INDIE HACKERS statistics (filtered by Stripe revenue verification )
I have an idea to build a project for data visualization and analysis Indie Hackers site. For the project I needed the data, but I did't have it.
A week ago I started building the project...
I've written a CLI program in Typescript. It consist of three commands "parse", "normalize" and "merge".
The "parse" command simply extracts the data and saves it to files.
The "normalize" command gets the data from the files and starts data normalization. The parsed data includes "country", "region", "city" fields written in different languages, which need to be normalized. For this task, I've written a function that transforms the data using Google geocoding. It is not fast process, because it needs to talk with the Google geocoding API, but to speed up the process, I created a cache based on LevelDB. The next step is split tags to the different fields like "category", "platform", "founders", etc. This step takes time to analyze the data. In the end, after completing all the normalization steps, I had all the data to start analyzing.
The "merge" command takes the normalized data and merges it into one file.
I'm not a data analyst at all, I'm a web developer :)
I started looking for data visualization tools.
Finally I've chosen Observablehq + MongoDB Charts
Note: the analysis is based on limited data grouped by 'wordpress' category and filtered by Stripe revenue verification!
The charts you can find in the original post https://observablehq.com/@pyoner/ih-summary