7
9 Comments

What is your dream data set? I'll try and get it for you.

There is a lot of data across the internet. If you could have access to something specific, what would you want?

During my day job and while putting together scrapers for my site, I stumble upon all kinds of interesting data and am constantly surprised about what is freely available to anyone with a little bit of web and scripting skills.

on September 16, 2021
  1. 1

    Just a small bug report: your website has some z-index issues with the background on macOS Safari.

  2. 1

    A list of all existing domain names of the .be tld.

      1. 1

        Interesting, thanks

      2. 1

        This comment was deleted 5 years ago.

  3. 1

    That's a big question!

    Right now, it would be a list of all residences in a few ZIP codes around my city, who owns them now, and when they last sold.

    I'm working on a real estate app with a friend, and a lot of this data is available via county records/maps online. I just don't know how to scrape it. We'd use this data to populate the app for our MVP.

    1. 1

      know python?

      a nice trick to speed up / jump start any scraping project is to google "<name of site you want to scrape> scraper site:github.com". if it is a popular site, someone will have likely already published a project.

      these might also be interesting sources for you:

  4. 1

    Hey Steve, what are some examples of the types of data you come across?

    1. 1

      lately, i have been working on putting together a dataset on rising/successful companies to add to my jobs site and so i'll use it as an example:

      • enrichment api's - clearbit for example can provide logos, locations, and social links for companies by name
      • venture capital / investment portfolio pages - pages like this one are on every vc's website. they are easy to scrape and provide great leads for new companies to look into

      other useful stuff:

      for non-standard example, i had been an solar engineer / analyst in a prior role. you can find weather data for free on government sites. combine this with pricing data supplied by solar manufacturers and you can model the profitability of a solar project.

      there is sooo information much you can find. what interests you?

Trending on Indie Hackers
I'm a lawyer who launched an AI contract tool on Product Hunt today — here's what building it as a non-technical founder actually felt like User Avatar 150 comments A simple way to keep AI automations from making bad decisions User Avatar 59 comments “This contract looked normal - but could cost millions” User Avatar 54 comments Never hire an SEO Agency for your Saas Startup User Avatar 44 comments 👉 The most expensive contract mistakes don’t feel risky User Avatar 41 comments The indie maker's dilemma: 2 months in, 700 downloads, and I'm stuck User Avatar 41 comments