Home
Starting Up
Tech
Creators
A.I.
Lifestyle
Money
Idea Board
Vibe Coding Tools
Products
Ideas DB
Case Studies DB
Subscribe to IH+
Starting Up
Tech
A.I.
Creators
Lifestyle
Money
Join
2
Likes
0
Bookmarks
5
Comments
Report
Where to find: AI Training Data?🤔
by
ericlamcrypto
Hey IH,
Where do you go to find training data and models for AI?
Should there be a marketplace for this?
yes
no
not sure
Vote
Trending on Indie Hackers
I spent $0 on marketing and got 1,200 website visitors - Here's my exact playbook
50 comments
Veo 3.1 vs Sora 2: AI Video Generation in 2025 🎬🤖
26 comments
I built eSIMKitStore — helping travelers stay online with instant QR-based eSIMs 🌍
20 comments
Codenhack Beta — Full Access + Referral
20 comments
🚀 Get Your Brand Featured on FaceSeek
18 comments
Day 6 - Slow days as a solo founder
15 comments
Depending on my usage, I would find my dataset in Kaggle. If I need a formal dataset (usually used in academic papers), then I would just go to a paper and find what dataset they used. Most of them are free.
For models, again it depends on my usage. For NLP related tasks, then there is OpenAI or Midjourney. For computer vision, there are in-built models from Tensorflow or Pytorch, otherwise I would find the github of related academic paper and get the model there.
For me, having a marketplace of these is kinda redundant. I mean it would be nice, but I'm not sure if it would add that much value. Hope you find it helpful.
Kaggle already exists and there is plenty of training data there.
thanks! noted
For data:
For models:
For some readers, the overhead of training a new model will justify using a tool like remyx.ai.
thank you