2
5 Comments

nanoGPT - is anyone playing with it?

Simon, a dude I follow on Twitter custom trained nanoGPT (a 4-layers model) to write in a Shakespeare style.
On his Mac. :)

Is it just a toy or you can do some useful stuff?

submitted this link to Icon for group Artificial Intelligence
Artificial Intelligence
on February 4, 2023
  1. 2

    I tried it with one of the GPT2 models on a small private project to do word synonyms, but it wasn't super for it. In comparison to the GPT3 api from OpenAI it's lacking the possibility to use semantic styles (positive, informal, catchy synonym etc.).

    I think it would be possible to use it to train specific types of writing styles with it, and it is made for being used with high grade consumer machines as you mention. The stuff I've done with Stable Diffusion or other stuff usually requires A100 GPU's and stuff.

    The shakespeare example anyone can try, it's even documented in the repo: https://github.com/karpathy/nanoGPT

    1. 1

      thanks, I kind of like the fact there is a bunch of models, simple and super complex so you can match a tool to a specific need; also environmental concerns...

  2. 2

    I didn't know about this. Thanks for sharing!

  3. 2

    Definitely think it can be useful. But it's a bad generalist AI and only really good for very specific use cases. Especially considering these are 124m params, which is quite low.

    Hoping to get this on Evoke. We just have stable diffusion for now, but are expanding into other AI models, so nanogpt definitely seems interesting.

    Also have an active AI discord if you're interested.

    1. 1

      thanks, discord joined.

Trending on Indie Hackers
Ideas are cheap. Execution is violent. User Avatar 25 comments Why I Pivoted from an AI Counseling Service to an AI Girlfriend Chat User Avatar 10 comments AI Visibility Is the New SEO for Indie Makers User Avatar 7 comments Product-led Growth User Avatar 6 comments Believing in your plan in 100% accuracy is Delusion. User Avatar 5 comments Validating an idea to help professionals reply safely to difficult work messages User Avatar 4 comments