1
1 Comment

AI Era: what LLM model do you choose?

Fina Money uses LLM to power up its answer to users' financial questions. Initially using OpenAI's API. Observed the slow response on GPT-4 model, it makes me think, are there any alternatives that we may consider to balance the workload?

However, not all LLM models have the same quality to achieve the accuracy we want, this makes me test out a list of models available, and have a sense about what the landscape looks like regarding

  • Accuracy
  • Speed

To make it short, I would stay with GPT-4-Turbo, though the speed is still a concern, but literally there is no another one that could replace, here is my test report to share with everyone, if you are looking at the same problem for your APP, it maybe useful, check it out , it tests out these models to have a sense of the landscape for comparison:

  • gpt-4-turbo
  • gpt-3.5-turbo
  • llama3-8b-8192
  • llama3-70b-8192
  • gemma-7b-it
  • mixtral-8x7b-32768

https://app.fina.money/doc/jM8LYvPkm07xxg

posted to Icon for group Developers
Developers
on May 8, 2024
  1. 1

    Right now probably:
    GPT-4 for balance,
    Claude for long-form reasoning,
    and Gemini for large-context workflows.

    Feels less like “best model” now and more about using the right one for the task.

Trending on Indie Hackers
Most founders don't have a product problem. They have a visibility problem User Avatar 106 comments Day 4: Why I Built a $199 Workspace Nobody Asked For User Avatar 55 comments Hi IH — quick update. The MVP is live. User Avatar 28 comments I Built a Football Sentiment Platform in 18 Days. The World Cup Starts in 7 Days. Now I Need Distribution. User Avatar 17 comments Built an n8n booking alert system — is cold outreach dead for B2B micro-tools? User Avatar 16 comments Day 7: 51 people answered my question. I wasn't ready for what they said. User Avatar 15 comments