8
23 Comments

Building a search engine for programmers

I’m considering building a search engine (yes, really) that is niche towards developers and professionals.

I’m personally annoyed and displeased with both Google and DuckDuckGo search results within this topic and I honestly think that I can do a better job.

It would take things such as the following into consideration:

  • Version. Always rank latest version of documentation over old ones.
  • Code examples and snippets increase ranking for relevant queries.
  • Show code snippets directly in search results.
  • Understand categories/topics on a more granular level.
  • Have higher understanding of brand and industry leaders. E.g official Python Docs should probably rank higher than someone’s tutorial on a query such as “Python reduce function”

That’s just a few of the ideas that I think could be used to enhance the search experience.

Does anyone else agree that they would like to see something like this? Any other ideas of how these type of queries could be enhanced and what features that could be used for ranking?

posted to Icon for group Ideas and Validation
Ideas and Validation
on December 18, 2019
  1. 3

    I agree that you could do better. But..

    • The problem is not that intense (Google works kinda okey-ish)
    • The market is very limited (~30 million software engineers in the world)
    • The solution is very hard (crawling and indexing the entire web)

    So, nice idea, but the above 3 criteria for commercially viable product point to suicide.

    1. 1

      I think despite the market being limited, devs are very tech savvy and if there was a search engine that provided much better tech-related search results than Google it would catch on.

    2. 1

      I agree with you that it’s not obvious that the problem is large enough for most people, it’s kind of a question mark and one of the reasons to why I posted here. Thanks for feedback.

      Regarding the other points I do not agree with you that they are problems or obstacles.

      • Plenty of dev focused businesses are major successes. I don’t see why a niche market of 30M people would be a problem.
      • An hard issue is the only issue worth solving. I’ve built 3 startups in the past and I’d only do another one if it was something inspiring and tricky to achieve.
      • A search engine with a niche does not need to index 100% of the web.
      1. 1

        My product is a Java developer tool for niche of 10m, so obviously I don't think that the developer market is overall "too small". But if you are building a true search engine, then the default assumption is that it will be ad funded, and for that 30 million is too small, IF you are seeking investment.

        1. 1

          I think that you have a lot of assumptions going into the definition of what a “true” search engine is. It’s simply a tool that allow you to find the information you are looking for.

          I think the ad revenue model might work quite well since it is a niche. An example of an ad revenue model within the same industry is StackOverflow that can highly specialize on job ads since they understand each user very well within the programmer niche.

          Thanks a lot for the feedback. In regards to if it’s too small for VC funding, you might be right! Great point.

          1. 1

            Well, your post started with "search engine" and by comparing to Google. That's a pretty direct way to make your readers assume basics of your product and business model...

            Anyway, just because you serve a niche doesn't mean you monetize significantly better: Google Ads are anyway targeted. If somebody searches for "Java exception" in Google that is just as valuable for advertiser as the same search in your product.

            The real reason why StackOverflow monetizes massively better per-user than search engines is that it is massively different than search engines: strong community that produces unique content, which is consumed on their own site, with correspondingly high engagement numbers.

            Also, please keep in mind that all this is from "Devil's Advocate" point of view. PersonallyI'd love to have better search engine for programming related searches & wish you will succeed.

            Good luck!

    3. 1

      This comment was deleted 6 years ago.

  2. 2

    This reminds me of Dash; https://kapeli.com/dash. I've been using this for searching documentation and code snippets. It has been super useful, and I'm a paying customer. What's nice about it is that it works offline and is super fast since docs are downloaded (and auto updated). And if what you search isn't in the docs, it uses Google right in the app. I believe it can scratch much of the same itch you describe.

    1. 2

      Very cool. I think their scope is much smaller than I envision, but you are right that they seem to attempt to scratch the same itch.

      Also interesting to see that there are multiple business models that might work for this type of project.

  3. 1

    I definitely want to see something like this! I would not focus on official documentation though, because most of that already allows navigating through versions etc.

    Instead, I would focus on other locations where less context is available. For example, StackOverflow or blog postings might be years old but still seem to be good enough for Google. If an engine could narrow this down, for example according to available versions at the posting time, and rank postings accordingly, this would help tremendously.

    I'll give you a concrete example where I keep experiencing such issues regularly. I've been an early adopter of Ember.js, a JavaScript framework. It has seen massive improvements over the years and lots of the information you find online is outdated and not valid anymore. I think this is particularly because the community is smaller and the influx of new content isn't just as big as e.g. for React, so old content isn't superseded so easily.

    Of course, there are tricks to narrow your Google results down to what you need. But a search engine that actually makes sense of the context, like version used or post date, and ranks results accordingly, would be super helpful.

    Admittedly, I am not sure if this is easy to monetize. Software engineers are good in blocking ads and I am not sure if the pain is big enough to pay for the service.

    1. 1

      Thanks a lot for such a positive message. Yeah I think versioning, both of docs but also of answers would be a tremendous benefit for developers.

      Another example is all the Angular vs AngularJS stuff that overlap a lot with each other.

      Hopefully a niched search engine like this would have a more granular understanding of the content within this industry.

  4. 1

    Great idea!
    I know he R language has a pretty good search engine: https://rseek.org/

    1. 1

      Super cool. I tried it out a bit and it seems very weighted towards R. It’s definitely something like that that I would aspire to create.

  5. 1

    We saw on the last months many Developer-focused companies go IPO or grow organically very well. I think that it's an excelent market.

    Crawling, indexing and understanding what the developer meant it's quite hard. Basically (I'm from Deep Learning field), you need to create a cross-reference corpora with "What the developer searched and found" x "What the code was" (basically 2 language models, and a good optimizer to align them).

    Also, did you checked out "Dash"?

    1. 1

      Thanks for the encouragement regarding market size. I’m currently an “AI Engineer” where I work with machine learning cases for some of the largest retailers in the world. I feel confident that I have the technical capability to create an MVP that is decent.

      I’ll definitely checkout Dash. Thanks.

  6. 1

    Sound awesome, specially code snippets directly on search results.

    However for me personally is a compromise between something specific or ease of access, sometimes I find it faster to search on google rather than going to the site directly even if I know is there because in a sense github search is a search engine but google does a better job.

    Google might not have best results for everything but they do a good damn job with focus keywords, site url and other search methods which you can narrow down.

    Also it is there directly on search bar which I don't see anywhere soon replacing with either duckduckgo or a specific dev search engine because what I will search depends very much on what I am doing.

    Where I would use such engine is if it was e.g directly on VS Code or my local environment.

    Ranking is a very complex story, for the docs sometimes people do a better job explaining shortly on a tutorial which is majority of how I learn things rather than official documentation.

    This is a rough idea on what I think , you might give it a try for me is a first time hearing something like this and would love to test.

    1. 1

      I think that you are right that search is a lot about convenience and getting the results quickly and easily. Your idea about integration with IDE sounds great.

      Regarding replacing Google in the search bar, I’m pretty sure that the millions of DuckDuckGo users our there use the browser plugins to search with the address bar and that they COMBINE their usage of DuckDuckGo with Google. Hopefully DuckDuckGo has opened the market somewhat in this regards.

  7. 1

    I get what your saying, I get how it's a bit of a better mouse trap, not convinced if I'd bother remembering it for a while, I don't see the pain that strong due there might be more at specific times... Possibly if you market it which teachings of development you might get some traction.

    Depending on you implementation details it could be a high project, but if you just sublet the searches and reorder them according to your rules, it could be half viable possibly.

    1. 1

      Thinking deeper, possibly searching for errors is more of a pain than the docs...

      1. 1

        Great point, error solving is probably one of the largest pains right now.

        Regarding implementation details, I would aspire to build my own crawler/parser/index and not reuse existing ones to have full control over rankings and data.

  8. 1

    Cool idea! It would be cool to have a "docs" tab that only contains documentation, similar to the Google "images" tab.

    1. 1

      Great idea! Hopefully doc style queries can be interpreted automatically.

      E.g. "python reduce" might indicate wanting to see documentation while "how to use python reduce function" might indicate wanting to see tutorials.

  9. 1

    Whilst I'm not a developer Invision have developed a similar niche targeted search engine for design inspiration which might inspire you in terms of UX rather than content: https://search.muz.li/?utm_source=Muzli_medium&utm_medium=muzli_medium_banner&utm_campaign=search_banner_main

    1. 1

      Thanks a lot, I'll definitely check it out.

Trending on Indie Hackers
I spent $0 on marketing and got 1,200 website visitors - Here's my exact playbook User Avatar 41 comments Why Early-Stage Founders Should Consider Skipping Prior Art Searches for Their Patent Applications User Avatar 22 comments I built eSIMKitStore — helping travelers stay online with instant QR-based eSIMs 🌍 User Avatar 20 comments Codenhack Beta — Full Access + Referral User Avatar 20 comments Veo 3.1 vs Sora 2: AI Video Generation in 2025 🎬🤖 User Avatar 18 comments Day 6 - Slow days as a solo founder User Avatar 13 comments