2
2 Comments

I built an open-source PII masking layer for LLM APIs — early traction, looking for design partners

I kept running into the same wall while talking to developers at healthtech and fintech companies: they wanted to use LLMs to automate workflows, but their data had names, emails, Aadhaar numbers, PAN cards, SSNs in it. Sending that to OpenAI or Anthropic felt wrong — legally and ethically.

Most teams were either skipping LLMs entirely or hand-rolling their own scrubbers. Neither felt like the right answer.

So I built Armos.

It wraps the OpenAI and Anthropic Python SDKs. Before your prompt goes out, PII is detected locally (nothing leaves your machine during detection), replaced with reversible tokens. The LLM sees tokens, responds with tokens, Armos swaps real values back. Your app gets the original text. The model never does.

The entire integration is one line:
client = ArmosOpenAI(OpenAI())

Where I am:

  • Just launched v1.2.1 on PyPI
  • Detects 10 entity types including India-specific ones (Aadhaar, PAN)
  • Got a warm lead from a tax automation company for a design partnership
  • HN post going up tomorrow

What I'm looking for:

  • Developers building on sensitive data (health, finance, legal, HR) who
    want to trial this early
  • Feedback on what's missing — entity types, framework integrations,
    async/streaming support
  • Honest criticism of the approach

Still early and rough around the edges. Would love to connect with anyone
hitting this problem.

GitHub: https://github.com/armos-ai/armos-python
Docs: https://armos.dev
pip install armos

posted to Icon for group Growth
Growth
on May 25, 2026
  1. 1

    This is a strong wedge because you are not selling “LLM security” in a vague way. You are solving a specific blocker that sensitive-data teams already feel: they want LLM automation, but they cannot casually send names, IDs, tax data, health data, or legal records into external models.

    The local detection plus reversible token layer is the right trust angle. I would make that the center of the positioning: Armos is not just a wrapper, it is the privacy boundary between regulated workflows and LLM APIs.

    One thing I’d pressure-test before the HN post and design partner conversations is the name. Armos is decent, but for healthtech, fintech, legal, and HR developers, the brand has to immediately feel secure, technical, and serious. This is infrastructure sitting between sensitive data and foundation models, so the name carries trust before the docs even do.

    Vroth .com would fit that layer better if you want it to feel like hard security infrastructure for LLM workflows, not just an open-source SDK. The product direction is strong enough that naming is not cosmetic here. It affects whether security-conscious developers read it as a real privacy layer or another early wrapper.

    1. 1

      Really appreciate this — the "privacy boundary between regulated workflows and LLM APIs" framing is sharper than how I've been positioning it. Stealing that.

      On the name — I hear you, and I don't disagree that names carry
      trust in security infra. But I'd rather not sweat it at this stage.

      No paid users, no enterprise contracts, nothing that makes a rebrand painful. If the
      product earns trust with the right teams, Armos won't have been the thing that stopped them. I'll revisit naming seriously before any real scaling push.

      What I'm more focused on right now is getting it in front of sensitive-data teams and letting them pressure-test the actual trust layer — the local detection, the reversible tokens, the zero PII to the model. That's where I want the feedback loop first.

      Are you building in any of these spaces? Would love to hear where you'd see this fitting or breaking.

Trending on Indie Hackers
AI runs 70% of my distribution. The exact stack. User Avatar 191 comments I used $30,983 of AI tokens last month in Claude code on $200/mo plan User Avatar 86 comments 30 days ago I posted here with $0 revenue. Here's what actually happened next. User Avatar 83 comments my reddit post got 600K+ views. here's exactly what i did User Avatar 43 comments I turned someone’s tweet into an app idea and it has made ~$3000 so far in 4 months. User Avatar 37 comments How to spot high-intent customers in 5 minutes, for free. User Avatar 35 comments