Skip to content
Dustin's AI Lab
Go back

PII Guard TW: A De-identification Tool Built for Taiwan

There's no off-the-shelf de-identification tool for Taiwan's PII formats. So I built one that keeps sensitive data on your machine while you send the rest to AI.


I was recently using AI to process client data, and the client was clear: sensitive information should not be sent to AI servers. But when I went looking for de-identification tools, I hit a wall—Taiwan’s PII formats (national ID numbers, local phone numbers, tax IDs) aren’t supported by any existing tool.

So I combined an open-source library with local adaptations and built pii-guard-tw. It automatically replaces PII in your documents with placeholders, lets you send the sanitized version to AI for processing, and then restores the original data afterward. Your real data never leaves your machine.

Supported PII Types

Supported File Formats

MCP Integration

There’s also an MCP server so you can plug it directly into Claude Code and use it seamlessly in your workflow.

Still very early stage—issues and PRs are welcome.

A Note for API / Enterprise Users

Claude API and Enterprise users can refer to Anthropic’s official ZDR (Zero Data Retention) policy—your data isn’t retained by default. For regular subscription users, besides using a de-identification tool, remember to go into your settings and turn off “Allow my data to be used for model training.” That way your data is only stored by Anthropic for 30 days instead of five years.


Share this post on:

Previous Post
Claude Code Source Leak: Three Secrets Unearthed by Reddit
Next Post
Anthropic's Trust Crisis and My Backup Plan