Does Safe-Doc store my documents?

No. Safe-Doc applies a no-retention approach: no document content kept in the database or logs. Temporary technical processing then purge. The processed file and mapping are downloaded to you.

How do I protect an NDA before analyzing it with AI?

Safe-Doc automatically pseudonymizes party names, amounts, dates and internal references in an NDA before sending to AI. You get a neutralized contract you can analyze freely with ChatGPT, Claude or Gemini, then re-inject the real values into the result.

Safe-Doc FAQ | Pseudonymisation, Shadow AI, GDPR, pricing and AI compatibility

Q: What is shadow AI?

Shadow AI is the use of external AI tools outside an official framework: no dedicated DPIA, no logging, no document protection before sending. It can be occasional (pasting a paragraph into ChatGPT) or massive (contracts, audits, HR files).

Q: How do I pseudonymize a PDF before sending it to ChatGPT?

With Safe-Doc: import the PDF, choose the protection level, review detected entities, export the pseudonymized text and paste it into ChatGPT. On return, import the JSON mapping to re-inject the real values into the response.

Q: What is the difference between pseudonymization and anonymization under GDPR?

Pseudonymization (GDPR Art. 4(5)) replaces identifying data with tokens while keeping a mapping that allows re-identification. Anonymization aims for irreversibility: if no cross-matching can re-identify, the data falls outside GDPR scope.

Q: Does Safe-Doc work with Microsoft Copilot?

Yes. Safe-Doc is AI-tool agnostic: ChatGPT, Claude, Gemini, Microsoft Copilot, Perplexity, Mistral. You pseudonymize with Safe-Doc, paste into the AI tool of your choice, retrieve the response and de-anonymize locally.

Q: How much does Safe-Doc cost?

Safe-Doc offers 4 plans: Solo at €35/month (1 seat, 1,200 pages), Cabinet at €149/month (3 seats, 6,400 pages), Cabinet+ at €399/month (10 seats, 20,000 pages), Enterprise on request. Add-ons: +€35/1,500 pages, +€35/extra seat.

Q: Do I need IT installation to deploy Safe-Doc?

No. Safe-Doc is a browser-accessible SaaS. No workstation installation, no IT integration required. An account is enough to get started in minutes.

Q: What is the AI Act and how does Safe-Doc help comply?

The European AI Act imposes transparency and risk-management obligations for AI system use, with a compliance deadline in August 2026. Safe-Doc helps by documenting each pseudonymization operation, reducing personal data exposure to third-party AI models, and enabling workflow traceability.

Q: Is Safe-Doc GDPR-compliant?

Safe-Doc is designed privacy-by-design: minimization, pseudonymization (GDPR Art. 4), traceability, EU hosting (Hetzner, Germany), 100% self-hosted AI, DPA available. It is a compliance aid, not a substitute for the controller's obligations.

Shadow AI & AI adoption

Why not simply ban ChatGPT, Claude or Gemini?

Because real usage already exists. In most organizations, employees use generative AI to move faster - often without a document security layer. That's Shadow AI: productive, but hard to audit.

Bans without alternatives create workarounds, lost productivity, and higher risk (people still send raw documents).

Safe-Doc doesn't replace internal policies. It enables controlled usage: pseudonymize before AI, stay in control, de-anonymize on the way back when needed.

What is Shadow AI?

Shadow AI is the use of external AI tools outside an official framework: no dedicated DPIA, no logging, no document protection before sending.

It can be small (pasting a paragraph) or massive (contracts, audits, HR files). Safe-Doc targets the latter - where document exposure is structural.

Does Safe-Doc replace ChatGPT, Claude or Mistral?

No. Safe-Doc is not an analysis model. It's a document security layer before and after the AI you already use.

You keep your usual tools on a pseudonymized or anonymized version - Safe-Doc doesn't lock you into a proprietary chat or a workspace that stores your case files.

How is Safe-Doc different from an "all-in-one" summary or analysis tool?

A summary tool processes content. Safe-Doc processes exposure risk: what goes to AI, what stays with you, and what can be re-identified later.

"Workspace + built-in chat" platforms often centralize files, history and mapping tables at the vendor. Safe-Doc inverts that: ephemeral processing, not a cloud archive of sensitive case files.

Product, workflow & modes

What does Safe-Doc do, concretely?

Safe-Doc follows four steps:

Import a document (file, pasted text, or Data Room batch)
Pseudonymize or anonymize depending on mode and level N1–N2 (N3 on roadmap)
Review detected entities (uncheck, adjust)
Export AI-ready text + JSON mapping (if reversible) + report

On return: paste the tokenized AI answer + import mapping to de-anonymize locally.

See the visual guide →

Single document vs Data Room: when to use which?

Single document - one contract, report or memo: fast, ideal for daily use.

Data Room - a homogeneous batch (contracts + annexes + exhibits): consistent pseudonymization - the same entity gets the same token across the batch. Essential for multi-document AI analysis.

What is the difference between Clean and Advanced?

Safe-Doc Clean

Everyday team AI usage
Pseudonymization or anonymization + metadata
Standard context reduction
Data Room included

Safe-Doc Advanced

High-sensitivity case files
Stronger multi-doc pseudonymization
Context + style reduction
Roadmap Q2 2026 - details →

What do levels N1–N2 and N3 mean?

N1–N2 : direct identifiers (names, emails, phones, IBAN…) and risky context (exact dates, amounts, locations, internal references)
N3 : stylistic fingerprint and weak signals (roadmap Q3 2026)

Higher levels make the document more generic - with less contextual precision to validate with the business.

What replacement modes are available?

Anonymized - masking oriented toward irreversibility
Pseudonymized - consistent tokens + exportable mapping
Fake data - plausible substitute values (depending on level)

Do I need IT installation to deploy Safe-Doc?

No. Safe-Doc is a browser-accessible SaaS. No workstation installation, no IT integration required. An account is enough to get started in minutes.

For team deployments (Cabinet or Cabinet+ plan), users receive an email invitation. No IT department involvement required for standard plans.

Pseudonymization & anonymization

What gets pseudonymized or anonymized?

Personal data: names, emails, phones, addresses, national IDs…
Confidential data: company names, internal references, contracts, amounts, IBAN…
Identifying context: exact dates, fine locations, rare combinations (by level)
Metadata: author, revision history, comments (DOCX), PDF layers

Exact scope depends on mode and level.

Full guide: pseudonymization vs anonymization →

Pseudonymize or anonymize: which to choose?

Pseudonymize if you need to reuse AI analysis with real values (mapping + de-anonymization).

Anonymize if you don't need re-identification and want fewer traces (no correspondence table to secure).

Safe-Doc offers both - not a single marketing choice.

Can we de-anonymize analysis from ChatGPT or Claude?

Yes in pseudonymized mode: import mapping_xxx.json, paste the AI answer with tokens, Safe-Doc re-injects values locally.

You remain responsible for protecting the mapping - don't share it on unsecured channels.

Is pseudonymization reversible?

Yes by design - that enables "analyze then re-identify". Reversibility relies on the mapping table; if it leaks, re-identification risk rises sharply.

Irreversible anonymization aims not to leave an exploitable link - but strict GDPR "anonymity" remains hard in practice on rich documents.

What is the difference between pseudonymization and anonymization under GDPR?

Pseudonymization (GDPR Art. 4(5)) replaces identifying data with tokens while keeping a mapping that allows re-identification. The data remains personal data under GDPR : processing stays subject to its obligations.

Anonymization aims for irreversibility: if no cross-matching can re-identify, the data falls outside GDPR scope. In practice on rich documents, that threshold is hard to reach.

Safe-Doc uses the right term : not approximate "anonymization" when re-injection is possible.

How do I pseudonymize a PDF before sending it to ChatGPT?

With Safe-Doc, the process takes 4 steps:

Import your PDF into Safe-Doc (drag-and-drop or upload)
Choose the protection level N1–N2
Review detected entities and export the pseudonymized text
Paste the pseudonymized text into ChatGPT : no sensitive data goes to OpenAI

On return, import the JSON mapping to re-inject the real values into ChatGPT's response.

See the step-by-step guide →

How do I protect an NDA or contract before analyzing it with AI?

An NDA typically contains party names, amounts, validity dates, confidentiality clauses and internal references : all elements that must not transit in plain text to a third-party AI model.

Safe-Doc automatically pseudonymizes all these entities before sending. You get a "neutralized" contract you can analyze freely with ChatGPT, Claude or Gemini : then re-inject the real values into the result.

Zero storage & architecture

Where do my documents go? Are they stored at Safe-Doc?

Safe-Doc is built on no-retention:

No document content kept in the database or application logs
temporary technical processing (upload, pipeline), then purge
processed file and mapping (if exported) are downloaded to you

Job metadata (status, counters, technical keys) may be kept for tracking - never the full document text.

Why is "zero trace" more reassuring than a document workspace?

Many LegalTech solutions offer a persistent project space: file versions, chat history, hosted mapping tables, long retention to re-identify later.

Convenient - but each upload increases risk surface: data lifetime at the vendor, support/backup access, DPA and sub-processor complexity.

Safe-Doc doesn't turn client files into a cloud vault. It's a technical pass-through: in → process → out → purge.

More: workspace vs zero trace →

Does Safe-Doc store my AI conversation history?

No. Safe-Doc doesn't replace ChatGPT: it doesn't store your prompts or third-party model answers. You keep exchanges in the AI tool you already use - Safe-Doc provides the upstream/downstream document layer.

Who is responsible for the mapping file?

You are. Mapping is generated for local export. Its protection (encryption, retention, internal sharing) is your organization's duty. Safe-Doc doesn't keep it as a central re-identification archive.

Security, GDPR & compliance

Is Safe-Doc GDPR-compliant?

Safe-Doc is designed privacy-by-design: minimization, pseudonymization (GDPR Art. 4), operation traceability, EU hosting (Hetzner, Germany), 100% self-hosted AI, DPA available.

It's a compliance aid - not a substitute for the controller's obligations (legal basis, retention, transparency, etc.).

Legal, ToU, DPA →

What is the AI Act and how does Safe-Doc help comply?

The European AI Act imposes transparency, documentation and risk-management obligations for AI system use : with a compliance deadline in August 2026 for high-risk systems.

Safe-Doc helps AI Act compliance by:

Documenting each pseudonymization operation (audit report)
Reducing personal data exposure to third-party AI models
Enabling workflow traceability for AI on sensitive documents

Safe-Doc is not a standalone AI Act compliance tool : consult your DPO for a full analysis.

Is it suitable for lawyers and professional secrecy?

Safe-Doc targets sensitive documents and controlled AI usage - including law firms. Pseudonymization before third-party AI reduces exposure, aligned with sector recommendations.

Lawyers and DPOs remain responsible for data choices, AI vendor selection and final review. Safe-Doc does not provide legal advice.

Where is data hosted?

Data processing infrastructure: European Union (Hetzner, Germany). Scaleway is used for transactional email only — no document hosting. No third-party AI API receives your content; you then send the pseudonymized version to the AI tool of your choice from your workstation.

Are my documents used to train AI models?

No. Safe-Doc doesn't use your files to train models. For external AI you use afterward, check your account terms (professional tiers with training opt-out).

What do Safe-Doc logs contain?

Technical indicators: job id, duration, counters (pages, PII replaced, warnings) - not document bodies, no plaintext PII in application logs.

Formats, detection & technical limits

Which formats are supported?

PDF and DOCX fully supported; text pasted in the UI. Scanned PDFs: OCR extraction depending on scan quality.

Detailed formats page →

Is detection infallible?

No. Any automatic engine has false positives (mask too much) and false negatives (miss an entity). Hence human review and residual risk indicators in the report.

What about metadata and invisible fingerprints?

Clean mode: common metadata removal (author, tool dates, Word comments, etc.). Stylistic fingerprint is Advanced territory (roadmap). No guarantee of removing all forensic traces.

Can I process health or classified data?

Only if your organization has appropriate legal basis and contractual framework. Safe-Doc is not HDS-certified or cleared for national defense secrets. When in doubt, consult your DPO or compliance officer.

AI tool compatibility

Does Safe-Doc work with Microsoft Copilot?

Yes. Safe-Doc is AI-tool agnostic : it works with any model or interface: ChatGPT, Claude, Gemini, Microsoft Copilot, Perplexity, Mistral and any other LLM accessible via web interface or API.

The principle is the same: pseudonymize with Safe-Doc, then copy-paste, call via API or connect through MCP to Copilot, retrieve the response and de-anonymize locally.

Which AI tools are compatible with Safe-Doc?

All AI tools accessible via web interface or API:

OpenAI: ChatGPT (all plans), GPT-4 API
Anthropic: Claude (claude.ai, API)
Google: Gemini, NotebookLM
Microsoft: Copilot, Azure OpenAI
Mistral, Perplexity, LLaMA via third-party interfaces

Safe-Doc does not integrate into these tools as a plugin : it operates upstream (pseudonymization) and downstream (de-anonymization). Three access modes: web UI, REST API or MCP server. No extension required.

What's the difference between Safe-Doc and Anonym-IA?

Both tools pseudonymize documents before AI. The main differences:

Market: Anonym-IA mainly targets lawyers; Safe-Doc targets all professionals handling sensitive documents (M&A, consulting, HR, finance, audit)
Architecture: Safe-Doc does not store your data or see your AI queries : unlike solutions with built-in AI chat where every query transits their servers
PII coverage: 55+ entity types detected across 6 countries
Formats: PDF, DOCX, OCR : multi-document Data Room

Plans & pricing

What plans and pricing?

Solo (€35/month · 1 seat · 1,200 pages), Cabinet (€149/month · 3 seats · 6,400 pages), Cabinet+ (€399/month · 10 seats · 20,000 pages). Enterprise on request. Add-ons: extra pages +€35/1,500 pages · extra seat +€35 (Cabinet and Cabinet+ only).

Pricing grid →

How much does Safe-Doc cost per user?

Safe-Doc offers 4 plans:

Solo : €35/month · 1 seat · 1,200 pages/month
Cabinet : €149/month · 3 seats · 6,400 pages/month (~€50/user)
Cabinet+ : €399/month · 10 seats · 20,000 pages/month (~€40/user)
Enterprise : on request · unlimited seats and pages · SSO · dedicated DPA

Add-ons: +€35 / 1,500 extra pages (all plans) · +€35 / extra seat (Cabinet and Cabinet+ only).

See full pricing →

Who is Safe-Doc for?

Legal, compliance, audit teams
Law firms and regulated professions
Finance, HR, M&A, operations
SMEs adopting AI without a dedicated Shadow AI function

Must we deploy company-wide at once?

No. Many customers start with a pilot team (legal, compliance, innovation) then extend usage rules: pseudonymize before any external AI on sensitive documents.

Limits, liability & warranties

Can Safe-Doc guarantee "zero risk" or absolute anonymity?

No. Safe-Doc significantly reduces exposure and re-identification risk - it doesn't remove all cross-matching possibilities (rare amounts, unique context, style).

The service is provided on a best-efforts basis. Human review and settings adapted to the case remain essential.

Who is liable if masking fails?

The user: level choice, entity validation, decision to send to third-party AI. Safe-Doc provides indicators and an action report - not legal validation of the final document.

Can I use Safe-Doc output without review?

Technically yes - not recommended. Any automated output should be reviewed before external transmission, especially on high-stakes matters (litigation, M&A, disciplinary).

Frequently asked questions

Safe-Doc Clean

Safe-Doc Advanced

A question not covered?