Contract check · Vendor / SaaS contract

Can my SaaS vendor use my company's data to train their AI model?

The short answer

Whether your SaaS vendor can use your company's data to train their AI models depends on the agreement you accepted. Some vendors include explicit opt-out provisions or prohibit AI training on customer data entirely; others include broad license grants that permit training on aggregated, anonymized, or de-identified data derived from your inputs. The distinction between training on your raw customer data and training on aggregated or anonymized derivatives is significant — and how 'anonymized' is defined in the agreement determines how meaningful that distinction is in practice. Prompts you send to an AI-powered feature are often treated differently from other customer data. Scan your agreement to see what AI training rights, if any, the vendor has reserved.

Scan your agreement — free preview Free preview · Full report $6.99 · One-time, no subscription required

No account requiredFile deleted after analysisNot legal advice

What SaaS AI training clauses commonly say

As AI features have become common in SaaS products, vendors have added or updated license grant language to address training rights. Some agreements explicitly state that customer data will not be used to train general AI models. Others include a license grant permitting use of 'aggregated, anonymized, or de-identified data' for model improvement, benchmarking, or developing new features. A third category — typically consumer or freemium tiers — may include a broad license grant with no AI-specific restriction. The clause that governs your company is whichever terms were in effect when you accepted.

The difference between training on your raw data and training on aggregated derivatives matters in practice. Raw data training could expose proprietary business information or customer personal data; aggregated training may be less direct but still derives value from your usage. Prompts submitted to AI-powered features — questions you ask the tool, data you paste into a chat interface — are sometimes treated as a separate category from 'customer data' defined elsewhere in the agreement, with broader vendor rights.

Why this is a high-priority clause to find

Businesses report discovering that employee use of AI-powered SaaS features — summarization tools, code assistants, contract analysis products — was governed by license terms allowing the vendor to use those inputs for model improvement. The practical concern includes competitive information submitted as prompts, customer data processed through AI features, and the inability to claw back data once it has been incorporated into training. Opt-out clauses exist in many business-tier agreements but may require affirmative action or a separate addendum to activate.

What to look for in your agreement

Questions to ask before signing

Why scan instead of guess

The general rule tells you the baseline. Your agreement tells you what you’re actually being asked to sign — and the wording is what binds. Dang reads the document and flags the clauses worth reviewing, in plain English.

The deterministic engine scores and decides what’s risky. The AI only enriches the plain-English wording — AI extracts, code decides, never the other way around.

Your original file is deleted promptly after processing — we keep only the report you can read. No account needed for a one-time scan. Free preview first; full report $6.99, one-time.

Common questions

What is the difference between training on raw data vs. aggregated data?

Raw data training uses the actual content you input — documents, queries, customer records — which may contain proprietary or regulated information. Aggregated or anonymized training derives statistical patterns from usage across many customers, with identifying information removed. How well 'anonymized' holds up in practice depends on the definition in the agreement and the technical methods used; this is worth clarifying with the vendor.

Does opting out of AI training affect how the software works?

That depends on the vendor and the specific feature. For features that rely on customer-specific training, an opt-out may limit personalization. For general product AI features, an opt-out typically removes your data from training pipelines without affecting core functionality. The agreement and vendor documentation are the sources to check for your specific tool.