Training Data Transparency

Last updated: March 6, 2026

At BasaltHQ, we are committed to transparency and compliance in our AI development practices. In accordance with California AB 2013, this page outlines the high-level origins, nature, and licensing of the datasets used to train or fine-tune our models since January 1, 2022.

Data Inventory & Origin Mapping

Dataset Category	Origin / Source	License Status	IP & Privacy Audit (CCPA)
General Web Conversations (Anonymized)	Public Domain Crawls	Open Source / Public Iteration	Cleared. Pre-processed to remove PII.
Proprietary Sales Interaction Data	Licensed Vendors	Commercial License	Cleared. Subject to vendor indemnification. No user PI included.
Opt-In Customer Telemetry	BasaltHQ Internal Platform	First-Party (Terms of Service)	Ongoing. Users have the right to opt-out. Data is aggregated and anonymized.
Synthetic Enterprise Scenarios	Internally Generated (Base Models)	Proprietary	Cleared. Wholly synthetic, no real-world IP or PII included.

Privacy & Copyright Safeguards

Our IP and Privacy Audit processes strictly flag any data ingestion that includes copyrighted works (without license) or Personal Information (PI) as defined by the California Consumer Privacy Act (CCPA).

Personal Information (PI): We employ automated scrubbing routines to remove names, contact information, and specific identifiers from any data used for fine-tuning. Opt-in customer data used for training is aggregated.
Copyrighted Works: We rely exclusively on licensed, proprietary, or public domain data specifically cleared for commercial AI training. We honor standard opt-out protocols (like robots.txt exclusions) in any public domain data curation.

User Opt-Out

BasaltHQ allows workspace administrators and individual users to opt-out of their tenant data being utilized for ongoing model fine-tuning. For more details on managing your data preferences, please refer to your Workspace Settings or our Privacy Policy.

Contact

For inquiries regarding our data transparency practices or AI compliance programs, please contact our privacy compliance team at [email protected].