Training Data Transparency
Last updated: March 6, 2026
At BasaltHQ, we are committed to transparency and compliance in our AI development practices. In accordance with California AB 2013, this page outlines the high-level origins, nature, and licensing of the datasets used to train or fine-tune our models since January 1, 2022.
Data Inventory & Origin Mapping
| Dataset Category | Origin / Source | License Status | IP & Privacy Audit (CCPA) |
|---|---|---|---|
| General Web Conversations (Anonymized) | Public Domain Crawls | Open Source / Public Iteration | Cleared. Pre-processed to remove PII. |
| Proprietary Sales Interaction Data | Licensed Vendors | Commercial License | Cleared. Subject to vendor indemnification. No user PI included. |
| Opt-In Customer Telemetry | BasaltHQ Internal Platform | First-Party (Terms of Service) | Ongoing. Users have the right to opt-out. Data is aggregated and anonymized. |
| Synthetic Enterprise Scenarios | Internally Generated (Base Models) | Proprietary | Cleared. Wholly synthetic, no real-world IP or PII included. |
Privacy & Copyright Safeguards
Our IP and Privacy Audit processes strictly flag any data ingestion that includes copyrighted works (without license) or Personal Information (PI) as defined by the California Consumer Privacy Act (CCPA).
- Personal Information (PI): We employ automated scrubbing routines to remove names, contact information, and specific identifiers from any data used for fine-tuning. Opt-in customer data used for training is aggregated.
- Copyrighted Works: We rely exclusively on licensed, proprietary, or public domain data specifically cleared for commercial AI training. We honor standard opt-out protocols (like robots.txt exclusions) in any public domain data curation.
User Opt-Out
BasaltHQ allows workspace administrators and individual users to opt-out of their tenant data being utilized for ongoing model fine-tuning. For more details on managing your data preferences, please refer to your Workspace Settings or our Privacy Policy.
Contact
For inquiries regarding our data transparency practices or AI compliance programs, please contact our privacy compliance team at [email protected].