SynthForge

Frequently Asked Questions

Everything you need to know about SynthForge

General

What is SynthForge?

SynthForge is a synthetic data generation platform that creates realistic, relationship-aware datasets. Describe what you need in plain English and let our AI agents design the schema, or take full control with our visual schema editor to build and customize tables, columns, and relationships yourself. Either way, SynthForge generates high-quality test data that respects your constraints and maintains referential integrity.

Who is SynthForge for?

SynthForge is designed for developers, QA engineers, data scientists, and anyone who needs realistic test data. Whether you're building a new application, testing database performance, creating demos, or training machine learning models, SynthForge can generate the data you need.

Is my data secure?

Yes. SynthForge generates entirely synthetic data, so no real personal information is ever used or stored. Your schema descriptions are processed securely, and generated datasets are stored with encryption. You maintain full control over your data and can delete it at any time.

Features & Capabilities

What types of data can I generate?

SynthForge supports over 100 unique field types across 12 categories including personal information (names, emails, addresses), financial data (credit cards, IBANs), healthcare codes (ICD-10, CPT), geographic data, dates, and more. Each field type generates contextually appropriate, realistic values.

How does the AI schema generation work?

When you describe your data needs, five specialized AI agents work in sequence: the Schema Architect converts your description to table definitions, the Relationship Mapper identifies foreign keys, the Constraint Validator ensures data integrity, the Data Generator creates realistic values, and the QA Agent validates the output quality.

Can I define relationships between tables?

Absolutely. SynthForge supports all standard relationship types: one-to-one (1:1), one-to-many (1:M), and many-to-many (M:M). Our AI automatically detects and creates foreign key relationships from your description, or you can use the visual schema editor to manually define them. Generated data always maintains referential integrity across all related tables.

What export formats are available?

You can export your generated data in multiple formats: CSV for spreadsheets, JSON for APIs, Parquet for big data workflows, and SQLite database files. For SQL exports, we generate optimized import scripts for PostgreSQL, MySQL, SQLite, SQL Server, MariaDB, DuckDB, and CockroachDB, each with proper CREATE TABLE statements, bulk loading commands, and foreign key constraints. All formats can be downloaded individually or as a compressed ZIP archive.

Are there pre-built templates available?

Yes! SynthForge includes built-in scenarios for common domains including banking systems, healthcare records, manufacturing data, e-commerce, and more. These templates provide a starting point that you can customize to match your specific requirements.

Technical

How many rows can I generate?

SynthForge can generate datasets of virtually any size. Our engine is optimized for high-throughput generation, often exceeding 100,000 rows per second. During early access, default limits are in place but you can easily request an increase at the click of a button!

Can I apply constraints to my data?

Yes. You can define constraints like minimum/maximum values, string length limits, regex patterns, enumerated values, date ranges, and distribution ratios. For example, you can specify that 45% of accounts should be "checking" and 40% "savings" types.

Does SynthForge support field correlations?

Yes. Field modifiers allow you to create correlations between fields. For example, product prices can vary based on category, or city names can be consistent with state values. This ensures your synthetic data has realistic internal relationships.

Getting Started

How do I get started?

Simply create an account and describe your data requirements in plain English. SynthForge will generate a schema for you to review and customize. Once you're satisfied, click generate to create your dataset and download it in your preferred format.

Ready to generate your first dataset?

Get started in minutes with our AI-powered data generation platform.

Launch SynthForge