Data Ingestion
Syncora lets you quickly upload sample data or connect external sources. It automatically detects schema and prepares your data for high-quality synthetic generation.
Supported Data Modalities
Syncora supports a wide range of data types to cover common machine learning and analytics use cases. Below are the currently supported modalities and their accepted file formats:
🔢 Tabular Data
Formats Supported: CSV
Use Cases: Ideal for structured data such as user profiles, transactions, and product catalogs.
Note: Syncora automatically infers column types and applies schema optimization.
🧾 JSON / JSONL
Formats Supported: TXT, PDF
Use Cases: Commonly used for model training (e.g., fine-tuning LLMs), dialog systems, and structured NLP inputs.
Note: Schema is auto-inferred. Ensure JSONL is line-delimited for best performance.
⏱️ Time-Series Data
Formats Supported: CSV
Required Schema: Must include at least two columns – a timestamp and a value.
Use Cases: Forecasting, behavioral analytics, anomaly detection, etc.
🖼️ Images (Beta)
Formats Supported: PNG, JPG
Labeling: Labels are optional but can be provided as a separate file or embedded metadata.
Use Cases: Computer vision tasks like image classification and object detection.
Plan-Based Quotas
Syncora’s generation limits are based on your selected plan and apply to both input and output data volumes.
Free
100 MB
$ 0 per month
Suitable for beginners
Growth
8 GB
$499 per month
Designed for startups and smaller teams
Hyperscale
40 GB
$1999 per month
Suitable for large teams and model training workflows
Enterprise(upcoming)
Custom
Custom
Includes VPC deployment and flexible billing options
⚠️ Quotas are calculated as the combined size of all uploaded inputs and generated outputs.
Last updated