Data Ingestion

Syncora lets you quickly upload sample data or connect external sources. It automatically detects schema and prepares your data for high-quality synthetic generation.

Supported Data Modalities

Syncora supports a wide range of data types to cover common machine learning and analytics use cases. Below are the currently supported modalities and their accepted file formats:

🔢 Tabular Data

  • Formats Supported: CSV

  • Use Cases: Ideal for structured data such as user profiles, transactions, and product catalogs.

  • Note: Syncora automatically infers column types and applies schema optimization.

🧾 JSON / JSONL

  • Formats Supported: TXT, PDF

  • Use Cases: Commonly used for model training (e.g., fine-tuning LLMs), dialog systems, and structured NLP inputs.

  • Note: Schema is auto-inferred. Ensure JSONL is line-delimited for best performance.

⏱️ Time-Series Data

  • Formats Supported: CSV

  • Required Schema: Must include at least two columns – a timestamp and a value.

  • Use Cases: Forecasting, behavioral analytics, anomaly detection, etc.

🖼️ Images (Beta)

  • Formats Supported: PNG, JPG

  • Labeling: Labels are optional but can be provided as a separate file or embedded metadata.

  • Use Cases: Computer vision tasks like image classification and object detection.

Plan-Based Quotas

Syncora’s generation limits are based on your selected plan and apply to both input and output data volumes.

Plan
Monthly Quota
Monthly Charges
Special Features

Free

100 MB

$ 0 per month

Suitable for beginners

Growth

8  GB

$499 per month

Designed for startups and smaller teams

Hyperscale

40  GB

$1999 per month

Suitable for large teams and model training workflows

Enterprise(upcoming)

Custom

Custom

Includes VPC deployment and flexible billing options

⚠️ Quotas are calculated as the combined size of all uploaded inputs and generated outputs.

Last updated