Skip to Content

Datasets

Provide NEO with access to your data using multiple methods and formats

NEO handles real-world data automatically, supporting various access methods, file formats, and data sources.


Supported File Formats

FormatTypeDescriptionPlatformVS Code
CSVTabularStandard format for tabular data and time series
ParquetTabularRecommended for datasets >100MB
JSONStructuredIdeal for logs and nested data structures
ExcelTabularBusiness data and reportsUpload only ✅
ImagesVisualJPG, PNG, TIFF formats (ZIP)ZIP ✅
AudioAudioWAV, MP3, FLAC formats (ZIP)ZIP ✅

Data Access Methods

Platform Mode

MethodDescriptionLimit
File UploadDrag and drop directly in chatMax 50MB per file
Public URLsReference public dataset URLsAny size
Cloud StorageS3, GCS, Azure via Secrets ManagerNo size limit
GitHubAccess repository datasetsPublic & Private repos
KaggleCompetition datasetsVia API

VS Code Extension

Integrated Providers:

ProviderUse Case
Amazon S3Datasets and model checkpoints
Weights & BiasesExperiment tracking and artifacts
Hugging FaceModel hub access
KaggleCompetition data
GitHubRepository datasets

Quick Setup Guide

StepActionDetails
Step 1Choose Access MethodUpload, URL, cloud, or local files
Step 2Prepare Your DataUse supported formats (CSV, Parquet, JSON, etc.)
Step 3Reference in TaskInclude file path or URL in your task description

Example Tasks

CSV Dataset Example

Analyze the retail sales data in sales_data.csv (columns: date, product_id,
quantity, price, store_id) and forecast demand for each product category.
Include confidence intervals.

Parquet Dataset Example

Use the large transaction dataset in transactions.parquet (10M+ records)
to detect fraudulent transactions. Optimize for precision to minimize
false positives.

Cloud Storage Example

Analyze customer feedback from s3://company-data/feedback/2024/ and
perform sentiment analysis. Generate monthly sentiment trends.

Multi-Source Example

Combine customer_data.parquet, transactions.json, and product_images.zip
to build a personalized recommendation engine.

Best Practices


Troubleshooting

IssueSolution
File Not FoundVerify file path, check spelling, ensure file exists
Format Not SupportedConvert to CSV, Parquet, or JSON
File Too LargeUse cloud storage or VS Code Extension
Access DeniedVerify credentials and permissions

Need Help?

ResourceDescription
Getting StartedLearn how to submit your first task
Use CasesSee data examples in action
FAQFind answers to common questions