a curated list of data for reasoning ai
-
Updated
Jun 12, 2024
a curated list of data for reasoning ai
Open source data anonymization and synthetic data orchestration for developers. Create high fidelity synthetic data and sync it across your environments.
A lightweight library for generating synthetic instruction tuning datasets for your data without GPT.
Synthetic data generation for tabular data
⚗️ distilabel is a framework for synthetic data and AI feedback for AI engineers that require high-quality outputs, full data ownership, and overall efficiency.
Design, conduct and analyze results of AI-powered surveys and experiments. Simulate social science and market research with large numbers of AI agents and LLMs.
Synthesizer - a code for creating synthetic astrophysical spectra
Generation and evaluation of synthetic time series datasets (also, augmentations, visualizations, a collection of popular datasets)
Software for evaluating the quality of synthetic data compared with real data.
Benchmarking synthetic data generation methods.
Open-source version of the TDspora synthetic data generation algorithm.
Synthetic Data Generation for mixed-type, multivariate time series.
The Gretel Python Client allows you to interact with the Gretel REST API.
Conditional GAN for generating synthetic tabular data.
Synthetic Patient Population Simulator
This project allows users to generate synthetic videos from CAD models, including .npy files with additional information. Models are loaded dynamically into a Blender scene, and the camera smoothly moves along spherical points to create the final video.
[CVPR 2024] Official code for EgoGen: An Egocentric Synthetic Data Generator
Add a description, image, and links to the synthetic-data topic page so that developers can more easily learn about it.
To associate your repository with the synthetic-data topic, visit your repo's landing page and select "manage topics."