🦖 Efficiently evolve your old fixed-length data files into more modern file formats, fully parallelized!
-
Updated
May 29, 2024 - Rust
🦖 Efficiently evolve your old fixed-length data files into more modern file formats, fully parallelized!
C++ Faker library for generating fake (but realistic) data.
A novel approach for synthesizing tabular data using pretrained large language models
GRADE evaluation and processing scripts
Data generation and property-based testing for Elixir. 🔮
Generate relevant synthetic data quickly for your projects. The Databricks Labs synthetic data generator (aka `dbldatagen`) may be used to generate large simulated / synthetic data sets for test, POCs, and other uses in Databricks environments including in Delta Live Tables pipelines
Synthetic data generation for tabular data
A library to model multivariate data using copulas.
A toolkit for test data generation
Snaplet Documentation
Random data generator based on JSON schemas
BENERATOR is a leading software solution to generate, obfuscate, pseudonymize and migrate data for development, testing, and training purposes with a model-driven approach.
Simple interface to synthesize complex and highly dimensional datasets using Gretel APIs.
Test data management tool for any data source, batch or real-time
Example API implementation for Data Caterer
Grounded-SAM: Marrying Grounding-DINO with Segment Anything & Stable Diffusion & Recognize Anything - Automatically Detect , Segment and Generate Anything
☕️🎵 A Music-Centric social network database development
An R package for simulating data
A command line tool for extracting machine learning ready data from software binaries powered by Radare2
Generate strings that match a given regular expression
Add a description, image, and links to the data-generation topic page so that developers can more easily learn about it.
To associate your repository with the data-generation topic, visit your repo's landing page and select "manage topics."