What you can synthesize
Oumi incorporates data synthesis as an iterative, repeatable part of your machine learning workflow. You can rapidly prototype datasets, expand small or imbalanced data, and evolve training data alongside your models. Some examples of what you can build with Oumi’s data synthesis include:- Question-answer datasets for training chatbots
- Instruction-following datasets with varied complexity levels
- Domain-specific training data (legal, medical, technical)
- Conversation datasets with different personas or styles
- Data augmentation to expand existing small datasets
What’s next
How It Works
Learn how Oumi data synthesis works.
Recipes
Find out what goes inside a data synthesis recipe.