The standardized internal format used by Oumi to represent datasets, where data is structured as a sequence of messages with defined roles (e.g., user, assistant) and associated metadata.
Metadata that records the origin, transformations, and lineage of data within a dataset, helping ensure transparency, traceability, and reproducibility.
A scoring function or model that assesses the quality of model outputs according to specific criteria, such as accuracy, safety, or instruction adherence.
A configurable setting that influences how a machine learning model trains or generates predictions. Examples include learning rate, temperature, batch size, and max tokens.