Datasets
Learn how to create and manage datasets for use in your experiments with our SDKs
Datasets allow you to store and reuse well-defined data for use in experiments. Datasets can be stored and versioned in Galileo, and available for experiments running both in the console as well as in code.
Dataset fields
Each record in a Galileo dataset can have three top-level fields:
input
- Input variables that can be passed to your application to recreate a test case.output
- Reference outputs to evaluate your application. These can be the ground truth for BLEU, ROUGE, and Ground Truth Adherence metrics, or reference outputs for manual reference.metadata
- Additional data you can use to filter or group your dataset.
Create datasets
When you create a dataset, it is uploaded to Galileo and available to future experiments. Datasets need to have unique names, and are available to all projects across your organization.
See the create_dataset
Python SDK docs or createDataset
TypeScript SDK docs for more details.
Get existing datasets
Once a dataset has been created in Galileo, you can retrieve it to use in your experiments by name or ID.
See the get_dataset
Python SDK docs or getDataset
TypeScript SDK docs for more details.
Add rows to existing datasets
See the add_rows
Python SDK docs or addRowsToDataset
TypeScript SDK docs for more details.
List datasets
You can retrieve all the datasets for a project.
See the list_datasets
Python SDK docs or getDatasets
TypeScript SDK docs for more details.
Delete datasets
If a dataset is no longer needed, you can delete it by name or ID.
See the delete_dataset
Python SDK docs or deleteDataset
TypeScript SDK docs for more details.
Work with dataset versions
Galileo automatically creates new versions of datasets when they are modified. You can access different versions by getting the dataset history.
See the get_dataset_version_history
Python SDK docs for more details.
Use datasets in experiments
Datasets are primarily used for running experiments to evaluate the performance of your LLM applications: