Download Dataset & Dataset Format

In this page, we will introduce how to download the dataset and the dataset format.

Download

You can get the datasets from the GitHub Release Page.
If you just want to get one dataset, you can download it from the following links:

The dataset is stored in the form of a zip file. After downloading, you can unzip it and get the dataset. For more details about data format, please refer to the following section.

Single table fungi dataset

This dataset involves a single table. After unzipping the dataset, you will get the following files:

single_table_fungi_dataset
├── answers
│   ├── learn_knn.csv
│   └── test_knn.csv
├── bool_filter
│   ├── learn_filter.csv
│   └── test_filter.csv
├── metadata
│   └── base_metadata.csv
└── vectors
    ├── base_vectors.npy
    ├── learn_vectors.npy
    └── test_vectors.npy

There are four folders in the dataset, including answers, bool_filter, metadata and vectors. Now, let's explain the content of each folder.

Multi table fungi dataset

This dataset involves multiple tables. After unzipping the dataset, you will get the following files:

multi_table_movie_dataset
├── answers
│   ├── learn_knn.csv
│   └── test_knn.csv
├── bool_filter
│   ├── learn_filter.csv
│   └── test_filter.csv
├── metadata
│   ├── conversations.csv
│   ├── genres.csv
│   ├── movies.csv
│   ├── movies_genres.csv
│   ├── speakers.csv
│   └── utterances.csv
└── vectors
    ├── base_vectors.npy
    ├── learn_vectors.npy
    └── test_vectors.npy

Similar to the single table fungi dataset, there are four folders in the dataset, including answers, bool_filter, metadata and vectors. Now, let's explain the content of each folder.