Datasets:

imageomics
/

KABR

Tasks:

Video Classification

Size Categories: 1M<n<10M

Tags: zebra giraffe plains zebra

DOI:

License: cc0-1.0

Dataset card Files Files and versions Community

Dataset Viewer

Auto-converted to Parquet

Go to dataset viewer

Viewer

original_vido_id video_id frame_id path labels string
"ZG0001.1 1 1 ZG0001.1/1.jpg 0"
"ZG0001.1 1 2 ZG0001.1/2.jpg 0"
"ZG0001.1 1 3 ZG0001.1/3.jpg 0"
"ZG0001.1 1 4 ZG0001.1/4.jpg 0"
"ZG0001.1 1 5 ZG0001.1/5.jpg 0"
"ZG0001.1 1 6 ZG0001.1/6.jpg 0"
"ZG0001.1 1 7 ZG0001.1/7.jpg 0"
"ZG0001.1 1 8 ZG0001.1/8.jpg 0"
"ZG0001.1 1 9 ZG0001.1/9.jpg 0"
"ZG0001.1 1 10 ZG0001.1/10.jpg 0"
"ZG0001.1 1 11 ZG0001.1/11.jpg 0"
"ZG0001.1 1 12 ZG0001.1/12.jpg 0"
"ZG0001.1 1 13 ZG0001.1/13.jpg 0"
"ZG0001.1 1 14 ZG0001.1/14.jpg 0"
"ZG0001.1 1 15 ZG0001.1/15.jpg 0"
"ZG0001.1 1 16 ZG0001.1/16.jpg 0"
"ZG0001.1 1 17 ZG0001.1/17.jpg 0"
"ZG0001.1 1 18 ZG0001.1/18.jpg 0"
"ZG0001.1 1 19 ZG0001.1/19.jpg 0"
"ZG0001.1 1 20 ZG0001.1/20.jpg 0"
"ZG0001.1 1 21 ZG0001.1/21.jpg 0"
"ZG0001.1 1 22 ZG0001.1/22.jpg 0"
"ZG0001.1 1 23 ZG0001.1/23.jpg 0"
"ZG0001.1 1 24 ZG0001.1/24.jpg 0"
"ZG0001.1 1 25 ZG0001.1/25.jpg 0"
"ZG0001.1 1 26 ZG0001.1/26.jpg 0"
"ZG0001.1 1 27 ZG0001.1/27.jpg 0"
"ZG0001.1 1 28 ZG0001.1/28.jpg 0"
"ZG0001.1 1 29 ZG0001.1/29.jpg 0"
"ZG0001.1 1 30 ZG0001.1/30.jpg 0"
"ZG0001.1 1 31 ZG0001.1/31.jpg 0"
"ZG0001.1 1 32 ZG0001.1/32.jpg 0"
"ZG0001.1 1 33 ZG0001.1/33.jpg 0"
"ZG0001.1 1 34 ZG0001.1/34.jpg 0"
"ZG0001.1 1 35 ZG0001.1/35.jpg 0"
"ZG0001.1 1 36 ZG0001.1/36.jpg 0"
"ZG0001.1 1 37 ZG0001.1/37.jpg 0"
"ZG0001.1 1 38 ZG0001.1/38.jpg 0"
"ZG0001.1 1 39 ZG0001.1/39.jpg 0"
"ZG0001.1 1 40 ZG0001.1/40.jpg 0"
"ZG0001.1 1 41 ZG0001.1/41.jpg 0"
"ZG0001.1 1 42 ZG0001.1/42.jpg 0"
"ZG0001.1 1 43 ZG0001.1/43.jpg 0"
"ZG0001.1 1 44 ZG0001.1/44.jpg 0"
"ZG0001.1 1 45 ZG0001.1/45.jpg 0"
"ZG0001.1 1 46 ZG0001.1/46.jpg 0"
"ZG0001.1 1 47 ZG0001.1/47.jpg 0"
"ZG0001.1 1 48 ZG0001.1/48.jpg 0"
"ZG0001.1 1 49 ZG0001.1/49.jpg 0"
"ZG0001.1 1 50 ZG0001.1/50.jpg 0"
"ZG0001.1 1 51 ZG0001.1/51.jpg 0"
"ZG0001.1 1 52 ZG0001.1/52.jpg 0"
"ZG0001.1 1 53 ZG0001.1/53.jpg 0"
"ZG0001.1 1 54 ZG0001.1/54.jpg 0"
"ZG0001.1 1 55 ZG0001.1/55.jpg 0"
"ZG0001.1 1 56 ZG0001.1/56.jpg 3"
"ZG0001.1 1 57 ZG0001.1/57.jpg 3"
"ZG0001.1 1 58 ZG0001.1/58.jpg 3"
"ZG0001.1 1 59 ZG0001.1/59.jpg 3"
"ZG0001.1 1 60 ZG0001.1/60.jpg 3"
"ZG0001.1 1 61 ZG0001.1/61.jpg 3"
"ZG0001.1 1 62 ZG0001.1/62.jpg 3"
"ZG0001.1 1 63 ZG0001.1/63.jpg 3"
"ZG0001.1 1 64 ZG0001.1/64.jpg 3"
"ZG0001.1 1 65 ZG0001.1/65.jpg 3"
"ZG0001.1 1 66 ZG0001.1/66.jpg 3"
"ZG0001.1 1 67 ZG0001.1/67.jpg 3"
"ZG0001.1 1 68 ZG0001.1/68.jpg 3"
"ZG0001.1 1 69 ZG0001.1/69.jpg 3"
"ZG0001.1 1 70 ZG0001.1/70.jpg 3"
"ZG0001.1 1 71 ZG0001.1/71.jpg 3"
"ZG0001.1 1 72 ZG0001.1/72.jpg 3"
"ZG0001.1 1 73 ZG0001.1/73.jpg 3"
"ZG0001.1 1 74 ZG0001.1/74.jpg 3"
"ZG0001.1 1 75 ZG0001.1/75.jpg 3"
"ZG0001.1 1 76 ZG0001.1/76.jpg 3"
"ZG0001.1 1 77 ZG0001.1/77.jpg 3"
"ZG0001.1 1 78 ZG0001.1/78.jpg 3"
"ZG0001.1 1 79 ZG0001.1/79.jpg 3"
"ZG0001.1 1 80 ZG0001.1/80.jpg 3"
"ZG0001.1 1 81 ZG0001.1/81.jpg 3"
"ZG0001.1 1 82 ZG0001.1/82.jpg 3"
"ZG0001.1 1 83 ZG0001.1/83.jpg 3"
"ZG0001.1 1 84 ZG0001.1/84.jpg 3"
"ZG0001.1 1 85 ZG0001.1/85.jpg 3"
"ZG0001.1 1 86 ZG0001.1/86.jpg 3"
"ZG0001.1 1 87 ZG0001.1/87.jpg 3"
"ZG0001.1 1 88 ZG0001.1/88.jpg 3"
"ZG0001.1 1 89 ZG0001.1/89.jpg 3"
"ZG0001.1 1 90 ZG0001.1/90.jpg 3"
"ZG0001.2 2 1 ZG0001.2/1.jpg 3"
"ZG0001.2 2 2 ZG0001.2/2.jpg 3"
"ZG0001.2 2 3 ZG0001.2/3.jpg 3"
"ZG0001.2 2 4 ZG0001.2/4.jpg 3"
"ZG0001.2 2 5 ZG0001.2/5.jpg 3"
"ZG0001.2 2 6 ZG0001.2/6.jpg 3"
"ZG0001.2 2 7 ZG0001.2/7.jpg 3"
"ZG0001.2 2 8 ZG0001.2/8.jpg 3"
"ZG0001.2 2 9 ZG0001.2/9.jpg 3"
"ZG0001.2 2 10 ZG0001.2/10.jpg 3"

Dataset Card for KABR: In-Situ Dataset for Kenyan Animal Behavior Recognition from Drone Videos

Dataset Summary

We present a novel high-quality dataset for animal behavior recognition from drone videos. The dataset is focused on Kenyan wildlife and contains behaviors of giraffes, plains zebras, and Grevy's zebras. The dataset consists of more than 10 hours of annotated videos, and it includes eight different classes, encompassing seven types of animal behavior and an additional category for occluded instances. In the annotation process for this dataset, a team of 10 people was involved, with an expert zoologist overseeing the process. Each behavior was labeled based on its distinctive features, using a standardized set of criteria to ensure consistency and accuracy across the annotations. The dataset was collected using drones that flew over the animals in the Mpala Research Centre in Kenya, providing high-quality video footage of the animal's natural behaviors. The drone footage is captured at a resolution of 5472 x 3078 pixels, and the videos were recorded at a frame rate of 29.97 frames per second.

Supported Tasks and Leaderboards

The results of our evaluation using I3D, SlowFast, and X3D architectures are given in the table below. For each one, the model was trained for 120 epochs with batch size of 5. For more information on these results, see our [paper](coming soon).

Method	All	Giraffes	Plains Zebras	Grevy’s Zebras
I3D (16x5)	53.41	61.82	58.75	46.73
SlowFast (16x5, 4x5)	52.92	61.15	60.60	47.42
X3D (16x5)	61.9	65.1	63.11	51.16

Languages

English

Dataset Structure

Under KABR/dataset/image/, the data has been archived into .zip files, which are split into 2GB files. These must be recombined and extracted. After cloning and navigating into the repository, you can use the following commands to do the reconstruction:

cd KABR/dataset/image/
cat giraffes_part_* > giraffes.zip
md5sum giraffes.zip # Compare this to what's shown with `cat giraffes_md5.txt`
unzip giraffes.zip
rm -rf giraffes_part_*

# Similarly for `zebras_grevys_part_*` and `zebras_plains_part_*`

Alternatively, there is a download script, download.py, which allows a download of the entire dataset in its established format without requiring one to clone the repository (cloning requires at least double the size of the dataset to store). To proceed with this approach, download download.py to the system where you want to access the data. Then, in the same directory as the script, run the following to begin the download:

pip install requests
python download.py

This script then downloads all the files present in the repository (without making a clone of the .git directory, etc.), concatenates the part files to their ZIP archives, verifies the MD5 checksums, extracts, and cleans up so that the folder structure, as described below, is present.

Note that it will require approximately 116GB of free space to complete this process, though the final dataset will only take about 61GB of disk space (the script removes the extra files after checking the download was successful).

The KABR dataset follows the Charades format:

KABR
    /dataset
        /image
            /video_1
                /image_1.jpg
                /image_2.jpg
                ...
                /image_n.jpg
            /video_2
                /image_1.jpg
                /image_2.jpg
                ...
                /image_n.jpg
            ...
            /video_n
                /image_1.jpg
                /image_2.jpg
                /image_3.jpg
                ...
                /image_n.jpg
    /annotation
        /classes.json
        /train.csv
        /val.csv

The dataset can be directly loaded and processed by the SlowFast framework.

Informational Files

KABR/configs: examples of SlowFast framework configs.
KABR/annotation/distribution.xlsx: distribution of classes for all videos.

Scripts:

image2video.py: Encode image sequences into the original video.
- For example, [image/G0067.1, image/G0067.2, ..., image/G0067.24] will be encoded into video/G0067.mp4.
image2visual.py: Encode image sequences into the original video with corresponding annotations.
- For example, [image/G0067.1, image/G0067.2, ..., image/G0067.24] will be encoded into visual/G0067.mp4.

Data Instances

Naming: Within the image folder, the video_n folders are named as follows (X indicates a number):

G0XXX.X - Giraffes
ZP0XXX.X - Plains Zebras
ZG0XXX.X - Grevy's Zebras
Within each of these folders the images are simply X.jpg.

Note: The dataset consists of a total of 1,139,893 frames captured from drone videos. There are 488,638 frames of Grevy's zebras, 492,507 frames of plains zebras, and 158,748 frames of giraffes.

Data Fields

There are 14,764 unique behavioral sequences in the dataset. These consist of eight distinct behaviors:

Walk
Trot
Run: animal is moving at a cantor or gallop
Graze: animal is eating grass or other vegetation
Browse: animal is eating trees or bushes
Head Up: animal is looking around or observe surroundings
Auto-Groom: animal is grooming itself (licking, scratching, or rubbing)
Occluded: animal is not fully visible

Data Splits

Training and validation sets are indicated by their respective CSV files (train.csv and val.csv), located within the annotation folder.

Dataset Creation

Curation Rationale

We present a novel high-quality dataset for animal behavior recognition from drone videos. The dataset is focused on Kenyan wildlife and contains behaviors of giraffes, plains zebras, and Grevy's zebras. The dataset consists of more than 10 hours of annotated videos, and it includes eight different classes, encompassing seven types of animal behavior and an additional category for occluded instances. In the annotation process for this dataset, a team of 10 people was involved, with an expert zoologist overseeing the process. Each behavior was labeled based on its distinctive features, using a standardized set of criteria to ensure consistency and accuracy across the annotations. The dataset was collected using drones that flew over the animals in the Mpala Research Centre in Kenya, providing high-quality video footage of the animal's natural behaviors. We believe that this dataset will be a valuable resource for researchers working on animal behavior recognition, as it provides a diverse and high-quality set of annotated videos that can be used for evaluating deep learning models. Additionally, the dataset can be used to study the behavior patterns of Kenyan animals and can help to inform conservation efforts and wildlife management strategies.

We provide a detailed description of the dataset and its annotation process, along with some initial experiments on the dataset using conventional deep learning models. The results demonstrate the effectiveness of the dataset for animal behavior recognition and highlight the potential for further research in this area.

Source Data

Initial Data Collection and Normalization

Data was collected from 6 January 2023 through 21 January 2023 at the Mpala Research Centre in Kenya under a Nacosti research license. We used DJI Mavic 2S drones equipped with cameras to record 5.4K resolution videos (5472 x 3078 pixels) from varying altitudes and distances of 10 to 50 meters from the animals (distance was determined by circumstances and safety regulations).

Mini-scenes were extracted from these videos to reduce the impact of drone movement and facilitate human annotation. Animals were detected in frame using YOLOv8, then the SORT tracking algorithm was applied to follow their movement. A 400 by 300 pixel window, centered on the animal, was then extracted; this is the mini-scene.

Annotations

Annotation process

In the annotation process for this dataset, a team of 10 people was involved, with an expert zoologist overseeing the process. Each behavior was labeled based on its distinctive features, using a standardized set of criteria to ensure consistency and accuracy across the annotations.

Personal and Sensitive Information

Though there are endangered species included in this data, exact locations are not provided and their safety is assured by their location within the preserve.

Considerations for Using the Data

Other Known Limitations

This data exhibits a long-tailed distribution due to the natural variation in frequency of the observed behaviors.

Additional Information

Authors

Maksim Kholiavchenko (Rensselaer Polytechnic Institute) - ORCID: 0000-0001-6757-1957
Jenna Kline (The Ohio State University)
Michelle Ramirez (The Ohio State University)
Sam Stevens (The Ohio State University)
Alec Sheets (The Ohio State University) - ORCID: 0000-0002-3737-1484
Reshma Ramesh Babu (The Ohio State University) - ORCID: 0000-0002-2517-5347
Namrata Banerji (The Ohio State University) - ORCID: 0000-0001-6813-0010
Elizabeth Campolongo (Imageomics Institute) - ORCID: 0000-0003-0846-2413
Nina Van Tiel (Eidgenössische Technische Hochschule Zürich) - ORCID: 0000-0001-6393-5629
Jackson Miliko (Mpala Research Centre)
Eduardo Bessa (Universidade de Brasília) - ORCID: 0000-0003-0606-5860
Tanya Berger-Wolf (The Ohio State University) - ORCID: 0000-0001-7610-1412
Daniel Rubenstein (Princeton University) - ORCID: 0000-0001-9049-5219
Charles Stewart (Rensselaer Polytechnic Institute)

Licensing Information

This dataset is dedicated to the public domain for the benefit of scientific pursuits. We ask that you cite the dataset using the below citation if you make use of it in your research.

Citation Information

@misc{KABR_Data,
  author = {Maksim Kholiavchenko and et. al},
  title = {KABR: In-Situ Dataset for Kenyan Animal Behavior Recognition from Drone Videos},
  year = {2023},
  url = {https://huggingface.co/datasets/imageomics/KABR},
  doi = {10.57967/hf/1010},
  publisher = {Model Database}
}

Contributions

The Imageomics Institute is funded by the US National Science Foundation's Harnessing the Data Revolution (HDR) Institute program under Award #2118240 (Imageomics: A New Frontier of Biological Information Powered by Knowledge-Guided Machine Learning).

Downloads last month: 4

Edit dataset card

Evaluate models Model Database Leaderboard

Size of the auto-converted Parquet files:

11.6 MB

Number of rows:

1,139,893