id
string
audio
audio
speaker_id
string
normalized_text
string
sentiment
string
start_second
float64
end_second
float64
"id10059_229vKIGbxrI_00001"
"id10059"
"and i i don't believe in god no religion says yet i was"
"Neutral"
0
4.24
"id10059_229vKIGbxrI_00002"
"id10059"
"the question because of my mother till i was fourteen when i thought about it when i emerged with"
"Neutral"
0
5.8
"id10059_229vKIGbxrI_00003"
"id10059"
"from my own culture things changed i i think about it a lot i value our"
"Neutral"
0
5.67
"id10059_229vKIGbxrI_00004"
"id10059"
"of god what is a creator the almighty that uh"
"Neutral"
0.45
4.52
"id10059_229vKIGbxrI_00005"
"id10059"
"i don't wanna pinpoint what exactly god is i i didn't see the burning"
"Neutral"
0
4.2
"id10059_229vKIGbxrI_00006"
"id10059"
"it's everywhere including the philosophy i think god's everywhere from the"
"Neutral"
0
4.52
"id10059_229vKIGbxrI_00007"
"id10059"
"um in portraying a character you project a lot of what you wanna do dramatically so i did a lot with that with the tiger but also when you portray something you're imitating it you're interpreting it so i"
"Neutral"
0
12.52
"id10059_229vKIGbxrI_00008"
"id10059"
"of a great art so you have to explore you have to give a"
"Neutral"
0
3.15
"id10059_229vKIGbxrI_00009"
"id10059"
"we don't quite know yet maybe we'll throw it away in five years from now maybe five years from now that's all we do i i i i don't know but i'm a novice of that media i got very"
"Neutral"
0
9.56
"id10059_229vKIGbxrI_00010"
"id10059"
"you know when i started out my career nobody gave me books like you know i have to write"
"Neutral"
0
5.04
"id10059_229vKIGbxrI_00011"
"id10059"
"uh i think i i chose the book because"
"Neutral"
0
3.75
"id10059_229vKIGbxrI_00012"
"id10059"
"and done their research put their soul into it it make me want to do some"
"Neutral"
0
4.36
"id10059_229vKIGbxrI_00013"
"id10059"
"most of the time i read them that make me want to do a movie not be not necessarily because i think"
"Neutral"
0
5.6
"id10059_229vKIGbxrI_00014"
"id10059"
"can measure success in so many ways for me i i rather think i'm i'm myself a"
"Neutral"
0
4.92
"id10059_229vKIGbxrI_00015"
"id10059"
"you've been cursed like why why this movie chose me to to to be the vehicle what have i done why do i have to go"
"Neutral"
0
6.42
"id10059_229vKIGbxrI_00016"
"id10059"
"it's very hard to judge success i i don't wanna measure success by box office or what or even word of mouth"
"Neutral"
0
7.44
"id10059_229vKIGbxrI_00017"
"id10059"
"how i connect and my crew connect with the experience of making it is something else to me every every movie we go through that"
"Neutral"
0
8.64
"id10059_229vKIGbxrI_00018"
"id10059"
"with all sincerity that's a success uh what what a way to leave our"
"Neutral"
0
4.16
"id10059_229vKIGbxrI_00019"
"id10059"
"oh this one i definitely am proud of the more difficult the more i'm proud of but i love them like all my"
"Neutral"
0
5.56
"id10059_229vKIGbxrI_00020"
"id10059"
"little patches of flashback of my memories experience they're all put"
"Neutral"
0
3.88
"id10059_2iL0P9T7pYY_00001"
"id10059"
"theater sometimes in the melodramas or operatic movies i will cry so hard a whole"
"Neutral"
0
5.88
"id10059_2iL0P9T7pYY_00002"
"id10059"
"uh it started from big movie to like your mainstream movies so for a long time it was"
"Neutral"
0
6.54
"id10059_2iL0P9T7pYY_00003"
"id10059"
"between me and sight and sound and i had some theatrical background that helped the traumatic event"
"Neutral"
0
5.84
"id10059_2iL0P9T7pYY_00004"
"id10059"
"just when when i everybody started to listen to me"
"Neutral"
0
3.96
"id10059_2iL0P9T7pYY_00005"
"id10059"
"it's a script computation held by the government i thought i might win some"
"Neutral"
0
4.32
"id10059_2iL0P9T7pYY_00006"
"id10059"
"i pitched them the movie they pitched them to me they say what a a king of"
"Neutral"
0
4.01
"id10059_2iL0P9T7pYY_00007"
"id10059"
"we're gonna make this movie the whole time i was hoping that"
"Neutral"
0
4.32
"id10059_2iL0P9T7pYY_00008"
"id10059"
"uh i told him i spent an hour total in the movie is like the"
"Neutral"
0
4.23
"id10059_2iL0P9T7pYY_00009"
"id10059"
"if you want me to make another movie which i didn't want to end my career i feel like retiring after"
"Neutral"
0
5.52
"id10059_2iL0P9T7pYY_00010"
"id10059"
"says a bad my my older son with make me that trip he sets bad example"
"Neutral"
0
4.29
"id10059_2iL0P9T7pYY_00011"
"id10059"
"so i wanna i did the funeral and i didn't want it have time to to"
"Neutral"
0
6.06
"id10059_2iL0P9T7pYY_00012"
"id10059"
"and when it hit the shopping mall it became a hit i got nervous i think people are gonna lynch me for portraying gay"
"Neutral"
0
7.03
"id10059_2iL0P9T7pYY_00013"
"id10059"
"yeah i had some difficulties some feel hesitated or didn't call back"
"Neutral"
0
4
"id10059_qM5FO3hHEbk_00001"
"id10059"
"uh for obvious reasons but i like that kind of work i need to work instead of uh sitting at home feeling bad for myself i need something i need a direction"
"Neutral"
0
9.6
"id10059_qM5FO3hHEbk_00002"
"id10059"
"less of a production stress if make your focus on preserving the performance only"
"Neutral"
0
6.48
"id10059_qM5FO3hHEbk_00003"
"id10059"
"and whatever ambition since you have since the student days all you have to do is finish the day and make sure that"
"Neutral"
0
6.28
"id10059_qM5FO3hHEbk_00004"
"id10059"
"yeah it feels that way it feels like that project that story was to meet the"
"Neutral"
0
4.35
"id10059_qM5FO3hHEbk_00005"
"id10059"
"making the movie is about trusting the tale trusting each other that's"
"Neutral"
0
4.63
"id10059_qM5FO3hHEbk_00006"
"id10059"
"then the second time i met her she took me around in the northern part of wyoming showed me the towns the folks"
"Neutral"
0
7.26
"id10059_qM5FO3hHEbk_00007"
"id10059"
"thing brokeback mountain would be uh same thing with larry when even more"
"Neutral"
0
4.96
"id10059_qM5FO3hHEbk_00008"
"id10059"
"but once i was onboard once she said what she had to say the wyoming way whenever i talk about being gay in wyoming but just wyoming"
"Neutral"
0
9.32
"id10059_qM5FO3hHEbk_00009"
"id10059"
"and the vibe the the feeling of wyoming the the west not the your western movie west but realistic uh rural life of the american west and the spirit of"
"Neutral"
0
10.72
"id10059_qM5FO3hHEbk_00010"
"id10059"
"and the foundation she wants to make sure it's captured uh once"
"Neutral"
0
4.71
"id10059_qM5FO3hHEbk_00011"
"id10059"
"bother us on the script or anything i'm sure she struggled with the script writer early on and that's many years before i was on board but once i start making the movie and it was my movies she would give us notes about don't forget this and that many details like that's just"
"Neutral"
0
17.57
"id10059_qM5FO3hHEbk_00012"
"id10059"
"not not really uh i i did ride with the devil another movie which is a pre western"
"Neutral"
0
4.68
"id10059_qM5FO3hHEbk_00013"
"id10059"
"well it's hard to get the the right the proper finance money to"
"Neutral"
0
5
"id10059_qM5FO3hHEbk_00014"
"id10059"
"they met and and herding sheep in the mountain brokeback mountain wyoming tough"
"Neutral"
0
5.57
"id10059_qM5FO3hHEbk_00015"
"id10059"
"one there that gets too colder both in the tent it tipped over they didn't know what happened they thought it was one summer"
"Neutral"
0
7.4
"id10059_qM5FO3hHEbk_00016"
"id10059"
"of become a functional father and husband leave their"
"Neutral"
0
4
"id10059_qM5FO3hHEbk_00017"
"id10059"
"and she kept trying to to hide out going back to brokeback mountain because having a life together is not possible"
"Neutral"
0
6.32
"id10059_qM5FO3hHEbk_00018"
"id10059"
"i was surprised scene by scene how they carried the age you know twenty to"
"Neutral"
0
4.68
"id10059_qM5FO3hHEbk_00019"
"id10059"
"put put in regular preparation i want him to carry that western mood"
"Neutral"
0
4.68
"id10059_qM5FO3hHEbk_00020"
"id10059"
"sort of the other side of america we're more familiar with the romantic the"
"Neutral"
0
4.04
"id10059_qM5FO3hHEbk_00021"
"id10059"
"there struck me as he's a uh actor with weight it it really holds your"
"Neutral"
0
5.65
"id10059_qM5FO3hHEbk_00022"
"id10059"
"all i could do for the movie cause i shot it that way the flashback happened twice"
"Neutral"
0
4.52
"id10059_qM5FO3hHEbk_00023"
"id10059"
"then he'd go for precisions cause there's no coverage in this scene i wanna do a one"
"Neutral"
0
4.53
"id10059_qM5FO3hHEbk_00024"
"id10059"
"yeah they just sent and sometimes something magic happened it's not one person's plan they have to correspond to each other"
"Neutral"
0
6.68
"id10059_qM5FO3hHEbk_00025"
"id10059"
"who who knows which one is is the most perfect one i also had to see how it fit with everything else in the movie so i was guessing i always kept doing it kept"
"Neutral"
0
8.56
"id10059_qM5FO3hHEbk_00026"
"id10059"
"me and because the challenge i think the only way to get out of it is they have to be"
"Neutral"
0
6
"id10059_qM5FO3hHEbk_00027"
"id10059"
"i think it's i was very moved um um i was hardly moved i was professional i remember uh"
"Neutral"
0
5.28
"id10059_qM5FO3hHEbk_00028"
"id10059"
"they're kind of besides themselves very genuine and i was i was"
"Neutral"
0
4.16
"id10059_qM5FO3hHEbk_00029"
"id10059"
"they feel like a cowboy they feel like ranch hand they know each other so much more"
"Neutral"
0
5.6
"id10059_qM5FO3hHEbk_00030"
"id10059"
"somebody asked me um how do you feel after this shooting this"
"Neutral"
0
4.08
"id10059_qM5FO3hHEbk_00031"
"id10059"
"and on the set i feel right at we're doing a scene it feels quite"
"Neutral"
0
4.36
"id10059_qM5FO3hHEbk_00032"
"id10059"
"i craft a movie and lay it out reveal the story in the way i think nobody knows anything"
"Neutral"
0
5.86
"id10059_qM5FO3hHEbk_00033"
"id10059"
"that knows so much and have anticipations and how do you take that into calculation how do you take that"
"Neutral"
0
6.16
"id10059_qM5FO3hHEbk_00034"
"id10059"
"things are not always in control like you have a thousand sheep"
"Neutral"
0
4.16
"id10059_qM5FO3hHEbk_00035"
"id10059"
"and shooting is like buying groceries you try to get the best ingredient"
"Neutral"
0
4.1
"id10059_qM5FO3hHEbk_00036"
"id10059"
"very exciting about shooting is you prepare for months from your decision to make it to all the way to the day you're shooting that scene it's a long process a lot of anticipation so many people's effort now everything's"
"Neutral"
0
12.99
"id10059_qM5FO3hHEbk_00037"
"id10059"
"and they're not in control you're battling it that moment's very precious especially"
"Neutral"
0
5
"id10059_qM5FO3hHEbk_00038"
"id10059"
"that moment is very precious to me i was in altered state which i am not in when i'm just laid back and editing"
"Neutral"
0
7.35
"id10059_qM5FO3hHEbk_00039"
"id10059"
"like in a i can shoot everything they still have to decide but still that's my decision"
"Neutral"
0
4.16
"id10059_qM5FO3hHEbk_00040"
"id10059"
"and help shaping it by observing it and finding it it's a little"
"Neutral"
0
4.26
"id10059_qM5FO3hHEbk_00041"
"id10059"
"and less uh politically really provocative i could get the movie i'm well"
"Neutral"
0
5.56
"id10059_qM5FO3hHEbk_00042"
"id10059"
"uh to do up to my standard in terms of productions that's a"
"Neutral"
0
3.79
"id10059_qM5FO3hHEbk_00043"
"id10059"
"you have to make things happen instead of two hundred people push you forward you push two hundred people you know something like that but it gets easier the more movie"
"Neutral"
0
7.91
"id10059_qM5FO3hHEbk_00044"
"id10059"
"i think the biggest battle is is is culture how do you make it how do you when a movie come out how did"
"Neutral"
0
6.73
"id10059_qM5FO3hHEbk_00045"
"id10059"
"but other than that it is unreal like yeah i feel like a little twilight"
"Neutral"
0
4.48
"id10059_qM5FO3hHEbk_00046"
"id10059"
"the less known material like brokeback mountain attracts me but that's also"
"Neutral"
0
4.84
"id10062_11wavAHn4cs_00001"
"id10062"
"you don't think about that person at all just because we're playing such different characters it's not we're not in a dystopian world our you know everything is different about these guys so"
"Neutral"
0
8.16
"id10062_11wavAHn4cs_00002"
"id10062"
"coping mechanisms i think that you know one could say that augustus is fearless but"
"Neutral"
0
4.8
"id10062_11wavAHn4cs_00003"
"id10062"
"and he is actually very very fearful of of dying without leaving his mark and hazel helps him"
"Neutral"
0
5.17
"id10062_11wavAHn4cs_00004"
"id10062"
"it's a like an idealistic way of thinking about things right everyone everyone wants to be not everyone but especially at a young age like you're told that like success is what you want and it you wanna change the world and then as you get"
"Neutral"
0
11.15
"id10062_11wavAHn4cs_00005"
"id10062"
"realizing you know even even if you do like leave a mark what does that what's the big deal like as hazel says oblivion is inevitable"
"Neutral"
0
7.64
"id10062_11wavAHn4cs_00006"
"id10062"
"um that yeah that's it's an i think it's a natural thing for young people to have so it's realistic that john writes a"
"Neutral"
0
6.17
"id10062_11wavAHn4cs_00007"
"id10062"
"any of my heroes that you know were were bad people or something i once heard a funny story someone someone said that they were like a"
"Neutral"
0
6.97
"id10062_11wavAHn4cs_00008"
"id10062"
"like a long time ago obviously a big fan of mickey mantle a huge fan and they waited and they waited outside of yankee stadium for him to show up and when he finally did they ran over to him and mickey mickey can i have an"
"Neutral"
0
9.91
"id10062_11wavAHn4cs_00009"
"id10062"
"they you know they're not obsessed with van houten they're just hazel's obsessed with this book"
"Neutral"
0
3.96
"id10062_11wavAHn4cs_00010"
"id10062"
"they're more you know they don't idolize in that way so when he does sorta spit on them they're sort of like screw you man we don't need you anymore what we have here is more"
"Neutral"
0
7.8
"id10062_EJCAgjbGKJQ_00001"
"id10062"
"i read the script about two years ago and i mean it was pretty much simultaneous that the the whole movie hype and the book rose together so it was um perfect timing like i i i actually read"
"Neutral"
0
9.76
"id10062_EJCAgjbGKJQ_00002"
"id10062"
"um which was when it was really getting popular and shortly it it became my favorite book and i really really loved it and once i read it i hadn't gotten the role yet and i was like i need to get"
"Neutral"
0
8.76
"id10062_EJCAgjbGKJQ_00003"
"id10062"
"i feel like everyone's dealt with cancer indirectly directly um someone they know or they know someone who knows someone um so that makes a story obviously very universal um but then also it's it's a above that it's a beautiful love story and i i think that's what makes our movie so special is that you leave the theater"
"Neutral"
0
18.16
"id10062_EJCAgjbGKJQ_00004"
"id10062"
"hopeful and um someone once told me that that once they left the theater that they called everyone they loved and told them that they love"
"Neutral"
0
6.6
"id10062_EJCAgjbGKJQ_00005"
"id10062"
"so it really is about appreciating what's in front of you and it's not just a sad story and i think that's"
"Neutral"
0
6.3
"id10062_EJCAgjbGKJQ_00006"
"id10062"
"i think that's always very interesting because especially as actors and when we're young and in school all they tell us to do is be natural and be as you know just be as natural as possible but sometimes people in real life have a have a"
"Neutral"
0
13.52
"id10062_EJCAgjbGKJQ_00007"
"id10062"
"and i think augustus in the beginning of the story definitely has a bluff and he's over the top and he"
"Neutral"
0
4.51
"id10062_EJCAgjbGKJQ_00008"
"id10062"
"that was i think that was important to us because it's not often that a studio movie would have their leading lady having a cannula in their mouth always or on the movie poster let alone and that that's something that hollywood doesn't do very often and we were really proud to to do something like that and really"
"Neutral"
0
17.28
"id10062_EJCAgjbGKJQ_00009"
"id10062"
"you know have the the real the main characters be the people with the disease and and show it and like not"
"Neutral"
0
5.78
"id10062_EJCAgjbGKJQ_00010"
"id10062"
"crazy feeling but you know i i mean i'm not gonna take credit for anything tom cruise is tom cruise and he's amazing uh it's just our story i think is a really special story and i couldn't be happier that you know that we"
"Neutral"
0
11.58
"id10062_EJCAgjbGKJQ_00011"
"id10062"
"when someone said you know you can always make art and make the best art you can but you can never guarantee you'll have an audience for it and something that we're all proud of me my costars shailene and nat um we're so proud because this is something we're all so happy with and proud of and this is the thing that we're getting the most audience for and getting recognized for and that"
"Neutral"
0
19.43
YAML Metadata Error: "configs[0]" must be of type object
YAML Metadata Error: "configs[1]" must be of type object

Dataset Card for SLUE

Dataset Summary

We introduce the Spoken Language Understanding Evaluation (SLUE) benchmark. The goals of our work are to

  • Track research progress on multiple SLU tasks
  • Facilitate the development of pre-trained representations by providing fine-tuning and eval sets for a variety of SLU tasks
  • Foster the open exchange of research by focusing on freely available datasets that all academic and industrial groups can easily use.

For this benchmark, we provide new annotation of publicly available, natural speech data for training and evaluation. We also provide a benchmark suite including code to download and pre-process the SLUE datasets, train the baseline models, and evaluate performance on SLUE tasks. Refer to Toolkit and Paper for more details.

Supported Tasks and Leaderboards

Automatic Speech Recognition (ASR)

Although this is not a SLU task, ASR can help analyze the performance of downstream SLU tasks on the same domain. Additionally, pipeline approaches depend on ASR outputs, making ASR relevant to SLU. ASR is evaluated using word error rate (WER).

Named Entity Recognition (NER)

Named entity recognition involves detecting the named entities and their tags (types) in a given sentence. We evaluate performance using micro-averaged F1 and label-F1 scores. The F1 score evaluates an unordered list of named entity phrase and tag pairs predicted for each sentence. Only the tag predictions are considered for label-F1.

Sentiment Analysis (SA)

Sentiment analysis refers to classifying a given speech segment as having negative, neutral, or positive sentiment. We evaluate SA using macro-averaged (unweighted) recall and F1 scores.[More Information Needed]

How-to-submit for your test set evaluation

See here https://asappresearch.github.io/slue-toolkit/how-to-submit.html

Languages

The language data in SLUE is in English.

Dataset Structure

Data Instances

voxpopuli

  • Size of downloaded dataset files: 398.45 MB
  • Size of the generated dataset: 5.81 MB
  • Total amount of disk used: 404.26 MB An example of 'train' looks as follows.
{'id': '20131007-0900-PLENARY-19-en_20131007-21:26:04_3',
 'audio': {'path': '/Users/username/.cache/huggingface/datasets/downloads/extracted/e35757b0971ac7ff5e2fcdc301bba0364857044be55481656e2ade6f7e1fd372/slue-voxpopuli/fine-tune/20131007-0900-PLENARY-19-en_20131007-21:26:04_3.ogg',
  'array': array([ 0.00132601,  0.00058881, -0.00052187, ...,  0.06857217,
          0.07835515,  0.07845446], dtype=float32),
  'sampling_rate': 16000},
 'speaker_id': 'None',
 'normalized_text': 'two thousand and twelve for instance the new brussels i regulation provides for the right for employees to sue several employers together and the right for employees to have access to courts in europe even if the employer is domiciled outside europe. the commission will',
 'raw_text': '2012. For instance, the new Brussels I Regulation provides for the right for employees to sue several employers together and the right for employees to have access to courts in Europe, even if the employer is domiciled outside Europe. The Commission will',
 'raw_ner': {'type': ['LOC', 'LOC', 'LAW', 'DATE'],
  'start': [227, 177, 28, 0],
  'length': [6, 6, 21, 4]},
 'normalized_ner': {'type': ['LOC', 'LOC', 'LAW', 'DATE'],
  'start': [243, 194, 45, 0],
  'length': [6, 6, 21, 23]},
 'raw_combined_ner': {'type': ['PLACE', 'PLACE', 'LAW', 'WHEN'],
  'start': [227, 177, 28, 0],
  'length': [6, 6, 21, 4]},
 'normalized_combined_ner': {'type': ['PLACE', 'PLACE', 'LAW', 'WHEN'],
  'start': [243, 194, 45, 0],
  'length': [6, 6, 21, 23]}}

voxceleb

  • Size of downloaded dataset files: 1.55 GB
  • Size of the generated dataset: 3.78 MB
  • Total amount of disk used: 1.55 GB An example of 'train' looks as follows.
{'id': 'id10059_229vKIGbxrI_00004',
 'audio': {'path': '/Users/felixwu/.cache/huggingface/datasets/downloads/extracted/400facb6d2f2496ebcd58a5ffe5fbf2798f363d1b719b888d28a29b872751626/slue-voxceleb/fine-tune_raw/id10059_229vKIGbxrI_00004.flac',
  'array': array([-0.00442505, -0.00204468,  0.00628662, ...,  0.00158691,
          0.00100708,  0.00033569], dtype=float32),
  'sampling_rate': 16000},
 'speaker_id': 'id10059',
 'normalized_text': 'of god what is a creator the almighty that uh',
 'sentiment': 'Neutral',
 'start_second': 0.45,
 'end_second': 4.52}

Data Fields

voxpopuli

  • id: a string id of an instance.
  • audio: audio feature of the raw audio. It is a dictionary containing the path to the downloaded audio file, the decoded audio array, and the sampling rate. Note that when accessing the audio column: dataset[0]["audio"] the audio file is automatically decoded and resampled to dataset.features["audio"].sampling_rate. Decoding and resampling of a large number of audio files might take a significant amount of time. Thus it is important to first query the sample index before the "audio" column, i.e. dataset[0]["audio"] should always be preferred over dataset["audio"][0].
  • speaker_id: a string of the speaker id.
  • raw_text: a string feature that contains the raw transcription of the audio.
  • normalized_text: a string feature that contains the normalized transcription of the audio which is used in the standard evaluation.
  • raw_ner: the NER annotation of the raw_text using the same 18 NER classes as OntoNotes.
  • normalized_ner: the NER annotation of the normalized_text using the same 18 NER classes as OntoNotes.
  • raw_combined_ner: the NER annotation of the raw_text using our 7 NER classes (WHEN, QUANT, PLACE, NORP, ORG, LAW, PERSON).
  • normalized_combined_ner: the NER annotation of the normalized_text using our 7 NER classes (WHEN, QUANT, PLACE, NORP, ORG, LAW, PERSON) which is used in the standard evaluation. Each NER annotation is a dictionary containing three lists: type, start, and length. type is a list of the NER tag types. start is a list of the start character position of each named entity in the corresponding text. length is a list of the number of characters of each named entity.

voxceleb

  • id: a string id of an instance.
  • audio: audio feature of the raw audio. Please use start_second and end_second to crop the transcribed segment. For example, dataset[0]["audio"]["array"][int(dataset[0]["start_second"] * dataset[0]["audio"]["sample_rate"]):int(dataset[0]["end_second"] * dataset[0]["audio"]["sample_rate"])]. It is a dictionary containing the path to the downloaded audio file, the decoded audio array, and the sampling rate. Note that when accessing the audio column: dataset[0]["audio"] the audio file is automatically decoded and resampled to dataset.features["audio"].sampling_rate. Decoding and resampling of a large number of audio files might take a significant amount of time. Thus it is important to first query the sample index before the "audio" column, i.e. dataset[0]["audio"] should always be preferred over dataset["audio"][0].
  • speaker_id: a string of the speaker id.
  • normalized_text: a string feature that contains the transcription of the audio segment.
  • sentiment: a string feature which can be Negative, Neutral, or Positive.
  • start_second: a float feature that specifies the start second of the audio segment.
  • end_second: a float feature that specifies the end second of the audio segment.

Data Splits

train validation test
voxpopuli 5000 1753 1842
voxceleb 5777 1454 3553
Here we use the standard split names in Huggingface's datasets, so the train and validation splits are the original fine-tune and dev splits of SLUE datasets, respectively.

Dataset Creation

Curation Rationale

[More Information Needed]

Source Data

Initial Data Collection and Normalization

[More Information Needed]

Who are the source language producers?

[More Information Needed]

Annotations

Annotation process

[More Information Needed]

Who are the annotators?

[More Information Needed]

Personal and Sensitive Information

[More Information Needed]

Considerations for Using the Data

Social Impact of Dataset

[More Information Needed]

Discussion of Biases

[More Information Needed]

Other Known Limitations

[More Information Needed]

Additional Information

Dataset Curators

[More Information Needed]

Licensing Information

SLUE-VoxPopuli Dataset

SLUE-VoxPopuli dataset contains a subset of VoxPopuli dataset and the copyright of this subset remains the same with the original license, CC0. See also European Parliament's legal notice (https://www.europarl.europa.eu/legal-notice/en/)

Additionally, we provide named entity annotation (normalized_ner and raw_ner column in .tsv files) and it is covered with the same license as CC0.

SLUE-VoxCeleb Dataset

SLUE-VoxCeleb Dataset contains a subset of OXFORD VoxCeleb dataset and the copyright of this subset remains the same Creative Commons Attribution 4.0 International license as below. Additionally, we provide transcription, sentiment annotation and timestamp (start, end) that follows the same license to OXFORD VoxCeleb dataset.

Original License of OXFORD VGG VoxCeleb Dataset

VoxCeleb1 contains over 100,000 utterances for 1,251 celebrities, extracted from videos uploaded to YouTube.
VoxCeleb2 contains over a million utterances for 6,112 celebrities, extracted from videos uploaded to YouTube.

The speakers span a wide range of different ethnicities, accents, professions and ages.

We provide Youtube URLs, associated face detections, and timestamps, as well as cropped audio segments and cropped face videos from the dataset. The copyright of both the original and cropped versions of the videos remains with the original owners.

The data is covered under a Creative Commons Attribution 4.0 International license (Please read the license terms here. https://creativecommons.org/licenses/by/4.0/).

Downloading this dataset implies agreement to follow the same conditions for any modification and/or re-distribution of the dataset in any form.

Additionally any entity using this dataset agrees to the following conditions:

THIS DATASET IS PROVIDED BY THE COPYRIGHT HOLDERS AND CONTRIBUTORS "AS IS" AND ANY EXPRESS OR IMPLIED WARRANTIES, INCLUDING, BUT NOT LIMITED TO, THE IMPLIED WARRANTIES OF MERCHANTABILITY AND FITNESS FOR A PARTICULAR PURPOSE ARE DISCLAIMED. IN NO EVENT SHALL THE COPYRIGHT HOLDER BE LIABLE FOR ANY DIRECT, INDIRECT, INCIDENTAL, SPECIAL, EXEMPLARY, OR CONSEQUENTIAL DAMAGES (INCLUDING, BUT NOT LIMITED TO, PROCUREMENT OF SUBSTITUTE GOODS OR SERVICES; LOSS OF USE, DATA, OR PROFITS; OR BUSINESS INTERRUPTION) HOWEVER CAUSED AND ON ANY THEORY OF LIABILITY, WHETHER IN CONTRACT, STRICT LIABILITY, OR TORT (INCLUDING NEGLIGENCE OR OTHERWISE) ARISING IN ANY WAY OUT OF THE USE OF THIS SOFTWARE, EVEN IF ADVISED OF THE POSSIBILITY OF SUCH DAMAGE.

Please cite [1,2] below if you make use of the dataset.

[1] J. S. Chung, A. Nagrani, A. Zisserman
VoxCeleb2: Deep Speaker Recognition
INTERSPEECH, 2018.

[2] A. Nagrani, J. S. Chung, A. Zisserman VoxCeleb: a large-scale speaker identification dataset
INTERSPEECH, 2017

Citation Information

@inproceedings{shon2022slue,
  title={Slue: New benchmark tasks for spoken language understanding evaluation on natural speech},
  author={Shon, Suwon and Pasad, Ankita and Wu, Felix and Brusco, Pablo and Artzi, Yoav and Livescu, Karen and Han, Kyu J},
  booktitle={ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)},
  pages={7927--7931},
  year={2022},
  organization={IEEE}
}

Contributions

Thanks to @fwu-asapp for adding this dataset.

Downloads last month
174