Dataset Preview
Viewer
The full dataset viewer is not available (click to read why). Only showing a preview of the rows.
Job manager crashed while running this job (missing heartbeats).
Error code:   JobManagerCrashedError

Need help to make the dataset viewer work? Open a discussion for direct support.

image_id
string
question_id
int32
question
string
question_tokens
sequence
image
image
image_width
int32
image_height
int32
flickr_original_url
string
flickr_300k_url
string
answers
sequence
image_classes
sequence
set_name
string
"0054c91397f2fe05"
0
"what is the brand of phone?"
[ "what", "is", "the", "brand", "of", "phone" ]
1,024
730
"https://farm6.staticflickr.com/2891/9134076951_f65b421097_o.jpg"
"https://c4.staticflickr.com/3/2891/9134076951_9db89d3e0f_z.jpg"
[ "nokia", "nokia", "nokia", "nokia", "toshiba", "nokia", "nokia", "nokia", "nokia", "nokia" ]
[ "Belt", "Headphones", "Goggles", "Scale", "Bottle opener", "Mobile phone", "Mirror", "Digital clock", "Television", "Telephone", "Tool", "Wheel", "Camera", "Watch", "Glasses", "Aircraft" ]
"train"
"005635e119b9f32f"
1
"what type of plane is this?"
[ "what", "type", "of", "plane", "is", "this" ]
1,024
667
"https://c8.staticflickr.com/3/2579/5811451782_31f8649055_o.jpg"
"https://c8.staticflickr.com/3/2579/5811451782_f6c4633327_z.jpg"
[ "lape", "cargo", "ec-agg", "lape", "lape", "lape", "lape", "lape", "lape", "airplane" ]
[ "Vehicle", "Helicopter", "Airplane", "Bomb", "Aircraft" ]
"train"
"005635e119b9f32f"
2
"what are the letters on the tail section of the plane?"
[ "what", "are", "the", "letters", "on", "the", "tail", "section", "of", "the", "plane" ]
1,024
667
"https://c8.staticflickr.com/3/2579/5811451782_31f8649055_o.jpg"
"https://c8.staticflickr.com/3/2579/5811451782_f6c4633327_z.jpg"
[ "ec agg", "ec-agg", "ec", "ec-agg", "ec", "ec", "ec", "ec", "ec goeland", "ec" ]
[ "Vehicle", "Helicopter", "Airplane", "Bomb", "Aircraft" ]
"train"
"00685bc495504d61"
3
"who is this copyrighted by?"
[ "who", "is", "this", "copyrighted", "by" ]
786
1,024
"https://farm2.staticflickr.com/5067/5620759429_4ea686e643_o.jpg"
"https://c5.staticflickr.com/6/5067/5620759429_f43a649fb5_z.jpg"
[ "simon clancy", "simon ciancy", "simon clancy", "simon clancy", "the brand is bayard", "simon clancy", "simon clancy", "simon clancy", "simon clancy", "simon clancy" ]
[ "Vehicle", "Tower", "Airplane", "Aircraft" ]
"train"
"00685bc495504d61"
4
"what brand is on the plane?"
[ "what", "brand", "is", "on", "the", "plane" ]
786
1,024
"https://farm2.staticflickr.com/5067/5620759429_4ea686e643_o.jpg"
"https://c5.staticflickr.com/6/5067/5620759429_f43a649fb5_z.jpg"
[ "virgin is the brand on the plane.", "virgin mobile", "virgin", "virgin", "virgin", "virgin", "virgin", "virgin", "virgin", "virgin" ]
[ "Vehicle", "Tower", "Airplane", "Aircraft" ]
"train"
"006d10667d17b924"
5
"what year is shown in the photo?"
[ "what", "year", "is", "shown", "in", "the", "photo" ]
1,024
768
"https://farm5.staticflickr.com/6196/6118278128_09f43f09eb_o.jpg"
"https://c4.staticflickr.com/7/6196/6118278128_0fc8a3e349_z.jpg"
[ "2011", "2011", "2011", "2011", "2011", "2011", "2011", "2011", "2011", "2011" ]
[ "Person", "Woman", "Man", "Tree", "Clothing", "Airplane", "Human face", "Aircraft" ]
"train"
"006d10667d17b924"
6
"what type of meeting is it?"
[ "what", "type", "of", "meeting", "is", "it" ]
1,024
768
"https://farm5.staticflickr.com/6196/6118278128_09f43f09eb_o.jpg"
"https://c4.staticflickr.com/7/6196/6118278128_0fc8a3e349_z.jpg"
[ "rimini", "royal theater", "rimini", "rimini", "rimini meeting", "rimini", "rimini meeting", "rimini", "rimini", "rimini" ]
[ "Person", "Woman", "Man", "Tree", "Clothing", "Airplane", "Human face", "Aircraft" ]
"train"
"00c359f294f7dcd9"
7
"what is the name of this plane?"
[ "what", "is", "the", "name", "of", "this", "plane" ]
1,024
680
"https://c2.staticflickr.com/9/8103/8562272505_2ce50b5a35_o.jpg"
"https://c1.staticflickr.com/9/8103/8562272505_5b42f9d199_z.jpg"
[ "g-atco", "g-atco", "g-atco", "g-atco", "g-atco", "g atco", "g-atco", "g-atco", "g-atco", "g-atco" ]
[ "Vehicle", "Helicopter", "Airplane", "Aircraft" ]
"train"
"00c359f294f7dcd9"
8
"what is the plane's call sign?"
[ "what", "is", "the", "plane", "'", "s", "call", "sign" ]
1,024
680
"https://c2.staticflickr.com/9/8103/8562272505_2ce50b5a35_o.jpg"
"https://c1.staticflickr.com/9/8103/8562272505_5b42f9d199_z.jpg"
[ "g-atco", "g-atco", "g-atco", "g-atco", "g-atco", "g-atco", "g-atco", "g-atco", "g-atco", "g-atco" ]
[ "Vehicle", "Helicopter", "Airplane", "Aircraft" ]
"train"
"0122c5279a501df2"
9
"what letter is on the plane's tail?"
[ "what", "letter", "is", "on", "the", "plane", "'", "s", "tail" ]
1,024
683
"https://farm3.staticflickr.com/8673/16416250538_594bcb48d2_o.jpg"
"https://c7.staticflickr.com/9/8673/16416250538_9328760686_z.jpg"
[ "f", "f", "f", "f", "f", "f", "f", "f", "f", "f" ]
[ "Vehicle", "Airplane", "Aircraft" ]
"train"
"0122c5279a501df2"
10
"what airline is this?"
[ "what", "airline", "is", "this" ]
1,024
683
"https://farm3.staticflickr.com/8673/16416250538_594bcb48d2_o.jpg"
"https://c7.staticflickr.com/9/8673/16416250538_9328760686_z.jpg"
[ "finn", "finn", "finn air", "finnair", "finn", "finnair", "finn", "finnair", "finnair", "unanswerable" ]
[ "Vehicle", "Airplane", "Aircraft" ]
"train"
"015c537f722d115e"
11
"what airline is this plane for?"
[ "what", "airline", "is", "this", "plane", "for" ]
1,024
448
"https://c7.staticflickr.com/3/2682/4481270420_2b2e163583_o.jpg"
"https://c8.staticflickr.com/3/2682/4481270420_728ee098f0_z.jpg"
[ "airfrance", "air france", "airfrance", "air france", "airfrance", "airfrance", "spanish", "air france", "air france", "air france" ]
[ "Vehicle", "Airplane", "Aircraft" ]
"train"
"0202faf23a9aae11"
12
"what does it say on the plane?"
[ "what", "does", "it", "say", "on", "the", "plane" ]
1,024
1,024
"https://farm2.staticflickr.com/5473/9450199255_515b079639_o.jpg"
"https://c1.staticflickr.com/6/5473/9450199255_43029a3cd1_z.jpg"
[ "croatia", "croatia", "roatia", "roatia", "croatia", "croatia", "croatia", "croatia", "roatia", "roatia " ]
[ "Building", "Person", "Vehicle", "Airplane", "Human face", "Aircraft" ]
"train"
"02b2daa20b00f5d3"
13
"what website is this jet associated with?"
[ "what", "website", "is", "this", "jet", "associated", "with" ]
1,024
682
"https://farm4.staticflickr.com/7403/13914609743_2004ef8b74_o.jpg"
"https://c3.staticflickr.com/8/7403/13914609743_23fdb967b3_z.jpg"
[ "jet2.com", "jet2.com", "jet2.com", "jet2.com", "jet2", "jet2.com", "jet2.com", "jet2.com", "jet2", "jet2.com" ]
[ "Tree", "Vehicle", "Airplane", "Aircraft" ]
"train"
"02b2daa20b00f5d3"
14
"what does this jet advertise as having?"
[ "what", "does", "this", "jet", "advertise", "as", "having" ]
1,024
682
"https://farm4.staticflickr.com/7403/13914609743_2004ef8b74_o.jpg"
"https://c3.staticflickr.com/8/7403/13914609743_23fdb967b3_z.jpg"
[ "friendly low fares", "friendly low fares", "friendly low fares", "friendly low fares", "friendly low fares", "friendly low fares", "friendly low fares", "friendly low fairs", "friendly low fares", "friendly low fares" ]
[ "Tree", "Vehicle", "Airplane", "Aircraft" ]
"train"
"02da7a42e971e3cb"
15
"what letters are embellished on the parachute?"
[ "what", "letters", "are", "embellished", "on", "the", "parachute" ]
1,024
903
"https://c4.staticflickr.com/5/4102/4759963602_47a83be793_o.jpg"
"https://c4.staticflickr.com/5/4102/4759963602_5ea2beab2f_z.jpg"
[ "raf", "raf", "raf", "raf", "r a f", "raf", "raf", "raf", "raf", "raf" ]
[ "Vehicle", "Helicopter", "Parachute", "Aircraft" ]
"train"
"02da7a42e971e3cb"
16
"what are the letters being displayed?"
[ "what", "are", "the", "letters", "being", "displayed" ]
1,024
903
"https://c4.staticflickr.com/5/4102/4759963602_47a83be793_o.jpg"
"https://c4.staticflickr.com/5/4102/4759963602_5ea2beab2f_z.jpg"
[ "raf", "raf", "raf", "raf", "raf", "raf", "raf", "raf", "raf", "raf" ]
[ "Vehicle", "Helicopter", "Parachute", "Aircraft" ]
"train"
"0300066daef7157a"
17
"what is the last number of the plane?"
[ "what", "is", "the", "last", "number", "of", "the", "plane" ]
1,024
683
"https://c3.staticflickr.com/6/5102/5650736526_9ec0ea741b_o.jpg"
"https://c2.staticflickr.com/6/5102/5650736526_28c63333c5_z.jpg"
[ "3", "3", "3", "traktor", "3", "3", "3", "3", "3", "3" ]
[ "Vehicle", "Ambulance", "Airplane", "Aircraft" ]
"train"
"0320299003ff65fb"
18
"what letters are visible on the tail of the helicopter?"
[ "what", "letters", "are", "visible", "on", "the", "tail", "of", "the", "helicopter" ]
1,024
584
"https://c4.staticflickr.com/9/8003/7476657322_eb2e6cf61c_o.jpg"
"https://c7.staticflickr.com/9/8003/7476657322_225c533aae_z.jpg"
[ "adx", "danger", "ad", "adx", "adx", "adx", "adx", "adx", "adx", "adx" ]
[ "Person", "Vehicle", "Helicopter", "Footwear", "Airplane", "Aircraft" ]
"train"
"0393c9d77b8215a3"
19
"what is the plane name?"
[ "what", "is", "the", "plane", "name" ]
1,024
532
"https://farm4.staticflickr.com/7317/8720879958_791373f9ce_o.jpg"
"https://c1.staticflickr.com/8/7317/8720879958_85e108e726_z.jpg"
[ "728tfw", "f-16", "wp", "wp", "wp", "wp", "wp", "wp", "wp", "unanswerable" ]
[ "Vehicle", "Boat", "Clock", "Airplane", "Aircraft" ]
"train"
"0393c9d77b8215a3"
20
"what is the plane number?"
[ "what", "is", "the", "plane", "number" ]
1,024
532
"https://farm4.staticflickr.com/7317/8720879958_791373f9ce_o.jpg"
"https://c1.staticflickr.com/8/7317/8720879958_85e108e726_z.jpg"
[ "728", "728tfw", "81", "81728", "728tfw", "unanswerable", "af91728tfw", "728tfw", "8", "wp" ]
[ "Vehicle", "Boat", "Clock", "Airplane", "Aircraft" ]
"train"
"0394933dcd965048"
21
"what does the train say?"
[ "what", "does", "the", "train", "say" ]
1,024
769
"https://c1.staticflickr.com/8/7271/7013089685_34455c39b6_o.jpg"
"https://c8.staticflickr.com/8/7271/7013089685_e28a82e463_z.jpg"
[ "ricklinghausen", "rocklinghausen ", "recklinghausen", "rockinghausen", "rocklinghausen", "recklinghausen ", "rocklinghausen", "rocklinghausen", "recklkinghausen", "recklinghausen " ]
[ "Train", "Boy", "Person", "Woman", "Man", "Vehicle", "Clothing", "Footwear", "Airplane", "Aircraft" ]
"train"
"03e38d16213c4067"
22
"what organization do these men work for?"
[ "what", "organization", "do", "these", "men", "work", "for" ]
1,024
682
"https://farm8.staticflickr.com/3383/3184901622_cf5f09010a_o.jpg"
"https://c3.staticflickr.com/4/3383/3184901622_ff62bb323c_z.jpg"
[ "politi", "politi", "politi", "politi", "politi", "politi", "politi", "politi", "politi", "police" ]
[ "Bus", "Ambulance", "Person", "Land vehicle", "Stretcher", "Vehicle", "Auto part", "Van", "Tire", "Car", "Aircraft" ]
"train"
"03e38d16213c4067"
23
"what two numbers can be seen on the white post on the right?"
[ "what", "two", "numbers", "can", "be", "seen", "on", "the", "white", "post", "on", "the", "right" ]
1,024
682
"https://farm8.staticflickr.com/3383/3184901622_cf5f09010a_o.jpg"
"https://c3.staticflickr.com/4/3383/3184901622_ff62bb323c_z.jpg"
[ "23", "25", "25 50", "unanswerable", "25, 50 ", "25 and 50", "25, 50", "25, 50", "23 50", "25-50" ]
[ "Bus", "Ambulance", "Person", "Land vehicle", "Stretcher", "Vehicle", "Auto part", "Van", "Tire", "Car", "Aircraft" ]
"train"
"0492ee5ac9a69515"
24
"what letters are on the craft?"
[ "what", "letters", "are", "on", "the", "craft" ]
1,024
683
"https://farm6.staticflickr.com/2909/14180467188_8880ca1448_o.jpg"
"https://c6.staticflickr.com/3/2909/14180467188_5d7ab99262_z.jpg"
[ "f-pdhv", "f-pdhv", "f-pdhv", "f-pdhv", "f-pdhv", "fpdhv", "f-pdhv", "f-pdhv", "f-pdhv", "f-pdhv" ]
[ "Person", "Vehicle", "Airplane", "Aircraft" ]
"train"
"0492ee5ac9a69515"
25
"what website is labeled on the plane?"
[ "what", "website", "is", "labeled", "on", "the", "plane" ]
1,024
683
"https://farm6.staticflickr.com/2909/14180467188_8880ca1448_o.jpg"
"https://c6.staticflickr.com/3/2909/14180467188_5d7ab99262_z.jpg"
[ "fpdhv", "www.verheesengineering.com", "unanswerable", "verhesengineering.com", "unanswerable", "verheesenginering.com", "www.verheesengineering.com", "www.verhwwsengineering.com", "2", "www.verheesengineering.com" ]
[ "Person", "Vehicle", "Airplane", "Aircraft" ]
"train"
"04dfad38d434409c"
26
"what happens if you pull?"
[ "what", "happens", "if", "you", "pull" ]
973
1,024
"https://c5.staticflickr.com/4/3315/3411908910_a8ecbfcaeb_o.jpg"
"https://c8.staticflickr.com/4/3315/3411908910_4c2f8e5fbc_z.jpg?zz=1"
[ "stop", "emergency stop", "emergency stop", "stop", "stop", "emergency stop", "emergency stop", "it is an emergency stop", "emergency stop", "unanswerable" ]
[ "Land vehicle", "Vehicle", "Airplane", "Car", "Aircraft" ]
"train"
"04dfad38d434409c"
27
"what does the sticker on the window say?"
[ "what", "does", "the", "sticker", "on", "the", "window", "say" ]
973
1,024
"https://c5.staticflickr.com/4/3315/3411908910_a8ecbfcaeb_o.jpg"
"https://c8.staticflickr.com/4/3315/3411908910_4c2f8e5fbc_z.jpg?zz=1"
[ "aa", "aa", "aa", "aa", "aa", "the sticker says emergency stop pull.", "aa", "emergency stop pull", "aa", "aa" ]
[ "Land vehicle", "Vehicle", "Airplane", "Car", "Aircraft" ]
"train"
"05a4fa87bfdaa243"
28
"what made this airplane?"
[ "what", "made", "this", "airplane" ]
1,024
682
"https://farm4.staticflickr.com/59/200317024_00b0454199_o.jpg"
"https://c1.staticflickr.com/1/59/200317024_00b0454199_z.jpg?zz=1"
[ "biman", "biman", "biman", "biman", "biman", "biman", "biman", "biman", "biman", "biman" ]
[ "Vehicle", "Rocket", "Airplane", "Aircraft" ]
"train"
"05a4fa87bfdaa243"
29
"what kind of plane is this?"
[ "what", "kind", "of", "plane", "is", "this" ]
1,024
682
"https://farm4.staticflickr.com/59/200317024_00b0454199_o.jpg"
"https://c1.staticflickr.com/1/59/200317024_00b0454199_z.jpg?zz=1"
[ "biman", "biman", "bitman", "biman", "biman", "dc 10-30", "biman", "biman", "biman", "biman" ]
[ "Vehicle", "Rocket", "Airplane", "Aircraft" ]
"train"
"05e8721b42640337"
30
"what are the numbers on the plane?"
[ "what", "are", "the", "numbers", "on", "the", "plane" ]
1,024
681
"https://farm8.staticflickr.com/5473/11998619305_9887d267f3_o.jpg"
"https://c6.staticflickr.com/6/5473/11998619305_f909aa2fc2_z.jpg"
[ "n337am", "n337am", "n337am", "n337am", "337", "n337am", "337", "337", "n337am", "337" ]
[ "Vehicle", "Airplane", "Aircraft" ]
"train"
"06628eaed257fa24"
31
"what word does one of the hot balloons feature?"
[ "what", "word", "does", "one", "of", "the", "hot", "balloons", "feature" ]
1,024
683
"https://farm5.staticflickr.com/5546/10262386264_6915b9587b_o.jpg"
"https://c3.staticflickr.com/6/5546/10262386264_f40d3fc168_z.jpg"
[ "when", "when", "yamaha", "when", "when", "when pigs fly", "bobby j's", "when", "when", "when apes fly" ]
[ "Vehicle", "Balloon", "Parachute", "Aircraft" ]
"train"
"073f668cdc671c37"
32
"where is the plane going?"
[ "where", "is", "the", "plane", "going" ]
1,024
683
"https://farm8.staticflickr.com/8292/7496758474_eea4bc6745_o.jpg"
"https://c3.staticflickr.com/9/8292/7496758474_ef1827aaff_z.jpg"
[ "south african", "worth", "unanswerable", "unanswerable", "unanswerable", "unanswerable", "south africa", "south africa", "south africa", "answering does not require reading text in the image" ]
[ "Tree", "Vehicle", "Airplane", "Aircraft" ]
"train"
"073f668cdc671c37"
33
"what type of plane is this?"
[ "what", "type", "of", "plane", "is", "this" ]
1,024
683
"https://farm8.staticflickr.com/8292/7496758474_eea4bc6745_o.jpg"
"https://c3.staticflickr.com/9/8292/7496758474_ef1827aaff_z.jpg"
[ "south african", "hidehi matsui", "south african", "south african", "south african", "south african", "south african ", "south africa", "south african", "707" ]
[ "Tree", "Vehicle", "Airplane", "Aircraft" ]
"train"
"0782a21a2538f868"
34
"what does the white sign say?"
[ "what", "does", "the", "white", "sign", "say" ]
1,024
408
"https://c1.staticflickr.com/3/2645/3891762410_be9ed7d0c3_o.jpg"
"https://c7.staticflickr.com/3/2645/3891762410_3fe4d1e701_z.jpg"
[ "a380", "airbus a380", "unanswerable", "unanswerable", "unanswerable", "airbus a380", "airbus", "airbus", "unanswerable", "airbus" ]
[ "Tree", "Vehicle", "Airplane", "Aircraft" ]
"train"
"0782a21a2538f868"
35
"what model of plane is this?"
[ "what", "model", "of", "plane", "is", "this" ]
1,024
408
"https://c1.staticflickr.com/3/2645/3891762410_be9ed7d0c3_o.jpg"
"https://c7.staticflickr.com/3/2645/3891762410_3fe4d1e701_z.jpg"
[ "airbus", "a380", "airbus a380", "airbus a380", "airbus a380", "macbook air ", "airbus", "airbus a380", "airbus a380", "airbus" ]
[ "Tree", "Vehicle", "Airplane", "Aircraft" ]
"train"
"080858916c6e0c6c"
36
"what number is on the plane?"
[ "what", "number", "is", "on", "the", "plane" ]
1,024
683
"https://c5.staticflickr.com/7/6150/5937848980_881ac05f0c_o.jpg"
"https://c8.staticflickr.com/7/6150/5937848980_4ae6e52643_z.jpg"
[ "41", "41", "41", "41", "41", "41", "41", "41", "41", "41" ]
[ "Vehicle", "Airplane", "Aircraft" ]
"train"
"080858916c6e0c6c"
37
"is there any other text on the plane?"
[ "is", "there", "any", "other", "text", "on", "the", "plane" ]
1,024
683
"https://c5.staticflickr.com/7/6150/5937848980_881ac05f0c_o.jpg"
"https://c8.staticflickr.com/7/6150/5937848980_4ae6e52643_z.jpg"
[ "yes", "yes", "41", "yes, luk", "41", "yes", "yes", "yes", "41", "yes" ]
[ "Vehicle", "Airplane", "Aircraft" ]
"train"
"08d8e9951c1ea69e"
38
"what is the number and letter of the plane?"
[ "what", "is", "the", "number", "and", "letter", "of", "the", "plane" ]
1,024
683
"https://c7.staticflickr.com/6/5238/5878712742_ef6f46119e_o.jpg"
"https://c1.staticflickr.com/6/5238/5878712742_c5ce03dd9d_z.jpg"
[ "n328kf", "n328kf", "n328kf", "n328kf", "n328kf", "n328kf", "n328kf", "n328kf", "n328kf", "n328kff" ]
[ "Table", "Vehicle", "Airplane", "Aircraft" ]
"train"
"08d8e9951c1ea69e"
39
"which company owns that aircraft?"
[ "which", "company", "owns", "that", "aircraft" ]
1,024
683
"https://c7.staticflickr.com/6/5238/5878712742_ef6f46119e_o.jpg"
"https://c1.staticflickr.com/6/5238/5878712742_c5ce03dd9d_z.jpg"
[ "scaled composites", "spaceshipone", "unanswerable", "scaled composites", "spaceshipone", "spaceshipone", "scaled composites ", "scaled", "space ship one", "scaled" ]
[ "Table", "Vehicle", "Airplane", "Aircraft" ]
"train"
"096bce1a91c1624c"
40
"to which organization does this helicopter belong?"
[ "to", "which", "organization", "does", "this", "helicopter", "belong" ]
1,024
683
"https://c5.staticflickr.com/5/4101/4912152432_95b0f3f317_o.jpg"
"https://c6.staticflickr.com/5/4101/4912152432_a134f2da27_z.jpg"
[ "department of public safety", "public safety", "department of public safety ", "public safety", "public safety", "public safety", "department of public safety", "department of public safety", "utah department of public safety", "department of public safety" ]
[ "Vehicle", "Helicopter", "Aircraft" ]
"train"
"09d87dca721d23be"
41
"what kind of airline is this?"
[ "what", "kind", "of", "airline", "is", "this" ]
1,024
683
"https://c7.staticflickr.com/9/8065/8253258734_c09a5fba2f_o.jpg"
"https://c2.staticflickr.com/9/8065/8253258734_f5c7d25b95_z.jpg"
[ "lufthansa ", "lufthansa cargo", "lufthansa cargo", "lufthansa cargo", "luthansa cargo", "lufthansa cargo", "lufthansa", "lufthansa cargo", "lufthansa cargo", "luthsana cargo" ]
[ "Doll", "Vehicle", "Airplane", "Aircraft" ]
"train"
"0b187390a96228ff"
42
"what brand is this phone?"
[ "what", "brand", "is", "this", "phone" ]
1,024
768
"https://c8.staticflickr.com/4/3267/2841854688_5b102a5f47_o.jpg"
"https://c8.staticflickr.com/4/3267/2841854688_174d48f551_z.jpg?zz=1"
[ "fedex", "unanswerable", "fedex", "unanswerable", "unanswerable", "fedex", "unanswerable", "unanswerable", "fedex", "unanswerable" ]
[ "Vehicle", "Airplane", "Aircraft" ]
"train"
"0b187390a96228ff"
43
"is the word express under the word fedex at the front of the plane?"
[ "is", "the", "word", "express", "under", "the", "word", "fedex", "at", "the", "front", "of", "the", "plane" ]
1,024
768
"https://c8.staticflickr.com/4/3267/2841854688_5b102a5f47_o.jpg"
"https://c8.staticflickr.com/4/3267/2841854688_174d48f551_z.jpg?zz=1"
[ "yes", "express", "yes", "yes", "yes", "express", "yes", "yes", "yes", "yes" ]
[ "Vehicle", "Airplane", "Aircraft" ]
"train"
"0b2f523a4e734bec"
44
"what is the plane number?"
[ "what", "is", "the", "plane", "number" ]
1,024
683
"https://c5.staticflickr.com/8/7118/7768063976_fdc66843c6_o.jpg"
"https://c1.staticflickr.com/8/7118/7768063976_ea9351e86f_z.jpg"
[ "ec-634", "ec-634", "ec-634", "ec-634", "ec-354", "ec-634", "ec-634", "ec-634", "the taste of freedom", "634" ]
[ "Vehicle", "Airplane", "Aircraft" ]
"train"
"0b2f523a4e734bec"
45
"what is the name of the plane?"
[ "what", "is", "the", "name", "of", "the", "plane" ]
1,024
683
"https://c5.staticflickr.com/8/7118/7768063976_fdc66843c6_o.jpg"
"https://c1.staticflickr.com/8/7118/7768063976_ea9351e86f_z.jpg"
[ "inta", "ec-634", "inta", "not all those who wander are lost", "inta", "inta", "ec-634", "inta", "inta", "inta ec-634" ]
[ "Vehicle", "Airplane", "Aircraft" ]
"train"
"0e40cb14d4c4bbbc"
46
"what is the id number written on the rear of the plane?"
[ "what", "is", "the", "id", "number", "written", "on", "the", "rear", "of", "the", "plane" ]
1,024
683
"https://c5.staticflickr.com/6/5448/9320917774_4f524cc61e_o.jpg"
"https://c5.staticflickr.com/6/5448/9320917774_952331468c_z.jpg"
[ "d-chio", "d-chi0", "0", "0chio", "d-chio", "d-chio", "d-chio", "d-chio", "d-chio", "d-chio" ]
[ "Tree", "Vehicle", "Clothing", "Footwear", "Airplane", "Aircraft" ]
"train"
"0e45202f3462f604"
47
"what company is this for?"
[ "what", "company", "is", "this", "for" ]
1,024
768
"https://farm5.staticflickr.com/2081/2478405105_c9ea1fa2f9_o.jpg"
"https://c6.staticflickr.com/3/2081/2478405105_37f0295257_z.jpg"
[ "java", "java", "java", "java", "java", "java", "java", "java", "jave", "java" ]
[ "Vehicle", "Human eye", "Person", "Human mouth", "Human ear", "Human hair", "Human head", "Man", "Airplane", "Human face", "Human nose", "Aircraft" ]
"train"
"0e45202f3462f604"
48
"what does the sign want to add to java?"
[ "what", "does", "the", "sign", "want", "to", "add", "to", "java" ]
1,024
768
"https://farm5.staticflickr.com/2081/2478405105_c9ea1fa2f9_o.jpg"
"https://c6.staticflickr.com/3/2081/2478405105_37f0295257_z.jpg"
[ "you", "you", "you", "you", "you", "you", "you", "you", "you", "you" ]
[ "Vehicle", "Human eye", "Person", "Human mouth", "Human ear", "Human hair", "Human head", "Man", "Airplane", "Human face", "Human nose", "Aircraft" ]
"train"
"0fd38fd0f9bf1bee"
49
"the train number is?"
[ "the", "train", "number", "is" ]
1,024
764
"https://c7.staticflickr.com/4/3727/19693966404_b5cacb3d24_o.jpg"
"https://c6.staticflickr.com/4/3727/19693966404_90016a2fce_z.jpg"
[ "1450", "1450", "may 10th 2009", "1450", "1450", "1450", "450", "1450", "1450", "1450" ]
[ "Vehicle", "Land vehicle", "Plant", "Train", "Wheel", "Auto part", "Aircraft" ]
"train"
"0fef083ac0b5dff6"
50
"what letter is written on the side of the red propeller plane?"
[ "what", "letter", "is", "written", "on", "the", "side", "of", "the", "red", "propeller", "plane" ]
1,024
768
"https://farm7.staticflickr.com/3062/2888363565_8f3e5ef325_o.jpg"
"https://c5.staticflickr.com/4/3062/2888363565_1ae375f3ac_z.jpg?zz=1"
[ "a", "unanswerable", "unanswerable", "618", "no text in image", "a", "a", "unanswerable", "unanswerable", "unanswerable" ]
[ "Vehicle", "Airplane", "Aircraft" ]
"train"
"0fef083ac0b5dff6"
51
"what 3 digit number is on the plane in the back?"
[ "what", "3", "digit", "number", "is", "on", "the", "plane", "in", "the", "back" ]
1,024
768
"https://farm7.staticflickr.com/3062/2888363565_8f3e5ef325_o.jpg"
"https://c5.staticflickr.com/4/3062/2888363565_1ae375f3ac_z.jpg?zz=1"
[ "987", "987", "pepsi", "987", "987", "612", "987", "987", "987", "997" ]
[ "Vehicle", "Airplane", "Aircraft" ]
"train"
"10513fda50da5e6d"
52
"what is the id number on the tail of the plane?"
[ "what", "is", "the", "id", "number", "on", "the", "tail", "of", "the", "plane" ]
1,024
768
"https://c6.staticflickr.com/4/3236/3113140447_9a14368487_o.jpg"
"https://c1.staticflickr.com/4/3236/3113140447_7a98021e3c_z.jpg"
[ "513", "513", "513", "513", "xt5v3", "513", "513", "513", "513", "513" ]
[ "Vehicle", "Airplane", "Aircraft" ]
"train"
"1086a31c9367a124"
53
"what is the word on the side of this plane?"
[ "what", "is", "the", "word", "on", "the", "side", "of", "this", "plane" ]
1,024
682
"https://farm5.staticflickr.com/1008/1313880307_2c59c70ffa_o.jpg"
"https://c3.staticflickr.com/2/1008/1313880307_0734ab83fb_z.jpg"
[ "team ameristar.com", "teamaerostar.com", "teamaerostar.com", "teamaerostar.com", "teamaerostar.com", "34", "teamaerostar", "teamaerostar.com", "teamaerostar", "teamaerostar.com" ]
[ "Person", "Land vehicle", "Vehicle", "Airplane", "Car", "Aircraft" ]
"train"
"1086a31c9367a124"
54
"what number is this plane?"
[ "what", "number", "is", "this", "plane" ]
1,024
682
"https://farm5.staticflickr.com/1008/1313880307_2c59c70ffa_o.jpg"
"https://c3.staticflickr.com/2/1008/1313880307_0734ab83fb_z.jpg"
[ "34", "34", "34", "34", "34", "34", "34", "34", "34", "34" ]
[ "Person", "Land vehicle", "Vehicle", "Airplane", "Car", "Aircraft" ]
"train"
"111d7be56517ed46"
55
"what is written on the side of this airplane?"
[ "what", "is", "written", "on", "the", "side", "of", "this", "airplane" ]
1,024
683
"https://farm7.staticflickr.com/7420/9489993636_a69b8334dc_o.jpg"
"https://c4.staticflickr.com/8/7420/9489993636_0d401d5a13_z.jpg"
[ "cs-dkd", "cs-dkd", "cs-dkd", "cs-dkd", "cs-dkd", "c9-dkd", "cs-dkd", "cs-oko", "cs-dkd", "cs-dkd" ]
[ "Vehicle", "Airplane", "Aircraft" ]
"train"
"111d7be56517ed46"
56
"what number is on the plane?"
[ "what", "number", "is", "on", "the", "plane" ]
1,024
683
"https://farm7.staticflickr.com/7420/9489993636_a69b8334dc_o.jpg"
"https://c4.staticflickr.com/8/7420/9489993636_0d401d5a13_z.jpg"
[ "csdkd", "cs-dk0", "8", "cs-0k0", "no number", "cs-dkd", "unanswerable", "cs-dkd", "cs-dkd", "xs-dkd" ]
[ "Vehicle", "Airplane", "Aircraft" ]
"train"
"1139456aa3f70a34"
57
"what number is on the silver plane?"
[ "what", "number", "is", "on", "the", "silver", "plane" ]
1,024
680
"https://farm5.staticflickr.com/5282/5304752422_73f0c89306_o.jpg"
"https://c6.staticflickr.com/6/5282/5304752422_796df24693_z.jpg"
[ "63", "63", "63", "63", "63", "63", "63", "63", "63", "63" ]
[ "Vehicle", "Clothing", "Airplane", "Aircraft" ]
"train"
"1252873867aa8a86"
58
"what identification number belongs to the big bomber plane in the center?"
[ "what", "identification", "number", "belongs", "to", "the", "big", "bomber", "plane", "in", "the", "center" ]
1,024
681
"https://farm7.staticflickr.com/4099/4879614318_cce9817c94_o.jpg"
"https://c7.staticflickr.com/5/4099/4879614318_6c19f98122_z.jpg"
[ "82", "82", "82", "82", "82", "82", "82", "82", "82", "82" ]
[ "Vehicle", "Wheel", "Airplane", "Aircraft" ]
"train"
"1252873867aa8a86"
59
"what letter is in the black circle?"
[ "what", "letter", "is", "in", "the", "black", "circle" ]
1,024
681
"https://farm7.staticflickr.com/4099/4879614318_cce9817c94_o.jpg"
"https://c7.staticflickr.com/5/4099/4879614318_6c19f98122_z.jpg"
[ "r", "r", "r", "r", "r", "r", "r", "r", "r", "r" ]
[ "Vehicle", "Wheel", "Airplane", "Aircraft" ]
"train"
"12e4f6ee148a112e"
60
"what airline is this plane for?"
[ "what", "airline", "is", "this", "plane", "for" ]
1,024
683
"https://c7.staticflickr.com/9/8514/8570025824_708ec5043f_o.jpg"
"https://c6.staticflickr.com/9/8514/8570025824_1108160a25_z.jpg"
[ "onurair", "onurair", "onurair", "onurair", "onurair", "onurair", "nurair", "onurair", "onurair", "onurair" ]
[ "Vehicle", "Airplane", "Aircraft" ]
"train"
"12e4f6ee148a112e"
61
"what is the name of the airline?"
[ "what", "is", "the", "name", "of", "the", "airline" ]
1,024
683
"https://c7.staticflickr.com/9/8514/8570025824_708ec5043f_o.jpg"
"https://c6.staticflickr.com/9/8514/8570025824_1108160a25_z.jpg"
[ "onurair", "onurair", "onurair", "onurair", "onurair", "onurair", "onurair", "onurair", "onurair", "onurair" ]
[ "Vehicle", "Airplane", "Aircraft" ]
"train"
"1334b6b8fcfd8afb"
62
"what number is on the grey and yellow plane?"
[ "what", "number", "is", "on", "the", "grey", "and", "yellow", "plane" ]
1,024
737
"https://c4.staticflickr.com/4/3213/5783769965_095c1521cc_o.jpg"
"https://c1.staticflickr.com/4/3213/5783769965_c47052b1b9_z.jpg"
[ "481", "481", "48i", "481", "481", "481", "481", "481", "481", "481" ]
[ "Person", "Vehicle", "Building", "Airplane", "Aircraft" ]
"train"
"13dec395531b458f"
63
"two letters on the tail?"
[ "two", "letters", "on", "the", "tail" ]
1,024
683
"https://c8.staticflickr.com/4/3423/3254044550_01df334a26_o.jpg"
"https://c1.staticflickr.com/4/3423/3254044550_9937e4cbff_z.jpg"
[ "ah", "aj", "ah", "ah", "ah", "ah", "ah", "ah", "ah", "ah" ]
[ "Vehicle", "Airplane", "Aircraft" ]
"train"
"13dec395531b458f"
64
"what is the number found on the head of the plane?"
[ "what", "is", "the", "number", "found", "on", "the", "head", "of", "the", "plane" ]
1,024
683
"https://c8.staticflickr.com/4/3423/3254044550_01df334a26_o.jpg"
"https://c1.staticflickr.com/4/3423/3254044550_9937e4cbff_z.jpg"
[ "106", "106", "106", "106", "106", "106", "106", "106", "106", "106" ]
[ "Vehicle", "Airplane", "Aircraft" ]
"train"
"142076607dbdfa59"
65
"is that plane part of the star alliance?"
[ "is", "that", "plane", "part", "of", "the", "star", "alliance" ]
1,024
683
"https://c2.staticflickr.com/6/5618/20923572869_97e8af2aff_o.jpg"
"https://c5.staticflickr.com/6/5618/20923572869_957a38f2d4_z.jpg"
[ "yes", "yes", "star alliance", "yes", "yes", "yes", "sunkist", "yes", "yes", "stop" ]
[ "Building", "Vehicle", "Airplane", "Aircraft" ]
"train"
"14fa38ae8fea318d"
66
"who owns the blimp?"
[ "who", "owns", "the", "blimp" ]
1,024
681
"https://c6.staticflickr.com/8/7298/13898426779_e9bda6bb6f_o.jpg"
"https://c2.staticflickr.com/8/7298/13898426779_dcce47bbd3_z.jpg"
[ "u.s. navy", "u.s. navy", "u.s. navy", "us navy", "u.s. navy", "u.s. navy", "u.s. navy", "us navy", "us navy", "us navy" ]
[ "Vehicle", "Aircraft" ]
"train"
"1582c8538686d50d"
67
"what number is on the plane?"
[ "what", "number", "is", "on", "the", "plane" ]
1,024
683
"https://c6.staticflickr.com/8/7270/7733381602_7db0f9bc57_o.jpg"
"https://c5.staticflickr.com/8/7270/7733381602_60faa5782c_z.jpg"
[ "155", "155", "155", "155", "155", "55", "155", "155", "155", "155" ]
[ "Person", "Vehicle", "Clothing", "Footwear", "Airplane", "Aircraft" ]
"train"
"15a2517d9e904afa"
68
"what airline does this plane belong to?"
[ "what", "airline", "does", "this", "plane", "belong", "to" ]
1,024
683
"https://c3.staticflickr.com/4/3904/14622637909_597d459db0_o.jpg"
"https://c5.staticflickr.com/4/3904/14622637909_f25e608e95_z.jpg"
[ "iberia", "iberia", "iberia", "iberia", "iberia", "iberia", "iberia", "iberia", "iberia ", "iberia" ]
[ "Vehicle", "Airplane", "Aircraft" ]
"train"
"15a48b2dd565998a"
69
"what is this?"
[ "what", "is", "this" ]
1,024
683
"https://c3.staticflickr.com/5/4115/4825846889_fda143463b_o.jpg"
"https://c2.staticflickr.com/5/4115/4825846889_35ab6efbe0_z.jpg"
[ "an a380 airbus", "airbus", "answering does not require reading text in the image", "airbus", "airbus a380", "airbus", "airplane", "airplane", "answering does not require reading text in the image", "answering does not require reading text in the image" ]
[ "Vehicle", "Airplane", "Aircraft" ]
"train"
"15a48b2dd565998a"
70
"what is the identification number?"
[ "what", "is", "the", "identification", "number" ]
1,024
683
"https://c3.staticflickr.com/5/4115/4825846889_fda143463b_o.jpg"
"https://c2.staticflickr.com/5/4115/4825846889_35ab6efbe0_z.jpg"
[ "a380", "a380", "a380", "a380", "a380", "a380", "a380", "a380", "a380", "a380" ]
[ "Vehicle", "Airplane", "Aircraft" ]
"train"
"15da5c794efe07b6"
71
"what is this planes website?"
[ "what", "is", "this", "planes", "website" ]
1,024
582
"https://farm7.staticflickr.com/5594/14944706458_b4993e9894_o.jpg"
"https://c8.staticflickr.com/6/5594/14944706458_99886c5f94_z.jpg"
[ "monarch.co.uk", "monarch.co.uk", "monarch.co.uk", "monarch.co.uk", "monarch.co.uk", "monarch.co.uk", "monarch.co.uk", "monarch.co.uk", "monach.co.uk", "monarch.co.uk" ]
[ "Vehicle", "Airplane", "Aircraft" ]
"train"
"15da5c794efe07b6"
72
"what letter is the symbol on the tale of the plane?"
[ "what", "letter", "is", "the", "symbol", "on", "the", "tale", "of", "the", "plane" ]
1,024
582
"https://farm7.staticflickr.com/5594/14944706458_b4993e9894_o.jpg"
"https://c8.staticflickr.com/6/5594/14944706458_99886c5f94_z.jpg"
[ "m", "m", "m", "m", "m", "m", "m", "m", "m", "m" ]
[ "Vehicle", "Airplane", "Aircraft" ]
"train"
"163c6f54edee23ae"
73
"what does it say on the jet?"
[ "what", "does", "it", "say", "on", "the", "jet" ]
1,024
680
"https://c7.staticflickr.com/8/7393/8727264764_9888660299_o.jpg"
"https://c3.staticflickr.com/8/7393/8727264764_04b6359b61_z.jpg"
[ "u.s. airforce", "us air force", "u.s. air force", "us air force", "u.s. air force", "us air force", "u.s. air force", "us air force", "us air force", "us air force" ]
[ "Vehicle", "Airplane", "Aircraft" ]
"train"
"163c6f54edee23ae"
74
"what branch of the military is the plane from?"
[ "what", "branch", "of", "the", "military", "is", "the", "plane", "from" ]
1,024
680
"https://c7.staticflickr.com/8/7393/8727264764_9888660299_o.jpg"
"https://c3.staticflickr.com/8/7393/8727264764_04b6359b61_z.jpg"
[ "us air force", "u.s. air force", "air force", "air force", "air force", "us air force ", "u.s. air force", "us air force", "air force", "u.s. airforce" ]
[ "Vehicle", "Airplane", "Aircraft" ]
"train"
"1665ff941d73a03f"
75
"what is the plane number?"
[ "what", "is", "the", "plane", "number" ]
1,024
768
"https://c8.staticflickr.com/1/60/156227243_a0b122c650_o.jpg"
"https://c4.staticflickr.com/1/60/156227243_a0b122c650_z.jpg?zz=1"
[ "105", "105", "105", "105", "105", "105", "105", "105", "105", "105" ]
[ "Vehicle", "Airplane", "Aircraft" ]
"train"
"1665ff941d73a03f"
76
"what branch of the military is on the plane?"
[ "what", "branch", "of", "the", "military", "is", "on", "the", "plane" ]
1,024
768
"https://c8.staticflickr.com/1/60/156227243_a0b122c650_o.jpg"
"https://c4.staticflickr.com/1/60/156227243_a0b122c650_z.jpg?zz=1"
[ "navy", "navy", "navy", "105", "navy", "navy", "navy", "eugenie & napoleon iii", "navy", "navy" ]
[ "Vehicle", "Airplane", "Aircraft" ]
"train"
"171d86fc6d86d5f0"
77
"what is the tail number of the jet?"
[ "what", "is", "the", "tail", "number", "of", "the", "jet" ]
1,024
627
"https://farm1.staticflickr.com/3901/18259467674_a4149a43ae_o.jpg"
"https://c5.staticflickr.com/4/3901/18259467674_fb79d7480d_z.jpg"
[ "13-143", "13-143", "13-143", "13-143", "13-143", "13-143", "13-143", "13 143", "13-143", "13-143" ]
[ "Tree", "Vehicle", "Airplane", "Aircraft" ]
"train"
"188d67c561540d6b"
78
"what is the name of the airline?"
[ "what", "is", "the", "name", "of", "the", "airline" ]
1,024
683
"https://c4.staticflickr.com/3/2910/14622747350_f8c31fbba0_o.jpg"
"https://c2.staticflickr.com/3/2910/14622747350_fe432e6388_z.jpg"
[ "air canada", "air canada", "air canada", "air canada", "air canada", "air canada", "air canada", "air canada", "air canada", "air canada" ]
[ "Tree", "Vehicle", "Airplane", "Aircraft" ]
"train"
"1a192511800af150"
79
"what number is on the back of this plane?"
[ "what", "number", "is", "on", "the", "back", "of", "this", "plane" ]
1,024
393
"https://farm1.staticflickr.com/7576/15026526724_3bca926dca_o.jpg"
"https://c7.staticflickr.com/8/7576/15026526724_ef993b26c0_z.jpg"
[ "3", "46100", "3", "3", "3", "3", "3", "3", "46100", "46100" ]
[ "Vehicle", "Airplane", "Aircraft" ]
"train"
"1a2e9a1c8d9432b6"
80
"what is the name on the plane?"
[ "what", "is", "the", "name", "on", "the", "plane" ]
1,024
529
"https://farm7.staticflickr.com/11/16042048_4b71d57a58_o.jpg"
"https://farm7.staticflickr.com/11/16042048_4b71d57a58_o.jpg"
[ "iberia ", "iberia", "iberia", "iberia", "iberia", "iberia", "iberia", "iberia", "iberia", "iberia" ]
[ "Land vehicle", "Vehicle", "Airplane", "Aircraft" ]
"train"
"1a2e9a1c8d9432b6"
81
"what is the model number of the plane printed on the rear?"
[ "what", "is", "the", "model", "number", "of", "the", "plane", "printed", "on", "the", "rear" ]
1,024
529
"https://farm7.staticflickr.com/11/16042048_4b71d57a58_o.jpg"
"https://farm7.staticflickr.com/11/16042048_4b71d57a58_o.jpg"
[ "sign of his misery", "ec-hul", "ec-hul", "ec hul", "ec-hul", "echul", "ec-hul", "hul", "hul", "ec-hul" ]
[ "Land vehicle", "Vehicle", "Airplane", "Aircraft" ]
"train"
"1b1c4a2fa6175cf0"
82
"what brand is the white balloon?"
[ "what", "brand", "is", "the", "white", "balloon" ]
1,024
683
"https://farm1.staticflickr.com/3783/10262236824_d713bbede4_o.jpg"
"https://c3.staticflickr.com/4/3783/10262236824_02a1022976_z.jpg"
[ "la ,esa", "lamesra", "lamesa", "lamesa", "lamesa", "la mesa", "lamesarv", "lamesa", "lamesa", "lamesah" ]
[ "Toy", "Balloon", "Vehicle", "Aircraft" ]
"train"
"1c28f57e08fc7858"
83
"does this plane state that it's easy?"
[ "does", "this", "plane", "state", "that", "it", "'", "s", "easy" ]
1,024
620
"https://c1.staticflickr.com/3/2828/9106837957_9136bd1b52_o.jpg"
"https://c7.staticflickr.com/3/2828/9106837957_41a172a688_z.jpg"
[ "no", "no", "no", "no", "no", "no", "yes", "no, ez", "unanswerable", "unanswerable" ]
[ "Vehicle", "Airplane", "Aircraft" ]
"train"
"1c28f57e08fc7858"
84
"what is the id number on the bottom of the wing?"
[ "what", "is", "the", "id", "number", "on", "the", "bottom", "of", "the", "wing" ]
1,024
620
"https://c1.staticflickr.com/3/2828/9106837957_9136bd1b52_o.jpg"
"https://c7.staticflickr.com/3/2828/9106837957_41a172a688_z.jpg"
[ "n5588n", "n5588n", "n5588n", "n5588n", "n5588n", "n5588n", "n5588n", "465688m", "n5588n", "n5588n" ]
[ "Vehicle", "Airplane", "Aircraft" ]
"train"
"1d1b9a571d21441e"
85
"what is the helicopter's number?"
[ "what", "is", "the", "helicopter", "'", "s", "number" ]
1,024
646
"https://c7.staticflickr.com/3/2105/5816184329_31a25758a6_o.jpg"
"https://c7.staticflickr.com/3/2105/5816184329_261a5522e4_z.jpg"
[ "1", "1", "1", "pp jdr", "1", "pp-jdr", "pp jdr", "22", "1", "2007" ]
[ "Vehicle", "Helicopter", "Aircraft" ]
"train"
"1d1b9a571d21441e"
86
"which website is listed on the picture?"
[ "which", "website", "is", "listed", "on", "the", "picture" ]
1,024
646
"https://c7.staticflickr.com/3/2105/5816184329_31a25758a6_o.jpg"
"https://c7.staticflickr.com/3/2105/5816184329_261a5522e4_z.jpg"
[ "flickr", "flickr.com/photos/degu_andre", "flickr.com", "flickr.com/photos/degu_andre", "flickr.com/photos/degu_andre", "flickr.com", "flickr.com/photos/degu_andre", "flickr.com", "flikr.com/photos/degu_andre", "flickr.com" ]
[ "Vehicle", "Helicopter", "Aircraft" ]
"train"
"1e4fcfbd0bb6e1e1"
87
"what is the slogan on the side say that life is for?"
[ "what", "is", "the", "slogan", "on", "the", "side", "say", "that", "life", "is", "for" ]
1,024
645
"https://farm2.staticflickr.com/3076/3118659187_482c4be28c_o.jpg"
"https://c6.staticflickr.com/4/3076/3118659187_e21f204141_z.jpg"
[ "sharing ", "life is for sharing", "sharing", "life is for sharing", "lite is for sharing", "sharing", "life is for sharing", "sharing", "airplane", "audite" ]
[ "Vehicle", "Airplane", "Aircraft" ]
"train"
"1e4fcfbd0bb6e1e1"
88
"what brand is being advertised on the plane?"
[ "what", "brand", "is", "being", "advertised", "on", "the", "plane" ]
1,024
645
"https://farm2.staticflickr.com/3076/3118659187_482c4be28c_o.jpg"
"https://c6.staticflickr.com/4/3076/3118659187_e21f204141_z.jpg"
[ "t mobile", "t mobile", "t mobile", "t-mobile", "t mobile", "t-mobile", "t-mobile", "t-mobile", "t", "t-mobile" ]
[ "Vehicle", "Airplane", "Aircraft" ]
"train"
"20db2e4f0602e5aa"
89
"what is the number on the tail wing?"
[ "what", "is", "the", "number", "on", "the", "tail", "wing" ]
1,024
680
"https://c3.staticflickr.com/8/7618/17149400576_9626a83e86_o.jpg"
"https://c8.staticflickr.com/8/7618/17149400576_c5675a3e5f_z.jpg"
[ "031", "031", "031", "031", "031", "031", "031", "031", "031", "031" ]
[ "Vehicle", "Airplane", "Aircraft" ]
"train"
"210c80055b9f8e1a"
90
"what number is on the nose of this aircraft?"
[ "what", "number", "is", "on", "the", "nose", "of", "this", "aircraft" ]
1,024
768
"https://c3.staticflickr.com/3/2322/3543331109_191bfc86c8_o.jpg"
"https://c8.staticflickr.com/3/2322/3543331109_e8c689de46_z.jpg?zz=1"
[ "202", "202", "202", "202", "202", "202", "202", "202", "202", "202" ]
[ "Vehicle", "Clothing", "Airplane", "Aircraft" ]
"train"
"226d623d0c70664f"
91
"what is the identification number on the balloon?"
[ "what", "is", "the", "identification", "number", "on", "the", "balloon" ]
680
1,024
"https://c8.staticflickr.com/1/387/19416117025_35e54452b3_o.jpg"
"https://c1.staticflickr.com/1/387/19416117025_24e91b578f_z.jpg"
[ "00-b2w", "00-bzw", "00-bzw", "00-bzw", "00-bzw", "00-9zm", "00-bzw", "00-bzw", "00-bzw", "00-bzw" ]
[ "Toy", "Balloon", "Vehicle", "Clothing", "Aircraft" ]
"train"
"226d623d0c70664f"
92
"what is the last letter on the balloon?"
[ "what", "is", "the", "last", "letter", "on", "the", "balloon" ]
680
1,024
"https://c8.staticflickr.com/1/387/19416117025_35e54452b3_o.jpg"
"https://c1.staticflickr.com/1/387/19416117025_24e91b578f_z.jpg"
[ "w", "e", "e", "e", "e", "n", "e", "a", "e", "w" ]
[ "Toy", "Balloon", "Vehicle", "Clothing", "Aircraft" ]
"train"
"229c0c1a9abfcd9e"
93
"what is the airline this plane belongs to?"
[ "what", "is", "the", "airline", "this", "plane", "belongs", "to" ]
1,024
683
"https://c7.staticflickr.com/9/8088/8536501186_d1841cefc3_o.jpg"
"https://c4.staticflickr.com/9/8088/8536501186_f322c7e575_z.jpg"
[ "eva air", "eva air", "eva air", "eva air", "eva air", "eva air", "eva air", "eva air", "eva air", "eva air" ]
[ "Vehicle", "Airplane", "Aircraft" ]
"train"
"229c0c1a9abfcd9e"
94
"what is the 6 digit numbers on the side of the plane?"
[ "what", "is", "the", "6", "digit", "numbers", "on", "the", "side", "of", "the", "plane" ]
1,024
683
"https://c7.staticflickr.com/9/8088/8536501186_d1841cefc3_o.jpg"
"https://c4.staticflickr.com/9/8088/8536501186_f322c7e575_z.jpg"
[ "777-300", "777-300", "777300", "777-900", "777-300", "777300", "777 300", "777-300", "777-300", "777-300" ]
[ "Vehicle", "Airplane", "Aircraft" ]
"train"
"233a9ff6b3175939"
95
"what year was this photograph taken?"
[ "what", "year", "was", "this", "photograph", "taken" ]
1,024
790
"https://c2.staticflickr.com/1/193/493783526_571bf5d618_o.jpg"
"https://c4.staticflickr.com/1/193/493783526_c693ecc6a9_z.jpg?zz=1"
[ "1949", "1949", "1949", "1949", "1949", "1949", "1949", "1949", "1949", "1949" ]
[ "Tree", "Vehicle", "Airplane", "Aircraft" ]
"train"
"233a9ff6b3175939"
96
"what is the model of this aircraft?"
[ "what", "is", "the", "model", "of", "this", "aircraft" ]
1,024
790
"https://c2.staticflickr.com/1/193/493783526_571bf5d618_o.jpg"
"https://c4.staticflickr.com/1/193/493783526_c693ecc6a9_z.jpg?zz=1"
[ "17 gc mcrexf ", "452967", "c-87", "b52", "c-87", "17 gc mcrexf 19jan49 c-87", "452907", "452907", "unanswerable", "452967" ]
[ "Tree", "Vehicle", "Airplane", "Aircraft" ]
"train"
"24456069fd6847a8"
97
"what is the model airplane model number?"
[ "what", "is", "the", "model", "airplane", "model", "number" ]
768
768
"https://c1.staticflickr.com/4/3414/5707967525_a9a8db5e1b_o.jpg"
"https://c2.staticflickr.com/4/3414/5707967525_62463523ab_z.jpg"
[ "v268", "107", "s107", "s107 or s1076", "s107", "v268", "s107", "s107", "s107", "s107" ]
[ "Person", "Musical instrument", "Musical keyboard", "Vehicle", "Helicopter", "Clothing", "Jeans", "Aircraft" ]
"train"
"2489dc9e42de1b36"
98
"what number is in red on the plane?"
[ "what", "number", "is", "in", "red", "on", "the", "plane" ]
1,024
683
"https://farm7.staticflickr.com/4019/4654240354_19b1ec2f0b_o.jpg"
"https://c3.staticflickr.com/5/4019/4654240354_57c0a3a6c7_z.jpg"
[ "4858", "1", "4858", "48592", "4858", "4858", "unanswerable", "48592", "not a question", "4859" ]
[ "Ski", "Vehicle", "Airplane", "Aircraft" ]
"train"
"24ad0e52d2f80d92"
99
"what's the number written on the plane?"
[ "what", "'", "s", "the", "number", "written", "on", "the", "plane" ]
1,024
768
"https://farm3.staticflickr.com/16/22937592_00d9b5a8b6_o.jpg"
"https://c2.staticflickr.com/1/16/22937592_00d9b5a8b6_z.jpg?zz=1"
[ "63", "63", "63", "63", "63", "63", "cool stuff", "63", "63", "63" ]
[ "Vehicle", "Airplane", "Aircraft" ]
"train"
End of preview (truncated to 100 rows)

Dataset Card for TextVQA

Dataset Summary

TextVQA requires models to read and reason about text in images to answer questions about them. Specifically, models need to incorporate a new modality of text present in the images and reason over it to answer TextVQA questions. TextVQA dataset contains 45,336 questions over 28,408 images from the OpenImages dataset. The dataset uses VQA accuracy metric for evaluation.

Supported Tasks and Leaderboards

  • visual-question-answering: The dataset can be used for Visual Question Answering tasks where given an image, you have to answer a question based on the image. For the TextVQA dataset specifically, the questions require reading and reasoning about the scene text in the given image.

Languages

The questions in the dataset are in English.

Dataset Structure

Data Instances

A typical sample mainly contains the question in question field, an image object in image field, OpenImage image id in image_id and lot of other useful metadata. 10 answers per questions are contained in the answers attribute. For test set, 10 empty strings are contained in the answers field as the answers are not available for it.

An example look like below:

  {'question': 'who is this copyrighted by?',
   'image_id': '00685bc495504d61',
   'image': <PIL.JpegImagePlugin.JpegImageFile image mode=RGB size=384x512 at 0x276021C5EB8>,
   'image_classes': ['Vehicle', 'Tower', 'Airplane', 'Aircraft'],
   'flickr_original_url': 'https://farm2.staticflickr.com/5067/5620759429_4ea686e643_o.jpg',
   'flickr_300k_url': 'https://c5.staticflickr.com/6/5067/5620759429_f43a649fb5_z.jpg',
   'image_width': 786,
   'image_height': 1024,
   'answers': ['simon clancy',
    'simon ciancy',
    'simon clancy',
    'simon clancy',
    'the brand is bayard',
    'simon clancy',
    'simon clancy',
    'simon clancy',
    'simon clancy',
    'simon clancy'],
   'question_tokens': ['who', 'is', 'this', 'copyrighted', 'by'],
   'question_id': 3,
   'set_name': 'train'
  },

Data Fields

  • question: string, the question that is being asked about the image
  • image_id: string, id of the image which is same as the OpenImages id
  • image: A PIL.Image.Image object containing the image about which the question is being asked. Note that when accessing the image column: dataset[0]["image"] the image file is automatically decoded. Decoding of a large number of image files might take a significant amount of time. Thus it is important to first query the sample index before the "image" column, i.e. dataset[0]["image"] should always be preferred over dataset["image"][0].
  • image_classes: List[str], The OpenImages classes to which the image belongs to.
  • flickr_original_url: string, URL to original image on Flickr
  • flickr_300k_url: string, URL to resized and low-resolution image on Flickr.
  • image_width: int, Width of the original image.
  • image_height: int, Height of the original image.
  • question_tokens: List[str], A pre-tokenized list of question.
  • answers: List[str], List of 10 human-annotated answers for the question. These 10 answers are collected from 10 different users. The list will contain empty strings for test set for which we don't have the answers.
  • question_id: int, Unique id of the question.
  • set_name: string, the set to which this question belongs.

Data Splits

There are three splits. train, validation and test. The train and validation sets share images with OpenImages train set and have their answers available. For test set answers, we return a list of ten empty strings. To get inference results and numbers on test set, you need to go to the EvalAI leaderboard and upload your predictions there. Please see instructions at https://textvqa.org/challenge/.

Dataset Creation

Curation Rationale

From the paper:

Studies have shown that a dominant class of questions asked by visually impaired users on images of their surroundings involves reading text in the image. But today’s VQA models can not read! Our paper takes a first step towards addressing this problem. First, we introduce a new “TextVQA” dataset to facilitate progress on this important problem. Existing datasets either have a small proportion of questions about text (e.g., the VQA dataset) or are too small (e.g., the VizWiz dataset). TextVQA contains 45,336 questions on 28,408 images that require reasoning about text to answer.

Source Data

Initial Data Collection and Normalization

The initial images were sourced from OpenImages v4 dataset. These were first filtered based on automatic heuristics using an OCR system where we only took images which had at least some text detected in them. See annotation process section to understand the next stages.

Who are the source language producers?

English Crowdsource Annotators

Annotations

Annotation process

After the automatic process of filter the images that contain text, the images were manually verified using human annotators making sure that they had text. In next stage, the annotators were asked to write questions involving scene text for the image. For some images, in this stage, two questions were collected whenever possible. Finally, in the last stage, ten different human annotators answer the questions asked in last stage.

Who are the annotators?

Annotators are from one of the major data collection platforms such as AMT. Exact details are not mentioned in the paper.

Personal and Sensitive Information

The dataset does have similar PII issues as OpenImages and can at some times contain human faces, license plates, and documents. Using provided image_classes data field is one option to try to filter out some of this information.

Considerations for Using the Data

Social Impact of Dataset

The paper helped realize the importance of scene text recognition and reasoning in general purpose machine learning applications and has led to many follow-up works including TextCaps and TextOCR. Similar datasets were introduced over the time which specifically focus on sight-disabled users such as VizWiz or focusing specifically on the same problem as TextVQA like STVQA, DocVQA and OCRVQA. Currently, most methods train on combined dataset from TextVQA and STVQA to achieve state-of-the-art performance on both datasets.

Discussion of Biases

Question-only bias where a model is able to answer the question without even looking at the image is discussed in the paper which was a major issue with original VQA dataset. The outlier bias in answers is prevented by collecting 10 different answers which are also taken in consideration by the evaluation metric.

Other Known Limitations

  • The dataset is english only but does involve images with non-English latin characters so can involve some multi-lingual understanding.
  • The performance on the dataset is also dependent on the quality of OCR used as the OCR errors can directly lead to wrong answers.
  • The metric used for calculating accuracy is same as VQA accuracy. This involves one-to-one matching with the given answers and thus doesn't allow analyzing one-off errors through OCR.

Additional Information

Dataset Curators

  • Amanpreet Singh
  • Vivek Natarjan
  • Meet Shah
  • Yu Jiang
  • Xinlei Chen
  • Dhruv Batra
  • Devi Parikh
  • Marcus Rohrbach

Licensing Information

CC by 4.0

Citation Information

@inproceedings{singh2019towards,
    title={Towards VQA Models That Can Read},
    author={Singh, Amanpreet and Natarjan, Vivek and Shah, Meet and Jiang, Yu and Chen, Xinlei and Batra, Dhruv and Parikh, Devi and Rohrbach, Marcus},
    booktitle={Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition},
    pages={8317-8326},
    year={2019}
}

Contributions

Thanks to @apsdehal for adding this dataset.

Downloads last month
655
Edit dataset card
Evaluate models HF Leaderboard

Data Sourcing report

powered
by Spawning.ai

No elements in this dataset have been identified as either opted-out, or opted-in, by their creator.

Models trained or fine-tuned on textvqa

Space using textvqa 1