Datasets:
task
string
| input
string
| output
string
| options
sequence
| pageTitle
string
| outputColName
string
| url
string
| wdcFile
string
|
---|---|---|---|---|---|---|---|
"b54d194c_t__PDPK1_013__ENST00000441549___Flags" | "[Name] PDPK1-001 [Transcript ID] ENST00000342085 [bp] 7241 [Protein] 556aa [Translation ID] ENSP00000344220 [Biotype] Genes and/or transcript that contains an open reading frame (ORF).Protein coding [CCDS] CCDS10472 [UniProt] O15530 [RefSeq] NM_002613 NP_002604 [Flags] " | "Transcript Support Level 1, when transcripts are supported by at least one non-suspect mRNA.The Transcript Support Level (TSL) is a method to highlight the well-supported and poorly-supported transcript models for users. The method relies on the primary data that can support full-length transcript structure: mRNA and EST alignments supplied by UCSC and Ensembl.TSL:1The GENCODE set is the gene set for human and mouse. GENCODE Basic is a subset of representative transcripts (splice variants).GENCODE basicPRINCIPAL1 - APPRIS candidate principal isoform.APPRIS is a system to annotate alternatively spliced transcripts based on a range of computational methods.APPRIS P1" | [] | "Ensembl genome browser 81: Homo sapiens - ID History - Transcript: PDPK1-013 (ENST00000441549)" | "Flags" | "http://www.ensembl.org/Homo_sapiens/Transcript/Idhistory/Protein?db=core;g=ENSG00000140992;r=16:2587965-2653189;t=ENST00000441549" | "36/1438042988308.23_20150728002308-00100-ip-10-236-191-2_430498039_0.json" |
"b54d194c_t__PDPK1_013__ENST00000441549___Flags" | "[Name] PDPK1-002 [Transcript ID] ENST00000268673 [bp] 4728 [Protein] 429aa [Translation ID] ENSP00000268673 [Biotype] Genes and/or transcript that contains an open reading frame (ORF).Protein coding [CCDS] CCDS10473 [UniProt] O15530 [RefSeq] NM_031268 NP_112558 [Flags] " | "Transcript Support Level 1, when transcripts are supported by at least one non-suspect mRNA.The Transcript Support Level (TSL) is a method to highlight the well-supported and poorly-supported transcript models for users. The method relies on the primary data that can support full-length transcript structure: mRNA and EST alignments supplied by UCSC and Ensembl.TSL:1The GENCODE set is the gene set for human and mouse. GENCODE Basic is a subset of representative transcripts (splice variants).GENCODE basic" | [] | "Ensembl genome browser 81: Homo sapiens - ID History - Transcript: PDPK1-013 (ENST00000441549)" | "Flags" | "http://www.ensembl.org/Homo_sapiens/Transcript/Idhistory/Protein?db=core;g=ENSG00000140992;r=16:2587965-2653189;t=ENST00000441549" | "36/1438042988308.23_20150728002308-00100-ip-10-236-191-2_430498039_0.json" |
"b54d194c_t__PDPK1_013__ENST00000441549___Flags" | "[Name] PDPK1-013 [Transcript ID] ENST00000441549 [bp] 1729 [Protein] 454aa [Translation ID] ENSP00000395357 [Biotype] Genes and/or transcript that contains an open reading frame (ORF).Protein coding [CCDS] CCDS58411 [UniProt] O15530 [RefSeq] NM_001261816 NP_001248745 [Flags] " | "Transcript Support Level 1, when transcripts are supported by at least one non-suspect mRNA.The Transcript Support Level (TSL) is a method to highlight the well-supported and poorly-supported transcript models for users. The method relies on the primary data that can support full-length transcript structure: mRNA and EST alignments supplied by UCSC and Ensembl.TSL:1The GENCODE set is the gene set for human and mouse. GENCODE Basic is a subset of representative transcripts (splice variants).GENCODE basic" | [] | "Ensembl genome browser 81: Homo sapiens - ID History - Transcript: PDPK1-013 (ENST00000441549)" | "Flags" | "http://www.ensembl.org/Homo_sapiens/Transcript/Idhistory/Protein?db=core;g=ENSG00000140992;r=16:2587965-2653189;t=ENST00000441549" | "36/1438042988308.23_20150728002308-00100-ip-10-236-191-2_430498039_0.json" |
"b54d194c_t__PDPK1_013__ENST00000441549___Flags" | "[Name] PDPK1-003 [Transcript ID] ENST00000389224 [bp] 2416 [Protein] 529aa [Translation ID] ENSP00000373876 [Biotype] Genes and/or transcript that contains an open reading frame (ORF).Protein coding [CCDS] - [UniProt] E9PER6 [RefSeq] - [Flags] " | "Transcript Support Level 2, when transcripts are supported by multiple ESTs or by an mRNA flagged as suspect.The Transcript Support Level (TSL) is a method to highlight the well-supported and poorly-supported transcript models for users. The method relies on the primary data that can support full-length transcript structure: mRNA and EST alignments supplied by UCSC and Ensembl.TSL:2The GENCODE set is the gene set for human and mouse. GENCODE Basic is a subset of representative transcripts (splice variants).GENCODE basic" | [] | "Ensembl genome browser 81: Homo sapiens - ID History - Transcript: PDPK1-013 (ENST00000441549)" | "Flags" | "http://www.ensembl.org/Homo_sapiens/Transcript/Idhistory/Protein?db=core;g=ENSG00000140992;r=16:2587965-2653189;t=ENST00000441549" | "36/1438042988308.23_20150728002308-00100-ip-10-236-191-2_430498039_0.json" |
"b54d194c_t__PDPK1_013__ENST00000441549___Flags" | "[Name] PDPK1-008 [Transcript ID] ENST00000461815 [bp] 611 [Protein] 149aa [Translation ID] ENSP00000455551 [Biotype] Genes and/or transcript that contains an open reading frame (ORF).Protein coding [CCDS] - [UniProt] H3BQ10 [RefSeq] - [Flags] " | "3' truncation in transcript evidence prevents annotation of the end of the CDS.CDS 3' incompleteTranscript Support Level 3, when transcripts are supported by a single EST only.The Transcript Support Level (TSL) is a method to highlight the well-supported and poorly-supported transcript models for users. The method relies on the primary data that can support full-length transcript structure: mRNA and EST alignments supplied by UCSC and Ensembl.TSL:3" | [] | "Ensembl genome browser 81: Homo sapiens - ID History - Transcript: PDPK1-013 (ENST00000441549)" | "Flags" | "http://www.ensembl.org/Homo_sapiens/Transcript/Idhistory/Protein?db=core;g=ENSG00000140992;r=16:2587965-2653189;t=ENST00000441549" | "36/1438042988308.23_20150728002308-00100-ip-10-236-191-2_430498039_0.json" |
"b54d194c_t__PDPK1_013__ENST00000441549___Flags" | "[Name] PDPK1-015 [Transcript ID] ENST00000566659 [bp] 451 [Protein] 88aa [Translation ID] ENSP00000455492 [Biotype] Genes and/or transcript that contains an open reading frame (ORF).Protein coding [CCDS] - [UniProt] H3BPW1 [RefSeq] - [Flags] " | "3' truncation in transcript evidence prevents annotation of the end of the CDS.CDS 3' incompleteTranscript Support Level 3, when transcripts are supported by a single EST only.The Transcript Support Level (TSL) is a method to highlight the well-supported and poorly-supported transcript models for users. The method relies on the primary data that can support full-length transcript structure: mRNA and EST alignments supplied by UCSC and Ensembl.TSL:3" | [] | "Ensembl genome browser 81: Homo sapiens - ID History - Transcript: PDPK1-013 (ENST00000441549)" | "Flags" | "http://www.ensembl.org/Homo_sapiens/Transcript/Idhistory/Protein?db=core;g=ENSG00000140992;r=16:2587965-2653189;t=ENST00000441549" | "36/1438042988308.23_20150728002308-00100-ip-10-236-191-2_430498039_0.json" |
"b54d194c_t__PDPK1_013__ENST00000441549___Flags" | "[Name] PDPK1-006 [Transcript ID] ENST00000474706 [bp] 699 [Protein] 35aa [Translation ID] ENSP00000455025 [Biotype] Transcript is thought to undergo nonsense mediated decay, a process which detects nonsense mutations and prevents the expression of truncated or erroneous proteins. Nonsense mediated decay [CCDS] - [UniProt] H3BNV5 [RefSeq] - [Flags] " | "Transcript Support Level 5, for transcripts that are not supported at all by either an mRNA or an EST.The Transcript Support Level (TSL) is a method to highlight the well-supported and poorly-supported transcript models for users. The method relies on the primary data that can support full-length transcript structure: mRNA and EST alignments supplied by UCSC and Ensembl.TSL:5" | [] | "Ensembl genome browser 81: Homo sapiens - ID History - Transcript: PDPK1-013 (ENST00000441549)" | "Flags" | "http://www.ensembl.org/Homo_sapiens/Transcript/Idhistory/Protein?db=core;g=ENSG00000140992;r=16:2587965-2653189;t=ENST00000441549" | "36/1438042988308.23_20150728002308-00100-ip-10-236-191-2_430498039_0.json" |
"b54d194c_t__PDPK1_013__ENST00000441549___Flags" | "[Name] PDPK1-007 [Transcript ID] ENST00000492021 [bp] 552 [Protein] 55aa [Translation ID] ENSP00000455684 [Biotype] Transcript is thought to undergo nonsense mediated decay, a process which detects nonsense mutations and prevents the expression of truncated or erroneous proteins. Nonsense mediated decay [CCDS] - [UniProt] H3BQA3 [RefSeq] - [Flags] " | "Transcript Support Level 5, for transcripts that are not supported at all by either an mRNA or an EST.The Transcript Support Level (TSL) is a method to highlight the well-supported and poorly-supported transcript models for users. The method relies on the primary data that can support full-length transcript structure: mRNA and EST alignments supplied by UCSC and Ensembl.TSL:5" | [] | "Ensembl genome browser 81: Homo sapiens - ID History - Transcript: PDPK1-013 (ENST00000441549)" | "Flags" | "http://www.ensembl.org/Homo_sapiens/Transcript/Idhistory/Protein?db=core;g=ENSG00000140992;r=16:2587965-2653189;t=ENST00000441549" | "36/1438042988308.23_20150728002308-00100-ip-10-236-191-2_430498039_0.json" |
"b54d194c_t__PDPK1_013__ENST00000441549___Flags" | "[Name] PDPK1-012 [Transcript ID] ENST00000478708 [bp] 445 [Protein] 40aa [Translation ID] ENSP00000455438 [Biotype] Transcript is thought to undergo nonsense mediated decay, a process which detects nonsense mutations and prevents the expression of truncated or erroneous proteins. Nonsense mediated decay [CCDS] - [UniProt] H3BPR5 [RefSeq] - [Flags] " | "5' truncation in transcript evidence prevents annotation of the start of the CDS.CDS 5' incompleteTranscript Support Level 5, for transcripts that are not supported at all by either an mRNA or an EST.The Transcript Support Level (TSL) is a method to highlight the well-supported and poorly-supported transcript models for users. The method relies on the primary data that can support full-length transcript structure: mRNA and EST alignments supplied by UCSC and Ensembl.TSL:5" | [] | "Ensembl genome browser 81: Homo sapiens - ID History - Transcript: PDPK1-013 (ENST00000441549)" | "Flags" | "http://www.ensembl.org/Homo_sapiens/Transcript/Idhistory/Protein?db=core;g=ENSG00000140992;r=16:2587965-2653189;t=ENST00000441549" | "36/1438042988308.23_20150728002308-00100-ip-10-236-191-2_430498039_0.json" |
"b54d194c_t__PDPK1_013__ENST00000441549___Flags" | "[Name] PDPK1-011 [Transcript ID] ENST00000471311 [bp] 621 [Protein] No protein [Translation ID] - [Biotype] Transcripts that don't contain an open reading frame (ORF) and cannot be placed in one of the other categories.Processed transcript [CCDS] - [UniProt] - [RefSeq] - [Flags] " | "Transcript Support Level 3, when transcripts are supported by a single EST only.The Transcript Support Level (TSL) is a method to highlight the well-supported and poorly-supported transcript models for users. The method relies on the primary data that can support full-length transcript structure: mRNA and EST alignments supplied by UCSC and Ensembl.TSL:3" | [] | "Ensembl genome browser 81: Homo sapiens - ID History - Transcript: PDPK1-013 (ENST00000441549)" | "Flags" | "http://www.ensembl.org/Homo_sapiens/Transcript/Idhistory/Protein?db=core;g=ENSG00000140992;r=16:2587965-2653189;t=ENST00000441549" | "36/1438042988308.23_20150728002308-00100-ip-10-236-191-2_430498039_0.json" |
"b54d194c_t__PDPK1_013__ENST00000441549___Flags" | "[Name] PDPK1-017 [Transcript ID] ENST00000561962 [bp] 570 [Protein] No protein [Translation ID] - [Biotype] Transcripts that don't contain an open reading frame (ORF) and cannot be placed in one of the other categories.Processed transcript [CCDS] - [UniProt] - [RefSeq] - [Flags] " | "Transcript Support Level 4, for transcripts supported by an EST flagged as suspect.The Transcript Support Level (TSL) is a method to highlight the well-supported and poorly-supported transcript models for users. The method relies on the primary data that can support full-length transcript structure: mRNA and EST alignments supplied by UCSC and Ensembl.TSL:4" | [] | "Ensembl genome browser 81: Homo sapiens - ID History - Transcript: PDPK1-013 (ENST00000441549)" | "Flags" | "http://www.ensembl.org/Homo_sapiens/Transcript/Idhistory/Protein?db=core;g=ENSG00000140992;r=16:2587965-2653189;t=ENST00000441549" | "36/1438042988308.23_20150728002308-00100-ip-10-236-191-2_430498039_0.json" |
"b54d194c_t__PDPK1_013__ENST00000441549___Flags" | "[Name] PDPK1-009 [Transcript ID] ENST00000460496 [bp] 569 [Protein] No protein [Translation ID] - [Biotype] Transcripts that don't contain an open reading frame (ORF) and cannot be placed in one of the other categories.Processed transcript [CCDS] - [UniProt] - [RefSeq] - [Flags] " | "Transcript Support Level 4, for transcripts supported by an EST flagged as suspect.The Transcript Support Level (TSL) is a method to highlight the well-supported and poorly-supported transcript models for users. The method relies on the primary data that can support full-length transcript structure: mRNA and EST alignments supplied by UCSC and Ensembl.TSL:4" | [] | "Ensembl genome browser 81: Homo sapiens - ID History - Transcript: PDPK1-013 (ENST00000441549)" | "Flags" | "http://www.ensembl.org/Homo_sapiens/Transcript/Idhistory/Protein?db=core;g=ENSG00000140992;r=16:2587965-2653189;t=ENST00000441549" | "36/1438042988308.23_20150728002308-00100-ip-10-236-191-2_430498039_0.json" |
"b54d194c_t__PDPK1_013__ENST00000441549___Flags" | "[Name] PDPK1-016 [Transcript ID] ENST00000462923 [bp] 550 [Protein] No protein [Translation ID] - [Biotype] Transcripts that don't contain an open reading frame (ORF) and cannot be placed in one of the other categories.Processed transcript [CCDS] - [UniProt] - [RefSeq] - [Flags] " | "Transcript Support Level 5, for transcripts that are not supported at all by either an mRNA or an EST.The Transcript Support Level (TSL) is a method to highlight the well-supported and poorly-supported transcript models for users. The method relies on the primary data that can support full-length transcript structure: mRNA and EST alignments supplied by UCSC and Ensembl.TSL:5" | [] | "Ensembl genome browser 81: Homo sapiens - ID History - Transcript: PDPK1-013 (ENST00000441549)" | "Flags" | "http://www.ensembl.org/Homo_sapiens/Transcript/Idhistory/Protein?db=core;g=ENSG00000140992;r=16:2587965-2653189;t=ENST00000441549" | "36/1438042988308.23_20150728002308-00100-ip-10-236-191-2_430498039_0.json" |
"b54d194c_t__PDPK1_013__ENST00000441549___Flags" | "[Name] PDPK1-010 [Transcript ID] ENST00000464702 [bp] 511 [Protein] No protein [Translation ID] - [Biotype] Transcripts that don't contain an open reading frame (ORF) and cannot be placed in one of the other categories.Processed transcript [CCDS] - [UniProt] - [RefSeq] - [Flags] " | "Transcript Support Level 2, when transcripts are supported by multiple ESTs or by an mRNA flagged as suspect.The Transcript Support Level (TSL) is a method to highlight the well-supported and poorly-supported transcript models for users. The method relies on the primary data that can support full-length transcript structure: mRNA and EST alignments supplied by UCSC and Ensembl.TSL:2" | [] | "Ensembl genome browser 81: Homo sapiens - ID History - Transcript: PDPK1-013 (ENST00000441549)" | "Flags" | "http://www.ensembl.org/Homo_sapiens/Transcript/Idhistory/Protein?db=core;g=ENSG00000140992;r=16:2587965-2653189;t=ENST00000441549" | "36/1438042988308.23_20150728002308-00100-ip-10-236-191-2_430498039_0.json" |
"b54d194c_t__PDPK1_013__ENST00000441549___Flags" | "[Name] PDPK1-018 [Transcript ID] ENST00000569721 [bp] 330 [Protein] No protein [Translation ID] - [Biotype] Transcripts that don't contain an open reading frame (ORF) and cannot be placed in one of the other categories.Processed transcript [CCDS] - [UniProt] - [RefSeq] - [Flags] " | "Transcript Support Level 5, for transcripts that are not supported at all by either an mRNA or an EST.The Transcript Support Level (TSL) is a method to highlight the well-supported and poorly-supported transcript models for users. The method relies on the primary data that can support full-length transcript structure: mRNA and EST alignments supplied by UCSC and Ensembl.TSL:5" | [] | "Ensembl genome browser 81: Homo sapiens - ID History - Transcript: PDPK1-013 (ENST00000441549)" | "Flags" | "http://www.ensembl.org/Homo_sapiens/Transcript/Idhistory/Protein?db=core;g=ENSG00000140992;r=16:2587965-2653189;t=ENST00000441549" | "36/1438042988308.23_20150728002308-00100-ip-10-236-191-2_430498039_0.json" |
"b54d194c_t__PDPK1_013__ENST00000441549___Flags" | "[Name] PDPK1-005 [Transcript ID] ENST00000491073 [bp] 2387 [Protein] No protein [Translation ID] - [Biotype] Alternatively spliced transcript that is believed to contain intronic sequence relative to other coding transcripts in a given locus.Retained intron [CCDS] - [UniProt] - [RefSeq] - [Flags] " | "Transcript Support Level 1, when transcripts are supported by at least one non-suspect mRNA.The Transcript Support Level (TSL) is a method to highlight the well-supported and poorly-supported transcript models for users. The method relies on the primary data that can support full-length transcript structure: mRNA and EST alignments supplied by UCSC and Ensembl.TSL:1" | [] | "Ensembl genome browser 81: Homo sapiens - ID History - Transcript: PDPK1-013 (ENST00000441549)" | "Flags" | "http://www.ensembl.org/Homo_sapiens/Transcript/Idhistory/Protein?db=core;g=ENSG00000140992;r=16:2587965-2653189;t=ENST00000441549" | "36/1438042988308.23_20150728002308-00100-ip-10-236-191-2_430498039_0.json" |
"b54d194c_t__PDPK1_013__ENST00000441549___Flags" | "[Name] PDPK1-014 [Transcript ID] ENST00000570136 [bp] 1125 [Protein] No protein [Translation ID] - [Biotype] Alternatively spliced transcript that is believed to contain intronic sequence relative to other coding transcripts in a given locus.Retained intron [CCDS] - [UniProt] - [RefSeq] - [Flags] " | "Transcript Support Level 2, when transcripts are supported by multiple ESTs or by an mRNA flagged as suspect.The Transcript Support Level (TSL) is a method to highlight the well-supported and poorly-supported transcript models for users. The method relies on the primary data that can support full-length transcript structure: mRNA and EST alignments supplied by UCSC and Ensembl.TSL:2" | [] | "Ensembl genome browser 81: Homo sapiens - ID History - Transcript: PDPK1-013 (ENST00000441549)" | "Flags" | "http://www.ensembl.org/Homo_sapiens/Transcript/Idhistory/Protein?db=core;g=ENSG00000140992;r=16:2587965-2653189;t=ENST00000441549" | "36/1438042988308.23_20150728002308-00100-ip-10-236-191-2_430498039_0.json" |
"7bf46dfc_t__PDPK1_013__ENST00000441549___Biotype" | "[Name] PDPK1-001 [Transcript ID] ENST00000342085 [bp] 7241 [Protein] 556aa [Translation ID] ENSP00000344220 [CCDS] CCDS10472 [UniProt] O15530 [RefSeq] NM_002613 NP_002604 [Flags] Transcript Support Level 1, when transcripts are supported by at least one non-suspect mRNA.The Transcript Support Level (TSL) is a method to highlight the well-supported and poorly-supported transcript models for users. The method relies on the primary data that can support full-length transcript structure: mRNA and EST alignments supplied by UCSC and Ensembl.TSL:1The GENCODE set is the gene set for human and mouse. GENCODE Basic is a subset of representative transcripts (splice variants).GENCODE basicPRINCIPAL1 - APPRIS candidate principal isoform.APPRIS is a system to annotate alternatively spliced transcripts based on a range of computational methods.APPRIS P1 [Biotype] " | "Genes and/or transcript that contains an open reading frame (ORF).Protein coding" | [] | "Ensembl genome browser 81: Homo sapiens - ID History - Transcript: PDPK1-013 (ENST00000441549)" | "Biotype" | "http://www.ensembl.org/Homo_sapiens/Transcript/Idhistory/Protein?db=core;g=ENSG00000140992;r=16:2587965-2653189;t=ENST00000441549" | "36/1438042988308.23_20150728002308-00100-ip-10-236-191-2_430498039_0.json" |
"7bf46dfc_t__PDPK1_013__ENST00000441549___Biotype" | "[Name] PDPK1-002 [Transcript ID] ENST00000268673 [bp] 4728 [Protein] 429aa [Translation ID] ENSP00000268673 [CCDS] CCDS10473 [UniProt] O15530 [RefSeq] NM_031268 NP_112558 [Flags] Transcript Support Level 1, when transcripts are supported by at least one non-suspect mRNA.The Transcript Support Level (TSL) is a method to highlight the well-supported and poorly-supported transcript models for users. The method relies on the primary data that can support full-length transcript structure: mRNA and EST alignments supplied by UCSC and Ensembl.TSL:1The GENCODE set is the gene set for human and mouse. GENCODE Basic is a subset of representative transcripts (splice variants).GENCODE basic [Biotype] " | "Genes and/or transcript that contains an open reading frame (ORF).Protein coding" | [] | "Ensembl genome browser 81: Homo sapiens - ID History - Transcript: PDPK1-013 (ENST00000441549)" | "Biotype" | "http://www.ensembl.org/Homo_sapiens/Transcript/Idhistory/Protein?db=core;g=ENSG00000140992;r=16:2587965-2653189;t=ENST00000441549" | "36/1438042988308.23_20150728002308-00100-ip-10-236-191-2_430498039_0.json" |
"7bf46dfc_t__PDPK1_013__ENST00000441549___Biotype" | "[Name] PDPK1-013 [Transcript ID] ENST00000441549 [bp] 1729 [Protein] 454aa [Translation ID] ENSP00000395357 [CCDS] CCDS58411 [UniProt] O15530 [RefSeq] NM_001261816 NP_001248745 [Flags] Transcript Support Level 1, when transcripts are supported by at least one non-suspect mRNA.The Transcript Support Level (TSL) is a method to highlight the well-supported and poorly-supported transcript models for users. The method relies on the primary data that can support full-length transcript structure: mRNA and EST alignments supplied by UCSC and Ensembl.TSL:1The GENCODE set is the gene set for human and mouse. GENCODE Basic is a subset of representative transcripts (splice variants).GENCODE basic [Biotype] " | "Genes and/or transcript that contains an open reading frame (ORF).Protein coding" | [] | "Ensembl genome browser 81: Homo sapiens - ID History - Transcript: PDPK1-013 (ENST00000441549)" | "Biotype" | "http://www.ensembl.org/Homo_sapiens/Transcript/Idhistory/Protein?db=core;g=ENSG00000140992;r=16:2587965-2653189;t=ENST00000441549" | "36/1438042988308.23_20150728002308-00100-ip-10-236-191-2_430498039_0.json" |
"7bf46dfc_t__PDPK1_013__ENST00000441549___Biotype" | "[Name] PDPK1-003 [Transcript ID] ENST00000389224 [bp] 2416 [Protein] 529aa [Translation ID] ENSP00000373876 [CCDS] - [UniProt] E9PER6 [RefSeq] - [Flags] Transcript Support Level 2, when transcripts are supported by multiple ESTs or by an mRNA flagged as suspect.The Transcript Support Level (TSL) is a method to highlight the well-supported and poorly-supported transcript models for users. The method relies on the primary data that can support full-length transcript structure: mRNA and EST alignments supplied by UCSC and Ensembl.TSL:2The GENCODE set is the gene set for human and mouse. GENCODE Basic is a subset of representative transcripts (splice variants).GENCODE basic [Biotype] " | "Genes and/or transcript that contains an open reading frame (ORF).Protein coding" | [] | "Ensembl genome browser 81: Homo sapiens - ID History - Transcript: PDPK1-013 (ENST00000441549)" | "Biotype" | "http://www.ensembl.org/Homo_sapiens/Transcript/Idhistory/Protein?db=core;g=ENSG00000140992;r=16:2587965-2653189;t=ENST00000441549" | "36/1438042988308.23_20150728002308-00100-ip-10-236-191-2_430498039_0.json" |
"7bf46dfc_t__PDPK1_013__ENST00000441549___Biotype" | "[Name] PDPK1-008 [Transcript ID] ENST00000461815 [bp] 611 [Protein] 149aa [Translation ID] ENSP00000455551 [CCDS] - [UniProt] H3BQ10 [RefSeq] - [Flags] 3' truncation in transcript evidence prevents annotation of the end of the CDS.CDS 3' incompleteTranscript Support Level 3, when transcripts are supported by a single EST only.The Transcript Support Level (TSL) is a method to highlight the well-supported and poorly-supported transcript models for users. The method relies on the primary data that can support full-length transcript structure: mRNA and EST alignments supplied by UCSC and Ensembl.TSL:3 [Biotype] " | "Genes and/or transcript that contains an open reading frame (ORF).Protein coding" | [] | "Ensembl genome browser 81: Homo sapiens - ID History - Transcript: PDPK1-013 (ENST00000441549)" | "Biotype" | "http://www.ensembl.org/Homo_sapiens/Transcript/Idhistory/Protein?db=core;g=ENSG00000140992;r=16:2587965-2653189;t=ENST00000441549" | "36/1438042988308.23_20150728002308-00100-ip-10-236-191-2_430498039_0.json" |
"7bf46dfc_t__PDPK1_013__ENST00000441549___Biotype" | "[Name] PDPK1-015 [Transcript ID] ENST00000566659 [bp] 451 [Protein] 88aa [Translation ID] ENSP00000455492 [CCDS] - [UniProt] H3BPW1 [RefSeq] - [Flags] 3' truncation in transcript evidence prevents annotation of the end of the CDS.CDS 3' incompleteTranscript Support Level 3, when transcripts are supported by a single EST only.The Transcript Support Level (TSL) is a method to highlight the well-supported and poorly-supported transcript models for users. The method relies on the primary data that can support full-length transcript structure: mRNA and EST alignments supplied by UCSC and Ensembl.TSL:3 [Biotype] " | "Genes and/or transcript that contains an open reading frame (ORF).Protein coding" | [] | "Ensembl genome browser 81: Homo sapiens - ID History - Transcript: PDPK1-013 (ENST00000441549)" | "Biotype" | "http://www.ensembl.org/Homo_sapiens/Transcript/Idhistory/Protein?db=core;g=ENSG00000140992;r=16:2587965-2653189;t=ENST00000441549" | "36/1438042988308.23_20150728002308-00100-ip-10-236-191-2_430498039_0.json" |
"7bf46dfc_t__PDPK1_013__ENST00000441549___Biotype" | "[Name] PDPK1-006 [Transcript ID] ENST00000474706 [bp] 699 [Protein] 35aa [Translation ID] ENSP00000455025 [CCDS] - [UniProt] H3BNV5 [RefSeq] - [Flags] Transcript Support Level 5, for transcripts that are not supported at all by either an mRNA or an EST.The Transcript Support Level (TSL) is a method to highlight the well-supported and poorly-supported transcript models for users. The method relies on the primary data that can support full-length transcript structure: mRNA and EST alignments supplied by UCSC and Ensembl.TSL:5 [Biotype] " | "Transcript is thought to undergo nonsense mediated decay, a process which detects nonsense mutations and prevents the expression of truncated or erroneous proteins. Nonsense mediated decay" | [] | "Ensembl genome browser 81: Homo sapiens - ID History - Transcript: PDPK1-013 (ENST00000441549)" | "Biotype" | "http://www.ensembl.org/Homo_sapiens/Transcript/Idhistory/Protein?db=core;g=ENSG00000140992;r=16:2587965-2653189;t=ENST00000441549" | "36/1438042988308.23_20150728002308-00100-ip-10-236-191-2_430498039_0.json" |
"7bf46dfc_t__PDPK1_013__ENST00000441549___Biotype" | "[Name] PDPK1-007 [Transcript ID] ENST00000492021 [bp] 552 [Protein] 55aa [Translation ID] ENSP00000455684 [CCDS] - [UniProt] H3BQA3 [RefSeq] - [Flags] Transcript Support Level 5, for transcripts that are not supported at all by either an mRNA or an EST.The Transcript Support Level (TSL) is a method to highlight the well-supported and poorly-supported transcript models for users. The method relies on the primary data that can support full-length transcript structure: mRNA and EST alignments supplied by UCSC and Ensembl.TSL:5 [Biotype] " | "Transcript is thought to undergo nonsense mediated decay, a process which detects nonsense mutations and prevents the expression of truncated or erroneous proteins. Nonsense mediated decay" | [] | "Ensembl genome browser 81: Homo sapiens - ID History - Transcript: PDPK1-013 (ENST00000441549)" | "Biotype" | "http://www.ensembl.org/Homo_sapiens/Transcript/Idhistory/Protein?db=core;g=ENSG00000140992;r=16:2587965-2653189;t=ENST00000441549" | "36/1438042988308.23_20150728002308-00100-ip-10-236-191-2_430498039_0.json" |
"7bf46dfc_t__PDPK1_013__ENST00000441549___Biotype" | "[Name] PDPK1-012 [Transcript ID] ENST00000478708 [bp] 445 [Protein] 40aa [Translation ID] ENSP00000455438 [CCDS] - [UniProt] H3BPR5 [RefSeq] - [Flags] 5' truncation in transcript evidence prevents annotation of the start of the CDS.CDS 5' incompleteTranscript Support Level 5, for transcripts that are not supported at all by either an mRNA or an EST.The Transcript Support Level (TSL) is a method to highlight the well-supported and poorly-supported transcript models for users. The method relies on the primary data that can support full-length transcript structure: mRNA and EST alignments supplied by UCSC and Ensembl.TSL:5 [Biotype] " | "Transcript is thought to undergo nonsense mediated decay, a process which detects nonsense mutations and prevents the expression of truncated or erroneous proteins. Nonsense mediated decay" | [] | "Ensembl genome browser 81: Homo sapiens - ID History - Transcript: PDPK1-013 (ENST00000441549)" | "Biotype" | "http://www.ensembl.org/Homo_sapiens/Transcript/Idhistory/Protein?db=core;g=ENSG00000140992;r=16:2587965-2653189;t=ENST00000441549" | "36/1438042988308.23_20150728002308-00100-ip-10-236-191-2_430498039_0.json" |
"7bf46dfc_t__PDPK1_013__ENST00000441549___Biotype" | "[Name] PDPK1-011 [Transcript ID] ENST00000471311 [bp] 621 [Protein] No protein [Translation ID] - [CCDS] - [UniProt] - [RefSeq] - [Flags] Transcript Support Level 3, when transcripts are supported by a single EST only.The Transcript Support Level (TSL) is a method to highlight the well-supported and poorly-supported transcript models for users. The method relies on the primary data that can support full-length transcript structure: mRNA and EST alignments supplied by UCSC and Ensembl.TSL:3 [Biotype] " | "Transcripts that don't contain an open reading frame (ORF) and cannot be placed in one of the other categories.Processed transcript" | [] | "Ensembl genome browser 81: Homo sapiens - ID History - Transcript: PDPK1-013 (ENST00000441549)" | "Biotype" | "http://www.ensembl.org/Homo_sapiens/Transcript/Idhistory/Protein?db=core;g=ENSG00000140992;r=16:2587965-2653189;t=ENST00000441549" | "36/1438042988308.23_20150728002308-00100-ip-10-236-191-2_430498039_0.json" |
"7bf46dfc_t__PDPK1_013__ENST00000441549___Biotype" | "[Name] PDPK1-017 [Transcript ID] ENST00000561962 [bp] 570 [Protein] No protein [Translation ID] - [CCDS] - [UniProt] - [RefSeq] - [Flags] Transcript Support Level 4, for transcripts supported by an EST flagged as suspect.The Transcript Support Level (TSL) is a method to highlight the well-supported and poorly-supported transcript models for users. The method relies on the primary data that can support full-length transcript structure: mRNA and EST alignments supplied by UCSC and Ensembl.TSL:4 [Biotype] " | "Transcripts that don't contain an open reading frame (ORF) and cannot be placed in one of the other categories.Processed transcript" | [] | "Ensembl genome browser 81: Homo sapiens - ID History - Transcript: PDPK1-013 (ENST00000441549)" | "Biotype" | "http://www.ensembl.org/Homo_sapiens/Transcript/Idhistory/Protein?db=core;g=ENSG00000140992;r=16:2587965-2653189;t=ENST00000441549" | "36/1438042988308.23_20150728002308-00100-ip-10-236-191-2_430498039_0.json" |
"7bf46dfc_t__PDPK1_013__ENST00000441549___Biotype" | "[Name] PDPK1-009 [Transcript ID] ENST00000460496 [bp] 569 [Protein] No protein [Translation ID] - [CCDS] - [UniProt] - [RefSeq] - [Flags] Transcript Support Level 4, for transcripts supported by an EST flagged as suspect.The Transcript Support Level (TSL) is a method to highlight the well-supported and poorly-supported transcript models for users. The method relies on the primary data that can support full-length transcript structure: mRNA and EST alignments supplied by UCSC and Ensembl.TSL:4 [Biotype] " | "Transcripts that don't contain an open reading frame (ORF) and cannot be placed in one of the other categories.Processed transcript" | [] | "Ensembl genome browser 81: Homo sapiens - ID History - Transcript: PDPK1-013 (ENST00000441549)" | "Biotype" | "http://www.ensembl.org/Homo_sapiens/Transcript/Idhistory/Protein?db=core;g=ENSG00000140992;r=16:2587965-2653189;t=ENST00000441549" | "36/1438042988308.23_20150728002308-00100-ip-10-236-191-2_430498039_0.json" |
"7bf46dfc_t__PDPK1_013__ENST00000441549___Biotype" | "[Name] PDPK1-016 [Transcript ID] ENST00000462923 [bp] 550 [Protein] No protein [Translation ID] - [CCDS] - [UniProt] - [RefSeq] - [Flags] Transcript Support Level 5, for transcripts that are not supported at all by either an mRNA or an EST.The Transcript Support Level (TSL) is a method to highlight the well-supported and poorly-supported transcript models for users. The method relies on the primary data that can support full-length transcript structure: mRNA and EST alignments supplied by UCSC and Ensembl.TSL:5 [Biotype] " | "Transcripts that don't contain an open reading frame (ORF) and cannot be placed in one of the other categories.Processed transcript" | [] | "Ensembl genome browser 81: Homo sapiens - ID History - Transcript: PDPK1-013 (ENST00000441549)" | "Biotype" | "http://www.ensembl.org/Homo_sapiens/Transcript/Idhistory/Protein?db=core;g=ENSG00000140992;r=16:2587965-2653189;t=ENST00000441549" | "36/1438042988308.23_20150728002308-00100-ip-10-236-191-2_430498039_0.json" |
"7bf46dfc_t__PDPK1_013__ENST00000441549___Biotype" | "[Name] PDPK1-010 [Transcript ID] ENST00000464702 [bp] 511 [Protein] No protein [Translation ID] - [CCDS] - [UniProt] - [RefSeq] - [Flags] Transcript Support Level 2, when transcripts are supported by multiple ESTs or by an mRNA flagged as suspect.The Transcript Support Level (TSL) is a method to highlight the well-supported and poorly-supported transcript models for users. The method relies on the primary data that can support full-length transcript structure: mRNA and EST alignments supplied by UCSC and Ensembl.TSL:2 [Biotype] " | "Transcripts that don't contain an open reading frame (ORF) and cannot be placed in one of the other categories.Processed transcript" | [] | "Ensembl genome browser 81: Homo sapiens - ID History - Transcript: PDPK1-013 (ENST00000441549)" | "Biotype" | "http://www.ensembl.org/Homo_sapiens/Transcript/Idhistory/Protein?db=core;g=ENSG00000140992;r=16:2587965-2653189;t=ENST00000441549" | "36/1438042988308.23_20150728002308-00100-ip-10-236-191-2_430498039_0.json" |
"7bf46dfc_t__PDPK1_013__ENST00000441549___Biotype" | "[Name] PDPK1-018 [Transcript ID] ENST00000569721 [bp] 330 [Protein] No protein [Translation ID] - [CCDS] - [UniProt] - [RefSeq] - [Flags] Transcript Support Level 5, for transcripts that are not supported at all by either an mRNA or an EST.The Transcript Support Level (TSL) is a method to highlight the well-supported and poorly-supported transcript models for users. The method relies on the primary data that can support full-length transcript structure: mRNA and EST alignments supplied by UCSC and Ensembl.TSL:5 [Biotype] " | "Transcripts that don't contain an open reading frame (ORF) and cannot be placed in one of the other categories.Processed transcript" | [] | "Ensembl genome browser 81: Homo sapiens - ID History - Transcript: PDPK1-013 (ENST00000441549)" | "Biotype" | "http://www.ensembl.org/Homo_sapiens/Transcript/Idhistory/Protein?db=core;g=ENSG00000140992;r=16:2587965-2653189;t=ENST00000441549" | "36/1438042988308.23_20150728002308-00100-ip-10-236-191-2_430498039_0.json" |
"7bf46dfc_t__PDPK1_013__ENST00000441549___Biotype" | "[Name] PDPK1-005 [Transcript ID] ENST00000491073 [bp] 2387 [Protein] No protein [Translation ID] - [CCDS] - [UniProt] - [RefSeq] - [Flags] Transcript Support Level 1, when transcripts are supported by at least one non-suspect mRNA.The Transcript Support Level (TSL) is a method to highlight the well-supported and poorly-supported transcript models for users. The method relies on the primary data that can support full-length transcript structure: mRNA and EST alignments supplied by UCSC and Ensembl.TSL:1 [Biotype] " | "Alternatively spliced transcript that is believed to contain intronic sequence relative to other coding transcripts in a given locus.Retained intron" | [] | "Ensembl genome browser 81: Homo sapiens - ID History - Transcript: PDPK1-013 (ENST00000441549)" | "Biotype" | "http://www.ensembl.org/Homo_sapiens/Transcript/Idhistory/Protein?db=core;g=ENSG00000140992;r=16:2587965-2653189;t=ENST00000441549" | "36/1438042988308.23_20150728002308-00100-ip-10-236-191-2_430498039_0.json" |
"7bf46dfc_t__PDPK1_013__ENST00000441549___Biotype" | "[Name] PDPK1-014 [Transcript ID] ENST00000570136 [bp] 1125 [Protein] No protein [Translation ID] - [CCDS] - [UniProt] - [RefSeq] - [Flags] Transcript Support Level 2, when transcripts are supported by multiple ESTs or by an mRNA flagged as suspect.The Transcript Support Level (TSL) is a method to highlight the well-supported and poorly-supported transcript models for users. The method relies on the primary data that can support full-length transcript structure: mRNA and EST alignments supplied by UCSC and Ensembl.TSL:2 [Biotype] " | "Alternatively spliced transcript that is believed to contain intronic sequence relative to other coding transcripts in a given locus.Retained intron" | [] | "Ensembl genome browser 81: Homo sapiens - ID History - Transcript: PDPK1-013 (ENST00000441549)" | "Biotype" | "http://www.ensembl.org/Homo_sapiens/Transcript/Idhistory/Protein?db=core;g=ENSG00000140992;r=16:2587965-2653189;t=ENST00000441549" | "36/1438042988308.23_20150728002308-00100-ip-10-236-191-2_430498039_0.json" |
"1c8892c1_t__NTRK1_002__ENST00000497019___Flags" | "[Name] NTRK1-001 [Transcript ID] ENST00000368196 [bp] 2701 [Protein] 790aa [Translation ID] ENSP00000357179 [Biotype] Genes and/or transcript that contains an open reading frame (ORF).Protein coding [CCDS] CCDS30891 [UniProt] P04629 X5DR71 [RefSeq] NM_001012331 NP_001012331 [Flags] " | "Transcript Support Level 1, when transcripts are supported by at least one non-suspect mRNA.The Transcript Support Level (TSL) is a method to highlight the well-supported and poorly-supported transcript models for users. The method relies on the primary data that can support full-length transcript structure: mRNA and EST alignments supplied by UCSC and Ensembl.TSL:1The GENCODE set is the gene set for human and mouse. GENCODE Basic is a subset of representative transcripts (splice variants).GENCODE basicALTERNATIVE2 - APPRIS candidate principal isoform that appears to be conserved in fewer than three tested non-primate species.APPRIS is a system to annotate alternatively spliced transcripts based on a range of computational methods.APPRIS ALT2" | [] | "Ensembl genome browser 81: Homo sapiens - Summary - Transcript: NTRK1-002 (ENST00000497019)" | "Flags" | "http://www.ensembl.org/Homo_sapiens/Transcript/Summary?db=core;g=ENSG00000198400;r=1:156815679-156881849;t=ENST00000497019" | "36/1438042989443.69_20150728002309-00321-ip-10-236-191-2_428770504_0.json" |
"1c8892c1_t__NTRK1_002__ENST00000497019___Flags" | "[Name] NTRK1-004 [Transcript ID] ENST00000392302 [bp] 2609 [Protein] 760aa [Translation ID] ENSP00000376120 [Biotype] Genes and/or transcript that contains an open reading frame (ORF).Protein coding [CCDS] CCDS30890 [UniProt] P04629 [RefSeq] NM_001007792 NP_001007793 [Flags] " | "Transcript Support Level 5, for transcripts that are not supported at all by either an mRNA or an EST.The Transcript Support Level (TSL) is a method to highlight the well-supported and poorly-supported transcript models for users. The method relies on the primary data that can support full-length transcript structure: mRNA and EST alignments supplied by UCSC and Ensembl.TSL:5The GENCODE set is the gene set for human and mouse. GENCODE Basic is a subset of representative transcripts (splice variants).GENCODE basic" | [] | "Ensembl genome browser 81: Homo sapiens - Summary - Transcript: NTRK1-002 (ENST00000497019)" | "Flags" | "http://www.ensembl.org/Homo_sapiens/Transcript/Summary?db=core;g=ENSG00000198400;r=1:156815679-156881849;t=ENST00000497019" | "36/1438042989443.69_20150728002309-00321-ip-10-236-191-2_428770504_0.json" |
"1c8892c1_t__NTRK1_002__ENST00000497019___Flags" | "[Name] NTRK1-007 [Transcript ID] ENST00000524377 [bp] 2432 [Protein] 796aa [Translation ID] ENSP00000431418 [Biotype] Genes and/or transcript that contains an open reading frame (ORF).Protein coding [CCDS] CCDS1161 [UniProt] P04629 [RefSeq] NM_002529 NP_002520 [Flags] " | "Transcript Support Level 1, when transcripts are supported by at least one non-suspect mRNA.The Transcript Support Level (TSL) is a method to highlight the well-supported and poorly-supported transcript models for users. The method relies on the primary data that can support full-length transcript structure: mRNA and EST alignments supplied by UCSC and Ensembl.TSL:1The GENCODE set is the gene set for human and mouse. GENCODE Basic is a subset of representative transcripts (splice variants).GENCODE basicPRINCIPAL3 - APPRIS candidate principal isoform (earliest CCDS).APPRIS is a system to annotate alternatively spliced transcripts based on a range of computational methods.APPRIS P3" | [] | "Ensembl genome browser 81: Homo sapiens - Summary - Transcript: NTRK1-002 (ENST00000497019)" | "Flags" | "http://www.ensembl.org/Homo_sapiens/Transcript/Summary?db=core;g=ENSG00000198400;r=1:156815679-156881849;t=ENST00000497019" | "36/1438042989443.69_20150728002309-00321-ip-10-236-191-2_428770504_0.json" |
"1c8892c1_t__NTRK1_002__ENST00000497019___Flags" | "[Name] NTRK1-006 [Transcript ID] ENST00000358660 [bp] 2492 [Protein] 793aa [Translation ID] ENSP00000351486 [Biotype] Genes and/or transcript that contains an open reading frame (ORF).Protein coding [CCDS] - [UniProt] J3KP20 [RefSeq] - [Flags] " | "Transcript Support Level 2, when transcripts are supported by multiple ESTs or by an mRNA flagged as suspect.The Transcript Support Level (TSL) is a method to highlight the well-supported and poorly-supported transcript models for users. The method relies on the primary data that can support full-length transcript structure: mRNA and EST alignments supplied by UCSC and Ensembl.TSL:2The GENCODE set is the gene set for human and mouse. GENCODE Basic is a subset of representative transcripts (splice variants).GENCODE basicALTERNATIVE2 - APPRIS candidate principal isoform that appears to be conserved in fewer than three tested non-primate species.APPRIS is a system to annotate alternatively spliced transcripts based on a range of computational methods.APPRIS ALT2" | [] | "Ensembl genome browser 81: Homo sapiens - Summary - Transcript: NTRK1-002 (ENST00000497019)" | "Flags" | "http://www.ensembl.org/Homo_sapiens/Transcript/Summary?db=core;g=ENSG00000198400;r=1:156815679-156881849;t=ENST00000497019" | "36/1438042989443.69_20150728002309-00321-ip-10-236-191-2_428770504_0.json" |
"1c8892c1_t__NTRK1_002__ENST00000497019___Flags" | "[Name] NTRK1-002 [Transcript ID] ENST00000497019 [bp] 2508 [Protein] 388aa [Translation ID] ENSP00000436804 [Biotype] Transcript is thought to undergo nonsense mediated decay, a process which detects nonsense mutations and prevents the expression of truncated or erroneous proteins. Nonsense mediated decay [CCDS] - [UniProt] E9PQG0 [RefSeq] - [Flags] " | "Transcript Support Level 2, when transcripts are supported by multiple ESTs or by an mRNA flagged as suspect.The Transcript Support Level (TSL) is a method to highlight the well-supported and poorly-supported transcript models for users. The method relies on the primary data that can support full-length transcript structure: mRNA and EST alignments supplied by UCSC and Ensembl.TSL:2" | [] | "Ensembl genome browser 81: Homo sapiens - Summary - Transcript: NTRK1-002 (ENST00000497019)" | "Flags" | "http://www.ensembl.org/Homo_sapiens/Transcript/Summary?db=core;g=ENSG00000198400;r=1:156815679-156881849;t=ENST00000497019" | "36/1438042989443.69_20150728002309-00321-ip-10-236-191-2_428770504_0.json" |
"1c8892c1_t__NTRK1_002__ENST00000497019___Flags" | "[Name] NTRK1-011 [Transcript ID] ENST00000531606 [bp] 642 [Protein] No protein [Translation ID] - [Biotype] Transcripts that don't contain an open reading frame (ORF) and cannot be placed in one of the other categories.Processed transcript [CCDS] - [UniProt] - [RefSeq] - [Flags] " | "Transcript Support Level 3, when transcripts are supported by a single EST only.The Transcript Support Level (TSL) is a method to highlight the well-supported and poorly-supported transcript models for users. The method relies on the primary data that can support full-length transcript structure: mRNA and EST alignments supplied by UCSC and Ensembl.TSL:3" | [] | "Ensembl genome browser 81: Homo sapiens - Summary - Transcript: NTRK1-002 (ENST00000497019)" | "Flags" | "http://www.ensembl.org/Homo_sapiens/Transcript/Summary?db=core;g=ENSG00000198400;r=1:156815679-156881849;t=ENST00000497019" | "36/1438042989443.69_20150728002309-00321-ip-10-236-191-2_428770504_0.json" |
"1c8892c1_t__NTRK1_002__ENST00000497019___Flags" | "[Name] NTRK1-008 [Transcript ID] ENST00000489021 [bp] 570 [Protein] No protein [Translation ID] - [Biotype] Transcripts that don't contain an open reading frame (ORF) and cannot be placed in one of the other categories.Processed transcript [CCDS] - [UniProt] - [RefSeq] - [Flags] " | "Transcript Support Level 4, for transcripts supported by an EST flagged as suspect.The Transcript Support Level (TSL) is a method to highlight the well-supported and poorly-supported transcript models for users. The method relies on the primary data that can support full-length transcript structure: mRNA and EST alignments supplied by UCSC and Ensembl.TSL:4" | [] | "Ensembl genome browser 81: Homo sapiens - Summary - Transcript: NTRK1-002 (ENST00000497019)" | "Flags" | "http://www.ensembl.org/Homo_sapiens/Transcript/Summary?db=core;g=ENSG00000198400;r=1:156815679-156881849;t=ENST00000497019" | "36/1438042989443.69_20150728002309-00321-ip-10-236-191-2_428770504_0.json" |
"1c8892c1_t__NTRK1_002__ENST00000497019___Flags" | "[Name] NTRK1-005 [Transcript ID] ENST00000530298 [bp] 3052 [Protein] No protein [Translation ID] - [Biotype] Alternatively spliced transcript that is believed to contain intronic sequence relative to other coding transcripts in a given locus.Retained intron [CCDS] - [UniProt] - [RefSeq] - [Flags] " | "Transcript Support Level 2, when transcripts are supported by multiple ESTs or by an mRNA flagged as suspect.The Transcript Support Level (TSL) is a method to highlight the well-supported and poorly-supported transcript models for users. The method relies on the primary data that can support full-length transcript structure: mRNA and EST alignments supplied by UCSC and Ensembl.TSL:2" | [] | "Ensembl genome browser 81: Homo sapiens - Summary - Transcript: NTRK1-002 (ENST00000497019)" | "Flags" | "http://www.ensembl.org/Homo_sapiens/Transcript/Summary?db=core;g=ENSG00000198400;r=1:156815679-156881849;t=ENST00000497019" | "36/1438042989443.69_20150728002309-00321-ip-10-236-191-2_428770504_0.json" |
"1c8892c1_t__NTRK1_002__ENST00000497019___Flags" | "[Name] NTRK1-010 [Transcript ID] ENST00000534682 [bp] 844 [Protein] No protein [Translation ID] - [Biotype] Alternatively spliced transcript that is believed to contain intronic sequence relative to other coding transcripts in a given locus.Retained intron [CCDS] - [UniProt] - [RefSeq] - [Flags] " | "Transcript Support Level 4, for transcripts supported by an EST flagged as suspect.The Transcript Support Level (TSL) is a method to highlight the well-supported and poorly-supported transcript models for users. The method relies on the primary data that can support full-length transcript structure: mRNA and EST alignments supplied by UCSC and Ensembl.TSL:4" | [] | "Ensembl genome browser 81: Homo sapiens - Summary - Transcript: NTRK1-002 (ENST00000497019)" | "Flags" | "http://www.ensembl.org/Homo_sapiens/Transcript/Summary?db=core;g=ENSG00000198400;r=1:156815679-156881849;t=ENST00000497019" | "36/1438042989443.69_20150728002309-00321-ip-10-236-191-2_428770504_0.json" |
"1c8892c1_t__NTRK1_002__ENST00000497019___Flags" | "[Name] NTRK1-009 [Transcript ID] ENST00000533630 [bp] 645 [Protein] No protein [Translation ID] - [Biotype] Alternatively spliced transcript that is believed to contain intronic sequence relative to other coding transcripts in a given locus.Retained intron [CCDS] - [UniProt] - [RefSeq] - [Flags] " | "Transcript Support Level 3, when transcripts are supported by a single EST only.The Transcript Support Level (TSL) is a method to highlight the well-supported and poorly-supported transcript models for users. The method relies on the primary data that can support full-length transcript structure: mRNA and EST alignments supplied by UCSC and Ensembl.TSL:3" | [] | "Ensembl genome browser 81: Homo sapiens - Summary - Transcript: NTRK1-002 (ENST00000497019)" | "Flags" | "http://www.ensembl.org/Homo_sapiens/Transcript/Summary?db=core;g=ENSG00000198400;r=1:156815679-156881849;t=ENST00000497019" | "36/1438042989443.69_20150728002309-00321-ip-10-236-191-2_428770504_0.json" |
"d1939a7e_pt__OPTN_006__ENST00000263036___Name" | "[Transcript ID] ENST00000378748 [bp] 3521 [Protein] 577aa [Translation ID] ENSP00000368022 [Biotype] Genes and/or transcript that contains an open reading frame (ORF).Protein coding [CCDS] CCDS7094 [UniProt] Q96CV9 [RefSeq] NM_001008211 NM_001008213 NP_001008212 NP_001008214 [Flags] Transcript Support Level 1, when transcripts are supported by at least one non-suspect mRNA.The Transcript Support Level (TSL) is a method to highlight the well-supported and poorly-supported transcript models for users. The method relies on the primary data that can support full-length transcript structure: mRNA and EST alignments supplied by UCSC and Ensembl.TSL:1The GENCODE set is the gene set for human and mouse. GENCODE Basic is a subset of representative transcripts (splice variants).GENCODE basicPRINCIPAL2 - APPRIS candidate principal isoform (CCDS).APPRIS is a system to annotate alternatively spliced transcripts based on a range of computational methods.APPRIS P2 [Name] " | "OPTN-005" | [] | "Ensembl genome browser 81: Homo sapiens - Protein summary - Transcript: OPTN-006 (ENST00000263036)" | "Name" | "http://www.ensembl.org/Homo_sapiens/Transcript/ProteinSummary?g=ENSG00000123240;r=10:13099449-13136923;t=ENST00000263036" | "36/1438042988308.23_20150728002308-00283-ip-10-236-191-2_429208942_0.json" |
"d1939a7e_pt__OPTN_006__ENST00000263036___Name" | "[Transcript ID] ENST00000378747 [bp] 3400 [Protein] 577aa [Translation ID] ENSP00000368021 [Biotype] Genes and/or transcript that contains an open reading frame (ORF).Protein coding [CCDS] CCDS7094 [UniProt] Q96CV9 [RefSeq] NM_001008212 NP_001008213 [Flags] Transcript Support Level 1, when transcripts are supported by at least one non-suspect mRNA.The Transcript Support Level (TSL) is a method to highlight the well-supported and poorly-supported transcript models for users. The method relies on the primary data that can support full-length transcript structure: mRNA and EST alignments supplied by UCSC and Ensembl.TSL:1The GENCODE set is the gene set for human and mouse. GENCODE Basic is a subset of representative transcripts (splice variants).GENCODE basicPRINCIPAL2 - APPRIS candidate principal isoform (CCDS).APPRIS is a system to annotate alternatively spliced transcripts based on a range of computational methods.APPRIS P2 [Name] " | "OPTN-008" | [] | "Ensembl genome browser 81: Homo sapiens - Protein summary - Transcript: OPTN-006 (ENST00000263036)" | "Name" | "http://www.ensembl.org/Homo_sapiens/Transcript/ProteinSummary?g=ENSG00000123240;r=10:13099449-13136923;t=ENST00000263036" | "36/1438042988308.23_20150728002308-00283-ip-10-236-191-2_429208942_0.json" |
"d1939a7e_pt__OPTN_006__ENST00000263036___Name" | "[Transcript ID] ENST00000378757 [bp] 3321 [Protein] 577aa [Translation ID] ENSP00000368032 [Biotype] Genes and/or transcript that contains an open reading frame (ORF).Protein coding [CCDS] CCDS7094 [UniProt] Q96CV9 [RefSeq] NM_021980 NP_068815 [Flags] Transcript Support Level 1, when transcripts are supported by at least one non-suspect mRNA.The Transcript Support Level (TSL) is a method to highlight the well-supported and poorly-supported transcript models for users. The method relies on the primary data that can support full-length transcript structure: mRNA and EST alignments supplied by UCSC and Ensembl.TSL:1The GENCODE set is the gene set for human and mouse. GENCODE Basic is a subset of representative transcripts (splice variants).GENCODE basicPRINCIPAL2 - APPRIS candidate principal isoform (CCDS).APPRIS is a system to annotate alternatively spliced transcripts based on a range of computational methods.APPRIS P2 [Name] " | "OPTN-007" | [] | "Ensembl genome browser 81: Homo sapiens - Protein summary - Transcript: OPTN-006 (ENST00000263036)" | "Name" | "http://www.ensembl.org/Homo_sapiens/Transcript/ProteinSummary?g=ENSG00000123240;r=10:13099449-13136923;t=ENST00000263036" | "36/1438042988308.23_20150728002308-00283-ip-10-236-191-2_429208942_0.json" |
"d1939a7e_pt__OPTN_006__ENST00000263036___Name" | "[Transcript ID] ENST00000263036 [bp] 2464 [Protein] 577aa [Translation ID] ENSP00000263036 [Biotype] Genes and/or transcript that contains an open reading frame (ORF).Protein coding [CCDS] CCDS7094 [UniProt] Q96CV9 [RefSeq] - [Flags] Transcript Support Level 2, when transcripts are supported by multiple ESTs or by an mRNA flagged as suspect.The Transcript Support Level (TSL) is a method to highlight the well-supported and poorly-supported transcript models for users. The method relies on the primary data that can support full-length transcript structure: mRNA and EST alignments supplied by UCSC and Ensembl.TSL:2The GENCODE set is the gene set for human and mouse. GENCODE Basic is a subset of representative transcripts (splice variants).GENCODE basicPRINCIPAL2 - APPRIS candidate principal isoform (CCDS).APPRIS is a system to annotate alternatively spliced transcripts based on a range of computational methods.APPRIS P2 [Name] " | "OPTN-006" | [] | "Ensembl genome browser 81: Homo sapiens - Protein summary - Transcript: OPTN-006 (ENST00000263036)" | "Name" | "http://www.ensembl.org/Homo_sapiens/Transcript/ProteinSummary?g=ENSG00000123240;r=10:13099449-13136923;t=ENST00000263036" | "36/1438042988308.23_20150728002308-00283-ip-10-236-191-2_429208942_0.json" |
"d1939a7e_pt__OPTN_006__ENST00000263036___Name" | "[Transcript ID] ENST00000378752 [bp] 3488 [Protein] 571aa [Translation ID] ENSP00000368027 [Biotype] Genes and/or transcript that contains an open reading frame (ORF).Protein coding [CCDS] - [UniProt] Q96CV9 [RefSeq] - [Flags] Transcript Support Level 1, when transcripts are supported by at least one non-suspect mRNA.The Transcript Support Level (TSL) is a method to highlight the well-supported and poorly-supported transcript models for users. The method relies on the primary data that can support full-length transcript structure: mRNA and EST alignments supplied by UCSC and Ensembl.TSL:1The GENCODE set is the gene set for human and mouse. GENCODE Basic is a subset of representative transcripts (splice variants).GENCODE basicALTERNATIVE1 - APPRIS candidate principal isoform that is conserved in at least three tested non-primate species.APPRIS is a system to annotate alternatively spliced transcripts based on a range of computational methods.APPRIS ALT1 [Name] " | "OPTN-004" | [] | "Ensembl genome browser 81: Homo sapiens - Protein summary - Transcript: OPTN-006 (ENST00000263036)" | "Name" | "http://www.ensembl.org/Homo_sapiens/Transcript/ProteinSummary?g=ENSG00000123240;r=10:13099449-13136923;t=ENST00000263036" | "36/1438042988308.23_20150728002308-00283-ip-10-236-191-2_429208942_0.json" |
"d1939a7e_pt__OPTN_006__ENST00000263036___Name" | "[Transcript ID] ENST00000378764 [bp] 2498 [Protein] 571aa [Translation ID] ENSP00000368040 [Biotype] Genes and/or transcript that contains an open reading frame (ORF).Protein coding [CCDS] - [UniProt] Q96CV9 [RefSeq] - [Flags] Transcript Support Level 5, for transcripts that are not supported at all by either an mRNA or an EST.The Transcript Support Level (TSL) is a method to highlight the well-supported and poorly-supported transcript models for users. The method relies on the primary data that can support full-length transcript structure: mRNA and EST alignments supplied by UCSC and Ensembl.TSL:5The GENCODE set is the gene set for human and mouse. GENCODE Basic is a subset of representative transcripts (splice variants).GENCODE basicALTERNATIVE1 - APPRIS candidate principal isoform that is conserved in at least three tested non-primate species.APPRIS is a system to annotate alternatively spliced transcripts based on a range of computational methods.APPRIS ALT1 [Name] " | "OPTN-009" | [] | "Ensembl genome browser 81: Homo sapiens - Protein summary - Transcript: OPTN-006 (ENST00000263036)" | "Name" | "http://www.ensembl.org/Homo_sapiens/Transcript/ProteinSummary?g=ENSG00000123240;r=10:13099449-13136923;t=ENST00000263036" | "36/1438042988308.23_20150728002308-00283-ip-10-236-191-2_429208942_0.json" |
"d1939a7e_pt__OPTN_006__ENST00000263036___Name" | "[Transcript ID] ENST00000424614 [bp] 664 [Protein] 126aa [Translation ID] ENSP00000400356 [Biotype] Genes and/or transcript that contains an open reading frame (ORF).Protein coding [CCDS] - [UniProt] H7C1H4 [RefSeq] - [Flags] 5' truncation in transcript evidence prevents annotation of the start of the CDS.CDS 5' incompleteTranscript Support Level 3, when transcripts are supported by a single EST only.The Transcript Support Level (TSL) is a method to highlight the well-supported and poorly-supported transcript models for users. The method relies on the primary data that can support full-length transcript structure: mRNA and EST alignments supplied by UCSC and Ensembl.TSL:3 [Name] " | "OPTN-001" | [] | "Ensembl genome browser 81: Homo sapiens - Protein summary - Transcript: OPTN-006 (ENST00000263036)" | "Name" | "http://www.ensembl.org/Homo_sapiens/Transcript/ProteinSummary?g=ENSG00000123240;r=10:13099449-13136923;t=ENST00000263036" | "36/1438042988308.23_20150728002308-00283-ip-10-236-191-2_429208942_0.json" |
"d1939a7e_pt__OPTN_006__ENST00000263036___Name" | "[Transcript ID] ENST00000486862 [bp] 423 [Protein] 107aa [Translation ID] ENSP00000481473 [Biotype] Genes and/or transcript that contains an open reading frame (ORF).Protein coding [CCDS] - [UniProt] A0A087WY28 [RefSeq] - [Flags] 5' truncation in transcript evidence prevents annotation of the start of the CDS.CDS 5' incompleteTranscript Support Level 3, when transcripts are supported by a single EST only.The Transcript Support Level (TSL) is a method to highlight the well-supported and poorly-supported transcript models for users. The method relies on the primary data that can support full-length transcript structure: mRNA and EST alignments supplied by UCSC and Ensembl.TSL:3 [Name] " | "OPTN-012" | [] | "Ensembl genome browser 81: Homo sapiens - Protein summary - Transcript: OPTN-006 (ENST00000263036)" | "Name" | "http://www.ensembl.org/Homo_sapiens/Transcript/ProteinSummary?g=ENSG00000123240;r=10:13099449-13136923;t=ENST00000263036" | "36/1438042988308.23_20150728002308-00283-ip-10-236-191-2_429208942_0.json" |
"d1939a7e_pt__OPTN_006__ENST00000263036___Name" | "[Transcript ID] ENST00000430081 [bp] 848 [Protein] 62aa [Translation ID] ENSP00000414747 [Biotype] Transcript is thought to undergo nonsense mediated decay, a process which detects nonsense mutations and prevents the expression of truncated or erroneous proteins. Nonsense mediated decay [CCDS] - [UniProt] X6RKL2 [RefSeq] - [Flags] Transcript Support Level 5, for transcripts that are not supported at all by either an mRNA or an EST.The Transcript Support Level (TSL) is a method to highlight the well-supported and poorly-supported transcript models for users. The method relies on the primary data that can support full-length transcript structure: mRNA and EST alignments supplied by UCSC and Ensembl.TSL:5 [Name] " | "OPTN-011" | [] | "Ensembl genome browser 81: Homo sapiens - Protein summary - Transcript: OPTN-006 (ENST00000263036)" | "Name" | "http://www.ensembl.org/Homo_sapiens/Transcript/ProteinSummary?g=ENSG00000123240;r=10:13099449-13136923;t=ENST00000263036" | "36/1438042988308.23_20150728002308-00283-ip-10-236-191-2_429208942_0.json" |
"d1939a7e_pt__OPTN_006__ENST00000263036___Name" | "[Transcript ID] ENST00000482140 [bp] 667 [Protein] 58aa [Translation ID] ENSP00000484961 [Biotype] Transcript is thought to undergo nonsense mediated decay, a process which detects nonsense mutations and prevents the expression of truncated or erroneous proteins. Nonsense mediated decay [CCDS] - [UniProt] A0A087X2G2 [RefSeq] - [Flags] Transcript Support Level 3, when transcripts are supported by a single EST only.The Transcript Support Level (TSL) is a method to highlight the well-supported and poorly-supported transcript models for users. The method relies on the primary data that can support full-length transcript structure: mRNA and EST alignments supplied by UCSC and Ensembl.TSL:3 [Name] " | "OPTN-002" | [] | "Ensembl genome browser 81: Homo sapiens - Protein summary - Transcript: OPTN-006 (ENST00000263036)" | "Name" | "http://www.ensembl.org/Homo_sapiens/Transcript/ProteinSummary?g=ENSG00000123240;r=10:13099449-13136923;t=ENST00000263036" | "36/1438042988308.23_20150728002308-00283-ip-10-236-191-2_429208942_0.json" |
"d1939a7e_pt__OPTN_006__ENST00000263036___Name" | "[Transcript ID] ENST00000469025 [bp] 642 [Protein] No protein [Translation ID] - [Biotype] Transcripts that don't contain an open reading frame (ORF) and cannot be placed in one of the other categories.Processed transcript [CCDS] - [UniProt] - [RefSeq] - [Flags] Transcript Support Level 3, when transcripts are supported by a single EST only.The Transcript Support Level (TSL) is a method to highlight the well-supported and poorly-supported transcript models for users. The method relies on the primary data that can support full-length transcript structure: mRNA and EST alignments supplied by UCSC and Ensembl.TSL:3 [Name] " | "OPTN-003" | [] | "Ensembl genome browser 81: Homo sapiens - Protein summary - Transcript: OPTN-006 (ENST00000263036)" | "Name" | "http://www.ensembl.org/Homo_sapiens/Transcript/ProteinSummary?g=ENSG00000123240;r=10:13099449-13136923;t=ENST00000263036" | "36/1438042988308.23_20150728002308-00283-ip-10-236-191-2_429208942_0.json" |
"d1939a7e_pt__OPTN_006__ENST00000263036___Name" | "[Transcript ID] ENST00000487935 [bp] 511 [Protein] No protein [Translation ID] - [Biotype] Transcripts that don't contain an open reading frame (ORF) and cannot be placed in one of the other categories.Processed transcript [CCDS] - [UniProt] - [RefSeq] - [Flags] Transcript Support Level 3, when transcripts are supported by a single EST only.The Transcript Support Level (TSL) is a method to highlight the well-supported and poorly-supported transcript models for users. The method relies on the primary data that can support full-length transcript structure: mRNA and EST alignments supplied by UCSC and Ensembl.TSL:3 [Name] " | "OPTN-010" | [] | "Ensembl genome browser 81: Homo sapiens - Protein summary - Transcript: OPTN-006 (ENST00000263036)" | "Name" | "http://www.ensembl.org/Homo_sapiens/Transcript/ProteinSummary?g=ENSG00000123240;r=10:13099449-13136923;t=ENST00000263036" | "36/1438042988308.23_20150728002308-00283-ip-10-236-191-2_429208942_0.json" |
"12b6ea4a___PRKACA_002__ENST00000590853___Biotype" | "[Name] PRKACA-001 [Transcript ID] ENST00000308677 [bp] 2677 [Protein] 351aa [Translation ID] ENSP00000309591 [CCDS] CCDS12304 [UniProt] A0A024R7J0 P17612 [RefSeq] NM_002730 NP_002721 [Flags] Transcript Support Level 1, when transcripts are supported by at least one non-suspect mRNA.The Transcript Support Level (TSL) is a method to highlight the well-supported and poorly-supported transcript models for users. The method relies on the primary data that can support full-length transcript structure: mRNA and EST alignments supplied by UCSC and Ensembl.TSL:1The GENCODE set is the gene set for human and mouse. GENCODE Basic is a subset of representative transcripts (splice variants).GENCODE basicPRINCIPAL1 - APPRIS candidate principal isoform.APPRIS is a system to annotate alternatively spliced transcripts based on a range of computational methods.APPRIS P1 [Biotype] " | "Genes and/or transcript that contains an open reading frame (ORF).Protein coding" | [] | "Ensembl genome browser 81: Homo sapiens - Exons - Transcript: PRKACA-002 (ENST00000590853)" | "Biotype" | "http://www.ensembl.org/Homo_sapiens/Transcript/Exons?db=core;g=ENSG00000072062;r=19:14202848-14228544;t=ENST00000590853" | "36/1438042988598.68_20150728002308-00199-ip-10-236-191-2_423629085_0.json" |
"12b6ea4a___PRKACA_002__ENST00000590853___Biotype" | "[Name] PRKACA-003 [Transcript ID] ENST00000589994 [bp] 1127 [Protein] 343aa [Translation ID] ENSP00000466651 [CCDS] CCDS12305 [UniProt] P17612 [RefSeq] NM_207518 NP_997401 [Flags] Transcript Support Level 2, when transcripts are supported by multiple ESTs or by an mRNA flagged as suspect.The Transcript Support Level (TSL) is a method to highlight the well-supported and poorly-supported transcript models for users. The method relies on the primary data that can support full-length transcript structure: mRNA and EST alignments supplied by UCSC and Ensembl.TSL:2The GENCODE set is the gene set for human and mouse. GENCODE Basic is a subset of representative transcripts (splice variants).GENCODE basic [Biotype] " | "Genes and/or transcript that contains an open reading frame (ORF).Protein coding" | [] | "Ensembl genome browser 81: Homo sapiens - Exons - Transcript: PRKACA-002 (ENST00000590853)" | "Biotype" | "http://www.ensembl.org/Homo_sapiens/Transcript/Exons?db=core;g=ENSG00000072062;r=19:14202848-14228544;t=ENST00000590853" | "36/1438042988598.68_20150728002308-00199-ip-10-236-191-2_423629085_0.json" |
"12b6ea4a___PRKACA_002__ENST00000590853___Biotype" | "[Name] PRKACA-002 [Transcript ID] ENST00000590853 [bp] 1101 [Protein] 124aa [Translation ID] ENSP00000466976 [CCDS] - [UniProt] K7ENJ5 [RefSeq] - [Flags] Transcript Support Level 5, for transcripts that are not supported at all by either an mRNA or an EST.The Transcript Support Level (TSL) is a method to highlight the well-supported and poorly-supported transcript models for users. The method relies on the primary data that can support full-length transcript structure: mRNA and EST alignments supplied by UCSC and Ensembl.TSL:5The GENCODE set is the gene set for human and mouse. GENCODE Basic is a subset of representative transcripts (splice variants).GENCODE basic [Biotype] " | "Genes and/or transcript that contains an open reading frame (ORF).Protein coding" | [] | "Ensembl genome browser 81: Homo sapiens - Exons - Transcript: PRKACA-002 (ENST00000590853)" | "Biotype" | "http://www.ensembl.org/Homo_sapiens/Transcript/Exons?db=core;g=ENSG00000072062;r=19:14202848-14228544;t=ENST00000590853" | "36/1438042988598.68_20150728002308-00199-ip-10-236-191-2_423629085_0.json" |
"12b6ea4a___PRKACA_002__ENST00000590853___Biotype" | "[Name] PRKACA-006 [Transcript ID] ENST00000593092 [bp] 922 [Protein] 207aa [Translation ID] ENSP00000466289 [CCDS] - [UniProt] Q15136 [RefSeq] - [Flags] 5' truncation in transcript evidence prevents annotation of the start of the CDS.CDS 5' incompleteTranscript Support Level 1, when transcripts are supported by at least one non-suspect mRNA.The Transcript Support Level (TSL) is a method to highlight the well-supported and poorly-supported transcript models for users. The method relies on the primary data that can support full-length transcript structure: mRNA and EST alignments supplied by UCSC and Ensembl.TSL:1 [Biotype] " | "Genes and/or transcript that contains an open reading frame (ORF).Protein coding" | [] | "Ensembl genome browser 81: Homo sapiens - Exons - Transcript: PRKACA-002 (ENST00000590853)" | "Biotype" | "http://www.ensembl.org/Homo_sapiens/Transcript/Exons?db=core;g=ENSG00000072062;r=19:14202848-14228544;t=ENST00000590853" | "36/1438042988598.68_20150728002308-00199-ip-10-236-191-2_423629085_0.json" |
"12b6ea4a___PRKACA_002__ENST00000590853___Biotype" | "[Name] PRKACA-010 [Transcript ID] ENST00000587372 [bp] 824 [Protein] 251aa [Translation ID] ENSP00000468352 [CCDS] - [UniProt] K7ERP6 [RefSeq] - [Flags] 5' truncation in transcript evidence prevents annotation of the start of the CDS.CDS 5' incompleteTranscript Support Level 3, when transcripts are supported by a single EST only.The Transcript Support Level (TSL) is a method to highlight the well-supported and poorly-supported transcript models for users. The method relies on the primary data that can support full-length transcript structure: mRNA and EST alignments supplied by UCSC and Ensembl.TSL:3 [Biotype] " | "Genes and/or transcript that contains an open reading frame (ORF).Protein coding" | [] | "Ensembl genome browser 81: Homo sapiens - Exons - Transcript: PRKACA-002 (ENST00000590853)" | "Biotype" | "http://www.ensembl.org/Homo_sapiens/Transcript/Exons?db=core;g=ENSG00000072062;r=19:14202848-14228544;t=ENST00000590853" | "36/1438042988598.68_20150728002308-00199-ip-10-236-191-2_423629085_0.json" |
"12b6ea4a___PRKACA_002__ENST00000590853___Biotype" | "[Name] PRKACA-008 [Transcript ID] ENST00000589284 [bp] 403 [Protein] 25aa [Translation ID] ENSP00000466660 [CCDS] - [UniProt] K7EMV1 [RefSeq] - [Flags] 3' truncation in transcript evidence prevents annotation of the end of the CDS.CDS 3' incompleteTranscript Support Level 5, for transcripts that are not supported at all by either an mRNA or an EST.The Transcript Support Level (TSL) is a method to highlight the well-supported and poorly-supported transcript models for users. The method relies on the primary data that can support full-length transcript structure: mRNA and EST alignments supplied by UCSC and Ensembl.TSL:5 [Biotype] " | "Genes and/or transcript that contains an open reading frame (ORF).Protein coding" | [] | "Ensembl genome browser 81: Homo sapiens - Exons - Transcript: PRKACA-002 (ENST00000590853)" | "Biotype" | "http://www.ensembl.org/Homo_sapiens/Transcript/Exons?db=core;g=ENSG00000072062;r=19:14202848-14228544;t=ENST00000590853" | "36/1438042988598.68_20150728002308-00199-ip-10-236-191-2_423629085_0.json" |
"12b6ea4a___PRKACA_002__ENST00000590853___Biotype" | "[Name] PRKACA-005 [Transcript ID] ENST00000350356 [bp] 3105 [Protein] No protein [Translation ID] - [CCDS] - [UniProt] - [RefSeq] - [Flags] Transcript Support Level 2, when transcripts are supported by multiple ESTs or by an mRNA flagged as suspect.The Transcript Support Level (TSL) is a method to highlight the well-supported and poorly-supported transcript models for users. The method relies on the primary data that can support full-length transcript structure: mRNA and EST alignments supplied by UCSC and Ensembl.TSL:2 [Biotype] " | "Transcripts that don't contain an open reading frame (ORF) and cannot be placed in one of the other categories.Processed transcript" | [] | "Ensembl genome browser 81: Homo sapiens - Exons - Transcript: PRKACA-002 (ENST00000590853)" | "Biotype" | "http://www.ensembl.org/Homo_sapiens/Transcript/Exons?db=core;g=ENSG00000072062;r=19:14202848-14228544;t=ENST00000590853" | "36/1438042988598.68_20150728002308-00199-ip-10-236-191-2_423629085_0.json" |
"12b6ea4a___PRKACA_002__ENST00000590853___Biotype" | "[Name] PRKACA-007 [Transcript ID] ENST00000587533 [bp] 2095 [Protein] No protein [Translation ID] - [CCDS] - [UniProt] - [RefSeq] - [Flags] Transcript Support Level Not Analysed. Pseudogenes, single exon transcripts, HLA, T-cell receptor and Ig transcripts are not analysed and therefore not given any of the TSL categories.The Transcript Support Level (TSL) is a method to highlight the well-supported and poorly-supported transcript models for users. The method relies on the primary data that can support full-length transcript structure: mRNA and EST alignments supplied by UCSC and Ensembl.TSL:NA [Biotype] " | "Alternatively spliced transcript that is believed to contain intronic sequence relative to other coding transcripts in a given locus.Retained intron" | [] | "Ensembl genome browser 81: Homo sapiens - Exons - Transcript: PRKACA-002 (ENST00000590853)" | "Biotype" | "http://www.ensembl.org/Homo_sapiens/Transcript/Exons?db=core;g=ENSG00000072062;r=19:14202848-14228544;t=ENST00000590853" | "36/1438042988598.68_20150728002308-00199-ip-10-236-191-2_423629085_0.json" |
"12b6ea4a___PRKACA_002__ENST00000590853___Biotype" | "[Name] PRKACA-004 [Transcript ID] ENST00000536649 [bp] 1524 [Protein] No protein [Translation ID] - [CCDS] - [UniProt] - [RefSeq] - [Flags] Transcript Support Level 2, when transcripts are supported by multiple ESTs or by an mRNA flagged as suspect.The Transcript Support Level (TSL) is a method to highlight the well-supported and poorly-supported transcript models for users. The method relies on the primary data that can support full-length transcript structure: mRNA and EST alignments supplied by UCSC and Ensembl.TSL:2 [Biotype] " | "Alternatively spliced transcript that is believed to contain intronic sequence relative to other coding transcripts in a given locus.Retained intron" | [] | "Ensembl genome browser 81: Homo sapiens - Exons - Transcript: PRKACA-002 (ENST00000590853)" | "Biotype" | "http://www.ensembl.org/Homo_sapiens/Transcript/Exons?db=core;g=ENSG00000072062;r=19:14202848-14228544;t=ENST00000590853" | "36/1438042988598.68_20150728002308-00199-ip-10-236-191-2_423629085_0.json" |
"12b6ea4a___PRKACA_002__ENST00000590853___Biotype" | "[Name] PRKACA-009 [Transcript ID] ENST00000588209 [bp] 942 [Protein] No protein [Translation ID] - [CCDS] - [UniProt] - [RefSeq] - [Flags] Transcript Support Level 3, when transcripts are supported by a single EST only.The Transcript Support Level (TSL) is a method to highlight the well-supported and poorly-supported transcript models for users. The method relies on the primary data that can support full-length transcript structure: mRNA and EST alignments supplied by UCSC and Ensembl.TSL:3 [Biotype] " | "Alternatively spliced transcript that is believed to contain intronic sequence relative to other coding transcripts in a given locus.Retained intron" | [] | "Ensembl genome browser 81: Homo sapiens - Exons - Transcript: PRKACA-002 (ENST00000590853)" | "Biotype" | "http://www.ensembl.org/Homo_sapiens/Transcript/Exons?db=core;g=ENSG00000072062;r=19:14202848-14228544;t=ENST00000590853" | "36/1438042988598.68_20150728002308-00199-ip-10-236-191-2_423629085_0.json" |
"b6dc9cb5_t__PDPK1_017__ENST00000561962___Flags" | "[Name] PDPK1-001 [Transcript ID] ENST00000342085 [bp] 7241 [Protein] 556aa [Translation ID] ENSP00000344220 [Biotype] Genes and/or transcript that contains an open reading frame (ORF).Protein coding [CCDS] CCDS10472 [UniProt] O15530 [RefSeq] NM_002613 NP_002604 [Flags] " | "Transcript Support Level 1, when transcripts are supported by at least one non-suspect mRNA.The Transcript Support Level (TSL) is a method to highlight the well-supported and poorly-supported transcript models for users. The method relies on the primary data that can support full-length transcript structure: mRNA and EST alignments supplied by UCSC and Ensembl.TSL:1The GENCODE set is the gene set for human and mouse. GENCODE Basic is a subset of representative transcripts (splice variants).GENCODE basicPRINCIPAL1 - APPRIS candidate principal isoform.APPRIS is a system to annotate alternatively spliced transcripts based on a range of computational methods.APPRIS P1" | [] | "Ensembl genome browser 81: Homo sapiens - Not Available - Transcript: PDPK1-017 (ENST00000561962)" | "Flags" | "http://www.ensembl.org/Homo_sapiens/Transcript/Sequence_Protein?db=core;g=ENSG00000140992;r=16:2600604-2615615;t=ENST00000561962" | "36/1438042988598.68_20150728002308-00065-ip-10-236-191-2_425130887_0.json" |
"b6dc9cb5_t__PDPK1_017__ENST00000561962___Flags" | "[Name] PDPK1-002 [Transcript ID] ENST00000268673 [bp] 4728 [Protein] 429aa [Translation ID] ENSP00000268673 [Biotype] Genes and/or transcript that contains an open reading frame (ORF).Protein coding [CCDS] CCDS10473 [UniProt] O15530 [RefSeq] NM_031268 NP_112558 [Flags] " | "Transcript Support Level 1, when transcripts are supported by at least one non-suspect mRNA.The Transcript Support Level (TSL) is a method to highlight the well-supported and poorly-supported transcript models for users. The method relies on the primary data that can support full-length transcript structure: mRNA and EST alignments supplied by UCSC and Ensembl.TSL:1The GENCODE set is the gene set for human and mouse. GENCODE Basic is a subset of representative transcripts (splice variants).GENCODE basic" | [] | "Ensembl genome browser 81: Homo sapiens - Not Available - Transcript: PDPK1-017 (ENST00000561962)" | "Flags" | "http://www.ensembl.org/Homo_sapiens/Transcript/Sequence_Protein?db=core;g=ENSG00000140992;r=16:2600604-2615615;t=ENST00000561962" | "36/1438042988598.68_20150728002308-00065-ip-10-236-191-2_425130887_0.json" |
"b6dc9cb5_t__PDPK1_017__ENST00000561962___Flags" | "[Name] PDPK1-013 [Transcript ID] ENST00000441549 [bp] 1729 [Protein] 454aa [Translation ID] ENSP00000395357 [Biotype] Genes and/or transcript that contains an open reading frame (ORF).Protein coding [CCDS] CCDS58411 [UniProt] O15530 [RefSeq] NM_001261816 NP_001248745 [Flags] " | "Transcript Support Level 1, when transcripts are supported by at least one non-suspect mRNA.The Transcript Support Level (TSL) is a method to highlight the well-supported and poorly-supported transcript models for users. The method relies on the primary data that can support full-length transcript structure: mRNA and EST alignments supplied by UCSC and Ensembl.TSL:1The GENCODE set is the gene set for human and mouse. GENCODE Basic is a subset of representative transcripts (splice variants).GENCODE basic" | [] | "Ensembl genome browser 81: Homo sapiens - Not Available - Transcript: PDPK1-017 (ENST00000561962)" | "Flags" | "http://www.ensembl.org/Homo_sapiens/Transcript/Sequence_Protein?db=core;g=ENSG00000140992;r=16:2600604-2615615;t=ENST00000561962" | "36/1438042988598.68_20150728002308-00065-ip-10-236-191-2_425130887_0.json" |
"b6dc9cb5_t__PDPK1_017__ENST00000561962___Flags" | "[Name] PDPK1-003 [Transcript ID] ENST00000389224 [bp] 2416 [Protein] 529aa [Translation ID] ENSP00000373876 [Biotype] Genes and/or transcript that contains an open reading frame (ORF).Protein coding [CCDS] - [UniProt] E9PER6 [RefSeq] - [Flags] " | "Transcript Support Level 2, when transcripts are supported by multiple ESTs or by an mRNA flagged as suspect.The Transcript Support Level (TSL) is a method to highlight the well-supported and poorly-supported transcript models for users. The method relies on the primary data that can support full-length transcript structure: mRNA and EST alignments supplied by UCSC and Ensembl.TSL:2The GENCODE set is the gene set for human and mouse. GENCODE Basic is a subset of representative transcripts (splice variants).GENCODE basic" | [] | "Ensembl genome browser 81: Homo sapiens - Not Available - Transcript: PDPK1-017 (ENST00000561962)" | "Flags" | "http://www.ensembl.org/Homo_sapiens/Transcript/Sequence_Protein?db=core;g=ENSG00000140992;r=16:2600604-2615615;t=ENST00000561962" | "36/1438042988598.68_20150728002308-00065-ip-10-236-191-2_425130887_0.json" |
"b6dc9cb5_t__PDPK1_017__ENST00000561962___Flags" | "[Name] PDPK1-008 [Transcript ID] ENST00000461815 [bp] 611 [Protein] 149aa [Translation ID] ENSP00000455551 [Biotype] Genes and/or transcript that contains an open reading frame (ORF).Protein coding [CCDS] - [UniProt] H3BQ10 [RefSeq] - [Flags] " | "3' truncation in transcript evidence prevents annotation of the end of the CDS.CDS 3' incompleteTranscript Support Level 3, when transcripts are supported by a single EST only.The Transcript Support Level (TSL) is a method to highlight the well-supported and poorly-supported transcript models for users. The method relies on the primary data that can support full-length transcript structure: mRNA and EST alignments supplied by UCSC and Ensembl.TSL:3" | [] | "Ensembl genome browser 81: Homo sapiens - Not Available - Transcript: PDPK1-017 (ENST00000561962)" | "Flags" | "http://www.ensembl.org/Homo_sapiens/Transcript/Sequence_Protein?db=core;g=ENSG00000140992;r=16:2600604-2615615;t=ENST00000561962" | "36/1438042988598.68_20150728002308-00065-ip-10-236-191-2_425130887_0.json" |
"b6dc9cb5_t__PDPK1_017__ENST00000561962___Flags" | "[Name] PDPK1-015 [Transcript ID] ENST00000566659 [bp] 451 [Protein] 88aa [Translation ID] ENSP00000455492 [Biotype] Genes and/or transcript that contains an open reading frame (ORF).Protein coding [CCDS] - [UniProt] H3BPW1 [RefSeq] - [Flags] " | "3' truncation in transcript evidence prevents annotation of the end of the CDS.CDS 3' incompleteTranscript Support Level 3, when transcripts are supported by a single EST only.The Transcript Support Level (TSL) is a method to highlight the well-supported and poorly-supported transcript models for users. The method relies on the primary data that can support full-length transcript structure: mRNA and EST alignments supplied by UCSC and Ensembl.TSL:3" | [] | "Ensembl genome browser 81: Homo sapiens - Not Available - Transcript: PDPK1-017 (ENST00000561962)" | "Flags" | "http://www.ensembl.org/Homo_sapiens/Transcript/Sequence_Protein?db=core;g=ENSG00000140992;r=16:2600604-2615615;t=ENST00000561962" | "36/1438042988598.68_20150728002308-00065-ip-10-236-191-2_425130887_0.json" |
"b6dc9cb5_t__PDPK1_017__ENST00000561962___Flags" | "[Name] PDPK1-006 [Transcript ID] ENST00000474706 [bp] 699 [Protein] 35aa [Translation ID] ENSP00000455025 [Biotype] Transcript is thought to undergo nonsense mediated decay, a process which detects nonsense mutations and prevents the expression of truncated or erroneous proteins. Nonsense mediated decay [CCDS] - [UniProt] H3BNV5 [RefSeq] - [Flags] " | "Transcript Support Level 5, for transcripts that are not supported at all by either an mRNA or an EST.The Transcript Support Level (TSL) is a method to highlight the well-supported and poorly-supported transcript models for users. The method relies on the primary data that can support full-length transcript structure: mRNA and EST alignments supplied by UCSC and Ensembl.TSL:5" | [] | "Ensembl genome browser 81: Homo sapiens - Not Available - Transcript: PDPK1-017 (ENST00000561962)" | "Flags" | "http://www.ensembl.org/Homo_sapiens/Transcript/Sequence_Protein?db=core;g=ENSG00000140992;r=16:2600604-2615615;t=ENST00000561962" | "36/1438042988598.68_20150728002308-00065-ip-10-236-191-2_425130887_0.json" |
"b6dc9cb5_t__PDPK1_017__ENST00000561962___Flags" | "[Name] PDPK1-007 [Transcript ID] ENST00000492021 [bp] 552 [Protein] 55aa [Translation ID] ENSP00000455684 [Biotype] Transcript is thought to undergo nonsense mediated decay, a process which detects nonsense mutations and prevents the expression of truncated or erroneous proteins. Nonsense mediated decay [CCDS] - [UniProt] H3BQA3 [RefSeq] - [Flags] " | "Transcript Support Level 5, for transcripts that are not supported at all by either an mRNA or an EST.The Transcript Support Level (TSL) is a method to highlight the well-supported and poorly-supported transcript models for users. The method relies on the primary data that can support full-length transcript structure: mRNA and EST alignments supplied by UCSC and Ensembl.TSL:5" | [] | "Ensembl genome browser 81: Homo sapiens - Not Available - Transcript: PDPK1-017 (ENST00000561962)" | "Flags" | "http://www.ensembl.org/Homo_sapiens/Transcript/Sequence_Protein?db=core;g=ENSG00000140992;r=16:2600604-2615615;t=ENST00000561962" | "36/1438042988598.68_20150728002308-00065-ip-10-236-191-2_425130887_0.json" |
"b6dc9cb5_t__PDPK1_017__ENST00000561962___Flags" | "[Name] PDPK1-012 [Transcript ID] ENST00000478708 [bp] 445 [Protein] 40aa [Translation ID] ENSP00000455438 [Biotype] Transcript is thought to undergo nonsense mediated decay, a process which detects nonsense mutations and prevents the expression of truncated or erroneous proteins. Nonsense mediated decay [CCDS] - [UniProt] H3BPR5 [RefSeq] - [Flags] " | "5' truncation in transcript evidence prevents annotation of the start of the CDS.CDS 5' incompleteTranscript Support Level 5, for transcripts that are not supported at all by either an mRNA or an EST.The Transcript Support Level (TSL) is a method to highlight the well-supported and poorly-supported transcript models for users. The method relies on the primary data that can support full-length transcript structure: mRNA and EST alignments supplied by UCSC and Ensembl.TSL:5" | [] | "Ensembl genome browser 81: Homo sapiens - Not Available - Transcript: PDPK1-017 (ENST00000561962)" | "Flags" | "http://www.ensembl.org/Homo_sapiens/Transcript/Sequence_Protein?db=core;g=ENSG00000140992;r=16:2600604-2615615;t=ENST00000561962" | "36/1438042988598.68_20150728002308-00065-ip-10-236-191-2_425130887_0.json" |
"b6dc9cb5_t__PDPK1_017__ENST00000561962___Flags" | "[Name] PDPK1-011 [Transcript ID] ENST00000471311 [bp] 621 [Protein] No protein [Translation ID] - [Biotype] Transcripts that don't contain an open reading frame (ORF) and cannot be placed in one of the other categories.Processed transcript [CCDS] - [UniProt] - [RefSeq] - [Flags] " | "Transcript Support Level 3, when transcripts are supported by a single EST only.The Transcript Support Level (TSL) is a method to highlight the well-supported and poorly-supported transcript models for users. The method relies on the primary data that can support full-length transcript structure: mRNA and EST alignments supplied by UCSC and Ensembl.TSL:3" | [] | "Ensembl genome browser 81: Homo sapiens - Not Available - Transcript: PDPK1-017 (ENST00000561962)" | "Flags" | "http://www.ensembl.org/Homo_sapiens/Transcript/Sequence_Protein?db=core;g=ENSG00000140992;r=16:2600604-2615615;t=ENST00000561962" | "36/1438042988598.68_20150728002308-00065-ip-10-236-191-2_425130887_0.json" |
"b6dc9cb5_t__PDPK1_017__ENST00000561962___Flags" | "[Name] PDPK1-017 [Transcript ID] ENST00000561962 [bp] 570 [Protein] No protein [Translation ID] - [Biotype] Transcripts that don't contain an open reading frame (ORF) and cannot be placed in one of the other categories.Processed transcript [CCDS] - [UniProt] - [RefSeq] - [Flags] " | "Transcript Support Level 4, for transcripts supported by an EST flagged as suspect.The Transcript Support Level (TSL) is a method to highlight the well-supported and poorly-supported transcript models for users. The method relies on the primary data that can support full-length transcript structure: mRNA and EST alignments supplied by UCSC and Ensembl.TSL:4" | [] | "Ensembl genome browser 81: Homo sapiens - Not Available - Transcript: PDPK1-017 (ENST00000561962)" | "Flags" | "http://www.ensembl.org/Homo_sapiens/Transcript/Sequence_Protein?db=core;g=ENSG00000140992;r=16:2600604-2615615;t=ENST00000561962" | "36/1438042988598.68_20150728002308-00065-ip-10-236-191-2_425130887_0.json" |
"b6dc9cb5_t__PDPK1_017__ENST00000561962___Flags" | "[Name] PDPK1-009 [Transcript ID] ENST00000460496 [bp] 569 [Protein] No protein [Translation ID] - [Biotype] Transcripts that don't contain an open reading frame (ORF) and cannot be placed in one of the other categories.Processed transcript [CCDS] - [UniProt] - [RefSeq] - [Flags] " | "Transcript Support Level 4, for transcripts supported by an EST flagged as suspect.The Transcript Support Level (TSL) is a method to highlight the well-supported and poorly-supported transcript models for users. The method relies on the primary data that can support full-length transcript structure: mRNA and EST alignments supplied by UCSC and Ensembl.TSL:4" | [] | "Ensembl genome browser 81: Homo sapiens - Not Available - Transcript: PDPK1-017 (ENST00000561962)" | "Flags" | "http://www.ensembl.org/Homo_sapiens/Transcript/Sequence_Protein?db=core;g=ENSG00000140992;r=16:2600604-2615615;t=ENST00000561962" | "36/1438042988598.68_20150728002308-00065-ip-10-236-191-2_425130887_0.json" |
"b6dc9cb5_t__PDPK1_017__ENST00000561962___Flags" | "[Name] PDPK1-016 [Transcript ID] ENST00000462923 [bp] 550 [Protein] No protein [Translation ID] - [Biotype] Transcripts that don't contain an open reading frame (ORF) and cannot be placed in one of the other categories.Processed transcript [CCDS] - [UniProt] - [RefSeq] - [Flags] " | "Transcript Support Level 5, for transcripts that are not supported at all by either an mRNA or an EST.The Transcript Support Level (TSL) is a method to highlight the well-supported and poorly-supported transcript models for users. The method relies on the primary data that can support full-length transcript structure: mRNA and EST alignments supplied by UCSC and Ensembl.TSL:5" | [] | "Ensembl genome browser 81: Homo sapiens - Not Available - Transcript: PDPK1-017 (ENST00000561962)" | "Flags" | "http://www.ensembl.org/Homo_sapiens/Transcript/Sequence_Protein?db=core;g=ENSG00000140992;r=16:2600604-2615615;t=ENST00000561962" | "36/1438042988598.68_20150728002308-00065-ip-10-236-191-2_425130887_0.json" |
"b6dc9cb5_t__PDPK1_017__ENST00000561962___Flags" | "[Name] PDPK1-010 [Transcript ID] ENST00000464702 [bp] 511 [Protein] No protein [Translation ID] - [Biotype] Transcripts that don't contain an open reading frame (ORF) and cannot be placed in one of the other categories.Processed transcript [CCDS] - [UniProt] - [RefSeq] - [Flags] " | "Transcript Support Level 2, when transcripts are supported by multiple ESTs or by an mRNA flagged as suspect.The Transcript Support Level (TSL) is a method to highlight the well-supported and poorly-supported transcript models for users. The method relies on the primary data that can support full-length transcript structure: mRNA and EST alignments supplied by UCSC and Ensembl.TSL:2" | [] | "Ensembl genome browser 81: Homo sapiens - Not Available - Transcript: PDPK1-017 (ENST00000561962)" | "Flags" | "http://www.ensembl.org/Homo_sapiens/Transcript/Sequence_Protein?db=core;g=ENSG00000140992;r=16:2600604-2615615;t=ENST00000561962" | "36/1438042988598.68_20150728002308-00065-ip-10-236-191-2_425130887_0.json" |
"b6dc9cb5_t__PDPK1_017__ENST00000561962___Flags" | "[Name] PDPK1-018 [Transcript ID] ENST00000569721 [bp] 330 [Protein] No protein [Translation ID] - [Biotype] Transcripts that don't contain an open reading frame (ORF) and cannot be placed in one of the other categories.Processed transcript [CCDS] - [UniProt] - [RefSeq] - [Flags] " | "Transcript Support Level 5, for transcripts that are not supported at all by either an mRNA or an EST.The Transcript Support Level (TSL) is a method to highlight the well-supported and poorly-supported transcript models for users. The method relies on the primary data that can support full-length transcript structure: mRNA and EST alignments supplied by UCSC and Ensembl.TSL:5" | [] | "Ensembl genome browser 81: Homo sapiens - Not Available - Transcript: PDPK1-017 (ENST00000561962)" | "Flags" | "http://www.ensembl.org/Homo_sapiens/Transcript/Sequence_Protein?db=core;g=ENSG00000140992;r=16:2600604-2615615;t=ENST00000561962" | "36/1438042988598.68_20150728002308-00065-ip-10-236-191-2_425130887_0.json" |
"b6dc9cb5_t__PDPK1_017__ENST00000561962___Flags" | "[Name] PDPK1-005 [Transcript ID] ENST00000491073 [bp] 2387 [Protein] No protein [Translation ID] - [Biotype] Alternatively spliced transcript that is believed to contain intronic sequence relative to other coding transcripts in a given locus.Retained intron [CCDS] - [UniProt] - [RefSeq] - [Flags] " | "Transcript Support Level 1, when transcripts are supported by at least one non-suspect mRNA.The Transcript Support Level (TSL) is a method to highlight the well-supported and poorly-supported transcript models for users. The method relies on the primary data that can support full-length transcript structure: mRNA and EST alignments supplied by UCSC and Ensembl.TSL:1" | [] | "Ensembl genome browser 81: Homo sapiens - Not Available - Transcript: PDPK1-017 (ENST00000561962)" | "Flags" | "http://www.ensembl.org/Homo_sapiens/Transcript/Sequence_Protein?db=core;g=ENSG00000140992;r=16:2600604-2615615;t=ENST00000561962" | "36/1438042988598.68_20150728002308-00065-ip-10-236-191-2_425130887_0.json" |
"b6dc9cb5_t__PDPK1_017__ENST00000561962___Flags" | "[Name] PDPK1-014 [Transcript ID] ENST00000570136 [bp] 1125 [Protein] No protein [Translation ID] - [Biotype] Alternatively spliced transcript that is believed to contain intronic sequence relative to other coding transcripts in a given locus.Retained intron [CCDS] - [UniProt] - [RefSeq] - [Flags] " | "Transcript Support Level 2, when transcripts are supported by multiple ESTs or by an mRNA flagged as suspect.The Transcript Support Level (TSL) is a method to highlight the well-supported and poorly-supported transcript models for users. The method relies on the primary data that can support full-length transcript structure: mRNA and EST alignments supplied by UCSC and Ensembl.TSL:2" | [] | "Ensembl genome browser 81: Homo sapiens - Not Available - Transcript: PDPK1-017 (ENST00000561962)" | "Flags" | "http://www.ensembl.org/Homo_sapiens/Transcript/Sequence_Protein?db=core;g=ENSG00000140992;r=16:2600604-2615615;t=ENST00000561962" | "36/1438042988598.68_20150728002308-00065-ip-10-236-191-2_425130887_0.json" |
"686cb3bc_pt__OPTN_006__ENST00000263036___Biotype" | "[Name] OPTN-005 [Transcript ID] ENST00000378748 [bp] 3521 [Protein] 577aa [Translation ID] ENSP00000368022 [CCDS] CCDS7094 [UniProt] Q96CV9 [RefSeq] NM_001008211 NM_001008213 NP_001008212 NP_001008214 [Flags] Transcript Support Level 1, when transcripts are supported by at least one non-suspect mRNA.The Transcript Support Level (TSL) is a method to highlight the well-supported and poorly-supported transcript models for users. The method relies on the primary data that can support full-length transcript structure: mRNA and EST alignments supplied by UCSC and Ensembl.TSL:1The GENCODE set is the gene set for human and mouse. GENCODE Basic is a subset of representative transcripts (splice variants).GENCODE basicPRINCIPAL2 - APPRIS candidate principal isoform (CCDS).APPRIS is a system to annotate alternatively spliced transcripts based on a range of computational methods.APPRIS P2 [Biotype] " | "Genes and/or transcript that contains an open reading frame (ORF).Protein coding" | [] | "Ensembl genome browser 81: Homo sapiens - Protein summary - Transcript: OPTN-006 (ENST00000263036)" | "Biotype" | "http://www.ensembl.org/Homo_sapiens/Transcript/ProteinSummary?g=ENSG00000123240;r=10:13099449-13136923;t=ENST00000263036" | "36/1438042988308.23_20150728002308-00283-ip-10-236-191-2_429208942_0.json" |
"686cb3bc_pt__OPTN_006__ENST00000263036___Biotype" | "[Name] OPTN-008 [Transcript ID] ENST00000378747 [bp] 3400 [Protein] 577aa [Translation ID] ENSP00000368021 [CCDS] CCDS7094 [UniProt] Q96CV9 [RefSeq] NM_001008212 NP_001008213 [Flags] Transcript Support Level 1, when transcripts are supported by at least one non-suspect mRNA.The Transcript Support Level (TSL) is a method to highlight the well-supported and poorly-supported transcript models for users. The method relies on the primary data that can support full-length transcript structure: mRNA and EST alignments supplied by UCSC and Ensembl.TSL:1The GENCODE set is the gene set for human and mouse. GENCODE Basic is a subset of representative transcripts (splice variants).GENCODE basicPRINCIPAL2 - APPRIS candidate principal isoform (CCDS).APPRIS is a system to annotate alternatively spliced transcripts based on a range of computational methods.APPRIS P2 [Biotype] " | "Genes and/or transcript that contains an open reading frame (ORF).Protein coding" | [] | "Ensembl genome browser 81: Homo sapiens - Protein summary - Transcript: OPTN-006 (ENST00000263036)" | "Biotype" | "http://www.ensembl.org/Homo_sapiens/Transcript/ProteinSummary?g=ENSG00000123240;r=10:13099449-13136923;t=ENST00000263036" | "36/1438042988308.23_20150728002308-00283-ip-10-236-191-2_429208942_0.json" |
"686cb3bc_pt__OPTN_006__ENST00000263036___Biotype" | "[Name] OPTN-007 [Transcript ID] ENST00000378757 [bp] 3321 [Protein] 577aa [Translation ID] ENSP00000368032 [CCDS] CCDS7094 [UniProt] Q96CV9 [RefSeq] NM_021980 NP_068815 [Flags] Transcript Support Level 1, when transcripts are supported by at least one non-suspect mRNA.The Transcript Support Level (TSL) is a method to highlight the well-supported and poorly-supported transcript models for users. The method relies on the primary data that can support full-length transcript structure: mRNA and EST alignments supplied by UCSC and Ensembl.TSL:1The GENCODE set is the gene set for human and mouse. GENCODE Basic is a subset of representative transcripts (splice variants).GENCODE basicPRINCIPAL2 - APPRIS candidate principal isoform (CCDS).APPRIS is a system to annotate alternatively spliced transcripts based on a range of computational methods.APPRIS P2 [Biotype] " | "Genes and/or transcript that contains an open reading frame (ORF).Protein coding" | [] | "Ensembl genome browser 81: Homo sapiens - Protein summary - Transcript: OPTN-006 (ENST00000263036)" | "Biotype" | "http://www.ensembl.org/Homo_sapiens/Transcript/ProteinSummary?g=ENSG00000123240;r=10:13099449-13136923;t=ENST00000263036" | "36/1438042988308.23_20150728002308-00283-ip-10-236-191-2_429208942_0.json" |
"686cb3bc_pt__OPTN_006__ENST00000263036___Biotype" | "[Name] OPTN-006 [Transcript ID] ENST00000263036 [bp] 2464 [Protein] 577aa [Translation ID] ENSP00000263036 [CCDS] CCDS7094 [UniProt] Q96CV9 [RefSeq] - [Flags] Transcript Support Level 2, when transcripts are supported by multiple ESTs or by an mRNA flagged as suspect.The Transcript Support Level (TSL) is a method to highlight the well-supported and poorly-supported transcript models for users. The method relies on the primary data that can support full-length transcript structure: mRNA and EST alignments supplied by UCSC and Ensembl.TSL:2The GENCODE set is the gene set for human and mouse. GENCODE Basic is a subset of representative transcripts (splice variants).GENCODE basicPRINCIPAL2 - APPRIS candidate principal isoform (CCDS).APPRIS is a system to annotate alternatively spliced transcripts based on a range of computational methods.APPRIS P2 [Biotype] " | "Genes and/or transcript that contains an open reading frame (ORF).Protein coding" | [] | "Ensembl genome browser 81: Homo sapiens - Protein summary - Transcript: OPTN-006 (ENST00000263036)" | "Biotype" | "http://www.ensembl.org/Homo_sapiens/Transcript/ProteinSummary?g=ENSG00000123240;r=10:13099449-13136923;t=ENST00000263036" | "36/1438042988308.23_20150728002308-00283-ip-10-236-191-2_429208942_0.json" |
"686cb3bc_pt__OPTN_006__ENST00000263036___Biotype" | "[Name] OPTN-004 [Transcript ID] ENST00000378752 [bp] 3488 [Protein] 571aa [Translation ID] ENSP00000368027 [CCDS] - [UniProt] Q96CV9 [RefSeq] - [Flags] Transcript Support Level 1, when transcripts are supported by at least one non-suspect mRNA.The Transcript Support Level (TSL) is a method to highlight the well-supported and poorly-supported transcript models for users. The method relies on the primary data that can support full-length transcript structure: mRNA and EST alignments supplied by UCSC and Ensembl.TSL:1The GENCODE set is the gene set for human and mouse. GENCODE Basic is a subset of representative transcripts (splice variants).GENCODE basicALTERNATIVE1 - APPRIS candidate principal isoform that is conserved in at least three tested non-primate species.APPRIS is a system to annotate alternatively spliced transcripts based on a range of computational methods.APPRIS ALT1 [Biotype] " | "Genes and/or transcript that contains an open reading frame (ORF).Protein coding" | [] | "Ensembl genome browser 81: Homo sapiens - Protein summary - Transcript: OPTN-006 (ENST00000263036)" | "Biotype" | "http://www.ensembl.org/Homo_sapiens/Transcript/ProteinSummary?g=ENSG00000123240;r=10:13099449-13136923;t=ENST00000263036" | "36/1438042988308.23_20150728002308-00283-ip-10-236-191-2_429208942_0.json" |
"686cb3bc_pt__OPTN_006__ENST00000263036___Biotype" | "[Name] OPTN-009 [Transcript ID] ENST00000378764 [bp] 2498 [Protein] 571aa [Translation ID] ENSP00000368040 [CCDS] - [UniProt] Q96CV9 [RefSeq] - [Flags] Transcript Support Level 5, for transcripts that are not supported at all by either an mRNA or an EST.The Transcript Support Level (TSL) is a method to highlight the well-supported and poorly-supported transcript models for users. The method relies on the primary data that can support full-length transcript structure: mRNA and EST alignments supplied by UCSC and Ensembl.TSL:5The GENCODE set is the gene set for human and mouse. GENCODE Basic is a subset of representative transcripts (splice variants).GENCODE basicALTERNATIVE1 - APPRIS candidate principal isoform that is conserved in at least three tested non-primate species.APPRIS is a system to annotate alternatively spliced transcripts based on a range of computational methods.APPRIS ALT1 [Biotype] " | "Genes and/or transcript that contains an open reading frame (ORF).Protein coding" | [] | "Ensembl genome browser 81: Homo sapiens - Protein summary - Transcript: OPTN-006 (ENST00000263036)" | "Biotype" | "http://www.ensembl.org/Homo_sapiens/Transcript/ProteinSummary?g=ENSG00000123240;r=10:13099449-13136923;t=ENST00000263036" | "36/1438042988308.23_20150728002308-00283-ip-10-236-191-2_429208942_0.json" |
"686cb3bc_pt__OPTN_006__ENST00000263036___Biotype" | "[Name] OPTN-001 [Transcript ID] ENST00000424614 [bp] 664 [Protein] 126aa [Translation ID] ENSP00000400356 [CCDS] - [UniProt] H7C1H4 [RefSeq] - [Flags] 5' truncation in transcript evidence prevents annotation of the start of the CDS.CDS 5' incompleteTranscript Support Level 3, when transcripts are supported by a single EST only.The Transcript Support Level (TSL) is a method to highlight the well-supported and poorly-supported transcript models for users. The method relies on the primary data that can support full-length transcript structure: mRNA and EST alignments supplied by UCSC and Ensembl.TSL:3 [Biotype] " | "Genes and/or transcript that contains an open reading frame (ORF).Protein coding" | [] | "Ensembl genome browser 81: Homo sapiens - Protein summary - Transcript: OPTN-006 (ENST00000263036)" | "Biotype" | "http://www.ensembl.org/Homo_sapiens/Transcript/ProteinSummary?g=ENSG00000123240;r=10:13099449-13136923;t=ENST00000263036" | "36/1438042988308.23_20150728002308-00283-ip-10-236-191-2_429208942_0.json" |
"686cb3bc_pt__OPTN_006__ENST00000263036___Biotype" | "[Name] OPTN-012 [Transcript ID] ENST00000486862 [bp] 423 [Protein] 107aa [Translation ID] ENSP00000481473 [CCDS] - [UniProt] A0A087WY28 [RefSeq] - [Flags] 5' truncation in transcript evidence prevents annotation of the start of the CDS.CDS 5' incompleteTranscript Support Level 3, when transcripts are supported by a single EST only.The Transcript Support Level (TSL) is a method to highlight the well-supported and poorly-supported transcript models for users. The method relies on the primary data that can support full-length transcript structure: mRNA and EST alignments supplied by UCSC and Ensembl.TSL:3 [Biotype] " | "Genes and/or transcript that contains an open reading frame (ORF).Protein coding" | [] | "Ensembl genome browser 81: Homo sapiens - Protein summary - Transcript: OPTN-006 (ENST00000263036)" | "Biotype" | "http://www.ensembl.org/Homo_sapiens/Transcript/ProteinSummary?g=ENSG00000123240;r=10:13099449-13136923;t=ENST00000263036" | "36/1438042988308.23_20150728002308-00283-ip-10-236-191-2_429208942_0.json" |
"686cb3bc_pt__OPTN_006__ENST00000263036___Biotype" | "[Name] OPTN-011 [Transcript ID] ENST00000430081 [bp] 848 [Protein] 62aa [Translation ID] ENSP00000414747 [CCDS] - [UniProt] X6RKL2 [RefSeq] - [Flags] Transcript Support Level 5, for transcripts that are not supported at all by either an mRNA or an EST.The Transcript Support Level (TSL) is a method to highlight the well-supported and poorly-supported transcript models for users. The method relies on the primary data that can support full-length transcript structure: mRNA and EST alignments supplied by UCSC and Ensembl.TSL:5 [Biotype] " | "Transcript is thought to undergo nonsense mediated decay, a process which detects nonsense mutations and prevents the expression of truncated or erroneous proteins. Nonsense mediated decay" | [] | "Ensembl genome browser 81: Homo sapiens - Protein summary - Transcript: OPTN-006 (ENST00000263036)" | "Biotype" | "http://www.ensembl.org/Homo_sapiens/Transcript/ProteinSummary?g=ENSG00000123240;r=10:13099449-13136923;t=ENST00000263036" | "36/1438042988308.23_20150728002308-00283-ip-10-236-191-2_429208942_0.json" |
"686cb3bc_pt__OPTN_006__ENST00000263036___Biotype" | "[Name] OPTN-002 [Transcript ID] ENST00000482140 [bp] 667 [Protein] 58aa [Translation ID] ENSP00000484961 [CCDS] - [UniProt] A0A087X2G2 [RefSeq] - [Flags] Transcript Support Level 3, when transcripts are supported by a single EST only.The Transcript Support Level (TSL) is a method to highlight the well-supported and poorly-supported transcript models for users. The method relies on the primary data that can support full-length transcript structure: mRNA and EST alignments supplied by UCSC and Ensembl.TSL:3 [Biotype] " | "Transcript is thought to undergo nonsense mediated decay, a process which detects nonsense mutations and prevents the expression of truncated or erroneous proteins. Nonsense mediated decay" | [] | "Ensembl genome browser 81: Homo sapiens - Protein summary - Transcript: OPTN-006 (ENST00000263036)" | "Biotype" | "http://www.ensembl.org/Homo_sapiens/Transcript/ProteinSummary?g=ENSG00000123240;r=10:13099449-13136923;t=ENST00000263036" | "36/1438042988308.23_20150728002308-00283-ip-10-236-191-2_429208942_0.json" |
"686cb3bc_pt__OPTN_006__ENST00000263036___Biotype" | "[Name] OPTN-003 [Transcript ID] ENST00000469025 [bp] 642 [Protein] No protein [Translation ID] - [CCDS] - [UniProt] - [RefSeq] - [Flags] Transcript Support Level 3, when transcripts are supported by a single EST only.The Transcript Support Level (TSL) is a method to highlight the well-supported and poorly-supported transcript models for users. The method relies on the primary data that can support full-length transcript structure: mRNA and EST alignments supplied by UCSC and Ensembl.TSL:3 [Biotype] " | "Transcripts that don't contain an open reading frame (ORF) and cannot be placed in one of the other categories.Processed transcript" | [] | "Ensembl genome browser 81: Homo sapiens - Protein summary - Transcript: OPTN-006 (ENST00000263036)" | "Biotype" | "http://www.ensembl.org/Homo_sapiens/Transcript/ProteinSummary?g=ENSG00000123240;r=10:13099449-13136923;t=ENST00000263036" | "36/1438042988308.23_20150728002308-00283-ip-10-236-191-2_429208942_0.json" |
"686cb3bc_pt__OPTN_006__ENST00000263036___Biotype" | "[Name] OPTN-010 [Transcript ID] ENST00000487935 [bp] 511 [Protein] No protein [Translation ID] - [CCDS] - [UniProt] - [RefSeq] - [Flags] Transcript Support Level 3, when transcripts are supported by a single EST only.The Transcript Support Level (TSL) is a method to highlight the well-supported and poorly-supported transcript models for users. The method relies on the primary data that can support full-length transcript structure: mRNA and EST alignments supplied by UCSC and Ensembl.TSL:3 [Biotype] " | "Transcripts that don't contain an open reading frame (ORF) and cannot be placed in one of the other categories.Processed transcript" | [] | "Ensembl genome browser 81: Homo sapiens - Protein summary - Transcript: OPTN-006 (ENST00000263036)" | "Biotype" | "http://www.ensembl.org/Homo_sapiens/Transcript/ProteinSummary?g=ENSG00000123240;r=10:13099449-13136923;t=ENST00000263036" | "36/1438042988308.23_20150728002308-00283-ip-10-236-191-2_429208942_0.json" |
"a7245e00_t__PARK2_003__ENST00000338468___Flags" | "[Name] PARK2-004 [Transcript ID] ENST00000366898 [bp] 4180 [Protein] 465aa [Translation ID] ENSP00000355865 [Biotype] Genes and/or transcript that contains an open reading frame (ORF).Protein coding [CCDS] CCDS5281 [UniProt] O60260 X5DR79 [RefSeq] NM_004562 NP_004553 [Flags] " | "Transcript Support Level 1, when transcripts are supported by at least one non-suspect mRNA.The Transcript Support Level (TSL) is a method to highlight the well-supported and poorly-supported transcript models for users. The method relies on the primary data that can support full-length transcript structure: mRNA and EST alignments supplied by UCSC and Ensembl.TSL:1The GENCODE set is the gene set for human and mouse. GENCODE Basic is a subset of representative transcripts (splice variants).GENCODE basicPRINCIPAL1 - APPRIS candidate principal isoform.APPRIS is a system to annotate alternatively spliced transcripts based on a range of computational methods.APPRIS P1" | [] | "Ensembl genome browser 81: Homo sapiens - ID History - Transcript: PARK2-003 (ENST00000338468)" | "Flags" | "http://www.ensembl.org/Homo_sapiens/Transcript/Idhistory/Protein?db=core;g=ENSG00000185345;r=6:161771131-163148700;t=ENST00000338468" | "36/1438042988308.23_20150728002308-00054-ip-10-236-191-2_414618671_0.json" |
"a7245e00_t__PARK2_003__ENST00000338468___Flags" | "[Name] PARK2-005 [Transcript ID] ENST00000366897 [bp] 2877 [Protein] 437aa [Translation ID] ENSP00000355863 [Biotype] Genes and/or transcript that contains an open reading frame (ORF).Protein coding [CCDS] CCDS5282 [UniProt] O60260 [RefSeq] NM_013987 NP_054642 [Flags] " | "Transcript Support Level 1, when transcripts are supported by at least one non-suspect mRNA.The Transcript Support Level (TSL) is a method to highlight the well-supported and poorly-supported transcript models for users. The method relies on the primary data that can support full-length transcript structure: mRNA and EST alignments supplied by UCSC and Ensembl.TSL:1The GENCODE set is the gene set for human and mouse. GENCODE Basic is a subset of representative transcripts (splice variants).GENCODE basic" | [] | "Ensembl genome browser 81: Homo sapiens - ID History - Transcript: PARK2-003 (ENST00000338468)" | "Flags" | "http://www.ensembl.org/Homo_sapiens/Transcript/Idhistory/Protein?db=core;g=ENSG00000185345;r=6:161771131-163148700;t=ENST00000338468" | "36/1438042988308.23_20150728002308-00054-ip-10-236-191-2_414618671_0.json" |
"a7245e00_t__PARK2_003__ENST00000338468___Flags" | "[Name] PARK2-006 [Transcript ID] ENST00000366896 [bp] 2514 [Protein] 316aa [Translation ID] ENSP00000355862 [Biotype] Genes and/or transcript that contains an open reading frame (ORF).Protein coding [CCDS] CCDS5283 [UniProt] O60260 [RefSeq] NM_013988 NP_054643 [Flags] " | "Transcript Support Level 1, when transcripts are supported by at least one non-suspect mRNA.The Transcript Support Level (TSL) is a method to highlight the well-supported and poorly-supported transcript models for users. The method relies on the primary data that can support full-length transcript structure: mRNA and EST alignments supplied by UCSC and Ensembl.TSL:1The GENCODE set is the gene set for human and mouse. GENCODE Basic is a subset of representative transcripts (splice variants).GENCODE basic" | [] | "Ensembl genome browser 81: Homo sapiens - ID History - Transcript: PARK2-003 (ENST00000338468)" | "Flags" | "http://www.ensembl.org/Homo_sapiens/Transcript/Idhistory/Protein?db=core;g=ENSG00000185345;r=6:161771131-163148700;t=ENST00000338468" | "36/1438042988308.23_20150728002308-00054-ip-10-236-191-2_414618671_0.json" |
"a7245e00_t__PARK2_003__ENST00000338468___Flags" | "[Name] PARK2-002 [Transcript ID] ENST00000366892 [bp] 1505 [Protein] 368aa [Translation ID] ENSP00000355858 [Biotype] Genes and/or transcript that contains an open reading frame (ORF).Protein coding [CCDS] - [UniProt] B1AKC3 [RefSeq] - [Flags] " | "Transcript Support Level 5, for transcripts that are not supported at all by either an mRNA or an EST.The Transcript Support Level (TSL) is a method to highlight the well-supported and poorly-supported transcript models for users. The method relies on the primary data that can support full-length transcript structure: mRNA and EST alignments supplied by UCSC and Ensembl.TSL:5The GENCODE set is the gene set for human and mouse. GENCODE Basic is a subset of representative transcripts (splice variants).GENCODE basic" | [] | "Ensembl genome browser 81: Homo sapiens - ID History - Transcript: PARK2-003 (ENST00000338468)" | "Flags" | "http://www.ensembl.org/Homo_sapiens/Transcript/Idhistory/Protein?db=core;g=ENSG00000185345;r=6:161771131-163148700;t=ENST00000338468" | "36/1438042988308.23_20150728002308-00054-ip-10-236-191-2_414618671_0.json" |
"a7245e00_t__PARK2_003__ENST00000338468___Flags" | "[Name] PARK2-003 [Transcript ID] ENST00000338468 [bp] 1276 [Protein] 274aa [Translation ID] ENSP00000343589 [Biotype] Genes and/or transcript that contains an open reading frame (ORF).Protein coding [CCDS] - [UniProt] O60260 [RefSeq] - [Flags] " | "Transcript Support Level 1, when transcripts are supported by at least one non-suspect mRNA.The Transcript Support Level (TSL) is a method to highlight the well-supported and poorly-supported transcript models for users. The method relies on the primary data that can support full-length transcript structure: mRNA and EST alignments supplied by UCSC and Ensembl.TSL:1The GENCODE set is the gene set for human and mouse. GENCODE Basic is a subset of representative transcripts (splice variants).GENCODE basic" | [] | "Ensembl genome browser 81: Homo sapiens - ID History - Transcript: PARK2-003 (ENST00000338468)" | "Flags" | "http://www.ensembl.org/Homo_sapiens/Transcript/Idhistory/Protein?db=core;g=ENSG00000185345;r=6:161771131-163148700;t=ENST00000338468" | "36/1438042988308.23_20150728002308-00054-ip-10-236-191-2_414618671_0.json" |
Dataset Card for "UnpredicTable-ensembl-org" - Dataset of Few-shot Tasks from Tables
Dataset Summary
The UnpredicTable dataset consists of web tables formatted as few-shot tasks for fine-tuning language models to improve their few-shot performance.
There are several dataset versions available:
UnpredicTable-full: Starting from the initial WTC corpus of 50M tables, we apply our tables-to-tasks procedure to produce our resulting dataset, UnpredicTable-full, which comprises 413,299 tasks from 23,744 unique websites.
UnpredicTable-unique: This is the same as UnpredicTable-full but filtered to have a maximum of one task per website. UnpredicTable-unique contains exactly 23,744 tasks from 23,744 websites.
UnpredicTable-5k: This dataset contains 5k random tables from the full dataset.
UnpredicTable data subsets based on a manual human quality rating (please see our publication for details of the ratings):
UnpredicTable data subsets based on the website of origin:
- UnpredicTable-baseball-fantasysports-yahoo-com
- UnpredicTable-bulbapedia-bulbagarden-net
- UnpredicTable-cappex-com
- UnpredicTable-cram-com
- UnpredicTable-dividend-com
- UnpredicTable-dummies-com
- UnpredicTable-en-wikipedia-org
- UnpredicTable-ensembl-org
- UnpredicTable-gamefaqs-com
- UnpredicTable-mgoblog-com
- UnpredicTable-mmo-champion-com
- UnpredicTable-msdn-microsoft-com
- UnpredicTable-phonearena-com
- UnpredicTable-sittercity-com
- UnpredicTable-sporcle-com
- UnpredicTable-studystack-com
- UnpredicTable-support-google-com
- UnpredicTable-w3-org
- UnpredicTable-wiki-openmoko-org
- UnpredicTable-wkdu-org
UnpredicTable data subsets based on clustering (for the clustering details please see our publication):
- UnpredicTable-cluster00
- UnpredicTable-cluster01
- UnpredicTable-cluster02
- UnpredicTable-cluster03
- UnpredicTable-cluster04
- UnpredicTable-cluster05
- UnpredicTable-cluster06
- UnpredicTable-cluster07
- UnpredicTable-cluster08
- UnpredicTable-cluster09
- UnpredicTable-cluster10
- UnpredicTable-cluster11
- UnpredicTable-cluster12
- UnpredicTable-cluster13
- UnpredicTable-cluster14
- UnpredicTable-cluster15
- UnpredicTable-cluster16
- UnpredicTable-cluster17
- UnpredicTable-cluster18
- UnpredicTable-cluster19
- UnpredicTable-cluster20
- UnpredicTable-cluster21
- UnpredicTable-cluster22
- UnpredicTable-cluster23
- UnpredicTable-cluster24
- UnpredicTable-cluster25
- UnpredicTable-cluster26
- UnpredicTable-cluster27
- UnpredicTable-cluster28
- UnpredicTable-cluster29
- UnpredicTable-cluster-noise
Supported Tasks and Leaderboards
Since the tables come from the web, the distribution of tasks and topics is very broad. The shape of our dataset is very wide, i.e., we have 1000's of tasks, while each task has only a few examples, compared to most current NLP datasets which are very deep, i.e., 10s of tasks with many examples. This implies that our dataset covers a broad range of potential tasks, e.g., multiple-choice, question-answering, table-question-answering, text-classification, etc.
The intended use of this dataset is to improve few-shot performance by fine-tuning/pre-training on our dataset.
Languages
English
Dataset Structure
Data Instances
Each task is represented as a jsonline file and consists of several few-shot examples. Each example is a dictionary containing a field 'task', which identifies the task, followed by an 'input', 'options', and 'output' field. The 'input' field contains several column elements of the same row in the table, while the 'output' field is a target which represents an individual column of the same row. Each task contains several such examples which can be concatenated as a few-shot task. In the case of multiple choice classification, the 'options' field contains the possible classes that a model needs to choose from.
There are also additional meta-data fields such as 'pageTitle', 'title', 'outputColName', 'url', 'wdcFile'.
Data Fields
'task': task identifier
'input': column elements of a specific row in the table.
'options': for multiple choice classification, it provides the options to choose from.
'output': target column element of the same row as input.
'pageTitle': the title of the page containing the table.
'outputColName': output column name
'url': url to the website containing the table
'wdcFile': WDC Web Table Corpus file
Data Splits
The UnpredicTable datasets do not come with additional data splits.
Dataset Creation
Curation Rationale
Few-shot training on multi-task datasets has been demonstrated to improve language models' few-shot learning (FSL) performance on new tasks, but it is unclear which training tasks lead to effective downstream task adaptation. Few-shot learning datasets are typically produced with expensive human curation, limiting the scale and diversity of the training tasks available to study. As an alternative source of few-shot data, we automatically extract 413,299 tasks from diverse internet tables. We provide this as a research resource to investigate the relationship between training data and few-shot learning.
Source Data
Initial Data Collection and Normalization
We use internet tables from the English-language Relational Subset of the WDC Web Table Corpus 2015 (WTC). The WTC dataset tables were extracted from the July 2015 Common Crawl web corpus (http://webdatacommons.org/webtables/2015/EnglishStatistics.html). The dataset contains 50,820,165 tables from 323,160 web domains. We then convert the tables into few-shot learning tasks. Please see our publication for more details on the data collection and conversion pipeline.
Who are the source language producers?
The dataset is extracted from WDC Web Table Corpora.
Annotations
Annotation process
Manual annotation was only carried out for the UnpredicTable-rated-low, UnpredicTable-rated-medium, and UnpredicTable-rated-high data subsets to rate task quality. Detailed instructions of the annotation instructions can be found in our publication.
Who are the annotators?
Annotations were carried out by a lab assistant.
Personal and Sensitive Information
The data was extracted from WDC Web Table Corpora, which in turn extracted tables from the Common Crawl. We did not filter the data in any way. Thus any user identities or otherwise sensitive information (e.g., data that reveals racial or ethnic origins, sexual orientations, religious beliefs, political opinions or union memberships, or locations; financial or health data; biometric or genetic data; forms of government identification, such as social security numbers; criminal history, etc.) might be contained in our dataset.
Considerations for Using the Data
Social Impact of Dataset
This dataset is intended for use as a research resource to investigate the relationship between training data and few-shot learning. As such, it contains high- and low-quality data, as well as diverse content that may be untruthful or inappropriate. Without careful investigation, it should not be used for training models that will be deployed for use in decision-critical or user-facing situations.
Discussion of Biases
Since our dataset contains tables that are scraped from the web, it will also contain many toxic, racist, sexist, and otherwise harmful biases and texts. We have not run any analysis on the biases prevalent in our datasets. Neither have we explicitly filtered the content. This implies that a model trained on our dataset may potentially reflect harmful biases and toxic text that exist in our dataset.
Other Known Limitations
No additional known limitations.
Additional Information
Dataset Curators
Jun Shern Chan, Michael Pieler, Jonathan Jao, Jérémy Scheurer, Ethan Perez
Licensing Information
Apache 2.0
Citation Information
@misc{chan2022few,
author = {Chan, Jun Shern and Pieler, Michael and Jao, Jonathan and Scheurer, Jérémy and Perez, Ethan},
title = {Few-shot Adaptation Works with UnpredicTable Data},
publisher={arXiv},
year = {2022},
url = {https://arxiv.org/abs/2208.01009}
}
- Downloads last month
- 89