BLASTX nr result

ID: Catharanthus23_contig00023095 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Catharanthus23_contig00023095
         (1181 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_006363542.1| PREDICTED: uncharacterized protein LOC102594...   388   e-105
ref|XP_004237074.1| PREDICTED: uncharacterized protein LOC101258...   385   e-104
gb|EMJ24449.1| hypothetical protein PRUPE_ppa009381mg [Prunus pe...   367   5e-99
ref|XP_004168837.1| PREDICTED: uncharacterized LOC101208049 [Cuc...   366   9e-99
ref|XP_004135215.1| PREDICTED: uncharacterized protein LOC101208...   365   2e-98
ref|XP_002267988.1| PREDICTED: uncharacterized protein LOC100255...   364   4e-98
ref|XP_006438521.1| hypothetical protein CICLE_v10032274mg [Citr...   363   7e-98
ref|XP_002520192.1| conserved hypothetical protein [Ricinus comm...   363   7e-98
ref|XP_004298596.1| PREDICTED: uncharacterized protein LOC101310...   356   1e-95
gb|EOY00279.1| Cofactor assembly of complex C [Theobroma cacao]       355   2e-95
ref|XP_002311785.2| hypothetical protein POPTR_0008s19620g [Popu...   353   1e-94
ref|XP_006301638.1| hypothetical protein CARUB_v10022081mg [Caps...   349   1e-93
gb|EXB74736.1| hypothetical protein L484_011015 [Morus notabilis]     349   1e-93
ref|NP_176193.2| cofactor assembly of complex C [Arabidopsis tha...   347   5e-93
gb|ESW29543.1| hypothetical protein PHAVU_002G078400g [Phaseolus...   347   7e-93
ref|XP_006392225.1| hypothetical protein EUTSA_v10023990mg [Eutr...   346   9e-93
ref|XP_002888168.1| hypothetical protein ARALYDRAFT_475325 [Arab...   345   3e-92
ref|XP_003519789.2| PREDICTED: uncharacterized protein LOC100776...   341   4e-91
gb|AAD39336.1|AC007258_25 Hypothetical protein [Arabidopsis thal...   340   9e-91
ref|XP_004237075.1| PREDICTED: uncharacterized protein LOC101258...   333   8e-89

>ref|XP_006363542.1| PREDICTED: uncharacterized protein LOC102594748 [Solanum tuberosum]
          Length = 288

 Score =  388 bits (996), Expect = e-105
 Identities = 191/243 (78%), Positives = 214/243 (88%)
 Frame = +1

Query: 10  YRGPKPKREWVAEWVSKNDDFVRSLPIYVGGISLLAVLFNRTVSGIAPVADASSSQSRAD 189
           YRG KP+REW+A+WVS NDD VRS+PIYVGG+SLLAVLFNRTVSGIAPVADASSSQSRAD
Sbjct: 46  YRGMKPRREWIADWVSSNDDLVRSMPIYVGGLSLLAVLFNRTVSGIAPVADASSSQSRAD 105

Query: 190 LLTLGLAVTNILNGLVWLSIRPKSISPVSPDGIECQIICSYLPDFVVSELLWAWESLSFV 369
           LLTLGLAVTNILNGLVWLSIRPKSIS V+P+G+ECQ I S+LPDFV+SELLWAW SLS V
Sbjct: 106 LLTLGLAVTNILNGLVWLSIRPKSISVVNPNGVECQRIASHLPDFVISELLWAWNSLSDV 165

Query: 370 TCCRSLVIVYDKRCILQIGYAAVSASXXXXXXXXXXXXLMQGSLYQGLMKSKSQSYLANL 549
           TCC+SLVIVYD +CILQ G+AA S S            L++GSLYQG++KS SQSYLANL
Sbjct: 166 TCCKSLVIVYDGKCILQTGFAAASLSNGSDAVAVDSNKLIEGSLYQGVLKSASQSYLANL 225

Query: 550 SLYPAKSELPFLPSNTQAVILQPLGSKGVAVIGGDIIRGFTTSDQAWISLVGEKLDATLA 729
           SLYP KSELPFLPSNTQAVILQPLG KG+A+IGGD IRGFT+SDQAWI+L+GEKLDATL 
Sbjct: 226 SLYPGKSELPFLPSNTQAVILQPLGDKGIAIIGGDTIRGFTSSDQAWITLIGEKLDATLT 285

Query: 730 KVM 738
           KV+
Sbjct: 286 KVI 288


>ref|XP_004237074.1| PREDICTED: uncharacterized protein LOC101258059 isoform 1 [Solanum
           lycopersicum]
          Length = 280

 Score =  385 bits (989), Expect = e-104
 Identities = 189/243 (77%), Positives = 212/243 (87%)
 Frame = +1

Query: 10  YRGPKPKREWVAEWVSKNDDFVRSLPIYVGGISLLAVLFNRTVSGIAPVADASSSQSRAD 189
           YRG KP+REW+A+WVS NDD VRS+PIYVGG+SLLAVLFNRT+SGIAPVADASSSQSRAD
Sbjct: 38  YRGMKPRREWIADWVSNNDDLVRSMPIYVGGLSLLAVLFNRTLSGIAPVADASSSQSRAD 97

Query: 190 LLTLGLAVTNILNGLVWLSIRPKSISPVSPDGIECQIICSYLPDFVVSELLWAWESLSFV 369
           LLTLGLAVTNILNGLVWLSIRPKSIS V+P G+ECQ I S+LPDFV+SELLWAW SLS V
Sbjct: 98  LLTLGLAVTNILNGLVWLSIRPKSISVVNPKGVECQRIASHLPDFVISELLWAWNSLSDV 157

Query: 370 TCCRSLVIVYDKRCILQIGYAAVSASXXXXXXXXXXXXLMQGSLYQGLMKSKSQSYLANL 549
           TCCRSLV+VYD +CILQ G+AA S S             ++GSLYQG++KS SQSYLANL
Sbjct: 158 TCCRSLVVVYDGKCILQTGFAAASLSNGSDAVAVDSNKFIEGSLYQGVLKSASQSYLANL 217

Query: 550 SLYPAKSELPFLPSNTQAVILQPLGSKGVAVIGGDIIRGFTTSDQAWISLVGEKLDATLA 729
           SLYP KSELPFLPSNTQAVILQPLG KG+A+IGGD IRGFT+SDQAWI+L+GEKLDATL 
Sbjct: 218 SLYPGKSELPFLPSNTQAVILQPLGDKGIAIIGGDTIRGFTSSDQAWITLIGEKLDATLT 277

Query: 730 KVM 738
           KV+
Sbjct: 278 KVI 280


>gb|EMJ24449.1| hypothetical protein PRUPE_ppa009381mg [Prunus persica]
          Length = 295

 Score =  367 bits (942), Expect = 5e-99
 Identities = 186/245 (75%), Positives = 208/245 (84%)
 Frame = +1

Query: 10  YRGPKPKREWVAEWVSKNDDFVRSLPIYVGGISLLAVLFNRTVSGIAPVADASSSQSRAD 189
           YRGPKP+R WVA+WVS NDD VRSLPIY GG SLLAVLFNRTVS IA VADASSSQSRAD
Sbjct: 40  YRGPKPQRNWVADWVSNNDDAVRSLPIYAGGASLLAVLFNRTVSDIALVADASSSQSRAD 99

Query: 190 LLTLGLAVTNILNGLVWLSIRPKSISPVSPDGIECQIICSYLPDFVVSELLWAWESLSFV 369
           LLTLGLAVTNIL GLVWLSIRPKSIS V+P+GIEC+ + S LP  ++SELLWAWESLS V
Sbjct: 100 LLTLGLAVTNILAGLVWLSIRPKSISKVNPEGIECERMHSNLPHTLLSELLWAWESLSDV 159

Query: 370 TCCRSLVIVYDKRCILQIGYAAVSASXXXXXXXXXXXXLMQGSLYQGLMKSKSQSYLANL 549
           TCCRSLVIVYD RCILQIG+AA S++            LMQGSLY+G+MKS  QSYLANL
Sbjct: 160 TCCRSLVIVYDGRCILQIGFAAESSNGDGKAVSVDADKLMQGSLYRGVMKSGVQSYLANL 219

Query: 550 SLYPAKSELPFLPSNTQAVILQPLGSKGVAVIGGDIIRGFTTSDQAWISLVGEKLDATLA 729
            LYP KSELPFLPSNTQAVILQPLG KG+ +IGGD +RGFT++DQAWISL+GEKLD+TLA
Sbjct: 220 YLYPGKSELPFLPSNTQAVILQPLGDKGITIIGGDTVRGFTSADQAWISLIGEKLDSTLA 279

Query: 730 KVMDD 744
           K +D+
Sbjct: 280 KYVDN 284


>ref|XP_004168837.1| PREDICTED: uncharacterized LOC101208049 [Cucumis sativus]
          Length = 300

 Score =  366 bits (940), Expect = 9e-99
 Identities = 185/244 (75%), Positives = 205/244 (84%)
 Frame = +1

Query: 1   QGDYRGPKPKREWVAEWVSKNDDFVRSLPIYVGGISLLAVLFNRTVSGIAPVADASSSQS 180
           Q  YRGPKP    VA+WVS NDD VRSLPIYVGGISLL VLFNR VSGIAPVADASSSQS
Sbjct: 41  QRSYRGPKPSTNLVADWVSNNDDTVRSLPIYVGGISLLIVLFNRAVSGIAPVADASSSQS 100

Query: 181 RADLLTLGLAVTNILNGLVWLSIRPKSISPVSPDGIECQIICSYLPDFVVSELLWAWESL 360
           RADLLTLGLAVTN+L GLVWLSIRPKSI+PV+P G+E + ICS LPD V SELLW W+SL
Sbjct: 101 RADLLTLGLAVTNVLAGLVWLSIRPKSITPVNPLGVENERICSSLPDRVTSELLWVWKSL 160

Query: 361 SFVTCCRSLVIVYDKRCILQIGYAAVSASXXXXXXXXXXXXLMQGSLYQGLMKSKSQSYL 540
           S VTCCRSLV+VYD +CI Q+G+AA SA             LMQGSLY+G++KS++QSYL
Sbjct: 161 SEVTCCRSLVVVYDGKCIFQVGFAAESAEGNGEAEHVEAGKLMQGSLYKGVLKSQAQSYL 220

Query: 541 ANLSLYPAKSELPFLPSNTQAVILQPLGSKGVAVIGGDIIRGFTTSDQAWISLVGEKLDA 720
           ANLSLYP KSELPFLPSNTQAVILQPLG KG+A+IGG+ IRGFTTSDQAWIS VGEKLDA
Sbjct: 221 ANLSLYPGKSELPFLPSNTQAVILQPLGEKGIAIIGGNTIRGFTTSDQAWISFVGEKLDA 280

Query: 721 TLAK 732
           TL+K
Sbjct: 281 TLSK 284


>ref|XP_004135215.1| PREDICTED: uncharacterized protein LOC101208049 [Cucumis sativus]
          Length = 300

 Score =  365 bits (937), Expect = 2e-98
 Identities = 185/244 (75%), Positives = 204/244 (83%)
 Frame = +1

Query: 1   QGDYRGPKPKREWVAEWVSKNDDFVRSLPIYVGGISLLAVLFNRTVSGIAPVADASSSQS 180
           Q  YRGPKP    VA+WVS NDD VRSLPIYVGGISLL VLFNR VSGIAPVADASSSQS
Sbjct: 41  QRSYRGPKPSTNLVADWVSNNDDTVRSLPIYVGGISLLIVLFNRAVSGIAPVADASSSQS 100

Query: 181 RADLLTLGLAVTNILNGLVWLSIRPKSISPVSPDGIECQIICSYLPDFVVSELLWAWESL 360
           RADLLTLGLAVTN+L GLVWLSIRPKSI+PV+P G+E + ICS LPD V SELLW W+SL
Sbjct: 101 RADLLTLGLAVTNVLAGLVWLSIRPKSITPVNPLGVENERICSSLPDRVTSELLWVWKSL 160

Query: 361 SFVTCCRSLVIVYDKRCILQIGYAAVSASXXXXXXXXXXXXLMQGSLYQGLMKSKSQSYL 540
           S VTCCRSLV+VYD  CI Q+G+AA SA             LMQGSLY+G++KS++QSYL
Sbjct: 161 SEVTCCRSLVVVYDGTCIFQVGFAAESAEGNGEAEHVEAGKLMQGSLYKGVLKSQAQSYL 220

Query: 541 ANLSLYPAKSELPFLPSNTQAVILQPLGSKGVAVIGGDIIRGFTTSDQAWISLVGEKLDA 720
           ANLSLYP KSELPFLPSNTQAVILQPLG KG+A+IGG+ IRGFTTSDQAWIS VGEKLDA
Sbjct: 221 ANLSLYPGKSELPFLPSNTQAVILQPLGEKGIAIIGGNTIRGFTTSDQAWISFVGEKLDA 280

Query: 721 TLAK 732
           TL+K
Sbjct: 281 TLSK 284


>ref|XP_002267988.1| PREDICTED: uncharacterized protein LOC100255235 [Vitis vinifera]
           gi|296086476|emb|CBI32065.3| unnamed protein product
           [Vitis vinifera]
          Length = 291

 Score =  364 bits (934), Expect = 4e-98
 Identities = 182/253 (71%), Positives = 211/253 (83%)
 Frame = +1

Query: 1   QGDYRGPKPKREWVAEWVSKNDDFVRSLPIYVGGISLLAVLFNRTVSGIAPVADASSSQS 180
           QG YRGPKP+R+W+A+WVS NDD VRSLPIYVGG+SLL+VLFNRT+SGIAPVADASSSQS
Sbjct: 35  QGRYRGPKPRRDWLADWVSNNDDTVRSLPIYVGGVSLLSVLFNRTISGIAPVADASSSQS 94

Query: 181 RADLLTLGLAVTNILNGLVWLSIRPKSISPVSPDGIECQIICSYLPDFVVSELLWAWESL 360
           RADLLTLGLAVTNIL GLVWLSIRPKSIS V+P G+E + I   +   +VSELLW WESL
Sbjct: 95  RADLLTLGLAVTNILAGLVWLSIRPKSISVVNPQGVESRRIYPNIAASLVSELLWVWESL 154

Query: 361 SFVTCCRSLVIVYDKRCILQIGYAAVSASXXXXXXXXXXXXLMQGSLYQGLMKSKSQSYL 540
           S VTCCRSLVI+YD  CILQIG AA S++            LMQGSLYQG+MKS +QSYL
Sbjct: 155 SVVTCCRSLVILYDSICILQIGMAAESSAGDGEAMAVDATKLMQGSLYQGVMKSGAQSYL 214

Query: 541 ANLSLYPAKSELPFLPSNTQAVILQPLGSKGVAVIGGDIIRGFTTSDQAWISLVGEKLDA 720
           ANLSLYP KSELPFLPSNTQAVILQP+G KG+ +IGGD IRGFT SDQAWI+L+GEK+DA
Sbjct: 215 ANLSLYPGKSELPFLPSNTQAVILQPIGDKGIIIIGGDTIRGFTASDQAWITLIGEKVDA 274

Query: 721 TLAKVMDDISANV 759
           +L+K ++++   V
Sbjct: 275 SLSKYVNNLPVAV 287


>ref|XP_006438521.1| hypothetical protein CICLE_v10032274mg [Citrus clementina]
           gi|568860477|ref|XP_006483743.1| PREDICTED:
           uncharacterized protein LOC102610709 isoform X1 [Citrus
           sinensis] gi|557540717|gb|ESR51761.1| hypothetical
           protein CICLE_v10032274mg [Citrus clementina]
          Length = 294

 Score =  363 bits (932), Expect = 7e-98
 Identities = 184/253 (72%), Positives = 209/253 (82%)
 Frame = +1

Query: 1   QGDYRGPKPKREWVAEWVSKNDDFVRSLPIYVGGISLLAVLFNRTVSGIAPVADASSSQS 180
           QG YRGPKP ++ VA+WV  NDD VRSLPIYVGG SLLAVLFNRTVSGIAPVADASSSQS
Sbjct: 39  QGGYRGPKPSQDLVADWVMNNDDAVRSLPIYVGGASLLAVLFNRTVSGIAPVADASSSQS 98

Query: 181 RADLLTLGLAVTNILNGLVWLSIRPKSISPVSPDGIECQIICSYLPDFVVSELLWAWESL 360
           RADLL + LAVT+IL GLVWLSIRPKSI+ V+P G+ECQ+ICSYLPD VVSELLWAWESL
Sbjct: 99  RADLLAISLAVTSILTGLVWLSIRPKSITVVNPKGVECQMICSYLPDSVVSELLWAWESL 158

Query: 361 SFVTCCRSLVIVYDKRCILQIGYAAVSASXXXXXXXXXXXXLMQGSLYQGLMKSKSQSYL 540
           S VTCCRSLV+VYD  C+LQIG AA S +            L QGS+Y G+M+SK+Q YL
Sbjct: 159 SAVTCCRSLVVVYDGICLLQIGMAAESGN-SGEAVIVDASKLTQGSVYLGVMRSKAQRYL 217

Query: 541 ANLSLYPAKSELPFLPSNTQAVILQPLGSKGVAVIGGDIIRGFTTSDQAWISLVGEKLDA 720
           ANL LYP +SELPFLPSNTQAVILQPLG KG+A+IGGD IRGFTTSDQ+WI+ +GEKLDA
Sbjct: 218 ANLLLYPGRSELPFLPSNTQAVILQPLGDKGIAIIGGDTIRGFTTSDQSWIAFIGEKLDA 277

Query: 721 TLAKVMDDISANV 759
           TLAK  + + + V
Sbjct: 278 TLAKYSNIVPSMV 290


>ref|XP_002520192.1| conserved hypothetical protein [Ricinus communis]
           gi|223540684|gb|EEF42247.1| conserved hypothetical
           protein [Ricinus communis]
          Length = 336

 Score =  363 bits (932), Expect = 7e-98
 Identities = 186/244 (76%), Positives = 204/244 (83%)
 Frame = +1

Query: 1   QGDYRGPKPKREWVAEWVSKNDDFVRSLPIYVGGISLLAVLFNRTVSGIAPVADASSSQS 180
           QG YRGPKPKR+ VA+WVS NDD VRSLPIYVGG SLLAVLFNR  SGIAPVADASSSQS
Sbjct: 81  QGKYRGPKPKRDLVADWVSNNDDTVRSLPIYVGGASLLAVLFNRAASGIAPVADASSSQS 140

Query: 181 RADLLTLGLAVTNILNGLVWLSIRPKSISPVSPDGIECQIICSYLPDFVVSELLWAWESL 360
           RADLLTLGLAVTNIL GL+WLSI+PKSIS V+P G+ECQII S+LPD+VVSELLWAWESL
Sbjct: 141 RADLLTLGLAVTNILAGLIWLSIKPKSISLVNPQGVECQIILSHLPDYVVSELLWAWESL 200

Query: 361 SFVTCCRSLVIVYDKRCILQIGYAAVSASXXXXXXXXXXXXLMQGSLYQGLMKSKSQSYL 540
           S  TCCRSLV+VYD  C LQIG AA S +            LMQGSL Q + KS +QSYL
Sbjct: 201 SAATCCRSLVVVYDCVCFLQIGMAAESPN-KGEALSVDAAKLMQGSLVQAIKKSGAQSYL 259

Query: 541 ANLSLYPAKSELPFLPSNTQAVILQPLGSKGVAVIGGDIIRGFTTSDQAWISLVGEKLDA 720
           ANLSLYP ++ELPFLP NTQAVILQPLG KGVA+IGGD IRGFTTSDQAWI+ +GEKLD+
Sbjct: 260 ANLSLYPGRTELPFLPLNTQAVILQPLGDKGVAIIGGDTIRGFTTSDQAWITFIGEKLDS 319

Query: 721 TLAK 732
           TLAK
Sbjct: 320 TLAK 323


>ref|XP_004298596.1| PREDICTED: uncharacterized protein LOC101310411 [Fragaria vesca
           subsp. vesca]
          Length = 293

 Score =  356 bits (913), Expect = 1e-95
 Identities = 183/248 (73%), Positives = 206/248 (83%)
 Frame = +1

Query: 7   DYRGPKPKREWVAEWVSKNDDFVRSLPIYVGGISLLAVLFNRTVSGIAPVADASSSQSRA 186
           DYRGPKPK   VA+WVS NDD  RSLPIY GG+SLLAVLFNRT S IA VADASSSQSRA
Sbjct: 39  DYRGPKPKTNLVADWVSNNDDAARSLPIYAGGVSLLAVLFNRTASDIALVADASSSQSRA 98

Query: 187 DLLTLGLAVTNILNGLVWLSIRPKSISPVSPDGIECQIICSYLPDFVVSELLWAWESLSF 366
           DLLTLGLAVTNIL GLVWLSIRPKSIS V P+G+EC+ + S LP+ +VSELLWAWESLS+
Sbjct: 99  DLLTLGLAVTNILAGLVWLSIRPKSISKVDPEGVECRRMRSDLPETLVSELLWAWESLSY 158

Query: 367 VTCCRSLVIVYDKRCILQIGYAAVSASXXXXXXXXXXXXLMQGSLYQGLMKSKSQSYLAN 546
            TCCRSLVIVY  R +LQIG+AA +AS            L+QGSLY+G+MKS  QSYLAN
Sbjct: 159 ATCCRSLVIVYGGRNVLQIGFAA-AASDGGEAVAVDADKLIQGSLYRGVMKSGVQSYLAN 217

Query: 547 LSLYPAKSELPFLPSNTQAVILQPLGSKGVAVIGGDIIRGFTTSDQAWISLVGEKLDATL 726
           L+LYP KSELPFLPSNTQAVILQPLG  G+A+IGGD IRGFT SDQAWISL+GEKLDATL
Sbjct: 218 LALYPGKSELPFLPSNTQAVILQPLGDNGIAIIGGDTIRGFTASDQAWISLIGEKLDATL 277

Query: 727 AKVMDDIS 750
           A+ +D++S
Sbjct: 278 ARCVDNLS 285


>gb|EOY00279.1| Cofactor assembly of complex C [Theobroma cacao]
          Length = 289

 Score =  355 bits (912), Expect = 2e-95
 Identities = 177/249 (71%), Positives = 205/249 (82%)
 Frame = +1

Query: 1   QGDYRGPKPKREWVAEWVSKNDDFVRSLPIYVGGISLLAVLFNRTVSGIAPVADASSSQS 180
           +G  +GPKP R+W+A WVSK D+ VRSLPIYVGG SLL VLFNR VSGIAPVADASSSQS
Sbjct: 36  RGSDKGPKPSRDWIAHWVSKKDEAVRSLPIYVGGASLLTVLFNRAVSGIAPVADASSSQS 95

Query: 181 RADLLTLGLAVTNILNGLVWLSIRPKSISPVSPDGIECQIICSYLPDFVVSELLWAWESL 360
           RADLLTLGLAVT+IL GLVWLSI+PKSI+PV P G+ECQ+  S L ++VVSE+ WAWESL
Sbjct: 96  RADLLTLGLAVTSILTGLVWLSIQPKSITPVDPQGVECQVFYSQLSEWVVSEIFWAWESL 155

Query: 361 SFVTCCRSLVIVYDKRCILQIGYAAVSASXXXXXXXXXXXXLMQGSLYQGLMKSKSQSYL 540
           S +TCCRSLVI+YD +CI+QIG AA S +            LMQGSL  G++KS +Q YL
Sbjct: 156 STITCCRSLVIIYDCKCIVQIGAAAKSPN-DGEPVIVDAAKLMQGSLCIGVLKSGAQRYL 214

Query: 541 ANLSLYPAKSELPFLPSNTQAVILQPLGSKGVAVIGGDIIRGFTTSDQAWISLVGEKLDA 720
           ANLSLYP +SELPFLPSNTQAVILQPLG KG+A++GGD IRGFTTSDQAWI+ +GEKLDA
Sbjct: 215 ANLSLYPGRSELPFLPSNTQAVILQPLGDKGIAILGGDTIRGFTTSDQAWITFIGEKLDA 274

Query: 721 TLAKVMDDI 747
           TLAK M D+
Sbjct: 275 TLAKFMSDM 283


>ref|XP_002311785.2| hypothetical protein POPTR_0008s19620g [Populus trichocarpa]
           gi|550333478|gb|EEE89152.2| hypothetical protein
           POPTR_0008s19620g [Populus trichocarpa]
          Length = 312

 Score =  353 bits (905), Expect = 1e-94
 Identities = 179/237 (75%), Positives = 199/237 (83%)
 Frame = +1

Query: 22  KPKREWVAEWVSKNDDFVRSLPIYVGGISLLAVLFNRTVSGIAPVADASSSQSRADLLTL 201
           KPKR+W A+W+S NDD VR  PI+ GG SLLAVL NRTV+GIAPVADASSSQSRADLLTL
Sbjct: 65  KPKRDWAADWISNNDDTVRGFPIFFGGASLLAVLVNRTVTGIAPVADASSSQSRADLLTL 124

Query: 202 GLAVTNILNGLVWLSIRPKSISPVSPDGIECQIICSYLPDFVVSELLWAWESLSFVTCCR 381
           GLAVTNIL GLVWL+IRPKSIS V+P G+EC+ I S+LPDFVVSELLWAWESLS VTCCR
Sbjct: 125 GLAVTNILTGLVWLTIRPKSISVVNPLGVECRFIFSHLPDFVVSELLWAWESLSGVTCCR 184

Query: 382 SLVIVYDKRCILQIGYAAVSASXXXXXXXXXXXXLMQGSLYQGLMKSKSQSYLANLSLYP 561
           SLV+VYD RCILQIG AA S +            LMQGSLYQ  +KS SQSYLANLSLYP
Sbjct: 185 SLVVVYDCRCILQIGVAAESVN--NEALAVDAAKLMQGSLYQAAIKSASQSYLANLSLYP 242

Query: 562 AKSELPFLPSNTQAVILQPLGSKGVAVIGGDIIRGFTTSDQAWISLVGEKLDATLAK 732
            +SELPFLP NTQAVILQPLG KG+ +IGGD IRGFT+SDQAWI+L+GEKL+ATLAK
Sbjct: 243 GRSELPFLPLNTQAVILQPLGDKGIVIIGGDTIRGFTSSDQAWITLIGEKLEATLAK 299


>ref|XP_006301638.1| hypothetical protein CARUB_v10022081mg [Capsella rubella]
           gi|482570348|gb|EOA34536.1| hypothetical protein
           CARUB_v10022081mg [Capsella rubella]
          Length = 297

 Score =  349 bits (896), Expect = 1e-93
 Identities = 174/241 (72%), Positives = 202/241 (83%)
 Frame = +1

Query: 10  YRGPKPKREWVAEWVSKNDDFVRSLPIYVGGISLLAVLFNRTVSGIAPVADASSSQSRAD 189
           YRGPKP +  VA+++S+NDD  RSLPIYVGG SLLAVLFNRTVSGIAPVADASSSQSRAD
Sbjct: 45  YRGPKPSKNLVADFISRNDDLARSLPIYVGGASLLAVLFNRTVSGIAPVADASSSQSRAD 104

Query: 190 LLTLGLAVTNILNGLVWLSIRPKSISPVSPDGIECQIICSYLPDFVVSELLWAWESLSFV 369
           LL LGLAVTN+L GLVWLSIRPKSI+PV+P+G+EC+++ S LP  +VSELLWAWESL   
Sbjct: 105 LLALGLAVTNLLTGLVWLSIRPKSITPVNPEGVECKVVESDLPTSIVSELLWAWESLKVA 164

Query: 370 TCCRSLVIVYDKRCILQIGYAAVSASXXXXXXXXXXXXLMQGSLYQGLMKSKSQSYLANL 549
           TCC+SLVIVYD  C++QIG  A S              LMQGS+Y+G+MKSK+QSYLANL
Sbjct: 165 TCCKSLVIVYDGICLIQIGMVAESPE-DKKGVIVNTDKLMQGSVYRGVMKSKAQSYLANL 223

Query: 550 SLYPAKSELPFLPSNTQAVILQPLGSKGVAVIGGDIIRGFTTSDQAWISLVGEKLDATLA 729
           SLYP +SELPFLP+NTQAVILQPLG KG+AVIGG+ IRGFT+SDQAWIS +GEKLDATL 
Sbjct: 224 SLYPGRSELPFLPANTQAVILQPLGDKGIAVIGGNTIRGFTSSDQAWISAIGEKLDATLG 283

Query: 730 K 732
           +
Sbjct: 284 R 284


>gb|EXB74736.1| hypothetical protein L484_011015 [Morus notabilis]
          Length = 286

 Score =  349 bits (895), Expect = 1e-93
 Identities = 179/246 (72%), Positives = 200/246 (81%)
 Frame = +1

Query: 10  YRGPKPKREWVAEWVSKNDDFVRSLPIYVGGISLLAVLFNRTVSGIAPVADASSSQSRAD 189
           YRGPKP ++WVA+WVSKNDD VRSLPIYVGG+SLLAVLFNRT+SGIAPVADASSSQSRAD
Sbjct: 45  YRGPKPTKDWVADWVSKNDDLVRSLPIYVGGVSLLAVLFNRTLSGIAPVADASSSQSRAD 104

Query: 190 LLTLGLAVTNILNGLVWLSIRPKSISPVSPDGIECQIICSYLPDFVVSELLWAWESLSFV 369
           LLTLGLAVTNIL GLV             P+GIECQI+   LP+ VVSELLW W+SLS V
Sbjct: 105 LLTLGLAVTNILAGLV------------DPEGIECQIVNPDLPNPVVSELLWFWQSLSDV 152

Query: 370 TCCRSLVIVYDKRCILQIGYAAVSASXXXXXXXXXXXXLMQGSLYQGLMKSKSQSYLANL 549
           TCCRSLVIVY+  CI+Q+G+AA S              L+QGSLY G+MKS +QSYLANL
Sbjct: 153 TCCRSLVIVYNSSCIIQLGFAAESLKSDRKPASVDASKLVQGSLYHGVMKSGAQSYLANL 212

Query: 550 SLYPAKSELPFLPSNTQAVILQPLGSKGVAVIGGDIIRGFTTSDQAWISLVGEKLDATLA 729
           SLYP KSELPFLPSNTQA ILQPLG KG+AVIGGD IRGFTTSDQAWISL+GEKLDATLA
Sbjct: 213 SLYPGKSELPFLPSNTQAAILQPLGEKGIAVIGGDTIRGFTTSDQAWISLIGEKLDATLA 272

Query: 730 KVMDDI 747
           K +D++
Sbjct: 273 KYLDNL 278


>ref|NP_176193.2| cofactor assembly of complex C [Arabidopsis thaliana]
           gi|42571921|ref|NP_974051.1| cofactor assembly of
           complex C [Arabidopsis thaliana]
           gi|34146872|gb|AAQ62444.1| At1g59840 [Arabidopsis
           thaliana] gi|51968666|dbj|BAD43025.1| hypothetical
           protein [Arabidopsis thaliana]
           gi|51968878|dbj|BAD43131.1| hypothetical protein
           [Arabidopsis thaliana] gi|62320426|dbj|BAD94887.1|
           hypothetical protein [Arabidopsis thaliana]
           gi|332195505|gb|AEE33626.1| cofactor assembly of complex
           C [Arabidopsis thaliana] gi|332195506|gb|AEE33627.1|
           cofactor assembly of complex C [Arabidopsis thaliana]
          Length = 297

 Score =  347 bits (890), Expect = 5e-93
 Identities = 175/245 (71%), Positives = 203/245 (82%)
 Frame = +1

Query: 10  YRGPKPKREWVAEWVSKNDDFVRSLPIYVGGISLLAVLFNRTVSGIAPVADASSSQSRAD 189
           Y GPKP++  VA+++SKNDD VRSLPIYVGG SLLAVLFNRTVSGIAPVADASSSQSRAD
Sbjct: 45  YEGPKPRKNLVADFISKNDDLVRSLPIYVGGASLLAVLFNRTVSGIAPVADASSSQSRAD 104

Query: 190 LLTLGLAVTNILNGLVWLSIRPKSISPVSPDGIECQIICSYLPDFVVSELLWAWESLSFV 369
           LL LGLAVTN+L GLVWLSIRPKSI+PV+P G+EC+++ S LP  +VSELLWAWESL   
Sbjct: 105 LLALGLAVTNLLTGLVWLSIRPKSITPVNPKGVECKVVESDLPASMVSELLWAWESLKVA 164

Query: 370 TCCRSLVIVYDKRCILQIGYAAVSASXXXXXXXXXXXXLMQGSLYQGLMKSKSQSYLANL 549
           TCC+SLVIVY+  C++QIG  A S              LMQGS+Y+G+MKSK+QSYLANL
Sbjct: 165 TCCKSLVIVYNGICLIQIGMVAESPE-DKKTVIVKTDKLMQGSVYRGVMKSKAQSYLANL 223

Query: 550 SLYPAKSELPFLPSNTQAVILQPLGSKGVAVIGGDIIRGFTTSDQAWISLVGEKLDATLA 729
           SLYP +SELPFLP+NTQAVILQPLG KG+AVIGG+ IRGFT+SDQAWIS +GEKLDATL 
Sbjct: 224 SLYPGRSELPFLPANTQAVILQPLGDKGIAVIGGNTIRGFTSSDQAWISSIGEKLDATLG 283

Query: 730 KVMDD 744
           +   D
Sbjct: 284 RYFVD 288


>gb|ESW29543.1| hypothetical protein PHAVU_002G078400g [Phaseolus vulgaris]
          Length = 304

 Score =  347 bits (889), Expect = 7e-93
 Identities = 169/241 (70%), Positives = 199/241 (82%)
 Frame = +1

Query: 10  YRGPKPKREWVAEWVSKNDDFVRSLPIYVGGISLLAVLFNRTVSGIAPVADASSSQSRAD 189
           YRGP PKR+++A+WVS+NDD VR+LPIYVG  SL AVL NR +SGIAPVA+A SSQSRAD
Sbjct: 53  YRGPTPKRDFLADWVSRNDDVVRTLPIYVGSASLFAVLINRALSGIAPVANAGSSQSRAD 112

Query: 190 LLTLGLAVTNILNGLVWLSIRPKSISPVSPDGIECQIICSYLPDFVVSELLWAWESLSFV 369
           LLTLGLAVTNIL GLVWLSIRPKSI+ V+P G+EC+  C+ LP+  ++ELLW WESLS  
Sbjct: 113 LLTLGLAVTNILTGLVWLSIRPKSITVVNPRGVECKRFCTTLPEVALNELLWVWESLSDA 172

Query: 370 TCCRSLVIVYDKRCILQIGYAAVSASXXXXXXXXXXXXLMQGSLYQGLMKSKSQSYLANL 549
           TCCRSLV+VY+  C+LQIG+AA S+             LMQGS+YQG+MKS +QSYLANL
Sbjct: 173 TCCRSLVVVYESSCVLQIGFAADSSPGDGEAVSVDVNKLMQGSVYQGVMKSGAQSYLANL 232

Query: 550 SLYPAKSELPFLPSNTQAVILQPLGSKGVAVIGGDIIRGFTTSDQAWISLVGEKLDATLA 729
           SLYP KSELPFLP NTQAVILQPLG KG+A+IGGD IRG+T SDQAWI+ +GEKLD+TLA
Sbjct: 233 SLYPGKSELPFLPLNTQAVILQPLGDKGIAIIGGDTIRGYTASDQAWITYIGEKLDSTLA 292

Query: 730 K 732
           K
Sbjct: 293 K 293


>ref|XP_006392225.1| hypothetical protein EUTSA_v10023990mg [Eutrema salsugineum]
           gi|557088731|gb|ESQ29511.1| hypothetical protein
           EUTSA_v10023990mg [Eutrema salsugineum]
          Length = 288

 Score =  346 bits (888), Expect = 9e-93
 Identities = 175/247 (70%), Positives = 203/247 (82%)
 Frame = +1

Query: 10  YRGPKPKREWVAEWVSKNDDFVRSLPIYVGGISLLAVLFNRTVSGIAPVADASSSQSRAD 189
           YRGPKP +  VA++VS+NDD VRSLPIYVGG SLLAVLFNR VSGIAPVADASSSQSRAD
Sbjct: 43  YRGPKPSKNLVADFVSRNDDSVRSLPIYVGGASLLAVLFNRAVSGIAPVADASSSQSRAD 102

Query: 190 LLTLGLAVTNILNGLVWLSIRPKSISPVSPDGIECQIICSYLPDFVVSELLWAWESLSFV 369
           LL LGLAVTN+L GLVWLSIRPKSI+PV P+G+EC+++ S LP  +VSELLWAW SL   
Sbjct: 103 LLALGLAVTNLLTGLVWLSIRPKSITPVQPEGVECKVVESDLPASIVSELLWAWVSLKAA 162

Query: 370 TCCRSLVIVYDKRCILQIGYAAVSASXXXXXXXXXXXXLMQGSLYQGLMKSKSQSYLANL 549
           TCC+SLVIVY+  C++QIG  A S              LMQGS+YQG+MKSK+QSYLANL
Sbjct: 163 TCCKSLVIVYNGICLIQIGMVAESPQ-DKKAIIVNTDKLMQGSVYQGVMKSKAQSYLANL 221

Query: 550 SLYPAKSELPFLPSNTQAVILQPLGSKGVAVIGGDIIRGFTTSDQAWISLVGEKLDATLA 729
           SLYP +SELPFLP+NTQAVILQPLG KG+AVIGG+ IRGFT+SDQAWIS +GEKLDATL 
Sbjct: 222 SLYPGRSELPFLPANTQAVILQPLGDKGIAVIGGNTIRGFTSSDQAWISSIGEKLDATLG 281

Query: 730 KVMDDIS 750
           +   D++
Sbjct: 282 RYFIDVA 288


>ref|XP_002888168.1| hypothetical protein ARALYDRAFT_475325 [Arabidopsis lyrata subsp.
           lyrata] gi|297334009|gb|EFH64427.1| hypothetical protein
           ARALYDRAFT_475325 [Arabidopsis lyrata subsp. lyrata]
          Length = 298

 Score =  345 bits (884), Expect = 3e-92
 Identities = 175/245 (71%), Positives = 201/245 (82%)
 Frame = +1

Query: 10  YRGPKPKREWVAEWVSKNDDFVRSLPIYVGGISLLAVLFNRTVSGIAPVADASSSQSRAD 189
           YRGPKP +  VA+++SKNDD VRSLPIYVGG SLLAVLFNRTVSGIAPVADASSSQSRAD
Sbjct: 46  YRGPKPSKNLVADFISKNDDLVRSLPIYVGGASLLAVLFNRTVSGIAPVADASSSQSRAD 105

Query: 190 LLTLGLAVTNILNGLVWLSIRPKSISPVSPDGIECQIICSYLPDFVVSELLWAWESLSFV 369
           LL LGLAVTN+L GLVWLSIRPKSI+PV PDG+E +++ S LP  VVSELLWAWES    
Sbjct: 106 LLALGLAVTNLLTGLVWLSIRPKSITPVQPDGVEWKVVESGLPASVVSELLWAWESFKVA 165

Query: 370 TCCRSLVIVYDKRCILQIGYAAVSASXXXXXXXXXXXXLMQGSLYQGLMKSKSQSYLANL 549
           TCC+SLVIVY+  C++QIG  A S              LMQGS+Y+G+MKSK+QSYLANL
Sbjct: 166 TCCKSLVIVYNGICLIQIGMVAESPE-DKKAVVVNTDKLMQGSVYRGVMKSKAQSYLANL 224

Query: 550 SLYPAKSELPFLPSNTQAVILQPLGSKGVAVIGGDIIRGFTTSDQAWISLVGEKLDATLA 729
           SLYP +SELPFLP+NTQAVILQPLG KG+AVIGG+ IRGFT++DQAWIS +GEKLDATL 
Sbjct: 225 SLYPGRSELPFLPANTQAVILQPLGDKGIAVIGGNTIRGFTSADQAWISSIGEKLDATLG 284

Query: 730 KVMDD 744
           +   D
Sbjct: 285 RYFID 289


>ref|XP_003519789.2| PREDICTED: uncharacterized protein LOC100776666 isoform X1 [Glycine
           max]
          Length = 300

 Score =  341 bits (874), Expect = 4e-91
 Identities = 165/243 (67%), Positives = 198/243 (81%)
 Frame = +1

Query: 10  YRGPKPKREWVAEWVSKNDDFVRSLPIYVGGISLLAVLFNRTVSGIAPVADASSSQSRAD 189
           YRGP PKR ++A+WVS+NDD VR+LPIYVG  SL A+L NR++SGIAPV DA SSQSRAD
Sbjct: 49  YRGPTPKRPFLADWVSQNDDVVRTLPIYVGSASLFAILLNRSLSGIAPVVDAGSSQSRAD 108

Query: 190 LLTLGLAVTNILNGLVWLSIRPKSISPVSPDGIECQIICSYLPDFVVSELLWAWESLSFV 369
           LLTLGLAVTNIL GLVWLS++PKSI+ V+P G+EC+ +C+ LP+   +ELLW WESLS  
Sbjct: 109 LLTLGLAVTNILAGLVWLSVKPKSITVVNPRGVECKSLCATLPEVARNELLWVWESLSDA 168

Query: 370 TCCRSLVIVYDKRCILQIGYAAVSASXXXXXXXXXXXXLMQGSLYQGLMKSKSQSYLANL 549
           TCCRSLV+VY+  C+LQIG+AA S+             LMQGS+YQG+MKS +QSYLANL
Sbjct: 169 TCCRSLVVVYESSCVLQIGFAAESSPGDGDAVSVDANKLMQGSVYQGVMKSGAQSYLANL 228

Query: 550 SLYPAKSELPFLPSNTQAVILQPLGSKGVAVIGGDIIRGFTTSDQAWISLVGEKLDATLA 729
           SLYP KSELPFLPSNTQAVILQPLG KG+A+IGGD IRGFT SDQ WI+ +G KLD++LA
Sbjct: 229 SLYPGKSELPFLPSNTQAVILQPLGDKGIAIIGGDTIRGFTGSDQVWITYIGAKLDSSLA 288

Query: 730 KVM 738
           K +
Sbjct: 289 KYL 291


>gb|AAD39336.1|AC007258_25 Hypothetical protein [Arabidopsis thaliana]
          Length = 305

 Score =  340 bits (871), Expect = 9e-91
 Identities = 175/253 (69%), Positives = 203/253 (80%), Gaps = 8/253 (3%)
 Frame = +1

Query: 10  YRGPKPKREWVAEWVSKNDDFVRSLPIYVGGISLLAVLFNRTVSGIAPVADASSSQSRAD 189
           Y GPKP++  VA+++SKNDD VRSLPIYVGG SLLAVLFNRTVSGIAPVADASSSQSRAD
Sbjct: 45  YEGPKPRKNLVADFISKNDDLVRSLPIYVGGASLLAVLFNRTVSGIAPVADASSSQSRAD 104

Query: 190 LLTLGLAVTNILNGLVWLSIRPKSISPVSPDGIECQIICSYLPDFVVSELLWAWESLSFV 369
           LL LGLAVTN+L GLVWLSIRPKSI+PV+P G+EC+++ S LP  +VSELLWAWESL   
Sbjct: 105 LLALGLAVTNLLTGLVWLSIRPKSITPVNPKGVECKVVESDLPASMVSELLWAWESLKVA 164

Query: 370 TCCRSLVIVYDKRCILQIGYAAVSASXXXXXXXXXXXXLMQGSLYQGLMKSK-------- 525
           TCC+SLVIVY+  C++QIG  A S              LMQGS+Y+G+MKSK        
Sbjct: 165 TCCKSLVIVYNGICLIQIGMVAESPE-DKKTVIVKTDKLMQGSVYRGVMKSKARKNNDCL 223

Query: 526 SQSYLANLSLYPAKSELPFLPSNTQAVILQPLGSKGVAVIGGDIIRGFTTSDQAWISLVG 705
           S+SYLANLSLYP +SELPFLP+NTQAVILQPLG KG+AVIGG+ IRGFT+SDQAWIS +G
Sbjct: 224 SESYLANLSLYPGRSELPFLPANTQAVILQPLGDKGIAVIGGNTIRGFTSSDQAWISSIG 283

Query: 706 EKLDATLAKVMDD 744
           EKLDATL +   D
Sbjct: 284 EKLDATLGRYFVD 296


>ref|XP_004237075.1| PREDICTED: uncharacterized protein LOC101258059 isoform 2 [Solanum
           lycopersicum]
          Length = 260

 Score =  333 bits (854), Expect = 8e-89
 Identities = 172/243 (70%), Positives = 192/243 (79%)
 Frame = +1

Query: 10  YRGPKPKREWVAEWVSKNDDFVRSLPIYVGGISLLAVLFNRTVSGIAPVADASSSQSRAD 189
           YRG KP+REW+A+WVS NDD VRS+PIYVGG+SLLAVLFNRT+SGIAPVADASSSQSRAD
Sbjct: 38  YRGMKPRREWIADWVSNNDDLVRSMPIYVGGLSLLAVLFNRTLSGIAPVADASSSQSRAD 97

Query: 190 LLTLGLAVTNILNGLVWLSIRPKSISPVSPDGIECQIICSYLPDFVVSELLWAWESLSFV 369
           LLTLGLAVTNILNGLVWLSIRPKSIS                    V  +  AW SLS V
Sbjct: 98  LLTLGLAVTNILNGLVWLSIRPKSIS--------------------VPFVFRAWNSLSDV 137

Query: 370 TCCRSLVIVYDKRCILQIGYAAVSASXXXXXXXXXXXXLMQGSLYQGLMKSKSQSYLANL 549
           TCCRSLV+VYD +CILQ G+AA S S             ++GSLYQG++KS SQSYLANL
Sbjct: 138 TCCRSLVVVYDGKCILQTGFAAASLSNGSDAVAVDSNKFIEGSLYQGVLKSASQSYLANL 197

Query: 550 SLYPAKSELPFLPSNTQAVILQPLGSKGVAVIGGDIIRGFTTSDQAWISLVGEKLDATLA 729
           SLYP KSELPFLPSNTQAVILQPLG KG+A+IGGD IRGFT+SDQAWI+L+GEKLDATL 
Sbjct: 198 SLYPGKSELPFLPSNTQAVILQPLGDKGIAIIGGDTIRGFTSSDQAWITLIGEKLDATLT 257

Query: 730 KVM 738
           KV+
Sbjct: 258 KVI 260


Top