BLASTX nr result
ID: Catharanthus22_contig00015321
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Catharanthus22_contig00015321 (1607 letters) Database: ./nr 37,332,560 sequences; 13,225,080,153 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_006349054.1| PREDICTED: uncharacterized protein LOC102583... 328 4e-87 ref|XP_004250995.1| PREDICTED: uncharacterized protein LOC101266... 317 7e-84 ref|XP_002272430.2| PREDICTED: uncharacterized protein LOC100249... 315 4e-83 gb|EOY33633.1| Mitochondria isoform 2 [Theobroma cacao] 304 6e-80 gb|EOY33632.1| Mitochondria isoform 1 [Theobroma cacao] gi|50878... 304 6e-80 ref|XP_004145771.1| PREDICTED: uncharacterized protein LOC101222... 298 3e-78 ref|XP_004294051.1| PREDICTED: uncharacterized protein LOC101304... 296 2e-77 gb|EOY33635.1| Mitochondria isoform 4 [Theobroma cacao] 295 4e-77 ref|XP_006424400.1| hypothetical protein CICLE_v10029076mg [Citr... 289 2e-75 ref|XP_004487551.1| PREDICTED: uncharacterized protein LOC101494... 285 5e-74 ref|XP_002313557.1| hypothetical protein POPTR_0009s00700g [Popu... 278 6e-72 ref|XP_003542941.1| PREDICTED: uncharacterized protein LOC100793... 273 1e-70 gb|EPS73853.1| hypothetical protein M569_00916 [Genlisea aurea] 265 4e-68 gb|ESW19918.1| hypothetical protein PHAVU_006G166300g [Phaseolus... 264 9e-68 ref|XP_006838715.1| hypothetical protein AMTR_s00002p00251510 [A... 263 1e-67 ref|XP_006417714.1| hypothetical protein EUTSA_v10008429mg [Eutr... 259 2e-66 ref|XP_006349055.1| PREDICTED: uncharacterized protein LOC102583... 256 2e-65 ref|NP_172300.2| uncharacterized protein [Arabidopsis thaliana] ... 254 7e-65 ref|XP_002892448.1| hypothetical protein ARALYDRAFT_470883 [Arab... 254 7e-65 ref|XP_006304092.1| hypothetical protein CARUB_v10009984mg [Caps... 254 1e-64 >ref|XP_006349054.1| PREDICTED: uncharacterized protein LOC102583204 isoform X1 [Solanum tuberosum] Length = 272 Score = 328 bits (841), Expect = 4e-87 Identities = 166/252 (65%), Positives = 196/252 (77%), Gaps = 1/252 (0%) Frame = +3 Query: 72 MLRLKRLSSATT-AVLNSKYFLKPEFQPCSIIPSKSPSQWTSNRFLDFYQLGNKAAIEKE 248 ML+LKR SSA T A+LNS+ + E + S+I ++P +WTSNRF D YQLGNK AIEKE Sbjct: 1 MLKLKRFSSAVTPAILNSRKKFQREEKLVSLIALQNPYRWTSNRFFDIYQLGNKEAIEKE 60 Query: 249 RARLADELNRGYFADIKEMKQHGGKXXXXXXXXXXXXXXVKFPALEVNYSDGSILKLPIT 428 RARL DE+NRGYFADI E+K+HGGK VKFPALEV +SDGS LKLPIT Sbjct: 61 RARLKDEMNRGYFADINELKEHGGKIATANKIIIPAMVAVKFPALEVIHSDGSNLKLPIT 120 Query: 429 SGGTEPATDKPDAPKASLLCLSFRANSQAMIDSWTVPFVDRFSHSNKVQLFEVSFIDSWL 608 S G +K +A KASL+C+SFRA+SQAMIDSW+ PF+D F S +VQL+E+S IDSW Sbjct: 121 STGDGVEANKLEASKASLMCVSFRASSQAMIDSWSKPFLDTFKDSKRVQLYEISLIDSWF 180 Query: 609 LTRNPIKKLLLRIMRKSNPRDSKDALQRQVVYSFGDHYYFRKELNILNLLTGYIFLLDSF 788 LT +P+KKLLLR+MRKSNP +SKD L RQ+VYSFGDHYYFRKEL ILNLLTGY+FL+D F Sbjct: 181 LTLSPVKKLLLRMMRKSNPHESKDVLHRQIVYSFGDHYYFRKELKILNLLTGYMFLVDKF 240 Query: 789 GRIRWQGFGLAT 824 GRIRWQG GLAT Sbjct: 241 GRIRWQGSGLAT 252 >ref|XP_004250995.1| PREDICTED: uncharacterized protein LOC101266482 [Solanum lycopersicum] Length = 272 Score = 317 bits (813), Expect = 7e-84 Identities = 160/252 (63%), Positives = 194/252 (76%), Gaps = 1/252 (0%) Frame = +3 Query: 72 MLRLKRLSSATT-AVLNSKYFLKPEFQPCSIIPSKSPSQWTSNRFLDFYQLGNKAAIEKE 248 ML+LKR SSA + A+ NS+ + + + S+I ++P +W SNRFLD YQLGNK AIEKE Sbjct: 1 MLKLKRFSSAVSPAIFNSRNKFQRKEKLISLIALQNPYRWISNRFLDIYQLGNKEAIEKE 60 Query: 249 RARLADELNRGYFADIKEMKQHGGKXXXXXXXXXXXXXXVKFPALEVNYSDGSILKLPIT 428 RARL DE+ RGYFADI E+K+HGGK VKFPALEV +SDGS +KLPIT Sbjct: 61 RARLKDEMTRGYFADINELKEHGGKIATANKIIIPAMAAVKFPALEVIHSDGSNVKLPIT 120 Query: 429 SGGTEPATDKPDAPKASLLCLSFRANSQAMIDSWTVPFVDRFSHSNKVQLFEVSFIDSWL 608 S G +K +A KASL+CLSFRA+SQAMIDSW+ PF+D F S +VQL+E+S IDSW+ Sbjct: 121 STGDGVEANKLEASKASLMCLSFRASSQAMIDSWSKPFLDTFKDSKRVQLYEISLIDSWV 180 Query: 609 LTRNPIKKLLLRIMRKSNPRDSKDALQRQVVYSFGDHYYFRKELNILNLLTGYIFLLDSF 788 LT +P+KKLLLR+MRKSNP +SKD + RQ+VYSFGDHYYFRKEL ILNLLTGY+FL+D F Sbjct: 181 LTLSPVKKLLLRMMRKSNPHESKDVVHRQIVYSFGDHYYFRKELKILNLLTGYMFLVDKF 240 Query: 789 GRIRWQGFGLAT 824 GRIRWQ GLAT Sbjct: 241 GRIRWQASGLAT 252 >ref|XP_002272430.2| PREDICTED: uncharacterized protein LOC100249926 [Vitis vinifera] gi|297734874|emb|CBI17108.3| unnamed protein product [Vitis vinifera] Length = 272 Score = 315 bits (807), Expect = 4e-83 Identities = 165/254 (64%), Positives = 186/254 (73%), Gaps = 4/254 (1%) Frame = +3 Query: 72 MLRLKRL----SSATTAVLNSKYFLKPEFQPCSIIPSKSPSQWTSNRFLDFYQLGNKAAI 239 MLRL RL +S ++ L S+ E ++PS + TS RFLD YQLGNK A Sbjct: 1 MLRLNRLILNSASTRSSTLLSRQLGSHEPPSLPLLPSHHLAHRTSTRFLDIYQLGNKEAF 60 Query: 240 EKERARLADELNRGYFADIKEMKQHGGKXXXXXXXXXXXXXXVKFPALEVNYSDGSILKL 419 EKERARLADE+NRGYFAD+ E KQHGGK +KFPALEVNYSDG LKL Sbjct: 61 EKERARLADEMNRGYFADMSEFKQHGGKIAMANKIIIPAMAAMKFPALEVNYSDGRSLKL 120 Query: 420 PITSGGTEPATDKPDAPKASLLCLSFRANSQAMIDSWTVPFVDRFSHSNKVQLFEVSFID 599 PI+S G E T K D PKASLLCLSFRA+SQAMIDSW+ PF D FS S VQL+EVSF+D Sbjct: 121 PISSHGNEAGTSKLDIPKASLLCLSFRASSQAMIDSWSKPFFDAFSDSKNVQLYEVSFVD 180 Query: 600 SWLLTRNPIKKLLLRIMRKSNPRDSKDALQRQVVYSFGDHYYFRKELNILNLLTGYIFLL 779 SW L+ NPIK+LLLRIM+KS P D K LQRQ+VYSFGDHYYFRKEL ILNLLTGY+FL+ Sbjct: 181 SWFLSLNPIKRLLLRIMKKSKP-DGKSVLQRQIVYSFGDHYYFRKELKILNLLTGYMFLV 239 Query: 780 DSFGRIRWQGFGLA 821 D FGRIRWQGFGLA Sbjct: 240 DKFGRIRWQGFGLA 253 >gb|EOY33633.1| Mitochondria isoform 2 [Theobroma cacao] Length = 280 Score = 304 bits (779), Expect = 6e-80 Identities = 163/252 (64%), Positives = 183/252 (72%), Gaps = 1/252 (0%) Frame = +3 Query: 72 MLRLKRLSSATTA-VLNSKYFLKPEFQPCSIIPSKSPSQWTSNRFLDFYQLGNKAAIEKE 248 MLR+ R+ S T A +L K L E Q P + ++ +SNRFLD YQLGNK AIEKE Sbjct: 1 MLRVNRVVSQTRASILTCKQLLNHE-QKLLPFPPQHFARKSSNRFLDIYQLGNKEAIEKE 59 Query: 249 RARLADELNRGYFADIKEMKQHGGKXXXXXXXXXXXXXXVKFPALEVNYSDGSILKLPIT 428 RARLADE+NRGYFADI E+KQHGGK VKFP LEV YSDG LKLPI Sbjct: 60 RARLADEMNRGYFADISELKQHGGKIAVANKIIIPTMAAVKFPGLEVTYSDGRTLKLPIV 119 Query: 429 SGGTEPATDKPDAPKASLLCLSFRANSQAMIDSWTVPFVDRFSHSNKVQLFEVSFIDSWL 608 S G +K PK SL+CLSFRA+SQ MID+W PF + FS+S VQL+EVSFIDSWL Sbjct: 120 SNGDRVDAEKLAVPKVSLVCLSFRASSQKMIDTWCTPFSEAFSNSKDVQLYEVSFIDSWL 179 Query: 609 LTRNPIKKLLLRIMRKSNPRDSKDALQRQVVYSFGDHYYFRKELNILNLLTGYIFLLDSF 788 L RNPIK+LLLR MRKS + KDALQRQ+VYSFGDHYYFRKEL ILNLLTGYIFLLD Sbjct: 180 LCRNPIKRLLLRTMRKSIDGE-KDALQRQIVYSFGDHYYFRKELKILNLLTGYIFLLDKL 238 Query: 789 GRIRWQGFGLAT 824 GR+RWQGFGLAT Sbjct: 239 GRVRWQGFGLAT 250 >gb|EOY33632.1| Mitochondria isoform 1 [Theobroma cacao] gi|508786378|gb|EOY33634.1| Mitochondria isoform 1 [Theobroma cacao] Length = 268 Score = 304 bits (779), Expect = 6e-80 Identities = 163/252 (64%), Positives = 183/252 (72%), Gaps = 1/252 (0%) Frame = +3 Query: 72 MLRLKRLSSATTA-VLNSKYFLKPEFQPCSIIPSKSPSQWTSNRFLDFYQLGNKAAIEKE 248 MLR+ R+ S T A +L K L E Q P + ++ +SNRFLD YQLGNK AIEKE Sbjct: 1 MLRVNRVVSQTRASILTCKQLLNHE-QKLLPFPPQHFARKSSNRFLDIYQLGNKEAIEKE 59 Query: 249 RARLADELNRGYFADIKEMKQHGGKXXXXXXXXXXXXXXVKFPALEVNYSDGSILKLPIT 428 RARLADE+NRGYFADI E+KQHGGK VKFP LEV YSDG LKLPI Sbjct: 60 RARLADEMNRGYFADISELKQHGGKIAVANKIIIPTMAAVKFPGLEVTYSDGRTLKLPIV 119 Query: 429 SGGTEPATDKPDAPKASLLCLSFRANSQAMIDSWTVPFVDRFSHSNKVQLFEVSFIDSWL 608 S G +K PK SL+CLSFRA+SQ MID+W PF + FS+S VQL+EVSFIDSWL Sbjct: 120 SNGDRVDAEKLAVPKVSLVCLSFRASSQKMIDTWCTPFSEAFSNSKDVQLYEVSFIDSWL 179 Query: 609 LTRNPIKKLLLRIMRKSNPRDSKDALQRQVVYSFGDHYYFRKELNILNLLTGYIFLLDSF 788 L RNPIK+LLLR MRKS + KDALQRQ+VYSFGDHYYFRKEL ILNLLTGYIFLLD Sbjct: 180 LCRNPIKRLLLRTMRKSIDGE-KDALQRQIVYSFGDHYYFRKELKILNLLTGYIFLLDKL 238 Query: 789 GRIRWQGFGLAT 824 GR+RWQGFGLAT Sbjct: 239 GRVRWQGFGLAT 250 >ref|XP_004145771.1| PREDICTED: uncharacterized protein LOC101222490 [Cucumis sativus] gi|449496215|ref|XP_004160075.1| PREDICTED: uncharacterized LOC101222490 [Cucumis sativus] Length = 272 Score = 298 bits (764), Expect = 3e-78 Identities = 151/224 (67%), Positives = 170/224 (75%), Gaps = 2/224 (0%) Frame = +3 Query: 159 IIPSKSPSQWTSNRFLDFYQLGNKAAIEKERARLADELNRGYFADIKEMKQHGGKXXXXX 338 + PS+ +Q TSNRFLD YQLGNK AIEKERARLADE+NRGYFAD+ E+KQHGGK Sbjct: 31 VFPSQHLAQLTSNRFLDIYQLGNKTAIEKERARLADEINRGYFADMSELKQHGGKIAAAN 90 Query: 339 XXXXXXXXXVKFPALEVNYSDGSILKLPITSGGT--EPATDKPDAPKASLLCLSFRANSQ 512 VKFP EV+YSDG LKLPI S E + P A+LLCLSFRANSQ Sbjct: 91 KILIPAMAAVKFPEFEVSYSDGKTLKLPIKSDVNVIEGNSSPSGLPMATLLCLSFRANSQ 150 Query: 513 AMIDSWTVPFVDRFSHSNKVQLFEVSFIDSWLLTRNPIKKLLLRIMRKSNPRDSKDALQR 692 AMIDSW+ F++ FS SN VQL+EVSFIDSW L RNPIKKLLLR+MRKS+ D+LQR Sbjct: 151 AMIDSWSASFLNAFSSSNNVQLYEVSFIDSWFLCRNPIKKLLLRLMRKSSGNAQNDSLQR 210 Query: 693 QVVYSFGDHYYFRKELNILNLLTGYIFLLDSFGRIRWQGFGLAT 824 Q+VYSFGDHYYFRKEL ILNLLTGY+FL+D GRIRWQGFGLAT Sbjct: 211 QIVYSFGDHYYFRKELKILNLLTGYVFLVDKLGRIRWQGFGLAT 254 >ref|XP_004294051.1| PREDICTED: uncharacterized protein LOC101304550 [Fragaria vesca subsp. vesca] Length = 273 Score = 296 bits (757), Expect = 2e-77 Identities = 147/217 (67%), Positives = 169/217 (77%), Gaps = 2/217 (0%) Frame = +3 Query: 180 SQWTSNRFLDFYQLGNKAAIEKERARLADELNRGYFADIKEMKQHGGKXXXXXXXXXXXX 359 +Q TSNRF DFY+LGNKAA+EKERARLADELNRGYFAD+ ++K+HGGK Sbjct: 39 AQRTSNRFFDFYKLGNKAAVEKERARLADELNRGYFADMDDLKKHGGKVSESNKILFPAM 98 Query: 360 XXVKFPALEVNYSDGSILKLPITSGGTEPATDK--PDAPKASLLCLSFRANSQAMIDSWT 533 VKFP LEV YSDG +KLP+ G E D D PKASL CLSFRA++Q MIDSW+ Sbjct: 99 VAVKFPDLEVTYSDGKAVKLPLRPNGNENVADANVSDLPKASLYCLSFRASAQGMIDSWS 158 Query: 534 VPFVDRFSHSNKVQLFEVSFIDSWLLTRNPIKKLLLRIMRKSNPRDSKDALQRQVVYSFG 713 VPFVD FS S +VQLFEVSFID WLL R PIK+LLLRIM+K ++KDA++RQ+VYSFG Sbjct: 159 VPFVDAFSGSKEVQLFEVSFIDQWLLCRTPIKQLLLRIMKKPKHDENKDAVKRQIVYSFG 218 Query: 714 DHYYFRKELNILNLLTGYIFLLDSFGRIRWQGFGLAT 824 DHYYFRKEL ILNLLTGY+FLLD FGRIRWQG G+AT Sbjct: 219 DHYYFRKELRILNLLTGYVFLLDKFGRIRWQGSGVAT 255 >gb|EOY33635.1| Mitochondria isoform 4 [Theobroma cacao] Length = 264 Score = 295 bits (755), Expect = 4e-77 Identities = 161/252 (63%), Positives = 181/252 (71%), Gaps = 1/252 (0%) Frame = +3 Query: 72 MLRLKRLSSATTA-VLNSKYFLKPEFQPCSIIPSKSPSQWTSNRFLDFYQLGNKAAIEKE 248 MLR+ R+ S T A +L K L E Q P + ++ +SNRFLD YQLGNK AIEKE Sbjct: 1 MLRVNRVVSQTRASILTCKQLLNHE-QKLLPFPPQHFARKSSNRFLDIYQLGNKEAIEKE 59 Query: 249 RARLADELNRGYFADIKEMKQHGGKXXXXXXXXXXXXXXVKFPALEVNYSDGSILKLPIT 428 RARLADE+NRGYFADI E+KQHG VKFP LEV YSDG LKLPI Sbjct: 60 RARLADEMNRGYFADISELKQHG----VANKIIIPTMAAVKFPGLEVTYSDGRTLKLPIV 115 Query: 429 SGGTEPATDKPDAPKASLLCLSFRANSQAMIDSWTVPFVDRFSHSNKVQLFEVSFIDSWL 608 S G +K PK SL+CLSFRA+SQ MID+W PF + FS+S VQL+EVSFIDSWL Sbjct: 116 SNGDRVDAEKLAVPKVSLVCLSFRASSQKMIDTWCTPFSEAFSNSKDVQLYEVSFIDSWL 175 Query: 609 LTRNPIKKLLLRIMRKSNPRDSKDALQRQVVYSFGDHYYFRKELNILNLLTGYIFLLDSF 788 L RNPIK+LLLR MRKS + KDALQRQ+VYSFGDHYYFRKEL ILNLLTGYIFLLD Sbjct: 176 LCRNPIKRLLLRTMRKSIDGE-KDALQRQIVYSFGDHYYFRKELKILNLLTGYIFLLDKL 234 Query: 789 GRIRWQGFGLAT 824 GR+RWQGFGLAT Sbjct: 235 GRVRWQGFGLAT 246 >ref|XP_006424400.1| hypothetical protein CICLE_v10029076mg [Citrus clementina] gi|567863494|ref|XP_006424401.1| hypothetical protein CICLE_v10029076mg [Citrus clementina] gi|557526334|gb|ESR37640.1| hypothetical protein CICLE_v10029076mg [Citrus clementina] gi|557526335|gb|ESR37641.1| hypothetical protein CICLE_v10029076mg [Citrus clementina] Length = 265 Score = 289 bits (740), Expect = 2e-75 Identities = 151/252 (59%), Positives = 183/252 (72%), Gaps = 1/252 (0%) Frame = +3 Query: 72 MLRLKRLSSATTA-VLNSKYFLKPEFQPCSIIPSKSPSQWTSNRFLDFYQLGNKAAIEKE 248 M R+ RL + T A ++ SK L E + + P + +S RFLD YQLGNK A+EKE Sbjct: 1 MFRINRLINQTRASLITSKQLLTHEHK---LFPQHYAKK-SSTRFLDIYQLGNKQAVEKE 56 Query: 249 RARLADELNRGYFADIKEMKQHGGKXXXXXXXXXXXXXXVKFPALEVNYSDGSILKLPIT 428 RARLADE+NRGYFAD+ E+K+HGGK VKFP L+V+YSD + LKLP+ Sbjct: 57 RARLADEMNRGYFADVAELKKHGGKIATANKIIIPALAAVKFPDLDVSYSDRTTLKLPVC 116 Query: 429 SGGTEPATDKPDAPKASLLCLSFRANSQAMIDSWTVPFVDRFSHSNKVQLFEVSFIDSWL 608 S G DK PK SL+CL+FRA+SQAM+DSW+ PF + FS S V L+EVSFIDSWL Sbjct: 117 SSGDVANADKAAIPKVSLVCLTFRASSQAMVDSWSSPFFEAFSDSKNVHLYEVSFIDSWL 176 Query: 609 LTRNPIKKLLLRIMRKSNPRDSKDALQRQVVYSFGDHYYFRKELNILNLLTGYIFLLDSF 788 L R+PIK++LL+IMRKS ++ALQRQ+VYSFGDHYYFRKEL ILNLLTGYIFLLD F Sbjct: 177 LCRSPIKRILLKIMRKSKDA-GENALQRQIVYSFGDHYYFRKELKILNLLTGYIFLLDKF 235 Query: 789 GRIRWQGFGLAT 824 GRIRWQGFG+AT Sbjct: 236 GRIRWQGFGMAT 247 >ref|XP_004487551.1| PREDICTED: uncharacterized protein LOC101494939 [Cicer arietinum] Length = 265 Score = 285 bits (728), Expect = 5e-74 Identities = 149/252 (59%), Positives = 181/252 (71%), Gaps = 1/252 (0%) Frame = +3 Query: 72 MLRLKRLSSATTAVLNSKYFLKPEFQPCSIIPSKSPSQWTSNRFLDFYQLGNKAAIEKER 251 M LK+L+ + + NS K + P PS++ S+ T RF D ++LGNK AI KER Sbjct: 1 MQGLKQLTRRCSCIRNSIVPEKFHYSP----PSQNLSRLTPKRFFDLHKLGNKEAIAKER 56 Query: 252 ARLADELNRGYFADIKEMKQHGGKXXXXXXXXXXXXXXVKFPALEVNYSDGSILKLPITS 431 ARL DE+NRGYFAD+ E KQHGGK VKFP +EV++SDG +KLPI Sbjct: 57 ARLNDEMNRGYFADMAEFKQHGGKVAAANKVIIPAMAAVKFPDIEVSFSDGKTMKLPIRV 116 Query: 432 GGTEPATDKPDAPKASLLCLSFRANSQAMIDSWTVPFVDRFSHSNKVQLFEVSFIDSWLL 611 +DK PKASL+CLSFRA SQ MI+SW+VPF FS+SN VQL++VSFIDSWLL Sbjct: 117 SDNPVDSDKSSVPKASLVCLSFRAISQEMINSWSVPFAKAFSNSNNVQLYQVSFIDSWLL 176 Query: 612 TRNPIKKLLLRIMRKSN-PRDSKDALQRQVVYSFGDHYYFRKELNILNLLTGYIFLLDSF 788 R+PIK+LLLR M+K N +SKDALQ ++VYSFGDHYYFRKEL ILNLLTGYIFLLD+F Sbjct: 177 CRSPIKRLLLRAMKKPNLNEESKDALQGKMVYSFGDHYYFRKELKILNLLTGYIFLLDNF 236 Query: 789 GRIRWQGFGLAT 824 GR+RWQGFG+AT Sbjct: 237 GRVRWQGFGVAT 248 >ref|XP_002313557.1| hypothetical protein POPTR_0009s00700g [Populus trichocarpa] gi|222849965|gb|EEE87512.1| hypothetical protein POPTR_0009s00700g [Populus trichocarpa] Length = 265 Score = 278 bits (710), Expect = 6e-72 Identities = 149/254 (58%), Positives = 176/254 (69%), Gaps = 3/254 (1%) Frame = +3 Query: 72 MLRLKRLSSATTAVLNSKYFLKPEFQPCSIIPSKSPSQW---TSNRFLDFYQLGNKAAIE 242 MLRL RL T + FL + Q ++ S Q T RFLD Y++GNKAAIE Sbjct: 1 MLRLNRLIKHTNKWTRTSTFLSSQQQTQAVFDSSVSCQHFFRTQIRFLDIYKIGNKAAIE 60 Query: 243 KERARLADELNRGYFADIKEMKQHGGKXXXXXXXXXXXXXXVKFPALEVNYSDGSILKLP 422 KERARLADELNRGYFADI E K+HGGK VKFP ++VNYS+G+ LKLP Sbjct: 61 KERARLADELNRGYFADISEFKKHGGKIAVANKIIIPAVAAVKFPDVKVNYSNGTSLKLP 120 Query: 423 ITSGGTEPATDKPDAPKASLLCLSFRANSQAMIDSWTVPFVDRFSHSNKVQLFEVSFIDS 602 I S G D A+L+CLSFRA+SQ MI+SW++PF++ F + V L+EVSFIDS Sbjct: 121 IRSDGNVVGAD------ATLMCLSFRASSQEMINSWSMPFLEAFRDAKNVHLYEVSFIDS 174 Query: 603 WLLTRNPIKKLLLRIMRKSNPRDSKDALQRQVVYSFGDHYYFRKELNILNLLTGYIFLLD 782 W L NPIKK+LLR+MRKS+ D DALQ+Q+VYSFGDHYY RK+L ILNLLTGYIFLLD Sbjct: 175 WFLCLNPIKKMLLRMMRKSD-TDGNDALQKQIVYSFGDHYYMRKDLRILNLLTGYIFLLD 233 Query: 783 SFGRIRWQGFGLAT 824 FGRIRW GFGLAT Sbjct: 234 KFGRIRWGGFGLAT 247 >ref|XP_003542941.1| PREDICTED: uncharacterized protein LOC100793428 isoform X1 [Glycine max] Length = 269 Score = 273 bits (699), Expect = 1e-70 Identities = 135/223 (60%), Positives = 163/223 (73%) Frame = +3 Query: 156 SIIPSKSPSQWTSNRFLDFYQLGNKAAIEKERARLADELNRGYFADIKEMKQHGGKXXXX 335 S +PS+ ++ T RF D +QLGNK AIEKERARLADE+ RGYFAD+ E K+H GK Sbjct: 29 SPVPSQHLARLTPKRFFDLHQLGNKEAIEKERARLADEMTRGYFADMAEFKKHAGKIAVA 88 Query: 336 XXXXXXXXXXVKFPALEVNYSDGSILKLPITSGGTEPATDKPDAPKASLLCLSFRANSQA 515 KFP EV+++DG +KLPI +DK PKASL+CLSFRA+SQ Sbjct: 89 NKLIIPAMVATKFPDFEVSFTDGKTMKLPIRVSDRAVDSDKSSVPKASLVCLSFRASSQE 148 Query: 516 MIDSWTVPFVDRFSHSNKVQLFEVSFIDSWLLTRNPIKKLLLRIMRKSNPRDSKDALQRQ 695 MI+SW+VPF + F SN V L++VSFIDSWLL R PIK+LLL M+K + +SKD LQ+Q Sbjct: 149 MINSWSVPFTEAFRKSNDVHLYQVSFIDSWLLCRAPIKRLLLWTMKKPSHHESKDTLQQQ 208 Query: 696 VVYSFGDHYYFRKELNILNLLTGYIFLLDSFGRIRWQGFGLAT 824 +VYSFGDHYYFRKEL ILNLLTGYIFLLD+FGR+RWQGFG AT Sbjct: 209 IVYSFGDHYYFRKELRILNLLTGYIFLLDNFGRVRWQGFGSAT 251 >gb|EPS73853.1| hypothetical protein M569_00916 [Genlisea aurea] Length = 260 Score = 265 bits (677), Expect = 4e-68 Identities = 142/250 (56%), Positives = 169/250 (67%) Frame = +3 Query: 75 LRLKRLSSATTAVLNSKYFLKPEFQPCSIIPSKSPSQWTSNRFLDFYQLGNKAAIEKERA 254 +RLK L T + + + + I ++PS+ SNRFLD YQ GNK AI KERA Sbjct: 1 MRLKGLQ-LTPRFFKAGFAINDVVESFGRISFENPSRLISNRFLDIYQFGNKEAILKERA 59 Query: 255 RLADELNRGYFADIKEMKQHGGKXXXXXXXXXXXXXXVKFPALEVNYSDGSILKLPITSG 434 RL DE++RGYFADI EMKQHGGK +KFP L+V SD + LKLPITS Sbjct: 60 RLKDEMSRGYFADISEMKQHGGKIASANKIIVPITAALKFPTLQVCNSDRTNLKLPITSD 119 Query: 435 GTEPATDKPDAPKASLLCLSFRANSQAMIDSWTVPFVDRFSHSNKVQLFEVSFIDSWLLT 614 G D PKASLLCLSFRA SQ M+DSWT+PF++ F HS + L+EVSFIDSWLL Sbjct: 120 GRSW-----DVPKASLLCLSFRATSQGMVDSWTLPFLNTFGHSKHISLYEVSFIDSWLLC 174 Query: 615 RNPIKKLLLRIMRKSNPRDSKDALQRQVVYSFGDHYYFRKELNILNLLTGYIFLLDSFGR 794 +PIKKLLL+IM+K P ++ R +VYSFGDHY+FRKEL ILNLLTGY FL+D GR Sbjct: 175 SSPIKKLLLKIMKKPKPAEASH--HRHLVYSFGDHYHFRKELKILNLLTGYFFLVDDGGR 232 Query: 795 IRWQGFGLAT 824 IRWQGFG AT Sbjct: 233 IRWQGFGSAT 242 >gb|ESW19918.1| hypothetical protein PHAVU_006G166300g [Phaseolus vulgaris] Length = 275 Score = 264 bits (674), Expect = 9e-68 Identities = 143/257 (55%), Positives = 175/257 (68%), Gaps = 4/257 (1%) Frame = +3 Query: 66 AKMLRLKRL----SSATTAVLNSKYFLKPEFQPCSIIPSKSPSQWTSNRFLDFYQLGNKA 233 AKM+ LKRL SS V+ S + + + C + S+ ++ T RFLD +Q NK Sbjct: 4 AKMVGLKRLIRRRSSLRDYVVGS--VAEKDHRHCPL-HSQHLARLTPKRFLDLHQFVNKK 60 Query: 234 AIEKERARLADELNRGYFADIKEMKQHGGKXXXXXXXXXXXXXXVKFPALEVNYSDGSIL 413 AI +ERAR+ DE+ RGYFAD+ E KQHGGK VKFP EV++SDG + Sbjct: 61 AIAEERARIGDEMKRGYFADMAEFKQHGGKIGLASKVIIPAMVAVKFPDFEVSFSDGKTV 120 Query: 414 KLPITSGGTEPATDKPDAPKASLLCLSFRANSQAMIDSWTVPFVDRFSHSNKVQLFEVSF 593 KLPI +DK PKASL+CLSFRANSQ MI+SW+VPF++ F S V L++VSF Sbjct: 121 KLPIRVSDFAVDSDKSSVPKASLVCLSFRANSQEMINSWSVPFLEAFRKSKGVHLYQVSF 180 Query: 594 IDSWLLTRNPIKKLLLRIMRKSNPRDSKDALQRQVVYSFGDHYYFRKELNILNLLTGYIF 773 IDSWLL PIK+ LL M+K +SKD LQ+Q+VYSFGDHYYFRKEL ILNLLTGYIF Sbjct: 181 IDSWLLCLPPIKRFLLWTMKKPINDESKDTLQKQMVYSFGDHYYFRKELQILNLLTGYIF 240 Query: 774 LLDSFGRIRWQGFGLAT 824 LLD+FGR+RWQGFGLAT Sbjct: 241 LLDNFGRVRWQGFGLAT 257 >ref|XP_006838715.1| hypothetical protein AMTR_s00002p00251510 [Amborella trichopoda] gi|548841221|gb|ERN01284.1| hypothetical protein AMTR_s00002p00251510 [Amborella trichopoda] Length = 270 Score = 263 bits (673), Expect = 1e-67 Identities = 132/211 (62%), Positives = 156/211 (73%) Frame = +3 Query: 192 SNRFLDFYQLGNKAAIEKERARLADELNRGYFADIKEMKQHGGKXXXXXXXXXXXXXXVK 371 S RF D Y+ GNK AI+KER RL+DE+NRGYFAD+ E+K+HGGK K Sbjct: 42 SPRFFDIYRFGNKEAIKKERERLSDEMNRGYFADMSELKKHGGKIGMANKTIVPSMVAKK 101 Query: 372 FPALEVNYSDGSILKLPITSGGTEPATDKPDAPKASLLCLSFRANSQAMIDSWTVPFVDR 551 FPAL+V +S+G +KLPI E ++ P +SLLCL FRA+SQAMIDSW+ PF D Sbjct: 102 FPALDVEFSNGRKIKLPIAYEEKESNANQMAIPHSSLLCLHFRASSQAMIDSWSKPFEDA 161 Query: 552 FSHSNKVQLFEVSFIDSWLLTRNPIKKLLLRIMRKSNPRDSKDALQRQVVYSFGDHYYFR 731 FS+S VQL+EVSFIDSW L+ +PI+ LLLR MRKS+ K LQ+Q+VYSFGDHYYFR Sbjct: 162 FSNSRNVQLYEVSFIDSWFLSLSPIRSLLLRTMRKSDFDSEKQTLQKQMVYSFGDHYYFR 221 Query: 732 KELNILNLLTGYIFLLDSFGRIRWQGFGLAT 824 KEL ILNLLTGYIFLLD FGRIRWQGFGLAT Sbjct: 222 KELQILNLLTGYIFLLDRFGRIRWQGFGLAT 252 >ref|XP_006417714.1| hypothetical protein EUTSA_v10008429mg [Eutrema salsugineum] gi|557095485|gb|ESQ36067.1| hypothetical protein EUTSA_v10008429mg [Eutrema salsugineum] Length = 278 Score = 259 bits (663), Expect = 2e-66 Identities = 130/222 (58%), Positives = 159/222 (71%), Gaps = 1/222 (0%) Frame = +3 Query: 162 IPSKSPS-QWTSNRFLDFYQLGNKAAIEKERARLADELNRGYFADIKEMKQHGGKXXXXX 338 +PS+ P+ + T+ FLDFYQ GNK AI ERAR+ DE+NRGYFAD+K+ K+HGGK Sbjct: 40 LPSQMPALRSTTRSFLDFYQFGNKKAIADERARINDEMNRGYFADMKDFKEHGGKIAAAS 99 Query: 339 XXXXXXXXXVKFPALEVNYSDGSILKLPITSGGTEPATDKPDAPKASLLCLSFRANSQAM 518 +KFP L V +S+G ILKLPI S E + D PK SL+CLSFRA+SQ M Sbjct: 100 KTIIPAVSAMKFPELAVTFSNGKILKLPIASNSKEVNIESLDVPKVSLVCLSFRASSQEM 159 Query: 519 IDSWTVPFVDRFSHSNKVQLFEVSFIDSWLLTRNPIKKLLLRIMRKSNPRDSKDALQRQV 698 I SW+ PF++ F +QLFEVSFID WLL PIKKLLLR+++K N +S LQRQ+ Sbjct: 160 ISSWSKPFLESFGDRKDLQLFEVSFIDKWLLGLAPIKKLLLRVLQKPNNSES-SVLQRQI 218 Query: 699 VYSFGDHYYFRKELNILNLLTGYIFLLDSFGRIRWQGFGLAT 824 VYSFGDHY+FRK++ +LNLLTGYI LLD FGRIRWQGFG AT Sbjct: 219 VYSFGDHYHFRKQMKVLNLLTGYILLLDKFGRIRWQGFGKAT 260 >ref|XP_006349055.1| PREDICTED: uncharacterized protein LOC102583204 isoform X2 [Solanum tuberosum] Length = 205 Score = 256 bits (654), Expect = 2e-65 Identities = 125/185 (67%), Positives = 146/185 (78%) Frame = +3 Query: 270 LNRGYFADIKEMKQHGGKXXXXXXXXXXXXXXVKFPALEVNYSDGSILKLPITSGGTEPA 449 +NRGYFADI E+K+HGGK VKFPALEV +SDGS LKLPITS G Sbjct: 1 MNRGYFADINELKEHGGKIATANKIIIPAMVAVKFPALEVIHSDGSNLKLPITSTGDGVE 60 Query: 450 TDKPDAPKASLLCLSFRANSQAMIDSWTVPFVDRFSHSNKVQLFEVSFIDSWLLTRNPIK 629 +K +A KASL+C+SFRA+SQAMIDSW+ PF+D F S +VQL+E+S IDSW LT +P+K Sbjct: 61 ANKLEASKASLMCVSFRASSQAMIDSWSKPFLDTFKDSKRVQLYEISLIDSWFLTLSPVK 120 Query: 630 KLLLRIMRKSNPRDSKDALQRQVVYSFGDHYYFRKELNILNLLTGYIFLLDSFGRIRWQG 809 KLLLR+MRKSNP +SKD L RQ+VYSFGDHYYFRKEL ILNLLTGY+FL+D FGRIRWQG Sbjct: 121 KLLLRMMRKSNPHESKDVLHRQIVYSFGDHYYFRKELKILNLLTGYMFLVDKFGRIRWQG 180 Query: 810 FGLAT 824 GLAT Sbjct: 181 SGLAT 185 >ref|NP_172300.2| uncharacterized protein [Arabidopsis thaliana] gi|110736304|dbj|BAF00122.1| hypothetical protein [Arabidopsis thaliana] gi|332190141|gb|AEE28262.1| uncharacterized protein AT1G08220 [Arabidopsis thaliana] Length = 274 Score = 254 bits (649), Expect = 7e-65 Identities = 128/222 (57%), Positives = 156/222 (70%), Gaps = 1/222 (0%) Frame = +3 Query: 162 IPSKSPS-QWTSNRFLDFYQLGNKAAIEKERARLADELNRGYFADIKEMKQHGGKXXXXX 338 +PS+ P+ + T+ FLDFY+ GNK AIE ERARL DE+NRGYFAD+KE K+HGGK Sbjct: 36 LPSQMPALRSTTRSFLDFYKFGNKKAIEDERARLNDEMNRGYFADMKEFKEHGGKIAAAN 95 Query: 339 XXXXXXXXXVKFPALEVNYSDGSILKLPITSGGTEPATDKPDAPKASLLCLSFRANSQAM 518 +KFP L V +S+G LKLPI E T+ PK SL+CLSFRA+SQ M Sbjct: 96 KTIIPAASAIKFPVLAVTFSNGKSLKLPIAPNSNEVDTESLVVPKVSLVCLSFRASSQEM 155 Query: 519 IDSWTVPFVDRFSHSNKVQLFEVSFIDSWLLTRNPIKKLLLRIMRKSNPRDSKDALQRQV 698 I SW+ PF++ F + +QLFEVSFID WLL PI+KLLLR+++K N + LQRQV Sbjct: 156 ISSWSKPFLESFGNRKDLQLFEVSFIDKWLLGLAPIRKLLLRVLQKPN-NNENSVLQRQV 214 Query: 699 VYSFGDHYYFRKELNILNLLTGYIFLLDSFGRIRWQGFGLAT 824 Y+FGDHYYFRKE+ +LNLLTGYI LLD GRIRWQGFG AT Sbjct: 215 GYAFGDHYYFRKEIKVLNLLTGYILLLDKSGRIRWQGFGTAT 256 >ref|XP_002892448.1| hypothetical protein ARALYDRAFT_470883 [Arabidopsis lyrata subsp. lyrata] gi|297338290|gb|EFH68707.1| hypothetical protein ARALYDRAFT_470883 [Arabidopsis lyrata subsp. lyrata] Length = 274 Score = 254 bits (649), Expect = 7e-65 Identities = 133/249 (53%), Positives = 163/249 (65%), Gaps = 1/249 (0%) Frame = +3 Query: 81 LKRLSSATTAVLNSKYFLKPEFQPCSIIPSKSPS-QWTSNRFLDFYQLGNKAAIEKERAR 257 LK S +T + F ++ +PSK P+ + T+ FLDFYQ GNK AIE ER R Sbjct: 9 LKHRSVSTLNKHHQGVFSFTRYENHDSLPSKMPALRSTTRSFLDFYQFGNKKAIEDERTR 68 Query: 258 LADELNRGYFADIKEMKQHGGKXXXXXXXXXXXXXXVKFPALEVNYSDGSILKLPITSGG 437 L DE+NRGYFAD+KE K+HGGK +KFP L V YS+G L LPIT Sbjct: 69 LNDEMNRGYFADMKEFKEHGGKIAAANKILIPAASAMKFPVLAVTYSNGQRLNLPITPNS 128 Query: 438 TEPATDKPDAPKASLLCLSFRANSQAMIDSWTVPFVDRFSHSNKVQLFEVSFIDSWLLTR 617 E T+ PK SL+CLSFRA+SQ MI SW+ PF++ F + +QLFEVSFID WLL Sbjct: 129 NEVDTESLAVPKVSLVCLSFRASSQEMISSWSKPFLETFGNRKDLQLFEVSFIDKWLLGL 188 Query: 618 NPIKKLLLRIMRKSNPRDSKDALQRQVVYSFGDHYYFRKELNILNLLTGYIFLLDSFGRI 797 PI+KLLLR+++K N + LQRQ VY+FGDHY FRK++ +LNLLTGYI LLD GRI Sbjct: 189 APIRKLLLRVLQKPN-NNENSVLQRQAVYAFGDHYNFRKQIKVLNLLTGYILLLDKSGRI 247 Query: 798 RWQGFGLAT 824 RWQGFG AT Sbjct: 248 RWQGFGTAT 256 >ref|XP_006304092.1| hypothetical protein CARUB_v10009984mg [Capsella rubella] gi|482572803|gb|EOA36990.1| hypothetical protein CARUB_v10009984mg [Capsella rubella] Length = 276 Score = 254 bits (648), Expect = 1e-64 Identities = 131/254 (51%), Positives = 176/254 (69%), Gaps = 2/254 (0%) Frame = +3 Query: 69 KMLRLKRLSSATTAVLNSKYFLKPEFQPCSIIPSKSPS-QWTSNRFLDFYQLGNKAAIEK 245 ++L+ + +S+ + + F+ ++ +P++ P+ + TS FLDFY+ GNK AIE Sbjct: 7 RILKHRSVSAFSYRNQHQGLFISTRYENHDSLPTQMPALRSTSRSFLDFYKFGNKKAIED 66 Query: 246 ERARLADELNRGYFADIKEMKQHGGKXXXXXXXXXXXXXXVKFPALEVNYSDGSILKLPI 425 ERARL DE+NRGYFAD+KE ++HGGK +KFPAL V +++G LPI Sbjct: 67 ERARLNDEMNRGYFADMKEFREHGGKIAAANKTIIPAVSAMKFPALAVTFANGESQTLPI 126 Query: 426 TSGGTEPATDKPDAPKASLLCLSFRANSQAMIDSWTVPFVDRFSHSNKVQLFEVSFIDSW 605 TS E T+ PK SL+CLSFRA+SQ MI SW+ PF++ F + +QLFEVSFID W Sbjct: 127 TSNNNEVNTESLAVPKLSLVCLSFRASSQEMISSWSKPFLESFGNRKDLQLFEVSFIDKW 186 Query: 606 LLTRNPIKKLLLRIMRKSNPRDSKD-ALQRQVVYSFGDHYYFRKELNILNLLTGYIFLLD 782 LL +PIKKLLLR+++K P++S++ LQRQVVY+FGDHY FRK++ +LNLLTGYI LLD Sbjct: 187 LLGLSPIKKLLLRVLQK--PKNSENHVLQRQVVYAFGDHYNFRKQIKVLNLLTGYILLLD 244 Query: 783 SFGRIRWQGFGLAT 824 GRIRWQGFG AT Sbjct: 245 KSGRIRWQGFGTAT 258