BLASTX nr result
ID: Catharanthus23_contig00015627
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Catharanthus23_contig00015627 (923 letters) Database: ./nr 37,332,560 sequences; 13,225,080,153 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value gb|EXB91557.1| hypothetical protein L484_016173 [Morus notabilis] 282 1e-73 gb|EMJ04821.1| hypothetical protein PRUPE_ppa003792mg [Prunus pe... 276 8e-72 ref|XP_006341381.1| PREDICTED: uncharacterized protein LOC102605... 273 7e-71 gb|EOY31262.1| Uncharacterized protein TCM_038232 [Theobroma cacao] 273 9e-71 ref|XP_002268417.1| PREDICTED: uncharacterized protein LOC100267... 272 1e-70 ref|XP_004235915.1| PREDICTED: uncharacterized protein LOC101263... 268 2e-69 ref|XP_003546591.1| PREDICTED: uncharacterized protein LOC100809... 268 2e-69 ref|XP_006587134.1| PREDICTED: uncharacterized protein LOC100797... 268 3e-69 ref|XP_006587133.1| PREDICTED: uncharacterized protein LOC100797... 268 3e-69 ref|XP_003533868.1| PREDICTED: uncharacterized protein LOC100797... 268 3e-69 ref|XP_006409707.1| hypothetical protein EUTSA_v10022640mg [Eutr... 267 4e-69 ref|XP_006453730.1| hypothetical protein CICLE_v10007959mg [Citr... 264 3e-68 ref|XP_004287238.1| PREDICTED: uncharacterized protein LOC101297... 263 7e-68 ref|XP_006473929.1| PREDICTED: uncharacterized protein LOC102612... 263 9e-68 ref|NP_178935.2| uncharacterized protein [Arabidopsis thaliana] ... 262 1e-67 gb|EOY25962.1| Uncharacterized protein TCM_027327 [Theobroma cacao] 261 2e-67 ref|XP_002885852.1| hypothetical protein ARALYDRAFT_899530 [Arab... 261 2e-67 ref|XP_006587132.1| PREDICTED: uncharacterized protein LOC100797... 261 3e-67 gb|EOY29411.1| Uncharacterized protein TCM_036954 [Theobroma cacao] 259 1e-66 ref|XP_004488469.1| PREDICTED: uncharacterized protein LOC101501... 258 2e-66 >gb|EXB91557.1| hypothetical protein L484_016173 [Morus notabilis] Length = 310 Score = 282 bits (722), Expect = 1e-73 Identities = 153/253 (60%), Positives = 182/253 (71%) Frame = -2 Query: 760 GGFEELKDGIVSWEMKTRSLLELRNGKNSSLVLAAERTRRKDPLENLNYYTGGWNISDKH 581 GGF D IV WE RSL E NSSL+LA RTRR+DPL+ Y GGWNIS+ H Sbjct: 41 GGF----DDIVRWETTRRSLAE-EAVDNSSLILAEGRTRRRDPLDGFTRYNGGWNISNSH 95 Query: 580 YFASAAFTATPLFLVAAIWFVAFGXXXXXXXXXXXXCRRQNYGYSRTAYALSLILLSLFT 401 Y+AS AFTA PLF++A IWFV FG C R+ YGYSRTAYALSLILL LFT Sbjct: 96 YWASVAFTAVPLFVIAGIWFVVFGLSLSLICLCYCCCPREPYGYSRTAYALSLILLILFT 155 Query: 400 IATVVGSIVLYTGQGKFHDSTKDTLLYVVRQSDSTVENLRNVSDYLDAAKQIQVDQISLP 221 IA +VG IVLYTGQGK HDSTKDTL YVV Q+D TVENLRNVSDYL AAK+I VD + LP Sbjct: 156 IAAIVGCIVLYTGQGKLHDSTKDTLNYVVHQADVTVENLRNVSDYLAAAKRIGVDAVFLP 215 Query: 220 SDVKSRIDQVHSKIDSASFTLEDETKKNKNRIEDILEIVRKILIIAAAVMLFLTLLGFLF 41 +V+++IDQ+ +KI+S++ TL T+ N +I+D L+ +R LII AAVMLFL LGFLF Sbjct: 216 EEVQNKIDQIQTKINSSATTLSKTTQDNSMQIQDGLDSMRLALIIVAAVMLFLAFLGFLF 275 Query: 40 SILGLQCLVSVLV 2 S+ GL+ LV +LV Sbjct: 276 SVFGLRVLVYILV 288 >gb|EMJ04821.1| hypothetical protein PRUPE_ppa003792mg [Prunus persica] Length = 548 Score = 276 bits (706), Expect = 8e-72 Identities = 152/254 (59%), Positives = 183/254 (72%), Gaps = 1/254 (0%) Frame = -2 Query: 760 GGFEELK-DGIVSWEMKTRSLLELRNGKNSSLVLAAERTRRKDPLENLNYYTGGWNISDK 584 GG E + G V W+ + RSL E +NSSL+LA +RT RKDPL+ YTGGWNIS+ Sbjct: 43 GGESEYEYGGAVKWDTR-RSLAE-GTVQNSSLILAEKRTYRKDPLDGFKKYTGGWNISND 100 Query: 583 HYFASAAFTATPLFLVAAIWFVAFGXXXXXXXXXXXXCRRQNYGYSRTAYALSLILLSLF 404 HY+AS FTA P F++A +WFV FG C R+ YGYSRTAYALSLI L LF Sbjct: 101 HYWASVGFTAAPFFVIAGVWFVLFGLSLSFICLCYCCCPREPYGYSRTAYALSLIFLILF 160 Query: 403 TIATVVGSIVLYTGQGKFHDSTKDTLLYVVRQSDSTVENLRNVSDYLDAAKQIQVDQISL 224 T+A +VG IVLYTGQGKFH ST +TL YVV Q+D+TVENLRN+S YL AAK+I VD + L Sbjct: 161 TLAAIVGCIVLYTGQGKFHSSTTNTLKYVVSQADTTVENLRNLSGYLGAAKRIGVDAVFL 220 Query: 223 PSDVKSRIDQVHSKIDSASFTLEDETKKNKNRIEDILEIVRKILIIAAAVMLFLTLLGFL 44 P+DV+S ID V +KI+SAS TL D+T+KN RI+D L+ +R LII AAVMLFL LGFL Sbjct: 221 PADVQSNIDNVLTKINSASNTLSDKTEKNSKRIQDGLDSMRLALIIVAAVMLFLAFLGFL 280 Query: 43 FSILGLQCLVSVLV 2 FSILG+Q LV LV Sbjct: 281 FSILGMQVLVYFLV 294 >ref|XP_006341381.1| PREDICTED: uncharacterized protein LOC102605696 [Solanum tuberosum] Length = 537 Score = 273 bits (698), Expect = 7e-71 Identities = 148/245 (60%), Positives = 180/245 (73%) Frame = -2 Query: 736 GIVSWEMKTRSLLELRNGKNSSLVLAAERTRRKDPLENLNYYTGGWNISDKHYFASAAFT 557 G+VS + R L E N+SLVLAAERT R+DPLE+ +YYTGGWNIS+ HY+AS A+T Sbjct: 40 GVVS-SVSKRFLAEDNATGNASLVLAAERTHRRDPLEDRSYYTGGWNISNSHYWASVAYT 98 Query: 556 ATPLFLVAAIWFVAFGXXXXXXXXXXXXCRRQNYGYSRTAYALSLILLSLFTIATVVGSI 377 +PLF++A WFVA G CRR YGYSRTAYALSLILLS FTIA ++GSI Sbjct: 99 GSPLFVIALFWFVACGISLLLVCICCCCCRRGRYGYSRTAYALSLILLSAFTIAAIIGSI 158 Query: 376 VLYTGQGKFHDSTKDTLLYVVRQSDSTVENLRNVSDYLDAAKQIQVDQISLPSDVKSRID 197 LYTGQGKFHD TK TL YVV+Q+ STV+NLRNVS+ L AK + QI LP +V++ ID Sbjct: 159 FLYTGQGKFHDGTKHTLDYVVQQAGSTVDNLRNVSNILAVAKHTGISQIFLPQNVQNNID 218 Query: 196 QVHSKIDSASFTLEDETKKNKNRIEDILEIVRKILIIAAAVMLFLTLLGFLFSILGLQCL 17 +V +KI SA+ TLE ET+KNK I +L++VR+ILII AAVML L LGFL SILGLQ + Sbjct: 219 KVDTKISSAAETLETETEKNKKNIMIVLDLVRRILIIVAAVMLGLAALGFLLSILGLQFI 278 Query: 16 VSVLV 2 V +LV Sbjct: 279 VYILV 283 >gb|EOY31262.1| Uncharacterized protein TCM_038232 [Theobroma cacao] Length = 551 Score = 273 bits (697), Expect = 9e-71 Identities = 147/270 (54%), Positives = 185/270 (68%), Gaps = 7/270 (2%) Frame = -2 Query: 790 ASPVANHHFSGGFEELKDG------IVSWE-MKTRSLLELRNGKNSSLVLAAERTRRKDP 632 AS V H SG ++ G ++ W+ ++ R+L E G NSSL+LA ERTRR+DP Sbjct: 30 ASNVRTLHISGYYKSFVFGEREHTEMLPWKKIERRNLAEGSEGDNSSLILAGERTRRRDP 89 Query: 631 LENLNYYTGGWNISDKHYFASAAFTATPLFLVAAIWFVAFGXXXXXXXXXXXXCRRQNYG 452 L+N YTGGWNIS++HY+AS FTA P F +AA+WFV F C+ NYG Sbjct: 90 LDNFKKYTGGWNISNEHYWASVGFTAAPFFAIAAVWFVIFALCLFIICIRHCCCQLDNYG 149 Query: 451 YSRTAYALSLILLSLFTIATVVGSIVLYTGQGKFHDSTKDTLLYVVRQSDSTVENLRNVS 272 YSRTAYALSLILL LFTIA +VG +VLYTGQGKFH STK+TL Y V ++D T E+LRNVS Sbjct: 150 YSRTAYALSLILLILFTIAAIVGCVVLYTGQGKFHGSTKNTLDYAVNKADVTAESLRNVS 209 Query: 271 DYLDAAKQIQVDQISLPSDVKSRIDQVHSKIDSASFTLEDETKKNKNRIEDILEIVRKIL 92 DYL AAK+I VD + L D++ ID + KI+S++ TL +T NK++I++ L+ +R L Sbjct: 210 DYLSAAKKISVDSVFLAPDIQKSIDDIEKKINSSATTLSTQTGDNKDKIQNGLDSMRLAL 269 Query: 91 IIAAAVMLFLTLLGFLFSILGLQCLVSVLV 2 II AA MLFL LGFLFSILGLQ LV LV Sbjct: 270 IIVAAAMLFLAFLGFLFSILGLQFLVYTLV 299 >ref|XP_002268417.1| PREDICTED: uncharacterized protein LOC100267143 [Vitis vinifera] gi|296084555|emb|CBI25576.3| unnamed protein product [Vitis vinifera] Length = 551 Score = 272 bits (696), Expect = 1e-70 Identities = 145/268 (54%), Positives = 186/268 (69%) Frame = -2 Query: 805 SSANGASPVANHHFSGGFEELKDGIVSWEMKTRSLLELRNGKNSSLVLAAERTRRKDPLE 626 S + P+ GG E G+V W KTR L +G NSSL+LAA+RT RKDP + Sbjct: 33 SQLHHPQPLRVQEVFGGREN--GGLVPW--KTRRSLAEGSGDNSSLILAAKRTHRKDPSD 88 Query: 625 NLNYYTGGWNISDKHYFASAAFTATPLFLVAAIWFVAFGXXXXXXXXXXXXCRRQNYGYS 446 +YYTGGWNIS+ HY+AS ++TA P F++ IWFV FG CRR+ YGYS Sbjct: 89 GFSYYTGGWNISNGHYWASVSYTAVPFFVLGGIWFVLFGLCLSLICLCYCCCRREPYGYS 148 Query: 445 RTAYALSLILLSLFTIATVVGSIVLYTGQGKFHDSTKDTLLYVVRQSDSTVENLRNVSDY 266 RTAYALSLILL LFTIA ++G +VLYTGQGKFH ST TL YVV Q+++T ENL+NVS+Y Sbjct: 149 RTAYALSLILLILFTIAAIIGCVVLYTGQGKFHGSTTSTLGYVVDQAETTSENLKNVSEY 208 Query: 265 LDAAKQIQVDQISLPSDVKSRIDQVHSKIDSASFTLEDETKKNKNRIEDILEIVRKILII 86 L AAK+I + LP++V++ ID +KI++++ TL+ +T+KN I+D+L+ VR LI+ Sbjct: 209 LSAAKRIGIGSSFLPANVQTNIDHAETKINASATTLDTKTQKNSKDIQDLLDAVRLALIV 268 Query: 85 AAAVMLFLTLLGFLFSILGLQCLVSVLV 2 AAVML L LGFLFSILGLQCLV LV Sbjct: 269 LAAVMLLLVFLGFLFSILGLQCLVYFLV 296 >ref|XP_004235915.1| PREDICTED: uncharacterized protein LOC101263891 [Solanum lycopersicum] Length = 536 Score = 268 bits (686), Expect = 2e-69 Identities = 146/245 (59%), Positives = 176/245 (71%) Frame = -2 Query: 736 GIVSWEMKTRSLLELRNGKNSSLVLAAERTRRKDPLENLNYYTGGWNISDKHYFASAAFT 557 G+V+ + R L E NSSLVLAAERT R+DPLE+ +YYTGGWNIS+ HY+AS A+T Sbjct: 40 GVVT-SVSKRFLAENSATGNSSLVLAAERTHRRDPLEDRSYYTGGWNISNDHYWASVAYT 98 Query: 556 ATPLFLVAAIWFVAFGXXXXXXXXXXXXCRRQNYGYSRTAYALSLILLSLFTIATVVGSI 377 PLF++A WF A G CRR YGYSR AYALSLI LS FTIA ++GSI Sbjct: 99 GAPLFIIALFWFFACGISLFLVCICCCCCRRGRYGYSRAAYALSLIFLSAFTIAAIIGSI 158 Query: 376 VLYTGQGKFHDSTKDTLLYVVRQSDSTVENLRNVSDYLDAAKQIQVDQISLPSDVKSRID 197 LYTGQGKFHD TKDTL YVV+Q+ STV+NLRNVS L AK + QI LP +V++ ID Sbjct: 159 FLYTGQGKFHDGTKDTLDYVVQQAGSTVDNLRNVSKILAVAKHTGISQIFLPQNVQNNID 218 Query: 196 QVHSKIDSASFTLEDETKKNKNRIEDILEIVRKILIIAAAVMLFLTLLGFLFSILGLQCL 17 +V +KI SA+ TLE ET+KNK I +L++VR+ILII AAVML L LGFL SILGLQ + Sbjct: 219 KVDTKISSAAETLETETEKNKKDIMVVLDLVRRILIIVAAVMLGLAALGFLLSILGLQFI 278 Query: 16 VSVLV 2 V +LV Sbjct: 279 VYILV 283 >ref|XP_003546591.1| PREDICTED: uncharacterized protein LOC100809927 [Glycine max] Length = 539 Score = 268 bits (685), Expect = 2e-69 Identities = 140/253 (55%), Positives = 176/253 (69%) Frame = -2 Query: 760 GGFEELKDGIVSWEMKTRSLLELRNGKNSSLVLAAERTRRKDPLENLNYYTGGWNISDKH 581 GG E ++ W+ + RS+ E N+SL+LA +RT RKDPL+N N YTGGWNIS++H Sbjct: 44 GGEGEHYHEVLPWKTR-RSMAEEEATSNASLILAQKRTTRKDPLDNFNRYTGGWNISNRH 102 Query: 580 YFASAAFTATPLFLVAAIWFVAFGXXXXXXXXXXXXCRRQNYGYSRTAYALSLILLSLFT 401 Y AS FTA P F+VAA+WFV FG C R+ YGYSR AYALSLI L LFT Sbjct: 103 YIASVVFTAVPFFVVAAVWFVVFGLSLSLICLCYCCCPREPYGYSRLAYALSLIFLILFT 162 Query: 400 IATVVGSIVLYTGQGKFHDSTKDTLLYVVRQSDSTVENLRNVSDYLDAAKQIQVDQISLP 221 +A +VG ++LYT QGKFH ST TL YVV Q+D T ENLRNVS YLDAA++I VD + LP Sbjct: 163 LAAIVGCVLLYTAQGKFHGSTTSTLKYVVSQADFTAENLRNVSHYLDAAQKIGVDAVFLP 222 Query: 220 SDVKSRIDQVHSKIDSASFTLEDETKKNKNRIEDILEIVRKILIIAAAVMLFLTLLGFLF 41 DV+ ID+V +KI+S++ L +TK+N I+D+++ +R L+I AAVMLFL LGFLF Sbjct: 223 GDVQKNIDEVQTKINSSAAELSSKTKENSETIKDVIDAMRLALVIVAAVMLFLAFLGFLF 282 Query: 40 SILGLQCLVSVLV 2 SI GLQ LV LV Sbjct: 283 SIFGLQGLVYFLV 295 >ref|XP_006587134.1| PREDICTED: uncharacterized protein LOC100797376 isoform X4 [Glycine max] Length = 433 Score = 268 bits (684), Expect = 3e-69 Identities = 139/253 (54%), Positives = 178/253 (70%) Frame = -2 Query: 760 GGFEELKDGIVSWEMKTRSLLELRNGKNSSLVLAAERTRRKDPLENLNYYTGGWNISDKH 581 GG E ++ W+ + RS+ E N+SL+LA RT RKDPL+N N+YTGGWNIS++H Sbjct: 46 GGEGEYYHEVLPWKTR-RSMAEEEATSNTSLILAQRRTTRKDPLDNYNHYTGGWNISNRH 104 Query: 580 YFASAAFTATPLFLVAAIWFVAFGXXXXXXXXXXXXCRRQNYGYSRTAYALSLILLSLFT 401 Y AS FTA P F+VAA+WFV FG C R+ YGYSR AYALSLI L LFT Sbjct: 105 YIASVVFTAVPFFVVAAVWFVIFGLSLSFICLCYCCCPREPYGYSRLAYALSLIFLVLFT 164 Query: 400 IATVVGSIVLYTGQGKFHDSTKDTLLYVVRQSDSTVENLRNVSDYLDAAKQIQVDQISLP 221 +A +VG ++LYT QGKFH ST +TL +V Q+D T ENLRNVSDYLDAA++I V+ + LP Sbjct: 165 LAAIVGCVLLYTAQGKFHGSTTNTLKSLVSQADFTAENLRNVSDYLDAAQKIGVEAVFLP 224 Query: 220 SDVKSRIDQVHSKIDSASFTLEDETKKNKNRIEDILEIVRKILIIAAAVMLFLTLLGFLF 41 +DV+ ID+V KI+S++ L +TKKN I+D+++ +R+ L+I AAVMLFL LGFLF Sbjct: 225 ADVQKNIDEVQMKINSSAADLSSKTKKNSETIKDVIDAMRRDLVILAAVMLFLAFLGFLF 284 Query: 40 SILGLQCLVSVLV 2 SI GLQ LV LV Sbjct: 285 SIFGLQGLVYFLV 297 >ref|XP_006587133.1| PREDICTED: uncharacterized protein LOC100797376 isoform X3 [Glycine max] Length = 464 Score = 268 bits (684), Expect = 3e-69 Identities = 139/253 (54%), Positives = 178/253 (70%) Frame = -2 Query: 760 GGFEELKDGIVSWEMKTRSLLELRNGKNSSLVLAAERTRRKDPLENLNYYTGGWNISDKH 581 GG E ++ W+ + RS+ E N+SL+LA RT RKDPL+N N+YTGGWNIS++H Sbjct: 46 GGEGEYYHEVLPWKTR-RSMAEEEATSNTSLILAQRRTTRKDPLDNYNHYTGGWNISNRH 104 Query: 580 YFASAAFTATPLFLVAAIWFVAFGXXXXXXXXXXXXCRRQNYGYSRTAYALSLILLSLFT 401 Y AS FTA P F+VAA+WFV FG C R+ YGYSR AYALSLI L LFT Sbjct: 105 YIASVVFTAVPFFVVAAVWFVIFGLSLSFICLCYCCCPREPYGYSRLAYALSLIFLVLFT 164 Query: 400 IATVVGSIVLYTGQGKFHDSTKDTLLYVVRQSDSTVENLRNVSDYLDAAKQIQVDQISLP 221 +A +VG ++LYT QGKFH ST +TL +V Q+D T ENLRNVSDYLDAA++I V+ + LP Sbjct: 165 LAAIVGCVLLYTAQGKFHGSTTNTLKSLVSQADFTAENLRNVSDYLDAAQKIGVEAVFLP 224 Query: 220 SDVKSRIDQVHSKIDSASFTLEDETKKNKNRIEDILEIVRKILIIAAAVMLFLTLLGFLF 41 +DV+ ID+V KI+S++ L +TKKN I+D+++ +R+ L+I AAVMLFL LGFLF Sbjct: 225 ADVQKNIDEVQMKINSSAADLSSKTKKNSETIKDVIDAMRRDLVILAAVMLFLAFLGFLF 284 Query: 40 SILGLQCLVSVLV 2 SI GLQ LV LV Sbjct: 285 SIFGLQGLVYFLV 297 >ref|XP_003533868.1| PREDICTED: uncharacterized protein LOC100797376 isoform X1 [Glycine max] Length = 541 Score = 268 bits (684), Expect = 3e-69 Identities = 139/253 (54%), Positives = 178/253 (70%) Frame = -2 Query: 760 GGFEELKDGIVSWEMKTRSLLELRNGKNSSLVLAAERTRRKDPLENLNYYTGGWNISDKH 581 GG E ++ W+ + RS+ E N+SL+LA RT RKDPL+N N+YTGGWNIS++H Sbjct: 46 GGEGEYYHEVLPWKTR-RSMAEEEATSNTSLILAQRRTTRKDPLDNYNHYTGGWNISNRH 104 Query: 580 YFASAAFTATPLFLVAAIWFVAFGXXXXXXXXXXXXCRRQNYGYSRTAYALSLILLSLFT 401 Y AS FTA P F+VAA+WFV FG C R+ YGYSR AYALSLI L LFT Sbjct: 105 YIASVVFTAVPFFVVAAVWFVIFGLSLSFICLCYCCCPREPYGYSRLAYALSLIFLVLFT 164 Query: 400 IATVVGSIVLYTGQGKFHDSTKDTLLYVVRQSDSTVENLRNVSDYLDAAKQIQVDQISLP 221 +A +VG ++LYT QGKFH ST +TL +V Q+D T ENLRNVSDYLDAA++I V+ + LP Sbjct: 165 LAAIVGCVLLYTAQGKFHGSTTNTLKSLVSQADFTAENLRNVSDYLDAAQKIGVEAVFLP 224 Query: 220 SDVKSRIDQVHSKIDSASFTLEDETKKNKNRIEDILEIVRKILIIAAAVMLFLTLLGFLF 41 +DV+ ID+V KI+S++ L +TKKN I+D+++ +R+ L+I AAVMLFL LGFLF Sbjct: 225 ADVQKNIDEVQMKINSSAADLSSKTKKNSETIKDVIDAMRRDLVILAAVMLFLAFLGFLF 284 Query: 40 SILGLQCLVSVLV 2 SI GLQ LV LV Sbjct: 285 SIFGLQGLVYFLV 297 >ref|XP_006409707.1| hypothetical protein EUTSA_v10022640mg [Eutrema salsugineum] gi|557110869|gb|ESQ51160.1| hypothetical protein EUTSA_v10022640mg [Eutrema salsugineum] Length = 532 Score = 267 bits (683), Expect = 4e-69 Identities = 138/246 (56%), Positives = 173/246 (70%), Gaps = 1/246 (0%) Frame = -2 Query: 736 GIVSWEMKTRS-LLELRNGKNSSLVLAAERTRRKDPLENLNYYTGGWNISDKHYFASAAF 560 G+ W + NG+NSSL+L+A+RT+RKDP EN YTGGWNIS+ HY++S A+ Sbjct: 33 GVEEWRTSVNERFMAEENGENSSLILSAKRTKRKDPTENFKLYTGGWNISNSHYWSSVAY 92 Query: 559 TATPLFLVAAIWFVAFGXXXXXXXXXXXXCRRQNYGYSRTAYALSLILLSLFTIATVVGS 380 TA P ++AA+WFV FG C RQ YGYSR AYALSLILL FTIA VVG Sbjct: 93 TAMPFVVIAAVWFVFFGLSLSIICFCFCCCARQPYGYSRVAYALSLILLISFTIAAVVGC 152 Query: 379 IVLYTGQGKFHDSTKDTLLYVVRQSDSTVENLRNVSDYLDAAKQIQVDQISLPSDVKSRI 200 + LYTGQGKFH ST DTL YVVRQ++ T ENLRNVSDYL+AAK+I V I LP DV S I Sbjct: 153 VFLYTGQGKFHTSTSDTLDYVVRQANFTSENLRNVSDYLNAAKKIDVQSIVLPGDVLSSI 212 Query: 199 DQVHSKIDSASFTLEDETKKNKNRIEDILEIVRKILIIAAAVMLFLTLLGFLFSILGLQC 20 D + KI+S++ TL +T +N+ +I+++L+ +R LII AAVMLFL +GFL S+ GL+C Sbjct: 213 DNIQGKINSSATTLSVKTMENQEKIQNVLDNMRLALIIIAAVMLFLAFIGFLLSVFGLRC 272 Query: 19 LVSVLV 2 LV LV Sbjct: 273 LVYTLV 278 >ref|XP_006453730.1| hypothetical protein CICLE_v10007959mg [Citrus clementina] gi|557556956|gb|ESR66970.1| hypothetical protein CICLE_v10007959mg [Citrus clementina] Length = 536 Score = 264 bits (675), Expect = 3e-68 Identities = 141/245 (57%), Positives = 175/245 (71%) Frame = -2 Query: 736 GIVSWEMKTRSLLELRNGKNSSLVLAAERTRRKDPLENLNYYTGGWNISDKHYFASAAFT 557 G+ W++K R L E NS+L+LA +RT+RKDP++N + Y GGWNIS+KHY+AS FT Sbjct: 51 GVSLWKLK-RYLAE-EPTDNSTLILAEKRTQRKDPIDNFHKYKGGWNISNKHYWASVGFT 108 Query: 556 ATPLFLVAAIWFVAFGXXXXXXXXXXXXCRRQNYGYSRTAYALSLILLSLFTIATVVGSI 377 A P F++A IWFV FG CRR+ YGYSRT YALSLILL FTI+ +VG I Sbjct: 109 AAPFFIIAGIWFVVFGLSLCFMCLHYCCCRREPYGYSRTCYALSLILLVFFTISAIVGCI 168 Query: 376 VLYTGQGKFHDSTKDTLLYVVRQSDSTVENLRNVSDYLDAAKQIQVDQISLPSDVKSRID 197 VLYTGQGKFH ST DTL YVV+Q+ T E+L+NVSDYL AAK I V+ ++L DV+S ID Sbjct: 169 VLYTGQGKFHSSTLDTLNYVVKQAHITSESLQNVSDYLAAAKTIGVNSVTLAPDVQSNID 228 Query: 196 QVHSKIDSASFTLEDETKKNKNRIEDILEIVRKILIIAAAVMLFLTLLGFLFSILGLQCL 17 ++ KI+S++ TL +TKKN I+D L+ V LII AAVMLFL LGFLFS+ GLQCL Sbjct: 229 KIDRKINSSATTLSYQTKKNSKDIKDALDSVGLALIIVAAVMLFLAFLGFLFSVFGLQCL 288 Query: 16 VSVLV 2 V LV Sbjct: 289 VYFLV 293 >ref|XP_004287238.1| PREDICTED: uncharacterized protein LOC101297283 [Fragaria vesca subsp. vesca] Length = 542 Score = 263 bits (672), Expect = 7e-68 Identities = 143/253 (56%), Positives = 181/253 (71%) Frame = -2 Query: 760 GGFEELKDGIVSWEMKTRSLLELRNGKNSSLVLAAERTRRKDPLENLNYYTGGWNISDKH 581 GG E G+V + + RSL++ N +NSSL+LA ERTRR+DPL++ + YTGGWNIS+ H Sbjct: 39 GGVER---GVVGLDTR-RSLVD-ENLQNSSLILAEERTRRRDPLDDFHKYTGGWNISNDH 93 Query: 580 YFASAAFTATPLFLVAAIWFVAFGXXXXXXXXXXXXCRRQNYGYSRTAYALSLILLSLFT 401 Y+AS A+TA P F+VA +WFV FG C R+ YGYSRTAYALSLILL FT Sbjct: 94 YWASVAWTAVPFFVVAGLWFVIFGLSLTLICFCYCCCPREPYGYSRTAYALSLILLIFFT 153 Query: 400 IATVVGSIVLYTGQGKFHDSTKDTLLYVVRQSDSTVENLRNVSDYLDAAKQIQVDQISLP 221 +A +VG IVLYTGQ KFH ST TL YVV Q+D+TVENLRN+S YL AAK+I VD + LP Sbjct: 154 VAAIVGCIVLYTGQAKFHSSTTKTLNYVVSQADTTVENLRNLSGYLGAAKRIGVDSVFLP 213 Query: 220 SDVKSRIDQVHSKIDSASFTLEDETKKNKNRIEDILEIVRKILIIAAAVMLFLTLLGFLF 41 SDV+S ID +K++SA+ L + T+KN RI+D L+ +R LI+ AAVML L +GFLF Sbjct: 214 SDVQSNIDTTITKLNSAATKLSNTTEKNSKRIDDGLDSIRLALIVIAAVMLCLAFIGFLF 273 Query: 40 SILGLQCLVSVLV 2 S+LG+Q V LV Sbjct: 274 SVLGMQACVYFLV 286 >ref|XP_006473929.1| PREDICTED: uncharacterized protein LOC102612751 [Citrus sinensis] Length = 536 Score = 263 bits (671), Expect = 9e-68 Identities = 140/245 (57%), Positives = 175/245 (71%) Frame = -2 Query: 736 GIVSWEMKTRSLLELRNGKNSSLVLAAERTRRKDPLENLNYYTGGWNISDKHYFASAAFT 557 G+ W++K R L E NS+L+LA +RT+RKDP++N + Y GGWNIS+KHY+AS FT Sbjct: 51 GVSLWKLK-RYLAE-EPTDNSTLILAEKRTQRKDPIDNFHKYKGGWNISNKHYWASVGFT 108 Query: 556 ATPLFLVAAIWFVAFGXXXXXXXXXXXXCRRQNYGYSRTAYALSLILLSLFTIATVVGSI 377 A P F++A IWFV FG CRR+ YGYSRT YALSLILL FTI+ +VG I Sbjct: 109 AAPFFIIAGIWFVVFGLSLCFICLHYCCCRREPYGYSRTCYALSLILLVFFTISAIVGCI 168 Query: 376 VLYTGQGKFHDSTKDTLLYVVRQSDSTVENLRNVSDYLDAAKQIQVDQISLPSDVKSRID 197 VLYTGQGKFH ST DTL YVV+Q+ T E+L+NVSDYL AAK I V+ ++L DV+S ID Sbjct: 169 VLYTGQGKFHSSTLDTLNYVVKQAHITSESLQNVSDYLAAAKTIGVNSVTLAPDVQSNID 228 Query: 196 QVHSKIDSASFTLEDETKKNKNRIEDILEIVRKILIIAAAVMLFLTLLGFLFSILGLQCL 17 ++ KI+S++ TL +TKKN I+D L+ V LII AAV+LFL LGFLFS+ GLQCL Sbjct: 229 KIDRKINSSATTLSYQTKKNSKDIKDALDSVGLALIIVAAVILFLAFLGFLFSVFGLQCL 288 Query: 16 VSVLV 2 V LV Sbjct: 289 VYFLV 293 >ref|NP_178935.2| uncharacterized protein [Arabidopsis thaliana] gi|25083238|gb|AAN72054.1| unknown protein [Arabidopsis thaliana] gi|330251102|gb|AEC06196.1| uncharacterized protein AT2G12400 [Arabidopsis thaliana] Length = 541 Score = 262 bits (670), Expect = 1e-67 Identities = 136/260 (52%), Positives = 176/260 (67%) Frame = -2 Query: 781 VANHHFSGGFEELKDGIVSWEMKTRSLLELRNGKNSSLVLAAERTRRKDPLENLNYYTGG 602 V N G EE + ++ ++ +G+NSSL+LAA+RTRRKDP +N YTGG Sbjct: 34 VGNEEERGVEEEWRTSVIE------RVIAEESGENSSLILAAKRTRRKDPADNFKLYTGG 87 Query: 601 WNISDKHYFASAAFTATPLFLVAAIWFVAFGXXXXXXXXXXXXCRRQNYGYSRTAYALSL 422 WNIS+ HY S +TA P ++A +WFV FG C RQ+YGYSR AYALSL Sbjct: 88 WNISNSHYLTSVGYTAAPFIIIALVWFVFFGLSLSLICLCYCCCARQSYGYSRVAYALSL 147 Query: 421 ILLSLFTIATVVGSIVLYTGQGKFHDSTKDTLLYVVRQSDSTVENLRNVSDYLDAAKQIQ 242 ILL FTIA ++G + LYTGQGKFH ST DTL YVV Q++ T ENLRNVSDYL+AAK++ Sbjct: 148 ILLISFTIAAIIGCVFLYTGQGKFHASTTDTLDYVVSQANLTSENLRNVSDYLNAAKKVD 207 Query: 241 VDQISLPSDVKSRIDQVHSKIDSASFTLEDETKKNKNRIEDILEIVRKILIIAAAVMLFL 62 V LP DV S ID + KI+S++ TL +T +N+++I+++L+I+R L+I AAVMLFL Sbjct: 208 VQSSILPQDVLSSIDNIQGKINSSATTLSVKTMENQDKIQNVLDIMRLALVIIAAVMLFL 267 Query: 61 TLLGFLFSILGLQCLVSVLV 2 +GFL SI GLQCLV LV Sbjct: 268 AFIGFLLSIFGLQCLVYTLV 287 >gb|EOY25962.1| Uncharacterized protein TCM_027327 [Theobroma cacao] Length = 449 Score = 261 bits (668), Expect = 2e-67 Identities = 141/269 (52%), Positives = 186/269 (69%), Gaps = 1/269 (0%) Frame = -2 Query: 805 SSANGASPVANHHFSGGFEELKDGIVSWEMKTRSL-LELRNGKNSSLVLAAERTRRKDPL 629 S A+ + V N G E K G+VS M+ + + ++ KNS+LVLAAERT RKDPL Sbjct: 26 SRASLSYHVPNKPNPGVMGESKYGVVSDGMRRSVVGVYVKAMKNSTLVLAAERTHRKDPL 85 Query: 628 ENLNYYTGGWNISDKHYFASAAFTATPLFLVAAIWFVAFGXXXXXXXXXXXXCRRQNYGY 449 +N NYY GGWNIS+KHYF+S FTA PLFL+AA WF+ FG CRRQ+Y Y Sbjct: 86 DNFNYYKGGWNISEKHYFSSVGFTAAPLFLIAAFWFLGFGMCLLVITLCHCCCRRQHYDY 145 Query: 448 SRTAYALSLILLSLFTIATVVGSIVLYTGQGKFHDSTKDTLLYVVRQSDSTVENLRNVSD 269 S+T Y LSLI L+LFTIA V+G IVLY GQGKFH ST TL YVV Q+D+TV+ L+NVS+ Sbjct: 146 SQTIYLLSLIFLTLFTIAAVIGCIVLYVGQGKFHTSTTVTLEYVVEQADTTVDKLKNVSE 205 Query: 268 YLDAAKQIQVDQISLPSDVKSRIDQVHSKIDSASFTLEDETKKNKNRIEDILEIVRKILI 89 YL+AAKQIQV+QI LP +++ I++V KI+ ++ LE ++K+N +I +L+ V LI Sbjct: 206 YLEAAKQIQVNQIFLPPNIQGNIERVDKKINDSAKILERKSKENSEKIRHVLDYVSLALI 265 Query: 88 IAAAVMLFLTLLGFLFSILGLQCLVSVLV 2 I A+VML + LGF FS+ G++ V +LV Sbjct: 266 IIASVMLLMAFLGFSFSVSGMRFCVYILV 294 >ref|XP_002885852.1| hypothetical protein ARALYDRAFT_899530 [Arabidopsis lyrata subsp. lyrata] gi|297331692|gb|EFH62111.1| hypothetical protein ARALYDRAFT_899530 [Arabidopsis lyrata subsp. lyrata] Length = 540 Score = 261 bits (668), Expect = 2e-67 Identities = 134/253 (52%), Positives = 174/253 (68%) Frame = -2 Query: 760 GGFEELKDGIVSWEMKTRSLLELRNGKNSSLVLAAERTRRKDPLENLNYYTGGWNISDKH 581 GG EE + ++ ++ +G+NSSL+LAA+RT+RKDP +N +YTGGWNIS+ H Sbjct: 40 GGVEEWRTSVIE------RVIAEESGENSSLILAAKRTKRKDPNDNFKFYTGGWNISNSH 93 Query: 580 YFASAAFTATPLFLVAAIWFVAFGXXXXXXXXXXXXCRRQNYGYSRTAYALSLILLSLFT 401 Y S +TA P ++A +WFV FG C RQ YGYSR AYALSLILL FT Sbjct: 94 YLTSVGYTAVPFIIIAVVWFVFFGLSLSLICLCYCCCARQPYGYSRVAYALSLILLISFT 153 Query: 400 IATVVGSIVLYTGQGKFHDSTKDTLLYVVRQSDSTVENLRNVSDYLDAAKQIQVDQISLP 221 IA ++G I LYTGQGKFH ST DTL YVV Q++ T ENLRNVSDYL+AAK++ V LP Sbjct: 154 IAAIIGCIFLYTGQGKFHASTTDTLEYVVSQANLTSENLRNVSDYLNAAKKVDVQSSILP 213 Query: 220 SDVKSRIDQVHSKIDSASFTLEDETKKNKNRIEDILEIVRKILIIAAAVMLFLTLLGFLF 41 DV S ID + KI+S++ TL +T +N+++I+++L+ +R L+I AAVMLFL +GFL Sbjct: 214 QDVLSSIDNIQGKINSSATTLSVKTMENQDKIQNVLDSMRLALVIIAAVMLFLAFIGFLL 273 Query: 40 SILGLQCLVSVLV 2 SI GLQCLV LV Sbjct: 274 SIFGLQCLVYTLV 286 >ref|XP_006587132.1| PREDICTED: uncharacterized protein LOC100797376 isoform X2 [Glycine max] Length = 478 Score = 261 bits (666), Expect = 3e-67 Identities = 132/226 (58%), Positives = 166/226 (73%) Frame = -2 Query: 679 NSSLVLAAERTRRKDPLENLNYYTGGWNISDKHYFASAAFTATPLFLVAAIWFVAFGXXX 500 N+SL+LA RT RKDPL+N N+YTGGWNIS++HY AS FTA P F+VAA+WFV FG Sbjct: 9 NTSLILAQRRTTRKDPLDNYNHYTGGWNISNRHYIASVVFTAVPFFVVAAVWFVIFGLSL 68 Query: 499 XXXXXXXXXCRRQNYGYSRTAYALSLILLSLFTIATVVGSIVLYTGQGKFHDSTKDTLLY 320 C R+ YGYSR AYALSLI L LFT+A +VG ++LYT QGKFH ST +TL Sbjct: 69 SFICLCYCCCPREPYGYSRLAYALSLIFLVLFTLAAIVGCVLLYTAQGKFHGSTTNTLKS 128 Query: 319 VVRQSDSTVENLRNVSDYLDAAKQIQVDQISLPSDVKSRIDQVHSKIDSASFTLEDETKK 140 +V Q+D T ENLRNVSDYLDAA++I V+ + LP+DV+ ID+V KI+S++ L +TKK Sbjct: 129 LVSQADFTAENLRNVSDYLDAAQKIGVEAVFLPADVQKNIDEVQMKINSSAADLSSKTKK 188 Query: 139 NKNRIEDILEIVRKILIIAAAVMLFLTLLGFLFSILGLQCLVSVLV 2 N I+D+++ +R+ L+I AAVMLFL LGFLFSI GLQ LV LV Sbjct: 189 NSETIKDVIDAMRRDLVILAAVMLFLAFLGFLFSIFGLQGLVYFLV 234 >gb|EOY29411.1| Uncharacterized protein TCM_036954 [Theobroma cacao] Length = 557 Score = 259 bits (662), Expect = 1e-66 Identities = 138/249 (55%), Positives = 175/249 (70%), Gaps = 7/249 (2%) Frame = -2 Query: 727 SWEMKTRSLLELRNGK-------NSSLVLAAERTRRKDPLENLNYYTGGWNISDKHYFAS 569 SWE+ RS++E G+ +SSLVLAAERT RKDPL YTGGWNI ++HY+AS Sbjct: 58 SWEITRRSVVEGPVGEPIPVVEVSSSLVLAAERTYRKDPLNGFKRYTGGWNIRERHYWAS 117 Query: 568 AAFTATPLFLVAAIWFVAFGXXXXXXXXXXXXCRRQNYGYSRTAYALSLILLSLFTIATV 389 AFTA PLF +AAIWFV FG C+R YGYSR AYA+SLI L LF +A + Sbjct: 118 VAFTAVPLFAIAAIWFVGFGLCLLLIFLCYFCCKRPPYGYSRIAYAISLIFLILFAVAAI 177 Query: 388 VGSIVLYTGQGKFHDSTKDTLLYVVRQSDSTVENLRNVSDYLDAAKQIQVDQISLPSDVK 209 +G IVLY GQG+FHDST TL YVV Q+D TV LR+VSD L +AKQI VD++ LPS+V Sbjct: 178 LGCIVLYVGQGRFHDSTTKTLQYVVNQADMTVGKLRDVSDALASAKQIGVDKVFLPSNVL 237 Query: 208 SRIDQVHSKIDSASFTLEDETKKNKNRIEDILEIVRKILIIAAAVMLFLTLLGFLFSILG 29 + ID++ +KI+S++ TL D+T N + I D+L+ VR LI+ AA+ML LT LGFLFS+ G Sbjct: 238 TDIDEIGAKINSSASTLADKTVDNSDDIRDLLDSVRVALIVVAAIMLLLTFLGFLFSVFG 297 Query: 28 LQCLVSVLV 2 +Q LV +LV Sbjct: 298 MQLLVYILV 306 >ref|XP_004488469.1| PREDICTED: uncharacterized protein LOC101501423 [Cicer arietinum] Length = 477 Score = 258 bits (660), Expect = 2e-66 Identities = 134/226 (59%), Positives = 162/226 (71%) Frame = -2 Query: 679 NSSLVLAAERTRRKDPLENLNYYTGGWNISDKHYFASAAFTATPLFLVAAIWFVAFGXXX 500 N+SL+LA +RT RKDPL+N N Y GGWNIS+ HY AS FTA P F+VAA+W V FG Sbjct: 9 NASLILAQKRTSRKDPLDNFNRYIGGWNISNNHYIASVVFTAVPFFIVAAVWLVLFGLCL 68 Query: 499 XXXXXXXXXCRRQNYGYSRTAYALSLILLSLFTIATVVGSIVLYTGQGKFHDSTKDTLLY 320 C R+ YGYSR AYALSL LL LFT+A +VG +VLYT QGKFH ST +TL Y Sbjct: 69 SFICFCYCCCPREPYGYSRVAYALSLFLLILFTLAAIVGCVVLYTAQGKFHGSTSNTLEY 128 Query: 319 VVRQSDSTVENLRNVSDYLDAAKQIQVDQISLPSDVKSRIDQVHSKIDSASFTLEDETKK 140 VV Q+D T ENLRNVSDYLDAAK I VD + LP DV++ ID V +KI+S++ L +TK Sbjct: 129 VVSQADFTAENLRNVSDYLDAAKNIGVDAVFLPGDVQNNIDNVKTKINSSAVELSTKTKD 188 Query: 139 NKNRIEDILEIVRKILIIAAAVMLFLTLLGFLFSILGLQCLVSVLV 2 N +I+D ++ +R L+I AAVMLF+ LGFLFSILGLQ LV LV Sbjct: 189 NSEKIKDGIDGMRLALVILAAVMLFIAFLGFLFSILGLQGLVYFLV 234