BLASTX nr result
ID: Atropa21_contig00032144
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Atropa21_contig00032144 (1479 letters) Database: nr 37,332,560 sequences; 13,225,080,153 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_006351162.1| PREDICTED: uncharacterized protein LOC102591... 610 e-172 ref|XP_004234776.1| PREDICTED: 3-dehydroquinate synthase-like [S... 595 e-167 ref|XP_006603860.1| PREDICTED: uncharacterized protein LOC100806... 485 e-134 ref|XP_003554373.1| PREDICTED: uncharacterized protein LOC100806... 485 e-134 ref|XP_002282990.2| PREDICTED: 3-dehydroquinate synthase-like [V... 479 e-132 emb|CBI22182.3| unnamed protein product [Vitis vinifera] 479 e-132 gb|ESW23206.1| hypothetical protein PHAVU_004G027100g [Phaseolus... 479 e-132 ref|XP_004302345.1| PREDICTED: 3-dehydroquinate synthase-like [F... 478 e-132 gb|EXB94290.1| 3-dehydroquinate synthase [Morus notabilis] 470 e-130 ref|XP_006482557.1| PREDICTED: uncharacterized protein LOC102626... 466 e-128 ref|XP_002517488.1| conserved hypothetical protein [Ricinus comm... 463 e-128 ref|XP_004147467.1| PREDICTED: 3-dehydroquinate synthase-like [C... 462 e-127 gb|EOY03402.1| Prokaryotic-type isoform 3 [Theobroma cacao] 461 e-127 ref|XP_002323844.2| hypothetical protein POPTR_0017s11670g [Popu... 457 e-126 ref|XP_006827144.1| hypothetical protein AMTR_s00010p00251120 [A... 446 e-123 gb|EOY03401.1| Prokaryotic-type, putative isoform 2 [Theobroma c... 445 e-122 gb|EOY03400.1| Prokaryotic-type, putative isoform 1 [Theobroma c... 445 e-122 ref|NP_001030791.1| uncharacterized protein [Arabidopsis thalian... 441 e-121 ref|NP_189518.2| uncharacterized protein [Arabidopsis thaliana] ... 441 e-121 ref|XP_002877130.1| hypothetical protein ARALYDRAFT_322953 [Arab... 439 e-120 >ref|XP_006351162.1| PREDICTED: uncharacterized protein LOC102591464 [Solanum tuberosum] Length = 394 Score = 610 bits (1573), Expect = e-172 Identities = 323/396 (81%), Positives = 339/396 (85%), Gaps = 1/396 (0%) Frame = +1 Query: 64 MAMVLSSLLISYPNNKVPGKWKNCRYVDLSLNFSSTERTFARVARMCAFTCSKSKK-TVW 240 M M+L SL +SYP KV GKW+NCR F T R VA+MCAFT S SKK TVW Sbjct: 1 MDMLLPSLSLSYP--KVAGKWQNCR------KFLGTNR----VAKMCAFTPSNSKKKTVW 48 Query: 241 IWTENKQVMTASVERGWNTFIFPSHRQDLALEWSSIALINPLFIEEGRLFDHEQQTVSAF 420 IWTENKQVMTA+VERGWNTFIFPS+RQDLALEWSSIA+I PLF+EEGR DHE ++V+AF Sbjct: 49 IWTENKQVMTAAVERGWNTFIFPSNRQDLALEWSSIAVIYPLFVEEGRQIDHEHKSVAAF 108 Query: 421 AXXXXXXXXXXXXXXXXXADKVVVDLLDWQVIPAENIVAAFQGTQKTVLAVSKTQSEAQV 600 A ADKVVV+LLDWQVIPAENIVA FQGTQ TVL VSKTQSEAQV Sbjct: 109 AEISSPQQLEQFQISEEQADKVVVNLLDWQVIPAENIVADFQGTQTTVLVVSKTQSEAQV 168 Query: 601 FLEALEHGLGGVVMKVEDVGAILELKGYFDKRREEDSLLNLTKAKITHVQATGMGDRVCV 780 FLEALEHGLGGVVMKVEDVGAILELKGYFD+RR+ DSLLNLTKA I+H+Q TGMGDRVCV Sbjct: 169 FLEALEHGLGGVVMKVEDVGAILELKGYFDRRRDVDSLLNLTKAIISHIQVTGMGDRVCV 228 Query: 781 DICSLMRPGEGLLVGSFARGLFLVHSECLESNYISSRPFRVNAGPVHAYVAVPGGKTSYL 960 DICSLMRPGEGLLVGSFARGLFLVHSECLESNYISSRPFRVNAGPVHAYVAVPGGKTSYL Sbjct: 229 DICSLMRPGEGLLVGSFARGLFLVHSECLESNYISSRPFRVNAGPVHAYVAVPGGKTSYL 288 Query: 961 SELKSGKEVIVADQRGMQRTAIVGRVKVETRQLILVEAKVESENESYSILLQNAETVGFV 1140 SELKSGKEVIV DQRGMQRTAIVGRVKVETR LILVEAKVESENESYSILLQNAETVG V Sbjct: 289 SELKSGKEVIVVDQRGMQRTAIVGRVKVETRPLILVEAKVESENESYSILLQNAETVGLV 348 Query: 1141 STRQGEGHQRTVIPVTSLKVGDEVLLLLQGGARHTG 1248 S GEGHQRT IPVTSLKVGDEVLLLLQGGARHTG Sbjct: 349 SPLHGEGHQRTTIPVTSLKVGDEVLLLLQGGARHTG 384 >ref|XP_004234776.1| PREDICTED: 3-dehydroquinate synthase-like [Solanum lycopersicum] Length = 394 Score = 595 bits (1535), Expect = e-167 Identities = 318/396 (80%), Positives = 337/396 (85%), Gaps = 1/396 (0%) Frame = +1 Query: 64 MAMVLSSLLISYPNNKVPGKWKNCRYVDLSLNFSSTERTFARVARMCAFTCSKSKK-TVW 240 M ++L SL S+P K GK +NCR L +N RVARMCAFT S SKK TVW Sbjct: 1 MDILLPSLSHSFP--KFAGKRQNCRKF-LGIN---------RVARMCAFTPSNSKKKTVW 48 Query: 241 IWTENKQVMTASVERGWNTFIFPSHRQDLALEWSSIALINPLFIEEGRLFDHEQQTVSAF 420 IWTENKQVMTA+VE GWNTFIFPS+RQDLALEWSSIA+I+P+FI+EGRL DHE ++V+AF Sbjct: 49 IWTENKQVMTAAVEGGWNTFIFPSNRQDLALEWSSIAVIHPVFIKEGRLIDHEHKSVAAF 108 Query: 421 AXXXXXXXXXXXXXXXXXADKVVVDLLDWQVIPAENIVAAFQGTQKTVLAVSKTQSEAQV 600 A +DKVVV+LLDWQVIPAENIVAAFQGTQ TVLAVSK QSEAQ Sbjct: 109 AEISSPQQLEQFQISEEQSDKVVVNLLDWQVIPAENIVAAFQGTQTTVLAVSKNQSEAQA 168 Query: 601 FLEALEHGLGGVVMKVEDVGAILELKGYFDKRREEDSLLNLTKAKITHVQATGMGDRVCV 780 FLEALEHGLGGVVMKVEDVGAILELKGYFD+RRE DSLLNLTKA ITH+Q TGMGDRVCV Sbjct: 169 FLEALEHGLGGVVMKVEDVGAILELKGYFDRRREVDSLLNLTKAIITHIQVTGMGDRVCV 228 Query: 781 DICSLMRPGEGLLVGSFARGLFLVHSECLESNYISSRPFRVNAGPVHAYVAVPGGKTSYL 960 DICSLMRPGEGLLVGSFARGLFLVHSECLESNYISSRPFRVNAGPVHAYVAVPGGKTSYL Sbjct: 229 DICSLMRPGEGLLVGSFARGLFLVHSECLESNYISSRPFRVNAGPVHAYVAVPGGKTSYL 288 Query: 961 SELKSGKEVIVADQRGMQRTAIVGRVKVETRQLILVEAKVESENESYSILLQNAETVGFV 1140 SELKSGKEVIV DQRGMQRTAIVGRVKVETR LILVEAKVESENESYSILLQNAETVG V Sbjct: 289 SELKSGKEVIVVDQRGMQRTAIVGRVKVETRPLILVEAKVESENESYSILLQNAETVGLV 348 Query: 1141 STRQGEGHQRTVIPVTSLKVGDEVLLLLQGGARHTG 1248 S GEGHQRT IPVTSL+VG EVLLLLQGGARHTG Sbjct: 349 SPLHGEGHQRTTIPVTSLEVGSEVLLLLQGGARHTG 384 >ref|XP_006603860.1| PREDICTED: uncharacterized protein LOC100806285 isoform X2 [Glycine max] Length = 440 Score = 485 bits (1248), Expect = e-134 Identities = 250/388 (64%), Positives = 290/388 (74%), Gaps = 13/388 (3%) Frame = +1 Query: 124 WKNCRYVDLSLNFSSTE---RTFARVARMCAFTCS----------KSKKTVWIWTENKQV 264 W N R +L N +S +T R CS K K VWIWT NKQV Sbjct: 45 WNNIRRTNLCSNVNSLRYSGKTLLRHRHKYYNPCSSMASSLDESGKRSKRVWIWTSNKQV 104 Query: 265 MTASVERGWNTFIFPSHRQDLALEWSSIALINPLFIEEGRLFDHEQQTVSAFAXXXXXXX 444 MTA+VERGWNTF+FPSH + LA +WSSIA+I PLF+ EG + D + + V+ Sbjct: 105 MTAAVERGWNTFVFPSHHRQLAHDWSSIAVICPLFVNEGEVLDGQNKRVATIFDVSTPEE 164 Query: 445 XXXXXXXXXXADKVVVDLLDWQVIPAENIVAAFQGTQKTVLAVSKTQSEAQVFLEALEHG 624 A+ +VV+LLDWQVIPAENI+AAFQ +Q TV A+S SEAQVFLEALEHG Sbjct: 165 LEELRPENEQAENIVVNLLDWQVIPAENIIAAFQRSQNTVFAISNNTSEAQVFLEALEHG 224 Query: 625 LGGVVMKVEDVGAILELKGYFDKRREEDSLLNLTKAKITHVQATGMGDRVCVDICSLMRP 804 L G++MKVEDV +LELK YFD+R EE +LL+LTKA +TH+QA GMGDRVCVD+CSLMRP Sbjct: 225 LDGIIMKVEDVEPVLELKEYFDRRMEESNLLSLTKATVTHIQAAGMGDRVCVDLCSLMRP 284 Query: 805 GEGLLVGSFARGLFLVHSECLESNYISSRPFRVNAGPVHAYVAVPGGKTSYLSELKSGKE 984 GEGLLVGSFARGLFLVHSECLESNYI+SRPFRVNAGPVHAYVAVPGG+T YLSELKSGKE Sbjct: 285 GEGLLVGSFARGLFLVHSECLESNYIASRPFRVNAGPVHAYVAVPGGRTCYLSELKSGKE 344 Query: 985 VIVADQRGMQRTAIVGRVKVETRQLILVEAKVESENESYSILLQNAETVGFVSTRQGEGH 1164 VI+ D +G QR AIVGRVK+E+R LILVEAK+ES+N+S SILLQNAETV V T QG Sbjct: 345 VIIVDHQGRQRIAIVGRVKIESRPLILVEAKIESDNQSISILLQNAETVALVCTPQGNTL 404 Query: 1165 QRTVIPVTSLKVGDEVLLLLQGGARHTG 1248 +T IPVTSLKVGDE+LL +QGGARHTG Sbjct: 405 LKTSIPVTSLKVGDEILLRVQGGARHTG 432 >ref|XP_003554373.1| PREDICTED: uncharacterized protein LOC100806285 isoform X1 [Glycine max] Length = 442 Score = 485 bits (1248), Expect = e-134 Identities = 250/388 (64%), Positives = 290/388 (74%), Gaps = 13/388 (3%) Frame = +1 Query: 124 WKNCRYVDLSLNFSSTE---RTFARVARMCAFTCS----------KSKKTVWIWTENKQV 264 W N R +L N +S +T R CS K K VWIWT NKQV Sbjct: 45 WNNIRRTNLCSNVNSLRYSGKTLLRHRHKYYNPCSSMASSLDESGKRSKRVWIWTSNKQV 104 Query: 265 MTASVERGWNTFIFPSHRQDLALEWSSIALINPLFIEEGRLFDHEQQTVSAFAXXXXXXX 444 MTA+VERGWNTF+FPSH + LA +WSSIA+I PLF+ EG + D + + V+ Sbjct: 105 MTAAVERGWNTFVFPSHHRQLAHDWSSIAVICPLFVNEGEVLDGQNKRVATIFDVSTPEE 164 Query: 445 XXXXXXXXXXADKVVVDLLDWQVIPAENIVAAFQGTQKTVLAVSKTQSEAQVFLEALEHG 624 A+ +VV+LLDWQVIPAENI+AAFQ +Q TV A+S SEAQVFLEALEHG Sbjct: 165 LEELRPENEQAENIVVNLLDWQVIPAENIIAAFQRSQNTVFAISNNTSEAQVFLEALEHG 224 Query: 625 LGGVVMKVEDVGAILELKGYFDKRREEDSLLNLTKAKITHVQATGMGDRVCVDICSLMRP 804 L G++MKVEDV +LELK YFD+R EE +LL+LTKA +TH+QA GMGDRVCVD+CSLMRP Sbjct: 225 LDGIIMKVEDVEPVLELKEYFDRRMEESNLLSLTKATVTHIQAAGMGDRVCVDLCSLMRP 284 Query: 805 GEGLLVGSFARGLFLVHSECLESNYISSRPFRVNAGPVHAYVAVPGGKTSYLSELKSGKE 984 GEGLLVGSFARGLFLVHSECLESNYI+SRPFRVNAGPVHAYVAVPGG+T YLSELKSGKE Sbjct: 285 GEGLLVGSFARGLFLVHSECLESNYIASRPFRVNAGPVHAYVAVPGGRTCYLSELKSGKE 344 Query: 985 VIVADQRGMQRTAIVGRVKVETRQLILVEAKVESENESYSILLQNAETVGFVSTRQGEGH 1164 VI+ D +G QR AIVGRVK+E+R LILVEAK+ES+N+S SILLQNAETV V T QG Sbjct: 345 VIIVDHQGRQRIAIVGRVKIESRPLILVEAKIESDNQSISILLQNAETVALVCTPQGNTL 404 Query: 1165 QRTVIPVTSLKVGDEVLLLLQGGARHTG 1248 +T IPVTSLKVGDE+LL +QGGARHTG Sbjct: 405 LKTSIPVTSLKVGDEILLRVQGGARHTG 432 >ref|XP_002282990.2| PREDICTED: 3-dehydroquinate synthase-like [Vitis vinifera] Length = 368 Score = 479 bits (1234), Expect = e-132 Identities = 242/344 (70%), Positives = 281/344 (81%), Gaps = 1/344 (0%) Frame = +1 Query: 220 KSKKTVWIWTENKQVMTASVERGWNTFIFPSHRQDLALEWSSIALINPLFIEEGRLFDHE 399 + K VWIWTE+KQVMTA+VERGWNTFIF ++LA EWSSIALI+PLFI+EG+LFD E Sbjct: 15 RQHKVVWIWTESKQVMTAAVERGWNTFIFLPDHRELATEWSSIALIHPLFIKEGKLFDSE 74 Query: 400 QQTVSAFAXXXXXXXXXXXXXXXXXADKVVVDLLDWQVIPAENIVAAFQGTQKTVLAVSK 579 + V+ AD V+++LLDWQVIPAENIVAAFQG+ TV A+SK Sbjct: 75 GRGVATVYDVTSPQQLQLLQPEDKQADNVIINLLDWQVIPAENIVAAFQGSHITVFAISK 134 Query: 580 TQSEAQVFLEALEHGLGGVVMKVEDVGAILELKGYFDKRREEDSLLNLTKAKITHVQATG 759 + SEAQ+FLEALE GLGGVV+KVED A+LELK YFD+R E++++L+LTKA IT + +G Sbjct: 135 SPSEAQIFLEALEQGLGGVVLKVEDATAVLELKDYFDRRNEDNNILSLTKATITQIHISG 194 Query: 760 MGDRVCVDICSLMRPGEGLLVGSFARGLFLVHSECLESNYISSRPFRVNAGPVHAYVAVP 939 MGDRVCVD+CSLMRPGEGLLVGSFARGLFLVHSECLESNYI+SRPFRVNAGPVHAYVA+P Sbjct: 195 MGDRVCVDLCSLMRPGEGLLVGSFARGLFLVHSECLESNYIASRPFRVNAGPVHAYVAIP 254 Query: 940 GGKTSYLSELKSGKEVIVADQRGMQRTAIVGRVKVETRQLILVEAKVESENES-YSILLQ 1116 GGKT YLSEL +GKEVIV DQ G QRTAIVGRVK+ETR LILVEAK +S+N + YS+LLQ Sbjct: 255 GGKTCYLSELVTGKEVIVVDQNGKQRTAIVGRVKIETRPLILVEAKGDSDNGTLYSVLLQ 314 Query: 1117 NAETVGFVSTRQGEGHQRTVIPVTSLKVGDEVLLLLQGGARHTG 1248 NAETV + QG G+Q+ IPVTSLKVGDEVLL LQGGARHTG Sbjct: 315 NAETVALICPSQGSGYQKKAIPVTSLKVGDEVLLRLQGGARHTG 358 >emb|CBI22182.3| unnamed protein product [Vitis vinifera] Length = 998 Score = 479 bits (1234), Expect = e-132 Identities = 242/344 (70%), Positives = 281/344 (81%), Gaps = 1/344 (0%) Frame = +1 Query: 220 KSKKTVWIWTENKQVMTASVERGWNTFIFPSHRQDLALEWSSIALINPLFIEEGRLFDHE 399 + K VWIWTE+KQVMTA+VERGWNTFIF ++LA EWSSIALI+PLFI+EG+LFD E Sbjct: 645 RQHKVVWIWTESKQVMTAAVERGWNTFIFLPDHRELATEWSSIALIHPLFIKEGKLFDSE 704 Query: 400 QQTVSAFAXXXXXXXXXXXXXXXXXADKVVVDLLDWQVIPAENIVAAFQGTQKTVLAVSK 579 + V+ AD V+++LLDWQVIPAENIVAAFQG+ TV A+SK Sbjct: 705 GRGVATVYDVTSPQQLQLLQPEDKQADNVIINLLDWQVIPAENIVAAFQGSHITVFAISK 764 Query: 580 TQSEAQVFLEALEHGLGGVVMKVEDVGAILELKGYFDKRREEDSLLNLTKAKITHVQATG 759 + SEAQ+FLEALE GLGGVV+KVED A+LELK YFD+R E++++L+LTKA IT + +G Sbjct: 765 SPSEAQIFLEALEQGLGGVVLKVEDATAVLELKDYFDRRNEDNNILSLTKATITQIHISG 824 Query: 760 MGDRVCVDICSLMRPGEGLLVGSFARGLFLVHSECLESNYISSRPFRVNAGPVHAYVAVP 939 MGDRVCVD+CSLMRPGEGLLVGSFARGLFLVHSECLESNYI+SRPFRVNAGPVHAYVA+P Sbjct: 825 MGDRVCVDLCSLMRPGEGLLVGSFARGLFLVHSECLESNYIASRPFRVNAGPVHAYVAIP 884 Query: 940 GGKTSYLSELKSGKEVIVADQRGMQRTAIVGRVKVETRQLILVEAKVESENES-YSILLQ 1116 GGKT YLSEL +GKEVIV DQ G QRTAIVGRVK+ETR LILVEAK +S+N + YS+LLQ Sbjct: 885 GGKTCYLSELVTGKEVIVVDQNGKQRTAIVGRVKIETRPLILVEAKGDSDNGTLYSVLLQ 944 Query: 1117 NAETVGFVSTRQGEGHQRTVIPVTSLKVGDEVLLLLQGGARHTG 1248 NAETV + QG G+Q+ IPVTSLKVGDEVLL LQGGARHTG Sbjct: 945 NAETVALICPSQGSGYQKKAIPVTSLKVGDEVLLRLQGGARHTG 988 >gb|ESW23206.1| hypothetical protein PHAVU_004G027100g [Phaseolus vulgaris] Length = 439 Score = 479 bits (1233), Expect = e-132 Identities = 238/343 (69%), Positives = 275/343 (80%) Frame = +1 Query: 220 KSKKTVWIWTENKQVMTASVERGWNTFIFPSHRQDLALEWSSIALINPLFIEEGRLFDHE 399 K K VWIWT NKQVMTA+VERGWNTF+FPSH + LA EWS IA+I PLF+ E + D + Sbjct: 87 KPSKRVWIWTSNKQVMTAAVERGWNTFVFPSHHRQLAREWSEIAVICPLFVNEEEVLDEQ 146 Query: 400 QQTVSAFAXXXXXXXXXXXXXXXXXADKVVVDLLDWQVIPAENIVAAFQGTQKTVLAVSK 579 + V+ A+ +VV+LLDWQVIPAENI+AAFQ +QKTV A+S Sbjct: 147 NKRVATIFDVSNPEELEGLRPEDEHAESIVVNLLDWQVIPAENIIAAFQRSQKTVFAISN 206 Query: 580 TQSEAQVFLEALEHGLGGVVMKVEDVGAILELKGYFDKRREEDSLLNLTKAKITHVQATG 759 SEAQ+FLEALEHGL G+VMK+EDV +LELK YFD+R EE +LL+LTKA +TH+Q TG Sbjct: 207 NTSEAQLFLEALEHGLDGIVMKIEDVEPVLELKAYFDRRMEESNLLSLTKATVTHIQGTG 266 Query: 760 MGDRVCVDICSLMRPGEGLLVGSFARGLFLVHSECLESNYISSRPFRVNAGPVHAYVAVP 939 MGDRVCVD+CSLMRPGEGLLVGSFARGLFLVHSECLESNYI+SRPFRVNAGPVHAYVAVP Sbjct: 267 MGDRVCVDLCSLMRPGEGLLVGSFARGLFLVHSECLESNYIASRPFRVNAGPVHAYVAVP 326 Query: 940 GGKTSYLSELKSGKEVIVADQRGMQRTAIVGRVKVETRQLILVEAKVESENESYSILLQN 1119 G +TSYLSELKSGKEVIV DQ+G QR AIVGRVK+E+R LILVEAK+ES+ ++ SILLQN Sbjct: 327 GSRTSYLSELKSGKEVIVVDQKGHQRIAIVGRVKIESRPLILVEAKIESDTQTISILLQN 386 Query: 1120 AETVGFVSTRQGEGHQRTVIPVTSLKVGDEVLLLLQGGARHTG 1248 AETV V QG +T IPVTSLKVGDE+LL +QGGARHTG Sbjct: 387 AETVALVCPPQGNTVLKTAIPVTSLKVGDEILLRVQGGARHTG 429 >ref|XP_004302345.1| PREDICTED: 3-dehydroquinate synthase-like [Fragaria vesca subsp. vesca] Length = 403 Score = 478 bits (1229), Expect = e-132 Identities = 253/382 (66%), Positives = 296/382 (77%), Gaps = 6/382 (1%) Frame = +1 Query: 121 KWKN-CRYVDL----SLNFSSTERTFARVARMCAFTCSKSKKTVWIWTENKQVMTASVER 285 KW N CR + S+ +T+ + VA + SKKTVW+WTE+KQVMTA+VER Sbjct: 16 KWSNICRLISSHNRHSMEAKATQNS--SVASSSTMSFRSSKKTVWVWTESKQVMTAAVER 73 Query: 286 GWNTFIFPSHRQDLALEWSSIALINPLFIEEGRLFDHEQQTVSAFAXXXXXXXXXXXXXX 465 GWNTF+F S Q LA +WSSIALI+PL ++EG +FD E V+ Sbjct: 74 GWNTFVFQS--QKLADDWSSIALIDPLLMKEGGIFDSENTRVATVFEVSSPEELEQLQPE 131 Query: 466 XXXADKVVVDLLDWQVIPAENIVAAFQGTQKTVLAVSKTQSEAQVFLEALEHGLGGVVMK 645 + VVVDLLDWQVIPAENIVAAFQG+QKTV AVSKT EAQVF EALEHGLGGVV+K Sbjct: 132 NGVGENVVVDLLDWQVIPAENIVAAFQGSQKTVFAVSKTPVEAQVFFEALEHGLGGVVLK 191 Query: 646 VEDVGAILELKGYFDKRREEDSLLNLTKAKITHVQATGMGDRVCVDICSLMRPGEGLLVG 825 VEDV A+L+LK YFD+R E ++L+LTKA +T VQ GMGDRVCVD+CSLMRPGEGLLVG Sbjct: 192 VEDVQAVLDLKDYFDRRDEVGNILSLTKAIVTGVQVAGMGDRVCVDLCSLMRPGEGLLVG 251 Query: 826 SFARGLFLVHSECLESNYISSRPFRVNAGPVHAYVAVPGGKTSYLSELKSGKEVIVADQR 1005 SFARGLFLVHSECLESNYI+SRPFRVNAGPVHAYVAVPGGKTSYLSELK+GKEVI+ DQ Sbjct: 252 SFARGLFLVHSECLESNYIASRPFRVNAGPVHAYVAVPGGKTSYLSELKAGKEVILVDQE 311 Query: 1006 GMQRTAIVGRVKVETRQLILVEAKVESENES-YSILLQNAETVGFVSTRQGEGHQRTVIP 1182 G QRTAIVGR K+ETR LILVEAK+ S++++ YSIL+QNAETV V ++ G ++T IP Sbjct: 312 GHQRTAIVGRAKIETRPLILVEAKMCSDDQTIYSILVQNAETVALVCPKKESGGRKTAIP 371 Query: 1183 VTSLKVGDEVLLLLQGGARHTG 1248 VTSLKVGDE++L LQGGARHTG Sbjct: 372 VTSLKVGDEIMLRLQGGARHTG 393 >gb|EXB94290.1| 3-dehydroquinate synthase [Morus notabilis] Length = 424 Score = 470 bits (1210), Expect = e-130 Identities = 244/362 (67%), Positives = 284/362 (78%), Gaps = 4/362 (1%) Frame = +1 Query: 175 RTFARVARMCAFTCSKSK---KTVWIWTENKQVMTASVERGWNTFIFPSHRQDLALEWSS 345 RT V M + T S S K VWIWTENKQVMTA+VERGWNTFIF + L+ +WSS Sbjct: 53 RTRPVVVTMSSCTRSYSSGPSKRVWIWTENKQVMTAAVERGWNTFIFSPESRKLSDDWSS 112 Query: 346 IALINPLFIEEGRLFDHEQQTVSAFAXXXXXXXXXXXXXXXXXADKVVVDLLDWQVIPAE 525 IA+I+PL++EEG +FD E + + + + VVVDLLDWQVIPAE Sbjct: 113 IAVISPLYLEEGGIFDGENKRIGSIFGISNNQELELLQPEKGLGENVVVDLLDWQVIPAE 172 Query: 526 NIVAAFQGTQKTVLAVSKTQSEAQVFLEALEHGLGGVVMKVEDVGAILELKGYFDKRREE 705 NIVAAFQG+ +TV A+SK SEAQ+FLEALE GLGGVV+KVED AILELK YFD+R + Sbjct: 173 NIVAAFQGSDRTVFAISKNSSEAQIFLEALEQGLGGVVLKVEDAKAILELKEYFDRRNDM 232 Query: 706 DSLLNLTKAKITHVQATGMGDRVCVDICSLMRPGEGLLVGSFARGLFLVHSECLESNYIS 885 ++L+LTKA IT VQ GMGDRVCVD+CS+MRPGEGLLVGSFARGLFLVHSECLE NYI+ Sbjct: 233 SNILSLTKATITRVQVAGMGDRVCVDLCSIMRPGEGLLVGSFARGLFLVHSECLEWNYIA 292 Query: 886 SRPFRVNAGPVHAYVAVPGGKTSYLSELKSGKEVIVADQRGMQRTAIVGRVKVETRQLIL 1065 SRPFRVNAGPVHAYVA+PGGKT YLSELK GKEVIV +Q+G QR AIVGRVK+ETR LIL Sbjct: 293 SRPFRVNAGPVHAYVAIPGGKTCYLSELKVGKEVIVVNQKGQQRNAIVGRVKIETRPLIL 352 Query: 1066 VEAKVESENES-YSILLQNAETVGFVSTRQGEGHQRTVIPVTSLKVGDEVLLLLQGGARH 1242 VEAK++S++++ YSILLQNAETV VS QG+G Q IPVTSLKVGDEV+L +QGGARH Sbjct: 353 VEAKLDSDSQTLYSILLQNAETVALVSPFQGDGLQNAAIPVTSLKVGDEVVLRVQGGARH 412 Query: 1243 TG 1248 TG Sbjct: 413 TG 414 >ref|XP_006482557.1| PREDICTED: uncharacterized protein LOC102626217 isoform X1 [Citrus sinensis] Length = 401 Score = 466 bits (1198), Expect = e-128 Identities = 249/404 (61%), Positives = 296/404 (73%), Gaps = 9/404 (2%) Frame = +1 Query: 64 MAMVLSSLLISYPNNKVP------GKWKNCRYVDLSLNFSSTERTFARVARMCAFTCSKS 225 MA++LSS +S + ++P KW R S F+ MC+ + S S Sbjct: 1 MALLLSSSFVS--STQLPFSTFNTDKWNTGRVNKNSYCFT-----------MCSVSNSSS 47 Query: 226 KKT--VWIWTENKQVMTASVERGWNTFIFPSHRQDLALEWSSIALINPLFIEEGRLFDHE 399 K VWIWTE+KQVMTA+VERGWNTF+F S Q LA++WS+IAL++PLFI+EG ++D Sbjct: 48 SKPKRVWIWTESKQVMTAAVERGWNTFVFLSENQQLAIDWSTIALLDPLFIKEGEVYDSG 107 Query: 400 QQTVSAFAXXXXXXXXXXXXXXXXXADKVVVDLLDWQVIPAENIVAAFQGTQKTVLAVSK 579 + V + A+ +V+DL DWQVIPAENIVA+FQG+ KTV A+SK Sbjct: 108 DRRVGSIIEVSTPQELQQLQPADGQAENIVIDLPDWQVIPAENIVASFQGSGKTVFAISK 167 Query: 580 TQSEAQVFLEALEHGLGGVVMKVEDVGAILELKGYFDKRREEDSLLNLTKAKITHVQATG 759 T SEAQ+FLEALE GLGG+V+KVEDV A+L LK YFD R E +LL+L KA +T V G Sbjct: 168 TPSEAQIFLEALEQGLGGIVLKVEDVKAVLALKEYFDGRNEVSNLLSLMKATVTRVDVAG 227 Query: 760 MGDRVCVDICSLMRPGEGLLVGSFARGLFLVHSECLESNYISSRPFRVNAGPVHAYVAVP 939 MGDRVCVD+CSLMRPGEGLLVGSFARGLFLVHSECLESNYI+SRPFRVNAGPVHAYV VP Sbjct: 228 MGDRVCVDLCSLMRPGEGLLVGSFARGLFLVHSECLESNYIASRPFRVNAGPVHAYVLVP 287 Query: 940 GGKTSYLSELKSGKEVIVADQRGMQRTAIVGRVKVETRQLILVEAKVESENES-YSILLQ 1116 GGKT YLSELKSGKEVIV DQ+G QRTA+VGRVK+E+R LILVEAK S +++ Y I+LQ Sbjct: 288 GGKTCYLSELKSGKEVIVVDQKGRQRTAVVGRVKIESRPLILVEAKTNSGDQTLYGIILQ 347 Query: 1117 NAETVGFVSTRQGEGHQRTVIPVTSLKVGDEVLLLLQGGARHTG 1248 NAETV VS +G G Q IPVTSLKVGDEVLL +QG ARHTG Sbjct: 348 NAETVALVSPCKGTGEQEKAIPVTSLKVGDEVLLRVQGAARHTG 391 >ref|XP_002517488.1| conserved hypothetical protein [Ricinus communis] gi|223543499|gb|EEF45030.1| conserved hypothetical protein [Ricinus communis] Length = 419 Score = 463 bits (1191), Expect = e-128 Identities = 239/378 (63%), Positives = 289/378 (76%), Gaps = 3/378 (0%) Frame = +1 Query: 124 WKNCRYVDLSLNFSS--TERTFARVARMCAFTCSKSKKTVWIWTENKQVMTASVERGWNT 297 W +C L N +S + +R+ + K KK VWIWTENKQVMTA+VERGWNT Sbjct: 33 WNSCNSRKLKTNHNSFVAMSSLNNASRISSGDYDKLKK-VWIWTENKQVMTAAVERGWNT 91 Query: 298 FIFPSHRQDLALEWSSIALINPLFIEEGRLFDHEQQTVSAFAXXXXXXXXXXXXXXXXXA 477 FIF ++LA EWSS A+I PLF++E + D E + V+A A Sbjct: 92 FIFCYKCRELADEWSSTAMIYPLFVKEDEILDGENKRVAATFDISTPQELEQFQLENAQA 151 Query: 478 DKVVVDLLDWQVIPAENIVAAFQGTQKTVLAVSKTQSEAQVFLEALEHGLGGVVMKVEDV 657 + +VV+LLDWQ+IPAENIVAAFQG+QKTV AVSKT SEA+VFLEALEHGLGG++++VEDV Sbjct: 152 ENIVVNLLDWQIIPAENIVAAFQGSQKTVFAVSKTPSEAKVFLEALEHGLGGIILRVEDV 211 Query: 658 GAILELKGYFDKRREEDSLLNLTKAKITHVQATGMGDRVCVDICSLMRPGEGLLVGSFAR 837 A+ ELK YFD+R E ++L LTKA ++ +QA GMGDRVCVD+CSLMRPGEGLLVGSFAR Sbjct: 212 EAVFELKNYFDRRNEASNVLILTKATVSKIQAAGMGDRVCVDLCSLMRPGEGLLVGSFAR 271 Query: 838 GLFLVHSECLESNYISSRPFRVNAGPVHAYVAVPGGKTSYLSELKSGKEVIVADQRGMQR 1017 GLFLVHSECLESNYI+SRPFRVNAGPV+AY++VPGGKT YLSEL++GKEVIV DQ+G R Sbjct: 272 GLFLVHSECLESNYIASRPFRVNAGPVNAYISVPGGKTCYLSELRAGKEVIVVDQKGQLR 331 Query: 1018 TAIVGRVKVETRQLILVEAKVESENES-YSILLQNAETVGFVSTRQGEGHQRTVIPVTSL 1194 TAIVGRVK+E+R L+L+EAK++S+ ++ YSI LQNAETV V QG G Q IPVT+L Sbjct: 332 TAIVGRVKIESRPLVLLEAKIDSDYQTVYSIFLQNAETVALVPPCQGNGTQNVAIPVTAL 391 Query: 1195 KVGDEVLLLLQGGARHTG 1248 KVGDEVLL LQG ARHTG Sbjct: 392 KVGDEVLLRLQGAARHTG 409 >ref|XP_004147467.1| PREDICTED: 3-dehydroquinate synthase-like [Cucumis sativus] gi|449520920|ref|XP_004167480.1| PREDICTED: 3-dehydroquinate synthase-like [Cucumis sativus] Length = 423 Score = 462 bits (1189), Expect = e-127 Identities = 238/357 (66%), Positives = 279/357 (78%), Gaps = 8/357 (2%) Frame = +1 Query: 202 CAFTCSKS-------KKTVWIWTENKQVMTASVERGWNTFIFPSHRQDLALEWSSIALIN 360 C++T S S K VWIW+E +QVMTA+VERGW+TFIF H +LA EWSSIALI+ Sbjct: 58 CSYTSSSSPMSPIEASKGVWIWSECQQVMTAAVERGWSTFIFSPHNTELAHEWSSIALIH 117 Query: 361 PLFIEEGRLFDHEQQTVSAFAXXXXXXXXXXXXXXXXXADKVVVDLLDWQVIPAENIVAA 540 PLFI+E + D E + +++ AD VVVDL DWQ+IPAENIVAA Sbjct: 118 PLFIKENGVLDGEDRLIASVVEVSNPQQLEQLQPARASADIVVVDLQDWQIIPAENIVAA 177 Query: 541 FQGTQKTVLAVSKTQSEAQVFLEALEHGLGGVVMKVEDVGAILELKGYFDKRREEDSLLN 720 FQG+QKTV A+SKT EAQ+FLEALEHGLGGV++KVED A+ +LK YFD+R E +LLN Sbjct: 178 FQGSQKTVFAISKTPIEAQIFLEALEHGLGGVILKVEDPEAVFQLKDYFDRRNEASNLLN 237 Query: 721 LTKAKITHVQATGMGDRVCVDICSLMRPGEGLLVGSFARGLFLVHSECLESNYISSRPFR 900 LTKA IT + GMGDRVCVD+CSLMRPGEGLLVGS+ARGLFL+HSECLESNYI+SRPFR Sbjct: 238 LTKATITQIHVVGMGDRVCVDLCSLMRPGEGLLVGSYARGLFLIHSECLESNYIASRPFR 297 Query: 901 VNAGPVHAYVAVPGGKTSYLSELKSGKEVIVADQRGMQRTAIVGRVKVETRQLILVEAKV 1080 VNAGPVHAYVAVPGGKTSYLSEL++G EVIV DQ G QRTAIVGRVK+ETRQLILV+AK Sbjct: 298 VNAGPVHAYVAVPGGKTSYLSELQAGNEVIVVDQEGRQRTAIVGRVKIETRQLILVQAKR 357 Query: 1081 ESENES-YSILLQNAETVGFVSTRQGEGHQRTVIPVTSLKVGDEVLLLLQGGARHTG 1248 +S+ ++ YS+LLQNAETV V QG +++ IPVTSLKVGDEV L LQG ARHTG Sbjct: 358 DSDEQTPYSVLLQNAETVALVCPGQG-NNEKKAIPVTSLKVGDEVFLRLQGEARHTG 413 >gb|EOY03402.1| Prokaryotic-type isoform 3 [Theobroma cacao] Length = 419 Score = 461 bits (1186), Expect = e-127 Identities = 241/361 (66%), Positives = 274/361 (75%), Gaps = 10/361 (2%) Frame = +1 Query: 196 RMCAFTCSKS---------KKTVWIWTENKQVMTASVERGWNTFIFPSHRQDLALEWSSI 348 RMC+ S S K VWIWTEN QVMTA+VERGWNTFIF S Q L EWSSI Sbjct: 49 RMCSVAASDSPVSTALYEQSKRVWIWTENSQVMTAAVERGWNTFIFSSQNQGLVNEWSSI 108 Query: 349 ALINPLFIEEGRLFDHEQQTVSAFAXXXXXXXXXXXXXXXXXADKVVVDLLDWQVIPAEN 528 A I+PL I+EG +FD + V+ VV+DLLDWQVIPAEN Sbjct: 109 AFIDPLIIKEGGIFDSAGKRVATIFEVSTPADLKKVQSEDEHTGNVVIDLLDWQVIPAEN 168 Query: 529 IVAAFQGTQKTVLAVSKTQSEAQVFLEALEHGLGGVVMKVEDVGAILELKGYFDKRREED 708 IVA QG+Q T AVSK+ +EAQ+FLEALEHGLGGVV+K EDV A+L+LK YFD+R E Sbjct: 169 IVAELQGSQTTAFAVSKSPAEAQLFLEALEHGLGGVVLKAEDVKAVLDLKEYFDRRNEVH 228 Query: 709 SLLNLTKAKITHVQATGMGDRVCVDICSLMRPGEGLLVGSFARGLFLVHSECLESNYISS 888 + L+L+KA +T V A GMGDRVCVD+CSLMRPGEGLLVGSFARGLFLVHSECLESNYI+S Sbjct: 229 NRLSLSKATVTQVHAVGMGDRVCVDLCSLMRPGEGLLVGSFARGLFLVHSECLESNYIAS 288 Query: 889 RPFRVNAGPVHAYVAVPGGKTSYLSELKSGKEVIVADQRGMQRTAIVGRVKVETRQLILV 1068 RPFRVNAGPVH YVAVPGGKTSYLSELK+GKEVIV DQ+G +TAIVGRVK+ETR LILV Sbjct: 289 RPFRVNAGPVHTYVAVPGGKTSYLSELKAGKEVIVVDQKGKLKTAIVGRVKIETRPLILV 348 Query: 1069 EAKVESENES-YSILLQNAETVGFVSTRQGEGHQRTVIPVTSLKVGDEVLLLLQGGARHT 1245 EAK ++ +++ YSILLQNAETV V T +G Q+T IPVTSLKVGDEVLL LQG ARHT Sbjct: 349 EAKRDANDQTVYSILLQNAETVALVCTHKGNTMQKTAIPVTSLKVGDEVLLRLQGAARHT 408 Query: 1246 G 1248 G Sbjct: 409 G 409 >ref|XP_002323844.2| hypothetical protein POPTR_0017s11670g [Populus trichocarpa] gi|550320061|gb|EEF03977.2| hypothetical protein POPTR_0017s11670g [Populus trichocarpa] Length = 411 Score = 457 bits (1176), Expect = e-126 Identities = 237/364 (65%), Positives = 276/364 (75%), Gaps = 15/364 (4%) Frame = +1 Query: 202 CAFTCSKS---------------KKTVWIWTENKQVMTASVERGWNTFIFPSHRQDLALE 336 C TCS S K VWIWTE+KQVMTA+VERGWNTFIF S+ + LA++ Sbjct: 45 CVTTCSSSTSVFTMSSSGGSYEKSKRVWIWTESKQVMTAAVERGWNTFIFLSNHRQLAID 104 Query: 337 WSSIALINPLFIEEGRLFDHEQQTVSAFAXXXXXXXXXXXXXXXXXADKVVVDLLDWQVI 516 WSS + INPLFIEEG + D E + V+ A+ V+++LLDWQ+I Sbjct: 105 WSSFSFINPLFIEEGEVLDGENKRVATIFEVSTPQELQQLQPENGQAENVIINLLDWQII 164 Query: 517 PAENIVAAFQGTQKTVLAVSKTQSEAQVFLEALEHGLGGVVMKVEDVGAILELKGYFDKR 696 PAENIVAAFQG+QKTVLA+SKT SEAQ+FLEALEHGLGGVV+KVEDV A+++LK Y D+R Sbjct: 165 PAENIVAAFQGSQKTVLAISKTHSEAQIFLEALEHGLGGVVLKVEDVEAVIKLKEYCDRR 224 Query: 697 REEDSLLNLTKAKITHVQATGMGDRVCVDICSLMRPGEGLLVGSFARGLFLVHSECLESN 876 E +LL+LTKA IT VQ GMGDRVCVD+CSLM+PGEGLLVGSFARGLFLVHSECLESN Sbjct: 225 NEATNLLSLTKATITRVQVAGMGDRVCVDLCSLMKPGEGLLVGSFARGLFLVHSECLESN 284 Query: 877 YISSRPFRVNAGPVHAYVAVPGGKTSYLSELKSGKEVIVADQRGMQRTAIVGRVKVETRQ 1056 YI+SRPFRVNAGPVHAYV++PGG+T YLSELK+G+EV VADQ G RTAIVGRVK+ETR Sbjct: 285 YIASRPFRVNAGPVHAYVSIPGGRTCYLSELKAGEEVSVADQNGQLRTAIVGRVKIETRP 344 Query: 1057 LILVEAKVESENESYSILLQNAETVGFVSTRQGEGHQRTVIPVTSLKVGDEVLLLLQGGA 1236 LILVEAK + + YSI LQNAETV + + IPVTSLKVGDEVLL +QGGA Sbjct: 345 LILVEAKSDDQT-VYSIFLQNAETVALIPPCE------AAIPVTSLKVGDEVLLRIQGGA 397 Query: 1237 RHTG 1248 RHTG Sbjct: 398 RHTG 401 >ref|XP_006827144.1| hypothetical protein AMTR_s00010p00251120 [Amborella trichopoda] gi|548831573|gb|ERM94381.1| hypothetical protein AMTR_s00010p00251120 [Amborella trichopoda] Length = 414 Score = 446 bits (1148), Expect = e-123 Identities = 226/344 (65%), Positives = 265/344 (77%), Gaps = 4/344 (1%) Frame = +1 Query: 229 KTVWIWTENKQVMTASVERGWNTFIFPSHRQDLALEWSSIALINPLFIEEGRLFDHEQQT 408 K VW+WTE K VMTA+VERGWNTF+F SH + LA EWSSIA+I PLFI+EG +FD E + Sbjct: 61 KAVWVWTEKKDVMTAAVERGWNTFVFSSHSRKLADEWSSIAMIKPLFIQEGEIFDSENKR 120 Query: 409 VSAFAXXXXXXXXXXXXXXXXXADKVVVDLLDWQVIPAENIVAAFQGTQKTVLAVSKTQS 588 ++ + A+ VV+ L+DWQVIPAENIVA FQG+Q VLA+ KT S Sbjct: 121 IAIVSEISCPEQLEQLQLLDGQAENVVISLMDWQVIPAENIVAVFQGSQTKVLAIGKTPS 180 Query: 589 EAQVFLEALEHGLGGVVMKVEDVGAILELKGYFDKRREEDSLLNLTKAKITHVQATGMGD 768 EAQ+FLEALE GL GVV+K+ED IL+LK YFD+R E ++L+L KA ++ VQ GMGD Sbjct: 181 EAQLFLEALEQGLSGVVLKIEDSEVILKLKEYFDRRNEVKNVLSLVKATVSQVQVAGMGD 240 Query: 769 RVCVDICSLMRPGEGLLVGSFARGLFLVHSECLESNYISSRPFRVNAGPVHAYVAVPGGK 948 RVCVD+C+LMRPGEGLLVGS+ARGL LVHSECL S+YISSRPFRVNAGPVHAYVAVPGGK Sbjct: 241 RVCVDLCTLMRPGEGLLVGSYARGLLLVHSECLASSYISSRPFRVNAGPVHAYVAVPGGK 300 Query: 949 TSYLSELKSGKEVIVADQRGMQRTAIVGRVKVETRQLILVEAKVE----SENESYSILLQ 1116 T YLSEL+SGKEVIV D G QRTA+VGRVK+ETR LILVEAK++ + YSILLQ Sbjct: 301 TCYLSELQSGKEVIVVDLNGRQRTAVVGRVKIETRPLILVEAKLQIDDSDDKTKYSILLQ 360 Query: 1117 NAETVGFVSTRQGEGHQRTVIPVTSLKVGDEVLLLLQGGARHTG 1248 NAETVG V Q H + IPVT+LKVGDEVLL +QGGARHTG Sbjct: 361 NAETVGLVCPFQVGKHNMSAIPVTTLKVGDEVLLRVQGGARHTG 404 >gb|EOY03401.1| Prokaryotic-type, putative isoform 2 [Theobroma cacao] Length = 415 Score = 445 bits (1145), Expect = e-122 Identities = 236/362 (65%), Positives = 266/362 (73%), Gaps = 17/362 (4%) Frame = +1 Query: 196 RMCAFTCSKS---------KKTVWIWTENKQVMTASVERGWNTFIFPSHRQDLALEWSSI 348 RMC+ S S K VWIWTEN QVMTA+VERGWNTFIF S Q L EWSSI Sbjct: 49 RMCSVAASDSPVSTALYEQSKRVWIWTENSQVMTAAVERGWNTFIFSSQNQGLVNEWSSI 108 Query: 349 ALINPLFIEEGRLFDHEQQTVSAFAXXXXXXXXXXXXXXXXXADKVVVDLLDWQVIPAEN 528 A I+PL I+EG +FD + V+ VV+DLLDWQVIPAEN Sbjct: 109 AFIDPLIIKEGGIFDSAGKRVATIFEVSTPADLKKVQSEDEHTGNVVIDLLDWQVIPAEN 168 Query: 529 IVAAFQGTQKTVLAVSKTQSEAQVFLEALEHGLGGVVMKVEDVGAILELKGYFDKRREED 708 IVA QG+Q T AVSK+ +EAQ+FLEALEHGLGGVV+K EDV A+L+LK YFD+R E Sbjct: 169 IVAELQGSQTTAFAVSKSPAEAQLFLEALEHGLGGVVLKAEDVKAVLDLKEYFDRRNEVH 228 Query: 709 SLLNLTKAKITHVQATGMGDRVCVDICSLMRPGEGLLVGSFARGLFLVHSECLESNYISS 888 + L+L+KA +T V A GMGDRVCVD+CSLMRPGEGLLVGSFARGLFLVHSECLESNYI+S Sbjct: 229 NRLSLSKATVTQVHAVGMGDRVCVDLCSLMRPGEGLLVGSFARGLFLVHSECLESNYIAS 288 Query: 889 RPFRVNAGPVHAYVAVPGGKTSYLSELKSGKEVIVADQRGMQRTAIVGRVKVETRQLILV 1068 RPFRVNAGPVH YVAVPGGKTSYLSELK+GKEVIV DQ+G +TAIVGRVK+ETR LILV Sbjct: 289 RPFRVNAGPVHTYVAVPGGKTSYLSELKAGKEVIVVDQKGKLKTAIVGRVKIETRPLILV 348 Query: 1069 EAKV--------ESENESYSILLQNAETVGFVSTRQGEGHQRTVIPVTSLKVGDEVLLLL 1224 EAK ++ YSILLQNAETV V T +G Q+T IPVTSLKVGDEVLL L Sbjct: 349 EAKYWTLLPQRDANDQTVYSILLQNAETVALVCTHKGNTMQKTAIPVTSLKVGDEVLLRL 408 Query: 1225 QG 1230 QG Sbjct: 409 QG 410 >gb|EOY03400.1| Prokaryotic-type, putative isoform 1 [Theobroma cacao] Length = 423 Score = 445 bits (1145), Expect = e-122 Identities = 236/362 (65%), Positives = 266/362 (73%), Gaps = 17/362 (4%) Frame = +1 Query: 196 RMCAFTCSKS---------KKTVWIWTENKQVMTASVERGWNTFIFPSHRQDLALEWSSI 348 RMC+ S S K VWIWTEN QVMTA+VERGWNTFIF S Q L EWSSI Sbjct: 49 RMCSVAASDSPVSTALYEQSKRVWIWTENSQVMTAAVERGWNTFIFSSQNQGLVNEWSSI 108 Query: 349 ALINPLFIEEGRLFDHEQQTVSAFAXXXXXXXXXXXXXXXXXADKVVVDLLDWQVIPAEN 528 A I+PL I+EG +FD + V+ VV+DLLDWQVIPAEN Sbjct: 109 AFIDPLIIKEGGIFDSAGKRVATIFEVSTPADLKKVQSEDEHTGNVVIDLLDWQVIPAEN 168 Query: 529 IVAAFQGTQKTVLAVSKTQSEAQVFLEALEHGLGGVVMKVEDVGAILELKGYFDKRREED 708 IVA QG+Q T AVSK+ +EAQ+FLEALEHGLGGVV+K EDV A+L+LK YFD+R E Sbjct: 169 IVAELQGSQTTAFAVSKSPAEAQLFLEALEHGLGGVVLKAEDVKAVLDLKEYFDRRNEVH 228 Query: 709 SLLNLTKAKITHVQATGMGDRVCVDICSLMRPGEGLLVGSFARGLFLVHSECLESNYISS 888 + L+L+KA +T V A GMGDRVCVD+CSLMRPGEGLLVGSFARGLFLVHSECLESNYI+S Sbjct: 229 NRLSLSKATVTQVHAVGMGDRVCVDLCSLMRPGEGLLVGSFARGLFLVHSECLESNYIAS 288 Query: 889 RPFRVNAGPVHAYVAVPGGKTSYLSELKSGKEVIVADQRGMQRTAIVGRVKVETRQLILV 1068 RPFRVNAGPVH YVAVPGGKTSYLSELK+GKEVIV DQ+G +TAIVGRVK+ETR LILV Sbjct: 289 RPFRVNAGPVHTYVAVPGGKTSYLSELKAGKEVIVVDQKGKLKTAIVGRVKIETRPLILV 348 Query: 1069 EAKV--------ESENESYSILLQNAETVGFVSTRQGEGHQRTVIPVTSLKVGDEVLLLL 1224 EAK ++ YSILLQNAETV V T +G Q+T IPVTSLKVGDEVLL L Sbjct: 349 EAKYWTLLPQRDANDQTVYSILLQNAETVALVCTHKGNTMQKTAIPVTSLKVGDEVLLRL 408 Query: 1225 QG 1230 QG Sbjct: 409 QG 410 >ref|NP_001030791.1| uncharacterized protein [Arabidopsis thaliana] gi|222424331|dbj|BAH20122.1| AT3G28760 [Arabidopsis thaliana] gi|332643967|gb|AEE77488.1| uncharacterized protein AT3G28760 [Arabidopsis thaliana] Length = 444 Score = 441 bits (1135), Expect = e-121 Identities = 231/368 (62%), Positives = 279/368 (75%), Gaps = 8/368 (2%) Frame = +1 Query: 169 TERTFAR--VARMCAFTC----SKSKKTVWIWTENKQVMTASVERGWNTFIFPSHRQDLA 330 ++RTF++ V +M A T K+KK VWIWT K+VMT +VERGWNTFIF S + L+ Sbjct: 68 SKRTFSQRIVVKMSASTLPMNLGKAKK-VWIWTMCKEVMTVAVERGWNTFIFSSDNRKLS 126 Query: 331 LEWSSIALINPLFIEEGRLFDHEQQTVSAFAXXXXXXXXXXXXXXXXXADKVVVDLLDWQ 510 EWSSIAL++ LFIEE ++ D V++ + +V+D LDW+ Sbjct: 127 NEWSSIALMDTLFIEEKKVIDGTGNVVASVFEVSTPEELRSLNIENEQIENIVLDFLDWK 186 Query: 511 VIPAENIVAAFQGTQKTVLAVSKTQSEAQVFLEALEHGLGGVVMKVEDVGAILELKGYFD 690 IPAEN+VAA QG++KTV AVS T SEA++FLEALEHGLGG+++K EDV A+L+LK YFD Sbjct: 187 SIPAENLVAALQGSEKTVFAVSNTPSEAKLFLEALEHGLGGIILKSEDVKAVLDLKEYFD 246 Query: 691 KRREEDSLLNLTKAKITHVQATGMGDRVCVDICSLMRPGEGLLVGSFARGLFLVHSECLE 870 KR EE L+LT+A IT VQ GMGDRVCVD+CSLMRPGEGLLVGSFARGLFLVHSECLE Sbjct: 247 KRNEESDTLSLTEATITRVQMVGMGDRVCVDLCSLMRPGEGLLVGSFARGLFLVHSECLE 306 Query: 871 SNYISSRPFRVNAGPVHAYVAVPGGKTSYLSELKSGKEVIVADQRGMQRTAIVGRVKVET 1050 SNYI SRPFRVNAGPVHAYVAVPGGKT YLSEL++G+EVIV DQ+G QRTA+VGRVK+E Sbjct: 307 SNYIESRPFRVNAGPVHAYVAVPGGKTCYLSELRTGREVIVVDQKGKQRTAVVGRVKIEK 366 Query: 1051 RQLILVEAKVESENES--YSILLQNAETVGFVSTRQGEGHQRTVIPVTSLKVGDEVLLLL 1224 R LI+VEAK+ ++ E YSI+LQNAETV V+ Q RT +PVTSLK GD+VL+ L Sbjct: 367 RPLIVVEAKLSTKEEETVYSIILQNAETVALVTPHQVNSSGRTAVPVTSLKPGDQVLIRL 426 Query: 1225 QGGARHTG 1248 QGGARHTG Sbjct: 427 QGGARHTG 434 >ref|NP_189518.2| uncharacterized protein [Arabidopsis thaliana] gi|27754381|gb|AAO22639.1| unknown protein [Arabidopsis thaliana] gi|28973463|gb|AAO64056.1| unknown protein [Arabidopsis thaliana] gi|332643966|gb|AEE77487.1| uncharacterized protein AT3G28760 [Arabidopsis thaliana] Length = 422 Score = 441 bits (1135), Expect = e-121 Identities = 231/368 (62%), Positives = 279/368 (75%), Gaps = 8/368 (2%) Frame = +1 Query: 169 TERTFAR--VARMCAFTC----SKSKKTVWIWTENKQVMTASVERGWNTFIFPSHRQDLA 330 ++RTF++ V +M A T K+KK VWIWT K+VMT +VERGWNTFIF S + L+ Sbjct: 46 SKRTFSQRIVVKMSASTLPMNLGKAKK-VWIWTMCKEVMTVAVERGWNTFIFSSDNRKLS 104 Query: 331 LEWSSIALINPLFIEEGRLFDHEQQTVSAFAXXXXXXXXXXXXXXXXXADKVVVDLLDWQ 510 EWSSIAL++ LFIEE ++ D V++ + +V+D LDW+ Sbjct: 105 NEWSSIALMDTLFIEEKKVIDGTGNVVASVFEVSTPEELRSLNIENEQIENIVLDFLDWK 164 Query: 511 VIPAENIVAAFQGTQKTVLAVSKTQSEAQVFLEALEHGLGGVVMKVEDVGAILELKGYFD 690 IPAEN+VAA QG++KTV AVS T SEA++FLEALEHGLGG+++K EDV A+L+LK YFD Sbjct: 165 SIPAENLVAALQGSEKTVFAVSNTPSEAKLFLEALEHGLGGIILKSEDVKAVLDLKEYFD 224 Query: 691 KRREEDSLLNLTKAKITHVQATGMGDRVCVDICSLMRPGEGLLVGSFARGLFLVHSECLE 870 KR EE L+LT+A IT VQ GMGDRVCVD+CSLMRPGEGLLVGSFARGLFLVHSECLE Sbjct: 225 KRNEESDTLSLTEATITRVQMVGMGDRVCVDLCSLMRPGEGLLVGSFARGLFLVHSECLE 284 Query: 871 SNYISSRPFRVNAGPVHAYVAVPGGKTSYLSELKSGKEVIVADQRGMQRTAIVGRVKVET 1050 SNYI SRPFRVNAGPVHAYVAVPGGKT YLSEL++G+EVIV DQ+G QRTA+VGRVK+E Sbjct: 285 SNYIESRPFRVNAGPVHAYVAVPGGKTCYLSELRTGREVIVVDQKGKQRTAVVGRVKIEK 344 Query: 1051 RQLILVEAKVESENES--YSILLQNAETVGFVSTRQGEGHQRTVIPVTSLKVGDEVLLLL 1224 R LI+VEAK+ ++ E YSI+LQNAETV V+ Q RT +PVTSLK GD+VL+ L Sbjct: 345 RPLIVVEAKLSTKEEETVYSIILQNAETVALVTPHQVNSSGRTAVPVTSLKPGDQVLIRL 404 Query: 1225 QGGARHTG 1248 QGGARHTG Sbjct: 405 QGGARHTG 412 >ref|XP_002877130.1| hypothetical protein ARALYDRAFT_322953 [Arabidopsis lyrata subsp. lyrata] gi|297322968|gb|EFH53389.1| hypothetical protein ARALYDRAFT_322953 [Arabidopsis lyrata subsp. lyrata] Length = 426 Score = 439 bits (1130), Expect = e-120 Identities = 228/368 (61%), Positives = 279/368 (75%), Gaps = 8/368 (2%) Frame = +1 Query: 169 TERTFAR--VARMCAFTC----SKSKKTVWIWTENKQVMTASVERGWNTFIFPSHRQDLA 330 ++RTF++ +M A T K+KK VWIWTE K+ MT +VERGWNTFIF S ++L+ Sbjct: 50 SKRTFSQKLAVKMSASTLPMNLGKAKK-VWIWTECKEAMTVAVERGWNTFIFSSDNRELS 108 Query: 331 LEWSSIALINPLFIEEGRLFDHEQQTVSAFAXXXXXXXXXXXXXXXXXADKVVVDLLDWQ 510 EWSSIAL++ LFIEE ++ D V++ A+ +V+D LDW+ Sbjct: 109 NEWSSIALMDTLFIEEDQVVDSMGNVVASVFEVSTPEELRNLKIENDQAENIVLDFLDWK 168 Query: 511 VIPAENIVAAFQGTQKTVLAVSKTQSEAQVFLEALEHGLGGVVMKVEDVGAILELKGYFD 690 IPAEN+VAA QG++KTVLA+S T SEA++FLEALEHGL G+++K EDV A+L+LK YFD Sbjct: 169 SIPAENLVAALQGSEKTVLAISNTPSEAKLFLEALEHGLSGIILKSEDVKAVLDLKEYFD 228 Query: 691 KRREEDSLLNLTKAKITHVQATGMGDRVCVDICSLMRPGEGLLVGSFARGLFLVHSECLE 870 KR EE L+LT+A IT VQ GMGDRVCVD+CSLMRPGEGLLVGSFARGLFLVHSECLE Sbjct: 229 KRNEESDTLSLTEATITRVQMVGMGDRVCVDLCSLMRPGEGLLVGSFARGLFLVHSECLE 288 Query: 871 SNYISSRPFRVNAGPVHAYVAVPGGKTSYLSELKSGKEVIVADQRGMQRTAIVGRVKVET 1050 SNYI SRPFRVNAGPVHAYVAVPGGKT YLSEL++G+EVIV DQ+G QRTA+VGRVK+E Sbjct: 289 SNYIESRPFRVNAGPVHAYVAVPGGKTCYLSELRTGREVIVVDQKGKQRTAVVGRVKIEK 348 Query: 1051 RQLILVEAKVESENES--YSILLQNAETVGFVSTRQGEGHQRTVIPVTSLKVGDEVLLLL 1224 R LILVE K+ ++ E +SI+LQNAETV V+ Q +T +PVTSLK GD+VL+ L Sbjct: 349 RPLILVEVKLSAKEEETVFSIILQNAETVALVTPHQVNSSGKTAVPVTSLKPGDQVLIRL 408 Query: 1225 QGGARHTG 1248 QGGARHTG Sbjct: 409 QGGARHTG 416