BLASTX nr result
ID: Akebia26_contig00022751
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Akebia26_contig00022751 (1371 letters) Database: ./nr 38,876,450 sequences; 13,856,398,315 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value emb|CBI22182.3| unnamed protein product [Vitis vinifera] 548 e-153 ref|XP_006482557.1| PREDICTED: uncharacterized protein LOC102626... 547 e-153 ref|XP_002282990.2| PREDICTED: 3-dehydroquinate synthase-like [V... 545 e-152 ref|XP_006827144.1| hypothetical protein AMTR_s00010p00251120 [A... 540 e-151 ref|XP_004302345.1| PREDICTED: 3-dehydroquinate synthase-like [F... 538 e-150 ref|XP_002323844.2| hypothetical protein POPTR_0017s11670g [Popu... 533 e-149 ref|XP_007032476.1| Prokaryotic-type isoform 3 [Theobroma cacao]... 531 e-148 ref|XP_004147467.1| PREDICTED: 3-dehydroquinate synthase-like [C... 530 e-148 ref|XP_002517488.1| conserved hypothetical protein [Ricinus comm... 523 e-146 ref|XP_003554373.1| PREDICTED: uncharacterized protein LOC100806... 520 e-145 ref|XP_006603860.1| PREDICTED: uncharacterized protein LOC100806... 517 e-144 gb|EYU34096.1| hypothetical protein MIMGU_mgv1a007488mg [Mimulus... 514 e-143 ref|XP_007151212.1| hypothetical protein PHAVU_004G027100g [Phas... 513 e-142 ref|XP_006351162.1| PREDICTED: uncharacterized protein LOC102591... 511 e-142 gb|EXB94290.1| 3-dehydroquinate synthase [Morus notabilis] 508 e-141 ref|XP_004234776.1| PREDICTED: 3-dehydroquinate synthase-like [S... 505 e-140 ref|XP_007032475.1| Prokaryotic-type, putative isoform 2 [Theobr... 503 e-140 ref|XP_007032474.1| Prokaryotic-type, putative isoform 1 [Theobr... 503 e-140 ref|NP_001030791.1| uncharacterized protein [Arabidopsis thalian... 499 e-138 ref|NP_189518.2| uncharacterized protein [Arabidopsis thaliana] ... 499 e-138 >emb|CBI22182.3| unnamed protein product [Vitis vinifera] Length = 998 Score = 548 bits (1411), Expect = e-153 Identities = 280/377 (74%), Positives = 319/377 (84%), Gaps = 2/377 (0%) Frame = +2 Query: 131 EVNSNATMCTYVPT--TFENYEQPKKVWIWTESKQVMTAAVERGWDTFIFTSENRNLANE 304 + +S TMC+ + T Y Q K VWIWTESKQVMTAAVERGW+TFIF ++R LA E Sbjct: 624 QFSSRVTMCSSHSSSVTSAGYRQHKVVWIWTESKQVMTAAVERGWNTFIFLPDHRELATE 683 Query: 305 WSSIALLYPLFVENGELLDSENKRVATISEICSPKQLQQLQPVDEQAKNVVISLLDWQVI 484 WSSIAL++PLF++ G+L DSE + VAT+ ++ SP+QLQ LQP D+QA NV+I+LLDWQVI Sbjct: 684 WSSIALIHPLFIKEGKLFDSEGRGVATVYDVTSPQQLQLLQPEDKQADNVIINLLDWQVI 743 Query: 485 PAENIVAAFQGSQKTVFAISKTPLEAQIFLEALEQGLGGVVLKVEDVDAVLELKDYMDGR 664 PAENIVAAFQGS TVFAISK+P EAQIFLEALEQGLGGVVLKVED AVLELKDY D R Sbjct: 744 PAENIVAAFQGSHITVFAISKSPSEAQIFLEALEQGLGGVVLKVEDATAVLELKDYFDRR 803 Query: 665 NEARHLLNLVKVTITQVQVVGMGDRVCVDLCSLMRPGEGLLVGSFARGLFLVHSECLESN 844 NE ++L+L K TITQ+ + GMGDRVCVDLCSLMRPGEGLLVGSFARGLFLVHSECLESN Sbjct: 804 NEDNNILSLTKATITQIHISGMGDRVCVDLCSLMRPGEGLLVGSFARGLFLVHSECLESN 863 Query: 845 YIASRPFRVNAGPVHAYVAVPGGKTSYLSELHAGKEVIVVDQSGRQRTAIVGRVKIEKRP 1024 YIASRPFRVNAGPVHAYVA+PGGKT YLSEL GKEVIVVDQ+G+QRTAIVGRVKIE RP Sbjct: 864 YIASRPFRVNAGPVHAYVAIPGGKTCYLSELVTGKEVIVVDQNGKQRTAIVGRVKIETRP 923 Query: 1025 LILVEAKASNGSDNQIVYNIILQNAETVGLVCPYRANESLKTAIPVTSLKVGDEVMLRVQ 1204 LILVEAK SDN +Y+++LQNAETV L+CP + + K AIPVTSLKVGDEV+LR+Q Sbjct: 924 LILVEAKGD--SDNGTLYSVLLQNAETVALICPSQGSGYQKKAIPVTSLKVGDEVLLRLQ 981 Query: 1205 GEARHTGIEIQEFILEK 1255 G ARHTGIEIQEFI+EK Sbjct: 982 GGARHTGIEIQEFIVEK 998 >ref|XP_006482557.1| PREDICTED: uncharacterized protein LOC102626217 isoform X1 [Citrus sinensis] Length = 401 Score = 547 bits (1409), Expect = e-153 Identities = 289/415 (69%), Positives = 334/415 (80%) Frame = +2 Query: 8 MMALLNSSLSLRISKPMISFTPQTGNRCRWIHSATLMNVGYEVNSNATMCTYVPTTFENY 187 M LL+SS +S + F+ T N +W N G VN N+ T + + Sbjct: 1 MALLLSSSF---VSSTQLPFS--TFNTDKW-------NTG-RVNKNSYCFTMCSVSNSSS 47 Query: 188 EQPKKVWIWTESKQVMTAAVERGWDTFIFTSENRNLANEWSSIALLYPLFVENGELLDSE 367 +PK+VWIWTESKQVMTAAVERGW+TF+F SEN+ LA +WS+IALL PLF++ GE+ DS Sbjct: 48 SKPKRVWIWTESKQVMTAAVERGWNTFVFLSENQQLAIDWSTIALLDPLFIKEGEVYDSG 107 Query: 368 NKRVATISEICSPKQLQQLQPVDEQAKNVVISLLDWQVIPAENIVAAFQGSQKTVFAISK 547 ++RV +I E+ +P++LQQLQP D QA+N+VI L DWQVIPAENIVA+FQGS KTVFAISK Sbjct: 108 DRRVGSIIEVSTPQELQQLQPADGQAENIVIDLPDWQVIPAENIVASFQGSGKTVFAISK 167 Query: 548 TPLEAQIFLEALEQGLGGVVLKVEDVDAVLELKDYMDGRNEARHLLNLVKVTITQVQVVG 727 TP EAQIFLEALEQGLGG+VLKVEDV AVL LK+Y DGRNE +LL+L+K T+T+V V G Sbjct: 168 TPSEAQIFLEALEQGLGGIVLKVEDVKAVLALKEYFDGRNEVSNLLSLMKATVTRVDVAG 227 Query: 728 MGDRVCVDLCSLMRPGEGLLVGSFARGLFLVHSECLESNYIASRPFRVNAGPVHAYVAVP 907 MGDRVCVDLCSLMRPGEGLLVGSFARGLFLVHSECLESNYIASRPFRVNAGPVHAYV VP Sbjct: 228 MGDRVCVDLCSLMRPGEGLLVGSFARGLFLVHSECLESNYIASRPFRVNAGPVHAYVLVP 287 Query: 908 GGKTSYLSELHAGKEVIVVDQSGRQRTAIVGRVKIEKRPLILVEAKASNGSDNQIVYNII 1087 GGKT YLSEL +GKEVIVVDQ GRQRTA+VGRVKIE RPLILVEAK ++G +Q +Y II Sbjct: 288 GGKTCYLSELKSGKEVIVVDQKGRQRTAVVGRVKIESRPLILVEAKTNSG--DQTLYGII 345 Query: 1088 LQNAETVGLVCPYRANESLKTAIPVTSLKVGDEVMLRVQGEARHTGIEIQEFILE 1252 LQNAETV LV P + + AIPVTSLKVGDEV+LRVQG ARHTGIEIQEFI+E Sbjct: 346 LQNAETVALVSPCKGTGEQEKAIPVTSLKVGDEVLLRVQGAARHTGIEIQEFIVE 400 >ref|XP_002282990.2| PREDICTED: 3-dehydroquinate synthase-like [Vitis vinifera] Length = 368 Score = 545 bits (1403), Expect = e-152 Identities = 276/361 (76%), Positives = 311/361 (86%) Frame = +2 Query: 173 TFENYEQPKKVWIWTESKQVMTAAVERGWDTFIFTSENRNLANEWSSIALLYPLFVENGE 352 T Y Q K VWIWTESKQVMTAAVERGW+TFIF ++R LA EWSSIAL++PLF++ G+ Sbjct: 10 TSAGYRQHKVVWIWTESKQVMTAAVERGWNTFIFLPDHRELATEWSSIALIHPLFIKEGK 69 Query: 353 LLDSENKRVATISEICSPKQLQQLQPVDEQAKNVVISLLDWQVIPAENIVAAFQGSQKTV 532 L DSE + VAT+ ++ SP+QLQ LQP D+QA NV+I+LLDWQVIPAENIVAAFQGS TV Sbjct: 70 LFDSEGRGVATVYDVTSPQQLQLLQPEDKQADNVIINLLDWQVIPAENIVAAFQGSHITV 129 Query: 533 FAISKTPLEAQIFLEALEQGLGGVVLKVEDVDAVLELKDYMDGRNEARHLLNLVKVTITQ 712 FAISK+P EAQIFLEALEQGLGGVVLKVED AVLELKDY D RNE ++L+L K TITQ Sbjct: 130 FAISKSPSEAQIFLEALEQGLGGVVLKVEDATAVLELKDYFDRRNEDNNILSLTKATITQ 189 Query: 713 VQVVGMGDRVCVDLCSLMRPGEGLLVGSFARGLFLVHSECLESNYIASRPFRVNAGPVHA 892 + + GMGDRVCVDLCSLMRPGEGLLVGSFARGLFLVHSECLESNYIASRPFRVNAGPVHA Sbjct: 190 IHISGMGDRVCVDLCSLMRPGEGLLVGSFARGLFLVHSECLESNYIASRPFRVNAGPVHA 249 Query: 893 YVAVPGGKTSYLSELHAGKEVIVVDQSGRQRTAIVGRVKIEKRPLILVEAKASNGSDNQI 1072 YVA+PGGKT YLSEL GKEVIVVDQ+G+QRTAIVGRVKIE RPLILVEAK SDN Sbjct: 250 YVAIPGGKTCYLSELVTGKEVIVVDQNGKQRTAIVGRVKIETRPLILVEAKGD--SDNGT 307 Query: 1073 VYNIILQNAETVGLVCPYRANESLKTAIPVTSLKVGDEVMLRVQGEARHTGIEIQEFILE 1252 +Y+++LQNAETV L+CP + + K AIPVTSLKVGDEV+LR+QG ARHTGIEIQEFI+E Sbjct: 308 LYSVLLQNAETVALICPSQGSGYQKKAIPVTSLKVGDEVLLRLQGGARHTGIEIQEFIVE 367 Query: 1253 K 1255 K Sbjct: 368 K 368 >ref|XP_006827144.1| hypothetical protein AMTR_s00010p00251120 [Amborella trichopoda] gi|548831573|gb|ERM94381.1| hypothetical protein AMTR_s00010p00251120 [Amborella trichopoda] Length = 414 Score = 540 bits (1391), Expect = e-151 Identities = 276/417 (66%), Positives = 339/417 (81%), Gaps = 2/417 (0%) Frame = +2 Query: 11 MALLNSSLSLRISKPMISFTPQTGNRCRWIHSATL-MNVGYEVNSNATMCTYVPTTFENY 187 MA+L S+ S ++ +P ++ + G+ C + S L M ++ + T FE Y Sbjct: 1 MAILLSA-SQKLFRPPLAL--KIGDNCHSVWSCPLKMASRDQLQAKCQAMMPSSTNFEIY 57 Query: 188 EQPKKVWIWTESKQVMTAAVERGWDTFIFTSENRNLANEWSSIALLYPLFVENGELLDSE 367 + PK VW+WTE K VMTAAVERGW+TF+F+S +R LA+EWSSIA++ PLF++ GE+ DSE Sbjct: 58 DPPKAVWVWTEKKDVMTAAVERGWNTFVFSSHSRKLADEWSSIAMIKPLFIQEGEIFDSE 117 Query: 368 NKRVATISEICSPKQLQQLQPVDEQAKNVVISLLDWQVIPAENIVAAFQGSQKTVFAISK 547 NKR+A +SEI P+QL+QLQ +D QA+NVVISL+DWQVIPAENIVA FQGSQ V AI K Sbjct: 118 NKRIAIVSEISCPEQLEQLQLLDGQAENVVISLMDWQVIPAENIVAVFQGSQTKVLAIGK 177 Query: 548 TPLEAQIFLEALEQGLGGVVLKVEDVDAVLELKDYMDGRNEARHLLNLVKVTITQVQVVG 727 TP EAQ+FLEALEQGL GVVLK+ED + +L+LK+Y D RNE +++L+LVK T++QVQV G Sbjct: 178 TPSEAQLFLEALEQGLSGVVLKIEDSEVILKLKEYFDRRNEVKNVLSLVKATVSQVQVAG 237 Query: 728 MGDRVCVDLCSLMRPGEGLLVGSFARGLFLVHSECLESNYIASRPFRVNAGPVHAYVAVP 907 MGDRVCVDLC+LMRPGEGLLVGS+ARGL LVHSECL S+YI+SRPFRVNAGPVHAYVAVP Sbjct: 238 MGDRVCVDLCTLMRPGEGLLVGSYARGLLLVHSECLASSYISSRPFRVNAGPVHAYVAVP 297 Query: 908 GGKTSYLSELHAGKEVIVVDQSGRQRTAIVGRVKIEKRPLILVEAKAS-NGSDNQIVYNI 1084 GGKT YLSEL +GKEVIVVD +GRQRTA+VGRVKIE RPLILVEAK + SD++ Y+I Sbjct: 298 GGKTCYLSELQSGKEVIVVDLNGRQRTAVVGRVKIETRPLILVEAKLQIDDSDDKTKYSI 357 Query: 1085 ILQNAETVGLVCPYRANESLKTAIPVTSLKVGDEVMLRVQGEARHTGIEIQEFILEK 1255 +LQNAETVGLVCP++ + +AIPVT+LKVGDEV+LRVQG ARHTGIEIQEFI+EK Sbjct: 358 LLQNAETVGLVCPFQVGKHNMSAIPVTTLKVGDEVLLRVQGGARHTGIEIQEFIIEK 414 >ref|XP_004302345.1| PREDICTED: 3-dehydroquinate synthase-like [Fragaria vesca subsp. vesca] Length = 403 Score = 538 bits (1387), Expect = e-150 Identities = 279/400 (69%), Positives = 327/400 (81%), Gaps = 3/400 (0%) Frame = +2 Query: 65 FTPQT---GNRCRWIHSATLMNVGYEVNSNATMCTYVPTTFENYEQPKKVWIWTESKQVM 235 FTP T N CR I S ++ + N+++ + +F + + K VW+WTESKQVM Sbjct: 10 FTPPTDKWSNICRLISSHNRHSMEAKATQNSSVASSSTMSFRSSK--KTVWVWTESKQVM 67 Query: 236 TAAVERGWDTFIFTSENRNLANEWSSIALLYPLFVENGELLDSENKRVATISEICSPKQL 415 TAAVERGW+TF+F S+ LA++WSSIAL+ PL ++ G + DSEN RVAT+ E+ SP++L Sbjct: 68 TAAVERGWNTFVFQSQK--LADDWSSIALIDPLLMKEGGIFDSENTRVATVFEVSSPEEL 125 Query: 416 QQLQPVDEQAKNVVISLLDWQVIPAENIVAAFQGSQKTVFAISKTPLEAQIFLEALEQGL 595 +QLQP + +NVV+ LLDWQVIPAENIVAAFQGSQKTVFA+SKTP+EAQ+F EALE GL Sbjct: 126 EQLQPENGVGENVVVDLLDWQVIPAENIVAAFQGSQKTVFAVSKTPVEAQVFFEALEHGL 185 Query: 596 GGVVLKVEDVDAVLELKDYMDGRNEARHLLNLVKVTITQVQVVGMGDRVCVDLCSLMRPG 775 GGVVLKVEDV AVL+LKDY D R+E ++L+L K +T VQV GMGDRVCVDLCSLMRPG Sbjct: 186 GGVVLKVEDVQAVLDLKDYFDRRDEVGNILSLTKAIVTGVQVAGMGDRVCVDLCSLMRPG 245 Query: 776 EGLLVGSFARGLFLVHSECLESNYIASRPFRVNAGPVHAYVAVPGGKTSYLSELHAGKEV 955 EGLLVGSFARGLFLVHSECLESNYIASRPFRVNAGPVHAYVAVPGGKTSYLSEL AGKEV Sbjct: 246 EGLLVGSFARGLFLVHSECLESNYIASRPFRVNAGPVHAYVAVPGGKTSYLSELKAGKEV 305 Query: 956 IVVDQSGRQRTAIVGRVKIEKRPLILVEAKASNGSDNQIVYNIILQNAETVGLVCPYRAN 1135 I+VDQ G QRTAIVGR KIE RPLILVEAK SD+Q +Y+I++QNAETV LVCP + + Sbjct: 306 ILVDQEGHQRTAIVGRAKIETRPLILVEAKMC--SDDQTIYSILVQNAETVALVCPKKES 363 Query: 1136 ESLKTAIPVTSLKVGDEVMLRVQGEARHTGIEIQEFILEK 1255 KTAIPVTSLKVGDE+MLR+QG ARHTGIEIQEFI+EK Sbjct: 364 GGRKTAIPVTSLKVGDEIMLRLQGGARHTGIEIQEFIVEK 403 >ref|XP_002323844.2| hypothetical protein POPTR_0017s11670g [Populus trichocarpa] gi|550320061|gb|EEF03977.2| hypothetical protein POPTR_0017s11670g [Populus trichocarpa] Length = 411 Score = 533 bits (1374), Expect = e-149 Identities = 287/420 (68%), Positives = 336/420 (80%), Gaps = 5/420 (1%) Frame = +2 Query: 8 MMALLNSS--LSLRISKPMISFTPQTGNRCRW-IHSATLMNVGYEVN--SNATMCTYVPT 172 M LL+S+ L K FTP T R ++ TL+ V S++T + + Sbjct: 1 MATLLSSTSFLGFPFPKHFSYFTPLTDKRNSLRLNKETLLRYSCCVTTCSSSTSVFTMSS 60 Query: 173 TFENYEQPKKVWIWTESKQVMTAAVERGWDTFIFTSENRNLANEWSSIALLYPLFVENGE 352 + +YE+ K+VWIWTESKQVMTAAVERGW+TFIF S +R LA +WSS + + PLF+E GE Sbjct: 61 SGGSYEKSKRVWIWTESKQVMTAAVERGWNTFIFLSNHRQLAIDWSSFSFINPLFIEEGE 120 Query: 353 LLDSENKRVATISEICSPKQLQQLQPVDEQAKNVVISLLDWQVIPAENIVAAFQGSQKTV 532 +LD ENKRVATI E+ +P++LQQLQP + QA+NV+I+LLDWQ+IPAENIVAAFQGSQKTV Sbjct: 121 VLDGENKRVATIFEVSTPQELQQLQPENGQAENVIINLLDWQIIPAENIVAAFQGSQKTV 180 Query: 533 FAISKTPLEAQIFLEALEQGLGGVVLKVEDVDAVLELKDYMDGRNEARHLLNLVKVTITQ 712 AISKT EAQIFLEALE GLGGVVLKVEDV+AV++LK+Y D RNEA +LL+L K TIT+ Sbjct: 181 LAISKTHSEAQIFLEALEHGLGGVVLKVEDVEAVIKLKEYCDRRNEATNLLSLTKATITR 240 Query: 713 VQVVGMGDRVCVDLCSLMRPGEGLLVGSFARGLFLVHSECLESNYIASRPFRVNAGPVHA 892 VQV GMGDRVCVDLCSLM+PGEGLLVGSFARGLFLVHSECLESNYIASRPFRVNAGPVHA Sbjct: 241 VQVAGMGDRVCVDLCSLMKPGEGLLVGSFARGLFLVHSECLESNYIASRPFRVNAGPVHA 300 Query: 893 YVAVPGGKTSYLSELHAGKEVIVVDQSGRQRTAIVGRVKIEKRPLILVEAKASNGSDNQI 1072 YV++PGG+T YLSEL AG+EV V DQ+G+ RTAIVGRVKIE RPLILVEAK SD+Q Sbjct: 301 YVSIPGGRTCYLSELKAGEEVSVADQNGQLRTAIVGRVKIETRPLILVEAK----SDDQT 356 Query: 1073 VYNIILQNAETVGLVCPYRANESLKTAIPVTSLKVGDEVMLRVQGEARHTGIEIQEFILE 1252 VY+I LQNAETV L+ P A AIPVTSLKVGDEV+LR+QG ARHTGIEIQEFI+E Sbjct: 357 VYSIFLQNAETVALIPPCEA------AIPVTSLKVGDEVLLRIQGGARHTGIEIQEFIVE 410 >ref|XP_007032476.1| Prokaryotic-type isoform 3 [Theobroma cacao] gi|508711505|gb|EOY03402.1| Prokaryotic-type isoform 3 [Theobroma cacao] Length = 419 Score = 531 bits (1368), Expect = e-148 Identities = 271/377 (71%), Positives = 309/377 (81%), Gaps = 4/377 (1%) Frame = +2 Query: 134 VNSNATMCTYV----PTTFENYEQPKKVWIWTESKQVMTAAVERGWDTFIFTSENRNLAN 301 +NS+ MC+ P + YEQ K+VWIWTE+ QVMTAAVERGW+TFIF+S+N+ L N Sbjct: 44 INSSVRMCSVAASDSPVSTALYEQSKRVWIWTENSQVMTAAVERGWNTFIFSSQNQGLVN 103 Query: 302 EWSSIALLYPLFVENGELLDSENKRVATISEICSPKQLQQLQPVDEQAKNVVISLLDWQV 481 EWSSIA + PL ++ G + DS KRVATI E+ +P L+++Q DE NVVI LLDWQV Sbjct: 104 EWSSIAFIDPLIIKEGGIFDSAGKRVATIFEVSTPADLKKVQSEDEHTGNVVIDLLDWQV 163 Query: 482 IPAENIVAAFQGSQKTVFAISKTPLEAQIFLEALEQGLGGVVLKVEDVDAVLELKDYMDG 661 IPAENIVA QGSQ T FA+SK+P EAQ+FLEALE GLGGVVLK EDV AVL+LK+Y D Sbjct: 164 IPAENIVAELQGSQTTAFAVSKSPAEAQLFLEALEHGLGGVVLKAEDVKAVLDLKEYFDR 223 Query: 662 RNEARHLLNLVKVTITQVQVVGMGDRVCVDLCSLMRPGEGLLVGSFARGLFLVHSECLES 841 RNE + L+L K T+TQV VGMGDRVCVDLCSLMRPGEGLLVGSFARGLFLVHSECLES Sbjct: 224 RNEVHNRLSLSKATVTQVHAVGMGDRVCVDLCSLMRPGEGLLVGSFARGLFLVHSECLES 283 Query: 842 NYIASRPFRVNAGPVHAYVAVPGGKTSYLSELHAGKEVIVVDQSGRQRTAIVGRVKIEKR 1021 NYIASRPFRVNAGPVH YVAVPGGKTSYLSEL AGKEVIVVDQ G+ +TAIVGRVKIE R Sbjct: 284 NYIASRPFRVNAGPVHTYVAVPGGKTSYLSELKAGKEVIVVDQKGKLKTAIVGRVKIETR 343 Query: 1022 PLILVEAKASNGSDNQIVYNIILQNAETVGLVCPYRANESLKTAIPVTSLKVGDEVMLRV 1201 PLILVEAK +++Q VY+I+LQNAETV LVC ++ N KTAIPVTSLKVGDEV+LR+ Sbjct: 344 PLILVEAK--RDANDQTVYSILLQNAETVALVCTHKGNTMQKTAIPVTSLKVGDEVLLRL 401 Query: 1202 QGEARHTGIEIQEFILE 1252 QG ARHTGIEIQEFILE Sbjct: 402 QGAARHTGIEIQEFILE 418 >ref|XP_004147467.1| PREDICTED: 3-dehydroquinate synthase-like [Cucumis sativus] gi|449520920|ref|XP_004167480.1| PREDICTED: 3-dehydroquinate synthase-like [Cucumis sativus] Length = 423 Score = 530 bits (1364), Expect = e-148 Identities = 270/369 (73%), Positives = 308/369 (83%), Gaps = 2/369 (0%) Frame = +2 Query: 155 CTYVPTT--FENYEQPKKVWIWTESKQVMTAAVERGWDTFIFTSENRNLANEWSSIALLY 328 C+Y ++ E K VWIW+E +QVMTAAVERGW TFIF+ N LA+EWSSIAL++ Sbjct: 58 CSYTSSSSPMSPIEASKGVWIWSECQQVMTAAVERGWSTFIFSPHNTELAHEWSSIALIH 117 Query: 329 PLFVENGELLDSENKRVATISEICSPKQLQQLQPVDEQAKNVVISLLDWQVIPAENIVAA 508 PLF++ +LD E++ +A++ E+ +P+QL+QLQP A VV+ L DWQ+IPAENIVAA Sbjct: 118 PLFIKENGVLDGEDRLIASVVEVSNPQQLEQLQPARASADIVVVDLQDWQIIPAENIVAA 177 Query: 509 FQGSQKTVFAISKTPLEAQIFLEALEQGLGGVVLKVEDVDAVLELKDYMDGRNEARHLLN 688 FQGSQKTVFAISKTP+EAQIFLEALE GLGGV+LKVED +AV +LKDY D RNEA +LLN Sbjct: 178 FQGSQKTVFAISKTPIEAQIFLEALEHGLGGVILKVEDPEAVFQLKDYFDRRNEASNLLN 237 Query: 689 LVKVTITQVQVVGMGDRVCVDLCSLMRPGEGLLVGSFARGLFLVHSECLESNYIASRPFR 868 L K TITQ+ VVGMGDRVCVDLCSLMRPGEGLLVGS+ARGLFL+HSECLESNYIASRPFR Sbjct: 238 LTKATITQIHVVGMGDRVCVDLCSLMRPGEGLLVGSYARGLFLIHSECLESNYIASRPFR 297 Query: 869 VNAGPVHAYVAVPGGKTSYLSELHAGKEVIVVDQSGRQRTAIVGRVKIEKRPLILVEAKA 1048 VNAGPVHAYVAVPGGKTSYLSEL AG EVIVVDQ GRQRTAIVGRVKIE R LILV+AK Sbjct: 298 VNAGPVHAYVAVPGGKTSYLSELQAGNEVIVVDQEGRQRTAIVGRVKIETRQLILVQAK- 356 Query: 1049 SNGSDNQIVYNIILQNAETVGLVCPYRANESLKTAIPVTSLKVGDEVMLRVQGEARHTGI 1228 SD Q Y+++LQNAETV LVCP + N K AIPVTSLKVGDEV LR+QGEARHTGI Sbjct: 357 -RDSDEQTPYSVLLQNAETVALVCPGQGNNE-KKAIPVTSLKVGDEVFLRLQGEARHTGI 414 Query: 1229 EIQEFILEK 1255 EIQEFI+EK Sbjct: 415 EIQEFIVEK 423 >ref|XP_002517488.1| conserved hypothetical protein [Ricinus communis] gi|223543499|gb|EEF45030.1| conserved hypothetical protein [Ricinus communis] Length = 419 Score = 523 bits (1346), Expect = e-146 Identities = 274/420 (65%), Positives = 330/420 (78%), Gaps = 5/420 (1%) Frame = +2 Query: 8 MMALLNSSLSLRISKPMIS--FTPQTGNRCRWIHSATLMNVGYEVNSNATMCTYVPTT-- 175 M LL SS + I +S F PQ G+ +S + NS M + + Sbjct: 1 MAVLLPSSANTTILPKQLSTAFPPQPGSLNILWNSCNSRKLKTNHNSFVAMSSLNNASRI 60 Query: 176 -FENYEQPKKVWIWTESKQVMTAAVERGWDTFIFTSENRNLANEWSSIALLYPLFVENGE 352 +Y++ KKVWIWTE+KQVMTAAVERGW+TFIF + R LA+EWSS A++YPLFV+ E Sbjct: 61 SSGDYDKLKKVWIWTENKQVMTAAVERGWNTFIFCYKCRELADEWSSTAMIYPLFVKEDE 120 Query: 353 LLDSENKRVATISEICSPKQLQQLQPVDEQAKNVVISLLDWQVIPAENIVAAFQGSQKTV 532 +LD ENKRVA +I +P++L+Q Q + QA+N+V++LLDWQ+IPAENIVAAFQGSQKTV Sbjct: 121 ILDGENKRVAATFDISTPQELEQFQLENAQAENIVVNLLDWQIIPAENIVAAFQGSQKTV 180 Query: 533 FAISKTPLEAQIFLEALEQGLGGVVLKVEDVDAVLELKDYMDGRNEARHLLNLVKVTITQ 712 FA+SKTP EA++FLEALE GLGG++L+VEDV+AV ELK+Y D RNEA ++L L K T+++ Sbjct: 181 FAVSKTPSEAKVFLEALEHGLGGIILRVEDVEAVFELKNYFDRRNEASNVLILTKATVSK 240 Query: 713 VQVVGMGDRVCVDLCSLMRPGEGLLVGSFARGLFLVHSECLESNYIASRPFRVNAGPVHA 892 +Q GMGDRVCVDLCSLMRPGEGLLVGSFARGLFLVHSECLESNYIASRPFRVNAGPV+A Sbjct: 241 IQAAGMGDRVCVDLCSLMRPGEGLLVGSFARGLFLVHSECLESNYIASRPFRVNAGPVNA 300 Query: 893 YVAVPGGKTSYLSELHAGKEVIVVDQSGRQRTAIVGRVKIEKRPLILVEAKASNGSDNQI 1072 Y++VPGGKT YLSEL AGKEVIVVDQ G+ RTAIVGRVKIE RPL+L+EAK SD Q Sbjct: 301 YISVPGGKTCYLSELRAGKEVIVVDQKGQLRTAIVGRVKIESRPLVLLEAKID--SDYQT 358 Query: 1073 VYNIILQNAETVGLVCPYRANESLKTAIPVTSLKVGDEVMLRVQGEARHTGIEIQEFILE 1252 VY+I LQNAETV LV P + N + AIPVT+LKVGDEV+LR+QG ARHTGIEIQEFI+E Sbjct: 359 VYSIFLQNAETVALVPPCQGNGTQNVAIPVTALKVGDEVLLRLQGAARHTGIEIQEFIVE 418 >ref|XP_003554373.1| PREDICTED: uncharacterized protein LOC100806285 isoform X1 [Glycine max] Length = 442 Score = 520 bits (1338), Expect = e-145 Identities = 259/359 (72%), Positives = 308/359 (85%) Frame = +2 Query: 179 ENYEQPKKVWIWTESKQVMTAAVERGWDTFIFTSENRNLANEWSSIALLYPLFVENGELL 358 E+ ++ K+VWIWT +KQVMTAAVERGW+TF+F S +R LA++WSSIA++ PLFV GE+L Sbjct: 87 ESGKRSKRVWIWTSNKQVMTAAVERGWNTFVFPSHHRQLAHDWSSIAVICPLFVNEGEVL 146 Query: 359 DSENKRVATISEICSPKQLQQLQPVDEQAKNVVISLLDWQVIPAENIVAAFQGSQKTVFA 538 D +NKRVATI ++ +P++L++L+P +EQA+N+V++LLDWQVIPAENI+AAFQ SQ TVFA Sbjct: 147 DGQNKRVATIFDVSTPEELEELRPENEQAENIVVNLLDWQVIPAENIIAAFQRSQNTVFA 206 Query: 539 ISKTPLEAQIFLEALEQGLGGVVLKVEDVDAVLELKDYMDGRNEARHLLNLVKVTITQVQ 718 IS EAQ+FLEALE GL G+++KVEDV+ VLELK+Y D R E +LL+L K T+T +Q Sbjct: 207 ISNNTSEAQVFLEALEHGLDGIIMKVEDVEPVLELKEYFDRRMEESNLLSLTKATVTHIQ 266 Query: 719 VVGMGDRVCVDLCSLMRPGEGLLVGSFARGLFLVHSECLESNYIASRPFRVNAGPVHAYV 898 GMGDRVCVDLCSLMRPGEGLLVGSFARGLFLVHSECLESNYIASRPFRVNAGPVHAYV Sbjct: 267 AAGMGDRVCVDLCSLMRPGEGLLVGSFARGLFLVHSECLESNYIASRPFRVNAGPVHAYV 326 Query: 899 AVPGGKTSYLSELHAGKEVIVVDQSGRQRTAIVGRVKIEKRPLILVEAKASNGSDNQIVY 1078 AVPGG+T YLSEL +GKEVI+VD GRQR AIVGRVKIE RPLILVEAK SDNQ + Sbjct: 327 AVPGGRTCYLSELKSGKEVIIVDHQGRQRIAIVGRVKIESRPLILVEAKIE--SDNQSI- 383 Query: 1079 NIILQNAETVGLVCPYRANESLKTAIPVTSLKVGDEVMLRVQGEARHTGIEIQEFILEK 1255 +I+LQNAETV LVC + N LKT+IPVTSLKVGDE++LRVQG ARHTGIEIQEFI+EK Sbjct: 384 SILLQNAETVALVCTPQGNTLLKTSIPVTSLKVGDEILLRVQGGARHTGIEIQEFIVEK 442 >ref|XP_006603860.1| PREDICTED: uncharacterized protein LOC100806285 isoform X2 [Glycine max] Length = 440 Score = 517 bits (1331), Expect = e-144 Identities = 258/357 (72%), Positives = 306/357 (85%) Frame = +2 Query: 179 ENYEQPKKVWIWTESKQVMTAAVERGWDTFIFTSENRNLANEWSSIALLYPLFVENGELL 358 E+ ++ K+VWIWT +KQVMTAAVERGW+TF+F S +R LA++WSSIA++ PLFV GE+L Sbjct: 87 ESGKRSKRVWIWTSNKQVMTAAVERGWNTFVFPSHHRQLAHDWSSIAVICPLFVNEGEVL 146 Query: 359 DSENKRVATISEICSPKQLQQLQPVDEQAKNVVISLLDWQVIPAENIVAAFQGSQKTVFA 538 D +NKRVATI ++ +P++L++L+P +EQA+N+V++LLDWQVIPAENI+AAFQ SQ TVFA Sbjct: 147 DGQNKRVATIFDVSTPEELEELRPENEQAENIVVNLLDWQVIPAENIIAAFQRSQNTVFA 206 Query: 539 ISKTPLEAQIFLEALEQGLGGVVLKVEDVDAVLELKDYMDGRNEARHLLNLVKVTITQVQ 718 IS EAQ+FLEALE GL G+++KVEDV+ VLELK+Y D R E +LL+L K T+T +Q Sbjct: 207 ISNNTSEAQVFLEALEHGLDGIIMKVEDVEPVLELKEYFDRRMEESNLLSLTKATVTHIQ 266 Query: 719 VVGMGDRVCVDLCSLMRPGEGLLVGSFARGLFLVHSECLESNYIASRPFRVNAGPVHAYV 898 GMGDRVCVDLCSLMRPGEGLLVGSFARGLFLVHSECLESNYIASRPFRVNAGPVHAYV Sbjct: 267 AAGMGDRVCVDLCSLMRPGEGLLVGSFARGLFLVHSECLESNYIASRPFRVNAGPVHAYV 326 Query: 899 AVPGGKTSYLSELHAGKEVIVVDQSGRQRTAIVGRVKIEKRPLILVEAKASNGSDNQIVY 1078 AVPGG+T YLSEL +GKEVI+VD GRQR AIVGRVKIE RPLILVEAK SDNQ + Sbjct: 327 AVPGGRTCYLSELKSGKEVIIVDHQGRQRIAIVGRVKIESRPLILVEAKIE--SDNQSI- 383 Query: 1079 NIILQNAETVGLVCPYRANESLKTAIPVTSLKVGDEVMLRVQGEARHTGIEIQEFIL 1249 +I+LQNAETV LVC + N LKT+IPVTSLKVGDE++LRVQG ARHTGIEIQEFIL Sbjct: 384 SILLQNAETVALVCTPQGNTLLKTSIPVTSLKVGDEILLRVQGGARHTGIEIQEFIL 440 >gb|EYU34096.1| hypothetical protein MIMGU_mgv1a007488mg [Mimulus guttatus] Length = 405 Score = 514 bits (1323), Expect = e-143 Identities = 263/356 (73%), Positives = 301/356 (84%), Gaps = 1/356 (0%) Frame = +2 Query: 191 QPKKVWIWTESKQVMTAAVERGWDTFIFTSENRNLANEWSSIALLYPLFVENGELLDSEN 370 Q KKVW+WTE K+VMTAAVERGW+TFIF R LA +WSSIALLYPLF+E G L D E+ Sbjct: 54 QKKKVWVWTEKKEVMTAAVERGWNTFIFPHHFRELAADWSSIALLYPLFIEEGGLFDGEH 113 Query: 371 KRVATISEICSPKQLQQLQPVDEQAKNVVISLLDWQVIPAENIVAAFQGSQKTVFAISKT 550 K++A EI SP+QL++LQP+DE A NVVI+LLDWQVIPAENIVAA QG+QKTVFA+SKT Sbjct: 114 KKIAAFFEISSPEQLEKLQPLDELADNVVINLLDWQVIPAENIVAAIQGTQKTVFAVSKT 173 Query: 551 PLEAQIFLEALEQGLGGVVLKVEDVDAVLELKDYMDGRNEARHLLNLVKVTITQVQVVGM 730 EAQ F EALEQGLGGVVLK EDV+++LELKDY++ RNE +L L K +T V++VGM Sbjct: 174 SSEAQTFFEALEQGLGGVVLKTEDVESILELKDYLERRNEEGSVLELTKARVTNVEMVGM 233 Query: 731 GDRVCVDLCSLMRPGEGLLVGSFARGLFLVHSECLESNYIASRPFRVNAGPVHAYVAVPG 910 GDRVCVD+CS+M+PGEGLLVGSFARGLFLVHSECLESNYI+SRPFRVNAGPVHAYVA+PG Sbjct: 234 GDRVCVDICSIMKPGEGLLVGSFARGLFLVHSECLESNYISSRPFRVNAGPVHAYVAIPG 293 Query: 911 GKTSYLSELHAGKEVIVVDQSGRQRTAIVGRVKIEKRPLILVEAKASNGSDNQIVYNIIL 1090 GKTSYLSEL AGKEVIVVDQ+GRQR AIVGRVKIE R LILVEAK D + Y+I+L Sbjct: 294 GKTSYLSELKAGKEVIVVDQNGRQRIAIVGRVKIETRQLILVEAK--RDEDKETSYSILL 351 Query: 1091 QNAETVGLV-CPYRANESLKTAIPVTSLKVGDEVMLRVQGEARHTGIEIQEFILEK 1255 QNAETV LV P N+ + AIPVTSLK+GDE++LRVQG ARHTGIEIQEFILEK Sbjct: 352 QNAETVALVSSPGDGNQ--RRAIPVTSLKLGDEILLRVQGGARHTGIEIQEFILEK 405 >ref|XP_007151212.1| hypothetical protein PHAVU_004G027100g [Phaseolus vulgaris] gi|561024521|gb|ESW23206.1| hypothetical protein PHAVU_004G027100g [Phaseolus vulgaris] Length = 439 Score = 513 bits (1320), Expect = e-142 Identities = 260/359 (72%), Positives = 302/359 (84%) Frame = +2 Query: 179 ENYEQPKKVWIWTESKQVMTAAVERGWDTFIFTSENRNLANEWSSIALLYPLFVENGELL 358 E+ + K+VWIWT +KQVMTAAVERGW+TF+F S +R LA EWS IA++ PLFV E+L Sbjct: 84 ESGKPSKRVWIWTSNKQVMTAAVERGWNTFVFPSHHRQLAREWSEIAVICPLFVNEEEVL 143 Query: 359 DSENKRVATISEICSPKQLQQLQPVDEQAKNVVISLLDWQVIPAENIVAAFQGSQKTVFA 538 D +NKRVATI ++ +P++L+ L+P DE A+++V++LLDWQVIPAENI+AAFQ SQKTVFA Sbjct: 144 DEQNKRVATIFDVSNPEELEGLRPEDEHAESIVVNLLDWQVIPAENIIAAFQRSQKTVFA 203 Query: 539 ISKTPLEAQIFLEALEQGLGGVVLKVEDVDAVLELKDYMDGRNEARHLLNLVKVTITQVQ 718 IS EAQ+FLEALE GL G+V+K+EDV+ VLELK Y D R E +LL+L K T+T +Q Sbjct: 204 ISNNTSEAQLFLEALEHGLDGIVMKIEDVEPVLELKAYFDRRMEESNLLSLTKATVTHIQ 263 Query: 719 VVGMGDRVCVDLCSLMRPGEGLLVGSFARGLFLVHSECLESNYIASRPFRVNAGPVHAYV 898 GMGDRVCVDLCSLMRPGEGLLVGSFARGLFLVHSECLESNYIASRPFRVNAGPVHAYV Sbjct: 264 GTGMGDRVCVDLCSLMRPGEGLLVGSFARGLFLVHSECLESNYIASRPFRVNAGPVHAYV 323 Query: 899 AVPGGKTSYLSELHAGKEVIVVDQSGRQRTAIVGRVKIEKRPLILVEAKASNGSDNQIVY 1078 AVPG +TSYLSEL +GKEVIVVDQ G QR AIVGRVKIE RPLILVEAK SD Q + Sbjct: 324 AVPGSRTSYLSELKSGKEVIVVDQKGHQRIAIVGRVKIESRPLILVEAKIE--SDTQTI- 380 Query: 1079 NIILQNAETVGLVCPYRANESLKTAIPVTSLKVGDEVMLRVQGEARHTGIEIQEFILEK 1255 +I+LQNAETV LVCP + N LKTAIPVTSLKVGDE++LRVQG ARHTGIEIQEFI+EK Sbjct: 381 SILLQNAETVALVCPPQGNTVLKTAIPVTSLKVGDEILLRVQGGARHTGIEIQEFIVEK 439 >ref|XP_006351162.1| PREDICTED: uncharacterized protein LOC102591464 [Solanum tuberosum] Length = 394 Score = 511 bits (1317), Expect = e-142 Identities = 262/373 (70%), Positives = 302/373 (80%) Frame = +2 Query: 137 NSNATMCTYVPTTFENYEQPKKVWIWTESKQVMTAAVERGWDTFIFTSENRNLANEWSSI 316 N A MC + P+ + K VWIWTE+KQVMTAAVERGW+TFIF S ++LA EWSSI Sbjct: 29 NRVAKMCAFTPSN----SKKKTVWIWTENKQVMTAAVERGWNTFIFPSNRQDLALEWSSI 84 Query: 317 ALLYPLFVENGELLDSENKRVATISEICSPKQLQQLQPVDEQAKNVVISLLDWQVIPAEN 496 A++YPLFVE G +D E+K VA +EI SP+QL+Q Q +EQA VV++LLDWQVIPAEN Sbjct: 85 AVIYPLFVEEGRQIDHEHKSVAAFAEISSPQQLEQFQISEEQADKVVVNLLDWQVIPAEN 144 Query: 497 IVAAFQGSQKTVFAISKTPLEAQIFLEALEQGLGGVVLKVEDVDAVLELKDYMDGRNEAR 676 IVA FQG+Q TV +SKT EAQ+FLEALE GLGGVV+KVEDV A+LELK Y D R + Sbjct: 145 IVADFQGTQTTVLVVSKTQSEAQVFLEALEHGLGGVVMKVEDVGAILELKGYFDRRRDVD 204 Query: 677 HLLNLVKVTITQVQVVGMGDRVCVDLCSLMRPGEGLLVGSFARGLFLVHSECLESNYIAS 856 LLNL K I+ +QV GMGDRVCVD+CSLMRPGEGLLVGSFARGLFLVHSECLESNYI+S Sbjct: 205 SLLNLTKAIISHIQVTGMGDRVCVDICSLMRPGEGLLVGSFARGLFLVHSECLESNYISS 264 Query: 857 RPFRVNAGPVHAYVAVPGGKTSYLSELHAGKEVIVVDQSGRQRTAIVGRVKIEKRPLILV 1036 RPFRVNAGPVHAYVAVPGGKTSYLSEL +GKEVIVVDQ G QRTAIVGRVK+E RPLILV Sbjct: 265 RPFRVNAGPVHAYVAVPGGKTSYLSELKSGKEVIVVDQRGMQRTAIVGRVKVETRPLILV 324 Query: 1037 EAKASNGSDNQIVYNIILQNAETVGLVCPYRANESLKTAIPVTSLKVGDEVMLRVQGEAR 1216 EAK + +++ Y+I+LQNAETVGLV P +T IPVTSLKVGDEV+L +QG AR Sbjct: 325 EAKVESENES---YSILLQNAETVGLVSPLHGEGHQRTTIPVTSLKVGDEVLLLLQGGAR 381 Query: 1217 HTGIEIQEFILEK 1255 HTGIEI+EFI+EK Sbjct: 382 HTGIEIKEFIVEK 394 >gb|EXB94290.1| 3-dehydroquinate synthase [Morus notabilis] Length = 424 Score = 508 bits (1307), Expect = e-141 Identities = 257/352 (73%), Positives = 299/352 (84%) Frame = +2 Query: 197 KKVWIWTESKQVMTAAVERGWDTFIFTSENRNLANEWSSIALLYPLFVENGELLDSENKR 376 K+VWIWTE+KQVMTAAVERGW+TFIF+ E+R L+++WSSIA++ PL++E G + D ENKR Sbjct: 74 KRVWIWTENKQVMTAAVERGWNTFIFSPESRKLSDDWSSIAVISPLYLEEGGIFDGENKR 133 Query: 377 VATISEICSPKQLQQLQPVDEQAKNVVISLLDWQVIPAENIVAAFQGSQKTVFAISKTPL 556 + +I I + ++L+ LQP +NVV+ LLDWQVIPAENIVAAFQGS +TVFAISK Sbjct: 134 IGSIFGISNNQELELLQPEKGLGENVVVDLLDWQVIPAENIVAAFQGSDRTVFAISKNSS 193 Query: 557 EAQIFLEALEQGLGGVVLKVEDVDAVLELKDYMDGRNEARHLLNLVKVTITQVQVVGMGD 736 EAQIFLEALEQGLGGVVLKVED A+LELK+Y D RN+ ++L+L K TIT+VQV GMGD Sbjct: 194 EAQIFLEALEQGLGGVVLKVEDAKAILELKEYFDRRNDMSNILSLTKATITRVQVAGMGD 253 Query: 737 RVCVDLCSLMRPGEGLLVGSFARGLFLVHSECLESNYIASRPFRVNAGPVHAYVAVPGGK 916 RVCVDLCS+MRPGEGLLVGSFARGLFLVHSECLE NYIASRPFRVNAGPVHAYVA+PGGK Sbjct: 254 RVCVDLCSIMRPGEGLLVGSFARGLFLVHSECLEWNYIASRPFRVNAGPVHAYVAIPGGK 313 Query: 917 TSYLSELHAGKEVIVVDQSGRQRTAIVGRVKIEKRPLILVEAKASNGSDNQIVYNIILQN 1096 T YLSEL GKEVIVV+Q G+QR AIVGRVKIE RPLILVEAK SD+Q +Y+I+LQN Sbjct: 314 TCYLSELKVGKEVIVVNQKGQQRNAIVGRVKIETRPLILVEAKLD--SDSQTLYSILLQN 371 Query: 1097 AETVGLVCPYRANESLKTAIPVTSLKVGDEVMLRVQGEARHTGIEIQEFILE 1252 AETV LV P++ + AIPVTSLKVGDEV+LRVQG ARHTGIEIQEFI+E Sbjct: 372 AETVALVSPFQGDGLQNAAIPVTSLKVGDEVVLRVQGGARHTGIEIQEFIVE 423 >ref|XP_004234776.1| PREDICTED: 3-dehydroquinate synthase-like [Solanum lycopersicum] Length = 394 Score = 505 bits (1301), Expect = e-140 Identities = 258/374 (68%), Positives = 302/374 (80%) Frame = +2 Query: 134 VNSNATMCTYVPTTFENYEQPKKVWIWTESKQVMTAAVERGWDTFIFTSENRNLANEWSS 313 +N A MC + P+ + K VWIWTE+KQVMTAAVE GW+TFIF S ++LA EWSS Sbjct: 28 INRVARMCAFTPSN----SKKKTVWIWTENKQVMTAAVEGGWNTFIFPSNRQDLALEWSS 83 Query: 314 IALLYPLFVENGELLDSENKRVATISEICSPKQLQQLQPVDEQAKNVVISLLDWQVIPAE 493 IA+++P+F++ G L+D E+K VA +EI SP+QL+Q Q +EQ+ VV++LLDWQVIPAE Sbjct: 84 IAVIHPVFIKEGRLIDHEHKSVAAFAEISSPQQLEQFQISEEQSDKVVVNLLDWQVIPAE 143 Query: 494 NIVAAFQGSQKTVFAISKTPLEAQIFLEALEQGLGGVVLKVEDVDAVLELKDYMDGRNEA 673 NIVAAFQG+Q TV A+SK EAQ FLEALE GLGGVV+KVEDV A+LELK Y D R E Sbjct: 144 NIVAAFQGTQTTVLAVSKNQSEAQAFLEALEHGLGGVVMKVEDVGAILELKGYFDRRREV 203 Query: 674 RHLLNLVKVTITQVQVVGMGDRVCVDLCSLMRPGEGLLVGSFARGLFLVHSECLESNYIA 853 LLNL K IT +QV GMGDRVCVD+CSLMRPGEGLLVGSFARGLFLVHSECLESNYI+ Sbjct: 204 DSLLNLTKAIITHIQVTGMGDRVCVDICSLMRPGEGLLVGSFARGLFLVHSECLESNYIS 263 Query: 854 SRPFRVNAGPVHAYVAVPGGKTSYLSELHAGKEVIVVDQSGRQRTAIVGRVKIEKRPLIL 1033 SRPFRVNAGPVHAYVAVPGGKTSYLSEL +GKEVIVVDQ G QRTAIVGRVK+E RPLIL Sbjct: 264 SRPFRVNAGPVHAYVAVPGGKTSYLSELKSGKEVIVVDQRGMQRTAIVGRVKVETRPLIL 323 Query: 1034 VEAKASNGSDNQIVYNIILQNAETVGLVCPYRANESLKTAIPVTSLKVGDEVMLRVQGEA 1213 VEAK + +++ Y+I+LQNAETVGLV P +T IPVTSL+VG EV+L +QG A Sbjct: 324 VEAKVESENES---YSILLQNAETVGLVSPLHGEGHQRTTIPVTSLEVGSEVLLLLQGGA 380 Query: 1214 RHTGIEIQEFILEK 1255 RHTGIEI+EFI+EK Sbjct: 381 RHTGIEIKEFIVEK 394 >ref|XP_007032475.1| Prokaryotic-type, putative isoform 2 [Theobroma cacao] gi|508711504|gb|EOY03401.1| Prokaryotic-type, putative isoform 2 [Theobroma cacao] Length = 415 Score = 503 bits (1295), Expect = e-140 Identities = 257/367 (70%), Positives = 295/367 (80%), Gaps = 9/367 (2%) Frame = +2 Query: 134 VNSNATMCTYV----PTTFENYEQPKKVWIWTESKQVMTAAVERGWDTFIFTSENRNLAN 301 +NS+ MC+ P + YEQ K+VWIWTE+ QVMTAAVERGW+TFIF+S+N+ L N Sbjct: 44 INSSVRMCSVAASDSPVSTALYEQSKRVWIWTENSQVMTAAVERGWNTFIFSSQNQGLVN 103 Query: 302 EWSSIALLYPLFVENGELLDSENKRVATISEICSPKQLQQLQPVDEQAKNVVISLLDWQV 481 EWSSIA + PL ++ G + DS KRVATI E+ +P L+++Q DE NVVI LLDWQV Sbjct: 104 EWSSIAFIDPLIIKEGGIFDSAGKRVATIFEVSTPADLKKVQSEDEHTGNVVIDLLDWQV 163 Query: 482 IPAENIVAAFQGSQKTVFAISKTPLEAQIFLEALEQGLGGVVLKVEDVDAVLELKDYMDG 661 IPAENIVA QGSQ T FA+SK+P EAQ+FLEALE GLGGVVLK EDV AVL+LK+Y D Sbjct: 164 IPAENIVAELQGSQTTAFAVSKSPAEAQLFLEALEHGLGGVVLKAEDVKAVLDLKEYFDR 223 Query: 662 RNEARHLLNLVKVTITQVQVVGMGDRVCVDLCSLMRPGEGLLVGSFARGLFLVHSECLES 841 RNE + L+L K T+TQV VGMGDRVCVDLCSLMRPGEGLLVGSFARGLFLVHSECLES Sbjct: 224 RNEVHNRLSLSKATVTQVHAVGMGDRVCVDLCSLMRPGEGLLVGSFARGLFLVHSECLES 283 Query: 842 NYIASRPFRVNAGPVHAYVAVPGGKTSYLSELHAGKEVIVVDQSGRQRTAIVGRVKIEKR 1021 NYIASRPFRVNAGPVH YVAVPGGKTSYLSEL AGKEVIVVDQ G+ +TAIVGRVKIE R Sbjct: 284 NYIASRPFRVNAGPVHTYVAVPGGKTSYLSELKAGKEVIVVDQKGKLKTAIVGRVKIETR 343 Query: 1022 PLILVEAK-----ASNGSDNQIVYNIILQNAETVGLVCPYRANESLKTAIPVTSLKVGDE 1186 PLILVEAK +++Q VY+I+LQNAETV LVC ++ N KTAIPVTSLKVGDE Sbjct: 344 PLILVEAKYWTLLPQRDANDQTVYSILLQNAETVALVCTHKGNTMQKTAIPVTSLKVGDE 403 Query: 1187 VMLRVQG 1207 V+LR+QG Sbjct: 404 VLLRLQG 410 >ref|XP_007032474.1| Prokaryotic-type, putative isoform 1 [Theobroma cacao] gi|508711503|gb|EOY03400.1| Prokaryotic-type, putative isoform 1 [Theobroma cacao] Length = 423 Score = 503 bits (1295), Expect = e-140 Identities = 257/367 (70%), Positives = 295/367 (80%), Gaps = 9/367 (2%) Frame = +2 Query: 134 VNSNATMCTYV----PTTFENYEQPKKVWIWTESKQVMTAAVERGWDTFIFTSENRNLAN 301 +NS+ MC+ P + YEQ K+VWIWTE+ QVMTAAVERGW+TFIF+S+N+ L N Sbjct: 44 INSSVRMCSVAASDSPVSTALYEQSKRVWIWTENSQVMTAAVERGWNTFIFSSQNQGLVN 103 Query: 302 EWSSIALLYPLFVENGELLDSENKRVATISEICSPKQLQQLQPVDEQAKNVVISLLDWQV 481 EWSSIA + PL ++ G + DS KRVATI E+ +P L+++Q DE NVVI LLDWQV Sbjct: 104 EWSSIAFIDPLIIKEGGIFDSAGKRVATIFEVSTPADLKKVQSEDEHTGNVVIDLLDWQV 163 Query: 482 IPAENIVAAFQGSQKTVFAISKTPLEAQIFLEALEQGLGGVVLKVEDVDAVLELKDYMDG 661 IPAENIVA QGSQ T FA+SK+P EAQ+FLEALE GLGGVVLK EDV AVL+LK+Y D Sbjct: 164 IPAENIVAELQGSQTTAFAVSKSPAEAQLFLEALEHGLGGVVLKAEDVKAVLDLKEYFDR 223 Query: 662 RNEARHLLNLVKVTITQVQVVGMGDRVCVDLCSLMRPGEGLLVGSFARGLFLVHSECLES 841 RNE + L+L K T+TQV VGMGDRVCVDLCSLMRPGEGLLVGSFARGLFLVHSECLES Sbjct: 224 RNEVHNRLSLSKATVTQVHAVGMGDRVCVDLCSLMRPGEGLLVGSFARGLFLVHSECLES 283 Query: 842 NYIASRPFRVNAGPVHAYVAVPGGKTSYLSELHAGKEVIVVDQSGRQRTAIVGRVKIEKR 1021 NYIASRPFRVNAGPVH YVAVPGGKTSYLSEL AGKEVIVVDQ G+ +TAIVGRVKIE R Sbjct: 284 NYIASRPFRVNAGPVHTYVAVPGGKTSYLSELKAGKEVIVVDQKGKLKTAIVGRVKIETR 343 Query: 1022 PLILVEAK-----ASNGSDNQIVYNIILQNAETVGLVCPYRANESLKTAIPVTSLKVGDE 1186 PLILVEAK +++Q VY+I+LQNAETV LVC ++ N KTAIPVTSLKVGDE Sbjct: 344 PLILVEAKYWTLLPQRDANDQTVYSILLQNAETVALVCTHKGNTMQKTAIPVTSLKVGDE 403 Query: 1187 VMLRVQG 1207 V+LR+QG Sbjct: 404 VLLRLQG 410 >ref|NP_001030791.1| uncharacterized protein [Arabidopsis thaliana] gi|222424331|dbj|BAH20122.1| AT3G28760 [Arabidopsis thaliana] gi|332643967|gb|AEE77488.1| uncharacterized protein AT3G28760 [Arabidopsis thaliana] Length = 444 Score = 499 bits (1285), Expect = e-138 Identities = 247/357 (69%), Positives = 299/357 (83%) Frame = +2 Query: 182 NYEQPKKVWIWTESKQVMTAAVERGWDTFIFTSENRNLANEWSSIALLYPLFVENGELLD 361 N + KKVWIWT K+VMT AVERGW+TFIF+S+NR L+NEWSSIAL+ LF+E +++D Sbjct: 88 NLGKAKKVWIWTMCKEVMTVAVERGWNTFIFSSDNRKLSNEWSSIALMDTLFIEEKKVID 147 Query: 362 SENKRVATISEICSPKQLQQLQPVDEQAKNVVISLLDWQVIPAENIVAAFQGSQKTVFAI 541 VA++ E+ +P++L+ L +EQ +N+V+ LDW+ IPAEN+VAA QGS+KTVFA+ Sbjct: 148 GTGNVVASVFEVSTPEELRSLNIENEQIENIVLDFLDWKSIPAENLVAALQGSEKTVFAV 207 Query: 542 SKTPLEAQIFLEALEQGLGGVVLKVEDVDAVLELKDYMDGRNEARHLLNLVKVTITQVQV 721 S TP EA++FLEALE GLGG++LK EDV AVL+LK+Y D RNE L+L + TIT+VQ+ Sbjct: 208 SNTPSEAKLFLEALEHGLGGIILKSEDVKAVLDLKEYFDKRNEESDTLSLTEATITRVQM 267 Query: 722 VGMGDRVCVDLCSLMRPGEGLLVGSFARGLFLVHSECLESNYIASRPFRVNAGPVHAYVA 901 VGMGDRVCVDLCSLMRPGEGLLVGSFARGLFLVHSECLESNYI SRPFRVNAGPVHAYVA Sbjct: 268 VGMGDRVCVDLCSLMRPGEGLLVGSFARGLFLVHSECLESNYIESRPFRVNAGPVHAYVA 327 Query: 902 VPGGKTSYLSELHAGKEVIVVDQSGRQRTAIVGRVKIEKRPLILVEAKASNGSDNQIVYN 1081 VPGGKT YLSEL G+EVIVVDQ G+QRTA+VGRVKIEKRPLI+VEAK S + + VY+ Sbjct: 328 VPGGKTCYLSELRTGREVIVVDQKGKQRTAVVGRVKIEKRPLIVVEAKLST-KEEETVYS 386 Query: 1082 IILQNAETVGLVCPYRANESLKTAIPVTSLKVGDEVMLRVQGEARHTGIEIQEFILE 1252 IILQNAETV LV P++ N S +TA+PVTSLK GD+V++R+QG ARHTGIEIQEFI+E Sbjct: 387 IILQNAETVALVTPHQVNSSGRTAVPVTSLKPGDQVLIRLQGGARHTGIEIQEFIVE 443 >ref|NP_189518.2| uncharacterized protein [Arabidopsis thaliana] gi|27754381|gb|AAO22639.1| unknown protein [Arabidopsis thaliana] gi|28973463|gb|AAO64056.1| unknown protein [Arabidopsis thaliana] gi|332643966|gb|AEE77487.1| uncharacterized protein AT3G28760 [Arabidopsis thaliana] Length = 422 Score = 499 bits (1285), Expect = e-138 Identities = 247/357 (69%), Positives = 299/357 (83%) Frame = +2 Query: 182 NYEQPKKVWIWTESKQVMTAAVERGWDTFIFTSENRNLANEWSSIALLYPLFVENGELLD 361 N + KKVWIWT K+VMT AVERGW+TFIF+S+NR L+NEWSSIAL+ LF+E +++D Sbjct: 66 NLGKAKKVWIWTMCKEVMTVAVERGWNTFIFSSDNRKLSNEWSSIALMDTLFIEEKKVID 125 Query: 362 SENKRVATISEICSPKQLQQLQPVDEQAKNVVISLLDWQVIPAENIVAAFQGSQKTVFAI 541 VA++ E+ +P++L+ L +EQ +N+V+ LDW+ IPAEN+VAA QGS+KTVFA+ Sbjct: 126 GTGNVVASVFEVSTPEELRSLNIENEQIENIVLDFLDWKSIPAENLVAALQGSEKTVFAV 185 Query: 542 SKTPLEAQIFLEALEQGLGGVVLKVEDVDAVLELKDYMDGRNEARHLLNLVKVTITQVQV 721 S TP EA++FLEALE GLGG++LK EDV AVL+LK+Y D RNE L+L + TIT+VQ+ Sbjct: 186 SNTPSEAKLFLEALEHGLGGIILKSEDVKAVLDLKEYFDKRNEESDTLSLTEATITRVQM 245 Query: 722 VGMGDRVCVDLCSLMRPGEGLLVGSFARGLFLVHSECLESNYIASRPFRVNAGPVHAYVA 901 VGMGDRVCVDLCSLMRPGEGLLVGSFARGLFLVHSECLESNYI SRPFRVNAGPVHAYVA Sbjct: 246 VGMGDRVCVDLCSLMRPGEGLLVGSFARGLFLVHSECLESNYIESRPFRVNAGPVHAYVA 305 Query: 902 VPGGKTSYLSELHAGKEVIVVDQSGRQRTAIVGRVKIEKRPLILVEAKASNGSDNQIVYN 1081 VPGGKT YLSEL G+EVIVVDQ G+QRTA+VGRVKIEKRPLI+VEAK S + + VY+ Sbjct: 306 VPGGKTCYLSELRTGREVIVVDQKGKQRTAVVGRVKIEKRPLIVVEAKLST-KEEETVYS 364 Query: 1082 IILQNAETVGLVCPYRANESLKTAIPVTSLKVGDEVMLRVQGEARHTGIEIQEFILE 1252 IILQNAETV LV P++ N S +TA+PVTSLK GD+V++R+QG ARHTGIEIQEFI+E Sbjct: 365 IILQNAETVALVTPHQVNSSGRTAVPVTSLKPGDQVLIRLQGGARHTGIEIQEFIVE 421