BLASTX nr result
ID: Achyranthes23_contig00006309
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Achyranthes23_contig00006309 (2783 letters) Database: ./nr 37,332,560 sequences; 13,225,080,153 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value emb|CBI16022.3| unnamed protein product [Vitis vinifera] 316 3e-83 ref|XP_006488440.1| PREDICTED: mediator of RNA polymerase II tra... 295 6e-77 ref|XP_006424987.1| hypothetical protein CICLE_v10027683mg [Citr... 291 8e-76 ref|XP_002520450.1| hypothetical protein RCOM_0731250 [Ricinus c... 274 1e-70 ref|XP_006379033.1| hypothetical protein POPTR_0009s04520g [Popu... 271 1e-69 gb|EOY33856.1| Uncharacterized protein isoform 7 [Theobroma cacao] 261 1e-66 gb|EOY33851.1| Uncharacterized protein isoform 2 [Theobroma caca... 261 1e-66 gb|EOY33857.1| Uncharacterized protein isoform 8 [Theobroma cacao] 260 3e-66 ref|XP_002298329.1| hypothetical protein POPTR_0001s25430g [Popu... 252 7e-64 gb|EMJ06149.1| hypothetical protein PRUPE_ppa000292mg [Prunus pe... 236 3e-59 ref|XP_004153176.1| PREDICTED: uncharacterized protein LOC101214... 223 3e-55 ref|XP_004145323.1| PREDICTED: uncharacterized protein LOC101205... 223 3e-55 emb|CAN64434.1| hypothetical protein VITISV_000937 [Vitis vinifera] 221 2e-54 ref|XP_004169561.1| PREDICTED: uncharacterized protein LOC101227... 220 2e-54 gb|EXB30469.1| hypothetical protein L484_006018 [Morus notabilis] 214 2e-52 ref|XP_004295721.1| PREDICTED: uncharacterized protein LOC101314... 209 7e-51 gb|EOY33855.1| Uncharacterized protein isoform 6 [Theobroma cacao] 197 2e-47 gb|EOY33854.1| Uncharacterized protein isoform 5 [Theobroma cacao] 197 2e-47 gb|EOY33850.1| Uncharacterized protein isoform 1 [Theobroma cacao] 197 2e-47 ref|XP_006848046.1| hypothetical protein AMTR_s00029p00190880 [A... 184 2e-43 >emb|CBI16022.3| unnamed protein product [Vitis vinifera] Length = 1669 Score = 316 bits (810), Expect = 3e-83 Identities = 226/625 (36%), Positives = 299/625 (47%), Gaps = 71/625 (11%) Frame = +1 Query: 802 VDQGRNQLHPIQYGPSVQLRPGAVSMSKSTPHPFNSQQPTIXXXXXXXXXXXXXFNSADN 981 +D GR+Q P+QYGP+VQ RP A S ++ P P + A Sbjct: 1020 LDGGRHQPPPMQYGPTVQQRPAAPSSGQAMPPPGLVHNAPVPGQPSTQLQP-----QALG 1074 Query: 982 SQPSSIKHPPGRVPHENASGAMVTPGLSGPF----------------PRPDNMGYY-QAS 1110 P + G HE G ++ PG + F P + G+Y Q Sbjct: 1075 LLPHPAQQSRGSFHHEIPPGGILGPGSAASFGRGLSHFAPPQRSFEPPSVVSQGHYNQGH 1134 Query: 1111 MPPYQAGQPQNPAGEPFGGSSFAAQRPGALDSHVGVREREPA---DFEQRPPYPMENEKF 1281 P AG + GE G G+ DSH G+ R P D +QRP P+E+E F Sbjct: 1135 GLPSHAGPSRISQGELIGRPPLGPLPAGSFDSHGGMMVRAPPHGPDGQQRPVNPVESEIF 1194 Query: 1282 PVQRPGSFDGRKPESLPHGSLDRAAYGPVQP-GVQLGAMKIGGPPAHDSMSAPGMRDERG 1458 RP FDGR+ +S GS +R +G QP GVQ M++ G +S G++DER Sbjct: 1195 SNPRPNYFDGRQSDSHIPGSSERGPFG--QPSGVQSNMMRMNGGLGIESSLPVGLQDERF 1252 Query: 1459 MPFPEERLKRIPHREFEDDPRKFPRSSHFEAGPSSKLGTQFPSSGPLDHGPHVYAGDGPS 1638 PE + H +F +D ++F RSSH ++ K G F SS PLD G + D Sbjct: 1253 KSLPEPGRRSSDHGKFAEDLKQFSRSSHLDSDLVPKFGNYFSSSRPLDRGSQGFVMDAAQ 1312 Query: 1639 RPFEKAPHGFERDPGLKMDSAVGSGPLRFLPPFHPNDVGERGRPPTFPDDSMGRGDFGHR 1818 +KAP GF D G K S+ G+G RF PP HP GER R F +D++GR D R Sbjct: 1313 GLLDKAPLGFNYDSGFK--SSAGTGTSRFFPPPHPGGDGERSRAVGFHEDNVGRSDMA-R 1369 Query: 1819 ADFSGPGAGPGYGRSRMDGFPPRSPGRDFPGLPSGTFGA--------------------- 1935 + G+ P YGR MDG PRSP R+F G+P FG Sbjct: 1370 THPNFLGSVPEYGRHHMDGLNPRSPTREFSGIPHRGFGGLSGVPGRQSDLDDIDGRESRR 1429 Query: 1936 FGDGGNSFPAESFGKSIHDSRFPVPPNHLQRGEIDVPG---------------NLRVGGP 2070 FG+G +F S +SRFPV P+HL+RGE++ PG +LR G Sbjct: 1430 FGEGSKTFNLPS-----DESRFPVLPSHLRRGELEGPGELVMADPIASRPAPHHLRGGDL 1484 Query: 2071 RNQDMLPNHLRR-DLVGPRN----MHMGDPT------KPRMGEPPLARNFPQHLPFGESF 2217 QD+LP+HL+R + G RN + G+P PRMGE NFP L GESF Sbjct: 1485 IGQDILPSHLQRGEHFGSRNIPGQLRFGEPVFDAFLGHPRMGELSGPGNFPSRLSAGESF 1544 Query: 2218 GG-EKPGHPLAGEPXXXXXXXXXXXXHEGGFY-PDEMEPFDDPRKWKPVGI-MCRICKVE 2388 GG K GHP GEP ++ GF P +ME FD+ RK KP+ + CRIC ++ Sbjct: 1545 GGSNKSGHPRIGEPGFRSTYSLHGYPNDHGFRPPGDMESFDNSRKRKPLSMAWCRICNID 1604 Query: 2389 CGTVEGLDLHSQSREHQRKARDMVL 2463 C TV+GLD+HSQ+REHQ+ A D+VL Sbjct: 1605 CETVDGLDMHSQTREHQQMAMDIVL 1629 >ref|XP_006488440.1| PREDICTED: mediator of RNA polymerase II transcription subunit 15-like isoform X1 [Citrus sinensis] gi|568870502|ref|XP_006488441.1| PREDICTED: mediator of RNA polymerase II transcription subunit 15-like isoform X2 [Citrus sinensis] gi|568870504|ref|XP_006488442.1| PREDICTED: mediator of RNA polymerase II transcription subunit 15-like isoform X3 [Citrus sinensis] gi|568870506|ref|XP_006488443.1| PREDICTED: mediator of RNA polymerase II transcription subunit 15-like isoform X4 [Citrus sinensis] Length = 1392 Score = 295 bits (756), Expect = 6e-77 Identities = 281/845 (33%), Positives = 360/845 (42%), Gaps = 88/845 (10%) Frame = +1 Query: 304 VRPPQL-------NQTYPSRVNNQMSAASEQQAGHLQQPSRPTDVENPRDQVVDKSMHD- 459 VRP QL NQ+ S +NQ+ +SEQQAG +P + +++V K H+ Sbjct: 616 VRPAQLGANQSSSNQSNLSWTSNQVQLSSEQQAGATSKPEM-----SEKNEVAVKIAHER 670 Query: 460 --QNNPNKVAK----NLVGGSGAGVLMN-------------------EAKMNAESTLDSG 564 +++ K AK + G A V M E K N T Sbjct: 671 EAESSSEKTAKTDNFDTPGPEAAAVGMKVPKSETDVKAAVDEIKTEVEDKTNVVDTSSKE 730 Query: 565 FDANDNKVSGMGSKPLESDASEGVLEPSPGSKSTK---IGAEDHKDVRKKSEVQESKQTA 735 F + +P+ E V+E G K + I E+H EVQE Sbjct: 731 FVTDRESHIAENVQPINKMVKEEVIENVEGQKDSANVDIKQEEHS---VSKEVQEEPLLK 787 Query: 736 KSGAPNMPQSNLSTQVHGTNASVDQGRNQLHPIQYGPSVQLRPGAVSMSKSTPHPFNSQQ 915 S Q ++ V Q + P P+ Q + G S + + ++ Q Sbjct: 788 TSTMQQGTQFGEQSEKVQKEQKVPQAQGAQGPGAVPPAGQAQAGGFVQSAPSLYGSSTLQ 847 Query: 916 PTIXXXXXXXXXXXXXFNSADNSQPSSIKHPP-GRVPHENASG--------AMVTPG--- 1059 + PS + PP G VP A A V PG Sbjct: 848 QR-------------------PAAPSIFQAPPPGAVPQTQAPTQFRPPMFKAEVPPGGIP 888 Query: 1060 LSGP---FPR-PDNMGYYQASM-PPYQAGQ-PQN---PAGEPFGGSSFAAQRPGALDSHV 1212 +SGP F R P + G +Q S PP A Q P N P P GG + DSHV Sbjct: 889 VSGPAASFGRGPGHNGPHQHSFEPPLVAPQGPYNLGHPHPSPVGGPPQRSVPLSGFDSHV 948 Query: 1213 GVRERE------PADFEQRPPYPMENEKFPVQRPGSFDGRKPESLPHGSLDRAAYGPVQP 1374 G P D +Q P PME E F QRPG DGR+ +S GS R+ GP Sbjct: 949 GTMVGPAYGPGGPMDLKQ-PSNPMEAEMFTGQRPGYMDGRESDSHFPGSQQRSPLGPPS- 1006 Query: 1375 GVQLGAMKIGGPPAHDSMSAPGMRDERGMPFPEERLKRIP---------HREFEDDPRKF 1527 G + M++ G P + +RDER FP+ RL P EFE+D ++F Sbjct: 1007 GTRSNMMRMNGGPGSE------LRDERFKSFPDGRLNPFPVDPARSVIDRGEFEEDLKQF 1060 Query: 1528 PRSSHFEAGPSSKLGTQFPSSGPLDHGPHVYAGDGPSRPFEKAPHGFERDPGLKMDSAVG 1707 R SH +A P KLG+ F S P D GPH Y D RPFE+ G DPGLK+D Sbjct: 1061 SRPSHLDAEPVPKLGSHFLPSRPFDRGPHGYGMDMGPRPFER---GLSYDPGLKLDPMGA 1117 Query: 1708 SGPLRFLPPFHPNDVGERGRPPTFPDDSMGRGDFGH-RADFSGPGAGPGYGRSRMDGFPP 1884 S P RFLP +H DD+ GR D H DF PG YGR M G P Sbjct: 1118 SAPSRFLPAYH--------------DDAAGRSDSSHAHPDFPRPGRA--YGRRHMGGLSP 1161 Query: 1885 RSPGRDFPGLPSGTFGAFGD--------GGNSFP--AESFGKSIHDSRFPVPPNHLQRGE 2034 RS R+F G G G+ G GG F + G S HDSRFPV P+HL+RGE Sbjct: 1162 RSSFREFCGF-GGLPGSLGGSRSVREDIGGREFRRFGDPIGNSFHDSRFPVLPSHLRRGE 1220 Query: 2035 IDVPGNLRVGGPRNQDMLPNHLRR-DLVGPRNMHMGDPTKPRMGEPPLARNFPQHLPFGE 2211 + PG R G Q+ LP+HLRR + +GP N+ +G+ T G P AR E Sbjct: 1221 FEGPG--RTGDLIGQEFLPSHLRRGEPLGPHNLRLGE-TVGLGGFPGPARM--------E 1269 Query: 2212 SFGGEKPGH---PLAGEPXXXXXXXXXXXXHEGGFYPDEMEPFDDPRKWKPVGI-MCRIC 2379 GG PG+ P GEP ++GGFY +ME D+ RK KP + CRIC Sbjct: 1270 ELGG--PGNFPPPRLGEPGFRSSFSRQGFPNDGGFYTGDMESIDNSRKRKPPSMGWCRIC 1327 Query: 2380 KVECGTVEGLDLHSQSREHQRKARDMVLXXXXXXXXXXXXXXXATFEGRDGGRPRNSSFQ 2559 KV+C TV+GLDLHSQ+REHQ+ A DMVL D + RN +F Sbjct: 1328 KVDCETVDGLDLHSQTREHQKMAMDMVLSIKQNAKKQKLTSGDRC-STDDANKSRNVNFD 1386 Query: 2560 GRGNK 2574 GRG K Sbjct: 1387 GRGKK 1391 >ref|XP_006424987.1| hypothetical protein CICLE_v10027683mg [Citrus clementina] gi|557526921|gb|ESR38227.1| hypothetical protein CICLE_v10027683mg [Citrus clementina] Length = 1392 Score = 291 bits (746), Expect = 8e-76 Identities = 280/847 (33%), Positives = 360/847 (42%), Gaps = 90/847 (10%) Frame = +1 Query: 304 VRPPQL--NQTYPSRVN-----NQMSAASEQQAGHLQQPSRPTDVENPRDQVVDKSMHD- 459 VRP QL NQ+ ++ N NQ+ +SEQQAG +P + +++V K H+ Sbjct: 616 VRPAQLGANQSSSNQSNLFWTSNQVQLSSEQQAGATSKPEM-----SEKNEVAVKIAHER 670 Query: 460 --QNNPNKVAK----NLVGGSGAGVLMN-------------------EAKMNAESTLDSG 564 +++ K AK + G A V M E K N T Sbjct: 671 EAESSSEKTAKTDNFDTPGPEAAAVGMKVPKSETDVKAAVDEIKTEVEDKTNVVDTSSKE 730 Query: 565 FDANDNKVSGMGSKPLESDASEGVLEPSPGSKSTK---IGAEDHKDVRKKSEVQESKQTA 735 F + +P+ E V+E G K + I E+H EVQE Sbjct: 731 FVTDRESHIAENVQPINKMVKEEVIENVEGQKDSANVDIKQEEHS---VSKEVQEEPLLK 787 Query: 736 KSGAPNMPQSNLSTQVHGTNASVDQGRNQLHPIQYGPSVQLRPGAVSMSKSTPHPFNSQQ 915 S Q ++ V Q + P P+ Q + G S + + ++ Q Sbjct: 788 TSTMQQGTQFGEQSEKVQKEQKVPQAQGAQGPGAVPPAGQAQAGGFVQSAPSLYGSSTLQ 847 Query: 916 PTIXXXXXXXXXXXXXFNSADNSQPSSIKHPP-GRVPHENASG--------AMVTPG--- 1059 + PS + PP G VP A A V PG Sbjct: 848 QR-------------------PAAPSIFQAPPPGAVPQTQAPTQFRPPMFKAEVPPGGIP 888 Query: 1060 LSGP---FPR-PDNMGYYQASM-PPYQAGQPQNPAG------EPFGGSSFAAQRPGALDS 1206 +SGP F R P + G +Q S PP A PQ P P GG + DS Sbjct: 889 VSGPAASFGRGPGHNGPHQHSFEPPLVA--PQGPYNLGHLHPSPVGGPPQRSVPLSGFDS 946 Query: 1207 HVGVRERE------PADFEQRPPYPMENEKFPVQRPGSFDGRKPESLPHGSLDRAAYGPV 1368 HVG P D +Q P PME E F QRPG DGR+ +S GS R+ GP Sbjct: 947 HVGTMVGPAYGPGGPMDLKQ-PSNPMEAEMFTGQRPGYMDGRESDSHFPGSQQRSPLGPP 1005 Query: 1369 QPGVQLGAMKIGGPPAHDSMSAPGMRDERGMPFPEERLKRIP---------HREFEDDPR 1521 G + M++ G P + +RDER FP+ RL P EFE+D + Sbjct: 1006 S-GTRSNMMRMNGGPGSE------LRDERFKSFPDGRLNPFPVDPARSVIDRGEFEEDLK 1058 Query: 1522 KFPRSSHFEAGPSSKLGTQFPSSGPLDHGPHVYAGDGPSRPFEKAPHGFERDPGLKMDSA 1701 +F R SH +A P KLG+ F S P D GPH Y D RPFE+ G DPGLK+D Sbjct: 1059 QFSRPSHLDAEPVPKLGSHFLPSRPFDRGPHGYGMDMGPRPFER---GLSYDPGLKLDPM 1115 Query: 1702 VGSGPLRFLPPFHPNDVGERGRPPTFPDDSMGRGDFGH-RADFSGPGAGPGYGRSRMDGF 1878 S P RFLP +H DD+ GR D H DF PG YGR M G Sbjct: 1116 GASAPSRFLPAYH--------------DDAAGRSDSSHAHPDFPRPGRA--YGRRHMGGL 1159 Query: 1879 PPRSPGRDFPGLPSGTFGAFGD--------GGNSFP--AESFGKSIHDSRFPVPPNHLQR 2028 PRS R+F G G G+ G GG F + G S HDSRFPV P+HL+R Sbjct: 1160 SPRSSFREFCGF-GGLPGSLGGSRSVREDIGGREFRRFGDPIGNSFHDSRFPVLPSHLRR 1218 Query: 2029 GEIDVPGNLRVGGPRNQDMLPNHLRR-DLVGPRNMHMGDPTKPRMGEPPLARNFPQHLPF 2205 GE + PG R G Q+ LP+HLRR + +GP N+ +G+ T G P AR Sbjct: 1219 GEFEGPG--RTGDLIGQEFLPSHLRRGEPLGPHNLRLGE-TVGLGGFPGPARM------- 1268 Query: 2206 GESFGGEKPGH---PLAGEPXXXXXXXXXXXXHEGGFYPDEMEPFDDPRKWKPVGI-MCR 2373 E GG PG+ P GEP ++GGFY +ME D+ RK KP + CR Sbjct: 1269 -EELGG--PGNFPPPRLGEPGFRSSFSHQGFPNDGGFYTGDMESIDNSRKRKPPSMGWCR 1325 Query: 2374 ICKVECGTVEGLDLHSQSREHQRKARDMVLXXXXXXXXXXXXXXXATFEGRDGGRPRNSS 2553 ICKV+C TV+GLDLHSQ+REHQ+ A DMVL D + RN + Sbjct: 1326 ICKVDCETVDGLDLHSQTREHQKMAMDMVLSIKQNAKKQKLTSGDRC-STDDANKSRNVN 1384 Query: 2554 FQGRGNK 2574 F GRG K Sbjct: 1385 FDGRGKK 1391 >ref|XP_002520450.1| hypothetical protein RCOM_0731250 [Ricinus communis] gi|223540292|gb|EEF41863.1| hypothetical protein RCOM_0731250 [Ricinus communis] Length = 1329 Score = 274 bits (701), Expect = 1e-70 Identities = 270/919 (29%), Positives = 369/919 (40%), Gaps = 61/919 (6%) Frame = +1 Query: 1 HSQQPGYPFQHRPGVXXXXXXXXXXXXS----FAGH------------GPYMQPQQPTAA 132 H+QQPG P P + F G G YMQ Sbjct: 508 HAQQPGLPVHQLPVMQSVQQPIHQQYVQQQPPFPGQALGPVQNQVHQQGAYMQQH----L 563 Query: 133 HGHAXXXXXXXXXXXX----NYAPGHGAQHNMAQSYAARPFXXXXXXXXXXXXXXXXXXX 300 HGH+ N HG Q + AQ+ RP Sbjct: 564 HGHSQLRPQGPSHAYTQPLQNVPLPHGTQAHQAQNLGGRP----PYGVPTYPHPHSSVGM 619 Query: 301 XVRPPQLNQTYPS----RVNNQMSAASEQQAGHLQQPSRPTDVENPRDQVVDKSMHDQNN 468 VRP Q+ S R NNQM +SEQ +G + SRPT D +++KS ++ Sbjct: 620 QVRPMQVGADQQSGNAFRANNQMQLSSEQPSGAI---SRPTS-NRQGDDIIEKSSEADSS 675 Query: 469 PNKVAKNLVGGSGAGVLMNEAKMNAESTLDSGFDANDNKVSGMGSKPLESDASEGVLEPS 648 K V + ++ S L S +S KP++ D ++ + E Sbjct: 676 SQK-----------NVRRDPNDLDVASGLGSDVSDLKTVISESNLKPVDDD-NKSINEVK 723 Query: 649 PGSKSTKIGAEDHKDVRKKSEVQESKQTAKSGAPNMPQSNLSTQVHGTNASVDQGRNQLH 828 + K G +D KD+ E K K G P M L H + S+ R + Sbjct: 724 ---EEPKKGNDDQKDISNTDNDAEDKGV-KDG-PVMKNRPLPEAEHLEDQSMKSQRGRNV 778 Query: 829 PIQYGPSVQLR-----PGAVSMSKSTPHPFNSQQPTIXXXXXXXXXXXXXFNSADNSQPS 993 Q+ L G S S P +Q QP Sbjct: 779 TPQHSGGFILHGQVQGEGLAQPSHSIPIAEQGKQ-----------------------QPP 815 Query: 994 SIKHPPGRVPHENASGAMVTPGLSGPFPRPDNMGYYQASMPPYQAGQ-PQNP----AGEP 1158 I H P + +++T G G+ A + P G P P AG Sbjct: 816 VIPHGPSALQQRPIGSSLLTAPPPGSLHHGQIPGHPSARVRPLGPGHIPHGPEVSSAGMT 875 Query: 1159 FGGSSFAAQRPGALDSHVGVR------EREPADFEQRPPYPMENEKFPVQRPGSFDGRKP 1320 GS+ R G SH G++ P+ + R PY + + F QRP DG++ Sbjct: 876 GLGSTPITGRGG---SHYGLQGTYTQGHALPSQAD-RTPYGHDTDMFANQRPNYTDGKRL 931 Query: 1321 ESLPHGSLDRAAYGPVQPGVQLGAMKIGGPPAHDSMSAPGMRDERGMPFPEERLKRIP-- 1494 + L Q G+ AM++ G P DS SA G+RD+R PF +E + P Sbjct: 932 DPLGQ-----------QSGMHSNAMRMNGAPGMDSSSALGLRDDRFRPFSDEYMNPFPKD 980 Query: 1495 -------HREFEDDPRKFPRSSHFEAGPSSKLGTQFPSSGPLDHGPHVYAGDGPSRPFEK 1653 REFE+D + F R S + ++K G F SS PLD GP +K Sbjct: 981 PSQRIVDRREFEEDLKHFSRPSDLDTQSTTKFGANFSSSRPLDRGP-----------LDK 1029 Query: 1654 APHGFERDPGLKMDSAVGSGPLRFLPPFH------PNDVGERGRPPTFPDDSMGRGDFGH 1815 HG D G+K++S G P RF PP+H PND+ ER F D+++GR Sbjct: 1030 GLHGPNYDSGMKLESLGGPPPSRFFPPYHHDGLMHPNDIAERSIG--FHDNTLGRQPDSV 1087 Query: 1816 RA--DFSGPGAGPGYGRSRMDGFPPRSPGRDFPGLPSGTFGAFG--DGGNSFPAESFGKS 1983 RA +F GPG Y R DG PRSPGRD+PG+ S FGA D + + FG S Sbjct: 1088 RAHPEFFGPGRR--YDRRHRDGMAPRSPGRDYPGVSSRGFGAIPGLDDIDGRESRRFGDS 1145 Query: 1984 IHDSRFPVPPNHLQRGEIDVPGNLRVGGPRNQDMLPNHLRR-DLVGPRNMHMGDPTKPRM 2160 H SRFPV P+H++ GE + P +QD NH RR + +G NM + R+ Sbjct: 1146 FHGSRFPVLPSHMRMGEFEGP---------SQDGFSNHFRRGEHLGHHNM------RNRL 1190 Query: 2161 GEPPLARNFPQHLPFGESFGGEKPGHPLAGEPXXXXXXXXXXXXHEGGFYPDEMEPFDDP 2340 GEP FP G+ G +P GEP +GG Y E+E FD+ Sbjct: 1191 GEPIGFGAFPGPAGMGDLSGTGNFFNPRLGEPGFRSSFSFKGFPGDGGIYAGELESFDNS 1250 Query: 2341 RKWKPVGI-MCRICKVECGTVEGLDLHSQSREHQRKARDMVLXXXXXXXXXXXXXXXATF 2517 R+ K + CRICKV+C TVEGLDLHSQ+REHQ++A DMV+ + Sbjct: 1251 RRRKSSSMGWCRICKVDCETVEGLDLHSQTREHQKRAMDMVVTIKQNAKKQKLANNDHS- 1309 Query: 2518 EGRDGGRPRNSSFQGRGNK 2574 D + +N+S +GRGNK Sbjct: 1310 SVDDASKSKNTSIEGRGNK 1328 >ref|XP_006379033.1| hypothetical protein POPTR_0009s04520g [Populus trichocarpa] gi|550331020|gb|ERP56830.1| hypothetical protein POPTR_0009s04520g [Populus trichocarpa] Length = 1315 Score = 271 bits (692), Expect = 1e-69 Identities = 263/914 (28%), Positives = 358/914 (39%), Gaps = 56/914 (6%) Frame = +1 Query: 1 HSQQPGYPFQHRPGVXXXXXXXXXXXXS----FAGH------------GPYMQPQQ---- 120 H+ QPG P Q RPG+ F+G GPY+Q QQ Sbjct: 500 HAHQPGLPVQQRPGMQPTPQPMHQQYAQHQQPFSGQPWGAVHNQAHQQGPYVQQQQLHPL 559 Query: 121 ----PTAAHGHAXXXXXXXXXXXXNYAPGHGAQHNMAQSYAARPFXXXXXXXXXXXXXXX 288 P N HGA + A+S A P Sbjct: 560 TQLRPQGLPQSFQQPSHAYPHPQQNVLLPHGAHPHQAKSLAVGP------GLPAQSYPQS 613 Query: 289 XXXXXVRPPQLNQTYPS----RVNNQMSAASEQQAGHLQQPSRPTDVENPRDQVVDKSMH 456 VR Q+ S + NNQ+ +S+QQ+G + R D+E K Sbjct: 614 ASGMQVRSIQIGANQQSGNILKTNNQVELSSDQQSG-VSSRQRQGDIE--------KGAE 664 Query: 457 DQNNPNKVAKNLVGGSGAGVLMNEAKMNA-ESTLDSGFDANDNKVSGMGSKPLESDASEG 633 + + K K + AG+ + ++M +S D + NK +G ES Sbjct: 665 GELSAQKTIKKELNDLDAGLAADASEMKTIKSESDLKQVDDKNKPTGEAKDVPES----- 719 Query: 634 VLEPSPGSKSTKIGAEDHKDVRKKSEVQESKQTAKSGAPNMPQSNLSTQVHGTNASVDQG 813 L + G S K E+H+D +Q S A + + LS H ++ Sbjct: 720 -LAAANGESSIKQVKEEHRD-------GADEQNDVSNADH-EKVELSVSEHKDGPLLETA 770 Query: 814 RNQLHPIQYGPSVQLRPGAVSMSKSTPHPFNSQQPTIXXXXXXXXXXXXXFNSADNS--Q 987 + L P + S P+ Q ++ D + Sbjct: 771 PSHLEEQIMKLQKDKTPTSQSFGGFPPNGHVQSQSV---------------SAVDQGKLE 815 Query: 988 PSSIKHPPGRVPHENASGAMVTPGLSGPFPRPDNMGYYQASMPPYQAGQPQNPAGEPFGG 1167 P I H P ++V GP G+ PP Q G+ Sbjct: 816 PLPIHHGPSAAQQRPVGPSLVQASPLGPPHHMQLPGH-----PPTQHGR----------- 859 Query: 1168 SSFAAQRPGALDSHVGVRE-------REPADFEQRPPYPMENEKFPVQRPGSFDGRKPES 1326 PG + SH G + P+ E+ P + E F QRP DGR+ Sbjct: 860 -----LGPGHVPSHYGPPQGAYPHAPAPPSQGERTPSHVHEATMFANQRPKYPDGRQ--- 911 Query: 1327 LPHGSLDRAAYGPVQPGVQLGAMKIGGPPAHDSMSAPGMRDERGMPFPEERLKRIPHR-E 1503 G + + G +S + DE PFP H+ E Sbjct: 912 ----------------GTYSNVVGMNGAQGPNSDRFSSLPDEHLNPFPRGPAHHNVHQGE 955 Query: 1504 FEDDPRKFPRSSHFEAGPSSKLGTQFPSSGPLDHGPHVYAGDGPSRPFEKAPHGFERDPG 1683 FE+D + FPR SH + P K + FPSS PLD GP + DG RP +K HGF D G Sbjct: 956 FEEDLKHFPRPSHLDTEPVPKSSSHFPSSRPLDRGPRGFGVDGAPRPLDKGSHGFNYDSG 1015 Query: 1684 LKMDSAVGSGPLRFLPPFHPNDV---GERGRPPTFPDDSMGRGDFGH-RADFSGPGAGPG 1851 L M+ GS P RF PP+H + + + D GR DF R F GP PG Sbjct: 1016 LNMEPLGGSAPPRFFPPYHHDKALHPSDAEVSLGYHDSLAGRSDFARTRPGFLGPPI-PG 1074 Query: 1852 YGRSRMDGFPPRSPGRDFPGLPSGTFGAF-------GDGGNSFPAESFGKSIHDSRFPVP 2010 Y MD PRSP RD+PG+P+ FGA G + F + F S+ DSRFPV Sbjct: 1075 YDHRHMDNLAPRSPVRDYPGMPTRRFGALPGLDDIDGRDPHRF-GDKFSSSLRDSRFPVF 1133 Query: 2011 PNHLQRGEIDVPGNLRVGGPRNQDML-----PNHLRR-DLVGPRNMHMGDPTKPRMGEPP 2172 P+HL+RGE++ PGNL +G + D++ P HLRR + +GPRN+ P+ +GEP Sbjct: 1134 PSHLRRGELEGPGNLHMGEHLSGDLMGHDGRPAHLRRGEHLGPRNL----PSHLWVGEPG 1189 Query: 2173 LARNFPQHLPFGESFGGEKPGHPLAGEPXXXXXXXXXXXXHEGGFYPDEMEPFDDPRKWK 2352 FP H GE G H GEP GG Y +++ FD+ RK K Sbjct: 1190 NFGAFPGHARMGELAGPGNFYHHQLGEPGFRSSF--------GGNYAGDLQFFDNSRKRK 1241 Query: 2353 PVGIMCRICKVECGTVEGLDLHSQSREHQRKARDMVLXXXXXXXXXXXXXXXATFEGRDG 2532 P CRICKV+C TVE LDLHSQ+REHQ+ A DMV+ + D Sbjct: 1242 PSMGWCRICKVDCETVEALDLHSQTREHQKMALDMVVTIKQNAKKHKSTPCHHS-SLEDK 1300 Query: 2533 GRPRNSSFQGRGNK 2574 + RN+SF+GRGNK Sbjct: 1301 SKSRNASFEGRGNK 1314 >gb|EOY33856.1| Uncharacterized protein isoform 7 [Theobroma cacao] Length = 975 Score = 261 bits (667), Expect = 1e-66 Identities = 270/907 (29%), Positives = 356/907 (39%), Gaps = 83/907 (9%) Frame = +1 Query: 103 YMQPQQPTAAHGHAXXXXXXXXXXXXNYAPGHGAQHNMAQSYAARPFXXXXXXXXXXXXX 282 Y QPQQ A HA P HG Q AA Sbjct: 178 YAQPQQNVAG-SHAVHFHPSHNLVGRPMTPNHGVQSQPYPHSAA---------------- 220 Query: 283 XXXXXXXVRPPQLNQTYPSRVNNQMSAASEQQAGHLQQPSRPTDVENPRDQVVDKSMHDQ 462 V+P L PS N + + Q +G QP ++ D+ V + D Sbjct: 221 ----GTPVKPVHLGANQPSSYQNNVFRTNNQ-SGVTSQPMSEVPGDHGTDKNVAEQEADS 275 Query: 463 NNPNKVAK-----NLVGGSGAGVL-MNEAKMNAES-------TLDSGFDANDNKVSGMGS 603 ++P K ++ GA V N AK+ A+ T D G D+N +S + Sbjct: 276 SSPGTARKEANELDMASSLGADVAEKNTAKLEADLKSVDEKLTGDVGDDSNGVDISTKET 335 Query: 604 KPLESDASEGV-----LEPSPGSKSTKIGAEDHKDVR-----------------KKSEVQ 717 ES + G +P + T ED KDV K +Q Sbjct: 336 P--ESRRTVGTDLEQHRDPVSKNMVTCEAIEDQKDVHNGEHKVEEIKIKDGPSLKTPPLQ 393 Query: 718 ESK----QTAK----------SGAPNMPQSN------LSTQVHGTN--------ASVDQG 813 E+K Q K G P P N S+QV +VDQG Sbjct: 394 EAKLGEEQNGKMQKDKILPHDQGTPKGPAGNGFRGIPPSSQVQPGGYLPPSHSVPNVDQG 453 Query: 814 RNQLHPIQYGPSV-QLRPGAVSMSKSTPH--PFNSQQPTIXXXXXXXXXXXXXFNSADNS 984 R+Q + YG + Q RP ++ ++ P P ++Q P + +N Sbjct: 454 RHQPLQMPYGSNNNQQRPAVSAILQAPPPGLPSHAQTPGLPPNQFRPQGPGQALVPPENL 513 Query: 985 QPSSIKHPPGRVPHENASGAMVTPGLSGPFPRPDNMGYYQASMPPYQAGQPQNPAGEPFG 1164 P S P G GP+ N G PP +G P+ GEP Sbjct: 514 PPGSFGRDPSNY------------GPQGPY----NQG------PPSLSGAPRISQGEPLV 551 Query: 1165 GSSFAAQRPGALDSHVGVREREPADFEQRPPYPMENEKFPVQRPGSFDGRKPESLPHGSL 1344 G S+ A DSH P+ P S + ++ Sbjct: 552 GLSYGTPPLTAFDSHGA----------------------PLYGPESHSVQHSANMVDYHA 589 Query: 1345 DRAAYGPVQPGVQLGAMKIGGPPAHDSMSAPGMRDERGMPFPEERLKRIP----HR---- 1500 D P G+ DS S +R ER P +E + P HR Sbjct: 590 DNRQLDPRASGL-------------DSTSTFSLRGERLKPVQDECSNQFPLDRGHRGDRG 636 Query: 1501 EFEDDPRKFPRSSHFEAGPSSKLGTQFPSSGPLDHGPHVYAGDGPSRPFEKAPHGFERDP 1680 +FE+D + FPR SH + P K G+ SS PLD GPH + D R EK PHGF DP Sbjct: 637 QFEEDLKHFPRPSHLDNEPVPKFGSYISSSRPLDRGPHGFGMDMGPRAQEKEPHGFSFDP 696 Query: 1681 GLKMDSAVGSGPLRFLPPFHPNDVGERGRPPTFPDDSMGRGDFGHRADFSGPGAGPGYGR 1860 +GSGP RFLPP+HP+D GE RP P D++GR DF G P YGR Sbjct: 697 ------MIGSGPSRFLPPYHPDDTGE--RPVGLPKDTLGRPDF--------LGTVPSYGR 740 Query: 1861 SRMDGFPPRSPGRDFPGLPSGTFGAFGDGGNSFPAESFGKSIHDSRFPVPPNHLQRGEID 2040 RMDGF RSPGR++PG+ FG G G+ + RFP P HL RG + Sbjct: 741 HRMDGFVSRSPGREYPGISPHGFG--GHPGDEIDGR---ERRFSDRFPGLPGHLHRGGFE 795 Query: 2041 ----VPGNLRVGGPRNQDMLPNHLRR-DLVGPRNMHMGDPTKPRMGEPPLARNFPQHLPF 2205 + +LR NQD P + RR + VG NM P R+GEP +F H Sbjct: 796 SSDRMEEHLRSRDMINQDNRPAYFRRGEHVGHHNM----PGHLRLGEPIGFGDFSSHERI 851 Query: 2206 GESFGGEKPG---HPLAGEPXXXXXXXXXXXXHEGGFYPDEMEPFDDPRKWKPVGI-MCR 2373 GE FGG PG HP GEP ++GG Y M+ F++ RK KP+ + CR Sbjct: 852 GE-FGG--PGNFRHPRLGEPGFRSSFSLQEFPNDGGIYTGGMDSFENLRKRKPMSMGWCR 908 Query: 2374 ICKVECGTVEGLDLHSQSREHQRKARDMVLXXXXXXXXXXXXXXXATFEGRDGGRPRNSS 2553 ICK++C TVEGLDLHSQ+REHQ+ A DMV+ + D + +N Sbjct: 909 ICKIDCETVEGLDLHSQTREHQKMAMDMVVTIKQNAKKQKLTSSDHSIR-NDTSKSKNVK 967 Query: 2554 FQGRGNK 2574 F+GR NK Sbjct: 968 FEGRVNK 974 >gb|EOY33851.1| Uncharacterized protein isoform 2 [Theobroma cacao] gi|508786596|gb|EOY33852.1| Uncharacterized protein isoform 2 [Theobroma cacao] gi|508786597|gb|EOY33853.1| Uncharacterized protein isoform 2 [Theobroma cacao] Length = 1408 Score = 261 bits (667), Expect = 1e-66 Identities = 270/907 (29%), Positives = 356/907 (39%), Gaps = 83/907 (9%) Frame = +1 Query: 103 YMQPQQPTAAHGHAXXXXXXXXXXXXNYAPGHGAQHNMAQSYAARPFXXXXXXXXXXXXX 282 Y QPQQ A HA P HG Q AA Sbjct: 611 YAQPQQNVAG-SHAVHFHPSHNLVGRPMTPNHGVQSQPYPHSAA---------------- 653 Query: 283 XXXXXXXVRPPQLNQTYPSRVNNQMSAASEQQAGHLQQPSRPTDVENPRDQVVDKSMHDQ 462 V+P L PS N + + Q +G QP ++ D+ V + D Sbjct: 654 ----GTPVKPVHLGANQPSSYQNNVFRTNNQ-SGVTSQPMSEVPGDHGTDKNVAEQEADS 708 Query: 463 NNPNKVAK-----NLVGGSGAGVL-MNEAKMNAES-------TLDSGFDANDNKVSGMGS 603 ++P K ++ GA V N AK+ A+ T D G D+N +S + Sbjct: 709 SSPGTARKEANELDMASSLGADVAEKNTAKLEADLKSVDEKLTGDVGDDSNGVDISTKET 768 Query: 604 KPLESDASEGV-----LEPSPGSKSTKIGAEDHKDVR-----------------KKSEVQ 717 ES + G +P + T ED KDV K +Q Sbjct: 769 P--ESRRTVGTDLEQHRDPVSKNMVTCEAIEDQKDVHNGEHKVEEIKIKDGPSLKTPPLQ 826 Query: 718 ESK----QTAK----------SGAPNMPQSN------LSTQVHGTN--------ASVDQG 813 E+K Q K G P P N S+QV +VDQG Sbjct: 827 EAKLGEEQNGKMQKDKILPHDQGTPKGPAGNGFRGIPPSSQVQPGGYLPPSHSVPNVDQG 886 Query: 814 RNQLHPIQYGPSV-QLRPGAVSMSKSTPH--PFNSQQPTIXXXXXXXXXXXXXFNSADNS 984 R+Q + YG + Q RP ++ ++ P P ++Q P + +N Sbjct: 887 RHQPLQMPYGSNNNQQRPAVSAILQAPPPGLPSHAQTPGLPPNQFRPQGPGQALVPPENL 946 Query: 985 QPSSIKHPPGRVPHENASGAMVTPGLSGPFPRPDNMGYYQASMPPYQAGQPQNPAGEPFG 1164 P S P G GP+ N G PP +G P+ GEP Sbjct: 947 PPGSFGRDPSNY------------GPQGPY----NQG------PPSLSGAPRISQGEPLV 984 Query: 1165 GSSFAAQRPGALDSHVGVREREPADFEQRPPYPMENEKFPVQRPGSFDGRKPESLPHGSL 1344 G S+ A DSH P+ P S + ++ Sbjct: 985 GLSYGTPPLTAFDSHGA----------------------PLYGPESHSVQHSANMVDYHA 1022 Query: 1345 DRAAYGPVQPGVQLGAMKIGGPPAHDSMSAPGMRDERGMPFPEERLKRIP----HR---- 1500 D P G+ DS S +R ER P +E + P HR Sbjct: 1023 DNRQLDPRASGL-------------DSTSTFSLRGERLKPVQDECSNQFPLDRGHRGDRG 1069 Query: 1501 EFEDDPRKFPRSSHFEAGPSSKLGTQFPSSGPLDHGPHVYAGDGPSRPFEKAPHGFERDP 1680 +FE+D + FPR SH + P K G+ SS PLD GPH + D R EK PHGF DP Sbjct: 1070 QFEEDLKHFPRPSHLDNEPVPKFGSYISSSRPLDRGPHGFGMDMGPRAQEKEPHGFSFDP 1129 Query: 1681 GLKMDSAVGSGPLRFLPPFHPNDVGERGRPPTFPDDSMGRGDFGHRADFSGPGAGPGYGR 1860 +GSGP RFLPP+HP+D GE RP P D++GR DF G P YGR Sbjct: 1130 ------MIGSGPSRFLPPYHPDDTGE--RPVGLPKDTLGRPDF--------LGTVPSYGR 1173 Query: 1861 SRMDGFPPRSPGRDFPGLPSGTFGAFGDGGNSFPAESFGKSIHDSRFPVPPNHLQRGEID 2040 RMDGF RSPGR++PG+ FG G G+ + RFP P HL RG + Sbjct: 1174 HRMDGFVSRSPGREYPGISPHGFG--GHPGDEIDGR---ERRFSDRFPGLPGHLHRGGFE 1228 Query: 2041 ----VPGNLRVGGPRNQDMLPNHLRR-DLVGPRNMHMGDPTKPRMGEPPLARNFPQHLPF 2205 + +LR NQD P + RR + VG NM P R+GEP +F H Sbjct: 1229 SSDRMEEHLRSRDMINQDNRPAYFRRGEHVGHHNM----PGHLRLGEPIGFGDFSSHERI 1284 Query: 2206 GESFGGEKPG---HPLAGEPXXXXXXXXXXXXHEGGFYPDEMEPFDDPRKWKPVGI-MCR 2373 GE FGG PG HP GEP ++GG Y M+ F++ RK KP+ + CR Sbjct: 1285 GE-FGG--PGNFRHPRLGEPGFRSSFSLQEFPNDGGIYTGGMDSFENLRKRKPMSMGWCR 1341 Query: 2374 ICKVECGTVEGLDLHSQSREHQRKARDMVLXXXXXXXXXXXXXXXATFEGRDGGRPRNSS 2553 ICK++C TVEGLDLHSQ+REHQ+ A DMV+ + D + +N Sbjct: 1342 ICKIDCETVEGLDLHSQTREHQKMAMDMVVTIKQNAKKQKLTSSDHSIR-NDTSKSKNVK 1400 Query: 2554 FQGRGNK 2574 F+GR NK Sbjct: 1401 FEGRVNK 1407 >gb|EOY33857.1| Uncharacterized protein isoform 8 [Theobroma cacao] Length = 972 Score = 260 bits (664), Expect = 3e-66 Identities = 270/907 (29%), Positives = 355/907 (39%), Gaps = 83/907 (9%) Frame = +1 Query: 103 YMQPQQPTAAHGHAXXXXXXXXXXXXNYAPGHGAQHNMAQSYAARPFXXXXXXXXXXXXX 282 Y QPQQ A HA P HG Q AA Sbjct: 178 YAQPQQNVAG-SHAVHFHPSHNLVGRPMTPNHGVQSQPYPHSAA---------------- 220 Query: 283 XXXXXXXVRPPQLNQTYPSRVNNQMSAASEQQAGHLQQPSRPTDVENPRDQVVDKSMHDQ 462 V+P L PS N + + Q +G QP ++ D+ V + D Sbjct: 221 ----GTPVKPVHLGANQPSSYQNNVFRTNNQ-SGVTSQPMSEVPGDHGTDKNVAEQEADS 275 Query: 463 NNPNKVAK-----NLVGGSGAGVL-MNEAKMNAES-------TLDSGFDANDNKVSGMGS 603 ++P K ++ GA V N AK+ A+ T D G D+N +S + Sbjct: 276 SSPGTARKEANELDMASSLGADVAEKNTAKLEADLKSVDEKLTGDVGDDSNGVDISTKET 335 Query: 604 KPLESDASEGV-----LEPSPGSKSTKIGAEDHKDVR-----------------KKSEVQ 717 ES + G +P + T ED KDV K +Q Sbjct: 336 P--ESRRTVGTDLEQHRDPVSKNMVTCEAIEDQKDVHNGEHKVEEIKIKDGPSLKTPPLQ 393 Query: 718 ESK----QTAK----------SGAPNMPQSN------LSTQVHGTN--------ASVDQG 813 E+K Q K G P P N S+QV +VDQG Sbjct: 394 EAKLGEEQNGKMQKDKILPHDQGTPKGPAGNGFRGIPPSSQVQPGGYLPPSHSVPNVDQG 453 Query: 814 RNQLHPIQYGPSV-QLRPGAVSMSKSTPH--PFNSQQPTIXXXXXXXXXXXXXFNSADNS 984 R+Q + YG + Q RP ++ ++ P P ++Q P + +N Sbjct: 454 RHQPLQMPYGSNNNQQRPAVSAILQAPPPGLPSHAQTPGLPPNQFRPQGPGQALVPPENL 513 Query: 985 QPSSIKHPPGRVPHENASGAMVTPGLSGPFPRPDNMGYYQASMPPYQAGQPQNPAGEPFG 1164 P S P G GP+ N G PP +G P+ GEP Sbjct: 514 PPGSFGRDPSNY------------GPQGPY----NQG------PPSLSGAPRISQGEPLV 551 Query: 1165 GSSFAAQRPGALDSHVGVREREPADFEQRPPYPMENEKFPVQRPGSFDGRKPESLPHGSL 1344 G S+ A DSH P+ P S + ++ Sbjct: 552 GLSYGTPPLTAFDSHGA----------------------PLYGPESHSVQHSANMVDYHA 589 Query: 1345 DRAAYGPVQPGVQLGAMKIGGPPAHDSMSAPGMRDERGMPFPEERLKRIP----HR---- 1500 D P G+ DS S +R ER P +E + P HR Sbjct: 590 DNRQLDPRASGL-------------DSTSTFSLRGERLKPVQDECSNQFPLDRGHRGDRG 636 Query: 1501 EFEDDPRKFPRSSHFEAGPSSKLGTQFPSSGPLDHGPHVYAGDGPSRPFEKAPHGFERDP 1680 +FE+D + FPR SH + P K G+ SS PLD GPH + D R EK PHGF DP Sbjct: 637 QFEEDLKHFPRPSHLDNEPVPKFGSYISSSRPLDRGPHGFGMDMGPRAQEKEPHGFSFDP 696 Query: 1681 GLKMDSAVGSGPLRFLPPFHPNDVGERGRPPTFPDDSMGRGDFGHRADFSGPGAGPGYGR 1860 +GSGP RFLPP+HP+D GE RP P D++GR DF G P YGR Sbjct: 697 ------MIGSGPSRFLPPYHPDDTGE--RPVGLPKDTLGRPDF--------LGTVPSYGR 740 Query: 1861 SRMDGFPPRSPGRDFPGLPSGTFGAFGDGGNSFPAESFGKSIHDSRFPVPPNHLQRGEID 2040 RMDGF RSPGR++PG+ FG G G+ + RFP P HL RG + Sbjct: 741 HRMDGFVSRSPGREYPGISPHGFG--GHPGDEIDGR---ERRFSDRFPGLPGHLHRGGFE 795 Query: 2041 ----VPGNLRVGGPRNQDMLPNHLRR-DLVGPRNMHMGDPTKPRMGEPPLARNFPQHLPF 2205 + +LR NQD P + RR + VG NM P R+GEP +F H Sbjct: 796 SSDRMEEHLRSRDMINQDNRPAYFRRGEHVGHHNM----PGHLRLGEPIGFGDFSSHERI 851 Query: 2206 GESFGGEKPG---HPLAGEPXXXXXXXXXXXXHEGGFYPDEMEPFDDPRKWKPVGI-MCR 2373 GE FGG PG HP GEP ++GG Y M+ F++ RK KP+ + CR Sbjct: 852 GE-FGG--PGNFRHPRLGEPGFRSSFSLQEFPNDGGIYTGGMDSFENLRKRKPMSMGWCR 908 Query: 2374 ICKVECGTVEGLDLHSQSREHQRKARDMVLXXXXXXXXXXXXXXXATFEGRDGGRPRNSS 2553 ICK++C TVEGLDLHSQ+REHQ+ A DMV+ D + +N Sbjct: 909 ICKIDCETVEGLDLHSQTREHQKMAMDMVVTIKQNAKKQKLDHSIR----NDTSKSKNVK 964 Query: 2554 FQGRGNK 2574 F+GR NK Sbjct: 965 FEGRVNK 971 >ref|XP_002298329.1| hypothetical protein POPTR_0001s25430g [Populus trichocarpa] gi|222845587|gb|EEE83134.1| hypothetical protein POPTR_0001s25430g [Populus trichocarpa] Length = 1327 Score = 252 bits (643), Expect = 7e-64 Identities = 255/854 (29%), Positives = 340/854 (39%), Gaps = 27/854 (3%) Frame = +1 Query: 94 HGPYMQPQQPTAAHGHAXXXXXXXXXXXXNYAPGHGAQHNMAQSYAARPFXXXXXXXXXX 273 HGP QQP+ A+ H N GA + AQS A Sbjct: 568 HGPVQSFQQPSHAYPHPQQ----------NVPLPRGAHPHQAQSLAVGTGVSPHGVLSVQ 617 Query: 274 XXXXXXXXXXVRPPQLNQTYPS----RVNNQMSAASEQQAGHLQQP--SRPTDVENPRDQ 435 RP Q+ S + NNQ+ +SEQQA +P R D+E + Sbjct: 618 SYPQSTAVMQARPVQIGANQQSGNILKTNNQVEFSSEQQAWVASRPISERQGDIEKGAEG 677 Query: 436 VVDKSMHDQNNPNKVAKNLVGGSGAGVLMNEAK-MNAESTLDSGFDANDNKVSGMGSKPL 612 + S H N K L G GA +E K + +ES L D +NK +G Sbjct: 678 --ESSAH--NTIKKELNELDAGLGASA--SEMKTIKSESDLKQVDD--ENKPTG------ 723 Query: 613 ESDASEGVLEPSPGSKSTKIGAEDHKDVRKKSEVQESKQTAKSGAPNMPQSNLSTQVHGT 792 E+ G + G S K EDH+DV K + + K + +LS + G Sbjct: 724 EAKDIPGAPAAANGEPSIKQVKEDHRDVTDKQKDISNADQKKV------ELSLSEYMDGK 777 Query: 793 NASVDQGRNQLHPIQYGPSVQLRPGAVSMSKSTP--HPFNSQQPTIXXXXXXXXXXXXXF 966 + ++ PS S TP F P Sbjct: 778 DGL---------SLETAPSHLEEQSKKSQKDKTPTSQGFGGFPP---------------- 812 Query: 967 NSADNSQPSSIKHPPGRVPHENASGAMVTPGLSGPFPRPDNMGYYQASM-PPYQAGQPQN 1143 N SQP S+ V + G + RP + QA PP+ P + Sbjct: 813 NGHMQSQPVSV------VDQGKLHPLPIHQGPAALQQRPVGPSWLQAPHGPPHHMQLPGH 866 Query: 1144 PAGEPFGGSSFAAQRPGALDSHVGVREREPADFEQRPPYPMENEKFPVQRPGSFDGRKPE 1323 P S PG + SH G + + P E V F ++P Sbjct: 867 PP------SHHGRLPPGHMPSHYGPPQ---GPYTHAPTSQGERTSSYVHETSMFGNQRP- 916 Query: 1324 SLPHGSLDRAAYGPVQPGVQLGAMKIGGPPAHDSMSAPGMRDERGMPFPEERLKRIPHR- 1500 S P G + G+ A+ G +S DE PFP + +R H+ Sbjct: 917 SYPGG----------RQGILSNAVGTNGAQDPNSDRFRSFPDEHLNPFPHDPARRNAHQG 966 Query: 1501 EFEDDPRKFPRSSHFEAGPSSKLGTQFPSSGPLDHGPHVYAGDGPSRPFEKAPHGFERDP 1680 EFE+D + F S + P K G F SS PLD GPH + DG + +K HG D Sbjct: 967 EFEEDLKHFTAPSCLDTKPVPKSGGHFSSSRPLDRGPHGFGVDGAPKHLDKGSHGLNYDS 1026 Query: 1681 GLKMDSAVGSGPLRFLPPFHPNDVGERGRPP---TFPDDSMGRGDFGH-RADFSGPGAGP 1848 GL ++ GS P RF PP H + R F D+ GR DF R GP P Sbjct: 1027 GLNVEPLGGSAPPRFFPPIHHDRTLHRSEAEGSLGFHDNLAGRTDFARTRPGLLGPPM-P 1085 Query: 1849 GYGRSRMDGFPPRSPGRDFPGLPSGTFGA---FGDGGNSFPAES---FGKSIHDSRFPVP 2010 GY MD PRSPGRD+PG+ FGA D P S S+HDSRFP+ Sbjct: 1086 GYDHRDMDNLAPRSPGRDYPGMSMQRFGALPGLDDIDGRAPQRSSDPITSSLHDSRFPLF 1145 Query: 2011 PNHLQRGEIDVPGNLRVGGPRNQDML-----PNHLRR-DLVGPRNMHMGDPTKPRMGEPP 2172 P+HL+RGE++ PGN +G + D++ P HLRR + +GPRN P+ R+GE Sbjct: 1146 PSHLRRGELNGPGNFHMGEHLSGDLMGHDGWPAHLRRGERLGPRN----PPSHLRLGERG 1201 Query: 2173 LARNFPQHLPFGESFGGEKPGHPLAGEPXXXXXXXXXXXXHEGGFYPDEMEPFDDPRKWK 2352 +FP H GE G H GEP GG Y +++ ++ RK K Sbjct: 1202 GFGSFPGHARMGELAGPGNLYHQQLGEPGFRSSF--------GGSYAGDLQYSENSRKRK 1253 Query: 2353 PVGIMCRICKVECGTVEGLDLHSQSREHQRKARDMVLXXXXXXXXXXXXXXXATFEGRDG 2532 CRICKV+C T EGLDLHSQ+REHQ+ A DMV+ + D Sbjct: 1254 SSMGWCRICKVDCETFEGLDLHSQTREHQKMAMDMVVTIKQNVKKHKSAPSDHS-SLEDT 1312 Query: 2533 GRPRNSSFQGRGNK 2574 + RN+SF+GRGNK Sbjct: 1313 SKLRNASFEGRGNK 1326 >gb|EMJ06149.1| hypothetical protein PRUPE_ppa000292mg [Prunus persica] Length = 1334 Score = 236 bits (603), Expect = 3e-59 Identities = 220/747 (29%), Positives = 300/747 (40%), Gaps = 43/747 (5%) Frame = +1 Query: 349 NQMSAASEQQAGHLQQPSRPTDVENPRDQVVDKSMHDQNNPNKVAKNLVGGSGAGVLMNE 528 NQ + G S PT E +Q + Q N KV ++ G+ + V+ + Sbjct: 643 NQNNMVRTNNLGQSGANSGPTTSERQAEQ--ESEFSAQQNAKKVVHDV--GTASAVVADA 698 Query: 529 AKMNAESTLDSGFDANDNKVSGMGSKPLESDASEGVLEPSPGSKSTKIGAEDHKDVRKKS 708 A+S D N+NK +G K ++ D S + P + + G K + K+ Sbjct: 699 EVKTAKSETDMKSIDNENKPTGE-DKTIQGDTSSKEI---PDIHALENGESVSKSILKEE 754 Query: 709 EVQESKQTAKSGAPNMPQSNLSTQVHGTNASV--DQGRNQLHPIQYGPSVQLRPGAVSMS 882 V + + +M Q L ++ A + +QG P + S + Sbjct: 755 GVDGTLDHSNVSISDMKQRELK-EIPSEEAQLREEQGWMLQKDASGDPQPFIGTDEGSQA 813 Query: 883 KSTPHPFNSQQPTIXXXXXXXXXXXXXFNSADNSQPSSIKHPPGRVPHENASGAMVTPGL 1062 ST P + Q + P ++ PPG H G + P Sbjct: 814 VSTSAPISDQGKHLPHHGPTTLPQRP-------GAPLLLQVPPGPPCHTQGPGHHLRP-- 864 Query: 1063 SGPFPRPDNMGYYQASMPPYQAGQPQNPAGEP--FGGSSFAAQRPGALDSHVGVREREPA 1236 GP P P+ + + P G FG SS A + G S Sbjct: 865 PGPAHVPGQ---------PFHSSEHFQPHGGNLGFGASSGRASQYGPQGS---------I 906 Query: 1237 DFEQRPPYPMENE-KFPVQRPGSFDGRKPESLPHGSLDRAAYGPVQPGVQLGAMKIGGPP 1413 + + P+ NE P+ +FD G + RAA G+ +++ G P Sbjct: 907 ELQSVTPHGPYNEGHLPLPPTSAFDSHG------GMMSRAAPIGQPSGIHPNMLRMNGTP 960 Query: 1414 AHDSMSAPGMRDERGMPFPEERLKRIP---------HREFEDDPRKFPRSSHFEAGPSSK 1566 DS S G RDER FP ERL P EFEDD ++FPR S+ ++ P +K Sbjct: 961 GLDSSSTHGPRDERFKAFPGERLNPFPVDPTRHVIDRVEFEDDLKQFPRPSYLDSEPVAK 1020 Query: 1567 LGTQFPSSGPLDHGPHVYAGDGPSRPFEKAPHGFERDPGLKMDSAVGSGPLRFLPPF--- 1737 G SRPF++APHGF+ D G D G+ P RFL P+ Sbjct: 1021 FGNY------------------SSRPFDRAPHGFKYDSGPHTDPLAGTAPSRFLSPYRLG 1062 Query: 1738 ---HPNDVGERGRPPTFPDDSMGRGDFGHRADFSGPGAGPGYGRSRMDGFPPRSPGRDFP 1908 H ND G+ GR + + G DF GR +DG PRSP RD+P Sbjct: 1063 GSVHGNDAGDFGRM----EPTHGHPDF--------------VGRRLVDGLAPRSPVRDYP 1104 Query: 1909 GLPSGTFGAFGDG---GNSFP--AESFGKSIHDSRFPVPPNHLQRGEIDVPGNLRVGGPR 2073 GLP F FG G F + G H+ RF P H +RGE + PGNLR+ R Sbjct: 1105 GLPPHGFRGFGPDDFDGREFHRFGDPLGNQFHEGRFSNLPGHFRRGEFEGPGNLRMVDHR 1164 Query: 2074 NQDML-----PNHLRR-DLVGPRNM-----------HMGDPTKPRMGEPPLARNFPQHLP 2202 D + P HLRR D +GP N+ HMGD P EP Sbjct: 1165 RNDFIGQDGHPGHLRRGDHLGPHNLREPLGFGSRHSHMGDMAGPGNFEP----------- 1213 Query: 2203 FGESFGGEKPGHPLAGEPXXXXXXXXXXXXHEGGFYPDEMEPFDDPRKWKPVGI-MCRIC 2379 F G +P HP GEP ++G Y ++E FD RK KP + CRIC Sbjct: 1214 ----FRGNRPNHPRLGEPGFRSSFSLQRFPNDGT-YTGDLESFDHSRKRKPASMGWCRIC 1268 Query: 2380 KVECGTVEGLDLHSQSREHQRKARDMV 2460 KV+C TVEGLDLHSQ+REHQ+ A DMV Sbjct: 1269 KVDCETVEGLDLHSQTREHQKMAMDMV 1295 >ref|XP_004153176.1| PREDICTED: uncharacterized protein LOC101214768 [Cucumis sativus] Length = 1177 Score = 223 bits (569), Expect = 3e-55 Identities = 225/737 (30%), Positives = 310/737 (42%), Gaps = 51/737 (6%) Frame = +1 Query: 517 LMNEAKMNAESTLDSGFDANDNKVSGMGSKPLESDASEGVLEPSPGSKSTKIGAEDHKDV 696 L+ E K N E + + D ++ SK +++D S G PS G+ ++ GA + Sbjct: 502 LVIENKGNQE---EFKISSQDTELREEQSKRMQNDTS-GTPHPSSGTNESQQGATTTSSL 557 Query: 697 RKKSEVQESKQTAKSGAPNMPQSNLSTQVHGTNAS-----VDQGRNQLHPIQYGPSVQLR 861 S ++ + P PQ+ TQ+ S V R+Q P Y S L+ Sbjct: 558 ILGSPGMLNQHGYQDKNP--PQTG-GTQIGAAVTSHPASLVAHTRHQTPPSSYVSSA-LQ 613 Query: 862 PGAVSMSKSTPHPFNSQQPTIXXXXXXXXXXXXXFNSADNSQP--SSIKHPPGRVPHENA 1035 G + S P P Q A QP S G +P E+ Sbjct: 614 HGVAAPSLPGPPPGPYHQAQFSNNPSMQVRPRAPGLVAHPGQPFNPSESFHLGGIP-ESG 672 Query: 1036 SGAMVTPGLSGPFPRP------DNMGYYQASMPPYQAGQPQNPAGEPFGGSSFAAQRPGA 1197 S + GL P+ + Y S P G + G+P G + F ++ PGA Sbjct: 673 SASSFGRGLGQYGPQQALERSIGSQATYSLSQPSASQGGSKMSLGDPVG-AHFRSKLPGA 731 Query: 1198 LDSHVGVREREPADFEQRPPYPMENEKFPVQRPGSFDGRKPESLPHGSLDRAAYGPVQPG 1377 DS + E QRP +P+E E F QRP D P ++ H + P G Sbjct: 732 FDSRGLLHAPEAQIGVQRPIHPLEAEIFSNQRP-RLDSHLPGTMEH-------HPPHLTG 783 Query: 1378 VQLGAMKIGGPPAHDSMSAPGMRDERGMPFPEERLKRIP---------HREFEDDPRKFP 1530 + + + G P DS S G+RDER EE+L P + ED R+FP Sbjct: 784 IPPNVLPLNGAPGPDSSSKLGLRDERFKLLHEEQLNSFPLDPARRPINQTDAEDILRQFP 843 Query: 1531 RSSHFEAGPSSKLGTQFPSSGPLDHGPHVYAGDGPSRPFEKAPHGFERDPGLKMDSAVGS 1710 R SH E+ + ++G RPF++ HG D GL +D A S Sbjct: 844 RPSHLESELAQRIGNY------------------SLRPFDRGVHGQNFDTGLTIDGAAAS 885 Query: 1711 GPLRFLPPFH------PNDVGERGRPPTFPDDSMGRGDF--GHRADFSGPGAGPGYGRSR 1866 R LPP H P D RP F +DS G+ D GH +DF PG+ YGR Sbjct: 886 ---RVLPPRHIGGALYPTDAE---RPIAFYEDSTGQADRSRGH-SDFPAPGS---YGRRF 935 Query: 1867 MDGFPPRSPGRDFPGLPSGTFGAFGD---GGNSFPAESFGK--SIHDSRFPVPPNHLQRG 2031 +DGF PRSP ++ G G G G G FP FG S +SRFP+ +HLQRG Sbjct: 936 VDGFGPRSPLHEYHGRGFGGRGFTGVEEIDGQDFP-HHFGDPLSFRESRFPIFRSHLQRG 994 Query: 2032 EIDVPGNLRVG---------------GPRNQDMLPNHLRRDLVGPRNMHMGDPTKPRMGE 2166 + + GN R+ GPR+ LP HLR +G P R+G+ Sbjct: 995 DFESSGNFRMSEHLRTGDLIGQDRHFGPRS---LPGHLR---LGELTAFGSHPGHSRIGD 1048 Query: 2167 PPLARNFPQHLPFGESFGGEKPGHPLAGEPXXXXXXXXXXXXHEGGFYPDEMEPFDDPRK 2346 + NF PFG GG +P +P GEP +G F+ ++E FD+ RK Sbjct: 1049 LSVLGNFE---PFG---GGHRPNNPRLGEPGFRSSFSRQGLVDDGRFFAGDVESFDNSRK 1102 Query: 2347 WKPVGI-MCRICKVECGTVEGLDLHSQSREHQRKARDMVLXXXXXXXXXXXXXXXATFEG 2523 KP+ + CRICKV+C TVEGL+LHSQ+REHQ+ A DMV + E Sbjct: 1103 RKPISMGWCRICKVDCETVEGLELHSQTREHQKMAMDMVQSIKQNAKKHKVTPNDHSSE- 1161 Query: 2524 RDGGRPRNSSFQGRGNK 2574 G+ +N + RG K Sbjct: 1162 --DGKSKNVGLESRGKK 1176 >ref|XP_004145323.1| PREDICTED: uncharacterized protein LOC101205914 [Cucumis sativus] Length = 1434 Score = 223 bits (569), Expect = 3e-55 Identities = 225/737 (30%), Positives = 310/737 (42%), Gaps = 51/737 (6%) Frame = +1 Query: 517 LMNEAKMNAESTLDSGFDANDNKVSGMGSKPLESDASEGVLEPSPGSKSTKIGAEDHKDV 696 L+ E K N E + + D ++ SK +++D S G PS G+ ++ GA + Sbjct: 759 LVIENKGNQE---EFKISSQDTELREEQSKRMQNDTS-GTPHPSSGTNESQQGATTTSSL 814 Query: 697 RKKSEVQESKQTAKSGAPNMPQSNLSTQVHGTNAS-----VDQGRNQLHPIQYGPSVQLR 861 S ++ + P PQ+ TQ+ S V R+Q P Y S L+ Sbjct: 815 ILGSPGMLNQHGYQDKNP--PQTG-GTQIGAAVTSHPASLVAHTRHQTPPSSYVSSA-LQ 870 Query: 862 PGAVSMSKSTPHPFNSQQPTIXXXXXXXXXXXXXFNSADNSQP--SSIKHPPGRVPHENA 1035 G + S P P Q A QP S G +P E+ Sbjct: 871 HGVAAPSLPGPPPGPYHQAQFSNNPSMQVRPRAPGLVAHPGQPFNPSESFHLGGIP-ESG 929 Query: 1036 SGAMVTPGLSGPFPRP------DNMGYYQASMPPYQAGQPQNPAGEPFGGSSFAAQRPGA 1197 S + GL P+ + Y S P G + G+P G + F ++ PGA Sbjct: 930 SASSFGRGLGQYGPQQALERSIGSQATYSLSQPSASQGGSKMSLGDPVG-AHFRSKLPGA 988 Query: 1198 LDSHVGVREREPADFEQRPPYPMENEKFPVQRPGSFDGRKPESLPHGSLDRAAYGPVQPG 1377 DS + E QRP +P+E E F QRP D P ++ H + P G Sbjct: 989 FDSRGLLHAPEAQIGVQRPIHPLEAEIFSNQRP-RLDSHLPGTMEH-------HPPHLTG 1040 Query: 1378 VQLGAMKIGGPPAHDSMSAPGMRDERGMPFPEERLKRIP---------HREFEDDPRKFP 1530 + + + G P DS S G+RDER EE+L P + ED R+FP Sbjct: 1041 IPPNVLPLNGAPGPDSSSKLGLRDERFKLLHEEQLNSFPLDPARRPINQTDAEDILRQFP 1100 Query: 1531 RSSHFEAGPSSKLGTQFPSSGPLDHGPHVYAGDGPSRPFEKAPHGFERDPGLKMDSAVGS 1710 R SH E+ + ++G RPF++ HG D GL +D A S Sbjct: 1101 RPSHLESELAQRIGNY------------------SLRPFDRGVHGQNFDTGLTIDGAAAS 1142 Query: 1711 GPLRFLPPFH------PNDVGERGRPPTFPDDSMGRGDF--GHRADFSGPGAGPGYGRSR 1866 R LPP H P D RP F +DS G+ D GH +DF PG+ YGR Sbjct: 1143 ---RVLPPRHIGGALYPTDAE---RPIAFYEDSTGQADRSRGH-SDFPAPGS---YGRRF 1192 Query: 1867 MDGFPPRSPGRDFPGLPSGTFGAFGD---GGNSFPAESFGK--SIHDSRFPVPPNHLQRG 2031 +DGF PRSP ++ G G G G G FP FG S +SRFP+ +HLQRG Sbjct: 1193 VDGFGPRSPLHEYHGRGFGGRGFTGVEEIDGQDFP-HHFGDPLSFRESRFPIFRSHLQRG 1251 Query: 2032 EIDVPGNLRVG---------------GPRNQDMLPNHLRRDLVGPRNMHMGDPTKPRMGE 2166 + + GN R+ GPR+ LP HLR +G P R+G+ Sbjct: 1252 DFESSGNFRMSEHLRTGDLIGQDRHFGPRS---LPGHLR---LGELTAFGSHPGHSRIGD 1305 Query: 2167 PPLARNFPQHLPFGESFGGEKPGHPLAGEPXXXXXXXXXXXXHEGGFYPDEMEPFDDPRK 2346 + NF PFG GG +P +P GEP +G F+ ++E FD+ RK Sbjct: 1306 LSVLGNFE---PFG---GGHRPNNPRLGEPGFRSSFSRQGLVDDGRFFAGDVESFDNSRK 1359 Query: 2347 WKPVGI-MCRICKVECGTVEGLDLHSQSREHQRKARDMVLXXXXXXXXXXXXXXXATFEG 2523 KP+ + CRICKV+C TVEGL+LHSQ+REHQ+ A DMV + E Sbjct: 1360 RKPISMGWCRICKVDCETVEGLELHSQTREHQKMAMDMVQSIKQNAKKHKVTPNDHSSE- 1418 Query: 2524 RDGGRPRNSSFQGRGNK 2574 G+ +N + RG K Sbjct: 1419 --DGKSKNVGLESRGKK 1433 >emb|CAN64434.1| hypothetical protein VITISV_000937 [Vitis vinifera] Length = 1131 Score = 221 bits (562), Expect = 2e-54 Identities = 188/582 (32%), Positives = 248/582 (42%), Gaps = 28/582 (4%) Frame = +1 Query: 802 VDQGRNQLHPIQYGPSVQLRPGAVSMSKSTPHPFNSQQPTIXXXXXXXXXXXXXFNSADN 981 +D GR+Q P+QYGP+VQ RP A S ++ P P + A Sbjct: 591 LDGGRHQPPPMQYGPTVQQRPAAPSSGQAMPPPGLVHNAPVPGQPSTQLQP-----QALG 645 Query: 982 SQPSSIKHPPGRVPHENASGAMVTPGLSGPF----------------PRPDNMGYY-QAS 1110 P + G HE G ++ PG + F P + G+Y Q Sbjct: 646 LLPHPAQQSRGSFHHEIPPGGILGPGSAASFGRGLSHFAPPQRSFEPPSVVSQGHYNQGH 705 Query: 1111 MPPYQAGQPQNPAGEPFGGSSFAAQRPGALDSHVGVREREPA---DFEQRPPYPMENEKF 1281 P AG + GE G G+ DSH G+ R P D +QRP P+E+E F Sbjct: 706 GLPSHAGPSRISQGELIGRPPLGPLPAGSFDSHGGMMVRAPPHGPDGQQRPVNPVESEIF 765 Query: 1282 PVQRPGSFDGRKPESLPHGSLDRAAYGPVQP-GVQLGAMKIGGPPAHDSMSAPGMRDERG 1458 RP FDGR+ +S GS +R +G QP G Q M++ G +S G++DER Sbjct: 766 SNPRPNYFDGRQSDSHIPGSSERGPFG--QPSGXQSNMMRMNGGLGIESSLPVGLQDERF 823 Query: 1459 MPFPEERLKRIPHREFEDDPRKFPRSSHFEAGPSSKLGTQFPSSGPLDHGPHVYAGDGPS 1638 PE + H +F +D ++F RSSH ++ K G F SS PLD G + D Sbjct: 824 KSLPEPGRRSSDHGKFAEDLKQFSRSSHLDSDLVPKFGNYFSSSRPLDRGSQGFVMDAAQ 883 Query: 1639 RPFEKAPHGFERDPGLKMDSAVGSGPLRFLPPFHPNDVGERGRPPTFPDDSMGRGDFGHR 1818 +KAP GF D G K S+ G+G R +D+ DD GR Sbjct: 884 GLLDKAPLGFNYDSGFK--SSAGTGTSR------QSDL----------DDIDGR------ 919 Query: 1819 ADFSGPGAGPGYGRSRMDGFPPRSPGRDFPGLPSGTFGAFGDGGNSFPAESFGKSIHDSR 1998 G GY + R FP LPS R Sbjct: 920 ---ESRRFGEGYQTFNLPSDESR-----FPVLPS-----------------------HLR 948 Query: 1999 FPVPPNHLQRGE----IDVPGNLRVGGPRNQDMLPNHLRRDLVGPRNMHMGDPTKPRMGE 2166 + P+HLQRGE ++PG LR G P L + PRMGE Sbjct: 949 RDILPSHLQRGEHFGSRNIPGQLRFGEPVFDAFLGH-------------------PRMGE 989 Query: 2167 PPLARNFPQHLPFGESFGG-EKPGHPLAGEPXXXXXXXXXXXXHEGGFY-PDEMEPFDDP 2340 NFP L GESFGG K GHP GEP ++ GF P +ME FD+ Sbjct: 990 LSGPGNFPSRLSAGESFGGSNKSGHPRIGEPGFRSTYSLHGYPNDHGFRPPGDMESFDNS 1049 Query: 2341 RKWKPVGI-MCRICKVECGTVEGLDLHSQSREHQRKARDMVL 2463 RK KP+ + CRIC ++C TV+GLD+HSQ+REHQ+ A D+VL Sbjct: 1050 RKRKPLSMAWCRICNIDCETVDGLDMHSQTREHQQMAMDIVL 1091 >ref|XP_004169561.1| PREDICTED: uncharacterized protein LOC101227701 [Cucumis sativus] Length = 538 Score = 220 bits (561), Expect = 2e-54 Identities = 179/530 (33%), Positives = 239/530 (45%), Gaps = 38/530 (7%) Frame = +1 Query: 1099 YQASMPPYQAGQPQNPAGEPFGGSSFAAQRPGALDSHVGVREREPADFEQRPPYPMENEK 1278 Y S P G + G+P G + F ++ PGA DS + E QRP +P+E E Sbjct: 61 YSLSQPSASQGGSKMSLGDPVG-AHFRSKLPGAFDSRGLLHAPEAQIGVQRPIHPLEAEI 119 Query: 1279 FPVQRPGSFDGRKPESLPHGSLDRAAYGPVQPGVQLGAMKIGGPPAHDSMSAPGMRDERG 1458 F QRP D P ++ H + P G+ + + G P DS S G+RDER Sbjct: 120 FSNQRP-RLDSHLPGTMEH-------HPPHLTGIPPNVLPLNGAPGPDSSSKLGLRDERF 171 Query: 1459 MPFPEERLKRIP---------HREFEDDPRKFPRSSHFEAGPSSKLGTQFPSSGPLDHGP 1611 EE+L P + ED R+FPR SH E+ + ++G Sbjct: 172 KLLHEEQLNSFPLDPARRPINQTDAEDILRQFPRPSHLESELAQRIGNY----------- 220 Query: 1612 HVYAGDGPSRPFEKAPHGFERDPGLKMDSAVGSGPLRFLPPFH------PNDVGERGRPP 1773 RPF++ HG D GL +D A S R LPP H P D RP Sbjct: 221 -------SLRPFDRGVHGQNFDTGLTIDGAAAS---RVLPPRHIGGALYPTDAE---RPI 267 Query: 1774 TFPDDSMGRGDF--GHRADFSGPGAGPGYGRSRMDGFPPRSPGRDFPGLPSGTFGAFGD- 1944 F +DS G+ D GH +DF PG+ YGR +DGF PRSP ++ G G G G Sbjct: 268 AFYEDSTGQADRSRGH-SDFPAPGS---YGRRFVDGFGPRSPLHEYHGRGFGGRGFTGVE 323 Query: 1945 --GGNSFPAESFGK--SIHDSRFPVPPNHLQRGEIDVPGNLRVG---------------G 2067 G FP FG S +SRFP+ +HLQRG+ + GN R+ G Sbjct: 324 EIDGQDFP-HHFGDPLSFRESRFPIFRSHLQRGDFESSGNFRMSEHLRTGDLIGQDRHFG 382 Query: 2068 PRNQDMLPNHLRRDLVGPRNMHMGDPTKPRMGEPPLARNFPQHLPFGESFGGEKPGHPLA 2247 PR+ LP HLR +G P R+G+ + NF PFG GG +P +P Sbjct: 383 PRS---LPGHLR---LGELTAFGSHPGHSRIGDLSVLGNFE---PFG---GGHRPNNPRL 430 Query: 2248 GEPXXXXXXXXXXXXHEGGFYPDEMEPFDDPRKWKPVGI-MCRICKVECGTVEGLDLHSQ 2424 GEP +G F+ ++E FD+ RK KP+ + CRICKV+C TVEGL+LHSQ Sbjct: 431 GEPGFRSSFSRQGLVDDGRFFAGDVESFDNSRKRKPISMGWCRICKVDCETVEGLELHSQ 490 Query: 2425 SREHQRKARDMVLXXXXXXXXXXXXXXXATFEGRDGGRPRNSSFQGRGNK 2574 +REHQ+ A DMV + E G+ +N + RG K Sbjct: 491 TREHQKMAMDMVQSIKQNAKKHKVTPNDHSSE---DGKSKNVGLESRGKK 537 >gb|EXB30469.1| hypothetical protein L484_006018 [Morus notabilis] Length = 1320 Score = 214 bits (545), Expect = 2e-52 Identities = 240/821 (29%), Positives = 321/821 (39%), Gaps = 72/821 (8%) Frame = +1 Query: 322 NQTYPSRVNNQMSAASEQQAG-------HLQQPSRPTDVENPRDQVVDKSMHDQNNPNKV 480 NQ + NNQM SE+ +G ++Q ++ + + +VV S + KV Sbjct: 584 NQNNILKTNNQMKLPSEEHSGANSTATMSIRQGNQDFVKGSAQQEVVASS----HKTVKV 639 Query: 481 AKNLVGGSGAGVLMNEAKMNAESTLDSGFDANDNKVSGMGSKPLESDAS-EGVLEPSPGS 657 N S +L N ++ E + D K + KP+ + E L+ S Sbjct: 640 GTNN-SDSVLDLLANVGEVKTEKS------KTDLKSTDPVVKPMMKEEDVESTLKNSSNG 692 Query: 658 KSTKIGAEDHKDVRK------KSEVQESKQTAKSGAPNMPQSNLSTQVHGTNASVDQ--- 810 KS K+ AED KDV K K+ E K S P + SV Sbjct: 693 KSGKVVAEDKKDVLKVEPEKMKNSTVEDKDVGGSLQKKSPLQAVERHEGQGGDSVKDAAS 752 Query: 811 GRNQLHPIQYGPSVQ-LRPGAVSMSKSTP---------HPFNSQQPTIXXXXXXXXXXXX 960 G ++ + PS Q LR A +P H P Sbjct: 753 GSDRASKVVPTPSAQILRSPASGGEVKSPYSRSVQVQGHQLPGPPPLSQVPPPGPPHKTQ 812 Query: 961 XFNSADN----SQPSSIKHPPGRVPHENASGAMVTPGLSGPFPR-PDNMGYYQAS--MPP 1119 F ++ P HPPG +P G + PF R P+ G Q S + Sbjct: 813 EFGASQTHCRPQVPGDPLHPPGSIP-----------GSAIPFGRGPNQYGPNQQSSELQS 861 Query: 1120 YQAGQPQNPA---------GEPFGGSSFAAQRPGALDSHVGVREREPADFEQRPPYPMEN 1272 +P NP GEP G S +P A +SH G+ R P P Sbjct: 862 LAPQRPYNPGPFGAFRLSQGEPTGAESSGVLQPRAFNSHGGMMAR---------PTPHGP 912 Query: 1273 EKFPVQRPGSFDGRKPESLPHGSLDRAAYGPVQPGVQLGAMKIGGPPAHDSMSAPGMRDE 1452 E F QRP D R P+ GSL+ A+ G+ ++ DS+S G RDE Sbjct: 913 EMFSNQRPDFMDSRGPDPHFAGSLEHGAHSQ-SFGIHPNMTRMNDSHGFDSLSTLGPRDE 971 Query: 1453 RGMPFPEERLKRIPHREFEDDPRKFPRSSHFEAGPSSKLGTQFPSSGPLDHGPHVYAGDG 1632 R PFP P EFEDD ++FPR Sbjct: 972 RFNPFPAGPN---PRAEFEDDLKQFPR--------------------------------- 995 Query: 1633 PSRPFEKAPHGFERDPGLKMDSAVGSGPLRFLPPFHPNDVGERG-RPPTFPDDSMGRGD- 1806 PF++ HG + GLKMDS VGS P R L P++ + G R D+ GR D Sbjct: 996 ---PFDRGLHGLKYHTGLKMDSGVGSVPSRSLSPYNGGGANDGGDRLGWHRGDAFGRMDP 1052 Query: 1807 -FGHRADFSGPGAGPGYGRSRMDGFPPRSPGRDFPGLPSGTFGAFGDGGNSFPA------ 1965 GH DF GPG G Y R RMD RSP R+ PG+ G G G + Sbjct: 1053 TRGH-LDFLGPGLG--YDRRRMDSLASRSPIREHPGI--SLRGFVGPGPDDIHGRELRRF 1107 Query: 1966 -ESFGKSIHDSRFPVPPNHLQRGEIDVPGNLRVGGPRNQDMLPNHLRRDLVGPRNM---- 2130 E F S H+SRF + P HL+RGE + P N+ +G +HLR DL+G + Sbjct: 1108 GEPFDSSFHESRFSMLPGHLRRGEFEGPRNMGMG---------DHLRNDLIGRDGLSGPL 1158 Query: 2131 ----HMGD-PTKPRMGEPPLARNFPQHLPFGE--------SFG-GEKPGHPLAGEPXXXX 2268 HMGD +GEP +H E SFG G+ P P GEP Sbjct: 1159 RWGEHMGDFHGHFHLGEPVGFGAHSRHARIREIGGPGSFDSFGRGDGPSFPHLGEPGFRS 1218 Query: 2269 XXXXXXXXHEGGFYPDEMEPFDDPRKWK-PVGIMCRICKVECGTVEGLDLHSQSREHQRK 2445 G + +++ FD RK K P CRICKV+C TVEGL+LHSQ+REHQ+ Sbjct: 1219 RFSSHGFPTGDGIFTEDLA-FDKSRKRKLPTMGWCRICKVDCETVEGLELHSQTREHQKM 1277 Query: 2446 ARDMVLXXXXXXXXXXXXXXXATFEGRDGGRPRNSSFQGRG 2568 A DMV+ + G D +PR++ +G G Sbjct: 1278 AMDMVVAIKQNAKKQKLTFGDQSSLG-DASQPRSAGTEGHG 1317 >ref|XP_004295721.1| PREDICTED: uncharacterized protein LOC101314450 [Fragaria vesca subsp. vesca] Length = 1316 Score = 209 bits (531), Expect = 7e-51 Identities = 226/765 (29%), Positives = 318/765 (41%), Gaps = 52/765 (6%) Frame = +1 Query: 322 NQTYPSRVNNQMSAASEQQAGHLQQPSRPTDVENPRDQVVDKSMHDQNNPNKVAKNLVGG 501 NQ R NNQ+ + SRPT P ++ + S + V+ +V Sbjct: 610 NQNNIGRTNNQVQPGAN---------SRPTMTTRPAEKEAELSAKNGAQDVGVSSAVVAD 660 Query: 502 SGAGVLMNEAKMNAESTLDSGFDANDNKVSGMGSKPLESDASEGVLEPSPGSKSTKIGAE 681 S A + +E ++ +ST D +++++ S G+K E S+G+L + S+S E Sbjct: 661 SEAKTVKSE--VDIKSTDDGNKPSSEDR-SYQGTK--EIPESKGMLGANGESESKPTLKE 715 Query: 682 DHKDVRKKSEVQESK--QTAKSGAPNMPQSNLS-----------TQVHGTN--------A 798 + D + ++ K + GA + P S + Q+HG + Sbjct: 716 EGVDSTLE-DLSNGKLGELVAEGAKDAPSSGMKLGEHKEMPPEEAQLHGVKDKKLQKVVS 774 Query: 799 SVDQGRNQLHPIQYGPSVQLRPGAVSMSKSTPHPFNSQQPTIXXXXXXXXXXXXXFNSAD 978 S ++G +Q I P Q++ G + M S P QQ + Sbjct: 775 STEEG-SQTVSISSAPIGQVQAGGL-MQPSHPGSAILQQKPGAPPLLQVPSSGPPHHILG 832 Query: 979 NSQPSSIKHP--PGRVP------HENASGAMVTPGLSGPFPRPDNMGYYQASMPPYQAGQ 1134 + QP + P PG VP E+ G + G Y S P +G Sbjct: 833 SGQPLAHVRPQGPGHVPGHPSHLSEHFQSPRGNLGFAASSANASQHGPYNQSHAPPHSGA 892 Query: 1135 PQNPAGEPFGGSSFAAQRPGALDSHVGVREREPADFEQRPPYPMENEKFPVQRPGSFDGR 1314 P+ P PF A P A DSH G+ R PY E + +QRP Sbjct: 893 PRGP---PF------APPPSAFDSHGGIMARAA-------PYGHEGQ-MGLQRPAF---- 931 Query: 1315 KPESLPHGSLDRAAYGPVQP-GVQLGAMKIGGPPAHDSMSAPGMRDERGMPFPEERLKRI 1491 +++ A G QP G+ +++ G P +S S G+RDER P+ RL Sbjct: 932 --------QMEQGATG--QPSGIISNMLRMNGNPGFESSSTLGLRDERFKALPDGRLNPF 981 Query: 1492 PHRE--------FEDDPRKFPRSSHFEAGPSSKLGTQFPSSGPLDHGPHVYAGDGPSRPF 1647 P FEDD ++FPR S ++ P KLG SR F Sbjct: 982 PGDPTRVISRVGFEDDLKQFPRPSFLDSEPLPKLGNY------------------SSRAF 1023 Query: 1648 EKAPHGFERDPGLKMDSAVGSGPLRFLPPFHPNDVGERGRPPTFPDDSMGRGDFGHRADF 1827 ++ P G D L +D A GS P RFL P+ G G +D++G DFG Sbjct: 1024 DRRPFGVNYDTRLNIDPAAGSAP-RFLSPY-----GHAGL--IHANDTIGHPDFG----- 1070 Query: 1828 SGPGAGPGYGRSRMDGFPPRSPGRDFPGLPSGTFGAFGDG---GNSFP--AESFGKSIHD 1992 GR MDG RSP RD+PG+PS F FG G F + G+ HD Sbjct: 1071 ---------GRRLMDGLARRSPIRDYPGIPS-RFRGFGPDDFDGREFHRFGDPLGREFHD 1120 Query: 1993 SRFPVPPNHLQRGEIDVPGNLRVGGPRNQDMLPN-----HLRR-DLVGPRNMHMGDPTKP 2154 +RFP H +RGE + PGN+RV D++ HL+R + +GP N+ P Sbjct: 1121 NRFP--NQHFRRGEFEGPGNMRVDDRMRNDLIGQDGHLGHLQRGEHLGPHNL----PGHL 1174 Query: 2155 RMGEPPLARNFPQHLPFG--ESFGGEKPGHPLAGEPXXXXXXXXXXXXHEGGFYPDEMEP 2328 M E P+H G ESF G + HP GEP ++G Y E+E Sbjct: 1175 HMREHVGFGVHPRHAGPGSFESFIGNRANHPRLGEPGFRSSFSLKRFPNDGT-YAGELES 1233 Query: 2329 FDDPRKWKPVGI-MCRICKVECGTVEGLDLHSQSREHQRKARDMV 2460 FD RK KP + CRICKV C TVEGLD+HSQ+REHQR A +MV Sbjct: 1234 FDHSRKRKPASMGWCRICKVNCETVEGLDVHSQTREHQRMAMEMV 1278 >gb|EOY33855.1| Uncharacterized protein isoform 6 [Theobroma cacao] Length = 1345 Score = 197 bits (502), Expect = 2e-47 Identities = 233/818 (28%), Positives = 305/818 (37%), Gaps = 82/818 (10%) Frame = +1 Query: 103 YMQPQQPTAAHGHAXXXXXXXXXXXXNYAPGHGAQHNMAQSYAARPFXXXXXXXXXXXXX 282 Y QPQQ A HA P HG Q AA Sbjct: 611 YAQPQQNVAG-SHAVHFHPSHNLVGRPMTPNHGVQSQPYPHSAA---------------- 653 Query: 283 XXXXXXXVRPPQLNQTYPSRVNNQMSAASEQQAGHLQQPSRPTDVENPRDQVVDKSMHDQ 462 V+P L PS N + + Q +G QP ++ D+ V + D Sbjct: 654 ----GTPVKPVHLGANQPSSYQNNVFRTNNQ-SGVTSQPMSEVPGDHGTDKNVAEQEADS 708 Query: 463 NNPNKVAK-----NLVGGSGAGVL-MNEAKMNAES-------TLDSGFDANDNKVSGMGS 603 ++P K ++ GA V N AK+ A+ T D G D+N +S + Sbjct: 709 SSPGTARKEANELDMASSLGADVAEKNTAKLEADLKSVDEKLTGDVGDDSNGVDISTKET 768 Query: 604 KPLESDASEGV-----LEPSPGSKSTKIGAEDHKDVR-----------------KKSEVQ 717 ES + G +P + T ED KDV K +Q Sbjct: 769 P--ESRRTVGTDLEQHRDPVSKNMVTCEAIEDQKDVHNGEHKVEEIKIKDGPSLKTPPLQ 826 Query: 718 ESK----QTAK----------SGAPNMPQSN------LSTQVHGTN--------ASVDQG 813 E+K Q K G P P N S+QV +VDQG Sbjct: 827 EAKLGEEQNGKMQKDKILPHDQGTPKGPAGNGFRGIPPSSQVQPGGYLPPSHSVPNVDQG 886 Query: 814 RNQLHPIQYGPSV-QLRPGAVSMSKSTPH--PFNSQQPTIXXXXXXXXXXXXXFNSADNS 984 R+Q + YG + Q RP ++ ++ P P ++Q P + +N Sbjct: 887 RHQPLQMPYGSNNNQQRPAVSAILQAPPPGLPSHAQTPGLPPNQFRPQGPGQALVPPENL 946 Query: 985 QPSSIKHPPGRVPHENASGAMVTPGLSGPFPRPDNMGYYQASMPPYQAGQPQNPAGEPFG 1164 P S P G GP+ N G PP +G P+ GEP Sbjct: 947 PPGSFGRDPSNY------------GPQGPY----NQG------PPSLSGAPRISQGEPLV 984 Query: 1165 GSSFAAQRPGALDSHVGVREREPADFEQRPPYPMENEKFPVQRPGSFDGRKPESLPHGSL 1344 G S+ A DSH P+ P S + ++ Sbjct: 985 GLSYGTPPLTAFDSHGA----------------------PLYGPESHSVQHSANMVDYHA 1022 Query: 1345 DRAAYGPVQPGVQLGAMKIGGPPAHDSMSAPGMRDERGMPFPEERLKRIP----HR---- 1500 D P G+ DS S +R ER P +E + P HR Sbjct: 1023 DNRQLDPRASGL-------------DSTSTFSLRGERLKPVQDECSNQFPLDRGHRGDRG 1069 Query: 1501 EFEDDPRKFPRSSHFEAGPSSKLGTQFPSSGPLDHGPHVYAGDGPSRPFEKAPHGFERDP 1680 +FE+D + FPR SH + P K G+ SS PLD GPH + D R EK PHGF DP Sbjct: 1070 QFEEDLKHFPRPSHLDNEPVPKFGSYISSSRPLDRGPHGFGMDMGPRAQEKEPHGFSFDP 1129 Query: 1681 GLKMDSAVGSGPLRFLPPFHPNDVGERGRPPTFPDDSMGRGDFGHRADFSGPGAGPGYGR 1860 +GSGP RFLPP+HP+D GE RP P D++GR DF G P YGR Sbjct: 1130 ------MIGSGPSRFLPPYHPDDTGE--RPVGLPKDTLGRPDF--------LGTVPSYGR 1173 Query: 1861 SRMDGFPPRSPGRDFPGLPSGTFGAFGDGGNSFPAESFGKSIHDSRFPVPPNHLQRGEID 2040 RMDGF RSPGR++PG+ FG G G+ + RFP P HL RG + Sbjct: 1174 HRMDGFVSRSPGREYPGISPHGFG--GHPGDEIDGR---ERRFSDRFPGLPGHLHRGGFE 1228 Query: 2041 ----VPGNLRVGGPRNQDMLPNHLRR-DLVGPRNMHMGDPTKPRMGEPPLARNFPQHLPF 2205 + +LR NQD P + RR + VG NM P R+GEP +F H Sbjct: 1229 SSDRMEEHLRSRDMINQDNRPAYFRRGEHVGHHNM----PGHLRLGEPIGFGDFSSHERI 1284 Query: 2206 GESFGGEKPG---HPLAGEPXXXXXXXXXXXXHEGGFY 2310 GE FGG PG HP GEP ++GG Y Sbjct: 1285 GE-FGG--PGNFRHPRLGEPGFRSSFSLQEFPNDGGIY 1319 >gb|EOY33854.1| Uncharacterized protein isoform 5 [Theobroma cacao] Length = 1358 Score = 197 bits (502), Expect = 2e-47 Identities = 233/818 (28%), Positives = 305/818 (37%), Gaps = 82/818 (10%) Frame = +1 Query: 103 YMQPQQPTAAHGHAXXXXXXXXXXXXNYAPGHGAQHNMAQSYAARPFXXXXXXXXXXXXX 282 Y QPQQ A HA P HG Q AA Sbjct: 611 YAQPQQNVAG-SHAVHFHPSHNLVGRPMTPNHGVQSQPYPHSAA---------------- 653 Query: 283 XXXXXXXVRPPQLNQTYPSRVNNQMSAASEQQAGHLQQPSRPTDVENPRDQVVDKSMHDQ 462 V+P L PS N + + Q +G QP ++ D+ V + D Sbjct: 654 ----GTPVKPVHLGANQPSSYQNNVFRTNNQ-SGVTSQPMSEVPGDHGTDKNVAEQEADS 708 Query: 463 NNPNKVAK-----NLVGGSGAGVL-MNEAKMNAES-------TLDSGFDANDNKVSGMGS 603 ++P K ++ GA V N AK+ A+ T D G D+N +S + Sbjct: 709 SSPGTARKEANELDMASSLGADVAEKNTAKLEADLKSVDEKLTGDVGDDSNGVDISTKET 768 Query: 604 KPLESDASEGV-----LEPSPGSKSTKIGAEDHKDVR-----------------KKSEVQ 717 ES + G +P + T ED KDV K +Q Sbjct: 769 P--ESRRTVGTDLEQHRDPVSKNMVTCEAIEDQKDVHNGEHKVEEIKIKDGPSLKTPPLQ 826 Query: 718 ESK----QTAK----------SGAPNMPQSN------LSTQVHGTN--------ASVDQG 813 E+K Q K G P P N S+QV +VDQG Sbjct: 827 EAKLGEEQNGKMQKDKILPHDQGTPKGPAGNGFRGIPPSSQVQPGGYLPPSHSVPNVDQG 886 Query: 814 RNQLHPIQYGPSV-QLRPGAVSMSKSTPH--PFNSQQPTIXXXXXXXXXXXXXFNSADNS 984 R+Q + YG + Q RP ++ ++ P P ++Q P + +N Sbjct: 887 RHQPLQMPYGSNNNQQRPAVSAILQAPPPGLPSHAQTPGLPPNQFRPQGPGQALVPPENL 946 Query: 985 QPSSIKHPPGRVPHENASGAMVTPGLSGPFPRPDNMGYYQASMPPYQAGQPQNPAGEPFG 1164 P S P G GP+ N G PP +G P+ GEP Sbjct: 947 PPGSFGRDPSNY------------GPQGPY----NQG------PPSLSGAPRISQGEPLV 984 Query: 1165 GSSFAAQRPGALDSHVGVREREPADFEQRPPYPMENEKFPVQRPGSFDGRKPESLPHGSL 1344 G S+ A DSH P+ P S + ++ Sbjct: 985 GLSYGTPPLTAFDSHGA----------------------PLYGPESHSVQHSANMVDYHA 1022 Query: 1345 DRAAYGPVQPGVQLGAMKIGGPPAHDSMSAPGMRDERGMPFPEERLKRIP----HR---- 1500 D P G+ DS S +R ER P +E + P HR Sbjct: 1023 DNRQLDPRASGL-------------DSTSTFSLRGERLKPVQDECSNQFPLDRGHRGDRG 1069 Query: 1501 EFEDDPRKFPRSSHFEAGPSSKLGTQFPSSGPLDHGPHVYAGDGPSRPFEKAPHGFERDP 1680 +FE+D + FPR SH + P K G+ SS PLD GPH + D R EK PHGF DP Sbjct: 1070 QFEEDLKHFPRPSHLDNEPVPKFGSYISSSRPLDRGPHGFGMDMGPRAQEKEPHGFSFDP 1129 Query: 1681 GLKMDSAVGSGPLRFLPPFHPNDVGERGRPPTFPDDSMGRGDFGHRADFSGPGAGPGYGR 1860 +GSGP RFLPP+HP+D GE RP P D++GR DF G P YGR Sbjct: 1130 ------MIGSGPSRFLPPYHPDDTGE--RPVGLPKDTLGRPDF--------LGTVPSYGR 1173 Query: 1861 SRMDGFPPRSPGRDFPGLPSGTFGAFGDGGNSFPAESFGKSIHDSRFPVPPNHLQRGEID 2040 RMDGF RSPGR++PG+ FG G G+ + RFP P HL RG + Sbjct: 1174 HRMDGFVSRSPGREYPGISPHGFG--GHPGDEIDGR---ERRFSDRFPGLPGHLHRGGFE 1228 Query: 2041 ----VPGNLRVGGPRNQDMLPNHLRR-DLVGPRNMHMGDPTKPRMGEPPLARNFPQHLPF 2205 + +LR NQD P + RR + VG NM P R+GEP +F H Sbjct: 1229 SSDRMEEHLRSRDMINQDNRPAYFRRGEHVGHHNM----PGHLRLGEPIGFGDFSSHERI 1284 Query: 2206 GESFGGEKPG---HPLAGEPXXXXXXXXXXXXHEGGFY 2310 GE FGG PG HP GEP ++GG Y Sbjct: 1285 GE-FGG--PGNFRHPRLGEPGFRSSFSLQEFPNDGGIY 1319 >gb|EOY33850.1| Uncharacterized protein isoform 1 [Theobroma cacao] Length = 1326 Score = 197 bits (502), Expect = 2e-47 Identities = 233/818 (28%), Positives = 305/818 (37%), Gaps = 82/818 (10%) Frame = +1 Query: 103 YMQPQQPTAAHGHAXXXXXXXXXXXXNYAPGHGAQHNMAQSYAARPFXXXXXXXXXXXXX 282 Y QPQQ A HA P HG Q AA Sbjct: 611 YAQPQQNVAG-SHAVHFHPSHNLVGRPMTPNHGVQSQPYPHSAA---------------- 653 Query: 283 XXXXXXXVRPPQLNQTYPSRVNNQMSAASEQQAGHLQQPSRPTDVENPRDQVVDKSMHDQ 462 V+P L PS N + + Q +G QP ++ D+ V + D Sbjct: 654 ----GTPVKPVHLGANQPSSYQNNVFRTNNQ-SGVTSQPMSEVPGDHGTDKNVAEQEADS 708 Query: 463 NNPNKVAK-----NLVGGSGAGVL-MNEAKMNAES-------TLDSGFDANDNKVSGMGS 603 ++P K ++ GA V N AK+ A+ T D G D+N +S + Sbjct: 709 SSPGTARKEANELDMASSLGADVAEKNTAKLEADLKSVDEKLTGDVGDDSNGVDISTKET 768 Query: 604 KPLESDASEGV-----LEPSPGSKSTKIGAEDHKDVR-----------------KKSEVQ 717 ES + G +P + T ED KDV K +Q Sbjct: 769 P--ESRRTVGTDLEQHRDPVSKNMVTCEAIEDQKDVHNGEHKVEEIKIKDGPSLKTPPLQ 826 Query: 718 ESK----QTAK----------SGAPNMPQSN------LSTQVHGTN--------ASVDQG 813 E+K Q K G P P N S+QV +VDQG Sbjct: 827 EAKLGEEQNGKMQKDKILPHDQGTPKGPAGNGFRGIPPSSQVQPGGYLPPSHSVPNVDQG 886 Query: 814 RNQLHPIQYGPSV-QLRPGAVSMSKSTPH--PFNSQQPTIXXXXXXXXXXXXXFNSADNS 984 R+Q + YG + Q RP ++ ++ P P ++Q P + +N Sbjct: 887 RHQPLQMPYGSNNNQQRPAVSAILQAPPPGLPSHAQTPGLPPNQFRPQGPGQALVPPENL 946 Query: 985 QPSSIKHPPGRVPHENASGAMVTPGLSGPFPRPDNMGYYQASMPPYQAGQPQNPAGEPFG 1164 P S P G GP+ N G PP +G P+ GEP Sbjct: 947 PPGSFGRDPSNY------------GPQGPY----NQG------PPSLSGAPRISQGEPLV 984 Query: 1165 GSSFAAQRPGALDSHVGVREREPADFEQRPPYPMENEKFPVQRPGSFDGRKPESLPHGSL 1344 G S+ A DSH P+ P S + ++ Sbjct: 985 GLSYGTPPLTAFDSHGA----------------------PLYGPESHSVQHSANMVDYHA 1022 Query: 1345 DRAAYGPVQPGVQLGAMKIGGPPAHDSMSAPGMRDERGMPFPEERLKRIP----HR---- 1500 D P G+ DS S +R ER P +E + P HR Sbjct: 1023 DNRQLDPRASGL-------------DSTSTFSLRGERLKPVQDECSNQFPLDRGHRGDRG 1069 Query: 1501 EFEDDPRKFPRSSHFEAGPSSKLGTQFPSSGPLDHGPHVYAGDGPSRPFEKAPHGFERDP 1680 +FE+D + FPR SH + P K G+ SS PLD GPH + D R EK PHGF DP Sbjct: 1070 QFEEDLKHFPRPSHLDNEPVPKFGSYISSSRPLDRGPHGFGMDMGPRAQEKEPHGFSFDP 1129 Query: 1681 GLKMDSAVGSGPLRFLPPFHPNDVGERGRPPTFPDDSMGRGDFGHRADFSGPGAGPGYGR 1860 +GSGP RFLPP+HP+D GE RP P D++GR DF G P YGR Sbjct: 1130 ------MIGSGPSRFLPPYHPDDTGE--RPVGLPKDTLGRPDF--------LGTVPSYGR 1173 Query: 1861 SRMDGFPPRSPGRDFPGLPSGTFGAFGDGGNSFPAESFGKSIHDSRFPVPPNHLQRGEID 2040 RMDGF RSPGR++PG+ FG G G+ + RFP P HL RG + Sbjct: 1174 HRMDGFVSRSPGREYPGISPHGFG--GHPGDEIDGR---ERRFSDRFPGLPGHLHRGGFE 1228 Query: 2041 ----VPGNLRVGGPRNQDMLPNHLRR-DLVGPRNMHMGDPTKPRMGEPPLARNFPQHLPF 2205 + +LR NQD P + RR + VG NM P R+GEP +F H Sbjct: 1229 SSDRMEEHLRSRDMINQDNRPAYFRRGEHVGHHNM----PGHLRLGEPIGFGDFSSHERI 1284 Query: 2206 GESFGGEKPG---HPLAGEPXXXXXXXXXXXXHEGGFY 2310 GE FGG PG HP GEP ++GG Y Sbjct: 1285 GE-FGG--PGNFRHPRLGEPGFRSSFSLQEFPNDGGIY 1319 >ref|XP_006848046.1| hypothetical protein AMTR_s00029p00190880 [Amborella trichopoda] gi|548851351|gb|ERN09627.1| hypothetical protein AMTR_s00029p00190880 [Amborella trichopoda] Length = 1626 Score = 184 bits (466), Expect = 2e-43 Identities = 175/589 (29%), Positives = 235/589 (39%), Gaps = 59/589 (10%) Frame = +1 Query: 988 PSSIKHPPGRVPHENASGAMVTPGLSGPFPR-PDNMGYYQASMPPY-QAGQPQNPAGEPF 1161 P I+ PPG H V G G R P+ +G S+PP + P P + Sbjct: 1071 PDMIEKPPGPPLHHGPLHPGVQTGGPGDIGRGPNQLGMPPPSLPPQGHSSVPMYPPSKHA 1130 Query: 1162 GGSSFAAQRPGALDSHVGVREREPA---DFEQRPPYPMEN-EKFPVQRPGSFDGRKPESL 1329 G G D + R P D + P PM++ + F RPG FDGR+P+ Sbjct: 1131 PGERLPGPPSGPFDGPGSMMPRAPVHGIDNQMGRP-PMDHVDTFLKNRPGYFDGRQPDVH 1189 Query: 1330 PHGSLDRAAYGPVQPGVQLGAMKIGGPPAHDSMSAPGMRDERGMPFPEERLKRIPH---- 1497 DRA YG V G+ +S G+ +ER P PE+R K +P Sbjct: 1190 QSLPSDRAPYGLVNGAAGKGSN------VPESAFPHGLPEERFGPLPEDRFKHLPEDGLK 1243 Query: 1498 ----------------------REFEDDPRKFPRSSHFEAGPSSKLGTQFPSSGPLDHGP 1611 REFE+D +KFPRS H + P+S+ F S P H P Sbjct: 1244 KPLPDDHFRPYALDPSRRAIDRREFEEDLKKFPRSGHLDGEPASRYDGYFSSRNPSGHSP 1303 Query: 1612 HVYAGDGPSRPFEKAPHGFERDPGLKMDSAVGSGPLRFLPPFHPNDVGERGRPPTFPDDS 1791 G + + P G P G L D+G+R +P F D Sbjct: 1304 RSLERPGLNLDAPRYPEGMSVPPY----RGAGGSSL---------DLGDRSKPGGFHGDL 1350 Query: 1792 MGR--GDFGHRADFSGPGAGPGYGRSRMDGF-PPRSPGRDFPGLP--------------- 1917 +GR G R+D+ GP P RS DG PPRSP RD+ G+ Sbjct: 1351 IGRKLDTTGARSDYGGPF--PEVSRSHRDGLGPPRSPVRDYAGVRVSGVRPDYAGIPHPL 1408 Query: 1918 SGTFGAFGDGGNSFPAESFGKSIHDSRFPVPPNHLQRGEIDVPGNLRVGGPRNQDMLPNH 2097 G G G A +F IH + P P E +P R+ P H Sbjct: 1409 DGLGGREPLGFGEQRARAFLDPIHGGKIPSGPF-----ESRLPIPSRIAESAGFGDFPGH 1463 Query: 2098 LRR-DLVGPRNMHMGD-PTKPRMGEPPLARNFPQHLPFGESFGG----EKPGHPLAGEPX 2259 LR D GP + G+ P+ R E + N P HL GE+ G +PG + G P Sbjct: 1464 LRGGDPFGPSHFRSGELPSHLRGRELAGSGNLPPHLRIGEAMGPGGHLREPGFGMQGYPK 1523 Query: 2260 XXXXXXXXXXXHEGGFYPDEMEPFDDPRKWKPVGI-MCRICKVECGTVEGLDLHSQSREH 2436 + G F P +++ + RK KP CRICKV+C TVEGLDLHSQ+REH Sbjct: 1524 DGGFY------NPGSFPPSDVDALEYSRKRKPGSTGWCRICKVDCETVEGLDLHSQTREH 1577 Query: 2437 QRKARDMVLXXXXXXXXXXXXXXXAT--FEGRDGGRPRNSSFQGRGNKR 2577 Q+ A DMVL + + + R +SF+ RG++R Sbjct: 1578 QKMAMDMVLSIKQDSAKKQKLYGSSEDHVPQEEPTKGRRASFESRGSRR 1626