BLASTX nr result
ID: Atropa21_contig00009254
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Atropa21_contig00009254 (2377 letters) Database: ./nr 37,332,560 sequences; 13,225,080,153 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_006347527.1| PREDICTED: uncharacterized protein LOC102592... 1151 0.0 ref|XP_006347526.1| PREDICTED: uncharacterized protein LOC102592... 1151 0.0 ref|XP_004235030.1| PREDICTED: uncharacterized protein LOC101252... 1091 0.0 ref|XP_003634177.1| PREDICTED: uncharacterized protein LOC100853... 193 3e-46 gb|EMJ21784.1| hypothetical protein PRUPE_ppa000352mg [Prunus pe... 164 2e-37 ref|XP_004309093.1| PREDICTED: uncharacterized protein LOC101301... 146 5e-32 ref|XP_002521299.1| hypothetical protein RCOM_0756330 [Ricinus c... 143 4e-31 ref|XP_006478087.1| PREDICTED: uncharacterized protein LOC102628... 139 7e-30 ref|XP_006441272.1| hypothetical protein CICLE_v10018632mg [Citr... 135 1e-28 ref|XP_006441271.1| hypothetical protein CICLE_v10018632mg [Citr... 135 1e-28 ref|XP_006441268.1| hypothetical protein CICLE_v10018632mg [Citr... 135 1e-28 ref|XP_002321950.2| hypothetical protein POPTR_0015s00600g [Popu... 132 5e-28 ref|XP_006441269.1| hypothetical protein CICLE_v10018632mg [Citr... 127 2e-26 ref|XP_002317835.2| hypothetical protein POPTR_0012s00720g [Popu... 123 4e-25 gb|EOY23726.1| Uncharacterized protein isoform 6 [Theobroma cacao] 122 5e-25 gb|EOY23723.1| Uncharacterized protein isoform 3 [Theobroma cacao] 122 5e-25 gb|EOY23722.1| Uncharacterized protein isoform 2 [Theobroma cacao] 122 5e-25 gb|EOY23721.1| Uncharacterized protein isoform 1 [Theobroma caca... 122 5e-25 gb|EOY23725.1| Uncharacterized protein isoform 5 [Theobroma cacao] 122 9e-25 gb|EOY23728.1| Uncharacterized protein isoform 8, partial [Theob... 110 2e-21 >ref|XP_006347527.1| PREDICTED: uncharacterized protein LOC102592566 isoform X2 [Solanum tuberosum] Length = 1166 Score = 1151 bits (2978), Expect = 0.0 Identities = 598/813 (73%), Positives = 658/813 (80%), Gaps = 44/813 (5%) Frame = +2 Query: 71 PLAPPFTVDRSNSKPGST---------YTGSVPFGQQSWQQ-YTDPSTTGYNFFPKHEIV 220 PLAPPFTVDR+NSK GST YTG+VPFGQ SWQ +PS TGYNFFP V Sbjct: 21 PLAPPFTVDRTNSKTGSTQLLNFSDSSYTGTVPFGQ-SWQYGAANPSPTGYNFFPS---V 76 Query: 221 TDSMPTTC----MPEFTPTDSVKPSSNNLWSTSNQTANVSTDTYS----GYYAPYVPSIV 376 TDS+PTTC PEF+P DSV+P S+ WSTSN T + STDTYS GYYAPYVPSIV Sbjct: 77 TDSVPTTCNMPLSPEFSPADSVEPGSH-FWSTSNPTVHASTDTYSFGREGYYAPYVPSIV 135 Query: 377 TNDSPSAAFNEGLFDAVPSSGNISVNVSSQVDYTQSLSGLEYPVSHWSVWTKVGDGKQDV 556 +N+ PSAAFNE D +P+SG+I V+ SSQVDYTQSLSGLEYP HWS ++KV DGKQD Sbjct: 136 SNEHPSAAFNEPSLDVLPNSGSIHVDASSQVDYTQSLSGLEYP--HWSFFSKVADGKQDE 193 Query: 557 RKGV---------NAGASFGYMNCTSQGNSLEGVNIAGEDSGALSGNFTDGVYTGPSSMG 709 R GV NAGAS+GY NC SQGNSLEGVNIA EDSGA GNF DGVYTGPSSMG Sbjct: 194 RNGVDGSFSLGNVNAGASYGYRNCMSQGNSLEGVNIAREDSGA--GNFIDGVYTGPSSMG 251 Query: 710 HMDAKPYIPQEPVYPSFNSKTAVGSILPVSCQAGLSLGSSNNYLNYENPFTPHEKFFQPI 889 HMDAK Y+ QEP+Y S NS+TA+GSILPVSCQ GLSLGSSNNYLNYENPFTPHEKFFQP+ Sbjct: 252 HMDAKSYLTQEPIYQSLNSETAMGSILPVSCQVGLSLGSSNNYLNYENPFTPHEKFFQPL 311 Query: 890 DSCPRDTTSTSKFSPVVVIRPAPSGSRFFAQKTD--------KTGASNSEKSDVCDLLNK 1045 DSCPRDTTSTSK SPVVVIRPAPSGSRFFA K D KTGA+NSEKSDVCDLL K Sbjct: 312 DSCPRDTTSTSKSSPVVVIRPAPSGSRFFAPKIDLHKNVDICKTGATNSEKSDVCDLL-K 370 Query: 1046 GEETRLPIDSQVEGFALGTGPPLDFGKIKDIFYASSSIKNQCPSHPCGSNGIEIAVKESY 1225 +ETRLPIDS ++ F+LG+ PLDF KIK+IF+ASSS+ N C + PC SN IEIAVKE Sbjct: 371 SQETRLPIDSPIKEFSLGSSTPLDFDKIKNIFFASSSVNNLCSTRPCSSNSIEIAVKERS 430 Query: 1226 GSQAPYSSAPPVTFTEKCSDALDLHNLNEDSPCWKGAPAFCISPSDSVEAPSPCYFKCKL 1405 GSQAP +SAPPVTF EKCSDALDLHN N DSPCWKGAPAF IS DSV+A SPC F K+ Sbjct: 431 GSQAPCASAPPVTFAEKCSDALDLHNPNVDSPCWKGAPAFRISLGDSVDASSPCLFTSKV 490 Query: 1406 ECSDFGQSNPLFPPAEHSGRTSLKKSGEDNLHSHNVYAGIGLSAPAQGTGTNNYITEEHR 1585 E +DF QSNPLFPPAE+SG+TSLKK GE+NLH+HNVYAG GLS P+ GTGTNNY TEE R Sbjct: 491 EFADFSQSNPLFPPAEYSGKTSLKKLGEENLHNHNVYAGNGLSVPSVGTGTNNYTTEELR 550 Query: 1586 TNDVKEKTFEHMDLSSSSGVLKFSEDLNKPSKGYNLPQYSENDC---------LSVDEHK 1738 T DV ++TF MDLSS+ G+ KFSEDLNKPSKGY+LPQYSENDC LSVD H+ Sbjct: 551 TIDVTKETFVPMDLSSNGGIPKFSEDLNKPSKGYSLPQYSENDCQLQYSWGKHLSVDGHQ 610 Query: 1739 YGHTKHSLTESFVHSGLNLNDTLEGGVVALDAAENVLRSPASQEDAKQAQPYEVGSSPKL 1918 YG KH+L E ++H+GL+LNDTLEGGVVALDAAENVLRSPASQEDAKQAQ Y++GSSPKL Sbjct: 611 YGPKKHNLPEGYMHTGLSLNDTLEGGVVALDAAENVLRSPASQEDAKQAQQYQMGSSPKL 670 Query: 1919 DVQTLVRAIYNLSELLKSQCLTNACLLDEQDHDALKHAITNLGACTTKKIETKDTMISQH 2098 DVQTLV AI+NLSELLKSQCL NACLL+ QD D LK AITNLGACT KKIETKDTM+SQH Sbjct: 671 DVQTLVHAIHNLSELLKSQCLANACLLEGQDIDTLKSAITNLGACTAKKIETKDTMVSQH 730 Query: 2099 DTFEKFGESGPSYMGTGTGHPQFMEEVTWNSCGLGNQPVHEDKSKNNGKTAENSPLLTPV 2278 DTFEKF ES S+MGT TGHPQFMEEV W+SCGL NQP EDKSKNNGK ENS LLTP Sbjct: 731 DTFEKFEESRRSFMGTETGHPQFMEEVAWDSCGLDNQPTPEDKSKNNGKKTENSALLTPA 790 Query: 2279 DELGGSNEEQVVQAIKKVLNENFLSDEGMQPQA 2377 D+LG SNEEQVVQAIKKVLNENFLSDEGMQPQA Sbjct: 791 DDLGDSNEEQVVQAIKKVLNENFLSDEGMQPQA 823 >ref|XP_006347526.1| PREDICTED: uncharacterized protein LOC102592566 isoform X1 [Solanum tuberosum] Length = 1173 Score = 1151 bits (2978), Expect = 0.0 Identities = 598/813 (73%), Positives = 658/813 (80%), Gaps = 44/813 (5%) Frame = +2 Query: 71 PLAPPFTVDRSNSKPGST---------YTGSVPFGQQSWQQ-YTDPSTTGYNFFPKHEIV 220 PLAPPFTVDR+NSK GST YTG+VPFGQ SWQ +PS TGYNFFP V Sbjct: 21 PLAPPFTVDRTNSKTGSTQLLNFSDSSYTGTVPFGQ-SWQYGAANPSPTGYNFFPS---V 76 Query: 221 TDSMPTTC----MPEFTPTDSVKPSSNNLWSTSNQTANVSTDTYS----GYYAPYVPSIV 376 TDS+PTTC PEF+P DSV+P S+ WSTSN T + STDTYS GYYAPYVPSIV Sbjct: 77 TDSVPTTCNMPLSPEFSPADSVEPGSH-FWSTSNPTVHASTDTYSFGREGYYAPYVPSIV 135 Query: 377 TNDSPSAAFNEGLFDAVPSSGNISVNVSSQVDYTQSLSGLEYPVSHWSVWTKVGDGKQDV 556 +N+ PSAAFNE D +P+SG+I V+ SSQVDYTQSLSGLEYP HWS ++KV DGKQD Sbjct: 136 SNEHPSAAFNEPSLDVLPNSGSIHVDASSQVDYTQSLSGLEYP--HWSFFSKVADGKQDE 193 Query: 557 RKGV---------NAGASFGYMNCTSQGNSLEGVNIAGEDSGALSGNFTDGVYTGPSSMG 709 R GV NAGAS+GY NC SQGNSLEGVNIA EDSGA GNF DGVYTGPSSMG Sbjct: 194 RNGVDGSFSLGNVNAGASYGYRNCMSQGNSLEGVNIAREDSGA--GNFIDGVYTGPSSMG 251 Query: 710 HMDAKPYIPQEPVYPSFNSKTAVGSILPVSCQAGLSLGSSNNYLNYENPFTPHEKFFQPI 889 HMDAK Y+ QEP+Y S NS+TA+GSILPVSCQ GLSLGSSNNYLNYENPFTPHEKFFQP+ Sbjct: 252 HMDAKSYLTQEPIYQSLNSETAMGSILPVSCQVGLSLGSSNNYLNYENPFTPHEKFFQPL 311 Query: 890 DSCPRDTTSTSKFSPVVVIRPAPSGSRFFAQKTD--------KTGASNSEKSDVCDLLNK 1045 DSCPRDTTSTSK SPVVVIRPAPSGSRFFA K D KTGA+NSEKSDVCDLL K Sbjct: 312 DSCPRDTTSTSKSSPVVVIRPAPSGSRFFAPKIDLHKNVDICKTGATNSEKSDVCDLL-K 370 Query: 1046 GEETRLPIDSQVEGFALGTGPPLDFGKIKDIFYASSSIKNQCPSHPCGSNGIEIAVKESY 1225 +ETRLPIDS ++ F+LG+ PLDF KIK+IF+ASSS+ N C + PC SN IEIAVKE Sbjct: 371 SQETRLPIDSPIKEFSLGSSTPLDFDKIKNIFFASSSVNNLCSTRPCSSNSIEIAVKERS 430 Query: 1226 GSQAPYSSAPPVTFTEKCSDALDLHNLNEDSPCWKGAPAFCISPSDSVEAPSPCYFKCKL 1405 GSQAP +SAPPVTF EKCSDALDLHN N DSPCWKGAPAF IS DSV+A SPC F K+ Sbjct: 431 GSQAPCASAPPVTFAEKCSDALDLHNPNVDSPCWKGAPAFRISLGDSVDASSPCLFTSKV 490 Query: 1406 ECSDFGQSNPLFPPAEHSGRTSLKKSGEDNLHSHNVYAGIGLSAPAQGTGTNNYITEEHR 1585 E +DF QSNPLFPPAE+SG+TSLKK GE+NLH+HNVYAG GLS P+ GTGTNNY TEE R Sbjct: 491 EFADFSQSNPLFPPAEYSGKTSLKKLGEENLHNHNVYAGNGLSVPSVGTGTNNYTTEELR 550 Query: 1586 TNDVKEKTFEHMDLSSSSGVLKFSEDLNKPSKGYNLPQYSENDC---------LSVDEHK 1738 T DV ++TF MDLSS+ G+ KFSEDLNKPSKGY+LPQYSENDC LSVD H+ Sbjct: 551 TIDVTKETFVPMDLSSNGGIPKFSEDLNKPSKGYSLPQYSENDCQLQYSWGKHLSVDGHQ 610 Query: 1739 YGHTKHSLTESFVHSGLNLNDTLEGGVVALDAAENVLRSPASQEDAKQAQPYEVGSSPKL 1918 YG KH+L E ++H+GL+LNDTLEGGVVALDAAENVLRSPASQEDAKQAQ Y++GSSPKL Sbjct: 611 YGPKKHNLPEGYMHTGLSLNDTLEGGVVALDAAENVLRSPASQEDAKQAQQYQMGSSPKL 670 Query: 1919 DVQTLVRAIYNLSELLKSQCLTNACLLDEQDHDALKHAITNLGACTTKKIETKDTMISQH 2098 DVQTLV AI+NLSELLKSQCL NACLL+ QD D LK AITNLGACT KKIETKDTM+SQH Sbjct: 671 DVQTLVHAIHNLSELLKSQCLANACLLEGQDIDTLKSAITNLGACTAKKIETKDTMVSQH 730 Query: 2099 DTFEKFGESGPSYMGTGTGHPQFMEEVTWNSCGLGNQPVHEDKSKNNGKTAENSPLLTPV 2278 DTFEKF ES S+MGT TGHPQFMEEV W+SCGL NQP EDKSKNNGK ENS LLTP Sbjct: 731 DTFEKFEESRRSFMGTETGHPQFMEEVAWDSCGLDNQPTPEDKSKNNGKKTENSALLTPA 790 Query: 2279 DELGGSNEEQVVQAIKKVLNENFLSDEGMQPQA 2377 D+LG SNEEQVVQAIKKVLNENFLSDEGMQPQA Sbjct: 791 DDLGDSNEEQVVQAIKKVLNENFLSDEGMQPQA 823 >ref|XP_004235030.1| PREDICTED: uncharacterized protein LOC101252062 [Solanum lycopersicum] Length = 1175 Score = 1091 bits (2822), Expect = 0.0 Identities = 571/814 (70%), Positives = 644/814 (79%), Gaps = 45/814 (5%) Frame = +2 Query: 71 PLAPPFTVDRSNSKPGST---------YTGSVPFGQQSWQQYT-DPSTTGYNFFPKHEIV 220 PLAPPFTVDRSNSK ST YTG+VPFGQ SWQ DPS TGYNFFP V Sbjct: 21 PLAPPFTVDRSNSKTVSTQLLNFSDSSYTGTVPFGQ-SWQYAAADPSPTGYNFFPS---V 76 Query: 221 TDSMPTTC----MPEFTPTDSVKPSSNNLWSTSNQTANVSTDTYS----GYYAPYVPSIV 376 TDS+PTTC PEFTP DSV+P S+ WST N T N ST+TYS GYYA YVPS+V Sbjct: 77 TDSVPTTCNMPLSPEFTPADSVEPGSH-FWSTPNPTVNASTETYSFGREGYYAAYVPSLV 135 Query: 377 TNDSPSAAFNEGLFDAVPSSGNISVNVSS-QVDYTQSLSGLEYPVSHWSVWTKVGDGKQD 553 +N+ PS+AFNE D +P+SGNI V+ SS QVDYTQ+LSGLEYP HWS ++KV DGKQ+ Sbjct: 136 SNEHPSSAFNEPSLDVLPNSGNIHVDASSSQVDYTQTLSGLEYP--HWSFFSKVADGKQE 193 Query: 554 VRKGV---------NAGASFGYMNCTSQGNSLEGVNIAGEDSGALSGNFTDGVYTGPSSM 706 +KGV N GAS+GY NC S+GNSLEG NI E+SGA NF DGVYTGPSS+ Sbjct: 194 EKKGVDGSFSSGNVNVGASYGYRNCMSKGNSLEGANIPRENSGA--ANFIDGVYTGPSSI 251 Query: 707 GHMDAKPYIPQEPVYPSFNSKTAVGSILPVSCQAGLSLGSSNNYLNYENPFTPHEKFFQP 886 GHMDAK Y+ QEP+Y S S+TA+GS PVSCQ GLSLGSS+NYLNY+NPFTPH KFFQP Sbjct: 252 GHMDAKSYLTQEPIYQSLTSETAMGSFSPVSCQVGLSLGSSSNYLNYKNPFTPHGKFFQP 311 Query: 887 IDSCPRDTTSTSKFSPVVVIRPAPSGSRFFAQKTD--------KTGASNSEKSDVCDLLN 1042 +DSCPRDTTSTSK SPV+V RPAPSGSRFFA K D KTGA+N+EKSDVC++L Sbjct: 312 LDSCPRDTTSTSKSSPVLVFRPAPSGSRFFAPKIDLHKNVDICKTGATNTEKSDVCNVL- 370 Query: 1043 KGEETRLPIDSQVEGFALGTGPPLDFGKIKDIFYASSSIKNQCPSHPCGSNGIEIAVKES 1222 K +ETRLPIDS ++ F+LG+ P DF KIK+ F+ASSS+ N C + PC SN IEIAVKE Sbjct: 371 KSQETRLPIDSPIKEFSLGSSTPPDFDKIKNNFFASSSVNNLCSTRPCSSNSIEIAVKER 430 Query: 1223 YGSQAPYSSAPPVTFTEKCSDALDLHNLNEDSPCWKGAPAFCISPSDSVEAPSPCYFKCK 1402 GSQAP +SAPPVT EKCSDALDLHN N DSPCWKGAPAF +S SDSVEAPSPC K Sbjct: 431 SGSQAPCASAPPVTSAEKCSDALDLHNPNVDSPCWKGAPAFRVSLSDSVEAPSPCILTSK 490 Query: 1403 LECSDFGQSNPLFPPAEHSGRTSLKKSGEDNLHSHNVYAGIGLSAPAQGTGTNNYITEEH 1582 +E SDFGQSN LFPPAE+SG+TSLKK GE+NLH+HNVYAG GLS P+ GT TNNY TEE Sbjct: 491 VEFSDFGQSNHLFPPAEYSGKTSLKKLGEENLHNHNVYAGNGLSVPSVGTVTNNYTTEEL 550 Query: 1583 RTNDVKEKTFEHMDLSSSSGVLKFSEDLNKPSKGYNLPQYSENDC---------LSVDEH 1735 RT DV + TF +DLSS+ +LKFSEDLNKPSKGY+LPQYSENDC LSVD H Sbjct: 551 RTIDVTKGTFVPVDLSSNGVILKFSEDLNKPSKGYSLPQYSENDCQKQYSWGEHLSVDCH 610 Query: 1736 KYGHTKHSLTESFVHSGLNLNDTLEGGVVALDAAENVLRSPASQEDAKQAQPYEVGSSPK 1915 +YG KH+L E ++H+GLNLNDTLEGGVVALDAAENVLRSPASQEDAKQAQPY++GSSPK Sbjct: 611 QYGPKKHNLPEGYMHTGLNLNDTLEGGVVALDAAENVLRSPASQEDAKQAQPYQMGSSPK 670 Query: 1916 LDVQTLVRAIYNLSELLKSQCLTNACLLDEQDHDALKHAITNLGACTTKKIETKDTMISQ 2095 LDVQTLV AI+NLSELLKSQCL NACLL+ QD+D LK AITNLGACT KKIETKDTM+++ Sbjct: 671 LDVQTLVHAIHNLSELLKSQCLPNACLLEGQDYDTLKSAITNLGACTVKKIETKDTMVTE 730 Query: 2096 HDTFEKFGESGPSYMGTGTGHPQFMEEVTWNSCGLGNQPVHEDKSKNNGKTAENSPLLTP 2275 HDTFE+ ES SYMGT TG+PQFMEEV +SCGL NQP+ EDKSKNNGK ENSPLLT Sbjct: 731 HDTFERLKESHRSYMGTETGNPQFMEEVARDSCGLDNQPMPEDKSKNNGKKTENSPLLTS 790 Query: 2276 VDELGGSNEEQVVQAIKKVLNENFLSDEGMQPQA 2377 D+LG SNEEQVVQAIKKVLNENFLSDEGMQPQA Sbjct: 791 ADDLGDSNEEQVVQAIKKVLNENFLSDEGMQPQA 824 >ref|XP_003634177.1| PREDICTED: uncharacterized protein LOC100853355 [Vitis vinifera] gi|302143995|emb|CBI23100.3| unnamed protein product [Vitis vinifera] Length = 1167 Score = 193 bits (491), Expect = 3e-46 Identities = 239/873 (27%), Positives = 363/873 (41%), Gaps = 105/873 (12%) Frame = +2 Query: 74 LAPPFTVDRSNSKP---------GSTYTGSVPFGQQSWQQYTDPSTTGYNFFPKHEIVTD 226 LAPPFTVDR SKP STY +W P + ++F D Sbjct: 21 LAPPFTVDRPVSKPLSNPLVNFTESTYAAPFNSSLHNWVHPQSPVSRP-DYFSNPNSAVD 79 Query: 227 SMPTTCMPEF------------TPTDSVKPSSNNLWSTSN-----QTANVSTDTYS---- 343 S+ T +P +P + P S+ + ++ + TD +S Sbjct: 80 SVQATGVPPSNAYRYSVSQPVNSPVVHLPPLSHIVSGIAHLPPLSPIVSAGTDVFSFGQC 139 Query: 344 ------------GYYAPYVPSIVTNDSPSAAFNEGLFDAVPSSGNISVNVSSQVD-YTQS 484 YY PYV + ++SP NE +D + +S +N SS +D YTQS Sbjct: 140 SDRMKTSLVEAKPYYPPYVAPAIEDNSPLVVLNEPNYDLLSTSHAAHLNGSSSLDDYTQS 199 Query: 485 LSGLEYPVSHWSVWTKVGDGKQDVR---------KGVNAGASFGYMNCTSQGN-SLEGVN 634 +SGLEYP W + D +Q + K N S Y + +QG+ + EGV+ Sbjct: 200 MSGLEYPSRWCGFWNGLADIEQGKKVELDESLCSKESNFVGSSIYRSYINQGDPTAEGVS 259 Query: 635 IAGEDSGALSGNFTDGV----YTGPSSMGHMDAKPYIPQEP----VYPSFNSKTAVGS-- 784 + E S + D + G S H + K + + V F + +GS Sbjct: 260 NSEEGSVLSDRKYVDILGRDNCVGSLSPDHFNNKSFYEPKANPMVVSLDFPRTSFLGSTS 319 Query: 785 ILPVSCQAGL-SLGSSNNYLNYENPFTP-HEKFFQPIDSCPRDTTSTSKFSPVVVIRP-- 952 +LP + SL N NY P + +EK F+ IDSC D S +K SP +VIRP Sbjct: 320 VLPETPHPRAPSLEPVTNSWNYRKPQSALYEKCFRKIDSCVDDPVSKAKSSPAIVIRPPA 379 Query: 953 -APS--GSRFFAQKTDKTGASNSEKSDVCDLLNKGEETRLPIDSQVEGFALGTGPPLDFG 1123 +PS G F+ + + NSE L N EE +P+ S+ T Sbjct: 380 NSPSSLGVNSFSSR-NMICTDNSENVSGHHLSNM-EEPHIPVISEGRELYSDTSQLNGHW 437 Query: 1124 KIKD--IFYASSSIKNQCPSHPCGSNGIEIAVKESYGSQAPY-----------SSAPPVT 1264 + D +SS+ K++ ++ G + ++ Q P+ +S V Sbjct: 438 QRNDHLSMESSSTKKHELLNNEMGVKETDNLLRARSELQIPHLNVEDGFSFSPNSIEAVN 497 Query: 1265 FTEKCSDALDLHNLNEDSPCWKGAPAFCISPSDSVEAPSPCYFKCKLECSD-FG-QSNPL 1438 + S+ LD +N DSPCWKG+ SP + EA SP +LE D F Q + + Sbjct: 498 SIDNTSETLDHYNPAVDSPCWKGSITSHFSPFEVSEALSPHNLMEQLEALDGFNLQGHHI 557 Query: 1439 FP-PAEHSGRTSLKKSGEDNLHSHNVYAGIGLSAPAQGTGTNNYITEEHRTNDVKEKTFE 1615 FP ++ + S K E+ + NV GL + N+ + E R+ D KT Sbjct: 558 FPLNSDDAVNVSSLKPNENTEYHKNVCGENGLLPSWKRPSVVNHPSREQRSLDAF-KTGP 616 Query: 1616 HMDLSSSSGVLKFSEDLNKPSKGYNLPQYSENDCL--------SVDEHKY-GHTKHSLTE 1768 + SS + S D+ +P + ++L S++D L S +E K+ K S Sbjct: 617 YCQKLSSGDGNQSSNDIIQPKRDHSLLNSSKSDNLELSHTMRQSFEEVKFTSERKLSSGV 676 Query: 1769 SFVHSGLNLNDTLEGGVV--ALDAAENVLRSPASQEDA--KQAQPYEVGSSPKLDVQTLV 1936 +G N+ND G EN+ SP S +DA K + S+PK+DV L+ Sbjct: 677 GVEVTGNNINDVSRDGSSHETYHLTENISCSPLSGDDASTKLTKQPASESTPKIDVHMLI 736 Query: 1937 RAIYNLSELLKSQCLTNACLLDEQDHDALKHAITNLGACTTKKIETKDTMISQH------ 2098 + +LS LL S C NA L EQDH+ LK I N AC TKK + S H Sbjct: 737 NTVQDLSVLLLSHCSDNAFSLKEQDHETLKRVIDNFDACLTKKGQKIAEQGSSHFLGELP 796 Query: 2099 DTFEKFGESGPSYMGTGTGHPQFMEEVTWNSCGLGNQPVHEDKSKNNGKTAENSPLLTPV 2278 D + S P +G ++ S G + S + K + S ++ V Sbjct: 797 DLNKSASASWP--LGKKVADANVEDQFHCQSDHKGKRHC----SVSGNKDEKLSDFVSLV 850 Query: 2279 DELGGSNEEQVVQAIKKVLNENFLSDEGMQPQA 2377 ++ N++ +QAI+K+L++NF +E PQA Sbjct: 851 NDEDTVNDDSTIQAIRKILDKNFHDEEETDPQA 883 >gb|EMJ21784.1| hypothetical protein PRUPE_ppa000352mg [Prunus persica] Length = 1254 Score = 164 bits (414), Expect = 2e-37 Identities = 214/854 (25%), Positives = 338/854 (39%), Gaps = 92/854 (10%) Frame = +2 Query: 74 LAPPFTVDRSNSKPGST---------YTGSVPFGQQSWQQYTDPSTTGYNFFPKHEIVTD 226 LAPPFTVDRS KP S+ Y + +W + P TG NFF + Sbjct: 25 LAPPFTVDRSVPKPISSPLVDVTETPYVAPLNSSSHNWLP-SHPPITGSNFFANPTPEFN 83 Query: 227 SMPTTCMPEFT-------------PTDSVKPSSNNLWSTSNQTANVSTDTYSG--YYAPY 361 S+P++ + P +++ P+S+N ++ V+T YY Y Sbjct: 84 SLPSSNAYRYAGSQIVDPPNTTLPPLNTITPASSNAFTYDQSLDAVATSFVEAKPYYPSY 143 Query: 362 VPSIVTNDSPSAAFNEGLFDAVPSSGNISVNVSSQVDYTQSLSGLEYPVSHWSVWTKVGD 541 + + DSP ++ +D + ++ ++ S+ DYTQ L+Y +W + + Sbjct: 144 LSPTIHGDSPLVVPDQPSYDWLSTTHFAPLDGCSRKDYTQRPPDLKYTAQWGGLWNGLSE 203 Query: 542 GKQDVR---------KGVNAGASFGYMNCTSQ----GNSLE-------GVNIAGEDSGAL 661 +Q + K + SF Y N +Q NSL G+N G + Sbjct: 204 WEQGKQGDFDGSFCSKKTDVSGSFLYKNFMNQEPHSSNSLNSFEEASHGINTLGWEKPGG 263 Query: 662 SGNFTDGVYTGPSSMGHMDAKPYIPQEPVY-PSFNSKTAVGSILPVSCQAGLSLGSSNNY 838 SGN H+ K + + + PS SK+ +GS L V + L SS Sbjct: 264 SGN------------AHLGDKSLVGKNSKFTPSDFSKSVMGS-LSVVPEPHLKAPSSQCV 310 Query: 839 LNYENPFTPHE--KFFQPIDSCPRDTTSTSKFSPVVVIRPAPSGSRFFAQKTDK----TG 1000 N TP+ Q +D+ TS S+ SP R G++ T Sbjct: 311 TKTSNCKTPYSVSSETQQLDASLDYITSISESSPAFATRTPALGTKLSEPGTGLFRRLNF 370 Query: 1001 ASNSEKSDVCDLLNKG-EETRLP--------IDSQVEGFALGTGPPLDFGKIKDIFYA-S 1150 S++ +D D + G +E+ LP DS GF LG KD F A S Sbjct: 371 ISDAADTDHGDYYSSGVQESHLPQISEGKVLFDSSQLGFHLGA---------KDCFSAES 421 Query: 1151 SSIKNQCPSHPCGSNGIEIAVKESY------------------GSQAPYSSAPPVTFTEK 1276 SS +N+ S N I K+++ G + + + + Sbjct: 422 SSARNEELS-----NNRNIINKDAWDKVFKAKPGLQNSHVGLDGFKMAFKTNETINSFLS 476 Query: 1277 CSDALDLHNLNEDSPCWKGAPAFCISPSDSVEAPSPCYFKCKLECSDFGQSNPLFPPAEH 1456 SD +D +N DSPCWKG P C SP + E P K +CS P+FP + Sbjct: 477 SSDNVDPNNPGVDSPCWKGVPGSCFSPFGASEDGVPEQIKKLEDCSGLNIHMPMFPLSAG 536 Query: 1457 SGRTSLKKSGEDNLHSHNVYAGI--GLSAPAQGTGTNNYITEEHRTNDVKEKTFEHMDLS 1630 +S K N +N + + GL P + N EH+ ++ + T++ + S Sbjct: 537 ENVSSQKPI--KNAVEYNEFGWLENGLRPPLKRYSVANSAFGEHKWDNSVKTTYD-AETS 593 Query: 1631 SSSGVLKFSEDLNKPSKGYNLPQYSENDCLSVDEH--KYGHTKHSLTESFVHSG------ 1786 G + + L++ G ++ L D H + GH + L + Sbjct: 594 HDRGPQSYRDGLHQSGNG------DKSLGLLDDSHAMQQGHGEDGLATEVKQTWSCVADV 647 Query: 1787 -LNLNDTLEGGV--VALDAAENVLRSPASQEDAKQAQPYEVGSSPKLDVQTLVRAIYNLS 1957 LN NDT+E G V ENVL S A K ++ S K+DVQ LV + NLS Sbjct: 648 KLNANDTMEYGSSHVPSHVVENVLCSSAEDAATKLSKSNGEESMLKVDVQMLVDTLKNLS 707 Query: 1958 ELLKSQCLTNACLLDEQDHDALKHAITNLGACTTKKIETKDTMISQHDTFEKFGESGPSY 2137 ELL + C C L + D LK I NL C +K +E K + + + TF++ + Y Sbjct: 708 ELLLTNCSNGLCQLKKTDIATLKAVINNLHICISKNVE-KWSPMQESPTFQQ--NTSQCY 764 Query: 2138 MGTGTGHPQFMEEVTWNSCGLGNQPVHEDKSKNNGKTAENSPLLTPVDELGGSNEEQVVQ 2317 H ++P+ + + + + +D + E+++ Q Sbjct: 765 AELSEHHKVLS----------ADRPLSASAPDIQDQVIGSIHVKSDIDVV---KEDKMTQ 811 Query: 2318 AIKKVLNENFLSDE 2359 AIK++L+ENF S+E Sbjct: 812 AIKEILSENFHSEE 825 >ref|XP_004309093.1| PREDICTED: uncharacterized protein LOC101301835 [Fragaria vesca subsp. vesca] Length = 1218 Score = 146 bits (368), Expect = 5e-32 Identities = 203/850 (23%), Positives = 335/850 (39%), Gaps = 88/850 (10%) Frame = +2 Query: 74 LAPPFTVDRSNSKPGST--YTGSVPFGQQSWQQYT-DPSTTGYNFFPKHEIVT------- 223 LAPPFTV+R KP S+ P + + Q Y P++T +N+ P H + Sbjct: 24 LAPPFTVERPVPKPISSPLVESFTPLVEVTEQPYAAPPNSTLHNWLPPHSPSSVPNFFTN 83 Query: 224 -----DSMPTTCMPEFT-------------PTDSVKPSSNNLWSTSNQTANVSTDTYSG- 346 DS+P++ + P +SV S+N +S + +T Sbjct: 84 PPPAFDSVPSSNAYRYAGLPTVDSFSTNLPPMNSVSMPSSNAFSYDQRLDVAATSFVEAK 143 Query: 347 -YYAPYVPSIVTNDSPSAAFNEGLFDAVPSSGNISVNVSSQVDYTQSLSGLEYPVSHWSV 523 YY Y+ + D+P ++ +D + +S ++ SS +YTQ S +Y S Sbjct: 144 PYYPSYLSPTIHGDNPVVPPDQPSYDWLSTSQFAPLDGSSHKEYTQRPSSSKYTAQWGSS 203 Query: 524 WTKVGD---GKQ-----DVRKGVNAGASFGYMNCTSQ----GNSLEGVNIAGEDSGALSG 667 W + GKQ R N ++ Y N +Q NSL+ + + S Sbjct: 204 WNGPAEWEQGKQGQFDGSFRPKENDVSNLPYNNYLNQEPHSSNSLKSYGV----NEVASH 259 Query: 668 NFTDGVYTGPSSMGHMDAKPYIPQEPVYPSFN-SKTAVGSI-----LPVSCQAGLSLGSS 829 N D + G + H+ K ++ + + + +K +GS+ +P + +G S Sbjct: 260 NIPD--WNGSVNAEHLGDKSFVGRNSKFSPIDFTKPTMGSLSVVPEIPSKAPSSPFIGKS 317 Query: 830 NNYLNYENPFTPHEKFFQPIDSCPRDTTSTSKFSPVVVIRPAPSGSR-------FFAQKT 988 ++ E + D+ D TS SK SP +IRP G++ F + Sbjct: 318 TYGVSCEK---------RQHDASWNDVTSISKSSPASIIRPPAIGTKSSEPKMGLFKRLN 368 Query: 989 DKTGASNSEKSDVCDL----LNKGEETRLPIDSQVEGFALGTGPP--LDFGKIKDIFYAS 1150 A+N++ L + ++P DS G LG P ++ KD + Sbjct: 369 SGRDAANADHGGYYPSQESHLPQSFVDKVPFDSSQLGIHLGRIDPFSVESSSTKDTALPN 428 Query: 1151 S-SIKNQCPSHPCGSN-GIEIAVKESYGSQAPYSSAPPVTFTEKCSDALDLHNLNEDSPC 1324 + SI N H G+ + + G A + + S+ +D +N DSPC Sbjct: 429 NGSISNDPLDHLFKVKPGLPNSHVKPDGFDAAVNINDSINSFLNSSENVDPNNPAVDSPC 488 Query: 1325 WKGAPAFCISPSDSVEAPSPCYFKCKLECSDFGQSNPLFPPAEHSGRTSLKKSGEDNLHS 1504 WKG SP + E P K C+ + P+ S +K E N Sbjct: 489 WKGVRGSRFSPFKASEEGGPEKMKKLEGCNGLNLNMPMIFSLNTCENISTQKPVEYNEFG 548 Query: 1505 H--NVYAGIGLSAPAQGTGTNNYITEEHRTNDVKEKTFEHMDLSSSSGVLKFSEDLNKPS 1678 N G GL P + + N EH+ +D + T+ + + G+ + +N P Sbjct: 549 WLGNGLLGNGLPLPLKKSSVENSAFGEHKLDDTTKTTY-YRESGHDRGLHGY---INTPH 604 Query: 1679 KGYNLPQYSENDCLSVDEHKY---------GHTKHSLTESFVHSG---LNLNDTLEGGVV 1822 G S + S EH Y G T S ++ LN+NDTLE G Sbjct: 605 SG------SGDKSSSPFEHSYIVQEGCGEGGLTTESKNTTWSVGADVKLNINDTLECGSS 658 Query: 1823 ALDAAENVLRSPASQE-DAKQAQPYEVGSSPKLDVQTLVRAIYNLSELLKSQCLTNACLL 1999 EN SP+ ++ D K Y S+ +D+Q LV + +LSE+L C ++C L Sbjct: 659 HTSPIENTFCSPSVEDADTKLTTSYGEESNMNMDIQMLVNKMNSLSEVLLVNCSNSSCQL 718 Query: 2000 DEQDHDALKHAITNLGACTTKKIETKDTMIS----QHDTFEKFGESGPSYMGTGTGHPQF 2167 ++D DALK I NL +C K E +M Q T + E PQ Sbjct: 719 KKKDIDALKAVINNLNSCILKHDEDFLSMPESPPIQQSTIKYIEELCKPNKALSPDMPQL 778 Query: 2168 ME------EVTWNSCGLGNQPVHEDKSKNNGKTAENSPLLTPVDELGGSNEEQVVQAIKK 2329 + + + G+ H++ KN+ + + + +D + +E++ Q IKK Sbjct: 779 TKIFAPSIQDPLHLQGVQKVKNHDNLVKNDDEVISSVSAKSDIDFV---KQEEMTQDIKK 835 Query: 2330 VLNENFLSDE 2359 +L+ENF +D+ Sbjct: 836 ILSENFHTDD 845 >ref|XP_002521299.1| hypothetical protein RCOM_0756330 [Ricinus communis] gi|223539484|gb|EEF41073.1| hypothetical protein RCOM_0756330 [Ricinus communis] Length = 1125 Score = 143 bits (360), Expect = 4e-31 Identities = 199/825 (24%), Positives = 323/825 (39%), Gaps = 58/825 (7%) Frame = +2 Query: 74 LAPPFTVDRSNSKPGSTYTGSVPFGQQSWQQYTDPSTTGYNFFPKHEIVTDSMPTTCMPE 253 LAPPFTVDRS KP T + S + +P +++F + DS Sbjct: 47 LAPPFTVDRSVPKPLVDLTEPTSY-HHSLHNWVNPHQPEFDYFVIQKPELDSN------- 98 Query: 254 FTPTDSVKPSSNNLWSTSNQTANVSTDTY-------SGYYA-PYVPSIVTNDSPSAAFNE 409 S N ++SN +VSTD+ +G A PY PS T SP+ N+ Sbjct: 99 ---------SYNRYSASSNPHVSVSTDSVLYGQSGVTGLEAKPYYPS--TYISPAIG-ND 146 Query: 410 GLFDAVP--------SSGNISVNVSSQVDYTQSLSGLEYPVSHWSVWTKVGDGKQDVRKG 565 VP S+ +S ++ S DYTQSLSG W+ + DG D + Sbjct: 147 CSLGGVPHHSDYGLLSASRVSTSIGSSEDYTQSLSGQ---------WSGMWDGLTDWLQS 197 Query: 566 VNAGASFGYMNCTSQGNSLEGVNIAGEDSGALSGNFTDGV----YTGPSSMGHMDAKPYI 733 + + + N + G+ + S + D V + +G +D K ++ Sbjct: 198 EQVQLDGSFCSKETYMNQVAGLYASESTSKYEASQSADTVGRETQIESAGVGKLDYKSFL 257 Query: 734 PQEPVYPSFN----SKTAVGSILPVSCQAGLSLGSSNNYLNYENPFTP-HEKFFQPIDSC 898 + + + S A ++P +C S + N++ N+ P++ +EK + D+ Sbjct: 258 GENRKFTPSDYPTPSSLASTLLVPETCSQVPSKKAVNSW-NHHMPYSASNEKCLRRHDAT 316 Query: 899 PRDTTSTSKFSPVVVIRPAPSGSRFFAQKTDKTGASNSEKSDVCDLLNKGEETRLPIDSQ 1078 D + SP VVI+P + K T + K C+ + E R I S+ Sbjct: 317 SSDIATILYSSPAVVIKPPEHNKG--SLKNVNTSSDGDNKDFSCNSPSVVVEPRPFITSK 374 Query: 1079 VEGFALGTGPPLDFGKIKDIFYASSSIKNQ-----------CPSHPCGSNGIEIAVKESY 1225 + GK + SS KN+ H G + S Sbjct: 375 GSVCYDASQVSFHLGKTDQVIANFSSAKNEELSSNQNASMDVSGHFAGEKPVIQVPCTSL 434 Query: 1226 GSQAPYSSAPPVTFTEKCSDALDLHNLNEDSPCWKGAPAFCISPSDSVEAPSPCYFKCKL 1405 G + + + +++LD +N DSPCWKGAP S + EA +P K Sbjct: 435 GGISLVDKNEAIDPAKNHTESLDHYNPAVDSPCWKGAPVSNFSQLEVSEAVTPQNMKNLE 494 Query: 1406 ECSDFGQSN--PLFPPAEHSGRTSLKKSGEDNLH----SHNVYAGIGLSAPAQGT----- 1552 CS ++ + + S +K+ E ++ S Y+ + P Sbjct: 495 ACSGSNHQGYQTFSVSSDDAVKVSPEKTSEKSIQQKGWSLENYSASSMKRPLADNMLHRE 554 Query: 1553 GTNNYITEEHRTNDVKEKTFEHMDLSSSSGVLKFSEDLNKPSKGYNLPQYSENDCLSVDE 1732 G ++++ N K F + +S + K +D N LPQ + C + Sbjct: 555 GIDHFVN--FGANCTKPSLFHQVQISDDALPNKSFDDSNG-----KLPQNEKQSC---ES 604 Query: 1733 HKYGHTKHSLTE-SFVHSGLNLNDTLE--GGVVALDAAENVLRSPASQEDAKQAQPYEVG 1903 K+ +S S G+N+ND + V A E+VL SP S + A G Sbjct: 605 GKWTTESNSAPVISVADVGMNMNDDPDECSSHVPFHAVEHVLSSPPSADSASIKLTKACG 664 Query: 1904 --SSPKLDVQTLVRAIYNLSELLKSQCLTNACLLDEQDHDALKHAITNLGACTTKKIE-- 2071 S+ K ++T++ + NLSELL + C L E D +ALK I+NL C K +E Sbjct: 665 GVSTQKTYIRTVIDTMQNLSELLIFHLSNDLCDLKEDDSNALKGMISNLELCMLKNVERM 724 Query: 2072 --TKDTMISQHDTFEKFGESGPSYMGT-GTGHPQFMEEVTWNSCGLGNQPVHEDKSKNNG 2242 T++++I + D + G+S GT G G + + Q V ++ + ++G Sbjct: 725 TSTQESIIPERDGAQLSGKSSKLQKGTNGNGFLISRSDPLEFQYSVKYQHVQDEHNISSG 784 Query: 2243 KTAENSPLLTPVDELGGS-NEEQVVQAIKKVLNENFLSDEGMQPQ 2374 K E V +++ QAIK L ENF +E +PQ Sbjct: 785 KNDETLSSYVSVRAAADMLKRDKMTQAIKNALTENFHGEEETEPQ 829 >ref|XP_006478087.1| PREDICTED: uncharacterized protein LOC102628429 [Citrus sinensis] Length = 1065 Score = 139 bits (349), Expect = 7e-30 Identities = 207/828 (25%), Positives = 341/828 (41%), Gaps = 66/828 (7%) Frame = +2 Query: 74 LAPPFTVDRSNSKPGSTYTGS----VPFGQQSWQQYTDPSTTGYNFFPKHEIVTDSMPTT 241 LAPPFTVDRS SKP T + ++ + GY+F P T +P Sbjct: 23 LAPPFTVDRSVSKPLVDLTEPPLNWLNTHPLNFDSVHSSNAYGYSFNPPS---TAHIP-- 77 Query: 242 CMPEFTPTDSVKPSSNNLWSTSNQTANVSTDTYSGYYAPYV-PSIVTNDSPSAAFNEGLF 418 P P SS +S+ + + + YY YV P+ T D Sbjct: 78 --PPENPIPITSASSFLYGQSSDAIPSANLVEANPYYPSYVSPTKYTYD----------- 124 Query: 419 DAVPSSGNISVNVSSQVDYTQSLSGLEYPVSHWSVWTKVGDGKQDVRKGVNAGASFGYMN 598 DY QSLS L W K+ G+ K +N Y + Sbjct: 125 -----------------DYAQSLSSLWDASREWEFGRKLELGESFCAKEMNVPDLSIYQD 167 Query: 599 CTSQG-NSLEGVNIAGEDSGALSGNFTDGVYTGPSSMGHMDAKPYIPQ--EPVYPSFNSK 769 QG +S +G+N + + L ++ + G + +D K + Q E + ++ K Sbjct: 168 YADQGAHSSKGLNTFEQKNNNLDMLGSEQ-HQGSINREQLDYKSFTGQISEFMPVEYSRK 226 Query: 770 TAVGSILPVSCQAGLSLGSSNNYLNYENPFTPH-EKFFQPIDSCPRDTTSTSKFSPVVVI 946 + GS L+ +++ P+ EK + P D +S K SPV VI Sbjct: 227 SVHGSTSFFPETYSLTSFEQGRSWSHQTPYGASCEKGAKQHGISPNDISSVKKSSPVHVI 286 Query: 947 R---------PAPSGSRFFAQKTDKTGASNSEKSDVCDLLN-KGEETRLPIDSQVEGFAL 1096 + P +GS + + ASN S++ + E ++ D+ F L Sbjct: 287 KSQAVCSSLSPPSTGSFNNLENSSGAIASNDNLSNMKEFYPLHSSEGKVHFDAGQVSFHL 346 Query: 1097 GTG----PPLDFGKIKDIFYASSSIKNQCPSHPCGSNGIEIAVKESYGSQAPYSSAPPVT 1264 G P L K + + S IK+ P G++I ++ + Sbjct: 347 ERGSHIFPKLPLEKKEKLSSNVSVIKDPLKEKP----GLQIPDIGPGSVSLMLANNGAIN 402 Query: 1265 FTEKCSDALDLHNLNEDSPCWKGAPAFCISPSDSVEAPSPCYFKCKLECSDFGQSNPLFP 1444 +E S++LD +N DSPCWKGAP + SP +S P K+E S F Sbjct: 403 CSEGSSESLDHYNPAVDSPCWKGAPDYH-SPVES-SGPVTLQHINKIEACSGSNS---FG 457 Query: 1445 PAEHSGRTSLKKSGEDNLHSHNVYAGIGLSAPAQGTGTNNYITEEHR-TNDVKEKTFEHM 1621 P ++SG+ S +K + + + + Y + + + N + EEH +D+K +++ Sbjct: 458 PTDNSGKVSPQKPSDYSFYQEHGYLENDPESSPKRSSRANLLFEEHGYDHDLKTGSYQ-- 515 Query: 1622 DLSSSSGV-LKFSEDLNKPSKGYNLPQYSENDCLSVDEHK--YGHTKHSLT--------E 1768 + SS G+ ++FS+ ++KP + Y S ++ H+ Y ++ LT Sbjct: 516 -MKSSCGLGVQFSDYIDKPRQDYVHANNSADEFKFRPFHQVQYDTVENKLTFERKCELGS 574 Query: 1769 SFVHSGLNLNDTLEG--GVVALDAAENVLRSPASQE--DAKQAQPYEVGSSPKLDVQTLV 1936 GL++N T EG V L A E+VL SP+S E A+ + + +P++ V+TL+ Sbjct: 575 GVADVGLSINGTSEGCSSHVPLHATEHVLSSPSSVEAVPARLNKLHGEQLAPQMCVRTLI 634 Query: 1937 RAIYNLSELLKSQCLTNACLLDEQDHDALKHAITNLGACTTKKI----ETKDTMISQHDT 2104 +++NLSELL C + C L E D +ALK + NL C +K++ ++++++Q + Sbjct: 635 SSMHNLSELLLFHCSNDMCGLKEHDFEALKLVVNNLDKCISKRMGPEAPIQESLLTQKSS 694 Query: 2105 FEKFGESGPSYMGTGTGHPQFMEEVTWNSCGLGNQP----VHEDKSKN--NGKTAENSPL 2266 E E + G PQ T + + NQP V E +S + GK E Sbjct: 695 -EFIREFPELHEGVTVSSPQ----ETKAAFSVLNQPNYQHVQEQRSPDIAAGKKIEKCSD 749 Query: 2267 LTP-----------------VDELGGSNEEQVVQAIKKVLNENFLSDE 2359 T D+ ++ + QAIKKVL++NF+ +E Sbjct: 750 FTSQGGHAERVKDDDMTQVHKDDAERVKDDNMTQAIKKVLSDNFVKEE 797 >ref|XP_006441272.1| hypothetical protein CICLE_v10018632mg [Citrus clementina] gi|557543534|gb|ESR54512.1| hypothetical protein CICLE_v10018632mg [Citrus clementina] Length = 842 Score = 135 bits (339), Expect = 1e-28 Identities = 205/826 (24%), Positives = 337/826 (40%), Gaps = 64/826 (7%) Frame = +2 Query: 74 LAPPFTVDRSNSKPGSTYTGS----VPFGQQSWQQYTDPSTTGYNFFPKHEIVTDSMPTT 241 LAPPFTVDRS SKP T + ++ + GY+F P T +P Sbjct: 23 LAPPFTVDRSVSKPLVDLTEPPLNWLNTHPLNFDSVHSSNAYGYSFNPPS---TAHIP-- 77 Query: 242 CMPEFTPTDSVKPSSNNLWSTSNQTANVSTDTYSGYYAPYV-PSIVTNDSPSAAFNEGLF 418 P P SS +S+ + + + YY YV P+ T D Sbjct: 78 --PPENPIPITSASSFLYGQSSDAIPSANLVEANPYYPSYVSPTKYTYD----------- 124 Query: 419 DAVPSSGNISVNVSSQVDYTQSLSGLEYPVSHWSVWTKVGDGKQDVRKGVNAGASFGYMN 598 DY QSLS L + W K+ G+ K +N Y + Sbjct: 125 -----------------DYAQSLSSL-WDSREWEFSRKLELGESFCSKEMNVPDLSIYQD 166 Query: 599 CTSQG-NSLEGVNIAGEDSGALSGNFTDGVYTGPSSMGHMDAKPYIPQ--EPVYPSFNSK 769 QG +S +G+N + + L ++ + G + +D K + Q E + ++ K Sbjct: 167 YADQGAHSSKGLNTFEQKNNNLDMLGSEQ-HQGSINREQLDYKSFTGQISEFMPVEYSRK 225 Query: 770 TAVGSILPVSCQAGLSLGSSNNYLNYENPFTPH-EKFFQPIDSCPRDTTSTSKFSPVVVI 946 + GS L+ +++ P+ EK + P D +S K SPV V+ Sbjct: 226 SVHGSTSLFPETYSLTSYEQGRSWSHQTPYGASCEKGAKQHGISPNDISSVKKSSPVHVV 285 Query: 947 RP-------APSGSRFFAQKTDKTG--ASNSEKSDVCDLLN-KGEETRLPIDSQVEGFAL 1096 + +P + F + +G ASN S++ + E ++ D+ F L Sbjct: 286 KSQAVFTSLSPPSTVSFNNLENSSGVIASNDNLSNMKEFYPLHSSEGKVHFDAGQVSFHL 345 Query: 1097 GTG----PPLDFGKIKDIFYASSSIKNQCPSHPCGSNGIEIAVKESYGSQAPYSSAPPVT 1264 G P L F K + + S IK+ P G++I ++ + Sbjct: 346 ERGSHIFPKLPFEKKEKLSSNVSVIKDPLKEKP----GLQIPDIGPGSVSLMLANNRAIN 401 Query: 1265 FTEKCSDALDLHNLNEDSPCWKGAPAFCISPSDSVEAPSPCYFKCKLECSDFGQSNPLFP 1444 +E S++LD +N DSPCWKGAP + SP +S P K+E S Sbjct: 402 CSEGSSESLDHYNPAVDSPCWKGAPDYH-SPVES-SGPVTLQHINKIEACSGSNS---IG 456 Query: 1445 PAEHSGRTSLKKSGEDNLHSHNVYAGIGLSAPAQGTGTNNYITEEHRTNDVKEKTFEHMD 1624 P ++SG+ S +K + + + + Y + + + N + EEH + + F M Sbjct: 457 PTDNSGKVSPQKPSDYSFYQEHGYLENDPESSPKRSSRANLLFEEHGYDRDLKTGFYQMK 516 Query: 1625 LSSSSGVLKFSEDLNKPSKGYNLPQYSENDCLSVDEHK--YGHTKHSLT--------ESF 1774 S GV +FS+ ++KP + Y S ++ H+ Y ++ LT Sbjct: 517 SSYGLGV-QFSDCIDKPRQDYVHANNSADEFKFRPFHQVQYDSVENKLTFERKCELGSGV 575 Query: 1775 VHSGLNLNDTLEG--GVVALDAAENVLRSPASQE--DAKQAQPYEVGSSPKLDVQTLVRA 1942 GL++N T EG V L A E+VL SP+S E A+ + + +P++ V+TL+ Sbjct: 576 ADVGLSINGTSEGCSSHVPLHATEHVLSSPSSVEAVPARLNKLHGEQLAPQMCVRTLIST 635 Query: 1943 IYNLSELLKSQCLTNACLLDEQDHDALKHAITNLGACTTKKI----ETKDTMISQHDTFE 2110 ++NLSELL C + C L E D +ALK + NL C +K++ ++++++Q + E Sbjct: 636 MHNLSELLLFHCSNDMCGLKEHDFEALKLVVNNLDKCISKRMGPEAPIQESLLTQKSS-E 694 Query: 2111 KFGESGPSYMGTGTGHPQFMEEVTWNSCGLGNQP----VHEDKSKN--NGKTAENSPLLT 2272 E + G P + T + + NQP V E +S + GK +E T Sbjct: 695 FIREFPELHEGVTVSSP----KETKAAFSVLNQPNYQHVQEQRSPDIAAGKKSEKCSDFT 750 Query: 2273 P-----------------VDELGGSNEEQVVQAIKKVLNENFLSDE 2359 D+ ++ + QAIKKVL++NF+ +E Sbjct: 751 SQGGHAERVKDDDMTQVHKDDAERVKDDNMTQAIKKVLSDNFVEEE 796 >ref|XP_006441271.1| hypothetical protein CICLE_v10018632mg [Citrus clementina] gi|557543533|gb|ESR54511.1| hypothetical protein CICLE_v10018632mg [Citrus clementina] Length = 1064 Score = 135 bits (339), Expect = 1e-28 Identities = 205/826 (24%), Positives = 337/826 (40%), Gaps = 64/826 (7%) Frame = +2 Query: 74 LAPPFTVDRSNSKPGSTYTGS----VPFGQQSWQQYTDPSTTGYNFFPKHEIVTDSMPTT 241 LAPPFTVDRS SKP T + ++ + GY+F P T +P Sbjct: 23 LAPPFTVDRSVSKPLVDLTEPPLNWLNTHPLNFDSVHSSNAYGYSFNPPS---TAHIP-- 77 Query: 242 CMPEFTPTDSVKPSSNNLWSTSNQTANVSTDTYSGYYAPYV-PSIVTNDSPSAAFNEGLF 418 P P SS +S+ + + + YY YV P+ T D Sbjct: 78 --PPENPIPITSASSFLYGQSSDAIPSANLVEANPYYPSYVSPTKYTYD----------- 124 Query: 419 DAVPSSGNISVNVSSQVDYTQSLSGLEYPVSHWSVWTKVGDGKQDVRKGVNAGASFGYMN 598 DY QSLS L + W K+ G+ K +N Y + Sbjct: 125 -----------------DYAQSLSSL-WDSREWEFSRKLELGESFCSKEMNVPDLSIYQD 166 Query: 599 CTSQG-NSLEGVNIAGEDSGALSGNFTDGVYTGPSSMGHMDAKPYIPQ--EPVYPSFNSK 769 QG +S +G+N + + L ++ + G + +D K + Q E + ++ K Sbjct: 167 YADQGAHSSKGLNTFEQKNNNLDMLGSEQ-HQGSINREQLDYKSFTGQISEFMPVEYSRK 225 Query: 770 TAVGSILPVSCQAGLSLGSSNNYLNYENPFTPH-EKFFQPIDSCPRDTTSTSKFSPVVVI 946 + GS L+ +++ P+ EK + P D +S K SPV V+ Sbjct: 226 SVHGSTSLFPETYSLTSYEQGRSWSHQTPYGASCEKGAKQHGISPNDISSVKKSSPVHVV 285 Query: 947 RP-------APSGSRFFAQKTDKTG--ASNSEKSDVCDLLN-KGEETRLPIDSQVEGFAL 1096 + +P + F + +G ASN S++ + E ++ D+ F L Sbjct: 286 KSQAVFTSLSPPSTVSFNNLENSSGVIASNDNLSNMKEFYPLHSSEGKVHFDAGQVSFHL 345 Query: 1097 GTG----PPLDFGKIKDIFYASSSIKNQCPSHPCGSNGIEIAVKESYGSQAPYSSAPPVT 1264 G P L F K + + S IK+ P G++I ++ + Sbjct: 346 ERGSHIFPKLPFEKKEKLSSNVSVIKDPLKEKP----GLQIPDIGPGSVSLMLANNRAIN 401 Query: 1265 FTEKCSDALDLHNLNEDSPCWKGAPAFCISPSDSVEAPSPCYFKCKLECSDFGQSNPLFP 1444 +E S++LD +N DSPCWKGAP + SP +S P K+E S Sbjct: 402 CSEGSSESLDHYNPAVDSPCWKGAPDYH-SPVES-SGPVTLQHINKIEACSGSNS---IG 456 Query: 1445 PAEHSGRTSLKKSGEDNLHSHNVYAGIGLSAPAQGTGTNNYITEEHRTNDVKEKTFEHMD 1624 P ++SG+ S +K + + + + Y + + + N + EEH + + F M Sbjct: 457 PTDNSGKVSPQKPSDYSFYQEHGYLENDPESSPKRSSRANLLFEEHGYDRDLKTGFYQMK 516 Query: 1625 LSSSSGVLKFSEDLNKPSKGYNLPQYSENDCLSVDEHK--YGHTKHSLT--------ESF 1774 S GV +FS+ ++KP + Y S ++ H+ Y ++ LT Sbjct: 517 SSYGLGV-QFSDCIDKPRQDYVHANNSADEFKFRPFHQVQYDSVENKLTFERKCELGSGV 575 Query: 1775 VHSGLNLNDTLEG--GVVALDAAENVLRSPASQE--DAKQAQPYEVGSSPKLDVQTLVRA 1942 GL++N T EG V L A E+VL SP+S E A+ + + +P++ V+TL+ Sbjct: 576 ADVGLSINGTSEGCSSHVPLHATEHVLSSPSSVEAVPARLNKLHGEQLAPQMCVRTLIST 635 Query: 1943 IYNLSELLKSQCLTNACLLDEQDHDALKHAITNLGACTTKKI----ETKDTMISQHDTFE 2110 ++NLSELL C + C L E D +ALK + NL C +K++ ++++++Q + E Sbjct: 636 MHNLSELLLFHCSNDMCGLKEHDFEALKLVVNNLDKCISKRMGPEAPIQESLLTQKSS-E 694 Query: 2111 KFGESGPSYMGTGTGHPQFMEEVTWNSCGLGNQP----VHEDKSKN--NGKTAENSPLLT 2272 E + G P + T + + NQP V E +S + GK +E T Sbjct: 695 FIREFPELHEGVTVSSP----KETKAAFSVLNQPNYQHVQEQRSPDIAAGKKSEKCSDFT 750 Query: 2273 P-----------------VDELGGSNEEQVVQAIKKVLNENFLSDE 2359 D+ ++ + QAIKKVL++NF+ +E Sbjct: 751 SQGGHAERVKDDDMTQVHKDDAERVKDDNMTQAIKKVLSDNFVEEE 796 >ref|XP_006441268.1| hypothetical protein CICLE_v10018632mg [Citrus clementina] gi|557543530|gb|ESR54508.1| hypothetical protein CICLE_v10018632mg [Citrus clementina] Length = 1041 Score = 135 bits (339), Expect = 1e-28 Identities = 205/826 (24%), Positives = 337/826 (40%), Gaps = 64/826 (7%) Frame = +2 Query: 74 LAPPFTVDRSNSKPGSTYTGS----VPFGQQSWQQYTDPSTTGYNFFPKHEIVTDSMPTT 241 LAPPFTVDRS SKP T + ++ + GY+F P T +P Sbjct: 23 LAPPFTVDRSVSKPLVDLTEPPLNWLNTHPLNFDSVHSSNAYGYSFNPPS---TAHIP-- 77 Query: 242 CMPEFTPTDSVKPSSNNLWSTSNQTANVSTDTYSGYYAPYV-PSIVTNDSPSAAFNEGLF 418 P P SS +S+ + + + YY YV P+ T D Sbjct: 78 --PPENPIPITSASSFLYGQSSDAIPSANLVEANPYYPSYVSPTKYTYD----------- 124 Query: 419 DAVPSSGNISVNVSSQVDYTQSLSGLEYPVSHWSVWTKVGDGKQDVRKGVNAGASFGYMN 598 DY QSLS L + W K+ G+ K +N Y + Sbjct: 125 -----------------DYAQSLSSL-WDSREWEFSRKLELGESFCSKEMNVPDLSIYQD 166 Query: 599 CTSQG-NSLEGVNIAGEDSGALSGNFTDGVYTGPSSMGHMDAKPYIPQ--EPVYPSFNSK 769 QG +S +G+N + + L ++ + G + +D K + Q E + ++ K Sbjct: 167 YADQGAHSSKGLNTFEQKNNNLDMLGSEQ-HQGSINREQLDYKSFTGQISEFMPVEYSRK 225 Query: 770 TAVGSILPVSCQAGLSLGSSNNYLNYENPFTPH-EKFFQPIDSCPRDTTSTSKFSPVVVI 946 + GS L+ +++ P+ EK + P D +S K SPV V+ Sbjct: 226 SVHGSTSLFPETYSLTSYEQGRSWSHQTPYGASCEKGAKQHGISPNDISSVKKSSPVHVV 285 Query: 947 RP-------APSGSRFFAQKTDKTG--ASNSEKSDVCDLLN-KGEETRLPIDSQVEGFAL 1096 + +P + F + +G ASN S++ + E ++ D+ F L Sbjct: 286 KSQAVFTSLSPPSTVSFNNLENSSGVIASNDNLSNMKEFYPLHSSEGKVHFDAGQVSFHL 345 Query: 1097 GTG----PPLDFGKIKDIFYASSSIKNQCPSHPCGSNGIEIAVKESYGSQAPYSSAPPVT 1264 G P L F K + + S IK+ P G++I ++ + Sbjct: 346 ERGSHIFPKLPFEKKEKLSSNVSVIKDPLKEKP----GLQIPDIGPGSVSLMLANNRAIN 401 Query: 1265 FTEKCSDALDLHNLNEDSPCWKGAPAFCISPSDSVEAPSPCYFKCKLECSDFGQSNPLFP 1444 +E S++LD +N DSPCWKGAP + SP +S P K+E S Sbjct: 402 CSEGSSESLDHYNPAVDSPCWKGAPDYH-SPVES-SGPVTLQHINKIEACSGSNS---IG 456 Query: 1445 PAEHSGRTSLKKSGEDNLHSHNVYAGIGLSAPAQGTGTNNYITEEHRTNDVKEKTFEHMD 1624 P ++SG+ S +K + + + + Y + + + N + EEH + + F M Sbjct: 457 PTDNSGKVSPQKPSDYSFYQEHGYLENDPESSPKRSSRANLLFEEHGYDRDLKTGFYQMK 516 Query: 1625 LSSSSGVLKFSEDLNKPSKGYNLPQYSENDCLSVDEHK--YGHTKHSLT--------ESF 1774 S GV +FS+ ++KP + Y S ++ H+ Y ++ LT Sbjct: 517 SSYGLGV-QFSDCIDKPRQDYVHANNSADEFKFRPFHQVQYDSVENKLTFERKCELGSGV 575 Query: 1775 VHSGLNLNDTLEG--GVVALDAAENVLRSPASQE--DAKQAQPYEVGSSPKLDVQTLVRA 1942 GL++N T EG V L A E+VL SP+S E A+ + + +P++ V+TL+ Sbjct: 576 ADVGLSINGTSEGCSSHVPLHATEHVLSSPSSVEAVPARLNKLHGEQLAPQMCVRTLIST 635 Query: 1943 IYNLSELLKSQCLTNACLLDEQDHDALKHAITNLGACTTKKI----ETKDTMISQHDTFE 2110 ++NLSELL C + C L E D +ALK + NL C +K++ ++++++Q + E Sbjct: 636 MHNLSELLLFHCSNDMCGLKEHDFEALKLVVNNLDKCISKRMGPEAPIQESLLTQKSS-E 694 Query: 2111 KFGESGPSYMGTGTGHPQFMEEVTWNSCGLGNQP----VHEDKSKN--NGKTAENSPLLT 2272 E + G P + T + + NQP V E +S + GK +E T Sbjct: 695 FIREFPELHEGVTVSSP----KETKAAFSVLNQPNYQHVQEQRSPDIAAGKKSEKCSDFT 750 Query: 2273 P-----------------VDELGGSNEEQVVQAIKKVLNENFLSDE 2359 D+ ++ + QAIKKVL++NF+ +E Sbjct: 751 SQGGHAERVKDDDMTQVHKDDAERVKDDNMTQAIKKVLSDNFVEEE 796 >ref|XP_002321950.2| hypothetical protein POPTR_0015s00600g [Populus trichocarpa] gi|550321678|gb|EEF06077.2| hypothetical protein POPTR_0015s00600g [Populus trichocarpa] Length = 1236 Score = 132 bits (333), Expect = 5e-28 Identities = 198/816 (24%), Positives = 325/816 (39%), Gaps = 50/816 (6%) Frame = +2 Query: 77 APPFTVDRSNSKP-----GSTYTGSVPFGQQSW-QQYTDPSTTGYNFFPKHEIVTDSMPT 238 APPFTVDRS +K +TY S+ +W + + + FP + DS+P+ Sbjct: 27 APPFTVDRSAAKSLLDLTETTYPVSLNPSLHNWVTSNSHIPNSRPDLFPIPNLEFDSVPS 86 Query: 239 TCMPEFTPTDSVKPSSNNLWSTSNQTA-----NVSTDTYSGYY-APYVPSIVTNDSPSAA 400 ++ + S+ L S S N S YY + YV + +D Sbjct: 87 PPAFGYSSPTQMPSMSHPLVSASTDAVLYVQGNPSIVEAEPYYPSSYVSPAIASDGSLKI 146 Query: 401 FNEGLFDAVPSSGNISVNVSSQVDYTQSLSGLEYPVSHWSVWTKVGDGKQDVRKGVNAGA 580 N+ ++ + +S + N SS+ DY+QSL LE+P +W V D Q + ++ G Sbjct: 147 PNQSGYELLSTSHVGTSNGSSRDDYSQSLVVLEHPAQWSGLWEGVTDWHQSKKMQLDGGF 206 Query: 581 SFGYMNCTSQGNSL-----------EGVNIAGEDSGALSGNFTDGVYTGPSSMGHMDAKP 727 S N +QG S G+N+ G + +T +S G MD K Sbjct: 207 S-AKENFINQGFSAFKDISKCEETSLGINVVGRQT-----------HTESASTGQMDYKA 254 Query: 728 YIPQEPVYPSFNSKTAVGSILP-VSCQAGLSLGSS---NNYLNYENPFTPHEKFFQPIDS 895 ++ ++P + T + P V+ QA + SS N+ +N + K + D+ Sbjct: 255 FLGEKPKFMPAGYSTPSPLVFPSVAPQAYPQVPSSNVVNSPINQMPDVILYGKSSRKRDA 314 Query: 896 CPRDTTSTSKFSPVVVIRPAPSGSRFFAQKTDKTGASNSEKSDVCDLLNKGEETRLPIDS 1075 P D+ +K SPVVV+R G ++ K TG EK + + +E I S Sbjct: 315 SPNDSMPVTKPSPVVVVR--SPGQDTYSFKNMNTGCDGDEKGNNSSSV---QEPNPFISS 369 Query: 1076 QVEGFALGTGPPLDFGKIKDIFYASSSIKNQCPSHP-CGSNGIEIAVKESYGSQA----- 1237 + + F + + D SS N+ PS+ + + K ++ Sbjct: 370 EGKVFYDSSQINFHLKQNDDYLAEISSKNNELPSNKNISVDFFDQLFKAKMDNKVLRRNL 429 Query: 1238 -----PYSSAPPVTFTEKCSDALDLHNLNEDSPCWKGAPAFCISPSDSVEAPSPCYFKCK 1402 + E S++LD +N DSPCWKGAP +S + E P K Sbjct: 430 DFFNLAMDGHEAIGSVENTSESLDHYNPAVDSPCWKGAPVSHLSAFEISEVVDPLIPKKV 489 Query: 1403 LECSDFGQSNP-LFPPAEHSGRTSL--KKSGEDNLHSHNVYAGIGLSAPAQGTGTNNYIT 1573 C+ P +FP A + + K+S +H +S + Sbjct: 490 EACNGLSPQGPQIFPSATNDAVKACPEKQSNISVPLNHESLEHQQVSLFKRPLDAKVLFR 549 Query: 1574 EEHRTNDVKEKTFEHMDLSSSSGVLKFSEDLNKPSKGYN-LPQYS--ENDCLSVDEHKYG 1744 EE D K + + S + S+ ++ ++ + L ++ + S+++ ++ Sbjct: 550 EE---IDDAGKYGPYQRIPSYCHEAQISDVIDDETRKESILSDFNSLHTEQRSLEDGEWP 606 Query: 1745 HTKHSLTESFVHSGLNLNDTLEGGVVALDAAENVLRSPASQEDAKQAQPYEVG--SSPKL 1918 K+S V +N + V A E VL SP S E A G S K+ Sbjct: 607 SKKNSYVAD-VRRKINDDPDDCSSHVPFHAIEQVLCSPPSSEHAPAQHTQSQGEESLSKM 665 Query: 1919 DVQTLVRAIYNLSELLKSQCLTNACLLDEQDHDALKHAITNLGACTTK----KIETKDTM 2086 +TLV ++NL+ELL + C L ++D D LK I NL C +K KI T++++ Sbjct: 666 HARTLVDTMHNLAELLLFYSSNDTCELKDEDFDVLKDVINNLDICISKNLERKISTQESL 725 Query: 2087 ISQHDTFEKFGESGPSYMGTGTGHPQFMEEVTWNSCGLGNQPVHEDKSKNNGKTAENSPL 2266 I Q T + G+ Y G F +E + DK K K + + Sbjct: 726 IPQQATSQFHGKLSDLYKGQ-LEFQHFEDE--------EEHKIASDKRKE--KLSNWAST 774 Query: 2267 LTPVDELGGSNEEQVVQAIKKVLNENFLSDEGMQPQ 2374 D + ++ + QAIKKVL +NF +E + Q Sbjct: 775 RCAADTV---KDDNMTQAIKKVLAKNFPIEEESESQ 807 >ref|XP_006441269.1| hypothetical protein CICLE_v10018632mg [Citrus clementina] gi|567897564|ref|XP_006441270.1| hypothetical protein CICLE_v10018632mg [Citrus clementina] gi|557543531|gb|ESR54509.1| hypothetical protein CICLE_v10018632mg [Citrus clementina] gi|557543532|gb|ESR54510.1| hypothetical protein CICLE_v10018632mg [Citrus clementina] Length = 807 Score = 127 bits (319), Expect = 2e-26 Identities = 178/702 (25%), Positives = 288/702 (41%), Gaps = 37/702 (5%) Frame = +2 Query: 74 LAPPFTVDRSNSKPGSTYTGS----VPFGQQSWQQYTDPSTTGYNFFPKHEIVTDSMPTT 241 LAPPFTVDRS SKP T + ++ + GY+F P T +P Sbjct: 23 LAPPFTVDRSVSKPLVDLTEPPLNWLNTHPLNFDSVHSSNAYGYSFNPPS---TAHIP-- 77 Query: 242 CMPEFTPTDSVKPSSNNLWSTSNQTANVSTDTYSGYYAPYV-PSIVTNDSPSAAFNEGLF 418 P P SS +S+ + + + YY YV P+ T D Sbjct: 78 --PPENPIPITSASSFLYGQSSDAIPSANLVEANPYYPSYVSPTKYTYD----------- 124 Query: 419 DAVPSSGNISVNVSSQVDYTQSLSGLEYPVSHWSVWTKVGDGKQDVRKGVNAGASFGYMN 598 DY QSLS L + W K+ G+ K +N Y + Sbjct: 125 -----------------DYAQSLSSL-WDSREWEFSRKLELGESFCSKEMNVPDLSIYQD 166 Query: 599 CTSQG-NSLEGVNIAGEDSGALSGNFTDGVYTGPSSMGHMDAKPYIPQ--EPVYPSFNSK 769 QG +S +G+N + + L ++ + G + +D K + Q E + ++ K Sbjct: 167 YADQGAHSSKGLNTFEQKNNNLDMLGSEQ-HQGSINREQLDYKSFTGQISEFMPVEYSRK 225 Query: 770 TAVGSILPVSCQAGLSLGSSNNYLNYENPFTPH-EKFFQPIDSCPRDTTSTSKFSPVVVI 946 + GS L+ +++ P+ EK + P D +S K SPV V+ Sbjct: 226 SVHGSTSLFPETYSLTSYEQGRSWSHQTPYGASCEKGAKQHGISPNDISSVKKSSPVHVV 285 Query: 947 RP-------APSGSRFFAQKTDKTG--ASNSEKSDVCDLLN-KGEETRLPIDSQVEGFAL 1096 + +P + F + +G ASN S++ + E ++ D+ F L Sbjct: 286 KSQAVFTSLSPPSTVSFNNLENSSGVIASNDNLSNMKEFYPLHSSEGKVHFDAGQVSFHL 345 Query: 1097 GTG----PPLDFGKIKDIFYASSSIKNQCPSHPCGSNGIEIAVKESYGSQAPYSSAPPVT 1264 G P L F K + + S IK+ P G++I ++ + Sbjct: 346 ERGSHIFPKLPFEKKEKLSSNVSVIKDPLKEKP----GLQIPDIGPGSVSLMLANNRAIN 401 Query: 1265 FTEKCSDALDLHNLNEDSPCWKGAPAFCISPSDSVEAPSPCYFKCKLECSDFGQSNPLFP 1444 +E S++LD +N DSPCWKGAP + SP +S P K+E S Sbjct: 402 CSEGSSESLDHYNPAVDSPCWKGAPDYH-SPVES-SGPVTLQHINKIEACSGSNS---IG 456 Query: 1445 PAEHSGRTSLKKSGEDNLHSHNVYAGIGLSAPAQGTGTNNYITEEHRTNDVKEKTFEHMD 1624 P ++SG+ S +K + + + + Y + + + N + EEH + + F M Sbjct: 457 PTDNSGKVSPQKPSDYSFYQEHGYLENDPESSPKRSSRANLLFEEHGYDRDLKTGFYQMK 516 Query: 1625 LSSSSGVLKFSEDLNKPSKGYNLPQYSENDCLSVDEHK--YGHTKHSLT--------ESF 1774 S GV +FS+ ++KP + Y S ++ H+ Y ++ LT Sbjct: 517 SSYGLGV-QFSDCIDKPRQDYVHANNSADEFKFRPFHQVQYDSVENKLTFERKCELGSGV 575 Query: 1775 VHSGLNLNDTLEG--GVVALDAAENVLRSPASQE--DAKQAQPYEVGSSPKLDVQTLVRA 1942 GL++N T EG V L A E+VL SP+S E A+ + + +P++ V+TL+ Sbjct: 576 ADVGLSINGTSEGCSSHVPLHATEHVLSSPSSVEAVPARLNKLHGEQLAPQMCVRTLIST 635 Query: 1943 IYNLSELLKSQCLTNACLLDEQDHDALKHAITNLGACTTKKI 2068 ++NLSELL C + C L E D +ALK + NL C +K++ Sbjct: 636 MHNLSELLLFHCSNDMCGLKEHDFEALKLVVNNLDKCISKRM 677 >ref|XP_002317835.2| hypothetical protein POPTR_0012s00720g [Populus trichocarpa] gi|550326088|gb|EEE96055.2| hypothetical protein POPTR_0012s00720g [Populus trichocarpa] Length = 1227 Score = 123 bits (308), Expect = 4e-25 Identities = 203/814 (24%), Positives = 329/814 (40%), Gaps = 46/814 (5%) Frame = +2 Query: 71 PLAPPFTVDRSNSKP-----GSTYTGSVPFGQQSWQQYTDPSTTGY-NFFPKHEIVTDSM 232 PLAPPFTVDRS +KP TY S+ +W + FP + +S+ Sbjct: 25 PLAPPFTVDRSVAKPLLDLTEPTYPVSLNPSLHNWATSNSHIPNSRPDLFPLPNLEFNSI 84 Query: 233 PTTCMPEFT-PTDSVKPSSNNLWSTSNQT-----ANVSTDTYSGYY-APYVPSIVTNDSP 391 P+ + ++ PT V ++ L S +N S YY + YV + +D Sbjct: 85 PSPNVFGYSSPTPQVTSKNHPLVLASTDAVLYGQSNPSLVEAVPYYPSSYVSPAIGSDGH 144 Query: 392 SAAFNEGLFDAVPSSGNISVNVSSQVDYTQSLSGLEYPVSHWSVWTKVGDGKQDVRKGVN 571 ++ ++ + +S + N SS DYTQS GLE+ +W V D Q + ++ Sbjct: 145 LKIPHQSGYELLSNSYVGTSNGSSHDDYTQSSLGLEHATQWSGLWEGVTDWNQSKKLQLD 204 Query: 572 AGASFGYMNCTSQG-NSLEGVNIAGEDSGALSGNFTDGVYTGPSSMGHMDAKPYIPQEPV 748 G N +QG ++ + V+ E S + ++TG +S G +D K ++ ++ Sbjct: 205 GGFC-EKENFINQGFSAFKDVSKCEETSLGID-MVGRQMHTGSASTGQLDYKAFLVEK-- 260 Query: 749 YPSFNSKTAVGSILPVSCQAGLSLGSSNNYLNYEN----PFTPHEKFFQPIDSCPRDTTS 916 P T I P + SS+N +N N T + K + D+ D Sbjct: 261 -PKSMPTTPPSLIFPPTAPQAYPQVSSSNVVNSPNNQMRHVTSYGKSSRKRDASSNDRMP 319 Query: 917 TSKFSPVVVIRPAPSGSRFFAQKTDKTGASNSEKSDVCDLLNKGEETRLPIDSQVEGFAL 1096 K SP VVIR P G ++ K G EK + + +E I S +G Sbjct: 320 MMKPSPAVVIR--PPGQDRYSFKNINAGTDGDEKDFAGNNTSFAQEPNPFISS--KGKVC 375 Query: 1097 GTGPPLDFG-KIKDIFYASSSIKNQ---CPSHPCGSNGIEIAVKESYGSQAP-------- 1240 ++F K D +A KN + + ++ +E ++ P Sbjct: 376 YDSSQVNFHLKQNDDSFAEVPSKNHEELLSNKNISIDFLDKLFREKMENRVPCKNLDFFN 435 Query: 1241 -----YSSAPPVTFTEKCSDALDLHNLNEDSPCWKGAPAFCISPSDSVEAPSPCYFKCKL 1405 + +A V T S++LD + DSPCWKGAP S + E +P + K+ Sbjct: 436 LAMDGHEAAGSVEIT---SESLDHYFPAVDSPCWKGAPVSLPSAFEGSEVVNP---QNKV 489 Query: 1406 E-CSDFGQSNPLFPPA--EHSGRTSLKKSGEDNLHSHNVYAGIGLSAPAQGTGTNNYITE 1576 E C+ P P+ + + +K ++ +N ++ + N + Sbjct: 490 EACNGLNLQGPQISPSTTNDAVKDCPEKQSNISMTFNNESLEHRPASSFKRPLVANVLFR 549 Query: 1577 EHRTNDVKEKTFEHMDLSSSSGVLKFSEDLNKPSKGYNLPQYS--ENDCLSVDEHKYGHT 1750 E + VK + SS + S+ +++P K LP + S++E ++ Sbjct: 550 EGIDDAVKYGPCQRK--SSYCNEAQISDVIDEPRKESILPDFKPVHTKQKSLEEGEWPSK 607 Query: 1751 KHSLTESFVHSGLNLNDTLEGGVVALDAAENVLRSPASQEDA-KQAQPYEVG-SSPKLDV 1924 K+S + V +N N V A E+VL SP S E A Q +VG SS K+ Sbjct: 608 KNS-DVAGVRRKINDNPDDCSSHVPYHAIEHVLCSPPSSEHAPAQHTQSQVGESSSKMHA 666 Query: 1925 QTLVRAIYNLSELLKSQCLTNACLLDEQDHDALKHAITNLGACTTKKIE----TKDTMIS 2092 +TLV ++NLSELL + C L ++D D L I NL +K E T++++I Sbjct: 667 RTLVDTMHNLSELLLFYSSNDTCELKDEDFDVLNDVINNLDIFISKNSERKNSTQESLIP 726 Query: 2093 QHDTFEKFGESGPSYMGTGTGHPQFMEEVTWNSCGLGNQPVHEDKSKNNGKTAENSPLLT 2272 + T + G+ Y G +F C + + E K K + + T Sbjct: 727 RRATSQSPGKLSELY----KGQLEFQHFEDEKECKIVSD---ERKEKLSNFVSMRGATDT 779 Query: 2273 PVDELGGSNEEQVVQAIKKVLNENFLSDEGMQPQ 2374 D + V QAIKKVL +NF E + Q Sbjct: 780 VKD-------DNVTQAIKKVLAQNFPIKEESESQ 806 >gb|EOY23726.1| Uncharacterized protein isoform 6 [Theobroma cacao] Length = 827 Score = 122 bits (307), Expect = 5e-25 Identities = 218/839 (25%), Positives = 320/839 (38%), Gaps = 72/839 (8%) Frame = +2 Query: 74 LAPPFTVDRSNSKPGSTYTGSVPFGQQ-SWQQYTDPSTTGYNFFPKHEIVTDSMPTTCMP 250 LAPPFTVDRS KP +T V G+ +W DS P T Sbjct: 33 LAPPFTVDRSIPKPAATPL--VDLGEPLNW--------------------LDSNPYT--- 67 Query: 251 EFTPTDSVKPSSNNLWSTSNQTANVSTDTYSGYYAPYVPSIVTNDSPSAAFNEGLFDAVP 430 F + +L T + N ++D + Y PS V+ FNE Sbjct: 68 -FNSPQPAQLPQLDLEPTPTPSYNQNSDLFEP--KTYYPSYVSPPLHVPTFNE------- 117 Query: 431 SSGNISVNVSSQVDYTQSLSGLEYPVSHWSVWTKVGDGKQDVRKGVNA--GASFGYMNCT 604 QSL GL+ H + W G G D KG A G SF Y+ T Sbjct: 118 ----------------QSLPGLD----HTAQW---GGGLWDWEKGKPAQLGGSF-YLKET 153 Query: 605 SQGNS---LEGVNIAGEDSGALSG--NFTDGVYT------GPSSMGHMDAKPYIPQEPVY 751 S S ++ +N+ S +L + +Y+ GP+++ +D P + Q P + Sbjct: 154 SVAPSSIYMDHINLGAHPSKSLKTCEETSYNIYSPREDQAGPANIEKLDYNPVLGQNPSF 213 Query: 752 -PSFNSKTAVGSILPVSCQAGLS---LGSSNNYLNYENPFTPHEKFFQPIDSCPRDTTST 919 P KT+V +A L L N N+ TP+EK + + D+ + Sbjct: 214 MPVDYLKTSVIGSSSAISEANLQAPPLNLVNCKNNHVQISTPYEKPLRQHGTTLSDSIPS 273 Query: 920 SKFSPVVVIRPAPSGSRFFAQKTDKTGASNSEKSDVCDLLNKGEETRLPIDSQV----EG 1087 K SP VVIRP G+ + ASNS + +T L +++ Sbjct: 274 VKSSPGVVIRPPAVGT--------SSSASNSVSFKNVNTGINATDTNLAGNNRFIVEEPR 325 Query: 1088 FALGTGPPLDFGKIKDIF------YASS---------SIKNQCPSHPCGS-NGIEIAVKE 1219 F G +F I+ F Y S S +N + G+ +G+ ++ Sbjct: 326 FLFNFGSKNEFDPIQHSFLLDGNCYMSGESSTSTEKLSTRNMASDNFFGAKSGVNLSRIS 385 Query: 1220 SYGSQAPYSSAPPVTFTEKCSDALDLHNLNEDSPCWKGAPAFCISPSDSVEAPSPCYFKC 1399 + + V E ++LD +N DSPCWKGAPA SP S E P Sbjct: 386 PDNFSLAFENNEAVIAVENSLESLDHYNPPVDSPCWKGAPASNNSPFGSSE-PVAVQLAK 444 Query: 1400 KLECSD--------FGQSNPLFPPAEHSGRTSLKKSGEDNLHSHNVYAGIGLSAPAQGTG 1555 KLE D F SN SG K+GE + N G + + Sbjct: 445 KLEACDGSNGLVLKFISSNTANMVKHPSG-----KAGEILMSDENGNVEDGSMSSLKLPP 499 Query: 1556 TNNYITEEHRTNDVKEKTFEHMDLSSSSGVLKFSEDLNKPSKGYNLPQYSENDCLSVDE- 1732 + +EH ++ K H + +SS+ +KFS++ ++ K Y L S VDE Sbjct: 500 VSIPSFKEHEPDEAG-KAGSHKNKASSACEVKFSDNASEWKKDYVLFDKS------VDEV 552 Query: 1733 HKYGHTKHS-LTESFVHSG-------------LNLNDTLEGGV--VALDAAENVLRSPAS 1864 K HT L E + S + +ND G V+ A +++ +P+S Sbjct: 553 EKASHTSQQCLAEGRLASKNLCRSETGVADLEMKINDVSGCGSSHVSCHAVKHLSCAPSS 612 Query: 1865 QEDAKQAQPYEVGSSP--KLDVQTLVRAIYNLSELLKSQCLTNACLLDEQDHDALKHAIT 2038 ED +G P + LV + NLSELL C AC L EQD +L+ I Sbjct: 613 VEDVSTKHTKFLGKEPVSNSSISVLVDTMQNLSELLLYHCSNEACELREQDVKSLEKVIN 672 Query: 2039 NLGACTTKKIETKDTMISQHDTF----EKFGES---GPSYMGTGTGHPQFMEEVTWNSCG 2197 NL C +K I + + H + +K G+ + GT TG PQ + Sbjct: 673 NLDTCMSKNIGQETLLSELHKVWFPMSKKNGQESLLSELHKGTSTGSPQV---AAIDVLS 729 Query: 2198 LGNQPVHEDKSKNNGKTAENSPLLTPVDELGGSNEEQVVQAIKKVLNENFLSDEGMQPQ 2374 Q + K + K +E + + D +++ QAIKKVL ENF E PQ Sbjct: 730 QHTQVKRKHFGKKDEKCSEFVSVRSGTDI--KVKNDKMTQAIKKVLIENFHEKEETHPQ 786 >gb|EOY23723.1| Uncharacterized protein isoform 3 [Theobroma cacao] Length = 1068 Score = 122 bits (307), Expect = 5e-25 Identities = 218/839 (25%), Positives = 320/839 (38%), Gaps = 72/839 (8%) Frame = +2 Query: 74 LAPPFTVDRSNSKPGSTYTGSVPFGQQ-SWQQYTDPSTTGYNFFPKHEIVTDSMPTTCMP 250 LAPPFTVDRS KP +T V G+ +W DS P T Sbjct: 22 LAPPFTVDRSIPKPAATPL--VDLGEPLNW--------------------LDSNPYT--- 56 Query: 251 EFTPTDSVKPSSNNLWSTSNQTANVSTDTYSGYYAPYVPSIVTNDSPSAAFNEGLFDAVP 430 F + +L T + N ++D + Y PS V+ FNE Sbjct: 57 -FNSPQPAQLPQLDLEPTPTPSYNQNSDLFEP--KTYYPSYVSPPLHVPTFNE------- 106 Query: 431 SSGNISVNVSSQVDYTQSLSGLEYPVSHWSVWTKVGDGKQDVRKGVNA--GASFGYMNCT 604 QSL GL+ H + W G G D KG A G SF Y+ T Sbjct: 107 ----------------QSLPGLD----HTAQW---GGGLWDWEKGKPAQLGGSF-YLKET 142 Query: 605 SQGNS---LEGVNIAGEDSGALSG--NFTDGVYT------GPSSMGHMDAKPYIPQEPVY 751 S S ++ +N+ S +L + +Y+ GP+++ +D P + Q P + Sbjct: 143 SVAPSSIYMDHINLGAHPSKSLKTCEETSYNIYSPREDQAGPANIEKLDYNPVLGQNPSF 202 Query: 752 -PSFNSKTAVGSILPVSCQAGLS---LGSSNNYLNYENPFTPHEKFFQPIDSCPRDTTST 919 P KT+V +A L L N N+ TP+EK + + D+ + Sbjct: 203 MPVDYLKTSVIGSSSAISEANLQAPPLNLVNCKNNHVQISTPYEKPLRQHGTTLSDSIPS 262 Query: 920 SKFSPVVVIRPAPSGSRFFAQKTDKTGASNSEKSDVCDLLNKGEETRLPIDSQV----EG 1087 K SP VVIRP G+ + ASNS + +T L +++ Sbjct: 263 VKSSPGVVIRPPAVGT--------SSSASNSVSFKNVNTGINATDTNLAGNNRFIVEEPR 314 Query: 1088 FALGTGPPLDFGKIKDIF------YASS---------SIKNQCPSHPCGS-NGIEIAVKE 1219 F G +F I+ F Y S S +N + G+ +G+ ++ Sbjct: 315 FLFNFGSKNEFDPIQHSFLLDGNCYMSGESSTSTEKLSTRNMASDNFFGAKSGVNLSRIS 374 Query: 1220 SYGSQAPYSSAPPVTFTEKCSDALDLHNLNEDSPCWKGAPAFCISPSDSVEAPSPCYFKC 1399 + + V E ++LD +N DSPCWKGAPA SP S E P Sbjct: 375 PDNFSLAFENNEAVIAVENSLESLDHYNPPVDSPCWKGAPASNNSPFGSSE-PVAVQLAK 433 Query: 1400 KLECSD--------FGQSNPLFPPAEHSGRTSLKKSGEDNLHSHNVYAGIGLSAPAQGTG 1555 KLE D F SN SG K+GE + N G + + Sbjct: 434 KLEACDGSNGLVLKFISSNTANMVKHPSG-----KAGEILMSDENGNVEDGSMSSLKLPP 488 Query: 1556 TNNYITEEHRTNDVKEKTFEHMDLSSSSGVLKFSEDLNKPSKGYNLPQYSENDCLSVDE- 1732 + +EH ++ K H + +SS+ +KFS++ ++ K Y L S VDE Sbjct: 489 VSIPSFKEHEPDEAG-KAGSHKNKASSACEVKFSDNASEWKKDYVLFDKS------VDEV 541 Query: 1733 HKYGHTKHS-LTESFVHSG-------------LNLNDTLEGGV--VALDAAENVLRSPAS 1864 K HT L E + S + +ND G V+ A +++ +P+S Sbjct: 542 EKASHTSQQCLAEGRLASKNLCRSETGVADLEMKINDVSGCGSSHVSCHAVKHLSCAPSS 601 Query: 1865 QEDAKQAQPYEVGSSP--KLDVQTLVRAIYNLSELLKSQCLTNACLLDEQDHDALKHAIT 2038 ED +G P + LV + NLSELL C AC L EQD +L+ I Sbjct: 602 VEDVSTKHTKFLGKEPVSNSSISVLVDTMQNLSELLLYHCSNEACELREQDVKSLEKVIN 661 Query: 2039 NLGACTTKKIETKDTMISQHDTF----EKFGES---GPSYMGTGTGHPQFMEEVTWNSCG 2197 NL C +K I + + H + +K G+ + GT TG PQ + Sbjct: 662 NLDTCMSKNIGQETLLSELHKVWFPMSKKNGQESLLSELHKGTSTGSPQV---AAIDVLS 718 Query: 2198 LGNQPVHEDKSKNNGKTAENSPLLTPVDELGGSNEEQVVQAIKKVLNENFLSDEGMQPQ 2374 Q + K + K +E + + D +++ QAIKKVL ENF E PQ Sbjct: 719 QHTQVKRKHFGKKDEKCSEFVSVRSGTDI--KVKNDKMTQAIKKVLIENFHEKEETHPQ 775 >gb|EOY23722.1| Uncharacterized protein isoform 2 [Theobroma cacao] Length = 1017 Score = 122 bits (307), Expect = 5e-25 Identities = 218/839 (25%), Positives = 320/839 (38%), Gaps = 72/839 (8%) Frame = +2 Query: 74 LAPPFTVDRSNSKPGSTYTGSVPFGQQ-SWQQYTDPSTTGYNFFPKHEIVTDSMPTTCMP 250 LAPPFTVDRS KP +T V G+ +W DS P T Sbjct: 33 LAPPFTVDRSIPKPAATPL--VDLGEPLNW--------------------LDSNPYT--- 67 Query: 251 EFTPTDSVKPSSNNLWSTSNQTANVSTDTYSGYYAPYVPSIVTNDSPSAAFNEGLFDAVP 430 F + +L T + N ++D + Y PS V+ FNE Sbjct: 68 -FNSPQPAQLPQLDLEPTPTPSYNQNSDLFEP--KTYYPSYVSPPLHVPTFNE------- 117 Query: 431 SSGNISVNVSSQVDYTQSLSGLEYPVSHWSVWTKVGDGKQDVRKGVNA--GASFGYMNCT 604 QSL GL+ H + W G G D KG A G SF Y+ T Sbjct: 118 ----------------QSLPGLD----HTAQW---GGGLWDWEKGKPAQLGGSF-YLKET 153 Query: 605 SQGNS---LEGVNIAGEDSGALSG--NFTDGVYT------GPSSMGHMDAKPYIPQEPVY 751 S S ++ +N+ S +L + +Y+ GP+++ +D P + Q P + Sbjct: 154 SVAPSSIYMDHINLGAHPSKSLKTCEETSYNIYSPREDQAGPANIEKLDYNPVLGQNPSF 213 Query: 752 -PSFNSKTAVGSILPVSCQAGLS---LGSSNNYLNYENPFTPHEKFFQPIDSCPRDTTST 919 P KT+V +A L L N N+ TP+EK + + D+ + Sbjct: 214 MPVDYLKTSVIGSSSAISEANLQAPPLNLVNCKNNHVQISTPYEKPLRQHGTTLSDSIPS 273 Query: 920 SKFSPVVVIRPAPSGSRFFAQKTDKTGASNSEKSDVCDLLNKGEETRLPIDSQV----EG 1087 K SP VVIRP G+ + ASNS + +T L +++ Sbjct: 274 VKSSPGVVIRPPAVGT--------SSSASNSVSFKNVNTGINATDTNLAGNNRFIVEEPR 325 Query: 1088 FALGTGPPLDFGKIKDIF------YASS---------SIKNQCPSHPCGS-NGIEIAVKE 1219 F G +F I+ F Y S S +N + G+ +G+ ++ Sbjct: 326 FLFNFGSKNEFDPIQHSFLLDGNCYMSGESSTSTEKLSTRNMASDNFFGAKSGVNLSRIS 385 Query: 1220 SYGSQAPYSSAPPVTFTEKCSDALDLHNLNEDSPCWKGAPAFCISPSDSVEAPSPCYFKC 1399 + + V E ++LD +N DSPCWKGAPA SP S E P Sbjct: 386 PDNFSLAFENNEAVIAVENSLESLDHYNPPVDSPCWKGAPASNNSPFGSSE-PVAVQLAK 444 Query: 1400 KLECSD--------FGQSNPLFPPAEHSGRTSLKKSGEDNLHSHNVYAGIGLSAPAQGTG 1555 KLE D F SN SG K+GE + N G + + Sbjct: 445 KLEACDGSNGLVLKFISSNTANMVKHPSG-----KAGEILMSDENGNVEDGSMSSLKLPP 499 Query: 1556 TNNYITEEHRTNDVKEKTFEHMDLSSSSGVLKFSEDLNKPSKGYNLPQYSENDCLSVDE- 1732 + +EH ++ K H + +SS+ +KFS++ ++ K Y L S VDE Sbjct: 500 VSIPSFKEHEPDEAG-KAGSHKNKASSACEVKFSDNASEWKKDYVLFDKS------VDEV 552 Query: 1733 HKYGHTKHS-LTESFVHSG-------------LNLNDTLEGGV--VALDAAENVLRSPAS 1864 K HT L E + S + +ND G V+ A +++ +P+S Sbjct: 553 EKASHTSQQCLAEGRLASKNLCRSETGVADLEMKINDVSGCGSSHVSCHAVKHLSCAPSS 612 Query: 1865 QEDAKQAQPYEVGSSP--KLDVQTLVRAIYNLSELLKSQCLTNACLLDEQDHDALKHAIT 2038 ED +G P + LV + NLSELL C AC L EQD +L+ I Sbjct: 613 VEDVSTKHTKFLGKEPVSNSSISVLVDTMQNLSELLLYHCSNEACELREQDVKSLEKVIN 672 Query: 2039 NLGACTTKKIETKDTMISQHDTF----EKFGES---GPSYMGTGTGHPQFMEEVTWNSCG 2197 NL C +K I + + H + +K G+ + GT TG PQ + Sbjct: 673 NLDTCMSKNIGQETLLSELHKVWFPMSKKNGQESLLSELHKGTSTGSPQV---AAIDVLS 729 Query: 2198 LGNQPVHEDKSKNNGKTAENSPLLTPVDELGGSNEEQVVQAIKKVLNENFLSDEGMQPQ 2374 Q + K + K +E + + D +++ QAIKKVL ENF E PQ Sbjct: 730 QHTQVKRKHFGKKDEKCSEFVSVRSGTDI--KVKNDKMTQAIKKVLIENFHEKEETHPQ 786 >gb|EOY23721.1| Uncharacterized protein isoform 1 [Theobroma cacao] gi|508776468|gb|EOY23724.1| Uncharacterized protein isoform 1 [Theobroma cacao] Length = 1079 Score = 122 bits (307), Expect = 5e-25 Identities = 218/839 (25%), Positives = 320/839 (38%), Gaps = 72/839 (8%) Frame = +2 Query: 74 LAPPFTVDRSNSKPGSTYTGSVPFGQQ-SWQQYTDPSTTGYNFFPKHEIVTDSMPTTCMP 250 LAPPFTVDRS KP +T V G+ +W DS P T Sbjct: 33 LAPPFTVDRSIPKPAATPL--VDLGEPLNW--------------------LDSNPYT--- 67 Query: 251 EFTPTDSVKPSSNNLWSTSNQTANVSTDTYSGYYAPYVPSIVTNDSPSAAFNEGLFDAVP 430 F + +L T + N ++D + Y PS V+ FNE Sbjct: 68 -FNSPQPAQLPQLDLEPTPTPSYNQNSDLFEP--KTYYPSYVSPPLHVPTFNE------- 117 Query: 431 SSGNISVNVSSQVDYTQSLSGLEYPVSHWSVWTKVGDGKQDVRKGVNA--GASFGYMNCT 604 QSL GL+ H + W G G D KG A G SF Y+ T Sbjct: 118 ----------------QSLPGLD----HTAQW---GGGLWDWEKGKPAQLGGSF-YLKET 153 Query: 605 SQGNS---LEGVNIAGEDSGALSG--NFTDGVYT------GPSSMGHMDAKPYIPQEPVY 751 S S ++ +N+ S +L + +Y+ GP+++ +D P + Q P + Sbjct: 154 SVAPSSIYMDHINLGAHPSKSLKTCEETSYNIYSPREDQAGPANIEKLDYNPVLGQNPSF 213 Query: 752 -PSFNSKTAVGSILPVSCQAGLS---LGSSNNYLNYENPFTPHEKFFQPIDSCPRDTTST 919 P KT+V +A L L N N+ TP+EK + + D+ + Sbjct: 214 MPVDYLKTSVIGSSSAISEANLQAPPLNLVNCKNNHVQISTPYEKPLRQHGTTLSDSIPS 273 Query: 920 SKFSPVVVIRPAPSGSRFFAQKTDKTGASNSEKSDVCDLLNKGEETRLPIDSQV----EG 1087 K SP VVIRP G+ + ASNS + +T L +++ Sbjct: 274 VKSSPGVVIRPPAVGT--------SSSASNSVSFKNVNTGINATDTNLAGNNRFIVEEPR 325 Query: 1088 FALGTGPPLDFGKIKDIF------YASS---------SIKNQCPSHPCGS-NGIEIAVKE 1219 F G +F I+ F Y S S +N + G+ +G+ ++ Sbjct: 326 FLFNFGSKNEFDPIQHSFLLDGNCYMSGESSTSTEKLSTRNMASDNFFGAKSGVNLSRIS 385 Query: 1220 SYGSQAPYSSAPPVTFTEKCSDALDLHNLNEDSPCWKGAPAFCISPSDSVEAPSPCYFKC 1399 + + V E ++LD +N DSPCWKGAPA SP S E P Sbjct: 386 PDNFSLAFENNEAVIAVENSLESLDHYNPPVDSPCWKGAPASNNSPFGSSE-PVAVQLAK 444 Query: 1400 KLECSD--------FGQSNPLFPPAEHSGRTSLKKSGEDNLHSHNVYAGIGLSAPAQGTG 1555 KLE D F SN SG K+GE + N G + + Sbjct: 445 KLEACDGSNGLVLKFISSNTANMVKHPSG-----KAGEILMSDENGNVEDGSMSSLKLPP 499 Query: 1556 TNNYITEEHRTNDVKEKTFEHMDLSSSSGVLKFSEDLNKPSKGYNLPQYSENDCLSVDE- 1732 + +EH ++ K H + +SS+ +KFS++ ++ K Y L S VDE Sbjct: 500 VSIPSFKEHEPDEAG-KAGSHKNKASSACEVKFSDNASEWKKDYVLFDKS------VDEV 552 Query: 1733 HKYGHTKHS-LTESFVHSG-------------LNLNDTLEGGV--VALDAAENVLRSPAS 1864 K HT L E + S + +ND G V+ A +++ +P+S Sbjct: 553 EKASHTSQQCLAEGRLASKNLCRSETGVADLEMKINDVSGCGSSHVSCHAVKHLSCAPSS 612 Query: 1865 QEDAKQAQPYEVGSSP--KLDVQTLVRAIYNLSELLKSQCLTNACLLDEQDHDALKHAIT 2038 ED +G P + LV + NLSELL C AC L EQD +L+ I Sbjct: 613 VEDVSTKHTKFLGKEPVSNSSISVLVDTMQNLSELLLYHCSNEACELREQDVKSLEKVIN 672 Query: 2039 NLGACTTKKIETKDTMISQHDTF----EKFGES---GPSYMGTGTGHPQFMEEVTWNSCG 2197 NL C +K I + + H + +K G+ + GT TG PQ + Sbjct: 673 NLDTCMSKNIGQETLLSELHKVWFPMSKKNGQESLLSELHKGTSTGSPQV---AAIDVLS 729 Query: 2198 LGNQPVHEDKSKNNGKTAENSPLLTPVDELGGSNEEQVVQAIKKVLNENFLSDEGMQPQ 2374 Q + K + K +E + + D +++ QAIKKVL ENF E PQ Sbjct: 730 QHTQVKRKHFGKKDEKCSEFVSVRSGTDI--KVKNDKMTQAIKKVLIENFHEKEETHPQ 786 >gb|EOY23725.1| Uncharacterized protein isoform 5 [Theobroma cacao] Length = 1059 Score = 122 bits (305), Expect = 9e-25 Identities = 217/832 (26%), Positives = 319/832 (38%), Gaps = 65/832 (7%) Frame = +2 Query: 74 LAPPFTVDRSNSKPGSTYTGSVPFGQQ-SWQQYTDPSTTGYNFFPKHEIVTDSMPTTCMP 250 LAPPFTVDRS KP +T V G+ +W DS P T Sbjct: 33 LAPPFTVDRSIPKPAATPL--VDLGEPLNW--------------------LDSNPYT--- 67 Query: 251 EFTPTDSVKPSSNNLWSTSNQTANVSTDTYSGYYAPYVPSIVTNDSPSAAFNEGLFDAVP 430 F + +L T + N ++D + Y PS V+ FNE Sbjct: 68 -FNSPQPAQLPQLDLEPTPTPSYNQNSDLFEP--KTYYPSYVSPPLHVPTFNE------- 117 Query: 431 SSGNISVNVSSQVDYTQSLSGLEYPVSHWSVWTKVGDGKQDVRKGVNA--GASFGYMNCT 604 QSL GL+ H + W G G D KG A G SF Y+ T Sbjct: 118 ----------------QSLPGLD----HTAQW---GGGLWDWEKGKPAQLGGSF-YLKET 153 Query: 605 SQGNS---LEGVNIAGEDSGALSG--NFTDGVYT------GPSSMGHMDAKPYIPQEPVY 751 S S ++ +N+ S +L + +Y+ GP+++ +D P + Q P + Sbjct: 154 SVAPSSIYMDHINLGAHPSKSLKTCEETSYNIYSPREDQAGPANIEKLDYNPVLGQNPSF 213 Query: 752 -PSFNSKTAVGSILPVSCQAGLS---LGSSNNYLNYENPFTPHEKFFQPIDSCPRDTTST 919 P KT+V +A L L N N+ TP+EK + + D+ + Sbjct: 214 MPVDYLKTSVIGSSSAISEANLQAPPLNLVNCKNNHVQISTPYEKPLRQHGTTLSDSIPS 273 Query: 920 SKFSPVVVIRPAPSGSRFFAQKTDKTGASNSEKSDVCDLLNKGEETRLPIDSQV----EG 1087 K SP VVIRP G+ + ASNS + +T L +++ Sbjct: 274 VKSSPGVVIRPPAVGT--------SSSASNSVSFKNVNTGINATDTNLAGNNRFIVEEPR 325 Query: 1088 FALGTGPPLDFGKIKDIF------YASS---------SIKNQCPSHPCGS-NGIEIAVKE 1219 F G +F I+ F Y S S +N + G+ +G+ ++ Sbjct: 326 FLFNFGSKNEFDPIQHSFLLDGNCYMSGESSTSTEKLSTRNMASDNFFGAKSGVNLSRIS 385 Query: 1220 SYGSQAPYSSAPPVTFTEKCSDALDLHNLNEDSPCWKGAPAFCISPSDSVEAPSPCYFKC 1399 + + V E ++LD +N DSPCWKGAPA SP S E P Sbjct: 386 PDNFSLAFENNEAVIAVENSLESLDHYNPPVDSPCWKGAPASNNSPFGSSE-PVAVQLAK 444 Query: 1400 KLECSD--------FGQSNPLFPPAEHSGRTSLKKSGEDNLHSHNVYAGIGLSAPAQGTG 1555 KLE D F SN SG K+GE + N G + + Sbjct: 445 KLEACDGSNGLVLKFISSNTANMVKHPSG-----KAGEILMSDENGNVEDGSMSSLKLPP 499 Query: 1556 TNNYITEEHRTNDVKEKTFEHMDLSSSSGVLKFSEDLNKPSKGYNLPQYSENDCLSVDE- 1732 + +EH ++ K H + +SS+ +KFS++ ++ K Y L S VDE Sbjct: 500 VSIPSFKEHEPDEAG-KAGSHKNKASSACEVKFSDNASEWKKDYVLFDKS------VDEV 552 Query: 1733 HKYGHTKHS-LTESFVHSG-------------LNLNDTLEGGV--VALDAAENVLRSPAS 1864 K HT L E + S + +ND G V+ A +++ +P+S Sbjct: 553 EKASHTSQQCLAEGRLASKNLCRSETGVADLEMKINDVSGCGSSHVSCHAVKHLSCAPSS 612 Query: 1865 QEDAKQAQPYEVGSSP--KLDVQTLVRAIYNLSELLKSQCLTNACLLDEQDHDALKHAIT 2038 ED +G P + LV + NLSELL C AC L EQD +L+ I Sbjct: 613 VEDVSTKHTKFLGKEPVSNSSISVLVDTMQNLSELLLYHCSNEACELREQDVKSLEKVIN 672 Query: 2039 NLGACTTKKIETKDTMISQHDTFEKFGESGPSYMGTGTGHPQFMEEVTWNSCGLGNQPVH 2218 NL C +K I ++T++S+ + GT TG PQ + Q Sbjct: 673 NLDTCMSKNI-GQETLLSE------------LHKGTSTGSPQV---AAIDVLSQHTQVKR 716 Query: 2219 EDKSKNNGKTAENSPLLTPVDELGGSNEEQVVQAIKKVLNENFLSDEGMQPQ 2374 + K + K +E + + D +++ QAIKKVL ENF E PQ Sbjct: 717 KHFGKKDEKCSEFVSVRSGTDI--KVKNDKMTQAIKKVLIENFHEKEETHPQ 766 >gb|EOY23728.1| Uncharacterized protein isoform 8, partial [Theobroma cacao] Length = 828 Score = 110 bits (276), Expect = 2e-21 Identities = 195/744 (26%), Positives = 287/744 (38%), Gaps = 65/744 (8%) Frame = +2 Query: 74 LAPPFTVDRSNSKPGSTYTGSVPFGQQ-SWQQYTDPSTTGYNFFPKHEIVTDSMPTTCMP 250 LAPPFTVDRS KP +T V G+ +W DS P T Sbjct: 22 LAPPFTVDRSIPKPAATPL--VDLGEPLNW--------------------LDSNPYT--- 56 Query: 251 EFTPTDSVKPSSNNLWSTSNQTANVSTDTYSGYYAPYVPSIVTNDSPSAAFNEGLFDAVP 430 F + +L T + N ++D + Y PS V+ FNE Sbjct: 57 -FNSPQPAQLPQLDLEPTPTPSYNQNSDLFEP--KTYYPSYVSPPLHVPTFNE------- 106 Query: 431 SSGNISVNVSSQVDYTQSLSGLEYPVSHWSVWTKVGDGKQDVRKGVNA--GASFGYMNCT 604 QSL GL+ H + W G G D KG A G SF Y+ T Sbjct: 107 ----------------QSLPGLD----HTAQW---GGGLWDWEKGKPAQLGGSF-YLKET 142 Query: 605 SQGNS---LEGVNIAGEDSGALSG--NFTDGVYT------GPSSMGHMDAKPYIPQEPVY 751 S S ++ +N+ S +L + +Y+ GP+++ +D P + Q P + Sbjct: 143 SVAPSSIYMDHINLGAHPSKSLKTCEETSYNIYSPREDQAGPANIEKLDYNPVLGQNPSF 202 Query: 752 -PSFNSKTAVGSILPVSCQAGLS---LGSSNNYLNYENPFTPHEKFFQPIDSCPRDTTST 919 P KT+V +A L L N N+ TP+EK + + D+ + Sbjct: 203 MPVDYLKTSVIGSSSAISEANLQAPPLNLVNCKNNHVQISTPYEKPLRQHGTTLSDSIPS 262 Query: 920 SKFSPVVVIRPAPSGSRFFAQKTDKTGASNSEKSDVCDLLNKGEETRLPIDSQV----EG 1087 K SP VVIRP G+ + ASNS + +T L +++ Sbjct: 263 VKSSPGVVIRPPAVGT--------SSSASNSVSFKNVNTGINATDTNLAGNNRFIVEEPR 314 Query: 1088 FALGTGPPLDFGKIKDIF------YASS---------SIKNQCPSHPCGS-NGIEIAVKE 1219 F G +F I+ F Y S S +N + G+ +G+ ++ Sbjct: 315 FLFNFGSKNEFDPIQHSFLLDGNCYMSGESSTSTEKLSTRNMASDNFFGAKSGVNLSRIS 374 Query: 1220 SYGSQAPYSSAPPVTFTEKCSDALDLHNLNEDSPCWKGAPAFCISPSDSVEAPSPCYFKC 1399 + + V E ++LD +N DSPCWKGAPA SP S E P Sbjct: 375 PDNFSLAFENNEAVIAVENSLESLDHYNPPVDSPCWKGAPASNNSPFGSSE-PVAVQLAK 433 Query: 1400 KLECSD--------FGQSNPLFPPAEHSGRTSLKKSGEDNLHSHNVYAGIGLSAPAQGTG 1555 KLE D F SN SG K+GE + N G + + Sbjct: 434 KLEACDGSNGLVLKFISSNTANMVKHPSG-----KAGEILMSDENGNVEDGSMSSLKLPP 488 Query: 1556 TNNYITEEHRTNDVKEKTFEHMDLSSSSGVLKFSEDLNKPSKGYNLPQYSENDCLSVDE- 1732 + +EH ++ K H + +SS+ +KFS++ ++ K Y L S VDE Sbjct: 489 VSIPSFKEHEPDEAG-KAGSHKNKASSACEVKFSDNASEWKKDYVLFDKS------VDEV 541 Query: 1733 HKYGHTKHS-LTESFVHSG-------------LNLNDTLEGGV--VALDAAENVLRSPAS 1864 K HT L E + S + +ND G V+ A +++ +P+S Sbjct: 542 EKASHTSQQCLAEGRLASKNLCRSETGVADLEMKINDVSGCGSSHVSCHAVKHLSCAPSS 601 Query: 1865 QEDAKQAQPYEVGSSP--KLDVQTLVRAIYNLSELLKSQCLTNACLLDEQDHDALKHAIT 2038 ED +G P + LV + NLSELL C AC L EQD +L+ I Sbjct: 602 VEDVSTKHTKFLGKEPVSNSSISVLVDTMQNLSELLLYHCSNEACELREQDVKSLEKVIN 661 Query: 2039 NLGACTTKKIETKDTMISQHDTFE 2110 NL C +K I ++T++S+ D E Sbjct: 662 NLDTCMSKNI-GQETLLSELDLSE 684