BLASTX nr result
ID: Akebia23_contig00001954
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Akebia23_contig00001954 (3834 letters) Database: ./nr 38,876,450 sequences; 13,856,398,315 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_003634177.1| PREDICTED: uncharacterized protein LOC100853... 337 3e-89 ref|XP_002321950.2| hypothetical protein POPTR_0015s00600g [Popu... 234 2e-58 ref|XP_002317835.2| hypothetical protein POPTR_0012s00720g [Popu... 233 4e-58 ref|XP_007220585.1| hypothetical protein PRUPE_ppa000352mg [Prun... 231 2e-57 ref|XP_007039224.1| Uncharacterized protein isoform 5 [Theobroma... 229 6e-57 ref|XP_007039222.1| Uncharacterized protein isoform 3 [Theobroma... 228 2e-56 ref|XP_007039220.1| Uncharacterized protein isoform 1 [Theobroma... 228 2e-56 ref|XP_002521299.1| hypothetical protein RCOM_0756330 [Ricinus c... 222 1e-54 gb|EXB94712.1| hypothetical protein L484_002599 [Morus notabilis] 221 2e-54 ref|XP_007039221.1| Uncharacterized protein isoform 2 [Theobroma... 199 8e-48 ref|XP_004309093.1| PREDICTED: uncharacterized protein LOC101301... 199 8e-48 ref|XP_006478087.1| PREDICTED: uncharacterized protein LOC102628... 179 1e-41 ref|XP_006441268.1| hypothetical protein CICLE_v10018632mg [Citr... 178 2e-41 ref|XP_006441271.1| hypothetical protein CICLE_v10018632mg [Citr... 177 3e-41 ref|XP_006441272.1| hypothetical protein CICLE_v10018632mg [Citr... 155 1e-34 ref|XP_007039225.1| Uncharacterized protein isoform 6 [Theobroma... 143 5e-31 ref|XP_004235030.1| PREDICTED: uncharacterized protein LOC101252... 132 2e-27 ref|XP_006347526.1| PREDICTED: uncharacterized protein LOC102592... 125 1e-25 ref|XP_002300521.2| hypothetical protein POPTR_0001s45660g [Popu... 124 3e-25 ref|XP_006347527.1| PREDICTED: uncharacterized protein LOC102592... 122 1e-24 >ref|XP_003634177.1| PREDICTED: uncharacterized protein LOC100853355 [Vitis vinifera] gi|302143995|emb|CBI23100.3| unnamed protein product [Vitis vinifera] Length = 1167 Score = 337 bits (863), Expect = 3e-89 Identities = 354/1275 (27%), Positives = 519/1275 (40%), Gaps = 163/1275 (12%) Frame = +3 Query: 219 LAPPFTVGA-YSIPKKNSLGHFVEHPY---FDESHHYLGFGSSPFDNWPPLQPDTSLPNP 386 LAPPFTV S P N L +F E Y F+ S H NW Q S P+ Sbjct: 21 LAPPFTVDRPVSKPLSNPLVNFTESTYAAPFNSSLH----------NWVHPQSPVSRPDY 70 Query: 387 FNDPNLRTDAIGGMSPPYEGTYGLPT-----------PTLNYYST---------PSIT-- 500 F++PN D++ P Y P L++ + P ++ Sbjct: 71 FSNPNSAVDSVQATGVPPSNAYRYSVSQPVNSPVVHLPPLSHIVSGIAHLPPLSPIVSAG 130 Query: 501 --------------------RPYFPEYPSTTVHDNSSSVVFNEP---------------- 572 +PY+P Y + + DNS VV NEP Sbjct: 131 TDVFSFGQCSDRMKTSLVEAKPYYPPYVAPAIEDNSPLVVLNEPNYDLLSTSHAAHLNGS 190 Query: 573 -------QSQSGKMAPLSPGGNLWTGMSGEEQWRR---------KDKNEAGSLNYNSSLN 704 QS SG P S W G++ EQ ++ K+ N GS Y S +N Sbjct: 191 SSLDDYTQSMSGLEYP-SRWCGFWNGLADIEQGKKVELDESLCSKESNFVGSSIYRSYIN 249 Query: 705 QGTPAVDGLSKCEDSSAVWLGKCVDSMARKCHIGPKSIDLLDTKVSXXXXXXXXVTPFNM 884 QG P +G+S E+ S + K VD + R +G S D + K V + Sbjct: 250 QGDPTAEGVSNSEEGSVLSDRKYVDILGRDNCVGSLSPDHFNNKSFYEPKANPMVVSLDF 309 Query: 885 SRTSVSDPTSLFQEMPYHPVTSSESVLKSWSHLNPNTASYERCVSDLDSIATNPTVFYPS 1064 RTS TS+ E P+ S E V SW++ P +A YE+C +DS +P S Sbjct: 310 PRTSFLGSTSVLPETPHPRAPSLEPVTNSWNYRKPQSALYEKCFRKIDSCVDDPV----S 365 Query: 1065 SIYSSLAHVYNSPGNSSTFEKEFLSENMNHFEKPWDAC-DNKEASSS---------RTQV 1214 SS A V P NS + S +N F C DN E S V Sbjct: 366 KAKSSPAIVIRPPANSPS------SLGVNSFSSRNMICTDNSENVSGHHLSNMEEPHIPV 419 Query: 1215 HTEGKEGCGDTSRTNNGFDVNNQFSMDSLMKKDFSSYRNSITDEPFNHLYNPKFKSQITD 1394 +EG+E DTS+ N + N+ SM+S K N + + ++L + + QI Sbjct: 420 ISEGRELYSDTSQLNGHWQRNDHLSMESSSTKKHELLNNEMGVKETDNLLRARSELQIPH 479 Query: 1395 VNLPNAFTLVHTXXXXXXXXXXXXXETLDQFNSVLDSPCWKGAPTSQHFPCGITEA---- 1562 +N+ + F+ ETLD +N +DSPCWKG+ TS P ++EA Sbjct: 480 LNVEDGFSF-SPNSIEAVNSIDNTSETLDHYNPAVDSPCWKGSITSHFSPFEVSEALSPH 538 Query: 1563 ------GRFNGLNLQGPQILPFDA-------------NLVYSENGSAEDGLSYFPKRSSP 1685 +G NLQG I P ++ N Y +N E+GL KR S Sbjct: 539 NLMEQLEALDGFNLQGHHIFPLNSDDAVNVSSLKPNENTEYHKNVCGENGLLPSWKRPSV 598 Query: 1686 DVSLSTEKYELKDVVKAG-------SDHSEMRNENVIQCFFPSNLKNCSETKS---SQMM 1835 S E+ L D K G S + ++IQ +L N S++ + S M Sbjct: 599 VNHPSREQRSL-DAFKTGPYCQKLSSGDGNQSSNDIIQPKRDHSLLNSSKSDNLELSHTM 657 Query: 1836 KMNQNEDK--------------DPGIDVKDAALDDSSHVTFGDGERISRLLSSGDSVPIE 1973 + + E K G ++ D + D SSH T+ E IS SGD + Sbjct: 658 RQSFEEVKFTSERKLSSGVGVEVTGNNINDVSRDGSSHETYHLTENISCSPLSGDDASTK 717 Query: 1974 LGEPLVGASDTLGGCQLPRIGLRVVVNAMHNLSELLLSY-SEDVNALKEHDHVVLRHVLN 2150 L + S P+I + +++N + +LS LLLS+ S++ +LKE DH L+ V++ Sbjct: 718 LTKQPASES-------TPKIDVHMLINTVQDLSVLLLSHCSDNAFSLKEQDHETLKRVID 770 Query: 2151 NLDAYVSKVG----------LVRPIPESDLSQSGTFSFGKLTVPNESQITNAEANDVQNQ 2300 N DA ++K G + +P+ + S S ++ GK A+AN V++Q Sbjct: 771 NFDACLTKKGQKIAEQGSSHFLGELPDLNKSASASWPLGKKV---------ADAN-VEDQ 820 Query: 2301 CDFNASEPKMHSTMXXXXXXXXXXXXXXXXXXXXXXXNDDMIQAIKKALMENFHEEDE-H 2477 + +D IQAI+K L +NFH+E+E Sbjct: 821 FHCQSDHKGKRHCSVSGNKDEKLSDFVSLVNDEDTVNDDSTIQAIRKILDKNFHDEEETD 880 Query: 2478 PQTSLFKNLWLESEAALCSMACKTRFARMKMKMEKCNNQHQTKDVVGMPINADKLSGSKI 2657 PQ L++NLWLE+EAALCS++ + RF RMK++MEK +T+D++ I+ +K S SK+ Sbjct: 881 PQALLYRNLWLEAEAALCSISYRARFDRMKIEMEKF-KLRKTEDLLKNTIDVEKQSSSKV 939 Query: 2658 SVDLYRDDMVASKTKGSVTSNISTHDTSQSSNTSCAEDVEASVMDRYRILKCRVDNSISI 2837 S D+ D + + + +I+ D+ + S A D V+DR+ ILK R +NS S+ Sbjct: 940 SSDISMVDKFEREAQENPVPDITIEDSPNVTTMSHAAD----VVDRFHILKRRYENSDSL 995 Query: 2838 NKEEKLPDPVDEAKAAPCSMEDKAQSARLKFEMERCNQNHVEDVVGLPINVDQLQSSKVS 3017 N ++ V + S KVS Sbjct: 996 NSKD----------------------------------------------VGKQSSCKVS 1009 Query: 3018 FVLDGDDMAASKAKGTVTPNISTLDTSQSSNMSVTEDVKSSVMDRFRILQSRIDNTISIN 3197 ++ DD A AK +PNIST ++QS + VM RFRIL+ R D + +N Sbjct: 1010 HDMNSDDNLAPAAKDDHSPNIST--STQSDD----------VMARFRILKCRADKSNPMN 1057 Query: 3198 KEGRLPNPCDIGAYPGSESVQNFGIKPQLPELVDMGFAERRVKWPFIRMMRTEDEADLDP 3377 E + P PE VD+ FA + W FI+ R ED L P Sbjct: 1058 AERQQP-----------------------PEEVDLEFAGKGSHWMFIK-DRVED-VTLGP 1092 Query: 3378 DLS-EKQDPVKD-FSTFVSD--------------DEPETQFYDTDRFGTWVPTSGYDSPS 3509 DL + KD F +++ D D+P Q ++R +P D S Sbjct: 1093 DLQVHIANHTKDRFDSYLDDFDCEIVKEFHEHAMDDPVIQLPRSNRLQNQLPAGFSDGSS 1152 Query: 3510 SDWEHVLKEELAWPN 3554 +DWEHVLKEEL N Sbjct: 1153 ADWEHVLKEELPGGN 1167 >ref|XP_002321950.2| hypothetical protein POPTR_0015s00600g [Populus trichocarpa] gi|550321678|gb|EEF06077.2| hypothetical protein POPTR_0015s00600g [Populus trichocarpa] Length = 1236 Score = 234 bits (597), Expect = 2e-58 Identities = 335/1307 (25%), Positives = 503/1307 (38%), Gaps = 181/1307 (13%) Frame = +3 Query: 165 YGMGYKGXXXXXXXXXXPLAPPFTVGAYSIPKKNSLGHFVEHPYFDESHHYLGFGSSPFD 344 YG G APPFTV + L + S H +S Sbjct: 8 YGYNNGGGSSSSSSNLSASAPPFTVDRSAAKSLLDLTETTYPVSLNPSLHNWVTSNSHIP 67 Query: 345 NWPPLQPDTSLPNPFNDPNLRTDAIGGMSPPYEGTYGLPT--PTLNY------------- 479 N S P+ F PNL D++ SPP G Y PT P++++ Sbjct: 68 N--------SRPDLFPIPNLEFDSVP--SPPAFG-YSSPTQMPSMSHPLVSASTDAVLYV 116 Query: 480 YSTPSITR--PYFPE-YPSTTVHDNSSSVVFNEPQSQ--SGKMAPLSPGGNL-------- 620 PSI PY+P Y S + + S + N+ + S S G + Sbjct: 117 QGNPSIVEAEPYYPSSYVSPAIASDGSLKIPNQSGYELLSTSHVGTSNGSSRDDYSQSLV 176 Query: 621 -------WTGM-SGEEQWRRKDKNE--AGSLNYNSSLNQGTPAVDGLSKCEDSSAVWLGK 770 W+G+ G W + K + G + +NQG A +SKCE++S LG Sbjct: 177 VLEHPAQWSGLWEGVTDWHQSKKMQLDGGFSAKENFINQGFSAFKDISKCEETS---LG- 232 Query: 771 CVDSMARKCHIGPKSIDLLDTKVSXXXXXXXXVTPFNMSRTSVSDPTSLFQEMPY-HPVT 947 ++ + R+ H S +D K F + S P P +P Sbjct: 233 -INVVGRQTHTESASTGQMDYKAFLGEKPK-----FMPAGYSTPSPLVFPSVAPQAYPQV 286 Query: 948 SSESVLKSWSHLNPNTASYERCVSDLDSIATNPT-VFYPSSIYSSLAHVYNSPGNSSTFE 1124 S +V+ S + P+ Y + D+ + V PS + V SPG + Sbjct: 287 PSSNVVNSPINQMPDVILYGKSSRKRDASPNDSMPVTKPSPVV-----VVRSPGQDT--- 338 Query: 1125 KEFLSENMNHFEKPWDACDNKEASSSRTQVH-------TEGKEGCGDTSRTNNGFDVNNQ 1283 + +NMN CD E ++ + V +EGK D+S+ N N+ Sbjct: 339 --YSFKNMN------TGCDGDEKGNNSSSVQEPNPFISSEGKVFY-DSSQINFHLKQNDD 389 Query: 1284 FSMDSLMKKDFSSYRNSITDEPFNHLYNPKFKSQITDVNLPNAFTLVHTXXXXXXXXXXX 1463 + + K + +I+ + F+ L+ K +++ NL F + Sbjct: 390 YLAEISSKNNELPSNKNISVDFFDQLFKAKMDNKVLRRNLD--FFNLAMDGHEAIGSVEN 447 Query: 1464 XXETLDQFNSVLDSPCWKGAPTSQHFPCGITEA---------GRFNGLNLQGPQILPFDA 1616 E+LD +N +DSPCWKGAP S I+E NGL+ QGPQI P Sbjct: 448 TSESLDHYNPAVDSPCWKGAPVSHLSAFEISEVVDPLIPKKVEACNGLSPQGPQIFPSAT 507 Query: 1617 NLVYSENGSAEDGLSYFPKRSSPD---VSLSTE--------KYELKDVVKAG-------- 1739 N + +S S + VSL + E+ D K G Sbjct: 508 NDAVKACPEKQSNISVPLNHESLEHQQVSLFKRPLDAKVLFREEIDDAGKYGPYQRIPSY 567 Query: 1740 ------SD--HSEMRNENVIQCFFPSNLKNCSETKSSQMMKMNQNEDKDPGIDVKDAALD 1895 SD E R E+++ F + + S K N D + D D Sbjct: 568 CHEAQISDVIDDETRKESILSDFNSLHTEQRSLEDGEWPSKKNSYV-ADVRRKINDDPDD 626 Query: 1896 DSSHVTFGDGERISRLLSSGDSVPIELGEPLVGASDTLGGCQLPRIGLRVVVNAMHNLSE 2075 SSHV F E++ S + P + + + G L ++ R +V+ MHNL+E Sbjct: 627 CSSHVPFHAIEQVLCSPPSSEHAPAQ-------HTQSQGEESLSKMHARTLVDTMHNLAE 679 Query: 2076 LLLSYSE-DVNALKEHDHVVLRHVLNNLDAYVSKVGLVRPIP--ESDLSQSGTFSF-GKL 2243 LLL YS D LK+ D VL+ V+NNLD +SK L R I ES + Q T F GKL Sbjct: 680 LLLFYSSNDTCELKDEDFDVLKDVINNLDICISK-NLERKISTQESLIPQQATSQFHGKL 738 Query: 2244 TVPNESQITNAEANDVQNQCDFNASEPKMHSTMXXXXXXXXXXXXXXXXXXXXXXXNDDM 2423 + + + Q +F E + + +D+M Sbjct: 739 S------------DLYKGQLEFQHFEDEEEHKIASDKRKEKLSNWASTRCAADTVKDDNM 786 Query: 2424 IQAIKKALMENFH-EEDEHPQTSLFKNLWLESEAALCSMACKTRFARMKMKMEKCNNQHQ 2600 QAIKK L +NF EE+ Q L++NLWLE+EA+LCS+ RF RMK++MEK ++Q Sbjct: 787 TQAIKKVLAKNFPIEEESESQILLYRNLWLEAEASLCSVNYMARFNRMKIEMEKGHSQKA 846 Query: 2601 TK--------------------DVVGMPI-NADKLSGSKISVDLYRDDMVA--------- 2690 + D G P+ + L S +S + + DD++A Sbjct: 847 NEKSMVLENLSRPKVSSDILPADDKGSPVQDVSFLDSSILSRNSHSDDVMARFHILKSRV 906 Query: 2691 ------------------------------SKTKGSVTSNISTHDTSQSSNTSCAEDVEA 2780 TK S N+S D+ S +S A+DV + Sbjct: 907 DDSNSMSTSAVEKLSSSKVSPDLNLVDKLACDTKDSTKPNVSIQDSHMSGTSSNADDVSS 966 Query: 2781 ---SVMDRYRILKCRVDNSIS-----------------INKEEKLPDPVDEAKAAPCSME 2900 V+ R+ ILKCRVDNS S +NK +K+ ++ +++ Sbjct: 967 HADDVIARFHILKCRVDNSSSGNTSAMEKLSSSKVSPDLNKVDKMVYDTKDSTKPHITIQ 1026 Query: 2901 DKAQSAR-------------LKFEMERCNQNHVEDVVGLPINVDQLQSSKVSFVLDGDDM 3041 D + R L+ ++ CN ++ + LP SSKVS L Sbjct: 1027 DSPMAGRSSHADDVMARFRTLEGRVDNCNSVNISAMEKLP-------SSKVSSNLSNVGK 1079 Query: 3042 AASKAKGTVTPNISTLDTSQSSNMSVTEDVKSSVMDRFRILQSRIDNTISINKEGRLPNP 3221 +AK + P+I+ D+ S S ED+++++M R IL+ R + S+ E P Sbjct: 1080 LTVEAKDSTKPDITKQDSPLPSTSSHAEDIEAAIMARLLILKHRDGCSSSLEMEEHQPES 1139 Query: 3222 CDIGAYPGSESVQNFGIKPQLPELVDMGFAERRVKWPFIRMMRTEDEADLDPDLSEKQDP 3401 D G Y G ++D+ P IR + D +E + Sbjct: 1140 IDNG-YTSLRRDVPMGKGGLKDSILDVNME------PVIR--------NYPADSAEDKST 1184 Query: 3402 VKDFSTFVSDDEPETQFYDTDRFGTWVPTSGYDSPSSDWEHVLKEEL 3542 VK+F FV+DD +TQ T+RFG YDS SSDWEHVLKEE+ Sbjct: 1185 VKEFRLFVNDD-AKTQSSLTNRFGDQPHAGWYDSCSSDWEHVLKEEI 1230 >ref|XP_002317835.2| hypothetical protein POPTR_0012s00720g [Populus trichocarpa] gi|550326088|gb|EEE96055.2| hypothetical protein POPTR_0012s00720g [Populus trichocarpa] Length = 1227 Score = 233 bits (595), Expect = 4e-58 Identities = 341/1298 (26%), Positives = 502/1298 (38%), Gaps = 171/1298 (13%) Frame = +3 Query: 165 YGMGYKGXXXXXXXXXXPLAPPFTVGAYSIPKKNSLGHFVEHPYFDESHHYLGFGSSP-F 341 YG G PLAPPFTV V P D + +P Sbjct: 8 YGYNNGGGSSFSSSNLSPLAPPFTVD-----------RSVAKPLLDLTEPTYPVSLNPSL 56 Query: 342 DNWPPLQPD--TSLPNPFNDPNLRTDAIGGMSPPYEGTYGLPTP---------------- 467 NW S P+ F PNL ++I SP G Y PTP Sbjct: 57 HNWATSNSHIPNSRPDLFPLPNLEFNSIP--SPNVFG-YSSPTPQVTSKNHPLVLASTDA 113 Query: 468 TLNYYSTPSITR--PYFPE-YPSTTVHDN--------------SSSVVFNEPQSQSGKMA 596 L S PS+ PY+P Y S + + S+S V S Sbjct: 114 VLYGQSNPSLVEAVPYYPSSYVSPAIGSDGHLKIPHQSGYELLSNSYVGTSNGSSHDDYT 173 Query: 597 PLSPG---GNLWTGM-SGEEQWRRKDKNE--AGSLNYNSSLNQGTPAVDGLSKCEDSSAV 758 S G W+G+ G W + K + G + +NQG A +SKCE++S Sbjct: 174 QSSLGLEHATQWSGLWEGVTDWNQSKKLQLDGGFCEKENFINQGFSAFKDVSKCEETS-- 231 Query: 759 WLGKCVDSMARKCHIGPKSIDLLDTKVSXXXXXXXXVTPFNMSRTSVSDPTSLFQEMP-- 932 LG +D + R+ H G S LD K P +M T P+ +F Sbjct: 232 -LG--IDMVGRQMHTGSASTGQLDYKAFLVEK------PKSMPTTP---PSLIFPPTAPQ 279 Query: 933 YHPVTSSESVLKSWSHLNPNTASYERCVSDLDSIATNPTVFYPSSIYSSLAHVYNSPGNS 1112 +P SS +V+ S ++ + SY + D+ + + + S A V PG Sbjct: 280 AYPQVSSSNVVNSPNNQMRHVTSYGKSSRKRDASSNDRMPM----MKPSPAVVIRPPGQD 335 Query: 1113 STFEKEFLSENMNHFEKPWDACDNKEASSSRTQVHTEGKEGCGDTSRTNNGFDVNNQFSM 1292 K ++ + EK + + A + ++GK C D+S+ N N+ Sbjct: 336 RYSFKN-INAGTDGDEKDFAGNNTSFAQEPNPFISSKGKV-CYDSSQVNFHLKQNDDSFA 393 Query: 1293 DSLMKKDFSSYRN-SITDEPFNHLYNPKFKSQITDVNLPNAFTLVHTXXXXXXXXXXXXX 1469 + K N +I+ + + L+ K ++++ NL F + Sbjct: 394 EVPSKNHEELLSNKNISIDFLDKLFREKMENRVPCKNLD--FFNLAMDGHEAAGSVEITS 451 Query: 1470 ETLDQFNSVLDSPCWKGAPTSQHF---------PCGITEAGRFNGLNLQGPQILPFDANL 1622 E+LD + +DSPCWKGAP S P EA NGLNLQGPQI P N Sbjct: 452 ESLDHYFPAVDSPCWKGAPVSLPSAFEGSEVVNPQNKVEA--CNGLNLQGPQISPSTTND 509 Query: 1623 VYSENGSAEDGLSYFPKRSS-----------PDVSLSTEKYELKDVVKAGSDH------- 1748 + + +S S P V+ + + D VK G Sbjct: 510 AVKDCPEKQSNISMTFNNESLEHRPASSFKRPLVANVLFREGIDDAVKYGPCQRKSSYCN 569 Query: 1749 --------SEMRNENVIQCFFPSNLKNCSETKSSQMMKMNQNEDKDPGI--DVKDAALDD 1898 E R E+++ F P + K S + K N + G+ + D D Sbjct: 570 EAQISDVIDEPRKESILPDFKPVHTKQKSLEEGEWPSKKNSDV---AGVRRKINDNPDDC 626 Query: 1899 SSHVTFGDGERISRLLSSGDSVPIELGEPLVGASDTLGGCQLPRIGLRVVVNAMHNLSEL 2078 SSHV + E + S + P + + VG S + ++ R +V+ MHNLSEL Sbjct: 627 SSHVPYHAIEHVLCSPPSSEHAPAQHTQSQVGESSS-------KMHARTLVDTMHNLSEL 679 Query: 2079 LLSYSE-DVNALKEHDHVVLRHVLNNLDAYVSKVGLVRPIPESDL--SQSGTFSFGKLTV 2249 LL YS D LK+ D VL V+NNLD ++SK + + L ++ + S GKL+ Sbjct: 680 LLFYSSNDTCELKDEDFDVLNDVINNLDIFISKNSERKNSTQESLIPRRATSQSPGKLSE 739 Query: 2250 PNESQITNAEANDVQNQCDFNASEPKMHSTMXXXXXXXXXXXXXXXXXXXXXXXNDDMIQ 2429 + Q+ D + +C + E K + +D++ Q Sbjct: 740 LYKGQLEFQHFED-EKECKIVSDERKEKLS-----------NFVSMRGATDTVKDDNVTQ 787 Query: 2430 AIKKALMENFH-EEDEHPQTSLFKNLWLESEAALCSMACKTRFARMKMKMEKCNNQHQTK 2606 AIKK L +NF +E+ Q L+KNLWLE+EA+LC + C RF R+K+++EK ++Q + Sbjct: 788 AIKKVLAQNFPIKEESESQILLYKNLWLEAEASLCVVNCMDRFNRLKIEIEKGSSQKVNE 847 Query: 2607 DVVGMPI------NADKLSGSKISVDL-----------------------YRDDMVA--- 2690 P+ + L G K+S D+ + DD++A Sbjct: 848 FSSAAPVVPENSMIMENLLGPKVSSDILPAEDEGSPVHNVPDSSILSRNSHSDDVMARFH 907 Query: 2691 ---SKTKGSVTSNIS-------------------THDTSQSSNTSC--------AEDVEA 2780 S+ S + N S HDT SS + A Sbjct: 908 IIKSRVDDSNSLNTSAMDLSSPKVSPDLNKVDKFAHDTKDSSKSHISFQDSIRGASSHAD 967 Query: 2781 SVMDRYRILKCRVDNSISIN--------KEEKLPDPVDEAKAAPCSMEDKAQSARLKFEM 2936 +VMDR+ ILKCRV+NS S+N PD K A +D S ++ Sbjct: 968 NVMDRFHILKCRVENSSSVNTATGGILASSMVSPDQNQVDKLAH-DTKDSIMSYTIQDSP 1026 Query: 2937 ERCNQNHVEDVV-------GLPIN--------VDQLQSSKVSFVLDGDDMAASKAKGTVT 3071 +H +DV+ G N V++L SSKVS L+ K ++ Sbjct: 1027 MSGRSSHADDVMTRFCILNGRDDNSNSVTISAVEKLSSSKVSSDLNKVSKLTDDTKDSIK 1086 Query: 3072 PNISTLDTSQSSNMSVTEDVKSSVMDRFRILQSRIDNTISINKEGRLPNPCDIGAYPGSE 3251 +++T D+S SS S EDV++SV IL+ R N+ S++ E D G Y Sbjct: 1087 ADVTTQDSSMSSASSQAEDVEASV-----ILKHRDGNSSSLDMEEHQRVSIDNG-YMDLI 1140 Query: 3252 SVQNFGIKPQLPELVDMGFAERRVKWPFIRMMRTEDEADLDPDLSEKQDPVKDFSTFVSD 3431 + +D+ P I R D +E + VK+F F++D Sbjct: 1141 RLARMNKDGTKDRTLDVNME------PLIPNFRA--------DSTEDKPTVKEFRLFIND 1186 Query: 3432 DEPETQFYDTDRFGTWVPTSGYDSPSSDWEHVLKEELA 3545 D ETQ TDRFG YDS SSDWEHVLKEELA Sbjct: 1187 D-VETQSRLTDRFGDQSHAGWYDSCSSDWEHVLKEELA 1223 >ref|XP_007220585.1| hypothetical protein PRUPE_ppa000352mg [Prunus persica] gi|462417047|gb|EMJ21784.1| hypothetical protein PRUPE_ppa000352mg [Prunus persica] Length = 1254 Score = 231 bits (590), Expect = 2e-57 Identities = 338/1295 (26%), Positives = 529/1295 (40%), Gaps = 187/1295 (14%) Frame = +3 Query: 219 LAPPFTVGAYSIPKKNS--LGHFVEHPYF----DESHHYLGF-----GSSPFDNWPPLQP 365 LAPPFTV S+PK S L E PY SH++L GS+ F N P Sbjct: 25 LAPPFTVDR-SVPKPISSPLVDVTETPYVAPLNSSSHNWLPSHPPITGSNFFAN--PTPE 81 Query: 366 DTSLPNP----------FNDPNLRTDAIGGMSPPYEGTYGLPTPTLNYYSTPSI-TRPYF 512 SLP+ + PN + ++P + +L+ +T + +PY+ Sbjct: 82 FNSLPSSNAYRYAGSQIVDPPNTTLPPLNTITPASSNAFTYDQ-SLDAVATSFVEAKPYY 140 Query: 513 PEYPSTTVHDNSSSVVFNEPQSQ---SGKMAPL------------------SPGGNLWTG 629 P Y S T+H +S VV ++P + APL + G LW G Sbjct: 141 PSYLSPTIHGDSPLVVPDQPSYDWLSTTHFAPLDGCSRKDYTQRPPDLKYTAQWGGLWNG 200 Query: 630 MSGEEQWRRKD---------KNEAGSLNYNSSLNQGTPAVDGLSKCEDSS--AVWLGKCV 776 +S EQ ++ D + +GS Y + +NQ + + L+ E++S LG Sbjct: 201 LSEWEQGKQGDFDGSFCSKKTDVSGSFLYKNFMNQEPHSSNSLNSFEEASHGINTLGWEK 260 Query: 777 DSMARKCHIGPKSIDLLDTKVSXXXXXXXXVTPFNMSRTSVSDPTSLFQEMPYHPVTSSE 956 + H+G KS+ ++K TP + S+ SV S+ E P+ SS+ Sbjct: 261 PGGSGNAHLGDKSLVGKNSKF----------TPSDFSK-SVMGSLSVVPE-PHLKAPSSQ 308 Query: 957 SVLKSWSHLNPNTASYERCVSDLDSIATNPTVFYPSSIY-SSLAHVYNSP--GNSSTFEK 1127 V K+ + P + S E LD+ ++ Y +SI SS A +P G + Sbjct: 309 CVTKTSNCKTPYSVSSE--TQQLDA-----SLDYITSISESSPAFATRTPALGTKLSEPG 361 Query: 1128 EFLSENMNHFEKPWDACDNKEASSSRTQVH----TEGKEGCGDTSRTNNGFDVNNQFSMD 1295 L +N D SS + H +EGK D+S+ + FS + Sbjct: 362 TGLFRRLNFISDAADTDHGDYYSSGVQESHLPQISEGKV-LFDSSQLGFHLGAKDCFSAE 420 Query: 1296 S--LMKKDFSSYRNSITDEPFNHLYNPKFKSQITDVNLPNAFTLVHTXXXXXXXXXXXXX 1469 S ++ S+ RN I + ++ ++ K Q + V L T Sbjct: 421 SSSARNEELSNNRNIINKDAWDKVFKAKPGLQNSHVGLDGFKMAFKTNETINSFLSSS-- 478 Query: 1470 ETLDQFNSVLDSPCWKGAPTSQHFPCGITEAG---------RFNGLNLQGPQI------- 1601 + +D N +DSPCWKG P S P G +E G +GLN+ P Sbjct: 479 DNVDPNNPGVDSPCWKGVPGSCFSPFGASEDGVPEQIKKLEDCSGLNIHMPMFPLSAGEN 538 Query: 1602 ----LPFDANLVYSENGSAEDGLSYFPKRSSPDVSLSTEKYELKDVVKAGSDHSEMRNEN 1769 P + Y+E G E+GL KR S S E ++ + VK D +E ++ Sbjct: 539 VSSQKPIKNAVEYNEFGWLENGLRPPLKRYSVANSAFGE-HKWDNSVKTTYD-AETSHDR 596 Query: 1770 VIQCFFPSNLKNCSETKS------SQMMKMNQNED-------------KDPGIDVKDAAL 1892 Q + ++ + KS S M+ ED D ++ D Sbjct: 597 GPQSYRDGLHQSGNGDKSLGLLDDSHAMQQGHGEDGLATEVKQTWSCVADVKLNANDTME 656 Query: 1893 DDSSHVTFGDGERISRLLSSGDSVPIELGEPLVGASDTLGGCQLPRIGLRVVVNAMHNLS 2072 SSHV E + L SS + +L S + G + ++ ++++V+ + NLS Sbjct: 657 YGSSHVPSHVVENV--LCSSAEDAATKL-------SKSNGEESMLKVDVQMLVDTLKNLS 707 Query: 2073 ELLLSY-SEDVNALKEHDHVVLRHVLNNLDAYVSK-VGLVRPIPESDLSQSGTFS-FGKL 2243 ELLL+ S + LK+ D L+ V+NNL +SK V P+ ES Q T + +L Sbjct: 708 ELLLTNCSNGLCQLKKTDIATLKAVINNLHICISKNVEKWSPMQESPTFQQNTSQCYAEL 767 Query: 2244 T----VPNESQITNAEANDVQNQCDFNASEPKMHSTMXXXXXXXXXXXXXXXXXXXXXXX 2411 + V + + +A A D+Q+Q Sbjct: 768 SEHHKVLSADRPLSASAPDIQDQV----------------------IGSIHVKSDIDVVK 805 Query: 2412 NDDMIQAIKKALMENFHEEDEHPQTSLFKNLWLESEAALCSMACKTRFARMKMKMEKCNN 2591 D M QAIK+ L ENFH E+ PQ L+KNLWLE+EA LCS+ K RF R+K++M+KC Sbjct: 806 EDKMTQAIKEILSENFHSEETDPQVLLYKNLWLEAEAVLCSINYKARFNRVKIEMDKCKA 865 Query: 2592 QHQTKDVVGMPINADKLSGSKISVDLYRDDMVASKTKGSVTSNISTHDTSQSSNTSCAED 2771 ++ +KDV + K S S++S D + + + +G TSN+ Sbjct: 866 EN-SKDVFEYTADMMKQSKSEVSPDSNPVNPLTPEAQGCPTSNVPDLPILSQ-------- 916 Query: 2772 VEASVMDRYRILKCRVDNSISIN-------KEEKLPDPVDEAKAAP---------CSMED 2903 E V+ R+ IL+ RV+N+ SIN + P+P + AP S++D Sbjct: 917 -EDEVLARFDILRGRVENTNSINASNAAELSSKASPEPSKVERIAPEANGTPSPGISIQD 975 Query: 2904 KAQSARL----KFEMERCNQNHV------EDVVGLPINVDQLQSSKVSFVLDGDDMAASK 3053 + S+ + +E + H+ + +N+++ S KVS D + + Sbjct: 976 SSISSTIGVTDDYEASVMARFHILRDRVEKSKFISAVNMEEPSSPKVSLEPKTDVIVPDR 1035 Query: 3054 AKGTVTPNISTLDTSQSSNMSVTEDVKSSVMDRFRILQSRIDNTISINKEG-RLPNP-CD 3227 G+ + D+ S S D ++SVM R IL+SR+DN ++ EG +LP P + Sbjct: 1036 NDGSASEFNLFQDSPPSITTSHANDCEASVMSRLHILKSRVDNCSDMHTEGQQLPEPKIE 1095 Query: 3228 IGAYPGSESVQ-NFGIK---------------------------------------PQLP 3287 + A S+S+ F I+ QLP Sbjct: 1096 VIAPDTSDSLMPEFSIQDSPVSRATSQANDCEASVMSRLHILKSRVDNSSYMHREGKQLP 1155 Query: 3288 ELVDMGFAERRVKWPFIRMMRTEDEADL---------DPDLSE-KQDPVKDFSTFVSDDE 3437 E+ +G A +R WP I +D+ D SE K D K+F FV DD Sbjct: 1156 EIGGLGNAGKRHPWPIISKRSEGGSSDIKEQPILRSFKADNSEGKLDTAKEFHLFVEDD- 1214 Query: 3438 PETQFYDTDRFGTWVPTSGYDSPSSDWEHVLKEEL 3542 P TQ++ + +P G+D+ SSDWEHV+KEE+ Sbjct: 1215 PLTQYFRIHKPANQLPAGGHDNSSSDWEHVMKEEI 1249 >ref|XP_007039224.1| Uncharacterized protein isoform 5 [Theobroma cacao] gi|508776469|gb|EOY23725.1| Uncharacterized protein isoform 5 [Theobroma cacao] Length = 1059 Score = 229 bits (585), Expect = 6e-57 Identities = 313/1196 (26%), Positives = 478/1196 (39%), Gaps = 84/1196 (7%) Frame = +3 Query: 219 LAPPFTVGAYSIPKKNSLGHFVEHPYFDESHHYLGFGSSPFDNWPPLQPDT-SLPNPFND 395 LAPPFTV SIPK + P D LG P NW P T + P P Sbjct: 33 LAPPFTVDR-SIPKPAAT------PLVD-----LG---EPL-NWLDSNPYTFNSPQPAQL 76 Query: 396 PNLRTDAIGGMSPPYEGTYGLPTPTLNYYSTPSITRP--YFPEYPSTTVHDNSSSVVFNE 569 P L + PTPT +Y + P Y+P Y S +H + FNE Sbjct: 77 PQLDLE---------------PTPTPSYNQNSDLFEPKTYYPSYVSPPLHVPT----FNE 117 Query: 570 PQSQSGKMAPLSPGGNLWTGMSGEE-----QWRRKDKNEAGSLNYNSSLNQGTPAVDGLS 734 QS G GG LW G+ + K+ + A S Y +N G L Sbjct: 118 -QSLPGLDHTAQWGGGLWDWEKGKPAQLGGSFYLKETSVAPSSIYMDHINLGAHPSKSLK 176 Query: 735 KCEDSSAVWLGKCVDSMARKCHIGPKSIDLLDTKVSXXXXXXXXVTPFNMSRTSVSDPTS 914 CE++S R+ GP +I+ LD + P + +TSV +S Sbjct: 177 TCEETSYNIYSP------REDQAGPANIEKLD--YNPVLGQNPSFMPVDYLKTSVIGSSS 228 Query: 915 LFQEM-----PYHPVTSSESVLKSWSHLNPNTASYERCVSD-LDSIATNP-TVFYPSSIY 1073 E P + V + ++ + + +SD + S+ ++P V P ++ Sbjct: 229 AISEANLQAPPLNLVNCKNNHVQISTPYEKPLRQHGTTLSDSIPSVKSSPGVVIRPPAVG 288 Query: 1074 SSLAHVYNSPGNSSTFEKEFLSENMNHFEKPWDACDNKEASSSRTQVHTE------GKEG 1235 +S +S NS +F+ N A D A ++R V G + Sbjct: 289 TS-----SSASNSVSFKNVNTGIN---------ATDTNLAGNNRFIVEEPRFLFNFGSKN 334 Query: 1236 CGDTSRTNNGFDVNNQFSMDSLMKKDFSSYRNSITDEPFNHLYNPKFKSQITDVNLPNAF 1415 D + + D N S +S + S RN +D F S+I+ N AF Sbjct: 335 EFDPIQHSFLLDGNCYMSGESSTSTEKLSTRNMASDNFFGAKSGVNL-SRISPDNFSLAF 393 Query: 1416 TLVHTXXXXXXXXXXXXXETLDQFNSVLDSPCWKGAPTSQHFPCGITE------AGRFNG 1577 E+LD +N +DSPCWKGAP S + P G +E A + Sbjct: 394 E-----NNEAVIAVENSLESLDHYNPPVDSPCWKGAPASNNSPFGSSEPVAVQLAKKLEA 448 Query: 1578 LNLQGPQILPFDAN----------------LVYSENGSAEDGLSYFPKRSSPDVSL-STE 1706 + +L F ++ L+ ENG+ EDG K P VS+ S + Sbjct: 449 CDGSNGLVLKFISSNTANMVKHPSGKAGEILMSDENGNVEDGSMSSLKL--PPVSIPSFK 506 Query: 1707 KYELKDVVKAGS---------------DHSEMRNENVIQCFFPSNLKNCSETK------- 1820 ++E + KAGS + SE + + V+ ++ S T Sbjct: 507 EHEPDEAGKAGSHKNKASSACEVKFSDNASEWKKDYVLFDKSVDEVEKASHTSQQCLAEG 566 Query: 1821 --SSQMMKMNQNEDKDPGIDVKDAALDDSSHVTFGDGERISRLLSSGDSVPIELGEPLVG 1994 +S+ + ++ D + + D + SSHV+ + +S SS + V + Sbjct: 567 RLASKNLCRSETGVADLEMKINDVSGCGSSHVSCHAVKHLSCAPSSVEDVSTK------- 619 Query: 1995 ASDTLGGCQLPRIGLRVVVNAMHNLSELLLSY-SEDVNALKEHDHVVLRHVLNNLDAYVS 2171 + LG + + V+V+ M NLSELLL + S + L+E D L V+NNLD +S Sbjct: 620 HTKFLGKEPVSNSSISVLVDTMQNLSELLLYHCSNEACELREQDVKSLEKVINNLDTCMS 679 Query: 2172 KVGLVRPIPESDLSQSGTFSFGKLTVPNESQITNAEANDVQNQCDFNASEPKMHSTMXXX 2351 K + G+ T+ +E + + D + ++ Sbjct: 680 K------------------NIGQETLLSELHKGTSTGSPQVAAIDVLSQHTQVKRKHFGK 721 Query: 2352 XXXXXXXXXXXXXXXXXXXXNDDMIQAIKKALMENFHEEDE-HPQTSLFKNLWLESEAAL 2528 ND M QAIKK L+ENFHE++E HPQ L+KNLWLE+EAAL Sbjct: 722 KDEKCSEFVSVRSGTDIKVKNDKMTQAIKKVLIENFHEKEETHPQVLLYKNLWLEAEAAL 781 Query: 2529 CSMACKTRFARMKMKMEKCNNQHQTKDVVGMPINADKLSGSKISVDLYRDDMVASKTKGS 2708 CS+ R+ MK+++EKC + KD+ + DK+S SK+S DL + + + + + Sbjct: 782 CSINYMARYNNMKIEIEKCKLDTE-KDLSEDTPDEDKISRSKLSADLDTNKKLTAIAESA 840 Query: 2709 VTSNISTHDTSQSSNTSCAEDVEASVMDRYRILKCRVDNSISINKEEKLPDPVDEAKAAP 2888 T ++S + +S+++ A+DV A R+ +LK R++NS S+ Sbjct: 841 PTLDVSNQNFPIASSSNHADDVTA----RFHVLKHRLNNSYSV----------------- 879 Query: 2889 CSMEDKAQSARLKFEMERCNQNHVEDVVGLPINVDQLQSSKVSFVLDGDDMAASKAKGTV 3068 H D D+L SSK+S D D A++ K + Sbjct: 880 ----------------------HTRD-------ADELSSSKLSLDSDAVDKLATEVKDSS 910 Query: 3069 TPNISTLDTSQSSNMSVTEDVKSSVMDRFRILQSRIDNTISINKEGRLPNPCDIGAYPGS 3248 T ++ T D+ T+DV++S+M R IL+SR + + N+ + P Sbjct: 911 TSSLQTQDSPVPGTACHTDDVEASIMTRLHILKSRGNVDLDSNEMEQKP----------- 959 Query: 3249 ESVQNFGIKPQLPELVDMGFAERRVKWPFIRMMRTEDEAD-------LDP-------DLS 3386 LPE+VD+GFA ++ + P ED AD L+ D + Sbjct: 960 -----------LPEVVDLGFAGKKKQIPI-----DEDTADDGVLGFNLESVSQNQVVDYA 1003 Query: 3387 EKQDPVKDFSTFVSDDEPETQFYDTDRFGTWVPTSGYDSPSSDWEHVLKEELAWPN 3554 +Q VKDF V D Q + R G + YDS SSDWEHVLKEEL+ N Sbjct: 1004 GEQSVVKDFHLCVKHD-CTIQSPKSTRLGNQLSAGWYDSCSSDWEHVLKEELSGQN 1058 >ref|XP_007039222.1| Uncharacterized protein isoform 3 [Theobroma cacao] gi|508776467|gb|EOY23723.1| Uncharacterized protein isoform 3 [Theobroma cacao] Length = 1068 Score = 228 bits (581), Expect = 2e-56 Identities = 317/1203 (26%), Positives = 485/1203 (40%), Gaps = 91/1203 (7%) Frame = +3 Query: 219 LAPPFTVGAYSIPKKNSLGHFVEHPYFDESHHYLGFGSSPFDNWPPLQPDT-SLPNPFND 395 LAPPFTV SIPK + P D LG P NW P T + P P Sbjct: 22 LAPPFTVDR-SIPKPAAT------PLVD-----LG---EPL-NWLDSNPYTFNSPQPAQL 65 Query: 396 PNLRTDAIGGMSPPYEGTYGLPTPTLNYYSTPSITRP--YFPEYPSTTVHDNSSSVVFNE 569 P L + PTPT +Y + P Y+P Y S +H + FNE Sbjct: 66 PQLDLE---------------PTPTPSYNQNSDLFEPKTYYPSYVSPPLHVPT----FNE 106 Query: 570 PQSQSGKMAPLSPGGNLWTGMSGEE-----QWRRKDKNEAGSLNYNSSLNQGTPAVDGLS 734 QS G GG LW G+ + K+ + A S Y +N G L Sbjct: 107 -QSLPGLDHTAQWGGGLWDWEKGKPAQLGGSFYLKETSVAPSSIYMDHINLGAHPSKSLK 165 Query: 735 KCEDSSAVWLGKCVDSMARKCHIGPKSIDLLDTKVSXXXXXXXXVTPFNMSRTSVSDPTS 914 CE++S R+ GP +I+ LD + P + +TSV +S Sbjct: 166 TCEETSYNIYSP------REDQAGPANIEKLD--YNPVLGQNPSFMPVDYLKTSVIGSSS 217 Query: 915 LFQEM-----PYHPVTSSESVLKSWSHLNPNTASYERCVSD-LDSIATNP-TVFYPSSIY 1073 E P + V + ++ + + +SD + S+ ++P V P ++ Sbjct: 218 AISEANLQAPPLNLVNCKNNHVQISTPYEKPLRQHGTTLSDSIPSVKSSPGVVIRPPAVG 277 Query: 1074 SSLAHVYNSPGNSSTFEKEFLSENMNHFEKPWDACDNKEASSSRTQVHTE------GKEG 1235 +S +S NS +F+ N A D A ++R V G + Sbjct: 278 TS-----SSASNSVSFKNVNTGIN---------ATDTNLAGNNRFIVEEPRFLFNFGSKN 323 Query: 1236 CGDTSRTNNGFDVNNQFSMDSLMKKDFSSYRNSITDEPFNHLYNPKFKSQITDVNLPNAF 1415 D + + D N S +S + S RN +D F S+I+ N AF Sbjct: 324 EFDPIQHSFLLDGNCYMSGESSTSTEKLSTRNMASDNFFGAKSGVNL-SRISPDNFSLAF 382 Query: 1416 TLVHTXXXXXXXXXXXXXETLDQFNSVLDSPCWKGAPTSQHFPCGITE------AGRFNG 1577 E+LD +N +DSPCWKGAP S + P G +E A + Sbjct: 383 E-----NNEAVIAVENSLESLDHYNPPVDSPCWKGAPASNNSPFGSSEPVAVQLAKKLEA 437 Query: 1578 LNLQGPQILPFDAN----------------LVYSENGSAEDGLSYFPKRSSPDVSL-STE 1706 + +L F ++ L+ ENG+ EDG K P VS+ S + Sbjct: 438 CDGSNGLVLKFISSNTANMVKHPSGKAGEILMSDENGNVEDGSMSSLKL--PPVSIPSFK 495 Query: 1707 KYELKDVVKAGS---------------DHSEMRNENVIQCFFPSNLKNCSETK------- 1820 ++E + KAGS + SE + + V+ ++ S T Sbjct: 496 EHEPDEAGKAGSHKNKASSACEVKFSDNASEWKKDYVLFDKSVDEVEKASHTSQQCLAEG 555 Query: 1821 --SSQMMKMNQNEDKDPGIDVKDAALDDSSHVTFGDGERISRLLSSGDSVPIELGEPLVG 1994 +S+ + ++ D + + D + SSHV+ + +S SS + V + Sbjct: 556 RLASKNLCRSETGVADLEMKINDVSGCGSSHVSCHAVKHLSCAPSSVEDVSTK------- 608 Query: 1995 ASDTLGGCQLPRIGLRVVVNAMHNLSELLLSY-SEDVNALKEHDHVVLRHVLNNLDAYVS 2171 + LG + + V+V+ M NLSELLL + S + L+E D L V+NNLD +S Sbjct: 609 HTKFLGKEPVSNSSISVLVDTMQNLSELLLYHCSNEACELREQDVKSLEKVINNLDTCMS 668 Query: 2172 K-VGLVRPIPESDLSQSGTFSF------GKLTVPNESQITNAEANDVQNQCDFNASEPKM 2330 K +G E+ LS+ F G+ ++ +E + + D + ++ Sbjct: 669 KNIG-----QETLLSELHKVWFPMSKKNGQESLLSELHKGTSTGSPQVAAIDVLSQHTQV 723 Query: 2331 HSTMXXXXXXXXXXXXXXXXXXXXXXXNDDMIQAIKKALMENFHEEDE-HPQTSLFKNLW 2507 ND M QAIKK L+ENFHE++E HPQ L+KNLW Sbjct: 724 KRKHFGKKDEKCSEFVSVRSGTDIKVKNDKMTQAIKKVLIENFHEKEETHPQVLLYKNLW 783 Query: 2508 LESEAALCSMACKTRFARMKMKMEKCNNQHQTKDVVGMPINADKLSGSKISVDLYRDDMV 2687 LE+EAALCS+ R+ MK+++EKC + KD+ + DK+S SK+S DL + + Sbjct: 784 LEAEAALCSINYMARYNNMKIEIEKCKLDTE-KDLSEDTPDEDKISRSKLSADLDTNKKL 842 Query: 2688 ASKTKGSVTSNISTHDTSQSSNTSCAEDVEASVMDRYRILKCRVDNSISINKEEKLPDPV 2867 + + + T ++S + +S+++ A+DV A R+ +LK R++NS S+ Sbjct: 843 TAIAESAPTLDVSNQNFPIASSSNHADDVTA----RFHVLKHRLNNSYSV---------- 888 Query: 2868 DEAKAAPCSMEDKAQSARLKFEMERCNQNHVEDVVGLPINVDQLQSSKVSFVLDGDDMAA 3047 H D D+L SSK+S D D A Sbjct: 889 -----------------------------HTRD-------ADELSSSKLSLDSDAVDKLA 912 Query: 3048 SKAKGTVTPNISTLDTSQSSNMSVTEDVKSSVMDRFRILQSRIDNTISINKEGRLPNPCD 3227 ++ K + T ++ T D+ T+DV++S+M R IL+SR + + N+ + P Sbjct: 913 TEVKDSSTSSLQTQDSPVPGTACHTDDVEASIMTRLHILKSRGNVDLDSNEMEQKP---- 968 Query: 3228 IGAYPGSESVQNFGIKPQLPELVDMGFAERRVKWPFIRMMRTEDEAD-------LDP--- 3377 LPE+VD+GFA ++ + P ED AD L+ Sbjct: 969 ------------------LPEVVDLGFAGKKKQIPI-----DEDTADDGVLGFNLESVSQ 1005 Query: 3378 ----DLSEKQDPVKDFSTFVSDDEPETQFYDTDRFGTWVPTSGYDSPSSDWEHVLKEELA 3545 D + +Q VKDF V D Q + R G + YDS SSDWEHVLKEEL+ Sbjct: 1006 NQVVDYAGEQSVVKDFHLCVKHD-CTIQSPKSTRLGNQLSAGWYDSCSSDWEHVLKEELS 1064 Query: 3546 WPN 3554 N Sbjct: 1065 GQN 1067 >ref|XP_007039220.1| Uncharacterized protein isoform 1 [Theobroma cacao] gi|590674635|ref|XP_007039223.1| Uncharacterized protein isoform 1 [Theobroma cacao] gi|508776465|gb|EOY23721.1| Uncharacterized protein isoform 1 [Theobroma cacao] gi|508776468|gb|EOY23724.1| Uncharacterized protein isoform 1 [Theobroma cacao] Length = 1079 Score = 228 bits (581), Expect = 2e-56 Identities = 317/1203 (26%), Positives = 485/1203 (40%), Gaps = 91/1203 (7%) Frame = +3 Query: 219 LAPPFTVGAYSIPKKNSLGHFVEHPYFDESHHYLGFGSSPFDNWPPLQPDT-SLPNPFND 395 LAPPFTV SIPK + P D LG P NW P T + P P Sbjct: 33 LAPPFTVDR-SIPKPAAT------PLVD-----LG---EPL-NWLDSNPYTFNSPQPAQL 76 Query: 396 PNLRTDAIGGMSPPYEGTYGLPTPTLNYYSTPSITRP--YFPEYPSTTVHDNSSSVVFNE 569 P L + PTPT +Y + P Y+P Y S +H + FNE Sbjct: 77 PQLDLE---------------PTPTPSYNQNSDLFEPKTYYPSYVSPPLHVPT----FNE 117 Query: 570 PQSQSGKMAPLSPGGNLWTGMSGEE-----QWRRKDKNEAGSLNYNSSLNQGTPAVDGLS 734 QS G GG LW G+ + K+ + A S Y +N G L Sbjct: 118 -QSLPGLDHTAQWGGGLWDWEKGKPAQLGGSFYLKETSVAPSSIYMDHINLGAHPSKSLK 176 Query: 735 KCEDSSAVWLGKCVDSMARKCHIGPKSIDLLDTKVSXXXXXXXXVTPFNMSRTSVSDPTS 914 CE++S R+ GP +I+ LD + P + +TSV +S Sbjct: 177 TCEETSYNIYSP------REDQAGPANIEKLD--YNPVLGQNPSFMPVDYLKTSVIGSSS 228 Query: 915 LFQEM-----PYHPVTSSESVLKSWSHLNPNTASYERCVSD-LDSIATNP-TVFYPSSIY 1073 E P + V + ++ + + +SD + S+ ++P V P ++ Sbjct: 229 AISEANLQAPPLNLVNCKNNHVQISTPYEKPLRQHGTTLSDSIPSVKSSPGVVIRPPAVG 288 Query: 1074 SSLAHVYNSPGNSSTFEKEFLSENMNHFEKPWDACDNKEASSSRTQVHTE------GKEG 1235 +S +S NS +F+ N A D A ++R V G + Sbjct: 289 TS-----SSASNSVSFKNVNTGIN---------ATDTNLAGNNRFIVEEPRFLFNFGSKN 334 Query: 1236 CGDTSRTNNGFDVNNQFSMDSLMKKDFSSYRNSITDEPFNHLYNPKFKSQITDVNLPNAF 1415 D + + D N S +S + S RN +D F S+I+ N AF Sbjct: 335 EFDPIQHSFLLDGNCYMSGESSTSTEKLSTRNMASDNFFGAKSGVNL-SRISPDNFSLAF 393 Query: 1416 TLVHTXXXXXXXXXXXXXETLDQFNSVLDSPCWKGAPTSQHFPCGITE------AGRFNG 1577 E+LD +N +DSPCWKGAP S + P G +E A + Sbjct: 394 E-----NNEAVIAVENSLESLDHYNPPVDSPCWKGAPASNNSPFGSSEPVAVQLAKKLEA 448 Query: 1578 LNLQGPQILPFDAN----------------LVYSENGSAEDGLSYFPKRSSPDVSL-STE 1706 + +L F ++ L+ ENG+ EDG K P VS+ S + Sbjct: 449 CDGSNGLVLKFISSNTANMVKHPSGKAGEILMSDENGNVEDGSMSSLKL--PPVSIPSFK 506 Query: 1707 KYELKDVVKAGS---------------DHSEMRNENVIQCFFPSNLKNCSETK------- 1820 ++E + KAGS + SE + + V+ ++ S T Sbjct: 507 EHEPDEAGKAGSHKNKASSACEVKFSDNASEWKKDYVLFDKSVDEVEKASHTSQQCLAEG 566 Query: 1821 --SSQMMKMNQNEDKDPGIDVKDAALDDSSHVTFGDGERISRLLSSGDSVPIELGEPLVG 1994 +S+ + ++ D + + D + SSHV+ + +S SS + V + Sbjct: 567 RLASKNLCRSETGVADLEMKINDVSGCGSSHVSCHAVKHLSCAPSSVEDVSTK------- 619 Query: 1995 ASDTLGGCQLPRIGLRVVVNAMHNLSELLLSY-SEDVNALKEHDHVVLRHVLNNLDAYVS 2171 + LG + + V+V+ M NLSELLL + S + L+E D L V+NNLD +S Sbjct: 620 HTKFLGKEPVSNSSISVLVDTMQNLSELLLYHCSNEACELREQDVKSLEKVINNLDTCMS 679 Query: 2172 K-VGLVRPIPESDLSQSGTFSF------GKLTVPNESQITNAEANDVQNQCDFNASEPKM 2330 K +G E+ LS+ F G+ ++ +E + + D + ++ Sbjct: 680 KNIG-----QETLLSELHKVWFPMSKKNGQESLLSELHKGTSTGSPQVAAIDVLSQHTQV 734 Query: 2331 HSTMXXXXXXXXXXXXXXXXXXXXXXXNDDMIQAIKKALMENFHEEDE-HPQTSLFKNLW 2507 ND M QAIKK L+ENFHE++E HPQ L+KNLW Sbjct: 735 KRKHFGKKDEKCSEFVSVRSGTDIKVKNDKMTQAIKKVLIENFHEKEETHPQVLLYKNLW 794 Query: 2508 LESEAALCSMACKTRFARMKMKMEKCNNQHQTKDVVGMPINADKLSGSKISVDLYRDDMV 2687 LE+EAALCS+ R+ MK+++EKC + KD+ + DK+S SK+S DL + + Sbjct: 795 LEAEAALCSINYMARYNNMKIEIEKCKLDTE-KDLSEDTPDEDKISRSKLSADLDTNKKL 853 Query: 2688 ASKTKGSVTSNISTHDTSQSSNTSCAEDVEASVMDRYRILKCRVDNSISINKEEKLPDPV 2867 + + + T ++S + +S+++ A+DV A R+ +LK R++NS S+ Sbjct: 854 TAIAESAPTLDVSNQNFPIASSSNHADDVTA----RFHVLKHRLNNSYSV---------- 899 Query: 2868 DEAKAAPCSMEDKAQSARLKFEMERCNQNHVEDVVGLPINVDQLQSSKVSFVLDGDDMAA 3047 H D D+L SSK+S D D A Sbjct: 900 -----------------------------HTRD-------ADELSSSKLSLDSDAVDKLA 923 Query: 3048 SKAKGTVTPNISTLDTSQSSNMSVTEDVKSSVMDRFRILQSRIDNTISINKEGRLPNPCD 3227 ++ K + T ++ T D+ T+DV++S+M R IL+SR + + N+ + P Sbjct: 924 TEVKDSSTSSLQTQDSPVPGTACHTDDVEASIMTRLHILKSRGNVDLDSNEMEQKP---- 979 Query: 3228 IGAYPGSESVQNFGIKPQLPELVDMGFAERRVKWPFIRMMRTEDEAD-------LDP--- 3377 LPE+VD+GFA ++ + P ED AD L+ Sbjct: 980 ------------------LPEVVDLGFAGKKKQIPI-----DEDTADDGVLGFNLESVSQ 1016 Query: 3378 ----DLSEKQDPVKDFSTFVSDDEPETQFYDTDRFGTWVPTSGYDSPSSDWEHVLKEELA 3545 D + +Q VKDF V D Q + R G + YDS SSDWEHVLKEEL+ Sbjct: 1017 NQVVDYAGEQSVVKDFHLCVKHD-CTIQSPKSTRLGNQLSAGWYDSCSSDWEHVLKEELS 1075 Query: 3546 WPN 3554 N Sbjct: 1076 GQN 1078 >ref|XP_002521299.1| hypothetical protein RCOM_0756330 [Ricinus communis] gi|223539484|gb|EEF41073.1| hypothetical protein RCOM_0756330 [Ricinus communis] Length = 1125 Score = 222 bits (565), Expect = 1e-54 Identities = 328/1247 (26%), Positives = 499/1247 (40%), Gaps = 106/1247 (8%) Frame = +3 Query: 120 VRLRSLELD*SKIMMYGMGYKGXXXXXXXXXXPLAPPFTVGAYSIPKKNSLGHFVEHPYF 299 + + +E+D + GY G LAPPFTV S+PK L E + Sbjct: 18 LNFKGVEIDEMMMGQGSFGYGGGSSNLSA----LAPPFTVDR-SVPKP--LVDLTEPTSY 70 Query: 300 DES-HHYLGFGSSPFDNWPPLQP--DTSLPNPFNDPNLRTDAIGGMSPPY--EGTYGLPT 464 S H+++ FD + +P D++ N ++ + ++ S Y G GL Sbjct: 71 HHSLHNWVNPHQPEFDYFVIQKPELDSNSYNRYSASSNPHVSVSTDSVLYGQSGVTGLEA 130 Query: 465 PTL--NYYSTPSITR-------PYFPEYPSTTVHDNSSSVVFNEPQSQSGKMAPLSPGGN 617 + Y +P+I P+ +Y + S+S+ +E +QS LS Sbjct: 131 KPYYPSTYISPAIGNDCSLGGVPHHSDYGLLSASRVSTSIGSSEDYTQS-----LS---G 182 Query: 618 LWTGM-SGEEQWRRKDKNEA-GSL-NYNSSLNQGTPAVDGLSKCEDSSAVWLGKCVDSMA 788 W+GM G W + ++ + GS + + +NQ V GL E +S + D++ Sbjct: 183 QWSGMWDGLTDWLQSEQVQLDGSFCSKETYMNQ----VAGLYASESTSKYEASQSADTVG 238 Query: 789 RKCHIGPKSIDLLDTKVSXXXXXXXXVTPFNMSRTSVSDPTSLFQEMPYHPVTSSESVLK 968 R+ I + LD K TP + S T L E S+ + Sbjct: 239 RETQIESAGVGKLDYK--SFLGENRKFTPSDYPTPSSLASTLLVPETCSQ--VPSKKAVN 294 Query: 969 SWSHLNPNTASYERCVSDLDSIATNPTVFYPSSIYSSLAHVYNSPGNSSTFEKEFLSENM 1148 SW+H P +AS E+C+ D+ +++ +YSS A V P ++ K + + Sbjct: 295 SWNHHMPYSASNEKCLRRHDATSSDIATI----LYSSPAVVIKPPEHNKGSLKNVNTSSD 350 Query: 1149 NHFEKPWDACDNKEASSSRTQVHTEGK-------EGCGDTSRTNNGFDVNNQF--SMDSL 1301 DNK+ S + V E + C D S+ + +Q + S Sbjct: 351 G---------DNKDFSCNSPSVVVEPRPFITSKGSVCYDASQVSFHLGKTDQVIANFSSA 401 Query: 1302 MKKDFSSYRNSITDEPFNHLYNPKFKSQITDVNLPNAFTLVHTXXXXXXXXXXXXXETLD 1481 ++ SS +N+ D H K Q+ +L +LV E+LD Sbjct: 402 KNEELSSNQNASMDVS-GHFAGEKPVIQVPCTSL-GGISLVDKNEAIDPAKNHT--ESLD 457 Query: 1482 QFNSVLDSPCWKGAPTSQHFPCGITEAGRFNGLNLQGPQILPFDANLVYSENGSAEDGLS 1661 +N +DSPCWKGAP S ++EA + P + + + +GS G Sbjct: 458 HYNPAVDSPCWKGAPVSNFSQLEVSEA------------VTPQNMKNLEACSGSNHQGYQ 505 Query: 1662 YFPKRSSPDVSLSTEKYELKDVVKAG--------SDHSEMRNENVIQ------------- 1778 F S V +S EK K + + G S +N++ Sbjct: 506 TFSVSSDDAVKVSPEKTSEKSIQQKGWSLENYSASSMKRPLADNMLHREGIDHFVNFGAN 565 Query: 1779 CFFPS----------NLKNCSETKSSQMMKMNQNED------------------KDPGID 1874 C PS L N S S+ + N+ + D G++ Sbjct: 566 CTKPSLFHQVQISDDALPNKSFDDSNGKLPQNEKQSCESGKWTTESNSAPVISVADVGMN 625 Query: 1875 VKDAALDDSSHVTFGDGERISRLLSSGDSVPIELGEPLVGASDTLGGCQLPRIGLRVVVN 2054 + D + SSHV F E + S DS I+L + GG + +R V++ Sbjct: 626 MNDDPDECSSHVPFHAVEHVLSSPPSADSASIKL-------TKACGGVSTQKTYIRTVID 678 Query: 2055 AMHNLSELLLSY-SEDVNALKEHDHVVLRHVLNNLDAYVSKVGLVRP-------IPESDL 2210 M NLSELL+ + S D+ LKE D L+ +++NL+ + K + R IPE D Sbjct: 679 TMQNLSELLIFHLSNDLCDLKEDDSNALKGMISNLELCMLK-NVERMTSTQESIIPERDG 737 Query: 2211 SQ-SGTFS-FGKLTVPNESQITNAEANDVQN-------QCDFNASEPKMHSTMXXXXXXX 2363 +Q SG S K T N I+ ++ + Q Q + N S K T+ Sbjct: 738 AQLSGKSSKLQKGTNGNGFLISRSDPLEFQYSVKYQHVQDEHNISSGKNDETLSSYVSVR 797 Query: 2364 XXXXXXXXXXXXXXXXNDDMIQAIKKALMENFH-EEDEHPQTSLFKNLWLESEAALCSMA 2540 D M QAIK AL ENFH EE+ PQ L+KNLWLE+EA+LC + Sbjct: 798 AAADMLK---------RDKMTQAIKNALTENFHGEEETEPQVLLYKNLWLEAEASLCYAS 848 Query: 2541 CKTRFARMKMKMEKCNNQHQTKDVVGMPINA---DKLSGSKISVDLYRDDMVASKTKGSV 2711 C RF R+K +MEKC+++ G P N +KLS S I D +++AS TKGS Sbjct: 849 CMARFNRIKSEMEKCDSEKAN----GSPENCMVEEKLSKSNIRSDPCTGNVLASNTKGSP 904 Query: 2712 TSNISTHDTSQSSNTSCAEDVEASVMDRYRILKCRVDNSISINKEEKLPDPVDEAKAAPC 2891 + S ++S +S A+DV A RY ILK RVD++ ++ Sbjct: 905 LPDTSIPESSILCTSSHADDVTA----RYHILKYRVDSTNAV------------------ 942 Query: 2892 SMEDKAQSARLKFEMERCNQNHVEDVVGLPINVDQLQSSKVSFVLDGDDMAASKAKGTVT 3071 N + ++ ++G + D+L SS+ S + + + K Sbjct: 943 ------------------NTSSLDKMLG---SADKLSSSQFSPCPNNVEKGVCEEKDGQK 981 Query: 3072 PNISTLDTSQSSNMSVTEDVKSSVMDRFRILQSRIDNTISINKEGRLPNPCDIGAYPGSE 3251 P+IS D+ S+ S DV++SVM RF IL+ R DN S++KE Sbjct: 982 PDISIQDSLVSNTTSHLNDVEASVMARFHILKCRDDN-FSMHKE---------------- 1024 Query: 3252 SVQNFGIKPQLPELVDMGFAERRVKWPFIRMMRTEDEA----------DLDPDLSEKQDP 3401 + E VD+G+ WP TED D + +E + P Sbjct: 1025 ---------ESTESVDLGYVGLPRHWP-TGTDETEDRVLDVNMRTHLQHHDCNFTEDKLP 1074 Query: 3402 VKDFSTFVSDDEPETQFYDTDRFGTWVPTSGYDSPSSDWEHVLKEEL 3542 VK+F FV DD P D +R G S D SSDWEHVL EEL Sbjct: 1075 VKEFHLFVKDD-PVIGSRDINRLGDQSHASFCDG-SSDWEHVLLEEL 1119 >gb|EXB94712.1| hypothetical protein L484_002599 [Morus notabilis] Length = 1159 Score = 221 bits (563), Expect = 2e-54 Identities = 306/1235 (24%), Positives = 483/1235 (39%), Gaps = 124/1235 (10%) Frame = +3 Query: 222 APPFTVGAYSIPKKNSLGHFVEHPYFDESHHYLGFG-SSPFDNWPPLQPDTSLPNPFNDP 398 APPFT I + + G+ ++S SS NW P + TS N F+DP Sbjct: 26 APPFT-----IDRSPNTGYMPLVDLLEQSSRTGTLNNSSSLHNWLPPRSPTSETNFFSDP 80 Query: 399 NLRTDAIGGMSPPYEGTYGLPTPTLNYYST---------------PSITRPYFPEYPSTT 533 NL +++ + PY Y L + ST + +PY+ + S Sbjct: 81 NLELNSVASPNNPY--NYASLNTHLPHLSTSVSASADAFSYAQCGDGVAKPYYFSFLSPP 138 Query: 534 VHDNSSSVV-----------------------FNEPQSQ-SGKMAPLSPGGNLWTGMSGE 641 + S VV N+ SQ SG + G LW G S Sbjct: 139 TQKDGSLVVPDQTSYDWLSSSSHVAVTALDGSSNKDYSQRSGDSKKPAQWGGLWNGFSEW 198 Query: 642 EQWRR----------KDKNEAGSLNYNSSLNQGTPAVDGLSKCEDSSAVWLGK----CVD 779 EQ + K+ + S Y + +NQ T + GL++ E++ + GK ++ Sbjct: 199 EQGNQGLFDGSFCCSKESDIPVSSMYENFMNQETHSPKGLNRGEEAMRLNRGKEAFHGIN 258 Query: 780 SMARKCHIGPKSIDLLDTKVSXXXXXXXXVTPFNMSRTSVSDPTSLFQEMPYHPVTSSES 959 ++ H G + + + K P + SR S + + P + ES Sbjct: 259 NLDSDKHGGSVNAENFNDK--SFSGKTSNFLPADCSR-------SFLESLSGFPDSGLES 309 Query: 960 ----VLKSWSHLNPNTASYERCVSD--LDSIATNPT-VFYPSSIYSSLAHVYNSPGNSST 1118 + S H P AS E+ + DS ++PT V P S SP N++ Sbjct: 310 PCFMIGTSSGHQIPYGASNEKHLKQHATDSAKSSPTPVIGPPVAGSGF-----SPSNNAP 364 Query: 1119 FEKEFLSENMNHFEKPWDACDNKEASSSRTQVHTEGKEGCGDTSRTNNGFDVNNQFSMDS 1298 F+ N+ + D C K S + +G + D+S+ + D+++ S+ S Sbjct: 365 FK----IVNLGSCKTDADMCSKKAPSF----IDADGVKPAFDSSKLSIHLDIDDPASLGS 416 Query: 1299 LMKKDFSSY-RNSITDEPFNHLYNPKFKSQITDVNLPNAFTLVHTXXXXXXXXXXXXXET 1475 + K+ + I+ + +H+ PK Q + N+P+ + E Sbjct: 417 YVTKNEEMLNKECISSDTLHHVLIPKSGPQTS--NVPHEGFKLDLNTNENINSVEDSSEN 474 Query: 1476 LDQFNSVLDSPCWKGAPT--SQHFPCGITEAGR---FNGLNLQGPQILPFDA-------- 1616 +D +N +DSPCWKG P S F + E R F+ N+Q QI + Sbjct: 475 VDHYNHAVDSPCWKGVPATRSSPFDASVPETKRQEVFSNSNVQTKQIFQLNTGDKVSSQK 534 Query: 1617 ---NLVYSENGSAEDGLSYFPKRSSPDVSLSTEKYELKDVVKAGSD-------HSEMRNE 1766 N++ E GS E+GL FP +SP + + D+VK GSD HS +E Sbjct: 535 RNDNMMCHEFGSPENGLE-FPLNTSPAAKSTFSDRKSDDIVKIGSDLETKGIQHSNDIHE 593 Query: 1767 NVIQCFFPSNLKNCSETKSSQMMKMNQNEDKDPGIDVKDAALDDSSHVTFGDGERISRLL 1946 + S CS+ KSS + N + ++ +A S + F IS Sbjct: 594 H------GSRSTGCSDLKSSLNGEQNIQRNGLISENINEALQCVSPRLPFPMENIIS--- 644 Query: 1947 SSGDSVPIELGEPLVGASDTLGGCQLPRIGLRVVVNAMHNLSELLLSY-SEDVNALKEHD 2123 SS + +L + G S P I + V+V+ + NLSELLL + + LK+ D Sbjct: 645 SSVEDASTKLNKSNEGPSS-------PTIDVPVLVSTIRNLSELLLFHCTSGSYQLKQKD 697 Query: 2124 HVVLRHVLNNLDAYVSKVGLVRPIPESDLSQSGTFSF-----GKLTVPNESQITNAEAND 2288 ++ +++NL SK + S+ T + K N+ Q+T Sbjct: 698 LETIQSMIDNLSVCASKNSEKTVSTQDSTSEKYTSDYLGDKNHKGFTLNKLQVTKTAGPI 757 Query: 2289 VQNQCDFNASEPKMHSTMXXXXXXXXXXXXXXXXXXXXXXXNDDMIQAIKKALMENF-HE 2465 + D N K + D IQA+KK L +NF +E Sbjct: 758 LDLLADQNVH--KGNKYYVAGKENDELLDSVSVRADVDIVDEDKAIQALKKVLTDNFDYE 815 Query: 2466 EDEHPQTSLFKNLWLESEAALCSMACKTRFARMKMKMEKCNNQHQTKDVVGMPINA--DK 2639 E+ PQ L+KNLWLE+EAALCSM+CK RF R+K++ME ++KD G I DK Sbjct: 816 EEASPQALLYKNLWLEAEAALCSMSCKARFNRVKLEMEN-PKLPKSKDAHGNTITTEMDK 874 Query: 2640 LSGSKISVDLYRDDMVASKTKGSVTSNISTHDTSQSSNTSCAEDVEASVMDRYRILKCRV 2819 +S S++S DL + ++ K KG T+ SQ S+ + VMDR++IL+CR Sbjct: 875 VSRSEVSPDLNGANTLSPKAKGCATTK------SQESSVLSTNAEDDDVMDRFQILRCRA 928 Query: 2820 DNSISINKEEKLPDPVDEAKAAPCSMEDKAQSARLKFEMERCNQNHVEDVVGLPINVDQL 2999 S +K D K +P Sbjct: 929 KKSNYGIVADK--DKPSSPKVSP------------------------------------- 949 Query: 3000 QSSKVSFVLDGDDMAASKAKGTVTPNISTLDTSQSSNMSVTEDVKSSVMDRFRILQSRID 3179 S+KV +L A++ G+ P+I +S SS + D ++SVM RF IL+SR D Sbjct: 950 HSNKVGKILP----EANEETGSSKPDIRRQASSNSSTDKPSNDYEASVMARFHILKSRGD 1005 Query: 3180 NTISINKEGRLPNPCDIGAYPGSESVQNFGIKP-------------------QLPELVDM 3302 N ++ +G+L D V + ++P + P +D Sbjct: 1006 NCSPLSTQGQLAENVDGSTIGSKSEVGSSCVEPEPTLQHHDADSTEGQLTGGEFPMFIDY 1065 Query: 3303 GFAERRVK-----------WPFIRMMRTEDEADLDPDLSEKQDPVKDFSTFVSDDEPETQ 3449 + + W F R+ + D D +E Q +F F+ D + +Q Sbjct: 1066 DSMSQSHRPNRRENSLLAGW-FDRVSSEWEHVGNDADSTEGQLTGGEFPMFI-DYDSMSQ 1123 Query: 3450 FYDTDRFGTWVPTSGYDSPSSDWEHVLKEELAWPN 3554 + +R + +D SS+WEHV KEEL N Sbjct: 1124 SHRPNRRENSLLAGWFDRVSSEWEHVGKEELGLQN 1158 >ref|XP_007039221.1| Uncharacterized protein isoform 2 [Theobroma cacao] gi|508776466|gb|EOY23722.1| Uncharacterized protein isoform 2 [Theobroma cacao] Length = 1017 Score = 199 bits (506), Expect = 8e-48 Identities = 273/1009 (27%), Positives = 412/1009 (40%), Gaps = 90/1009 (8%) Frame = +3 Query: 219 LAPPFTVGAYSIPKKNSLGHFVEHPYFDESHHYLGFGSSPFDNWPPLQPDT-SLPNPFND 395 LAPPFTV SIPK + P D LG P NW P T + P P Sbjct: 33 LAPPFTVDR-SIPKPAAT------PLVD-----LG---EPL-NWLDSNPYTFNSPQPAQL 76 Query: 396 PNLRTDAIGGMSPPYEGTYGLPTPTLNYYSTPSITRP--YFPEYPSTTVHDNSSSVVFNE 569 P L + PTPT +Y + P Y+P Y S +H + FNE Sbjct: 77 PQLDLE---------------PTPTPSYNQNSDLFEPKTYYPSYVSPPLHVPT----FNE 117 Query: 570 PQSQSGKMAPLSPGGNLWTGMSGEE-----QWRRKDKNEAGSLNYNSSLNQGTPAVDGLS 734 QS G GG LW G+ + K+ + A S Y +N G L Sbjct: 118 -QSLPGLDHTAQWGGGLWDWEKGKPAQLGGSFYLKETSVAPSSIYMDHINLGAHPSKSLK 176 Query: 735 KCEDSSAVWLGKCVDSMARKCHIGPKSIDLLDTKVSXXXXXXXXVTPFNMSRTSVSDPTS 914 CE++S R+ GP +I+ LD + P + +TSV +S Sbjct: 177 TCEETSYNIYSP------REDQAGPANIEKLD--YNPVLGQNPSFMPVDYLKTSVIGSSS 228 Query: 915 LFQEM-----PYHPVTSSESVLKSWSHLNPNTASYERCVSD-LDSIATNP-TVFYPSSIY 1073 E P + V + ++ + + +SD + S+ ++P V P ++ Sbjct: 229 AISEANLQAPPLNLVNCKNNHVQISTPYEKPLRQHGTTLSDSIPSVKSSPGVVIRPPAVG 288 Query: 1074 SSLAHVYNSPGNSSTFEKEFLSENMNHFEKPWDACDNKEASSSRTQVHTE------GKEG 1235 +S +S NS +F+ N A D A ++R V G + Sbjct: 289 TS-----SSASNSVSFKNVNTGIN---------ATDTNLAGNNRFIVEEPRFLFNFGSKN 334 Query: 1236 CGDTSRTNNGFDVNNQFSMDSLMKKDFSSYRNSITDEPFNHLYNPKFKSQITDVNLPNAF 1415 D + + D N S +S + S RN +D F S+I+ N AF Sbjct: 335 EFDPIQHSFLLDGNCYMSGESSTSTEKLSTRNMASDNFFGAKSGVNL-SRISPDNFSLAF 393 Query: 1416 TLVHTXXXXXXXXXXXXXETLDQFNSVLDSPCWKGAPTSQHFPCGITE------AGRFNG 1577 E+LD +N +DSPCWKGAP S + P G +E A + Sbjct: 394 E-----NNEAVIAVENSLESLDHYNPPVDSPCWKGAPASNNSPFGSSEPVAVQLAKKLEA 448 Query: 1578 LNLQGPQILPFDAN----------------LVYSENGSAEDGLSYFPKRSSPDVSL-STE 1706 + +L F ++ L+ ENG+ EDG K P VS+ S + Sbjct: 449 CDGSNGLVLKFISSNTANMVKHPSGKAGEILMSDENGNVEDGSMSSLKL--PPVSIPSFK 506 Query: 1707 KYELKDVVKAGS---------------DHSEMRNENVIQCFFPSNLKNCSETK------- 1820 ++E + KAGS + SE + + V+ ++ S T Sbjct: 507 EHEPDEAGKAGSHKNKASSACEVKFSDNASEWKKDYVLFDKSVDEVEKASHTSQQCLAEG 566 Query: 1821 --SSQMMKMNQNEDKDPGIDVKDAALDDSSHVTFGDGERISRLLSSGDSVPIELGEPLVG 1994 +S+ + ++ D + + D + SSHV+ + +S SS + V + Sbjct: 567 RLASKNLCRSETGVADLEMKINDVSGCGSSHVSCHAVKHLSCAPSSVEDVSTK------- 619 Query: 1995 ASDTLGGCQLPRIGLRVVVNAMHNLSELLLSY-SEDVNALKEHDHVVLRHVLNNLDAYVS 2171 + LG + + V+V+ M NLSELLL + S + L+E D L V+NNLD +S Sbjct: 620 HTKFLGKEPVSNSSISVLVDTMQNLSELLLYHCSNEACELREQDVKSLEKVINNLDTCMS 679 Query: 2172 K-VGLVRPIPESDLSQSGTFSF------GKLTVPNESQITNAEANDVQNQCDFNASEPKM 2330 K +G E+ LS+ F G+ ++ +E + + D + ++ Sbjct: 680 KNIG-----QETLLSELHKVWFPMSKKNGQESLLSELHKGTSTGSPQVAAIDVLSQHTQV 734 Query: 2331 HSTMXXXXXXXXXXXXXXXXXXXXXXXNDDMIQAIKKALMENFHEEDE-HPQTSLFKNLW 2507 ND M QAIKK L+ENFHE++E HPQ L+KNLW Sbjct: 735 KRKHFGKKDEKCSEFVSVRSGTDIKVKNDKMTQAIKKVLIENFHEKEETHPQVLLYKNLW 794 Query: 2508 LESEAALCSMACKTRFARMKMKMEKC------NNQHQTKDVVGMPINADKLSGSKISVDL 2669 LE+EAALCS+ R+ MK+++EKC + T D + +AD+LS SK+S+D Sbjct: 795 LEAEAALCSINYMARYNNMKIEIEKCKLDTEKDLSEDTPDEDKISRDADELSSSKLSLDS 854 Query: 2670 YRDDMVASKTKGSVTSNISTHDTSQSSNTSCAEDVEASVMDRYRILKCRVDNSISINKEE 2849 D +A++ K S TS++ T D+ +DVEAS+M R ILK R + + N+ E Sbjct: 855 DAVDKLATEVKDSSTSSLQTQDSPVPGTACHTDDVEASIMTRLHILKSRGNVDLDSNEME 914 Query: 2850 K--LPDPVD-----EAKAAPCSMEDKAQSARLKFEMERCNQNHVEDVVG 2975 + LP+ VD + K P ED A L F +E +QN V D G Sbjct: 915 QKPLPEVVDLGFAGKKKQIPID-EDTADDGVLGFNLESVSQNQVVDYAG 962 Score = 94.7 bits (234), Expect = 3e-16 Identities = 78/249 (31%), Positives = 117/249 (46%), Gaps = 21/249 (8%) Frame = +3 Query: 2871 EAKAAPCSMEDKAQSARLKFEMERCNQNHVEDVVG-------LPINVDQLQSSKVSFVLD 3029 EA+AA CS+ A+ +K E+E+C + +D+ + + D+L SSK+S D Sbjct: 796 EAEAALCSINYMARYNNMKIEIEKCKLDTEKDLSEDTPDEDKISRDADELSSSKLSLDSD 855 Query: 3030 GDDMAASKAKGTVTPNISTLDTSQSSNMSVTEDVKSSVMDRFRILQSRIDNTISINKEGR 3209 D A++ K + T ++ T D+ T+DV++S+M R IL+SR + + N+ + Sbjct: 856 AVDKLATEVKDSSTSSLQTQDSPVPGTACHTDDVEASIMTRLHILKSRGNVDLDSNEMEQ 915 Query: 3210 LPNPCDIGAYPGSESVQNFGIKPQLPELVDMGFAERRVKWPFIRMMRTEDEAD------- 3368 P LPE+VD+GFA ++ + P ED AD Sbjct: 916 KP----------------------LPEVVDLGFAGKKKQIPI-----DEDTADDGVLGFN 948 Query: 3369 LDP-------DLSEKQDPVKDFSTFVSDDEPETQFYDTDRFGTWVPTSGYDSPSSDWEHV 3527 L+ D + +Q VKDF V D Q + R G + YDS SSDWEHV Sbjct: 949 LESVSQNQVVDYAGEQSVVKDFHLCVKHD-CTIQSPKSTRLGNQLSAGWYDSCSSDWEHV 1007 Query: 3528 LKEELAWPN 3554 LKEEL+ N Sbjct: 1008 LKEELSGQN 1016 >ref|XP_004309093.1| PREDICTED: uncharacterized protein LOC101301835 [Fragaria vesca subsp. vesca] Length = 1218 Score = 199 bits (506), Expect = 8e-48 Identities = 322/1275 (25%), Positives = 499/1275 (39%), Gaps = 167/1275 (13%) Frame = +3 Query: 219 LAPPFTVGAYSIPKKNSLGHFVEH--PYFDESHH-YLGFGSSPFDNWPPLQPDTSLPNPF 389 LAPPFTV +PK S VE P + + Y +S NW P +S+PN F Sbjct: 24 LAPPFTVER-PVPKPIS-SPLVESFTPLVEVTEQPYAAPPNSTLHNWLPPHSPSSVPNFF 81 Query: 390 NDPNLRTDAIGGMSPPYEGTY---GLPT--------PTLNYYSTPSIT------------ 500 +P D++ P Y GLPT P +N S PS Sbjct: 82 TNPPPAFDSV-----PSSNAYRYAGLPTVDSFSTNLPPMNSVSMPSSNAFSYDQRLDVAA 136 Query: 501 ------RPYFPEYPSTTVHDNSSSVVFNEPQSQ---SGKMAPLSPG-------------- 611 +PY+P Y S T+H ++ V ++P + + APL Sbjct: 137 TSFVEAKPYYPSYLSPTIHGDNPVVPPDQPSYDWLSTSQFAPLDGSSHKEYTQRPSSSKY 196 Query: 612 ----GNLWTGMSGEEQWR--------RKDKNEAGSLNYNSSLNQGTPAVDGLSKCEDSSA 755 G+ W G + EQ + R +N+ +L YN+ LNQ + + L + Sbjct: 197 TAQWGSSWNGPAEWEQGKQGQFDGSFRPKENDVSNLPYNNYLNQEPHSSNSLKSYGVNEV 256 Query: 756 V------WLGKCVDSMARKCHIGPKSIDLLDTKVSXXXXXXXXVTPFNMSRTSVSDPTSL 917 W G H+G KS ++K S P + ++ ++ S+ Sbjct: 257 ASHNIPDWNGS-----VNAEHLGDKSFVGRNSKFS----------PIDFTKPTMGS-LSV 300 Query: 918 FQEMPYHPVTSSESVLKSWSHLNPNTASYERCVSDLDSIATNPTVFYPSSIYSSLAHVYN 1097 E+P SS + KS ++ ++ +D+ SI+ SS A + Sbjct: 301 VPEIP-SKAPSSPFIGKSTYGVSCEKRQHDASWNDVTSISK-----------SSPASIIR 348 Query: 1098 SP--GNSSTFEKEFLSENMNHFEKPWDACDNKEASSSRTQVHTEGKEGCG-DTSRTNNGF 1268 P G S+ K L + +N +A S + + + D+S+ Sbjct: 349 PPAIGTKSSEPKMGLFKRLNSGRDAANADHGGYYPSQESHLPQSFVDKVPFDSSQLGIHL 408 Query: 1269 DVNNQFSMDSLMKKDFSSYRN-SITDEPFNHLYNPKFKSQITDVNLPNAFTLVHTXXXXX 1445 + FS++S KD + N SI+++P +HL+ K + V P+ F Sbjct: 409 GRIDPFSVESSSTKDTALPNNGSISNDPLDHLFKVKPGLPNSHVK-PDGFDAA-VNINDS 466 Query: 1446 XXXXXXXXETLDQFNSVLDSPCWKGAPTSQHFPCGITEAGR---------FNGLNLQGPQ 1598 E +D N +DSPCWKG S+ P +E G NGLNL P Sbjct: 467 INSFLNSSENVDPNNPAVDSPCWKGVRGSRFSPFKASEEGGPEKMKKLEGCNGLNLNMPM 526 Query: 1599 ILPFDA--------NLVYSE-----NGSAEDGLSYFPKRSSPDVSLSTEKYELKDVVKAG 1739 I + + Y+E NG +GL K+SS + S E ++L D K Sbjct: 527 IFSLNTCENISTQKPVEYNEFGWLGNGLLGNGLPLPLKKSSVENSAFGE-HKLDDTTKT- 584 Query: 1740 SDHSEMRNENVIQCFFPSNLKNCSETKSSQMMKMNQNEDKDPGIDVKDAALDDSSHVTFG 1919 + + E ++ + + + + S KSS + + + G + +S + T+ Sbjct: 585 TYYRESGHDRGLHGYI-NTPHSGSGDKSSSPFEHSYIVQEGCG---EGGLTTESKNTTWS 640 Query: 1920 DGE----RISRLLSSGDS--VPIE--LGEPLVGASDT-----LGGCQLPRIGLRVVVNAM 2060 G I+ L G S PIE P V +DT G + ++++VN M Sbjct: 641 VGADVKLNINDTLECGSSHTSPIENTFCSPSVEDADTKLTTSYGEESNMNMDIQMLVNKM 700 Query: 2061 HNLSELLL-SYSEDVNALKEHDHVVLRHVLNNLDAYVSKVGL-VRPIPESDLSQSGTFSF 2234 ++LSE+LL + S LK+ D L+ V+NNL++ + K +PES Q T + Sbjct: 701 NSLSEVLLVNCSNSSCQLKKKDIDALKAVINNLNSCILKHDEDFLSMPESPPIQQSTIKY 760 Query: 2235 -------GKLTVPNESQITNAEANDVQNQCDFNASEP-KMHSTMXXXXXXXXXXXXXXXX 2390 K P+ Q+T A +Q+ + K H + Sbjct: 761 IEELCKPNKALSPDMPQLTKIFAPSIQDPLHLQGVQKVKNHDNLVKNDDEVISSVSAKSD 820 Query: 2391 XXXXXXXNDDMIQAIKKALMENFHEEDEHPQTSLFKNLWLESEAALCSMACKTRFARMKM 2570 ++M Q IKK L ENFH +D HPQT L+KNLWLE+EA +CS K RF R+K Sbjct: 821 IDFVK--QEEMTQDIKKILSENFHTDDTHPQTLLYKNLWLEAEAVICSTNYKARFNRLKT 878 Query: 2571 KMEKCNNQHQTKDVVGMPINADKLSGSKISVDLYRDDMVASKTKGSVTSNISTHDTSQSS 2750 +MEKC Q+KDV + S S++ V+ + + S+ +GS ++ Q S Sbjct: 879 EMEKC-KADQSKDVFEHTADMMTQSRSEVCVNSNPVEKLTSEVQGSPLPKLNL----QES 933 Query: 2751 NTSCAEDVEASVMDRYRILKCRVDNSISINKE---------EKLPDPVD----EAKAAP- 2888 T D +VM R+ +L+ R++N S+N +PD VD EA A P Sbjct: 934 PTLTQGD--DNVMARFHVLRNRIENLSSVNATFGDESSSTLSLVPDKVDEVAPEADARPS 991 Query: 2889 --CSMEDKAQS--------------ARLKFEMERC-NQNHVEDVVGLPINVDQLQSSKVS 3017 S++D S AR +R N + D NV+ SSKVS Sbjct: 992 PRISLQDSPTSSITGLSNDYEASVMARFHIIRDRVENSKFISDA-----NVEDTASSKVS 1046 Query: 3018 FVLDGDDMAASKAKGTVTPNISTLD---------TSQSSNMSVTEDVKSSVMDRFRILQS 3170 + ++ A + ++ D S S+ + SV+ RF IL+S Sbjct: 1047 REHEAEEGACETSDDGPIQELNIQDYPGSVQDYPVSTSTTTGHAYQYEDSVLARFNILKS 1106 Query: 3171 RIDNTISINKEGRLPNPCDIGAYPGSESVQNFGIKPQLPELVDMGFAERRVKWPFIRMMR 3350 R+DN I G +L E VD+G+A +R P I R Sbjct: 1107 RVDNCSDIPTVG------------------------ELLESVDLGYAGKRNLGPII-CNR 1141 Query: 3351 TED-----------EADLDPDLSEKQDPVKDFSTFVSDDEPETQFYDTDRFGTWVPTSGY 3497 +ED ++ + + K K+F FV DD ++ P Sbjct: 1142 SEDGSSDVKEQPVLQSHIADNSKGKCMDAKEFHLFVEDDPGHMINRPANQLSAGSPD--- 1198 Query: 3498 DSPSSDWEHVLKEEL 3542 S SSDWEHV+KEE+ Sbjct: 1199 QSTSSDWEHVMKEEV 1213 >ref|XP_006478087.1| PREDICTED: uncharacterized protein LOC102628429 [Citrus sinensis] Length = 1065 Score = 179 bits (453), Expect = 1e-41 Identities = 253/956 (26%), Positives = 376/956 (39%), Gaps = 86/956 (8%) Frame = +3 Query: 219 LAPPFTVGAYSIPK------KNSLGHFVEHPY-FDESHHYLGFGSSPFDNWPPLQPDTSL 377 LAPPFTV S+ K + L HP FD H +G S + P P T+ Sbjct: 23 LAPPFTVDR-SVSKPLVDLTEPPLNWLNTHPLNFDSVHSSNAYGYS----FNP--PSTAH 75 Query: 378 PNPFNDPNLRTDAIGGMSPPYEGTYGLPTPTLNYYSTPSITRPYFPEYPSTTVHDNSSSV 557 P +P T A S G P+ N PY+P Y S T + Sbjct: 76 IPPPENPIPITSA----SSFLYGQSSDAIPSANLVEA----NPYYPSYVSPTKY------ 121 Query: 558 VFNEPQSQSGKMAPLSPGGNLWTGMSGEEQWRRKDKNEAGSLNYNSSLNQGTPAVDGLSK 737 +++ + S + E + K+ N Y +QG + GL+ Sbjct: 122 TYDDYAQSLSSLWDASREWEFGRKLELGESFCAKEMNVPDLSIYQDYADQGAHSSKGLNT 181 Query: 738 CEDSSAVWLGKCVDSMARKCHIGPKSIDLLDTKVSXXXXXXXXVTPFNMSRTSVSDPTSL 917 E + +D + + H G + + LD K P SR SV TS Sbjct: 182 FEQKN-----NNLDMLGSEQHQGSINREQLDYK--SFTGQISEFMPVEYSRKSVHGSTSF 234 Query: 918 FQEMPYHPVTSSESVLKSWSHLNPNTASYERCVSDLDSIATNPTVFYPSSIY-SSLAHVY 1094 F E + +TS E +SWSH P AS E+ I+ N SS+ SS HV Sbjct: 235 FPET--YSLTSFEQG-RSWSHQTPYGASCEKGAKQ-HGISPNDI----SSVKKSSPVHVI 286 Query: 1095 NSPGNSSTFEKEFLSENMNHFEKPWDACDNKEASSSRTQVHTEGKEGCGDTSRTNNGFDV 1274 S S+ N N S+ + EG F + Sbjct: 287 KSQAVCSSLSPPSTGSFNNLENSSGAIASNDNLSNMKEFYPLHSSEGKVHFDAGQVSFHL 346 Query: 1275 ---NNQFSMDSLMKKDFSSYRNSITDEPFNHLYNPKFKSQITDVNLPNAFTLVHTXXXXX 1445 ++ F L KK+ S S+ +P K QI D+ P + +L+ Sbjct: 347 ERGSHIFPKLPLEKKEKLSSNVSVIKDPLKE----KPGLQIPDIG-PGSVSLM-LANNGA 400 Query: 1446 XXXXXXXXETLDQFNSVLDSPCWKGAP------------TSQHFPCGITEAGRFNGLNLQ 1589 E+LD +N +DSPCWKGAP T QH I + +G N Sbjct: 401 INCSEGSSESLDHYNPAVDSPCWKGAPDYHSPVESSGPVTLQH----INKIEACSGSNSF 456 Query: 1590 GP-----QILPFDAN--LVYSENGSAEDGLSYFPKRSSP----------DVSLSTEKYEL 1718 GP ++ P + Y E+G E+ PKRSS D L T Y++ Sbjct: 457 GPTDNSGKVSPQKPSDYSFYQEHGYLENDPESSPKRSSRANLLFEEHGYDHDLKTGSYQM 516 Query: 1719 KDVVKAGSDHSEMRNENVIQCFFPSNLKNCSETKSSQMMKMNQNEDK------------- 1859 K G S+ ++ +N + + + ++ + E+K Sbjct: 517 KSSCGLGVQFSDYIDKPRQDYVHANNSADEFKFRPFHQVQYDTVENKLTFERKCELGSGV 576 Query: 1860 -DPGIDVKDAALDDSSHVTFGDGERISRLLSSGDSVPIELGEPLVGASDTLGGCQL-PRI 2033 D G+ + + SSHV E + SS ++VP L + L G QL P++ Sbjct: 577 ADVGLSINGTSEGCSSHVPLHATEHVLSSPSSVEAVPARLNK--------LHGEQLAPQM 628 Query: 2034 GLRVVVNAMHNLSELLLSY-SEDVNALKEHDHVVLRHVLNNLDAYVSK-VGLVRPIPESD 2207 +R ++++MHNLSELLL + S D+ LKEHD L+ V+NNLD +SK +G PI ES Sbjct: 629 CVRTLISSMHNLSELLLFHCSNDMCGLKEHDFEALKLVVNNLDKCISKRMGPEAPIQESL 688 Query: 2208 LSQSGT----------------------FSFGKLTVPNESQITNAEAND------VQNQC 2303 L+Q + +F L PN + + D ++ Sbjct: 689 LTQKSSEFIREFPELHEGVTVSSPQETKAAFSVLNQPNYQHVQEQRSPDIAAGKKIEKCS 748 Query: 2304 DFNASEPKMHSTMXXXXXXXXXXXXXXXXXXXXXXXNDDMIQAIKKALMENF-HEEDEHP 2480 DF S +D+M QAIKK L +NF EEDE Sbjct: 749 DFT-------SQGGHAERVKDDDMTQVHKDDAERVKDDNMTQAIKKVLSDNFVKEEDEKL 801 Query: 2481 QTSLFKNLWLESEAALCSMACKTRFARMKMKMEKCNNQHQTKDVVGMPINADKLSGSKIS 2660 Q L++NLWLE+EAALC++ K RF RMK+++E C + KD+ +KLS + S Sbjct: 802 QVLLYRNLWLEAEAALCAINYKARFNRMKIELENC-KLLKAKDLSENTSELEKLSQTTFS 860 Query: 2661 VDLYRDDMVASKTKGSVTSNISTHDTSQSSNTSCAEDVEASVMDRYRILKCRVDNS 2828 DL+ + + + K T ++S D ++++S +DV A R++ILKC+ S Sbjct: 861 PDLHAVNKLPPQVKDDTTQDVSVRDFPIANSSSHPDDVVA----RFQILKCQESKS 912 >ref|XP_006441268.1| hypothetical protein CICLE_v10018632mg [Citrus clementina] gi|557543530|gb|ESR54508.1| hypothetical protein CICLE_v10018632mg [Citrus clementina] Length = 1041 Score = 178 bits (451), Expect = 2e-41 Identities = 271/1050 (25%), Positives = 406/1050 (38%), Gaps = 134/1050 (12%) Frame = +3 Query: 219 LAPPFTVGAYSIPK------KNSLGHFVEHPY-FDESHHYLGFGSSPFDNWPPLQPDTSL 377 LAPPFTV S+ K + L HP FD H +G S + P P T+ Sbjct: 23 LAPPFTVDR-SVSKPLVDLTEPPLNWLNTHPLNFDSVHSSNAYGYS----FNP--PSTAH 75 Query: 378 PNPFNDPNLRTDAIGGMSPPYEGTYGLPTPTLNYYSTPSITRPYFPEYPSTTVHDNSSSV 557 P +P T A S G P+ N PY+P Y S T + Sbjct: 76 IPPPENPIPITSA----SSFLYGQSSDAIPSANLVEA----NPYYPSYVSPTKYTYDDYA 127 Query: 558 VFNEPQSQSGKMAPLSPGGNLWTGMSGE--------EQWRRKDKNEAGSLNYNSSLNQGT 713 QS S +LW E E + K+ N Y +QG Sbjct: 128 -----QSLS----------SLWDSREWEFSRKLELGESFCSKEMNVPDLSIYQDYADQGA 172 Query: 714 PAVDGLSKCEDSSAVWLGKCVDSMARKCHIGPKSIDLLDTKVSXXXXXXXXVTPFNMSRT 893 + GL+ E + +D + + H G + + LD K P SR Sbjct: 173 HSSKGLNTFEQKN-----NNLDMLGSEQHQGSINREQLDYK--SFTGQISEFMPVEYSRK 225 Query: 894 SVSDPTSLFQEMPYHPVTSSESVLKSWSHLNPNTASYERCV-------SDLDSIATNPTV 1052 SV TSLF E + +TS E +SWSH P AS E+ +D+ S+ + V Sbjct: 226 SVHGSTSLFPET--YSLTSYEQG-RSWSHQTPYGASCEKGAKQHGISPNDISSVKKSSPV 282 Query: 1053 FYPSS--IYSSLAHVYNSPGNSSTFEKEFLSENMNHFEKPWDACDNKEA----SSSRTQV 1214 S +++SL SP ++ +F EN + D N + SS +V Sbjct: 283 HVVKSQAVFTSL-----SPPSTVSFNN---LENSSGVIASNDNLSNMKEFYPLHSSEGKV 334 Query: 1215 HTEGKEGCGDTSRTNNGFDVNNQFSMDSLMKKDFSSYRNSITDEPFNHLYNPKFKSQITD 1394 H + + R ++ F F + + S ++ + ++P QI D Sbjct: 335 HFDAGQVSFHLERGSHIFP-KLPFEKKEKLSSNVSVIKDPLKEKP---------GLQIPD 384 Query: 1395 VNLPNAFTLVHTXXXXXXXXXXXXXETLDQFNSVLDSPCWKGAP------------TSQH 1538 + P + +L+ E+LD +N +DSPCWKGAP T QH Sbjct: 385 IG-PGSVSLM-LANNRAINCSEGSSESLDHYNPAVDSPCWKGAPDYHSPVESSGPVTLQH 442 Query: 1539 FP-----CGITEAGRFNGLNLQGPQILPFDANLVYSENGSAEDGLSYFPKRSSP------ 1685 G G + PQ P D + Y E+G E+ PKRSS Sbjct: 443 INKIEACSGSNSIGPTDNSGKVSPQ-KPSDYSF-YQEHGYLENDPESSPKRSSRANLLFE 500 Query: 1686 ----DVSLSTEKYELKDVVKAGSDHSEMRNENVIQCFFPSNLKNCSETKSSQMMKMNQNE 1853 D L T Y++K G S+ ++ +N + + + ++ + E Sbjct: 501 EHGYDRDLKTGFYQMKSSYGLGVQFSDCIDKPRQDYVHANNSADEFKFRPFHQVQYDSVE 560 Query: 1854 DK--------------DPGIDVKDAALDDSSHVTFGDGERISRLLSSGDSVPIELGEPLV 1991 +K D G+ + + SSHV E + SS ++VP L + Sbjct: 561 NKLTFERKCELGSGVADVGLSINGTSEGCSSHVPLHATEHVLSSPSSVEAVPARLNK--- 617 Query: 1992 GASDTLGGCQL-PRIGLRVVVNAMHNLSELLLSY-SEDVNALKEHDHVVLRHVLNNLDAY 2165 L G QL P++ +R +++ MHNLSELLL + S D+ LKEHD L+ V+NNLD Sbjct: 618 -----LHGEQLAPQMCVRTLISTMHNLSELLLFHCSNDMCGLKEHDFEALKLVVNNLDKC 672 Query: 2166 VSK-VGLVRPIPESDLSQSGT----------------------FSFGKLTVPNESQITNA 2276 +SK +G PI ES L+Q + +F L PN + Sbjct: 673 ISKRMGPEAPIQESLLTQKSSEFIREFPELHEGVTVSSPKETKAAFSVLNQPNYQHVQEQ 732 Query: 2277 EANDV-----QNQC-DFNASEPKMHSTMXXXXXXXXXXXXXXXXXXXXXXXNDDMIQAIK 2438 + D+ +C DF S +D+M QAIK Sbjct: 733 RSPDIAAGKKSEKCSDFT-------SQGGHAERVKDDDMTQVHKDDAERVKDDNMTQAIK 785 Query: 2439 KALMENF-HEEDEHPQTSLFKNLWLESEAALCSMACKTRFARMKMKMEKCNNQHQTKDVV 2615 K L +NF EEDE Q L++NLWLE+EAALCS+ K RF RMK+++E C + + Sbjct: 786 KVLSDNFVEEEDEKLQVLLYRNLWLEAEAALCSINYKARFNRMKIELENCKLLKAKVNKL 845 Query: 2616 GMPINADKLSGSKI------SVDLYRDDMVAS---------------------------K 2696 + D + ++ + DD+VA + Sbjct: 846 PPQVKDDSTQDVSVHDFPIANISSHPDDVVARSQILKCQESESHANQRPTADEVDNFLFE 905 Query: 2697 TKGSVTSNISTHDTSQSSNTSCAEDVEASVMDRYRILKCRVDNSISINKEEKLPDPVDEA 2876 + T ST S +++TS A+DVEASV+ R+ ILK R++NS N +++ V Sbjct: 906 ARNDQTPPTSTCSLSNATSTSKADDVEASVIARFHILKNRIENSSCSNMGDQILPQV--- 962 Query: 2877 KAAPCSMEDKAQSARLKFEMERCNQNHVED 2966 A E+ E+ R + NH++D Sbjct: 963 --AFKLFENGTSDVNTGPELHRNSSNHMQD 990 >ref|XP_006441271.1| hypothetical protein CICLE_v10018632mg [Citrus clementina] gi|557543533|gb|ESR54511.1| hypothetical protein CICLE_v10018632mg [Citrus clementina] Length = 1064 Score = 177 bits (450), Expect = 3e-41 Identities = 259/971 (26%), Positives = 384/971 (39%), Gaps = 101/971 (10%) Frame = +3 Query: 219 LAPPFTVGAYSIPK------KNSLGHFVEHPY-FDESHHYLGFGSSPFDNWPPLQPDTSL 377 LAPPFTV S+ K + L HP FD H +G S + P P T+ Sbjct: 23 LAPPFTVDR-SVSKPLVDLTEPPLNWLNTHPLNFDSVHSSNAYGYS----FNP--PSTAH 75 Query: 378 PNPFNDPNLRTDAIGGMSPPYEGTYGLPTPTLNYYSTPSITRPYFPEYPSTTVHDNSSSV 557 P +P T A S G P+ N PY+P Y S T + Sbjct: 76 IPPPENPIPITSA----SSFLYGQSSDAIPSANLVEA----NPYYPSYVSPTKYTYDDYA 127 Query: 558 VFNEPQSQSGKMAPLSPGGNLWTGMSGE--------EQWRRKDKNEAGSLNYNSSLNQGT 713 QS S +LW E E + K+ N Y +QG Sbjct: 128 -----QSLS----------SLWDSREWEFSRKLELGESFCSKEMNVPDLSIYQDYADQGA 172 Query: 714 PAVDGLSKCEDSSAVWLGKCVDSMARKCHIGPKSIDLLDTKVSXXXXXXXXVTPFNMSRT 893 + GL+ E + +D + + H G + + LD K P SR Sbjct: 173 HSSKGLNTFEQKN-----NNLDMLGSEQHQGSINREQLDYK--SFTGQISEFMPVEYSRK 225 Query: 894 SVSDPTSLFQEMPYHPVTSSESVLKSWSHLNPNTASYERCV-------SDLDSIATNPTV 1052 SV TSLF E + +TS E +SWSH P AS E+ +D+ S+ + V Sbjct: 226 SVHGSTSLFPET--YSLTSYEQG-RSWSHQTPYGASCEKGAKQHGISPNDISSVKKSSPV 282 Query: 1053 FYPSS--IYSSLAHVYNSPGNSSTFEKEFLSENMNHFEKPWDACDNKEA----SSSRTQV 1214 S +++SL SP ++ +F EN + D N + SS +V Sbjct: 283 HVVKSQAVFTSL-----SPPSTVSFNN---LENSSGVIASNDNLSNMKEFYPLHSSEGKV 334 Query: 1215 HTEGKEGCGDTSRTNNGFDVNNQFSMDSLMKKDFSSYRNSITDEPFNHLYNPKFKSQITD 1394 H + + R ++ F F + + S ++ + ++P QI D Sbjct: 335 HFDAGQVSFHLERGSHIFP-KLPFEKKEKLSSNVSVIKDPLKEKP---------GLQIPD 384 Query: 1395 VNLPNAFTLVHTXXXXXXXXXXXXXETLDQFNSVLDSPCWKGAP------------TSQH 1538 + P + +L+ E+LD +N +DSPCWKGAP T QH Sbjct: 385 IG-PGSVSLM-LANNRAINCSEGSSESLDHYNPAVDSPCWKGAPDYHSPVESSGPVTLQH 442 Query: 1539 FP-----CGITEAGRFNGLNLQGPQILPFDANLVYSENGSAEDGLSYFPKRSSP------ 1685 G G + PQ P D + Y E+G E+ PKRSS Sbjct: 443 INKIEACSGSNSIGPTDNSGKVSPQ-KPSDYSF-YQEHGYLENDPESSPKRSSRANLLFE 500 Query: 1686 ----DVSLSTEKYELKDVVKAGSDHSEMRNENVIQCFFPSNLKNCSETKSSQMMKMNQNE 1853 D L T Y++K G S+ ++ +N + + + ++ + E Sbjct: 501 EHGYDRDLKTGFYQMKSSYGLGVQFSDCIDKPRQDYVHANNSADEFKFRPFHQVQYDSVE 560 Query: 1854 DK--------------DPGIDVKDAALDDSSHVTFGDGERISRLLSSGDSVPIELGEPLV 1991 +K D G+ + + SSHV E + SS ++VP L + Sbjct: 561 NKLTFERKCELGSGVADVGLSINGTSEGCSSHVPLHATEHVLSSPSSVEAVPARLNK--- 617 Query: 1992 GASDTLGGCQL-PRIGLRVVVNAMHNLSELLLSY-SEDVNALKEHDHVVLRHVLNNLDAY 2165 L G QL P++ +R +++ MHNLSELLL + S D+ LKEHD L+ V+NNLD Sbjct: 618 -----LHGEQLAPQMCVRTLISTMHNLSELLLFHCSNDMCGLKEHDFEALKLVVNNLDKC 672 Query: 2166 VSK-VGLVRPIPESDLSQSGT----------------------FSFGKLTVPNESQITNA 2276 +SK +G PI ES L+Q + +F L PN + Sbjct: 673 ISKRMGPEAPIQESLLTQKSSEFIREFPELHEGVTVSSPKETKAAFSVLNQPNYQHVQEQ 732 Query: 2277 EANDV-----QNQC-DFNASEPKMHSTMXXXXXXXXXXXXXXXXXXXXXXXNDDMIQAIK 2438 + D+ +C DF + +D+M QAIK Sbjct: 733 RSPDIAAGKKSEKCSDFTSQGGHAERVKDDDMTQVHKDDAERVK-------DDNMTQAIK 785 Query: 2439 KALMENF-HEEDEHPQTSLFKNLWLESEAALCSMACKTRFARMKMKMEKCNNQHQTKDVV 2615 K L +NF EEDE Q L++NLWLE+EAALCS+ K RF RMK+++E C + KD Sbjct: 786 KVLSDNFVEEEDEKLQVLLYRNLWLEAEAALCSINYKARFNRMKIELENC-KLLKAKDFS 844 Query: 2616 GMPINADKLSGSKISVDLYRDDMVASKTKGSVTSNISTHDTSQSSNTSCAEDVEASVMDR 2795 +KLS + S DL+ + + + K T ++S HD ++ +S +DV A R Sbjct: 845 ENTSELEKLSQTTFSPDLHAVNKLPPQVKDDSTQDVSVHDFPIANISSHPDDVVA----R 900 Query: 2796 YRILKCRVDNS 2828 +ILKC+ S Sbjct: 901 SQILKCQESES 911 >ref|XP_006441272.1| hypothetical protein CICLE_v10018632mg [Citrus clementina] gi|557543534|gb|ESR54512.1| hypothetical protein CICLE_v10018632mg [Citrus clementina] Length = 842 Score = 155 bits (392), Expect = 1e-34 Identities = 237/890 (26%), Positives = 346/890 (38%), Gaps = 101/890 (11%) Frame = +3 Query: 219 LAPPFTVGAYSIPK------KNSLGHFVEHPY-FDESHHYLGFGSSPFDNWPPLQPDTSL 377 LAPPFTV S+ K + L HP FD H +G S + P P T+ Sbjct: 23 LAPPFTVDR-SVSKPLVDLTEPPLNWLNTHPLNFDSVHSSNAYGYS----FNP--PSTAH 75 Query: 378 PNPFNDPNLRTDAIGGMSPPYEGTYGLPTPTLNYYSTPSITRPYFPEYPSTTVHDNSSSV 557 P +P T A S G P+ N PY+P Y S T + Sbjct: 76 IPPPENPIPITSA----SSFLYGQSSDAIPSANLVEA----NPYYPSYVSPTKYTYDDYA 127 Query: 558 VFNEPQSQSGKMAPLSPGGNLWTGMSGE--------EQWRRKDKNEAGSLNYNSSLNQGT 713 QS S +LW E E + K+ N Y +QG Sbjct: 128 -----QSLS----------SLWDSREWEFSRKLELGESFCSKEMNVPDLSIYQDYADQGA 172 Query: 714 PAVDGLSKCEDSSAVWLGKCVDSMARKCHIGPKSIDLLDTKVSXXXXXXXXVTPFNMSRT 893 + GL+ E + +D + + H G + + LD K P SR Sbjct: 173 HSSKGLNTFEQKN-----NNLDMLGSEQHQGSINREQLDYK--SFTGQISEFMPVEYSRK 225 Query: 894 SVSDPTSLFQEMPYHPVTSSESVLKSWSHLNPNTASYERCV-------SDLDSIATNPTV 1052 SV TSLF E + +TS E +SWSH P AS E+ +D+ S+ + V Sbjct: 226 SVHGSTSLFPET--YSLTSYEQG-RSWSHQTPYGASCEKGAKQHGISPNDISSVKKSSPV 282 Query: 1053 FYPSS--IYSSLAHVYNSPGNSSTFEKEFLSENMNHFEKPWDACDNKEA----SSSRTQV 1214 S +++SL SP ++ +F EN + D N + SS +V Sbjct: 283 HVVKSQAVFTSL-----SPPSTVSFNN---LENSSGVIASNDNLSNMKEFYPLHSSEGKV 334 Query: 1215 HTEGKEGCGDTSRTNNGFDVNNQFSMDSLMKKDFSSYRNSITDEPFNHLYNPKFKSQITD 1394 H + + R ++ F F + + S ++ + ++P QI D Sbjct: 335 HFDAGQVSFHLERGSHIFP-KLPFEKKEKLSSNVSVIKDPLKEKP---------GLQIPD 384 Query: 1395 VNLPNAFTLVHTXXXXXXXXXXXXXETLDQFNSVLDSPCWKGAP------------TSQH 1538 + P + +L+ E+LD +N +DSPCWKGAP T QH Sbjct: 385 IG-PGSVSLM-LANNRAINCSEGSSESLDHYNPAVDSPCWKGAPDYHSPVESSGPVTLQH 442 Query: 1539 FP-----CGITEAGRFNGLNLQGPQILPFDANLVYSENGSAEDGLSYFPKRSSP------ 1685 G G + PQ P D + Y E+G E+ PKRSS Sbjct: 443 INKIEACSGSNSIGPTDNSGKVSPQ-KPSDYSF-YQEHGYLENDPESSPKRSSRANLLFE 500 Query: 1686 ----DVSLSTEKYELKDVVKAGSDHSEMRNENVIQCFFPSNLKNCSETKSSQMMKMNQNE 1853 D L T Y++K G S+ ++ +N + + + ++ + E Sbjct: 501 EHGYDRDLKTGFYQMKSSYGLGVQFSDCIDKPRQDYVHANNSADEFKFRPFHQVQYDSVE 560 Query: 1854 DK--------------DPGIDVKDAALDDSSHVTFGDGERISRLLSSGDSVPIELGEPLV 1991 +K D G+ + + SSHV E + SS ++VP L + Sbjct: 561 NKLTFERKCELGSGVADVGLSINGTSEGCSSHVPLHATEHVLSSPSSVEAVPARLNK--- 617 Query: 1992 GASDTLGGCQL-PRIGLRVVVNAMHNLSELLLSY-SEDVNALKEHDHVVLRHVLNNLDAY 2165 L G QL P++ +R +++ MHNLSELLL + S D+ LKEHD L+ V+NNLD Sbjct: 618 -----LHGEQLAPQMCVRTLISTMHNLSELLLFHCSNDMCGLKEHDFEALKLVVNNLDKC 672 Query: 2166 VSK-VGLVRPIPESDLSQSGT----------------------FSFGKLTVPNESQITNA 2276 +SK +G PI ES L+Q + +F L PN + Sbjct: 673 ISKRMGPEAPIQESLLTQKSSEFIREFPELHEGVTVSSPKETKAAFSVLNQPNYQHVQEQ 732 Query: 2277 EANDV-----QNQC-DFNASEPKMHSTMXXXXXXXXXXXXXXXXXXXXXXXNDDMIQAIK 2438 + D+ +C DF S +D+M QAIK Sbjct: 733 RSPDIAAGKKSEKCSDFT-------SQGGHAERVKDDDMTQVHKDDAERVKDDNMTQAIK 785 Query: 2439 KALMENF-HEEDEHPQTSLFKNLWLESEAALCSMACKTRFARMKMKMEKC 2585 K L +NF EEDE Q L++NLWLE+EAALCS+ K RF RMK+++E C Sbjct: 786 KVLSDNFVEEEDEKLQVLLYRNLWLEAEAALCSINYKARFNRMKIELENC 835 >ref|XP_007039225.1| Uncharacterized protein isoform 6 [Theobroma cacao] gi|508776470|gb|EOY23726.1| Uncharacterized protein isoform 6 [Theobroma cacao] Length = 827 Score = 143 bits (361), Expect = 5e-31 Identities = 225/866 (25%), Positives = 342/866 (39%), Gaps = 77/866 (8%) Frame = +3 Query: 219 LAPPFTVGAYSIPKKNSLGHFVEHPYFDESHHYLGFGSSPFDNWPPLQPDT-SLPNPFND 395 LAPPFTV SIPK + P D LG P NW P T + P P Sbjct: 33 LAPPFTVDR-SIPKPAAT------PLVD-----LG---EPL-NWLDSNPYTFNSPQPAQL 76 Query: 396 PNLRTDAIGGMSPPYEGTYGLPTPTLNYYSTPSITRP--YFPEYPSTTVHDNSSSVVFNE 569 P L + PTPT +Y + P Y+P Y S +H + FNE Sbjct: 77 PQLDLE---------------PTPTPSYNQNSDLFEPKTYYPSYVSPPLHVPT----FNE 117 Query: 570 PQSQSGKMAPLSPGGNLWTGMSGEE-----QWRRKDKNEAGSLNYNSSLNQGTPAVDGLS 734 QS G GG LW G+ + K+ + A S Y +N G L Sbjct: 118 -QSLPGLDHTAQWGGGLWDWEKGKPAQLGGSFYLKETSVAPSSIYMDHINLGAHPSKSLK 176 Query: 735 KCEDSSAVWLGKCVDSMARKCHIGPKSIDLLDTKVSXXXXXXXXVTPFNMSRTSVSDPTS 914 CE++S R+ GP +I+ LD + P + +TSV +S Sbjct: 177 TCEETSYNIYSP------REDQAGPANIEKLD--YNPVLGQNPSFMPVDYLKTSVIGSSS 228 Query: 915 LFQEM-----PYHPVTSSESVLKSWSHLNPNTASYERCVSD-LDSIATNP-TVFYPSSIY 1073 E P + V + ++ + + +SD + S+ ++P V P ++ Sbjct: 229 AISEANLQAPPLNLVNCKNNHVQISTPYEKPLRQHGTTLSDSIPSVKSSPGVVIRPPAVG 288 Query: 1074 SSLAHVYNSPGNSSTFEKEFLSENMNHFEKPWDACDNKEASSSRTQVHTE------GKEG 1235 +S +S NS +F+ N A D A ++R V G + Sbjct: 289 TS-----SSASNSVSFKNVNTGIN---------ATDTNLAGNNRFIVEEPRFLFNFGSKN 334 Query: 1236 CGDTSRTNNGFDVNNQFSMDSLMKKDFSSYRNSITDEPFNHLYNPKFKSQITDVNLPNAF 1415 D + + D N S +S + S RN +D F S+I+ N AF Sbjct: 335 EFDPIQHSFLLDGNCYMSGESSTSTEKLSTRNMASDNFFGAKSGVNL-SRISPDNFSLAF 393 Query: 1416 TLVHTXXXXXXXXXXXXXETLDQFNSVLDSPCWKGAPTSQHFPCGITE------AGRFNG 1577 E+LD +N +DSPCWKGAP S + P G +E A + Sbjct: 394 E-----NNEAVIAVENSLESLDHYNPPVDSPCWKGAPASNNSPFGSSEPVAVQLAKKLEA 448 Query: 1578 LNLQGPQILPFDAN----------------LVYSENGSAEDGLSYFPKRSSPDVSL-STE 1706 + +L F ++ L+ ENG+ EDG K P VS+ S + Sbjct: 449 CDGSNGLVLKFISSNTANMVKHPSGKAGEILMSDENGNVEDGSMSSLKL--PPVSIPSFK 506 Query: 1707 KYELKDVVKAGS---------------DHSEMRNENVIQCFFPSNLKNCSETK------- 1820 ++E + KAGS + SE + + V+ ++ S T Sbjct: 507 EHEPDEAGKAGSHKNKASSACEVKFSDNASEWKKDYVLFDKSVDEVEKASHTSQQCLAEG 566 Query: 1821 --SSQMMKMNQNEDKDPGIDVKDAALDDSSHVTFGDGERISRLLSSGDSVPIELGEPLVG 1994 +S+ + ++ D + + D + SSHV+ + +S SS + V + Sbjct: 567 RLASKNLCRSETGVADLEMKINDVSGCGSSHVSCHAVKHLSCAPSSVEDVSTK------- 619 Query: 1995 ASDTLGGCQLPRIGLRVVVNAMHNLSELLLSY-SEDVNALKEHDHVVLRHVLNNLDAYVS 2171 + LG + + V+V+ M NLSELLL + S + L+E D L V+NNLD +S Sbjct: 620 HTKFLGKEPVSNSSISVLVDTMQNLSELLLYHCSNEACELREQDVKSLEKVINNLDTCMS 679 Query: 2172 K-VGLVRPIPESDLSQSGTFSF------GKLTVPNESQITNAEANDVQNQCDFNASEPKM 2330 K +G E+ LS+ F G+ ++ +E + + D + ++ Sbjct: 680 KNIG-----QETLLSELHKVWFPMSKKNGQESLLSELHKGTSTGSPQVAAIDVLSQHTQV 734 Query: 2331 HSTMXXXXXXXXXXXXXXXXXXXXXXXNDDMIQAIKKALMENFHEEDE-HPQTSLFKNLW 2507 ND M QAIKK L+ENFHE++E HPQ L+KNLW Sbjct: 735 KRKHFGKKDEKCSEFVSVRSGTDIKVKNDKMTQAIKKVLIENFHEKEETHPQVLLYKNLW 794 Query: 2508 LESEAALCSMACKTRFARMKMKMEKC 2585 LE+EAALCS+ R+ MK+++EKC Sbjct: 795 LEAEAALCSINYMARYNNMKIEIEKC 820 >ref|XP_004235030.1| PREDICTED: uncharacterized protein LOC101252062 [Solanum lycopersicum] Length = 1175 Score = 132 bits (331), Expect = 2e-27 Identities = 293/1253 (23%), Positives = 452/1253 (36%), Gaps = 145/1253 (11%) Frame = +3 Query: 216 PLAPPFTVG-AYSIPKKNSLGHFVEHPY-----FDESHHYLGFGSSPFD-NWPPLQPDTS 374 PLAPPFTV + S L +F + Y F +S Y SP N+ P D S Sbjct: 21 PLAPPFTVDRSNSKTVSTQLLNFSDSSYTGTVPFGQSWQYAAADPSPTGYNFFPSVTD-S 79 Query: 375 LPNPFNDPNLRTDAIGGMSPPYEGTYGLPTPTLNYYS-TPSITRP-YFPEYPSTTVHDNS 548 +P N P P + P PT+N + T S R Y+ Y + V + Sbjct: 80 VPTTCNMPLSPEFTPADSVEPGSHFWSTPNPTVNASTETYSFGREGYYAAYVPSLVSNEH 139 Query: 549 SSVVFNEPQ----------------SQSGKMAPLS----PGGNLWTGMSGEEQWRRK--- 659 S FNEP SQ LS P + ++ ++ +Q +K Sbjct: 140 PSSAFNEPSLDVLPNSGNIHVDASSSQVDYTQTLSGLEYPHWSFFSKVADGKQEEKKGVD 199 Query: 660 ------DKNEAGSLNYNSSLNQGTPAVDGLSKCEDSSAVWLGKCVDSMARKCHIGPKSID 821 + N S Y + +++G E+S A +D + + GP SI Sbjct: 200 GSFSSGNVNVGASYGYRNCMSKGNSLEGANIPRENSGAA---NFIDGV----YTGPSSIG 252 Query: 822 LLDTKVSXXXXXXXXVTPFNMSRTSVSDPTSLFQEMPYHPVTSSESVLKSWSHLNPNTAS 1001 +D K P S TS + S F + S S ++ NP T Sbjct: 253 HMDAK------SYLTQEPIYQSLTSETAMGS-FSPVSCQVGLSLGSSSNYLNYKNPFTP- 304 Query: 1002 YERCVSDLDSIATNPTVFYPSSIYSSLAHVYNSPGNSSTFEKEFLSENMNHFEKPWDACD 1181 + + LDS + T S+ SS V+ + S F + + N D C Sbjct: 305 HGKFFQPLDSCPRDTT----STSKSSPVLVFRPAPSGSRFFAPKIDLHKNV-----DICK 355 Query: 1182 NKEASSSRTQVHTEGKEGCGDTSRTNNGFDVNN---QFSMDSLMKKDFSSYRNSI-TDEP 1349 ++ ++ V C +++ +FS+ S DF +N+ Sbjct: 356 TGATNTEKSDV-------CNVLKSQETRLPIDSPIKEFSLGSSTPPDFDKIKNNFFASSS 408 Query: 1350 FNHLYN--PKFKSQITDVNLPNAFTLVHTXXXXXXXXXXXXXETLDQFNSVLDSPCWKGA 1523 N+L + P + I + + + LD N +DSPCWKGA Sbjct: 409 VNNLCSTRPCSSNSIEIAVKERSGSQAPCASAPPVTSAEKCSDALDLHNPNVDSPCWKGA 468 Query: 1524 PTSQ---------HFPCGITEAGRFNGLNLQGPQILPFDAN-----------LVYSENGS 1643 P + PC +T F+ P + + +++ N Sbjct: 469 PAFRVSLSDSVEAPSPCILTSKVEFSDFGQSNHLFPPAEYSGKTSLKKLGEENLHNHNVY 528 Query: 1644 AEDGLSYFPKRSSPDVSLSTEKYELKDVVKAGSDHSEMRNENVIQCFF-----PS---NL 1799 A +GLS P + + +TE+ DV K ++ + VI F PS +L Sbjct: 529 AGNGLSV-PSVGTVTNNYTTEELRTIDVTKGTFVPVDLSSNGVILKFSEDLNKPSKGYSL 587 Query: 1800 KNCSETKSSQMMKMNQNEDKD-----------------PGIDVKDAALDDSSHVTFGDGE 1928 SE + ++ D G+++ D + V E Sbjct: 588 PQYSENDCQKQYSWGEHLSVDCHQYGPKKHNLPEGYMHTGLNLNDTL--EGGVVALDAAE 645 Query: 1929 RISRLLSSGDSV----PIELGEPLVGASDTLGGCQLPRIGLRVVVNAMHNLSELLLSYS- 2093 + R +S + P ++G P++ ++ +V+A+HNLSELL S Sbjct: 646 NVLRSPASQEDAKQAQPYQMGSS-------------PKLDVQTLVHAIHNLSELLKSQCL 692 Query: 2094 EDVNALKEHDHVVLRHVLNNLDAYVSKVGLVRPIPESDLSQSGTFSFGKLTVPNESQITN 2273 + L+ D+ L+ + NL A V+ I D + +F +L + S + Sbjct: 693 PNACLLEGQDYDTLKSAITNLGACT-----VKKIETKDTMVTEHDTFERLKESHRSYMGT 747 Query: 2274 AEAND------VQNQCDFNASEPKMHSTMXXXXXXXXXXXXXXXXXXXXXXXNDDMIQAI 2435 N ++ C + ++P + ++QAI Sbjct: 748 ETGNPQFMEEVARDSCGLD-NQPMPEDKSKNNGKKTENSPLLTSADDLGDSNEEQVVQAI 806 Query: 2436 KKALMENF-HEEDEHPQTSLFKNLWLESEAALCSMACKTRFARMKMKMEKCNNQHQTKDV 2612 KK L ENF +E PQ LFKNLWLE+EA LCS++ K+RF RMK++MEK Sbjct: 807 KKVLNENFLSDEGMQPQALLFKNLWLEAEAKLCSLSYKSRFDRMKIEMEK---------- 856 Query: 2613 VGMPINADKLSGSKISVDLYRDDMVASKTKGSVTSNISTHDTSQSSNTSCAEDVEASVMD 2792 + S DL + VA + K S IS+ S SS V+ S+M+ Sbjct: 857 ------------HRFSQDLNLNSSVAPEAKNDSASKISSQSPSTSSKN---VHVDYSLME 901 Query: 2793 RYRILKCRVD--NSISINKEEKLPDPVDEAKAAPCSMEDKAQSARLKFEMERCNQNH--- 2957 R+ IL R + NS KEE D K D S +K + R N+ Sbjct: 902 RFNILNRREEKLNSSFFMKEEN-----DSVKVG----SDSEDSVTMKLNILRKQGNNFSS 952 Query: 2958 ---------------VEDVVGLPINV-----DQLQSSKVSFVLDGDDMAASKAKGTVTPN 3077 ED V N+ + L+SS + D D+ A+ A+ +V Sbjct: 953 SFMQEKKASDIVSSDTEDSVMERFNILRRREENLKSSFMGEKKD-QDVIANDAEDSVKVR 1011 Query: 3078 ISTL----DTSQSSNMS-------VTEDVKSSVMDRFRILQSRIDNTISINKEGRLPNPC 3224 ++ L D SS M VT D + SVM RF +L R DN S E + Sbjct: 1012 LNILRQREDNLNSSFMEETKDPDMVTNDAEDSVMARFNVLTRRGDNLNSPFMEVKK---- 1067 Query: 3225 DIG-AYPGSESVQNFGI-------KPQLPELVDMGFAERRVKWPFIRMMRTEDEADLDPD 3380 D+ GS ++N G+ + ++D F I + D Sbjct: 1068 DLNMVAAGSADMENHGMINGEVSNDQRANVVIDPYFYHHS-----INSSEGYNSFGSYTD 1122 Query: 3381 LSEKQDPVKDFSTFVSDDEPETQFYDTDRFGTWVPTSGYDSPSSDWEHVLKEE 3539 D +K F V+DD P R G + YD+ SSDWEHV K+E Sbjct: 1123 -GSGYDSMKQFLLSVADD-PIVHSNRKARLGNHHSSGLYDNSSSDWEHVAKDE 1173 >ref|XP_006347526.1| PREDICTED: uncharacterized protein LOC102592566 isoform X1 [Solanum tuberosum] Length = 1173 Score = 125 bits (315), Expect = 1e-25 Identities = 291/1246 (23%), Positives = 459/1246 (36%), Gaps = 138/1246 (11%) Frame = +3 Query: 216 PLAPPFTVG-AYSIPKKNSLGHFVEHPY-----FDESHHYLGFGSSPFD-NWPPLQPDTS 374 PLAPPFTV S L +F + Y F +S Y SP N+ P D S Sbjct: 21 PLAPPFTVDRTNSKTGSTQLLNFSDSSYTGTVPFGQSWQYGAANPSPTGYNFFPSVTD-S 79 Query: 375 LPNPFNDPNLRTDAIGGMSPPYEGTYGLPTPTLNYYS-TPSITRP-YFPEYPSTTVHDNS 548 +P N P + P + PT++ + T S R Y+ Y + V + Sbjct: 80 VPTTCNMPLSPEFSPADSVEPGSHFWSTSNPTVHASTDTYSFGREGYYAPYVPSIVSNEH 139 Query: 549 SSVVFNEPQ---------------SQSGKMAPLS----PGGNLWTGMSGEEQWRRK---- 659 S FNEP SQ LS P + ++ ++ +Q R Sbjct: 140 PSAAFNEPSLDVLPNSGSIHVDASSQVDYTQSLSGLEYPHWSFFSKVADGKQDERNGVDG 199 Query: 660 -----DKNEAGSLNYNSSLNQGTPAVDGLSKCEDSSAVWLGKCVDSMARKCHIGPKSIDL 824 + N S Y + ++QG +++G++ + S G +D + + GP S+ Sbjct: 200 SFSLGNVNAGASYGYRNCMSQGN-SLEGVNIAREDSGA--GNFIDGV----YTGPSSMGH 252 Query: 825 LDTKVSXXXXXXXXVTPFNMSRTSVSDPTSLFQEMPYH-----PVTSSESVLKSWSHLNP 989 +D K S++ T++ +P + SS + L ++ NP Sbjct: 253 MDAKSYLTQEPIYQ---------SLNSETAMGSILPVSCQVGLSLGSSNNYL---NYENP 300 Query: 990 NTASYERCVSDLDSIATNPTVFYPSSIYSSLAHVYNSPGNSSTFEKEF-LSENMN----- 1151 T +E+ LDS + T +S S + + +P S F + L +N++ Sbjct: 301 FTP-HEKFFQPLDSCPRDTT---STSKSSPVVVIRPAPSGSRFFAPKIDLHKNVDICKTG 356 Query: 1152 --HFEKPWDACDNKEASSSRTQVHTEGKEGCGDTSRTNNGFDVNNQFSMDSLMKKDFSSY 1325 + EK D CD ++ +R + + KE FS+ S DF Sbjct: 357 ATNSEKS-DVCDLLKSQETRLPIDSPIKE-----------------FSLGSSTPLDFDKI 398 Query: 1326 RNSI-TDEPFNHLYN--PKFKSQITDVNLPNAFTLVHTXXXXXXXXXXXXXETLDQFNSV 1496 +N N+L + P + I + + + LD N Sbjct: 399 KNIFFASSSVNNLCSTRPCSSNSIEIAVKERSGSQAPCASAPPVTFAEKCSDALDLHNPN 458 Query: 1497 LDSPCWKGAPTSQ---------HFPCGITEAGRFNGLNLQGPQILPFDAN---------- 1619 +DSPCWKGAP + PC T F + P P + + Sbjct: 459 VDSPCWKGAPAFRISLGDSVDASSPCLFTSKVEFADFSQSNPLFPPAEYSGKTSLKKLGE 518 Query: 1620 -LVYSENGSAEDGLSYFPKRSSPDVSLSTEKYELKDVVKAGSDHSEMRNENVIQCFFPSN 1796 +++ N A +GLS P + + +TE+ DV K ++ + I F Sbjct: 519 ENLHNHNVYAGNGLSV-PSVGTGTNNYTTEELRTIDVTKETFVPMDLSSNGGIPKFSEDL 577 Query: 1797 LKNCSETKSSQMMKMNQNEDKDPGIDVKDAALDDSSHVTFGDGERISRLLSSGDSVPIEL 1976 K +K + + ++N+ + K ++D + G + L + L Sbjct: 578 NK---PSKGYSLPQYSENDCQLQYSWGKHLSVDGHQY-----GPKKHNLPEGYMHTGLSL 629 Query: 1977 GEPLVGASDTLGGCQ----------------------LPRIGLRVVVNAMHNLSELLLSY 2090 + L G L + P++ ++ +V+A+HNLSELL S Sbjct: 630 NDTLEGGVVALDAAENVLRSPASQEDAKQAQQYQMGSSPKLDVQTLVHAIHNLSELLKSQ 689 Query: 2091 S-EDVNALKEHDHVVLRHVLNNLDAYVSKVGLVRPIPESDLSQSGTFSFGKLTVPNESQI 2267 + L+ D L+ + NL A +K I D S +F K S Sbjct: 690 CLANACLLEGQDIDTLKSAITNLGACTAK-----KIETKDTMVSQHDTFEKFEESRRS-F 743 Query: 2268 TNAEANDVQ-------NQCDFNASEPKMHSTMXXXXXXXXXXXXXXXXXXXXXXXNDDMI 2426 E Q + C + ++P + ++ Sbjct: 744 MGTETGHPQFMEEVAWDSCGLD-NQPTPEDKSKNNGKKTENSALLTPADDLGDSNEEQVV 802 Query: 2427 QAIKKALMENF-HEEDEHPQTSLFKNLWLESEAALCSMACKTRFARMKMKMEKCNNQHQT 2603 QAIKK L ENF +E PQ LFKNLWLE+EA LCS++ K+RF RMK++MEK Sbjct: 803 QAIKKVLNENFLSDEGMQPQALLFKNLWLEAEAKLCSLSYKSRFDRMKIEMEK------- 855 Query: 2604 KDVVGMPINADKLSGSKISVDLYRDDMVASKTKGSVTSNISTHDTSQSSNTSCAEDVEAS 2783 + S +L + VA + + S I+T S SS + +D S Sbjct: 856 ---------------HRFSQELNLNSSVAPEAENDSASKITTQSPSTSSKSVHIDD---S 897 Query: 2784 VMDRYRILKCRVDN-SISINKEEK-----LPDPVDEAKAAPCSMEDKAQSARLKFEMERC 2945 VM+R+ IL R + S S KEE D D + + ++ F E+ Sbjct: 898 VMERFNILNRREEKLSSSFMKEENDSVKVGSDSEDSVTMRLNILRKQGNNSSSSFMQEKK 957 Query: 2946 NQNHV----EDVVGLPINV-----DQLQSSKVSFVLDGDDMAASKAKGTVTPNISTLDTS 3098 + V ED V N+ D L+SS + D D+ A+ A+ +V ++ L Sbjct: 958 ASDIVSSDTEDSVMERFNILRRREDNLKSSFMGEKKD-QDVVANDAEDSVKVRLNILRQR 1016 Query: 3099 QSSNMS-----------VTEDVKSSVMDRFRILQSRIDNTISINKEGRLPNPCDIGAYPG 3245 + + S VT D + SVM RF +L R DN S E + D+ A G Sbjct: 1017 EDNLNSSFTEETKDPDMVTNDAEDSVMARFNVLTHRGDNLNSPFME--VKKDLDMVA-AG 1073 Query: 3246 SESVQNFGIKPQLPELVDMGFAERRVKWPFIRMMRTEDE--------ADLDPDLSEKQDP 3401 S ++N G+ E+ A ++ F E AD D Sbjct: 1074 SADMENHGLIN--GEVSGYQRANVVIEPYFYHHSINSSEGYNSFGSYAD-----GSGYDS 1126 Query: 3402 VKDFSTFVSDDEPETQFYDTDRFGTWVPTSGYDSPSSDWEHVLKEE 3539 +K F V+DD P R G + YD+ SSDWEHV K+E Sbjct: 1127 MKQFLLSVADD-PIVHSNRKARLGNHHSSGLYDNSSSDWEHVAKDE 1171 >ref|XP_002300521.2| hypothetical protein POPTR_0001s45660g [Populus trichocarpa] gi|550349961|gb|EEE85326.2| hypothetical protein POPTR_0001s45660g [Populus trichocarpa] Length = 911 Score = 124 bits (311), Expect = 3e-25 Identities = 212/893 (23%), Positives = 345/893 (38%), Gaps = 116/893 (12%) Frame = +3 Query: 216 PLAPPFTVGAYSIPKKNSLGHFVEHPYFDESHHYLGFGSSPF---DNWPP--LQPDTSLP 380 PLA PFT+ K N+ + D S F +PP LQ Sbjct: 15 PLARPFTLS-----KPNNSSFSPSNSELDRSFSSFSLEGDSFAFYPQYPPGVLQVSADFS 69 Query: 381 NPFNDPNLRTDAIGGMSPPYEGTYGLPTPTLNYYSTPSITRPYFPEYPSTTVHDNSSSVV 560 + D +L+ + S + ++ + +S T Y+P+YPS + ++ + Sbjct: 70 SLARDSSLQDTSFNPSSSMLDRSF-------SSFSLEGDTFAYYPQYPSGVLQGSADFGL 122 Query: 561 FNEPQSQSG-----KMAPLSPGGNLWTGMSGEEQWRRKDKNEAGSLNYNSSLNQ------ 707 F E +S K L + + G W KDK+ S+ + S Q Sbjct: 123 FTESKSDFDAVPVTKSTELGYEAHKSGDLRGILHW--KDKHGGFSMFNDDSTKQALVHML 180 Query: 708 -----GTPAVDGLSKCEDSSAVWLGKCVDSMARKCHIGPKSIDLLDTKVSXXXXXXXXVT 872 G+PA +GL ++S GK + + PK +D++ + Sbjct: 181 FIFPAGSPA-EGLKLSPETSDSLCGKLSGISLKDHEVRPKRTREIDSQCVPISLKFSTTS 239 Query: 873 PFNMSRTSVSDPTSLFQEMPYHPVTSSESVLKSWSHLNPNTASYERCVSD-LDSIATNPT 1049 N S + DP S +P SV SWS + N A + R +S LD A Sbjct: 240 DLNSSAI-LQDPQSGINYLP-------PSV--SWSSCDTNIAYFGRSLSQQLDFHAAKQN 289 Query: 1050 VFYPSSIYSSLAHVYNSPGNSSTFEKEF---LSENM----------NHF--------EKP 1166 V PSS +SL + + P +ST F LSEN+ N+F +KP Sbjct: 290 V-PPSSDINSLPVLVSEPSVASTGYLPFNHVLSENLDSDGDGGVSKNNFLGYGQASLKKP 348 Query: 1167 WDACDNKEASSSRTQVHTEGKEGCGDTSRTNNGFD----VNNQFSM-------DSLMKKD 1313 D + + +GKEG T+ + ++ + D ++ D Sbjct: 349 HAVVDKSKEVFHNKVLTDKGKEGKMGKPVTHKVMEPVPMAKSELQITCPSPPIDLTLEVD 408 Query: 1314 FSS----------------YRNSITDEPFNHLYNPKFKSQITDVNLPNAFTLVHTXXXXX 1445 S +T E + K + QIT +L TL + Sbjct: 409 KSKEVFHHKVLADKGKEGKLGKPVTHEVMEPVPMAKSELQITCPSLLIDLTL-ESLGIKE 467 Query: 1446 XXXXXXXXETLDQFNSVLDSPCWKGAPTSQHFPCGITEAGRFNGL----------NLQGP 1595 + +++ +S LDSPCWKG ++ C ++ F L N P Sbjct: 468 SDPIENSSKIINENDSDLDSPCWKGKLAAEQSSCEVSVPDNFQHLKSEQEACSYLNPLAP 527 Query: 1596 QILPFDANLVYSENGSAEDGLSYFP--KRSSPDVSLSTEKYELKDVVKAGSDHSEMRNEN 1769 P + G+ DG F K +S V+L + + L+ AGS SE + Sbjct: 528 HFFPSSDKQKVNYCGNEGDGNDCFSFQKTASSVVNLVSREQRLQHSATAGSSSSEQSSIT 587 Query: 1770 VIQCFFPSNLKN-----CSETKSSQM------MKMNQNEDKDP--------------GID 1874 C+ ++ N +++ SS M + + ED G Sbjct: 588 EAHCYSDMHVPNKEYELLTDSSSSSMHGSSCVVLPSVLEDYFTSSGQLLTGQCVGGFGKA 647 Query: 1875 VKDAALDDSSHVTFGDGERISRLLSSGDSVPIELGEPLVGASDTLGGCQLPRIGLRVVVN 2054 +KD A + S+ V+ + + S + V +L E GA+ L C PR+ ++VV Sbjct: 648 IKDTAPNGSTSVSLFASKHVFDSSSCREGVSTDLSETYGGATKPL--CSPPRLDFQIVVK 705 Query: 2055 AMHNLSELLL-SYSEDVNALKEHDHVVLRHVLNNLDAYV-SKVGLVRPIPESDLSQSGTF 2228 M+ LSELL+ + + D+++L EH+H +++ +++NL + ++VG + ES + ++ Sbjct: 706 TMNELSELLMQNCTNDLDSLNEHEHDIIKRIIHNLTLCIRNRVGEHTLMSESSHPHT-SY 764 Query: 2229 SFGKLTVPN-----ESQITNAEANDVQNQCDF-NASEPKMHSTMXXXXXXXXXXXXXXXX 2390 K T N E Q T +A V ++ N E +M ST Sbjct: 765 CVRKSTHLNKCSNMELQTTRTKAVMVSHELGHQNKHERQMSSTSFRERFLDSLNARNGGF 824 Query: 2391 XXXXXXXNDDMIQAIKKALMENFH-EEDEHPQTSLFKNLWLESEAALCSMACK 2546 N+D+ Q +KAL ++ EE+E+PQ +KNLWLE+EAALCSM K Sbjct: 825 NK-----NEDITQVNEKALEGHYELEEEENPQVLFYKNLWLEAEAALCSMKYK 872 >ref|XP_006347527.1| PREDICTED: uncharacterized protein LOC102592566 isoform X2 [Solanum tuberosum] Length = 1166 Score = 122 bits (307), Expect = 1e-24 Identities = 290/1246 (23%), Positives = 456/1246 (36%), Gaps = 138/1246 (11%) Frame = +3 Query: 216 PLAPPFTVG-AYSIPKKNSLGHFVEHPY-----FDESHHYLGFGSSPFD-NWPPLQPDTS 374 PLAPPFTV S L +F + Y F +S Y SP N+ P D S Sbjct: 21 PLAPPFTVDRTNSKTGSTQLLNFSDSSYTGTVPFGQSWQYGAANPSPTGYNFFPSVTD-S 79 Query: 375 LPNPFNDPNLRTDAIGGMSPPYEGTYGLPTPTLNYYS-TPSITRP-YFPEYPSTTVHDNS 548 +P N P + P + PT++ + T S R Y+ Y + V + Sbjct: 80 VPTTCNMPLSPEFSPADSVEPGSHFWSTSNPTVHASTDTYSFGREGYYAPYVPSIVSNEH 139 Query: 549 SSVVFNEPQ---------------SQSGKMAPLS----PGGNLWTGMSGEEQWRRK---- 659 S FNEP SQ LS P + ++ ++ +Q R Sbjct: 140 PSAAFNEPSLDVLPNSGSIHVDASSQVDYTQSLSGLEYPHWSFFSKVADGKQDERNGVDG 199 Query: 660 -----DKNEAGSLNYNSSLNQGTPAVDGLSKCEDSSAVWLGKCVDSMARKCHIGPKSIDL 824 + N S Y + ++QG +++G++ + S G +D + + GP S+ Sbjct: 200 SFSLGNVNAGASYGYRNCMSQGN-SLEGVNIAREDSGA--GNFIDGV----YTGPSSMGH 252 Query: 825 LDTKVSXXXXXXXXVTPFNMSRTSVSDPTSLFQEMPYH-----PVTSSESVLKSWSHLNP 989 +D K S++ T++ +P + SS + L ++ NP Sbjct: 253 MDAKSYLTQEPIYQ---------SLNSETAMGSILPVSCQVGLSLGSSNNYL---NYENP 300 Query: 990 NTASYERCVSDLDSIATNPTVFYPSSIYSSLAHVYNSPGNSSTFEKEF-LSENMN----- 1151 T +E+ LDS + T +S S + + +P S F + L +N++ Sbjct: 301 FTP-HEKFFQPLDSCPRDTT---STSKSSPVVVIRPAPSGSRFFAPKIDLHKNVDICKTG 356 Query: 1152 --HFEKPWDACDNKEASSSRTQVHTEGKEGCGDTSRTNNGFDVNNQFSMDSLMKKDFSSY 1325 + EK D CD ++ +R + + KE FS+ S DF Sbjct: 357 ATNSEKS-DVCDLLKSQETRLPIDSPIKE-----------------FSLGSSTPLDFDKI 398 Query: 1326 RNSI-TDEPFNHLYN--PKFKSQITDVNLPNAFTLVHTXXXXXXXXXXXXXETLDQFNSV 1496 +N N+L + P + I + + + LD N Sbjct: 399 KNIFFASSSVNNLCSTRPCSSNSIEIAVKERSGSQAPCASAPPVTFAEKCSDALDLHNPN 458 Query: 1497 LDSPCWKGAPTSQ---------HFPCGITEAGRFNGLNLQGPQILPFDAN---------- 1619 +DSPCWKGAP + PC T F + P P + + Sbjct: 459 VDSPCWKGAPAFRISLGDSVDASSPCLFTSKVEFADFSQSNPLFPPAEYSGKTSLKKLGE 518 Query: 1620 -LVYSENGSAEDGLSYFPKRSSPDVSLSTEKYELKDVVKAGSDHSEMRNENVIQCFFPSN 1796 +++ N A +GLS P + + +TE+ DV K ++ + I F Sbjct: 519 ENLHNHNVYAGNGLSV-PSVGTGTNNYTTEELRTIDVTKETFVPMDLSSNGGIPKFSEDL 577 Query: 1797 LKNCSETKSSQMMKMNQNEDKDPGIDVKDAALDDSSHVTFGDGERISRLLSSGDSVPIEL 1976 K +K + + ++N+ + K ++D + G + L + L Sbjct: 578 NK---PSKGYSLPQYSENDCQLQYSWGKHLSVDGHQY-----GPKKHNLPEGYMHTGLSL 629 Query: 1977 GEPLVGASDTLGGCQ----------------------LPRIGLRVVVNAMHNLSELLLSY 2090 + L G L + P++ ++ +V+A+HNLSELL S Sbjct: 630 NDTLEGGVVALDAAENVLRSPASQEDAKQAQQYQMGSSPKLDVQTLVHAIHNLSELLKSQ 689 Query: 2091 S-EDVNALKEHDHVVLRHVLNNLDAYVSKVGLVRPIPESDLSQSGTFSFGKLTVPNESQI 2267 + L+ D L+ + NL A +K I D S +F K S Sbjct: 690 CLANACLLEGQDIDTLKSAITNLGACTAK-----KIETKDTMVSQHDTFEKFEESRRS-F 743 Query: 2268 TNAEANDVQ-------NQCDFNASEPKMHSTMXXXXXXXXXXXXXXXXXXXXXXXNDDMI 2426 E Q + C + ++P + ++ Sbjct: 744 MGTETGHPQFMEEVAWDSCGLD-NQPTPEDKSKNNGKKTENSALLTPADDLGDSNEEQVV 802 Query: 2427 QAIKKALMENF-HEEDEHPQTSLFKNLWLESEAALCSMACKTRFARMKMKMEKCNNQHQT 2603 QAIKK L ENF +E PQ LFKNLWLE+EA LCS++ K+RF RMK++MEK Sbjct: 803 QAIKKVLNENFLSDEGMQPQALLFKNLWLEAEAKLCSLSYKSRFDRMKIEMEK------- 855 Query: 2604 KDVVGMPINADKLSGSKISVDLYRDDMVASKTKGSVTSNISTHDTSQSSNTSCAEDVEAS 2783 +R VA + + S I+T S SS + +D S Sbjct: 856 ----------------------HRFSQVAPEAENDSASKITTQSPSTSSKSVHIDD---S 890 Query: 2784 VMDRYRILKCRVDN-SISINKEEK-----LPDPVDEAKAAPCSMEDKAQSARLKFEMERC 2945 VM+R+ IL R + S S KEE D D + + ++ F E+ Sbjct: 891 VMERFNILNRREEKLSSSFMKEENDSVKVGSDSEDSVTMRLNILRKQGNNSSSSFMQEKK 950 Query: 2946 NQNHV----EDVVGLPINV-----DQLQSSKVSFVLDGDDMAASKAKGTVTPNISTLDTS 3098 + V ED V N+ D L+SS + D D+ A+ A+ +V ++ L Sbjct: 951 ASDIVSSDTEDSVMERFNILRRREDNLKSSFMGEKKD-QDVVANDAEDSVKVRLNILRQR 1009 Query: 3099 QSSNMS-----------VTEDVKSSVMDRFRILQSRIDNTISINKEGRLPNPCDIGAYPG 3245 + + S VT D + SVM RF +L R DN S E + D+ A G Sbjct: 1010 EDNLNSSFTEETKDPDMVTNDAEDSVMARFNVLTHRGDNLNSPFME--VKKDLDMVA-AG 1066 Query: 3246 SESVQNFGIKPQLPELVDMGFAERRVKWPFIRMMRTEDE--------ADLDPDLSEKQDP 3401 S ++N G+ E+ A ++ F E AD D Sbjct: 1067 SADMENHGLIN--GEVSGYQRANVVIEPYFYHHSINSSEGYNSFGSYAD-----GSGYDS 1119 Query: 3402 VKDFSTFVSDDEPETQFYDTDRFGTWVPTSGYDSPSSDWEHVLKEE 3539 +K F V+DD P R G + YD+ SSDWEHV K+E Sbjct: 1120 MKQFLLSVADD-PIVHSNRKARLGNHHSSGLYDNSSSDWEHVAKDE 1164