BLASTX nr result
ID: Achyranthes23_contig00001598
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Achyranthes23_contig00001598 (3162 letters) Database: ./nr 37,332,560 sequences; 13,225,080,153 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value gb|EMJ21784.1| hypothetical protein PRUPE_ppa000352mg [Prunus pe... 180 4e-42 ref|XP_003634177.1| PREDICTED: uncharacterized protein LOC100853... 167 3e-38 ref|XP_002321950.2| hypothetical protein POPTR_0015s00600g [Popu... 150 3e-33 ref|XP_006581690.1| PREDICTED: uncharacterized protein LOC100807... 145 8e-32 gb|EXB94712.1| hypothetical protein L484_002599 [Morus notabilis] 144 2e-31 ref|XP_004309093.1| PREDICTED: uncharacterized protein LOC101301... 139 6e-30 ref|XP_003526770.2| PREDICTED: uncharacterized protein LOC100807... 139 8e-30 ref|XP_006441268.1| hypothetical protein CICLE_v10018632mg [Citr... 134 3e-28 ref|XP_002317835.2| hypothetical protein POPTR_0012s00720g [Popu... 134 3e-28 gb|EOY23722.1| Uncharacterized protein isoform 2 [Theobroma cacao] 133 4e-28 gb|EOY23725.1| Uncharacterized protein isoform 5 [Theobroma cacao] 131 2e-27 ref|XP_002521299.1| hypothetical protein RCOM_0756330 [Ricinus c... 131 2e-27 gb|EOY23723.1| Uncharacterized protein isoform 3 [Theobroma cacao] 130 4e-27 gb|EOY23721.1| Uncharacterized protein isoform 1 [Theobroma caca... 130 4e-27 ref|XP_003523306.2| PREDICTED: uncharacterized protein LOC100778... 128 1e-26 ref|XP_006441271.1| hypothetical protein CICLE_v10018632mg [Citr... 125 1e-25 gb|EOY23726.1| Uncharacterized protein isoform 6 [Theobroma cacao] 124 3e-25 ref|XP_006478087.1| PREDICTED: uncharacterized protein LOC102628... 120 4e-24 ref|XP_006441272.1| hypothetical protein CICLE_v10018632mg [Citr... 108 1e-20 ref|XP_006347527.1| PREDICTED: uncharacterized protein LOC102592... 102 8e-19 >gb|EMJ21784.1| hypothetical protein PRUPE_ppa000352mg [Prunus persica] Length = 1254 Score = 180 bits (456), Expect = 4e-42 Identities = 232/988 (23%), Positives = 405/988 (40%), Gaps = 132/988 (13%) Frame = -1 Query: 2979 SASVSKETPQLQEPVLKSISSFCGSQKGNVLSP---EGHFREPGFFMFGCASNTSTFPKS 2809 S SV + EP LK+ SS C ++ N +P ++ + S + + P Sbjct: 287 SKSVMGSLSVVPEPHLKAPSSQCVTKTSNCKTPYSVSSETQQLDASLDYITSISESSPAF 346 Query: 2808 ITELPYGGT-ISPVGSTPSAVKRNSMGLDAADQSNTTGGYNFTYQMESQM-------LLG 2653 T P GT +S G+ +R + DAAD + G Y + ES + +L Sbjct: 347 ATRTPALGTKLSEPGT--GLFRRLNFISDAADTDH--GDYYSSGVQESHLPQISEGKVLF 402 Query: 2652 SKRHSSIRIRKPSSMSTVASSCQLGEIGDSR-LENKVSSEQM-KISPQLPHQ-LSLHGLN 2482 + S +SS + E+ ++R + NK + +++ K P L + + L G Sbjct: 403 DSSQLGFHLGAKDCFSAESSSARNEELSNNRNIINKDAWDKVFKAKPGLQNSHVGLDGFK 462 Query: 2481 IEDSDVEAVYSVEKNSAGSDSHNPAEDSPCWKGASSNRLSPFGHSESVDSDALVKKIHNC 2302 + E + S +S D +NP DSPCWKG + SPFG SE + +KK+ +C Sbjct: 463 MAFKTNETINSFLSSSDNVDPNNPGVDSPCWKGVPGSCFSPFGASEDGVPEQ-IKKLEDC 521 Query: 2301 ---------------KSMSSQD---------------------LQNSSVVSDAFGSKDFS 2230 +++SSQ L+ SV + AFG + Sbjct: 522 SGLNIHMPMFPLSAGENVSSQKPIKNAVEYNEFGWLENGLRPPLKRYSVANSAFGEHKWD 581 Query: 2229 SRKVNETTSFEKPPVNDCALNSNKD----CGSANPSRFDLNTSDGFQFAEDCYASLVECN 2062 + + T+++ +D S +D G+ + S L+ S Q E Sbjct: 582 N---SVKTTYDAETSHDRGPQSYRDGLHQSGNGDKSLGLLDDSHAMQQGHGEDGLATEVK 638 Query: 2061 MLNKSKADISEALPSEECENRTPVAEESLSGLPSSIHNNVSDLLPGNKASL--KSDGEDS 1888 AD+ N E S +PS + NV + A+ KS+GE+S Sbjct: 639 QTWSCVADVKL--------NANDTMEYGSSHVPSHVVENVLCSSAEDAATKLSKSNGEES 690 Query: 1887 NSRIDINVLLRTMINLSETLRCYCFSNRAQVSEKQSLAVKQILCNLNASMLLMSGQTIAT 1708 ++D+ +L+ T+ NLSE L C + Q+ + +K ++ NL+ + + + Sbjct: 691 MLKVDVQMLVDTLKNLSELLLTNCSNGLCQLKKTDIATLKAVINNLHICI----SKNVEK 746 Query: 1707 TVLSQPDKLFLKEFP--VVEMVNDTEALAASLKLGGTTEDFSEYISSSS--KDDENSFQD 1540 Q F + E+ + L+A L + D + + S K D + ++ Sbjct: 747 WSPMQESPTFQQNTSQCYAELSEHHKVLSADRPLSASAPDIQDQVIGSIHVKSDIDVVKE 806 Query: 1539 VNMMEAVKKLISEDFQEEDVDQQKLLYKNLWLEAEASLCLMTARARFLRMKTE------- 1381 M +A+K+++SE+F E+ D Q LLYKNLWLEAEA LC + +ARF R+K E Sbjct: 807 DKMTQAIKEILSENFHSEETDPQVLLYKNLWLEAEAVLCSINYKARFNRVKIEMDKCKAE 866 Query: 1380 ------------MKRDHDGIKEKDNPTS--------------PNIPVDIHKDIELGNDNL 1279 MK+ + NP + P++P+ +D L ++ Sbjct: 867 NSKDVFEYTADMMKQSKSEVSPDSNPVNPLTPEAQGCPTSNVPDLPILSQEDEVLARFDI 926 Query: 1278 HSPHL--GEYLKPTTTSHIKVFHDPK-SQVGRVFSESERSTSPKNETSVSSLYVSGTKEN 1108 + + + + + P+ S+V R+ E+ + SP S+ +S T Sbjct: 927 LRGRVENTNSINASNAAELSSKASPEPSKVERIAPEANGTPSP--GISIQDSSISST--- 981 Query: 1107 PCSTSHANDVDTRVMARYQVLKNRVGGP---NTLNYE-------SNQPSGEHKCP----- 973 +D + VMAR+ +L++RV + +N E S +P + P Sbjct: 982 ---IGVTDDYEASVMARFHILRDRVEKSKFISAVNMEEPSSPKVSLEPKTDVIVPDRNDG 1038 Query: 972 -SSSFDVGNTSPASSTNIKSNDLQISDFGAYQILKGRDDTSSCLNFEGQQLPGDENYLPG 796 +S F++ SP S T +ND + S ILK R D S ++ EGQQLP + + Sbjct: 1039 SASEFNLFQDSPPSITTSHANDCEASVMSRLHILKSRVDNCSDMHTEGQQLPEPKIEVIA 1098 Query: 795 NEFTNG---DKSLQDSQMSNSLGVTAESEASIMARFRLIQSRKDHSNFLINDSA------ 643 + ++ + S+QDS +S + + EAS+M+R +++SR D+S+++ + Sbjct: 1099 PDTSDSLMPEFSIQDSPVSRATSQANDCEASVMSRLHILKSRVDNSSYMHREGKQLPEIG 1158 Query: 642 ---------PAQASAVVFEDGTTDMDE--YMTSFGPYPQLPNDEDVNNFDIGANEQAFDP 496 P + E G++D+ E + SF + F + + Sbjct: 1159 GLGNAGKRHPWPIISKRSEGGSSDIKEQPILRSFKADNSEGKLDTAKEFHLFVEDDPLTQ 1218 Query: 495 ILKTSRMSSVILEGWYANNCSSSDWQHI 412 + + ++ + G + N SSSDW+H+ Sbjct: 1219 YFRIHKPANQLPAGGHDN--SSSDWEHV 1244 >ref|XP_003634177.1| PREDICTED: uncharacterized protein LOC100853355 [Vitis vinifera] gi|302143995|emb|CBI23100.3| unnamed protein product [Vitis vinifera] Length = 1167 Score = 167 bits (423), Expect = 3e-38 Identities = 198/807 (24%), Positives = 336/807 (41%), Gaps = 87/807 (10%) Frame = -1 Query: 2979 SASVSKETPQLQEPVLKSISSFCGSQKGNVLSPEGHFREPGFFMFGCASNTSTFPKSITE 2800 S SV ETP + P L+ +++ +K E FR+ + C + + KS Sbjct: 317 STSVLPETPHPRAPSLEPVTNSWNYRKPQSALYEKCFRK----IDSCVDDPVSKAKSSPA 372 Query: 2799 LPYGGTISPVGSTPSAVKRNSMG---LDAADQSNTTGGYNFTYQMESQMLL---GSKRHS 2638 + I P ++PS++ NS + D S G++ + E + + G + +S Sbjct: 373 I----VIRPPANSPSSLGVNSFSSRNMICTDNSENVSGHHLSNMEEPHIPVISEGRELYS 428 Query: 2637 SI-----RIRKPSSMSTVASSCQLGEIGDSRLENKVSSEQMKISP--QLPHQLSLHGLNI 2479 ++ +S +SS + E+ ++ + K + ++ Q+PH G + Sbjct: 429 DTSQLNGHWQRNDHLSMESSSTKKHELLNNEMGVKETDNLLRARSELQIPHLNVEDGFSF 488 Query: 2478 EDSDVEAVYSVEKNSAGSDSHNPAEDSPCWKGASSNRLSPFGHSESVDSDALVKKIHNCK 2299 + +EAV S++ S D +NPA DSPCWKG+ ++ SPF SE++ L++++ Sbjct: 489 SPNSIEAVNSIDNTSETLDHYNPAVDSPCWKGSITSHFSPFEVSEALSPHNLMEQLEALD 548 Query: 2298 SMSSQDLQNSSVVSDAFGSKDFSSRKVNETTSFEKPPVNDCALNS--------------- 2164 + Q + SD + + SS K NE T + K N C N Sbjct: 549 GFNLQGHHIFPLNSD--DAVNVSSLKPNENTEYHK---NVCGENGLLPSWKRPSVVNHPS 603 Query: 2163 ----NKDCGSANPSRFDLNTSDGFQFAEDCYASLVECNMLNKSKAD---ISEALPSEECE 2005 + D P L++ DG Q + D + ++LN SK+D +S + E Sbjct: 604 REQRSLDAFKTGPYCQKLSSGDGNQSSNDIIQPKRDHSLLNSSKSDNLELSHTMRQSFEE 663 Query: 2004 NRTPVAEESLSGLPSSI-HNNVSDL--------------------LPGNKAS---LKSDG 1897 + + SG+ + NN++D+ L G+ AS K Sbjct: 664 VKFTSERKLSSGVGVEVTGNNINDVSRDGSSHETYHLTENISCSPLSGDDASTKLTKQPA 723 Query: 1896 EDSNSRIDINVLLRTMINLSETLRCYCFSNRAQVSEKQSLAVKQILCNLNASMLLMSGQT 1717 +S +ID+++L+ T+ +LS L +C N + E+ +K+++ N +A L GQ Sbjct: 724 SESTPKIDVHMLINTVQDLSVLLLSHCSDNAFSLKEQDHETLKRVIDNFDA-CLTKKGQK 782 Query: 1716 IATTVLSQPDKLFLKEFPVV-----------------------EMVNDTEALAASLKLGG 1606 IA Q FL E P + +D + G Sbjct: 783 IA----EQGSSHFLGELPDLNKSASASWPLGKKVADANVEDQFHCQSDHKGKRHCSVSGN 838 Query: 1605 TTEDFSEYISSSSKDDENSFQDVNMMEAVKKLISEDF-QEEDVDQQKLLYKNLWLEAEAS 1429 E S+++ S +DE++ D + ++A++K++ ++F EE+ D Q LLY+NLWLEAEA+ Sbjct: 839 KDEKLSDFV--SLVNDEDTVNDDSTIQAIRKILDKNFHDEEETDPQALLYRNLWLEAEAA 896 Query: 1428 LCLMTARARFLRMKTEMKRDHDGIKEKDNPTSPNIPVDIHKDIELGNDNLHSPHLGEYLK 1249 LC ++ RARF RMK EM++ + K +L LK Sbjct: 897 LCSISYRARFDRMKIEMEK-----------------FKLRKTEDL-------------LK 926 Query: 1248 PTTTSHIKVFHDPKSQVGRVFSESERSTSPKNETSVSSLYVSGTKENPCSTSHANDVDTR 1069 T I V S+V S ++ E V + + + N + SHA D Sbjct: 927 NT----IDVEKQSSSKVSSDISMVDKFEREAQENPVPDITIEDS-PNVTTMSHAAD---- 977 Query: 1068 VMARYQVLKNRVGGPNTLNYESNQPSGEHKCPSSSFDVGNTSPAS----STNIKSNDLQI 901 V+ R+ +LK R ++LN + K N +PA+ S NI ++ Sbjct: 978 VVDRFHILKRRYENSDSLNSKDVGKQSSCKVSHDMNSDDNLAPAAKDDHSPNISTSTQSD 1037 Query: 900 SDFGAYQILKGRDDTSSCLNFEGQQLP 820 ++ILK R D S+ +N E QQ P Sbjct: 1038 DVMARFRILKCRADKSNPMNAERQQPP 1064 >ref|XP_002321950.2| hypothetical protein POPTR_0015s00600g [Populus trichocarpa] gi|550321678|gb|EEF06077.2| hypothetical protein POPTR_0015s00600g [Populus trichocarpa] Length = 1236 Score = 150 bits (379), Expect = 3e-33 Identities = 200/826 (24%), Positives = 337/826 (40%), Gaps = 130/826 (15%) Frame = -1 Query: 2499 SLHGLNIEDSDVEAVYSVEKNSAGSDSHNPAEDSPCWKGASSNRLSPFGHSESVDSDALV 2320 +L N+ EA+ SVE S D +NPA DSPCWKGA + LS F SE VD + Sbjct: 428 NLDFFNLAMDGHEAIGSVENTSESLDHYNPAVDSPCWKGAPVSHLSAFEISEVVDP-LIP 486 Query: 2319 KKIHNCKSMSSQDLQN-SSVVSDAFGSKDFSSRKVN----------ETTSFEKPPVNDCA 2173 KK+ C +S Q Q S +DA + ++ + S K P++ Sbjct: 487 KKVEACNGLSPQGPQIFPSATNDAVKACPEKQSNISVPLNHESLEHQQVSLFKRPLDAKV 546 Query: 2172 LNSNK--DCGSANPSR------FDLNTSDGFQFAEDCYASLVECNMLNKSKADISEA-LP 2020 L + D G P + + SD + L + N L+ + + + P Sbjct: 547 LFREEIDDAGKYGPYQRIPSYCHEAQISDVIDDETRKESILSDFNSLHTEQRSLEDGEWP 606 Query: 2019 S-----------------EECENRTP--VAEESLSGLPSSIHNNVSDLLPGNKASLKSDG 1897 S ++C + P E+ L PSS H +S G Sbjct: 607 SKKNSYVADVRRKINDDPDDCSSHVPFHAIEQVLCSPPSSEHAPAQHT--------QSQG 658 Query: 1896 EDSNSRIDINVLLRTMINLSETLRCYCFSNRAQVSEKQSLAVKQILCNLNASMLLMSGQT 1717 E+S S++ L+ TM NL+E L Y ++ ++ ++ +K ++ NL+ + + Sbjct: 659 EESLSKMHARTLVDTMHNLAELLLFYSSNDTCELKDEDFDVLKDVINNLDICISKNLERK 718 Query: 1716 IATTVLSQPDK-----------LFLKEFPVVEMVNDTEALAASLKLGGTTEDFSEYISSS 1570 I+T P + L+ + ++ E AS K E S + S+ Sbjct: 719 ISTQESLIPQQATSQFHGKLSDLYKGQLEFQHFEDEEEHKIASDK---RKEKLSNWASTR 775 Query: 1569 SKDDENSFQDVNMMEAVKKLISEDFQ-EEDVDQQKLLYKNLWLEAEASLCLMTARARFLR 1393 D + +D NM +A+KK+++++F EE+ + Q LLY+NLWLEAEASLC + ARF R Sbjct: 776 CAAD--TVKDDNMTQAIKKVLAKNFPIEEESESQILLYRNLWLEAEASLCSVNYMARFNR 833 Query: 1392 MKTEMKRDH-DGIKEK----DNPTSPNIPVDI-------------------------HKD 1303 MK EM++ H EK +N + P + DI H D Sbjct: 834 MKIEMEKGHSQKANEKSMVLENLSRPKVSSDILPADDKGSPVQDVSFLDSSILSRNSHSD 893 Query: 1302 IELGNDNLHSPHLGEYLKPTTTSHIKVFHDPKSQ----VGRVFSESERSTSPKNETSVSS 1135 + ++ + + +T++ K+ S V ++ +++ ST P S+ Sbjct: 894 DVMARFHILKSRVDDSNSMSTSAVEKLSSSKVSPDLNLVDKLACDTKDSTKP--NVSIQD 951 Query: 1134 LYVSGTKENPCS-TSHANDVDTRVMARYQVLKNRVGGPNTLNYESNQPSGEHKCPSSSFD 958 ++SGT N +SHA+D V+AR+ +LK RV N S S K SS Sbjct: 952 SHMSGTSSNADDVSSHADD----VIARFHILKCRVD-----NSSSGNTSAMEKLSSSKVS 1002 Query: 957 ---------VGNTSPASSTNIKSNDLQISD--------FGAYQILKGRDDTSSCLNFEG- 832 V +T ++ +I D ++ ++ L+GR D + +N Sbjct: 1003 PDLNKVDKMVYDTKDSTKPHITIQDSPMAGRSSHADDVMARFRTLEGRVDNCNSVNISAM 1062 Query: 831 QQLPGD----------ENYLPGNEFTNGDKSLQDSQMSNSLGVTAESEASIMARFRLIQS 682 ++LP + + + T D + QDS + ++ + EA+IMAR +++ Sbjct: 1063 EKLPSSKVSSNLSNVGKLTVEAKDSTKPDITKQDSPLPSTSSHAEDIEAAIMARLLILKH 1122 Query: 681 R---------KDHSNFLINDSAPAQASAVVFEDG-------TTDMDEYMTSFGPYPQLPN 550 R ++H I++ + V G +M+ + ++ P + Sbjct: 1123 RDGCSSSLEMEEHQPESIDNGYTSLRRDVPMGKGGLKDSILDVNMEPVIRNY-PADSAED 1181 Query: 549 DEDVNNFDIGANEQAFDPILKTSRMSSVILEGWYANNCSSSDWQHI 412 V F + N+ A T+R GWY ++C SSDW+H+ Sbjct: 1182 KSTVKEFRLFVNDDAKTQSSLTNRFGDQPHAGWY-DSC-SSDWEHV 1225 >ref|XP_006581690.1| PREDICTED: uncharacterized protein LOC100807937 isoform X2 [Glycine max] Length = 1067 Score = 145 bits (367), Expect = 8e-32 Identities = 182/679 (26%), Positives = 285/679 (41%), Gaps = 67/679 (9%) Frame = -1 Query: 2643 HSSIRIRKPSSMSTVASSCQLGEIGDSRLENKVSSEQMKISPQL--PHQLSLHGLNIEDS 2470 H +R +PSS + I D + V+ + S + PH ++ L + S Sbjct: 340 HMHLRRNEPSSSNKAM-------ISDKNVSRNVADYIFRESHEFQNPHA-NMDNLRLGLS 391 Query: 2469 DVEAVYSVEKNSAGSDSHNPAEDSPCWKGASSNRLSPFGHSESVDSDALVKKIHNCKSMS 2290 +E V VEK+ G D NPAEDSPCWKGAS+ R S F S ++ + + KK + S+ Sbjct: 392 AIEDVNFVEKSFEGGDRCNPAEDSPCWKGASAARFSHFEPSAALSQEYVHKKESSFGSVI 451 Query: 2289 SQ------DLQNSSVVS--DAFGSKDFSSRKVNETTSFEKPPVNDCALNSNKDCGSANPS 2134 + D +N+ S ++ G + + + +S P + + C S + Sbjct: 452 KEPQNYLLDTENNMKKSCGNSNGFQMHTGIVYQDRSSAGSPRRFSVTKFAPEYCKSGSA- 510 Query: 2133 RFDLNTSDG-FQFAEDCYASLVECNMLNKSKADISEALPSEECENRTPVAEESLSGLPSS 1957 +DG FQ C L + + K K + +CE+ + L L Sbjct: 511 -----LNDGPFQSKPSCDFGLQQYVDITKMKENTVPPAKPTDCESGSSQMGLQLVDLKEF 565 Query: 1956 IHNNVSDLL--------PGNKASLKSDGEDSNSRIDINVLLRTMINLSETLRCYCFSNRA 1801 I LL P N A G+ S ++D+ +LL M NLSE L +C ++ Sbjct: 566 ITQKQQALLCTVLDATTPENSA-----GKASTEKLDVQMLLDRMQNLSELLLSHCLNDAC 620 Query: 1800 QVSEKQSLAVKQILCNLNASMLLMSGQTIATT---VLSQPD-------------KLFLK- 1672 + E+ +K ++ NLN L + IA + +QP+ LK Sbjct: 621 EWKEQDCNVLKNVISNLNTCAL--KNEQIAPVQECLFNQPETSKHAGESRKFRQNSCLKR 678 Query: 1671 --------EFPVVEMVNDTEALAA-SLKLGGTTEDFSEYISSSSKDDENSFQDVNMMEAV 1519 E +E N A A + G S+ IS D E + D NM + + Sbjct: 679 PQLTKIGPESSKIEFENPLVAEANFCFRSGKPHRKLSDSISPRV-DTEMTKAD-NMTKDL 736 Query: 1518 KKLISEDFQEED---VDQQKLLYKNLWLEAEASLCLMTARARFLRMKTEMKRDHDGIKEK 1348 K+++SE+F +D + Q +LYKNLWLEAEA+LC + RAR+ +MK EM D KEK Sbjct: 737 KRILSENFHGDDDEGAEPQTVLYKNLWLEAEATLCSVYYRARYNQMKIEM--DKHSYKEK 794 Query: 1347 DNPTSPNIPVDIHKDIELGNDNLHSPHLGEYLKPTTTSHIKVFHDPKSQVGRVFSESERS 1168 V I + + S Y P +++ +K P V + E R Sbjct: 795 VMEKQSKSEV-----IPTLSQSQSSATKVHYPNPDSSADLKF---PVLDVTNL-EELSRL 845 Query: 1167 TSPKNETSVSSLYVSGTKEN----------PCST--SHANDVDTRVMARYQVLKNRVGGP 1024 + +++ G +N PCS + ND + +MARYQVLK R+ Sbjct: 846 NISTDMNKSNAITPEGRGQNLDSFIDNYLVPCSVNKTERNDESSVMMARYQVLKARIDQS 905 Query: 1023 NTLNYESNQP-------SGEHKCPSSSFDVGNTSPASSTNIKSNDLQISDFGAYQILKGR 865 +T+ +P S + + ++ SP N S + + S + ILK R Sbjct: 906 STVTTNLEEPLDVADSSSPRGRDNQNQVNLCQDSPIPEKN--SAEYETSVLARFHILKSR 963 Query: 864 DDTSSCLNFEGQQLPGDEN 808 D+ SS ++ EG+QL GDE+ Sbjct: 964 DEGSSSISSEGKQLHGDES 982 >gb|EXB94712.1| hypothetical protein L484_002599 [Morus notabilis] Length = 1159 Score = 144 bits (364), Expect = 2e-31 Identities = 192/777 (24%), Positives = 320/777 (41%), Gaps = 71/777 (9%) Frame = -1 Query: 2529 KISPQ---LPHQLSLHGLNIEDSDVEAVYSVEKNSAGSDSHNPAEDSPCWKGASSNRLSP 2359 K PQ +PH+ G ++ + E + SVE +S D +N A DSPCWKG + R SP Sbjct: 442 KSGPQTSNVPHE----GFKLDLNTNENINSVEDSSENVDHYNHAVDSPCWKGVPATRSSP 497 Query: 2358 FGHS------ESVDSDALVK-----KIHNCKSMSSQDLQNSSVVSDAFGSKDFSSRKVNE 2212 F S + V S++ V+ +++ +SSQ +N +++ FGS + Sbjct: 498 FDASVPETKRQEVFSNSNVQTKQIFQLNTGDKVSSQK-RNDNMMCHEFGSPENGLEFPLN 556 Query: 2211 TTSFEKPPVNDCALNSNKDCGSANPSRFDLNTSDGFQFAEDCYASLVECNMLNKSKADIS 2032 T+ K +D + GS DL T G Q + D + E + +D+ Sbjct: 557 TSPAAKSTFSDRKSDDIVKIGS------DLETK-GIQHSNDIH----EHGSRSTGCSDLK 605 Query: 2031 EALPSEECENRTPVAEESLSG--------LPSSIHNNVSDLLPGNKASL-KSDGEDSNSR 1879 +L E+ R + E+++ LP + N +S + L KS+ S+ Sbjct: 606 SSLNGEQNIQRNGLISENINEALQCVSPRLPFPMENIISSSVEDASTKLNKSNEGPSSPT 665 Query: 1878 IDINVLLRTMINLSETLRCYCFSNRAQVSEKQSLAVKQILCNLNASMLLMSGQTIATTVL 1699 ID+ VL+ T+ NLSE L +C S Q+ +K ++ ++ NL+ S +T++T Sbjct: 666 IDVPVLVSTIRNLSELLLFHCTSGSYQLKQKDLETIQSMIDNLSVCASKNSEKTVSTQDS 725 Query: 1698 SQP-----------------DKLFLKEF--PVVEMVNDTEALAASLKLGGTTEDFSEYIS 1576 + +KL + + P+++++ D + E+ S Sbjct: 726 TSEKYTSDYLGDKNHKGFTLNKLQVTKTAGPILDLLADQNVHKGNKYYVAGKENDELLDS 785 Query: 1575 SSSKDDENSFQDVNMMEAVKKLISEDFQ-EEDVDQQKLLYKNLWLEAEASLCLMTARARF 1399 S + D + + ++A+KK+++++F EE+ Q LLYKNLWLEAEA+LC M+ +ARF Sbjct: 786 VSVRADVDIVDEDKAIQALKKVLTDNFDYEEEASPQALLYKNLWLEAEAALCSMSCKARF 845 Query: 1398 LRMKTEMKRDHDGIKEKDNPTSPNIPVDIHKDIELGNDNLHSPHLGEYLKPTTTSHIKVF 1219 R+K EM +NP P D H GN TT KV Sbjct: 846 NRVKLEM----------ENPKLPK-SKDAH-----GN-------------TITTEMDKVS 876 Query: 1218 HDPKSQVGRVFSESERSTSPKNETSVSSLYVSGTKENPCSTSHANDVDTRVMARYQVLKN 1039 SE + N S + + TK S N D VM R+Q+L+ Sbjct: 877 R----------SEVSPDLNGANTLSPKAKGCATTKSQESSVLSTNAEDDDVMDRFQILRC 926 Query: 1038 RVGGPN-TLNYESNQPSGEHKCPSS----------SFDVGNTSP--------ASSTNIKS 916 R N + + ++PS P S + + G++ P SST+ S Sbjct: 927 RAKKSNYGIVADKDKPSSPKVSPHSNKVGKILPEANEETGSSKPDIRRQASSNSSTDKPS 986 Query: 915 NDLQISDFGAYQILKGRDDTSSCLNFEGQQLPGDENYLPGNEFTNG------DKSLQDSQ 754 ND + S + ILK R D S L+ +GQ + G++ G + +LQ Sbjct: 987 NDYEASVMARFHILKSRGDNCSPLSTQGQLAENVDGSTIGSKSEVGSSCVEPEPTLQHHD 1046 Query: 753 MSNSLGVTAESEASIMARF-RLIQSRKDH--SNFLINDSAPAQASAVVFEDGTTDMDEYM 583 ++ G E + + + QS + + N L+ +S D E Sbjct: 1047 ADSTEGQLTGGEFPMFIDYDSMSQSHRPNRRENSLLAGWFDRVSSEWEHVGNDADSTEGQ 1106 Query: 582 TSFGPYPQLPNDEDVNNFDIGANEQAFDPILKTSRMSSVILEGWYANNCSSSDWQHI 412 + G +P D + Q+ P +R + +L GW+ + SS+W+H+ Sbjct: 1107 LTGGEFPMF--------IDYDSMSQSHRP----NRRENSLLAGWF--DRVSSEWEHV 1149 >ref|XP_004309093.1| PREDICTED: uncharacterized protein LOC101301835 [Fragaria vesca subsp. vesca] Length = 1218 Score = 139 bits (351), Expect = 6e-30 Identities = 204/839 (24%), Positives = 335/839 (39%), Gaps = 117/839 (13%) Frame = -1 Query: 2835 SNTSTFPKSITELPYGGTISPVGSTPSA--VKRNSMGLDAADQSNTTGGYNFTYQMESQM 2662 S + + P SI P GT S S P KR + G DAA+ + GGY + + Sbjct: 337 SISKSSPASIIRPPAIGTKS---SEPKMGLFKRLNSGRDAANADH--GGYYPSQESHLPQ 391 Query: 2661 LLGSK-----RHSSIRIRKPSSMSTVASSCQLGEI-GDSRLENKVSSEQMKISPQLPHQ- 2503 K I + + S +SS + + + + N K+ P LP+ Sbjct: 392 SFVDKVPFDSSQLGIHLGRIDPFSVESSSTKDTALPNNGSISNDPLDHLFKVKPGLPNSH 451 Query: 2502 LSLHGLNIEDSDVEAVYSVEKNSAGSDSHNPAEDSPCWKGASSNRLSPFGHSESVDSDAL 2323 + G + + +++ S +S D +NPA DSPCWKG +R SPF SE + + Sbjct: 452 VKPDGFDAAVNINDSINSFLNSSENVDPNNPAVDSPCWKGVRGSRFSPFKASEEGGPEKM 511 Query: 2322 VK---------------KIHNCKSMSSQD----------------------LQNSSVVSD 2254 K ++ C+++S+Q L+ SSV + Sbjct: 512 KKLEGCNGLNLNMPMIFSLNTCENISTQKPVEYNEFGWLGNGLLGNGLPLPLKKSSVENS 571 Query: 2253 AFGSKDFSSRKVNETT--SFEKPPVNDCALN---SNKDCGSANPSRFDLNTSDGFQFAED 2089 AFG K+++TT ++ + +D L+ + GS + S S + E Sbjct: 572 AFGE-----HKLDDTTKTTYYRESGHDRGLHGYINTPHSGSGDKSSSPFEHS--YIVQEG 624 Query: 2088 CYASLVECNMLNKS-------KADISEALPSEECENRTPVAEESLSGLPSSIHNNVSDLL 1930 C + N + K +I++ L EC + E+ PS V D Sbjct: 625 CGEGGLTTESKNTTWSVGADVKLNINDTL---ECGSSHTSPIENTFCSPS-----VED-- 674 Query: 1929 PGNKASLKSDGEDSNSRIDINVLLRTMINLSETLRCYCFSNRAQVSEKQSLAVKQILCNL 1750 + S GE+SN +DI +L+ M +LSE L C ++ Q+ +K A+K ++ NL Sbjct: 675 -ADTKLTTSYGEESNMNMDIQMLVNKMNSLSEVLLVNCSNSSCQLKKKDIDALKAVINNL 733 Query: 1749 NASMLLMSGQTIA---------TTV-----LSQPDKLFLKEFPVVEMVNDTEALAASLKL 1612 N+ +L ++ +T+ L +P+K + P + + ++ L L Sbjct: 734 NSCILKHDEDFLSMPESPPIQQSTIKYIEELCKPNKALSPDMPQLTKIF-APSIQDPLHL 792 Query: 1611 GGT---------TEDFSEYISS-SSKDDENSFQDVNMMEAVKKLISEDFQEEDVDQQKLL 1462 G ++ E ISS S+K D + + M + +KK++SE+F +D Q LL Sbjct: 793 QGVQKVKNHDNLVKNDDEVISSVSAKSDIDFVKQEEMTQDIKKILSENFHTDDTHPQTLL 852 Query: 1461 YKNLWLEAEASLCLMTARARFLRMKTEMKRDHDGIKEKDNPTSPNIPVDIHKDIELGNDN 1282 YKNLWLEAEA +C +ARF R+KTEM++ D KD+ Sbjct: 853 YKNLWLEAEAVICSTNYKARFNRLKTEMEK---------------CKADQSKDV------ 891 Query: 1281 LHSPHLGEYLKPTTTSHIKVFHDPKSQVGRVFSESERSTSPKNETSVSSLYVSGTKENPC 1102 H + + + S + V +P V ++ SE + S PK S G Sbjct: 892 --FEHTAD-MMTQSRSEVCVNSNP---VEKLTSEVQGSPLPKLNLQESPTLTQG------ 939 Query: 1101 STSHANDVDTRVMARYQVLKNRVGGPNTLN--YESNQPSGEHKCPSSSFDVG-------- 952 D VMAR+ VL+NR+ +++N + S P +V Sbjct: 940 --------DDNVMARFHVLRNRIENLSSVNATFGDESSSTLSLVPDKVDEVAPEADARPS 991 Query: 951 -----NTSPASSTNIKSNDLQISDFGAYQILKGRDDTSSCLN------------------ 841 SP SS SND + S + I++ R + S ++ Sbjct: 992 PRISLQDSPTSSITGLSNDYEASVMARFHIIRDRVENSKFISDANVEDTASSKVSREHEA 1051 Query: 840 FEGQQLPGDENYLPGNEFTNGDKSLQDSQMSNS--LGVTAESEASIMARFRLIQSRKDH 670 EG D+ + + S+QD +S S G + E S++ARF +++SR D+ Sbjct: 1052 EEGACETSDDGPIQELNIQDYPGSVQDYPVSTSTTTGHAYQYEDSVLARFNILKSRVDN 1110 >ref|XP_003526770.2| PREDICTED: uncharacterized protein LOC100807937 isoform X1 [Glycine max] Length = 1097 Score = 139 bits (350), Expect = 8e-30 Identities = 185/707 (26%), Positives = 287/707 (40%), Gaps = 95/707 (13%) Frame = -1 Query: 2643 HSSIRIRKPSSMSTVASSCQLGEIGDSRLENKVSSEQMKISPQL--PHQLSLHGLNIEDS 2470 H +R +PSS + I D + V+ + S + PH ++ L + S Sbjct: 340 HMHLRRNEPSSSNKAM-------ISDKNVSRNVADYIFRESHEFQNPHA-NMDNLRLGLS 391 Query: 2469 DVEAVYSVEKNSAGSDSHNPAEDSPCWKGASSNRLSPFGHSESVDSDALVKKIHNCKSMS 2290 +E V VEK+ G D NPAEDSPCWKGAS+ R S F S ++ + + KK + S+ Sbjct: 392 AIEDVNFVEKSFEGGDRCNPAEDSPCWKGASAARFSHFEPSAALSQEYVHKKESSFGSVI 451 Query: 2289 SQ------DLQNS---------------SVV----SDAFGSKDFSSRKVNETTSFEKPPV 2185 + D +N+ +V S A + FS K + Sbjct: 452 KEPQNYLLDTENNMKKSCGNSNGFQMHTGIVYQDRSSAGSPRRFSVTKFAPEYCKSGSAL 511 Query: 2184 NDCALNSNKDCG------------------SANPSRFDLNTSD-GFQFAEDCYASLVECN 2062 ND S C A P+ + +S G Q D + + Sbjct: 512 NDGPFQSKPSCDFGLQQYVDITKMKENTVPPAKPTDCESGSSQMGLQLV-DLKEFITQKQ 570 Query: 2061 MLNKSKADISEALPSEEC-ENRTPVAEESLSGLPSSIHNNVSDLLPGNKASLKSDGEDSN 1885 D++ C E + E + LPSS+ + + P N A G+ S Sbjct: 571 QALLCTGDVNSGCNVNNCSEYDSSHTAEHVLPLPSSVLDATT---PENSA-----GKAST 622 Query: 1884 SRIDINVLLRTMINLSETLRCYCFSNRAQVSEKQSLAVKQILCNLNASMLLMSGQTIATT 1705 ++D+ +LL M NLSE L +C ++ + E+ +K ++ NLN L + IA Sbjct: 623 EKLDVQMLLDRMQNLSELLLSHCLNDACEWKEQDCNVLKNVISNLNTCAL--KNEQIAPV 680 Query: 1704 ---VLSQPD-------------KLFLK---------EFPVVEMVNDTEALAA-SLKLGGT 1603 + +QP+ LK E +E N A A + G Sbjct: 681 QECLFNQPETSKHAGESRKFRQNSCLKRPQLTKIGPESSKIEFENPLVAEANFCFRSGKP 740 Query: 1602 TEDFSEYISSSSKDDENSFQDVNMMEAVKKLISEDFQEED---VDQQKLLYKNLWLEAEA 1432 S+ IS D E + D NM + +K+++SE+F +D + Q +LYKNLWLEAEA Sbjct: 741 HRKLSDSISPRV-DTEMTKAD-NMTKDLKRILSENFHGDDDEGAEPQTVLYKNLWLEAEA 798 Query: 1431 SLCLMTARARFLRMKTEMKRDHDGIKEKDNPTSPNIPVDIHKDIELGNDNLHSPHLGEYL 1252 +LC + RAR+ +MK EM D KEK V I + + S Y Sbjct: 799 TLCSVYYRARYNQMKIEM--DKHSYKEKVMEKQSKSEV-----IPTLSQSQSSATKVHYP 851 Query: 1251 KPTTTSHIKVFHDPKSQVGRVFSESERSTSPKNETSVSSLYVSGTKEN----------PC 1102 P +++ +K P V + E R + +++ G +N PC Sbjct: 852 NPDSSADLKF---PVLDVTNL-EELSRLNISTDMNKSNAITPEGRGQNLDSFIDNYLVPC 907 Query: 1101 ST--SHANDVDTRVMARYQVLKNRVGGPNTLNYESNQP-------SGEHKCPSSSFDVGN 949 S + ND + +MARYQVLK R+ +T+ +P S + + ++ Sbjct: 908 SVNKTERNDESSVMMARYQVLKARIDQSSTVTTNLEEPLDVADSSSPRGRDNQNQVNLCQ 967 Query: 948 TSPASSTNIKSNDLQISDFGAYQILKGRDDTSSCLNFEGQQLPGDEN 808 SP N S + + S + ILK RD+ SS ++ EG+QL GDE+ Sbjct: 968 DSPIPEKN--SAEYETSVLARFHILKSRDEGSSSISSEGKQLHGDES 1012 >ref|XP_006441268.1| hypothetical protein CICLE_v10018632mg [Citrus clementina] gi|557543530|gb|ESR54508.1| hypothetical protein CICLE_v10018632mg [Citrus clementina] Length = 1041 Score = 134 bits (337), Expect = 3e-28 Identities = 172/647 (26%), Positives = 262/647 (40%), Gaps = 100/647 (15%) Frame = -1 Query: 2460 AVYSVEKNSAGSDSHNPAEDSPCWKGASSNRLSPFGHSESVDSDALVKKIHNCKSMSS-Q 2284 A+ E +S D +NPA DSPCWKGA SP S V + KI C +S Sbjct: 399 AINCSEGSSESLDHYNPAVDSPCWKGAPDYH-SPVESSGPVTLQH-INKIEACSGSNSIG 456 Query: 2283 DLQNSSVVS------------------DAFGSKDFSSRK--VNETTSFEKP--------- 2191 NS VS D S SSR + E +++ Sbjct: 457 PTDNSGKVSPQKPSDYSFYQEHGYLENDPESSPKRSSRANLLFEEHGYDRDLKTGFYQMK 516 Query: 2190 -------PVNDCALNSNKDCGSANPSRFDLNTSDGFQFA--EDCYASLVECNMLNKSKAD 2038 +DC +D AN N++D F+F VE + + K + Sbjct: 517 SSYGLGVQFSDCIDKPRQDYVHAN------NSADEFKFRPFHQVQYDSVENKLTFERKCE 570 Query: 2037 ISEALP---------SEECENRTPV-AEESLSGLPSSIHNNVSDLLPGNKASLKSDGEDS 1888 + + SE C + P+ A E + PSS+ + +P L GE Sbjct: 571 LGSGVADVGLSINGTSEGCSSHVPLHATEHVLSSPSSV-----EAVPARLNKLH--GEQL 623 Query: 1887 NSRIDINVLLRTMINLSETLRCYCFSNRAQVSEKQSLAVKQILCNLNASMLLMSGQT--I 1714 ++ + L+ TM NLSE L +C ++ + E A+K ++ NL+ + G I Sbjct: 624 APQMCVRTLISTMHNLSELLLFHCSNDMCGLKEHDFEALKLVVNNLDKCISKRMGPEAPI 683 Query: 1713 ATTVLSQPDKLFLKEFP------VVEMVNDTEALAASLK----------------LGGTT 1600 ++L+Q F++EFP V +T+A + L G + Sbjct: 684 QESLLTQKSSEFIREFPELHEGVTVSSPKETKAAFSVLNQPNYQHVQEQRSPDIAAGKKS 743 Query: 1599 EDFSEYISSSS--------------KDDENSFQDVNMMEAVKKLISEDF-QEEDVDQQKL 1465 E S++ S KDD +D NM +A+KK++S++F +EED Q L Sbjct: 744 EKCSDFTSQGGHAERVKDDDMTQVHKDDAERVKDDNMTQAIKKVLSDNFVEEEDEKLQVL 803 Query: 1464 LYKNLWLEAEASLCLMTARARFLRMKTEMKRDHDGIKEKDNPTSPNIPVDIHKDIE---- 1297 LY+NLWLEAEA+LC + +ARF RMK E++ + +K K N P + D +D+ Sbjct: 804 LYRNLWLEAEAALCSINYKARFNRMKIELE-NCKLLKAKVNKLPPQVKDDSTQDVSVHDF 862 Query: 1296 -LGNDNLHSPHL---GEYLKPTTTSHIKVFHDPKSQVGRVFSESERSTSPKNETSVSSLY 1129 + N + H + + LK + +V E+ +P TS SL Sbjct: 863 PIANISSHPDDVVARSQILKCQESESHANQRPTADEVDNFLFEARNDQTP--PTSTCSL- 919 Query: 1128 VSGTKENPCSTSHANDVDTRVMARYQVLKNRVGGPNTLNY-ESNQPSGEHKCPSSSFDVG 952 N STS A+DV+ V+AR+ +LKNR+ + N + P K + Sbjct: 920 -----SNATSTSKADDVEASVIARFHILKNRIENSSCSNMGDQILPQVAFKLFENGTSDV 974 Query: 951 NTSP---ASSTNIKSNDLQISDFGAYQILKGRDDTSSCLNFEGQQLP 820 NT P +S+N + L + +F L S LN G QLP Sbjct: 975 NTGPELHRNSSNHMQDKLTVKEFH----LNDAVIQSPRLNKLGNQLP 1017 >ref|XP_002317835.2| hypothetical protein POPTR_0012s00720g [Populus trichocarpa] gi|550326088|gb|EEE96055.2| hypothetical protein POPTR_0012s00720g [Populus trichocarpa] Length = 1227 Score = 134 bits (337), Expect = 3e-28 Identities = 193/810 (23%), Positives = 315/810 (38%), Gaps = 114/810 (14%) Frame = -1 Query: 2499 SLHGLNIEDSDVEAVYSVEKNSAGSDSHNPAEDSPCWKGASSNRLSPFGHSESVDSDALV 2320 +L N+ EA SVE S D + PA DSPCWKGA + S F SE V+ Sbjct: 430 NLDFFNLAMDGHEAAGSVEITSESLDHYFPAVDSPCWKGAPVSLPSAFEGSEVVNPQ--- 486 Query: 2319 KKIHNCKSMSSQDLQNS-SVVSDAFGSKDFSSRKVN-------------ETTSFEKPPVN 2182 K+ C ++ Q Q S S +DA KD ++ N +SF++P V Sbjct: 487 NKVEACNGLNLQGPQISPSTTNDAV--KDCPEKQSNISMTFNNESLEHRPASSFKRPLVA 544 Query: 2181 DCALNSNKD-------CGSANPSRFDLNTSD------------GFQFAEDCYASLVECNM 2059 + D C + + SD F+ SL E Sbjct: 545 NVLFREGIDDAVKYGPCQRKSSYCNEAQISDVIDEPRKESILPDFKPVHTKQKSLEEGEW 604 Query: 2058 LNKSKADISEALPS-----EECENRTP--VAEESLSGLPSSIHNNVSDLLPGNKASLKSD 1900 +K +D++ ++C + P E L PSS H +S Sbjct: 605 PSKKNSDVAGVRRKINDNPDDCSSHVPYHAIEHVLCSPPSSEHAPAQHT--------QSQ 656 Query: 1899 GEDSNSRIDINVLLRTMINLSETLRCYCFSNRAQVSEKQSLAVKQILCNLNASMLLMSGQ 1720 +S+S++ L+ TM NLSE L Y ++ ++ ++ + ++ NL+ + S + Sbjct: 657 VGESSSKMHARTLVDTMHNLSELLLFYSSNDTCELKDEDFDVLNDVINNLDIFISKNSER 716 Query: 1719 TIAT--------TVLSQPDKLFLKEFPVVEMVNDTEALAASLKLGGTTEDFSEYISSSSK 1564 +T P KL +E + + + E S ++S Sbjct: 717 KNSTQESLIPRRATSQSPGKLSELYKGQLEFQHFEDEKECKIVSDERKEKLSNFVSMRGA 776 Query: 1563 DDENSFQDVNMMEAVKKLISEDFQ-EEDVDQQKLLYKNLWLEAEASLCLMTARARFLRMK 1387 D + +D N+ +A+KK+++++F +E+ + Q LLYKNLWLEAEASLC++ RF R+K Sbjct: 777 TD--TVKDDNVTQAIKKVLAQNFPIKEESESQILLYKNLWLEAEASLCVVNCMDRFNRLK 834 Query: 1386 TEMKRDHDGIKEKDNPTSPNIPVDIHKDIELGNDNLHSPHLGEYLKPTTTSHIKVFHDPK 1207 E+++ G +K N S PV + + +NL P + + P V + P Sbjct: 835 IEIEK---GSSQKVNEFSSAAPVVPENSMIM--ENLLGPKVSSDILPAEDEGSPVHNVPD 889 Query: 1206 SQVGRVFSESE----------RSTSPKNETSVSSLYVSGTKENP--------------CS 1099 S + S S+ N + S++ +S K +P S Sbjct: 890 SSILSRNSHSDDVMARFHIIKSRVDDSNSLNTSAMDLSSPKVSPDLNKVDKFAHDTKDSS 949 Query: 1098 TSHANDVDT---------RVMARYQVLKNRVGGPNTLNYESNQPSGEHKCPSSSFDVGNT 946 SH + D+ VM R+ +LK RV +++N + V Sbjct: 950 KSHISFQDSIRGASSHADNVMDRFHILKCRVENSSSVNTATGGILASSMVSPDQNQVDKL 1009 Query: 945 SPASSTNIKSNDLQISDFGA-----------YQILKGRDDTSSCL------NFEGQQLPG 817 + + +I S +Q S + IL GRDD S+ + ++ Sbjct: 1010 AHDTKDSIMSYTIQDSPMSGRSSHADDVMTRFCILNGRDDNSNSVTISAVEKLSSSKVSS 1069 Query: 816 DENYL-----PGNEFTNGDKSLQDSQMSNSLGVTAESEASIMARFRLIQS----RKDHSN 664 D N + + D + QDS MS++ + EAS++ + R S ++H Sbjct: 1070 DLNKVSKLTDDTKDSIKADVTTQDSSMSSASSQAEDVEASVILKHRDGNSSSLDMEEHQR 1129 Query: 663 FLI-NDSAPAQASAVVFEDGTTD--MDEYMTSFGPYPQLPNDED---VNNFDIGANEQAF 502 I N A + +DGT D +D M P + + ED V F + N+ Sbjct: 1130 VSIDNGYMDLIRLARMNKDGTKDRTLDVNMEPLIPNFRADSTEDKPTVKEFRLFINDDVE 1189 Query: 501 DPILKTSRMSSVILEGWYANNCSSSDWQHI 412 T R GWY ++C SSDW+H+ Sbjct: 1190 TQSRLTDRFGDQSHAGWY-DSC-SSDWEHV 1217 >gb|EOY23722.1| Uncharacterized protein isoform 2 [Theobroma cacao] Length = 1017 Score = 133 bits (335), Expect = 4e-28 Identities = 181/734 (24%), Positives = 308/734 (41%), Gaps = 85/734 (11%) Frame = -1 Query: 2982 TSASVSKETPQLQEPVLKSISSFCGSQKGNVLSP-EGHFREPGFFMFGCASNTSTFPKSI 2806 +S+++S+ LQ P L ++ C + + +P E R+ G + + + P + Sbjct: 226 SSSAISEAN--LQAPPLNLVN--CKNNHVQISTPYEKPLRQHGTTLSDSIPSVKSSPGVV 281 Query: 2805 TELPYGGTISPVGSTPSAVKRNSMGLDAADQSNTTGGYNFTYQMESQMLL--GSK----- 2647 P GT S ++ S K + G++A D +N G F + E + L GSK Sbjct: 282 IRPPAVGTSSSASNSVS-FKNVNTGINATD-TNLAGNNRFIVE-EPRFLFNFGSKNEFDP 338 Query: 2646 -RHSSIR-----IRKPSSMSTVASSCQLGEIGDSRLENKVSSEQMKISPQLPHQLSLHGL 2485 +HS + + SS ST S + D+ K +ISP Sbjct: 339 IQHSFLLDGNCYMSGESSTSTEKLSTR-NMASDNFFGAKSGVNLSRISPD--------NF 389 Query: 2484 NIEDSDVEAVYSVEKNSAGSDSHNPAEDSPCWKGASSNRLSPFGHSESVDSDALVKKIHN 2305 ++ + EAV +VE + D +NP DSPCWKGA ++ SPFG SE V + L KK+ Sbjct: 390 SLAFENNEAVIAVENSLESLDHYNPPVDSPCWKGAPASNNSPFGSSEPV-AVQLAKKLEA 448 Query: 2304 C--------KSMSSQDLQNSSVVSDAFGSKDFSSRKVN----ETTSFEKPPVNDCALNSN 2161 C K +SS S G S N +S + PPV+ + + Sbjct: 449 CDGSNGLVLKFISSNTANMVKHPSGKAGEILMSDENGNVEDGSMSSLKLPPVSIPSFKEH 508 Query: 2160 K--DCGSANPSRFDLNTSDGFQFAEDCYASLVECNMLNKSKADISEALPSEE-------- 2011 + + G A + +++ +F+++ + + +KS ++ +A + + Sbjct: 509 EPDEAGKAGSHKNKASSACEVKFSDNASEWKKDYVLFDKSVDEVEKASHTSQQCLAEGRL 568 Query: 2010 -----CENRTPVAE-----ESLSGLPSS------------IHNNVSDLLPGNKASLKSDG 1897 C + T VA+ +SG SS ++V D+ + L G Sbjct: 569 ASKNLCRSETGVADLEMKINDVSGCGSSHVSCHAVKHLSCAPSSVEDVSTKHTKFL---G 625 Query: 1896 EDSNSRIDINVLLRTMINLSETLRCYCFSNRAQVSEKQSLAVKQILCNLNASMLLMSGQT 1717 ++ S I+VL+ TM NLSE L +C + ++ E+ ++++++ NL+ M GQ Sbjct: 626 KEPVSNSSISVLVDTMQNLSELLLYHCSNEACELREQDVKSLEKVINNLDTCMSKNIGQE 685 Query: 1716 IATTVLSQPDKLFLKEFPVVEMVNDTEALAASLKLGGTT--------------------- 1600 T+LS+ K++ FP+ + N E+L + L G +T Sbjct: 686 ---TLLSELHKVW---FPMSKK-NGQESLLSELHKGTSTGSPQVAAIDVLSQHTQVKRKH 738 Query: 1599 -----EDFSEYISSSSKDDENSFQDVNMMEAVKKLISEDFQE-EDVDQQKLLYKNLWLEA 1438 E SE++S S D D M +A+KK++ E+F E E+ Q LLYKNLWLEA Sbjct: 739 FGKKDEKCSEFVSVRSGTDIKVKND-KMTQAIKKVLIENFHEKEETHPQVLLYKNLWLEA 797 Query: 1437 EASLCLMTARARFLRMKTEMKRDHDGIKEKDNPTSPNIPVDIHKDIELGNDNLHSPHLGE 1258 EA+LC + AR+ MK E+++ +D KD L D + Sbjct: 798 EAALCSINYMARYNNMKIEIEK---------------CKLDTEKD--LSEDTPDEDKISR 840 Query: 1257 YLKPTTTSHIKVFHDPKSQVGRVFSESERSTSPKNETSVSSLYVSGTKENPCSTSHANDV 1078 ++S + + D ++ +S S+ ++ V P + H +DV Sbjct: 841 DADELSSSKLSLDSDAVDKLATEVKDSSTSSLQTQDSPV-----------PGTACHTDDV 889 Query: 1077 DTRVMARYQVLKNR 1036 + +M R +LK+R Sbjct: 890 EASIMTRLHILKSR 903 >gb|EOY23725.1| Uncharacterized protein isoform 5 [Theobroma cacao] Length = 1059 Score = 131 bits (330), Expect = 2e-27 Identities = 181/747 (24%), Positives = 312/747 (41%), Gaps = 98/747 (13%) Frame = -1 Query: 2982 TSASVSKETPQLQEPVLKSISSFCGSQKGNVLSP-EGHFREPGFFMFGCASNTSTFPKSI 2806 +S+++S+ LQ P L ++ C + + +P E R+ G + + + P + Sbjct: 226 SSSAISEAN--LQAPPLNLVN--CKNNHVQISTPYEKPLRQHGTTLSDSIPSVKSSPGVV 281 Query: 2805 TELPYGGTISPVGSTPSAVKRNSMGLDAADQSNTTGGYNFTYQMESQMLL--GSK----- 2647 P GT S ++ S K + G++A D +N G F + E + L GSK Sbjct: 282 IRPPAVGTSSSASNSVS-FKNVNTGINATD-TNLAGNNRFIVE-EPRFLFNFGSKNEFDP 338 Query: 2646 -RHSSIR-----IRKPSSMSTVASSCQLGEIGDSRLENKVSSEQMKISPQLPHQLSLHGL 2485 +HS + + SS ST S + D+ K +ISP Sbjct: 339 IQHSFLLDGNCYMSGESSTSTEKLSTR-NMASDNFFGAKSGVNLSRISPD--------NF 389 Query: 2484 NIEDSDVEAVYSVEKNSAGSDSHNPAEDSPCWKGASSNRLSPFGHSESVDSDALVKKIHN 2305 ++ + EAV +VE + D +NP DSPCWKGA ++ SPFG SE V + L KK+ Sbjct: 390 SLAFENNEAVIAVENSLESLDHYNPPVDSPCWKGAPASNNSPFGSSEPV-AVQLAKKLEA 448 Query: 2304 C--------KSMSSQDLQNSSVVSDAFGSKDFSSRKVN----ETTSFEKPPVNDCALNSN 2161 C K +SS S G S N +S + PPV+ + + Sbjct: 449 CDGSNGLVLKFISSNTANMVKHPSGKAGEILMSDENGNVEDGSMSSLKLPPVSIPSFKEH 508 Query: 2160 K--DCGSANPSRFDLNTSDGFQFAEDCYASLVECNMLNKSKADISEALPSEE-------- 2011 + + G A + +++ +F+++ + + +KS ++ +A + + Sbjct: 509 EPDEAGKAGSHKNKASSACEVKFSDNASEWKKDYVLFDKSVDEVEKASHTSQQCLAEGRL 568 Query: 2010 -----CENRTPVAE-----ESLSGLPSS------------IHNNVSDLLPGNKASLKSDG 1897 C + T VA+ +SG SS ++V D+ + L G Sbjct: 569 ASKNLCRSETGVADLEMKINDVSGCGSSHVSCHAVKHLSCAPSSVEDVSTKHTKFL---G 625 Query: 1896 EDSNSRIDINVLLRTMINLSETLRCYCFSNRAQVSEKQSLAVKQILCNLNASMLLMSGQT 1717 ++ S I+VL+ TM NLSE L +C + ++ E+ ++++++ NL+ M GQ Sbjct: 626 KEPVSNSSISVLVDTMQNLSELLLYHCSNEACELREQDVKSLEKVINNLDTCMSKNIGQE 685 Query: 1716 IATTVLSQPDKLFLKEFPVVEMVNDTEALAASLKLGGTTEDFSEYISSSSKDDENSFQDV 1537 + L + + +++++ + G E SE++S S D D Sbjct: 686 TLLSELHKGTSTGSPQVAAIDVLSQHTQVKRK-HFGKKDEKCSEFVSVRSGTDIKVKND- 743 Query: 1536 NMMEAVKKLISEDFQE-EDVDQQKLLYKNLWLEAEASLCLMTARARFLRMKTEM---KRD 1369 M +A+KK++ E+F E E+ Q LLYKNLWLEAEA+LC + AR+ MK E+ K D Sbjct: 744 KMTQAIKKVLIENFHEKEETHPQVLLYKNLWLEAEAALCSINYMARYNNMKIEIEKCKLD 803 Query: 1368 HDGIKEKDNPTSPNI-------PVDIHK----------DIELGNDNLHSPHLGEYLKPTT 1240 + +D P I +D +K +++ N N + T Sbjct: 804 TEKDLSEDTPDEDKISRSKLSADLDTNKKLTAIAESAPTLDVSNQNFPIASSSNHADDVT 863 Query: 1239 TSHIKVFHDPKSQVGRVFS----ESERSTSPK---------------NETSVSSLYVSGT 1117 FH K ++ +S +++ +S K ++S SSL + Sbjct: 864 AR----FHVLKHRLNNSYSVHTRDADELSSSKLSLDSDAVDKLATEVKDSSTSSLQTQDS 919 Query: 1116 KENPCSTSHANDVDTRVMARYQVLKNR 1036 P + H +DV+ +M R +LK+R Sbjct: 920 PV-PGTACHTDDVEASIMTRLHILKSR 945 >ref|XP_002521299.1| hypothetical protein RCOM_0756330 [Ricinus communis] gi|223539484|gb|EEF41073.1| hypothetical protein RCOM_0756330 [Ricinus communis] Length = 1125 Score = 131 bits (329), Expect = 2e-27 Identities = 171/641 (26%), Positives = 261/641 (40%), Gaps = 91/641 (14%) Frame = -1 Query: 2499 SLHGLNIEDSDVEAVYSVEKNSAGSDSHNPAEDSPCWKGASSNRLSPFGHSESVDSDALV 2320 SL G+++ D + EA+ + ++ D +NPA DSPCWKGA + S SE+V + Sbjct: 433 SLGGISLVDKN-EAIDPAKNHTESLDHYNPAVDSPCWKGAPVSNFSQLEVSEAVTPQNM- 490 Query: 2319 KKIHNCKSMSSQDLQNSSVVSD---AFGSKDFSSRKV--------NETTSFEKPPVNDCA 2173 K + C + Q Q SV SD + S + + N + S K P+ D Sbjct: 491 KNLEACSGSNHQGYQTFSVSSDDAVKVSPEKTSEKSIQQKGWSLENYSASSMKRPLADNM 550 Query: 2172 LNSNKDCGSANPSRFDLNTSDGFQFAEDCYASLVECNMLNKSKADISEALPSEE---CE- 2005 L+ G + F N + F + + + + NKS D + LP E CE Sbjct: 551 LHRE---GIDHFVNFGANCTKPSLFHQ---VQISDDALPNKSFDDSNGKLPQNEKQSCES 604 Query: 2004 -------NRTPVA-------------EESLSGLP-SSIHNNVSDLLPGNKASLK---SDG 1897 N PV +E S +P ++ + +S + AS+K + G Sbjct: 605 GKWTTESNSAPVISVADVGMNMNDDPDECSSHVPFHAVEHVLSSPPSADSASIKLTKACG 664 Query: 1896 EDSNSRIDINVLLRTMINLSETLRCYCFSNRAQVSEKQSLAVKQILCNLNASML------ 1735 S + I ++ TM NLSE L + ++ + E S A+K ++ NL ML Sbjct: 665 GVSTQKTYIRTVIDTMQNLSELLIFHLSNDLCDLKEDDSNALKGMISNLELCMLKNVERM 724 Query: 1734 ----------------------LMSGQTIATTVLSQPDKLFLKEFPVVEMVNDTEALAAS 1621 L G ++S+ D L + + V D +++ Sbjct: 725 TSTQESIIPERDGAQLSGKSSKLQKGTNGNGFLISRSDPLEFQYSVKYQHVQDEHNISS- 783 Query: 1620 LKLGGTTEDFSEYISSSSKDDENSFQDVNMMEAVKKLISEDFQ-EEDVDQQKLLYKNLWL 1444 G E S Y+S + D + M +A+K ++E+F EE+ + Q LLYKNLWL Sbjct: 784 ---GKNDETLSSYVSVRAAAD--MLKRDKMTQAIKNALTENFHGEEETEPQVLLYKNLWL 838 Query: 1443 EAEASLCLMTARARFLRMKTEMKRDHDGIKEKDNPTSPNIPVDIHKDIELGNDNLHSPHL 1264 EAEASLC + ARF R+K+EM++ EK N + N V + +L N+ S Sbjct: 839 EAEASLCYASCMARFNRIKSEMEKCD---SEKANGSPENCMV----EEKLSKSNIRS--- 888 Query: 1263 GEYLKPTTTSHIKVFHDPKSQVGRVFSESERSTSPKNETSVSSLYVSGTKENPCSTSHAN 1084 DP + G V + + + SP +TS+ + C++SHA+ Sbjct: 889 ----------------DPCT--GNVLASNTKG-SPLPDTSIPESSIL------CTSSHAD 923 Query: 1083 DVDTRVMARYQVLKNRVGGPNTLNYES--NQPSGEHKCPSSSF----------------- 961 D V ARY +LK RV N +N S K SS F Sbjct: 924 D----VTARYHILKYRVDSTNAVNTSSLDKMLGSADKLSSSQFSPCPNNVEKGVCEEKDG 979 Query: 960 ---DVG-NTSPASSTNIKSNDLQISDFGAYQILKGRDDTSS 850 D+ S S+T ND++ S + ILK RDD S Sbjct: 980 QKPDISIQDSLVSNTTSHLNDVEASVMARFHILKCRDDNFS 1020 >gb|EOY23723.1| Uncharacterized protein isoform 3 [Theobroma cacao] Length = 1068 Score = 130 bits (327), Expect = 4e-27 Identities = 191/773 (24%), Positives = 322/773 (41%), Gaps = 124/773 (16%) Frame = -1 Query: 2982 TSASVSKETPQLQEPVLKSISSFCGSQKGNVLSP-EGHFREPGFFMFGCASNTSTFPKSI 2806 +S+++S+ LQ P L ++ C + + +P E R+ G + + + P + Sbjct: 215 SSSAISEAN--LQAPPLNLVN--CKNNHVQISTPYEKPLRQHGTTLSDSIPSVKSSPGVV 270 Query: 2805 TELPYGGTISPVGSTPSAVKRNSMGLDAADQSNTTGGYNFTYQMESQMLL--GSK----- 2647 P GT S ++ S K + G++A D +N G F + E + L GSK Sbjct: 271 IRPPAVGTSSSASNSVS-FKNVNTGINATD-TNLAGNNRFIVE-EPRFLFNFGSKNEFDP 327 Query: 2646 -RHSSIR-----IRKPSSMSTVASSCQLGEIGDSRLENKVSSEQMKISPQLPHQLSLHGL 2485 +HS + + SS ST S + D+ K +ISP Sbjct: 328 IQHSFLLDGNCYMSGESSTSTEKLSTR-NMASDNFFGAKSGVNLSRISPD--------NF 378 Query: 2484 NIEDSDVEAVYSVEKNSAGSDSHNPAEDSPCWKGASSNRLSPFGHSESVDSDALVKKIHN 2305 ++ + EAV +VE + D +NP DSPCWKGA ++ SPFG SE V + L KK+ Sbjct: 379 SLAFENNEAVIAVENSLESLDHYNPPVDSPCWKGAPASNNSPFGSSEPV-AVQLAKKLEA 437 Query: 2304 C--------KSMSSQDLQNSSVVSDAFGSKDFSSRKVN----ETTSFEKPPVNDCALNSN 2161 C K +SS S G S N +S + PPV+ + + Sbjct: 438 CDGSNGLVLKFISSNTANMVKHPSGKAGEILMSDENGNVEDGSMSSLKLPPVSIPSFKEH 497 Query: 2160 K--DCGSANPSRFDLNTSDGFQFAEDCYASLVECNMLNKSKADISEALPSEE-------- 2011 + + G A + +++ +F+++ + + +KS ++ +A + + Sbjct: 498 EPDEAGKAGSHKNKASSACEVKFSDNASEWKKDYVLFDKSVDEVEKASHTSQQCLAEGRL 557 Query: 2010 -----CENRTPVAE-----ESLSGLPSS------------IHNNVSDLLPGNKASLKSDG 1897 C + T VA+ +SG SS ++V D+ + L G Sbjct: 558 ASKNLCRSETGVADLEMKINDVSGCGSSHVSCHAVKHLSCAPSSVEDVSTKHTKFL---G 614 Query: 1896 EDSNSRIDINVLLRTMINLSETLRCYCFSNRAQVSEKQSLAVKQILCNLNASMLLMSGQT 1717 ++ S I+VL+ TM NLSE L +C + ++ E+ ++++++ NL+ M GQ Sbjct: 615 KEPVSNSSISVLVDTMQNLSELLLYHCSNEACELREQDVKSLEKVINNLDTCMSKNIGQE 674 Query: 1716 IATTVLSQPDKLFLKEFPVVEMVNDTEALAASLKLGGTT--------------------- 1600 T+LS+ K++ FP+ + N E+L + L G +T Sbjct: 675 ---TLLSELHKVW---FPMSKK-NGQESLLSELHKGTSTGSPQVAAIDVLSQHTQVKRKH 727 Query: 1599 -----EDFSEYISSSSKDDENSFQDVNMMEAVKKLISEDFQE-EDVDQQKLLYKNLWLEA 1438 E SE++S S D D M +A+KK++ E+F E E+ Q LLYKNLWLEA Sbjct: 728 FGKKDEKCSEFVSVRSGTDIKVKND-KMTQAIKKVLIENFHEKEETHPQVLLYKNLWLEA 786 Query: 1437 EASLCLMTARARFLRMKTEM---KRDHDGIKEKDNPTSPNI-------PVDIHK------ 1306 EA+LC + AR+ MK E+ K D + +D P I +D +K Sbjct: 787 EAALCSINYMARYNNMKIEIEKCKLDTEKDLSEDTPDEDKISRSKLSADLDTNKKLTAIA 846 Query: 1305 ----DIELGNDNLHSPHLGEYLKPTTTSHIKVFHDPKSQVGRVFS----ESERSTSPK-- 1156 +++ N N + T FH K ++ +S +++ +S K Sbjct: 847 ESAPTLDVSNQNFPIASSSNHADDVTAR----FHVLKHRLNNSYSVHTRDADELSSSKLS 902 Query: 1155 -------------NETSVSSLYVSGTKENPCSTSHANDVDTRVMARYQVLKNR 1036 ++S SSL + P + H +DV+ +M R +LK+R Sbjct: 903 LDSDAVDKLATEVKDSSTSSLQTQDSPV-PGTACHTDDVEASIMTRLHILKSR 954 >gb|EOY23721.1| Uncharacterized protein isoform 1 [Theobroma cacao] gi|508776468|gb|EOY23724.1| Uncharacterized protein isoform 1 [Theobroma cacao] Length = 1079 Score = 130 bits (327), Expect = 4e-27 Identities = 191/773 (24%), Positives = 322/773 (41%), Gaps = 124/773 (16%) Frame = -1 Query: 2982 TSASVSKETPQLQEPVLKSISSFCGSQKGNVLSP-EGHFREPGFFMFGCASNTSTFPKSI 2806 +S+++S+ LQ P L ++ C + + +P E R+ G + + + P + Sbjct: 226 SSSAISEAN--LQAPPLNLVN--CKNNHVQISTPYEKPLRQHGTTLSDSIPSVKSSPGVV 281 Query: 2805 TELPYGGTISPVGSTPSAVKRNSMGLDAADQSNTTGGYNFTYQMESQMLL--GSK----- 2647 P GT S ++ S K + G++A D +N G F + E + L GSK Sbjct: 282 IRPPAVGTSSSASNSVS-FKNVNTGINATD-TNLAGNNRFIVE-EPRFLFNFGSKNEFDP 338 Query: 2646 -RHSSIR-----IRKPSSMSTVASSCQLGEIGDSRLENKVSSEQMKISPQLPHQLSLHGL 2485 +HS + + SS ST S + D+ K +ISP Sbjct: 339 IQHSFLLDGNCYMSGESSTSTEKLSTR-NMASDNFFGAKSGVNLSRISPD--------NF 389 Query: 2484 NIEDSDVEAVYSVEKNSAGSDSHNPAEDSPCWKGASSNRLSPFGHSESVDSDALVKKIHN 2305 ++ + EAV +VE + D +NP DSPCWKGA ++ SPFG SE V + L KK+ Sbjct: 390 SLAFENNEAVIAVENSLESLDHYNPPVDSPCWKGAPASNNSPFGSSEPV-AVQLAKKLEA 448 Query: 2304 C--------KSMSSQDLQNSSVVSDAFGSKDFSSRKVN----ETTSFEKPPVNDCALNSN 2161 C K +SS S G S N +S + PPV+ + + Sbjct: 449 CDGSNGLVLKFISSNTANMVKHPSGKAGEILMSDENGNVEDGSMSSLKLPPVSIPSFKEH 508 Query: 2160 K--DCGSANPSRFDLNTSDGFQFAEDCYASLVECNMLNKSKADISEALPSEE-------- 2011 + + G A + +++ +F+++ + + +KS ++ +A + + Sbjct: 509 EPDEAGKAGSHKNKASSACEVKFSDNASEWKKDYVLFDKSVDEVEKASHTSQQCLAEGRL 568 Query: 2010 -----CENRTPVAE-----ESLSGLPSS------------IHNNVSDLLPGNKASLKSDG 1897 C + T VA+ +SG SS ++V D+ + L G Sbjct: 569 ASKNLCRSETGVADLEMKINDVSGCGSSHVSCHAVKHLSCAPSSVEDVSTKHTKFL---G 625 Query: 1896 EDSNSRIDINVLLRTMINLSETLRCYCFSNRAQVSEKQSLAVKQILCNLNASMLLMSGQT 1717 ++ S I+VL+ TM NLSE L +C + ++ E+ ++++++ NL+ M GQ Sbjct: 626 KEPVSNSSISVLVDTMQNLSELLLYHCSNEACELREQDVKSLEKVINNLDTCMSKNIGQE 685 Query: 1716 IATTVLSQPDKLFLKEFPVVEMVNDTEALAASLKLGGTT--------------------- 1600 T+LS+ K++ FP+ + N E+L + L G +T Sbjct: 686 ---TLLSELHKVW---FPMSKK-NGQESLLSELHKGTSTGSPQVAAIDVLSQHTQVKRKH 738 Query: 1599 -----EDFSEYISSSSKDDENSFQDVNMMEAVKKLISEDFQE-EDVDQQKLLYKNLWLEA 1438 E SE++S S D D M +A+KK++ E+F E E+ Q LLYKNLWLEA Sbjct: 739 FGKKDEKCSEFVSVRSGTDIKVKND-KMTQAIKKVLIENFHEKEETHPQVLLYKNLWLEA 797 Query: 1437 EASLCLMTARARFLRMKTEM---KRDHDGIKEKDNPTSPNI-------PVDIHK------ 1306 EA+LC + AR+ MK E+ K D + +D P I +D +K Sbjct: 798 EAALCSINYMARYNNMKIEIEKCKLDTEKDLSEDTPDEDKISRSKLSADLDTNKKLTAIA 857 Query: 1305 ----DIELGNDNLHSPHLGEYLKPTTTSHIKVFHDPKSQVGRVFS----ESERSTSPK-- 1156 +++ N N + T FH K ++ +S +++ +S K Sbjct: 858 ESAPTLDVSNQNFPIASSSNHADDVTAR----FHVLKHRLNNSYSVHTRDADELSSSKLS 913 Query: 1155 -------------NETSVSSLYVSGTKENPCSTSHANDVDTRVMARYQVLKNR 1036 ++S SSL + P + H +DV+ +M R +LK+R Sbjct: 914 LDSDAVDKLATEVKDSSTSSLQTQDSPV-PGTACHTDDVEASIMTRLHILKSR 965 >ref|XP_003523306.2| PREDICTED: uncharacterized protein LOC100778126 [Glycine max] Length = 1048 Score = 128 bits (322), Expect = 1e-26 Identities = 149/633 (23%), Positives = 255/633 (40%), Gaps = 71/633 (11%) Frame = -1 Query: 2499 SLHGLNIEDSDVEAVYSVEKNSAGSDSHNPAEDSPCWKGASSNRLSPFG----------H 2350 ++ L + + E V+K+ G D NPAEDSPCWKGAS+ R S F H Sbjct: 376 NVDNLRLRPNATEGANFVQKSFEGVDQCNPAEDSPCWKGASAARFSHFEPSAALPQEYVH 435 Query: 2349 SESVDSDALVKKIHNCKSMSSQDLQNSSVVSDAFGSKDFSSRKVNETTSFEKPP------ 2188 + + +++++ N + +++ S S+ + + ++ VN+ S P Sbjct: 436 KKEISFGSIIQEPQNILLDTENNMKKSGENSNGYQTH---TKIVNQERSSAGSPRKFSVT 492 Query: 2187 ------------VNDCALNSNKDCGSANPSRFDLNTSDGFQFAE---------DC----- 2086 VND S CG F L+ D + E DC Sbjct: 493 KFAPEYFKSGSAVNDGPFQSKPSCG------FGLHYLDITKMKENTVPPAKPTDCASGSS 546 Query: 2085 -----YASLVECNMLNKSKA-----DISEALPSEEC-ENRTPVAEESLSGLPSSIHNNVS 1939 + L E + K +A D+ C E + + E + PSS+ + + Sbjct: 547 QMGLQHVDLKEFIIFQKQQALVCTGDVDSGCNVNNCSEYSSSCSAEHVPPSPSSVVDTTT 606 Query: 1938 DLLPGNKASLKSDGEDSNSRIDINVLLRTMINLSETLRCYCFSNRAQVSEKQSLAVKQIL 1759 P N A + S ++++ +LL T+ NLSE L +C ++ ++ E+ +K ++ Sbjct: 607 T--PENSAR-----KVSTEKLNVQMLLDTLQNLSELLLYHCLNDACELKERDCNILKNVI 659 Query: 1758 CNLNASMLLMSGQTIATTVLSQPDKLFLKEFPVVEMVNDTEALAASLKLGGTTEDFSEYI 1579 NLN L + Q ++ + F + + K G + +F + Sbjct: 660 SNLNTCALKNAEQ------IAPAQECFFNQ-------------PETSKSAGESREFHQNA 700 Query: 1578 SSSSKD--DENSFQDVNMMEAVKKLISEDFQEED--VDQQKLLYKNLWLEAEASLCLMTA 1411 S + NM + +K+++SE+F ++D + Q +LYKNLWLEAEA+LC + Sbjct: 701 SFKRPQLTKTEMTKACNMTKDLKRILSENFHDDDEGAEPQTVLYKNLWLEAEAALCSVYY 760 Query: 1410 RARFLRMKTEMKRDHDGIKEKDNPTSPNIPVDIHKDIELGNDNLHSPHLG-------EYL 1252 +AR+ ++K EM + KE + + + + + +H P+ L Sbjct: 761 KARYNQIKIEMDKHSYQEKEMEKQSKSEVVPSLSQSQSFAT-KVHHPNPDSSAALKFRVL 819 Query: 1251 KPTTTSHIKVFHDPKSQVGRVFSESERSTSPKNETSVSSLYVSGTKENPCSTSHA--NDV 1078 T + + E ++ +++ +V PCS A ND Sbjct: 820 DATNLEELSCLNISTDMNKPNAMTPEGKGGQNLDSFINNYFV------PCSDDEAERND- 872 Query: 1077 DTRVMARYQVLKNRVGGPNTLNYESNQPSGEHKCP-----SSSFDVGNTSPASSTNIKSN 913 ++ VMARYQVLK RV + N E + P + ++ SP N Sbjct: 873 ESSVMARYQVLKARVDQSSIDNLEEPLDIADKSSPRGRDNQNQVNLSQDSPIPEKN--CT 930 Query: 912 DLQISDFGAYQILKGRDDTSSCLNFEGQQLPGD 814 D + S + ILK R + SS + EG+QL GD Sbjct: 931 DYETSVLARFHILKSRIEGSSSTS-EGKQLDGD 962 >ref|XP_006441271.1| hypothetical protein CICLE_v10018632mg [Citrus clementina] gi|557543533|gb|ESR54511.1| hypothetical protein CICLE_v10018632mg [Citrus clementina] Length = 1064 Score = 125 bits (314), Expect = 1e-25 Identities = 170/667 (25%), Positives = 264/667 (39%), Gaps = 120/667 (17%) Frame = -1 Query: 2460 AVYSVEKNSAGSDSHNPAEDSPCWKGASSNRLSPFGHSESVDSDALVKKIHNCKSMSS-Q 2284 A+ E +S D +NPA DSPCWKGA SP S V + KI C +S Sbjct: 399 AINCSEGSSESLDHYNPAVDSPCWKGAPDYH-SPVESSGPVTLQH-INKIEACSGSNSIG 456 Query: 2283 DLQNSSVVS------------------DAFGSKDFSSRK--VNETTSFEKP--------- 2191 NS VS D S SSR + E +++ Sbjct: 457 PTDNSGKVSPQKPSDYSFYQEHGYLENDPESSPKRSSRANLLFEEHGYDRDLKTGFYQMK 516 Query: 2190 -------PVNDCALNSNKDCGSANPSRFDLNTSDGFQFA--EDCYASLVECNMLNKSKAD 2038 +DC +D AN N++D F+F VE + + K + Sbjct: 517 SSYGLGVQFSDCIDKPRQDYVHAN------NSADEFKFRPFHQVQYDSVENKLTFERKCE 570 Query: 2037 ISEALP---------SEECENRTPV-AEESLSGLPSSIHNNVSDLLPGNKASLKSDGEDS 1888 + + SE C + P+ A E + PSS+ + +P L GE Sbjct: 571 LGSGVADVGLSINGTSEGCSSHVPLHATEHVLSSPSSV-----EAVPARLNKLH--GEQL 623 Query: 1887 NSRIDINVLLRTMINLSETLRCYCFSNRAQVSEKQSLAVKQILCNLNASMLLMSGQT--I 1714 ++ + L+ TM NLSE L +C ++ + E A+K ++ NL+ + G I Sbjct: 624 APQMCVRTLISTMHNLSELLLFHCSNDMCGLKEHDFEALKLVVNNLDKCISKRMGPEAPI 683 Query: 1713 ATTVLSQPDKLFLKEFP------VVEMVNDTEALAASLK----------------LGGTT 1600 ++L+Q F++EFP V +T+A + L G + Sbjct: 684 QESLLTQKSSEFIREFPELHEGVTVSSPKETKAAFSVLNQPNYQHVQEQRSPDIAAGKKS 743 Query: 1599 EDFSEYISSSS--------------KDDENSFQDVNMMEAVKKLISEDF-QEEDVDQQKL 1465 E S++ S KDD +D NM +A+KK++S++F +EED Q L Sbjct: 744 EKCSDFTSQGGHAERVKDDDMTQVHKDDAERVKDDNMTQAIKKVLSDNFVEEEDEKLQVL 803 Query: 1464 LYKNLWLEAEASLCLMTARARFLRMKTEM--------KRDHDGIKEKDNPTSPNIPVDIH 1309 LY+NLWLEAEA+LC + +ARF RMK E+ K + E + + D+H Sbjct: 804 LYRNLWLEAEAALCSINYKARFNRMKIELENCKLLKAKDFSENTSELEKLSQTTFSPDLH 863 Query: 1308 K----DIELGNDNLHSPHLGEYLKPTTTSHIKVFHDPKSQVGR------VFSESERSTSP 1159 ++ +D+ + ++ +SH P V R SES + P Sbjct: 864 AVNKLPPQVKDDSTQDVSVHDFPIANISSH------PDDVVARSQILKCQESESHANQRP 917 Query: 1158 KNETSVSSLYVSGTKENP----------CSTSHANDVDTRVMARYQVLKNRVGGPNTLNY 1009 + + L+ + + P STS A+DV+ V+AR+ +LKNR+ + N Sbjct: 918 TADEVDNFLFEARNDQTPPTSTCSLSNATSTSKADDVEASVIARFHILKNRIENSSCSNM 977 Query: 1008 -ESNQPSGEHKCPSSSFDVGNTSP---ASSTNIKSNDLQISDFGAYQILKGRDDTSSCLN 841 + P K + NT P +S+N + L + +F L S LN Sbjct: 978 GDQILPQVAFKLFENGTSDVNTGPELHRNSSNHMQDKLTVKEFH----LNDAVIQSPRLN 1033 Query: 840 FEGQQLP 820 G QLP Sbjct: 1034 KLGNQLP 1040 >gb|EOY23726.1| Uncharacterized protein isoform 6 [Theobroma cacao] Length = 827 Score = 124 bits (311), Expect = 3e-25 Identities = 162/622 (26%), Positives = 271/622 (43%), Gaps = 85/622 (13%) Frame = -1 Query: 2982 TSASVSKETPQLQEPVLKSISSFCGSQKGNVLSP-EGHFREPGFFMFGCASNTSTFPKSI 2806 +S+++S+ LQ P L ++ C + + +P E R+ G + + + P + Sbjct: 226 SSSAISEAN--LQAPPLNLVN--CKNNHVQISTPYEKPLRQHGTTLSDSIPSVKSSPGVV 281 Query: 2805 TELPYGGTISPVGSTPSAVKRNSMGLDAADQSNTTGGYNFTYQMESQMLL--GSK----- 2647 P GT S ++ S K + G++A D +N G F + E + L GSK Sbjct: 282 IRPPAVGTSSSASNSVS-FKNVNTGINATD-TNLAGNNRFIVE-EPRFLFNFGSKNEFDP 338 Query: 2646 -RHSSIR-----IRKPSSMSTVASSCQLGEIGDSRLENKVSSEQMKISPQLPHQLSLHGL 2485 +HS + + SS ST S + D+ K +ISP Sbjct: 339 IQHSFLLDGNCYMSGESSTSTEKLSTR-NMASDNFFGAKSGVNLSRISPD--------NF 389 Query: 2484 NIEDSDVEAVYSVEKNSAGSDSHNPAEDSPCWKGASSNRLSPFGHSESVDSDALVKKIHN 2305 ++ + EAV +VE + D +NP DSPCWKGA ++ SPFG SE V + L KK+ Sbjct: 390 SLAFENNEAVIAVENSLESLDHYNPPVDSPCWKGAPASNNSPFGSSEPV-AVQLAKKLEA 448 Query: 2304 C--------KSMSSQDLQNSSVVSDAFGSKDFSSRKVN----ETTSFEKPPVNDCALNSN 2161 C K +SS S G S N +S + PPV+ + + Sbjct: 449 CDGSNGLVLKFISSNTANMVKHPSGKAGEILMSDENGNVEDGSMSSLKLPPVSIPSFKEH 508 Query: 2160 K--DCGSANPSRFDLNTSDGFQFAEDCYASLVECNMLNKSKADISEALPSEE-------- 2011 + + G A + +++ +F+++ + + +KS ++ +A + + Sbjct: 509 EPDEAGKAGSHKNKASSACEVKFSDNASEWKKDYVLFDKSVDEVEKASHTSQQCLAEGRL 568 Query: 2010 -----CENRTPVAE-----ESLSGLPSS------------IHNNVSDLLPGNKASLKSDG 1897 C + T VA+ +SG SS ++V D+ + L G Sbjct: 569 ASKNLCRSETGVADLEMKINDVSGCGSSHVSCHAVKHLSCAPSSVEDVSTKHTKFL---G 625 Query: 1896 EDSNSRIDINVLLRTMINLSETLRCYCFSNRAQVSEKQSLAVKQILCNLNASMLLMSGQT 1717 ++ S I+VL+ TM NLSE L +C + ++ E+ ++++++ NL+ M GQ Sbjct: 626 KEPVSNSSISVLVDTMQNLSELLLYHCSNEACELREQDVKSLEKVINNLDTCMSKNIGQE 685 Query: 1716 IATTVLSQPDKLFLKEFPVVEMVNDTEALAASLKLGGTT--------------------- 1600 T+LS+ K++ FP+ + N E+L + L G +T Sbjct: 686 ---TLLSELHKVW---FPMSKK-NGQESLLSELHKGTSTGSPQVAAIDVLSQHTQVKRKH 738 Query: 1599 -----EDFSEYISSSSKDDENSFQDVNMMEAVKKLISEDFQE-EDVDQQKLLYKNLWLEA 1438 E SE++S S D D M +A+KK++ E+F E E+ Q LLYKNLWLEA Sbjct: 739 FGKKDEKCSEFVSVRSGTDIKVKND-KMTQAIKKVLIENFHEKEETHPQVLLYKNLWLEA 797 Query: 1437 EASLCLMTARARFLRMKTEMKR 1372 EA+LC + AR+ MK E+++ Sbjct: 798 EAALCSINYMARYNNMKIEIEK 819 >ref|XP_006478087.1| PREDICTED: uncharacterized protein LOC102628429 [Citrus sinensis] Length = 1065 Score = 120 bits (301), Expect = 4e-24 Identities = 170/665 (25%), Positives = 259/665 (38%), Gaps = 118/665 (17%) Frame = -1 Query: 2460 AVYSVEKNSAGSDSHNPAEDSPCWKGASSNRLSPFGHSESVDSDALVKKIHNCKSMSS-Q 2284 A+ E +S D +NPA DSPCWKGA SP S V + KI C +S Sbjct: 400 AINCSEGSSESLDHYNPAVDSPCWKGAPDYH-SPVESSGPVTLQH-INKIEACSGSNSFG 457 Query: 2283 DLQNSSVVSDAFGSKDFS----------------SRKVNETTSFEKP----PVNDCALNS 2164 NS VS S D+S R FE+ + + Sbjct: 458 PTDNSGKVSPQKPS-DYSFYQEHGYLENDPESSPKRSSRANLLFEEHGYDHDLKTGSYQM 516 Query: 2163 NKDCGSA-------NPSRFDL----NTSDGFQFA--EDCYASLVECNMLNKSKADISEAL 2023 CG + R D N++D F+F VE + + K ++ + Sbjct: 517 KSSCGLGVQFSDYIDKPRQDYVHANNSADEFKFRPFHQVQYDTVENKLTFERKCELGSGV 576 Query: 2022 P---------SEECENRTPV-AEESLSGLPSSIHNNVSDLLPGNKASLKSDGEDSNSRID 1873 SE C + P+ A E + PSS+ + +P L GE ++ Sbjct: 577 ADVGLSINGTSEGCSSHVPLHATEHVLSSPSSV-----EAVPARLNKLH--GEQLAPQMC 629 Query: 1872 INVLLRTMINLSETLRCYCFSNRAQVSEKQSLAVKQILCNLNASMLLMSGQT--IATTVL 1699 + L+ +M NLSE L +C ++ + E A+K ++ NL+ + G I ++L Sbjct: 630 VRTLISSMHNLSELLLFHCSNDMCGLKEHDFEALKLVVNNLDKCISKRMGPEAPIQESLL 689 Query: 1698 SQPDKLFLKEFP------VVEMVNDTEALAASLK----------------LGGTTEDFSE 1585 +Q F++EFP V +T+A + L G E S+ Sbjct: 690 TQKSSEFIREFPELHEGVTVSSPQETKAAFSVLNQPNYQHVQEQRSPDIAAGKKIEKCSD 749 Query: 1584 YISSSS--------------KDDENSFQDVNMMEAVKKLISEDF-QEEDVDQQKLLYKNL 1450 + S KDD +D NM +A+KK++S++F +EED Q LLY+NL Sbjct: 750 FTSQGGHAERVKDDDMTQVHKDDAERVKDDNMTQAIKKVLSDNFVKEEDEKLQVLLYRNL 809 Query: 1449 WLEAEASLCLMTARARFLRMKTEM--------KRDHDGIKEKDNPTSPNIPVDIH----- 1309 WLEAEA+LC + +ARF RMK E+ K + E + + D+H Sbjct: 810 WLEAEAALCAINYKARFNRMKIELENCKLLKAKDLSENTSELEKLSQTTFSPDLHAVNKL 869 Query: 1308 --------------KDIELGNDNLHSPHLGEYLKPTTTSHIKVFHDPK---SQVGRVFSE 1180 +D + N + H + + K + K +V E Sbjct: 870 PPQVKDDTTQDVSVRDFPIANSSSHPDDVVARFQILKCQESKSHANQKPTADEVDNFLFE 929 Query: 1179 SERSTSPKNETSVSSLYVSGTKENPCSTSHANDVDTRVMARYQVLKNRVGGPNTLNY-ES 1003 + +P TS SL N STS A+DV+ V+AR+ +LKNR+ + N + Sbjct: 930 ARNDQTP--PTSTCSL------SNATSTSKADDVEASVIARFHILKNRIENSSCSNMGDQ 981 Query: 1002 NQPSGEHKCPSSSFDVGNTSPA----SSTNIKSNDLQISDFGAYQILKGRDDTSSCLNFE 835 P K + NT P SST+++ + L + +F L S LN Sbjct: 982 ILPQVAFKLFENGTSDVNTGPELHRNSSTHMQ-DKLTVKEFH----LNDAVIQSPRLNKL 1036 Query: 834 GQQLP 820 G QLP Sbjct: 1037 GNQLP 1041 >ref|XP_006441272.1| hypothetical protein CICLE_v10018632mg [Citrus clementina] gi|557543534|gb|ESR54512.1| hypothetical protein CICLE_v10018632mg [Citrus clementina] Length = 842 Score = 108 bits (271), Expect = 1e-20 Identities = 123/450 (27%), Positives = 186/450 (41%), Gaps = 88/450 (19%) Frame = -1 Query: 2460 AVYSVEKNSAGSDSHNPAEDSPCWKGASSNRLSPFGHSESVDSDALVKKIHNCKSMSS-Q 2284 A+ E +S D +NPA DSPCWKGA SP S V + KI C +S Sbjct: 399 AINCSEGSSESLDHYNPAVDSPCWKGAPDYH-SPVESSGPVTLQH-INKIEACSGSNSIG 456 Query: 2283 DLQNSSVVS------------------DAFGSKDFSSRK--VNETTSFEKP--------- 2191 NS VS D S SSR + E +++ Sbjct: 457 PTDNSGKVSPQKPSDYSFYQEHGYLENDPESSPKRSSRANLLFEEHGYDRDLKTGFYQMK 516 Query: 2190 -------PVNDCALNSNKDCGSANPSRFDLNTSDGFQFA--EDCYASLVECNMLNKSKAD 2038 +DC +D AN N++D F+F VE + + K + Sbjct: 517 SSYGLGVQFSDCIDKPRQDYVHAN------NSADEFKFRPFHQVQYDSVENKLTFERKCE 570 Query: 2037 ISEALP---------SEECENRTPV-AEESLSGLPSSIHNNVSDLLPGNKASLKSDGEDS 1888 + + SE C + P+ A E + PSS+ + +P L GE Sbjct: 571 LGSGVADVGLSINGTSEGCSSHVPLHATEHVLSSPSSV-----EAVPARLNKLH--GEQL 623 Query: 1887 NSRIDINVLLRTMINLSETLRCYCFSNRAQVSEKQSLAVKQILCNLNASMLLMSGQT--I 1714 ++ + L+ TM NLSE L +C ++ + E A+K ++ NL+ + G I Sbjct: 624 APQMCVRTLISTMHNLSELLLFHCSNDMCGLKEHDFEALKLVVNNLDKCISKRMGPEAPI 683 Query: 1713 ATTVLSQPDKLFLKEFP------VVEMVNDTEALAASLK----------------LGGTT 1600 ++L+Q F++EFP V +T+A + L G + Sbjct: 684 QESLLTQKSSEFIREFPELHEGVTVSSPKETKAAFSVLNQPNYQHVQEQRSPDIAAGKKS 743 Query: 1599 EDFSEYISSSS--------------KDDENSFQDVNMMEAVKKLISEDF-QEEDVDQQKL 1465 E S++ S KDD +D NM +A+KK++S++F +EED Q L Sbjct: 744 EKCSDFTSQGGHAERVKDDDMTQVHKDDAERVKDDNMTQAIKKVLSDNFVEEEDEKLQVL 803 Query: 1464 LYKNLWLEAEASLCLMTARARFLRMKTEMK 1375 LY+NLWLEAEA+LC + +ARF RMK E++ Sbjct: 804 LYRNLWLEAEAALCSINYKARFNRMKIELE 833 >ref|XP_006347527.1| PREDICTED: uncharacterized protein LOC102592566 isoform X2 [Solanum tuberosum] Length = 1166 Score = 102 bits (255), Expect = 8e-19 Identities = 191/792 (24%), Positives = 316/792 (39%), Gaps = 113/792 (14%) Frame = -1 Query: 2445 EKNSAGSDSHNPAEDSPCWKGASSNRLSPFGHSESVDSDAL-VKKIHNCKSMSSQDLQNS 2269 EK S D HNP DSPCWKGA + R+S G S S L K+ S L Sbjct: 446 EKCSDALDLHNPNVDSPCWKGAPAFRIS-LGDSVDASSPCLFTSKVEFADFSQSNPLFPP 504 Query: 2268 SVVSDA-----FGSKDFSSRKVNETTSFEKPPVNDCALN-SNKDCGSANPSR-----FDL 2122 + S G ++ + V P V N + ++ + + ++ DL Sbjct: 505 AEYSGKTSLKKLGEENLHNHNVYAGNGLSVPSVGTGTNNYTTEELRTIDVTKETFVPMDL 564 Query: 2121 NTSDGF-QFAEDCYASLVECNMLNKSKADISEALPSE-ECENRTPVAEE-SLSG------ 1969 +++ G +F+ED LNK S SE +C+ + + S+ G Sbjct: 565 SSNGGIPKFSED----------LNKPSKGYSLPQYSENDCQLQYSWGKHLSVDGHQYGPK 614 Query: 1968 ---LPSS-IHNNVS--DLLPGNKASLKS---------DGED----------SNSRIDINV 1864 LP +H +S D L G +L + ED S+ ++D+ Sbjct: 615 KHNLPEGYMHTGLSLNDTLEGGVVALDAAENVLRSPASQEDAKQAQQYQMGSSPKLDVQT 674 Query: 1863 LLRTMINLSETLRCYCFSNRAQVSEKQSLAVKQILCNLNASMLLMSGQTIAT--TVLSQP 1690 L+ + NLSE L+ C +N + + +K + NL A + + I T T++SQ Sbjct: 675 LVHAIHNLSELLKSQCLANACLLEGQDIDTLKSAITNLGAC----TAKKIETKDTMVSQH 730 Query: 1689 D---------KLFL-KEFPVVEMVNDTEALAASLKLGGTTEDFSEYISSSSK-------- 1564 D + F+ E + + + + L T ED S+ ++ Sbjct: 731 DTFEKFEESRRSFMGTETGHPQFMEEVAWDSCGLDNQPTPEDKSKNNGKKTENSALLTPA 790 Query: 1563 DDENSFQDVNMMEAVKKLISEDF-QEEDVDQQKLLYKNLWLEAEASLCLMTARARFLRMK 1387 DD + +++A+KK+++E+F +E + Q LL+KNLWLEAEA LC ++ ++RF RMK Sbjct: 791 DDLGDSNEEQVVQAIKKVLNENFLSDEGMQPQALLFKNLWLEAEAKLCSLSYKSRFDRMK 850 Query: 1386 TEMK--RDHDGIKEKDNPTSPNI----PVDIHKDIELGNDNLHSPHLGEYLKPTTTSHIK 1225 EM+ R E +N ++ I P K + + + + ++ + +S Sbjct: 851 IEMEKHRFSQVAPEAENDSASKITTQSPSTSSKSVHIDDSVMERFNILNRREEKLSSSFM 910 Query: 1224 VFHDPKSQVGRVFSESERSTS------PKNETSVSSLYVSGTKENPCSTSHANDVDTRVM 1063 + +VG S+SE S + K + SS ++ +E S ++D + VM Sbjct: 911 KEENDSVKVG---SDSEDSVTMRLNILRKQGNNSSSSFM---QEKKASDIVSSDTEDSVM 964 Query: 1062 ARYQVLKNRVGGPNTLNYESNQPS---GEHKCPSSSFDVGNTSPASSTNIKSNDLQISDF 892 R+ +L+ R E N S GE K ++ +ND + S Sbjct: 965 ERFNILRRR---------EDNLKSSFMGEKK---------------DQDVVANDAEDSVK 1000 Query: 891 GAYQILKGRDDTSSCLNFEGQQLPGDENYLPGNEFTNGDKSLQDSQMSNSLGVTAESEAS 712 IL+ R+D + + FT K D M VT ++E S Sbjct: 1001 VRLNILRQREDNLN------------------SSFTEETK---DPDM-----VTNDAEDS 1034 Query: 711 IMARFRLIQSRKDHSNF-----------------------LIND--SAPAQASAVV---F 616 +MARF ++ R D+ N LIN S +A+ V+ F Sbjct: 1035 VMARFNVLTHRGDNLNSPFMEVKKDLDMVAAGSADMENHGLINGEVSGYQRANVVIEPYF 1094 Query: 615 EDGTTDMDEYMTSFGPYPQLPNDEDVNNFDIGANEQAFDPIL---KTSRMSSVILEGWYA 445 + + E SFG Y + + F + A DPI+ + +R+ + G Y Sbjct: 1095 YHHSINSSEGYNSFGSYADGSGYDSMKQFLLSV---ADDPIVHSNRKARLGNHHSSGLYD 1151 Query: 444 NNCSSSDWQHIS 409 N SSSDW+H++ Sbjct: 1152 N--SSSDWEHVA 1161