BLASTX nr result
ID: Angelica27_contig00012010
seq
BLASTX 2.2.26 [Sep-21-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Angelica27_contig00012010 (2830 letters) Database: ./nr 115,041,592 sequences; 42,171,959,267 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value XP_017219078.1 PREDICTED: uncharacterized protein LOC108196342 [... 1171 0.0 CDP04052.1 unnamed protein product [Coffea canephora] 608 0.0 XP_009765208.1 PREDICTED: uncharacterized protein LOC104216778 i... 598 0.0 XP_019248229.1 PREDICTED: uncharacterized protein LOC109227495 i... 585 0.0 XP_009603872.1 PREDICTED: uncharacterized protein LOC104098765 i... 583 0.0 XP_011090068.1 PREDICTED: uncharacterized protein LOC105170843 [... 571 0.0 XP_015077047.1 PREDICTED: uncharacterized protein LOC107020999 [... 568 0.0 XP_006350879.1 PREDICTED: uncharacterized protein LOC102602843 [... 566 0.0 XP_004242484.1 PREDICTED: uncharacterized protein LOC101246260 [... 565 0.0 XP_011080657.1 PREDICTED: uncharacterized protein LOC105163859 [... 562 0.0 XP_019178455.1 PREDICTED: uncharacterized protein LOC109173649 [... 551 e-179 XP_012841200.1 PREDICTED: uncharacterized protein LOC105961493 [... 545 e-178 XP_002518281.1 PREDICTED: uncharacterized protein LOC8258097 iso... 544 e-177 XP_016577610.1 PREDICTED: uncharacterized protein LOC107875414 [... 545 e-177 XP_007017068.2 PREDICTED: uncharacterized protein LOC18591082 is... 541 e-176 EOY34687.1 NT domain of poly(A) polymerase and terminal uridylyl... 538 e-174 APA20307.1 PAP/OAS1 substrate-binding domain superfamily protein... 536 e-174 XP_006371669.1 hypothetical protein POPTR_0019s14930g [Populus t... 535 e-174 XP_011041798.1 PREDICTED: uncharacterized protein LOC105137670 [... 533 e-173 EOY34688.1 NT domain of poly(A) polymerase and terminal uridylyl... 530 e-172 >XP_017219078.1 PREDICTED: uncharacterized protein LOC108196342 [Daucus carota subsp. sativus] XP_017219079.1 PREDICTED: uncharacterized protein LOC108196342 [Daucus carota subsp. sativus] KZM88454.1 hypothetical protein DCAR_025529 [Daucus carota subsp. sativus] Length = 779 Score = 1171 bits (3029), Expect = 0.0 Identities = 595/771 (77%), Positives = 644/771 (83%) Frame = -2 Query: 2517 VAADPTAIGEDVWETAEQVAWDEVLACIHPTYDSEEKRKDVIDYVHRLIRRSIGVEVFPY 2338 V ADPTAI EDVWETAE VAW EVL CIHPT DS+EKR+DVIDYV RLIRRS+GVEVFPY Sbjct: 18 VRADPTAISEDVWETAEAVAWHEVLECIHPTLDSQEKRRDVIDYVQRLIRRSLGVEVFPY 77 Query: 2337 GSVPLKTYLPDGDIDLTALSTPKSDDSLCRDVLAVMHAEQLNANAEFEVRDTQFIDAEVK 2158 GSVPLKTYLPDGDIDLTALSTPK DDSLCRDVLAVMH EQLNANAEFEVRDTQFIDAEVK Sbjct: 78 GSVPLKTYLPDGDIDLTALSTPKGDDSLCRDVLAVMHGEQLNANAEFEVRDTQFIDAEVK 137 Query: 2157 LVKCLVDNIVIDISFNQLGGLSTLCFLEQVDRLIGKDHLFKHSIILIKAWCYYESRILGA 1978 LVKCLVDNIVIDISFNQLGGLSTLCFLEQVDRLIGK+HLFKHSIILIKAWCYYESRILGA Sbjct: 138 LVKCLVDNIVIDISFNQLGGLSTLCFLEQVDRLIGKEHLFKHSIILIKAWCYYESRILGA 197 Query: 1977 HHGLISTYALESLILYIFHMFNSTLNGPLAALHRFLVYYSKFDWENYSISLNGPVCTSYL 1798 HHGLISTYALESLILYIFHMFNS+LNGPLAALHRFLVYYSKFDWENY ISLNGPVC S+L Sbjct: 198 HHGLISTYALESLILYIFHMFNSSLNGPLAALHRFLVYYSKFDWENYCISLNGPVCISFL 257 Query: 1797 PDLVVDNXXXXXXXXXXXEFLRNCVDMFAVSPKASESNMRAFIKKNLNIIDPLKENNNLG 1618 PDLVVDN EFLRNCVDMF+V PK+SESN RAF KKNLNIIDPLK NNNLG Sbjct: 258 PDLVVDNPGNDEEPMLSEEFLRNCVDMFSVLPKSSESNFRAFPKKNLNIIDPLKGNNNLG 317 Query: 1617 RSVNRGNYFRIRSAFKFGARKLGNILNLPRDRLAEEIKNFFENTLERHGRQNASHISGDE 1438 RSVNRGNYFRIRSAFKFGARKLGNIL+LPRDRLAEEI NFFENTLERHGR+NA IS DE Sbjct: 318 RSVNRGNYFRIRSAFKFGARKLGNILSLPRDRLAEEIMNFFENTLERHGRKNAPDISADE 377 Query: 1437 LARLPFTVYSYDDLHLNQFNGNFDHHMSGLEDNIRSASRNEPGWYANSTVSSQMVVQTCY 1258 L RLP TVYS D++HLN+FNGN H M GLEDNIR ASRNEP Y+NSTVSSQMV Sbjct: 378 LERLPVTVYSNDEVHLNKFNGNLVHDMLGLEDNIRYASRNEPDRYSNSTVSSQMVQA--- 434 Query: 1257 TPEGTVSFVDRFEKDYDDLVMDRALRARNTNVEPDFSLPGSDNIDSFVTNFNAGSFGDLA 1078 +PEGT+ +EK+YD+LV DR L +NTN EP+FS PG DN DSF TNF AG FG+ A Sbjct: 435 SPEGTLG--GHYEKEYDELVTDRCLHLQNTNSEPNFSSPGRDNCDSFSTNFKAGCFGESA 492 Query: 1077 ISSPEDTFSDSISVDLRKKILDGNSDDTESLNLADLTGDYDSHIRSLLYGQCCHGIALSS 898 ISS ED SD SVD +KK LD N DDTE +NLADL+GDYDSHIRSLLYGQCCHG ALSS Sbjct: 493 ISSQEDILSDGPSVDFKKKTLDDNLDDTEGVNLADLSGDYDSHIRSLLYGQCCHGYALSS 552 Query: 897 PAKCSTLLSSPTQLQNKPWDTVRQNLPVIWKMNSSDAAFGHPPYAVDNSNPSTAGFALVE 718 PAKCSTLLS PT +Q+KPWDTVRQ P++W++NSSDAAF PPYAVDNSN ST GF LVE Sbjct: 553 PAKCSTLLSFPTHVQSKPWDTVRQYFPILWRLNSSDAAFVQPPYAVDNSNQSTDGFGLVE 612 Query: 717 RRKARGTGTYIPHLKRNSSRERPSQARRNQLQSNRCQVQKNTSNNIVDTTSPKTLDANSI 538 RR+ARGTGTYIPHL RNS PSQ RRNQLQ R QV T NIVD+TSP+TLDANS+ Sbjct: 613 RRRARGTGTYIPHLTRNSFNYPPSQVRRNQLQDTRGQVPNYTYYNIVDSTSPRTLDANSV 672 Query: 537 RINAFEGGDHQLXXXXXXXXXXXXQDLHSQPINSHGNGTTIPKWELKFGSFGSLAEGDCS 358 R N+FEGGD QD +QP+NS+GNGT+ WELKFGSFG LAEG S Sbjct: 673 RTNSFEGGDQM---SVKEQSSFEKQDSPNQPVNSYGNGTSRANWELKFGSFGPLAEGS-S 728 Query: 357 LGLSDTIAGTSVSSPASPAGQTSKEVKGNHGRDAENFLNLENDDDFPPLSS 205 LGLS+++AG SVSSP S A + S++V+GN RDAENFLNL+ND+DFPPLSS Sbjct: 729 LGLSESVAGVSVSSPVSSAVRNSEDVEGNKERDAENFLNLQNDEDFPPLSS 779 >CDP04052.1 unnamed protein product [Coffea canephora] Length = 862 Score = 608 bits (1569), Expect = 0.0 Identities = 376/849 (44%), Positives = 503/849 (59%), Gaps = 82/849 (9%) Frame = -2 Query: 2508 DPTAIGEDVWETAEQVAWDEVLACIHPTYDSEEKRKDVIDYVHRLIRRSIGVEVFPYGSV 2329 DP++I ++ AE+ A EVL IHPT DSEEKRKDVIDYV R+IR S+G EVFPYGSV Sbjct: 40 DPSSIKQENLGVAEETAL-EVLNAIHPTMDSEEKRKDVIDYVQRIIRNSLGFEVFPYGSV 98 Query: 2328 PLKTYLPDGDIDLTALSTPKSDDSLCRDVLAVMHAEQLNANAEFEVRDTQFIDAEVKLVK 2149 PLKTYLPDGDIDLT S+P ++ DVL+++ E+ N NAE+EV+DTQFIDAEVKLVK Sbjct: 99 PLKTYLPDGDIDLTIFSSPYVEEFWASDVLSILQREEQNENAEYEVKDTQFIDAEVKLVK 158 Query: 2148 CLVDNIVIDISFNQLGGLSTLCFLEQVDRLIGKDHLFKHSIILIKAWCYYESRILGAHHG 1969 CL NIVIDISFNQLGGL TLCFLEQVDRL+GKDHLFK SIILIKAWCYYESRILGAHHG Sbjct: 159 CLAQNIVIDISFNQLGGLCTLCFLEQVDRLVGKDHLFKRSIILIKAWCYYESRILGAHHG 218 Query: 1968 LISTYALESLILYIFHMFNSTLNGPLAALHRFLVYYSKFDWENYSISLNGPVCTSYLPDL 1789 LISTYALE+L+LYIFH+F+S+L+GP A L+RFL Y+SKFDWENY ISLNGPVC S LP++ Sbjct: 219 LISTYALETLVLYIFHLFHSSLHGPFAVLYRFLDYFSKFDWENYCISLNGPVCKSSLPEI 278 Query: 1788 VVDN-XXXXXXXXXXXEFLRNCVDMFAVSPKASESNMRAFIKKNLNIIDPLKENNNLGRS 1612 V EFLRNC++MF+V + E+N R F+ K+LNIIDPLKE NNLGRS Sbjct: 279 VAQTPHNEGTELLLSDEFLRNCMEMFSVRSRDLETNSRLFLPKHLNIIDPLKEYNNLGRS 338 Query: 1611 VNRGNYFRIRSAFKFGARKLGNILNLPRDRLAEEIKNFFENTLERHGR------QNASHI 1450 V+RGN++RIRSAFK+GAR+L IL+LP+D +A+EI+ FF NTL+ HGR Q+ + + Sbjct: 339 VHRGNFYRIRSAFKYGARRLEQILSLPKDEIADEIQKFFGNTLQMHGRNCGSDTQDYALL 398 Query: 1449 SGDELARL----PFTVYSYDDLHLNQFNGNFDHHMSGLEDNIRSA-------------SR 1321 G+ + L P V S DD+ L + + +E + S + Sbjct: 399 FGEGSSTLYSSSPAAVLSEDDMLLRSSTSDLESDGLLMEGHYNSVQLYRCSSDLDSLQTM 458 Query: 1320 NEPGW--------------------------------------YANSTVS--------SQ 1279 +EPG+ Y N + S SQ Sbjct: 459 SEPGYSLDGTPVSRYLCNGDSCDLATHNSLDFRTSNVTSDYSPYINYSGSGFGQYHHLSQ 518 Query: 1278 MVVQTCYTPEGTVSFVDRFEKDYDDLVMDRALRARNTNVE-PDFSLPGSDNIDSFVTNFN 1102 + + Y G S + + D+L +D+ L + +++ D S +D++D F + + Sbjct: 519 LYLPKSYAENGHCSQTYSSDGEEDELGLDQWLEQKVNHLDLVDTSQSCADSLDGFCSCSS 578 Query: 1101 AGSFGDLAISSPEDTFSDSISVDLRKKILDGNSDDTESLN-LADLTGDYDSHIRSLLYGQ 925 A+SSP +++ D+R++ G+ D E LN LA+L+GDYDSHIRSLLYGQ Sbjct: 579 -------AVSSPRTNILENLLPDIRERD-SGSVVDAEPLNPLANLSGDYDSHIRSLLYGQ 630 Query: 924 CCHGIALSSPAKCSTLLSSPTQLQN-----KPWDTVRQNLPV----IWKMNSSDAAFGHP 772 CC+G ALS+P LS+P+ L++ KPWDTVR ++P+ +MNSS G P Sbjct: 631 CCNGFALSAPG-----LSNPSVLRSRFGNKKPWDTVRHSMPLKQGSFSQMNSSTTIVGSP 685 Query: 771 PYAVDNSNPSTAGFALVERRKARGTGTYIPHLKRNSSRERPSQAR-RNQLQSNRCQVQKN 595 + P + F E+ KARGTGTY+P +K SSR+RPSQ R R + Q +++ Sbjct: 686 ---ASSHAPPVSAFPSEEKHKARGTGTYLP-IKNCSSRDRPSQGRPRFKATGTPFQSERH 741 Query: 594 TSNNIVDTTSPKTLDANSIRINAFEGGDHQLXXXXXXXXXXXXQDLHSQPINSHGNGTTI 415 T +N P ++ NS+ E Q LHS+ ++ NG + Sbjct: 742 TQDN---GFVPAFMETNSLPKGGRELLHAWCPVQPRTKSGVSCQSLHSKAGDTCANGFSN 798 Query: 414 PKWELKFGSFGSLAEGDCSLGLSDTIAGTSVSSPASPAGQTSKEVKGNHGRDAENFLNLE 235 ++FGS G+LA+ + + ++ A+P ++ R A+ LE Sbjct: 799 SMRRIEFGSLGNLADDVILAPCNACLLSSNNQKNATP------DLTKEEARVADQLFGLE 852 Query: 234 NDDDFPPLS 208 N+D+FPPLS Sbjct: 853 NEDEFPPLS 861 >XP_009765208.1 PREDICTED: uncharacterized protein LOC104216778 isoform X1 [Nicotiana sylvestris] XP_016477613.1 PREDICTED: uncharacterized protein LOC107799056 isoform X1 [Nicotiana tabacum] Length = 849 Score = 598 bits (1542), Expect = 0.0 Identities = 380/846 (44%), Positives = 505/846 (59%), Gaps = 76/846 (8%) Frame = -2 Query: 2517 VAADPTAIGEDVWETAEQVAWDEVLACIHPTYDSEEKRKDVIDYVHRLIRRSIGVEVFPY 2338 +++DP + E+ W AE+ A +EV+ C+HPT D+EEKRKDVIDYV LI+ S+G EVFPY Sbjct: 19 LSSDPLTVTENCWAVAEE-ATEEVVNCLHPTLDTEEKRKDVIDYVQSLIKFSLGCEVFPY 77 Query: 2337 GSVPLKTYLPDGDIDLTALSTPKSDDSLCRDVLAVMHAEQLNANAEFEVRDTQFIDAEVK 2158 GSVPLKTYLPDGDIDLT S+ ++++L RDVLAV+ E+ +AE++VRDTQFIDAEVK Sbjct: 78 GSVPLKTYLPDGDIDLTVFSSSITEETLGRDVLAVLQEEEQKEDAEYDVRDTQFIDAEVK 137 Query: 2157 LVKCLVDNIVIDISFNQLGGLSTLCFLEQVDRLIGKDHLFKHSIILIKAWCYYESRILGA 1978 LVKCLV N VIDISFNQLGGL TLCFLEQVDRL+GK+HLFK SIILIKAWCYYESRILGA Sbjct: 138 LVKCLVQNTVIDISFNQLGGLCTLCFLEQVDRLVGKNHLFKRSIILIKAWCYYESRILGA 197 Query: 1977 HHGLISTYALESLILYIFHMFNSTLNGPLAALHRFLVYYSKFDWENYSISLNGPVCTSYL 1798 HHGLISTYALE+L+L+IFH F+S+LNGPLA L+RFL YYSKFDW+NY ISLNGP+C S Sbjct: 198 HHGLISTYALETLVLFIFHQFHSSLNGPLAVLYRFLDYYSKFDWDNYCISLNGPICKS-S 256 Query: 1797 PDLVVD-NXXXXXXXXXXXEFLRNCVDMFAVSPKASESNMRAFIKKNLNIIDPLKENNNL 1621 PD++V+ EFLRNC++MF+V + E + R F +K LNI+DPLKENNNL Sbjct: 257 PDILVEMPDHISNNLLLSEEFLRNCMEMFSVPSRGLEVDTRVFQQKYLNIVDPLKENNNL 316 Query: 1620 GRSVNRGNYFRIRSAFKFGARKLGNILNLPRDRLAEEIKNFFENTLERHGR------QNA 1459 GRSVNRG++ RI+ AF++GARKLGNIL+LP DR+A+EIK FF NT+ER Q++ Sbjct: 317 GRSVNRGSFCRIQRAFRYGARKLGNILSLPSDRVADEIKKFFANTIERRRHNLLADLQDS 376 Query: 1458 SHISGDE---LARLPFTVYSYDDLHLNQFNGNFDHHMSGLEDNIRSASRNEPGWYANSTV 1288 I GDE + P Y+ + + L +G+F++ L+ +S S NE + V Sbjct: 377 GLIYGDEDTCSSLSPAESYADNRMLLKSSDGDFEN--DSLKKVFKSMS-NELSSNLMNGV 433 Query: 1287 SSQMVVQTCYTPEGTV---SFVDRFEKDY---DDLVMDRA--------------LRARNT 1168 SS MV ++ P+ F+ E D+ D L +D A +R Sbjct: 434 SSDMVSESGSFPDDAALSGFFLSTDESDHSASDPLNLDVANGTYDCCFNGNSMNFLSRKH 493 Query: 1167 NVEPDF---------------SLPGSDNIDSFV------------------TNFNAGSF- 1090 +P F +L SD DS + T+++ + Sbjct: 494 YHDPPFHFNKSCVENGNSGPENLCQSDLSDSGLWVETRECALECSSIYQSGTDYSESVWS 553 Query: 1089 GDLAISSPEDTFSDSISVDLRKKILDGNSDDTESLN-LADLTGDYDSHIRSLLYGQCCHG 913 G ISSP+ + S+S+D+ ++ L + D E+LN L DL+GDYDSHIRSLLYGQCC+G Sbjct: 554 GGSVISSPKTSILGSLSLDIGERDLASTAGDVEALNPLVDLSGDYDSHIRSLLYGQCCYG 613 Query: 912 IALSSPAKCSTLLSSPTQLQNKP-WDTVRQNLPV----IWKMNSSDAAFGHPPYAVDNSN 748 ALS+P S SSP+Q QNK WDTVRQ++P+ W+ N + G + DN Sbjct: 614 FALSAPVLNSP--SSPSQCQNKHFWDTVRQSMPLRQNSFWQTNVNGMLVGPAVHRPDNYL 671 Query: 747 PSTAGFALVERRKARGTGTYIPHLKRNSSRERPSQARRNQLQSNRCQVQKNTSNNIVDTT 568 PSTA ++ A+G GTY P+ + R R + N Q+ ++S + + Sbjct: 672 PSTATLGSEKKETAQGIGTYFPNSNYSFQEIRCKGRTRIKAPRNHGQLHMHSSTH-SNKW 730 Query: 567 SPKTLDAN-----SIRINAFEGG-DHQLXXXXXXXXXXXXQDLHSQPINSHGNGTTIPKW 406 P DAN ++ I+A + + +DLH N + T I Sbjct: 731 VPSLSDANRSEKCTVEISAVQSSVQVRGKFAVASQSDKFLKDLHD---NDFSHSTCI--- 784 Query: 405 ELKFGSFGSLAEGDCSLGLSDTIAGTSVSSPASPAGQTSKEVKGNHGRDAENFLNLENDD 226 ++FGS G+L+E S D + SV Q + V R AE+ L L+N+D Sbjct: 785 -IEFGSLGNLSEDALSCTSHDVLLIPSVPQKV----QLPESVCSKQERAAEHLLRLKNED 839 Query: 225 DFPPLS 208 +FPPLS Sbjct: 840 EFPPLS 845 >XP_019248229.1 PREDICTED: uncharacterized protein LOC109227495 isoform X1 [Nicotiana attenuata] OIT02880.1 hypothetical protein A4A49_29192 [Nicotiana attenuata] Length = 851 Score = 585 bits (1508), Expect = 0.0 Identities = 373/846 (44%), Positives = 488/846 (57%), Gaps = 76/846 (8%) Frame = -2 Query: 2517 VAADPTAIGEDVWETAEQVAWDEVLACIHPTYDSEEKRKDVIDYVHRLIRRSIGVEVFPY 2338 +++DP + E+ W AE+ A +EV+ C+HPT D+EEKRKDVIDYV RLI+ S+G EVFPY Sbjct: 19 LSSDPLTVTENCWAVAEE-ATEEVVNCVHPTLDTEEKRKDVIDYVQRLIKFSLGCEVFPY 77 Query: 2337 GSVPLKTYLPDGDIDLTALSTPKSDDSLCRDVLAVMHAEQLNANAEFEVRDTQFIDAEVK 2158 GSVPLKTYLPDGDIDLT S+ ++++L RDVLAV+ E+ +AE++V+DTQFIDAEVK Sbjct: 78 GSVPLKTYLPDGDIDLTVFSSSIAEETLGRDVLAVLQEEEQKEDAEYDVKDTQFIDAEVK 137 Query: 2157 LVKCLVDNIVIDISFNQLGGLSTLCFLEQVDRLIGKDHLFKHSIILIKAWCYYESRILGA 1978 LVKCLV N VIDISFNQLGGL TLCFLEQVDRL+GK+HLFK SIILIKAWCYYESRILGA Sbjct: 138 LVKCLVQNTVIDISFNQLGGLCTLCFLEQVDRLVGKNHLFKRSIILIKAWCYYESRILGA 197 Query: 1977 HHGLISTYALESLILYIFHMFNSTLNGPLAALHRFLVYYSKFDWENYSISLNGPVCTSYL 1798 HHGLISTYALE+L+L+IFH+F+S+LNGPLA L+RFL YYSKFDW+NY ISLNGPVC S Sbjct: 198 HHGLISTYALETLVLFIFHLFHSSLNGPLAVLYRFLDYYSKFDWDNYCISLNGPVCKS-S 256 Query: 1797 PDLVVD-NXXXXXXXXXXXEFLRNCVDMFAVSPKASESNMRAFIKKNLNIIDPLKENNNL 1621 PD++V+ EFLRNCV+MF+V + E + R F +K LNI+DPLKENNNL Sbjct: 257 PDILVEMPDHISNNLLLSEEFLRNCVEMFSVPSRGLEGDTRVFQQKYLNIVDPLKENNNL 316 Query: 1620 GRSVNRGNYFRIRSAFKFGARKLGNILNLPRDRLAEEIKNFFENTLERHGR------QNA 1459 GRSVNRG++ RI+ AF++GARKLGNIL+LP DR+A+EIK FF NT+ER Q++ Sbjct: 317 GRSVNRGSFCRIQRAFRYGARKLGNILSLPSDRVADEIKKFFANTIERRRHNLLADLQDS 376 Query: 1458 SHISGDELARLPFT-VYSYDDLHLNQFNGNFDHHMSGLEDNIRSASRNEPGWYANSTVSS 1282 I GDE + SY D + + + D L+ +S S NE + VSS Sbjct: 377 GLIFGDEDTCSSLSPAESYADNRMLLKSSDCDFENDSLKKVFKSIS-NELSSNLMNGVSS 435 Query: 1281 QMVVQTCYTPEGTV-------------SFVDRFEKDYDDLVMDRALRARNTNV--EPDFS 1147 +V ++ P+ S D D + D + N + Sbjct: 436 DLVSESGSFPDDAALSGFCLSTDASDHSASDPLNLDVANGTYDCCFNGNSMNFLSRKHYH 495 Query: 1146 LPGSDNIDSFVTNFNAG---------SFGDLAISSPE------------DTFSDSI---- 1042 P S V N N+G S L + + E +S+S+ Sbjct: 496 APPFHFNKSCVENGNSGPENLCQSDLSDSGLWVETRECALECSSIYQSGTDYSESVWSGG 555 Query: 1041 --------------SVDLRKKILDGNSDDTESLN-LADLTGDYDSHIRSLLYGQCCHGIA 907 S+D+ ++ L + D E+LN L DL+GDYD HIRSLLYGQCC+G A Sbjct: 556 SVISSPKTSILESLSLDIAERDLASTAGDVEALNPLVDLSGDYDCHIRSLLYGQCCYGFA 615 Query: 906 LSSPAKCSTLLSSPTQLQNKP-WDTVRQNLPV----IWKMNSSDAAFGHPPYAVDNSNPS 742 LS+P S SSP+Q QNK WDTVRQ++P+ W+ N + G + +NS PS Sbjct: 616 LSAPVLNSPTPSSPSQCQNKHFWDTVRQSMPLRQNSFWQTNVNGMLVGPAVHRPNNSLPS 675 Query: 741 TAGFALVERRKARGTGTYIPHLKRNSSRERPSQARRNQLQSNRCQVQKNT---SNNIVD- 574 TA ++ A+G GTY P+ + R R + N Q+ ++ SN V Sbjct: 676 TATLGSEKKETAQGIGTYFPNSNYSFQEIRCKGRTRIKAPRNHGQLHMHSGTHSNKWVPA 735 Query: 573 ----TTSPKTLDANSIRINAFEGGDHQLXXXXXXXXXXXXQDLHSQPINSHGNGTTIPKW 406 S K + S ++ +GG +D H+ SH + T Sbjct: 736 LSDANRSEKCTEEISAMQSSVQGGG---KFSVASESDKLLKDFHANDF-SHSSCT----- 786 Query: 405 ELKFGSFGSLAEGDCSLGLSDTIAGTSVSSPASPAGQTSKEVKGNHGRDAENFLNLENDD 226 ++FGS G+L+E S D + SV Q + V R AE L L+N+D Sbjct: 787 -IEFGSLGNLSEDALSCTSHDVLLIPSVPQKV----QLPESVCSKQERAAEYSLRLKNED 841 Query: 225 DFPPLS 208 +FPPLS Sbjct: 842 EFPPLS 847 >XP_009603872.1 PREDICTED: uncharacterized protein LOC104098765 isoform X1 [Nicotiana tomentosiformis] Length = 848 Score = 583 bits (1503), Expect = 0.0 Identities = 368/843 (43%), Positives = 492/843 (58%), Gaps = 73/843 (8%) Frame = -2 Query: 2517 VAADPTAIGEDVWETAEQVAWDEVLACIHPTYDSEEKRKDVIDYVHRLIRRSIGVEVFPY 2338 ++ DP + ED AE+ A +EV+ C+HPT D+EEKRK VIDYV RLI+ S+G EVFPY Sbjct: 19 LSPDPLTVTEDCLAVAEE-ATEEVVNCVHPTLDTEEKRKAVIDYVQRLIKFSLGCEVFPY 77 Query: 2337 GSVPLKTYLPDGDIDLTALSTPKSDDSLCRDVLAVMHAEQLNANAEFEVRDTQFIDAEVK 2158 GSVPLKTYLPDGDIDLT S+ ++++ RDVLAV+ E+ AE++V+D QFIDAEVK Sbjct: 78 GSVPLKTYLPDGDIDLTVFSSSITEETFGRDVLAVLQEEEQKEVAEYDVKDPQFIDAEVK 137 Query: 2157 LVKCLVDNIVIDISFNQLGGLSTLCFLEQVDRLIGKDHLFKHSIILIKAWCYYESRILGA 1978 LVKCLV N VIDISFNQLGGL TLCFLEQVDRL+GK+HLFK SIILIKAWCYYESRILGA Sbjct: 138 LVKCLVQNTVIDISFNQLGGLCTLCFLEQVDRLVGKNHLFKRSIILIKAWCYYESRILGA 197 Query: 1977 HHGLISTYALESLILYIFHMFNSTLNGPLAALHRFLVYYSKFDWENYSISLNGPVCTSYL 1798 HHGLISTYALE+L+L+IFH+F+S+LNGPLA L+RFL YYSKFDW+NY ISLNGPVC S Sbjct: 198 HHGLISTYALETLVLFIFHLFHSSLNGPLAVLYRFLDYYSKFDWDNYCISLNGPVCKS-S 256 Query: 1797 PDLVVD-NXXXXXXXXXXXEFLRNCVDMFAVSPKASESNMRAFIKKNLNIIDPLKENNNL 1621 PD++V+ FLRNC++MF+V + E + R F +K LNI+DPLKENNNL Sbjct: 257 PDILVEMPDHISNNLLLSEGFLRNCMEMFSVPSRGLEVDTRVFQQKYLNIVDPLKENNNL 316 Query: 1620 GRSVNRGNYFRIRSAFKFGARKLGNILNLPRDRLAEEIKNFFENTLER------HGRQNA 1459 GRSVNRG++ RI+ AF++GARKLGNIL+LP DR+A+EIK FF NT+ER Q++ Sbjct: 317 GRSVNRGSFCRIQRAFRYGARKLGNILSLPSDRVADEIKKFFANTIERCRHNLLADLQDS 376 Query: 1458 SHISGDE---LARLPFTVYSYDDLHLNQFNGNFDHH-------------MSGLEDNIRSA 1327 I GDE + P YS + + L +G+F++ S L + + S Sbjct: 377 CLIFGDEDTCSSLSPAESYSDNRMLLKSSDGDFENDSLKKVFKSISNELSSNLMNGVSSD 436 Query: 1326 SRNEPGWYANSTVSSQMVVQT---------------------CYTPEGTVSFVDR----- 1225 +E G + + S + T C +++F+ R Sbjct: 437 MVSESGSFPDDAALSGFCLSTDASDHSASDPLNLDVANGTYDCCFNGNSMNFLSRKHYHD 496 Query: 1224 ----FEKDY-------DDLVMDRALRARNTNVEP-DFSLPGSDNIDSFVTNFNAGSFGDL 1081 F K Y + + L VE +F+L S S + G Sbjct: 497 PPFHFNKSYVENGNSGPENLCQSDLSDYGLWVETREFALECSSIYQSGTDYSESVWSGGS 556 Query: 1080 AISSPEDTFSDSISVDLRKKILDGNSDDTESLN-LADLTGDYDSHIRSLLYGQCCHGIAL 904 ISSP+ + +S+ +D+ ++ L + D E+LN L DL+GDYDSHIRSLLYGQCC+G +L Sbjct: 557 VISSPKTSILESLFLDIGERDLASTAGDVEALNPLVDLSGDYDSHIRSLLYGQCCYGFSL 616 Query: 903 SSPAKCSTLLSSPTQLQNKP-WDTVRQNLPV----IWKMNSSDAAFGHPPYAVDNSNPST 739 S+P S SSP+Q QNK WDTVRQ++P+ W+ N + G + DNS PST Sbjct: 617 SAPVLNSP--SSPSQCQNKHFWDTVRQSMPLRQNSFWQTNVNGMLVGPAVHRPDNSLPST 674 Query: 738 AGFALVERRKARGTGTYIPHLKRNSSRERPSQARRNQLQSNRCQVQKNTSNNIVDTTSPK 559 A + A+G GTY P+ + R R + N CQ+ ++ + + P Sbjct: 675 APLGSEKEETAQGIGTYFPNSNYSFQEIRCKGRTRIKAPRNHCQLHMHSGTH-SNKWVPA 733 Query: 558 TLDAN-----SIRINAFE-GGDHQLXXXXXXXXXXXXQDLHSQPINSHGNGTTIPKWELK 397 DAN ++ I+A + + +D H+ N + + I ++ Sbjct: 734 LSDANRSEKCTVEISAVQSSAQGRGKFAVASQSDKLLKDFHA---NDFSHSSCI----IE 786 Query: 396 FGSFGSLAEGDCSLGLSDTIAGTSVSSPASPAGQTSKEVKGNHGRDAENFLNLENDDDFP 217 FGS G+L+E S D + +S P Q + V R AE+ L L+N+D+FP Sbjct: 787 FGSLGNLSEDALSCTSHDVLL---ISVPQKV--QLPESVCSKQERAAEHPLRLKNEDEFP 841 Query: 216 PLS 208 PLS Sbjct: 842 PLS 844 >XP_011090068.1 PREDICTED: uncharacterized protein LOC105170843 [Sesamum indicum] Length = 824 Score = 571 bits (1472), Expect = 0.0 Identities = 343/762 (45%), Positives = 446/762 (58%), Gaps = 49/762 (6%) Frame = -2 Query: 2508 DPTAIGEDVWETAEQVAWDEVLACIHPTYDSEEKRKDVIDYVHRLIRRSIGVEVFPYGSV 2329 DP +I E+ AE+ A ++VL C+HP DSEEKR+DVIDYV L++ + EV YGSV Sbjct: 37 DPASISEECLCAAEEAA-EQVLNCVHPRLDSEEKRRDVIDYVQLLVKSHLNCEVVSYGSV 95 Query: 2328 PLKTYLPDGDIDLTALSTPKSDDSLCRDVLAVMHAEQLNANAEFEVRDTQFIDAEVKLVK 2149 PLKTYLPDGDIDLT L PK+D+ L DVLA++ E+ N NAEFEV+DTQFIDAEVKLVK Sbjct: 96 PLKTYLPDGDIDLTILKGPKADECLAHDVLALLQGEEKNENAEFEVKDTQFIDAEVKLVK 155 Query: 2148 CLVDNIVIDISFNQLGGLSTLCFLEQVDRLIGKDHLFKHSIILIKAWCYYESRILGAHHG 1969 CLV NIVIDISFNQLGGLSTLCFLEQVDRL+G++HLFK SIIL+K WCYYESRILGAHHG Sbjct: 156 CLVRNIVIDISFNQLGGLSTLCFLEQVDRLVGRNHLFKRSIILVKTWCYYESRILGAHHG 215 Query: 1968 LISTYALESLILYIFHMFNSTLNGPLAALHRFLVYYSKFDWENYSISLNGPVCTSYLPDL 1789 LISTYALE+LILYIF++F+S+L+GPLA LHRFL YYS+FDWENY IS+ GP+C LPD+ Sbjct: 216 LISTYALETLILYIFNLFHSSLSGPLAVLHRFLDYYSQFDWENYCISVKGPICKLSLPDI 275 Query: 1788 VVD-NXXXXXXXXXXXEFLRNCVDMFAVSPKASESNMRAFIKKNLNIIDPLKENNNLGRS 1612 VV FL +C++MF+VS + E+ +AF K+LNI+DPLKENNNLGRS Sbjct: 276 VVKMPESERKDLLLSEAFLEDCMEMFSVSSRGLEAQPKAFQTKHLNIVDPLKENNNLGRS 335 Query: 1611 VNRGNYFRIRSAFKFGARKLGNILNLPRDRLAEEIKNFFENTLERHGRQNASHI------ 1450 V+RGN++RIRSAFK+GARKLG IL PRD++A+EI FF NT RH Q+ S I Sbjct: 336 VHRGNFYRIRSAFKYGARKLGQILLRPRDKVADEIHKFFANTRARHENQHRSSIRRLALE 395 Query: 1449 -----SGDELARLPFTVYSYDDLHLNQFNGNFDHHMSGLEDNIRSASRNEPGWYANSTVS 1285 S P ++S DDL L +FD++ L +N+ Y+ VS Sbjct: 396 FDDEESLTSSLSSPVELFSDDDLLLQSSVSDFDNYSVDLGQRPILEFKNDMDGYSIIEVS 455 Query: 1284 SQMVVQTCYTPEGTVSFVDRFEKDYDDL----------VMD--------RALRARNTNVE 1159 S+ + CY+ +G D D L MD +L + + + Sbjct: 456 SETASEACYSSDGIFISGHSITGDKDSLATPNSGWRNGTMDYISSSNYSASLSENHCHKQ 515 Query: 1158 PDFSLPGSDNIDSFVTNFNAGSFGDLAISSP--------EDTFSDSISVDLRKKILDGNS 1003 DFS S + A G LA +S E+ + +S+D R+ Sbjct: 516 HDFSSKSSTENEI----LKAHQIG-LACASEKFGLKSWLENKGLEDLSLDFRETDSMSAG 570 Query: 1002 DDTESLN-LADLTGDYDSHIRSLLYGQCCHGIALSSPAKCSTLLSSPTQLQNKPWDTVRQ 826 ++E+ N LADLTGDYD+HIRSLL GQ CH +S C+ S+ KPWD VRQ Sbjct: 571 GESEAFNPLADLTGDYDNHIRSLLRGQLCHAFTISEHVVCNPASSTSAIHTKKPWDIVRQ 630 Query: 825 NLPV----IWKMNSSDAAFGHPPYAVDNSNPSTAGFALVERRKARGTGTYIPHLKRNSSR 658 ++P+ +MNS + H + +S F +KARGTGTY PH+ Sbjct: 631 SMPLRENKFCQMNSHTISTEHKMHPGFDSGLLGTAFQFEANQKARGTGTYFPHV-NVLYM 689 Query: 657 ERPSQAR-RNQLQSNRCQVQKNTSNNIVDTTSPKTLDANSIRINAFEGGDHQLXXXXXXX 481 +RPSQ + RN+ N + + + N + S ++ N+ E G H++ Sbjct: 690 DRPSQMKGRNKAPGNHNRHHRYSQINGLYPASSQS--------NSSENGSHEICPVRSRS 741 Query: 480 XXXXXQDLHSQPI-----NSHGNGTTIPKWELKFGSFGSLAE 370 D+ Q +H NG ++FGS G+LAE Sbjct: 742 QGQRRSDIKCQSPRLVGGRNHTNGYLSGSCRIEFGSIGNLAE 783 >XP_015077047.1 PREDICTED: uncharacterized protein LOC107020999 [Solanum pennellii] Length = 843 Score = 568 bits (1465), Expect = 0.0 Identities = 364/845 (43%), Positives = 489/845 (57%), Gaps = 79/845 (9%) Frame = -2 Query: 2508 DPTAIGEDVWETAEQVAWDEVLACIHPTYDSEEKRKDVIDYVHRLIRRSIGVEVFPYGSV 2329 DP+A+ ED W AE+ A EV+ C+HPT D+EEKRKDV+DYV RLIR S+G EVF YGSV Sbjct: 23 DPSAVTEDCWAVAEE-AVQEVVNCVHPTLDTEEKRKDVVDYVQRLIRCSLGCEVFSYGSV 81 Query: 2328 PLKTYLPDGDIDLTALSTPKSDDSLCRDVLAVMHAEQLNANAEFEVRDTQFIDAEVKLVK 2149 PLKTYLPDGDIDLT +P +++L RDVLAV+ E+L N E++V+D QFIDAEVKLVK Sbjct: 82 PLKTYLPDGDIDLTVFGSPVVEETLARDVLAVLQEEELKGNTEYDVKDPQFIDAEVKLVK 141 Query: 2148 CLVDNIVIDISFNQLGGLSTLCFLEQVDRLIGKDHLFKHSIILIKAWCYYESRILGAHHG 1969 C+V N VIDISFNQLGGLSTLCFLEQVDRL+GK+HLFK SIILIKAWCYYESR+LGAHHG Sbjct: 142 CIVRNTVIDISFNQLGGLSTLCFLEQVDRLVGKNHLFKRSIILIKAWCYYESRVLGAHHG 201 Query: 1968 LISTYALESLILYIFHMFNSTLNGPLAALHRFLVYYSKFDWENYSISLNGPVCTSYLPDL 1789 LISTYALE+L+L+IF +F+S+LNGPLA L+RFL YYSKFDW+NY ISLNGPVC S LP+L Sbjct: 202 LISTYALETLVLFIFQLFHSSLNGPLAVLYRFLDYYSKFDWDNYCISLNGPVCKSSLPEL 261 Query: 1788 VVD-NXXXXXXXXXXXEFLRNCVDMFAVSPKASESNMRAFIKKNLNIIDPLKENNNLGRS 1612 V+ EFLRN +MF+V + ES+ R F +K LNIIDPLKENNNLGRS Sbjct: 262 FVEMPDYISNELLLSEEFLRNSAEMFSVPSRGLESDTRPFQQKYLNIIDPLKENNNLGRS 321 Query: 1611 VNRGNYFRIRSAFKFGARKLGNILNLPRDRLAEEIKNFFENTLERH------GRQNASHI 1450 V++GN +RI+ AFK+GARKLG+IL P D++A+EIK FF NT+ERH Q ++ I Sbjct: 322 VSKGNLYRIQRAFKYGARKLGDILLSPYDKVADEIKKFFANTIERHRLNLVAELQYSNLI 381 Query: 1449 SGDE---LARLPFTVYSYDDLHLNQFNGNFDHHMSGLEDNIRSASRNEPGWYANSTV--- 1288 GDE + P Y+ + L +G+F++ D+++ A + +S + Sbjct: 382 FGDEDTCSSLSPAEFYANARMLLKSSDGDFEN------DSLKKAYTSISNELLSSLMNGA 435 Query: 1287 SSQMVVQT-CYTPEGTVSFVDRFEKDYDDLVMDRALRARNTNVEPDFSLPGSDNID---- 1123 SS+MV +T ++ + VS ++ D L L +N D S G+ Sbjct: 436 SSEMVSETGSFSDDALVSGFCQYRYANDPLA-SVPLNLGVSNGSYDCSSNGNSMSSLSWK 494 Query: 1122 -----------SFVTNFNAG--------SFGDLAISSP-----------------EDTFS 1051 S V N N+G S L + +P ED +S Sbjct: 495 HYYAPPFYFNKSSVENGNSGPELCQSDLSGSCLGVETPECPQESSSIYKAGTDCSEDFWS 554 Query: 1050 -------------DSISVDLRKKILDGNSDDTESLN-LADLTGDYDSHIRSLLYGQCCHG 913 +S+++D+ ++ L + D E++N L DL+GDYDSHIRSLLYGQCC+G Sbjct: 555 GGSEISSPRTSVLESVTLDIGERDLASTAGDIEAINPLVDLSGDYDSHIRSLLYGQCCYG 614 Query: 912 IALSSPAKCSTLLSSPTQLQNKP-WDTVRQNLPV----IWKMNSSDAAFGHPPYAVD-NS 751 LS+P S SSP+ QNK WDTVRQ++P+ W+ N + P N+ Sbjct: 615 CYLSAPVLNSP--SSPSPSQNKNFWDTVRQSIPLRKNSFWQTNGNGMLVVEPAVRPSGNA 672 Query: 750 NPSTAGFALVERRKARGTGTYIPHLKRNSSRERPSQARRNQLQSNRCQVQKNTSNNIVDT 571 S A ++ A+GTG Y P K +ER +++ + Q ++ + + Sbjct: 673 LSSDATLRSGKKEMAQGTGIYFP--KTEYQQERRKGRTKSKALGSHGQFHLHSGTHSYEV 730 Query: 570 TSPKTLDANSI-RINAFEGGDHQLXXXXXXXXXXXXQDLHSQPINSHGNGTTIPKWELKF 394 + I + + GG +L SH N + ++F Sbjct: 731 AFSDANHSEEISAVKSSVGGREKLASSSQSGGLLE---------ESHANAFSNSSCRIEF 781 Query: 393 GSFGSLAEGDCSLGLSDTIAGTS---VSSPASPAG-QTSKEVKGNHGRDAENFLNLENDD 226 GS G+L+E D ++ TS + P++P Q + GRDAE+ L L+N+D Sbjct: 782 GSLGNLSE--------DVLSHTSRDVILIPSAPQKVQLPEPACSKQGRDAEHSLRLKNED 833 Query: 225 DFPPL 211 +FPPL Sbjct: 834 EFPPL 838 >XP_006350879.1 PREDICTED: uncharacterized protein LOC102602843 [Solanum tuberosum] Length = 844 Score = 567 bits (1460), Expect = 0.0 Identities = 359/836 (42%), Positives = 478/836 (57%), Gaps = 70/836 (8%) Frame = -2 Query: 2508 DPTAIGEDVWETAEQVAWDEVLACIHPTYDSEEKRKDVIDYVHRLIRRSIGVEVFPYGSV 2329 DP+A+ ED W AE+ A EV+ C+HPT D+EEKRKDV+DYV RLIR ++G EVF YGSV Sbjct: 23 DPSAVTEDSWAVAEE-AVQEVVNCVHPTLDTEEKRKDVVDYVQRLIRCTLGCEVFSYGSV 81 Query: 2328 PLKTYLPDGDIDLTALSTPKSDDSLCRDVLAVMHAEQLNANAEFEVRDTQFIDAEVKLVK 2149 PLKTYLPDGDIDLT +P +++L RDVLAV+ E+L N E++V+D QFIDAEVKLVK Sbjct: 82 PLKTYLPDGDIDLTVFGSPVIEETLARDVLAVLQEEELKENTEYDVKDPQFIDAEVKLVK 141 Query: 2148 CLVDNIVIDISFNQLGGLSTLCFLEQVDRLIGKDHLFKHSIILIKAWCYYESRILGAHHG 1969 C+V N VIDISFNQLGGLSTLCFLEQVDRL+GK+HLFK SIILIKAWCYYESR+LGAHHG Sbjct: 142 CIVRNTVIDISFNQLGGLSTLCFLEQVDRLVGKNHLFKRSIILIKAWCYYESRVLGAHHG 201 Query: 1968 LISTYALESLILYIFHMFNSTLNGPLAALHRFLVYYSKFDWENYSISLNGPVCTSYLPDL 1789 LISTYALE+L+L+IF +F+S+LNGPLA L+RFL YYSKFDW+ Y ISLNGPVC S LP+L Sbjct: 202 LISTYALETLVLFIFQLFHSSLNGPLAVLYRFLDYYSKFDWDKYCISLNGPVCKSSLPEL 261 Query: 1788 VVD-NXXXXXXXXXXXEFLRNCVDMFAVSPKASESNMRAFIKKNLNIIDPLKENNNLGRS 1612 V+ EFLRN +MF+V + ES+ R F +K LNIIDPLKENNNLGRS Sbjct: 262 FVEMPDYISNELLLSEEFLRNSAEMFSVPSRGLESDTRPFQQKYLNIIDPLKENNNLGRS 321 Query: 1611 VNRGNYFRIRSAFKFGARKLGNILNLPRDRLAEEIKNFFENTLERH------GRQNASHI 1450 V++GN +RI+ AFK+GARKLG+IL P D++A+EIK FF NT+ERH Q +S I Sbjct: 322 VSKGNLYRIQRAFKYGARKLGDILLSPDDKVADEIKKFFANTIERHRLNHVAELQYSSLI 381 Query: 1449 SGDE---LARLPFTVYSYDDLHLNQFNGNFDHH-------------MSGLEDNIRSASRN 1318 GDE + P Y+ + L +G+F++ +S L + S + Sbjct: 382 FGDEDTCSSLSPAEFYANARMLLKSSDGDFENDSLKKAYTSISNELLSSLMNGASSEMVS 441 Query: 1317 EPG--------------WYANSTVSSQMV-------VQTCYTPEGTVSFVDRFEKDYDDL 1201 E G YAN ++S + C + ++S + ++ Y Sbjct: 442 ENGSFSDDALVSGFCQYRYANDPLASVPLNLGVSNGSYDCSSNGNSMSSLS-WKHYYARP 500 Query: 1200 VMDRALRARNTNVEPDFSLPGSDNIDSFV------------TNFNAGS-------FGDLA 1078 N N EP+ L SD DS + + + AG+ G Sbjct: 501 FYFNKSSVENGNCEPELCL--SDLSDSCLGVETPKCPQESSSIYQAGTDYSEDFWSGGSE 558 Query: 1077 ISSPEDTFSDSISVDLRKKILDGNSDDTESLN-LADLTGDYDSHIRSLLYGQCCHGIALS 901 ISSP + +S+++D+ ++ L + D E++N L DL+GDYDSHIRSLLYGQCC+G LS Sbjct: 559 ISSPRTSVLESVTLDIGERDLASIAGDIEAINPLVDLSGDYDSHIRSLLYGQCCYGCYLS 618 Query: 900 SPAKCSTLLSSPTQLQNKP-WDTVRQNLPV----IWKMNSSDAAFGHP-PYAVDNSNPST 739 +P S SSP+ QNK WDTVRQ++P+ W+ N + P N+ S Sbjct: 619 APVLNSP--SSPSPSQNKNFWDTVRQSIPLRKNSFWQTNGNGMLVVKPAAQPSGNALSSD 676 Query: 738 AGFALVERRKARGTGTYIPHLKRNSSRERPSQARRNQLQSNRCQVQKNTSNNIVDTTSPK 559 A ++ A+GTGTY + + + R + + + + T ++ S Sbjct: 677 ATLGSDKKEMAQGTGTYFLNTEYHQERRKGRTKSKALGSHGQFHLHSGTHSHECVAFSDA 736 Query: 558 TLDANSIRINAFEGGDHQLXXXXXXXXXXXXQDLHSQPINSHGNGTTIPKWELKFGSFGS 379 + + G +L SH N + ++FGS G+ Sbjct: 737 NHSEEISAVKSSVEGHEKLASSSQSDGLLE---------ESHANAFSNSSCRIEFGSLGN 787 Query: 378 LAEGDCSLGLSDTIAGTSVSSPASPAGQTSKEVKGNHGRDAENFLNLENDDDFPPL 211 L+ S D + SV Q S+ GRDAE+ L L+N+D+FPPL Sbjct: 788 LSGDVLSHTSRDVVLIPSVPQKV----QLSQPACSKLGRDAEHSLRLKNEDEFPPL 839 >XP_004242484.1 PREDICTED: uncharacterized protein LOC101246260 [Solanum lycopersicum] Length = 844 Score = 565 bits (1456), Expect = 0.0 Identities = 368/848 (43%), Positives = 493/848 (58%), Gaps = 82/848 (9%) Frame = -2 Query: 2508 DPTAIGEDVWETAEQVAWDEVLACIHPTYDSEEKRKDVIDYVHRLIRRSIGVEVFPYGSV 2329 DP+A+ ED W AE+ A EV+ C+HPT D+EEKRKDV+D+V RLIR S+G EVF YGSV Sbjct: 23 DPSAVTEDCWAVAEE-AVQEVVNCVHPTLDTEEKRKDVVDHVQRLIRCSLGCEVFSYGSV 81 Query: 2328 PLKTYLPDGDIDLTALSTPKSDDSLCRDVLAVMHAEQLNANAEFEVRDTQFIDAEVKLVK 2149 PLKTYLPDGDIDLT +P +++L RDVLAV+ E+L N E++V+D QFIDAEVKLVK Sbjct: 82 PLKTYLPDGDIDLTVFGSPVVEETLARDVLAVLQEEELKGNTEYDVKDPQFIDAEVKLVK 141 Query: 2148 CLVDNIVIDISFNQLGGLSTLCFLEQVDRLIGKDHLFKHSIILIKAWCYYESRILGAHHG 1969 C+V N VIDISFNQLGGLSTLCFLEQVDRL+GK+HLFK SIILIKAWCYYESR+LGAHHG Sbjct: 142 CIVRNTVIDISFNQLGGLSTLCFLEQVDRLVGKNHLFKRSIILIKAWCYYESRVLGAHHG 201 Query: 1968 LISTYALESLILYIFHMFNSTLNGPLAALHRFLVYYSKFDWENYSISLNGPVCTSYLPDL 1789 LISTYALE+L+L+IF +F+S+LNGPLA L+RFL YYSKFDW+NY ISLNGPVC S LP+L Sbjct: 202 LISTYALETLVLFIFQLFHSSLNGPLAVLYRFLDYYSKFDWDNYCISLNGPVCKSSLPEL 261 Query: 1788 VVD-NXXXXXXXXXXXEFLRNCVDMFAVSPKASESNMRAFIKKNLNIIDPLKENNNLGRS 1612 V+ EFLRN +MF+V + ES+ R F +K LNIIDPLKENNNLGRS Sbjct: 262 FVEMPDYISNELLLSEEFLRNSAEMFSVPSRGLESDTRPFQQKYLNIIDPLKENNNLGRS 321 Query: 1611 VNRGNYFRIRSAFKFGARKLGNILNLPRDRLAEEIKNFFENTLERH------GRQNASHI 1450 V++GN +RI+ AFK+GARKLG+IL P D++A+E K FF NT+ERH Q ++ I Sbjct: 322 VSKGNLYRIQRAFKYGARKLGDILLSPYDKVADETKKFFANTIERHRLNLVAELQYSNLI 381 Query: 1449 SGDE---LARLPFTVYSYDDLHLNQFNGNFDHHMSGLEDNIRSASRNEPGWYANSTV--- 1288 GDE + P Y+ + L +G+F++ D+++ A + +S + Sbjct: 382 FGDEDTCSSLSPAEFYANARMLLKSSDGDFEN------DSLKKAYTSISNELLSSLMNGA 435 Query: 1287 SSQMVVQT-CYTPEGTVSFVDRFEKDYDDLVMDRALRARNTNVEPDFSLPGSDNID---- 1123 SS+MV +T ++ + VS ++ D L L +N D S G+ Sbjct: 436 SSEMVSETGSFSDDALVSGFCQYRYANDPLA-SVPLNLGVSNGSYDCSSNGNSMSSLSWK 494 Query: 1122 -----------SFVTNFNAG--------SFGDLAISSP-----------------EDTFS 1051 S V N N G S L + +P ED +S Sbjct: 495 HYYAPPFYFNKSSVENGNRGPELCQSDLSGSCLGVETPECPQESSSIYKAGTDCSEDFWS 554 Query: 1050 -------------DSISVDLRKKILDGNSDDTESLN-LADLTGDYDSHIRSLLYGQCCHG 913 +S+++D+ ++ L + D E++N L DL+GDYDSHIRSLLYGQCC+G Sbjct: 555 GGSEISSPRTSVLESVTLDIGERDLASTAGDIEAINPLVDLSGDYDSHIRSLLYGQCCYG 614 Query: 912 IALSSPAKCSTLLSSPTQLQNKP-WDTVRQNLPV----IWKMNSSDAAFGHPPYAVD-NS 751 LS+P S SSP+ QNK WDTVRQ++P+ W+ N + P N+ Sbjct: 615 CYLSAPVLNSP--SSPSPSQNKNFWDTVRQSIPLGKNSFWQTNGNGMLVVEPAARPSGNA 672 Query: 750 NPSTAGFALVERRKARGTGTYIPHLKRNSSRERPSQARRNQLQSNRCQVQKNTSNNIVDT 571 S A ++ A+GTG Y P K +ER +++ + Q ++ + + Sbjct: 673 LSSDATLRSGKKEMAQGTGIYFP--KTEYQQERRKGRTKSKALGSHGQFHLHSGTHSYEC 730 Query: 570 TSPKTLDAN-SIRINAFE---GGDHQLXXXXXXXXXXXXQDLHSQPINSHGNGTTIPKWE 403 + DAN S I+A + GG +L SH N + Sbjct: 731 VA--FSDANHSEEISAVKSSVGGREKLASSSQSGGLLE---------ESHANAFSNSSCR 779 Query: 402 LKFGSFGSLAEGDCSLGLSDTIAGTS---VSSPASPAG-QTSKEVKGNHGRDAENFLNLE 235 ++FGS G+L+E D ++ TS + P++P Q S+ GRDAE+ L L+ Sbjct: 780 IEFGSLGNLSE--------DVLSHTSRDVILIPSAPQKVQLSEPACSKQGRDAEHSLRLK 831 Query: 234 NDDDFPPL 211 N+D+FPPL Sbjct: 832 NEDEFPPL 839 >XP_011080657.1 PREDICTED: uncharacterized protein LOC105163859 [Sesamum indicum] Length = 852 Score = 562 bits (1448), Expect = 0.0 Identities = 354/850 (41%), Positives = 463/850 (54%), Gaps = 84/850 (9%) Frame = -2 Query: 2508 DPTAIGEDVWETAEQVAWDEVLACIHPTYDSEEKRKDVIDYVHRLIRRSIGVEVFPYGSV 2329 DP ++ E+ AE+ A +VL C+HPT DSEEKR+D++DYV RLI+ + EVFPYGSV Sbjct: 30 DPASLSEESLSAAEEAA-QQVLNCVHPTLDSEEKRRDIVDYVQRLIKSHLNCEVFPYGSV 88 Query: 2328 PLKTYLPDGDIDLTALSTPKSDDSLCRDVLAVMHAEQLNANAEFEVRDTQFIDAEVKLVK 2149 PLKTYLPDGDIDLTAL P +++SL DV A++ E+ + NAE++V+DTQ+IDAEVKLVK Sbjct: 89 PLKTYLPDGDIDLTALKVPNAEESLPHDVFALLQREEKSENAEYQVKDTQYIDAEVKLVK 148 Query: 2148 CLVDNIVIDISFNQLGGLSTLCFLEQVDRLIGKDHLFKHSIILIKAWCYYESRILGAHHG 1969 CLV NIVIDISFNQLGGLSTLCFLEQVDRL+G++HLFK SIIL+KAWCYYESRILGAHHG Sbjct: 149 CLVRNIVIDISFNQLGGLSTLCFLEQVDRLVGRNHLFKRSIILVKAWCYYESRILGAHHG 208 Query: 1968 LISTYALESLILYIFHMFNSTLNGPLAALHRFLVYYSKFDWENYSISLNGPVCTSYLPDL 1789 LISTYALE+LILYIFH+++S+L+GPLA L+RFL YYS+FDWENY ISL GP C S LPD+ Sbjct: 209 LISTYALETLILYIFHLYHSSLSGPLAVLYRFLDYYSQFDWENYCISLKGPACKSSLPDI 268 Query: 1788 VVDN-XXXXXXXXXXXEFLRNCVDMFAVSPKASESNMRAFIKKNLNIIDPLKENNNLGRS 1612 VV+ EFL NC++MF+VS + E N +AF K+LNIIDPLKENNNLGRS Sbjct: 269 VVETPESGWNNLMLSEEFLENCIEMFSVSSRTPEGNPKAFQAKHLNIIDPLKENNNLGRS 328 Query: 1611 VNRGNYFRIRSAFKFGARKLGNILNLPRDRLAEEIKNFFENTLERHGRQNASHI------ 1450 V+RGN++RIRSAFK+GARKLG +L PRD++A I +FF NTL RHG S Sbjct: 329 VHRGNFYRIRSAFKYGARKLGQVLLQPRDKVANGICDFFTNTLARHGNDFRSSFQCLTLE 388 Query: 1449 SGDELARL-----PFTVYSYDDLHLNQFNGNFDHHMSGLEDNIRSASRNEPGWYANSTVS 1285 SGDE + P + S DD+ L + D + ++R +NE Y + Sbjct: 389 SGDEESSTASLSSPVELLSEDDMLLKSSASDAD------KGSVRFELKNETDRYLTIELP 442 Query: 1284 SQMVVQTCYTPEGTVSFVDRFEKDYDDLVMDRALRARNTNVEPDFSLPGS---------- 1135 S+M + Y+ EG +S + YD + + +LR ++ S Sbjct: 443 SEMA--SGYSAEGVISGHHIAGETYDLVTSNSSLRNGTSDYASSSDYTSSVSWNHCHESL 500 Query: 1134 -DNIDSFVTNFNAGSFGDLAISSPE---------------------------DTFSDSIS 1039 I + + G+F +++ D D Sbjct: 501 YSGISAEIGQLRTGTFHQSGLATTASLKFGISSWLKYIEENGKISSTYQWCMDNSHDGCL 560 Query: 1038 VDLRKKILDGNSDDTESLN------------------LADLTGDYDSHIRSLLYGQCCHG 913 L I N D SL LADL+GDYDSHIRSLLYGQ CHG Sbjct: 561 TGLGSSIPKANILDNLSLEFREMDLTSLGGESEAFNPLADLSGDYDSHIRSLLYGQLCHG 620 Query: 912 IALSSPAKCSTLLSSPTQLQNKPWDTVRQNLPVIWKMNSSDAAFGHPPYAVDNSNP---- 745 +L + C T L KPWD VRQ++P W+ S + P +V+ S Sbjct: 621 FSLFTSEVCHTPSFPSRLLNKKPWDIVRQSMP-FWRSQFSKVS--SRPVSVEQSRRLAAD 677 Query: 744 ---STAGFALVERRKARGTGTYIPHLKRNSSRERPSQAR-RNQLQSNRCQVQK-NTSNNI 580 S++ F K RGTGTY PH+K RE+ Q R R + N Q + N + Sbjct: 678 SAFSSSAFRSEGMHKTRGTGTYFPHVK-GFYREKSLQGRSRYKALGNPNQFHRYGRCNGL 736 Query: 579 VDTTSPKTLDANSIRINAFEGGDHQLXXXXXXXXXXXXQDLHSQPINSHGNGTTIP---- 412 + F G H++ D+ Q S G+G Sbjct: 737 YPALGTPSY---------FGNGIHEVLPARSKSKSRGRLDVQCQSPRSVGDGNQASGDLN 787 Query: 411 -KWELKFGSFGSLAEG--DCSLGLSDTIAGTSVSSPASPAGQTSKEVKGNHGRDAENFLN 241 ++FGS G+L E S + + G S+ + + + V G ++ Sbjct: 788 GSCRIEFGSVGNLGEEVISTSAHVCGSALGVSLETQCTKKLMKQERVPGPS-------IH 840 Query: 240 LENDDDFPPL 211 L+N+ DFPPL Sbjct: 841 LKNEVDFPPL 850 >XP_019178455.1 PREDICTED: uncharacterized protein LOC109173649 [Ipomoea nil] Length = 850 Score = 551 bits (1420), Expect = e-179 Identities = 366/860 (42%), Positives = 474/860 (55%), Gaps = 94/860 (10%) Frame = -2 Query: 2508 DPTAIGEDVWETAEQVAWDEVLACIHPTYDSEEKRKDVIDYVHRLIRRSIGVEVFPYGSV 2329 DP+AI E+ W +AE++ E++ CIHPT DSEEKRKDV++Y+ +LIR S+ EVFPYGSV Sbjct: 23 DPSAIAEECWTSAEEMI-QELVNCIHPTMDSEEKRKDVMEYIQKLIRDSLACEVFPYGSV 81 Query: 2328 PLKTYLPDGDIDLTALSTPKSDDSLCRDVLAVMHAEQLNANAEFEVRDTQFIDAEVKLVK 2149 PLKTYLPDGDIDLTAL+ P +++ L DVLA++ E+ N E+EVRDTQFIDAEVKLVK Sbjct: 82 PLKTYLPDGDIDLTALTAPNTEEFLVHDVLALLREEEKKENVEYEVRDTQFIDAEVKLVK 141 Query: 2148 CLVDNIVIDISFNQLGGLSTLCFLEQVDRLIGKDHLFKHSIILIKAWCYYESRILGAHHG 1969 CLV +IVIDISFNQLGGL +LCFLEQVDRLIGK+HLFKHSI+LIK+WCYYESRILG+HHG Sbjct: 142 CLVQDIVIDISFNQLGGLCSLCFLEQVDRLIGKNHLFKHSIMLIKSWCYYESRILGSHHG 201 Query: 1968 LISTYALESLILYIFHMFNSTLNGPLAALHRFLVYYSKFDWENYSISLNGPVCTSYLPDL 1789 LISTYALE+LILYIF F+S+LNGPLA L+RFL YYS+FDWE Y ISLNGPVC + LP++ Sbjct: 202 LISTYALETLILYIFQFFHSSLNGPLAVLYRFLDYYSRFDWEKYCISLNGPVCKASLPEI 261 Query: 1788 VVDNXXXXXXXXXXXEFLRNCVDMFAVSPKASESNMRAFIKKNLNIIDPLKENNNLGRSV 1609 VV EFLRNC++MF+ +P N R F +K+LNIIDPLKE NNLGRSV Sbjct: 262 VVSIPDNGGNLLLSEEFLRNCMEMFS-APSRVIGNTRVFKQKHLNIIDPLKETNNLGRSV 320 Query: 1608 NRGNYFRIRSAFKFGARKLGNILNLPRDRLAEEIKNFFENTLERHGRQ---NASHI---- 1450 +RGNY+RIRSAFK+GARKLG IL D++ + I FF NT++RHG N H Sbjct: 321 HRGNYYRIRSAFKYGARKLGRILLSSPDKIGDGITKFFANTIDRHGHNVFYNLKHSSLKC 380 Query: 1449 ----SGDELARLPFTVYSYDDLHLNQFNGNFDHHMSGLEDNIRSASRNEPGWYANSTVSS 1282 SG + P S D++ LN G ++ ED S RNE AN + + Sbjct: 381 CPNGSGVLSSPSPAEFISEDEMPLNSSFGYCENDNFEWEDKCGSVLRNE----ANKLMKT 436 Query: 1281 QMVVQTC--YTPEGTVSFVDRFEKDYDDLVMDRALRARNTNVEP---------------D 1153 V C T TVS D DD AL +TN D Sbjct: 437 ---VSECSSLTIGATVS-GHGLSGDTDDPACSHALNPSSTNSMSNCSTSGNCSDSLSGLD 492 Query: 1152 FSLPGSDNIDSFVTN--------FNAGSF------------------------------- 1090 +S P ++ S N F +G F Sbjct: 493 YSAPEFHSLKSSAINGSCKNWALFQSGQFDYVYGNPGFGSCIDQGEFPLENSSIYQSVTD 552 Query: 1089 -------GDLAISSPEDTFSDSISVDLRKKILDGNSDDTESLN-LADLTGDYDSHIRSLL 934 GD A S+P+ + +S+S+D R++ L + D E LN LADLTGDYD HIRSL+ Sbjct: 553 YSESICSGDSATSTPKTSILESLSLDFRERDLASIASDLEVLNPLADLTGDYDCHIRSLI 612 Query: 933 YGQCCHGIAL--------------SSPAKCSTLLSSPTQLQNKPWDTVRQNLPVIWKMNS 796 YGQCCHG AL + A C+ + S T QN T + PVI + Sbjct: 613 YGQCCHGYALMASLLFDPSTQSHFQNQAFCNAVRQSSTPGQNSVAKTNMR--PVIVR--- 667 Query: 795 SDAAFGHPPYAVDNSNPSTAGFALVERR-KARGTGTYIPHLKRNSSRERPSQAR-RNQLQ 622 P+ +NPS++ E + KA ++P++ + RERP + + RN+ Sbjct: 668 --------PFVSSPTNPSSSTVTRSEEKPKAPLPQPFVPNM-HHYFRERPWKPKGRNKEF 718 Query: 621 SNRCQVQKNTSNNIVDTTSPKTLDANSIRINAFEGGDHQLXXXXXXXXXXXXQDLHSQPI 442 + Q T++N P +AN + + + H P Sbjct: 719 GSDVQFHNGTNSN---GWVPVLSEANCLENDGRQEFSHAQPSCKKRGKFSTPNQYDHHPA 775 Query: 441 -NSHGNGTTIPKWELKFGSFGSLAEGDCSLGLSDTIAGTSVSS--PASPAGQTSKEVKGN 271 +S NG E++FGS G L E D G S T++S P++ KE Sbjct: 776 GDSDENGLANLLCEIEFGSLGKLPE-DFLSGSSRDRTPTTLSDKVPSAKPALCRKE---- 830 Query: 270 HGRDAENFLNLENDDDFPPL 211 R A+ +L+N+++FPPL Sbjct: 831 --RFADQSFHLKNEEEFPPL 848 >XP_012841200.1 PREDICTED: uncharacterized protein LOC105961493 [Erythranthe guttata] Length = 765 Score = 545 bits (1404), Expect = e-178 Identities = 347/789 (43%), Positives = 460/789 (58%), Gaps = 23/789 (2%) Frame = -2 Query: 2508 DPTAIGEDVWETAEQVAWDEVLACIHPTYDSEEKRKDVIDYVHRLIRRSIGVEVFPYGSV 2329 DP + E+ A + A +V+ C+HPT DSEEKR+DVIDYV RLI+ I EVFPYGSV Sbjct: 22 DPAPLSEECLSAAAEAAAQQVVNCVHPTLDSEEKRRDVIDYVQRLIKSQINCEVFPYGSV 81 Query: 2328 PLKTYLPDGDIDLTALSTPKSDDSLCRDVLAVMHAEQLNANAEFEVRDTQFIDAEVKLVK 2149 PLKTYLPDGDIDLTA+ + ++ L +V A++ E+ N NAEF+V+D QFIDAEVKLVK Sbjct: 82 PLKTYLPDGDIDLTAVKGLEGEEVLAHEVFALLQREEKNENAEFQVKDPQFIDAEVKLVK 141 Query: 2148 CLVDNIVIDISFNQLGGLSTLCFLEQVDRLIGKDHLFKHSIILIKAWCYYESRILGAHHG 1969 CLV NIVIDISFNQLGGLSTLCFLEQVDRL+G++HLFK SIIL+KAWCYYESR+LGAHHG Sbjct: 142 CLVQNIVIDISFNQLGGLSTLCFLEQVDRLVGRNHLFKRSIILVKAWCYYESRVLGAHHG 201 Query: 1968 LISTYALESLILYIFHMFNSTLNGPLAALHRFLVYYSKFDWENYSISLNGPVCTSYLPDL 1789 LISTYALE+LILYIFH+F+S+L+GPL+ L++FL YYS+FDWENY +SL GPVC S LPD+ Sbjct: 202 LISTYALETLILYIFHLFHSSLSGPLSVLYKFLEYYSQFDWENYCVSLKGPVCKSSLPDI 261 Query: 1788 VVDN-XXXXXXXXXXXEFLRNCVDMFAVSPKASESNMRAFIKKNLNIIDPLKENNNLGRS 1612 VV EFL NC++MF+VS + E +AF K LNIIDPLKENNNLGRS Sbjct: 262 VVKTPESERKDLMLSEEFLENCMEMFSVSSRVVEGKPKAFQTKYLNIIDPLKENNNLGRS 321 Query: 1611 VNRGNYFRIRSAFKFGARKLGNILNLPRDRLAEEIKNFFENTLERHGRQNASHIS----- 1447 V+RGN++RIRSAFK+GARKLG + P+D++A+EI FF +T+ RHG S Sbjct: 322 VHRGNFYRIRSAFKYGARKLGKVFQQPKDKIADEISEFFADTIARHGSDYRSGTQGLTLE 381 Query: 1446 -GDE-----LARLPFTVYSYDDLHLNQFNGNFDHHMSGLEDNIRSASRNEPGWYANSTVS 1285 GDE + P + S DD+ L + D+ GLE +NE ++ S Sbjct: 382 FGDEDSSTAYSSSPVELSSEDDIILKSSVCD-DNDSLGLE------FKNESNRHSELATS 434 Query: 1284 SQMVVQTCYTPE-GTVSFVDRFEKDYDDLVMDRALRARNTNVEPDFSLPGSDNIDSFVTN 1108 + ++ PE + S + + L A N D L DN + Sbjct: 435 NSSLING--VPEYASSSNNSSWNHSHKPFYCSSKLSAENEKFCLDTWL---DNREENGET 489 Query: 1107 FNAGSFGDLAISSPEDTFSDSISVDLRKKILDGNSDDTESLN-LADLTGDYDSHIRSLLY 931 N + L S + F+D I++D ++ L ++E+ N LADL GDY+SHIRSLLY Sbjct: 490 NNTYQW-CLDNSHAKSNFTDHIALDFKEMDLKSVGGESEAFNPLADLRGDYESHIRSLLY 548 Query: 930 GQCCHGIALSSPAKCSTLLSSPTQLQNKP--WDTVRQNLPVIWKMNSSDAAFGHPPYAVD 757 GQ CHG +LS+ T P+++ NK DT+ Q +P S P+ ++ Sbjct: 549 GQLCHGFSLSTSVAYHTPF-LPSRVSNKKPLSDTLPQPMPACRVQFSQ---MSSTPFQIE 604 Query: 756 NSNPSTAGFALVERRKARGTGTYIPHLKRNSSRERPSQA-RRNQL----QSNRCQVQKNT 592 + A FA +R ++GTGTY PH+ +RERP+Q RN+ N K Sbjct: 605 QNVGQLANFA---QRASQGTGTYFPHVNA-FNRERPTQGWSRNKAPPHNHHNHFHRHKRV 660 Query: 591 SNNIVDTTSPKTLDANSIRINAFEGGDHQLXXXXXXXXXXXXQDLHSQPINSHGNGTTIP 412 + + T A S N+ G D + Q+ SQ S NG Sbjct: 661 NGSCAYT-------APSQPKNSENGVDEVV-----PSARRRSQESMSQSPRSVNNGDQKS 708 Query: 411 K--WELKFGSFGSLAEGDCSLGLSDTIAGTSVSSPASPAGQTSKEVKGNHGRDAENFLNL 238 + ++FGS G+LAE + IA +S ++ T KE R + ++L Sbjct: 709 RKLSRIEFGSIGNLAE--------EVIAASSGVIASTQCSSTKKE------RVSVPIVHL 754 Query: 237 ENDDDFPPL 211 +N+D+FPPL Sbjct: 755 KNEDEFPPL 763 >XP_002518281.1 PREDICTED: uncharacterized protein LOC8258097 isoform X1 [Ricinus communis] EEF44041.1 nucleic acid binding protein, putative [Ricinus communis] Length = 821 Score = 544 bits (1402), Expect = e-177 Identities = 340/806 (42%), Positives = 465/806 (57%), Gaps = 39/806 (4%) Frame = -2 Query: 2508 DPTAIGEDVWETAEQVAWDEVLACIHPTYDSEEKRKDVIDYVHRLIRRSIGVEVFPYGSV 2329 DP I E+ WE AEQ +++ IHPT +++ RK V++YV LI+ S+G +VFPYGSV Sbjct: 42 DPALISEENWERAEQATL-QIVYRIHPTVEADCNRKHVVEYVQSLIQSSLGFQVFPYGSV 100 Query: 2328 PLKTYLPDGDIDLTALSTPKSDDSLCRDVLAVMHAEQLNANAEFEVRDTQFIDAEVKLVK 2149 PLKTYLPDGDIDLTA+ P D+ DV AV+ E+ N +A ++V+D FIDAEVKL+K Sbjct: 101 PLKTYLPDGDIDLTAIINPAGVDASVSDVHAVLRREEQNRDAPYKVKDVHFIDAEVKLIK 160 Query: 2148 CLVDNIVIDISFNQLGGLSTLCFLEQVDRLIGKDHLFKHSIILIKAWCYYESRILGAHHG 1969 C+V +IV+DISFNQLGGLSTLCFLEQVD+LIGK HLFK SIILIKAWCYYESRILGAHHG Sbjct: 161 CIVHDIVVDISFNQLGGLSTLCFLEQVDQLIGKSHLFKRSIILIKAWCYYESRILGAHHG 220 Query: 1968 LISTYALESLILYIFHMFNSTLNGPLAALHRFLVYYSKFDWENYSISLNGPVCTSYLPDL 1789 LISTYALE+LILYIFH+F+S+LNGPL L+RFL Y+SKFDW+NY ISLNGPVC S LP + Sbjct: 221 LISTYALETLILYIFHLFHSSLNGPLMVLYRFLDYFSKFDWDNYCISLNGPVCKSSLPKI 280 Query: 1788 VVD-NXXXXXXXXXXXEFLRNCVDMFAVSPKASESNMRAFIKKNLNIIDPLKENNNLGRS 1612 V + EFLRN V M +V ++ E N R F +K+LNI+DPL+ENNNLGRS Sbjct: 281 VAEPPETGRGNLLLDDEFLRNSVKMLSVPSRSPEMNSRPFTQKHLNIVDPLRENNNLGRS 340 Query: 1611 VNRGNYFRIRSAFKFGARKLGNILNLPRDRLAEEIKNFFENTLERHGRQNASHISG---- 1444 VNRGN++RIRSAFK+GARKLG+IL+L DR+ E+ FF NTL+RHG + +H+ Sbjct: 341 VNRGNFYRIRSAFKYGARKLGHILSLQSDRMINELDKFFANTLDRHGSNSLTHVKSSCLV 400 Query: 1443 ------DELARLPFTVYSYDDLHLNQFNGN-----FDHHMSGLEDNIR----SASRNEPG 1309 D L+ + S +D + + F+ SG N S+ E G Sbjct: 401 SPTGNFDNLSSSSLSDTSSEDSIVQKSTAGCSVRPFETSCSGNSHNASHFYLSSLHGEDG 460 Query: 1308 WYANSTVSSQMVVQTCYTPEGTVSFVDRFEKDYDDLVMDRALRARNTNVEPDFSLPGSDN 1129 + + + + +G +S + E + V++ + +N E SL Sbjct: 461 KFESGISDGTTLAN--FVIDGQISCTEWSESKENHFVINNS-ACSCSNHEGKTSL--CST 515 Query: 1128 IDSFVTNFNAGSFGDLAISSPEDTFSDSISVDLRKKILDGNSDDTESLNLADLTGDYDSH 949 I S V N + +LA ++ E F+ + K +L DLTGDYDSH Sbjct: 516 IPSLVNNISE----NLAPTTAERDFASISQIPRSFK------------SLLDLTGDYDSH 559 Query: 948 IRSLLYGQCCHGIALSSPA-KCSTLLSSPTQLQNKPWDTVRQNLPVIWKMNS---SDAAF 781 ++S+ +GQ C A+S+P CS ++P PW+TVRQ+L + ++S ++ F Sbjct: 560 LKSVKFGQGCCFFAVSAPVLPCSP--TAPHSKNKNPWETVRQSLQLKRNVHSQINTNGIF 617 Query: 780 GHPPYAVDNSNPSTAGFALVERRKARGTGTYIPHLKRNSSRERPSQARR-NQLQSNRCQV 604 GH + +++ P T F+ E+RK RGTGTYIP++ +S+RERPS RR N + +N + Sbjct: 618 GHQQHFLNHLVPFTTAFSSEEKRKQRGTGTYIPNMSYHSNRERPSSERRKNHVTANNGDL 677 Query: 603 QKNTSNNIVDTTSPKTLDANSIRINAFEGGDHQLXXXXXXXXXXXXQDLHSQPIN----- 439 + T +N + T P IN+++ G H+L ++ Sbjct: 678 HRRTRDNGLAATRP--------GINSYQHG-HELSEAEYPYLGNGKPVPSEVQLSQSFVW 728 Query: 438 --SHGNGTTIPKWELKFG------SFGSLAEGDCSLGLSDTIAGTSVSSPASPAGQTSKE 283 S NG + P + FG SL E + S + SSP A + + Sbjct: 729 GPSSANGFSRPSERIDFGGQELQLQEASLQERVPTQDSSTSSTLVFPSSPEVTAAERREP 788 Query: 282 VKGN-HGRDAENFLNLENDDDFPPLS 208 V N R A +L+++ DFPPLS Sbjct: 789 VLQNVQERAASESYHLKDEVDFPPLS 814 >XP_016577610.1 PREDICTED: uncharacterized protein LOC107875414 [Capsicum annuum] Length = 836 Score = 545 bits (1403), Expect = e-177 Identities = 357/842 (42%), Positives = 468/842 (55%), Gaps = 72/842 (8%) Frame = -2 Query: 2517 VAADPTAIGEDVWETAEQVAWDEVLACIHPTYDSEEKRKDVIDYVHRLIRRSIGVEVFPY 2338 V DP+ + ED W E+ A EV+ C+HPT D+EEKRKDV+DYV RLIR S+G EVF Y Sbjct: 21 VNPDPSQVKEDCWVVGEE-AVQEVVNCVHPTLDTEEKRKDVVDYVQRLIRCSLGCEVFSY 79 Query: 2337 GSVPLKTYLPDGDIDLTALSTPKSDDSLCRDVLAVMHAEQLNANAEFEVRDTQFIDAEVK 2158 GSVPLKTYLPDGDIDLT S+P +++L RDVLAV+ E+L N E++V+D QFIDAEVK Sbjct: 80 GSVPLKTYLPDGDIDLTVFSSPIIEETLARDVLAVLQEEELKENTEYDVKDPQFIDAEVK 139 Query: 2157 LVKCLVDNIVIDISFNQLGGLSTLCFLEQVDRLIGKDHLFKHSIILIKAWCYYESRILGA 1978 LVKC+V NIVIDIS NQLGGL TLCFLEQVDR++GK+HLFK SIILIKAWCYYESR+LGA Sbjct: 140 LVKCIVQNIVIDISVNQLGGLCTLCFLEQVDRIVGKNHLFKRSIILIKAWCYYESRVLGA 199 Query: 1977 HHGLISTYALESLILYIFHMFNSTLNGPLAALHRFLVYYSKFDWENYSISLNGPVCTSYL 1798 HHGLISTYALE L+L+IF +F+S+LNGPL+ L+RFL YY KFDW+NY ISLNGPV S L Sbjct: 200 HHGLISTYALEILVLFIFQLFHSSLNGPLSVLYRFLDYYCKFDWDNYCISLNGPVRKSSL 259 Query: 1797 PDLVVD-NXXXXXXXXXXXEFLRNCVDMFAVSPKASESNMRAFIKKNLNIIDPLKENNNL 1621 P + V+ EFLR+ ++MF+V P+ E++ R F +K LNIIDPLKENNNL Sbjct: 260 PAIFVEMPDYITKKLLLSEEFLRDSMEMFSVPPRGLETDTRPFQQKYLNIIDPLKENNNL 319 Query: 1620 GRSVNRGNYFRIRSAFKFGARKLGNILNLPRDRLAEEIKNFFENTLERH------GRQNA 1459 GRSV++GN +RI+ AFK+GARKLG IL P D++A+EIKNFF NT+ERH Q + Sbjct: 320 GRSVSKGNLYRIQRAFKYGARKLGEILLSPSDKVADEIKNFFANTIERHILNHVADLQYS 379 Query: 1458 SHISGDE---LARLPFTVYSYDDLHLNQFNGNFDHHMSGLEDNIRSASRNEPGWYANSTV 1288 S I GDE + P Y+ + L +G+F++ L+ +S S NE + V Sbjct: 380 SLILGDEDTCSSLSPAEFYANARMLLKSSDGDFEN--DSLKKVFKSIS-NESSSNLMNGV 436 Query: 1287 SSQMVVQTCYTPEGTVSFVDRFEKDYDDLVMDRALRARNTNVEPDFSLPGSDNIDSFVTN 1108 S MV + +G +D +D L N D+ G+ + Sbjct: 437 PSDMVSENASFSDGAAVSGFCPYRDANDSSASVPLNLDVANGTYDYCSNGTSMSSLSWKH 496 Query: 1107 FNAGSF------------------GDLAISS----------------------PEDTFSD 1048 + A F DL+ S ED +S Sbjct: 497 YYAPPFHFNKSCVETGNCAENLCQRDLSDSGLWVETTECPQESSNIYQAGADYSEDVWSG 556 Query: 1047 SISVDLRK-KILDGNSDDTESLNLADLTGD-------------YDSHIRSLLYGQCCHGI 910 ++ K IL+ + + E + A GD YDSHIRSLLYGQCC+G Sbjct: 557 GSAISSPKTSILESLTLNIEERDTASAAGDIESMNPLVDLSGDYDSHIRSLLYGQCCYGF 616 Query: 909 ALSSPAKCSTLLSSPTQLQN-KPWDTVRQNLPVIWKMNSSDAAFGH----PPYAVDNSN- 748 LS+P S SSP QN W+TVRQ++P+ + NS +G+ P A + N Sbjct: 617 VLSAPVLNSP--SSPPPSQNITYWNTVRQSMPL--RQNSFWQTYGNVMLVEPAAQPSGNA 672 Query: 747 -PSTAGFALVERRKARGTGTYIPHLKRNSSRERPSQARRNQLQSN-RCQVQKNTSNNIVD 574 PSTA E+ KA+GTGTY P+ + R + R S+ + ++ N S + Sbjct: 673 LPSTATLGSEEKEKAQGTGTYFPNAEYRQERRFKGRTRSKAPGSHGQLHLKSNHSEKCTE 732 Query: 573 TTSPKTLDANSIRINAFEGGDHQLXXXXXXXXXXXXQDLHSQPINSHGNGTTIPKWELKF 394 S + + G +L L N+ N + + ++F Sbjct: 733 EIS---------AVKSSVQGREKLAIPSQSDNF-----LEDSRANAFSNSSCV----IEF 774 Query: 393 GSFGSLAEGDCSLGLSDTIAGTSVSSPASPAGQTSKEVKGNHGRDAENFLNLENDDDFPP 214 GS G+L+EG S D + SV + Q SK GR A L L+N+D+FPP Sbjct: 775 GSLGNLSEGVLSRTSRDVLLNPSVPQQS----QLSKPACSKQGRAAVPLLGLKNEDEFPP 830 Query: 213 LS 208 LS Sbjct: 831 LS 832 >XP_007017068.2 PREDICTED: uncharacterized protein LOC18591082 isoform X1 [Theobroma cacao] Length = 836 Score = 541 bits (1393), Expect = e-176 Identities = 356/836 (42%), Positives = 470/836 (56%), Gaps = 70/836 (8%) Frame = -2 Query: 2505 PTAIGEDVWETAEQVAWDEVLACIHPTYDSEEKRKDVIDYVHRLIRRSIGVEVFPYGSVP 2326 P +I D W++AE+ A ++ + PT D++ KRK++++YV RLI+ +G +VFPYGSVP Sbjct: 39 PCSIARDSWDSAEETA-RRIVWSVQPTLDADRKRKEIVEYVQRLIQDGLGYQVFPYGSVP 97 Query: 2325 LKTYLPDGDIDLTALSTPKSDDSLCRDVLAVMHAEQLNANAEFEVRDTQFIDAEVKLVKC 2146 LKTYLPDGDIDLT LS+P +D+L DV A++ E+ N A + V+D IDAEVKLVKC Sbjct: 98 LKTYLPDGDIDLTTLSSPAIEDTLVSDVHAILRGEEHNQKAPYRVKDVHCIDAEVKLVKC 157 Query: 2145 LVDNIVIDISFNQLGGLSTLCFLEQVDRLIGKDHLFKHSIILIKAWCYYESRILGAHHGL 1966 LV +IV+DISFNQLGGL TLCFLEQ+DRL+GKDHLFK SIILIKAWCYYESRILGAHHGL Sbjct: 158 LVQDIVVDISFNQLGGLCTLCFLEQIDRLVGKDHLFKRSIILIKAWCYYESRILGAHHGL 217 Query: 1965 ISTYALESLILYIFHMFNSTLNGPLAALHRFLVYYSKFDWENYSISLNGPVCTSYLPDLV 1786 ISTYALE+L+LYIFH+F+S+LNGP+A L+RFL Y+SKFDWENY ISLNGPVC S LPD+V Sbjct: 218 ISTYALETLVLYIFHLFHSSLNGPIAVLYRFLDYFSKFDWENYCISLNGPVCKSSLPDIV 277 Query: 1785 VD-NXXXXXXXXXXXEFLRNCVDMFAVSPKASESNMRAFIKKNLNIIDPLKENNNLGRSV 1609 + EFLR C++MF+V K E+N R F K+LNIIDPLKENNNLGRSV Sbjct: 278 AEVPENVGNNPLLSEEFLRKCINMFSVPSKGVETNSRLFPLKHLNIIDPLKENNNLGRSV 337 Query: 1608 NRGNYFRIRSAFKFGARKLGNILNLPRDRLAEEIKNFFENTLERHGRQNASHISGDELAR 1429 NRGNY+RIRSAFK+GA KL IL LPR+R+ +E+ FF NTLERHG ++H++G + Sbjct: 338 NRGNYYRIRSAFKYGAHKLEQILILPRERIPDELVKFFANTLERHG---SNHLTG--MQN 392 Query: 1428 LPFT--VYSYDDLH----LNQFNGNFDHHMSGLEDNIRSASRNEPGWYANS--------- 1294 LP T YD + + +GN+ + N+ S++ G A S Sbjct: 393 LPSTSDARGYDHVMPSPCASMCSGNY---LFAKSINVGSSNNRMSGSIAASGSRYKLGCP 449 Query: 1293 --TVSSQMVVQTCYTPEGTVSFVDRFEKDYDDLVMDRALRARNTNVEPDFSLPGSDNIDS 1120 ++SQ+V + D + V+ L ++ N D S P S N+ + Sbjct: 450 FDVLTSQVVPEKKSNVNRNAVSGHCHPGDAKEFVLSGLLAMKSENDSSD-SFPPSSNLGA 508 Query: 1119 --FVTNFNAGSFGDLAI-SSPEDTFSDSISVD--------------------LRKKILDG 1009 V G + I +S + T +DSI+ D + K+ L G Sbjct: 509 SLSVKPRTCRQMGMVEIGNSFKSTLTDSIAADDMSFALKPYSKNDTLAASNVVCKRELAG 568 Query: 1008 NSDDTESL-NLADLTGDYDSHIRSLLYGQCCHGIALSSPAKCSTLLSSPTQLQNKPWDTV 832 D+ESL +L DLTGDYD SLLYGQ CH ++SSP SP W+T+ Sbjct: 569 IFGDSESLKSLLDLTGDYDGQFWSLLYGQYCHLFSVSSPV-------SPHLQNENHWETI 621 Query: 831 RQNLPVIWKMNS---------SDAAFGHPPYAVDNSNPSTAGFALVERRKARGTGTYIPH 679 Q++P+ + S S F PP AV + S E +K RGTGTYIP Sbjct: 622 EQSIPLKQDLYSQRDSNGILGSQFCFSKPPVAVHTALDS-------EDKKKRGTGTYIPS 674 Query: 678 LKRNSSRERPSQARRNQLQSNRC--QVQKNTSNNIVDTTSPKTLDANSIRINAFEGGDHQ 505 +K S+RER S R Q++R Q+Q+ T+N T + + + G H+ Sbjct: 675 IKYRSNRERHSSG-RGIFQASRAYSQLQRYTNNKGSATVQQE--------MALSQEGSHE 725 Query: 504 LXXXXXXXXXXXXQDLHSQPINSHGNGTTIPKWEL-----------KFGSFGSLAEGDCS 358 L + P N+H ++ W L +F S S E + Sbjct: 726 L----SPKEYPALGPVKFGPPNTHPPYPSV--WGLCAASGLNCPPERFESESSSLELQST 779 Query: 357 -----LGLSDTIAGTSVSSPASPAGQTSKEV-KGNHGRDAENFLNLENDDDFPPLS 208 L D S S PA Q++K V + N DA +L+N+ DFPPLS Sbjct: 780 NMPEDNALPDPCTCGSTPSVMIPAAQSAKPVLESNQESDAGLSYHLKNEHDFPPLS 835 >EOY34687.1 NT domain of poly(A) polymerase and terminal uridylyl transferase-containing protein, putative isoform 1 [Theobroma cacao] Length = 836 Score = 538 bits (1385), Expect = e-174 Identities = 354/836 (42%), Positives = 470/836 (56%), Gaps = 70/836 (8%) Frame = -2 Query: 2505 PTAIGEDVWETAEQVAWDEVLACIHPTYDSEEKRKDVIDYVHRLIRRSIGVEVFPYGSVP 2326 P +I + W++AE+ A ++ + PT D++ KRK++++YV RLI+ +G +VFPYGSVP Sbjct: 39 PCSIARESWDSAEETA-RRIVWSVQPTLDADRKRKEIVEYVQRLIQDGLGYQVFPYGSVP 97 Query: 2325 LKTYLPDGDIDLTALSTPKSDDSLCRDVLAVMHAEQLNANAEFEVRDTQFIDAEVKLVKC 2146 LKTYLPDGDIDLT LS+P +D+L DV A++ E+ N A + V+D IDAEVKLVKC Sbjct: 98 LKTYLPDGDIDLTTLSSPAIEDTLVSDVHAILRGEEHNQKAPYRVKDVHCIDAEVKLVKC 157 Query: 2145 LVDNIVIDISFNQLGGLSTLCFLEQVDRLIGKDHLFKHSIILIKAWCYYESRILGAHHGL 1966 LV +IV+DISFNQLGGL TLCFLEQ+DRL+GKDHLFK SIILIKAWCYYESRILGAHHGL Sbjct: 158 LVQDIVVDISFNQLGGLCTLCFLEQIDRLVGKDHLFKRSIILIKAWCYYESRILGAHHGL 217 Query: 1965 ISTYALESLILYIFHMFNSTLNGPLAALHRFLVYYSKFDWENYSISLNGPVCTSYLPDLV 1786 ISTYALE+L+LYIFH+F+S+L GP+A L+RFL Y+SKFDWENY ISLNGPVC S LPD+V Sbjct: 218 ISTYALETLVLYIFHLFHSSLTGPIAVLYRFLDYFSKFDWENYCISLNGPVCKSSLPDIV 277 Query: 1785 VD-NXXXXXXXXXXXEFLRNCVDMFAVSPKASESNMRAFIKKNLNIIDPLKENNNLGRSV 1609 + EFLR C++MF+V K E+N R F K+LNIIDPLKENNNLGRSV Sbjct: 278 AEVPENVGNNPLLSEEFLRKCINMFSVPSKGVETNSRLFPLKHLNIIDPLKENNNLGRSV 337 Query: 1608 NRGNYFRIRSAFKFGARKLGNILNLPRDRLAEEIKNFFENTLERHGRQNASHISGDELAR 1429 NRGNY+RIRSAFK+GA KL IL LPR+R+ +E+ FF NTLERHG ++H++G + Sbjct: 338 NRGNYYRIRSAFKYGAHKLEQILILPRERIPDELVKFFANTLERHG---SNHLTG--MQN 392 Query: 1428 LPFT--VYSYDDLH----LNQFNGNFDHHMSGLEDNIRSASRNEPGWYANS--------- 1294 LP T YD + + +GN+ + N+ S++ G A S Sbjct: 393 LPSTSDARGYDHVMPSPCASMCSGNY---LFAKSINVGSSNNRMSGSIAASGSRYKLGCP 449 Query: 1293 --TVSSQMVVQTCYTPEGTVSFVDRFEKDYDDLVMDRALRARNTNVEPDFSLPGSDNIDS 1120 ++SQ+V + + D + V+ L ++ N D S P S N+ + Sbjct: 450 FDVLTSQVVPEKKANVNRNAVSGNCHPGDAKEFVLSGLLAMKSENDSSD-SFPPSSNLGA 508 Query: 1119 --FVTNFNAGSFGDLAI-SSPEDTFSDSISVD--------------------LRKKILDG 1009 V G + I +S + T +DSI+ D + K+ L G Sbjct: 509 SLSVKPRTCRQMGMVEIGNSFKSTLTDSIAADDMSFALKPYSKNDTLAASNVVCKRELAG 568 Query: 1008 NSDDTESL-NLADLTGDYDSHIRSLLYGQCCHGIALSSPAKCSTLLSSPTQLQNKPWDTV 832 D+ESL +L DLTGDYD SLLYGQ CH ++SSP SP W+T+ Sbjct: 569 IFGDSESLKSLLDLTGDYDGQFWSLLYGQYCHLFSVSSPV-------SPHLQNENHWETI 621 Query: 831 RQNLPVIWKMNS---------SDAAFGHPPYAVDNSNPSTAGFALVERRKARGTGTYIPH 679 Q++P+ + S S F PP AV + S E +K RGTGTYIP Sbjct: 622 EQSIPLKQDLYSQRDSNGILGSQFCFSKPPVAVHTALDS-------EDKKKRGTGTYIPS 674 Query: 678 LKRNSSRERPSQARRNQLQSNRC--QVQKNTSNNIVDTTSPKTLDANSIRINAFEGGDHQ 505 +K S+RER S R Q++R Q+Q+ T+N T + + + G H+ Sbjct: 675 IKYRSNRERHSSG-RGIFQASRAYSQLQRYTNNKGSATVQQE--------MALSQEGSHE 725 Query: 504 LXXXXXXXXXXXXQDLHSQPINSHGNGTTIPKWEL-----------KFGSFGSLAEGDCS 358 L + P N+H ++ W L +F S S E + Sbjct: 726 L----SPKEYPALGPVKFGPPNTHPPYPSV--WGLCAASGLNCPPERFESESSSLELQST 779 Query: 357 -----LGLSDTIAGTSVSSPASPAGQTSKEV-KGNHGRDAENFLNLENDDDFPPLS 208 L D S S PA Q++K V + N DA +L+N+ DFPPLS Sbjct: 780 NMPEDNALPDPCTCGSTPSVMIPAAQSAKPVLESNQESDAGLSYHLKNEHDFPPLS 835 >APA20307.1 PAP/OAS1 substrate-binding domain superfamily protein [Populus tomentosa] Length = 807 Score = 536 bits (1380), Expect = e-174 Identities = 339/795 (42%), Positives = 459/795 (57%), Gaps = 28/795 (3%) Frame = -2 Query: 2508 DPTAIGEDVWETAEQVAWDEVLACIHPTYDSEEKRKDVIDYVHRLIRRSIGVEVFPYGSV 2329 DP +I E+ WE AE+ E++ IHPT +S KRK VI YV RLI+ S+G EVFPYGSV Sbjct: 48 DPWSIVEENWERAEEFT-REIVYRIHPTVESNFKRKQVIGYVQRLIKSSLGFEVFPYGSV 106 Query: 2328 PLKTYLPDGDIDLTALSTPKSDDSLCRDVLAVMHAEQLNANAEFEVRDTQFIDAEVKLVK 2149 PLKTYLPDGDIDLTA+S+P +++L D+ AV+ E+LN ++ FEV+D IDAEVKL+K Sbjct: 107 PLKTYLPDGDIDLTAISSPAIEEALVSDIHAVLRREELNEDSTFEVKDVHCIDAEVKLIK 166 Query: 2148 CLVDNIVIDISFNQLGGLSTLCFLEQVDRLIGKDHLFKHSIILIKAWCYYESRILGAHHG 1969 C+V N V+DISFNQLGGL TLCFLE+VD+L+GK+HLFK SIILIKAWCYYESRILGAHHG Sbjct: 167 CIVQNTVVDISFNQLGGLCTLCFLEEVDQLVGKNHLFKRSIILIKAWCYYESRILGAHHG 226 Query: 1968 LISTYALESLILYIFHMFNSTLNGPLAALHRFLVYYSKFDWENYSISLNGPVCTSYLPDL 1789 LISTYALE+LILYIFH+F+ +LNGPLA L+RFL Y+SKFDWENY ISLNGPVC S LP++ Sbjct: 227 LISTYALETLILYIFHLFHCSLNGPLAVLYRFLDYFSKFDWENYCISLNGPVCKSSLPNI 286 Query: 1788 VVDN-XXXXXXXXXXXEFLRNCVDMFAVSPKASESNMRAFIKKNLNIIDPLKENNNLGRS 1612 V + EFL++CVD F+V + E N R F +K+LNI+DPLKENNNLGRS Sbjct: 287 VAEPLENGQGELLLSDEFLKDCVDRFSVPSRKPEMNSRPFPQKHLNIVDPLKENNNLGRS 346 Query: 1611 VNRGNYFRIRSAFKFGARKLGNILNLPRDRLAEEIKNFFENTLERHGRQNASHISGDELA 1432 VNRGN+FRIRSAFK+GARKLG IL LP++R+A+E+K FF NTL+RHG + + ELA Sbjct: 347 VNRGNFFRIRSAFKYGARKLGQILLLPKERIADELKIFFANTLDRHGSDYWTEVGNSELA 406 Query: 1431 ------------RLPFTVYSYDDLHLNQFNGNFDHHMSGLEDNIRSASRNEPGWYANSTV 1288 S DD+HL + NG +D+ E + + + PG Sbjct: 407 SGARSSDNSVSLSSHSNTCSEDDIHL-KLNGGYDNDTLFSEKSNHTPPLHFPG------- 458 Query: 1287 SSQMVVQTCYTPEGTVSFVDRFEKDYDDLVMDRALRARNTNVEPDFSLPGSDNIDSFVTN 1108 EG + F D + + R EP + S N T Sbjct: 459 ----------LSEGNREILINFSADDEMSCIFRP--------EPKQNHFQSSNSVCSCTK 500 Query: 1107 FNAGSFGDLAISSPEDTFSDSISVDLRKKILDGNSDDTESL-NLADLTGDYDSHIRSLLY 931 + +P D +++S +K G + +++ L +L L GD++ H++SL Y Sbjct: 501 HEGIAPSVSTTPNPADNVPENLSTTRVEKDFAGINGNSQPLKSLLGLRGDHNGHLQSLAY 560 Query: 930 GQCCHGIALSSP-AKCSTLLSSPTQLQNKPWDTVRQNLPVIWKMN-----SSDAAFGHPP 769 Q CH A+S+P C ++L P W+TV+Q+L + K N +++ FG Sbjct: 561 SQYCHMHAVSAPIPPCPSML--PLSENKNGWETVQQSLQL--KQNGHSQMNTNYVFGTQL 616 Query: 768 YAVDNSNPSTAGFALVERRKARGTGTYIPHLKRNSSR-ERPSQAR-RNQLQSNRCQVQKN 595 Y V+ P A E+ K RGTGTYIP++ +SSR +R S R R Q Q Q+ K Sbjct: 617 YCVNPGGPFRAATDSEEKNKRRGTGTYIPNMSYHSSRGDRLSLGRGRTQPQVYHGQLHKY 676 Query: 594 TSNNIVDTTSPKTLDANSIRINAFEGGDHQLXXXXXXXXXXXXQDLHSQPI---NSHGNG 424 N + P TL ++ N + + + HS P +S+ NG Sbjct: 677 AHENGL----PTTLQEKNLSENGHDLSEAEYPHLGNGKPVPIEAH-HSYPSVWGSSNANG 731 Query: 423 TTIPKWELKFGSFG-SLAEGDCSLGLSDTIA--GTSVSSPASPAGQTSKEVKGNHGRDAE 253 ++ GS G EG S + ++ GTS +SP + + + ++ R Sbjct: 732 SSRAFVRTDCGSRGLQHPEGPPSTSDLEVLSCPGTSATSPVASTAKDLEILENEQERALL 791 Query: 252 NFLNLENDDDFPPLS 208 +L+++ FPPL+ Sbjct: 792 QQYHLKDNVHFPPLT 806 >XP_006371669.1 hypothetical protein POPTR_0019s14930g [Populus trichocarpa] ERP49466.1 hypothetical protein POPTR_0019s14930g [Populus trichocarpa] Length = 808 Score = 535 bits (1379), Expect = e-174 Identities = 334/798 (41%), Positives = 468/798 (58%), Gaps = 31/798 (3%) Frame = -2 Query: 2508 DPTAIGEDVWETAEQVAWDEVLACIHPTYDSEEKRKDVIDYVHRLIRRSIGVEVFPYGSV 2329 DP +I E+ WE AE+ E++ IHPT +S KRK +I YV RLI+ S+G EVFPYGSV Sbjct: 49 DPWSIVEENWERAEEFT-REIVYRIHPTVESNFKRKQIIGYVQRLIKSSLGFEVFPYGSV 107 Query: 2328 PLKTYLPDGDIDLTALSTPKSDDSLCRDVLAVMHAEQLNANAEFEVRDTQFIDAEVKLVK 2149 PLKTYLPDGDIDLT++S+P +++L D+ AV+ E+LN ++ FEV+D IDAEVKL+K Sbjct: 108 PLKTYLPDGDIDLTSISSPAIEEALVSDIHAVLRREELNEDSTFEVKDVHCIDAEVKLIK 167 Query: 2148 CLVDNIVIDISFNQLGGLSTLCFLEQVDRLIGKDHLFKHSIILIKAWCYYESRILGAHHG 1969 C+V N V+DISFNQLGGL TLCFLE+VDRL+GK+HLFK SIILIKAWCYYESRILGAHHG Sbjct: 168 CIVQNTVVDISFNQLGGLCTLCFLEEVDRLVGKNHLFKRSIILIKAWCYYESRILGAHHG 227 Query: 1968 LISTYALESLILYIFHMFNSTLNGPLAALHRFLVYYSKFDWENYSISLNGPVCTSYLPDL 1789 LISTYALE+LILYIFH+F+ +LNGPLA L+RFL Y+SKFDWENY ISLNGPVC S LP++ Sbjct: 228 LISTYALETLILYIFHLFHCSLNGPLAVLYRFLEYFSKFDWENYCISLNGPVCKSSLPNI 287 Query: 1788 VVDN-XXXXXXXXXXXEFLRNCVDMFAVSPKASESNMRAFIKKNLNIIDPLKENNNLGRS 1612 V + EFL++C D F+V + E N R F +K+LNI+DPLKENNNLGRS Sbjct: 288 VAEPLENGQGELLLSDEFLKDCADRFSVPSRKPEMNSRPFPQKHLNIVDPLKENNNLGRS 347 Query: 1611 VNRGNYFRIRSAFKFGARKLGNILNLPRDRLAEEIKNFFENTLERHGRQNASHISGDELA 1432 VNRGN+FRIRSAFK+GARKLG IL LP++R+A+E+K FF NTL+RHG + + ELA Sbjct: 348 VNRGNFFRIRSAFKYGARKLGQILLLPKERIADELKIFFANTLDRHGSDYWTEVGNSELA 407 Query: 1431 ------------RLPFTVYSYDDLHLNQFNGNFDHHMSGLEDNIRSASRNEPGWYANSTV 1288 S DD+HL + NG +D+ E + + + PG S Sbjct: 408 SGARSSDNSVSRSSHSDTCSEDDMHL-KLNGGYDNDTLFSEKSNHTPPLHFPGL---SEG 463 Query: 1287 SSQMVVQTCYTPEGTVSFVDRFEKDYDDLVMDRALRA--RNTNVEPDFSLPGSDNIDSFV 1114 + +M++ ++ + +S + R E + ++ + ++ + P S Sbjct: 464 NREMLIN--FSADDEMSCIFRPEPKQNHFQNSNSVCSCTKHEGIAPSVS----------- 510 Query: 1113 TNFNAGSFGDLAISSPEDTFSDSISVDLRKKILDGNSDDTESL-NLADLTGDYDSHIRSL 937 +P D +++S +K G + +++ L +L L GD++ H++SL Sbjct: 511 -----------TTPNPADNVPENLSTTRVEKDFAGITGNSQPLKSLLGLRGDHNGHLQSL 559 Query: 936 LYGQCCHGIALSSP-AKCSTLLSSPTQLQNKPWDTVRQNLPVIWKMN-----SSDAAFGH 775 Y Q CH A+S+P C ++L P W+TV+Q+L + K N +++ FG Sbjct: 560 AYSQYCHMHAVSAPIPPCPSML--PLSENKNRWETVQQSLQL--KQNGHSQMNTNHIFGT 615 Query: 774 PPYAVDNSNPSTAGFALVERRKARGTGTYIPHLKRNSSR-ERPSQAR-RNQLQSNRCQVQ 601 Y V+ P A E++ RGTGTYIP++ +SSR +R S R R Q Q+N Q+ Sbjct: 616 QLYCVNPGGPFRAATDSEEKKIRRGTGTYIPNMSYHSSRGDRLSLGRGRTQPQANHGQLH 675 Query: 600 KNTSNNIVDTTSPKTLDANSIRINAFEGGDHQLXXXXXXXXXXXXQDLHSQPI---NSHG 430 K T N + P TL ++ + + + + HS P +S+ Sbjct: 676 KYTHENGL----PTTLQEKNLSEHGHDLSEAEYPHLGNGKPVPLEAH-HSYPSVWGSSNA 730 Query: 429 NGTTIPKWELKFGSFGSLAEGDCSLGLSDTIA----GTSVSSPASPAGQTSKEVKGNHGR 262 NG++ GS G L + SD + GTS +SP + + + ++ R Sbjct: 731 NGSSRAFVRTDCGSRG-LQHPEGPPSTSDLVVLSCPGTSATSPVASTAKDLEILENEQER 789 Query: 261 DAENFLNLENDDDFPPLS 208 +L+++ FPPL+ Sbjct: 790 ALLQQYHLKDNVHFPPLT 807 >XP_011041798.1 PREDICTED: uncharacterized protein LOC105137670 [Populus euphratica] Length = 807 Score = 533 bits (1372), Expect = e-173 Identities = 337/796 (42%), Positives = 465/796 (58%), Gaps = 29/796 (3%) Frame = -2 Query: 2508 DPTAIGEDVWETAEQVAWDEVLACIHPTYDSEEKRKDVIDYVHRLIRRSIGVEVFPYGSV 2329 DP +I E+ WE AE+ E++ IHPT +S KRK VI YV RLI+ S+G EVFPYGSV Sbjct: 49 DPWSIVEENWERAEEFT-REIVYRIHPTVESNFKRKQVIGYVQRLIKSSLGFEVFPYGSV 107 Query: 2328 PLKTYLPDGDIDLTALSTPKSDDSLCRDVLAVMHAEQLNANAEFEVRDTQFIDAEVKLVK 2149 PLKTYLPDGDIDLTA+S+P +++L D+ AV+ E+LN ++ FEV+D IDAEVKL+K Sbjct: 108 PLKTYLPDGDIDLTAISSPAIEEALVSDIHAVLRREELNEDSTFEVKDVHCIDAEVKLIK 167 Query: 2148 CLVDNIVIDISFNQLGGLSTLCFLEQVDRLIGKDHLFKHSIILIKAWCYYESRILGAHHG 1969 C+V N V+DISFNQLGGL TLCFLE+VD+L+GK+HLFK SIILIKAWCYYESRILGAHHG Sbjct: 168 CIVQNTVVDISFNQLGGLCTLCFLEEVDQLVGKNHLFKRSIILIKAWCYYESRILGAHHG 227 Query: 1968 LISTYALESLILYIFHMFNSTLNGPLAALHRFLVYYSKFDWENYSISLNGPVCTSYLPDL 1789 LISTYALE+LILYIFH+F+ +LNGPLA L+RFL Y+SKFDWENY ISLNGPVC S LP++ Sbjct: 228 LISTYALETLILYIFHLFHCSLNGPLAVLYRFLDYFSKFDWENYCISLNGPVCKSSLPNI 287 Query: 1788 VVDN-XXXXXXXXXXXEFLRNCVDMFAVSPKASESNMRAFIKKNLNIIDPLKENNNLGRS 1612 V + EFL++CVD F+V + E N R F +K+LNI+DPLKENNNLGRS Sbjct: 288 VAEPLENGQGELLLSDEFLKDCVDRFSVQSRKPEMNSRPFPQKHLNIVDPLKENNNLGRS 347 Query: 1611 VNRGNYFRIRSAFKFGARKLGNILNLPRDRLAEEIKNFFENTLERHG---------RQNA 1459 VNRGN+FRIRSAFK+GARKLG IL LP++R+++E+K FF NTL+R G + A Sbjct: 348 VNRGNFFRIRSAFKYGARKLGQILLLPKERISDELKIFFANTLDRLGSDYLTEGGNSELA 407 Query: 1458 SHISGDELARLP--FTVYSYDDLHLNQFNGNFDHHMSGLEDNIRSASRNEPGWYANSTVS 1285 S S D L S DD+HL + NG +D+ E + + + PG S + Sbjct: 408 SGASSDNSVSLSSHSNTCSEDDMHL-KLNGGYDNDTLFSEKSNHTPPLHFPGL---SEGN 463 Query: 1284 SQMVVQTCYTPEGTVSFVDRFEKDYDDLVMDRALRARNTNVEPDFSLPGSDNIDSFVTNF 1105 +M++ E + F ++++ ++ + P S Sbjct: 464 REMLINFSADDEMSCIFTPEPKQNHFQSSNSVCSCTKHEGIAPSVS-------------- 509 Query: 1104 NAGSFGDLAISSPEDTFSDSISVDLRKKILDGNSDDTE-SLNLADLTGDYDSHIRSLLYG 928 +P D +++S +K G + +++ S +L L GD++ H++SL Y Sbjct: 510 --------TTPNPADNVPENLSTTRVEKDFAGITGNSQPSKSLLGLRGDHNGHLQSLAYS 561 Query: 927 QCCHGIALSSP-AKCSTLLSSPTQLQNKPWDTVRQNLPVIWKMN-----SSDAAFGHPPY 766 Q CH A+S+P C ++L P W+TV+Q+L + K N +++ FG Y Sbjct: 562 QYCHVHAVSAPIPPCPSML--PLSENKNSWETVQQSLQL--KQNGHSQMNTNHVFGTQLY 617 Query: 765 AVDNSNPSTAGFALVERRKARGTGTYIPHLKRNSSR-ERPSQAR-RNQLQSNRCQVQKNT 592 V+ P A E++K RGTGTYIP++ +SSR +R S R R Q Q+N Q+ K Sbjct: 618 CVNPGGPFRAATDSEEKKKRRGTGTYIPNMSYHSSRGDRLSLGRGRTQPQANHGQLHKYA 677 Query: 591 SNNIVDTTSPKTLDANSIRINAFEGGDHQLXXXXXXXXXXXXQDLHSQPI---NSHGNGT 421 N + P TL ++ + + + + HS P +S+ NG+ Sbjct: 678 HENGL----PTTLQEKNLSEHGHDLSEAEYPHLGNGKPVPLEAH-HSYPSVWGSSNANGS 732 Query: 420 TIPKWELKFGSFG-SLAEGDCSLGLSDTIA----GTSVSSPASPAGQTSKEVKGNHGRDA 256 + GS G EG S SD + GTS +SP + + + ++ R Sbjct: 733 SRAFVRTDCGSRGFQHPEGPPS--TSDLVVLSCPGTSATSPVASTAKDLEILENEQERAL 790 Query: 255 ENFLNLENDDDFPPLS 208 +L+++ FPPL+ Sbjct: 791 LQQYHLKDNVHFPPLT 806 >EOY34688.1 NT domain of poly(A) polymerase and terminal uridylyl transferase-containing protein, putative isoform 2 [Theobroma cacao] Length = 836 Score = 530 bits (1366), Expect = e-172 Identities = 321/693 (46%), Positives = 421/693 (60%), Gaps = 53/693 (7%) Frame = -2 Query: 2505 PTAIGEDVWETAEQVAWDEVLACIHPTYDSEEKRKDVIDYVHRLIRRSIGVEVFPYGSVP 2326 P +I + W++AE+ A ++ + PT D++ KRK++++YV RLI+ +G +VFPYGSVP Sbjct: 39 PCSIARESWDSAEETA-RRIVWSVQPTLDADRKRKEIVEYVQRLIQDGLGYQVFPYGSVP 97 Query: 2325 LKTYLPDGDIDLTALSTPKSDDSLCRDVLAVMHAEQLNANAEFEVRDTQFIDAEVKLVKC 2146 LKTYLPDGDIDLT LS+P +D+L DV A++ E+ N A + V+D IDAEVKLVKC Sbjct: 98 LKTYLPDGDIDLTTLSSPAIEDTLVSDVHAILRGEEHNQKAPYRVKDVHCIDAEVKLVKC 157 Query: 2145 LVDNIVIDISFNQLGGLSTLCFLEQVDRLIGKDHLFKHSIILIKAWCYYESRILGAHHGL 1966 LV +IV+DISFNQLGGL TLCFLEQ+DRL+GKDHLFK SIILIKAWCYYESRILGAHHGL Sbjct: 158 LVQDIVVDISFNQLGGLCTLCFLEQIDRLVGKDHLFKRSIILIKAWCYYESRILGAHHGL 217 Query: 1965 ISTYALESLILYIFHMFNSTLNGPLAALHRFLVYYSKFDWENYSISLNGPVCTSYLPDLV 1786 ISTYALE+L+LYIFH+F+S+L GP+A L+RFL Y+SKFDWENY ISLNGPVC S LPD+V Sbjct: 218 ISTYALETLVLYIFHLFHSSLTGPIAVLYRFLDYFSKFDWENYCISLNGPVCKSSLPDIV 277 Query: 1785 VD-NXXXXXXXXXXXEFLRNCVDMFAVSPKASESNMRAFIKKNLNIIDPLKENNNLGRSV 1609 + EFLR C++MF+V K E+N R F K+LNIIDPLKENNNLGRSV Sbjct: 278 AEVPENVGNNPLLSEEFLRKCINMFSVPSKGVETNSRLFPLKHLNIIDPLKENNNLGRSV 337 Query: 1608 NRGNYFRIRSAFKFGARKLGNILNLPRDRLAEEIKNFFENTLERHGRQNASHISGDELAR 1429 NRGNY+RIRSAFK+GA KL IL LPR+R+ +E+ FF NTLERHG ++H++G + Sbjct: 338 NRGNYYRIRSAFKYGAHKLEQILILPRERIPDELVKFFANTLERHG---SNHLTG--MQN 392 Query: 1428 LPFT--VYSYDDLH----LNQFNGNFDHHMSGLEDNIRSASRNEPGWYANS--------- 1294 LP T YD + + +GN+ + N+ S++ G A S Sbjct: 393 LPSTSDARGYDHVMPSPCASMCSGNY---LFAKSINVGSSNNRMSGSIAASGSRYKLGCP 449 Query: 1293 --TVSSQMVVQTCYTPEGTVSFVDRFEKDYDDLVMDRALRARNTNVEPDFSLPGSDNIDS 1120 ++SQ+V + + D + V+ L ++ N D S P S N+ + Sbjct: 450 FDVLTSQVVPEKKANVNRNAVSGNCHPGDAKEFVLSGLLAMKSENDSSD-SFPPSSNLGA 508 Query: 1119 --FVTNFNAGSFGDLAI-SSPEDTFSDSISVD--------------------LRKKILDG 1009 V G + I +S + T +DSI+ D + K+ L G Sbjct: 509 SLSVKPRTCRQMGMVEIGNSFKSTLTDSIAADDMSFALKPYSKNDTLAASNVVCKRELAG 568 Query: 1008 NSDDTESL-NLADLTGDYDSHIRSLLYGQCCHGIALSSPAKCSTLLSSPTQLQNKPWDTV 832 D+ESL +L DLTGDYD SLLYGQ CH ++SSP SP W+T+ Sbjct: 569 IFGDSESLKSLLDLTGDYDGQFWSLLYGQYCHLFSVSSPV-------SPHLQNENHWETI 621 Query: 831 RQNLPVIWKMNS---------SDAAFGHPPYAVDNSNPSTAGFALVERRKARGTGTYIPH 679 Q++P+ + S S F PP AV + S E +K RGTGTYIP Sbjct: 622 EQSIPLKQDLYSQRDSNGILGSQFCFSKPPVAVHTALDS-------EDKKKRGTGTYIPS 674 Query: 678 LKRNSSRERPSQARRNQLQSNRC--QVQKNTSN 586 +K S+RER S R Q++R Q+Q+ T+N Sbjct: 675 IKYRSNRERHSSG-RGIFQASRAYSQLQRYTNN 706