BLASTX nr result

ID: Catharanthus23_contig00010269 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Catharanthus23_contig00010269
         (2367 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002282310.2| PREDICTED: uncharacterized protein LOC100266...   629   e-177
ref|XP_004238182.1| PREDICTED: uncharacterized protein LOC101262...   627   e-177
ref|XP_006354919.1| PREDICTED: uncharacterized protein LOC102581...   623   e-175
gb|EOY14571.1| Uncharacterized protein isoform 1 [Theobroma cacao]    593   e-166
ref|XP_002510268.1| transcription factor, putative [Ricinus comm...   585   e-164
ref|XP_006374991.1| hypothetical protein POPTR_0014s03410g [Popu...   580   e-162
ref|XP_006374990.1| hypothetical protein POPTR_0014s03410g [Popu...   578   e-162
gb|EMJ24107.1| hypothetical protein PRUPE_ppa003634mg [Prunus pe...   577   e-162
ref|XP_006473370.1| PREDICTED: uncharacterized protein LOC102623...   577   e-161
ref|XP_004291137.1| PREDICTED: uncharacterized protein LOC101314...   550   e-153
gb|EOY14573.1| Uncharacterized protein isoform 3 [Theobroma cacao]    550   e-153
ref|XP_006473371.1| PREDICTED: uncharacterized protein LOC102623...   541   e-151
ref|XP_004141244.1| PREDICTED: uncharacterized protein LOC101211...   540   e-151
gb|EOY14572.1| Uncharacterized protein isoform 2 [Theobroma cacao]    539   e-150
gb|EXB75617.1| hypothetical protein L484_026093 [Morus notabilis]     536   e-149
ref|XP_006578008.1| PREDICTED: uncharacterized protein LOC100807...   499   e-138
ref|XP_006581234.1| PREDICTED: uncharacterized protein LOC100818...   495   e-137
gb|EOX91357.1| Uncharacterized protein isoform 1 [Theobroma caca...   495   e-137
ref|XP_002327110.1| predicted protein [Populus trichocarpa]           493   e-136
ref|XP_006374994.1| hypothetical protein POPTR_0014s03410g [Popu...   493   e-136

>ref|XP_002282310.2| PREDICTED: uncharacterized protein LOC100266128 [Vitis vinifera]
            gi|302142367|emb|CBI19570.3| unnamed protein product
            [Vitis vinifera]
          Length = 573

 Score =  629 bits (1622), Expect = e-177
 Identities = 331/573 (57%), Positives = 408/573 (71%), Gaps = 5/573 (0%)
 Frame = -1

Query: 1965 MNTRERTANLQSVRAPLKPQKEKVELQGKNTMNSVKAKTNRRGTIRERKKALQQDVDKLK 1786
            MNTR RT  LQ+++AP+K  K+KVE+QG   M++ +A  NRRG  R+RK ALQQDVDKLK
Sbjct: 1    MNTRVRT-KLQNMKAPMKHDKDKVEMQGTRGMDAKRATANRRGPSRDRKMALQQDVDKLK 59

Query: 1785 QKLRHEENVHRALERAFXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXVNFR 1606
            +KLRHEENVHRALERAF                                       V+FR
Sbjct: 60   KKLRHEENVHRALERAFNRPLGALPRLPPYLPPCTLELLAEVAILEEEVVRLEEQVVHFR 119

Query: 1605 QGLYQEAVYISSSKRNVDNSAETCDQYQINNRKQKQSKLSLEAEDDLIASSGRHLALLYD 1426
            QGLYQEAVYISSSK+N+++ A+  + Y + N K+ Q+K  ++  D+   S+ R       
Sbjct: 120  QGLYQEAVYISSSKKNMESLADLYNPYLMRNSKKDQTKFLVQTVDNSATSATRDAPSPPA 179

Query: 1425 NGLGKENEVFSNSTKSKQKSSGLKAQNMKTPSKRQPFEYRPVDKHLDPKKLQLD---MDQ 1255
            +  GKEN+ ++NSTK+ ++    KAQ + TP KR P E+   +KHLD +KLQL+   +DQ
Sbjct: 180  DRRGKENQSYANSTKNNKRDPNNKAQKISTPVKRPPIEHGSAEKHLDSQKLQLENRVVDQ 239

Query: 1254 DS-KGRTPNIGNEQTSGDDNPNKISESVLKCLLSIFLRMSAKRTRSMADXXXXXXXXXXX 1078
            ++ + RT    +E+ S DD PNKISE +L+CL SIFLRMS  ++R  ++           
Sbjct: 240  ENAETRTSLTPDERLSADDKPNKISEDILRCLFSIFLRMSTLKSRGTSENLPSLPSLASH 299

Query: 1077 XXED-ADFKDPYGILTEFGKRDIGPYKHLFVVESSSINPNRTTISXXXXXXXXXXXXXXX 901
               +  + +DPYGI +EFGKRDIGPYKHLF +++SSIN NRT  S               
Sbjct: 300  GSGEETELQDPYGICSEFGKRDIGPYKHLFSIQASSINLNRTANSLFLVHRLKRLLGKLA 359

Query: 900  XXXXKGLTHQEKLAFWINIYNSCMMNAYLEHGIPESPEMVVALMQKATINVGGHFLNAIT 721
                +GLTHQEKLAFWIN YNSCMMNA+LEHGIP +PEMVV LM+KATINVGGH LNAIT
Sbjct: 360  SVNLQGLTHQEKLAFWINTYNSCMMNAFLEHGIPGNPEMVVELMRKATINVGGHLLNAIT 419

Query: 720  IEHFILRLPYHSKYTFVKGAKNDEMSTRSVFGLELSEPLVTFALSCGSWSSPAVRIYTAS 541
            IEHFILRLPYH KYTF KGAKNDEM+ RS++GLELSEPLVTFALSCGSWSSPAVR+YTAS
Sbjct: 420  IEHFILRLPYHIKYTFPKGAKNDEMTARSIYGLELSEPLVTFALSCGSWSSPAVRVYTAS 479

Query: 540  QVENELEMAKKEYLQAAIGISTTKRVVAIPKLLDWYMLDFAKDLDSFVDWICLQLPSELG 361
            QVENELE+AK+EYLQAA+GISTTK + AIPKLLDWY+LDFAKD +SF+DWICLQLPSELG
Sbjct: 480  QVENELEVAKREYLQAAVGISTTK-LFAIPKLLDWYLLDFAKDFESFLDWICLQLPSELG 538

Query: 360  KEAINCLERDRNEPLSQFLEIMPYEFNFRYLLH 262
            KEAI CLER  +EPLSQF++++PYEF+FRYLLH
Sbjct: 539  KEAIKCLERGNSEPLSQFVQVIPYEFSFRYLLH 571


>ref|XP_004238182.1| PREDICTED: uncharacterized protein LOC101262306 [Solanum
            lycopersicum]
          Length = 572

 Score =  627 bits (1616), Expect = e-177
 Identities = 332/573 (57%), Positives = 405/573 (70%), Gaps = 5/573 (0%)
 Frame = -1

Query: 1965 MNTRERTANLQSVRAPLKPQKEKVELQGKNTMNSVKAKTNRRGTIRERKKALQQDVDKLK 1786
            MN R RT+ LQS++ P K  KEKVE+QG   M++ K   NRR  IRERK AL QDVDKLK
Sbjct: 1    MNARVRTS-LQSMKTPSKNVKEKVEMQGNRKMSTDKTPINRRKAIRERKMALLQDVDKLK 59

Query: 1785 QKLRHEENVHRALERAFXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXVNFR 1606
            +KLRHEENVHRALERAF                                       V+FR
Sbjct: 60   KKLRHEENVHRALERAFTRPLGALPRLPPYLPPNTLELLAEVAVLEEEVVRLEEKVVHFR 119

Query: 1605 QGLYQEAVYISSSKRNVDNSAETCDQYQINNRKQKQSKLSLEAEDDLIASSGRHLALLYD 1426
            QGLY EAVYISSSKRN+DN  +T +Q Q+ + KQKQ+KLS + E +  + SGRHL  L D
Sbjct: 120  QGLYHEAVYISSSKRNMDNVTDTVEQNQVKSPKQKQTKLSPQLESNSASFSGRHLPSLSD 179

Query: 1425 NGLGKENEVFSNSTKSKQKSSGLKAQNMKTPSKRQPFEYRPVDKHLDPKKLQLDMDQDSK 1246
            +   KEN   S STKSK +S   K +  +TP K+ P E R  +K +DP+KLQL+      
Sbjct: 180  DSCLKENHSLS-STKSKHRSVNAKVKTARTPVKKLPAENRLAEKRVDPQKLQLEDQVMYH 238

Query: 1245 GRTPN----IGNEQTSGDDNPNKISESVLKCLLSIFLRMSAKRTRSMADXXXXXXXXXXX 1078
            G          + + S D++PN ISE++LKCL +IFLRMS+++ R+ AD           
Sbjct: 239  GSLEERIFVTQDRKPSPDESPNTISENILKCLSNIFLRMSSRKGRTTADTLPSLTGYNSC 298

Query: 1077 XXEDA-DFKDPYGILTEFGKRDIGPYKHLFVVESSSINPNRTTISXXXXXXXXXXXXXXX 901
               +  +F DPYGI ++F +RDIGPYKHL+ VE+SS+NPNRTTIS               
Sbjct: 299  ESIEKKEFGDPYGICSKFERRDIGPYKHLYAVEASSVNPNRTTISVFLVRRLKLLLEKLA 358

Query: 900  XXXXKGLTHQEKLAFWINIYNSCMMNAYLEHGIPESPEMVVALMQKATINVGGHFLNAIT 721
                +GL+HQEKLAFWINIYNSCMMNA+LE+G+PE+PEMVVALMQKATINV GH LNAIT
Sbjct: 359  SANLQGLSHQEKLAFWINIYNSCMMNAFLEYGLPENPEMVVALMQKATINVSGHLLNAIT 418

Query: 720  IEHFILRLPYHSKYTFVKGAKNDEMSTRSVFGLELSEPLVTFALSCGSWSSPAVRIYTAS 541
            IEHFILRLPYHSK+TF KG KNDEM+ RS+FGLELSEPLVTFALSCGS+SSPAVR+YTA+
Sbjct: 419  IEHFILRLPYHSKFTFAKGVKNDEMTARSIFGLELSEPLVTFALSCGSFSSPAVRVYTAA 478

Query: 540  QVENELEMAKKEYLQAAIGISTTKRVVAIPKLLDWYMLDFAKDLDSFVDWICLQLPSELG 361
             +ENEL++AKKEYLQA++G+ST+K++VAIPKLLDWY+LDFAKDL+S +DWICLQLP+E G
Sbjct: 479  NIENELQVAKKEYLQASVGVSTSKKLVAIPKLLDWYLLDFAKDLESLLDWICLQLPNEHG 538

Query: 360  KEAINCLERDRNEPLSQFLEIMPYEFNFRYLLH 262
            KEAINCLER  NEPLS  L+I+PYEF+FRYLLH
Sbjct: 539  KEAINCLERKNNEPLSNVLQIVPYEFSFRYLLH 571


>ref|XP_006354919.1| PREDICTED: uncharacterized protein LOC102581774 [Solanum tuberosum]
          Length = 570

 Score =  623 bits (1606), Expect = e-175
 Identities = 331/571 (57%), Positives = 405/571 (70%), Gaps = 3/571 (0%)
 Frame = -1

Query: 1965 MNTRERTANLQSVRAPLKPQKEKVELQGKNTMNSVKAKTNRRGTIRERKKALQQDVDKLK 1786
            MN R RT+ LQS++ P K  KEKVE+QG   M++ K   NRR  IRERK AL QDVDKLK
Sbjct: 1    MNARVRTS-LQSMKTPSKNVKEKVEMQGNRKMSTEKTPINRRKAIRERKMALLQDVDKLK 59

Query: 1785 QKLRHEENVHRALERAFXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXVNFR 1606
            +KLRHEENVHRALERAF                                       V+FR
Sbjct: 60   KKLRHEENVHRALERAFTRPLGALPRLPPYLPPNTLELLAEVAVLEEEVVRLEEKVVHFR 119

Query: 1605 QGLYQEAVYISSSKRNVDNSAETCDQYQINNRKQKQSKLSLEAEDDLIASSGRHLALLYD 1426
            QGLY EAVYISSSKRN+DN  +T +Q Q+ + KQKQ+KLS + E +  + SGRHL  L D
Sbjct: 120  QGLYHEAVYISSSKRNMDNVTDTIEQNQVKSPKQKQTKLSPQLESNSASFSGRHLPSLSD 179

Query: 1425 NGLGKENEVFSNSTKSKQKSSGLKAQNMKTPSKRQPFEYRPVDKHLDPKKLQLDMDQDS- 1249
            +   KEN   S STKSK +S   K + ++TP K+ P E R  +K +DP+KLQ  +     
Sbjct: 180  DSCLKENHSLS-STKSKHRSVNAKVKTVRTPVKKLPAENRLAEKRVDPQKLQDQVMYHGS 238

Query: 1248 -KGRTPNIGNEQTSGDDNPNKISESVLKCLLSIFLRMSAKRTRSMADXXXXXXXXXXXXX 1072
             + R     + + S D++PN ISE++LKCL +IFLRMS+++ R+ AD             
Sbjct: 239  LEERIFVTQDRKPSPDESPNTISENILKCLSNIFLRMSSRKGRTTADTLPSLTGYNSCES 298

Query: 1071 EDA-DFKDPYGILTEFGKRDIGPYKHLFVVESSSINPNRTTISXXXXXXXXXXXXXXXXX 895
             +  +F DPYGI ++F +RDIGPYKHL+ VE+SS+NPNRTTIS                 
Sbjct: 299  IEKKEFGDPYGICSKFERRDIGPYKHLYAVEASSVNPNRTTISVFLVRRLKLLLEKLASA 358

Query: 894  XXKGLTHQEKLAFWINIYNSCMMNAYLEHGIPESPEMVVALMQKATINVGGHFLNAITIE 715
              +GL+HQEKLAFWINIYNSCMMNA+LE+G+PE+PEMVVALMQKATI V GH LNAITIE
Sbjct: 359  NLQGLSHQEKLAFWINIYNSCMMNAFLEYGLPENPEMVVALMQKATIKVSGHLLNAITIE 418

Query: 714  HFILRLPYHSKYTFVKGAKNDEMSTRSVFGLELSEPLVTFALSCGSWSSPAVRIYTASQV 535
            HFILRLPYHSK+TF KG KNDEM+ RSVFGLELSEPLVTFALSCGS+SSPAVR+YTA+ +
Sbjct: 419  HFILRLPYHSKFTFAKGVKNDEMTARSVFGLELSEPLVTFALSCGSFSSPAVRVYTAANI 478

Query: 534  ENELEMAKKEYLQAAIGISTTKRVVAIPKLLDWYMLDFAKDLDSFVDWICLQLPSELGKE 355
            ENEL++AKKEYLQA++G+ST+K++VAIPKLLDWY+LDFAKDL+S +DWICLQLP+E GKE
Sbjct: 479  ENELQVAKKEYLQASVGVSTSKKLVAIPKLLDWYLLDFAKDLESLLDWICLQLPNEHGKE 538

Query: 354  AINCLERDRNEPLSQFLEIMPYEFNFRYLLH 262
            AINCLER  NEPLS  L+I+PYEF+FRYLLH
Sbjct: 539  AINCLERKNNEPLSNVLQIVPYEFSFRYLLH 569


>gb|EOY14571.1| Uncharacterized protein isoform 1 [Theobroma cacao]
          Length = 566

 Score =  593 bits (1528), Expect = e-166
 Identities = 320/574 (55%), Positives = 393/574 (68%), Gaps = 5/574 (0%)
 Frame = -1

Query: 1965 MNTRERTANLQSVRAPLKPQKEKVELQGKNTMNSVKAKTNRRGTIRERKKALQQDVDKLK 1786
            MNTR R A+LQS + P K +KEKV++Q      + KA  NRR + +ERK  LQQDVDKLK
Sbjct: 1    MNTRVR-ASLQSRKVPGKHEKEKVDMQETKPTVATKAMKNRRASSKERKMVLQQDVDKLK 59

Query: 1785 QKLRHEENVHRALERAFXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXVNFR 1606
            +KLR EEN+HRALERAF                                       V+FR
Sbjct: 60   KKLRQEENIHRALERAFNRPLGALPRLPPYLPPSTLELLAEVAVLEEEVVRLEEKVVHFR 119

Query: 1605 QGLYQEAVYISSSKRNVDNSAETCDQYQINNRKQKQSKLSLEAEDDLIASSGRHLALLYD 1426
            Q LYQEAVYISSSKRN+DNSA+ C+     + K +Q K+          S  RHL    D
Sbjct: 120  QDLYQEAVYISSSKRNMDNSADLCEPSLDKSPKPEQPKILTRD-----TSMARHLQSFSD 174

Query: 1425 NGLGKENEVFSNSTKSKQKSSGLKAQNMKTPSKRQPFEYRPVDKHLDPKKLQLDMDQDSK 1246
            +G GKEN+  +NSTKS + S   K+Q+++TP +R   + +P +K +DP+KLQL+     +
Sbjct: 175  DGRGKENQSCTNSTKSNKGSLVHKSQSVRTPVERPLIDSKPAEKRIDPQKLQLECRIRDQ 234

Query: 1245 GRTP----NIGNEQTSGDDNPNKISESVLKCLLSIFLRMSAKRTRSMADXXXXXXXXXXX 1078
            G T     +  +E+  GDD PNK+SE ++KCL SIFLRMS+ + +S A+           
Sbjct: 235  GNTEARIISTPDERRLGDDEPNKVSEELVKCLSSIFLRMSSTKRKSTAEGSPSLSMLGSQ 294

Query: 1077 XXED-ADFKDPYGILTEFGKRDIGPYKHLFVVESSSINPNRTTISXXXXXXXXXXXXXXX 901
               +  +F+DPYG  + FG+RDIGPYK+LF +++ SINPNRT+ S               
Sbjct: 295  ESSEETEFRDPYGTCSNFGRRDIGPYKNLFSIDAGSINPNRTSKSLFLLRRLKLLLERLA 354

Query: 900  XXXXKGLTHQEKLAFWINIYNSCMMNAYLEHGIPESPEMVVALMQKATINVGGHFLNAIT 721
                  L HQEKLAFWINIYNSCMMNA+LEHG+P+SP+MVV LM+KATINVGG  LNAIT
Sbjct: 355  SSNLLNLNHQEKLAFWINIYNSCMMNAFLEHGVPDSPKMVVELMRKATINVGGRLLNAIT 414

Query: 720  IEHFILRLPYHSKYTFVKGAKNDEMSTRSVFGLELSEPLVTFALSCGSWSSPAVRIYTAS 541
            IEHFILRLPYHSK+ F KG KNDEM+ RS+FGLELSEPLVTFALSCGSWSSPAVR+YTAS
Sbjct: 415  IEHFILRLPYHSKFIFSKGVKNDEMTARSIFGLELSEPLVTFALSCGSWSSPAVRVYTAS 474

Query: 540  QVENELEMAKKEYLQAAIGISTTKRVVAIPKLLDWYMLDFAKDLDSFVDWICLQLPSELG 361
            QVENELE+AK+EYLQAA+GIS+TK   AIPKLLDWY+LDFAKDLDS +DWICLQLPSELG
Sbjct: 475  QVENELEVAKREYLQAAVGISSTK--FAIPKLLDWYLLDFAKDLDSLLDWICLQLPSELG 532

Query: 360  KEAINCLERDRNEPLSQFLEIMPYEFNFRYLLHT 259
            KEAI  LER ++E LSQF++IMPYEF+FRYLL T
Sbjct: 533  KEAIKYLERAKSESLSQFVQIMPYEFSFRYLLCT 566


>ref|XP_002510268.1| transcription factor, putative [Ricinus communis]
            gi|223550969|gb|EEF52455.1| transcription factor,
            putative [Ricinus communis]
          Length = 533

 Score =  585 bits (1508), Expect = e-164
 Identities = 313/550 (56%), Positives = 373/550 (67%), Gaps = 1/550 (0%)
 Frame = -1

Query: 1905 KEKVELQGKNTMNSVKAKTNRRGTIRERKKALQQDVDKLKQKLRHEENVHRALERAFXXX 1726
            KEK+E++G    N+ KA   R+ + RERK +LQQDVDKLK+KLR+EENVHRALERAF   
Sbjct: 2    KEKIEIKGNKQRNATKAAKTRQASSRERKISLQQDVDKLKKKLRYEENVHRALERAFNRP 61

Query: 1725 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXVNFRQGLYQEAVYISSSKRNVDNS 1546
                                                V+FRQ LYQEAVYISSSKRNV++ 
Sbjct: 62   LGALPRLPPYLPASTLELLAEVAVLEEEVVRLEEQVVHFRQDLYQEAVYISSSKRNVESF 121

Query: 1545 AETCDQYQINNRKQKQSKLSLEAEDDLIASSGRHLALLYDNGLGKENEVFSNSTKSKQKS 1366
            A+  D  Q NN KQ   K      D                G  KEN++ +NS K+ + S
Sbjct: 122  ADLYDLSQNNNSKQANIKTIARNID----------------GQEKENQLCTNSVKNNKSS 165

Query: 1365 SGLKAQNMKTPSKRQPFEYRPVDKHLDPKKLQLDMDQDSKGRTPNIGNEQTSGDDNPNKI 1186
            S  KAQ  KTP K+ P E + ++K LDP+KLQ+  +   + R  +  +E  S +DNPNKI
Sbjct: 166  SIHKAQPGKTPMKKHPIENKQIEKCLDPQKLQVSQENPKEARNVSTADEHLSANDNPNKI 225

Query: 1185 SESVLKCLLSIFLRMSAKRTRSMADXXXXXXXXXXXXXEDA-DFKDPYGILTEFGKRDIG 1009
            SE ++KCL +IFLRMS+++TR  AD              +  + +DPY I +E GK+DIG
Sbjct: 226  SEDIVKCLSNIFLRMSSRKTRRTADNLSFLSSLVSQENGEEIECRDPYSICSEVGKKDIG 285

Query: 1008 PYKHLFVVESSSINPNRTTISXXXXXXXXXXXXXXXXXXXKGLTHQEKLAFWINIYNSCM 829
            PYKHLF +E+ +INPNRT+ S                   + LTHQEKLAFWINIYNSCM
Sbjct: 286  PYKHLFAIEAGTINPNRTSNSLFLLHRLKLLLGKLASVNLQNLTHQEKLAFWINIYNSCM 345

Query: 828  MNAYLEHGIPESPEMVVALMQKATINVGGHFLNAITIEHFILRLPYHSKYTFVKGAKNDE 649
            MNA+LEHGIPESPEMVVALMQKATINVGGH LNAITIEHFILRLPYH KY F KG KNDE
Sbjct: 346  MNAFLEHGIPESPEMVVALMQKATINVGGHSLNAITIEHFILRLPYHLKYAFSKGTKNDE 405

Query: 648  MSTRSVFGLELSEPLVTFALSCGSWSSPAVRIYTASQVENELEMAKKEYLQAAIGISTTK 469
            M+ RS FGLELSEPLVTFALSCGSWSSPAVR+YTAS+VENEL+ AK+EYLQAA+G ST K
Sbjct: 406  MTARSKFGLELSEPLVTFALSCGSWSSPAVRVYTASEVENELDAAKREYLQAAVGFSTRK 465

Query: 468  RVVAIPKLLDWYMLDFAKDLDSFVDWICLQLPSELGKEAINCLERDRNEPLSQFLEIMPY 289
               AIPKLLDWY+LDFAKDL+S +DWICLQLPSELGKEAI CLER ++EP SQF++IMPY
Sbjct: 466  --FAIPKLLDWYLLDFAKDLESLLDWICLQLPSELGKEAIKCLERGKSEPHSQFVQIMPY 523

Query: 288  EFNFRYLLHT 259
            EF+FRYLL T
Sbjct: 524  EFSFRYLLCT 533


>ref|XP_006374991.1| hypothetical protein POPTR_0014s03410g [Populus trichocarpa]
            gi|566202237|ref|XP_006374992.1| hypothetical protein
            POPTR_0014s03410g [Populus trichocarpa]
            gi|550323305|gb|ERP52788.1| hypothetical protein
            POPTR_0014s03410g [Populus trichocarpa]
            gi|550323306|gb|ERP52789.1| hypothetical protein
            POPTR_0014s03410g [Populus trichocarpa]
          Length = 573

 Score =  580 bits (1495), Expect = e-162
 Identities = 319/575 (55%), Positives = 390/575 (67%), Gaps = 6/575 (1%)
 Frame = -1

Query: 1965 MNTRERTANLQSVRAPLKPQKEKVELQGKNTMNSVKAKTNRRGTIRERKKALQQDVDKLK 1786
            MNTR RT  L S++AP+K +KEKV +QG     + KA   R+ + RERK ALQ+DVDKLK
Sbjct: 1    MNTRVRT-RLHSMKAPMKHEKEKVGMQGSKPNVAKKAANKRQASSRERKIALQEDVDKLK 59

Query: 1785 QKLRHEENVHRALERAFXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXVNFR 1606
            ++LRHEEN+ RALERAF                                       V+FR
Sbjct: 60   KQLRHEENIRRALERAFSRPLGALPRLPPYLPRTTLELLAEVAVLEEEVVRLEEQVVHFR 119

Query: 1605 QGLYQEAVYISSSKRNVDNSAETCDQYQINNRKQKQSKLSLEAEDDLIASSGRHLALLYD 1426
            Q LYQEAVY+SSSKRNV++ ++    Y   N K  QSK   +  D+   S+ RHL  L  
Sbjct: 120  QDLYQEAVYMSSSKRNVESVSDLYHLYPNKNPKPDQSKSLAQNVDESATSTIRHLPSLSA 179

Query: 1425 NGLGKENEVFS-NSTKSKQKSSGLKAQNMKTPSKRQPFEYRPVDKHLDPKKLQLDM---- 1261
            +G GKEN   + NS K+ + SS  KAQ  +   KR   + RP +K LD  K QL+     
Sbjct: 180  DGTGKENAFSTANSRKNSKGSSINKAQTSRNMVKRPSEDNRPAEKKLDSHKSQLECRVPD 239

Query: 1260 DQDSKGRTPNIGNEQTSGDDNPNKISESVLKCLLSIFLRMSAKRTRSMADXXXXXXXXXX 1081
             ++++ R+    +E  +GD +PNK+SE +LKCL SIFLRMS+   R  AD          
Sbjct: 240  QENAEARSHVTASEGVTGDASPNKLSEDILKCLSSIFLRMSSMNNRRTADNLSFLSTLVS 299

Query: 1080 XXXED-ADFKDPYGILTEFGKRDIGPYKHLFVVESSSINPNRTTISXXXXXXXXXXXXXX 904
               E+ A+ +DPYGI +EFGKRDIGPYK LF +ES +INPNRT+ S              
Sbjct: 300  QENEEEAECQDPYGICSEFGKRDIGPYKRLFSIESGTINPNRTSNSLFLLHRLELLFGKL 359

Query: 903  XXXXXKGLTHQEKLAFWINIYNSCMMNAYLEHGIPESPEMVVALMQKATINVGGHFLNAI 724
                 + LTHQ+KLAFWINIYNSCMMNA+LEHGIPESPE VV LM+KATIN+GGH LNAI
Sbjct: 360  ASVNLQNLTHQKKLAFWINIYNSCMMNAFLEHGIPESPETVVELMRKATINIGGHLLNAI 419

Query: 723  TIEHFILRLPYHSKYTFVKGAKNDEMSTRSVFGLELSEPLVTFALSCGSWSSPAVRIYTA 544
            TIEHFILRLPY+SKYT  KGAKNDEM+ R+ FGLELSEPLV+FAL CGSWSSPAVR+YTA
Sbjct: 420  TIEHFILRLPYYSKYTISKGAKNDEMAARNKFGLELSEPLVSFALCCGSWSSPAVRVYTA 479

Query: 543  SQVENELEMAKKEYLQAAIGISTTKRVVAIPKLLDWYMLDFAKDLDSFVDWICLQLPSEL 364
            +QVENELE AK++YLQAAIGI+T+K   AIPKLLDWY+LDFAKDL+S +DWICLQLPSEL
Sbjct: 480  AQVENELEEAKRDYLQAAIGITTSK--FAIPKLLDWYLLDFAKDLESLLDWICLQLPSEL 537

Query: 363  GKEAINCLERDRNEPLSQFLEIMPYEFNFRYLLHT 259
            GKEAINCLE+ +NEP S F+++MPYEF FRYLL+T
Sbjct: 538  GKEAINCLEKGKNEPHSHFVQVMPYEFGFRYLLYT 572


>ref|XP_006374990.1| hypothetical protein POPTR_0014s03410g [Populus trichocarpa]
            gi|550323304|gb|ERP52787.1| hypothetical protein
            POPTR_0014s03410g [Populus trichocarpa]
          Length = 572

 Score =  578 bits (1490), Expect = e-162
 Identities = 320/575 (55%), Positives = 390/575 (67%), Gaps = 6/575 (1%)
 Frame = -1

Query: 1965 MNTRERTANLQSVRAPLKPQKEKVELQGKNTMNSVKAKTNRRGTIRERKKALQQDVDKLK 1786
            MNTR RT  L S++AP+K +KEKV +QG     + KA   R+ + RERK ALQ+DVDKLK
Sbjct: 1    MNTRVRT-RLHSMKAPMKHEKEKVGMQGSKPNVAKKAANKRQASSRERKIALQEDVDKLK 59

Query: 1785 QKLRHEENVHRALERAFXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXVNFR 1606
            ++LRHEEN+ RALERAF                                       V+FR
Sbjct: 60   KQLRHEENIRRALERAFSRPLGALPRLPPYLPRTTLELLAEVAVLEEEVVRLEEQVVHFR 119

Query: 1605 QGLYQEAVYISSSKRNVDNSAETCDQYQINNRKQKQSKLSLEAEDDLIASSGRHLALLYD 1426
            Q LYQEAVY+SSSKRNV++ ++    Y   N K  QSK   +  D+   S+ RHL  L D
Sbjct: 120  QDLYQEAVYMSSSKRNVESVSDLYHLYPNKNPKPDQSKSLAQNVDESATSTIRHLPSLSD 179

Query: 1425 NGLGKENEVFS-NSTKSKQKSSGLKAQNMKTPSKRQPFEYRPVDKHLDPKKLQLDM---- 1261
             G GKEN   + NS K+ + SS  KAQ  +   KR   + RP +K LD  K QL+     
Sbjct: 180  -GTGKENAFSTANSRKNSKGSSINKAQTSRNMVKRPSEDNRPAEKKLDSHKSQLECRVPD 238

Query: 1260 DQDSKGRTPNIGNEQTSGDDNPNKISESVLKCLLSIFLRMSAKRTRSMADXXXXXXXXXX 1081
             ++++ R+    +E  +GD +PNK+SE +LKCL SIFLRMS+   R  AD          
Sbjct: 239  QENAEARSHVTASEGVTGDASPNKLSEDILKCLSSIFLRMSSMNNRRTADNLSFLSTLVS 298

Query: 1080 XXXED-ADFKDPYGILTEFGKRDIGPYKHLFVVESSSINPNRTTISXXXXXXXXXXXXXX 904
               E+ A+ +DPYGI +EFGKRDIGPYK LF +ES +INPNRT+ S              
Sbjct: 299  QENEEEAECQDPYGICSEFGKRDIGPYKRLFSIESGTINPNRTSNSLFLLHRLELLFGKL 358

Query: 903  XXXXXKGLTHQEKLAFWINIYNSCMMNAYLEHGIPESPEMVVALMQKATINVGGHFLNAI 724
                 + LTHQ+KLAFWINIYNSCMMNA+LEHGIPESPE VV LM+KATIN+GGH LNAI
Sbjct: 359  ASVNLQNLTHQKKLAFWINIYNSCMMNAFLEHGIPESPETVVELMRKATINIGGHLLNAI 418

Query: 723  TIEHFILRLPYHSKYTFVKGAKNDEMSTRSVFGLELSEPLVTFALSCGSWSSPAVRIYTA 544
            TIEHFILRLPY+SKYT  KGAKNDEM+ R+ FGLELSEPLV+FAL CGSWSSPAVR+YTA
Sbjct: 419  TIEHFILRLPYYSKYTISKGAKNDEMAARNKFGLELSEPLVSFALCCGSWSSPAVRVYTA 478

Query: 543  SQVENELEMAKKEYLQAAIGISTTKRVVAIPKLLDWYMLDFAKDLDSFVDWICLQLPSEL 364
            +QVENELE AK++YLQAAIGI+T+K   AIPKLLDWY+LDFAKDL+S +DWICLQLPSEL
Sbjct: 479  AQVENELEEAKRDYLQAAIGITTSK--FAIPKLLDWYLLDFAKDLESLLDWICLQLPSEL 536

Query: 363  GKEAINCLERDRNEPLSQFLEIMPYEFNFRYLLHT 259
            GKEAINCLE+ +NEP S F+++MPYEF FRYLL+T
Sbjct: 537  GKEAINCLEKGKNEPHSHFVQVMPYEFGFRYLLYT 571


>gb|EMJ24107.1| hypothetical protein PRUPE_ppa003634mg [Prunus persica]
          Length = 560

 Score =  577 bits (1487), Expect = e-162
 Identities = 312/572 (54%), Positives = 390/572 (68%), Gaps = 3/572 (0%)
 Frame = -1

Query: 1965 MNTRERTANLQSVRAPLKPQKEKVELQGKNTMNSVKAKTNRRGTIRERKKALQQDVDKLK 1786
            MNTR +T  LQS++ PLK +KEKV++QG    ++ K  TN RG+ RERK ALQQDVDKLK
Sbjct: 1    MNTRVKT-RLQSMKPPLKNEKEKVKMQGSKLRDTAKTITNGRGSRRERKLALQQDVDKLK 59

Query: 1785 QKLRHEENVHRALERAFXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXVNFR 1606
            +KLRHEENVHRAL+RAF                                       V+FR
Sbjct: 60   KKLRHEENVHRALQRAFTRPLGALPRLPPYLPPYTLELLAEVAVLEEEVVRLEEQLVHFR 119

Query: 1605 QGLYQEAVYISSSKRNVDNSAETCDQYQINNRKQKQSKLSLEAEDDLIASSGRHLALLYD 1426
            + LYQEAV IS+SKR ++ SA+ CD Y I N KQ+Q K   +  +    ++ +H     D
Sbjct: 120  KDLYQEAVNISTSKRKMETSADLCDSYPIKNPKQEQPKSQAQKANKSTNATEKHWPSPSD 179

Query: 1425 NGLGKENEVFSNSTKSKQKSSGLKAQNMKTPSKRQPFEYRPVDKHLDPKKLQLD---MDQ 1255
            +  GKEN+  +NSTK   + S +    ++TP KR P +++  +K  DP+KLQL+   MDQ
Sbjct: 180  DKQGKENQSSNNSTKKNNEKSLIHKAQVRTPVKRPPIDHKTAEKRSDPQKLQLEYRVMDQ 239

Query: 1254 DSKGRTPNIGNEQTSGDDNPNKISESVLKCLLSIFLRMSAKRTRSMADXXXXXXXXXXXX 1075
            +S      + ++  SGD++PNKISE++LKCL SI +RMS+ +  S               
Sbjct: 240  ESA----QVPDKAMSGDESPNKISENILKCLSSILMRMSSAKG-SAESLPSFSTLAAQEN 294

Query: 1074 XEDADFKDPYGILTEFGKRDIGPYKHLFVVESSSINPNRTTISXXXXXXXXXXXXXXXXX 895
             E  +  DPY I +EFG+RDIGPYK L  +E+ +INPN+T  +                 
Sbjct: 295  NEPKESWDPYAICSEFGRRDIGPYKQLHAIEAETINPNQTANALFLLRRLKLLLRKLASV 354

Query: 894  XXKGLTHQEKLAFWINIYNSCMMNAYLEHGIPESPEMVVALMQKATINVGGHFLNAITIE 715
              + L+HQEKLAFWINIYNSCMMNA+LEHGIPESPE++VALMQKATINVGGH LNAITIE
Sbjct: 355  NLQHLSHQEKLAFWINIYNSCMMNAFLEHGIPESPEIIVALMQKATINVGGHLLNAITIE 414

Query: 714  HFILRLPYHSKYTFVKGAKNDEMSTRSVFGLELSEPLVTFALSCGSWSSPAVRIYTASQV 535
            HFILRLPYHSKY    G KNDE + RS+FGLELSEPLVTFALSCGSWSSPAVR+YTASQV
Sbjct: 415  HFILRLPYHSKY----GTKNDEKTARSIFGLELSEPLVTFALSCGSWSSPAVRVYTASQV 470

Query: 534  ENELEMAKKEYLQAAIGISTTKRVVAIPKLLDWYMLDFAKDLDSFVDWICLQLPSELGKE 355
            ENELE+AK+EYLQAA+GIS+TK   AIPKLL+WY+ DFAKD++S +DWICLQLPSELGK+
Sbjct: 471  ENELEVAKREYLQAAVGISSTK--FAIPKLLNWYLPDFAKDIESLLDWICLQLPSELGKD 528

Query: 354  AINCLERDRNEPLSQFLEIMPYEFNFRYLLHT 259
            AI  LER +NEPLSQ ++I+PYEF+FRYLL T
Sbjct: 529  AIKLLERGKNEPLSQVVQIIPYEFSFRYLLST 560


>ref|XP_006473370.1| PREDICTED: uncharacterized protein LOC102623920 isoform X1 [Citrus
            sinensis]
          Length = 573

 Score =  577 bits (1486), Expect = e-161
 Identities = 316/575 (54%), Positives = 390/575 (67%), Gaps = 7/575 (1%)
 Frame = -1

Query: 1965 MNTRERTA-NLQSVRAPLKPQKEKVELQGKNTMNSVKAKTNRRGTIRERKKALQQDVDKL 1789
            M+ + RT+   QS  APL   KEKVE+       + KA  +RR +  +RK ALQQDVDKL
Sbjct: 1    MSRKLRTSLQCQSFEAPLNKDKEKVEMPITKVAGARKATASRRASNAQRKYALQQDVDKL 60

Query: 1788 KQKLRHEENVHRALERAFXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXVNF 1609
            K+KLRHEENVHRALERAF                                       V+F
Sbjct: 61   KKKLRHEENVHRALERAFSRPLGALPRLPPYLPPSTKELLAEVAVLEEEVVRLEEQVVHF 120

Query: 1608 RQGLYQEAVYISSSKRNVDNSAETCDQ-YQINNRKQKQSKLSLEAEDDLIASSGRHLALL 1432
            RQ LY+EAVYISSSK+N+++S + CD      N KQ+QSK           S+ R LA L
Sbjct: 121  RQDLYREAVYISSSKKNMESSIDLCDPCVDDTNSKQEQSKFLARNVGRSTTSAIRQLAAL 180

Query: 1431 YDNGLGKENEVFSNSTKSKQKSSGLKAQNMKTPSKRQPFEYRPVDKHLDPKKLQL----D 1264
              +G GKEN++ +NS K K+ SS  K Q  +TP KR   + +   +HLDP+K+QL     
Sbjct: 181  SADGRGKENQLCTNSMK-KKGSSVHKVQTGRTPVKRPSNDCKQTMRHLDPQKIQLVCRLQ 239

Query: 1263 MDQDSKGRTPNIGNEQTSGDDNPNKISESVLKCLLSIFLRMSA-KRTRSMADXXXXXXXX 1087
              ++   RT ++ +E+ SGDD PN+ISE +++CL +I LRMS+ KR  +  +        
Sbjct: 240  NPENEGARTISVTDERESGDDGPNRISEDIVRCLSTILLRMSSGKRKGTSENLHFLSTLA 299

Query: 1086 XXXXXEDADFKDPYGILTEFGKRDIGPYKHLFVVESSSINPNRTTISXXXXXXXXXXXXX 907
                 E+ + +DPYGI  +FGKRDIGPYKHL  +E+ SI+ NRT+ S             
Sbjct: 300  SEESNEETESQDPYGICLQFGKRDIGPYKHLLAIEADSIDTNRTSSSMFLVRRLKILLGK 359

Query: 906  XXXXXXKGLTHQEKLAFWINIYNSCMMNAYLEHGIPESPEMVVALMQKATINVGGHFLNA 727
                  + L HQEKLAFWINIYNSCMMNA+LE+GIPESPEMVVALMQKATI VGGH LNA
Sbjct: 360  IASVNLENLNHQEKLAFWINIYNSCMMNAFLENGIPESPEMVVALMQKATIRVGGHLLNA 419

Query: 726  ITIEHFILRLPYHSKYTFVKGAKNDEMSTRSVFGLELSEPLVTFALSCGSWSSPAVRIYT 547
            ITIEHFILRLPYHSKYTF KGAKNDEM+ R +FGLELSEPLVTFALSCGSWSSPAVR+YT
Sbjct: 420  ITIEHFILRLPYHSKYTFSKGAKNDEMTARFMFGLELSEPLVTFALSCGSWSSPAVRVYT 479

Query: 546  ASQVENELEMAKKEYLQAAIGISTTKRVVAIPKLLDWYMLDFAKDLDSFVDWICLQLPSE 367
            AS+VE+ELE+AK+EYLQAA+GIS+ K  +AIPKLLDWY+LDFAKD +S +DWICLQ+P E
Sbjct: 480  ASEVESELEVAKREYLQAAVGISSEK--LAIPKLLDWYLLDFAKDFESLLDWICLQVPCE 537

Query: 366  LGKEAINCLERDRNEPLSQFLEIMPYEFNFRYLLH 262
            LGK+AI CLER +NEPLSQF+++MPYEF+FRYLLH
Sbjct: 538  LGKKAIKCLERGKNEPLSQFIQVMPYEFSFRYLLH 572


>ref|XP_004291137.1| PREDICTED: uncharacterized protein LOC101314149 [Fragaria vesca
            subsp. vesca]
          Length = 563

 Score =  550 bits (1417), Expect = e-153
 Identities = 309/576 (53%), Positives = 380/576 (65%), Gaps = 7/576 (1%)
 Frame = -1

Query: 1965 MNTRERTANLQSVRAPLKPQKEKV---ELQGKNTMNSVKAKTNRRGTIRERKKALQQDVD 1795
            MNTR RT  LQS++AP K  ++ +   E+QG    N  KA +  + + RERK ALQQDVD
Sbjct: 1    MNTRVRT-KLQSMKAPTKKNEKDIKVEEMQGSKERNITKAISLGKASRRERKLALQQDVD 59

Query: 1794 KLKQKLRHEENVHRALERAFXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXV 1615
            KLK+KLRHEENVHRAL+RAF                                       V
Sbjct: 60   KLKKKLRHEENVHRALQRAFNRPLGALPRLPPYLPPYTLELLAEVAVLEEEVVRLEEQVV 119

Query: 1614 NFRQGLYQEAVYISSSKRNVDNSAETCDQYQINNRKQKQSKLSLEAEDDLIASSGRHLAL 1435
            +FR+ LYQEAV IS+SKR+++ SAE CD     N K   S +  +       S   H   
Sbjct: 120  HFRKDLYQEAVNISTSKRSLETSAELCDSNPKKNHKFHGSIVDTDTAVKHSPSPSDHK-- 177

Query: 1434 LYDNGLGKENEVFSNSTKSKQKSSGLKAQNMKTPSKRQPFEYRPVDKHLDPKKLQLDM-- 1261
                   +EN+  + S K+ +KS       ++TP+KR P + +   K LD  KLQL++  
Sbjct: 178  -------QENQSCNPSMKNNKKSLIHSNAQVRTPAKRPPVDPKIAQKRLDSPKLQLEVRV 230

Query: 1260 --DQDSKGRTPNIGNEQTSGDDNPNKISESVLKCLLSIFLRMSAKRTRSMADXXXXXXXX 1087
               + ++ R  +I  ++ SGDD+PNKISE++LKCL SIF+RMS+ +  +  +        
Sbjct: 231  TEQESTEARLSSIPEKKPSGDDSPNKISENILKCLSSIFMRMSSAKGIT-ENQPSFSTLG 289

Query: 1086 XXXXXEDADFKDPYGILTEFGKRDIGPYKHLFVVESSSINPNRTTISXXXXXXXXXXXXX 907
                 E  +F DPYGI +EFG+RDIGPYK L  VE+ SINPNRT  S             
Sbjct: 290  IQQSNEKPEFWDPYGICSEFGRRDIGPYKQLHAVEARSINPNRTASSLFLLRRLKLLLGK 349

Query: 906  XXXXXXKGLTHQEKLAFWINIYNSCMMNAYLEHGIPESPEMVVALMQKATINVGGHFLNA 727
                  K L HQEKLAFWINIYNSCMMNA+LEHGIPE PE++VALMQKATINVGGH L+A
Sbjct: 350  LASVNLKSLGHQEKLAFWINIYNSCMMNAFLEHGIPERPEIIVALMQKATINVGGHLLSA 409

Query: 726  ITIEHFILRLPYHSKYTFVKGAKNDEMSTRSVFGLELSEPLVTFALSCGSWSSPAVRIYT 547
            ITIEHFILRLPYHSKYTF KGAKNDE + RS+F LELSEPLVTFALSCGSWSSPAVR+YT
Sbjct: 410  ITIEHFILRLPYHSKYTFSKGAKNDENTARSIFALELSEPLVTFALSCGSWSSPAVRVYT 469

Query: 546  ASQVENELEMAKKEYLQAAIGISTTKRVVAIPKLLDWYMLDFAKDLDSFVDWICLQLPSE 367
            ASQV+NELE+AK+EYLQAA+GIS++K   AIPKLLDWY+LD+AKDL+S +DWICLQLPSE
Sbjct: 470  ASQVDNELEIAKREYLQAAVGISSSK--FAIPKLLDWYLLDYAKDLESLLDWICLQLPSE 527

Query: 366  LGKEAINCLERDRNEPLSQFLEIMPYEFNFRYLLHT 259
            LGKEAI  LE  + +P SQF++IMPYEF+FRYLLHT
Sbjct: 528  LGKEAIKFLEGAKTDPHSQFVQIMPYEFSFRYLLHT 563


>gb|EOY14573.1| Uncharacterized protein isoform 3 [Theobroma cacao]
          Length = 546

 Score =  550 bits (1416), Expect = e-153
 Identities = 306/574 (53%), Positives = 374/574 (65%), Gaps = 5/574 (0%)
 Frame = -1

Query: 1965 MNTRERTANLQSVRAPLKPQKEKVELQGKNTMNSVKAKTNRRGTIRERKKALQQDVDKLK 1786
            MNTR R A+LQS + P K +KEKV++Q      + KA  NRR + +ERK  LQQDVDKLK
Sbjct: 1    MNTRVR-ASLQSRKVPGKHEKEKVDMQETKPTVATKAMKNRRASSKERKMVLQQDVDKLK 59

Query: 1785 QKLRHEENVHRALERAFXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXVNFR 1606
            +KLR EEN+HRALERAF                                       V+FR
Sbjct: 60   KKLRQEENIHRALERAFNRPLGALPRLPPYLPPSTLELLAEVAVLEEEVVRLEEKVVHFR 119

Query: 1605 QGLYQEAVYISSSKRNVDNSAETCDQYQINNRKQKQSKLSLEAEDDLIASSGRHLALLYD 1426
            Q LYQEAVYISSSKRN+DNSA+ C+     + K +Q K+          S  RHL    D
Sbjct: 120  QDLYQEAVYISSSKRNMDNSADLCEPSLDKSPKPEQPKILTRD-----TSMARHLQSFSD 174

Query: 1425 NGLGKENEVFSNSTKSKQKSSGLKAQNMKTPSKRQPFEYRPVDKHLDPKKLQLDMDQDSK 1246
            +G GKEN+  +NSTKS + S   K+Q+++TP +R   + +P +K +DP+KLQL+     +
Sbjct: 175  DGRGKENQSCTNSTKSNKGSLVHKSQSVRTPVERPLIDSKPAEKRIDPQKLQLECRIRDQ 234

Query: 1245 GRTP----NIGNEQTSGDDNPNKISESVLKCLLSIFLRMSAKRTRSMADXXXXXXXXXXX 1078
            G T     +  +E+  GDD PNK+SE ++KCL SIFLRMS+ + +S A+           
Sbjct: 235  GNTEARIISTPDERRLGDDEPNKVSEELVKCLSSIFLRMSSTKRKSTAEGSPSLSMLGSQ 294

Query: 1077 XXED-ADFKDPYGILTEFGKRDIGPYKHLFVVESSSINPNRTTISXXXXXXXXXXXXXXX 901
               +  +F+DPYG  + FG+RDIGPYK+LF +++ SINPNRT+ S               
Sbjct: 295  ESSEETEFRDPYGTCSNFGRRDIGPYKNLFSIDAGSINPNRTSKSLFLLRRLKLLLERLA 354

Query: 900  XXXXKGLTHQEKLAFWINIYNSCMMNAYLEHGIPESPEMVVALMQKATINVGGHFLNAIT 721
                  L HQEKLAFWINIYNSCMMNA                    TINVGG  LNAIT
Sbjct: 355  SSNLLNLNHQEKLAFWINIYNSCMMNA--------------------TINVGGRLLNAIT 394

Query: 720  IEHFILRLPYHSKYTFVKGAKNDEMSTRSVFGLELSEPLVTFALSCGSWSSPAVRIYTAS 541
            IEHFILRLPYHSK+ F KG KNDEM+ RS+FGLELSEPLVTFALSCGSWSSPAVR+YTAS
Sbjct: 395  IEHFILRLPYHSKFIFSKGVKNDEMTARSIFGLELSEPLVTFALSCGSWSSPAVRVYTAS 454

Query: 540  QVENELEMAKKEYLQAAIGISTTKRVVAIPKLLDWYMLDFAKDLDSFVDWICLQLPSELG 361
            QVENELE+AK+EYLQAA+GIS+TK   AIPKLLDWY+LDFAKDLDS +DWICLQLPSELG
Sbjct: 455  QVENELEVAKREYLQAAVGISSTK--FAIPKLLDWYLLDFAKDLDSLLDWICLQLPSELG 512

Query: 360  KEAINCLERDRNEPLSQFLEIMPYEFNFRYLLHT 259
            KEAI  LER ++E LSQF++IMPYEF+FRYLL T
Sbjct: 513  KEAIKYLERAKSESLSQFVQIMPYEFSFRYLLCT 546


>ref|XP_006473371.1| PREDICTED: uncharacterized protein LOC102623920 isoform X2 [Citrus
            sinensis]
          Length = 539

 Score =  541 bits (1395), Expect = e-151
 Identities = 292/519 (56%), Positives = 358/519 (68%), Gaps = 6/519 (1%)
 Frame = -1

Query: 1800 VDKLKQKLRHEENVHRALERAFXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 1621
            VDKLK+KLRHEENVHRALERAF                                      
Sbjct: 23   VDKLKKKLRHEENVHRALERAFSRPLGALPRLPPYLPPSTKELLAEVAVLEEEVVRLEEQ 82

Query: 1620 XVNFRQGLYQEAVYISSSKRNVDNSAETCDQ-YQINNRKQKQSKLSLEAEDDLIASSGRH 1444
             V+FRQ LY+EAVYISSSK+N+++S + CD      N KQ+QSK           S+ R 
Sbjct: 83   VVHFRQDLYREAVYISSSKKNMESSIDLCDPCVDDTNSKQEQSKFLARNVGRSTTSAIRQ 142

Query: 1443 LALLYDNGLGKENEVFSNSTKSKQKSSGLKAQNMKTPSKRQPFEYRPVDKHLDPKKLQL- 1267
            LA L  +G GKEN++ +NS K K+ SS  K Q  +TP KR   + +   +HLDP+K+QL 
Sbjct: 143  LAALSADGRGKENQLCTNSMK-KKGSSVHKVQTGRTPVKRPSNDCKQTMRHLDPQKIQLV 201

Query: 1266 ---DMDQDSKGRTPNIGNEQTSGDDNPNKISESVLKCLLSIFLRMSA-KRTRSMADXXXX 1099
                  ++   RT ++ +E+ SGDD PN+ISE +++CL +I LRMS+ KR  +  +    
Sbjct: 202  CRLQNPENEGARTISVTDERESGDDGPNRISEDIVRCLSTILLRMSSGKRKGTSENLHFL 261

Query: 1098 XXXXXXXXXEDADFKDPYGILTEFGKRDIGPYKHLFVVESSSINPNRTTISXXXXXXXXX 919
                     E+ + +DPYGI  +FGKRDIGPYKHL  +E+ SI+ NRT+ S         
Sbjct: 262  STLASEESNEETESQDPYGICLQFGKRDIGPYKHLLAIEADSIDTNRTSSSMFLVRRLKI 321

Query: 918  XXXXXXXXXXKGLTHQEKLAFWINIYNSCMMNAYLEHGIPESPEMVVALMQKATINVGGH 739
                      + L HQEKLAFWINIYNSCMMNA+LE+GIPESPEMVVALMQKATI VGGH
Sbjct: 322  LLGKIASVNLENLNHQEKLAFWINIYNSCMMNAFLENGIPESPEMVVALMQKATIRVGGH 381

Query: 738  FLNAITIEHFILRLPYHSKYTFVKGAKNDEMSTRSVFGLELSEPLVTFALSCGSWSSPAV 559
             LNAITIEHFILRLPYHSKYTF KGAKNDEM+ R +FGLELSEPLVTFALSCGSWSSPAV
Sbjct: 382  LLNAITIEHFILRLPYHSKYTFSKGAKNDEMTARFMFGLELSEPLVTFALSCGSWSSPAV 441

Query: 558  RIYTASQVENELEMAKKEYLQAAIGISTTKRVVAIPKLLDWYMLDFAKDLDSFVDWICLQ 379
            R+YTAS+VE+ELE+AK+EYLQAA+GIS+ K  +AIPKLLDWY+LDFAKD +S +DWICLQ
Sbjct: 442  RVYTASEVESELEVAKREYLQAAVGISSEK--LAIPKLLDWYLLDFAKDFESLLDWICLQ 499

Query: 378  LPSELGKEAINCLERDRNEPLSQFLEIMPYEFNFRYLLH 262
            +P ELGK+AI CLER +NEPLSQF+++MPYEF+FRYLLH
Sbjct: 500  VPCELGKKAIKCLERGKNEPLSQFIQVMPYEFSFRYLLH 538


>ref|XP_004141244.1| PREDICTED: uncharacterized protein LOC101211254 [Cucumis sativus]
          Length = 547

 Score =  540 bits (1392), Expect = e-151
 Identities = 303/570 (53%), Positives = 372/570 (65%), Gaps = 4/570 (0%)
 Frame = -1

Query: 1956 RERTANLQSVRAPLKPQKEKVELQGKNTMNSVKAKTNRRGTIRERKKALQQDVDKLKQKL 1777
            R+    LQS+RA    +K  V++   N +++ KA T+ R + R+RK ALQQDVDKLK+KL
Sbjct: 3    RKGRTRLQSMRASANHEKGNVDMPEANFLDAAKASTSGRVSSRQRKVALQQDVDKLKKKL 62

Query: 1776 RHEENVHRALERAFXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXVNFRQGL 1597
            RHEENV RAL+RAF                                       V FRQ L
Sbjct: 63   RHEENVGRALKRAFTRPLGALPRLPPFLPPNMLELLAEVAVLEEEVVRLEEQVVLFRQDL 122

Query: 1596 YQEAVYISSSKRNVDNSAETCDQYQINNRKQKQSKLSLEAEDDLIASSGRHLALLYDNGL 1417
            YQEAV ISSSK+ ++ S +       NN KQ QSKLS++  D+++               
Sbjct: 123  YQEAVNISSSKKTMELSPK-------NNSKQAQSKLSVQKTDNVV--------------- 160

Query: 1416 GKENEVFSNSTKSKQKSSGLKAQNMKTPSKRQPFEYRPVDKHLDPKKLQLDM----DQDS 1249
            GKENE   NST + + SS  K   +KTP K+ P   +  +K   PK L L+      +++
Sbjct: 161  GKENESRMNSTSNNKGSSIKKIHTIKTPVKKPPVRNKSSEKPNSPK-LNLENRTANPENA 219

Query: 1248 KGRTPNIGNEQTSGDDNPNKISESVLKCLLSIFLRMSAKRTRSMADXXXXXXXXXXXXXE 1069
            + R     +++ SGDD+PN ISE++LKCL SI LRMS+ + R   +             E
Sbjct: 220  EARQLRAPDDKVSGDDSPNSISENILKCLSSILLRMSSIKNRGATESLHLFSMVTTMQTE 279

Query: 1068 DADFKDPYGILTEFGKRDIGPYKHLFVVESSSINPNRTTISXXXXXXXXXXXXXXXXXXX 889
            + D  DPYGI +EFG+RDIGPYK++  VE+ SIN  RTT S                   
Sbjct: 280  ETDLPDPYGICSEFGRRDIGPYKNVHTVEACSINTKRTTNSLFLFQRLKLLLGKLASVNL 339

Query: 888  KGLTHQEKLAFWINIYNSCMMNAYLEHGIPESPEMVVALMQKATINVGGHFLNAITIEHF 709
            + LTHQEKLAFWINIYNSCM+NA+LEHGIPESPEMVVALMQKATINV GH LNAITIEHF
Sbjct: 340  QRLTHQEKLAFWINIYNSCMINAFLEHGIPESPEMVVALMQKATINVSGHLLNAITIEHF 399

Query: 708  ILRLPYHSKYTFVKGAKNDEMSTRSVFGLELSEPLVTFALSCGSWSSPAVRIYTASQVEN 529
            ILRLPYHS+Y F K AK DE + RS+FGLELSEPLVTFALSCGSWSSPAVR+YTASQVEN
Sbjct: 400  ILRLPYHSQYAFSKSAKYDEKTFRSIFGLELSEPLVTFALSCGSWSSPAVRVYTASQVEN 459

Query: 528  ELEMAKKEYLQAAIGISTTKRVVAIPKLLDWYMLDFAKDLDSFVDWICLQLPSELGKEAI 349
            ELE+AK+EYL+AA+GIS+ K    IPKLLDWY+LDFAKDLDS VDW+CLQLPSELGKEAI
Sbjct: 460  ELELAKREYLEAAVGISSEK--FGIPKLLDWYLLDFAKDLDSLVDWVCLQLPSELGKEAI 517

Query: 348  NCLERDRNEPLSQFLEIMPYEFNFRYLLHT 259
              +E  RN+PLSQF++++PYEF+FRYLL T
Sbjct: 518  KLMEGRRNQPLSQFVKVIPYEFSFRYLLCT 547


>gb|EOY14572.1| Uncharacterized protein isoform 2 [Theobroma cacao]
          Length = 533

 Score =  539 bits (1389), Expect = e-150
 Identities = 288/521 (55%), Positives = 353/521 (67%), Gaps = 5/521 (0%)
 Frame = -1

Query: 1866 SVKAKTNRRGTIRERKKALQQDVDKLKQKLRHEENVHRALERAFXXXXXXXXXXXXXXXX 1687
            + KA  NRR + +ERK  LQQDVDKLK+KLR EEN+HRALERAF                
Sbjct: 9    ATKAMKNRRASSKERKMVLQQDVDKLKKKLRQEENIHRALERAFNRPLGALPRLPPYLPP 68

Query: 1686 XXXXXXXXXXXXXXXXXXXXXXXVNFRQGLYQEAVYISSSKRNVDNSAETCDQYQINNRK 1507
                                   V+FRQ LYQEAVYISSSKRN+DNSA+ C+     + K
Sbjct: 69   STLELLAEVAVLEEEVVRLEEKVVHFRQDLYQEAVYISSSKRNMDNSADLCEPSLDKSPK 128

Query: 1506 QKQSKLSLEAEDDLIASSGRHLALLYDNGLGKENEVFSNSTKSKQKSSGLKAQNMKTPSK 1327
             +Q K+          S  RHL    D+G GKEN+  +NSTKS + S   K+Q+++TP +
Sbjct: 129  PEQPKILTRD-----TSMARHLQSFSDDGRGKENQSCTNSTKSNKGSLVHKSQSVRTPVE 183

Query: 1326 RQPFEYRPVDKHLDPKKLQLDMDQDSKGRTP----NIGNEQTSGDDNPNKISESVLKCLL 1159
            R   + +P +K +DP+KLQL+     +G T     +  +E+  GDD PNK+SE ++KCL 
Sbjct: 184  RPLIDSKPAEKRIDPQKLQLECRIRDQGNTEARIISTPDERRLGDDEPNKVSEELVKCLS 243

Query: 1158 SIFLRMSAKRTRSMADXXXXXXXXXXXXXED-ADFKDPYGILTEFGKRDIGPYKHLFVVE 982
            SIFLRMS+ + +S A+              +  +F+DPYG  + FG+RDIGPYK+LF ++
Sbjct: 244  SIFLRMSSTKRKSTAEGSPSLSMLGSQESSEETEFRDPYGTCSNFGRRDIGPYKNLFSID 303

Query: 981  SSSINPNRTTISXXXXXXXXXXXXXXXXXXXKGLTHQEKLAFWINIYNSCMMNAYLEHGI 802
            + SINPNRT+ S                     L HQEKLAFWINIYNSCMMNA+LEHG+
Sbjct: 304  AGSINPNRTSKSLFLLRRLKLLLERLASSNLLNLNHQEKLAFWINIYNSCMMNAFLEHGV 363

Query: 801  PESPEMVVALMQKATINVGGHFLNAITIEHFILRLPYHSKYTFVKGAKNDEMSTRSVFGL 622
            P+SP+MVV LM+KATINVGG  LNAITIEHFILRLPYHSK+ F KG KNDEM+ RS+FGL
Sbjct: 364  PDSPKMVVELMRKATINVGGRLLNAITIEHFILRLPYHSKFIFSKGVKNDEMTARSIFGL 423

Query: 621  ELSEPLVTFALSCGSWSSPAVRIYTASQVENELEMAKKEYLQAAIGISTTKRVVAIPKLL 442
            ELSEPLVTFALSCGSWSSPAVR+YTASQVENELE+AK+EYLQAA+GIS+TK   AIPKLL
Sbjct: 424  ELSEPLVTFALSCGSWSSPAVRVYTASQVENELEVAKREYLQAAVGISSTK--FAIPKLL 481

Query: 441  DWYMLDFAKDLDSFVDWICLQLPSELGKEAINCLERDRNEP 319
            DWY+LDFAKDLDS +DWICLQLPSELGKEAI  LER + +P
Sbjct: 482  DWYLLDFAKDLDSLLDWICLQLPSELGKEAIKYLERAKRKP 522


>gb|EXB75617.1| hypothetical protein L484_026093 [Morus notabilis]
          Length = 549

 Score =  536 bits (1380), Expect = e-149
 Identities = 291/573 (50%), Positives = 369/573 (64%), Gaps = 5/573 (0%)
 Frame = -1

Query: 1965 MNTRERTANLQSVRAPLKPQKEKVELQGKNTMNSVKAKTNRRGTIRERKKALQQDVDKLK 1786
            MNT+ RT    +    L+P K    L+  +                       + VDKLK
Sbjct: 1    MNTKTRTTTTITTATRLQPGKTAPTLKNSS-----------------------EKVDKLK 37

Query: 1785 QKLRHEENVHRALERAFXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXVNFR 1606
            +KLRHEE+VHRALERAF                                       V+FR
Sbjct: 38   KKLRHEESVHRALERAFNRPLGALPRLPPYLPPYTLELLAEVAVLEEEVVRLEEQVVHFR 97

Query: 1605 QGLYQEAVYISSSKRNVDNSAETCDQYQINNRKQKQSKLSLEAEDDLIASSGRHLALLYD 1426
            Q LYQEAVYISSSKRN++NSA++ D   + + + +  K  L    +   +  +HL    D
Sbjct: 98   QDLYQEAVYISSSKRNIENSADSHDPRPVKSPRPELPKFLLAPMGNSATTRTKHLRTNSD 157

Query: 1425 NGLGKENEVFSNSTKSKQKSSGLKAQNMKTPSKRQPFEYRPVDKHLDPKKLQLDM----D 1258
            +  GKEN+  +NSTK+ + SS  K+Q M+   KR P + +  +K  DP+KLQL+     +
Sbjct: 158  DRQGKENQSCTNSTKNSKGSSIHKSQTMRASVKRPPADQKSAEKSSDPQKLQLESRVTDN 217

Query: 1257 QDSKGRTPNIGNEQTSGDDNPNKISESVLKCLLSIFLRMSAKRTRSMADXXXXXXXXXXX 1078
            + ++ RT  +   + S DD+PN+ISE+++KCL +IFLRMS+ + R   +           
Sbjct: 218  ESAEARTCTVQETKVSEDDSPNRISENIMKCLCNIFLRMSSVKNRGGDESCPTFSNLATQ 277

Query: 1077 XXEDA-DFKDPYGILTEFGKRDIGPYKHLFVVESSSINPNRTTISXXXXXXXXXXXXXXX 901
              ++  +F DPYGI +EFGKRDIG YK    +++SSIN NRT  S               
Sbjct: 278  ESKEKREFGDPYGITSEFGKRDIGKYKQFLSIDASSINLNRTANSLFLLRRLKLLFEKLA 337

Query: 900  XXXXKGLTHQEKLAFWINIYNSCMMNAYLEHGIPESPEMVVALMQKATINVGGHFLNAIT 721
                + L+HQEKLAFWINIYNSCMMN +LEHGIPESPEMV ALMQKA +NVGGH LNAIT
Sbjct: 338  SVNLENLSHQEKLAFWINIYNSCMMNPFLEHGIPESPEMVAALMQKAIVNVGGHLLNAIT 397

Query: 720  IEHFILRLPYHSKYTFVKGAKNDEMSTRSVFGLELSEPLVTFALSCGSWSSPAVRIYTAS 541
            IEHFILRLPYHSKYTF KGAKNDE + RS+FGLELSEPLVTFALSCGSWSSPAVR+Y+A+
Sbjct: 398  IEHFILRLPYHSKYTFSKGAKNDEKTARSIFGLELSEPLVTFALSCGSWSSPAVRVYSAA 457

Query: 540  QVENELEMAKKEYLQAAIGISTTKRVVAIPKLLDWYMLDFAKDLDSFVDWICLQLPSELG 361
            QVENELE+AK+EYL+AA+GIS++K    IPKLLDWY+LDFAKDL+S +DWICLQLPSELG
Sbjct: 458  QVENELEVAKREYLEAAVGISSSK--FKIPKLLDWYLLDFAKDLESLLDWICLQLPSELG 515

Query: 360  KEAINCLERDRNEPLSQFLEIMPYEFNFRYLLH 262
            KEA+ CLER + +PLSQF+ IMPY+FNFRYLL+
Sbjct: 516  KEALKCLERGKIQPLSQFVHIMPYDFNFRYLLY 548


>ref|XP_006578008.1| PREDICTED: uncharacterized protein LOC100807554 isoform X1 [Glycine
            max]
          Length = 587

 Score =  499 bits (1284), Expect = e-138
 Identities = 294/578 (50%), Positives = 356/578 (61%), Gaps = 5/578 (0%)
 Frame = -1

Query: 1977 E*RKMNTRERTANLQSVRAPLKP--QKEKVELQGKNTMNSVKAKTNRRGTIRERKKALQQ 1804
            E +KMNT  R   LQ  +AP       EK E++G +   +  A    + + ++RK ALQQ
Sbjct: 20   EVQKMNTSVRP-RLQHRKAPTPTTVSHEKAEMRGSSRGGANNAVKGGKTSSKDRKLALQQ 78

Query: 1803 DVDKLKQKLRHEENVHRALERAFXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 1624
            DVD+LK+KL+ EEN+HRALERAF                                     
Sbjct: 79   DVDRLKKKLKREENIHRALERAFNRPLGALPRLPPYLPPYTLGLLAEVAVLEEEIVRLEE 138

Query: 1623 XXVNFRQGLYQEAVYISSSKRNVDNSAETCDQYQINNRKQKQSKLSLEAEDDLIASSGRH 1444
              V+FRQ LYQEAVY+SSSK  ++ SA   +    ++ K  + K   +  DD   S  R 
Sbjct: 139  QVVHFRQDLYQEAVYMSSSKMKLEQSARVNNASPNSSPKLGKLKSLSQTMDDAATSETRP 198

Query: 1443 LALLYDNGLGKENEVFSNSTKSKQKSSGLKAQNMKTPSKRQPFEYRPVDKHLDPKKLQ-- 1270
               L  +  GKEN+  +NS KS ++S+  K Q  K+P K+ P + + + K  DP K Q  
Sbjct: 199  TTTLPKDRHGKENQSCTNSFKSNKQST-CKGQTTKSPIKKLPIDNKSLQKRRDPPKKQQE 257

Query: 1269 LDMDQDSKGRTPNIGNEQTSGDDNPNKISESVLKCLLSIFLRMSA-KRTRSMADXXXXXX 1093
            L +         N+  E   GD++PN ISE++LKCL SI LRMSA K   S AD      
Sbjct: 258  LRLKDQPIAEVRNL-RENPQGDESPNIISENILKCLSSIILRMSAAKNLDSTADVPPLRT 316

Query: 1092 XXXXXXXEDADFKDPYGILTEFGKRDIGPYKHLFVVESSSINPNRTTISXXXXXXXXXXX 913
                   E  +F DPY I  EFGKRDIGPYK L  +E+ S +P RT  S           
Sbjct: 317  PKSKNCVEGIEFWDPYSICLEFGKRDIGPYKQLRSIETKSFDPKRTAKSLFLLHRLKLLL 376

Query: 912  XXXXXXXXKGLTHQEKLAFWINIYNSCMMNAYLEHGIPESPEMVVALMQKATINVGGHFL 733
                    + L HQEKLAFWINIYNSCMMNAY+E+GIPESPEMV ALMQKATINVGGH L
Sbjct: 377  RKLACVNIENLNHQEKLAFWINIYNSCMMNAYIENGIPESPEMVAALMQKATINVGGHLL 436

Query: 732  NAITIEHFILRLPYHSKYTFVKGAKNDEMSTRSVFGLELSEPLVTFALSCGSWSSPAVRI 553
            +A TIEH ILRLPYH K+T  KG KN E      +GLELSEPLVTFALSCG+WSSPAVRI
Sbjct: 437  SATTIEHCILRLPYHWKFTLSKGGKNHE-----TYGLELSEPLVTFALSCGTWSSPAVRI 491

Query: 552  YTASQVENELEMAKKEYLQAAIGISTTKRVVAIPKLLDWYMLDFAKDLDSFVDWICLQLP 373
            YTASQVENELEMAK+EYLQAA+GIS +K    IPKLLDWY+LDFAKDL+S +DWICLQLP
Sbjct: 492  YTASQVENELEMAKREYLQAAVGISISK--FLIPKLLDWYLLDFAKDLESLLDWICLQLP 549

Query: 372  SELGKEAINCLERDRNEPLSQFLEIMPYEFNFRYLLHT 259
            S++GKEAI  LE+ +  PLSQF+ IMPYEFNFRYLL T
Sbjct: 550  SDVGKEAIKFLEKRKTGPLSQFVHIMPYEFNFRYLLCT 587


>ref|XP_006581234.1| PREDICTED: uncharacterized protein LOC100818982 isoform X1 [Glycine
            max]
          Length = 563

 Score =  495 bits (1274), Expect = e-137
 Identities = 289/572 (50%), Positives = 349/572 (61%), Gaps = 3/572 (0%)
 Frame = -1

Query: 1965 MNTRERTANLQSVRAPLKPQKEKVELQGKNTMNSVKAKTNRRGTIRERKKALQQDVDKLK 1786
            MNT  R   L   +AP     EK E++G +      A    + + ++RK ALQQDVD+LK
Sbjct: 1    MNTSVRPRLLHRKKAPTLLSHEKAEMRGSSRGGENNAAKGGKASSKDRKLALQQDVDRLK 60

Query: 1785 QKLRHEENVHRALERAFXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXVNFR 1606
            +KL  EEN+HRALERAF                                       V+FR
Sbjct: 61   KKLSREENIHRALERAFNRPLGALPRLPPYLPPYTLGLLAEVAVLEEEIVRLEEQVVHFR 120

Query: 1605 QGLYQEAVYISSSKRNVDNSAETCDQYQINNRKQKQSKLSLEAEDDLIASSGRHLALLYD 1426
            Q LYQEAVY+SSSK  ++ SA   +    ++ K  + K   ++ DD   S  R    L  
Sbjct: 121  QDLYQEAVYMSSSKMKLEQSAGVNNANPTSSPKLGKLKSLSQSMDDTATSETRPTTTLPK 180

Query: 1425 NGLGKENEVFSNSTKSKQKSSGLKAQNMKTPSKRQPFEYRPVDKHLDPKKLQ--LDMDQD 1252
            +  GKEN+  ++S+KS  K S  K Q  K+P K+ P + + + K  DP K Q  L +   
Sbjct: 181  DRHGKENQSCTSSSKSS-KQSICKGQTTKSPIKKLPIDNKSLQKRRDPPKKQQELRLKDQ 239

Query: 1251 SKGRTPNIGNEQTSGDDNPNKISESVLKCLLSIFLRMSA-KRTRSMADXXXXXXXXXXXX 1075
                  N+  E   GD+ PN ISE++LKCL +I LRMSA K   S AD            
Sbjct: 240  PIAEVRNL-RENPQGDECPNIISENILKCLSNIILRMSAAKNLDSTADVPPFRTPKSKNC 298

Query: 1074 XEDADFKDPYGILTEFGKRDIGPYKHLFVVESSSINPNRTTISXXXXXXXXXXXXXXXXX 895
             E ++F DPY I  EFGKRD GP+K L  +E+ S +P RT  S                 
Sbjct: 299  VEGSEFWDPYSICLEFGKRDSGPFKQLRSIEAKSFDPKRTAKSLFLLHRLKLLLRKLACV 358

Query: 894  XXKGLTHQEKLAFWINIYNSCMMNAYLEHGIPESPEMVVALMQKATINVGGHFLNAITIE 715
              + L HQEKLAFWINIYNSCMMNAYLE GIPESPEMVVALM KATINVGGH L+A TIE
Sbjct: 359  NIENLNHQEKLAFWINIYNSCMMNAYLEKGIPESPEMVVALMHKATINVGGHLLSATTIE 418

Query: 714  HFILRLPYHSKYTFVKGAKNDEMSTRSVFGLELSEPLVTFALSCGSWSSPAVRIYTASQV 535
            H ILRLPYH K+T  KG KN E      +GLELSEPLVTFALSCG+WSSPAVRIY ASQV
Sbjct: 419  HCILRLPYHWKFTLSKGGKNHE-----TYGLELSEPLVTFALSCGTWSSPAVRIYRASQV 473

Query: 534  ENELEMAKKEYLQAAIGISTTKRVVAIPKLLDWYMLDFAKDLDSFVDWICLQLPSELGKE 355
            ENELEMAKKEYLQAA+GIS +K    IPKLLDWY+LDFAKDL+S +DWICLQLPS++GKE
Sbjct: 474  ENELEMAKKEYLQAAVGISISK--FLIPKLLDWYLLDFAKDLESLLDWICLQLPSDVGKE 531

Query: 354  AINCLERDRNEPLSQFLEIMPYEFNFRYLLHT 259
            AI  LE+ + EPLSQ+++IMPYEFNFRYLL T
Sbjct: 532  AIKFLEKRKTEPLSQYVQIMPYEFNFRYLLCT 563


>gb|EOX91357.1| Uncharacterized protein isoform 1 [Theobroma cacao]
            gi|508699462|gb|EOX91358.1| Uncharacterized protein
            isoform 1 [Theobroma cacao]
          Length = 616

 Score =  495 bits (1274), Expect = e-137
 Identities = 292/617 (47%), Positives = 375/617 (60%), Gaps = 50/617 (8%)
 Frame = -1

Query: 1965 MNTRERTANLQSVRAPLKP---QKEKVEL-QGKNTMNSVKAKTNRRGTIRERKKALQQDV 1798
            MNTR RTA+ QS++APL     +KEK+E  QG   + + KA TNRR + RERK AL QDV
Sbjct: 2    MNTRVRTAH-QSMKAPLSHDSNKKEKMEKSQGGRALGTGKALTNRRRSNRERKMALLQDV 60

Query: 1797 DKLKQKLRHEENVHRALERAFXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 1618
            DKLK+KLRHEENVHRALERAF                                       
Sbjct: 61   DKLKRKLRHEENVHRALERAFTRPLGALPRLPPYLPPYTLELLAEVAVLEEEVVRLEEQV 120

Query: 1617 VNFRQGLYQEAVYISSSKRNVDNSAETCDQYQINNRKQKQSKLSLEAEDDLIASSGRHLA 1438
            VNFRQGLYQEAVY +SSKRNV+N  E+ +Q  + + K ++SK     E   + + G+   
Sbjct: 121  VNFRQGLYQEAVY-ASSKRNVENLNESIEQSPVRSSKHQRSKSLSVNEMSSVTTIGKPQP 179

Query: 1437 LL----------------YDNGL-----------------------GKENEVFSNSTKSK 1375
             L                  NGL                       GKEN+ F+N+ K K
Sbjct: 180  SLARSVSSRKLLPPDTTNERNGLCFSRPTNGRQASTKLNSASGDVRGKENQSFANAVKDK 239

Query: 1374 QKSSGLKAQNMKTPSKRQPFEYRPVDKHLDPKKLQLD---MDQDSKGRTPNIGNEQ--TS 1210
            Q S   K   + TP KR P ++   +K LD  K QLD   +DQ+    +P+  ++   + 
Sbjct: 240  Q-SPEKKITKVVTPVKRLPTKHESANKCLDALKSQLDGRLVDQERAQESPSGSSDDKVSE 298

Query: 1209 GDDNPNKISESVLKCLLSIFLRMSAKRTRSMAD--XXXXXXXXXXXXXEDADFKDPYGIL 1036
             D  PNKISE  ++CL SIF+R+S  + RS+                  +++F+DPYGI 
Sbjct: 299  ADSTPNKISEDTVRCLCSIFVRLSTLKDRSVESGILPSQSAANSYEISRESEFQDPYGIC 358

Query: 1035 TEFGKRDIGPYKHLFVVESSSINPNRTTISXXXXXXXXXXXXXXXXXXXKGLTHQEKLAF 856
            ++   RDIGPYK+L  +E+++++ +R   +                    GL+HQ+KLAF
Sbjct: 359  SDSKTRDIGPYKNLCTIEANTVDLSRRMNALFLIHRLKFLLGKLTSVNLDGLSHQQKLAF 418

Query: 855  WINIYNSCMMNAYLEHGIPESPEMVVALMQKATINVGGHFLNAITIEHFILRLPYHSKYT 676
            WIN YNSCMMNA LEHGIPE+PE VV LMQKATI VGGH LNAITIEHFILRLP+H K+T
Sbjct: 419  WINTYNSCMMNAILEHGIPETPESVVGLMQKATIVVGGHLLNAITIEHFILRLPFHLKFT 478

Query: 675  FVKGAKNDEMSTRSVFGLELSEPLVTFALSCGSWSSPAVRIYTASQVENELEMAKKEYLQ 496
              K AKNDEM  R++FGLE SEPLVT+AL+CGSWSSPAVR+YTAS VE+ELE AK++YLQ
Sbjct: 479  CSKAAKNDEMKARNIFGLEWSEPLVTYALACGSWSSPAVRVYTASHVEDELETAKRDYLQ 538

Query: 495  AAIGISTTKRVVAIPKLLDWYMLDFAKDLDSFVDWICLQLPSELGKEAINCLERDRNEPL 316
            AA+ IS T +++ IPKLLDWY+LDFAKDL+S +DW+CLQL +EL  EA+ CLER   EPL
Sbjct: 539  AAVAISRTNKLI-IPKLLDWYLLDFAKDLESLLDWVCLQLTNELRNEAVKCLERKGKEPL 597

Query: 315  SQFLEIMPYEFNFRYLL 265
            SQ +++MPY+F+FR LL
Sbjct: 598  SQLVQVMPYDFSFRLLL 614


>ref|XP_002327110.1| predicted protein [Populus trichocarpa]
          Length = 445

 Score =  493 bits (1269), Expect = e-136
 Identities = 260/446 (58%), Positives = 319/446 (71%), Gaps = 6/446 (1%)
 Frame = -1

Query: 1578 ISSSKRNVDNSAETCDQYQINNRKQKQSKLSLEAEDDLIASSGRHLALLYDNGLGKENEV 1399
            +SSSKRNV++ ++    Y   N K  QSK   +  D+   S+ RHL  L  +G GKEN  
Sbjct: 1    MSSSKRNVESVSDLYHLYPNKNPKPDQSKSLAQNVDESATSTIRHLPSLSADGTGKENAF 60

Query: 1398 FSNSTKSKQKSSGL-KAQNMKTPSKRQPFEYRPVDKHLDPKKLQLDM----DQDSKGRTP 1234
             + +++ K K S + KAQ  +   KR   + RP +K LD  K QL+      ++++ R+ 
Sbjct: 61   STANSRKKSKGSSINKAQTSRNMVKRPSEDNRPAEKKLDSHKSQLECRVPDQENAEARSH 120

Query: 1233 NIGNEQTSGDDNPNKISESVLKCLLSIFLRMSAKRTRSMADXXXXXXXXXXXXXED-ADF 1057
               +E  SGD +PNK+SE +LKCL SIF+RMS+   R  AD             E+ A+ 
Sbjct: 121  VTASEGVSGDASPNKLSEDILKCLSSIFVRMSSMNNRRTADNLSFLSTLVSQENEEEAEC 180

Query: 1056 KDPYGILTEFGKRDIGPYKHLFVVESSSINPNRTTISXXXXXXXXXXXXXXXXXXXKGLT 877
            +DPYGI +EFGKRDIGPYK LF +ES +INPNRT+ S                   + LT
Sbjct: 181  QDPYGICSEFGKRDIGPYKRLFSIESGTINPNRTSNSLFLLHRLELLFGKLASVNLQNLT 240

Query: 876  HQEKLAFWINIYNSCMMNAYLEHGIPESPEMVVALMQKATINVGGHFLNAITIEHFILRL 697
            HQ+KLAFWINIYNSCMMNA+LEHGIPESPE VV LM+KATIN+GGH LNAITIEHFILRL
Sbjct: 241  HQKKLAFWINIYNSCMMNAFLEHGIPESPETVVELMRKATINIGGHLLNAITIEHFILRL 300

Query: 696  PYHSKYTFVKGAKNDEMSTRSVFGLELSEPLVTFALSCGSWSSPAVRIYTASQVENELEM 517
            PY+SKYT  KGAKNDEM+ R+ FGLELSEPLV+FAL CGSWSSPAVR+YTA+QVENELE 
Sbjct: 301  PYYSKYTISKGAKNDEMAARNKFGLELSEPLVSFALCCGSWSSPAVRVYTAAQVENELEE 360

Query: 516  AKKEYLQAAIGISTTKRVVAIPKLLDWYMLDFAKDLDSFVDWICLQLPSELGKEAINCLE 337
            AK++YLQAAIGI+T+K   AIPKLLDWY+LDFAKDL+S +DWICLQLPSELGKEAINCLE
Sbjct: 361  AKRDYLQAAIGITTSK--FAIPKLLDWYLLDFAKDLESLLDWICLQLPSELGKEAINCLE 418

Query: 336  RDRNEPLSQFLEIMPYEFNFRYLLHT 259
              +NEP S F+++MPYEF FRYLL+T
Sbjct: 419  NGKNEPHSHFVQVMPYEFGFRYLLYT 444


>ref|XP_006374994.1| hypothetical protein POPTR_0014s03410g [Populus trichocarpa]
            gi|550323308|gb|ERP52791.1| hypothetical protein
            POPTR_0014s03410g [Populus trichocarpa]
          Length = 445

 Score =  493 bits (1268), Expect = e-136
 Identities = 262/446 (58%), Positives = 320/446 (71%), Gaps = 6/446 (1%)
 Frame = -1

Query: 1578 ISSSKRNVDNSAETCDQYQINNRKQKQSKLSLEAEDDLIASSGRHLALLYDNGLGKENEV 1399
            +SSSKRNV++ ++    Y   N K  QSK   +  D+   S+ RHL  L  +G GKEN  
Sbjct: 1    MSSSKRNVESVSDLYHLYPNKNPKPDQSKSLAQNVDESATSTIRHLPSLSADGTGKENAF 60

Query: 1398 FS-NSTKSKQKSSGLKAQNMKTPSKRQPFEYRPVDKHLDPKKLQLDM----DQDSKGRTP 1234
             + NS K+ + SS  KAQ  +   KR   + RP +K LD  K QL+      ++++ R+ 
Sbjct: 61   STANSRKNSKGSSINKAQTSRNMVKRPSEDNRPAEKKLDSHKSQLECRVPDQENAEARSH 120

Query: 1233 NIGNEQTSGDDNPNKISESVLKCLLSIFLRMSAKRTRSMADXXXXXXXXXXXXXED-ADF 1057
               +E  +GD +PNK+SE +LKCL SIFLRMS+   R  AD             E+ A+ 
Sbjct: 121  VTASEGVTGDASPNKLSEDILKCLSSIFLRMSSMNNRRTADNLSFLSTLVSQENEEEAEC 180

Query: 1056 KDPYGILTEFGKRDIGPYKHLFVVESSSINPNRTTISXXXXXXXXXXXXXXXXXXXKGLT 877
            +DPYGI +EFGKRDIGPYK LF +ES +INPNRT+ S                   + LT
Sbjct: 181  QDPYGICSEFGKRDIGPYKRLFSIESGTINPNRTSNSLFLLHRLELLFGKLASVNLQNLT 240

Query: 876  HQEKLAFWINIYNSCMMNAYLEHGIPESPEMVVALMQKATINVGGHFLNAITIEHFILRL 697
            HQ+KLAFWINIYNSCMMNA+LEHGIPESPE VV LM+KATIN+GGH LNAITIEHFILRL
Sbjct: 241  HQKKLAFWINIYNSCMMNAFLEHGIPESPETVVELMRKATINIGGHLLNAITIEHFILRL 300

Query: 696  PYHSKYTFVKGAKNDEMSTRSVFGLELSEPLVTFALSCGSWSSPAVRIYTASQVENELEM 517
            PY+SKYT  KGAKNDEM+ R+ FGLELSEPLV+FAL CGSWSSPAVR+YTA+QVENELE 
Sbjct: 301  PYYSKYTISKGAKNDEMAARNKFGLELSEPLVSFALCCGSWSSPAVRVYTAAQVENELEE 360

Query: 516  AKKEYLQAAIGISTTKRVVAIPKLLDWYMLDFAKDLDSFVDWICLQLPSELGKEAINCLE 337
            AK++YLQAAIGI+T+K   AIPKLLDWY+LDFAKDL+S +DWICLQLPSELGKEAINCLE
Sbjct: 361  AKRDYLQAAIGITTSK--FAIPKLLDWYLLDFAKDLESLLDWICLQLPSELGKEAINCLE 418

Query: 336  RDRNEPLSQFLEIMPYEFNFRYLLHT 259
            + +NEP S F+++MPYEF FRYLL+T
Sbjct: 419  KGKNEPHSHFVQVMPYEFGFRYLLYT 444


Top