BLASTX nr result

ID: Rehmannia23_contig00018345 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Rehmannia23_contig00018345
         (1747 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_004238182.1| PREDICTED: uncharacterized protein LOC101262...   453   e-125
ref|XP_006354919.1| PREDICTED: uncharacterized protein LOC102581...   448   e-123
ref|XP_002282310.2| PREDICTED: uncharacterized protein LOC100266...   446   e-122
gb|EOY14571.1| Uncharacterized protein isoform 1 [Theobroma cacao]    427   e-117
ref|XP_002510268.1| transcription factor, putative [Ricinus comm...   422   e-115
ref|XP_006473370.1| PREDICTED: uncharacterized protein LOC102623...   421   e-115
gb|EMJ24107.1| hypothetical protein PRUPE_ppa003634mg [Prunus pe...   420   e-114
gb|EMJ24106.1| hypothetical protein PRUPE_ppa003634mg [Prunus pe...   420   e-114
ref|XP_006374990.1| hypothetical protein POPTR_0014s03410g [Popu...   410   e-112
ref|XP_006374991.1| hypothetical protein POPTR_0014s03410g [Popu...   410   e-112
ref|XP_004291137.1| PREDICTED: uncharacterized protein LOC101314...   406   e-110
gb|EOY14572.1| Uncharacterized protein isoform 2 [Theobroma cacao]    404   e-110
ref|XP_004141244.1| PREDICTED: uncharacterized protein LOC101211...   400   e-108
gb|EOY14573.1| Uncharacterized protein isoform 3 [Theobroma cacao]    392   e-106
ref|XP_006473371.1| PREDICTED: uncharacterized protein LOC102623...   386   e-104
gb|EXB75617.1| hypothetical protein L484_026093 [Morus notabilis]     379   e-102
gb|EPS58536.1| hypothetical protein M569_16277, partial [Genlise...   378   e-102
ref|XP_002301197.2| hypothetical protein POPTR_0002s12990g, part...   362   2e-97
ref|XP_006601299.1| PREDICTED: uncharacterized protein LOC100801...   360   8e-97
ref|XP_006595957.1| PREDICTED: uncharacterized protein LOC100817...   357   9e-96

>ref|XP_004238182.1| PREDICTED: uncharacterized protein LOC101262306 [Solanum
            lycopersicum]
          Length = 572

 Score =  453 bits (1166), Expect = e-125
 Identities = 254/472 (53%), Positives = 304/472 (64%), Gaps = 66/472 (13%)
 Frame = +3

Query: 165  MNTKARRSLHPMKAHLKHEKERVEMQRGWIMDAEKPRTSRRGSIRERKMALQQDVDKLKK 344
            MN + R SL  MK   K+ KE+VEMQ    M  +K   +RR +IRERKMAL QDVDKLKK
Sbjct: 1    MNARVRTSLQSMKTPSKNVKEKVEMQGNRKMSTDKTPINRRKAIRERKMALLQDVDKLKK 60

Query: 345  RLRHEENVHRALERAFTRPLGALPRLPPYLPTHTXXXXXXXXXXXXXXXXXXXXXXHFRQ 524
            +LRHEENVHRALERAFTRPLGALPRLPPYLP +T                      HFRQ
Sbjct: 61   KLRHEENVHRALERAFTRPLGALPRLPPYLPPNTLELLAEVAVLEEEVVRLEEKVVHFRQ 120

Query: 525  GLYQEAVYNSSSKK---NIDNT-------------------------------------- 581
            GLY EAVY SSSK+   N+ +T                                      
Sbjct: 121  GLYHEAVYISSSKRNMDNVTDTVEQNQVKSPKQKQTKLSPQLESNSASFSGRHLPSLSDD 180

Query: 582  --------------KQHSPNSKVQSVKTPVRKHPIKCRLGKRHLDPQRLQLXXXXXXXXX 719
                          K  S N+KV++ +TPV+K P + RL ++ +DPQ+LQL         
Sbjct: 181  SCLKENHSLSSTKSKHRSVNAKVKTARTPVKKLPAENRLAEKRVDPQKLQLEDQVMYHGS 240

Query: 720  XXXKNSANQEEVSTVGDSPNKISESILKCLMNIFLRMSSKKNK-------SITSF---ES 869
               +    Q+   +  +SPN ISE+ILKCL NIFLRMSS+K +       S+T +   ES
Sbjct: 241  LEERIFVTQDRKPSPDESPNTISENILKCLSNIFLRMSSRKGRTTADTLPSLTGYNSCES 300

Query: 870  LAYAEYSDPYNLCSKFGKRDIGPYKHLIAIEAISIDPNRTTISVFLVQRLKLLFQRLAKI 1049
            +   E+ DPY +CSKF +RDIGPYKHL A+EA S++PNRTTISVFLV+RLKLL ++LA  
Sbjct: 301  IEKKEFGDPYGICSKFERRDIGPYKHLYAVEASSVNPNRTTISVFLVRRLKLLLEKLASA 360

Query: 1050 NLKDLSHQEKLAFWINIYNSCMMNAFIEYGITESAEKVVELMQKATLNVGGHVLNAITIE 1229
            NL+ LSHQEKLAFWINIYNSCMMNAF+EYG+ E+ E VV LMQKAT+NV GH+LNAITIE
Sbjct: 361  NLQGLSHQEKLAFWINIYNSCMMNAFLEYGLPENPEMVVALMQKATINVSGHLLNAITIE 420

Query: 1230 HFILRQPYHSKYTFSKGTKYDVMT-RSVLGLELSEPLVTFALSCGSWSSPAV 1382
            HFILR PYHSK+TF+KG K D MT RS+ GLELSEPLVTFALSCGS+SSPAV
Sbjct: 421  HFILRLPYHSKFTFAKGVKNDEMTARSIFGLELSEPLVTFALSCGSFSSPAV 472


>ref|XP_006354919.1| PREDICTED: uncharacterized protein LOC102581774 [Solanum tuberosum]
          Length = 570

 Score =  448 bits (1152), Expect = e-123
 Identities = 255/472 (54%), Positives = 303/472 (64%), Gaps = 66/472 (13%)
 Frame = +3

Query: 165  MNTKARRSLHPMKAHLKHEKERVEMQRGWIMDAEKPRTSRRGSIRERKMALQQDVDKLKK 344
            MN + R SL  MK   K+ KE+VEMQ    M  EK   +RR +IRERKMAL QDVDKLKK
Sbjct: 1    MNARVRTSLQSMKTPSKNVKEKVEMQGNRKMSTEKTPINRRKAIRERKMALLQDVDKLKK 60

Query: 345  RLRHEENVHRALERAFTRPLGALPRLPPYLPTHTXXXXXXXXXXXXXXXXXXXXXXHFRQ 524
            +LRHEENVHRALERAFTRPLGALPRLPPYLP +T                      HFRQ
Sbjct: 61   KLRHEENVHRALERAFTRPLGALPRLPPYLPPNTLELLAEVAVLEEEVVRLEEKVVHFRQ 120

Query: 525  GLYQEAVYNSSSKK---NIDNT-------------------------------------- 581
            GLY EAVY SSSK+   N+ +T                                      
Sbjct: 121  GLYHEAVYISSSKRNMDNVTDTIEQNQVKSPKQKQTKLSPQLESNSASFSGRHLPSLSDD 180

Query: 582  --------------KQHSPNSKVQSVKTPVRKHPIKCRLGKRHLDPQRLQLXXXXXXXXX 719
                          K  S N+KV++V+TPV+K P + RL ++ +DPQ+LQ          
Sbjct: 181  SCLKENHSLSSTKSKHRSVNAKVKTVRTPVKKLPAENRLAEKRVDPQKLQ--DQVMYHGS 238

Query: 720  XXXKNSANQEEVSTVGDSPNKISESILKCLMNIFLRMSSKKNK-------SITSF---ES 869
               +    Q+   +  +SPN ISE+ILKCL NIFLRMSS+K +       S+T +   ES
Sbjct: 239  LEERIFVTQDRKPSPDESPNTISENILKCLSNIFLRMSSRKGRTTADTLPSLTGYNSCES 298

Query: 870  LAYAEYSDPYNLCSKFGKRDIGPYKHLIAIEAISIDPNRTTISVFLVQRLKLLFQRLAKI 1049
            +   E+ DPY +CSKF +RDIGPYKHL A+EA S++PNRTTISVFLV+RLKLL ++LA  
Sbjct: 299  IEKKEFGDPYGICSKFERRDIGPYKHLYAVEASSVNPNRTTISVFLVRRLKLLLEKLASA 358

Query: 1050 NLKDLSHQEKLAFWINIYNSCMMNAFIEYGITESAEKVVELMQKATLNVGGHVLNAITIE 1229
            NL+ LSHQEKLAFWINIYNSCMMNAF+EYG+ E+ E VV LMQKAT+ V GH+LNAITIE
Sbjct: 359  NLQGLSHQEKLAFWINIYNSCMMNAFLEYGLPENPEMVVALMQKATIKVSGHLLNAITIE 418

Query: 1230 HFILRQPYHSKYTFSKGTKYDVMT-RSVLGLELSEPLVTFALSCGSWSSPAV 1382
            HFILR PYHSK+TF+KG K D MT RSV GLELSEPLVTFALSCGS+SSPAV
Sbjct: 419  HFILRLPYHSKFTFAKGVKNDEMTARSVFGLELSEPLVTFALSCGSFSSPAV 470


>ref|XP_002282310.2| PREDICTED: uncharacterized protein LOC100266128 [Vitis vinifera]
            gi|302142367|emb|CBI19570.3| unnamed protein product
            [Vitis vinifera]
          Length = 573

 Score =  446 bits (1148), Expect = e-122
 Identities = 250/473 (52%), Positives = 296/473 (62%), Gaps = 67/473 (14%)
 Frame = +3

Query: 165  MNTKARRSLHPMKAHLKHEKERVEMQRGWIMDAEKPRTSRRGSIRERKMALQQDVDKLKK 344
            MNT+ R  L  MKA +KH+K++VEMQ    MDA++   +RRG  R+RKMALQQDVDKLKK
Sbjct: 1    MNTRVRTKLQNMKAPMKHDKDKVEMQGTRGMDAKRATANRRGPSRDRKMALQQDVDKLKK 60

Query: 345  RLRHEENVHRALERAFTRPLGALPRLPPYLPTHTXXXXXXXXXXXXXXXXXXXXXXHFRQ 524
            +LRHEENVHRALERAF RPLGALPRLPPYLP  T                      HFRQ
Sbjct: 61   KLRHEENVHRALERAFNRPLGALPRLPPYLPPCTLELLAEVAILEEEVVRLEEQVVHFRQ 120

Query: 525  GLYQEAVYNSSSKKNI-----------------DNTK----------------------- 584
            GLYQEAVY SSSKKN+                 D TK                       
Sbjct: 121  GLYQEAVYISSSKKNMESLADLYNPYLMRNSKKDQTKFLVQTVDNSATSATRDAPSPPAD 180

Query: 585  ----------------QHSPNSKVQSVKTPVRKHPIKCRLGKRHLDPQRLQLXXXXXXXX 716
                            +  PN+K Q + TPV++ PI+    ++HLD Q+LQL        
Sbjct: 181  RRGKENQSYANSTKNNKRDPNNKAQKISTPVKRPPIEHGSAEKHLDSQKLQLENRVVDQE 240

Query: 717  XXXXKNSANQEEVSTVGDSPNKISESILKCLMNIFLRMSSKKNK----------SITSFE 866
                + S   +E  +  D PNKISE IL+CL +IFLRMS+ K++          S+ S  
Sbjct: 241  NAETRTSLTPDERLSADDKPNKISEDILRCLFSIFLRMSTLKSRGTSENLPSLPSLASHG 300

Query: 867  SLAYAEYSDPYNLCSKFGKRDIGPYKHLIAIEAISIDPNRTTISVFLVQRLKLLFQRLAK 1046
            S    E  DPY +CS+FGKRDIGPYKHL +I+A SI+ NRT  S+FLV RLK L  +LA 
Sbjct: 301  SGEETELQDPYGICSEFGKRDIGPYKHLFSIQASSINLNRTANSLFLVHRLKRLLGKLAS 360

Query: 1047 INLKDLSHQEKLAFWINIYNSCMMNAFIEYGITESAEKVVELMQKATLNVGGHVLNAITI 1226
            +NL+ L+HQEKLAFWIN YNSCMMNAF+E+GI  + E VVELM+KAT+NVGGH+LNAITI
Sbjct: 361  VNLQGLTHQEKLAFWINTYNSCMMNAFLEHGIPGNPEMVVELMRKATINVGGHLLNAITI 420

Query: 1227 EHFILRQPYHSKYTFSKGTKYDVMT-RSVLGLELSEPLVTFALSCGSWSSPAV 1382
            EHFILR PYH KYTF KG K D MT RS+ GLELSEPLVTFALSCGSWSSPAV
Sbjct: 421  EHFILRLPYHIKYTFPKGAKNDEMTARSIYGLELSEPLVTFALSCGSWSSPAV 473


>gb|EOY14571.1| Uncharacterized protein isoform 1 [Theobroma cacao]
          Length = 566

 Score =  427 bits (1099), Expect = e-117
 Identities = 243/468 (51%), Positives = 291/468 (62%), Gaps = 62/468 (13%)
 Frame = +3

Query: 165  MNTKARRSLHPMKAHLKHEKERVEMQRGWIMDAEKPRTSRRGSIRERKMALQQDVDKLKK 344
            MNT+ R SL   K   KHEKE+V+MQ      A K   +RR S +ERKM LQQDVDKLKK
Sbjct: 1    MNTRVRASLQSRKVPGKHEKEKVDMQETKPTVATKAMKNRRASSKERKMVLQQDVDKLKK 60

Query: 345  RLRHEENVHRALERAFTRPLGALPRLPPYLPTHTXXXXXXXXXXXXXXXXXXXXXXHFRQ 524
            +LR EEN+HRALERAF RPLGALPRLPPYLP  T                      HFRQ
Sbjct: 61   KLRQEENIHRALERAFNRPLGALPRLPPYLPPSTLELLAEVAVLEEEVVRLEEKVVHFRQ 120

Query: 525  GLYQEAVYNSSSKKNIDNT----------------------------------------- 581
             LYQEAVY SSSK+N+DN+                                         
Sbjct: 121  DLYQEAVYISSSKRNMDNSADLCEPSLDKSPKPEQPKILTRDTSMARHLQSFSDDGRGKE 180

Query: 582  KQHSPNS----------KVQSVKTPVRKHPIKCRLGKRHLDPQRLQLXXXXXXXXXXXXK 731
             Q   NS          K QSV+TPV +  I  +  ++ +DPQ+LQL            +
Sbjct: 181  NQSCTNSTKSNKGSLVHKSQSVRTPVERPLIDSKPAEKRIDPQKLQLECRIRDQGNTEAR 240

Query: 732  NSANQEEVSTVGDSPNKISESILKCLMNIFLRMSSKKNKS----------ITSFESLAYA 881
              +  +E     D PNK+SE ++KCL +IFLRMSS K KS          + S ES    
Sbjct: 241  IISTPDERRLGDDEPNKVSEELVKCLSSIFLRMSSTKRKSTAEGSPSLSMLGSQESSEET 300

Query: 882  EYSDPYNLCSKFGKRDIGPYKHLIAIEAISIDPNRTTISVFLVQRLKLLFQRLAKINLKD 1061
            E+ DPY  CS FG+RDIGPYK+L +I+A SI+PNRT+ S+FL++RLKLL +RLA  NL +
Sbjct: 301  EFRDPYGTCSNFGRRDIGPYKNLFSIDAGSINPNRTSKSLFLLRRLKLLLERLASSNLLN 360

Query: 1062 LSHQEKLAFWINIYNSCMMNAFIEYGITESAEKVVELMQKATLNVGGHVLNAITIEHFIL 1241
            L+HQEKLAFWINIYNSCMMNAF+E+G+ +S + VVELM+KAT+NVGG +LNAITIEHFIL
Sbjct: 361  LNHQEKLAFWINIYNSCMMNAFLEHGVPDSPKMVVELMRKATINVGGRLLNAITIEHFIL 420

Query: 1242 RQPYHSKYTFSKGTKYDVMT-RSVLGLELSEPLVTFALSCGSWSSPAV 1382
            R PYHSK+ FSKG K D MT RS+ GLELSEPLVTFALSCGSWSSPAV
Sbjct: 421  RLPYHSKFIFSKGVKNDEMTARSIFGLELSEPLVTFALSCGSWSSPAV 468


>ref|XP_002510268.1| transcription factor, putative [Ricinus communis]
            gi|223550969|gb|EEF52455.1| transcription factor,
            putative [Ricinus communis]
          Length = 533

 Score =  422 bits (1086), Expect = e-115
 Identities = 238/438 (54%), Positives = 291/438 (66%), Gaps = 51/438 (11%)
 Frame = +3

Query: 222  KERVEMQRGWIMDAEKPRTSRRGSIRERKMALQQDVDKLKKRLRHEENVHRALERAFTRP 401
            KE++E++     +A K   +R+ S RERK++LQQDVDKLKK+LR+EENVHRALERAF RP
Sbjct: 2    KEKIEIKGNKQRNATKAAKTRQASSRERKISLQQDVDKLKKKLRYEENVHRALERAFNRP 61

Query: 402  LGALPRLPPYLPTHTXXXXXXXXXXXXXXXXXXXXXXHFRQGLYQEAVYNSSSKKNID-- 575
            LGALPRLPPYLP  T                      HFRQ LYQEAVY SSSK+N++  
Sbjct: 62   LGALPRLPPYLPASTLELLAEVAVLEEEVVRLEEQVVHFRQDLYQEAVYISSSKRNVESF 121

Query: 576  ----------NTKQHS----------------------PNSKVQSV------KTPVRKHP 641
                      N+KQ +                       N+K  S+      KTP++KHP
Sbjct: 122  ADLYDLSQNNNSKQANIKTIARNIDGQEKENQLCTNSVKNNKSSSIHKAQPGKTPMKKHP 181

Query: 642  IKCRLGKRHLDPQRLQLXXXXXXXXXXXXKNSANQEEVSTVGDSPNKISESILKCLMNIF 821
            I+ +  ++ LDPQ+LQ+            +N +  +E  +  D+PNKISE I+KCL NIF
Sbjct: 182  IENKQIEKCLDPQKLQVSQENPKEA----RNVSTADEHLSANDNPNKISEDIVKCLSNIF 237

Query: 822  LRMSSKKNK----------SITSFESLAYAEYSDPYNLCSKFGKRDIGPYKHLIAIEAIS 971
            LRMSS+K +          S+ S E+    E  DPY++CS+ GK+DIGPYKHL AIEA +
Sbjct: 238  LRMSSRKTRRTADNLSFLSSLVSQENGEEIECRDPYSICSEVGKKDIGPYKHLFAIEAGT 297

Query: 972  IDPNRTTISVFLVQRLKLLFQRLAKINLKDLSHQEKLAFWINIYNSCMMNAFIEYGITES 1151
            I+PNRT+ S+FL+ RLKLL  +LA +NL++L+HQEKLAFWINIYNSCMMNAF+E+GI ES
Sbjct: 298  INPNRTSNSLFLLHRLKLLLGKLASVNLQNLTHQEKLAFWINIYNSCMMNAFLEHGIPES 357

Query: 1152 AEKVVELMQKATLNVGGHVLNAITIEHFILRQPYHSKYTFSKGTKYDVMT-RSVLGLELS 1328
             E VV LMQKAT+NVGGH LNAITIEHFILR PYH KY FSKGTK D MT RS  GLELS
Sbjct: 358  PEMVVALMQKATINVGGHSLNAITIEHFILRLPYHLKYAFSKGTKNDEMTARSKFGLELS 417

Query: 1329 EPLVTFALSCGSWSSPAV 1382
            EPLVTFALSCGSWSSPAV
Sbjct: 418  EPLVTFALSCGSWSSPAV 435


>ref|XP_006473370.1| PREDICTED: uncharacterized protein LOC102623920 isoform X1 [Citrus
            sinensis]
          Length = 573

 Score =  421 bits (1081), Expect = e-115
 Identities = 245/475 (51%), Positives = 290/475 (61%), Gaps = 69/475 (14%)
 Frame = +3

Query: 165  MNTKARRSLH--PMKAHLKHEKERVEMQRGWIMDAEKPRTSRRGSIRERKMALQQDVDKL 338
            M+ K R SL     +A L  +KE+VEM    +  A K   SRR S  +RK ALQQDVDKL
Sbjct: 1    MSRKLRTSLQCQSFEAPLNKDKEKVEMPITKVAGARKATASRRASNAQRKYALQQDVDKL 60

Query: 339  KKRLRHEENVHRALERAFTRPLGALPRLPPYLPTHTXXXXXXXXXXXXXXXXXXXXXXHF 518
            KK+LRHEENVHRALERAF+RPLGALPRLPPYLP  T                      HF
Sbjct: 61   KKKLRHEENVHRALERAFSRPLGALPRLPPYLPPSTKELLAEVAVLEEEVVRLEEQVVHF 120

Query: 519  RQGLYQEAVYNSSSKKN-----------IDNT---------------------------- 581
            RQ LY+EAVY SSSKKN           +D+T                            
Sbjct: 121  RQDLYREAVYISSSKKNMESSIDLCDPCVDDTNSKQEQSKFLARNVGRSTTSAIRQLAAL 180

Query: 582  -----------------KQHSPNSKVQSVKTPVRKHPIKCRLGKRHLDPQRLQLXXXXXX 710
                             K+ S   KVQ+ +TPV++    C+   RHLDPQ++QL      
Sbjct: 181  SADGRGKENQLCTNSMKKKGSSVHKVQTGRTPVKRPSNDCKQTMRHLDPQKIQLVCRLQN 240

Query: 711  XXXXXXKNSANQEEVSTVGDSPNKISESILKCLMNIFLRMSSKKNK----------SITS 860
                  +  +  +E  +  D PN+ISE I++CL  I LRMSS K K          ++ S
Sbjct: 241  PENEGARTISVTDERESGDDGPNRISEDIVRCLSTILLRMSSGKRKGTSENLHFLSTLAS 300

Query: 861  FESLAYAEYSDPYNLCSKFGKRDIGPYKHLIAIEAISIDPNRTTISVFLVQRLKLLFQRL 1040
             ES    E  DPY +C +FGKRDIGPYKHL+AIEA SID NRT+ S+FLV+RLK+L  ++
Sbjct: 301  EESNEETESQDPYGICLQFGKRDIGPYKHLLAIEADSIDTNRTSSSMFLVRRLKILLGKI 360

Query: 1041 AKINLKDLSHQEKLAFWINIYNSCMMNAFIEYGITESAEKVVELMQKATLNVGGHVLNAI 1220
            A +NL++L+HQEKLAFWINIYNSCMMNAF+E GI ES E VV LMQKAT+ VGGH+LNAI
Sbjct: 361  ASVNLENLNHQEKLAFWINIYNSCMMNAFLENGIPESPEMVVALMQKATIRVGGHLLNAI 420

Query: 1221 TIEHFILRQPYHSKYTFSKGTKYDVMT-RSVLGLELSEPLVTFALSCGSWSSPAV 1382
            TIEHFILR PYHSKYTFSKG K D MT R + GLELSEPLVTFALSCGSWSSPAV
Sbjct: 421  TIEHFILRLPYHSKYTFSKGAKNDEMTARFMFGLELSEPLVTFALSCGSWSSPAV 475


>gb|EMJ24107.1| hypothetical protein PRUPE_ppa003634mg [Prunus persica]
          Length = 560

 Score =  420 bits (1079), Expect = e-114
 Identities = 245/472 (51%), Positives = 299/472 (63%), Gaps = 66/472 (13%)
 Frame = +3

Query: 165  MNTKARRSLHPMKAHLKHEKERVEMQRGWIMDAEKPRTSRRGSIRERKMALQQDVDKLKK 344
            MNT+ +  L  MK  LK+EKE+V+MQ   + D  K  T+ RGS RERK+ALQQDVDKLKK
Sbjct: 1    MNTRVKTRLQSMKPPLKNEKEKVKMQGSKLRDTAKTITNGRGSRRERKLALQQDVDKLKK 60

Query: 345  RLRHEENVHRALERAFTRPLGALPRLPPYLPTHTXXXXXXXXXXXXXXXXXXXXXXHFRQ 524
            +LRHEENVHRAL+RAFTRPLGALPRLPPYLP +T                      HFR+
Sbjct: 61   KLRHEENVHRALQRAFTRPLGALPRLPPYLPPYTLELLAEVAVLEEEVVRLEEQLVHFRK 120

Query: 525  GLYQEAVYNSSSKKNID------------NTKQHSPNSKVQS------------------ 614
             LYQEAV  S+SK+ ++            N KQ  P S+ Q                   
Sbjct: 121  DLYQEAVNISTSKRKMETSADLCDSYPIKNPKQEQPKSQAQKANKSTNATEKHWPSPSDD 180

Query: 615  --------------------------VKTPVRKHPIKCRLGKRHLDPQRLQLXXXXXXXX 716
                                      V+TPV++ PI  +  ++  DPQ+LQL        
Sbjct: 181  KQGKENQSSNNSTKKNNEKSLIHKAQVRTPVKRPPIDHKTAEKRSDPQKLQLEYRVMD-- 238

Query: 717  XXXXKNSANQEEVSTVGD-SPNKISESILKCLMNIFLRMSSKKN--KSITSFESLAYAEY 887
                + SA   + +  GD SPNKISE+ILKCL +I +RMSS K   +S+ SF +LA  E 
Sbjct: 239  ----QESAQVPDKAMSGDESPNKISENILKCLSSILMRMSSAKGSAESLPSFSTLAAQEN 294

Query: 888  S------DPYNLCSKFGKRDIGPYKHLIAIEAISIDPNRTTISVFLVQRLKLLFQRLAKI 1049
            +      DPY +CS+FG+RDIGPYK L AIEA +I+PN+T  ++FL++RLKLL ++LA +
Sbjct: 295  NEPKESWDPYAICSEFGRRDIGPYKQLHAIEAETINPNQTANALFLLRRLKLLLRKLASV 354

Query: 1050 NLKDLSHQEKLAFWINIYNSCMMNAFIEYGITESAEKVVELMQKATLNVGGHVLNAITIE 1229
            NL+ LSHQEKLAFWINIYNSCMMNAF+E+GI ES E +V LMQKAT+NVGGH+LNAITIE
Sbjct: 355  NLQHLSHQEKLAFWINIYNSCMMNAFLEHGIPESPEIIVALMQKATINVGGHLLNAITIE 414

Query: 1230 HFILRQPYHSKYTFSKGTKYDVMT-RSVLGLELSEPLVTFALSCGSWSSPAV 1382
            HFILR PYHSKY    GTK D  T RS+ GLELSEPLVTFALSCGSWSSPAV
Sbjct: 415  HFILRLPYHSKY----GTKNDEKTARSIFGLELSEPLVTFALSCGSWSSPAV 462


>gb|EMJ24106.1| hypothetical protein PRUPE_ppa003634mg [Prunus persica]
          Length = 498

 Score =  420 bits (1079), Expect = e-114
 Identities = 245/472 (51%), Positives = 299/472 (63%), Gaps = 66/472 (13%)
 Frame = +3

Query: 165  MNTKARRSLHPMKAHLKHEKERVEMQRGWIMDAEKPRTSRRGSIRERKMALQQDVDKLKK 344
            MNT+ +  L  MK  LK+EKE+V+MQ   + D  K  T+ RGS RERK+ALQQDVDKLKK
Sbjct: 1    MNTRVKTRLQSMKPPLKNEKEKVKMQGSKLRDTAKTITNGRGSRRERKLALQQDVDKLKK 60

Query: 345  RLRHEENVHRALERAFTRPLGALPRLPPYLPTHTXXXXXXXXXXXXXXXXXXXXXXHFRQ 524
            +LRHEENVHRAL+RAFTRPLGALPRLPPYLP +T                      HFR+
Sbjct: 61   KLRHEENVHRALQRAFTRPLGALPRLPPYLPPYTLELLAEVAVLEEEVVRLEEQLVHFRK 120

Query: 525  GLYQEAVYNSSSKKNID------------NTKQHSPNSKVQS------------------ 614
             LYQEAV  S+SK+ ++            N KQ  P S+ Q                   
Sbjct: 121  DLYQEAVNISTSKRKMETSADLCDSYPIKNPKQEQPKSQAQKANKSTNATEKHWPSPSDD 180

Query: 615  --------------------------VKTPVRKHPIKCRLGKRHLDPQRLQLXXXXXXXX 716
                                      V+TPV++ PI  +  ++  DPQ+LQL        
Sbjct: 181  KQGKENQSSNNSTKKNNEKSLIHKAQVRTPVKRPPIDHKTAEKRSDPQKLQLEYRVMD-- 238

Query: 717  XXXXKNSANQEEVSTVGD-SPNKISESILKCLMNIFLRMSSKKN--KSITSFESLAYAEY 887
                + SA   + +  GD SPNKISE+ILKCL +I +RMSS K   +S+ SF +LA  E 
Sbjct: 239  ----QESAQVPDKAMSGDESPNKISENILKCLSSILMRMSSAKGSAESLPSFSTLAAQEN 294

Query: 888  S------DPYNLCSKFGKRDIGPYKHLIAIEAISIDPNRTTISVFLVQRLKLLFQRLAKI 1049
            +      DPY +CS+FG+RDIGPYK L AIEA +I+PN+T  ++FL++RLKLL ++LA +
Sbjct: 295  NEPKESWDPYAICSEFGRRDIGPYKQLHAIEAETINPNQTANALFLLRRLKLLLRKLASV 354

Query: 1050 NLKDLSHQEKLAFWINIYNSCMMNAFIEYGITESAEKVVELMQKATLNVGGHVLNAITIE 1229
            NL+ LSHQEKLAFWINIYNSCMMNAF+E+GI ES E +V LMQKAT+NVGGH+LNAITIE
Sbjct: 355  NLQHLSHQEKLAFWINIYNSCMMNAFLEHGIPESPEIIVALMQKATINVGGHLLNAITIE 414

Query: 1230 HFILRQPYHSKYTFSKGTKYDVMT-RSVLGLELSEPLVTFALSCGSWSSPAV 1382
            HFILR PYHSKY    GTK D  T RS+ GLELSEPLVTFALSCGSWSSPAV
Sbjct: 415  HFILRLPYHSKY----GTKNDEKTARSIFGLELSEPLVTFALSCGSWSSPAV 462


>ref|XP_006374990.1| hypothetical protein POPTR_0014s03410g [Populus trichocarpa]
            gi|550323304|gb|ERP52787.1| hypothetical protein
            POPTR_0014s03410g [Populus trichocarpa]
          Length = 572

 Score =  410 bits (1055), Expect = e-112
 Identities = 234/473 (49%), Positives = 291/473 (61%), Gaps = 67/473 (14%)
 Frame = +3

Query: 165  MNTKARRSLHPMKAHLKHEKERVEMQRGWIMDAEKPRTSRRGSIRERKMALQQDVDKLKK 344
            MNT+ R  LH MKA +KHEKE+V MQ      A+K    R+ S RERK+ALQ+DVDKLKK
Sbjct: 1    MNTRVRTRLHSMKAPMKHEKEKVGMQGSKPNVAKKAANKRQASSRERKIALQEDVDKLKK 60

Query: 345  RLRHEENVHRALERAFTRPLGALPRLPPYLPTHTXXXXXXXXXXXXXXXXXXXXXXHFRQ 524
            +LRHEEN+ RALERAF+RPLGALPRLPPYLP  T                      HFRQ
Sbjct: 61   QLRHEENIRRALERAFSRPLGALPRLPPYLPRTTLELLAEVAVLEEEVVRLEEQVVHFRQ 120

Query: 525  GLYQEAVYNSSSKKNIDN-------------------------------TKQHSPN---- 599
             LYQEAVY SSSK+N+++                               T +H P+    
Sbjct: 121  DLYQEAVYMSSSKRNVESVSDLYHLYPNKNPKPDQSKSLAQNVDESATSTIRHLPSLSDG 180

Query: 600  ---------------------SKVQSVKTPVRKHPIKCRLGKRHLDPQRLQLXXXXXXXX 716
                                 +K Q+ +  V++     R  ++ LD  + QL        
Sbjct: 181  TGKENAFSTANSRKNSKGSSINKAQTSRNMVKRPSEDNRPAEKKLDSHKSQLECRVPDQE 240

Query: 717  XXXXKNSANQEEVSTVGDSPNKISESILKCLMNIFLRMSSKKNK----------SITSFE 866
                ++     E  T   SPNK+SE ILKCL +IFLRMSS  N+          ++ S E
Sbjct: 241  NAEARSHVTASEGVTGDASPNKLSEDILKCLSSIFLRMSSMNNRRTADNLSFLSTLVSQE 300

Query: 867  SLAYAEYSDPYNLCSKFGKRDIGPYKHLIAIEAISIDPNRTTISVFLVQRLKLLFQRLAK 1046
            +   AE  DPY +CS+FGKRDIGPYK L +IE+ +I+PNRT+ S+FL+ RL+LLF +LA 
Sbjct: 301  NEEEAECQDPYGICSEFGKRDIGPYKRLFSIESGTINPNRTSNSLFLLHRLELLFGKLAS 360

Query: 1047 INLKDLSHQEKLAFWINIYNSCMMNAFIEYGITESAEKVVELMQKATLNVGGHVLNAITI 1226
            +NL++L+HQ+KLAFWINIYNSCMMNAF+E+GI ES E VVELM+KAT+N+GGH+LNAITI
Sbjct: 361  VNLQNLTHQKKLAFWINIYNSCMMNAFLEHGIPESPETVVELMRKATINIGGHLLNAITI 420

Query: 1227 EHFILRQPYHSKYTFSKGTKYDVM-TRSVLGLELSEPLVTFALSCGSWSSPAV 1382
            EHFILR PY+SKYT SKG K D M  R+  GLELSEPLV+FAL CGSWSSPAV
Sbjct: 421  EHFILRLPYYSKYTISKGAKNDEMAARNKFGLELSEPLVSFALCCGSWSSPAV 473


>ref|XP_006374991.1| hypothetical protein POPTR_0014s03410g [Populus trichocarpa]
            gi|566202237|ref|XP_006374992.1| hypothetical protein
            POPTR_0014s03410g [Populus trichocarpa]
            gi|550323305|gb|ERP52788.1| hypothetical protein
            POPTR_0014s03410g [Populus trichocarpa]
            gi|550323306|gb|ERP52789.1| hypothetical protein
            POPTR_0014s03410g [Populus trichocarpa]
          Length = 573

 Score =  410 bits (1054), Expect = e-112
 Identities = 234/474 (49%), Positives = 291/474 (61%), Gaps = 68/474 (14%)
 Frame = +3

Query: 165  MNTKARRSLHPMKAHLKHEKERVEMQRGWIMDAEKPRTSRRGSIRERKMALQQDVDKLKK 344
            MNT+ R  LH MKA +KHEKE+V MQ      A+K    R+ S RERK+ALQ+DVDKLKK
Sbjct: 1    MNTRVRTRLHSMKAPMKHEKEKVGMQGSKPNVAKKAANKRQASSRERKIALQEDVDKLKK 60

Query: 345  RLRHEENVHRALERAFTRPLGALPRLPPYLPTHTXXXXXXXXXXXXXXXXXXXXXXHFRQ 524
            +LRHEEN+ RALERAF+RPLGALPRLPPYLP  T                      HFRQ
Sbjct: 61   QLRHEENIRRALERAFSRPLGALPRLPPYLPRTTLELLAEVAVLEEEVVRLEEQVVHFRQ 120

Query: 525  GLYQEAVYNSSSKKNIDN-------------------------------TKQHSPN---- 599
             LYQEAVY SSSK+N+++                               T +H P+    
Sbjct: 121  DLYQEAVYMSSSKRNVESVSDLYHLYPNKNPKPDQSKSLAQNVDESATSTIRHLPSLSAD 180

Query: 600  ----------------------SKVQSVKTPVRKHPIKCRLGKRHLDPQRLQLXXXXXXX 713
                                  +K Q+ +  V++     R  ++ LD  + QL       
Sbjct: 181  GTGKENAFSTANSRKNSKGSSINKAQTSRNMVKRPSEDNRPAEKKLDSHKSQLECRVPDQ 240

Query: 714  XXXXXKNSANQEEVSTVGDSPNKISESILKCLMNIFLRMSSKKNK----------SITSF 863
                 ++     E  T   SPNK+SE ILKCL +IFLRMSS  N+          ++ S 
Sbjct: 241  ENAEARSHVTASEGVTGDASPNKLSEDILKCLSSIFLRMSSMNNRRTADNLSFLSTLVSQ 300

Query: 864  ESLAYAEYSDPYNLCSKFGKRDIGPYKHLIAIEAISIDPNRTTISVFLVQRLKLLFQRLA 1043
            E+   AE  DPY +CS+FGKRDIGPYK L +IE+ +I+PNRT+ S+FL+ RL+LLF +LA
Sbjct: 301  ENEEEAECQDPYGICSEFGKRDIGPYKRLFSIESGTINPNRTSNSLFLLHRLELLFGKLA 360

Query: 1044 KINLKDLSHQEKLAFWINIYNSCMMNAFIEYGITESAEKVVELMQKATLNVGGHVLNAIT 1223
             +NL++L+HQ+KLAFWINIYNSCMMNAF+E+GI ES E VVELM+KAT+N+GGH+LNAIT
Sbjct: 361  SVNLQNLTHQKKLAFWINIYNSCMMNAFLEHGIPESPETVVELMRKATINIGGHLLNAIT 420

Query: 1224 IEHFILRQPYHSKYTFSKGTKYDVM-TRSVLGLELSEPLVTFALSCGSWSSPAV 1382
            IEHFILR PY+SKYT SKG K D M  R+  GLELSEPLV+FAL CGSWSSPAV
Sbjct: 421  IEHFILRLPYYSKYTISKGAKNDEMAARNKFGLELSEPLVSFALCCGSWSSPAV 474


>ref|XP_004291137.1| PREDICTED: uncharacterized protein LOC101314149 [Fragaria vesca
            subsp. vesca]
          Length = 563

 Score =  406 bits (1043), Expect = e-110
 Identities = 235/465 (50%), Positives = 291/465 (62%), Gaps = 59/465 (12%)
 Frame = +3

Query: 165  MNTKARRSLHPMKAHLKHEKERV---EMQRGWIMDAEKPRTSRRGSIRERKMALQQDVDK 335
            MNT+ R  L  MKA  K  ++ +   EMQ     +  K  +  + S RERK+ALQQDVDK
Sbjct: 1    MNTRVRTKLQSMKAPTKKNEKDIKVEEMQGSKERNITKAISLGKASRRERKLALQQDVDK 60

Query: 336  LKKRLRHEENVHRALERAFTRPLGALPRLPPYLPTHTXXXXXXXXXXXXXXXXXXXXXXH 515
            LKK+LRHEENVHRAL+RAF RPLGALPRLPPYLP +T                      H
Sbjct: 61   LKKKLRHEENVHRALQRAFNRPLGALPRLPPYLPPYTLELLAEVAVLEEEVVRLEEQVVH 120

Query: 516  FRQGLYQEAVYNSSSKKNI------------------------DNTKQHSPN-------- 599
            FR+ LYQEAV  S+SK+++                        D   +HSP+        
Sbjct: 121  FRKDLYQEAVNISTSKRSLETSAELCDSNPKKNHKFHGSIVDTDTAVKHSPSPSDHKQEN 180

Query: 600  ------------SKVQS---VKTPVRKHPIKCRLGKRHLDPQRLQLXXXXXXXXXXXXKN 734
                        S + S   V+TP ++ P+  ++ ++ LD  +LQL            + 
Sbjct: 181  QSCNPSMKNNKKSLIHSNAQVRTPAKRPPVDPKIAQKRLDSPKLQLEVRVTEQESTEARL 240

Query: 735  SANQEEVSTVGDSPNKISESILKCLMNIFLRMSSKKN--KSITSFESLAYA------EYS 890
            S+  E+  +  DSPNKISE+ILKCL +IF+RMSS K   ++  SF +L         E+ 
Sbjct: 241  SSIPEKKPSGDDSPNKISENILKCLSSIFMRMSSAKGITENQPSFSTLGIQQSNEKPEFW 300

Query: 891  DPYNLCSKFGKRDIGPYKHLIAIEAISIDPNRTTISVFLVQRLKLLFQRLAKINLKDLSH 1070
            DPY +CS+FG+RDIGPYK L A+EA SI+PNRT  S+FL++RLKLL  +LA +NLK L H
Sbjct: 301  DPYGICSEFGRRDIGPYKQLHAVEARSINPNRTASSLFLLRRLKLLLGKLASVNLKSLGH 360

Query: 1071 QEKLAFWINIYNSCMMNAFIEYGITESAEKVVELMQKATLNVGGHVLNAITIEHFILRQP 1250
            QEKLAFWINIYNSCMMNAF+E+GI E  E +V LMQKAT+NVGGH+L+AITIEHFILR P
Sbjct: 361  QEKLAFWINIYNSCMMNAFLEHGIPERPEIIVALMQKATINVGGHLLSAITIEHFILRLP 420

Query: 1251 YHSKYTFSKGTKYDVMT-RSVLGLELSEPLVTFALSCGSWSSPAV 1382
            YHSKYTFSKG K D  T RS+  LELSEPLVTFALSCGSWSSPAV
Sbjct: 421  YHSKYTFSKGAKNDENTARSIFALELSEPLVTFALSCGSWSSPAV 465


>gb|EOY14572.1| Uncharacterized protein isoform 2 [Theobroma cacao]
          Length = 533

 Score =  404 bits (1038), Expect = e-110
 Identities = 228/436 (52%), Positives = 273/436 (62%), Gaps = 62/436 (14%)
 Frame = +3

Query: 261  AEKPRTSRRGSIRERKMALQQDVDKLKKRLRHEENVHRALERAFTRPLGALPRLPPYLPT 440
            A K   +RR S +ERKM LQQDVDKLKK+LR EEN+HRALERAF RPLGALPRLPPYLP 
Sbjct: 9    ATKAMKNRRASSKERKMVLQQDVDKLKKKLRQEENIHRALERAFNRPLGALPRLPPYLPP 68

Query: 441  HTXXXXXXXXXXXXXXXXXXXXXXHFRQGLYQEAVYNSSSKKNIDNT------------- 581
             T                      HFRQ LYQEAVY SSSK+N+DN+             
Sbjct: 69   STLELLAEVAVLEEEVVRLEEKVVHFRQDLYQEAVYISSSKRNMDNSADLCEPSLDKSPK 128

Query: 582  ----------------------------KQHSPNS----------KVQSVKTPVRKHPIK 647
                                         Q   NS          K QSV+TPV +  I 
Sbjct: 129  PEQPKILTRDTSMARHLQSFSDDGRGKENQSCTNSTKSNKGSLVHKSQSVRTPVERPLID 188

Query: 648  CRLGKRHLDPQRLQLXXXXXXXXXXXXKNSANQEEVSTVGDSPNKISESILKCLMNIFLR 827
             +  ++ +DPQ+LQL            +  +  +E     D PNK+SE ++KCL +IFLR
Sbjct: 189  SKPAEKRIDPQKLQLECRIRDQGNTEARIISTPDERRLGDDEPNKVSEELVKCLSSIFLR 248

Query: 828  MSSKKNKS----------ITSFESLAYAEYSDPYNLCSKFGKRDIGPYKHLIAIEAISID 977
            MSS K KS          + S ES    E+ DPY  CS FG+RDIGPYK+L +I+A SI+
Sbjct: 249  MSSTKRKSTAEGSPSLSMLGSQESSEETEFRDPYGTCSNFGRRDIGPYKNLFSIDAGSIN 308

Query: 978  PNRTTISVFLVQRLKLLFQRLAKINLKDLSHQEKLAFWINIYNSCMMNAFIEYGITESAE 1157
            PNRT+ S+FL++RLKLL +RLA  NL +L+HQEKLAFWINIYNSCMMNAF+E+G+ +S +
Sbjct: 309  PNRTSKSLFLLRRLKLLLERLASSNLLNLNHQEKLAFWINIYNSCMMNAFLEHGVPDSPK 368

Query: 1158 KVVELMQKATLNVGGHVLNAITIEHFILRQPYHSKYTFSKGTKYDVMT-RSVLGLELSEP 1334
             VVELM+KAT+NVGG +LNAITIEHFILR PYHSK+ FSKG K D MT RS+ GLELSEP
Sbjct: 369  MVVELMRKATINVGGRLLNAITIEHFILRLPYHSKFIFSKGVKNDEMTARSIFGLELSEP 428

Query: 1335 LVTFALSCGSWSSPAV 1382
            LVTFALSCGSWSSPAV
Sbjct: 429  LVTFALSCGSWSSPAV 444


>ref|XP_004141244.1| PREDICTED: uncharacterized protein LOC101211254 [Cucumis sativus]
          Length = 547

 Score =  400 bits (1027), Expect = e-108
 Identities = 232/452 (51%), Positives = 282/452 (62%), Gaps = 46/452 (10%)
 Frame = +3

Query: 165  MNTKARRSLHPMKAHLKHEKERVEMQRGWIMDAEKPRTSRRGSIRERKMALQQDVDKLKK 344
            M+ K R  L  M+A   HEK  V+M     +DA K  TS R S R+RK+ALQQDVDKLKK
Sbjct: 1    MDRKGRTRLQSMRASANHEKGNVDMPEANFLDAAKASTSGRVSSRQRKVALQQDVDKLKK 60

Query: 345  RLRHEENVHRALERAFTRPLGALPRLPPYLPTHTXXXXXXXXXXXXXXXXXXXXXXHFRQ 524
            +LRHEENV RAL+RAFTRPLGALPRLPP+LP +                        FRQ
Sbjct: 61   KLRHEENVGRALKRAFTRPLGALPRLPPFLPPNMLELLAEVAVLEEEVVRLEEQVVLFRQ 120

Query: 525  GLYQEAVYNSSSKKNID-----NTKQ-----------------------------HSPNS 602
             LYQEAV  SSSKK ++     N+KQ                              S   
Sbjct: 121  DLYQEAVNISSSKKTMELSPKNNSKQAQSKLSVQKTDNVVGKENESRMNSTSNNKGSSIK 180

Query: 603  KVQSVKTPVRKHPIKCRLGKRHLDPQRLQLXXXXXXXXXXXXKNSANQEEVSTVGDSPNK 782
            K+ ++KTPV+K P++ +  ++   P +L L            +     ++  +  DSPN 
Sbjct: 181  KIHTIKTPVKKPPVRNKSSEKPNSP-KLNLENRTANPENAEARQLRAPDDKVSGDDSPNS 239

Query: 783  ISESILKCLMNIFLRMSSKKNKSITSFESL-----------AYAEYSDPYNLCSKFGKRD 929
            ISE+ILKCL +I LRMSS KN+  T  ESL              +  DPY +CS+FG+RD
Sbjct: 240  ISENILKCLSSILLRMSSIKNRGAT--ESLHLFSMVTTMQTEETDLPDPYGICSEFGRRD 297

Query: 930  IGPYKHLIAIEAISIDPNRTTISVFLVQRLKLLFQRLAKINLKDLSHQEKLAFWINIYNS 1109
            IGPYK++  +EA SI+  RTT S+FL QRLKLL  +LA +NL+ L+HQEKLAFWINIYNS
Sbjct: 298  IGPYKNVHTVEACSINTKRTTNSLFLFQRLKLLLGKLASVNLQRLTHQEKLAFWINIYNS 357

Query: 1110 CMMNAFIEYGITESAEKVVELMQKATLNVGGHVLNAITIEHFILRQPYHSKYTFSKGTKY 1289
            CM+NAF+E+GI ES E VV LMQKAT+NV GH+LNAITIEHFILR PYHS+Y FSK  KY
Sbjct: 358  CMINAFLEHGIPESPEMVVALMQKATINVSGHLLNAITIEHFILRLPYHSQYAFSKSAKY 417

Query: 1290 DVMT-RSVLGLELSEPLVTFALSCGSWSSPAV 1382
            D  T RS+ GLELSEPLVTFALSCGSWSSPAV
Sbjct: 418  DEKTFRSIFGLELSEPLVTFALSCGSWSSPAV 449


>gb|EOY14573.1| Uncharacterized protein isoform 3 [Theobroma cacao]
          Length = 546

 Score =  392 bits (1008), Expect = e-106
 Identities = 232/468 (49%), Positives = 274/468 (58%), Gaps = 62/468 (13%)
 Frame = +3

Query: 165  MNTKARRSLHPMKAHLKHEKERVEMQRGWIMDAEKPRTSRRGSIRERKMALQQDVDKLKK 344
            MNT+ R SL   K   KHEKE+V+MQ      A K   +RR S +ERKM LQQDVDKLKK
Sbjct: 1    MNTRVRASLQSRKVPGKHEKEKVDMQETKPTVATKAMKNRRASSKERKMVLQQDVDKLKK 60

Query: 345  RLRHEENVHRALERAFTRPLGALPRLPPYLPTHTXXXXXXXXXXXXXXXXXXXXXXHFRQ 524
            +LR EEN+HRALERAF RPLGALPRLPPYLP  T                      HFRQ
Sbjct: 61   KLRQEENIHRALERAFNRPLGALPRLPPYLPPSTLELLAEVAVLEEEVVRLEEKVVHFRQ 120

Query: 525  GLYQEAVYNSSSKKNIDNT----------------------------------------- 581
             LYQEAVY SSSK+N+DN+                                         
Sbjct: 121  DLYQEAVYISSSKRNMDNSADLCEPSLDKSPKPEQPKILTRDTSMARHLQSFSDDGRGKE 180

Query: 582  KQHSPNS----------KVQSVKTPVRKHPIKCRLGKRHLDPQRLQLXXXXXXXXXXXXK 731
             Q   NS          K QSV+TPV +  I  +  ++ +DPQ+LQL            +
Sbjct: 181  NQSCTNSTKSNKGSLVHKSQSVRTPVERPLIDSKPAEKRIDPQKLQLECRIRDQGNTEAR 240

Query: 732  NSANQEEVSTVGDSPNKISESILKCLMNIFLRMSSKKNKS----------ITSFESLAYA 881
              +  +E     D PNK+SE ++KCL +IFLRMSS K KS          + S ES    
Sbjct: 241  IISTPDERRLGDDEPNKVSEELVKCLSSIFLRMSSTKRKSTAEGSPSLSMLGSQESSEET 300

Query: 882  EYSDPYNLCSKFGKRDIGPYKHLIAIEAISIDPNRTTISVFLVQRLKLLFQRLAKINLKD 1061
            E+ DPY  CS FG+RDIGPYK+L +I+A SI+PNRT+ S+FL++RLKLL +RLA  NL +
Sbjct: 301  EFRDPYGTCSNFGRRDIGPYKNLFSIDAGSINPNRTSKSLFLLRRLKLLLERLASSNLLN 360

Query: 1062 LSHQEKLAFWINIYNSCMMNAFIEYGITESAEKVVELMQKATLNVGGHVLNAITIEHFIL 1241
            L+HQEKLAFWINIYNSCMMN                    AT+NVGG +LNAITIEHFIL
Sbjct: 361  LNHQEKLAFWINIYNSCMMN--------------------ATINVGGRLLNAITIEHFIL 400

Query: 1242 RQPYHSKYTFSKGTKYDVMT-RSVLGLELSEPLVTFALSCGSWSSPAV 1382
            R PYHSK+ FSKG K D MT RS+ GLELSEPLVTFALSCGSWSSPAV
Sbjct: 401  RLPYHSKFIFSKGVKNDEMTARSIFGLELSEPLVTFALSCGSWSSPAV 448


>ref|XP_006473371.1| PREDICTED: uncharacterized protein LOC102623920 isoform X2 [Citrus
            sinensis]
          Length = 539

 Score =  386 bits (992), Expect = e-104
 Identities = 220/419 (52%), Positives = 259/419 (61%), Gaps = 67/419 (15%)
 Frame = +3

Query: 327  VDKLKKRLRHEENVHRALERAFTRPLGALPRLPPYLPTHTXXXXXXXXXXXXXXXXXXXX 506
            VDKLKK+LRHEENVHRALERAF+RPLGALPRLPPYLP  T                    
Sbjct: 23   VDKLKKKLRHEENVHRALERAFSRPLGALPRLPPYLPPSTKELLAEVAVLEEEVVRLEEQ 82

Query: 507  XXHFRQGLYQEAVYNSSSKKN-----------IDNT------------------------ 581
              HFRQ LY+EAVY SSSKKN           +D+T                        
Sbjct: 83   VVHFRQDLYREAVYISSSKKNMESSIDLCDPCVDDTNSKQEQSKFLARNVGRSTTSAIRQ 142

Query: 582  ---------------------KQHSPNSKVQSVKTPVRKHPIKCRLGKRHLDPQRLQLXX 698
                                 K+ S   KVQ+ +TPV++    C+   RHLDPQ++QL  
Sbjct: 143  LAALSADGRGKENQLCTNSMKKKGSSVHKVQTGRTPVKRPSNDCKQTMRHLDPQKIQLVC 202

Query: 699  XXXXXXXXXXKNSANQEEVSTVGDSPNKISESILKCLMNIFLRMSSKKNK---------- 848
                      +  +  +E  +  D PN+ISE I++CL  I LRMSS K K          
Sbjct: 203  RLQNPENEGARTISVTDERESGDDGPNRISEDIVRCLSTILLRMSSGKRKGTSENLHFLS 262

Query: 849  SITSFESLAYAEYSDPYNLCSKFGKRDIGPYKHLIAIEAISIDPNRTTISVFLVQRLKLL 1028
            ++ S ES    E  DPY +C +FGKRDIGPYKHL+AIEA SID NRT+ S+FLV+RLK+L
Sbjct: 263  TLASEESNEETESQDPYGICLQFGKRDIGPYKHLLAIEADSIDTNRTSSSMFLVRRLKIL 322

Query: 1029 FQRLAKINLKDLSHQEKLAFWINIYNSCMMNAFIEYGITESAEKVVELMQKATLNVGGHV 1208
              ++A +NL++L+HQEKLAFWINIYNSCMMNAF+E GI ES E VV LMQKAT+ VGGH+
Sbjct: 323  LGKIASVNLENLNHQEKLAFWINIYNSCMMNAFLENGIPESPEMVVALMQKATIRVGGHL 382

Query: 1209 LNAITIEHFILRQPYHSKYTFSKGTKYDVMT-RSVLGLELSEPLVTFALSCGSWSSPAV 1382
            LNAITIEHFILR PYHSKYTFSKG K D MT R + GLELSEPLVTFALSCGSWSSPAV
Sbjct: 383  LNAITIEHFILRLPYHSKYTFSKGAKNDEMTARFMFGLELSEPLVTFALSCGSWSSPAV 441


>gb|EXB75617.1| hypothetical protein L484_026093 [Morus notabilis]
          Length = 549

 Score =  379 bits (972), Expect = e-102
 Identities = 215/421 (51%), Positives = 259/421 (61%), Gaps = 67/421 (15%)
 Frame = +3

Query: 321  QDVDKLKKRLRHEENVHRALERAFTRPLGALPRLPPYLPTHTXXXXXXXXXXXXXXXXXX 500
            + VDKLKK+LRHEE+VHRALERAF RPLGALPRLPPYLP +T                  
Sbjct: 31   EKVDKLKKKLRHEESVHRALERAFNRPLGALPRLPPYLPPYTLELLAEVAVLEEEVVRLE 90

Query: 501  XXXXHFRQGLYQEAVYNSSSKKNIDN--------------------------------TK 584
                HFRQ LYQEAVY SSSK+NI+N                                TK
Sbjct: 91   EQVVHFRQDLYQEAVYISSSKRNIENSADSHDPRPVKSPRPELPKFLLAPMGNSATTRTK 150

Query: 585  QHSPNS------------------------KVQSVKTPVRKHPIKCRLGKRHLDPQRLQL 692
                NS                        K Q+++  V++ P   +  ++  DPQ+LQL
Sbjct: 151  HLRTNSDDRQGKENQSCTNSTKNSKGSSIHKSQTMRASVKRPPADQKSAEKSSDPQKLQL 210

Query: 693  XXXXXXXXXXXXKNSANQEEVSTVGDSPNKISESILKCLMNIFLRMSSKKNK-------- 848
                        +    QE   +  DSPN+ISE+I+KCL NIFLRMSS KN+        
Sbjct: 211  ESRVTDNESAEARTCTVQETKVSEDDSPNRISENIMKCLCNIFLRMSSVKNRGGDESCPT 270

Query: 849  --SITSFESLAYAEYSDPYNLCSKFGKRDIGPYKHLIAIEAISIDPNRTTISVFLVQRLK 1022
              ++ + ES    E+ DPY + S+FGKRDIG YK  ++I+A SI+ NRT  S+FL++RLK
Sbjct: 271  FSNLATQESKEKREFGDPYGITSEFGKRDIGKYKQFLSIDASSINLNRTANSLFLLRRLK 330

Query: 1023 LLFQRLAKINLKDLSHQEKLAFWINIYNSCMMNAFIEYGITESAEKVVELMQKATLNVGG 1202
            LLF++LA +NL++LSHQEKLAFWINIYNSCMMN F+E+GI ES E V  LMQKA +NVGG
Sbjct: 331  LLFEKLASVNLENLSHQEKLAFWINIYNSCMMNPFLEHGIPESPEMVAALMQKAIVNVGG 390

Query: 1203 HVLNAITIEHFILRQPYHSKYTFSKGTKYDVMT-RSVLGLELSEPLVTFALSCGSWSSPA 1379
            H+LNAITIEHFILR PYHSKYTFSKG K D  T RS+ GLELSEPLVTFALSCGSWSSPA
Sbjct: 391  HLLNAITIEHFILRLPYHSKYTFSKGAKNDEKTARSIFGLELSEPLVTFALSCGSWSSPA 450

Query: 1380 V 1382
            V
Sbjct: 451  V 451


>gb|EPS58536.1| hypothetical protein M569_16277, partial [Genlisea aurea]
          Length = 468

 Score =  378 bits (971), Expect = e-102
 Identities = 215/382 (56%), Positives = 257/382 (67%), Gaps = 14/382 (3%)
 Frame = +3

Query: 279  SRRGSIRERKMALQQDVDKLKKRLRHEENVHRALERAFTRPLGALPRLPPYLPTHTXXXX 458
            SRR SIRERKMALQQDVDKL+K+LRHEENVHRALERAFTRPLGALPRLP YLP +T    
Sbjct: 1    SRRSSIRERKMALQQDVDKLQKKLRHEENVHRALERAFTRPLGALPRLPSYLPPYTLELL 60

Query: 459  XXXXXXXXXXXXXXXXXXHFRQGLYQEAVYNSSSKKNI----DNTKQHSPNSKVQSVKTP 626
                               FRQGLYQEAVY SSSK+      D  +   P SK +     
Sbjct: 61   AEVAVLEEEVVRLEEKVVRFRQGLYQEAVYTSSSKRPAAVDGDPIQLADPASKARDEG-- 118

Query: 627  VRKHPIKCRLGKRHLDPQRLQLXXXXXXXXXXXXKNSANQEEVSTVGDSPNKISESILKC 806
             +     C   KR   P  ++             +N + Q+E ++   +PN+ISE+ILKC
Sbjct: 119  -QASTTSCLRNKRPNSPA-VESKLSADPKLQAGRRNGSLQDEAAS---NPNRISETILKC 173

Query: 807  LMNIFLRMSSKKNKSITSFESLAY-------AEYSDPYNLCSKFGKRDIGPYKHLIAIEA 965
            LMNIFLRMS  ++ S ++ ESL         A++ DPY +CS+FG RDIGPYKH  A+EA
Sbjct: 174  LMNIFLRMS--RSSSSSAAESLQQHSRTPPSADFKDPYGICSRFGARDIGPYKHSFAVEA 231

Query: 966  ISIDPN-RTTISVFLVQRLKLLFQRLAKINLKDLS-HQEKLAFWINIYNSCMMNAFIEYG 1139
             S++ N +T    FLVQRLK L ++LA + LK L+ HQ+KLAFWIN+YNSCMM  F+E+G
Sbjct: 232  ASVNRNLKTNNHAFLVQRLKQLLEKLAAVELKGLTTHQDKLAFWINVYNSCMMKGFLEFG 291

Query: 1140 ITESAEKVVELMQKATLNVGGHVLNAITIEHFILRQPYHSKYTFSKGTKYDVMT-RSVLG 1316
            I +S   VV LMQKAT+NVGGHVLN++TIEHFILR PYHSKYTFSKG K D  T RS+ G
Sbjct: 292  IPDSPAMVVSLMQKATVNVGGHVLNSVTIEHFILRLPYHSKYTFSKGMKSDETTARSLFG 351

Query: 1317 LELSEPLVTFALSCGSWSSPAV 1382
            LE SEPLVTFALSCGSWSSPAV
Sbjct: 352  LEFSEPLVTFALSCGSWSSPAV 373


>ref|XP_002301197.2| hypothetical protein POPTR_0002s12990g, partial [Populus trichocarpa]
            gi|550344891|gb|EEE80470.2| hypothetical protein
            POPTR_0002s12990g, partial [Populus trichocarpa]
          Length = 444

 Score =  362 bits (930), Expect = 2e-97
 Identities = 215/444 (48%), Positives = 264/444 (59%), Gaps = 69/444 (15%)
 Frame = +3

Query: 255  MDAEKPRTS--RRGSIRERKMALQQDVDKLKKRLRHEENVHRALERAFTRPLGALPRLPP 428
            M   KP  +  RR S RERK+ALQQDVD LKK+LRHEEN+HRALERAF+RPLGALPRLPP
Sbjct: 1    MQGSKPNVAKNRRISSRERKIALQQDVDNLKKQLRHEENIHRALERAFSRPLGALPRLPP 60

Query: 429  YLPTHTXXXXXXXXXXXXXXXXXXXXXXHFRQGLYQEAVYNSSSKK-------------- 566
            YLP  T                      +F+Q LYQEAV+ SSSK+              
Sbjct: 61   YLPRATLELLAEVAVLEEEVVQLEEQIVYFKQDLYQEAVHISSSKRNMGSFSDLYNLYRI 120

Query: 567  -------------NID---------------------------NTKQHSPNS--KVQSVK 620
                         N+D                           +TK +  +S  K Q+ K
Sbjct: 121  KNPKPDQLKSSAQNLDKSATSMISHLPSLSNGTGKENAFSTANSTKNNKGSSIHKAQTSK 180

Query: 621  TPVRKHPIKCRLGKRHLDPQRLQLXXXXXXXXXXXXKNSANQEEVSTVGDSPNKISESIL 800
               +   +     ++ LD  +LQL            +      E  +  DSPNK+SE I+
Sbjct: 181  NMFKIPAVNNGSAEKTLDSPKLQLERRVTGQENVEARTVVTPGERLSGDDSPNKVSEDIM 240

Query: 801  KCLMNIFLRMSSKKNK----------SITSFESLAYAEYSDPYNLCSKFGKRDIGPYKHL 950
            KCL +IFLRMSS KNK          ++   E+    E  DPY +CS+FG RDIG YK L
Sbjct: 241  KCLSSIFLRMSSVKNKPTADDLPFSSTLVPQENGKEIECRDPYGICSEFGNRDIGSYKRL 300

Query: 951  IAIEAISIDPNRTTISVFLVQRLKLLFQRLAKINLKDLSHQEKLAFWINIYNSCMMNAFI 1130
             +IE  +I+PNRT+ S+FL+ RL+LL  +LA +NL++LSHQEKLAFWINIYNSCMMNAF+
Sbjct: 301  FSIEPGAINPNRTSNSLFLLHRLELLLGKLASVNLQNLSHQEKLAFWINIYNSCMMNAFL 360

Query: 1131 EYGITESAEKVVELMQKATLNVGGHVLNAITIEHFILRQPYHSKYTFSKGTKYDVM-TRS 1307
            E+GI ES E VVELM+KAT+N+GGH+LNAITIEHFILR PY+SKYT SKG K D M  R+
Sbjct: 361  EHGIPESPEMVVELMRKATINIGGHLLNAITIEHFILRLPYYSKYTISKGAKNDEMAARN 420

Query: 1308 VLGLELSEPLVTFALSCGSWSSPA 1379
              GLELSEPLV+FAL CGSWSSPA
Sbjct: 421  KFGLELSEPLVSFALRCGSWSSPA 444


>ref|XP_006601299.1| PREDICTED: uncharacterized protein LOC100801978 isoform X1 [Glycine
            max] gi|571539451|ref|XP_006601300.1| PREDICTED:
            uncharacterized protein LOC100801978 isoform X2 [Glycine
            max] gi|571539455|ref|XP_006601301.1| PREDICTED:
            uncharacterized protein LOC100801978 isoform X3 [Glycine
            max]
          Length = 542

 Score =  360 bits (925), Expect = 8e-97
 Identities = 215/449 (47%), Positives = 265/449 (59%), Gaps = 67/449 (14%)
 Frame = +3

Query: 237  MQRGWIMDAEKPRTSRRGSIRERKMALQQDVDKLKKRLRHEENVHRALERAFTRPLGALP 416
            M +G + +A K   S + S RERK+ALQQDVD+LKK+LRHEEN+HRALERAF RPLGALP
Sbjct: 1    MMQGSLRNANK---SGKASSRERKLALQQDVDRLKKQLRHEENIHRALERAFNRPLGALP 57

Query: 417  RLPPYLPTHTXXXXXXXXXXXXXXXXXXXXXXHFRQGLYQEAVYNSSSKKNIDNTKQHSP 596
            RLPPYLP +                       HFRQ LYQEAVY SSS + ++N+    P
Sbjct: 58   RLPPYLPPYILALLAEVAVLEEEIVRLEEQVVHFRQDLYQEAVYMSSSMRKLENSVSAPP 117

Query: 597  NSKVQSVKTP-------------------------------------------------- 626
            N    ++ +P                                                  
Sbjct: 118  NKSNPTLDSPKLDKLKSLTQTAGNSTATSETKPTTTLTEDRQGKENQSCTNSSKSRQQSS 177

Query: 627  --VRKHPIKC----RLGKRHLDPQRLQLXXXXXXXXXXXXKNSANQEEVSTVGDSPNKIS 788
              + K PIK      L KR   P+R Q              +S ++   S    SPN IS
Sbjct: 178  NQMNKTPIKNIDSQSLQKRLDHPKRKQEPRVNNQQIADVRNHSPHKN--SPEAQSPNIIS 235

Query: 789  ESILKCLMNIFLRMSSKKNKSITSFESLAY----------AEYSDPYNLCSKFGKRDIGP 938
            E+ILKCL NI LRMS+ KN   T   +  +          A++ DPY +C +FGKRDIGP
Sbjct: 236  ENILKCLSNILLRMSAVKNPGSTCDMAPLWDLKPQNCDEEADFWDPYGICLEFGKRDIGP 295

Query: 939  YKHLIAIEAISIDPNRTTISVFLVQRLKLLFQRLAKINLKDLSHQEKLAFWINIYNSCMM 1118
            Y+ L AI+A S +P RT  ++FL+ RLKLLF+++A +NL++L+HQEKLAFWINIYNSCMM
Sbjct: 296  YRQLCAIDAKSFNPKRTANTLFLLHRLKLLFRKVASVNLENLNHQEKLAFWINIYNSCMM 355

Query: 1119 NAFIEYGITESAEKVVELMQKATLNVGGHVLNAITIEHFILRQPYHSKYTFSKGTKYDVM 1298
            NAFIE GI E+ +  V LM+KAT+NVGGHVL+A TIEHFILR PYH K+TFSKGTK   M
Sbjct: 356  NAFIENGIPENPQMAVALMRKATINVGGHVLSATTIEHFILRLPYHWKFTFSKGTKNHQM 415

Query: 1299 T-RSVLGLELSEPLVTFALSCGSWSSPAV 1382
            T RS+ GLELSEPLVTFALS G+WSSPAV
Sbjct: 416  TARSIYGLELSEPLVTFALSSGTWSSPAV 444


>ref|XP_006595957.1| PREDICTED: uncharacterized protein LOC100817917 isoform X1 [Glycine
            max] gi|571508206|ref|XP_006595958.1| PREDICTED:
            uncharacterized protein LOC100817917 isoform X2 [Glycine
            max]
          Length = 543

 Score =  357 bits (916), Expect = 9e-96
 Identities = 211/448 (47%), Positives = 258/448 (57%), Gaps = 66/448 (14%)
 Frame = +3

Query: 237  MQRGWIMDAEKPRTSRRGSIRERKMALQQDVDKLKKRLRHEENVHRALERAFTRPLGALP 416
            M  G + +A K   SR+ S RERK+ALQQDVD LKK+LRHEEN+HRALERAF RPLGALP
Sbjct: 1    MMHGSLRNANK---SRKASSRERKLALQQDVDTLKKKLRHEENIHRALERAFNRPLGALP 57

Query: 417  RLPPYLPTHTXXXXXXXXXXXXXXXXXXXXXXHFRQGLYQEAVYNSSSKKNIDNTKQHSP 596
            RLPPYLP +                       HFRQ LYQEAVY SSS + ++N+    P
Sbjct: 58   RLPPYLPPYIPALLAEVAVLEEEIVRLEEQVVHFRQDLYQEAVYMSSSMRKLENSVSAPP 117

Query: 597  -----------------------NSKVQSVKTPVRKHPIKCRLGKRHLD-----PQRLQL 692
                                   NS   S   P    P   R GK +         R Q 
Sbjct: 118  NKSNPTMDSPKLDKLKSLTQTTGNSTATSATKPTTTLPDDNRQGKENQSCTNSSKSRKQS 177

Query: 693  XXXXXXXXXXXXKNSANQEEV---------------------------STVGDSPNKISE 791
                         N + Q+++                           S    SPN ISE
Sbjct: 178  SNQTNKTPIKKINNQSLQKKLDHPKRKKEPKVKNQQVADVRNHSPHKNSPEAQSPNIISE 237

Query: 792  SILKCLMNIFLRMSSKKNKSITSFESLAY----------AEYSDPYNLCSKFGKRDIGPY 941
            +ILKCL NI LRMS+ KN   T      +           E+ DPY +C +FGKRDIGPY
Sbjct: 238  NILKCLSNIILRMSALKNPGSTCDMPPVWDLKPHNRDEGTEFGDPYGICLEFGKRDIGPY 297

Query: 942  KHLIAIEAISIDPNRTTISVFLVQRLKLLFQRLAKINLKDLSHQEKLAFWINIYNSCMMN 1121
            K L +I+  S +P RT  ++FL+ RLKLLF++LA +NL++L+HQEKLAFWINIYNSCMMN
Sbjct: 298  KQLWSIDVKSFNPKRTANTLFLLHRLKLLFRKLASVNLENLNHQEKLAFWINIYNSCMMN 357

Query: 1122 AFIEYGITESAEKVVELMQKATLNVGGHVLNAITIEHFILRQPYHSKYTFSKGTK-YDVM 1298
            AFIE GI E+ +  V LM+KAT+NVGGHVL+A TIEHFILR PYH ++TFSKGTK +++ 
Sbjct: 358  AFIENGIPENPQMAVALMRKATINVGGHVLSATTIEHFILRLPYHWRFTFSKGTKNHEMK 417

Query: 1299 TRSVLGLELSEPLVTFALSCGSWSSPAV 1382
             RS+ G+ELSEPLVTFALS G+WSSPAV
Sbjct: 418  ARSIYGMELSEPLVTFALSSGTWSSPAV 445


Top