BLASTX nr result

ID: Achyranthes23_contig00026130 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Achyranthes23_contig00026130
         (1364 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

emb|CAN68624.1| hypothetical protein VITISV_010682 [Vitis vinifera]   264   6e-68
emb|CBI21048.3| unnamed protein product [Vitis vinifera]              260   1e-66
ref|XP_002285150.2| PREDICTED: uncharacterized protein LOC100266...   260   1e-66
ref|XP_002515508.1| conserved hypothetical protein [Ricinus comm...   252   2e-64
ref|XP_006450423.1| hypothetical protein CICLE_v10008661mg [Citr...   251   4e-64
ref|XP_006483371.1| PREDICTED: uncharacterized protein LOC102623...   250   9e-64
gb|EOY29473.1| Uncharacterized protein isoform 3 [Theobroma cacao]    250   9e-64
gb|EOY29471.1| Uncharacterized protein isoform 1 [Theobroma cacao]    248   6e-63
gb|EOY29472.1| Uncharacterized protein isoform 2 [Theobroma cacao]    247   9e-63
gb|EXB40145.1| hypothetical protein L484_004495 [Morus notabilis]     243   2e-61
gb|EMJ23923.1| hypothetical protein PRUPE_ppa006474mg [Prunus pe...   241   5e-61
ref|XP_002309630.1| hypothetical protein POPTR_0006s27080g [Popu...   240   1e-60
ref|XP_002324862.2| hypothetical protein POPTR_0018s01770g [Popu...   238   4e-60
ref|NP_680756.1| uncharacterized protein [Arabidopsis thaliana] ...   234   5e-59
ref|XP_002869285.1| hypothetical protein ARALYDRAFT_491499 [Arab...   234   5e-59
ref|XP_006284395.1| hypothetical protein CARUB_v10005566mg, part...   233   1e-58
ref|XP_006412493.1| hypothetical protein EUTSA_v10026043mg [Eutr...   233   2e-58
ref|XP_004292983.1| PREDICTED: uncharacterized protein LOC101295...   229   2e-57
emb|CBI28217.3| unnamed protein product [Vitis vinifera]              225   3e-56
gb|EPS60827.1| hypothetical protein M569_13973, partial [Genlise...   223   1e-55

>emb|CAN68624.1| hypothetical protein VITISV_010682 [Vitis vinifera]
          Length = 526

 Score =  264 bits (675), Expect = 6e-68
 Identities = 189/433 (43%), Positives = 230/433 (53%), Gaps = 68/433 (15%)
 Frame = +3

Query: 69   FFLSLKMPRPGPRPYECVRRAWHSERHQPIRGSLIQEIFRIANEIHSPETRKKKEWQEKL 248
            F  + +MPRPGPRPYECVRRAWHS+RHQPIRGSLIQEIFR+ NEIHS  T+K KEWQEKL
Sbjct: 19   FNFNKRMPRPGPRPYECVRRAWHSDRHQPIRGSLIQEIFRVVNEIHSSATKKNKEWQEKL 78

Query: 249  PIVVLKAEEILYSKANSEVEYMDLKTLGDRTNDAINTIIRREDGTEETGEFLQPCIEAAL 428
            PIVVLKAEEI+YSKANSE EYMDLKTL DR NDAINTIIRR++ T ETGEFLQPCIEA+L
Sbjct: 79   PIVVLKAEEIMYSKANSEAEYMDLKTLWDRANDAINTIIRRDEST-ETGEFLQPCIEASL 137

Query: 429  HLGCVPRRTSRSQRNINPRGYLSGN---------SDLTNVTEGNPKLTSPLVPHYASFL- 578
            +LGC  RR SRSQRN NPR YL+ +         S L N  +GN    S ++  YA+F+ 
Sbjct: 138  NLGCPQRRASRSQRNNNPRCYLTPSTQEPISISPSILENSPQGNHTTISQVMSRYATFIK 197

Query: 579  ---------GKEPHNLALPVKTCLNNEPFLCSQNTLPHQGSY--------SSQXXXXXXX 707
                     G EPH+ A     C   + FL S    P  G+         +S        
Sbjct: 198  PSSMSVIQPGLEPHSTAFHNNDCPTXK-FLFSSENCPPSGNKCLQMEVYPASNLCAVYPL 256

Query: 708  XXXXXSQPVEPQLMLG--DHSRNNRPE---MATPRNFF--LSNEENSRNRRDEPKPPINL 866
                  Q  E Q   G   H ++N  E   M T +N F    +     ++ D      N 
Sbjct: 257  YDGNQLQCEESQCGFGVQSHPKSNPMEPAGMGTIQNLFSYAIDPTKKPSQTDFGHVTENS 316

Query: 867  PTYECDLSLRLG---------LSSTHQGREDDSS-----GSK-----QKLKETFPMY--H 983
            P  +CDLSLRLG          +S  Q  ED  S     GSK      ++ + FP +   
Sbjct: 317  PKIDCDLSLRLGPLSIPCVSVENSWPQEFEDVGSSCSREGSKFSDLSPRVDKQFPFFPRG 376

Query: 984  ETDGPL---------RDDSLEINLMLRKRKAKFFPSYSNEGMSNSGQLIAPRP*LMGR-- 1130
             TD PL           ++L +   +RKRKA    SY  E      Q   P   L GR  
Sbjct: 377  NTDDPLDSCLSKRSSEGENLNMEATMRKRKAVI--SYPLEDRQFCCQPKLPYNYLPGRMR 434

Query: 1131 --ENCRLVDWSFI 1163
              +  R ++W F+
Sbjct: 435  NADEGRQINWGFL 447


>emb|CBI21048.3| unnamed protein product [Vitis vinifera]
          Length = 451

 Score =  260 bits (664), Expect = 1e-66
 Identities = 177/385 (45%), Positives = 213/385 (55%), Gaps = 64/385 (16%)
 Frame = +3

Query: 87   MPRPGPRPYECVRRAWHSERHQPIRGSLIQEIFRIANEIHSPETRKKKEWQEKLPIVVLK 266
            MPRPGPRPYECVRRAWHS+RHQPIRGSLIQEIFR+ NEIHS  T+K KEWQEKLPIVVLK
Sbjct: 1    MPRPGPRPYECVRRAWHSDRHQPIRGSLIQEIFRVVNEIHSSATKKNKEWQEKLPIVVLK 60

Query: 267  AEEILYSKANSEVEYMDLKTLGDRTNDAINTIIRREDGTEETGEFLQPCIEAALHLGCVP 446
            AEEI+YSKANSE EYMDLKTL DR NDAINTIIRR++ T ETGEFLQPCIEA+L+LGC  
Sbjct: 61   AEEIMYSKANSEAEYMDLKTLWDRANDAINTIIRRDEST-ETGEFLQPCIEASLNLGCPQ 119

Query: 447  RRTSRSQRNINPRGYLSGN---------SDLTNVTEGNPKLTSPLVPHYASFL------- 578
            RR SRSQRN NPR YL+ +         S L N  +GN    S ++  YA+F+       
Sbjct: 120  RRASRSQRNNNPRCYLTPSTQEPISISPSILENSPQGNHTTISQVMSRYATFIKPSSMSV 179

Query: 579  ---GKEPHNLALPVKTCLNNEPFLCSQNTLPHQGSY--------SSQXXXXXXXXXXXXS 725
               G EPH+ A     C  ++ FL S    P  G+         +S              
Sbjct: 180  IQPGLEPHSTAFHNNDCPTSK-FLFSSENCPPSGNKCLQMEVYPASNVCAVYPLYDGNQL 238

Query: 726  QPVEPQLMLG--DHSRNNRPE---MATPRNFF--LSNEENSRNRRDEPKPPINLPTYECD 884
            Q  E Q   G   H ++N  E   M T +N F    +     ++ D      N P  +CD
Sbjct: 239  QCEESQCGFGVQSHPKSNPMEPAGMGTIQNLFSYAIDPTKKPSQTDFGHVTENSPKIDCD 298

Query: 885  LSLRLG---------LSSTHQGREDDSS-----GSK-----QKLKETFPMY--HETDGPL 1001
            LSLRLG          +S  Q  ED  S     GSK      ++ + FP +    TD PL
Sbjct: 299  LSLRLGPLSIPCVSVENSWPQEFEDVGSSCSREGSKFSDLSPQVDKQFPFFPRGNTDDPL 358

Query: 1002 ---------RDDSLEINLMLRKRKA 1049
                       ++L +   +RKRKA
Sbjct: 359  DSCLSKRSSEGENLNMEATMRKRKA 383


>ref|XP_002285150.2| PREDICTED: uncharacterized protein LOC100266444 [Vitis vinifera]
          Length = 414

 Score =  260 bits (664), Expect = 1e-66
 Identities = 177/385 (45%), Positives = 213/385 (55%), Gaps = 64/385 (16%)
 Frame = +3

Query: 87   MPRPGPRPYECVRRAWHSERHQPIRGSLIQEIFRIANEIHSPETRKKKEWQEKLPIVVLK 266
            MPRPGPRPYECVRRAWHS+RHQPIRGSLIQEIFR+ NEIHS  T+K KEWQEKLPIVVLK
Sbjct: 1    MPRPGPRPYECVRRAWHSDRHQPIRGSLIQEIFRVVNEIHSSATKKNKEWQEKLPIVVLK 60

Query: 267  AEEILYSKANSEVEYMDLKTLGDRTNDAINTIIRREDGTEETGEFLQPCIEAALHLGCVP 446
            AEEI+YSKANSE EYMDLKTL DR NDAINTIIRR++ T ETGEFLQPCIEA+L+LGC  
Sbjct: 61   AEEIMYSKANSEAEYMDLKTLWDRANDAINTIIRRDEST-ETGEFLQPCIEASLNLGCPQ 119

Query: 447  RRTSRSQRNINPRGYLSGN---------SDLTNVTEGNPKLTSPLVPHYASFL------- 578
            RR SRSQRN NPR YL+ +         S L N  +GN    S ++  YA+F+       
Sbjct: 120  RRASRSQRNNNPRCYLTPSTQEPISISPSILENSPQGNHTTISQVMSRYATFIKPSSMSV 179

Query: 579  ---GKEPHNLALPVKTCLNNEPFLCSQNTLPHQGSY--------SSQXXXXXXXXXXXXS 725
               G EPH+ A     C  ++ FL S    P  G+         +S              
Sbjct: 180  IQPGLEPHSTAFHNNDCPTSK-FLFSSENCPPSGNKCLQMEVYPASNVCAVYPLYDGNQL 238

Query: 726  QPVEPQLMLG--DHSRNNRPE---MATPRNFF--LSNEENSRNRRDEPKPPINLPTYECD 884
            Q  E Q   G   H ++N  E   M T +N F    +     ++ D      N P  +CD
Sbjct: 239  QCEESQCGFGVQSHPKSNPMEPAGMGTIQNLFSYAIDPTKKPSQTDFGHVTENSPKIDCD 298

Query: 885  LSLRLG---------LSSTHQGREDDSS-----GSK-----QKLKETFPMY--HETDGPL 1001
            LSLRLG          +S  Q  ED  S     GSK      ++ + FP +    TD PL
Sbjct: 299  LSLRLGPLSIPCVSVENSWPQEFEDVGSSCSREGSKFSDLSPQVDKQFPFFPRGNTDDPL 358

Query: 1002 ---------RDDSLEINLMLRKRKA 1049
                       ++L +   +RKRKA
Sbjct: 359  DSCLSKRSSEGENLNMEATMRKRKA 383


>ref|XP_002515508.1| conserved hypothetical protein [Ricinus communis]
            gi|223545452|gb|EEF46957.1| conserved hypothetical
            protein [Ricinus communis]
          Length = 391

 Score =  252 bits (644), Expect = 2e-64
 Identities = 161/366 (43%), Positives = 210/366 (57%), Gaps = 43/366 (11%)
 Frame = +3

Query: 87   MPRPGPRPYECVRRAWHSERHQPIRGSLIQEIFRIANEIHSPETRKKKEWQEKLPIVVLK 266
            MPRPGPRPYECVRRAWHS+RHQP+RGSLIQEIFR+ NE+HSP T+K KEWQEKLP+VVLK
Sbjct: 1    MPRPGPRPYECVRRAWHSDRHQPVRGSLIQEIFRVVNEVHSPATKKNKEWQEKLPVVVLK 60

Query: 267  AEEILYSKANSEVEYMDLKTLGDRTNDAINTIIRREDGTEETGEFLQPCIEAALHLGCVP 446
            AEEI+YSKANSE EYMDLKTL +R ND INTI+RR++ T ETG+ L PCIEAAL+LGC P
Sbjct: 61   AEEIMYSKANSEAEYMDLKTLWERVNDVINTIVRRDEST-ETGDLLLPCIEAALNLGCTP 119

Query: 447  RRTSRSQRNINPRGYLS-GNSDLTNVTEGNPKLTSPL--VPHY----------ASFLGKE 587
            RRTSRSQRN NPR YL+ G  +   +    P  T+ L  +P+Y          ++ LG E
Sbjct: 120  RRTSRSQRNCNPRCYLTPGTQEPNTLPPAMPTHTTSLQCIPNYLDLIKPAIVNSTHLGSE 179

Query: 588  PHNLALPVKTCLNNEPFLCSQNTLPHQGSYSSQXXXXXXXXXXXXSQPVEPQLMLGDHSR 767
              NL     +  +N+  L + N      +Y+              S       ++   S 
Sbjct: 180  LQNLVCQNISVTSNKFLLATDNGC--LSNYNQSFPMENYPMSSLYSVYPLCYGLIPVSST 237

Query: 768  NNRPEMATPRNFFLSNEENS-RNRRDEPKPPINLPTYECDLSLRLGLSSTH--------- 917
                ++   +N F   ++ + +  + +P+ P+N     CDLSLRLG  S           
Sbjct: 238  LEPGKVGVEQNLFSFGDDAAVKFNQPDPQSPLNQHETGCDLSLRLGSLSAAPLPSDKNRQ 297

Query: 918  -QGREDDSSGSKQK-LKETFPMYHETDGPLR-------DDSLE-----------INLMLR 1037
             Q  ED+  GS Q+ +K    M H TD  L        D+SL+           +N +++
Sbjct: 298  LQDFEDNGHGSFQEGIKFKTQMQH-TDDELHFVTRLNTDNSLDSCSSKLSEHASVNGIIK 356

Query: 1038 KRKAKF 1055
            KRKA F
Sbjct: 357  KRKADF 362


>ref|XP_006450423.1| hypothetical protein CICLE_v10008661mg [Citrus clementina]
           gi|557553649|gb|ESR63663.1| hypothetical protein
           CICLE_v10008661mg [Citrus clementina]
          Length = 377

 Score =  251 bits (642), Expect = 4e-64
 Identities = 148/280 (52%), Positives = 173/280 (61%), Gaps = 8/280 (2%)
 Frame = +3

Query: 87  MPRPGPRPYECVRRAWHSERHQPIRGSLIQEIFRIANEIHSPETRKKKEWQEKLPIVVLK 266
           MPRPGPRPYECVRRAWHSERHQP+RGSLIQEIFR+ NEIHS  T+K KEWQEKLP+VVLK
Sbjct: 1   MPRPGPRPYECVRRAWHSERHQPMRGSLIQEIFRVVNEIHSEATKKNKEWQEKLPVVVLK 60

Query: 267 AEEILYSKANSEVEYMDLKTLGDRTNDAINTIIRREDGTEETGEFLQPCIEAALHLGCVP 446
           +EEI+YSKANSE EYMDLKTL DRTNDAINTIIR ++ TE TGE L PCIEAAL+LGC+P
Sbjct: 61  SEEIMYSKANSEAEYMDLKTLLDRTNDAINTIIRLDESTE-TGELLPPCIEAALNLGCLP 119

Query: 447 RRTSRSQRNINPRGYLSGN----SDLTNVTEGNPKLTSPLVPHYASFLGKE---PHNLAL 605
           RRTSRSQRN NPR YL+      S++ NV +GN  + S  +  Y SF+ +      NL +
Sbjct: 120 RRTSRSQRNNNPRCYLNTGIQEPSNVENVPQGNHSVQSQGMAPYCSFMKQTMSATQNLVV 179

Query: 606 P-VKTCLNNEPFLCSQNTLPHQGSYSSQXXXXXXXXXXXXSQPVEPQLMLGDHSRNNRPE 782
             +  C N  PF  SQN  P                    + P  P      +    + E
Sbjct: 180 QNINGCANKLPF-ASQNVPPSGNKQCFSLE----------NYPAAPSAYPLYYGTCFKFE 228

Query: 783 MATPRNFFLSNEENSRNRRDEPKPPINLPTYECDLSLRLG 902
              P      N  +   +R     P N     CDLSLRLG
Sbjct: 229 EIPPGLENFPNPTSKNTQRYIKDTPDNPQDIGCDLSLRLG 268


>ref|XP_006483371.1| PREDICTED: uncharacterized protein LOC102623950 [Citrus sinensis]
          Length = 377

 Score =  250 bits (639), Expect = 9e-64
 Identities = 148/280 (52%), Positives = 172/280 (61%), Gaps = 8/280 (2%)
 Frame = +3

Query: 87  MPRPGPRPYECVRRAWHSERHQPIRGSLIQEIFRIANEIHSPETRKKKEWQEKLPIVVLK 266
           MPRPGPRPYECVRRAWHSERHQP+RGSLIQEIFR+ NEIHS  T+K KEWQEKLP+VVLK
Sbjct: 1   MPRPGPRPYECVRRAWHSERHQPMRGSLIQEIFRVVNEIHSEATKKNKEWQEKLPVVVLK 60

Query: 267 AEEILYSKANSEVEYMDLKTLGDRTNDAINTIIRREDGTEETGEFLQPCIEAALHLGCVP 446
           +EEI+YSKANSE EYMDLKTL DRTNDAINTIIR ++ TE TGE L PCIEAAL+LGC+P
Sbjct: 61  SEEIMYSKANSEAEYMDLKTLLDRTNDAINTIIRLDESTE-TGELLPPCIEAALNLGCMP 119

Query: 447 RRTSRSQRNINPRGYLSGN----SDLTNVTEGNPKLTSPLVPHYASFLGKE---PHNLAL 605
           RRTSRSQRN NPR YL+      S++ NV +GN  + S  +  Y SF+ +      NL  
Sbjct: 120 RRTSRSQRNNNPRCYLNTGIQEPSNVENVPQGNHLVQSQGMAPYCSFMKQTMSATQNLVF 179

Query: 606 P-VKTCLNNEPFLCSQNTLPHQGSYSSQXXXXXXXXXXXXSQPVEPQLMLGDHSRNNRPE 782
             +  C N  PF  SQN  P                    + P  P      +    + E
Sbjct: 180 QNINGCANKLPF-ASQNVPPSGNKQCFSLE----------NYPAAPSAYPLYYGTCFKFE 228

Query: 783 MATPRNFFLSNEENSRNRRDEPKPPINLPTYECDLSLRLG 902
              P      N  +   +R     P N     CDLSLRLG
Sbjct: 229 EIPPGLENFPNPTSKNTQRYIKDTPDNPQDIGCDLSLRLG 268


>gb|EOY29473.1| Uncharacterized protein isoform 3 [Theobroma cacao]
          Length = 447

 Score =  250 bits (639), Expect = 9e-64
 Identities = 167/394 (42%), Positives = 202/394 (51%), Gaps = 61/394 (15%)
 Frame = +3

Query: 78   SLKMPRPGPRPYECVRRAWHSERHQPIRGSLIQEIFRIANEIHSPETRKKKEWQEKLPIV 257
            SLKMPRPGPRPY C RRAWHS+RHQP+RGSLIQEIFR+ NEIHS  T+K KEWQEKLP+V
Sbjct: 40   SLKMPRPGPRPYVCERRAWHSDRHQPMRGSLIQEIFRVVNEIHSSATKKNKEWQEKLPVV 99

Query: 258  VLKAEEILYSKANSEVEYMDLKTLGDRTNDAINTIIRREDGTEETGEFLQPCIEAALHLG 437
            VLKAEEI+YSKANSE EYMDLK+L DRTNDAINTII+R++ T ETGE LQPCIEAAL+LG
Sbjct: 100  VLKAEEIMYSKANSEAEYMDLKSLWDRTNDAINTIIKRDEST-ETGELLQPCIEAALNLG 158

Query: 438  CVPRRTSRSQRNINPRGYLS-GNSDLTNVTEGNPKLTSPLVPHYASF----------LGK 584
            C PRRT RSQRN NPR YLS G  +  N T+ N       +  Y+ F          LG 
Sbjct: 159  CTPRRTLRSQRNCNPRCYLSPGTQEAENTTQANLTTNPNFMASYSGFMKSTIMNVTHLGS 218

Query: 585  EPHNLALPVKTCLNNE-PFLCSQNTLPHQGSYSSQXXXXXXXXXXXXSQPVEPQLMLGDH 761
            E          C   + PF      LP     S+                V P L  G+H
Sbjct: 219  ESQKHIAQDSNCTTYKFPFASENGPLP-----SNSQCLPMEKYPPPNLYSVYP-LYYGNH 272

Query: 762  ---------------SRNNRPE---MATPRNFFLS--NEENSRNRRDEPKPPINLPTYEC 881
                           S +N  E   M    N F S  +  N+ N+ D      N     C
Sbjct: 273  LKFEEMQHGFGIFPKSISNTVEPAKMGVIDNLFSSDVDSSNNMNQTDVSNTSNNPHENAC 332

Query: 882  DLSLRL---------------------GLSSTHQGREDDSSGSKQKLKETFPMYHETD-- 992
            DLSLRL                     G +S    R  D + S  K+  +FP  +  D  
Sbjct: 333  DLSLRLGPLSIPCLSVGKSRPQVIEDTGSTSLEWNRFGDLTPSIDKMLSSFPRSNRDDPL 392

Query: 993  ------GPLRDDSLEINLMLRKRKAKFFPSYSNE 1076
                    L  + + ++  +RKRK  + P+   +
Sbjct: 393  NSSLNRWSLEGEHVNVDATMRKRKTVYGPTVDQQ 426


>gb|EOY29471.1| Uncharacterized protein isoform 1 [Theobroma cacao]
          Length = 417

 Score =  248 bits (632), Expect = 6e-63
 Identities = 165/374 (44%), Positives = 198/374 (52%), Gaps = 41/374 (10%)
 Frame = +3

Query: 78   SLKMPRPGPRPYECVRRAWHSERHQPIRGSLIQEIFRIANEIHSPETRKKKEWQEKLPIV 257
            SLKMPRPGPRPY C RRAWHS+RHQP+RGSLIQEIFR+ NEIHS  T+K KEWQEKLP+V
Sbjct: 40   SLKMPRPGPRPYVCERRAWHSDRHQPMRGSLIQEIFRVVNEIHSSATKKNKEWQEKLPVV 99

Query: 258  VLKAEEILYSKANSEVEYMDLKTLGDRTNDAINTIIRREDGTEETGEFLQPCIEAALHLG 437
            VLKAEEI+YSKANSE EYMDLK+L DRTNDAINTII+R++ T ETGE LQPCIEAAL+LG
Sbjct: 100  VLKAEEIMYSKANSEAEYMDLKSLWDRTNDAINTIIKRDEST-ETGELLQPCIEAALNLG 158

Query: 438  CVPRRTSRSQRNINPRGYLS-GNSDLTNVTEGNPKLTSPLVPHYASF----------LGK 584
            C PRRT RSQRN NPR YLS G  +  N T+ N       +  Y+ F          LG 
Sbjct: 159  CTPRRTLRSQRNCNPRCYLSPGTQEAENTTQANLTTNPNFMASYSGFMKSTIMNVTHLGS 218

Query: 585  EPHNLALPVKTCLNNE-PFLCSQNTLPHQGSYSSQXXXXXXXXXXXXSQPVEPQLMLGDH 761
            E          C   + PF      LP     S+                V P L  G+H
Sbjct: 219  ESQKHIAQDSNCTTYKFPFASENGPLP-----SNSQCLPMEKYPPPNLYSVYP-LYYGNH 272

Query: 762  ---------------SRNNRPE---MATPRNFFLS--NEENSRNRRDEPKPPINLPTYEC 881
                           S +N  E   M    N F S  +  N+ N+ D      N     C
Sbjct: 273  LKFEEMQHGFGIFPKSISNTVEPAKMGVIDNLFSSDVDSSNNMNQTDVSNTSNNPHENAC 332

Query: 882  DLSLRLGL---------SSTHQGREDDSSGSKQKLKETFPMYHETDGPLRDDSLEINLML 1034
            DLSLRLG           S  Q  ED  S S +  + +          L  + + ++  +
Sbjct: 333  DLSLRLGPLSIPCLSVGKSRPQVIEDTGSTSLEWNRWS----------LEGEHVNVDATM 382

Query: 1035 RKRKAKFFPSYSNE 1076
            RKRK  + P+   +
Sbjct: 383  RKRKTVYGPTVDQQ 396


>gb|EOY29472.1| Uncharacterized protein isoform 2 [Theobroma cacao]
          Length = 359

 Score =  247 bits (630), Expect = 9e-63
 Identities = 153/307 (49%), Positives = 175/307 (57%), Gaps = 32/307 (10%)
 Frame = +3

Query: 78  SLKMPRPGPRPYECVRRAWHSERHQPIRGSLIQEIFRIANEIHSPETRKKKEWQEKLPIV 257
           SLKMPRPGPRPY C RRAWHS+RHQP+RGSLIQEIFR+ NEIHS  T+K KEWQEKLP+V
Sbjct: 40  SLKMPRPGPRPYVCERRAWHSDRHQPMRGSLIQEIFRVVNEIHSSATKKNKEWQEKLPVV 99

Query: 258 VLKAEEILYSKANSEVEYMDLKTLGDRTNDAINTIIRREDGTEETGEFLQPCIEAALHLG 437
           VLKAEEI+YSKANSE EYMDLK+L DRTNDAINTII+R++ T ETGE LQPCIEAAL+LG
Sbjct: 100 VLKAEEIMYSKANSEAEYMDLKSLWDRTNDAINTIIKRDEST-ETGELLQPCIEAALNLG 158

Query: 438 CVPRRTSRSQRNINPRGYLS-GNSDLTNVTEGNPKLTSPLVPHYASF----------LGK 584
           C PRRT RSQRN NPR YLS G  +  N T+ N       +  Y+ F          LG 
Sbjct: 159 CTPRRTLRSQRNCNPRCYLSPGTQEAENTTQANLTTNPNFMASYSGFMKSTIMNVTHLGS 218

Query: 585 EPHNLALPVKTCLNNE-PFLCSQNTLPHQGSYSSQXXXXXXXXXXXXSQPVEPQLMLGDH 761
           E          C   + PF      LP     S+                V P L  G+H
Sbjct: 219 ESQKHIAQDSNCTTYKFPFASENGPLP-----SNSQCLPMEKYPPPNLYSVYP-LYYGNH 272

Query: 762 ---------------SRNNRPE---MATPRNFFLS--NEENSRNRRDEPKPPINLPTYEC 881
                          S +N  E   M    N F S  +  N+ N+ D      N     C
Sbjct: 273 LKFEEMQHGFGIFPKSISNTVEPAKMGVIDNLFSSDVDSSNNMNQTDVSNTSNNPHENAC 332

Query: 882 DLSLRLG 902
           DLSLRLG
Sbjct: 333 DLSLRLG 339


>gb|EXB40145.1| hypothetical protein L484_004495 [Morus notabilis]
          Length = 374

 Score =  243 bits (619), Expect = 2e-61
 Identities = 119/171 (69%), Positives = 139/171 (81%)
 Frame = +3

Query: 87  MPRPGPRPYECVRRAWHSERHQPIRGSLIQEIFRIANEIHSPETRKKKEWQEKLPIVVLK 266
           MPRPGPRPYECVRRAWHS+RHQPIRGSLI+EIFR+ANEIHS  T++ KEWQEKLP+VVLK
Sbjct: 1   MPRPGPRPYECVRRAWHSDRHQPIRGSLIKEIFRVANEIHSSSTKQNKEWQEKLPMVVLK 60

Query: 267 AEEILYSKANSEVEYMDLKTLGDRTNDAINTIIRREDGTEETGEFLQPCIEAALHLGCVP 446
           AEEI+YSKANSE EYMDLKTL DRTNDAINTIIRR++ T ETGEFLQPCIEAAL+LGC P
Sbjct: 61  AEEIMYSKANSEAEYMDLKTLWDRTNDAINTIIRRDEST-ETGEFLQPCIEAALNLGCTP 119

Query: 447 RRTSRSQRNINPRGYLSGNSDLTNVTEGNPKLTSPLVPHYASFLGKEPHNL 599
           RR+SRSQRN +PR YLS N+   + +  +           ++ L  +P +L
Sbjct: 120 RRSSRSQRNCHPRCYLSPNTPDVSPSMADNSANGSTFVRPSNHLSSDPRSL 170


>gb|EMJ23923.1| hypothetical protein PRUPE_ppa006474mg [Prunus persica]
          Length = 410

 Score =  241 bits (615), Expect = 5e-61
 Identities = 150/333 (45%), Positives = 186/333 (55%), Gaps = 43/333 (12%)
 Frame = +3

Query: 87  MPRPGPRPYECVRRAWHSERHQPIRGSLIQEIFRIANEIHSPETRKKKEWQEKLPIVVLK 266
           MPR GPRPYECVRRAWHSERHQP+RGSLI+EIFR+ NEIHS  TRK KEWQ+KLPIVVLK
Sbjct: 1   MPRSGPRPYECVRRAWHSERHQPMRGSLIKEIFRVVNEIHSSATRKNKEWQDKLPIVVLK 60

Query: 267 AEEILYSKANSEVEYMDLKTLGDRTNDAINTIIRREDGTEETGEFLQPCIEAALHLGCVP 446
           AEEI+YSKANSE EYMDLKTL DRTNDAINTIIRR++GT ETG+FLQPCIEAAL+LGC+P
Sbjct: 61  AEEIMYSKANSEAEYMDLKTLWDRTNDAINTIIRRDEGT-ETGDFLQPCIEAALNLGCIP 119

Query: 447 RRTSRSQRNINPRGYL---------SGNSDLTNVTEGNPKLTSPLVPHYASF-------- 575
           RRTSRSQR+ NP  YL            S + N ++ +    S   PH  +F        
Sbjct: 120 RRTSRSQRHANPSCYLIPITSDVPGISPSVVENASQRDYTSNSQYRPHCPNFVKPKSMTT 179

Query: 576 -LGKEPHNLALPVKTCLNNEPFLCSQNTLPH------------QGSYSS----------- 683
            LG E     +    C   +  + S+N  P               ++SS           
Sbjct: 180 QLGFESRFPVVQNNDCTTMKFRIASENIPPSGYDQFSPRESMATSNFSSYPLHYRNFPQF 239

Query: 684 -QXXXXXXXXXXXXSQPVEPQLMLGDHSRNNRPEMATPRNFFLSNEENSRNRRDEPKPPI 860
            +            S P+EP             +M    N   + ++++ N + + +   
Sbjct: 240 EELKPGFVILPKPVSDPIEP------------AKMGVISNLLCNGDKSNDNTQTDTRDYT 287

Query: 861 NLP-TYECDLSLRLGLSSTHQGREDDSSGSKQK 956
             P T  CDLSLRLG  ST     ++S   + K
Sbjct: 288 ENPCTVGCDLSLRLGPLSTQHSIGENSQPEEVK 320


>ref|XP_002309630.1| hypothetical protein POPTR_0006s27080g [Populus trichocarpa]
           gi|222855606|gb|EEE93153.1| hypothetical protein
           POPTR_0006s27080g [Populus trichocarpa]
          Length = 407

 Score =  240 bits (612), Expect = 1e-60
 Identities = 144/302 (47%), Positives = 174/302 (57%), Gaps = 30/302 (9%)
 Frame = +3

Query: 87  MPRPGPRPYECVRRAWHSERHQPIRGSLIQEIFRIANEIHSPETRKKKEWQEKLPIVVLK 266
           MPRPGPRPYECVRRAWHS+RHQPIRGSLIQEIFR+ NE HS  T+K KEWQEKLP+VVLK
Sbjct: 1   MPRPGPRPYECVRRAWHSDRHQPIRGSLIQEIFRLVNEAHSSTTKKNKEWQEKLPVVVLK 60

Query: 267 AEEILYSKANSEVEYMDLKTLGDRTNDAINTIIRREDGTEETGEFLQPCIEAALHLGCVP 446
           AEEI+YSKANSE EYM+LKTL DRTNDAINTIIRR++ T E GE LQPCIEAAL+LGC P
Sbjct: 61  AEEIMYSKANSEAEYMELKTLWDRTNDAINTIIRRDEST-EIGELLQPCIEAALNLGCTP 119

Query: 447 RRTSRSQRNINPRGYLSGNSDLTN---------VTEGNPKLTSPLVPHYASFLGKEPHNL 599
           RR SRSQRN NP  YLS ++   N           + N    S ++P+Y+S +     N 
Sbjct: 120 RRASRSQRNCNPSFYLSPSTQEPNTLSSGSVHSAIQANRTSNSHVLPNYSSMVKPIIMNS 179

Query: 600 ALPVKTCLN--------NEPFLCSQNTLPHQGSYSSQXXXXXXXXXXXXSQP------VE 737
             P     +        +  FL   +++P   +                  P      +E
Sbjct: 180 TPPGSESQDFVGQSNGTSNRFLFIDDSIPLSNANQCLPLGNYRIPSLCSVYPLYYGCCLE 239

Query: 738 PQLMLGDHSRN-----NRPEMATPRNFFLSNEEN--SRNRRDEPKPPINLPTYECDLSLR 896
           PQ   G   +         ++A  +NFF  NE+        D    P+      CDLSLR
Sbjct: 240 PQRGCGALPKTFPGTMEPVKVAVMQNFFPCNEDTPVKTCHADHKDSPLQPQEIGCDLSLR 299

Query: 897 LG 902
           LG
Sbjct: 300 LG 301


>ref|XP_002324862.2| hypothetical protein POPTR_0018s01770g [Populus trichocarpa]
           gi|550317816|gb|EEF03427.2| hypothetical protein
           POPTR_0018s01770g [Populus trichocarpa]
          Length = 448

 Score =  238 bits (607), Expect = 4e-60
 Identities = 149/311 (47%), Positives = 175/311 (56%), Gaps = 38/311 (12%)
 Frame = +3

Query: 84  KMPRPGPRPYECVRRAWHSERHQPIRGSLIQEIFRIANEIHSPETRKKKEWQEKLPIVVL 263
           KMPRPGPRPYECVRRAWHS+RHQPIRGSLIQEIFR+ NE H P T+K KEWQEKLP+VVL
Sbjct: 41  KMPRPGPRPYECVRRAWHSDRHQPIRGSLIQEIFRLVNEAHCPATKKNKEWQEKLPVVVL 100

Query: 264 KAEEILYSKANSEVEYMDLKTLGDRTNDAINTIIRREDGTEETGEFLQPCIEAALHLGCV 443
           KAEEI+YSKANSE EYMDLKTL DR NDAINTIIRR++   ETGE LQPCIEAAL+LGC 
Sbjct: 101 KAEEIMYSKANSEAEYMDLKTLWDRANDAINTIIRRDESL-ETGELLQPCIEAALNLGCT 159

Query: 444 PRRTSRSQRNINPRGYLSGNSDLTNVTEGNPKLTSPLVPHYA--------SFLGKEPHNL 599
           PRR SRSQRN N R YLS ++  +N         SP   H A        S   ++  NL
Sbjct: 160 PRRASRSQRNCNLRFYLSPSTQESNT-------LSPAAVHNAIRANHISNSHCLRDYSNL 212

Query: 600 ALPVKTCLNNEPFLCSQNTLPHQGSYSS------------------------QXXXXXXX 707
             P  T +N+ P       L  QG+ +S                        +       
Sbjct: 213 VKP--TIMNSAPSGSESQDLVGQGNDTSNRFLFRSDNIPPSNVNRCLPLENYRIPSLCSV 270

Query: 708 XXXXXSQPVEPQLMLGDHSRN-----NRPEMATPRNFFLSNEENS-RNRRDEPKPPINLP 869
                   +EPQ   G   +         ++   +NFF  NE+   R  +   K  +   
Sbjct: 271 YPLYYGSCLEPQRGCGALPKTFPGTIEPVKVVAVQNFFPCNEDTPVRTSQVGHKDCLQPQ 330

Query: 870 TYECDLSLRLG 902
             ECDLSLRLG
Sbjct: 331 EIECDLSLRLG 341


>ref|NP_680756.1| uncharacterized protein [Arabidopsis thaliana]
           gi|28392894|gb|AAO41883.1| unknown protein [Arabidopsis
           thaliana] gi|28827750|gb|AAO50719.1| unknown protein
           [Arabidopsis thaliana] gi|332660636|gb|AEE86036.1|
           uncharacterized protein AT4G32295 [Arabidopsis thaliana]
          Length = 238

 Score =  234 bits (598), Expect = 5e-59
 Identities = 114/140 (81%), Positives = 123/140 (87%)
 Frame = +3

Query: 87  MPRPGPRPYECVRRAWHSERHQPIRGSLIQEIFRIANEIHSPETRKKKEWQEKLPIVVLK 266
           MPRPGPRPY+C+RRAWHS+RHQP+RG LIQEIFRI  EIHS  TRK  EWQEKLP+VVL+
Sbjct: 1   MPRPGPRPYDCIRRAWHSDRHQPMRGLLIQEIFRIVCEIHSQSTRKNTEWQEKLPVVVLR 60

Query: 267 AEEILYSKANSEVEYMDLKTLGDRTNDAINTIIRREDGTEETGEFLQPCIEAALHLGCVP 446
           AEEI+YSKANSE EYMD+KTL DRTNDAINTIIR  D T ETGEFLQPCIEAALHLGC P
Sbjct: 61  AEEIMYSKANSEAEYMDMKTLLDRTNDAINTIIRL-DETTETGEFLQPCIEAALHLGCTP 119

Query: 447 RRTSRSQRNINPRGYLSGNS 506
           RR SRSQRNINPR YLS +S
Sbjct: 120 RRASRSQRNINPRCYLSQDS 139


>ref|XP_002869285.1| hypothetical protein ARALYDRAFT_491499 [Arabidopsis lyrata subsp.
           lyrata] gi|297315121|gb|EFH45544.1| hypothetical protein
           ARALYDRAFT_491499 [Arabidopsis lyrata subsp. lyrata]
          Length = 242

 Score =  234 bits (598), Expect = 5e-59
 Identities = 114/140 (81%), Positives = 123/140 (87%)
 Frame = +3

Query: 87  MPRPGPRPYECVRRAWHSERHQPIRGSLIQEIFRIANEIHSPETRKKKEWQEKLPIVVLK 266
           MPRPGPRPY+C+RRAWHS+RHQP+RG LIQEIFRI  EIHS  TRK  EWQEKLP+VVL+
Sbjct: 1   MPRPGPRPYDCIRRAWHSDRHQPMRGLLIQEIFRIVCEIHSQSTRKNTEWQEKLPVVVLR 60

Query: 267 AEEILYSKANSEVEYMDLKTLGDRTNDAINTIIRREDGTEETGEFLQPCIEAALHLGCVP 446
           AEEI+YSKANSE EYMD+KTL DRTNDAINTIIR  D T ETGEFLQPCIEAALHLGC P
Sbjct: 61  AEEIMYSKANSEAEYMDMKTLLDRTNDAINTIIRL-DETTETGEFLQPCIEAALHLGCTP 119

Query: 447 RRTSRSQRNINPRGYLSGNS 506
           RR SRSQRNINPR YLS +S
Sbjct: 120 RRASRSQRNINPRCYLSQDS 139


>ref|XP_006284395.1| hypothetical protein CARUB_v10005566mg, partial [Capsella rubella]
           gi|482553100|gb|EOA17293.1| hypothetical protein
           CARUB_v10005566mg, partial [Capsella rubella]
          Length = 253

 Score =  233 bits (594), Expect = 1e-58
 Identities = 113/147 (76%), Positives = 129/147 (87%), Gaps = 1/147 (0%)
 Frame = +3

Query: 84  KMPRPGPRPYECVRRAWHSERHQPIRGSLIQEIFRIANEIHSPETRKKKEWQEKLPIVVL 263
           KMPRPGPRPY+C+RRAWHS+RHQP+RG LIQEIFRI  EIHS  T+K  EWQEKLP+VVL
Sbjct: 11  KMPRPGPRPYDCIRRAWHSDRHQPMRGLLIQEIFRIVCEIHSQSTKKNTEWQEKLPVVVL 70

Query: 264 KAEEILYSKANSEVEYMDLKTLGDRTNDAINTIIRREDGTEETGEFLQPCIEAALHLGCV 443
           +AEEI+YSKANSE EYMD+ TL DRTN+AINTIIR  D T E+GEFLQPCIEAALHLGC 
Sbjct: 71  RAEEIMYSKANSEAEYMDMTTLLDRTNEAINTIIRL-DETTESGEFLQPCIEAALHLGCT 129

Query: 444 PRRTSRSQRNINPRGYLS-GNSDLTNV 521
           PR+TSRSQRNINPR YLS G+++L N+
Sbjct: 130 PRKTSRSQRNINPRCYLSQGSTNLDNI 156


>ref|XP_006412493.1| hypothetical protein EUTSA_v10026043mg [Eutrema salsugineum]
           gi|557113663|gb|ESQ53946.1| hypothetical protein
           EUTSA_v10026043mg [Eutrema salsugineum]
          Length = 256

 Score =  233 bits (593), Expect = 2e-58
 Identities = 115/145 (79%), Positives = 124/145 (85%)
 Frame = +3

Query: 87  MPRPGPRPYECVRRAWHSERHQPIRGSLIQEIFRIANEIHSPETRKKKEWQEKLPIVVLK 266
           MPRPGPRPY+C+RRAWHS+RHQP+RG LIQEIFRI  EIHS  T+K  EWQEKLP+VVL+
Sbjct: 1   MPRPGPRPYDCIRRAWHSDRHQPMRGLLIQEIFRIVCEIHSQSTKKNTEWQEKLPVVVLR 60

Query: 267 AEEILYSKANSEVEYMDLKTLGDRTNDAINTIIRREDGTEETGEFLQPCIEAALHLGCVP 446
           AEEI+YSKANSE EYMDL TL DRTNDAINTIIR  D T ETGEFLQPCIEAALHLGC P
Sbjct: 61  AEEIMYSKANSEAEYMDLNTLLDRTNDAINTIIRL-DETTETGEFLQPCIEAALHLGCTP 119

Query: 447 RRTSRSQRNINPRGYLSGNSDLTNV 521
           RR SRSQRNINPR YLS   D TN+
Sbjct: 120 RRASRSQRNINPRCYLS-QQDSTNL 143


>ref|XP_004292983.1| PREDICTED: uncharacterized protein LOC101295200 [Fragaria vesca
           subsp. vesca]
          Length = 343

 Score =  229 bits (584), Expect = 2e-57
 Identities = 143/295 (48%), Positives = 169/295 (57%), Gaps = 19/295 (6%)
 Frame = +3

Query: 87  MPRPGPRPYECVRRAWHSERHQPIRGSLIQEIFRIANEIHSPETRKKKEWQEKLPIVVLK 266
           MPR GPRPY+C+RRAWHSERHQP+RGSLI+EIF + NEIHS  TRK KEWQEKLPIVVLK
Sbjct: 1   MPRSGPRPYDCIRRAWHSERHQPMRGSLIKEIFSVVNEIHSSATRKNKEWQEKLPIVVLK 60

Query: 267 AEEILYSKANSEVEYMDLKTLGDRTNDAINTIIRREDGTEETGEFLQPCIEAALHLGCVP 446
           AEEI+YSKANSE EY DLKTL DR NDAINTIIRR++  E   EFLQPCIEAAL+LGCV 
Sbjct: 61  AEEIMYSKANSEAEYTDLKTLWDRANDAINTIIRRDESIETGEEFLQPCIEAALNLGCVA 120

Query: 447 RRTSRSQRNINPRGYLSG-NSDLTNVTE---------GNPKLTSPLV---PHYASFLGKE 587
           RR SRSQR  NPR YLS   SD+ +V E            K   P+     H  S   K+
Sbjct: 121 RRASRSQRYSNPRCYLSPITSDVPSVAEKGSQKDHTPHRSKFVKPITINSSHLGSESTKK 180

Query: 588 PHNLALPVKTCLNNEPFLCS------QNTLPHQGSYSSQXXXXXXXXXXXXSQPVEPQLM 749
           P +++  V  C  ++   CS       + +P    Y               S P      
Sbjct: 181 PISVSENVPPCGYDQ---CSPRDTQATSNIPSYPLYYGNCPQFEELKHGFVSLPKPVSKP 237

Query: 750 LGDHSRNNRPEMATPRNFFLSNEENSRNRRDEPKPPINLPTYECDLSLRLGLSST 914
           L     +  P +   R+    N    +  RD P    +L    CDLSLRLG  S+
Sbjct: 238 LEPARTSGVPNLFRSRD--KPNYNTQKGARDCPDQTPDL--VGCDLSLRLGSLSS 288


>emb|CBI28217.3| unnamed protein product [Vitis vinifera]
          Length = 302

 Score =  225 bits (574), Expect = 3e-56
 Identities = 144/300 (48%), Positives = 173/300 (57%), Gaps = 25/300 (8%)
 Frame = +3

Query: 87  MPRPGPRPYECVRRAWHSERHQPIRGSLIQEIFRIANEIHSPETRKKKEWQEKLPIVVLK 266
           MPRPGPRPYECVRRAWHS+RHQP+RGS+IQ+IFR+  + HS  T+K +EWQEKLPIVVLK
Sbjct: 1   MPRPGPRPYECVRRAWHSDRHQPMRGSIIQQIFRVVTDTHSSATKKNREWQEKLPIVVLK 60

Query: 267 AEEILYSKANSEVEYMDLKTLGDRTNDAINTIIRREDGTEETGEFLQPCIEAALHLGCVP 446
           AEEI+YSKANSE EYMDL TL DR NDA+NTIIRR++ T ETGE L PCIEAAL+LGCVP
Sbjct: 61  AEEIMYSKANSETEYMDLGTLWDRVNDAVNTIIRRDEST-ETGELLPPCIEAALNLGCVP 119

Query: 447 RRTSRSQRNINPRGYLSGNSD---------LTN-VTEGNPKLTSPLVPHYASF--LGKEP 590
            R SRSQR+ NPR YL+  +          L N V E  P+L  P   +  +F  L  + 
Sbjct: 120 VRASRSQRHNNPRSYLTHRTQEPTSVSPRVLDNAVNERCPQLQPPSAGNQLTFGRLNMDS 179

Query: 591 HNLALPV-KTCLNNEPFLCSQN-TLPHQ----GSYSSQXXXXXXXXXXXXSQPVEPQLML 752
            +L L   +    N     ++N   P++    GS  S               P    L  
Sbjct: 180 THLVLDSDRHVTQNNSLATTRNFHFPYENFPLGSNQSMTVETNTPLNFGSVYP----LYY 235

Query: 753 GDHSRNNR-------PEMATPRNFFLSNEENSRNRRDEPKPPINLPTYECDLSLRLGLSS 911
           G H +N         PE A     F+     +     EP         ECDLSLRLGLSS
Sbjct: 236 GTHFQNEESHLGFQMPETANANTVFVGAPIGTSIA--EPS------EMECDLSLRLGLSS 287


>gb|EPS60827.1| hypothetical protein M569_13973, partial [Genlisea aurea]
          Length = 136

 Score =  223 bits (569), Expect = 1e-55
 Identities = 107/137 (78%), Positives = 118/137 (86%)
 Frame = +3

Query: 87  MPRPGPRPYECVRRAWHSERHQPIRGSLIQEIFRIANEIHSPETRKKKEWQEKLPIVVLK 266
           MPRPGPRPYEC RRAWHS+RHQPIRGSLIQEIFR+ NE+H   TRK  EWQEKLPIVVL+
Sbjct: 1   MPRPGPRPYECFRRAWHSDRHQPIRGSLIQEIFRLVNEVHCSSTRKNMEWQEKLPIVVLR 60

Query: 267 AEEILYSKANSEVEYMDLKTLGDRTNDAINTIIRREDGTEETGEFLQPCIEAALHLGCVP 446
           AEEI+YSKANSE EY DL TL +R NDAI+TIIRR D + ETGE L+PCIEAALHLGC P
Sbjct: 61  AEEIMYSKANSEAEYSDLSTLWNRVNDAIDTIIRR-DESSETGELLRPCIEAALHLGCTP 119

Query: 447 RRTSRSQRNINPRGYLS 497
           RR+SRSQRN +PR YLS
Sbjct: 120 RRSSRSQRNDSPRNYLS 136


Top