BLASTX nr result

ID: Catharanthus22_contig00010262 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Catharanthus22_contig00010262
         (1669 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_004231006.1| PREDICTED: uncharacterized protein LOC101256...   381   e-103
ref|XP_006359720.1| PREDICTED: uncharacterized protein LOC102579...   375   e-101
gb|EPS72390.1| hypothetical protein M569_02372, partial [Genlise...   334   8e-89
gb|ADN34011.1| translation initiation factor [Cucumis melo subsp...   308   6e-81
ref|XP_004147022.1| PREDICTED: uncharacterized protein LOC101214...   304   9e-80
gb|EMJ00816.1| hypothetical protein PRUPE_ppa007168mg [Prunus pe...   290   2e-75
ref|XP_002275899.1| PREDICTED: uncharacterized protein LOC100262...   263   1e-67
emb|CAN77162.1| hypothetical protein VITISV_029831 [Vitis vinifera]   259   2e-66
ref|XP_006411661.1| hypothetical protein EUTSA_v10025358mg [Eutr...   256   2e-65
ref|XP_006283781.1| hypothetical protein CARUB_v10004871mg [Caps...   251   9e-64
ref|NP_195583.1| glycine-rich protein [Arabidopsis thaliana] gi|...   244   8e-62
ref|XP_004292157.1| PREDICTED: uncharacterized protein LOC101302...   243   1e-61
ref|NP_001190956.1| glycine-rich protein [Arabidopsis thaliana] ...   243   2e-61
ref|XP_002528261.1| translation initiation factor, putative [Ric...   240   2e-60
ref|XP_002868857.1| hypothetical protein ARALYDRAFT_490636 [Arab...   235   5e-59
ref|XP_003540800.1| PREDICTED: uncharacterized protein LOC100820...   233   2e-58
gb|EOX97749.1| Glycine-rich protein, putative [Theobroma cacao]       232   3e-58
gb|ESW03680.1| hypothetical protein PHAVU_011G033300g [Phaseolus...   231   5e-58
ref|XP_004505768.1| PREDICTED: uncharacterized protein LOC101488...   230   2e-57
ref|XP_002313735.2| hypothetical protein POPTR_0009s13280g [Popu...   227   1e-56

>ref|XP_004231006.1| PREDICTED: uncharacterized protein LOC101256330 [Solanum
            lycopersicum]
          Length = 422

 Score =  381 bits (978), Expect = e-103
 Identities = 224/421 (53%), Positives = 277/421 (65%), Gaps = 23/421 (5%)
 Frame = -2

Query: 1602 MAATVSAWAKPGAWALDSEEHEAELLQQQKDEPDNGNHQSNGTSGISSDFPSLXXXXXXX 1423
            MAATVSAWAKPGAWALDSEE+E EL +++  + +N  H + G +G  +DFPSL       
Sbjct: 1    MAATVSAWAKPGAWALDSEENELELQKEESVKVEN--HSNGGGAGGLADFPSLAAAATTK 58

Query: 1422 XXXXXKGQTLSLQEFASFGSVKQ--TSQAPASKGLTPDELLALPTGPRQRSAEELDRNKL 1249
                   QTLSLQEF+++ + KQ  T+ A ++KGLTP+E+L LPTGPR+R+AEELD+++L
Sbjct: 59   TKKKKP-QTLSLQEFSTYSAAKQSQTAAAASTKGLTPEEVLMLPTGPRERTAEELDQSRL 117

Query: 1248 GNGFRSYGNSYDRPG-RGSSDEQPR----RRDSNRDLAPSRADEIDDWGAAKKSTVGNXX 1084
            G GFRSYG  YDR G RGSSD+  R    RRD++R++APSRADE DDWGAAKK++ GN  
Sbjct: 118  GGGFRSYG--YDRQGGRGSSDDSRRQGGFRRDTDREIAPSRADETDDWGAAKKTSAGNGF 175

Query: 1083 XXXXXXXXXXXF-SDSQSRADDVDNWASKKTFVPSEGRRYDRRGGFESNGGGADTDNWTK 907
                       F SDSQS+AD+ DNW + K FVPS GRR+DRR  F SNG  +D+D WTK
Sbjct: 176  ERRGERGERGGFFSDSQSKADESDNWGANKAFVPSSGRRFDRRVSFGSNGSDSDSDRWTK 235

Query: 906  RKEEE-GRKFPSSGGAFDSLRERKGGSFEQSNGNGPDSDIWGKKREE-----GVGRPRLN 745
            RKEEE GR+F S GGAFDSLRER+GG    SNG G DS+ WGKKREE     G GRP+LN
Sbjct: 236  RKEEEGGRRFASGGGAFDSLRERRGG--YDSNG-GVDSENWGKKREENGVSAGGGRPKLN 292

Query: 744  LQPRSLPVSDIQQEN---------GTAAKPKGSNPFGDARPREEVLKEKGQDPRXXXXXX 592
            LQPR+LP+S+ QQ              AKPKG+NPFG ARPREEVLKEKG+D +      
Sbjct: 293  LQPRTLPLSEGQQNGNEPVPAPVPAPVAKPKGTNPFGAARPREEVLKEKGRDWKEIDQKL 352

Query: 591  XXXXXXETAWSDGSAPXXXXXXXXXXXXXXXSWPEDRTERAWRKPETVEAPPQSDEKTED 412
                  E + S   AP                + ED+TE++WRKPE  E PP S E+T +
Sbjct: 353  ESLKVKEASESSDGAP--IPKKAWGSPNGKLIFREDKTEKSWRKPELNEVPPSSAEETVN 410

Query: 411  E 409
            E
Sbjct: 411  E 411


>ref|XP_006359720.1| PREDICTED: uncharacterized protein LOC102579361 [Solanum tuberosum]
          Length = 452

 Score =  375 bits (963), Expect = e-101
 Identities = 227/447 (50%), Positives = 278/447 (62%), Gaps = 46/447 (10%)
 Frame = -2

Query: 1602 MAATVSAWAKPGAWALDSEEHEAELLQQQKDEPDNGNHQSNGTSGISSDFPSLXXXXXXX 1423
            MAATVSAWAKPGAWALDSEE+E EL +++  + +N  H + G +G  +DFPSL       
Sbjct: 1    MAATVSAWAKPGAWALDSEENELELQKEESVKVEN--HSNGGGAGGLADFPSLAAAATTK 58

Query: 1422 XXXXXKGQTLSLQEFASFGSVK--QTSQAPASKGLTPDELLALPTGPRQRSAEELDRNKL 1249
                   QTLSLQEF+++ + K  QT+ A A+KGLTP+E+L LPTGPR+R+AEELD+++L
Sbjct: 59   TKKKKP-QTLSLQEFSTYSAAKKSQTAAAAATKGLTPEEVLMLPTGPRERTAEELDQSRL 117

Query: 1248 GNGFRSYGNSYDRP--------------------------GRGSSDEQPR----RRDSNR 1159
            G GFRSYG  YD                            GRGSSD+  R    RRD++R
Sbjct: 118  GGGFRSYG--YDNSIFGLTPDKVLMLPTSPRERTAEELGQGRGSSDDSHRQGGFRRDTDR 175

Query: 1158 DLAPSRADEIDDWGAAKKSTVGNXXXXXXXXXXXXXF-SDSQSRADDVDNWASKKTFVPS 982
            ++APSRADE DDWGAAKK++ GN             F SDSQS+ D+ DNWA+ K FVPS
Sbjct: 176  EIAPSRADETDDWGAAKKTSAGNGFERRGERGERGGFFSDSQSKVDESDNWAANKAFVPS 235

Query: 981  EGRRYDRRGGFESNGGGADTDNWTKRKEEE-GRKFPSSGGAFDSLRERKGGSFEQSNGNG 805
             GRR+DRRG F SNG  +D+D WTKRKEEE GR+F S GGAFDSLRER+GG    SNG G
Sbjct: 236  SGRRFDRRGSFGSNGSDSDSDRWTKRKEEEGGRRFASGGGAFDSLRERRGG--YDSNG-G 292

Query: 804  PDSDIWGKKREE-----GVGRPRLNLQPRSLPVSDIQQENGTAA-------KPKGSNPFG 661
             DS+ WGKKREE     G GRP+LNLQPR+LP+S+ QQ     A       KPKG+NPFG
Sbjct: 293  VDSENWGKKREENGVSAGGGRPKLNLQPRTLPLSEGQQNGNEPAPVPVPVVKPKGANPFG 352

Query: 660  DARPREEVLKEKGQDPRXXXXXXXXXXXXETAWSDGSAPXXXXXXXXXXXXXXXSWPEDR 481
             ARPREEVLKEKGQD +            E + S   AP                + ED+
Sbjct: 353  AARPREEVLKEKGQDWKEIDQKIESLKVKEASESIDGAP--IVKKAWGSPNGKLIFREDK 410

Query: 480  TERAWRKPETVEAPPQSDEKTEDEPVE 400
            TE++WRKPE  E PP S E+T +  VE
Sbjct: 411  TEKSWRKPELNEVPPSSAEETVNGAVE 437


>gb|EPS72390.1| hypothetical protein M569_02372, partial [Genlisea aurea]
          Length = 375

 Score =  334 bits (856), Expect = 8e-89
 Identities = 196/341 (57%), Positives = 236/341 (69%), Gaps = 12/341 (3%)
 Frame = -2

Query: 1602 MAATVSAWAKPGAWALDSEEHEAELL-QQQKDEPDNGNHQSNGTSGISSDFPSLXXXXXX 1426
            MAATVS W KPGAWALDSEE+E+EL+ +  K+E      +S+G  G + +FPSL      
Sbjct: 1    MAATVSVWGKPGAWALDSEENESELIPKDDKEESSIAIGKSDG--GETEEFPSLSAAVSK 58

Query: 1425 XXXXXXKGQTLSLQEFASFGSVKQTSQAPASKGLTPDELLALPTGPRQRSAEELDRNKLG 1246
                    QT+SLQ F+ +G+ K +     +KGLTPDELL LPTGPR+RSAEEL+RNKLG
Sbjct: 59   KPKKKK-AQTVSLQHFSLYGATKPSPSE--NKGLTPDELLMLPTGPRERSAEELERNKLG 115

Query: 1245 NGFRSYGNSYDRPGRGSSDEQPRR---RDSNRDLAPSRADEIDDWGAAKK-STVGNXXXX 1078
             GFRSYG        G  D+Q RR   R+SNRD APSRADE DDWGA KK S+ G+    
Sbjct: 116  GGFRSYGG-------GIRDDQQRRNFNRESNRDFAPSRADETDDWGATKKFSSSGSGFDR 168

Query: 1077 XXXXXXXXXFSDSQSRADDVDNWASKKTFVPSEGRRYDRRG--GFESNGGGADTDNWTKR 904
                     F+DSQSRAD+VDNWAS K+FVPS+ RR DR+   GF++N  G D+ +W KR
Sbjct: 169  KERGDRGGFFTDSQSRADEVDNWASSKSFVPSDPRRNDRKPGFGFDTNNNGIDSSSWMKR 228

Query: 903  KEEEGRKFPSSGGAFDSLRERKGGSFEQSNGNGPDSDIWGKKREE---GVGRPRLNLQPR 733
            KEEEGRK    GGAFDSLRER+GG FE S     DSD WG+++EE   G  RP+LNLQPR
Sbjct: 229  KEEEGRKV--VGGAFDSLRERRGGGFEPSR---VDSDNWGRRKEEVSIGGSRPKLNLQPR 283

Query: 732  SLPVSDIQQ-ENGTAAKPK-GSNPFGDARPREEVLKEKGQD 616
            +LPV + Q+ E GTA+KPK GSNPFG+ARPREEVLKEKGQD
Sbjct: 284  TLPVDEGQKSETGTASKPKGGSNPFGEARPREEVLKEKGQD 324


>gb|ADN34011.1| translation initiation factor [Cucumis melo subsp. melo]
          Length = 405

 Score =  308 bits (788), Expect = 6e-81
 Identities = 194/434 (44%), Positives = 245/434 (56%), Gaps = 31/434 (7%)
 Frame = -2

Query: 1602 MAATVSAWAKPGAWALDSEEHEAELLQQQKDEPDNGNHQSNGTSGISSDFPSLXXXXXXX 1423
            MAATVS W KPGAWALD+EEHEAELL   KD+ D   HQS      S+DFPSL       
Sbjct: 1    MAATVSPWGKPGAWALDAEEHEAELL---KDQQDQSRHQSEP----SADFPSLAAAAATK 53

Query: 1422 XXXXXKGQTLSLQEFASFGSVKQTSQAPASKGLTPDELLALPTGPRQRSAEELDRNKLGN 1243
                  GQ++ L EF ++G  +  +Q+   KGLT ++L+ LPTGPRQR+AEE+DRN+LG 
Sbjct: 54   PKKKK-GQSIPLSEFQTYGGPRPAAQSTDPKGLTAEDLMMLPTGPRQRTAEEMDRNRLGG 112

Query: 1242 GFRSYGNS--YDRPGRGSSDE-------------QPRR------RDSNRDLAPSRADEID 1126
            GF+S+G +  YDR  R S+ E             + RR      R+  R+  PSRADEID
Sbjct: 113  GFKSWGQNSLYDRGNRYSNSEDSPNSRRSSRVFDESRRSNDGSDREFRRESLPSRADEID 172

Query: 1125 DWGAAKKSTVGNXXXXXXXXXXXXXFSDSQSRADDVDNWASKKTFVPSEGRR-----YDR 961
            DWGA KK  +GN             F    S+AD+ D+W S K+F PSEGRR      +R
Sbjct: 173  DWGAGKKPMMGNGFERRERGGGGGFFDSHSSKADESDSWVSSKSFTPSEGRRSGGFDRER 232

Query: 960  RGGFESNGGGADTDNWTKRKEEEGRKFPSSGGAFDSLRERKGGSFEQSNGNGPDSDIWGK 781
            RGGF ++GGGAD+DNW       GRK   S GA   + E         NG G DSD WGK
Sbjct: 233  RGGFPTSGGGADSDNW-------GRK---SDGARAGMGE---------NGGGADSDNWGK 273

Query: 780  KRE---EGVG-RPRLNLQPRSLPVSDIQQE-NGTAAKPKGSNPFGDARPREEVLKEKGQD 616
            K E    G+G RPRLNLQPRS+P+++  QE +G A KPKGSNPFG+ARPREEVL EKGQD
Sbjct: 274  KSEGVRSGIGERPRLNLQPRSIPLNNGNQEASGVAVKPKGSNPFGNARPREEVLAEKGQD 333

Query: 615  PRXXXXXXXXXXXXETAWSDGSAPXXXXXXXXXXXXXXXSWPEDRTERAWRKPETVEAPP 436
             +            +T     ++                  P+  + R+WRKPE+ ++ P
Sbjct: 334  WKKIDEQLGSMKIKDTVERAETSSGASFERRKGFGVRSGRSPD--SGRSWRKPESADSRP 391

Query: 435  QSDEKTEDEPVEND 394
            QS E  ED P E +
Sbjct: 392  QSAELVEDGPAEEN 405


>ref|XP_004147022.1| PREDICTED: uncharacterized protein LOC101214573 [Cucumis sativus]
            gi|449489695|ref|XP_004158389.1| PREDICTED:
            uncharacterized LOC101214573 [Cucumis sativus]
          Length = 405

 Score =  304 bits (778), Expect = 9e-80
 Identities = 194/434 (44%), Positives = 245/434 (56%), Gaps = 31/434 (7%)
 Frame = -2

Query: 1602 MAATVSAWAKPGAWALDSEEHEAELLQQQKDEPDNGNHQSNGTSGISSDFPSLXXXXXXX 1423
            MAATVS W KPGAWALD+EEHEAELL   KD+ +   HQ       S+DFPSL       
Sbjct: 1    MAATVSPWGKPGAWALDAEEHEAELL---KDQEEQSRHQEEP----SADFPSLAAAAATK 53

Query: 1422 XXXXXKGQTLSLQEFASFGSVKQTSQAPASKGLTPDELLALPTGPRQRSAEELDRNKLGN 1243
                  GQ++ L EF ++G  K ++Q+   KGLT ++L+ LPTGPRQR+AEE+DRN+LG 
Sbjct: 54   PKKKK-GQSIPLSEFQTYGGPKPSAQSSDPKGLTAEDLMMLPTGPRQRTAEEMDRNRLGG 112

Query: 1242 GFRSYGNS--YDRPGRGSSDE-------------QPRR------RDSNRDLAPSRADEID 1126
            GF+S+G +  YDR  R S+ E             + RR      R+  R+  PSRADEID
Sbjct: 113  GFKSWGQNSLYDRGNRYSNSEDSPNSRRSSRVFDESRRTNDGSDREFRRESLPSRADEID 172

Query: 1125 DWGAAKKSTVGNXXXXXXXXXXXXXFSDSQSRADDVDNWASKKTFVPSEGRR-----YDR 961
            DWGA KK  VGN             F    S+AD+ D+W S K+F PSEGRR      +R
Sbjct: 173  DWGAGKKPMVGNGFERRERGGGGGFFDSHSSKADESDSWVSSKSFTPSEGRRSGGFDRER 232

Query: 960  RGGFESNGGGADTDNWTKRKEEEGRKFPSSGGAFDSLRERKGGSFEQSNGNGPDSDIWGK 781
            RGGF ++GGGAD+DNW       GRK     GA       +GG  E  NG   DS+ WGK
Sbjct: 233  RGGFPTSGGGADSDNW-------GRK---PDGA-------RGGIGE--NGGSADSENWGK 273

Query: 780  KRE---EGVG-RPRLNLQPRSLPVSDIQQE-NGTAAKPKGSNPFGDARPREEVLKEKGQD 616
            + E    G+G RPRLNLQPRS+P+++  QE +G A KPKGSNPFG+ARPREEVL EKGQD
Sbjct: 274  RSEGVRSGIGERPRLNLQPRSIPLNNGNQEASGVAVKPKGSNPFGNARPREEVLAEKGQD 333

Query: 615  PRXXXXXXXXXXXXETAWSDGSAPXXXXXXXXXXXXXXXSWPEDRTERAWRKPETVEAPP 436
             +            +T     ++                  P+  + R WRKPE+VE+ P
Sbjct: 334  WKKIDEQLESVKIKDTVERAETSSGASFERKKGFGARSGRSPD--SGRTWRKPESVESRP 391

Query: 435  QSDEKTEDEPVEND 394
            QS E  ED P E +
Sbjct: 392  QSAELVEDGPAEEN 405


>gb|EMJ00816.1| hypothetical protein PRUPE_ppa007168mg [Prunus persica]
          Length = 379

 Score =  290 bits (741), Expect = 2e-75
 Identities = 189/413 (45%), Positives = 229/413 (55%), Gaps = 17/413 (4%)
 Frame = -2

Query: 1602 MAATVSAWAKPGAWALDSEEHEAELLQQQKDEPDNGNHQSNGTSGISSDFPSLXXXXXXX 1423
            MAATVS WAKPGAWAL +EE +AEL Q    E  N  H     S   +D+PSL       
Sbjct: 1    MAATVSPWAKPGAWALAAEEQDAELEQ----ETQNARHVVEPPS---ADYPSLSVAATAK 53

Query: 1422 XXXXXKGQTLSLQEFASFGSVKQTSQAPASKGLTPDELLALPTGPRQRSAEELDRNKLGN 1243
                 KGQ +SL EF +FG+ K  +Q    +GLT  + + LPTGPR+R+AEELDRN+LG 
Sbjct: 54   PKKKNKGQKISLAEFTAFGAPKPVAQP---EGLTHQDRMHLPTGPRERTAEELDRNRLGG 110

Query: 1242 GFRSYGNSYDRPGRGSSDEQPRRRDSNRDLAPSRADEIDDWGAAKKSTVGNXXXXXXXXX 1063
            GFRSYG+                     D   SRADEIDDWGAAKKSTVGN         
Sbjct: 111  GFRSYGS---------------------DRGNSRADEIDDWGAAKKSTVGNGFERRERGA 149

Query: 1062 XXXXFSDSQSRADDVDNWASKKTFVPSEGRRY---------DRRGGFESNGGGADTDNWT 910
                F  SQS+AD+ D+W S K+ V SEGRR+         +R+ GF S+ GGAD+DNW 
Sbjct: 150  GGSFFGGSQSKADESDSWVSNKSSVSSEGRRFGASGGGFDRERKVGFTSD-GGADSDNWG 208

Query: 909  KRKEEEGRKFPSSGGAFDSLRERKGGSFEQSNGNGPDSDIWGKKRE-------EGVGRPR 751
            ++KEE      + G  FD  RER+ G    SNG G DS++WGKK+E       E  GRPR
Sbjct: 209  RKKEES-----NGGSGFD--RERRVGFV--SNGGGADSEVWGKKKEESNGGLSESTGRPR 259

Query: 750  LNLQPRSLPVS-DIQQENGTAAKPKGSNPFGDARPREEVLKEKGQDPRXXXXXXXXXXXX 574
            LNLQPR+LPVS +    + T  K KGSNPFG+ARPREEVL EKG+D +            
Sbjct: 260  LNLQPRTLPVSNETSPGSTTVPKSKGSNPFGEARPREEVLAEKGKDWKKIDEELESVKIK 319

Query: 573  ETAWSDGSAPXXXXXXXXXXXXXXXSWPEDRTERAWRKPETVEAPPQSDEKTE 415
            E A  D S                     DRTERAWRKP+  +A PQS E+ E
Sbjct: 320  EVAERDHSPSFGKRSFGIGNGRAG-----DRTERAWRKPDVADARPQSAEENE 367


>ref|XP_002275899.1| PREDICTED: uncharacterized protein LOC100262348 [Vitis vinifera]
          Length = 401

 Score =  263 bits (673), Expect = 1e-67
 Identities = 179/430 (41%), Positives = 236/430 (54%), Gaps = 33/430 (7%)
 Frame = -2

Query: 1602 MAATVSAWAKPGAWALDSEEHEAELLQQQKDEPDNGNH---QSNGTSGISSDFPSLXXXX 1432
            MAATVS W K GAWALDSEEHE ELLQQQ+D+  NG     +       S+DFP+L    
Sbjct: 1    MAATVSPWGKAGAWALDSEEHEDELLQQQRDDKVNGEFSGGEGRQAPEASADFPTLATAA 60

Query: 1431 XXXXXXXXKGQTLSLQEFASFGSVKQTSQAPASKGLTPDELLALPTGPRQRSAEELDRNK 1252
                     GQTLSL EF++FG+ K ++Q   +KGLT ++L+ LPTGPRQRSAEELDR +
Sbjct: 61   ATKSKKKK-GQTLSLSEFSAFGAGK-SAQPSQTKGLTHEDLMMLPTGPRQRSAEELDRGR 118

Query: 1251 LGNGFRSYGN--SYD----RPGRGSSDEQPRRRDSNRDLAPSRADEIDDWGAAKKSTVGN 1090
            LG GFRSYG+  SY+    R G G     PR         P  ++E    G  + S+   
Sbjct: 119  LGGGFRSYGSNGSYEGGRSRYGGGEDSANPR-------WGPRGSEERRQGGFGRDSS--- 168

Query: 1089 XXXXXXXXXXXXXFSDSQSRADDVDNW-ASKKTFVPSEGRRYDRRGGFESNGGGADTDNW 913
                            + SRAD++D+W A+KK+ V +   R DR G F+S     ++ +W
Sbjct: 169  -------------RELAPSRADEIDDWGAAKKSTVGNGFERRDRGGFFDSQSRADESASW 215

Query: 912  TKRKE---EEGRKFPSSGGAFDSLRERKGGSFEQSNGNG-PDSDIWGKKREEGVG----- 760
               K     EGR+F   GG F+SLRER+GG    S+G G  DS+ WG+K+EEG G     
Sbjct: 216  VSNKSFTPSEGRRF-GGGGGFESLRERRGGFDSASDGGGGADSESWGRKKEEGSGNANGS 274

Query: 759  ---RPRLNLQPRSLPVSDIQQE-NGTAAKPKGSNPFGDARPREEVLKEKGQD-----PRX 607
               RP+L LQPR++PV+D QQ  +G+ AKPKG NPFG+ARPREEVL EKGQD      + 
Sbjct: 275  AGSRPKLILQPRTVPVNDGQQPGSGSVAKPKGPNPFGEARPREEVLAEKGQDWKEIEEKL 334

Query: 606  XXXXXXXXXXXETAWSDGSAPXXXXXXXXXXXXXXXSWPEDRTERAWRKPET-----VEA 442
                           +DG +                S PE R+E++WRKPE+      + 
Sbjct: 335  ESVKLKDVGSPGVGQTDGPS---FGKRSFGSGNARASLPESRSEKSWRKPESEDVRAAKT 391

Query: 441  PPQSDEKTED 412
              + +EKT+D
Sbjct: 392  EDEHEEKTQD 401


>emb|CAN77162.1| hypothetical protein VITISV_029831 [Vitis vinifera]
          Length = 1434

 Score =  259 bits (662), Expect = 2e-66
 Identities = 175/412 (42%), Positives = 227/412 (55%), Gaps = 28/412 (6%)
 Frame = -2

Query: 1602 MAATVSAWAKPGAWALDSEEHEAELLQQQKDEPDNGNH---QSNGTSGISSDFPSLXXXX 1432
            MAATVS W K GAWALDSEEHE ELLQQQ+D+  NG     +       S+DFP+L    
Sbjct: 1    MAATVSPWGKAGAWALDSEEHEDELLQQQRDDKVNGEFSGGEGRQAPEASADFPTLATAA 60

Query: 1431 XXXXXXXXKGQTLSLQEFASFGSVKQTSQAPASKGLTPDELLALPTGPRQRSAEELDRNK 1252
                     GQTLSL EF++FG+ K ++Q   +KGLT ++L+ LPTGPRQRSAEELDR +
Sbjct: 61   ATKSKKKK-GQTLSLSEFSAFGAGK-SAQPSQTKGLTHEDLMMLPTGPRQRSAEELDRGR 118

Query: 1251 LGNGFRSYGN--SYD----RPGRGSSDEQPRRRDSNRDLAPSRADEIDDWGAAKKSTVGN 1090
            LG GFRSYG+  SY+    R G G     PR         P  ++E    G  + S+   
Sbjct: 119  LGGGFRSYGSNGSYEGGRSRYGGGEDSANPR-------WGPRGSEERRQGGFGRDSS--- 168

Query: 1089 XXXXXXXXXXXXXFSDSQSRADDVDNW-ASKKTFVPSEGRRYDRRGGFESNGGGADTDNW 913
                            + SRAD++D+W A+KK+ V +   R DR G F+S     ++ +W
Sbjct: 169  -------------RELAPSRADEIDDWGAAKKSTVGNGFERRDRGGFFDSQSRADESASW 215

Query: 912  TKRKE---EEGRKFPSSGGAFDSLRERKGGSFEQSNGNG-PDSDIWGKKREEGVG----- 760
               K     EGR+F   GG F+SLRER+GG    S+G G  DS+ WG+K+EEG G     
Sbjct: 216  VSNKSFTPSEGRRF-GGGGGFESLRERRGGFDSASDGGGGADSESWGRKKEEGSGNANGS 274

Query: 759  ---RPRLNLQPRSLPVSDIQQE-NGTAAKPKGSNPFGDARPREEVLKEKGQD-----PRX 607
               RP+L LQPR++PV+D QQ  +G+ AKPKG NPFG+ARPREEVL EKGQD      + 
Sbjct: 275  AGSRPKLILQPRTVPVNDGQQPGSGSVAKPKGPNPFGEARPREEVLAEKGQDWKEIEEKL 334

Query: 606  XXXXXXXXXXXETAWSDGSAPXXXXXXXXXXXXXXXSWPEDRTERAWRKPET 451
                           +DG +                S PE R E++WRKPE+
Sbjct: 335  ESVKLKDVGSPGVGQTDGPS---FGKRSFGSGNARASLPESRXEKSWRKPES 383


>ref|XP_006411661.1| hypothetical protein EUTSA_v10025358mg [Eutrema salsugineum]
            gi|557112831|gb|ESQ53114.1| hypothetical protein
            EUTSA_v10025358mg [Eutrema salsugineum]
          Length = 404

 Score =  256 bits (654), Expect = 2e-65
 Identities = 186/434 (42%), Positives = 230/434 (52%), Gaps = 27/434 (6%)
 Frame = -2

Query: 1599 AATVSAWAKPGAWALDSEEHEAELLQQQKDEPDNGNHQSNGTSGISSDFPSLXXXXXXXX 1420
            AA  S WAKPGAWALD+EE+EAEL QQ        ++Q+N     SSDFPSL        
Sbjct: 3    AAVSSVWAKPGAWALDAEENEAELQQQSL-----ASNQTNS----SSDFPSLAAAATTKT 53

Query: 1419 XXXXKGQTLSLQEFASFGSVKQTSQAPASKGLTPDELLALPTGPRQRSAEELDRNKLGNG 1240
                 GQTLSL EFA++GSVK  S AP ++ LT DEL++LPTGPR+RSAEELDR+KLG G
Sbjct: 54   KKKK-GQTLSLAEFATYGSVKAAS-APKTERLTHDELVSLPTGPRERSAEELDRSKLGGG 111

Query: 1239 FRSYGNSYDR--PGRGSSDEQPRR------RDSNRDLAPSRADEIDDWGAAKKSTVGNXX 1084
            FRSYG    R    R S D + R       R+S RD  PSRADE D+W A KK   GN  
Sbjct: 112  FRSYGRDDSRWSSSRVSEDGEKRGGGFNRDRESGRDSGPSRADETDNWAAGKKPVGGNGF 171

Query: 1083 XXXXXXXXXXXFSDSQSRADDVDNWASKKTFVPSEGRRYDRRGGFESNGGGADTDNWTKR 904
                        S SQS+AD+VD+W S K   PSE RR        SNGG    D + +R
Sbjct: 172  ERRERGGGFFE-SQSQSKADEVDSWVSSK---PSEPRRIS-----SSNGGA---DRFERR 219

Query: 903  KEEEGRKFPSSGGAFDSLRERKGGSFEQSNGNGPDSDIWGKKREE-------GVGRPRLN 745
                        G+F+SL   +   +    G G DSD WG++REE       G  RPRL 
Sbjct: 220  ------------GSFESLSRNRDSQY---GGGGSDSDSWGRRREEIGAPPPSGGSRPRLV 264

Query: 744  LQPRSLPVS-----DIQQENG---TAAKPKGSNPFGDARPREEVLKEKGQDPRXXXXXXX 589
            LQPR+LPV+     D+  E+    T  KPKG+NPFG+ARPREEVL EKGQD +       
Sbjct: 265  LQPRTLPVAAPAIVDVNPESAVTVTVEKPKGANPFGNARPREEVLAEKGQDWKEIEEKLD 324

Query: 588  XXXXXETA----WSDGSAPXXXXXXXXXXXXXXXSWPEDRTERAWRKPETVEAPPQSDEK 421
                 + A     SD  +P                  +DRTER+WRK     +  QS+E 
Sbjct: 325  AVKLKDVAAAIEKSDERSPGKMGFGLGNGRN------DDRTERSWRK-----STEQSEEG 373

Query: 420  TEDEPVENDDVDKE 379
             + E    ++ +KE
Sbjct: 374  AQQEEPVVEEANKE 387


>ref|XP_006283781.1| hypothetical protein CARUB_v10004871mg [Capsella rubella]
            gi|482552486|gb|EOA16679.1| hypothetical protein
            CARUB_v10004871mg [Capsella rubella]
          Length = 427

 Score =  251 bits (640), Expect = 9e-64
 Identities = 188/457 (41%), Positives = 235/457 (51%), Gaps = 48/457 (10%)
 Frame = -2

Query: 1599 AATVSAWAKPGAWALDSEEHEAELLQQQKDEPDNGNHQSNGTSGISSDFPSLXXXXXXXX 1420
            AA  S WAKPGAWAL++EEHE EL QQ    P N       ++G SSDFPSL        
Sbjct: 3    AAVSSVWAKPGAWALEAEEHEDELKQQAP--PSN----QKSSAGDSSDFPSLAAAATTKT 56

Query: 1419 XXXXKGQTLSLQEFASFGSVKQTSQAPASKGLTPDELLALPTGPRQRSAEELDRNKLGNG 1240
                 GQT+SL EFAS+GS K  + AP ++ LT  EL+ALPTGPR+RSAEELDR+KLG G
Sbjct: 57   KKKK-GQTISLAEFASYGSAK-AAPAPQTERLTQAELVALPTGPRERSAEELDRSKLGGG 114

Query: 1239 FRSYG---------NSYDRPGRGSSDEQPRR------RDSNRDLAPSRADEIDDWGAAKK 1105
            FRSYG         NS     R S D + R       R+S+RD  PSRADE D+WGA KK
Sbjct: 115  FRSYGGGRYGDDSSNSRWGSSRVSEDGERRGGGFNRDRESSRDSGPSRADEDDNWGATKK 174

Query: 1104 STVGNXXXXXXXXXXXXXFSDSQSRADDVDNWASKKTFVPSEGRRYDRRGGFESNGGGAD 925
               G+             F DSQS+AD+VD+W S K   PSE RRY       SNGGG  
Sbjct: 175  PIGGSGFERRERGGGGGGFFDSQSKADEVDSWVSTK---PSEPRRY-----VSSNGGG-- 224

Query: 924  TDNWTKRKEEEGRKFPSSGGAFDSLRERKGGSFEQSNGNGPDSDIWGKKREEGVG----- 760
             D + KR            G+F+SL   +   +    G G +SD WG++REE  G     
Sbjct: 225  -DRFEKR------------GSFESLSRTRDSQY---GGGGSESDTWGRRREESAGADGAP 268

Query: 759  -------RPRLNLQPRSLPVSD-------IQQENGTAA---KPKGSNPFGDARPREEVLK 631
                   RPRL LQPR+LPV+        ++ E+       KPKG+NPFG+ARPREEVL 
Sbjct: 269  PPSSGGSRPRLVLQPRTLPVAVPVAVVEVVKPESPVMVAVDKPKGANPFGNARPREEVLA 328

Query: 630  EKGQDPRXXXXXXXXXXXXETAWSDGSAPXXXXXXXXXXXXXXXSWPEDRTERAWRK-PE 454
            EKGQD +            + A +    P                  +DRT ++WRK  E
Sbjct: 329  EKGQDWKEIDEKLEADKLKDVA-AAFEKPDEKSPGKLGFGLGNGRKDDDRTGKSWRKSTE 387

Query: 453  TVEAPPQSD---------EKTEDEP-VENDDVDKEPQ 373
              E   + D         E+TE+EP VE +D  +E +
Sbjct: 388  QSEEGAEEDEASVEEAKKEETEEEPAVEEEDKKEETE 424


>ref|NP_195583.1| glycine-rich protein [Arabidopsis thaliana]
            gi|4467158|emb|CAB37527.1| putative protein [Arabidopsis
            thaliana] gi|7270854|emb|CAB80535.1| putative protein
            [Arabidopsis thaliana] gi|17065142|gb|AAL32725.1|
            putative protein [Arabidopsis thaliana]
            gi|20259814|gb|AAM13254.1| putative protein [Arabidopsis
            thaliana] gi|332661567|gb|AEE86967.1| glycine-rich
            protein [Arabidopsis thaliana]
          Length = 452

 Score =  244 bits (623), Expect = 8e-62
 Identities = 175/441 (39%), Positives = 223/441 (50%), Gaps = 33/441 (7%)
 Frame = -2

Query: 1599 AATVSAWAKPGAWALDSEEHEAELLQQQKDEPDNGNHQSNGTSGISSDFPSLXXXXXXXX 1420
            AA  S WAKPGAWAL++EEHEAEL QQ    P   N +S+     SSDFPSL        
Sbjct: 3    AAVSSVWAKPGAWALEAEEHEAELKQQ----PSPTNQKSSAED--SSDFPSLAAAATTKT 56

Query: 1419 XXXXKGQTLSLQEFASFGSVKQTSQAPASKGLTPDELLALPTGPRQRSAEELDRNKLGNG 1240
                 GQT+SL EFA++G+ K    AP ++ LT  EL+ALPTGPR+RSAEELDR+KLG G
Sbjct: 57   KKKK-GQTISLAEFATYGTAK-AKPAPQTERLTQAELVALPTGPRERSAEELDRSKLGGG 114

Query: 1239 FRSYG-------NSYDRPGRGSSDEQPRR--------RDSNRDLAPSRADEIDDWGAAKK 1105
            FRSYG       +S  R G     E   R        R+ +RD  PSRADE D+W AAKK
Sbjct: 115  FRSYGGGRYGDESSSSRWGSSRVSEDGERRGGGFNRDREPSRDSGPSRADEDDNWAAAKK 174

Query: 1104 STVGN-XXXXXXXXXXXXXFSDSQSRADDVDNWASKKTFVPSEGRRYDRRGGFESNGGGA 928
               GN               S SQS+AD+VD+W S K   PSE RR+       SNGGG 
Sbjct: 175  PISGNGFERRERGSGGGFFESQSQSKADEVDSWVSTK---PSEPRRF-----VSSNGGGG 226

Query: 927  DTDNWTKRKEEEGRKFPSSGGAFDSLRERKGGSFEQSNGNGPDSDIWGKKREE------- 769
              D + KR            G+F+SL   +   +    G G +SD WG++REE       
Sbjct: 227  --DRFEKR------------GSFESLSRNRDSQY--GGGGGSESDTWGRRREESGAANGS 270

Query: 768  ----GVGRPRLNLQPRSLPVSDIQQENGTA------AKPKGSNPFGDARPREEVLKEKGQ 619
                G  RPRL LQPR+LPV+ ++     +       KPKG+NPFG+ARPREEVL EKGQ
Sbjct: 271  PPPSGGSRPRLVLQPRTLPVAVVEVVKPESPVLVIVEKPKGANPFGNARPREEVLAEKGQ 330

Query: 618  DPRXXXXXXXXXXXXETAWSDGSAPXXXXXXXXXXXXXXXSWPEDRTERAWRKPETVEAP 439
            D +            + A +    P                  E+R ER+WRK       
Sbjct: 331  DWKEIDEKLEAEKLKDIA-AAMEKPNEKSTGKMGFGLGNGRKDEERIERSWRKSTEHSEE 389

Query: 438  PQSDEKTEDEPVENDDVDKEP 376
               +E+   E  + ++ + +P
Sbjct: 390  DAQEEEPAVEGAKKEETEDKP 410


>ref|XP_004292157.1| PREDICTED: uncharacterized protein LOC101302425 [Fragaria vesca
            subsp. vesca]
          Length = 422

 Score =  243 bits (621), Expect = 1e-61
 Identities = 184/450 (40%), Positives = 230/450 (51%), Gaps = 40/450 (8%)
 Frame = -2

Query: 1602 MAATVSA-WAKPGAWALDSEEHEAELLQQQKDEPDNGNHQSNGTSGISSDFPSLXXXXXX 1426
            MAATVS+ WAKPGAWALD+EEHEAEL QQ K E               +DFPSL      
Sbjct: 1    MAATVSSPWAKPGAWALDAEEHEAELEQQTKIETQP-----------LADFPSL-SAAAA 48

Query: 1425 XXXXXXKGQTLSLQEFASFGSVKQTSQAPASKGLTPDELLALPTGPRQRSAEELDRNKLG 1246
                  KGQ +SL EF +FG  K     P   GLT ++ L LPTGPR+R+AEELDR++  
Sbjct: 49   KPKKKSKGQKVSLAEFTTFGGPKPVQAEPV--GLTHEDRLVLPTGPRERTAEELDRSR-- 104

Query: 1245 NGFRSYGNSYDRPGRGSSDEQ--PRRRD--------SNRDLAPSRADEIDDWGAAKKSTV 1096
             GFRSYG   DR  R  S+ +   +RR+        ++RD APSRADE DDWG  KKS V
Sbjct: 105  -GFRSYGG--DRVNREESNSKWGSQRREGGGFGGEKTDRD-APSRADEADDWGVGKKS-V 159

Query: 1095 GNXXXXXXXXXXXXXFSDSQSRADDVDNWASKKTFVPS------EGRRYDRRGGFESNGG 934
            GN                SQS+AD+ D+W S K+   S       G   +R+ GF SNGG
Sbjct: 160  GNGFERRERAGFGF---GSQSKADESDSWVSNKSSFSSLRSGGGGGFERERKVGFASNGG 216

Query: 933  GADTDNWTKRKEEEGRKFPSSGGAFDSLRERKGGSFEQSNGNGPDSDIWGKKREEG---- 766
            GAD+++W +++EE       S G F+  RERK G    SNG G D++ WG++REE     
Sbjct: 217  GADSESWGRKREE-------SNGGFE--RERKVGLEFNSNGGGADAESWGRRREESNGGT 267

Query: 765  --VGRPRLNLQPRSLPVS----DIQQENGTAA---------KPKGSNPFGDARPREEVLK 631
               GRPRLNLQPR+LPV+     +  E    A         +P+ +NPFG ARPREEVL 
Sbjct: 268  ETTGRPRLNLQPRTLPVTLPPPVVSDETSPVAAPVAPEIVPRPRSTNPFGAARPREEVLA 327

Query: 630  EKGQDPRXXXXXXXXXXXXETAWSDGSAPXXXXXXXXXXXXXXXSWPEDRTERAWRKPET 451
            EKGQD +                   +                     DRTE AWRKP  
Sbjct: 328  EKGQDWKKIDEQLESVKLK----EKEAVAAEGESFGKRSFGMGSGRSGDRTEGAWRKPVV 383

Query: 450  VEAP----PQSDEKTEDEPVENDDVDKEPQ 373
             EA     PQS    E E   ++  + EP+
Sbjct: 384  AEAEGEARPQSAGNDEIESRSSNSEELEPE 413


>ref|NP_001190956.1| glycine-rich protein [Arabidopsis thaliana]
            gi|332661568|gb|AEE86968.1| glycine-rich protein
            [Arabidopsis thaliana]
          Length = 465

 Score =  243 bits (620), Expect = 2e-61
 Identities = 178/441 (40%), Positives = 225/441 (51%), Gaps = 33/441 (7%)
 Frame = -2

Query: 1599 AATVSAWAKPGAWALDSEEHEAELLQQQKDEPDNGNHQSNGTSGISSDFPSLXXXXXXXX 1420
            AA  S WAKPGAWAL++EEHEAEL QQ    P   N +S+     SSDFPSL        
Sbjct: 3    AAVSSVWAKPGAWALEAEEHEAELKQQ----PSPTNQKSSAED--SSDFPSLAAAATTKT 56

Query: 1419 XXXXKGQTLSLQEFASFGSVKQTSQAPASKGLTPDELLALPTGPRQRSAEELDRNKLGNG 1240
                 GQT+SL EFA++G+ K    AP ++ LT  EL+ALPTGPR+RSAEELDR+KLG G
Sbjct: 57   KKKK-GQTISLAEFATYGTAK-AKPAPQTERLTQAELVALPTGPRERSAEELDRSKLGGG 114

Query: 1239 FRSYG-------NSYDRPGRGSSDEQPRR--------RDSNRDLAPSRADEIDDWGAAKK 1105
            FRSYG       +S  R G     E   R        R+ +RD  PSRADE D+W AAKK
Sbjct: 115  FRSYGGGRYGDESSSSRWGSSRVSEDGERRGGGFNRDREPSRDSGPSRADEDDNWAAAKK 174

Query: 1104 STVGN-XXXXXXXXXXXXXFSDSQSRADDVDNWASKKTFVPSEGRRYDRRGGFESNGGGA 928
               GN               S SQS+AD+VD+W S K   PSE RR+       SNGGG 
Sbjct: 175  PISGNGFERRERGSGGGFFESQSQSKADEVDSWVSTK---PSEPRRF-----VSSNGGGG 226

Query: 927  DTDNWTKRKEEEGRKFPSSGGAFDSLRERKGGSFEQSNGNGPDSDIWGKKREE------- 769
              D + KR            G+F+SL   +   +    G G +SD WG++REE       
Sbjct: 227  --DRFEKR------------GSFESLSRNRDSQY--GGGGGSESDTWGRRREESGAANGS 270

Query: 768  ----GVGRPRLNLQPRSLPVSDIQQENGTA------AKPKGSNPFGDARPREEVLKEKGQ 619
                G  RPRL LQPR+LPV+ ++     +       KPKG+NPFG+ARPREEVL EKGQ
Sbjct: 271  PPPSGGSRPRLVLQPRTLPVAVVEVVKPESPVLVIVEKPKGANPFGNARPREEVLAEKGQ 330

Query: 618  DPRXXXXXXXXXXXXETAWSDGSAPXXXXXXXXXXXXXXXSWPEDRTERAWRKPETVEAP 439
            D +            + A +    P                  E+R ER+WRK  ++ + 
Sbjct: 331  DWKEIDEKLEAEKLKDIA-AAMEKPNEKSTGKMGFGLGNGRKDEERIERSWRKSFSLHSY 389

Query: 438  PQSDEKTEDEPVENDDVDKEP 376
             + D     E  E D  ++EP
Sbjct: 390  MEVD-VLNTEHSEEDAQEEEP 409


>ref|XP_002528261.1| translation initiation factor, putative [Ricinus communis]
            gi|223532298|gb|EEF34099.1| translation initiation
            factor, putative [Ricinus communis]
          Length = 396

 Score =  240 bits (612), Expect = 2e-60
 Identities = 163/350 (46%), Positives = 202/350 (57%), Gaps = 21/350 (6%)
 Frame = -2

Query: 1602 MAATVSA-WAKPGAWALDSEEHEAELLQQQKDEPDNGNHQSNGTSGISSDFPSLXXXXXX 1426
            MAATVS+ W KPGAWALD+EEHE EL Q++ D   +            SDFPSL      
Sbjct: 1    MAATVSSPWGKPGAWALDAEEHEDELKQERLDSQQDKE----------SDFPSLSVAATK 50

Query: 1425 XXXXXXKGQTLSLQEFASFGSVKQTSQ-APASKGLTPDELLALPTGPRQRSAEELDRNKL 1249
                    QTLSL EFA++ S   + Q +  S+GLT ++LL LPTGPRQRSAEELDR++L
Sbjct: 51   QPKKKK-NQTLSLAEFATYSSAAASQQPSQHSRGLTHEDLLNLPTGPRQRSAEELDRSRL 109

Query: 1248 GNGFRSYG----NSYDR-----PGRGSSDEQPRRRDSNRDLAPSRADEIDDWGAAKKS-T 1099
            G GF+SYG    N  D       G G+S    R RDS+R+L  SRADEIDDW   KKS +
Sbjct: 110  GGGFKSYGMNSRNGDDAGNSRWGGGGNSRVSSRDRDSSRELVLSRADEIDDWSKTKKSPS 169

Query: 1098 VGNXXXXXXXXXXXXXFSDSQSRADDVDNWASKKTFVPSEGRRYDRRGGFESNGGGADTD 919
             GN               DSQS+AD+ D+W + K   P E RR+   GG    GGG    
Sbjct: 170  FGNERRERSSSFF-----DSQSKADESDSWVANK---PMETRRF---GG--GGGGGGSNG 216

Query: 918  NWTKRKEEEGRKFPSSGGAFDSLRERKGGSFEQSNGNG-PDSDIWGKKREE--------G 766
             + +R            G+FDSL   + GS   SNG G  DSD WG+K+E+        G
Sbjct: 217  GFERR------------GSFDSLSRDRYGS---SNGGGAADSDNWGRKKEDSNGMGSVSG 261

Query: 765  VGRPRLNLQPRSLPVSDIQQENGTAAKPKGSNPFGDARPREEVLKEKGQD 616
            + RP+L LQPRSLP+S+   +NG   KPKGS+PFG+ARPREEVL EKG+D
Sbjct: 262  IARPKLVLQPRSLPISN---DNGVGMKPKGSSPFGNARPREEVLAEKGKD 308


>ref|XP_002868857.1| hypothetical protein ARALYDRAFT_490636 [Arabidopsis lyrata subsp.
            lyrata] gi|297314693|gb|EFH45116.1| hypothetical protein
            ARALYDRAFT_490636 [Arabidopsis lyrata subsp. lyrata]
          Length = 419

 Score =  235 bits (599), Expect = 5e-59
 Identities = 169/440 (38%), Positives = 217/440 (49%), Gaps = 31/440 (7%)
 Frame = -2

Query: 1599 AATVSAWAKPGAWALDSEEHEAELLQQQKDEPDNGNHQSNGTSGISSDFPSLXXXXXXXX 1420
            AA  S WAKPGAWAL++EEHEAEL QQ        +      +G SSDFPSL        
Sbjct: 3    AAVSSVWAKPGAWALEAEEHEAELKQQAPPSTQKSS------AGDSSDFPSLAAAATTKT 56

Query: 1419 XXXXKGQTLSLQEFASFGSVKQTSQAPASKGLTPDELLALPTGPRQRSAEELDRNKLGNG 1240
                  QT+SL EFA++GS K  +Q   ++ LT  EL++LPTGPR+RSA+ELDR+KLG G
Sbjct: 57   KKKK-AQTISLAEFATYGSAKAAAQ---TERLTQAELVSLPTGPRERSADELDRSKLGGG 112

Query: 1239 FRSYG-------NSYDRPGRGSSDEQPRR--------RDSNRDLAPSRADEIDDWGAAKK 1105
            FRSYG       +S  R G     E   R        R+ +RD  PSRADE D+W AAKK
Sbjct: 113  FRSYGGGRYGDESSSSRWGSSRVSEDGERRGGGFNRDREPSRDSGPSRADEDDNWAAAKK 172

Query: 1104 STVGNXXXXXXXXXXXXXFSDSQSRADDVDNWASKKTFVPSEGRRYDRRGGFES------ 943
               GN             F +SQS+AD+VD+W S K   PSE RRY++RG FES      
Sbjct: 173  PIGGNGFERRERGAGGGFF-ESQSKADEVDSWVSSK---PSEPRRYEKRGSFESLSRNRD 228

Query: 942  ----NGGGADTDNWTKRKEEEGRKFPSSGGAFDSLRERKGGSFEQSNGNGPDSDIWGKKR 775
                 GG +D+D W +R+E                        E S  NG  S   G   
Sbjct: 229  SQYGGGGSSDSDTWGRRRE------------------------ESSGANGVPSPTAG--- 261

Query: 774  EEGVGRPRLNLQPRSLPVSDIQQENGTA------AKPKGSNPFGDARPREEVLKEKGQDP 613
                 RPRL LQPR+LPV+ ++     +       KPKG+NPFG+ARPREEVL EKGQD 
Sbjct: 262  ----SRPRLVLQPRTLPVAVVEVVKPESPVLVIVEKPKGANPFGNARPREEVLAEKGQDW 317

Query: 612  RXXXXXXXXXXXXETAWSDGSAPXXXXXXXXXXXXXXXSWPEDRTERAWRKPETVEAPPQ 433
            +            + A +                       ++RTER+WRK         
Sbjct: 318  KEIDEKLEADKLKDVAAAIEKPDEKSPGKMGGFGLGNGRKDDERTERSWRK--------- 368

Query: 432  SDEKTEDEPVENDDVDKEPQ 373
            S E++E+EP   +   +E +
Sbjct: 369  STEQSEEEPAVEEAKKEEAE 388


>ref|XP_003540800.1| PREDICTED: uncharacterized protein LOC100820014 [Glycine max]
          Length = 380

 Score =  233 bits (594), Expect = 2e-58
 Identities = 172/443 (38%), Positives = 213/443 (48%), Gaps = 35/443 (7%)
 Frame = -2

Query: 1602 MAATVS-AWAKPGAWALDSEEHEAELLQQQKDEPDNGNHQSNGTSGISSDFPSLXXXXXX 1426
            MAATVS AW+KPGAWALDSEEHEAELLQQ  + P++            +DFPSL      
Sbjct: 1    MAATVSSAWSKPGAWALDSEEHEAELLQQNNNNPNDKP---------LADFPSLAAAAAT 51

Query: 1425 XXXXXXKGQTLSLQEFASFGSVKQTSQAPASKGLTPDELLALPTGPRQRSAEELDRNKLG 1246
                    QT SL EF +        Q P          + LPTGPRQR+AEELDR +LG
Sbjct: 52   KPKKKK-AQTYSLAEFTAKPDSAFADQDP----------VVLPTGPRQRTAEELDRTRLG 100

Query: 1245 NGFRSYGNSYDRPGRGSSD----------------EQPRR-----RDSNRDLAPSRADEI 1129
             GFR+YG   DRP R +S                 ++PRR     RDSNR+L PSRA   
Sbjct: 101  GGFRNYG---DRPNRNNSSGGDESSNSRWGSSRVSDEPRRNGFGARDSNRELPPSRA--- 154

Query: 1128 DDWGAAKKSTVGNXXXXXXXXXXXXXFSDSQSRADDVDNWASKKTFVPSEG---RRYDRR 958
                                              D+ DNWA+ K   PS G   R  D+ 
Sbjct: 155  ----------------------------------DETDNWAAAKK--PSGGFERRERDKG 178

Query: 957  GGFESNGGGADTDNWTKRKE---EEGRKFPSSGGAFDSLRERKGGSFEQSNGNGPDSDIW 787
            G F+S     ++D+W   K     EGR+F S+GG F+  RER+   F  S G   DSD W
Sbjct: 179  GFFDSQSRADESDSWVSNKSFVPSEGRRFGSNGGGFE--RERRVVGFGSSGG--ADSDNW 234

Query: 786  GKKREEGV-------GRPRLNLQPRSLPVSDIQQENGTAAKPKGSNPFGDARPREEVLKE 628
              K+ E         GRP+L LQPR++ VSD   +   A KPKG NPFG+ARPRE+VL E
Sbjct: 235  NTKKGESNVGSESVGGRPKLVLQPRTVSVSDEGVDGNNAGKPKGVNPFGEARPREQVLAE 294

Query: 627  KGQDPRXXXXXXXXXXXXETAWSDGSAPXXXXXXXXXXXXXXXSWPEDRTERAWRKPETV 448
            KGQD +            E +  DG                    PE RTER+WRKP+  
Sbjct: 295  KGQDWKKIDEQLESVKIKEASGGDGFGKRGFGSSNGGGGRATL--PESRTERSWRKPQFD 352

Query: 447  EAPPQSDEKTEDEPVENDDVDKE 379
            +  P+S EK EDEP +  +V+ E
Sbjct: 353  DDRPKSAEKVEDEPDQKKEVEDE 375


>gb|EOX97749.1| Glycine-rich protein, putative [Theobroma cacao]
          Length = 369

 Score =  232 bits (592), Expect = 3e-58
 Identities = 173/439 (39%), Positives = 225/439 (51%), Gaps = 32/439 (7%)
 Frame = -2

Query: 1602 MAATVSA-WAKPGAWALDSEEHEAELLQQQKDEPDNGNHQSNGTSGISSDFPSLXXXXXX 1426
            MAATVS+ W KPGAWALD+EEHEAEL QQ ++  D+ + +        +DFPSL      
Sbjct: 1    MAATVSSPWGKPGAWALDAEEHEAELQQQDQNHGDSSSEKH-------ADFPSLATAAAA 53

Query: 1425 XXXXXXKGQTLSLQEFASFGSVKQTSQAPASKGLTPDELLALPTGPRQRSAEELDRNKLG 1246
                    QTLSL EF ++G+ K +        LT ++LL LPTGPRQRS EELDRN+LG
Sbjct: 54   KTKKKK-SQTLSLAEFTTYGAAKPSEPTR----LTHEDLLVLPTGPRQRSPEELDRNRLG 108

Query: 1245 NGFRSYG-NSYDRPGRGSSDE------QPRRRDSNRDLAPSRADEIDDWGAAKKST-VGN 1090
             GF+SYG N Y+  G  SS        +   RDSNR++APSRADEID+W +AKKST  GN
Sbjct: 109  GGFKSYGSNRYNSNGDDSSSNGRWGSSRASNRDSNREIAPSRADEIDNWASAKKSTSTGN 168

Query: 1089 XXXXXXXXXXXXXFS--DSQSRADDVDNWASKKTFVPSEGRRYDRR--GGFESNGGGADT 922
                             DSQS+AD+VDNWA+ K++  S      RR  GGFE        
Sbjct: 169  GFGGGFERRERGGGGFFDSQSKADEVDNWAANKSY-KSANEAPPRRFGGGFERRS----- 222

Query: 921  DNWTKRKEEEGRKFPSSGGAFDSLRERKGGSFEQSNGNGPDSDIWGKKREE------GVG 760
                               +FDSL         QS  +  D D WGKK+EE      G  
Sbjct: 223  -------------------SFDSL---------QSRDSPRDLDNWGKKKEESGSAGSGGV 254

Query: 759  RPRLNLQPRSLPVSDIQQENGTAAKPKGSNPFGDARPREEVLKEKGQDPRXXXXXXXXXX 580
            RPRL LQPR+  V++  ++  T AKP+G+NPFG+ARPREEVLKEKG+D +          
Sbjct: 255  RPRLVLQPRT--VTEEGKKEATLAKPRGANPFGEARPREEVLKEKGKDWKEIDEKLEAVK 312

Query: 579  XXET-------------AWSDGSAPXXXXXXXXXXXXXXXSWPEDRTERAWRKPETVEAP 439
              ET             ++ +G AP                      ER+WRK ++VEA 
Sbjct: 313  IKETVAVTERGERGGKVSFGNGRAP---------------------VERSWRKSDSVEAV 351

Query: 438  PQSDEKTEDEPVENDDVDK 382
                ++++    EN  V++
Sbjct: 352  AADADQSQSS--ENGHVEE 368


>gb|ESW03680.1| hypothetical protein PHAVU_011G033300g [Phaseolus vulgaris]
          Length = 371

 Score =  231 bits (590), Expect = 5e-58
 Identities = 174/442 (39%), Positives = 212/442 (47%), Gaps = 39/442 (8%)
 Frame = -2

Query: 1602 MAATVSAWAKPGAWALDSEEHEAELLQQQKDEPDNGNHQSNGTSGISSDFPSLXXXXXXX 1423
            MAATVSAW+KPGAWA+DSEEHEAELLQQ         H +   +    DFPSL       
Sbjct: 1    MAATVSAWSKPGAWAIDSEEHEAELLQQSTI------HDTKPLA----DFPSLAVAAATK 50

Query: 1422 XXXXXKGQTLSLQEFASFGSVKQTSQAPASKGLTPDELLALPTGPRQRSAEELDRNKLGN 1243
                   QT+SL EF +        Q P          + LPTGPRQR+AEELDR +LG 
Sbjct: 51   PKKKK-AQTISLAEFTAKPDTSFADQDP----------VVLPTGPRQRTAEELDRTRLGG 99

Query: 1242 GFRSYGNSYDRPGRGSS--------------DEQPRR------RDSNRDLAPSRADEIDD 1123
            GFRSYG   DRP R SS               ++PRR      RDSNR+LAPSRA     
Sbjct: 100  GFRSYG---DRPNRNSSGDDSSNSRWGSSRVSDEPRRNGSFGARDSNRELAPSRA----- 151

Query: 1122 WGAAKKSTVGNXXXXXXXXXXXXXFSDSQSRADDVDNWASKKTFVPSEG---RRYDRRGG 952
                                            D+ DNWA+ K   PS G   +  DR G 
Sbjct: 152  --------------------------------DETDNWAAAKK--PSGGFERKERDRGGF 177

Query: 951  FESNGGGADTDNWTKRKEE--EGRKFPSSGGAFDSLRERKGGSFEQSNGNGPDSDIWGKK 778
            F+S     ++++W   K      R+F S+GG F+  RER+   F  S G   DS+ W KK
Sbjct: 178  FDSQSRADESESWVSNKSSGPSERRFGSNGGGFE--RERRVVGFGSSGG--ADSEDWNKK 233

Query: 777  REE--------------GVGRPRLNLQPRSLPVSDIQQENGTAAKPKGSNPFGDARPREE 640
            + E              G GRPRL LQPRSL VS+ +  +G   KPKG NPFG+ARPRE+
Sbjct: 234  KGESNVGTETVSVGVGVGGGRPRLVLQPRSLSVSN-EGPDGNVGKPKGVNPFGEARPREQ 292

Query: 639  VLKEKGQDPRXXXXXXXXXXXXETAWSDGSAPXXXXXXXXXXXXXXXSWPEDRTERAWRK 460
            VL EKGQD +            ETA  D                     PE RTER+WRK
Sbjct: 293  VLAEKGQDWKKIDEQLDSMKIKETAGGDSFGKRSFGSSNGGGRPAL---PESRTERSWRK 349

Query: 459  PETVEAPPQSDEKTEDEPVEND 394
            P++ +  P+S EK EDE VE +
Sbjct: 350  PQSDDESPKSAEKVEDEHVEEN 371


>ref|XP_004505768.1| PREDICTED: uncharacterized protein LOC101488662 [Cicer arietinum]
          Length = 384

 Score =  230 bits (586), Expect = 2e-57
 Identities = 180/448 (40%), Positives = 214/448 (47%), Gaps = 45/448 (10%)
 Frame = -2

Query: 1602 MAATVSAWAKPGAWALDSEEHEAELLQQQKDEPDNGNHQSNGTSGISSDFPSLXXXXXXX 1423
            MAATVSAW+KPGAWALDSEEHEAELLQQ  +         N T  ++ +FPSL       
Sbjct: 1    MAATVSAWSKPGAWALDSEEHEAELLQQTNN---------NDTKPLA-EFPSLAVAAATK 50

Query: 1422 XXXXXKGQTLSLQEFASFGSVKQTSQAPASKGLTPDELLALPTGPRQRSAEEL--DRNKL 1249
                   QTLSL EF +      T Q P            LPTGPRQR+AEEL  DR ++
Sbjct: 51   PKKKK-AQTLSLAEFTAKPLSSFTQQDPVD----------LPTGPRQRTAEELERDRTRI 99

Query: 1248 GNGFRSYGNSYDRPGRGSSDEQPR-----------------RRDSNRDLAP-SRADEIDD 1123
            G GFRSYG+  +R G G      R                  RDSNR+ AP SRADEID+
Sbjct: 100  GGGFRSYGDRPNRTGGGDEGSNSRWGSSRVSDDLRRNNSFGSRDSNRESAPPSRADEIDN 159

Query: 1122 WGAAKKSTVG-----NXXXXXXXXXXXXXFSDSQSRADDVDNWASKKTFVPSEGRRYDRR 958
            W AAKK++VG                   F DSQSRAD+ D+W S K+FVPS        
Sbjct: 160  WAAAKKTSVGVGNGFERRERDNRERGGGGFFDSQSRADESDSWVSSKSFVPS-------- 211

Query: 957  GGFESNGGGADTDNWTKRKEEEGRKFPSSGGAFDSLRERKGGSFEQSNGNGPDSDIWGKK 778
                                 EGR+F  SGG F+  RERK G        G DSD W KK
Sbjct: 212  ---------------------EGRRFGGSGGGFE--RERKVGF---GTSGGADSDNWNKK 245

Query: 777  ---------REEGV--GRPRLNLQPRSLPVSDIQQE---------NGTAAKPKGSNPFGD 658
                     R E V  GRPRL LQPRS+  S+  Q          +G  AKPKG+NPFG+
Sbjct: 246  KGEFSVGSERNESVAGGRPRLVLQPRSVSASNENQNQDVAAAAVVSGNVAKPKGANPFGE 305

Query: 657  ARPREEVLKEKGQDPRXXXXXXXXXXXXETAWSDGSAPXXXXXXXXXXXXXXXSWPEDRT 478
            ARPRE+VL EKGQD +            ET   +G                     EDR+
Sbjct: 306  ARPREQVLAEKGQDWKKIDEQLESMKIKETV-VEGFGKRGFGSGNGRG--------EDRS 356

Query: 477  ERAWRKPETVEAPPQSDEKTEDEPVEND 394
            ER+WRK  + +   +S EK ED  VE +
Sbjct: 357  ERSWRKSPSEDGLSESAEKVEDVHVEEN 384


>ref|XP_002313735.2| hypothetical protein POPTR_0009s13280g [Populus trichocarpa]
            gi|550331640|gb|EEE87690.2| hypothetical protein
            POPTR_0009s13280g [Populus trichocarpa]
          Length = 387

 Score =  227 bits (579), Expect = 1e-56
 Identities = 163/438 (37%), Positives = 216/438 (49%), Gaps = 27/438 (6%)
 Frame = -2

Query: 1602 MAATVSA-WAKPGAWALDSEEHEAELLQQQKDEPDNGNHQSNGTSGISSDFPSLXXXXXX 1426
            MAATVS+ W+KPGAWALD+EEHEAEL Q+ ++        +    G++ +FPSL      
Sbjct: 1    MAATVSSPWSKPGAWALDAEEHEAELQQEHENSQQASTLAAQPLGGVA-EFPSLAAAAAT 59

Query: 1425 XXXXXXKGQTLSLQEFASFGSVKQTSQAPASKGLTPDELLALPTGPRQRSAEELDRNKLG 1246
                  K QTLSL EF+++   K + +        PD L  LPT PR+RSAEELDR +LG
Sbjct: 60   KQPKKKKNQTLSLAEFSNYSLAKSSHE--------PD-LFNLPTRPRERSAEELDRARLG 110

Query: 1245 NGFRSYGNSYDRPGR---------GSSDEQPRRRDSNRDLAPSRADEIDDWGAAKKSTVG 1093
             GF+SYG SY   G          G+ + +   R+S+++ APSRADEIDDW   KKS  G
Sbjct: 111  GGFKSYGLSYRNGGEESNSRWGGGGNGNSRVSNRESSKEFAPSRADEIDDWSKTKKSPAG 170

Query: 1092 NXXXXXXXXXXXXXFSDSQSRADDVDNWASKKT-------FVPSEGRRYDRRGGFES--- 943
            N             F DSQS+AD+  +W S KT       FV +    ++RRG +++   
Sbjct: 171  NVYERRERERGSSFF-DSQSKADESASWVSNKTTNDGPRRFVGANNGGFERRGSYDTLSR 229

Query: 942  -----NGGGADTDNWTKRKEEEGRKFPSSGGAFDSLRERKGGSFEQSNGNGPDSDIWGKK 778
                 +GG AD+DNW ++K+E          +F+S      GS  +              
Sbjct: 230  ERHGFSGGAADSDNWGRKKDE----------SFNS------GSVGE-------------- 259

Query: 777  REEGVGRPRLNLQPRSLPVSDIQQENGTAAKPKGSNPFGDARPREEVLKEKGQDPRXXXX 598
                  RP+L LQPR+LPVSD    NG   KPKGSNPFGDARPREEVLKEKG D +    
Sbjct: 260  ------RPKLKLQPRTLPVSD---GNGAVEKPKGSNPFGDARPREEVLKEKGMDYKEIDE 310

Query: 597  XXXXXXXXETAWSDGSAPXXXXXXXXXXXXXXXSWPEDRTERAWRKPETVE--APPQSDE 424
                         D                          ER+WRKP+  +  + PQS E
Sbjct: 311  KLDSVKISSERSKDVERSDSFGKRGFGIGRGG-----SGNERSWRKPDVADSGSRPQSAE 365

Query: 423  KTEDEPVENDDVDKEPQM 370
             TE+     D +  E ++
Sbjct: 366  TTENGNNAEDGLATEDEV 383


Top