BLASTX nr result

ID: Catharanthus23_contig00009478 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Catharanthus23_contig00009478
         (2723 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_004231006.1| PREDICTED: uncharacterized protein LOC101256...   374   e-101
ref|XP_006359720.1| PREDICTED: uncharacterized protein LOC102579...   368   7e-99
gb|EPS72390.1| hypothetical protein M569_02372, partial [Genlise...   334   1e-88
gb|ADN34011.1| translation initiation factor [Cucumis melo subsp...   297   1e-77
ref|XP_004147022.1| PREDICTED: uncharacterized protein LOC101214...   293   2e-76
gb|EMJ00816.1| hypothetical protein PRUPE_ppa007168mg [Prunus pe...   285   1e-73
ref|XP_002275899.1| PREDICTED: uncharacterized protein LOC100262...   260   3e-66
emb|CAN77162.1| hypothetical protein VITISV_029831 [Vitis vinifera]   259   4e-66
ref|XP_006411661.1| hypothetical protein EUTSA_v10025358mg [Eutr...   252   7e-64
ref|XP_006283781.1| hypothetical protein CARUB_v10004871mg [Caps...   244   1e-61
ref|NP_195583.1| glycine-rich protein [Arabidopsis thaliana] gi|...   240   2e-60
ref|NP_001190956.1| glycine-rich protein [Arabidopsis thaliana] ...   240   2e-60
ref|XP_002528261.1| translation initiation factor, putative [Ric...   240   3e-60
ref|XP_004292157.1| PREDICTED: uncharacterized protein LOC101302...   239   6e-60
gb|EOX97749.1| Glycine-rich protein, putative [Theobroma cacao]       231   1e-57
ref|XP_002868857.1| hypothetical protein ARALYDRAFT_490636 [Arab...   231   2e-57
ref|XP_002313735.2| hypothetical protein POPTR_0009s13280g [Popu...   223   3e-55
ref|XP_004505768.1| PREDICTED: uncharacterized protein LOC101488...   220   3e-54
ref|XP_006851959.1| hypothetical protein AMTR_s00041p00189930 [A...   218   1e-53
gb|ESW03680.1| hypothetical protein PHAVU_011G033300g [Phaseolus...   218   1e-53

>ref|XP_004231006.1| PREDICTED: uncharacterized protein LOC101256330 [Solanum
            lycopersicum]
          Length = 422

 Score =  374 bits (961), Expect = e-101
 Identities = 220/412 (53%), Positives = 271/412 (65%), Gaps = 23/412 (5%)
 Frame = -2

Query: 2656 MAATVSAWAKPGAWALDSEEHEAELLQQQKDEPDNGNHQSNGTSGISSDFPSLXXXXXXX 2477
            MAATVSAWAKPGAWALDSEE+E EL +++  + +N  H + G +G  +DFPSL       
Sbjct: 1    MAATVSAWAKPGAWALDSEENELELQKEESVKVEN--HSNGGGAGGLADFPSLAAAATTK 58

Query: 2476 XXXXXKGQTLSLQEFASFGSVKQ--TSQAPASKGLTPDELLALPTGPRQRSAEELDRNKL 2303
                   QTLSLQEF+++ + KQ  T+ A ++KGLTP+E+L LPTGPR+R+AEELD+++L
Sbjct: 59   TKKKKP-QTLSLQEFSTYSAAKQSQTAAAASTKGLTPEEVLMLPTGPRERTAEELDQSRL 117

Query: 2302 GNGFRSYGNSYDRPG-RGSSDEQPR----RRDSNRDLAPSRADEIDDWGAAKKSTVGNXX 2138
            G GFRSYG  YDR G RGSSD+  R    RRD++R++APSRADE DDWGAAKK++ GN  
Sbjct: 118  GGGFRSYG--YDRQGGRGSSDDSRRQGGFRRDTDREIAPSRADETDDWGAAKKTSAGNGF 175

Query: 2137 XXXXXXXXXXXF-SDSQSRADDVDNWASKKTFVPSEGRRYDRRGGFESNGGGADTDNWTK 1961
                       F SDSQS+AD+ DNW + K FVPS GRR+DRR  F SNG  +D+D WTK
Sbjct: 176  ERRGERGERGGFFSDSQSKADESDNWGANKAFVPSSGRRFDRRVSFGSNGSDSDSDRWTK 235

Query: 1960 RKEEE-GRKFPSSGGAFDSLRERKGGSFEQSNGNGPDSDIWGKKREE-----GVGRPRLN 1799
            RKEEE GR+F S GGAFDSLRER+GG    SNG G DS+ WGKKREE     G GRP+LN
Sbjct: 236  RKEEEGGRRFASGGGAFDSLRERRGG--YDSNG-GVDSENWGKKREENGVSAGGGRPKLN 292

Query: 1798 LQPRSLPVSDIQQEN---------GTAAKPKGSNPFGDARPREEVLKEKGQDPRXXXXXX 1646
            LQPR+LP+S+ QQ              AKPKG+NPFG ARPREEVLKEKG+D +      
Sbjct: 293  LQPRTLPLSEGQQNGNEPVPAPVPAPVAKPKGTNPFGAARPREEVLKEKGRDWKEIDQKL 352

Query: 1645 XXXXXXETAWSDGSAPXXXXXXXXXXXXXXXSWPEDRTERAWRKPETVEAPP 1490
                  E + S   AP                + ED+TE++WRKPE  E PP
Sbjct: 353  ESLKVKEASESSDGAP--IPKKAWGSPNGKLIFREDKTEKSWRKPELNEVPP 402


>ref|XP_006359720.1| PREDICTED: uncharacterized protein LOC102579361 [Solanum tuberosum]
          Length = 452

 Score =  368 bits (945), Expect = 7e-99
 Identities = 222/435 (51%), Positives = 271/435 (62%), Gaps = 46/435 (10%)
 Frame = -2

Query: 2656 MAATVSAWAKPGAWALDSEEHEAELLQQQKDEPDNGNHQSNGTSGISSDFPSLXXXXXXX 2477
            MAATVSAWAKPGAWALDSEE+E EL +++  + +N  H + G +G  +DFPSL       
Sbjct: 1    MAATVSAWAKPGAWALDSEENELELQKEESVKVEN--HSNGGGAGGLADFPSLAAAATTK 58

Query: 2476 XXXXXKGQTLSLQEFASFGSVK--QTSQAPASKGLTPDELLALPTGPRQRSAEELDRNKL 2303
                   QTLSLQEF+++ + K  QT+ A A+KGLTP+E+L LPTGPR+R+AEELD+++L
Sbjct: 59   TKKKKP-QTLSLQEFSTYSAAKKSQTAAAAATKGLTPEEVLMLPTGPRERTAEELDQSRL 117

Query: 2302 GNGFRSYGNSYDRP--------------------------GRGSSDEQPR----RRDSNR 2213
            G GFRSYG  YD                            GRGSSD+  R    RRD++R
Sbjct: 118  GGGFRSYG--YDNSIFGLTPDKVLMLPTSPRERTAEELGQGRGSSDDSHRQGGFRRDTDR 175

Query: 2212 DLAPSRADEIDDWGAAKKSTVGNXXXXXXXXXXXXXF-SDSQSRADDVDNWASKKTFVPS 2036
            ++APSRADE DDWGAAKK++ GN             F SDSQS+ D+ DNWA+ K FVPS
Sbjct: 176  EIAPSRADETDDWGAAKKTSAGNGFERRGERGERGGFFSDSQSKVDESDNWAANKAFVPS 235

Query: 2035 EGRRYDRRGGFESNGGGADTDNWTKRKEEE-GRKFPSSGGAFDSLRERKGGSFEQSNGNG 1859
             GRR+DRRG F SNG  +D+D WTKRKEEE GR+F S GGAFDSLRER+GG    SNG G
Sbjct: 236  SGRRFDRRGSFGSNGSDSDSDRWTKRKEEEGGRRFASGGGAFDSLRERRGG--YDSNG-G 292

Query: 1858 PDSDIWGKKREE-----GVGRPRLNLQPRSLPVSDIQQENGTAA-------KPKGSNPFG 1715
             DS+ WGKKREE     G GRP+LNLQPR+LP+S+ QQ     A       KPKG+NPFG
Sbjct: 293  VDSENWGKKREENGVSAGGGRPKLNLQPRTLPLSEGQQNGNEPAPVPVPVVKPKGANPFG 352

Query: 1714 DARPREEVLKEKGQDPRXXXXXXXXXXXXETAWSDGSAPXXXXXXXXXXXXXXXSWPEDR 1535
             ARPREEVLKEKGQD +            E + S   AP                + ED+
Sbjct: 353  AARPREEVLKEKGQDWKEIDQKIESLKVKEASESIDGAP--IVKKAWGSPNGKLIFREDK 410

Query: 1534 TERAWRKPETVEAPP 1490
            TE++WRKPE  E PP
Sbjct: 411  TEKSWRKPELNEVPP 425


>gb|EPS72390.1| hypothetical protein M569_02372, partial [Genlisea aurea]
          Length = 375

 Score =  334 bits (856), Expect = 1e-88
 Identities = 196/341 (57%), Positives = 236/341 (69%), Gaps = 12/341 (3%)
 Frame = -2

Query: 2656 MAATVSAWAKPGAWALDSEEHEAELL-QQQKDEPDNGNHQSNGTSGISSDFPSLXXXXXX 2480
            MAATVS W KPGAWALDSEE+E+EL+ +  K+E      +S+G  G + +FPSL      
Sbjct: 1    MAATVSVWGKPGAWALDSEENESELIPKDDKEESSIAIGKSDG--GETEEFPSLSAAVSK 58

Query: 2479 XXXXXXKGQTLSLQEFASFGSVKQTSQAPASKGLTPDELLALPTGPRQRSAEELDRNKLG 2300
                    QT+SLQ F+ +G+ K +     +KGLTPDELL LPTGPR+RSAEEL+RNKLG
Sbjct: 59   KPKKKK-AQTVSLQHFSLYGATKPSPSE--NKGLTPDELLMLPTGPRERSAEELERNKLG 115

Query: 2299 NGFRSYGNSYDRPGRGSSDEQPRR---RDSNRDLAPSRADEIDDWGAAKK-STVGNXXXX 2132
             GFRSYG        G  D+Q RR   R+SNRD APSRADE DDWGA KK S+ G+    
Sbjct: 116  GGFRSYGG-------GIRDDQQRRNFNRESNRDFAPSRADETDDWGATKKFSSSGSGFDR 168

Query: 2131 XXXXXXXXXFSDSQSRADDVDNWASKKTFVPSEGRRYDRRG--GFESNGGGADTDNWTKR 1958
                     F+DSQSRAD+VDNWAS K+FVPS+ RR DR+   GF++N  G D+ +W KR
Sbjct: 169  KERGDRGGFFTDSQSRADEVDNWASSKSFVPSDPRRNDRKPGFGFDTNNNGIDSSSWMKR 228

Query: 1957 KEEEGRKFPSSGGAFDSLRERKGGSFEQSNGNGPDSDIWGKKREE---GVGRPRLNLQPR 1787
            KEEEGRK    GGAFDSLRER+GG FE S     DSD WG+++EE   G  RP+LNLQPR
Sbjct: 229  KEEEGRKV--VGGAFDSLRERRGGGFEPSR---VDSDNWGRRKEEVSIGGSRPKLNLQPR 283

Query: 1786 SLPVSDIQQ-ENGTAAKPK-GSNPFGDARPREEVLKEKGQD 1670
            +LPV + Q+ E GTA+KPK GSNPFG+ARPREEVLKEKGQD
Sbjct: 284  TLPVDEGQKSETGTASKPKGGSNPFGEARPREEVLKEKGQD 324


>gb|ADN34011.1| translation initiation factor [Cucumis melo subsp. melo]
          Length = 405

 Score =  297 bits (761), Expect = 1e-77
 Identities = 188/421 (44%), Positives = 238/421 (56%), Gaps = 31/421 (7%)
 Frame = -2

Query: 2656 MAATVSAWAKPGAWALDSEEHEAELLQQQKDEPDNGNHQSNGTSGISSDFPSLXXXXXXX 2477
            MAATVS W KPGAWALD+EEHEAELL   KD+ D   HQS      S+DFPSL       
Sbjct: 1    MAATVSPWGKPGAWALDAEEHEAELL---KDQQDQSRHQSEP----SADFPSLAAAAATK 53

Query: 2476 XXXXXKGQTLSLQEFASFGSVKQTSQAPASKGLTPDELLALPTGPRQRSAEELDRNKLGN 2297
                  GQ++ L EF ++G  +  +Q+   KGLT ++L+ LPTGPRQR+AEE+DRN+LG 
Sbjct: 54   PKKKK-GQSIPLSEFQTYGGPRPAAQSTDPKGLTAEDLMMLPTGPRQRTAEEMDRNRLGG 112

Query: 2296 GFRSYGNS--YDRPGRGSSDE-------------QPRR------RDSNRDLAPSRADEID 2180
            GF+S+G +  YDR  R S+ E             + RR      R+  R+  PSRADEID
Sbjct: 113  GFKSWGQNSLYDRGNRYSNSEDSPNSRRSSRVFDESRRSNDGSDREFRRESLPSRADEID 172

Query: 2179 DWGAAKKSTVGNXXXXXXXXXXXXXFSDSQSRADDVDNWASKKTFVPSEGRR-----YDR 2015
            DWGA KK  +GN             F    S+AD+ D+W S K+F PSEGRR      +R
Sbjct: 173  DWGAGKKPMMGNGFERRERGGGGGFFDSHSSKADESDSWVSSKSFTPSEGRRSGGFDRER 232

Query: 2014 RGGFESNGGGADTDNWTKRKEEEGRKFPSSGGAFDSLRERKGGSFEQSNGNGPDSDIWGK 1835
            RGGF ++GGGAD+DNW       GRK   S GA   + E         NG G DSD WGK
Sbjct: 233  RGGFPTSGGGADSDNW-------GRK---SDGARAGMGE---------NGGGADSDNWGK 273

Query: 1834 KRE---EGVG-RPRLNLQPRSLPVSDIQQE-NGTAAKPKGSNPFGDARPREEVLKEKGQD 1670
            K E    G+G RPRLNLQPRS+P+++  QE +G A KPKGSNPFG+ARPREEVL EKGQD
Sbjct: 274  KSEGVRSGIGERPRLNLQPRSIPLNNGNQEASGVAVKPKGSNPFGNARPREEVLAEKGQD 333

Query: 1669 PRXXXXXXXXXXXXETAWSDGSAPXXXXXXXXXXXXXXXSWPEDRTERAWRKPETVEAPP 1490
             +            +T     ++                  P+  + R+WRKPE+ ++ P
Sbjct: 334  WKKIDEQLGSMKIKDTVERAETSSGASFERRKGFGVRSGRSPD--SGRSWRKPESADSRP 391

Query: 1489 Q 1487
            Q
Sbjct: 392  Q 392


>ref|XP_004147022.1| PREDICTED: uncharacterized protein LOC101214573 [Cucumis sativus]
            gi|449489695|ref|XP_004158389.1| PREDICTED:
            uncharacterized LOC101214573 [Cucumis sativus]
          Length = 405

 Score =  293 bits (751), Expect = 2e-76
 Identities = 188/421 (44%), Positives = 238/421 (56%), Gaps = 31/421 (7%)
 Frame = -2

Query: 2656 MAATVSAWAKPGAWALDSEEHEAELLQQQKDEPDNGNHQSNGTSGISSDFPSLXXXXXXX 2477
            MAATVS W KPGAWALD+EEHEAELL   KD+ +   HQ       S+DFPSL       
Sbjct: 1    MAATVSPWGKPGAWALDAEEHEAELL---KDQEEQSRHQEEP----SADFPSLAAAAATK 53

Query: 2476 XXXXXKGQTLSLQEFASFGSVKQTSQAPASKGLTPDELLALPTGPRQRSAEELDRNKLGN 2297
                  GQ++ L EF ++G  K ++Q+   KGLT ++L+ LPTGPRQR+AEE+DRN+LG 
Sbjct: 54   PKKKK-GQSIPLSEFQTYGGPKPSAQSSDPKGLTAEDLMMLPTGPRQRTAEEMDRNRLGG 112

Query: 2296 GFRSYGNS--YDRPGRGSSDE-------------QPRR------RDSNRDLAPSRADEID 2180
            GF+S+G +  YDR  R S+ E             + RR      R+  R+  PSRADEID
Sbjct: 113  GFKSWGQNSLYDRGNRYSNSEDSPNSRRSSRVFDESRRTNDGSDREFRRESLPSRADEID 172

Query: 2179 DWGAAKKSTVGNXXXXXXXXXXXXXFSDSQSRADDVDNWASKKTFVPSEGRR-----YDR 2015
            DWGA KK  VGN             F    S+AD+ D+W S K+F PSEGRR      +R
Sbjct: 173  DWGAGKKPMVGNGFERRERGGGGGFFDSHSSKADESDSWVSSKSFTPSEGRRSGGFDRER 232

Query: 2014 RGGFESNGGGADTDNWTKRKEEEGRKFPSSGGAFDSLRERKGGSFEQSNGNGPDSDIWGK 1835
            RGGF ++GGGAD+DNW       GRK     GA       +GG  E  NG   DS+ WGK
Sbjct: 233  RGGFPTSGGGADSDNW-------GRK---PDGA-------RGGIGE--NGGSADSENWGK 273

Query: 1834 KRE---EGVG-RPRLNLQPRSLPVSDIQQE-NGTAAKPKGSNPFGDARPREEVLKEKGQD 1670
            + E    G+G RPRLNLQPRS+P+++  QE +G A KPKGSNPFG+ARPREEVL EKGQD
Sbjct: 274  RSEGVRSGIGERPRLNLQPRSIPLNNGNQEASGVAVKPKGSNPFGNARPREEVLAEKGQD 333

Query: 1669 PRXXXXXXXXXXXXETAWSDGSAPXXXXXXXXXXXXXXXSWPEDRTERAWRKPETVEAPP 1490
             +            +T     ++                  P+  + R WRKPE+VE+ P
Sbjct: 334  WKKIDEQLESVKIKDTVERAETSSGASFERKKGFGARSGRSPD--SGRTWRKPESVESRP 391

Query: 1489 Q 1487
            Q
Sbjct: 392  Q 392


>gb|EMJ00816.1| hypothetical protein PRUPE_ppa007168mg [Prunus persica]
          Length = 379

 Score =  285 bits (728), Expect = 1e-73
 Identities = 186/407 (45%), Positives = 225/407 (55%), Gaps = 17/407 (4%)
 Frame = -2

Query: 2656 MAATVSAWAKPGAWALDSEEHEAELLQQQKDEPDNGNHQSNGTSGISSDFPSLXXXXXXX 2477
            MAATVS WAKPGAWAL +EE +AEL Q    E  N  H     S   +D+PSL       
Sbjct: 1    MAATVSPWAKPGAWALAAEEQDAELEQ----ETQNARHVVEPPS---ADYPSLSVAATAK 53

Query: 2476 XXXXXKGQTLSLQEFASFGSVKQTSQAPASKGLTPDELLALPTGPRQRSAEELDRNKLGN 2297
                 KGQ +SL EF +FG+ K  +Q    +GLT  + + LPTGPR+R+AEELDRN+LG 
Sbjct: 54   PKKKNKGQKISLAEFTAFGAPKPVAQP---EGLTHQDRMHLPTGPRERTAEELDRNRLGG 110

Query: 2296 GFRSYGNSYDRPGRGSSDEQPRRRDSNRDLAPSRADEIDDWGAAKKSTVGNXXXXXXXXX 2117
            GFRSYG+                     D   SRADEIDDWGAAKKSTVGN         
Sbjct: 111  GFRSYGS---------------------DRGNSRADEIDDWGAAKKSTVGNGFERRERGA 149

Query: 2116 XXXXFSDSQSRADDVDNWASKKTFVPSEGRRY---------DRRGGFESNGGGADTDNWT 1964
                F  SQS+AD+ D+W S K+ V SEGRR+         +R+ GF S+ GGAD+DNW 
Sbjct: 150  GGSFFGGSQSKADESDSWVSNKSSVSSEGRRFGASGGGFDRERKVGFTSD-GGADSDNWG 208

Query: 1963 KRKEEEGRKFPSSGGAFDSLRERKGGSFEQSNGNGPDSDIWGKKRE-------EGVGRPR 1805
            ++KEE      + G  FD  RER+ G    SNG G DS++WGKK+E       E  GRPR
Sbjct: 209  RKKEES-----NGGSGFD--RERRVGFV--SNGGGADSEVWGKKKEESNGGLSESTGRPR 259

Query: 1804 LNLQPRSLPVS-DIQQENGTAAKPKGSNPFGDARPREEVLKEKGQDPRXXXXXXXXXXXX 1628
            LNLQPR+LPVS +    + T  K KGSNPFG+ARPREEVL EKG+D +            
Sbjct: 260  LNLQPRTLPVSNETSPGSTTVPKSKGSNPFGEARPREEVLAEKGKDWKKIDEELESVKIK 319

Query: 1627 ETAWSDGSAPXXXXXXXXXXXXXXXSWPEDRTERAWRKPETVEAPPQ 1487
            E A  D S                     DRTERAWRKP+  +A PQ
Sbjct: 320  EVAERDHSPSFGKRSFGIGNGRAG-----DRTERAWRKPDVADARPQ 361


>ref|XP_002275899.1| PREDICTED: uncharacterized protein LOC100262348 [Vitis vinifera]
          Length = 401

 Score =  260 bits (664), Expect = 3e-66
 Identities = 175/412 (42%), Positives = 228/412 (55%), Gaps = 28/412 (6%)
 Frame = -2

Query: 2656 MAATVSAWAKPGAWALDSEEHEAELLQQQKDEPDNGNH---QSNGTSGISSDFPSLXXXX 2486
            MAATVS W K GAWALDSEEHE ELLQQQ+D+  NG     +       S+DFP+L    
Sbjct: 1    MAATVSPWGKAGAWALDSEEHEDELLQQQRDDKVNGEFSGGEGRQAPEASADFPTLATAA 60

Query: 2485 XXXXXXXXKGQTLSLQEFASFGSVKQTSQAPASKGLTPDELLALPTGPRQRSAEELDRNK 2306
                     GQTLSL EF++FG+ K ++Q   +KGLT ++L+ LPTGPRQRSAEELDR +
Sbjct: 61   ATKSKKKK-GQTLSLSEFSAFGAGK-SAQPSQTKGLTHEDLMMLPTGPRQRSAEELDRGR 118

Query: 2305 LGNGFRSYGN--SYD----RPGRGSSDEQPRRRDSNRDLAPSRADEIDDWGAAKKSTVGN 2144
            LG GFRSYG+  SY+    R G G     PR         P  ++E    G  + S+   
Sbjct: 119  LGGGFRSYGSNGSYEGGRSRYGGGEDSANPR-------WGPRGSEERRQGGFGRDSS--- 168

Query: 2143 XXXXXXXXXXXXXFSDSQSRADDVDNW-ASKKTFVPSEGRRYDRRGGFESNGGGADTDNW 1967
                            + SRAD++D+W A+KK+ V +   R DR G F+S     ++ +W
Sbjct: 169  -------------RELAPSRADEIDDWGAAKKSTVGNGFERRDRGGFFDSQSRADESASW 215

Query: 1966 TKRKE---EEGRKFPSSGGAFDSLRERKGGSFEQSNGNG-PDSDIWGKKREEGVG----- 1814
               K     EGR+F   GG F+SLRER+GG    S+G G  DS+ WG+K+EEG G     
Sbjct: 216  VSNKSFTPSEGRRF-GGGGGFESLRERRGGFDSASDGGGGADSESWGRKKEEGSGNANGS 274

Query: 1813 ---RPRLNLQPRSLPVSDIQQE-NGTAAKPKGSNPFGDARPREEVLKEKGQD-----PRX 1661
               RP+L LQPR++PV+D QQ  +G+ AKPKG NPFG+ARPREEVL EKGQD      + 
Sbjct: 275  AGSRPKLILQPRTVPVNDGQQPGSGSVAKPKGPNPFGEARPREEVLAEKGQDWKEIEEKL 334

Query: 1660 XXXXXXXXXXXETAWSDGSAPXXXXXXXXXXXXXXXSWPEDRTERAWRKPET 1505
                           +DG +                S PE R+E++WRKPE+
Sbjct: 335  ESVKLKDVGSPGVGQTDGPS---FGKRSFGSGNARASLPESRSEKSWRKPES 383


>emb|CAN77162.1| hypothetical protein VITISV_029831 [Vitis vinifera]
          Length = 1434

 Score =  259 bits (662), Expect = 4e-66
 Identities = 175/412 (42%), Positives = 227/412 (55%), Gaps = 28/412 (6%)
 Frame = -2

Query: 2656 MAATVSAWAKPGAWALDSEEHEAELLQQQKDEPDNGNH---QSNGTSGISSDFPSLXXXX 2486
            MAATVS W K GAWALDSEEHE ELLQQQ+D+  NG     +       S+DFP+L    
Sbjct: 1    MAATVSPWGKAGAWALDSEEHEDELLQQQRDDKVNGEFSGGEGRQAPEASADFPTLATAA 60

Query: 2485 XXXXXXXXKGQTLSLQEFASFGSVKQTSQAPASKGLTPDELLALPTGPRQRSAEELDRNK 2306
                     GQTLSL EF++FG+ K ++Q   +KGLT ++L+ LPTGPRQRSAEELDR +
Sbjct: 61   ATKSKKKK-GQTLSLSEFSAFGAGK-SAQPSQTKGLTHEDLMMLPTGPRQRSAEELDRGR 118

Query: 2305 LGNGFRSYGN--SYD----RPGRGSSDEQPRRRDSNRDLAPSRADEIDDWGAAKKSTVGN 2144
            LG GFRSYG+  SY+    R G G     PR         P  ++E    G  + S+   
Sbjct: 119  LGGGFRSYGSNGSYEGGRSRYGGGEDSANPR-------WGPRGSEERRQGGFGRDSS--- 168

Query: 2143 XXXXXXXXXXXXXFSDSQSRADDVDNW-ASKKTFVPSEGRRYDRRGGFESNGGGADTDNW 1967
                            + SRAD++D+W A+KK+ V +   R DR G F+S     ++ +W
Sbjct: 169  -------------RELAPSRADEIDDWGAAKKSTVGNGFERRDRGGFFDSQSRADESASW 215

Query: 1966 TKRKE---EEGRKFPSSGGAFDSLRERKGGSFEQSNGNG-PDSDIWGKKREEGVG----- 1814
               K     EGR+F   GG F+SLRER+GG    S+G G  DS+ WG+K+EEG G     
Sbjct: 216  VSNKSFTPSEGRRF-GGGGGFESLRERRGGFDSASDGGGGADSESWGRKKEEGSGNANGS 274

Query: 1813 ---RPRLNLQPRSLPVSDIQQE-NGTAAKPKGSNPFGDARPREEVLKEKGQD-----PRX 1661
               RP+L LQPR++PV+D QQ  +G+ AKPKG NPFG+ARPREEVL EKGQD      + 
Sbjct: 275  AGSRPKLILQPRTVPVNDGQQPGSGSVAKPKGPNPFGEARPREEVLAEKGQDWKEIEEKL 334

Query: 1660 XXXXXXXXXXXETAWSDGSAPXXXXXXXXXXXXXXXSWPEDRTERAWRKPET 1505
                           +DG +                S PE R E++WRKPE+
Sbjct: 335  ESVKLKDVGSPGVGQTDGPS---FGKRSFGSGNARASLPESRXEKSWRKPES 383


>ref|XP_006411661.1| hypothetical protein EUTSA_v10025358mg [Eutrema salsugineum]
            gi|557112831|gb|ESQ53114.1| hypothetical protein
            EUTSA_v10025358mg [Eutrema salsugineum]
          Length = 404

 Score =  252 bits (643), Expect = 7e-64
 Identities = 180/407 (44%), Positives = 218/407 (53%), Gaps = 27/407 (6%)
 Frame = -2

Query: 2653 AATVSAWAKPGAWALDSEEHEAELLQQQKDEPDNGNHQSNGTSGISSDFPSLXXXXXXXX 2474
            AA  S WAKPGAWALD+EE+EAEL QQ        ++Q+N     SSDFPSL        
Sbjct: 3    AAVSSVWAKPGAWALDAEENEAELQQQSL-----ASNQTNS----SSDFPSLAAAATTKT 53

Query: 2473 XXXXKGQTLSLQEFASFGSVKQTSQAPASKGLTPDELLALPTGPRQRSAEELDRNKLGNG 2294
                 GQTLSL EFA++GSVK  S AP ++ LT DEL++LPTGPR+RSAEELDR+KLG G
Sbjct: 54   KKKK-GQTLSLAEFATYGSVKAAS-APKTERLTHDELVSLPTGPRERSAEELDRSKLGGG 111

Query: 2293 FRSYGNSYDR--PGRGSSDEQPRR------RDSNRDLAPSRADEIDDWGAAKKSTVGNXX 2138
            FRSYG    R    R S D + R       R+S RD  PSRADE D+W A KK   GN  
Sbjct: 112  FRSYGRDDSRWSSSRVSEDGEKRGGGFNRDRESGRDSGPSRADETDNWAAGKKPVGGNGF 171

Query: 2137 XXXXXXXXXXXFSDSQSRADDVDNWASKKTFVPSEGRRYDRRGGFESNGGGADTDNWTKR 1958
                        S SQS+AD+VD+W S K   PSE RR        SNGG    D + +R
Sbjct: 172  ERRERGGGFFE-SQSQSKADEVDSWVSSK---PSEPRRIS-----SSNGGA---DRFERR 219

Query: 1957 KEEEGRKFPSSGGAFDSLRERKGGSFEQSNGNGPDSDIWGKKREE-------GVGRPRLN 1799
                        G+F+SL   +   +    G G DSD WG++REE       G  RPRL 
Sbjct: 220  ------------GSFESLSRNRDSQY---GGGGSDSDSWGRRREEIGAPPPSGGSRPRLV 264

Query: 1798 LQPRSLPVS-----DIQQENG---TAAKPKGSNPFGDARPREEVLKEKGQDPRXXXXXXX 1643
            LQPR+LPV+     D+  E+    T  KPKG+NPFG+ARPREEVL EKGQD +       
Sbjct: 265  LQPRTLPVAAPAIVDVNPESAVTVTVEKPKGANPFGNARPREEVLAEKGQDWKEIEEKLD 324

Query: 1642 XXXXXETA----WSDGSAPXXXXXXXXXXXXXXXSWPEDRTERAWRK 1514
                 + A     SD  +P                  +DRTER+WRK
Sbjct: 325  AVKLKDVAAAIEKSDERSPGKMGFGLGNGRN------DDRTERSWRK 365


>ref|XP_006283781.1| hypothetical protein CARUB_v10004871mg [Capsella rubella]
            gi|482552486|gb|EOA16679.1| hypothetical protein
            CARUB_v10004871mg [Capsella rubella]
          Length = 427

 Score =  244 bits (624), Expect = 1e-61
 Identities = 168/365 (46%), Positives = 203/365 (55%), Gaps = 37/365 (10%)
 Frame = -2

Query: 2653 AATVSAWAKPGAWALDSEEHEAELLQQQKDEPDNGNHQSNGTSGISSDFPSLXXXXXXXX 2474
            AA  S WAKPGAWAL++EEHE EL QQ    P N       ++G SSDFPSL        
Sbjct: 3    AAVSSVWAKPGAWALEAEEHEDELKQQAP--PSN----QKSSAGDSSDFPSLAAAATTKT 56

Query: 2473 XXXXKGQTLSLQEFASFGSVKQTSQAPASKGLTPDELLALPTGPRQRSAEELDRNKLGNG 2294
                 GQT+SL EFAS+GS K  + AP ++ LT  EL+ALPTGPR+RSAEELDR+KLG G
Sbjct: 57   KKKK-GQTISLAEFASYGSAK-AAPAPQTERLTQAELVALPTGPRERSAEELDRSKLGGG 114

Query: 2293 FRSYG---------NSYDRPGRGSSDEQPRR------RDSNRDLAPSRADEIDDWGAAKK 2159
            FRSYG         NS     R S D + R       R+S+RD  PSRADE D+WGA KK
Sbjct: 115  FRSYGGGRYGDDSSNSRWGSSRVSEDGERRGGGFNRDRESSRDSGPSRADEDDNWGATKK 174

Query: 2158 STVGNXXXXXXXXXXXXXFSDSQSRADDVDNWASKKTFVPSEGRRYDRRGGFESNGGGAD 1979
               G+             F DSQS+AD+VD+W S K   PSE RRY       SNGGG  
Sbjct: 175  PIGGSGFERRERGGGGGGFFDSQSKADEVDSWVSTK---PSEPRRY-----VSSNGGG-- 224

Query: 1978 TDNWTKRKEEEGRKFPSSGGAFDSLRERKGGSFEQSNGNGPDSDIWGKKREEGVG----- 1814
             D + KR            G+F+SL   +   +    G G +SD WG++REE  G     
Sbjct: 225  -DRFEKR------------GSFESLSRTRDSQY---GGGGSESDTWGRRREESAGADGAP 268

Query: 1813 -------RPRLNLQPRSLPVSD-------IQQENGTAA---KPKGSNPFGDARPREEVLK 1685
                   RPRL LQPR+LPV+        ++ E+       KPKG+NPFG+ARPREEVL 
Sbjct: 269  PPSSGGSRPRLVLQPRTLPVAVPVAVVEVVKPESPVMVAVDKPKGANPFGNARPREEVLA 328

Query: 1684 EKGQD 1670
            EKGQD
Sbjct: 329  EKGQD 333


>ref|NP_195583.1| glycine-rich protein [Arabidopsis thaliana]
            gi|4467158|emb|CAB37527.1| putative protein [Arabidopsis
            thaliana] gi|7270854|emb|CAB80535.1| putative protein
            [Arabidopsis thaliana] gi|17065142|gb|AAL32725.1|
            putative protein [Arabidopsis thaliana]
            gi|20259814|gb|AAM13254.1| putative protein [Arabidopsis
            thaliana] gi|332661567|gb|AEE86967.1| glycine-rich
            protein [Arabidopsis thaliana]
          Length = 452

 Score =  240 bits (613), Expect = 2e-60
 Identities = 172/413 (41%), Positives = 213/413 (51%), Gaps = 33/413 (7%)
 Frame = -2

Query: 2653 AATVSAWAKPGAWALDSEEHEAELLQQQKDEPDNGNHQSNGTSGISSDFPSLXXXXXXXX 2474
            AA  S WAKPGAWAL++EEHEAEL QQ    P   N +S+     SSDFPSL        
Sbjct: 3    AAVSSVWAKPGAWALEAEEHEAELKQQ----PSPTNQKSSAED--SSDFPSLAAAATTKT 56

Query: 2473 XXXXKGQTLSLQEFASFGSVKQTSQAPASKGLTPDELLALPTGPRQRSAEELDRNKLGNG 2294
                 GQT+SL EFA++G+ K    AP ++ LT  EL+ALPTGPR+RSAEELDR+KLG G
Sbjct: 57   KKKK-GQTISLAEFATYGTAK-AKPAPQTERLTQAELVALPTGPRERSAEELDRSKLGGG 114

Query: 2293 FRSYG-------NSYDRPGRGSSDEQPRR--------RDSNRDLAPSRADEIDDWGAAKK 2159
            FRSYG       +S  R G     E   R        R+ +RD  PSRADE D+W AAKK
Sbjct: 115  FRSYGGGRYGDESSSSRWGSSRVSEDGERRGGGFNRDREPSRDSGPSRADEDDNWAAAKK 174

Query: 2158 STVGN-XXXXXXXXXXXXXFSDSQSRADDVDNWASKKTFVPSEGRRYDRRGGFESNGGGA 1982
               GN               S SQS+AD+VD+W S K   PSE RR+       SNGGG 
Sbjct: 175  PISGNGFERRERGSGGGFFESQSQSKADEVDSWVSTK---PSEPRRF-----VSSNGGGG 226

Query: 1981 DTDNWTKRKEEEGRKFPSSGGAFDSLRERKGGSFEQSNGNGPDSDIWGKKREE------- 1823
              D + KR            G+F+SL   +   +    G G +SD WG++REE       
Sbjct: 227  --DRFEKR------------GSFESLSRNRDSQY--GGGGGSESDTWGRRREESGAANGS 270

Query: 1822 ----GVGRPRLNLQPRSLPVSDIQQENGTA------AKPKGSNPFGDARPREEVLKEKGQ 1673
                G  RPRL LQPR+LPV+ ++     +       KPKG+NPFG+ARPREEVL EKGQ
Sbjct: 271  PPPSGGSRPRLVLQPRTLPVAVVEVVKPESPVLVIVEKPKGANPFGNARPREEVLAEKGQ 330

Query: 1672 DPRXXXXXXXXXXXXETAWSDGSAPXXXXXXXXXXXXXXXSWPEDRTERAWRK 1514
            D +            + A +    P                  E+R ER+WRK
Sbjct: 331  DWKEIDEKLEAEKLKDIA-AAMEKPNEKSTGKMGFGLGNGRKDEERIERSWRK 382


>ref|NP_001190956.1| glycine-rich protein [Arabidopsis thaliana]
            gi|332661568|gb|AEE86968.1| glycine-rich protein
            [Arabidopsis thaliana]
          Length = 465

 Score =  240 bits (613), Expect = 2e-60
 Identities = 172/413 (41%), Positives = 213/413 (51%), Gaps = 33/413 (7%)
 Frame = -2

Query: 2653 AATVSAWAKPGAWALDSEEHEAELLQQQKDEPDNGNHQSNGTSGISSDFPSLXXXXXXXX 2474
            AA  S WAKPGAWAL++EEHEAEL QQ    P   N +S+     SSDFPSL        
Sbjct: 3    AAVSSVWAKPGAWALEAEEHEAELKQQ----PSPTNQKSSAED--SSDFPSLAAAATTKT 56

Query: 2473 XXXXKGQTLSLQEFASFGSVKQTSQAPASKGLTPDELLALPTGPRQRSAEELDRNKLGNG 2294
                 GQT+SL EFA++G+ K    AP ++ LT  EL+ALPTGPR+RSAEELDR+KLG G
Sbjct: 57   KKKK-GQTISLAEFATYGTAK-AKPAPQTERLTQAELVALPTGPRERSAEELDRSKLGGG 114

Query: 2293 FRSYG-------NSYDRPGRGSSDEQPRR--------RDSNRDLAPSRADEIDDWGAAKK 2159
            FRSYG       +S  R G     E   R        R+ +RD  PSRADE D+W AAKK
Sbjct: 115  FRSYGGGRYGDESSSSRWGSSRVSEDGERRGGGFNRDREPSRDSGPSRADEDDNWAAAKK 174

Query: 2158 STVGN-XXXXXXXXXXXXXFSDSQSRADDVDNWASKKTFVPSEGRRYDRRGGFESNGGGA 1982
               GN               S SQS+AD+VD+W S K   PSE RR+       SNGGG 
Sbjct: 175  PISGNGFERRERGSGGGFFESQSQSKADEVDSWVSTK---PSEPRRF-----VSSNGGGG 226

Query: 1981 DTDNWTKRKEEEGRKFPSSGGAFDSLRERKGGSFEQSNGNGPDSDIWGKKREE------- 1823
              D + KR            G+F+SL   +   +    G G +SD WG++REE       
Sbjct: 227  --DRFEKR------------GSFESLSRNRDSQY--GGGGGSESDTWGRRREESGAANGS 270

Query: 1822 ----GVGRPRLNLQPRSLPVSDIQQENGTA------AKPKGSNPFGDARPREEVLKEKGQ 1673
                G  RPRL LQPR+LPV+ ++     +       KPKG+NPFG+ARPREEVL EKGQ
Sbjct: 271  PPPSGGSRPRLVLQPRTLPVAVVEVVKPESPVLVIVEKPKGANPFGNARPREEVLAEKGQ 330

Query: 1672 DPRXXXXXXXXXXXXETAWSDGSAPXXXXXXXXXXXXXXXSWPEDRTERAWRK 1514
            D +            + A +    P                  E+R ER+WRK
Sbjct: 331  DWKEIDEKLEAEKLKDIA-AAMEKPNEKSTGKMGFGLGNGRKDEERIERSWRK 382


>ref|XP_002528261.1| translation initiation factor, putative [Ricinus communis]
            gi|223532298|gb|EEF34099.1| translation initiation
            factor, putative [Ricinus communis]
          Length = 396

 Score =  240 bits (612), Expect = 3e-60
 Identities = 163/350 (46%), Positives = 202/350 (57%), Gaps = 21/350 (6%)
 Frame = -2

Query: 2656 MAATVSA-WAKPGAWALDSEEHEAELLQQQKDEPDNGNHQSNGTSGISSDFPSLXXXXXX 2480
            MAATVS+ W KPGAWALD+EEHE EL Q++ D   +            SDFPSL      
Sbjct: 1    MAATVSSPWGKPGAWALDAEEHEDELKQERLDSQQDKE----------SDFPSLSVAATK 50

Query: 2479 XXXXXXKGQTLSLQEFASFGSVKQTSQ-APASKGLTPDELLALPTGPRQRSAEELDRNKL 2303
                    QTLSL EFA++ S   + Q +  S+GLT ++LL LPTGPRQRSAEELDR++L
Sbjct: 51   QPKKKK-NQTLSLAEFATYSSAAASQQPSQHSRGLTHEDLLNLPTGPRQRSAEELDRSRL 109

Query: 2302 GNGFRSYG----NSYDR-----PGRGSSDEQPRRRDSNRDLAPSRADEIDDWGAAKKS-T 2153
            G GF+SYG    N  D       G G+S    R RDS+R+L  SRADEIDDW   KKS +
Sbjct: 110  GGGFKSYGMNSRNGDDAGNSRWGGGGNSRVSSRDRDSSRELVLSRADEIDDWSKTKKSPS 169

Query: 2152 VGNXXXXXXXXXXXXXFSDSQSRADDVDNWASKKTFVPSEGRRYDRRGGFESNGGGADTD 1973
             GN               DSQS+AD+ D+W + K   P E RR+   GG    GGG    
Sbjct: 170  FGNERRERSSSFF-----DSQSKADESDSWVANK---PMETRRF---GG--GGGGGGSNG 216

Query: 1972 NWTKRKEEEGRKFPSSGGAFDSLRERKGGSFEQSNGNG-PDSDIWGKKREE--------G 1820
             + +R            G+FDSL   + GS   SNG G  DSD WG+K+E+        G
Sbjct: 217  GFERR------------GSFDSLSRDRYGS---SNGGGAADSDNWGRKKEDSNGMGSVSG 261

Query: 1819 VGRPRLNLQPRSLPVSDIQQENGTAAKPKGSNPFGDARPREEVLKEKGQD 1670
            + RP+L LQPRSLP+S+   +NG   KPKGS+PFG+ARPREEVL EKG+D
Sbjct: 262  IARPKLVLQPRSLPISN---DNGVGMKPKGSSPFGNARPREEVLAEKGKD 308


>ref|XP_004292157.1| PREDICTED: uncharacterized protein LOC101302425 [Fragaria vesca
            subsp. vesca]
          Length = 422

 Score =  239 bits (609), Expect = 6e-60
 Identities = 177/423 (41%), Positives = 219/423 (51%), Gaps = 36/423 (8%)
 Frame = -2

Query: 2656 MAATVSA-WAKPGAWALDSEEHEAELLQQQKDEPDNGNHQSNGTSGISSDFPSLXXXXXX 2480
            MAATVS+ WAKPGAWALD+EEHEAEL QQ K E               +DFPSL      
Sbjct: 1    MAATVSSPWAKPGAWALDAEEHEAELEQQTKIETQP-----------LADFPSL-SAAAA 48

Query: 2479 XXXXXXKGQTLSLQEFASFGSVKQTSQAPASKGLTPDELLALPTGPRQRSAEELDRNKLG 2300
                  KGQ +SL EF +FG  K     P   GLT ++ L LPTGPR+R+AEELDR++  
Sbjct: 49   KPKKKSKGQKVSLAEFTTFGGPKPVQAEPV--GLTHEDRLVLPTGPRERTAEELDRSR-- 104

Query: 2299 NGFRSYGNSYDRPGRGSSDEQ--PRRRD--------SNRDLAPSRADEIDDWGAAKKSTV 2150
             GFRSYG   DR  R  S+ +   +RR+        ++RD APSRADE DDWG  KKS V
Sbjct: 105  -GFRSYGG--DRVNREESNSKWGSQRREGGGFGGEKTDRD-APSRADEADDWGVGKKS-V 159

Query: 2149 GNXXXXXXXXXXXXXFSDSQSRADDVDNWASKKTFVPS------EGRRYDRRGGFESNGG 1988
            GN                SQS+AD+ D+W S K+   S       G   +R+ GF SNGG
Sbjct: 160  GNGFERRERAGFGF---GSQSKADESDSWVSNKSSFSSLRSGGGGGFERERKVGFASNGG 216

Query: 1987 GADTDNWTKRKEEEGRKFPSSGGAFDSLRERKGGSFEQSNGNGPDSDIWGKKREEG---- 1820
            GAD+++W +++EE       S G F+  RERK G    SNG G D++ WG++REE     
Sbjct: 217  GADSESWGRKREE-------SNGGFE--RERKVGLEFNSNGGGADAESWGRRREESNGGT 267

Query: 1819 --VGRPRLNLQPRSLPVS----DIQQENGTAA---------KPKGSNPFGDARPREEVLK 1685
               GRPRLNLQPR+LPV+     +  E    A         +P+ +NPFG ARPREEVL 
Sbjct: 268  ETTGRPRLNLQPRTLPVTLPPPVVSDETSPVAAPVAPEIVPRPRSTNPFGAARPREEVLA 327

Query: 1684 EKGQDPRXXXXXXXXXXXXETAWSDGSAPXXXXXXXXXXXXXXXSWPEDRTERAWRKPET 1505
            EKGQD +                   +                     DRTE AWRKP  
Sbjct: 328  EKGQDWKKIDEQLESVKLK----EKEAVAAEGESFGKRSFGMGSGRSGDRTEGAWRKPVV 383

Query: 1504 VEA 1496
             EA
Sbjct: 384  AEA 386


>gb|EOX97749.1| Glycine-rich protein, putative [Theobroma cacao]
          Length = 369

 Score =  231 bits (589), Expect = 1e-57
 Identities = 157/348 (45%), Positives = 196/348 (56%), Gaps = 19/348 (5%)
 Frame = -2

Query: 2656 MAATVSA-WAKPGAWALDSEEHEAELLQQQKDEPDNGNHQSNGTSGISSDFPSLXXXXXX 2480
            MAATVS+ W KPGAWALD+EEHEAEL QQ ++  D+ + +        +DFPSL      
Sbjct: 1    MAATVSSPWGKPGAWALDAEEHEAELQQQDQNHGDSSSEKH-------ADFPSLATAAAA 53

Query: 2479 XXXXXXKGQTLSLQEFASFGSVKQTSQAPASKGLTPDELLALPTGPRQRSAEELDRNKLG 2300
                    QTLSL EF ++G+ K +        LT ++LL LPTGPRQRS EELDRN+LG
Sbjct: 54   KTKKKK-SQTLSLAEFTTYGAAKPSEPTR----LTHEDLLVLPTGPRQRSPEELDRNRLG 108

Query: 2299 NGFRSYG-NSYDRPGRGSSDE------QPRRRDSNRDLAPSRADEIDDWGAAKKST-VGN 2144
             GF+SYG N Y+  G  SS        +   RDSNR++APSRADEID+W +AKKST  GN
Sbjct: 109  GGFKSYGSNRYNSNGDDSSSNGRWGSSRASNRDSNREIAPSRADEIDNWASAKKSTSTGN 168

Query: 2143 XXXXXXXXXXXXXFS--DSQSRADDVDNWASKKTFVPSEGRRYDRR--GGFESNGGGADT 1976
                             DSQS+AD+VDNWA+ K++  S      RR  GGFE        
Sbjct: 169  GFGGGFERRERGGGGFFDSQSKADEVDNWAANKSY-KSANEAPPRRFGGGFERRS----- 222

Query: 1975 DNWTKRKEEEGRKFPSSGGAFDSLRERKGGSFEQSNGNGPDSDIWGKKREE------GVG 1814
                               +FDSL         QS  +  D D WGKK+EE      G  
Sbjct: 223  -------------------SFDSL---------QSRDSPRDLDNWGKKKEESGSAGSGGV 254

Query: 1813 RPRLNLQPRSLPVSDIQQENGTAAKPKGSNPFGDARPREEVLKEKGQD 1670
            RPRL LQPR+  V++  ++  T AKP+G+NPFG+ARPREEVLKEKG+D
Sbjct: 255  RPRLVLQPRT--VTEEGKKEATLAKPRGANPFGEARPREEVLKEKGKD 300


>ref|XP_002868857.1| hypothetical protein ARALYDRAFT_490636 [Arabidopsis lyrata subsp.
            lyrata] gi|297314693|gb|EFH45116.1| hypothetical protein
            ARALYDRAFT_490636 [Arabidopsis lyrata subsp. lyrata]
          Length = 419

 Score =  231 bits (588), Expect = 2e-57
 Identities = 163/411 (39%), Positives = 205/411 (49%), Gaps = 31/411 (7%)
 Frame = -2

Query: 2653 AATVSAWAKPGAWALDSEEHEAELLQQQKDEPDNGNHQSNGTSGISSDFPSLXXXXXXXX 2474
            AA  S WAKPGAWAL++EEHEAEL QQ        +      +G SSDFPSL        
Sbjct: 3    AAVSSVWAKPGAWALEAEEHEAELKQQAPPSTQKSS------AGDSSDFPSLAAAATTKT 56

Query: 2473 XXXXKGQTLSLQEFASFGSVKQTSQAPASKGLTPDELLALPTGPRQRSAEELDRNKLGNG 2294
                  QT+SL EFA++GS K  +Q   ++ LT  EL++LPTGPR+RSA+ELDR+KLG G
Sbjct: 57   KKKK-AQTISLAEFATYGSAKAAAQ---TERLTQAELVSLPTGPRERSADELDRSKLGGG 112

Query: 2293 FRSYG-------NSYDRPGRGSSDEQPRR--------RDSNRDLAPSRADEIDDWGAAKK 2159
            FRSYG       +S  R G     E   R        R+ +RD  PSRADE D+W AAKK
Sbjct: 113  FRSYGGGRYGDESSSSRWGSSRVSEDGERRGGGFNRDREPSRDSGPSRADEDDNWAAAKK 172

Query: 2158 STVGNXXXXXXXXXXXXXFSDSQSRADDVDNWASKKTFVPSEGRRYDRRGGFES------ 1997
               GN             F +SQS+AD+VD+W S K   PSE RRY++RG FES      
Sbjct: 173  PIGGNGFERRERGAGGGFF-ESQSKADEVDSWVSSK---PSEPRRYEKRGSFESLSRNRD 228

Query: 1996 ----NGGGADTDNWTKRKEEEGRKFPSSGGAFDSLRERKGGSFEQSNGNGPDSDIWGKKR 1829
                 GG +D+D W +R+E                        E S  NG  S   G   
Sbjct: 229  SQYGGGGSSDSDTWGRRRE------------------------ESSGANGVPSPTAG--- 261

Query: 1828 EEGVGRPRLNLQPRSLPVSDIQQENGTA------AKPKGSNPFGDARPREEVLKEKGQDP 1667
                 RPRL LQPR+LPV+ ++     +       KPKG+NPFG+ARPREEVL EKGQD 
Sbjct: 262  ----SRPRLVLQPRTLPVAVVEVVKPESPVLVIVEKPKGANPFGNARPREEVLAEKGQDW 317

Query: 1666 RXXXXXXXXXXXXETAWSDGSAPXXXXXXXXXXXXXXXSWPEDRTERAWRK 1514
            +            + A +                       ++RTER+WRK
Sbjct: 318  KEIDEKLEADKLKDVAAAIEKPDEKSPGKMGGFGLGNGRKDDERTERSWRK 368


>ref|XP_002313735.2| hypothetical protein POPTR_0009s13280g [Populus trichocarpa]
            gi|550331640|gb|EEE87690.2| hypothetical protein
            POPTR_0009s13280g [Populus trichocarpa]
          Length = 387

 Score =  223 bits (569), Expect = 3e-55
 Identities = 148/354 (41%), Positives = 192/354 (54%), Gaps = 25/354 (7%)
 Frame = -2

Query: 2656 MAATVSA-WAKPGAWALDSEEHEAELLQQQKDEPDNGNHQSNGTSGISSDFPSLXXXXXX 2480
            MAATVS+ W+KPGAWALD+EEHEAEL Q+ ++        +    G++ +FPSL      
Sbjct: 1    MAATVSSPWSKPGAWALDAEEHEAELQQEHENSQQASTLAAQPLGGVA-EFPSLAAAAAT 59

Query: 2479 XXXXXXKGQTLSLQEFASFGSVKQTSQAPASKGLTPDELLALPTGPRQRSAEELDRNKLG 2300
                  K QTLSL EF+++   K + +        PD L  LPT PR+RSAEELDR +LG
Sbjct: 60   KQPKKKKNQTLSLAEFSNYSLAKSSHE--------PD-LFNLPTRPRERSAEELDRARLG 110

Query: 2299 NGFRSYGNSYDRPGR---------GSSDEQPRRRDSNRDLAPSRADEIDDWGAAKKSTVG 2147
             GF+SYG SY   G          G+ + +   R+S+++ APSRADEIDDW   KKS  G
Sbjct: 111  GGFKSYGLSYRNGGEESNSRWGGGGNGNSRVSNRESSKEFAPSRADEIDDWSKTKKSPAG 170

Query: 2146 NXXXXXXXXXXXXXFSDSQSRADDVDNWASKKT-------FVPSEGRRYDRRGGFES--- 1997
            N             F DSQS+AD+  +W S KT       FV +    ++RRG +++   
Sbjct: 171  NVYERRERERGSSFF-DSQSKADESASWVSNKTTNDGPRRFVGANNGGFERRGSYDTLSR 229

Query: 1996 -----NGGGADTDNWTKRKEEEGRKFPSSGGAFDSLRERKGGSFEQSNGNGPDSDIWGKK 1832
                 +GG AD+DNW ++K+E          +F+S      GS  +              
Sbjct: 230  ERHGFSGGAADSDNWGRKKDE----------SFNS------GSVGE-------------- 259

Query: 1831 REEGVGRPRLNLQPRSLPVSDIQQENGTAAKPKGSNPFGDARPREEVLKEKGQD 1670
                  RP+L LQPR+LPVSD    NG   KPKGSNPFGDARPREEVLKEKG D
Sbjct: 260  ------RPKLKLQPRTLPVSD---GNGAVEKPKGSNPFGDARPREEVLKEKGMD 304


>ref|XP_004505768.1| PREDICTED: uncharacterized protein LOC101488662 [Cicer arietinum]
          Length = 384

 Score =  220 bits (560), Expect = 3e-54
 Identities = 173/426 (40%), Positives = 203/426 (47%), Gaps = 45/426 (10%)
 Frame = -2

Query: 2656 MAATVSAWAKPGAWALDSEEHEAELLQQQKDEPDNGNHQSNGTSGISSDFPSLXXXXXXX 2477
            MAATVSAW+KPGAWALDSEEHEAELLQQ  +         N T  ++ +FPSL       
Sbjct: 1    MAATVSAWSKPGAWALDSEEHEAELLQQTNN---------NDTKPLA-EFPSLAVAAATK 50

Query: 2476 XXXXXKGQTLSLQEFASFGSVKQTSQAPASKGLTPDELLALPTGPRQRSAEEL--DRNKL 2303
                   QTLSL EF +      T Q P            LPTGPRQR+AEEL  DR ++
Sbjct: 51   PKKKK-AQTLSLAEFTAKPLSSFTQQDPVD----------LPTGPRQRTAEELERDRTRI 99

Query: 2302 GNGFRSYGNSYDRPGRGSSDEQPR-----------------RRDSNRDLAP-SRADEIDD 2177
            G GFRSYG+  +R G G      R                  RDSNR+ AP SRADEID+
Sbjct: 100  GGGFRSYGDRPNRTGGGDEGSNSRWGSSRVSDDLRRNNSFGSRDSNRESAPPSRADEIDN 159

Query: 2176 WGAAKKSTVG-----NXXXXXXXXXXXXXFSDSQSRADDVDNWASKKTFVPSEGRRYDRR 2012
            W AAKK++VG                   F DSQSRAD+ D+W S K+FVPS        
Sbjct: 160  WAAAKKTSVGVGNGFERRERDNRERGGGGFFDSQSRADESDSWVSSKSFVPS-------- 211

Query: 2011 GGFESNGGGADTDNWTKRKEEEGRKFPSSGGAFDSLRERKGGSFEQSNGNGPDSDIWGKK 1832
                                 EGR+F  SGG F+  RERK G        G DSD W KK
Sbjct: 212  ---------------------EGRRFGGSGGGFE--RERKVGF---GTSGGADSDNWNKK 245

Query: 1831 ---------REEGV--GRPRLNLQPRSLPVSDIQQE---------NGTAAKPKGSNPFGD 1712
                     R E V  GRPRL LQPRS+  S+  Q          +G  AKPKG+NPFG+
Sbjct: 246  KGEFSVGSERNESVAGGRPRLVLQPRSVSASNENQNQDVAAAAVVSGNVAKPKGANPFGE 305

Query: 1711 ARPREEVLKEKGQDPRXXXXXXXXXXXXETAWSDGSAPXXXXXXXXXXXXXXXSWPEDRT 1532
            ARPRE+VL EKGQD +            ET   +G                     EDR+
Sbjct: 306  ARPREQVLAEKGQDWKKIDEQLESMKIKETV-VEGFGKRGFGSGNGRG--------EDRS 356

Query: 1531 ERAWRK 1514
            ER+WRK
Sbjct: 357  ERSWRK 362


>ref|XP_006851959.1| hypothetical protein AMTR_s00041p00189930 [Amborella trichopoda]
            gi|548855542|gb|ERN13426.1| hypothetical protein
            AMTR_s00041p00189930 [Amborella trichopoda]
          Length = 413

 Score =  218 bits (555), Expect = 1e-53
 Identities = 174/430 (40%), Positives = 212/430 (49%), Gaps = 47/430 (10%)
 Frame = -2

Query: 2647 TVSAWAKPGAWALDSEEHEAELLQQQKDEPDNGNHQSNGTSGISSDFPSLXXXXXXXXXX 2468
            T SAWAKPGAWAL+SEE E+        E +        T    SDFPSL          
Sbjct: 2    TTSAWAKPGAWALESEESESM-------EAEAPPAAQKTTEKPQSDFPSLALAASAKTSK 54

Query: 2467 XXKGQTLSLQEFAS-----FGSVKQTSQAPASKGLTPDELLALPTGPRQRSAEELDRNK- 2306
              K Q LSL EF +     + +  +++   +S+GLTPDELL LPTGPR+R+AEEL+R + 
Sbjct: 55   KKKSQPLSLAEFTTGKQVAYSAKPRSAIIDSSRGLTPDELLRLPTGPRERTAEELERGRG 114

Query: 2305 -LGNGFRSYGNSYDRPGRGSSDE--QPRRRDSNRDLAPSRADEIDDWGAAKKSTVGNXXX 2135
             LG GF+SYG   DR  RG   E      R       PSRADEIDDWGA K+        
Sbjct: 115  SLGGGFQSYGR--DRADRGPRREAFDDDARGGRMPPPPSRADEIDDWGATKRQMA---PP 169

Query: 2134 XXXXXXXXXXFSDSQSRADDVDNW-ASKKTFVPS-EGRRYDRRGGFES----NGGGADTD 1973
                      FSDSQSRAD+ DNW ASKK+FVPS E RR    GGFE+         + D
Sbjct: 170  ASQERRSSNFFSDSQSRADESDNWGASKKSFVPSMEPRRL--VGGFENFRERESLADEVD 227

Query: 1972 NWTKRKEE-----EGRKFPSSGGAFDSLRERKGGSFEQSNGNGPDSDIWGKKREEGVGRP 1808
            NWT  K+      E R+   SGG F++ RER+G       G+  DS    ++REEG  RP
Sbjct: 228  NWTSAKKSFAPSVEPRR---SGGGFENYREREGSRV----GSRFDSSC-QREREEGGQRP 279

Query: 1807 RLNLQPRSLPVSD-----IQQENGTAAKPKG--------------SNPFGDARPREEVLK 1685
            RL LQPR+LPV D     + Q     + P G              SNPFG+ARPREEVL 
Sbjct: 280  RLVLQPRTLPVGDGDGPVVLQLRAKVSNPLGEGDGPVVLQPRAKVSNPFGEARPREEVLA 339

Query: 1684 EKGQDPRXXXXXXXXXXXXETA---WSDG-----SAPXXXXXXXXXXXXXXXSWPEDRTE 1529
            EKGQD +            +T     SDG                           +RTE
Sbjct: 340  EKGQDWKKIAEQLDSMKIKDTVHKELSDGVSVERGGEGGFIRRPFGMGNGVGGMARERTE 399

Query: 1528 RAWRKPETVE 1499
            R+WRKPE +E
Sbjct: 400  RSWRKPEPIE 409


>gb|ESW03680.1| hypothetical protein PHAVU_011G033300g [Phaseolus vulgaris]
          Length = 371

 Score =  218 bits (554), Expect = 1e-53
 Identities = 166/429 (38%), Positives = 203/429 (47%), Gaps = 39/429 (9%)
 Frame = -2

Query: 2656 MAATVSAWAKPGAWALDSEEHEAELLQQQKDEPDNGNHQSNGTSGISSDFPSLXXXXXXX 2477
            MAATVSAW+KPGAWA+DSEEHEAELLQQ         H +   +    DFPSL       
Sbjct: 1    MAATVSAWSKPGAWAIDSEEHEAELLQQSTI------HDTKPLA----DFPSLAVAAATK 50

Query: 2476 XXXXXKGQTLSLQEFASFGSVKQTSQAPASKGLTPDELLALPTGPRQRSAEELDRNKLGN 2297
                   QT+SL EF +        Q P          + LPTGPRQR+AEELDR +LG 
Sbjct: 51   PKKKK-AQTISLAEFTAKPDTSFADQDP----------VVLPTGPRQRTAEELDRTRLGG 99

Query: 2296 GFRSYGNSYDRPGRGSS--------------DEQPRR------RDSNRDLAPSRADEIDD 2177
            GFRSYG   DRP R SS               ++PRR      RDSNR+LAPSRA     
Sbjct: 100  GFRSYG---DRPNRNSSGDDSSNSRWGSSRVSDEPRRNGSFGARDSNRELAPSRA----- 151

Query: 2176 WGAAKKSTVGNXXXXXXXXXXXXXFSDSQSRADDVDNWASKKTFVPSEG---RRYDRRGG 2006
                                            D+ DNWA+ K   PS G   +  DR G 
Sbjct: 152  --------------------------------DETDNWAAAKK--PSGGFERKERDRGGF 177

Query: 2005 FESNGGGADTDNWTKRKEE--EGRKFPSSGGAFDSLRERKGGSFEQSNGNGPDSDIWGKK 1832
            F+S     ++++W   K      R+F S+GG F+  RER+   F  S G   DS+ W KK
Sbjct: 178  FDSQSRADESESWVSNKSSGPSERRFGSNGGGFE--RERRVVGFGSSGG--ADSEDWNKK 233

Query: 1831 REE--------------GVGRPRLNLQPRSLPVSDIQQENGTAAKPKGSNPFGDARPREE 1694
            + E              G GRPRL LQPRSL VS+ +  +G   KPKG NPFG+ARPRE+
Sbjct: 234  KGESNVGTETVSVGVGVGGGRPRLVLQPRSLSVSN-EGPDGNVGKPKGVNPFGEARPREQ 292

Query: 1693 VLKEKGQDPRXXXXXXXXXXXXETAWSDGSAPXXXXXXXXXXXXXXXSWPEDRTERAWRK 1514
            VL EKGQD +            ETA  D                     PE RTER+WRK
Sbjct: 293  VLAEKGQDWKKIDEQLDSMKIKETAGGDSFGKRSFGSSNGGGRPAL---PESRTERSWRK 349

Query: 1513 PETVEAPPQ 1487
            P++ +  P+
Sbjct: 350  PQSDDESPK 358


Top