BLASTX nr result

ID: Rehmannia22_contig00024819 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Rehmannia22_contig00024819
         (1841 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_004240703.1| PREDICTED: uncharacterized protein LOC101255...   271   9e-70
ref|XP_002283083.2| PREDICTED: uncharacterized protein LOC100249...   268   5e-69
emb|CBI21108.3| unnamed protein product [Vitis vinifera]              268   5e-69
ref|XP_006355909.1| PREDICTED: large proline-rich protein bag6-B...   267   1e-68
gb|EOY31281.1| Ubiquitin-like superfamily protein, putative isof...   244   1e-61
gb|EOY31280.1| Ubiquitin-like superfamily protein, putative isof...   244   1e-61
gb|EOY31278.1| Ubiquitin-like superfamily protein, putative isof...   244   1e-61
gb|EOY31277.1| Ubiquitin-like superfamily protein, putative isof...   244   1e-61
gb|EOY31276.1| Ubiquitin-like superfamily protein, putative isof...   244   1e-61
gb|EOY31275.1| Ubiquitin-like superfamily protein, putative isof...   244   1e-61
gb|EOY31274.1| Ubiquitin-like superfamily protein, putative isof...   244   1e-61
gb|EMJ05831.1| hypothetical protein PRUPE_ppa002041mg [Prunus pe...   242   3e-61
ref|XP_002532506.1| scythe/bat3, putative [Ricinus communis] gi|...   240   2e-60
ref|XP_006289444.1| hypothetical protein CARUB_v10002958mg [Caps...   238   7e-60
ref|XP_006394775.1| hypothetical protein EUTSA_v10003804mg [Eutr...   237   1e-59
gb|EPS60717.1| hypothetical protein M569_14085, partial [Genlise...   236   2e-59
ref|NP_197909.4| ubiquitin-like superfamily protein [Arabidopsis...   236   2e-59
gb|ESW27099.1| hypothetical protein PHAVU_003G173700g [Phaseolus...   235   4e-59
ref|XP_002331046.1| predicted protein [Populus trichocarpa] gi|5...   233   3e-58
ref|XP_003609499.1| Large proline-rich protein BAT3 [Medicago tr...   230   2e-57

>ref|XP_004240703.1| PREDICTED: uncharacterized protein LOC101255405 [Solanum
            lycopersicum]
          Length = 706

 Score =  271 bits (692), Expect = 9e-70
 Identities = 175/427 (40%), Positives = 234/427 (54%), Gaps = 85/427 (19%)
 Frame = +2

Query: 134  MGSNGGENIKVSESDADECSETTIDIKIKTLDSKTFSVQVDKCVSVAELKEKIASLNGVL 313
            M SNG E++++  S   EC ETT++IKIK LDS+T++++VDKCV V  LKE+IA++ GVL
Sbjct: 1    MVSNGAEDVQICGSGEAECPETTVEIKIKMLDSQTYTLRVDKCVPVPALKEQIATVTGVL 60

Query: 314  SEKQRLICRGKVLKDDQLLSAYNVEDGHTLHLVVREPVAPSPENSPDHPAIXXXXXXXXX 493
            +E+QRLICRGKVLKDDQLLSAY+VEDGHTLHLVVR+   PS +++PD  A          
Sbjct: 61   TEQQRLICRGKVLKDDQLLSAYHVEDGHTLHLVVRQ---PSSDSTPDPQATASASNAGYS 117

Query: 494  XGNQLGPGMPI-----IEPGDGAFPDLNRVVSAVLNAFGFRITRFGSSGEGID------- 637
             GN++ P M +      + GDG FPDLNR+V+AVL +FG  I   G   EGID       
Sbjct: 118  QGNRVSPDMVVGTYSSSDHGDGIFPDLNRIVTAVLGSFG--IASAGGGNEGIDLHGFGPA 175

Query: 638  ---------------SDT--------------------------IPGSLTTLSEYISNLR 694
                           +DT                          IP SLTTL++Y+S+L 
Sbjct: 176  SLGNIRDSGRSQTEQADTRDQSNVTNSASARSTDVPPEALQAPVIPDSLTTLTQYLSHLT 235

Query: 695  REFIANAGGQNTDSTNDGLPGSNLHDLEVLPCCSGCRGVRTVMYLAELLSSTRQLLGEQA 874
             EF ANA GQ+  + + G+  ++   LE      G RG  T   LAE++  TRQL  EQ 
Sbjct: 236  VEFRANARGQSETTQSAGVHLADRTALEATAHSIGERGFPTPASLAEVIILTRQLFMEQV 295

Query: 875  TECXXXXXXXXXXHTSITDALERSRILSDAIRSGSIFQHLGSLLLELGRVITTMQMGQTP 1054
             EC            ++T+  ER RI S A+R+G +F+++G++LLELGR   T++MG+TP
Sbjct: 296  VECLSQFSTLLENQANVTNPGERMRIQSYALRTGGLFRNIGAMLLELGRTAMTLRMGETP 355

Query: 1055 ADALVNAGSPVYIRPTG-------------------ASVGPGDN-------------LPR 1138
            ADA+VNAG  V++   G                    SVG   N             +PR
Sbjct: 356  ADAVVNAGPAVFVSTAGPNPIMVQPLPFQPNTSFGAVSVGTVQNNTGFSGGSVSSGFIPR 415

Query: 1139 NIDITIR 1159
            NIDI IR
Sbjct: 416  NIDIRIR 422


>ref|XP_002283083.2| PREDICTED: uncharacterized protein LOC100249152 [Vitis vinifera]
          Length = 708

 Score =  268 bits (686), Expect = 5e-69
 Identities = 174/428 (40%), Positives = 231/428 (53%), Gaps = 86/428 (20%)
 Frame = +2

Query: 134  MGSNGGENIKVSESDADECSETTIDIKIKTLDSKTFSVQVDKCVSVAELKEKIASLNGVL 313
            MGS GG+ + +S S   +CSE T++IKIKTLDS+T++++VDKC+ V  LKE+IAS+ GVL
Sbjct: 1    MGSTGGDEVMISGSGEAQCSEATVEIKIKTLDSQTYTLRVDKCMPVPALKEQIASVTGVL 60

Query: 314  SEKQRLICRGKVLKDDQLLSAYNVEDGHTLHLVVREPVAPSPENSPDHPAIXXXXXXXXX 493
            SE+QRLICRG+VLKDDQLLSAY+VEDGHTLHLVVR+P  PS E+ PD+ A          
Sbjct: 61   SEQQRLICRGRVLKDDQLLSAYHVEDGHTLHLVVRQPFPPSSESLPDNSATDPASNTLRN 120

Query: 494  XGNQLGPGMPII-EPGDGAFPDLNRVVSAVLNAFGFRITRFGSSG--------------- 625
             G  +G  + ++ E GDG  PDL+R+VSAVL++FG    R GS G               
Sbjct: 121  QGFHVGSSVVVLSEQGDGV-PDLSRIVSAVLSSFGVNNVRSGSEGADPRDPVPERGSGTP 179

Query: 626  -------------------------------------EGIDSDTIPGSLTTLSEYISNLR 694
                                                 E +    IP SLTTLS+Y+ N+R
Sbjct: 180  GLSGLRDSSRQQPNQAATIDQSNPLNGASLLPNDVTLEQLQPPVIPDSLTTLSQYLRNMR 239

Query: 695  REFIANAGGQNTDSTNDGLPGSNLHDLE-VLPCCSGCRGVRTVMYLAELLSSTRQLLGEQ 871
             EF  +  G   +S   G+ G ++ + E  L       G+ T   LAE++ STRQ+L EQ
Sbjct: 240  HEFGGSVRGHGNNSA-AGIHGCDVQNSEATLQSDVTQGGLPTPASLAEVILSTRQILIEQ 298

Query: 872  ATECXXXXXXXXXXHTSITDALERSRILSDAIRSGSIFQHLGSLLLELGRVITTMQMGQT 1051
            A E           H ++TD L R  I S+A+R G+I ++LG+LLLELGR   T++MGQT
Sbjct: 299  AAEDLSQLTRQLENHANVTDPLARRSIQSNALRLGAILRNLGALLLELGRTTMTLRMGQT 358

Query: 1052 PADALVNAGSPVYIRPTG----------------------ASVGPGDN----------LP 1135
            P DA+VNAG  ++I  +G                       +V PG            LP
Sbjct: 359  PNDAVVNAGPALFISTSGPNPIMVQPLPFHPGTSFGAIPMGTVQPGSGFSSGTLRSGFLP 418

Query: 1136 RNIDITIR 1159
            RNIDI IR
Sbjct: 419  RNIDIRIR 426


>emb|CBI21108.3| unnamed protein product [Vitis vinifera]
          Length = 573

 Score =  268 bits (686), Expect = 5e-69
 Identities = 174/428 (40%), Positives = 231/428 (53%), Gaps = 86/428 (20%)
 Frame = +2

Query: 134  MGSNGGENIKVSESDADECSETTIDIKIKTLDSKTFSVQVDKCVSVAELKEKIASLNGVL 313
            MGS GG+ + +S S   +CSE T++IKIKTLDS+T++++VDKC+ V  LKE+IAS+ GVL
Sbjct: 1    MGSTGGDEVMISGSGEAQCSEATVEIKIKTLDSQTYTLRVDKCMPVPALKEQIASVTGVL 60

Query: 314  SEKQRLICRGKVLKDDQLLSAYNVEDGHTLHLVVREPVAPSPENSPDHPAIXXXXXXXXX 493
            SE+QRLICRG+VLKDDQLLSAY+VEDGHTLHLVVR+P  PS E+ PD+ A          
Sbjct: 61   SEQQRLICRGRVLKDDQLLSAYHVEDGHTLHLVVRQPFPPSSESLPDNSATDPASNTLRN 120

Query: 494  XGNQLGPGMPII-EPGDGAFPDLNRVVSAVLNAFGFRITRFGSSG--------------- 625
             G  +G  + ++ E GDG  PDL+R+VSAVL++FG    R GS G               
Sbjct: 121  QGFHVGSSVVVLSEQGDGV-PDLSRIVSAVLSSFGVNNVRSGSEGADPRDPVPERGSGTP 179

Query: 626  -------------------------------------EGIDSDTIPGSLTTLSEYISNLR 694
                                                 E +    IP SLTTLS+Y+ N+R
Sbjct: 180  GLSGLRDSSRQQPNQAATIDQSNPLNGASLLPNDVTLEQLQPPVIPDSLTTLSQYLRNMR 239

Query: 695  REFIANAGGQNTDSTNDGLPGSNLHDLE-VLPCCSGCRGVRTVMYLAELLSSTRQLLGEQ 871
             EF  +  G   +S   G+ G ++ + E  L       G+ T   LAE++ STRQ+L EQ
Sbjct: 240  HEFGGSVRGHGNNSA-AGIHGCDVQNSEATLQSDVTQGGLPTPASLAEVILSTRQILIEQ 298

Query: 872  ATECXXXXXXXXXXHTSITDALERSRILSDAIRSGSIFQHLGSLLLELGRVITTMQMGQT 1051
            A E           H ++TD L R  I S+A+R G+I ++LG+LLLELGR   T++MGQT
Sbjct: 299  AAEDLSQLTRQLENHANVTDPLARRSIQSNALRLGAILRNLGALLLELGRTTMTLRMGQT 358

Query: 1052 PADALVNAGSPVYIRPTG----------------------ASVGPGDN----------LP 1135
            P DA+VNAG  ++I  +G                       +V PG            LP
Sbjct: 359  PNDAVVNAGPALFISTSGPNPIMVQPLPFHPGTSFGAIPMGTVQPGSGFSSGTLRSGFLP 418

Query: 1136 RNIDITIR 1159
            RNIDI IR
Sbjct: 419  RNIDIRIR 426


>ref|XP_006355909.1| PREDICTED: large proline-rich protein bag6-B-like isoform X1 [Solanum
            tuberosum] gi|565378956|ref|XP_006355910.1| PREDICTED:
            large proline-rich protein bag6-B-like isoform X2
            [Solanum tuberosum]
          Length = 703

 Score =  267 bits (682), Expect = 1e-68
 Identities = 172/425 (40%), Positives = 235/425 (55%), Gaps = 83/425 (19%)
 Frame = +2

Query: 134  MGSNGGENIKVSESDADECSETTIDIKIKTLDSKTFSVQVDKCVSVAELKEKIASLNGVL 313
            M SNG E++++  S   EC ETT++IKIK LDS+T++++VDKCV V  LKE+IA++ GVL
Sbjct: 1    MVSNGAEDVQICGSGEAECPETTVEIKIKMLDSQTYTLRVDKCVPVPALKEQIATVTGVL 60

Query: 314  SEKQRLICRGKVLKDDQLLSAYNVEDGHTLHLVVREPVAPSPENSPDHPAIXXXXXXXXX 493
            +E+QRLICRGKVLKDDQLLSAY+VEDGHTLHLVVR+   PS +++PD  A          
Sbjct: 61   TEQQRLICRGKVLKDDQLLSAYHVEDGHTLHLVVRQ---PSSDSTPDPQATASASSAGYS 117

Query: 494  XGNQLGPGMPI-----IEPGDGAFPDLNRVVSAVLNAFGFRITRFGSSGEGID------- 637
             GN++ PG+ +      + GDG FPDLNR+V+AVL +FG  I   G    GID       
Sbjct: 118  QGNRVSPGVVVGTYSSSDHGDGIFPDLNRIVTAVLGSFG--IASAGGGNAGIDLHGFGPA 175

Query: 638  ---------------SDT-----------------------IPGSLTTLSEYISNLRREF 703
                           +DT                       IP SLTTL++Y+S+L  EF
Sbjct: 176  SLGNIRDSGRSQTEQADTRDQSSVTTSASARPTDVPLQAPVIPDSLTTLTQYLSHLTAEF 235

Query: 704  IANAGGQNTDSTNDGLPGSNLHDLEVLPCCSGCRGVRTVMYLAELLSSTRQLLGEQATEC 883
             AN  GQ+  + + G+  ++   LE      G RG  T   LAE++  TRQL  EQ  EC
Sbjct: 236  RANVRGQSETTQSAGVHVADRTALEATTHSIGERGFPTPASLAEVIILTRQLFMEQVVEC 295

Query: 884  XXXXXXXXXXHTSITDALERSRILSDAIRSGSIFQHLGSLLLELGRVITTMQMGQTPADA 1063
                        ++T+  ER RI S A+R+G +F+++G++LLELGR   T++MG+TP DA
Sbjct: 296  LSQFSTLLENQANVTNPGERMRIQSYALRTGGLFRNIGAMLLELGRTTMTLRMGETPDDA 355

Query: 1064 LVNAG----------SPVYIRP-----------------------TGASVGPGDNLPRNI 1144
            +VNAG          +P+ ++P                       +G SV  G  +PRNI
Sbjct: 356  VVNAGPAVFVSTAGPNPIMVQPLPFQPNTSFGAVPVGTVQNNTGFSGGSVSSG-FIPRNI 414

Query: 1145 DITIR 1159
            DI IR
Sbjct: 415  DIRIR 419


>gb|EOY31281.1| Ubiquitin-like superfamily protein, putative isoform 8 [Theobroma
            cacao] gi|508784026|gb|EOY31282.1| Ubiquitin-like
            superfamily protein, putative isoform 8 [Theobroma cacao]
            gi|508784027|gb|EOY31283.1| Ubiquitin-like superfamily
            protein, putative isoform 8 [Theobroma cacao]
          Length = 575

 Score =  244 bits (622), Expect = 1e-61
 Identities = 174/439 (39%), Positives = 226/439 (51%), Gaps = 97/439 (22%)
 Frame = +2

Query: 134  MGSNGGENIKVSESDADECSETTIDIKIKTLDSKTFSVQVDKCVSVAELKEKIASLNGVL 313
            MGS G +  KV      E SETTI+IKIKTLDS+T++++VDK + V  LKE+IAS+ GVL
Sbjct: 1    MGSTGAD--KVPRDSETEGSETTIEIKIKTLDSQTYTLRVDKQMPVPALKEQIASVTGVL 58

Query: 314  SEKQRLICRGKVLKDDQLLSAYNVEDGHTLHLVVREPVAPSPENSPDHPAIXXXXXXXXX 493
            SE+QRLICRGKVLKDDQLLSAY+VEDGHTLH+VVR+PV PS + SP H A          
Sbjct: 59   SEQQRLICRGKVLKDDQLLSAYHVEDGHTLHMVVRQPVPPSSDGSP-HSANDSASGTSRG 117

Query: 494  XGNQLGPGMPI-----IEPGDGAFPDLNRVVSAVLNAFGFRITRFGSSGEGI-------- 634
              N + P + I      + GDG  P+++R+VSAVL +FGF     G+ G  +        
Sbjct: 118  HSNHVAPSVVIETFNVPDQGDGVPPEISRIVSAVLGSFGFANVGSGNIGGDVREHGSQRL 177

Query: 635  ----------DSD----------------------------------TIPGSLTTLSEYI 682
                      DS                                    IP SL TLS+Y+
Sbjct: 178  ERTSGASGMPDSSQAQTEQASMRGQSDRVHSAFGLPAAVSLGPLQPPVIPDSLATLSQYL 237

Query: 683  SNLRREF--IANAGGQNTDSTN-------DGLPGSNLHDLEVLPCCSGCRGVRTVMYLAE 835
            S+LRREF  I  AGG++  + +       D  P SN   ++         G+ T   LAE
Sbjct: 238  SHLRREFDGIGRAGGEDPQAASLSRTGDRDSNPASNSGTVQ--------EGLPTPASLAE 289

Query: 836  LLSSTRQLLGEQATECXXXXXXXXXXHTSITDALERSRILSDAIRSGSIFQHLGSLLLEL 1015
            +L +TRQLL EQA EC            ++TD+  R    S A R+G + Q+LGSL LEL
Sbjct: 290  VLLATRQLLIEQAGECLQQLARQLEDQGNVTDSSARLSAQSIAWRTGVLLQNLGSLFLEL 349

Query: 1016 GRVITTMQMGQTPADALVNAGSPVYIRPTG----------------------ASVGPGDN 1129
            GR   T+++GQTP++A+VNAG  V+I P+G                       +V PG  
Sbjct: 350  GRTTMTIRLGQTPSEAVVNAGPAVFISPSGPNPLMVQALPFQPGTSFGAFPMGTVQPGSG 409

Query: 1130 ---------LPRNIDITIR 1159
                     LPR IDI IR
Sbjct: 410  LVNGLGTGLLPRRIDIQIR 428


>gb|EOY31280.1| Ubiquitin-like superfamily protein, putative isoform 7 [Theobroma
            cacao]
          Length = 725

 Score =  244 bits (622), Expect = 1e-61
 Identities = 174/439 (39%), Positives = 226/439 (51%), Gaps = 97/439 (22%)
 Frame = +2

Query: 134  MGSNGGENIKVSESDADECSETTIDIKIKTLDSKTFSVQVDKCVSVAELKEKIASLNGVL 313
            MGS G +  KV      E SETTI+IKIKTLDS+T++++VDK + V  LKE+IAS+ GVL
Sbjct: 1    MGSTGAD--KVPRDSETEGSETTIEIKIKTLDSQTYTLRVDKQMPVPALKEQIASVTGVL 58

Query: 314  SEKQRLICRGKVLKDDQLLSAYNVEDGHTLHLVVREPVAPSPENSPDHPAIXXXXXXXXX 493
            SE+QRLICRGKVLKDDQLLSAY+VEDGHTLH+VVR+PV PS + SP H A          
Sbjct: 59   SEQQRLICRGKVLKDDQLLSAYHVEDGHTLHMVVRQPVPPSSDGSP-HSANDSASGTSRG 117

Query: 494  XGNQLGPGMPI-----IEPGDGAFPDLNRVVSAVLNAFGFRITRFGSSGEGI-------- 634
              N + P + I      + GDG  P+++R+VSAVL +FGF     G+ G  +        
Sbjct: 118  HSNHVAPSVVIETFNVPDQGDGVPPEISRIVSAVLGSFGFANVGSGNIGGDVREHGSQRL 177

Query: 635  ----------DSD----------------------------------TIPGSLTTLSEYI 682
                      DS                                    IP SL TLS+Y+
Sbjct: 178  ERTSGASGMPDSSQAQTEQASMRGQSDRVHSAFGLPAAVSLGPLQPPVIPDSLATLSQYL 237

Query: 683  SNLRREF--IANAGGQNTDSTN-------DGLPGSNLHDLEVLPCCSGCRGVRTVMYLAE 835
            S+LRREF  I  AGG++  + +       D  P SN   ++         G+ T   LAE
Sbjct: 238  SHLRREFDGIGRAGGEDPQAASLSRTGDRDSNPASNSGTVQ--------EGLPTPASLAE 289

Query: 836  LLSSTRQLLGEQATECXXXXXXXXXXHTSITDALERSRILSDAIRSGSIFQHLGSLLLEL 1015
            +L +TRQLL EQA EC            ++TD+  R    S A R+G + Q+LGSL LEL
Sbjct: 290  VLLATRQLLIEQAGECLQQLARQLEDQGNVTDSSARLSAQSIAWRTGVLLQNLGSLFLEL 349

Query: 1016 GRVITTMQMGQTPADALVNAGSPVYIRPTG----------------------ASVGPGDN 1129
            GR   T+++GQTP++A+VNAG  V+I P+G                       +V PG  
Sbjct: 350  GRTTMTIRLGQTPSEAVVNAGPAVFISPSGPNPLMVQALPFQPGTSFGAFPMGTVQPGSG 409

Query: 1130 ---------LPRNIDITIR 1159
                     LPR IDI IR
Sbjct: 410  LVNGLGTGLLPRRIDIQIR 428


>gb|EOY31278.1| Ubiquitin-like superfamily protein, putative isoform 5 [Theobroma
            cacao]
          Length = 729

 Score =  244 bits (622), Expect = 1e-61
 Identities = 174/439 (39%), Positives = 226/439 (51%), Gaps = 97/439 (22%)
 Frame = +2

Query: 134  MGSNGGENIKVSESDADECSETTIDIKIKTLDSKTFSVQVDKCVSVAELKEKIASLNGVL 313
            MGS G +  KV      E SETTI+IKIKTLDS+T++++VDK + V  LKE+IAS+ GVL
Sbjct: 1    MGSTGAD--KVPRDSETEGSETTIEIKIKTLDSQTYTLRVDKQMPVPALKEQIASVTGVL 58

Query: 314  SEKQRLICRGKVLKDDQLLSAYNVEDGHTLHLVVREPVAPSPENSPDHPAIXXXXXXXXX 493
            SE+QRLICRGKVLKDDQLLSAY+VEDGHTLH+VVR+PV PS + SP H A          
Sbjct: 59   SEQQRLICRGKVLKDDQLLSAYHVEDGHTLHMVVRQPVPPSSDGSP-HSANDSASGTSRG 117

Query: 494  XGNQLGPGMPI-----IEPGDGAFPDLNRVVSAVLNAFGFRITRFGSSGEGI-------- 634
              N + P + I      + GDG  P+++R+VSAVL +FGF     G+ G  +        
Sbjct: 118  HSNHVAPSVVIETFNVPDQGDGVPPEISRIVSAVLGSFGFANVGSGNIGGDVREHGSQRL 177

Query: 635  ----------DSD----------------------------------TIPGSLTTLSEYI 682
                      DS                                    IP SL TLS+Y+
Sbjct: 178  ERTSGASGMPDSSQAQTEQASMRGQSDRVHSAFGLPAAVSLGPLQPPVIPDSLATLSQYL 237

Query: 683  SNLRREF--IANAGGQNTDSTN-------DGLPGSNLHDLEVLPCCSGCRGVRTVMYLAE 835
            S+LRREF  I  AGG++  + +       D  P SN   ++         G+ T   LAE
Sbjct: 238  SHLRREFDGIGRAGGEDPQAASLSRTGDRDSNPASNSGTVQ--------EGLPTPASLAE 289

Query: 836  LLSSTRQLLGEQATECXXXXXXXXXXHTSITDALERSRILSDAIRSGSIFQHLGSLLLEL 1015
            +L +TRQLL EQA EC            ++TD+  R    S A R+G + Q+LGSL LEL
Sbjct: 290  VLLATRQLLIEQAGECLQQLARQLEDQGNVTDSSARLSAQSIAWRTGVLLQNLGSLFLEL 349

Query: 1016 GRVITTMQMGQTPADALVNAGSPVYIRPTG----------------------ASVGPGDN 1129
            GR   T+++GQTP++A+VNAG  V+I P+G                       +V PG  
Sbjct: 350  GRTTMTIRLGQTPSEAVVNAGPAVFISPSGPNPLMVQALPFQPGTSFGAFPMGTVQPGSG 409

Query: 1130 ---------LPRNIDITIR 1159
                     LPR IDI IR
Sbjct: 410  LVNGLGTGLLPRRIDIQIR 428


>gb|EOY31277.1| Ubiquitin-like superfamily protein, putative isoform 4 [Theobroma
            cacao]
          Length = 724

 Score =  244 bits (622), Expect = 1e-61
 Identities = 174/439 (39%), Positives = 226/439 (51%), Gaps = 97/439 (22%)
 Frame = +2

Query: 134  MGSNGGENIKVSESDADECSETTIDIKIKTLDSKTFSVQVDKCVSVAELKEKIASLNGVL 313
            MGS G +  KV      E SETTI+IKIKTLDS+T++++VDK + V  LKE+IAS+ GVL
Sbjct: 1    MGSTGAD--KVPRDSETEGSETTIEIKIKTLDSQTYTLRVDKQMPVPALKEQIASVTGVL 58

Query: 314  SEKQRLICRGKVLKDDQLLSAYNVEDGHTLHLVVREPVAPSPENSPDHPAIXXXXXXXXX 493
            SE+QRLICRGKVLKDDQLLSAY+VEDGHTLH+VVR+PV PS + SP H A          
Sbjct: 59   SEQQRLICRGKVLKDDQLLSAYHVEDGHTLHMVVRQPVPPSSDGSP-HSANDSASGTSRG 117

Query: 494  XGNQLGPGMPI-----IEPGDGAFPDLNRVVSAVLNAFGFRITRFGSSGEGI-------- 634
              N + P + I      + GDG  P+++R+VSAVL +FGF     G+ G  +        
Sbjct: 118  HSNHVAPSVVIETFNVPDQGDGVPPEISRIVSAVLGSFGFANVGSGNIGGDVREHGSQRL 177

Query: 635  ----------DSD----------------------------------TIPGSLTTLSEYI 682
                      DS                                    IP SL TLS+Y+
Sbjct: 178  ERTSGASGMPDSSQAQTEQASMRGQSDRVHSAFGLPAAVSLGPLQPPVIPDSLATLSQYL 237

Query: 683  SNLRREF--IANAGGQNTDSTN-------DGLPGSNLHDLEVLPCCSGCRGVRTVMYLAE 835
            S+LRREF  I  AGG++  + +       D  P SN   ++         G+ T   LAE
Sbjct: 238  SHLRREFDGIGRAGGEDPQAASLSRTGDRDSNPASNSGTVQ--------EGLPTPASLAE 289

Query: 836  LLSSTRQLLGEQATECXXXXXXXXXXHTSITDALERSRILSDAIRSGSIFQHLGSLLLEL 1015
            +L +TRQLL EQA EC            ++TD+  R    S A R+G + Q+LGSL LEL
Sbjct: 290  VLLATRQLLIEQAGECLQQLARQLEDQGNVTDSSARLSAQSIAWRTGVLLQNLGSLFLEL 349

Query: 1016 GRVITTMQMGQTPADALVNAGSPVYIRPTG----------------------ASVGPGDN 1129
            GR   T+++GQTP++A+VNAG  V+I P+G                       +V PG  
Sbjct: 350  GRTTMTIRLGQTPSEAVVNAGPAVFISPSGPNPLMVQALPFQPGTSFGAFPMGTVQPGSG 409

Query: 1130 ---------LPRNIDITIR 1159
                     LPR IDI IR
Sbjct: 410  LVNGLGTGLLPRRIDIQIR 428


>gb|EOY31276.1| Ubiquitin-like superfamily protein, putative isoform 3 [Theobroma
            cacao]
          Length = 730

 Score =  244 bits (622), Expect = 1e-61
 Identities = 174/439 (39%), Positives = 226/439 (51%), Gaps = 97/439 (22%)
 Frame = +2

Query: 134  MGSNGGENIKVSESDADECSETTIDIKIKTLDSKTFSVQVDKCVSVAELKEKIASLNGVL 313
            MGS G +  KV      E SETTI+IKIKTLDS+T++++VDK + V  LKE+IAS+ GVL
Sbjct: 1    MGSTGAD--KVPRDSETEGSETTIEIKIKTLDSQTYTLRVDKQMPVPALKEQIASVTGVL 58

Query: 314  SEKQRLICRGKVLKDDQLLSAYNVEDGHTLHLVVREPVAPSPENSPDHPAIXXXXXXXXX 493
            SE+QRLICRGKVLKDDQLLSAY+VEDGHTLH+VVR+PV PS + SP H A          
Sbjct: 59   SEQQRLICRGKVLKDDQLLSAYHVEDGHTLHMVVRQPVPPSSDGSP-HSANDSASGTSRG 117

Query: 494  XGNQLGPGMPI-----IEPGDGAFPDLNRVVSAVLNAFGFRITRFGSSGEGI-------- 634
              N + P + I      + GDG  P+++R+VSAVL +FGF     G+ G  +        
Sbjct: 118  HSNHVAPSVVIETFNVPDQGDGVPPEISRIVSAVLGSFGFANVGSGNIGGDVREHGSQRL 177

Query: 635  ----------DSD----------------------------------TIPGSLTTLSEYI 682
                      DS                                    IP SL TLS+Y+
Sbjct: 178  ERTSGASGMPDSSQAQTEQASMRGQSDRVHSAFGLPAAVSLGPLQPPVIPDSLATLSQYL 237

Query: 683  SNLRREF--IANAGGQNTDSTN-------DGLPGSNLHDLEVLPCCSGCRGVRTVMYLAE 835
            S+LRREF  I  AGG++  + +       D  P SN   ++         G+ T   LAE
Sbjct: 238  SHLRREFDGIGRAGGEDPQAASLSRTGDRDSNPASNSGTVQ--------EGLPTPASLAE 289

Query: 836  LLSSTRQLLGEQATECXXXXXXXXXXHTSITDALERSRILSDAIRSGSIFQHLGSLLLEL 1015
            +L +TRQLL EQA EC            ++TD+  R    S A R+G + Q+LGSL LEL
Sbjct: 290  VLLATRQLLIEQAGECLQQLARQLEDQGNVTDSSARLSAQSIAWRTGVLLQNLGSLFLEL 349

Query: 1016 GRVITTMQMGQTPADALVNAGSPVYIRPTG----------------------ASVGPGDN 1129
            GR   T+++GQTP++A+VNAG  V+I P+G                       +V PG  
Sbjct: 350  GRTTMTIRLGQTPSEAVVNAGPAVFISPSGPNPLMVQALPFQPGTSFGAFPMGTVQPGSG 409

Query: 1130 ---------LPRNIDITIR 1159
                     LPR IDI IR
Sbjct: 410  LVNGLGTGLLPRRIDIQIR 428


>gb|EOY31275.1| Ubiquitin-like superfamily protein, putative isoform 2 [Theobroma
            cacao] gi|508784028|gb|EOY31284.1| Ubiquitin-like
            superfamily protein, putative isoform 2 [Theobroma cacao]
          Length = 579

 Score =  244 bits (622), Expect = 1e-61
 Identities = 174/439 (39%), Positives = 226/439 (51%), Gaps = 97/439 (22%)
 Frame = +2

Query: 134  MGSNGGENIKVSESDADECSETTIDIKIKTLDSKTFSVQVDKCVSVAELKEKIASLNGVL 313
            MGS G +  KV      E SETTI+IKIKTLDS+T++++VDK + V  LKE+IAS+ GVL
Sbjct: 1    MGSTGAD--KVPRDSETEGSETTIEIKIKTLDSQTYTLRVDKQMPVPALKEQIASVTGVL 58

Query: 314  SEKQRLICRGKVLKDDQLLSAYNVEDGHTLHLVVREPVAPSPENSPDHPAIXXXXXXXXX 493
            SE+QRLICRGKVLKDDQLLSAY+VEDGHTLH+VVR+PV PS + SP H A          
Sbjct: 59   SEQQRLICRGKVLKDDQLLSAYHVEDGHTLHMVVRQPVPPSSDGSP-HSANDSASGTSRG 117

Query: 494  XGNQLGPGMPI-----IEPGDGAFPDLNRVVSAVLNAFGFRITRFGSSGEGI-------- 634
              N + P + I      + GDG  P+++R+VSAVL +FGF     G+ G  +        
Sbjct: 118  HSNHVAPSVVIETFNVPDQGDGVPPEISRIVSAVLGSFGFANVGSGNIGGDVREHGSQRL 177

Query: 635  ----------DSD----------------------------------TIPGSLTTLSEYI 682
                      DS                                    IP SL TLS+Y+
Sbjct: 178  ERTSGASGMPDSSQAQTEQASMRGQSDRVHSAFGLPAAVSLGPLQPPVIPDSLATLSQYL 237

Query: 683  SNLRREF--IANAGGQNTDSTN-------DGLPGSNLHDLEVLPCCSGCRGVRTVMYLAE 835
            S+LRREF  I  AGG++  + +       D  P SN   ++         G+ T   LAE
Sbjct: 238  SHLRREFDGIGRAGGEDPQAASLSRTGDRDSNPASNSGTVQ--------EGLPTPASLAE 289

Query: 836  LLSSTRQLLGEQATECXXXXXXXXXXHTSITDALERSRILSDAIRSGSIFQHLGSLLLEL 1015
            +L +TRQLL EQA EC            ++TD+  R    S A R+G + Q+LGSL LEL
Sbjct: 290  VLLATRQLLIEQAGECLQQLARQLEDQGNVTDSSARLSAQSIAWRTGVLLQNLGSLFLEL 349

Query: 1016 GRVITTMQMGQTPADALVNAGSPVYIRPTG----------------------ASVGPGDN 1129
            GR   T+++GQTP++A+VNAG  V+I P+G                       +V PG  
Sbjct: 350  GRTTMTIRLGQTPSEAVVNAGPAVFISPSGPNPLMVQALPFQPGTSFGAFPMGTVQPGSG 409

Query: 1130 ---------LPRNIDITIR 1159
                     LPR IDI IR
Sbjct: 410  LVNGLGTGLLPRRIDIQIR 428


>gb|EOY31274.1| Ubiquitin-like superfamily protein, putative isoform 1 [Theobroma
            cacao] gi|508784023|gb|EOY31279.1| Ubiquitin-like
            superfamily protein, putative isoform 1 [Theobroma cacao]
          Length = 724

 Score =  244 bits (622), Expect = 1e-61
 Identities = 174/439 (39%), Positives = 226/439 (51%), Gaps = 97/439 (22%)
 Frame = +2

Query: 134  MGSNGGENIKVSESDADECSETTIDIKIKTLDSKTFSVQVDKCVSVAELKEKIASLNGVL 313
            MGS G +  KV      E SETTI+IKIKTLDS+T++++VDK + V  LKE+IAS+ GVL
Sbjct: 1    MGSTGAD--KVPRDSETEGSETTIEIKIKTLDSQTYTLRVDKQMPVPALKEQIASVTGVL 58

Query: 314  SEKQRLICRGKVLKDDQLLSAYNVEDGHTLHLVVREPVAPSPENSPDHPAIXXXXXXXXX 493
            SE+QRLICRGKVLKDDQLLSAY+VEDGHTLH+VVR+PV PS + SP H A          
Sbjct: 59   SEQQRLICRGKVLKDDQLLSAYHVEDGHTLHMVVRQPVPPSSDGSP-HSANDSASGTSRG 117

Query: 494  XGNQLGPGMPI-----IEPGDGAFPDLNRVVSAVLNAFGFRITRFGSSGEGI-------- 634
              N + P + I      + GDG  P+++R+VSAVL +FGF     G+ G  +        
Sbjct: 118  HSNHVAPSVVIETFNVPDQGDGVPPEISRIVSAVLGSFGFANVGSGNIGGDVREHGSQRL 177

Query: 635  ----------DSD----------------------------------TIPGSLTTLSEYI 682
                      DS                                    IP SL TLS+Y+
Sbjct: 178  ERTSGASGMPDSSQAQTEQASMRGQSDRVHSAFGLPAAVSLGPLQPPVIPDSLATLSQYL 237

Query: 683  SNLRREF--IANAGGQNTDSTN-------DGLPGSNLHDLEVLPCCSGCRGVRTVMYLAE 835
            S+LRREF  I  AGG++  + +       D  P SN   ++         G+ T   LAE
Sbjct: 238  SHLRREFDGIGRAGGEDPQAASLSRTGDRDSNPASNSGTVQ--------EGLPTPASLAE 289

Query: 836  LLSSTRQLLGEQATECXXXXXXXXXXHTSITDALERSRILSDAIRSGSIFQHLGSLLLEL 1015
            +L +TRQLL EQA EC            ++TD+  R    S A R+G + Q+LGSL LEL
Sbjct: 290  VLLATRQLLIEQAGECLQQLARQLEDQGNVTDSSARLSAQSIAWRTGVLLQNLGSLFLEL 349

Query: 1016 GRVITTMQMGQTPADALVNAGSPVYIRPTG----------------------ASVGPGDN 1129
            GR   T+++GQTP++A+VNAG  V+I P+G                       +V PG  
Sbjct: 350  GRTTMTIRLGQTPSEAVVNAGPAVFISPSGPNPLMVQALPFQPGTSFGAFPMGTVQPGSG 409

Query: 1130 ---------LPRNIDITIR 1159
                     LPR IDI IR
Sbjct: 410  LVNGLGTGLLPRRIDIQIR 428


>gb|EMJ05831.1| hypothetical protein PRUPE_ppa002041mg [Prunus persica]
            gi|462400164|gb|EMJ05832.1| hypothetical protein
            PRUPE_ppa002041mg [Prunus persica]
          Length = 725

 Score =  242 bits (618), Expect = 3e-61
 Identities = 177/440 (40%), Positives = 230/440 (52%), Gaps = 94/440 (21%)
 Frame = +2

Query: 134  MGSNGGENIKVSESDADECSETTIDIKIKTLDSKTFSVQVDKCVSVAELKEKIASLNGVL 313
            MGSN  E+I + E    E SETT++IKIKTLDS+T++++VDK + V  LKE+IAS+ GVL
Sbjct: 1    MGSNSAEHIPMCEQI--EGSETTVEIKIKTLDSQTYTLRVDKQMPVPALKEQIASVTGVL 58

Query: 314  SEKQRLICRGKVLKDDQLLSAYNVEDGHTLHLVVREPVAPSPENSPDHPAIXXXXXXXXX 493
            SE+QRLICRGKVLKDDQLLSAY+VEDGHTLHLVVR+P+ PSPE  P+HP           
Sbjct: 59   SEQQRLICRGKVLKDDQLLSAYHVEDGHTLHLVVRQPILPSPEGLPNHPGTDPASSTSRG 118

Query: 494  XGNQLGPG-------MPIIEPGDGAFPDLNRVVSAVLNAFG------------------- 595
              +Q+ PG       MP+   GDG  P+++R++SA+L + G                   
Sbjct: 119  Q-HQVAPGVVIETFSMPV--QGDGFPPEISRIISAILGSIGLPNISGGSEGIEVRRPERT 175

Query: 596  ------FRITRFGSSGEG----------------------IDSDTIPGSLTTLSEYISNL 691
                  F  ++F S   G                      +    IP SLTTLS+Y+S+L
Sbjct: 176  PGLSGMFDFSQFQSEQAGPRGPSDRSNGTFGHPTDFSLGTLPPLVIPDSLTTLSQYLSHL 235

Query: 692  RREF--IANAGGQ-NTDSTNDGLPGSNLHDLEVLPCCSGCR--GVRTVMYLAELLSSTRQ 856
            RREF  IA  GG+  T +T      SN          +G R  G+ +   LAE++ STR 
Sbjct: 236  RREFEAIARDGGRGQTAATLRTEESSNASS------HTGARQEGLPSPASLAEVMRSTRL 289

Query: 857  LLGEQATECXXXXXXXXXXHTSITDALERSRILSDAIRSGSIFQHLGSLLLELGRVITTM 1036
            LL EQ  E             ++TD   R    S A R+G++F +LG+ LLELGR   T+
Sbjct: 290  LLLEQVGESLLQFASQLENQVNVTDPSARFSAQSSASRTGALFHNLGAFLLELGRTTMTL 349

Query: 1037 QMGQTPADALVNAGSPVYIRPTG----------------------ASVGPGDNL------ 1132
            Q+GQTP+DA+VNAG  V+I PTG                       +V PG  L      
Sbjct: 350  QLGQTPSDAVVNAGPAVFISPTGPNPIMVQPLPFQSGMSFGAIPMGAVQPGSGLVNGLGT 409

Query: 1133 ---PRNIDITIR----AVTP 1171
               PR IDI IR    A TP
Sbjct: 410  GFVPRRIDIQIRRGSSATTP 429


>ref|XP_002532506.1| scythe/bat3, putative [Ricinus communis] gi|223527781|gb|EEF29882.1|
            scythe/bat3, putative [Ricinus communis]
          Length = 709

 Score =  240 bits (612), Expect = 2e-60
 Identities = 165/427 (38%), Positives = 219/427 (51%), Gaps = 85/427 (19%)
 Frame = +2

Query: 134  MGSNGGENIKVSESDADECSETTIDIKIKTLDSKTFSVQVDKCVSVAELKEKIASLNGVL 313
            MGS+G +  K+  +D  E SETTI+IK+KTLDS+T++++VDK + V  LKE+IAS+ GVL
Sbjct: 1    MGSDGAQ--KIPGTDVAEGSETTIEIKLKTLDSQTYTLRVDKQMPVPALKEQIASVTGVL 58

Query: 314  SEKQRLICRGKVLKDDQLLSAYNVEDGHTLHLVVREPVAPSPENSPDHPAIXXXXXXXXX 493
            SE+QRLICRGKVLKDDQLLSAY+VEDGHTLHLVVR+PV PS +   +H A          
Sbjct: 59   SEQQRLICRGKVLKDDQLLSAYHVEDGHTLHLVVRQPVIPSSDGLSNHSATDPASSTSR- 117

Query: 494  XGNQLGPGMPI-----IEPGDGAFPDLNRVVSAVLNAFGFRITRFGSSGEGID------- 637
                + P + I      + GDG  P+++R+VSAVL +FGF     GS GEG+D       
Sbjct: 118  --GHVAPSVVIETFSMPDQGDGVPPEISRIVSAVLGSFGF--PNIGSGGEGVDVARERDQ 173

Query: 638  ----------------------SD--------------------TIPGSLTTLSEYISNL 691
                                  SD                     IP SLTTLS+Y+S++
Sbjct: 174  HRSAAASPEAAQLQPEQGSRIQSDRSQSVFGLPTTVSLGSLHPPIIPDSLTTLSQYLSHM 233

Query: 692  RREFIANAGGQNTDSTNDGLPGSNLHDLEVLPCCSGCRGVRTVMYLAELLSSTRQLLGEQ 871
            RREF      +  +         +    E LP         T  YLAE+++S+RQ + EQ
Sbjct: 234  RREFNTIEATRRDEQRETNSTSRSGTGQERLP---------TPAYLAEVITSSRQFINEQ 284

Query: 872  ATECXXXXXXXXXXHTSITDALERSRILSDAIRSGSIFQHLGSLLLELGRVITTMQMGQT 1051
              EC            ++TD+  R  I S A R+G    +LG+ LLELGR   T+++GQ 
Sbjct: 285  VAECLQQLARQLENQANVTDSAARLNIQSSAWRTGVQLHNLGAFLLELGRTTMTLRLGQA 344

Query: 1052 PADALVNAGSPVYIRPTG----------------------ASVGPGDN---------LPR 1138
            P++A+VNAG  V+I P+G                       SV PG           LPR
Sbjct: 345  PSEAVVNAGPAVFISPSGPNPLMVQPLPFQTGASFGALPLGSVQPGSGLVNGIGTGFLPR 404

Query: 1139 NIDITIR 1159
             IDI IR
Sbjct: 405  RIDIQIR 411


>ref|XP_006289444.1| hypothetical protein CARUB_v10002958mg [Capsella rubella]
            gi|482558150|gb|EOA22342.1| hypothetical protein
            CARUB_v10002958mg [Capsella rubella]
          Length = 660

 Score =  238 bits (607), Expect = 7e-60
 Identities = 157/412 (38%), Positives = 221/412 (53%), Gaps = 70/412 (16%)
 Frame = +2

Query: 134  MGSNGGENIKVSESDADECSETTIDIKIKTLDSKTFSVQVDKCVSVAELKEKIASLNGVL 313
            MG NG + I V   +A +C+   ++IKIKTLDS+T+++ VDKCV V  LKE++A++ GV+
Sbjct: 1    MGDNGKDEIMV---EASQCAGAMVEIKIKTLDSQTYTLHVDKCVPVPALKEQVATVTGVV 57

Query: 314  SEKQRLICRGKVLKDDQLLSAYNVEDGHTLHLVVREPVAPSPENSPDHPAIXXXXXXXXX 493
            +E+QRLICRGKV+KDDQLLSAY+VEDGHTLHLVVR+PV P  E+S  + A          
Sbjct: 58   TEQQRLICRGKVMKDDQLLSAYHVEDGHTLHLVVRQPVPPISESSTSNAAADPALSAGDS 117

Query: 494  XGNQ----LGPGMPIIEPGDGAFPDLNRVVSAVLNAFGF------------------RIT 607
             G+Q    +     I E  DG + DL ++VSAVL + G                   R++
Sbjct: 118  QGSQRSRVVVGSFNIAEQADGVYSDLGQIVSAVLGSLGISNPEGGIEGIEAMGPLHERLS 177

Query: 608  RFGSSGEGIDSD-----------------------TIPGSLTTLSEYISNLRREFIANAG 718
            R        DS                         IP SLTTLSEY+++LR+EF AN  
Sbjct: 178  RSSGPTSARDSSGGRSATPNTVNQTSTPLTSSQPAVIPDSLTTLSEYLNHLRQEFAAN-- 235

Query: 719  GQNTDSTNDGLPGSNLHDLEVLPCCSGCRGVRTVMYLAELLSSTRQLLGEQATECXXXXX 898
            G N ++  D    +++ +++     +G   +    +LAE+L STRQLL  +  +C     
Sbjct: 236  GSNANNLQDS--ENSMGNVQDSASTTGESRIPRPSHLAEVLQSTRQLLIGEVADCLSNLS 293

Query: 899  XXXXXHTSITDALERSRILSDAIRSGSIFQHLGSLLLELGRVITTMQMGQTPADALVNAG 1078
                 H ++TD   R    S+ ++SGS+ ++LG  LLELGR    +++GQTP DA+VNAG
Sbjct: 294  RQLVDHVNVTDPSTRRLCQSNMLQSGSLLENLGISLLELGRATMMLRLGQTPDDAVVNAG 353

Query: 1079 SPVYIRPT----------GASVG------------PGDNL---PRNIDITIR 1159
              V+I PT          G S+G             G +L   PRNI+I IR
Sbjct: 354  PAVFISPTRRNPLASTRQGTSIGGLQAGTAHSNPFAGQSLASAPRNIEIRIR 405


>ref|XP_006394775.1| hypothetical protein EUTSA_v10003804mg [Eutrema salsugineum]
            gi|557091414|gb|ESQ32061.1| hypothetical protein
            EUTSA_v10003804mg [Eutrema salsugineum]
          Length = 646

 Score =  237 bits (605), Expect = 1e-59
 Identities = 143/377 (37%), Positives = 203/377 (53%), Gaps = 45/377 (11%)
 Frame = +2

Query: 134  MGSNGGENIKVSESDADECSETTIDIKIKTLDSKTFSVQVDKCVSVAELKEKIASLNGVL 313
            MG +G + I V E +A +C+   ++I IKTLDS+T +++VDKCV V  LKE+IAS+ GV+
Sbjct: 1    MGQSGKDEIIVREMEASQCASAMVEINIKTLDSQTHTLRVDKCVPVPALKEQIASVTGVV 60

Query: 314  SEKQRLICRGKVLKDDQLLSAYNVEDGHTLHLVVREPVAPSPENSPDHPAIXXXXXXXXX 493
            +E+QRLICRG+V+KDDQLLSAY+VEDGHTLH+VVR+P+ P  E++  + A          
Sbjct: 61   TEQQRLICRGQVMKDDQLLSAYHVEDGHTLHMVVRQPIPPLSESAASNAAADPALSAGDS 120

Query: 494  XGNQ----LGPGMPIIEPGDGAFPDLNRVVSAVLNAFGF------------------RIT 607
             G+Q    +     I E  DGA+ DL ++VSAVL + G                   R++
Sbjct: 121  QGSQRSRVVVGSFNIAEQADGAYSDLGQIVSAVLGSLGISNTEGAFEGIEALGPLHERLS 180

Query: 608  RFGSSGEGIDSD-----------------------TIPGSLTTLSEYISNLRREFIANAG 718
            R        DS                         +P SLTTLSEY+++LR+EF  N  
Sbjct: 181  RSSGPDAARDSSGARSVTPNAMDQTSTPLTSSQPAAVPDSLTTLSEYLNHLRQEFATNGS 240

Query: 719  GQNTDSTNDGLPGSNLHDLEVLPCCSGCRGVRTVMYLAELLSSTRQLLGEQATECXXXXX 898
              N    ++   G+          C     +    +LAE+L STRQLL  +  +C     
Sbjct: 241  NANNLQNSENSMGNVQASGSTTEECR----IPRPSHLAEVLQSTRQLLTGEVADCISHVA 296

Query: 899  XXXXXHTSITDALERSRILSDAIRSGSIFQHLGSLLLELGRVITTMQMGQTPADALVNAG 1078
                 H ++TD   R    S+ + SGS+ ++LG LLLELGR    +++GQTP DA+VNAG
Sbjct: 297  RQLLDHRNVTDPSTRRLCQSNMLHSGSLLENLGILLLELGRATMMLRLGQTPDDAVVNAG 356

Query: 1079 SPVYIRPTGASVGPGDN 1129
              V+I PTG +  P  +
Sbjct: 357  PAVFISPTGRNPLPSQS 373


>gb|EPS60717.1| hypothetical protein M569_14085, partial [Genlisea aurea]
          Length = 346

 Score =  236 bits (603), Expect = 2e-59
 Identities = 151/351 (43%), Positives = 207/351 (58%), Gaps = 47/351 (13%)
 Frame = +2

Query: 143  NGGENIKVSESDADECSETTIDIKIKTLDSKTFSVQVDKCVSVAELKEKIASLNGVLSEK 322
            +GG  IK+  + A ECS T ++IKIKTLDS+TF+++VDKCV V ELK +IAS+ GV+ E+
Sbjct: 8    DGGAQIKLPLNGA-ECSGTMVEIKIKTLDSQTFTLRVDKCVPVPELKRQIASVTGVVMEQ 66

Query: 323  QRLICRGKVLKDDQLLSAYNVEDGHTLHLVVREPVAPSPENSPDHPAIXXXXXXXXXXGN 502
            QRLICRG+VLKDDQLLSAY+VEDGHTLHLVVR+P+ P  + S D  A            N
Sbjct: 67   QRLICRGRVLKDDQLLSAYHVEDGHTLHLVVRQPIVPLADMSGD-TATDGLPSSGPVPSN 125

Query: 503  QLGPGM-----PIIEPGDGAFPDLNRVVSAVLNAFGFRITRFGSSGEGIDSDT------- 646
            ++ PG+      +++  D  FP+LN++VSAV N+ G  +TR GS GEGID +        
Sbjct: 126  RVRPGVLIGSFNVLDDIDREFPNLNQIVSAVFNSIG--VTRNGSGGEGIDLNVMHLFAHS 183

Query: 647  -----------------------------------IPGSLTTLSEYISNLRREFIANAGG 721
                                               IP SL+TL +Y ++LR++       
Sbjct: 184  LFSPLQRPPSELPGGLSSDQTASAAASLDSIQPPIIPDSLSTLLQYANHLRQD------- 236

Query: 722  QNTDSTNDGLPGSNLHDLEVLPCCSGCRGVRTVMYLAELLSSTRQLLGEQATECXXXXXX 901
            +N+ ST  GLP ++  +L         R + T   LA ++SSTR+LL  +A+EC      
Sbjct: 237  ENSQST--GLPVADGLELGAGVTSGESRRLLTPESLARVMSSTRELLCGEASECLQQLEG 294

Query: 902  XXXXHTSITDALERSRILSDAIRSGSIFQHLGSLLLELGRVITTMQMGQTP 1054
                H+S+TD +ERSRI   A+RSG +FQ+LG LLLELGR I T++MG+TP
Sbjct: 295  QLESHSSVTDIVERSRIQYRAVRSGVLFQNLGGLLLELGRTIMTVRMGRTP 345


>ref|NP_197909.4| ubiquitin-like superfamily protein [Arabidopsis thaliana]
            gi|332006037|gb|AED93420.1| ubiquitin-like superfamily
            protein [Arabidopsis thaliana]
          Length = 658

 Score =  236 bits (602), Expect = 2e-59
 Identities = 146/371 (39%), Positives = 208/371 (56%), Gaps = 42/371 (11%)
 Frame = +2

Query: 134  MGSNGGENIKVSESDADECSETTIDIKIKTLDSKTFSVQVDKCVSVAELKEKIASLNGVL 313
            MG NG + I V   +A +C+   ++IKIKTLDS+T++++VDKCV V  LKE++AS+ GV+
Sbjct: 1    MGDNGKDEIMV---EASQCAGAMVEIKIKTLDSQTYTLRVDKCVPVPALKEQVASVTGVV 57

Query: 314  SEKQRLICRGKVLKDDQLLSAYNVEDGHTLHLVVREPVA-PSPENSPDHPAIXXXXXXXX 490
            +E+QRLICRGKV+KDDQLLSAY+VEDGHTLHLVVR+PV+  S  N+   PA+        
Sbjct: 58   TEQQRLICRGKVMKDDQLLSAYHVEDGHTLHLVVRQPVSESSTSNAAADPALSAGDSQGS 117

Query: 491  XXGNQLGPGMPIIEPGDGAFPDLNRVVSAVLNAFGF------------------RITRFG 616
                 +     I E  DG + DL ++VSAVL + G                   R++R  
Sbjct: 118  QRSRVVVGSFNIAEQADGVYSDLGQIVSAVLGSLGISNPEGGIEGIDDMGPLHERLSRSS 177

Query: 617  SSGEGIDSD-----------------------TIPGSLTTLSEYISNLRREFIANAGGQN 727
              G   DS                         IP SLTTLSEY+++LR+EF AN  G N
Sbjct: 178  GPGTARDSSGGRSATPNAVDQTSTPLASSQPAAIPDSLTTLSEYLNHLRQEFAAN--GSN 235

Query: 728  TDSTNDGLPGSNLHDLEVLPCCSGCRGVRTVMYLAELLSSTRQLLGEQATECXXXXXXXX 907
             ++  D    +++ +++     +G   +    +LAE+L STRQLL  +  +C        
Sbjct: 236  ANNLQDS--ENSVGNVQDSASTTGESRIPRPSHLAEVLQSTRQLLIGEVADCLSNLSRQL 293

Query: 908  XXHTSITDALERSRILSDAIRSGSIFQHLGSLLLELGRVITTMQMGQTPADALVNAGSPV 1087
              H ++TD   R    S+ ++SGS+ + LG  LLELGR    +++GQTP DA+V+AG  V
Sbjct: 294  VDHVNVTDPPTRRLCQSNMLQSGSLLESLGISLLELGRATMMLRLGQTPDDAVVDAGPAV 353

Query: 1088 YIRPTGASVGP 1120
            +I PTG +  P
Sbjct: 354  FISPTGRNPLP 364


>gb|ESW27099.1| hypothetical protein PHAVU_003G173700g [Phaseolus vulgaris]
          Length = 717

 Score =  235 bits (600), Expect = 4e-59
 Identities = 161/390 (41%), Positives = 211/390 (54%), Gaps = 66/390 (16%)
 Frame = +2

Query: 134  MGSNGGENIKVSESDADECSETTIDIKIKTLDSKTFSVQVDKCVSVAELKEKIASLNGVL 313
            MGSNG E I ++ S   E SETTI+IKIKTLDS+T++++VDK + V  LKE+IAS+ GVL
Sbjct: 1    MGSNGTEKIPINNSA--ESSETTIEIKIKTLDSQTYTLRVDKQMPVPALKEQIASVTGVL 58

Query: 314  SEKQRLICRGKVLKDDQLLSAYNVEDGHTLHLVVREPVAPSPENSPDHPAIXXXXXXXXX 493
            SE+QRLIC+GKVLKDDQLLSAY+VEDGHTLHLVVR+P  P P + P+H            
Sbjct: 59   SERQRLICQGKVLKDDQLLSAYHVEDGHTLHLVVRQPDLPPPGSLPNHSVTEPNSSSSLS 118

Query: 494  XGNQLGPGMPIIE------PGDGAFPDLNRVVSAVLNAFGFRITRFGSSGEGID------ 637
              +Q+ PG+  IE       GDG  P++NR+VSAVL + G +     +SGEGID      
Sbjct: 119  HSSQVAPGV-FIETFNVPFQGDGVAPEINRIVSAVLGSIGLQNF---ASGEGIDVREHDS 174

Query: 638  ----------------------------SD--------TIPGSL------------TTLS 673
                                        SD         +P SL            TTLS
Sbjct: 175  QGPGRISGSSGIFDSSHPQPEQSGFRILSDRSRNAFGAPVPVSLGSLQPPVIPDSLTTLS 234

Query: 674  EYISNLRREF--IANAGGQNTDSTNDGLPGSNLHDLEVLPCC----SGCRGVRTVMYLAE 835
            +Y+S++  EF  I   GG    +           ++E  P      S   G  +   LAE
Sbjct: 235  QYLSHISHEFDAIVREGGDIAQA------AEAQRNVETRPVSSRSGSAPEGFSSPTLLAE 288

Query: 836  LLSSTRQLLGEQATECXXXXXXXXXXHTSITDALERSRILSDAIRSGSIFQHLGSLLLEL 1015
            +L STR+++ EQA EC             ITD L RS I S A+R+G +F +LG+ LLEL
Sbjct: 289  VLLSTRRMIVEQAGECLLQLSRQLENQADITDPLLRSSIQSRALRTGVLFYNLGAFLLEL 348

Query: 1016 GRVITTMQMGQTPADALVNAGSPVYIRPTG 1105
            GR   T+++GQTP +A+VN G  V+I P G
Sbjct: 349  GRTTMTLRLGQTPTEAVVNGGPAVFISPNG 378


>ref|XP_002331046.1| predicted protein [Populus trichocarpa]
            gi|566174703|ref|XP_006381061.1| ubiquitin family protein
            [Populus trichocarpa] gi|550335564|gb|ERP58858.1|
            ubiquitin family protein [Populus trichocarpa]
          Length = 733

 Score =  233 bits (593), Expect = 3e-58
 Identities = 151/380 (39%), Positives = 206/380 (54%), Gaps = 56/380 (14%)
 Frame = +2

Query: 134  MGSNGGENIKVSESDADECSETTIDIKIKTLDSKTFSVQVDKCVSVAELKEKIASLNGVL 313
            MGSN  +  K+ ++   E SET I+IKIKTLDS+T++++VDK + V  LKE+IAS+ GVL
Sbjct: 1    MGSNVAD--KIPKAGEAEGSETNIEIKIKTLDSQTYTLRVDKQMPVPALKEQIASVTGVL 58

Query: 314  SEKQRLICRGKVLKDDQLLSAYNVEDGHTLHLVVREPVAPSPENSPDHPAIXXXXXXXXX 493
            SE+QRLICRGKVLKDDQLLSAY+VEDGHTLH+VVR+P+  S E   +HP           
Sbjct: 59   SEQQRLICRGKVLKDDQLLSAYHVEDGHTLHMVVRQPIPQSSEGLSNHPGNDPASGSSRH 118

Query: 494  XGNQLGPGM-----PIIEPGDGAFPDLNRVVSAVLNAFGFRITRFGSSG----------- 625
             G Q+ P +      + + GDG  P+++R+VSAVL +FGF     GS G           
Sbjct: 119  TG-QVAPSVVIETFSVPDQGDGVPPEISRIVSAVLGSFGFSNIEGGSEGVDVVRERGMSR 177

Query: 626  ---EGIDSDT------------------------------------IPGSLTTLSEYISN 688
                G  +DT                                    IP SLTTLS+Y+S+
Sbjct: 178  TSAAGGSTDTSQLQSEQTGTRGLSDRAQNIFGLPSAVSLGSMNPPVIPDSLTTLSQYLSH 237

Query: 689  LRREFIA-NAGGQNTDSTNDGLPGSNLHDLEVLPCCSGCRGVRTVMYLAELLSSTRQLLG 865
            +RREF A    G+N    +       +    +    +G   + T   LAE++ S+RQLL 
Sbjct: 238  MRREFDAIGRVGENNAQADAARRTEQIDSNSMSQSGTGQERLPTPASLAEVIRSSRQLLT 297

Query: 866  EQATECXXXXXXXXXXHTSITDALERSRILSDAIRSGSIFQHLGSLLLELGRVITTMQMG 1045
            EQ  EC             ITD   R    S A+R+G    +LG+LLLELGR I T+++G
Sbjct: 298  EQVAECLLQLAMQLENQADITDPAVRHTTQSSALRTGVQLHNLGALLLELGRTIMTLRLG 357

Query: 1046 QTPADALVNAGSPVYIRPTG 1105
            Q P++A+VNAG  ++I  +G
Sbjct: 358  QAPSEAIVNAGPAIFINQSG 377


>ref|XP_003609499.1| Large proline-rich protein BAT3 [Medicago truncatula]
            gi|355510554|gb|AES91696.1| Large proline-rich protein
            BAT3 [Medicago truncatula]
          Length = 787

 Score =  230 bits (586), Expect = 2e-57
 Identities = 155/389 (39%), Positives = 213/389 (54%), Gaps = 65/389 (16%)
 Frame = +2

Query: 134  MGSNGGENIKVSESDADECSETTIDIKIKTLDSKTFSVQVDKCVSVAELKEKIASLNGVL 313
            MGSN  E  K+  +++   SETTI+IK+KTLDS+T++++VDK + V  LKE+IA++ GVL
Sbjct: 74   MGSNSVE--KIPSNNSTGSSETTIEIKLKTLDSQTYTLRVDKQMPVPALKEQIATVTGVL 131

Query: 314  SEKQRLICRGKVLKDDQLLSAYNVEDGHTLHLVVREPVAPSPENSPDHPAIXXXXXXXXX 493
            SE+QRLIC+GKVLKDDQLLSAY+VEDGHTLHLVVR+P  P P + P+H A          
Sbjct: 132  SEQQRLICQGKVLKDDQLLSAYHVEDGHTLHLVVRQPDLPPPGSLPNHAATEPNSSTSHS 191

Query: 494  XGNQLGPG-------MPIIEPGDGAFPDLNRVVSAVLNAFGFRITRFGSSGEGI------ 634
               Q+ PG       +PI   GDG  P++NR+VSAVL +    +  F S GEGI      
Sbjct: 192  HSTQVAPGVFIETFNVPI--QGDGVPPEINRIVSAVLGSI-TGLPNFASGGEGIVVREHD 248

Query: 635  ----------------------------------DSDTIPGSLT--------------TL 670
                                              ++  +P S++              TL
Sbjct: 249  SQGLGRTLDSSGTSDPFRPLLDQTGSQSVSDRLRNTFGLPASVSLGSLQPPVIPGSLTTL 308

Query: 671  SEYISNLRREF--IANAG--GQNTDSTNDGLPGSNLHDLEVLPCCSGCRGVRTVMYLAEL 838
            S+Y+S++ REF  I   G  GQ  ++ ++   GS        P  S    + +   LAE+
Sbjct: 309  SQYLSHMSREFDTIVREGGNGQAAEAHSNEAVGSGSS-----PLGSTAENLPSPASLAEV 363

Query: 839  LSSTRQLLGEQATECXXXXXXXXXXHTSITDALERSRILSDAIRSGSIFQHLGSLLLELG 1018
            L STRQ++ EQA EC             ITDA  RS   S A+R+G +F +LG+ LLELG
Sbjct: 364  LLSTRQMIIEQAGECILQLARQLQNQADITDAPSRSTTQSRALRTGLLFYNLGAFLLELG 423

Query: 1019 RVITTMQMGQTPADALVNAGSPVYIRPTG 1105
            R   T+++GQTP++A+VN G  V+I PTG
Sbjct: 424  RTTMTLRLGQTPSEAVVNGGPAVFISPTG 452


Top