BLASTX nr result

ID: Rehmannia23_contig00017691 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Rehmannia23_contig00017691
         (1815 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_004240703.1| PREDICTED: uncharacterized protein LOC101255...   269   3e-69
ref|XP_002283083.2| PREDICTED: uncharacterized protein LOC100249...   266   2e-68
emb|CBI21108.3| unnamed protein product [Vitis vinifera]              266   2e-68
ref|XP_006355909.1| PREDICTED: large proline-rich protein bag6-B...   265   5e-68
gb|EMJ05831.1| hypothetical protein PRUPE_ppa002041mg [Prunus pe...   244   7e-62
gb|EOY31281.1| Ubiquitin-like superfamily protein, putative isof...   243   3e-61
gb|EOY31280.1| Ubiquitin-like superfamily protein, putative isof...   243   3e-61
gb|EOY31278.1| Ubiquitin-like superfamily protein, putative isof...   243   3e-61
gb|EOY31277.1| Ubiquitin-like superfamily protein, putative isof...   243   3e-61
gb|EOY31276.1| Ubiquitin-like superfamily protein, putative isof...   243   3e-61
gb|EOY31275.1| Ubiquitin-like superfamily protein, putative isof...   243   3e-61
gb|EOY31274.1| Ubiquitin-like superfamily protein, putative isof...   243   3e-61
ref|XP_002532506.1| scythe/bat3, putative [Ricinus communis] gi|...   239   3e-60
ref|XP_006289444.1| hypothetical protein CARUB_v10002958mg [Caps...   238   6e-60
ref|XP_006394775.1| hypothetical protein EUTSA_v10003804mg [Eutr...   238   8e-60
ref|NP_197909.4| ubiquitin-like superfamily protein [Arabidopsis...   236   2e-59
gb|EPS60717.1| hypothetical protein M569_14085, partial [Genlise...   235   4e-59
gb|ESW27099.1| hypothetical protein PHAVU_003G173700g [Phaseolus...   234   7e-59
ref|XP_002331046.1| predicted protein [Populus trichocarpa] gi|5...   232   4e-58
ref|XP_004139265.1| PREDICTED: uncharacterized protein LOC101210...   228   5e-57

>ref|XP_004240703.1| PREDICTED: uncharacterized protein LOC101255405 [Solanum
            lycopersicum]
          Length = 706

 Score =  269 bits (687), Expect = 3e-69
 Identities = 175/427 (40%), Positives = 234/427 (54%), Gaps = 85/427 (19%)
 Frame = -1

Query: 1701 MGSNGGENIKVSECDADECSETTIDIKIKTLDSKTFSVQVDKCVSVAELKEKIASLNGVL 1522
            M SNG E++++      EC ETT++IKIK LDS+T++++VDKCV V  LKE+IA++ GVL
Sbjct: 1    MVSNGAEDVQICGSGEAECPETTVEIKIKMLDSQTYTLRVDKCVPVPALKEQIATVTGVL 60

Query: 1521 SEKQRLICRGKVLKDDQLLSAYNVEDGHTLHLVVREPVAPSPENSPDHPAIXXXXXXXXX 1342
            +E+QRLICRGKVLKDDQLLSAY+VEDGHTLHLVVR+   PS +++PD  A          
Sbjct: 61   TEQQRLICRGKVLKDDQLLSAYHVEDGHTLHLVVRQ---PSSDSTPDPQATASASNAGYS 117

Query: 1341 SGNQLGPGMPI-----IEPGDGAFPDLNRVVSAVLNAFGFRITRFGSSGEGID------- 1198
             GN++ P M +      + GDG FPDLNR+V+AVL +FG  I   G   EGID       
Sbjct: 118  QGNRVSPDMVVGTYSSSDHGDGIFPDLNRIVTAVLGSFG--IASAGGGNEGIDLHGFGPA 175

Query: 1197 ---------------SDT--------------------------IPGSLTTLSEYISNLR 1141
                           +DT                          IP SLTTL++Y+S+L 
Sbjct: 176  SLGNIRDSGRSQTEQADTRDQSNVTNSASARSTDVPPEALQAPVIPDSLTTLTQYLSHLT 235

Query: 1140 REFIANAGGQNTDSTNDGLPGSNLHDLEVLPCCSGCRGVRTVMYLAELLSSTRQLLGEQA 961
             EF ANA GQ+  + + G+  ++   LE      G RG  T   LAE++  TRQL  EQ 
Sbjct: 236  VEFRANARGQSETTQSAGVHLADRTALEATAHSIGERGFPTPASLAEVIILTRQLFMEQV 295

Query: 960  TECXXXXXXXXXNHTSITDALERSRILSDAIRSGSIFQHLGSLLLELGRVITTMQMGQTP 781
             EC         N  ++T+  ER RI S A+R+G +F+++G++LLELGR   T++MG+TP
Sbjct: 296  VECLSQFSTLLENQANVTNPGERMRIQSYALRTGGLFRNIGAMLLELGRTAMTLRMGETP 355

Query: 780  ADALVNAGSPVYIRPTG-------------------ASVGPGDN-------------LPR 697
            ADA+VNAG  V++   G                    SVG   N             +PR
Sbjct: 356  ADAVVNAGPAVFVSTAGPNPIMVQPLPFQPNTSFGAVSVGTVQNNTGFSGGSVSSGFIPR 415

Query: 696  NIDITIR 676
            NIDI IR
Sbjct: 416  NIDIRIR 422


>ref|XP_002283083.2| PREDICTED: uncharacterized protein LOC100249152 [Vitis vinifera]
          Length = 708

 Score =  266 bits (681), Expect = 2e-68
 Identities = 174/428 (40%), Positives = 231/428 (53%), Gaps = 86/428 (20%)
 Frame = -1

Query: 1701 MGSNGGENIKVSECDADECSETTIDIKIKTLDSKTFSVQVDKCVSVAELKEKIASLNGVL 1522
            MGS GG+ + +S     +CSE T++IKIKTLDS+T++++VDKC+ V  LKE+IAS+ GVL
Sbjct: 1    MGSTGGDEVMISGSGEAQCSEATVEIKIKTLDSQTYTLRVDKCMPVPALKEQIASVTGVL 60

Query: 1521 SEKQRLICRGKVLKDDQLLSAYNVEDGHTLHLVVREPVAPSPENSPDHPAIXXXXXXXXX 1342
            SE+QRLICRG+VLKDDQLLSAY+VEDGHTLHLVVR+P  PS E+ PD+ A          
Sbjct: 61   SEQQRLICRGRVLKDDQLLSAYHVEDGHTLHLVVRQPFPPSSESLPDNSATDPASNTLRN 120

Query: 1341 SGNQLGPGMPII-EPGDGAFPDLNRVVSAVLNAFGFRITRFGSSG--------------- 1210
             G  +G  + ++ E GDG  PDL+R+VSAVL++FG    R GS G               
Sbjct: 121  QGFHVGSSVVVLSEQGDGV-PDLSRIVSAVLSSFGVNNVRSGSEGADPRDPVPERGSGTP 179

Query: 1209 -------------------------------------EGIDSDTIPGSLTTLSEYISNLR 1141
                                                 E +    IP SLTTLS+Y+ N+R
Sbjct: 180  GLSGLRDSSRQQPNQAATIDQSNPLNGASLLPNDVTLEQLQPPVIPDSLTTLSQYLRNMR 239

Query: 1140 REFIANAGGQNTDSTNDGLPGSNLHDLE-VLPCCSGCRGVRTVMYLAELLSSTRQLLGEQ 964
             EF  +  G   +S   G+ G ++ + E  L       G+ T   LAE++ STRQ+L EQ
Sbjct: 240  HEFGGSVRGHGNNSA-AGIHGCDVQNSEATLQSDVTQGGLPTPASLAEVILSTRQILIEQ 298

Query: 963  ATECXXXXXXXXXNHTSITDALERSRILSDAIRSGSIFQHLGSLLLELGRVITTMQMGQT 784
            A E          NH ++TD L R  I S+A+R G+I ++LG+LLLELGR   T++MGQT
Sbjct: 299  AAEDLSQLTRQLENHANVTDPLARRSIQSNALRLGAILRNLGALLLELGRTTMTLRMGQT 358

Query: 783  PADALVNAGSPVYIRPTG----------------------ASVGPGDN----------LP 700
            P DA+VNAG  ++I  +G                       +V PG            LP
Sbjct: 359  PNDAVVNAGPALFISTSGPNPIMVQPLPFHPGTSFGAIPMGTVQPGSGFSSGTLRSGFLP 418

Query: 699  RNIDITIR 676
            RNIDI IR
Sbjct: 419  RNIDIRIR 426


>emb|CBI21108.3| unnamed protein product [Vitis vinifera]
          Length = 573

 Score =  266 bits (681), Expect = 2e-68
 Identities = 174/428 (40%), Positives = 231/428 (53%), Gaps = 86/428 (20%)
 Frame = -1

Query: 1701 MGSNGGENIKVSECDADECSETTIDIKIKTLDSKTFSVQVDKCVSVAELKEKIASLNGVL 1522
            MGS GG+ + +S     +CSE T++IKIKTLDS+T++++VDKC+ V  LKE+IAS+ GVL
Sbjct: 1    MGSTGGDEVMISGSGEAQCSEATVEIKIKTLDSQTYTLRVDKCMPVPALKEQIASVTGVL 60

Query: 1521 SEKQRLICRGKVLKDDQLLSAYNVEDGHTLHLVVREPVAPSPENSPDHPAIXXXXXXXXX 1342
            SE+QRLICRG+VLKDDQLLSAY+VEDGHTLHLVVR+P  PS E+ PD+ A          
Sbjct: 61   SEQQRLICRGRVLKDDQLLSAYHVEDGHTLHLVVRQPFPPSSESLPDNSATDPASNTLRN 120

Query: 1341 SGNQLGPGMPII-EPGDGAFPDLNRVVSAVLNAFGFRITRFGSSG--------------- 1210
             G  +G  + ++ E GDG  PDL+R+VSAVL++FG    R GS G               
Sbjct: 121  QGFHVGSSVVVLSEQGDGV-PDLSRIVSAVLSSFGVNNVRSGSEGADPRDPVPERGSGTP 179

Query: 1209 -------------------------------------EGIDSDTIPGSLTTLSEYISNLR 1141
                                                 E +    IP SLTTLS+Y+ N+R
Sbjct: 180  GLSGLRDSSRQQPNQAATIDQSNPLNGASLLPNDVTLEQLQPPVIPDSLTTLSQYLRNMR 239

Query: 1140 REFIANAGGQNTDSTNDGLPGSNLHDLE-VLPCCSGCRGVRTVMYLAELLSSTRQLLGEQ 964
             EF  +  G   +S   G+ G ++ + E  L       G+ T   LAE++ STRQ+L EQ
Sbjct: 240  HEFGGSVRGHGNNSA-AGIHGCDVQNSEATLQSDVTQGGLPTPASLAEVILSTRQILIEQ 298

Query: 963  ATECXXXXXXXXXNHTSITDALERSRILSDAIRSGSIFQHLGSLLLELGRVITTMQMGQT 784
            A E          NH ++TD L R  I S+A+R G+I ++LG+LLLELGR   T++MGQT
Sbjct: 299  AAEDLSQLTRQLENHANVTDPLARRSIQSNALRLGAILRNLGALLLELGRTTMTLRMGQT 358

Query: 783  PADALVNAGSPVYIRPTG----------------------ASVGPGDN----------LP 700
            P DA+VNAG  ++I  +G                       +V PG            LP
Sbjct: 359  PNDAVVNAGPALFISTSGPNPIMVQPLPFHPGTSFGAIPMGTVQPGSGFSSGTLRSGFLP 418

Query: 699  RNIDITIR 676
            RNIDI IR
Sbjct: 419  RNIDIRIR 426


>ref|XP_006355909.1| PREDICTED: large proline-rich protein bag6-B-like isoform X1 [Solanum
            tuberosum] gi|565378956|ref|XP_006355910.1| PREDICTED:
            large proline-rich protein bag6-B-like isoform X2
            [Solanum tuberosum]
          Length = 703

 Score =  265 bits (677), Expect = 5e-68
 Identities = 172/425 (40%), Positives = 235/425 (55%), Gaps = 83/425 (19%)
 Frame = -1

Query: 1701 MGSNGGENIKVSECDADECSETTIDIKIKTLDSKTFSVQVDKCVSVAELKEKIASLNGVL 1522
            M SNG E++++      EC ETT++IKIK LDS+T++++VDKCV V  LKE+IA++ GVL
Sbjct: 1    MVSNGAEDVQICGSGEAECPETTVEIKIKMLDSQTYTLRVDKCVPVPALKEQIATVTGVL 60

Query: 1521 SEKQRLICRGKVLKDDQLLSAYNVEDGHTLHLVVREPVAPSPENSPDHPAIXXXXXXXXX 1342
            +E+QRLICRGKVLKDDQLLSAY+VEDGHTLHLVVR+   PS +++PD  A          
Sbjct: 61   TEQQRLICRGKVLKDDQLLSAYHVEDGHTLHLVVRQ---PSSDSTPDPQATASASSAGYS 117

Query: 1341 SGNQLGPGMPI-----IEPGDGAFPDLNRVVSAVLNAFGFRITRFGSSGEGID------- 1198
             GN++ PG+ +      + GDG FPDLNR+V+AVL +FG  I   G    GID       
Sbjct: 118  QGNRVSPGVVVGTYSSSDHGDGIFPDLNRIVTAVLGSFG--IASAGGGNAGIDLHGFGPA 175

Query: 1197 ---------------SDT-----------------------IPGSLTTLSEYISNLRREF 1132
                           +DT                       IP SLTTL++Y+S+L  EF
Sbjct: 176  SLGNIRDSGRSQTEQADTRDQSSVTTSASARPTDVPLQAPVIPDSLTTLTQYLSHLTAEF 235

Query: 1131 IANAGGQNTDSTNDGLPGSNLHDLEVLPCCSGCRGVRTVMYLAELLSSTRQLLGEQATEC 952
             AN  GQ+  + + G+  ++   LE      G RG  T   LAE++  TRQL  EQ  EC
Sbjct: 236  RANVRGQSETTQSAGVHVADRTALEATTHSIGERGFPTPASLAEVIILTRQLFMEQVVEC 295

Query: 951  XXXXXXXXXNHTSITDALERSRILSDAIRSGSIFQHLGSLLLELGRVITTMQMGQTPADA 772
                     N  ++T+  ER RI S A+R+G +F+++G++LLELGR   T++MG+TP DA
Sbjct: 296  LSQFSTLLENQANVTNPGERMRIQSYALRTGGLFRNIGAMLLELGRTTMTLRMGETPDDA 355

Query: 771  LVNAG----------SPVYIRP-----------------------TGASVGPGDNLPRNI 691
            +VNAG          +P+ ++P                       +G SV  G  +PRNI
Sbjct: 356  VVNAGPAVFVSTAGPNPIMVQPLPFQPNTSFGAVPVGTVQNNTGFSGGSVSSG-FIPRNI 414

Query: 690  DITIR 676
            DI IR
Sbjct: 415  DIRIR 419


>gb|EMJ05831.1| hypothetical protein PRUPE_ppa002041mg [Prunus persica]
            gi|462400164|gb|EMJ05832.1| hypothetical protein
            PRUPE_ppa002041mg [Prunus persica]
          Length = 725

 Score =  244 bits (624), Expect = 7e-62
 Identities = 178/440 (40%), Positives = 232/440 (52%), Gaps = 94/440 (21%)
 Frame = -1

Query: 1701 MGSNGGENIKVSECDADECSETTIDIKIKTLDSKTFSVQVDKCVSVAELKEKIASLNGVL 1522
            MGSN  E+I +  C+  E SETT++IKIKTLDS+T++++VDK + V  LKE+IAS+ GVL
Sbjct: 1    MGSNSAEHIPM--CEQIEGSETTVEIKIKTLDSQTYTLRVDKQMPVPALKEQIASVTGVL 58

Query: 1521 SEKQRLICRGKVLKDDQLLSAYNVEDGHTLHLVVREPVAPSPENSPDHPAIXXXXXXXXX 1342
            SE+QRLICRGKVLKDDQLLSAY+VEDGHTLHLVVR+P+ PSPE  P+HP           
Sbjct: 59   SEQQRLICRGKVLKDDQLLSAYHVEDGHTLHLVVRQPILPSPEGLPNHPGTDPASSTSRG 118

Query: 1341 SGNQLGPG-------MPIIEPGDGAFPDLNRVVSAVLNAFG------------------- 1240
              +Q+ PG       MP+   GDG  P+++R++SA+L + G                   
Sbjct: 119  Q-HQVAPGVVIETFSMPV--QGDGFPPEISRIISAILGSIGLPNISGGSEGIEVRRPERT 175

Query: 1239 ------FRITRFGSSGEG----------------------IDSDTIPGSLTTLSEYISNL 1144
                  F  ++F S   G                      +    IP SLTTLS+Y+S+L
Sbjct: 176  PGLSGMFDFSQFQSEQAGPRGPSDRSNGTFGHPTDFSLGTLPPLVIPDSLTTLSQYLSHL 235

Query: 1143 RREF--IANAGGQ-NTDSTNDGLPGSNLHDLEVLPCCSGCR--GVRTVMYLAELLSSTRQ 979
            RREF  IA  GG+  T +T      SN          +G R  G+ +   LAE++ STR 
Sbjct: 236  RREFEAIARDGGRGQTAATLRTEESSNASS------HTGARQEGLPSPASLAEVMRSTRL 289

Query: 978  LLGEQATECXXXXXXXXXNHTSITDALERSRILSDAIRSGSIFQHLGSLLLELGRVITTM 799
            LL EQ  E          N  ++TD   R    S A R+G++F +LG+ LLELGR   T+
Sbjct: 290  LLLEQVGESLLQFASQLENQVNVTDPSARFSAQSSASRTGALFHNLGAFLLELGRTTMTL 349

Query: 798  QMGQTPADALVNAGSPVYIRPTG----------------------ASVGPGDNL------ 703
            Q+GQTP+DA+VNAG  V+I PTG                       +V PG  L      
Sbjct: 350  QLGQTPSDAVVNAGPAVFISPTGPNPIMVQPLPFQSGMSFGAIPMGAVQPGSGLVNGLGT 409

Query: 702  ---PRNIDITIR----AVTP 664
               PR IDI IR    A TP
Sbjct: 410  GFVPRRIDIQIRRGSSATTP 429


>gb|EOY31281.1| Ubiquitin-like superfamily protein, putative isoform 8 [Theobroma
            cacao] gi|508784026|gb|EOY31282.1| Ubiquitin-like
            superfamily protein, putative isoform 8 [Theobroma cacao]
            gi|508784027|gb|EOY31283.1| Ubiquitin-like superfamily
            protein, putative isoform 8 [Theobroma cacao]
          Length = 575

 Score =  243 bits (619), Expect = 3e-61
 Identities = 174/439 (39%), Positives = 227/439 (51%), Gaps = 97/439 (22%)
 Frame = -1

Query: 1701 MGSNGGENIKVSECDADECSETTIDIKIKTLDSKTFSVQVDKCVSVAELKEKIASLNGVL 1522
            MGS G +  KV      E SETTI+IKIKTLDS+T++++VDK + V  LKE+IAS+ GVL
Sbjct: 1    MGSTGAD--KVPRDSETEGSETTIEIKIKTLDSQTYTLRVDKQMPVPALKEQIASVTGVL 58

Query: 1521 SEKQRLICRGKVLKDDQLLSAYNVEDGHTLHLVVREPVAPSPENSPDHPAIXXXXXXXXX 1342
            SE+QRLICRGKVLKDDQLLSAY+VEDGHTLH+VVR+PV PS + SP H A          
Sbjct: 59   SEQQRLICRGKVLKDDQLLSAYHVEDGHTLHMVVRQPVPPSSDGSP-HSANDSASGTSRG 117

Query: 1341 SGNQLGPGMPI-----IEPGDGAFPDLNRVVSAVLNAFGFRITRFGSSGEGI-------- 1201
              N + P + I      + GDG  P+++R+VSAVL +FGF     G+ G  +        
Sbjct: 118  HSNHVAPSVVIETFNVPDQGDGVPPEISRIVSAVLGSFGFANVGSGNIGGDVREHGSQRL 177

Query: 1200 ----------DSD----------------------------------TIPGSLTTLSEYI 1153
                      DS                                    IP SL TLS+Y+
Sbjct: 178  ERTSGASGMPDSSQAQTEQASMRGQSDRVHSAFGLPAAVSLGPLQPPVIPDSLATLSQYL 237

Query: 1152 SNLRREF--IANAGGQNTDSTN-------DGLPGSNLHDLEVLPCCSGCRGVRTVMYLAE 1000
            S+LRREF  I  AGG++  + +       D  P SN   ++         G+ T   LAE
Sbjct: 238  SHLRREFDGIGRAGGEDPQAASLSRTGDRDSNPASNSGTVQ--------EGLPTPASLAE 289

Query: 999  LLSSTRQLLGEQATECXXXXXXXXXNHTSITDALERSRILSDAIRSGSIFQHLGSLLLEL 820
            +L +TRQLL EQA EC         +  ++TD+  R    S A R+G + Q+LGSL LEL
Sbjct: 290  VLLATRQLLIEQAGECLQQLARQLEDQGNVTDSSARLSAQSIAWRTGVLLQNLGSLFLEL 349

Query: 819  GRVITTMQMGQTPADALVNAGSPVYIRPTG----------------------ASVGPGDN 706
            GR   T+++GQTP++A+VNAG  V+I P+G                       +V PG  
Sbjct: 350  GRTTMTIRLGQTPSEAVVNAGPAVFISPSGPNPLMVQALPFQPGTSFGAFPMGTVQPGSG 409

Query: 705  ---------LPRNIDITIR 676
                     LPR IDI IR
Sbjct: 410  LVNGLGTGLLPRRIDIQIR 428


>gb|EOY31280.1| Ubiquitin-like superfamily protein, putative isoform 7 [Theobroma
            cacao]
          Length = 725

 Score =  243 bits (619), Expect = 3e-61
 Identities = 174/439 (39%), Positives = 227/439 (51%), Gaps = 97/439 (22%)
 Frame = -1

Query: 1701 MGSNGGENIKVSECDADECSETTIDIKIKTLDSKTFSVQVDKCVSVAELKEKIASLNGVL 1522
            MGS G +  KV      E SETTI+IKIKTLDS+T++++VDK + V  LKE+IAS+ GVL
Sbjct: 1    MGSTGAD--KVPRDSETEGSETTIEIKIKTLDSQTYTLRVDKQMPVPALKEQIASVTGVL 58

Query: 1521 SEKQRLICRGKVLKDDQLLSAYNVEDGHTLHLVVREPVAPSPENSPDHPAIXXXXXXXXX 1342
            SE+QRLICRGKVLKDDQLLSAY+VEDGHTLH+VVR+PV PS + SP H A          
Sbjct: 59   SEQQRLICRGKVLKDDQLLSAYHVEDGHTLHMVVRQPVPPSSDGSP-HSANDSASGTSRG 117

Query: 1341 SGNQLGPGMPI-----IEPGDGAFPDLNRVVSAVLNAFGFRITRFGSSGEGI-------- 1201
              N + P + I      + GDG  P+++R+VSAVL +FGF     G+ G  +        
Sbjct: 118  HSNHVAPSVVIETFNVPDQGDGVPPEISRIVSAVLGSFGFANVGSGNIGGDVREHGSQRL 177

Query: 1200 ----------DSD----------------------------------TIPGSLTTLSEYI 1153
                      DS                                    IP SL TLS+Y+
Sbjct: 178  ERTSGASGMPDSSQAQTEQASMRGQSDRVHSAFGLPAAVSLGPLQPPVIPDSLATLSQYL 237

Query: 1152 SNLRREF--IANAGGQNTDSTN-------DGLPGSNLHDLEVLPCCSGCRGVRTVMYLAE 1000
            S+LRREF  I  AGG++  + +       D  P SN   ++         G+ T   LAE
Sbjct: 238  SHLRREFDGIGRAGGEDPQAASLSRTGDRDSNPASNSGTVQ--------EGLPTPASLAE 289

Query: 999  LLSSTRQLLGEQATECXXXXXXXXXNHTSITDALERSRILSDAIRSGSIFQHLGSLLLEL 820
            +L +TRQLL EQA EC         +  ++TD+  R    S A R+G + Q+LGSL LEL
Sbjct: 290  VLLATRQLLIEQAGECLQQLARQLEDQGNVTDSSARLSAQSIAWRTGVLLQNLGSLFLEL 349

Query: 819  GRVITTMQMGQTPADALVNAGSPVYIRPTG----------------------ASVGPGDN 706
            GR   T+++GQTP++A+VNAG  V+I P+G                       +V PG  
Sbjct: 350  GRTTMTIRLGQTPSEAVVNAGPAVFISPSGPNPLMVQALPFQPGTSFGAFPMGTVQPGSG 409

Query: 705  ---------LPRNIDITIR 676
                     LPR IDI IR
Sbjct: 410  LVNGLGTGLLPRRIDIQIR 428


>gb|EOY31278.1| Ubiquitin-like superfamily protein, putative isoform 5 [Theobroma
            cacao]
          Length = 729

 Score =  243 bits (619), Expect = 3e-61
 Identities = 174/439 (39%), Positives = 227/439 (51%), Gaps = 97/439 (22%)
 Frame = -1

Query: 1701 MGSNGGENIKVSECDADECSETTIDIKIKTLDSKTFSVQVDKCVSVAELKEKIASLNGVL 1522
            MGS G +  KV      E SETTI+IKIKTLDS+T++++VDK + V  LKE+IAS+ GVL
Sbjct: 1    MGSTGAD--KVPRDSETEGSETTIEIKIKTLDSQTYTLRVDKQMPVPALKEQIASVTGVL 58

Query: 1521 SEKQRLICRGKVLKDDQLLSAYNVEDGHTLHLVVREPVAPSPENSPDHPAIXXXXXXXXX 1342
            SE+QRLICRGKVLKDDQLLSAY+VEDGHTLH+VVR+PV PS + SP H A          
Sbjct: 59   SEQQRLICRGKVLKDDQLLSAYHVEDGHTLHMVVRQPVPPSSDGSP-HSANDSASGTSRG 117

Query: 1341 SGNQLGPGMPI-----IEPGDGAFPDLNRVVSAVLNAFGFRITRFGSSGEGI-------- 1201
              N + P + I      + GDG  P+++R+VSAVL +FGF     G+ G  +        
Sbjct: 118  HSNHVAPSVVIETFNVPDQGDGVPPEISRIVSAVLGSFGFANVGSGNIGGDVREHGSQRL 177

Query: 1200 ----------DSD----------------------------------TIPGSLTTLSEYI 1153
                      DS                                    IP SL TLS+Y+
Sbjct: 178  ERTSGASGMPDSSQAQTEQASMRGQSDRVHSAFGLPAAVSLGPLQPPVIPDSLATLSQYL 237

Query: 1152 SNLRREF--IANAGGQNTDSTN-------DGLPGSNLHDLEVLPCCSGCRGVRTVMYLAE 1000
            S+LRREF  I  AGG++  + +       D  P SN   ++         G+ T   LAE
Sbjct: 238  SHLRREFDGIGRAGGEDPQAASLSRTGDRDSNPASNSGTVQ--------EGLPTPASLAE 289

Query: 999  LLSSTRQLLGEQATECXXXXXXXXXNHTSITDALERSRILSDAIRSGSIFQHLGSLLLEL 820
            +L +TRQLL EQA EC         +  ++TD+  R    S A R+G + Q+LGSL LEL
Sbjct: 290  VLLATRQLLIEQAGECLQQLARQLEDQGNVTDSSARLSAQSIAWRTGVLLQNLGSLFLEL 349

Query: 819  GRVITTMQMGQTPADALVNAGSPVYIRPTG----------------------ASVGPGDN 706
            GR   T+++GQTP++A+VNAG  V+I P+G                       +V PG  
Sbjct: 350  GRTTMTIRLGQTPSEAVVNAGPAVFISPSGPNPLMVQALPFQPGTSFGAFPMGTVQPGSG 409

Query: 705  ---------LPRNIDITIR 676
                     LPR IDI IR
Sbjct: 410  LVNGLGTGLLPRRIDIQIR 428


>gb|EOY31277.1| Ubiquitin-like superfamily protein, putative isoform 4 [Theobroma
            cacao]
          Length = 724

 Score =  243 bits (619), Expect = 3e-61
 Identities = 174/439 (39%), Positives = 227/439 (51%), Gaps = 97/439 (22%)
 Frame = -1

Query: 1701 MGSNGGENIKVSECDADECSETTIDIKIKTLDSKTFSVQVDKCVSVAELKEKIASLNGVL 1522
            MGS G +  KV      E SETTI+IKIKTLDS+T++++VDK + V  LKE+IAS+ GVL
Sbjct: 1    MGSTGAD--KVPRDSETEGSETTIEIKIKTLDSQTYTLRVDKQMPVPALKEQIASVTGVL 58

Query: 1521 SEKQRLICRGKVLKDDQLLSAYNVEDGHTLHLVVREPVAPSPENSPDHPAIXXXXXXXXX 1342
            SE+QRLICRGKVLKDDQLLSAY+VEDGHTLH+VVR+PV PS + SP H A          
Sbjct: 59   SEQQRLICRGKVLKDDQLLSAYHVEDGHTLHMVVRQPVPPSSDGSP-HSANDSASGTSRG 117

Query: 1341 SGNQLGPGMPI-----IEPGDGAFPDLNRVVSAVLNAFGFRITRFGSSGEGI-------- 1201
              N + P + I      + GDG  P+++R+VSAVL +FGF     G+ G  +        
Sbjct: 118  HSNHVAPSVVIETFNVPDQGDGVPPEISRIVSAVLGSFGFANVGSGNIGGDVREHGSQRL 177

Query: 1200 ----------DSD----------------------------------TIPGSLTTLSEYI 1153
                      DS                                    IP SL TLS+Y+
Sbjct: 178  ERTSGASGMPDSSQAQTEQASMRGQSDRVHSAFGLPAAVSLGPLQPPVIPDSLATLSQYL 237

Query: 1152 SNLRREF--IANAGGQNTDSTN-------DGLPGSNLHDLEVLPCCSGCRGVRTVMYLAE 1000
            S+LRREF  I  AGG++  + +       D  P SN   ++         G+ T   LAE
Sbjct: 238  SHLRREFDGIGRAGGEDPQAASLSRTGDRDSNPASNSGTVQ--------EGLPTPASLAE 289

Query: 999  LLSSTRQLLGEQATECXXXXXXXXXNHTSITDALERSRILSDAIRSGSIFQHLGSLLLEL 820
            +L +TRQLL EQA EC         +  ++TD+  R    S A R+G + Q+LGSL LEL
Sbjct: 290  VLLATRQLLIEQAGECLQQLARQLEDQGNVTDSSARLSAQSIAWRTGVLLQNLGSLFLEL 349

Query: 819  GRVITTMQMGQTPADALVNAGSPVYIRPTG----------------------ASVGPGDN 706
            GR   T+++GQTP++A+VNAG  V+I P+G                       +V PG  
Sbjct: 350  GRTTMTIRLGQTPSEAVVNAGPAVFISPSGPNPLMVQALPFQPGTSFGAFPMGTVQPGSG 409

Query: 705  ---------LPRNIDITIR 676
                     LPR IDI IR
Sbjct: 410  LVNGLGTGLLPRRIDIQIR 428


>gb|EOY31276.1| Ubiquitin-like superfamily protein, putative isoform 3 [Theobroma
            cacao]
          Length = 730

 Score =  243 bits (619), Expect = 3e-61
 Identities = 174/439 (39%), Positives = 227/439 (51%), Gaps = 97/439 (22%)
 Frame = -1

Query: 1701 MGSNGGENIKVSECDADECSETTIDIKIKTLDSKTFSVQVDKCVSVAELKEKIASLNGVL 1522
            MGS G +  KV      E SETTI+IKIKTLDS+T++++VDK + V  LKE+IAS+ GVL
Sbjct: 1    MGSTGAD--KVPRDSETEGSETTIEIKIKTLDSQTYTLRVDKQMPVPALKEQIASVTGVL 58

Query: 1521 SEKQRLICRGKVLKDDQLLSAYNVEDGHTLHLVVREPVAPSPENSPDHPAIXXXXXXXXX 1342
            SE+QRLICRGKVLKDDQLLSAY+VEDGHTLH+VVR+PV PS + SP H A          
Sbjct: 59   SEQQRLICRGKVLKDDQLLSAYHVEDGHTLHMVVRQPVPPSSDGSP-HSANDSASGTSRG 117

Query: 1341 SGNQLGPGMPI-----IEPGDGAFPDLNRVVSAVLNAFGFRITRFGSSGEGI-------- 1201
              N + P + I      + GDG  P+++R+VSAVL +FGF     G+ G  +        
Sbjct: 118  HSNHVAPSVVIETFNVPDQGDGVPPEISRIVSAVLGSFGFANVGSGNIGGDVREHGSQRL 177

Query: 1200 ----------DSD----------------------------------TIPGSLTTLSEYI 1153
                      DS                                    IP SL TLS+Y+
Sbjct: 178  ERTSGASGMPDSSQAQTEQASMRGQSDRVHSAFGLPAAVSLGPLQPPVIPDSLATLSQYL 237

Query: 1152 SNLRREF--IANAGGQNTDSTN-------DGLPGSNLHDLEVLPCCSGCRGVRTVMYLAE 1000
            S+LRREF  I  AGG++  + +       D  P SN   ++         G+ T   LAE
Sbjct: 238  SHLRREFDGIGRAGGEDPQAASLSRTGDRDSNPASNSGTVQ--------EGLPTPASLAE 289

Query: 999  LLSSTRQLLGEQATECXXXXXXXXXNHTSITDALERSRILSDAIRSGSIFQHLGSLLLEL 820
            +L +TRQLL EQA EC         +  ++TD+  R    S A R+G + Q+LGSL LEL
Sbjct: 290  VLLATRQLLIEQAGECLQQLARQLEDQGNVTDSSARLSAQSIAWRTGVLLQNLGSLFLEL 349

Query: 819  GRVITTMQMGQTPADALVNAGSPVYIRPTG----------------------ASVGPGDN 706
            GR   T+++GQTP++A+VNAG  V+I P+G                       +V PG  
Sbjct: 350  GRTTMTIRLGQTPSEAVVNAGPAVFISPSGPNPLMVQALPFQPGTSFGAFPMGTVQPGSG 409

Query: 705  ---------LPRNIDITIR 676
                     LPR IDI IR
Sbjct: 410  LVNGLGTGLLPRRIDIQIR 428


>gb|EOY31275.1| Ubiquitin-like superfamily protein, putative isoform 2 [Theobroma
            cacao] gi|508784028|gb|EOY31284.1| Ubiquitin-like
            superfamily protein, putative isoform 2 [Theobroma cacao]
          Length = 579

 Score =  243 bits (619), Expect = 3e-61
 Identities = 174/439 (39%), Positives = 227/439 (51%), Gaps = 97/439 (22%)
 Frame = -1

Query: 1701 MGSNGGENIKVSECDADECSETTIDIKIKTLDSKTFSVQVDKCVSVAELKEKIASLNGVL 1522
            MGS G +  KV      E SETTI+IKIKTLDS+T++++VDK + V  LKE+IAS+ GVL
Sbjct: 1    MGSTGAD--KVPRDSETEGSETTIEIKIKTLDSQTYTLRVDKQMPVPALKEQIASVTGVL 58

Query: 1521 SEKQRLICRGKVLKDDQLLSAYNVEDGHTLHLVVREPVAPSPENSPDHPAIXXXXXXXXX 1342
            SE+QRLICRGKVLKDDQLLSAY+VEDGHTLH+VVR+PV PS + SP H A          
Sbjct: 59   SEQQRLICRGKVLKDDQLLSAYHVEDGHTLHMVVRQPVPPSSDGSP-HSANDSASGTSRG 117

Query: 1341 SGNQLGPGMPI-----IEPGDGAFPDLNRVVSAVLNAFGFRITRFGSSGEGI-------- 1201
              N + P + I      + GDG  P+++R+VSAVL +FGF     G+ G  +        
Sbjct: 118  HSNHVAPSVVIETFNVPDQGDGVPPEISRIVSAVLGSFGFANVGSGNIGGDVREHGSQRL 177

Query: 1200 ----------DSD----------------------------------TIPGSLTTLSEYI 1153
                      DS                                    IP SL TLS+Y+
Sbjct: 178  ERTSGASGMPDSSQAQTEQASMRGQSDRVHSAFGLPAAVSLGPLQPPVIPDSLATLSQYL 237

Query: 1152 SNLRREF--IANAGGQNTDSTN-------DGLPGSNLHDLEVLPCCSGCRGVRTVMYLAE 1000
            S+LRREF  I  AGG++  + +       D  P SN   ++         G+ T   LAE
Sbjct: 238  SHLRREFDGIGRAGGEDPQAASLSRTGDRDSNPASNSGTVQ--------EGLPTPASLAE 289

Query: 999  LLSSTRQLLGEQATECXXXXXXXXXNHTSITDALERSRILSDAIRSGSIFQHLGSLLLEL 820
            +L +TRQLL EQA EC         +  ++TD+  R    S A R+G + Q+LGSL LEL
Sbjct: 290  VLLATRQLLIEQAGECLQQLARQLEDQGNVTDSSARLSAQSIAWRTGVLLQNLGSLFLEL 349

Query: 819  GRVITTMQMGQTPADALVNAGSPVYIRPTG----------------------ASVGPGDN 706
            GR   T+++GQTP++A+VNAG  V+I P+G                       +V PG  
Sbjct: 350  GRTTMTIRLGQTPSEAVVNAGPAVFISPSGPNPLMVQALPFQPGTSFGAFPMGTVQPGSG 409

Query: 705  ---------LPRNIDITIR 676
                     LPR IDI IR
Sbjct: 410  LVNGLGTGLLPRRIDIQIR 428


>gb|EOY31274.1| Ubiquitin-like superfamily protein, putative isoform 1 [Theobroma
            cacao] gi|508784023|gb|EOY31279.1| Ubiquitin-like
            superfamily protein, putative isoform 1 [Theobroma cacao]
          Length = 724

 Score =  243 bits (619), Expect = 3e-61
 Identities = 174/439 (39%), Positives = 227/439 (51%), Gaps = 97/439 (22%)
 Frame = -1

Query: 1701 MGSNGGENIKVSECDADECSETTIDIKIKTLDSKTFSVQVDKCVSVAELKEKIASLNGVL 1522
            MGS G +  KV      E SETTI+IKIKTLDS+T++++VDK + V  LKE+IAS+ GVL
Sbjct: 1    MGSTGAD--KVPRDSETEGSETTIEIKIKTLDSQTYTLRVDKQMPVPALKEQIASVTGVL 58

Query: 1521 SEKQRLICRGKVLKDDQLLSAYNVEDGHTLHLVVREPVAPSPENSPDHPAIXXXXXXXXX 1342
            SE+QRLICRGKVLKDDQLLSAY+VEDGHTLH+VVR+PV PS + SP H A          
Sbjct: 59   SEQQRLICRGKVLKDDQLLSAYHVEDGHTLHMVVRQPVPPSSDGSP-HSANDSASGTSRG 117

Query: 1341 SGNQLGPGMPI-----IEPGDGAFPDLNRVVSAVLNAFGFRITRFGSSGEGI-------- 1201
              N + P + I      + GDG  P+++R+VSAVL +FGF     G+ G  +        
Sbjct: 118  HSNHVAPSVVIETFNVPDQGDGVPPEISRIVSAVLGSFGFANVGSGNIGGDVREHGSQRL 177

Query: 1200 ----------DSD----------------------------------TIPGSLTTLSEYI 1153
                      DS                                    IP SL TLS+Y+
Sbjct: 178  ERTSGASGMPDSSQAQTEQASMRGQSDRVHSAFGLPAAVSLGPLQPPVIPDSLATLSQYL 237

Query: 1152 SNLRREF--IANAGGQNTDSTN-------DGLPGSNLHDLEVLPCCSGCRGVRTVMYLAE 1000
            S+LRREF  I  AGG++  + +       D  P SN   ++         G+ T   LAE
Sbjct: 238  SHLRREFDGIGRAGGEDPQAASLSRTGDRDSNPASNSGTVQ--------EGLPTPASLAE 289

Query: 999  LLSSTRQLLGEQATECXXXXXXXXXNHTSITDALERSRILSDAIRSGSIFQHLGSLLLEL 820
            +L +TRQLL EQA EC         +  ++TD+  R    S A R+G + Q+LGSL LEL
Sbjct: 290  VLLATRQLLIEQAGECLQQLARQLEDQGNVTDSSARLSAQSIAWRTGVLLQNLGSLFLEL 349

Query: 819  GRVITTMQMGQTPADALVNAGSPVYIRPTG----------------------ASVGPGDN 706
            GR   T+++GQTP++A+VNAG  V+I P+G                       +V PG  
Sbjct: 350  GRTTMTIRLGQTPSEAVVNAGPAVFISPSGPNPLMVQALPFQPGTSFGAFPMGTVQPGSG 409

Query: 705  ---------LPRNIDITIR 676
                     LPR IDI IR
Sbjct: 410  LVNGLGTGLLPRRIDIQIR 428


>ref|XP_002532506.1| scythe/bat3, putative [Ricinus communis] gi|223527781|gb|EEF29882.1|
            scythe/bat3, putative [Ricinus communis]
          Length = 709

 Score =  239 bits (610), Expect = 3e-60
 Identities = 166/427 (38%), Positives = 219/427 (51%), Gaps = 85/427 (19%)
 Frame = -1

Query: 1701 MGSNGGENIKVSECDADECSETTIDIKIKTLDSKTFSVQVDKCVSVAELKEKIASLNGVL 1522
            MGS+G +  K+   D  E SETTI+IK+KTLDS+T++++VDK + V  LKE+IAS+ GVL
Sbjct: 1    MGSDGAQ--KIPGTDVAEGSETTIEIKLKTLDSQTYTLRVDKQMPVPALKEQIASVTGVL 58

Query: 1521 SEKQRLICRGKVLKDDQLLSAYNVEDGHTLHLVVREPVAPSPENSPDHPAIXXXXXXXXX 1342
            SE+QRLICRGKVLKDDQLLSAY+VEDGHTLHLVVR+PV PS +   +H A          
Sbjct: 59   SEQQRLICRGKVLKDDQLLSAYHVEDGHTLHLVVRQPVIPSSDGLSNHSATDPASSTSR- 117

Query: 1341 SGNQLGPGMPI-----IEPGDGAFPDLNRVVSAVLNAFGFRITRFGSSGEGID------- 1198
                + P + I      + GDG  P+++R+VSAVL +FGF     GS GEG+D       
Sbjct: 118  --GHVAPSVVIETFSMPDQGDGVPPEISRIVSAVLGSFGF--PNIGSGGEGVDVARERDQ 173

Query: 1197 ----------------------SD--------------------TIPGSLTTLSEYISNL 1144
                                  SD                     IP SLTTLS+Y+S++
Sbjct: 174  HRSAAASPEAAQLQPEQGSRIQSDRSQSVFGLPTTVSLGSLHPPIIPDSLTTLSQYLSHM 233

Query: 1143 RREFIANAGGQNTDSTNDGLPGSNLHDLEVLPCCSGCRGVRTVMYLAELLSSTRQLLGEQ 964
            RREF      +  +         +    E LP         T  YLAE+++S+RQ + EQ
Sbjct: 234  RREFNTIEATRRDEQRETNSTSRSGTGQERLP---------TPAYLAEVITSSRQFINEQ 284

Query: 963  ATECXXXXXXXXXNHTSITDALERSRILSDAIRSGSIFQHLGSLLLELGRVITTMQMGQT 784
              EC         N  ++TD+  R  I S A R+G    +LG+ LLELGR   T+++GQ 
Sbjct: 285  VAECLQQLARQLENQANVTDSAARLNIQSSAWRTGVQLHNLGAFLLELGRTTMTLRLGQA 344

Query: 783  PADALVNAGSPVYIRPTG----------------------ASVGPGDN---------LPR 697
            P++A+VNAG  V+I P+G                       SV PG           LPR
Sbjct: 345  PSEAVVNAGPAVFISPSGPNPLMVQPLPFQTGASFGALPLGSVQPGSGLVNGIGTGFLPR 404

Query: 696  NIDITIR 676
             IDI IR
Sbjct: 405  RIDIQIR 411


>ref|XP_006289444.1| hypothetical protein CARUB_v10002958mg [Capsella rubella]
            gi|482558150|gb|EOA22342.1| hypothetical protein
            CARUB_v10002958mg [Capsella rubella]
          Length = 660

 Score =  238 bits (607), Expect = 6e-60
 Identities = 157/412 (38%), Positives = 222/412 (53%), Gaps = 70/412 (16%)
 Frame = -1

Query: 1701 MGSNGGENIKVSECDADECSETTIDIKIKTLDSKTFSVQVDKCVSVAELKEKIASLNGVL 1522
            MG NG + I V   +A +C+   ++IKIKTLDS+T+++ VDKCV V  LKE++A++ GV+
Sbjct: 1    MGDNGKDEIMV---EASQCAGAMVEIKIKTLDSQTYTLHVDKCVPVPALKEQVATVTGVV 57

Query: 1521 SEKQRLICRGKVLKDDQLLSAYNVEDGHTLHLVVREPVAPSPENSPDHPAIXXXXXXXXX 1342
            +E+QRLICRGKV+KDDQLLSAY+VEDGHTLHLVVR+PV P  E+S  + A          
Sbjct: 58   TEQQRLICRGKVMKDDQLLSAYHVEDGHTLHLVVRQPVPPISESSTSNAAADPALSAGDS 117

Query: 1341 SGNQ----LGPGMPIIEPGDGAFPDLNRVVSAVLNAFGF------------------RIT 1228
             G+Q    +     I E  DG + DL ++VSAVL + G                   R++
Sbjct: 118  QGSQRSRVVVGSFNIAEQADGVYSDLGQIVSAVLGSLGISNPEGGIEGIEAMGPLHERLS 177

Query: 1227 RFGSSGEGIDSD-----------------------TIPGSLTTLSEYISNLRREFIANAG 1117
            R        DS                         IP SLTTLSEY+++LR+EF AN  
Sbjct: 178  RSSGPTSARDSSGGRSATPNTVNQTSTPLTSSQPAVIPDSLTTLSEYLNHLRQEFAAN-- 235

Query: 1116 GQNTDSTNDGLPGSNLHDLEVLPCCSGCRGVRTVMYLAELLSSTRQLLGEQATECXXXXX 937
            G N ++  D    +++ +++     +G   +    +LAE+L STRQLL  +  +C     
Sbjct: 236  GSNANNLQDS--ENSMGNVQDSASTTGESRIPRPSHLAEVLQSTRQLLIGEVADCLSNLS 293

Query: 936  XXXXNHTSITDALERSRILSDAIRSGSIFQHLGSLLLELGRVITTMQMGQTPADALVNAG 757
                +H ++TD   R    S+ ++SGS+ ++LG  LLELGR    +++GQTP DA+VNAG
Sbjct: 294  RQLVDHVNVTDPSTRRLCQSNMLQSGSLLENLGISLLELGRATMMLRLGQTPDDAVVNAG 353

Query: 756  SPVYIRPT----------GASVG------------PGDNL---PRNIDITIR 676
              V+I PT          G S+G             G +L   PRNI+I IR
Sbjct: 354  PAVFISPTRRNPLASTRQGTSIGGLQAGTAHSNPFAGQSLASAPRNIEIRIR 405


>ref|XP_006394775.1| hypothetical protein EUTSA_v10003804mg [Eutrema salsugineum]
            gi|557091414|gb|ESQ32061.1| hypothetical protein
            EUTSA_v10003804mg [Eutrema salsugineum]
          Length = 646

 Score =  238 bits (606), Expect = 8e-60
 Identities = 173/518 (33%), Positives = 245/518 (47%), Gaps = 94/518 (18%)
 Frame = -1

Query: 1701 MGSNGGENIKVSECDADECSETTIDIKIKTLDSKTFSVQVDKCVSVAELKEKIASLNGVL 1522
            MG +G + I V E +A +C+   ++I IKTLDS+T +++VDKCV V  LKE+IAS+ GV+
Sbjct: 1    MGQSGKDEIIVREMEASQCASAMVEINIKTLDSQTHTLRVDKCVPVPALKEQIASVTGVV 60

Query: 1521 SEKQRLICRGKVLKDDQLLSAYNVEDGHTLHLVVREPVAPSPENSPDHPAIXXXXXXXXX 1342
            +E+QRLICRG+V+KDDQLLSAY+VEDGHTLH+VVR+P+ P  E++  + A          
Sbjct: 61   TEQQRLICRGQVMKDDQLLSAYHVEDGHTLHMVVRQPIPPLSESAASNAAADPALSAGDS 120

Query: 1341 SGNQ----LGPGMPIIEPGDGAFPDLNRVVSAVLNAFGF------------------RIT 1228
             G+Q    +     I E  DGA+ DL ++VSAVL + G                   R++
Sbjct: 121  QGSQRSRVVVGSFNIAEQADGAYSDLGQIVSAVLGSLGISNTEGAFEGIEALGPLHERLS 180

Query: 1227 RFGSSGEGIDSD-----------------------TIPGSLTTLSEYISNLRREFIANAG 1117
            R        DS                         +P SLTTLSEY+++LR+EF  N  
Sbjct: 181  RSSGPDAARDSSGARSVTPNAMDQTSTPLTSSQPAAVPDSLTTLSEYLNHLRQEFATNGS 240

Query: 1116 GQNTDSTNDGLPGSNLHDLEVLPCCSGCRGVRTVMYLAELLSSTRQLLGEQATECXXXXX 937
              N    ++   G+          C     +    +LAE+L STRQLL  +  +C     
Sbjct: 241  NANNLQNSENSMGNVQASGSTTEECR----IPRPSHLAEVLQSTRQLLTGEVADCISHVA 296

Query: 936  XXXXNHTSITDALERSRILSDAIRSGSIFQHLGSLLLELGRVITTMQMGQTPADALVNAG 757
                +H ++TD   R    S+ + SGS+ ++LG LLLELGR    +++GQTP DA+VNAG
Sbjct: 297  RQLLDHRNVTDPSTRRLCQSNMLHSGSLLENLGILLLELGRATMMLRLGQTPDDAVVNAG 356

Query: 756  SPVYIRPT------------GASVG------------PGDNL---PRNIDITIRA----- 673
              V+I PT            GAS+G             G +L   PRNI+I IR      
Sbjct: 357  PAVFISPTGRNPLPSQSSRLGASIGGLQAGTAHSSAFTGPSLASAPRNIEIRIRTGSWMP 416

Query: 672  -------------VTPREAIXXXXXXXXXXXXXXPVDSGGQDG----VISANGNSPRGSR 544
                          TP +AI               V S         VI      P+GS 
Sbjct: 417  SGGANQREETTTQQTPGQAIPSAASIITDSVPSMRVPSENPRNPVALVIPVVARYPQGSS 476

Query: 543  VADNHLRPECNTEQPMPDSATQQETTQVLGGNGSKDSA 430
              +          QP+ +S+ Q ++T   G  G  +S+
Sbjct: 477  -GERSSTGIDGVHQPVAESSRQPQSTSTPGREGDSNSS 513


>ref|NP_197909.4| ubiquitin-like superfamily protein [Arabidopsis thaliana]
            gi|332006037|gb|AED93420.1| ubiquitin-like superfamily
            protein [Arabidopsis thaliana]
          Length = 658

 Score =  236 bits (602), Expect = 2e-59
 Identities = 146/371 (39%), Positives = 209/371 (56%), Gaps = 42/371 (11%)
 Frame = -1

Query: 1701 MGSNGGENIKVSECDADECSETTIDIKIKTLDSKTFSVQVDKCVSVAELKEKIASLNGVL 1522
            MG NG + I V   +A +C+   ++IKIKTLDS+T++++VDKCV V  LKE++AS+ GV+
Sbjct: 1    MGDNGKDEIMV---EASQCAGAMVEIKIKTLDSQTYTLRVDKCVPVPALKEQVASVTGVV 57

Query: 1521 SEKQRLICRGKVLKDDQLLSAYNVEDGHTLHLVVREPVA-PSPENSPDHPAIXXXXXXXX 1345
            +E+QRLICRGKV+KDDQLLSAY+VEDGHTLHLVVR+PV+  S  N+   PA+        
Sbjct: 58   TEQQRLICRGKVMKDDQLLSAYHVEDGHTLHLVVRQPVSESSTSNAAADPALSAGDSQGS 117

Query: 1344 XSGNQLGPGMPIIEPGDGAFPDLNRVVSAVLNAFGF------------------RITRFG 1219
                 +     I E  DG + DL ++VSAVL + G                   R++R  
Sbjct: 118  QRSRVVVGSFNIAEQADGVYSDLGQIVSAVLGSLGISNPEGGIEGIDDMGPLHERLSRSS 177

Query: 1218 SSGEGIDSD-----------------------TIPGSLTTLSEYISNLRREFIANAGGQN 1108
              G   DS                         IP SLTTLSEY+++LR+EF AN  G N
Sbjct: 178  GPGTARDSSGGRSATPNAVDQTSTPLASSQPAAIPDSLTTLSEYLNHLRQEFAAN--GSN 235

Query: 1107 TDSTNDGLPGSNLHDLEVLPCCSGCRGVRTVMYLAELLSSTRQLLGEQATECXXXXXXXX 928
             ++  D    +++ +++     +G   +    +LAE+L STRQLL  +  +C        
Sbjct: 236  ANNLQDS--ENSVGNVQDSASTTGESRIPRPSHLAEVLQSTRQLLIGEVADCLSNLSRQL 293

Query: 927  XNHTSITDALERSRILSDAIRSGSIFQHLGSLLLELGRVITTMQMGQTPADALVNAGSPV 748
             +H ++TD   R    S+ ++SGS+ + LG  LLELGR    +++GQTP DA+V+AG  V
Sbjct: 294  VDHVNVTDPPTRRLCQSNMLQSGSLLESLGISLLELGRATMMLRLGQTPDDAVVDAGPAV 353

Query: 747  YIRPTGASVGP 715
            +I PTG +  P
Sbjct: 354  FISPTGRNPLP 364


>gb|EPS60717.1| hypothetical protein M569_14085, partial [Genlisea aurea]
          Length = 346

 Score =  235 bits (600), Expect = 4e-59
 Identities = 150/351 (42%), Positives = 207/351 (58%), Gaps = 47/351 (13%)
 Frame = -1

Query: 1692 NGGENIKVSECDADECSETTIDIKIKTLDSKTFSVQVDKCVSVAELKEKIASLNGVLSEK 1513
            +GG  IK+   +  ECS T ++IKIKTLDS+TF+++VDKCV V ELK +IAS+ GV+ E+
Sbjct: 8    DGGAQIKLP-LNGAECSGTMVEIKIKTLDSQTFTLRVDKCVPVPELKRQIASVTGVVMEQ 66

Query: 1512 QRLICRGKVLKDDQLLSAYNVEDGHTLHLVVREPVAPSPENSPDHPAIXXXXXXXXXSGN 1333
            QRLICRG+VLKDDQLLSAY+VEDGHTLHLVVR+P+ P  + S D  A            N
Sbjct: 67   QRLICRGRVLKDDQLLSAYHVEDGHTLHLVVRQPIVPLADMSGD-TATDGLPSSGPVPSN 125

Query: 1332 QLGPGM-----PIIEPGDGAFPDLNRVVSAVLNAFGFRITRFGSSGEGIDSDT------- 1189
            ++ PG+      +++  D  FP+LN++VSAV N+ G  +TR GS GEGID +        
Sbjct: 126  RVRPGVLIGSFNVLDDIDREFPNLNQIVSAVFNSIG--VTRNGSGGEGIDLNVMHLFAHS 183

Query: 1188 -----------------------------------IPGSLTTLSEYISNLRREFIANAGG 1114
                                               IP SL+TL +Y ++LR++       
Sbjct: 184  LFSPLQRPPSELPGGLSSDQTASAAASLDSIQPPIIPDSLSTLLQYANHLRQD------- 236

Query: 1113 QNTDSTNDGLPGSNLHDLEVLPCCSGCRGVRTVMYLAELLSSTRQLLGEQATECXXXXXX 934
            +N+ ST  GLP ++  +L         R + T   LA ++SSTR+LL  +A+EC      
Sbjct: 237  ENSQST--GLPVADGLELGAGVTSGESRRLLTPESLARVMSSTRELLCGEASECLQQLEG 294

Query: 933  XXXNHTSITDALERSRILSDAIRSGSIFQHLGSLLLELGRVITTMQMGQTP 781
               +H+S+TD +ERSRI   A+RSG +FQ+LG LLLELGR I T++MG+TP
Sbjct: 295  QLESHSSVTDIVERSRIQYRAVRSGVLFQNLGGLLLELGRTIMTVRMGRTP 345


>gb|ESW27099.1| hypothetical protein PHAVU_003G173700g [Phaseolus vulgaris]
          Length = 717

 Score =  234 bits (598), Expect = 7e-59
 Identities = 161/390 (41%), Positives = 213/390 (54%), Gaps = 66/390 (16%)
 Frame = -1

Query: 1701 MGSNGGENIKVSECDADECSETTIDIKIKTLDSKTFSVQVDKCVSVAELKEKIASLNGVL 1522
            MGSNG E I ++  ++ E SETTI+IKIKTLDS+T++++VDK + V  LKE+IAS+ GVL
Sbjct: 1    MGSNGTEKIPIN--NSAESSETTIEIKIKTLDSQTYTLRVDKQMPVPALKEQIASVTGVL 58

Query: 1521 SEKQRLICRGKVLKDDQLLSAYNVEDGHTLHLVVREPVAPSPENSPDHPAIXXXXXXXXX 1342
            SE+QRLIC+GKVLKDDQLLSAY+VEDGHTLHLVVR+P  P P + P+H            
Sbjct: 59   SERQRLICQGKVLKDDQLLSAYHVEDGHTLHLVVRQPDLPPPGSLPNHSVTEPNSSSSLS 118

Query: 1341 SGNQLGPGMPIIE------PGDGAFPDLNRVVSAVLNAFGFRITRFGSSGEGID------ 1198
              +Q+ PG+  IE       GDG  P++NR+VSAVL + G +     +SGEGID      
Sbjct: 119  HSSQVAPGV-FIETFNVPFQGDGVAPEINRIVSAVLGSIGLQNF---ASGEGIDVREHDS 174

Query: 1197 ----------------------------SD--------TIPGSL------------TTLS 1162
                                        SD         +P SL            TTLS
Sbjct: 175  QGPGRISGSSGIFDSSHPQPEQSGFRILSDRSRNAFGAPVPVSLGSLQPPVIPDSLTTLS 234

Query: 1161 EYISNLRREF--IANAGGQNTDSTNDGLPGSNLHDLEVLPCC----SGCRGVRTVMYLAE 1000
            +Y+S++  EF  I   GG    +           ++E  P      S   G  +   LAE
Sbjct: 235  QYLSHISHEFDAIVREGGDIAQA------AEAQRNVETRPVSSRSGSAPEGFSSPTLLAE 288

Query: 999  LLSSTRQLLGEQATECXXXXXXXXXNHTSITDALERSRILSDAIRSGSIFQHLGSLLLEL 820
            +L STR+++ EQA EC         N   ITD L RS I S A+R+G +F +LG+ LLEL
Sbjct: 289  VLLSTRRMIVEQAGECLLQLSRQLENQADITDPLLRSSIQSRALRTGVLFYNLGAFLLEL 348

Query: 819  GRVITTMQMGQTPADALVNAGSPVYIRPTG 730
            GR   T+++GQTP +A+VN G  V+I P G
Sbjct: 349  GRTTMTLRLGQTPTEAVVNGGPAVFISPNG 378


>ref|XP_002331046.1| predicted protein [Populus trichocarpa]
            gi|566174703|ref|XP_006381061.1| ubiquitin family protein
            [Populus trichocarpa] gi|550335564|gb|ERP58858.1|
            ubiquitin family protein [Populus trichocarpa]
          Length = 733

 Score =  232 bits (592), Expect = 4e-58
 Identities = 152/380 (40%), Positives = 207/380 (54%), Gaps = 56/380 (14%)
 Frame = -1

Query: 1701 MGSNGGENIKVSECDADECSETTIDIKIKTLDSKTFSVQVDKCVSVAELKEKIASLNGVL 1522
            MGSN  +  K+ +    E SET I+IKIKTLDS+T++++VDK + V  LKE+IAS+ GVL
Sbjct: 1    MGSNVAD--KIPKAGEAEGSETNIEIKIKTLDSQTYTLRVDKQMPVPALKEQIASVTGVL 58

Query: 1521 SEKQRLICRGKVLKDDQLLSAYNVEDGHTLHLVVREPVAPSPENSPDHPAIXXXXXXXXX 1342
            SE+QRLICRGKVLKDDQLLSAY+VEDGHTLH+VVR+P+  S E   +HP           
Sbjct: 59   SEQQRLICRGKVLKDDQLLSAYHVEDGHTLHMVVRQPIPQSSEGLSNHPGNDPASGSSRH 118

Query: 1341 SGNQLGPGM-----PIIEPGDGAFPDLNRVVSAVLNAFGFRITRFGSSG----------- 1210
            +G Q+ P +      + + GDG  P+++R+VSAVL +FGF     GS G           
Sbjct: 119  TG-QVAPSVVIETFSVPDQGDGVPPEISRIVSAVLGSFGFSNIEGGSEGVDVVRERGMSR 177

Query: 1209 ---EGIDSDT------------------------------------IPGSLTTLSEYISN 1147
                G  +DT                                    IP SLTTLS+Y+S+
Sbjct: 178  TSAAGGSTDTSQLQSEQTGTRGLSDRAQNIFGLPSAVSLGSMNPPVIPDSLTTLSQYLSH 237

Query: 1146 LRREFIA-NAGGQNTDSTNDGLPGSNLHDLEVLPCCSGCRGVRTVMYLAELLSSTRQLLG 970
            +RREF A    G+N    +       +    +    +G   + T   LAE++ S+RQLL 
Sbjct: 238  MRREFDAIGRVGENNAQADAARRTEQIDSNSMSQSGTGQERLPTPASLAEVIRSSRQLLT 297

Query: 969  EQATECXXXXXXXXXNHTSITDALERSRILSDAIRSGSIFQHLGSLLLELGRVITTMQMG 790
            EQ  EC         N   ITD   R    S A+R+G    +LG+LLLELGR I T+++G
Sbjct: 298  EQVAECLLQLAMQLENQADITDPAVRHTTQSSALRTGVQLHNLGALLLELGRTIMTLRLG 357

Query: 789  QTPADALVNAGSPVYIRPTG 730
            Q P++A+VNAG  ++I  +G
Sbjct: 358  QAPSEAIVNAGPAIFINQSG 377


>ref|XP_004139265.1| PREDICTED: uncharacterized protein LOC101210096 [Cucumis sativus]
            gi|449493679|ref|XP_004159408.1| PREDICTED:
            uncharacterized protein LOC101228995 [Cucumis sativus]
          Length = 709

 Score =  228 bits (582), Expect = 5e-57
 Identities = 172/449 (38%), Positives = 223/449 (49%), Gaps = 100/449 (22%)
 Frame = -1

Query: 1701 MGSNG-GENIKVSECDADECSETTIDIKIKTLDSKTFSVQVDKCVSVAELKEKIASLNGV 1525
            MGSN  GE     E D    SETTI+IK+KTLDS+ ++++VDK + V  LKE+IAS+ GV
Sbjct: 1    MGSNFIGEATSCGEADG---SETTIEIKLKTLDSQIYTLRVDKQMPVPALKEQIASVTGV 57

Query: 1524 LSEKQRLICRGKVLKDDQLLSAYNVEDGHTLHLVVREPVAPSPENSPDHPAIXXXXXXXX 1345
            LSE+QRLICRGKVLKDDQLLSAY+VEDGHTLHLVVR+P+ PS E  P+ P          
Sbjct: 58   LSEQQRLICRGKVLKDDQLLSAYHVEDGHTLHLVVRQPLPPS-ETLPNRPETDPNSSTSR 116

Query: 1344 XSGNQLGPG-------MPIIEPGDGAFPDLNRVVSAVLNAFGFRITRFGSSGEGI----- 1201
               N++ PG       MP+   GDG  P++NR+VSAVL++ G   +  GS G  +     
Sbjct: 117  VHSNRVAPGVVIETFSMPV--QGDGMPPEINRIVSAVLSSIGLSNSVTGSDGMDVVREID 174

Query: 1200 ----------------------DSDTIPGS--------------------------LTTL 1165
                                  D+ + P S                          LTTL
Sbjct: 175  QQRSGERVIAAGMIDLNQHQSGDNGSRPLSDRFHGTSGHPSIPSLGSFPPPVIPDSLTTL 234

Query: 1164 SEYISNLRREF--IANAGGQNTDSTNDGLPGSNLHDLEVLPCCSGCR------GVRTVMY 1009
            S+ + N+RR+F  I   GG N   T       N+H  E     S  R         T   
Sbjct: 235  SQNLGNMRRDFENIGRVGGNNAQET-------NIHGDEESSSNSSSRPSTTQESFPTPAS 287

Query: 1008 LAELLSSTRQLLGEQATECXXXXXXXXXNHTSITDALERSRILSDAIRSGSIFQHLGSLL 829
            LAE++ STRQ+L  + +EC         NH ++TD   R    S A RSG +F +LG+ L
Sbjct: 288  LAEVMLSTRQMLTGEVSECLLQLARQLENHRNVTDPTLRMNTQSSAWRSGVLFNNLGAYL 347

Query: 828  LELGRVITTMQMGQTPADALVNAGSPVYIRPTG--------------ASVGP-------- 715
            LELGR + T++MGQ P++A+VNAG  V+I  TG              AS+GP        
Sbjct: 348  LELGRTMMTLRMGQNPSEAVVNAGPAVFISQTGPNPIMVQPLPFQQNASLGPVPMGTMQP 407

Query: 714  ---------GDNLPRNIDITIRAVTPREA 655
                        LPR IDI IR  +P  A
Sbjct: 408  GSALIHGLGSGFLPRRIDIQIRRGSPTTA 436


Top