BLASTX nr result

ID: Lithospermum23_contig00019776 seq

BLASTX 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Lithospermum23_contig00019776
         (1203 letters)

Database: ./nr 
           115,041,592 sequences; 42,171,959,267 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

OAY51980.1 hypothetical protein MANES_04G047900 [Manihot esculenta]   190   1e-54
XP_007150455.1 hypothetical protein PHAVU_005G154800g [Phaseolus...   184   1e-52
XP_012065219.1 PREDICTED: uncharacterized protein LOC105628422 [...   183   3e-52
XP_014521905.1 PREDICTED: uncharacterized protein LOC106778455 [...   182   7e-52
EOY04786.1 Uncharacterized protein TCM_019960 [Theobroma cacao]       182   9e-52
XP_017423244.1 PREDICTED: uncharacterized protein LOC108332474 [...   182   1e-51
XP_016699093.1 PREDICTED: uncharacterized protein LOC107914640 [...   181   2e-51
KHN33288.1 hypothetical protein glysoja_005330 [Glycine soja]         181   3e-51
XP_017974955.1 PREDICTED: uncharacterized protein LOC18602424 is...   181   4e-51
XP_003546819.1 PREDICTED: uncharacterized protein LOC100801419 [...   180   4e-51
XP_012455983.1 PREDICTED: uncharacterized protein LOC105777320 [...   180   5e-51
XP_016677700.1 PREDICTED: uncharacterized protein LOC107896910 [...   179   1e-50
OMO90318.1 hypothetical protein CCACVL1_07407 [Corchorus capsula...   180   1e-50
XP_011091621.1 PREDICTED: uncharacterized protein LOC105172002 i...   179   2e-50
XP_015583637.1 PREDICTED: uncharacterized protein LOC8284759 [Ri...   178   4e-50
XP_003543574.1 PREDICTED: uncharacterized protein LOC100819879 [...   177   6e-50
XP_011466862.1 PREDICTED: uncharacterized protein LOC101313817 [...   176   2e-49
XP_004486880.1 PREDICTED: uncharacterized protein LOC101498979 [...   175   5e-49
XP_019053511.1 PREDICTED: uncharacterized protein LOC104598244 i...   175   7e-49
KZV34811.1 hypothetical protein F511_00713 [Dorcoceras hygrometr...   174   7e-49

>OAY51980.1 hypothetical protein MANES_04G047900 [Manihot esculenta]
          Length = 240

 Score =  190 bits (482), Expect = 1e-54
 Identities = 113/244 (46%), Positives = 149/244 (61%), Gaps = 16/244 (6%)
 Frame = -1

Query: 918 MTNPSGV---EKNHGSVSAYNATPTR------------SLLFNQGISADWTQEEQAILEE 784
           M NPSG+   + NH   S   A PT             +L  N GIS DW+ EEQAILE+
Sbjct: 1   MANPSGIHHQDANHTPSSIVGANPTNGHGNSVPERSVGALKHNPGISTDWSAEEQAILED 60

Query: 783 GLIQYASEPVINRYAKIGLLLENKTIRDVALRCTWMRTXXXXXXXXXELGVARRNKEIKV 604
           GL+Q+A++P ++RYAKI L L NKT+RDVALRC WM T         E  +AR++K+ K 
Sbjct: 61  GLVQFATDPSVSRYAKIALQLPNKTVRDVALRCRWM-TKKEQSKRRKEDNLARKSKDKK- 118

Query: 603 TMEKAISPSATPSFIASHQGFAACTQGLHTNKIDVGISYNGVAGPVRHLLQQNAQAFNQI 424
             E+ + PSA  S   +       +  +     D GIS   + G    LL+QNA+  NQI
Sbjct: 119 --ERVVDPSAKVSPFMARPNIPPYSTPVIPMDYDDGISCKAIGGITGELLEQNAKVLNQI 176

Query: 423 SMNIATQHMQENIGLLFRTRDNIYKILSSLDEMPN-MKQMHPLPITLNEELASSILPRSA 247
           S N++T  +QENIGLL +TRDNI KI++ L++MP+ MKQM PLP+ LNEELA++ILP S 
Sbjct: 177 SANLSTMQLQENIGLLCQTRDNILKIMNQLNDMPDIMKQMPPLPVKLNEELANTILPHSN 236

Query: 246 LQMK 235
           L MK
Sbjct: 237 LPMK 240


>XP_007150455.1 hypothetical protein PHAVU_005G154800g [Phaseolus vulgaris]
           ESW22449.1 hypothetical protein PHAVU_005G154800g
           [Phaseolus vulgaris]
          Length = 231

 Score =  184 bits (467), Expect = 1e-52
 Identities = 108/229 (47%), Positives = 141/229 (61%), Gaps = 4/229 (1%)
 Frame = -1

Query: 918 MTNPSGVEKNHGSVSAYNATPTRSLL---FNQGISADWTQEEQAILEEGLIQYASEPVIN 748
           M NPSG  + H  VS+     + + L    N GIS DWT EEQAILE+GL +YASE  I 
Sbjct: 1   MANPSGNHQEHTHVSSSAPETSGAALAMKHNPGISLDWTAEEQAILEDGLSKYASESNIV 60

Query: 747 RYAKIGLLLENKTIRDVALRCTWMRTXXXXXXXXXELGVARRNKEIKVTMEKAISPSATP 568
           RYAKI L L++KT+RDVALR  WM           +  +AR++K+ K   E+   P+   
Sbjct: 61  RYAKIALQLQHKTVRDVALRVRWMNKKENSKRRKDDHNLARKSKDKK---ERVSDPAVKS 117

Query: 567 SFIASHQGFAACTQGLHTNKIDVGISYNGVAGPVRHLLQQNAQAFNQISMNIATQHMQEN 388
           S  A+    +     + T   D GISY  + GP   LL+QNAQA NQIS N++   +QEN
Sbjct: 118 SSFAARSNVSPYAPPMITMDNDDGISYTAIGGPTGELLEQNAQALNQISTNLSAFQVQEN 177

Query: 387 IGLLFRTRDNIYKILSSLDEMPN-MKQMHPLPITLNEELASSILPRSAL 244
           I L  +TRDNI KI++ L++ P  MKQM PLP+ +NEELA+SILPR+AL
Sbjct: 178 INLFCQTRDNILKIVNELNDSPEVMKQMPPLPVKVNEELANSILPRTAL 226


>XP_012065219.1 PREDICTED: uncharacterized protein LOC105628422 [Jatropha curcas]
           KDP43904.1 hypothetical protein JCGZ_20914 [Jatropha
           curcas]
          Length = 240

 Score =  183 bits (465), Expect = 3e-52
 Identities = 112/244 (45%), Positives = 144/244 (59%), Gaps = 16/244 (6%)
 Frame = -1

Query: 918 MTNPSGV---EKNHGSVSAYNATPTR------------SLLFNQGISADWTQEEQAILEE 784
           M NPSG    + NH S S     PT             +L  N GIS DW+ EEQAILE+
Sbjct: 1   MANPSGTHHHDANHASSSFNGTNPTNGHGNSVSESSGTALKHNPGISPDWSLEEQAILED 60

Query: 783 GLIQYASEPVINRYAKIGLLLENKTIRDVALRCTWMRTXXXXXXXXXELGVARRNKEIKV 604
           GL +YA+E  I RYAKI + L+NKT+RDVALRC WM T         E  +AR++K+ K 
Sbjct: 61  GLSKYAAESNIIRYAKIAMQLQNKTVRDVALRCRWM-TKKENSKRRKEENLARKSKDKK- 118

Query: 603 TMEKAISPSATPSFIASHQGFAACTQGLHTNKIDVGISYNGVAGPVRHLLQQNAQAFNQI 424
             E+ I P    S   +          +     D GISY  + G    LL+QNA+AFNQI
Sbjct: 119 --ERVIDPPLKASHFMARPNVHPYATPMIPMDYDDGISYKAIGGVTGELLEQNAKAFNQI 176

Query: 423 SMNIATQHMQENIGLLFRTRDNIYKILSSLDEMPN-MKQMHPLPITLNEELASSILPRSA 247
           S N++T  +Q+NI LL +TRDNI KI+  L++MP+ MKQM PLP+ LN+ELA++ILPR  
Sbjct: 177 SANLSTMQIQDNISLLCQTRDNILKIMDELNDMPDVMKQMPPLPVKLNDELANTILPRPN 236

Query: 246 LQMK 235
           L MK
Sbjct: 237 LPMK 240


>XP_014521905.1 PREDICTED: uncharacterized protein LOC106778455 [Vigna radiata var.
           radiata]
          Length = 231

 Score =  182 bits (462), Expect = 7e-52
 Identities = 107/229 (46%), Positives = 140/229 (61%), Gaps = 4/229 (1%)
 Frame = -1

Query: 918 MTNPSGVEKNHGSVSAYNATPTRSLL---FNQGISADWTQEEQAILEEGLIQYASEPVIN 748
           M NPSG  + H  VS+     + + L    N GIS DWT EEQAILE+GL +YASE  I 
Sbjct: 1   MANPSGNHQEHTHVSSSAPETSGAALAMKHNPGISLDWTAEEQAILEDGLSKYASESNIV 60

Query: 747 RYAKIGLLLENKTIRDVALRCTWMRTXXXXXXXXXELGVARRNKEIKVTMEKAISPSATP 568
           RYAKI L L++KT+RDVALR  WM           +  + R++K+ K   E+   P+   
Sbjct: 61  RYAKIALQLQHKTVRDVALRVRWMNKKENSKRRKDDHNLTRKSKDKK---ERVSDPAVKS 117

Query: 567 SFIASHQGFAACTQGLHTNKIDVGISYNGVAGPVRHLLQQNAQAFNQISMNIATQHMQEN 388
           S  A+    +     + T   D GISY  + GP   LL+QNAQA NQIS N++   +QEN
Sbjct: 118 SNFAARSNVSPYAPPMITMDNDDGISYTAIGGPTGELLEQNAQALNQISTNLSAFQVQEN 177

Query: 387 IGLLFRTRDNIYKILSSLDEMPN-MKQMHPLPITLNEELASSILPRSAL 244
           I L  +TRDNI KI++ L++ P  MKQM PLP+ +NEELA+SILPR+AL
Sbjct: 178 INLFCQTRDNILKIVNELNDSPEVMKQMPPLPVKVNEELANSILPRTAL 226


>EOY04786.1 Uncharacterized protein TCM_019960 [Theobroma cacao]
          Length = 240

 Score =  182 bits (462), Expect = 9e-52
 Identities = 109/242 (45%), Positives = 145/242 (59%), Gaps = 14/242 (5%)
 Frame = -1

Query: 918 MTNPSG---VEKNH-------GSVSAYNATPTRS---LLFNQGISADWTQEEQAILEEGL 778
           M NP G    E NH       G++S  +  P  S   +  N GI+ DWT EEQAIL+EGL
Sbjct: 1   MANPPGNHQQEANHASSSFNGGNLSNGSTIPDSSGSGMKHNPGIALDWTLEEQAILDEGL 60

Query: 777 IQYASEPVINRYAKIGLLLENKTIRDVALRCTWMRTXXXXXXXXXELGVARRNKEIKVTM 598
            ++ASE  I RYAKI + L+NKT+RDVALRC WM           E  +AR++K+ K   
Sbjct: 61  KKFASESSIIRYAKIAMQLQNKTVRDVALRCRWMTKKENSKRRKEEHNLARKSKDKK--- 117

Query: 597 EKAISPSATPSFIASHQGFAACTQGLHTNKIDVGISYNGVAGPVRHLLQQNAQAFNQISM 418
           E+   PS  P+  A+          +     D GI Y  + G    LL+QNAQAFNQIS 
Sbjct: 118 ERVADPSTKPAHFAARPNVPPYAPPMIPMDYDDGIPYKAIGGATGELLEQNAQAFNQISA 177

Query: 417 NIATQHMQENIGLLFRTRDNIYKILSSLDEMPN-MKQMHPLPITLNEELASSILPRSALQ 241
           N+A   +QEN+GLL +TRDNI+KI++ L++MP+ MKQM PLP+ +N+ELA +ILP S   
Sbjct: 178 NLAAFQIQENVGLLCQTRDNIFKIMNDLNDMPDIMKQMPPLPVKVNDELAGTILPPSTHM 237

Query: 240 MK 235
           M+
Sbjct: 238 MQ 239


>XP_017423244.1 PREDICTED: uncharacterized protein LOC108332474 [Vigna angularis]
           KOM44429.1 hypothetical protein LR48_Vigan05g203400
           [Vigna angularis] BAT91754.1 hypothetical protein
           VIGAN_07037700 [Vigna angularis var. angularis]
          Length = 231

 Score =  182 bits (461), Expect = 1e-51
 Identities = 107/229 (46%), Positives = 140/229 (61%), Gaps = 4/229 (1%)
 Frame = -1

Query: 918 MTNPSGVEKNHGSVSAYNATPTRSLL---FNQGISADWTQEEQAILEEGLIQYASEPVIN 748
           M NPSG  + H  VS+     + + L    N GIS DWT EEQAILE+GL +YASE  I 
Sbjct: 1   MANPSGNHQEHTHVSSSAPETSGAALAMKHNPGISLDWTAEEQAILEDGLSKYASESNIV 60

Query: 747 RYAKIGLLLENKTIRDVALRCTWMRTXXXXXXXXXELGVARRNKEIKVTMEKAISPSATP 568
           RYAKI L L++KT+RDVALR  WM           +  + R++K+ K   E+   P+   
Sbjct: 61  RYAKIALQLQHKTVRDVALRVRWMNKKENSKRRKDDHNLTRKSKDKK---ERVSDPAVKS 117

Query: 567 SFIASHQGFAACTQGLHTNKIDVGISYNGVAGPVRHLLQQNAQAFNQISMNIATQHMQEN 388
           S  A+    +     + T   D GISY  + GP   LL+QNAQA NQIS N++   +QEN
Sbjct: 118 SNFAARSNVSPYAPPMITMDNDDGISYTAIGGPTGDLLEQNAQALNQISTNLSAFQVQEN 177

Query: 387 IGLLFRTRDNIYKILSSLDEMPN-MKQMHPLPITLNEELASSILPRSAL 244
           I L  +TRDNI KI++ L++ P  MKQM PLP+ +NEELA+SILPR+AL
Sbjct: 178 INLFCQTRDNILKIVNELNDSPEVMKQMPPLPVKVNEELANSILPRTAL 226


>XP_016699093.1 PREDICTED: uncharacterized protein LOC107914640 [Gossypium
           hirsutum]
          Length = 239

 Score =  181 bits (460), Expect = 2e-51
 Identities = 108/240 (45%), Positives = 143/240 (59%), Gaps = 13/240 (5%)
 Frame = -1

Query: 918 MTNPSGVEKNHGS-VSAYNA-----------TPTRSLLFNQGISADWTQEEQAILEEGLI 775
           M NP G  +   +  S++N            T    +  N GIS DWT EEQAIL++GL 
Sbjct: 1   MANPPGNHQQEANQASSFNGAHLNNGNPVPETSGSGMKHNPGISLDWTLEEQAILDDGLK 60

Query: 774 QYASEPVINRYAKIGLLLENKTIRDVALRCTWMRTXXXXXXXXXELGVARRNKEIKVTME 595
           +YASEP I RYAKI L L+NKT+RDVALRC WM           E  +AR++K+ K   E
Sbjct: 61  KYASEPSIIRYAKIALQLQNKTVRDVALRCRWMTKKENSKRRKEEHNIARKSKDKK---E 117

Query: 594 KAISPSATPSFIASHQGFAACTQGLHTNKIDVGISYNGVAGPVRHLLQQNAQAFNQISMN 415
           +   P+A P+  A+    +     +     D GI Y  + G    LL+QNA AFNQIS N
Sbjct: 118 RVADPTAKPTQFAARPNLSPYAPPMIPMDYDDGIPYRAIGGVTGELLEQNAHAFNQISAN 177

Query: 414 IATQHMQENIGLLFRTRDNIYKILSSLDEMPN-MKQMHPLPITLNEELASSILPRSALQM 238
           +A   +QENIGLL +TRDNI KI++ L+++P+ MKQM  LP+ LN+ELAS+ILP S+  M
Sbjct: 178 LAAFQIQENIGLLCQTRDNILKIMNELNDIPDVMKQMQVLPVKLNDELASTILPPSSHPM 237


>KHN33288.1 hypothetical protein glysoja_005330 [Glycine soja]
          Length = 231

 Score =  181 bits (458), Expect = 3e-51
 Identities = 106/230 (46%), Positives = 139/230 (60%), Gaps = 5/230 (2%)
 Frame = -1

Query: 918 MTNPSGVEKNHGSVSAYNATPTR----SLLFNQGISADWTQEEQAILEEGLIQYASEPVI 751
           M NPSG  + H  V + +A  T     ++  N GIS DWT EEQAILE+GL +YASE  I
Sbjct: 1   MANPSGNHQEHTHVVSSSAPETSGAALAMKHNPGISLDWTAEEQAILEDGLSKYASESNI 60

Query: 750 NRYAKIGLLLENKTIRDVALRCTWMRTXXXXXXXXXELGVARRNKEIKVTMEKAISPSAT 571
            RYAKI L L++KT+RDVALR  WM           +  + R++K+ K   E+   P+  
Sbjct: 61  VRYAKIALQLQHKTVRDVALRVRWMNKKENSKRRKDDHNLTRKSKDKK---ERVSDPAVK 117

Query: 570 PSFIASHQGFAACTQGLHTNKIDVGISYNGVAGPVRHLLQQNAQAFNQISMNIATQHMQE 391
            S   +    +     + T   D GISY  + GP   LL+QNAQA NQIS N++   +QE
Sbjct: 118 SSNFTARSNVSPYAPPMITMDNDDGISYTAIGGPTGDLLEQNAQALNQISTNLSAFQVQE 177

Query: 390 NIGLLFRTRDNIYKILSSLDEMPN-MKQMHPLPITLNEELASSILPRSAL 244
           NI L  +TRDNI KI++ L++ P  MKQM PLP+ +NEELASSILPR+ L
Sbjct: 178 NINLFCQTRDNILKIMNELNDSPEVMKQMPPLPVKVNEELASSILPRTNL 227


>XP_017974955.1 PREDICTED: uncharacterized protein LOC18602424 isoform X1
           [Theobroma cacao] XP_007033860.2 PREDICTED:
           uncharacterized protein LOC18602424 isoform X1
           [Theobroma cacao]
          Length = 240

 Score =  181 bits (458), Expect = 4e-51
 Identities = 108/242 (44%), Positives = 145/242 (59%), Gaps = 14/242 (5%)
 Frame = -1

Query: 918 MTNPSG---VEKNH-------GSVSAYNATPTRS---LLFNQGISADWTQEEQAILEEGL 778
           M NP G    E NH       G++S  +  P  S   +  N GI+ DWT EEQAIL+EGL
Sbjct: 1   MANPPGNHQQEANHASSSFNGGNLSNGSTIPDSSGSGMKHNPGIALDWTLEEQAILDEGL 60

Query: 777 IQYASEPVINRYAKIGLLLENKTIRDVALRCTWMRTXXXXXXXXXELGVARRNKEIKVTM 598
            ++ASE  I RYAKI + L+NKT+RDVALRC WM           E  +AR++K+ K   
Sbjct: 61  KKFASESSIIRYAKIAMQLQNKTVRDVALRCRWMTKKENSKRRKEEHNLARKSKDKK--- 117

Query: 597 EKAISPSATPSFIASHQGFAACTQGLHTNKIDVGISYNGVAGPVRHLLQQNAQAFNQISM 418
           E+   PS  P+  A+          +     D GI Y  + G    LL++NAQAFNQIS 
Sbjct: 118 ERVADPSTKPAHFAARPNVPPYAPPMIPMDYDDGIPYKAIGGATGELLERNAQAFNQISA 177

Query: 417 NIATQHMQENIGLLFRTRDNIYKILSSLDEMPN-MKQMHPLPITLNEELASSILPRSALQ 241
           N+A   +QEN+GLL +TRDNI+KI++ L++MP+ MKQM PLP+ +N+ELA +ILP S   
Sbjct: 178 NLAAFQIQENVGLLCQTRDNIFKIMNDLNDMPDIMKQMPPLPVKVNDELAGTILPPSTHM 237

Query: 240 MK 235
           M+
Sbjct: 238 MQ 239


>XP_003546819.1 PREDICTED: uncharacterized protein LOC100801419 [Glycine max]
           KRH10100.1 hypothetical protein GLYMA_15G028300 [Glycine
           max]
          Length = 231

 Score =  180 bits (457), Expect = 4e-51
 Identities = 106/230 (46%), Positives = 138/230 (60%), Gaps = 5/230 (2%)
 Frame = -1

Query: 918 MTNPSGVEKNHGSVSAYNATPTR----SLLFNQGISADWTQEEQAILEEGLIQYASEPVI 751
           M NPSG  + H  V + +A  T     ++  N GIS DWT EEQAILE+GL +YASE  I
Sbjct: 1   MANPSGNHQEHTHVVSSSAPETSGAALAMKHNPGISLDWTAEEQAILEDGLSKYASESNI 60

Query: 750 NRYAKIGLLLENKTIRDVALRCTWMRTXXXXXXXXXELGVARRNKEIKVTMEKAISPSAT 571
            RYAKI L L+ KT+RDVALR  WM           +  + R++K+ K   E+   P+  
Sbjct: 61  VRYAKIALQLQQKTVRDVALRVRWMNKKENSKRRKDDHNLTRKSKDKK---ERVSDPAVK 117

Query: 570 PSFIASHQGFAACTQGLHTNKIDVGISYNGVAGPVRHLLQQNAQAFNQISMNIATQHMQE 391
            S   +    +     + T   D GISY  + GP   LL+QNAQA NQIS N++   +QE
Sbjct: 118 SSNFTARSNVSPYAPPMITMDNDDGISYTAIGGPTGDLLEQNAQALNQISTNLSAFQVQE 177

Query: 390 NIGLLFRTRDNIYKILSSLDEMPN-MKQMHPLPITLNEELASSILPRSAL 244
           NI L  +TRDNI KI++ L++ P  MKQM PLP+ +NEELASSILPR+ L
Sbjct: 178 NINLFCQTRDNILKIMNELNDSPEVMKQMPPLPVKVNEELASSILPRTNL 227


>XP_012455983.1 PREDICTED: uncharacterized protein LOC105777320 [Gossypium
           raimondii] KJB71725.1 hypothetical protein
           B456_011G138900 [Gossypium raimondii]
          Length = 239

 Score =  180 bits (457), Expect = 5e-51
 Identities = 107/240 (44%), Positives = 143/240 (59%), Gaps = 13/240 (5%)
 Frame = -1

Query: 918 MTNPSGVEKNHGS-VSAYNA-----------TPTRSLLFNQGISADWTQEEQAILEEGLI 775
           M NP G  +   +  S++N            T    +  N GIS DWT EEQAIL++GL 
Sbjct: 1   MANPPGNHQQEANQASSFNGAHLNNGNPVPETSGSGMKHNPGISLDWTLEEQAILDDGLK 60

Query: 774 QYASEPVINRYAKIGLLLENKTIRDVALRCTWMRTXXXXXXXXXELGVARRNKEIKVTME 595
           +YASEP I RYAKI L L+NKT+RDVALRC WM           E  +AR++K+ K   E
Sbjct: 61  KYASEPSIIRYAKIALQLQNKTVRDVALRCRWMTKKENSKRRKEEHNIARKSKDKK---E 117

Query: 594 KAISPSATPSFIASHQGFAACTQGLHTNKIDVGISYNGVAGPVRHLLQQNAQAFNQISMN 415
           +   P+A P+  A+    +     +     D GI Y  + G    LL+QNA AFNQIS N
Sbjct: 118 RVADPTAKPTQFAARPNLSPYAPPMIPMDYDDGIPYRAIGGVTGELLEQNAHAFNQISAN 177

Query: 414 IATQHMQENIGLLFRTRDNIYKILSSLDEMPN-MKQMHPLPITLNEELASSILPRSALQM 238
           +A   +QENIGLL +TRDNI KI++ L+++P+ MKQM  LP+ LN+ELA++ILP S+  M
Sbjct: 178 LAAFQIQENIGLLCQTRDNILKIMNELNDIPDVMKQMQVLPVKLNDELANTILPPSSHPM 237


>XP_016677700.1 PREDICTED: uncharacterized protein LOC107896910 [Gossypium
           hirsutum] XP_017647133.1 PREDICTED: uncharacterized
           protein LOC108487332 [Gossypium arboreum] KHG27629.1
           Histone H2A deubiquitinase MYSM1 [Gossypium arboreum]
          Length = 239

 Score =  179 bits (455), Expect = 1e-50
 Identities = 107/240 (44%), Positives = 142/240 (59%), Gaps = 13/240 (5%)
 Frame = -1

Query: 918 MTNPSGVEKNHGS-VSAYNA-----------TPTRSLLFNQGISADWTQEEQAILEEGLI 775
           M NP G  +   +  S++N            T    +  N GIS DWT EEQAIL++GL 
Sbjct: 1   MANPPGNHQQEANQASSFNGAHLNNGNPVPETSGSGMKHNPGISLDWTLEEQAILDDGLK 60

Query: 774 QYASEPVINRYAKIGLLLENKTIRDVALRCTWMRTXXXXXXXXXELGVARRNKEIKVTME 595
           +YASEP I RYAKI L L+NKT+RDVALRC WM           E  +AR++K+ K   E
Sbjct: 61  KYASEPSIIRYAKIALQLQNKTVRDVALRCRWMTKKENSKRRKEEHNIARKSKDKK---E 117

Query: 594 KAISPSATPSFIASHQGFAACTQGLHTNKIDVGISYNGVAGPVRHLLQQNAQAFNQISMN 415
           +   P+A P+  A+          +     D GI Y  + G    LL+QNA AFNQIS N
Sbjct: 118 RVADPTAKPTQFAARPSLPPYAPPMIPMDYDDGIPYRAIGGVTGELLEQNAHAFNQISAN 177

Query: 414 IATQHMQENIGLLFRTRDNIYKILSSLDEMPN-MKQMHPLPITLNEELASSILPRSALQM 238
           +A   +QENIGLL +TRDNI KI++ L+++P+ MKQM  LP+ LN+ELA++ILP S+  M
Sbjct: 178 LAAFQIQENIGLLCQTRDNILKIMNELNDIPDIMKQMQVLPVKLNDELANTILPPSSHPM 237


>OMO90318.1 hypothetical protein CCACVL1_07407 [Corchorus capsularis]
          Length = 256

 Score =  180 bits (456), Expect = 1e-50
 Identities = 106/236 (44%), Positives = 142/236 (60%), Gaps = 15/236 (6%)
 Frame = -1

Query: 918 MTNPSG---VEKNHGSVSAYNA-----------TPTRSLLFNQGISADWTQEEQAILEEG 781
           M NP G    E NH S S++NA           T    +  N GIS DWT EEQAILE+G
Sbjct: 1   MANPPGNHQQEPNHAS-SSFNAANLSNSNPIPETSVSGMKHNPGISVDWTLEEQAILEDG 59

Query: 780 LIQYASEPVINRYAKIGLLLENKTIRDVALRCTWMRTXXXXXXXXXELGVARRNKEIKVT 601
           L  + SEP I RYAKI + L+NKT+RDVALRC WM           E   AR++K+ K  
Sbjct: 60  LKTFGSEPNITRYAKIAMQLQNKTVRDVALRCRWMNKKENSKRRKEEHNSARKSKDKK-- 117

Query: 600 MEKAISPSATPSFIASHQGFAACTQGLHTNKIDVGISYNGVAGPVRHLLQQNAQAFNQIS 421
            E+   PS+  +  AS    +     + +   + GI Y  + GP   LL+QN QAF QIS
Sbjct: 118 -ERVADPSSNLAQFASRPNISPYAPPMISMDYEDGIPYKAIGGPTGELLEQNVQAFTQIS 176

Query: 420 MNIATQHMQENIGLLFRTRDNIYKILSSLDEMPN-MKQMHPLPITLNEELASSILP 256
            N+A+  +QEN+GLL +TRDNI +I+ +L++MP+ M+QM PLP+ +NEELA++ILP
Sbjct: 177 ANLASFQIQENLGLLCQTRDNILRIMKNLNDMPDIMRQMPPLPVKVNEELANTILP 232


>XP_011091621.1 PREDICTED: uncharacterized protein LOC105172002 isoform X2 [Sesamum
           indicum]
          Length = 235

 Score =  179 bits (453), Expect = 2e-50
 Identities = 111/228 (48%), Positives = 140/228 (61%), Gaps = 7/228 (3%)
 Frame = -1

Query: 897 EKNHGSVSAYNATPTR------SLLFNQGISADWTQEEQAILEEGLIQYASEPVINRYAK 736
           ++ HG  S YN           +    Q IS DWT EEQ ILEEGL +YASE  I RYAK
Sbjct: 14  QQGHGPASPYNGNSVHGEWGMPTFQHQQTISMDWTPEEQTILEEGLAKYASESNIIRYAK 73

Query: 735 IGLLLENKTIRDVALRCTWMRTXXXXXXXXXELGVARRNKEIKVTMEKAISPSATPSFIA 556
           I + L+NKT+RDVALRC WM           +   AR+ K+ K   EK +   A PS ++
Sbjct: 74  IAVQLKNKTVRDVALRCRWMTKKEISKRRKEDFS-ARKCKDRK---EKVVDSLAKPSRLS 129

Query: 555 SHQGFAACTQGLHTNKIDVGISYNGVAGPVRHLLQQNAQAFNQISMNIATQHMQENIGLL 376
              GF+    G+ +N  D  ISYN +AG  R LLQQNA AF QIS N+AT  + ENIGLL
Sbjct: 130 IQSGFSHAP-GMVSNCNDDTISYNDIAGITRQLLQQNAWAFKQISANLATHQIHENIGLL 188

Query: 375 FRTRDNIYKILSSLDEM-PNMKQMHPLPITLNEELASSILPRSALQMK 235
            + RDNI KIL++L++M P MK+M PLP  LNEELA+SILP +  Q++
Sbjct: 189 SQARDNILKILTNLNDMGPTMKKMPPLP-KLNEELANSILPPTTFQIQ 235


>XP_015583637.1 PREDICTED: uncharacterized protein LOC8284759 [Ricinus communis]
          Length = 242

 Score =  178 bits (451), Expect = 4e-50
 Identities = 111/247 (44%), Positives = 145/247 (58%), Gaps = 19/247 (7%)
 Frame = -1

Query: 918 MTNPSGV----EKNHGSVSAYNA--------------TPTRSLLFNQGISADWTQEEQAI 793
           M NPSGV    E NH S S++N               T   +L  N GIS DWT EEQAI
Sbjct: 1   MANPSGVHHQQEGNHAS-SSFNGGNIPTNGHGNSGPETSGTNLKHNPGISTDWTLEEQAI 59

Query: 792 LEEGLIQYASEPVINRYAKIGLLLENKTIRDVALRCTWMRTXXXXXXXXXELGVARRNKE 613
           LE+ L QYA+E  + RYAKI + L+NKT+RDVALRC WM T         E  +AR++K+
Sbjct: 60  LEDALNQYAAESSVIRYAKIAVQLQNKTVRDVALRCRWM-TKKEYSKRRKEDSLARKSKD 118

Query: 612 IKVTMEKAISPSATPSFIASHQGFAACTQGLHTNKIDVGISYNGVAGPVRHLLQQNAQAF 433
            K   E+   PS   S   +          +   + D G++YNG+ G    LL QNA+A 
Sbjct: 119 KK---ERVTDPSVKASRFMARSNVHPYATSMIPMEYDDGMAYNGIDGVTGELLDQNAKAL 175

Query: 432 NQISMNIATQHMQENIGLLFRTRDNIYKILSSLDEMPN-MKQMHPLPITLNEELASSILP 256
           + IS N++T  +QENI LL +TRDNI KI++ +++MP  M+QM PLP+ LNEELA +ILP
Sbjct: 176 DHISANLSTMQLQENISLLCQTRDNILKIMNEMNDMPELMEQMPPLPVKLNEELADTILP 235

Query: 255 RSALQMK 235
           R  L MK
Sbjct: 236 RPNLPMK 242


>XP_003543574.1 PREDICTED: uncharacterized protein LOC100819879 [Glycine max]
           KHN30087.1 hypothetical protein glysoja_010465 [Glycine
           soja] KRH23232.1 hypothetical protein GLYMA_13G346000
           [Glycine max]
          Length = 232

 Score =  177 bits (449), Expect = 6e-50
 Identities = 104/230 (45%), Positives = 137/230 (59%), Gaps = 5/230 (2%)
 Frame = -1

Query: 918 MTNPSGVEKNHGSVSAYNATPTR----SLLFNQGISADWTQEEQAILEEGLIQYASEPVI 751
           M NPSG  + H  V + +A  T     ++  N GIS DWT EEQAILE+GL +YASE  I
Sbjct: 2   MANPSGNHQEHTHVVSSSAPETSGAALAMKHNPGISLDWTAEEQAILEDGLSKYASESNI 61

Query: 750 NRYAKIGLLLENKTIRDVALRCTWMRTXXXXXXXXXELGVARRNKEIKVTMEKAISPSAT 571
            RYAKI L L+ KT+RDVALR  WM           +  + R++K+ K   E+   P+  
Sbjct: 62  VRYAKIALQLQQKTVRDVALRVRWMNKKENSKRRKDDHNLTRKSKDKK---ERVSDPAVK 118

Query: 570 PSFIASHQGFAACTQGLHTNKIDVGISYNGVAGPVRHLLQQNAQAFNQISMNIATQHMQE 391
            S   +    +     +     D GISY  + GP   LL+QNAQA NQIS N++   +QE
Sbjct: 119 SSNFVARSNVSPYAPPMIAMDNDDGISYTAIGGPTGDLLEQNAQALNQISTNLSAFQVQE 178

Query: 390 NIGLLFRTRDNIYKILSSLDEMPN-MKQMHPLPITLNEELASSILPRSAL 244
           NI L  +TRDNI KI++ L++ P  MKQM PLP+ +NEELA+SILPR+ L
Sbjct: 179 NINLFCQTRDNILKIMNELNDSPEVMKQMPPLPVKVNEELANSILPRTNL 228


>XP_011466862.1 PREDICTED: uncharacterized protein LOC101313817 [Fragaria vesca
           subsp. vesca] XP_011466863.1 PREDICTED: uncharacterized
           protein LOC101313817 [Fragaria vesca subsp. vesca]
          Length = 239

 Score =  176 bits (446), Expect = 2e-49
 Identities = 107/242 (44%), Positives = 141/242 (58%), Gaps = 14/242 (5%)
 Frame = -1

Query: 918 MTNPSG--VEKNHGSVSAY----NATPTRS-------LLFNQGISADWTQEEQAILEEGL 778
           M NPSG   E +H S S      N+TP  +       +  N GIS DWT EEQ IL++GL
Sbjct: 1   MANPSGNHQEPSHASSSFNPTNGNSTPVSAPPESSGAMKHNPGISMDWTAEEQVILDDGL 60

Query: 777 IQYASEPVINRYAKIGLLLENKTIRDVALRCTWMRTXXXXXXXXXELGVARRNKEIKVTM 598
             YA+E  I RYAKI + L+NKT+RDVALRC WM           EL +AR++K+ K   
Sbjct: 61  ANYATEANIIRYAKIAMQLQNKTVRDVALRCRWMTKKENSKRRKEELNLARKSKDKK--- 117

Query: 597 EKAISPSATPSFIASHQGFAACTQGLHTNKIDVGISYNGVAGPVRHLLQQNAQAFNQISM 418
           E+ +  SA PS   +    A     + +   D GISY  + G    LL+QNAQA NQIS 
Sbjct: 118 ERVVDTSAKPSHFTTRPNVAPYVPPIISMDNDDGISYKAIGGVTGELLEQNAQALNQISA 177

Query: 417 NIATQHMQENIGLLFRTRDNIYKILSSLDEMPN-MKQMHPLPITLNEELASSILPRSALQ 241
           N+    +QENI L  + RDNI KI++ L++MP+ MKQM PLP+ +NEEL + +LP  A  
Sbjct: 178 NLQAFQIQENINLFCQARDNILKIMNDLNDMPDVMKQMPPLPVKVNEELVNHVLPPPAHP 237

Query: 240 MK 235
           M+
Sbjct: 238 MQ 239


>XP_004486880.1 PREDICTED: uncharacterized protein LOC101498979 [Cicer arietinum]
          Length = 237

 Score =  175 bits (443), Expect = 5e-49
 Identities = 105/236 (44%), Positives = 140/236 (59%), Gaps = 11/236 (4%)
 Frame = -1

Query: 918 MTNPSGVEKNHGSVSAY----NATPTRS------LLFNQGISADWTQEEQAILEEGLIQY 769
           M NPSG   +   V+      N+ P  S      +  N GIS DWT EEQA LE GL +Y
Sbjct: 1   MANPSGTGNHQEQVNPSSFNGNSVPESSSGLAMNMKHNPGISLDWTPEEQATLENGLSKY 60

Query: 768 ASEPVINRYAKIGLLLENKTIRDVALRCTWMRTXXXXXXXXXELGVARRNKEIKVTMEKA 589
           A+E  I RYAKI L L+NKT+RDVALR  WM           +  ++R++K+ K   E+ 
Sbjct: 61  ATESNIVRYAKIALQLQNKTVRDVALRVRWMNKKENSKRRKDDHNLSRKSKDKK---ERV 117

Query: 588 ISPSATPSFIASHQGFAACTQGLHTNKIDVGISYNGVAGPVRHLLQQNAQAFNQISMNIA 409
             P+A  S  A+          + T   D GISY  + GP   LL+QNAQA +QIS N++
Sbjct: 118 SDPAAKSSHFAARPNVPPYAPPMITMDNDDGISYAAIGGPTGELLEQNAQALSQISANLS 177

Query: 408 TQHMQENIGLLFRTRDNIYKILSSLDEMPN-MKQMHPLPITLNEELASSILPRSAL 244
           +  +QENI LL +TRDNI +I++ L++ P  MKQM PLP+ +NEELA+SILPR+ L
Sbjct: 178 SLQIQENINLLCQTRDNILRIMNELNDSPEVMKQMPPLPVKMNEELANSILPRTPL 233


>XP_019053511.1 PREDICTED: uncharacterized protein LOC104598244 isoform X3 [Nelumbo
           nucifera]
          Length = 255

 Score =  175 bits (444), Expect = 7e-49
 Identities = 102/223 (45%), Positives = 139/223 (62%), Gaps = 2/223 (0%)
 Frame = -1

Query: 912 NPSGVEKNHGSVSAYNATPTRSLLFNQGISADWTQEEQAILEEGLIQYASEPVINRYAKI 733
           NPS    N G+V   N+ P  +L  N G+S++WT EEQ+IL+EGL +YASE +I RYAKI
Sbjct: 32  NPSN--GNSGAVPE-NSAPANALKHNPGLSSEWTAEEQSILDEGLSKYASESIIVRYAKI 88

Query: 732 GLLLENKTIRDVALRCTWMRTXXXXXXXXXELGVARRNKEIKVTMEKAISPSA-TPSFIA 556
            + L++KT+RDVALRC WM           +  + R++K+ K   EK +  S  + S +A
Sbjct: 89  AMQLQDKTVRDVALRCRWMTKKESGKRRKEDHNLTRKSKDKK---EKVVESSVKSSSHLA 145

Query: 555 SHQGFAACTQGLHTNKIDVGISYNGVAGPVRHLLQQNAQAFNQISMNIATQHMQENIGLL 376
           +          +     D GISY  + G    LL+QNAQ FNQIS N+A   +QENI L 
Sbjct: 146 ARPNVPPYALPMIAMDNDDGISYRDIGGTTGQLLEQNAQFFNQISANLANYQLQENINLF 205

Query: 375 FRTRDNIYKILSSLDEMPN-MKQMHPLPITLNEELASSILPRS 250
            + RDNI  IL+ L++MP+ MKQM PLP+ +NEELA+SILPR+
Sbjct: 206 CQARDNILTILNDLNDMPSIMKQMPPLPVKINEELANSILPRT 248


>KZV34811.1 hypothetical protein F511_00713 [Dorcoceras hygrometricum]
          Length = 235

 Score =  174 bits (442), Expect = 7e-49
 Identities = 108/224 (48%), Positives = 138/224 (61%), Gaps = 7/224 (3%)
 Frame = -1

Query: 897 EKNHGSVSAYNATPTR------SLLFNQGISADWTQEEQAILEEGLIQYASEPVINRYAK 736
           ++ H SVS YN +         +    Q IS +WT EEQ +LEEGL +YA E  I RYAK
Sbjct: 14  QQGHSSVSPYNGSSVSGEWGVPTFQHQQTISMEWTPEEQTMLEEGLAKYALESNIIRYAK 73

Query: 735 IGLLLENKTIRDVALRCTWMRTXXXXXXXXXELGVARRNKEIKVTMEKAISPSATPSFIA 556
           I + L NKT+RDVALRC WM           +   AR++KE K   EK +  SA PS  A
Sbjct: 74  IAVQLNNKTVRDVALRCRWMTKKEITKRKKDDFS-ARKSKERK---EKVVDSSAKPSRFA 129

Query: 555 SHQGFAACTQGLHTNKIDVGISYNGVAGPVRHLLQQNAQAFNQISMNIATQHMQENIGLL 376
              GF+  + G+ ++  D GISYN V G  R L+QQN+ AF QIS N+A   + ENIGLL
Sbjct: 130 IQSGFSHAS-GMVSDSYDDGISYNDVTGATRQLIQQNSWAFKQISSNLAAHQIHENIGLL 188

Query: 375 FRTRDNIYKILSSLDEMP-NMKQMHPLPITLNEELASSILPRSA 247
            + R+NIYKIL++L+ M   MK+M PLP  +NEELAS+ILP SA
Sbjct: 189 NQARENIYKILNNLNGMGLTMKKMPPLP-KVNEELASTILPPSA 231


Top