BLASTX nr result

ID: Rehmannia27_contig00049119 seq

BLASTX 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Rehmannia27_contig00049119
         (639 letters)

Database: ./nr 
           84,704,028 sequences; 31,038,470,784 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_011081952.1| PREDICTED: DNA cross-link repair protein SNM...   293   3e-92
ref|XP_012855786.1| PREDICTED: DNA cross-link repair 1A protein ...   200   1e-56
ref|XP_009794231.1| PREDICTED: uncharacterized protein LOC104241...   163   1e-44
ref|XP_009588874.1| PREDICTED: uncharacterized protein LOC104086...   153   1e-39
ref|XP_015083439.1| PREDICTED: uncharacterized protein LOC107026...   141   3e-35
ref|XP_004245204.1| PREDICTED: uncharacterized protein LOC101266...   140   7e-35
ref|XP_006361524.1| PREDICTED: uncharacterized protein LOC102596...   139   9e-35
ref|XP_007012472.1| Sterile alpha motif domain-containing protei...   127   1e-30
ref|XP_007012473.1| Sterile alpha motif domain-containing protei...   127   1e-30
ref|XP_007012470.1| Sterile alpha motif domain-containing protei...   127   1e-30
ref|XP_007012471.1| Sterile alpha motif domain-containing protei...   127   1e-30
ref|XP_007012469.1| Sterile alpha motif domain-containing protei...   127   1e-30
ref|XP_007012468.1| Sterile alpha motif domain-containing protei...   127   1e-30
emb|CDP17885.1| unnamed protein product [Coffea canephora]            127   2e-30
gb|KDO73678.1| hypothetical protein CISIN_1g0048772mg, partial [...   118   3e-29
ref|XP_002309453.1| sterile alpha motif domain-containing family...   121   3e-28
ref|XP_006474528.1| PREDICTED: DNA cross-link repair 1A protein ...   119   9e-28
ref|XP_012451377.1| PREDICTED: DNA cross-link repair protein SNM...   116   1e-26
ref|XP_012451376.1| PREDICTED: DNA cross-link repair protein SNM...   116   1e-26
ref|XP_010648406.1| PREDICTED: uncharacterized protein LOC100256...   116   1e-26

>ref|XP_011081952.1| PREDICTED: DNA cross-link repair protein SNM1 [Sesamum indicum]
          Length = 712

 Score =  293 bits (750), Expect = 3e-92
 Identities = 149/214 (69%), Positives = 164/214 (76%), Gaps = 1/214 (0%)
 Frame = -1

Query: 639 PPRLKPHSSTTLRPSEKLKKQKPINPGKENRLFHETEEADLDCGLDSIEPTLDLLNPKGD 460
           PPRLK H+S TL P + LK +K  NPGKEN  F ETE +DL CGLDSIEPTLDLLNPKG 
Sbjct: 51  PPRLKLHNSNTLCPPKNLKNKKSNNPGKENCFFDETE-SDLGCGLDSIEPTLDLLNPKGI 109

Query: 459 SDYSHSKKSAESKLFKPCGEEREKLCDEE-FYEGSSQLDVLLKLCADVDEQGNNNAMDDS 283
            DY  +  S ES+L K  GEE    CDEE F EGS+Q DVLLKLCA+VDE GN +  DDS
Sbjct: 110 GDYLRNSYSIESRLLKHRGEEEANACDEELFEEGSTQFDVLLKLCAEVDEPGNASYRDDS 169

Query: 282 EGKCVVFICCPICGADISGLSDDLRQIHTNECLDLVEGPTEVAATSDGRGAHQCPGQVLD 103
           EGKC V ICCP+CGADISGL DDLRQIHTNECLD +EG T+VA   D  G +QCPGQVLD
Sbjct: 170 EGKCDVSICCPLCGADISGLRDDLRQIHTNECLDKLEGSTDVAVRDDELGTYQCPGQVLD 229

Query: 102 DSPIKSAKKVVDVSPVVEWLRNLGLAKYEEIFVR 1
            SP KS K+ VD SPVVEWLRNLGLAKYEEIF+R
Sbjct: 230 GSPHKSVKEAVDASPVVEWLRNLGLAKYEEIFIR 263


>ref|XP_012855786.1| PREDICTED: DNA cross-link repair 1A protein [Erythranthe guttata]
          Length = 749

 Score =  200 bits (508), Expect = 1e-56
 Identities = 127/269 (47%), Positives = 144/269 (53%), Gaps = 57/269 (21%)
 Frame = -1

Query: 639 PPRLKPHSSTTLRPSEKLKKQKPINPGKENRLFHETEEADLDCGLDSIEPTLDLLNPKGD 460
           PPRLKP SSTTLR S++ K+ KP+NPGKEN L     E     GL SIEPTLD L+PKG 
Sbjct: 47  PPRLKPRSSTTLRQSKRPKRGKPVNPGKENCLLFNEIEGAFVGGLKSIEPTLDWLSPKGV 106

Query: 459 SDYSHSKKSAESKL---------------------------------------------- 418
            D   S  S ESKL                                              
Sbjct: 107 CDNLQSNNSIESKLLDPFIQGEEEEIDLESIEPNLQLFNPKGVIDYLRCNNSVESRLLQP 166

Query: 417 FKPCGEE---------REKLCDEEFYEGSSQLDVLLKLCADVDEQGNNNAMDDSEGKCVV 265
           F+P  EE          EK+ DEEF E SSQLD LLKLC +VD + N+N           
Sbjct: 167 FRPEEEEDEVVVVEEGEEKIFDEEFSERSSQLDALLKLCEEVDVESNSN----------- 215

Query: 264 FICCPICGADISGLSDDLRQIHTNECLDLVEGPTEVAA--TSDGRGAHQCPGQVLDDSPI 91
                +CGADISGLSDD RQIHTNECLD VEG   VA   ++D    HQ PG V+D SP+
Sbjct: 216 -----VCGADISGLSDDQRQIHTNECLDSVEGSANVAVAVSNDDTRTHQGPGHVVDGSPL 270

Query: 90  KSAKKVVDVSPVVEWLRNLGLAKYEEIFV 4
           KSA K  ++S VVEWLRNLGLAKYEEIFV
Sbjct: 271 KSATKAGNLSSVVEWLRNLGLAKYEEIFV 299


>ref|XP_009794231.1| PREDICTED: uncharacterized protein LOC104241024 [Nicotiana
           sylvestris]
          Length = 467

 Score =  163 bits (412), Expect = 1e-44
 Identities = 99/216 (45%), Positives = 137/216 (63%), Gaps = 9/216 (4%)
 Frame = -1

Query: 621 HSSTTLRPSEKLKKQKPINPGKENRLFHETEEADLDCGLDSIEPTLDLLN--PKGDSDYS 448
           H     RP++K K+Q  I+   E+    E E+ DL  GLDSIE T+D  +   + +++  
Sbjct: 100 HRLDNSRPTKKPKQQPLISEKSES----EFEDLDLCHGLDSIESTIDCCSRAQRTENEKE 155

Query: 447 HSK----KSAESKLFKPCGEEREKLCDEEFYEGSSQLDVLLKLCADVDEQGN---NNAMD 289
             K    KS E++L    G   E+   +E  E  S+LD+LLKLC + +++G+   +  ++
Sbjct: 156 LKKGYLFKSIEARLLNSDGGFEER---KEESEECSELDLLLKLCGEEEDEGDGVESFGLE 212

Query: 288 DSEGKCVVFICCPICGADISGLSDDLRQIHTNECLDLVEGPTEVAATSDGRGAHQCPGQV 109
           D  G     +CCP+CGADIS LS D+R++HTNECLD  E P  V  T++   + QCPGQV
Sbjct: 213 DEYG----LLCCPLCGADISDLSGDMREVHTNECLDNEETPAHVV-TANNDVSFQCPGQV 267

Query: 108 LDDSPIKSAKKVVDVSPVVEWLRNLGLAKYEEIFVR 1
           L+DSP +S K+V+ VSPVVEWLRNLGLAKYEEIFVR
Sbjct: 268 LNDSPCQSPKEVIRVSPVVEWLRNLGLAKYEEIFVR 303


>ref|XP_009588874.1| PREDICTED: uncharacterized protein LOC104086333 [Nicotiana
           tomentosiformis]
          Length = 744

 Score =  153 bits (387), Expect = 1e-39
 Identities = 98/216 (45%), Positives = 133/216 (61%), Gaps = 9/216 (4%)
 Frame = -1

Query: 621 HSSTTLRPSEKLKKQKPINPGKENRLFHETEEADLDCGLDSIEPTLDLLN--PKGDSDYS 448
           H   + RP++K  KQ+P+   K    F   E+ DL  GLDSIE T+D  +   + +++  
Sbjct: 98  HKLDSYRPTKK-PKQQPLISEKSKSGF---EDLDLCHGLDSIESTIDCCSRTQRTENEEE 153

Query: 447 HSK----KSAESKLFKPCGEEREKLCDEEFYEGSSQLDVLLKLCADVDEQGNNN---AMD 289
             K    KS E++L        E+   +E  E  S+LD+LLKLC + +++G+      + 
Sbjct: 154 LKKGYLFKSIEARLLNSNDGLEER---KEELEECSELDLLLKLCGEEEDEGDGVECFGLG 210

Query: 288 DSEGKCVVFICCPICGADISGLSDDLRQIHTNECLDLVEGPTEVAATSDGRGAHQCPGQV 109
           D  G     ICCP+CGADIS LS D+R++HTNECLD  E P  V  T++   + QCPGQV
Sbjct: 211 DEYG----LICCPLCGADISDLSGDMREVHTNECLDNEETPAHVV-TANNDVSVQCPGQV 265

Query: 108 LDDSPIKSAKKVVDVSPVVEWLRNLGLAKYEEIFVR 1
           L+DSP +S K+VV V PVVEWL+NLGLAKYEEIFVR
Sbjct: 266 LNDSPRQSPKEVVRVLPVVEWLQNLGLAKYEEIFVR 301


>ref|XP_015083439.1| PREDICTED: uncharacterized protein LOC107026855 [Solanum pennellii]
          Length = 770

 Score =  141 bits (355), Expect = 3e-35
 Identities = 89/191 (46%), Positives = 117/191 (61%), Gaps = 14/191 (7%)
 Frame = -1

Query: 531 EEADLDCGLDSIEPTLDLLNP------KGDSDYSHSKKSAESKLFKPCGEEREKLCDEEF 370
           E+ DL  GLD+IE T+D  +       + +    +  KS E++L    G   E+   EE 
Sbjct: 135 EDLDLGHGLDNIESTIDCCSGVQRATNEEELKRGYLFKSIEARLLNSNGGFEER--KEEE 192

Query: 369 YEGSSQLDVLLKLCADVDE------QGNNNAMDDSEG--KCVVFICCPICGADISGLSDD 214
            E  S+LD+LLKLC + DE        + +  ++  G  K    ICCP+CGADIS LS +
Sbjct: 193 SEECSELDLLLKLCGEEDEVYCDALTADPHRQEECLGLDKEYGLICCPLCGADISDLSGE 252

Query: 213 LRQIHTNECLDLVEGPTEVAATSDGRGAHQCPGQVLDDSPIKSAKKVVDVSPVVEWLRNL 34
           +R +HTNECLD  E P +V  T++   + QCPGQVL+DSP    K+VV +SPVVEWLRNL
Sbjct: 253 MRLVHTNECLDKDETPADV-VTANNDVSFQCPGQVLNDSP--CPKEVVHMSPVVEWLRNL 309

Query: 33  GLAKYEEIFVR 1
           GLAKYEEIFVR
Sbjct: 310 GLAKYEEIFVR 320


>ref|XP_004245204.1| PREDICTED: uncharacterized protein LOC101266356 [Solanum
           lycopersicum]
          Length = 770

 Score =  140 bits (352), Expect = 7e-35
 Identities = 99/246 (40%), Positives = 133/246 (54%), Gaps = 39/246 (15%)
 Frame = -1

Query: 621 HSSTTLRPSEKLKKQKPINPGKE-----------------NRLFHETE----EADLDCGL 505
           H   + RP++K  KQ P++  K+                 N   H++E    + DL  GL
Sbjct: 85  HGLDSSRPTKK-PKQHPVSVEKDSLAPVVFEKSDENGKRLNSAHHKSESDFEDLDLGHGL 143

Query: 504 DSIEPTLDLLNP------KGDSDYSHSKKSAESKLFKPCGEEREKLCDEEFYEGSSQLDV 343
           D+IE T+D  +       + +    +  KS E++L    G   E+   EE  E  S+LD+
Sbjct: 144 DNIESTIDCCSGVKRATNEEELKRGYLFKSIEARLLNSNGGLEER--KEEESEECSELDL 201

Query: 342 LLKLCADVDE------------QGNNNAMDDSEGKCVVFICCPICGADISGLSDDLRQIH 199
           LLKLC + DE            Q     +D+  G     ICCP+CGADIS LS ++R +H
Sbjct: 202 LLKLCGEEDEVYCDALTADPHRQEECLELDEEYG----LICCPLCGADISDLSGEMRLVH 257

Query: 198 TNECLDLVEGPTEVAATSDGRGAHQCPGQVLDDSPIKSAKKVVDVSPVVEWLRNLGLAKY 19
           TNECLD  E P +V  T++   + QCPGQVL+DSP    K+VV +SPVVEWLRNLGL KY
Sbjct: 258 TNECLDKDETPADV-VTANNDVSIQCPGQVLNDSP--CPKEVVHMSPVVEWLRNLGLPKY 314

Query: 18  EEIFVR 1
           EEIFVR
Sbjct: 315 EEIFVR 320


>ref|XP_006361524.1| PREDICTED: uncharacterized protein LOC102596611 [Solanum tuberosum]
          Length = 769

 Score =  139 bits (351), Expect = 9e-35
 Identities = 88/191 (46%), Positives = 116/191 (60%), Gaps = 14/191 (7%)
 Frame = -1

Query: 531 EEADLDCGLDSIEPTLDLLNP------KGDSDYSHSKKSAESKLFKPCGEEREKLCDEEF 370
           E+ DL  GLD+IE T+D  +       + +    +  KS E++L    G   E+   EE 
Sbjct: 135 EDLDLGHGLDNIESTIDCCSGVQRTTNEEELKRGYLFKSIEARLLNSNGAFEER--KEEE 192

Query: 369 YEGSSQLDVLLKLCADVDEQGNNNAMDD--SEGKCVVF------ICCPICGADISGLSDD 214
            E  S+LD+LLKLC + DE   +    D   + +C+        ICCP+CGADIS LS +
Sbjct: 193 PEECSELDLLLKLCGEEDEVYGDALTADLHRQEECLGLDEEYGLICCPLCGADISDLSGE 252

Query: 213 LRQIHTNECLDLVEGPTEVAATSDGRGAHQCPGQVLDDSPIKSAKKVVDVSPVVEWLRNL 34
           +R +HTNECLD  E P  V  T++   + QCPGQVL+DSP    K+VV +SPVVEWL+NL
Sbjct: 253 MRLVHTNECLDKDETPVNV-VTANNDVSFQCPGQVLNDSP--CPKEVVHMSPVVEWLQNL 309

Query: 33  GLAKYEEIFVR 1
           GLAKYEEIFVR
Sbjct: 310 GLAKYEEIFVR 320


>ref|XP_007012472.1| Sterile alpha motif domain-containing protein isoform 5, partial
           [Theobroma cacao] gi|508782835|gb|EOY30091.1| Sterile
           alpha motif domain-containing protein isoform 5, partial
           [Theobroma cacao]
          Length = 680

 Score =  127 bits (320), Expect = 1e-30
 Identities = 91/241 (37%), Positives = 131/241 (54%), Gaps = 31/241 (12%)
 Frame = -1

Query: 630 LKPHSSTTLRPSEKLKKQKPINPGKENR------LFHETEEADLD--CGLD----SIEPT 487
           LKP  S T RP  K  K+    PGKEN       +    ++ DLD  C LD    SI  +
Sbjct: 46  LKP--SNTPRPPSKKPKRPDNPPGKENTAVVTIPITRSNDQPDLDETCSLDLIPSSINCS 103

Query: 486 LDLLNPKG-DSDY---SHSKK------------SAESKLFKPCGEEREKLCDEEFYEGSS 355
            +L + +  DSDY      KK            S ES+L +P  E  E+  ++  ++  +
Sbjct: 104 FNLTSAQDRDSDYVKCDEKKKELLELNKGYLCNSIESRLIRPRSELSEEFGED--FDEDN 161

Query: 354 QLDVLLKLCADVDEQGNNNAMDDSEGKCV--VFICCPICGADISGLSDDLRQIHTNECLD 181
           +LD LLKLC DV+E+   ++ D+ E   +    + CP+CG +ISGL+++ R +H N+CLD
Sbjct: 162 ELDALLKLCNDVEEEKEEDSGDEKESNVLDNSLVQCPLCGVNISGLNEEHRLVHINDCLD 221

Query: 180 LVEGPTE-VAATSDGRGAHQCPGQVLDDSPIKSAKKVVDVSPVVEWLRNLGLAKYEEIFV 4
            VE P + V          QC  +V+D  P+ S ++VVDVSPVV+WL NLGLA+Y + FV
Sbjct: 222 KVENPGQNVVFPPSVDREFQCVPEVVDGPPL-SPRQVVDVSPVVKWLSNLGLARYADAFV 280

Query: 3   R 1
           R
Sbjct: 281 R 281


>ref|XP_007012473.1| Sterile alpha motif domain-containing protein isoform 6, partial
           [Theobroma cacao] gi|508782836|gb|EOY30092.1| Sterile
           alpha motif domain-containing protein isoform 6, partial
           [Theobroma cacao]
          Length = 686

 Score =  127 bits (320), Expect = 1e-30
 Identities = 91/241 (37%), Positives = 131/241 (54%), Gaps = 31/241 (12%)
 Frame = -1

Query: 630 LKPHSSTTLRPSEKLKKQKPINPGKENR------LFHETEEADLD--CGLD----SIEPT 487
           LKP  S T RP  K  K+    PGKEN       +    ++ DLD  C LD    SI  +
Sbjct: 53  LKP--SNTPRPPSKKPKRPDNPPGKENTAVVTIPITRSNDQPDLDETCSLDLIPSSINCS 110

Query: 486 LDLLNPKG-DSDY---SHSKK------------SAESKLFKPCGEEREKLCDEEFYEGSS 355
            +L + +  DSDY      KK            S ES+L +P  E  E+  ++  ++  +
Sbjct: 111 FNLTSAQDRDSDYVKCDEKKKELLELNKGYLCNSIESRLIRPRSELSEEFGED--FDEDN 168

Query: 354 QLDVLLKLCADVDEQGNNNAMDDSEGKCV--VFICCPICGADISGLSDDLRQIHTNECLD 181
           +LD LLKLC DV+E+   ++ D+ E   +    + CP+CG +ISGL+++ R +H N+CLD
Sbjct: 169 ELDALLKLCNDVEEEKEEDSGDEKESNVLDNSLVQCPLCGVNISGLNEEHRLVHINDCLD 228

Query: 180 LVEGPTE-VAATSDGRGAHQCPGQVLDDSPIKSAKKVVDVSPVVEWLRNLGLAKYEEIFV 4
            VE P + V          QC  +V+D  P+ S ++VVDVSPVV+WL NLGLA+Y + FV
Sbjct: 229 KVENPGQNVVFPPSVDREFQCVPEVVDGPPL-SPRQVVDVSPVVKWLSNLGLARYADAFV 287

Query: 3   R 1
           R
Sbjct: 288 R 288


>ref|XP_007012470.1| Sterile alpha motif domain-containing protein isoform 3 [Theobroma
           cacao] gi|508782833|gb|EOY30089.1| Sterile alpha motif
           domain-containing protein isoform 3 [Theobroma cacao]
          Length = 703

 Score =  127 bits (320), Expect = 1e-30
 Identities = 91/241 (37%), Positives = 131/241 (54%), Gaps = 31/241 (12%)
 Frame = -1

Query: 630 LKPHSSTTLRPSEKLKKQKPINPGKENR------LFHETEEADLD--CGLD----SIEPT 487
           LKP  S T RP  K  K+    PGKEN       +    ++ DLD  C LD    SI  +
Sbjct: 41  LKP--SNTPRPPSKKPKRPDNPPGKENTAVVTIPITRSNDQPDLDETCSLDLIPSSINCS 98

Query: 486 LDLLNPKG-DSDY---SHSKK------------SAESKLFKPCGEEREKLCDEEFYEGSS 355
            +L + +  DSDY      KK            S ES+L +P  E  E+  ++  ++  +
Sbjct: 99  FNLTSAQDRDSDYVKCDEKKKELLELNKGYLCNSIESRLIRPRSELSEEFGED--FDEDN 156

Query: 354 QLDVLLKLCADVDEQGNNNAMDDSEGKCV--VFICCPICGADISGLSDDLRQIHTNECLD 181
           +LD LLKLC DV+E+   ++ D+ E   +    + CP+CG +ISGL+++ R +H N+CLD
Sbjct: 157 ELDALLKLCNDVEEEKEEDSGDEKESNVLDNSLVQCPLCGVNISGLNEEHRLVHINDCLD 216

Query: 180 LVEGPTE-VAATSDGRGAHQCPGQVLDDSPIKSAKKVVDVSPVVEWLRNLGLAKYEEIFV 4
            VE P + V          QC  +V+D  P+ S ++VVDVSPVV+WL NLGLA+Y + FV
Sbjct: 217 KVENPGQNVVFPPSVDREFQCVPEVVDGPPL-SPRQVVDVSPVVKWLSNLGLARYADAFV 275

Query: 3   R 1
           R
Sbjct: 276 R 276


>ref|XP_007012471.1| Sterile alpha motif domain-containing protein isoform 4 [Theobroma
           cacao] gi|508782834|gb|EOY30090.1| Sterile alpha motif
           domain-containing protein isoform 4 [Theobroma cacao]
          Length = 727

 Score =  127 bits (320), Expect = 1e-30
 Identities = 91/241 (37%), Positives = 131/241 (54%), Gaps = 31/241 (12%)
 Frame = -1

Query: 630 LKPHSSTTLRPSEKLKKQKPINPGKENR------LFHETEEADLD--CGLD----SIEPT 487
           LKP  S T RP  K  K+    PGKEN       +    ++ DLD  C LD    SI  +
Sbjct: 41  LKP--SNTPRPPSKKPKRPDNPPGKENTAVVTIPITRSNDQPDLDETCSLDLIPSSINCS 98

Query: 486 LDLLNPKG-DSDY---SHSKK------------SAESKLFKPCGEEREKLCDEEFYEGSS 355
            +L + +  DSDY      KK            S ES+L +P  E  E+  ++  ++  +
Sbjct: 99  FNLTSAQDRDSDYVKCDEKKKELLELNKGYLCNSIESRLIRPRSELSEEFGED--FDEDN 156

Query: 354 QLDVLLKLCADVDEQGNNNAMDDSEGKCV--VFICCPICGADISGLSDDLRQIHTNECLD 181
           +LD LLKLC DV+E+   ++ D+ E   +    + CP+CG +ISGL+++ R +H N+CLD
Sbjct: 157 ELDALLKLCNDVEEEKEEDSGDEKESNVLDNSLVQCPLCGVNISGLNEEHRLVHINDCLD 216

Query: 180 LVEGPTE-VAATSDGRGAHQCPGQVLDDSPIKSAKKVVDVSPVVEWLRNLGLAKYEEIFV 4
            VE P + V          QC  +V+D  P+ S ++VVDVSPVV+WL NLGLA+Y + FV
Sbjct: 217 KVENPGQNVVFPPSVDREFQCVPEVVDGPPL-SPRQVVDVSPVVKWLSNLGLARYADAFV 275

Query: 3   R 1
           R
Sbjct: 276 R 276


>ref|XP_007012469.1| Sterile alpha motif domain-containing protein isoform 2 [Theobroma
           cacao] gi|508782832|gb|EOY30088.1| Sterile alpha motif
           domain-containing protein isoform 2 [Theobroma cacao]
          Length = 745

 Score =  127 bits (320), Expect = 1e-30
 Identities = 91/241 (37%), Positives = 131/241 (54%), Gaps = 31/241 (12%)
 Frame = -1

Query: 630 LKPHSSTTLRPSEKLKKQKPINPGKENR------LFHETEEADLD--CGLD----SIEPT 487
           LKP  S T RP  K  K+    PGKEN       +    ++ DLD  C LD    SI  +
Sbjct: 41  LKP--SNTPRPPSKKPKRPDNPPGKENTAVVTIPITRSNDQPDLDETCSLDLIPSSINCS 98

Query: 486 LDLLNPKG-DSDY---SHSKK------------SAESKLFKPCGEEREKLCDEEFYEGSS 355
            +L + +  DSDY      KK            S ES+L +P  E  E+  ++  ++  +
Sbjct: 99  FNLTSAQDRDSDYVKCDEKKKELLELNKGYLCNSIESRLIRPRSELSEEFGED--FDEDN 156

Query: 354 QLDVLLKLCADVDEQGNNNAMDDSEGKCV--VFICCPICGADISGLSDDLRQIHTNECLD 181
           +LD LLKLC DV+E+   ++ D+ E   +    + CP+CG +ISGL+++ R +H N+CLD
Sbjct: 157 ELDALLKLCNDVEEEKEEDSGDEKESNVLDNSLVQCPLCGVNISGLNEEHRLVHINDCLD 216

Query: 180 LVEGPTE-VAATSDGRGAHQCPGQVLDDSPIKSAKKVVDVSPVVEWLRNLGLAKYEEIFV 4
            VE P + V          QC  +V+D  P+ S ++VVDVSPVV+WL NLGLA+Y + FV
Sbjct: 217 KVENPGQNVVFPPSVDREFQCVPEVVDGPPL-SPRQVVDVSPVVKWLSNLGLARYADAFV 275

Query: 3   R 1
           R
Sbjct: 276 R 276


>ref|XP_007012468.1| Sterile alpha motif domain-containing protein isoform 1 [Theobroma
           cacao] gi|508782831|gb|EOY30087.1| Sterile alpha motif
           domain-containing protein isoform 1 [Theobroma cacao]
          Length = 838

 Score =  127 bits (320), Expect = 1e-30
 Identities = 91/241 (37%), Positives = 131/241 (54%), Gaps = 31/241 (12%)
 Frame = -1

Query: 630 LKPHSSTTLRPSEKLKKQKPINPGKENR------LFHETEEADLD--CGLD----SIEPT 487
           LKP  S T RP  K  K+    PGKEN       +    ++ DLD  C LD    SI  +
Sbjct: 41  LKP--SNTPRPPSKKPKRPDNPPGKENTAVVTIPITRSNDQPDLDETCSLDLIPSSINCS 98

Query: 486 LDLLNPKG-DSDY---SHSKK------------SAESKLFKPCGEEREKLCDEEFYEGSS 355
            +L + +  DSDY      KK            S ES+L +P  E  E+  ++  ++  +
Sbjct: 99  FNLTSAQDRDSDYVKCDEKKKELLELNKGYLCNSIESRLIRPRSELSEEFGED--FDEDN 156

Query: 354 QLDVLLKLCADVDEQGNNNAMDDSEGKCV--VFICCPICGADISGLSDDLRQIHTNECLD 181
           +LD LLKLC DV+E+   ++ D+ E   +    + CP+CG +ISGL+++ R +H N+CLD
Sbjct: 157 ELDALLKLCNDVEEEKEEDSGDEKESNVLDNSLVQCPLCGVNISGLNEEHRLVHINDCLD 216

Query: 180 LVEGPTE-VAATSDGRGAHQCPGQVLDDSPIKSAKKVVDVSPVVEWLRNLGLAKYEEIFV 4
            VE P + V          QC  +V+D  P+ S ++VVDVSPVV+WL NLGLA+Y + FV
Sbjct: 217 KVENPGQNVVFPPSVDREFQCVPEVVDGPPL-SPRQVVDVSPVVKWLSNLGLARYADAFV 275

Query: 3   R 1
           R
Sbjct: 276 R 276


>emb|CDP17885.1| unnamed protein product [Coffea canephora]
          Length = 749

 Score =  127 bits (319), Expect = 2e-30
 Identities = 97/243 (39%), Positives = 125/243 (51%), Gaps = 36/243 (14%)
 Frame = -1

Query: 621 HSSTTLRPSEKLKKQKPINPGKEN----------RLFHETE----EADLD----CGLDSI 496
           +SS    P +    ++ INPGKEN            F E +    E  LD    CGLDSI
Sbjct: 62  NSSDLPLPKKVKNTEQKINPGKENIWVSSNPSGPSFFREDDKTIDEFKLDLAGSCGLDSI 121

Query: 495 EPTLDL-LNPKGDSDYSHSKKSAESKLFKPCGEEREKLCDEEFYEGSSQLDVLLKLC-AD 322
           E T+D   N K  ++    +   E       G    K   E+   G++ LD+LLKLC AD
Sbjct: 122 ESTIDCQANGKLKNNEERKESGLEESGKGQWGGNEYK---EDSEGGTAHLDLLLKLCDAD 178

Query: 321 VDEQG---------NNNAMD-------DSEGKCVVFICCPICGADISGLSDDLRQIHTNE 190
            D+           +++ +D       + E      ICCP+CG DISGLSD+LRQ+HTNE
Sbjct: 179 SDQDVECSEKVSTCSDDGLDFREACGFEEEEVDERLICCPLCGNDISGLSDELRQVHTNE 238

Query: 189 CLDLVEGPTEVAATSDGRGAHQCPGQVLDDSPIKSAKKVVDVSPVVEWLRNLGLAKYEEI 10
           CLD  E   E     + +  H  P  VLD SP +S++KVV   PV+EWL NLGLAKYEEI
Sbjct: 239 CLDKGETANENLRNQE-KATHIVP-FVLDGSPRQSSRKVVAAFPVLEWLHNLGLAKYEEI 296

Query: 9   FVR 1
           FVR
Sbjct: 297 FVR 299


>gb|KDO73678.1| hypothetical protein CISIN_1g0048772mg, partial [Citrus sinensis]
          Length = 269

 Score =  118 bits (296), Expect = 3e-29
 Identities = 87/229 (37%), Positives = 116/229 (50%), Gaps = 19/229 (8%)
 Frame = -1

Query: 630 LKPHSSTTLRPSEKLKKQKPI-NPGKENRLFHETEEADLDCGLDSIEPTLDLLNPKGDSD 454
           LKP ++    PS   KK KP+ N GKEN +      +D  C L++I  ++D   P    D
Sbjct: 42  LKPSNN----PSRPSKKPKPVTNLGKENNIEGFYLNSDETCSLEAIPSSIDCTRPTACVD 97

Query: 453 YSHS-----------------KKSAESKLFKPCGEEREKLCDEEFYEGSSQLDVLLKLCA 325
             HS                 + S ES+L +P   +     + E  E  + LDVLLKLC 
Sbjct: 98  IDHSPECEEIKEILKVNEGYLRNSVESRLLRPRAADCRLSEESEEEEEDAVLDVLLKLCD 157

Query: 324 DVDEQGNNNAMDDSEGKCVVFICCPICGADISGLSDDLRQIHTNECLDLVEGPT-EVAAT 148
             D   N N +D+S       + CP+CG DIS L+++LRQ HTN CLD  E    +V   
Sbjct: 158 KNDV--NCNKIDES-------VRCPLCGIDISDLNEELRQAHTNNCLDKCENQAQDVVFP 208

Query: 147 SDGRGAHQCPGQVLDDSPIKSAKKVVDVSPVVEWLRNLGLAKYEEIFVR 1
              RG    P   +D    +S +K VDVSPVVE+L +LGLA+YEE FVR
Sbjct: 209 KHERGPRLEP--EIDLGLGRSPQKAVDVSPVVEFLHSLGLARYEEAFVR 255


>ref|XP_002309453.1| sterile alpha motif domain-containing family protein [Populus
           trichocarpa] gi|222855429|gb|EEE92976.1| sterile alpha
           motif domain-containing family protein [Populus
           trichocarpa]
          Length = 740

 Score =  121 bits (303), Expect = 3e-28
 Identities = 91/232 (39%), Positives = 120/232 (51%), Gaps = 31/232 (13%)
 Frame = -1

Query: 603 RPSEKLKKQKPINPGKEN------RLFHETEEA------DLDCGLDSIEPTLDLL----- 475
           RPS+K KK  P NPGKEN       L+ +TE        D +C LD IE ++D       
Sbjct: 44  RPSKKPKK--PPNPGKENIDPNSLLLYQKTESGANDFNLDENCSLDFIESSIDCTVSSKV 101

Query: 474 -NPKGDSDYSHSKK----------SAESKLFKPCGE-EREKLCDEEFYEGSSQLDVLLKL 331
            N K DS     +K          S E++L K   +     + +EE +E +S+LD L+KL
Sbjct: 102 GNEKFDSGSGKKEKLEVSGGYLCNSIEARLMKSRVDYSGVNVGNEEDFEENSELDALIKL 161

Query: 330 CADVDE-QGNNNAMDDSEGKCVVFICCPICGADISGLSDDLRQIHTNECLDLVEGP-TEV 157
           C + +E +       +  G    F+ CP+CG DIS LS++ R +HTNECLD  E   T V
Sbjct: 162 CTEEEESEAREKIKVNCNGDECCFVLCPLCGTDISDLSEEFRLVHTNECLDKEENSVTYV 221

Query: 156 AATSDGRGAHQCPGQVLDDSPIKSAKKVVDVSPVVEWLRNLGLAKYEEIFVR 1
               D       P  V  + P+   KKVV VSPVV+WLRNLGL +YEE FVR
Sbjct: 222 VLGGDDGRPEVVPRGV--EGPVCGPKKVV-VSPVVKWLRNLGLERYEEDFVR 270


>ref|XP_006474528.1| PREDICTED: DNA cross-link repair 1A protein [Citrus sinensis]
          Length = 728

 Score =  119 bits (299), Expect = 9e-28
 Identities = 87/229 (37%), Positives = 117/229 (51%), Gaps = 19/229 (8%)
 Frame = -1

Query: 630 LKPHSSTTLRPSEKLKKQKPI-NPGKENRLFHETEEADLDCGLDSIEPTLDLLNPKGDSD 454
           LKP ++    PS   KK KP+ N GKEN +      +D  C L++I  ++D   P    D
Sbjct: 45  LKPSNN----PSRPSKKPKPVTNLGKENNIEGFYLNSDETCSLEAIPSSIDCTRPTACVD 100

Query: 453 YSHS-----------------KKSAESKLFKPCGEEREKLCDEEFYEGSSQLDVLLKLCA 325
             HS                 + S ES+L +P   +     + E  E  ++LDVLLKLC 
Sbjct: 101 VDHSPECEEIKEILKVNEGYLRNSVESRLLRPRAADCSLSEESEEEEEDAELDVLLKLCD 160

Query: 324 DVDEQGNNNAMDDSEGKCVVFICCPICGADISGLSDDLRQIHTNECLDLVEGPT-EVAAT 148
             D   N N +D+S       + CP+CG DIS L+++LRQ HTN CLD  E    +V   
Sbjct: 161 KNDV--NCNKIDES-------VRCPLCGIDISDLNEELRQAHTNNCLDKCENQAQDVVFP 211

Query: 147 SDGRGAHQCPGQVLDDSPIKSAKKVVDVSPVVEWLRNLGLAKYEEIFVR 1
              RG    P   +D    +S +K VDVSPVVE+L +LGLA+YEE FVR
Sbjct: 212 RHERGPRLEP--EIDLGLGRSPQKAVDVSPVVEFLHSLGLARYEEAFVR 258


>ref|XP_012451377.1| PREDICTED: DNA cross-link repair protein SNM1 isoform X2 [Gossypium
           raimondii]
          Length = 732

 Score =  116 bits (291), Expect = 1e-26
 Identities = 91/255 (35%), Positives = 120/255 (47%), Gaps = 45/255 (17%)
 Frame = -1

Query: 630 LKPHS-STTLRPSEK-----LKKQKPINPGKENRLFHE---TEEADLD-----CGLDSIE 493
           LKP+S  T L+PS        K+    N GKEN +      T   DL      CGLD I 
Sbjct: 33  LKPNSHKTPLKPSNPPHPSFKKRNHAANSGKENAVVSTVPATRSDDLPVLRDICGLDLIP 92

Query: 492 PTLD-------LLNPKGDSDYSHSKK-------------SAESKLFKPCGEEREKLCDEE 373
            ++D         N + D+     KK             S ES+L +P  E  E  C  E
Sbjct: 93  SSIDSSFDSTSAQNKESDTVKCDEKKMESLELTKGYMCNSVESRLIRPISELSEGFC--E 150

Query: 372 FYEGSSQLDVLLKLCADVDEQGNNNAMDDSEGKCVV---------FICCPICGADISGLS 220
             E   +LD LLKLC +V+E+    + ++ E   +           + CP+CG DIS L+
Sbjct: 151 VCEEDEELDELLKLCDEVEEKEEETSREEEEDNGIEQERNAEDNGSVPCPLCGVDISNLN 210

Query: 219 DDLRQIHTNECLDLVEGPTE--VAATSDGRGAHQCPGQVLDDSPIKSAKKVVDVSPVVEW 46
           ++ R +HTN CLD VE P    V  +S     H  P  V  D P+ S ++VVDVSPVV W
Sbjct: 211 EEQRLVHTNGCLDKVENPPPKVVIPSSVDSELHSLPEVV--DGPLLSPRQVVDVSPVVNW 268

Query: 45  LRNLGLAKYEEIFVR 1
           L  LGLAKY   FV+
Sbjct: 269 LSGLGLAKYAAAFVQ 283


>ref|XP_012451376.1| PREDICTED: DNA cross-link repair protein SNM1 isoform X1 [Gossypium
           raimondii] gi|763798444|gb|KJB65399.1| hypothetical
           protein B456_010G093400 [Gossypium raimondii]
          Length = 752

 Score =  116 bits (291), Expect = 1e-26
 Identities = 91/255 (35%), Positives = 120/255 (47%), Gaps = 45/255 (17%)
 Frame = -1

Query: 630 LKPHS-STTLRPSEK-----LKKQKPINPGKENRLFHE---TEEADLD-----CGLDSIE 493
           LKP+S  T L+PS        K+    N GKEN +      T   DL      CGLD I 
Sbjct: 33  LKPNSHKTPLKPSNPPHPSFKKRNHAANSGKENAVVSTVPATRSDDLPVLRDICGLDLIP 92

Query: 492 PTLD-------LLNPKGDSDYSHSKK-------------SAESKLFKPCGEEREKLCDEE 373
            ++D         N + D+     KK             S ES+L +P  E  E  C  E
Sbjct: 93  SSIDSSFDSTSAQNKESDTVKCDEKKMESLELTKGYMCNSVESRLIRPISELSEGFC--E 150

Query: 372 FYEGSSQLDVLLKLCADVDEQGNNNAMDDSEGKCVV---------FICCPICGADISGLS 220
             E   +LD LLKLC +V+E+    + ++ E   +           + CP+CG DIS L+
Sbjct: 151 VCEEDEELDELLKLCDEVEEKEEETSREEEEDNGIEQERNAEDNGSVPCPLCGVDISNLN 210

Query: 219 DDLRQIHTNECLDLVEGPTE--VAATSDGRGAHQCPGQVLDDSPIKSAKKVVDVSPVVEW 46
           ++ R +HTN CLD VE P    V  +S     H  P  V  D P+ S ++VVDVSPVV W
Sbjct: 211 EEQRLVHTNGCLDKVENPPPKVVIPSSVDSELHSLPEVV--DGPLLSPRQVVDVSPVVNW 268

Query: 45  LRNLGLAKYEEIFVR 1
           L  LGLAKY   FV+
Sbjct: 269 LSGLGLAKYAAAFVQ 283


>ref|XP_010648406.1| PREDICTED: uncharacterized protein LOC100256089 isoform X3 [Vitis
           vinifera]
          Length = 590

 Score =  116 bits (290), Expect = 1e-26
 Identities = 90/240 (37%), Positives = 125/240 (52%), Gaps = 30/240 (12%)
 Frame = -1

Query: 630 LKPHSSTTLRPSEKLKKQKPINPGKEN-------RLFHETEEADLDCGL--DSIEPTLDL 478
           LKP S ++ RPS++ K      PGKEN       R   E EE         DSIE  L  
Sbjct: 20  LKP-SDSSRRPSKRPKAAATAAPGKENVPPSRKKRDCSEREELKSKGSYLCDSIESRLLN 78

Query: 477 LNPKGD----------SDYSHSKKSAESKLFKPC----GEEREKLCDEEFYEGSSQLDVL 340
               GD          S+ S+S  S ES+L K      G+     C EE  E   QLDVL
Sbjct: 79  ARSGGDGNITCGFSEESEGSYSCNSVESRLLKSRSGGDGDGNGGFC-EESDEDFEQLDVL 137

Query: 339 LKLCADVDEQGNNNAM-------DDSEGKCVVFICCPICGADISGLSDDLRQIHTNECLD 181
           ++LC++ +E+ +++           SEG+ +V   CP+C  DIS L+D+LRQ+HTN CLD
Sbjct: 138 IRLCSEGEEEPDSDGFRFREQRGSGSEGRGLVR--CPLCEIDISDLNDELRQVHTNGCLD 195

Query: 180 LVEGPTEVAATSDGRGAHQCPGQVLDDSPIKSAKKVVDVSPVVEWLRNLGLAKYEEIFVR 1
            +E    +    +G    Q P    D SP+++ +KVVDVSPV+ W+ +LGL +YEE F+R
Sbjct: 196 RLEADNVLR---NGDRECQFPQPFNDGSPVQTHQKVVDVSPVIGWIHSLGLGRYEEAFIR 252


Top