BLASTX nr result

ID: Zanthoxylum22_contig00010738 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Zanthoxylum22_contig00010738
         (819 letters)

Database: ./nr 
           77,306,371 sequences; 28,104,191,420 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|KDO86797.1| hypothetical protein CISIN_1g039468mg [Citrus sin...   503   e-140
ref|XP_006444605.1| hypothetical protein CICLE_v10018980mg [Citr...   503   e-140
ref|XP_006444604.1| hypothetical protein CICLE_v10018980mg [Citr...   503   e-140
ref|XP_010271006.1| PREDICTED: uncharacterized protein LOC104607...   324   4e-86
ref|XP_012082990.1| PREDICTED: uncharacterized protein LOC105642...   316   1e-83
ref|XP_012082989.1| PREDICTED: uncharacterized protein LOC105642...   316   1e-83
gb|KDP28332.1| hypothetical protein JCGZ_14103 [Jatropha curcas]      316   1e-83
ref|XP_011005949.1| PREDICTED: uncharacterized protein LOC105112...   285   2e-74
ref|XP_002302803.2| hypothetical protein POPTR_0002s18900g [Popu...   284   5e-74
ref|XP_002320261.2| zinc finger family protein [Populus trichoca...   284   6e-74
ref|XP_011040349.1| PREDICTED: uncharacterized protein LOC105136...   283   1e-73
ref|XP_007051274.1| MuDR family transposase, putative isoform 2 ...   283   1e-73
ref|XP_007051273.1| MuDR family transposase, putative isoform 1 ...   283   1e-73
ref|XP_010241459.1| PREDICTED: uncharacterized protein LOC104586...   277   6e-72
ref|XP_010241463.1| PREDICTED: uncharacterized protein LOC104586...   276   1e-71
ref|XP_010241621.1| PREDICTED: uncharacterized protein LOC104586...   275   3e-71
ref|XP_010241619.1| PREDICTED: uncharacterized protein LOC104586...   275   3e-71
ref|XP_010241617.1| PREDICTED: uncharacterized protein LOC104586...   275   3e-71
ref|XP_012443064.1| PREDICTED: uncharacterized protein LOC105767...   274   6e-71
ref|XP_012443073.1| PREDICTED: uncharacterized protein LOC105767...   274   6e-71

>gb|KDO86797.1| hypothetical protein CISIN_1g039468mg [Citrus sinensis]
          Length = 1282

 Score =  503 bits (1294), Expect = e-140
 Identities = 247/272 (90%), Positives = 253/272 (93%)
 Frame = -2

Query: 818  RTEGAESVSPSDNLIALPSVSGDGVLLNTLFPVASDSGDTRLLSNLAPAGSDPRQQKLIK 639
            RT GAESV+PSDNLI LPSVSGD VLLNTLFP ASD+GDTR LSN APAGSD RQQKLIK
Sbjct: 644  RTTGAESVTPSDNLIPLPSVSGDMVLLNTLFPAASDAGDTRQLSNSAPAGSDLRQQKLIK 703

Query: 638  SWKNCITGVHQRFNNVHDFRDALYKYSVAHGFTYCFKKNEGLRVTAKCKAEGCPWRIHAS 459
            SWKNCITGVHQRFNNVHDFRDALYKYSVAHGFTYCFKKNEGLRVTAKCKAEGCPWRIHAS
Sbjct: 704  SWKNCITGVHQRFNNVHDFRDALYKYSVAHGFTYCFKKNEGLRVTAKCKAEGCPWRIHAS 763

Query: 458  RVPTTQLFQIKTMNETHTCNAGPDPTTRSKASRKLLASIVKEKLLETPKCKPKEIAEEIR 279
            RVPTTQLFQIKTMN  HTC AGPDPTTRSKASRKLLASIVKEKLLE PKCKPKEIAEEIR
Sbjct: 764  RVPTTQLFQIKTMNGMHTCKAGPDPTTRSKASRKLLASIVKEKLLEAPKCKPKEIAEEIR 823

Query: 278  RDFGIELGYVKAWRALENARGESQGSYKESYNQLPWLVDKILETNPSSVVTLSTREDLSF 99
            RDFGIELGYVKAWRALENARGESQ SYK+SYNQLPWLVDKILETNP SVVTLSTREDLSF
Sbjct: 824  RDFGIELGYVKAWRALENARGESQVSYKDSYNQLPWLVDKILETNPGSVVTLSTREDLSF 883

Query: 98   HQLFVTFRASLYGFENGCRPLIFLETLTIKSK 3
            H LFV   ASLYGF+NGCRPLIFL++  IKSK
Sbjct: 884  HHLFVALHASLYGFQNGCRPLIFLDSFPIKSK 915


>ref|XP_006444605.1| hypothetical protein CICLE_v10018980mg [Citrus clementina]
           gi|568878875|ref|XP_006492409.1| PREDICTED:
           uncharacterized protein LOC102626371 isoform X1 [Citrus
           sinensis] gi|568878877|ref|XP_006492410.1| PREDICTED:
           uncharacterized protein LOC102626371 isoform X2 [Citrus
           sinensis] gi|557546867|gb|ESR57845.1| hypothetical
           protein CICLE_v10018980mg [Citrus clementina]
          Length = 757

 Score =  503 bits (1294), Expect = e-140
 Identities = 247/272 (90%), Positives = 253/272 (93%)
 Frame = -2

Query: 818 RTEGAESVSPSDNLIALPSVSGDGVLLNTLFPVASDSGDTRLLSNLAPAGSDPRQQKLIK 639
           RT GAESV+PSDNLI LPSVSGD VLLNTLFP ASD+GDTR LSN APAGSD RQQKLIK
Sbjct: 119 RTTGAESVTPSDNLIPLPSVSGDMVLLNTLFPAASDAGDTRQLSNSAPAGSDLRQQKLIK 178

Query: 638 SWKNCITGVHQRFNNVHDFRDALYKYSVAHGFTYCFKKNEGLRVTAKCKAEGCPWRIHAS 459
           SWKNCITGVHQRFNNVHDFRDALYKYSVAHGFTYCFKKNEGLRVTAKCKAEGCPWRIHAS
Sbjct: 179 SWKNCITGVHQRFNNVHDFRDALYKYSVAHGFTYCFKKNEGLRVTAKCKAEGCPWRIHAS 238

Query: 458 RVPTTQLFQIKTMNETHTCNAGPDPTTRSKASRKLLASIVKEKLLETPKCKPKEIAEEIR 279
           RVPTTQLFQIKTMN  HTC AGPDPTTRSKASRKLLASIVKEKLLE PKCKPKEIAEEIR
Sbjct: 239 RVPTTQLFQIKTMNGMHTCKAGPDPTTRSKASRKLLASIVKEKLLEAPKCKPKEIAEEIR 298

Query: 278 RDFGIELGYVKAWRALENARGESQGSYKESYNQLPWLVDKILETNPSSVVTLSTREDLSF 99
           RDFGIELGYVKAWRALENARGESQ SYK+SYNQLPWLVDKILETNP SVVTLSTREDLSF
Sbjct: 299 RDFGIELGYVKAWRALENARGESQVSYKDSYNQLPWLVDKILETNPGSVVTLSTREDLSF 358

Query: 98  HQLFVTFRASLYGFENGCRPLIFLETLTIKSK 3
           H LFV   ASLYGF+NGCRPLIFL++  IKSK
Sbjct: 359 HHLFVALHASLYGFQNGCRPLIFLDSFPIKSK 390


>ref|XP_006444604.1| hypothetical protein CICLE_v10018980mg [Citrus clementina]
           gi|557546866|gb|ESR57844.1| hypothetical protein
           CICLE_v10018980mg [Citrus clementina]
          Length = 648

 Score =  503 bits (1294), Expect = e-140
 Identities = 247/272 (90%), Positives = 253/272 (93%)
 Frame = -2

Query: 818 RTEGAESVSPSDNLIALPSVSGDGVLLNTLFPVASDSGDTRLLSNLAPAGSDPRQQKLIK 639
           RT GAESV+PSDNLI LPSVSGD VLLNTLFP ASD+GDTR LSN APAGSD RQQKLIK
Sbjct: 10  RTTGAESVTPSDNLIPLPSVSGDMVLLNTLFPAASDAGDTRQLSNSAPAGSDLRQQKLIK 69

Query: 638 SWKNCITGVHQRFNNVHDFRDALYKYSVAHGFTYCFKKNEGLRVTAKCKAEGCPWRIHAS 459
           SWKNCITGVHQRFNNVHDFRDALYKYSVAHGFTYCFKKNEGLRVTAKCKAEGCPWRIHAS
Sbjct: 70  SWKNCITGVHQRFNNVHDFRDALYKYSVAHGFTYCFKKNEGLRVTAKCKAEGCPWRIHAS 129

Query: 458 RVPTTQLFQIKTMNETHTCNAGPDPTTRSKASRKLLASIVKEKLLETPKCKPKEIAEEIR 279
           RVPTTQLFQIKTMN  HTC AGPDPTTRSKASRKLLASIVKEKLLE PKCKPKEIAEEIR
Sbjct: 130 RVPTTQLFQIKTMNGMHTCKAGPDPTTRSKASRKLLASIVKEKLLEAPKCKPKEIAEEIR 189

Query: 278 RDFGIELGYVKAWRALENARGESQGSYKESYNQLPWLVDKILETNPSSVVTLSTREDLSF 99
           RDFGIELGYVKAWRALENARGESQ SYK+SYNQLPWLVDKILETNP SVVTLSTREDLSF
Sbjct: 190 RDFGIELGYVKAWRALENARGESQVSYKDSYNQLPWLVDKILETNPGSVVTLSTREDLSF 249

Query: 98  HQLFVTFRASLYGFENGCRPLIFLETLTIKSK 3
           H LFV   ASLYGF+NGCRPLIFL++  IKSK
Sbjct: 250 HHLFVALHASLYGFQNGCRPLIFLDSFPIKSK 281


>ref|XP_010271006.1| PREDICTED: uncharacterized protein LOC104607156 [Nelumbo nucifera]
          Length = 724

 Score =  324 bits (831), Expect = 4e-86
 Identities = 150/228 (65%), Positives = 185/228 (81%)
 Frame = -2

Query: 686 NLAPAGSDPRQQKLIKSWKNCITGVHQRFNNVHDFRDALYKYSVAHGFTYCFKKNEGLRV 507
           N    G D  QQKL + W+N ITG+HQ+FN+V+DFRDAL KYS+AHGF Y FK N+  RV
Sbjct: 134 NAVEDGEDVGQQKLTRLWENSITGLHQQFNSVNDFRDALRKYSIAHGFAYMFKNNDSRRV 193

Query: 506 TAKCKAEGCPWRIHASRVPTTQLFQIKTMNETHTCNAGPDPTTRSKASRKLLASIVKEKL 327
           +AKCKAEGCPWR+HAS++ TTQLF+IK MN THTC AG   T RS+A++KL+ASIVK+KL
Sbjct: 194 SAKCKAEGCPWRVHASKLSTTQLFRIKKMNATHTCGAGTGTTNRSQATKKLVASIVKDKL 253

Query: 326 LETPKCKPKEIAEEIRRDFGIELGYVKAWRALENARGESQGSYKESYNQLPWLVDKILET 147
            ++P  +PKEIA +IRRDFGIEL Y + WR +E AR E QGSYKESYNQLPWL DK++E 
Sbjct: 254 RDSPNYRPKEIANDIRRDFGIELRYSQVWRGMETAREELQGSYKESYNQLPWLCDKMVEA 313

Query: 146 NPSSVVTLSTREDLSFHQLFVTFRASLYGFENGCRPLIFLETLTIKSK 3
           NP SV TL TR+DLSFH+LFV   ASL+GFE+GCRPL+FL+T+T+KS+
Sbjct: 314 NPGSVATLITRDDLSFHRLFVALHASLFGFEHGCRPLLFLDTMTLKSR 361


>ref|XP_012082990.1| PREDICTED: uncharacterized protein LOC105642692 isoform X2
           [Jatropha curcas]
          Length = 643

 Score =  316 bits (809), Expect = 1e-83
 Identities = 151/241 (62%), Positives = 186/241 (77%)
 Frame = -2

Query: 725 PVASDSGDTRLLSNLAPAGSDPRQQKLIKSWKNCITGVHQRFNNVHDFRDALYKYSVAHG 546
           PVAS  GD    + +A    D    KL KSW NCITG+HQ+FNNV + RDAL +YS+AHG
Sbjct: 36  PVASIGGDMDQPNPVAIEIKDAGHHKLFKSWANCITGLHQQFNNVQELRDALRRYSIAHG 95

Query: 545 FTYCFKKNEGLRVTAKCKAEGCPWRIHASRVPTTQLFQIKTMNETHTCNAGPDPTTRSKA 366
           F   FK N+  RV+AKCKAEGCPWRIHAS++ TT LF+IK +NE HTC AG     R +A
Sbjct: 96  FKCKFKHNDATRVSAKCKAEGCPWRIHASKLSTTPLFRIKKLNEIHTCGAGTGRLNRPQA 155

Query: 365 SRKLLASIVKEKLLETPKCKPKEIAEEIRRDFGIELGYVKAWRALENARGESQGSYKESY 186
           SRKL+A+IVKEKL ++P  +PKEIA +IR++FGIEL Y +AWR +E AR E QGSYKE+Y
Sbjct: 156 SRKLVATIVKEKLKDSPNFRPKEIANQIRQEFGIELRYSQAWRGMETAREELQGSYKEAY 215

Query: 185 NQLPWLVDKILETNPSSVVTLSTREDLSFHQLFVTFRASLYGFENGCRPLIFLETLTIKS 6
           NQLPWL +KI+ETNP SV TL TRED SFH+LFV   AS+YGF+NGCRPL+FL+++T+KS
Sbjct: 216 NQLPWLCEKIVETNPGSVATLITREDQSFHRLFVALHASIYGFQNGCRPLLFLDSVTLKS 275

Query: 5   K 3
           K
Sbjct: 276 K 276


>ref|XP_012082989.1| PREDICTED: uncharacterized protein LOC105642692 isoform X1
           [Jatropha curcas]
          Length = 740

 Score =  316 bits (809), Expect = 1e-83
 Identities = 151/241 (62%), Positives = 186/241 (77%)
 Frame = -2

Query: 725 PVASDSGDTRLLSNLAPAGSDPRQQKLIKSWKNCITGVHQRFNNVHDFRDALYKYSVAHG 546
           PVAS  GD    + +A    D    KL KSW NCITG+HQ+FNNV + RDAL +YS+AHG
Sbjct: 133 PVASIGGDMDQPNPVAIEIKDAGHHKLFKSWANCITGLHQQFNNVQELRDALRRYSIAHG 192

Query: 545 FTYCFKKNEGLRVTAKCKAEGCPWRIHASRVPTTQLFQIKTMNETHTCNAGPDPTTRSKA 366
           F   FK N+  RV+AKCKAEGCPWRIHAS++ TT LF+IK +NE HTC AG     R +A
Sbjct: 193 FKCKFKHNDATRVSAKCKAEGCPWRIHASKLSTTPLFRIKKLNEIHTCGAGTGRLNRPQA 252

Query: 365 SRKLLASIVKEKLLETPKCKPKEIAEEIRRDFGIELGYVKAWRALENARGESQGSYKESY 186
           SRKL+A+IVKEKL ++P  +PKEIA +IR++FGIEL Y +AWR +E AR E QGSYKE+Y
Sbjct: 253 SRKLVATIVKEKLKDSPNFRPKEIANQIRQEFGIELRYSQAWRGMETAREELQGSYKEAY 312

Query: 185 NQLPWLVDKILETNPSSVVTLSTREDLSFHQLFVTFRASLYGFENGCRPLIFLETLTIKS 6
           NQLPWL +KI+ETNP SV TL TRED SFH+LFV   AS+YGF+NGCRPL+FL+++T+KS
Sbjct: 313 NQLPWLCEKIVETNPGSVATLITREDQSFHRLFVALHASIYGFQNGCRPLLFLDSVTLKS 372

Query: 5   K 3
           K
Sbjct: 373 K 373


>gb|KDP28332.1| hypothetical protein JCGZ_14103 [Jatropha curcas]
          Length = 738

 Score =  316 bits (809), Expect = 1e-83
 Identities = 151/241 (62%), Positives = 186/241 (77%)
 Frame = -2

Query: 725 PVASDSGDTRLLSNLAPAGSDPRQQKLIKSWKNCITGVHQRFNNVHDFRDALYKYSVAHG 546
           PVAS  GD    + +A    D    KL KSW NCITG+HQ+FNNV + RDAL +YS+AHG
Sbjct: 131 PVASIGGDMDQPNPVAIEIKDAGHHKLFKSWANCITGLHQQFNNVQELRDALRRYSIAHG 190

Query: 545 FTYCFKKNEGLRVTAKCKAEGCPWRIHASRVPTTQLFQIKTMNETHTCNAGPDPTTRSKA 366
           F   FK N+  RV+AKCKAEGCPWRIHAS++ TT LF+IK +NE HTC AG     R +A
Sbjct: 191 FKCKFKHNDATRVSAKCKAEGCPWRIHASKLSTTPLFRIKKLNEIHTCGAGTGRLNRPQA 250

Query: 365 SRKLLASIVKEKLLETPKCKPKEIAEEIRRDFGIELGYVKAWRALENARGESQGSYKESY 186
           SRKL+A+IVKEKL ++P  +PKEIA +IR++FGIEL Y +AWR +E AR E QGSYKE+Y
Sbjct: 251 SRKLVATIVKEKLKDSPNFRPKEIANQIRQEFGIELRYSQAWRGMETAREELQGSYKEAY 310

Query: 185 NQLPWLVDKILETNPSSVVTLSTREDLSFHQLFVTFRASLYGFENGCRPLIFLETLTIKS 6
           NQLPWL +KI+ETNP SV TL TRED SFH+LFV   AS+YGF+NGCRPL+FL+++T+KS
Sbjct: 311 NQLPWLCEKIVETNPGSVATLITREDQSFHRLFVALHASIYGFQNGCRPLLFLDSVTLKS 370

Query: 5   K 3
           K
Sbjct: 371 K 371


>ref|XP_011005949.1| PREDICTED: uncharacterized protein LOC105112077 [Populus
           euphratica]
          Length = 762

 Score =  285 bits (730), Expect = 2e-74
 Identities = 133/241 (55%), Positives = 181/241 (75%)
 Frame = -2

Query: 725 PVASDSGDTRLLSNLAPAGSDPRQQKLIKSWKNCITGVHQRFNNVHDFRDALYKYSVAHG 546
           P + ++G    L+  A       Q K +K W+NCITG+HQ+FNNV + RDAL KYS+A G
Sbjct: 159 PASENAGRFNTLAAAAGEIESVGQLKRVKLWENCITGLHQQFNNVREVRDALRKYSIAQG 218

Query: 545 FTYCFKKNEGLRVTAKCKAEGCPWRIHASRVPTTQLFQIKTMNETHTCNAGPDPTTRSKA 366
           FT  FKKN+ +RV+ KC  +GCPWRI ASR+ TT LF+IK +NE HTC AG    +  +A
Sbjct: 219 FTVKFKKNDSMRVSVKCSVDGCPWRIFASRLSTTHLFRIKRLNEIHTCGAGTGTDSHPRA 278

Query: 365 SRKLLASIVKEKLLETPKCKPKEIAEEIRRDFGIELGYVKAWRALENARGESQGSYKESY 186
           S+K++  IVKEKL ++P  KPKEIA +I+++FGIEL Y +  R +E A  E QGSY+E+Y
Sbjct: 279 SKKVVEGIVKEKLHDSPNVKPKEIANQIQQEFGIELRYSQVRRWMEAATEEIQGSYREAY 338

Query: 185 NQLPWLVDKILETNPSSVVTLSTREDLSFHQLFVTFRASLYGFENGCRPLIFLETLTIKS 6
           NQLPWL +KI+ETNP + ++L+TREDLSFH+LFV F ASL+GF++GCRPL+FL+T++++S
Sbjct: 339 NQLPWLCEKIVETNPGTAISLNTREDLSFHRLFVAFHASLHGFQSGCRPLLFLDTMSLQS 398

Query: 5   K 3
           K
Sbjct: 399 K 399


>ref|XP_002302803.2| hypothetical protein POPTR_0002s18900g [Populus trichocarpa]
           gi|550345338|gb|EEE82076.2| hypothetical protein
           POPTR_0002s18900g [Populus trichocarpa]
          Length = 760

 Score =  284 bits (727), Expect = 5e-74
 Identities = 133/241 (55%), Positives = 180/241 (74%)
 Frame = -2

Query: 725 PVASDSGDTRLLSNLAPAGSDPRQQKLIKSWKNCITGVHQRFNNVHDFRDALYKYSVAHG 546
           P + ++G    L+  A       Q K +K W+NCITG+HQ+FNNV + RDAL KYS+A G
Sbjct: 157 PASENAGRFNTLAAAAGEIESVGQLKRVKLWENCITGLHQQFNNVREVRDALRKYSIAQG 216

Query: 545 FTYCFKKNEGLRVTAKCKAEGCPWRIHASRVPTTQLFQIKTMNETHTCNAGPDPTTRSKA 366
           FT  FKKN+ +RV+ KC  +GCPWRI ASR+ TT LF+IK +NE HTC AG    +  +A
Sbjct: 217 FTVKFKKNDSMRVSVKCSVDGCPWRIFASRLSTTHLFRIKRLNEIHTCGAGTGTDSHPRA 276

Query: 365 SRKLLASIVKEKLLETPKCKPKEIAEEIRRDFGIELGYVKAWRALENARGESQGSYKESY 186
           S+K++  IVKEKL ++P  KPKEIA +I+++FGIEL Y +  R +E A  E QGSY+E+Y
Sbjct: 277 SKKVVEGIVKEKLHDSPNVKPKEIANQIQQEFGIELRYSQVRRWMEAATEEIQGSYREAY 336

Query: 185 NQLPWLVDKILETNPSSVVTLSTREDLSFHQLFVTFRASLYGFENGCRPLIFLETLTIKS 6
           NQLPWL +KI+ETNP + V+L+TREDL FH+LFV F ASL+GF++GCRPL+FL+T++++S
Sbjct: 337 NQLPWLCEKIVETNPGTAVSLNTREDLGFHRLFVAFHASLHGFQSGCRPLLFLDTMSLQS 396

Query: 5   K 3
           K
Sbjct: 397 K 397


>ref|XP_002320261.2| zinc finger family protein [Populus trichocarpa]
           gi|550323954|gb|EEE98576.2| zinc finger family protein
           [Populus trichocarpa]
          Length = 580

 Score =  284 bits (726), Expect = 6e-74
 Identities = 129/216 (59%), Positives = 172/216 (79%)
 Frame = -2

Query: 650 KLIKSWKNCITGVHQRFNNVHDFRDALYKYSVAHGFTYCFKKNEGLRVTAKCKAEGCPWR 471
           K +K W+NCITG+HQ+FNNV + RDA  KYS+A GFT  FKKN+ +RV+AKC  +GCPWR
Sbjct: 2   KRVKLWENCITGLHQQFNNVREVRDAFRKYSIAQGFTIKFKKNDSMRVSAKCSVDGCPWR 61

Query: 470 IHASRVPTTQLFQIKTMNETHTCNAGPDPTTRSKASRKLLASIVKEKLLETPKCKPKEIA 291
           I ASR+ TT LF+IK +NE HTC AG    +  +AS+K++  IVKEKL ++P  KPKEIA
Sbjct: 62  IFASRLSTTHLFRIKRLNEIHTCGAGTSTDSHPRASKKVVEGIVKEKLRDSPNVKPKEIA 121

Query: 290 EEIRRDFGIELGYVKAWRALENARGESQGSYKESYNQLPWLVDKILETNPSSVVTLSTRE 111
            +I+++FGIEL Y +  R +E A  E QGSYKE+YNQLPWL +KI+ETNP + V+L+TRE
Sbjct: 122 NQIQQEFGIELRYSQVRRWMEAATEEIQGSYKEAYNQLPWLCEKIVETNPGTAVSLNTRE 181

Query: 110 DLSFHQLFVTFRASLYGFENGCRPLIFLETLTIKSK 3
           DLSFH+LF+ F ASL+GF++GCRPL+FL+T++++SK
Sbjct: 182 DLSFHRLFIAFHASLHGFQSGCRPLLFLDTMSLQSK 217


>ref|XP_011040349.1| PREDICTED: uncharacterized protein LOC105136632 [Populus
           euphratica] gi|743791035|ref|XP_011040358.1| PREDICTED:
           uncharacterized protein LOC105136632 [Populus
           euphratica]
          Length = 756

 Score =  283 bits (724), Expect = 1e-73
 Identities = 129/218 (59%), Positives = 171/218 (78%)
 Frame = -2

Query: 656 QQKLIKSWKNCITGVHQRFNNVHDFRDALYKYSVAHGFTYCFKKNEGLRVTAKCKAEGCP 477
           Q K +K W+NCITG+HQ+FNNV + RDA  KYS+A GFT  FKKN+ +RV+AKC  +GCP
Sbjct: 176 QMKRVKLWENCITGLHQQFNNVREVRDAFRKYSIAQGFTIKFKKNDSMRVSAKCSVDGCP 235

Query: 476 WRIHASRVPTTQLFQIKTMNETHTCNAGPDPTTRSKASRKLLASIVKEKLLETPKCKPKE 297
           WRI ASR+ TT LF+IK +NE HTC AG    +  +AS+K++  IVKEKL ++P  KPKE
Sbjct: 236 WRIFASRLSTTHLFRIKRLNEIHTCGAGTSTDSHPRASKKVVEGIVKEKLRDSPNVKPKE 295

Query: 296 IAEEIRRDFGIELGYVKAWRALENARGESQGSYKESYNQLPWLVDKILETNPSSVVTLST 117
           IA +I+ +FGIEL Y +  R +E A  E QGSYKE+YNQLPWL +KI+ETNP + V+L+T
Sbjct: 296 IANQIQEEFGIELRYSQVRRWMEAATEEIQGSYKEAYNQLPWLCEKIVETNPGTAVSLNT 355

Query: 116 REDLSFHQLFVTFRASLYGFENGCRPLIFLETLTIKSK 3
           RED SFH+LF+ F ASL+GF++GCRPL+FL+T++++SK
Sbjct: 356 REDQSFHRLFIAFHASLHGFQSGCRPLLFLDTMSLQSK 393


>ref|XP_007051274.1| MuDR family transposase, putative isoform 2 [Theobroma cacao]
           gi|590720229|ref|XP_007051275.1| MuDR family
           transposase, putative isoform 2 [Theobroma cacao]
           gi|508703535|gb|EOX95431.1| MuDR family transposase,
           putative isoform 2 [Theobroma cacao]
           gi|508703536|gb|EOX95432.1| MuDR family transposase,
           putative isoform 2 [Theobroma cacao]
          Length = 638

 Score =  283 bits (724), Expect = 1e-73
 Identities = 142/269 (52%), Positives = 187/269 (69%), Gaps = 3/269 (1%)
 Frame = -2

Query: 803 ESVSPSDNLIALPSVSGDGVLLNTLFPVASDSGDTRLLSNL---APAGSDPRQQKLIKSW 633
           E V+  D+     SVSGD   L+ L  +AS S D    +NL   AP   D   QKL+KSW
Sbjct: 4   EPVTSPDSFTHAASVSGDTEQLDWLASIASVSRDMNHPNNLTPEAPNDKDNGLQKLVKSW 63

Query: 632 KNCITGVHQRFNNVHDFRDALYKYSVAHGFTYCFKKNEGLRVTAKCKAEGCPWRIHASRV 453
           +NC+TG+ Q+FNNV+DFR AL ++S+AHGF Y FK N    V A CKAEGCPW I A+R+
Sbjct: 64  ENCLTGLDQQFNNVYDFRVALNRFSIAHGFKYTFKTNNARYVIATCKAEGCPWSIQAARL 123

Query: 452 PTTQLFQIKTMNETHTCNAGPDPTTRSKASRKLLASIVKEKLLETPKCKPKEIAEEIRRD 273
            TT+LF IK M+ETH+C AG       + S KL+  +VKEKL + P  KP+EIA+EI +D
Sbjct: 124 STTKLFLIKKMSETHSCGAGKSSARCPQVSSKLVKILVKEKLRDAPHAKPREIADEILQD 183

Query: 272 FGIELGYVKAWRALENARGESQGSYKESYNQLPWLVDKILETNPSSVVTLSTREDLSFHQ 93
           +G +  Y + WR +E  + + Q  Y+E YNQLP LV +++E NP S+ TL TREDLSFH+
Sbjct: 184 YGFKARYSQVWRGVETVKEKHQVPYEEGYNQLPSLVKQMVENNPGSIATLFTREDLSFHR 243

Query: 92  LFVTFRASLYGFENGCRPLIFLETLTIKS 6
           LFV+F+ASL+GF+NGCRPL+FL+T+TIKS
Sbjct: 244 LFVSFQASLHGFKNGCRPLLFLDTMTIKS 272


>ref|XP_007051273.1| MuDR family transposase, putative isoform 1 [Theobroma cacao]
           gi|508703534|gb|EOX95430.1| MuDR family transposase,
           putative isoform 1 [Theobroma cacao]
          Length = 756

 Score =  283 bits (724), Expect = 1e-73
 Identities = 142/269 (52%), Positives = 187/269 (69%), Gaps = 3/269 (1%)
 Frame = -2

Query: 803 ESVSPSDNLIALPSVSGDGVLLNTLFPVASDSGDTRLLSNL---APAGSDPRQQKLIKSW 633
           E V+  D+     SVSGD   L+ L  +AS S D    +NL   AP   D   QKL+KSW
Sbjct: 122 EPVTSPDSFTHAASVSGDTEQLDWLASIASVSRDMNHPNNLTPEAPNDKDNGLQKLVKSW 181

Query: 632 KNCITGVHQRFNNVHDFRDALYKYSVAHGFTYCFKKNEGLRVTAKCKAEGCPWRIHASRV 453
           +NC+TG+ Q+FNNV+DFR AL ++S+AHGF Y FK N    V A CKAEGCPW I A+R+
Sbjct: 182 ENCLTGLDQQFNNVYDFRVALNRFSIAHGFKYTFKTNNARYVIATCKAEGCPWSIQAARL 241

Query: 452 PTTQLFQIKTMNETHTCNAGPDPTTRSKASRKLLASIVKEKLLETPKCKPKEIAEEIRRD 273
            TT+LF IK M+ETH+C AG       + S KL+  +VKEKL + P  KP+EIA+EI +D
Sbjct: 242 STTKLFLIKKMSETHSCGAGKSSARCPQVSSKLVKILVKEKLRDAPHAKPREIADEILQD 301

Query: 272 FGIELGYVKAWRALENARGESQGSYKESYNQLPWLVDKILETNPSSVVTLSTREDLSFHQ 93
           +G +  Y + WR +E  + + Q  Y+E YNQLP LV +++E NP S+ TL TREDLSFH+
Sbjct: 302 YGFKARYSQVWRGVETVKEKHQVPYEEGYNQLPSLVKQMVENNPGSIATLFTREDLSFHR 361

Query: 92  LFVTFRASLYGFENGCRPLIFLETLTIKS 6
           LFV+F+ASL+GF+NGCRPL+FL+T+TIKS
Sbjct: 362 LFVSFQASLHGFKNGCRPLLFLDTMTIKS 390


>ref|XP_010241459.1| PREDICTED: uncharacterized protein LOC104586057 isoform X1 [Nelumbo
           nucifera] gi|720078803|ref|XP_010241460.1| PREDICTED:
           uncharacterized protein LOC104586057 isoform X1 [Nelumbo
           nucifera]
          Length = 757

 Score =  277 bits (709), Expect = 6e-72
 Identities = 144/277 (51%), Positives = 191/277 (68%), Gaps = 5/277 (1%)
 Frame = -2

Query: 818 RTEGAESVSPSDNLIALPSVSGDGVLLNTLFPVASD-SGDTRLLSNLA--PA--GSDPRQ 654
           RT  +E+V+P D  +  P  +      +      +D + D  ++ ++A  PA    D + 
Sbjct: 119 RTTLSEAVTPVDAPVDAPMDTVVDAPTDINIDTPNDITTDAAIVMSIATPPAITSVDSKH 178

Query: 653 QKLIKSWKNCITGVHQRFNNVHDFRDALYKYSVAHGFTYCFKKNEGLRVTAKCKAEGCPW 474
            K  K W+N ITGV QRF++VH+FR+AL +YS+AHGF Y +KKN+  RVT KCK EGCPW
Sbjct: 179 NKAKKQWENAITGVDQRFSSVHEFREALRRYSIAHGFAYKYKKNDSHRVTVKCKTEGCPW 238

Query: 473 RIHASRVPTTQLFQIKTMNETHTCNAGPDPTTRSKASRKLLASIVKEKLLETPKCKPKEI 294
           RIHASR+ TTQL  IK MN THTC  G   TT  +A+R  +ASI+KEKL E+P  KPK+I
Sbjct: 239 RIHASRLSTTQLICIKKMNPTHTCE-GEVATTGYQATRSWVASIIKEKLKESPNYKPKDI 297

Query: 293 AEEIRRDFGIELGYVKAWRALENARGESQGSYKESYNQLPWLVDKILETNPSSVVTLSTR 114
           A +IRR++GI+L Y +AWR  E AR + QGSYKE+Y+QLP+  +KI+ETNP S  T +T+
Sbjct: 298 ANDIRREYGIQLNYSQAWRGKEIAREQLQGSYKEAYSQLPFFCEKIMETNPGSFATFTTK 357

Query: 113 EDLSFHQLFVTFRASLYGFENGCRPLIFLETLTIKSK 3
           ED SFH+LFV F ASL GF+ GCRPLIFL++  + SK
Sbjct: 358 EDSSFHRLFVAFHASLSGFQQGCRPLIFLDSTPLNSK 394


>ref|XP_010241463.1| PREDICTED: uncharacterized protein LOC104586057 isoform X2 [Nelumbo
           nucifera]
          Length = 621

 Score =  276 bits (707), Expect = 1e-71
 Identities = 132/221 (59%), Positives = 167/221 (75%)
 Frame = -2

Query: 665 DPRQQKLIKSWKNCITGVHQRFNNVHDFRDALYKYSVAHGFTYCFKKNEGLRVTAKCKAE 486
           D +  K  K W+N ITGV QRF++VH+FR+AL +YS+AHGF Y +KKN+  RVT KCK E
Sbjct: 39  DSKHNKAKKQWENAITGVDQRFSSVHEFREALRRYSIAHGFAYKYKKNDSHRVTVKCKTE 98

Query: 485 GCPWRIHASRVPTTQLFQIKTMNETHTCNAGPDPTTRSKASRKLLASIVKEKLLETPKCK 306
           GCPWRIHASR+ TTQL  IK MN THTC  G   TT  +A+R  +ASI+KEKL E+P  K
Sbjct: 99  GCPWRIHASRLSTTQLICIKKMNPTHTCE-GEVATTGYQATRSWVASIIKEKLKESPNYK 157

Query: 305 PKEIAEEIRRDFGIELGYVKAWRALENARGESQGSYKESYNQLPWLVDKILETNPSSVVT 126
           PK+IA +IRR++GI+L Y +AWR  E AR + QGSYKE+Y+QLP+  +KI+ETNP S  T
Sbjct: 158 PKDIANDIRREYGIQLNYSQAWRGKEIAREQLQGSYKEAYSQLPFFCEKIMETNPGSFAT 217

Query: 125 LSTREDLSFHQLFVTFRASLYGFENGCRPLIFLETLTIKSK 3
            +T+ED SFH+LFV F ASL GF+ GCRPLIFL++  + SK
Sbjct: 218 FTTKEDSSFHRLFVAFHASLSGFQQGCRPLIFLDSTPLNSK 258


>ref|XP_010241621.1| PREDICTED: uncharacterized protein LOC104586161 isoform X3 [Nelumbo
           nucifera]
          Length = 836

 Score =  275 bits (703), Expect = 3e-71
 Identities = 129/221 (58%), Positives = 169/221 (76%)
 Frame = -2

Query: 665 DPRQQKLIKSWKNCITGVHQRFNNVHDFRDALYKYSVAHGFTYCFKKNEGLRVTAKCKAE 486
           D   +KLI  WKN ITGV Q+F+ VH+FRDAL KYS+AH F Y  KKNE  R TAKC+A+
Sbjct: 252 DAGVKKLITLWKNGITGVGQQFSGVHEFRDALRKYSIAHHFMYILKKNEASRATAKCRAD 311

Query: 485 GCPWRIHASRVPTTQLFQIKTMNETHTCNAGPDPTTRSKASRKLLASIVKEKLLETPKCK 306
           GC WRIHAS VPTTQ F IK MN+THTC  G +    S +++  LASI++++L ++P  K
Sbjct: 312 GCTWRIHASWVPTTQTFTIKRMNKTHTC--GGNIGKCSPSTKNWLASIIRDRLQDSPHYK 369

Query: 305 PKEIAEEIRRDFGIELGYVKAWRALENARGESQGSYKESYNQLPWLVDKILETNPSSVVT 126
           PK+IA+EI RDFGIEL Y + WR +ENAR + QGSYK++YNQLPW  +KI+ETNP S+  
Sbjct: 370 PKDIADEICRDFGIELNYSQVWRGVENARAQLQGSYKDAYNQLPWFCEKIVETNPGSICN 429

Query: 125 LSTREDLSFHQLFVTFRASLYGFENGCRPLIFLETLTIKSK 3
            +T++DLSF  LF++F ASL+GF+NGCRP++FL++  +KSK
Sbjct: 430 FTTKDDLSFQHLFLSFHASLFGFKNGCRPILFLDSTPLKSK 470


>ref|XP_010241619.1| PREDICTED: uncharacterized protein LOC104586161 isoform X2 [Nelumbo
           nucifera] gi|720079297|ref|XP_010241620.1| PREDICTED:
           uncharacterized protein LOC104586161 isoform X2 [Nelumbo
           nucifera]
          Length = 862

 Score =  275 bits (703), Expect = 3e-71
 Identities = 129/221 (58%), Positives = 169/221 (76%)
 Frame = -2

Query: 665 DPRQQKLIKSWKNCITGVHQRFNNVHDFRDALYKYSVAHGFTYCFKKNEGLRVTAKCKAE 486
           D   +KLI  WKN ITGV Q+F+ VH+FRDAL KYS+AH F Y  KKNE  R TAKC+A+
Sbjct: 278 DAGVKKLITLWKNGITGVGQQFSGVHEFRDALRKYSIAHHFMYILKKNEASRATAKCRAD 337

Query: 485 GCPWRIHASRVPTTQLFQIKTMNETHTCNAGPDPTTRSKASRKLLASIVKEKLLETPKCK 306
           GC WRIHAS VPTTQ F IK MN+THTC  G +    S +++  LASI++++L ++P  K
Sbjct: 338 GCTWRIHASWVPTTQTFTIKRMNKTHTC--GGNIGKCSPSTKNWLASIIRDRLQDSPHYK 395

Query: 305 PKEIAEEIRRDFGIELGYVKAWRALENARGESQGSYKESYNQLPWLVDKILETNPSSVVT 126
           PK+IA+EI RDFGIEL Y + WR +ENAR + QGSYK++YNQLPW  +KI+ETNP S+  
Sbjct: 396 PKDIADEICRDFGIELNYSQVWRGVENARAQLQGSYKDAYNQLPWFCEKIVETNPGSICN 455

Query: 125 LSTREDLSFHQLFVTFRASLYGFENGCRPLIFLETLTIKSK 3
            +T++DLSF  LF++F ASL+GF+NGCRP++FL++  +KSK
Sbjct: 456 FTTKDDLSFQHLFLSFHASLFGFKNGCRPILFLDSTPLKSK 496


>ref|XP_010241617.1| PREDICTED: uncharacterized protein LOC104586161 isoform X1 [Nelumbo
           nucifera] gi|720079291|ref|XP_010241618.1| PREDICTED:
           uncharacterized protein LOC104586161 isoform X1 [Nelumbo
           nucifera]
          Length = 867

 Score =  275 bits (703), Expect = 3e-71
 Identities = 129/221 (58%), Positives = 169/221 (76%)
 Frame = -2

Query: 665 DPRQQKLIKSWKNCITGVHQRFNNVHDFRDALYKYSVAHGFTYCFKKNEGLRVTAKCKAE 486
           D   +KLI  WKN ITGV Q+F+ VH+FRDAL KYS+AH F Y  KKNE  R TAKC+A+
Sbjct: 283 DAGVKKLITLWKNGITGVGQQFSGVHEFRDALRKYSIAHHFMYILKKNEASRATAKCRAD 342

Query: 485 GCPWRIHASRVPTTQLFQIKTMNETHTCNAGPDPTTRSKASRKLLASIVKEKLLETPKCK 306
           GC WRIHAS VPTTQ F IK MN+THTC  G +    S +++  LASI++++L ++P  K
Sbjct: 343 GCTWRIHASWVPTTQTFTIKRMNKTHTC--GGNIGKCSPSTKNWLASIIRDRLQDSPHYK 400

Query: 305 PKEIAEEIRRDFGIELGYVKAWRALENARGESQGSYKESYNQLPWLVDKILETNPSSVVT 126
           PK+IA+EI RDFGIEL Y + WR +ENAR + QGSYK++YNQLPW  +KI+ETNP S+  
Sbjct: 401 PKDIADEICRDFGIELNYSQVWRGVENARAQLQGSYKDAYNQLPWFCEKIVETNPGSICN 460

Query: 125 LSTREDLSFHQLFVTFRASLYGFENGCRPLIFLETLTIKSK 3
            +T++DLSF  LF++F ASL+GF+NGCRP++FL++  +KSK
Sbjct: 461 FTTKDDLSFQHLFLSFHASLFGFKNGCRPILFLDSTPLKSK 501


>ref|XP_012443064.1| PREDICTED: uncharacterized protein LOC105767976 isoform X1
           [Gossypium raimondii] gi|823220726|ref|XP_012443065.1|
           PREDICTED: uncharacterized protein LOC105767976 isoform
           X1 [Gossypium raimondii]
           gi|823220728|ref|XP_012443067.1| PREDICTED:
           uncharacterized protein LOC105767976 isoform X1
           [Gossypium raimondii] gi|823220730|ref|XP_012443068.1|
           PREDICTED: uncharacterized protein LOC105767976 isoform
           X1 [Gossypium raimondii]
           gi|823220732|ref|XP_012443069.1| PREDICTED:
           uncharacterized protein LOC105767976 isoform X1
           [Gossypium raimondii] gi|823220734|ref|XP_012443070.1|
           PREDICTED: uncharacterized protein LOC105767976 isoform
           X1 [Gossypium raimondii]
           gi|823220736|ref|XP_012443071.1| PREDICTED:
           uncharacterized protein LOC105767976 isoform X1
           [Gossypium raimondii] gi|823220738|ref|XP_012443072.1|
           PREDICTED: uncharacterized protein LOC105767976 isoform
           X1 [Gossypium raimondii]
          Length = 773

 Score =  274 bits (700), Expect = 6e-71
 Identities = 136/263 (51%), Positives = 176/263 (66%), Gaps = 14/263 (5%)
 Frame = -2

Query: 749 GVLLNTLFPVASDSGDTRLLSNLAPAGSDPRQQ--------------KLIKSWKNCITGV 612
           G++   + P AS SGDT  L + A   +D   Q              KL+KSW+NC+TG+
Sbjct: 146 GMVDEPVTPAASVSGDTEQLDSSASLTTDVDNQNNFTPEAPNANALQKLVKSWENCLTGL 205

Query: 611 HQRFNNVHDFRDALYKYSVAHGFTYCFKKNEGLRVTAKCKAEGCPWRIHASRVPTTQLFQ 432
            QRFNN HDFR AL K+S+AHGF Y FK N    + A CKAEGCPW I A+R+ TT+LF 
Sbjct: 206 EQRFNNAHDFRVALNKFSIAHGFEYTFKTNRSRYIIANCKAEGCPWTIQAARLSTTKLFL 265

Query: 431 IKTMNETHTCNAGPDPTTRSKASRKLLASIVKEKLLETPKCKPKEIAEEIRRDFGIELGY 252
           IK M+ETHTC AG   +   K S KL+  +VKEKL ++P  KP+EI  EI +D+G +  Y
Sbjct: 266 IKKMSETHTCGAGNSSSRHPKVSSKLVKFLVKEKLRDSPNAKPREIINEILQDYGFKARY 325

Query: 251 VKAWRALENARGESQGSYKESYNQLPWLVDKILETNPSSVVTLSTREDLSFHQLFVTFRA 72
              WR +E+A+ + Q SY E YNQ+P L  +I+E NP S+ TL T EDLSFH LFV+ +A
Sbjct: 326 AHVWRGVESAKEKPQVSYDEGYNQVPSLFKQIIENNPGSMATLVTGEDLSFHLLFVSLQA 385

Query: 71  SLYGFENGCRPLIFLETLTIKSK 3
           SL+GF+NGCRPL+FL+T+TIKSK
Sbjct: 386 SLHGFKNGCRPLLFLDTMTIKSK 408


>ref|XP_012443073.1| PREDICTED: uncharacterized protein LOC105767976 isoform X2
           [Gossypium raimondii] gi|823220742|ref|XP_012443074.1|
           PREDICTED: uncharacterized protein LOC105767976 isoform
           X2 [Gossypium raimondii]
           gi|823220744|ref|XP_012443075.1| PREDICTED:
           uncharacterized protein LOC105767976 isoform X2
           [Gossypium raimondii] gi|823220746|ref|XP_012443076.1|
           PREDICTED: uncharacterized protein LOC105767976 isoform
           X2 [Gossypium raimondii]
           gi|823220748|ref|XP_012443077.1| PREDICTED:
           uncharacterized protein LOC105767976 isoform X2
           [Gossypium raimondii] gi|763789286|gb|KJB56282.1|
           hypothetical protein B456_009G114800 [Gossypium
           raimondii] gi|763789287|gb|KJB56283.1| hypothetical
           protein B456_009G114800 [Gossypium raimondii]
          Length = 745

 Score =  274 bits (700), Expect = 6e-71
 Identities = 136/263 (51%), Positives = 176/263 (66%), Gaps = 14/263 (5%)
 Frame = -2

Query: 749 GVLLNTLFPVASDSGDTRLLSNLAPAGSDPRQQ--------------KLIKSWKNCITGV 612
           G++   + P AS SGDT  L + A   +D   Q              KL+KSW+NC+TG+
Sbjct: 118 GMVDEPVTPAASVSGDTEQLDSSASLTTDVDNQNNFTPEAPNANALQKLVKSWENCLTGL 177

Query: 611 HQRFNNVHDFRDALYKYSVAHGFTYCFKKNEGLRVTAKCKAEGCPWRIHASRVPTTQLFQ 432
            QRFNN HDFR AL K+S+AHGF Y FK N    + A CKAEGCPW I A+R+ TT+LF 
Sbjct: 178 EQRFNNAHDFRVALNKFSIAHGFEYTFKTNRSRYIIANCKAEGCPWTIQAARLSTTKLFL 237

Query: 431 IKTMNETHTCNAGPDPTTRSKASRKLLASIVKEKLLETPKCKPKEIAEEIRRDFGIELGY 252
           IK M+ETHTC AG   +   K S KL+  +VKEKL ++P  KP+EI  EI +D+G +  Y
Sbjct: 238 IKKMSETHTCGAGNSSSRHPKVSSKLVKFLVKEKLRDSPNAKPREIINEILQDYGFKARY 297

Query: 251 VKAWRALENARGESQGSYKESYNQLPWLVDKILETNPSSVVTLSTREDLSFHQLFVTFRA 72
              WR +E+A+ + Q SY E YNQ+P L  +I+E NP S+ TL T EDLSFH LFV+ +A
Sbjct: 298 AHVWRGVESAKEKPQVSYDEGYNQVPSLFKQIIENNPGSMATLVTGEDLSFHLLFVSLQA 357

Query: 71  SLYGFENGCRPLIFLETLTIKSK 3
           SL+GF+NGCRPL+FL+T+TIKSK
Sbjct: 358 SLHGFKNGCRPLLFLDTMTIKSK 380


Top