BLASTX nr result

ID: Rehmannia32_contig00008694 seq

BLASTX 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Rehmannia32_contig00008694
         (1615 letters)

Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF excluding environmental samples
from WGS projects 
           149,584,005 sequences; 54,822,741,787 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|PIN09560.1| DNA oxidative demethylase [Handroanthus impetigin...   267   7e-80
ref|XP_011080840.1| uncharacterized protein LOC105164002 isoform...   250   1e-72
ref|XP_012856810.1| PREDICTED: uncharacterized protein LOC105976...   233   4e-67
ref|XP_020550347.1| uncharacterized protein LOC105164002 isoform...   186   8e-50
ref|XP_022889009.1| uncharacterized protein LOC111404405 [Olea e...   137   7e-32
ref|XP_019196582.1| PREDICTED: uncharacterized protein LOC109190...   122   8e-26
ref|XP_018839226.1| PREDICTED: uncharacterized protein LOC109004...   119   3e-25
gb|EOY27334.1| 2-oxoglutarate-dependent dioxygenase family prote...   112   3e-23
ref|XP_011036495.1| PREDICTED: uncharacterized protein LOC105133...   113   4e-23
gb|PON43568.1| Alkylated DNA repair protein AlkB [Parasponia and...   112   5e-23
gb|EOY27333.1| 2-oxoglutarate-dependent dioxygenase family prote...   112   9e-23
ref|XP_011036496.1| PREDICTED: uncharacterized protein LOC105133...   112   1e-22
gb|OMO70560.1| Oxoglutarate/iron-dependent dioxygenase [Corchoru...   110   2e-22
ref|XP_007024711.2| PREDICTED: uncharacterized protein LOC185962...   111   2e-22
ref|XP_011036494.1| PREDICTED: uncharacterized protein LOC105133...   112   2e-22
ref|XP_011036493.1| PREDICTED: uncharacterized protein LOC105133...   112   2e-22
ref|XP_011036492.1| PREDICTED: uncharacterized protein LOC105133...   112   2e-22
ref|XP_011036491.1| PREDICTED: uncharacterized protein LOC105133...   112   2e-22
ref|XP_010033676.1| PREDICTED: uncharacterized protein LOC104422...   109   3e-22
ref|XP_021293825.1| uncharacterized protein LOC110423786 [Herran...   110   4e-22

>gb|PIN09560.1| DNA oxidative demethylase [Handroanthus impetiginosus]
          Length = 479

 Score =  267 bits (683), Expect = 7e-80
 Identities = 153/276 (55%), Positives = 188/276 (68%), Gaps = 13/276 (4%)
 Frame = +2

Query: 827  GRNVVRRSPV-AHPDGASPG--SGSSEIGKINSEISSVDGVTKKFSGLLSSDYLHGPNAV 997
            GR V    PV AH DGAS    SGSSEIG+ NSE ++V G+TKKFSG+ +S   HGP++ 
Sbjct: 31   GRVVGYYKPVGAHVDGASLSHESGSSEIGRRNSETAAVHGLTKKFSGMSTSGCPHGPSSE 90

Query: 998  PHGVGNMSPNNGNSKLAESVSEAENQTPKPLDLPSVVGPGHNNVFSSLPTHSELNATTIQ 1177
             HGVGN+SPN G+SK  +SV E  +QTPK LDLPSV  PG++N F SL THSE+NATTIQ
Sbjct: 91   AHGVGNISPNVGSSKQEQSVPE--DQTPKSLDLPSVASPGYDNDFPSLSTHSEVNATTIQ 148

Query: 1178 EHSEAE--KSWDQDEQN--------QTMDCGLLDGQVPQEFQRGKCTSEVTGGKGEYEKS 1327
                 +  +S D +E+N        + +D G  DG+ PQ  Q  K   E T G  EYE S
Sbjct: 149  VTMTIQMTQSPDVEEENCSVSHKNKENIDFGFQDGRSPQVVQTEKHVFEGTSGNIEYENS 208

Query: 1328 DFQQYGFSFYICEERDKNIPKLKTPLLVKNRAMRSEMKRQAQGVNIQTCRSGMILLKGYI 1507
                 GFSF ICEE +  + KLK+ L VKN+A+R+E KR+ +G  I+  R GMILLK Y+
Sbjct: 209  GLHHKGFSFDICEETNSKVVKLKSSLFVKNKAIRNEAKRRMEGNKIRIQRPGMILLKDYL 268

Query: 1508 SLKDQVKLIKSCRDLGRGHGGFYQPGYRDGAKLHLK 1615
            SLKDQVK+IK+CRDLGRG GGFYQPGYRDGA +HLK
Sbjct: 269  SLKDQVKVIKTCRDLGRGPGGFYQPGYRDGAMMHLK 304



 Score =  116 bits (290), Expect = 4e-24
 Identities = 63/106 (59%), Positives = 75/106 (70%), Gaps = 2/106 (1%)
 Frame = +2

Query: 443 PVVAHPDGASPG--SGYSEIGRINSELLSVDGVTKKFNGLLSSDYPNGPNAEPHGVGNMS 616
           PV AH DGAS    SG SEIGR NSE  +V G+TKKF+G+ +S  P+GP++E HGVGN+S
Sbjct: 39  PVGAHVDGASLSHESGSSEIGRRNSETAAVHGLTKKFSGMSTSGCPHGPSSEAHGVGNIS 98

Query: 617 PNNGNPKVVDSVSEAENQTPKPLDLPSVVGPGYSYGFPSFPTHSEL 754
           PN G+ K   SV   E+QTPK LDLPSV  PGY   FPS  THSE+
Sbjct: 99  PNVGSSKQEQSV--PEDQTPKSLDLPSVASPGYDNDFPSLSTHSEV 142



 Score = 87.8 bits (216), Expect = 1e-14
 Identities = 61/124 (49%), Positives = 72/124 (58%), Gaps = 19/124 (15%)
 Frame = +2

Query: 35  GWRS-SPRPTGHYVVRRGPVVAHPDGASPG--SGSSEIGRINSEIS-------------- 163
           G+R  SPR  G  V    PV AH DGAS    SGSSEIGR NSE +              
Sbjct: 21  GYRGRSPRAAGRVVGYYKPVGAHVDGASLSHESGSSEIGRRNSETAAVHGLTKKFSGMST 80

Query: 164 SGNPYGPSAKPSGV--VNTNNGNPKVEESVSEAENQTPKPLDLPSVVGPGYGYGFSSLPT 337
           SG P+GPS++  GV  ++ N G+ K E+SV E  +QTPK LDLPSV  PGY   F SL T
Sbjct: 81  SGCPHGPSSEAHGVGNISPNVGSSKQEQSVPE--DQTPKSLDLPSVASPGYDNDFPSLST 138

Query: 338 HSEL 349
           HSE+
Sbjct: 139 HSEV 142


>ref|XP_011080840.1| uncharacterized protein LOC105164002 isoform X1 [Sesamum indicum]
 ref|XP_020550346.1| uncharacterized protein LOC105164002 isoform X1 [Sesamum indicum]
          Length = 535

 Score =  250 bits (638), Expect = 1e-72
 Identities = 153/306 (50%), Positives = 181/306 (59%), Gaps = 54/306 (17%)
 Frame = +2

Query: 860  HPDGASP--GSGSSEIGKINSEISSVDGVTKKFSGLLSSDYLHGPNAVPHGVGNMSPNNG 1033
            H D AS   GSGSS  G+ NSEI+S+D +TKKF G L SDY +GP+A P G G ++ + G
Sbjct: 56   HSDDASSAHGSGSSTNGRRNSEIASLDSITKKFDGALKSDYAYGPSAEPLGAG-IASDAG 114

Query: 1034 NSKLAESVSEAENQTPKPLDLPSVVGPGHNNVFSSLPTHSELNATTIQE----------- 1180
             SKL ++V  AE+Q PKPL+LP V  P ++N F SL  HSELNA TIQ            
Sbjct: 115  ISKLVQNVPIAEDQMPKPLELPFVTRPCYDNDFPSLSAHSELNARTIQMIQAQDVKEDNC 174

Query: 1181 ----------HSEAEKS-----------WDQD--------------------EQNQTMDC 1237
                      H + E S           + QD                    E +QT   
Sbjct: 175  TTYTPLVDMLHKKGEPSQTTGPGSQVGRFSQDVKEDTCTTYSPLVDMLHKKGEPSQTTGP 234

Query: 1238 GLLDGQVPQEFQRGKCTSEVTGGKGEYEKSDFQQYGFSFYICEERDKNIPKLKTPLLVKN 1417
                G+  QE QRGK T+E    KGEYE SDFQ  GFSF ICEER +++ KLK+PL VKN
Sbjct: 235  DAQVGRFSQEVQRGKSTTEGIIEKGEYENSDFQHKGFSFDICEERSRSVVKLKSPLHVKN 294

Query: 1418 RAMRSEMKRQAQGVNIQTCRSGMILLKGYISLKDQVKLIKSCRDLGRGHGGFYQPGYRDG 1597
            RAMR+E KR   G NI+  R GMIL+KGY+SL DQVKLI SCRDLGRG GGFYQPGY DG
Sbjct: 295  RAMRNERKRHMVGDNIKIFRPGMILIKGYLSLMDQVKLIMSCRDLGRGPGGFYQPGYGDG 354

Query: 1598 AKLHLK 1615
            AKLHLK
Sbjct: 355  AKLHLK 360



 Score = 97.1 bits (240), Expect = 1e-17
 Identities = 55/105 (52%), Positives = 67/105 (63%), Gaps = 2/105 (1%)
 Frame = +2

Query: 446 VVAHPDGASP--GSGYSEIGRINSELLSVDGVTKKFNGLLSSDYPNGPNAEPHGVGNMSP 619
           V  H D AS   GSG S  GR NSE+ S+D +TKKF+G L SDY  GP+AEP G G ++ 
Sbjct: 53  VADHSDDASSAHGSGSSTNGRRNSEIASLDSITKKFDGALKSDYAYGPSAEPLGAG-IAS 111

Query: 620 NNGNPKVVDSVSEAENQTPKPLDLPSVVGPGYSYGFPSFPTHSEL 754
           + G  K+V +V  AE+Q PKPL+LP V  P Y   FPS   HSEL
Sbjct: 112 DAGISKLVQNVPIAEDQMPKPLELPFVTRPCYDNDFPSLSAHSEL 156



 Score = 65.9 bits (159), Expect = 1e-07
 Identities = 46/104 (44%), Positives = 57/104 (54%), Gaps = 17/104 (16%)
 Frame = +2

Query: 89  VVAHPDGASP--GSGSSEIGRINSEISSGNP--------------YGPSAKPSGV-VNTN 217
           V  H D AS   GSGSS  GR NSEI+S +               YGPSA+P G  + ++
Sbjct: 53  VADHSDDASSAHGSGSSTNGRRNSEIASLDSITKKFDGALKSDYAYGPSAEPLGAGIASD 112

Query: 218 NGNPKVEESVSEAENQTPKPLDLPSVVGPGYGYGFSSLPTHSEL 349
            G  K+ ++V  AE+Q PKPL+LP V  P Y   F SL  HSEL
Sbjct: 113 AGISKLVQNVPIAEDQMPKPLELPFVTRPCYDNDFPSLSAHSEL 156


>ref|XP_012856810.1| PREDICTED: uncharacterized protein LOC105976070 [Erythranthe guttata]
 gb|EYU21292.1| hypothetical protein MIMGU_mgv1a006814mg [Erythranthe guttata]
          Length = 430

 Score =  233 bits (593), Expect = 4e-67
 Identities = 131/262 (50%), Positives = 165/262 (62%), Gaps = 4/262 (1%)
 Frame = +2

Query: 842  RRSPV-AHPDGASPG--SGSSEIGKINSEISSVDGVTKKFSGLLSSDYLHGPNAVPHGVG 1012
            R SP+ AH + +SPG  SGSS     NSE+++VD VTKKFS + SSDY  GPN+ P GV 
Sbjct: 32   RHSPIGAHSNESSPGNGSGSSRSETRNSEVAAVDNVTKKFSDMSSSDYQQGPNSEPRGVK 91

Query: 1013 NMSPNNGNSKLAESVSEAENQTPKPLDLPSVVGPGHNNVFSSLPTHSELNATTIQEHSEA 1192
              SPN+GN +  ++++                 PG +N F SL T S             
Sbjct: 92   TASPNDGNPRRVQNIA-----------------PGFDNDFPSLSTESA------------ 122

Query: 1193 EKSWDQDEQNQTMDCGLLDGQVPQEFQRGKCTSEVTGGKGEYEKSDFQQYGFSFYICEER 1372
                      QT+DC  ++G++P+E + GK TS+ T GK  +E SD QQ G SF ICE+R
Sbjct: 123  ----------QTIDCSFMNGKLPKEVESGKSTSDGTSGKSGFENSDSQQKGCSFDICEQR 172

Query: 1373 DKNIPKLKTPLLVKNRAMRSEMKRQAQGV-NIQTCRSGMILLKGYISLKDQVKLIKSCRD 1549
            D+N+ KLKTPL VKN+A R+EMKR+ +G  NIQ  R GMILLK Y+S+ DQVKLIK+CRD
Sbjct: 173  DRNVVKLKTPLHVKNKAARNEMKRRTEGYNNIQNLRPGMILLKNYLSVSDQVKLIKACRD 232

Query: 1550 LGRGHGGFYQPGYRDGAKLHLK 1615
            LGRG GGFYQPGY DGAKL LK
Sbjct: 233  LGRGCGGFYQPGYSDGAKLQLK 254



 Score = 82.4 bits (202), Expect = 5e-13
 Identities = 47/110 (42%), Positives = 62/110 (56%), Gaps = 2/110 (1%)
 Frame = +2

Query: 425 NAVRRSPVVAHPDGASPG--SGYSEIGRINSELLSVDGVTKKFNGLLSSDYPNGPNAEPH 598
           +A R SP+ AH + +SPG  SG S     NSE+ +VD VTKKF+ + SSDY  GPN+EP 
Sbjct: 29  SADRHSPIGAHSNESSPGNGSGSSRSETRNSEVAAVDNVTKKFSDMSSSDYQQGPNSEPR 88

Query: 599 GVGNMSPNNGNPKVVDSVSEAENQTPKPLDLPSVVGPGYSYGFPSFPTHS 748
           GV   SPN+GNP+ V +++                 PG+   FPS  T S
Sbjct: 89  GVKTASPNDGNPRRVQNIA-----------------PGFDNDFPSLSTES 121


>ref|XP_020550347.1| uncharacterized protein LOC105164002 isoform X2 [Sesamum indicum]
          Length = 407

 Score =  186 bits (472), Expect = 8e-50
 Identities = 114/231 (49%), Positives = 130/231 (56%), Gaps = 52/231 (22%)
 Frame = +2

Query: 1079 PKPLDLPSVVGPGHNNVFSSLPTHSELNATTIQE---------------------HSEAE 1195
            PKPL+LP V  P ++N F SL  HSELNA TIQ                      H + E
Sbjct: 2    PKPLELPFVTRPCYDNDFPSLSAHSELNARTIQMIQAQDVKEDNCTTYTPLVDMLHKKGE 61

Query: 1196 KS-----------WDQD--------------------EQNQTMDCGLLDGQVPQEFQRGK 1282
             S           + QD                    E +QT       G+  QE QRGK
Sbjct: 62   PSQTTGPGSQVGRFSQDVKEDTCTTYSPLVDMLHKKGEPSQTTGPDAQVGRFSQEVQRGK 121

Query: 1283 CTSEVTGGKGEYEKSDFQQYGFSFYICEERDKNIPKLKTPLLVKNRAMRSEMKRQAQGVN 1462
             T+E    KGEYE SDFQ  GFSF ICEER +++ KLK+PL VKNRAMR+E KR   G N
Sbjct: 122  STTEGIIEKGEYENSDFQHKGFSFDICEERSRSVVKLKSPLHVKNRAMRNERKRHMVGDN 181

Query: 1463 IQTCRSGMILLKGYISLKDQVKLIKSCRDLGRGHGGFYQPGYRDGAKLHLK 1615
            I+  R GMIL+KGY+SL DQVKLI SCRDLGRG GGFYQPGY DGAKLHLK
Sbjct: 182  IKIFRPGMILIKGYLSLMDQVKLIMSCRDLGRGPGGFYQPGYGDGAKLHLK 232


>ref|XP_022889009.1| uncharacterized protein LOC111404405 [Olea europaea var. sylvestris]
          Length = 395

 Score =  137 bits (344), Expect = 7e-32
 Identities = 100/277 (36%), Positives = 140/277 (50%), Gaps = 13/277 (4%)
 Frame = +2

Query: 824  TGRNVV---RRSPVAHPDGASPGSG---SSEIGKINSEISSVDGVTKKFSGLLSSDYLHG 985
            +GR V+   R++P   P G S  S    S+E  K+    + ++ V+ K+    S+     
Sbjct: 44   SGRGVLQYRRKNPETTPVGVSCPSNIISSAENLKLECNTTKLEYVSPKYQESASTP---- 99

Query: 986  PNAVPHGVGNMSPNNGNSKLA-------ESVSEAENQTPKPLDLPSVVGPGHNNVFSSLP 1144
            P+ +     N+      +KL        ESVS   N   +      + G G  +   SLP
Sbjct: 100  PSNIISSAENLKLECNTTKLEYVSPKYQESVSTPPNMNERKTHQEQLRGAGKESF--SLP 157

Query: 1145 THSELNATTIQEHSEAEKSWDQDEQNQTMDCGLLDGQVPQEFQRGKCTSEVTGGKGEYEK 1324
                L  T+  + S  +K   Q   + + +         QE Q     +E T G G    
Sbjct: 158  ---HLGGTSPVDASYTKKGPSQSSVSDSKE-----SHFGQETQSRTSATENTVGDGRPND 209

Query: 1325 SDFQQYGFSFYICEERDKNIPKLKTPLLVKNRAMRSEMKRQAQGVNIQTCRSGMILLKGY 1504
            SD  Q G+SF IC E+  +  KL+ PLLVKNR  ++EMKR+ +G NI+  R G++LLK Y
Sbjct: 210  SDSLQKGYSFDICVEKIGSFVKLQPPLLVKNREKKNEMKRRTEGENIKVLRPGVVLLKCY 269

Query: 1505 ISLKDQVKLIKSCRDLGRGHGGFYQPGYRDGAKLHLK 1615
            + L DQVKL+K CRDLG G GGFYQPGYRDGA+L LK
Sbjct: 270  LPLMDQVKLVKMCRDLGLGSGGFYQPGYRDGAQLRLK 306


>ref|XP_019196582.1| PREDICTED: uncharacterized protein LOC109190540 [Ipomoea nil]
          Length = 545

 Score =  122 bits (305), Expect = 8e-26
 Identities = 85/266 (31%), Positives = 128/266 (48%), Gaps = 10/266 (3%)
 Frame = +2

Query: 848  SPVAHPDGASPGSGSSEIGKINSEISS---VDGVTKKFSGLLSSDYLHGPNAVPHG---- 1006
            +P  H  G +P   + E      E S     +G++KKF+ L   D  +  +  P      
Sbjct: 133  NPYRHRGGFTPSGHNVERRSSERERSGDTLANGISKKFASLSPLDNPYRNDDSPQSKFSC 192

Query: 1007 VGNMSPNNGNSKLAESVSEAEN---QTPKPLDLPSVVGPGHNNVFSSLPTHSELNATTIQ 1177
            VGN      NS + +  + + +   Q   P   P+V+                       
Sbjct: 193  VGNPMQVKHNSSIKQPFTSSASGFKQKDSPWSCPAVISC--------------------- 231

Query: 1178 EHSEAEKSWDQDEQNQTMDCGLLDGQVPQEFQRGKCTSEVTGGKGEYEKSDFQQYGFSFY 1357
              S   K++ ++E    ++ G    ++ +E  + + +S+     G+  K+        F 
Sbjct: 232  --SPVAKTFLKNESIHAVNSGGFGKRLSEEINQSEQSSKEEANNGDKSKN----LDVGFD 285

Query: 1358 ICEERDKNIPKLKTPLLVKNRAMRSEMKRQAQGVNIQTCRSGMILLKGYISLKDQVKLIK 1537
            IC+ER  N+ KLKTPL VKN+  R+E+KR  +  NI+    GM+LLK +ISL DQVK++ 
Sbjct: 286  ICQERAGNLIKLKTPLHVKNKEKRNEIKRSMEVQNIKILCDGMVLLKSFISLLDQVKIVN 345

Query: 1538 SCRDLGRGHGGFYQPGYRDGAKLHLK 1615
            +CR LG G GGFYQPGY DGAKLHLK
Sbjct: 346  TCRKLGIGPGGFYQPGYNDGAKLHLK 371


>ref|XP_018839226.1| PREDICTED: uncharacterized protein LOC109004968 [Juglans regia]
          Length = 470

 Score =  119 bits (298), Expect = 3e-25
 Identities = 73/154 (47%), Positives = 85/154 (55%), Gaps = 3/154 (1%)
 Frame = +2

Query: 1163 ATTIQEHSEAEK---SWDQDEQNQTMDCGLLDGQVPQEFQRGKCTSEVTGGKGEYEKSDF 1333
            A  I+ H E  K   S  Q E          DG  P +F   KC+S    G    E S+ 
Sbjct: 148  ADGIKSHDELSKLRISGQQSESQLPYKSAKKDGPSPMKFP--KCSS----GCDNSEYSEH 201

Query: 1334 QQYGFSFYICEERDKNIPKLKTPLLVKNRAMRSEMKRQAQGVNIQTCRSGMILLKGYISL 1513
                 +F IC  R      LK PLLVKNR  R+E+KR  +G      RSGM+LLK +IS 
Sbjct: 202  SAALHAFDICPPRASTSVVLKPPLLVKNRDRRNEIKRSMEGQTGTVLRSGMVLLKSHISS 261

Query: 1514 KDQVKLIKSCRDLGRGHGGFYQPGYRDGAKLHLK 1615
             DQVK++K CRDLG G GGFYQPGYRDGAKLHLK
Sbjct: 262  SDQVKIVKICRDLGLGPGGFYQPGYRDGAKLHLK 295


>gb|EOY27334.1| 2-oxoglutarate-dependent dioxygenase family protein, putative isoform
            2 [Theobroma cacao]
          Length = 378

 Score =  112 bits (279), Expect = 3e-23
 Identities = 62/125 (49%), Positives = 79/125 (63%)
 Frame = +2

Query: 1241 LLDGQVPQEFQRGKCTSEVTGGKGEYEKSDFQQYGFSFYICEERDKNIPKLKTPLLVKNR 1420
            L D   P E  + K + + + G G+   ++ Q     F IC  +      LK  LLVKNR
Sbjct: 163  LQDESEPSESSQ-KMSPQNSAGFGDSVHTECQVVVDPFDICLSKAGTPVMLKPSLLVKNR 221

Query: 1421 AMRSEMKRQAQGVNIQTCRSGMILLKGYISLKDQVKLIKSCRDLGRGHGGFYQPGYRDGA 1600
              R+E+KR  +G N    RSGM+LLK Y+SL DQVK++K+CR+LG G GGFYQPGYRDGA
Sbjct: 222  EKRNEIKRSMEGQNGIVLRSGMVLLKKYLSLSDQVKIVKACRELGFGSGGFYQPGYRDGA 281

Query: 1601 KLHLK 1615
            KLHLK
Sbjct: 282  KLHLK 286


>ref|XP_011036495.1| PREDICTED: uncharacterized protein LOC105133990 isoform X5 [Populus
            euphratica]
          Length = 501

 Score =  113 bits (283), Expect = 4e-23
 Identities = 78/211 (36%), Positives = 108/211 (51%)
 Frame = +2

Query: 983  GPNAVPHGVGNMSPNNGNSKLAESVSEAENQTPKPLDLPSVVGPGHNNVFSSLPTHSELN 1162
            G N     V + S N G+S L  ++S + +     +  P   G     V  SL  +S + 
Sbjct: 134  GSNQSDCSVASNSVNKGDSALMPAISGSRDMKSDIVSTPLEKGAKDGAVDFSLKFNSTI- 192

Query: 1163 ATTIQEHSEAEKSWDQDEQNQTMDCGLLDGQVPQEFQRGKCTSEVTGGKGEYEKSDFQQY 1342
               +Q  +E+  S+            + +GQ+ QE       S   G  G  E+    + 
Sbjct: 193  --PLQNQNESHLSFQ----------AVANGQLSQEKDPNIVVS--AGYSGYSEQRAVVE- 237

Query: 1343 GFSFYICEERDKNIPKLKTPLLVKNRAMRSEMKRQAQGVNIQTCRSGMILLKGYISLKDQ 1522
               F IC  +     KLK  LLVKNR  R++++R A GVN Q  RSGM+LLK Y+SL DQ
Sbjct: 238  --PFDICLPKTGTTLKLKPSLLVKNREKRNDVRRAAGGVNGQILRSGMVLLKNYLSLHDQ 295

Query: 1523 VKLIKSCRDLGRGHGGFYQPGYRDGAKLHLK 1615
            +K+IK CRD+G G GGFYQP YRDG ++HLK
Sbjct: 296  IKIIKLCRDIGLGPGGFYQPVYRDGGRMHLK 326


>gb|PON43568.1| Alkylated DNA repair protein AlkB [Parasponia andersonii]
          Length = 433

 Score =  112 bits (280), Expect = 5e-23
 Identities = 89/263 (33%), Positives = 124/263 (47%), Gaps = 6/263 (2%)
 Frame = +2

Query: 842  RRSPVAHPDGASPGSGSSEIGKI---NSEISSVDGVTKKFSGL---LSSDYLHGPNAVPH 1003
            RR   + P G+S G     +  +   + +ISS DG     SG+   ++S+  H  N+ P 
Sbjct: 26   RRHHNSEPRGSSSGGKDRFVYAVKIKDGQISS-DGKIGASSGVKSSIASELAHEENSTPF 84

Query: 1004 GVGNMSPNNGNSKLAESVSEAENQTPKPLDLPSVVGPGHNNVFSSLPTHSELNATTIQEH 1183
               N S    +S + E  S+      K  D         +++   L +   LN       
Sbjct: 85   SAANCS---ASSHMIEERSQIAQTPTKFTD---------DDMKLKLDSSINLNI------ 126

Query: 1184 SEAEKSWDQDEQNQTMDCGLLDGQVPQEFQRGKCTSEVTGGKGEYEKSDFQQYGFSFYIC 1363
                 S    E + T  CG   G  P + Q+   +   +      E ++       F IC
Sbjct: 127  -----SCQDVEISLTTKCGEKGGPSPLKGQKTPASDRKS------ENTEASSAFAPFDIC 175

Query: 1364 EERDKNIPKLKTPLLVKNRAMRSEMKRQAQGVNIQTCRSGMILLKGYISLKDQVKLIKSC 1543
              +  ++ KLK PLL KNR  R+E KR  +G N    R GM++LK +ISL DQVK++K C
Sbjct: 176  PTKAGSV-KLKPPLLAKNRERRNETKRVMEGPNGSVIRPGMVILKSHISLSDQVKVVKQC 234

Query: 1544 RDLGRGHGGFYQPGYRDGAKLHL 1612
            RDLG G GGFYQPGYRDGAKLHL
Sbjct: 235  RDLGVGPGGFYQPGYRDGAKLHL 257


>gb|EOY27333.1| 2-oxoglutarate-dependent dioxygenase family protein, putative isoform
            1 [Theobroma cacao]
          Length = 461

 Score =  112 bits (279), Expect = 9e-23
 Identities = 62/125 (49%), Positives = 79/125 (63%)
 Frame = +2

Query: 1241 LLDGQVPQEFQRGKCTSEVTGGKGEYEKSDFQQYGFSFYICEERDKNIPKLKTPLLVKNR 1420
            L D   P E  + K + + + G G+   ++ Q     F IC  +      LK  LLVKNR
Sbjct: 163  LQDESEPSESSQ-KMSPQNSAGFGDSVHTECQVVVDPFDICLSKAGTPVMLKPSLLVKNR 221

Query: 1421 AMRSEMKRQAQGVNIQTCRSGMILLKGYISLKDQVKLIKSCRDLGRGHGGFYQPGYRDGA 1600
              R+E+KR  +G N    RSGM+LLK Y+SL DQVK++K+CR+LG G GGFYQPGYRDGA
Sbjct: 222  EKRNEIKRSMEGQNGIVLRSGMVLLKKYLSLSDQVKIVKACRELGFGSGGFYQPGYRDGA 281

Query: 1601 KLHLK 1615
            KLHLK
Sbjct: 282  KLHLK 286


>ref|XP_011036496.1| PREDICTED: uncharacterized protein LOC105133990 isoform X6 [Populus
            euphratica]
          Length = 499

 Score =  112 bits (279), Expect = 1e-22
 Identities = 76/203 (37%), Positives = 106/203 (52%)
 Frame = +2

Query: 1007 VGNMSPNNGNSKLAESVSEAENQTPKPLDLPSVVGPGHNNVFSSLPTHSELNATTIQEHS 1186
            V + S N G+S L  ++S + +     +  P   G     V  SL  +S +    +Q  +
Sbjct: 140  VASNSVNKGDSALMPAISGSRDMKSDIVSTPLEKGAKDGAVDFSLKFNSTI---PLQNQN 196

Query: 1187 EAEKSWDQDEQNQTMDCGLLDGQVPQEFQRGKCTSEVTGGKGEYEKSDFQQYGFSFYICE 1366
            E+  S+            + +GQ+ QE       S   G  G  E+    +    F IC 
Sbjct: 197  ESHLSFQ----------AVANGQLSQEKDPNIVVS--AGYSGYSEQRAVVE---PFDICL 241

Query: 1367 ERDKNIPKLKTPLLVKNRAMRSEMKRQAQGVNIQTCRSGMILLKGYISLKDQVKLIKSCR 1546
             +     KLK  LLVKNR  R++++R A GVN Q  RSGM+LLK Y+SL DQ+K+IK CR
Sbjct: 242  PKTGTTLKLKPSLLVKNREKRNDVRRAAGGVNGQILRSGMVLLKNYLSLHDQIKIIKLCR 301

Query: 1547 DLGRGHGGFYQPGYRDGAKLHLK 1615
            D+G G GGFYQP YRDG ++HLK
Sbjct: 302  DIGLGPGGFYQPVYRDGGRMHLK 324


>gb|OMO70560.1| Oxoglutarate/iron-dependent dioxygenase [Corchorus capsularis]
          Length = 396

 Score =  110 bits (275), Expect = 2e-22
 Identities = 78/225 (34%), Positives = 107/225 (47%), Gaps = 25/225 (11%)
 Frame = +2

Query: 1016 MSPNNGNSKLAESVSEAENQTPKPLD----------LPSVVGPGHNNV---FSSL----- 1141
            MSP +G+      VS      PKPL           LP   G  +  +   F SL     
Sbjct: 1    MSPASGSRYSKHDVSPVYEYRPKPLPDGSGIGKQNHLPEATGTSNTVLKDDFPSLSCQSG 60

Query: 1142 -------PTHSELNATTIQEHSEAEKSWDQDEQNQTMDCGLLDGQVPQEFQRGKCTSEVT 1300
                   P  ++     ++E +E+  S    + ++ ++      +     +R    S  T
Sbjct: 61   YKGPWPDPRRTQFEPLRVEEETESCASLLHHDLSRKVNISYSVDEYKPTQERSPQISTST 120

Query: 1301 GGKGEYEKSDFQQYGFSFYICEERDKNIPKLKTPLLVKNRAMRSEMKRQAQGVNIQTCRS 1480
            G   +    D Q     F IC  +   +  LK  LLVKNR  R+EMKR  +G +    RS
Sbjct: 121  GDSVD----DLQAVIKPFDICPVKTGTLVMLKPSLLVKNREKRNEMKRSMEGESGIVLRS 176

Query: 1481 GMILLKGYISLKDQVKLIKSCRDLGRGHGGFYQPGYRDGAKLHLK 1615
            GM+LLK Y+SL DQVK+ K+CR+LG   GGFYQPGYRDGAKL+LK
Sbjct: 177  GMVLLKNYLSLSDQVKIAKTCRELGLASGGFYQPGYRDGAKLNLK 221


>ref|XP_007024711.2| PREDICTED: uncharacterized protein LOC18596274 [Theobroma cacao]
          Length = 461

 Score =  111 bits (277), Expect = 2e-22
 Identities = 61/123 (49%), Positives = 78/123 (63%)
 Frame = +2

Query: 1247 DGQVPQEFQRGKCTSEVTGGKGEYEKSDFQQYGFSFYICEERDKNIPKLKTPLLVKNRAM 1426
            D   P E  + K + + + G G+   ++ Q     F IC  +      LK  LLVKNR  
Sbjct: 165  DESEPSESSQ-KMSPQNSAGFGDSVHTECQVVVDPFDICLSKAGTPVMLKPSLLVKNREK 223

Query: 1427 RSEMKRQAQGVNIQTCRSGMILLKGYISLKDQVKLIKSCRDLGRGHGGFYQPGYRDGAKL 1606
            R+E+KR  +G N    RSGM+LLK Y+SL DQVK++K+CR+LG G GGFYQPGYRDGAKL
Sbjct: 224  RNEIKRSMEGQNGIVLRSGMVLLKKYLSLSDQVKIVKACRELGFGSGGFYQPGYRDGAKL 283

Query: 1607 HLK 1615
            HLK
Sbjct: 284  HLK 286


>ref|XP_011036494.1| PREDICTED: uncharacterized protein LOC105133990 isoform X4 [Populus
            euphratica]
          Length = 595

 Score =  112 bits (279), Expect = 2e-22
 Identities = 76/203 (37%), Positives = 106/203 (52%)
 Frame = +2

Query: 1007 VGNMSPNNGNSKLAESVSEAENQTPKPLDLPSVVGPGHNNVFSSLPTHSELNATTIQEHS 1186
            V + S N G+S L  ++S + +     +  P   G     V  SL  +S +    +Q  +
Sbjct: 236  VASNSVNKGDSALMPAISGSRDMKSDIVSTPLEKGAKDGAVDFSLKFNSTI---PLQNQN 292

Query: 1187 EAEKSWDQDEQNQTMDCGLLDGQVPQEFQRGKCTSEVTGGKGEYEKSDFQQYGFSFYICE 1366
            E+  S+            + +GQ+ QE       S   G  G  E+    +    F IC 
Sbjct: 293  ESHLSFQ----------AVANGQLSQEKDPNIVVS--AGYSGYSEQRAVVE---PFDICL 337

Query: 1367 ERDKNIPKLKTPLLVKNRAMRSEMKRQAQGVNIQTCRSGMILLKGYISLKDQVKLIKSCR 1546
             +     KLK  LLVKNR  R++++R A GVN Q  RSGM+LLK Y+SL DQ+K+IK CR
Sbjct: 338  PKTGTTLKLKPSLLVKNREKRNDVRRAAGGVNGQILRSGMVLLKNYLSLHDQIKIIKLCR 397

Query: 1547 DLGRGHGGFYQPGYRDGAKLHLK 1615
            D+G G GGFYQP YRDG ++HLK
Sbjct: 398  DIGLGPGGFYQPVYRDGGRMHLK 420


>ref|XP_011036493.1| PREDICTED: uncharacterized protein LOC105133990 isoform X3 [Populus
            euphratica]
          Length = 599

 Score =  112 bits (279), Expect = 2e-22
 Identities = 76/203 (37%), Positives = 106/203 (52%)
 Frame = +2

Query: 1007 VGNMSPNNGNSKLAESVSEAENQTPKPLDLPSVVGPGHNNVFSSLPTHSELNATTIQEHS 1186
            V + S N G+S L  ++S + +     +  P   G     V  SL  +S +    +Q  +
Sbjct: 240  VASNSVNKGDSALMPAISGSRDMKSDIVSTPLEKGAKDGAVDFSLKFNSTI---PLQNQN 296

Query: 1187 EAEKSWDQDEQNQTMDCGLLDGQVPQEFQRGKCTSEVTGGKGEYEKSDFQQYGFSFYICE 1366
            E+  S+            + +GQ+ QE       S   G  G  E+    +    F IC 
Sbjct: 297  ESHLSFQ----------AVANGQLSQEKDPNIVVS--AGYSGYSEQRAVVE---PFDICL 341

Query: 1367 ERDKNIPKLKTPLLVKNRAMRSEMKRQAQGVNIQTCRSGMILLKGYISLKDQVKLIKSCR 1546
             +     KLK  LLVKNR  R++++R A GVN Q  RSGM+LLK Y+SL DQ+K+IK CR
Sbjct: 342  PKTGTTLKLKPSLLVKNREKRNDVRRAAGGVNGQILRSGMVLLKNYLSLHDQIKIIKLCR 401

Query: 1547 DLGRGHGGFYQPGYRDGAKLHLK 1615
            D+G G GGFYQP YRDG ++HLK
Sbjct: 402  DIGLGPGGFYQPVYRDGGRMHLK 424


>ref|XP_011036492.1| PREDICTED: uncharacterized protein LOC105133990 isoform X2 [Populus
            euphratica]
          Length = 603

 Score =  112 bits (279), Expect = 2e-22
 Identities = 76/203 (37%), Positives = 106/203 (52%)
 Frame = +2

Query: 1007 VGNMSPNNGNSKLAESVSEAENQTPKPLDLPSVVGPGHNNVFSSLPTHSELNATTIQEHS 1186
            V + S N G+S L  ++S + +     +  P   G     V  SL  +S +    +Q  +
Sbjct: 244  VASNSVNKGDSALMPAISGSRDMKSDIVSTPLEKGAKDGAVDFSLKFNSTI---PLQNQN 300

Query: 1187 EAEKSWDQDEQNQTMDCGLLDGQVPQEFQRGKCTSEVTGGKGEYEKSDFQQYGFSFYICE 1366
            E+  S+            + +GQ+ QE       S   G  G  E+    +    F IC 
Sbjct: 301  ESHLSFQ----------AVANGQLSQEKDPNIVVS--AGYSGYSEQRAVVE---PFDICL 345

Query: 1367 ERDKNIPKLKTPLLVKNRAMRSEMKRQAQGVNIQTCRSGMILLKGYISLKDQVKLIKSCR 1546
             +     KLK  LLVKNR  R++++R A GVN Q  RSGM+LLK Y+SL DQ+K+IK CR
Sbjct: 346  PKTGTTLKLKPSLLVKNREKRNDVRRAAGGVNGQILRSGMVLLKNYLSLHDQIKIIKLCR 405

Query: 1547 DLGRGHGGFYQPGYRDGAKLHLK 1615
            D+G G GGFYQP YRDG ++HLK
Sbjct: 406  DIGLGPGGFYQPVYRDGGRMHLK 428


>ref|XP_011036491.1| PREDICTED: uncharacterized protein LOC105133990 isoform X1 [Populus
            euphratica]
          Length = 613

 Score =  112 bits (279), Expect = 2e-22
 Identities = 76/203 (37%), Positives = 106/203 (52%)
 Frame = +2

Query: 1007 VGNMSPNNGNSKLAESVSEAENQTPKPLDLPSVVGPGHNNVFSSLPTHSELNATTIQEHS 1186
            V + S N G+S L  ++S + +     +  P   G     V  SL  +S +    +Q  +
Sbjct: 254  VASNSVNKGDSALMPAISGSRDMKSDIVSTPLEKGAKDGAVDFSLKFNSTI---PLQNQN 310

Query: 1187 EAEKSWDQDEQNQTMDCGLLDGQVPQEFQRGKCTSEVTGGKGEYEKSDFQQYGFSFYICE 1366
            E+  S+            + +GQ+ QE       S   G  G  E+    +    F IC 
Sbjct: 311  ESHLSFQ----------AVANGQLSQEKDPNIVVS--AGYSGYSEQRAVVE---PFDICL 355

Query: 1367 ERDKNIPKLKTPLLVKNRAMRSEMKRQAQGVNIQTCRSGMILLKGYISLKDQVKLIKSCR 1546
             +     KLK  LLVKNR  R++++R A GVN Q  RSGM+LLK Y+SL DQ+K+IK CR
Sbjct: 356  PKTGTTLKLKPSLLVKNREKRNDVRRAAGGVNGQILRSGMVLLKNYLSLHDQIKIIKLCR 415

Query: 1547 DLGRGHGGFYQPGYRDGAKLHLK 1615
            D+G G GGFYQP YRDG ++HLK
Sbjct: 416  DIGLGPGGFYQPVYRDGGRMHLK 438


>ref|XP_010033676.1| PREDICTED: uncharacterized protein LOC104422912 isoform X1
            [Eucalyptus grandis]
          Length = 372

 Score =  109 bits (272), Expect = 3e-22
 Identities = 74/177 (41%), Positives = 88/177 (49%), Gaps = 6/177 (3%)
 Frame = +2

Query: 1103 VVGPGHNNVFSSLPTHSELNA----TTIQEHSEAEKSWDQDEQNQTMDCGLLDGQVPQEF 1270
            V GP H     S P   ELN       I E    E   D    N+       + Q P   
Sbjct: 25   VGGPAHRR---SSPADPELNLGMNRIQIDETHSKETENDSLSPNKNSYFPPSELQQPSHS 81

Query: 1271 QRGKC--TSEVTGGKGEYEKSDFQQYGFSFYICEERDKNIPKLKTPLLVKNRAMRSEMKR 1444
              GK   T  + G +      +F   G  F IC  +      LK  LLVKNR  R+E KR
Sbjct: 82   NSGKSEETKSIAGFEDSVPAEEFTGVG-RFDICVPQVGTPVMLKPSLLVKNREKRNEEKR 140

Query: 1445 QAQGVNIQTCRSGMILLKGYISLKDQVKLIKSCRDLGRGHGGFYQPGYRDGAKLHLK 1615
              +  N +  R GM+LLK Y+S+ DQVK++K CRDLG G GGFYQPGYRDGAKLHLK
Sbjct: 141  SLEEHNWRILRPGMVLLKSYLSVGDQVKIVKLCRDLGLGAGGFYQPGYRDGAKLHLK 197


>ref|XP_021293825.1| uncharacterized protein LOC110423786 [Herrania umbratica]
          Length = 461

 Score =  110 bits (274), Expect = 4e-22
 Identities = 62/125 (49%), Positives = 78/125 (62%)
 Frame = +2

Query: 1241 LLDGQVPQEFQRGKCTSEVTGGKGEYEKSDFQQYGFSFYICEERDKNIPKLKTPLLVKNR 1420
            L D   P E  R K + + + G G+   ++ Q     F IC  +      LK  LLVKNR
Sbjct: 163  LQDESEPSESYR-KMSPQNSAGFGDSVHTECQVVVEPFDICLSKAGTPVMLKPSLLVKNR 221

Query: 1421 AMRSEMKRQAQGVNIQTCRSGMILLKGYISLKDQVKLIKSCRDLGRGHGGFYQPGYRDGA 1600
              R+E+KR  +G N    RSGM+LLK Y+SL DQVK++K+CR+LG G GGFYQPGYRDGA
Sbjct: 222  EKRNEIKRSMEGQNGIVLRSGMVLLKKYLSLSDQVKIVKACRELGFGSGGFYQPGYRDGA 281

Query: 1601 KLHLK 1615
            KL LK
Sbjct: 282  KLQLK 286


Top