BLASTX nr result
ID: Rehmannia32_contig00008694
seq
BLASTX 2.2.26 [Sep-21-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Rehmannia32_contig00008694 (1615 letters) Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects 149,584,005 sequences; 54,822,741,787 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value gb|PIN09560.1| DNA oxidative demethylase [Handroanthus impetigin... 267 7e-80 ref|XP_011080840.1| uncharacterized protein LOC105164002 isoform... 250 1e-72 ref|XP_012856810.1| PREDICTED: uncharacterized protein LOC105976... 233 4e-67 ref|XP_020550347.1| uncharacterized protein LOC105164002 isoform... 186 8e-50 ref|XP_022889009.1| uncharacterized protein LOC111404405 [Olea e... 137 7e-32 ref|XP_019196582.1| PREDICTED: uncharacterized protein LOC109190... 122 8e-26 ref|XP_018839226.1| PREDICTED: uncharacterized protein LOC109004... 119 3e-25 gb|EOY27334.1| 2-oxoglutarate-dependent dioxygenase family prote... 112 3e-23 ref|XP_011036495.1| PREDICTED: uncharacterized protein LOC105133... 113 4e-23 gb|PON43568.1| Alkylated DNA repair protein AlkB [Parasponia and... 112 5e-23 gb|EOY27333.1| 2-oxoglutarate-dependent dioxygenase family prote... 112 9e-23 ref|XP_011036496.1| PREDICTED: uncharacterized protein LOC105133... 112 1e-22 gb|OMO70560.1| Oxoglutarate/iron-dependent dioxygenase [Corchoru... 110 2e-22 ref|XP_007024711.2| PREDICTED: uncharacterized protein LOC185962... 111 2e-22 ref|XP_011036494.1| PREDICTED: uncharacterized protein LOC105133... 112 2e-22 ref|XP_011036493.1| PREDICTED: uncharacterized protein LOC105133... 112 2e-22 ref|XP_011036492.1| PREDICTED: uncharacterized protein LOC105133... 112 2e-22 ref|XP_011036491.1| PREDICTED: uncharacterized protein LOC105133... 112 2e-22 ref|XP_010033676.1| PREDICTED: uncharacterized protein LOC104422... 109 3e-22 ref|XP_021293825.1| uncharacterized protein LOC110423786 [Herran... 110 4e-22 >gb|PIN09560.1| DNA oxidative demethylase [Handroanthus impetiginosus] Length = 479 Score = 267 bits (683), Expect = 7e-80 Identities = 153/276 (55%), Positives = 188/276 (68%), Gaps = 13/276 (4%) Frame = +2 Query: 827 GRNVVRRSPV-AHPDGASPG--SGSSEIGKINSEISSVDGVTKKFSGLLSSDYLHGPNAV 997 GR V PV AH DGAS SGSSEIG+ NSE ++V G+TKKFSG+ +S HGP++ Sbjct: 31 GRVVGYYKPVGAHVDGASLSHESGSSEIGRRNSETAAVHGLTKKFSGMSTSGCPHGPSSE 90 Query: 998 PHGVGNMSPNNGNSKLAESVSEAENQTPKPLDLPSVVGPGHNNVFSSLPTHSELNATTIQ 1177 HGVGN+SPN G+SK +SV E +QTPK LDLPSV PG++N F SL THSE+NATTIQ Sbjct: 91 AHGVGNISPNVGSSKQEQSVPE--DQTPKSLDLPSVASPGYDNDFPSLSTHSEVNATTIQ 148 Query: 1178 EHSEAE--KSWDQDEQN--------QTMDCGLLDGQVPQEFQRGKCTSEVTGGKGEYEKS 1327 + +S D +E+N + +D G DG+ PQ Q K E T G EYE S Sbjct: 149 VTMTIQMTQSPDVEEENCSVSHKNKENIDFGFQDGRSPQVVQTEKHVFEGTSGNIEYENS 208 Query: 1328 DFQQYGFSFYICEERDKNIPKLKTPLLVKNRAMRSEMKRQAQGVNIQTCRSGMILLKGYI 1507 GFSF ICEE + + KLK+ L VKN+A+R+E KR+ +G I+ R GMILLK Y+ Sbjct: 209 GLHHKGFSFDICEETNSKVVKLKSSLFVKNKAIRNEAKRRMEGNKIRIQRPGMILLKDYL 268 Query: 1508 SLKDQVKLIKSCRDLGRGHGGFYQPGYRDGAKLHLK 1615 SLKDQVK+IK+CRDLGRG GGFYQPGYRDGA +HLK Sbjct: 269 SLKDQVKVIKTCRDLGRGPGGFYQPGYRDGAMMHLK 304 Score = 116 bits (290), Expect = 4e-24 Identities = 63/106 (59%), Positives = 75/106 (70%), Gaps = 2/106 (1%) Frame = +2 Query: 443 PVVAHPDGASPG--SGYSEIGRINSELLSVDGVTKKFNGLLSSDYPNGPNAEPHGVGNMS 616 PV AH DGAS SG SEIGR NSE +V G+TKKF+G+ +S P+GP++E HGVGN+S Sbjct: 39 PVGAHVDGASLSHESGSSEIGRRNSETAAVHGLTKKFSGMSTSGCPHGPSSEAHGVGNIS 98 Query: 617 PNNGNPKVVDSVSEAENQTPKPLDLPSVVGPGYSYGFPSFPTHSEL 754 PN G+ K SV E+QTPK LDLPSV PGY FPS THSE+ Sbjct: 99 PNVGSSKQEQSV--PEDQTPKSLDLPSVASPGYDNDFPSLSTHSEV 142 Score = 87.8 bits (216), Expect = 1e-14 Identities = 61/124 (49%), Positives = 72/124 (58%), Gaps = 19/124 (15%) Frame = +2 Query: 35 GWRS-SPRPTGHYVVRRGPVVAHPDGASPG--SGSSEIGRINSEIS-------------- 163 G+R SPR G V PV AH DGAS SGSSEIGR NSE + Sbjct: 21 GYRGRSPRAAGRVVGYYKPVGAHVDGASLSHESGSSEIGRRNSETAAVHGLTKKFSGMST 80 Query: 164 SGNPYGPSAKPSGV--VNTNNGNPKVEESVSEAENQTPKPLDLPSVVGPGYGYGFSSLPT 337 SG P+GPS++ GV ++ N G+ K E+SV E +QTPK LDLPSV PGY F SL T Sbjct: 81 SGCPHGPSSEAHGVGNISPNVGSSKQEQSVPE--DQTPKSLDLPSVASPGYDNDFPSLST 138 Query: 338 HSEL 349 HSE+ Sbjct: 139 HSEV 142 >ref|XP_011080840.1| uncharacterized protein LOC105164002 isoform X1 [Sesamum indicum] ref|XP_020550346.1| uncharacterized protein LOC105164002 isoform X1 [Sesamum indicum] Length = 535 Score = 250 bits (638), Expect = 1e-72 Identities = 153/306 (50%), Positives = 181/306 (59%), Gaps = 54/306 (17%) Frame = +2 Query: 860 HPDGASP--GSGSSEIGKINSEISSVDGVTKKFSGLLSSDYLHGPNAVPHGVGNMSPNNG 1033 H D AS GSGSS G+ NSEI+S+D +TKKF G L SDY +GP+A P G G ++ + G Sbjct: 56 HSDDASSAHGSGSSTNGRRNSEIASLDSITKKFDGALKSDYAYGPSAEPLGAG-IASDAG 114 Query: 1034 NSKLAESVSEAENQTPKPLDLPSVVGPGHNNVFSSLPTHSELNATTIQE----------- 1180 SKL ++V AE+Q PKPL+LP V P ++N F SL HSELNA TIQ Sbjct: 115 ISKLVQNVPIAEDQMPKPLELPFVTRPCYDNDFPSLSAHSELNARTIQMIQAQDVKEDNC 174 Query: 1181 ----------HSEAEKS-----------WDQD--------------------EQNQTMDC 1237 H + E S + QD E +QT Sbjct: 175 TTYTPLVDMLHKKGEPSQTTGPGSQVGRFSQDVKEDTCTTYSPLVDMLHKKGEPSQTTGP 234 Query: 1238 GLLDGQVPQEFQRGKCTSEVTGGKGEYEKSDFQQYGFSFYICEERDKNIPKLKTPLLVKN 1417 G+ QE QRGK T+E KGEYE SDFQ GFSF ICEER +++ KLK+PL VKN Sbjct: 235 DAQVGRFSQEVQRGKSTTEGIIEKGEYENSDFQHKGFSFDICEERSRSVVKLKSPLHVKN 294 Query: 1418 RAMRSEMKRQAQGVNIQTCRSGMILLKGYISLKDQVKLIKSCRDLGRGHGGFYQPGYRDG 1597 RAMR+E KR G NI+ R GMIL+KGY+SL DQVKLI SCRDLGRG GGFYQPGY DG Sbjct: 295 RAMRNERKRHMVGDNIKIFRPGMILIKGYLSLMDQVKLIMSCRDLGRGPGGFYQPGYGDG 354 Query: 1598 AKLHLK 1615 AKLHLK Sbjct: 355 AKLHLK 360 Score = 97.1 bits (240), Expect = 1e-17 Identities = 55/105 (52%), Positives = 67/105 (63%), Gaps = 2/105 (1%) Frame = +2 Query: 446 VVAHPDGASP--GSGYSEIGRINSELLSVDGVTKKFNGLLSSDYPNGPNAEPHGVGNMSP 619 V H D AS GSG S GR NSE+ S+D +TKKF+G L SDY GP+AEP G G ++ Sbjct: 53 VADHSDDASSAHGSGSSTNGRRNSEIASLDSITKKFDGALKSDYAYGPSAEPLGAG-IAS 111 Query: 620 NNGNPKVVDSVSEAENQTPKPLDLPSVVGPGYSYGFPSFPTHSEL 754 + G K+V +V AE+Q PKPL+LP V P Y FPS HSEL Sbjct: 112 DAGISKLVQNVPIAEDQMPKPLELPFVTRPCYDNDFPSLSAHSEL 156 Score = 65.9 bits (159), Expect = 1e-07 Identities = 46/104 (44%), Positives = 57/104 (54%), Gaps = 17/104 (16%) Frame = +2 Query: 89 VVAHPDGASP--GSGSSEIGRINSEISSGNP--------------YGPSAKPSGV-VNTN 217 V H D AS GSGSS GR NSEI+S + YGPSA+P G + ++ Sbjct: 53 VADHSDDASSAHGSGSSTNGRRNSEIASLDSITKKFDGALKSDYAYGPSAEPLGAGIASD 112 Query: 218 NGNPKVEESVSEAENQTPKPLDLPSVVGPGYGYGFSSLPTHSEL 349 G K+ ++V AE+Q PKPL+LP V P Y F SL HSEL Sbjct: 113 AGISKLVQNVPIAEDQMPKPLELPFVTRPCYDNDFPSLSAHSEL 156 >ref|XP_012856810.1| PREDICTED: uncharacterized protein LOC105976070 [Erythranthe guttata] gb|EYU21292.1| hypothetical protein MIMGU_mgv1a006814mg [Erythranthe guttata] Length = 430 Score = 233 bits (593), Expect = 4e-67 Identities = 131/262 (50%), Positives = 165/262 (62%), Gaps = 4/262 (1%) Frame = +2 Query: 842 RRSPV-AHPDGASPG--SGSSEIGKINSEISSVDGVTKKFSGLLSSDYLHGPNAVPHGVG 1012 R SP+ AH + +SPG SGSS NSE+++VD VTKKFS + SSDY GPN+ P GV Sbjct: 32 RHSPIGAHSNESSPGNGSGSSRSETRNSEVAAVDNVTKKFSDMSSSDYQQGPNSEPRGVK 91 Query: 1013 NMSPNNGNSKLAESVSEAENQTPKPLDLPSVVGPGHNNVFSSLPTHSELNATTIQEHSEA 1192 SPN+GN + ++++ PG +N F SL T S Sbjct: 92 TASPNDGNPRRVQNIA-----------------PGFDNDFPSLSTESA------------ 122 Query: 1193 EKSWDQDEQNQTMDCGLLDGQVPQEFQRGKCTSEVTGGKGEYEKSDFQQYGFSFYICEER 1372 QT+DC ++G++P+E + GK TS+ T GK +E SD QQ G SF ICE+R Sbjct: 123 ----------QTIDCSFMNGKLPKEVESGKSTSDGTSGKSGFENSDSQQKGCSFDICEQR 172 Query: 1373 DKNIPKLKTPLLVKNRAMRSEMKRQAQGV-NIQTCRSGMILLKGYISLKDQVKLIKSCRD 1549 D+N+ KLKTPL VKN+A R+EMKR+ +G NIQ R GMILLK Y+S+ DQVKLIK+CRD Sbjct: 173 DRNVVKLKTPLHVKNKAARNEMKRRTEGYNNIQNLRPGMILLKNYLSVSDQVKLIKACRD 232 Query: 1550 LGRGHGGFYQPGYRDGAKLHLK 1615 LGRG GGFYQPGY DGAKL LK Sbjct: 233 LGRGCGGFYQPGYSDGAKLQLK 254 Score = 82.4 bits (202), Expect = 5e-13 Identities = 47/110 (42%), Positives = 62/110 (56%), Gaps = 2/110 (1%) Frame = +2 Query: 425 NAVRRSPVVAHPDGASPG--SGYSEIGRINSELLSVDGVTKKFNGLLSSDYPNGPNAEPH 598 +A R SP+ AH + +SPG SG S NSE+ +VD VTKKF+ + SSDY GPN+EP Sbjct: 29 SADRHSPIGAHSNESSPGNGSGSSRSETRNSEVAAVDNVTKKFSDMSSSDYQQGPNSEPR 88 Query: 599 GVGNMSPNNGNPKVVDSVSEAENQTPKPLDLPSVVGPGYSYGFPSFPTHS 748 GV SPN+GNP+ V +++ PG+ FPS T S Sbjct: 89 GVKTASPNDGNPRRVQNIA-----------------PGFDNDFPSLSTES 121 >ref|XP_020550347.1| uncharacterized protein LOC105164002 isoform X2 [Sesamum indicum] Length = 407 Score = 186 bits (472), Expect = 8e-50 Identities = 114/231 (49%), Positives = 130/231 (56%), Gaps = 52/231 (22%) Frame = +2 Query: 1079 PKPLDLPSVVGPGHNNVFSSLPTHSELNATTIQE---------------------HSEAE 1195 PKPL+LP V P ++N F SL HSELNA TIQ H + E Sbjct: 2 PKPLELPFVTRPCYDNDFPSLSAHSELNARTIQMIQAQDVKEDNCTTYTPLVDMLHKKGE 61 Query: 1196 KS-----------WDQD--------------------EQNQTMDCGLLDGQVPQEFQRGK 1282 S + QD E +QT G+ QE QRGK Sbjct: 62 PSQTTGPGSQVGRFSQDVKEDTCTTYSPLVDMLHKKGEPSQTTGPDAQVGRFSQEVQRGK 121 Query: 1283 CTSEVTGGKGEYEKSDFQQYGFSFYICEERDKNIPKLKTPLLVKNRAMRSEMKRQAQGVN 1462 T+E KGEYE SDFQ GFSF ICEER +++ KLK+PL VKNRAMR+E KR G N Sbjct: 122 STTEGIIEKGEYENSDFQHKGFSFDICEERSRSVVKLKSPLHVKNRAMRNERKRHMVGDN 181 Query: 1463 IQTCRSGMILLKGYISLKDQVKLIKSCRDLGRGHGGFYQPGYRDGAKLHLK 1615 I+ R GMIL+KGY+SL DQVKLI SCRDLGRG GGFYQPGY DGAKLHLK Sbjct: 182 IKIFRPGMILIKGYLSLMDQVKLIMSCRDLGRGPGGFYQPGYGDGAKLHLK 232 >ref|XP_022889009.1| uncharacterized protein LOC111404405 [Olea europaea var. sylvestris] Length = 395 Score = 137 bits (344), Expect = 7e-32 Identities = 100/277 (36%), Positives = 140/277 (50%), Gaps = 13/277 (4%) Frame = +2 Query: 824 TGRNVV---RRSPVAHPDGASPGSG---SSEIGKINSEISSVDGVTKKFSGLLSSDYLHG 985 +GR V+ R++P P G S S S+E K+ + ++ V+ K+ S+ Sbjct: 44 SGRGVLQYRRKNPETTPVGVSCPSNIISSAENLKLECNTTKLEYVSPKYQESASTP---- 99 Query: 986 PNAVPHGVGNMSPNNGNSKLA-------ESVSEAENQTPKPLDLPSVVGPGHNNVFSSLP 1144 P+ + N+ +KL ESVS N + + G G + SLP Sbjct: 100 PSNIISSAENLKLECNTTKLEYVSPKYQESVSTPPNMNERKTHQEQLRGAGKESF--SLP 157 Query: 1145 THSELNATTIQEHSEAEKSWDQDEQNQTMDCGLLDGQVPQEFQRGKCTSEVTGGKGEYEK 1324 L T+ + S +K Q + + + QE Q +E T G G Sbjct: 158 ---HLGGTSPVDASYTKKGPSQSSVSDSKE-----SHFGQETQSRTSATENTVGDGRPND 209 Query: 1325 SDFQQYGFSFYICEERDKNIPKLKTPLLVKNRAMRSEMKRQAQGVNIQTCRSGMILLKGY 1504 SD Q G+SF IC E+ + KL+ PLLVKNR ++EMKR+ +G NI+ R G++LLK Y Sbjct: 210 SDSLQKGYSFDICVEKIGSFVKLQPPLLVKNREKKNEMKRRTEGENIKVLRPGVVLLKCY 269 Query: 1505 ISLKDQVKLIKSCRDLGRGHGGFYQPGYRDGAKLHLK 1615 + L DQVKL+K CRDLG G GGFYQPGYRDGA+L LK Sbjct: 270 LPLMDQVKLVKMCRDLGLGSGGFYQPGYRDGAQLRLK 306 >ref|XP_019196582.1| PREDICTED: uncharacterized protein LOC109190540 [Ipomoea nil] Length = 545 Score = 122 bits (305), Expect = 8e-26 Identities = 85/266 (31%), Positives = 128/266 (48%), Gaps = 10/266 (3%) Frame = +2 Query: 848 SPVAHPDGASPGSGSSEIGKINSEISS---VDGVTKKFSGLLSSDYLHGPNAVPHG---- 1006 +P H G +P + E E S +G++KKF+ L D + + P Sbjct: 133 NPYRHRGGFTPSGHNVERRSSERERSGDTLANGISKKFASLSPLDNPYRNDDSPQSKFSC 192 Query: 1007 VGNMSPNNGNSKLAESVSEAEN---QTPKPLDLPSVVGPGHNNVFSSLPTHSELNATTIQ 1177 VGN NS + + + + + Q P P+V+ Sbjct: 193 VGNPMQVKHNSSIKQPFTSSASGFKQKDSPWSCPAVISC--------------------- 231 Query: 1178 EHSEAEKSWDQDEQNQTMDCGLLDGQVPQEFQRGKCTSEVTGGKGEYEKSDFQQYGFSFY 1357 S K++ ++E ++ G ++ +E + + +S+ G+ K+ F Sbjct: 232 --SPVAKTFLKNESIHAVNSGGFGKRLSEEINQSEQSSKEEANNGDKSKN----LDVGFD 285 Query: 1358 ICEERDKNIPKLKTPLLVKNRAMRSEMKRQAQGVNIQTCRSGMILLKGYISLKDQVKLIK 1537 IC+ER N+ KLKTPL VKN+ R+E+KR + NI+ GM+LLK +ISL DQVK++ Sbjct: 286 ICQERAGNLIKLKTPLHVKNKEKRNEIKRSMEVQNIKILCDGMVLLKSFISLLDQVKIVN 345 Query: 1538 SCRDLGRGHGGFYQPGYRDGAKLHLK 1615 +CR LG G GGFYQPGY DGAKLHLK Sbjct: 346 TCRKLGIGPGGFYQPGYNDGAKLHLK 371 >ref|XP_018839226.1| PREDICTED: uncharacterized protein LOC109004968 [Juglans regia] Length = 470 Score = 119 bits (298), Expect = 3e-25 Identities = 73/154 (47%), Positives = 85/154 (55%), Gaps = 3/154 (1%) Frame = +2 Query: 1163 ATTIQEHSEAEK---SWDQDEQNQTMDCGLLDGQVPQEFQRGKCTSEVTGGKGEYEKSDF 1333 A I+ H E K S Q E DG P +F KC+S G E S+ Sbjct: 148 ADGIKSHDELSKLRISGQQSESQLPYKSAKKDGPSPMKFP--KCSS----GCDNSEYSEH 201 Query: 1334 QQYGFSFYICEERDKNIPKLKTPLLVKNRAMRSEMKRQAQGVNIQTCRSGMILLKGYISL 1513 +F IC R LK PLLVKNR R+E+KR +G RSGM+LLK +IS Sbjct: 202 SAALHAFDICPPRASTSVVLKPPLLVKNRDRRNEIKRSMEGQTGTVLRSGMVLLKSHISS 261 Query: 1514 KDQVKLIKSCRDLGRGHGGFYQPGYRDGAKLHLK 1615 DQVK++K CRDLG G GGFYQPGYRDGAKLHLK Sbjct: 262 SDQVKIVKICRDLGLGPGGFYQPGYRDGAKLHLK 295 >gb|EOY27334.1| 2-oxoglutarate-dependent dioxygenase family protein, putative isoform 2 [Theobroma cacao] Length = 378 Score = 112 bits (279), Expect = 3e-23 Identities = 62/125 (49%), Positives = 79/125 (63%) Frame = +2 Query: 1241 LLDGQVPQEFQRGKCTSEVTGGKGEYEKSDFQQYGFSFYICEERDKNIPKLKTPLLVKNR 1420 L D P E + K + + + G G+ ++ Q F IC + LK LLVKNR Sbjct: 163 LQDESEPSESSQ-KMSPQNSAGFGDSVHTECQVVVDPFDICLSKAGTPVMLKPSLLVKNR 221 Query: 1421 AMRSEMKRQAQGVNIQTCRSGMILLKGYISLKDQVKLIKSCRDLGRGHGGFYQPGYRDGA 1600 R+E+KR +G N RSGM+LLK Y+SL DQVK++K+CR+LG G GGFYQPGYRDGA Sbjct: 222 EKRNEIKRSMEGQNGIVLRSGMVLLKKYLSLSDQVKIVKACRELGFGSGGFYQPGYRDGA 281 Query: 1601 KLHLK 1615 KLHLK Sbjct: 282 KLHLK 286 >ref|XP_011036495.1| PREDICTED: uncharacterized protein LOC105133990 isoform X5 [Populus euphratica] Length = 501 Score = 113 bits (283), Expect = 4e-23 Identities = 78/211 (36%), Positives = 108/211 (51%) Frame = +2 Query: 983 GPNAVPHGVGNMSPNNGNSKLAESVSEAENQTPKPLDLPSVVGPGHNNVFSSLPTHSELN 1162 G N V + S N G+S L ++S + + + P G V SL +S + Sbjct: 134 GSNQSDCSVASNSVNKGDSALMPAISGSRDMKSDIVSTPLEKGAKDGAVDFSLKFNSTI- 192 Query: 1163 ATTIQEHSEAEKSWDQDEQNQTMDCGLLDGQVPQEFQRGKCTSEVTGGKGEYEKSDFQQY 1342 +Q +E+ S+ + +GQ+ QE S G G E+ + Sbjct: 193 --PLQNQNESHLSFQ----------AVANGQLSQEKDPNIVVS--AGYSGYSEQRAVVE- 237 Query: 1343 GFSFYICEERDKNIPKLKTPLLVKNRAMRSEMKRQAQGVNIQTCRSGMILLKGYISLKDQ 1522 F IC + KLK LLVKNR R++++R A GVN Q RSGM+LLK Y+SL DQ Sbjct: 238 --PFDICLPKTGTTLKLKPSLLVKNREKRNDVRRAAGGVNGQILRSGMVLLKNYLSLHDQ 295 Query: 1523 VKLIKSCRDLGRGHGGFYQPGYRDGAKLHLK 1615 +K+IK CRD+G G GGFYQP YRDG ++HLK Sbjct: 296 IKIIKLCRDIGLGPGGFYQPVYRDGGRMHLK 326 >gb|PON43568.1| Alkylated DNA repair protein AlkB [Parasponia andersonii] Length = 433 Score = 112 bits (280), Expect = 5e-23 Identities = 89/263 (33%), Positives = 124/263 (47%), Gaps = 6/263 (2%) Frame = +2 Query: 842 RRSPVAHPDGASPGSGSSEIGKI---NSEISSVDGVTKKFSGL---LSSDYLHGPNAVPH 1003 RR + P G+S G + + + +ISS DG SG+ ++S+ H N+ P Sbjct: 26 RRHHNSEPRGSSSGGKDRFVYAVKIKDGQISS-DGKIGASSGVKSSIASELAHEENSTPF 84 Query: 1004 GVGNMSPNNGNSKLAESVSEAENQTPKPLDLPSVVGPGHNNVFSSLPTHSELNATTIQEH 1183 N S +S + E S+ K D +++ L + LN Sbjct: 85 SAANCS---ASSHMIEERSQIAQTPTKFTD---------DDMKLKLDSSINLNI------ 126 Query: 1184 SEAEKSWDQDEQNQTMDCGLLDGQVPQEFQRGKCTSEVTGGKGEYEKSDFQQYGFSFYIC 1363 S E + T CG G P + Q+ + + E ++ F IC Sbjct: 127 -----SCQDVEISLTTKCGEKGGPSPLKGQKTPASDRKS------ENTEASSAFAPFDIC 175 Query: 1364 EERDKNIPKLKTPLLVKNRAMRSEMKRQAQGVNIQTCRSGMILLKGYISLKDQVKLIKSC 1543 + ++ KLK PLL KNR R+E KR +G N R GM++LK +ISL DQVK++K C Sbjct: 176 PTKAGSV-KLKPPLLAKNRERRNETKRVMEGPNGSVIRPGMVILKSHISLSDQVKVVKQC 234 Query: 1544 RDLGRGHGGFYQPGYRDGAKLHL 1612 RDLG G GGFYQPGYRDGAKLHL Sbjct: 235 RDLGVGPGGFYQPGYRDGAKLHL 257 >gb|EOY27333.1| 2-oxoglutarate-dependent dioxygenase family protein, putative isoform 1 [Theobroma cacao] Length = 461 Score = 112 bits (279), Expect = 9e-23 Identities = 62/125 (49%), Positives = 79/125 (63%) Frame = +2 Query: 1241 LLDGQVPQEFQRGKCTSEVTGGKGEYEKSDFQQYGFSFYICEERDKNIPKLKTPLLVKNR 1420 L D P E + K + + + G G+ ++ Q F IC + LK LLVKNR Sbjct: 163 LQDESEPSESSQ-KMSPQNSAGFGDSVHTECQVVVDPFDICLSKAGTPVMLKPSLLVKNR 221 Query: 1421 AMRSEMKRQAQGVNIQTCRSGMILLKGYISLKDQVKLIKSCRDLGRGHGGFYQPGYRDGA 1600 R+E+KR +G N RSGM+LLK Y+SL DQVK++K+CR+LG G GGFYQPGYRDGA Sbjct: 222 EKRNEIKRSMEGQNGIVLRSGMVLLKKYLSLSDQVKIVKACRELGFGSGGFYQPGYRDGA 281 Query: 1601 KLHLK 1615 KLHLK Sbjct: 282 KLHLK 286 >ref|XP_011036496.1| PREDICTED: uncharacterized protein LOC105133990 isoform X6 [Populus euphratica] Length = 499 Score = 112 bits (279), Expect = 1e-22 Identities = 76/203 (37%), Positives = 106/203 (52%) Frame = +2 Query: 1007 VGNMSPNNGNSKLAESVSEAENQTPKPLDLPSVVGPGHNNVFSSLPTHSELNATTIQEHS 1186 V + S N G+S L ++S + + + P G V SL +S + +Q + Sbjct: 140 VASNSVNKGDSALMPAISGSRDMKSDIVSTPLEKGAKDGAVDFSLKFNSTI---PLQNQN 196 Query: 1187 EAEKSWDQDEQNQTMDCGLLDGQVPQEFQRGKCTSEVTGGKGEYEKSDFQQYGFSFYICE 1366 E+ S+ + +GQ+ QE S G G E+ + F IC Sbjct: 197 ESHLSFQ----------AVANGQLSQEKDPNIVVS--AGYSGYSEQRAVVE---PFDICL 241 Query: 1367 ERDKNIPKLKTPLLVKNRAMRSEMKRQAQGVNIQTCRSGMILLKGYISLKDQVKLIKSCR 1546 + KLK LLVKNR R++++R A GVN Q RSGM+LLK Y+SL DQ+K+IK CR Sbjct: 242 PKTGTTLKLKPSLLVKNREKRNDVRRAAGGVNGQILRSGMVLLKNYLSLHDQIKIIKLCR 301 Query: 1547 DLGRGHGGFYQPGYRDGAKLHLK 1615 D+G G GGFYQP YRDG ++HLK Sbjct: 302 DIGLGPGGFYQPVYRDGGRMHLK 324 >gb|OMO70560.1| Oxoglutarate/iron-dependent dioxygenase [Corchorus capsularis] Length = 396 Score = 110 bits (275), Expect = 2e-22 Identities = 78/225 (34%), Positives = 107/225 (47%), Gaps = 25/225 (11%) Frame = +2 Query: 1016 MSPNNGNSKLAESVSEAENQTPKPLD----------LPSVVGPGHNNV---FSSL----- 1141 MSP +G+ VS PKPL LP G + + F SL Sbjct: 1 MSPASGSRYSKHDVSPVYEYRPKPLPDGSGIGKQNHLPEATGTSNTVLKDDFPSLSCQSG 60 Query: 1142 -------PTHSELNATTIQEHSEAEKSWDQDEQNQTMDCGLLDGQVPQEFQRGKCTSEVT 1300 P ++ ++E +E+ S + ++ ++ + +R S T Sbjct: 61 YKGPWPDPRRTQFEPLRVEEETESCASLLHHDLSRKVNISYSVDEYKPTQERSPQISTST 120 Query: 1301 GGKGEYEKSDFQQYGFSFYICEERDKNIPKLKTPLLVKNRAMRSEMKRQAQGVNIQTCRS 1480 G + D Q F IC + + LK LLVKNR R+EMKR +G + RS Sbjct: 121 GDSVD----DLQAVIKPFDICPVKTGTLVMLKPSLLVKNREKRNEMKRSMEGESGIVLRS 176 Query: 1481 GMILLKGYISLKDQVKLIKSCRDLGRGHGGFYQPGYRDGAKLHLK 1615 GM+LLK Y+SL DQVK+ K+CR+LG GGFYQPGYRDGAKL+LK Sbjct: 177 GMVLLKNYLSLSDQVKIAKTCRELGLASGGFYQPGYRDGAKLNLK 221 >ref|XP_007024711.2| PREDICTED: uncharacterized protein LOC18596274 [Theobroma cacao] Length = 461 Score = 111 bits (277), Expect = 2e-22 Identities = 61/123 (49%), Positives = 78/123 (63%) Frame = +2 Query: 1247 DGQVPQEFQRGKCTSEVTGGKGEYEKSDFQQYGFSFYICEERDKNIPKLKTPLLVKNRAM 1426 D P E + K + + + G G+ ++ Q F IC + LK LLVKNR Sbjct: 165 DESEPSESSQ-KMSPQNSAGFGDSVHTECQVVVDPFDICLSKAGTPVMLKPSLLVKNREK 223 Query: 1427 RSEMKRQAQGVNIQTCRSGMILLKGYISLKDQVKLIKSCRDLGRGHGGFYQPGYRDGAKL 1606 R+E+KR +G N RSGM+LLK Y+SL DQVK++K+CR+LG G GGFYQPGYRDGAKL Sbjct: 224 RNEIKRSMEGQNGIVLRSGMVLLKKYLSLSDQVKIVKACRELGFGSGGFYQPGYRDGAKL 283 Query: 1607 HLK 1615 HLK Sbjct: 284 HLK 286 >ref|XP_011036494.1| PREDICTED: uncharacterized protein LOC105133990 isoform X4 [Populus euphratica] Length = 595 Score = 112 bits (279), Expect = 2e-22 Identities = 76/203 (37%), Positives = 106/203 (52%) Frame = +2 Query: 1007 VGNMSPNNGNSKLAESVSEAENQTPKPLDLPSVVGPGHNNVFSSLPTHSELNATTIQEHS 1186 V + S N G+S L ++S + + + P G V SL +S + +Q + Sbjct: 236 VASNSVNKGDSALMPAISGSRDMKSDIVSTPLEKGAKDGAVDFSLKFNSTI---PLQNQN 292 Query: 1187 EAEKSWDQDEQNQTMDCGLLDGQVPQEFQRGKCTSEVTGGKGEYEKSDFQQYGFSFYICE 1366 E+ S+ + +GQ+ QE S G G E+ + F IC Sbjct: 293 ESHLSFQ----------AVANGQLSQEKDPNIVVS--AGYSGYSEQRAVVE---PFDICL 337 Query: 1367 ERDKNIPKLKTPLLVKNRAMRSEMKRQAQGVNIQTCRSGMILLKGYISLKDQVKLIKSCR 1546 + KLK LLVKNR R++++R A GVN Q RSGM+LLK Y+SL DQ+K+IK CR Sbjct: 338 PKTGTTLKLKPSLLVKNREKRNDVRRAAGGVNGQILRSGMVLLKNYLSLHDQIKIIKLCR 397 Query: 1547 DLGRGHGGFYQPGYRDGAKLHLK 1615 D+G G GGFYQP YRDG ++HLK Sbjct: 398 DIGLGPGGFYQPVYRDGGRMHLK 420 >ref|XP_011036493.1| PREDICTED: uncharacterized protein LOC105133990 isoform X3 [Populus euphratica] Length = 599 Score = 112 bits (279), Expect = 2e-22 Identities = 76/203 (37%), Positives = 106/203 (52%) Frame = +2 Query: 1007 VGNMSPNNGNSKLAESVSEAENQTPKPLDLPSVVGPGHNNVFSSLPTHSELNATTIQEHS 1186 V + S N G+S L ++S + + + P G V SL +S + +Q + Sbjct: 240 VASNSVNKGDSALMPAISGSRDMKSDIVSTPLEKGAKDGAVDFSLKFNSTI---PLQNQN 296 Query: 1187 EAEKSWDQDEQNQTMDCGLLDGQVPQEFQRGKCTSEVTGGKGEYEKSDFQQYGFSFYICE 1366 E+ S+ + +GQ+ QE S G G E+ + F IC Sbjct: 297 ESHLSFQ----------AVANGQLSQEKDPNIVVS--AGYSGYSEQRAVVE---PFDICL 341 Query: 1367 ERDKNIPKLKTPLLVKNRAMRSEMKRQAQGVNIQTCRSGMILLKGYISLKDQVKLIKSCR 1546 + KLK LLVKNR R++++R A GVN Q RSGM+LLK Y+SL DQ+K+IK CR Sbjct: 342 PKTGTTLKLKPSLLVKNREKRNDVRRAAGGVNGQILRSGMVLLKNYLSLHDQIKIIKLCR 401 Query: 1547 DLGRGHGGFYQPGYRDGAKLHLK 1615 D+G G GGFYQP YRDG ++HLK Sbjct: 402 DIGLGPGGFYQPVYRDGGRMHLK 424 >ref|XP_011036492.1| PREDICTED: uncharacterized protein LOC105133990 isoform X2 [Populus euphratica] Length = 603 Score = 112 bits (279), Expect = 2e-22 Identities = 76/203 (37%), Positives = 106/203 (52%) Frame = +2 Query: 1007 VGNMSPNNGNSKLAESVSEAENQTPKPLDLPSVVGPGHNNVFSSLPTHSELNATTIQEHS 1186 V + S N G+S L ++S + + + P G V SL +S + +Q + Sbjct: 244 VASNSVNKGDSALMPAISGSRDMKSDIVSTPLEKGAKDGAVDFSLKFNSTI---PLQNQN 300 Query: 1187 EAEKSWDQDEQNQTMDCGLLDGQVPQEFQRGKCTSEVTGGKGEYEKSDFQQYGFSFYICE 1366 E+ S+ + +GQ+ QE S G G E+ + F IC Sbjct: 301 ESHLSFQ----------AVANGQLSQEKDPNIVVS--AGYSGYSEQRAVVE---PFDICL 345 Query: 1367 ERDKNIPKLKTPLLVKNRAMRSEMKRQAQGVNIQTCRSGMILLKGYISLKDQVKLIKSCR 1546 + KLK LLVKNR R++++R A GVN Q RSGM+LLK Y+SL DQ+K+IK CR Sbjct: 346 PKTGTTLKLKPSLLVKNREKRNDVRRAAGGVNGQILRSGMVLLKNYLSLHDQIKIIKLCR 405 Query: 1547 DLGRGHGGFYQPGYRDGAKLHLK 1615 D+G G GGFYQP YRDG ++HLK Sbjct: 406 DIGLGPGGFYQPVYRDGGRMHLK 428 >ref|XP_011036491.1| PREDICTED: uncharacterized protein LOC105133990 isoform X1 [Populus euphratica] Length = 613 Score = 112 bits (279), Expect = 2e-22 Identities = 76/203 (37%), Positives = 106/203 (52%) Frame = +2 Query: 1007 VGNMSPNNGNSKLAESVSEAENQTPKPLDLPSVVGPGHNNVFSSLPTHSELNATTIQEHS 1186 V + S N G+S L ++S + + + P G V SL +S + +Q + Sbjct: 254 VASNSVNKGDSALMPAISGSRDMKSDIVSTPLEKGAKDGAVDFSLKFNSTI---PLQNQN 310 Query: 1187 EAEKSWDQDEQNQTMDCGLLDGQVPQEFQRGKCTSEVTGGKGEYEKSDFQQYGFSFYICE 1366 E+ S+ + +GQ+ QE S G G E+ + F IC Sbjct: 311 ESHLSFQ----------AVANGQLSQEKDPNIVVS--AGYSGYSEQRAVVE---PFDICL 355 Query: 1367 ERDKNIPKLKTPLLVKNRAMRSEMKRQAQGVNIQTCRSGMILLKGYISLKDQVKLIKSCR 1546 + KLK LLVKNR R++++R A GVN Q RSGM+LLK Y+SL DQ+K+IK CR Sbjct: 356 PKTGTTLKLKPSLLVKNREKRNDVRRAAGGVNGQILRSGMVLLKNYLSLHDQIKIIKLCR 415 Query: 1547 DLGRGHGGFYQPGYRDGAKLHLK 1615 D+G G GGFYQP YRDG ++HLK Sbjct: 416 DIGLGPGGFYQPVYRDGGRMHLK 438 >ref|XP_010033676.1| PREDICTED: uncharacterized protein LOC104422912 isoform X1 [Eucalyptus grandis] Length = 372 Score = 109 bits (272), Expect = 3e-22 Identities = 74/177 (41%), Positives = 88/177 (49%), Gaps = 6/177 (3%) Frame = +2 Query: 1103 VVGPGHNNVFSSLPTHSELNA----TTIQEHSEAEKSWDQDEQNQTMDCGLLDGQVPQEF 1270 V GP H S P ELN I E E D N+ + Q P Sbjct: 25 VGGPAHRR---SSPADPELNLGMNRIQIDETHSKETENDSLSPNKNSYFPPSELQQPSHS 81 Query: 1271 QRGKC--TSEVTGGKGEYEKSDFQQYGFSFYICEERDKNIPKLKTPLLVKNRAMRSEMKR 1444 GK T + G + +F G F IC + LK LLVKNR R+E KR Sbjct: 82 NSGKSEETKSIAGFEDSVPAEEFTGVG-RFDICVPQVGTPVMLKPSLLVKNREKRNEEKR 140 Query: 1445 QAQGVNIQTCRSGMILLKGYISLKDQVKLIKSCRDLGRGHGGFYQPGYRDGAKLHLK 1615 + N + R GM+LLK Y+S+ DQVK++K CRDLG G GGFYQPGYRDGAKLHLK Sbjct: 141 SLEEHNWRILRPGMVLLKSYLSVGDQVKIVKLCRDLGLGAGGFYQPGYRDGAKLHLK 197 >ref|XP_021293825.1| uncharacterized protein LOC110423786 [Herrania umbratica] Length = 461 Score = 110 bits (274), Expect = 4e-22 Identities = 62/125 (49%), Positives = 78/125 (62%) Frame = +2 Query: 1241 LLDGQVPQEFQRGKCTSEVTGGKGEYEKSDFQQYGFSFYICEERDKNIPKLKTPLLVKNR 1420 L D P E R K + + + G G+ ++ Q F IC + LK LLVKNR Sbjct: 163 LQDESEPSESYR-KMSPQNSAGFGDSVHTECQVVVEPFDICLSKAGTPVMLKPSLLVKNR 221 Query: 1421 AMRSEMKRQAQGVNIQTCRSGMILLKGYISLKDQVKLIKSCRDLGRGHGGFYQPGYRDGA 1600 R+E+KR +G N RSGM+LLK Y+SL DQVK++K+CR+LG G GGFYQPGYRDGA Sbjct: 222 EKRNEIKRSMEGQNGIVLRSGMVLLKKYLSLSDQVKIVKACRELGFGSGGFYQPGYRDGA 281 Query: 1601 KLHLK 1615 KL LK Sbjct: 282 KLQLK 286