BLASTX nr result

ID: Astragalus23_contig00014825 seq

BLASTX 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Astragalus23_contig00014825
         (1332 letters)

Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF excluding environmental samples
from WGS projects 
           149,584,005 sequences; 54,822,741,787 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_003543632.1| PREDICTED: probable prolyl 4-hydroxylase 7 [...   522   0.0  
ref|XP_003554232.1| PREDICTED: probable prolyl 4-hydroxylase 7 [...   520   0.0  
gb|ACU19077.1| unknown [Glycine max]                                  519   0.0  
ref|XP_020236148.1| probable prolyl 4-hydroxylase 7 [Cajanus cajan]   517   0.0  
ref|XP_004489361.1| PREDICTED: probable prolyl 4-hydroxylase 7 [...   515   e-180
ref|XP_014512275.1| probable prolyl 4-hydroxylase 6 [Vigna radia...   514   e-180
ref|XP_007151176.1| hypothetical protein PHAVU_004G024300g [Phas...   513   e-179
ref|XP_017437484.1| PREDICTED: probable prolyl 4-hydroxylase 7 [...   513   e-179
gb|KYP46194.1| Prolyl 4-hydroxylase subunit alpha-2 [Cajanus cajan]   504   e-175
ref|XP_020988167.1| probable prolyl 4-hydroxylase 6 [Arachis dur...   499   e-174
ref|XP_019442620.1| PREDICTED: probable prolyl 4-hydroxylase 7 [...   495   e-172
ref|XP_003618430.1| prolyl 4-hydroxylase alpha-like protein [Med...   488   e-170
gb|KOM56749.1| hypothetical protein LR48_Vigan10g264100 [Vigna a...   473   e-164
gb|PON54831.1| Isopenicillin N synthase [Trema orientalis]            471   e-163
gb|PON59436.1| Isopenicillin N synthase [Parasponia andersonii]       470   e-163
ref|XP_002281420.1| PREDICTED: probable prolyl 4-hydroxylase 6 [...   470   e-162
ref|XP_012489568.1| PREDICTED: probable prolyl 4-hydroxylase 7 [...   464   e-160
ref|XP_021808801.1| probable prolyl 4-hydroxylase 7 [Prunus avium]    464   e-160
ref|XP_007215695.1| probable prolyl 4-hydroxylase 7 [Prunus pers...   459   e-158
ref|XP_016695213.1| PREDICTED: probable prolyl 4-hydroxylase 7 [...   458   e-158

>ref|XP_003543632.1| PREDICTED: probable prolyl 4-hydroxylase 7 [Glycine max]
 gb|KHN13254.1| Prolyl 4-hydroxylase subunit alpha-1 [Glycine soja]
 gb|KRH18535.1| hypothetical protein GLYMA_13G066500 [Glycine max]
          Length = 318

 Score =  522 bits (1344), Expect = 0.0
 Identities = 247/298 (82%), Positives = 267/298 (89%), Gaps = 1/298 (0%)
 Frame = -3

Query: 1252 SSIRLPGSEND-KPTHGSVFGANRGDSKVKFDPTRAIQLSWRPRAFLYKKFLSEEECDHL 1076
            SSIRLPG + D K THGSV   NRG S VKFDPTR  QLSW PRAFLYK FLS+EECDHL
Sbjct: 21   SSIRLPGLDQDAKATHGSVLRLNRGGSSVKFDPTRVTQLSWSPRAFLYKGFLSDEECDHL 80

Query: 1075 VNLAKGKLEKSMVADNESGKSIASEVRTSSGMFLNRAQDEVVFDIETRIAKWTFLPIENG 896
            + LAK KLEKSMVADNESGKSI SEVRTSSGMFLN+AQDE+V  IE RIA WTFLPIENG
Sbjct: 81   ITLAKDKLEKSMVADNESGKSIMSEVRTSSGMFLNKAQDEIVAGIEARIAAWTFLPIENG 140

Query: 895  ESIQILHYENGQKYEPHFDYFHDKANQVLGGHRIATVLMYLSNIQKGGETIFPNSDGKLS 716
            ES+QILHYENGQKYEPHFDYFHDKANQV+GGHRIATVLMYLS+++KGGETIFPN+  KL 
Sbjct: 141  ESMQILHYENGQKYEPHFDYFHDKANQVMGGHRIATVLMYLSDVEKGGETIFPNAKAKLL 200

Query: 715  QPKDDSWSECAHRGYAVKPEKGDALLFFSLHLNATTDPKSLHGSCPVIEGEKWSATKWIH 536
            QPKD+SWSECAH+GYAVKP KGDALLFFSLHL+A+TD KSLHGSCPVIEGEKWSATKWIH
Sbjct: 201  QPKDESWSECAHKGYAVKPRKGDALLFFSLHLDASTDNKSLHGSCPVIEGEKWSATKWIH 260

Query: 535  VSDFEKPYGKIDNGDCVDENENCAKWAIVGECEKNPLYMVGNEGVKGKCMKSCNVCSS 362
            VSDF+KP  ++D+GDCVDENENC +WA VGECEKNPLYMVG EGVKG CMKSCNVCSS
Sbjct: 261  VSDFQKPIKQVDSGDCVDENENCPRWAKVGECEKNPLYMVGGEGVKGSCMKSCNVCSS 318


>ref|XP_003554232.1| PREDICTED: probable prolyl 4-hydroxylase 7 [Glycine max]
 gb|KHN34187.1| Prolyl 4-hydroxylase subunit alpha-2 [Glycine soja]
 gb|KRG93463.1| hypothetical protein GLYMA_19G018100 [Glycine max]
          Length = 319

 Score =  520 bits (1339), Expect = 0.0
 Identities = 241/298 (80%), Positives = 270/298 (90%), Gaps = 1/298 (0%)
 Frame = -3

Query: 1252 SSIRLPGSEND-KPTHGSVFGANRGDSKVKFDPTRAIQLSWRPRAFLYKKFLSEEECDHL 1076
            SS+RLPG + D K THGSV   NRG S VKFDPTR  QLSW PRAFLYK FLSEEECDHL
Sbjct: 22   SSLRLPGLDQDAKATHGSVLRLNRGGSSVKFDPTRVTQLSWSPRAFLYKGFLSEEECDHL 81

Query: 1075 VNLAKGKLEKSMVADNESGKSIASEVRTSSGMFLNRAQDEVVFDIETRIAKWTFLPIENG 896
            + LAK KLEKSMVADN+SGKSI S++RTSSGMFLN+AQDE+V  IE RIA WTFLP+ENG
Sbjct: 82   IVLAKDKLEKSMVADNDSGKSIMSDIRTSSGMFLNKAQDEIVAGIEARIAAWTFLPVENG 141

Query: 895  ESIQILHYENGQKYEPHFDYFHDKANQVLGGHRIATVLMYLSNIQKGGETIFPNSDGKLS 716
            ES+QILHYENGQKYEPHFDYFHDKANQV+GGHRIATVLMYLS+++KGGETIFPN++ KL 
Sbjct: 142  ESMQILHYENGQKYEPHFDYFHDKANQVMGGHRIATVLMYLSDVEKGGETIFPNAEAKLL 201

Query: 715  QPKDDSWSECAHRGYAVKPEKGDALLFFSLHLNATTDPKSLHGSCPVIEGEKWSATKWIH 536
            QPKD+SWSECAH+GYAVKP+KGDALLFFSLHL+A+TD KSLHGSCPVIEGEKWSATKWIH
Sbjct: 202  QPKDESWSECAHKGYAVKPQKGDALLFFSLHLDASTDTKSLHGSCPVIEGEKWSATKWIH 261

Query: 535  VSDFEKPYGKIDNGDCVDENENCAKWAIVGECEKNPLYMVGNEGVKGKCMKSCNVCSS 362
            VSDFEKP+ ++DNG+CVDENENC +WA VGEC+KNPLYMVG EGV+G CMKSCNVC+S
Sbjct: 262  VSDFEKPFKQVDNGECVDENENCPRWAKVGECDKNPLYMVGGEGVRGSCMKSCNVCTS 319


>gb|ACU19077.1| unknown [Glycine max]
          Length = 318

 Score =  519 bits (1336), Expect = 0.0
 Identities = 246/298 (82%), Positives = 266/298 (89%), Gaps = 1/298 (0%)
 Frame = -3

Query: 1252 SSIRLPGSEND-KPTHGSVFGANRGDSKVKFDPTRAIQLSWRPRAFLYKKFLSEEECDHL 1076
            SSIRLPG + D K THGSV   NRG S VKFDPTR  QLSW PRAFLYK FLS+EECDHL
Sbjct: 21   SSIRLPGLDQDAKATHGSVLRLNRGGSSVKFDPTRVTQLSWSPRAFLYKGFLSDEECDHL 80

Query: 1075 VNLAKGKLEKSMVADNESGKSIASEVRTSSGMFLNRAQDEVVFDIETRIAKWTFLPIENG 896
            + LAK KLEKSMVADNESGKSI SEVRTSSGMFLN+AQDE+V  IE RIA WTFLPIENG
Sbjct: 81   ITLAKDKLEKSMVADNESGKSIMSEVRTSSGMFLNKAQDEIVAGIEARIAAWTFLPIENG 140

Query: 895  ESIQILHYENGQKYEPHFDYFHDKANQVLGGHRIATVLMYLSNIQKGGETIFPNSDGKLS 716
            ES+QILHYENGQKYEPHFDYFHDKANQV+GGHRIATVLMYLS+++KGGETIF N+  KL 
Sbjct: 141  ESMQILHYENGQKYEPHFDYFHDKANQVMGGHRIATVLMYLSDVEKGGETIFSNAKAKLL 200

Query: 715  QPKDDSWSECAHRGYAVKPEKGDALLFFSLHLNATTDPKSLHGSCPVIEGEKWSATKWIH 536
            QPKD+SWSECAH+GYAVKP KGDALLFFSLHL+A+TD KSLHGSCPVIEGEKWSATKWIH
Sbjct: 201  QPKDESWSECAHKGYAVKPRKGDALLFFSLHLDASTDNKSLHGSCPVIEGEKWSATKWIH 260

Query: 535  VSDFEKPYGKIDNGDCVDENENCAKWAIVGECEKNPLYMVGNEGVKGKCMKSCNVCSS 362
            VSDF+KP  ++D+GDCVDENENC +WA VGECEKNPLYMVG EGVKG CMKSCNVCSS
Sbjct: 261  VSDFQKPIKQVDSGDCVDENENCPRWAKVGECEKNPLYMVGGEGVKGSCMKSCNVCSS 318


>ref|XP_020236148.1| probable prolyl 4-hydroxylase 7 [Cajanus cajan]
          Length = 324

 Score =  517 bits (1332), Expect = 0.0
 Identities = 241/298 (80%), Positives = 267/298 (89%), Gaps = 1/298 (0%)
 Frame = -3

Query: 1252 SSIRLPGSEND-KPTHGSVFGANRGDSKVKFDPTRAIQLSWRPRAFLYKKFLSEEECDHL 1076
            SSIRLPG + + K THGSV    RG S VKFDPTR  QLSW PRAFLYK FLSE+ECDHL
Sbjct: 27   SSIRLPGLDQESKATHGSVLRLKRGVSSVKFDPTRVTQLSWSPRAFLYKGFLSEKECDHL 86

Query: 1075 VNLAKGKLEKSMVADNESGKSIASEVRTSSGMFLNRAQDEVVFDIETRIAKWTFLPIENG 896
            +NLAK KLEKSMVADNESGKSI SEVRTSSGMFLN+AQDE+V DIE RI+ WTFLP+ENG
Sbjct: 87   INLAKDKLEKSMVADNESGKSIMSEVRTSSGMFLNKAQDEIVTDIEARISAWTFLPVENG 146

Query: 895  ESIQILHYENGQKYEPHFDYFHDKANQVLGGHRIATVLMYLSNIQKGGETIFPNSDGKLS 716
            ES+Q+LHYENG+KYEPHFDYFHDKANQVLGGHR+ATVLMYLSN++KGGETIFPNS+ KL 
Sbjct: 147  ESMQVLHYENGEKYEPHFDYFHDKANQVLGGHRVATVLMYLSNVEKGGETIFPNSEAKLL 206

Query: 715  QPKDDSWSECAHRGYAVKPEKGDALLFFSLHLNATTDPKSLHGSCPVIEGEKWSATKWIH 536
            QPKD+SWSECAH GYAVKP+KGDALLFFSLHL+ATTD KSLHGSCPVIEGEKWSATKWIH
Sbjct: 207  QPKDESWSECAHNGYAVKPQKGDALLFFSLHLDATTDTKSLHGSCPVIEGEKWSATKWIH 266

Query: 535  VSDFEKPYGKIDNGDCVDENENCAKWAIVGECEKNPLYMVGNEGVKGKCMKSCNVCSS 362
            VSDFEKP  ++ +G CVDEN+NC +WA +GECEKNPLYMVG +GV+G CMKSCNVCSS
Sbjct: 267  VSDFEKPVRQVGSGGCVDENDNCPRWAKIGECEKNPLYMVGGDGVRGNCMKSCNVCSS 324


>ref|XP_004489361.1| PREDICTED: probable prolyl 4-hydroxylase 7 [Cicer arietinum]
          Length = 321

 Score =  515 bits (1326), Expect = e-180
 Identities = 242/299 (80%), Positives = 267/299 (89%), Gaps = 2/299 (0%)
 Frame = -3

Query: 1252 SSIRLPGSENDKPTHGSVFGANRGDSKVKFDPTRAIQLSWRPRAFLYKKFLSEEECDHLV 1073
            SSIRLPG   DK T GSV GA +  SKVKFDPTR  QLSW PRAFLYK FLS+EECDHL 
Sbjct: 23   SSIRLPGLVEDKTTGGSVVGATKHGSKVKFDPTRVTQLSWSPRAFLYKNFLSDEECDHLK 82

Query: 1072 NLAKGKLEKSMVADNESGKSIASEVRTSSGMFLNRAQDEVVFDIETRIAKWTFLPIENGE 893
             LAK KLEKSMVADNESGKSI S+VRTSSGMFL +AQDE+V  IE RIA WTFLPIENGE
Sbjct: 83   ILAKDKLEKSMVADNESGKSIESDVRTSSGMFLGKAQDEIVSGIEDRIAAWTFLPIENGE 142

Query: 892  SIQILHYENGQKYEPHFDYFHDKANQVLGGHRIATVLMYLSNIQKGGETIFPNSDGKLSQ 713
            SIQ+LHY++G+KYEPHFD+FHDKANQ+LGGHRIATVLMYLSN++KGGETIFPN++G LSQ
Sbjct: 143  SIQVLHYQHGEKYEPHFDFFHDKANQILGGHRIATVLMYLSNVEKGGETIFPNAEGTLSQ 202

Query: 712  PKDDSWSECAHRGYAVKPEKGDALLFFSLHLNATTDPKSLHGSCPVIEGEKWSATKWIHV 533
            PKD+SWSECAH GYAVKP+KGDALLFFSLHLNATTD  SLHGSCPVIEGEKWSATKWIHV
Sbjct: 203  PKDESWSECAHNGYAVKPQKGDALLFFSLHLNATTDANSLHGSCPVIEGEKWSATKWIHV 262

Query: 532  SDFEKPYGKID--NGDCVDENENCAKWAIVGECEKNPLYMVGNEGVKGKCMKSCNVCSS 362
            +DFEKP G++D   G+CVDEN+NCA+WA VGEC+KNPLYM+G EGVKGKCMKSCNVCSS
Sbjct: 263  ADFEKPVGRLDVEEGECVDENDNCARWAKVGECDKNPLYMIGREGVKGKCMKSCNVCSS 321


>ref|XP_014512275.1| probable prolyl 4-hydroxylase 6 [Vigna radiata var. radiata]
          Length = 319

 Score =  514 bits (1325), Expect = e-180
 Identities = 242/299 (80%), Positives = 267/299 (89%), Gaps = 2/299 (0%)
 Frame = -3

Query: 1252 SSIRLPGSEND-KPTHGSVFGANRGDSKVKFDPTRAIQLSWRPRAFLYKKFLSEEECDHL 1076
            SSIRLPG + + K THGS  G N+G   VKFDPTR  QLSW PRAFLYK FL EEECDHL
Sbjct: 21   SSIRLPGVDQEAKATHGSGLGLNKGVCSVKFDPTRVTQLSWNPRAFLYKGFLKEEECDHL 80

Query: 1075 VNLAKGKLEKSMVADNESGKSIASEVRTSSGMFLNRAQDEVVFDIETRIAKWTFLPIENG 896
            ++LAK KLEKSMVADNESGKSI SEVRTSSGMFLN+AQDE V DIE R++ WTFLP+ENG
Sbjct: 81   ISLAKDKLEKSMVADNESGKSIMSEVRTSSGMFLNKAQDETVADIECRVSAWTFLPVENG 140

Query: 895  ESIQILHYENGQKYEPHFDYFHDKANQVLGGHRIATVLMYLSNIQKGGETIFPNSDGKLS 716
            ES+Q+LHYENG+KYEPHFDYFHDKANQ++GGHRIATVLMYLSN++KGGETIFPNS+GKL 
Sbjct: 141  ESMQVLHYENGEKYEPHFDYFHDKANQIMGGHRIATVLMYLSNVEKGGETIFPNSEGKLL 200

Query: 715  QPKDDSWSECAHRGYAVKPEKGDALLFFSLHLNATTDPKSLHGSCPVIEGEKWSATKWIH 536
            QPKDD+WSECAH+GYAVKP KGDALLFFSL+L+ATTD  SLHGSCPVIEGEKWSATKWIH
Sbjct: 201  QPKDDTWSECAHKGYAVKPRKGDALLFFSLNLDATTDTNSLHGSCPVIEGEKWSATKWIH 260

Query: 535  VSDFEKPYGKID-NGDCVDENENCAKWAIVGECEKNPLYMVGNEGVKGKCMKSCNVCSS 362
            VSDFEKP   ++ +GDCVDENENC +WA VGECEKNPLYMVG EGVKGKCMKSCNVCSS
Sbjct: 261  VSDFEKPVRSLESSGDCVDENENCYRWAKVGECEKNPLYMVGGEGVKGKCMKSCNVCSS 319


>ref|XP_007151176.1| hypothetical protein PHAVU_004G024300g [Phaseolus vulgaris]
 gb|ESW23170.1| hypothetical protein PHAVU_004G024300g [Phaseolus vulgaris]
          Length = 318

 Score =  513 bits (1321), Expect = e-179
 Identities = 239/298 (80%), Positives = 265/298 (88%), Gaps = 1/298 (0%)
 Frame = -3

Query: 1252 SSIRLPGSEND-KPTHGSVFGANRGDSKVKFDPTRAIQLSWRPRAFLYKKFLSEEECDHL 1076
            SSIRLPG + + K THGSV     G S VKFDPTR  QLSW PRAFLYK FLSEEECDHL
Sbjct: 21   SSIRLPGLDQEAKSTHGSVLRMKTGVSSVKFDPTRVTQLSWNPRAFLYKGFLSEEECDHL 80

Query: 1075 VNLAKGKLEKSMVADNESGKSIASEVRTSSGMFLNRAQDEVVFDIETRIAKWTFLPIENG 896
            + LAK KLE SMVADNESGKS+ SEVRTSSGMFLN+AQD++V DIE RI+ WTFLPIENG
Sbjct: 81   ITLAKDKLEISMVADNESGKSVMSEVRTSSGMFLNKAQDKIVADIEARISAWTFLPIENG 140

Query: 895  ESIQILHYENGQKYEPHFDYFHDKANQVLGGHRIATVLMYLSNIQKGGETIFPNSDGKLS 716
            ES+Q+LHYENGQKYEPHFDYFHDKANQ++GGHR+ATVLMYLSN+ KGGETIFPNS+ KL 
Sbjct: 141  ESMQVLHYENGQKYEPHFDYFHDKANQIMGGHRVATVLMYLSNVGKGGETIFPNSEAKLL 200

Query: 715  QPKDDSWSECAHRGYAVKPEKGDALLFFSLHLNATTDPKSLHGSCPVIEGEKWSATKWIH 536
            QPKDD+WSECAH+GYAVKPEKGDALLFFSLHL+ATTD  SLHGSCPVIEGEKWSATKWIH
Sbjct: 201  QPKDDTWSECAHKGYAVKPEKGDALLFFSLHLDATTDANSLHGSCPVIEGEKWSATKWIH 260

Query: 535  VSDFEKPYGKIDNGDCVDENENCAKWAIVGECEKNPLYMVGNEGVKGKCMKSCNVCSS 362
            VSDFEKP   ++ GDCVD+NENC++WA +GECEKNPLYMVG+ GV+GKCMKSCNVCSS
Sbjct: 261  VSDFEKPVISVEGGDCVDDNENCSRWAKIGECEKNPLYMVGSAGVRGKCMKSCNVCSS 318


>ref|XP_017437484.1| PREDICTED: probable prolyl 4-hydroxylase 7 [Vigna angularis]
 dbj|BAU01150.1| hypothetical protein VIGAN_11031700 [Vigna angularis var. angularis]
          Length = 319

 Score =  513 bits (1320), Expect = e-179
 Identities = 242/299 (80%), Positives = 265/299 (88%), Gaps = 2/299 (0%)
 Frame = -3

Query: 1252 SSIRLPG-SENDKPTHGSVFGANRGDSKVKFDPTRAIQLSWRPRAFLYKKFLSEEECDHL 1076
            SSIRLPG  E  K THGS  G N+G   VKFDPTR  QLSW PRAFLYK FL EEECDH+
Sbjct: 21   SSIRLPGVDEEAKATHGSGLGLNKGVCSVKFDPTRVTQLSWNPRAFLYKGFLKEEECDHV 80

Query: 1075 VNLAKGKLEKSMVADNESGKSIASEVRTSSGMFLNRAQDEVVFDIETRIAKWTFLPIENG 896
            + LAK KLEKSMVADNESGKSI SEVRTSSGMFLN+AQDE V DIE R++ WTFLP+ENG
Sbjct: 81   IALAKDKLEKSMVADNESGKSIMSEVRTSSGMFLNKAQDETVADIECRVSAWTFLPVENG 140

Query: 895  ESIQILHYENGQKYEPHFDYFHDKANQVLGGHRIATVLMYLSNIQKGGETIFPNSDGKLS 716
            ES+Q+LHYENG+KYEPHFDYFHDKANQ++GGHRIATVLMYLSN++KGGETIFPNS+GKL 
Sbjct: 141  ESMQVLHYENGEKYEPHFDYFHDKANQIMGGHRIATVLMYLSNVEKGGETIFPNSEGKLF 200

Query: 715  QPKDDSWSECAHRGYAVKPEKGDALLFFSLHLNATTDPKSLHGSCPVIEGEKWSATKWIH 536
            QPKDD+WSECAH+GYAVKP KGDALLFFSL+L+ATTD  SLHGSCPVIEGEKWSATKWIH
Sbjct: 201  QPKDDTWSECAHKGYAVKPRKGDALLFFSLNLDATTDSNSLHGSCPVIEGEKWSATKWIH 260

Query: 535  VSDFEKPYGKIDN-GDCVDENENCAKWAIVGECEKNPLYMVGNEGVKGKCMKSCNVCSS 362
            VSDFEKP   +++ GDCVDENENC +WA VGECEKNPLYMVG EGVKGKCMKSCNVCSS
Sbjct: 261  VSDFEKPVRSLESTGDCVDENENCYRWAKVGECEKNPLYMVGGEGVKGKCMKSCNVCSS 319


>gb|KYP46194.1| Prolyl 4-hydroxylase subunit alpha-2 [Cajanus cajan]
          Length = 373

 Score =  504 bits (1299), Expect = e-175
 Identities = 232/283 (81%), Positives = 256/283 (90%)
 Frame = -3

Query: 1210 HGSVFGANRGDSKVKFDPTRAIQLSWRPRAFLYKKFLSEEECDHLVNLAKGKLEKSMVAD 1031
            HGSV    RG S VKFDPTR  QLSW PRAFLYK FLSE+ECDHL+NLAK KLEKSMVAD
Sbjct: 91   HGSVLRLKRGVSSVKFDPTRVTQLSWSPRAFLYKGFLSEKECDHLINLAKDKLEKSMVAD 150

Query: 1030 NESGKSIASEVRTSSGMFLNRAQDEVVFDIETRIAKWTFLPIENGESIQILHYENGQKYE 851
            NESGKSI SEVRTSSGMFLN+AQDE+V DIE RI+ WTFLP+ENGES+Q+LHYENG+KYE
Sbjct: 151  NESGKSIMSEVRTSSGMFLNKAQDEIVTDIEARISAWTFLPVENGESMQVLHYENGEKYE 210

Query: 850  PHFDYFHDKANQVLGGHRIATVLMYLSNIQKGGETIFPNSDGKLSQPKDDSWSECAHRGY 671
            PHFDYFHDKANQVLGGHR+ATVLMYLSN++KGGETIFPNS+ KL QPKD+SWSECAH GY
Sbjct: 211  PHFDYFHDKANQVLGGHRVATVLMYLSNVEKGGETIFPNSEAKLLQPKDESWSECAHNGY 270

Query: 670  AVKPEKGDALLFFSLHLNATTDPKSLHGSCPVIEGEKWSATKWIHVSDFEKPYGKIDNGD 491
            AVKP+KGDALLFFSLHL+ATTD KSLHGSCPVIEGEKWSATKWIHVSDFEKP  ++ +G 
Sbjct: 271  AVKPQKGDALLFFSLHLDATTDTKSLHGSCPVIEGEKWSATKWIHVSDFEKPVRQVGSGG 330

Query: 490  CVDENENCAKWAIVGECEKNPLYMVGNEGVKGKCMKSCNVCSS 362
            CVDEN+NC +WA +GECEKNPLYMVG +GV+G CMKSCNVCSS
Sbjct: 331  CVDENDNCPRWAKIGECEKNPLYMVGGDGVRGNCMKSCNVCSS 373


>ref|XP_020988167.1| probable prolyl 4-hydroxylase 6 [Arachis duranensis]
          Length = 320

 Score =  499 bits (1285), Expect = e-174
 Identities = 238/297 (80%), Positives = 262/297 (88%), Gaps = 2/297 (0%)
 Frame = -3

Query: 1252 SSIRLPGSENDKPTHGSVFGANRGDSKVKFDPTRAIQLSWRPRAFLYKKFLSEEECDHLV 1073
            SSIRL  S+  K THGS+   NRG S V FDPTR IQLSW+PRAF+YKKFL++EECDHL+
Sbjct: 25   SSIRLHSSDATKTTHGSLLRLNRGGSSV-FDPTRVIQLSWQPRAFIYKKFLTDEECDHLI 83

Query: 1072 NLAKGKLEKSMVADNESGKSIASEVRTSSGMFLNRAQDEVVFDIETRIAKWTFLPIENGE 893
             LAK KLEKS+VADNESGKSI SEVRTSSGMFL + QDEVV  IE RIA WTFLPIENGE
Sbjct: 84   TLAKDKLEKSVVADNESGKSIESEVRTSSGMFLGKGQDEVVAAIEARIAAWTFLPIENGE 143

Query: 892  SIQILHYENGQKYEPHFDYFHDKANQVLGGHRIATVLMYLSNIQKGGETIFPNSDGKLSQ 713
            SIQILHYENGQKYEPHFDYFHDKANQ++GGHRIATVLMYLSN++KGGETIFPN++ K SQ
Sbjct: 144  SIQILHYENGQKYEPHFDYFHDKANQIMGGHRIATVLMYLSNVEKGGETIFPNAEAKESQ 203

Query: 712  PKDDSWSECAHRGYAVKPEKGDALLFFSLHLNATTDPKSLHGSCPVIEGEKWSATKWIHV 533
            PKD+SWSECAH+GYAVKPEKGDALLFFSLHL+ATTD +SLHGSCPVIEGEKWSATKWIHV
Sbjct: 204  PKDESWSECAHKGYAVKPEKGDALLFFSLHLDATTDSRSLHGSCPVIEGEKWSATKWIHV 263

Query: 532  SDFEKPYGKIDN--GDCVDENENCAKWAIVGECEKNPLYMVGNEGVKGKCMKSCNVC 368
            +DFEKP  K ++  GDC D NENC++WA VGECEKNPLYMVGN  VKG CMKSCNVC
Sbjct: 264  ADFEKPIKKFESGGGDCADLNENCSRWARVGECEKNPLYMVGNGDVKGYCMKSCNVC 320


>ref|XP_019442620.1| PREDICTED: probable prolyl 4-hydroxylase 7 [Lupinus angustifolius]
 gb|OIW12430.1| hypothetical protein TanjilG_04179 [Lupinus angustifolius]
          Length = 318

 Score =  495 bits (1275), Expect = e-172
 Identities = 239/301 (79%), Positives = 265/301 (88%), Gaps = 4/301 (1%)
 Frame = -3

Query: 1252 SSIRLPGSENDK---PTHGSVFGANRGDSKVKFDPTRAIQLSWRPRAFLYKKFLSEEECD 1082
            SSIRLP + +D+    T GSV        KVKFDPTR  QLSW PRAFL+K FLS+EECD
Sbjct: 22   SSIRLPTTTDDEHKTTTDGSVLRLK----KVKFDPTRVTQLSWEPRAFLHKGFLSDEECD 77

Query: 1081 HLVNLAKGKLEKSMVADNESGKSIASEVRTSSGMFLNRAQDEVVFDIETRIAKWTFLPIE 902
            HL+ LAK KLEKSMVADNESGKSIASEVRTSSGMFL++AQDE+V +IE RIA WTFLPIE
Sbjct: 78   HLIVLAKDKLEKSMVADNESGKSIASEVRTSSGMFLSKAQDEIVSNIEARIATWTFLPIE 137

Query: 901  NGESIQILHYENGQKYEPHFDYFHDKANQVLGGHRIATVLMYLSNIQKGGETIFPNSDGK 722
            NGES+Q+LHYE+GQKYEPHFDYFHDKANQV+GGHR+ATVLMYLSN++KGGETIFPNS+GK
Sbjct: 138  NGESMQVLHYEHGQKYEPHFDYFHDKANQVMGGHRVATVLMYLSNVEKGGETIFPNSEGK 197

Query: 721  LSQPKDDSWSECAHRGYAVKPEKGDALLFFSLHLNATTDPKSLHGSCPVIEGEKWSATKW 542
            LSQPKDD+WSECAH+GYAVKPEKGDALLFFSLHL+ATTD KSLHGSCPVIEGEKWSATKW
Sbjct: 198  LSQPKDDTWSECAHKGYAVKPEKGDALLFFSLHLDATTDTKSLHGSCPVIEGEKWSATKW 257

Query: 541  IHVSDFEKPY-GKIDNGDCVDENENCAKWAIVGECEKNPLYMVGNEGVKGKCMKSCNVCS 365
            IHVSDFEKP   ++D+  C DENENC KWA +GECEKNPLYMVGN  VKG CMKSCNVC+
Sbjct: 258  IHVSDFEKPIKRRVDSSGCSDENENCPKWANIGECEKNPLYMVGNGEVKGYCMKSCNVCT 317

Query: 364  S 362
            S
Sbjct: 318  S 318


>ref|XP_003618430.1| prolyl 4-hydroxylase alpha-like protein [Medicago truncatula]
 gb|ACJ85356.1| unknown [Medicago truncatula]
 gb|AES74648.1| prolyl 4-hydroxylase alpha-like protein [Medicago truncatula]
 gb|AFK35284.1| unknown [Medicago truncatula]
          Length = 313

 Score =  488 bits (1256), Expect = e-170
 Identities = 236/302 (78%), Positives = 264/302 (87%), Gaps = 2/302 (0%)
 Frame = -3

Query: 1261 SVSSSIRLPG-SENDKPTHGSVFGANRGDSKVKFDPTRAIQLSWRPRAFLYKKFLSEEEC 1085
            S  SSIRLPG  E +K T GSVFGA     KVKFDPTR  QLSW PRAFLYK FL++EEC
Sbjct: 18   SFVSSIRLPGLEEGNKITRGSVFGA-----KVKFDPTRVTQLSWSPRAFLYKNFLTDEEC 72

Query: 1084 DHLVNLAKGKLEKSMVADNESGKSIASEVRTSSGMFLNRAQDEVVFDIETRIAKWTFLPI 905
            DHL+ L+K KLEKSMVADNESGKSI SEVRTSSGMFLN+ QDE+V  IE RIA WTFLP+
Sbjct: 73   DHLIELSKDKLEKSMVADNESGKSIQSEVRTSSGMFLNKQQDEIVSGIEARIAAWTFLPV 132

Query: 904  ENGESIQILHYENGQKYEPHFDYFHDKANQVLGGHRIATVLMYLSNIQKGGETIFPNSDG 725
            ENGES+Q+LHY NG+KYEPHFD+FHDKANQ LGGHR+ATVLMYLSN++KGGETIFP+++G
Sbjct: 133  ENGESMQVLHYMNGEKYEPHFDFFHDKANQRLGGHRVATVLMYLSNVEKGGETIFPHAEG 192

Query: 724  KLSQPKDDSWSECAHRGYAVKPEKGDALLFFSLHLNATTDPKSLHGSCPVIEGEKWSATK 545
            KLSQPKD+SWSECAH+GYAVKP KGDALLFFSLHL+ATTD KSLHGSCPVIEGEKWSATK
Sbjct: 193  KLSQPKDESWSECAHKGYAVKPRKGDALLFFSLHLDATTDSKSLHGSCPVIEGEKWSATK 252

Query: 544  WIHVSDFEKPYGK-IDNGDCVDENENCAKWAIVGECEKNPLYMVGNEGVKGKCMKSCNVC 368
            WIHV+DFEKP  + +++  C DENENCA+WA VGECEKNPLYMVG +G  GKCMKSCNVC
Sbjct: 253  WIHVADFEKPVRQALEDRVCADENENCARWAKVGECEKNPLYMVG-KGGNGKCMKSCNVC 311

Query: 367  SS 362
            SS
Sbjct: 312  SS 313


>gb|KOM56749.1| hypothetical protein LR48_Vigan10g264100 [Vigna angularis]
          Length = 312

 Score =  473 bits (1216), Expect = e-164
 Identities = 231/299 (77%), Positives = 251/299 (83%), Gaps = 2/299 (0%)
 Frame = -3

Query: 1252 SSIRLPG-SENDKPTHGSVFGANRGDSKVKFDPTRAIQLSWRPRAFLYKKFLSEEECDHL 1076
            SSIRLPG  E  K THGS  G N+G   VKFDPTR  QLSW PRAFLYK FL EEECDH+
Sbjct: 21   SSIRLPGVDEEAKATHGSGLGLNKGVCSVKFDPTRVTQLSWNPRAFLYKGFLKEEECDHV 80

Query: 1075 VNLAKGKLEKSMVADNESGKSIASEVRTSSGMFLNRAQDEVVFDIETRIAKWTFLPIENG 896
            + LAK KLEKSMVADNESGKSI SEVRTSSGMFLN+AQ          I        ENG
Sbjct: 81   IALAKDKLEKSMVADNESGKSIMSEVRTSSGMFLNKAQGWYRQFYSAVIR-------ENG 133

Query: 895  ESIQILHYENGQKYEPHFDYFHDKANQVLGGHRIATVLMYLSNIQKGGETIFPNSDGKLS 716
            ES+Q+LHYENG+KYEPHFDYFHDKANQ++GGHRIATVLMYLSN++KGGETIFPNS+GKL 
Sbjct: 134  ESMQVLHYENGEKYEPHFDYFHDKANQIMGGHRIATVLMYLSNVEKGGETIFPNSEGKLF 193

Query: 715  QPKDDSWSECAHRGYAVKPEKGDALLFFSLHLNATTDPKSLHGSCPVIEGEKWSATKWIH 536
            QPKDD+WSECAH+GYAVKP KGDALLFFSL+L+ATTD  SLHGSCPVIEGEKWSATKWIH
Sbjct: 194  QPKDDTWSECAHKGYAVKPRKGDALLFFSLNLDATTDSNSLHGSCPVIEGEKWSATKWIH 253

Query: 535  VSDFEKPYGKIDN-GDCVDENENCAKWAIVGECEKNPLYMVGNEGVKGKCMKSCNVCSS 362
            VSDFEKP   +++ GDCVDENENC +WA VGECEKNPLYMVG EGVKGKCMKSCNVCSS
Sbjct: 254  VSDFEKPVRSLESTGDCVDENENCYRWAKVGECEKNPLYMVGGEGVKGKCMKSCNVCSS 312


>gb|PON54831.1| Isopenicillin N synthase [Trema orientalis]
          Length = 320

 Score =  471 bits (1211), Expect = e-163
 Identities = 223/297 (75%), Positives = 250/297 (84%)
 Frame = -3

Query: 1252 SSIRLPGSENDKPTHGSVFGANRGDSKVKFDPTRAIQLSWRPRAFLYKKFLSEEECDHLV 1073
            S++R+P    DK T  S+     G S V FDPTR  QLSWRPRAFLYK FLSEEECDHLV
Sbjct: 24   SAVRVPKWLGDKKTEDSLIRMKTGASSVGFDPTRVTQLSWRPRAFLYKGFLSEEECDHLV 83

Query: 1072 NLAKGKLEKSMVADNESGKSIASEVRTSSGMFLNRAQDEVVFDIETRIAKWTFLPIENGE 893
             LAK KLEKSMVADNESGKSI SEVRTSSGMFL +AQD+VV DIE RIA WTFLP ENGE
Sbjct: 84   ALAKDKLEKSMVADNESGKSIMSEVRTSSGMFLQKAQDKVVADIEARIAAWTFLPEENGE 143

Query: 892  SIQILHYENGQKYEPHFDYFHDKANQVLGGHRIATVLMYLSNIQKGGETIFPNSDGKLSQ 713
            S+QILHYENG+KYEPHFDYFHDKANQ LGGHR+ATVLMYLSNI+KGGETIFPNS+ K+SQ
Sbjct: 144  SMQILHYENGEKYEPHFDYFHDKANQELGGHRVATVLMYLSNIEKGGETIFPNSEAKMSQ 203

Query: 712  PKDDSWSECAHRGYAVKPEKGDALLFFSLHLNATTDPKSLHGSCPVIEGEKWSATKWIHV 533
            PKDDSWS+CA  GYAVKP KGDALLFFSLH ++TTD  SLHGSCPVIEGEKWSATKWIHV
Sbjct: 204  PKDDSWSDCAKNGYAVKPYKGDALLFFSLHPDSTTDSNSLHGSCPVIEGEKWSATKWIHV 263

Query: 532  SDFEKPYGKIDNGDCVDENENCAKWAIVGECEKNPLYMVGNEGVKGKCMKSCNVCSS 362
             +F+KP  +  +G+C DEN NC++WA +GEC+KNP+YMVG+E + G C KSCN CSS
Sbjct: 264  RNFDKPVKRSGSGECTDENANCSQWAKIGECKKNPVYMVGSEELPGFCRKSCNACSS 320


>gb|PON59436.1| Isopenicillin N synthase [Parasponia andersonii]
          Length = 320

 Score =  470 bits (1210), Expect = e-163
 Identities = 222/297 (74%), Positives = 250/297 (84%)
 Frame = -3

Query: 1252 SSIRLPGSENDKPTHGSVFGANRGDSKVKFDPTRAIQLSWRPRAFLYKKFLSEEECDHLV 1073
            S++R+P    DK T  S+     G S V FDPTR  QLSWRPRAFLYK FLSEEECDHL+
Sbjct: 24   SAVRVPKWLGDKKTEDSLIRMKTGASSVGFDPTRVTQLSWRPRAFLYKGFLSEEECDHLI 83

Query: 1072 NLAKGKLEKSMVADNESGKSIASEVRTSSGMFLNRAQDEVVFDIETRIAKWTFLPIENGE 893
             LAK KLEKSMVADNESGKSI SEVRTSSGMFL +AQD+VV DIE RIA WTFLP ENGE
Sbjct: 84   ALAKDKLEKSMVADNESGKSITSEVRTSSGMFLQKAQDKVVADIEARIAAWTFLPEENGE 143

Query: 892  SIQILHYENGQKYEPHFDYFHDKANQVLGGHRIATVLMYLSNIQKGGETIFPNSDGKLSQ 713
            S+QILHYENG+KYEPHFDYFHDKANQ LGGHR+ATVLMYLSNI+KGGETIFPNS+ K+ Q
Sbjct: 144  SMQILHYENGEKYEPHFDYFHDKANQELGGHRVATVLMYLSNIEKGGETIFPNSEAKMLQ 203

Query: 712  PKDDSWSECAHRGYAVKPEKGDALLFFSLHLNATTDPKSLHGSCPVIEGEKWSATKWIHV 533
            PKDDSWS+CA  GYAVKP KGDALLFFSLH ++TTD KSLHGSCPVIEGEKWSATKWIHV
Sbjct: 204  PKDDSWSDCAKNGYAVKPYKGDALLFFSLHPDSTTDSKSLHGSCPVIEGEKWSATKWIHV 263

Query: 532  SDFEKPYGKIDNGDCVDENENCAKWAIVGECEKNPLYMVGNEGVKGKCMKSCNVCSS 362
             +F+KP  +  +G+C DEN NC++WA +GEC+KNP+YMVG+E + G C KSCN CSS
Sbjct: 264  RNFDKPVKRSGSGECTDENANCSQWAKIGECKKNPVYMVGSEELPGFCRKSCNACSS 320


>ref|XP_002281420.1| PREDICTED: probable prolyl 4-hydroxylase 6 [Vitis vinifera]
 emb|CBI35001.3| unnamed protein product, partial [Vitis vinifera]
          Length = 316

 Score =  470 bits (1209), Expect = e-162
 Identities = 221/297 (74%), Positives = 248/297 (83%)
 Frame = -3

Query: 1252 SSIRLPGSENDKPTHGSVFGANRGDSKVKFDPTRAIQLSWRPRAFLYKKFLSEEECDHLV 1073
            SS++ PG   +K T GSV G         FDPTR  QLSWRPRAFLYK FLSEEECDHL+
Sbjct: 20   SSLQFPGWVGEKKTGGSVLGLKPRGFASGFDPTRVTQLSWRPRAFLYKGFLSEEECDHLI 79

Query: 1072 NLAKGKLEKSMVADNESGKSIASEVRTSSGMFLNRAQDEVVFDIETRIAKWTFLPIENGE 893
             LAK KLEKSMVADNESGKSI SEVRTSSGMFL +AQDE+V DIE RIA WTFLP+ENGE
Sbjct: 80   TLAKDKLEKSMVADNESGKSIMSEVRTSSGMFLLKAQDEIVADIEARIAAWTFLPVENGE 139

Query: 892  SIQILHYENGQKYEPHFDYFHDKANQVLGGHRIATVLMYLSNIQKGGETIFPNSDGKLSQ 713
            SIQILHYENG+KYEPHFDYFHDK NQ+LGGHRIATVLMYL+ +++GGET+FPNS+G+ SQ
Sbjct: 140  SIQILHYENGEKYEPHFDYFHDKVNQLLGGHRIATVLMYLATVEEGGETVFPNSEGRFSQ 199

Query: 712  PKDDSWSECAHRGYAVKPEKGDALLFFSLHLNATTDPKSLHGSCPVIEGEKWSATKWIHV 533
            PKDDSWS+CA +GYAV P+KGDALLFFSLH +ATTDP SLHGSCPVI GEKWSATKWIHV
Sbjct: 200  PKDDSWSDCAKKGYAVNPKKGDALLFFSLHPDATTDPSSLHGSCPVIAGEKWSATKWIHV 259

Query: 532  SDFEKPYGKIDNGDCVDENENCAKWAIVGECEKNPLYMVGNEGVKGKCMKSCNVCSS 362
              F+KP  +   G+CVDE+E+C KWA VGECEKNP+YMVG+E   G C KSC VCSS
Sbjct: 260  RSFDKPSKRGAQGECVDEDEHCPKWAAVGECEKNPVYMVGSENSDGFCRKSCGVCSS 316


>ref|XP_012489568.1| PREDICTED: probable prolyl 4-hydroxylase 7 [Gossypium raimondii]
 gb|KJB40817.1| hypothetical protein B456_007G078100 [Gossypium raimondii]
          Length = 307

 Score =  464 bits (1195), Expect = e-160
 Identities = 220/283 (77%), Positives = 237/283 (83%)
 Frame = -3

Query: 1210 HGSVFGANRGDSKVKFDPTRAIQLSWRPRAFLYKKFLSEEECDHLVNLAKGKLEKSMVAD 1031
            +GSV    RG S V FDPTR  QLSW PRAFLYK FLS EECDHL+ LAK KLEKSMVAD
Sbjct: 25   NGSVLEMKRGTSSVPFDPTRVTQLSWHPRAFLYKGFLSSEECDHLITLAKDKLEKSMVAD 84

Query: 1030 NESGKSIASEVRTSSGMFLNRAQDEVVFDIETRIAKWTFLPIENGESIQILHYENGQKYE 851
            NESG SI SEVRTSSGMFL +AQDEVV DIE RIA WTFLP+ENGES+QILHYENGQKYE
Sbjct: 85   NESGDSIESEVRTSSGMFLQKAQDEVVADIEARIAAWTFLPVENGESMQILHYENGQKYE 144

Query: 850  PHFDYFHDKANQVLGGHRIATVLMYLSNIQKGGETIFPNSDGKLSQPKDDSWSECAHRGY 671
            PHFDYFHDKANQ LGGHRIATVLMYLS++  GGET+FPN++GKLSQPKDDSWS+CA  GY
Sbjct: 145  PHFDYFHDKANQELGGHRIATVLMYLSDVDSGGETVFPNAEGKLSQPKDDSWSDCAKNGY 204

Query: 670  AVKPEKGDALLFFSLHLNATTDPKSLHGSCPVIEGEKWSATKWIHVSDFEKPYGKIDNGD 491
            AVKP KGDALLFFSLHL+ATTD  SLHGSCPVI+GEKWSATKWIHV  F+    +  NGD
Sbjct: 205  AVKPRKGDALLFFSLHLDATTDSDSLHGSCPVIKGEKWSATKWIHVRSFDTAKRQSVNGD 264

Query: 490  CVDENENCAKWAIVGECEKNPLYMVGNEGVKGKCMKSCNVCSS 362
            CVDENENCA WA  GECEKNP YM+G+E   G C KSC VCSS
Sbjct: 265  CVDENENCATWASAGECEKNPSYMIGSEDYYGYCRKSCKVCSS 307


>ref|XP_021808801.1| probable prolyl 4-hydroxylase 7 [Prunus avium]
          Length = 319

 Score =  464 bits (1194), Expect = e-160
 Identities = 222/298 (74%), Positives = 247/298 (82%)
 Frame = -3

Query: 1255 SSSIRLPGSENDKPTHGSVFGANRGDSKVKFDPTRAIQLSWRPRAFLYKKFLSEEECDHL 1076
            SS  R+P    +K T GSV    RG S   FDPTR IQLSWRPRAFLYK FLSEEECDHL
Sbjct: 22   SSRSRVPILIEEKKTEGSVLRLRRGASSATFDPTRVIQLSWRPRAFLYKGFLSEEECDHL 81

Query: 1075 VNLAKGKLEKSMVADNESGKSIASEVRTSSGMFLNRAQDEVVFDIETRIAKWTFLPIENG 896
            + +AK KLEKSMVADNESGKSI SEVRTSSGMFL +AQDEVV +IE RIA WTFLPIENG
Sbjct: 82   IEIAKDKLEKSMVADNESGKSIESEVRTSSGMFLQKAQDEVVANIEARIAAWTFLPIENG 141

Query: 895  ESIQILHYENGQKYEPHFDYFHDKANQVLGGHRIATVLMYLSNIQKGGETIFPNSDGKLS 716
            ESIQILHYE+GQKYEPHFDYFHDKANQ LGGHR+ATVLMYLSN++KGGET+FPN++ ++S
Sbjct: 142  ESIQILHYEHGQKYEPHFDYFHDKANQELGGHRVATVLMYLSNVEKGGETVFPNTESQMS 201

Query: 715  QPKDDSWSECAHRGYAVKPEKGDALLFFSLHLNATTDPKSLHGSCPVIEGEKWSATKWIH 536
            Q KD+  S+CA +GY+VKP KGDALLFFSLH +ATTDP SLHGSCPVIEGEKWSATKWIH
Sbjct: 202  QSKDEDASDCAKQGYSVKPYKGDALLFFSLHPDATTDPSSLHGSCPVIEGEKWSATKWIH 261

Query: 535  VSDFEKPYGKIDNGDCVDENENCAKWAIVGECEKNPLYMVGNEGVKGKCMKSCNVCSS 362
            V  FEK      +GDCVD N+NC  WA  GECEKNP YMVG+EG+ G C KSCN+CSS
Sbjct: 262  VRSFEKSIKHAVSGDCVDGNDNCPLWAKAGECEKNPTYMVGSEGLPGFCRKSCNMCSS 319


>ref|XP_007215695.1| probable prolyl 4-hydroxylase 7 [Prunus persica]
 gb|ONI19362.1| hypothetical protein PRUPE_3G274700 [Prunus persica]
          Length = 319

 Score =  459 bits (1181), Expect = e-158
 Identities = 218/297 (73%), Positives = 244/297 (82%)
 Frame = -3

Query: 1252 SSIRLPGSENDKPTHGSVFGANRGDSKVKFDPTRAIQLSWRPRAFLYKKFLSEEECDHLV 1073
            S  R+P    +K T GSV    RG S   FDPTR  QLSW PRAFLYK FLSEEECDHL+
Sbjct: 23   SRSRVPILIEEKKTEGSVLRLRRGASSATFDPTRVTQLSWHPRAFLYKGFLSEEECDHLI 82

Query: 1072 NLAKGKLEKSMVADNESGKSIASEVRTSSGMFLNRAQDEVVFDIETRIAKWTFLPIENGE 893
             +AK KLEKSMVADNESGKSI SEVRTSSGMFL ++QDEVV +IE RIA WTFLPIENGE
Sbjct: 83   EIAKNKLEKSMVADNESGKSIESEVRTSSGMFLQKSQDEVVANIEARIAAWTFLPIENGE 142

Query: 892  SIQILHYENGQKYEPHFDYFHDKANQVLGGHRIATVLMYLSNIQKGGETIFPNSDGKLSQ 713
            SIQILHYE+GQKYEPHFDYFHDKANQ LGGHR+ATVLMYLSN++KGGET+FPN++ ++SQ
Sbjct: 143  SIQILHYEHGQKYEPHFDYFHDKANQELGGHRVATVLMYLSNVEKGGETVFPNTEAQMSQ 202

Query: 712  PKDDSWSECAHRGYAVKPEKGDALLFFSLHLNATTDPKSLHGSCPVIEGEKWSATKWIHV 533
             KDD  S+CA +GY+VKP KGDALLFFSLH +ATTDP SLHGSCPVIEGEKWSATKWIHV
Sbjct: 203  SKDDDASDCAKQGYSVKPYKGDALLFFSLHPDATTDPSSLHGSCPVIEGEKWSATKWIHV 262

Query: 532  SDFEKPYGKIDNGDCVDENENCAKWAIVGECEKNPLYMVGNEGVKGKCMKSCNVCSS 362
              FEK      +GDC DEN+NC  WA  GECEKNP YMVG++G+ G C KSCN+CSS
Sbjct: 263  RSFEKSLKHAVSGDCADENDNCPLWAKAGECEKNPTYMVGSKGLPGFCRKSCNMCSS 319


>ref|XP_016695213.1| PREDICTED: probable prolyl 4-hydroxylase 7 [Gossypium hirsutum]
          Length = 307

 Score =  458 bits (1179), Expect = e-158
 Identities = 217/283 (76%), Positives = 235/283 (83%)
 Frame = -3

Query: 1210 HGSVFGANRGDSKVKFDPTRAIQLSWRPRAFLYKKFLSEEECDHLVNLAKGKLEKSMVAD 1031
            +GSV    RG S   FDPTR  QLSW PRAFLYK FLS EECDHL+ LAK KLEKSMVAD
Sbjct: 25   NGSVLEMKRGTSSAPFDPTRVTQLSWHPRAFLYKGFLSSEECDHLITLAKDKLEKSMVAD 84

Query: 1030 NESGKSIASEVRTSSGMFLNRAQDEVVFDIETRIAKWTFLPIENGESIQILHYENGQKYE 851
            NESG SI SEVRTSSGMFL +AQDEVV DIE RIA WTFLP ENGES+QI+HYENGQKYE
Sbjct: 85   NESGDSIESEVRTSSGMFLQKAQDEVVADIEARIAAWTFLPAENGESMQIIHYENGQKYE 144

Query: 850  PHFDYFHDKANQVLGGHRIATVLMYLSNIQKGGETIFPNSDGKLSQPKDDSWSECAHRGY 671
            PHFDYFHDKANQ LGGHRIATVLMYLS+++ GGET+FPN++GKLSQPKDDSWS+CA  GY
Sbjct: 145  PHFDYFHDKANQELGGHRIATVLMYLSDVESGGETVFPNAEGKLSQPKDDSWSDCAKNGY 204

Query: 670  AVKPEKGDALLFFSLHLNATTDPKSLHGSCPVIEGEKWSATKWIHVSDFEKPYGKIDNGD 491
            AVKP KGDALLFFSLHL+ATTD  SLHGSCPVI+GEKWSATKWIHV  F+    +  N D
Sbjct: 205  AVKPRKGDALLFFSLHLDATTDSDSLHGSCPVIKGEKWSATKWIHVRSFDTAKRQSVNRD 264

Query: 490  CVDENENCAKWAIVGECEKNPLYMVGNEGVKGKCMKSCNVCSS 362
            CVDENENCA WA  GECEKNP YM+G+E   G C KSC VCSS
Sbjct: 265  CVDENENCATWASAGECEKNPSYMIGSEDYYGYCRKSCKVCSS 307


Top