BLASTX nr result

ID: Catharanthus22_contig00003483 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Catharanthus22_contig00003483
         (933 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_006356692.1| PREDICTED: prolyl 4-hydroxylase subunit alph...   389   e-106
ref|XP_004241065.1| PREDICTED: prolyl 4-hydroxylase subunit alph...   388   e-105
dbj|BAG86624.1| type 2 proly 4-hydroxylase [Nicotiana tabacum]        387   e-105
ref|XP_004229482.1| PREDICTED: prolyl 4-hydroxylase subunit alph...   386   e-105
ref|XP_006365272.1| PREDICTED: prolyl 4-hydroxylase subunit alph...   381   e-103
ref|XP_003594052.1| Prolyl 4-hydroxylase alpha subunit-like prot...   377   e-102
gb|EXC05706.1| Prolyl 4-hydroxylase subunit alpha-1 [Morus notab...   376   e-102
ref|XP_004296772.1| PREDICTED: prolyl 4-hydroxylase subunit alph...   376   e-102
ref|XP_006419737.1| hypothetical protein CICLE_v10005535mg [Citr...   375   e-101
gb|EOY06341.1| 2-oxoglutarate and Fe(II)-dependent oxygenase sup...   375   e-101
ref|XP_002516833.1| prolyl 4-hydroxylase alpha subunit, putative...   374   e-101
ref|XP_003594053.1| Prolyl 4-hydroxylase alpha subunit-like prot...   372   e-101
ref|XP_002312720.1| oxidoreductase family protein [Populus trich...   371   e-100
gb|EMJ24403.1| hypothetical protein PRUPE_ppa009336mg [Prunus pe...   370   e-100
ref|XP_003631980.1| PREDICTED: prolyl 4-hydroxylase subunit alph...   369   e-100
ref|XP_003546111.1| PREDICTED: prolyl 4-hydroxylase subunit alph...   369   1e-99
ref|NP_001241485.1| uncharacterized protein LOC100783075 precurs...   367   3e-99
ref|NP_001276206.1| uncharacterized protein LOC100818794 precurs...   367   5e-99
ref|NP_001242363.1| uncharacterized protein LOC100796794 precurs...   366   6e-99
ref|XP_004508327.1| PREDICTED: prolyl 4-hydroxylase subunit alph...   363   7e-98

>ref|XP_006356692.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Solanum
           tuberosum]
          Length = 295

 Score =  389 bits (1000), Expect = e-106
 Identities = 189/257 (73%), Positives = 215/257 (83%), Gaps = 7/257 (2%)
 Frame = +3

Query: 183 FLSIALSIVNDSSGSAIINPSKVKQISWRPRAFVYRGFLTDEECDHLISLAKSELKRSAV 362
           FL   ++ +++SS SAIINPSK KQISW+PRAFVY GFLTDEEC+HLISLAKSELKRSAV
Sbjct: 12  FLIFIIAFIHESSSSAIINPSKSKQISWKPRAFVYEGFLTDEECNHLISLAKSELKRSAV 71

Query: 363 ADNLSGESKLSEVRTSSGMFIPXXXXXXXXXXXXX-------PKENGEDIQVLRYEPRQK 521
           ADN SG+SK SEVRTSSGMFI                     PKENGE+IQVLRYE  QK
Sbjct: 72  ADNESGKSKHSEVRTSSGMFISKAKDPIVSGIEDKIATWTFLPKENGEEIQVLRYEEGQK 131

Query: 522 YDPHYDYFTDKVNVARGGHRIATVLMYLSDVEKGGETVFPNAEETSRRRSVIADEDLSDC 701
           Y+PHYDYF DKVN+ARGGHR ATVLMYL+DVEKGGETVFP AEE+ RRRS+ AD+ LS+C
Sbjct: 132 YEPHYDYFVDKVNIARGGHRFATVLMYLTDVEKGGETVFPKAEESHRRRSMAADDSLSEC 191

Query: 702 AKKGVAVKPRKGDALLFFSLHPSAIPDPESLHGGCPVIEGEKWSATKWIHVDNFDKTLGS 881
           AKKG+AVKPRKGDALLF+SLHP+A PDP SLHGGCPV++GEKWSATKWIHVD+FDKT+G+
Sbjct: 192 AKKGIAVKPRKGDALLFYSLHPNATPDPISLHGGCPVLQGEKWSATKWIHVDSFDKTVGT 251

Query: 882 SDSCTDANENCERWAAL 932
             +CTDA+ENCERWAAL
Sbjct: 252 DGNCTDADENCERWAAL 268


>ref|XP_004241065.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Solanum
           lycopersicum]
          Length = 295

 Score =  388 bits (997), Expect = e-105
 Identities = 189/257 (73%), Positives = 215/257 (83%), Gaps = 7/257 (2%)
 Frame = +3

Query: 183 FLSIALSIVNDSSGSAIINPSKVKQISWRPRAFVYRGFLTDEECDHLISLAKSELKRSAV 362
           FL   ++  ++S+ SAIINPSK KQISW+PRAFVY GFLTDEEC+HLISLAKSELKRSAV
Sbjct: 12  FLIFIIAFTHESTSSAIINPSKSKQISWKPRAFVYEGFLTDEECNHLISLAKSELKRSAV 71

Query: 363 ADNLSGESKLSEVRTSSGMFIPXXXXXXXXXXXXX-------PKENGEDIQVLRYEPRQK 521
           ADN SGESK SEVRTSSGMFI                     PKENGE+IQVLRYE  QK
Sbjct: 72  ADNESGESKHSEVRTSSGMFISKAKDPIVSGIEDKIATWTFLPKENGEEIQVLRYEEGQK 131

Query: 522 YDPHYDYFTDKVNVARGGHRIATVLMYLSDVEKGGETVFPNAEETSRRRSVIADEDLSDC 701
           Y+PHYDYF DKVN+ARGGHR+ATVLMYL+DVEKGGETVFP AEE+ RRRS+ AD+ LS+C
Sbjct: 132 YEPHYDYFVDKVNIARGGHRLATVLMYLTDVEKGGETVFPKAEESHRRRSMAADDSLSEC 191

Query: 702 AKKGVAVKPRKGDALLFFSLHPSAIPDPESLHGGCPVIEGEKWSATKWIHVDNFDKTLGS 881
           AKKG+AVKPRKGDALLFFSL+P+A PDP SLHGGCPV++GEKWSATKWIHVD+FDKT+G+
Sbjct: 192 AKKGIAVKPRKGDALLFFSLYPNATPDPISLHGGCPVLQGEKWSATKWIHVDSFDKTVGT 251

Query: 882 SDSCTDANENCERWAAL 932
             +CTDA+ENCERWAAL
Sbjct: 252 DGNCTDADENCERWAAL 268


>dbj|BAG86624.1| type 2 proly 4-hydroxylase [Nicotiana tabacum]
          Length = 294

 Score =  387 bits (994), Expect = e-105
 Identities = 188/257 (73%), Positives = 213/257 (82%), Gaps = 7/257 (2%)
 Frame = +3

Query: 183 FLSIALSIVNDSSGSAIINPSKVKQISWRPRAFVYRGFLTDEECDHLISLAKSELKRSAV 362
           FL   ++ V +SS SAIINPSK KQISW+PRAFVY GFLTDEEC+HLISLAKSELKRSAV
Sbjct: 11  FLLFIIAFVRESSSSAIINPSKAKQISWKPRAFVYEGFLTDEECNHLISLAKSELKRSAV 70

Query: 363 ADNLSGESKLSEVRTSSGMFIPXXXXXXXXXXXXX-------PKENGEDIQVLRYEPRQK 521
           ADN SG SK SEVRTSSGMFIP                    PKENGE+IQVLRYE  QK
Sbjct: 71  ADNESGNSKTSEVRTSSGMFIPKAKDPIVSGIEEKIATWTFLPKENGEEIQVLRYEEGQK 130

Query: 522 YDPHYDYFTDKVNVARGGHRIATVLMYLSDVEKGGETVFPNAEETSRRRSVIADEDLSDC 701
           Y+PHYDYF DKVN+ARGGHR+ATVLMYL++VEKGGETVFP AEE+ RRRS+IAD+ LS+C
Sbjct: 131 YEPHYDYFVDKVNIARGGHRLATVLMYLTNVEKGGETVFPKAEESPRRRSMIADDSLSEC 190

Query: 702 AKKGVAVKPRKGDALLFFSLHPSAIPDPESLHGGCPVIEGEKWSATKWIHVDNFDKTLGS 881
           AKKG+ VKPRKGDALLF+SLHP+A PDP SLHGGCPVI+GEKWSATKWIHVD+FDKT+ +
Sbjct: 191 AKKGIPVKPRKGDALLFYSLHPNATPDPLSLHGGCPVIQGEKWSATKWIHVDSFDKTVDT 250

Query: 882 SDSCTDANENCERWAAL 932
             +C+D +ENCERWAAL
Sbjct: 251 EGNCSDRDENCERWAAL 267


>ref|XP_004229482.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Solanum
           lycopersicum]
          Length = 295

 Score =  386 bits (991), Expect = e-105
 Identities = 194/273 (71%), Positives = 217/273 (79%), Gaps = 8/273 (2%)
 Frame = +3

Query: 138 LKLFSSMIKFWQLILFLSIALSIVNDSSGSAIINPSKVKQISWRPRAFVYRGFLTDEECD 317
           + +FS +  F     F  + +  V  SS SAIINPSKVKQISW+PRAFVY GFLTDEEC+
Sbjct: 1   MNIFSQIFTF-----FFFLIVVFVTKSSCSAIINPSKVKQISWKPRAFVYEGFLTDEECN 55

Query: 318 HLISLAKSELKRSAVADNLSGESKLSEVRTSSGMFIPXXXXXXXXXXXXX-------PKE 476
           HLISLAK ELKRSAVADN SGESKLSEVRTSSGMFI                     PKE
Sbjct: 56  HLISLAKKELKRSAVADNESGESKLSEVRTSSGMFISKAKDPIVTGIEEKIATWTFLPKE 115

Query: 477 NGEDIQVLRYEPRQKYDPHYDYFTDKVNVARGGHRIATVLMYLSDVEKGGETVFPNAEET 656
           NGEDIQVLRYE  Q+Y+PHYDYFTDKVN+ RGGHR+ATVLMYLSDVEKGGETVFP AE +
Sbjct: 116 NGEDIQVLRYEEGQRYEPHYDYFTDKVNIVRGGHRLATVLMYLSDVEKGGETVFPEAEVS 175

Query: 657 SRRRSVIADEDLSDCAKKGVAVKPRKGDALLFFSLHPSAIPDPESLHGGCPVIEGEKWSA 836
           +RRRS+ AD+ LS+CAK+G+AVKPRKGDALLFFSLHP+A+PDP SLHGGCPV+EGEKWSA
Sbjct: 176 TRRRSMAADDSLSECAKRGIAVKPRKGDALLFFSLHPNAVPDPMSLHGGCPVMEGEKWSA 235

Query: 837 TKWIHVDNFDKTLGSSDS-CTDANENCERWAAL 932
           TKWIHVD+FDKT+ S    C D NENCERWAAL
Sbjct: 236 TKWIHVDSFDKTVDSEGGHCADHNENCERWAAL 268


>ref|XP_006365272.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Solanum
           tuberosum]
          Length = 295

 Score =  381 bits (979), Expect = e-103
 Identities = 192/265 (72%), Positives = 213/265 (80%), Gaps = 9/265 (3%)
 Frame = +3

Query: 165 FWQLILFLSIALSI-VNDSSGSAIINPSKVKQISWRPRAFVYRGFLTDEECDHLISLAKS 341
           F Q+  FL   +++ V  SS SAIINPSKVKQISW+PRAFVY GFLTDEEC+HL+SLAK 
Sbjct: 4   FSQIFTFLFFLIAVFVTKSSCSAIINPSKVKQISWKPRAFVYEGFLTDEECNHLVSLAKK 63

Query: 342 ELKRSAVADNLSGESKLSEVRTSSGMFIPXXXXXXXXXXXXX-------PKENGEDIQVL 500
           ELKRSAVADN SGESKLSEVRTSSGMFI                     P ENGEDIQVL
Sbjct: 64  ELKRSAVADNDSGESKLSEVRTSSGMFISKAKDPIVTGIEEKIATWTFLPTENGEDIQVL 123

Query: 501 RYEPRQKYDPHYDYFTDKVNVARGGHRIATVLMYLSDVEKGGETVFPNAEETSRRRSVIA 680
           RYE  Q+Y+PH+DYFTDKVN+ RGGHR+ATVLMYLSDVEKGGET FP AE ++RRRS+ A
Sbjct: 124 RYEEGQRYEPHHDYFTDKVNIVRGGHRLATVLMYLSDVEKGGETAFPEAEVSTRRRSMAA 183

Query: 681 DEDLSDCAKKGVAVKPRKGDALLFFSLHPSAIPDPESLHGGCPVIEGEKWSATKWIHVDN 860
           D  LS+CAKKG+AVKPRKGDALLFFSLHP+A+PDP SLHGGCPVIEGEKWSATKWIHVD+
Sbjct: 184 DNSLSECAKKGIAVKPRKGDALLFFSLHPNAVPDPMSLHGGCPVIEGEKWSATKWIHVDS 243

Query: 861 FDKTLGSSDS-CTDANENCERWAAL 932
           FDKT+ S    C D NENCERWAAL
Sbjct: 244 FDKTVESEGGHCADHNENCERWAAL 268


>ref|XP_003594052.1| Prolyl 4-hydroxylase alpha subunit-like protein [Medicago
           truncatula] gi|355483100|gb|AES64303.1| Prolyl
           4-hydroxylase alpha subunit-like protein [Medicago
           truncatula]
          Length = 301

 Score =  377 bits (969), Expect = e-102
 Identities = 186/260 (71%), Positives = 211/260 (81%), Gaps = 7/260 (2%)
 Frame = +3

Query: 174 LILFLSIALSIVNDSSGSAIINPSKVKQISWRPRAFVYRGFLTDEECDHLISLAKSELKR 353
           L+L   I  ++ + +  SAII+P+KVKQ+SW+PRAFVY+GFLTD ECDHLIS+AKSELKR
Sbjct: 15  LLLICKIHFALGSYAGTSAIIDPTKVKQVSWKPRAFVYKGFLTDLECDHLISIAKSELKR 74

Query: 354 SAVADNLSGESKLSEVRTSSGMFIPXXXXXXXXXXXXX-------PKENGEDIQVLRYEP 512
           SAVADNLSGESKLSEVRTSSGMFI                     PKENGEDIQVLRYE 
Sbjct: 75  SAVADNLSGESKLSEVRTSSGMFISKNKDAIVSGIEDKISSWTFLPKENGEDIQVLRYEH 134

Query: 513 RQKYDPHYDYFTDKVNVARGGHRIATVLMYLSDVEKGGETVFPNAEETSRRRSVIADEDL 692
            QKYDPHYDYF DKVN+ARGGHR+ATVLMYL++V KGGETVFPNAEE+ R +    DEDL
Sbjct: 135 GQKYDPHYDYFADKVNIARGGHRVATVLMYLTNVTKGGETVFPNAEESPRHKLSETDEDL 194

Query: 693 SDCAKKGVAVKPRKGDALLFFSLHPSAIPDPESLHGGCPVIEGEKWSATKWIHVDNFDKT 872
           S+C KKGVAVKPR+GDALLFFSLHP+AIPD  SLH GCPVIEGEKWSATKWIHVD+FDKT
Sbjct: 195 SECGKKGVAVKPRRGDALLFFSLHPNAIPDTLSLHAGCPVIEGEKWSATKWIHVDSFDKT 254

Query: 873 LGSSDSCTDANENCERWAAL 932
           +G+   CTD +E+CERWAAL
Sbjct: 255 VGAGGDCTDQHESCERWAAL 274


>gb|EXC05706.1| Prolyl 4-hydroxylase subunit alpha-1 [Morus notabilis]
          Length = 300

 Score =  376 bits (966), Expect = e-102
 Identities = 191/273 (69%), Positives = 214/273 (78%), Gaps = 14/273 (5%)
 Frame = +3

Query: 156 MIKFW-QLILFLSIALSIVNDSSGS------AIINPSKVKQISWRPRAFVYRGFLTDEEC 314
           M K W QL LFL    S  ++SS S      +IINPSKVKQ+SW+PRAFVY GFLTD EC
Sbjct: 1   MSKLWVQLFLFLLSISSSFHESSSSYAGSAASIINPSKVKQVSWKPRAFVYEGFLTDLEC 60

Query: 315 DHLISLAKSELKRSAVADNLSGESKLSEVRTSSGMFIPXXXXXXXXXXXXX-------PK 473
           DHLISLAKSELKRSAVADN+SG+SKLSEVRTSSGMFIP                    PK
Sbjct: 61  DHLISLAKSELKRSAVADNVSGKSKLSEVRTSSGMFIPKAKDPIVAGIEDKISTWTFLPK 120

Query: 474 ENGEDIQVLRYEPRQKYDPHYDYFTDKVNVARGGHRIATVLMYLSDVEKGGETVFPNAEE 653
           ENGED+QVLRYE  QKYDPHYDYF DKVN+ARGGHRIATVLMYL+DV KGGETVFP+AEE
Sbjct: 121 ENGEDMQVLRYEHGQKYDPHYDYFADKVNIARGGHRIATVLMYLTDVVKGGETVFPSAEE 180

Query: 654 TSRRRSVIADEDLSDCAKKGVAVKPRKGDALLFFSLHPSAIPDPESLHGGCPVIEGEKWS 833
           +   ++   D+DLS+CAKKG+AVKPR+GDALLFFSL P+A+PD  SLH GCPVIEGEKWS
Sbjct: 181 SHHHKASTTDDDLSECAKKGIAVKPRRGDALLFFSLLPTAVPDTISLHAGCPVIEGEKWS 240

Query: 834 ATKWIHVDNFDKTLGSSDSCTDANENCERWAAL 932
           ATKWIHVD+FDK L +   CTD NE+CERWAAL
Sbjct: 241 ATKWIHVDSFDKDLSAGGKCTDQNESCERWAAL 273


>ref|XP_004296772.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Fragaria
           vesca subsp. vesca]
          Length = 294

 Score =  376 bits (965), Expect = e-102
 Identities = 185/259 (71%), Positives = 209/259 (80%), Gaps = 7/259 (2%)
 Frame = +3

Query: 177 ILFLSIALSIVNDSSGSAIINPSKVKQISWRPRAFVYRGFLTDEECDHLISLAKSELKRS 356
           + FLS+ LS+ + +S +  +NPSKVKQISW PRAFVY G L++ ECDHLIS+AKSELKRS
Sbjct: 10  LCFLSLLLSLTS-ASATFTVNPSKVKQISWNPRAFVYEGLLSELECDHLISIAKSELKRS 68

Query: 357 AVADNLSGESKLSEVRTSSGMFIPXXXXXXXXXXXXX-------PKENGEDIQVLRYEPR 515
           AVADNLSG+SKLSEVRTSSGMFIP                    PKENGEDIQVLRYEP 
Sbjct: 69  AVADNLSGQSKLSEVRTSSGMFIPKAKDHIVAGIEDKLATWTFLPKENGEDIQVLRYEPG 128

Query: 516 QKYDPHYDYFTDKVNVARGGHRIATVLMYLSDVEKGGETVFPNAEETSRRRSVIADEDLS 695
           QKY+PHYDYF DKVN+ARGGHRIATVLMYL+DV KGGETVFP AEE  RR++ + D  LS
Sbjct: 129 QKYEPHYDYFADKVNIARGGHRIATVLMYLTDVAKGGETVFPLAEEVHRRKASVPDASLS 188

Query: 696 DCAKKGVAVKPRKGDALLFFSLHPSAIPDPESLHGGCPVIEGEKWSATKWIHVDNFDKTL 875
           DCAKKG+AVKPR+GDALLFFSLHP+AIPD  SLH GCPVIEGEKWSATKWIHVD+FD  L
Sbjct: 189 DCAKKGIAVKPRRGDALLFFSLHPNAIPDENSLHAGCPVIEGEKWSATKWIHVDSFDNIL 248

Query: 876 GSSDSCTDANENCERWAAL 932
            +  +CTD NE+CERWAAL
Sbjct: 249 DTGGNCTDLNESCERWAAL 267


>ref|XP_006419737.1| hypothetical protein CICLE_v10005535mg [Citrus clementina]
           gi|557521610|gb|ESR32977.1| hypothetical protein
           CICLE_v10005535mg [Citrus clementina]
          Length = 296

 Score =  375 bits (964), Expect = e-101
 Identities = 186/256 (72%), Positives = 206/256 (80%), Gaps = 7/256 (2%)
 Frame = +3

Query: 186 LSIALSIVNDSSGSAIINPSKVKQISWRPRAFVYRGFLTDEECDHLISLAKSELKRSAVA 365
           LS +L I    S +AIINPSKVKQISW+PRAFVY GFLTD ECDHLI+LAKS+LKRSAVA
Sbjct: 14  LSFSLLIRKSFSSTAIINPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVA 73

Query: 366 DNLSGESKLSEVRTSSGMFIPXXXXXXXXXXXXX-------PKENGEDIQVLRYEPRQKY 524
           DNLSGESKLS+VRTSSG FIP                    PKENGEDIQVLRYE  QKY
Sbjct: 74  DNLSGESKLSDVRTSSGTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKY 133

Query: 525 DPHYDYFTDKVNVARGGHRIATVLMYLSDVEKGGETVFPNAEETSRRRSVIADEDLSDCA 704
           +PHYDYF+DKVN+ RGGHR+ATVLMYLSDV KGGETVFPNAEE  RRR+   ++DLS+CA
Sbjct: 134 EPHYDYFSDKVNIVRGGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECA 193

Query: 705 KKGVAVKPRKGDALLFFSLHPSAIPDPESLHGGCPVIEGEKWSATKWIHVDNFDKTLGSS 884
           KKG+AVKPR+GDALLFFSLH +AIPDP SLH GCPVIEGEKWSATKWIHVD+FDK +   
Sbjct: 194 KKGIAVKPRRGDALLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIHVDSFDKIVEEG 253

Query: 885 DSCTDANENCERWAAL 932
             CTD N +CERWAAL
Sbjct: 254 GDCTDNNASCERWAAL 269


>gb|EOY06341.1| 2-oxoglutarate and Fe(II)-dependent oxygenase superfamily protein
           [Theobroma cacao]
          Length = 287

 Score =  375 bits (964), Expect = e-101
 Identities = 182/243 (74%), Positives = 203/243 (83%), Gaps = 7/243 (2%)
 Frame = +3

Query: 225 SAIINPSKVKQISWRPRAFVYRGFLTDEECDHLISLAKSELKRSAVADNLSGESKLSEVR 404
           S+IINP+K KQ+SW+PRAFVY GFLTD ECDHLISLAKSELKRSAVADN+SG+SKLSEVR
Sbjct: 33  SSIINPAKAKQVSWKPRAFVYEGFLTDLECDHLISLAKSELKRSAVADNVSGKSKLSEVR 92

Query: 405 TSSGMFIPXXXXXXXXXXXXX-------PKENGEDIQVLRYEPRQKYDPHYDYFTDKVNV 563
           TSSGMFI                     PKENGEDIQVLRYE  QKYDPHYDYF DKVN+
Sbjct: 93  TSSGMFISKGKDPIVAGIEDKISTWTFLPKENGEDIQVLRYEHGQKYDPHYDYFVDKVNI 152

Query: 564 ARGGHRIATVLMYLSDVEKGGETVFPNAEETSRRRSVIADEDLSDCAKKGVAVKPRKGDA 743
           ARGGHRIATVLMYL+DV KGGET+FP AEE+SRR++   D+DLS+CAKKG+AVKPR+GDA
Sbjct: 153 ARGGHRIATVLMYLTDVTKGGETIFPQAEESSRRKTPATDDDLSECAKKGIAVKPRRGDA 212

Query: 744 LLFFSLHPSAIPDPESLHGGCPVIEGEKWSATKWIHVDNFDKTLGSSDSCTDANENCERW 923
           LLFFSL P+AIPDP SLH GCPVIEGEKWSATKWIHVD+FDK L +  +CTD NE+CERW
Sbjct: 213 LLFFSLSPTAIPDPSSLHAGCPVIEGEKWSATKWIHVDSFDKNLEAGGNCTDLNESCERW 272

Query: 924 AAL 932
           AAL
Sbjct: 273 AAL 275


>ref|XP_002516833.1| prolyl 4-hydroxylase alpha subunit, putative [Ricinus communis]
           gi|223543921|gb|EEF45447.1| prolyl 4-hydroxylase alpha
           subunit, putative [Ricinus communis]
          Length = 297

 Score =  374 bits (959), Expect = e-101
 Identities = 184/260 (70%), Positives = 209/260 (80%), Gaps = 7/260 (2%)
 Frame = +3

Query: 174 LILFLSIALSIVNDSSGSAIINPSKVKQISWRPRAFVYRGFLTDEECDHLISLAKSELKR 353
           L++ L    S     S ++II+PSKVKQ+SW+PRAFVY GFLTD ECDHLISLAKSELKR
Sbjct: 11  LLISLIFHKSSSYPGSPTSIIDPSKVKQVSWKPRAFVYEGFLTDLECDHLISLAKSELKR 70

Query: 354 SAVADNLSGESKLSEVRTSSGMFIPXXXXXXXXXXXXX-------PKENGEDIQVLRYEP 512
           SAVADN SG+SKLSEVRTSSGMFI                     PKENGED+QVLRYE 
Sbjct: 71  SAVADNESGKSKLSEVRTSSGMFIAKGKDPIIAGIEEKISTWTFLPKENGEDLQVLRYEH 130

Query: 513 RQKYDPHYDYFTDKVNVARGGHRIATVLMYLSDVEKGGETVFPNAEETSRRRSVIADEDL 692
            QKYDPHYDYF DK+N+ARGGHR+ATVLMYLSDV KGGETVFPNAEE  RR++  + EDL
Sbjct: 131 GQKYDPHYDYFADKINIARGGHRMATVLMYLSDVVKGGETVFPNAEEPPRRKATESHEDL 190

Query: 693 SDCAKKGVAVKPRKGDALLFFSLHPSAIPDPESLHGGCPVIEGEKWSATKWIHVDNFDKT 872
           S+CAKKG++VKPR+GDALLFFSLHP+AIPDP SLH GCPVIEGEKWSATKWIHVD+FDK 
Sbjct: 191 SECAKKGISVKPRRGDALLFFSLHPTAIPDPNSLHAGCPVIEGEKWSATKWIHVDSFDKN 250

Query: 873 LGSSDSCTDANENCERWAAL 932
           + +  +CTD NE+CERWAAL
Sbjct: 251 IEAGGNCTDKNESCERWAAL 270


>ref|XP_003594053.1| Prolyl 4-hydroxylase alpha subunit-like protein [Medicago
           truncatula] gi|355483101|gb|AES64304.1| Prolyl
           4-hydroxylase alpha subunit-like protein [Medicago
           truncatula]
          Length = 303

 Score =  372 bits (956), Expect = e-101
 Identities = 186/262 (70%), Positives = 211/262 (80%), Gaps = 9/262 (3%)
 Frame = +3

Query: 174 LILFLSIALSIVNDSSGSAIINPSKVKQISWRPRAFVYRGFLTDEECDHLISLAKSELKR 353
           L+L   I  ++ + +  SAII+P+KVKQ+SW+PRAFVY+GFLTD ECDHLIS+AKSELKR
Sbjct: 15  LLLICKIHFALGSYAGTSAIIDPTKVKQVSWKPRAFVYKGFLTDLECDHLISIAKSELKR 74

Query: 354 SAVADNLSGESKLSEVRTSSGMFIPXXXXXXXXXXXXX-------PKENGEDIQVLRYEP 512
           SAVADNLSGESKLSEVRTSSGMFI                     PKENGEDIQVLRYE 
Sbjct: 75  SAVADNLSGESKLSEVRTSSGMFISKNKDAIVSGIEDKISSWTFLPKENGEDIQVLRYEH 134

Query: 513 RQKYDPHYDYFTDKVNVARGGHRIATVLMYLSDVEKGGETVFPNAE--ETSRRRSVIADE 686
            QKYDPHYDYF DKVN+ARGGHR+ATVLMYL++V KGGETVFPNAE  E+ R +    DE
Sbjct: 135 GQKYDPHYDYFADKVNIARGGHRVATVLMYLTNVTKGGETVFPNAELQESPRHKLSETDE 194

Query: 687 DLSDCAKKGVAVKPRKGDALLFFSLHPSAIPDPESLHGGCPVIEGEKWSATKWIHVDNFD 866
           DLS+C KKGVAVKPR+GDALLFFSLHP+AIPD  SLH GCPVIEGEKWSATKWIHVD+FD
Sbjct: 195 DLSECGKKGVAVKPRRGDALLFFSLHPNAIPDTLSLHAGCPVIEGEKWSATKWIHVDSFD 254

Query: 867 KTLGSSDSCTDANENCERWAAL 932
           KT+G+   CTD +E+CERWAAL
Sbjct: 255 KTVGAGGDCTDQHESCERWAAL 276


>ref|XP_002312720.1| oxidoreductase family protein [Populus trichocarpa]
           gi|222852540|gb|EEE90087.1| oxidoreductase family
           protein [Populus trichocarpa]
          Length = 300

 Score =  371 bits (953), Expect = e-100
 Identities = 181/260 (69%), Positives = 211/260 (81%), Gaps = 7/260 (2%)
 Frame = +3

Query: 174 LILFLSIALSIVNDSSGSAIINPSKVKQISWRPRAFVYRGFLTDEECDHLISLAKSELKR 353
           L +F  +  SI    + S+IINP+KVKQ+SW+PRAFVY GFLTD ECDHLISLAKSELKR
Sbjct: 14  LSIFSILHKSISYPGTSSSIINPAKVKQVSWKPRAFVYEGFLTDLECDHLISLAKSELKR 73

Query: 354 SAVADNLSGESKLSEVRTSSGMFIPXXXXXXXXXXXXX-------PKENGEDIQVLRYEP 512
           SAVADN SG+SKLSEVRTSSGMFI                     P+ENGEDIQVLRYE 
Sbjct: 74  SAVADNESGKSKLSEVRTSSGMFITKAKDPIVAGIEDKIATWTFLPRENGEDIQVLRYEH 133

Query: 513 RQKYDPHYDYFTDKVNVARGGHRIATVLMYLSDVEKGGETVFPNAEETSRRRSVIADEDL 692
            QKYDPHYDYF+DKVN+ARGGHR+ATVLMYL+DVEKGGETVFP+AEE  RR++ ++ EDL
Sbjct: 134 GQKYDPHYDYFSDKVNIARGGHRVATVLMYLTDVEKGGETVFPSAEELPRRKASVSHEDL 193

Query: 693 SDCAKKGVAVKPRKGDALLFFSLHPSAIPDPESLHGGCPVIEGEKWSATKWIHVDNFDKT 872
           S+CA+KG+AVKPR+GDALLFFSL+P+A+PD  S+H GCPVIEGEKWSATKWIHVD+FDK 
Sbjct: 194 SECARKGIAVKPRRGDALLFFSLYPTAVPDTSSIHAGCPVIEGEKWSATKWIHVDSFDKN 253

Query: 873 LGSSDSCTDANENCERWAAL 932
           L +  +CTD NE+C RWAAL
Sbjct: 254 LEAGGNCTDQNESCGRWAAL 273


>gb|EMJ24403.1| hypothetical protein PRUPE_ppa009336mg [Prunus persica]
          Length = 297

 Score =  370 bits (951), Expect = e-100
 Identities = 184/270 (68%), Positives = 207/270 (76%), Gaps = 11/270 (4%)
 Frame = +3

Query: 156 MIKFWQLILFLSIALSIVNDSSGSA----IINPSKVKQISWRPRAFVYRGFLTDEECDHL 323
           M + W  + F    LSI + S  ++     +NPSKV+QISW PRAFVY G LTD ECDHL
Sbjct: 1   MTRVWLQLFFFFFLLSISSSSYAASPHTFTVNPSKVRQISWNPRAFVYEGLLTDAECDHL 60

Query: 324 ISLAKSELKRSAVADNLSGESKLSEVRTSSGMFIPXXXXXXXXXXXXX-------PKENG 482
           IS+AKSELKRSAVADNLSG+SKLSEVRTSSGMFIP                    PKENG
Sbjct: 61  ISIAKSELKRSAVADNLSGQSKLSEVRTSSGMFIPKAKDPIVAGIEDKIATWTFLPKENG 120

Query: 483 EDIQVLRYEPRQKYDPHYDYFTDKVNVARGGHRIATVLMYLSDVEKGGETVFPNAEETSR 662
           EDIQVLRYEP QKY+PHYDYF DKVN+ARGGHRIATVLMYL+DV +GGETVFP AE  SR
Sbjct: 121 EDIQVLRYEPGQKYEPHYDYFADKVNIARGGHRIATVLMYLTDVTRGGETVFPEAEVPSR 180

Query: 663 RRSVIADEDLSDCAKKGVAVKPRKGDALLFFSLHPSAIPDPESLHGGCPVIEGEKWSATK 842
           R++   D  LS+CAKKG+AVKPR+GDALLFFSL P A+PD  SLH GCPVIEGEKWSATK
Sbjct: 181 RKASEVDHSLSECAKKGIAVKPRRGDALLFFSLTPHAVPDENSLHAGCPVIEGEKWSATK 240

Query: 843 WIHVDNFDKTLGSSDSCTDANENCERWAAL 932
           WIHVD+FDK L +S +C D NE+CERWAAL
Sbjct: 241 WIHVDSFDKNLDASGNCADLNESCERWAAL 270


>ref|XP_003631980.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 isoform 2 [Vitis
           vinifera] gi|297736941|emb|CBI26142.3| unnamed protein
           product [Vitis vinifera]
          Length = 298

 Score =  369 bits (947), Expect = e-100
 Identities = 185/271 (68%), Positives = 215/271 (79%), Gaps = 12/271 (4%)
 Frame = +3

Query: 156 MIKFWQLILFLSIALSIVNDSSGSAI-----INPSKVKQISWRPRAFVYRGFLTDEECDH 320
           M+   Q +L L I+ +I+  SS  A      ++ +KV+QISW+PRAFVY GFL++EECDH
Sbjct: 1   MVSSLQFLLLLWISSTILEFSSSYADAAGSNVSAAKVRQISWKPRAFVYEGFLSEEECDH 60

Query: 321 LISLAKSELKRSAVADNLSGESKLSEVRTSSGMFIPXXXXXXXXXXXXX-------PKEN 479
           LISLAKSELKRSAVADN+SG+S+LSEVRTSSGMFI                     PK+N
Sbjct: 61  LISLAKSELKRSAVADNVSGKSRLSEVRTSSGMFIGKGKDPIVAGIEDKIAAWTFLPKDN 120

Query: 480 GEDIQVLRYEPRQKYDPHYDYFTDKVNVARGGHRIATVLMYLSDVEKGGETVFPNAEETS 659
           GED+QVLRYEP QKYD HYDYF DKVN+ARGGHRIATVLMYLSDV KGGETVFP AEE S
Sbjct: 121 GEDMQVLRYEPGQKYDAHYDYFVDKVNIARGGHRIATVLMYLSDVVKGGETVFPMAEEPS 180

Query: 660 RRRSVIADEDLSDCAKKGVAVKPRKGDALLFFSLHPSAIPDPESLHGGCPVIEGEKWSAT 839
           RR+ +  ++DLS+CA+KG+AVKPRKGDALLFFSLHP+AIPDP SLHGGCPVIEGEKWSAT
Sbjct: 181 RRKPLPTNDDLSECARKGIAVKPRKGDALLFFSLHPTAIPDPMSLHGGCPVIEGEKWSAT 240

Query: 840 KWIHVDNFDKTLGSSDSCTDANENCERWAAL 932
           KWIHVD+FDK L    +CTD N++CERWAAL
Sbjct: 241 KWIHVDSFDKILKPGGNCTDENDSCERWAAL 271


>ref|XP_003546111.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like isoform 1
           [Glycine max]
          Length = 301

 Score =  369 bits (946), Expect = 1e-99
 Identities = 180/245 (73%), Positives = 198/245 (80%), Gaps = 7/245 (2%)
 Frame = +3

Query: 219 SGSAIINPSKVKQISWRPRAFVYRGFLTDEECDHLISLAKSELKRSAVADNLSGESKLSE 398
           S SAII+PSKVKQ+SW+PRAFVY GFLT+ ECDHLIS+AKSELKRSAVADNLSGESKLSE
Sbjct: 30  SASAIIDPSKVKQVSWKPRAFVYEGFLTELECDHLISIAKSELKRSAVADNLSGESKLSE 89

Query: 399 VRTSSGMFIPXXXXXXXXXXXXX-------PKENGEDIQVLRYEPRQKYDPHYDYFTDKV 557
           VRTSSGMFIP                    PKENGEDIQVLRYE  QKYDPHYDYF DKV
Sbjct: 90  VRTSSGMFIPKNKDPIVAGVEDKISSWTLLPKENGEDIQVLRYEHGQKYDPHYDYFADKV 149

Query: 558 NVARGGHRIATVLMYLSDVEKGGETVFPNAEETSRRRSVIADEDLSDCAKKGVAVKPRKG 737
           N+ARGGHR+ATVLMYL+DV KGGETVFPNAEE+ R R     EDLS+CA+KG+AVKPR+G
Sbjct: 150 NIARGGHRVATVLMYLTDVTKGGETVFPNAEESPRHRGSETKEDLSECAQKGIAVKPRRG 209

Query: 738 DALLFFSLHPSAIPDPESLHGGCPVIEGEKWSATKWIHVDNFDKTLGSSDSCTDANENCE 917
           DALLFFSL+P+AIPD  SLH GCPVIEGEKWSATKWIHVD+FDK +     C D  ENC+
Sbjct: 210 DALLFFSLYPNAIPDTMSLHAGCPVIEGEKWSATKWIHVDSFDKMVADGGDCNDKQENCD 269

Query: 918 RWAAL 932
           RWA L
Sbjct: 270 RWATL 274


>ref|NP_001241485.1| uncharacterized protein LOC100783075 precursor [Glycine max]
           gi|571532068|ref|XP_006600167.1| PREDICTED:
           uncharacterized protein LOC100783075 isoform X1 [Glycine
           max] gi|255645457|gb|ACU23224.1| unknown [Glycine max]
          Length = 298

 Score =  367 bits (942), Expect = 3e-99
 Identities = 185/270 (68%), Positives = 209/270 (77%), Gaps = 9/270 (3%)
 Frame = +3

Query: 150 SSMIKFWQLILFLSIALSIVNDSSGSA--IINPSKVKQISWRPRAFVYRGFLTDEECDHL 323
           SS + F   +L +S    +    +GSA  I+NPSKVKQISW+PRAFVY GFLTD ECDHL
Sbjct: 2   SSRVWFLLFLLLISKCHQVWGSYAGSASSIVNPSKVKQISWKPRAFVYEGFLTDLECDHL 61

Query: 324 ISLAKSELKRSAVADNLSGESKLSEVRTSSGMFIPXXXXXXXXXXXXX-------PKENG 482
           ISLAKSELKRSAVADNLSGES+LS+VRTSSGMFI                     PKENG
Sbjct: 62  ISLAKSELKRSAVADNLSGESQLSDVRTSSGMFISKNKDPIISGIEDKISSWTFLPKENG 121

Query: 483 EDIQVLRYEPRQKYDPHYDYFTDKVNVARGGHRIATVLMYLSDVEKGGETVFPNAEETSR 662
           EDIQVLRYE  QKYDPHYDYFTDKVN+ARGGHRIATVLMYL++V KGGETVFP+AEE  R
Sbjct: 122 EDIQVLRYEHGQKYDPHYDYFTDKVNIARGGHRIATVLMYLTNVTKGGETVFPSAEEPPR 181

Query: 663 RRSVIADEDLSDCAKKGVAVKPRKGDALLFFSLHPSAIPDPESLHGGCPVIEGEKWSATK 842
           RR      DLS+CAKKG+AVKP +GDALLFFSLH +A PD  SLH GCPVIEGEKWSATK
Sbjct: 182 RRGTETSSDLSECAKKGIAVKPHRGDALLFFSLHTNATPDTSSLHAGCPVIEGEKWSATK 241

Query: 843 WIHVDNFDKTLGSSDSCTDANENCERWAAL 932
           WIHVD+FDKT+G+   C+D + +CERWA+L
Sbjct: 242 WIHVDSFDKTVGAGGDCSDHHVSCERWASL 271


>ref|NP_001276206.1| uncharacterized protein LOC100818794 precursor [Glycine max]
           gi|255641919|gb|ACU21228.1| unknown [Glycine max]
          Length = 301

 Score =  367 bits (941), Expect = 5e-99
 Identities = 178/245 (72%), Positives = 201/245 (82%), Gaps = 7/245 (2%)
 Frame = +3

Query: 219 SGSAIINPSKVKQISWRPRAFVYRGFLTDEECDHLISLAKSELKRSAVADNLSGESKLSE 398
           S SAII+PSKVKQ+SW+PRAFVY GFLT+ ECDHLIS+AKSELKRSAVADNLSGESKLSE
Sbjct: 30  SASAIIDPSKVKQVSWKPRAFVYEGFLTELECDHLISIAKSELKRSAVADNLSGESKLSE 89

Query: 399 VRTSSGMFIPXXXXXXXXXXXXX-------PKENGEDIQVLRYEPRQKYDPHYDYFTDKV 557
           VRTSSGMFIP                    PKENGEDIQVLRYE  QKYDPHYDYF DKV
Sbjct: 90  VRTSSGMFIPKNKDLIVAGIEDKISSWTFLPKENGEDIQVLRYEHGQKYDPHYDYFADKV 149

Query: 558 NVARGGHRIATVLMYLSDVEKGGETVFPNAEETSRRRSVIADEDLSDCAKKGVAVKPRKG 737
           N+ARGGHR+ATVLMYL+DV KGGETVFP+AEE+ R +    +E+LS+CA+KG+AVKPR+G
Sbjct: 150 NIARGGHRVATVLMYLTDVTKGGETVFPDAEESPRHKGSETNENLSECAQKGIAVKPRRG 209

Query: 738 DALLFFSLHPSAIPDPESLHGGCPVIEGEKWSATKWIHVDNFDKTLGSSDSCTDANENCE 917
           DALLFFSL+P+AIPD  SLH GCPVIEGEKWSAT+WIHVD+FDK +G    C D +ENCE
Sbjct: 210 DALLFFSLYPNAIPDTLSLHAGCPVIEGEKWSATEWIHVDSFDKVVGDGGDCNDKHENCE 269

Query: 918 RWAAL 932
           RWA L
Sbjct: 270 RWATL 274


>ref|NP_001242363.1| uncharacterized protein LOC100796794 precursor [Glycine max]
           gi|255641119|gb|ACU20838.1| unknown [Glycine max]
          Length = 297

 Score =  366 bits (940), Expect = 6e-99
 Identities = 180/245 (73%), Positives = 199/245 (81%), Gaps = 7/245 (2%)
 Frame = +3

Query: 219 SGSAIINPSKVKQISWRPRAFVYRGFLTDEECDHLISLAKSELKRSAVADNLSGESKLSE 398
           S S++INPSKVKQISW+PRAFVY GFLTD ECDHLISLAKSELKRSAVADNLSGES+LS+
Sbjct: 26  SASSVINPSKVKQISWKPRAFVYEGFLTDLECDHLISLAKSELKRSAVADNLSGESQLSD 85

Query: 399 VRTSSGMFIPXXXXXXXXXXXXX-------PKENGEDIQVLRYEPRQKYDPHYDYFTDKV 557
           VRTSSGMFI                     PKENGEDIQV RYE  QKYDPHYDYFTDKV
Sbjct: 86  VRTSSGMFISKNKDPIVAGIEDKISSWTFLPKENGEDIQVSRYEHGQKYDPHYDYFTDKV 145

Query: 558 NVARGGHRIATVLMYLSDVEKGGETVFPNAEETSRRRSVIADEDLSDCAKKGVAVKPRKG 737
           N+ARGGHRIATVLMYL+DV KGGETVFP+AEE  RRR      DLS+CAKKG+AVKPR+G
Sbjct: 146 NIARGGHRIATVLMYLTDVAKGGETVFPSAEEPPRRRGAETSSDLSECAKKGIAVKPRRG 205

Query: 738 DALLFFSLHPSAIPDPESLHGGCPVIEGEKWSATKWIHVDNFDKTLGSSDSCTDANENCE 917
           DALLFFSLH +A PD  SLH GCPVIEGEKWSATKWIHVD+FDKT+G+   C+D + +CE
Sbjct: 206 DALLFFSLHTNATPDTSSLHAGCPVIEGEKWSATKWIHVDSFDKTVGAGGDCSDNHVSCE 265

Query: 918 RWAAL 932
           RWA+L
Sbjct: 266 RWASL 270


>ref|XP_004508327.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Cicer
           arietinum]
          Length = 297

 Score =  363 bits (931), Expect = 7e-98
 Identities = 184/270 (68%), Positives = 209/270 (77%), Gaps = 11/270 (4%)
 Frame = +3

Query: 156 MIKFWQLILFLSIALSIVNDSS----GSAIINPSKVKQISWRPRAFVYRGFLTDEECDHL 323
           MIK W L+L + I+ +    SS     S+IINPSKVKQISW PRAFVY+GFLTD ECDHL
Sbjct: 1   MIKVWFLLLLVLISQTDEVHSSYAGSASSIINPSKVKQISWIPRAFVYQGFLTDLECDHL 60

Query: 324 ISLAKSELKRSAVADNLSGESKLSEVRTSSGMFIPXXXXXXXXXXXXX-------PKENG 482
           ISLAKSELKRSAVADNLSG+SKLS+VRTSSGMFI                     PKENG
Sbjct: 61  ISLAKSELKRSAVADNLSGDSKLSDVRTSSGMFISKNKDPIVAGIEDKISSWTFLPKENG 120

Query: 483 EDIQVLRYEPRQKYDPHYDYFTDKVNVARGGHRIATVLMYLSDVEKGGETVFPNAEETSR 662
           EDIQVLRYE  QKYDPHYDYFTDKVN+A+GGHR  TVLMYL++V KGGET+FP A+E  R
Sbjct: 121 EDIQVLRYEHGQKYDPHYDYFTDKVNIAQGGHRFVTVLMYLTNVTKGGETMFPVAKEPPR 180

Query: 663 RRSVIADEDLSDCAKKGVAVKPRKGDALLFFSLHPSAIPDPESLHGGCPVIEGEKWSATK 842
           RR      DLS+CAKKG+AVKPR+GDALLFFSLH +A PD  SLH GCPVIEGEKWSATK
Sbjct: 181 RRGSETSSDLSECAKKGIAVKPRRGDALLFFSLHTNATPDTSSLHAGCPVIEGEKWSATK 240

Query: 843 WIHVDNFDKTLGSSDSCTDANENCERWAAL 932
           WIHVD+FDK +G+   C+D +E+CERWA+L
Sbjct: 241 WIHVDSFDKNVGAGGGCSDQHESCERWASL 270


Top