BLASTX nr result

ID: Rehmannia24_contig00017700 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Rehmannia24_contig00017700
         (942 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_004234409.1| PREDICTED: uncharacterized protein LOC101255...   288   3e-75
ref|XP_006353874.1| PREDICTED: prolyl 4-hydroxylase subunit alph...   280   4e-73
ref|XP_002271805.2| PREDICTED: prolyl 4-hydroxylase subunit alph...   271   2e-70
ref|XP_004309201.1| PREDICTED: prolyl 4-hydroxylase subunit alph...   271   3e-70
emb|CBI22704.3| unnamed protein product [Vitis vinifera]              268   2e-69
ref|XP_002318810.1| ShTK domain-containing family protein [Popul...   268   3e-69
ref|XP_006490420.1| PREDICTED: transmembrane prolyl 4-hydroxylas...   260   6e-67
ref|XP_006421954.1| hypothetical protein CICLE_v10005478mg [Citr...   260   6e-67
ref|XP_004515255.1| PREDICTED: uncharacterized protein LOC101510...   255   2e-65
ref|XP_004515254.1| PREDICTED: uncharacterized protein LOC101510...   255   2e-65
gb|EXC19145.1| Prolyl 4-hydroxylase subunit alpha-1 [Morus notab...   253   7e-65
ref|XP_006599568.1| PREDICTED: uncharacterized protein LOC100795...   251   2e-64
ref|XP_003548177.2| PREDICTED: uncharacterized protein LOC100795...   251   2e-64
gb|EOY23228.1| Oxoglutarate/iron-dependent oxygenase, putative [...   248   2e-63
ref|XP_002513687.1| prolyl 4-hydroxylase alpha subunit, putative...   246   4e-63
ref|XP_006587295.1| PREDICTED: uncharacterized protein LOC100775...   246   1e-62
ref|XP_003533993.1| PREDICTED: uncharacterized protein LOC100775...   246   1e-62
gb|ESW24239.1| hypothetical protein PHAVU_004G113700g [Phaseolus...   241   2e-61
gb|ESW24238.1| hypothetical protein PHAVU_004G113700g [Phaseolus...   241   2e-61
ref|XP_006413291.1| hypothetical protein EUTSA_v10025829mg [Eutr...   230   7e-58

>ref|XP_004234409.1| PREDICTED: uncharacterized protein LOC101255367 [Solanum
           lycopersicum]
          Length = 306

 Score =  288 bits (736), Expect = 3e-75
 Identities = 146/256 (57%), Positives = 183/256 (71%), Gaps = 4/256 (1%)
 Frame = -2

Query: 938 SRSTLLASKVRVFLYRDFLSEEECDYLISWVREKKSYNVSDDDSPKIETNNIPANFGVSV 759
           SR   L+ + RVFLYRDF+S EE D+LIS V   +  N S  D+  ++  N P   G+ V
Sbjct: 54  SRVVQLSWRPRVFLYRDFMSAEETDHLISSVHGMR--NGSTIDNASVDAVNFPT-MGIPV 110

Query: 758 DADDEIAKTIVERISAWTFLPKENSKSMSVLHFGPENSKQNYNYFHNESAEQVGLPLLAT 579
           DA D  +  I ERISAWTFLPK NSK + VLH G E+SK NY+YF   S  +   PL+AT
Sbjct: 111 DAKDPTSSRIEERISAWTFLPKGNSKPLHVLHSGRESSKGNYSYFEMNSTLKSSEPLMAT 170

Query: 578 VILYLSNISRGGQIHFPQSENEMLSDCTKTNNIFRPSKGNAIVFFNLHLNATPDRSSLHA 399
           VILYLSN+++GGQI FP+SEN++LSDCTK+++  RP+KGNAIVFFN+HL+A+PDRSS HA
Sbjct: 171 VILYLSNVTQGGQILFPESENKILSDCTKSSDSLRPTKGNAIVFFNVHLDASPDRSSSHA 230

Query: 398 RCPVLEGDMWCATKLFYLKDIST----XXXXXXXXXXXXXENCSRWAAIGECQRNSIFMI 231
           RCPV++G+MW A K FYL+ I+                  ENC+RWAA GEC+RN +FM+
Sbjct: 231 RCPVIDGEMWYAIKFFYLRSITVQKDPLQSDGDTYCTDEDENCTRWAATGECERNPVFMV 290

Query: 230 GSPDYYGTCRKSCNAC 183
           GSPDYYGTCRKSCNAC
Sbjct: 291 GSPDYYGTCRKSCNAC 306


>ref|XP_006353874.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Solanum
           tuberosum]
          Length = 306

 Score =  280 bits (717), Expect = 4e-73
 Identities = 143/256 (55%), Positives = 178/256 (69%), Gaps = 4/256 (1%)
 Frame = -2

Query: 938 SRSTLLASKVRVFLYRDFLSEEECDYLISWVREKKSYNVSDDDSPKIETNNIPANFGVSV 759
           SR   L+ + RVFLYRDFLS EE D+LIS V   +  N S  D+  ++    P   G+ +
Sbjct: 54  SRVVQLSWRPRVFLYRDFLSAEETDHLISLVHGTR--NSSTIDNASVDAVKFPT-MGIPL 110

Query: 758 DADDEIAKTIVERISAWTFLPKENSKSMSVLHFGPENSKQNYNYFHNESAEQVGLPLLAT 579
           DA D  +  I ERISAWTFLPK NSK + VLH   E+ K NY YF   S  +   PL+AT
Sbjct: 111 DAKDPTSSRIEERISAWTFLPKGNSKPLHVLHSERESLKGNYGYFERNSTLKSSEPLMAT 170

Query: 578 VILYLSNISRGGQIHFPQSENEMLSDCTKTNNIFRPSKGNAIVFFNLHLNATPDRSSLHA 399
           VILYLSN+++GGQI FP+SEN++LSDCTK+ +  RP+KGNAIVFFN+HL+A+PDRSS HA
Sbjct: 171 VILYLSNVTQGGQILFPESENKILSDCTKSRDSLRPTKGNAIVFFNVHLDASPDRSSSHA 230

Query: 398 RCPVLEGDMWCATKLFYLKDIST----XXXXXXXXXXXXXENCSRWAAIGECQRNSIFMI 231
           RCPV++G+MW A K FYL+ I+                  ENC+RWAA GEC+RN +FM+
Sbjct: 231 RCPVIDGEMWYAIKFFYLRSITVQKDPLQSDGDTYCTDEDENCTRWAATGECERNPVFMV 290

Query: 230 GSPDYYGTCRKSCNAC 183
           GSPDYYGTCRKSCNAC
Sbjct: 291 GSPDYYGTCRKSCNAC 306


>ref|XP_002271805.2| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like [Vitis
           vinifera]
          Length = 312

 Score =  271 bits (694), Expect = 2e-70
 Identities = 137/258 (53%), Positives = 183/258 (70%), Gaps = 6/258 (2%)
 Frame = -2

Query: 938 SRSTLLASKVRVFLYRDFLSEEECDYLISWVREKKSYNVSDD-DSPKIETNNIPANFGVS 762
           SR   L+ + R FLYR FLS+EECD+LIS    KK    ++  DS  +    +  +    
Sbjct: 55  SRVIQLSWQPRAFLYRGFLSDEECDHLISLALGKKEELATNGGDSGNVVLKRLLKSSEGP 114

Query: 761 VDADDEIAKTIVERISAWTFLPKENSKSMSVLHFGPENSKQNYNYFHNESAEQVGLPLLA 582
           +  DDE+A  I +RISAWTFLPKENS+ + V+ +  EN+KQ YNYF N+S  + G PL+A
Sbjct: 115 LYIDDEVAARIEKRISAWTFLPKENSEPLEVVQYQFENAKQKYNYFSNKSTSKFGEPLMA 174

Query: 581 TVILYLSNISRGGQIHFPQSENE--MLSDCTKTNNIFRPSKGNAIVFFNLHLNATPDRSS 408
           TV+L+LSN++RGG++ FP+SE++  +LSDCT++++  RP KGNAI+FFN+H NA+PD+SS
Sbjct: 175 TVLLHLSNVTRGGELFFPESESKSGILSDCTESSSGLRPVKGNAILFFNVHPNASPDKSS 234

Query: 407 LHARCPVLEGDMWCATKLFYLKDI---STXXXXXXXXXXXXXENCSRWAAIGECQRNSIF 237
            +ARCPVLEG+MWCATK F+L+ I   +              ENC +WA+IGECQRN I+
Sbjct: 235 SYARCPVLEGEMWCATKFFHLRAIGRENVSFKLDGGECTDEDENCPKWASIGECQRNPIY 294

Query: 236 MIGSPDYYGTCRKSCNAC 183
           MIGSPDYYGTCRKSCN C
Sbjct: 295 MIGSPDYYGTCRKSCNVC 312


>ref|XP_004309201.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like [Fragaria
           vesca subsp. vesca]
          Length = 310

 Score =  271 bits (692), Expect = 3e-70
 Identities = 139/258 (53%), Positives = 178/258 (68%), Gaps = 6/258 (2%)
 Frame = -2

Query: 938 SRSTLLASKVRVFLYRDFLSEEECDYLISWVREKKSYNVSD-DDSPKIETNNIPANFGVS 762
           SR   L+ + RVFLY  FLS+EECD+LI         + +D D+S    TN +  +  + 
Sbjct: 53  SRVVQLSWRPRVFLYEGFLSDEECDHLIYLANGGDGKSSTDYDESGNSNTNRMLKSLELP 112

Query: 761 VDADDEIAKTIVERISAWTFLPKENSKSMSVLHFGPENSKQNYNYFHNESAEQVGLPLLA 582
           ++ +D I  TI E+ISAWTFLPKENS+++ VLH+  E  ++NYNYF N S  +   PLLA
Sbjct: 113 LNQEDGIVSTIEEKISAWTFLPKENSRALQVLHYDLEEVEKNYNYFGNGSTLEQSEPLLA 172

Query: 581 TVILYLSNISRGGQIHFPQSE--NEMLSDCTKTNNIFRPSKGNAIVFFNLHLNATPDRSS 408
           TV+LYLSNI+RGG+I FP+SE  ++  S C K+N+I +P KGNAI+FFNLH NA+PD+SS
Sbjct: 173 TVVLYLSNITRGGEILFPESELKSKAWSGCGKSNSILKPIKGNAILFFNLHPNASPDKSS 232

Query: 407 LHARCPVLEGDMWCATKLFYLKDI---STXXXXXXXXXXXXXENCSRWAAIGECQRNSIF 237
            HARCPVLEG+MWCATKLF+ K I    +             ++C RWA IGECQRN +F
Sbjct: 233 SHARCPVLEGEMWCATKLFHAKAIPREHSLSNSGNRECTDEDDSCPRWADIGECQRNPVF 292

Query: 236 MIGSPDYYGTCRKSCNAC 183
           MIGS DYYGTCRKSCN C
Sbjct: 293 MIGSDDYYGTCRKSCNVC 310


>emb|CBI22704.3| unnamed protein product [Vitis vinifera]
          Length = 317

 Score =  268 bits (686), Expect = 2e-69
 Identities = 137/263 (52%), Positives = 182/263 (69%), Gaps = 11/263 (4%)
 Frame = -2

Query: 938 SRSTLLASKVRVFLYRDFLSEEECDYLISWVREKKSYNVSDD-DSPKIETNNIPANFGVS 762
           SR   L+ + R FLYR FLS+EECD+LIS    KK    ++  DS  +    +  +    
Sbjct: 55  SRVIQLSWQPRAFLYRGFLSDEECDHLISLALGKKEELATNGGDSGNVVLKRLLKSSEGP 114

Query: 761 VDADDEIAKTIVERISAWTFLPKENSKSMSVLHFGPENSKQNYNYFHNESAEQVGLPLLA 582
           +  DDE+A  I +RISAWTFLPKENS+ + V+ +  EN+KQ YNYF N+S  + G PL+A
Sbjct: 115 LYIDDEVAARIEKRISAWTFLPKENSEPLEVVQYQFENAKQKYNYFSNKSTSKFGEPLMA 174

Query: 581 TVILYLSNISRGGQIHFPQSE-------NEMLSDCTKTNNIFRPSKGNAIVFFNLHLNAT 423
           TV+L+LSN++RGG++ FP+SE       + +LSDCT++++  RP KGNAI+FFN+H NA+
Sbjct: 175 TVLLHLSNVTRGGELFFPESELKNSQSKSGILSDCTESSSGLRPVKGNAILFFNVHPNAS 234

Query: 422 PDRSSLHARCPVLEGDMWCATKLFYLKDI---STXXXXXXXXXXXXXENCSRWAAIGECQ 252
           PD+SS +ARCPVLEG+MWCATK F+L+ I   +              ENC +WA+IGECQ
Sbjct: 235 PDKSSSYARCPVLEGEMWCATKFFHLRAIGRENVSFKLDGGECTDEDENCPKWASIGECQ 294

Query: 251 RNSIFMIGSPDYYGTCRKSCNAC 183
           RN I+MIGSPDYYGTCRKSCN C
Sbjct: 295 RNPIYMIGSPDYYGTCRKSCNVC 317


>ref|XP_002318810.1| ShTK domain-containing family protein [Populus trichocarpa]
           gi|222859483|gb|EEE97030.1| ShTK domain-containing
           family protein [Populus trichocarpa]
          Length = 310

 Score =  268 bits (684), Expect = 3e-69
 Identities = 130/255 (50%), Positives = 179/255 (70%), Gaps = 3/255 (1%)
 Frame = -2

Query: 938 SRSTLLASKVRVFLYRDFLSEEECDYLISWVR-EKKSYNVSDDDSPKIETNNIPANFGVS 762
           SR   ++ + RVF+Y+ FL++EECD+LIS  +  K++    DDDS +IE N + A+    
Sbjct: 56  SRVVTVSWQPRVFVYKGFLTDEECDHLISLAQGTKETSEGKDDDSGRIERNRLFASSTSL 115

Query: 761 VDADDEIAKTIVERISAWTFLPKENSKSMSVLHFGPENSKQNYNYFHNESAEQVGLPLLA 582
           ++ DD I   I ER+SAWT LPKENSK + V+H+G E++K  ++YF N+SA     PL+A
Sbjct: 116 LNMDDNILSRIEERVSAWTLLPKENSKPLQVMHYGIEDAKNYFDYFGNKSAIISSEPLMA 175

Query: 581 TVILYLSNISRGGQIHFPQSE--NEMLSDCTKTNNIFRPSKGNAIVFFNLHLNATPDRSS 408
           T++ YLSN+++GG+I FP+SE  N++ SDCTK ++  RP KGNAI+FF +H N +PD  S
Sbjct: 176 TLVFYLSNVTQGGEIFFPKSEVKNKIWSDCTKISDSLRPIKGNAILFFTVHPNTSPDMGS 235

Query: 407 LHARCPVLEGDMWCATKLFYLKDISTXXXXXXXXXXXXXENCSRWAAIGECQRNSIFMIG 228
            H+RCPVLEG+MW ATK FYL+ I               ENC  WAA+GEC++N ++MIG
Sbjct: 236 SHSRCPVLEGEMWYATKKFYLRAIKVFSDSEGSECTDEDENCPSWAALGECEKNPVYMIG 295

Query: 227 SPDYYGTCRKSCNAC 183
           SPDY+GTCRKSCNAC
Sbjct: 296 SPDYFGTCRKSCNAC 310


>ref|XP_006490420.1| PREDICTED: transmembrane prolyl 4-hydroxylase-like [Citrus
           sinensis]
          Length = 313

 Score =  260 bits (664), Expect = 6e-67
 Identities = 127/258 (49%), Positives = 173/258 (67%), Gaps = 6/258 (2%)
 Frame = -2

Query: 938 SRSTLLASKVRVFLYRDFLSEEECDYLISWVR-EKKSYNVSDDDSPKIETNNIPANFGVS 762
           SR T ++ + RVFLYR  LS EECD+LIS     +K Y  + +D   +  N   ++F   
Sbjct: 56  SRVTQISWRPRVFLYRGLLSNEECDHLISLGHGAEKKYKRTGEDPENVSKNKQNSSFRTE 115

Query: 761 VDADDEIAKTIVERISAWTFLPKENSKSMSVLHFGPENSKQNYNYFHNESAEQVGLPLLA 582
           ++ +D+I   I E+I  WTFLPKENSK + V+ +G + +K+N +YF N+SA  +  PL+A
Sbjct: 116 LNIEDDIVARIEEKILTWTFLPKENSKPVHVMRYGLDEAKENLDYFGNKSALGLSQPLMA 175

Query: 581 TVILYLSNISRGGQIHFPQSE--NEMLSDCTKTNNIFRPSKGNAIVFFNLHLNATPDRSS 408
           TV+LYLSN+++GG++ FP SE  ++M SDC KT+N+ RP KGNAI+FF +H NA PD SS
Sbjct: 176 TVVLYLSNVTQGGELLFPNSEEKDKMWSDCAKTSNVLRPVKGNAILFFTVHPNAAPDESS 235

Query: 407 LHARCPVLEGDMWCATKLFYLKDIS---TXXXXXXXXXXXXXENCSRWAAIGECQRNSIF 237
            H RCPVLEG+MW A K F +K  +                 +NC  WAA+GECQRN ++
Sbjct: 236 SHTRCPVLEGEMWSAVKFFQVKAANAEEVLIGSDSNECTDEDDNCPHWAAVGECQRNPVY 295

Query: 236 MIGSPDYYGTCRKSCNAC 183
           M+GSPDYYGTCRKSC+AC
Sbjct: 296 MLGSPDYYGTCRKSCHAC 313


>ref|XP_006421954.1| hypothetical protein CICLE_v10005478mg [Citrus clementina]
           gi|557523827|gb|ESR35194.1| hypothetical protein
           CICLE_v10005478mg [Citrus clementina]
          Length = 312

 Score =  260 bits (664), Expect = 6e-67
 Identities = 127/258 (49%), Positives = 173/258 (67%), Gaps = 6/258 (2%)
 Frame = -2

Query: 938 SRSTLLASKVRVFLYRDFLSEEECDYLISWVR-EKKSYNVSDDDSPKIETNNIPANFGVS 762
           SR T ++ + RVFLYR  LS EECD+LIS     +K Y  + +D   +  N   ++F   
Sbjct: 55  SRVTQISWRPRVFLYRGLLSNEECDHLISLGHGAEKKYKRTGEDPENVSKNKQNSSFRTE 114

Query: 761 VDADDEIAKTIVERISAWTFLPKENSKSMSVLHFGPENSKQNYNYFHNESAEQVGLPLLA 582
           ++ +D+I   I E+I  WTFLPKENSK + V+ +G + +K+N +YF N+SA  +  PL+A
Sbjct: 115 LNIEDDIVARIEEKILTWTFLPKENSKPVHVMRYGLDEAKENLDYFGNKSALGLSQPLMA 174

Query: 581 TVILYLSNISRGGQIHFPQSE--NEMLSDCTKTNNIFRPSKGNAIVFFNLHLNATPDRSS 408
           TV+LYLSN+++GG++ FP SE  ++M SDC KT+N+ RP KGNAI+FF +H NA PD SS
Sbjct: 175 TVVLYLSNVTQGGELLFPNSEEKDKMWSDCAKTSNVLRPVKGNAILFFTVHPNAAPDESS 234

Query: 407 LHARCPVLEGDMWCATKLFYLKDIS---TXXXXXXXXXXXXXENCSRWAAIGECQRNSIF 237
            H RCPVLEG+MW A K F +K  +                 +NC  WAA+GECQRN ++
Sbjct: 235 SHTRCPVLEGEMWSAVKFFQVKAANAEEVLIGSDSNECTDEDDNCPHWAAVGECQRNPVY 294

Query: 236 MIGSPDYYGTCRKSCNAC 183
           M+GSPDYYGTCRKSC+AC
Sbjct: 295 MLGSPDYYGTCRKSCHAC 312


>ref|XP_004515255.1| PREDICTED: uncharacterized protein LOC101510244 isoform X2 [Cicer
           arietinum]
          Length = 302

 Score =  255 bits (651), Expect = 2e-65
 Identities = 127/250 (50%), Positives = 169/250 (67%), Gaps = 8/250 (3%)
 Frame = -2

Query: 908 RVFLYRDFLSEEECDYLISW---VREKKSYNVSDDDSPKIETNNIPANFGVSVDADDEIA 738
           RVFLY+ FLS++ECDYLI+    VREK S N    +               S+D +D+I 
Sbjct: 64  RVFLYKGFLSDKECDYLIALARDVREKSSGNGGHSEEDD-----------TSLDMNDDIV 112

Query: 737 KTIVERISAWTFLPKENSKSMSVLHFGPENSKQNYNYFHNESAEQVGLPLLATVILYLSN 558
           K I ER+S WTFLPKENSK + ++H+G E  +QN +YF N++      PL+AT++LYLSN
Sbjct: 113 KRIEERLSVWTFLPKENSKPLDIMHYGLEKDRQNIDYFTNKTKLDSNGPLMATIVLYLSN 172

Query: 557 ISRGGQIHFPQS--ENEMLSDCTKTNNIFRPSKGNAIVFFNLHLNATPDRSSLHARCPVL 384
            ++GGQ+ FP+S  ++   S+C  T++I +P KGNAI+FF+L+LNA+PD++S HARCPVL
Sbjct: 173 STQGGQVLFPESVPKSSSWSNCGNTSDILQPVKGNAILFFSLNLNASPDKTSFHARCPVL 232

Query: 383 EGDMWCATKLFYLKDIS---TXXXXXXXXXXXXXENCSRWAAIGECQRNSIFMIGSPDYY 213
           +GDMW A K FY + IS                 +NCS WAA+GECQRN ++MIGSPDYY
Sbjct: 233 KGDMWSAIKFFYARPISGGKVSATPDVEECTDEDDNCSAWAALGECQRNPVYMIGSPDYY 292

Query: 212 GTCRKSCNAC 183
           GTCRKSCN C
Sbjct: 293 GTCRKSCNVC 302


>ref|XP_004515254.1| PREDICTED: uncharacterized protein LOC101510244 isoform X1 [Cicer
           arietinum]
          Length = 303

 Score =  255 bits (651), Expect = 2e-65
 Identities = 127/250 (50%), Positives = 169/250 (67%), Gaps = 8/250 (3%)
 Frame = -2

Query: 908 RVFLYRDFLSEEECDYLISW---VREKKSYNVSDDDSPKIETNNIPANFGVSVDADDEIA 738
           RVFLY+ FLS++ECDYLI+    VREK S N    +               S+D +D+I 
Sbjct: 65  RVFLYKGFLSDKECDYLIALARDVREKSSGNGGHSEEDD-----------TSLDMNDDIV 113

Query: 737 KTIVERISAWTFLPKENSKSMSVLHFGPENSKQNYNYFHNESAEQVGLPLLATVILYLSN 558
           K I ER+S WTFLPKENSK + ++H+G E  +QN +YF N++      PL+AT++LYLSN
Sbjct: 114 KRIEERLSVWTFLPKENSKPLDIMHYGLEKDRQNIDYFTNKTKLDSNGPLMATIVLYLSN 173

Query: 557 ISRGGQIHFPQS--ENEMLSDCTKTNNIFRPSKGNAIVFFNLHLNATPDRSSLHARCPVL 384
            ++GGQ+ FP+S  ++   S+C  T++I +P KGNAI+FF+L+LNA+PD++S HARCPVL
Sbjct: 174 STQGGQVLFPESVPKSSSWSNCGNTSDILQPVKGNAILFFSLNLNASPDKTSFHARCPVL 233

Query: 383 EGDMWCATKLFYLKDIS---TXXXXXXXXXXXXXENCSRWAAIGECQRNSIFMIGSPDYY 213
           +GDMW A K FY + IS                 +NCS WAA+GECQRN ++MIGSPDYY
Sbjct: 234 KGDMWSAIKFFYARPISGGKVSATPDVEECTDEDDNCSAWAALGECQRNPVYMIGSPDYY 293

Query: 212 GTCRKSCNAC 183
           GTCRKSCN C
Sbjct: 294 GTCRKSCNVC 303


>gb|EXC19145.1| Prolyl 4-hydroxylase subunit alpha-1 [Morus notabilis]
          Length = 356

 Score =  253 bits (646), Expect = 7e-65
 Identities = 135/254 (53%), Positives = 168/254 (66%), Gaps = 6/254 (2%)
 Frame = -2

Query: 938 SRSTLLASKVRVFLYRDFLSEEECDYLISWVREKKSYNVSDDD-SPKIETNNIPANFGVS 762
           SR   L+ + RVFLY+DFLS+EECDYLIS V ++   + SD + S    T          
Sbjct: 56  SRVVQLSWRPRVFLYQDFLSDEECDYLISLVHKRNEKSSSDGNGSGDTITKGQLKGSETP 115

Query: 761 VDADDEIAKTIVERISAWTFLPKENSKSMSVLHFGPENSKQNYNYFHNESAEQVGLPLLA 582
            D  DE+   I ERISAWTFLPKEN K++ V  +  E+S+++ NYF N S  Q   PL+A
Sbjct: 116 DDIVDEVVSRIEERISAWTFLPKENGKALQVWRYENEDSQKDLNYFGNSSLLQQSKPLIA 175

Query: 581 TVILYLSNISRGGQIHFPQSE--NEMLSDCTKTNNIFRPSKGNAIVFFNLHLNATPDRSS 408
           TVILYLSN++ GGQI FP SE  + + SDCTK++NI RP+KGNAI+FFN+H + +PD SS
Sbjct: 176 TVILYLSNVAHGGQILFPDSEVKDNIWSDCTKSDNILRPTKGNAILFFNIHPDTSPDPSS 235

Query: 407 LHARCPVLEGDMWCATKLFYLKDIS---TXXXXXXXXXXXXXENCSRWAAIGECQRNSIF 237
            HARCPV EG MWCATKLF+ K I    T             ENC RWAA GEC+RN +F
Sbjct: 236 SHARCPVQEGQMWCATKLFHAKAIGGEVTSSKSYDGECSDQDENCPRWAATGECERNPVF 295

Query: 236 MIGSPDYYGTCRKS 195
           M+GSPDYYGT  K+
Sbjct: 296 MVGSPDYYGTYLKA 309


>ref|XP_006599568.1| PREDICTED: uncharacterized protein LOC100795761 isoform X2 [Glycine
           max]
          Length = 300

 Score =  251 bits (642), Expect = 2e-64
 Identities = 132/261 (50%), Positives = 176/261 (67%), Gaps = 9/261 (3%)
 Frame = -2

Query: 938 SRSTLLASKVRVFLYRDFLSEEECDYLISW---VREKKSYNVSDDDSPKIETNNIPANFG 768
           SR   ++ + RVFLY+ FLS++ECDYL+S    V+EK S N    +   +ET        
Sbjct: 51  SRVVQISWQPRVFLYKGFLSDKECDYLVSLAYAVKEKSSGNGGLSEG--VET-------- 100

Query: 767 VSVDADDEIAKTIVERISAWTFLPKENSKSMSVLHFGPENSKQNYNYFHNESAEQVGLPL 588
            S+D +D+I   I ER+S W FLPKE SK + V+H+GPE + +N +YF N++  ++  PL
Sbjct: 101 -SLDMEDDILARIEERLSVWAFLPKEYSKPLQVMHYGPEQNGRNLDYFTNKTQLELSGPL 159

Query: 587 LATVILYLSN-ISRGGQIHFPQSE--NEMLSDCTKTNNIFRPSKGNAIVFFNLHLNATPD 417
           +AT+ILYLSN +++GGQI FP+S   +   S C+ ++NI +P KGNAI+FF+LH +A+PD
Sbjct: 160 MATIILYLSNDVTQGGQILFPESVPGSSSWSSCSNSSNILQPVKGNAILFFSLHPSASPD 219

Query: 416 RSSLHARCPVLEGDMWCATKLFYLKDIS---TXXXXXXXXXXXXXENCSRWAAIGECQRN 246
           +SS HARCPVLEGDMW A K FY K IS                 ++C  WAA+GECQRN
Sbjct: 220 KSSFHARCPVLEGDMWSAIKYFYAKPISRGKVSATLDGGECTDEDDSCPAWAAVGECQRN 279

Query: 245 SIFMIGSPDYYGTCRKSCNAC 183
            +FMIGSPDYYGTCRKSCNAC
Sbjct: 280 PVFMIGSPDYYGTCRKSCNAC 300


>ref|XP_003548177.2| PREDICTED: uncharacterized protein LOC100795761 isoform X1 [Glycine
           max]
          Length = 301

 Score =  251 bits (642), Expect = 2e-64
 Identities = 132/261 (50%), Positives = 176/261 (67%), Gaps = 9/261 (3%)
 Frame = -2

Query: 938 SRSTLLASKVRVFLYRDFLSEEECDYLISW---VREKKSYNVSDDDSPKIETNNIPANFG 768
           SR   ++ + RVFLY+ FLS++ECDYL+S    V+EK S N    +   +ET        
Sbjct: 52  SRVVQISWQPRVFLYKGFLSDKECDYLVSLAYAVKEKSSGNGGLSEG--VET-------- 101

Query: 767 VSVDADDEIAKTIVERISAWTFLPKENSKSMSVLHFGPENSKQNYNYFHNESAEQVGLPL 588
            S+D +D+I   I ER+S W FLPKE SK + V+H+GPE + +N +YF N++  ++  PL
Sbjct: 102 -SLDMEDDILARIEERLSVWAFLPKEYSKPLQVMHYGPEQNGRNLDYFTNKTQLELSGPL 160

Query: 587 LATVILYLSN-ISRGGQIHFPQSE--NEMLSDCTKTNNIFRPSKGNAIVFFNLHLNATPD 417
           +AT+ILYLSN +++GGQI FP+S   +   S C+ ++NI +P KGNAI+FF+LH +A+PD
Sbjct: 161 MATIILYLSNDVTQGGQILFPESVPGSSSWSSCSNSSNILQPVKGNAILFFSLHPSASPD 220

Query: 416 RSSLHARCPVLEGDMWCATKLFYLKDIS---TXXXXXXXXXXXXXENCSRWAAIGECQRN 246
           +SS HARCPVLEGDMW A K FY K IS                 ++C  WAA+GECQRN
Sbjct: 221 KSSFHARCPVLEGDMWSAIKYFYAKPISRGKVSATLDGGECTDEDDSCPAWAAVGECQRN 280

Query: 245 SIFMIGSPDYYGTCRKSCNAC 183
            +FMIGSPDYYGTCRKSCNAC
Sbjct: 281 PVFMIGSPDYYGTCRKSCNAC 301


>gb|EOY23228.1| Oxoglutarate/iron-dependent oxygenase, putative [Theobroma cacao]
          Length = 353

 Score =  248 bits (633), Expect = 2e-63
 Identities = 127/258 (49%), Positives = 169/258 (65%), Gaps = 6/258 (2%)
 Frame = -2

Query: 938 SRSTLLASKVRVFLYRDFLSEEECDYLISWVREKKSYNVS-DDDSPKIETNNIPANFGVS 762
           SR   L  + RVFLY  FLS+EECD+LIS     K   +  +DD   + TN    +    
Sbjct: 96  SRVMQLLWQPRVFLYNGFLSDEECDHLISLGHGAKEGILGINDDRVNVGTNRQLTSSEPL 155

Query: 761 VDADDEIAKTIVERISAWTFLPKENSKSMSVLHFGPENSKQNYNYFHNESAEQVGLPLLA 582
           ++ +D++   I ERIS WTFLP++N + + V   G E ++QN +YF N S   +  PL+A
Sbjct: 156 LNTEDKVLAMIEERISTWTFLPRDNGEPLQVRRHGLEGTEQNLDYFGNISTLALSEPLMA 215

Query: 581 TVILYLSNISRGGQIHFPQSE--NEMLSDCTKTNNIFRPSKGNAIVFFNLHLNATPDRSS 408
           T+ILYLSN++RGG+I FP +E  +++ SDC K++NI +P KGNAI+FF  HLNA+PD SS
Sbjct: 216 TLILYLSNVTRGGEILFPHAEPRSKIWSDCAKSSNIVKPVKGNAILFFTTHLNASPDGSS 275

Query: 407 LHARCPVLEGDMWCATKLFYLKDI---STXXXXXXXXXXXXXENCSRWAAIGECQRNSIF 237
            HARCPVLEG+MW ATK F L+ +                   NC +WAA+GECQRN +F
Sbjct: 276 SHARCPVLEGEMWFATKFFCLRAVKGDKVSFDSDGNECVDEDANCPQWAALGECQRNPVF 335

Query: 236 MIGSPDYYGTCRKSCNAC 183
           M+GSPDYYGTCRK+CNAC
Sbjct: 336 MVGSPDYYGTCRKTCNAC 353


>ref|XP_002513687.1| prolyl 4-hydroxylase alpha subunit, putative [Ricinus communis]
           gi|223547595|gb|EEF49090.1| prolyl 4-hydroxylase alpha
           subunit, putative [Ricinus communis]
          Length = 309

 Score =  246 bits (627), Expect(2) = 4e-63
 Identities = 125/253 (49%), Positives = 171/253 (67%), Gaps = 6/253 (2%)
 Frame = -2

Query: 923 LASKVRVFLYRDFLSEEECDYLISWVREKKSYNVSDDDSPKIETNNIP-ANFGVSVDADD 747
           L+ + RVFLY+ FL++EECD LIS     K  +    D  +   NNI  A+        D
Sbjct: 61  LSWRPRVFLYKGFLTDEECDRLISLAHGAKEISKGKGDGSR---NNIQLASSESRSHIYD 117

Query: 746 EIAKTIVERISAWTFLPKENSKSMSVLHFGPENSKQNYNYFHNESAEQVGLPLLATVILY 567
           ++   I ERISAWTF+PKENSK + V+H+G E ++++++YF N++     + L+AT++LY
Sbjct: 118 DLLARIEERISAWTFIPKENSKPLQVMHYGIEEAREHFDYFDNKTLIS-NVSLMATLVLY 176

Query: 566 LSNISRGGQIHFPQSE--NEMLSDCTKTNNIFRPSKGNAIVFFNLHLNATPDRSSLHARC 393
           LSN++RGG+I FP+SE  +++ SDCTK ++I RP KGNA++ FN HLNA+ D  S H RC
Sbjct: 177 LSNVTRGGEILFPKSELKDKVWSDCTKDSSILRPVKGNAVLIFNAHLNASADSRSTHGRC 236

Query: 392 PVLEGDMWCATKLFYLK---DISTXXXXXXXXXXXXXENCSRWAAIGECQRNSIFMIGSP 222
           PVLEG+MWCATK F ++   +  +             +NC +WAA+GECQRN IFM GSP
Sbjct: 237 PVLEGEMWCATKQFLVRATNEEKSLPDSDGSDCTDEDDNCPKWAALGECQRNPIFMTGSP 296

Query: 221 DYYGTCRKSCNAC 183
           DYYGTCRKSCNAC
Sbjct: 297 DYYGTCRKSCNAC 309



 Score = 23.5 bits (49), Expect(2) = 4e-63
 Identities = 9/10 (90%), Positives = 9/10 (90%)
 Frame = -3

Query: 940 QVVQLSWHPR 911
           QVVQLSW PR
Sbjct: 57  QVVQLSWRPR 66


>ref|XP_006587295.1| PREDICTED: uncharacterized protein LOC100775928 isoform X2 [Glycine
           max]
          Length = 301

 Score =  246 bits (627), Expect = 1e-62
 Identities = 129/261 (49%), Positives = 175/261 (67%), Gaps = 9/261 (3%)
 Frame = -2

Query: 938 SRSTLLASKVRVFLYRDFLSEEECDYLISW---VREKKSYNVSDDDSPKIETNNIPANFG 768
           SR   ++ + RVFLY+ FLS++ECDYL+S    V+EK S N    +   +ET        
Sbjct: 52  SRVVQISWQPRVFLYKGFLSDKECDYLVSLAYAVKEKSSGNGGFSEG--VET-------- 101

Query: 767 VSVDADDEIAKTIVERISAWTFLPKENSKSMSVLHFGPENSKQNYNYFHNESAEQVGLPL 588
             +D +D+I   I ER+S W FLPKE SK + V+H+GPE + +N +YF N++  ++  PL
Sbjct: 102 -FLDIEDDILARIEERLSLWAFLPKEYSKPLQVMHYGPEPNGRNLDYFTNKTQLELSGPL 160

Query: 587 LATVILYLSNIS-RGGQIHFPQS--ENEMLSDCTKTNNIFRPSKGNAIVFFNLHLNATPD 417
           +AT++LYLSN + +GGQI FP+S   +   S C+ ++NI +P KGNAI+FF+LH +A+PD
Sbjct: 161 MATIVLYLSNAATQGGQILFPESVPRSSSWSSCSNSSNILQPVKGNAILFFSLHPSASPD 220

Query: 416 RSSLHARCPVLEGDMWCATKLFYLKDIST---XXXXXXXXXXXXXENCSRWAAIGECQRN 246
           ++S HARCPVLEG+MW A K FY K IS+                +NC  WAA+GECQRN
Sbjct: 221 KNSFHARCPVLEGNMWSAIKYFYAKPISSGEVSAISDGGECTDEDDNCPAWAAMGECQRN 280

Query: 245 SIFMIGSPDYYGTCRKSCNAC 183
            +FMIGSPDYYGTCRKSCNAC
Sbjct: 281 PVFMIGSPDYYGTCRKSCNAC 301


>ref|XP_003533993.1| PREDICTED: uncharacterized protein LOC100775928 isoform X1 [Glycine
           max]
          Length = 302

 Score =  246 bits (627), Expect = 1e-62
 Identities = 129/261 (49%), Positives = 175/261 (67%), Gaps = 9/261 (3%)
 Frame = -2

Query: 938 SRSTLLASKVRVFLYRDFLSEEECDYLISW---VREKKSYNVSDDDSPKIETNNIPANFG 768
           SR   ++ + RVFLY+ FLS++ECDYL+S    V+EK S N    +   +ET        
Sbjct: 53  SRVVQISWQPRVFLYKGFLSDKECDYLVSLAYAVKEKSSGNGGFSEG--VET-------- 102

Query: 767 VSVDADDEIAKTIVERISAWTFLPKENSKSMSVLHFGPENSKQNYNYFHNESAEQVGLPL 588
             +D +D+I   I ER+S W FLPKE SK + V+H+GPE + +N +YF N++  ++  PL
Sbjct: 103 -FLDIEDDILARIEERLSLWAFLPKEYSKPLQVMHYGPEPNGRNLDYFTNKTQLELSGPL 161

Query: 587 LATVILYLSNIS-RGGQIHFPQS--ENEMLSDCTKTNNIFRPSKGNAIVFFNLHLNATPD 417
           +AT++LYLSN + +GGQI FP+S   +   S C+ ++NI +P KGNAI+FF+LH +A+PD
Sbjct: 162 MATIVLYLSNAATQGGQILFPESVPRSSSWSSCSNSSNILQPVKGNAILFFSLHPSASPD 221

Query: 416 RSSLHARCPVLEGDMWCATKLFYLKDIST---XXXXXXXXXXXXXENCSRWAAIGECQRN 246
           ++S HARCPVLEG+MW A K FY K IS+                +NC  WAA+GECQRN
Sbjct: 222 KNSFHARCPVLEGNMWSAIKYFYAKPISSGEVSAISDGGECTDEDDNCPAWAAMGECQRN 281

Query: 245 SIFMIGSPDYYGTCRKSCNAC 183
            +FMIGSPDYYGTCRKSCNAC
Sbjct: 282 PVFMIGSPDYYGTCRKSCNAC 302


>gb|ESW24239.1| hypothetical protein PHAVU_004G113700g [Phaseolus vulgaris]
          Length = 294

 Score =  241 bits (616), Expect = 2e-61
 Identities = 124/257 (48%), Positives = 165/257 (64%), Gaps = 5/257 (1%)
 Frame = -2

Query: 938 SRSTLLASKVRVFLYRDFLSEEECDYLISWVREKKSYNVSDDDSPKIETNNIPANFGVSV 759
           SR   ++ + RVFLY+ FLS++EC+YLIS    +K  +                N G S+
Sbjct: 52  SRVVQISWQPRVFLYKGFLSDKECEYLISLAYAEKEKS--------------SGNGGTSL 97

Query: 758 DADDEIAKTIVERISAWTFLPKENSKSMSVLHFGPENSKQNYNYFHNESAEQVGLPLLAT 579
           + +D+I   I ER+S WTFLPKENSK + V+ +G E + Q   YF N++  ++  PL+AT
Sbjct: 98  EMEDDILARIEERLSIWTFLPKENSKPLQVMQYGSEENDQTLYYFTNKTNLELSGPLMAT 157

Query: 578 VILYLSNISRGGQIHFPQS--ENEMLSDCTKTNNIFRPSKGNAIVFFNLHLNATPDRSSL 405
           V+LYLS+ ++GGQI FP+S   +   S C+ +N   +P KGNAI+FF+LH +A+PD+SS 
Sbjct: 158 VVLYLSDSTQGGQILFPESVPRSSSWSSCSNSNKTLQPVKGNAILFFSLHPSASPDKSSF 217

Query: 404 HARCPVLEGDMWCATKLFYLKDIS---TXXXXXXXXXXXXXENCSRWAAIGECQRNSIFM 234
           H+RCPVLEGDMW A K FY K IS                 ++C  WAA GECQRN +FM
Sbjct: 218 HSRCPVLEGDMWSAIKYFYAKPISRGKVSAILDDDECTDQDDSCPAWAAKGECQRNPVFM 277

Query: 233 IGSPDYYGTCRKSCNAC 183
           IGSPDYYGTCRKSCNAC
Sbjct: 278 IGSPDYYGTCRKSCNAC 294


>gb|ESW24238.1| hypothetical protein PHAVU_004G113700g [Phaseolus vulgaris]
          Length = 293

 Score =  241 bits (616), Expect = 2e-61
 Identities = 124/257 (48%), Positives = 165/257 (64%), Gaps = 5/257 (1%)
 Frame = -2

Query: 938 SRSTLLASKVRVFLYRDFLSEEECDYLISWVREKKSYNVSDDDSPKIETNNIPANFGVSV 759
           SR   ++ + RVFLY+ FLS++EC+YLIS    +K  +                N G S+
Sbjct: 51  SRVVQISWQPRVFLYKGFLSDKECEYLISLAYAEKEKS--------------SGNGGTSL 96

Query: 758 DADDEIAKTIVERISAWTFLPKENSKSMSVLHFGPENSKQNYNYFHNESAEQVGLPLLAT 579
           + +D+I   I ER+S WTFLPKENSK + V+ +G E + Q   YF N++  ++  PL+AT
Sbjct: 97  EMEDDILARIEERLSIWTFLPKENSKPLQVMQYGSEENDQTLYYFTNKTNLELSGPLMAT 156

Query: 578 VILYLSNISRGGQIHFPQS--ENEMLSDCTKTNNIFRPSKGNAIVFFNLHLNATPDRSSL 405
           V+LYLS+ ++GGQI FP+S   +   S C+ +N   +P KGNAI+FF+LH +A+PD+SS 
Sbjct: 157 VVLYLSDSTQGGQILFPESVPRSSSWSSCSNSNKTLQPVKGNAILFFSLHPSASPDKSSF 216

Query: 404 HARCPVLEGDMWCATKLFYLKDIS---TXXXXXXXXXXXXXENCSRWAAIGECQRNSIFM 234
           H+RCPVLEGDMW A K FY K IS                 ++C  WAA GECQRN +FM
Sbjct: 217 HSRCPVLEGDMWSAIKYFYAKPISRGKVSAILDDDECTDQDDSCPAWAAKGECQRNPVFM 276

Query: 233 IGSPDYYGTCRKSCNAC 183
           IGSPDYYGTCRKSCNAC
Sbjct: 277 IGSPDYYGTCRKSCNAC 293


>ref|XP_006413291.1| hypothetical protein EUTSA_v10025829mg [Eutrema salsugineum]
           gi|557114461|gb|ESQ54744.1| hypothetical protein
           EUTSA_v10025829mg [Eutrema salsugineum]
          Length = 303

 Score =  230 bits (586), Expect = 7e-58
 Identities = 119/250 (47%), Positives = 159/250 (63%), Gaps = 3/250 (1%)
 Frame = -2

Query: 923 LASKVRVFLYRDFLSEEECDYLISWVREKKSYNVSDDDSPKIETNNIPANFGVSVDADDE 744
           L+ + RVFLYR FLSEEECD+L S  +E    N  D D     +++     G +++  D 
Sbjct: 59  LSWQPRVFLYRGFLSEEECDHLKSLRKENSEVNSGDADGMTQLSSS-----GYALNVPDP 113

Query: 743 IAKTIVERISAWTFLPKENSKSMSVLHFGPENSKQNYNYFHNESAEQVGLPLLATVILYL 564
           +   I ERISAWTFLP+ENS  + V  +  E S +  +YF  ES+ +    LLATVILY+
Sbjct: 114 VVAGIEERISAWTFLPRENSGPIKVTSYASEKSGKKLDYFGEESSSETHESLLATVILYV 173

Query: 563 SNISRGGQIHFPQSE---NEMLSDCTKTNNIFRPSKGNAIVFFNLHLNATPDRSSLHARC 393
           S+ + GG++ FP SE    +  S+C++T NI RP KGNA++FF  HLNA+ D++S H RC
Sbjct: 174 SDTTEGGELLFPNSELKPKKSWSECSETGNILRPVKGNAVLFFTRHLNASLDQTSTHFRC 233

Query: 392 PVLEGDMWCATKLFYLKDISTXXXXXXXXXXXXXENCSRWAAIGECQRNSIFMIGSPDYY 213
           PVL+G++  ATKL Y K                 ENC RWA +GEC++N +FMIGSPDY+
Sbjct: 234 PVLKGELLVATKLIYAKKQERNDESGGGECSDEDENCRRWAELGECKKNPVFMIGSPDYF 293

Query: 212 GTCRKSCNAC 183
           GTCRKSCNAC
Sbjct: 294 GTCRKSCNAC 303


Top