BLASTX nr result

ID: Cephaelis21_contig00002144 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Cephaelis21_contig00002144
         (1113 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002271805.2| PREDICTED: prolyl 4-hydroxylase subunit alph...   293   5e-77
emb|CBI22704.3| unnamed protein product [Vitis vinifera]              290   4e-76
ref|XP_002318810.1| predicted protein [Populus trichocarpa] gi|2...   280   4e-73
ref|XP_002513687.1| prolyl 4-hydroxylase alpha subunit, putative...   277   4e-72
ref|XP_003533993.1| PREDICTED: uncharacterized protein LOC100775...   269   1e-69

>ref|XP_002271805.2| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like [Vitis
           vinifera]
          Length = 312

 Score =  293 bits (750), Expect = 5e-77
 Identities = 151/292 (51%), Positives = 197/292 (67%), Gaps = 24/292 (8%)
 Frame = -3

Query: 877 SAETIRKELRTKEVGGQQINKRSGG---IDVIDPSRVTQVAWRPRVFLYKSFLSDEECDY 707
           S + IRKELR  +V  Q+   + G     + +DPSRV Q++W+PR FLY+ FLSDEECD+
Sbjct: 21  STQVIRKELRINKVVNQETTVQLGHSIEYNRVDPSRVIQLSWQPRAFLYRGFLSDEECDH 80

Query: 706 LIYW-LQRKKSSSVAGGD---------LKEID----TEDEVFQRIDERISAWTFIPKENG 569
           LI   L +K+  +  GGD         LK  +     +DEV  RI++RISAWTF+PKEN 
Sbjct: 81  LISLALGKKEELATNGGDSGNVVLKRLLKSSEGPLYIDDEVAARIEKRISAWTFLPKENS 140

Query: 568 RPVRVLHFDGEESKQNHKNFGEEFIKLPSAPLMATVVLYLSNVSQGGQILFPQSENE--I 395
            P+ V+ +  E +KQ +  F  +       PLMATV+L+LSNV++GG++ FP+SE++  I
Sbjct: 141 EPLEVVQYQFENAKQKYNYFSNKSTSKFGEPLMATVLLHLSNVTRGGELFFPESESKSGI 200

Query: 394 LSDCTRSSMALKPTKGNAVVFFNAHLSAAPDKSSSHARCPLFGGDMWCATKFFHLRAIRR 215
           LSDCT SS  L+P KGNA++FFN H +A+PDKSSS+ARCP+  G+MWCATKFFHLRAI R
Sbjct: 201 LSDCTESSSGLRPVKGNAILFFNVHPNASPDKSSSYARCPVLEGEMWCATKFFHLRAIGR 260

Query: 214 -----XXXXXXXXXXXENCPSWAASGECQRNPVFMVGSPDYYGTCRKSCNAC 74
                           ENCP WA+ GECQRNP++M+GSPDYYGTCRKSCN C
Sbjct: 261 ENVSFKLDGGECTDEDENCPKWASIGECQRNPIYMIGSPDYYGTCRKSCNVC 312


>emb|CBI22704.3| unnamed protein product [Vitis vinifera]
          Length = 317

 Score =  290 bits (742), Expect = 4e-76
 Identities = 151/297 (50%), Positives = 196/297 (65%), Gaps = 29/297 (9%)
 Frame = -3

Query: 877 SAETIRKELRTKEVGGQQINKRSGG---IDVIDPSRVTQVAWRPRVFLYKSFLSDEECDY 707
           S + IRKELR  +V  Q+   + G     + +DPSRV Q++W+PR FLY+ FLSDEECD+
Sbjct: 21  STQVIRKELRINKVVNQETTVQLGHSIEYNRVDPSRVIQLSWQPRAFLYRGFLSDEECDH 80

Query: 706 LIYW-LQRKKSSSVAGGD---------LKEID----TEDEVFQRIDERISAWTFIPKENG 569
           LI   L +K+  +  GGD         LK  +     +DEV  RI++RISAWTF+PKEN 
Sbjct: 81  LISLALGKKEELATNGGDSGNVVLKRLLKSSEGPLYIDDEVAARIEKRISAWTFLPKENS 140

Query: 568 RPVRVLHFDGEESKQNHKNFGEEFIKLPSAPLMATVVLYLSNVSQGGQILFPQSE----- 404
            P+ V+ +  E +KQ +  F  +       PLMATV+L+LSNV++GG++ FP+SE     
Sbjct: 141 EPLEVVQYQFENAKQKYNYFSNKSTSKFGEPLMATVLLHLSNVTRGGELFFPESELKNSQ 200

Query: 403 --NEILSDCTRSSMALKPTKGNAVVFFNAHLSAAPDKSSSHARCPLFGGDMWCATKFFHL 230
             + ILSDCT SS  L+P KGNA++FFN H +A+PDKSSS+ARCP+  G+MWCATKFFHL
Sbjct: 201 SKSGILSDCTESSSGLRPVKGNAILFFNVHPNASPDKSSSYARCPVLEGEMWCATKFFHL 260

Query: 229 RAIRR-----XXXXXXXXXXXENCPSWAASGECQRNPVFMVGSPDYYGTCRKSCNAC 74
           RAI R                ENCP WA+ GECQRNP++M+GSPDYYGTCRKSCN C
Sbjct: 261 RAIGRENVSFKLDGGECTDEDENCPKWASIGECQRNPIYMIGSPDYYGTCRKSCNVC 317


>ref|XP_002318810.1| predicted protein [Populus trichocarpa] gi|222859483|gb|EEE97030.1|
           predicted protein [Populus trichocarpa]
          Length = 310

 Score =  280 bits (717), Expect = 4e-73
 Identities = 143/302 (47%), Positives = 196/302 (64%), Gaps = 20/302 (6%)
 Frame = -3

Query: 919 FLIVFLSDALSGCLSAETIRKELRTKEVGGQQINKRSGGIDV--IDPSRVTQVAWRPRVF 746
           F+++ L+   S C    + RKELR KE   + + +    I    +DPSRV  V+W+PRVF
Sbjct: 10  FMVLTLTTQFSLCFGKSS-RKELRNKEAHLETMIQFGSSIQTNWVDPSRVVTVSWQPRVF 68

Query: 745 LYKSFLSDEECDYLIYWLQRKKSSSVAGGD--------------LKEIDTEDEVFQRIDE 608
           +YK FL+DEECD+LI   Q  K +S    D                 ++ +D +  RI+E
Sbjct: 69  VYKGFLTDEECDHLISLAQGTKETSEGKDDDSGRIERNRLFASSTSLLNMDDNILSRIEE 128

Query: 607 RISAWTFIPKENGRPVRVLHFDGEESKQNHKNFGEEFIKLPSAPLMATVVLYLSNVSQGG 428
           R+SAWT +PKEN +P++V+H+  E++K     FG +   + S PLMAT+V YLSNV+QGG
Sbjct: 129 RVSAWTLLPKENSKPLQVMHYGIEDAKNYFDYFGNKSAIISSEPLMATLVFYLSNVTQGG 188

Query: 427 QILFPQSE--NEILSDCTRSSMALKPTKGNAVVFFNAHLSAAPDKSSSHARCPLFGGDMW 254
           +I FP+SE  N+I SDCT+ S +L+P KGNA++FF  H + +PD  SSH+RCP+  G+MW
Sbjct: 189 EIFFPKSEVKNKIWSDCTKISDSLRPIKGNAILFFTVHPNTSPDMGSSHSRCPVLEGEMW 248

Query: 253 CATKFFHLRAIR--RXXXXXXXXXXXENCPSWAASGECQRNPVFMVGSPDYYGTCRKSCN 80
            ATK F+LRAI+              ENCPSWAA GEC++NPV+M+GSPDY+GTCRKSCN
Sbjct: 249 YATKKFYLRAIKVFSDSEGSECTDEDENCPSWAALGECEKNPVYMIGSPDYFGTCRKSCN 308

Query: 79  AC 74
           AC
Sbjct: 309 AC 310


>ref|XP_002513687.1| prolyl 4-hydroxylase alpha subunit, putative [Ricinus communis]
           gi|223547595|gb|EEF49090.1| prolyl 4-hydroxylase alpha
           subunit, putative [Ricinus communis]
          Length = 309

 Score =  277 bits (708), Expect = 4e-72
 Identities = 146/305 (47%), Positives = 195/305 (63%), Gaps = 20/305 (6%)
 Frame = -3

Query: 928 FSFFLIVFLSDALSGCLSAETIRKELRTKEVGGQQINKRSGGIDV--IDPSRVTQVAWRP 755
           +   L+V ++ A      AE+IRKELR KEV  + I +    +    I   +V Q++WRP
Sbjct: 6   YFLLLVVLIASAPFHFCFAESIRKELRDKEVKHETIIQLGSSVQTNRISLLQVVQLSWRP 65

Query: 754 RVFLYKSFLSDEECDYLIYWLQRKKSSSVAGGDLKEIDTE-----------DEVFQRIDE 608
           RVFLYK FL+DEECD LI      K  S   GD    + +           D++  RI+E
Sbjct: 66  RVFLYKGFLTDEECDRLISLAHGAKEISKGKGDGSRNNIQLASSESRSHIYDDLLARIEE 125

Query: 607 RISAWTFIPKENGRPVRVLHFDGEESKQNHKNFGEEFIKLPSAPLMATVVLYLSNVSQGG 428
           RISAWTFIPKEN +P++V+H+  EE++++   F  + + + +  LMAT+VLYLSNV++GG
Sbjct: 126 RISAWTFIPKENSKPLQVMHYGIEEAREHFDYFDNKTL-ISNVSLMATLVLYLSNVTRGG 184

Query: 427 QILFPQSE--NEILSDCTRSSMALKPTKGNAVVFFNAHLSAAPDKSSSHARCPLFGGDMW 254
           +ILFP+SE  +++ SDCT+ S  L+P KGNAV+ FNAHL+A+ D  S+H RCP+  G+MW
Sbjct: 185 EILFPKSELKDKVWSDCTKDSSILRPVKGNAVLIFNAHLNASADSRSTHGRCPVLEGEMW 244

Query: 253 CATKFFHLRAIRR-----XXXXXXXXXXXENCPSWAASGECQRNPVFMVGSPDYYGTCRK 89
           CATK F +RA                   +NCP WAA GECQRNP+FM GSPDYYGTCRK
Sbjct: 245 CATKQFLVRATNEEKSLPDSDGSDCTDEDDNCPKWAALGECQRNPIFMTGSPDYYGTCRK 304

Query: 88  SCNAC 74
           SCNAC
Sbjct: 305 SCNAC 309


>ref|XP_003533993.1| PREDICTED: uncharacterized protein LOC100775928 [Glycine max]
          Length = 302

 Score =  269 bits (687), Expect = 1e-69
 Identities = 143/299 (47%), Positives = 189/299 (63%), Gaps = 14/299 (4%)
 Frame = -3

Query: 928 FSFFLIVFLSDALSGCLSAETIRKELRTKEVGGQQINKRSGGI-DVIDPSRVTQVAWRPR 752
           F FF ++  S         E+ RKELR+K+    Q+ + S    + I+PSRV Q++W+PR
Sbjct: 11  FVFFFLIATS-------LTESSRKELRSKQETALQMLEHSIHYSNRINPSRVVQISWQPR 63

Query: 751 VFLYKSFLSDEECDYLIYWLQRKKSSSVAGGDLKE-----IDTEDEVFQRIDERISAWTF 587
           VFLYK FLSD+ECDYL+      K  S   G   E     +D ED++  RI+ER+S W F
Sbjct: 64  VFLYKGFLSDKECDYLVSLAYAVKEKSSGNGGFSEGVETFLDIEDDILARIEERLSLWAF 123

Query: 586 IPKENGRPVRVLHFDGEESKQNHKNFGEEFIKLPSAPLMATVVLYLSNVS-QGGQILFPQ 410
           +PKE  +P++V+H+  E + +N   F  +     S PLMAT+VLYLSN + QGGQILFP+
Sbjct: 124 LPKEYSKPLQVMHYGPEPNGRNLDYFTNKTQLELSGPLMATIVLYLSNAATQGGQILFPE 183

Query: 409 S--ENEILSDCTRSSMALKPTKGNAVVFFNAHLSAAPDKSSSHARCPLFGGDMWCATKFF 236
           S   +   S C+ SS  L+P KGNA++FF+ H SA+PDK+S HARCP+  G+MW A K+F
Sbjct: 184 SVPRSSSWSSCSNSSNILQPVKGNAILFFSLHPSASPDKNSFHARCPVLEGNMWSAIKYF 243

Query: 235 HLRAIRRXXXXXXXXXXXE-----NCPSWAASGECQRNPVFMVGSPDYYGTCRKSCNAC 74
           + + I                   NCP+WAA GECQRNPVFM+GSPDYYGTCRKSCNAC
Sbjct: 244 YAKPISSGEVSAISDGGECTDEDDNCPAWAAMGECQRNPVFMIGSPDYYGTCRKSCNAC 302


Top