BLASTX nr result

ID: Cephaelis21_contig00012328 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Cephaelis21_contig00012328
         (952 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002321853.1| predicted protein [Populus trichocarpa] gi|2...   190   4e-46
emb|CAN66820.1| hypothetical protein VITISV_003496 [Vitis vinifera]   170   4e-40
ref|XP_002510900.1| hypothetical protein RCOM_1498790 [Ricinus c...   160   5e-37
ref|XP_002864122.1| hydroxyproline-rich glycoprotein family prot...   126   9e-27
ref|NP_199981.2| hydroxyproline-rich glycoprotein family protein...   122   2e-25

>ref|XP_002321853.1| predicted protein [Populus trichocarpa] gi|222868849|gb|EEF05980.1|
           predicted protein [Populus trichocarpa]
          Length = 333

 Score =  190 bits (483), Expect = 4e-46
 Identities = 123/281 (43%), Positives = 150/281 (53%), Gaps = 7/281 (2%)
 Frame = +2

Query: 122 FRMERKEPSDATTRKRAKQSISVPFIWEEFPGIPKKNWXXXXXXXXXXXXXXX-YIASIP 298
           F+M   E  D++ +K  +Q  SVPF+WE  PG+ K++W                 IAS+P
Sbjct: 18  FKMAGLEVIDSSRKKHIRQPPSVPFLWEVRPGVAKRDWKPEVSSVTPVQLPPVKLIASVP 77

Query: 299 FKWEEKXXXXXXXXXXXXXXXTIAPAQESFPL------SPRRSRPCENSWSCLTDQDGDE 460
           F WEEK                I P      L      S       +       +  GDE
Sbjct: 78  FNWEEKPGKPLSCFSQSPESAFITPQANLLALPWHVTCSQGDDNHKQEDGDSGEENFGDE 137

Query: 461 IDMLESYPETCESETDDSFTSAAPSLVANGLVPTVAISNAVPVQLISETGFHSEQLQTPA 640
             M  S  E+   ETD+SF+SA  SL+AN +V +VAIS AVPVQ  S T   + Q +TP+
Sbjct: 138 QVMFNSDLESFSFETDESFSSAQ-SLLANCMVSSVAISTAVPVQTTSPTDDSNGQQETPS 196

Query: 641 SPASEKDSSTSSYETGATSLAGASFLEWLFPLLTPQSNFLEKAGHSEKSDSHIQTRKHGN 820
           SP SE DSSTSSY TG +SL GA+FLEWLFPL TP+S FL KA H  K         +  
Sbjct: 197 SPPSETDSSTSSYATGVSSLEGAAFLEWLFPLYTPKSGFLGKASHPRKES--FTPELNSR 254

Query: 821 DFYCEKNYSASITKPQTLGELILMSRRFSYQRKAVPMRTQN 943
           DF  E+N S  I KP TLGELI+MSRR S QRKAV MR QN
Sbjct: 255 DFDYERNSSVMIRKPLTLGELIMMSRRRSCQRKAVQMRKQN 295


>emb|CAN66820.1| hypothetical protein VITISV_003496 [Vitis vinifera]
          Length = 341

 Score =  170 bits (431), Expect = 4e-40
 Identities = 124/302 (41%), Positives = 152/302 (50%), Gaps = 33/302 (10%)
 Frame = +2

Query: 140 EPSDATTRKRAKQSISVPFIWEEFPGIPKKNWXXXXXXXXXXXXXXX------------- 280
           E   ++ RK+ +Q  SVPF+WEE PGIPKK+W                            
Sbjct: 3   EQQVSSNRKQIRQPPSVPFLWEEKPGIPKKDWKPEVTAVNPPPPPPPPPPPPPPPPPPPP 62

Query: 281 --------------YIASIPFKWEEKXXXXXXXXXXXXXXXTIAPAQESFPLSPRRSRPC 418
                          IASIPF WEEK                  P  +S  L P +   C
Sbjct: 63  PPPPPPPPPPPPIKLIASIPFTWEEKPGKPLPFFSG-------TPHDDSLLLFPPKKLVC 115

Query: 419 ENSWSCLTDQD-----GDEID-MLESYPETCESETDDSFTSAAPSLVANGLVPTVAISNA 580
            +S S    +D      DE D + ES  E    ETDDSF+SA PSL+AN L+ TVAIS A
Sbjct: 116 CSSLSDADSKDYEDDGDDEHDGIFESDFEAFGFETDDSFSSA-PSLLANRLMSTVAISTA 174

Query: 581 VPVQLISETGFHSEQLQTPASPASEKDSSTSSYETGATSLAGASFLEWLFPLLTPQSNFL 760
           VPVQ  S     ++Q ++P+SPASE +SSTS Y TG TSL G+SFL+ LFPL  P S FL
Sbjct: 175 VPVQKTSLNEDSNDQPESPSSPASETNSSTSXYATGTTSLVGSSFLDCLFPLFPPNSGFL 234

Query: 761 EKAGHSEKSDSHIQTRKHGNDFYCEKNYSASITKPQTLGELILMSRRFSYQRKAVPMRTQ 940
            K G  E S    + +  G D   E N S  + +  TLGELI+ SRR SY+RKAV MR  
Sbjct: 235 AKVGCPEGSPPPPELQNKGLD--RETNSSVIVRRAPTLGELIMKSRRRSYRRKAVQMRKH 292

Query: 941 NL 946
           NL
Sbjct: 293 NL 294


>ref|XP_002510900.1| hypothetical protein RCOM_1498790 [Ricinus communis]
           gi|223550015|gb|EEF51502.1| hypothetical protein
           RCOM_1498790 [Ricinus communis]
          Length = 278

 Score =  160 bits (404), Expect = 5e-37
 Identities = 116/281 (41%), Positives = 144/281 (51%), Gaps = 7/281 (2%)
 Frame = +2

Query: 128 MERKEPSDATTRKRAKQSISVPFIWEEFPGIPKKNWXXXXXXXXXXXXXXXY--IASIPF 301
           M   E  +A+ RK  +Q   VPF+WEE PGI KK+W                  IAS+PF
Sbjct: 1   MTENEIIEASKRKHIRQPPFVPFLWEERPGIAKKDWKPVVSSVTTLALPPPVKLIASVPF 60

Query: 302 KWEEKXXXXXXXXXXXXXXXTIAPAQE--SFPLSPRRSRPCE--NSWSCLTDQDGD-EID 466
            WEEK                 A      S P+  +R   CE  N      D  G+ E  
Sbjct: 61  NWEEKPGKPLPCFSQPPMESPPATLNSLPSPPMYYQRCDDCEFNNENRAGHDNYGEKEEG 120

Query: 467 MLESYPETCESETDDSFTSAAPSLVANGLVPTVAISNAVPVQLISETGFHSEQLQTPASP 646
           + +   E+   ETDDS +SA PSL+AN LV +VA+S+AVPV          + L+TP+SP
Sbjct: 121 IFDLDIESFSFETDDSLSSA-PSLLANCLVSSVAVSDAVPV----------DHLETPSSP 169

Query: 647 ASEKDSSTSSYETGATSLAGASFLEWLFPLLTPQSNFLEKAGHSEKSDSHIQTRKHGNDF 826
           AS+ DSSTSSY TG +SL GAS LE LFPL  P S FLE   HS K        ++ N  
Sbjct: 170 ASDTDSSTSSYATGISSLTGASLLECLFPLYAPDSGFLETVAHSTKGSLIATEVQNCNSN 229

Query: 827 YCEKNYSASITKPQTLGELILMSRRFSYQRKAVPMRTQNLP 949
               N   +   P TLGELI+MSRR S QRKA+ M  +NLP
Sbjct: 230 RASDNIVTTKRTP-TLGELIMMSRRRSCQRKAIQMGNRNLP 269


>ref|XP_002864122.1| hydroxyproline-rich glycoprotein family protein [Arabidopsis lyrata
           subsp. lyrata] gi|297309957|gb|EFH40381.1|
           hydroxyproline-rich glycoprotein family protein
           [Arabidopsis lyrata subsp. lyrata]
          Length = 343

 Score =  126 bits (316), Expect = 9e-27
 Identities = 105/313 (33%), Positives = 136/313 (43%), Gaps = 41/313 (13%)
 Frame = +2

Query: 128 MERKEPSDATT-RKRAKQSISVPFIWEEFPGIPKKNW--------XXXXXXXXXXXXXXX 280
           M   EP +    RK+ +Q  SVPFIWEE PG PKKNW                       
Sbjct: 1   MSEMEPKETKPPRKQLRQPPSVPFIWEERPGFPKKNWQPSLATFVPSPPLLPPPVPVPVK 60

Query: 281 YIASIPFKWEE--------KXXXXXXXXXXXXXXXTIAPAQESFPLSPRRSRPCENSWS- 433
            + S+PF+WEE                        T  P     P+  ++       W  
Sbjct: 61  LVTSVPFRWEETPGKPLPPSSNDPPQLPHPPLETATTTPLPPPVPVPVKQVTSVPFDWEE 120

Query: 434 -------CLTDQDGDEI-----------DMLESYPETCESETDDSFTSAAPSLVANGLVP 559
                  C  D +  E+             +E+  +  +  + DSF S+ PSL+A     
Sbjct: 121 TPGQPYPCFVDTNPPELLDQPLPPPPMYGEVETSSDIFDDASSDSF-SSVPSLLATN--R 177

Query: 560 TVAISNAVPVQLISETGFHSEQLQTPASPASEKDSSTSSYETGATSLAGASFLEWLFPLL 739
           +V+IS AV V    +   +      P SPA E D STSSY TGA+SL GASFLE LFP L
Sbjct: 178 SVSISGAVAVDEFDD-NLNRVTRSMPTSPAYESDDSTSSYMTGASSLVGASFLEKLFPRL 236

Query: 740 TPQSNFLEKAGHSEKSDSHIQTRK-HGNDFYCEKNYSASI----TKPQTLGELILMSRRF 904
            P    LEK   ++  D  + T   H       ++ + SI      PQTLGELI+MSRR 
Sbjct: 237 LP----LEKVKSADSEDVQVSTHPLHEEVKLTTESDNMSIGFPVRAPQTLGELIMMSRRR 292

Query: 905 SYQRKAVPMRTQN 943
           SY R+AV MR QN
Sbjct: 293 SYMRRAVEMRKQN 305


>ref|NP_199981.2| hydroxyproline-rich glycoprotein family protein [Arabidopsis
           thaliana] gi|332008731|gb|AED96114.1|
           hydroxyproline-rich glycoprotein family protein
           [Arabidopsis thaliana]
          Length = 343

 Score =  122 bits (305), Expect = 2e-25
 Identities = 105/312 (33%), Positives = 135/312 (43%), Gaps = 40/312 (12%)
 Frame = +2

Query: 128 MERKEPSDATTRKRAKQSISVPFIWEEFPGIPKKNW--------XXXXXXXXXXXXXXXY 283
           ME KE      RK+ +Q  SVPFIWEE PG PKKNW                        
Sbjct: 4   MEPKETKPP--RKQLRQPPSVPFIWEERPGFPKKNWQPSLATFVPSPPPLPPPIPVPVKL 61

Query: 284 IASIPFKWEEKXXXXXXXXXXXXXXXTIAPAQESFPL----------------------S 397
           + S+PF+WEE                   P + + P                       +
Sbjct: 62  VTSVPFRWEETPGKPLPASSNDPPQLPHPPLETATPTPLPPPVPVPVKQVTSVPFDWEET 121

Query: 398 PRRSRPC--ENSWSCLTDQDGDEIDM---LESYPETCESETDDSFTSAAPSLVANGLVPT 562
           P +  PC  + S   L DQ      M   +E+  +  +  + DSF S+ PSL+A     +
Sbjct: 122 PGQPYPCFVDTSPPELLDQPLPPPPMYGDVETSSDIFDDASSDSF-SSVPSLLATN--RS 178

Query: 563 VAISNAVPVQLISETGFHSEQLQTPASPASEKDSSTSSYETGATSLAGASFLEWLFPLLT 742
           V+IS AV V    +   ++     P SPA E D STSSY TGA+SL GASFLE LFP L 
Sbjct: 179 VSISGAVAVDEFDD-NLNTVTSSMPTSPAYESDDSTSSYMTGASSLVGASFLEKLFPRLL 237

Query: 743 PQSNFLEKAGHSEKSDSHIQTRKHGNDFYC-----EKNYSASITKPQTLGELILMSRRFS 907
           P     EK   +   D  + T     +          +    +  PQTLGELI+MSRR S
Sbjct: 238 PS----EKVKAAVSEDVQVSTHPLHEEVKLTTETDNMSIGFPVRTPQTLGELIMMSRRRS 293

Query: 908 YQRKAVPMRTQN 943
           Y R+AV MR QN
Sbjct: 294 YMRRAVEMRKQN 305


Top