BLASTX nr result

ID: Angelica22_contig00025170 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Angelica22_contig00025170
         (640 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_003528505.1| PREDICTED: uncharacterized protein LOC100802...    73   5e-11
ref|XP_003627267.1| hypothetical protein MTR_8g020530 [Medicago ...    71   1e-10
ref|NP_177404.1| hydroxyproline-rich glycoprotein family protein...    70   3e-10
ref|XP_002304137.1| predicted protein [Populus trichocarpa] gi|2...    69   7e-10
ref|NP_683431.1| proline-rich family protein [Arabidopsis thalia...    66   6e-09

>ref|XP_003528505.1| PREDICTED: uncharacterized protein LOC100802107 [Glycine max]
          Length = 161

 Score = 72.8 bits (177), Expect = 5e-11
 Identities = 44/99 (44%), Positives = 48/99 (48%), Gaps = 19/99 (19%)
 Frame = -1

Query: 490 PPPRKTSNHPPPLHDKAL--VPQHGL-----------------HTYHKHKTPPRPPPSED 368
           PPP   S  PPP     L  VP  G                     H+   PP PPP   
Sbjct: 57  PPPLSLSPPPPPSKSMTLSFVPPPGAPPPYPNNMPNVRRPPHRRRRHRRPPPPPPPPPPH 116

Query: 367 KLNLGKKVGLLFTGIVAILQVCVVAFLVIKSRQMFKDQD 251
           K+N GKKVGLLF GI AI+QV VV FLVIK RQ+ K  D
Sbjct: 117 KMNAGKKVGLLFVGIAAIMQVGVVGFLVIKRRQLLKSDD 155


>ref|XP_003627267.1| hypothetical protein MTR_8g020530 [Medicago truncatula]
           gi|355521289|gb|AET01743.1| hypothetical protein
           MTR_8g020530 [Medicago truncatula]
          Length = 168

 Score = 71.2 bits (173), Expect = 1e-10
 Identities = 42/95 (44%), Positives = 50/95 (52%), Gaps = 14/95 (14%)
 Frame = -1

Query: 493 TPPPRKTSNHPPPLHDKALVPQHG------------LHTYHKHKT--PPRPPPSEDKLNL 356
           TPPP +  N PPP     + PQ+             L  +H H    PP  PP +  +N 
Sbjct: 66  TPPPPQDFNSPPPPTLPDIPPQNQNPSPISSPPPPHLRRWHDHVQLPPPLAPPPQHSMNA 125

Query: 355 GKKVGLLFTGIVAILQVCVVAFLVIKSRQMFKDQD 251
           GKKVGLLF GI AI+QV  V FLVIK RQ+ K  D
Sbjct: 126 GKKVGLLFVGIAAIMQVGFVGFLVIKRRQLLKTND 160


>ref|NP_177404.1| hydroxyproline-rich glycoprotein family protein [Arabidopsis
           thaliana] gi|334183868|ref|NP_001185384.1|
           hydroxyproline-rich glycoprotein family protein
           [Arabidopsis thaliana]
           gi|12323771|gb|AAG51851.1|AC010926_14 hypothetical
           protein; 72245-71838 [Arabidopsis thaliana]
           gi|38454042|gb|AAR20715.1| At1g72600 [Arabidopsis
           thaliana] gi|45592904|gb|AAS68106.1| At1g72600
           [Arabidopsis thaliana] gi|332197225|gb|AEE35346.1|
           hydroxyproline-rich glycoprotein family protein
           [Arabidopsis thaliana] gi|332197226|gb|AEE35347.1|
           hydroxyproline-rich glycoprotein family protein
           [Arabidopsis thaliana]
          Length = 135

 Score = 70.5 bits (171), Expect = 3e-10
 Identities = 37/86 (43%), Positives = 44/86 (51%), Gaps = 11/86 (12%)
 Frame = -1

Query: 490 PPPRKTSNHPPPLHDKALVPQHGLHTYHKH-----------KTPPRPPPSEDKLNLGKKV 344
           PPP   S  PPP    A+     +H  H H           K PP PPP + K+N GK V
Sbjct: 42  PPPPPLSLSPPPSPITAIESNKAIHEKHHHRRKKWRQRRHHKHPPPPPPKKQKVNTGKTV 101

Query: 343 GLLFTGIVAILQVCVVAFLVIKSRQM 266
           GL F G+ A LQV V AFL+ K RQ+
Sbjct: 102 GLFFAGVAAALQVVVAAFLIFKRRQL 127


>ref|XP_002304137.1| predicted protein [Populus trichocarpa] gi|222841569|gb|EEE79116.1|
           predicted protein [Populus trichocarpa]
          Length = 167

 Score = 68.9 bits (167), Expect = 7e-10
 Identities = 37/81 (45%), Positives = 48/81 (59%), Gaps = 1/81 (1%)
 Frame = -1

Query: 490 PPPRKTSNHPPPLHDKALVPQHGLHTYHKHKTPPRPPPSED-KLNLGKKVGLLFTGIVAI 314
           PP +K    PPP  D++      +     H  PP PPPS++ ++N GKK+GLLF GI AI
Sbjct: 83  PPRKKLQPPPPPPRDRST---GNVMRRRSHPPPPPPPPSKNHQMNSGKKIGLLFVGIAAI 139

Query: 313 LQVCVVAFLVIKSRQMFKDQD 251
           LQ+ VV FL  K RQ+ K  D
Sbjct: 140 LQIGVVGFLAYKRRQLLKIND 160


>ref|NP_683431.1| proline-rich family protein [Arabidopsis thaliana]
           gi|332194942|gb|AEE33063.1| proline-rich family protein
           [Arabidopsis thaliana]
          Length = 169

 Score = 65.9 bits (159), Expect = 6e-09
 Identities = 37/84 (44%), Positives = 49/84 (58%), Gaps = 2/84 (2%)
 Frame = -1

Query: 490 PPPRKTSNHPPPLHDKALVPQHGLHTYHKHKTPPRPP--PSEDKLNLGKKVGLLFTGIVA 317
           PPPR   + PPP   +  +P+       +H  PPR P  P  D LN GK VGL+F G++A
Sbjct: 96  PPPR---SQPPPKPPQKNLPR-------RHPPPPRSPEKPKRDGLNKGKTVGLVFVGLIA 145

Query: 316 ILQVCVVAFLVIKSRQMFKDQDVN 245
           +LQV VV FLV K +Q+   +D N
Sbjct: 146 MLQVIVVVFLVFKRKQLLSLKDTN 169


Top