BLASTX nr result

ID: Rehmannia23_contig00005823 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Rehmannia23_contig00005823
         (463 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002268821.1| PREDICTED: pentatricopeptide repeat-containi...   105   6e-21
gb|EMJ21432.1| hypothetical protein PRUPE_ppa001979mg [Prunus pe...    84   1e-14
ref|XP_006399946.1| hypothetical protein EUTSA_v10015672mg [Eutr...    74   2e-11
gb|EOY02618.1| Pentatricopeptide repeat (PPR-like) superfamily p...    73   3e-11
ref|XP_006447217.1| hypothetical protein CICLE_v10014357mg [Citr...    71   1e-10
gb|EXC31403.1| hypothetical protein L484_017686 [Morus notabilis]      70   3e-10
ref|XP_002526948.1| pentatricopeptide repeat-containing protein,...    69   6e-10
ref|XP_006338641.1| PREDICTED: pentatricopeptide repeat-containi...    65   1e-08
ref|NP_190245.1| pentatricopeptide repeat-containing protein [Ar...    65   1e-08
ref|XP_004231824.1| PREDICTED: pentatricopeptide repeat-containi...    62   6e-08
ref|XP_002873660.1| pentatricopeptide repeat-containing protein ...    61   1e-07
ref|XP_006292935.1| hypothetical protein CARUB_v10019206mg [Caps...    60   2e-07
ref|XP_004308618.1| PREDICTED: pentatricopeptide repeat-containi...    58   1e-06

>ref|XP_002268821.1| PREDICTED: pentatricopeptide repeat-containing protein At3g46610
           [Vitis vinifera]
          Length = 763

 Score =  105 bits (262), Expect = 6e-21
 Identities = 61/141 (43%), Positives = 86/141 (60%)
 Frame = -3

Query: 425 GFKSGPSKCKIFSLCPRGTVNFLFKPKKGSFGAAFALTWALEEPEVGNNVGDNEELGRLN 246
           G  SG SK KIF LC R         K+GSFGA+FAL WALE+  +GN     E+   ++
Sbjct: 79  GLLSGYSKLKIFLLCER---------KRGSFGASFALAWALEQQAIGNEFV-KEDSNSIH 128

Query: 245 DVSDNRDGIEFTHAQVDDVKEINENDSHDNDDDQKSGEKTENGNNRRVDVRALAFKLHSS 66
            ++ N + ++    +VD  ++ +END+ +  + +K+GE  E   +R VDVRALA  L  +
Sbjct: 129 SLAGNTETVDIDCLKVDGARDGDENDNEEEKEAEKNGEVIEE-KSRNVDVRALAHGLEFA 187

Query: 65  KNADDVEKVLKEKGNLPLQVY 3
             ADDVE+VLK+K  LPLQVY
Sbjct: 188 TTADDVEEVLKDKVELPLQVY 208


>gb|EMJ21432.1| hypothetical protein PRUPE_ppa001979mg [Prunus persica]
          Length = 734

 Score = 84.3 bits (207), Expect = 1e-14
 Identities = 56/143 (39%), Positives = 79/143 (55%), Gaps = 2/143 (1%)
 Frame = -3

Query: 425 GFKSGPSKCKIFSLCPRGTVNFLFKPKKGSFGAAFALTWALEEPEVGNNVGDNEELG--R 252
           G  SG SK K   +C         + KK SFGA+F + WALEE  +GN++   E     R
Sbjct: 79  GCFSGYSKLKPARIC---------QSKKRSFGASFVVAWALEEQAIGNDIVIEESTSEHR 129

Query: 251 LNDVSDNRDGIEFTHAQVDDVKEINENDSHDNDDDQKSGEKTENGNNRRVDVRALAFKLH 72
           L+   +++ G++  H  VD+     E     N+ D ++G       N ++DVRALA  L 
Sbjct: 130 LSGEGESK-GVD--HLIVDEA----EGGEDKNEVDVRNGGANWEQKNEKIDVRALALSLQ 182

Query: 71  SSKNADDVEKVLKEKGNLPLQVY 3
            +K ADDVE VLK+KG+LPLQV+
Sbjct: 183 FAKTADDVEVVLKDKGDLPLQVF 205


>ref|XP_006399946.1| hypothetical protein EUTSA_v10015672mg [Eutrema salsugineum]
           gi|557101036|gb|ESQ41399.1| hypothetical protein
           EUTSA_v10015672mg [Eutrema salsugineum]
          Length = 688

 Score = 73.9 bits (180), Expect = 2e-11
 Identities = 52/139 (37%), Positives = 69/139 (49%), Gaps = 1/139 (0%)
 Frame = -3

Query: 416 SGPSKCKIFSLCPRGTVNFLFKPKKGSFGAAFALTWALEEPEVGNNVGDNEELGRLNDVS 237
           S   K +  ++ P   V FL +PKK   G++  + WA E+ E+G      EE+ R     
Sbjct: 56  SSNRKFEGLAINPSTKVLFLCEPKKSLSGSSVGVGWATEQRELG------EEVSR----- 104

Query: 236 DNRDGIEFTHAQVDDVKEINENDS-HDNDDDQKSGEKTENGNNRRVDVRALAFKLHSSKN 60
                        +D   +  +DS H        GEKT    N RVDVR LA+ L ++K 
Sbjct: 105 -------------EDSSSVTASDSDHSKSQAVTGGEKT----NARVDVRELAYSLRAAKT 147

Query: 59  ADDVEKVLKEKGNLPLQVY 3
           ADDV+ VLKEKG LPLQVY
Sbjct: 148 ADDVDVVLKEKGELPLQVY 166


>gb|EOY02618.1| Pentatricopeptide repeat (PPR-like) superfamily protein, putative
           [Theobroma cacao]
          Length = 741

 Score = 73.2 bits (178), Expect = 3e-11
 Identities = 54/155 (34%), Positives = 72/155 (46%), Gaps = 8/155 (5%)
 Frame = -3

Query: 443 LSSYFRGFKSGPS--------KCKIFSLCPRGTVNFLFKPKKGSFGAAFALTWALEEPEV 288
           LSSY R  +SG          +C          V    +PK+GS     AL WALE+ E+
Sbjct: 52  LSSYSRFSRSGTCYRNLNCSLRCGFLCWYSELKVVLFCEPKRGSSRGLVALAWALEQQEI 111

Query: 287 GNNVGDNEELGRLNDVSDNRDGIEFTHAQVDDVKEINENDSHDNDDDQKSGEKTENGNNR 108
           GN +   E        S +RDG              N N+  + + D  S  + E   + 
Sbjct: 112 GNELEREE--------SHSRDGD-------------NGNEDKNEEMDASSEGEVELEESA 150

Query: 107 RVDVRALAFKLHSSKNADDVEKVLKEKGNLPLQVY 3
           R+DVRALA  L  +K ADD+EKVLK+   LPLQV+
Sbjct: 151 RLDVRALASSLQFAKTADDIEKVLKDMDELPLQVH 185


>ref|XP_006447217.1| hypothetical protein CICLE_v10014357mg [Citrus clementina]
           gi|568831365|ref|XP_006469938.1| PREDICTED:
           pentatricopeptide repeat-containing protein
           At3g46610-like [Citrus sinensis]
           gi|557549828|gb|ESR60457.1| hypothetical protein
           CICLE_v10014357mg [Citrus clementina]
          Length = 768

 Score = 71.2 bits (173), Expect = 1e-10
 Identities = 47/144 (32%), Positives = 74/144 (51%), Gaps = 9/144 (6%)
 Frame = -3

Query: 407 SKCKIFSLCPRGTVNFLFKPKKGSFGAAFALTWALEEPEVGNNVGDNEELGRLNDVSDNR 228
           SKC+  S      +    +PKK  FGA+    W++E+ E+GN        G L +  ++ 
Sbjct: 76  SKCEFLSGFSSHKLVLFCEPKKSYFGASVMFAWSMEQQEIGN--------GLLVEEPNSA 127

Query: 227 DGIEF-THAQVDDVKEINENDSHDNDDDQKSGEKTENGNNR--------RVDVRALAFKL 75
           DG+   T + + D + ++  +   ++ +Q   E+ E    R        RVDV+ALA  L
Sbjct: 128 DGLLVETESDIVDYRSVHRVEDTGDNGNQVESEEVEIIGERGVGKQKSGRVDVKALAQSL 187

Query: 74  HSSKNADDVEKVLKEKGNLPLQVY 3
             +K ADDVE+VLK+ G LP QV+
Sbjct: 188 WHTKTADDVEEVLKDMGELPPQVH 211


>gb|EXC31403.1| hypothetical protein L484_017686 [Morus notabilis]
          Length = 737

 Score = 70.1 bits (170), Expect = 3e-10
 Identities = 53/144 (36%), Positives = 73/144 (50%), Gaps = 3/144 (2%)
 Frame = -3

Query: 425 GFKSGPSKCKIFSLCPRGTVNFLFKPKK-GSFGAAFALTWALEEPEVGNNVGDNEELGRL 249
           GF  G SK K+   C         KPKK  S GA+ AL  ALEE  VG+ +   EEL   
Sbjct: 81  GFLFGFSKLKVARFC---------KPKKKSSLGASVALAGALEEQAVGSAIRI-EELDSE 130

Query: 248 NDVSDNRDGIEFTHAQVDDVKEINENDSHDND--DDQKSGEKTENGNNRRVDVRALAFKL 75
             +S           +++   + N ++  +N   +D  S EK+      +VDVR LA  L
Sbjct: 131 CSLSGKLSDGHLLLGRIESGDDNNGDEEQENKVIEDVGSEEKSREEKGGKVDVRELASSL 190

Query: 74  HSSKNADDVEKVLKEKGNLPLQVY 3
             +K ADDV++VLK+KG LP QV+
Sbjct: 191 RFAKTADDVDEVLKDKGELPPQVF 214


>ref|XP_002526948.1| pentatricopeptide repeat-containing protein, putative [Ricinus
           communis] gi|223533700|gb|EEF35435.1| pentatricopeptide
           repeat-containing protein, putative [Ricinus communis]
          Length = 671

 Score = 68.9 bits (167), Expect = 6e-10
 Identities = 42/118 (35%), Positives = 65/118 (55%), Gaps = 6/118 (5%)
 Frame = -3

Query: 338 SFGAAFALTWALEEPEVGNNVG------DNEELGRLNDVSDNRDGIEFTHAQVDDVKEIN 177
           SF ++ A  WAL++ ++ +         D+  LG+    S+  D       +++D  + N
Sbjct: 3   SFRSSIAFAWALQKQDISSEFHGVEPSLDDGLLGK----SEKEDVNPHNLGRLEDSDDDN 58

Query: 176 ENDSHDNDDDQKSGEKTENGNNRRVDVRALAFKLHSSKNADDVEKVLKEKGNLPLQVY 3
            N   + + D +S E       R +DVR+LA  LHS++ ADDVE+VLK+KG LPLQVY
Sbjct: 59  NNQEDNIELDLRSKEGVGEEKCRSIDVRSLARSLHSAQTADDVEEVLKDKGELPLQVY 116


>ref|XP_006338641.1| PREDICTED: pentatricopeptide repeat-containing protein
           At3g46610-like [Solanum tuberosum]
          Length = 740

 Score = 64.7 bits (156), Expect = 1e-08
 Identities = 46/121 (38%), Positives = 66/121 (54%), Gaps = 3/121 (2%)
 Frame = -3

Query: 356 FKPKKGSFGAAFALTWALEEPEVGNNVGDNEELGRLNDVSDNRDGIE-FTHAQVDDVKEI 180
           F+P+K     +FALT A EE ++  +V            +    G+E FT  Q+++   +
Sbjct: 73  FRPQKKD---SFALTQASEEKDIHCDVVKQNS----QSFTSGEGGVEGFTCVQLEEKGNL 125

Query: 179 NENDSHDNDDDQKSGEKTENGN--NRRVDVRALAFKLHSSKNADDVEKVLKEKGNLPLQV 6
             N  +D+D D    E+ E G     +VDVRALA  LH  K AD+V++VLK+K  LPLQV
Sbjct: 126 TNNIEYDDDGDV-GNEEDEAGRVKGEKVDVRALAQSLHFVKTADEVDEVLKDKIELPLQV 184

Query: 5   Y 3
           Y
Sbjct: 185 Y 185


>ref|NP_190245.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana]
           gi|75206903|sp|Q9SNB7.1|PP264_ARATH RecName:
           Full=Pentatricopeptide repeat-containing protein
           At3g46610 gi|6523064|emb|CAB62331.1| hypothetical
           protein [Arabidopsis thaliana]
           gi|332644660|gb|AEE78181.1| pentatricopeptide
           repeat-containing protein [Arabidopsis thaliana]
          Length = 665

 Score = 64.7 bits (156), Expect = 1e-08
 Identities = 48/140 (34%), Positives = 63/140 (45%)
 Frame = -3

Query: 422 FKSGPSKCKIFSLCPRGTVNFLFKPKKGSFGAAFALTWALEEPEVGNNVGDNEELGRLND 243
           F S  S      +     V FL +PK+   G++F + WA E+ E+        ELG    
Sbjct: 47  FGSSSSISSFIFVSSNRKVLFLCEPKRSLLGSSFGVGWATEQREL--------ELGE--- 95

Query: 242 VSDNRDGIEFTHAQVDDVKEINENDSHDNDDDQKSGEKTENGNNRRVDVRALAFKLHSSK 63
                                 E  S ++      GEK    NN RVDVR LAF L ++K
Sbjct: 96  ----------------------EEVSTEDLSSANGGEK----NNLRVDVRELAFSLRAAK 129

Query: 62  NADDVEKVLKEKGNLPLQVY 3
            ADDV+ VLK+KG LPLQV+
Sbjct: 130 TADDVDAVLKDKGELPLQVF 149


>ref|XP_004231824.1| PREDICTED: pentatricopeptide repeat-containing protein
           At3g46610-like [Solanum lycopersicum]
          Length = 742

 Score = 62.4 bits (150), Expect = 6e-08
 Identities = 45/123 (36%), Positives = 68/123 (55%), Gaps = 5/123 (4%)
 Frame = -3

Query: 356 FKP-KKGSFGAAFALTWALEEPEVGNNVGDNEELGRLNDVSDNRDGIE-FTHAQVDDVKE 183
           F+P KK SFG + AL  A  E ++  ++     L      +    G+E FT  Q+++  +
Sbjct: 73  FRPQKKDSFGPSCALAQASGEKDIDCDIVKQNSLS----FTSGEGGVEGFTCVQLEEKGD 128

Query: 182 INENDSHDN---DDDQKSGEKTENGNNRRVDVRALAFKLHSSKNADDVEKVLKEKGNLPL 12
           +  N  +D+   ++D+    K E     +VDVRALA  LH  K AD+V++VLK+K  LPL
Sbjct: 129 LTNNVEYDDVVSEEDEAGIVKGE-----KVDVRALAQSLHFVKTADEVDEVLKDKVELPL 183

Query: 11  QVY 3
           QVY
Sbjct: 184 QVY 186


>ref|XP_002873660.1| pentatricopeptide repeat-containing protein [Arabidopsis lyrata
           subsp. lyrata] gi|297319497|gb|EFH49919.1|
           pentatricopeptide repeat-containing protein [Arabidopsis
           lyrata subsp. lyrata]
          Length = 674

 Score = 61.2 bits (147), Expect = 1e-07
 Identities = 43/126 (34%), Positives = 58/126 (46%)
 Frame = -3

Query: 380 PRGTVNFLFKPKKGSFGAAFALTWALEEPEVGNNVGDNEELGRLNDVSDNRDGIEFTHAQ 201
           P   V FL +PK+   G++  + WA E+ E+G                            
Sbjct: 68  PTSKVLFLCEPKRNLSGSSVGVGWATEQRELG---------------------------- 99

Query: 200 VDDVKEINENDSHDNDDDQKSGEKTENGNNRRVDVRALAFKLHSSKNADDVEKVLKEKGN 21
               +E++  DS         GEKT    N RVDVR LA+ L ++K ADDV+ V+KE G 
Sbjct: 100 ----EEVSTEDS-SYPQTVNGGEKT----NSRVDVRELAYSLRAAKTADDVDIVIKEMGE 150

Query: 20  LPLQVY 3
           LPLQVY
Sbjct: 151 LPLQVY 156


>ref|XP_006292935.1| hypothetical protein CARUB_v10019206mg [Capsella rubella]
           gi|482561642|gb|EOA25833.1| hypothetical protein
           CARUB_v10019206mg [Capsella rubella]
          Length = 673

 Score = 60.5 bits (145), Expect = 2e-07
 Identities = 31/64 (48%), Positives = 41/64 (64%), Gaps = 2/64 (3%)
 Frame = -3

Query: 188 KEINENDSHDNDDDQKSGEKTENG--NNRRVDVRALAFKLHSSKNADDVEKVLKEKGNLP 15
           +E++  DS  +  D    +    G  NN RV+VR LAF L ++K ADDV+ VLKEKG LP
Sbjct: 91  EEVSTEDSSSSSVDHSEPQAVNGGEKNNSRVNVRELAFSLRAAKTADDVDAVLKEKGELP 150

Query: 14  LQVY 3
           LQV+
Sbjct: 151 LQVF 154


>ref|XP_004308618.1| PREDICTED: pentatricopeptide repeat-containing protein
           At3g46610-like [Fragaria vesca subsp. vesca]
          Length = 657

 Score = 57.8 bits (138), Expect = 1e-06
 Identities = 41/110 (37%), Positives = 55/110 (50%), Gaps = 3/110 (2%)
 Frame = -3

Query: 323 FALTWALEEPEVGNNVGDNEEL---GRLNDVSDNRDGIEFTHAQVDDVKEINENDSHDND 153
           F   WALEE ++G+ V         G L +      G+E +          +E D     
Sbjct: 38  FVSAWALEEQDIGDEVSVENSTSGNGLLAECGSREVGMEGS----------DEVDGRSGG 87

Query: 152 DDQKSGEKTENGNNRRVDVRALAFKLHSSKNADDVEKVLKEKGNLPLQVY 3
           +     EK+E      VDVRALA +L  +K ADDVE+VLKE G+LPLQV+
Sbjct: 88  EGGNWEEKSEV-----VDVRALASRLQFAKTADDVEEVLKEMGDLPLQVF 132


Top