BLASTX nr result

ID: Forsythia21_contig00050772 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Forsythia21_contig00050772
         (436 letters)

Database: ./nr 
           69,698,275 sequences; 24,982,196,650 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

emb|CDP11859.1| unnamed protein product [Coffea canephora]            103   3e-20
ref|XP_012852438.1| PREDICTED: pentatricopeptide repeat-containi...    93   6e-17
ref|XP_011100283.1| PREDICTED: pentatricopeptide repeat-containi...    91   3e-16
ref|XP_002522705.1| pentatricopeptide repeat-containing protein,...    83   8e-14
ref|XP_002319950.1| hypothetical protein POPTR_0013s14820g [Popu...    77   6e-12
ref|XP_002276196.1| PREDICTED: pentatricopeptide repeat-containi...    75   1e-11
ref|XP_004304934.2| PREDICTED: pentatricopeptide repeat-containi...    73   7e-11
gb|KHN27212.1| Pentatricopeptide repeat-containing protein [Glyc...    73   7e-11
ref|XP_011020996.1| PREDICTED: pentatricopeptide repeat-containi...    72   1e-10
ref|XP_012078984.1| PREDICTED: pentatricopeptide repeat-containi...    72   1e-10
ref|XP_012078983.1| PREDICTED: pentatricopeptide repeat-containi...    72   1e-10
ref|XP_007139254.1| hypothetical protein PHAVU_008G014200g [Phas...    71   2e-10
ref|XP_003551818.1| PREDICTED: pentatricopeptide repeat-containi...    71   2e-10
ref|XP_009353362.1| PREDICTED: pentatricopeptide repeat-containi...    71   3e-10
ref|XP_008379591.1| PREDICTED: LOW QUALITY PROTEIN: pentatricope...    69   9e-10
ref|XP_008230115.1| PREDICTED: pentatricopeptide repeat-containi...    69   9e-10
ref|XP_007033247.1| Mitochondrial editing factor 22 [Theobroma c...    69   2e-09
ref|XP_012436686.1| PREDICTED: pentatricopeptide repeat-containi...    68   2e-09
ref|XP_010105982.1| hypothetical protein L484_007616 [Morus nota...    65   2e-09
ref|XP_010542851.1| PREDICTED: pentatricopeptide repeat-containi...    67   4e-09

>emb|CDP11859.1| unnamed protein product [Coffea canephora]
          Length = 747

 Score =  103 bits (258), Expect = 3e-20
 Identities = 58/102 (56%), Positives = 68/102 (66%), Gaps = 1/102 (0%)
 Frame = -1

Query: 307 RAFIFPTPP-SVLRKLCSSLPDHLDQLFSSNKNLCSTYDTVEPGTYYTSLLKKSVNKIHL 131
           + F FP    S +    SSLP  LD  F +N NL   Y T EP  YYTSLL KS +K  L
Sbjct: 36  KLFEFPASKFSCIGYYYSSLPLPLDHNFDNNNNL---YYTFEPEAYYTSLLSKSTHKSFL 92

Query: 130 NQIHAQLYIHGLQNNGFIIAKFINVGSNLKEICYAHQVFDEF 5
           NQIHAQL+  GLQNNG+II KFI+VGSN+ EI YA ++FDEF
Sbjct: 93  NQIHAQLFTFGLQNNGYIITKFIHVGSNIGEIIYARKIFDEF 134


>ref|XP_012852438.1| PREDICTED: pentatricopeptide repeat-containing protein At3g12770
           [Erythranthe guttatus]
          Length = 710

 Score = 93.2 bits (230), Expect = 6e-17
 Identities = 47/93 (50%), Positives = 60/93 (64%), Gaps = 1/93 (1%)
 Frame = -1

Query: 277 VLRKLCSSLPDHLDQLFSSNKNL-CSTYDTVEPGTYYTSLLKKSVNKIHLNQIHAQLYIH 101
           V R+   S     D  F   + + C ++ + +  +++TSLL+KS +  HLNQIH QLYIH
Sbjct: 6   VQRRTIKSRIKIFDLYFKFRRGIHCCSHSSTDDESFFTSLLEKSAHTAHLNQIHNQLYIH 65

Query: 100 GLQNNGFIIAKFINVGSNLKEICYAHQVFDEFP 2
           GL  NGFII KFIN  SNL+EI YA  VFDEFP
Sbjct: 66  GLHENGFIITKFINTSSNLREIDYARHVFDEFP 98


>ref|XP_011100283.1| PREDICTED: pentatricopeptide repeat-containing protein At3g12770
           [Sesamum indicum]
          Length = 715

 Score = 90.9 bits (224), Expect = 3e-16
 Identities = 46/78 (58%), Positives = 56/78 (71%)
 Frame = -1

Query: 235 QLFSSNKNLCSTYDTVEPGTYYTSLLKKSVNKIHLNQIHAQLYIHGLQNNGFIIAKFINV 56
           +LFSS     S Y  +   ++YTSLL+KS + IHL QIH QLY HGLQNNGFI+ K I+V
Sbjct: 30  RLFSSR----SHYGRISAESFYTSLLEKSTHIIHLRQIHCQLYTHGLQNNGFIVTKLIHV 85

Query: 55  GSNLKEICYAHQVFDEFP 2
            SNLKEI YA  +F+EFP
Sbjct: 86  TSNLKEIHYARHLFEEFP 103


>ref|XP_002522705.1| pentatricopeptide repeat-containing protein, putative [Ricinus
           communis] gi|223538055|gb|EEF39667.1| pentatricopeptide
           repeat-containing protein, putative [Ricinus communis]
          Length = 501

 Score = 82.8 bits (203), Expect = 8e-14
 Identities = 43/86 (50%), Positives = 57/86 (66%)
 Frame = -1

Query: 259 SSLPDHLDQLFSSNKNLCSTYDTVEPGTYYTSLLKKSVNKIHLNQIHAQLYIHGLQNNGF 80
           SSLP HL   F++  +LC  +D   P T++ SL+  SVN+ HLNQIHAQL++  LQ NGF
Sbjct: 24  SSLPRHLHSCFNTG-HLCFRFD---PCTFFASLIDNSVNRSHLNQIHAQLFVSRLQYNGF 79

Query: 79  IIAKFINVGSNLKEICYAHQVFDEFP 2
           +I K +N  + L EI YA  VFD +P
Sbjct: 80  LITKLVNCCATLGEIRYARNVFDYYP 105


>ref|XP_002319950.1| hypothetical protein POPTR_0013s14820g [Populus trichocarpa]
           gi|222858326|gb|EEE95873.1| hypothetical protein
           POPTR_0013s14820g [Populus trichocarpa]
          Length = 746

 Score = 76.6 bits (187), Expect = 6e-12
 Identities = 44/102 (43%), Positives = 65/102 (63%), Gaps = 4/102 (3%)
 Frame = -1

Query: 295 FPTPPSVLRKL----CSSLPDHLDQLFSSNKNLCSTYDTVEPGTYYTSLLKKSVNKIHLN 128
           F  P S L+ L     SSL  +L   F++NK+ C+   T +P  +Y SL+  S++K HLN
Sbjct: 36  FRFPASTLKFLETHYSSSL--NLTTHFNNNKDDCNE-STFKPDKFYASLIDDSIHKTHLN 92

Query: 127 QIHAQLYIHGLQNNGFIIAKFINVGSNLKEICYAHQVFDEFP 2
           QI+A+L + GLQ  GF+IAK +N  SN+ E+  A ++FD+FP
Sbjct: 93  QIYAKLLVTGLQYGGFLIAKLVNKASNIGEVSCARKLFDKFP 134


>ref|XP_002276196.1| PREDICTED: pentatricopeptide repeat-containing protein At3g12770
           [Vitis vinifera] gi|296081235|emb|CBI17979.3| unnamed
           protein product [Vitis vinifera]
          Length = 742

 Score = 75.5 bits (184), Expect = 1e-11
 Identities = 43/101 (42%), Positives = 57/101 (56%), Gaps = 3/101 (2%)
 Frame = -1

Query: 295 FPTPPSVLRKLCSSLP---DHLDQLFSSNKNLCSTYDTVEPGTYYTSLLKKSVNKIHLNQ 125
           FP          SSLP   DH D +          Y   +  ++++SLL  SV+K HLNQ
Sbjct: 39  FPATLFKFLNFYSSLPLPLDHSDYI---------PYSGFDFDSFFSSLLDHSVHKRHLNQ 89

Query: 124 IHAQLYIHGLQNNGFIIAKFINVGSNLKEICYAHQVFDEFP 2
           IHAQL + GL  +GF++ KF+N   N+ EI YA +VFDEFP
Sbjct: 90  IHAQLVVSGLVESGFLVTKFVNASWNIGEIGYARKVFDEFP 130


>ref|XP_004304934.2| PREDICTED: pentatricopeptide repeat-containing protein At3g12770
           [Fragaria vesca subsp. vesca]
           gi|764597985|ref|XP_011466238.1| PREDICTED:
           pentatricopeptide repeat-containing protein At3g12770
           [Fragaria vesca subsp. vesca]
           gi|764597989|ref|XP_011466239.1| PREDICTED:
           pentatricopeptide repeat-containing protein At3g12770
           [Fragaria vesca subsp. vesca]
          Length = 740

 Score = 73.2 bits (178), Expect = 7e-11
 Identities = 44/100 (44%), Positives = 53/100 (53%), Gaps = 1/100 (1%)
 Frame = -1

Query: 301 FIFPTPPSVLR-KLCSSLPDHLDQLFSSNKNLCSTYDTVEPGTYYTSLLKKSVNKIHLNQ 125
           F FP P S +     SS P  LD  FS N  L   YD+      + S +  S  + HL Q
Sbjct: 31  FHFPLPFSFISPNHYSSSPLSLDSYFSDNHTLHFGYDSESS---FASFIDSSTRRTHLTQ 87

Query: 124 IHAQLYIHGLQNNGFIIAKFINVGSNLKEICYAHQVFDEF 5
           IHAQL + G Q + F+I K +N  SNL  I YA QVFDEF
Sbjct: 88  IHAQLLVLGFQASAFLITKLVNCSSNLGHISYARQVFDEF 127


>gb|KHN27212.1| Pentatricopeptide repeat-containing protein [Glycine soja]
          Length = 433

 Score = 73.2 bits (178), Expect = 7e-11
 Identities = 40/93 (43%), Positives = 54/93 (58%)
 Frame = -1

Query: 283 PSVLRKLCSSLPDHLDQLFSSNKNLCSTYDTVEPGTYYTSLLKKSVNKIHLNQIHAQLYI 104
           P V + LC S   H +     N   C   D+     +Y SL+  S +K H +QIH QL I
Sbjct: 4   PQVCKYLCFSSALHPEHFV--NHGHCFNSDS-----FYASLIDNSTHKRHRDQIHNQLVI 56

Query: 103 HGLQNNGFIIAKFINVGSNLKEICYAHQVFDEF 5
            GLQ+NGF++ K +N  SNL +ICYA ++FDEF
Sbjct: 57  SGLQHNGFLMTKVVNGSSNLGQICYARKLFDEF 89


>ref|XP_011020996.1| PREDICTED: pentatricopeptide repeat-containing protein At3g12770
           [Populus euphratica]
          Length = 746

 Score = 72.4 bits (176), Expect = 1e-10
 Identities = 33/74 (44%), Positives = 51/74 (68%)
 Frame = -1

Query: 223 SNKNLCSTYDTVEPGTYYTSLLKKSVNKIHLNQIHAQLYIHGLQNNGFIIAKFINVGSNL 44
           +NK   S  ++ +P  +Y SL+  S++K HLNQI+A+L + GLQ  GF+IAK +N  SN+
Sbjct: 61  NNKKDDSNENSFKPDKFYASLIDDSIHKTHLNQIYAKLLVTGLQYGGFLIAKLVNKASNI 120

Query: 43  KEICYAHQVFDEFP 2
            E+  A ++FD+FP
Sbjct: 121 GEVSCARKLFDKFP 134


>ref|XP_012078984.1| PREDICTED: pentatricopeptide repeat-containing protein
           At3g12770-like isoform X2 [Jatropha curcas]
          Length = 714

 Score = 72.0 bits (175), Expect = 1e-10
 Identities = 36/59 (61%), Positives = 42/59 (71%)
 Frame = -1

Query: 178 TYYTSLLKKSVNKIHLNQIHAQLYIHGLQNNGFIIAKFINVGSNLKEICYAHQVFDEFP 2
           T+Y SL   SV+K +L QIHAQL +  LQ NGF+IAKF+N  SNL EI YA  VFD FP
Sbjct: 44  TFYASLTDNSVDKSNLYQIHAQLLVSQLQYNGFLIAKFVNCSSNLGEISYARNVFDCFP 102


>ref|XP_012078983.1| PREDICTED: pentatricopeptide repeat-containing protein
           At3g12770-like isoform X1 [Jatropha curcas]
          Length = 728

 Score = 72.0 bits (175), Expect = 1e-10
 Identities = 36/59 (61%), Positives = 42/59 (71%)
 Frame = -1

Query: 178 TYYTSLLKKSVNKIHLNQIHAQLYIHGLQNNGFIIAKFINVGSNLKEICYAHQVFDEFP 2
           T+Y SL   SV+K +L QIHAQL +  LQ NGF+IAKF+N  SNL EI YA  VFD FP
Sbjct: 44  TFYASLTDNSVDKSNLYQIHAQLLVSQLQYNGFLIAKFVNCSSNLGEISYARNVFDCFP 102


>ref|XP_007139254.1| hypothetical protein PHAVU_008G014200g [Phaseolus vulgaris]
           gi|593331658|ref|XP_007139255.1| hypothetical protein
           PHAVU_008G014200g [Phaseolus vulgaris]
           gi|561012387|gb|ESW11248.1| hypothetical protein
           PHAVU_008G014200g [Phaseolus vulgaris]
           gi|561012388|gb|ESW11249.1| hypothetical protein
           PHAVU_008G014200g [Phaseolus vulgaris]
          Length = 727

 Score = 71.2 bits (173), Expect = 2e-10
 Identities = 31/63 (49%), Positives = 45/63 (71%)
 Frame = -1

Query: 193 TVEPGTYYTSLLKKSVNKIHLNQIHAQLYIHGLQNNGFIIAKFINVGSNLKEICYAHQVF 14
           +V   ++Y SL+  S +K HL+QIH QL + GLQ+NGF++ K +N  SNL +ICYA ++F
Sbjct: 52  SVNSDSFYASLIDNSTHKRHLDQIHNQLVVSGLQHNGFLMTKLVNGSSNLAQICYARKLF 111

Query: 13  DEF 5
           D F
Sbjct: 112 DGF 114


>ref|XP_003551818.1| PREDICTED: pentatricopeptide repeat-containing protein
           At3g12770-like [Glycine max]
          Length = 727

 Score = 71.2 bits (173), Expect = 2e-10
 Identities = 31/58 (53%), Positives = 44/58 (75%)
 Frame = -1

Query: 178 TYYTSLLKKSVNKIHLNQIHAQLYIHGLQNNGFIIAKFINVGSNLKEICYAHQVFDEF 5
           ++Y SL+  S +K HL+QIH +L I GLQ+NGF++ K +N  SNL +ICYA ++FDEF
Sbjct: 57  SFYASLIDNSTHKRHLDQIHNRLVISGLQHNGFLMTKLVNGSSNLGQICYARKLFDEF 114


>ref|XP_009353362.1| PREDICTED: pentatricopeptide repeat-containing protein
           At3g12770-like [Pyrus x bretschneideri]
          Length = 753

 Score = 70.9 bits (172), Expect = 3e-10
 Identities = 44/101 (43%), Positives = 58/101 (57%), Gaps = 4/101 (3%)
 Frame = -1

Query: 295 FPTPPSV----LRKLCSSLPDHLDQLFSSNKNLCSTYDTVEPGTYYTSLLKKSVNKIHLN 128
           F TP S     LR  CSS P  LD     +++L   +D   P +++ SL+  S  +  L 
Sbjct: 44  FETPASFSFTFLRHYCSS-PLRLDPYCYDDQHLRHGHD---PDSFFASLIDSSTRESQLG 99

Query: 127 QIHAQLYIHGLQNNGFIIAKFINVGSNLKEICYAHQVFDEF 5
           QIHAQL + GL ++GF+I K +N  SNL  I YA QVFDEF
Sbjct: 100 QIHAQLVVLGLIDSGFLITKLVNASSNLGYIWYARQVFDEF 140


>ref|XP_008379591.1| PREDICTED: LOW QUALITY PROTEIN: pentatricopeptide repeat-containing
           protein At3g12770-like [Malus domestica]
          Length = 758

 Score = 69.3 bits (168), Expect = 9e-10
 Identities = 44/101 (43%), Positives = 57/101 (56%), Gaps = 4/101 (3%)
 Frame = -1

Query: 295 FPTPPSV----LRKLCSSLPDHLDQLFSSNKNLCSTYDTVEPGTYYTSLLKKSVNKIHLN 128
           F TP S     +R  CSS P  LD     +++L    D   P +++ SL+  S  K  L 
Sbjct: 29  FETPASFSFTFIRHYCSS-PLRLDPYCYDDQHLRHGXD---PDSFFGSLIDGSTRKSQLG 84

Query: 127 QIHAQLYIHGLQNNGFIIAKFINVGSNLKEICYAHQVFDEF 5
           QIHAQL + GL ++GF+I K +N  SNL  I YA QVFDEF
Sbjct: 85  QIHAQLVVLGLIDSGFLITKLVNASSNLGYISYARQVFDEF 125


>ref|XP_008230115.1| PREDICTED: pentatricopeptide repeat-containing protein At3g12770
           [Prunus mume]
          Length = 740

 Score = 69.3 bits (168), Expect = 9e-10
 Identities = 39/90 (43%), Positives = 54/90 (60%)
 Frame = -1

Query: 274 LRKLCSSLPDHLDQLFSSNKNLCSTYDTVEPGTYYTSLLKKSVNKIHLNQIHAQLYIHGL 95
           L   CSS    LD  +  ++ L  +YD+    + + SL+  S  K HL QIHAQL + GL
Sbjct: 42  LSHYCSSAL-RLDPCYYGDQQLHYSYDS---DSSFASLIDGSTQKSHLGQIHAQLLVLGL 97

Query: 94  QNNGFIIAKFINVGSNLKEICYAHQVFDEF 5
           Q++GF+I K +N  SNL  + YA +VFDEF
Sbjct: 98  QDSGFLITKLVNASSNLGFVTYARRVFDEF 127


>ref|XP_007033247.1| Mitochondrial editing factor 22 [Theobroma cacao]
           gi|508712276|gb|EOY04173.1| Mitochondrial editing factor
           22 [Theobroma cacao]
          Length = 735

 Score = 68.6 bits (166), Expect = 2e-09
 Identities = 41/103 (39%), Positives = 54/103 (52%)
 Frame = -1

Query: 310 LRAFIFPTPPSVLRKLCSSLPDHLDQLFSSNKNLCSTYDTVEPGTYYTSLLKKSVNKIHL 131
           ++AF FP  P    K            FSS+ N  + + T    ++Y +LL  S    HL
Sbjct: 33  VKAFEFPATPFNFLKP-----------FSSHDN-STIFHTFNFDSFYANLLDSSTRNAHL 80

Query: 130 NQIHAQLYIHGLQNNGFIIAKFINVGSNLKEICYAHQVFDEFP 2
            QIHA+L +  +  NGF+I K IN   NL EI YA +VFDEFP
Sbjct: 81  TQIHAKLVLLDIHQNGFLITKLINSAVNLGEISYARKVFDEFP 123


>ref|XP_012436686.1| PREDICTED: pentatricopeptide repeat-containing protein At3g12770
           [Gossypium raimondii] gi|763781109|gb|KJB48180.1|
           hypothetical protein B456_008G055100 [Gossypium
           raimondii]
          Length = 732

 Score = 68.2 bits (165), Expect = 2e-09
 Identities = 33/85 (38%), Positives = 54/85 (63%)
 Frame = -1

Query: 256 SLPDHLDQLFSSNKNLCSTYDTVEPGTYYTSLLKKSVNKIHLNQIHAQLYIHGLQNNGFI 77
           S P +  +  SSN N  + +      ++Y +L+  S +KI++ Q+HA+L + G+Q NGF+
Sbjct: 37  STPFNFVKHLSSNDNP-TIFHNFNFDSFYANLIDNSSHKINITQVHAKLLLLGIQQNGFL 95

Query: 76  IAKFINVGSNLKEICYAHQVFDEFP 2
           ++K +N   NL EI YA +VFD+FP
Sbjct: 96  VSKLVNAAVNLGEISYARKVFDKFP 120


>ref|XP_010105982.1| hypothetical protein L484_007616 [Morus notabilis]
           gi|587919468|gb|EXC06936.1| hypothetical protein
           L484_007616 [Morus notabilis]
          Length = 734

 Score = 64.7 bits (156), Expect(2) = 2e-09
 Identities = 31/58 (53%), Positives = 41/58 (70%)
 Frame = -1

Query: 178 TYYTSLLKKSVNKIHLNQIHAQLYIHGLQNNGFIIAKFINVGSNLKEICYAHQVFDEF 5
           ++Y  LL  S +K HL+QIHAQL I GLQ NGF+I K +NV S++    YA ++FDEF
Sbjct: 64  SFYAYLLGSSTHKKHLDQIHAQLLISGLQQNGFLITKLVNVSSDIGCNFYARKLFDEF 121



 Score = 23.5 bits (49), Expect(2) = 2e-09
 Identities = 7/20 (35%), Positives = 11/20 (55%)
 Frame = -3

Query: 356 LYKQCNHLYVFYCFVATGIY 297
           +Y  CNH  ++YC+     Y
Sbjct: 47  IYPSCNHGRLYYCYGDISFY 66


>ref|XP_010542851.1| PREDICTED: pentatricopeptide repeat-containing protein At3g12770
           isoform X1 [Tarenaya hassleriana]
          Length = 694

 Score = 67.4 bits (163), Expect = 4e-09
 Identities = 29/63 (46%), Positives = 42/63 (66%)
 Frame = -1

Query: 190 VEPGTYYTSLLKKSVNKIHLNQIHAQLYIHGLQNNGFIIAKFINVGSNLKEICYAHQVFD 11
           +   ++Y SL+  S  K HL Q+HA+L + G Q +GF+I K I+  S L ++CYA QVFD
Sbjct: 20  IHSDSFYASLIDSSTRKGHLRQVHARLLLLGFQFSGFLITKLIHASSELGDMCYARQVFD 79

Query: 10  EFP 2
           +FP
Sbjct: 80  DFP 82


Top