BLASTX nr result

ID: Catharanthus22_contig00037825 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Catharanthus22_contig00037825
         (426 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002515835.1| pentatricopeptide repeat-containing protein,...    96   4e-18
ref|XP_002271725.2| PREDICTED: pentatricopeptide repeat-containi...    87   3e-15
emb|CBI40338.3| unnamed protein product [Vitis vinifera]               87   3e-15
emb|CAN79811.1| hypothetical protein VITISV_018821 [Vitis vinifera]    87   3e-15
ref|XP_004235474.1| PREDICTED: pentatricopeptide repeat-containi...    86   4e-15
gb|EOY16968.1| Tetratricopeptide repeat (TPR)-like superfamily p...    85   1e-14
gb|EOY16969.1| Tetratricopeptide repeat (TPR)-like superfamily p...    84   3e-14
ref|XP_006471568.1| PREDICTED: pentatricopeptide repeat-containi...    83   3e-14
ref|XP_006363979.1| PREDICTED: pentatricopeptide repeat-containi...    83   4e-14
ref|XP_004137054.1| PREDICTED: pentatricopeptide repeat-containi...    81   2e-13
ref|XP_003637572.1| Pentatricopeptide repeat-containing protein ...    74   2e-11
gb|EXB63285.1| hypothetical protein L484_012475 [Morus notabilis]      72   6e-11
ref|XP_004498096.1| PREDICTED: pentatricopeptide repeat-containi...    72   1e-10
ref|XP_004301456.1| PREDICTED: pentatricopeptide repeat-containi...    70   2e-10
ref|XP_006400228.1| hypothetical protein EUTSA_v10012670mg [Eutr...    66   4e-09
ref|NP_197188.1| pentatricopeptide repeat-containing protein [Ar...    64   3e-08
gb|ESW13450.1| hypothetical protein PHAVU_008G197200g [Phaseolus...    63   4e-08
ref|XP_002871739.1| pentatricopeptide repeat-containing protein ...    63   5e-08
gb|EPS66947.1| hypothetical protein M569_07825, partial [Genlise...    62   6e-08
ref|XP_006287060.1| hypothetical protein CARUB_v10000209mg [Caps...    62   6e-08

>ref|XP_002515835.1| pentatricopeptide repeat-containing protein, putative [Ricinus
           communis] gi|223544990|gb|EEF46504.1| pentatricopeptide
           repeat-containing protein, putative [Ricinus communis]
          Length = 655

 Score = 96.3 bits (238), Expect = 4e-18
 Identities = 53/142 (37%), Positives = 80/142 (56%), Gaps = 16/142 (11%)
 Frame = +3

Query: 48  LPSKLNKFSTLALARCINNF-FSTAAVLADCNPQFQSR--YHEYLLEAVYSRF------- 197
           L SK   FST + +  + +    +  +L  C   FQS   + + +++ + S F       
Sbjct: 21  LTSKHRFFSTTSGSIALTSTPVISPTLLKQCKSIFQSHLIHQQAIVQGLLSHFSLNLIST 80

Query: 198 ------PENAIAMLQCLPTSSSNVFYWNTLIKNTVYLRDYGNALRCFRHMLRLDWKPDGY 359
                 P +A+++LQCL  S S V++WN LI+  V L    ++L  FR M RL+W PD Y
Sbjct: 81  YLALNAPSHALSLLQCLTPSPSAVYWWNALIRRAVRLGLLQHSLSLFRTMRRLNWSPDHY 140

Query: 360 TYPYILKACGELPSFRHGACLH 425
           T+P++ KACGELPSF HG+C+H
Sbjct: 141 TFPFVFKACGELPSFLHGSCIH 162


>ref|XP_002271725.2| PREDICTED: pentatricopeptide repeat-containing protein
           At5g16860-like [Vitis vinifera]
          Length = 852

 Score = 86.7 bits (213), Expect = 3e-15
 Identities = 38/76 (50%), Positives = 55/76 (72%)
 Frame = +3

Query: 198 PENAIAMLQCLPTSSSNVFYWNTLIKNTVYLRDYGNALRCFRHMLRLDWKPDGYTYPYIL 377
           P  A+++L+ L  SS  VF+WN LI+ +V+L    + L+ +R M RL W+PD YT+P++L
Sbjct: 74  PAKALSVLRRLHPSSHTVFWWNQLIRRSVHLGFLEDVLQLYRRMQRLGWRPDHYTFPFVL 133

Query: 378 KACGELPSFRHGACLH 425
           KACGE+PSFR GA +H
Sbjct: 134 KACGEIPSFRCGASVH 149


>emb|CBI40338.3| unnamed protein product [Vitis vinifera]
          Length = 487

 Score = 86.7 bits (213), Expect = 3e-15
 Identities = 38/76 (50%), Positives = 55/76 (72%)
 Frame = +3

Query: 198 PENAIAMLQCLPTSSSNVFYWNTLIKNTVYLRDYGNALRCFRHMLRLDWKPDGYTYPYIL 377
           P  A+++L+ L  SS  VF+WN LI+ +V+L    + L+ +R M RL W+PD YT+P++L
Sbjct: 74  PAKALSVLRRLHPSSHTVFWWNQLIRRSVHLGFLEDVLQLYRRMQRLGWRPDHYTFPFVL 133

Query: 378 KACGELPSFRHGACLH 425
           KACGE+PSFR GA +H
Sbjct: 134 KACGEIPSFRCGASVH 149


>emb|CAN79811.1| hypothetical protein VITISV_018821 [Vitis vinifera]
          Length = 871

 Score = 86.7 bits (213), Expect = 3e-15
 Identities = 38/76 (50%), Positives = 55/76 (72%)
 Frame = +3

Query: 198 PENAIAMLQCLPTSSSNVFYWNTLIKNTVYLRDYGNALRCFRHMLRLDWKPDGYTYPYIL 377
           P  A+++L+ L  SS  VF+WN LI+ +V+L    + L+ +R M RL W+PD YT+P++L
Sbjct: 93  PAKALSVLRRLHPSSHTVFWWNQLIRRSVHLGFLEDVLQLYRRMQRLGWRPDHYTFPFVL 152

Query: 378 KACGELPSFRHGACLH 425
           KACGE+PSFR GA +H
Sbjct: 153 KACGEIPSFRCGASVH 168


>ref|XP_004235474.1| PREDICTED: pentatricopeptide repeat-containing protein
           At5g16860-like [Solanum lycopersicum]
          Length = 843

 Score = 86.3 bits (212), Expect = 4e-15
 Identities = 41/63 (65%), Positives = 44/63 (69%)
 Frame = +3

Query: 237 SSSNVFYWNTLIKNTVYLRDYGNALRCFRHMLRLDWKPDGYTYPYILKACGELPSFRHGA 416
           SS  VFYWN LIK  V LR + +AL  FR MLRLDW PDGYTYPYILKACGEL     G 
Sbjct: 76  SSQVVFYWNNLIKRCVLLRHHESALVLFREMLRLDWNPDGYTYPYILKACGELRFLLFGE 135

Query: 417 CLH 425
            +H
Sbjct: 136 SVH 138


>gb|EOY16968.1| Tetratricopeptide repeat (TPR)-like superfamily protein isoform 1
           [Theobroma cacao]
          Length = 862

 Score = 84.7 bits (208), Expect = 1e-14
 Identities = 51/152 (33%), Positives = 82/152 (53%), Gaps = 17/152 (11%)
 Frame = +3

Query: 21  YCNMLMSSLLP--SKLNKFSTLALARCINNFFSTAAVLADCNPQFQSR--YHEYLLEAVY 188
           + N ++ S  P  SK     +L ++   +   STAA+L  C    Q++  + + L++ + 
Sbjct: 9   HLNRMLRSFFPLSSKSRPSGSLTIS-LFSTTTSTAALLQKCKSLVQAKLIHQQLLIQGLS 67

Query: 189 SRFP-------------ENAIAMLQCLPTSSSNVFYWNTLIKNTVYLRDYGNALRCFRHM 329
             F               ++I++LQ    S S VF+WN+LI+ +++L    + L  FR M
Sbjct: 68  HHFATHLISAYLTHHASSHSISLLQRFTPSPSAVFFWNSLIRRSLHLGFSHDVLTLFRRM 127

Query: 330 LRLDWKPDGYTYPYILKACGELPSFRHGACLH 425
           L L   PD YT+P++LKACG+LPSFR GA +H
Sbjct: 128 LSLGCSPDHYTFPFVLKACGQLPSFRRGAAVH 159


>gb|EOY16969.1| Tetratricopeptide repeat (TPR)-like superfamily protein isoform 2
           [Theobroma cacao]
          Length = 850

 Score = 83.6 bits (205), Expect = 3e-14
 Identities = 45/120 (37%), Positives = 69/120 (57%), Gaps = 15/120 (12%)
 Frame = +3

Query: 111 STAAVLADCNPQFQSR--YHEYLLEAVYSRFP-------------ENAIAMLQCLPTSSS 245
           STAA+L  C    Q++  + + L++ +   F               ++I++LQ    S S
Sbjct: 28  STAALLQKCKSLVQAKLIHQQLLIQGLSHHFATHLISAYLTHHASSHSISLLQRFTPSPS 87

Query: 246 NVFYWNTLIKNTVYLRDYGNALRCFRHMLRLDWKPDGYTYPYILKACGELPSFRHGACLH 425
            VF+WN+LI+ +++L    + L  FR ML L   PD YT+P++LKACG+LPSFR GA +H
Sbjct: 88  AVFFWNSLIRRSLHLGFSHDVLTLFRRMLSLGCSPDHYTFPFVLKACGQLPSFRRGAAVH 147


>ref|XP_006471568.1| PREDICTED: pentatricopeptide repeat-containing protein
           At5g16860-like [Citrus sinensis]
          Length = 860

 Score = 83.2 bits (204), Expect = 3e-14
 Identities = 51/139 (36%), Positives = 76/139 (54%), Gaps = 4/139 (2%)
 Frame = +3

Query: 21  YCNMLMSSLLPSKLNKFSTLALARCINNFFSTAAVLADCNPQFQSRYH---EYLLEAVYS 191
           + N+ + S+  +   K ++L L +C +    T   L       Q+  H    +L+ A  S
Sbjct: 23  FTNIKLFSVTTTPCIKITSLLLRQCKS---LTQVYLIHQQIIVQNLTHVPPSHLIAAYVS 79

Query: 192 R-FPENAIAMLQCLPTSSSNVFYWNTLIKNTVYLRDYGNALRCFRHMLRLDWKPDGYTYP 368
              P  A+++LQ +  S  +VF+WN LI+  V LR   NA R F  M+R  W PD YT+P
Sbjct: 80  HNAPSPALSLLQRISPSPFSVFWWNALIRRAVRLRLPDNAFRLFLQMMRRGWHPDEYTFP 139

Query: 369 YILKACGELPSFRHGACLH 425
           ++LKACGELPS R G+ +H
Sbjct: 140 FVLKACGELPSSRCGSSVH 158


>ref|XP_006363979.1| PREDICTED: pentatricopeptide repeat-containing protein
           At5g16860-like isoform X1 [Solanum tuberosum]
           gi|565396768|ref|XP_006363980.1| PREDICTED:
           pentatricopeptide repeat-containing protein
           At5g16860-like isoform X2 [Solanum tuberosum]
          Length = 843

 Score = 82.8 bits (203), Expect = 4e-14
 Identities = 39/63 (61%), Positives = 44/63 (69%)
 Frame = +3

Query: 237 SSSNVFYWNTLIKNTVYLRDYGNALRCFRHMLRLDWKPDGYTYPYILKACGELPSFRHGA 416
           SS  VFYWN LIK +V LR + +AL  FR MLRLDW  DGYTYPY+LKACGEL     G 
Sbjct: 76  SSQVVFYWNNLIKRSVILRHHESALVLFREMLRLDWNADGYTYPYVLKACGELRFLLCGE 135

Query: 417 CLH 425
            +H
Sbjct: 136 SVH 138


>ref|XP_004137054.1| PREDICTED: pentatricopeptide repeat-containing protein
           At5g16860-like [Cucumis sativus]
           gi|449479088|ref|XP_004155501.1| PREDICTED:
           pentatricopeptide repeat-containing protein
           At5g16860-like [Cucumis sativus]
          Length = 855

 Score = 80.9 bits (198), Expect = 2e-13
 Identities = 37/73 (50%), Positives = 50/73 (68%)
 Frame = +3

Query: 207 AIAMLQCLPTSSSNVFYWNTLIKNTVYLRDYGNALRCFRHMLRLDWKPDGYTYPYILKAC 386
           A+++LQ L  S S VF+WN LI+ +V L    + L  +  M RL W PD YT+P++LKAC
Sbjct: 78  AVSLLQRLIPSHSTVFWWNALIRRSVKLGLLDDTLGFYCQMQRLGWLPDHYTFPFVLKAC 137

Query: 387 GELPSFRHGACLH 425
           GE+PS RHGA +H
Sbjct: 138 GEIPSLRHGASVH 150


>ref|XP_003637572.1| Pentatricopeptide repeat-containing protein [Medicago truncatula]
           gi|355503507|gb|AES84710.1| Pentatricopeptide
           repeat-containing protein [Medicago truncatula]
          Length = 833

 Score = 73.9 bits (180), Expect = 2e-11
 Identities = 34/75 (45%), Positives = 49/75 (65%), Gaps = 1/75 (1%)
 Frame = +3

Query: 204 NAIAMLQCLPTSS-SNVFYWNTLIKNTVYLRDYGNALRCFRHMLRLDWKPDGYTYPYILK 380
           NAI +L+   T S S+V++WN LI++ ++      ALR FR M  L W PD YT+P++ K
Sbjct: 75  NAILLLEKNVTPSHSSVYWWNQLIRHALHFNSPNTALRLFRRMKTLHWTPDHYTFPFVFK 134

Query: 381 ACGELPSFRHGACLH 425
           ACGE+ +F  GA +H
Sbjct: 135 ACGEISNFELGASIH 149


>gb|EXB63285.1| hypothetical protein L484_012475 [Morus notabilis]
          Length = 858

 Score = 72.4 bits (176), Expect = 6e-11
 Identities = 33/74 (44%), Positives = 47/74 (63%)
 Frame = +3

Query: 204 NAIAMLQCLPTSSSNVFYWNTLIKNTVYLRDYGNALRCFRHMLRLDWKPDGYTYPYILKA 383
           +A+ +L+ L  S  +VF+WN  I+  V        L  ++ M RL W+PD YT+P++LKA
Sbjct: 82  HAVVLLEPLEPSPFSVFWWNQFIRRAVGSGLLNEVLGLYQRMHRLGWRPDEYTFPFVLKA 141

Query: 384 CGELPSFRHGACLH 425
           CGEL SFR GA +H
Sbjct: 142 CGELSSFRLGASVH 155


>ref|XP_004498096.1| PREDICTED: pentatricopeptide repeat-containing protein
           At5g16860-like [Cicer arietinum]
          Length = 855

 Score = 71.6 bits (174), Expect = 1e-10
 Identities = 31/74 (41%), Positives = 47/74 (63%)
 Frame = +3

Query: 204 NAIAMLQCLPTSSSNVFYWNTLIKNTVYLRDYGNALRCFRHMLRLDWKPDGYTYPYILKA 383
           NA+++LQ L  S S+VF+WN LI+ +++       L  +  M  L W PD YT+P++ KA
Sbjct: 78  NALSLLQTLHPSPSSVFWWNQLIRQSLHFNSPHVVLHLYCRMKTLHWSPDHYTFPFVFKA 137

Query: 384 CGELPSFRHGACLH 425
           CG++ SF  GA +H
Sbjct: 138 CGDVLSFNLGASIH 151


>ref|XP_004301456.1| PREDICTED: pentatricopeptide repeat-containing protein
           At5g16860-like [Fragaria vesca subsp. vesca]
          Length = 850

 Score = 70.5 bits (171), Expect = 2e-10
 Identities = 37/87 (42%), Positives = 51/87 (58%), Gaps = 3/87 (3%)
 Frame = +3

Query: 174 LEAVYSRF--PENAIAMLQCLPTSS-SNVFYWNTLIKNTVYLRDYGNALRCFRHMLRLDW 344
           L A Y  F  P +A+++L+ L T   + V++WN LI++ V      + L  +  M RL W
Sbjct: 61  LIAAYLSFNAPSHALSLLERLATPRPAAVYWWNVLIRSAVRSGFLEHVLSLYSRMQRLGW 120

Query: 345 KPDGYTYPYILKACGELPSFRHGACLH 425
           KPD YTYP++ KACGEL S R G   H
Sbjct: 121 KPDHYTYPFVFKACGELGSLRRGEAAH 147


>ref|XP_006400228.1| hypothetical protein EUTSA_v10012670mg [Eutrema salsugineum]
           gi|557101318|gb|ESQ41681.1| hypothetical protein
           EUTSA_v10012670mg [Eutrema salsugineum]
          Length = 853

 Score = 66.2 bits (160), Expect = 4e-09
 Identities = 29/73 (39%), Positives = 43/73 (58%)
 Frame = +3

Query: 207 AIAMLQCLPTSSSNVFYWNTLIKNTVYLRDYGNALRCFRHMLRLDWKPDGYTYPYILKAC 386
           A+++L   P S + V++WN+LI+          +L  FR M  L W PD YT+P++ KAC
Sbjct: 78  AVSLLHRFPPSEAGVYHWNSLIRVYCDNDRVSESLSLFRLMHSLSWTPDNYTFPFVFKAC 137

Query: 387 GELPSFRHGACLH 425
           G++ SFR G   H
Sbjct: 138 GDISSFRCGVSAH 150


>ref|NP_197188.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana]
           gi|75174141|sp|Q9LFL5.1|PP390_ARATH RecName:
           Full=Pentatricopeptide repeat-containing protein
           At5g16860 gi|9755687|emb|CAC01699.1| putative protein
           [Arabidopsis thaliana] gi|332004967|gb|AED92350.1|
           pentatricopeptide repeat-containing protein [Arabidopsis
           thaliana]
          Length = 850

 Score = 63.5 bits (153), Expect = 3e-08
 Identities = 31/77 (40%), Positives = 47/77 (61%), Gaps = 3/77 (3%)
 Frame = +3

Query: 204 NAIAMLQCLPTSSSNVFYWNTLIKNTVYLRDYGNALRC---FRHMLRLDWKPDGYTYPYI 374
           +A+++L+  P S + V++WN+LI++     D G A +C   F  M  L W PD YT+P++
Sbjct: 77  HAVSLLRRFPPSDAGVYHWNSLIRS---YGDNGCANKCLYLFGLMHSLSWTPDNYTFPFV 133

Query: 375 LKACGELPSFRHGACLH 425
            KACGE+ S R G   H
Sbjct: 134 FKACGEISSVRCGESAH 150


>gb|ESW13450.1| hypothetical protein PHAVU_008G197200g [Phaseolus vulgaris]
          Length = 863

 Score = 63.2 bits (152), Expect = 4e-08
 Identities = 32/73 (43%), Positives = 41/73 (56%)
 Frame = +3

Query: 207 AIAMLQCLPTSSSNVFYWNTLIKNTVYLRDYGNALRCFRHMLRLDWKPDGYTYPYILKAC 386
           AI +L+ LP S S+VF+WN LI+  ++L         FR M  L W PD YTYP++ K C
Sbjct: 92  AILLLERLPPSPSSVFWWNQLIRRALHLGTPRKVFALFRRMKSLGWTPDHYTYPFLFKGC 151

Query: 387 GELPSFRHGACLH 425
             L     GA LH
Sbjct: 152 SFLSL---GASLH 161


>ref|XP_002871739.1| pentatricopeptide repeat-containing protein [Arabidopsis lyrata
           subsp. lyrata] gi|297317576|gb|EFH47998.1|
           pentatricopeptide repeat-containing protein [Arabidopsis
           lyrata subsp. lyrata]
          Length = 850

 Score = 62.8 bits (151), Expect = 5e-08
 Identities = 32/80 (40%), Positives = 45/80 (56%), Gaps = 6/80 (7%)
 Frame = +3

Query: 204 NAIAMLQCLPTSSSNVFYWNTLIKNTVYLRDYGN------ALRCFRHMLRLDWKPDGYTY 365
           +A+++L+  P S + V++WN+LI      R YGN       L  F  M  L W PD YT+
Sbjct: 77  HAVSLLRRFPPSDAGVYHWNSLI------RSYGNNGRANKCLSSFCLMHSLSWTPDNYTF 130

Query: 366 PYILKACGELPSFRHGACLH 425
           P++ KACGE+ S R G   H
Sbjct: 131 PFVFKACGEISSVRCGDSSH 150


>gb|EPS66947.1| hypothetical protein M569_07825, partial [Genlisea aurea]
          Length = 779

 Score = 62.4 bits (150), Expect = 6e-08
 Identities = 28/73 (38%), Positives = 41/73 (56%)
 Frame = +3

Query: 207 AIAMLQCLPTSSSNVFYWNTLIKNTVYLRDYGNALRCFRHMLRLDWKPDGYTYPYILKAC 386
           AI+ L+ +      V+YWN +I+  ++  +   AL  F  M RL W PD YTYP++ KAC
Sbjct: 1   AISFLRRIVPCRYYVYYWNKIIRRCLFSGEQRKALLVFDEMRRLGWVPDEYTYPFVFKAC 60

Query: 387 GELPSFRHGACLH 425
           G+L     G  +H
Sbjct: 61  GDLSLLTTGVSVH 73


>ref|XP_006287060.1| hypothetical protein CARUB_v10000209mg [Capsella rubella]
           gi|482555766|gb|EOA19958.1| hypothetical protein
           CARUB_v10000209mg [Capsella rubella]
          Length = 850

 Score = 62.4 bits (150), Expect = 6e-08
 Identities = 31/77 (40%), Positives = 45/77 (58%), Gaps = 3/77 (3%)
 Frame = +3

Query: 204 NAIAMLQCLPTSSSNVFYWNTLIKNTVYLRDYGNALRC---FRHMLRLDWKPDGYTYPYI 374
           +A+++L   P S S V++WN+LI+   +  + G A  C   FR M  L W PD YT+P++
Sbjct: 77  SAVSLLCRFPPSDSGVYHWNSLIR---FHGENGRASECISLFRLMHSLSWTPDNYTFPFV 133

Query: 375 LKACGELPSFRHGACLH 425
            KACGE+ S   G   H
Sbjct: 134 FKACGEISSVICGVSAH 150


Top