BLASTX nr result

ID: Catharanthus22_contig00026054 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Catharanthus22_contig00026054
         (550 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|EOY21924.1| Pentatricopeptide repeat-containing protein, puta...   106   4e-21
ref|XP_006354771.1| PREDICTED: pentatricopeptide repeat-containi...   100   3e-19
emb|CAN66662.1| hypothetical protein VITISV_031722 [Vitis vinifera]   100   4e-19
ref|XP_004242174.1| PREDICTED: pentatricopeptide repeat-containi...    99   9e-19
ref|XP_003634263.1| PREDICTED: pentatricopeptide repeat-containi...    98   2e-18
emb|CBI15198.3| unnamed protein product [Vitis vinifera]               98   2e-18
ref|XP_004301139.1| PREDICTED: pentatricopeptide repeat-containi...    97   3e-18
ref|XP_002514728.1| pentatricopeptide repeat-containing protein,...    92   6e-17
ref|XP_006440635.1| hypothetical protein CICLE_v10024595mg [Citr...    92   1e-16
ref|XP_004167903.1| PREDICTED: pentatricopeptide repeat-containi...    89   9e-16
ref|NP_200948.1| pentatricopeptide repeat-containing protein [Ar...    69   1e-09
ref|XP_002864731.1| pentatricopeptide repeat-containing protein ...    68   1e-09
ref|XP_006394512.1| hypothetical protein EUTSA_v10005336mg [Eutr...    67   3e-09
gb|EMJ12012.1| hypothetical protein PRUPE_ppa016961mg, partial [...    63   5e-08

>gb|EOY21924.1| Pentatricopeptide repeat-containing protein, putative [Theobroma
           cacao]
          Length = 676

 Score =  106 bits (264), Expect = 4e-21
 Identities = 59/101 (58%), Positives = 71/101 (70%), Gaps = 2/101 (1%)
 Frame = +2

Query: 254 LNSKTQEEALEIFHS--ATINPHKNLKVHSAIIHCLTEAKSYIHARCLIKDLIEELQKFR 427
           LNS+T  +AL +F+S    INP KNL+ +SAIIH LT AK Y  ARCLIK LI+ LQ   
Sbjct: 41  LNSQTPHQALNLFNSNIKLINPSKNLEPYSAIIHVLTGAKLYTDARCLIKYLIKTLQSSL 100

Query: 428 KPRRACSSVFKALSQLGSDNSTSNVYGVLIIALCEMGLIDE 550
           KPRRAC  +F ALS+L +   T NV+G LIIA  EMGLI+E
Sbjct: 101 KPRRACHLIFNALSKLQTSKFTPNVFGSLIIAFSEMGLIEE 141


>ref|XP_006354771.1| PREDICTED: pentatricopeptide repeat-containing protein
           At5g61400-like [Solanum tuberosum]
          Length = 675

 Score =  100 bits (248), Expect = 3e-19
 Identities = 53/101 (52%), Positives = 72/101 (71%), Gaps = 2/101 (1%)
 Frame = +2

Query: 254 LNSKTQEEALEIFHSATI--NPHKNLKVHSAIIHCLTEAKSYIHARCLIKDLIEELQKFR 427
           LN+KT  EA+ +F+S  +  +P K+L +HSAIIH LT A+ Y+ ARCLIK LIE L+K  
Sbjct: 44  LNAKTCSEAMRLFNSTILRTDPTKDLTLHSAIIHYLTRARLYLDARCLIKRLIENLRKNS 103

Query: 428 KPRRACSSVFKALSQLGSDNSTSNVYGVLIIALCEMGLIDE 550
            PR+ CS +F  L ++ S  S+ NV+GVLIIAL EMG +D+
Sbjct: 104 NPRKVCSLIFNDLGKIDS-GSSCNVFGVLIIALSEMGFVDD 143


>emb|CAN66662.1| hypothetical protein VITISV_031722 [Vitis vinifera]
          Length = 1060

 Score = 99.8 bits (247), Expect = 4e-19
 Identities = 63/145 (43%), Positives = 83/145 (57%), Gaps = 3/145 (2%)
 Frame = +2

Query: 125 MLKIFPPKSKSVLSKRAPIHPHRFLQF-XXXXXXXXXXXXXXXXLNSKTQEEALEIFHSA 301
           MLK FPPKS+ + +K +                           L  +T  +ALE+FHS 
Sbjct: 1   MLKSFPPKSRRIYAKHSSFISRPLSSSPSSSSSDSSPSSLPNSILTCRTANQALELFHSV 60

Query: 302 T--INPHKNLKVHSAIIHCLTEAKSYIHARCLIKDLIEELQKFRKPRRACSSVFKALSQL 475
           +   +  KN +++SAIIH LT AK Y  ARCL++DLI+ LQK R+  R C SVF  LS+L
Sbjct: 61  SRRADLAKNPQLYSAIIHVLTGAKLYAKARCLMRDLIQCLQKSRR-SRICCSVFNVLSRL 119

Query: 476 GSDNSTSNVYGVLIIALCEMGLIDE 550
            S   T NV+GVLIIA  EMGL++E
Sbjct: 120 ESSKFTPNVFGVLIIAFSEMGLVEE 144


>ref|XP_004242174.1| PREDICTED: pentatricopeptide repeat-containing protein
           At5g61400-like [Solanum lycopersicum]
          Length = 691

 Score = 98.6 bits (244), Expect = 9e-19
 Identities = 53/101 (52%), Positives = 71/101 (70%), Gaps = 2/101 (1%)
 Frame = +2

Query: 254 LNSKTQEEALEIFHSATI--NPHKNLKVHSAIIHCLTEAKSYIHARCLIKDLIEELQKFR 427
           LN+K   EA+ ++ SA +  +P K+L +HSAIIH LT A+ Y+ ARCLIK LIE L+K  
Sbjct: 44  LNAKNCSEAMRLYDSAIVKTDPTKDLTLHSAIIHYLTRARLYLDARCLIKRLIENLRKNS 103

Query: 428 KPRRACSSVFKALSQLGSDNSTSNVYGVLIIALCEMGLIDE 550
            PR+ CS VF  L ++ S  S+ NV+GVLIIAL EMG +D+
Sbjct: 104 NPRKVCSLVFNDLGKIDS-GSSCNVFGVLIIALSEMGFVDD 143


>ref|XP_003634263.1| PREDICTED: pentatricopeptide repeat-containing protein
           At5g61400-like [Vitis vinifera]
          Length = 665

 Score = 97.8 bits (242), Expect = 2e-18
 Identities = 62/145 (42%), Positives = 82/145 (56%), Gaps = 3/145 (2%)
 Frame = +2

Query: 125 MLKIFPPKSKSVLSKRAPIHPHRFLQF-XXXXXXXXXXXXXXXXLNSKTQEEALEIFHSA 301
           MLK FPPKS+ + +K +                           L  +T  +ALE+FHS 
Sbjct: 1   MLKSFPPKSRRIYAKHSSFISRPLSSSPSSSSSDSSPSSLPNSILTCRTANQALELFHSV 60

Query: 302 T--INPHKNLKVHSAIIHCLTEAKSYIHARCLIKDLIEELQKFRKPRRACSSVFKALSQL 475
           +   +  KN +++SAIIH LT AK Y  ARCL++DLI+ LQ  R+  R C SVF  LS+L
Sbjct: 61  SRRADLAKNPQLYSAIIHVLTGAKLYAKARCLMRDLIQCLQNSRR-SRICCSVFNVLSRL 119

Query: 476 GSDNSTSNVYGVLIIALCEMGLIDE 550
            S   T NV+GVLIIA  EMGL++E
Sbjct: 120 ESSKFTPNVFGVLIIAFSEMGLVEE 144


>emb|CBI15198.3| unnamed protein product [Vitis vinifera]
          Length = 948

 Score = 97.8 bits (242), Expect = 2e-18
 Identities = 62/145 (42%), Positives = 82/145 (56%), Gaps = 3/145 (2%)
 Frame = +2

Query: 125 MLKIFPPKSKSVLSKRAPIHPHRFLQF-XXXXXXXXXXXXXXXXLNSKTQEEALEIFHSA 301
           MLK FPPKS+ + +K +                           L  +T  +ALE+FHS 
Sbjct: 1   MLKSFPPKSRRIYAKHSSFISRPLSSSPSSSSSDSSPSSLPNSILTCRTANQALELFHSV 60

Query: 302 T--INPHKNLKVHSAIIHCLTEAKSYIHARCLIKDLIEELQKFRKPRRACSSVFKALSQL 475
           +   +  KN +++SAIIH LT AK Y  ARCL++DLI+ LQ  R+  R C SVF  LS+L
Sbjct: 61  SRRADLAKNPQLYSAIIHVLTGAKLYAKARCLMRDLIQCLQNSRR-SRICCSVFNVLSRL 119

Query: 476 GSDNSTSNVYGVLIIALCEMGLIDE 550
            S   T NV+GVLIIA  EMGL++E
Sbjct: 120 ESSKFTPNVFGVLIIAFSEMGLVEE 144


>ref|XP_004301139.1| PREDICTED: pentatricopeptide repeat-containing protein
           At5g61400-like [Fragaria vesca subsp. vesca]
          Length = 631

 Score = 96.7 bits (239), Expect = 3e-18
 Identities = 53/101 (52%), Positives = 70/101 (69%), Gaps = 2/101 (1%)
 Frame = +2

Query: 254 LNSKTQEEALEIFHSAT--INPHKNLKVHSAIIHCLTEAKSYIHARCLIKDLIEELQKFR 427
           LN ++  EALEIF+SAT  +   K+ +++SAI H L  AK Y+ AR LIK+LI++LQK  
Sbjct: 27  LNCRSPAEALEIFNSATKQVGARKDFQLYSAITHVLVSAKLYVKARLLIKELIQDLQKSC 86

Query: 428 KPRRACSSVFKALSQLGSDNSTSNVYGVLIIALCEMGLIDE 550
           K RRAC  V+ ALS L     + NV+GVLI AL EMGL++E
Sbjct: 87  KSRRACELVYNALSGLERSRVSRNVFGVLINALSEMGLVEE 127


>ref|XP_002514728.1| pentatricopeptide repeat-containing protein, putative [Ricinus
           communis] gi|223546332|gb|EEF47834.1| pentatricopeptide
           repeat-containing protein, putative [Ricinus communis]
          Length = 517

 Score = 92.4 bits (228), Expect = 6e-17
 Identities = 62/146 (42%), Positives = 77/146 (52%), Gaps = 4/146 (2%)
 Frame = +2

Query: 125 MLKIFPPKSKSVLSKRAPIHPHRFLQFXXXXXXXXXXXXXXXXLNSKTQEEALEIFHSAT 304
           MLK+FPPK    L +          +                 L+SKT E+AL+ F S  
Sbjct: 1   MLKLFPPKHSLKLVET---------KIRFFTSLSPPSDLTTIILHSKTPEQALDTFTSVL 51

Query: 305 I----NPHKNLKVHSAIIHCLTEAKSYIHARCLIKDLIEELQKFRKPRRACSSVFKALSQ 472
                NP K L ++SA+IH LT AK Y  ARCL KDLI+ L +   PRR  S VF ALSQ
Sbjct: 52  KQNPNNPTKKLHLYSAVIHYLTGAKVYPTARCLTKDLIQTLLQSCTPRRVNSLVFNALSQ 111

Query: 473 LGSDNSTSNVYGVLIIALCEMGLIDE 550
           L       +V+GVLIIA  E+GL+DE
Sbjct: 112 LRGSKFNPSVFGVLIIAFSEVGLVDE 137


>ref|XP_006440635.1| hypothetical protein CICLE_v10024595mg [Citrus clementina]
           gi|557542897|gb|ESR53875.1| hypothetical protein
           CICLE_v10024595mg [Citrus clementina]
          Length = 697

 Score = 91.7 bits (226), Expect = 1e-16
 Identities = 62/162 (38%), Positives = 77/162 (47%), Gaps = 20/162 (12%)
 Frame = +2

Query: 125 MLKIFPPKSKSVL------------------SKRAPIHPHRFLQFXXXXXXXXXXXXXXX 250
           MLKIFPPK K                     S   P  P   L                 
Sbjct: 1   MLKIFPPKQKLTFIDIKINKQPLITLRSISESSSMPSTPFSSLSSSSSSSLPPRSNLTNA 60

Query: 251 XLNSKTQEEALEIFHSAT--INPHKNLKVHSAIIHCLTEAKSYIHARCLIKDLIEELQKF 424
            LNSKT  +AL +F+S++  +NP K+L   +AI + L  AK Y +ARCLIKD+ E L K 
Sbjct: 61  ILNSKTPNQALVLFNSSSKKLNPTKSLAPFAAIFYVLANAKLYKNARCLIKDVTENLLKS 120

Query: 425 RKPRRACSSVFKALSQLGSDNSTSNVYGVLIIALCEMGLIDE 550
           RKP   C SVF AL+ L       +V+  LIIA  EMG I+E
Sbjct: 121 RKPHHVCYSVFNALNSLEIPKFNPSVFSTLIIAFSEMGHIEE 162


>ref|XP_004167903.1| PREDICTED: pentatricopeptide repeat-containing protein
           At5g61400-like [Cucumis sativus]
          Length = 645

 Score = 88.6 bits (218), Expect = 9e-16
 Identities = 47/99 (47%), Positives = 64/99 (64%)
 Frame = +2

Query: 254 LNSKTQEEALEIFHSATINPHKNLKVHSAIIHCLTEAKSYIHARCLIKDLIEELQKFRKP 433
           LN ++  +ALE F++A   P KN++++SAIIH L  +K   HAR L+ DL++ L K  KP
Sbjct: 38  LNCRSPWKALEFFNAA---PEKNIQLYSAIIHVLVGSKLLSHARYLLNDLVQNLVKSHKP 94

Query: 434 RRACSSVFKALSQLGSDNSTSNVYGVLIIALCEMGLIDE 550
             AC   F  LS+L S   T NVYG LII LC+M L++E
Sbjct: 95  YHACQLAFSELSRLKSSKFTPNVYGELIIVLCKMELVEE 133


>ref|NP_200948.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana]
           gi|75171473|sp|Q9FLJ4.1|PP440_ARATH RecName:
           Full=Pentatricopeptide repeat-containing protein
           At5g61400 gi|9757861|dbj|BAB08495.1| unnamed protein
           product [Arabidopsis thaliana]
           gi|332010079|gb|AED97462.1| pentatricopeptide
           repeat-containing protein [Arabidopsis thaliana]
          Length = 654

 Score = 68.6 bits (166), Expect = 1e-09
 Identities = 37/102 (36%), Positives = 59/102 (57%), Gaps = 3/102 (2%)
 Frame = +2

Query: 254 LNSKTQEEALEIFHSAT---INPHKNLKVHSAIIHCLTEAKSYIHARCLIKDLIEELQKF 424
           L  ++ EEA ++F +++   ++   +L+  SA+IH LT A  Y  ARCLIK LIE L++ 
Sbjct: 49  LKCRSAEEAFKLFETSSRSRVSKSNDLQSFSAVIHVLTGAHKYTLARCLIKSLIERLKRH 108

Query: 425 RKPRRACSSVFKALSQLGSDNSTSNVYGVLIIALCEMGLIDE 550
            +P      +F AL  + S   +  V+ +LI+   EMGL +E
Sbjct: 109 SEPSNMSHRLFNALEDIQSPKFSIGVFSLLIMEFLEMGLFEE 150


>ref|XP_002864731.1| pentatricopeptide repeat-containing protein [Arabidopsis lyrata
           subsp. lyrata] gi|297310566|gb|EFH40990.1|
           pentatricopeptide repeat-containing protein [Arabidopsis
           lyrata subsp. lyrata]
          Length = 732

 Score = 68.2 bits (165), Expect = 1e-09
 Identities = 37/102 (36%), Positives = 59/102 (57%), Gaps = 3/102 (2%)
 Frame = +2

Query: 254 LNSKTQEEALEIFHSATIN---PHKNLKVHSAIIHCLTEAKSYIHARCLIKDLIEELQKF 424
           L  ++ EEA  +F +++I+      +L+  SA+IH LT A  Y  ARCLIK LIE L+++
Sbjct: 83  LKCRSAEEAFRLFETSSISRLSKTTDLQSFSAVIHVLTGAHKYTLARCLIKSLIERLRRY 142

Query: 425 RKPRRACSSVFKALSQLGSDNSTSNVYGVLIIALCEMGLIDE 550
            +P      +F AL  + S   +  V+ +LI+   EMGL ++
Sbjct: 143 SEPTNISHRLFNALEDIQSPKFSIGVFSLLIMEFLEMGLFED 184


>ref|XP_006394512.1| hypothetical protein EUTSA_v10005336mg [Eutrema salsugineum]
           gi|557091151|gb|ESQ31798.1| hypothetical protein
           EUTSA_v10005336mg [Eutrema salsugineum]
          Length = 665

 Score = 67.0 bits (162), Expect = 3e-09
 Identities = 39/102 (38%), Positives = 59/102 (57%), Gaps = 3/102 (2%)
 Frame = +2

Query: 254 LNSKTQEEALEIFHSAT---INPHKNLKVHSAIIHCLTEAKSYIHARCLIKDLIEELQKF 424
           L  ++ EEAL +F +++   I+   +L+  SA+IH LT A+ +  ARCLIK LIE L++ 
Sbjct: 55  LKCRSAEEALRLFETSSRLKISNINDLRSFSALIHVLTGAQKFTVARCLIKRLIESLRRQ 114

Query: 425 RKPRRACSSVFKALSQLGSDNSTSNVYGVLIIALCEMGLIDE 550
            KP      VF AL  + +      V+ +LI+ L EMG  +E
Sbjct: 115 NKPTNVSYRVFNALEDIQTPEFCIGVFSLLIMELVEMGWFEE 156


>gb|EMJ12012.1| hypothetical protein PRUPE_ppa016961mg, partial [Prunus persica]
          Length = 548

 Score = 62.8 bits (151), Expect = 5e-08
 Identities = 32/55 (58%), Positives = 39/55 (70%)
 Frame = +2

Query: 386 CLIKDLIEELQKFRKPRRACSSVFKALSQLGSDNSTSNVYGVLIIALCEMGLIDE 550
           CLIKDLI +LQKF  P  AC  VF AL+ L S   + +V+GVLII L EMGL++E
Sbjct: 1   CLIKDLIHDLQKFCNPSLACHLVFDALNCLESSRFSPDVFGVLIIGLSEMGLVEE 55


Top