BLASTX nr result

ID: Catharanthus22_contig00016640 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Catharanthus22_contig00016640
         (578 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_006352928.1| PREDICTED: pentatricopeptide repeat-containi...   133   4e-29
ref|XP_004245945.1| PREDICTED: pentatricopeptide repeat-containi...   130   2e-28
gb|EXC26223.1| hypothetical protein L484_022794 [Morus notabilis]     128   1e-27
ref|XP_002518995.1| GTP-binding  protein alpha subunit, gna, put...   125   8e-27
ref|XP_002275784.1| PREDICTED: pentatricopeptide repeat-containi...   122   9e-26
gb|EOY25237.1| Tetratricopeptide repeat-like superfamily protein...   120   3e-25
ref|XP_006432677.1| hypothetical protein CICLE_v10000638mg [Citr...   119   7e-25
ref|XP_004136211.1| PREDICTED: pentatricopeptide repeat-containi...   115   6e-24
gb|EMJ12073.1| hypothetical protein PRUPE_ppa003110mg [Prunus pe...   108   1e-21
gb|ESW30211.1| hypothetical protein PHAVU_002G134100g [Phaseolus...   105   8e-21
ref|XP_006368339.1| pentatricopeptide repeat-containing family p...   103   3e-20
ref|XP_004300183.1| PREDICTED: pentatricopeptide repeat-containi...   102   9e-20
emb|CDL67990.1| putative pentatricopeptide repeat-containing pro...   101   2e-19
ref|XP_006572948.1| PREDICTED: pentatricopeptide repeat-containi...   101   2e-19
ref|XP_006572946.1| PREDICTED: pentatricopeptide repeat-containi...   100   2e-19
ref|XP_004512460.1| PREDICTED: pentatricopeptide repeat-containi...    98   2e-18
ref|XP_003533450.1| PREDICTED: pentatricopeptide repeat-containi...    96   7e-18
gb|AFK43998.1| unknown [Lotus japonicus]                               90   5e-16
ref|NP_174474.1| pentatricopeptide repeat-containing protein [Ar...    83   6e-14
dbj|BAE98745.1| hypothetical protein [Arabidopsis thaliana]            83   6e-14

>ref|XP_006352928.1| PREDICTED: pentatricopeptide repeat-containing protein
           At1g31920-like [Solanum tuberosum]
          Length = 605

 Score =  133 bits (334), Expect = 4e-29
 Identities = 63/94 (67%), Positives = 77/94 (81%)
 Frame = +1

Query: 295 MVGTSVLHQTHFFMPQDDYAKNPEFDFSSKEQECISLIKKCRSLMDFKLVHGQILKLGFL 474
           MV TSVL+QT F +P++ +AK  EF+FS KEQE IS+IKKC S+ + K VHGQILKLGF+
Sbjct: 1   MVRTSVLYQTPFLIPKEYHAKAQEFNFSLKEQEWISMIKKCNSMRELKQVHGQILKLGFI 60

Query: 475 WSSFCASSLLATCALSDWGSMDYACSIFQNIDDP 576
            SSFC+ +LL+TCALS+WGSMDYAC IF  IDDP
Sbjct: 61  CSSFCSGNLLSTCALSEWGSMDYACLIFDEIDDP 94


>ref|XP_004245945.1| PREDICTED: pentatricopeptide repeat-containing protein
           At1g31920-like [Solanum lycopersicum]
          Length = 605

 Score =  130 bits (328), Expect = 2e-28
 Identities = 62/94 (65%), Positives = 76/94 (80%)
 Frame = +1

Query: 295 MVGTSVLHQTHFFMPQDDYAKNPEFDFSSKEQECISLIKKCRSLMDFKLVHGQILKLGFL 474
           MV TSVL+QT F +P++ +AK  E +FS KEQE IS+IKKC ++ + K VHGQILKLGF+
Sbjct: 1   MVRTSVLYQTPFLIPKEYHAKAQELNFSLKEQEWISMIKKCNNMRELKQVHGQILKLGFI 60

Query: 475 WSSFCASSLLATCALSDWGSMDYACSIFQNIDDP 576
            SSFCA +LL+TCALS+WGSMDYAC IF  IDDP
Sbjct: 61  CSSFCAGNLLSTCALSEWGSMDYACLIFDEIDDP 94


>gb|EXC26223.1| hypothetical protein L484_022794 [Morus notabilis]
          Length = 605

 Score =  128 bits (321), Expect = 1e-27
 Identities = 56/94 (59%), Positives = 75/94 (79%)
 Frame = +1

Query: 295 MVGTSVLHQTHFFMPQDDYAKNPEFDFSSKEQECISLIKKCRSLMDFKLVHGQILKLGFL 474
           M GTSVL+QTH  +P  +  ++PEF  S KEQEC+SL+K+C+S+ + K +H QILK+G L
Sbjct: 1   MTGTSVLNQTHLLLPAKEPIQSPEFHLSLKEQECLSLLKRCKSVRELKQIHVQILKIGLL 60

Query: 475 WSSFCASSLLATCALSDWGSMDYACSIFQNIDDP 576
             SFCA +L+ATCALSDWGSMDYACSIF+++ +P
Sbjct: 61  GDSFCAGNLVATCALSDWGSMDYACSIFRHVKEP 94


>ref|XP_002518995.1| GTP-binding  protein alpha subunit, gna, putative [Ricinus communis]
            gi|223541982|gb|EEF43528.1| GTP-binding protein alpha
            subunit, gna, putative [Ricinus communis]
          Length = 1203

 Score =  125 bits (314), Expect = 8e-27
 Identities = 54/94 (57%), Positives = 73/94 (77%)
 Frame = +1

Query: 295  MVGTSVLHQTHFFMPQDDYAKNPEFDFSSKEQECISLIKKCRSLMDFKLVHGQILKLGFL 474
            M+GT+VLHQ H  +P +D +++PE     KEQEC+SL+K+C+++ +F+  H QILK GF 
Sbjct: 862  MIGTTVLHQIHILLPPEDPSESPEVSLRVKEQECLSLLKRCQNMEEFRQAHAQILKWGFF 921

Query: 475  WSSFCASSLLATCALSDWGSMDYACSIFQNIDDP 576
             + FCAS+L+ATCALS WGSMDYACSIF+ ID P
Sbjct: 922  SNPFCASNLVATCALSHWGSMDYACSIFRQIDQP 955


>ref|XP_002275784.1| PREDICTED: pentatricopeptide repeat-containing protein At1g31920
           [Vitis vinifera] gi|297742017|emb|CBI33804.3| unnamed
           protein product [Vitis vinifera]
          Length = 605

 Score =  122 bits (305), Expect = 9e-26
 Identities = 54/93 (58%), Positives = 72/93 (77%)
 Frame = +1

Query: 295 MVGTSVLHQTHFFMPQDDYAKNPEFDFSSKEQECISLIKKCRSLMDFKLVHGQILKLGFL 474
           M+ TSVLHQTH  + ++D  ++PE  F   E+EC+SL+KKC ++ +FK  H +ILKLG  
Sbjct: 1   MIRTSVLHQTHVLVSREDPPQSPELSFKLGEKECVSLLKKCSNMEEFKQSHARILKLGLF 60

Query: 475 WSSFCASSLLATCALSDWGSMDYACSIFQNIDD 573
             SFCAS+L+ATCALSDWGSMDYACSIF+ +D+
Sbjct: 61  GDSFCASNLVATCALSDWGSMDYACSIFRQMDE 93


>gb|EOY25237.1| Tetratricopeptide repeat-like superfamily protein [Theobroma cacao]
          Length = 703

 Score =  120 bits (300), Expect = 3e-25
 Identities = 55/97 (56%), Positives = 71/97 (73%)
 Frame = +1

Query: 286 EGRMVGTSVLHQTHFFMPQDDYAKNPEFDFSSKEQECISLIKKCRSLMDFKLVHGQILKL 465
           + RM GTSVL QT FF    D  ++ E     KEQEC S++K+C+++ +F+  H QI+K 
Sbjct: 96  DNRMPGTSVLQQTKFFSLPADPPQSLELSLRLKEQECFSILKRCKNMEEFRQAHAQIVKW 155

Query: 466 GFLWSSFCASSLLATCALSDWGSMDYACSIFQNIDDP 576
           GF W+SFCAS+L+A CALSD GSMDYACSIFQ ID+P
Sbjct: 156 GFFWNSFCASNLVAACALSDGGSMDYACSIFQQIDEP 192


>ref|XP_006432677.1| hypothetical protein CICLE_v10000638mg [Citrus clementina]
           gi|568834767|ref|XP_006471474.1| PREDICTED:
           pentatricopeptide repeat-containing protein
           At1g31920-like [Citrus sinensis]
           gi|557534799|gb|ESR45917.1| hypothetical protein
           CICLE_v10000638mg [Citrus clementina]
          Length = 605

 Score =  119 bits (297), Expect = 7e-25
 Identities = 52/94 (55%), Positives = 69/94 (73%)
 Frame = +1

Query: 295 MVGTSVLHQTHFFMPQDDYAKNPEFDFSSKEQECISLIKKCRSLMDFKLVHGQILKLGFL 474
           M  TSVLHQ+      ++  K PE +   KEQEC++++K C++L +FK VH  +LK GF 
Sbjct: 1   MTRTSVLHQSLLLTQPEEPPKGPELNLRLKEQECLTILKTCKNLEEFKKVHAHVLKWGFF 60

Query: 475 WSSFCASSLLATCALSDWGSMDYACSIFQNIDDP 576
           W+ FCAS+L+ATCALS WGSMDYACSIF+ ID+P
Sbjct: 61  WNPFCASNLVATCALSHWGSMDYACSIFRQIDEP 94


>ref|XP_004136211.1| PREDICTED: pentatricopeptide repeat-containing protein
           At1g31920-like [Cucumis sativus]
           gi|449508034|ref|XP_004163198.1| PREDICTED:
           pentatricopeptide repeat-containing protein
           At1g31920-like [Cucumis sativus]
          Length = 606

 Score =  115 bits (289), Expect = 6e-24
 Identities = 55/95 (57%), Positives = 69/95 (72%), Gaps = 1/95 (1%)
 Frame = +1

Query: 295 MVGTSVLHQTHFFMPQDDYAKNP-EFDFSSKEQECISLIKKCRSLMDFKLVHGQILKLGF 471
           M+GTSVL+  H  +P  D  ++  E +   KEQE + L+KKC+SL +FK VH QILK G 
Sbjct: 1   MMGTSVLNYNHHLLPSKDLPQSSSELNLKQKEQEYLCLVKKCKSLEEFKQVHVQILKFGL 60

Query: 472 LWSSFCASSLLATCALSDWGSMDYACSIFQNIDDP 576
              SFC+SS+LATCALSDW SMDYACSIFQ +D+P
Sbjct: 61  FLDSFCSSSVLATCALSDWNSMDYACSIFQQLDEP 95


>gb|EMJ12073.1| hypothetical protein PRUPE_ppa003110mg [Prunus persica]
          Length = 602

 Score =  108 bits (269), Expect = 1e-21
 Identities = 52/94 (55%), Positives = 65/94 (69%)
 Frame = +1

Query: 295 MVGTSVLHQTHFFMPQDDYAKNPEFDFSSKEQECISLIKKCRSLMDFKLVHGQILKLGFL 474
           M G  VL+QTH F+P       PE    SKEQE +SL+K+CR++ + K VH  ILKLG  
Sbjct: 1   MTGAPVLNQTHLFLPSKTPLGCPETSSRSKEQESLSLLKRCRNMEELKQVHAHILKLGHF 60

Query: 475 WSSFCASSLLATCALSDWGSMDYACSIFQNIDDP 576
             SFCA +L+AT ALS WGSMD+ACSIFQ I++P
Sbjct: 61  CDSFCAGNLVATSALSAWGSMDHACSIFQQINEP 94


>gb|ESW30211.1| hypothetical protein PHAVU_002G134100g [Phaseolus vulgaris]
          Length = 605

 Score =  105 bits (262), Expect = 8e-21
 Identities = 50/94 (53%), Positives = 67/94 (71%)
 Frame = +1

Query: 295 MVGTSVLHQTHFFMPQDDYAKNPEFDFSSKEQECISLIKKCRSLMDFKLVHGQILKLGFL 474
           M GTSVL Q+H     ++  +N E +    EQ  +SL+K+C+S+ +FK VH QILKLG  
Sbjct: 1   MSGTSVLCQSHLLSLPNNPPQNSELNAKFNEQGWLSLLKRCKSMEEFKQVHAQILKLGLF 60

Query: 475 WSSFCASSLLATCALSDWGSMDYACSIFQNIDDP 576
             SFC S+L+ATCALS WGSM+YACSIF+ I++P
Sbjct: 61  LDSFCGSNLVATCALSRWGSMEYACSIFRQIEEP 94


>ref|XP_006368339.1| pentatricopeptide repeat-containing family protein [Populus
           trichocarpa] gi|550346246|gb|ERP64908.1|
           pentatricopeptide repeat-containing family protein
           [Populus trichocarpa]
          Length = 602

 Score =  103 bits (257), Expect = 3e-20
 Identities = 50/95 (52%), Positives = 66/95 (69%), Gaps = 1/95 (1%)
 Frame = +1

Query: 295 MVGTSVLHQTHFFMPQDDYAKNPEFDFSSKEQECISLIKKCRSLMDFKLVHGQILKLGFL 474
           M+GT VL +  F    +   +        KEQEC+SL+K+C+++ +FK VH Q+LK    
Sbjct: 1   MIGTPVLQKIRFLSLPEAPTQTTGLSLKLKEQECLSLMKRCKNMEEFKQVHAQVLK---- 56

Query: 475 W-SSFCASSLLATCALSDWGSMDYACSIFQNIDDP 576
           W +SFCAS+L+ATCALSDWGSMDYACSIF+ ID P
Sbjct: 57  WENSFCASNLVATCALSDWGSMDYACSIFRQIDQP 91


>ref|XP_004300183.1| PREDICTED: pentatricopeptide repeat-containing protein
           At1g31920-like [Fragaria vesca subsp. vesca]
          Length = 606

 Score =  102 bits (253), Expect = 9e-20
 Identities = 51/95 (53%), Positives = 66/95 (69%), Gaps = 1/95 (1%)
 Frame = +1

Query: 295 MVGTSVLH-QTHFFMPQDDYAKNPEFDFSSKEQECISLIKKCRSLMDFKLVHGQILKLGF 471
           M GT+VL+ QTH  +P  D   + E  F  KEQE +SL+K+C++L +FK VH  ILKLG 
Sbjct: 1   MTGTTVLNLQTHLLLPVKDPPGSQELSFRLKEQESLSLLKRCKNLEEFKQVHSHILKLGV 60

Query: 472 LWSSFCASSLLATCALSDWGSMDYACSIFQNIDDP 576
              SF A +L+AT  LS WGSMDYACSIF+ I++P
Sbjct: 61  SCDSFVAGNLVATNVLSAWGSMDYACSIFEQIEEP 95


>emb|CDL67990.1| putative pentatricopeptide repeat-containing protein At1g31920,
           partial [Olea europaea]
          Length = 199

 Score =  101 bits (251), Expect = 2e-19
 Identities = 41/57 (71%), Positives = 52/57 (91%)
 Frame = +1

Query: 406 IKKCRSLMDFKLVHGQILKLGFLWSSFCASSLLATCALSDWGSMDYACSIFQNIDDP 576
           +KKC+++ +FK +HGQI+K GFLWSSFC+S+LLATCALS+WGSMDYACSIF  I+DP
Sbjct: 3   LKKCKNMQEFKQIHGQIIKFGFLWSSFCSSNLLATCALSEWGSMDYACSIFHQIEDP 59


>ref|XP_006572948.1| PREDICTED: pentatricopeptide repeat-containing protein
           At1g31920-like [Glycine max]
          Length = 605

 Score =  101 bits (251), Expect = 2e-19
 Identities = 47/94 (50%), Positives = 66/94 (70%)
 Frame = +1

Query: 295 MVGTSVLHQTHFFMPQDDYAKNPEFDFSSKEQECISLIKKCRSLMDFKLVHGQILKLGFL 474
           M GTSVL Q+H     +   ++ E +    EQ  +SL+K+C+S+ +FK VH  ILKLG  
Sbjct: 1   MSGTSVLCQSHLLSLPNSPLQSSELNAKFNEQGWLSLLKRCKSMEEFKKVHAHILKLGLF 60

Query: 475 WSSFCASSLLATCALSDWGSMDYACSIFQNIDDP 576
           + SFC S+L+A+CALS WGSM+YACSIF+ I++P
Sbjct: 61  YDSFCGSNLVASCALSRWGSMEYACSIFRQIEEP 94


>ref|XP_006572946.1| PREDICTED: pentatricopeptide repeat-containing protein
           At1g31920-like [Glycine max]
          Length = 605

 Score =  100 bits (250), Expect = 2e-19
 Identities = 47/94 (50%), Positives = 65/94 (69%)
 Frame = +1

Query: 295 MVGTSVLHQTHFFMPQDDYAKNPEFDFSSKEQECISLIKKCRSLMDFKLVHGQILKLGFL 474
           M GTSVL Q+H     +   ++ E +    EQ  +SL+K+C+S+ +FK VH  ILKLG  
Sbjct: 1   MSGTSVLCQSHLLSLPNSPPQSSELNAKFNEQGWLSLLKRCKSMEEFKQVHAHILKLGLF 60

Query: 475 WSSFCASSLLATCALSDWGSMDYACSIFQNIDDP 576
           + SFC S+L+A+CALS WGSM+YACSIF  I++P
Sbjct: 61  YDSFCGSNLVASCALSRWGSMEYACSIFSQIEEP 94


>ref|XP_004512460.1| PREDICTED: pentatricopeptide repeat-containing protein
           At1g31920-like [Cicer arietinum]
          Length = 606

 Score = 97.8 bits (242), Expect = 2e-18
 Identities = 44/94 (46%), Positives = 64/94 (68%)
 Frame = +1

Query: 295 MVGTSVLHQTHFFMPQDDYAKNPEFDFSSKEQECISLIKKCRSLMDFKLVHGQILKLGFL 474
           M GT+ L+QTHF +  ++  ++ E   S  E+  + L+K+C ++ +FK VH   LK G  
Sbjct: 1   MTGTTALNQTHFLLLTNNSHQSFELSKSFNEKGWLCLLKRCNNMEEFKQVHAYFLKCGIF 60

Query: 475 WSSFCASSLLATCALSDWGSMDYACSIFQNIDDP 576
           + SFC S+L+ATCAL+ WGSMDYACSIF  I++P
Sbjct: 61  FDSFCGSNLVATCALTKWGSMDYACSIFTQIEEP 94


>ref|XP_003533450.1| PREDICTED: pentatricopeptide repeat-containing protein
           At1g31920-like [Glycine max]
          Length = 604

 Score = 95.9 bits (237), Expect = 7e-18
 Identities = 47/94 (50%), Positives = 66/94 (70%)
 Frame = +1

Query: 295 MVGTSVLHQTHFFMPQDDYAKNPEFDFSSKEQECISLIKKCRSLMDFKLVHGQILKLGFL 474
           M  TSVL Q+HF    ++  ++ E +     Q  +SL+K+C+S+ +FK VH  ILKLG  
Sbjct: 1   MSWTSVLCQSHFLSLPNNPPQSSELNAKFNVQG-LSLLKRCKSMEEFKQVHAHILKLGLF 59

Query: 475 WSSFCASSLLATCALSDWGSMDYACSIFQNIDDP 576
           + SFC S+L+ATCALS WGSM+YACSIF+ I++P
Sbjct: 60  YDSFCGSNLVATCALSRWGSMEYACSIFRQIEEP 93


>gb|AFK43998.1| unknown [Lotus japonicus]
          Length = 94

 Score = 89.7 bits (221), Expect = 5e-16
 Identities = 43/91 (47%), Positives = 56/91 (61%)
 Frame = +1

Query: 295 MVGTSVLHQTHFFMPQDDYAKNPEFDFSSKEQECISLIKKCRSLMDFKLVHGQILKLGFL 474
           M  T+VL QTH         +  E      EQ    L+K+C+S+ +FK VH  +LKLGF 
Sbjct: 1   MTRTTVLSQTHLLSLPSTPPQCSELSTRFNEQGWYPLLKRCKSMEEFKQVHAHVLKLGFF 60

Query: 475 WSSFCASSLLATCALSDWGSMDYACSIFQNI 567
             SFC S+L+ATCAL+ WGSM+YACSIF  +
Sbjct: 61  CDSFCGSNLVATCALAKWGSMEYACSIFSRL 91


>ref|NP_174474.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana]
           gi|75169173|sp|Q9C6T2.1|PPR68_ARATH RecName:
           Full=Pentatricopeptide repeat-containing protein
           At1g31920 gi|12321292|gb|AAG50713.1|AC079041_6
           PPR-repeat protein, putative [Arabidopsis thaliana]
           gi|332193295|gb|AEE31416.1| pentatricopeptide
           repeat-containing protein [Arabidopsis thaliana]
          Length = 606

 Score = 82.8 bits (203), Expect = 6e-14
 Identities = 44/82 (53%), Positives = 57/82 (69%), Gaps = 3/82 (3%)
 Frame = +1

Query: 340 QDDYAKNPEFD-FSSKEQECISLIKKCRSLMDFKLVHGQILKLG-FLWSSFCASSLLATC 513
           +DD   NPE + F  KEQEC+ L+K+C ++ +FK VH + +KL  F  SSF ASS+LA C
Sbjct: 14  RDDLTHNPEVNNFGGKEQECLYLLKRCHNIDEFKQVHARFIKLSLFYSSSFSASSVLAKC 73

Query: 514 ALSDW-GSMDYACSIFQNIDDP 576
           A S W  SM+YA SIF+ IDDP
Sbjct: 74  AHSGWENSMNYAASIFRGIDDP 95


>dbj|BAE98745.1| hypothetical protein [Arabidopsis thaliana]
          Length = 527

 Score = 82.8 bits (203), Expect = 6e-14
 Identities = 44/82 (53%), Positives = 57/82 (69%), Gaps = 3/82 (3%)
 Frame = +1

Query: 340 QDDYAKNPEFD-FSSKEQECISLIKKCRSLMDFKLVHGQILKLG-FLWSSFCASSLLATC 513
           +DD   NPE + F  KEQEC+ L+K+C ++ +FK VH + +KL  F  SSF ASS+LA C
Sbjct: 14  RDDLTHNPEVNNFGGKEQECLYLLKRCHNIDEFKQVHARFIKLSLFYSSSFSASSVLAKC 73

Query: 514 ALSDW-GSMDYACSIFQNIDDP 576
           A S W  SM+YA SIF+ IDDP
Sbjct: 74  AHSGWENSMNYAASIFRGIDDP 95


Top