BLASTX nr result

ID: Catharanthus23_contig00018526 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Catharanthus23_contig00018526
         (1265 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_004247885.1| PREDICTED: pentatricopeptide repeat-containi...   345   2e-92
ref|XP_002276778.1| PREDICTED: pentatricopeptide repeat-containi...   345   2e-92
ref|XP_006360926.1| PREDICTED: pentatricopeptide repeat-containi...   343   9e-92
ref|XP_002281924.1| PREDICTED: pentatricopeptide repeat-containi...   342   2e-91
ref|XP_002517255.1| conserved hypothetical protein [Ricinus comm...   337   5e-90
ref|XP_002875809.1| pentatricopeptide repeat-containing protein ...   333   7e-89
gb|EOY33905.1| Pentatricopeptide repeat superfamily protein [The...   333   9e-89
gb|EMJ06956.1| hypothetical protein PRUPE_ppa010266mg [Prunus pe...   333   1e-88
ref|XP_006404445.1| hypothetical protein EUTSA_v10010650mg [Eutr...   332   2e-88
ref|XP_004291520.1| PREDICTED: pentatricopeptide repeat-containi...   330   6e-88
ref|XP_002314145.2| pentatricopeptide repeat-containing family p...   330   8e-88
ref|NP_190271.1| pentatricopeptide repeat-containing protein [Ar...   330   8e-88
ref|XP_006291711.1| hypothetical protein CARUB_v10017876mg [Caps...   328   2e-87
ref|XP_003537970.1| PREDICTED: pentatricopeptide repeat-containi...   326   1e-86
ref|XP_006858620.1| hypothetical protein AMTR_s00066p00023060 [A...   326   1e-86
dbj|BAK03258.1| predicted protein [Hordeum vulgare subsp. vulgare]    325   2e-86
ref|XP_006591821.1| PREDICTED: uncharacterized protein LOC100306...   325   3e-86
ref|XP_004145279.1| PREDICTED: pentatricopeptide repeat-containi...   325   3e-86
ref|XP_004507153.1| PREDICTED: pentatricopeptide repeat-containi...   323   1e-85
gb|ESW04024.1| hypothetical protein PHAVU_011G060800g [Phaseolus...   322   3e-85

>ref|XP_004247885.1| PREDICTED: pentatricopeptide repeat-containing protein
           At3g46870-like [Solanum lycopersicum]
          Length = 256

 Score =  345 bits (886), Expect = 2e-92
 Identities = 175/254 (68%), Positives = 199/254 (78%), Gaps = 1/254 (0%)
 Frame = +3

Query: 117 ISRARINSTPIFHILPRLSYISHPFTRKQMSFSLLSPPPWY-NYARFGVLRVQAYHDGRP 293
           ISR +I S     +L  LS    P     ++ S    P W   ++R  V  V+ YHDGRP
Sbjct: 6   ISRLKIPSL----LLKHLSVSPLPSISSTLTSSSSCSPSWVPEFSRSSVRDVRWYHDGRP 61

Query: 294 RGPLWRGKKMIGKEALYVIMGLKRFKDDEEKLQKFIKTHVFRLLKLDMIAVLNELERQQE 473
           RG LWRGKK+IGKEAL+VI+GL+RFKDDEEKL KF+KTHV RLLK+DMIAVLNELERQ+E
Sbjct: 62  RGSLWRGKKLIGKEALFVILGLRRFKDDEEKLDKFVKTHVLRLLKMDMIAVLNELERQEE 121

Query: 474 VSLAVKMFKVMQKQEWYKPDVYLYKDLIISLARSKKMDQTMELWEMMKKEDLFPDSQTYT 653
           VSLAVK+F V+QKQ WY+PDVYLYKDLII+LAR +KMD  M+LWE M+KEDLFPD QT+T
Sbjct: 122 VSLAVKVFWVIQKQAWYQPDVYLYKDLIIALARRRKMDDAMKLWESMRKEDLFPDCQTFT 181

Query: 654 EVIRGFLRDGSPGDAMNIYEDMKKSPDPPEELPFRIXXXXXXXXXXXRNRVKQDFEEIFP 833
           EVIRGFLRDGSP DAMNI+EDMKKSP PPEELPFR+           RNRVKQDFEEIFP
Sbjct: 182 EVIRGFLRDGSPADAMNIFEDMKKSPYPPEELPFRVLLKGLLPHPLLRNRVKQDFEEIFP 241

Query: 834 DRHIYDPPEEIFGL 875
           DRHIYDPPEEIFGL
Sbjct: 242 DRHIYDPPEEIFGL 255


>ref|XP_002276778.1| PREDICTED: pentatricopeptide repeat-containing protein At3g46870
           [Vitis vinifera] gi|297745785|emb|CBI15841.3| unnamed
           protein product [Vitis vinifera]
          Length = 252

 Score =  345 bits (886), Expect = 2e-92
 Identities = 170/239 (71%), Positives = 193/239 (80%), Gaps = 9/239 (3%)
 Frame = +3

Query: 189 FTRKQMSFSLLSPPPWYNYARFGVLRVQA---------YHDGRPRGPLWRGKKMIGKEAL 341
           F+ K +   + SP       +F VL V +         YHDGRPRGPLWRGKK+IGKEAL
Sbjct: 14  FSAKILQSLIKSPIKESTKFQFPVLEVASNKPLFGLKHYHDGRPRGPLWRGKKLIGKEAL 73

Query: 342 YVIMGLKRFKDDEEKLQKFIKTHVFRLLKLDMIAVLNELERQQEVSLAVKMFKVMQKQEW 521
           +VI+GLKRFKDDEEKL+KFIK+HV RLLK+DMIAVL ELERQ+EV+LAV++F+V QKQ+W
Sbjct: 74  FVILGLKRFKDDEEKLRKFIKSHVLRLLKMDMIAVLTELERQEEVTLAVEVFRVFQKQDW 133

Query: 522 YKPDVYLYKDLIISLARSKKMDQTMELWEMMKKEDLFPDSQTYTEVIRGFLRDGSPGDAM 701
           YKPDVYLYKDLII+LA+ KKMD  M+LWE M+KEDLFPD QTYTEVIRGFLR GSP DAM
Sbjct: 134 YKPDVYLYKDLIIALAKCKKMDNAMQLWESMRKEDLFPDYQTYTEVIRGFLRHGSPADAM 193

Query: 702 NIYEDMKKSPDPPEELPFRIXXXXXXXXXXXRNRVKQDFEEIFPDRHIYDPPEEIFGLH 878
           NIYEDMKKSPDPPEELPFRI           RNRVKQDFEEIFPDRH+YDPPEEIFG+H
Sbjct: 194 NIYEDMKKSPDPPEELPFRILLKGLLPHPLLRNRVKQDFEEIFPDRHVYDPPEEIFGIH 252


>ref|XP_006360926.1| PREDICTED: pentatricopeptide repeat-containing protein
           At3g46870-like [Solanum tuberosum]
          Length = 256

 Score =  343 bits (880), Expect = 9e-92
 Identities = 174/254 (68%), Positives = 198/254 (77%), Gaps = 1/254 (0%)
 Frame = +3

Query: 117 ISRARINSTPIFHILPRLSYISHPFTRKQMSFSLLSPPPWY-NYARFGVLRVQAYHDGRP 293
           ISR +I S     +L  LS    P     ++ S      W    +R  V  V+ YHDGRP
Sbjct: 6   ISRLKIPSL----LLKHLSVSPLPSVSSTLTSSSSCSTSWVPELSRSSVRDVRWYHDGRP 61

Query: 294 RGPLWRGKKMIGKEALYVIMGLKRFKDDEEKLQKFIKTHVFRLLKLDMIAVLNELERQQE 473
           RGPLWRGKK+IGKEAL+VI+GL+RF+DDEEKL KF+KTHV RLLK+DMIAVLNELERQ+E
Sbjct: 62  RGPLWRGKKLIGKEALFVILGLRRFRDDEEKLDKFVKTHVLRLLKMDMIAVLNELERQEE 121

Query: 474 VSLAVKMFKVMQKQEWYKPDVYLYKDLIISLARSKKMDQTMELWEMMKKEDLFPDSQTYT 653
           VSLAVK+F V+QKQ WY+PDVYLYKDLII+LAR +KMD  M+LWE M+KEDLFPD QT+T
Sbjct: 122 VSLAVKVFWVIQKQAWYQPDVYLYKDLIIALARRRKMDDAMKLWESMRKEDLFPDCQTFT 181

Query: 654 EVIRGFLRDGSPGDAMNIYEDMKKSPDPPEELPFRIXXXXXXXXXXXRNRVKQDFEEIFP 833
           EVIRGFLRDGSP DAMNI+EDMKKSP PPEELPFR+           RNRVKQDFEEIFP
Sbjct: 182 EVIRGFLRDGSPADAMNIFEDMKKSPYPPEELPFRVLLKGLLPHPLLRNRVKQDFEEIFP 241

Query: 834 DRHIYDPPEEIFGL 875
           DRHIYDPPEEIFGL
Sbjct: 242 DRHIYDPPEEIFGL 255


>ref|XP_002281924.1| PREDICTED: pentatricopeptide repeat-containing protein At3g46870
           isoform 1 [Vitis vinifera]
           gi|359478900|ref|XP_003632183.1| PREDICTED:
           pentatricopeptide repeat-containing protein At3g46870
           isoform 2 [Vitis vinifera] gi|297745987|emb|CBI16043.3|
           unnamed protein product [Vitis vinifera]
          Length = 252

 Score =  342 bits (877), Expect = 2e-91
 Identities = 161/207 (77%), Positives = 183/207 (88%)
 Frame = +3

Query: 258 VLRVQAYHDGRPRGPLWRGKKMIGKEALYVIMGLKRFKDDEEKLQKFIKTHVFRLLKLDM 437
           +L  + YHDGRPRGPLWRGKK+IGKEAL+VI+GLKRFKDDEE L+KFIK+HV RLLK+DM
Sbjct: 46  LLGSKLYHDGRPRGPLWRGKKLIGKEALFVILGLKRFKDDEENLRKFIKSHVLRLLKMDM 105

Query: 438 IAVLNELERQQEVSLAVKMFKVMQKQEWYKPDVYLYKDLIISLARSKKMDQTMELWEMMK 617
           +AVL ELERQ+EVSLAV++F+V++KQ+WYKPDVYLYKDLII+LA+ KKMD  M+LWE M+
Sbjct: 106 VAVLTELERQEEVSLAVEVFRVIRKQDWYKPDVYLYKDLIIALAKCKKMDDAMQLWESMR 165

Query: 618 KEDLFPDSQTYTEVIRGFLRDGSPGDAMNIYEDMKKSPDPPEELPFRIXXXXXXXXXXXR 797
           KEDLFPD QTYTEVIRGFLR GSP DAMNIYEDMKKSPDPPEELPFRI           R
Sbjct: 166 KEDLFPDYQTYTEVIRGFLRYGSPADAMNIYEDMKKSPDPPEELPFRILLKGLLPHPLLR 225

Query: 798 NRVKQDFEEIFPDRHIYDPPEEIFGLH 878
           NRVKQDFEEIFPDRH+YDPPEEIFG+H
Sbjct: 226 NRVKQDFEEIFPDRHVYDPPEEIFGVH 252


>ref|XP_002517255.1| conserved hypothetical protein [Ricinus communis]
           gi|223543626|gb|EEF45155.1| conserved hypothetical
           protein [Ricinus communis]
          Length = 258

 Score =  337 bits (865), Expect = 5e-90
 Identities = 157/213 (73%), Positives = 184/213 (86%)
 Frame = +3

Query: 237 YNYARFGVLRVQAYHDGRPRGPLWRGKKMIGKEALYVIMGLKRFKDDEEKLQKFIKTHVF 416
           +N++      ++ YHDGRPRGPLWRGKK+IGKEAL+VI+GLKRFKD+EEKL KFIKTHV 
Sbjct: 45  HNHSHNSFSGLRQYHDGRPRGPLWRGKKLIGKEALFVILGLKRFKDEEEKLDKFIKTHVL 104

Query: 417 RLLKLDMIAVLNELERQQEVSLAVKMFKVMQKQEWYKPDVYLYKDLIISLARSKKMDQTM 596
           RLLK+DMIAVL ELERQ+EVSLA K+F+++QKQ+WY PDVYLYKDLII+L RS KMDQ M
Sbjct: 105 RLLKMDMIAVLTELERQEEVSLATKVFQIIQKQDWYNPDVYLYKDLIIALTRSGKMDQAM 164

Query: 597 ELWEMMKKEDLFPDSQTYTEVIRGFLRDGSPGDAMNIYEDMKKSPDPPEELPFRIXXXXX 776
           +LWE M+ E+LFPDSQ YTE+IRGFLRDGSPGDAMNIYEDMKKSPDPPEELPFRI     
Sbjct: 165 KLWEAMRSENLFPDSQMYTELIRGFLRDGSPGDAMNIYEDMKKSPDPPEELPFRILLKGL 224

Query: 777 XXXXXXRNRVKQDFEEIFPDRHIYDPPEEIFGL 875
                 RNRVKQD+EE+FP++H+YDPPEEIFG+
Sbjct: 225 LPHPLLRNRVKQDYEELFPEKHVYDPPEEIFGV 257


>ref|XP_002875809.1| pentatricopeptide repeat-containing protein [Arabidopsis lyrata
           subsp. lyrata] gi|297321647|gb|EFH52068.1|
           pentatricopeptide repeat-containing protein [Arabidopsis
           lyrata subsp. lyrata]
          Length = 257

 Score =  333 bits (855), Expect = 7e-89
 Identities = 157/220 (71%), Positives = 186/220 (84%)
 Frame = +3

Query: 213 SLLSPPPWYNYARFGVLRVQAYHDGRPRGPLWRGKKMIGKEALYVIMGLKRFKDDEEKLQ 392
           +LL P P   +  F    V  +HDGRPRGPLWRGKK+IGKEAL+VI+GLKR KDD+EKLQ
Sbjct: 40  TLLHPIPPKPFTVF----VSRFHDGRPRGPLWRGKKLIGKEALFVILGLKRLKDDDEKLQ 95

Query: 393 KFIKTHVFRLLKLDMIAVLNELERQQEVSLAVKMFKVMQKQEWYKPDVYLYKDLIISLAR 572
           KFIKTHVFRLLKLDM+AV+ ELERQ+E +LA+KMF+V+QKQEWY+PDV++YKDLI+SLA+
Sbjct: 96  KFIKTHVFRLLKLDMLAVIGELERQEETALAIKMFEVIQKQEWYQPDVFMYKDLIVSLAK 155

Query: 573 SKKMDQTMELWEMMKKEDLFPDSQTYTEVIRGFLRDGSPGDAMNIYEDMKKSPDPPEELP 752
           SK+MD+ M LWE MKKE+LFPDSQTYTEVIRGFLRDG P DAMN+YEDM KSPDPPEELP
Sbjct: 156 SKRMDEAMALWEKMKKENLFPDSQTYTEVIRGFLRDGCPADAMNVYEDMLKSPDPPEELP 215

Query: 753 FRIXXXXXXXXXXXRNRVKQDFEEIFPDRHIYDPPEEIFG 872
           FR+           RN+VK+DFEE+FP++H YDPPEEIFG
Sbjct: 216 FRVLLKGLLPHPLLRNKVKKDFEELFPEKHAYDPPEEIFG 255


>gb|EOY33905.1| Pentatricopeptide repeat superfamily protein [Theobroma cacao]
          Length = 287

 Score =  333 bits (854), Expect = 9e-89
 Identities = 154/202 (76%), Positives = 181/202 (89%)
 Frame = +3

Query: 267 VQAYHDGRPRGPLWRGKKMIGKEALYVIMGLKRFKDDEEKLQKFIKTHVFRLLKLDMIAV 446
           ++ YHDGRPRGPLW+GKK+IGKEAL+VI+GLKRFKDD++KLQKFIKTHV RLLK+++IAV
Sbjct: 84  LKQYHDGRPRGPLWKGKKLIGKEALFVILGLKRFKDDDDKLQKFIKTHVLRLLKMELIAV 143

Query: 447 LNELERQQEVSLAVKMFKVMQKQEWYKPDVYLYKDLIISLARSKKMDQTMELWEMMKKED 626
           L ELERQ+E SLAVK+F+V+QKQ+WYKPDVYLYKDLII+LAR KKMD+ M+LWE M+KE+
Sbjct: 144 LTELERQEETSLAVKVFQVIQKQDWYKPDVYLYKDLIIALARFKKMDEAMKLWEYMRKEE 203

Query: 627 LFPDSQTYTEVIRGFLRDGSPGDAMNIYEDMKKSPDPPEELPFRIXXXXXXXXXXXRNRV 806
           LFPDSQTYTE+IRGFLRDGSP DAMNIYEDM KSPDPPEELPFRI           RN+V
Sbjct: 204 LFPDSQTYTEIIRGFLRDGSPADAMNIYEDMIKSPDPPEELPFRILLKGLLPHPLLRNKV 263

Query: 807 KQDFEEIFPDRHIYDPPEEIFG 872
           K+DFEE+FP++H YDPPEEIFG
Sbjct: 264 KKDFEELFPEKHAYDPPEEIFG 285


>gb|EMJ06956.1| hypothetical protein PRUPE_ppa010266mg [Prunus persica]
          Length = 256

 Score =  333 bits (853), Expect = 1e-88
 Identities = 158/200 (79%), Positives = 174/200 (87%)
 Frame = +3

Query: 276 YHDGRPRGPLWRGKKMIGKEALYVIMGLKRFKDDEEKLQKFIKTHVFRLLKLDMIAVLNE 455
           YHDGRPRGPLWRGKK+IGKEALYVI GLKRFKDDEEKL KFIK HV RLLK+D+IAVL E
Sbjct: 55  YHDGRPRGPLWRGKKLIGKEALYVISGLKRFKDDEEKLGKFIKNHVLRLLKMDLIAVLTE 114

Query: 456 LERQQEVSLAVKMFKVMQKQEWYKPDVYLYKDLIISLARSKKMDQTMELWEMMKKEDLFP 635
           LERQ+EV+LA+K+F V++KQ+WY PDVYLYKDLIISLARSKKMD  M LW+ MKKEDLFP
Sbjct: 115 LERQEEVNLAIKVFNVIRKQDWYNPDVYLYKDLIISLARSKKMDDVMLLWDGMKKEDLFP 174

Query: 636 DSQTYTEVIRGFLRDGSPGDAMNIYEDMKKSPDPPEELPFRIXXXXXXXXXXXRNRVKQD 815
           DSQTYTEVIRGFL  GSP DAMNIYEDMK SPDPPEELPFRI           RNRVKQD
Sbjct: 175 DSQTYTEVIRGFLSSGSPADAMNIYEDMKNSPDPPEELPFRILLKGLLPHPLLRNRVKQD 234

Query: 816 FEEIFPDRHIYDPPEEIFGL 875
           FEE+FP++H+YDPPEEIFG+
Sbjct: 235 FEELFPEQHVYDPPEEIFGV 254


>ref|XP_006404445.1| hypothetical protein EUTSA_v10010650mg [Eutrema salsugineum]
           gi|557105564|gb|ESQ45898.1| hypothetical protein
           EUTSA_v10010650mg [Eutrema salsugineum]
          Length = 257

 Score =  332 bits (851), Expect = 2e-88
 Identities = 157/237 (66%), Positives = 191/237 (80%)
 Frame = +3

Query: 162 PRLSYISHPFTRKQMSFSLLSPPPWYNYARFGVLRVQAYHDGRPRGPLWRGKKMIGKEAL 341
           P +  IS  +  K  +   +SP P+  +A         +HDGRPRGPLWRGKK+IGKEAL
Sbjct: 26  PTIHRISFSYLIKPKTLHPVSPKPFTVFAA-------QFHDGRPRGPLWRGKKLIGKEAL 78

Query: 342 YVIMGLKRFKDDEEKLQKFIKTHVFRLLKLDMIAVLNELERQQEVSLAVKMFKVMQKQEW 521
           +VI+GLKR K+D+EKL KFIKTHVFRLLKLDM+AV+ ELERQ+E +LA+KMF+V+QKQEW
Sbjct: 79  FVILGLKRLKEDDEKLDKFIKTHVFRLLKLDMLAVIGELERQEETALAIKMFEVIQKQEW 138

Query: 522 YKPDVYLYKDLIISLARSKKMDQTMELWEMMKKEDLFPDSQTYTEVIRGFLRDGSPGDAM 701
           Y+PDV++YKDLI+SLA+SK+MD+ M LWE MKKE+LFPDSQTYTEVIRGFLRDG P DAM
Sbjct: 139 YQPDVFMYKDLIVSLAKSKRMDEAMGLWEKMKKENLFPDSQTYTEVIRGFLRDGCPADAM 198

Query: 702 NIYEDMKKSPDPPEELPFRIXXXXXXXXXXXRNRVKQDFEEIFPDRHIYDPPEEIFG 872
           N+YEDM KSPDPPEELPFR+           RN+VK+DFEE+FP++H YDPPEEIFG
Sbjct: 199 NVYEDMLKSPDPPEELPFRVLLKGLLPHPLLRNKVKKDFEELFPEKHAYDPPEEIFG 255


>ref|XP_004291520.1| PREDICTED: pentatricopeptide repeat-containing protein
           At3g46870-like [Fragaria vesca subsp. vesca]
          Length = 255

 Score =  330 bits (847), Expect = 6e-88
 Identities = 156/199 (78%), Positives = 175/199 (87%)
 Frame = +3

Query: 276 YHDGRPRGPLWRGKKMIGKEALYVIMGLKRFKDDEEKLQKFIKTHVFRLLKLDMIAVLNE 455
           YHDGRPRGPLWRGKK+IGKEALYVI GLKRFKDDEE L KFIK+HV RLLKLDMIAVL E
Sbjct: 55  YHDGRPRGPLWRGKKLIGKEALYVISGLKRFKDDEETLGKFIKSHVLRLLKLDMIAVLTE 114

Query: 456 LERQQEVSLAVKMFKVMQKQEWYKPDVYLYKDLIISLARSKKMDQTMELWEMMKKEDLFP 635
           LERQ+EV+LA+K+F V++KQ+WYKPDVYLYKDLIISLA+SKKMD  M LW+ MKKEDLFP
Sbjct: 115 LERQEEVNLAIKVFNVIRKQDWYKPDVYLYKDLIISLAKSKKMDDVMVLWDCMKKEDLFP 174

Query: 636 DSQTYTEVIRGFLRDGSPGDAMNIYEDMKKSPDPPEELPFRIXXXXXXXXXXXRNRVKQD 815
           DSQT+TEVIRGFL  GSP DAMNIYEDMK+SPDPPE+LPFRI           RNRVKQD
Sbjct: 175 DSQTFTEVIRGFLSTGSPADAMNIYEDMKQSPDPPEQLPFRILLKGLLPHPLLRNRVKQD 234

Query: 816 FEEIFPDRHIYDPPEEIFG 872
           FEE+FP++H+YDPP+EIFG
Sbjct: 235 FEELFPEQHVYDPPQEIFG 253


>ref|XP_002314145.2| pentatricopeptide repeat-containing family protein [Populus
           trichocarpa] gi|550331008|gb|EEE88100.2|
           pentatricopeptide repeat-containing family protein
           [Populus trichocarpa]
          Length = 257

 Score =  330 bits (846), Expect = 8e-88
 Identities = 167/256 (65%), Positives = 203/256 (79%), Gaps = 3/256 (1%)
 Frame = +3

Query: 117 ISRARI-NSTPIFHILPRLSYISHPFTRKQMSFSLLSPPPWYNYARFGV-LRVQAYHDGR 290
           +SR++I N + +  IL  L+       + Q+  S  S     N   F V   ++ YHDGR
Sbjct: 6   LSRSKIPNFSSVIVILQNLTTKQSIIDQTQLPLSKAS-----NIQSFLVPAGLRQYHDGR 60

Query: 291 PRGPLWRGKKMIGKEALYVIMGLKRFK-DDEEKLQKFIKTHVFRLLKLDMIAVLNELERQ 467
           PRGPLWRGKK+IGKEAL+VI+GLKRFK DD+EKL +FIKTHVFRLLKLDMIAVL+ELERQ
Sbjct: 61  PRGPLWRGKKLIGKEALFVILGLKRFKNDDDEKLDRFIKTHVFRLLKLDMIAVLSELERQ 120

Query: 468 QEVSLAVKMFKVMQKQEWYKPDVYLYKDLIISLARSKKMDQTMELWEMMKKEDLFPDSQT 647
           +EVSLAVK+F+V+QKQ+WYKPDVYLYKDLI++L ++ KM++ M+LWE M+ EDLFPDSQ 
Sbjct: 121 EEVSLAVKIFRVIQKQDWYKPDVYLYKDLIMALLKTGKMEEAMKLWEDMRNEDLFPDSQM 180

Query: 648 YTEVIRGFLRDGSPGDAMNIYEDMKKSPDPPEELPFRIXXXXXXXXXXXRNRVKQDFEEI 827
           YTE IRG+LRDGSP DAMNIYEDMKKSPDPPEELPFRI           RNRVKQD+EE+
Sbjct: 181 YTEAIRGYLRDGSPADAMNIYEDMKKSPDPPEELPFRILLKGLLPHPLLRNRVKQDYEEL 240

Query: 828 FPDRHIYDPPEEIFGL 875
           FP++H+YDPPEEIFG+
Sbjct: 241 FPEKHVYDPPEEIFGI 256


>ref|NP_190271.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana]
           gi|75266318|sp|Q9STF9.1|PP266_ARATH RecName:
           Full=Pentatricopeptide repeat-containing protein
           At3g46870 gi|545719887|pdb|4LEU|A Chain A, Crystal
           Structure Of Tha8-like Protein From Arabidopsis Thaliana
           gi|5541672|emb|CAB51178.1| putative protein [Arabidopsis
           thaliana] gi|26450732|dbj|BAC42475.1| unknown protein
           [Arabidopsis thaliana] gi|28950815|gb|AAO63331.1|
           At3g46870 [Arabidopsis thaliana]
           gi|332644692|gb|AEE78213.1| pentatricopeptide
           repeat-containing protein [Arabidopsis thaliana]
          Length = 257

 Score =  330 bits (846), Expect = 8e-88
 Identities = 155/220 (70%), Positives = 185/220 (84%)
 Frame = +3

Query: 213 SLLSPPPWYNYARFGVLRVQAYHDGRPRGPLWRGKKMIGKEALYVIMGLKRFKDDEEKLQ 392
           +LL P P   +  F    V  +HDGRPRGPLWRGKK+IGKEAL+VI+GLKR K+D+EKL 
Sbjct: 40  TLLHPIPPKPFTVF----VSRFHDGRPRGPLWRGKKLIGKEALFVILGLKRLKEDDEKLD 95

Query: 393 KFIKTHVFRLLKLDMIAVLNELERQQEVSLAVKMFKVMQKQEWYKPDVYLYKDLIISLAR 572
           KFIKTHVFRLLKLDM+AV+ ELERQ+E +LA+KMF+V+QKQEWY+PDV++YKDLI+SLA+
Sbjct: 96  KFIKTHVFRLLKLDMLAVIGELERQEETALAIKMFEVIQKQEWYQPDVFMYKDLIVSLAK 155

Query: 573 SKKMDQTMELWEMMKKEDLFPDSQTYTEVIRGFLRDGSPGDAMNIYEDMKKSPDPPEELP 752
           SK+MD+ M LWE MKKE+LFPDSQTYTEVIRGFLRDG P DAMN+YEDM KSPDPPEELP
Sbjct: 156 SKRMDEAMALWEKMKKENLFPDSQTYTEVIRGFLRDGCPADAMNVYEDMLKSPDPPEELP 215

Query: 753 FRIXXXXXXXXXXXRNRVKQDFEEIFPDRHIYDPPEEIFG 872
           FR+           RN+VK+DFEE+FP++H YDPPEEIFG
Sbjct: 216 FRVLLKGLLPHPLLRNKVKKDFEELFPEKHAYDPPEEIFG 255


>ref|XP_006291711.1| hypothetical protein CARUB_v10017876mg [Capsella rubella]
           gi|565467664|ref|XP_006291712.1| hypothetical protein
           CARUB_v10017876mg [Capsella rubella]
           gi|482560418|gb|EOA24609.1| hypothetical protein
           CARUB_v10017876mg [Capsella rubella]
           gi|482560419|gb|EOA24610.1| hypothetical protein
           CARUB_v10017876mg [Capsella rubella]
          Length = 257

 Score =  328 bits (842), Expect = 2e-87
 Identities = 150/202 (74%), Positives = 178/202 (88%)
 Frame = +3

Query: 267 VQAYHDGRPRGPLWRGKKMIGKEALYVIMGLKRFKDDEEKLQKFIKTHVFRLLKLDMIAV 446
           V  +HDGRPRGPLWRGKK+IGKEAL+VI+GLKR K+D+EKLQKFIKTHV RLLKLDM+AV
Sbjct: 54  VSRFHDGRPRGPLWRGKKLIGKEALFVILGLKRLKEDDEKLQKFIKTHVLRLLKLDMLAV 113

Query: 447 LNELERQQEVSLAVKMFKVMQKQEWYKPDVYLYKDLIISLARSKKMDQTMELWEMMKKED 626
           + ELERQ+E +LA+KMF+V+QKQEWY+PDV++YKDLI+SLA+SK+MD+ M LWE MKKE+
Sbjct: 114 IGELERQEETALAIKMFEVIQKQEWYQPDVFMYKDLIVSLAKSKRMDEAMGLWEKMKKEN 173

Query: 627 LFPDSQTYTEVIRGFLRDGSPGDAMNIYEDMKKSPDPPEELPFRIXXXXXXXXXXXRNRV 806
           LFPDSQTYTEVIRGFLRDG P DAMN+YEDM KSPDPPEELPFR+           RN+V
Sbjct: 174 LFPDSQTYTEVIRGFLRDGCPADAMNVYEDMLKSPDPPEELPFRVLLKGLLPHPLLRNKV 233

Query: 807 KQDFEEIFPDRHIYDPPEEIFG 872
           K+DFEE+FP++H YDPPEEIFG
Sbjct: 234 KKDFEELFPEKHAYDPPEEIFG 255


>ref|XP_003537970.1| PREDICTED: pentatricopeptide repeat-containing protein
           At3g46870-like [Glycine max]
          Length = 255

 Score =  326 bits (836), Expect = 1e-86
 Identities = 154/199 (77%), Positives = 172/199 (86%)
 Frame = +3

Query: 276 YHDGRPRGPLWRGKKMIGKEALYVIMGLKRFKDDEEKLQKFIKTHVFRLLKLDMIAVLNE 455
           YHDGRPRGPLW+GKK+IGKEAL+VI G KRF DDE+KL KFIKTHV RLLK+DMIAVL E
Sbjct: 55  YHDGRPRGPLWKGKKLIGKEALFVISGFKRFNDDEDKLHKFIKTHVLRLLKMDMIAVLTE 114

Query: 456 LERQQEVSLAVKMFKVMQKQEWYKPDVYLYKDLIISLARSKKMDQTMELWEMMKKEDLFP 635
           LERQ++VSLA+ MFKVMQKQ+WYKPD YLYKDLII+LAR+KKMD+ ++LWE M+KE+LFP
Sbjct: 115 LERQEQVSLALMMFKVMQKQDWYKPDAYLYKDLIIALARAKKMDEVLQLWESMRKENLFP 174

Query: 636 DSQTYTEVIRGFLRDGSPGDAMNIYEDMKKSPDPPEELPFRIXXXXXXXXXXXRNRVKQD 815
           DSQTYTEVIRGFL  GSP DAMNIYEDMK SPDPPEELPFRI           RN+VKQD
Sbjct: 175 DSQTYTEVIRGFLNYGSPADAMNIYEDMKNSPDPPEELPFRILLKGLLPHPLLRNKVKQD 234

Query: 816 FEEIFPDRHIYDPPEEIFG 872
           FEEIFPD  IYDPP+EIFG
Sbjct: 235 FEEIFPDSSIYDPPQEIFG 253


>ref|XP_006858620.1| hypothetical protein AMTR_s00066p00023060 [Amborella trichopoda]
           gi|548862731|gb|ERN20087.1| hypothetical protein
           AMTR_s00066p00023060 [Amborella trichopoda]
          Length = 265

 Score =  326 bits (835), Expect = 1e-86
 Identities = 149/203 (73%), Positives = 178/203 (87%)
 Frame = +3

Query: 267 VQAYHDGRPRGPLWRGKKMIGKEALYVIMGLKRFKDDEEKLQKFIKTHVFRLLKLDMIAV 446
           ++ YHDGRPRGPLWRGKK+IGKEAL++I GLKRFKDDE +L KF+K+HV RLLK+DM+AV
Sbjct: 62  MRCYHDGRPRGPLWRGKKLIGKEALFIISGLKRFKDDEGQLDKFVKSHVSRLLKMDMVAV 121

Query: 447 LNELERQQEVSLAVKMFKVMQKQEWYKPDVYLYKDLIISLARSKKMDQTMELWEMMKKED 626
           L ELERQ+EV LA+K+F+V+QK+ WYKPDVYLYKDLII+LARSK+M+  M++WE M++ED
Sbjct: 122 LCELERQEEVILALKIFRVIQKENWYKPDVYLYKDLIIALARSKRMEDAMQIWECMRRED 181

Query: 627 LFPDSQTYTEVIRGFLRDGSPGDAMNIYEDMKKSPDPPEELPFRIXXXXXXXXXXXRNRV 806
           LFPDSQTYTEVIRGFLR GSP DAMNIYEDMK SP+PPEELP+R+           RNR+
Sbjct: 182 LFPDSQTYTEVIRGFLRYGSPADAMNIYEDMKNSPEPPEELPYRVLLKGLLPHPLLRNRI 241

Query: 807 KQDFEEIFPDRHIYDPPEEIFGL 875
           KQDFEE+FPDRH+YDPPEEIFGL
Sbjct: 242 KQDFEEMFPDRHVYDPPEEIFGL 264


>dbj|BAK03258.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 251

 Score =  325 bits (833), Expect = 2e-86
 Identities = 156/241 (64%), Positives = 190/241 (78%), Gaps = 2/241 (0%)
 Frame = +3

Query: 162 PRLSYIS--HPFTRKQMSFSLLSPPPWYNYARFGVLRVQAYHDGRPRGPLWRGKKMIGKE 335
           P++  +S  H   R ++   LL P P+  + R       A+HDGRPRGPLWR KK+IGKE
Sbjct: 18  PKIPTLSPPHGLFRGEVPVRLLPPQPFGEWRR-------AFHDGRPRGPLWRSKKLIGKE 70

Query: 336 ALYVIMGLKRFKDDEEKLQKFIKTHVFRLLKLDMIAVLNELERQQEVSLAVKMFKVMQKQ 515
           AL+ I GLKRFK DEEKL++FIK HV RLLK D +AVL ELERQ+EV L+VKMF+++QK+
Sbjct: 71  ALFAIQGLKRFKGDEEKLREFIKRHVARLLKADKLAVLGELERQEEVDLSVKMFRIIQKE 130

Query: 516 EWYKPDVYLYKDLIISLARSKKMDQTMELWEMMKKEDLFPDSQTYTEVIRGFLRDGSPGD 695
           +WYKPDVY+YKDLIISLA+ KKMD+ M++W  MK+E+LFPDSQTY EVIRGFLR GSP D
Sbjct: 131 DWYKPDVYMYKDLIISLAKCKKMDEAMDIWGNMKEENLFPDSQTYAEVIRGFLRYGSPSD 190

Query: 696 AMNIYEDMKKSPDPPEELPFRIXXXXXXXXXXXRNRVKQDFEEIFPDRHIYDPPEEIFGL 875
           AMNIYEDMK+SPDPPEELPFR+           RNRVKQDFEE+FP+RHIYDPP++IFG+
Sbjct: 191 AMNIYEDMKRSPDPPEELPFRVLLKGLLPHPLLRNRVKQDFEELFPERHIYDPPDDIFGM 250

Query: 876 H 878
           H
Sbjct: 251 H 251


>ref|XP_006591821.1| PREDICTED: uncharacterized protein LOC100306428 isoform X1 [Glycine
           max]
          Length = 255

 Score =  325 bits (832), Expect = 3e-86
 Identities = 153/199 (76%), Positives = 172/199 (86%)
 Frame = +3

Query: 276 YHDGRPRGPLWRGKKMIGKEALYVIMGLKRFKDDEEKLQKFIKTHVFRLLKLDMIAVLNE 455
           YHDGRPRGPLW+GKK+IGKEAL+VI G KRF DDE+KL KFIKTHV RLLK+DMIAVL E
Sbjct: 55  YHDGRPRGPLWKGKKLIGKEALFVISGFKRFNDDEDKLHKFIKTHVLRLLKMDMIAVLTE 114

Query: 456 LERQQEVSLAVKMFKVMQKQEWYKPDVYLYKDLIISLARSKKMDQTMELWEMMKKEDLFP 635
           LERQ++VSLA+ MFKVMQKQ+WYKPD YLYKDLII+LAR+KKMD+ ++LWE M++E+LFP
Sbjct: 115 LERQEQVSLALMMFKVMQKQDWYKPDAYLYKDLIIALARAKKMDEVLQLWESMREENLFP 174

Query: 636 DSQTYTEVIRGFLRDGSPGDAMNIYEDMKKSPDPPEELPFRIXXXXXXXXXXXRNRVKQD 815
           DSQTYTEVIRGFL  GSP DAMNIYEDMK SPDPPEELPFRI           RN+VKQD
Sbjct: 175 DSQTYTEVIRGFLNYGSPADAMNIYEDMKNSPDPPEELPFRILLKGLLPHPLLRNKVKQD 234

Query: 816 FEEIFPDRHIYDPPEEIFG 872
           FEEIFPD  IYDPP+EIFG
Sbjct: 235 FEEIFPDSSIYDPPQEIFG 253


>ref|XP_004145279.1| PREDICTED: pentatricopeptide repeat-containing protein
           At3g46870-like [Cucumis sativus]
           gi|449475090|ref|XP_004154371.1| PREDICTED:
           pentatricopeptide repeat-containing protein
           At3g46870-like [Cucumis sativus]
           gi|449508514|ref|XP_004163333.1| PREDICTED:
           pentatricopeptide repeat-containing protein
           At3g46870-like [Cucumis sativus]
          Length = 255

 Score =  325 bits (832), Expect = 3e-86
 Identities = 150/200 (75%), Positives = 177/200 (88%)
 Frame = +3

Query: 276 YHDGRPRGPLWRGKKMIGKEALYVIMGLKRFKDDEEKLQKFIKTHVFRLLKLDMIAVLNE 455
           YHDGRPRGPLWR +K IGKEAL+VI GLKRFK+DEEK +KF+K+HV RLLKLDM+AVL E
Sbjct: 55  YHDGRPRGPLWRSRKAIGKEALFVIQGLKRFKEDEEKFEKFMKSHVSRLLKLDMVAVLGE 114

Query: 456 LERQQEVSLAVKMFKVMQKQEWYKPDVYLYKDLIISLARSKKMDQTMELWEMMKKEDLFP 635
           LERQ+EV+LAVK+F++++KQ+WYKPDVY+YKDLII+LARSKKMD  M+LWE M++E+LFP
Sbjct: 115 LERQEEVALAVKIFRLIRKQDWYKPDVYIYKDLIIALARSKKMDDAMKLWESMREENLFP 174

Query: 636 DSQTYTEVIRGFLRDGSPGDAMNIYEDMKKSPDPPEELPFRIXXXXXXXXXXXRNRVKQD 815
           DSQTYTEVIRGFLR GSP DAMN+YEDMKKSPDPP+ELPFRI           RNRVKQD
Sbjct: 175 DSQTYTEVIRGFLRYGSPSDAMNVYEDMKKSPDPPDELPFRILLKGLLPHPLLRNRVKQD 234

Query: 816 FEEIFPDRHIYDPPEEIFGL 875
           FEE+FPD+H++DPPEEIF L
Sbjct: 235 FEELFPDQHVFDPPEEIFSL 254


>ref|XP_004507153.1| PREDICTED: pentatricopeptide repeat-containing protein
           At3g46870-like [Cicer arietinum]
          Length = 261

 Score =  323 bits (827), Expect = 1e-85
 Identities = 152/199 (76%), Positives = 173/199 (86%)
 Frame = +3

Query: 276 YHDGRPRGPLWRGKKMIGKEALYVIMGLKRFKDDEEKLQKFIKTHVFRLLKLDMIAVLNE 455
           YHDGRPRGPLWRGKK+IGKEAL+VI GLKRFKD+E+ L KFIKTHV RLLK+D+IAVL E
Sbjct: 61  YHDGRPRGPLWRGKKLIGKEALFVISGLKRFKDEEDTLHKFIKTHVLRLLKMDLIAVLTE 120

Query: 456 LERQQEVSLAVKMFKVMQKQEWYKPDVYLYKDLIISLARSKKMDQTMELWEMMKKEDLFP 635
           LERQQEVSLA+ +F VMQKQ+WYKPD++LYKDLII+LAR+K+MD  ++LWE M+KE+LFP
Sbjct: 121 LERQQEVSLALMVFNVMQKQDWYKPDMFLYKDLIIALARAKRMDDVLQLWESMRKENLFP 180

Query: 636 DSQTYTEVIRGFLRDGSPGDAMNIYEDMKKSPDPPEELPFRIXXXXXXXXXXXRNRVKQD 815
           DSQTYTEVIRGFL +GSP DAMNIYEDMK SPDPPEELPFRI           RN+VKQD
Sbjct: 181 DSQTYTEVIRGFLSNGSPADAMNIYEDMKNSPDPPEELPFRILLKGLLPHPLLRNKVKQD 240

Query: 816 FEEIFPDRHIYDPPEEIFG 872
           FEEIFPD  IYDPP+EIFG
Sbjct: 241 FEEIFPDSSIYDPPQEIFG 259


>gb|ESW04024.1| hypothetical protein PHAVU_011G060800g [Phaseolus vulgaris]
          Length = 254

 Score =  322 bits (824), Expect = 3e-85
 Identities = 150/199 (75%), Positives = 174/199 (87%)
 Frame = +3

Query: 276 YHDGRPRGPLWRGKKMIGKEALYVIMGLKRFKDDEEKLQKFIKTHVFRLLKLDMIAVLNE 455
           YHDGRPRGPLW+GKK+IGKEAL+V++GLKRFKDD++KLQKFIK+HV RLLK+DMIAVL E
Sbjct: 54  YHDGRPRGPLWKGKKLIGKEALFVVLGLKRFKDDQDKLQKFIKSHVLRLLKMDMIAVLTE 113

Query: 456 LERQQEVSLAVKMFKVMQKQEWYKPDVYLYKDLIISLARSKKMDQTMELWEMMKKEDLFP 635
           LERQ++VSLA+ MFKVMQKQ+WYKPD YLYKDLII+LARSKKM++   LWE M+KE+LFP
Sbjct: 114 LERQEQVSLALMMFKVMQKQDWYKPDTYLYKDLIIALARSKKMEEVSYLWESMRKENLFP 173

Query: 636 DSQTYTEVIRGFLRDGSPGDAMNIYEDMKKSPDPPEELPFRIXXXXXXXXXXXRNRVKQD 815
           DSQT+TEVIRGFL  GSP DAM++YEDMK SPDPP+ELPFRI           RN+VKQD
Sbjct: 174 DSQTFTEVIRGFLNYGSPADAMDVYEDMKNSPDPPDELPFRILLKGLLPHPLLRNKVKQD 233

Query: 816 FEEIFPDRHIYDPPEEIFG 872
           FEEIFPD  IYDPP+EIFG
Sbjct: 234 FEEIFPDSSIYDPPQEIFG 252


Top