BLASTX nr result
ID: Catharanthus23_contig00018526
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Catharanthus23_contig00018526 (1265 letters) Database: ./nr 37,332,560 sequences; 13,225,080,153 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_004247885.1| PREDICTED: pentatricopeptide repeat-containi... 345 2e-92 ref|XP_002276778.1| PREDICTED: pentatricopeptide repeat-containi... 345 2e-92 ref|XP_006360926.1| PREDICTED: pentatricopeptide repeat-containi... 343 9e-92 ref|XP_002281924.1| PREDICTED: pentatricopeptide repeat-containi... 342 2e-91 ref|XP_002517255.1| conserved hypothetical protein [Ricinus comm... 337 5e-90 ref|XP_002875809.1| pentatricopeptide repeat-containing protein ... 333 7e-89 gb|EOY33905.1| Pentatricopeptide repeat superfamily protein [The... 333 9e-89 gb|EMJ06956.1| hypothetical protein PRUPE_ppa010266mg [Prunus pe... 333 1e-88 ref|XP_006404445.1| hypothetical protein EUTSA_v10010650mg [Eutr... 332 2e-88 ref|XP_004291520.1| PREDICTED: pentatricopeptide repeat-containi... 330 6e-88 ref|XP_002314145.2| pentatricopeptide repeat-containing family p... 330 8e-88 ref|NP_190271.1| pentatricopeptide repeat-containing protein [Ar... 330 8e-88 ref|XP_006291711.1| hypothetical protein CARUB_v10017876mg [Caps... 328 2e-87 ref|XP_003537970.1| PREDICTED: pentatricopeptide repeat-containi... 326 1e-86 ref|XP_006858620.1| hypothetical protein AMTR_s00066p00023060 [A... 326 1e-86 dbj|BAK03258.1| predicted protein [Hordeum vulgare subsp. vulgare] 325 2e-86 ref|XP_006591821.1| PREDICTED: uncharacterized protein LOC100306... 325 3e-86 ref|XP_004145279.1| PREDICTED: pentatricopeptide repeat-containi... 325 3e-86 ref|XP_004507153.1| PREDICTED: pentatricopeptide repeat-containi... 323 1e-85 gb|ESW04024.1| hypothetical protein PHAVU_011G060800g [Phaseolus... 322 3e-85 >ref|XP_004247885.1| PREDICTED: pentatricopeptide repeat-containing protein At3g46870-like [Solanum lycopersicum] Length = 256 Score = 345 bits (886), Expect = 2e-92 Identities = 175/254 (68%), Positives = 199/254 (78%), Gaps = 1/254 (0%) Frame = +3 Query: 117 ISRARINSTPIFHILPRLSYISHPFTRKQMSFSLLSPPPWY-NYARFGVLRVQAYHDGRP 293 ISR +I S +L LS P ++ S P W ++R V V+ YHDGRP Sbjct: 6 ISRLKIPSL----LLKHLSVSPLPSISSTLTSSSSCSPSWVPEFSRSSVRDVRWYHDGRP 61 Query: 294 RGPLWRGKKMIGKEALYVIMGLKRFKDDEEKLQKFIKTHVFRLLKLDMIAVLNELERQQE 473 RG LWRGKK+IGKEAL+VI+GL+RFKDDEEKL KF+KTHV RLLK+DMIAVLNELERQ+E Sbjct: 62 RGSLWRGKKLIGKEALFVILGLRRFKDDEEKLDKFVKTHVLRLLKMDMIAVLNELERQEE 121 Query: 474 VSLAVKMFKVMQKQEWYKPDVYLYKDLIISLARSKKMDQTMELWEMMKKEDLFPDSQTYT 653 VSLAVK+F V+QKQ WY+PDVYLYKDLII+LAR +KMD M+LWE M+KEDLFPD QT+T Sbjct: 122 VSLAVKVFWVIQKQAWYQPDVYLYKDLIIALARRRKMDDAMKLWESMRKEDLFPDCQTFT 181 Query: 654 EVIRGFLRDGSPGDAMNIYEDMKKSPDPPEELPFRIXXXXXXXXXXXRNRVKQDFEEIFP 833 EVIRGFLRDGSP DAMNI+EDMKKSP PPEELPFR+ RNRVKQDFEEIFP Sbjct: 182 EVIRGFLRDGSPADAMNIFEDMKKSPYPPEELPFRVLLKGLLPHPLLRNRVKQDFEEIFP 241 Query: 834 DRHIYDPPEEIFGL 875 DRHIYDPPEEIFGL Sbjct: 242 DRHIYDPPEEIFGL 255 >ref|XP_002276778.1| PREDICTED: pentatricopeptide repeat-containing protein At3g46870 [Vitis vinifera] gi|297745785|emb|CBI15841.3| unnamed protein product [Vitis vinifera] Length = 252 Score = 345 bits (886), Expect = 2e-92 Identities = 170/239 (71%), Positives = 193/239 (80%), Gaps = 9/239 (3%) Frame = +3 Query: 189 FTRKQMSFSLLSPPPWYNYARFGVLRVQA---------YHDGRPRGPLWRGKKMIGKEAL 341 F+ K + + SP +F VL V + YHDGRPRGPLWRGKK+IGKEAL Sbjct: 14 FSAKILQSLIKSPIKESTKFQFPVLEVASNKPLFGLKHYHDGRPRGPLWRGKKLIGKEAL 73 Query: 342 YVIMGLKRFKDDEEKLQKFIKTHVFRLLKLDMIAVLNELERQQEVSLAVKMFKVMQKQEW 521 +VI+GLKRFKDDEEKL+KFIK+HV RLLK+DMIAVL ELERQ+EV+LAV++F+V QKQ+W Sbjct: 74 FVILGLKRFKDDEEKLRKFIKSHVLRLLKMDMIAVLTELERQEEVTLAVEVFRVFQKQDW 133 Query: 522 YKPDVYLYKDLIISLARSKKMDQTMELWEMMKKEDLFPDSQTYTEVIRGFLRDGSPGDAM 701 YKPDVYLYKDLII+LA+ KKMD M+LWE M+KEDLFPD QTYTEVIRGFLR GSP DAM Sbjct: 134 YKPDVYLYKDLIIALAKCKKMDNAMQLWESMRKEDLFPDYQTYTEVIRGFLRHGSPADAM 193 Query: 702 NIYEDMKKSPDPPEELPFRIXXXXXXXXXXXRNRVKQDFEEIFPDRHIYDPPEEIFGLH 878 NIYEDMKKSPDPPEELPFRI RNRVKQDFEEIFPDRH+YDPPEEIFG+H Sbjct: 194 NIYEDMKKSPDPPEELPFRILLKGLLPHPLLRNRVKQDFEEIFPDRHVYDPPEEIFGIH 252 >ref|XP_006360926.1| PREDICTED: pentatricopeptide repeat-containing protein At3g46870-like [Solanum tuberosum] Length = 256 Score = 343 bits (880), Expect = 9e-92 Identities = 174/254 (68%), Positives = 198/254 (77%), Gaps = 1/254 (0%) Frame = +3 Query: 117 ISRARINSTPIFHILPRLSYISHPFTRKQMSFSLLSPPPWY-NYARFGVLRVQAYHDGRP 293 ISR +I S +L LS P ++ S W +R V V+ YHDGRP Sbjct: 6 ISRLKIPSL----LLKHLSVSPLPSVSSTLTSSSSCSTSWVPELSRSSVRDVRWYHDGRP 61 Query: 294 RGPLWRGKKMIGKEALYVIMGLKRFKDDEEKLQKFIKTHVFRLLKLDMIAVLNELERQQE 473 RGPLWRGKK+IGKEAL+VI+GL+RF+DDEEKL KF+KTHV RLLK+DMIAVLNELERQ+E Sbjct: 62 RGPLWRGKKLIGKEALFVILGLRRFRDDEEKLDKFVKTHVLRLLKMDMIAVLNELERQEE 121 Query: 474 VSLAVKMFKVMQKQEWYKPDVYLYKDLIISLARSKKMDQTMELWEMMKKEDLFPDSQTYT 653 VSLAVK+F V+QKQ WY+PDVYLYKDLII+LAR +KMD M+LWE M+KEDLFPD QT+T Sbjct: 122 VSLAVKVFWVIQKQAWYQPDVYLYKDLIIALARRRKMDDAMKLWESMRKEDLFPDCQTFT 181 Query: 654 EVIRGFLRDGSPGDAMNIYEDMKKSPDPPEELPFRIXXXXXXXXXXXRNRVKQDFEEIFP 833 EVIRGFLRDGSP DAMNI+EDMKKSP PPEELPFR+ RNRVKQDFEEIFP Sbjct: 182 EVIRGFLRDGSPADAMNIFEDMKKSPYPPEELPFRVLLKGLLPHPLLRNRVKQDFEEIFP 241 Query: 834 DRHIYDPPEEIFGL 875 DRHIYDPPEEIFGL Sbjct: 242 DRHIYDPPEEIFGL 255 >ref|XP_002281924.1| PREDICTED: pentatricopeptide repeat-containing protein At3g46870 isoform 1 [Vitis vinifera] gi|359478900|ref|XP_003632183.1| PREDICTED: pentatricopeptide repeat-containing protein At3g46870 isoform 2 [Vitis vinifera] gi|297745987|emb|CBI16043.3| unnamed protein product [Vitis vinifera] Length = 252 Score = 342 bits (877), Expect = 2e-91 Identities = 161/207 (77%), Positives = 183/207 (88%) Frame = +3 Query: 258 VLRVQAYHDGRPRGPLWRGKKMIGKEALYVIMGLKRFKDDEEKLQKFIKTHVFRLLKLDM 437 +L + YHDGRPRGPLWRGKK+IGKEAL+VI+GLKRFKDDEE L+KFIK+HV RLLK+DM Sbjct: 46 LLGSKLYHDGRPRGPLWRGKKLIGKEALFVILGLKRFKDDEENLRKFIKSHVLRLLKMDM 105 Query: 438 IAVLNELERQQEVSLAVKMFKVMQKQEWYKPDVYLYKDLIISLARSKKMDQTMELWEMMK 617 +AVL ELERQ+EVSLAV++F+V++KQ+WYKPDVYLYKDLII+LA+ KKMD M+LWE M+ Sbjct: 106 VAVLTELERQEEVSLAVEVFRVIRKQDWYKPDVYLYKDLIIALAKCKKMDDAMQLWESMR 165 Query: 618 KEDLFPDSQTYTEVIRGFLRDGSPGDAMNIYEDMKKSPDPPEELPFRIXXXXXXXXXXXR 797 KEDLFPD QTYTEVIRGFLR GSP DAMNIYEDMKKSPDPPEELPFRI R Sbjct: 166 KEDLFPDYQTYTEVIRGFLRYGSPADAMNIYEDMKKSPDPPEELPFRILLKGLLPHPLLR 225 Query: 798 NRVKQDFEEIFPDRHIYDPPEEIFGLH 878 NRVKQDFEEIFPDRH+YDPPEEIFG+H Sbjct: 226 NRVKQDFEEIFPDRHVYDPPEEIFGVH 252 >ref|XP_002517255.1| conserved hypothetical protein [Ricinus communis] gi|223543626|gb|EEF45155.1| conserved hypothetical protein [Ricinus communis] Length = 258 Score = 337 bits (865), Expect = 5e-90 Identities = 157/213 (73%), Positives = 184/213 (86%) Frame = +3 Query: 237 YNYARFGVLRVQAYHDGRPRGPLWRGKKMIGKEALYVIMGLKRFKDDEEKLQKFIKTHVF 416 +N++ ++ YHDGRPRGPLWRGKK+IGKEAL+VI+GLKRFKD+EEKL KFIKTHV Sbjct: 45 HNHSHNSFSGLRQYHDGRPRGPLWRGKKLIGKEALFVILGLKRFKDEEEKLDKFIKTHVL 104 Query: 417 RLLKLDMIAVLNELERQQEVSLAVKMFKVMQKQEWYKPDVYLYKDLIISLARSKKMDQTM 596 RLLK+DMIAVL ELERQ+EVSLA K+F+++QKQ+WY PDVYLYKDLII+L RS KMDQ M Sbjct: 105 RLLKMDMIAVLTELERQEEVSLATKVFQIIQKQDWYNPDVYLYKDLIIALTRSGKMDQAM 164 Query: 597 ELWEMMKKEDLFPDSQTYTEVIRGFLRDGSPGDAMNIYEDMKKSPDPPEELPFRIXXXXX 776 +LWE M+ E+LFPDSQ YTE+IRGFLRDGSPGDAMNIYEDMKKSPDPPEELPFRI Sbjct: 165 KLWEAMRSENLFPDSQMYTELIRGFLRDGSPGDAMNIYEDMKKSPDPPEELPFRILLKGL 224 Query: 777 XXXXXXRNRVKQDFEEIFPDRHIYDPPEEIFGL 875 RNRVKQD+EE+FP++H+YDPPEEIFG+ Sbjct: 225 LPHPLLRNRVKQDYEELFPEKHVYDPPEEIFGV 257 >ref|XP_002875809.1| pentatricopeptide repeat-containing protein [Arabidopsis lyrata subsp. lyrata] gi|297321647|gb|EFH52068.1| pentatricopeptide repeat-containing protein [Arabidopsis lyrata subsp. lyrata] Length = 257 Score = 333 bits (855), Expect = 7e-89 Identities = 157/220 (71%), Positives = 186/220 (84%) Frame = +3 Query: 213 SLLSPPPWYNYARFGVLRVQAYHDGRPRGPLWRGKKMIGKEALYVIMGLKRFKDDEEKLQ 392 +LL P P + F V +HDGRPRGPLWRGKK+IGKEAL+VI+GLKR KDD+EKLQ Sbjct: 40 TLLHPIPPKPFTVF----VSRFHDGRPRGPLWRGKKLIGKEALFVILGLKRLKDDDEKLQ 95 Query: 393 KFIKTHVFRLLKLDMIAVLNELERQQEVSLAVKMFKVMQKQEWYKPDVYLYKDLIISLAR 572 KFIKTHVFRLLKLDM+AV+ ELERQ+E +LA+KMF+V+QKQEWY+PDV++YKDLI+SLA+ Sbjct: 96 KFIKTHVFRLLKLDMLAVIGELERQEETALAIKMFEVIQKQEWYQPDVFMYKDLIVSLAK 155 Query: 573 SKKMDQTMELWEMMKKEDLFPDSQTYTEVIRGFLRDGSPGDAMNIYEDMKKSPDPPEELP 752 SK+MD+ M LWE MKKE+LFPDSQTYTEVIRGFLRDG P DAMN+YEDM KSPDPPEELP Sbjct: 156 SKRMDEAMALWEKMKKENLFPDSQTYTEVIRGFLRDGCPADAMNVYEDMLKSPDPPEELP 215 Query: 753 FRIXXXXXXXXXXXRNRVKQDFEEIFPDRHIYDPPEEIFG 872 FR+ RN+VK+DFEE+FP++H YDPPEEIFG Sbjct: 216 FRVLLKGLLPHPLLRNKVKKDFEELFPEKHAYDPPEEIFG 255 >gb|EOY33905.1| Pentatricopeptide repeat superfamily protein [Theobroma cacao] Length = 287 Score = 333 bits (854), Expect = 9e-89 Identities = 154/202 (76%), Positives = 181/202 (89%) Frame = +3 Query: 267 VQAYHDGRPRGPLWRGKKMIGKEALYVIMGLKRFKDDEEKLQKFIKTHVFRLLKLDMIAV 446 ++ YHDGRPRGPLW+GKK+IGKEAL+VI+GLKRFKDD++KLQKFIKTHV RLLK+++IAV Sbjct: 84 LKQYHDGRPRGPLWKGKKLIGKEALFVILGLKRFKDDDDKLQKFIKTHVLRLLKMELIAV 143 Query: 447 LNELERQQEVSLAVKMFKVMQKQEWYKPDVYLYKDLIISLARSKKMDQTMELWEMMKKED 626 L ELERQ+E SLAVK+F+V+QKQ+WYKPDVYLYKDLII+LAR KKMD+ M+LWE M+KE+ Sbjct: 144 LTELERQEETSLAVKVFQVIQKQDWYKPDVYLYKDLIIALARFKKMDEAMKLWEYMRKEE 203 Query: 627 LFPDSQTYTEVIRGFLRDGSPGDAMNIYEDMKKSPDPPEELPFRIXXXXXXXXXXXRNRV 806 LFPDSQTYTE+IRGFLRDGSP DAMNIYEDM KSPDPPEELPFRI RN+V Sbjct: 204 LFPDSQTYTEIIRGFLRDGSPADAMNIYEDMIKSPDPPEELPFRILLKGLLPHPLLRNKV 263 Query: 807 KQDFEEIFPDRHIYDPPEEIFG 872 K+DFEE+FP++H YDPPEEIFG Sbjct: 264 KKDFEELFPEKHAYDPPEEIFG 285 >gb|EMJ06956.1| hypothetical protein PRUPE_ppa010266mg [Prunus persica] Length = 256 Score = 333 bits (853), Expect = 1e-88 Identities = 158/200 (79%), Positives = 174/200 (87%) Frame = +3 Query: 276 YHDGRPRGPLWRGKKMIGKEALYVIMGLKRFKDDEEKLQKFIKTHVFRLLKLDMIAVLNE 455 YHDGRPRGPLWRGKK+IGKEALYVI GLKRFKDDEEKL KFIK HV RLLK+D+IAVL E Sbjct: 55 YHDGRPRGPLWRGKKLIGKEALYVISGLKRFKDDEEKLGKFIKNHVLRLLKMDLIAVLTE 114 Query: 456 LERQQEVSLAVKMFKVMQKQEWYKPDVYLYKDLIISLARSKKMDQTMELWEMMKKEDLFP 635 LERQ+EV+LA+K+F V++KQ+WY PDVYLYKDLIISLARSKKMD M LW+ MKKEDLFP Sbjct: 115 LERQEEVNLAIKVFNVIRKQDWYNPDVYLYKDLIISLARSKKMDDVMLLWDGMKKEDLFP 174 Query: 636 DSQTYTEVIRGFLRDGSPGDAMNIYEDMKKSPDPPEELPFRIXXXXXXXXXXXRNRVKQD 815 DSQTYTEVIRGFL GSP DAMNIYEDMK SPDPPEELPFRI RNRVKQD Sbjct: 175 DSQTYTEVIRGFLSSGSPADAMNIYEDMKNSPDPPEELPFRILLKGLLPHPLLRNRVKQD 234 Query: 816 FEEIFPDRHIYDPPEEIFGL 875 FEE+FP++H+YDPPEEIFG+ Sbjct: 235 FEELFPEQHVYDPPEEIFGV 254 >ref|XP_006404445.1| hypothetical protein EUTSA_v10010650mg [Eutrema salsugineum] gi|557105564|gb|ESQ45898.1| hypothetical protein EUTSA_v10010650mg [Eutrema salsugineum] Length = 257 Score = 332 bits (851), Expect = 2e-88 Identities = 157/237 (66%), Positives = 191/237 (80%) Frame = +3 Query: 162 PRLSYISHPFTRKQMSFSLLSPPPWYNYARFGVLRVQAYHDGRPRGPLWRGKKMIGKEAL 341 P + IS + K + +SP P+ +A +HDGRPRGPLWRGKK+IGKEAL Sbjct: 26 PTIHRISFSYLIKPKTLHPVSPKPFTVFAA-------QFHDGRPRGPLWRGKKLIGKEAL 78 Query: 342 YVIMGLKRFKDDEEKLQKFIKTHVFRLLKLDMIAVLNELERQQEVSLAVKMFKVMQKQEW 521 +VI+GLKR K+D+EKL KFIKTHVFRLLKLDM+AV+ ELERQ+E +LA+KMF+V+QKQEW Sbjct: 79 FVILGLKRLKEDDEKLDKFIKTHVFRLLKLDMLAVIGELERQEETALAIKMFEVIQKQEW 138 Query: 522 YKPDVYLYKDLIISLARSKKMDQTMELWEMMKKEDLFPDSQTYTEVIRGFLRDGSPGDAM 701 Y+PDV++YKDLI+SLA+SK+MD+ M LWE MKKE+LFPDSQTYTEVIRGFLRDG P DAM Sbjct: 139 YQPDVFMYKDLIVSLAKSKRMDEAMGLWEKMKKENLFPDSQTYTEVIRGFLRDGCPADAM 198 Query: 702 NIYEDMKKSPDPPEELPFRIXXXXXXXXXXXRNRVKQDFEEIFPDRHIYDPPEEIFG 872 N+YEDM KSPDPPEELPFR+ RN+VK+DFEE+FP++H YDPPEEIFG Sbjct: 199 NVYEDMLKSPDPPEELPFRVLLKGLLPHPLLRNKVKKDFEELFPEKHAYDPPEEIFG 255 >ref|XP_004291520.1| PREDICTED: pentatricopeptide repeat-containing protein At3g46870-like [Fragaria vesca subsp. vesca] Length = 255 Score = 330 bits (847), Expect = 6e-88 Identities = 156/199 (78%), Positives = 175/199 (87%) Frame = +3 Query: 276 YHDGRPRGPLWRGKKMIGKEALYVIMGLKRFKDDEEKLQKFIKTHVFRLLKLDMIAVLNE 455 YHDGRPRGPLWRGKK+IGKEALYVI GLKRFKDDEE L KFIK+HV RLLKLDMIAVL E Sbjct: 55 YHDGRPRGPLWRGKKLIGKEALYVISGLKRFKDDEETLGKFIKSHVLRLLKLDMIAVLTE 114 Query: 456 LERQQEVSLAVKMFKVMQKQEWYKPDVYLYKDLIISLARSKKMDQTMELWEMMKKEDLFP 635 LERQ+EV+LA+K+F V++KQ+WYKPDVYLYKDLIISLA+SKKMD M LW+ MKKEDLFP Sbjct: 115 LERQEEVNLAIKVFNVIRKQDWYKPDVYLYKDLIISLAKSKKMDDVMVLWDCMKKEDLFP 174 Query: 636 DSQTYTEVIRGFLRDGSPGDAMNIYEDMKKSPDPPEELPFRIXXXXXXXXXXXRNRVKQD 815 DSQT+TEVIRGFL GSP DAMNIYEDMK+SPDPPE+LPFRI RNRVKQD Sbjct: 175 DSQTFTEVIRGFLSTGSPADAMNIYEDMKQSPDPPEQLPFRILLKGLLPHPLLRNRVKQD 234 Query: 816 FEEIFPDRHIYDPPEEIFG 872 FEE+FP++H+YDPP+EIFG Sbjct: 235 FEELFPEQHVYDPPQEIFG 253 >ref|XP_002314145.2| pentatricopeptide repeat-containing family protein [Populus trichocarpa] gi|550331008|gb|EEE88100.2| pentatricopeptide repeat-containing family protein [Populus trichocarpa] Length = 257 Score = 330 bits (846), Expect = 8e-88 Identities = 167/256 (65%), Positives = 203/256 (79%), Gaps = 3/256 (1%) Frame = +3 Query: 117 ISRARI-NSTPIFHILPRLSYISHPFTRKQMSFSLLSPPPWYNYARFGV-LRVQAYHDGR 290 +SR++I N + + IL L+ + Q+ S S N F V ++ YHDGR Sbjct: 6 LSRSKIPNFSSVIVILQNLTTKQSIIDQTQLPLSKAS-----NIQSFLVPAGLRQYHDGR 60 Query: 291 PRGPLWRGKKMIGKEALYVIMGLKRFK-DDEEKLQKFIKTHVFRLLKLDMIAVLNELERQ 467 PRGPLWRGKK+IGKEAL+VI+GLKRFK DD+EKL +FIKTHVFRLLKLDMIAVL+ELERQ Sbjct: 61 PRGPLWRGKKLIGKEALFVILGLKRFKNDDDEKLDRFIKTHVFRLLKLDMIAVLSELERQ 120 Query: 468 QEVSLAVKMFKVMQKQEWYKPDVYLYKDLIISLARSKKMDQTMELWEMMKKEDLFPDSQT 647 +EVSLAVK+F+V+QKQ+WYKPDVYLYKDLI++L ++ KM++ M+LWE M+ EDLFPDSQ Sbjct: 121 EEVSLAVKIFRVIQKQDWYKPDVYLYKDLIMALLKTGKMEEAMKLWEDMRNEDLFPDSQM 180 Query: 648 YTEVIRGFLRDGSPGDAMNIYEDMKKSPDPPEELPFRIXXXXXXXXXXXRNRVKQDFEEI 827 YTE IRG+LRDGSP DAMNIYEDMKKSPDPPEELPFRI RNRVKQD+EE+ Sbjct: 181 YTEAIRGYLRDGSPADAMNIYEDMKKSPDPPEELPFRILLKGLLPHPLLRNRVKQDYEEL 240 Query: 828 FPDRHIYDPPEEIFGL 875 FP++H+YDPPEEIFG+ Sbjct: 241 FPEKHVYDPPEEIFGI 256 >ref|NP_190271.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana] gi|75266318|sp|Q9STF9.1|PP266_ARATH RecName: Full=Pentatricopeptide repeat-containing protein At3g46870 gi|545719887|pdb|4LEU|A Chain A, Crystal Structure Of Tha8-like Protein From Arabidopsis Thaliana gi|5541672|emb|CAB51178.1| putative protein [Arabidopsis thaliana] gi|26450732|dbj|BAC42475.1| unknown protein [Arabidopsis thaliana] gi|28950815|gb|AAO63331.1| At3g46870 [Arabidopsis thaliana] gi|332644692|gb|AEE78213.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana] Length = 257 Score = 330 bits (846), Expect = 8e-88 Identities = 155/220 (70%), Positives = 185/220 (84%) Frame = +3 Query: 213 SLLSPPPWYNYARFGVLRVQAYHDGRPRGPLWRGKKMIGKEALYVIMGLKRFKDDEEKLQ 392 +LL P P + F V +HDGRPRGPLWRGKK+IGKEAL+VI+GLKR K+D+EKL Sbjct: 40 TLLHPIPPKPFTVF----VSRFHDGRPRGPLWRGKKLIGKEALFVILGLKRLKEDDEKLD 95 Query: 393 KFIKTHVFRLLKLDMIAVLNELERQQEVSLAVKMFKVMQKQEWYKPDVYLYKDLIISLAR 572 KFIKTHVFRLLKLDM+AV+ ELERQ+E +LA+KMF+V+QKQEWY+PDV++YKDLI+SLA+ Sbjct: 96 KFIKTHVFRLLKLDMLAVIGELERQEETALAIKMFEVIQKQEWYQPDVFMYKDLIVSLAK 155 Query: 573 SKKMDQTMELWEMMKKEDLFPDSQTYTEVIRGFLRDGSPGDAMNIYEDMKKSPDPPEELP 752 SK+MD+ M LWE MKKE+LFPDSQTYTEVIRGFLRDG P DAMN+YEDM KSPDPPEELP Sbjct: 156 SKRMDEAMALWEKMKKENLFPDSQTYTEVIRGFLRDGCPADAMNVYEDMLKSPDPPEELP 215 Query: 753 FRIXXXXXXXXXXXRNRVKQDFEEIFPDRHIYDPPEEIFG 872 FR+ RN+VK+DFEE+FP++H YDPPEEIFG Sbjct: 216 FRVLLKGLLPHPLLRNKVKKDFEELFPEKHAYDPPEEIFG 255 >ref|XP_006291711.1| hypothetical protein CARUB_v10017876mg [Capsella rubella] gi|565467664|ref|XP_006291712.1| hypothetical protein CARUB_v10017876mg [Capsella rubella] gi|482560418|gb|EOA24609.1| hypothetical protein CARUB_v10017876mg [Capsella rubella] gi|482560419|gb|EOA24610.1| hypothetical protein CARUB_v10017876mg [Capsella rubella] Length = 257 Score = 328 bits (842), Expect = 2e-87 Identities = 150/202 (74%), Positives = 178/202 (88%) Frame = +3 Query: 267 VQAYHDGRPRGPLWRGKKMIGKEALYVIMGLKRFKDDEEKLQKFIKTHVFRLLKLDMIAV 446 V +HDGRPRGPLWRGKK+IGKEAL+VI+GLKR K+D+EKLQKFIKTHV RLLKLDM+AV Sbjct: 54 VSRFHDGRPRGPLWRGKKLIGKEALFVILGLKRLKEDDEKLQKFIKTHVLRLLKLDMLAV 113 Query: 447 LNELERQQEVSLAVKMFKVMQKQEWYKPDVYLYKDLIISLARSKKMDQTMELWEMMKKED 626 + ELERQ+E +LA+KMF+V+QKQEWY+PDV++YKDLI+SLA+SK+MD+ M LWE MKKE+ Sbjct: 114 IGELERQEETALAIKMFEVIQKQEWYQPDVFMYKDLIVSLAKSKRMDEAMGLWEKMKKEN 173 Query: 627 LFPDSQTYTEVIRGFLRDGSPGDAMNIYEDMKKSPDPPEELPFRIXXXXXXXXXXXRNRV 806 LFPDSQTYTEVIRGFLRDG P DAMN+YEDM KSPDPPEELPFR+ RN+V Sbjct: 174 LFPDSQTYTEVIRGFLRDGCPADAMNVYEDMLKSPDPPEELPFRVLLKGLLPHPLLRNKV 233 Query: 807 KQDFEEIFPDRHIYDPPEEIFG 872 K+DFEE+FP++H YDPPEEIFG Sbjct: 234 KKDFEELFPEKHAYDPPEEIFG 255 >ref|XP_003537970.1| PREDICTED: pentatricopeptide repeat-containing protein At3g46870-like [Glycine max] Length = 255 Score = 326 bits (836), Expect = 1e-86 Identities = 154/199 (77%), Positives = 172/199 (86%) Frame = +3 Query: 276 YHDGRPRGPLWRGKKMIGKEALYVIMGLKRFKDDEEKLQKFIKTHVFRLLKLDMIAVLNE 455 YHDGRPRGPLW+GKK+IGKEAL+VI G KRF DDE+KL KFIKTHV RLLK+DMIAVL E Sbjct: 55 YHDGRPRGPLWKGKKLIGKEALFVISGFKRFNDDEDKLHKFIKTHVLRLLKMDMIAVLTE 114 Query: 456 LERQQEVSLAVKMFKVMQKQEWYKPDVYLYKDLIISLARSKKMDQTMELWEMMKKEDLFP 635 LERQ++VSLA+ MFKVMQKQ+WYKPD YLYKDLII+LAR+KKMD+ ++LWE M+KE+LFP Sbjct: 115 LERQEQVSLALMMFKVMQKQDWYKPDAYLYKDLIIALARAKKMDEVLQLWESMRKENLFP 174 Query: 636 DSQTYTEVIRGFLRDGSPGDAMNIYEDMKKSPDPPEELPFRIXXXXXXXXXXXRNRVKQD 815 DSQTYTEVIRGFL GSP DAMNIYEDMK SPDPPEELPFRI RN+VKQD Sbjct: 175 DSQTYTEVIRGFLNYGSPADAMNIYEDMKNSPDPPEELPFRILLKGLLPHPLLRNKVKQD 234 Query: 816 FEEIFPDRHIYDPPEEIFG 872 FEEIFPD IYDPP+EIFG Sbjct: 235 FEEIFPDSSIYDPPQEIFG 253 >ref|XP_006858620.1| hypothetical protein AMTR_s00066p00023060 [Amborella trichopoda] gi|548862731|gb|ERN20087.1| hypothetical protein AMTR_s00066p00023060 [Amborella trichopoda] Length = 265 Score = 326 bits (835), Expect = 1e-86 Identities = 149/203 (73%), Positives = 178/203 (87%) Frame = +3 Query: 267 VQAYHDGRPRGPLWRGKKMIGKEALYVIMGLKRFKDDEEKLQKFIKTHVFRLLKLDMIAV 446 ++ YHDGRPRGPLWRGKK+IGKEAL++I GLKRFKDDE +L KF+K+HV RLLK+DM+AV Sbjct: 62 MRCYHDGRPRGPLWRGKKLIGKEALFIISGLKRFKDDEGQLDKFVKSHVSRLLKMDMVAV 121 Query: 447 LNELERQQEVSLAVKMFKVMQKQEWYKPDVYLYKDLIISLARSKKMDQTMELWEMMKKED 626 L ELERQ+EV LA+K+F+V+QK+ WYKPDVYLYKDLII+LARSK+M+ M++WE M++ED Sbjct: 122 LCELERQEEVILALKIFRVIQKENWYKPDVYLYKDLIIALARSKRMEDAMQIWECMRRED 181 Query: 627 LFPDSQTYTEVIRGFLRDGSPGDAMNIYEDMKKSPDPPEELPFRIXXXXXXXXXXXRNRV 806 LFPDSQTYTEVIRGFLR GSP DAMNIYEDMK SP+PPEELP+R+ RNR+ Sbjct: 182 LFPDSQTYTEVIRGFLRYGSPADAMNIYEDMKNSPEPPEELPYRVLLKGLLPHPLLRNRI 241 Query: 807 KQDFEEIFPDRHIYDPPEEIFGL 875 KQDFEE+FPDRH+YDPPEEIFGL Sbjct: 242 KQDFEEMFPDRHVYDPPEEIFGL 264 >dbj|BAK03258.1| predicted protein [Hordeum vulgare subsp. vulgare] Length = 251 Score = 325 bits (833), Expect = 2e-86 Identities = 156/241 (64%), Positives = 190/241 (78%), Gaps = 2/241 (0%) Frame = +3 Query: 162 PRLSYIS--HPFTRKQMSFSLLSPPPWYNYARFGVLRVQAYHDGRPRGPLWRGKKMIGKE 335 P++ +S H R ++ LL P P+ + R A+HDGRPRGPLWR KK+IGKE Sbjct: 18 PKIPTLSPPHGLFRGEVPVRLLPPQPFGEWRR-------AFHDGRPRGPLWRSKKLIGKE 70 Query: 336 ALYVIMGLKRFKDDEEKLQKFIKTHVFRLLKLDMIAVLNELERQQEVSLAVKMFKVMQKQ 515 AL+ I GLKRFK DEEKL++FIK HV RLLK D +AVL ELERQ+EV L+VKMF+++QK+ Sbjct: 71 ALFAIQGLKRFKGDEEKLREFIKRHVARLLKADKLAVLGELERQEEVDLSVKMFRIIQKE 130 Query: 516 EWYKPDVYLYKDLIISLARSKKMDQTMELWEMMKKEDLFPDSQTYTEVIRGFLRDGSPGD 695 +WYKPDVY+YKDLIISLA+ KKMD+ M++W MK+E+LFPDSQTY EVIRGFLR GSP D Sbjct: 131 DWYKPDVYMYKDLIISLAKCKKMDEAMDIWGNMKEENLFPDSQTYAEVIRGFLRYGSPSD 190 Query: 696 AMNIYEDMKKSPDPPEELPFRIXXXXXXXXXXXRNRVKQDFEEIFPDRHIYDPPEEIFGL 875 AMNIYEDMK+SPDPPEELPFR+ RNRVKQDFEE+FP+RHIYDPP++IFG+ Sbjct: 191 AMNIYEDMKRSPDPPEELPFRVLLKGLLPHPLLRNRVKQDFEELFPERHIYDPPDDIFGM 250 Query: 876 H 878 H Sbjct: 251 H 251 >ref|XP_006591821.1| PREDICTED: uncharacterized protein LOC100306428 isoform X1 [Glycine max] Length = 255 Score = 325 bits (832), Expect = 3e-86 Identities = 153/199 (76%), Positives = 172/199 (86%) Frame = +3 Query: 276 YHDGRPRGPLWRGKKMIGKEALYVIMGLKRFKDDEEKLQKFIKTHVFRLLKLDMIAVLNE 455 YHDGRPRGPLW+GKK+IGKEAL+VI G KRF DDE+KL KFIKTHV RLLK+DMIAVL E Sbjct: 55 YHDGRPRGPLWKGKKLIGKEALFVISGFKRFNDDEDKLHKFIKTHVLRLLKMDMIAVLTE 114 Query: 456 LERQQEVSLAVKMFKVMQKQEWYKPDVYLYKDLIISLARSKKMDQTMELWEMMKKEDLFP 635 LERQ++VSLA+ MFKVMQKQ+WYKPD YLYKDLII+LAR+KKMD+ ++LWE M++E+LFP Sbjct: 115 LERQEQVSLALMMFKVMQKQDWYKPDAYLYKDLIIALARAKKMDEVLQLWESMREENLFP 174 Query: 636 DSQTYTEVIRGFLRDGSPGDAMNIYEDMKKSPDPPEELPFRIXXXXXXXXXXXRNRVKQD 815 DSQTYTEVIRGFL GSP DAMNIYEDMK SPDPPEELPFRI RN+VKQD Sbjct: 175 DSQTYTEVIRGFLNYGSPADAMNIYEDMKNSPDPPEELPFRILLKGLLPHPLLRNKVKQD 234 Query: 816 FEEIFPDRHIYDPPEEIFG 872 FEEIFPD IYDPP+EIFG Sbjct: 235 FEEIFPDSSIYDPPQEIFG 253 >ref|XP_004145279.1| PREDICTED: pentatricopeptide repeat-containing protein At3g46870-like [Cucumis sativus] gi|449475090|ref|XP_004154371.1| PREDICTED: pentatricopeptide repeat-containing protein At3g46870-like [Cucumis sativus] gi|449508514|ref|XP_004163333.1| PREDICTED: pentatricopeptide repeat-containing protein At3g46870-like [Cucumis sativus] Length = 255 Score = 325 bits (832), Expect = 3e-86 Identities = 150/200 (75%), Positives = 177/200 (88%) Frame = +3 Query: 276 YHDGRPRGPLWRGKKMIGKEALYVIMGLKRFKDDEEKLQKFIKTHVFRLLKLDMIAVLNE 455 YHDGRPRGPLWR +K IGKEAL+VI GLKRFK+DEEK +KF+K+HV RLLKLDM+AVL E Sbjct: 55 YHDGRPRGPLWRSRKAIGKEALFVIQGLKRFKEDEEKFEKFMKSHVSRLLKLDMVAVLGE 114 Query: 456 LERQQEVSLAVKMFKVMQKQEWYKPDVYLYKDLIISLARSKKMDQTMELWEMMKKEDLFP 635 LERQ+EV+LAVK+F++++KQ+WYKPDVY+YKDLII+LARSKKMD M+LWE M++E+LFP Sbjct: 115 LERQEEVALAVKIFRLIRKQDWYKPDVYIYKDLIIALARSKKMDDAMKLWESMREENLFP 174 Query: 636 DSQTYTEVIRGFLRDGSPGDAMNIYEDMKKSPDPPEELPFRIXXXXXXXXXXXRNRVKQD 815 DSQTYTEVIRGFLR GSP DAMN+YEDMKKSPDPP+ELPFRI RNRVKQD Sbjct: 175 DSQTYTEVIRGFLRYGSPSDAMNVYEDMKKSPDPPDELPFRILLKGLLPHPLLRNRVKQD 234 Query: 816 FEEIFPDRHIYDPPEEIFGL 875 FEE+FPD+H++DPPEEIF L Sbjct: 235 FEELFPDQHVFDPPEEIFSL 254 >ref|XP_004507153.1| PREDICTED: pentatricopeptide repeat-containing protein At3g46870-like [Cicer arietinum] Length = 261 Score = 323 bits (827), Expect = 1e-85 Identities = 152/199 (76%), Positives = 173/199 (86%) Frame = +3 Query: 276 YHDGRPRGPLWRGKKMIGKEALYVIMGLKRFKDDEEKLQKFIKTHVFRLLKLDMIAVLNE 455 YHDGRPRGPLWRGKK+IGKEAL+VI GLKRFKD+E+ L KFIKTHV RLLK+D+IAVL E Sbjct: 61 YHDGRPRGPLWRGKKLIGKEALFVISGLKRFKDEEDTLHKFIKTHVLRLLKMDLIAVLTE 120 Query: 456 LERQQEVSLAVKMFKVMQKQEWYKPDVYLYKDLIISLARSKKMDQTMELWEMMKKEDLFP 635 LERQQEVSLA+ +F VMQKQ+WYKPD++LYKDLII+LAR+K+MD ++LWE M+KE+LFP Sbjct: 121 LERQQEVSLALMVFNVMQKQDWYKPDMFLYKDLIIALARAKRMDDVLQLWESMRKENLFP 180 Query: 636 DSQTYTEVIRGFLRDGSPGDAMNIYEDMKKSPDPPEELPFRIXXXXXXXXXXXRNRVKQD 815 DSQTYTEVIRGFL +GSP DAMNIYEDMK SPDPPEELPFRI RN+VKQD Sbjct: 181 DSQTYTEVIRGFLSNGSPADAMNIYEDMKNSPDPPEELPFRILLKGLLPHPLLRNKVKQD 240 Query: 816 FEEIFPDRHIYDPPEEIFG 872 FEEIFPD IYDPP+EIFG Sbjct: 241 FEEIFPDSSIYDPPQEIFG 259 >gb|ESW04024.1| hypothetical protein PHAVU_011G060800g [Phaseolus vulgaris] Length = 254 Score = 322 bits (824), Expect = 3e-85 Identities = 150/199 (75%), Positives = 174/199 (87%) Frame = +3 Query: 276 YHDGRPRGPLWRGKKMIGKEALYVIMGLKRFKDDEEKLQKFIKTHVFRLLKLDMIAVLNE 455 YHDGRPRGPLW+GKK+IGKEAL+V++GLKRFKDD++KLQKFIK+HV RLLK+DMIAVL E Sbjct: 54 YHDGRPRGPLWKGKKLIGKEALFVVLGLKRFKDDQDKLQKFIKSHVLRLLKMDMIAVLTE 113 Query: 456 LERQQEVSLAVKMFKVMQKQEWYKPDVYLYKDLIISLARSKKMDQTMELWEMMKKEDLFP 635 LERQ++VSLA+ MFKVMQKQ+WYKPD YLYKDLII+LARSKKM++ LWE M+KE+LFP Sbjct: 114 LERQEQVSLALMMFKVMQKQDWYKPDTYLYKDLIIALARSKKMEEVSYLWESMRKENLFP 173 Query: 636 DSQTYTEVIRGFLRDGSPGDAMNIYEDMKKSPDPPEELPFRIXXXXXXXXXXXRNRVKQD 815 DSQT+TEVIRGFL GSP DAM++YEDMK SPDPP+ELPFRI RN+VKQD Sbjct: 174 DSQTFTEVIRGFLNYGSPADAMDVYEDMKNSPDPPDELPFRILLKGLLPHPLLRNKVKQD 233 Query: 816 FEEIFPDRHIYDPPEEIFG 872 FEEIFPD IYDPP+EIFG Sbjct: 234 FEEIFPDSSIYDPPQEIFG 252