BLASTX nr result

ID: Mentha24_contig00017401 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Mentha24_contig00017401
         (1086 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|EYU42962.1| hypothetical protein MIMGU_mgv1a021045mg [Mimulus...   583   e-164
ref|XP_006345374.1| PREDICTED: pentatricopeptide repeat-containi...   516   e-144
emb|CBI17752.3| unnamed protein product [Vitis vinifera]              513   e-143
ref|XP_002304774.1| pentatricopeptide repeat-containing family p...   456   e-126
ref|XP_007220734.1| hypothetical protein PRUPE_ppa023145mg [Prun...   450   e-124
ref|NP_193806.1| pentatricopeptide repeat-containing protein [Ar...   450   e-124
ref|XP_006413862.1| hypothetical protein EUTSA_v10024515mg [Eutr...   449   e-124
ref|XP_002869928.1| pentatricopeptide repeat-containing protein ...   445   e-122
ref|XP_004140286.1| PREDICTED: pentatricopeptide repeat-containi...   444   e-122
gb|EXC10461.1| hypothetical protein L484_008628 [Morus notabilis]     441   e-121
ref|XP_006283187.1| hypothetical protein CARUB_v10004218mg, part...   441   e-121
ref|XP_003598903.1| Pentatricopeptide repeat-containing protein ...   439   e-121
ref|XP_004229293.1| PREDICTED: pentatricopeptide repeat-containi...   434   e-119
ref|XP_006482966.1| PREDICTED: pentatricopeptide repeat-containi...   430   e-118
ref|XP_004494981.1| PREDICTED: pentatricopeptide repeat-containi...   430   e-118
ref|XP_007008770.1| Pentatricopeptide repeat (PPR-like) superfam...   429   e-118
ref|XP_006605814.1| PREDICTED: pentatricopeptide repeat-containi...   427   e-117
ref|XP_006438906.1| hypothetical protein CICLE_v10030824mg [Citr...   427   e-117
ref|XP_002269066.2| PREDICTED: pentatricopeptide repeat-containi...   422   e-116
ref|XP_004515007.1| PREDICTED: pentatricopeptide repeat-containi...   421   e-115

>gb|EYU42962.1| hypothetical protein MIMGU_mgv1a021045mg [Mimulus guttatus]
          Length = 726

 Score =  583 bits (1502), Expect = e-164
 Identities = 281/360 (78%), Positives = 319/360 (88%)
 Frame = +3

Query: 6    NRPTVRGGLFSNRQTINPKKFSRPSAASDEPFDLRKWDPEDESNLKLPYVKDPSERFFIS 185
            +RPTVRGGLFSNRQT+NP+ F R +AA  EPFDL+KWDP+DE+N K PY KDPSE+FF  
Sbjct: 29   SRPTVRGGLFSNRQTVNPENFRRRTAAH-EPFDLQKWDPDDEANRKPPYGKDPSEKFFSL 87

Query: 186  AKNLSPIARYIVDAFRTHKNWNAELVAELNRLRRVTPKLVAEVLKFPNVDPRLSSKFFNW 365
            AKNLSPIARYIVDAFR HK W+ +LV ELNRLRRVTP LV EVLKFP+VDPR+SSKFF+W
Sbjct: 88   AKNLSPIARYIVDAFRKHKQWSPQLVQELNRLRRVTPTLVTEVLKFPDVDPRVSSKFFHW 147

Query: 366  AGKQKGYRHDFACYNAFAYFLNRANHFRDADQVPELMHMQGKPPSEKQFEILIRMHADAK 545
            AGKQKGY+HDFACYNA+AYFLNR+NHFR+ADQ+PELMHMQGKPP+EKQFEILIRMHAD+ 
Sbjct: 148  AGKQKGYKHDFACYNAYAYFLNRSNHFREADQLPELMHMQGKPPTEKQFEILIRMHADSN 207

Query: 546  RGLRVHYVYEKMKKFGVKPRVFLYNRIMDALVKTDHLDLAMSVYIDFKEDGLVEENMTYM 725
            RGLRVHYVYEKMKKFGVKPRVFLYNRIMDALVKT+HLDLAMSVY DFKE+GL EEN+TYM
Sbjct: 208  RGLRVHYVYEKMKKFGVKPRVFLYNRIMDALVKTNHLDLAMSVYRDFKEEGLSEENVTYM 267

Query: 726  ILIKGLCKAGRMDEVFRLLDKMRKNLSKPDVFAYTAMVKVLIAEGNLDGCLMVWDEMLKD 905
            ILIKGLCKAGR+DE+F L+D+MRKNL KPDVFAYTAMVKVL++EGNL+GCL VW+EM KD
Sbjct: 268  ILIKGLCKAGRLDEMFDLVDRMRKNLCKPDVFAYTAMVKVLVSEGNLNGCLTVWEEMKKD 327

Query: 906  RIDPDVMAYTTLVMALCKGNRVDXXXXXXXXXXXXXXLIDRAVYGSLIEAYVVDGRVGSA 1085
             ++PD MAY+TL+MALC+G  VD              LIDRA+YGSLIEAYVVDG+VGSA
Sbjct: 328  GVEPDSMAYSTLIMALCEGKFVDKGYELFKEMKGRNCLIDRAIYGSLIEAYVVDGKVGSA 387



 Score = 73.2 bits (178), Expect = 2e-10
 Identities = 48/201 (23%), Positives = 91/201 (45%)
 Frame = +3

Query: 372 KQKGYRHDFACYNAFAYFLNRANHFRDADQVPELMHMQGKPPSEKQFEILIRMHADAKRG 551
           K+ G +     YN     L + NH   A  V      +G       + ILI+    A R 
Sbjct: 220 KKFGVKPRVFLYNRIMDALVKTNHLDLAMSVYRDFKEEGLSEENVTYMILIKGLCKAGRL 279

Query: 552 LRVHYVYEKMKKFGVKPRVFLYNRIMDALVKTDHLDLAMSVYIDFKEDGLVEENMTYMIL 731
             +  + ++M+K   KP VF Y  ++  LV   +L+  ++V+ + K+DG+  ++M Y  L
Sbjct: 280 DEMFDLVDRMRKNLCKPDVFAYTAMVKVLVSEGNLNGCLTVWEEMKKDGVEPDSMAYSTL 339

Query: 732 IKGLCKAGRMDEVFRLLDKMRKNLSKPDVFAYTAMVKVLIAEGNLDGCLMVWDEMLKDRI 911
           I  LC+   +D+ + L  +M+      D   Y ++++  + +G +     +  +++    
Sbjct: 340 IMALCEGKFVDKGYELFKEMKGRNCLIDRAIYGSLIEAYVVDGKVGSACDLLKDLINSGY 399

Query: 912 DPDVMAYTTLVMALCKGNRVD 974
             D+  Y +L+  LC    VD
Sbjct: 400 RADLAIYNSLIKGLCNSKLVD 420


>ref|XP_006345374.1| PREDICTED: pentatricopeptide repeat-containing protein At4g20740-like
            [Solanum tuberosum]
          Length = 720

 Score =  516 bits (1329), Expect = e-144
 Identities = 253/362 (69%), Positives = 295/362 (81%), Gaps = 2/362 (0%)
 Frame = +3

Query: 6    NRPTVRGGLFSNRQTINPKKFSR--PSAASDEPFDLRKWDPEDESNLKLPYVKDPSERFF 179
            +RPTV+GGLFSNRQTINP + ++  PS+ +   F L+KWDP+  S  +    +DPS+ FF
Sbjct: 22   HRPTVQGGLFSNRQTINPNRTTKNSPSSVTQGDFQLQKWDPDGVSGQQS---RDPSQEFF 78

Query: 180  ISAKNLSPIARYIVDAFRTHKNWNAELVAELNRLRRVTPKLVAEVLKFPNVDPRLSSKFF 359
              A+ LSPIARYIVD+FR H NW A L+A+LN LRRVTPKLV EVLK PN+DP++SSKFF
Sbjct: 79   SLAQRLSPIARYIVDSFRKHGNWGAPLLADLNSLRRVTPKLVTEVLKHPNLDPKISSKFF 138

Query: 360  NWAGKQKGYRHDFACYNAFAYFLNRANHFRDADQVPELMHMQGKPPSEKQFEILIRMHAD 539
             WAGKQKGYRHDF+CYNAFAY LNRAN FR ADQVPELMHMQGKPPSEKQFEILIRMH D
Sbjct: 139  YWAGKQKGYRHDFSCYNAFAYGLNRANQFRTADQVPELMHMQGKPPSEKQFEILIRMHGD 198

Query: 540  AKRGLRVHYVYEKMKKFGVKPRVFLYNRIMDALVKTDHLDLAMSVYIDFKEDGLVEENMT 719
            A RGLRV+YVYEKMKKFGVKPRVFLYNRIMDALVKT+HLD+AMSVY DFK+DGLVEE+MT
Sbjct: 199  ANRGLRVYYVYEKMKKFGVKPRVFLYNRIMDALVKTNHLDMAMSVYDDFKKDGLVEESMT 258

Query: 720  YMILIKGLCKAGRMDEVFRLLDKMRKNLSKPDVFAYTAMVKVLIAEGNLDGCLMVWDEML 899
            +MILIKGLCK GRMDEVF LL +MR+N  KPDVFAYTAMVK+L+AE NLDGC  VW EM 
Sbjct: 259  FMILIKGLCKLGRMDEVFELLGRMRENRCKPDVFAYTAMVKILVAERNLDGCSKVWKEMQ 318

Query: 900  KDRIDPDVMAYTTLVMALCKGNRVDXXXXXXXXXXXXXXLIDRAVYGSLIEAYVVDGRVG 1079
            +D ++PDV+AY+T +  LCK N+VD              LIDR +YGSLIE++V +G+VG
Sbjct: 319  QDAVEPDVIAYSTFIAGLCKNNQVDKGYELFKEMKQKNILIDRGIYGSLIESFVANGKVG 378

Query: 1080 SA 1085
             A
Sbjct: 379  LA 380



 Score = 68.9 bits (167), Expect = 3e-09
 Identities = 47/201 (23%), Positives = 88/201 (43%)
 Frame = +3

Query: 372 KQKGYRHDFACYNAFAYFLNRANHFRDADQVPELMHMQGKPPSEKQFEILIRMHADAKRG 551
           K+ G +     YN     L + NH   A  V +     G       F ILI+      R 
Sbjct: 213 KKFGVKPRVFLYNRIMDALVKTNHLDMAMSVYDDFKKDGLVEESMTFMILIKGLCKLGRM 272

Query: 552 LRVHYVYEKMKKFGVKPRVFLYNRIMDALVKTDHLDLAMSVYIDFKEDGLVEENMTYMIL 731
             V  +  +M++   KP VF Y  ++  LV   +LD    V+ + ++D +  + + Y   
Sbjct: 273 DEVFELLGRMRENRCKPDVFAYTAMVKILVAERNLDGCSKVWKEMQQDAVEPDVIAYSTF 332

Query: 732 IKGLCKAGRMDEVFRLLDKMRKNLSKPDVFAYTAMVKVLIAEGNLDGCLMVWDEMLKDRI 911
           I GLCK  ++D+ + L  +M++     D   Y ++++  +A G +     +  ++++   
Sbjct: 333 IAGLCKNNQVDKGYELFKEMKQKNILIDRGIYGSLIESFVANGKVGLACDLLKDLIESGY 392

Query: 912 DPDVMAYTTLVMALCKGNRVD 974
             D+  Y +++  LC   R D
Sbjct: 393 RADLAIYNSIIEGLCNAKRTD 413


>emb|CBI17752.3| unnamed protein product [Vitis vinifera]
          Length = 729

 Score =  513 bits (1320), Expect = e-143
 Identities = 257/360 (71%), Positives = 291/360 (80%)
 Frame = +3

Query: 6    NRPTVRGGLFSNRQTINPKKFSRPSAASDEPFDLRKWDPEDESNLKLPYVKDPSERFFIS 185
            NRPTV GGLFSNR T+NPK  +  +  +   F+L+ WDP+    L +P  K P ERFF  
Sbjct: 25   NRPTVHGGLFSNRTTLNPKPPTLQNPTTH--FNLQNWDPDSPKALAIPPSKTPCERFFDI 82

Query: 186  AKNLSPIARYIVDAFRTHKNWNAELVAELNRLRRVTPKLVAEVLKFPNVDPRLSSKFFNW 365
            AKNLSPIARYI D+FR H+NW   +VA+LN+LRRVTP LVAEVLK    DP + SKFF+W
Sbjct: 83   AKNLSPIARYICDSFRKHRNWGPPVVADLNKLRRVTPVLVAEVLKV-QTDPVICSKFFHW 141

Query: 366  AGKQKGYRHDFACYNAFAYFLNRANHFRDADQVPELMHMQGKPPSEKQFEILIRMHADAK 545
            AGKQKGY+H+FA YNAFAY LNR+N FR ADQVPELM+MQGKPPSEKQFEILIRMH DA 
Sbjct: 142  AGKQKGYKHNFASYNAFAYCLNRSNQFRAADQVPELMNMQGKPPSEKQFEILIRMHIDAN 201

Query: 546  RGLRVHYVYEKMKKFGVKPRVFLYNRIMDALVKTDHLDLAMSVYIDFKEDGLVEENMTYM 725
            RGLRV+YVYEKMKKFG+KPRVFLYNRIMD LVKT HLDLAMSVY DFKEDGLVEE++TYM
Sbjct: 202  RGLRVYYVYEKMKKFGIKPRVFLYNRIMDGLVKTGHLDLAMSVYEDFKEDGLVEESVTYM 261

Query: 726  ILIKGLCKAGRMDEVFRLLDKMRKNLSKPDVFAYTAMVKVLIAEGNLDGCLMVWDEMLKD 905
            IL+KGLCKAGR+DEV  LLD+MR NL KPDVFAYTAMVKVL+AEGNLDGCL VW+EM KD
Sbjct: 262  ILVKGLCKAGRIDEVLELLDRMRGNLCKPDVFAYTAMVKVLVAEGNLDGCLRVWEEMRKD 321

Query: 906  RIDPDVMAYTTLVMALCKGNRVDXXXXXXXXXXXXXXLIDRAVYGSLIEAYVVDGRVGSA 1085
            +++PDVMAYTTLV ALC GNRV               LIDRA+YGSLIE +VV+ RVGSA
Sbjct: 322  KVEPDVMAYTTLVAALCNGNRVGEGFELFKEMKQKKYLIDRAIYGSLIEGFVVNERVGSA 381


>ref|XP_002304774.1| pentatricopeptide repeat-containing family protein [Populus
            trichocarpa] gi|222842206|gb|EEE79753.1|
            pentatricopeptide repeat-containing family protein
            [Populus trichocarpa]
          Length = 728

 Score =  456 bits (1173), Expect = e-126
 Identities = 230/368 (62%), Positives = 280/368 (76%), Gaps = 8/368 (2%)
 Frame = +3

Query: 6    NRPTVRGGLFSNRQTINPKKFSRPSAASDEPFDLRKWDPEDESNLKLPYVKDPSE----- 170
            NRP VRGGLF+NRQT+ P+    P     +PFDL KWDP+      LP+   PS+     
Sbjct: 28   NRPVVRGGLFTNRQTVKPQPPKNPITPF-KPFDLHKWDPQQN----LPHQPQPSKPQSPR 82

Query: 171  --RFFISAKNLSPIARYIVDAFRTHKN-WNAELVAELNRLRRVTPKLVAEVLKFPNVDPR 341
                   ++ LSPIAR+I+DAFR ++N W  E+V EL +LRRVTP LVAEVLK  N +P+
Sbjct: 83   SRHSLALSQRLSPIARFILDAFRKNRNQWGPEVVTELCKLRRVTPDLVAEVLKVEN-NPQ 141

Query: 342  LSSKFFNWAGKQKGYRHDFACYNAFAYFLNRANHFRDADQVPELMHMQGKPPSEKQFEIL 521
            L++KFF+WAGKQKG++H FA YNAFAY LNR+N FR ADQ+PELM  QGKPP+EKQFEIL
Sbjct: 142  LATKFFHWAGKQKGFKHTFASYNAFAYNLNRSNFFRAADQLPELMEAQGKPPTEKQFEIL 201

Query: 522  IRMHADAKRGLRVHYVYEKMKKFGVKPRVFLYNRIMDALVKTDHLDLAMSVYIDFKEDGL 701
            IRMH+DA RGLRV+YVY+KM KFGVKPRVFLYNRIMD+L+KT HLDLA+SVY DF+ DGL
Sbjct: 202  IRMHSDANRGLRVYYVYQKMVKFGVKPRVFLYNRIMDSLIKTGHLDLALSVYEDFRRDGL 261

Query: 702  VEENMTYMILIKGLCKAGRMDEVFRLLDKMRKNLSKPDVFAYTAMVKVLIAEGNLDGCLM 881
            VEE++TYMILIKGLCKAGR++E+  +L +MR+NL KPDVFAYTAMV+ L  EGNLD CL 
Sbjct: 262  VEESVTYMILIKGLCKAGRIEEMMEVLGRMRENLCKPDVFAYTAMVRALAGEGNLDACLR 321

Query: 882  VWDEMLKDRIDPDVMAYTTLVMALCKGNRVDXXXXXXXXXXXXXXLIDRAVYGSLIEAYV 1061
            VW+EM +D ++PDVMAY TLV ALCKG RVD              LIDR +YG L+EA+V
Sbjct: 322  VWEEMKRDGVEPDVMAYVTLVTALCKGGRVDKGYEVFKEMKGRRILIDRGIYGILVEAFV 381

Query: 1062 VDGRVGSA 1085
             DG++G A
Sbjct: 382  ADGKIGLA 389



 Score = 75.9 bits (185), Expect = 3e-11
 Identities = 55/195 (28%), Positives = 89/195 (45%), Gaps = 5/195 (2%)
 Frame = +3

Query: 405 YNAFAYFLNRANHFRDADQVPELMHMQGKPPSEKQFEILIRMHADAKRGLRVHYVYEKMK 584
           YN     L +  H   A  V E     G       + ILI+    A R   +  V  +M+
Sbjct: 233 YNRIMDSLIKTGHLDLALSVYEDFRRDGLVEESVTYMILIKGLCKAGRIEEMMEVLGRMR 292

Query: 585 KFGVKPRVFLYNRIMDALVKTDHLDLAMSVYIDFKEDGLVEENMTYMILIKGLCKAGRMD 764
           +   KP VF Y  ++ AL    +LD  + V+ + K DG+  + M Y+ L+  LCK GR+D
Sbjct: 293 ENLCKPDVFAYTAMVRALAGEGNLDACLRVWEEMKRDGVEPDVMAYVTLVTALCKGGRVD 352

Query: 765 EVFRLLDKMRKNLSKPDVFAYTAMVKVLIAEGNLD-GCLMVWDEMLKDRID----PDVMA 929
           + + +  +M+      D   Y  +V+  +A+G +   C     ++LKD +D     D+  
Sbjct: 353 KGYEVFKEMKGRRILIDRGIYGILVEAFVADGKIGLAC-----DLLKDLVDSGYRADLRI 407

Query: 930 YTTLVMALCKGNRVD 974
           Y +L+   C   RVD
Sbjct: 408 YNSLIEGFCNVKRVD 422



 Score = 57.8 bits (138), Expect = 8e-06
 Identities = 46/198 (23%), Positives = 84/198 (42%)
 Frame = +3

Query: 381 GYRHDFACYNAFAYFLNRANHFRDADQVPELMHMQGKPPSEKQFEILIRMHADAKRGLRV 560
           GYR D   YN+             A ++ ++   +G     K    L+  +A+ K+    
Sbjct: 400 GYRADLRIYNSLIEGFCNVKRVDKAHKLFQVTVQEGLERDFKTVNPLLMSYAEMKKMDDF 459

Query: 561 HYVYEKMKKFGVKPRVFLYNRIMDALVKTDHLDLAMSVYIDFKEDGLVEENMTYMILIKG 740
             + ++M+K G      L       + K +   +A+ V+ D K  G     + Y IL++ 
Sbjct: 460 CKLLKQMEKLGFSVFDDLSKFFSYVVGKPERTMMALEVFEDLKVKGYSSVPI-YNILMEA 518

Query: 741 LCKAGRMDEVFRLLDKMRKNLSKPDVFAYTAMVKVLIAEGNLDGCLMVWDEMLKDRIDPD 920
           L   G M     L  +M K+L+KPD   Y+  +   + +GN+    +  +++++    P 
Sbjct: 519 LLTIGEMKRALSLFGEM-KDLNKPDSTTYSIAIICFVEDGNIQEACVSHNKIVEMFCVPS 577

Query: 921 VMAYTTLVMALCKGNRVD 974
           V AY +L   LC    +D
Sbjct: 578 VAAYCSLAKGLCDNGEID 595


>ref|XP_007220734.1| hypothetical protein PRUPE_ppa023145mg [Prunus persica]
            gi|462417196|gb|EMJ21933.1| hypothetical protein
            PRUPE_ppa023145mg [Prunus persica]
          Length = 721

 Score =  450 bits (1157), Expect = e-124
 Identities = 231/361 (63%), Positives = 279/361 (77%), Gaps = 1/361 (0%)
 Frame = +3

Query: 6    NRPTVRGGLFSNRQTINPKKFSRPSAASDEPFDLRKWDPEDESNLKLPYVKDPSERFFIS 185
            NRP VRGGLFSNR ++  +++   +A   +PF+L KWDP    +       +P++   +S
Sbjct: 25   NRPRVRGGLFSNRVSLPNRRYPI-AAPQPQPFELSKWDPHLPQSSPSTSSSNPADTTLLS 83

Query: 186  AKNLSPIARYIVDAFRTHKN-WNAELVAELNRLRRVTPKLVAEVLKFPNVDPRLSSKFFN 362
               LSPIAR+I+DAFR ++N W   +V+EL +LRRVTP LVAEVLK  N DP  +SKFF+
Sbjct: 84   F--LSPIARFILDAFRKNQNHWGPPVVSELRKLRRVTPDLVAEVLKVQN-DPVSASKFFH 140

Query: 363  WAGKQKGYRHDFACYNAFAYFLNRANHFRDADQVPELMHMQGKPPSEKQFEILIRMHADA 542
            WAGKQKG++H +A YNA AY LNR+N FR ADQVPELM  QGKPPSEKQFEILIRMH+DA
Sbjct: 141  WAGKQKGFKHTYASYNALAYCLNRSNRFRSADQVPELMDSQGKPPSEKQFEILIRMHSDA 200

Query: 543  KRGLRVHYVYEKMKKFGVKPRVFLYNRIMDALVKTDHLDLAMSVYIDFKEDGLVEENMTY 722
             RGLRV+YVYEKMKKFGVKPRVFLYNRIMDALVK+ +LDLA+SVY DF+ DGLVEE++T+
Sbjct: 201  NRGLRVYYVYEKMKKFGVKPRVFLYNRIMDALVKSGYLDLALSVYEDFRGDGLVEESVTF 260

Query: 723  MILIKGLCKAGRMDEVFRLLDKMRKNLSKPDVFAYTAMVKVLIAEGNLDGCLMVWDEMLK 902
            MILIKGLCK GRMDE+ +LL++MR NL KPDVFAYTAMVKVLI+EGNLDGCL VW+EM +
Sbjct: 261  MILIKGLCKMGRMDEMLQLLERMRVNLCKPDVFAYTAMVKVLISEGNLDGCLRVWEEMKR 320

Query: 903  DRIDPDVMAYTTLVMALCKGNRVDXXXXXXXXXXXXXXLIDRAVYGSLIEAYVVDGRVGS 1082
            DR+  DVMAY TLV  LCKG RV+              LIDRA+YG LIE +V D +VG+
Sbjct: 321  DRVGADVMAYATLVTGLCKGGRVEKGYKLFREMKVKGFLIDRAIYGVLIEGFVADRKVGA 380

Query: 1083 A 1085
            A
Sbjct: 381  A 381


>ref|NP_193806.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana]
            gi|75211707|sp|Q9SVH3.1|PP328_ARATH RecName:
            Full=Pentatricopeptide repeat-containing protein
            At4g20740 gi|5262214|emb|CAB45840.1| putative protein
            [Arabidopsis thaliana] gi|7268870|emb|CAB79074.1|
            putative protein [Arabidopsis thaliana]
            gi|332658957|gb|AEE84357.1| pentatricopeptide
            repeat-containing protein [Arabidopsis thaliana]
          Length = 727

 Score =  450 bits (1157), Expect = e-124
 Identities = 234/366 (63%), Positives = 277/366 (75%), Gaps = 6/366 (1%)
 Frame = +3

Query: 6    NRPTVRGGLFSNRQTINPKKFSRP---SAASDEPFDLRKWDPEDESNLKLPYVKDPSERF 176
            NRPTV GGLFSNRQ+I P+   +P   S A   PFDLRKWDPE      LP    PS   
Sbjct: 28   NRPTVYGGLFSNRQSI-PRVSPQPQSNSLAHRTPFDLRKWDPETH----LPPPSPPSHST 82

Query: 177  FISA--KNLSPIARYIVDAFRTHKN-WNAELVAELNRLRRVTPKLVAEVLKFPNVDPRLS 347
             ISA  + LSPIAR+++DAFR ++N W   +V+ELN+LRRVTP +VAEVLK  N D  ++
Sbjct: 83   VISAASERLSPIARFVLDAFRKNRNHWGPSVVSELNKLRRVTPSIVAEVLKLGN-DAAVA 141

Query: 348  SKFFNWAGKQKGYRHDFACYNAFAYFLNRANHFRDADQVPELMHMQGKPPSEKQFEILIR 527
            +KFF+WAGKQKGY+HDFA YNAFAY LNR  HFR ADQ+PELM  QG+PPSEKQFEILIR
Sbjct: 142  AKFFHWAGKQKGYKHDFAAYNAFAYCLNRNGHFRAADQLPELMDSQGRPPSEKQFEILIR 201

Query: 528  MHADAKRGLRVHYVYEKMKKFGVKPRVFLYNRIMDALVKTDHLDLAMSVYIDFKEDGLVE 707
            MHAD +RGLRV+YVYEKMKKFG KPRVFLYNRIMDALVK  + DLA++VY DFKEDGLVE
Sbjct: 202  MHADNRRGLRVYYVYEKMKKFGFKPRVFLYNRIMDALVKNGYFDLALAVYEDFKEDGLVE 261

Query: 708  ENMTYMILIKGLCKAGRMDEVFRLLDKMRKNLSKPDVFAYTAMVKVLIAEGNLDGCLMVW 887
            E+ T+MIL+KGLCKAGR++E+  +L +MR+NL KPDVFAYTAM+K L++EGNLD  L VW
Sbjct: 262  ESTTFMILVKGLCKAGRIEEMLEILQRMRENLCKPDVFAYTAMIKTLVSEGNLDASLRVW 321

Query: 888  DEMLKDRIDPDVMAYTTLVMALCKGNRVDXXXXXXXXXXXXXXLIDRAVYGSLIEAYVVD 1067
            DEM +D I PDVMAY TLV+ LCK  RV+              LIDR +Y  LIE +V D
Sbjct: 322  DEMRRDEIKPDVMAYGTLVVGLCKDGRVERGYELFMEMKGKQILIDREIYRVLIEGFVAD 381

Query: 1068 GRVGSA 1085
            G+V SA
Sbjct: 382  GKVRSA 387



 Score = 81.3 bits (199), Expect = 7e-13
 Identities = 56/235 (23%), Positives = 103/235 (43%)
 Frame = +3

Query: 372  KQKGYRHDFACYNAFAYFLNRANHFRDADQVPELMHMQGKPPSEKQFEILIRMHADAKRG 551
            K+ G++     YN     L +  +F  A  V E     G       F IL++    A R 
Sbjct: 220  KKFGFKPRVFLYNRIMDALVKNGYFDLALAVYEDFKEDGLVEESTTFMILVKGLCKAGRI 279

Query: 552  LRVHYVYEKMKKFGVKPRVFLYNRIMDALVKTDHLDLAMSVYIDFKEDGLVEENMTYMIL 731
              +  + ++M++   KP VF Y  ++  LV   +LD ++ V+ + + D +  + M Y  L
Sbjct: 280  EEMLEILQRMRENLCKPDVFAYTAMIKTLVSEGNLDASLRVWDEMRRDEIKPDVMAYGTL 339

Query: 732  IKGLCKAGRMDEVFRLLDKMRKNLSKPDVFAYTAMVKVLIAEGNLDGCLMVWDEMLKDRI 911
            + GLCK GR++  + L  +M+      D   Y  +++  +A+G +     +W++++    
Sbjct: 340  VVGLCKDGRVERGYELFMEMKGKQILIDREIYRVLIEGFVADGKVRSACNLWEDLVDSGY 399

Query: 912  DPDVMAYTTLVMALCKGNRVDXXXXXXXXXXXXXXLIDRAVYGSLIEAYVVDGRV 1076
              D+  Y  ++  LC  N+VD                D      ++ AYVV  R+
Sbjct: 400  IADIGIYNAVIKGLCSVNQVDKAYKLFQVAIEEELEPDFETLSPIMVAYVVMNRL 454


>ref|XP_006413862.1| hypothetical protein EUTSA_v10024515mg [Eutrema salsugineum]
            gi|557115032|gb|ESQ55315.1| hypothetical protein
            EUTSA_v10024515mg [Eutrema salsugineum]
          Length = 735

 Score =  449 bits (1155), Expect = e-124
 Identities = 225/363 (61%), Positives = 273/363 (75%), Gaps = 3/363 (0%)
 Frame = +3

Query: 6    NRPTVRGGLFSNRQTINPKKFSRPSAASDE--PFDLRKWDPEDESNLKLPYVKDPSERFF 179
            NRP V GGLFSNRQ ++      PS A  +  PFDLRKWDPE     +      PS    
Sbjct: 28   NRPVVHGGLFSNRQYLSRDPPQSPSNAVADRIPFDLRKWDPESRLPSERASSSSPSTSIS 87

Query: 180  ISAKNLSPIARYIVDAFRTHKN-WNAELVAELNRLRRVTPKLVAEVLKFPNVDPRLSSKF 356
             +++ LSPIAR+++DAFR ++N W   +V+ELN+LRRVTP +VAEVLK  N D  +S+KF
Sbjct: 88   AASERLSPIARFVLDAFRKNRNRWGPSVVSELNKLRRVTPSIVAEVLKVGN-DAAVSAKF 146

Query: 357  FNWAGKQKGYRHDFACYNAFAYFLNRANHFRDADQVPELMHMQGKPPSEKQFEILIRMHA 536
            F+WAGKQKGY+HDFA YNAFAY LNR  HFR ADQ+PELM  QG+PPSEKQFEILIRMH+
Sbjct: 147  FHWAGKQKGYKHDFAAYNAFAYCLNRTGHFRAADQLPELMDSQGRPPSEKQFEILIRMHS 206

Query: 537  DAKRGLRVHYVYEKMKKFGVKPRVFLYNRIMDALVKTDHLDLAMSVYIDFKEDGLVEENM 716
            D KRGLRV+YVYEKMKKFG KPRVFLYNRIMDAL+KT + DLA++VY DFKEDGLVEE+ 
Sbjct: 207  DNKRGLRVYYVYEKMKKFGFKPRVFLYNRIMDALMKTGYFDLALAVYEDFKEDGLVEEST 266

Query: 717  TYMILIKGLCKAGRMDEVFRLLDKMRKNLSKPDVFAYTAMVKVLIAEGNLDGCLMVWDEM 896
            T+MIL+KGLCK+GRM+E+  +L +MR+NL +PDVFAYTAM+K L++EGN+D  L VWDEM
Sbjct: 267  TFMILVKGLCKSGRMEEMLEILQRMRENLCRPDVFAYTAMIKTLVSEGNMDASLRVWDEM 326

Query: 897  LKDRIDPDVMAYTTLVMALCKGNRVDXXXXXXXXXXXXXXLIDRAVYGSLIEAYVVDGRV 1076
             +D + PDVMAY TLVM LCK  RV+              LIDR +Y  LIE +V DG+V
Sbjct: 327  KRDEVKPDVMAYGTLVMGLCKDGRVEKGYELFMEMKEKQILIDRDIYRVLIEGFVADGKV 386

Query: 1077 GSA 1085
             SA
Sbjct: 387  RSA 389



 Score = 77.0 bits (188), Expect = 1e-11
 Identities = 53/235 (22%), Positives = 103/235 (43%)
 Frame = +3

Query: 372  KQKGYRHDFACYNAFAYFLNRANHFRDADQVPELMHMQGKPPSEKQFEILIRMHADAKRG 551
            K+ G++     YN     L +  +F  A  V E     G       F IL++    + R 
Sbjct: 222  KKFGFKPRVFLYNRIMDALMKTGYFDLALAVYEDFKEDGLVEESTTFMILVKGLCKSGRM 281

Query: 552  LRVHYVYEKMKKFGVKPRVFLYNRIMDALVKTDHLDLAMSVYIDFKEDGLVEENMTYMIL 731
              +  + ++M++   +P VF Y  ++  LV   ++D ++ V+ + K D +  + M Y  L
Sbjct: 282  EEMLEILQRMRENLCRPDVFAYTAMIKTLVSEGNMDASLRVWDEMKRDEVKPDVMAYGTL 341

Query: 732  IKGLCKAGRMDEVFRLLDKMRKNLSKPDVFAYTAMVKVLIAEGNLDGCLMVWDEMLKDRI 911
            + GLCK GR+++ + L  +M++     D   Y  +++  +A+G +     +W +++    
Sbjct: 342  VMGLCKDGRVEKGYELFMEMKEKQILIDRDIYRVLIEGFVADGKVRSACDLWKDLVDSGY 401

Query: 912  DPDVMAYTTLVMALCKGNRVDXXXXXXXXXXXXXXLIDRAVYGSLIEAYVVDGRV 1076
              D+  Y  ++  LC   +VD                D      ++ AYVV  R+
Sbjct: 402  IADLGIYNAIIKGLCTVKQVDKAYKLFQIATEEELEPDFETLSPIMVAYVVMKRL 456


>ref|XP_002869928.1| pentatricopeptide repeat-containing protein [Arabidopsis lyrata
            subsp. lyrata] gi|297315764|gb|EFH46187.1|
            pentatricopeptide repeat-containing protein [Arabidopsis
            lyrata subsp. lyrata]
          Length = 731

 Score =  445 bits (1145), Expect = e-122
 Identities = 228/365 (62%), Positives = 272/365 (74%), Gaps = 5/365 (1%)
 Frame = +3

Query: 6    NRPTVRGGLFSNRQTINPKKFSRPS--AASDEPFDLRKWDPEDESNLKLPYVKDPSERFF 179
            NRP V GGLFS RQ+++      PS   A   PFDLRKWDPE    L+      PS    
Sbjct: 28   NRPIVYGGLFSTRQSLSRDSPQSPSNAVAHRTPFDLRKWDPETHLPLERSSSSPPSHSTV 87

Query: 180  ISA--KNLSPIARYIVDAFRTHKN-WNAELVAELNRLRRVTPKLVAEVLKFPNVDPRLSS 350
            ISA  + LSPIAR+++DAFR ++N W   +V+ELN+LRRVTP +VAEVLK  N D   ++
Sbjct: 88   ISAASERLSPIARFVLDAFRKNRNHWGPSVVSELNKLRRVTPSIVAEVLKLGN-DATAAA 146

Query: 351  KFFNWAGKQKGYRHDFACYNAFAYFLNRANHFRDADQVPELMHMQGKPPSEKQFEILIRM 530
            KFF+WAGKQKGY+HDFA YNAFAY LNR  HFR ADQ+PELM  QG+PPSEKQFEILIRM
Sbjct: 147  KFFHWAGKQKGYKHDFAAYNAFAYCLNRNGHFRAADQLPELMDSQGRPPSEKQFEILIRM 206

Query: 531  HADAKRGLRVHYVYEKMKKFGVKPRVFLYNRIMDALVKTDHLDLAMSVYIDFKEDGLVEE 710
            HAD +RGLRV+YVYEKMKKFG KPRVFLYNRIMDALVK  + DLA++VY DFKEDGLVEE
Sbjct: 207  HADNRRGLRVYYVYEKMKKFGFKPRVFLYNRIMDALVKNGYFDLALAVYEDFKEDGLVEE 266

Query: 711  NMTYMILIKGLCKAGRMDEVFRLLDKMRKNLSKPDVFAYTAMVKVLIAEGNLDGCLMVWD 890
            + T+MIL+KGLCKAGR++E+  +L +MR+NL KPDVFAYTAM+K L++EGNLD  L VWD
Sbjct: 267  STTFMILVKGLCKAGRIEEMLEILQRMRENLCKPDVFAYTAMIKTLVSEGNLDASLRVWD 326

Query: 891  EMLKDRIDPDVMAYTTLVMALCKGNRVDXXXXXXXXXXXXXXLIDRAVYGSLIEAYVVDG 1070
            EM +D I PDVMAY TLV+ LCK  R++              LIDR +Y  LIE +V DG
Sbjct: 327  EMKRDEIKPDVMAYGTLVVGLCKDGRIERGYELFMEMKGKQILIDREIYRVLIEGFVADG 386

Query: 1071 RVGSA 1085
            +V SA
Sbjct: 387  KVRSA 391



 Score = 80.1 bits (196), Expect = 1e-12
 Identities = 57/235 (24%), Positives = 102/235 (43%)
 Frame = +3

Query: 372  KQKGYRHDFACYNAFAYFLNRANHFRDADQVPELMHMQGKPPSEKQFEILIRMHADAKRG 551
            K+ G++     YN     L +  +F  A  V E     G       F IL++    A R 
Sbjct: 224  KKFGFKPRVFLYNRIMDALVKNGYFDLALAVYEDFKEDGLVEESTTFMILVKGLCKAGRI 283

Query: 552  LRVHYVYEKMKKFGVKPRVFLYNRIMDALVKTDHLDLAMSVYIDFKEDGLVEENMTYMIL 731
              +  + ++M++   KP VF Y  ++  LV   +LD ++ V+ + K D +  + M Y  L
Sbjct: 284  EEMLEILQRMRENLCKPDVFAYTAMIKTLVSEGNLDASLRVWDEMKRDEIKPDVMAYGTL 343

Query: 732  IKGLCKAGRMDEVFRLLDKMRKNLSKPDVFAYTAMVKVLIAEGNLDGCLMVWDEMLKDRI 911
            + GLCK GR++  + L  +M+      D   Y  +++  +A+G +     +W +++    
Sbjct: 344  VVGLCKDGRIERGYELFMEMKGKQILIDREIYRVLIEGFVADGKVRSACDLWKDLVDSGY 403

Query: 912  DPDVMAYTTLVMALCKGNRVDXXXXXXXXXXXXXXLIDRAVYGSLIEAYVVDGRV 1076
              D+  Y  ++  LC  N+VD                D      ++ AYVV  R+
Sbjct: 404  IADLGIYNAVIKGLCSVNQVDKAYNLFQVAIEEELEPDFETLSPIMVAYVVMNRL 458


>ref|XP_004140286.1| PREDICTED: pentatricopeptide repeat-containing protein At4g20740-like
            [Cucumis sativus] gi|449531474|ref|XP_004172711.1|
            PREDICTED: pentatricopeptide repeat-containing protein
            At4g20740-like [Cucumis sativus]
          Length = 726

 Score =  444 bits (1141), Expect = e-122
 Identities = 225/364 (61%), Positives = 274/364 (75%), Gaps = 4/364 (1%)
 Frame = +3

Query: 6    NRPTVRGGLFSNRQTINPKKFSRPSAASDEPFDLRKWDPEDESNLKLPYVKDPSERFFIS 185
            +RPTV GG F+NR+++ P    +P++   +PF L  WDP+  S  +       S+ FF +
Sbjct: 23   HRPTVYGGFFTNRRSLPPPSPHQPTSPKPQPFLLHNWDPDLPSQKRSNLPSSTSDAFFST 82

Query: 186  AKNLSPIARYIVDAFRTHKN-WNAELVAELNRLRRVTPKLVAEVLKFPN---VDPRLSSK 353
            +  LSPIAR+IVD FR ++N W   +++ELN+LRRVTP LVAEVLK  +    +  L+SK
Sbjct: 83   SLRLSPIARFIVDVFRKNQNQWGPPVISELNKLRRVTPDLVAEVLKASHRRDSNSILASK 142

Query: 354  FFNWAGKQKGYRHDFACYNAFAYFLNRANHFRDADQVPELMHMQGKPPSEKQFEILIRMH 533
            FF WAGKQKG+ H FA YNAFAY LNR N FR ADQ+PELM  QGKPPSEKQFEILIRMH
Sbjct: 143  FFYWAGKQKGFHHTFASYNAFAYCLNRHNRFRAADQIPELMDSQGKPPSEKQFEILIRMH 202

Query: 534  ADAKRGLRVHYVYEKMKKFGVKPRVFLYNRIMDALVKTDHLDLAMSVYIDFKEDGLVEEN 713
             DA RGLRV+YVYEKMKKFGV PRVFLYNRI+DALVKTDHLDLA++VY DF+E+GLVEE+
Sbjct: 203  CDANRGLRVYYVYEKMKKFGVVPRVFLYNRILDALVKTDHLDLALTVYRDFQENGLVEES 262

Query: 714  MTYMILIKGLCKAGRMDEVFRLLDKMRKNLSKPDVFAYTAMVKVLIAEGNLDGCLMVWDE 893
            +T+MILIKGLCKAGR+DE+  LL +MR NL KPDVFAYTAMVKVL ++ NL+GCL VWDE
Sbjct: 263  VTFMILIKGLCKAGRVDEMLELLARMRANLCKPDVFAYTAMVKVLASKDNLEGCLRVWDE 322

Query: 894  MLKDRIDPDVMAYTTLVMALCKGNRVDXXXXXXXXXXXXXXLIDRAVYGSLIEAYVVDGR 1073
            M  DR++PDVMAY TL++ LCK  R                LIDRA+YG+LIEA+V D +
Sbjct: 323  MRADRVEPDVMAYGTLIIGLCKVGRAQKGYELFQEMKGKRILIDRAIYGTLIEAFVQDEK 382

Query: 1074 VGSA 1085
            VG A
Sbjct: 383  VGLA 386



 Score = 62.8 bits (151), Expect = 2e-07
 Identities = 46/190 (24%), Positives = 82/190 (43%)
 Frame = +3

Query: 405 YNAFAYFLNRANHFRDADQVPELMHMQGKPPSEKQFEILIRMHADAKRGLRVHYVYEKMK 584
           YN     L + +H   A  V       G       F ILI+    A R   +  +  +M+
Sbjct: 230 YNRILDALVKTDHLDLALTVYRDFQENGLVEESVTFMILIKGLCKAGRVDEMLELLARMR 289

Query: 585 KFGVKPRVFLYNRIMDALVKTDHLDLAMSVYIDFKEDGLVEENMTYMILIKGLCKAGRMD 764
               KP VF Y  ++  L   D+L+  + V+ + + D +  + M Y  LI GLCK GR  
Sbjct: 290 ANLCKPDVFAYTAMVKVLASKDNLEGCLRVWDEMRADRVEPDVMAYGTLIIGLCKVGRAQ 349

Query: 765 EVFRLLDKMRKNLSKPDVFAYTAMVKVLIAEGNLDGCLMVWDEMLKDRIDPDVMAYTTLV 944
           + + L  +M+      D   Y  +++  + +  +     ++ +++      D+  Y +L+
Sbjct: 350 KGYELFQEMKGKRILIDRAIYGTLIEAFVQDEKVGLACDLFKDLVDSGYRADLGIYHSLI 409

Query: 945 MALCKGNRVD 974
             LC  N+VD
Sbjct: 410 KGLCNVNQVD 419


>gb|EXC10461.1| hypothetical protein L484_008628 [Morus notabilis]
          Length = 716

 Score =  441 bits (1133), Expect = e-121
 Identities = 221/361 (61%), Positives = 269/361 (74%), Gaps = 1/361 (0%)
 Frame = +3

Query: 6    NRPTVRGGLFSNRQTINPKKFSRPSAASDEPFDLRKWDPEDESNLKLPYVKDPSERFFIS 185
            NRPTVRGGLFSNRQ++ P++   P      P DL KWDP    +        P+  F   
Sbjct: 25   NRPTVRGGLFSNRQSLKPRQ--NPHHHHKPPSDLSKWDPHLLPSPSSTTTTTPTLSF--- 79

Query: 186  AKNLSPIARYIVDAFR-THKNWNAELVAELNRLRRVTPKLVAEVLKFPNVDPRLSSKFFN 362
               LSPIAR+I DAFR  H  W   +V EL++LRRVTP LV EVLK    DP L+SKFF+
Sbjct: 80   ---LSPIARFITDAFRKNHSKWGPPVVTELHKLRRVTPNLVTEVLKV-QTDPSLASKFFH 135

Query: 363  WAGKQKGYRHDFACYNAFAYFLNRANHFRDADQVPELMHMQGKPPSEKQFEILIRMHADA 542
            WAGKQKGYRH+FA YNAFAY LNR + +R ADQVP LM  QGKPPSEKQFEILIRMH+DA
Sbjct: 136  WAGKQKGYRHNFASYNAFAYCLNRGDRYRSADQVPHLMEAQGKPPSEKQFEILIRMHSDA 195

Query: 543  KRGLRVHYVYEKMKKFGVKPRVFLYNRIMDALVKTDHLDLAMSVYIDFKEDGLVEENMTY 722
             RGLRV+Y YE MKKFG+KPRVFL+NR+MDALV+T +LDLA+SVY DFKE GLVEE++T+
Sbjct: 196  NRGLRVYYAYENMKKFGIKPRVFLFNRVMDALVRTGYLDLALSVYGDFKEAGLVEESVTF 255

Query: 723  MILIKGLCKAGRMDEVFRLLDKMRKNLSKPDVFAYTAMVKVLIAEGNLDGCLMVWDEMLK 902
            MILIKGLCKAGR++E+  +L +MR  L KPDVFAYTAMV+V++ EGNLDGCL VW+EM  
Sbjct: 256  MILIKGLCKAGRVEEMLEVLGRMRGELCKPDVFAYTAMVRVMVGEGNLDGCLRVWEEMRS 315

Query: 903  DRIDPDVMAYTTLVMALCKGNRVDXXXXXXXXXXXXXXLIDRAVYGSLIEAYVVDGRVGS 1082
            DR++PDV+AY T++  LCKG RV+              L+DRA+YG+L++A+V DG+VG 
Sbjct: 316  DRVEPDVIAYGTVIAGLCKGGRVEKGYELFKEMKGKGALVDRAIYGALVKAFVEDGKVGL 375

Query: 1083 A 1085
            A
Sbjct: 376  A 376



 Score = 69.3 bits (168), Expect = 3e-09
 Identities = 43/155 (27%), Positives = 76/155 (49%)
 Frame = +3

Query: 510 FEILIRMHADAKRGLRVHYVYEKMKKFGVKPRVFLYNRIMDALVKTDHLDLAMSVYIDFK 689
           F ILI+    A R   +  V  +M+    KP VF Y  ++  +V   +LD  + V+ + +
Sbjct: 255 FMILIKGLCKAGRVEEMLEVLGRMRGELCKPDVFAYTAMVRVMVGEGNLDGCLRVWEEMR 314

Query: 690 EDGLVEENMTYMILIKGLCKAGRMDEVFRLLDKMRKNLSKPDVFAYTAMVKVLIAEGNLD 869
            D +  + + Y  +I GLCK GR+++ + L  +M+   +  D   Y A+VK  + +G + 
Sbjct: 315 SDRVEPDVIAYGTVIAGLCKGGRVEKGYELFKEMKGKGALVDRAIYGALVKAFVEDGKVG 374

Query: 870 GCLMVWDEMLKDRIDPDVMAYTTLVMALCKGNRVD 974
               V+ +++      D+  Y  L+  LC   RVD
Sbjct: 375 LACDVFKDLVNSGYRADLDIYNYLIQGLCNAKRVD 409



 Score = 58.2 bits (139), Expect = 6e-06
 Identities = 45/198 (22%), Positives = 86/198 (43%)
 Frame = +3

Query: 381 GYRHDFACYNAFAYFLNRANHFRDADQVPELMHMQGKPPSEKQFEILIRMHADAKRGLRV 560
           GYR D   YN     L  A     A ++  +   +G  P+      ++  +A+ ++    
Sbjct: 387 GYRADLDIYNYLIQGLCNAKRVDKAYKLFRVTVQEGLGPNFVTINPILLCYAEMRKIDEF 446

Query: 561 HYVYEKMKKFGVKPRVFLYNRIMDALVKTDHLDLAMSVYIDFKEDGLVEENMTYMILIKG 740
             +  +M+K G+     L       + K D L +A+ V+ D K  G    ++ Y IL++ 
Sbjct: 447 CDLLVQMQKLGISVVDDLTKFFSFVVRKGDGLKMALEVFEDLKVRGYYSVSI-YNILMEA 505

Query: 741 LCKAGRMDEVFRLLDKMRKNLSKPDVFAYTAMVKVLIAEGNLDGCLMVWDEMLKDRIDPD 920
             K     +   LL++M+   ++PD   Y+  ++  + EG+L       +++++    P 
Sbjct: 506 FYKTEMAKKALSLLNEMKDMNAQPDSSTYSVAIECFVEEGDLKEACACHNKIIEMSCVPS 565

Query: 921 VMAYTTLVMALCKGNRVD 974
           V AY +L   LC    +D
Sbjct: 566 VSAYCSLARGLCNIGEID 583


>ref|XP_006283187.1| hypothetical protein CARUB_v10004218mg, partial [Capsella rubella]
            gi|482551892|gb|EOA16085.1| hypothetical protein
            CARUB_v10004218mg, partial [Capsella rubella]
          Length = 745

 Score =  441 bits (1133), Expect = e-121
 Identities = 228/366 (62%), Positives = 273/366 (74%), Gaps = 6/366 (1%)
 Frame = +3

Query: 6    NRPTVRGGLFSNRQTIN---PKKFSRPSAASDEPFDLRKWDPEDESNLKLPYVKDPSERF 176
            NRP V GGLFSNRQ+++   P+  S  + A   PFDLRKWDPE     +      PS   
Sbjct: 42   NRPIVYGGLFSNRQSLSRDSPQPQSN-AVAHRTPFDLRKWDPESHLPSERASSSPPSHST 100

Query: 177  FISA--KNLSPIARYIVDAFRTHKN-WNAELVAELNRLRRVTPKLVAEVLKFPNVDPRLS 347
             ISA  + LSPIAR+++DAFR ++N W   +V+ELN+LRRVTP +VAEVLK  N D  ++
Sbjct: 101  GISAASERLSPIARFVLDAFRKNRNHWGPSVVSELNKLRRVTPSIVAEVLKLGN-DAAVA 159

Query: 348  SKFFNWAGKQKGYRHDFACYNAFAYFLNRANHFRDADQVPELMHMQGKPPSEKQFEILIR 527
            +KFF+WAGKQKGYRHDFA YNAFAY LNR  HFR ADQ+PELM  QG+PPSEKQFEILIR
Sbjct: 160  AKFFHWAGKQKGYRHDFASYNAFAYCLNRNGHFRAADQLPELMDSQGRPPSEKQFEILIR 219

Query: 528  MHADAKRGLRVHYVYEKMKKFGVKPRVFLYNRIMDALVKTDHLDLAMSVYIDFKEDGLVE 707
            MHAD KRGLRV+YVYEKMKKFG KPRVFLYNRIMDALVK  + DLA++VY DFKEDGLVE
Sbjct: 220  MHADNKRGLRVYYVYEKMKKFGFKPRVFLYNRIMDALVKNGYFDLALAVYEDFKEDGLVE 279

Query: 708  ENMTYMILIKGLCKAGRMDEVFRLLDKMRKNLSKPDVFAYTAMVKVLIAEGNLDGCLMVW 887
            E+ T+MIL+KGLCKAGR++E+  +L +MR NL KPDVFAYTAM+K L++EGN+D  L VW
Sbjct: 280  ESTTFMILVKGLCKAGRIEEMLEILQRMRANLCKPDVFAYTAMIKTLVSEGNMDASLQVW 339

Query: 888  DEMLKDRIDPDVMAYTTLVMALCKGNRVDXXXXXXXXXXXXXXLIDRAVYGSLIEAYVVD 1067
            DEM +D I PDVMAY TLV  LC+  RV+              LIDR +Y  LIE +V +
Sbjct: 340  DEMKRDEIKPDVMAYGTLVTGLCRDGRVERGYELFMEMKEKQILIDREIYRVLIEGFVAE 399

Query: 1068 GRVGSA 1085
            G+V SA
Sbjct: 400  GKVRSA 405



 Score = 82.4 bits (202), Expect = 3e-13
 Identities = 56/235 (23%), Positives = 103/235 (43%)
 Frame = +3

Query: 372  KQKGYRHDFACYNAFAYFLNRANHFRDADQVPELMHMQGKPPSEKQFEILIRMHADAKRG 551
            K+ G++     YN     L +  +F  A  V E     G       F IL++    A R 
Sbjct: 238  KKFGFKPRVFLYNRIMDALVKNGYFDLALAVYEDFKEDGLVEESTTFMILVKGLCKAGRI 297

Query: 552  LRVHYVYEKMKKFGVKPRVFLYNRIMDALVKTDHLDLAMSVYIDFKEDGLVEENMTYMIL 731
              +  + ++M+    KP VF Y  ++  LV   ++D ++ V+ + K D +  + M Y  L
Sbjct: 298  EEMLEILQRMRANLCKPDVFAYTAMIKTLVSEGNMDASLQVWDEMKRDEIKPDVMAYGTL 357

Query: 732  IKGLCKAGRMDEVFRLLDKMRKNLSKPDVFAYTAMVKVLIAEGNLDGCLMVWDEMLKDRI 911
            + GLC+ GR++  + L  +M++     D   Y  +++  +AEG +     +W++++    
Sbjct: 358  VTGLCRDGRVERGYELFMEMKEKQILIDREIYRVLIEGFVAEGKVRSACNLWEDLVDSGY 417

Query: 912  DPDVMAYTTLVMALCKGNRVDXXXXXXXXXXXXXXLIDRAVYGSLIEAYVVDGRV 1076
              D+  Y  ++  LC  N+VD                D      ++ AYVV  R+
Sbjct: 418  IADLGIYNAVIKGLCSVNQVDKAYKLFQIAIDEELEPDFETLSPILVAYVVMNRL 472


>ref|XP_003598903.1| Pentatricopeptide repeat-containing protein [Medicago truncatula]
            gi|355487951|gb|AES69154.1| Pentatricopeptide
            repeat-containing protein [Medicago truncatula]
          Length = 767

 Score =  439 bits (1130), Expect = e-121
 Identities = 224/364 (61%), Positives = 271/364 (74%), Gaps = 4/364 (1%)
 Frame = +3

Query: 6    NRPTVRGGLFSNRQTINPKKFSRPSAASDEPFDLRKWDPE--DESNLKLPYVKDPSERFF 179
            NRPTVRGGLFSNR+T+ P K    S      F ++KWDP    + N   P      E  F
Sbjct: 24   NRPTVRGGLFSNRKTLTPPK--PKSTKPTNSFQIQKWDPHFLSQPNSPSPSPSPSPEATF 81

Query: 180  ISAKNLSPIARYIVDAFR-THKNWNAELVAELNRLRRVTPKLVAEVLKFPNVDPRLSSKF 356
             ++  LSPIAR+I+DAFR  + NW   +V ELN+LRRVTP LVAEVLK    +P L+ KF
Sbjct: 82   SASLRLSPIARFILDAFRKNNNNWGPPVVTELNKLRRVTPTLVAEVLKV-QTNPTLAFKF 140

Query: 357  FNWAGKQKGYRHDFACYNAFAYFLNRANHFRDADQVPELMHMQGKPPSEKQFEILIRMHA 536
            F+W  KQKGY H+FA YNAF Y LNRANHFR ADQ+PELM  QGKPPSEKQFEILIRMH+
Sbjct: 141  FHWVEKQKGYHHNFASYNAFTYCLNRANHFRAADQLPELMDAQGKPPSEKQFEILIRMHS 200

Query: 537  DAKRGLRVHYVYEKMK-KFGVKPRVFLYNRIMDALVKTDHLDLAMSVYIDFKEDGLVEEN 713
            DA RGLRV++VY+KM+ KFGVKPRVFLYNRIMDALVKT HLDLA+SVY DF+EDGLVEE+
Sbjct: 201  DAGRGLRVYHVYDKMRNKFGVKPRVFLYNRIMDALVKTGHLDLALSVYNDFREDGLVEES 260

Query: 714  MTYMILIKGLCKAGRMDEVFRLLDKMRKNLSKPDVFAYTAMVKVLIAEGNLDGCLMVWDE 893
            +T+MILIKGLCK G++DE+  +L +MR+ L KPDVFAYTA+V++++ EGNLDGCL VW E
Sbjct: 261  VTFMILIKGLCKGGKIDEMLEVLGRMREKLCKPDVFAYTALVRIMVKEGNLDGCLRVWKE 320

Query: 894  MLKDRIDPDVMAYTTLVMALCKGNRVDXXXXXXXXXXXXXXLIDRAVYGSLIEAYVVDGR 1073
            M +DR+DPDVMAY T++  L KG RV               LIDRA+YGSL+E++V   +
Sbjct: 321  MKRDRVDPDVMAYGTIIGGLAKGGRVSEGYELFKEMKSKGHLIDRAIYGSLVESFVAGNK 380

Query: 1074 VGSA 1085
            VG A
Sbjct: 381  VGLA 384



 Score = 68.6 bits (166), Expect = 4e-09
 Identities = 48/198 (24%), Positives = 89/198 (44%)
 Frame = +3

Query: 381 GYRHDFACYNAFAYFLNRANHFRDADQVPELMHMQGKPPSEKQFEILIRMHADAKRGLRV 560
           GYR D   YN     L   N    A ++ ++   +G  P     + L+  +A+AKR    
Sbjct: 395 GYRADLGMYNNLIEGLCNLNKVEKAYKLFQVTIQEGLEPDFLSVKPLLLAYAEAKRMEEF 454

Query: 561 HYVYEKMKKFGVKPRVFLYNRIMDALVKTDHLDLAMSVYIDFKEDGLVEENMTYMILIKG 740
             + EKMKK G  P +   ++    LV+    ++A+ ++   KE   V   + Y I ++ 
Sbjct: 455 FMLLEKMKKLGF-PVIDDLSKFFSHLVEKKGPEMALEIFTHLKEKSYVSVEI-YNIFMES 512

Query: 741 LCKAGRMDEVFRLLDKMRKNLSKPDVFAYTAMVKVLIAEGNLDGCLMVWDEMLKDRIDPD 920
           L  +G++++   L D+++ +  +PD   Y   +  L+  G +       +++++    P 
Sbjct: 513 LHLSGKVEKALSLFDEIKGSDLEPDSSTYNIAILCLVDHGQIKEACECHNKIIEMSSIPS 572

Query: 921 VMAYTTLVMALCKGNRVD 974
           V AY  L   LC    +D
Sbjct: 573 VAAYNCLAKGLCNIGEID 590



 Score = 61.6 bits (148), Expect = 5e-07
 Identities = 53/194 (27%), Positives = 85/194 (43%), Gaps = 4/194 (2%)
 Frame = +3

Query: 405 YNAFAYFLNRANHFRDADQVPELMHMQGKPPSEKQFEILIRMHADAKRGLRVHYVYEKMK 584
           YN     L +  H   A  V       G       F ILI+      +   +  V  +M+
Sbjct: 228 YNRIMDALVKTGHLDLALSVYNDFREDGLVEESVTFMILIKGLCKGGKIDEMLEVLGRMR 287

Query: 585 KFGVKPRVFLYNRIMDALVKTDHLDLAMSVYIDFKEDGLVEENMTYMILIKGLCKAGRMD 764
           +   KP VF Y  ++  +VK  +LD  + V+ + K D +  + M Y  +I GL K GR+ 
Sbjct: 288 EKLCKPDVFAYTALVRIMVKEGNLDGCLRVWKEMKRDRVDPDVMAYGTIIGGLAKGGRVS 347

Query: 765 EVFRLLDKMRKNLSKPDVFAYTAMVKVLIAEGNLDGCLMVWDEMLKDRID----PDVMAY 932
           E + L  +M+      D   Y ++V+  +A GN  G      ++LKD +      D+  Y
Sbjct: 348 EGYELFKEMKSKGHLIDRAIYGSLVESFVA-GNKVGLAF---DLLKDLVSSGYRADLGMY 403

Query: 933 TTLVMALCKGNRVD 974
             L+  LC  N+V+
Sbjct: 404 NNLIEGLCNLNKVE 417


>ref|XP_004229293.1| PREDICTED: pentatricopeptide repeat-containing protein At4g20740-like
            [Solanum lycopersicum]
          Length = 1256

 Score =  434 bits (1115), Expect = e-119
 Identities = 224/362 (61%), Positives = 261/362 (72%), Gaps = 2/362 (0%)
 Frame = +3

Query: 6    NRPTVRGGLFSNRQTINPKKFSR--PSAASDEPFDLRKWDPEDESNLKLPYVKDPSERFF 179
            +RPTV+GGLFSNRQTINP   ++  PS  +   F L+KWDP++ S  K    +DPS+ FF
Sbjct: 593  HRPTVQGGLFSNRQTINPNLTTKNSPSPVTQGDFQLQKWDPDEVSGQKS---RDPSQEFF 649

Query: 180  ISAKNLSPIARYIVDAFRTHKNWNAELVAELNRLRRVTPKLVAEVLKFPNVDPRLSSKFF 359
              A+ LSPIARYIVD+FR H  W A L+A+LN LRRVTPKLV EVLK PN+DP++SSKFF
Sbjct: 650  SLAQRLSPIARYIVDSFRKHGKWGAPLLADLNTLRRVTPKLVTEVLKHPNLDPKISSKFF 709

Query: 360  NWAGKQKGYRHDFACYNAFAYFLNRANHFRDADQVPELMHMQGKPPSEKQFEILIRMHAD 539
             WAGKQKGYRHDF+CYNAFAY LNRAN FR ADQVPELMHMQGKPPSEKQFEILIRMH D
Sbjct: 710  YWAGKQKGYRHDFSCYNAFAYGLNRANQFRTADQVPELMHMQGKPPSEKQFEILIRMHGD 769

Query: 540  AKRGLRVHYVYEKMKKFGVKPRVFLYNRIMDALVKTDHLDLAMSVYIDFKEDGLVEENMT 719
            A RGLRV+YVYEKMKKFGVKPRVFLYNRIMDALVKT+HLDLAMSVY DFK+DGLVEE++T
Sbjct: 770  ANRGLRVYYVYEKMKKFGVKPRVFLYNRIMDALVKTNHLDLAMSVYDDFKKDGLVEESIT 829

Query: 720  YMILIKGLCKAGRMDEVFRLLDKMRKNLSKPDVFAYTAMVKVLIAEGNLDGCLMVWDEML 899
            +MILIKGLCK GRMDEVF +  +M+++  +PDV AY                        
Sbjct: 830  FMILIKGLCKFGRMDEVFEVWKEMQQDAVEPDVIAY------------------------ 865

Query: 900  KDRIDPDVMAYTTLVMALCKGNRVDXXXXXXXXXXXXXXLIDRAVYGSLIEAYVVDGRVG 1079
                       +T +  LCK N+VD              LIDR +YGSLIE++V  G+VG
Sbjct: 866  -----------STFIAGLCKNNQVDKGYELFKEMKQKKILIDRGIYGSLIESFVASGKVG 914

Query: 1080 SA 1085
             A
Sbjct: 915  LA 916


>ref|XP_006482966.1| PREDICTED: pentatricopeptide repeat-containing protein At4g20740-like
            [Citrus sinensis]
          Length = 721

 Score =  430 bits (1105), Expect = e-118
 Identities = 226/363 (62%), Positives = 274/363 (75%), Gaps = 3/363 (0%)
 Frame = +3

Query: 6    NRPTVRGGLFSNRQTI-NPKKFSRPSAASDEPFDLRKWDPEDESNLKLPYVKDPSE-RFF 179
            NRPTV GG FSNRQ++ NP   S P  +  +PF+++KWDP    N K      PS+ + F
Sbjct: 24   NRPTVYGGFFSNRQSLRNPNSTSEPHQS--QPFNVQKWDPHYLPNQKTQ--SPPSDPKTF 79

Query: 180  ISAKNLSPIARYIVDAFRTHK-NWNAELVAELNRLRRVTPKLVAEVLKFPNVDPRLSSKF 356
               ++LSPIAR+I DAFR ++ +W   +V EL++LRRVTP LVAEVLK  N +P L+SKF
Sbjct: 80   QLQRHLSPIARFITDAFRKNQFHWGPRVVTELSKLRRVTPDLVAEVLKVEN-NPTLASKF 138

Query: 357  FNWAGKQKGYRHDFACYNAFAYFLNRANHFRDADQVPELMHMQGKPPSEKQFEILIRMHA 536
            F+WAGKQKGY+H+FA YNA AY L+R N FR ADQVPELM  QGKPP+EKQFEILIRMHA
Sbjct: 139  FHWAGKQKGYKHNFASYNALAYCLSRNNLFRAADQVPELMDSQGKPPTEKQFEILIRMHA 198

Query: 537  DAKRGLRVHYVYEKMKKFGVKPRVFLYNRIMDALVKTDHLDLAMSVYIDFKEDGLVEENM 716
            D  RGLRV +VY+KMKKFG+ PRVFLYN+IMDALVKT+ LDLA+SVY +FK  GLVEE++
Sbjct: 199  DCNRGLRVFHVYQKMKKFGILPRVFLYNKIMDALVKTNCLDLALSVYEEFKGHGLVEESV 258

Query: 717  TYMILIKGLCKAGRMDEVFRLLDKMRKNLSKPDVFAYTAMVKVLIAEGNLDGCLMVWDEM 896
            TYMILIKGLCKAGR+ E+  +L+KMR+NL KPDVFAYTAM++VL AE NLD CL VW+EM
Sbjct: 259  TYMILIKGLCKAGRIAEMLEILEKMRRNLCKPDVFAYTAMIRVLAAERNLDACLRVWEEM 318

Query: 897  LKDRIDPDVMAYTTLVMALCKGNRVDXXXXXXXXXXXXXXLIDRAVYGSLIEAYVVDGRV 1076
             KD ++ DVMAY TL+M LCKG RV               LIDRA+YG LIE  V +G+V
Sbjct: 319  KKDLVEADVMAYVTLIMGLCKGGRVVRGYELFREMKENGILIDRAIYGVLIEGLVGEGKV 378

Query: 1077 GSA 1085
            G A
Sbjct: 379  GKA 381



 Score = 72.0 bits (175), Expect = 4e-10
 Identities = 54/194 (27%), Positives = 90/194 (46%), Gaps = 4/194 (2%)
 Frame = +3

Query: 405 YNAFAYFLNRANHFRDADQVPELMHMQGKPPSEKQFEILIRMHADAKRGLRVHYVYEKMK 584
           YN     L + N    A  V E     G       + ILI+    A R   +  + EKM+
Sbjct: 225 YNKIMDALVKTNCLDLALSVYEEFKGHGLVEESVTYMILIKGLCKAGRIAEMLEILEKMR 284

Query: 585 KFGVKPRVFLYNRIMDALVKTDHLDLAMSVYIDFKEDGLVEENMTYMILIKGLCKAGRMD 764
           +   KP VF Y  ++  L    +LD  + V+ + K+D +  + M Y+ LI GLCK GR+ 
Sbjct: 285 RNLCKPDVFAYTAMIRVLAAERNLDACLRVWEEMKKDLVEADVMAYVTLIMGLCKGGRVV 344

Query: 765 EVFRLLDKMRKNLSKPDVFAYTAMVKVLIAEGNLDGCLMVWDEMLKDRID----PDVMAY 932
             + L  +M++N    D   Y  +++ L+ EG +        ++LKD +D     D+  Y
Sbjct: 345 RGYELFREMKENGILIDRAIYGVLIEGLVGEGKVGKAC----DLLKDLVDSGYRADLGIY 400

Query: 933 TTLVMALCKGNRVD 974
            +++  LC+  + D
Sbjct: 401 NSIIGGLCRVKQFD 414


>ref|XP_004494981.1| PREDICTED: pentatricopeptide repeat-containing protein At4g20740-like
            [Cicer arietinum]
          Length = 720

 Score =  430 bits (1105), Expect = e-118
 Identities = 221/363 (60%), Positives = 269/363 (74%), Gaps = 3/363 (0%)
 Frame = +3

Query: 6    NRPTVRGGLFSNRQTINPKKFSRPSAASDEPFDLRKWDPEDESNLK-LPYVKDPSERFFI 182
            NRPTVRGGLFSNRQT+ P K    S  +  PF+++KWDP   S     P     SE  F 
Sbjct: 24   NRPTVRGGLFSNRQTLTPPK----SKTTSRPFEIQKWDPHFLSQQNPSPPPSPSSEASFS 79

Query: 183  SAKNLSPIARYIVDAFRTHK-NWNAELVAELNRLRRVTPKLVAEVLKFPNVDPRLSSKFF 359
             +  LSPIAR+IVDAFR +   W   ++ ELN+LRRV P LVAEVLK    +P L+ KFF
Sbjct: 80   PSLRLSPIARFIVDAFRKNSYKWGPSVITELNKLRRVPPNLVAEVLKV-QTNPTLAFKFF 138

Query: 360  NWAGKQKGYRHDFACYNAFAYFLNRANHFRDADQVPELMHMQGKPPSEKQFEILIRMHAD 539
            +W   QKGY H+FA +NAFAY LNRANHF  ADQ+PELM  QGKPPSEKQFEILIRMH+D
Sbjct: 139  HWVENQKGYHHNFASFNAFAYCLNRANHFHAADQLPELMDAQGKPPSEKQFEILIRMHSD 198

Query: 540  AKRGLRVHYVYEKMK-KFGVKPRVFLYNRIMDALVKTDHLDLAMSVYIDFKEDGLVEENM 716
            A RGLR ++VY+KM+ KFGVKPRVFLYNRIMDALVKT HLDLA+SVY DF+EDGLVEE++
Sbjct: 199  AGRGLRAYHVYDKMRNKFGVKPRVFLYNRIMDALVKTRHLDLALSVYNDFREDGLVEESV 258

Query: 717  TYMILIKGLCKAGRMDEVFRLLDKMRKNLSKPDVFAYTAMVKVLIAEGNLDGCLMVWDEM 896
            T+M+L+KGLCKAGR+ E+  +L +MR+ L KPDVFAYTA+V++++AEGNLDGCL VW+EM
Sbjct: 259  TFMVLVKGLCKAGRIGEMLEVLGRMREKLYKPDVFAYTALVRIMVAEGNLDGCLRVWEEM 318

Query: 897  LKDRIDPDVMAYTTLVMALCKGNRVDXXXXXXXXXXXXXXLIDRAVYGSLIEAYVVDGRV 1076
             +D + PDVMAY T++  L K  RV               LIDRA+YGSLIE++VV  +V
Sbjct: 319  KRDGVVPDVMAYDTIIGGLAKEGRVKEGYELFKEMKSKGHLIDRAIYGSLIESFVVGNKV 378

Query: 1077 GSA 1085
            G A
Sbjct: 379  GLA 381



 Score = 65.1 bits (157), Expect = 5e-08
 Identities = 52/194 (26%), Positives = 87/194 (44%), Gaps = 4/194 (2%)
 Frame = +3

Query: 405 YNAFAYFLNRANHFRDADQVPELMHMQGKPPSEKQFEILIRMHADAKRGLRVHYVYEKMK 584
           YN     L +  H   A  V       G       F +L++    A R   +  V  +M+
Sbjct: 225 YNRIMDALVKTRHLDLALSVYNDFREDGLVEESVTFMVLVKGLCKAGRIGEMLEVLGRMR 284

Query: 585 KFGVKPRVFLYNRIMDALVKTDHLDLAMSVYIDFKEDGLVEENMTYMILIKGLCKAGRMD 764
           +   KP VF Y  ++  +V   +LD  + V+ + K DG+V + M Y  +I GL K GR+ 
Sbjct: 285 EKLYKPDVFAYTALVRIMVAEGNLDGCLRVWEEMKRDGVVPDVMAYDTIIGGLAKEGRVK 344

Query: 765 EVFRLLDKMRKNLSKPDVFAYTAMVKVLIAEGNLDGCLMVWDEMLKDRID----PDVMAY 932
           E + L  +M+      D   Y ++++  +  GN  G      ++LKD ++     D+  Y
Sbjct: 345 EGYELFKEMKSKGHLIDRAIYGSLIESFVV-GNKVGLAF---DLLKDLVNSGYRADLGIY 400

Query: 933 TTLVMALCKGNRVD 974
             L+  LC  N+V+
Sbjct: 401 NNLIKGLCNLNKVE 414


>ref|XP_007008770.1| Pentatricopeptide repeat (PPR-like) superfamily protein [Theobroma
            cacao] gi|508725683|gb|EOY17580.1| Pentatricopeptide
            repeat (PPR-like) superfamily protein [Theobroma cacao]
          Length = 716

 Score =  429 bits (1104), Expect = e-118
 Identities = 227/362 (62%), Positives = 268/362 (74%), Gaps = 2/362 (0%)
 Frame = +3

Query: 6    NRPTVRGGLFSNRQTIN-PKKFSRPSAASDEPFDLRKWDPEDESNLKLPYVKDPSERFFI 182
            NRP V GGLFSNRQ +  P    +PS     PFDLRKWDP   S    P    PS     
Sbjct: 25   NRPVVYGGLFSNRQILKTPPTPPQPSP----PFDLRKWDPYYLSQNPSP----PSTPNPY 76

Query: 183  SAKNLSPIARYIVDAFRTHK-NWNAELVAELNRLRRVTPKLVAEVLKFPNVDPRLSSKFF 359
              + LSPIAR+IVDAFR ++  W   +V ELN+LRRVT  LVAEVLK  N DP L+SKFF
Sbjct: 77   QNRKLSPIARFIVDAFRKNQYTWGPTVVFELNKLRRVTASLVAEVLKVEN-DPVLASKFF 135

Query: 360  NWAGKQKGYRHDFACYNAFAYFLNRANHFRDADQVPELMHMQGKPPSEKQFEILIRMHAD 539
            +WAGKQKG++H+FA YNA AY LNR   FR ADQ+PELM  QGK P+EKQFEILIRMHAD
Sbjct: 136  HWAGKQKGFKHNFASYNALAYCLNRNGRFRAADQLPELMDSQGKQPTEKQFEILIRMHAD 195

Query: 540  AKRGLRVHYVYEKMKKFGVKPRVFLYNRIMDALVKTDHLDLAMSVYIDFKEDGLVEENMT 719
              RG RV+YVY+KMK FG+KPRVFLYNRIMDALVKT +LDLA+SVY DF+ DGLVEE++T
Sbjct: 196  NNRGQRVYYVYQKMKNFGIKPRVFLYNRIMDALVKTGYLDLALSVYEDFRGDGLVEESIT 255

Query: 720  YMILIKGLCKAGRMDEVFRLLDKMRKNLSKPDVFAYTAMVKVLIAEGNLDGCLMVWDEML 899
            +MILIKGLCKAGR++E+  +L +MR+ L KPDVFAYTAMV++L++E NLDGCL+VW+EM 
Sbjct: 256  FMILIKGLCKAGRIEEMLEVLGRMREKLCKPDVFAYTAMVRILVSEKNLDGCLLVWEEME 315

Query: 900  KDRIDPDVMAYTTLVMALCKGNRVDXXXXXXXXXXXXXXLIDRAVYGSLIEAYVVDGRVG 1079
            +D ++PDVMAY TLV  LCKG RV               LIDRA YG LIE +V DG+VG
Sbjct: 316  RDGVEPDVMAYVTLVTGLCKGGRVQRGYELFREMKDKGILIDRATYGVLIEGFVKDGKVG 375

Query: 1080 SA 1085
            SA
Sbjct: 376  SA 377



 Score = 70.9 bits (172), Expect = 9e-10
 Identities = 55/205 (26%), Positives = 90/205 (43%), Gaps = 4/205 (1%)
 Frame = +3

Query: 372 KQKGYRHDFACYNAFAYFLNRANHFRDADQVPELMHMQGKPPSEKQFEILIRMHADAKRG 551
           K  G +     YN     L +  +   A  V E     G       F ILI+    A R 
Sbjct: 210 KNFGIKPRVFLYNRIMDALVKTGYLDLALSVYEDFRGDGLVEESITFMILIKGLCKAGRI 269

Query: 552 LRVHYVYEKMKKFGVKPRVFLYNRIMDALVKTDHLDLAMSVYIDFKEDGLVEENMTYMIL 731
             +  V  +M++   KP VF Y  ++  LV   +LD  + V+ + + DG+  + M Y+ L
Sbjct: 270 EEMLEVLGRMREKLCKPDVFAYTAMVRILVSEKNLDGCLLVWEEMERDGVEPDVMAYVTL 329

Query: 732 IKGLCKAGRMDEVFRLLDKMRKNLSKPDVFAYTAMVKVLIAEGNLDGCLMVWDEMLKDRI 911
           + GLCK GR+   + L  +M+      D   Y  +++  + +G +        ++LKD +
Sbjct: 330 VTGLCKGGRVQRGYELFREMKDKGILIDRATYGVLIEGFVKDGKVGSAC----DLLKDLV 385

Query: 912 D----PDVMAYTTLVMALCKGNRVD 974
           D     D+  Y +L+  LC   RVD
Sbjct: 386 DSGYRADLGIYNSLIEGLCDARRVD 410


>ref|XP_006605814.1| PREDICTED: pentatricopeptide repeat-containing protein At4g20740-like
            isoform X2 [Glycine max] gi|571565751|ref|XP_003555182.2|
            PREDICTED: pentatricopeptide repeat-containing protein
            At4g20740-like isoform X1 [Glycine max]
          Length = 764

 Score =  427 bits (1099), Expect = e-117
 Identities = 217/360 (60%), Positives = 272/360 (75%), Gaps = 3/360 (0%)
 Frame = +3

Query: 6    NRPTVRGGLFSNRQTINPKKFSRPSAASDEPFDLRKWDPEDESNLKLPYVKDPSERFFIS 185
            NRPTVRGGLFSNRQT+NP   S+P   + +PF+++ WDP   SN       +PS     S
Sbjct: 70   NRPTVRGGLFSNRQTLNPNP-SQPKPTT-KPFNIKNWDPHFLSNPN----SNPSPSTLSS 123

Query: 186  AK-NLSPIARYIVDAFRTHKN-WNAELVAELNRLRRVTPKLVAEVLKFPNVDPRLSSKFF 359
            A   LSPIAR+IVDAFR + N W   + AEL++LRR+TP LVAEVLK    +  L+SKFF
Sbjct: 124  ASLRLSPIARFIVDAFRRNDNKWCPNVAAELSKLRRITPNLVAEVLKV-QTNHTLASKFF 182

Query: 360  NWAGKQKGYRHDFACYNAFAYFLNRANHFRDADQVPELMHMQGKPPSEKQFEILIRMHAD 539
            +WAG Q+GY H+FA YNA AY LNR + FR ADQ+PELM  QGKPPSEKQFEILIRMH+D
Sbjct: 183  HWAGSQRGYHHNFASYNALAYCLNRHHQFRAADQLPELMESQGKPPSEKQFEILIRMHSD 242

Query: 540  AKRGLRVHYVYEKMK-KFGVKPRVFLYNRIMDALVKTDHLDLAMSVYIDFKEDGLVEENM 716
            A RGLRV++VYEKM+ KFGVKPRVFLYNR+MDALV+T HLDLA+SVY D KEDGLVEE++
Sbjct: 243  ANRGLRVYHVYEKMRNKFGVKPRVFLYNRVMDALVRTGHLDLALSVYDDLKEDGLVEESV 302

Query: 717  TYMILIKGLCKAGRMDEVFRLLDKMRKNLSKPDVFAYTAMVKVLIAEGNLDGCLMVWDEM 896
            T+M+L+KGLCK GR+DE+  +L +MR+ L KPDVFAYTA+VK+L+  GNLD CL VW+EM
Sbjct: 303  TFMVLVKGLCKCGRIDEMLEVLGRMRERLCKPDVFAYTALVKILVPAGNLDACLRVWEEM 362

Query: 897  LKDRIDPDVMAYTTLVMALCKGNRVDXXXXXXXXXXXXXXLIDRAVYGSLIEAYVVDGRV 1076
             +DR++PDV AY T+++ L KG RV               L+DR +YG+L+EA+V +G+V
Sbjct: 363  KRDRVEPDVKAYATMIVGLAKGGRVQEGYELFREMKGKGCLVDRVIYGALVEAFVAEGKV 422



 Score = 70.1 bits (170), Expect = 2e-09
 Identities = 50/189 (26%), Positives = 83/189 (43%)
 Frame = +3

Query: 405 YNAFAYFLNRANHFRDADQVPELMHMQGKPPSEKQFEILIRMHADAKRGLRVHYVYEKMK 584
           YN     L R  H   A  V + +   G       F +L++      R   +  V  +M+
Sbjct: 269 YNRVMDALVRTGHLDLALSVYDDLKEDGLVEESVTFMVLVKGLCKCGRIDEMLEVLGRMR 328

Query: 585 KFGVKPRVFLYNRIMDALVKTDHLDLAMSVYIDFKEDGLVEENMTYMILIKGLCKAGRMD 764
           +   KP VF Y  ++  LV   +LD  + V+ + K D +  +   Y  +I GL K GR+ 
Sbjct: 329 ERLCKPDVFAYTALVKILVPAGNLDACLRVWEEMKRDRVEPDVKAYATMIVGLAKGGRVQ 388

Query: 765 EVFRLLDKMRKNLSKPDVFAYTAMVKVLIAEGNLDGCLMVWDEMLKDRIDPDVMAYTTLV 944
           E + L  +M+      D   Y A+V+  +AEG ++    +  +++      D+  Y  L+
Sbjct: 389 EGYELFREMKGKGCLVDRVIYGALVEAFVAEGKVELAFDLLKDLVSSGYRADLGIYICLI 448

Query: 945 MALCKGNRV 971
             LC  NRV
Sbjct: 449 EGLCNLNRV 457



 Score = 64.7 bits (156), Expect = 6e-08
 Identities = 50/198 (25%), Positives = 86/198 (43%)
 Frame = +3

Query: 381  GYRHDFACYNAFAYFLNRANHFRDADQVPELMHMQGKPPSEKQFEILIRMHADAKRGLRV 560
            GYR D   Y      L   N  + A ++ +L   +G  P     + L+  +A+A R    
Sbjct: 436  GYRADLGIYICLIEGLCNLNRVQKAYKLFQLTVREGLEPDFLTVKPLLVAYAEANRMEEF 495

Query: 561  HYVYEKMKKFGVKPRVFLYNRIMDALVKTDHLDLAMSVYIDFKEDGLVEENMTYMILIKG 740
              + E+M+K G  P +   ++    LV+     +A+  +   KE G V   + Y I +  
Sbjct: 496  CKLLEQMQKLGF-PVIADLSKFFSVLVEKKGPIMALETFGQLKEKGHVSVEI-YNIFMDS 553

Query: 741  LCKAGRMDEVFRLLDKMRKNLSKPDVFAYTAMVKVLIAEGNLDGCLMVWDEMLKDRIDPD 920
            L K G + +   L D+M+    KPD F Y   +  L+  G +       + +++    P 
Sbjct: 554  LHKIGEVKKALSLFDEMKGLSLKPDSFTYCTAILCLVDLGEIKEACACHNRIIEMSCIPS 613

Query: 921  VMAYTTLVMALCKGNRVD 974
            V AY++L   LC+   +D
Sbjct: 614  VAAYSSLTKGLCQIGEID 631


>ref|XP_006438906.1| hypothetical protein CICLE_v10030824mg [Citrus clementina]
            gi|557541102|gb|ESR52146.1| hypothetical protein
            CICLE_v10030824mg [Citrus clementina]
          Length = 721

 Score =  427 bits (1097), Expect = e-117
 Identities = 227/365 (62%), Positives = 273/365 (74%), Gaps = 5/365 (1%)
 Frame = +3

Query: 6    NRPTVRGGLFSNRQTI-NPKKFSRPSAASDEPFDLRKWDPEDESNLKLPYVKDPSE-RFF 179
            NRPTV GG FSNRQ++ NP   S P  +  +PF+++KWDP    + K      PS+ + F
Sbjct: 24   NRPTVYGGFFSNRQSLRNPNSTSEPHQS--QPFNVQKWDPHYLPSQKTQ--SPPSDPKTF 79

Query: 180  ISAKNLSPIARYIVDAFRTHKN---WNAELVAELNRLRRVTPKLVAEVLKFPNVDPRLSS 350
               ++LSPIAR+I DAF  HKN   W   +V EL++LRRVTP LVAEVLK  N +P L+S
Sbjct: 80   QLQRHLSPIARFITDAF--HKNQFHWGPRVVTELSKLRRVTPDLVAEVLKVEN-NPTLAS 136

Query: 351  KFFNWAGKQKGYRHDFACYNAFAYFLNRANHFRDADQVPELMHMQGKPPSEKQFEILIRM 530
            KFF+WAGKQKGY+H+FA YNA AY L+R N FR ADQVPELM  QGKPP+EKQFEILIRM
Sbjct: 137  KFFHWAGKQKGYKHNFASYNALAYCLSRNNLFRAADQVPELMDSQGKPPTEKQFEILIRM 196

Query: 531  HADAKRGLRVHYVYEKMKKFGVKPRVFLYNRIMDALVKTDHLDLAMSVYIDFKEDGLVEE 710
            HAD  RGLRV +VY+KMKKFG+ PRVFLYN+IMDALVKT+ LDLA+SVY +FK  GLVEE
Sbjct: 197  HADCNRGLRVFHVYQKMKKFGILPRVFLYNKIMDALVKTNCLDLALSVYEEFKGHGLVEE 256

Query: 711  NMTYMILIKGLCKAGRMDEVFRLLDKMRKNLSKPDVFAYTAMVKVLIAEGNLDGCLMVWD 890
            ++TYMILIKGLCKAGR+ E+  +L+KMR+NL KPDVFAYTAM++VL AE NLD CL VW+
Sbjct: 257  SVTYMILIKGLCKAGRIAEMLEILEKMRRNLCKPDVFAYTAMIRVLAAERNLDACLRVWE 316

Query: 891  EMLKDRIDPDVMAYTTLVMALCKGNRVDXXXXXXXXXXXXXXLIDRAVYGSLIEAYVVDG 1070
            EM KD ++ DVMAY TL+M LCKG RV               LIDRA+YG LIE  V +G
Sbjct: 317  EMKKDLVEADVMAYVTLIMGLCKGGRVVRGYKLFREMKENGILIDRAIYGVLIEGLVGEG 376

Query: 1071 RVGSA 1085
            +VG A
Sbjct: 377  KVGKA 381



 Score = 72.8 bits (177), Expect = 2e-10
 Identities = 54/194 (27%), Positives = 91/194 (46%), Gaps = 4/194 (2%)
 Frame = +3

Query: 405 YNAFAYFLNRANHFRDADQVPELMHMQGKPPSEKQFEILIRMHADAKRGLRVHYVYEKMK 584
           YN     L + N    A  V E     G       + ILI+    A R   +  + EKM+
Sbjct: 225 YNKIMDALVKTNCLDLALSVYEEFKGHGLVEESVTYMILIKGLCKAGRIAEMLEILEKMR 284

Query: 585 KFGVKPRVFLYNRIMDALVKTDHLDLAMSVYIDFKEDGLVEENMTYMILIKGLCKAGRMD 764
           +   KP VF Y  ++  L    +LD  + V+ + K+D +  + M Y+ LI GLCK GR+ 
Sbjct: 285 RNLCKPDVFAYTAMIRVLAAERNLDACLRVWEEMKKDLVEADVMAYVTLIMGLCKGGRVV 344

Query: 765 EVFRLLDKMRKNLSKPDVFAYTAMVKVLIAEGNLDGCLMVWDEMLKDRID----PDVMAY 932
             ++L  +M++N    D   Y  +++ L+ EG +        ++LKD +D     D+  Y
Sbjct: 345 RGYKLFREMKENGILIDRAIYGVLIEGLVGEGKVGKAC----DLLKDLVDSGYRADLGIY 400

Query: 933 TTLVMALCKGNRVD 974
            +++  LC+  + D
Sbjct: 401 NSIIGGLCRVKQFD 414


>ref|XP_002269066.2| PREDICTED: pentatricopeptide repeat-containing protein At4g20740-like
            [Vitis vinifera]
          Length = 1294

 Score =  422 bits (1086), Expect = e-116
 Identities = 223/360 (61%), Positives = 256/360 (71%)
 Frame = +3

Query: 6    NRPTVRGGLFSNRQTINPKKFSRPSAASDEPFDLRKWDPEDESNLKLPYVKDPSERFFIS 185
            NRPTV GGLFSNR T+NPK  +  +  +   F+L+ WDP+    L +P  K P ERFF  
Sbjct: 564  NRPTVHGGLFSNRTTLNPKPPTLQNPTTH--FNLQNWDPDSPKALAIPPSKTPCERFFDI 621

Query: 186  AKNLSPIARYIVDAFRTHKNWNAELVAELNRLRRVTPKLVAEVLKFPNVDPRLSSKFFNW 365
            AKNLSPIARYI D+FR H+NW   +VA+LN+LRRVTP LVAEVLK    DP + SKFF+W
Sbjct: 622  AKNLSPIARYICDSFRKHRNWGPPVVADLNKLRRVTPVLVAEVLKV-QTDPVICSKFFHW 680

Query: 366  AGKQKGYRHDFACYNAFAYFLNRANHFRDADQVPELMHMQGKPPSEKQFEILIRMHADAK 545
            AGKQKGY+H+FA YNAFAY LNR+N FR ADQVPELM+MQGKPPSEKQFEILIRMH DA 
Sbjct: 681  AGKQKGYKHNFASYNAFAYCLNRSNQFRAADQVPELMNMQGKPPSEKQFEILIRMHIDAN 740

Query: 546  RGLRVHYVYEKMKKFGVKPRVFLYNRIMDALVKTDHLDLAMSVYIDFKEDGLVEENMTYM 725
            RGLRV+YVYEKMKKFG+KPRVFLYNRIMD LVKT HLDLAMSVY DFKEDGLVEE++TYM
Sbjct: 741  RGLRVYYVYEKMKKFGIKPRVFLYNRIMDGLVKTGHLDLAMSVYEDFKEDGLVEESVTYM 800

Query: 726  ILIKGLCKAGRMDEVFRLLDKMRKNLSKPDVFAYTAMVKVLIAEGNLDGCLMVWDEMLKD 905
            IL+KGLCKAGR+DEV  + ++MRK+  +PDV AY                          
Sbjct: 801  ILVKGLCKAGRIDEVLEVWEEMRKDKVEPDVMAY-------------------------- 834

Query: 906  RIDPDVMAYTTLVMALCKGNRVDXXXXXXXXXXXXXXLIDRAVYGSLIEAYVVDGRVGSA 1085
                     TTLV ALC GNRV               LIDRA+YGSLIE +VV+ RVGSA
Sbjct: 835  ---------TTLVAALCNGNRVGEGFELFKEMKQKKYLIDRAIYGSLIEGFVVNERVGSA 885


>ref|XP_004515007.1| PREDICTED: pentatricopeptide repeat-containing protein At4g20740-like
            [Cicer arietinum]
          Length = 720

 Score =  421 bits (1081), Expect = e-115
 Identities = 218/364 (59%), Positives = 267/364 (73%), Gaps = 4/364 (1%)
 Frame = +3

Query: 6    NRPTVRGGLFSNRQTINPKKFSRPSAASDEPFDLRKWDPE--DESNLKLPYVKDPSERFF 179
            NRPTVRGGLFSNRQT+ P K   P   S  PF+++KWDP    + N   P    P+  F 
Sbjct: 24   NRPTVRGGLFSNRQTLTPPK---PKTTS-RPFEIQKWDPHFLSQQNPSPPPSPSPAASFS 79

Query: 180  ISAKNLSPIARYIVDAFRTHK-NWNAELVAELNRLRRVTPKLVAEVLKFPNVDPRLSSKF 356
             S + LSPI R+IVDAFR +   W   ++ EL++ RRV P LVAEVLK    +P ++ KF
Sbjct: 80   ASLR-LSPIVRFIVDAFRKNGYKWGPSVITELSKFRRVPPNLVAEVLKV-QTNPTIAFKF 137

Query: 357  FNWAGKQKGYRHDFACYNAFAYFLNRANHFRDADQVPELMHMQGKPPSEKQFEILIRMHA 536
            F W   QKGY H+FA +NAFAY LNRANHF  ADQ+PELM  QGKPPSEKQFEILIRMH+
Sbjct: 138  FRWVENQKGYHHNFASFNAFAYCLNRANHFHAADQLPELMDAQGKPPSEKQFEILIRMHS 197

Query: 537  DAKRGLRVHYVYEKMK-KFGVKPRVFLYNRIMDALVKTDHLDLAMSVYIDFKEDGLVEEN 713
            DA RGLRV++VY+KM+ KFGVKPRVFLYNRIMDALVKT HLDLA+SVY DF+EDGLVEE+
Sbjct: 198  DAGRGLRVYHVYDKMRNKFGVKPRVFLYNRIMDALVKTGHLDLALSVYNDFREDGLVEES 257

Query: 714  MTYMILIKGLCKAGRMDEVFRLLDKMRKNLSKPDVFAYTAMVKVLIAEGNLDGCLMVWDE 893
            +TYM+L+KGLCKAGR+ E+  +L +MR+ L KPDV AYTA+V++++AEGNLDGCL VW+E
Sbjct: 258  VTYMVLVKGLCKAGRIGEMLEVLGRMREKLCKPDVCAYTALVRIMVAEGNLDGCLRVWEE 317

Query: 894  MLKDRIDPDVMAYTTLVMALCKGNRVDXXXXXXXXXXXXXXLIDRAVYGSLIEAYVVDGR 1073
            M +D + PDVMAY T++  L K  RV               LIDRA+YGSLIE++V   +
Sbjct: 318  MKRDGVVPDVMAYGTVIGGLAKEGRVKEGYELFKEMKSKGHLIDRAIYGSLIESFVAGNK 377

Query: 1074 VGSA 1085
            VG A
Sbjct: 378  VGLA 381



 Score = 60.5 bits (145), Expect = 1e-06
 Identities = 45/190 (23%), Positives = 82/190 (43%)
 Frame = +3

Query: 405 YNAFAYFLNRANHFRDADQVPELMHMQGKPPSEKQFEILIRMHADAKRGLRVHYVYEKMK 584
           YN     L +  H   A  V       G       + +L++    A R   +  V  +M+
Sbjct: 225 YNRIMDALVKTGHLDLALSVYNDFREDGLVEESVTYMVLVKGLCKAGRIGEMLEVLGRMR 284

Query: 585 KFGVKPRVFLYNRIMDALVKTDHLDLAMSVYIDFKEDGLVEENMTYMILIKGLCKAGRMD 764
           +   KP V  Y  ++  +V   +LD  + V+ + K DG+V + M Y  +I GL K GR+ 
Sbjct: 285 EKLCKPDVCAYTALVRIMVAEGNLDGCLRVWEEMKRDGVVPDVMAYGTVIGGLAKEGRVK 344

Query: 765 EVFRLLDKMRKNLSKPDVFAYTAMVKVLIAEGNLDGCLMVWDEMLKDRIDPDVMAYTTLV 944
           E + L  +M+      D   Y ++++  +A   +     +  +++      D+  Y  L+
Sbjct: 345 EGYELFKEMKSKGHLIDRAIYGSLIESFVAGNKVGLAFDLLRDLVNSGYRADLGIYNNLI 404

Query: 945 MALCKGNRVD 974
             LC  N+V+
Sbjct: 405 EGLCNLNKVE 414


Top