BLASTX nr result

ID: Atropa21_contig00005017 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Atropa21_contig00005017
         (2051 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_006347992.1| PREDICTED: pentatricopeptide repeat-containi...   973   0.0  
ref|XP_004229730.1| PREDICTED: pentatricopeptide repeat-containi...   966   0.0  
ref|XP_002270492.1| PREDICTED: pentatricopeptide repeat-containi...   669   0.0  
ref|XP_002301239.2| pentatricopeptide repeat-containing family p...   666   0.0  
ref|XP_002327026.1| predicted protein [Populus trichocarpa]           665   0.0  
ref|XP_006375170.1| pentatricopeptide repeat-containing family p...   664   0.0  
ref|XP_002526471.1| pentatricopeptide repeat-containing protein,...   663   0.0  
gb|EOX96514.1| Tetratricopeptide repeat (TPR)-like superfamily p...   661   0.0  
gb|EMJ20170.1| hypothetical protein PRUPE_ppa003822mg [Prunus pe...   660   0.0  
ref|XP_006445447.1| hypothetical protein CICLE_v10019658mg [Citr...   653   0.0  
ref|XP_004307244.1| PREDICTED: pentatricopeptide repeat-containi...   650   0.0  
gb|EXB38379.1| hypothetical protein L484_008037 [Morus notabilis]     647   0.0  
ref|XP_004133941.1| PREDICTED: pentatricopeptide repeat-containi...   636   e-180
ref|XP_006418504.1| hypothetical protein EUTSA_v10007383mg [Eutr...   631   e-178
ref|XP_006306047.1| hypothetical protein CARUB_v10011354mg [Caps...   630   e-178
ref|XP_003530115.1| PREDICTED: pentatricopeptide repeat-containi...   622   e-175
ref|XP_002892022.1| pentatricopeptide repeat-containing protein ...   618   e-174
ref|NP_171717.2| pentatricopeptide repeat-containing protein [Ar...   617   e-174
ref|XP_003520417.1| PREDICTED: pentatricopeptide repeat-containi...   617   e-174
gb|AAM19786.1| At1g02150/T7I23.8 [Arabidopsis thaliana] gi|29028...   617   e-174

>ref|XP_006347992.1| PREDICTED: pentatricopeptide repeat-containing protein At1g02150-like
            [Solanum tuberosum]
          Length = 545

 Score =  973 bits (2516), Expect = 0.0
 Identities = 486/545 (89%), Positives = 501/545 (91%), Gaps = 4/545 (0%)
 Frame = +1

Query: 64   MLLQPTTAIKPPHHKIETHVXXXXXXXXXXXXXXGFCNLSGLT----CPKNHSVIITCSS 231
            MLLQPTT +KPPH K E +V              GFCNL G T    C KNH  +I+CSS
Sbjct: 1    MLLQPTTTVKPPHQKTENYVSFSSSLSYSLSFPSGFCNLGGFTKPLMCSKNHHSVISCSS 60

Query: 232  ISQVHSYGTLDYERRPIVKWNAIYKRISMNDGPERGSVSVLNQWENEGKKVTKWELSRVI 411
              QVHSYGT+DYERRPIVKWNAIYKRISMNDGPERGSVSVLNQWENEGKKVTKWELSRVI
Sbjct: 61   TPQVHSYGTVDYERRPIVKWNAIYKRISMNDGPERGSVSVLNQWENEGKKVTKWELSRVI 120

Query: 412  KELRKFRRYKLALEVYEWMNNRAERFRLTTSDTAIQLDLIAKVHGVSSAEEYFIKLPDTL 591
            KELRKFRRYKLA EVYEWMNNR ERFRLTTSDTAIQLDLIAKVHG+SSAEEYF KLPDTL
Sbjct: 121  KELRKFRRYKLAFEVYEWMNNRPERFRLTTSDTAIQLDLIAKVHGISSAEEYFEKLPDTL 180

Query: 592  KDKRIYGSLLNAFVRARKKEQAESLIDKMRNRGYTDHALPFNVMMTLYMNLKDYNKVELV 771
            KDKRIYGSLLNAFVR+RKKEQAESL+DKMRNRGYTDHALPFNVMMTLYMNLKDYNKVE V
Sbjct: 181  KDKRIYGSLLNAFVRSRKKEQAESLLDKMRNRGYTDHALPFNVMMTLYMNLKDYNKVESV 240

Query: 772  VSEMKEKQIPLDIYSYNIWLSSCGSQGSIEKMEKVLEQMNLDTDINLNWTTFSTMATMYI 951
            VSEMKEK+IPLDIYSYNIWLSSCGSQGSIEKMEKVLEQMNLDTDIN NWTTFSTMATMYI
Sbjct: 241  VSEMKEKKIPLDIYSYNIWLSSCGSQGSIEKMEKVLEQMNLDTDINPNWTTFSTMATMYI 300

Query: 952  KLGQLKKAEDSLKSVESRITGRDRIPYHYLISLYGSLGKKEEVLRIWKTYQSQFPTIPNL 1131
            KLG+LKKAEDSLKSVESRITGRDRIPYHYLISLYGSLGKKEEVLRIWKTYQSQFP IPNL
Sbjct: 301  KLGELKKAEDSLKSVESRITGRDRIPYHYLISLYGSLGKKEEVLRIWKTYQSQFPNIPNL 360

Query: 1132 GYHSVISSLVRLDDIEGAEKIYDEWLPVKVHYDPRIGNLLLGYYVRKGFADKASAFFDQM 1311
            GYHSVISSLVRLDDIEGAEKIYDEWLPVKVHYDPRIGNLLLGYYVRKGF DKASAFFDQM
Sbjct: 361  GYHSVISSLVRLDDIEGAEKIYDEWLPVKVHYDPRIGNLLLGYYVRKGFVDKASAFFDQM 420

Query: 1312 VEAGGKPNSMTMEILAEGHIRERRISEALSCLNDAVSTLGSKSWRPKPATVSSILRLCEQ 1491
            + AGGKPNSMT EILAEGHIR+RRISEALSCL DAVST GSKSWRPKPATVSSILRLCEQ
Sbjct: 421  IGAGGKPNSMTCEILAEGHIRDRRISEALSCLKDAVSTEGSKSWRPKPATVSSILRLCEQ 480

Query: 1492 EEDTQSKEALFEVLKQVGCLDDEKYMSYIPLSNGTYTSSGPEIEKDTSDNDEGSEILLNQ 1671
            E+DTQ+KEAL EVLKQVGCLDDEKYMSYIPLSNGT TSS PEIEKDTSDN EGS+ILLNQ
Sbjct: 481  EDDTQNKEALLEVLKQVGCLDDEKYMSYIPLSNGTITSSEPEIEKDTSDNGEGSDILLNQ 540

Query: 1672 LQESL 1686
            LQESL
Sbjct: 541  LQESL 545


>ref|XP_004229730.1| PREDICTED: pentatricopeptide repeat-containing protein At1g02150-like
            [Solanum lycopersicum]
          Length = 545

 Score =  966 bits (2497), Expect = 0.0
 Identities = 481/545 (88%), Positives = 501/545 (91%), Gaps = 4/545 (0%)
 Frame = +1

Query: 64   MLLQPTTAIKPPHHKIETHVXXXXXXXXXXXXXXGFCNLSGLT----CPKNHSVIITCSS 231
            MLLQPTT +KPPH K E +V              GFCNL G T    C KNH  +I+CSS
Sbjct: 1    MLLQPTTTVKPPHQKTEKYVSFSSSLSYSLSFPSGFCNLGGFTKPLMCSKNHHSVISCSS 60

Query: 232  ISQVHSYGTLDYERRPIVKWNAIYKRISMNDGPERGSVSVLNQWENEGKKVTKWELSRVI 411
             SQVHSYGT+DYERRPIVKWNAIYKRISMNDGPERGSVSVLNQWENEGKKVTKWELSRVI
Sbjct: 61   TSQVHSYGTVDYERRPIVKWNAIYKRISMNDGPERGSVSVLNQWENEGKKVTKWELSRVI 120

Query: 412  KELRKFRRYKLALEVYEWMNNRAERFRLTTSDTAIQLDLIAKVHGVSSAEEYFIKLPDTL 591
            KELRKFRRYKLA EVYEWMNNR ERFRLTTSDTAIQLDLIAKVHG+SSAEEYF KLPDTL
Sbjct: 121  KELRKFRRYKLAFEVYEWMNNRPERFRLTTSDTAIQLDLIAKVHGISSAEEYFDKLPDTL 180

Query: 592  KDKRIYGSLLNAFVRARKKEQAESLIDKMRNRGYTDHALPFNVMMTLYMNLKDYNKVELV 771
            KDKRIYGSLLNAFVR+RKKEQAESL+DKMRNRGYTDHALPFNVMMTLYMNLKDY+KVE V
Sbjct: 181  KDKRIYGSLLNAFVRSRKKEQAESLLDKMRNRGYTDHALPFNVMMTLYMNLKDYDKVESV 240

Query: 772  VSEMKEKQIPLDIYSYNIWLSSCGSQGSIEKMEKVLEQMNLDTDINLNWTTFSTMATMYI 951
            VSEMKEK+IPLDIYSYNIWLSSCGSQGSIEKMEKVLEQMNLDTDIN NWTTFSTMATMYI
Sbjct: 241  VSEMKEKRIPLDIYSYNIWLSSCGSQGSIEKMEKVLEQMNLDTDINPNWTTFSTMATMYI 300

Query: 952  KLGQLKKAEDSLKSVESRITGRDRIPYHYLISLYGSLGKKEEVLRIWKTYQSQFPTIPNL 1131
            KLGQ+KKAEDSLKSVESRITGRDRIPYHYLISLYGSLGKKE+VLRIWKTYQSQFP IPNL
Sbjct: 301  KLGQMKKAEDSLKSVESRITGRDRIPYHYLISLYGSLGKKEDVLRIWKTYQSQFPNIPNL 360

Query: 1132 GYHSVISSLVRLDDIEGAEKIYDEWLPVKVHYDPRIGNLLLGYYVRKGFADKASAFFDQM 1311
            GYHSVISSLVRLDDIEGAEKIYDEWLPVKVHYDPRIGNLLLGYYVRKGF DKASAFFDQM
Sbjct: 361  GYHSVISSLVRLDDIEGAEKIYDEWLPVKVHYDPRIGNLLLGYYVRKGFVDKASAFFDQM 420

Query: 1312 VEAGGKPNSMTMEILAEGHIRERRISEALSCLNDAVSTLGSKSWRPKPATVSSILRLCEQ 1491
            + AGGKPNSMT EILAEGHIR+RRISEALSCL DAVS+ GSKSWRPKPATVSSILRLCEQ
Sbjct: 421  IGAGGKPNSMTCEILAEGHIRDRRISEALSCLKDAVSSEGSKSWRPKPATVSSILRLCEQ 480

Query: 1492 EEDTQSKEALFEVLKQVGCLDDEKYMSYIPLSNGTYTSSGPEIEKDTSDNDEGSEILLNQ 1671
            E+D Q+KE L EVLKQVGCLDDEKYMSYIPLSNG++TSS  EIEKDTSDNDEGS+ILLNQ
Sbjct: 481  EDDIQNKEVLLEVLKQVGCLDDEKYMSYIPLSNGSFTSSEREIEKDTSDNDEGSDILLNQ 540

Query: 1672 LQESL 1686
            LQESL
Sbjct: 541  LQESL 545


>ref|XP_002270492.1| PREDICTED: pentatricopeptide repeat-containing protein At1g02150
            [Vitis vinifera]
          Length = 527

 Score =  669 bits (1725), Expect = 0.0
 Identities = 332/504 (65%), Positives = 400/504 (79%), Gaps = 8/504 (1%)
 Frame = +1

Query: 199  KNHSVIITCSSISQVHSYGTLDYERRPIVKWNAIYKRISMNDGPERGSVSVLNQWENEGK 378
            + HS  ITCS ISQ+HSYGT+DYERRP+VKWNA+Y+RIS+ + PE GS SVLNQWENEGK
Sbjct: 30   RKHS--ITCS-ISQIHSYGTVDYERRPLVKWNAVYRRISLMENPEMGSASVLNQWENEGK 86

Query: 379  KVTKWELSRVIKELRKFRRYKLALEVYEWMNNRAERFRLTTSDTAIQLDLIAKVHGVSSA 558
            ++TKWEL RV+KELRKF+R+K+ALEVYEWMNNR ERFRL++SD AIQLDLIAKV GVSSA
Sbjct: 87   RLTKWELCRVVKELRKFKRFKMALEVYEWMNNRGERFRLSSSDAAIQLDLIAKVCGVSSA 146

Query: 559  EEYFIKLPDTLKDKRIYGSLLNAFVRARKKEQAESLIDKMRNRGYTDHALPFNVMMTLYM 738
            E+YF +LPDTLKDKRIYG+LLNA+V+A+ +++AE LI+K+RN+GY    LPFNVMMTLYM
Sbjct: 147  EDYFSRLPDTLKDKRIYGALLNAYVQAKMRDKAEILIEKLRNKGYATTPLPFNVMMTLYM 206

Query: 739  NLKDYNKVELVVSEMKEKQIPLDIYSYNIWLSSCGSQGSIEKMEKVLEQMNLDTDINLNW 918
            NLK+ +KV+ ++SEM  K I LDIYSYNIWLSSC S    E+ME+V EQM L+  IN NW
Sbjct: 207  NLKELDKVQSMISEMMNKNIQLDIYSYNIWLSSCEST---ERMEQVFEQMKLERTINPNW 263

Query: 919  TTFSTMATMYIKLGQLKKAEDSLKSVESRITGRDRIPYHYLISLYGSLGKKEEVLRIWKT 1098
            TTFSTMATMYIKLGQ +KAE+ LK VESRIT RDR+PYHYLISLYGS G K EV R W  
Sbjct: 264  TTFSTMATMYIKLGQFEKAEECLKKVESRITNRDRMPYHYLISLYGSTGNKAEVYRAWNI 323

Query: 1099 YQSQFPTIPNLGYHSVISSLVRLDDIEGAEKIYDEWLPVKVHYDPRIGNLLLGYYVRKGF 1278
            Y+S+FP IPNLGYH++ISSLVR+ D+EGAEKIY+EWL VK  YDPRIGNLLLG YV++GF
Sbjct: 324  YKSKFPNIPNLGYHALISSLVRVGDLEGAEKIYEEWLSVKSSYDPRIGNLLLGCYVKEGF 383

Query: 1279 ADKASAFFDQMVEAGGKPNSMTMEILAEGHIRERRISEALSCLNDAVSTLGSKSWRPKPA 1458
             +KA  F D M+EAGGKPNS T EILAEG+   ++IS+ALSC   AV   GS  W+PKP 
Sbjct: 384  LEKAEGFLDHMIEAGGKPNSTTWEILAEGNTGVKKISDALSCFKRAVLAEGSNGWKPKPV 443

Query: 1459 TVSSILRLCEQEEDTQSKEALFEVLKQVGCLDDEKYMSYIPLSNGTYTSSGPEIEKDTS- 1635
             VS+ L LCE+E DT +KEAL  +L+Q+GCL+DE Y S   L  G+ T +    EKD + 
Sbjct: 444  NVSAFLDLCEEEADTATKEALMGLLRQMGCLEDEPYASLFGLHTGSVTGNELSNEKDRTG 503

Query: 1636 -------DNDEGSEILLNQLQESL 1686
                   D D+G+E+LLNQ Q  L
Sbjct: 504  ADKDIDEDEDDGAEMLLNQFQSGL 527


>ref|XP_002301239.2| pentatricopeptide repeat-containing family protein [Populus
            trichocarpa] gi|550344984|gb|EEE80512.2|
            pentatricopeptide repeat-containing family protein
            [Populus trichocarpa]
          Length = 539

 Score =  666 bits (1719), Expect = 0.0
 Identities = 323/497 (64%), Positives = 405/497 (81%), Gaps = 6/497 (1%)
 Frame = +1

Query: 214  IITCSSISQVHSYGTLDYERRPIVKWNAIYKRISMNDGPERGSVSVLNQWENEGKKVTKW 393
            +ITCS ISQ+H+YGT+DYERRP++KWNAIY+RIS+ + PE GS SVLNQWEN+GK++TKW
Sbjct: 44   VITCS-ISQIHNYGTVDYERRPMMKWNAIYRRISLMENPELGSGSVLNQWENDGKRLTKW 102

Query: 394  ELSRVIKELRKFRRYKLALEVYEWMNNRAERFRLTTSDTAIQLDLIAKVHGVSSAEEYFI 573
            EL RV+KELRK++RY+ ALEVY+WMNNR ERF L+ SD AIQLDLIAKV GVSSAE++F+
Sbjct: 103  ELCRVVKELRKYKRYQQALEVYDWMNNRQERFGLSPSDAAIQLDLIAKVRGVSSAEDFFL 162

Query: 574  KLPDTLKDKRIYGSLLNAFVRARKKEQAESLIDKMRNRGYTDHALPFNVMMTLYMNLKDY 753
            +LP+T KD+RIYG+LLNA+VR R +E+AESLID+MR + Y  HALP+NVMMTLYMN+ +Y
Sbjct: 163  RLPNTFKDRRIYGALLNAYVRNRMREKAESLIDEMRGKDYVTHALPYNVMMTLYMNINEY 222

Query: 754  NKVELVVSEMKEKQIPLDIYSYNIWLSSCGSQGSIEKMEKVLEQMNLDTDINLNWTTFST 933
            +KV+L++SEM EK I LDIYSYNIWLSSCG QGS +KME+V EQM  D  IN NWTTFST
Sbjct: 223  DKVDLIISEMNEKNIKLDIYSYNIWLSSCGLQGSADKMEQVFEQMKSDGSINPNWTTFST 282

Query: 934  MATMYIKLGQLKKAEDSLKSVESRITGRDRIPYHYLISLYGSLGKKEEVLRIWKTYQSQF 1113
            MATMYIK+G+ +KAED L+ VESRITGRDRIPYHYL+SLYG++G KEEV R+W  Y+S F
Sbjct: 283  MATMYIKMGKFEKAEDCLRRVESRITGRDRIPYHYLLSLYGNVGNKEEVYRVWNIYKSIF 342

Query: 1114 PTIPNLGYHSVISSLVRLDDIEGAEKIYDEWLPVKVHYDPRIGNLLLGYYVRKGFADKAS 1293
            P+IPNLGYH++ISSLVR+DDIEGAEKIY+EWL +K  YDPRI NL +  +V +G  DKA 
Sbjct: 343  PSIPNLGYHAMISSLVRMDDIEGAEKIYEEWLSIKTSYDPRIANLFMAAFVYQGNLDKAE 402

Query: 1294 AFFDQMVEAGGKPNSMTMEILAEGHIRERRISEALSCLNDAVSTLGSKSWRPKPATVSSI 1473
            +FFD M+E GGKPNS + EILA+GHI ERR SEALSCL +A +T GSKSW+P PA VSS 
Sbjct: 403  SFFDHMLEEGGKPNSHSWEILAQGHISERRTSEALSCLKEAFATPGSKSWKPNPANVSSF 462

Query: 1474 LRLCEQEEDTQSKEALFEVLKQVGCLDDEKY--MSYIPLSNGTYTS----SGPEIEKDTS 1635
             +LCE+E D  SKEAL   L+Q G L D+ Y  +  +P++    ++    +  +I+ + +
Sbjct: 463  FKLCEEEVDMASKEALASFLRQSGHLKDKAYALLLGMPVTGDELSTKEERTEDQIDNEEN 522

Query: 1636 DNDEGSEILLNQLQESL 1686
            D D GSE+L++QLQ SL
Sbjct: 523  DGDNGSEMLVSQLQGSL 539


>ref|XP_002327026.1| predicted protein [Populus trichocarpa]
          Length = 539

 Score =  665 bits (1716), Expect = 0.0
 Identities = 321/497 (64%), Positives = 403/497 (81%), Gaps = 6/497 (1%)
 Frame = +1

Query: 214  IITCSSISQVHSYGTLDYERRPIVKWNAIYKRISMNDGPERGSVSVLNQWENEGKKVTKW 393
            +I CS ISQVH+YGT+DYERRP++KWN IY+RIS+ + PE GS SVLN+WENEGK++TKW
Sbjct: 44   VIICS-ISQVHNYGTVDYERRPMIKWNGIYRRISLMENPELGSGSVLNRWENEGKRLTKW 102

Query: 394  ELSRVIKELRKFRRYKLALEVYEWMNNRAERFRLTTSDTAIQLDLIAKVHGVSSAEEYFI 573
            EL RV+KELRK++RY+ ALEVY+WM NR ERFRL+ SD AIQLDLIAKV GVS+AE++F+
Sbjct: 103  ELCRVVKELRKYKRYQQALEVYDWMKNRQERFRLSPSDAAIQLDLIAKVRGVSTAEDFFL 162

Query: 574  KLPDTLKDKRIYGSLLNAFVRARKKEQAESLIDKMRNRGYTDHALPFNVMMTLYMNLKDY 753
             LP+T KD+R+YG+LLNA+V+ R +E+AE+L D+MR++GY  HALPFNV MTLYMN+K+Y
Sbjct: 163  SLPNTFKDRRVYGALLNAYVQNRMREKAETLFDEMRDKGYVTHALPFNVTMTLYMNIKEY 222

Query: 754  NKVELVVSEMKEKQIPLDIYSYNIWLSSCGSQGSIEKMEKVLEQMNLDTDINLNWTTFST 933
            +KV+L++SEM EK I LDIYSYNIWLSSCGSQGS +KME+V EQM  D  IN NWTTFST
Sbjct: 223  DKVDLMISEMNEKNIKLDIYSYNIWLSSCGSQGSADKMEQVYEQMKSDRSINPNWTTFST 282

Query: 934  MATMYIKLGQLKKAEDSLKSVESRITGRDRIPYHYLISLYGSLGKKEEVLRIWKTYQSQF 1113
            MATMYIK+GQ +KAED L+ VESRITGRDRIPYHYL+SLYG++G KEEV R+W  Y+S F
Sbjct: 283  MATMYIKMGQFEKAEDCLRRVESRITGRDRIPYHYLLSLYGNVGNKEEVYRVWNIYKSIF 342

Query: 1114 PTIPNLGYHSVISSLVRLDDIEGAEKIYDEWLPVKVHYDPRIGNLLLGYYVRKGFADKAS 1293
            P+IPNLGYH++ISSLVRLDDIEGAEKIY+EWL +K  YDPRI NL +  YV +G  D+A 
Sbjct: 343  PSIPNLGYHAIISSLVRLDDIEGAEKIYEEWLSIKTSYDPRIANLFIAAYVYQGNLDEAK 402

Query: 1294 AFFDQMVEAGGKPNSMTMEILAEGHIRERRISEALSCLNDAVSTLGSKSWRPKPATVSSI 1473
            +FFD M+E GGKPNS T EILA+GHI ERR SEALSCL +A  T GSKSW+P PA V+S 
Sbjct: 403  SFFDHMLEDGGKPNSNTWEILAQGHISERRTSEALSCLKEAFVTPGSKSWKPNPANVTSF 462

Query: 1474 LRLCEQEEDTQSKEALFEVLKQVGCLDDEKYMSYI--PLSNGTYTS----SGPEIEKDTS 1635
             +LCE+E D  +KEAL   L+Q G L D+ Y S +  P++    ++    +G +I+ +  
Sbjct: 463  FKLCEEEADMANKEALEGFLRQSGHLKDKAYASLLGMPVTGDELSTKEDGTGDQIDNEED 522

Query: 1636 DNDEGSEILLNQLQESL 1686
            D D+G+E+L++ LQ SL
Sbjct: 523  DEDDGAEMLVSHLQGSL 539


>ref|XP_006375170.1| pentatricopeptide repeat-containing family protein [Populus
            trichocarpa] gi|550323489|gb|ERP52967.1|
            pentatricopeptide repeat-containing family protein
            [Populus trichocarpa]
          Length = 539

 Score =  664 bits (1712), Expect = 0.0
 Identities = 320/497 (64%), Positives = 403/497 (81%), Gaps = 6/497 (1%)
 Frame = +1

Query: 214  IITCSSISQVHSYGTLDYERRPIVKWNAIYKRISMNDGPERGSVSVLNQWENEGKKVTKW 393
            +I CS ISQVH+YGT+DYERRP++KWN IY+RIS+ + PE GS SVLN+WENEGK++TKW
Sbjct: 44   VIICS-ISQVHNYGTVDYERRPMIKWNGIYRRISLMENPELGSGSVLNRWENEGKRLTKW 102

Query: 394  ELSRVIKELRKFRRYKLALEVYEWMNNRAERFRLTTSDTAIQLDLIAKVHGVSSAEEYFI 573
            EL RV+KELRK++RY+ ALEVY+WM NR ERFRL+ SD AIQLDLIAKV GVS+AE++F+
Sbjct: 103  ELCRVVKELRKYKRYQQALEVYDWMKNRQERFRLSPSDAAIQLDLIAKVRGVSTAEDFFL 162

Query: 574  KLPDTLKDKRIYGSLLNAFVRARKKEQAESLIDKMRNRGYTDHALPFNVMMTLYMNLKDY 753
             LP+T KD+R+YG+LLNA+V+ R +E+AE+L D+MR++GY  HALPFNV MTLYMN+K+Y
Sbjct: 163  SLPNTFKDRRVYGALLNAYVQNRMREKAETLFDEMRDKGYVTHALPFNVTMTLYMNIKEY 222

Query: 754  NKVELVVSEMKEKQIPLDIYSYNIWLSSCGSQGSIEKMEKVLEQMNLDTDINLNWTTFST 933
            +KV+L++SEM EK I LDIYSYNIWLSSCGSQGS +KME+V EQM  D  IN NWTTFST
Sbjct: 223  DKVDLMISEMNEKNIKLDIYSYNIWLSSCGSQGSADKMEQVYEQMKSDRSINPNWTTFST 282

Query: 934  MATMYIKLGQLKKAEDSLKSVESRITGRDRIPYHYLISLYGSLGKKEEVLRIWKTYQSQF 1113
            MATMYIK+GQ +KAED L+ VESRITGRDRIPYHYL+SLYG++G KEEV R+W  Y+S F
Sbjct: 283  MATMYIKMGQFEKAEDCLRRVESRITGRDRIPYHYLLSLYGNVGNKEEVYRVWNIYKSIF 342

Query: 1114 PTIPNLGYHSVISSLVRLDDIEGAEKIYDEWLPVKVHYDPRIGNLLLGYYVRKGFADKAS 1293
            P+IPNLGYH++ISSLVRLDDIEGAEKI++EWL +K  YDPRI NL +  YV +G  D+A 
Sbjct: 343  PSIPNLGYHAIISSLVRLDDIEGAEKIFEEWLSIKTSYDPRIANLFIAAYVYQGNLDEAK 402

Query: 1294 AFFDQMVEAGGKPNSMTMEILAEGHIRERRISEALSCLNDAVSTLGSKSWRPKPATVSSI 1473
            +FFD M+E GGKPNS T EILA+GHI ERR SEALSCL +A  T GSKSW+P PA V+S 
Sbjct: 403  SFFDHMLEDGGKPNSNTWEILAQGHISERRTSEALSCLKEAFVTPGSKSWKPNPANVTSF 462

Query: 1474 LRLCEQEEDTQSKEALFEVLKQVGCLDDEKYMSYI--PLSNGTYTS----SGPEIEKDTS 1635
             +LCE+E D  +KEAL   L+Q G L D+ Y S +  P++    ++    +G +I+ +  
Sbjct: 463  FKLCEEEADMANKEALEGFLRQSGHLKDKAYASLLGMPVTGDELSTKEDRTGDQIDNEED 522

Query: 1636 DNDEGSEILLNQLQESL 1686
            D D+G+E+L++ LQ SL
Sbjct: 523  DEDDGAEMLVSHLQGSL 539


>ref|XP_002526471.1| pentatricopeptide repeat-containing protein, putative [Ricinus
            communis] gi|223534146|gb|EEF35862.1| pentatricopeptide
            repeat-containing protein, putative [Ricinus communis]
          Length = 533

 Score =  663 bits (1711), Expect = 0.0
 Identities = 325/490 (66%), Positives = 401/490 (81%), Gaps = 3/490 (0%)
 Frame = +1

Query: 217  ITCSSISQVHSYGTLDYERRPIVKWNAIYKRISMNDGPERGSVSVLNQWENEGKKVTKWE 396
            ITCS IS+VHSYGT+DYERRP++KWN++Y+RIS+ + PE G+ +VLN+ E +GKK+TKWE
Sbjct: 45   ITCS-ISKVHSYGTVDYERRPMIKWNSVYRRISLMEKPELGAATVLNEMEKDGKKLTKWE 103

Query: 397  LSRVIKELRKFRRYKLALEVYEWMNNRAERFRLTTSDTAIQLDLIAKVHGVSSAEEYFIK 576
            L RV+KELRK++R+K ALEVY+WMNNR ERFRL+ SD AIQLDL+AKV GVSSAE+YF++
Sbjct: 104  LCRVVKELRKYKRHKQALEVYDWMNNREERFRLSASDAAIQLDLVAKVRGVSSAEDYFMR 163

Query: 577  LPDTLKDKRIYGSLLNAFVRARKKEQAESLIDKMRNRGYTDHALPFNVMMTLYMNLKDYN 756
            L D +KD+R+YG+LLN++V+AR +E+AESLI+KMR + YT HALPFNVMMTLYMNLK+Y+
Sbjct: 164  LSDNVKDRRVYGALLNSYVKARMREKAESLIEKMRKKDYTTHALPFNVMMTLYMNLKEYD 223

Query: 757  KVELVVSEMKEKQIPLDIYSYNIWLSSCGSQGSIEKMEKVLEQMNLDTDINLNWTTFSTM 936
            KV++++SEM  K I LDIYSYNIWLSS GSQGSIE+ME+V EQM LD+ IN NWTTFSTM
Sbjct: 224  KVDMMISEMMAKNIRLDIYSYNIWLSSRGSQGSIERMEEVYEQMKLDSTINPNWTTFSTM 283

Query: 937  ATMYIKLGQLKKAEDSLKSVESRITGRDRIPYHYLISLYGSLGKKEEVLRIWKTYQSQFP 1116
            ATMYIK+GQL+KAED L+ VESRITGRDRIPYHYL+SLYG++G KEE+ R+W  Y+S F 
Sbjct: 284  ATMYIKMGQLEKAEDCLRRVESRITGRDRIPYHYLLSLYGNVGNKEEIYRVWNIYKSIFA 343

Query: 1117 TIPNLGYHSVISSLVRLDDIEGAEKIYDEWLPVKVHYDPRIGNLLLGYYVRKGFADKASA 1296
            TIPNLGYH++ISSLVR+DDIEGAEKIY+EWLPVK  YDPRIGNLL+G+YVR G  DKA +
Sbjct: 344  TIPNLGYHAIISSLVRMDDIEGAEKIYEEWLPVKSSYDPRIGNLLMGWYVRGGNLDKAES 403

Query: 1297 FFDQMVEAGGKPNSMTMEILAEGHIRERRISEALSCLNDAVSTLGSKSWRPKPATVSSIL 1476
            FFD M+E GGKPNS T EILA+GH RE+RISEALSC  +A    GSKSW+PKP  +SS  
Sbjct: 404  FFDHMMEVGGKPNSSTWEILADGHTREKRISEALSCFKEAFLAQGSKSWKPKPVIISSFF 463

Query: 1477 RLCEQEEDTQSKEALFEVLKQVGCLDDEKYMSYIPLSNGTYTSSGPEIEKD-TSDND--E 1647
            +LCE+E D  S   L ++L Q G L+D+ Y S I     +  S+    EKD T D +  E
Sbjct: 464  KLCEEEADMASTGVLEDLLAQSGYLEDKTYASLI---GSSVPSNELSTEKDRTGDRNEVE 520

Query: 1648 GSEILLNQLQ 1677
             +E  LNQLQ
Sbjct: 521  ENETFLNQLQ 530


>gb|EOX96514.1| Tetratricopeptide repeat (TPR)-like superfamily protein isoform 1
            [Theobroma cacao]
          Length = 549

 Score =  661 bits (1706), Expect = 0.0
 Identities = 320/491 (65%), Positives = 395/491 (80%), Gaps = 10/491 (2%)
 Frame = +1

Query: 217  ITCSSISQVHSYGTLDYERRPIVKWNAIYKRISMNDGPERGSVSVLNQWENEGKKVTKWE 396
            +TCS ISQ+HSYGT+DYERRP++KWNAIYK+IS+ + PE GS SVLN+WE  G+K+TKWE
Sbjct: 45   VTCS-ISQIHSYGTVDYERRPMIKWNAIYKKISLMENPELGSASVLNEWEKGGRKLTKWE 103

Query: 397  LSRVIKELRKFRRYKLALEVYEWMNNRAERFRLTTSDTAIQLDLIAKVHGVSSAEEYFIK 576
            L RV+KELRK++RYK ALEVY+WMNNR ERFRL+ SD AIQLDLIAKV GVSSAE++F++
Sbjct: 104  LCRVVKELRKYKRYKQALEVYDWMNNRGERFRLSASDAAIQLDLIAKVRGVSSAEDFFVQ 163

Query: 577  LPDTLKDKRIYGSLLNAFVRARKKEQAESLIDKMRNRGYTDHALPFNVMMTLYMNLKDYN 756
            LPDT+KDKRIYG+LLNA+VRA+ +++AE+LID MR +GY  H LPFNVMMTLYMNLK+Y+
Sbjct: 164  LPDTMKDKRIYGALLNAYVRAKMRDKAETLIDNMRGKGYAMHPLPFNVMMTLYMNLKEYD 223

Query: 757  KVELVVSEMKEKQIPLDIYSYNIWLSSCGSQGSIEKMEKVLEQMNLDTDINLNWTTFSTM 936
            KVE +VSEM EK I LDIYSYNIWLSSCGSQGS+EKME+V EQM  D  IN NWTTFSTM
Sbjct: 224  KVESMVSEMMEKNIRLDIYSYNIWLSSCGSQGSVEKMEEVYEQMKQDQSINPNWTTFSTM 283

Query: 937  ATMYIKLGQLKKAEDSLKSVESRITGRDRIPYHYLISLYGSLGKKEEVLRIWKTYQSQFP 1116
            ATMYIK+G  +KAE+ L++VESRITGRDRIPYHYLISLYG +G +EEV R+WK Y+S FP
Sbjct: 284  ATMYIKMGLTEKAEECLRNVESRITGRDRIPYHYLISLYGGVGNREEVYRVWKVYKSIFP 343

Query: 1117 TIPNLGYHSVISSLVRLDDIEGAEKIYDEWLPVKVHYDPRIGNLLLGYYVRKGFADKASA 1296
            +IPNLG+H+VISSLVR  DI+GAE+IY+EWL VK  YDPRI NLL+G+YV++G  DKA +
Sbjct: 344  SIPNLGFHAVISSLVRAGDIQGAERIYEEWLTVKTSYDPRIANLLMGWYVKEGNLDKAES 403

Query: 1297 FFDQMVEAGGKPNSMTMEILAEGHIRERRISEALSCLNDAVSTLGSKSWRPKPATVSSIL 1476
             F  + E GGKPNS + EILAEGHI E+RI +ALSCL DA +T GS+ WRPKP +VS+  
Sbjct: 404  LFSHIAEVGGKPNSSSWEILAEGHILEKRIPDALSCLKDAFATEGSRGWRPKPTSVSAFF 463

Query: 1477 RLCEQEEDTQSKEALFEVLKQVGCLDDEKYMSYIPLSNGTYTSSGPEIEKDTS------- 1635
             LCE++ D  S+E    +L+Q GCL +E Y S I LS    + S  E+ +D +       
Sbjct: 464  NLCEEKVDMASREVFVGLLRQSGCLKNEAYASLIGLSEEALSES--ELPRDKNRKSSYSS 521

Query: 1636 ---DNDEGSEI 1659
               + D+GSE+
Sbjct: 522  SDENQDDGSEV 532


>gb|EMJ20170.1| hypothetical protein PRUPE_ppa003822mg [Prunus persica]
          Length = 546

 Score =  660 bits (1702), Expect = 0.0
 Identities = 322/497 (64%), Positives = 399/497 (80%), Gaps = 10/497 (2%)
 Frame = +1

Query: 217  ITCSSISQVHSYGTLDYERRPIVKWNAIYKRISMNDGPERGSVSVLNQWENEGKKVTKWE 396
            I+CS ISQVH+YGT+DYERRP+VKWNAIY++IS+ D PE  S  VLNQWE EG+K+TKWE
Sbjct: 47   ISCS-ISQVHNYGTVDYERRPMVKWNAIYRKISLTDDPEVRSADVLNQWEKEGRKLTKWE 105

Query: 397  LSRVIKELRKFRRYKLALEVYEWMNNRAERFRLTTSDTAIQLDLIAKVHGVSSAEEYFIK 576
            L RV+KELRK++RY  ALEVY+WM+NR ERFR++TSD AIQLDL+AKV GV+SAE YF+ 
Sbjct: 106  LCRVVKELRKYKRYDRALEVYDWMSNRGERFRISTSDAAIQLDLVAKVRGVASAENYFLS 165

Query: 577  LPDTLKDKRIYGSLLNAFVRARKKEQAESLIDKMRNRGYTDHALPFNVMMTLYMNLKDYN 756
            LPDTLKD+RIYG+LLNA+VR R KE+AESL+DKMR++G+   +LPFNVMMTLYMNLK+Y+
Sbjct: 166  LPDTLKDRRIYGALLNAYVRTRMKEKAESLLDKMRSKGHALQSLPFNVMMTLYMNLKEYD 225

Query: 757  KVELVVSEMKEKQIPLDIYSYNIWLSSCGSQGSIEKMEKVLEQMNLDTDINLNWTTFSTM 936
            KV+ ++SEM EK I LDIYSYNIWLSS GSQGS E+ME+V EQM LD  +N NWTTFSTM
Sbjct: 226  KVDSIISEMMEKNIQLDIYSYNIWLSSRGSQGSEERMEQVFEQMKLDRTVNPNWTTFSTM 285

Query: 937  ATMYIKLGQLKKAEDSLKSVESRITGRDRIPYHYLISLYGSLGKKEEVLRIWKTYQSQFP 1116
            ATMYIK+GQL+KAE  LK VESRITGRDRIPYHYL+SLYG++G KEE+ R+W  Y+S FP
Sbjct: 286  ATMYIKMGQLEKAEACLKKVESRITGRDRIPYHYLLSLYGNVGNKEELYRVWNIYKSSFP 345

Query: 1117 TIPNLGYHSVISSLVRLDDIEGAEKIYDEWLPVKVHYDPRIGNLLLGYYVRKGFADKASA 1296
            +IPNLGYH+++SSL+R+ D+EGAEKIY+EWL VK  YDPRI N+ + YY++ G  +KA +
Sbjct: 346  SIPNLGYHAIMSSLLRVGDVEGAEKIYEEWLTVKSTYDPRIANVFIAYYIKDGDFEKAQS 405

Query: 1297 FFDQMVEAGGKPNSMTMEILAEGHIRERRISEALSCLNDAVSTLGSKSWRPKPATVSSIL 1476
            F+D MV+ GGKPNS T E LAEGHI E+RISEALSC  +A S  GSKSWRPKP  VS+ L
Sbjct: 406  FYDHMVDVGGKPNSTTWETLAEGHIEEQRISEALSCWKEAFSAEGSKSWRPKPVNVSAFL 465

Query: 1477 RLCEQEEDTQSKEALFEVLKQVGCLDDEKYMSYIPLSNGTYTSSGPEIEKD----TSDND 1644
             LCEQE ++ SKE    +LKQ G L ++ Y S I L++   +     ++KD    T D+D
Sbjct: 466  ELCEQEANSVSKEFFMGLLKQSGQLKNKSYASLIGLADEDVSDDDLSLKKDRTNITKDDD 525

Query: 1645 ------EGSEILLNQLQ 1677
                  +GSE+LLN+LQ
Sbjct: 526  DEKEAGDGSELLLNELQ 542


>ref|XP_006445447.1| hypothetical protein CICLE_v10019658mg [Citrus clementina]
            gi|568819745|ref|XP_006464406.1| PREDICTED:
            pentatricopeptide repeat-containing protein
            At1g02150-like [Citrus sinensis]
            gi|557547709|gb|ESR58687.1| hypothetical protein
            CICLE_v10019658mg [Citrus clementina]
          Length = 535

 Score =  653 bits (1685), Expect = 0.0
 Identities = 313/488 (64%), Positives = 394/488 (80%)
 Frame = +1

Query: 214  IITCSSISQVHSYGTLDYERRPIVKWNAIYKRISMNDGPERGSVSVLNQWENEGKKVTKW 393
            +I CS +SQ+HSYGT+D+ERRP++KWNAI++++S+ D P+ GS SVLN WE  G+ +TKW
Sbjct: 49   VIKCS-MSQIHSYGTVDFERRPMIKWNAIFRKLSLMDNPQLGSASVLNDWEKGGRSLTKW 107

Query: 394  ELSRVIKELRKFRRYKLALEVYEWMNNRAERFRLTTSDTAIQLDLIAKVHGVSSAEEYFI 573
            EL RV+KELRKFRRYK ALEVY+WMNNR ERFRL+ SD AIQLDLIAKVHGV+SAE++F+
Sbjct: 108  ELCRVVKELRKFRRYKHALEVYDWMNNRGERFRLSASDAAIQLDLIAKVHGVASAEDFFL 167

Query: 574  KLPDTLKDKRIYGSLLNAFVRARKKEQAESLIDKMRNRGYTDHALPFNVMMTLYMNLKDY 753
             LPDTLKD+R+YG+LLNA+VRAR +  AE LIDKMR++GY  H+LP+NVMMTLYM +K+Y
Sbjct: 168  SLPDTLKDRRVYGALLNAYVRARMRGNAELLIDKMRDKGYAVHSLPYNVMMTLYMKIKEY 227

Query: 754  NKVELVVSEMKEKQIPLDIYSYNIWLSSCGSQGSIEKMEKVLEQMNLDTDINLNWTTFST 933
            ++VE +VSEMKEK I LD+YSYNIWLSSCGSQGS EKME V E M +D  +N NWTTFST
Sbjct: 228  DEVESMVSEMKEKGIRLDVYSYNIWLSSCGSQGSTEKMEGVFELMKVDKAVNPNWTTFST 287

Query: 934  MATMYIKLGQLKKAEDSLKSVESRITGRDRIPYHYLISLYGSLGKKEEVLRIWKTYQSQF 1113
            MATMYIK+GQ++KAE+SL+ VESRITGRDR+PYHYL+SLYGS+GKKEEV R+W  Y+S F
Sbjct: 288  MATMYIKMGQVEKAEESLRRVESRITGRDRVPYHYLLSLYGSVGKKEEVYRVWNLYRSVF 347

Query: 1114 PTIPNLGYHSVISSLVRLDDIEGAEKIYDEWLPVKVHYDPRIGNLLLGYYVRKGFADKAS 1293
            P + NLGYH++ISSL R+ DIEG EKI++EWL VK  YDPRI NL++ +YV++G  DKA 
Sbjct: 348  PGVTNLGYHAMISSLARIGDIEGMEKIFEEWLSVKSSYDPRIANLMMSWYVKEGNFDKAE 407

Query: 1294 AFFDQMVEAGGKPNSMTMEILAEGHIRERRISEALSCLNDAVSTLGSKSWRPKPATVSSI 1473
            AFF+ ++E GGKPNS + E LAEGHIRERRI EALSCL  A +  G+KSWRPKP  V + 
Sbjct: 408  AFFNSIIEEGGKPNSTSWETLAEGHIRERRILEALSCLKGAFAAEGAKSWRPKPVNVINF 467

Query: 1474 LRLCEQEEDTQSKEALFEVLKQVGCLDDEKYMSYIPLSNGTYTSSGPEIEKDTSDNDEGS 1653
             + CE+E D  SKEA   +L+Q G   ++ YMS I L++     +    +K+  D+DE S
Sbjct: 468  FKACEEESDMGSKEAFVALLRQPGYRKEKDYMSLIGLTDEAVAENN---KKNDEDSDEDS 524

Query: 1654 EILLNQLQ 1677
            E+LL+QLQ
Sbjct: 525  EMLLSQLQ 532


>ref|XP_004307244.1| PREDICTED: pentatricopeptide repeat-containing protein At1g02150-like
            [Fragaria vesca subsp. vesca]
          Length = 541

 Score =  650 bits (1676), Expect = 0.0
 Identities = 323/514 (62%), Positives = 399/514 (77%), Gaps = 11/514 (2%)
 Frame = +1

Query: 178  LSGLTCPK--NHSVIITCSSISQVHSYGTLDYERRPIVKWNAIYKRISM-NDGPERGSVS 348
            +  L+ P+  N+  +   SSISQVH+YGT+DYERRPIVKWNAIY++IS+  D PE  + S
Sbjct: 28   IPSLSLPRSINYQRLTISSSISQVHNYGTVDYERRPIVKWNAIYRKISLLADDPELNASS 87

Query: 349  VLNQWENEGKKVTKWELSRVIKELRKFRRYKLALEVYEWMNNRAERFRLTTSDTAIQLDL 528
            VLNQWE EGKK++KWEL RV+KELRKF+RY  ALEVY+WM NRAERFR ++SD AIQLDL
Sbjct: 88   VLNQWEKEGKKLSKWELCRVVKELRKFKRYGRALEVYDWMINRAERFRFSSSDAAIQLDL 147

Query: 529  IAKVHGVSSAEEYFIKLPDTLKDKRIYGSLLNAFVRARKKEQAESLIDKMRNRGYTDHAL 708
            + KV GVSSAE YF+ LPD LKDKRIYG+LLNA+VRA+ +E+AESL+DKMR++G+  H L
Sbjct: 148  VGKVRGVSSAENYFLSLPDNLKDKRIYGALLNAYVRAKMQEKAESLLDKMRSKGHALHPL 207

Query: 709  PFNVMMTLYMNLKDYNKVELVVSEMKEKQIPLDIYSYNIWLSSCGSQGSIEKMEKVLEQM 888
            PFNVMMTLYMNLK+Y KVE ++SEM EK I LDIYSYNIWLSS GSQGS E+ME+V EQM
Sbjct: 208  PFNVMMTLYMNLKEYEKVESIISEMMEKNIQLDIYSYNIWLSSRGSQGSAERMEQVFEQM 267

Query: 889  NLDTDINLNWTTFSTMATMYIKLGQLKKAEDSLKSVESRITGRDRIPYHYLISLYGSLGK 1068
             LD  IN NWTTFSTMATMYIK+G  +KAE  LK VESRITGRDRIPYHYL+SLYG +G 
Sbjct: 268  KLDRTINPNWTTFSTMATMYIKMGLFEKAEACLKKVESRITGRDRIPYHYLLSLYGGVGN 327

Query: 1069 KEEVLRIWKTYQSQFPTIPNLGYHSVISSLVRLDDIEGAEKIYDEWLPVKVHYDPRIGNL 1248
            K+E+ R+W  Y+S FP+IPNLGYH++I++L+R+ D+EGAEKI++EWL VK  YDPRI NL
Sbjct: 328  KDEIYRVWNVYKSSFPSIPNLGYHAIIAALIRVGDVEGAEKIFEEWLTVKPSYDPRIVNL 387

Query: 1249 LLGYYVRKGFADKASAFFDQMVEAGGKPNSMTMEILAEGHIRERRISEALSCLNDAVSTL 1428
             +  Y+ +G  DKA +FFD MVEAGGKPNS T E LAEGHI E+RISEALSC  +A    
Sbjct: 388  FIVSYIEEGDFDKAQSFFDNMVEAGGKPNSSTWEALAEGHIEEKRISEALSCWKEAFMAE 447

Query: 1429 GSKSWRPKPATVSSILRLCEQEEDTQSKEALFEVLKQVGCLDDEKYMSYIPLSNGTYTSS 1608
            GSKSWRPKP  V++    CEQE D +SKE    +L+Q G L ++ Y   + LS+   + +
Sbjct: 448  GSKSWRPKPVNVTTFYEFCEQEGDLRSKEIFLGLLRQSGQLKNKSYALLVGLSDEDSSDN 507

Query: 1609 GPEIEKDT-SDN-------DEGSEILLNQLQESL 1686
               +EKD+ +DN       D+GS++LLNQL  +L
Sbjct: 508  DISLEKDSINDNQDGDEKSDDGSDMLLNQLHSTL 541


>gb|EXB38379.1| hypothetical protein L484_008037 [Morus notabilis]
          Length = 546

 Score =  647 bits (1668), Expect = 0.0
 Identities = 318/508 (62%), Positives = 394/508 (77%), Gaps = 11/508 (2%)
 Frame = +1

Query: 193  CPKNHS----VIITCS-SISQVHSYGTLDYERRPIVKWNAIYKRISMNDGPERGSVSVLN 357
            C  NH+    + I CS S SQ+HSYGT+DYERRP+VKWNAIYKRIS+ + PE GS +VL+
Sbjct: 38   CSVNHNRRLRLGIFCSISQSQIHSYGTVDYERRPMVKWNAIYKRISLMEKPELGSGTVLS 97

Query: 358  QWENEGKKVTKWELSRVIKELRKFRRYKLALEVYEWMNNRAERFRLTTSDTAIQLDLIAK 537
            QWE EG++++KWEL RV+KELRK++R+  ALEVY+WMNNR ERFRL++SD AIQLDLI K
Sbjct: 98   QWEREGRQLSKWELCRVVKELRKYKRFDRALEVYDWMNNRGERFRLSSSDAAIQLDLIGK 157

Query: 538  VHGVSSAEEYFIKLPDTLKDKRIYGSLLNAFVRARKKEQAESLIDKMRNRGYTDHALPFN 717
            V G+SSAE +F+ L DT KD+RIYG+LLNA+V+AR KE+AESL+D+MR +GY  H+LPFN
Sbjct: 158  VRGISSAENFFLSLSDTSKDRRIYGALLNAYVQARMKEKAESLLDRMRGKGYAIHSLPFN 217

Query: 718  VMMTLYMNLKDYNKVELVVSEMKEKQIPLDIYSYNIWLSSCGSQGSIEKMEKVLEQMNLD 897
            VMMTLYMNLK+Y KV+ +VSEM +K I LD+YSYNIWLS CGSQGS E ME+V EQM  D
Sbjct: 218  VMMTLYMNLKEYKKVDAMVSEMMDKNIQLDVYSYNIWLSCCGSQGSAEGMEQVFEQMQQD 277

Query: 898  TDINLNWTTFSTMATMYIKLGQLKKAEDSLKSVESRITGRDRIPYHYLISLYGSLGKKEE 1077
              IN NWTTFSTMATMYIK+GQ +KAE+ L+ VESRITGRDRIPYHYL+SLYGS+G KEE
Sbjct: 278  KSINPNWTTFSTMATMYIKMGQFQKAEECLRKVESRITGRDRIPYHYLLSLYGSVGNKEE 337

Query: 1078 VLRIWKTYQSQFPTIPNLGYHSVISSLVRLDDIEGAEKIYDEWLPVKVHYDPRIGNLLLG 1257
            + R+WK Y++ FP+IPNLGYH++ISSL+R+ DIEGAE IY+EWLPVK  YDPRI NL + 
Sbjct: 338  IYRVWKVYKAIFPSIPNLGYHAIISSLLRIGDIEGAENIYNEWLPVKSSYDPRIANLFMS 397

Query: 1258 YYVRKGFADKASAFFDQMVEAGGKPNSMTMEILAEGHIRERRISEALSCLNDAVSTLGSK 1437
            YYVR G  +KA++  D ++E GGKPNS T EILA GH  ERRISEALS   +A +  G+K
Sbjct: 398  YYVRNGNLEKATSLVDHIIEVGGKPNSATWEILAAGHTGERRISEALSYWKEAFAAEGAK 457

Query: 1438 SWRPKPATVSSILRLCEQEEDTQSKEALFEVLKQVGCLDDEKYMSYIPLSNGTYTSSG-- 1611
            +WRPKP  VS+ L LCEQE D + KE L  +L++ G L D+ Y S++  S+     +G  
Sbjct: 458  NWRPKPVNVSAFLDLCEQEADLECKEVLVGLLREAGYLKDQSYASFVGFSHEAINDNGIT 517

Query: 1612 ---PEIEKDTSDN-DEGSEILLNQLQES 1683
                  E D  +N D+ S IL NQLQ S
Sbjct: 518  SVDVSFENDNDENKDDESGILFNQLQGS 545


>ref|XP_004133941.1| PREDICTED: pentatricopeptide repeat-containing protein At1g02150-like
            [Cucumis sativus] gi|449525818|ref|XP_004169913.1|
            PREDICTED: pentatricopeptide repeat-containing protein
            At1g02150-like [Cucumis sativus]
          Length = 537

 Score =  636 bits (1641), Expect = e-180
 Identities = 307/477 (64%), Positives = 382/477 (80%)
 Frame = +1

Query: 217  ITCSSISQVHSYGTLDYERRPIVKWNAIYKRISMNDGPERGSVSVLNQWENEGKKVTKWE 396
            +TCS ISQVHSYGT+D+ERRP+ KWNAIY+RIS+ + PE GS SVLNQWENEGK +TKWE
Sbjct: 46   VTCS-ISQVHSYGTVDFERRPMFKWNAIYRRISLMENPELGSASVLNQWENEGKNITKWE 104

Query: 397  LSRVIKELRKFRRYKLALEVYEWMNNRAERFRLTTSDTAIQLDLIAKVHGVSSAEEYFIK 576
            LSRV+KELRK++R++ ALE+Y+WM+NR ERFRLTTSD AIQLDLI+KV G+ SAEEYF++
Sbjct: 105  LSRVVKELRKYKRFERALEIYDWMSNREERFRLTTSDAAIQLDLISKVRGIKSAEEYFLR 164

Query: 577  LPDTLKDKRIYGSLLNAFVRARKKEQAESLIDKMRNRGYTDHALPFNVMMTLYMNLKDYN 756
            LP+ LKD+RIYG+LLNA+ + R++E+AE+L++KMR +G+T H LPFNVMMTLYMN+K+Y 
Sbjct: 165  LPNHLKDRRIYGALLNAYAKGRQREKAENLLEKMRTKGFTTHPLPFNVMMTLYMNVKEYE 224

Query: 757  KVELVVSEMKEKQIPLDIYSYNIWLSSCGSQGSIEKMEKVLEQMNLDTDINLNWTTFSTM 936
            KVE +VSEM E  I LDIYSYNIWLSSCG QGS EKME+V EQM  D  IN NWTTFSTM
Sbjct: 225  KVESLVSEMTENSIQLDIYSYNIWLSSCGLQGSTEKMEEVYEQMKQDRTINANWTTFSTM 284

Query: 937  ATMYIKLGQLKKAEDSLKSVESRITGRDRIPYHYLISLYGSLGKKEEVLRIWKTYQSQFP 1116
            ATMYIK+G ++KAE+ L+ VESRI GRDRIPYHYLISLYGS+G KEE+ R+W  Y++ FP
Sbjct: 285  ATMYIKMGLMEKAEECLRRVESRIVGRDRIPYHYLISLYGSVGNKEEMYRVWNIYKNVFP 344

Query: 1117 TIPNLGYHSVISSLVRLDDIEGAEKIYDEWLPVKVHYDPRIGNLLLGYYVRKGFADKASA 1296
            TIPNLGYH++IS+L+R+ D+EGAEKIY+EWL VK  YDPRI NL +G+YV++G   KA +
Sbjct: 345  TIPNLGYHAIISALIRVGDVEGAEKIYEEWLTVKSTYDPRIANLFIGWYVKEGNTSKAES 404

Query: 1297 FFDQMVEAGGKPNSMTMEILAEGHIRERRISEALSCLNDAVSTLGSKSWRPKPATVSSIL 1476
            FFD MVE GGKPNS T EIL + H +E R+S+AL+   +A S  GSKSWRPKP  V +  
Sbjct: 405  FFDHMVEVGGKPNSSTWEILVDRHTKEGRVSDALASWKEAFSAEGSKSWRPKPYNVLAYF 464

Query: 1477 RLCEQEEDTQSKEALFEVLKQVGCLDDEKYMSYIPLSNGTYTSSGPEIEKDTSDNDE 1647
             LCE+E D  SKE L  +L+Q   L D+ Y S I L + T  ++    EK ++ NDE
Sbjct: 465  DLCEKEGDIASKEVLVGLLRQPKYLQDKTYASLIGLLDETIDNNEVS-EKGSNINDE 520


>ref|XP_006418504.1| hypothetical protein EUTSA_v10007383mg [Eutrema salsugineum]
            gi|557096275|gb|ESQ36857.1| hypothetical protein
            EUTSA_v10007383mg [Eutrema salsugineum]
          Length = 517

 Score =  631 bits (1628), Expect = e-178
 Identities = 301/490 (61%), Positives = 383/490 (78%)
 Frame = +1

Query: 217  ITCSSISQVHSYGTLDYERRPIVKWNAIYKRISMNDGPERGSVSVLNQWENEGKKVTKWE 396
            I CS ISQV+ YGT+DYERRPI++WNAIYK+IS+ + PE G+ SVLNQWE  G+K+TKWE
Sbjct: 41   IVCS-ISQVYGYGTVDYERRPIIQWNAIYKKISLMEKPELGAASVLNQWEKGGRKLTKWE 99

Query: 397  LSRVIKELRKFRRYKLALEVYEWMNNRAERFRLTTSDTAIQLDLIAKVHGVSSAEEYFIK 576
            L RV+KELRK++R   ALEVY+WMNNR ERFRL+ SD AIQLDLI KV G+S AEE+F+ 
Sbjct: 100  LCRVVKELRKYKRPNQALEVYDWMNNRGERFRLSASDAAIQLDLIGKVRGISDAEEFFLS 159

Query: 577  LPDTLKDKRIYGSLLNAFVRARKKEQAESLIDKMRNRGYTDHALPFNVMMTLYMNLKDYN 756
            LP+  KD+R+YGSLLNA+VRA+ +E+AE+LIDKMR +GY  H LPFNVMMTLYMNL++Y+
Sbjct: 160  LPENFKDRRVYGSLLNAYVRAKSREKAEALIDKMREKGYALHPLPFNVMMTLYMNLREYD 219

Query: 757  KVELVVSEMKEKQIPLDIYSYNIWLSSCGSQGSIEKMEKVLEQMNLDTDINLNWTTFSTM 936
            KV+ +V EMK+K I LDIYSYNIWLSSCGS GS+EKME+V +QM  D  IN NWTTFSTM
Sbjct: 220  KVDAMVYEMKQKDIRLDIYSYNIWLSSCGSHGSVEKMEQVYQQMKSDVSINPNWTTFSTM 279

Query: 937  ATMYIKLGQLKKAEDSLKSVESRITGRDRIPYHYLISLYGSLGKKEEVLRIWKTYQSQFP 1116
            ATMYIK+G+ +KAED+L+ VE+RITGR+RIPYHYL+SLYGS+G K+E+ R+W  Y+S  P
Sbjct: 280  ATMYIKMGENEKAEDALRKVEARITGRNRIPYHYLLSLYGSVGNKKELYRVWNVYKSVVP 339

Query: 1117 TIPNLGYHSVISSLVRLDDIEGAEKIYDEWLPVKVHYDPRIGNLLLGYYVRKGFADKASA 1296
            +IPNLGYH+++SSLVR+ DI+GAEK+Y+EWLPVK  YDPRI NLL+  YV+    DKA  
Sbjct: 340  SIPNLGYHALVSSLVRMGDIQGAEKVYEEWLPVKSSYDPRIPNLLMNVYVKNDQLDKAEG 399

Query: 1297 FFDQMVEAGGKPNSMTMEILAEGHIRERRISEALSCLNDAVSTLGSKSWRPKPATVSSIL 1476
             FD M+E GGKP+S T EILA GH R+R I+EAL+CL +A S  GS +WRPK   +S   
Sbjct: 400  LFDHMIEMGGKPSSSTWEILAHGHTRKRNITEALTCLKEAFSAEGSSNWRPKVFMLSGFF 459

Query: 1477 RLCEQEEDTQSKEALFEVLKQVGCLDDEKYMSYIPLSNGTYTSSGPEIEKDTSDNDEGSE 1656
            +LCE+E D  SKEA+ E+L+Q G L D+ Y + I            +  +++    EG++
Sbjct: 460  KLCEEESDVASKEAVLELLRQSGHLQDKSYQALI------------DDAQESESESEGTD 507

Query: 1657 ILLNQLQESL 1686
            +LL QLQ+ L
Sbjct: 508  VLLTQLQDDL 517


>ref|XP_006306047.1| hypothetical protein CARUB_v10011354mg [Capsella rubella]
            gi|482574758|gb|EOA38945.1| hypothetical protein
            CARUB_v10011354mg [Capsella rubella]
          Length = 524

 Score =  630 bits (1624), Expect = e-178
 Identities = 305/496 (61%), Positives = 389/496 (78%), Gaps = 1/496 (0%)
 Frame = +1

Query: 196  PKNHSVIITCSSISQVHSYGTLDYERRPIVKWNAIYKRISMNDGPERGSVSVLNQWENEG 375
            P   +  I CS ISQV+ YGT+DYERRPI++WNAIYK+IS+ + PE G+ SVLNQWE  G
Sbjct: 34   PSKKTAAIVCS-ISQVYGYGTVDYERRPIIQWNAIYKKISLMEKPELGAASVLNQWEKGG 92

Query: 376  KKVTKWELSRVIKELRKFRRYKLALEVYEWMNNRAERFRLTTSDTAIQLDLIAKVHGVSS 555
            +K+TKWEL RV+KELRK++R   ALEVY+WMNNR ERFRL+ SD AIQLDLI KV G+S 
Sbjct: 93   RKLTKWELCRVVKELRKYKRPNQALEVYDWMNNRGERFRLSASDAAIQLDLIGKVRGISD 152

Query: 556  AEEYFIKLPDTLKDKRIYGSLLNAFVRARKKEQAESLIDKMRNRGYTDHALPFNVMMTLY 735
            AEE+F+ LP+T KD+R+YGSLLNA+VRA+ +E+AE+L++ MR +GY  H LPFNVMMTLY
Sbjct: 153  AEEFFLTLPETFKDRRVYGSLLNAYVRAKSREKAEALLNTMREKGYALHPLPFNVMMTLY 212

Query: 736  MNLKDYNKVELVVSEMKEKQIPLDIYSYNIWLSSCGSQGSIEKMEKVLEQMNLDTDINLN 915
            MNL++Y+KV+ +V EMK+K I LDIYSYNIWLSSCGS GS+EKME V +QM  D  IN N
Sbjct: 213  MNLREYDKVDAMVFEMKQKDIRLDIYSYNIWLSSCGSLGSVEKMELVYQQMKSDVAINPN 272

Query: 916  WTTFSTMATMYIKLGQLKKAEDSLKSVESRITGRDRIPYHYLISLYGSLGKKEEVLRIWK 1095
            WTTFSTMATMYIK+G+++KAED+L+ VE+RITGR+RIPYHYL+SLYGS+G K+E+ R+W 
Sbjct: 273  WTTFSTMATMYIKMGEIEKAEDALRKVEARITGRNRIPYHYLLSLYGSVGNKKELYRVWN 332

Query: 1096 TYQSQFPTIPNLGYHSVISSLVRLDDIEGAEKIYDEWLPVKVHYDPRIGNLLLGYYVRKG 1275
             Y+S  P+IPNLGYH+++SSLVR+ DIEGAEK+Y+EWLPVK  YDPRI NLL+  YV+  
Sbjct: 333  VYKSVAPSIPNLGYHALVSSLVRMGDIEGAEKVYEEWLPVKSSYDPRIPNLLMNVYVKND 392

Query: 1276 FADKASAFFDQMVEAGGKPNSMTMEILAEGHIRERRISEALSCLNDAVSTLGSKSWRPKP 1455
              +KA   FD MVE GGKP+S T EILA+GH R+R I EAL+CL  A S  GS +WRPK 
Sbjct: 393  QLEKAEGLFDHMVEMGGKPSSSTWEILADGHTRKRCIPEALTCLRKAFSAEGSSNWRPKV 452

Query: 1456 ATVSSILRLCEQEEDTQSKEALFEVLKQVGCLDDEKYMSYIPL-SNGTYTSSGPEIEKDT 1632
              +S   +LCE+E D  SKEA+ E+L+Q G L D+ Y + I +  N T  +S    E D 
Sbjct: 453  LMLSGFFKLCEEESDITSKEAVLELLRQAGHLQDKSYQALIDVDENRTVNNS----ENDA 508

Query: 1633 SDNDEGSEILLNQLQE 1680
             ++D G+++LL+QLQ+
Sbjct: 509  HESD-GTDVLLSQLQD 523


>ref|XP_003530115.1| PREDICTED: pentatricopeptide repeat-containing protein At1g02150-like
            [Glycine max]
          Length = 546

 Score =  622 bits (1605), Expect = e-175
 Identities = 304/493 (61%), Positives = 387/493 (78%), Gaps = 1/493 (0%)
 Frame = +1

Query: 208  SVIITCSSISQVHSYGTLDYERRPIVKWNAIYKRISMNDGPERGSVSVLNQWENEGKKVT 387
            SV+ TCS IS++HSYGT+DYERRPIV WN +Y+RIS+N  P+ GS  VLNQWENEG+ +T
Sbjct: 54   SVVTTCS-ISKIHSYGTVDYERRPIVGWNDVYRRISLNPNPQVGSAEVLNQWENEGRHLT 112

Query: 388  KWELSRVIKELRKFRRYKLALEVYEWMNNRAERFRLTTSDTAIQLDLIAKVHGVSSAEEY 567
            KWELSRV+KELRK++R++ ALEVY+WMNNR ERFR++ SD AIQLDLIAKV G+SSAE +
Sbjct: 113  KWELSRVVKELRKYKRFRRALEVYDWMNNRPERFRVSESDAAIQLDLIAKVRGLSSAEAF 172

Query: 568  FIKLPDTLKDKRIYGSLLNAFVRARKKEQAESLIDKMRNRGYTDHALPFNVMMTLYMNLK 747
            F+ L D LKDK+ YG+LLN +V +R KE+AESL D MR++GY  HALPFNVMMTLYMNL 
Sbjct: 173  FLSLEDKLKDKKTYGALLNVYVHSRSKEKAESLFDTMRSKGYVIHALPFNVMMTLYMNLN 232

Query: 748  DYNKVELVVSEMKEKQIPLDIYSYNIWLSSCGSQGSIEKMEKVLEQMNLDTDINLNWTTF 927
            +Y KV+++ SEM EK I LDIY+YNIWLSSCGSQGS+EKME+V EQM  D  I  NW+TF
Sbjct: 233  EYAKVDILASEMMEKNIQLDIYTYNIWLSSCGSQGSVEKMEQVFEQMEKDPSIIPNWSTF 292

Query: 928  STMATMYIKLGQLKKAEDSLKSVESRITGRDRIPYHYLISLYGSLGKKEEVLRIWKTYQS 1107
            STMA+MYI++ Q +KAE+ L+ VE RI GRDRIP+HYL+SLYGS+GKK+EV R+W TY+S
Sbjct: 293  STMASMYIRMDQNEKAEECLRKVEGRIKGRDRIPFHYLLSLYGSVGKKDEVCRVWNTYKS 352

Query: 1108 QFPTIPNLGYHSVISSLVRLDDIEGAEKIYDEWLPVKVHYDPRIGNLLLGYYVRKGFADK 1287
             FP+IPNLGYH++ISSLV+LDDIE AEK+Y+EW+ VK  YDPRIGNLL+G+YV+KG  DK
Sbjct: 353  IFPSIPNLGYHAIISSLVKLDDIEVAEKLYEEWISVKSSYDPRIGNLLIGWYVKKGDTDK 412

Query: 1288 ASAFFDQMVEAGGKPNSMTMEILAEGHIRERRISEALSCLNDA-VSTLGSKSWRPKPATV 1464
            A +FF+QM+  G  PNS T EIL+EGHI ++RISEA+SCL +A ++  GSKSWRPKP+ +
Sbjct: 413  ALSFFEQMLNDGCIPNSNTWEILSEGHIADKRISEAMSCLKEAFMAAGGSKSWRPKPSYL 472

Query: 1465 SSILRLCEQEEDTQSKEALFEVLKQVGCLDDEKYMSYIPLSNGTYTSSGPEIEKDTSDND 1644
            S+ L LC++++D +S E L  +L+Q      + Y S I  S+        +   D  D++
Sbjct: 473  SAFLELCQEQDDMESAEVLIGLLRQSKFNKSKVYASLIGSSDELPKIDTADRTDDAVDSE 532

Query: 1645 EGSEILLNQLQES 1683
                 LLNQL  S
Sbjct: 533  NMDNDLLNQLGSS 545


>ref|XP_002892022.1| pentatricopeptide repeat-containing protein [Arabidopsis lyrata
            subsp. lyrata] gi|297337864|gb|EFH68281.1|
            pentatricopeptide repeat-containing protein [Arabidopsis
            lyrata subsp. lyrata]
          Length = 523

 Score =  618 bits (1593), Expect = e-174
 Identities = 300/490 (61%), Positives = 379/490 (77%)
 Frame = +1

Query: 217  ITCSSISQVHSYGTLDYERRPIVKWNAIYKRISMNDGPERGSVSVLNQWENEGKKVTKWE 396
            I CS ISQV+ YGT+DYERRPIV+WNAIYK+IS+ + PE G+ SVLNQWE  G+K+TKWE
Sbjct: 42   IVCS-ISQVYGYGTVDYERRPIVQWNAIYKKISLMEKPELGAASVLNQWEKGGRKLTKWE 100

Query: 397  LSRVIKELRKFRRYKLALEVYEWMNNRAERFRLTTSDTAIQLDLIAKVHGVSSAEEYFIK 576
            L RV+KELRK++R   ALEVY+WMNNR ERFRL+ SD AIQLDLI KV G+S AE++F+ 
Sbjct: 101  LCRVVKELRKYKRPNQALEVYDWMNNRGERFRLSASDAAIQLDLIGKVRGISDAEQFFLT 160

Query: 577  LPDTLKDKRIYGSLLNAFVRARKKEQAESLIDKMRNRGYTDHALPFNVMMTLYMNLKDYN 756
            LP+  KD+R+YGSLLNA+VRA+ +E+AE+L+  MR++GY  H LPFNVMMTLYMNL++Y+
Sbjct: 161  LPENFKDRRVYGSLLNAYVRAKSREKAEALLHTMRDKGYALHPLPFNVMMTLYMNLREYD 220

Query: 757  KVELVVSEMKEKQIPLDIYSYNIWLSSCGSQGSIEKMEKVLEQMNLDTDINLNWTTFSTM 936
            KV+ +V EMK+K I LDIYSYNIWLSSCGS GS+EKME V +QM  D  IN NWTTFSTM
Sbjct: 221  KVDAMVFEMKQKDIRLDIYSYNIWLSSCGSLGSVEKMELVYQQMKSDVSINPNWTTFSTM 280

Query: 937  ATMYIKLGQLKKAEDSLKSVESRITGRDRIPYHYLISLYGSLGKKEEVLRIWKTYQSQFP 1116
            ATMYIK+G+ +KAED+L+ VE+RITGR+RIPYHYL+SLYGS+G K+E+ R+W  Y+S  P
Sbjct: 281  ATMYIKMGETEKAEDALRKVEARITGRNRIPYHYLLSLYGSVGNKKELYRVWNVYKSVVP 340

Query: 1117 TIPNLGYHSVISSLVRLDDIEGAEKIYDEWLPVKVHYDPRIGNLLLGYYVRKGFADKASA 1296
            +IPNLGYH+++SSL R+ DIEGAEK+Y+EWLPVK  YDPRI NLL+  YV+    +KA  
Sbjct: 341  SIPNLGYHALVSSLARMGDIEGAEKVYEEWLPVKSSYDPRIPNLLMNVYVKNDQLEKAEG 400

Query: 1297 FFDQMVEAGGKPNSMTMEILAEGHIRERRISEALSCLNDAVSTLGSKSWRPKPATVSSIL 1476
             FD MVE GGKP+S T EILA+GH R+R I EAL+CL  A S  GS +WRPK   +S   
Sbjct: 401  LFDHMVEMGGKPSSSTWEILADGHTRKRCIPEALTCLRKAFSAEGSSNWRPKVLMLSGFF 460

Query: 1477 RLCEQEEDTQSKEALFEVLKQVGCLDDEKYMSYIPLSNGTYTSSGPEIEKDTSDNDEGSE 1656
            +LCE+E D  SKEA+ E+L+Q G L+D+ Y + I +     T +  EI+   +D      
Sbjct: 461  KLCEEESDVTSKEAVLELLRQSGHLEDKAYQALIDVDENR-TENNSEIDAHETD------ 513

Query: 1657 ILLNQLQESL 1686
             LL QLQ+ L
Sbjct: 514  ALLTQLQDDL 523


>ref|NP_171717.2| pentatricopeptide repeat-containing protein [Arabidopsis thaliana]
            gi|193806400|sp|Q8LPS6.2|PPR3_ARATH RecName:
            Full=Pentatricopeptide repeat-containing protein
            At1g02150 gi|2317908|gb|AAC24372.1| Unknown protein
            [Arabidopsis thaliana] gi|332189272|gb|AEE27393.1|
            pentatricopeptide repeat-containing protein [Arabidopsis
            thaliana]
          Length = 524

 Score =  617 bits (1592), Expect = e-174
 Identities = 301/490 (61%), Positives = 381/490 (77%)
 Frame = +1

Query: 217  ITCSSISQVHSYGTLDYERRPIVKWNAIYKRISMNDGPERGSVSVLNQWENEGKKVTKWE 396
            I CS ISQV+ YGT+DYERRPIV+WNAIYK+IS+ + PE G+ SVLNQWE  G+K+TKWE
Sbjct: 43   IVCS-ISQVYGYGTVDYERRPIVQWNAIYKKISLMEKPELGAASVLNQWEKAGRKLTKWE 101

Query: 397  LSRVIKELRKFRRYKLALEVYEWMNNRAERFRLTTSDTAIQLDLIAKVHGVSSAEEYFIK 576
            L RV+KELRK++R   ALEVY+WMNNR ERFRL+ SD AIQLDLI KV G+  AEE+F++
Sbjct: 102  LCRVVKELRKYKRANQALEVYDWMNNRGERFRLSASDAAIQLDLIGKVRGIPDAEEFFLQ 161

Query: 577  LPDTLKDKRIYGSLLNAFVRARKKEQAESLIDKMRNRGYTDHALPFNVMMTLYMNLKDYN 756
            LP+  KD+R+YGSLLNA+VRA+ +E+AE+L++ MR++GY  H LPFNVMMTLYMNL++Y+
Sbjct: 162  LPENFKDRRVYGSLLNAYVRAKSREKAEALLNTMRDKGYALHPLPFNVMMTLYMNLREYD 221

Query: 757  KVELVVSEMKEKQIPLDIYSYNIWLSSCGSQGSIEKMEKVLEQMNLDTDINLNWTTFSTM 936
            KV+ +V EMK+K I LDIYSYNIWLSSCGS GS+EKME V +QM  D  I  NWTTFSTM
Sbjct: 222  KVDAMVFEMKQKDIRLDIYSYNIWLSSCGSLGSVEKMELVYQQMKSDVSIYPNWTTFSTM 281

Query: 937  ATMYIKLGQLKKAEDSLKSVESRITGRDRIPYHYLISLYGSLGKKEEVLRIWKTYQSQFP 1116
            ATMYIK+G+ +KAED+L+ VE+RITGR+RIPYHYL+SLYGSLG K+E+ R+W  Y+S  P
Sbjct: 282  ATMYIKMGETEKAEDALRKVEARITGRNRIPYHYLLSLYGSLGNKKELYRVWHVYKSVVP 341

Query: 1117 TIPNLGYHSVISSLVRLDDIEGAEKIYDEWLPVKVHYDPRIGNLLLGYYVRKGFADKASA 1296
            +IPNLGYH+++SSLVR+ DIEGAEK+Y+EWLPVK  YDPRI NLL+  YV+    + A  
Sbjct: 342  SIPNLGYHALVSSLVRMGDIEGAEKVYEEWLPVKSSYDPRIPNLLMNAYVKNDQLETAEG 401

Query: 1297 FFDQMVEAGGKPNSMTMEILAEGHIRERRISEALSCLNDAVSTLGSKSWRPKPATVSSIL 1476
             FD MVE GGKP+S T EILA GH R+R ISEAL+CL +A S  GS +WRPK   +S   
Sbjct: 402  LFDHMVEMGGKPSSSTWEILAVGHTRKRCISEALTCLRNAFSAEGSSNWRPKVLMLSGFF 461

Query: 1477 RLCEQEEDTQSKEALFEVLKQVGCLDDEKYMSYIPLSNGTYTSSGPEIEKDTSDNDEGSE 1656
            +LCE+E D  SKEA+ E+L+Q G L+D+ Y++ I +     T +  EI+   +D      
Sbjct: 462  KLCEEESDVTSKEAVLELLRQSGDLEDKSYLALIDVDENR-TVNNSEIDAHETD------ 514

Query: 1657 ILLNQLQESL 1686
             LL QLQ+ L
Sbjct: 515  ALLTQLQDDL 524


>ref|XP_003520417.1| PREDICTED: pentatricopeptide repeat-containing protein At1g02150-like
            [Glycine max]
          Length = 555

 Score =  617 bits (1591), Expect = e-174
 Identities = 306/506 (60%), Positives = 386/506 (76%), Gaps = 11/506 (2%)
 Frame = +1

Query: 190  TCPKNHSVIITCSSISQVHSYGTLDYERRPIVKWNAIYKRISMNDGPERGSVSVLNQWEN 369
            TC      ++TCS IS +HSYGT+DYERRPIV+WN +Y+RIS+N  P+ GS  VLNQWEN
Sbjct: 47   TCFHPRLSVVTCS-ISNIHSYGTVDYERRPIVRWNDVYRRISLNQNPQVGSAEVLNQWEN 105

Query: 370  EGKKVTKWELSRVIKELRKFRRYKLALEVYEWMNNRAERFRLTTSDTAIQLDLIAKVHGV 549
            EG+ +TKWELSRV+KELRK++R+  ALEVY+WMNNR ERFR++ SD AIQLDLIAKV GV
Sbjct: 106  EGRHLTKWELSRVVKELRKYKRFPRALEVYDWMNNRPERFRVSESDAAIQLDLIAKVRGV 165

Query: 550  SSAEEYFIKLPDTLKDKRIYGSLLNAFVRARKKEQAESLIDKMRNRGYTDHALPFNVMMT 729
            SSAE +F+ L D LKDKR YG+LLN +V +R KE+AESL D MR++GY  HALP NVMMT
Sbjct: 166  SSAEAFFLSLEDKLKDKRTYGALLNVYVHSRSKEKAESLFDTMRSKGYVIHALPINVMMT 225

Query: 730  LYMNLKDYNKVELVVSEMKEKQIPLDIYSYNIWLSSCGSQGSIEKMEKVLEQMNLDTDIN 909
            LYMNL +Y KV+++ SEM EK I LDIY+YNIWLSSCGSQGS+EKME+V EQM  D  I 
Sbjct: 226  LYMNLNEYAKVDMLASEMMEKNIQLDIYTYNIWLSSCGSQGSVEKMEQVFEQMERDPTIV 285

Query: 910  LNWTTFSTMATMYIKLGQLKKAEDSLKSVESRITGRDRIPYHYLISLYGSLGKKEEVLRI 1089
             NW+TFST+A+MYI++ Q +KAE  L+ VE RI GRDRIP+HYL+SLYGS+GKK+EV R+
Sbjct: 286  PNWSTFSTLASMYIRMNQNEKAEKCLRKVEGRIKGRDRIPFHYLLSLYGSVGKKDEVYRV 345

Query: 1090 WKTYQSQFPTIPNLGYHSVISSLVRLDDIEGAEKIYDEWLPVKVHYDPRIGNLLLGYYVR 1269
            W TY+S FP IPNLGYH++ISSLV+LDDIEGAEK+Y+EW+ VK  YDPRIGNLL+G+YV+
Sbjct: 346  WNTYKSIFPRIPNLGYHAIISSLVKLDDIEGAEKLYEEWISVKSSYDPRIGNLLMGWYVK 405

Query: 1270 KGFADKASAFFDQMVEAGGKPNSMTMEILAEGHIRERRISEALSCLNDAVSTL-GSKSWR 1446
            K   DKA +FF+Q+   G  PNS T EIL+EGHI ++RISEALSCL +A     GSKSWR
Sbjct: 406  KDDTDKALSFFEQISNDGCIPNSNTWEILSEGHIADKRISEALSCLKEAFMVAGGSKSWR 465

Query: 1447 PKPATVSSILRLCEQEEDTQSKEALFEVLKQVGCLDDEKYMSYI-----PLSNGTYTSSG 1611
            PKP+ +S+ L LC+++ D +S E L  +L+Q      + Y S I      + NG   S  
Sbjct: 466  PKPSYLSAFLELCQEQNDMESAEVLIGLLRQSKFSKIKVYASIIGSPDCTIDNGELQSKI 525

Query: 1612 PEIEK-----DTSDNDEGSEILLNQL 1674
               ++     D+ + D+ S++LLNQL
Sbjct: 526  DITDRTDDAVDSENMDDDSQMLLNQL 551


>gb|AAM19786.1| At1g02150/T7I23.8 [Arabidopsis thaliana] gi|29028736|gb|AAO64747.1|
            At1g02150/T7I23.8 [Arabidopsis thaliana]
          Length = 524

 Score =  617 bits (1590), Expect = e-174
 Identities = 300/490 (61%), Positives = 381/490 (77%)
 Frame = +1

Query: 217  ITCSSISQVHSYGTLDYERRPIVKWNAIYKRISMNDGPERGSVSVLNQWENEGKKVTKWE 396
            I CS ISQV+ YGT+DYERRPIV+WNAIYK+IS+ + PE G+ SVLNQWE  G+K+TKWE
Sbjct: 43   IVCS-ISQVYGYGTVDYERRPIVQWNAIYKKISLMEKPELGAASVLNQWEKAGRKLTKWE 101

Query: 397  LSRVIKELRKFRRYKLALEVYEWMNNRAERFRLTTSDTAIQLDLIAKVHGVSSAEEYFIK 576
            L RV+KELRK++R   A+EVY+WMNNR ERFRL+ SD AIQLDLI KV G+  AEE+F++
Sbjct: 102  LCRVVKELRKYKRANQAIEVYDWMNNRGERFRLSASDAAIQLDLIGKVRGIPDAEEFFLQ 161

Query: 577  LPDTLKDKRIYGSLLNAFVRARKKEQAESLIDKMRNRGYTDHALPFNVMMTLYMNLKDYN 756
            LP+  KD+R+YGSLLNA+VRA+ +E+AE+L++ MR++GY  H LPFNVMMTLYMNL++Y+
Sbjct: 162  LPENFKDRRVYGSLLNAYVRAKSREKAEALLNTMRDKGYALHPLPFNVMMTLYMNLREYD 221

Query: 757  KVELVVSEMKEKQIPLDIYSYNIWLSSCGSQGSIEKMEKVLEQMNLDTDINLNWTTFSTM 936
            KV+ +V EMK+K I LDIYSYNIWLSSCGS GS+EKME V +QM  D  I  NWTTFSTM
Sbjct: 222  KVDAMVFEMKQKDIRLDIYSYNIWLSSCGSLGSVEKMELVYQQMKSDVSIYPNWTTFSTM 281

Query: 937  ATMYIKLGQLKKAEDSLKSVESRITGRDRIPYHYLISLYGSLGKKEEVLRIWKTYQSQFP 1116
            ATMYIK+G+ +KAED+L+ VE+RITGR+RIPYHYL+SLYGSLG K+E+ R+W  Y+S  P
Sbjct: 282  ATMYIKMGETEKAEDALRKVEARITGRNRIPYHYLLSLYGSLGNKKELYRVWHVYKSVVP 341

Query: 1117 TIPNLGYHSVISSLVRLDDIEGAEKIYDEWLPVKVHYDPRIGNLLLGYYVRKGFADKASA 1296
            +IPNLGYH+++SSLVR+ DIEGAEK+Y+EWLPVK  YDPRI NLL+  YV+    + A  
Sbjct: 342  SIPNLGYHALVSSLVRMGDIEGAEKVYEEWLPVKSSYDPRIPNLLMNAYVKNDQLETAEG 401

Query: 1297 FFDQMVEAGGKPNSMTMEILAEGHIRERRISEALSCLNDAVSTLGSKSWRPKPATVSSIL 1476
             FD MVE GGKP+S T EILA GH R+R ISEAL+CL +A S  GS +WRPK   +S   
Sbjct: 402  LFDHMVEMGGKPSSSTWEILAVGHTRKRCISEALTCLRNAFSAEGSSNWRPKVLMLSGFF 461

Query: 1477 RLCEQEEDTQSKEALFEVLKQVGCLDDEKYMSYIPLSNGTYTSSGPEIEKDTSDNDEGSE 1656
            +LCE+E D  SKEA+ E+L+Q G L+D+ Y++ I +     T +  EI+   +D      
Sbjct: 462  KLCEEESDVTSKEAVLELLRQSGDLEDKSYLALIDVDENR-TVNNSEIDAHETD------ 514

Query: 1657 ILLNQLQESL 1686
             LL QLQ+ L
Sbjct: 515  ALLTQLQDDL 524


Top