BLASTX nr result

ID: Cocculus23_contig00001470 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Cocculus23_contig00001470
         (1710 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_007052358.1| Tetratricopeptide repeat (TPR)-like superfam...   521   e-145
ref|XP_007052357.1| Tetratricopeptide repeat (TPR)-like superfam...   520   e-145
gb|EXB38379.1| hypothetical protein L484_008037 [Morus notabilis]     512   e-142
ref|XP_002526471.1| pentatricopeptide repeat-containing protein,...   511   e-142
ref|XP_006347992.1| PREDICTED: pentatricopeptide repeat-containi...   511   e-142
ref|XP_002270492.1| PREDICTED: pentatricopeptide repeat-containi...   511   e-142
ref|XP_004229730.1| PREDICTED: pentatricopeptide repeat-containi...   506   e-140
emb|CAN75355.1| hypothetical protein VITISV_002476 [Vitis vinifera]   506   e-140
ref|XP_007218971.1| hypothetical protein PRUPE_ppa003822mg [Prun...   505   e-140
ref|XP_004307244.1| PREDICTED: pentatricopeptide repeat-containi...   504   e-140
ref|XP_006445447.1| hypothetical protein CICLE_v10019658mg [Citr...   499   e-138
ref|XP_006375170.1| pentatricopeptide repeat-containing family p...   496   e-137
emb|CBI16683.3| unnamed protein product [Vitis vinifera]              493   e-137
ref|XP_006418504.1| hypothetical protein EUTSA_v10007383mg [Eutr...   493   e-136
ref|XP_006306047.1| hypothetical protein CARUB_v10011354mg [Caps...   492   e-136
ref|XP_002301239.2| pentatricopeptide repeat-containing family p...   491   e-136
gb|EYU40343.1| hypothetical protein MIMGU_mgv1a004109mg [Mimulus...   488   e-135
ref|XP_004133941.1| PREDICTED: pentatricopeptide repeat-containi...   487   e-135
ref|XP_002892022.1| pentatricopeptide repeat-containing protein ...   484   e-134
ref|NP_171717.2| pentatricopeptide repeat-containing protein [Ar...   480   e-133

>ref|XP_007052358.1| Tetratricopeptide repeat (TPR)-like superfamily protein isoform 2
            [Theobroma cacao] gi|590724061|ref|XP_007052359.1|
            Tetratricopeptide repeat (TPR)-like superfamily protein
            isoform 2 [Theobroma cacao] gi|508704619|gb|EOX96515.1|
            Tetratricopeptide repeat (TPR)-like superfamily protein
            isoform 2 [Theobroma cacao] gi|508704620|gb|EOX96516.1|
            Tetratricopeptide repeat (TPR)-like superfamily protein
            isoform 2 [Theobroma cacao]
          Length = 420

 Score =  521 bits (1342), Expect = e-145
 Identities = 248/416 (59%), Positives = 324/416 (77%), Gaps = 6/416 (1%)
 Frame = -1

Query: 1695 VYEWMDTKGDRFKLSASDAAIQLDLISKVRGVPSAEEYFSRLSRSLTDKRTYTALLNAYV 1516
            VY+WM+ +G+RF+LSASDAAIQLDLI+KVRGV SAE++F +L  ++ DKR Y ALLNAYV
Sbjct: 2    VYDWMNNRGERFRLSASDAAIQLDLIAKVRGVSSAEDFFVQLPDTMKDKRIYGALLNAYV 61

Query: 1515 SAKMKEKAEFLMEEMRTKGYAMHPLPLNLMMTLYMKLKEYEKVLSLVSEMVEKNIALDLY 1336
             AKM++KAE L++ MR KGYAMHPLP N+MMTLYM LKEY+KV S+VSEM+EKNI LD+Y
Sbjct: 62   RAKMRDKAETLIDNMRGKGYAMHPLPFNVMMTLYMNLKEYDKVESMVSEMMEKNIRLDIY 121

Query: 1335 SYNIWLTTCGAMGSIEKMEAALEQMKQDKTVNPNWTTYSTMATMYMNLGQHQKAKDSLQM 1156
            SYNIWL++CG+ GS+EKME   EQMKQD+++NPNWTT+STMATMY+ +G  +KA++ L+ 
Sbjct: 122  SYNIWLSSCGSQGSVEKMEEVYEQMKQDQSINPNWTTFSTMATMYIKMGLTEKAEECLRN 181

Query: 1155 VESKMTGRDRVPYHYLLSLYGSVGEKEEVYRIWNTYKSGFSSIPNLGYHAIIASLIRLDD 976
            VES++TGRDR+PYHYL+SLYG VG +EEVYR+W  YKS F SIPNLG+HA+I+SL+R  D
Sbjct: 182  VESRITGRDRIPYHYLISLYGGVGNREEVYRVWKVYKSIFPSIPNLGFHAVISSLVRAGD 241

Query: 975  IGGAEKIYEEWLSARTTFDPRVCNLLLKWYVRNGCLEKAEALVDNVFEVGGKANSYTWEF 796
            I GAE+IYEEWL+ +T++DPR+ NLL+ WYV+ G L+KAE+L  ++ EVGGK NS +WE 
Sbjct: 242  IQGAERIYEEWLTVKTSYDPRIANLLMGWYVKEGNLDKAESLFSHIAEVGGKPNSSSWEI 301

Query: 795  LAEGHIREKQITGALSCIKKATLAEGAKNWKPTPRNVSLFFSLCEQESDTASKEALVDVL 616
            LAEGHI EK+I  ALSC+K A   EG++ W+P P +VS FF+LCE++ D AS+E  V +L
Sbjct: 302  LAEGHILEKRIPDALSCLKDAFATEGSRGWRPKPTSVSAFFNLCEEKVDMASREVFVGLL 361

Query: 615  RQIGCFEKEAYMSQV----SAYGVDDLGLNSVDKDGIDVG--GNGDGTHILLNQLE 466
            RQ GC + EAY S +     A    +L  +   K           DG+ +L+NQL+
Sbjct: 362  RQSGCLKNEAYASLIGLSEEALSESELPRDKNRKSSYSSSDENQDDGSEVLINQLQ 417


>ref|XP_007052357.1| Tetratricopeptide repeat (TPR)-like superfamily protein isoform 1
            [Theobroma cacao] gi|508704618|gb|EOX96514.1|
            Tetratricopeptide repeat (TPR)-like superfamily protein
            isoform 1 [Theobroma cacao]
          Length = 549

 Score =  520 bits (1340), Expect = e-145
 Identities = 242/380 (63%), Positives = 312/380 (82%)
 Frame = -1

Query: 1710 KLALGVYEWMDTKGDRFKLSASDAAIQLDLISKVRGVPSAEEYFSRLSRSLTDKRTYTAL 1531
            K AL VY+WM+ +G+RF+LSASDAAIQLDLI+KVRGV SAE++F +L  ++ DKR Y AL
Sbjct: 118  KQALEVYDWMNNRGERFRLSASDAAIQLDLIAKVRGVSSAEDFFVQLPDTMKDKRIYGAL 177

Query: 1530 LNAYVSAKMKEKAEFLMEEMRTKGYAMHPLPLNLMMTLYMKLKEYEKVLSLVSEMVEKNI 1351
            LNAYV AKM++KAE L++ MR KGYAMHPLP N+MMTLYM LKEY+KV S+VSEM+EKNI
Sbjct: 178  LNAYVRAKMRDKAETLIDNMRGKGYAMHPLPFNVMMTLYMNLKEYDKVESMVSEMMEKNI 237

Query: 1350 ALDLYSYNIWLTTCGAMGSIEKMEAALEQMKQDKTVNPNWTTYSTMATMYMNLGQHQKAK 1171
             LD+YSYNIWL++CG+ GS+EKME   EQMKQD+++NPNWTT+STMATMY+ +G  +KA+
Sbjct: 238  RLDIYSYNIWLSSCGSQGSVEKMEEVYEQMKQDQSINPNWTTFSTMATMYIKMGLTEKAE 297

Query: 1170 DSLQMVESKMTGRDRVPYHYLLSLYGSVGEKEEVYRIWNTYKSGFSSIPNLGYHAIIASL 991
            + L+ VES++TGRDR+PYHYL+SLYG VG +EEVYR+W  YKS F SIPNLG+HA+I+SL
Sbjct: 298  ECLRNVESRITGRDRIPYHYLISLYGGVGNREEVYRVWKVYKSIFPSIPNLGFHAVISSL 357

Query: 990  IRLDDIGGAEKIYEEWLSARTTFDPRVCNLLLKWYVRNGCLEKAEALVDNVFEVGGKANS 811
            +R  DI GAE+IYEEWL+ +T++DPR+ NLL+ WYV+ G L+KAE+L  ++ EVGGK NS
Sbjct: 358  VRAGDIQGAERIYEEWLTVKTSYDPRIANLLMGWYVKEGNLDKAESLFSHIAEVGGKPNS 417

Query: 810  YTWEFLAEGHIREKQITGALSCIKKATLAEGAKNWKPTPRNVSLFFSLCEQESDTASKEA 631
             +WE LAEGHI EK+I  ALSC+K A   EG++ W+P P +VS FF+LCE++ D AS+E 
Sbjct: 418  SSWEILAEGHILEKRIPDALSCLKDAFATEGSRGWRPKPTSVSAFFNLCEEKVDMASREV 477

Query: 630  LVDVLRQIGCFEKEAYMSQV 571
             V +LRQ GC + EAY S +
Sbjct: 478  FVGLLRQSGCLKNEAYASLI 497


>gb|EXB38379.1| hypothetical protein L484_008037 [Morus notabilis]
          Length = 546

 Score =  512 bits (1318), Expect = e-142
 Identities = 253/419 (60%), Positives = 322/419 (76%), Gaps = 4/419 (0%)
 Frame = -1

Query: 1704 ALGVYEWMDTKGDRFKLSASDAAIQLDLISKVRGVPSAEEYFSRLSRSLTDKRTYTALLN 1525
            AL VY+WM+ +G+RF+LS+SDAAIQLDLI KVRG+ SAE +F  LS +  D+R Y ALLN
Sbjct: 127  ALEVYDWMNNRGERFRLSSSDAAIQLDLIGKVRGISSAENFFLSLSDTSKDRRIYGALLN 186

Query: 1524 AYVSAKMKEKAEFLMEEMRTKGYAMHPLPLNLMMTLYMKLKEYEKVLSLVSEMVEKNIAL 1345
            AYV A+MKEKAE L++ MR KGYA+H LP N+MMTLYM LKEY+KV ++VSEM++KNI L
Sbjct: 187  AYVQARMKEKAESLLDRMRGKGYAIHSLPFNVMMTLYMNLKEYKKVDAMVSEMMDKNIQL 246

Query: 1344 DLYSYNIWLTTCGAMGSIEKMEAALEQMKQDKTVNPNWTTYSTMATMYMNLGQHQKAKDS 1165
            D+YSYNIWL+ CG+ GS E ME   EQM+QDK++NPNWTT+STMATMY+ +GQ QKA++ 
Sbjct: 247  DVYSYNIWLSCCGSQGSAEGMEQVFEQMQQDKSINPNWTTFSTMATMYIKMGQFQKAEEC 306

Query: 1164 LQMVESKMTGRDRVPYHYLLSLYGSVGEKEEVYRIWNTYKSGFSSIPNLGYHAIIASLIR 985
            L+ VES++TGRDR+PYHYLLSLYGSVG KEE+YR+W  YK+ F SIPNLGYHAII+SL+R
Sbjct: 307  LRKVESRITGRDRIPYHYLLSLYGSVGNKEEIYRVWKVYKAIFPSIPNLGYHAIISSLLR 366

Query: 984  LDDIGGAEKIYEEWLSARTTFDPRVCNLLLKWYVRNGCLEKAEALVDNVFEVGGKANSYT 805
            + DI GAE IY EWL  ++++DPR+ NL + +YVRNG LEKA +LVD++ EVGGK NS T
Sbjct: 367  IGDIEGAENIYNEWLPVKSSYDPRIANLFMSYYVRNGNLEKATSLVDHIIEVGGKPNSAT 426

Query: 804  WEFLAEGHIREKQITGALSCIKKATLAEGAKNWKPTPRNVSLFFSLCEQESDTASKEALV 625
            WE LA GH  E++I+ ALS  K+A  AEGAKNW+P P NVS F  LCEQE+D   KE LV
Sbjct: 427  WEILAAGHTGERRISEALSYWKEAFAAEGAKNWRPKPVNVSAFLDLCEQEADLECKEVLV 486

Query: 624  DVLRQIGCFEKEAYMSQV--SAYGVDDLGLNSVDK--DGIDVGGNGDGTHILLNQLERS 460
             +LR+ G  + ++Y S V  S   ++D G+ SVD   +  +     D + IL NQL+ S
Sbjct: 487  GLLREAGYLKDQSYASFVGFSHEAINDNGITSVDVSFENDNDENKDDESGILFNQLQGS 545


>ref|XP_002526471.1| pentatricopeptide repeat-containing protein, putative [Ricinus
            communis] gi|223534146|gb|EEF35862.1| pentatricopeptide
            repeat-containing protein, putative [Ricinus communis]
          Length = 533

 Score =  511 bits (1317), Expect = e-142
 Identities = 250/416 (60%), Positives = 323/416 (77%), Gaps = 1/416 (0%)
 Frame = -1

Query: 1710 KLALGVYEWMDTKGDRFKLSASDAAIQLDLISKVRGVPSAEEYFSRLSRSLTDKRTYTAL 1531
            K AL VY+WM+ + +RF+LSASDAAIQLDL++KVRGV SAE+YF RLS ++ D+R Y AL
Sbjct: 118  KQALEVYDWMNNREERFRLSASDAAIQLDLVAKVRGVSSAEDYFMRLSDNVKDRRVYGAL 177

Query: 1530 LNAYVSAKMKEKAEFLMEEMRTKGYAMHPLPLNLMMTLYMKLKEYEKVLSLVSEMVEKNI 1351
            LN+YV A+M+EKAE L+E+MR K Y  H LP N+MMTLYM LKEY+KV  ++SEM+ KNI
Sbjct: 178  LNSYVKARMREKAESLIEKMRKKDYTTHALPFNVMMTLYMNLKEYDKVDMMISEMMAKNI 237

Query: 1350 ALDLYSYNIWLTTCGAMGSIEKMEAALEQMKQDKTVNPNWTTYSTMATMYMNLGQHQKAK 1171
             LD+YSYNIWL++ G+ GSIE+ME   EQMK D T+NPNWTT+STMATMY+ +GQ +KA+
Sbjct: 238  RLDIYSYNIWLSSRGSQGSIERMEEVYEQMKLDSTINPNWTTFSTMATMYIKMGQLEKAE 297

Query: 1170 DSLQMVESKMTGRDRVPYHYLLSLYGSVGEKEEVYRIWNTYKSGFSSIPNLGYHAIIASL 991
            D L+ VES++TGRDR+PYHYLLSLYG+VG KEE+YR+WN YKS F++IPNLGYHAII+SL
Sbjct: 298  DCLRRVESRITGRDRIPYHYLLSLYGNVGNKEEIYRVWNIYKSIFATIPNLGYHAIISSL 357

Query: 990  IRLDDIGGAEKIYEEWLSARTTFDPRVCNLLLKWYVRNGCLEKAEALVDNVFEVGGKANS 811
            +R+DDI GAEKIYEEWL  ++++DPR+ NLL+ WYVR G L+KAE+  D++ EVGGK NS
Sbjct: 358  VRMDDIEGAEKIYEEWLPVKSSYDPRIGNLLMGWYVRGGNLDKAESFFDHMMEVGGKPNS 417

Query: 810  YTWEFLAEGHIREKQITGALSCIKKATLAEGAKNWKPTPRNVSLFFSLCEQESDTASKEA 631
             TWE LA+GH REK+I+ ALSC K+A LA+G+K+WKP P  +S FF LCE+E+D AS   
Sbjct: 418  STWEILADGHTREKRISEALSCFKEAFLAQGSKSWKPKPVIISSFFKLCEEEADMASTGV 477

Query: 630  LVDVLRQIGCFEKEAYMSQV-SAYGVDDLGLNSVDKDGIDVGGNGDGTHILLNQLE 466
            L D+L Q G  E + Y S + S+   ++L   S +KD        +     LNQL+
Sbjct: 478  LEDLLAQSGYLEDKTYASLIGSSVPSNEL---STEKDRTGDRNEVEENETFLNQLQ 530


>ref|XP_006347992.1| PREDICTED: pentatricopeptide repeat-containing protein At1g02150-like
            [Solanum tuberosum]
          Length = 545

 Score =  511 bits (1315), Expect = e-142
 Identities = 248/419 (59%), Positives = 321/419 (76%), Gaps = 1/419 (0%)
 Frame = -1

Query: 1710 KLALGVYEWMDTKGDRFKLSASDAAIQLDLISKVRGVPSAEEYFSRLSRSLTDKRTYTAL 1531
            KLA  VYEWM+ + +RF+L+ SD AIQLDLI+KV G+ SAEEYF +L  +L DKR Y +L
Sbjct: 130  KLAFEVYEWMNNRPERFRLTTSDTAIQLDLIAKVHGISSAEEYFEKLPDTLKDKRIYGSL 189

Query: 1530 LNAYVSAKMKEKAEFLMEEMRTKGYAMHPLPLNLMMTLYMKLKEYEKVLSLVSEMVEKNI 1351
            LNA+V ++ KE+AE L+++MR +GY  H LP N+MMTLYM LK+Y KV S+VSEM EK I
Sbjct: 190  LNAFVRSRKKEQAESLLDKMRNRGYTDHALPFNVMMTLYMNLKDYNKVESVVSEMKEKKI 249

Query: 1350 ALDLYSYNIWLTTCGAMGSIEKMEAALEQMKQDKTVNPNWTTYSTMATMYMNLGQHQKAK 1171
             LD+YSYNIWL++CG+ GSIEKME  LEQM  D  +NPNWTT+STMATMY+ LG+ +KA+
Sbjct: 250  PLDIYSYNIWLSSCGSQGSIEKMEKVLEQMNLDTDINPNWTTFSTMATMYIKLGELKKAE 309

Query: 1170 DSLQMVESKMTGRDRVPYHYLLSLYGSVGEKEEVYRIWNTYKSGFSSIPNLGYHAIIASL 991
            DSL+ VES++TGRDR+PYHYL+SLYGS+G+KEEV RIW TY+S F +IPNLGYH++I+SL
Sbjct: 310  DSLKSVESRITGRDRIPYHYLISLYGSLGKKEEVLRIWKTYQSQFPNIPNLGYHSVISSL 369

Query: 990  IRLDDIGGAEKIYEEWLSARTTFDPRVCNLLLKWYVRNGCLEKAEALVDNVFEVGGKANS 811
            +RLDDI GAEKIY+EWL  +  +DPR+ NLLL +YVR G ++KA A  D +   GGK NS
Sbjct: 370  VRLDDIEGAEKIYDEWLPVKVHYDPRIGNLLLGYYVRKGFVDKASAFFDQMIGAGGKPNS 429

Query: 810  YTWEFLAEGHIREKQITGALSCIKKATLAEGAKNWKPTPRNVSLFFSLCEQESDTASKEA 631
             T E LAEGHIR+++I+ ALSC+K A   EG+K+W+P P  VS    LCEQE DT +KEA
Sbjct: 430  MTCEILAEGHIRDRRISEALSCLKDAVSTEGSKSWRPKPATVSSILRLCEQEDDTQNKEA 489

Query: 630  LVDVLRQIGCFEKEAYMSQVS-AYGVDDLGLNSVDKDGIDVGGNGDGTHILLNQLERSL 457
            L++VL+Q+GC + E YMS +  + G        ++KD  D   NG+G+ ILLNQL+ SL
Sbjct: 490  LLEVLKQVGCLDDEKYMSYIPLSNGTITSSEPEIEKDTSD---NGEGSDILLNQLQESL 545


>ref|XP_002270492.1| PREDICTED: pentatricopeptide repeat-containing protein At1g02150
            [Vitis vinifera]
          Length = 527

 Score =  511 bits (1315), Expect = e-142
 Identities = 254/424 (59%), Positives = 320/424 (75%), Gaps = 6/424 (1%)
 Frame = -1

Query: 1710 KLALGVYEWMDTKGDRFKLSASDAAIQLDLISKVRGVPSAEEYFSRLSRSLTDKRTYTAL 1531
            K+AL VYEWM+ +G+RF+LS+SDAAIQLDLI+KV GV SAE+YFSRL  +L DKR Y AL
Sbjct: 107  KMALEVYEWMNNRGERFRLSSSDAAIQLDLIAKVCGVSSAEDYFSRLPDTLKDKRIYGAL 166

Query: 1530 LNAYVSAKMKEKAEFLMEEMRTKGYAMHPLPLNLMMTLYMKLKEYEKVLSLVSEMVEKNI 1351
            LNAYV AKM++KAE L+E++R KGYA  PLP N+MMTLYM LKE +KV S++SEM+ KNI
Sbjct: 167  LNAYVQAKMRDKAEILIEKLRNKGYATTPLPFNVMMTLYMNLKELDKVQSMISEMMNKNI 226

Query: 1350 ALDLYSYNIWLTTCGAMGSIEKMEAALEQMKQDKTVNPNWTTYSTMATMYMNLGQHQKAK 1171
             LD+YSYNIWL++C    S E+ME   EQMK ++T+NPNWTT+STMATMY+ LGQ +KA+
Sbjct: 227  QLDIYSYNIWLSSCE---STERMEQVFEQMKLERTINPNWTTFSTMATMYIKLGQFEKAE 283

Query: 1170 DSLQMVESKMTGRDRVPYHYLLSLYGSVGEKEEVYRIWNTYKSGFSSIPNLGYHAIIASL 991
            + L+ VES++T RDR+PYHYL+SLYGS G K EVYR WN YKS F +IPNLGYHA+I+SL
Sbjct: 284  ECLKKVESRITNRDRMPYHYLISLYGSTGNKAEVYRAWNIYKSKFPNIPNLGYHALISSL 343

Query: 990  IRLDDIGGAEKIYEEWLSARTTFDPRVCNLLLKWYVRNGCLEKAEALVDNVFEVGGKANS 811
            +R+ D+ GAEKIYEEWLS ++++DPR+ NLLL  YV+ G LEKAE  +D++ E GGK NS
Sbjct: 344  VRVGDLEGAEKIYEEWLSVKSSYDPRIGNLLLGCYVKEGFLEKAEGFLDHMIEAGGKPNS 403

Query: 810  YTWEFLAEGHIREKQITGALSCIKKATLAEGAKNWKPTPRNVSLFFSLCEQESDTASKEA 631
             TWE LAEG+   K+I+ ALSC K+A LAEG+  WKP P NVS F  LCE+E+DTA+KEA
Sbjct: 404  TTWEILAEGNTGVKKISDALSCFKRAVLAEGSNGWKPKPVNVSAFLDLCEEEADTATKEA 463

Query: 630  LVDVLRQIGCFEKEAYMSQVSAYGVDDLG---LNSVDKDGIDVG---GNGDGTHILLNQL 469
            L+ +LRQ+GC E E Y S    +     G    N  D+ G D        DG  +LLNQ 
Sbjct: 464  LMGLLRQMGCLEDEPYASLFGLHTGSVTGNELSNEKDRTGADKDIDEDEDDGAEMLLNQF 523

Query: 468  ERSL 457
            +  L
Sbjct: 524  QSGL 527


>ref|XP_004229730.1| PREDICTED: pentatricopeptide repeat-containing protein At1g02150-like
            [Solanum lycopersicum]
          Length = 545

 Score =  506 bits (1302), Expect = e-140
 Identities = 245/419 (58%), Positives = 320/419 (76%), Gaps = 1/419 (0%)
 Frame = -1

Query: 1710 KLALGVYEWMDTKGDRFKLSASDAAIQLDLISKVRGVPSAEEYFSRLSRSLTDKRTYTAL 1531
            KLA  VYEWM+ + +RF+L+ SD AIQLDLI+KV G+ SAEEYF +L  +L DKR Y +L
Sbjct: 130  KLAFEVYEWMNNRPERFRLTTSDTAIQLDLIAKVHGISSAEEYFDKLPDTLKDKRIYGSL 189

Query: 1530 LNAYVSAKMKEKAEFLMEEMRTKGYAMHPLPLNLMMTLYMKLKEYEKVLSLVSEMVEKNI 1351
            LNA+V ++ KE+AE L+++MR +GY  H LP N+MMTLYM LK+Y+KV S+VSEM EK I
Sbjct: 190  LNAFVRSRKKEQAESLLDKMRNRGYTDHALPFNVMMTLYMNLKDYDKVESVVSEMKEKRI 249

Query: 1350 ALDLYSYNIWLTTCGAMGSIEKMEAALEQMKQDKTVNPNWTTYSTMATMYMNLGQHQKAK 1171
             LD+YSYNIWL++CG+ GSIEKME  LEQM  D  +NPNWTT+STMATMY+ LGQ +KA+
Sbjct: 250  PLDIYSYNIWLSSCGSQGSIEKMEKVLEQMNLDTDINPNWTTFSTMATMYIKLGQMKKAE 309

Query: 1170 DSLQMVESKMTGRDRVPYHYLLSLYGSVGEKEEVYRIWNTYKSGFSSIPNLGYHAIIASL 991
            DSL+ VES++TGRDR+PYHYL+SLYGS+G+KE+V RIW TY+S F +IPNLGYH++I+SL
Sbjct: 310  DSLKSVESRITGRDRIPYHYLISLYGSLGKKEDVLRIWKTYQSQFPNIPNLGYHSVISSL 369

Query: 990  IRLDDIGGAEKIYEEWLSARTTFDPRVCNLLLKWYVRNGCLEKAEALVDNVFEVGGKANS 811
            +RLDDI GAEKIY+EWL  +  +DPR+ NLLL +YVR G ++KA A  D +   GGK NS
Sbjct: 370  VRLDDIEGAEKIYDEWLPVKVHYDPRIGNLLLGYYVRKGFVDKASAFFDQMIGAGGKPNS 429

Query: 810  YTWEFLAEGHIREKQITGALSCIKKATLAEGAKNWKPTPRNVSLFFSLCEQESDTASKEA 631
             T E LAEGHIR+++I+ ALSC+K A  +EG+K+W+P P  VS    LCEQE D  +KE 
Sbjct: 430  MTCEILAEGHIRDRRISEALSCLKDAVSSEGSKSWRPKPATVSSILRLCEQEDDIQNKEV 489

Query: 630  LVDVLRQIGCFEKEAYMSQVS-AYGVDDLGLNSVDKDGIDVGGNGDGTHILLNQLERSL 457
            L++VL+Q+GC + E YMS +  + G        ++KD  D   N +G+ ILLNQL+ SL
Sbjct: 490  LLEVLKQVGCLDDEKYMSYIPLSNGSFTSSEREIEKDTSD---NDEGSDILLNQLQESL 545


>emb|CAN75355.1| hypothetical protein VITISV_002476 [Vitis vinifera]
          Length = 736

 Score =  506 bits (1302), Expect = e-140
 Identities = 251/419 (59%), Positives = 316/419 (75%), Gaps = 6/419 (1%)
 Frame = -1

Query: 1695 VYEWMDTKGDRFKLSASDAAIQLDLISKVRGVPSAEEYFSRLSRSLTDKRTYTALLNAYV 1516
            VYEWM+ +G+RF+LS+SDAAIQLDLI+KV GV SAE+YFSRL  +L DKR Y ALLNAYV
Sbjct: 321  VYEWMNNRGERFRLSSSDAAIQLDLIAKVCGVSSAEDYFSRLPDTLKDKRIYGALLNAYV 380

Query: 1515 SAKMKEKAEFLMEEMRTKGYAMHPLPLNLMMTLYMKLKEYEKVLSLVSEMVEKNIALDLY 1336
             AKM++KAE L+E++R KGYA  PLP N+MMTLYM LKE +KV S++SEM+ KNI LD+Y
Sbjct: 381  QAKMRDKAEILIEKLRNKGYATTPLPFNVMMTLYMNLKELDKVQSMISEMMNKNIQLDIY 440

Query: 1335 SYNIWLTTCGAMGSIEKMEAALEQMKQDKTVNPNWTTYSTMATMYMNLGQHQKAKDSLQM 1156
            SYNIWL++C    S E+ME   EQMK ++T+NPNWTT+STMATMY+ LGQ +KA++ L+ 
Sbjct: 441  SYNIWLSSCE---STERMEQVFEQMKLERTINPNWTTFSTMATMYIKLGQFEKAEECLKK 497

Query: 1155 VESKMTGRDRVPYHYLLSLYGSVGEKEEVYRIWNTYKSGFSSIPNLGYHAIIASLIRLDD 976
            VES++T RDR+PYHYL+SLYGS G K EVYR WN YKS F +IPNLGYHA+I+SL+R+ D
Sbjct: 498  VESRITNRDRMPYHYLISLYGSTGNKAEVYRAWNIYKSKFPNIPNLGYHALISSLVRVGD 557

Query: 975  IGGAEKIYEEWLSARTTFDPRVCNLLLKWYVRNGCLEKAEALVDNVFEVGGKANSYTWEF 796
            + GAEKIYEEWLS ++++DPR+ NLLL  YV+ G LEKAE  +D++ E GGK NS TWE 
Sbjct: 558  LEGAEKIYEEWLSVKSSYDPRIGNLLLGCYVKEGFLEKAEGFLDHMIEAGGKPNSTTWEI 617

Query: 795  LAEGHIREKQITGALSCIKKATLAEGAKNWKPTPRNVSLFFSLCEQESDTASKEALVDVL 616
            LAEG+   K+I+ ALSC K+A LAEG+  WKP P NVS F  LCE+E+DTA+KEAL+ +L
Sbjct: 618  LAEGNTGVKKISDALSCFKRAVLAEGSNGWKPKPVNVSAFLDLCEEEADTATKEALMGLL 677

Query: 615  RQIGCFEKEAYMSQVSAYGVDDLG---LNSVDKDGIDVG---GNGDGTHILLNQLERSL 457
            RQ+GC E E Y S    +     G    N  D+ G D        DG  +LLNQ +  L
Sbjct: 678  RQMGCLEDEPYASLFGLHTGSVTGNELSNEKDRTGADKDIDEDEDDGAEMLLNQFQSGL 736


>ref|XP_007218971.1| hypothetical protein PRUPE_ppa003822mg [Prunus persica]
            gi|462415433|gb|EMJ20170.1| hypothetical protein
            PRUPE_ppa003822mg [Prunus persica]
          Length = 546

 Score =  505 bits (1300), Expect = e-140
 Identities = 243/421 (57%), Positives = 326/421 (77%), Gaps = 8/421 (1%)
 Frame = -1

Query: 1704 ALGVYEWMDTKGDRFKLSASDAAIQLDLISKVRGVPSAEEYFSRLSRSLTDKRTYTALLN 1525
            AL VY+WM  +G+RF++S SDAAIQLDL++KVRGV SAE YF  L  +L D+R Y ALLN
Sbjct: 122  ALEVYDWMSNRGERFRISTSDAAIQLDLVAKVRGVASAENYFLSLPDTLKDRRIYGALLN 181

Query: 1524 AYVSAKMKEKAEFLMEEMRTKGYAMHPLPLNLMMTLYMKLKEYEKVLSLVSEMVEKNIAL 1345
            AYV  +MKEKAE L+++MR+KG+A+  LP N+MMTLYM LKEY+KV S++SEM+EKNI L
Sbjct: 182  AYVRTRMKEKAESLLDKMRSKGHALQSLPFNVMMTLYMNLKEYDKVDSIISEMMEKNIQL 241

Query: 1344 DLYSYNIWLTTCGAMGSIEKMEAALEQMKQDKTVNPNWTTYSTMATMYMNLGQHQKAKDS 1165
            D+YSYNIWL++ G+ GS E+ME   EQMK D+TVNPNWTT+STMATMY+ +GQ +KA+  
Sbjct: 242  DIYSYNIWLSSRGSQGSEERMEQVFEQMKLDRTVNPNWTTFSTMATMYIKMGQLEKAEAC 301

Query: 1164 LQMVESKMTGRDRVPYHYLLSLYGSVGEKEEVYRIWNTYKSGFSSIPNLGYHAIIASLIR 985
            L+ VES++TGRDR+PYHYLLSLYG+VG KEE+YR+WN YKS F SIPNLGYHAI++SL+R
Sbjct: 302  LKKVESRITGRDRIPYHYLLSLYGNVGNKEELYRVWNIYKSSFPSIPNLGYHAIMSSLLR 361

Query: 984  LDDIGGAEKIYEEWLSARTTFDPRVCNLLLKWYVRNGCLEKAEALVDNVFEVGGKANSYT 805
            + D+ GAEKIYEEWL+ ++T+DPR+ N+ + +Y+++G  EKA++  D++ +VGGK NS T
Sbjct: 362  VGDVEGAEKIYEEWLTVKSTYDPRIANVFIAYYIKDGDFEKAQSFYDHMVDVGGKPNSTT 421

Query: 804  WEFLAEGHIREKQITGALSCIKKATLAEGAKNWKPTPRNVSLFFSLCEQESDTASKEALV 625
            WE LAEGHI E++I+ ALSC K+A  AEG+K+W+P P NVS F  LCEQE+++ SKE  +
Sbjct: 422  WETLAEGHIEEQRISEALSCWKEAFSAEGSKSWRPKPVNVSAFLELCEQEANSVSKEFFM 481

Query: 624  DVLRQIGCFEKEAYMSQVSA----YGVDDLGL----NSVDKDGIDVGGNGDGTHILLNQL 469
             +L+Q G  + ++Y S +         DDL L     ++ KD  D    GDG+ +LLN+L
Sbjct: 482  GLLKQSGQLKNKSYASLIGLADEDVSDDDLSLKKDRTNITKDDDDEKEAGDGSELLLNEL 541

Query: 468  E 466
            +
Sbjct: 542  Q 542


>ref|XP_004307244.1| PREDICTED: pentatricopeptide repeat-containing protein At1g02150-like
            [Fragaria vesca subsp. vesca]
          Length = 541

 Score =  504 bits (1299), Expect = e-140
 Identities = 246/422 (58%), Positives = 320/422 (75%), Gaps = 6/422 (1%)
 Frame = -1

Query: 1704 ALGVYEWMDTKGDRFKLSASDAAIQLDLISKVRGVPSAEEYFSRLSRSLTDKRTYTALLN 1525
            AL VY+WM  + +RF+ S+SDAAIQLDL+ KVRGV SAE YF  L  +L DKR Y ALLN
Sbjct: 120  ALEVYDWMINRAERFRFSSSDAAIQLDLVGKVRGVSSAENYFLSLPDNLKDKRIYGALLN 179

Query: 1524 AYVSAKMKEKAEFLMEEMRTKGYAMHPLPLNLMMTLYMKLKEYEKVLSLVSEMVEKNIAL 1345
            AYV AKM+EKAE L+++MR+KG+A+HPLP N+MMTLYM LKEYEKV S++SEM+EKNI L
Sbjct: 180  AYVRAKMQEKAESLLDKMRSKGHALHPLPFNVMMTLYMNLKEYEKVESIISEMMEKNIQL 239

Query: 1344 DLYSYNIWLTTCGAMGSIEKMEAALEQMKQDKTVNPNWTTYSTMATMYMNLGQHQKAKDS 1165
            D+YSYNIWL++ G+ GS E+ME   EQMK D+T+NPNWTT+STMATMY+ +G  +KA+  
Sbjct: 240  DIYSYNIWLSSRGSQGSAERMEQVFEQMKLDRTINPNWTTFSTMATMYIKMGLFEKAEAC 299

Query: 1164 LQMVESKMTGRDRVPYHYLLSLYGSVGEKEEVYRIWNTYKSGFSSIPNLGYHAIIASLIR 985
            L+ VES++TGRDR+PYHYLLSLYG VG K+E+YR+WN YKS F SIPNLGYHAIIA+LIR
Sbjct: 300  LKKVESRITGRDRIPYHYLLSLYGGVGNKDEIYRVWNVYKSSFPSIPNLGYHAIIAALIR 359

Query: 984  LDDIGGAEKIYEEWLSARTTFDPRVCNLLLKWYVRNGCLEKAEALVDNVFEVGGKANSYT 805
            + D+ GAEKI+EEWL+ + ++DPR+ NL +  Y+  G  +KA++  DN+ E GGK NS T
Sbjct: 360  VGDVEGAEKIFEEWLTVKPSYDPRIVNLFIVSYIEEGDFDKAQSFFDNMVEAGGKPNSST 419

Query: 804  WEFLAEGHIREKQITGALSCIKKATLAEGAKNWKPTPRNVSLFFSLCEQESDTASKEALV 625
            WE LAEGHI EK+I+ ALSC K+A +AEG+K+W+P P NV+ F+  CEQE D  SKE  +
Sbjct: 420  WEALAEGHIEEKRISEALSCWKEAFMAEGSKSWRPKPVNVTTFYEFCEQEGDLRSKEIFL 479

Query: 624  DVLRQIGCFEKEAYMSQVSAYGVDDLGLN-SVDKDGIDVGGNG-----DGTHILLNQLER 463
             +LRQ G  + ++Y   V     D    + S++KD I+   +G     DG+ +LLNQL  
Sbjct: 480  GLLRQSGQLKNKSYALLVGLSDEDSSDNDISLEKDSINDNQDGDEKSDDGSDMLLNQLHS 539

Query: 462  SL 457
            +L
Sbjct: 540  TL 541


>ref|XP_006445447.1| hypothetical protein CICLE_v10019658mg [Citrus clementina]
            gi|568819745|ref|XP_006464406.1| PREDICTED:
            pentatricopeptide repeat-containing protein
            At1g02150-like [Citrus sinensis]
            gi|557547709|gb|ESR58687.1| hypothetical protein
            CICLE_v10019658mg [Citrus clementina]
          Length = 535

 Score =  499 bits (1285), Expect = e-138
 Identities = 245/415 (59%), Positives = 321/415 (77%)
 Frame = -1

Query: 1710 KLALGVYEWMDTKGDRFKLSASDAAIQLDLISKVRGVPSAEEYFSRLSRSLTDKRTYTAL 1531
            K AL VY+WM+ +G+RF+LSASDAAIQLDLI+KV GV SAE++F  L  +L D+R Y AL
Sbjct: 123  KHALEVYDWMNNRGERFRLSASDAAIQLDLIAKVHGVASAEDFFLSLPDTLKDRRVYGAL 182

Query: 1530 LNAYVSAKMKEKAEFLMEEMRTKGYAMHPLPLNLMMTLYMKLKEYEKVLSLVSEMVEKNI 1351
            LNAYV A+M+  AE L+++MR KGYA+H LP N+MMTLYMK+KEY++V S+VSEM EK I
Sbjct: 183  LNAYVRARMRGNAELLIDKMRDKGYAVHSLPYNVMMTLYMKIKEYDEVESMVSEMKEKGI 242

Query: 1350 ALDLYSYNIWLTTCGAMGSIEKMEAALEQMKQDKTVNPNWTTYSTMATMYMNLGQHQKAK 1171
             LD+YSYNIWL++CG+ GS EKME   E MK DK VNPNWTT+STMATMY+ +GQ +KA+
Sbjct: 243  RLDVYSYNIWLSSCGSQGSTEKMEGVFELMKVDKAVNPNWTTFSTMATMYIKMGQVEKAE 302

Query: 1170 DSLQMVESKMTGRDRVPYHYLLSLYGSVGEKEEVYRIWNTYKSGFSSIPNLGYHAIIASL 991
            +SL+ VES++TGRDRVPYHYLLSLYGSVG+KEEVYR+WN Y+S F  + NLGYHA+I+SL
Sbjct: 303  ESLRRVESRITGRDRVPYHYLLSLYGSVGKKEEVYRVWNLYRSVFPGVTNLGYHAMISSL 362

Query: 990  IRLDDIGGAEKIYEEWLSARTTFDPRVCNLLLKWYVRNGCLEKAEALVDNVFEVGGKANS 811
             R+ DI G EKI+EEWLS ++++DPR+ NL++ WYV+ G  +KAEA  +++ E GGK NS
Sbjct: 363  ARIGDIEGMEKIFEEWLSVKSSYDPRIANLMMSWYVKEGNFDKAEAFFNSIIEEGGKPNS 422

Query: 810  YTWEFLAEGHIREKQITGALSCIKKATLAEGAKNWKPTPRNVSLFFSLCEQESDTASKEA 631
             +WE LAEGHIRE++I  ALSC+K A  AEGAK+W+P P NV  FF  CE+ESD  SKEA
Sbjct: 423  TSWETLAEGHIRERRILEALSCLKGAFAAEGAKSWRPKPVNVINFFKACEEESDMGSKEA 482

Query: 630  LVDVLRQIGCFEKEAYMSQVSAYGVDDLGLNSVDKDGIDVGGNGDGTHILLNQLE 466
             V +LRQ G  +++ YMS +   G+ D  +   +K   +   + + + +LL+QL+
Sbjct: 483  FVALLRQPGYRKEKDYMSLI---GLTDEAVAENNKKNDE--DSDEDSEMLLSQLQ 532


>ref|XP_006375170.1| pentatricopeptide repeat-containing family protein [Populus
            trichocarpa] gi|550323489|gb|ERP52967.1|
            pentatricopeptide repeat-containing family protein
            [Populus trichocarpa]
          Length = 539

 Score =  496 bits (1277), Expect = e-137
 Identities = 241/420 (57%), Positives = 314/420 (74%), Gaps = 4/420 (0%)
 Frame = -1

Query: 1704 ALGVYEWMDTKGDRFKLSASDAAIQLDLISKVRGVPSAEEYFSRLSRSLTDKRTYTALLN 1525
            AL VY+WM  + +RF+LS SDAAIQLDLI+KVRGV +AE++F  L  +  D+R Y ALLN
Sbjct: 120  ALEVYDWMKNRQERFRLSPSDAAIQLDLIAKVRGVSTAEDFFLSLPNTFKDRRVYGALLN 179

Query: 1524 AYVSAKMKEKAEFLMEEMRTKGYAMHPLPLNLMMTLYMKLKEYEKVLSLVSEMVEKNIAL 1345
            AYV  +M+EKAE L +EMR KGY  H LP N+ MTLYM +KEY+KV  ++SEM EKNI L
Sbjct: 180  AYVQNRMREKAETLFDEMRDKGYVTHALPFNVTMTLYMNIKEYDKVDLMISEMNEKNIKL 239

Query: 1344 DLYSYNIWLTTCGAMGSIEKMEAALEQMKQDKTVNPNWTTYSTMATMYMNLGQHQKAKDS 1165
            D+YSYNIWL++CG+ GS +KME   EQMK D+++NPNWTT+STMATMY+ +GQ +KA+D 
Sbjct: 240  DIYSYNIWLSSCGSQGSADKMEQVYEQMKSDRSINPNWTTFSTMATMYIKMGQFEKAEDC 299

Query: 1164 LQMVESKMTGRDRVPYHYLLSLYGSVGEKEEVYRIWNTYKSGFSSIPNLGYHAIIASLIR 985
            L+ VES++TGRDR+PYHYLLSLYG+VG KEEVYR+WN YKS F SIPNLGYHAII+SL+R
Sbjct: 300  LRRVESRITGRDRIPYHYLLSLYGNVGNKEEVYRVWNIYKSIFPSIPNLGYHAIISSLVR 359

Query: 984  LDDIGGAEKIYEEWLSARTTFDPRVCNLLLKWYVRNGCLEKAEALVDNVFEVGGKANSYT 805
            LDDI GAEKI+EEWLS +T++DPR+ NL +  YV  G L++A++  D++ E GGK NS T
Sbjct: 360  LDDIEGAEKIFEEWLSIKTSYDPRIANLFIAAYVYQGNLDEAKSFFDHMLEDGGKPNSNT 419

Query: 804  WEFLAEGHIREKQITGALSCIKKATLAEGAKNWKPTPRNVSLFFSLCEQESDTASKEALV 625
            WE LA+GHI E++ + ALSC+K+A +  G+K+WKP P NV+ FF LCE+E+D A+KEAL 
Sbjct: 420  WEILAQGHISERRTSEALSCLKEAFVTPGSKSWKPNPANVTSFFKLCEEEADMANKEALE 479

Query: 624  DVLRQIGCFEKEAYMSQVSAYGVDDLGLNSVDKDGIDVGG----NGDGTHILLNQLERSL 457
              LRQ G  + +AY S +      D      D+ G  +        DG  +L++ L+ SL
Sbjct: 480  GFLRQSGHLKDKAYASLLGMPVTGDELSTKEDRTGDQIDNEEDDEDDGAEMLVSHLQGSL 539


>emb|CBI16683.3| unnamed protein product [Vitis vinifera]
          Length = 423

 Score =  493 bits (1270), Expect = e-137
 Identities = 239/377 (63%), Positives = 302/377 (80%)
 Frame = -1

Query: 1710 KLALGVYEWMDTKGDRFKLSASDAAIQLDLISKVRGVPSAEEYFSRLSRSLTDKRTYTAL 1531
            K+AL VYEWM+ +G+RF+LS+SDAAIQLDLI+KV GV SAE+YFSRL  +L DKR Y AL
Sbjct: 41   KMALEVYEWMNNRGERFRLSSSDAAIQLDLIAKVCGVSSAEDYFSRLPDTLKDKRIYGAL 100

Query: 1530 LNAYVSAKMKEKAEFLMEEMRTKGYAMHPLPLNLMMTLYMKLKEYEKVLSLVSEMVEKNI 1351
            LNAYV AKM++KAE L+E++R KGYA  PLP N+MMTLYM LKE +KV S++SEM+ KNI
Sbjct: 101  LNAYVQAKMRDKAEILIEKLRNKGYATTPLPFNVMMTLYMNLKELDKVQSMISEMMNKNI 160

Query: 1350 ALDLYSYNIWLTTCGAMGSIEKMEAALEQMKQDKTVNPNWTTYSTMATMYMNLGQHQKAK 1171
             LD+YSYNIWL++C    S E+ME   EQMK ++T+NPNWTT+STMATMY+ LGQ +KA+
Sbjct: 161  QLDIYSYNIWLSSCE---STERMEQVFEQMKLERTINPNWTTFSTMATMYIKLGQFEKAE 217

Query: 1170 DSLQMVESKMTGRDRVPYHYLLSLYGSVGEKEEVYRIWNTYKSGFSSIPNLGYHAIIASL 991
            + L+ VES++T RDR+PYHYL+SLYGS G K EVYR WN YKS F +IPNLGYHA+I+SL
Sbjct: 218  ECLKKVESRITNRDRMPYHYLISLYGSTGNKAEVYRAWNIYKSKFPNIPNLGYHALISSL 277

Query: 990  IRLDDIGGAEKIYEEWLSARTTFDPRVCNLLLKWYVRNGCLEKAEALVDNVFEVGGKANS 811
            +R+ D+ GAEKIYEEWLS ++++DPR+ NLLL  YV+ G LEKAE  +D++ E GGK NS
Sbjct: 278  VRVGDLEGAEKIYEEWLSVKSSYDPRIGNLLLGCYVKEGFLEKAEGFLDHMIEAGGKPNS 337

Query: 810  YTWEFLAEGHIREKQITGALSCIKKATLAEGAKNWKPTPRNVSLFFSLCEQESDTASKEA 631
             TWE LAEG+   K+I+ ALSC K+A LAEG+  WKP P NVS F  LCE+E+DTA+KEA
Sbjct: 338  TTWEILAEGNTGVKKISDALSCFKRAVLAEGSNGWKPKPVNVSAFLDLCEEEADTATKEA 397

Query: 630  LVDVLRQIGCFEKEAYM 580
            L+ +LRQ+G  +  A M
Sbjct: 398  LMGLLRQMGYEDDGAEM 414


>ref|XP_006418504.1| hypothetical protein EUTSA_v10007383mg [Eutrema salsugineum]
            gi|557096275|gb|ESQ36857.1| hypothetical protein
            EUTSA_v10007383mg [Eutrema salsugineum]
          Length = 517

 Score =  493 bits (1269), Expect = e-136
 Identities = 232/416 (55%), Positives = 319/416 (76%)
 Frame = -1

Query: 1704 ALGVYEWMDTKGDRFKLSASDAAIQLDLISKVRGVPSAEEYFSRLSRSLTDKRTYTALLN 1525
            AL VY+WM+ +G+RF+LSASDAAIQLDLI KVRG+  AEE+F  L  +  D+R Y +LLN
Sbjct: 116  ALEVYDWMNNRGERFRLSASDAAIQLDLIGKVRGISDAEEFFLSLPENFKDRRVYGSLLN 175

Query: 1524 AYVSAKMKEKAEFLMEEMRTKGYAMHPLPLNLMMTLYMKLKEYEKVLSLVSEMVEKNIAL 1345
            AYV AK +EKAE L+++MR KGYA+HPLP N+MMTLYM L+EY+KV ++V EM +K+I L
Sbjct: 176  AYVRAKSREKAEALIDKMREKGYALHPLPFNVMMTLYMNLREYDKVDAMVYEMKQKDIRL 235

Query: 1344 DLYSYNIWLTTCGAMGSIEKMEAALEQMKQDKTVNPNWTTYSTMATMYMNLGQHQKAKDS 1165
            D+YSYNIWL++CG+ GS+EKME   +QMK D ++NPNWTT+STMATMY+ +G+++KA+D+
Sbjct: 236  DIYSYNIWLSSCGSHGSVEKMEQVYQQMKSDVSINPNWTTFSTMATMYIKMGENEKAEDA 295

Query: 1164 LQMVESKMTGRDRVPYHYLLSLYGSVGEKEEVYRIWNTYKSGFSSIPNLGYHAIIASLIR 985
            L+ VE+++TGR+R+PYHYLLSLYGSVG K+E+YR+WN YKS   SIPNLGYHA+++SL+R
Sbjct: 296  LRKVEARITGRNRIPYHYLLSLYGSVGNKKELYRVWNVYKSVVPSIPNLGYHALVSSLVR 355

Query: 984  LDDIGGAEKIYEEWLSARTTFDPRVCNLLLKWYVRNGCLEKAEALVDNVFEVGGKANSYT 805
            + DI GAEK+YEEWL  ++++DPR+ NLL+  YV+N  L+KAE L D++ E+GGK +S T
Sbjct: 356  MGDIQGAEKVYEEWLPVKSSYDPRIPNLLMNVYVKNDQLDKAEGLFDHMIEMGGKPSSST 415

Query: 804  WEFLAEGHIREKQITGALSCIKKATLAEGAKNWKPTPRNVSLFFSLCEQESDTASKEALV 625
            WE LA GH R++ IT AL+C+K+A  AEG+ NW+P    +S FF LCE+ESD ASKEA++
Sbjct: 416  WEILAHGHTRKRNITEALTCLKEAFSAEGSSNWRPKVFMLSGFFKLCEEESDVASKEAVL 475

Query: 624  DVLRQIGCFEKEAYMSQVSAYGVDDLGLNSVDKDGIDVGGNGDGTHILLNQLERSL 457
            ++LRQ G  + ++Y + +               D  +     +GT +LL QL+  L
Sbjct: 476  ELLRQSGHLQDKSYQALID--------------DAQESESESEGTDVLLTQLQDDL 517


>ref|XP_006306047.1| hypothetical protein CARUB_v10011354mg [Capsella rubella]
            gi|482574758|gb|EOA38945.1| hypothetical protein
            CARUB_v10011354mg [Capsella rubella]
          Length = 524

 Score =  492 bits (1266), Expect = e-136
 Identities = 232/413 (56%), Positives = 318/413 (76%)
 Frame = -1

Query: 1704 ALGVYEWMDTKGDRFKLSASDAAIQLDLISKVRGVPSAEEYFSRLSRSLTDKRTYTALLN 1525
            AL VY+WM+ +G+RF+LSASDAAIQLDLI KVRG+  AEE+F  L  +  D+R Y +LLN
Sbjct: 116  ALEVYDWMNNRGERFRLSASDAAIQLDLIGKVRGISDAEEFFLTLPETFKDRRVYGSLLN 175

Query: 1524 AYVSAKMKEKAEFLMEEMRTKGYAMHPLPLNLMMTLYMKLKEYEKVLSLVSEMVEKNIAL 1345
            AYV AK +EKAE L+  MR KGYA+HPLP N+MMTLYM L+EY+KV ++V EM +K+I L
Sbjct: 176  AYVRAKSREKAEALLNTMREKGYALHPLPFNVMMTLYMNLREYDKVDAMVFEMKQKDIRL 235

Query: 1344 DLYSYNIWLTTCGAMGSIEKMEAALEQMKQDKTVNPNWTTYSTMATMYMNLGQHQKAKDS 1165
            D+YSYNIWL++CG++GS+EKME   +QMK D  +NPNWTT+STMATMY+ +G+ +KA+D+
Sbjct: 236  DIYSYNIWLSSCGSLGSVEKMELVYQQMKSDVAINPNWTTFSTMATMYIKMGEIEKAEDA 295

Query: 1164 LQMVESKMTGRDRVPYHYLLSLYGSVGEKEEVYRIWNTYKSGFSSIPNLGYHAIIASLIR 985
            L+ VE+++TGR+R+PYHYLLSLYGSVG K+E+YR+WN YKS   SIPNLGYHA+++SL+R
Sbjct: 296  LRKVEARITGRNRIPYHYLLSLYGSVGNKKELYRVWNVYKSVAPSIPNLGYHALVSSLVR 355

Query: 984  LDDIGGAEKIYEEWLSARTTFDPRVCNLLLKWYVRNGCLEKAEALVDNVFEVGGKANSYT 805
            + DI GAEK+YEEWL  ++++DPR+ NLL+  YV+N  LEKAE L D++ E+GGK +S T
Sbjct: 356  MGDIEGAEKVYEEWLPVKSSYDPRIPNLLMNVYVKNDQLEKAEGLFDHMVEMGGKPSSST 415

Query: 804  WEFLAEGHIREKQITGALSCIKKATLAEGAKNWKPTPRNVSLFFSLCEQESDTASKEALV 625
            WE LA+GH R++ I  AL+C++KA  AEG+ NW+P    +S FF LCE+ESD  SKEA++
Sbjct: 416  WEILADGHTRKRCIPEALTCLRKAFSAEGSSNWRPKVLMLSGFFKLCEEESDITSKEAVL 475

Query: 624  DVLRQIGCFEKEAYMSQVSAYGVDDLGLNSVDKDGIDVGGNGDGTHILLNQLE 466
            ++LRQ G  + ++Y + +      D+  N    +  +     DGT +LL+QL+
Sbjct: 476  ELLRQAGHLQDKSYQALI------DVDENRTVNNSENDAHESDGTDVLLSQLQ 522


>ref|XP_002301239.2| pentatricopeptide repeat-containing family protein [Populus
            trichocarpa] gi|550344984|gb|EEE80512.2|
            pentatricopeptide repeat-containing family protein
            [Populus trichocarpa]
          Length = 539

 Score =  491 bits (1265), Expect = e-136
 Identities = 245/420 (58%), Positives = 317/420 (75%), Gaps = 4/420 (0%)
 Frame = -1

Query: 1704 ALGVYEWMDTKGDRFKLSASDAAIQLDLISKVRGVPSAEEYFSRLSRSLTDKRTYTALLN 1525
            AL VY+WM+ + +RF LS SDAAIQLDLI+KVRGV SAE++F RL  +  D+R Y ALLN
Sbjct: 120  ALEVYDWMNNRQERFGLSPSDAAIQLDLIAKVRGVSSAEDFFLRLPNTFKDRRIYGALLN 179

Query: 1524 AYVSAKMKEKAEFLMEEMRTKGYAMHPLPLNLMMTLYMKLKEYEKVLSLVSEMVEKNIAL 1345
            AYV  +M+EKAE L++EMR K Y  H LP N+MMTLYM + EY+KV  ++SEM EKNI L
Sbjct: 180  AYVRNRMREKAESLIDEMRGKDYVTHALPYNVMMTLYMNINEYDKVDLIISEMNEKNIKL 239

Query: 1344 DLYSYNIWLTTCGAMGSIEKMEAALEQMKQDKTVNPNWTTYSTMATMYMNLGQHQKAKDS 1165
            D+YSYNIWL++CG  GS +KME   EQMK D ++NPNWTT+STMATMY+ +G+ +KA+D 
Sbjct: 240  DIYSYNIWLSSCGLQGSADKMEQVFEQMKSDGSINPNWTTFSTMATMYIKMGKFEKAEDC 299

Query: 1164 LQMVESKMTGRDRVPYHYLLSLYGSVGEKEEVYRIWNTYKSGFSSIPNLGYHAIIASLIR 985
            L+ VES++TGRDR+PYHYLLSLYG+VG KEEVYR+WN YKS F SIPNLGYHA+I+SL+R
Sbjct: 300  LRRVESRITGRDRIPYHYLLSLYGNVGNKEEVYRVWNIYKSIFPSIPNLGYHAMISSLVR 359

Query: 984  LDDIGGAEKIYEEWLSARTTFDPRVCNLLLKWYVRNGCLEKAEALVDNVFEVGGKANSYT 805
            +DDI GAEKIYEEWLS +T++DPR+ NL +  +V  G L+KAE+  D++ E GGK NS++
Sbjct: 360  MDDIEGAEKIYEEWLSIKTSYDPRIANLFMAAFVYQGNLDKAESFFDHMLEEGGKPNSHS 419

Query: 804  WEFLAEGHIREKQITGALSCIKKATLAEGAKNWKPTPRNVSLFFSLCEQESDTASKEALV 625
            WE LA+GHI E++ + ALSC+K+A    G+K+WKP P NVS FF LCE+E D ASKEAL 
Sbjct: 420  WEILAQGHISERRTSEALSCLKEAFATPGSKSWKPNPANVSSFFKLCEEEVDMASKEALA 479

Query: 624  DVLRQIGCFEKEAY--MSQVSAYGVDDLGLNSVDKDGIDVGGN-GD-GTHILLNQLERSL 457
              LRQ G  + +AY  +  +   G +        +D ID   N GD G+ +L++QL+ SL
Sbjct: 480  SFLRQSGHLKDKAYALLLGMPVTGDELSTKEERTEDQIDNEENDGDNGSEMLVSQLQGSL 539


>gb|EYU40343.1| hypothetical protein MIMGU_mgv1a004109mg [Mimulus guttatus]
          Length = 543

 Score =  488 bits (1255), Expect = e-135
 Identities = 230/419 (54%), Positives = 312/419 (74%), Gaps = 1/419 (0%)
 Frame = -1

Query: 1710 KLALGVYEWMDTKGDRFKLSASDAAIQLDLISKVRGVPSAEEYFSRLSRSLTDKRTYTAL 1531
            K AL VY+WM+ + +R++++ SD AIQLDLI+KV G+ SAE YF +L  +L DKR Y +L
Sbjct: 128  KYALEVYDWMNNRAERYRITTSDTAIQLDLIAKVHGIASAEHYFLKLPDALKDKRIYGSL 187

Query: 1530 LNAYVSAKMKEKAEFLMEEMRTKGYAMHPLPLNLMMTLYMKLKEYEKVLSLVSEMVEKNI 1351
            LN Y  ++M+EK+E LM+ MR+KGYA H LP N+MMTLYM LK++EK+ SL+SE+ EKNI
Sbjct: 188  LNVYARSRMREKSESLMDIMRSKGYASHALPFNVMMTLYMNLKDHEKLESLISELKEKNI 247

Query: 1350 ALDLYSYNIWLTTCGAMGSIEKMEAALEQMKQDKTVNPNWTTYSTMATMYMNLGQHQKAK 1171
            ALD+Y+YNIWL++CGA G++EKME   E M  D  +NPNWTT+STMAT+Y+ LG  +KA+
Sbjct: 248  ALDIYTYNIWLSSCGAKGAVEKMEEVFELMSADPAINPNWTTFSTMATVYIKLGHLEKAE 307

Query: 1170 DSLQMVESKMTGRDRVPYHYLLSLYGSVGEKEEVYRIWNTYKSGFSSIPNLGYHAIIASL 991
            D L+ +ES++TGRDR+PYHYL+SLYGS   K+EVYR+WN YK+ F +IPNLGYH +I++L
Sbjct: 308  DCLKKIESRVTGRDRLPYHYLISLYGSAHNKDEVYRVWNLYKASFFNIPNLGYHTVISAL 367

Query: 990  IRLDDIGGAEKIYEEWLSARTTFDPRVCNLLLKWYVRNGCLEKAEALVDNVFEVGGKANS 811
             R D++ GAEKIY+EWLS ++ FDPR+ N+LL  YVR G  +KAE +   + E GGK NS
Sbjct: 368  ARTDEMEGAEKIYDEWLSVKSFFDPRITNILLSSYVRKGLSQKAETMFGQMIEAGGKPNS 427

Query: 810  YTWEFLAEGHIREKQITGALSCIKKATLAEGAKNWKPTPRNVSLFFSLCEQESDTASKEA 631
             TWE  AE HIR  +I+ ALSC+  ATLA+G+KNW+P P NVS    +CEQ++D ASK+A
Sbjct: 428  MTWEIFAEDHIRNTRISEALSCLNSATLADGSKNWRPNPSNVSSILKICEQQADVASKDA 487

Query: 630  LVDVLRQIGCFEKEAYMSQVSAYGVDDL-GLNSVDKDGIDVGGNGDGTHILLNQLERSL 457
            L+ +LR++GC    +YMS +     + + G  SV +D     G  DGT  LLN+L+ +L
Sbjct: 488  LLAILRRMGCLNDVSYMSYIPMLSGERIPGGVSVAEDS---DGGDDGTFGLLNELQETL 543


>ref|XP_004133941.1| PREDICTED: pentatricopeptide repeat-containing protein At1g02150-like
            [Cucumis sativus] gi|449525818|ref|XP_004169913.1|
            PREDICTED: pentatricopeptide repeat-containing protein
            At1g02150-like [Cucumis sativus]
          Length = 537

 Score =  487 bits (1253), Expect = e-135
 Identities = 232/405 (57%), Positives = 303/405 (74%)
 Frame = -1

Query: 1704 ALGVYEWMDTKGDRFKLSASDAAIQLDLISKVRGVPSAEEYFSRLSRSLTDKRTYTALLN 1525
            AL +Y+WM  + +RF+L+ SDAAIQLDLISKVRG+ SAEEYF RL   L D+R Y ALLN
Sbjct: 121  ALEIYDWMSNREERFRLTTSDAAIQLDLISKVRGIKSAEEYFLRLPNHLKDRRIYGALLN 180

Query: 1524 AYVSAKMKEKAEFLMEEMRTKGYAMHPLPLNLMMTLYMKLKEYEKVLSLVSEMVEKNIAL 1345
            AY   + +EKAE L+E+MRTKG+  HPLP N+MMTLYM +KEYEKV SLVSEM E +I L
Sbjct: 181  AYAKGRQREKAENLLEKMRTKGFTTHPLPFNVMMTLYMNVKEYEKVESLVSEMTENSIQL 240

Query: 1344 DLYSYNIWLTTCGAMGSIEKMEAALEQMKQDKTVNPNWTTYSTMATMYMNLGQHQKAKDS 1165
            D+YSYNIWL++CG  GS EKME   EQMKQD+T+N NWTT+STMATMY+ +G  +KA++ 
Sbjct: 241  DIYSYNIWLSSCGLQGSTEKMEEVYEQMKQDRTINANWTTFSTMATMYIKMGLMEKAEEC 300

Query: 1164 LQMVESKMTGRDRVPYHYLLSLYGSVGEKEEVYRIWNTYKSGFSSIPNLGYHAIIASLIR 985
            L+ VES++ GRDR+PYHYL+SLYGSVG KEE+YR+WN YK+ F +IPNLGYHAII++LIR
Sbjct: 301  LRRVESRIVGRDRIPYHYLISLYGSVGNKEEMYRVWNIYKNVFPTIPNLGYHAIISALIR 360

Query: 984  LDDIGGAEKIYEEWLSARTTFDPRVCNLLLKWYVRNGCLEKAEALVDNVFEVGGKANSYT 805
            + D+ GAEKIYEEWL+ ++T+DPR+ NL + WYV+ G   KAE+  D++ EVGGK NS T
Sbjct: 361  VGDVEGAEKIYEEWLTVKSTYDPRIANLFIGWYVKEGNTSKAESFFDHMVEVGGKPNSST 420

Query: 804  WEFLAEGHIREKQITGALSCIKKATLAEGAKNWKPTPRNVSLFFSLCEQESDTASKEALV 625
            WE L + H +E +++ AL+  K+A  AEG+K+W+P P NV  +F LCE+E D ASKE LV
Sbjct: 421  WEILVDRHTKEGRVSDALASWKEAFSAEGSKSWRPKPYNVLAYFDLCEKEGDIASKEVLV 480

Query: 624  DVLRQIGCFEKEAYMSQVSAYGVDDLGLNSVDKDGIDVGGNGDGT 490
             +LRQ    + + Y S +     + +  N V + G ++    D T
Sbjct: 481  GLLRQPKYLQDKTYASLIGLLD-ETIDNNEVSEKGSNINDEIDKT 524


>ref|XP_002892022.1| pentatricopeptide repeat-containing protein [Arabidopsis lyrata
            subsp. lyrata] gi|297337864|gb|EFH68281.1|
            pentatricopeptide repeat-containing protein [Arabidopsis
            lyrata subsp. lyrata]
          Length = 523

 Score =  484 bits (1247), Expect = e-134
 Identities = 224/378 (59%), Positives = 303/378 (80%)
 Frame = -1

Query: 1704 ALGVYEWMDTKGDRFKLSASDAAIQLDLISKVRGVPSAEEYFSRLSRSLTDKRTYTALLN 1525
            AL VY+WM+ +G+RF+LSASDAAIQLDLI KVRG+  AE++F  L  +  D+R Y +LLN
Sbjct: 117  ALEVYDWMNNRGERFRLSASDAAIQLDLIGKVRGISDAEQFFLTLPENFKDRRVYGSLLN 176

Query: 1524 AYVSAKMKEKAEFLMEEMRTKGYAMHPLPLNLMMTLYMKLKEYEKVLSLVSEMVEKNIAL 1345
            AYV AK +EKAE L+  MR KGYA+HPLP N+MMTLYM L+EY+KV ++V EM +K+I L
Sbjct: 177  AYVRAKSREKAEALLHTMRDKGYALHPLPFNVMMTLYMNLREYDKVDAMVFEMKQKDIRL 236

Query: 1344 DLYSYNIWLTTCGAMGSIEKMEAALEQMKQDKTVNPNWTTYSTMATMYMNLGQHQKAKDS 1165
            D+YSYNIWL++CG++GS+EKME   +QMK D ++NPNWTT+STMATMY+ +G+ +KA+D+
Sbjct: 237  DIYSYNIWLSSCGSLGSVEKMELVYQQMKSDVSINPNWTTFSTMATMYIKMGETEKAEDA 296

Query: 1164 LQMVESKMTGRDRVPYHYLLSLYGSVGEKEEVYRIWNTYKSGFSSIPNLGYHAIIASLIR 985
            L+ VE+++TGR+R+PYHYLLSLYGSVG K+E+YR+WN YKS   SIPNLGYHA+++SL R
Sbjct: 297  LRKVEARITGRNRIPYHYLLSLYGSVGNKKELYRVWNVYKSVVPSIPNLGYHALVSSLAR 356

Query: 984  LDDIGGAEKIYEEWLSARTTFDPRVCNLLLKWYVRNGCLEKAEALVDNVFEVGGKANSYT 805
            + DI GAEK+YEEWL  ++++DPR+ NLL+  YV+N  LEKAE L D++ E+GGK +S T
Sbjct: 357  MGDIEGAEKVYEEWLPVKSSYDPRIPNLLMNVYVKNDQLEKAEGLFDHMVEMGGKPSSST 416

Query: 804  WEFLAEGHIREKQITGALSCIKKATLAEGAKNWKPTPRNVSLFFSLCEQESDTASKEALV 625
            WE LA+GH R++ I  AL+C++KA  AEG+ NW+P    +S FF LCE+ESD  SKEA++
Sbjct: 417  WEILADGHTRKRCIPEALTCLRKAFSAEGSSNWRPKVLMLSGFFKLCEEESDVTSKEAVL 476

Query: 624  DVLRQIGCFEKEAYMSQV 571
            ++LRQ G  E +AY + +
Sbjct: 477  ELLRQSGHLEDKAYQALI 494


>ref|NP_171717.2| pentatricopeptide repeat-containing protein [Arabidopsis thaliana]
            gi|193806400|sp|Q8LPS6.2|PPR3_ARATH RecName:
            Full=Pentatricopeptide repeat-containing protein
            At1g02150 gi|2317908|gb|AAC24372.1| Unknown protein
            [Arabidopsis thaliana] gi|332189272|gb|AEE27393.1|
            pentatricopeptide repeat-containing protein [Arabidopsis
            thaliana]
          Length = 524

 Score =  480 bits (1236), Expect = e-133
 Identities = 231/416 (55%), Positives = 319/416 (76%)
 Frame = -1

Query: 1704 ALGVYEWMDTKGDRFKLSASDAAIQLDLISKVRGVPSAEEYFSRLSRSLTDKRTYTALLN 1525
            AL VY+WM+ +G+RF+LSASDAAIQLDLI KVRG+P AEE+F +L  +  D+R Y +LLN
Sbjct: 118  ALEVYDWMNNRGERFRLSASDAAIQLDLIGKVRGIPDAEEFFLQLPENFKDRRVYGSLLN 177

Query: 1524 AYVSAKMKEKAEFLMEEMRTKGYAMHPLPLNLMMTLYMKLKEYEKVLSLVSEMVEKNIAL 1345
            AYV AK +EKAE L+  MR KGYA+HPLP N+MMTLYM L+EY+KV ++V EM +K+I L
Sbjct: 178  AYVRAKSREKAEALLNTMRDKGYALHPLPFNVMMTLYMNLREYDKVDAMVFEMKQKDIRL 237

Query: 1344 DLYSYNIWLTTCGAMGSIEKMEAALEQMKQDKTVNPNWTTYSTMATMYMNLGQHQKAKDS 1165
            D+YSYNIWL++CG++GS+EKME   +QMK D ++ PNWTT+STMATMY+ +G+ +KA+D+
Sbjct: 238  DIYSYNIWLSSCGSLGSVEKMELVYQQMKSDVSIYPNWTTFSTMATMYIKMGETEKAEDA 297

Query: 1164 LQMVESKMTGRDRVPYHYLLSLYGSVGEKEEVYRIWNTYKSGFSSIPNLGYHAIIASLIR 985
            L+ VE+++TGR+R+PYHYLLSLYGS+G K+E+YR+W+ YKS   SIPNLGYHA+++SL+R
Sbjct: 298  LRKVEARITGRNRIPYHYLLSLYGSLGNKKELYRVWHVYKSVVPSIPNLGYHALVSSLVR 357

Query: 984  LDDIGGAEKIYEEWLSARTTFDPRVCNLLLKWYVRNGCLEKAEALVDNVFEVGGKANSYT 805
            + DI GAEK+YEEWL  ++++DPR+ NLL+  YV+N  LE AE L D++ E+GGK +S T
Sbjct: 358  MGDIEGAEKVYEEWLPVKSSYDPRIPNLLMNAYVKNDQLETAEGLFDHMVEMGGKPSSST 417

Query: 804  WEFLAEGHIREKQITGALSCIKKATLAEGAKNWKPTPRNVSLFFSLCEQESDTASKEALV 625
            WE LA GH R++ I+ AL+C++ A  AEG+ NW+P    +S FF LCE+ESD  SKEA++
Sbjct: 418  WEILAVGHTRKRCISEALTCLRNAFSAEGSSNWRPKVLMLSGFFKLCEEESDVTSKEAVL 477

Query: 624  DVLRQIGCFEKEAYMSQVSAYGVDDLGLNSVDKDGIDVGGNGDGTHILLNQLERSL 457
            ++LRQ G  E ++Y++ +    VD+    +V+   ID       T  LL QL+  L
Sbjct: 478  ELLRQSGDLEDKSYLALID---VDE--NRTVNNSEID----AHETDALLTQLQDDL 524


Top