BLASTX nr result

ID: Akebia25_contig00010484 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Akebia25_contig00010484
         (2491 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

emb|CBI17752.3| unnamed protein product [Vitis vinifera]             1092   0.0  
ref|XP_007220734.1| hypothetical protein PRUPE_ppa023145mg [Prun...  1005   0.0  
ref|XP_002269066.2| PREDICTED: pentatricopeptide repeat-containi...  1003   0.0  
ref|XP_006345374.1| PREDICTED: pentatricopeptide repeat-containi...   983   0.0  
gb|EXC10461.1| hypothetical protein L484_008628 [Morus notabilis]     983   0.0  
ref|XP_007008770.1| Pentatricopeptide repeat (PPR-like) superfam...   981   0.0  
ref|XP_002304774.1| pentatricopeptide repeat-containing family p...   979   0.0  
ref|XP_006605814.1| PREDICTED: pentatricopeptide repeat-containi...   961   0.0  
gb|EYU42962.1| hypothetical protein MIMGU_mgv1a021045mg [Mimulus...   955   0.0  
ref|XP_003598903.1| Pentatricopeptide repeat-containing protein ...   947   0.0  
ref|XP_004494981.1| PREDICTED: pentatricopeptide repeat-containi...   941   0.0  
ref|XP_004515007.1| PREDICTED: pentatricopeptide repeat-containi...   939   0.0  
ref|XP_007149018.1| hypothetical protein PHAVU_005G033500g [Phas...   935   0.0  
ref|XP_004511291.1| PREDICTED: pentatricopeptide repeat-containi...   930   0.0  
ref|XP_006482966.1| PREDICTED: pentatricopeptide repeat-containi...   917   0.0  
ref|XP_004229293.1| PREDICTED: pentatricopeptide repeat-containi...   914   0.0  
ref|XP_006438906.1| hypothetical protein CICLE_v10030824mg [Citr...   912   0.0  
ref|XP_002513116.1| pentatricopeptide repeat-containing protein,...   910   0.0  
ref|XP_004305399.1| PREDICTED: pentatricopeptide repeat-containi...   889   0.0  
ref|XP_006413862.1| hypothetical protein EUTSA_v10024515mg [Eutr...   884   0.0  

>emb|CBI17752.3| unnamed protein product [Vitis vinifera]
          Length = 729

 Score = 1092 bits (2824), Expect = 0.0
 Identities = 527/727 (72%), Positives = 622/727 (85%), Gaps = 3/727 (0%)
 Frame = +2

Query: 35   MPPKSQPSTITNKLYFYYGHRKPSQNRPTVHGGLFSNRKTINPNPNFYKSSKTLNPNQCS 214
            MPP+ QP    +K YF+YGHRKPSQNRPTVHGGLFSNR T+NP P       TL      
Sbjct: 1    MPPQPQPPK-PHKFYFFYGHRKPSQNRPTVHGGLFSNRTTLNPKP------PTLQNPTTH 53

Query: 215  IDINKWDPNSPQKF--PPTKTQSEKFFSIAQTLSPIARYICDSFRKHKNWCPSIVKDLDK 388
             ++  WDP+SP+    PP+KT  E+FF IA+ LSPIARYICDSFRKH+NW P +V DL+K
Sbjct: 54   FNLQNWDPDSPKALAIPPSKTPCERFFDIAKNLSPIARYICDSFRKHRNWGPPVVADLNK 113

Query: 389  LRRVPPNLVAEVLKAQTDPRISSRFFHWAGKQKGFRHNFASYNAFAYCLNRLNMFRAADQ 568
            LRRV P LVAEVLK QTDP I S+FFHWAGKQKG++HNFASYNAFAYCLNR N FRAADQ
Sbjct: 114  LRRVTPVLVAEVLKVQTDPVICSKFFHWAGKQKGYKHNFASYNAFAYCLNRSNQFRAADQ 173

Query: 569  VPDLMRMQGNQPTEKQFEILIRMHSDAGRGLRVYYIYEKMKNFGVKPRVFLYNRILDALV 748
            VP+LM MQG  P+EKQFEILIRMH DA RGLRVYY+YEKMK FG+KPRVFLYNRI+D LV
Sbjct: 174  VPELMNMQGKPPSEKQFEILIRMHIDANRGLRVYYVYEKMKKFGIKPRVFLYNRIMDGLV 233

Query: 749  KTGHVDLAISVYEDFKEDGLVEESVTFMILVKGLCKVGRMNEVFEILSRMR-NLCKPDVC 925
            KTGH+DLA+SVYEDFKEDGLVEESVT+MILVKGLCK GR++EV E+L RMR NLCKPDV 
Sbjct: 234  KTGHLDLAMSVYEDFKEDGLVEESVTYMILVKGLCKAGRIDEVLELLDRMRGNLCKPDVF 293

Query: 926  AYTAMIRILVSEGNLDGCLRIWEEMKIDGVDPDVMAYTTLVTGLCKGNKVEKGYEFFKEM 1105
            AYTAM+++LV+EGNLDGCLR+WEEM+ D V+PDVMAYTTLV  LC GN+V +G+E FKEM
Sbjct: 294  AYTAMVKVLVAEGNLDGCLRVWEEMRKDKVEPDVMAYTTLVAALCNGNRVGEGFELFKEM 353

Query: 1106 REKGCLIDRSMYGSLIEAFVVDGKIGSACDLLKDLIDSGYRADLSIYNSMVEGLCNANRV 1285
            ++K  LIDR++YGSLIE FVV+ ++GSACDLLKDL+DSGYRADL+IYNS++EG+CN  +V
Sbjct: 354  KQKKYLIDRAIYGSLIEGFVVNERVGSACDLLKDLMDSGYRADLAIYNSLIEGMCNVKQV 413

Query: 1286 DKAYKLFQITVREGLNPDFVTVNPILVSYADESRMDDFCILLEKMQKLGLPVIDDLSRFF 1465
            DKAYKLFQ+TV E L P+F+TV P+LVSYA+  RMDDFC LL +MQKLG PVIDDLS+FF
Sbjct: 414  DKAYKLFQVTVHESLEPNFLTVKPMLVSYAEMKRMDDFCSLLGQMQKLGFPVIDDLSKFF 473

Query: 1466 SFMVGKGDRVRKALEVFEDLKTKGYCSISIYNILIGALHRKGEAQNALTLFEEMKDLDFE 1645
            S M+ KG+R++ ALEVFE LK KGYCSISIYNIL+ A+HR GE + AL+LF+++KD +F+
Sbjct: 474  SVMIEKGERLKLALEVFEHLKAKGYCSISIYNILMEAIHRTGEVKKALSLFDDIKDSNFK 533

Query: 1646 PDSSTHSNVIPCFVDVGDVKEACSCYNKMKDMSWIPTVSAYCSLVKGLCRIGETDAVLTL 1825
            PDSST+SN I CFV+VGDV+EAC+CYNK+ +M  +P+V+AY SLVKGLC+  E DA + L
Sbjct: 534  PDSSTYSNAIICFVEVGDVQEACACYNKIIEMCQLPSVAAYRSLVKGLCKSEEIDAAIML 593

Query: 1826 VRDCLGNVTSGPMEFKYALTILHACKSGDAEKVIEVLNEMMEQGCPPDDVTYFAIISGMC 2005
            VRDCL NVTSGPMEFKY LTILHACKSG+AEKVI+VLNEMM++GC PD+VTY A+ISGMC
Sbjct: 594  VRDCLANVTSGPMEFKYTLTILHACKSGNAEKVIDVLNEMMQEGCTPDEVTYSALISGMC 653

Query: 2006 NHGTLEEARKVFGSMRDRNLLTEANLIVYDELLIDHMKKKTAGLVLSGLKFFGLESKLKS 2185
             HGTLEEARKVF +MR+R LLTEAN+IVYDE+LI+HMKKKTA LVLSGLKFFGLESKL+S
Sbjct: 654  KHGTLEEARKVFSNMRERKLLTEANVIVYDEILIEHMKKKTADLVLSGLKFFGLESKLRS 713

Query: 2186 KGSTILP 2206
            KGST+LP
Sbjct: 714  KGSTLLP 720


>ref|XP_007220734.1| hypothetical protein PRUPE_ppa023145mg [Prunus persica]
            gi|462417196|gb|EMJ21933.1| hypothetical protein
            PRUPE_ppa023145mg [Prunus persica]
          Length = 721

 Score = 1005 bits (2598), Expect = 0.0
 Identities = 488/727 (67%), Positives = 599/727 (82%), Gaps = 2/727 (0%)
 Frame = +2

Query: 35   MPPKSQPSTITNKLYFYYGHRKPSQNRPTVHGGLFSNRKTINPNPNFYKSSKTLNPNQCS 214
            MPP+S P    N   F++GHRKPSQNRP V GGLFSNR ++ PN  +  ++    P    
Sbjct: 1    MPPQSPPPKPQN-FTFFHGHRKPSQNRPRVRGGLFSNRVSL-PNRRYPIAAPQPQP---- 54

Query: 215  IDINKWDPNSPQKFPPTKTQSEKFFSIAQTLSPIARYICDSFRKHKN-WCPSIVKDLDKL 391
             +++KWDP+ PQ  P T + +    ++   LSPIAR+I D+FRK++N W P +V +L KL
Sbjct: 55   FELSKWDPHLPQSSPSTSSSNPADTTLLSFLSPIARFILDAFRKNQNHWGPPVVSELRKL 114

Query: 392  RRVPPNLVAEVLKAQTDPRISSRFFHWAGKQKGFRHNFASYNAFAYCLNRLNMFRAADQV 571
            RRV P+LVAEVLK Q DP  +S+FFHWAGKQKGF+H +ASYNA AYCLNR N FR+ADQV
Sbjct: 115  RRVTPDLVAEVLKVQNDPVSASKFFHWAGKQKGFKHTYASYNALAYCLNRSNRFRSADQV 174

Query: 572  PDLMRMQGNQPTEKQFEILIRMHSDAGRGLRVYYIYEKMKNFGVKPRVFLYNRILDALVK 751
            P+LM  QG  P+EKQFEILIRMHSDA RGLRVYY+YEKMK FGVKPRVFLYNRI+DALVK
Sbjct: 175  PELMDSQGKPPSEKQFEILIRMHSDANRGLRVYYVYEKMKKFGVKPRVFLYNRIMDALVK 234

Query: 752  TGHVDLAISVYEDFKEDGLVEESVTFMILVKGLCKVGRMNEVFEILSRMR-NLCKPDVCA 928
            +G++DLA+SVYEDF+ DGLVEESVTFMIL+KGLCK+GRM+E+ ++L RMR NLCKPDV A
Sbjct: 235  SGYLDLALSVYEDFRGDGLVEESVTFMILIKGLCKMGRMDEMLQLLERMRVNLCKPDVFA 294

Query: 929  YTAMIRILVSEGNLDGCLRIWEEMKIDGVDPDVMAYTTLVTGLCKGNKVEKGYEFFKEMR 1108
            YTAM+++L+SEGNLDGCLR+WEEMK D V  DVMAY TLVTGLCKG +VEKGY+ F+EM+
Sbjct: 295  YTAMVKVLISEGNLDGCLRVWEEMKRDRVGADVMAYATLVTGLCKGGRVEKGYKLFREMK 354

Query: 1109 EKGCLIDRSMYGSLIEAFVVDGKIGSACDLLKDLIDSGYRADLSIYNSMVEGLCNANRVD 1288
             KG LIDR++YG LIE FV D K+G+ACDLLKDL+DSGYRADL IYNS++EGLCNA RVD
Sbjct: 355  VKGFLIDRAIYGVLIEGFVADRKVGAACDLLKDLMDSGYRADLGIYNSLIEGLCNAKRVD 414

Query: 1289 KAYKLFQITVREGLNPDFVTVNPILVSYADESRMDDFCILLEKMQKLGLPVIDDLSRFFS 1468
            KAYK+F++TV+EGL PDF TVNPILVSYA+  RMD+FC +L +M+K   PVIDDLS+FFS
Sbjct: 415  KAYKIFRVTVQEGLQPDFATVNPILVSYAEMRRMDNFCDMLAEMEKFDFPVIDDLSKFFS 474

Query: 1469 FMVGKGDRVRKALEVFEDLKTKGYCSISIYNILIGALHRKGEAQNALTLFEEMKDLDFEP 1648
            FMVGK D V  ALEVF +LK KGY S+ IYNIL+G+LH+ G+ + AL+LF EMKD+D +P
Sbjct: 475  FMVGKEDGVPLALEVFGELKVKGYYSVGIYNILMGSLHKSGKVKKALSLFNEMKDVDLQP 534

Query: 1649 DSSTHSNVIPCFVDVGDVKEACSCYNKMKDMSWIPTVSAYCSLVKGLCRIGETDAVLTLV 1828
            D+ST+S  I CFV+  D+ EAC+ +NK+ +MS +P++SAYCSL +GLC++GE D V+ LV
Sbjct: 535  DASTYSIAIMCFVEDEDIHEACASHNKIIEMSCVPSISAYCSLARGLCKVGEIDTVMLLV 594

Query: 1829 RDCLGNVTSGPMEFKYALTILHACKSGDAEKVIEVLNEMMEQGCPPDDVTYFAIISGMCN 2008
            RDCL +VTSGPMEFKY+LTILHACKS +AEKVIEVLNEMM+QGCP DDV Y AIISGMC 
Sbjct: 595  RDCLASVTSGPMEFKYSLTILHACKSNNAEKVIEVLNEMMQQGCPLDDVIYSAIISGMCK 654

Query: 2009 HGTLEEARKVFGSMRDRNLLTEANLIVYDELLIDHMKKKTAGLVLSGLKFFGLESKLKSK 2188
            HGT+EEA K+F ++++R LLTEAN+ VYDE+LI+H+KKKTA LV+SGLKFFGLESKLK+K
Sbjct: 655  HGTIEEAMKIFSNLKERKLLTEANMFVYDEVLIEHVKKKTADLVVSGLKFFGLESKLKAK 714

Query: 2189 GSTILPG 2209
            G  +L G
Sbjct: 715  GCKLLSG 721


>ref|XP_002269066.2| PREDICTED: pentatricopeptide repeat-containing protein At4g20740-like
            [Vitis vinifera]
          Length = 1294

 Score = 1003 bits (2593), Expect = 0.0
 Identities = 495/738 (67%), Positives = 590/738 (79%), Gaps = 3/738 (0%)
 Frame = +2

Query: 2    KWKKRATTNPQMPPKSQPSTITNKLYFYYGHRKPSQNRPTVHGGLFSNRKTINPNPNFYK 181
            K+ + +  N +MPP+ QP    +K YF+YGHRKPSQNRPTVHGGLFSNR T+NP P    
Sbjct: 529  KFAEGSLKNSKMPPQPQPPK-PHKFYFFYGHRKPSQNRPTVHGGLFSNRTTLNPKP---- 583

Query: 182  SSKTLNPNQCSIDINKWDPNSPQKF--PPTKTQSEKFFSIAQTLSPIARYICDSFRKHKN 355
               TL       ++  WDP+SP+    PP+KT  E+FF IA+ LSPIARYICDSFRKH+N
Sbjct: 584  --PTLQNPTTHFNLQNWDPDSPKALAIPPSKTPCERFFDIAKNLSPIARYICDSFRKHRN 641

Query: 356  WCPSIVKDLDKLRRVPPNLVAEVLKAQTDPRISSRFFHWAGKQKGFRHNFASYNAFAYCL 535
            W P +V DL+KLRRV P LVAEVLK QTDP I S+FFHWAGKQKG++HNFASYNAFAYCL
Sbjct: 642  WGPPVVADLNKLRRVTPVLVAEVLKVQTDPVICSKFFHWAGKQKGYKHNFASYNAFAYCL 701

Query: 536  NRLNMFRAADQVPDLMRMQGNQPTEKQFEILIRMHSDAGRGLRVYYIYEKMKNFGVKPRV 715
            NR N FRAADQVP+LM MQG  P+EKQFEILIRMH DA RGLRVYY+YEKMK FG+KPRV
Sbjct: 702  NRSNQFRAADQVPELMNMQGKPPSEKQFEILIRMHIDANRGLRVYYVYEKMKKFGIKPRV 761

Query: 716  FLYNRILDALVKTGHVDLAISVYEDFKEDGLVEESVTFMILVKGLCKVGRMNEVFEILSR 895
            FLYNRI+D LVKTGH+DLA+SVYEDFKEDGLVEESVT+MILVKGLCK GR++EV E+   
Sbjct: 762  FLYNRIMDGLVKTGHLDLAMSVYEDFKEDGLVEESVTYMILVKGLCKAGRIDEVLEVWEE 821

Query: 896  MR-NLCKPDVCAYTAMIRILVSEGNLDGCLRIWEEMKIDGVDPDVMAYTTLVTGLCKGNK 1072
            MR +  +PDV AYT +                                   V  LC GN+
Sbjct: 822  MRKDKVEPDVMAYTTL-----------------------------------VAALCNGNR 846

Query: 1073 VEKGYEFFKEMREKGCLIDRSMYGSLIEAFVVDGKIGSACDLLKDLIDSGYRADLSIYNS 1252
            V +G+E FKEM++K  LIDR++YGSLIE FVV+ ++GSACDLLKDL+DSGYRADL+IYNS
Sbjct: 847  VGEGFELFKEMKQKKYLIDRAIYGSLIEGFVVNERVGSACDLLKDLMDSGYRADLAIYNS 906

Query: 1253 MVEGLCNANRVDKAYKLFQITVREGLNPDFVTVNPILVSYADESRMDDFCILLEKMQKLG 1432
            ++EG+CN  +VDKAYKLFQ+TV E L P+F+TV P+LVSYA+  RMDDFC LL +MQKLG
Sbjct: 907  LIEGMCNVKQVDKAYKLFQVTVHESLEPNFLTVKPMLVSYAEMKRMDDFCSLLGQMQKLG 966

Query: 1433 LPVIDDLSRFFSFMVGKGDRVRKALEVFEDLKTKGYCSISIYNILIGALHRKGEAQNALT 1612
             PVIDDLS+FFS M+ KG+R++ ALEVFE LK KGYCSISIYNIL+ A+HR GE + AL+
Sbjct: 967  FPVIDDLSKFFSVMIEKGERLKLALEVFEHLKAKGYCSISIYNILMEAIHRTGEVKKALS 1026

Query: 1613 LFEEMKDLDFEPDSSTHSNVIPCFVDVGDVKEACSCYNKMKDMSWIPTVSAYCSLVKGLC 1792
            LF+++KD +F+PDSST+SN I CFV+VGDV+EAC+CYNK+ +M  +P+V+AY SLVKGLC
Sbjct: 1027 LFDDIKDSNFKPDSSTYSNAIICFVEVGDVQEACACYNKIIEMCQLPSVAAYRSLVKGLC 1086

Query: 1793 RIGETDAVLTLVRDCLGNVTSGPMEFKYALTILHACKSGDAEKVIEVLNEMMEQGCPPDD 1972
            +  E DA + LVRDCL NVTSGPMEFKY LTILHACKSG+AEKVI+VLNEMM++GC PD+
Sbjct: 1087 KSEEIDAAIMLVRDCLANVTSGPMEFKYTLTILHACKSGNAEKVIDVLNEMMQEGCTPDE 1146

Query: 1973 VTYFAIISGMCNHGTLEEARKVFGSMRDRNLLTEANLIVYDELLIDHMKKKTAGLVLSGL 2152
            VTY A+ISGMC HGTLEEARKVF +MR+R LLTEAN+IVYDE+LI+HMKKKTA LVLSGL
Sbjct: 1147 VTYSALISGMCKHGTLEEARKVFSNMRERKLLTEANVIVYDEILIEHMKKKTADLVLSGL 1206

Query: 2153 KFFGLESKLKSKGSTILP 2206
            KFFGLESKL+SKGST+LP
Sbjct: 1207 KFFGLESKLRSKGSTLLP 1224


>ref|XP_006345374.1| PREDICTED: pentatricopeptide repeat-containing protein At4g20740-like
            [Solanum tuberosum]
          Length = 720

 Score =  983 bits (2542), Expect = 0.0
 Identities = 479/727 (65%), Positives = 589/727 (81%), Gaps = 2/727 (0%)
 Frame = +2

Query: 35   MPPKSQPSTITNKLYFYYGHRKPSQNRPTVHGGLFSNRKTINPNPNFYKSSKTLNPNQCS 214
            MPPKS  S    K YF+YGHRKP+Q+RPTV GGLFSNR+TINPN     S  ++   Q  
Sbjct: 1    MPPKSAQS----KPYFFYGHRKPTQHRPTVQGGLFSNRQTINPNRTTKNSPSSVT--QGD 54

Query: 215  IDINKWDPNSPQKFPPTKTQSEKFFSIAQTLSPIARYICDSFRKHKNWCPSIVKDLDKLR 394
              + KWDP+       ++  S++FFS+AQ LSPIARYI DSFRKH NW   ++ DL+ LR
Sbjct: 55   FQLQKWDPDGVSG-QQSRDPSQEFFSLAQRLSPIARYIVDSFRKHGNWGAPLLADLNSLR 113

Query: 395  RVPPNLVAEVLK-AQTDPRISSRFFHWAGKQKGFRHNFASYNAFAYCLNRLNMFRAADQV 571
            RV P LV EVLK    DP+ISS+FF+WAGKQKG+RH+F+ YNAFAY LNR N FR ADQV
Sbjct: 114  RVTPKLVTEVLKHPNLDPKISSKFFYWAGKQKGYRHDFSCYNAFAYGLNRANQFRTADQV 173

Query: 572  PDLMRMQGNQPTEKQFEILIRMHSDAGRGLRVYYIYEKMKNFGVKPRVFLYNRILDALVK 751
            P+LM MQG  P+EKQFEILIRMH DA RGLRVYY+YEKMK FGVKPRVFLYNRI+DALVK
Sbjct: 174  PELMHMQGKPPSEKQFEILIRMHGDANRGLRVYYVYEKMKKFGVKPRVFLYNRIMDALVK 233

Query: 752  TGHVDLAISVYEDFKEDGLVEESVTFMILVKGLCKVGRMNEVFEILSRMR-NLCKPDVCA 928
            T H+D+A+SVY+DFK+DGLVEES+TFMIL+KGLCK+GRM+EVFE+L RMR N CKPDV A
Sbjct: 234  TNHLDMAMSVYDDFKKDGLVEESMTFMILIKGLCKLGRMDEVFELLGRMRENRCKPDVFA 293

Query: 929  YTAMIRILVSEGNLDGCLRIWEEMKIDGVDPDVMAYTTLVTGLCKGNKVEKGYEFFKEMR 1108
            YTAM++ILV+E NLDGC ++W+EM+ D V+PDV+AY+T + GLCK N+V+KGYE FKEM+
Sbjct: 294  YTAMVKILVAERNLDGCSKVWKEMQQDAVEPDVIAYSTFIAGLCKNNQVDKGYELFKEMK 353

Query: 1109 EKGCLIDRSMYGSLIEAFVVDGKIGSACDLLKDLIDSGYRADLSIYNSMVEGLCNANRVD 1288
            +K  LIDR +YGSLIE+FV +GK+G ACDLLKDLI+SGYRADL+IYNS++EGLCNA R D
Sbjct: 354  QKNILIDRGIYGSLIESFVANGKVGLACDLLKDLIESGYRADLAIYNSIIEGLCNAKRTD 413

Query: 1289 KAYKLFQITVREGLNPDFVTVNPILVSYADESRMDDFCILLEKMQKLGLPVIDDLSRFFS 1468
            +AYKLFQITV+E L PDF TV PILVSYA+  +MD+ C LLE++Q+L   + DDLS+FF+
Sbjct: 414  RAYKLFQITVQEDLCPDFSTVKPILVSYAESKKMDEICKLLEELQRLSHCISDDLSKFFT 473

Query: 1469 FMVGKGDRVRKALEVFEDLKTKGYCSISIYNILIGALHRKGEAQNALTLFEEMKDLDFEP 1648
            +MV KGDR+  ALEVFE LK K YC + IYNIL+ AL++ GE   ALTLF E++  D+EP
Sbjct: 474  YMVEKGDRIMIALEVFEYLKVKDYCGVPIYNILMEALYQNGEVNKALTLFSELRSSDYEP 533

Query: 1649 DSSTHSNVIPCFVDVGDVKEACSCYNKMKDMSWIPTVSAYCSLVKGLCRIGETDAVLTLV 1828
            DSS +SN + CFV+VGDV+EA  CYN++K+MS IP+V+AY SLV GLC+IG+ D  + L+
Sbjct: 534  DSSAYSNAVQCFVEVGDVQEASICYNRIKEMSLIPSVAAYRSLVIGLCKIGQIDPAMMLI 593

Query: 1829 RDCLGNVTSGPMEFKYALTILHACKSGDAEKVIEVLNEMMEQGCPPDDVTYFAIISGMCN 2008
            RDCLGNV SGP+EFK  LTI+H CK  DAEKV++VL+E++E+G  PD+  Y A+I GMC 
Sbjct: 594  RDCLGNVASGPIEFKCILTIIHVCKMNDAEKVMKVLDELLEEGFSPDNAVYCAVIYGMCK 653

Query: 2009 HGTLEEARKVFGSMRDRNLLTEANLIVYDELLIDHMKKKTAGLVLSGLKFFGLESKLKSK 2188
            HGT+EEA+KVF SMR R  LTEA+L+VYDE+LIDHMKKKTA L+LSGLKFFGLESKLK+K
Sbjct: 654  HGTIEEAQKVFASMRKRKHLTEADLVVYDEMLIDHMKKKTADLLLSGLKFFGLESKLKAK 713

Query: 2189 GSTILPG 2209
            G T+L G
Sbjct: 714  GCTLLAG 720


>gb|EXC10461.1| hypothetical protein L484_008628 [Morus notabilis]
          Length = 716

 Score =  983 bits (2540), Expect = 0.0
 Identities = 474/728 (65%), Positives = 591/728 (81%), Gaps = 3/728 (0%)
 Frame = +2

Query: 29   PQMPPKSQPSTITNKLYFYYGHRKPSQNRPTVHGGLFSNRKTINPNPNFYKSSKTLNPNQ 208
            P  PP  +P     K YF+Y HRKPSQNRPTV GGLFSNR+++ P  N +   K  +   
Sbjct: 2    PAQPPPGKPQ----KFYFFYVHRKPSQNRPTVRGGLFSNRQSLKPRQNPHHHHKPPS--- 54

Query: 209  CSIDINKWDPNS-PQKFPPTKTQSEKFFSIAQTLSPIARYICDSFRK-HKNWCPSIVKDL 382
               D++KWDP+  P     T T     F     LSPIAR+I D+FRK H  W P +V +L
Sbjct: 55   ---DLSKWDPHLLPSPSSTTTTTPTLSF-----LSPIARFITDAFRKNHSKWGPPVVTEL 106

Query: 383  DKLRRVPPNLVAEVLKAQTDPRISSRFFHWAGKQKGFRHNFASYNAFAYCLNRLNMFRAA 562
             KLRRV PNLV EVLK QTDP ++S+FFHWAGKQKG+RHNFASYNAFAYCLNR + +R+A
Sbjct: 107  HKLRRVTPNLVTEVLKVQTDPSLASKFFHWAGKQKGYRHNFASYNAFAYCLNRGDRYRSA 166

Query: 563  DQVPDLMRMQGNQPTEKQFEILIRMHSDAGRGLRVYYIYEKMKNFGVKPRVFLYNRILDA 742
            DQVP LM  QG  P+EKQFEILIRMHSDA RGLRVYY YE MK FG+KPRVFL+NR++DA
Sbjct: 167  DQVPHLMEAQGKPPSEKQFEILIRMHSDANRGLRVYYAYENMKKFGIKPRVFLFNRVMDA 226

Query: 743  LVKTGHVDLAISVYEDFKEDGLVEESVTFMILVKGLCKVGRMNEVFEILSRMRN-LCKPD 919
            LV+TG++DLA+SVY DFKE GLVEESVTFMIL+KGLCK GR+ E+ E+L RMR  LCKPD
Sbjct: 227  LVRTGYLDLALSVYGDFKEAGLVEESVTFMILIKGLCKAGRVEEMLEVLGRMRGELCKPD 286

Query: 920  VCAYTAMIRILVSEGNLDGCLRIWEEMKIDGVDPDVMAYTTLVTGLCKGNKVEKGYEFFK 1099
            V AYTAM+R++V EGNLDGCLR+WEEM+ D V+PDV+AY T++ GLCKG +VEKGYE FK
Sbjct: 287  VFAYTAMVRVMVGEGNLDGCLRVWEEMRSDRVEPDVIAYGTVIAGLCKGGRVEKGYELFK 346

Query: 1100 EMREKGCLIDRSMYGSLIEAFVVDGKIGSACDLLKDLIDSGYRADLSIYNSMVEGLCNAN 1279
            EM+ KG L+DR++YG+L++AFV DGK+G ACD+ KDL++SGYRADL IYN +++GLCNA 
Sbjct: 347  EMKGKGALVDRAIYGALVKAFVEDGKVGLACDVFKDLVNSGYRADLDIYNYLIQGLCNAK 406

Query: 1280 RVDKAYKLFQITVREGLNPDFVTVNPILVSYADESRMDDFCILLEKMQKLGLPVIDDLSR 1459
            RVDKAYKLF++TV+EGL P+FVT+NPIL+ YA+  ++D+FC LL +MQKLG+ V+DDL++
Sbjct: 407  RVDKAYKLFRVTVQEGLGPNFVTINPILLCYAEMRKIDEFCDLLVQMQKLGISVVDDLTK 466

Query: 1460 FFSFMVGKGDRVRKALEVFEDLKTKGYCSISIYNILIGALHRKGEAQNALTLFEEMKDLD 1639
            FFSF+V KGD ++ ALEVFEDLK +GY S+SIYNIL+ A ++   A+ AL+L  EMKD++
Sbjct: 467  FFSFVVRKGDGLKMALEVFEDLKVRGYYSVSIYNILMEAFYKTEMAKKALSLLNEMKDMN 526

Query: 1640 FEPDSSTHSNVIPCFVDVGDVKEACSCYNKMKDMSWIPTVSAYCSLVKGLCRIGETDAVL 1819
             +PDSST+S  I CFV+ GD+KEAC+C+NK+ +MS +P+VSAYCSL +GLC IGE DA +
Sbjct: 527  AQPDSSTYSVAIECFVEEGDLKEACACHNKIIEMSCVPSVSAYCSLARGLCNIGEIDAAM 586

Query: 1820 TLVRDCLGNVTSGPMEFKYALTILHACKSGDAEKVIEVLNEMMEQGCPPDDVTYFAIISG 1999
             LVRDCL +V+SG MEFKYALT+LHACKSG +EKVI VL+E+M++GCPPD+V   A+ISG
Sbjct: 587  MLVRDCLASVSSGSMEFKYALTVLHACKSGKSEKVIGVLDELMQEGCPPDNVVLSAVISG 646

Query: 2000 MCNHGTLEEARKVFGSMRDRNLLTEANLIVYDELLIDHMKKKTAGLVLSGLKFFGLESKL 2179
            MC HGT+EEARKVF ++R+R L++EA  IVYDE+LIDHMKKKTA LV+SGLKFFGLESKL
Sbjct: 647  MCRHGTIEEARKVFSNLRERKLMSEARTIVYDEILIDHMKKKTADLVVSGLKFFGLESKL 706

Query: 2180 KSKGSTIL 2203
            K+KGST+L
Sbjct: 707  KAKGSTLL 714


>ref|XP_007008770.1| Pentatricopeptide repeat (PPR-like) superfamily protein [Theobroma
            cacao] gi|508725683|gb|EOY17580.1| Pentatricopeptide
            repeat (PPR-like) superfamily protein [Theobroma cacao]
          Length = 716

 Score =  981 bits (2536), Expect = 0.0
 Identities = 483/727 (66%), Positives = 592/727 (81%), Gaps = 4/727 (0%)
 Frame = +2

Query: 35   MPPKSQPSTITNKLYFYYGHRKPSQNRPTVHGGLFSNRKTINPNPNFYKSSKTLNPNQCS 214
            MPPKS P+  T K YF+YGHRKPSQNRP V+GGLFSNR+ +   P          P Q S
Sbjct: 1    MPPKSLPAK-TPKPYFFYGHRKPSQNRPVVYGGLFSNRQILKTPPT---------PPQPS 50

Query: 215  --IDINKWDPNSPQKFPPTKTQSEKFFSIAQTLSPIARYICDSFRKHK-NWCPSIVKDLD 385
               D+ KWDP    + P   +    + +  + LSPIAR+I D+FRK++  W P++V +L+
Sbjct: 51   PPFDLRKWDPYYLSQNPSPPSTPNPYQN--RKLSPIARFIVDAFRKNQYTWGPTVVFELN 108

Query: 386  KLRRVPPNLVAEVLKAQTDPRISSRFFHWAGKQKGFRHNFASYNAFAYCLNRLNMFRAAD 565
            KLRRV  +LVAEVLK + DP ++S+FFHWAGKQKGF+HNFASYNA AYCLNR   FRAAD
Sbjct: 109  KLRRVTASLVAEVLKVENDPVLASKFFHWAGKQKGFKHNFASYNALAYCLNRNGRFRAAD 168

Query: 566  QVPDLMRMQGNQPTEKQFEILIRMHSDAGRGLRVYYIYEKMKNFGVKPRVFLYNRILDAL 745
            Q+P+LM  QG QPTEKQFEILIRMH+D  RG RVYY+Y+KMKNFG+KPRVFLYNRI+DAL
Sbjct: 169  QLPELMDSQGKQPTEKQFEILIRMHADNNRGQRVYYVYQKMKNFGIKPRVFLYNRIMDAL 228

Query: 746  VKTGHVDLAISVYEDFKEDGLVEESVTFMILVKGLCKVGRMNEVFEILSRMRN-LCKPDV 922
            VKTG++DLA+SVYEDF+ DGLVEES+TFMIL+KGLCK GR+ E+ E+L RMR  LCKPDV
Sbjct: 229  VKTGYLDLALSVYEDFRGDGLVEESITFMILIKGLCKAGRIEEMLEVLGRMREKLCKPDV 288

Query: 923  CAYTAMIRILVSEGNLDGCLRIWEEMKIDGVDPDVMAYTTLVTGLCKGNKVEKGYEFFKE 1102
             AYTAM+RILVSE NLDGCL +WEEM+ DGV+PDVMAY TLVTGLCKG +V++GYE F+E
Sbjct: 289  FAYTAMVRILVSEKNLDGCLLVWEEMERDGVEPDVMAYVTLVTGLCKGGRVQRGYELFRE 348

Query: 1103 MREKGCLIDRSMYGSLIEAFVVDGKIGSACDLLKDLIDSGYRADLSIYNSMVEGLCNANR 1282
            M++KG LIDR+ YG LIE FV DGK+GSACDLLKDL+DSGYRADL IYNS++EGLC+A R
Sbjct: 349  MKDKGILIDRATYGVLIEGFVKDGKVGSACDLLKDLVDSGYRADLGIYNSLIEGLCDARR 408

Query: 1283 VDKAYKLFQITVREGLNPDFVTVNPILVSYADESRMDDFCILLEKMQKLGLPVIDDLSRF 1462
            VD+AYKLFQ+TV+EGL P+F TVNP+LV++A+  RM+DFC LLE+MQKLG  VIDDLS+F
Sbjct: 409  VDRAYKLFQVTVQEGLEPEFATVNPMLVAFAEMRRMNDFCKLLEQMQKLGFSVIDDLSKF 468

Query: 1463 FSFMVGKGDRVRKALEVFEDLKTKGYCSISIYNILIGALHRKGEAQNALTLFEEMKDLDF 1642
            FSF+VGK +R   A++VF++LK KGY  + IYNIL+ AL + G+ + AL+LF+EMK L+F
Sbjct: 469  FSFVVGKEERTVLAIQVFDELKVKGYTGVPIYNILMEALRKTGKVKQALSLFQEMKGLNF 528

Query: 1643 EPDSSTHSNVIPCFVDVGDVKEACSCYNKMKDMSWIPTVSAYCSLVKGLCRIGETDAVLT 1822
            EPDSST+   I CFV+  ++KEAC C+N + +MS +P++ AY SL KGLC+IGE DA + 
Sbjct: 529  EPDSSTYGTAIICFVEDENIKEACVCHNNIIEMSCVPSIDAYYSLAKGLCKIGEIDAAMM 588

Query: 1823 LVRDCLGNVTSGPMEFKYALTILHACKSGDAEKVIEVLNEMMEQGCPPDDVTYFAIISGM 2002
            LVRDCLGNVT+GPM FKYALT+LHACKSG  E V EVLNEMM++G PPD++ Y AIISGM
Sbjct: 589  LVRDCLGNVTNGPMAFKYALTVLHACKSG-GETVTEVLNEMMQEGWPPDNIIYSAIISGM 647

Query: 2003 CNHGTLEEARKVFGSMRDRNLLTEANLIVYDELLIDHMKKKTAGLVLSGLKFFGLESKLK 2182
            C +GT+EEARKVF ++R R LLTEAN IVYDE+LI+HM+KK A LVLSGLKFFGLESKLK
Sbjct: 648  CKYGTIEEARKVFANLRTRKLLTEANTIVYDEILIEHMEKKAAELVLSGLKFFGLESKLK 707

Query: 2183 SKGSTIL 2203
            +KGST+L
Sbjct: 708  AKGSTLL 714


>ref|XP_002304774.1| pentatricopeptide repeat-containing family protein [Populus
            trichocarpa] gi|222842206|gb|EEE79753.1|
            pentatricopeptide repeat-containing family protein
            [Populus trichocarpa]
          Length = 728

 Score =  979 bits (2530), Expect = 0.0
 Identities = 482/732 (65%), Positives = 587/732 (80%), Gaps = 7/732 (0%)
 Frame = +2

Query: 29   PQMPPKSQPSTITNKLYFYYGHRKPSQNRPTVHGGLFSNRKTINPNPNFYKSSKTLNPNQ 208
            P  PP S+P     K YF+YGHRKPSQNRP V GGLF+NR+T+ P P        + P +
Sbjct: 5    PPPPPPSKPL----KPYFFYGHRKPSQNRPVVRGGLFTNRQTVKPQP----PKNPITPFK 56

Query: 209  CSIDINKWDP-----NSPQKFPPTKTQSEKFFSIAQTLSPIARYICDSFRKHKN-WCPSI 370
               D++KWDP     + PQ   P   +S    +++Q LSPIAR+I D+FRK++N W P +
Sbjct: 57   -PFDLHKWDPQQNLPHQPQPSKPQSPRSRHSLALSQRLSPIARFILDAFRKNRNQWGPEV 115

Query: 371  VKDLDKLRRVPPNLVAEVLKAQTDPRISSRFFHWAGKQKGFRHNFASYNAFAYCLNRLNM 550
            V +L KLRRV P+LVAEVLK + +P+++++FFHWAGKQKGF+H FASYNAFAY LNR N 
Sbjct: 116  VTELCKLRRVTPDLVAEVLKVENNPQLATKFFHWAGKQKGFKHTFASYNAFAYNLNRSNF 175

Query: 551  FRAADQVPDLMRMQGNQPTEKQFEILIRMHSDAGRGLRVYYIYEKMKNFGVKPRVFLYNR 730
            FRAADQ+P+LM  QG  PTEKQFEILIRMHSDA RGLRVYY+Y+KM  FGVKPRVFLYNR
Sbjct: 176  FRAADQLPELMEAQGKPPTEKQFEILIRMHSDANRGLRVYYVYQKMVKFGVKPRVFLYNR 235

Query: 731  ILDALVKTGHVDLAISVYEDFKEDGLVEESVTFMILVKGLCKVGRMNEVFEILSRMR-NL 907
            I+D+L+KTGH+DLA+SVYEDF+ DGLVEESVT+MIL+KGLCK GR+ E+ E+L RMR NL
Sbjct: 236  IMDSLIKTGHLDLALSVYEDFRRDGLVEESVTYMILIKGLCKAGRIEEMMEVLGRMRENL 295

Query: 908  CKPDVCAYTAMIRILVSEGNLDGCLRIWEEMKIDGVDPDVMAYTTLVTGLCKGNKVEKGY 1087
            CKPDV AYTAM+R L  EGNLD CLR+WEEMK DGV+PDVMAY TLVT LCKG +V+KGY
Sbjct: 296  CKPDVFAYTAMVRALAGEGNLDACLRVWEEMKRDGVEPDVMAYVTLVTALCKGGRVDKGY 355

Query: 1088 EFFKEMREKGCLIDRSMYGSLIEAFVVDGKIGSACDLLKDLIDSGYRADLSIYNSMVEGL 1267
            E FKEM+ +  LIDR +YG L+EAFV DGKIG ACDLLKDL+DSGYRADL IYNS++EG 
Sbjct: 356  EVFKEMKGRRILIDRGIYGILVEAFVADGKIGLACDLLKDLVDSGYRADLRIYNSLIEGF 415

Query: 1268 CNANRVDKAYKLFQITVREGLNPDFVTVNPILVSYADESRMDDFCILLEKMQKLGLPVID 1447
            CN  RVDKA+KLFQ+TV+EGL  DF TVNP+L+SYA+  +MDDFC LL++M+KLG  V D
Sbjct: 416  CNVKRVDKAHKLFQVTVQEGLERDFKTVNPLLMSYAEMKKMDDFCKLLKQMEKLGFSVFD 475

Query: 1448 DLSRFFSFMVGKGDRVRKALEVFEDLKTKGYCSISIYNILIGALHRKGEAQNALTLFEEM 1627
            DLS+FFS++VGK +R   ALEVFEDLK KGY S+ IYNIL+ AL   GE + AL+LF EM
Sbjct: 476  DLSKFFSYVVGKPERTMMALEVFEDLKVKGYSSVPIYNILMEALLTIGEMKRALSLFGEM 535

Query: 1628 KDLDFEPDSSTHSNVIPCFVDVGDVKEACSCYNKMKDMSWIPTVSAYCSLVKGLCRIGET 1807
            KDL+ +PDS+T+S  I CFV+ G+++EAC  +NK+ +M  +P+V+AYCSL KGLC  GE 
Sbjct: 536  KDLN-KPDSTTYSIAIICFVEDGNIQEACVSHNKIVEMFCVPSVAAYCSLAKGLCDNGEI 594

Query: 1808 DAVLTLVRDCLGNVTSGPMEFKYALTILHACKSGDAEKVIEVLNEMMEQGCPPDDVTYFA 1987
            DA + LVRDCL +V SGPMEFKY+LTILHACK+G AEKVI+VLNEMM++GC P++V Y A
Sbjct: 595  DAAMMLVRDCLASVESGPMEFKYSLTILHACKTGGAEKVIDVLNEMMQEGCTPNEVIYSA 654

Query: 1988 IISGMCNHGTLEEARKVFGSMRDRNLLTEANLIVYDELLIDHMKKKTAGLVLSGLKFFGL 2167
            IISGMC HGT EEARKVF  +R R +LTEA  IV+DE+LI+HMKKKTA LVL+GLKFFGL
Sbjct: 655  IISGMCKHGTFEEARKVFTDLRQRKILTEAKTIVFDEILIEHMKKKTADLVLAGLKFFGL 714

Query: 2168 ESKLKSKGSTIL 2203
            ESKLK+ GST+L
Sbjct: 715  ESKLKAMGSTLL 726


>ref|XP_006605814.1| PREDICTED: pentatricopeptide repeat-containing protein At4g20740-like
            isoform X2 [Glycine max] gi|571565751|ref|XP_003555182.2|
            PREDICTED: pentatricopeptide repeat-containing protein
            At4g20740-like isoform X1 [Glycine max]
          Length = 764

 Score =  961 bits (2485), Expect = 0.0
 Identities = 471/729 (64%), Positives = 582/729 (79%), Gaps = 6/729 (0%)
 Frame = +2

Query: 38   PPKSQPSTITNKLYFYYGHRKPSQNRPTVHGGLFSNRKTINPNPNFYKSSKTLNPNQCSI 217
            PP + P   TNK YF+YGHR PSQNRPTV GGLFSNR+T+NPNP+  K      P     
Sbjct: 47   PPFTTPKP-TNKFYFFYGHRNPSQNRPTVRGGLFSNRQTLNPNPSQPK------PTTKPF 99

Query: 218  DINKWDPN---SPQKFPPTKTQSEKFFSIAQTLSPIARYICDSFRKHKN-WCPSIVKDLD 385
            +I  WDP+   +P   P   T S    S +  LSPIAR+I D+FR++ N WCP++  +L 
Sbjct: 100  NIKNWDPHFLSNPNSNPSPSTLS----SASLRLSPIARFIVDAFRRNDNKWCPNVAAELS 155

Query: 386  KLRRVPPNLVAEVLKAQTDPRISSRFFHWAGKQKGFRHNFASYNAFAYCLNRLNMFRAAD 565
            KLRR+ PNLVAEVLK QT+  ++S+FFHWAG Q+G+ HNFASYNA AYCLNR + FRAAD
Sbjct: 156  KLRRITPNLVAEVLKVQTNHTLASKFFHWAGSQRGYHHNFASYNALAYCLNRHHQFRAAD 215

Query: 566  QVPDLMRMQGNQPTEKQFEILIRMHSDAGRGLRVYYIYEKMKN-FGVKPRVFLYNRILDA 742
            Q+P+LM  QG  P+EKQFEILIRMHSDA RGLRVY++YEKM+N FGVKPRVFLYNR++DA
Sbjct: 216  QLPELMESQGKPPSEKQFEILIRMHSDANRGLRVYHVYEKMRNKFGVKPRVFLYNRVMDA 275

Query: 743  LVKTGHVDLAISVYEDFKEDGLVEESVTFMILVKGLCKVGRMNEVFEILSRMRN-LCKPD 919
            LV+TGH+DLA+SVY+D KEDGLVEESVTFM+LVKGLCK GR++E+ E+L RMR  LCKPD
Sbjct: 276  LVRTGHLDLALSVYDDLKEDGLVEESVTFMVLVKGLCKCGRIDEMLEVLGRMRERLCKPD 335

Query: 920  VCAYTAMIRILVSEGNLDGCLRIWEEMKIDGVDPDVMAYTTLVTGLCKGNKVEKGYEFFK 1099
            V AYTA+++ILV  GNLD CLR+WEEMK D V+PDV AY T++ GL KG +V++GYE F+
Sbjct: 336  VFAYTALVKILVPAGNLDACLRVWEEMKRDRVEPDVKAYATMIVGLAKGGRVQEGYELFR 395

Query: 1100 EMREKGCLIDRSMYGSLIEAFVVDGKIGSACDLLKDLIDSGYRADLSIYNSMVEGLCNAN 1279
            EM+ KGCL+DR +YG+L+EAFV +GK+  A DLLKDL+ SGYRADL IY  ++EGLCN N
Sbjct: 396  EMKGKGCLVDRVIYGALVEAFVAEGKVELAFDLLKDLVSSGYRADLGIYICLIEGLCNLN 455

Query: 1280 RVDKAYKLFQITVREGLNPDFVTVNPILVSYADESRMDDFCILLEKMQKLGLPVIDDLSR 1459
            RV KAYKLFQ+TVREGL PDF+TV P+LV+YA+ +RM++FC LLE+MQKLG PVI DLS+
Sbjct: 456  RVQKAYKLFQLTVREGLEPDFLTVKPLLVAYAEANRMEEFCKLLEQMQKLGFPVIADLSK 515

Query: 1460 FFSFMVGKGDRVRKALEVFEDLKTKGYCSISIYNILIGALHRKGEAQNALTLFEEMKDLD 1639
            FFS +V K   +  ALE F  LK KG+ S+ IYNI + +LH+ GE + AL+LF+EMK L 
Sbjct: 516  FFSVLVEKKGPIM-ALETFGQLKEKGHVSVEIYNIFMDSLHKIGEVKKALSLFDEMKGLS 574

Query: 1640 FEPDSSTHSNVIPCFVDVGDVKEACSCYNKMKDMSWIPTVSAYCSLVKGLCRIGETDAVL 1819
             +PDS T+   I C VD+G++KEAC+C+N++ +MS IP+V+AY SL KGLC+IGE D  +
Sbjct: 575  LKPDSFTYCTAILCLVDLGEIKEACACHNRIIEMSCIPSVAAYSSLTKGLCQIGEIDEAM 634

Query: 1820 TLVRDCLGNVTSGPMEFKYALTILHACKSGDAEKVIEVLNEMMEQGCPPDDVTYFAIISG 1999
             LVRDCLGNV+ GP+EFKY+LTI+HACKS  AEKVI+VLNEM+EQGC  D+V Y +IISG
Sbjct: 635  LLVRDCLGNVSDGPLEFKYSLTIIHACKSNVAEKVIDVLNEMIEQGCSLDNVIYCSIISG 694

Query: 2000 MCNHGTLEEARKVFGSMRDRNLLTEANLIVYDELLIDHMKKKTAGLVLSGLKFFGLESKL 2179
            MC HGT+EEARKVF ++R+RN LTE+N IVYDELLIDHMKKKTA LVLS LKFFGLESKL
Sbjct: 695  MCKHGTIEEARKVFSNLRERNFLTESNTIVYDELLIDHMKKKTADLVLSSLKFFGLESKL 754

Query: 2180 KSKGSTILP 2206
            K+KG  +LP
Sbjct: 755  KAKGCKLLP 763


>gb|EYU42962.1| hypothetical protein MIMGU_mgv1a021045mg [Mimulus guttatus]
          Length = 726

 Score =  955 bits (2468), Expect = 0.0
 Identities = 465/729 (63%), Positives = 577/729 (79%), Gaps = 7/729 (0%)
 Frame = +2

Query: 35   MPPKSQPS---TITNKLYFYYGHRKPSQNRPTVHGGLFSNRKTINPNPNFYKSSKTLNPN 205
            MPP S P    T  NK YF+YGHRKP+Q+RPTV GGLFSNR+T+NP  NF + +    P 
Sbjct: 1    MPPPSPPPGALTKPNKPYFFYGHRKPTQSRPTVRGGLFSNRQTVNPE-NFRRRTAAHEP- 58

Query: 206  QCSIDINKWDPNSP--QKFPPTKTQSEKFFSIAQTLSPIARYICDSFRKHKNWCPSIVKD 379
                D+ KWDP+    +K P  K  SEKFFS+A+ LSPIARYI D+FRKHK W P +V++
Sbjct: 59   ---FDLQKWDPDDEANRKPPYGKDPSEKFFSLAKNLSPIARYIVDAFRKHKQWSPQLVQE 115

Query: 380  LDKLRRVPPNLVAEVLK-AQTDPRISSRFFHWAGKQKGFRHNFASYNAFAYCLNRLNMFR 556
            L++LRRV P LV EVLK    DPR+SS+FFHWAGKQKG++H+FA YNA+AY LNR N FR
Sbjct: 116  LNRLRRVTPTLVTEVLKFPDVDPRVSSKFFHWAGKQKGYKHDFACYNAYAYFLNRSNHFR 175

Query: 557  AADQVPDLMRMQGNQPTEKQFEILIRMHSDAGRGLRVYYIYEKMKNFGVKPRVFLYNRIL 736
             ADQ+P+LM MQG  PTEKQFEILIRMH+D+ RGLRV+Y+YEKMK FGVKPRVFLYNRI+
Sbjct: 176  EADQLPELMHMQGKPPTEKQFEILIRMHADSNRGLRVHYVYEKMKKFGVKPRVFLYNRIM 235

Query: 737  DALVKTGHVDLAISVYEDFKEDGLVEESVTFMILVKGLCKVGRMNEVFEILSRMR-NLCK 913
            DALVKT H+DLA+SVY DFKE+GL EE+VT+MIL+KGLCK GR++E+F+++ RMR NLCK
Sbjct: 236  DALVKTNHLDLAMSVYRDFKEEGLSEENVTYMILIKGLCKAGRLDEMFDLVDRMRKNLCK 295

Query: 914  PDVCAYTAMIRILVSEGNLDGCLRIWEEMKIDGVDPDVMAYTTLVTGLCKGNKVEKGYEF 1093
            PDV AYTAM+++LVSEGNL+GCL +WEEMK DGV+PD MAY+TL+  LC+G  V+KGYE 
Sbjct: 296  PDVFAYTAMVKVLVSEGNLNGCLTVWEEMKKDGVEPDSMAYSTLIMALCEGKFVDKGYEL 355

Query: 1094 FKEMREKGCLIDRSMYGSLIEAFVVDGKIGSACDLLKDLIDSGYRADLSIYNSMVEGLCN 1273
            FKEM+ + CLIDR++YGSLIEA+VVDGK+GSACDLLKDLI+SGYRADL+IYNS+++GLCN
Sbjct: 356  FKEMKGRNCLIDRAIYGSLIEAYVVDGKVGSACDLLKDLINSGYRADLAIYNSLIKGLCN 415

Query: 1274 ANRVDKAYKLFQITVREGLNPDFVTVNPILVSYADESRMDDFCILLEKMQKLGLPVIDDL 1453
            +  VD+AYKLFQ  +RE L PDF TVNPIL+ YA+  ++ DFC LLE+M+KLG  + + L
Sbjct: 416  SKLVDRAYKLFQAAIREDLQPDFNTVNPILICYAELKKLHDFCKLLEQMEKLGFSINESL 475

Query: 1454 SRFFSFMVGKGDRVRKALEVFEDLKTKGYCSISIYNILIGALHRKGEAQNALTLFEEMKD 1633
              FFS +V   D V  ALEVFE LK + Y ++ IYNIL+ AL + G+ + AL LF E+KD
Sbjct: 476  LDFFSCVVETNDGVATALEVFEFLKIRNYINVPIYNILMDALFKNGDEKKALLLFHELKD 535

Query: 1634 LDFEPDSSTHSNVIPCFVDVGDVKEACSCYNKMKDMSWIPTVSAYCSLVKGLCRIGETDA 1813
             D  PDSST    I C++ +GDV+EAC+ YN +K+MS +P++ AY +LVKGL  IGE DA
Sbjct: 536  ADLAPDSSTLCIAISCYIKIGDVREACNTYNTIKEMSSVPSLDAYYALVKGLSDIGEVDA 595

Query: 1814 VLTLVRDCLGNVTSGPMEFKYALTILHACKSGDAEKVIEVLNEMMEQGCPPDDVTYFAII 1993
             + LVRDCL +VT GPMEFKYALTI+H CKS DA KVIEV+ EM EQGC P  +T  A++
Sbjct: 596  AMVLVRDCLAHVTGGPMEFKYALTIIHVCKSNDARKVIEVVGEMAEQGCTPSSITCTAVV 655

Query: 1994 SGMCNHGTLEEARKVFGSMRDRNLLTEANLIVYDELLIDHMKKKTAGLVLSGLKFFGLES 2173
             GMC HG++EEAR VF SMR+   +TEA++IVYDE L+DHMK+ TA LVLSG+KFFGLES
Sbjct: 656  YGMCKHGSIEEARNVFLSMRESKFITEADVIVYDEFLVDHMKETTADLVLSGIKFFGLES 715

Query: 2174 KLKSKGSTI 2200
            KLK+KG  I
Sbjct: 716  KLKAKGVVI 724


>ref|XP_003598903.1| Pentatricopeptide repeat-containing protein [Medicago truncatula]
            gi|355487951|gb|AES69154.1| Pentatricopeptide
            repeat-containing protein [Medicago truncatula]
          Length = 767

 Score =  947 bits (2449), Expect = 0.0
 Identities = 475/728 (65%), Positives = 575/728 (78%), Gaps = 9/728 (1%)
 Frame = +2

Query: 35   MPPKSQPSTITNKLYFYYGHRKPSQNRPTVHGGLFSNRKTINPNPNFYKSSKTLNPNQCS 214
            MPP  Q  T  NK YF+YGHRKPSQNRPTV GGLFSNRKT+ P     KS+K  N    S
Sbjct: 1    MPP--QTPTPPNKFYFFYGHRKPSQNRPTVRGGLFSNRKTLTPPKP--KSTKPTN----S 52

Query: 215  IDINKWDP------NSPQKFPPTKTQSEKFFSIAQTLSPIARYICDSFRKHKN-WCPSIV 373
              I KWDP      NSP   P      E  FS +  LSPIAR+I D+FRK+ N W P +V
Sbjct: 53   FQIQKWDPHFLSQPNSPS--PSPSPSPEATFSASLRLSPIARFILDAFRKNNNNWGPPVV 110

Query: 374  KDLDKLRRVPPNLVAEVLKAQTDPRISSRFFHWAGKQKGFRHNFASYNAFAYCLNRLNMF 553
             +L+KLRRV P LVAEVLK QT+P ++ +FFHW  KQKG+ HNFASYNAF YCLNR N F
Sbjct: 111  TELNKLRRVTPTLVAEVLKVQTNPTLAFKFFHWVEKQKGYHHNFASYNAFTYCLNRANHF 170

Query: 554  RAADQVPDLMRMQGNQPTEKQFEILIRMHSDAGRGLRVYYIYEKMKN-FGVKPRVFLYNR 730
            RAADQ+P+LM  QG  P+EKQFEILIRMHSDAGRGLRVY++Y+KM+N FGVKPRVFLYNR
Sbjct: 171  RAADQLPELMDAQGKPPSEKQFEILIRMHSDAGRGLRVYHVYDKMRNKFGVKPRVFLYNR 230

Query: 731  ILDALVKTGHVDLAISVYEDFKEDGLVEESVTFMILVKGLCKVGRMNEVFEILSRMRN-L 907
            I+DALVKTGH+DLA+SVY DF+EDGLVEESVTFMIL+KGLCK G+++E+ E+L RMR  L
Sbjct: 231  IMDALVKTGHLDLALSVYNDFREDGLVEESVTFMILIKGLCKGGKIDEMLEVLGRMREKL 290

Query: 908  CKPDVCAYTAMIRILVSEGNLDGCLRIWEEMKIDGVDPDVMAYTTLVTGLCKGNKVEKGY 1087
            CKPDV AYTA++RI+V EGNLDGCLR+W+EMK D VDPDVMAY T++ GL KG +V +GY
Sbjct: 291  CKPDVFAYTALVRIMVKEGNLDGCLRVWKEMKRDRVDPDVMAYGTIIGGLAKGGRVSEGY 350

Query: 1088 EFFKEMREKGCLIDRSMYGSLIEAFVVDGKIGSACDLLKDLIDSGYRADLSIYNSMVEGL 1267
            E FKEM+ KG LIDR++YGSL+E+FV   K+G A DLLKDL+ SGYRADL +YN+++EGL
Sbjct: 351  ELFKEMKSKGHLIDRAIYGSLVESFVAGNKVGLAFDLLKDLVSSGYRADLGMYNNLIEGL 410

Query: 1268 CNANRVDKAYKLFQITVREGLNPDFVTVNPILVSYADESRMDDFCILLEKMQKLGLPVID 1447
            CN N+V+KAYKLFQ+T++EGL PDF++V P+L++YA+  RM++F +LLEKM+KLG PVID
Sbjct: 411  CNLNKVEKAYKLFQVTIQEGLEPDFLSVKPLLLAYAEAKRMEEFFMLLEKMKKLGFPVID 470

Query: 1448 DLSRFFSFMVGKGDRVRKALEVFEDLKTKGYCSISIYNILIGALHRKGEAQNALTLFEEM 1627
            DLS+FFS +V K      ALE+F  LK K Y S+ IYNI + +LH  G+ + AL+LF+E+
Sbjct: 471  DLSKFFSHLVEKKGP-EMALEIFTHLKEKSYVSVEIYNIFMESLHLSGKVEKALSLFDEI 529

Query: 1628 KDLDFEPDSSTHSNVIPCFVDVGDVKEACSCYNKMKDMSWIPTVSAYCSLVKGLCRIGET 1807
            K  D EPDSST++  I C VD G +KEAC C+NK+ +MS IP+V+AY  L KGLC IGE 
Sbjct: 530  KGSDLEPDSSTYNIAILCLVDHGQIKEACECHNKIIEMSSIPSVAAYNCLAKGLCNIGEI 589

Query: 1808 DAVLTLVRDCLGNVTSGPMEFKYALTILHACKSGDAEKVIEVLNEMMEQGCPPDDVTYFA 1987
            D  + LVRDCLGNVTSGPMEFKY LTI+  CKS  AEK+I+VLNEMM++GC  D+V   A
Sbjct: 590  DEAMLLVRDCLGNVTSGPMEFKYCLTIIRMCKSNVAEKLIDVLNEMMQEGCSLDNVVCSA 649

Query: 1988 IISGMCNHGTLEEARKVFGSMRDRNLLTEANLIVYDELLIDHMKKKTAGLVLSGLKFFGL 2167
            IISGMC +GT+EEARKVF  +R+R LLTE++ IVYDELLIDHMKKKTA LV+SGLKFFGL
Sbjct: 650  IISGMCKYGTIEEARKVFSILRERKLLTESDTIVYDELLIDHMKKKTADLVISGLKFFGL 709

Query: 2168 ESKLKSKG 2191
            ESKLKSKG
Sbjct: 710  ESKLKSKG 717


>ref|XP_004494981.1| PREDICTED: pentatricopeptide repeat-containing protein At4g20740-like
            [Cicer arietinum]
          Length = 720

 Score =  941 bits (2431), Expect = 0.0
 Identities = 468/725 (64%), Positives = 573/725 (79%), Gaps = 6/725 (0%)
 Frame = +2

Query: 50   QPSTITNKLYFYYGHRKPSQNRPTVHGGLFSNRKTINPNPNFYKSSKTLNPNQCSIDINK 229
            Q  T  NK YFYYGHRKPSQNRPTV GGLFSNR+T+ P     KS  T  P     +I K
Sbjct: 4    QTPTTPNKFYFYYGHRKPSQNRPTVRGGLFSNRQTLTPP----KSKTTSRP----FEIQK 55

Query: 230  WDPN---SPQKFPPTKTQSEKFFSIAQTLSPIARYICDSFRKHK-NWCPSIVKDLDKLRR 397
            WDP+        PP    SE  FS +  LSPIAR+I D+FRK+   W PS++ +L+KLRR
Sbjct: 56   WDPHFLSQQNPSPPPSPSSEASFSPSLRLSPIARFIVDAFRKNSYKWGPSVITELNKLRR 115

Query: 398  VPPNLVAEVLKAQTDPRISSRFFHWAGKQKGFRHNFASYNAFAYCLNRLNMFRAADQVPD 577
            VPPNLVAEVLK QT+P ++ +FFHW   QKG+ HNFAS+NAFAYCLNR N F AADQ+P+
Sbjct: 116  VPPNLVAEVLKVQTNPTLAFKFFHWVENQKGYHHNFASFNAFAYCLNRANHFHAADQLPE 175

Query: 578  LMRMQGNQPTEKQFEILIRMHSDAGRGLRVYYIYEKMKN-FGVKPRVFLYNRILDALVKT 754
            LM  QG  P+EKQFEILIRMHSDAGRGLR Y++Y+KM+N FGVKPRVFLYNRI+DALVKT
Sbjct: 176  LMDAQGKPPSEKQFEILIRMHSDAGRGLRAYHVYDKMRNKFGVKPRVFLYNRIMDALVKT 235

Query: 755  GHVDLAISVYEDFKEDGLVEESVTFMILVKGLCKVGRMNEVFEILSRMRN-LCKPDVCAY 931
             H+DLA+SVY DF+EDGLVEESVTFM+LVKGLCK GR+ E+ E+L RMR  L KPDV AY
Sbjct: 236  RHLDLALSVYNDFREDGLVEESVTFMVLVKGLCKAGRIGEMLEVLGRMREKLYKPDVFAY 295

Query: 932  TAMIRILVSEGNLDGCLRIWEEMKIDGVDPDVMAYTTLVTGLCKGNKVEKGYEFFKEMRE 1111
            TA++RI+V+EGNLDGCLR+WEEMK DGV PDVMAY T++ GL K  +V++GYE FKEM+ 
Sbjct: 296  TALVRIMVAEGNLDGCLRVWEEMKRDGVVPDVMAYDTIIGGLAKEGRVKEGYELFKEMKS 355

Query: 1112 KGCLIDRSMYGSLIEAFVVDGKIGSACDLLKDLIDSGYRADLSIYNSMVEGLCNANRVDK 1291
            KG LIDR++YGSLIE+FVV  K+G A DLLKDL++SGYRADL IYN++++GLCN N+V+K
Sbjct: 356  KGHLIDRAIYGSLIESFVVGNKVGLAFDLLKDLVNSGYRADLGIYNNLIKGLCNLNKVEK 415

Query: 1292 AYKLFQITVREGLNPDFVTVNPILVSYADESRMDDFCILLEKMQKLGLPVIDDLSRFFSF 1471
            AYKLFQ+T++EGL PDF++V P+L++YA+  RM++F  LL+KM+KLG PVI+DLS+FFS 
Sbjct: 416  AYKLFQVTIQEGLEPDFLSVKPLLLAYAEAKRMEEFFKLLKKMEKLGFPVIEDLSKFFSH 475

Query: 1472 MVGKGDRVRKALEVFEDLKTKGYCSISIYNILIGALHRKGEAQNALTLFEEMKDLDFEPD 1651
            +V K   V  +LEVF  LK KGY S+ IYN+L+ +L   GE + AL+LF+E+K  D +PD
Sbjct: 476  LVEKKGPVM-SLEVFTHLKEKGYVSVEIYNVLMDSLRLSGEVKKALSLFDEIKGSDMKPD 534

Query: 1652 SSTHSNVIPCFVDVGDVKEACSCYNKMKDMSWIPTVSAYCSLVKGLCRIGETDAVLTLVR 1831
            SST++  I C V  G+++EAC C+NK+ +MS IP+V+ Y  L KGLC IGE D  + LVR
Sbjct: 535  SSTYNIAILCLVARGEIQEACVCHNKIIEMSCIPSVAVYHRLAKGLCEIGEIDEAMMLVR 594

Query: 1832 DCLGNVTSGPMEFKYALTILHACKSGDAEKVIEVLNEMMEQGCPPDDVTYFAIISGMCNH 2011
            DCLGN TSGPMEFKY LT++H CK  DAEKVI+VLNEMM+QG P  +V   AIISGMC H
Sbjct: 595  DCLGNATSGPMEFKYCLTLIHICKFNDAEKVIDVLNEMMQQGFPLCNVVCSAIISGMCKH 654

Query: 2012 GTLEEARKVFGSMRDRNLLTEANLIVYDELLIDHMKKKTAGLVLSGLKFFGLESKLKSKG 2191
            GT+EEARKVF ++RDR LLTE++ IVYDELLIDHMKKKTA LV+SGLKFFGLESKLK KG
Sbjct: 655  GTIEEARKVFSNLRDRKLLTESDTIVYDELLIDHMKKKTADLVISGLKFFGLESKLKLKG 714

Query: 2192 STILP 2206
              +LP
Sbjct: 715  CKLLP 719


>ref|XP_004515007.1| PREDICTED: pentatricopeptide repeat-containing protein At4g20740-like
            [Cicer arietinum]
          Length = 720

 Score =  939 bits (2427), Expect = 0.0
 Identities = 467/730 (63%), Positives = 573/730 (78%), Gaps = 6/730 (0%)
 Frame = +2

Query: 35   MPPKSQPSTITNKLYFYYGHRKPSQNRPTVHGGLFSNRKTINPNPNFYKSSKTLNPNQCS 214
            MPP  Q  T  NK YFYYGHR+PSQNRPTV GGLFSNR+T+ P     K   T  P    
Sbjct: 1    MPP--QTPTTPNKFYFYYGHRQPSQNRPTVRGGLFSNRQTLTPP----KPKTTSRP---- 50

Query: 215  IDINKWDPN---SPQKFPPTKTQSEKFFSIAQTLSPIARYICDSFRKHK-NWCPSIVKDL 382
             +I KWDP+        PP        FS +  LSPI R+I D+FRK+   W PS++ +L
Sbjct: 51   FEIQKWDPHFLSQQNPSPPPSPSPAASFSASLRLSPIVRFIVDAFRKNGYKWGPSVITEL 110

Query: 383  DKLRRVPPNLVAEVLKAQTDPRISSRFFHWAGKQKGFRHNFASYNAFAYCLNRLNMFRAA 562
             K RRVPPNLVAEVLK QT+P I+ +FF W   QKG+ HNFAS+NAFAYCLNR N F AA
Sbjct: 111  SKFRRVPPNLVAEVLKVQTNPTIAFKFFRWVENQKGYHHNFASFNAFAYCLNRANHFHAA 170

Query: 563  DQVPDLMRMQGNQPTEKQFEILIRMHSDAGRGLRVYYIYEKMKN-FGVKPRVFLYNRILD 739
            DQ+P+LM  QG  P+EKQFEILIRMHSDAGRGLRVY++Y+KM+N FGVKPRVFLYNRI+D
Sbjct: 171  DQLPELMDAQGKPPSEKQFEILIRMHSDAGRGLRVYHVYDKMRNKFGVKPRVFLYNRIMD 230

Query: 740  ALVKTGHVDLAISVYEDFKEDGLVEESVTFMILVKGLCKVGRMNEVFEILSRMRN-LCKP 916
            ALVKTGH+DLA+SVY DF+EDGLVEESVT+M+LVKGLCK GR+ E+ E+L RMR  LCKP
Sbjct: 231  ALVKTGHLDLALSVYNDFREDGLVEESVTYMVLVKGLCKAGRIGEMLEVLGRMREKLCKP 290

Query: 917  DVCAYTAMIRILVSEGNLDGCLRIWEEMKIDGVDPDVMAYTTLVTGLCKGNKVEKGYEFF 1096
            DVCAYTA++RI+V+EGNLDGCLR+WEEMK DGV PDVMAY T++ GL K  +V++GYE F
Sbjct: 291  DVCAYTALVRIMVAEGNLDGCLRVWEEMKRDGVVPDVMAYGTVIGGLAKEGRVKEGYELF 350

Query: 1097 KEMREKGCLIDRSMYGSLIEAFVVDGKIGSACDLLKDLIDSGYRADLSIYNSMVEGLCNA 1276
            KEM+ KG LIDR++YGSLIE+FV   K+G A DLL+DL++SGYRADL IYN+++EGLCN 
Sbjct: 351  KEMKSKGHLIDRAIYGSLIESFVAGNKVGLAFDLLRDLVNSGYRADLGIYNNLIEGLCNL 410

Query: 1277 NRVDKAYKLFQITVREGLNPDFVTVNPILVSYADESRMDDFCILLEKMQKLGLPVIDDLS 1456
            N+V+KAYKLFQ+T++EGL PDF++V  +L++YA+  RM++F  LL+KM+KLG P+IDDLS
Sbjct: 411  NKVEKAYKLFQVTIQEGLEPDFLSVKSLLLAYAEAKRMEEFFKLLKKMEKLGFPLIDDLS 470

Query: 1457 RFFSFMVGKGDRVRKALEVFEDLKTKGYCSISIYNILIGALHRKGEAQNALTLFEEMKDL 1636
            +FFS +V K   V  +LEVF  LK KGY S+ IYN+L+ +L   GE + AL+LF+E+K  
Sbjct: 471  KFFSHLVEKKGPVI-SLEVFIHLKEKGYVSVEIYNVLMDSLRLSGELKKALSLFDEIKGS 529

Query: 1637 DFEPDSSTHSNVIPCFVDVGDVKEACSCYNKMKDMSWIPTVSAYCSLVKGLCRIGETDAV 1816
            D +PDSST++  I C VD G+++EAC C+NK+ +MS IP+V+ Y  L KGLC IGE D  
Sbjct: 530  DMKPDSSTYNIAILCLVDCGEIQEACVCHNKIIEMSCIPSVAVYHRLAKGLCEIGEIDEA 589

Query: 1817 LTLVRDCLGNVTSGPMEFKYALTILHACKSGDAEKVIEVLNEMMEQGCPPDDVTYFAIIS 1996
            + LVRDCLGN TSGPMEFKY LT++H CK  DAEKVI+VLNEMM+QG P  +V   AIIS
Sbjct: 590  MMLVRDCLGNATSGPMEFKYCLTLIHICKFNDAEKVIDVLNEMMQQGFPLCNVVCSAIIS 649

Query: 1997 GMCNHGTLEEARKVFGSMRDRNLLTEANLIVYDELLIDHMKKKTAGLVLSGLKFFGLESK 2176
            GMC HGT+EEARKVF ++R+R LLTE++ IVYDELLIDHMKKKTA LV+SGLKFFGLESK
Sbjct: 650  GMCKHGTIEEARKVFSNLRNRKLLTESDTIVYDELLIDHMKKKTADLVISGLKFFGLESK 709

Query: 2177 LKSKGSTILP 2206
            LKSKG  +LP
Sbjct: 710  LKSKGCKLLP 719


>ref|XP_007149018.1| hypothetical protein PHAVU_005G033500g [Phaseolus vulgaris]
            gi|561022282|gb|ESW21012.1| hypothetical protein
            PHAVU_005G033500g [Phaseolus vulgaris]
          Length = 715

 Score =  935 bits (2417), Expect = 0.0
 Identities = 467/735 (63%), Positives = 574/735 (78%), Gaps = 8/735 (1%)
 Frame = +2

Query: 29   PQMPPKSQPSTITNKLYFYYGHRKPSQNRPTVHGGLFSNRKTINPNPNFYKSSKTLNPNQ 208
            PQ+P  ++P    N  YF+YGHRKPSQNRPTV GGLFSNR+T+ P+      +K  N   
Sbjct: 3    PQVPQPNKP----NNFYFFYGHRKPSQNRPTVRGGLFSNRQTLTPSSKPNLKTKPFN--- 55

Query: 209  CSIDINKWDPN-----SPQKFPPTKTQSEKFFSIAQTLSPIARYICDSFRKHKN-WCPSI 370
                I  WDP+     SP+  PP+ T           LSPIAR+I D+FRK+ N WCP++
Sbjct: 56   ----IKDWDPHFLSNPSPRSSPPSPTLR---------LSPIARFIVDAFRKNDNKWCPNV 102

Query: 371  VKDLDKLRRVPPNLVAEVLKAQTDPRISSRFFHWAGKQKGFRHNFASYNAFAYCLNRLNM 550
            V +L KLRRV PNLVAEVLK QT+  ++S+FFHWA  QKG+ HNFASYNA AYCLNR + 
Sbjct: 103  VAELKKLRRVTPNLVAEVLKVQTNHALASKFFHWANNQKGYHHNFASYNALAYCLNRSHQ 162

Query: 551  FRAADQVPDLMRMQGNQPTEKQFEILIRMHSDAGRGLRVYYIYEKMKN-FGVKPRVFLYN 727
            FRAADQ+P+LM   G  P+EKQFEILIRMHSDA RGLRVYY+Y+KM+N FGVKPRVFLYN
Sbjct: 163  FRAADQLPELMDSHGRPPSEKQFEILIRMHSDANRGLRVYYVYDKMRNKFGVKPRVFLYN 222

Query: 728  RILDALVKTGHVDLAISVYEDFKEDGLVEESVTFMILVKGLCKVGRMNEVFEILSRMR-N 904
            R++DAL KTGH+DL +SVY+DFKEDGLVEESVTFM+LVKGLCK GR++E+ E+L RMR +
Sbjct: 223  RVMDALFKTGHLDLGLSVYDDFKEDGLVEESVTFMLLVKGLCKGGRIDEMLEVLGRMRES 282

Query: 905  LCKPDVCAYTAMIRILVSEGNLDGCLRIWEEMKIDGVDPDVMAYTTLVTGLCKGNKVEKG 1084
            LCKPDV AYTA++RILV  G+LD CLR+WEEMK DGV  D  AY T++ GL KG +V++G
Sbjct: 283  LCKPDVFAYTALVRILVRAGDLDACLRVWEEMKRDGVVVDPKAYATMIVGLAKGGRVQEG 342

Query: 1085 YEFFKEMREKGCLIDRSMYGSLIEAFVVDGKIGSACDLLKDLIDSGYRADLSIYNSMVEG 1264
            YE FKEM+ KG L+DR +YG L+EAFV  GK+G A DLLKDL+ SGY ADL IYN ++EG
Sbjct: 343  YELFKEMKSKGFLVDRVIYGKLVEAFVAGGKVGLAFDLLKDLVSSGYTADLEIYNCLIEG 402

Query: 1265 LCNANRVDKAYKLFQITVREGLNPDFVTVNPILVSYADESRMDDFCILLEKMQKLGLPVI 1444
            LCN  ++ KAYKLFQ+TV EGL PDF+TV P+LV+YA+ +RM++FC LLEKMQKLG PV+
Sbjct: 403  LCNLKKLQKAYKLFQVTVGEGLEPDFLTVKPLLVAYAEANRMEEFCKLLEKMQKLGFPVL 462

Query: 1445 DDLSRFFSFMVGKGDRVRKALEVFEDLKTKGYCSISIYNILIGALHRKGEAQNALTLFEE 1624
             DLS+FFS +V K      A+E F  LK KG+ S+ IYNIL  +L++ GE + AL+LF+E
Sbjct: 463  ADLSKFFSVLVEKNGPTM-AVEAFAHLKEKGHVSVEIYNILTDSLYKIGEEKKALSLFDE 521

Query: 1625 MKDLDFEPDSSTHSNVIPCFVDVGDVKEACSCYNKMKDMSWIPTVSAYCSLVKGLCRIGE 1804
            MK +  EPDS T+S VI C VD+G+++EAC C+NK+ +MS IP+V+AY SL KGLC+IGE
Sbjct: 522  MKSM-MEPDSITYSIVIQCLVDLGEIQEACVCHNKIIEMSCIPSVAAYRSLAKGLCKIGE 580

Query: 1805 TDAVLTLVRDCLGNVTSGPMEFKYALTILHACKSGDAEKVIEVLNEMMEQGCPPDDVTYF 1984
             D  + LVRDCLG+V+ GPMEFKY+LTI+HACKS DAEKVI VLNEMMEQGC  D+V Y 
Sbjct: 581  IDEAMMLVRDCLGSVSDGPMEFKYSLTIIHACKSNDAEKVIGVLNEMMEQGCSLDNVIYS 640

Query: 1985 AIISGMCNHGTLEEARKVFGSMRDRNLLTEANLIVYDELLIDHMKKKTAGLVLSGLKFFG 2164
            AIISGMC HGT+EEARKVF ++R+RN LTE++ IVY+ELLIDH K+KTA LVL  LKFFG
Sbjct: 641  AIISGMCKHGTIEEARKVFSNLRERNYLTESDTIVYEELLIDHTKRKTADLVLLSLKFFG 700

Query: 2165 LESKLKSKGSTILPG 2209
            LESKLK+KGS +LPG
Sbjct: 701  LESKLKAKGSKLLPG 715


>ref|XP_004511291.1| PREDICTED: pentatricopeptide repeat-containing protein At4g20740-like
            isoform X1 [Cicer arietinum]
            gi|502158821|ref|XP_004511292.1| PREDICTED:
            pentatricopeptide repeat-containing protein
            At4g20740-like isoform X2 [Cicer arietinum]
            gi|502158825|ref|XP_004511293.1| PREDICTED:
            pentatricopeptide repeat-containing protein
            At4g20740-like isoform X3 [Cicer arietinum]
          Length = 720

 Score =  930 bits (2403), Expect = 0.0
 Identities = 464/730 (63%), Positives = 571/730 (78%), Gaps = 6/730 (0%)
 Frame = +2

Query: 35   MPPKSQPSTITNKLYFYYGHRKPSQNRPTVHGGLFSNRKTINPNPNFYKSSKTLNPNQCS 214
            MPP  Q  T  NK YFYYGHRKPSQNRPTV GGLFSNR+T+ P     K + T  P    
Sbjct: 1    MPP--QTPTTPNKFYFYYGHRKPSQNRPTVRGGLFSNRQTLTPP----KPTTTSRP---- 50

Query: 215  IDINKWDPN---SPQKFPPTKTQSEKFFSIAQTLSPIARYICDSFRKHK-NWCPSIVKDL 382
             +I KWDP+        PP     E  FS +  LSPIAR+I D+FRK+   W PS++ +L
Sbjct: 51   FEIQKWDPHFLSQQNPSPPPPPSPEASFSASLRLSPIARFIVDAFRKNGYKWGPSVIAEL 110

Query: 383  DKLRRVPPNLVAEVLKAQTDPRISSRFFHWAGKQKGFRHNFASYNAFAYCLNRLNMFRAA 562
            +KLRRVPPNLVAEVLK QT+P ++ +FFHW   QKG+ HNFAS+NAFAYCLNR N F AA
Sbjct: 111  NKLRRVPPNLVAEVLKVQTNPTLTFKFFHWVENQKGYHHNFASFNAFAYCLNRANHFHAA 170

Query: 563  DQVPDLMRMQGNQPTEKQFEILIRMHSDAGRGLRVYYIYEKMKN-FGVKPRVFLYNRILD 739
            DQ+P+LM   G  P+EKQFEILIRMH DAGRGLRVY+IY+KM+N FGVKPRVFLYN I+D
Sbjct: 171  DQLPELMDAHGKPPSEKQFEILIRMHCDAGRGLRVYHIYDKMRNKFGVKPRVFLYNTIMD 230

Query: 740  ALVKTGHVDLAISVYEDFKEDGLVEESVTFMILVKGLCKVGRMNEVFEILSRMRN-LCKP 916
            ALV+T H+DLA+SVY DF+EDGLVEESVTFM+LVKGLCK GR+ E+ E+L RMR  LCKP
Sbjct: 231  ALVRTRHLDLALSVYNDFREDGLVEESVTFMVLVKGLCKAGRIGEMLEVLGRMREKLCKP 290

Query: 917  DVCAYTAMIRILVSEGNLDGCLRIWEEMKIDGVDPDVMAYTTLVTGLCKGNKVEKGYEFF 1096
            DV AYTA++RI+V+EGNLDGCLR+WEEMK DGV  DVMAY T++ GL K  +V++GYE F
Sbjct: 291  DVFAYTALVRIMVAEGNLDGCLRVWEEMKRDGVVLDVMAYGTIIGGLAKEGRVKEGYELF 350

Query: 1097 KEMREKGCLIDRSMYGSLIEAFVVDGKIGSACDLLKDLIDSGYRADLSIYNSMVEGLCNA 1276
            KEM+ KG LIDR++YGSLIE+FV   K+G A DLLKDL++SGYRADL IYN++++GLCN 
Sbjct: 351  KEMKSKGHLIDRAIYGSLIESFVAGNKVGLAFDLLKDLVNSGYRADLGIYNNLIKGLCNL 410

Query: 1277 NRVDKAYKLFQITVREGLNPDFVTVNPILVSYADESRMDDFCILLEKMQKLGLPVIDDLS 1456
            N+V+KAYKLFQ+T++EGL PDF++V P+L++YA+  RM++F  LL+KM+KLG PVIDDLS
Sbjct: 411  NKVEKAYKLFQVTIQEGLEPDFLSVKPLLLAYAEAKRMEEFYKLLKKMEKLGFPVIDDLS 470

Query: 1457 RFFSFMVGKGDRVRKALEVFEDLKTKGYCSISIYNILIGALHRKGEAQNALTLFEEMKDL 1636
            +FFS +V K   V  +LE+F  LK KGY S+ IYN+L+ +L   GE + AL+LF+E+K  
Sbjct: 471  KFFSHLVEKKGPVM-SLEIFTHLKEKGYVSVEIYNVLMDSLRLSGEVKKALSLFDEIKGS 529

Query: 1637 DFEPDSSTHSNVIPCFVDVGDVKEACSCYNKMKDMSWIPTVSAYCSLVKGLCRIGETDAV 1816
              +PDSST++  I C +  G+++EAC C+NK+ +MS IP+V  Y  L KGLC IGE +  
Sbjct: 530  GMKPDSSTYNIAILCLIARGEIQEACVCHNKIIEMSCIPSVVVYHRLAKGLCEIGEIEEA 589

Query: 1817 LTLVRDCLGNVTSGPMEFKYALTILHACKSGDAEKVIEVLNEMMEQGCPPDDVTYFAIIS 1996
            + LVRDCLGN TSGPMEFKY LT++H CK  DAEKVI+VLNEMM+QG P  +V   AIIS
Sbjct: 590  MMLVRDCLGNATSGPMEFKYCLTLVHICKFNDAEKVIDVLNEMMQQGFPLCNVVCSAIIS 649

Query: 1997 GMCNHGTLEEARKVFGSMRDRNLLTEANLIVYDELLIDHMKKKTAGLVLSGLKFFGLESK 2176
            GMC HGT+EEARKVF ++RDR LLTE++ IVYDELLIDHMKKKTA LV+SGLKFFGLESK
Sbjct: 650  GMCKHGTIEEARKVFSNLRDRKLLTESDTIVYDELLIDHMKKKTADLVISGLKFFGLESK 709

Query: 2177 LKSKGSTILP 2206
            LKSKG  +LP
Sbjct: 710  LKSKGCKVLP 719


>ref|XP_006482966.1| PREDICTED: pentatricopeptide repeat-containing protein At4g20740-like
            [Citrus sinensis]
          Length = 721

 Score =  917 bits (2371), Expect = 0.0
 Identities = 460/732 (62%), Positives = 569/732 (77%), Gaps = 7/732 (0%)
 Frame = +2

Query: 29   PQMPPKSQPSTITNKLYFYYGHRKPSQNRPTVHGGLFSNRKTINPNPNFYKSSKTLNPNQ 208
            PQ PPK          YF+YGHRKPSQNRPTV+GG FSNR+++  NPN      T  P+Q
Sbjct: 6    PQRPPKP---------YFFYGHRKPSQNRPTVYGGFFSNRQSLR-NPN-----STSEPHQ 50

Query: 209  CS-IDINKWDPNSPQKFPPTKTQSE----KFFSIAQTLSPIARYICDSFRKHK-NWCPSI 370
                ++ KWDP+     P  KTQS     K F + + LSPIAR+I D+FRK++ +W P +
Sbjct: 51   SQPFNVQKWDPHY---LPNQKTQSPPSDPKTFQLQRHLSPIARFITDAFRKNQFHWGPRV 107

Query: 371  VKDLDKLRRVPPNLVAEVLKAQTDPRISSRFFHWAGKQKGFRHNFASYNAFAYCLNRLNM 550
            V +L KLRRV P+LVAEVLK + +P ++S+FFHWAGKQKG++HNFASYNA AYCL+R N+
Sbjct: 108  VTELSKLRRVTPDLVAEVLKVENNPTLASKFFHWAGKQKGYKHNFASYNALAYCLSRNNL 167

Query: 551  FRAADQVPDLMRMQGNQPTEKQFEILIRMHSDAGRGLRVYYIYEKMKNFGVKPRVFLYNR 730
            FRAADQVP+LM  QG  PTEKQFEILIRMH+D  RGLRV+++Y+KMK FG+ PRVFLYN+
Sbjct: 168  FRAADQVPELMDSQGKPPTEKQFEILIRMHADCNRGLRVFHVYQKMKKFGILPRVFLYNK 227

Query: 731  ILDALVKTGHVDLAISVYEDFKEDGLVEESVTFMILVKGLCKVGRMNEVFEILSRMR-NL 907
            I+DALVKT  +DLA+SVYE+FK  GLVEESVT+MIL+KGLCK GR+ E+ EIL +MR NL
Sbjct: 228  IMDALVKTNCLDLALSVYEEFKGHGLVEESVTYMILIKGLCKAGRIAEMLEILEKMRRNL 287

Query: 908  CKPDVCAYTAMIRILVSEGNLDGCLRIWEEMKIDGVDPDVMAYTTLVTGLCKGNKVEKGY 1087
            CKPDV AYTAMIR+L +E NLD CLR+WEEMK D V+ DVMAY TL+ GLCKG +V +GY
Sbjct: 288  CKPDVFAYTAMIRVLAAERNLDACLRVWEEMKKDLVEADVMAYVTLIMGLCKGGRVVRGY 347

Query: 1088 EFFKEMREKGCLIDRSMYGSLIEAFVVDGKIGSACDLLKDLIDSGYRADLSIYNSMVEGL 1267
            E F+EM+E G LIDR++YG LIE  V +GK+G ACDLLKDL+DSGYRADL IYNS++ GL
Sbjct: 348  ELFREMKENGILIDRAIYGVLIEGLVGEGKVGKACDLLKDLVDSGYRADLGIYNSIIGGL 407

Query: 1268 CNANRVDKAYKLFQITVREGLNPDFVTVNPILVSYADESRMDDFCILLEKMQKLGLPVID 1447
            C   + DKAYKLF++TV++ L PDF TVNP+LV  A+  RMD+F  LL + +KL   V  
Sbjct: 408  CRVKQFDKAYKLFEVTVQDDLAPDFSTVNPLLVCCAEMGRMDNFFKLLAQTEKLKFSVAA 467

Query: 1448 DLSRFFSFMVGKGDRVRKALEVFEDLKTKGYCSISIYNILIGALHRKGEAQNALTLFEEM 1627
            DL +FF F+VGK +R+  AL+VFE+LK KGY S+ IYNIL+GAL   GE + AL LF EM
Sbjct: 468  DLEKFFEFLVGKEERIMMALDVFEELKGKGYSSVPIYNILMGALLEIGEVKKALYLFGEM 527

Query: 1628 KDLDFEPDSSTHSNVIPCFVDVGDVKEACSCYNKMKDMSWIPTVSAYCSLVKGLCRIGET 1807
            + L+ E +S + S  I C V+ G++ EAC C+NK+ +MS +P+V+AY  L KGLC+IGE 
Sbjct: 528  RGLNLEVNSLSFSIAIQCHVESGEILEACECHNKIIEMSQVPSVAAYNCLTKGLCKIGEI 587

Query: 1808 DAVLTLVRDCLGNVTSGPMEFKYALTILHACKSGDAEKVIEVLNEMMEQGCPPDDVTYFA 1987
            DA + LVRDCLGNV SGP EFKYALTILH C+SG+AEK+IEVLNEM ++GCPP++V   A
Sbjct: 588  DAAMMLVRDCLGNVASGPTEFKYALTILHVCRSGEAEKIIEVLNEMTQEGCPPNEVICSA 647

Query: 1988 IISGMCNHGTLEEARKVFGSMRDRNLLTEANLIVYDELLIDHMKKKTAGLVLSGLKFFGL 2167
            IISGMC HGTLEEARKVF ++ +R LLTEAN IVYDE+LI+HMKKKTA LVLSGLKFFGL
Sbjct: 648  IISGMCKHGTLEEARKVFTNLGERKLLTEANTIVYDEILIEHMKKKTADLVLSGLKFFGL 707

Query: 2168 ESKLKSKGSTIL 2203
            ESKLK+KG  +L
Sbjct: 708  ESKLKAKGCKLL 719


>ref|XP_004229293.1| PREDICTED: pentatricopeptide repeat-containing protein At4g20740-like
            [Solanum lycopersicum]
          Length = 1256

 Score =  914 bits (2361), Expect = 0.0
 Identities = 460/732 (62%), Positives = 559/732 (76%), Gaps = 3/732 (0%)
 Frame = +2

Query: 23   TNPQMPPKSQPSTITNKLYFYYGHRKPSQNRPTVHGGLFSNRKTINPNPNFYKSSKTLNP 202
            TN +M  KS  S    K YF+YGHRKP+Q+RPTV GGLFSNR+TINPN     S   +  
Sbjct: 568  TNSKMAAKSAQS----KPYFFYGHRKPTQHRPTVQGGLFSNRQTINPNLTTKNSPSPVT- 622

Query: 203  NQCSIDINKWDPN--SPQKFPPTKTQSEKFFSIAQTLSPIARYICDSFRKHKNWCPSIVK 376
             Q    + KWDP+  S QK   ++  S++FFS+AQ LSPIARYI DSFRKH  W   ++ 
Sbjct: 623  -QGDFQLQKWDPDEVSGQK---SRDPSQEFFSLAQRLSPIARYIVDSFRKHGKWGAPLLA 678

Query: 377  DLDKLRRVPPNLVAEVLK-AQTDPRISSRFFHWAGKQKGFRHNFASYNAFAYCLNRLNMF 553
            DL+ LRRV P LV EVLK    DP+ISS+FF+WAGKQKG+RH+F+ YNAFAY LNR N F
Sbjct: 679  DLNTLRRVTPKLVTEVLKHPNLDPKISSKFFYWAGKQKGYRHDFSCYNAFAYGLNRANQF 738

Query: 554  RAADQVPDLMRMQGNQPTEKQFEILIRMHSDAGRGLRVYYIYEKMKNFGVKPRVFLYNRI 733
            R ADQVP+LM MQG  P+EKQFEILIRMH DA RGLRVYY+YEKMK FGVKPRVFLYNRI
Sbjct: 739  RTADQVPELMHMQGKPPSEKQFEILIRMHGDANRGLRVYYVYEKMKKFGVKPRVFLYNRI 798

Query: 734  LDALVKTGHVDLAISVYEDFKEDGLVEESVTFMILVKGLCKVGRMNEVFEILSRMRNLCK 913
            +DALVKT H+DLA+SVY+DFK+DGLVEES+TFMIL+KGLCK GRM+EVFE+         
Sbjct: 799  MDALVKTNHLDLAMSVYDDFKKDGLVEESITFMILIKGLCKFGRMDEVFEV--------- 849

Query: 914  PDVCAYTAMIRILVSEGNLDGCLRIWEEMKIDGVDPDVMAYTTLVTGLCKGNKVEKGYEF 1093
                                     W+EM+ D V+PDV+AY+T + GLCK N+V+KGYE 
Sbjct: 850  -------------------------WKEMQQDAVEPDVIAYSTFIAGLCKNNQVDKGYEL 884

Query: 1094 FKEMREKGCLIDRSMYGSLIEAFVVDGKIGSACDLLKDLIDSGYRADLSIYNSMVEGLCN 1273
            FKEM++K  LIDR +YGSLIE+FV  GK+G ACDLLKDLIDSGYRADL+IYNS++EGLCN
Sbjct: 885  FKEMKQKKILIDRGIYGSLIESFVASGKVGLACDLLKDLIDSGYRADLAIYNSIIEGLCN 944

Query: 1274 ANRVDKAYKLFQITVREGLNPDFVTVNPILVSYADESRMDDFCILLEKMQKLGLPVIDDL 1453
            A R D+AYKLFQITV+E L PDF TV PILVSYA+  +MD+ C LLE++Q+L   + DDL
Sbjct: 945  AKRTDRAYKLFQITVQEDLCPDFSTVKPILVSYAESKKMDEICKLLEELQRLSHCISDDL 1004

Query: 1454 SRFFSFMVGKGDRVRKALEVFEDLKTKGYCSISIYNILIGALHRKGEAQNALTLFEEMKD 1633
            S+FF++MV K DR+  ALEVFE LK K YCS+ IYNIL+ AL++ GE   ALTLF E++ 
Sbjct: 1005 SKFFTYMVEKDDRIMIALEVFEYLKVKDYCSVPIYNILMEALYQNGEVNKALTLFSELRS 1064

Query: 1634 LDFEPDSSTHSNVIPCFVDVGDVKEACSCYNKMKDMSWIPTVSAYCSLVKGLCRIGETDA 1813
             D +PDSST+SN + CFV+VGDV+EA  CYN++K+MS IP+V+AY SLV GLC+IG+ D 
Sbjct: 1065 SDCKPDSSTYSNAVQCFVEVGDVQEASICYNRIKEMSLIPSVAAYRSLVIGLCKIGQIDP 1124

Query: 1814 VLTLVRDCLGNVTSGPMEFKYALTILHACKSGDAEKVIEVLNEMMEQGCPPDDVTYFAII 1993
             + L+ DCL NV SGPMEFKY LTI+H CK  DAEKV++VL+E++E+G  PD+  Y A+I
Sbjct: 1125 AMLLILDCLRNVASGPMEFKYILTIIHVCKMNDAEKVMKVLDELLEEGYSPDNAVYCAVI 1184

Query: 1994 SGMCNHGTLEEARKVFGSMRDRNLLTEANLIVYDELLIDHMKKKTAGLVLSGLKFFGLES 2173
             GMC HGT+EEA+KVF SMR R  LTEA+LIVYDE+LIDHMKKKTA L+LSGLKFFGLES
Sbjct: 1185 YGMCKHGTIEEAQKVFASMRKRKHLTEADLIVYDEMLIDHMKKKTADLLLSGLKFFGLES 1244

Query: 2174 KLKSKGSTILPG 2209
            KLK+KG T+L G
Sbjct: 1245 KLKAKGCTLLAG 1256


>ref|XP_006438906.1| hypothetical protein CICLE_v10030824mg [Citrus clementina]
            gi|557541102|gb|ESR52146.1| hypothetical protein
            CICLE_v10030824mg [Citrus clementina]
          Length = 721

 Score =  912 bits (2357), Expect = 0.0
 Identities = 457/732 (62%), Positives = 567/732 (77%), Gaps = 7/732 (0%)
 Frame = +2

Query: 29   PQMPPKSQPSTITNKLYFYYGHRKPSQNRPTVHGGLFSNRKTINPNPNFYKSSKTLNPNQ 208
            PQ PPK          YF+YGHRKPSQNRPTV+GG FSNR+++  NPN      T  P+Q
Sbjct: 6    PQRPPKP---------YFFYGHRKPSQNRPTVYGGFFSNRQSLR-NPN-----STSEPHQ 50

Query: 209  CS-IDINKWDPNSPQKFPPTKTQSE----KFFSIAQTLSPIARYICDSFRKHK-NWCPSI 370
                ++ KWDP+     P  KTQS     K F + + LSPIAR+I D+F K++ +W P +
Sbjct: 51   SQPFNVQKWDPHY---LPSQKTQSPPSDPKTFQLQRHLSPIARFITDAFHKNQFHWGPRV 107

Query: 371  VKDLDKLRRVPPNLVAEVLKAQTDPRISSRFFHWAGKQKGFRHNFASYNAFAYCLNRLNM 550
            V +L KLRRV P+LVAEVLK + +P ++S+FFHWAGKQKG++HNFASYNA AYCL+R N+
Sbjct: 108  VTELSKLRRVTPDLVAEVLKVENNPTLASKFFHWAGKQKGYKHNFASYNALAYCLSRNNL 167

Query: 551  FRAADQVPDLMRMQGNQPTEKQFEILIRMHSDAGRGLRVYYIYEKMKNFGVKPRVFLYNR 730
            FRAADQVP+LM  QG  PTEKQFEILIRMH+D  RGLRV+++Y+KMK FG+ PRVFLYN+
Sbjct: 168  FRAADQVPELMDSQGKPPTEKQFEILIRMHADCNRGLRVFHVYQKMKKFGILPRVFLYNK 227

Query: 731  ILDALVKTGHVDLAISVYEDFKEDGLVEESVTFMILVKGLCKVGRMNEVFEILSRMR-NL 907
            I+DALVKT  +DLA+SVYE+FK  GLVEESVT+MIL+KGLCK GR+ E+ EIL +MR NL
Sbjct: 228  IMDALVKTNCLDLALSVYEEFKGHGLVEESVTYMILIKGLCKAGRIAEMLEILEKMRRNL 287

Query: 908  CKPDVCAYTAMIRILVSEGNLDGCLRIWEEMKIDGVDPDVMAYTTLVTGLCKGNKVEKGY 1087
            CKPDV AYTAMIR+L +E NLD CLR+WEEMK D V+ DVMAY TL+ GLCKG +V +GY
Sbjct: 288  CKPDVFAYTAMIRVLAAERNLDACLRVWEEMKKDLVEADVMAYVTLIMGLCKGGRVVRGY 347

Query: 1088 EFFKEMREKGCLIDRSMYGSLIEAFVVDGKIGSACDLLKDLIDSGYRADLSIYNSMVEGL 1267
            + F+EM+E G LIDR++YG LIE  V +GK+G ACDLLKDL+DSGYRADL IYNS++ GL
Sbjct: 348  KLFREMKENGILIDRAIYGVLIEGLVGEGKVGKACDLLKDLVDSGYRADLGIYNSIIGGL 407

Query: 1268 CNANRVDKAYKLFQITVREGLNPDFVTVNPILVSYADESRMDDFCILLEKMQKLGLPVID 1447
            C   + DKAYKLF++TV++ L PDF TVNP+LV  A+  RMD+F  LL + +KL   V  
Sbjct: 408  CRVKQFDKAYKLFEVTVQDDLAPDFSTVNPLLVCCAEMGRMDNFFKLLAQTEKLKFSVAA 467

Query: 1448 DLSRFFSFMVGKGDRVRKALEVFEDLKTKGYCSISIYNILIGALHRKGEAQNALTLFEEM 1627
            DL +FF F+VGK +R+  AL+VFE+LK KGY S+ IYNIL+GAL   GE + AL LF EM
Sbjct: 468  DLEKFFEFLVGKEERIMMALDVFEELKGKGYSSVPIYNILMGALLEIGEVKKALYLFGEM 527

Query: 1628 KDLDFEPDSSTHSNVIPCFVDVGDVKEACSCYNKMKDMSWIPTVSAYCSLVKGLCRIGET 1807
            + L+ E +S + S  I C V+ G++ EAC C+NK+ +M  +P+V+AY  L KGLC+IGE 
Sbjct: 528  RGLNLEVNSLSFSIAIQCHVESGEILEACECHNKIIEMYQVPSVAAYNCLTKGLCKIGEI 587

Query: 1808 DAVLTLVRDCLGNVTSGPMEFKYALTILHACKSGDAEKVIEVLNEMMEQGCPPDDVTYFA 1987
            DA + LVRDCLGNV SGP EFKYALTILH C+SG+AEK+IEVLNEM ++GCPP++V   A
Sbjct: 588  DAAMMLVRDCLGNVASGPTEFKYALTILHVCRSGEAEKIIEVLNEMTQEGCPPNEVICSA 647

Query: 1988 IISGMCNHGTLEEARKVFGSMRDRNLLTEANLIVYDELLIDHMKKKTAGLVLSGLKFFGL 2167
            IISGMC HGTLEEARKVF ++ +R LLTEAN IVYDE+LI+HMKKKTA LVLSGLKFFGL
Sbjct: 648  IISGMCKHGTLEEARKVFTNLGERKLLTEANTIVYDEILIEHMKKKTADLVLSGLKFFGL 707

Query: 2168 ESKLKSKGSTIL 2203
            ESKLK+KG  +L
Sbjct: 708  ESKLKAKGCKLL 719


>ref|XP_002513116.1| pentatricopeptide repeat-containing protein, putative [Ricinus
            communis] gi|223548127|gb|EEF49619.1| pentatricopeptide
            repeat-containing protein, putative [Ricinus communis]
          Length = 1128

 Score =  910 bits (2353), Expect = 0.0
 Identities = 456/729 (62%), Positives = 557/729 (76%), Gaps = 4/729 (0%)
 Frame = +2

Query: 29   PQMPPKSQPSTITNKLYFYYGHRKPSQNRPTVHGGLFSNRKTINPNPNFYKSSKTLNPNQ 208
            P+MPP+S P     K YF+YGHRKPSQNRP V GGLFSNR+ I P      S K  NP  
Sbjct: 442  PKMPPQSLPPP---KPYFFYGHRKPSQNRPVVRGGLFSNRQIIKPQ----NSIKPKNP-- 492

Query: 209  CSIDINKWDPNSP---QKFPPTKTQSEKFFSIAQTLSPIARYICDSFRKHKN-WCPSIVK 376
               D+  WDP +P    K PP  +Q+    +++Q LSPI+R+I D+FRK+ N W P +V 
Sbjct: 493  VPFDLQNWDPQNPCPSSKSPPL-SQNHSLSTLSQRLSPISRFIRDAFRKNSNKWGPPVVA 551

Query: 377  DLDKLRRVPPNLVAEVLKAQTDPRISSRFFHWAGKQKGFRHNFASYNAFAYCLNRLNMFR 556
            +L KLRRV P+LV+EVLK + DP ++S+FFHWAGKQKG+RHNFASYNA+AYCLNR + FR
Sbjct: 552  ELRKLRRVTPDLVSEVLKVENDPHLASQFFHWAGKQKGYRHNFASYNAYAYCLNRSSFFR 611

Query: 557  AADQVPDLMRMQGNQPTEKQFEILIRMHSDAGRGLRVYYIYEKMKNFGVKPRVFLYNRIL 736
            AADQ+P+LM  QG  PTEKQFEILIRMHSDA RGLRVY++Y+KMK FGVKPR FLYNRI+
Sbjct: 612  AADQLPELMDSQGKPPTEKQFEILIRMHSDANRGLRVYHVYQKMKKFGVKPRAFLYNRIM 671

Query: 737  DALVKTGHVDLAISVYEDFKEDGLVEESVTFMILVKGLCKVGRMNEVFEILSRMRNLCKP 916
            DAL+KT H+DLA+ VY+DFK DGLVE+SVT+MIL+KGLCK GR                 
Sbjct: 672  DALIKTAHLDLALVVYDDFKSDGLVEDSVTYMILIKGLCKFGR----------------- 714

Query: 917  DVCAYTAMIRILVSEGNLDGCLRIWEEMKIDGVDPDVMAYTTLVTGLCKGNKVEKGYEFF 1096
                             +D  + +WEEMK DGV+PDVMAY T+VTGLCKG +V +GYE F
Sbjct: 715  -----------------IDEMMEVWEEMKRDGVNPDVMAYATVVTGLCKGGRVAEGYELF 757

Query: 1097 KEMREKGCLIDRSMYGSLIEAFVVDGKIGSACDLLKDLIDSGYRADLSIYNSMVEGLCNA 1276
            KEM+E   LIDR++YG LIEAFV DGKIGSACDLL+ L+DSGYRADL IYNS++EGLCN 
Sbjct: 758  KEMKENKVLIDRAIYGVLIEAFVKDGKIGSACDLLQGLVDSGYRADLGIYNSLIEGLCNV 817

Query: 1277 NRVDKAYKLFQITVREGLNPDFVTVNPILVSYADESRMDDFCILLEKMQKLGLPVIDDLS 1456
             RVDKA KLFQI V+EGL  DF TVNP+LVSYA+  RMD+FC LL +M++LG  V+DD+S
Sbjct: 818  KRVDKARKLFQIMVQEGLELDFKTVNPMLVSYAEMKRMDEFCKLLVQMERLGFSVMDDIS 877

Query: 1457 RFFSFMVGKGDRVRKALEVFEDLKTKGYCSISIYNILIGALHRKGEAQNALTLFEEMKDL 1636
            + FSF+V + + +  ALEVFE+LK KGY S+ IYN L+ AL + GE + AL+LF EMKDL
Sbjct: 878  KLFSFLVRREEIITLALEVFEELKVKGYISVLIYNTLMEALLKVGEVRKALSLFSEMKDL 937

Query: 1637 DFEPDSSTHSNVIPCFVDVGDVKEACSCYNKMKDMSWIPTVSAYCSLVKGLCRIGETDAV 1816
            + EPDS+T+S  + CFV+ G+++EAC C+NK+ +MS +P+V+AYCSL KGLC IGE D  
Sbjct: 938  NCEPDSNTYSIAVICFVEDGNIQEACVCHNKIIEMSSVPSVAAYCSLTKGLCDIGEIDEA 997

Query: 1817 LTLVRDCLGNVTSGPMEFKYALTILHACKSGDAEKVIEVLNEMMEQGCPPDDVTYFAIIS 1996
            + LVRDCLGNVTSGPMEFKY LT+LH C+SGDAEKVIEVLNEMM + CPP++V   AIIS
Sbjct: 998  MMLVRDCLGNVTSGPMEFKYTLTVLHVCRSGDAEKVIEVLNEMMHENCPPNEVILSAIIS 1057

Query: 1997 GMCNHGTLEEARKVFGSMRDRNLLTEANLIVYDELLIDHMKKKTAGLVLSGLKFFGLESK 2176
            GMC HGTLEEARKVF ++R+R LLTEA  I YDE LI+HMKKKTA LV+SGLKFFGLESK
Sbjct: 1058 GMCKHGTLEEARKVFTNLRERKLLTEAKTIFYDERLIEHMKKKTADLVVSGLKFFGLESK 1117

Query: 2177 LKSKGSTIL 2203
            L++KG T+L
Sbjct: 1118 LRAKGCTLL 1126


>ref|XP_004305399.1| PREDICTED: pentatricopeptide repeat-containing protein At4g20740-like
            [Fragaria vesca subsp. vesca]
          Length = 1089

 Score =  889 bits (2296), Expect = 0.0
 Identities = 437/715 (61%), Positives = 549/715 (76%), Gaps = 2/715 (0%)
 Frame = +2

Query: 71   KLYFYYGHRKPSQNRPTVHGGLFSNRKTIN-PNPNFYKSSKTLNPNQCSIDINKWDPNSP 247
            K   +YGHRKPS+NRPTV G    NR +++ PNP          P     D++KW P++ 
Sbjct: 411  KFTLFYGHRKPSRNRPTVRG----NRLSLSQPNPKPIPIPTQSQP----FDLSKWHPHTN 462

Query: 248  QKFPPTKTQSEKFFSIAQTLSPIARYICDSFRKHKN-WCPSIVKDLDKLRRVPPNLVAEV 424
            Q  P T + S         LSPIAR+I D+FRK++N W P +V +L KLRRV P+LVAEV
Sbjct: 463  QSPPSTSSPSAAPVVSLSHLSPIARFILDAFRKNRNHWGPPVVAELHKLRRVTPDLVAEV 522

Query: 425  LKAQTDPRISSRFFHWAGKQKGFRHNFASYNAFAYCLNRLNMFRAADQVPDLMRMQGNQP 604
            LK Q DP  +S+ FHWAGKQKGF+H FASYNA  YCLNR + FR+ADQVPDLM  QG  P
Sbjct: 523  LKVQNDPVSASKLFHWAGKQKGFKHTFASYNALTYCLNRAHRFRSADQVPDLMDSQGKPP 582

Query: 605  TEKQFEILIRMHSDAGRGLRVYYIYEKMKNFGVKPRVFLYNRILDALVKTGHVDLAISVY 784
            TEKQFEILIRMHSDA RGLRVY+++ KMK FGVKPRVFLYNR++DALV+TGH DLA+SVY
Sbjct: 583  TEKQFEILIRMHSDANRGLRVYHVFRKMKTFGVKPRVFLYNRVMDALVRTGHFDLALSVY 642

Query: 785  EDFKEDGLVEESVTFMILVKGLCKVGRMNEVFEILSRMRNLCKPDVCAYTAMIRILVSEG 964
             DF+ DGLVEESVT+MIL+KG+CK GR++E+ ++L RMR                     
Sbjct: 643  HDFRGDGLVEESVTYMILIKGMCKCGRVDEMLQLLERMR--------------------- 681

Query: 965  NLDGCLRIWEEMKIDGVDPDVMAYTTLVTGLCKGNKVEKGYEFFKEMREKGCLIDRSMYG 1144
                   +WEEM+ D V+ D MAY TLVTGLCKG +VEKGYE F+EM+EKG LIDR++YG
Sbjct: 682  -------VWEEMRRDRVEADAMAYVTLVTGLCKGGRVEKGYELFREMKEKGFLIDRAIYG 734

Query: 1145 SLIEAFVVDGKIGSACDLLKDLIDSGYRADLSIYNSMVEGLCNANRVDKAYKLFQITVRE 1324
             L+E FV D K+G+ACDLLKDL+ SGYRADL IYNS+++GLC+  RVDKAYKLF++ V+E
Sbjct: 735  VLVEGFVEDRKVGAACDLLKDLVASGYRADLGIYNSLIKGLCDVKRVDKAYKLFRVAVQE 794

Query: 1325 GLNPDFVTVNPILVSYADESRMDDFCILLEKMQKLGLPVIDDLSRFFSFMVGKGDRVRKA 1504
            GL PDF TVNPI+VSYAD  RMD+FC LL +M+K G PVIDDLS+FFS +  K D V  A
Sbjct: 795  GLGPDFATVNPIMVSYADMRRMDNFCDLLAQMEKCGRPVIDDLSKFFSMLTEKEDGVMMA 854

Query: 1505 LEVFEDLKTKGYCSISIYNILIGALHRKGEAQNALTLFEEMKDLDFEPDSSTHSNVIPCF 1684
            LEVFE+LK K Y S++IYNILI  LH+ G+ + AL+LF+EMK  +F+PDSST+S+ I C+
Sbjct: 855  LEVFEELKVKSYYSVAIYNILIAGLHKFGKVKKALSLFDEMKSFNFQPDSSTYSSAIICY 914

Query: 1685 VDVGDVKEACSCYNKMKDMSWIPTVSAYCSLVKGLCRIGETDAVLTLVRDCLGNVTSGPM 1864
            ++ GD+ EAC+C+NK+ +MS +P+V+AY SL KGL  +GE DAV+ LVRDCL +VTSGPM
Sbjct: 915  MEEGDLHEACACHNKIIEMSCVPSVAAYRSLAKGLFNVGEIDAVMLLVRDCLASVTSGPM 974

Query: 1865 EFKYALTILHACKSGDAEKVIEVLNEMMEQGCPPDDVTYFAIISGMCNHGTLEEARKVFG 2044
             FKY+LT+L+AC+S +AEKVI+V+NEMM+QGCPPD V Y AIISGMC HGT+EEARKVF 
Sbjct: 975  AFKYSLTVLNACRSKNAEKVIDVMNEMMQQGCPPDHVVYSAIISGMCKHGTIEEARKVFS 1034

Query: 2045 SMRDRNLLTEANLIVYDELLIDHMKKKTAGLVLSGLKFFGLESKLKSKGSTILPG 2209
            ++++R L+TEAN+I+YD++LI+HMKKKTA L++SGLKFFGLESKLK+KG  +L G
Sbjct: 1035 NLKERKLITEANMILYDDMLIEHMKKKTADLIVSGLKFFGLESKLKAKGCNLLSG 1089


>ref|XP_006413862.1| hypothetical protein EUTSA_v10024515mg [Eutrema salsugineum]
            gi|557115032|gb|ESQ55315.1| hypothetical protein
            EUTSA_v10024515mg [Eutrema salsugineum]
          Length = 735

 Score =  884 bits (2284), Expect = 0.0
 Identities = 440/729 (60%), Positives = 557/729 (76%), Gaps = 4/729 (0%)
 Frame = +2

Query: 29   PQMPPKSQPSTITNKLYFYYGHRKPSQNRPTVHGGLFSNRKTINPNPNFYKSSKTLNPNQ 208
            P +P K      T K  F+YGHRKPSQNRP VHGGLFSNR+ ++ +P   +S      ++
Sbjct: 7    PNLPEK------TLKPNFFYGHRKPSQNRPVVHGGLFSNRQYLSRDPP--QSPSNAVADR 58

Query: 209  CSIDINKWDPNS--PQKFPPTKTQSEKFFSIAQTLSPIARYICDSFRKHKN-WCPSIVKD 379
               D+ KWDP S  P +   + + S    + ++ LSPIAR++ D+FRK++N W PS+V +
Sbjct: 59   IPFDLRKWDPESRLPSERASSSSPSTSISAASERLSPIARFVLDAFRKNRNRWGPSVVSE 118

Query: 380  LDKLRRVPPNLVAEVLKAQTDPRISSRFFHWAGKQKGFRHNFASYNAFAYCLNRLNMFRA 559
            L+KLRRV P++VAEVLK   D  +S++FFHWAGKQKG++H+FA+YNAFAYCLNR   FRA
Sbjct: 119  LNKLRRVTPSIVAEVLKVGNDAAVSAKFFHWAGKQKGYKHDFAAYNAFAYCLNRTGHFRA 178

Query: 560  ADQVPDLMRMQGNQPTEKQFEILIRMHSDAGRGLRVYYIYEKMKNFGVKPRVFLYNRILD 739
            ADQ+P+LM  QG  P+EKQFEILIRMHSD  RGLRVYY+YEKMK FG KPRVFLYNRI+D
Sbjct: 179  ADQLPELMDSQGRPPSEKQFEILIRMHSDNKRGLRVYYVYEKMKKFGFKPRVFLYNRIMD 238

Query: 740  ALVKTGHVDLAISVYEDFKEDGLVEESVTFMILVKGLCKVGRMNEVFEILSRMR-NLCKP 916
            AL+KTG+ DLA++VYEDFKEDGLVEES TFMILVKGLCK GRM E+ EIL RMR NLC+P
Sbjct: 239  ALMKTGYFDLALAVYEDFKEDGLVEESTTFMILVKGLCKSGRMEEMLEILQRMRENLCRP 298

Query: 917  DVCAYTAMIRILVSEGNLDGCLRIWEEMKIDGVDPDVMAYTTLVTGLCKGNKVEKGYEFF 1096
            DV AYTAMI+ LVSEGN+D  LR+W+EMK D V PDVMAY TLV GLCK  +VEKGYE F
Sbjct: 299  DVFAYTAMIKTLVSEGNMDASLRVWDEMKRDEVKPDVMAYGTLVMGLCKDGRVEKGYELF 358

Query: 1097 KEMREKGCLIDRSMYGSLIEAFVVDGKIGSACDLLKDLIDSGYRADLSIYNSMVEGLCNA 1276
             EM+EK  LIDR +Y  LIE FV DGK+ SACDL KDL+DSGY ADL IYN++++GLC  
Sbjct: 359  MEMKEKQILIDRDIYRVLIEGFVADGKVRSACDLWKDLVDSGYIADLGIYNAIIKGLCTV 418

Query: 1277 NRVDKAYKLFQITVREGLNPDFVTVNPILVSYADESRMDDFCILLEKMQKLGLPVIDDLS 1456
             +VDKAYKLFQI   E L PDF T++PI+V+Y    R+ DF  LLE++ + G PV D L+
Sbjct: 419  KQVDKAYKLFQIATEEELEPDFETLSPIMVAYVVMKRLSDFWNLLERIAESGYPVADYLT 478

Query: 1457 RFFSFMVGKGDRVRKALEVFEDLKTKGYCSISIYNILIGALHRKGEAQNALTLFEEMKDL 1636
            +FF  +    ++   AL+VF+ LKT+G+ S+S+YNIL+ AL++ G    +L+LF EM++ 
Sbjct: 479  QFFRLLCDDEEKRTLALDVFDVLKTQGHGSVSVYNILMEALYKMGNIHKSLSLFFEMREY 538

Query: 1637 DFEPDSSTHSNVIPCFVDVGDVKEACSCYNKMKDMSWIPTVSAYCSLVKGLCRIGETDAV 1816
             FEPDSS++S  I CFV+ GDV+EACSC+ K+ +MS +P+ SAY SL KGLC+IGE DAV
Sbjct: 539  GFEPDSSSYSIAISCFVEKGDVQEACSCHEKIIEMSCVPSTSAYLSLTKGLCQIGEIDAV 598

Query: 1817 LTLVRDCLGNVTSGPMEFKYALTILHACKSGDAEKVIEVLNEMMEQGCPPDDVTYFAIIS 1996
            + LVR+CLGNV SGPMEFKYAL + H CK  +AEKV+EVL+EM ++G    +V Y AIIS
Sbjct: 599  MKLVRECLGNVESGPMEFKYALRVCHVCKVNNAEKVMEVLDEMNQEGVCISEVIYCAIIS 658

Query: 1997 GMCNHGTLEEARKVFGSMRDRNLLTEANLIVYDELLIDHMKKKTAGLVLSGLKFFGLESK 2176
            GMC HGT++ AR+VF  ++ R ++TEA +IVYDE+LI+  KKKTA LVLSG+KFFGLESK
Sbjct: 659  GMCKHGTIKAAREVFAELKKRKVMTEAEMIVYDEMLIEQTKKKTADLVLSGIKFFGLESK 718

Query: 2177 LKSKGSTIL 2203
            L++KG  +L
Sbjct: 719  LRAKGCRLL 727


Top