BLASTX nr result

ID: Paeonia23_contig00004317 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Paeonia23_contig00004317
         (3023 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002271180.1| PREDICTED: pentatricopeptide repeat-containi...  1314   0.0  
ref|XP_007051141.1| S uncoupled 1 [Theobroma cacao] gi|508703402...  1254   0.0  
ref|XP_006492356.1| PREDICTED: pentatricopeptide repeat-containi...  1244   0.0  
ref|XP_006444533.1| hypothetical protein CICLE_v10018807mg [Citr...  1243   0.0  
ref|XP_002515260.1| pentatricopeptide repeat-containing protein,...  1238   0.0  
ref|XP_002320970.2| hypothetical protein POPTR_0014s11380g [Popu...  1227   0.0  
ref|XP_006386713.1| hypothetical protein POPTR_0002s19470g [Popu...  1226   0.0  
gb|EXB28566.1| hypothetical protein L484_009725 [Morus notabilis]    1219   0.0  
ref|XP_004166285.1| PREDICTED: pentatricopeptide repeat-containi...  1209   0.0  
ref|XP_004135985.1| PREDICTED: pentatricopeptide repeat-containi...  1208   0.0  
ref|XP_007221553.1| hypothetical protein PRUPE_ppa001263mg [Prun...  1199   0.0  
ref|XP_002301519.2| hypothetical protein POPTR_0002s19470g [Popu...  1198   0.0  
ref|XP_004288538.1| PREDICTED: pentatricopeptide repeat-containi...  1187   0.0  
ref|XP_006417966.1| hypothetical protein EUTSA_v10006755mg [Eutr...  1150   0.0  
ref|XP_006355855.1| PREDICTED: pentatricopeptide repeat-containi...  1118   0.0  
ref|XP_004240564.1| PREDICTED: pentatricopeptide repeat-containi...  1114   0.0  
ref|XP_006444532.1| hypothetical protein CICLE_v10018807mg [Citr...  1102   0.0  
ref|XP_006293642.1| hypothetical protein CARUB_v10022597mg [Caps...  1100   0.0  
ref|XP_004240565.1| PREDICTED: pentatricopeptide repeat-containi...  1100   0.0  
ref|NP_180698.1| pentatricopeptide-repeat protein GUN1 [Arabidop...  1097   0.0  

>ref|XP_002271180.1| PREDICTED: pentatricopeptide repeat-containing protein At2g31400,
            chloroplastic [Vitis vinifera]
          Length = 867

 Score = 1314 bits (3400), Expect = 0.0
 Identities = 677/878 (77%), Positives = 742/878 (84%), Gaps = 9/878 (1%)
 Frame = +1

Query: 244  MASSTP-HCSITGTKPYQNHQFPQNPIXXXXXXXXXXXXWAHRKVSLTKPSQVPHXXXXX 420
            MAS TP HCSIT  KPYQN  +PQNP             W+  KVSLT P   P      
Sbjct: 1    MASPTPPHCSITAAKPYQNLHYPQNPTKNHHNNHH----WSSHKVSLTNPLPSPRNAAKP 56

Query: 421  XXXXXXXXXXXXXXXXXXPTFPSLSRAPPPNKSELTADFSGRRSTRFVSKMHFGRQKAQL 600
                                FPSLS  PP +KSELTADFSGRRSTRFVSKMHFGR K   
Sbjct: 57   GAASPATATNRNS------NFPSLSPLPP-SKSELTADFSGRRSTRFVSKMHFGRPKTAA 109

Query: 601  SSRHSSVAEEALQESIRI-NDEKGLENVFLCFESKMSGSDDYIFVLRELGNRGECWKAVQ 777
            ++RH+S AEEAL+ +IR  +D+KG+++V L FES++ GSDDY F+LRELGNRGE  KA++
Sbjct: 110  AARHTSTAEEALRHAIRFASDDKGIDSVLLNFESRLCGSDDYTFLLRELGNRGEWAKAIR 169

Query: 778  CFEFAVRRERRRNEQGKLASAMISILGRLGKVDLARGVFEIGLKAGYGNTVYAYSALISA 957
            CFEFAVRRE+RRNEQGKLASAMISILGRLG+V+LA+ VFE  L  GYGNTVYA+SALISA
Sbjct: 170  CFEFAVRREQRRNEQGKLASAMISILGRLGQVELAKNVFETALNEGYGNTVYAFSALISA 229

Query: 958  YGRSGCSDEAIKVFETMKNSGLKPNLVTYNAVIDACGKGGVEFNQAAEILNEMLRNGIQP 1137
            YGRSG  DEAIKVFETMK+SGLKPNLVTYNAVIDACGKGGV+FN+AAEI +EMLRNG+QP
Sbjct: 230  YGRSGYCDEAIKVFETMKSSGLKPNLVTYNAVIDACGKGGVDFNRAAEIFDEMLRNGVQP 289

Query: 1138 DRITFNSLLAVCSRGGLWEAARNFFSEMVYRGIDQDIFTYNTLLDAFCKGGQMDLAFQIM 1317
            DRITFNSLLAVC RGGLWEAARN FSEM+YRGI+QDIFTYNTLLDA CKGGQMDLAFQIM
Sbjct: 290  DRITFNSLLAVCGRGGLWEAARNLFSEMLYRGIEQDIFTYNTLLDAVCKGGQMDLAFQIM 349

Query: 1318 AEMSAKNISPNVVTYSTMIDGYAKVGRLDDALNLLGEMKFAGIGLDRVSYNTLLSIYAKL 1497
            +EM  K+I PNVVTYST+IDGYAK GRLD+ALNL  EMKFA IGLDRVSYNTLLSIYAKL
Sbjct: 350  SEMPRKHIMPNVVTYSTVIDGYAKAGRLDEALNLFNEMKFASIGLDRVSYNTLLSIYAKL 409

Query: 1498 GRFEDVLNVCNEMESSGIKKDAVTYNALLGGYGKQGKYDEVKRVFKEMKTRHLSPNLLTY 1677
            GRFE+ LNVC EMESSGIKKDAVTYNALLGGYGKQGKY+EVKRVF+EMK   + PNLLTY
Sbjct: 410  GRFEEALNVCKEMESSGIKKDAVTYNALLGGYGKQGKYEEVKRVFEEMKAERIFPNLLTY 469

Query: 1678 STLIDVYSKGGLYREALEVFKEFKQAGLKADVVLYSALIDALCKNGLVEIAVSLLDEMTK 1857
            STLIDVYSKGGLY+EA+EVF+EFK+AGLKADVVLYSALIDALCKNGLVE AVS LDEMTK
Sbjct: 470  STLIDVYSKGGLYQEAMEVFREFKKAGLKADVVLYSALIDALCKNGLVESAVSFLDEMTK 529

Query: 1858 EGIRPNVVTYNSIIDAFGRSATA-------CGSNESLIEPSSLIALKDVSESNDEHREDN 2016
            EGIRPNVVTYNSIIDAFGRS +A         +N S +  SSL  ++D +ES    +EDN
Sbjct: 530  EGIRPNVVTYNSIIDAFGRSGSAECVIDPPYETNVSKMSSSSLKVVEDATESEVGDKEDN 589

Query: 2017 QIIKIFGQLAAEKSCHAKIDNRGRQEILCVLEVFQKMQELEIKPNVVTFSAILNACSRCN 2196
            QIIKIFGQLAAEK+CHAK +NRGRQEILC+L VF KM EL+IKPNVVTFSAILNACSRCN
Sbjct: 590  QIIKIFGQLAAEKTCHAKKENRGRQEILCILAVFHKMHELDIKPNVVTFSAILNACSRCN 649

Query: 2197 SFEDASMLLEELRWFDNQVYGVAHGLLMGRKDNVWAQAQCLFDEVKQMDSPTASAFYNAL 2376
            SFEDASMLLEELR FDNQVYGVAHGLLMG  DNVW QAQ LFDEVKQMDS TASAFYNAL
Sbjct: 650  SFEDASMLLEELRLFDNQVYGVAHGLLMGYGDNVWVQAQSLFDEVKQMDSSTASAFYNAL 709

Query: 2377 TDMLWHFGQKKGAQLVVLEGKRRHVWENMWSDSCLDLHLMSSGAARAMVHAWLLDIRSIV 2556
            TDMLWHFGQ++GAQLVVLEGKRRHVWENMWS+SCLDLHLMSSGAARAMVHAWLL+IRSIV
Sbjct: 710  TDMLWHFGQRRGAQLVVLEGKRRHVWENMWSNSCLDLHLMSSGAARAMVHAWLLNIRSIV 769

Query: 2557 FEGHELPKLISILTGWGKHSKVVGDGTLRRAIEALLTGMGAPFRVANCNLGRFISPGAVV 2736
            FEGHELP+L+SILTGWGKHSKVVGDG LRRAIEALLTGMGAPFRVA CNLGRFIS GAVV
Sbjct: 770  FEGHELPQLLSILTGWGKHSKVVGDGALRRAIEALLTGMGAPFRVAKCNLGRFISTGAVV 829

Query: 2737 AAWLKESGTLKVLVLHDDRAHQETSTFDRIPNLRTLTL 2850
            AAWL+ESGTLKVLVLHDDR + + +   +I NL+TL L
Sbjct: 830  AAWLRESGTLKVLVLHDDRTNPDRARCSQISNLQTLPL 867


>ref|XP_007051141.1| S uncoupled 1 [Theobroma cacao] gi|508703402|gb|EOX95298.1| S
            uncoupled 1 [Theobroma cacao]
          Length = 866

 Score = 1254 bits (3244), Expect = 0.0
 Identities = 645/877 (73%), Positives = 718/877 (81%), Gaps = 8/877 (0%)
 Frame = +1

Query: 244  MASSTPHCSITGT-KPYQNHQFPQNPIXXXXXXXXXXXXWAH-RKVSLTKPSQVPHXXXX 417
            MAS+ PHCSIT T KPYQNHQ+PQN +                +K SL+KP   P     
Sbjct: 1    MASTPPHCSITATTKPYQNHQYPQNHLKNHRNHQNNHRNQTRPQKFSLSKPPPSP----- 55

Query: 418  XXXXXXXXXXXXXXXXXXXPTFPSLSRAPPPNKSELTADFSGRRSTRFVSKMHFGRQKAQ 597
                                T   LS++P P  S L  DFSGRRSTRFVSKMH GR K  
Sbjct: 56   ----CNAAKPATTAAAAAASTRSPLSQSPVPFPS-LAPDFSGRRSTRFVSKMHLGRPKTS 110

Query: 598  LSSRHSSVAEEALQESIRINDEKGLENVFLCFESKMSGSDDYIFVLRELGNRGECWKAVQ 777
             ++RH+S+AEE LQ ++  N   GLE V + FESK+ GSDDY F+LRELGNRGE  KA++
Sbjct: 111  TNTRHTSIAEEVLQLALH-NGHSGLERVLVSFESKLCGSDDYTFLLRELGNRGEYEKAIK 169

Query: 778  CFEFAVRRERRRNEQGKLASAMISILGRLGKVDLARGVFEIGLKAGYGNTVYAYSALISA 957
            CF+FAVRRERR+ EQGKLASAMISILGRLGKV+LA+G+FE  L  GYGNTVYA+SALISA
Sbjct: 170  CFQFAVRRERRKTEQGKLASAMISILGRLGKVELAKGIFETALTEGYGNTVYAFSALISA 229

Query: 958  YGRSGCSDEAIKVFETMKNSGLKPNLVTYNAVIDACGKGGVEFNQAAEILNEMLRNGIQP 1137
            +GRSG SDEAIKVF++MKN+GLKPNLVTYNAVIDACGKGGVEF +  EI +EMLR+G+QP
Sbjct: 230  FGRSGYSDEAIKVFDSMKNNGLKPNLVTYNAVIDACGKGGVEFKRVVEIFDEMLRSGVQP 289

Query: 1138 DRITFNSLLAVCSRGGLWEAARNFFSEMVYRGIDQDIFTYNTLLDAFCKGGQMDLAFQIM 1317
            DRITFNSLLAVCSRGGLWEAARN FSEMV+RGIDQDIFTYNTLLDA CKGGQMDLAF+IM
Sbjct: 290  DRITFNSLLAVCSRGGLWEAARNLFSEMVHRGIDQDIFTYNTLLDAVCKGGQMDLAFEIM 349

Query: 1318 AEMSAKNISPNVVTYSTMIDGYAKVGRLDDALNLLGEMKFAGIGLDRVSYNTLLSIYAKL 1497
            AEM  KNI PNVVTYSTMIDGYAK GR DDALNL  EMKF GIGLDRVSYNT+LSIYAKL
Sbjct: 350  AEMPTKNILPNVVTYSTMIDGYAKAGRFDDALNLFNEMKFLGIGLDRVSYNTVLSIYAKL 409

Query: 1498 GRFEDVLNVCNEMESSGIKKDAVTYNALLGGYGKQGKYDEVKRVFKEMKTRHLSPNLLTY 1677
            GRFE+ L++C EME SGI+KD VTYNALLGGYGKQGKYDEV+R+F+EMKT+ +SPNLLTY
Sbjct: 410  GRFEEALDICREMEGSGIRKDVVTYNALLGGYGKQGKYDEVRRLFEEMKTQKVSPNLLTY 469

Query: 1678 STLIDVYSKGGLYREALEVFKEFKQAGLKADVVLYSALIDALCKNGLVEIAVSLLDEMTK 1857
            ST+IDVYSKGGLY EA++VF+EFK+ GLKADVVLYSALIDALCKNGLVE AVSLLDEMTK
Sbjct: 470  STVIDVYSKGGLYEEAMDVFREFKRVGLKADVVLYSALIDALCKNGLVESAVSLLDEMTK 529

Query: 1858 EGIRPNVVTYNSIIDAFGRSAT------ACGSNESLIEPSSLIALKDVSESNDEHREDNQ 2019
            EGIRPNVVTYNSIIDAFGRSAT      A G   +L   SS + +    E      EDNQ
Sbjct: 530  EGIRPNVVTYNSIIDAFGRSATSECAFDAGGEISALQTESSSLVIGHSIEGKARDGEDNQ 589

Query: 2020 IIKIFGQLAAEKSCHAKIDNRGRQEILCVLEVFQKMQELEIKPNVVTFSAILNACSRCNS 2199
            +IK FGQLAAEK   AK D RG+QEILC+L VFQKM ELEIKPNVVTFSAILNACSRC+S
Sbjct: 590  VIKFFGQLAAEKGGQAKKDCRGKQEILCILGVFQKMHELEIKPNVVTFSAILNACSRCDS 649

Query: 2200 FEDASMLLEELRWFDNQVYGVAHGLLMGRKDNVWAQAQCLFDEVKQMDSPTASAFYNALT 2379
            FEDASMLLEELR FDNQVYGVAHGLLMG ++NVW QAQ LFDEVK MDS TASAFYNALT
Sbjct: 650  FEDASMLLEELRLFDNQVYGVAHGLLMGYRENVWIQAQSLFDEVKLMDSSTASAFYNALT 709

Query: 2380 DMLWHFGQKKGAQLVVLEGKRRHVWENMWSDSCLDLHLMSSGAARAMVHAWLLDIRSIVF 2559
            DMLWHFGQK+GAQLVVLEGKRR VWEN+WS+SCLDLHLMSSGAARAMVHAWLL+IRSI+F
Sbjct: 710  DMLWHFGQKRGAQLVVLEGKRRQVWENVWSNSCLDLHLMSSGAARAMVHAWLLNIRSIIF 769

Query: 2560 EGHELPKLISILTGWGKHSKVVGDGTLRRAIEALLTGMGAPFRVANCNLGRFISPGAVVA 2739
            EGHELPKL+SILTGWGKHSKVVGDG LRR +E+L TGMGAPFR+A CNLGRF+S G VV 
Sbjct: 770  EGHELPKLLSILTGWGKHSKVVGDGALRRTVESLFTGMGAPFRLAKCNLGRFVSTGPVVT 829

Query: 2740 AWLKESGTLKVLVLHDDRAHQETSTFDRIPNLRTLTL 2850
            AWL+ESGTLK+LVLHDDR   E + F +I NL+TLTL
Sbjct: 830  AWLRESGTLKLLVLHDDRTQPENTGFGQISNLQTLTL 866


>ref|XP_006492356.1| PREDICTED: pentatricopeptide repeat-containing protein At2g31400,
            chloroplastic-like [Citrus sinensis]
          Length = 877

 Score = 1244 bits (3219), Expect = 0.0
 Identities = 647/886 (73%), Positives = 715/886 (80%), Gaps = 17/886 (1%)
 Frame = +1

Query: 244  MASSTPHCSITGTKPYQNHQFPQNPIXXXXXXXXXXXX----WAHRKVSLTKPSQVPHXX 411
            MAS+ PHCSIT TKPYQNHQ+P N +                W   KVSLTKP   P   
Sbjct: 1    MASTPPHCSITATKPYQNHQYPHNHLKNNHHRQSHHPSSRPHWTSHKVSLTKPPLSPSPR 60

Query: 412  XXXXXXXXXXXXXXXXXXXXXPTFPSLSRAPPPNKSELTADFSGRRSTRFVSKMHFGRQK 591
                                   F SLS  P  +KSEL  DFSGRRSTRFVSKMHFGR K
Sbjct: 61   NAPKPAATSTTVAPNPKP-----FHSLSPLPS-SKSELAPDFSGRRSTRFVSKMHFGRPK 114

Query: 592  AQLSSRHSSVAEEALQESIRI-NDEKGLENVFLCFESKMSGSDDYIFVLRELGNRGECWK 768
              +S+RHS VAEEAL        D+  L ++   FE K+ G+DDY F+LRELGNRGE  K
Sbjct: 115  IAMSTRHSVVAEEALHHVTAFARDDVSLGDILKKFEFKLCGADDYTFLLRELGNRGEWSK 174

Query: 769  AVQCFEFAVRRERRRNEQGKLASAMISILGRLGKVDLARGVFEIGLKAGYGNTVYAYSAL 948
            A+QCF FAV+RE R+N+QGKLASAMISILGRLGKVDLA+ +FE  L  GYGNTVYA+SAL
Sbjct: 175  AIQCFAFAVKREERKNDQGKLASAMISILGRLGKVDLAKNIFETALNEGYGNTVYAFSAL 234

Query: 949  ISAYGRSGCSDEAIKVFETMKNSGLKPNLVTYNAVIDACGKGGVEFNQAAEILNEMLRNG 1128
            ISAYGRSG   EAI VF +MK   LKPNLVTYNAVIDACGKGGV+F    EI ++MLRNG
Sbjct: 235  ISAYGRSGYCQEAISVFNSMKRYNLKPNLVTYNAVIDACGKGGVDFKHVVEIFDDMLRNG 294

Query: 1129 IQPDRITFNSLLAVCSRGGLWEAARNFFSEMVYRGIDQDIFTYNTLLDAFCKGGQMDLAF 1308
            +QPDRITFNSLLAVCSRGGLWEAARN F+EMV+RGIDQDIFTYNTLLDA CKG QMDLAF
Sbjct: 295  VQPDRITFNSLLAVCSRGGLWEAARNLFNEMVHRGIDQDIFTYNTLLDAICKGAQMDLAF 354

Query: 1309 QIMAEMSAKNISPNVVTYSTMIDGYAKVGRLDDALNLLGEMKFAGIGLDRVSYNTLLSIY 1488
            +IMAEM AKNISPNVVTYSTMIDGYAK GRLDDALN+  EMKF GIGLDRVSYNT+LSIY
Sbjct: 355  EIMAEMPAKNISPNVVTYSTMIDGYAKAGRLDDALNMFSEMKFLGIGLDRVSYNTVLSIY 414

Query: 1489 AKLGRFEDVLNVCNEMESSGIKKDAVTYNALLGGYGKQGKYDEVKRVFKEMKTRHLSPNL 1668
            AKLGRFE+ L VC EMESSGI+KDAVTYNALLGGYGKQGKYDEV+R+F++MK   +SPNL
Sbjct: 415  AKLGRFEEALLVCKEMESSGIRKDAVTYNALLGGYGKQGKYDEVRRMFEQMKADCVSPNL 474

Query: 1669 LTYSTLIDVYSKGGLYREALEVFKEFKQAGLKADVVLYSALIDALCKNGLVEIAVSLLDE 1848
            LTYSTLIDVYSKGGLY+EA+++F+EFKQAGLKADVVLYSALIDALCKNGLVE AVSLLDE
Sbjct: 475  LTYSTLIDVYSKGGLYKEAMQIFREFKQAGLKADVVLYSALIDALCKNGLVESAVSLLDE 534

Query: 1849 MTKEGIRPNVVTYNSIIDAFGRSATA-CGSNE------SLIEPSSLIAL-----KDVSES 1992
            MTKEGIRPNVVTYNSIIDAFGRSAT  C  ++         E ++L A+     KDV E+
Sbjct: 535  MTKEGIRPNVVTYNSIIDAFGRSATTECTVDDVERDLGKQKESANLDAMCSQDDKDVQEA 594

Query: 1993 NDEHREDNQIIKIFGQLAAEKSCHAKIDNRGRQEILCVLEVFQKMQELEIKPNVVTFSAI 2172
                R DNQIIK+FGQL AEK+   K +NR RQEILC+L VFQKM +L+IKPNVVTFSAI
Sbjct: 595  G---RTDNQIIKVFGQLVAEKAGQGKKENRCRQEILCILGVFQKMHKLKIKPNVVTFSAI 651

Query: 2173 LNACSRCNSFEDASMLLEELRWFDNQVYGVAHGLLMGRKDNVWAQAQCLFDEVKQMDSPT 2352
            LNACSRCNSFEDASMLLEELR FDNQVYGVAHGLLMG +DN+W QA  LFDEVK MDS T
Sbjct: 652  LNACSRCNSFEDASMLLEELRLFDNQVYGVAHGLLMGYRDNIWVQALSLFDEVKLMDSST 711

Query: 2353 ASAFYNALTDMLWHFGQKKGAQLVVLEGKRRHVWENMWSDSCLDLHLMSSGAARAMVHAW 2532
            ASAFYNALTDMLWHFGQK+GAQLVVLEGKRR VWEN+WS+SCLDLHLMSSGAARAMVHAW
Sbjct: 712  ASAFYNALTDMLWHFGQKRGAQLVVLEGKRRQVWENVWSESCLDLHLMSSGAARAMVHAW 771

Query: 2533 LLDIRSIVFEGHELPKLISILTGWGKHSKVVGDGTLRRAIEALLTGMGAPFRVANCNLGR 2712
            LL+I SIVFEGHELPKL+SILTGWGKHSKVVGDG LRRA+E LLTGMGAPF VANCNLGR
Sbjct: 772  LLNIHSIVFEGHELPKLLSILTGWGKHSKVVGDGALRRAVEVLLTGMGAPFWVANCNLGR 831

Query: 2713 FISPGAVVAAWLKESGTLKVLVLHDDRAHQETSTFDRIPNLRTLTL 2850
            FIS G +VA+WL+ESGTLKVLVLHDDR H E + FD + N++TLTL
Sbjct: 832  FISTGPMVASWLRESGTLKVLVLHDDRTHSENAGFDEMLNMQTLTL 877


>ref|XP_006444533.1| hypothetical protein CICLE_v10018807mg [Citrus clementina]
            gi|557546795|gb|ESR57773.1| hypothetical protein
            CICLE_v10018807mg [Citrus clementina]
          Length = 877

 Score = 1243 bits (3217), Expect = 0.0
 Identities = 647/886 (73%), Positives = 715/886 (80%), Gaps = 17/886 (1%)
 Frame = +1

Query: 244  MASSTPHCSITGTKPYQNHQFPQNPIXXXXXXXXXXXX----WAHRKVSLTKPSQVPHXX 411
            MAS+ PHCSIT TKPYQNHQ+P N +                W   KVSLTKP   P   
Sbjct: 1    MASTPPHCSITATKPYQNHQYPHNHLKNNHHRQSHHPSSRPHWTSHKVSLTKPPLSPSPR 60

Query: 412  XXXXXXXXXXXXXXXXXXXXXPTFPSLSRAPPPNKSELTADFSGRRSTRFVSKMHFGRQK 591
                                   F SLS  P  +KSEL  DFSGRRSTRFVSKMHFGR K
Sbjct: 61   NAPKPAATSTTVAPNPKP-----FHSLSPLPS-SKSELAPDFSGRRSTRFVSKMHFGRPK 114

Query: 592  AQLSSRHSSVAEEALQESIRI-NDEKGLENVFLCFESKMSGSDDYIFVLRELGNRGECWK 768
              +S+RHS VAEEAL        D+  L ++   FE K+ G+DDY F+LRELGNRGE  K
Sbjct: 115  IAMSTRHSVVAEEALHHVTAFARDDVSLGDILKKFEFKLCGADDYTFLLRELGNRGEWSK 174

Query: 769  AVQCFEFAVRRERRRNEQGKLASAMISILGRLGKVDLARGVFEIGLKAGYGNTVYAYSAL 948
            A+QCF FAV+RE R+N+QGKLASAMISILGRLGKVDLA+ +FE  L  GYGNTVYA+SAL
Sbjct: 175  AIQCFAFAVKREERKNDQGKLASAMISILGRLGKVDLAKNIFETALNEGYGNTVYAFSAL 234

Query: 949  ISAYGRSGCSDEAIKVFETMKNSGLKPNLVTYNAVIDACGKGGVEFNQAAEILNEMLRNG 1128
            ISAYGRSG   EAI VF +MK   LKPNLVTYNAVIDACGKGGV+F    EI ++MLRNG
Sbjct: 235  ISAYGRSGYCQEAISVFNSMKRYHLKPNLVTYNAVIDACGKGGVDFKHVVEIFDDMLRNG 294

Query: 1129 IQPDRITFNSLLAVCSRGGLWEAARNFFSEMVYRGIDQDIFTYNTLLDAFCKGGQMDLAF 1308
            +QPDRITFNSLLAVCSRGGLWEAARN F+EMV+RGIDQDIFTYNTLLDA CKG QMDLAF
Sbjct: 295  VQPDRITFNSLLAVCSRGGLWEAARNLFNEMVHRGIDQDIFTYNTLLDAICKGAQMDLAF 354

Query: 1309 QIMAEMSAKNISPNVVTYSTMIDGYAKVGRLDDALNLLGEMKFAGIGLDRVSYNTLLSIY 1488
            +IMAEM AKNISPNVVTYSTMIDGYAK GRLDDALN+  EMKF GIGLDRVSYNT+LSIY
Sbjct: 355  EIMAEMPAKNISPNVVTYSTMIDGYAKAGRLDDALNMFSEMKFLGIGLDRVSYNTVLSIY 414

Query: 1489 AKLGRFEDVLNVCNEMESSGIKKDAVTYNALLGGYGKQGKYDEVKRVFKEMKTRHLSPNL 1668
            AKLGRFE+ L VC EMESSGI+KDAVTYNALLGGYGKQGKYDEV+R+F++MK   +SPNL
Sbjct: 415  AKLGRFEEALLVCKEMESSGIRKDAVTYNALLGGYGKQGKYDEVRRMFEQMKADCVSPNL 474

Query: 1669 LTYSTLIDVYSKGGLYREALEVFKEFKQAGLKADVVLYSALIDALCKNGLVEIAVSLLDE 1848
            LTYSTLIDVYSKGGLY+EA+++F+EFKQAGLKADVVLYSALIDALCKNGLVE AVSLLDE
Sbjct: 475  LTYSTLIDVYSKGGLYKEAMQIFREFKQAGLKADVVLYSALIDALCKNGLVESAVSLLDE 534

Query: 1849 MTKEGIRPNVVTYNSIIDAFGRSATA-CGSNE------SLIEPSSLIAL-----KDVSES 1992
            MTKEGIRPNVVTYNSIIDAFGRSAT  C  ++         E ++L A+     KDV E+
Sbjct: 535  MTKEGIRPNVVTYNSIIDAFGRSATTECTVDDVERDLGKQKESANLDAMCSQDDKDVQEA 594

Query: 1993 NDEHREDNQIIKIFGQLAAEKSCHAKIDNRGRQEILCVLEVFQKMQELEIKPNVVTFSAI 2172
                R DNQIIK+FGQL AEK+   K +NR RQEILC+L VFQKM +L+IKPNVVTFSAI
Sbjct: 595  G---RTDNQIIKVFGQLVAEKAGQGKKENRCRQEILCILGVFQKMHKLKIKPNVVTFSAI 651

Query: 2173 LNACSRCNSFEDASMLLEELRWFDNQVYGVAHGLLMGRKDNVWAQAQCLFDEVKQMDSPT 2352
            LNACSRCNSFEDASMLLEELR FDNQVYGVAHGLLMG +DN+W QA  LFDEVK MDS T
Sbjct: 652  LNACSRCNSFEDASMLLEELRLFDNQVYGVAHGLLMGYRDNIWVQALSLFDEVKLMDSST 711

Query: 2353 ASAFYNALTDMLWHFGQKKGAQLVVLEGKRRHVWENMWSDSCLDLHLMSSGAARAMVHAW 2532
            ASAFYNALTDMLWHFGQK+GAQLVVLEGKRR VWEN+WS+SCLDLHLMSSGAARAMVHAW
Sbjct: 712  ASAFYNALTDMLWHFGQKRGAQLVVLEGKRRQVWENVWSESCLDLHLMSSGAARAMVHAW 771

Query: 2533 LLDIRSIVFEGHELPKLISILTGWGKHSKVVGDGTLRRAIEALLTGMGAPFRVANCNLGR 2712
            LL+I SIVFEGHELPKL+SILTGWGKHSKVVGDG LRRA+E LLTGMGAPF VANCNLGR
Sbjct: 772  LLNIHSIVFEGHELPKLLSILTGWGKHSKVVGDGALRRAVEVLLTGMGAPFWVANCNLGR 831

Query: 2713 FISPGAVVAAWLKESGTLKVLVLHDDRAHQETSTFDRIPNLRTLTL 2850
            FIS G +VA+WL+ESGTLKVLVLHDDR H E + FD + N++TLTL
Sbjct: 832  FISTGPMVASWLRESGTLKVLVLHDDRTHSENAGFDEMLNMQTLTL 877


>ref|XP_002515260.1| pentatricopeptide repeat-containing protein, putative [Ricinus
            communis] gi|223545740|gb|EEF47244.1| pentatricopeptide
            repeat-containing protein, putative [Ricinus communis]
          Length = 878

 Score = 1238 bits (3204), Expect = 0.0
 Identities = 640/862 (74%), Positives = 717/862 (83%), Gaps = 8/862 (0%)
 Frame = +1

Query: 244  MASSTPHCSITGTKPYQNHQFPQNPIXXXXXXXXXXXXWAHRKVSLTKPSQVPHXXXXXX 423
            MAS+ PHCSIT TKPYQNHQ+PQN +            W ++KVSLTKP   P       
Sbjct: 1    MASTPPHCSITATKPYQNHQYPQNHLKNHRQTHHHR--WTNQKVSLTKPPLAPSPCNAPK 58

Query: 424  XXXXXXXXXXXXXXXXXPTFPSLSRAPPPNKSELTADFSGRRSTRFVSKMHFGRQKAQLS 603
                             PTF SLS      KS+L+ADFSGRRSTRFVSK+HFGR K  ++
Sbjct: 59   AAAAAAAATTTHHTPN-PTFHSLSPLQS-QKSDLSADFSGRRSTRFVSKLHFGRPKTNMN 116

Query: 604  SRHSSVAEEALQESIRIN-DEKGLENVFLCFESKMSGSDDYIFVLRELGNRGECWKAVQC 780
             RH+SVA EALQ+ I+   D+K LENV L FES++ G DDY F+LRELGNRG+  KAV+C
Sbjct: 117  -RHTSVALEALQQVIQYGKDDKALENVLLNFESRLCGPDDYTFLLRELGNRGDSAKAVRC 175

Query: 781  FEFAVRRERRRNEQGKLASAMISILGRLGKVDLARGVFEIGLKAGYGNTVYAYSALISAY 960
            FEFAVRRE  +NEQGKLASAMIS LGRLGKV+LA+ VF+  LK GYG TVYA+SALISAY
Sbjct: 176  FEFAVRRESGKNEQGKLASAMISTLGRLGKVELAKAVFDTALKEGYGKTVYAFSALISAY 235

Query: 961  GRSGCSDEAIKVFETMKNSGLKPNLVTYNAVIDACGKGGVEFNQAAEILNEMLRNGIQPD 1140
            GRSG  +EAIKVF++MK++GL PNLVTYNAVIDACGKGGVEF +  EI + ML NG+QPD
Sbjct: 236  GRSGYCNEAIKVFDSMKSNGLMPNLVTYNAVIDACGKGGVEFKKVVEIFDGMLSNGVQPD 295

Query: 1141 RITFNSLLAVCSRGGLWEAARNFFSEMVYRGIDQDIFTYNTLLDAFCKGGQMDLAFQIMA 1320
            RITFNSLLAVCSRGGLWEAAR  FS MV +GIDQDIFTYNTLLDA CKGGQMDLAF+IM+
Sbjct: 296  RITFNSLLAVCSRGGLWEAARRLFSAMVDKGIDQDIFTYNTLLDAVCKGGQMDLAFEIMS 355

Query: 1321 EMSAKNISPNVVTYSTMIDGYAKVGRLDDALNLLGEMKFAGIGLDRVSYNTLLSIYAKLG 1500
            EM  KNI PNVVTYSTMIDGYAKVGRLDDALN+  EMKF G+GLDRVSYNTLLS+YAKLG
Sbjct: 356  EMPTKNILPNVVTYSTMIDGYAKVGRLDDALNMFNEMKFLGVGLDRVSYNTLLSVYAKLG 415

Query: 1501 RFEDVLNVCNEMESSGIKKDAVTYNALLGGYGKQGKYDEVKRVFKEMKTRHLSPNLLTYS 1680
            RFE  L+VC EME++GI+KD VTYNALL GYGKQ +YDEV+RVF+EMK   +SPNLLTYS
Sbjct: 416  RFEQALDVCKEMENAGIRKDVVTYNALLAGYGKQYRYDEVRRVFEEMKRGRVSPNLLTYS 475

Query: 1681 TLIDVYSKGGLYREALEVFKEFKQAGLKADVVLYSALIDALCKNGLVEIAVSLLDEMTKE 1860
            TLIDVYSKGGLY+EA+EVF+EFKQAGLKADVVLYSALIDALCKNGLVE +V+LLDEMTKE
Sbjct: 476  TLIDVYSKGGLYKEAMEVFREFKQAGLKADVVLYSALIDALCKNGLVESSVTLLDEMTKE 535

Query: 1861 GIRPNVVTYNSIIDAFGRSATA-CGSNES------LIEPSSLIALKDVSESNDEHREDNQ 2019
            GIRPNVVTYNSIIDAFGRSA+A C  ++S       +E  S I +++  ES    +EDN+
Sbjct: 536  GIRPNVVTYNSIIDAFGRSASAQCVVDDSGETTALQVESLSSIVVQEAIESQAADKEDNR 595

Query: 2020 IIKIFGQLAAEKSCHAKIDNRGRQEILCVLEVFQKMQELEIKPNVVTFSAILNACSRCNS 2199
            II+IFG+LAAEK+C AK  N G+QEILC+L VFQKM EL+IKPNVVTFSAILNACSRC+S
Sbjct: 596  IIEIFGKLAAEKACEAK--NSGKQEILCILGVFQKMHELKIKPNVVTFSAILNACSRCDS 653

Query: 2200 FEDASMLLEELRWFDNQVYGVAHGLLMGRKDNVWAQAQCLFDEVKQMDSPTASAFYNALT 2379
            FEDASMLLEELR FDNQVYGVAHGLLMG ++NVW QAQ LFDEVK MDS TASAFYNALT
Sbjct: 654  FEDASMLLEELRLFDNQVYGVAHGLLMGYRENVWLQAQSLFDEVKLMDSSTASAFYNALT 713

Query: 2380 DMLWHFGQKKGAQLVVLEGKRRHVWENMWSDSCLDLHLMSSGAARAMVHAWLLDIRSIVF 2559
            DMLWHFGQK+GAQLVVLEGKRR VWEN+WSDSCLDLHLMSSGAARAMVHAWLL+IRSIVF
Sbjct: 714  DMLWHFGQKRGAQLVVLEGKRRQVWENIWSDSCLDLHLMSSGAARAMVHAWLLNIRSIVF 773

Query: 2560 EGHELPKLISILTGWGKHSKVVGDGTLRRAIEALLTGMGAPFRVANCNLGRFISPGAVVA 2739
            EGHELPKL+SILTGWGKHSKVVGD  LRRA+EALL GMGAPFR+A CNLGRFIS G+VVA
Sbjct: 774  EGHELPKLLSILTGWGKHSKVVGDSALRRAVEALLIGMGAPFRLAKCNLGRFISTGSVVA 833

Query: 2740 AWLKESGTLKVLVLHDDRAHQE 2805
            AWLKESGTL+VLVLHDDR H E
Sbjct: 834  AWLKESGTLEVLVLHDDRTHPE 855


>ref|XP_002320970.2| hypothetical protein POPTR_0014s11380g [Populus trichocarpa]
            gi|550323986|gb|EEE99285.2| hypothetical protein
            POPTR_0014s11380g [Populus trichocarpa]
          Length = 875

 Score = 1227 bits (3175), Expect = 0.0
 Identities = 641/881 (72%), Positives = 716/881 (81%), Gaps = 12/881 (1%)
 Frame = +1

Query: 244  MASSTPHCSITGT-KPYQNHQFPQNPIXXXXXXXXXXXX--W-AHRKVSLTKPSQVPHXX 411
            MAS+ PHCSITGT KPY N+ +P +                W A+++VSLTKP   P   
Sbjct: 1    MASTPPHCSITGTTKPYHNNPYPHSHFKNHRQTHHQNPHQRWTANQRVSLTKPPLPPSSR 60

Query: 412  XXXXXXXXXXXXXXXXXXXXXPTFPSLSRAPPPNKSELTADFSGRRSTRFVSKMHFGRQK 591
                                 PTFPSL       KSEL +DFSGRRSTRFVSK++FGR +
Sbjct: 61   NAPKPPATTTTTTTTHHPQIHPTFPSLQSP----KSELASDFSGRRSTRFVSKLNFGRPR 116

Query: 592  AQLSSRHSSVAEEALQESIRIN-DEKGLENVFLCFESKMSGSDDYIFVLRELGNRGECWK 768
              + +RH+SVAEEALQ  I    DE  LENV L FES++SGSDDYIF+LRELGNRG+C K
Sbjct: 117  TTMGTRHTSVAEEALQNVIEYGKDEGALENVLLNFESRLSGSDDYIFLLRELGNRGDCKK 176

Query: 769  AVQCFEFAVRRERRRNEQGKLASAMISILGRLGKVDLARGVFEIGLKAGYGNTVYAYSAL 948
            A+ CFEFAV+RER++NEQGKLASAMIS LGRLGKV++A+ VFE  L  GYGNTVYA+SA+
Sbjct: 177  AICCFEFAVKRERKKNEQGKLASAMISTLGRLGKVEIAKSVFEAALIEGYGNTVYAFSAI 236

Query: 949  ISAYGRSGCSDEAIKVFETMKNSGLKPNLVTYNAVIDACGKGGVEFNQAAEILNEMLRNG 1128
            ISAYGRSG  DEAIKVF++MK+ GLKPNLVTYNAVIDACGKGGVEF +  EI +EMLRNG
Sbjct: 237  ISAYGRSGYCDEAIKVFDSMKHYGLKPNLVTYNAVIDACGKGGVEFKRVVEIFDEMLRNG 296

Query: 1129 IQPDRITFNSLLAVCSRGGLWEAARNFFSEMVYRGIDQDIFTYNTLLDAFCKGGQMDLAF 1308
            +QPDRITFNSLLAVCSRGGLWEAAR+  SEM+ RGIDQDIFTYNTLLDA CKGGQMD+AF
Sbjct: 297  VQPDRITFNSLLAVCSRGGLWEAARSLSSEMLNRGIDQDIFTYNTLLDAVCKGGQMDMAF 356

Query: 1309 QIMAEMSAKNISPNVVTYSTMIDGYAKVGRLDDALNLLGEMKFAGIGLDRVSYNTLLSIY 1488
            +IM+EM AKNI PNVVTYSTMIDGYAK GR DDALNL  EMKF  I LDRVSYNTLLSIY
Sbjct: 357  EIMSEMPAKNILPNVVTYSTMIDGYAKAGRFDDALNLFNEMKFLCISLDRVSYNTLLSIY 416

Query: 1489 AKLGRFEDVLNVCNEMESSGIKKDAVTYNALLGGYGKQGKYDEVKRVFKEMKTRHLSPNL 1668
            AKLGRF++ L+VC EME+ GI+KD VTYNALLGGYGKQ KYDEV+RVF EMK   +SPNL
Sbjct: 417  AKLGRFQEALDVCREMENCGIRKDVVTYNALLGGYGKQCKYDEVRRVFGEMKAGRVSPNL 476

Query: 1669 LTYSTLIDVYSKGGLYREALEVFKEFKQAGLKADVVLYSALIDALCKNGLVEIAVSLLDE 1848
            LTYSTLIDVYSKGGLYREA++VF+EFK+AGLKADVVLYSA+IDALCKNGLVE AVSLLDE
Sbjct: 477  LTYSTLIDVYSKGGLYREAMDVFREFKKAGLKADVVLYSAVIDALCKNGLVESAVSLLDE 536

Query: 1849 MTKEGIRPNVVTYNSIIDAFGRSATA-------CGSNESLIEPSSLIALKDVSESNDEHR 2007
            MTKEGIRPNVVTYNSIIDAFGRSA           +++  IE  S   +++ ++S    R
Sbjct: 537  MTKEGIRPNVVTYNSIIDAFGRSAITESVVDDNVQTSQLQIESLSSGVVEEATKSLLADR 596

Query: 2008 EDNQIIKIFGQLAAEKSCHAKIDNRGRQEILCVLEVFQKMQELEIKPNVVTFSAILNACS 2187
            E N+IIKIFGQLA EK+  AK  N   QE++C+L VF KM ELEIKPNVVTFSAILNACS
Sbjct: 597  EGNRIIKIFGQLAVEKAGQAK--NCSGQEMMCILAVFHKMHELEIKPNVVTFSAILNACS 654

Query: 2188 RCNSFEDASMLLEELRWFDNQVYGVAHGLLMGRKDNVWAQAQCLFDEVKQMDSPTASAFY 2367
            RCNSFEDASMLLEELR FDNQVYGVAHGLLMG ++NVW QAQ LFDEVK MDS TASAFY
Sbjct: 655  RCNSFEDASMLLEELRLFDNQVYGVAHGLLMGYRENVWEQAQSLFDEVKLMDSSTASAFY 714

Query: 2368 NALTDMLWHFGQKKGAQLVVLEGKRRHVWENMWSDSCLDLHLMSSGAARAMVHAWLLDIR 2547
            NALTDMLWHFGQK+GAQLVVLEGKRR VWEN+WS+SCLDLHLMSSGAARAMVHAWLL+IR
Sbjct: 715  NALTDMLWHFGQKRGAQLVVLEGKRRQVWENVWSESCLDLHLMSSGAARAMVHAWLLNIR 774

Query: 2548 SIVFEGHELPKLISILTGWGKHSKVVGDGTLRRAIEALLTGMGAPFRVANCNLGRFISPG 2727
            SIVFEGHELPKL+SILTGWGKHSKVVGD TLRRAIEALL GMGAPFR+A CNLGRFIS G
Sbjct: 775  SIVFEGHELPKLLSILTGWGKHSKVVGDSTLRRAIEALLMGMGAPFRLAKCNLGRFISTG 834

Query: 2728 AVVAAWLKESGTLKVLVLHDDRAHQETSTFDRIPNLRTLTL 2850
            +VVAAWL+ESGTLKVLVLHD R  QE   F +  NL+TL L
Sbjct: 835  SVVAAWLRESGTLKVLVLHDHRTEQENLRFGQASNLQTLQL 875


>ref|XP_006386713.1| hypothetical protein POPTR_0002s19470g [Populus trichocarpa]
            gi|550345388|gb|ERP64510.1| hypothetical protein
            POPTR_0002s19470g [Populus trichocarpa]
          Length = 873

 Score = 1226 bits (3172), Expect = 0.0
 Identities = 635/881 (72%), Positives = 720/881 (81%), Gaps = 12/881 (1%)
 Frame = +1

Query: 244  MASSTPHCSITGT-KPYQNHQFPQNPIXXXXXXXXXXXX--W-AHRKVSLTKPSQVPHXX 411
            MAS+ PHCSIT T K YQNH +P N +              W ++++VSL KP   P   
Sbjct: 1    MASTPPHCSITATTKHYQNHPYPHNQLKNHRQTHNQNPHQRWTSNQRVSLAKPPLPPSRN 60

Query: 412  XXXXXXXXXXXXXXXXXXXXXPTFPSLSRAPPPNKSELTADFSGRRSTRFVSKMHFGRQK 591
                                 PTF S      P KSEL +DF GRRSTRFVSK+HFGR +
Sbjct: 61   APKPAATTTTTTTQHPQIH--PTFSSFQ----PPKSELVSDFPGRRSTRFVSKLHFGRPR 114

Query: 592  AQLSSRHSSVAEEALQESIRIN-DEKGLENVFLCFESKMSGSDDYIFVLRELGNRGECWK 768
              + +RH+SVA+EALQ  I    DE+ LENV L FES++SGSDDY+F+LRELGNRG+C K
Sbjct: 115  TTMGTRHTSVAQEALQNVIEYGKDERALENVLLNFESRLSGSDDYVFLLRELGNRGDCKK 174

Query: 769  AVQCFEFAVRRERRRNEQGKLASAMISILGRLGKVDLARGVFEIGLKAGYGNTVYAYSAL 948
            A+ CFEFAV+RER++NEQGKLASAMIS LGRLGKV++A+ VF+  L  GYGNTVYA+SA+
Sbjct: 175  AICCFEFAVKRERKKNEQGKLASAMISTLGRLGKVEMAKTVFKAALTEGYGNTVYAFSAI 234

Query: 949  ISAYGRSGCSDEAIKVFETMKNSGLKPNLVTYNAVIDACGKGGVEFNQAAEILNEMLRNG 1128
            ISAYGRSG  +EAIK+F +MK+ GLKPNLVTYNAVIDACGKGGVEF +  EI +EMLRNG
Sbjct: 235  ISAYGRSGYCNEAIKIFYSMKDYGLKPNLVTYNAVIDACGKGGVEFKRVLEIFDEMLRNG 294

Query: 1129 IQPDRITFNSLLAVCSRGGLWEAARNFFSEMVYRGIDQDIFTYNTLLDAFCKGGQMDLAF 1308
            +QPDRITFNSLLAVCS+GGLWEAAR+   EMV RGIDQDIFTYNTLLDA CKGGQ+D+AF
Sbjct: 295  MQPDRITFNSLLAVCSKGGLWEAARSLSCEMVNRGIDQDIFTYNTLLDAVCKGGQLDMAF 354

Query: 1309 QIMAEMSAKNISPNVVTYSTMIDGYAKVGRLDDALNLLGEMKFAGIGLDRVSYNTLLSIY 1488
            +IM+EM AKNI PNVVTYSTMIDGYAK GRLDDA NL  EMKF GI LDRVSYNTLLSIY
Sbjct: 355  EIMSEMPAKNILPNVVTYSTMIDGYAKAGRLDDARNLFNEMKFLGISLDRVSYNTLLSIY 414

Query: 1489 AKLGRFEDVLNVCNEMESSGIKKDAVTYNALLGGYGKQGKYDEVKRVFKEMKTRHLSPNL 1668
            AKLGRFE+ ++VC EME+SGI+KD VTYNALLGGYGKQ KYD V++VF+EMK RH+SPNL
Sbjct: 415  AKLGRFEEAMDVCREMENSGIRKDVVTYNALLGGYGKQYKYDVVRKVFEEMKARHVSPNL 474

Query: 1669 LTYSTLIDVYSKGGLYREALEVFKEFKQAGLKADVVLYSALIDALCKNGLVEIAVSLLDE 1848
            LTYSTLIDVYSKGGLYREA++VF+EFK+AGLKADVVLYSALIDALCKNGLVE AVSLLDE
Sbjct: 475  LTYSTLIDVYSKGGLYREAMDVFREFKKAGLKADVVLYSALIDALCKNGLVESAVSLLDE 534

Query: 1849 MTKEGIRPNVVTYNSIIDAFGRSAT-------ACGSNESLIEPSSLIALKDVSESNDEHR 2007
            MTKEGIRPNVVTYNSIIDAFGR AT       A  ++E  I+  S  A++  ++S    R
Sbjct: 535  MTKEGIRPNVVTYNSIIDAFGRPATTESVVDDAGQTSELQIDSLSSSAVEKATKSLVADR 594

Query: 2008 EDNQIIKIFGQLAAEKSCHAKIDNRGRQEILCVLEVFQKMQELEIKPNVVTFSAILNACS 2187
            EDN+IIKIFGQLAAEK+  AK  N G QE++C+L VF KM ELEIKPNVVTFSAILNACS
Sbjct: 595  EDNRIIKIFGQLAAEKAGQAK--NSGGQEMMCILGVFHKMHELEIKPNVVTFSAILNACS 652

Query: 2188 RCNSFEDASMLLEELRWFDNQVYGVAHGLLMGRKDNVWAQAQCLFDEVKQMDSPTASAFY 2367
            RCNSFE+ASMLLEELR FDNQVYGVAHGLLMG ++NVW QAQ LFDEVK MDS TASAFY
Sbjct: 653  RCNSFEEASMLLEELRLFDNQVYGVAHGLLMGYRENVWEQAQSLFDEVKLMDSSTASAFY 712

Query: 2368 NALTDMLWHFGQKKGAQLVVLEGKRRHVWENMWSDSCLDLHLMSSGAARAMVHAWLLDIR 2547
            NALTDMLWHFGQK+GAQLVVLEGKRR VWEN+WS+SCLDLHLMSSGAARAMVHAWLL++R
Sbjct: 713  NALTDMLWHFGQKRGAQLVVLEGKRRQVWENVWSESCLDLHLMSSGAARAMVHAWLLNVR 772

Query: 2548 SIVFEGHELPKLISILTGWGKHSKVVGDGTLRRAIEALLTGMGAPFRVANCNLGRFISPG 2727
            +IVFEGHE+PKL+SILTGWGKHSKVVGD TLRRA+EALL GMGAPFR A CNLGR IS G
Sbjct: 773  AIVFEGHEVPKLLSILTGWGKHSKVVGDSTLRRAVEALLMGMGAPFRSAKCNLGRLISTG 832

Query: 2728 AVVAAWLKESGTLKVLVLHDDRAHQETSTFDRIPNLRTLTL 2850
            +VVA+WL+ESGTLKVLVLHDDR HQE   F +I NL+ L L
Sbjct: 833  SVVASWLRESGTLKVLVLHDDRTHQENLRFGQISNLQMLQL 873


>gb|EXB28566.1| hypothetical protein L484_009725 [Morus notabilis]
          Length = 871

 Score = 1219 bits (3154), Expect = 0.0
 Identities = 628/878 (71%), Positives = 710/878 (80%), Gaps = 11/878 (1%)
 Frame = +1

Query: 244  MASSTPHCSITGTKPYQNHQFPQNP---IXXXXXXXXXXXXWAHRKVSLTKPSQVPHXXX 414
            MAS+ PHCSIT +KPYQ+HQ+ QNP                W  +KVSLTKPS  P    
Sbjct: 1    MASTPPHCSITASKPYQSHQYAQNPNLKSHHRHSNHRQGHQWTTQKVSLTKPSPSP---- 56

Query: 415  XXXXXXXXXXXXXXXXXXXXPTFPSLSRAPPPNKSELTADFSGRRSTRFVSKMHFGRQKA 594
                                P F SL   P P KS+L A FSGRRSTRFVSKMH GR K 
Sbjct: 57   ---PPARNAAATPAQHASQNPAFHSLCSLPAP-KSDLAAVFSGRRSTRFVSKMHLGRPKT 112

Query: 595  QLSSRHSSVAEEALQESIRIN-DEKGLENVFLCFESKMSGSDDYIFVLRELGNRGECWKA 771
             + SRH++VAEE LQ++I+   D+ G++NV L FE K+ GSDDY F+LRELGNRGEC KA
Sbjct: 113  TVGSRHTAVAEEVLQQAIQFGKDDLGIDNVLLSFEPKLCGSDDYTFLLRELGNRGECRKA 172

Query: 772  VQCFEFAVRRERRRNEQGKLASAMISILGRLGKVDLARGVFEIGLKAGYGNTVYAYSALI 951
            ++CFEFAV RERR+ EQGKL SAMIS LGRLGKV+LAR VFE  L AGYGNTVY YSALI
Sbjct: 173  IRCFEFAVARERRKTEQGKLTSAMISTLGRLGKVELARDVFETALFAGYGNTVYTYSALI 232

Query: 952  SAYGRSGCSDEAIKVFETMKNSGLKPNLVTYNAVIDACGKGGVEFNQAAEILNEMLRNGI 1131
            SAYGRSG  +EA +V E+MK+SGLKPNLVTYNAVIDACGKGG EF +  EI +EMLRNG+
Sbjct: 233  SAYGRSGYWEEARRVVESMKDSGLKPNLVTYNAVIDACGKGGAEFKRVVEIFDEMLRNGV 292

Query: 1132 QPDRITFNSLLAVCSRGGLWEAARNFFSEMVYRGIDQDIFTYNTLLDAFCKGGQMDLAFQ 1311
            QPDRIT+NSLLAVCSRGGLWEAAR+ FSEMV R IDQDI+TYNTLLDA CKGGQMDLA Q
Sbjct: 293  QPDRITYNSLLAVCSRGGLWEAARSLFSEMVERQIDQDIYTYNTLLDAICKGGQMDLARQ 352

Query: 1312 IMAEMSAKNISPNVVTYSTMIDGYAKVGRLDDALNLLGEMKFAGIGLDRVSYNTLLSIYA 1491
            IM+EM +K I PNVVTYSTMIDGYAK GRL+DALNL  EMK+  IGLDRV YNTLLSIYA
Sbjct: 353  IMSEMPSKKILPNVVTYSTMIDGYAKAGRLEDALNLFNEMKYLAIGLDRVLYNTLLSIYA 412

Query: 1492 KLGRFEDVLNVCNEMESSGIKKDAVTYNALLGGYGKQGKYDEVKRVFKEMKTRHLSPNLL 1671
            KLGRFE+ L VC EMESSGI +D V+YNALLGGYGKQGKYDEVKR++++MK  H+SPNLL
Sbjct: 413  KLGRFEEALKVCKEMESSGIVRDVVSYNALLGGYGKQGKYDEVKRMYQDMKADHVSPNLL 472

Query: 1672 TYSTLIDVYSKGGLYREALEVFKEFKQAGLKADVVLYSALIDALCKNGLVEIAVSLLDEM 1851
            TYSTLIDVYSKGGLYREA+EVF+EFKQAGLKADVVLYS LI+ALCKNG+VE AVSLLDEM
Sbjct: 473  TYSTLIDVYSKGGLYREAMEVFREFKQAGLKADVVLYSELINALCKNGMVESAVSLLDEM 532

Query: 1852 TKEGIRPNVVTYNSIIDAFGRSAT-------ACGSNESLIEPSSLIALKDVSESNDEHRE 2010
            TKEGI PNV+TYNSIIDAFGR AT       A G NE   E SS I+ ++ +++   ++ 
Sbjct: 533  TKEGIMPNVITYNSIIDAFGRPATADSALGAAIGGNELETELSSSISNENANKNKAVNKG 592

Query: 2011 DNQIIKIFGQLAAEKSCHAKIDNRGRQEILCVLEVFQKMQELEIKPNVVTFSAILNACSR 2190
            D+QIIK+FGQLAAE+  H K D + RQEILC+L VFQKM EL IKPNVVTFSAILNACSR
Sbjct: 593  DHQIIKMFGQLAAEQEGHTKKDKKIRQEILCILGVFQKMHELNIKPNVVTFSAILNACSR 652

Query: 2191 CNSFEDASMLLEELRWFDNQVYGVAHGLLMGRKDNVWAQAQCLFDEVKQMDSPTASAFYN 2370
            CNSFEDASMLLEELR FDNQVYGVAHGLLMG ++NVW +AQ LFDEVKQMDS TASAFYN
Sbjct: 653  CNSFEDASMLLEELRLFDNQVYGVAHGLLMGHRENVWLEAQSLFDEVKQMDSSTASAFYN 712

Query: 2371 ALTDMLWHFGQKKGAQLVVLEGKRRHVWENMWSDSCLDLHLMSSGAARAMVHAWLLDIRS 2550
            ALTDMLWHFGQK+GAQLVVLEGKRR+VWE++WS+S LDLHLMSSGAARA++HAWLL+IRS
Sbjct: 713  ALTDMLWHFGQKRGAQLVVLEGKRRNVWESVWSNSFLDLHLMSSGAARALLHAWLLNIRS 772

Query: 2551 IVFEGHELPKLISILTGWGKHSKVVGDGTLRRAIEALLTGMGAPFRVANCNLGRFISPGA 2730
            +VFEG ELP+L+SILTGWGKHSKVVGD  LRRAIE+LL  MGAPF  A CNLGRF SPG 
Sbjct: 773  VVFEGQELPRLLSILTGWGKHSKVVGDSALRRAIESLLISMGAPFEAAKCNLGRFTSPGP 832

Query: 2731 VVAAWLKESGTLKVLVLHDDRAHQETSTFDRIPNLRTL 2844
            +VA WLKESGTLKVLVLHDDR+H + +    + NL+TL
Sbjct: 833  MVAGWLKESGTLKVLVLHDDRSHSQNA--KHVSNLQTL 868


>ref|XP_004166285.1| PREDICTED: pentatricopeptide repeat-containing protein At2g31400,
            chloroplastic-like [Cucumis sativus]
          Length = 868

 Score = 1209 bits (3127), Expect = 0.0
 Identities = 618/878 (70%), Positives = 698/878 (79%), Gaps = 9/878 (1%)
 Frame = +1

Query: 244  MASSTPHCSITGTKPYQNHQFPQNPIXXXXXXXXXXXXWAH-RKVSLTKPSQVPHXXXXX 420
            MAS+ PHCSIT  KPYQ HQ+PQN +            W    K  L KP          
Sbjct: 1    MASTPPHCSITAAKPYQTHQYPQNNLKNHRQNARQNGPWTTTHKFPLVKP--------LP 52

Query: 421  XXXXXXXXXXXXXXXXXXPTFPSLSRAPPPNKSELTADFSGRRSTRFVSKMHFGRQKAQL 600
                              P FPSL   P  +KSEL ++FSGRRSTRFVSK HFGR K+ +
Sbjct: 53   STPGHSATKSTSTPLSQSPNFPSLCSLPT-SKSELASNFSGRRSTRFVSKFHFGRPKSSM 111

Query: 601  SSRHSSVAEEALQESIRIN-DEKGLENVFLCFESKMSGSDDYIFVLRELGNRGECWKAVQ 777
            ++RHS++AEE L + ++   D+  L+N+ L FESK+ GS+DY F+LRELGNRGECWKA++
Sbjct: 112  TTRHSAIAEEVLHQVLQFGKDDASLDNILLNFESKLCGSEDYTFLLRELGNRGECWKAIR 171

Query: 778  CFEFAVRRERRRNEQGKLASAMISILGRLGKVDLARGVFEIGLKAGYGNTVYAYSALISA 957
            CF+FA+ RE R+NE+GKLASAMIS LGRLGKV+LA+GVFE  L  GYGNTV+A+SALISA
Sbjct: 172  CFDFALVREGRKNERGKLASAMISTLGRLGKVELAKGVFETALSEGYGNTVFAFSALISA 231

Query: 958  YGRSGCSDEAIKVFETMKNSGLKPNLVTYNAVIDACGKGGVEFNQAAEILNEMLRNGIQP 1137
            YG+SG  DEAIKVFE+MK SGLKPNLVTYNAVIDACGKGGVEF +  EI  EMLRNG+QP
Sbjct: 232  YGKSGYFDEAIKVFESMKVSGLKPNLVTYNAVIDACGKGGVEFKRVVEIFEEMLRNGVQP 291

Query: 1138 DRITFNSLLAVCSRGGLWEAARNFFSEMVYRGIDQDIFTYNTLLDAFCKGGQMDLAFQIM 1317
            DRIT+NSLLAVCSRGGLWEAARN F+EM+ RGIDQD+FTYNTLLDA CKGGQMDLA++IM
Sbjct: 292  DRITYNSLLAVCSRGGLWEAARNLFNEMIDRGIDQDVFTYNTLLDAVCKGGQMDLAYEIM 351

Query: 1318 AEMSAKNISPNVVTYSTMIDGYAKVGRLDDALNLLGEMKFAGIGLDRVSYNTLLSIYAKL 1497
             EM  K I PNVVTYSTM DGYAK GRL+DALNL  EMKF GIGLDRVSYNTLLSIYAKL
Sbjct: 352  LEMPGKKILPNVVTYSTMADGYAKAGRLEDALNLYNEMKFLGIGLDRVSYNTLLSIYAKL 411

Query: 1498 GRFEDVLNVCNEMESSGIKKDAVTYNALLGGYGKQGKYDEVKRVFKEMKTRHLSPNLLTY 1677
            GRFED L VC EM SSG+KKD VTYNALL GYGKQGK++EV RVFKEMK   + PNLLTY
Sbjct: 412  GRFEDALKVCKEMGSSGVKKDVVTYNALLDGYGKQGKFNEVTRVFKEMKKDRVFPNLLTY 471

Query: 1678 STLIDVYSKGGLYREALEVFKEFKQAGLKADVVLYSALIDALCKNGLVEIAVSLLDEMTK 1857
            STLIDVYSKG LY EA+EVF+EFKQAGLKADVVLYS LI+ALCKNGLV+ AV LLDEMTK
Sbjct: 472  STLIDVYSKGSLYEEAMEVFREFKQAGLKADVVLYSELINALCKNGLVDSAVLLLDEMTK 531

Query: 1858 EGIRPNVVTYNSIIDAFGRSATA-------CGSNESLIEPSSLIALKDVSESNDEHREDN 2016
            EGIRPNVVTYNSIIDAFGRS TA         SNE   E  + + ++ V ES + + +D 
Sbjct: 532  EGIRPNVVTYNSIIDAFGRSTTAEFLVDGVGASNERQSESPTFMLIEGVDES-EINWDDG 590

Query: 2017 QIIKIFGQLAAEKSCHAKIDNRGRQEILCVLEVFQKMQELEIKPNVVTFSAILNACSRCN 2196
             + K + QL +EK   AK +  G++EI  +L VF+KM ELEIKPNVVTFSAILNACSRC 
Sbjct: 591  HVFKFYQQLVSEKEGPAKKERLGKEEIRSILSVFKKMHELEIKPNVVTFSAILNACSRCK 650

Query: 2197 SFEDASMLLEELRWFDNQVYGVAHGLLMGRKDNVWAQAQCLFDEVKQMDSPTASAFYNAL 2376
            S EDASMLLEELR FDNQVYGVAHGLLMG  +NVW QAQ LFDEVKQMDS TASAFYNAL
Sbjct: 651  SIEDASMLLEELRLFDNQVYGVAHGLLMGFSENVWIQAQYLFDEVKQMDSSTASAFYNAL 710

Query: 2377 TDMLWHFGQKKGAQLVVLEGKRRHVWENMWSDSCLDLHLMSSGAARAMVHAWLLDIRSIV 2556
            TDMLWHFGQK+GAQLVVLEGKRR VWE +WSDSCLDLHLMSSGAARAMVHAWLL I S+V
Sbjct: 711  TDMLWHFGQKRGAQLVVLEGKRRKVWETLWSDSCLDLHLMSSGAARAMVHAWLLGIHSVV 770

Query: 2557 FEGHELPKLISILTGWGKHSKVVGDGTLRRAIEALLTGMGAPFRVANCNLGRFISPGAVV 2736
            FEGH+LPKL+SILTGWGKHSKVVGDG LRRAIEALLT MGAPFRVA CN+GR++S G+VV
Sbjct: 771  FEGHQLPKLLSILTGWGKHSKVVGDGALRRAIEALLTSMGAPFRVAKCNIGRYVSTGSVV 830

Query: 2737 AAWLKESGTLKVLVLHDDRAHQETSTFDRIPNLRTLTL 2850
            AAWLKESGTLK+LVLHDDR H +T   D I  L+T++L
Sbjct: 831  AAWLKESGTLKLLVLHDDRTHPDTENMDLISKLQTISL 868


>ref|XP_004135985.1| PREDICTED: pentatricopeptide repeat-containing protein At2g31400,
            chloroplastic-like [Cucumis sativus]
          Length = 868

 Score = 1208 bits (3126), Expect = 0.0
 Identities = 618/878 (70%), Positives = 698/878 (79%), Gaps = 9/878 (1%)
 Frame = +1

Query: 244  MASSTPHCSITGTKPYQNHQFPQNPIXXXXXXXXXXXXWAH-RKVSLTKPSQVPHXXXXX 420
            MAS+ PHCSIT  KPYQ HQ+PQN +            W    K  L KP          
Sbjct: 1    MASTPPHCSITAAKPYQTHQYPQNNLKNHRQNARQNGPWTTTHKFPLVKP--------LP 52

Query: 421  XXXXXXXXXXXXXXXXXXPTFPSLSRAPPPNKSELTADFSGRRSTRFVSKMHFGRQKAQL 600
                              P FPSL   P  +KSEL ++FSGRRSTRFVSK HFGR K+ +
Sbjct: 53   STPGHSATKSTSTPLSQSPNFPSLCSLPT-SKSELASNFSGRRSTRFVSKFHFGRPKSSM 111

Query: 601  SSRHSSVAEEALQESIRIN-DEKGLENVFLCFESKMSGSDDYIFVLRELGNRGECWKAVQ 777
            ++RHS++AEE L + ++   D+  L+N+ L FESK+ GS+DY F+LRELGNRGECWKA++
Sbjct: 112  TTRHSAIAEEVLHQVLQFGKDDASLDNILLNFESKLCGSEDYTFLLRELGNRGECWKAIR 171

Query: 778  CFEFAVRRERRRNEQGKLASAMISILGRLGKVDLARGVFEIGLKAGYGNTVYAYSALISA 957
            CF+FA+ RE R+NE+GKLASAMIS LGRLGKV+LA+GVFE  L  GYGNTV+A+SALISA
Sbjct: 172  CFDFALVREGRKNERGKLASAMISTLGRLGKVELAKGVFETALSEGYGNTVFAFSALISA 231

Query: 958  YGRSGCSDEAIKVFETMKNSGLKPNLVTYNAVIDACGKGGVEFNQAAEILNEMLRNGIQP 1137
            YG+SG  DEAIKVFE+MK SGLKPNLVTYNAVIDACGKGGVEF +  EI  EMLRNG+QP
Sbjct: 232  YGKSGYFDEAIKVFESMKVSGLKPNLVTYNAVIDACGKGGVEFKRVVEIFEEMLRNGVQP 291

Query: 1138 DRITFNSLLAVCSRGGLWEAARNFFSEMVYRGIDQDIFTYNTLLDAFCKGGQMDLAFQIM 1317
            DRIT+NSLLAVCSRGGLWEAARN F+EM+ RGIDQD+FTYNTLLDA CKGGQMDLA++IM
Sbjct: 292  DRITYNSLLAVCSRGGLWEAARNLFNEMIDRGIDQDVFTYNTLLDAVCKGGQMDLAYEIM 351

Query: 1318 AEMSAKNISPNVVTYSTMIDGYAKVGRLDDALNLLGEMKFAGIGLDRVSYNTLLSIYAKL 1497
             EM  K I PNVVTYSTM DGYAK GRL+DALNL  EMKF GIGLDRVSYNTLLSIYAKL
Sbjct: 352  LEMPGKKILPNVVTYSTMADGYAKAGRLEDALNLYNEMKFLGIGLDRVSYNTLLSIYAKL 411

Query: 1498 GRFEDVLNVCNEMESSGIKKDAVTYNALLGGYGKQGKYDEVKRVFKEMKTRHLSPNLLTY 1677
            GRFED L VC EM SSG+KKD VTYNALL GYGKQGK++EV RVFKEMK   + PNLLTY
Sbjct: 412  GRFEDALKVCKEMGSSGVKKDVVTYNALLDGYGKQGKFNEVTRVFKEMKKDRVFPNLLTY 471

Query: 1678 STLIDVYSKGGLYREALEVFKEFKQAGLKADVVLYSALIDALCKNGLVEIAVSLLDEMTK 1857
            STLIDVYSKG LY EA+EVF+EFKQAGLKADVVLYS LI+ALCKNGLV+ AV LLDEMTK
Sbjct: 472  STLIDVYSKGSLYEEAMEVFREFKQAGLKADVVLYSELINALCKNGLVDSAVLLLDEMTK 531

Query: 1858 EGIRPNVVTYNSIIDAFGRSATA-------CGSNESLIEPSSLIALKDVSESNDEHREDN 2016
            EGIRPNVVTYNSIIDAFGRS TA         SNE   E  S + ++ V ES + + +D 
Sbjct: 532  EGIRPNVVTYNSIIDAFGRSTTAEFLVDGVGASNERQSESPSFMLIEGVDES-EINWDDG 590

Query: 2017 QIIKIFGQLAAEKSCHAKIDNRGRQEILCVLEVFQKMQELEIKPNVVTFSAILNACSRCN 2196
             + K + QL +EK   AK +  G++EI  +L VF+KM ELEIKPNVVTFSAILNACSRC 
Sbjct: 591  HVFKFYQQLVSEKEGPAKKERLGKEEIRSILSVFKKMHELEIKPNVVTFSAILNACSRCK 650

Query: 2197 SFEDASMLLEELRWFDNQVYGVAHGLLMGRKDNVWAQAQCLFDEVKQMDSPTASAFYNAL 2376
            S EDASMLLEELR FDNQVYGVAHGLLMG  +NVW QAQ LFDEVKQMDS TASAFYNAL
Sbjct: 651  SIEDASMLLEELRLFDNQVYGVAHGLLMGFSENVWIQAQYLFDEVKQMDSSTASAFYNAL 710

Query: 2377 TDMLWHFGQKKGAQLVVLEGKRRHVWENMWSDSCLDLHLMSSGAARAMVHAWLLDIRSIV 2556
            TDMLWHFGQK+GAQLVVLEGKRR VWE +WSDSCLDLHLMSSGAARAMVHAWLL I S+V
Sbjct: 711  TDMLWHFGQKRGAQLVVLEGKRRKVWETLWSDSCLDLHLMSSGAARAMVHAWLLGIHSVV 770

Query: 2557 FEGHELPKLISILTGWGKHSKVVGDGTLRRAIEALLTGMGAPFRVANCNLGRFISPGAVV 2736
            FEGH+LPKL+SILTGWGKHSKVVGDG LRRAIEALLT MGAPFRVA CN+GR++S G+VV
Sbjct: 771  FEGHQLPKLLSILTGWGKHSKVVGDGALRRAIEALLTSMGAPFRVAKCNIGRYVSTGSVV 830

Query: 2737 AAWLKESGTLKVLVLHDDRAHQETSTFDRIPNLRTLTL 2850
            AAWLKESGTLK+LVLHDDR H ++   D I  L+T++L
Sbjct: 831  AAWLKESGTLKLLVLHDDRTHPDSENMDLISKLQTISL 868


>ref|XP_007221553.1| hypothetical protein PRUPE_ppa001263mg [Prunus persica]
            gi|462418303|gb|EMJ22752.1| hypothetical protein
            PRUPE_ppa001263mg [Prunus persica]
          Length = 868

 Score = 1199 bits (3103), Expect = 0.0
 Identities = 610/877 (69%), Positives = 702/877 (80%), Gaps = 8/877 (0%)
 Frame = +1

Query: 244  MASSTPHCSITGTKPYQNHQFPQNPIXXXXXXXXXXXXWAHRKVSLTKPSQVPHXXXXXX 423
            MAS+ PHCSIT TKPYQ H++PQN              W  ++VSL KP  +P       
Sbjct: 1    MASTPPHCSITATKPYQTHRYPQNQHLKSQRQSRQSNQWTKQQVSLPKPLPLPSQAPRTA 60

Query: 424  XXXXXXXXXXXXXXXXXPTFPSLSRAPPPNKSELTADFSGRRSTRFVSKMHFGRQKAQLS 603
                              +F SL   P P KS+L   FSGRRSTRFVSKMH GR K  + 
Sbjct: 61   AKTPTATPTS--------SFSSLCPLPHP-KSDLVTAFSGRRSTRFVSKMHLGRPKTTMG 111

Query: 604  SRHSSVAEEALQESIRI-NDEKGLENVFLCFESKMSGSDDYIFVLRELGNRGECWKAVQC 780
            S  S +AEEAL ++++  ND+  L+++ L F S++ GSDDY F+ RELGNRGECWKA++C
Sbjct: 112  SYRSPLAEEALHQAVQFGNDDLALDDILLSFHSRLCGSDDYTFLFRELGNRGECWKAIRC 171

Query: 781  FEFAVRRERRRNEQGKLASAMISILGRLGKVDLARGVFEIGLKAGYGNTVYAYSALISAY 960
            FEFAVRRE+RR EQGKLAS+MIS LGRLGKV+LA+ VF+  +  GYG TVY YSALI+AY
Sbjct: 172  FEFAVRREKRRTEQGKLASSMISTLGRLGKVELAKNVFQTAVNEGYGKTVYTYSALITAY 231

Query: 961  GRSGCSDEAIKVFETMKNSGLKPNLVTYNAVIDACGKGGVEFNQAAEILNEMLRNGIQPD 1140
            GR+G  +EAI+VFE+MK+SGLKPNLVTYNAVIDA GKGGVEF +  EI NEMLRNG QPD
Sbjct: 232  GRNGYCEEAIRVFESMKDSGLKPNLVTYNAVIDAYGKGGVEFKRVVEIFNEMLRNGEQPD 291

Query: 1141 RITFNSLLAVCSRGGLWEAARNFFSEMVYRGIDQDIFTYNTLLDAFCKGGQMDLAFQIMA 1320
            RIT+NSLLAVCSRGGLWE ARN FSEMV RGIDQDI+TYNTL+DA CKGGQMDLA+QIM+
Sbjct: 292  RITYNSLLAVCSRGGLWEMARNLFSEMVDRGIDQDIYTYNTLIDAICKGGQMDLAYQIMS 351

Query: 1321 EMSAKNISPNVVTYSTMIDGYAKVGRLDDALNLLGEMKFAGIGLDRVSYNTLLSIYAKLG 1500
            EM +KNI PNVVTYST+IDGYAK GRL+DAL+L  EMKF  IGLDRV YNTLLS+Y KLG
Sbjct: 352  EMPSKNILPNVVTYSTIIDGYAKAGRLEDALSLFNEMKFLAIGLDRVLYNTLLSLYGKLG 411

Query: 1501 RFEDVLNVCNEMESSGIKKDAVTYNALLGGYGKQGKYDEVKRVFKEMKTRHLSPNLLTYS 1680
            RFED L VC EMES GI KD V+YNALLGGYGKQGKYD+ KR++ +MK   +SPN+LTYS
Sbjct: 412  RFEDALKVCKEMESVGIAKDVVSYNALLGGYGKQGKYDDAKRMYNQMKEERVSPNILTYS 471

Query: 1681 TLIDVYSKGGLYREALEVFKEFKQAGLKADVVLYSALIDALCKNGLVEIAVSLLDEMTKE 1860
            TLIDVYSKGGLY EA++VF+EFKQAGLKADVVLYS L++ALCKNGLVE AV LLDEMTKE
Sbjct: 472  TLIDVYSKGGLYMEAMKVFREFKQAGLKADVVLYSELVNALCKNGLVESAVLLLDEMTKE 531

Query: 1861 GIRPNVVTYNSIIDAFGRSAT-------ACGSNESLIEPSSLIALKDVSESNDEHREDNQ 2019
            GIRPNVVTYNSIIDAFGRSAT       A G      E SS ++  D        R DN+
Sbjct: 532  GIRPNVVTYNSIIDAFGRSATTECAADAAGGGIVLQTESSSSVSEGDAIGIQVGDRGDNR 591

Query: 2020 IIKIFGQLAAEKSCHAKIDNRGRQEILCVLEVFQKMQELEIKPNVVTFSAILNACSRCNS 2199
             +K+FGQLAAEK+ +AK D + RQEILC+L +FQKM EL+IKPNVVTFSAILNACSRCNS
Sbjct: 592  FMKMFGQLAAEKAGYAKTDRKVRQEILCILGIFQKMHELDIKPNVVTFSAILNACSRCNS 651

Query: 2200 FEDASMLLEELRWFDNQVYGVAHGLLMGRKDNVWAQAQCLFDEVKQMDSPTASAFYNALT 2379
            FEDASMLLEELR FDN+VYGVAHGLLMG +DNVW +A+ LFDEVKQMDS TASAFYNALT
Sbjct: 652  FEDASMLLEELRLFDNKVYGVAHGLLMGYRDNVWVKAESLFDEVKQMDSSTASAFYNALT 711

Query: 2380 DMLWHFGQKKGAQLVVLEGKRRHVWENMWSDSCLDLHLMSSGAARAMVHAWLLDIRSIVF 2559
            DMLWH+GQK+GAQLVVLEGKRR+VWE++WS+SCLDLHLMSSGAARAMVHAWLL+IRSIVF
Sbjct: 712  DMLWHYGQKQGAQLVVLEGKRRNVWESVWSNSCLDLHLMSSGAARAMVHAWLLNIRSIVF 771

Query: 2560 EGHELPKLISILTGWGKHSKVVGDGTLRRAIEALLTGMGAPFRVANCNLGRFISPGAVVA 2739
            EG +LP L+SILTGWGKHSKVVGD TLRRAIEALLT MGAPFRVA CNLGRFIS G++ A
Sbjct: 772  EGQQLPNLLSILTGWGKHSKVVGDSTLRRAIEALLTSMGAPFRVAKCNLGRFISTGSMAA 831

Query: 2740 AWLKESGTLKVLVLHDDRAHQETSTFDRIPNLRTLTL 2850
            AWL+ESGTL+VLVLHDDR   +++  ++  NL+ L L
Sbjct: 832  AWLRESGTLEVLVLHDDRTCPKSADLEQTSNLQALAL 868


>ref|XP_002301519.2| hypothetical protein POPTR_0002s19470g [Populus trichocarpa]
            gi|550345387|gb|EEE80792.2| hypothetical protein
            POPTR_0002s19470g [Populus trichocarpa]
          Length = 864

 Score = 1198 bits (3099), Expect = 0.0
 Identities = 626/881 (71%), Positives = 711/881 (80%), Gaps = 12/881 (1%)
 Frame = +1

Query: 244  MASSTPHCSITGT-KPYQNHQFPQNPIXXXXXXXXXXXX--W-AHRKVSLTKPSQVPHXX 411
            MAS+ PHCSIT T K YQNH +P N +              W ++++VSL KP   P   
Sbjct: 1    MASTPPHCSITATTKHYQNHPYPHNQLKNHRQTHNQNPHQRWTSNQRVSLAKPPLPPSRN 60

Query: 412  XXXXXXXXXXXXXXXXXXXXXPTFPSLSRAPPPNKSELTADFSGRRSTRFVSKMHFGRQK 591
                                 PTF S      P KSEL +DF GRRSTRFVSK+HFGR +
Sbjct: 61   APKPAATTTTTTTQHPQIH--PTFSSFQ----PPKSELVSDFPGRRSTRFVSKLHFGRPR 114

Query: 592  AQLSSRHSSVAEEALQESIRIN-DEKGLENVFLCFESKMSGSDDYIFVLRELGNRGECWK 768
              + +RH+SVA+EALQ  I    DE+ LENV L FES++SGSDDY+F+LRELGNRG+C K
Sbjct: 115  TTMGTRHTSVAQEALQNVIEYGKDERALENVLLNFESRLSGSDDYVFLLRELGNRGDCKK 174

Query: 769  AVQCFEFAVRRERRRNEQGKLASAMISILGRLGKVDLARGVFEIGLKAGYGNTVYAYSAL 948
            A+ CFEFAV+RER++NEQGKLASAMIS LGRLGKV++A+ VF+  L  GYGNTVYA+SA+
Sbjct: 175  AICCFEFAVKRERKKNEQGKLASAMISTLGRLGKVEMAKTVFKAALTEGYGNTVYAFSAI 234

Query: 949  ISAYGRSGCSDEAIKVFETMKNSGLKPNLVTYNAVIDACGKGGVEFNQAAEILNEMLRNG 1128
            ISAYGRSG  +EAIK+F +MK+ GLKPNLVTYNAVIDACGKGGVEF +  EI +EMLRNG
Sbjct: 235  ISAYGRSGYCNEAIKIFYSMKDYGLKPNLVTYNAVIDACGKGGVEFKRVLEIFDEMLRNG 294

Query: 1129 IQPDRITFNSLLAVCSRGGLWEAARNFFSEMVYRGIDQDIFTYNTLLDAFCKGGQMDLAF 1308
            +QPDRITFNSLLAVCS+GGLWEAAR+   EMV RGIDQDIFTYNTLLDA CKGGQ+D+AF
Sbjct: 295  MQPDRITFNSLLAVCSKGGLWEAARSLSCEMVNRGIDQDIFTYNTLLDAVCKGGQLDMAF 354

Query: 1309 QIMAEMSAKNISPNVVTYSTMIDGYAKVGRLDDALNLLGEMKFAGIGLDRVSYNTLLSIY 1488
            +IM+EM AKNI PNVVTYSTMIDGYAK GRLDDA NL  EMKF GI LDRVSYNTLLSIY
Sbjct: 355  EIMSEMPAKNILPNVVTYSTMIDGYAKAGRLDDARNLFNEMKFLGISLDRVSYNTLLSIY 414

Query: 1489 AKLGRFEDVLNVCNEMESSGIKKDAVTYNALLGGYGKQGKYDEVKRVFKEMKTRHLSPNL 1668
            AKLGRFE+ ++VC EME+SGI+KD VTYNALLGGYGKQ KYD V++VF+EMK RH+SPNL
Sbjct: 415  AKLGRFEEAMDVCREMENSGIRKDVVTYNALLGGYGKQYKYDVVRKVFEEMKARHVSPNL 474

Query: 1669 LTYSTLIDVYSKGGLYREALEVFKEFKQAGLKADVVLYSALIDALCKNGLVEIAVSLLDE 1848
            LTYSTLIDVYSKGGLYREA++VF+EFK+AGLKADVVLYSALIDALCKNGLVE AVSLLDE
Sbjct: 475  LTYSTLIDVYSKGGLYREAMDVFREFKKAGLKADVVLYSALIDALCKNGLVESAVSLLDE 534

Query: 1849 MTKEGIRPNVVTYNSIIDAFGRSAT-------ACGSNESLIEPSSLIALKDVSESNDEHR 2007
            MTKEGIRPNVVTYNSIIDAFGR AT       A  ++E  I+  S  A++  ++S    R
Sbjct: 535  MTKEGIRPNVVTYNSIIDAFGRPATTESVVDDAGQTSELQIDSLSSSAVEKATKSLVADR 594

Query: 2008 EDNQIIKIFGQLAAEKSCHAKIDNRGRQEILCVLEVFQKMQELEIKPNVVTFSAILNACS 2187
            EDN+IIKIFGQLAAEK+  AK  N G QE++C+L VF KM ELEIKPNVVTFSAILNACS
Sbjct: 595  EDNRIIKIFGQLAAEKAGQAK--NSGGQEMMCILGVFHKMHELEIKPNVVTFSAILNACS 652

Query: 2188 RCNSFEDASMLLEELRWFDNQVYGVAHGLLMGRKDNVWAQAQCLFDEVKQMDSPTASAFY 2367
            RCNSFE+ASMLLEELR FDNQVYGVAHGLLMG ++NVW QAQ LFDEVK MDS TASAFY
Sbjct: 653  RCNSFEEASMLLEELRLFDNQVYGVAHGLLMGYRENVWEQAQSLFDEVKLMDSSTASAFY 712

Query: 2368 NALTDMLWHFGQKKGAQLVVLEGKRRHVWENMWSDSCLDLHLMSSGAARAMVHAWLLDIR 2547
            NALTDMLWHFGQK+GAQLVVLEGKRR VWEN+WS+SCLDLHLMSSGAARAMVHAWLL++R
Sbjct: 713  NALTDMLWHFGQKRGAQLVVLEGKRRQVWENVWSESCLDLHLMSSGAARAMVHAWLLNVR 772

Query: 2548 SIVFEGHELPKLISILTGWGKHSKVVGDGTLRRAIEALLTGMGAPFRVANCNLGRFISPG 2727
            +IVFEGHE+PKL+         SKVVGD TLRRA+EALL GMGAPFR A CNLGR IS G
Sbjct: 773  AIVFEGHEVPKLL---------SKVVGDSTLRRAVEALLMGMGAPFRSAKCNLGRLISTG 823

Query: 2728 AVVAAWLKESGTLKVLVLHDDRAHQETSTFDRIPNLRTLTL 2850
            +VVA+WL+ESGTLKVLVLHDDR HQE   F +I NL+ L L
Sbjct: 824  SVVASWLRESGTLKVLVLHDDRTHQENLRFGQISNLQMLQL 864


>ref|XP_004288538.1| PREDICTED: pentatricopeptide repeat-containing protein At2g31400,
            chloroplastic-like [Fragaria vesca subsp. vesca]
          Length = 870

 Score = 1187 bits (3071), Expect = 0.0
 Identities = 618/883 (69%), Positives = 701/883 (79%), Gaps = 14/883 (1%)
 Frame = +1

Query: 244  MASSTPHCSITGTKPYQNHQFPQNPIXXXXXXXXXXXXWAHRKVSLTKPSQVPHXXXXXX 423
            MAS+ PHCSIT TKPYQ HQ+PQN                   VSL+KP  +P       
Sbjct: 1    MASTPPHCSITATKPYQTHQYPQNQRLKSHRQTRPTT----HHVSLSKPLPLP------- 49

Query: 424  XXXXXXXXXXXXXXXXXPTFPSLSRAPPPNKSELTADFSGRRSTRFVSKMHFGRQKAQLS 603
                             P   S S   PP KS+L + FSGRRSTR VSKMH GR K  + 
Sbjct: 50   -PRPPPRTVPKPASAAGPVPSSFSSLCPPAKSDLVSAFSGRRSTRMVSKMHLGRPKTTVG 108

Query: 604  SRHSSVAEEALQESIRIN-DEKGLENVFLCFESKMSGSDDYIFVLRELGNRGECWKAVQC 780
            SRHS +AEEAL+ +IR   D+  L++V   FES++  SDD+ F+LRELGNRGECWKA++C
Sbjct: 109  SRHSPLAEEALETAIRFGKDDFALDDVLHSFESRLV-SDDFTFLLRELGNRGECWKAIRC 167

Query: 781  FEFAVRRERRRNEQGKLASAMISILGRLGKVDLARGVFEIGLKAGYGNTVYAYSALISAY 960
            FEFAVRRER+R EQGKLAS+MIS LGRLGKV+LA+ VF+  +  GYG TVY YSALISAY
Sbjct: 168  FEFAVRRERKRTEQGKLASSMISTLGRLGKVELAKNVFQTAVNEGYGRTVYTYSALISAY 227

Query: 961  GRSGCSDEAIKVFETMKNSGLKPNLVTYNAVIDACGKGGVEFNQAAEILNEMLRNGIQPD 1140
            GRSG  DEAI+V E+MK+SG+KPNLVTYNAVIDACGKGGVEF +  EI +EML+ G+QPD
Sbjct: 228  GRSGYCDEAIRVLESMKDSGVKPNLVTYNAVIDACGKGGVEFKKVVEIFDEMLKVGVQPD 287

Query: 1141 RITFNSLLAVCSRGGLWEAARNFFSEMVYRGIDQDIFTYNTLLDAFCKGGQMDLAFQIMA 1320
            RIT+NSLLAVCSRGGLWEAARN FSEMV RGIDQDI+TYNTLLDA  KGGQMDLA++IM+
Sbjct: 288  RITYNSLLAVCSRGGLWEAARNLFSEMVDRGIDQDIYTYNTLLDAISKGGQMDLAYKIMS 347

Query: 1321 EMSAKNISPNVVTYSTMIDGYAKVGRLDDALNLLGEMKFAGIGLDRVSYNTLLSIYAKLG 1500
            EM +KNI PNVVTYSTMIDGYAK GRL+DALNL  EMKF  IGLDRV YNTLLS+Y KLG
Sbjct: 348  EMPSKNILPNVVTYSTMIDGYAKAGRLEDALNLFNEMKFLAIGLDRVLYNTLLSLYGKLG 407

Query: 1501 RFEDVLNVCNEMESSGIKKDAVTYNALLGGYGKQGKYDEVKRVFKEMKTRHLSPNLLTYS 1680
            RFE+ LNVC EMES GI KD V+YNALLGGYGKQGKYDEVK ++ EMK   +SPNLLTYS
Sbjct: 408  RFEEALNVCKEMESVGIAKDVVSYNALLGGYGKQGKYDEVKGLYNEMKVERVSPNLLTYS 467

Query: 1681 TLIDVYSKGGLYREALEVFKEFKQAGLKADVVLYSALIDALCKNGLVEIAVSLLDEMTKE 1860
            TLIDVYSKGGLY EA++VF+EFKQAGLKADVVLYS LI+ALCKNGLVE AVSLLDEMTKE
Sbjct: 468  TLIDVYSKGGLYAEAVKVFREFKQAGLKADVVLYSELINALCKNGLVESAVSLLDEMTKE 527

Query: 1861 GIRPNVVTYNSIIDAFGRSAT--------ACGSNESLIEPSSLIALK-DVSESNDEH--- 2004
            GIRPNVVTYNSIIDAFGR AT        ACG        SS+ A   D+S+ N ++   
Sbjct: 528  GIRPNVVTYNSIIDAFGRPATTVCAVDAGACGIVLRSESSSSISARDFDISDKNVQNEMR 587

Query: 2005 -REDNQIIKIFGQLAAEKSCHAKIDNRGRQEILCVLEVFQKMQELEIKPNVVTFSAILNA 2181
             RED +I+K+FGQL A+K+ +AK D + RQEILC+L VFQKM EL+IKPNVVTFSAILNA
Sbjct: 588  DREDTRIMKMFGQLTADKAGYAKKDRKVRQEILCILGVFQKMHELDIKPNVVTFSAILNA 647

Query: 2182 CSRCNSFEDASMLLEELRWFDNQVYGVAHGLLMGRKDNVWAQAQCLFDEVKQMDSPTASA 2361
            CSRCNSFEDASMLLEELR FDNQVYGVAHGLLMG + NVW +AQ LFDEVKQMD  TASA
Sbjct: 648  CSRCNSFEDASMLLEELRLFDNQVYGVAHGLLMGCRGNVWVKAQSLFDEVKQMDCSTASA 707

Query: 2362 FYNALTDMLWHFGQKKGAQLVVLEGKRRHVWENMWSDSCLDLHLMSSGAARAMVHAWLLD 2541
            FYNALTDMLWHFGQKKGAQLVVLEG+RR+VWEN WS+S LDLHLMSSGAARAMVHAWLL+
Sbjct: 708  FYNALTDMLWHFGQKKGAQLVVLEGERRNVWENAWSNSRLDLHLMSSGAARAMVHAWLLN 767

Query: 2542 IRSIVFEGHELPKLISILTGWGKHSKVVGDGTLRRAIEALLTGMGAPFRVANCNLGRFIS 2721
            I SIV++G +LP L+SILTGWGKHSKVVGD  LRRA+EALLT MGAPFRV  CN+GRFIS
Sbjct: 768  IHSIVYQGQQLPNLLSILTGWGKHSKVVGDSALRRAVEALLTSMGAPFRVHECNIGRFIS 827

Query: 2722 PGAVVAAWLKESGTLKVLVLHDDRAHQETSTFDRIPNLRTLTL 2850
             G+V AAWLKESGTL+VL+LHDDRA   ++ F +I +LR L L
Sbjct: 828  TGSVAAAWLKESGTLEVLMLHDDRAEPNSANFGQISDLRALAL 870


>ref|XP_006417966.1| hypothetical protein EUTSA_v10006755mg [Eutrema salsugineum]
            gi|557095737|gb|ESQ36319.1| hypothetical protein
            EUTSA_v10006755mg [Eutrema salsugineum]
          Length = 895

 Score = 1150 bits (2975), Expect = 0.0
 Identities = 603/892 (67%), Positives = 698/892 (78%), Gaps = 36/892 (4%)
 Frame = +1

Query: 244  MASSTPHCSITGTKPYQNHQFPQNPIXXXXXXXXXXXX---WAHRKVSLTK--------- 387
            MAS+ PHCSIT TKPYQN+ +PQN +               WA ++ S +          
Sbjct: 1    MASTPPHCSITATKPYQNNPYPQNQLKNHRPSLHPPRYHRPWAPQRFSPSPLGGGTKGRG 60

Query: 388  PSQVPHXXXXXXXXXXXXXXXXXXXXXXXPTFPSLSRAPPPNKSELTADFSGRRSTRFVS 567
             +  P                        P FP+LS    P KS+L+ DF+GRRSTRFVS
Sbjct: 61   SAPSPSSSSSAAVAAAAATTASGQLSQASPRFPALSPLQTP-KSDLSPDFAGRRSTRFVS 119

Query: 568  KMHFGRQKAQLSSRHSSVAEEALQESIRIN-DEKGLENVFLCFESKMSGSDDYIFVLREL 744
            KMHFGR K  ++SRHS VAE+AL  +I+ + +++GL+N+ L FESK+ GSDDY ++LREL
Sbjct: 120  KMHFGRPKTAMASRHSLVAEDALHHAIQFSGNDEGLQNLLLSFESKLCGSDDYTYILREL 179

Query: 745  GNRGECWKAVQCFEFAVRRERRRNEQGKLASAMISILGRLGKVDLARGVFEIGLKAGYGN 924
            GNRGE  KAV+ +EFAV+RERR+NEQGKLASAMIS LGRLGKV +A+ VFE  L  GYGN
Sbjct: 180  GNRGEFEKAVRFYEFAVKRERRKNEQGKLASAMISTLGRLGKVGIAKRVFETALADGYGN 239

Query: 925  TVYAYSALISAYGRSGCSDEAIKVFETMKNSGLKPNLVTYNAVIDACGKGGVEFNQAAEI 1104
            TVYA+SA+ISAYGRSG  ++AIKVF +MK  GL+PNLVTYNAVIDACGKGG+EF Q AE 
Sbjct: 240  TVYAFSAIISAYGRSGYHEDAIKVFSSMKGHGLRPNLVTYNAVIDACGKGGMEFKQVAEF 299

Query: 1105 LNEMLRNGIQPDRITFNSLLAVCSRGGLWEAARNFFSEMVYRGIDQDIFTYNTLLDAFCK 1284
             +EM RN +QPDRITFNSLLAVCSRGG WEAARN F EM+ RGI+QDIFTYNTLLDA CK
Sbjct: 300  FDEMQRNRVQPDRITFNSLLAVCSRGGSWEAARNLFDEMLNRGIEQDIFTYNTLLDAICK 359

Query: 1285 GGQMDLAFQIMAEMSAKNISPNVVTYSTMIDGYAKVGRLDDALNLLGEMKFAGIGLDRVS 1464
            GGQMDLAF+I+A+M AKNI PNVVTYST+IDGYAK GR +DAL L GEMK+ GI LDRVS
Sbjct: 360  GGQMDLAFEILAQMPAKNIMPNVVTYSTVIDGYAKAGRFNDALTLFGEMKYLGIPLDRVS 419

Query: 1465 YNTLLSIYAKLGRFEDVLNVCNEMESSGIKKDAVTYNALLGGYGKQGKYDEVKRVFKEMK 1644
            YNTL+SIYAKLGRFE+ L++  EM ++GI+KDAVTYNALLGGYGK  KYDEVK VF EMK
Sbjct: 420  YNTLVSIYAKLGRFEEALDIVKEMAAAGIRKDAVTYNALLGGYGKHEKYDEVKSVFAEMK 479

Query: 1645 TRHLSPNLLTYSTLIDVYSKGGLYREALEVFKEFKQAGLKADVVLYSALIDALCKNGLVE 1824
               + PNLLTYSTLIDVYSKGGLY+EA+E+F+EFK  GL+ADVVLYSALIDALCKNGLVE
Sbjct: 480  QERVLPNLLTYSTLIDVYSKGGLYKEAMEIFREFKSVGLRADVVLYSALIDALCKNGLVE 539

Query: 1825 IAVSLLDEMTKEGIRPNVVTYNSIIDAFGRSATA-C-------GSN-----ESLIEPSSL 1965
             AVSLLDEMTKEGI PNVVTYNS+IDAFGRSAT  C       G+N     ES    S+ 
Sbjct: 540  SAVSLLDEMTKEGISPNVVTYNSMIDAFGRSATTECLADINEGGANGLEEDESFSSSSAS 599

Query: 1966 IALKD-----VSESNDEHR----EDNQIIKIFGQLAAEKSCHAKID-NRGRQEILCVLEV 2115
            ++  D     V E++   +    ED++I++IFGQL  E +   K D  +G QE+ C+LEV
Sbjct: 600  LSHTDSLSLAVGEADSLSKLTKTEDHRIVEIFGQLVTEGNNQIKRDCKQGVQELSCILEV 659

Query: 2116 FQKMQELEIKPNVVTFSAILNACSRCNSFEDASMLLEELRWFDNQVYGVAHGLLMGRKDN 2295
              KM ELEIKPNVVTFSAILNACSRCNSFE+ASMLLEELR FDN+VYGVAHGLLMG  +N
Sbjct: 660  CHKMHELEIKPNVVTFSAILNACSRCNSFEEASMLLEELRLFDNKVYGVAHGLLMGYNEN 719

Query: 2296 VWAQAQCLFDEVKQMDSPTASAFYNALTDMLWHFGQKKGAQLVVLEGKRRHVWENMWSDS 2475
            VW QAQ LFDEVK MD  TASAFYNALTDMLWHFGQK+GAQ VVLEG+RR VWEN+WSDS
Sbjct: 720  VWIQAQSLFDEVKAMDGSTASAFYNALTDMLWHFGQKRGAQSVVLEGRRRKVWENVWSDS 779

Query: 2476 CLDLHLMSSGAARAMVHAWLLDIRSIVFEGHELPKLISILTGWGKHSKVVGDGTLRRAIE 2655
            CLDLHLMSSGAARAMVHAWLL+IRSIV+EGHELPKL+SILTGWGKHSKV+GDGTLRRA+E
Sbjct: 780  CLDLHLMSSGAARAMVHAWLLNIRSIVYEGHELPKLLSILTGWGKHSKVMGDGTLRRAVE 839

Query: 2656 ALLTGMGAPFRVANCNLGRFISPGAVVAAWLKESGTLKVLVLHDDRAHQETS 2811
            ALL GMGAPF VA CN+GRF+S G+VVAAWL+ESGTLKVLVL +D  H+E S
Sbjct: 840  ALLRGMGAPFHVAKCNVGRFVSSGSVVAAWLRESGTLKVLVL-EDHKHEEAS 890


>ref|XP_006355855.1| PREDICTED: pentatricopeptide repeat-containing protein At2g31400,
            chloroplastic-like [Solanum tuberosum]
          Length = 848

 Score = 1118 bits (2891), Expect = 0.0
 Identities = 586/873 (67%), Positives = 678/873 (77%), Gaps = 4/873 (0%)
 Frame = +1

Query: 244  MASSTP--HCSITGTKPYQNHQFPQNPIXXXXXXXXXXXXWAHRKVSLTKPSQVPHXXXX 417
            MASSTP  HC++T +KPY  H   Q               W+ +KVSL +P+   +    
Sbjct: 1    MASSTPPPHCALTTSKPYHPHPLTQTH-SHPNHRNNHQRHWSSQKVSLNRPAPPRNATHP 59

Query: 418  XXXXXXXXXXXXXXXXXXXPTFPSLSRAPPPNKSELTADFSGRRSTRFVSKMHFGRQKAQ 597
                               P F SLS +    KS+ +ADFSGRRSTRFVSKMHFGR K  
Sbjct: 60   PPSQT--------------PNFLSLSSS----KSDFSADFSGRRSTRFVSKMHFGRAKIS 101

Query: 598  LSSRHSSVAEEALQESIRI-NDEKGLENVFLCFESKMSGSDDYIFVLRELGNRGECWKAV 774
             + RHSS AEEAL+E+IR   +E GL+ V L F SK+ GSDDY F+ RELGNRGE   A+
Sbjct: 102  GNGRHSSFAEEALEEAIRCCKNEAGLDQVLLTFGSKLLGSDDYTFLFRELGNRGEWLAAM 161

Query: 775  QCFEFAVRRERRRNEQGKLASAMISILGRLGKVDLARGVFEIGLKAGYGNTVYAYSALIS 954
            +CFEFAV RER+RNEQGKLAS+MISILGR GKVDLA  VFE  +  GYGNTVYAYSALIS
Sbjct: 162  RCFEFAVGRERKRNEQGKLASSMISILGRSGKVDLAEKVFENAVSDGYGNTVYAYSALIS 221

Query: 955  AYGRSGCSDEAIKVFETMKNSGLKPNLVTYNAVIDACGKGGVEFNQAAEILNEMLRNGIQ 1134
            AY +SG  +EAI+VFETMK+SGLKPNLVTYNA+IDACGKGG +F +A+EI +EMLRNG+Q
Sbjct: 222  AYAKSGYCNEAIRVFETMKDSGLKPNLVTYNALIDACGKGGADFKRASEIFDEMLRNGVQ 281

Query: 1135 PDRITFNSLLAVCSRGGLWEAARNFFSEMVYRGIDQDIFTYNTLLDAFCKGGQMDLAFQI 1314
            PDRITFNSLLAVCS  GLWE AR  F+EM+YRGIDQDI+TYNT LDA C GGQ+D+AF I
Sbjct: 282  PDRITFNSLLAVCSGAGLWETARGLFNEMIYRGIDQDIYTYNTFLDAACNGGQIDVAFDI 341

Query: 1315 MAEMSAKNISPNVVTYSTMIDGYAKVGRLDDALNLLGEMKFAGIGLDRVSYNTLLSIYAK 1494
            M+EM AKNI PN VTYST+I G AK GRLD AL+L  EMK AGI LDRVSYNTLL+IYA 
Sbjct: 342  MSEMHAKNILPNQVTYSTVIRGCAKAGRLDRALSLFNEMKCAGITLDRVSYNTLLAIYAS 401

Query: 1495 LGRFEDVLNVCNEMESSGIKKDAVTYNALLGGYGKQGKYDEVKRVFKEMKTRHLSPNLLT 1674
            LG+FE+ LNV  EMES GIKKD VTYNALL G+GKQG Y +VK++F EMK   LSPNLLT
Sbjct: 402  LGKFEEALNVSKEMESMGIKKDVVTYNALLDGFGKQGMYIKVKQLFAEMKAEKLSPNLLT 461

Query: 1675 YSTLIDVYSKGGLYREALEVFKEFKQAGLKADVVLYSALIDALCKNGLVEIAVSLLDEMT 1854
            YSTLI VY KG LY +A+EV+KEFK+ GLKADVV YS LIDALCK GLVE +  LL+EMT
Sbjct: 462  YSTLISVYLKGALYHDAVEVYKEFKKQGLKADVVFYSKLIDALCKKGLVEYSSLLLNEMT 521

Query: 1855 KEGIRPNVVTYNSIIDAFGRSAT-ACGSNESLIEPSSLIALKDVSESNDEHREDNQIIKI 2031
            KEGI+PNVVTYNSII+AFG SA+  CGS+      +    +  +S+S  E+ E++ I+KI
Sbjct: 522  KEGIQPNVVTYNSIINAFGESASNECGSD------NVTQIVSTISQSKWENTEEDNIVKI 575

Query: 2032 FGQLAAEKSCHAKIDNRGRQEILCVLEVFQKMQELEIKPNVVTFSAILNACSRCNSFEDA 2211
            F QLAA+KS   K  N  RQ+ILC+L VF KM EL+IKPNVVTFSAILNACSRC+SF++A
Sbjct: 576  FEQLAAQKSASGKKTNAERQDILCILGVFHKMHELQIKPNVVTFSAILNACSRCSSFDEA 635

Query: 2212 SMLLEELRWFDNQVYGVAHGLLMGRKDNVWAQAQCLFDEVKQMDSPTASAFYNALTDMLW 2391
            S+LLEELR FDNQVYGVAHGLLMG+++ VWAQA  LF+EVKQMDS TASAFYNALTDMLW
Sbjct: 636  SLLLEELRIFDNQVYGVAHGLLMGQREGVWAQALSLFNEVKQMDSSTASAFYNALTDMLW 695

Query: 2392 HFGQKKGAQLVVLEGKRRHVWENMWSDSCLDLHLMSSGAARAMVHAWLLDIRSIVFEGHE 2571
            HF QK+GAQLVVLEGKR  VWEN WS SCLDLHLMSSGAA AMVHAWLL IRSIVFEGHE
Sbjct: 696  HFDQKQGAQLVVLEGKRSEVWENTWSTSCLDLHLMSSGAACAMVHAWLLSIRSIVFEGHE 755

Query: 2572 LPKLISILTGWGKHSKVVGDGTLRRAIEALLTGMGAPFRVANCNLGRFISPGAVVAAWLK 2751
            LPK++SILTGWGKHSK+ GDG L+RAIE LLT +GAPF+VA CN+GRFIS GAVV AWL+
Sbjct: 756  LPKMLSILTGWGKHSKITGDGALKRAIEGLLTSIGAPFQVAKCNIGRFISTGAVVTAWLR 815

Query: 2752 ESGTLKVLVLHDDRAHQETSTFDRIPNLRTLTL 2850
            ESGTL+VLVL DD +H   + F +I NL+ LTL
Sbjct: 816  ESGTLEVLVLQDDTSHLRATRFGQISNLQQLTL 848


>ref|XP_004240564.1| PREDICTED: pentatricopeptide repeat-containing protein At2g31400,
            chloroplastic-like isoform 1 [Solanum lycopersicum]
          Length = 841

 Score = 1114 bits (2881), Expect = 0.0
 Identities = 583/877 (66%), Positives = 678/877 (77%), Gaps = 8/877 (0%)
 Frame = +1

Query: 244  MASSTP--HCSITGTKPYQ----NHQFPQNPIXXXXXXXXXXXXWAHRKVSLTKPSQVPH 405
            MASSTP  HC++T +KPYQ    +H  P +              W+ +KVSL  P    H
Sbjct: 1    MASSTPPPHCALTTSKPYQPQTHSHPHPNH-------RNNHQRHWSSQKVSLNPPRNPNH 53

Query: 406  XXXXXXXXXXXXXXXXXXXXXXXPTFPSLSRAPPPNKSELTADFSGRRSTRFVSKMHFGR 585
                                   P F SLS +    KS+ +ADFSGRRSTRFVSKMHFGR
Sbjct: 54   PSQT-------------------PNFLSLSSS----KSDFSADFSGRRSTRFVSKMHFGR 90

Query: 586  QKAQLSSRHSSVAEEALQESIRI-NDEKGLENVFLCFESKMSGSDDYIFVLRELGNRGEC 762
             K   + RHSS A+EAL+E+IR  N+E GL+ V L F SK+ GSDDY F+ RELGNRGE 
Sbjct: 91   AKISGNGRHSSFAQEALEEAIRCCNNEAGLDQVLLTFGSKLVGSDDYTFLFRELGNRGEW 150

Query: 763  WKAVQCFEFAVRRERRRNEQGKLASAMISILGRLGKVDLARGVFEIGLKAGYGNTVYAYS 942
              A++CF+FAV RER+RNEQGKLAS+MISILGR GKVDLA  VFE  +  GYG+TVYAYS
Sbjct: 151  LAAMRCFQFAVGRERKRNEQGKLASSMISILGRSGKVDLAEKVFENAVSDGYGSTVYAYS 210

Query: 943  ALISAYGRSGCSDEAIKVFETMKNSGLKPNLVTYNAVIDACGKGGVEFNQAAEILNEMLR 1122
            ALISAY +SG  +EAI+VFETMK+SGLKPNLVTYNA+IDACGKGG +F +A+EI +EMLR
Sbjct: 211  ALISAYAKSGYCNEAIRVFETMKDSGLKPNLVTYNALIDACGKGGADFKRASEIFDEMLR 270

Query: 1123 NGIQPDRITFNSLLAVCSRGGLWEAARNFFSEMVYRGIDQDIFTYNTLLDAFCKGGQMDL 1302
            NG+QPDRITFNSLLAVCS  GLWE AR  F+EM+YRGIDQDI+TYNT LD  C GGQ+D+
Sbjct: 271  NGVQPDRITFNSLLAVCSGAGLWETARGLFNEMIYRGIDQDIYTYNTFLDVACNGGQIDV 330

Query: 1303 AFQIMAEMSAKNISPNVVTYSTMIDGYAKVGRLDDALNLLGEMKFAGIGLDRVSYNTLLS 1482
            AF IM+EM AKNI PN VTYST+I G AK GRLD AL+L  EMK AGI LDRVSYNTLL+
Sbjct: 331  AFDIMSEMHAKNILPNQVTYSTVIRGCAKAGRLDKALSLFNEMKCAGIKLDRVSYNTLLA 390

Query: 1483 IYAKLGRFEDVLNVCNEMESSGIKKDAVTYNALLGGYGKQGKYDEVKRVFKEMKTRHLSP 1662
            IYA LG+FE+ LNV  EME  GIKKD VTYNALL G+GKQG Y +VK++F EMK   LSP
Sbjct: 391  IYASLGKFEEALNVSKEMEGMGIKKDVVTYNALLDGFGKQGMYTKVKQLFAEMKAEKLSP 450

Query: 1663 NLLTYSTLIDVYSKGGLYREALEVFKEFKQAGLKADVVLYSALIDALCKNGLVEIAVSLL 1842
            NLLTYSTLI VY KG LY +A+EV+KEFK+ GLKADVV YS LIDALCK GLVE +  LL
Sbjct: 451  NLLTYSTLISVYLKGALYHDAVEVYKEFKKQGLKADVVFYSKLIDALCKKGLVEYSSLLL 510

Query: 1843 DEMTKEGIRPNVVTYNSIIDAFGRSA-TACGSNESLIEPSSLIALKDVSESNDEHREDNQ 2019
            +EMTKEGI+PNVVTYNSII+AFG SA   CGS+      S+      +S+S  E+ E++ 
Sbjct: 511  NEMTKEGIQPNVVTYNSIINAFGESANNECGSDNVTHIVSA------ISQSKWENTEEDN 564

Query: 2020 IIKIFGQLAAEKSCHAKIDNRGRQEILCVLEVFQKMQELEIKPNVVTFSAILNACSRCNS 2199
            I+KIF QLAA+KS   K  N  RQ++LC+L VF KM EL+IKPNVVTFSAILNACSRC+S
Sbjct: 565  IVKIFEQLAAQKSASGKKTNAERQDMLCILGVFHKMHELQIKPNVVTFSAILNACSRCSS 624

Query: 2200 FEDASMLLEELRWFDNQVYGVAHGLLMGRKDNVWAQAQCLFDEVKQMDSPTASAFYNALT 2379
            F++AS+LLEELR FDNQVYGVAHGLLMG+++ VW+QA  LF+EVKQMDS TASAFYNALT
Sbjct: 625  FDEASLLLEELRLFDNQVYGVAHGLLMGQREGVWSQALSLFNEVKQMDSSTASAFYNALT 684

Query: 2380 DMLWHFGQKKGAQLVVLEGKRRHVWENMWSDSCLDLHLMSSGAARAMVHAWLLDIRSIVF 2559
            DMLWHF QK+GAQLVVLEGKR  VWEN WS SCLDLHLMSSGAA AMVHAWLL IRSIVF
Sbjct: 685  DMLWHFDQKQGAQLVVLEGKRSEVWENTWSTSCLDLHLMSSGAACAMVHAWLLSIRSIVF 744

Query: 2560 EGHELPKLISILTGWGKHSKVVGDGTLRRAIEALLTGMGAPFRVANCNLGRFISPGAVVA 2739
            EGHELPK++SILTGWGKHSK+ GDG L+RAIE LLT +GAPF++A CN+GRFIS GAVV 
Sbjct: 745  EGHELPKMLSILTGWGKHSKITGDGALKRAIEGLLTSIGAPFQIAKCNIGRFISTGAVVT 804

Query: 2740 AWLKESGTLKVLVLHDDRAHQETSTFDRIPNLRTLTL 2850
            AWL+ESGTL+VLVL DD +H   + FD+I NL+ LTL
Sbjct: 805  AWLRESGTLEVLVLQDDTSHLRATRFDQISNLQQLTL 841


>ref|XP_006444532.1| hypothetical protein CICLE_v10018807mg [Citrus clementina]
            gi|557546794|gb|ESR57772.1| hypothetical protein
            CICLE_v10018807mg [Citrus clementina]
          Length = 820

 Score = 1102 bits (2851), Expect = 0.0
 Identities = 578/798 (72%), Positives = 638/798 (79%), Gaps = 17/798 (2%)
 Frame = +1

Query: 244  MASSTPHCSITGTKPYQNHQFPQNPIXXXXXXXXXXXX----WAHRKVSLTKPSQVPHXX 411
            MAS+ PHCSIT TKPYQNHQ+P N +                W   KVSLTKP   P   
Sbjct: 1    MASTPPHCSITATKPYQNHQYPHNHLKNNHHRQSHHPSSRPHWTSHKVSLTKPPLSPSPR 60

Query: 412  XXXXXXXXXXXXXXXXXXXXXPTFPSLSRAPPPNKSELTADFSGRRSTRFVSKMHFGRQK 591
                                   F SLS  P  +KSEL  DFSGRRSTRFVSKMHFGR K
Sbjct: 61   NAPKPAATSTTVAPNPKP-----FHSLSPLPS-SKSELAPDFSGRRSTRFVSKMHFGRPK 114

Query: 592  AQLSSRHSSVAEEALQESIRI-NDEKGLENVFLCFESKMSGSDDYIFVLRELGNRGECWK 768
              +S+RHS VAEEAL        D+  L ++   FE K+ G+DDY F+LRELGNRGE  K
Sbjct: 115  IAMSTRHSVVAEEALHHVTAFARDDVSLGDILKKFEFKLCGADDYTFLLRELGNRGEWSK 174

Query: 769  AVQCFEFAVRRERRRNEQGKLASAMISILGRLGKVDLARGVFEIGLKAGYGNTVYAYSAL 948
            A+QCF FAV+RE R+N+QGKLASAMISILGRLGKVDLA+ +FE  L  GYGNTVYA+SAL
Sbjct: 175  AIQCFAFAVKREERKNDQGKLASAMISILGRLGKVDLAKNIFETALNEGYGNTVYAFSAL 234

Query: 949  ISAYGRSGCSDEAIKVFETMKNSGLKPNLVTYNAVIDACGKGGVEFNQAAEILNEMLRNG 1128
            ISAYGRSG   EAI VF +MK   LKPNLVTYNAVIDACGKGGV+F    EI ++MLRNG
Sbjct: 235  ISAYGRSGYCQEAISVFNSMKRYHLKPNLVTYNAVIDACGKGGVDFKHVVEIFDDMLRNG 294

Query: 1129 IQPDRITFNSLLAVCSRGGLWEAARNFFSEMVYRGIDQDIFTYNTLLDAFCKGGQMDLAF 1308
            +QPDRITFNSLLAVCSRGGLWEAARN F+EMV+RGIDQDIFTYNTLLDA CKG QMDLAF
Sbjct: 295  VQPDRITFNSLLAVCSRGGLWEAARNLFNEMVHRGIDQDIFTYNTLLDAICKGAQMDLAF 354

Query: 1309 QIMAEMSAKNISPNVVTYSTMIDGYAKVGRLDDALNLLGEMKFAGIGLDRVSYNTLLSIY 1488
            +IMAEM AKNISPNVVTYSTMIDGYAK GRLDDALN+  EMKF GIGLDRVSYNT+LSIY
Sbjct: 355  EIMAEMPAKNISPNVVTYSTMIDGYAKAGRLDDALNMFSEMKFLGIGLDRVSYNTVLSIY 414

Query: 1489 AKLGRFEDVLNVCNEMESSGIKKDAVTYNALLGGYGKQGKYDEVKRVFKEMKTRHLSPNL 1668
            AKLGRFE+ L VC EMESSGI+KDAVTYNALLGGYGKQGKYDEV+R+F++MK   +SPNL
Sbjct: 415  AKLGRFEEALLVCKEMESSGIRKDAVTYNALLGGYGKQGKYDEVRRMFEQMKADCVSPNL 474

Query: 1669 LTYSTLIDVYSKGGLYREALEVFKEFKQAGLKADVVLYSALIDALCKNGLVEIAVSLLDE 1848
            LTYSTLIDVYSKGGLY+EA+++F+EFKQAGLKADVVLYSALIDALCKNGLVE AVSLLDE
Sbjct: 475  LTYSTLIDVYSKGGLYKEAMQIFREFKQAGLKADVVLYSALIDALCKNGLVESAVSLLDE 534

Query: 1849 MTKEGIRPNVVTYNSIIDAFGRSATA-CGSNE------SLIEPSSLIAL-----KDVSES 1992
            MTKEGIRPNVVTYNSIIDAFGRSAT  C  ++         E ++L A+     KDV E+
Sbjct: 535  MTKEGIRPNVVTYNSIIDAFGRSATTECTVDDVERDLGKQKESANLDAMCSQDDKDVQEA 594

Query: 1993 NDEHREDNQIIKIFGQLAAEKSCHAKIDNRGRQEILCVLEVFQKMQELEIKPNVVTFSAI 2172
                R DNQIIK+FGQL AEK+   K +NR RQEILC+L VFQKM +L+IKPNVVTFSAI
Sbjct: 595  G---RTDNQIIKVFGQLVAEKAGQGKKENRCRQEILCILGVFQKMHKLKIKPNVVTFSAI 651

Query: 2173 LNACSRCNSFEDASMLLEELRWFDNQVYGVAHGLLMGRKDNVWAQAQCLFDEVKQMDSPT 2352
            LNACSRCNSFEDASMLLEELR FDNQVYGVAHGLLMG +DN+W QA  LFDEVK MDS T
Sbjct: 652  LNACSRCNSFEDASMLLEELRLFDNQVYGVAHGLLMGYRDNIWVQALSLFDEVKLMDSST 711

Query: 2353 ASAFYNALTDMLWHFGQKKGAQLVVLEGKRRHVWENMWSDSCLDLHLMSSGAARAMVHAW 2532
            ASAFYNALTDMLWHFGQK+GAQLVVLEGKRR VWEN+WS+SCLDLHLMSSGAARAMVHAW
Sbjct: 712  ASAFYNALTDMLWHFGQKRGAQLVVLEGKRRQVWENVWSESCLDLHLMSSGAARAMVHAW 771

Query: 2533 LLDIRSIVFEGHELPKLI 2586
            LL+I SIVFEGHELPKL+
Sbjct: 772  LLNIHSIVFEGHELPKLL 789


>ref|XP_006293642.1| hypothetical protein CARUB_v10022597mg [Capsella rubella]
            gi|482562350|gb|EOA26540.1| hypothetical protein
            CARUB_v10022597mg [Capsella rubella]
          Length = 932

 Score = 1100 bits (2846), Expect = 0.0
 Identities = 555/787 (70%), Positives = 648/787 (82%), Gaps = 12/787 (1%)
 Frame = +1

Query: 490  LSRAP-----PPNKSELTADFSGRRSTRFVSKMHFGRQKAQLSSRHSSVAEEALQESIRI 654
            LS+AP        KS+L++DFSGRRSTRFVSKMHFGR K  +++RHSS AE+ALQ +I  
Sbjct: 125  LSQAPNFAPLQTQKSDLSSDFSGRRSTRFVSKMHFGRPKTAMATRHSSAAEDALQNAIDF 184

Query: 655  N-DEKGLENVFLCFESKMSGSDDYIFVLRELGNRGECWKAVQCFEFAVRRERRRNEQGKL 831
            + D +   ++ L FESK+ GSDD  +++RELGNRGEC KAV  +EFAV+RERR+NEQGKL
Sbjct: 185  SGDSEMFHSLMLSFESKLCGSDDCTYIIRELGNRGECDKAVGFYEFAVKRERRKNEQGKL 244

Query: 832  ASAMISILGRLGKVDLARGVFEIGLKAGYGNTVYAYSALISAYGRSGCSDEAIKVFETMK 1011
            ASAMIS LGR GKV +A+ +FE     GYGNTVYA+SALISAYGRSG  +EAI VF +MK
Sbjct: 245  ASAMISTLGRYGKVTIAKRIFETAFAGGYGNTVYAFSALISAYGRSGLHEEAISVFSSMK 304

Query: 1012 NSGLKPNLVTYNAVIDACGKGGVEFNQAAEILNEMLRNGIQPDRITFNSLLAVCSRGGLW 1191
            + GL+PNLVTYNAVIDACGKGG+EF Q A+  +EM +NG+QPDRITFNSLLAVCSRGGLW
Sbjct: 305  DHGLRPNLVTYNAVIDACGKGGMEFKQVAKFFDEMQKNGVQPDRITFNSLLAVCSRGGLW 364

Query: 1192 EAARNFFSEMVYRGIDQDIFTYNTLLDAFCKGGQMDLAFQIMAEMSAKNISPNVVTYSTM 1371
            EAARN F EM  R I+QD+F+YNTLLDA CKGGQMDLAF+I+A+M AK I PNVV+YST+
Sbjct: 365  EAARNLFDEMSNRRIEQDVFSYNTLLDAICKGGQMDLAFEILAQMPAKRIMPNVVSYSTV 424

Query: 1372 IDGYAKVGRLDDALNLLGEMKFAGIGLDRVSYNTLLSIYAKLGRFEDVLNVCNEMESSGI 1551
            IDG+AK GR D+ALNL GEM++ GI LDRVSYNTLLSIY K+GR E+ L++  EM S GI
Sbjct: 425  IDGFAKAGRFDEALNLFGEMRYLGIALDRVSYNTLLSIYTKVGRSEEALDILREMASVGI 484

Query: 1552 KKDAVTYNALLGGYGKQGKYDEVKRVFKEMKTRHLSPNLLTYSTLIDVYSKGGLYREALE 1731
            KKD VTYNALLGGYGKQGKYDEVK+VF EMK  H+ PNLLTYSTLID YSKGGLY+EA+E
Sbjct: 485  KKDVVTYNALLGGYGKQGKYDEVKKVFAEMKREHVVPNLLTYSTLIDGYSKGGLYKEAME 544

Query: 1732 VFKEFKQAGLKADVVLYSALIDALCKNGLVEIAVSLLDEMTKEGIRPNVVTYNSIIDAFG 1911
            +F+EFK AGL+ADVVLYSALIDALCKNGLV  AVSL+DEMTKEGI PNVVTYNSIIDAFG
Sbjct: 545  IFREFKSAGLRADVVLYSALIDALCKNGLVGSAVSLIDEMTKEGISPNVVTYNSIIDAFG 604

Query: 1912 RSATACGSNE-SLIEPSSL----IALKDVSESNDEHREDNQIIKIFGQLAAEKSCHAKID 2076
            RSAT   S + S  E ++L    +AL   + S     E N++I++FGQL AE +     D
Sbjct: 605  RSATMERSADYSNGEANNLEVGSLALSSSALSKLTETEGNRVIQLFGQLTAESNNRMTKD 664

Query: 2077 -NRGRQEILCVLEVFQKMQELEIKPNVVTFSAILNACSRCNSFEDASMLLEELRWFDNQV 2253
               G QE+ C+LEVF+KM +LEIKPNVVTFSAILNACSRCNSFEDASMLLEELR FDN+V
Sbjct: 665  CKEGMQELSCILEVFRKMHQLEIKPNVVTFSAILNACSRCNSFEDASMLLEELRLFDNKV 724

Query: 2254 YGVAHGLLMGRKDNVWAQAQCLFDEVKQMDSPTASAFYNALTDMLWHFGQKKGAQLVVLE 2433
            YGV HGLLMG ++NVW QAQ LFD+V +MD  TASAFYNALTDMLWHFGQK+GA+LV LE
Sbjct: 725  YGVVHGLLMGERENVWLQAQSLFDKVNEMDGSTASAFYNALTDMLWHFGQKRGAELVALE 784

Query: 2434 GKRRHVWENMWSDSCLDLHLMSSGAARAMVHAWLLDIRSIVFEGHELPKLISILTGWGKH 2613
            G+ R VWEN+WSDSCLDLHLMSSGAARAMVHAWLL+IRSIV+EGHELPK++SILTGWGKH
Sbjct: 785  GRSRQVWENVWSDSCLDLHLMSSGAARAMVHAWLLNIRSIVYEGHELPKVLSILTGWGKH 844

Query: 2614 SKVVGDGTLRRAIEALLTGMGAPFRVANCNLGRFISPGAVVAAWLKESGTLKVLVLHDDR 2793
            SKVVGDG LRRA+E LL GM APF ++ CN+GRFIS G+VVA WL+ES TLK+L+LHD +
Sbjct: 845  SKVVGDGALRRAVEVLLRGMDAPFHLSKCNMGRFISSGSVVATWLRESATLKLLILHDHK 904

Query: 2794 AHQETST 2814
                 ST
Sbjct: 905  TTTTAST 911


>ref|XP_004240565.1| PREDICTED: pentatricopeptide repeat-containing protein At2g31400,
            chloroplastic-like isoform 2 [Solanum lycopersicum]
          Length = 829

 Score = 1100 bits (2844), Expect = 0.0
 Identities = 575/860 (66%), Positives = 667/860 (77%), Gaps = 8/860 (0%)
 Frame = +1

Query: 244  MASSTP--HCSITGTKPYQ----NHQFPQNPIXXXXXXXXXXXXWAHRKVSLTKPSQVPH 405
            MASSTP  HC++T +KPYQ    +H  P +              W+ +KVSL  P    H
Sbjct: 1    MASSTPPPHCALTTSKPYQPQTHSHPHPNH-------RNNHQRHWSSQKVSLNPPRNPNH 53

Query: 406  XXXXXXXXXXXXXXXXXXXXXXXPTFPSLSRAPPPNKSELTADFSGRRSTRFVSKMHFGR 585
                                   P F SLS +    KS+ +ADFSGRRSTRFVSKMHFGR
Sbjct: 54   PSQT-------------------PNFLSLSSS----KSDFSADFSGRRSTRFVSKMHFGR 90

Query: 586  QKAQLSSRHSSVAEEALQESIRI-NDEKGLENVFLCFESKMSGSDDYIFVLRELGNRGEC 762
             K   + RHSS A+EAL+E+IR  N+E GL+ V L F SK+ GSDDY F+ RELGNRGE 
Sbjct: 91   AKISGNGRHSSFAQEALEEAIRCCNNEAGLDQVLLTFGSKLVGSDDYTFLFRELGNRGEW 150

Query: 763  WKAVQCFEFAVRRERRRNEQGKLASAMISILGRLGKVDLARGVFEIGLKAGYGNTVYAYS 942
              A++CF+FAV RER+RNEQGKLAS+MISILGR GKVDLA  VFE  +  GYG+TVYAYS
Sbjct: 151  LAAMRCFQFAVGRERKRNEQGKLASSMISILGRSGKVDLAEKVFENAVSDGYGSTVYAYS 210

Query: 943  ALISAYGRSGCSDEAIKVFETMKNSGLKPNLVTYNAVIDACGKGGVEFNQAAEILNEMLR 1122
            ALISAY +SG  +EAI+VFETMK+SGLKPNLVTYNA+IDACGKGG +F +A+EI +EMLR
Sbjct: 211  ALISAYAKSGYCNEAIRVFETMKDSGLKPNLVTYNALIDACGKGGADFKRASEIFDEMLR 270

Query: 1123 NGIQPDRITFNSLLAVCSRGGLWEAARNFFSEMVYRGIDQDIFTYNTLLDAFCKGGQMDL 1302
            NG+QPDRITFNSLLAVCS  GLWE AR  F+EM+YRGIDQDI+TYNT LD  C GGQ+D+
Sbjct: 271  NGVQPDRITFNSLLAVCSGAGLWETARGLFNEMIYRGIDQDIYTYNTFLDVACNGGQIDV 330

Query: 1303 AFQIMAEMSAKNISPNVVTYSTMIDGYAKVGRLDDALNLLGEMKFAGIGLDRVSYNTLLS 1482
            AF IM+EM AKNI PN VTYST+I G AK GRLD AL+L  EMK AGI LDRVSYNTLL+
Sbjct: 331  AFDIMSEMHAKNILPNQVTYSTVIRGCAKAGRLDKALSLFNEMKCAGIKLDRVSYNTLLA 390

Query: 1483 IYAKLGRFEDVLNVCNEMESSGIKKDAVTYNALLGGYGKQGKYDEVKRVFKEMKTRHLSP 1662
            IYA LG+FE+ LNV  EME  GIKKD VTYNALL G+GKQG Y +VK++F EMK   LSP
Sbjct: 391  IYASLGKFEEALNVSKEMEGMGIKKDVVTYNALLDGFGKQGMYTKVKQLFAEMKAEKLSP 450

Query: 1663 NLLTYSTLIDVYSKGGLYREALEVFKEFKQAGLKADVVLYSALIDALCKNGLVEIAVSLL 1842
            NLLTYSTLI VY KG LY +A+EV+KEFK+ GLKADVV YS LIDALCK GLVE +  LL
Sbjct: 451  NLLTYSTLISVYLKGALYHDAVEVYKEFKKQGLKADVVFYSKLIDALCKKGLVEYSSLLL 510

Query: 1843 DEMTKEGIRPNVVTYNSIIDAFGRSA-TACGSNESLIEPSSLIALKDVSESNDEHREDNQ 2019
            +EMTKEGI+PNVVTYNSII+AFG SA   CGS+      S+      +S+S  E+ E++ 
Sbjct: 511  NEMTKEGIQPNVVTYNSIINAFGESANNECGSDNVTHIVSA------ISQSKWENTEEDN 564

Query: 2020 IIKIFGQLAAEKSCHAKIDNRGRQEILCVLEVFQKMQELEIKPNVVTFSAILNACSRCNS 2199
            I+KIF QLAA+KS   K  N  RQ++LC+L VF KM EL+IKPNVVTFSAILNACSRC+S
Sbjct: 565  IVKIFEQLAAQKSASGKKTNAERQDMLCILGVFHKMHELQIKPNVVTFSAILNACSRCSS 624

Query: 2200 FEDASMLLEELRWFDNQVYGVAHGLLMGRKDNVWAQAQCLFDEVKQMDSPTASAFYNALT 2379
            F++AS+LLEELR FDNQVYGVAHGLLMG+++ VW+QA  LF+EVKQMDS TASAFYNALT
Sbjct: 625  FDEASLLLEELRLFDNQVYGVAHGLLMGQREGVWSQALSLFNEVKQMDSSTASAFYNALT 684

Query: 2380 DMLWHFGQKKGAQLVVLEGKRRHVWENMWSDSCLDLHLMSSGAARAMVHAWLLDIRSIVF 2559
            DMLWHF QK+GAQLVVLEGKR  VWEN WS SCLDLHLMSSGAA AMVHAWLL IRSIVF
Sbjct: 685  DMLWHFDQKQGAQLVVLEGKRSEVWENTWSTSCLDLHLMSSGAACAMVHAWLLSIRSIVF 744

Query: 2560 EGHELPKLISILTGWGKHSKVVGDGTLRRAIEALLTGMGAPFRVANCNLGRFISPGAVVA 2739
            EGHELPK++SILTGWGKHSK+ GDG L+RAIE LLT +GAPF++A CN+GRFIS GAVV 
Sbjct: 745  EGHELPKMLSILTGWGKHSKITGDGALKRAIEGLLTSIGAPFQIAKCNIGRFISTGAVVT 804

Query: 2740 AWLKESGTLKVLVLHDDRAH 2799
            AWL+ESGTL+VLVL DD +H
Sbjct: 805  AWLRESGTLEVLVLQDDTSH 824


>ref|NP_180698.1| pentatricopeptide-repeat protein GUN1 [Arabidopsis thaliana]
            gi|75206083|sp|Q9SIC9.1|PP178_ARATH RecName:
            Full=Pentatricopeptide repeat-containing protein
            At2g31400, chloroplastic; Flags: Precursor
            gi|4589961|gb|AAD26479.1| unknown protein [Arabidopsis
            thaliana] gi|330253448|gb|AEC08542.1| genomes uncoupled 1
            protein [Arabidopsis thaliana]
          Length = 918

 Score = 1097 bits (2836), Expect = 0.0
 Identities = 554/783 (70%), Positives = 645/783 (82%), Gaps = 13/783 (1%)
 Frame = +1

Query: 505  PPN-------KSELTADFSGRRSTRFVSKMHFGRQKAQLSSRHSSVAEEALQESIRIN-D 660
            PPN       KS+L++DFSGRRSTRFVSKMHFGRQK  +++RHSS AE+ALQ +I  + D
Sbjct: 119  PPNFSPLQTPKSDLSSDFSGRRSTRFVSKMHFGRQKTTMATRHSSAAEDALQNAIDFSGD 178

Query: 661  EKGLENVFLCFESKMSGSDDYIFVLRELGNRGECWKAVQCFEFAVRRERRRNEQGKLASA 840
            ++   ++ L FESK+ GSDD  +++RELGNR EC KAV  +EFAV+RERR+NEQGKLASA
Sbjct: 179  DEMFHSLMLSFESKLCGSDDCTYIIRELGNRNECDKAVGFYEFAVKRERRKNEQGKLASA 238

Query: 841  MISILGRLGKVDLARGVFEIGLKAGYGNTVYAYSALISAYGRSGCSDEAIKVFETMKNSG 1020
            MIS LGR GKV +A+ +FE     GYGNTVYA+SALISAYGRSG  +EAI VF +MK  G
Sbjct: 239  MISTLGRYGKVTIAKRIFETAFAGGYGNTVYAFSALISAYGRSGLHEEAISVFNSMKEYG 298

Query: 1021 LKPNLVTYNAVIDACGKGGVEFNQAAEILNEMLRNGIQPDRITFNSLLAVCSRGGLWEAA 1200
            L+PNLVTYNAVIDACGKGG+EF Q A+  +EM RNG+QPDRITFNSLLAVCSRGGLWEAA
Sbjct: 299  LRPNLVTYNAVIDACGKGGMEFKQVAKFFDEMQRNGVQPDRITFNSLLAVCSRGGLWEAA 358

Query: 1201 RNFFSEMVYRGIDQDIFTYNTLLDAFCKGGQMDLAFQIMAEMSAKNISPNVVTYSTMIDG 1380
            RN F EM  R I+QD+F+YNTLLDA CKGGQMDLAF+I+A+M  K I PNVV+YST+IDG
Sbjct: 359  RNLFDEMTNRRIEQDVFSYNTLLDAICKGGQMDLAFEILAQMPVKRIMPNVVSYSTVIDG 418

Query: 1381 YAKVGRLDDALNLLGEMKFAGIGLDRVSYNTLLSIYAKLGRFEDVLNVCNEMESSGIKKD 1560
            +AK GR D+ALNL GEM++ GI LDRVSYNTLLSIY K+GR E+ L++  EM S GIKKD
Sbjct: 419  FAKAGRFDEALNLFGEMRYLGIALDRVSYNTLLSIYTKVGRSEEALDILREMASVGIKKD 478

Query: 1561 AVTYNALLGGYGKQGKYDEVKRVFKEMKTRHLSPNLLTYSTLIDVYSKGGLYREALEVFK 1740
             VTYNALLGGYGKQGKYDEVK+VF EMK  H+ PNLLTYSTLID YSKGGLY+EA+E+F+
Sbjct: 479  VVTYNALLGGYGKQGKYDEVKKVFTEMKREHVLPNLLTYSTLIDGYSKGGLYKEAMEIFR 538

Query: 1741 EFKQAGLKADVVLYSALIDALCKNGLVEIAVSLLDEMTKEGIRPNVVTYNSIIDAFGRSA 1920
            EFK AGL+ADVVLYSALIDALCKNGLV  AVSL+DEMTKEGI PNVVTYNSIIDAFGRSA
Sbjct: 539  EFKSAGLRADVVLYSALIDALCKNGLVGSAVSLIDEMTKEGISPNVVTYNSIIDAFGRSA 598

Query: 1921 T----ACGSNESLIEPSSLIALKDVSESNDEHREDNQIIKIFGQLAAEKSCHAKID-NRG 2085
            T    A  SN   + P S  AL  ++E+     E N++I++FGQL  E +     D   G
Sbjct: 599  TMDRSADYSNGGSL-PFSSSALSALTET-----EGNRVIQLFGQLTTESNNRTTKDCEEG 652

Query: 2086 RQEILCVLEVFQKMQELEIKPNVVTFSAILNACSRCNSFEDASMLLEELRWFDNQVYGVA 2265
             QE+ C+LEVF+KM +LEIKPNVVTFSAILNACSRCNSFEDASMLLEELR FDN+VYGV 
Sbjct: 653  MQELSCILEVFRKMHQLEIKPNVVTFSAILNACSRCNSFEDASMLLEELRLFDNKVYGVV 712

Query: 2266 HGLLMGRKDNVWAQAQCLFDEVKQMDSPTASAFYNALTDMLWHFGQKKGAQLVVLEGKRR 2445
            HGLLMG+++NVW QAQ LFD+V +MD  TASAFYNALTDMLWHFGQK+GA+LV LEG+ R
Sbjct: 713  HGLLMGQRENVWLQAQSLFDKVNEMDGSTASAFYNALTDMLWHFGQKRGAELVALEGRSR 772

Query: 2446 HVWENMWSDSCLDLHLMSSGAARAMVHAWLLDIRSIVFEGHELPKLISILTGWGKHSKVV 2625
             VWEN+WSDSCLDLHLMSSGAARAMVHAWLL+IRSIV+EGHELPK++SILTGWGKHSKVV
Sbjct: 773  QVWENVWSDSCLDLHLMSSGAARAMVHAWLLNIRSIVYEGHELPKVLSILTGWGKHSKVV 832

Query: 2626 GDGTLRRAIEALLTGMGAPFRVANCNLGRFISPGAVVAAWLKESGTLKVLVLHDDRAHQE 2805
            GDG LRRA+E LL GM APF ++ CN+GRF S G+VVA WL+ES TLK+L+LHD   H  
Sbjct: 833  GDGALRRAVEVLLRGMDAPFHLSKCNMGRFTSSGSVVATWLRESATLKLLILHD---HIT 889

Query: 2806 TST 2814
            T+T
Sbjct: 890  TAT 892


Top