BLASTX nr result

ID: Paeonia25_contig00004769 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Paeonia25_contig00004769
         (2909 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002271180.1| PREDICTED: pentatricopeptide repeat-containi...  1314   0.0  
ref|XP_007051141.1| S uncoupled 1 [Theobroma cacao] gi|508703402...  1254   0.0  
ref|XP_006492356.1| PREDICTED: pentatricopeptide repeat-containi...  1244   0.0  
ref|XP_006444533.1| hypothetical protein CICLE_v10018807mg [Citr...  1243   0.0  
ref|XP_002515260.1| pentatricopeptide repeat-containing protein,...  1238   0.0  
ref|XP_002320970.2| hypothetical protein POPTR_0014s11380g [Popu...  1227   0.0  
ref|XP_006386713.1| hypothetical protein POPTR_0002s19470g [Popu...  1226   0.0  
gb|EXB28566.1| hypothetical protein L484_009725 [Morus notabilis]    1219   0.0  
ref|XP_004166285.1| PREDICTED: pentatricopeptide repeat-containi...  1209   0.0  
ref|XP_004135985.1| PREDICTED: pentatricopeptide repeat-containi...  1208   0.0  
ref|XP_007221553.1| hypothetical protein PRUPE_ppa001263mg [Prun...  1199   0.0  
ref|XP_002301519.2| hypothetical protein POPTR_0002s19470g [Popu...  1198   0.0  
ref|XP_004288538.1| PREDICTED: pentatricopeptide repeat-containi...  1187   0.0  
ref|XP_006417966.1| hypothetical protein EUTSA_v10006755mg [Eutr...  1150   0.0  
ref|XP_006355855.1| PREDICTED: pentatricopeptide repeat-containi...  1118   0.0  
ref|XP_004240564.1| PREDICTED: pentatricopeptide repeat-containi...  1114   0.0  
ref|XP_006444532.1| hypothetical protein CICLE_v10018807mg [Citr...  1102   0.0  
ref|XP_006293642.1| hypothetical protein CARUB_v10022597mg [Caps...  1100   0.0  
ref|XP_004240565.1| PREDICTED: pentatricopeptide repeat-containi...  1100   0.0  
ref|NP_180698.1| pentatricopeptide-repeat protein GUN1 [Arabidop...  1097   0.0  

>ref|XP_002271180.1| PREDICTED: pentatricopeptide repeat-containing protein At2g31400,
            chloroplastic [Vitis vinifera]
          Length = 867

 Score = 1314 bits (3400), Expect = 0.0
 Identities = 677/878 (77%), Positives = 742/878 (84%), Gaps = 9/878 (1%)
 Frame = +3

Query: 132  MASSTP-HCSITGTKPYQNHQFPQNPIXXXXXXXXXXXXWAHRKVSLTKPSQVPHXXXXX 308
            MAS TP HCSIT  KPYQN  +PQNP             W+  KVSLT P   P      
Sbjct: 1    MASPTPPHCSITAAKPYQNLHYPQNPTKNHHNNHH----WSSHKVSLTNPLPSPRNAAKP 56

Query: 309  XXXXXXXXXXXXXXXXXXPTFPSLSRAPPPNKSELTADFSGRRSTRFVSKMHFGRQKAQL 488
                                FPSLS  PP +KSELTADFSGRRSTRFVSKMHFGR K   
Sbjct: 57   GAASPATATNRNS------NFPSLSPLPP-SKSELTADFSGRRSTRFVSKMHFGRPKTAA 109

Query: 489  SSRHSSVAEEALQESIRI-NDEKGLENVFLCFESKMSGSDDYIFVLRELGNRGECWKAVQ 665
            ++RH+S AEEAL+ +IR  +D+KG+++V L FES++ GSDDY F+LRELGNRGE  KA++
Sbjct: 110  AARHTSTAEEALRHAIRFASDDKGIDSVLLNFESRLCGSDDYTFLLRELGNRGEWAKAIR 169

Query: 666  CFEFAVRRERRRNEQGKLASAMISILGRLGKVDLARGVFEIGLKAGYGNTVYAYSALISA 845
            CFEFAVRRE+RRNEQGKLASAMISILGRLG+V+LA+ VFE  L  GYGNTVYA+SALISA
Sbjct: 170  CFEFAVRREQRRNEQGKLASAMISILGRLGQVELAKNVFETALNEGYGNTVYAFSALISA 229

Query: 846  YGRSGCSDEAIKVFETMKNSGLKPNLVTYNAVIDACGKGGVEFNQAAEILNEMLRNGIQP 1025
            YGRSG  DEAIKVFETMK+SGLKPNLVTYNAVIDACGKGGV+FN+AAEI +EMLRNG+QP
Sbjct: 230  YGRSGYCDEAIKVFETMKSSGLKPNLVTYNAVIDACGKGGVDFNRAAEIFDEMLRNGVQP 289

Query: 1026 DRITFNSLLAVCSRGGLWEAARNFFSEMVYRGIDQDIFTYNTLLDAFCKGGQMDLAFQIM 1205
            DRITFNSLLAVC RGGLWEAARN FSEM+YRGI+QDIFTYNTLLDA CKGGQMDLAFQIM
Sbjct: 290  DRITFNSLLAVCGRGGLWEAARNLFSEMLYRGIEQDIFTYNTLLDAVCKGGQMDLAFQIM 349

Query: 1206 AEMSAKNISPNVVTYSTMIDGYAKVGRLDDALNLLGEMKFAGIGLDRVSYNTLLSIYAKL 1385
            +EM  K+I PNVVTYST+IDGYAK GRLD+ALNL  EMKFA IGLDRVSYNTLLSIYAKL
Sbjct: 350  SEMPRKHIMPNVVTYSTVIDGYAKAGRLDEALNLFNEMKFASIGLDRVSYNTLLSIYAKL 409

Query: 1386 GRFEDVLNVCNEMESSGIKKDAVTYNALLGGYGKQGKYDEVKRVFKEMKTRHLSPNLLTY 1565
            GRFE+ LNVC EMESSGIKKDAVTYNALLGGYGKQGKY+EVKRVF+EMK   + PNLLTY
Sbjct: 410  GRFEEALNVCKEMESSGIKKDAVTYNALLGGYGKQGKYEEVKRVFEEMKAERIFPNLLTY 469

Query: 1566 STLIDVYSKGGLYREALEVFKEFKQAGLKADVVLYSALIDALCKNGLVEIAVSLLDEMTK 1745
            STLIDVYSKGGLY+EA+EVF+EFK+AGLKADVVLYSALIDALCKNGLVE AVS LDEMTK
Sbjct: 470  STLIDVYSKGGLYQEAMEVFREFKKAGLKADVVLYSALIDALCKNGLVESAVSFLDEMTK 529

Query: 1746 EGIRPNVVTYNSIIDAFGRSATA-------CGSNESLIEPSSLIALKDVSESNDEHREDN 1904
            EGIRPNVVTYNSIIDAFGRS +A         +N S +  SSL  ++D +ES    +EDN
Sbjct: 530  EGIRPNVVTYNSIIDAFGRSGSAECVIDPPYETNVSKMSSSSLKVVEDATESEVGDKEDN 589

Query: 1905 QIIKIFGQLAAEKSCHAKIDNRGRQEILCVLEVFQKMQELEIKPNVVTFSAILNACSRCN 2084
            QIIKIFGQLAAEK+CHAK +NRGRQEILC+L VF KM EL+IKPNVVTFSAILNACSRCN
Sbjct: 590  QIIKIFGQLAAEKTCHAKKENRGRQEILCILAVFHKMHELDIKPNVVTFSAILNACSRCN 649

Query: 2085 SFEDASMLLEELRWFDNQVYGVAHGLLMGRKDNVWAQAQCLFDEVKQMDSPTASAFYNAL 2264
            SFEDASMLLEELR FDNQVYGVAHGLLMG  DNVW QAQ LFDEVKQMDS TASAFYNAL
Sbjct: 650  SFEDASMLLEELRLFDNQVYGVAHGLLMGYGDNVWVQAQSLFDEVKQMDSSTASAFYNAL 709

Query: 2265 TDMLWHFGQKKGAQLVVLEGKRRHVWENMWSDSCLDLHLMSSGAARAMVHAWLLDIRSIV 2444
            TDMLWHFGQ++GAQLVVLEGKRRHVWENMWS+SCLDLHLMSSGAARAMVHAWLL+IRSIV
Sbjct: 710  TDMLWHFGQRRGAQLVVLEGKRRHVWENMWSNSCLDLHLMSSGAARAMVHAWLLNIRSIV 769

Query: 2445 FEGHELPKLISILTGWGKHSKVVGDGTLRRAIEALLTGMGAPFRVANCNLGRFISPGAVV 2624
            FEGHELP+L+SILTGWGKHSKVVGDG LRRAIEALLTGMGAPFRVA CNLGRFIS GAVV
Sbjct: 770  FEGHELPQLLSILTGWGKHSKVVGDGALRRAIEALLTGMGAPFRVAKCNLGRFISTGAVV 829

Query: 2625 AAWLKESGTLKVLVLHDDRAHQETSTFDRIPNLRTLTL 2738
            AAWL+ESGTLKVLVLHDDR + + +   +I NL+TL L
Sbjct: 830  AAWLRESGTLKVLVLHDDRTNPDRARCSQISNLQTLPL 867


>ref|XP_007051141.1| S uncoupled 1 [Theobroma cacao] gi|508703402|gb|EOX95298.1| S
            uncoupled 1 [Theobroma cacao]
          Length = 866

 Score = 1254 bits (3244), Expect = 0.0
 Identities = 645/877 (73%), Positives = 718/877 (81%), Gaps = 8/877 (0%)
 Frame = +3

Query: 132  MASSTPHCSITGT-KPYQNHQFPQNPIXXXXXXXXXXXXWAH-RKVSLTKPSQVPHXXXX 305
            MAS+ PHCSIT T KPYQNHQ+PQN +                +K SL+KP   P     
Sbjct: 1    MASTPPHCSITATTKPYQNHQYPQNHLKNHRNHQNNHRNQTRPQKFSLSKPPPSP----- 55

Query: 306  XXXXXXXXXXXXXXXXXXXPTFPSLSRAPPPNKSELTADFSGRRSTRFVSKMHFGRQKAQ 485
                                T   LS++P P  S L  DFSGRRSTRFVSKMH GR K  
Sbjct: 56   ----CNAAKPATTAAAAAASTRSPLSQSPVPFPS-LAPDFSGRRSTRFVSKMHLGRPKTS 110

Query: 486  LSSRHSSVAEEALQESIRINDEKGLENVFLCFESKMSGSDDYIFVLRELGNRGECWKAVQ 665
             ++RH+S+AEE LQ ++  N   GLE V + FESK+ GSDDY F+LRELGNRGE  KA++
Sbjct: 111  TNTRHTSIAEEVLQLALH-NGHSGLERVLVSFESKLCGSDDYTFLLRELGNRGEYEKAIK 169

Query: 666  CFEFAVRRERRRNEQGKLASAMISILGRLGKVDLARGVFEIGLKAGYGNTVYAYSALISA 845
            CF+FAVRRERR+ EQGKLASAMISILGRLGKV+LA+G+FE  L  GYGNTVYA+SALISA
Sbjct: 170  CFQFAVRRERRKTEQGKLASAMISILGRLGKVELAKGIFETALTEGYGNTVYAFSALISA 229

Query: 846  YGRSGCSDEAIKVFETMKNSGLKPNLVTYNAVIDACGKGGVEFNQAAEILNEMLRNGIQP 1025
            +GRSG SDEAIKVF++MKN+GLKPNLVTYNAVIDACGKGGVEF +  EI +EMLR+G+QP
Sbjct: 230  FGRSGYSDEAIKVFDSMKNNGLKPNLVTYNAVIDACGKGGVEFKRVVEIFDEMLRSGVQP 289

Query: 1026 DRITFNSLLAVCSRGGLWEAARNFFSEMVYRGIDQDIFTYNTLLDAFCKGGQMDLAFQIM 1205
            DRITFNSLLAVCSRGGLWEAARN FSEMV+RGIDQDIFTYNTLLDA CKGGQMDLAF+IM
Sbjct: 290  DRITFNSLLAVCSRGGLWEAARNLFSEMVHRGIDQDIFTYNTLLDAVCKGGQMDLAFEIM 349

Query: 1206 AEMSAKNISPNVVTYSTMIDGYAKVGRLDDALNLLGEMKFAGIGLDRVSYNTLLSIYAKL 1385
            AEM  KNI PNVVTYSTMIDGYAK GR DDALNL  EMKF GIGLDRVSYNT+LSIYAKL
Sbjct: 350  AEMPTKNILPNVVTYSTMIDGYAKAGRFDDALNLFNEMKFLGIGLDRVSYNTVLSIYAKL 409

Query: 1386 GRFEDVLNVCNEMESSGIKKDAVTYNALLGGYGKQGKYDEVKRVFKEMKTRHLSPNLLTY 1565
            GRFE+ L++C EME SGI+KD VTYNALLGGYGKQGKYDEV+R+F+EMKT+ +SPNLLTY
Sbjct: 410  GRFEEALDICREMEGSGIRKDVVTYNALLGGYGKQGKYDEVRRLFEEMKTQKVSPNLLTY 469

Query: 1566 STLIDVYSKGGLYREALEVFKEFKQAGLKADVVLYSALIDALCKNGLVEIAVSLLDEMTK 1745
            ST+IDVYSKGGLY EA++VF+EFK+ GLKADVVLYSALIDALCKNGLVE AVSLLDEMTK
Sbjct: 470  STVIDVYSKGGLYEEAMDVFREFKRVGLKADVVLYSALIDALCKNGLVESAVSLLDEMTK 529

Query: 1746 EGIRPNVVTYNSIIDAFGRSAT------ACGSNESLIEPSSLIALKDVSESNDEHREDNQ 1907
            EGIRPNVVTYNSIIDAFGRSAT      A G   +L   SS + +    E      EDNQ
Sbjct: 530  EGIRPNVVTYNSIIDAFGRSATSECAFDAGGEISALQTESSSLVIGHSIEGKARDGEDNQ 589

Query: 1908 IIKIFGQLAAEKSCHAKIDNRGRQEILCVLEVFQKMQELEIKPNVVTFSAILNACSRCNS 2087
            +IK FGQLAAEK   AK D RG+QEILC+L VFQKM ELEIKPNVVTFSAILNACSRC+S
Sbjct: 590  VIKFFGQLAAEKGGQAKKDCRGKQEILCILGVFQKMHELEIKPNVVTFSAILNACSRCDS 649

Query: 2088 FEDASMLLEELRWFDNQVYGVAHGLLMGRKDNVWAQAQCLFDEVKQMDSPTASAFYNALT 2267
            FEDASMLLEELR FDNQVYGVAHGLLMG ++NVW QAQ LFDEVK MDS TASAFYNALT
Sbjct: 650  FEDASMLLEELRLFDNQVYGVAHGLLMGYRENVWIQAQSLFDEVKLMDSSTASAFYNALT 709

Query: 2268 DMLWHFGQKKGAQLVVLEGKRRHVWENMWSDSCLDLHLMSSGAARAMVHAWLLDIRSIVF 2447
            DMLWHFGQK+GAQLVVLEGKRR VWEN+WS+SCLDLHLMSSGAARAMVHAWLL+IRSI+F
Sbjct: 710  DMLWHFGQKRGAQLVVLEGKRRQVWENVWSNSCLDLHLMSSGAARAMVHAWLLNIRSIIF 769

Query: 2448 EGHELPKLISILTGWGKHSKVVGDGTLRRAIEALLTGMGAPFRVANCNLGRFISPGAVVA 2627
            EGHELPKL+SILTGWGKHSKVVGDG LRR +E+L TGMGAPFR+A CNLGRF+S G VV 
Sbjct: 770  EGHELPKLLSILTGWGKHSKVVGDGALRRTVESLFTGMGAPFRLAKCNLGRFVSTGPVVT 829

Query: 2628 AWLKESGTLKVLVLHDDRAHQETSTFDRIPNLRTLTL 2738
            AWL+ESGTLK+LVLHDDR   E + F +I NL+TLTL
Sbjct: 830  AWLRESGTLKLLVLHDDRTQPENTGFGQISNLQTLTL 866


>ref|XP_006492356.1| PREDICTED: pentatricopeptide repeat-containing protein At2g31400,
            chloroplastic-like [Citrus sinensis]
          Length = 877

 Score = 1244 bits (3219), Expect = 0.0
 Identities = 647/886 (73%), Positives = 715/886 (80%), Gaps = 17/886 (1%)
 Frame = +3

Query: 132  MASSTPHCSITGTKPYQNHQFPQNPIXXXXXXXXXXXX----WAHRKVSLTKPSQVPHXX 299
            MAS+ PHCSIT TKPYQNHQ+P N +                W   KVSLTKP   P   
Sbjct: 1    MASTPPHCSITATKPYQNHQYPHNHLKNNHHRQSHHPSSRPHWTSHKVSLTKPPLSPSPR 60

Query: 300  XXXXXXXXXXXXXXXXXXXXXPTFPSLSRAPPPNKSELTADFSGRRSTRFVSKMHFGRQK 479
                                   F SLS  P  +KSEL  DFSGRRSTRFVSKMHFGR K
Sbjct: 61   NAPKPAATSTTVAPNPKP-----FHSLSPLPS-SKSELAPDFSGRRSTRFVSKMHFGRPK 114

Query: 480  AQLSSRHSSVAEEALQESIRI-NDEKGLENVFLCFESKMSGSDDYIFVLRELGNRGECWK 656
              +S+RHS VAEEAL        D+  L ++   FE K+ G+DDY F+LRELGNRGE  K
Sbjct: 115  IAMSTRHSVVAEEALHHVTAFARDDVSLGDILKKFEFKLCGADDYTFLLRELGNRGEWSK 174

Query: 657  AVQCFEFAVRRERRRNEQGKLASAMISILGRLGKVDLARGVFEIGLKAGYGNTVYAYSAL 836
            A+QCF FAV+RE R+N+QGKLASAMISILGRLGKVDLA+ +FE  L  GYGNTVYA+SAL
Sbjct: 175  AIQCFAFAVKREERKNDQGKLASAMISILGRLGKVDLAKNIFETALNEGYGNTVYAFSAL 234

Query: 837  ISAYGRSGCSDEAIKVFETMKNSGLKPNLVTYNAVIDACGKGGVEFNQAAEILNEMLRNG 1016
            ISAYGRSG   EAI VF +MK   LKPNLVTYNAVIDACGKGGV+F    EI ++MLRNG
Sbjct: 235  ISAYGRSGYCQEAISVFNSMKRYNLKPNLVTYNAVIDACGKGGVDFKHVVEIFDDMLRNG 294

Query: 1017 IQPDRITFNSLLAVCSRGGLWEAARNFFSEMVYRGIDQDIFTYNTLLDAFCKGGQMDLAF 1196
            +QPDRITFNSLLAVCSRGGLWEAARN F+EMV+RGIDQDIFTYNTLLDA CKG QMDLAF
Sbjct: 295  VQPDRITFNSLLAVCSRGGLWEAARNLFNEMVHRGIDQDIFTYNTLLDAICKGAQMDLAF 354

Query: 1197 QIMAEMSAKNISPNVVTYSTMIDGYAKVGRLDDALNLLGEMKFAGIGLDRVSYNTLLSIY 1376
            +IMAEM AKNISPNVVTYSTMIDGYAK GRLDDALN+  EMKF GIGLDRVSYNT+LSIY
Sbjct: 355  EIMAEMPAKNISPNVVTYSTMIDGYAKAGRLDDALNMFSEMKFLGIGLDRVSYNTVLSIY 414

Query: 1377 AKLGRFEDVLNVCNEMESSGIKKDAVTYNALLGGYGKQGKYDEVKRVFKEMKTRHLSPNL 1556
            AKLGRFE+ L VC EMESSGI+KDAVTYNALLGGYGKQGKYDEV+R+F++MK   +SPNL
Sbjct: 415  AKLGRFEEALLVCKEMESSGIRKDAVTYNALLGGYGKQGKYDEVRRMFEQMKADCVSPNL 474

Query: 1557 LTYSTLIDVYSKGGLYREALEVFKEFKQAGLKADVVLYSALIDALCKNGLVEIAVSLLDE 1736
            LTYSTLIDVYSKGGLY+EA+++F+EFKQAGLKADVVLYSALIDALCKNGLVE AVSLLDE
Sbjct: 475  LTYSTLIDVYSKGGLYKEAMQIFREFKQAGLKADVVLYSALIDALCKNGLVESAVSLLDE 534

Query: 1737 MTKEGIRPNVVTYNSIIDAFGRSATA-CGSNE------SLIEPSSLIAL-----KDVSES 1880
            MTKEGIRPNVVTYNSIIDAFGRSAT  C  ++         E ++L A+     KDV E+
Sbjct: 535  MTKEGIRPNVVTYNSIIDAFGRSATTECTVDDVERDLGKQKESANLDAMCSQDDKDVQEA 594

Query: 1881 NDEHREDNQIIKIFGQLAAEKSCHAKIDNRGRQEILCVLEVFQKMQELEIKPNVVTFSAI 2060
                R DNQIIK+FGQL AEK+   K +NR RQEILC+L VFQKM +L+IKPNVVTFSAI
Sbjct: 595  G---RTDNQIIKVFGQLVAEKAGQGKKENRCRQEILCILGVFQKMHKLKIKPNVVTFSAI 651

Query: 2061 LNACSRCNSFEDASMLLEELRWFDNQVYGVAHGLLMGRKDNVWAQAQCLFDEVKQMDSPT 2240
            LNACSRCNSFEDASMLLEELR FDNQVYGVAHGLLMG +DN+W QA  LFDEVK MDS T
Sbjct: 652  LNACSRCNSFEDASMLLEELRLFDNQVYGVAHGLLMGYRDNIWVQALSLFDEVKLMDSST 711

Query: 2241 ASAFYNALTDMLWHFGQKKGAQLVVLEGKRRHVWENMWSDSCLDLHLMSSGAARAMVHAW 2420
            ASAFYNALTDMLWHFGQK+GAQLVVLEGKRR VWEN+WS+SCLDLHLMSSGAARAMVHAW
Sbjct: 712  ASAFYNALTDMLWHFGQKRGAQLVVLEGKRRQVWENVWSESCLDLHLMSSGAARAMVHAW 771

Query: 2421 LLDIRSIVFEGHELPKLISILTGWGKHSKVVGDGTLRRAIEALLTGMGAPFRVANCNLGR 2600
            LL+I SIVFEGHELPKL+SILTGWGKHSKVVGDG LRRA+E LLTGMGAPF VANCNLGR
Sbjct: 772  LLNIHSIVFEGHELPKLLSILTGWGKHSKVVGDGALRRAVEVLLTGMGAPFWVANCNLGR 831

Query: 2601 FISPGAVVAAWLKESGTLKVLVLHDDRAHQETSTFDRIPNLRTLTL 2738
            FIS G +VA+WL+ESGTLKVLVLHDDR H E + FD + N++TLTL
Sbjct: 832  FISTGPMVASWLRESGTLKVLVLHDDRTHSENAGFDEMLNMQTLTL 877


>ref|XP_006444533.1| hypothetical protein CICLE_v10018807mg [Citrus clementina]
            gi|557546795|gb|ESR57773.1| hypothetical protein
            CICLE_v10018807mg [Citrus clementina]
          Length = 877

 Score = 1243 bits (3217), Expect = 0.0
 Identities = 647/886 (73%), Positives = 715/886 (80%), Gaps = 17/886 (1%)
 Frame = +3

Query: 132  MASSTPHCSITGTKPYQNHQFPQNPIXXXXXXXXXXXX----WAHRKVSLTKPSQVPHXX 299
            MAS+ PHCSIT TKPYQNHQ+P N +                W   KVSLTKP   P   
Sbjct: 1    MASTPPHCSITATKPYQNHQYPHNHLKNNHHRQSHHPSSRPHWTSHKVSLTKPPLSPSPR 60

Query: 300  XXXXXXXXXXXXXXXXXXXXXPTFPSLSRAPPPNKSELTADFSGRRSTRFVSKMHFGRQK 479
                                   F SLS  P  +KSEL  DFSGRRSTRFVSKMHFGR K
Sbjct: 61   NAPKPAATSTTVAPNPKP-----FHSLSPLPS-SKSELAPDFSGRRSTRFVSKMHFGRPK 114

Query: 480  AQLSSRHSSVAEEALQESIRI-NDEKGLENVFLCFESKMSGSDDYIFVLRELGNRGECWK 656
              +S+RHS VAEEAL        D+  L ++   FE K+ G+DDY F+LRELGNRGE  K
Sbjct: 115  IAMSTRHSVVAEEALHHVTAFARDDVSLGDILKKFEFKLCGADDYTFLLRELGNRGEWSK 174

Query: 657  AVQCFEFAVRRERRRNEQGKLASAMISILGRLGKVDLARGVFEIGLKAGYGNTVYAYSAL 836
            A+QCF FAV+RE R+N+QGKLASAMISILGRLGKVDLA+ +FE  L  GYGNTVYA+SAL
Sbjct: 175  AIQCFAFAVKREERKNDQGKLASAMISILGRLGKVDLAKNIFETALNEGYGNTVYAFSAL 234

Query: 837  ISAYGRSGCSDEAIKVFETMKNSGLKPNLVTYNAVIDACGKGGVEFNQAAEILNEMLRNG 1016
            ISAYGRSG   EAI VF +MK   LKPNLVTYNAVIDACGKGGV+F    EI ++MLRNG
Sbjct: 235  ISAYGRSGYCQEAISVFNSMKRYHLKPNLVTYNAVIDACGKGGVDFKHVVEIFDDMLRNG 294

Query: 1017 IQPDRITFNSLLAVCSRGGLWEAARNFFSEMVYRGIDQDIFTYNTLLDAFCKGGQMDLAF 1196
            +QPDRITFNSLLAVCSRGGLWEAARN F+EMV+RGIDQDIFTYNTLLDA CKG QMDLAF
Sbjct: 295  VQPDRITFNSLLAVCSRGGLWEAARNLFNEMVHRGIDQDIFTYNTLLDAICKGAQMDLAF 354

Query: 1197 QIMAEMSAKNISPNVVTYSTMIDGYAKVGRLDDALNLLGEMKFAGIGLDRVSYNTLLSIY 1376
            +IMAEM AKNISPNVVTYSTMIDGYAK GRLDDALN+  EMKF GIGLDRVSYNT+LSIY
Sbjct: 355  EIMAEMPAKNISPNVVTYSTMIDGYAKAGRLDDALNMFSEMKFLGIGLDRVSYNTVLSIY 414

Query: 1377 AKLGRFEDVLNVCNEMESSGIKKDAVTYNALLGGYGKQGKYDEVKRVFKEMKTRHLSPNL 1556
            AKLGRFE+ L VC EMESSGI+KDAVTYNALLGGYGKQGKYDEV+R+F++MK   +SPNL
Sbjct: 415  AKLGRFEEALLVCKEMESSGIRKDAVTYNALLGGYGKQGKYDEVRRMFEQMKADCVSPNL 474

Query: 1557 LTYSTLIDVYSKGGLYREALEVFKEFKQAGLKADVVLYSALIDALCKNGLVEIAVSLLDE 1736
            LTYSTLIDVYSKGGLY+EA+++F+EFKQAGLKADVVLYSALIDALCKNGLVE AVSLLDE
Sbjct: 475  LTYSTLIDVYSKGGLYKEAMQIFREFKQAGLKADVVLYSALIDALCKNGLVESAVSLLDE 534

Query: 1737 MTKEGIRPNVVTYNSIIDAFGRSATA-CGSNE------SLIEPSSLIAL-----KDVSES 1880
            MTKEGIRPNVVTYNSIIDAFGRSAT  C  ++         E ++L A+     KDV E+
Sbjct: 535  MTKEGIRPNVVTYNSIIDAFGRSATTECTVDDVERDLGKQKESANLDAMCSQDDKDVQEA 594

Query: 1881 NDEHREDNQIIKIFGQLAAEKSCHAKIDNRGRQEILCVLEVFQKMQELEIKPNVVTFSAI 2060
                R DNQIIK+FGQL AEK+   K +NR RQEILC+L VFQKM +L+IKPNVVTFSAI
Sbjct: 595  G---RTDNQIIKVFGQLVAEKAGQGKKENRCRQEILCILGVFQKMHKLKIKPNVVTFSAI 651

Query: 2061 LNACSRCNSFEDASMLLEELRWFDNQVYGVAHGLLMGRKDNVWAQAQCLFDEVKQMDSPT 2240
            LNACSRCNSFEDASMLLEELR FDNQVYGVAHGLLMG +DN+W QA  LFDEVK MDS T
Sbjct: 652  LNACSRCNSFEDASMLLEELRLFDNQVYGVAHGLLMGYRDNIWVQALSLFDEVKLMDSST 711

Query: 2241 ASAFYNALTDMLWHFGQKKGAQLVVLEGKRRHVWENMWSDSCLDLHLMSSGAARAMVHAW 2420
            ASAFYNALTDMLWHFGQK+GAQLVVLEGKRR VWEN+WS+SCLDLHLMSSGAARAMVHAW
Sbjct: 712  ASAFYNALTDMLWHFGQKRGAQLVVLEGKRRQVWENVWSESCLDLHLMSSGAARAMVHAW 771

Query: 2421 LLDIRSIVFEGHELPKLISILTGWGKHSKVVGDGTLRRAIEALLTGMGAPFRVANCNLGR 2600
            LL+I SIVFEGHELPKL+SILTGWGKHSKVVGDG LRRA+E LLTGMGAPF VANCNLGR
Sbjct: 772  LLNIHSIVFEGHELPKLLSILTGWGKHSKVVGDGALRRAVEVLLTGMGAPFWVANCNLGR 831

Query: 2601 FISPGAVVAAWLKESGTLKVLVLHDDRAHQETSTFDRIPNLRTLTL 2738
            FIS G +VA+WL+ESGTLKVLVLHDDR H E + FD + N++TLTL
Sbjct: 832  FISTGPMVASWLRESGTLKVLVLHDDRTHSENAGFDEMLNMQTLTL 877


>ref|XP_002515260.1| pentatricopeptide repeat-containing protein, putative [Ricinus
            communis] gi|223545740|gb|EEF47244.1| pentatricopeptide
            repeat-containing protein, putative [Ricinus communis]
          Length = 878

 Score = 1238 bits (3204), Expect = 0.0
 Identities = 640/862 (74%), Positives = 717/862 (83%), Gaps = 8/862 (0%)
 Frame = +3

Query: 132  MASSTPHCSITGTKPYQNHQFPQNPIXXXXXXXXXXXXWAHRKVSLTKPSQVPHXXXXXX 311
            MAS+ PHCSIT TKPYQNHQ+PQN +            W ++KVSLTKP   P       
Sbjct: 1    MASTPPHCSITATKPYQNHQYPQNHLKNHRQTHHHR--WTNQKVSLTKPPLAPSPCNAPK 58

Query: 312  XXXXXXXXXXXXXXXXXPTFPSLSRAPPPNKSELTADFSGRRSTRFVSKMHFGRQKAQLS 491
                             PTF SLS      KS+L+ADFSGRRSTRFVSK+HFGR K  ++
Sbjct: 59   AAAAAAAATTTHHTPN-PTFHSLSPLQS-QKSDLSADFSGRRSTRFVSKLHFGRPKTNMN 116

Query: 492  SRHSSVAEEALQESIRIN-DEKGLENVFLCFESKMSGSDDYIFVLRELGNRGECWKAVQC 668
             RH+SVA EALQ+ I+   D+K LENV L FES++ G DDY F+LRELGNRG+  KAV+C
Sbjct: 117  -RHTSVALEALQQVIQYGKDDKALENVLLNFESRLCGPDDYTFLLRELGNRGDSAKAVRC 175

Query: 669  FEFAVRRERRRNEQGKLASAMISILGRLGKVDLARGVFEIGLKAGYGNTVYAYSALISAY 848
            FEFAVRRE  +NEQGKLASAMIS LGRLGKV+LA+ VF+  LK GYG TVYA+SALISAY
Sbjct: 176  FEFAVRRESGKNEQGKLASAMISTLGRLGKVELAKAVFDTALKEGYGKTVYAFSALISAY 235

Query: 849  GRSGCSDEAIKVFETMKNSGLKPNLVTYNAVIDACGKGGVEFNQAAEILNEMLRNGIQPD 1028
            GRSG  +EAIKVF++MK++GL PNLVTYNAVIDACGKGGVEF +  EI + ML NG+QPD
Sbjct: 236  GRSGYCNEAIKVFDSMKSNGLMPNLVTYNAVIDACGKGGVEFKKVVEIFDGMLSNGVQPD 295

Query: 1029 RITFNSLLAVCSRGGLWEAARNFFSEMVYRGIDQDIFTYNTLLDAFCKGGQMDLAFQIMA 1208
            RITFNSLLAVCSRGGLWEAAR  FS MV +GIDQDIFTYNTLLDA CKGGQMDLAF+IM+
Sbjct: 296  RITFNSLLAVCSRGGLWEAARRLFSAMVDKGIDQDIFTYNTLLDAVCKGGQMDLAFEIMS 355

Query: 1209 EMSAKNISPNVVTYSTMIDGYAKVGRLDDALNLLGEMKFAGIGLDRVSYNTLLSIYAKLG 1388
            EM  KNI PNVVTYSTMIDGYAKVGRLDDALN+  EMKF G+GLDRVSYNTLLS+YAKLG
Sbjct: 356  EMPTKNILPNVVTYSTMIDGYAKVGRLDDALNMFNEMKFLGVGLDRVSYNTLLSVYAKLG 415

Query: 1389 RFEDVLNVCNEMESSGIKKDAVTYNALLGGYGKQGKYDEVKRVFKEMKTRHLSPNLLTYS 1568
            RFE  L+VC EME++GI+KD VTYNALL GYGKQ +YDEV+RVF+EMK   +SPNLLTYS
Sbjct: 416  RFEQALDVCKEMENAGIRKDVVTYNALLAGYGKQYRYDEVRRVFEEMKRGRVSPNLLTYS 475

Query: 1569 TLIDVYSKGGLYREALEVFKEFKQAGLKADVVLYSALIDALCKNGLVEIAVSLLDEMTKE 1748
            TLIDVYSKGGLY+EA+EVF+EFKQAGLKADVVLYSALIDALCKNGLVE +V+LLDEMTKE
Sbjct: 476  TLIDVYSKGGLYKEAMEVFREFKQAGLKADVVLYSALIDALCKNGLVESSVTLLDEMTKE 535

Query: 1749 GIRPNVVTYNSIIDAFGRSATA-CGSNES------LIEPSSLIALKDVSESNDEHREDNQ 1907
            GIRPNVVTYNSIIDAFGRSA+A C  ++S       +E  S I +++  ES    +EDN+
Sbjct: 536  GIRPNVVTYNSIIDAFGRSASAQCVVDDSGETTALQVESLSSIVVQEAIESQAADKEDNR 595

Query: 1908 IIKIFGQLAAEKSCHAKIDNRGRQEILCVLEVFQKMQELEIKPNVVTFSAILNACSRCNS 2087
            II+IFG+LAAEK+C AK  N G+QEILC+L VFQKM EL+IKPNVVTFSAILNACSRC+S
Sbjct: 596  IIEIFGKLAAEKACEAK--NSGKQEILCILGVFQKMHELKIKPNVVTFSAILNACSRCDS 653

Query: 2088 FEDASMLLEELRWFDNQVYGVAHGLLMGRKDNVWAQAQCLFDEVKQMDSPTASAFYNALT 2267
            FEDASMLLEELR FDNQVYGVAHGLLMG ++NVW QAQ LFDEVK MDS TASAFYNALT
Sbjct: 654  FEDASMLLEELRLFDNQVYGVAHGLLMGYRENVWLQAQSLFDEVKLMDSSTASAFYNALT 713

Query: 2268 DMLWHFGQKKGAQLVVLEGKRRHVWENMWSDSCLDLHLMSSGAARAMVHAWLLDIRSIVF 2447
            DMLWHFGQK+GAQLVVLEGKRR VWEN+WSDSCLDLHLMSSGAARAMVHAWLL+IRSIVF
Sbjct: 714  DMLWHFGQKRGAQLVVLEGKRRQVWENIWSDSCLDLHLMSSGAARAMVHAWLLNIRSIVF 773

Query: 2448 EGHELPKLISILTGWGKHSKVVGDGTLRRAIEALLTGMGAPFRVANCNLGRFISPGAVVA 2627
            EGHELPKL+SILTGWGKHSKVVGD  LRRA+EALL GMGAPFR+A CNLGRFIS G+VVA
Sbjct: 774  EGHELPKLLSILTGWGKHSKVVGDSALRRAVEALLIGMGAPFRLAKCNLGRFISTGSVVA 833

Query: 2628 AWLKESGTLKVLVLHDDRAHQE 2693
            AWLKESGTL+VLVLHDDR H E
Sbjct: 834  AWLKESGTLEVLVLHDDRTHPE 855


>ref|XP_002320970.2| hypothetical protein POPTR_0014s11380g [Populus trichocarpa]
            gi|550323986|gb|EEE99285.2| hypothetical protein
            POPTR_0014s11380g [Populus trichocarpa]
          Length = 875

 Score = 1227 bits (3175), Expect = 0.0
 Identities = 641/881 (72%), Positives = 716/881 (81%), Gaps = 12/881 (1%)
 Frame = +3

Query: 132  MASSTPHCSITGT-KPYQNHQFPQNPIXXXXXXXXXXXX--W-AHRKVSLTKPSQVPHXX 299
            MAS+ PHCSITGT KPY N+ +P +                W A+++VSLTKP   P   
Sbjct: 1    MASTPPHCSITGTTKPYHNNPYPHSHFKNHRQTHHQNPHQRWTANQRVSLTKPPLPPSSR 60

Query: 300  XXXXXXXXXXXXXXXXXXXXXPTFPSLSRAPPPNKSELTADFSGRRSTRFVSKMHFGRQK 479
                                 PTFPSL       KSEL +DFSGRRSTRFVSK++FGR +
Sbjct: 61   NAPKPPATTTTTTTTHHPQIHPTFPSLQSP----KSELASDFSGRRSTRFVSKLNFGRPR 116

Query: 480  AQLSSRHSSVAEEALQESIRIN-DEKGLENVFLCFESKMSGSDDYIFVLRELGNRGECWK 656
              + +RH+SVAEEALQ  I    DE  LENV L FES++SGSDDYIF+LRELGNRG+C K
Sbjct: 117  TTMGTRHTSVAEEALQNVIEYGKDEGALENVLLNFESRLSGSDDYIFLLRELGNRGDCKK 176

Query: 657  AVQCFEFAVRRERRRNEQGKLASAMISILGRLGKVDLARGVFEIGLKAGYGNTVYAYSAL 836
            A+ CFEFAV+RER++NEQGKLASAMIS LGRLGKV++A+ VFE  L  GYGNTVYA+SA+
Sbjct: 177  AICCFEFAVKRERKKNEQGKLASAMISTLGRLGKVEIAKSVFEAALIEGYGNTVYAFSAI 236

Query: 837  ISAYGRSGCSDEAIKVFETMKNSGLKPNLVTYNAVIDACGKGGVEFNQAAEILNEMLRNG 1016
            ISAYGRSG  DEAIKVF++MK+ GLKPNLVTYNAVIDACGKGGVEF +  EI +EMLRNG
Sbjct: 237  ISAYGRSGYCDEAIKVFDSMKHYGLKPNLVTYNAVIDACGKGGVEFKRVVEIFDEMLRNG 296

Query: 1017 IQPDRITFNSLLAVCSRGGLWEAARNFFSEMVYRGIDQDIFTYNTLLDAFCKGGQMDLAF 1196
            +QPDRITFNSLLAVCSRGGLWEAAR+  SEM+ RGIDQDIFTYNTLLDA CKGGQMD+AF
Sbjct: 297  VQPDRITFNSLLAVCSRGGLWEAARSLSSEMLNRGIDQDIFTYNTLLDAVCKGGQMDMAF 356

Query: 1197 QIMAEMSAKNISPNVVTYSTMIDGYAKVGRLDDALNLLGEMKFAGIGLDRVSYNTLLSIY 1376
            +IM+EM AKNI PNVVTYSTMIDGYAK GR DDALNL  EMKF  I LDRVSYNTLLSIY
Sbjct: 357  EIMSEMPAKNILPNVVTYSTMIDGYAKAGRFDDALNLFNEMKFLCISLDRVSYNTLLSIY 416

Query: 1377 AKLGRFEDVLNVCNEMESSGIKKDAVTYNALLGGYGKQGKYDEVKRVFKEMKTRHLSPNL 1556
            AKLGRF++ L+VC EME+ GI+KD VTYNALLGGYGKQ KYDEV+RVF EMK   +SPNL
Sbjct: 417  AKLGRFQEALDVCREMENCGIRKDVVTYNALLGGYGKQCKYDEVRRVFGEMKAGRVSPNL 476

Query: 1557 LTYSTLIDVYSKGGLYREALEVFKEFKQAGLKADVVLYSALIDALCKNGLVEIAVSLLDE 1736
            LTYSTLIDVYSKGGLYREA++VF+EFK+AGLKADVVLYSA+IDALCKNGLVE AVSLLDE
Sbjct: 477  LTYSTLIDVYSKGGLYREAMDVFREFKKAGLKADVVLYSAVIDALCKNGLVESAVSLLDE 536

Query: 1737 MTKEGIRPNVVTYNSIIDAFGRSATA-------CGSNESLIEPSSLIALKDVSESNDEHR 1895
            MTKEGIRPNVVTYNSIIDAFGRSA           +++  IE  S   +++ ++S    R
Sbjct: 537  MTKEGIRPNVVTYNSIIDAFGRSAITESVVDDNVQTSQLQIESLSSGVVEEATKSLLADR 596

Query: 1896 EDNQIIKIFGQLAAEKSCHAKIDNRGRQEILCVLEVFQKMQELEIKPNVVTFSAILNACS 2075
            E N+IIKIFGQLA EK+  AK  N   QE++C+L VF KM ELEIKPNVVTFSAILNACS
Sbjct: 597  EGNRIIKIFGQLAVEKAGQAK--NCSGQEMMCILAVFHKMHELEIKPNVVTFSAILNACS 654

Query: 2076 RCNSFEDASMLLEELRWFDNQVYGVAHGLLMGRKDNVWAQAQCLFDEVKQMDSPTASAFY 2255
            RCNSFEDASMLLEELR FDNQVYGVAHGLLMG ++NVW QAQ LFDEVK MDS TASAFY
Sbjct: 655  RCNSFEDASMLLEELRLFDNQVYGVAHGLLMGYRENVWEQAQSLFDEVKLMDSSTASAFY 714

Query: 2256 NALTDMLWHFGQKKGAQLVVLEGKRRHVWENMWSDSCLDLHLMSSGAARAMVHAWLLDIR 2435
            NALTDMLWHFGQK+GAQLVVLEGKRR VWEN+WS+SCLDLHLMSSGAARAMVHAWLL+IR
Sbjct: 715  NALTDMLWHFGQKRGAQLVVLEGKRRQVWENVWSESCLDLHLMSSGAARAMVHAWLLNIR 774

Query: 2436 SIVFEGHELPKLISILTGWGKHSKVVGDGTLRRAIEALLTGMGAPFRVANCNLGRFISPG 2615
            SIVFEGHELPKL+SILTGWGKHSKVVGD TLRRAIEALL GMGAPFR+A CNLGRFIS G
Sbjct: 775  SIVFEGHELPKLLSILTGWGKHSKVVGDSTLRRAIEALLMGMGAPFRLAKCNLGRFISTG 834

Query: 2616 AVVAAWLKESGTLKVLVLHDDRAHQETSTFDRIPNLRTLTL 2738
            +VVAAWL+ESGTLKVLVLHD R  QE   F +  NL+TL L
Sbjct: 835  SVVAAWLRESGTLKVLVLHDHRTEQENLRFGQASNLQTLQL 875


>ref|XP_006386713.1| hypothetical protein POPTR_0002s19470g [Populus trichocarpa]
            gi|550345388|gb|ERP64510.1| hypothetical protein
            POPTR_0002s19470g [Populus trichocarpa]
          Length = 873

 Score = 1226 bits (3172), Expect = 0.0
 Identities = 635/881 (72%), Positives = 720/881 (81%), Gaps = 12/881 (1%)
 Frame = +3

Query: 132  MASSTPHCSITGT-KPYQNHQFPQNPIXXXXXXXXXXXX--W-AHRKVSLTKPSQVPHXX 299
            MAS+ PHCSIT T K YQNH +P N +              W ++++VSL KP   P   
Sbjct: 1    MASTPPHCSITATTKHYQNHPYPHNQLKNHRQTHNQNPHQRWTSNQRVSLAKPPLPPSRN 60

Query: 300  XXXXXXXXXXXXXXXXXXXXXPTFPSLSRAPPPNKSELTADFSGRRSTRFVSKMHFGRQK 479
                                 PTF S      P KSEL +DF GRRSTRFVSK+HFGR +
Sbjct: 61   APKPAATTTTTTTQHPQIH--PTFSSFQ----PPKSELVSDFPGRRSTRFVSKLHFGRPR 114

Query: 480  AQLSSRHSSVAEEALQESIRIN-DEKGLENVFLCFESKMSGSDDYIFVLRELGNRGECWK 656
              + +RH+SVA+EALQ  I    DE+ LENV L FES++SGSDDY+F+LRELGNRG+C K
Sbjct: 115  TTMGTRHTSVAQEALQNVIEYGKDERALENVLLNFESRLSGSDDYVFLLRELGNRGDCKK 174

Query: 657  AVQCFEFAVRRERRRNEQGKLASAMISILGRLGKVDLARGVFEIGLKAGYGNTVYAYSAL 836
            A+ CFEFAV+RER++NEQGKLASAMIS LGRLGKV++A+ VF+  L  GYGNTVYA+SA+
Sbjct: 175  AICCFEFAVKRERKKNEQGKLASAMISTLGRLGKVEMAKTVFKAALTEGYGNTVYAFSAI 234

Query: 837  ISAYGRSGCSDEAIKVFETMKNSGLKPNLVTYNAVIDACGKGGVEFNQAAEILNEMLRNG 1016
            ISAYGRSG  +EAIK+F +MK+ GLKPNLVTYNAVIDACGKGGVEF +  EI +EMLRNG
Sbjct: 235  ISAYGRSGYCNEAIKIFYSMKDYGLKPNLVTYNAVIDACGKGGVEFKRVLEIFDEMLRNG 294

Query: 1017 IQPDRITFNSLLAVCSRGGLWEAARNFFSEMVYRGIDQDIFTYNTLLDAFCKGGQMDLAF 1196
            +QPDRITFNSLLAVCS+GGLWEAAR+   EMV RGIDQDIFTYNTLLDA CKGGQ+D+AF
Sbjct: 295  MQPDRITFNSLLAVCSKGGLWEAARSLSCEMVNRGIDQDIFTYNTLLDAVCKGGQLDMAF 354

Query: 1197 QIMAEMSAKNISPNVVTYSTMIDGYAKVGRLDDALNLLGEMKFAGIGLDRVSYNTLLSIY 1376
            +IM+EM AKNI PNVVTYSTMIDGYAK GRLDDA NL  EMKF GI LDRVSYNTLLSIY
Sbjct: 355  EIMSEMPAKNILPNVVTYSTMIDGYAKAGRLDDARNLFNEMKFLGISLDRVSYNTLLSIY 414

Query: 1377 AKLGRFEDVLNVCNEMESSGIKKDAVTYNALLGGYGKQGKYDEVKRVFKEMKTRHLSPNL 1556
            AKLGRFE+ ++VC EME+SGI+KD VTYNALLGGYGKQ KYD V++VF+EMK RH+SPNL
Sbjct: 415  AKLGRFEEAMDVCREMENSGIRKDVVTYNALLGGYGKQYKYDVVRKVFEEMKARHVSPNL 474

Query: 1557 LTYSTLIDVYSKGGLYREALEVFKEFKQAGLKADVVLYSALIDALCKNGLVEIAVSLLDE 1736
            LTYSTLIDVYSKGGLYREA++VF+EFK+AGLKADVVLYSALIDALCKNGLVE AVSLLDE
Sbjct: 475  LTYSTLIDVYSKGGLYREAMDVFREFKKAGLKADVVLYSALIDALCKNGLVESAVSLLDE 534

Query: 1737 MTKEGIRPNVVTYNSIIDAFGRSAT-------ACGSNESLIEPSSLIALKDVSESNDEHR 1895
            MTKEGIRPNVVTYNSIIDAFGR AT       A  ++E  I+  S  A++  ++S    R
Sbjct: 535  MTKEGIRPNVVTYNSIIDAFGRPATTESVVDDAGQTSELQIDSLSSSAVEKATKSLVADR 594

Query: 1896 EDNQIIKIFGQLAAEKSCHAKIDNRGRQEILCVLEVFQKMQELEIKPNVVTFSAILNACS 2075
            EDN+IIKIFGQLAAEK+  AK  N G QE++C+L VF KM ELEIKPNVVTFSAILNACS
Sbjct: 595  EDNRIIKIFGQLAAEKAGQAK--NSGGQEMMCILGVFHKMHELEIKPNVVTFSAILNACS 652

Query: 2076 RCNSFEDASMLLEELRWFDNQVYGVAHGLLMGRKDNVWAQAQCLFDEVKQMDSPTASAFY 2255
            RCNSFE+ASMLLEELR FDNQVYGVAHGLLMG ++NVW QAQ LFDEVK MDS TASAFY
Sbjct: 653  RCNSFEEASMLLEELRLFDNQVYGVAHGLLMGYRENVWEQAQSLFDEVKLMDSSTASAFY 712

Query: 2256 NALTDMLWHFGQKKGAQLVVLEGKRRHVWENMWSDSCLDLHLMSSGAARAMVHAWLLDIR 2435
            NALTDMLWHFGQK+GAQLVVLEGKRR VWEN+WS+SCLDLHLMSSGAARAMVHAWLL++R
Sbjct: 713  NALTDMLWHFGQKRGAQLVVLEGKRRQVWENVWSESCLDLHLMSSGAARAMVHAWLLNVR 772

Query: 2436 SIVFEGHELPKLISILTGWGKHSKVVGDGTLRRAIEALLTGMGAPFRVANCNLGRFISPG 2615
            +IVFEGHE+PKL+SILTGWGKHSKVVGD TLRRA+EALL GMGAPFR A CNLGR IS G
Sbjct: 773  AIVFEGHEVPKLLSILTGWGKHSKVVGDSTLRRAVEALLMGMGAPFRSAKCNLGRLISTG 832

Query: 2616 AVVAAWLKESGTLKVLVLHDDRAHQETSTFDRIPNLRTLTL 2738
            +VVA+WL+ESGTLKVLVLHDDR HQE   F +I NL+ L L
Sbjct: 833  SVVASWLRESGTLKVLVLHDDRTHQENLRFGQISNLQMLQL 873


>gb|EXB28566.1| hypothetical protein L484_009725 [Morus notabilis]
          Length = 871

 Score = 1219 bits (3154), Expect = 0.0
 Identities = 628/878 (71%), Positives = 710/878 (80%), Gaps = 11/878 (1%)
 Frame = +3

Query: 132  MASSTPHCSITGTKPYQNHQFPQNP---IXXXXXXXXXXXXWAHRKVSLTKPSQVPHXXX 302
            MAS+ PHCSIT +KPYQ+HQ+ QNP                W  +KVSLTKPS  P    
Sbjct: 1    MASTPPHCSITASKPYQSHQYAQNPNLKSHHRHSNHRQGHQWTTQKVSLTKPSPSP---- 56

Query: 303  XXXXXXXXXXXXXXXXXXXXPTFPSLSRAPPPNKSELTADFSGRRSTRFVSKMHFGRQKA 482
                                P F SL   P P KS+L A FSGRRSTRFVSKMH GR K 
Sbjct: 57   ---PPARNAAATPAQHASQNPAFHSLCSLPAP-KSDLAAVFSGRRSTRFVSKMHLGRPKT 112

Query: 483  QLSSRHSSVAEEALQESIRIN-DEKGLENVFLCFESKMSGSDDYIFVLRELGNRGECWKA 659
             + SRH++VAEE LQ++I+   D+ G++NV L FE K+ GSDDY F+LRELGNRGEC KA
Sbjct: 113  TVGSRHTAVAEEVLQQAIQFGKDDLGIDNVLLSFEPKLCGSDDYTFLLRELGNRGECRKA 172

Query: 660  VQCFEFAVRRERRRNEQGKLASAMISILGRLGKVDLARGVFEIGLKAGYGNTVYAYSALI 839
            ++CFEFAV RERR+ EQGKL SAMIS LGRLGKV+LAR VFE  L AGYGNTVY YSALI
Sbjct: 173  IRCFEFAVARERRKTEQGKLTSAMISTLGRLGKVELARDVFETALFAGYGNTVYTYSALI 232

Query: 840  SAYGRSGCSDEAIKVFETMKNSGLKPNLVTYNAVIDACGKGGVEFNQAAEILNEMLRNGI 1019
            SAYGRSG  +EA +V E+MK+SGLKPNLVTYNAVIDACGKGG EF +  EI +EMLRNG+
Sbjct: 233  SAYGRSGYWEEARRVVESMKDSGLKPNLVTYNAVIDACGKGGAEFKRVVEIFDEMLRNGV 292

Query: 1020 QPDRITFNSLLAVCSRGGLWEAARNFFSEMVYRGIDQDIFTYNTLLDAFCKGGQMDLAFQ 1199
            QPDRIT+NSLLAVCSRGGLWEAAR+ FSEMV R IDQDI+TYNTLLDA CKGGQMDLA Q
Sbjct: 293  QPDRITYNSLLAVCSRGGLWEAARSLFSEMVERQIDQDIYTYNTLLDAICKGGQMDLARQ 352

Query: 1200 IMAEMSAKNISPNVVTYSTMIDGYAKVGRLDDALNLLGEMKFAGIGLDRVSYNTLLSIYA 1379
            IM+EM +K I PNVVTYSTMIDGYAK GRL+DALNL  EMK+  IGLDRV YNTLLSIYA
Sbjct: 353  IMSEMPSKKILPNVVTYSTMIDGYAKAGRLEDALNLFNEMKYLAIGLDRVLYNTLLSIYA 412

Query: 1380 KLGRFEDVLNVCNEMESSGIKKDAVTYNALLGGYGKQGKYDEVKRVFKEMKTRHLSPNLL 1559
            KLGRFE+ L VC EMESSGI +D V+YNALLGGYGKQGKYDEVKR++++MK  H+SPNLL
Sbjct: 413  KLGRFEEALKVCKEMESSGIVRDVVSYNALLGGYGKQGKYDEVKRMYQDMKADHVSPNLL 472

Query: 1560 TYSTLIDVYSKGGLYREALEVFKEFKQAGLKADVVLYSALIDALCKNGLVEIAVSLLDEM 1739
            TYSTLIDVYSKGGLYREA+EVF+EFKQAGLKADVVLYS LI+ALCKNG+VE AVSLLDEM
Sbjct: 473  TYSTLIDVYSKGGLYREAMEVFREFKQAGLKADVVLYSELINALCKNGMVESAVSLLDEM 532

Query: 1740 TKEGIRPNVVTYNSIIDAFGRSAT-------ACGSNESLIEPSSLIALKDVSESNDEHRE 1898
            TKEGI PNV+TYNSIIDAFGR AT       A G NE   E SS I+ ++ +++   ++ 
Sbjct: 533  TKEGIMPNVITYNSIIDAFGRPATADSALGAAIGGNELETELSSSISNENANKNKAVNKG 592

Query: 1899 DNQIIKIFGQLAAEKSCHAKIDNRGRQEILCVLEVFQKMQELEIKPNVVTFSAILNACSR 2078
            D+QIIK+FGQLAAE+  H K D + RQEILC+L VFQKM EL IKPNVVTFSAILNACSR
Sbjct: 593  DHQIIKMFGQLAAEQEGHTKKDKKIRQEILCILGVFQKMHELNIKPNVVTFSAILNACSR 652

Query: 2079 CNSFEDASMLLEELRWFDNQVYGVAHGLLMGRKDNVWAQAQCLFDEVKQMDSPTASAFYN 2258
            CNSFEDASMLLEELR FDNQVYGVAHGLLMG ++NVW +AQ LFDEVKQMDS TASAFYN
Sbjct: 653  CNSFEDASMLLEELRLFDNQVYGVAHGLLMGHRENVWLEAQSLFDEVKQMDSSTASAFYN 712

Query: 2259 ALTDMLWHFGQKKGAQLVVLEGKRRHVWENMWSDSCLDLHLMSSGAARAMVHAWLLDIRS 2438
            ALTDMLWHFGQK+GAQLVVLEGKRR+VWE++WS+S LDLHLMSSGAARA++HAWLL+IRS
Sbjct: 713  ALTDMLWHFGQKRGAQLVVLEGKRRNVWESVWSNSFLDLHLMSSGAARALLHAWLLNIRS 772

Query: 2439 IVFEGHELPKLISILTGWGKHSKVVGDGTLRRAIEALLTGMGAPFRVANCNLGRFISPGA 2618
            +VFEG ELP+L+SILTGWGKHSKVVGD  LRRAIE+LL  MGAPF  A CNLGRF SPG 
Sbjct: 773  VVFEGQELPRLLSILTGWGKHSKVVGDSALRRAIESLLISMGAPFEAAKCNLGRFTSPGP 832

Query: 2619 VVAAWLKESGTLKVLVLHDDRAHQETSTFDRIPNLRTL 2732
            +VA WLKESGTLKVLVLHDDR+H + +    + NL+TL
Sbjct: 833  MVAGWLKESGTLKVLVLHDDRSHSQNA--KHVSNLQTL 868


>ref|XP_004166285.1| PREDICTED: pentatricopeptide repeat-containing protein At2g31400,
            chloroplastic-like [Cucumis sativus]
          Length = 868

 Score = 1209 bits (3127), Expect = 0.0
 Identities = 618/878 (70%), Positives = 698/878 (79%), Gaps = 9/878 (1%)
 Frame = +3

Query: 132  MASSTPHCSITGTKPYQNHQFPQNPIXXXXXXXXXXXXWAH-RKVSLTKPSQVPHXXXXX 308
            MAS+ PHCSIT  KPYQ HQ+PQN +            W    K  L KP          
Sbjct: 1    MASTPPHCSITAAKPYQTHQYPQNNLKNHRQNARQNGPWTTTHKFPLVKP--------LP 52

Query: 309  XXXXXXXXXXXXXXXXXXPTFPSLSRAPPPNKSELTADFSGRRSTRFVSKMHFGRQKAQL 488
                              P FPSL   P  +KSEL ++FSGRRSTRFVSK HFGR K+ +
Sbjct: 53   STPGHSATKSTSTPLSQSPNFPSLCSLPT-SKSELASNFSGRRSTRFVSKFHFGRPKSSM 111

Query: 489  SSRHSSVAEEALQESIRIN-DEKGLENVFLCFESKMSGSDDYIFVLRELGNRGECWKAVQ 665
            ++RHS++AEE L + ++   D+  L+N+ L FESK+ GS+DY F+LRELGNRGECWKA++
Sbjct: 112  TTRHSAIAEEVLHQVLQFGKDDASLDNILLNFESKLCGSEDYTFLLRELGNRGECWKAIR 171

Query: 666  CFEFAVRRERRRNEQGKLASAMISILGRLGKVDLARGVFEIGLKAGYGNTVYAYSALISA 845
            CF+FA+ RE R+NE+GKLASAMIS LGRLGKV+LA+GVFE  L  GYGNTV+A+SALISA
Sbjct: 172  CFDFALVREGRKNERGKLASAMISTLGRLGKVELAKGVFETALSEGYGNTVFAFSALISA 231

Query: 846  YGRSGCSDEAIKVFETMKNSGLKPNLVTYNAVIDACGKGGVEFNQAAEILNEMLRNGIQP 1025
            YG+SG  DEAIKVFE+MK SGLKPNLVTYNAVIDACGKGGVEF +  EI  EMLRNG+QP
Sbjct: 232  YGKSGYFDEAIKVFESMKVSGLKPNLVTYNAVIDACGKGGVEFKRVVEIFEEMLRNGVQP 291

Query: 1026 DRITFNSLLAVCSRGGLWEAARNFFSEMVYRGIDQDIFTYNTLLDAFCKGGQMDLAFQIM 1205
            DRIT+NSLLAVCSRGGLWEAARN F+EM+ RGIDQD+FTYNTLLDA CKGGQMDLA++IM
Sbjct: 292  DRITYNSLLAVCSRGGLWEAARNLFNEMIDRGIDQDVFTYNTLLDAVCKGGQMDLAYEIM 351

Query: 1206 AEMSAKNISPNVVTYSTMIDGYAKVGRLDDALNLLGEMKFAGIGLDRVSYNTLLSIYAKL 1385
             EM  K I PNVVTYSTM DGYAK GRL+DALNL  EMKF GIGLDRVSYNTLLSIYAKL
Sbjct: 352  LEMPGKKILPNVVTYSTMADGYAKAGRLEDALNLYNEMKFLGIGLDRVSYNTLLSIYAKL 411

Query: 1386 GRFEDVLNVCNEMESSGIKKDAVTYNALLGGYGKQGKYDEVKRVFKEMKTRHLSPNLLTY 1565
            GRFED L VC EM SSG+KKD VTYNALL GYGKQGK++EV RVFKEMK   + PNLLTY
Sbjct: 412  GRFEDALKVCKEMGSSGVKKDVVTYNALLDGYGKQGKFNEVTRVFKEMKKDRVFPNLLTY 471

Query: 1566 STLIDVYSKGGLYREALEVFKEFKQAGLKADVVLYSALIDALCKNGLVEIAVSLLDEMTK 1745
            STLIDVYSKG LY EA+EVF+EFKQAGLKADVVLYS LI+ALCKNGLV+ AV LLDEMTK
Sbjct: 472  STLIDVYSKGSLYEEAMEVFREFKQAGLKADVVLYSELINALCKNGLVDSAVLLLDEMTK 531

Query: 1746 EGIRPNVVTYNSIIDAFGRSATA-------CGSNESLIEPSSLIALKDVSESNDEHREDN 1904
            EGIRPNVVTYNSIIDAFGRS TA         SNE   E  + + ++ V ES + + +D 
Sbjct: 532  EGIRPNVVTYNSIIDAFGRSTTAEFLVDGVGASNERQSESPTFMLIEGVDES-EINWDDG 590

Query: 1905 QIIKIFGQLAAEKSCHAKIDNRGRQEILCVLEVFQKMQELEIKPNVVTFSAILNACSRCN 2084
             + K + QL +EK   AK +  G++EI  +L VF+KM ELEIKPNVVTFSAILNACSRC 
Sbjct: 591  HVFKFYQQLVSEKEGPAKKERLGKEEIRSILSVFKKMHELEIKPNVVTFSAILNACSRCK 650

Query: 2085 SFEDASMLLEELRWFDNQVYGVAHGLLMGRKDNVWAQAQCLFDEVKQMDSPTASAFYNAL 2264
            S EDASMLLEELR FDNQVYGVAHGLLMG  +NVW QAQ LFDEVKQMDS TASAFYNAL
Sbjct: 651  SIEDASMLLEELRLFDNQVYGVAHGLLMGFSENVWIQAQYLFDEVKQMDSSTASAFYNAL 710

Query: 2265 TDMLWHFGQKKGAQLVVLEGKRRHVWENMWSDSCLDLHLMSSGAARAMVHAWLLDIRSIV 2444
            TDMLWHFGQK+GAQLVVLEGKRR VWE +WSDSCLDLHLMSSGAARAMVHAWLL I S+V
Sbjct: 711  TDMLWHFGQKRGAQLVVLEGKRRKVWETLWSDSCLDLHLMSSGAARAMVHAWLLGIHSVV 770

Query: 2445 FEGHELPKLISILTGWGKHSKVVGDGTLRRAIEALLTGMGAPFRVANCNLGRFISPGAVV 2624
            FEGH+LPKL+SILTGWGKHSKVVGDG LRRAIEALLT MGAPFRVA CN+GR++S G+VV
Sbjct: 771  FEGHQLPKLLSILTGWGKHSKVVGDGALRRAIEALLTSMGAPFRVAKCNIGRYVSTGSVV 830

Query: 2625 AAWLKESGTLKVLVLHDDRAHQETSTFDRIPNLRTLTL 2738
            AAWLKESGTLK+LVLHDDR H +T   D I  L+T++L
Sbjct: 831  AAWLKESGTLKLLVLHDDRTHPDTENMDLISKLQTISL 868


>ref|XP_004135985.1| PREDICTED: pentatricopeptide repeat-containing protein At2g31400,
            chloroplastic-like [Cucumis sativus]
          Length = 868

 Score = 1208 bits (3126), Expect = 0.0
 Identities = 618/878 (70%), Positives = 698/878 (79%), Gaps = 9/878 (1%)
 Frame = +3

Query: 132  MASSTPHCSITGTKPYQNHQFPQNPIXXXXXXXXXXXXWAH-RKVSLTKPSQVPHXXXXX 308
            MAS+ PHCSIT  KPYQ HQ+PQN +            W    K  L KP          
Sbjct: 1    MASTPPHCSITAAKPYQTHQYPQNNLKNHRQNARQNGPWTTTHKFPLVKP--------LP 52

Query: 309  XXXXXXXXXXXXXXXXXXPTFPSLSRAPPPNKSELTADFSGRRSTRFVSKMHFGRQKAQL 488
                              P FPSL   P  +KSEL ++FSGRRSTRFVSK HFGR K+ +
Sbjct: 53   STPGHSATKSTSTPLSQSPNFPSLCSLPT-SKSELASNFSGRRSTRFVSKFHFGRPKSSM 111

Query: 489  SSRHSSVAEEALQESIRIN-DEKGLENVFLCFESKMSGSDDYIFVLRELGNRGECWKAVQ 665
            ++RHS++AEE L + ++   D+  L+N+ L FESK+ GS+DY F+LRELGNRGECWKA++
Sbjct: 112  TTRHSAIAEEVLHQVLQFGKDDASLDNILLNFESKLCGSEDYTFLLRELGNRGECWKAIR 171

Query: 666  CFEFAVRRERRRNEQGKLASAMISILGRLGKVDLARGVFEIGLKAGYGNTVYAYSALISA 845
            CF+FA+ RE R+NE+GKLASAMIS LGRLGKV+LA+GVFE  L  GYGNTV+A+SALISA
Sbjct: 172  CFDFALVREGRKNERGKLASAMISTLGRLGKVELAKGVFETALSEGYGNTVFAFSALISA 231

Query: 846  YGRSGCSDEAIKVFETMKNSGLKPNLVTYNAVIDACGKGGVEFNQAAEILNEMLRNGIQP 1025
            YG+SG  DEAIKVFE+MK SGLKPNLVTYNAVIDACGKGGVEF +  EI  EMLRNG+QP
Sbjct: 232  YGKSGYFDEAIKVFESMKVSGLKPNLVTYNAVIDACGKGGVEFKRVVEIFEEMLRNGVQP 291

Query: 1026 DRITFNSLLAVCSRGGLWEAARNFFSEMVYRGIDQDIFTYNTLLDAFCKGGQMDLAFQIM 1205
            DRIT+NSLLAVCSRGGLWEAARN F+EM+ RGIDQD+FTYNTLLDA CKGGQMDLA++IM
Sbjct: 292  DRITYNSLLAVCSRGGLWEAARNLFNEMIDRGIDQDVFTYNTLLDAVCKGGQMDLAYEIM 351

Query: 1206 AEMSAKNISPNVVTYSTMIDGYAKVGRLDDALNLLGEMKFAGIGLDRVSYNTLLSIYAKL 1385
             EM  K I PNVVTYSTM DGYAK GRL+DALNL  EMKF GIGLDRVSYNTLLSIYAKL
Sbjct: 352  LEMPGKKILPNVVTYSTMADGYAKAGRLEDALNLYNEMKFLGIGLDRVSYNTLLSIYAKL 411

Query: 1386 GRFEDVLNVCNEMESSGIKKDAVTYNALLGGYGKQGKYDEVKRVFKEMKTRHLSPNLLTY 1565
            GRFED L VC EM SSG+KKD VTYNALL GYGKQGK++EV RVFKEMK   + PNLLTY
Sbjct: 412  GRFEDALKVCKEMGSSGVKKDVVTYNALLDGYGKQGKFNEVTRVFKEMKKDRVFPNLLTY 471

Query: 1566 STLIDVYSKGGLYREALEVFKEFKQAGLKADVVLYSALIDALCKNGLVEIAVSLLDEMTK 1745
            STLIDVYSKG LY EA+EVF+EFKQAGLKADVVLYS LI+ALCKNGLV+ AV LLDEMTK
Sbjct: 472  STLIDVYSKGSLYEEAMEVFREFKQAGLKADVVLYSELINALCKNGLVDSAVLLLDEMTK 531

Query: 1746 EGIRPNVVTYNSIIDAFGRSATA-------CGSNESLIEPSSLIALKDVSESNDEHREDN 1904
            EGIRPNVVTYNSIIDAFGRS TA         SNE   E  S + ++ V ES + + +D 
Sbjct: 532  EGIRPNVVTYNSIIDAFGRSTTAEFLVDGVGASNERQSESPSFMLIEGVDES-EINWDDG 590

Query: 1905 QIIKIFGQLAAEKSCHAKIDNRGRQEILCVLEVFQKMQELEIKPNVVTFSAILNACSRCN 2084
             + K + QL +EK   AK +  G++EI  +L VF+KM ELEIKPNVVTFSAILNACSRC 
Sbjct: 591  HVFKFYQQLVSEKEGPAKKERLGKEEIRSILSVFKKMHELEIKPNVVTFSAILNACSRCK 650

Query: 2085 SFEDASMLLEELRWFDNQVYGVAHGLLMGRKDNVWAQAQCLFDEVKQMDSPTASAFYNAL 2264
            S EDASMLLEELR FDNQVYGVAHGLLMG  +NVW QAQ LFDEVKQMDS TASAFYNAL
Sbjct: 651  SIEDASMLLEELRLFDNQVYGVAHGLLMGFSENVWIQAQYLFDEVKQMDSSTASAFYNAL 710

Query: 2265 TDMLWHFGQKKGAQLVVLEGKRRHVWENMWSDSCLDLHLMSSGAARAMVHAWLLDIRSIV 2444
            TDMLWHFGQK+GAQLVVLEGKRR VWE +WSDSCLDLHLMSSGAARAMVHAWLL I S+V
Sbjct: 711  TDMLWHFGQKRGAQLVVLEGKRRKVWETLWSDSCLDLHLMSSGAARAMVHAWLLGIHSVV 770

Query: 2445 FEGHELPKLISILTGWGKHSKVVGDGTLRRAIEALLTGMGAPFRVANCNLGRFISPGAVV 2624
            FEGH+LPKL+SILTGWGKHSKVVGDG LRRAIEALLT MGAPFRVA CN+GR++S G+VV
Sbjct: 771  FEGHQLPKLLSILTGWGKHSKVVGDGALRRAIEALLTSMGAPFRVAKCNIGRYVSTGSVV 830

Query: 2625 AAWLKESGTLKVLVLHDDRAHQETSTFDRIPNLRTLTL 2738
            AAWLKESGTLK+LVLHDDR H ++   D I  L+T++L
Sbjct: 831  AAWLKESGTLKLLVLHDDRTHPDSENMDLISKLQTISL 868


>ref|XP_007221553.1| hypothetical protein PRUPE_ppa001263mg [Prunus persica]
            gi|462418303|gb|EMJ22752.1| hypothetical protein
            PRUPE_ppa001263mg [Prunus persica]
          Length = 868

 Score = 1199 bits (3103), Expect = 0.0
 Identities = 610/877 (69%), Positives = 702/877 (80%), Gaps = 8/877 (0%)
 Frame = +3

Query: 132  MASSTPHCSITGTKPYQNHQFPQNPIXXXXXXXXXXXXWAHRKVSLTKPSQVPHXXXXXX 311
            MAS+ PHCSIT TKPYQ H++PQN              W  ++VSL KP  +P       
Sbjct: 1    MASTPPHCSITATKPYQTHRYPQNQHLKSQRQSRQSNQWTKQQVSLPKPLPLPSQAPRTA 60

Query: 312  XXXXXXXXXXXXXXXXXPTFPSLSRAPPPNKSELTADFSGRRSTRFVSKMHFGRQKAQLS 491
                              +F SL   P P KS+L   FSGRRSTRFVSKMH GR K  + 
Sbjct: 61   AKTPTATPTS--------SFSSLCPLPHP-KSDLVTAFSGRRSTRFVSKMHLGRPKTTMG 111

Query: 492  SRHSSVAEEALQESIRI-NDEKGLENVFLCFESKMSGSDDYIFVLRELGNRGECWKAVQC 668
            S  S +AEEAL ++++  ND+  L+++ L F S++ GSDDY F+ RELGNRGECWKA++C
Sbjct: 112  SYRSPLAEEALHQAVQFGNDDLALDDILLSFHSRLCGSDDYTFLFRELGNRGECWKAIRC 171

Query: 669  FEFAVRRERRRNEQGKLASAMISILGRLGKVDLARGVFEIGLKAGYGNTVYAYSALISAY 848
            FEFAVRRE+RR EQGKLAS+MIS LGRLGKV+LA+ VF+  +  GYG TVY YSALI+AY
Sbjct: 172  FEFAVRREKRRTEQGKLASSMISTLGRLGKVELAKNVFQTAVNEGYGKTVYTYSALITAY 231

Query: 849  GRSGCSDEAIKVFETMKNSGLKPNLVTYNAVIDACGKGGVEFNQAAEILNEMLRNGIQPD 1028
            GR+G  +EAI+VFE+MK+SGLKPNLVTYNAVIDA GKGGVEF +  EI NEMLRNG QPD
Sbjct: 232  GRNGYCEEAIRVFESMKDSGLKPNLVTYNAVIDAYGKGGVEFKRVVEIFNEMLRNGEQPD 291

Query: 1029 RITFNSLLAVCSRGGLWEAARNFFSEMVYRGIDQDIFTYNTLLDAFCKGGQMDLAFQIMA 1208
            RIT+NSLLAVCSRGGLWE ARN FSEMV RGIDQDI+TYNTL+DA CKGGQMDLA+QIM+
Sbjct: 292  RITYNSLLAVCSRGGLWEMARNLFSEMVDRGIDQDIYTYNTLIDAICKGGQMDLAYQIMS 351

Query: 1209 EMSAKNISPNVVTYSTMIDGYAKVGRLDDALNLLGEMKFAGIGLDRVSYNTLLSIYAKLG 1388
            EM +KNI PNVVTYST+IDGYAK GRL+DAL+L  EMKF  IGLDRV YNTLLS+Y KLG
Sbjct: 352  EMPSKNILPNVVTYSTIIDGYAKAGRLEDALSLFNEMKFLAIGLDRVLYNTLLSLYGKLG 411

Query: 1389 RFEDVLNVCNEMESSGIKKDAVTYNALLGGYGKQGKYDEVKRVFKEMKTRHLSPNLLTYS 1568
            RFED L VC EMES GI KD V+YNALLGGYGKQGKYD+ KR++ +MK   +SPN+LTYS
Sbjct: 412  RFEDALKVCKEMESVGIAKDVVSYNALLGGYGKQGKYDDAKRMYNQMKEERVSPNILTYS 471

Query: 1569 TLIDVYSKGGLYREALEVFKEFKQAGLKADVVLYSALIDALCKNGLVEIAVSLLDEMTKE 1748
            TLIDVYSKGGLY EA++VF+EFKQAGLKADVVLYS L++ALCKNGLVE AV LLDEMTKE
Sbjct: 472  TLIDVYSKGGLYMEAMKVFREFKQAGLKADVVLYSELVNALCKNGLVESAVLLLDEMTKE 531

Query: 1749 GIRPNVVTYNSIIDAFGRSAT-------ACGSNESLIEPSSLIALKDVSESNDEHREDNQ 1907
            GIRPNVVTYNSIIDAFGRSAT       A G      E SS ++  D        R DN+
Sbjct: 532  GIRPNVVTYNSIIDAFGRSATTECAADAAGGGIVLQTESSSSVSEGDAIGIQVGDRGDNR 591

Query: 1908 IIKIFGQLAAEKSCHAKIDNRGRQEILCVLEVFQKMQELEIKPNVVTFSAILNACSRCNS 2087
             +K+FGQLAAEK+ +AK D + RQEILC+L +FQKM EL+IKPNVVTFSAILNACSRCNS
Sbjct: 592  FMKMFGQLAAEKAGYAKTDRKVRQEILCILGIFQKMHELDIKPNVVTFSAILNACSRCNS 651

Query: 2088 FEDASMLLEELRWFDNQVYGVAHGLLMGRKDNVWAQAQCLFDEVKQMDSPTASAFYNALT 2267
            FEDASMLLEELR FDN+VYGVAHGLLMG +DNVW +A+ LFDEVKQMDS TASAFYNALT
Sbjct: 652  FEDASMLLEELRLFDNKVYGVAHGLLMGYRDNVWVKAESLFDEVKQMDSSTASAFYNALT 711

Query: 2268 DMLWHFGQKKGAQLVVLEGKRRHVWENMWSDSCLDLHLMSSGAARAMVHAWLLDIRSIVF 2447
            DMLWH+GQK+GAQLVVLEGKRR+VWE++WS+SCLDLHLMSSGAARAMVHAWLL+IRSIVF
Sbjct: 712  DMLWHYGQKQGAQLVVLEGKRRNVWESVWSNSCLDLHLMSSGAARAMVHAWLLNIRSIVF 771

Query: 2448 EGHELPKLISILTGWGKHSKVVGDGTLRRAIEALLTGMGAPFRVANCNLGRFISPGAVVA 2627
            EG +LP L+SILTGWGKHSKVVGD TLRRAIEALLT MGAPFRVA CNLGRFIS G++ A
Sbjct: 772  EGQQLPNLLSILTGWGKHSKVVGDSTLRRAIEALLTSMGAPFRVAKCNLGRFISTGSMAA 831

Query: 2628 AWLKESGTLKVLVLHDDRAHQETSTFDRIPNLRTLTL 2738
            AWL+ESGTL+VLVLHDDR   +++  ++  NL+ L L
Sbjct: 832  AWLRESGTLEVLVLHDDRTCPKSADLEQTSNLQALAL 868


>ref|XP_002301519.2| hypothetical protein POPTR_0002s19470g [Populus trichocarpa]
            gi|550345387|gb|EEE80792.2| hypothetical protein
            POPTR_0002s19470g [Populus trichocarpa]
          Length = 864

 Score = 1198 bits (3099), Expect = 0.0
 Identities = 626/881 (71%), Positives = 711/881 (80%), Gaps = 12/881 (1%)
 Frame = +3

Query: 132  MASSTPHCSITGT-KPYQNHQFPQNPIXXXXXXXXXXXX--W-AHRKVSLTKPSQVPHXX 299
            MAS+ PHCSIT T K YQNH +P N +              W ++++VSL KP   P   
Sbjct: 1    MASTPPHCSITATTKHYQNHPYPHNQLKNHRQTHNQNPHQRWTSNQRVSLAKPPLPPSRN 60

Query: 300  XXXXXXXXXXXXXXXXXXXXXPTFPSLSRAPPPNKSELTADFSGRRSTRFVSKMHFGRQK 479
                                 PTF S      P KSEL +DF GRRSTRFVSK+HFGR +
Sbjct: 61   APKPAATTTTTTTQHPQIH--PTFSSFQ----PPKSELVSDFPGRRSTRFVSKLHFGRPR 114

Query: 480  AQLSSRHSSVAEEALQESIRIN-DEKGLENVFLCFESKMSGSDDYIFVLRELGNRGECWK 656
              + +RH+SVA+EALQ  I    DE+ LENV L FES++SGSDDY+F+LRELGNRG+C K
Sbjct: 115  TTMGTRHTSVAQEALQNVIEYGKDERALENVLLNFESRLSGSDDYVFLLRELGNRGDCKK 174

Query: 657  AVQCFEFAVRRERRRNEQGKLASAMISILGRLGKVDLARGVFEIGLKAGYGNTVYAYSAL 836
            A+ CFEFAV+RER++NEQGKLASAMIS LGRLGKV++A+ VF+  L  GYGNTVYA+SA+
Sbjct: 175  AICCFEFAVKRERKKNEQGKLASAMISTLGRLGKVEMAKTVFKAALTEGYGNTVYAFSAI 234

Query: 837  ISAYGRSGCSDEAIKVFETMKNSGLKPNLVTYNAVIDACGKGGVEFNQAAEILNEMLRNG 1016
            ISAYGRSG  +EAIK+F +MK+ GLKPNLVTYNAVIDACGKGGVEF +  EI +EMLRNG
Sbjct: 235  ISAYGRSGYCNEAIKIFYSMKDYGLKPNLVTYNAVIDACGKGGVEFKRVLEIFDEMLRNG 294

Query: 1017 IQPDRITFNSLLAVCSRGGLWEAARNFFSEMVYRGIDQDIFTYNTLLDAFCKGGQMDLAF 1196
            +QPDRITFNSLLAVCS+GGLWEAAR+   EMV RGIDQDIFTYNTLLDA CKGGQ+D+AF
Sbjct: 295  MQPDRITFNSLLAVCSKGGLWEAARSLSCEMVNRGIDQDIFTYNTLLDAVCKGGQLDMAF 354

Query: 1197 QIMAEMSAKNISPNVVTYSTMIDGYAKVGRLDDALNLLGEMKFAGIGLDRVSYNTLLSIY 1376
            +IM+EM AKNI PNVVTYSTMIDGYAK GRLDDA NL  EMKF GI LDRVSYNTLLSIY
Sbjct: 355  EIMSEMPAKNILPNVVTYSTMIDGYAKAGRLDDARNLFNEMKFLGISLDRVSYNTLLSIY 414

Query: 1377 AKLGRFEDVLNVCNEMESSGIKKDAVTYNALLGGYGKQGKYDEVKRVFKEMKTRHLSPNL 1556
            AKLGRFE+ ++VC EME+SGI+KD VTYNALLGGYGKQ KYD V++VF+EMK RH+SPNL
Sbjct: 415  AKLGRFEEAMDVCREMENSGIRKDVVTYNALLGGYGKQYKYDVVRKVFEEMKARHVSPNL 474

Query: 1557 LTYSTLIDVYSKGGLYREALEVFKEFKQAGLKADVVLYSALIDALCKNGLVEIAVSLLDE 1736
            LTYSTLIDVYSKGGLYREA++VF+EFK+AGLKADVVLYSALIDALCKNGLVE AVSLLDE
Sbjct: 475  LTYSTLIDVYSKGGLYREAMDVFREFKKAGLKADVVLYSALIDALCKNGLVESAVSLLDE 534

Query: 1737 MTKEGIRPNVVTYNSIIDAFGRSAT-------ACGSNESLIEPSSLIALKDVSESNDEHR 1895
            MTKEGIRPNVVTYNSIIDAFGR AT       A  ++E  I+  S  A++  ++S    R
Sbjct: 535  MTKEGIRPNVVTYNSIIDAFGRPATTESVVDDAGQTSELQIDSLSSSAVEKATKSLVADR 594

Query: 1896 EDNQIIKIFGQLAAEKSCHAKIDNRGRQEILCVLEVFQKMQELEIKPNVVTFSAILNACS 2075
            EDN+IIKIFGQLAAEK+  AK  N G QE++C+L VF KM ELEIKPNVVTFSAILNACS
Sbjct: 595  EDNRIIKIFGQLAAEKAGQAK--NSGGQEMMCILGVFHKMHELEIKPNVVTFSAILNACS 652

Query: 2076 RCNSFEDASMLLEELRWFDNQVYGVAHGLLMGRKDNVWAQAQCLFDEVKQMDSPTASAFY 2255
            RCNSFE+ASMLLEELR FDNQVYGVAHGLLMG ++NVW QAQ LFDEVK MDS TASAFY
Sbjct: 653  RCNSFEEASMLLEELRLFDNQVYGVAHGLLMGYRENVWEQAQSLFDEVKLMDSSTASAFY 712

Query: 2256 NALTDMLWHFGQKKGAQLVVLEGKRRHVWENMWSDSCLDLHLMSSGAARAMVHAWLLDIR 2435
            NALTDMLWHFGQK+GAQLVVLEGKRR VWEN+WS+SCLDLHLMSSGAARAMVHAWLL++R
Sbjct: 713  NALTDMLWHFGQKRGAQLVVLEGKRRQVWENVWSESCLDLHLMSSGAARAMVHAWLLNVR 772

Query: 2436 SIVFEGHELPKLISILTGWGKHSKVVGDGTLRRAIEALLTGMGAPFRVANCNLGRFISPG 2615
            +IVFEGHE+PKL+         SKVVGD TLRRA+EALL GMGAPFR A CNLGR IS G
Sbjct: 773  AIVFEGHEVPKLL---------SKVVGDSTLRRAVEALLMGMGAPFRSAKCNLGRLISTG 823

Query: 2616 AVVAAWLKESGTLKVLVLHDDRAHQETSTFDRIPNLRTLTL 2738
            +VVA+WL+ESGTLKVLVLHDDR HQE   F +I NL+ L L
Sbjct: 824  SVVASWLRESGTLKVLVLHDDRTHQENLRFGQISNLQMLQL 864


>ref|XP_004288538.1| PREDICTED: pentatricopeptide repeat-containing protein At2g31400,
            chloroplastic-like [Fragaria vesca subsp. vesca]
          Length = 870

 Score = 1187 bits (3071), Expect = 0.0
 Identities = 618/883 (69%), Positives = 701/883 (79%), Gaps = 14/883 (1%)
 Frame = +3

Query: 132  MASSTPHCSITGTKPYQNHQFPQNPIXXXXXXXXXXXXWAHRKVSLTKPSQVPHXXXXXX 311
            MAS+ PHCSIT TKPYQ HQ+PQN                   VSL+KP  +P       
Sbjct: 1    MASTPPHCSITATKPYQTHQYPQNQRLKSHRQTRPTT----HHVSLSKPLPLP------- 49

Query: 312  XXXXXXXXXXXXXXXXXPTFPSLSRAPPPNKSELTADFSGRRSTRFVSKMHFGRQKAQLS 491
                             P   S S   PP KS+L + FSGRRSTR VSKMH GR K  + 
Sbjct: 50   -PRPPPRTVPKPASAAGPVPSSFSSLCPPAKSDLVSAFSGRRSTRMVSKMHLGRPKTTVG 108

Query: 492  SRHSSVAEEALQESIRIN-DEKGLENVFLCFESKMSGSDDYIFVLRELGNRGECWKAVQC 668
            SRHS +AEEAL+ +IR   D+  L++V   FES++  SDD+ F+LRELGNRGECWKA++C
Sbjct: 109  SRHSPLAEEALETAIRFGKDDFALDDVLHSFESRLV-SDDFTFLLRELGNRGECWKAIRC 167

Query: 669  FEFAVRRERRRNEQGKLASAMISILGRLGKVDLARGVFEIGLKAGYGNTVYAYSALISAY 848
            FEFAVRRER+R EQGKLAS+MIS LGRLGKV+LA+ VF+  +  GYG TVY YSALISAY
Sbjct: 168  FEFAVRRERKRTEQGKLASSMISTLGRLGKVELAKNVFQTAVNEGYGRTVYTYSALISAY 227

Query: 849  GRSGCSDEAIKVFETMKNSGLKPNLVTYNAVIDACGKGGVEFNQAAEILNEMLRNGIQPD 1028
            GRSG  DEAI+V E+MK+SG+KPNLVTYNAVIDACGKGGVEF +  EI +EML+ G+QPD
Sbjct: 228  GRSGYCDEAIRVLESMKDSGVKPNLVTYNAVIDACGKGGVEFKKVVEIFDEMLKVGVQPD 287

Query: 1029 RITFNSLLAVCSRGGLWEAARNFFSEMVYRGIDQDIFTYNTLLDAFCKGGQMDLAFQIMA 1208
            RIT+NSLLAVCSRGGLWEAARN FSEMV RGIDQDI+TYNTLLDA  KGGQMDLA++IM+
Sbjct: 288  RITYNSLLAVCSRGGLWEAARNLFSEMVDRGIDQDIYTYNTLLDAISKGGQMDLAYKIMS 347

Query: 1209 EMSAKNISPNVVTYSTMIDGYAKVGRLDDALNLLGEMKFAGIGLDRVSYNTLLSIYAKLG 1388
            EM +KNI PNVVTYSTMIDGYAK GRL+DALNL  EMKF  IGLDRV YNTLLS+Y KLG
Sbjct: 348  EMPSKNILPNVVTYSTMIDGYAKAGRLEDALNLFNEMKFLAIGLDRVLYNTLLSLYGKLG 407

Query: 1389 RFEDVLNVCNEMESSGIKKDAVTYNALLGGYGKQGKYDEVKRVFKEMKTRHLSPNLLTYS 1568
            RFE+ LNVC EMES GI KD V+YNALLGGYGKQGKYDEVK ++ EMK   +SPNLLTYS
Sbjct: 408  RFEEALNVCKEMESVGIAKDVVSYNALLGGYGKQGKYDEVKGLYNEMKVERVSPNLLTYS 467

Query: 1569 TLIDVYSKGGLYREALEVFKEFKQAGLKADVVLYSALIDALCKNGLVEIAVSLLDEMTKE 1748
            TLIDVYSKGGLY EA++VF+EFKQAGLKADVVLYS LI+ALCKNGLVE AVSLLDEMTKE
Sbjct: 468  TLIDVYSKGGLYAEAVKVFREFKQAGLKADVVLYSELINALCKNGLVESAVSLLDEMTKE 527

Query: 1749 GIRPNVVTYNSIIDAFGRSAT--------ACGSNESLIEPSSLIALK-DVSESNDEH--- 1892
            GIRPNVVTYNSIIDAFGR AT        ACG        SS+ A   D+S+ N ++   
Sbjct: 528  GIRPNVVTYNSIIDAFGRPATTVCAVDAGACGIVLRSESSSSISARDFDISDKNVQNEMR 587

Query: 1893 -REDNQIIKIFGQLAAEKSCHAKIDNRGRQEILCVLEVFQKMQELEIKPNVVTFSAILNA 2069
             RED +I+K+FGQL A+K+ +AK D + RQEILC+L VFQKM EL+IKPNVVTFSAILNA
Sbjct: 588  DREDTRIMKMFGQLTADKAGYAKKDRKVRQEILCILGVFQKMHELDIKPNVVTFSAILNA 647

Query: 2070 CSRCNSFEDASMLLEELRWFDNQVYGVAHGLLMGRKDNVWAQAQCLFDEVKQMDSPTASA 2249
            CSRCNSFEDASMLLEELR FDNQVYGVAHGLLMG + NVW +AQ LFDEVKQMD  TASA
Sbjct: 648  CSRCNSFEDASMLLEELRLFDNQVYGVAHGLLMGCRGNVWVKAQSLFDEVKQMDCSTASA 707

Query: 2250 FYNALTDMLWHFGQKKGAQLVVLEGKRRHVWENMWSDSCLDLHLMSSGAARAMVHAWLLD 2429
            FYNALTDMLWHFGQKKGAQLVVLEG+RR+VWEN WS+S LDLHLMSSGAARAMVHAWLL+
Sbjct: 708  FYNALTDMLWHFGQKKGAQLVVLEGERRNVWENAWSNSRLDLHLMSSGAARAMVHAWLLN 767

Query: 2430 IRSIVFEGHELPKLISILTGWGKHSKVVGDGTLRRAIEALLTGMGAPFRVANCNLGRFIS 2609
            I SIV++G +LP L+SILTGWGKHSKVVGD  LRRA+EALLT MGAPFRV  CN+GRFIS
Sbjct: 768  IHSIVYQGQQLPNLLSILTGWGKHSKVVGDSALRRAVEALLTSMGAPFRVHECNIGRFIS 827

Query: 2610 PGAVVAAWLKESGTLKVLVLHDDRAHQETSTFDRIPNLRTLTL 2738
             G+V AAWLKESGTL+VL+LHDDRA   ++ F +I +LR L L
Sbjct: 828  TGSVAAAWLKESGTLEVLMLHDDRAEPNSANFGQISDLRALAL 870


>ref|XP_006417966.1| hypothetical protein EUTSA_v10006755mg [Eutrema salsugineum]
            gi|557095737|gb|ESQ36319.1| hypothetical protein
            EUTSA_v10006755mg [Eutrema salsugineum]
          Length = 895

 Score = 1150 bits (2975), Expect = 0.0
 Identities = 603/892 (67%), Positives = 698/892 (78%), Gaps = 36/892 (4%)
 Frame = +3

Query: 132  MASSTPHCSITGTKPYQNHQFPQNPIXXXXXXXXXXXX---WAHRKVSLTK--------- 275
            MAS+ PHCSIT TKPYQN+ +PQN +               WA ++ S +          
Sbjct: 1    MASTPPHCSITATKPYQNNPYPQNQLKNHRPSLHPPRYHRPWAPQRFSPSPLGGGTKGRG 60

Query: 276  PSQVPHXXXXXXXXXXXXXXXXXXXXXXXPTFPSLSRAPPPNKSELTADFSGRRSTRFVS 455
             +  P                        P FP+LS    P KS+L+ DF+GRRSTRFVS
Sbjct: 61   SAPSPSSSSSAAVAAAAATTASGQLSQASPRFPALSPLQTP-KSDLSPDFAGRRSTRFVS 119

Query: 456  KMHFGRQKAQLSSRHSSVAEEALQESIRIN-DEKGLENVFLCFESKMSGSDDYIFVLREL 632
            KMHFGR K  ++SRHS VAE+AL  +I+ + +++GL+N+ L FESK+ GSDDY ++LREL
Sbjct: 120  KMHFGRPKTAMASRHSLVAEDALHHAIQFSGNDEGLQNLLLSFESKLCGSDDYTYILREL 179

Query: 633  GNRGECWKAVQCFEFAVRRERRRNEQGKLASAMISILGRLGKVDLARGVFEIGLKAGYGN 812
            GNRGE  KAV+ +EFAV+RERR+NEQGKLASAMIS LGRLGKV +A+ VFE  L  GYGN
Sbjct: 180  GNRGEFEKAVRFYEFAVKRERRKNEQGKLASAMISTLGRLGKVGIAKRVFETALADGYGN 239

Query: 813  TVYAYSALISAYGRSGCSDEAIKVFETMKNSGLKPNLVTYNAVIDACGKGGVEFNQAAEI 992
            TVYA+SA+ISAYGRSG  ++AIKVF +MK  GL+PNLVTYNAVIDACGKGG+EF Q AE 
Sbjct: 240  TVYAFSAIISAYGRSGYHEDAIKVFSSMKGHGLRPNLVTYNAVIDACGKGGMEFKQVAEF 299

Query: 993  LNEMLRNGIQPDRITFNSLLAVCSRGGLWEAARNFFSEMVYRGIDQDIFTYNTLLDAFCK 1172
             +EM RN +QPDRITFNSLLAVCSRGG WEAARN F EM+ RGI+QDIFTYNTLLDA CK
Sbjct: 300  FDEMQRNRVQPDRITFNSLLAVCSRGGSWEAARNLFDEMLNRGIEQDIFTYNTLLDAICK 359

Query: 1173 GGQMDLAFQIMAEMSAKNISPNVVTYSTMIDGYAKVGRLDDALNLLGEMKFAGIGLDRVS 1352
            GGQMDLAF+I+A+M AKNI PNVVTYST+IDGYAK GR +DAL L GEMK+ GI LDRVS
Sbjct: 360  GGQMDLAFEILAQMPAKNIMPNVVTYSTVIDGYAKAGRFNDALTLFGEMKYLGIPLDRVS 419

Query: 1353 YNTLLSIYAKLGRFEDVLNVCNEMESSGIKKDAVTYNALLGGYGKQGKYDEVKRVFKEMK 1532
            YNTL+SIYAKLGRFE+ L++  EM ++GI+KDAVTYNALLGGYGK  KYDEVK VF EMK
Sbjct: 420  YNTLVSIYAKLGRFEEALDIVKEMAAAGIRKDAVTYNALLGGYGKHEKYDEVKSVFAEMK 479

Query: 1533 TRHLSPNLLTYSTLIDVYSKGGLYREALEVFKEFKQAGLKADVVLYSALIDALCKNGLVE 1712
               + PNLLTYSTLIDVYSKGGLY+EA+E+F+EFK  GL+ADVVLYSALIDALCKNGLVE
Sbjct: 480  QERVLPNLLTYSTLIDVYSKGGLYKEAMEIFREFKSVGLRADVVLYSALIDALCKNGLVE 539

Query: 1713 IAVSLLDEMTKEGIRPNVVTYNSIIDAFGRSATA-C-------GSN-----ESLIEPSSL 1853
             AVSLLDEMTKEGI PNVVTYNS+IDAFGRSAT  C       G+N     ES    S+ 
Sbjct: 540  SAVSLLDEMTKEGISPNVVTYNSMIDAFGRSATTECLADINEGGANGLEEDESFSSSSAS 599

Query: 1854 IALKD-----VSESNDEHR----EDNQIIKIFGQLAAEKSCHAKID-NRGRQEILCVLEV 2003
            ++  D     V E++   +    ED++I++IFGQL  E +   K D  +G QE+ C+LEV
Sbjct: 600  LSHTDSLSLAVGEADSLSKLTKTEDHRIVEIFGQLVTEGNNQIKRDCKQGVQELSCILEV 659

Query: 2004 FQKMQELEIKPNVVTFSAILNACSRCNSFEDASMLLEELRWFDNQVYGVAHGLLMGRKDN 2183
              KM ELEIKPNVVTFSAILNACSRCNSFE+ASMLLEELR FDN+VYGVAHGLLMG  +N
Sbjct: 660  CHKMHELEIKPNVVTFSAILNACSRCNSFEEASMLLEELRLFDNKVYGVAHGLLMGYNEN 719

Query: 2184 VWAQAQCLFDEVKQMDSPTASAFYNALTDMLWHFGQKKGAQLVVLEGKRRHVWENMWSDS 2363
            VW QAQ LFDEVK MD  TASAFYNALTDMLWHFGQK+GAQ VVLEG+RR VWEN+WSDS
Sbjct: 720  VWIQAQSLFDEVKAMDGSTASAFYNALTDMLWHFGQKRGAQSVVLEGRRRKVWENVWSDS 779

Query: 2364 CLDLHLMSSGAARAMVHAWLLDIRSIVFEGHELPKLISILTGWGKHSKVVGDGTLRRAIE 2543
            CLDLHLMSSGAARAMVHAWLL+IRSIV+EGHELPKL+SILTGWGKHSKV+GDGTLRRA+E
Sbjct: 780  CLDLHLMSSGAARAMVHAWLLNIRSIVYEGHELPKLLSILTGWGKHSKVMGDGTLRRAVE 839

Query: 2544 ALLTGMGAPFRVANCNLGRFISPGAVVAAWLKESGTLKVLVLHDDRAHQETS 2699
            ALL GMGAPF VA CN+GRF+S G+VVAAWL+ESGTLKVLVL +D  H+E S
Sbjct: 840  ALLRGMGAPFHVAKCNVGRFVSSGSVVAAWLRESGTLKVLVL-EDHKHEEAS 890


>ref|XP_006355855.1| PREDICTED: pentatricopeptide repeat-containing protein At2g31400,
            chloroplastic-like [Solanum tuberosum]
          Length = 848

 Score = 1118 bits (2891), Expect = 0.0
 Identities = 586/873 (67%), Positives = 678/873 (77%), Gaps = 4/873 (0%)
 Frame = +3

Query: 132  MASSTP--HCSITGTKPYQNHQFPQNPIXXXXXXXXXXXXWAHRKVSLTKPSQVPHXXXX 305
            MASSTP  HC++T +KPY  H   Q               W+ +KVSL +P+   +    
Sbjct: 1    MASSTPPPHCALTTSKPYHPHPLTQTH-SHPNHRNNHQRHWSSQKVSLNRPAPPRNATHP 59

Query: 306  XXXXXXXXXXXXXXXXXXXPTFPSLSRAPPPNKSELTADFSGRRSTRFVSKMHFGRQKAQ 485
                               P F SLS +    KS+ +ADFSGRRSTRFVSKMHFGR K  
Sbjct: 60   PPSQT--------------PNFLSLSSS----KSDFSADFSGRRSTRFVSKMHFGRAKIS 101

Query: 486  LSSRHSSVAEEALQESIRI-NDEKGLENVFLCFESKMSGSDDYIFVLRELGNRGECWKAV 662
             + RHSS AEEAL+E+IR   +E GL+ V L F SK+ GSDDY F+ RELGNRGE   A+
Sbjct: 102  GNGRHSSFAEEALEEAIRCCKNEAGLDQVLLTFGSKLLGSDDYTFLFRELGNRGEWLAAM 161

Query: 663  QCFEFAVRRERRRNEQGKLASAMISILGRLGKVDLARGVFEIGLKAGYGNTVYAYSALIS 842
            +CFEFAV RER+RNEQGKLAS+MISILGR GKVDLA  VFE  +  GYGNTVYAYSALIS
Sbjct: 162  RCFEFAVGRERKRNEQGKLASSMISILGRSGKVDLAEKVFENAVSDGYGNTVYAYSALIS 221

Query: 843  AYGRSGCSDEAIKVFETMKNSGLKPNLVTYNAVIDACGKGGVEFNQAAEILNEMLRNGIQ 1022
            AY +SG  +EAI+VFETMK+SGLKPNLVTYNA+IDACGKGG +F +A+EI +EMLRNG+Q
Sbjct: 222  AYAKSGYCNEAIRVFETMKDSGLKPNLVTYNALIDACGKGGADFKRASEIFDEMLRNGVQ 281

Query: 1023 PDRITFNSLLAVCSRGGLWEAARNFFSEMVYRGIDQDIFTYNTLLDAFCKGGQMDLAFQI 1202
            PDRITFNSLLAVCS  GLWE AR  F+EM+YRGIDQDI+TYNT LDA C GGQ+D+AF I
Sbjct: 282  PDRITFNSLLAVCSGAGLWETARGLFNEMIYRGIDQDIYTYNTFLDAACNGGQIDVAFDI 341

Query: 1203 MAEMSAKNISPNVVTYSTMIDGYAKVGRLDDALNLLGEMKFAGIGLDRVSYNTLLSIYAK 1382
            M+EM AKNI PN VTYST+I G AK GRLD AL+L  EMK AGI LDRVSYNTLL+IYA 
Sbjct: 342  MSEMHAKNILPNQVTYSTVIRGCAKAGRLDRALSLFNEMKCAGITLDRVSYNTLLAIYAS 401

Query: 1383 LGRFEDVLNVCNEMESSGIKKDAVTYNALLGGYGKQGKYDEVKRVFKEMKTRHLSPNLLT 1562
            LG+FE+ LNV  EMES GIKKD VTYNALL G+GKQG Y +VK++F EMK   LSPNLLT
Sbjct: 402  LGKFEEALNVSKEMESMGIKKDVVTYNALLDGFGKQGMYIKVKQLFAEMKAEKLSPNLLT 461

Query: 1563 YSTLIDVYSKGGLYREALEVFKEFKQAGLKADVVLYSALIDALCKNGLVEIAVSLLDEMT 1742
            YSTLI VY KG LY +A+EV+KEFK+ GLKADVV YS LIDALCK GLVE +  LL+EMT
Sbjct: 462  YSTLISVYLKGALYHDAVEVYKEFKKQGLKADVVFYSKLIDALCKKGLVEYSSLLLNEMT 521

Query: 1743 KEGIRPNVVTYNSIIDAFGRSAT-ACGSNESLIEPSSLIALKDVSESNDEHREDNQIIKI 1919
            KEGI+PNVVTYNSII+AFG SA+  CGS+      +    +  +S+S  E+ E++ I+KI
Sbjct: 522  KEGIQPNVVTYNSIINAFGESASNECGSD------NVTQIVSTISQSKWENTEEDNIVKI 575

Query: 1920 FGQLAAEKSCHAKIDNRGRQEILCVLEVFQKMQELEIKPNVVTFSAILNACSRCNSFEDA 2099
            F QLAA+KS   K  N  RQ+ILC+L VF KM EL+IKPNVVTFSAILNACSRC+SF++A
Sbjct: 576  FEQLAAQKSASGKKTNAERQDILCILGVFHKMHELQIKPNVVTFSAILNACSRCSSFDEA 635

Query: 2100 SMLLEELRWFDNQVYGVAHGLLMGRKDNVWAQAQCLFDEVKQMDSPTASAFYNALTDMLW 2279
            S+LLEELR FDNQVYGVAHGLLMG+++ VWAQA  LF+EVKQMDS TASAFYNALTDMLW
Sbjct: 636  SLLLEELRIFDNQVYGVAHGLLMGQREGVWAQALSLFNEVKQMDSSTASAFYNALTDMLW 695

Query: 2280 HFGQKKGAQLVVLEGKRRHVWENMWSDSCLDLHLMSSGAARAMVHAWLLDIRSIVFEGHE 2459
            HF QK+GAQLVVLEGKR  VWEN WS SCLDLHLMSSGAA AMVHAWLL IRSIVFEGHE
Sbjct: 696  HFDQKQGAQLVVLEGKRSEVWENTWSTSCLDLHLMSSGAACAMVHAWLLSIRSIVFEGHE 755

Query: 2460 LPKLISILTGWGKHSKVVGDGTLRRAIEALLTGMGAPFRVANCNLGRFISPGAVVAAWLK 2639
            LPK++SILTGWGKHSK+ GDG L+RAIE LLT +GAPF+VA CN+GRFIS GAVV AWL+
Sbjct: 756  LPKMLSILTGWGKHSKITGDGALKRAIEGLLTSIGAPFQVAKCNIGRFISTGAVVTAWLR 815

Query: 2640 ESGTLKVLVLHDDRAHQETSTFDRIPNLRTLTL 2738
            ESGTL+VLVL DD +H   + F +I NL+ LTL
Sbjct: 816  ESGTLEVLVLQDDTSHLRATRFGQISNLQQLTL 848


>ref|XP_004240564.1| PREDICTED: pentatricopeptide repeat-containing protein At2g31400,
            chloroplastic-like isoform 1 [Solanum lycopersicum]
          Length = 841

 Score = 1114 bits (2881), Expect = 0.0
 Identities = 583/877 (66%), Positives = 678/877 (77%), Gaps = 8/877 (0%)
 Frame = +3

Query: 132  MASSTP--HCSITGTKPYQ----NHQFPQNPIXXXXXXXXXXXXWAHRKVSLTKPSQVPH 293
            MASSTP  HC++T +KPYQ    +H  P +              W+ +KVSL  P    H
Sbjct: 1    MASSTPPPHCALTTSKPYQPQTHSHPHPNH-------RNNHQRHWSSQKVSLNPPRNPNH 53

Query: 294  XXXXXXXXXXXXXXXXXXXXXXXPTFPSLSRAPPPNKSELTADFSGRRSTRFVSKMHFGR 473
                                   P F SLS +    KS+ +ADFSGRRSTRFVSKMHFGR
Sbjct: 54   PSQT-------------------PNFLSLSSS----KSDFSADFSGRRSTRFVSKMHFGR 90

Query: 474  QKAQLSSRHSSVAEEALQESIRI-NDEKGLENVFLCFESKMSGSDDYIFVLRELGNRGEC 650
             K   + RHSS A+EAL+E+IR  N+E GL+ V L F SK+ GSDDY F+ RELGNRGE 
Sbjct: 91   AKISGNGRHSSFAQEALEEAIRCCNNEAGLDQVLLTFGSKLVGSDDYTFLFRELGNRGEW 150

Query: 651  WKAVQCFEFAVRRERRRNEQGKLASAMISILGRLGKVDLARGVFEIGLKAGYGNTVYAYS 830
              A++CF+FAV RER+RNEQGKLAS+MISILGR GKVDLA  VFE  +  GYG+TVYAYS
Sbjct: 151  LAAMRCFQFAVGRERKRNEQGKLASSMISILGRSGKVDLAEKVFENAVSDGYGSTVYAYS 210

Query: 831  ALISAYGRSGCSDEAIKVFETMKNSGLKPNLVTYNAVIDACGKGGVEFNQAAEILNEMLR 1010
            ALISAY +SG  +EAI+VFETMK+SGLKPNLVTYNA+IDACGKGG +F +A+EI +EMLR
Sbjct: 211  ALISAYAKSGYCNEAIRVFETMKDSGLKPNLVTYNALIDACGKGGADFKRASEIFDEMLR 270

Query: 1011 NGIQPDRITFNSLLAVCSRGGLWEAARNFFSEMVYRGIDQDIFTYNTLLDAFCKGGQMDL 1190
            NG+QPDRITFNSLLAVCS  GLWE AR  F+EM+YRGIDQDI+TYNT LD  C GGQ+D+
Sbjct: 271  NGVQPDRITFNSLLAVCSGAGLWETARGLFNEMIYRGIDQDIYTYNTFLDVACNGGQIDV 330

Query: 1191 AFQIMAEMSAKNISPNVVTYSTMIDGYAKVGRLDDALNLLGEMKFAGIGLDRVSYNTLLS 1370
            AF IM+EM AKNI PN VTYST+I G AK GRLD AL+L  EMK AGI LDRVSYNTLL+
Sbjct: 331  AFDIMSEMHAKNILPNQVTYSTVIRGCAKAGRLDKALSLFNEMKCAGIKLDRVSYNTLLA 390

Query: 1371 IYAKLGRFEDVLNVCNEMESSGIKKDAVTYNALLGGYGKQGKYDEVKRVFKEMKTRHLSP 1550
            IYA LG+FE+ LNV  EME  GIKKD VTYNALL G+GKQG Y +VK++F EMK   LSP
Sbjct: 391  IYASLGKFEEALNVSKEMEGMGIKKDVVTYNALLDGFGKQGMYTKVKQLFAEMKAEKLSP 450

Query: 1551 NLLTYSTLIDVYSKGGLYREALEVFKEFKQAGLKADVVLYSALIDALCKNGLVEIAVSLL 1730
            NLLTYSTLI VY KG LY +A+EV+KEFK+ GLKADVV YS LIDALCK GLVE +  LL
Sbjct: 451  NLLTYSTLISVYLKGALYHDAVEVYKEFKKQGLKADVVFYSKLIDALCKKGLVEYSSLLL 510

Query: 1731 DEMTKEGIRPNVVTYNSIIDAFGRSA-TACGSNESLIEPSSLIALKDVSESNDEHREDNQ 1907
            +EMTKEGI+PNVVTYNSII+AFG SA   CGS+      S+      +S+S  E+ E++ 
Sbjct: 511  NEMTKEGIQPNVVTYNSIINAFGESANNECGSDNVTHIVSA------ISQSKWENTEEDN 564

Query: 1908 IIKIFGQLAAEKSCHAKIDNRGRQEILCVLEVFQKMQELEIKPNVVTFSAILNACSRCNS 2087
            I+KIF QLAA+KS   K  N  RQ++LC+L VF KM EL+IKPNVVTFSAILNACSRC+S
Sbjct: 565  IVKIFEQLAAQKSASGKKTNAERQDMLCILGVFHKMHELQIKPNVVTFSAILNACSRCSS 624

Query: 2088 FEDASMLLEELRWFDNQVYGVAHGLLMGRKDNVWAQAQCLFDEVKQMDSPTASAFYNALT 2267
            F++AS+LLEELR FDNQVYGVAHGLLMG+++ VW+QA  LF+EVKQMDS TASAFYNALT
Sbjct: 625  FDEASLLLEELRLFDNQVYGVAHGLLMGQREGVWSQALSLFNEVKQMDSSTASAFYNALT 684

Query: 2268 DMLWHFGQKKGAQLVVLEGKRRHVWENMWSDSCLDLHLMSSGAARAMVHAWLLDIRSIVF 2447
            DMLWHF QK+GAQLVVLEGKR  VWEN WS SCLDLHLMSSGAA AMVHAWLL IRSIVF
Sbjct: 685  DMLWHFDQKQGAQLVVLEGKRSEVWENTWSTSCLDLHLMSSGAACAMVHAWLLSIRSIVF 744

Query: 2448 EGHELPKLISILTGWGKHSKVVGDGTLRRAIEALLTGMGAPFRVANCNLGRFISPGAVVA 2627
            EGHELPK++SILTGWGKHSK+ GDG L+RAIE LLT +GAPF++A CN+GRFIS GAVV 
Sbjct: 745  EGHELPKMLSILTGWGKHSKITGDGALKRAIEGLLTSIGAPFQIAKCNIGRFISTGAVVT 804

Query: 2628 AWLKESGTLKVLVLHDDRAHQETSTFDRIPNLRTLTL 2738
            AWL+ESGTL+VLVL DD +H   + FD+I NL+ LTL
Sbjct: 805  AWLRESGTLEVLVLQDDTSHLRATRFDQISNLQQLTL 841


>ref|XP_006444532.1| hypothetical protein CICLE_v10018807mg [Citrus clementina]
            gi|557546794|gb|ESR57772.1| hypothetical protein
            CICLE_v10018807mg [Citrus clementina]
          Length = 820

 Score = 1102 bits (2851), Expect = 0.0
 Identities = 578/798 (72%), Positives = 638/798 (79%), Gaps = 17/798 (2%)
 Frame = +3

Query: 132  MASSTPHCSITGTKPYQNHQFPQNPIXXXXXXXXXXXX----WAHRKVSLTKPSQVPHXX 299
            MAS+ PHCSIT TKPYQNHQ+P N +                W   KVSLTKP   P   
Sbjct: 1    MASTPPHCSITATKPYQNHQYPHNHLKNNHHRQSHHPSSRPHWTSHKVSLTKPPLSPSPR 60

Query: 300  XXXXXXXXXXXXXXXXXXXXXPTFPSLSRAPPPNKSELTADFSGRRSTRFVSKMHFGRQK 479
                                   F SLS  P  +KSEL  DFSGRRSTRFVSKMHFGR K
Sbjct: 61   NAPKPAATSTTVAPNPKP-----FHSLSPLPS-SKSELAPDFSGRRSTRFVSKMHFGRPK 114

Query: 480  AQLSSRHSSVAEEALQESIRI-NDEKGLENVFLCFESKMSGSDDYIFVLRELGNRGECWK 656
              +S+RHS VAEEAL        D+  L ++   FE K+ G+DDY F+LRELGNRGE  K
Sbjct: 115  IAMSTRHSVVAEEALHHVTAFARDDVSLGDILKKFEFKLCGADDYTFLLRELGNRGEWSK 174

Query: 657  AVQCFEFAVRRERRRNEQGKLASAMISILGRLGKVDLARGVFEIGLKAGYGNTVYAYSAL 836
            A+QCF FAV+RE R+N+QGKLASAMISILGRLGKVDLA+ +FE  L  GYGNTVYA+SAL
Sbjct: 175  AIQCFAFAVKREERKNDQGKLASAMISILGRLGKVDLAKNIFETALNEGYGNTVYAFSAL 234

Query: 837  ISAYGRSGCSDEAIKVFETMKNSGLKPNLVTYNAVIDACGKGGVEFNQAAEILNEMLRNG 1016
            ISAYGRSG   EAI VF +MK   LKPNLVTYNAVIDACGKGGV+F    EI ++MLRNG
Sbjct: 235  ISAYGRSGYCQEAISVFNSMKRYHLKPNLVTYNAVIDACGKGGVDFKHVVEIFDDMLRNG 294

Query: 1017 IQPDRITFNSLLAVCSRGGLWEAARNFFSEMVYRGIDQDIFTYNTLLDAFCKGGQMDLAF 1196
            +QPDRITFNSLLAVCSRGGLWEAARN F+EMV+RGIDQDIFTYNTLLDA CKG QMDLAF
Sbjct: 295  VQPDRITFNSLLAVCSRGGLWEAARNLFNEMVHRGIDQDIFTYNTLLDAICKGAQMDLAF 354

Query: 1197 QIMAEMSAKNISPNVVTYSTMIDGYAKVGRLDDALNLLGEMKFAGIGLDRVSYNTLLSIY 1376
            +IMAEM AKNISPNVVTYSTMIDGYAK GRLDDALN+  EMKF GIGLDRVSYNT+LSIY
Sbjct: 355  EIMAEMPAKNISPNVVTYSTMIDGYAKAGRLDDALNMFSEMKFLGIGLDRVSYNTVLSIY 414

Query: 1377 AKLGRFEDVLNVCNEMESSGIKKDAVTYNALLGGYGKQGKYDEVKRVFKEMKTRHLSPNL 1556
            AKLGRFE+ L VC EMESSGI+KDAVTYNALLGGYGKQGKYDEV+R+F++MK   +SPNL
Sbjct: 415  AKLGRFEEALLVCKEMESSGIRKDAVTYNALLGGYGKQGKYDEVRRMFEQMKADCVSPNL 474

Query: 1557 LTYSTLIDVYSKGGLYREALEVFKEFKQAGLKADVVLYSALIDALCKNGLVEIAVSLLDE 1736
            LTYSTLIDVYSKGGLY+EA+++F+EFKQAGLKADVVLYSALIDALCKNGLVE AVSLLDE
Sbjct: 475  LTYSTLIDVYSKGGLYKEAMQIFREFKQAGLKADVVLYSALIDALCKNGLVESAVSLLDE 534

Query: 1737 MTKEGIRPNVVTYNSIIDAFGRSATA-CGSNE------SLIEPSSLIAL-----KDVSES 1880
            MTKEGIRPNVVTYNSIIDAFGRSAT  C  ++         E ++L A+     KDV E+
Sbjct: 535  MTKEGIRPNVVTYNSIIDAFGRSATTECTVDDVERDLGKQKESANLDAMCSQDDKDVQEA 594

Query: 1881 NDEHREDNQIIKIFGQLAAEKSCHAKIDNRGRQEILCVLEVFQKMQELEIKPNVVTFSAI 2060
                R DNQIIK+FGQL AEK+   K +NR RQEILC+L VFQKM +L+IKPNVVTFSAI
Sbjct: 595  G---RTDNQIIKVFGQLVAEKAGQGKKENRCRQEILCILGVFQKMHKLKIKPNVVTFSAI 651

Query: 2061 LNACSRCNSFEDASMLLEELRWFDNQVYGVAHGLLMGRKDNVWAQAQCLFDEVKQMDSPT 2240
            LNACSRCNSFEDASMLLEELR FDNQVYGVAHGLLMG +DN+W QA  LFDEVK MDS T
Sbjct: 652  LNACSRCNSFEDASMLLEELRLFDNQVYGVAHGLLMGYRDNIWVQALSLFDEVKLMDSST 711

Query: 2241 ASAFYNALTDMLWHFGQKKGAQLVVLEGKRRHVWENMWSDSCLDLHLMSSGAARAMVHAW 2420
            ASAFYNALTDMLWHFGQK+GAQLVVLEGKRR VWEN+WS+SCLDLHLMSSGAARAMVHAW
Sbjct: 712  ASAFYNALTDMLWHFGQKRGAQLVVLEGKRRQVWENVWSESCLDLHLMSSGAARAMVHAW 771

Query: 2421 LLDIRSIVFEGHELPKLI 2474
            LL+I SIVFEGHELPKL+
Sbjct: 772  LLNIHSIVFEGHELPKLL 789


>ref|XP_006293642.1| hypothetical protein CARUB_v10022597mg [Capsella rubella]
            gi|482562350|gb|EOA26540.1| hypothetical protein
            CARUB_v10022597mg [Capsella rubella]
          Length = 932

 Score = 1100 bits (2846), Expect = 0.0
 Identities = 555/787 (70%), Positives = 648/787 (82%), Gaps = 12/787 (1%)
 Frame = +3

Query: 378  LSRAP-----PPNKSELTADFSGRRSTRFVSKMHFGRQKAQLSSRHSSVAEEALQESIRI 542
            LS+AP        KS+L++DFSGRRSTRFVSKMHFGR K  +++RHSS AE+ALQ +I  
Sbjct: 125  LSQAPNFAPLQTQKSDLSSDFSGRRSTRFVSKMHFGRPKTAMATRHSSAAEDALQNAIDF 184

Query: 543  N-DEKGLENVFLCFESKMSGSDDYIFVLRELGNRGECWKAVQCFEFAVRRERRRNEQGKL 719
            + D +   ++ L FESK+ GSDD  +++RELGNRGEC KAV  +EFAV+RERR+NEQGKL
Sbjct: 185  SGDSEMFHSLMLSFESKLCGSDDCTYIIRELGNRGECDKAVGFYEFAVKRERRKNEQGKL 244

Query: 720  ASAMISILGRLGKVDLARGVFEIGLKAGYGNTVYAYSALISAYGRSGCSDEAIKVFETMK 899
            ASAMIS LGR GKV +A+ +FE     GYGNTVYA+SALISAYGRSG  +EAI VF +MK
Sbjct: 245  ASAMISTLGRYGKVTIAKRIFETAFAGGYGNTVYAFSALISAYGRSGLHEEAISVFSSMK 304

Query: 900  NSGLKPNLVTYNAVIDACGKGGVEFNQAAEILNEMLRNGIQPDRITFNSLLAVCSRGGLW 1079
            + GL+PNLVTYNAVIDACGKGG+EF Q A+  +EM +NG+QPDRITFNSLLAVCSRGGLW
Sbjct: 305  DHGLRPNLVTYNAVIDACGKGGMEFKQVAKFFDEMQKNGVQPDRITFNSLLAVCSRGGLW 364

Query: 1080 EAARNFFSEMVYRGIDQDIFTYNTLLDAFCKGGQMDLAFQIMAEMSAKNISPNVVTYSTM 1259
            EAARN F EM  R I+QD+F+YNTLLDA CKGGQMDLAF+I+A+M AK I PNVV+YST+
Sbjct: 365  EAARNLFDEMSNRRIEQDVFSYNTLLDAICKGGQMDLAFEILAQMPAKRIMPNVVSYSTV 424

Query: 1260 IDGYAKVGRLDDALNLLGEMKFAGIGLDRVSYNTLLSIYAKLGRFEDVLNVCNEMESSGI 1439
            IDG+AK GR D+ALNL GEM++ GI LDRVSYNTLLSIY K+GR E+ L++  EM S GI
Sbjct: 425  IDGFAKAGRFDEALNLFGEMRYLGIALDRVSYNTLLSIYTKVGRSEEALDILREMASVGI 484

Query: 1440 KKDAVTYNALLGGYGKQGKYDEVKRVFKEMKTRHLSPNLLTYSTLIDVYSKGGLYREALE 1619
            KKD VTYNALLGGYGKQGKYDEVK+VF EMK  H+ PNLLTYSTLID YSKGGLY+EA+E
Sbjct: 485  KKDVVTYNALLGGYGKQGKYDEVKKVFAEMKREHVVPNLLTYSTLIDGYSKGGLYKEAME 544

Query: 1620 VFKEFKQAGLKADVVLYSALIDALCKNGLVEIAVSLLDEMTKEGIRPNVVTYNSIIDAFG 1799
            +F+EFK AGL+ADVVLYSALIDALCKNGLV  AVSL+DEMTKEGI PNVVTYNSIIDAFG
Sbjct: 545  IFREFKSAGLRADVVLYSALIDALCKNGLVGSAVSLIDEMTKEGISPNVVTYNSIIDAFG 604

Query: 1800 RSATACGSNE-SLIEPSSL----IALKDVSESNDEHREDNQIIKIFGQLAAEKSCHAKID 1964
            RSAT   S + S  E ++L    +AL   + S     E N++I++FGQL AE +     D
Sbjct: 605  RSATMERSADYSNGEANNLEVGSLALSSSALSKLTETEGNRVIQLFGQLTAESNNRMTKD 664

Query: 1965 -NRGRQEILCVLEVFQKMQELEIKPNVVTFSAILNACSRCNSFEDASMLLEELRWFDNQV 2141
               G QE+ C+LEVF+KM +LEIKPNVVTFSAILNACSRCNSFEDASMLLEELR FDN+V
Sbjct: 665  CKEGMQELSCILEVFRKMHQLEIKPNVVTFSAILNACSRCNSFEDASMLLEELRLFDNKV 724

Query: 2142 YGVAHGLLMGRKDNVWAQAQCLFDEVKQMDSPTASAFYNALTDMLWHFGQKKGAQLVVLE 2321
            YGV HGLLMG ++NVW QAQ LFD+V +MD  TASAFYNALTDMLWHFGQK+GA+LV LE
Sbjct: 725  YGVVHGLLMGERENVWLQAQSLFDKVNEMDGSTASAFYNALTDMLWHFGQKRGAELVALE 784

Query: 2322 GKRRHVWENMWSDSCLDLHLMSSGAARAMVHAWLLDIRSIVFEGHELPKLISILTGWGKH 2501
            G+ R VWEN+WSDSCLDLHLMSSGAARAMVHAWLL+IRSIV+EGHELPK++SILTGWGKH
Sbjct: 785  GRSRQVWENVWSDSCLDLHLMSSGAARAMVHAWLLNIRSIVYEGHELPKVLSILTGWGKH 844

Query: 2502 SKVVGDGTLRRAIEALLTGMGAPFRVANCNLGRFISPGAVVAAWLKESGTLKVLVLHDDR 2681
            SKVVGDG LRRA+E LL GM APF ++ CN+GRFIS G+VVA WL+ES TLK+L+LHD +
Sbjct: 845  SKVVGDGALRRAVEVLLRGMDAPFHLSKCNMGRFISSGSVVATWLRESATLKLLILHDHK 904

Query: 2682 AHQETST 2702
                 ST
Sbjct: 905  TTTTAST 911


>ref|XP_004240565.1| PREDICTED: pentatricopeptide repeat-containing protein At2g31400,
            chloroplastic-like isoform 2 [Solanum lycopersicum]
          Length = 829

 Score = 1100 bits (2844), Expect = 0.0
 Identities = 575/860 (66%), Positives = 667/860 (77%), Gaps = 8/860 (0%)
 Frame = +3

Query: 132  MASSTP--HCSITGTKPYQ----NHQFPQNPIXXXXXXXXXXXXWAHRKVSLTKPSQVPH 293
            MASSTP  HC++T +KPYQ    +H  P +              W+ +KVSL  P    H
Sbjct: 1    MASSTPPPHCALTTSKPYQPQTHSHPHPNH-------RNNHQRHWSSQKVSLNPPRNPNH 53

Query: 294  XXXXXXXXXXXXXXXXXXXXXXXPTFPSLSRAPPPNKSELTADFSGRRSTRFVSKMHFGR 473
                                   P F SLS +    KS+ +ADFSGRRSTRFVSKMHFGR
Sbjct: 54   PSQT-------------------PNFLSLSSS----KSDFSADFSGRRSTRFVSKMHFGR 90

Query: 474  QKAQLSSRHSSVAEEALQESIRI-NDEKGLENVFLCFESKMSGSDDYIFVLRELGNRGEC 650
             K   + RHSS A+EAL+E+IR  N+E GL+ V L F SK+ GSDDY F+ RELGNRGE 
Sbjct: 91   AKISGNGRHSSFAQEALEEAIRCCNNEAGLDQVLLTFGSKLVGSDDYTFLFRELGNRGEW 150

Query: 651  WKAVQCFEFAVRRERRRNEQGKLASAMISILGRLGKVDLARGVFEIGLKAGYGNTVYAYS 830
              A++CF+FAV RER+RNEQGKLAS+MISILGR GKVDLA  VFE  +  GYG+TVYAYS
Sbjct: 151  LAAMRCFQFAVGRERKRNEQGKLASSMISILGRSGKVDLAEKVFENAVSDGYGSTVYAYS 210

Query: 831  ALISAYGRSGCSDEAIKVFETMKNSGLKPNLVTYNAVIDACGKGGVEFNQAAEILNEMLR 1010
            ALISAY +SG  +EAI+VFETMK+SGLKPNLVTYNA+IDACGKGG +F +A+EI +EMLR
Sbjct: 211  ALISAYAKSGYCNEAIRVFETMKDSGLKPNLVTYNALIDACGKGGADFKRASEIFDEMLR 270

Query: 1011 NGIQPDRITFNSLLAVCSRGGLWEAARNFFSEMVYRGIDQDIFTYNTLLDAFCKGGQMDL 1190
            NG+QPDRITFNSLLAVCS  GLWE AR  F+EM+YRGIDQDI+TYNT LD  C GGQ+D+
Sbjct: 271  NGVQPDRITFNSLLAVCSGAGLWETARGLFNEMIYRGIDQDIYTYNTFLDVACNGGQIDV 330

Query: 1191 AFQIMAEMSAKNISPNVVTYSTMIDGYAKVGRLDDALNLLGEMKFAGIGLDRVSYNTLLS 1370
            AF IM+EM AKNI PN VTYST+I G AK GRLD AL+L  EMK AGI LDRVSYNTLL+
Sbjct: 331  AFDIMSEMHAKNILPNQVTYSTVIRGCAKAGRLDKALSLFNEMKCAGIKLDRVSYNTLLA 390

Query: 1371 IYAKLGRFEDVLNVCNEMESSGIKKDAVTYNALLGGYGKQGKYDEVKRVFKEMKTRHLSP 1550
            IYA LG+FE+ LNV  EME  GIKKD VTYNALL G+GKQG Y +VK++F EMK   LSP
Sbjct: 391  IYASLGKFEEALNVSKEMEGMGIKKDVVTYNALLDGFGKQGMYTKVKQLFAEMKAEKLSP 450

Query: 1551 NLLTYSTLIDVYSKGGLYREALEVFKEFKQAGLKADVVLYSALIDALCKNGLVEIAVSLL 1730
            NLLTYSTLI VY KG LY +A+EV+KEFK+ GLKADVV YS LIDALCK GLVE +  LL
Sbjct: 451  NLLTYSTLISVYLKGALYHDAVEVYKEFKKQGLKADVVFYSKLIDALCKKGLVEYSSLLL 510

Query: 1731 DEMTKEGIRPNVVTYNSIIDAFGRSA-TACGSNESLIEPSSLIALKDVSESNDEHREDNQ 1907
            +EMTKEGI+PNVVTYNSII+AFG SA   CGS+      S+      +S+S  E+ E++ 
Sbjct: 511  NEMTKEGIQPNVVTYNSIINAFGESANNECGSDNVTHIVSA------ISQSKWENTEEDN 564

Query: 1908 IIKIFGQLAAEKSCHAKIDNRGRQEILCVLEVFQKMQELEIKPNVVTFSAILNACSRCNS 2087
            I+KIF QLAA+KS   K  N  RQ++LC+L VF KM EL+IKPNVVTFSAILNACSRC+S
Sbjct: 565  IVKIFEQLAAQKSASGKKTNAERQDMLCILGVFHKMHELQIKPNVVTFSAILNACSRCSS 624

Query: 2088 FEDASMLLEELRWFDNQVYGVAHGLLMGRKDNVWAQAQCLFDEVKQMDSPTASAFYNALT 2267
            F++AS+LLEELR FDNQVYGVAHGLLMG+++ VW+QA  LF+EVKQMDS TASAFYNALT
Sbjct: 625  FDEASLLLEELRLFDNQVYGVAHGLLMGQREGVWSQALSLFNEVKQMDSSTASAFYNALT 684

Query: 2268 DMLWHFGQKKGAQLVVLEGKRRHVWENMWSDSCLDLHLMSSGAARAMVHAWLLDIRSIVF 2447
            DMLWHF QK+GAQLVVLEGKR  VWEN WS SCLDLHLMSSGAA AMVHAWLL IRSIVF
Sbjct: 685  DMLWHFDQKQGAQLVVLEGKRSEVWENTWSTSCLDLHLMSSGAACAMVHAWLLSIRSIVF 744

Query: 2448 EGHELPKLISILTGWGKHSKVVGDGTLRRAIEALLTGMGAPFRVANCNLGRFISPGAVVA 2627
            EGHELPK++SILTGWGKHSK+ GDG L+RAIE LLT +GAPF++A CN+GRFIS GAVV 
Sbjct: 745  EGHELPKMLSILTGWGKHSKITGDGALKRAIEGLLTSIGAPFQIAKCNIGRFISTGAVVT 804

Query: 2628 AWLKESGTLKVLVLHDDRAH 2687
            AWL+ESGTL+VLVL DD +H
Sbjct: 805  AWLRESGTLEVLVLQDDTSH 824


>ref|NP_180698.1| pentatricopeptide-repeat protein GUN1 [Arabidopsis thaliana]
            gi|75206083|sp|Q9SIC9.1|PP178_ARATH RecName:
            Full=Pentatricopeptide repeat-containing protein
            At2g31400, chloroplastic; Flags: Precursor
            gi|4589961|gb|AAD26479.1| unknown protein [Arabidopsis
            thaliana] gi|330253448|gb|AEC08542.1| genomes uncoupled 1
            protein [Arabidopsis thaliana]
          Length = 918

 Score = 1097 bits (2836), Expect = 0.0
 Identities = 554/783 (70%), Positives = 645/783 (82%), Gaps = 13/783 (1%)
 Frame = +3

Query: 393  PPN-------KSELTADFSGRRSTRFVSKMHFGRQKAQLSSRHSSVAEEALQESIRIN-D 548
            PPN       KS+L++DFSGRRSTRFVSKMHFGRQK  +++RHSS AE+ALQ +I  + D
Sbjct: 119  PPNFSPLQTPKSDLSSDFSGRRSTRFVSKMHFGRQKTTMATRHSSAAEDALQNAIDFSGD 178

Query: 549  EKGLENVFLCFESKMSGSDDYIFVLRELGNRGECWKAVQCFEFAVRRERRRNEQGKLASA 728
            ++   ++ L FESK+ GSDD  +++RELGNR EC KAV  +EFAV+RERR+NEQGKLASA
Sbjct: 179  DEMFHSLMLSFESKLCGSDDCTYIIRELGNRNECDKAVGFYEFAVKRERRKNEQGKLASA 238

Query: 729  MISILGRLGKVDLARGVFEIGLKAGYGNTVYAYSALISAYGRSGCSDEAIKVFETMKNSG 908
            MIS LGR GKV +A+ +FE     GYGNTVYA+SALISAYGRSG  +EAI VF +MK  G
Sbjct: 239  MISTLGRYGKVTIAKRIFETAFAGGYGNTVYAFSALISAYGRSGLHEEAISVFNSMKEYG 298

Query: 909  LKPNLVTYNAVIDACGKGGVEFNQAAEILNEMLRNGIQPDRITFNSLLAVCSRGGLWEAA 1088
            L+PNLVTYNAVIDACGKGG+EF Q A+  +EM RNG+QPDRITFNSLLAVCSRGGLWEAA
Sbjct: 299  LRPNLVTYNAVIDACGKGGMEFKQVAKFFDEMQRNGVQPDRITFNSLLAVCSRGGLWEAA 358

Query: 1089 RNFFSEMVYRGIDQDIFTYNTLLDAFCKGGQMDLAFQIMAEMSAKNISPNVVTYSTMIDG 1268
            RN F EM  R I+QD+F+YNTLLDA CKGGQMDLAF+I+A+M  K I PNVV+YST+IDG
Sbjct: 359  RNLFDEMTNRRIEQDVFSYNTLLDAICKGGQMDLAFEILAQMPVKRIMPNVVSYSTVIDG 418

Query: 1269 YAKVGRLDDALNLLGEMKFAGIGLDRVSYNTLLSIYAKLGRFEDVLNVCNEMESSGIKKD 1448
            +AK GR D+ALNL GEM++ GI LDRVSYNTLLSIY K+GR E+ L++  EM S GIKKD
Sbjct: 419  FAKAGRFDEALNLFGEMRYLGIALDRVSYNTLLSIYTKVGRSEEALDILREMASVGIKKD 478

Query: 1449 AVTYNALLGGYGKQGKYDEVKRVFKEMKTRHLSPNLLTYSTLIDVYSKGGLYREALEVFK 1628
             VTYNALLGGYGKQGKYDEVK+VF EMK  H+ PNLLTYSTLID YSKGGLY+EA+E+F+
Sbjct: 479  VVTYNALLGGYGKQGKYDEVKKVFTEMKREHVLPNLLTYSTLIDGYSKGGLYKEAMEIFR 538

Query: 1629 EFKQAGLKADVVLYSALIDALCKNGLVEIAVSLLDEMTKEGIRPNVVTYNSIIDAFGRSA 1808
            EFK AGL+ADVVLYSALIDALCKNGLV  AVSL+DEMTKEGI PNVVTYNSIIDAFGRSA
Sbjct: 539  EFKSAGLRADVVLYSALIDALCKNGLVGSAVSLIDEMTKEGISPNVVTYNSIIDAFGRSA 598

Query: 1809 T----ACGSNESLIEPSSLIALKDVSESNDEHREDNQIIKIFGQLAAEKSCHAKID-NRG 1973
            T    A  SN   + P S  AL  ++E+     E N++I++FGQL  E +     D   G
Sbjct: 599  TMDRSADYSNGGSL-PFSSSALSALTET-----EGNRVIQLFGQLTTESNNRTTKDCEEG 652

Query: 1974 RQEILCVLEVFQKMQELEIKPNVVTFSAILNACSRCNSFEDASMLLEELRWFDNQVYGVA 2153
             QE+ C+LEVF+KM +LEIKPNVVTFSAILNACSRCNSFEDASMLLEELR FDN+VYGV 
Sbjct: 653  MQELSCILEVFRKMHQLEIKPNVVTFSAILNACSRCNSFEDASMLLEELRLFDNKVYGVV 712

Query: 2154 HGLLMGRKDNVWAQAQCLFDEVKQMDSPTASAFYNALTDMLWHFGQKKGAQLVVLEGKRR 2333
            HGLLMG+++NVW QAQ LFD+V +MD  TASAFYNALTDMLWHFGQK+GA+LV LEG+ R
Sbjct: 713  HGLLMGQRENVWLQAQSLFDKVNEMDGSTASAFYNALTDMLWHFGQKRGAELVALEGRSR 772

Query: 2334 HVWENMWSDSCLDLHLMSSGAARAMVHAWLLDIRSIVFEGHELPKLISILTGWGKHSKVV 2513
             VWEN+WSDSCLDLHLMSSGAARAMVHAWLL+IRSIV+EGHELPK++SILTGWGKHSKVV
Sbjct: 773  QVWENVWSDSCLDLHLMSSGAARAMVHAWLLNIRSIVYEGHELPKVLSILTGWGKHSKVV 832

Query: 2514 GDGTLRRAIEALLTGMGAPFRVANCNLGRFISPGAVVAAWLKESGTLKVLVLHDDRAHQE 2693
            GDG LRRA+E LL GM APF ++ CN+GRF S G+VVA WL+ES TLK+L+LHD   H  
Sbjct: 833  GDGALRRAVEVLLRGMDAPFHLSKCNMGRFTSSGSVVATWLRESATLKLLILHD---HIT 889

Query: 2694 TST 2702
            T+T
Sbjct: 890  TAT 892


Top