BLASTX nr result

ID: Atropa21_contig00024551 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Atropa21_contig00024551
         (3202 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_006344988.1| PREDICTED: pentatricopeptide repeat-containi...  1627   0.0  
ref|XP_004236160.1| PREDICTED: pentatricopeptide repeat-containi...  1621   0.0  
ref|XP_002280557.1| PREDICTED: pentatricopeptide repeat-containi...  1313   0.0  
ref|XP_004301287.1| PREDICTED: pentatricopeptide repeat-containi...  1262   0.0  
gb|EMJ11568.1| hypothetical protein PRUPE_ppa001337mg [Prunus pe...  1260   0.0  
gb|EOY20555.1| Plastid transcriptionally active 2 isoform 1 [The...  1260   0.0  
ref|XP_006439718.1| hypothetical protein CICLE_v10018817mg [Citr...  1258   0.0  
ref|XP_006476695.1| PREDICTED: pentatricopeptide repeat-containi...  1255   0.0  
gb|EXB29767.1| hypothetical protein L484_008930 [Morus notabilis]    1236   0.0  
ref|XP_003525484.1| PREDICTED: pentatricopeptide repeat-containi...  1236   0.0  
ref|XP_006579551.1| PREDICTED: pentatricopeptide repeat-containi...  1231   0.0  
ref|XP_003549648.1| PREDICTED: pentatricopeptide repeat-containi...  1225   0.0  
ref|XP_004508810.1| PREDICTED: pentatricopeptide repeat-containi...  1224   0.0  
ref|XP_002322139.2| hypothetical protein POPTR_0015s08030g [Popu...  1222   0.0  
ref|XP_006600662.1| PREDICTED: pentatricopeptide repeat-containi...  1221   0.0  
ref|XP_004157803.1| PREDICTED: pentatricopeptide repeat-containi...  1215   0.0  
ref|XP_004152453.1| PREDICTED: pentatricopeptide repeat-containi...  1213   0.0  
ref|XP_006300609.1| hypothetical protein CARUB_v10019779mg [Caps...  1209   0.0  
ref|NP_177623.1| plastid transcriptionally active 2 [Arabidopsis...  1203   0.0  
ref|XP_006390383.1| hypothetical protein EUTSA_v10018112mg [Eutr...  1202   0.0  

>ref|XP_006344988.1| PREDICTED: pentatricopeptide repeat-containing protein At1g74850,
            chloroplastic-like [Solanum tuberosum]
          Length = 860

 Score = 1627 bits (4212), Expect = 0.0
 Identities = 811/860 (94%), Positives = 830/860 (96%), Gaps = 1/860 (0%)
 Frame = -2

Query: 3054 MSLSYNSFSPVLTPAPTSHRFLFPSKIPNPYKLSVIHRRLLLTVAVRAKPKDLILGNPTV 2875
            MSLSYN+FS VLTP P SHRFLFP+KIPN YKL   HRRLLLTVAVRAKPKDLILGNPTV
Sbjct: 1    MSLSYNTFSQVLTPVPPSHRFLFPTKIPNYYKLPGFHRRLLLTVAVRAKPKDLILGNPTV 60

Query: 2874 TVEKGKYSYDVETLINKLSSLPPRGSIARCLDTFKNKLSLTDFSLVFKEFAARGDWQRSL 2695
            TVEKGKYSYDVETLINKLSSLPPRGSIARCLDTFKNKLSL+DFSLVFKEFAARGDWQRSL
Sbjct: 61   TVEKGKYSYDVETLINKLSSLPPRGSIARCLDTFKNKLSLSDFSLVFKEFAARGDWQRSL 120

Query: 2694 RLFKYMQRQIWCKPNEHIYTLMIGILGREGLLDKAYEVFDEMPTHSVARTVFSYTSIINA 2515
            RLFKYMQRQIWCKPNEHIYTLMIGILGREGLLDKA+E+FDEM THSVARTVFSYT+IINA
Sbjct: 121  RLFKYMQRQIWCKPNEHIYTLMIGILGREGLLDKAFEIFDEMSTHSVARTVFSYTAIINA 180

Query: 2514 YGRNGQYETSLQLLEKMKQEKIVPSILTYNTVINSCARGGYEWEGLLGLFAEMRHEGTQP 2335
            YGRNGQYETSLQLLEKMKQE IVPSILTYNTVINSCARGGYEWEGLLGLFAEMRHEG QP
Sbjct: 181  YGRNGQYETSLQLLEKMKQENIVPSILTYNTVINSCARGGYEWEGLLGLFAEMRHEGIQP 240

Query: 2334 DLVTYNTLLSACSSRGLEDEAEMVFRTMNEAGVLPDVTTYSYLVETFGKLGKLEKVSELL 2155
            DLVTYNTLLSACSSR LEDEAEMVFRTMNEAGVLPDVTTYSYLVETFGKLGKLEKVSELL
Sbjct: 241  DLVTYNTLLSACSSRELEDEAEMVFRTMNEAGVLPDVTTYSYLVETFGKLGKLEKVSELL 300

Query: 2154 MEMEVGGTSPEVTSYNVLLEAYARLGSMKEAMDVFRQMQAAGCVANAETYSILLNLYGKN 1975
            MEME GGTSPEVTSYNVLLEAYA LGSMKEAMDVFRQMQAAGCVANAETYSILLNLYGKN
Sbjct: 301  MEMEAGGTSPEVTSYNVLLEAYAHLGSMKEAMDVFRQMQAAGCVANAETYSILLNLYGKN 360

Query: 1974 GRYDQVRELFLEMKTSNTEPDADTYNILIQVFGEGGYFKEVVTLFHDMVEEKVEPNMETY 1795
            GRYDQVRELFLEMKTSNTEPDADTYNILIQVFGEGGYFKEVVTLFHDMVEEKVEPNMETY
Sbjct: 361  GRYDQVRELFLEMKTSNTEPDADTYNILIQVFGEGGYFKEVVTLFHDMVEEKVEPNMETY 420

Query: 1794 EGLIYACGKGGLHEDAKRILLHMNGQGLVPSSKVYNGVIEAYGQAALYEEAVVAFNTMNE 1615
            EGLIYACGKGGLHEDAKRILLHMNGQGLVPSSKVY  VIEAYGQAALYEEAVVAFNTMNE
Sbjct: 421  EGLIYACGKGGLHEDAKRILLHMNGQGLVPSSKVYTAVIEAYGQAALYEEAVVAFNTMNE 480

Query: 1614 VGSRPMVETFNSLIHAFAKGGLYKESEAIWFRMGEVGVPRNRDSFNGMIEGYRQGGQFEE 1435
            VGSRPMVETFNSLIH FAKGGLYKESEAIWFRMGEVGVPRNRDSFNG+IEGYRQGGQFEE
Sbjct: 481  VGSRPMVETFNSLIHTFAKGGLYKESEAIWFRMGEVGVPRNRDSFNGLIEGYRQGGQFEE 540

Query: 1434 AVKSYVEMEKARCDPDERTLEAVLSVYCFAGLVDESEEQFQEIKSLGIQPSIICCCMMLA 1255
            A+K+YVEMEKARCDPDERTLEAVLSVYCFAGLVDESEEQFQEIKSLGIQPSIICCCMMLA
Sbjct: 541  AIKAYVEMEKARCDPDERTLEAVLSVYCFAGLVDESEEQFQEIKSLGIQPSIICCCMMLA 600

Query: 1254 IYAKSERWDMARELLNGVMTNKTSDMHQVIGQMIHGDLDDENNWQMVEYVFDKLKSEGCE 1075
            IYAKSERWDMARELLN VMTNKTSDMHQ+IG+MIHGD DDENNWQMVEYVFDKLKSEGC 
Sbjct: 601  IYAKSERWDMARELLNDVMTNKTSDMHQIIGRMIHGDFDDENNWQMVEYVFDKLKSEGCG 660

Query: 1074 LSMRFYNTLIEALWWLGQKERAARVLNEATKRGLFPELFRRNKLVWSVDVHRMWPGGACT 895
            LSMRFYNTLIEALWWLGQKERAARVLNEATKRGLFPELFRRNKLVWSVDVHRMWPGGACT
Sbjct: 661  LSMRFYNTLIEALWWLGQKERAARVLNEATKRGLFPELFRRNKLVWSVDVHRMWPGGACT 720

Query: 894  ALSVWLNDMEELFHKGEELPQLASVVVVRGQMEKSSITRDLPVAKAAYSFLKDTVSSSFF 715
            A+SVWLNDMEELFHKGEELPQLAS+VVVRGQ EKSS+TRDLPVAKAAYSFLKDTVSSSF 
Sbjct: 721  AISVWLNDMEELFHKGEELPQLASIVVVRGQTEKSSVTRDLPVAKAAYSFLKDTVSSSFS 780

Query: 714  FPGWNKGRIVCQKTQLKRTF-SAEPSSEASKCDILIPLSNTPISLLGKQTSTSAAKRSES 538
            FPGWNKGRIVCQ+TQLKRTF SAEPS+EASK D LIPLSN+PISLLG QTS S AKRSES
Sbjct: 781  FPGWNKGRIVCQRTQLKRTFSSAEPSAEASKGDRLIPLSNSPISLLGTQTSMSDAKRSES 840

Query: 537  VNADTERSTKSDSELMASSV 478
             NAD+ERST+ D ELMASSV
Sbjct: 841  ANADSERSTRPDPELMASSV 860


>ref|XP_004236160.1| PREDICTED: pentatricopeptide repeat-containing protein At1g74850,
            chloroplastic-like [Solanum lycopersicum]
          Length = 860

 Score = 1621 bits (4198), Expect = 0.0
 Identities = 807/860 (93%), Positives = 829/860 (96%), Gaps = 1/860 (0%)
 Frame = -2

Query: 3054 MSLSYNSFSPVLTPAPTSHRFLFPSKIPNPYKLSVIHRRLLLTVAVRAKPKDLILGNPTV 2875
            MSLSYN+FS VLTP P SHR+LFP+KIPN YKL  +HRRLLLTVAVRAKPKDLILGNPTV
Sbjct: 1    MSLSYNTFSQVLTPVPPSHRYLFPAKIPNYYKLPGLHRRLLLTVAVRAKPKDLILGNPTV 60

Query: 2874 TVEKGKYSYDVETLINKLSSLPPRGSIARCLDTFKNKLSLTDFSLVFKEFAARGDWQRSL 2695
            TVEKGKYSYDVETLINKLSSLPPRGSIARCLDTFKNKLSLTDFSLVFKEFAARGDWQRSL
Sbjct: 61   TVEKGKYSYDVETLINKLSSLPPRGSIARCLDTFKNKLSLTDFSLVFKEFAARGDWQRSL 120

Query: 2694 RLFKYMQRQIWCKPNEHIYTLMIGILGREGLLDKAYEVFDEMPTHSVARTVFSYTSIINA 2515
            RLFKYMQRQIWCKPNEHIYTLMIGILGREGLLDKA+E+FDEM TH+VARTVFSYT+IIN+
Sbjct: 121  RLFKYMQRQIWCKPNEHIYTLMIGILGREGLLDKAFEIFDEMSTHNVARTVFSYTAIINS 180

Query: 2514 YGRNGQYETSLQLLEKMKQEKIVPSILTYNTVINSCARGGYEWEGLLGLFAEMRHEGTQP 2335
            YGRNGQYETSLQLLEKMKQE IVPSILTYNTVINSCARGGYEWEGLLGLFAEMRHEG QP
Sbjct: 181  YGRNGQYETSLQLLEKMKQENIVPSILTYNTVINSCARGGYEWEGLLGLFAEMRHEGIQP 240

Query: 2334 DLVTYNTLLSACSSRGLEDEAEMVFRTMNEAGVLPDVTTYSYLVETFGKLGKLEKVSELL 2155
            DLVTYNTLLSACSSR LEDEAEMVFRTMNEAGVLPDVTTYSYLVETFGKLGKLEKVSELL
Sbjct: 241  DLVTYNTLLSACSSRELEDEAEMVFRTMNEAGVLPDVTTYSYLVETFGKLGKLEKVSELL 300

Query: 2154 MEMEVGGTSPEVTSYNVLLEAYARLGSMKEAMDVFRQMQAAGCVANAETYSILLNLYGKN 1975
            MEME GGTSPEVTSYNVLLEAYA LGSMKEAMDVFRQMQAAGCVANAETYSILLNLYGKN
Sbjct: 301  MEMEAGGTSPEVTSYNVLLEAYAHLGSMKEAMDVFRQMQAAGCVANAETYSILLNLYGKN 360

Query: 1974 GRYDQVRELFLEMKTSNTEPDADTYNILIQVFGEGGYFKEVVTLFHDMVEEKVEPNMETY 1795
            GRYDQVRELFLEMKTSNTEPDADTYNILIQVFGEGGYFKEVVTLFHDMVEEKVEPNMETY
Sbjct: 361  GRYDQVRELFLEMKTSNTEPDADTYNILIQVFGEGGYFKEVVTLFHDMVEEKVEPNMETY 420

Query: 1794 EGLIYACGKGGLHEDAKRILLHMNGQGLVPSSKVYNGVIEAYGQAALYEEAVVAFNTMNE 1615
            EGLIYACGKGGLHEDAKRILLHMNGQGLVPSSKVY  VIEAYGQAALYEEAVVAFNTMNE
Sbjct: 421  EGLIYACGKGGLHEDAKRILLHMNGQGLVPSSKVYTAVIEAYGQAALYEEAVVAFNTMNE 480

Query: 1614 VGSRPMVETFNSLIHAFAKGGLYKESEAIWFRMGEVGVPRNRDSFNGMIEGYRQGGQFEE 1435
            VGSRP+VETFNSLIH FAKGGLYKESEAIWFRMGEVGVPRNRDSFNGMIEGYRQGGQFEE
Sbjct: 481  VGSRPVVETFNSLIHTFAKGGLYKESEAIWFRMGEVGVPRNRDSFNGMIEGYRQGGQFEE 540

Query: 1434 AVKSYVEMEKARCDPDERTLEAVLSVYCFAGLVDESEEQFQEIKSLGIQPSIICCCMMLA 1255
            A+K+YVEMEKARCDPDERTLEAVLSVYCFAGLVDESEEQFQEIKSLGIQPSIICCCMMLA
Sbjct: 541  AIKAYVEMEKARCDPDERTLEAVLSVYCFAGLVDESEEQFQEIKSLGIQPSIICCCMMLA 600

Query: 1254 IYAKSERWDMARELLNGVMTNKTSDMHQVIGQMIHGDLDDENNWQMVEYVFDKLKSEGCE 1075
            IYAKSERWDMARELLN VMTNKTSDMHQ+IG+MIHGD DDENNWQMVEYVFDKLKSEGC 
Sbjct: 601  IYAKSERWDMARELLNDVMTNKTSDMHQIIGRMIHGDFDDENNWQMVEYVFDKLKSEGCG 660

Query: 1074 LSMRFYNTLIEALWWLGQKERAARVLNEATKRGLFPELFRRNKLVWSVDVHRMWPGGACT 895
            LSMRFYNTLIEALWWLGQKERAARVLNEATKRGLFPELFRRNKLVWSVDVHRMWPGGACT
Sbjct: 661  LSMRFYNTLIEALWWLGQKERAARVLNEATKRGLFPELFRRNKLVWSVDVHRMWPGGACT 720

Query: 894  ALSVWLNDMEELFHKGEELPQLASVVVVRGQMEKSSITRDLPVAKAAYSFLKDTVSSSFF 715
            A+S+WLNDMEELFHKGEELPQLAS+VVVRGQ EKSS+TRDLPVAKAAYSFLKDT+SSSF 
Sbjct: 721  AISIWLNDMEELFHKGEELPQLASIVVVRGQTEKSSVTRDLPVAKAAYSFLKDTISSSFS 780

Query: 714  FPGWNKGRIVCQKTQLKRTF-SAEPSSEASKCDILIPLSNTPISLLGKQTSTSAAKRSES 538
            FPGWNKGRIVCQKTQLKRTF SAEPS EASK D LIPLSN+ ISLLG QTS S AKRSES
Sbjct: 781  FPGWNKGRIVCQKTQLKRTFSSAEPSVEASKGDRLIPLSNSLISLLGTQTSMSVAKRSES 840

Query: 537  VNADTERSTKSDSELMASSV 478
            VNAD+ERST+ D ELM SSV
Sbjct: 841  VNADSERSTRPDPELMTSSV 860


>ref|XP_002280557.1| PREDICTED: pentatricopeptide repeat-containing protein At1g74850,
            chloroplastic [Vitis vinifera]
          Length = 869

 Score = 1313 bits (3399), Expect = 0.0
 Identities = 645/850 (75%), Positives = 743/850 (87%), Gaps = 2/850 (0%)
 Frame = -2

Query: 3021 LTPAPTSHRFLFPSKIPNPYKLSVIHRRLLLTVA-VRAKPKDLILGNPTVTVEKGKYSYD 2845
            L P P  +R LFP+K  + +     ++R+L + A +RAKPK+L+LGNP+VTVEKGKYSYD
Sbjct: 25   LRPNPNLNRHLFPAKATDFFG----YQRILASAARIRAKPKELVLGNPSVTVEKGKYSYD 80

Query: 2844 VETLINKLSSLPPRGSIARCLDTFKNKLSLTDFSLVFKEFAARGDWQRSLRLFKYMQRQI 2665
            VETLINKLSSLPPRGSIARCLD FKNKLSL DF+LVFKEFA RGDWQRSLRLFKYMQRQI
Sbjct: 81   VETLINKLSSLPPRGSIARCLDVFKNKLSLNDFALVFKEFAQRGDWQRSLRLFKYMQRQI 140

Query: 2664 WCKPNEHIYTLMIGILGREGLLDKAYEVFDEMPTHSVARTVFSYTSIINAYGRNGQYETS 2485
            WCKPNEHIYT+MIG+LGREGLL+K  E+FDEMP+H VA +VFS+T++INAYGRNGQY++S
Sbjct: 141  WCKPNEHIYTIMIGVLGREGLLEKCQEIFDEMPSHGVAPSVFSFTALINAYGRNGQYKSS 200

Query: 2484 LQLLEKMKQEKIVPSILTYNTVINSCARGGYEWEGLLGLFAEMRHEGTQPDLVTYNTLLS 2305
            L+LL++MK+E++ PSILTYNTVINSCARGG +WE LLGLFA+MRHEG Q D+VTYNTLLS
Sbjct: 201  LELLDRMKKERVSPSILTYNTVINSCARGGLDWEELLGLFAQMRHEGIQADIVTYNTLLS 260

Query: 2304 ACSSRGLEDEAEMVFRTMNEAGVLPDVTTYSYLVETFGKLGKLEKVSELLMEMEVGGTSP 2125
            AC+ RGL DEAEMVFRTMNE G+LPD+TTYSYLVETFGKL +LEKVSELL EME GG+ P
Sbjct: 261  ACARRGLGDEAEMVFRTMNEGGILPDITTYSYLVETFGKLNRLEKVSELLKEMESGGSFP 320

Query: 2124 EVTSYNVLLEAYARLGSMKEAMDVFRQMQAAGCVANAETYSILLNLYGKNGRYDQVRELF 1945
            ++TSYNVLLEA+A+ GS+KEAM VFRQMQ AGCV NA TYSILLNLYG++GRYD VR+LF
Sbjct: 321  DITSYNVLLEAHAQSGSIKEAMGVFRQMQGAGCVPNAATYSILLNLYGRHGRYDDVRDLF 380

Query: 1944 LEMKTSNTEPDADTYNILIQVFGEGGYFKEVVTLFHDMVEEKVEPNMETYEGLIYACGKG 1765
            LEMK SNTEP+A TYNILI VFGEGGYFKEVVTLFHDMVEE VEPNMETYEGLI+ACGKG
Sbjct: 381  LEMKVSNTEPNAATYNILINVFGEGGYFKEVVTLFHDMVEENVEPNMETYEGLIFACGKG 440

Query: 1764 GLHEDAKRILLHMNGQGLVPSSKVYNGVIEAYGQAALYEEAVVAFNTMNEVGSRPMVETF 1585
            GLHEDAK+ILLHMN +G+VPSSK Y GVIEAYGQAALYEEA+VAFNTMNEVGS+P VET+
Sbjct: 441  GLHEDAKKILLHMNEKGVVPSSKAYTGVIEAYGQAALYEEALVAFNTMNEVGSKPTVETY 500

Query: 1584 NSLIHAFAKGGLYKESEAIWFRMGEVGVPRNRDSFNGMIEGYRQGGQFEEAVKSYVEMEK 1405
            NSLI  FAKGGLYKESEAI  +MG+ GV RNRD+FNG+IE +RQGGQFEEA+K+YVEMEK
Sbjct: 501  NSLIQMFAKGGLYKESEAILLKMGQSGVARNRDTFNGVIEAFRQGGQFEEAIKAYVEMEK 560

Query: 1404 ARCDPDERTLEAVLSVYCFAGLVDESEEQFQEIKSLGIQPSIICCCMMLAIYAKSERWDM 1225
            ARCDPDE+TLEAVLSVYCFAGLV+ESEEQF EIK+LGI PS++C CMMLA+YAK++RWD 
Sbjct: 561  ARCDPDEQTLEAVLSVYCFAGLVEESEEQFGEIKALGILPSVMCYCMMLAVYAKADRWDD 620

Query: 1224 ARELLNGVMTNKTSDMHQVIGQMIHGDLDDENNWQMVEYVFDKLKSEGCELSMRFYNTLI 1045
            A +LL+ + TN+ S++HQVIGQMI GD DD++NWQMVEYVF+KLKSEGC L +RFYNTL+
Sbjct: 621  AHQLLDEMFTNRVSNIHQVIGQMIRGDYDDDSNWQMVEYVFEKLKSEGCSLGVRFYNTLL 680

Query: 1044 EALWWLGQKERAARVLNEATKRGLFPELFRRNKLVWSVDVHRMWPGGACTALSVWLNDME 865
            EALWWLGQKERA RVLNEATKRGLFPELFR+NKLVWSVDVHRMW G ACTA+SVWLN+M 
Sbjct: 681  EALWWLGQKERATRVLNEATKRGLFPELFRKNKLVWSVDVHRMWEGAACTAISVWLNNMH 740

Query: 864  ELFHKGEELPQLASVVVVRGQMEKSSITRDLPVAKAAYSFLKDTVSSSFFFPGWNKGRIV 685
            E+F  G++LPQLAS VVVRG MEKSSITRD PVAK+AY+FL + VSSSF FPGWNKGRIV
Sbjct: 741  EMFISGDDLPQLASAVVVRGHMEKSSITRDFPVAKSAYAFLNE-VSSSFCFPGWNKGRIV 799

Query: 684  CQKTQLKRTFS-AEPSSEASKCDILIPLSNTPISLLGKQTSTSAAKRSESVNADTERSTK 508
            CQ++QLKR  S  E  S+  K D +I LSN+P  L G  TS S  KR +  NAD ERS  
Sbjct: 800  CQRSQLKRILSVTEQHSDEYKKDRIITLSNSPFPLPGTNTSMSNVKRDQLSNADAERSIM 859

Query: 507  SDSELMASSV 478
            + +ELM S+V
Sbjct: 860  TRTELMTSTV 869


>ref|XP_004301287.1| PREDICTED: pentatricopeptide repeat-containing protein At1g74850,
            chloroplastic-like [Fragaria vesca subsp. vesca]
          Length = 862

 Score = 1262 bits (3266), Expect = 0.0
 Identities = 623/863 (72%), Positives = 727/863 (84%), Gaps = 10/863 (1%)
 Frame = -2

Query: 3036 SFSPVLT---PAPTSHRFLFPSKIPNPYKLSVI--HRRLL----LTVAVRAKPKDLILGN 2884
            + SP L+   P+P S     P   P P+ LS +  HR+ +    L+ +VRAKPKDLILGN
Sbjct: 2    TLSPTLSISRPSPLSAPI--PKLNPKPHHLSFLSGHRKFIHGQRLSFSVRAKPKDLILGN 59

Query: 2883 PTVTVEKGKYSYDVETLINKLSSLPPRGSIARCLDTFKNKLSLTDFSLVFKEFAARGDWQ 2704
            P+VTVEKGKYSYDVETLINKLSSLPPRGSIARCLD FKNKLSL DF+LVFKEFAARGDWQ
Sbjct: 60   PSVTVEKGKYSYDVETLINKLSSLPPRGSIARCLDIFKNKLSLNDFALVFKEFAARGDWQ 119

Query: 2703 RSLRLFKYMQRQIWCKPNEHIYTLMIGILGREGLLDKAYEVFDEMPTHSVARTVFSYTSI 2524
            RSLRLFKYMQRQIWCKP+EHIYT+MI +LGREGLLDK  E+FDEMPT  V R+VFSYT++
Sbjct: 120  RSLRLFKYMQRQIWCKPSEHIYTIMISLLGREGLLDKCAEIFDEMPTQGVIRSVFSYTAL 179

Query: 2523 INAYGRNGQYETSLQLLEKMKQEKIVPSILTYNTVINSCARGGYEWEGLLGLFAEMRHEG 2344
            INAYGRNGQ+E SLQLL++MK++K+ P+ILTYNTV+N+CARGG +WEGLLGLFAEMRHEG
Sbjct: 180  INAYGRNGQFEMSLQLLDRMKKDKVSPNILTYNTVLNACARGGLDWEGLLGLFAEMRHEG 239

Query: 2343 TQPDLVTYNTLLSACSSRGLEDEAEMVFRTMNEAGVLPDVTTYSYLVETFGKLGKLEKVS 2164
             QPDLVTYNTLLSAC+ RGL DEAEMVFRTMNE G++PD+TTYSYLVETFGKL  LEKVS
Sbjct: 240  VQPDLVTYNTLLSACAGRGLGDEAEMVFRTMNEGGIVPDITTYSYLVETFGKLNNLEKVS 299

Query: 2163 ELLMEMEVGGTSPEVTSYNVLLEAYARLGSMKEAMDVFRQMQAAGCVANAETYSILLNLY 1984
            ELL  ME GG  P++TSYNVLLEAYA+LGS+KEAM VFRQMQ AGC+ANA TYSILLNLY
Sbjct: 300  ELLKGMESGGNLPDITSYNVLLEAYAQLGSIKEAMGVFRQMQEAGCMANAATYSILLNLY 359

Query: 1983 GKNGRYDQVRELFLEMKTSNTEPDADTYNILIQVFGEGGYFKEVVTLFHDMVEEKVEPNM 1804
            G+ GRYD VRELFLEMK SN EPDA TYNILIQVFGEGGYF+EVVTLFHDMVEE +EPNM
Sbjct: 360  GRLGRYDDVRELFLEMKVSNAEPDAATYNILIQVFGEGGYFREVVTLFHDMVEENIEPNM 419

Query: 1803 ETYEGLIYACGKGGLHEDAKRILLHMNGQGLVPSSKVYNGVIEAYGQAALYEEAVVAFNT 1624
            ETYEGLIYACGKGGLHEDAK ILLHMN +G+VPSSK Y G IEAYGQAALY+EA+VAFNT
Sbjct: 420  ETYEGLIYACGKGGLHEDAKNILLHMNEKGIVPSSKAYTGAIEAYGQAALYDEALVAFNT 479

Query: 1623 MNEVGSRPMVETFNSLIHAFAKGGLYKESEAIWFRMGEVGVPRNRDSFNGMIEGYRQGGQ 1444
            MNEVGS P VE+FNSLIHA+A+GGLYKE+E +   MGE G+  N  SFNGMIE +RQGGQ
Sbjct: 480  MNEVGSSPSVESFNSLIHAYARGGLYKETEQVLSIMGEFGIAINASSFNGMIEAFRQGGQ 539

Query: 1443 FEEAVKSYVEMEKARCDPDERTLEAVLSVYCFAGLVDESEEQFQEIKSLGIQPSIICCCM 1264
            FEEA+K+YVEMEK RCDPDE TLEAVLSVY  AGLV+E EE F+EIK+ GI PS++C CM
Sbjct: 540  FEEAIKTYVEMEKRRCDPDECTLEAVLSVYSVAGLVNECEEHFEEIKASGILPSVMCYCM 599

Query: 1263 MLAIYAKSERWDMARELLNGVMTNKTSDMHQVIGQMIHGDLDDENNWQMVEYVFDKLKSE 1084
            MLA+YAK++RWD A +LLN ++TN+ S++HQV+GQMI GD DDE+NWQMVEYVFDKLKSE
Sbjct: 600  MLAVYAKTDRWDDANKLLNEMLTNRVSNIHQVMGQMIKGDYDDESNWQMVEYVFDKLKSE 659

Query: 1083 GCELSMRFYNTLIEALWWLGQKERAARVLNEATKRGLFPELFRRNKLVWSVDVHRMWPGG 904
            GC L MRFYNTLIEALWWLGQK+RA RVL+EAT+RGLFPEL R+NKLVWS+DVHRMW GG
Sbjct: 660  GCGLGMRFYNTLIEALWWLGQKQRAVRVLSEATQRGLFPELLRKNKLVWSIDVHRMWEGG 719

Query: 903  ACTALSVWLNDMEELFHKGEELPQLASVVVVRGQMEKSSITRDLPVAKAAYSFLKDTVSS 724
            A  A+SVWLNDM E+F  GE+LP +A+VVVVRG+MEKSS T+DLPVAKAAYSFL+D +S 
Sbjct: 720  AYAAMSVWLNDMYEMFLNGEDLPHVATVVVVRGKMEKSSTTQDLPVAKAAYSFLQDNMSG 779

Query: 723  SFFFPGWNKGRIVCQKTQLKRTFSA-EPSSEASKCDILIPLSNTPISLLGKQTSTSAAKR 547
            +F FP WN GRI+CQ++QLK+  S+ EPS++ S    +  LSN+P    G + S +    
Sbjct: 780  AFNFPKWNNGRILCQRSQLKKLLSSIEPSTDGSSSKSICILSNSPFPPPGTKISPTDVDS 839

Query: 546  SESVNADTERSTKSDSELMASSV 478
                   ++ ++++ +EL+ S+V
Sbjct: 840  GRYNGTSSDATSRTRTELLTSTV 862


>gb|EMJ11568.1| hypothetical protein PRUPE_ppa001337mg [Prunus persica]
          Length = 850

 Score = 1260 bits (3261), Expect = 0.0
 Identities = 619/840 (73%), Positives = 719/840 (85%), Gaps = 5/840 (0%)
 Frame = -2

Query: 2982 SKIPNPYKLSVI----HRRLLLTVAVRAKPKDLILGNPTVTVEKGKYSYDVETLINKLSS 2815
            S  P P  LS +    H   ++T    + PKDLILGNP+VTVEKGKYSYDVETLINKLSS
Sbjct: 11   SSSPLPASLSNLKPKSHHLSVVTKTPDSSPKDLILGNPSVTVEKGKYSYDVETLINKLSS 70

Query: 2814 LPPRGSIARCLDTFKNKLSLTDFSLVFKEFAARGDWQRSLRLFKYMQRQIWCKPNEHIYT 2635
            LPPRGSIARCLD FKNKLSL DF+LVFKEFAARGDWQRSLRLFKYMQRQIWCKPNEHIYT
Sbjct: 71   LPPRGSIARCLDIFKNKLSLNDFALVFKEFAARGDWQRSLRLFKYMQRQIWCKPNEHIYT 130

Query: 2634 LMIGILGREGLLDKAYEVFDEMPTHSVARTVFSYTSIINAYGRNGQYETSLQLLEKMKQE 2455
            +MI +LGREGLLDK  EVFD+MP+  V R+VFSYT++INAYGRNGQYETSLQ L++MK++
Sbjct: 131  IMISLLGREGLLDKCSEVFDDMPSQGVVRSVFSYTALINAYGRNGQYETSLQFLDRMKKD 190

Query: 2454 KIVPSILTYNTVINSCARGGYEWEGLLGLFAEMRHEGTQPDLVTYNTLLSACSSRGLEDE 2275
            K+ PSILTYNTV+N+CARGG EWEGLLGLFAEMRHEG QPDLVTYNTLLSAC+ RGL DE
Sbjct: 191  KVSPSILTYNTVLNACARGGLEWEGLLGLFAEMRHEGIQPDLVTYNTLLSACAGRGLGDE 250

Query: 2274 AEMVFRTMNEAGVLPDVTTYSYLVETFGKLGKLEKVSELLMEMEVGGTSPEVTSYNVLLE 2095
            AEMVFRTMNE G++PD+TTY YLVETFGKL KLEKVSELL EME GG  P++TSYNVLLE
Sbjct: 251  AEMVFRTMNEGGIVPDITTYRYLVETFGKLDKLEKVSELLKEMESGGNLPDITSYNVLLE 310

Query: 2094 AYARLGSMKEAMDVFRQMQAAGCVANAETYSILLNLYGKNGRYDQVRELFLEMKTSNTEP 1915
            AYA+LGS++E+M VFRQMQAAGC+ NA TYSILLNLYG++GRYD VRELFLEMK SNTEP
Sbjct: 311  AYAQLGSIRESMGVFRQMQAAGCMPNAATYSILLNLYGRHGRYDDVRELFLEMKISNTEP 370

Query: 1914 DADTYNILIQVFGEGGYFKEVVTLFHDMVEEKVEPNMETYEGLIYACGKGGLHEDAKRIL 1735
            D  TYNILIQVFGEGGYFKEVVTLFHDMVEE +EPNMETYEGLIYACGKGGLHEDAK IL
Sbjct: 371  DPATYNILIQVFGEGGYFKEVVTLFHDMVEENIEPNMETYEGLIYACGKGGLHEDAKNIL 430

Query: 1734 LHMNGQGLVPSSKVYNGVIEAYGQAALYEEAVVAFNTMNEVGSRPMVETFNSLIHAFAKG 1555
            LHM+ +G+VPSSK Y GVIEAYGQAALY+EA+VAFNTMNEVGS+P VE++NSLI+AFA+G
Sbjct: 431  LHMSEKGIVPSSKAYTGVIEAYGQAALYDEALVAFNTMNEVGSKPSVESYNSLIYAFARG 490

Query: 1554 GLYKESEAIWFRMGEVGVPRNRDSFNGMIEGYRQGGQFEEAVKSYVEMEKARCDPDERTL 1375
            GLY+E+EA+   MGEVG  RN  +FNGMIE +RQGGQFEEA+K+YVEMEK RCD DE TL
Sbjct: 491  GLYRETEAVLSIMGEVGAARNVHTFNGMIEAFRQGGQFEEAIKAYVEMEKRRCDHDEWTL 550

Query: 1374 EAVLSVYCFAGLVDESEEQFQEIKSLGIQPSIICCCMMLAIYAKSERWDMARELLNGVMT 1195
            EAVLSVYC AGLV+E EE FQE+K+ GI PS++C CMMLA+YA+++RWD A ELLN ++T
Sbjct: 551  EAVLSVYCVAGLVNECEEHFQEMKASGILPSVMCYCMMLAVYARNDRWDDANELLNEMLT 610

Query: 1194 NKTSDMHQVIGQMIHGDLDDENNWQMVEYVFDKLKSEGCELSMRFYNTLIEALWWLGQKE 1015
            N+ S++HQVIGQMI GD DD++NWQMVEYVFDKLKSEGC L MRFYNTL+EALWWLGQK+
Sbjct: 611  NRASNIHQVIGQMIKGDYDDDSNWQMVEYVFDKLKSEGCGLGMRFYNTLLEALWWLGQKQ 670

Query: 1014 RAARVLNEATKRGLFPELFRRNKLVWSVDVHRMWPGGACTALSVWLNDMEELFHKGEELP 835
            RA RVLNEAT+RGLFPELFR+NKLV SVDVHRMW GGA  A+SVWLN+M E+F  GE+LP
Sbjct: 671  RAVRVLNEATQRGLFPELFRKNKLVGSVDVHRMWQGGAYAAMSVWLNNMYEMFLNGEDLP 730

Query: 834  QLASVVVVRGQMEKSSITRDLPVAKAAYSFLKDTVSSSFFFPGWNKGRIVCQKTQLKRTF 655
             +A+VVVVRG+MEKSS+T+DLP+AKAAYSFL+D + SSF FP WNKGRI+CQ+ QLKR  
Sbjct: 731  NIATVVVVRGKMEKSSMTQDLPIAKAAYSFLEDNMPSSFSFPKWNKGRILCQRPQLKRIL 790

Query: 654  SA-EPSSEASKCDILIPLSNTPISLLGKQTSTSAAKRSESVNADTERSTKSDSELMASSV 478
            S+ EPS++ S+   +I LSN+    LG +TS+         +  ++   +  +EL+ S+V
Sbjct: 791  SSIEPSTDGSERKKIITLSNSLFPPLGTKTSSKDVNSGRYNDVTSDERLRIRTELLTSAV 850


>gb|EOY20555.1| Plastid transcriptionally active 2 isoform 1 [Theobroma cacao]
          Length = 859

 Score = 1260 bits (3260), Expect = 0.0
 Identities = 610/813 (75%), Positives = 708/813 (87%), Gaps = 1/813 (0%)
 Frame = -2

Query: 2916 RAKPKDLILGNPTVTVEKGKYSYDVETLINKLSSLPPRGSIARCLDTFKNKLSLTDFSLV 2737
            RAKP++L+LGNP+VTVEKGKYSYDVETLINKLSSLPPRGSIARCLD F+NKLSL DF+LV
Sbjct: 47   RAKPRELVLGNPSVTVEKGKYSYDVETLINKLSSLPPRGSIARCLDVFRNKLSLNDFALV 106

Query: 2736 FKEFAARGDWQRSLRLFKYMQRQIWCKPNEHIYTLMIGILGREGLLDKAYEVFDEMPTHS 2557
            FKEFA RGDWQRSLRLFKYMQRQIWCKPNEHIYT+MI +LGREGLL+K  EVFDEMP+  
Sbjct: 107  FKEFAHRGDWQRSLRLFKYMQRQIWCKPNEHIYTIMISLLGREGLLEKCREVFDEMPSQG 166

Query: 2556 VARTVFSYTSIINAYGRNGQYETSLQLLEKMKQEKIVPSILTYNTVINSCARGGYEWEGL 2377
            V R+VF+YT++INAYGRNG Y  SL+LL+KMK++K++PSILTYNTVIN+CARGG +WEGL
Sbjct: 167  VTRSVFAYTALINAYGRNGAYNISLELLDKMKKDKVLPSILTYNTVINACARGGLDWEGL 226

Query: 2376 LGLFAEMRHEGTQPDLVTYNTLLSACSSRGLEDEAEMVFRTMNEAGVLPDVTTYSYLVET 2197
            LGLFAEMRHEG QPD+VTYNTLLSAC++RGL +EAEMVFRTMNE G+LPD+TTYSYLVE+
Sbjct: 227  LGLFAEMRHEGIQPDIVTYNTLLSACANRGLGNEAEMVFRTMNEGGILPDLTTYSYLVES 286

Query: 2196 FGKLGKLEKVSELLMEMEVGGTSPEVTSYNVLLEAYARLGSMKEAMDVFRQMQAAGCVAN 2017
            FGKLGKLEKVSELL EME GG  P++ SYNVLLEAYA+ GS+KEAM VF+QMQ AGC  N
Sbjct: 287  FGKLGKLEKVSELLKEMESGGNLPDIMSYNVLLEAYAKSGSIKEAMGVFKQMQVAGCAPN 346

Query: 2016 AETYSILLNLYGKNGRYDQVRELFLEMKTSNTEPDADTYNILIQVFGEGGYFKEVVTLFH 1837
            A TYSILLNLYG+NGRYD VRELFLEMK SNTEPDA TYNILIQVFGEGGYFKEVVTLFH
Sbjct: 347  ATTYSILLNLYGRNGRYDDVRELFLEMKESNTEPDAATYNILIQVFGEGGYFKEVVTLFH 406

Query: 1836 DMVEEKVEPNMETYEGLIYACGKGGLHEDAKRILLHMNGQGLVPSSKVYNGVIEAYGQAA 1657
            DMVEE +EPN++TY+GLI+ACGKGGLHEDAK+ILLHMN + +VPSS+ Y GVIEAYGQAA
Sbjct: 407  DMVEENIEPNVKTYDGLIFACGKGGLHEDAKKILLHMNEKCIVPSSRAYTGVIEAYGQAA 466

Query: 1656 LYEEAVVAFNTMNEVGSRPMVETFNSLIHAFAKGGLYKESEAIWFRMGEVGVPRNRDSFN 1477
            LYEE +VAFNTMNEV S P +ET+NSL+  FA+GGLYKE+ AI  RM E GV +NRDSFN
Sbjct: 467  LYEEVLVAFNTMNEVESNPTIETYNSLLQTFARGGLYKEANAILSRMNETGVAKNRDSFN 526

Query: 1476 GMIEGYRQGGQFEEAVKSYVEMEKARCDPDERTLEAVLSVYCFAGLVDESEEQFQEIKSL 1297
             +IE +RQGGQFE+A+K+YVEMEKARCDPDERTLEAVLSVYCFAGLVDES EQFQEIK+L
Sbjct: 527  ALIEAFRQGGQFEDAIKAYVEMEKARCDPDERTLEAVLSVYCFAGLVDESNEQFQEIKAL 586

Query: 1296 GIQPSIICCCMMLAIYAKSERWDMARELLNGVMTNKTSDMHQVIGQMIHGDLDDENNWQM 1117
            G+ PS++C CMMLA+YAK +RWD A +L + ++TNK S++HQVIG+MI GD DD+ NWQM
Sbjct: 587  GVLPSVMCYCMMLAVYAKCDRWDDAYQLFDEMLTNKVSNIHQVIGKMIRGDYDDDANWQM 646

Query: 1116 VEYVFDKLKSEGCELSMRFYNTLIEALWWLGQKERAARVLNEATKRGLFPELFRRNKLVW 937
            VEYVFDKL SEGC   +RFYN L+EALWWL QKERAARVLNEATKRGLFPELFR+NKLVW
Sbjct: 647  VEYVFDKLNSEGCGFGIRFYNALLEALWWLRQKERAARVLNEATKRGLFPELFRKNKLVW 706

Query: 936  SVDVHRMWPGGACTALSVWLNDMEELFHKGEELPQLASVVVVRGQMEKSSITRDLPVAKA 757
            SVDVHRMW GG  TA+S+WLN M+++F  G++LPQLA+VVV RGQMEKSSI RD+P AKA
Sbjct: 707  SVDVHRMWEGGTYTAVSIWLNSMQKMFLSGDDLPQLATVVVARGQMEKSSIARDIPTAKA 766

Query: 756  AYSFLKDTVSSSFFFPGWNKGRIVCQKTQLKRTFSAE-PSSEASKCDILIPLSNTPISLL 580
            AY+FL+D VSSSF FPGWNKGRIVCQ++QLKR  SA   SS+ SK D +I LSN PI  +
Sbjct: 767  AYTFLQDIVSSSFSFPGWNKGRIVCQRSQLKRILSATGSSSDESKADNIIALSNFPIPSM 826

Query: 579  GKQTSTSAAKRSESVNADTERSTKSDSELMASS 481
            G ++S    + ++  NA +E   +  +ELMA +
Sbjct: 827  GVKSSPGDVEYTQHDNAISETKMRR-TELMAGT 858


>ref|XP_006439718.1| hypothetical protein CICLE_v10018817mg [Citrus clementina]
            gi|557541980|gb|ESR52958.1| hypothetical protein
            CICLE_v10018817mg [Citrus clementina]
          Length = 871

 Score = 1258 bits (3256), Expect = 0.0
 Identities = 627/846 (74%), Positives = 721/846 (85%), Gaps = 4/846 (0%)
 Frame = -2

Query: 3003 SHRFLFPS-KIPNPYKLSVIHRRLLL--TVAVRAKPKDLILGNPTVTVEKGKYSYDVETL 2833
            +H FL  + ++P   ++    RR L   T+ VRAKPK+L+LG+PTVTVEKGKYSYDVETL
Sbjct: 26   NHSFLSGNNELPCTQRIFTSGRRSLTSGTLQVRAKPKELVLGSPTVTVEKGKYSYDVETL 85

Query: 2832 INKLSSLPPRGSIARCLDTFKNKLSLTDFSLVFKEFAARGDWQRSLRLFKYMQRQIWCKP 2653
            INKLSSLPPRGSIARCLD FKNKLSL DF+LVFKEFA RGDWQRSLRLFKYMQRQIWCKP
Sbjct: 86   INKLSSLPPRGSIARCLDMFKNKLSLNDFALVFKEFAQRGDWQRSLRLFKYMQRQIWCKP 145

Query: 2652 NEHIYTLMIGILGREGLLDKAYEVFDEMPTHSVARTVFSYTSIINAYGRNGQYETSLQLL 2473
            +E IYT+MI +LGRE LLDKA EVF+EMP+  VAR+VFSYT++INAYGR+GQYETSL+LL
Sbjct: 146  SEQIYTIMISLLGRENLLDKASEVFEEMPSQGVARSVFSYTALINAYGRHGQYETSLELL 205

Query: 2472 EKMKQEKIVPSILTYNTVINSCARGGYEWEGLLGLFAEMRHEGTQPDLVTYNTLLSACSS 2293
            ++MK+EKI P+ILTYNTVIN+C RGG +WE LLGLFAEMRHEG QPD+VTYNTLLSAC  
Sbjct: 206  DRMKREKIAPNILTYNTVINACVRGGLDWEDLLGLFAEMRHEGIQPDIVTYNTLLSACGG 265

Query: 2292 RGLEDEAEMVFRTMNEAGVLPDVTTYSYLVETFGKLGKLEKVSELLMEMEVGGTSPEVTS 2113
            RGL DEAEMVFRTMNE GVLPD+TT+SYLVETFGKLGKLEKVSELL EME GG  P+VT 
Sbjct: 266  RGLGDEAEMVFRTMNEGGVLPDLTTFSYLVETFGKLGKLEKVSELLREMESGGNLPDVTC 325

Query: 2112 YNVLLEAYARLGSMKEAMDVFRQMQAAGCVANAETYSILLNLYGKNGRYDQVRELFLEMK 1933
            YNVLLEA+A++GS+KEAMDVFRQMQAAG VANA TYSILLNLYG+NGRYD VRELFLEMK
Sbjct: 326  YNVLLEAHAKMGSIKEAMDVFRQMQAAGSVANATTYSILLNLYGRNGRYDDVRELFLEMK 385

Query: 1932 TSNTEPDADTYNILIQVFGEGGYFKEVVTLFHDMVEEKVEPNMETYEGLIYACGKGGLHE 1753
             SNTEP+A TYNILIQVFGEGGYFKEVVTLFHDMVEE VEPNMETYEGLI+ACGKGGLHE
Sbjct: 386  ASNTEPNAATYNILIQVFGEGGYFKEVVTLFHDMVEENVEPNMETYEGLIFACGKGGLHE 445

Query: 1752 DAKRILLHMNGQGLVPSSKVYNGVIEAYGQAALYEEAVVAFNTMNEVGSRPMVETFNSLI 1573
            D K+ILL+MN +G VPSSK Y GVIEAYG AALYEEA+VAFNTMNEV S+P +ET+NSL+
Sbjct: 446  DVKKILLYMNERGTVPSSKAYTGVIEAYGLAALYEEALVAFNTMNEVESKPTIETYNSLL 505

Query: 1572 HAFAKGGLYKESEAIWFRMGEVGVPRNRDSFNGMIEGYRQGGQFEEAVKSYVEMEKARCD 1393
            H FA+GGLYKE +AI  RM E GV RN DSFN +IE +RQGG+FEEA+K+YVEMEK RCD
Sbjct: 506  HTFARGGLYKECQAILSRMSESGVARNSDSFNAVIEAFRQGGRFEEAIKAYVEMEKVRCD 565

Query: 1392 PDERTLEAVLSVYCFAGLVDESEEQFQEIKSLGIQPSIICCCMMLAIYAKSERWDMAREL 1213
            P+ERTLEAVLSVYCFAGLVDES+EQFQEIKS GI PS++C CM+LA+YAKS RWD A  L
Sbjct: 566  PNERTLEAVLSVYCFAGLVDESKEQFQEIKSSGILPSVMCYCMLLAVYAKSNRWDDAYGL 625

Query: 1212 LNGVMTNKTSDMHQVIGQMIHGDLDDENNWQMVEYVFDKLKSEGCELSMRFYNTLIEALW 1033
            L+ + TN+ S++HQV GQMI G+ DDE+NWQMVEYVFDKL  EG  L MRFYN L+EALW
Sbjct: 626  LDEMHTNRISNIHQVTGQMIKGEFDDESNWQMVEYVFDKLNCEGYGLGMRFYNALMEALW 685

Query: 1032 WLGQKERAARVLNEATKRGLFPELFRRNKLVWSVDVHRMWPGGACTALSVWLNDMEELFH 853
             LGQ+ERAARVL+EATKRGLFPELFR NKLVWSVDVHRMW GGA TA+SVWLN M E+F 
Sbjct: 686  CLGQRERAARVLDEATKRGLFPELFRHNKLVWSVDVHRMWEGGAYTAISVWLNKMYEMFM 745

Query: 852  KGEELPQLASVVVVRGQMEKSSITRDLPVAKAAYSFLKDTVSSSFFFPGWNKGRIVCQKT 673
             GE+LPQLA+VVVVRGQME++S T DLP+AKAAY+FL++  SS F FP WNKGRI+CQ+T
Sbjct: 746  MGEDLPQLATVVVVRGQMERTSTTEDLPIAKAAYTFLQENASSLFSFPQWNKGRIICQRT 805

Query: 672  QLKRTFSA-EPSSEASKCDILIPLSNTPISLLGKQTSTSAAKRSESVNADTERSTKSDSE 496
            QLKR  S  E SS+ SK D +I LSN+P S   ++ ST+  +     NA++E    + +E
Sbjct: 806  QLKRILSGRESSSDGSKKDNIISLSNSPFSPPDRKASTTGVRNGLFDNANSETKMSASTE 865

Query: 495  LMASSV 478
            LM S++
Sbjct: 866  LMTSTL 871


>ref|XP_006476695.1| PREDICTED: pentatricopeptide repeat-containing protein At1g74850,
            chloroplastic-like [Citrus sinensis]
          Length = 871

 Score = 1255 bits (3247), Expect = 0.0
 Identities = 626/846 (73%), Positives = 720/846 (85%), Gaps = 4/846 (0%)
 Frame = -2

Query: 3003 SHRFLFPS-KIPNPYKLSVIHRRLLL--TVAVRAKPKDLILGNPTVTVEKGKYSYDVETL 2833
            +H FL  + ++P   ++    RR L   TV VRAKPK+L+LG+PTVTVEKGKYSYDVETL
Sbjct: 26   NHSFLSGNNELPCTQRIFTSRRRSLTSGTVQVRAKPKELVLGSPTVTVEKGKYSYDVETL 85

Query: 2832 INKLSSLPPRGSIARCLDTFKNKLSLTDFSLVFKEFAARGDWQRSLRLFKYMQRQIWCKP 2653
            INKLSSLPPRGSIARCLD FKNKLSL DF+LVFKEFA RGDWQRSLRLFKYMQRQIWCKP
Sbjct: 86   INKLSSLPPRGSIARCLDMFKNKLSLNDFALVFKEFAQRGDWQRSLRLFKYMQRQIWCKP 145

Query: 2652 NEHIYTLMIGILGREGLLDKAYEVFDEMPTHSVARTVFSYTSIINAYGRNGQYETSLQLL 2473
            +E IYT+MI +LGRE LLDKA EVF+EMP+  V R+VFSYT++INAYGR+GQYETSL+LL
Sbjct: 146  SEQIYTIMISLLGRENLLDKASEVFEEMPSQGVPRSVFSYTALINAYGRHGQYETSLELL 205

Query: 2472 EKMKQEKIVPSILTYNTVINSCARGGYEWEGLLGLFAEMRHEGTQPDLVTYNTLLSACSS 2293
            ++MK+EKI P+ILTYNTVIN+C RGG +WE LLGLFAEMRHEG QPD+VTYNTLLSAC S
Sbjct: 206  DRMKREKIAPNILTYNTVINACVRGGLDWEDLLGLFAEMRHEGIQPDIVTYNTLLSACGS 265

Query: 2292 RGLEDEAEMVFRTMNEAGVLPDVTTYSYLVETFGKLGKLEKVSELLMEMEVGGTSPEVTS 2113
            RGL DEAEMVFRTMNE GVLPD+TT+SYLVETFGKLGKLEKVSELL EME GG  P+VT 
Sbjct: 266  RGLGDEAEMVFRTMNEGGVLPDLTTFSYLVETFGKLGKLEKVSELLREMESGGNLPDVTC 325

Query: 2112 YNVLLEAYARLGSMKEAMDVFRQMQAAGCVANAETYSILLNLYGKNGRYDQVRELFLEMK 1933
            YNVLLEA+A++GS+KEAMDVFRQMQAAG VANA TYSILLNLYG+NGRYD VRELFLEMK
Sbjct: 326  YNVLLEAHAKMGSIKEAMDVFRQMQAAGSVANATTYSILLNLYGRNGRYDDVRELFLEMK 385

Query: 1932 TSNTEPDADTYNILIQVFGEGGYFKEVVTLFHDMVEEKVEPNMETYEGLIYACGKGGLHE 1753
             SNTEP+A TYNILIQVFGEGGYFKEVVTLFHDMVEE VEPNMETYEGLI+ACGKGGLHE
Sbjct: 386  ASNTEPNAATYNILIQVFGEGGYFKEVVTLFHDMVEENVEPNMETYEGLIFACGKGGLHE 445

Query: 1752 DAKRILLHMNGQGLVPSSKVYNGVIEAYGQAALYEEAVVAFNTMNEVGSRPMVETFNSLI 1573
            D K+ILL+MN +G VPSSK Y GVIEAYG AALYEEA+VAFNTMNEV S+P +ET+NSL+
Sbjct: 446  DVKKILLYMNERGTVPSSKAYTGVIEAYGLAALYEEALVAFNTMNEVESKPTIETYNSLL 505

Query: 1572 HAFAKGGLYKESEAIWFRMGEVGVPRNRDSFNGMIEGYRQGGQFEEAVKSYVEMEKARCD 1393
            H F++GGLYKE +AI  RM E GV RN DSFN +IE +RQGG+FEEA+K+YVEMEK RCD
Sbjct: 506  HTFSRGGLYKECQAILSRMSESGVARNSDSFNAVIEAFRQGGRFEEAIKAYVEMEKVRCD 565

Query: 1392 PDERTLEAVLSVYCFAGLVDESEEQFQEIKSLGIQPSIICCCMMLAIYAKSERWDMAREL 1213
            P+ERTLEAVLSVYCFAGLVDES+EQFQEIKS GI PS++C CM+LA+YAKS RWD A  L
Sbjct: 566  PNERTLEAVLSVYCFAGLVDESKEQFQEIKSSGILPSVMCYCMLLAVYAKSNRWDDAYGL 625

Query: 1212 LNGVMTNKTSDMHQVIGQMIHGDLDDENNWQMVEYVFDKLKSEGCELSMRFYNTLIEALW 1033
            L+ + TN+ S++HQV GQMI G+ DDE+NWQMVEYVFDKL  EG  L MRFYN L+EALW
Sbjct: 626  LDEMYTNRISNIHQVTGQMIKGEFDDESNWQMVEYVFDKLNCEGYGLGMRFYNALLEALW 685

Query: 1032 WLGQKERAARVLNEATKRGLFPELFRRNKLVWSVDVHRMWPGGACTALSVWLNDMEELFH 853
             LG +ERAARVL+EATKRGLFPELFR NKLVWSVDVHRMW GGA TA+SVWLN M E+F 
Sbjct: 686  CLGLRERAARVLDEATKRGLFPELFRHNKLVWSVDVHRMWEGGAYTAISVWLNKMYEMFM 745

Query: 852  KGEELPQLASVVVVRGQMEKSSITRDLPVAKAAYSFLKDTVSSSFFFPGWNKGRIVCQKT 673
             GE+LPQLA+VVVVRG+ME++S T DLPVAKAAY+FL++  SS F FP WNKGRI+CQ+T
Sbjct: 746  MGEDLPQLATVVVVRGRMERTSTTEDLPVAKAAYTFLQENASSLFNFPQWNKGRIICQRT 805

Query: 672  QLKRTFSA-EPSSEASKCDILIPLSNTPISLLGKQTSTSAAKRSESVNADTERSTKSDSE 496
            QLKR  S  E SS+ SK D +I LSN+P S   ++ ST+  +     NA++E    + +E
Sbjct: 806  QLKRILSGRESSSDGSKKDNIISLSNSPFSPPDRKASTTGLRNGLFDNANSETKMSASTE 865

Query: 495  LMASSV 478
            LM S++
Sbjct: 866  LMTSTL 871


>gb|EXB29767.1| hypothetical protein L484_008930 [Morus notabilis]
          Length = 905

 Score = 1236 bits (3199), Expect = 0.0
 Identities = 626/905 (69%), Positives = 728/905 (80%), Gaps = 43/905 (4%)
 Frame = -2

Query: 3063 LTKMSLSYNSFSPV----LTPAPTSHRFLFPSKIPNPYK----LSVIHRRLLL------- 2929
            L   S+S  S SP+    + P+P  HR  F ++  +  +    LS   RR  L       
Sbjct: 3    LAAASMSIPSASPLPATLVKPSPLPHRLSFLTRTSDSLEQKRFLSSDRRREKLLTFLSGE 62

Query: 2928 --TVAVRAKPKDLILGNPTVTVEKGKYSYDVETLINKLSSLPPRGSIARCLDTFKNKLSL 2755
              + +VRAKPK++ILGNP VTVEKGKYSYDVETLINKLSSLPPRGSIARCLD FKNKLSL
Sbjct: 63   RRSFSVRAKPKEVILGNPAVTVEKGKYSYDVETLINKLSSLPPRGSIARCLDIFKNKLSL 122

Query: 2754 TDFSLVFKEFAARGDWQRSLRLFKYMQRQIWCKPNEHIYTLMIGILGREGLLDKAYEVFD 2575
             DF+LVFKEFA RGDWQRSLRLFKYMQRQIWCKPNEHIYT+MI +LGREGLLDK+ E+FD
Sbjct: 123  NDFALVFKEFAQRGDWQRSLRLFKYMQRQIWCKPNEHIYTIMISLLGREGLLDKSAEIFD 182

Query: 2574 EMPTHSVARTVFSYTSIINAYGRNGQYETSLQLLEKMKQEKIVPSILTYNTVINSCARGG 2395
            EMP+  V R+VFSYT++INAYGRNGQYETSLQLL++MK++K+ P+ILTYNTVIN+CARGG
Sbjct: 183  EMPSQGVVRSVFSYTALINAYGRNGQYETSLQLLDRMKKDKVSPNILTYNTVINACARGG 242

Query: 2394 YEWEGLLGLFAEMRHEGTQPDLVTYNTLLSACSSRGLEDEAEMVFRTMNEAGVLPDVTTY 2215
             +WEGLLGLFAEMRHEG QPDLVTYNTLL AC++RGL DEAEMVFRTMNE G++PD+TTY
Sbjct: 243  LDWEGLLGLFAEMRHEGIQPDLVTYNTLLGACANRGLGDEAEMVFRTMNEGGIVPDITTY 302

Query: 2214 SYLVETFGKLGKLEKVSELLMEMEVGGTSPEVTSYNVLLEAYARLGSMKEAMDVFRQMQA 2035
            S LVETFGKLGKLEKVSELL EME  G  P++TSYNVLLEAYA  GS+ EA+ VFRQMQ 
Sbjct: 303  SCLVETFGKLGKLEKVSELLKEMESRGNLPDITSYNVLLEAYAESGSISEAVGVFRQMQT 362

Query: 2034 AGCVANAETYSILLNLYGKNGRYDQVRELFLEMKTSNTEPDADTYNILIQVFGEGGYFKE 1855
            AGC+ NA TYSILLNLYGK GRY+ VRELFLEMK SNTEPDA TYNILIQVFGEGGYFKE
Sbjct: 363  AGCLPNANTYSILLNLYGKQGRYEDVRELFLEMKVSNTEPDAATYNILIQVFGEGGYFKE 422

Query: 1854 VVTLFHDMVEEKVEPNMETYEGLIYACGKGGLHEDAKRILLHMNGQGLVPSSKVYNGVIE 1675
            VVTLFHDMVEE VEPNMETYEGLI ACGKGGLH DAK IL HMN +G+VPSSKVY GVIE
Sbjct: 423  VVTLFHDMVEENVEPNMETYEGLIIACGKGGLHGDAKIILNHMNEKGIVPSSKVYTGVIE 482

Query: 1674 AYGQAALYEEAVVAFNTMNEVGSRPMVETFNSLIHAFAKGGLYKESEAIWFRMGEVGVPR 1495
            AYGQAALYEEA+VAFNTMNEVGSRP VET+NSLIHAF++GGLYKE+EAI  RMG   V R
Sbjct: 483  AYGQAALYEEALVAFNTMNEVGSRPSVETYNSLIHAFSRGGLYKEAEAILQRMGNSAVAR 542

Query: 1494 NRDSFNGMIEGYRQGGQFEEAVKSYVEMEKARCDPDERTLEAVLSVYCFAGLVDESEEQF 1315
            N D FN +IE +RQGGQ EEAVK+Y+EM K+RCDPDERTLEA+LSVYCFAGLVDE EE F
Sbjct: 543  NVDLFNSLIEAFRQGGQIEEAVKAYIEMGKSRCDPDERTLEALLSVYCFAGLVDECEEHF 602

Query: 1314 QEIKSLGIQPSIICCCMMLAIYAKSE-------------------------RWDMARELL 1210
            +EIK+ GI PS++C C MLA+YA+ +                         RWD A +LL
Sbjct: 603  KEIKASGILPSVMCYCTMLAVYARCDRIDRTLPQTLFYPNPPVPLDRWHRVRWDDAFKLL 662

Query: 1209 NGVMTNKTSDMHQVIGQMIHGDLDDENNWQMVEYVFDKLKSEGCELSMRFYNTLIEALWW 1030
            + ++ NK S++HQVI QMI GD DD  NWQMVEYVFDKL SEGC L +RFYNTL+EALWW
Sbjct: 663  DEMLKNKASNIHQVIAQMIKGDYDDGTNWQMVEYVFDKLNSEGCGLGIRFYNTLLEALWW 722

Query: 1029 LGQKERAARVLNEATKRGLFPELFRRNKLVWSVDVHRMWPGGACTALSVWLNDMEELFHK 850
            +GQKERA RVLNEATKRGLFPELFRRNKLVWS+DVHRMW GGACTA+SVWLNDM  +F  
Sbjct: 723  MGQKERAVRVLNEATKRGLFPELFRRNKLVWSIDVHRMWEGGACTAISVWLNDMFGMFKN 782

Query: 849  GEELPQLASVVVVRGQMEKSSITRDLPVAKAAYSFLKDTVSSSFFFPGWNKGRIVCQKTQ 670
            G++LP +A+VVVVRG+ME+S   ++ P+AKA+YSFL++ + SSF FP WNKGRIVCQ++Q
Sbjct: 783  GDDLPHVATVVVVRGKMERSPSAQETPIAKASYSFLQENMFSSFGFPTWNKGRIVCQRSQ 842

Query: 669  LKRTFSA-EPSSEASKCDILIPLSNTPISLLGKQTSTSAAKRSESVNADTERSTKSDSEL 493
            LK+  S  E SSE SK D +I LSN+P+   G +  T+  + S   N++++  T + +EL
Sbjct: 843  LKQVLSGIESSSEKSKKDKIITLSNSPVP--GTKMPTNVMQSSRYNNSNSDAVTGTRAEL 900

Query: 492  MASSV 478
            + S+V
Sbjct: 901  LTSTV 905


>ref|XP_003525484.1| PREDICTED: pentatricopeptide repeat-containing protein At1g74850,
            chloroplastic-like isoform X1 [Glycine max]
          Length = 857

 Score = 1236 bits (3197), Expect = 0.0
 Identities = 616/862 (71%), Positives = 726/862 (84%), Gaps = 2/862 (0%)
 Frame = -2

Query: 3057 KMSLSYNSFSP-VLTPAPTSHRFLFPSKIPNPYKLSVIHRRLLLTVAVRAKPKDLILGNP 2881
            KM+L+ + FSP +LTPA T  + LF +  P+P       +R LL  A  AKP  LI  NP
Sbjct: 4    KMTLTLSPFSPTLLTPATTLRQLLFTNFTPSP-------KRRLLLQARAAKPNVLIPINP 56

Query: 2880 TVTVEKGKYSYDVETLINKLSSLPPRGSIARCLDTFKNKLSLTDFSLVFKEFAARGDWQR 2701
            +VTVEKGKYSYDVETLIN+L++LPPRGSIARCLD FKNKLSL DF+LVFKEFA RGDWQR
Sbjct: 57   SVTVEKGKYSYDVETLINRLTALPPRGSIARCLDPFKNKLSLNDFALVFKEFAQRGDWQR 116

Query: 2700 SLRLFKYMQRQIWCKPNEHIYTLMIGILGREGLLDKAYEVFDEMPTHSVARTVFSYTSII 2521
            SLRLFKYMQRQIWCKPNEHI+T+MI +LGREGLLDK  EVFDEMP++ V RTV+SYT+II
Sbjct: 117  SLRLFKYMQRQIWCKPNEHIHTIMITLLGREGLLDKCREVFDEMPSNGVVRTVYSYTAII 176

Query: 2520 NAYGRNGQYETSLQLLEKMKQEKIVPSILTYNTVINSCARGGYEWEGLLGLFAEMRHEGT 2341
            NAYGRNGQ+  SL+LL  MKQE++ PSILTYNTVIN+CARGG +WEGLLGLFAEMRHEG 
Sbjct: 177  NAYGRNGQFHASLELLNGMKQERVSPSILTYNTVINACARGGLDWEGLLGLFAEMRHEGI 236

Query: 2340 QPDLVTYNTLLSACSSRGLEDEAEMVFRTMNEAGVLPDVTTYSYLVETFGKLGKLEKVSE 2161
            QPD++TYNTLL AC+ RGL DEAEMVFRTMNE+G++PD+ TYSYLV+TFGKL +LEKVSE
Sbjct: 237  QPDVITYNTLLGACAHRGLGDEAEMVFRTMNESGIVPDINTYSYLVQTFGKLNRLEKVSE 296

Query: 2160 LLMEMEVGGTSPEVTSYNVLLEAYARLGSMKEAMDVFRQMQAAGCVANAETYSILLNLYG 1981
            LL EME GG  P++TSYNVLLEAYA LGS+KEAM VFRQMQAAGCVANA TYS+LLNLYG
Sbjct: 297  LLREMECGGNLPDITSYNVLLEAYAELGSIKEAMGVFRQMQAAGCVANAATYSVLLNLYG 356

Query: 1980 KNGRYDQVRELFLEMKTSNTEPDADTYNILIQVFGEGGYFKEVVTLFHDMVEEKVEPNME 1801
            K+GRYD VR+LFLEMK SNT+PDA TYNILIQVFGEGGYFKEVVTLFHDM EE VEPNM+
Sbjct: 357  KHGRYDDVRDLFLEMKVSNTDPDAGTYNILIQVFGEGGYFKEVVTLFHDMAEENVEPNMQ 416

Query: 1800 TYEGLIYACGKGGLHEDAKRILLHMNGQGLVPSSKVYNGVIEAYGQAALYEEAVVAFNTM 1621
            TYEGLI+ACGKGGL+EDAK+ILLHMN +G+VPSSK Y GVIEA+GQAALYEEA+V FNTM
Sbjct: 417  TYEGLIFACGKGGLYEDAKKILLHMNEKGVVPSSKAYTGVIEAFGQAALYEEALVMFNTM 476

Query: 1620 NEVGSRPMVETFNSLIHAFAKGGLYKESEAIWFRMGEVGVPRNRDSFNGMIEGYRQGGQF 1441
            NEVGS P VET+NSLIHAFA+GGLYKE+EAI  RM E G+ R+  SFNG+IE +RQGGQ+
Sbjct: 477  NEVGSNPTVETYNSLIHAFARGGLYKEAEAILSRMNESGLKRDVHSFNGVIEAFRQGGQY 536

Query: 1440 EEAVKSYVEMEKARCDPDERTLEAVLSVYCFAGLVDESEEQFQEIKSLGIQPSIICCCMM 1261
            EEAVKSYVEMEKA C+P+E TLEAVLS+YC AGLVDE EEQFQEIK+ GI PS++C CMM
Sbjct: 537  EEAVKSYVEMEKANCEPNELTLEAVLSIYCSAGLVDEGEEQFQEIKASGILPSVMCYCMM 596

Query: 1260 LAIYAKSERWDMARELLNGVMTNKTSDMHQVIGQMIHGDLDDENNWQMVEYVFDKLKSEG 1081
            LA+YAK++R + A  L++ ++T + SD+HQVIGQMI GD DDE+NWQ+VEYVFDKL SEG
Sbjct: 597  LALYAKNDRLNDAYNLIDAMITMRVSDIHQVIGQMIKGDFDDESNWQIVEYVFDKLNSEG 656

Query: 1080 CELSMRFYNTLIEALWWLGQKERAARVLNEATKRGLFPELFRRNKLVWSVDVHRMWPGGA 901
            C L MRFYN L+EALW + Q+ERAARVLNEA+KRGLFPELFR++KLVWSVDVHRM  GGA
Sbjct: 657  CGLGMRFYNALLEALWCMFQRERAARVLNEASKRGLFPELFRKSKLVWSVDVHRMSEGGA 716

Query: 900  CTALSVWLNDMEELFHKGEELPQLASVVVVRGQMEKSSITRDLPVAKAAYSFLKDTVSSS 721
             TALSVWLN++ E+   G++LP++A+VVVVRG MEK++  +D P+AKAA SFL+D V SS
Sbjct: 717  LTALSVWLNNVHEMSMTGDDLPEVATVVVVRGHMEKTTDAQDFPIAKAAISFLQDNVPSS 776

Query: 720  FFFPGWNKGRIVCQKTQLKRTFS-AEPSSEASKCDILIPLSNTPISLLGKQTSTSAAKRS 544
            F FPGWNKGRIVCQ++QL+R  S  E SS   K D LI LSNTP++  G  TS S A+  
Sbjct: 777  FAFPGWNKGRIVCQQSQLRRILSGTESSSSRKKMDKLISLSNTPLTTAGAITSKSDAQSG 836

Query: 543  ESVNADTERSTKSDSELMASSV 478
            ++   D+ R+  + +EL+ S++
Sbjct: 837  KANGVDS-RTDSTRTELLTSAI 857


>ref|XP_006579551.1| PREDICTED: pentatricopeptide repeat-containing protein At1g74850,
            chloroplastic-like isoform X2 [Glycine max]
          Length = 858

 Score = 1231 bits (3185), Expect = 0.0
 Identities = 616/863 (71%), Positives = 726/863 (84%), Gaps = 3/863 (0%)
 Frame = -2

Query: 3057 KMSLSYNSFSP-VLTPAPTSHRFLFPSKIPNPYKLSVIHRRLLLTVAVRAKPKDLILGNP 2881
            KM+L+ + FSP +LTPA T  + LF +  P+P       +R LL  A  AKP  LI  NP
Sbjct: 4    KMTLTLSPFSPTLLTPATTLRQLLFTNFTPSP-------KRRLLLQARAAKPNVLIPINP 56

Query: 2880 TVTVEKGKYSYDVETLINKLSSLPPRGSIARCLDTFKNKLSLTDFSLVFKEFAARGDWQR 2701
            +VTVEKGKYSYDVETLIN+L++LPPRGSIARCLD FKNKLSL DF+LVFKEFA RGDWQR
Sbjct: 57   SVTVEKGKYSYDVETLINRLTALPPRGSIARCLDPFKNKLSLNDFALVFKEFAQRGDWQR 116

Query: 2700 SLRLFKYMQRQIWCKPNEHIYTLMIGILGREGLLDKAYEVFDEMPTHSVARTVFSYTSII 2521
            SLRLFKYMQRQIWCKPNEHI+T+MI +LGREGLLDK  EVFDEMP++ V RTV+SYT+II
Sbjct: 117  SLRLFKYMQRQIWCKPNEHIHTIMITLLGREGLLDKCREVFDEMPSNGVVRTVYSYTAII 176

Query: 2520 NAYGRNGQYETSLQLLEKMKQEKIVPSILTYNTVINSCARGGYEWEGLLGLFAEMRHEGT 2341
            NAYGRNGQ+  SL+LL  MKQE++ PSILTYNTVIN+CARGG +WEGLLGLFAEMRHEG 
Sbjct: 177  NAYGRNGQFHASLELLNGMKQERVSPSILTYNTVINACARGGLDWEGLLGLFAEMRHEGI 236

Query: 2340 QPDLVTYNTLLSACSSRGLEDEAEMVFRTMNEAGVLPDVTTYSYLVETFGKLGKLEKVSE 2161
            QPD++TYNTLL AC+ RGL DEAEMVFRTMNE+G++PD+ TYSYLV+TFGKL +LEKVSE
Sbjct: 237  QPDVITYNTLLGACAHRGLGDEAEMVFRTMNESGIVPDINTYSYLVQTFGKLNRLEKVSE 296

Query: 2160 LLMEMEVGGTSPEVTSYNVLLEAYARLGSMKEAMDVFRQMQAAGCVANAETYSILLNLYG 1981
            LL EME GG  P++TSYNVLLEAYA LGS+KEAM VFRQMQAAGCVANA TYS+LLNLYG
Sbjct: 297  LLREMECGGNLPDITSYNVLLEAYAELGSIKEAMGVFRQMQAAGCVANAATYSVLLNLYG 356

Query: 1980 KNGRYDQVRELFLEMKTSNTEPDADTYNILIQVFGEGGYFKEVVTLFHDMVEEKVEPNME 1801
            K+GRYD VR+LFLEMK SNT+PDA TYNILIQVFGEGGYFKEVVTLFHDM EE VEPNM+
Sbjct: 357  KHGRYDDVRDLFLEMKVSNTDPDAGTYNILIQVFGEGGYFKEVVTLFHDMAEENVEPNMQ 416

Query: 1800 TYEGLIYACGKGGLHEDAKRILLHMNGQGLVPSSKVYNGVIEAYGQAALYEEAVVAFNTM 1621
            TYEGLI+ACGKGGL+EDAK+ILLHMN +G+VPSSK Y GVIEA+GQAALYEEA+V FNTM
Sbjct: 417  TYEGLIFACGKGGLYEDAKKILLHMNEKGVVPSSKAYTGVIEAFGQAALYEEALVMFNTM 476

Query: 1620 NEVGSRPMVETFNSLIHAFAKGGLYKESEAIWFRMGEVGVPRNRDSFNGMIEGYRQGGQF 1441
            NEVGS P VET+NSLIHAFA+GGLYKE+EAI  RM E G+ R+  SFNG+IE +RQGGQ+
Sbjct: 477  NEVGSNPTVETYNSLIHAFARGGLYKEAEAILSRMNESGLKRDVHSFNGVIEAFRQGGQY 536

Query: 1440 EEAVKSYVEMEKARCDPDERTLEAVLSVYCFAGLVDESEEQFQEIKSLGIQPSIICCCMM 1261
            EEAVKSYVEMEKA C+P+E TLEAVLS+YC AGLVDE EEQFQEIK+ GI PS++C CMM
Sbjct: 537  EEAVKSYVEMEKANCEPNELTLEAVLSIYCSAGLVDEGEEQFQEIKASGILPSVMCYCMM 596

Query: 1260 LAIYAKSERWDMARELLNGVMTNKTSDMHQVIGQMIHGDLDDENNWQMVEYVFDKLKSEG 1081
            LA+YAK++R + A  L++ ++T + SD+HQVIGQMI GD DDE+NWQ+VEYVFDKL SEG
Sbjct: 597  LALYAKNDRLNDAYNLIDAMITMRVSDIHQVIGQMIKGDFDDESNWQIVEYVFDKLNSEG 656

Query: 1080 CELSMRFYNTLIEALWWLGQKERAARVLNEATKRGLFPELFRRNKLVWSVDVHRMWPGGA 901
            C L MRFYN L+EALW + Q+ERAARVLNEA+KRGLFPELFR++KLVWSVDVHRM  GGA
Sbjct: 657  CGLGMRFYNALLEALWCMFQRERAARVLNEASKRGLFPELFRKSKLVWSVDVHRMSEGGA 716

Query: 900  CTALSVWLNDMEELFHKGEELPQLASVVVV-RGQMEKSSITRDLPVAKAAYSFLKDTVSS 724
             TALSVWLN++ E+   G++LP++A+VVVV RG MEK++  +D P+AKAA SFL+D V S
Sbjct: 717  LTALSVWLNNVHEMSMTGDDLPEVATVVVVSRGHMEKTTDAQDFPIAKAAISFLQDNVPS 776

Query: 723  SFFFPGWNKGRIVCQKTQLKRTFS-AEPSSEASKCDILIPLSNTPISLLGKQTSTSAAKR 547
            SF FPGWNKGRIVCQ++QL+R  S  E SS   K D LI LSNTP++  G  TS S A+ 
Sbjct: 777  SFAFPGWNKGRIVCQQSQLRRILSGTESSSSRKKMDKLISLSNTPLTTAGAITSKSDAQS 836

Query: 546  SESVNADTERSTKSDSELMASSV 478
             ++   D+ R+  + +EL+ S++
Sbjct: 837  GKANGVDS-RTDSTRTELLTSAI 858


>ref|XP_003549648.1| PREDICTED: pentatricopeptide repeat-containing protein At1g74850,
            chloroplastic-like isoform X1 [Glycine max]
          Length = 859

 Score = 1225 bits (3170), Expect = 0.0
 Identities = 611/860 (71%), Positives = 716/860 (83%), Gaps = 2/860 (0%)
 Frame = -2

Query: 3051 SLSYNSFSPV-LTPAPTSHRFLFPSKIPNPYKLSVIHRRLLLTVAVRAKPKDLILGNPTV 2875
            SLS    SP  LTP  T  +  F +  P+P       RR L   A   KP  LI  NP+V
Sbjct: 8    SLSVPHPSPFSLTPTTTLRQLFFTNFTPSP-------RRRLQLQARAGKPNVLIPINPSV 60

Query: 2874 TVEKGKYSYDVETLINKLSSLPPRGSIARCLDTFKNKLSLTDFSLVFKEFAARGDWQRSL 2695
             VEKGKYSYDVETLIN++++LPPRGSIARCLD FKNKLSL DF+LVFKEFA RGDWQRSL
Sbjct: 61   AVEKGKYSYDVETLINRITALPPRGSIARCLDPFKNKLSLNDFALVFKEFAQRGDWQRSL 120

Query: 2694 RLFKYMQRQIWCKPNEHIYTLMIGILGREGLLDKAYEVFDEMPTHSVARTVFSYTSIINA 2515
            RLFKYMQRQIWCKPNEHIYT+MI +LGREGLLDK  EVFDEMP++ VARTV+ YT++INA
Sbjct: 121  RLFKYMQRQIWCKPNEHIYTIMITLLGREGLLDKCREVFDEMPSNGVARTVYVYTAVINA 180

Query: 2514 YGRNGQYETSLQLLEKMKQEKIVPSILTYNTVINSCARGGYEWEGLLGLFAEMRHEGTQP 2335
            YGRNGQ+  SL+LL  MKQE++ PSILTYNTVIN+CARGG +WEGLLGLFAEMRHEG QP
Sbjct: 181  YGRNGQFHASLELLNGMKQERVSPSILTYNTVINACARGGLDWEGLLGLFAEMRHEGIQP 240

Query: 2334 DLVTYNTLLSACSSRGLEDEAEMVFRTMNEAGVLPDVTTYSYLVETFGKLGKLEKVSELL 2155
            D++TYNTLL AC+ RGL DEAEMVFRTMNE+G++PD+ TYSYLV+TFGKL +LEKVSELL
Sbjct: 241  DVITYNTLLGACAHRGLGDEAEMVFRTMNESGIVPDINTYSYLVQTFGKLNRLEKVSELL 300

Query: 2154 MEMEVGGTSPEVTSYNVLLEAYARLGSMKEAMDVFRQMQAAGCVANAETYSILLNLYGKN 1975
             EME GG  P++TSYNVLLEAYA LGS+KEAMDVFRQMQAAGCVANA TYS+LLNLYGK+
Sbjct: 301  REMESGGNLPDITSYNVLLEAYAELGSIKEAMDVFRQMQAAGCVANAATYSVLLNLYGKH 360

Query: 1974 GRYDQVRELFLEMKTSNTEPDADTYNILIQVFGEGGYFKEVVTLFHDMVEEKVEPNMETY 1795
            GRYD VR++FLEMK SNT+PDA TYNILIQVFGEGGYFKEVVTLFHDMVEE VEPNMETY
Sbjct: 361  GRYDDVRDIFLEMKVSNTDPDAGTYNILIQVFGEGGYFKEVVTLFHDMVEENVEPNMETY 420

Query: 1794 EGLIYACGKGGLHEDAKRILLHMNGQGLVPSSKVYNGVIEAYGQAALYEEAVVAFNTMNE 1615
            EGLI+ACGKGGL+EDAK+ILLHMN +G+VPSSK Y GVIEA+GQAALYEEA+V FNTMNE
Sbjct: 421  EGLIFACGKGGLYEDAKKILLHMNEKGIVPSSKAYTGVIEAFGQAALYEEALVVFNTMNE 480

Query: 1614 VGSRPMVETFNSLIHAFAKGGLYKESEAIWFRMGEVGVPRNRDSFNGMIEGYRQGGQFEE 1435
            VGS P VET+NS IHAFA+GGLYKE+EAI  RM E G+ R+  SFNG+I+ +RQGGQ+EE
Sbjct: 481  VGSNPTVETYNSFIHAFARGGLYKEAEAILSRMNESGLKRDVHSFNGVIKAFRQGGQYEE 540

Query: 1434 AVKSYVEMEKARCDPDERTLEAVLSVYCFAGLVDESEEQFQEIKSLGIQPSIICCCMMLA 1255
            AVKSYVEMEKA C+P+E TLE VLSVYC AGLVDESEEQFQEIK+ GI PS++C C+MLA
Sbjct: 541  AVKSYVEMEKANCEPNELTLEVVLSVYCSAGLVDESEEQFQEIKASGILPSVMCYCLMLA 600

Query: 1254 IYAKSERWDMARELLNGVMTNKTSDMHQVIGQMIHGDLDDENNWQMVEYVFDKLKSEGCE 1075
            +YAK++R + A  L++ ++T + SD+HQ IGQMI GD DDE+NWQ+VEYVFDKL SEGC 
Sbjct: 601  LYAKNDRLNDAYNLIDEMITMRVSDIHQGIGQMIKGDFDDESNWQIVEYVFDKLNSEGCG 660

Query: 1074 LSMRFYNTLIEALWWLGQKERAARVLNEATKRGLFPELFRRNKLVWSVDVHRMWPGGACT 895
            L MRFYN L+EALWW+ Q+ERAARVLNEA+KRGLFPELFR++KLVWSVDVHRM  GGA T
Sbjct: 661  LGMRFYNALLEALWWMFQRERAARVLNEASKRGLFPELFRKSKLVWSVDVHRMSEGGALT 720

Query: 894  ALSVWLNDMEELFHKGEELPQLASVVVVRGQMEKSSITRDLPVAKAAYSFLKDTVSSSFF 715
            ALSVWLN+M E+   G +LP+LA+VVVVRG MEKS+  +D P+AKAA SFL+D V SSF 
Sbjct: 721  ALSVWLNNMHEMSRTGNDLPELATVVVVRGHMEKSTEAQDFPIAKAAISFLQDNVPSSFT 780

Query: 714  FPGWNKGRIVCQKTQLKRTFS-AEPSSEASKCDILIPLSNTPISLLGKQTSTSAAKRSES 538
            FPGWNKGRIVCQ++QL+R  S  E SS   K D L+ LSNTP++  G  TS S  +  ++
Sbjct: 781  FPGWNKGRIVCQQSQLRRILSGTESSSSRKKMDKLVSLSNTPLTTAGVITSKSDVQSGKA 840

Query: 537  VNADTERSTKSDSELMASSV 478
             + D+ R+  + +EL+ S++
Sbjct: 841  NDVDS-RTDSTRTELLTSAI 859


>ref|XP_004508810.1| PREDICTED: pentatricopeptide repeat-containing protein At1g74850,
            chloroplastic-like [Cicer arietinum]
          Length = 861

 Score = 1224 bits (3168), Expect = 0.0
 Identities = 609/854 (71%), Positives = 717/854 (83%), Gaps = 2/854 (0%)
 Frame = -2

Query: 3033 FSPVLTPAPTSHRFL-FPSKIPNPYKLSVIHRRLLLTVAVRAKPKDLILGNPTVTVEKGK 2857
            F P L  + T+ R L FP     P      H+   L    RAKP++LILGNP+VTVE GK
Sbjct: 17   FIPTLLDSNTTFRQLTFPISTTKPQ-----HK---LQFKARAKPRELILGNPSVTVESGK 68

Query: 2856 YSYDVETLINKLSSLPPRGSIARCLDTFKNKLSLTDFSLVFKEFAARGDWQRSLRLFKYM 2677
            YSYDVETLIN+LSSLPPRGSIARCLD+FKNKLSL DFS+VFKEFA RGDWQRSLRLFKYM
Sbjct: 69   YSYDVETLINRLSSLPPRGSIARCLDSFKNKLSLNDFSVVFKEFAQRGDWQRSLRLFKYM 128

Query: 2676 QRQIWCKPNEHIYTLMIGILGREGLLDKAYEVFDEMPTHSVARTVFSYTSIINAYGRNGQ 2497
            QRQIWCKPNEHIYT+MI +LGREGLLDK  EVFDEMP+  V R+VF+YT++INAYGRNGQ
Sbjct: 129  QRQIWCKPNEHIYTIMITLLGREGLLDKCREVFDEMPSQGVPRSVFAYTAVINAYGRNGQ 188

Query: 2496 YETSLQLLEKMKQEKIVPSILTYNTVINSCARGGYEWEGLLGLFAEMRHEGTQPDLVTYN 2317
            ++TS++LL++MKQE++ PSILTYNTVIN+CARGG +WEGLLGLFAEMRHEG QPD++TYN
Sbjct: 189  FQTSVELLDRMKQERVSPSILTYNTVINACARGGLDWEGLLGLFAEMRHEGIQPDVITYN 248

Query: 2316 TLLSACSSRGLEDEAEMVFRTMNEAGVLPDVTTYSYLVETFGKLGKLEKVSELLMEMEVG 2137
            TLLSAC+ RGL DEAEMVFRTMNE GV+PD+ TYSYLV TFGKL KLEKVSELL EME G
Sbjct: 249  TLLSACAHRGLGDEAEMVFRTMNEGGVVPDINTYSYLVHTFGKLNKLEKVSELLREMESG 308

Query: 2136 GTSPEVTSYNVLLEAYARLGSMKEAMDVFRQMQAAGCVANAETYSILLNLYGKNGRYDQV 1957
            G  P+V+SYNVLLEAYA  GS+K+A+ VFRQMQ AGCV NA TYSILLNLYGK+GRYD V
Sbjct: 309  GNLPDVSSYNVLLEAYAESGSIKDAIGVFRQMQGAGCVPNAATYSILLNLYGKHGRYDDV 368

Query: 1956 RELFLEMKTSNTEPDADTYNILIQVFGEGGYFKEVVTLFHDMVEEKVEPNMETYEGLIYA 1777
            R+LFLEMK SNT+PDA TYNILIQVFGEGGYFKEVVTLFHDMV+E VEPNMETYEGLI+A
Sbjct: 369  RDLFLEMKVSNTDPDAGTYNILIQVFGEGGYFKEVVTLFHDMVDENVEPNMETYEGLIFA 428

Query: 1776 CGKGGLHEDAKRILLHMNGQGLVPSSKVYNGVIEAYGQAALYEEAVVAFNTMNEVGSRPM 1597
            CGKGGL+EDAK+ILLHMN +G+VPSSK Y GVIEAYGQAALYEEA+VAFNTMNEVGS P 
Sbjct: 429  CGKGGLYEDAKKILLHMNERGVVPSSKAYTGVIEAYGQAALYEEALVAFNTMNEVGSNPT 488

Query: 1596 VETFNSLIHAFAKGGLYKESEAIWFRMGEVGVPRNRDSFNGMIEGYRQGGQFEEAVKSYV 1417
            VET+NSL+ +FA+GGLYKE EAI FRMGE G+PR+  SFNG+IE  RQ GQ+EEAVK++V
Sbjct: 489  VETYNSLVRSFARGGLYKEVEAILFRMGESGLPRDVHSFNGVIEALRQAGQYEEAVKAHV 548

Query: 1416 EMEKARCDPDERTLEAVLSVYCFAGLVDESEEQFQEIKSLGIQPSIICCCMMLAIYAKSE 1237
            EMEKA CD DE TLEAVLS+YC AGLVDESEEQFQEIK+ GI PS+ C CMMLA+YAK++
Sbjct: 549  EMEKANCDYDESTLEAVLSIYCAAGLVDESEEQFQEIKASGILPSVTCYCMMLALYAKND 608

Query: 1236 RWDMARELLNGVMTNKTSDMHQVIGQMIHGDLDDENNWQMVEYVFDKLKSEGCELSMRFY 1057
            R   A  LL+ ++T + SD+HQVIGQMI GD DDE+NWQ+VEY+FDKL S+GC L M+FY
Sbjct: 609  RSIDAYSLLDEMITTRVSDIHQVIGQMIKGDFDDESNWQIVEYIFDKLNSKGCGLGMKFY 668

Query: 1056 NTLIEALWWLGQKERAARVLNEATKRGLFPELFRRNKLVWSVDVHRMWPGGACTALSVWL 877
            N L+EALWW+ Q+ERAARVLNEA+KRGLFPELFR+NKLVWSVDVHRM  G A TALS+WL
Sbjct: 669  NALLEALWWMYQRERAARVLNEASKRGLFPELFRKNKLVWSVDVHRMSEGAALTALSIWL 728

Query: 876  NDMEELFHKGEELPQLASVVVVRGQMEKSSITRDLPVAKAAYSFLKDTVSSSFFFPGWNK 697
            ND++E+F  GE LP+LA+VVV RG+ME+S   +D P+AKAA+ FL+D VSS+F +PGWNK
Sbjct: 729  NDIQEMFMIGESLPELAAVVVARGKMEESIDAQDFPIAKAAFLFLQDIVSSAFTYPGWNK 788

Query: 696  GRIVCQKTQLKRTFSAEPSSEA-SKCDILIPLSNTPISLLGKQTSTSAAKRSESVNADTE 520
            GRIVCQ++QL+R  S   SS +  K D L+ LSN P++  G  TS S  +R ++ + D+ 
Sbjct: 789  GRIVCQQSQLRRILSGTGSSSSRKKMDKLVSLSNAPLTPAGAITSKSDVQRGKANDVDS- 847

Query: 519  RSTKSDSELMASSV 478
            R+  + +EL+ S+V
Sbjct: 848  RTDSTRTELLTSAV 861


>ref|XP_002322139.2| hypothetical protein POPTR_0015s08030g [Populus trichocarpa]
            gi|550322283|gb|EEF06266.2| hypothetical protein
            POPTR_0015s08030g [Populus trichocarpa]
          Length = 866

 Score = 1222 bits (3161), Expect = 0.0
 Identities = 603/862 (69%), Positives = 719/862 (83%), Gaps = 4/862 (0%)
 Frame = -2

Query: 3051 SLSYNSFSPVLTPAPTS-HRFLFPSKIPNPYKLSVIHRRLLLTVA--VRAKPKDLILGNP 2881
            SLS  S SP+ T +  S H F FP    +   +S    R   + A   RAKPK+L+LGNP
Sbjct: 6    SLSIPSPSPISTKSIKSKHTFPFPILPSHRRLVSFSSDRKAYSGAWKARAKPKELVLGNP 65

Query: 2880 TVTVEKGKYSYDVETLINKLSSLPPRGSIARCLDTFKNKLSLTDFSLVFKEFAARGDWQR 2701
            +V VEKGKYSYDVETLINKLSSLPPRGSIARCLD FKNKLSL DF+LVFKEFA RGDWQR
Sbjct: 66   SVVVEKGKYSYDVETLINKLSSLPPRGSIARCLDVFKNKLSLNDFALVFKEFAQRGDWQR 125

Query: 2700 SLRLFKYMQRQIWCKPNEHIYTLMIGILGREGLLDKAYEVFDEMPTHSVARTVFSYTSII 2521
            SLRLFK+MQRQIWCKPNEHIYT+MI +LGREGLL+K  ++F+EM  H V+R+VFSYT++I
Sbjct: 126  SLRLFKHMQRQIWCKPNEHIYTIMISLLGREGLLEKCSDIFEEMGAHGVSRSVFSYTALI 185

Query: 2520 NAYGRNGQYETSLQLLEKMKQEKIVPSILTYNTVINSCARGGYEWEGLLGLFAEMRHEGT 2341
            N+YGRNG+YE SL+LLE+MK+E++ PSILTYNTVINSCARGG +WEGLLGLFAEMRHEG 
Sbjct: 186  NSYGRNGKYEVSLELLERMKKERVSPSILTYNTVINSCARGGLDWEGLLGLFAEMRHEGI 245

Query: 2340 QPDLVTYNTLLSACSSRGLEDEAEMVFRTMNEAGVLPDVTTYSYLVETFGKLGKLEKVSE 2161
            QPD+VTYNTLL ACS+RGL DEAEMVFRTMNE GV+PD+TTY+YLV+TFGKL +L+KVSE
Sbjct: 246  QPDIVTYNTLLCACSNRGLGDEAEMVFRTMNEGGVVPDITTYTYLVDTFGKLNRLDKVSE 305

Query: 2160 LLMEMEVGGTSPEVTSYNVLLEAYARLGSMKEAMDVFRQMQAAGCVANAETYSILLNLYG 1981
            LL EM   G  PE++SYNVLLEAYAR+G++++A  VFR MQ AGCV NAETYSILL LYG
Sbjct: 306  LLKEMASTGNVPEISSYNVLLEAYARIGNIEDATGVFRLMQEAGCVPNAETYSILLGLYG 365

Query: 1980 KNGRYDQVRELFLEMKTSNTEPDADTYNILIQVFGEGGYFKEVVTLFHDMVEEKVEPNME 1801
            K+GRYD+VRELFLEMK SNTEPDA TYN LI VFGEGGYFKEVVTLFHDM EE VEPNME
Sbjct: 366  KHGRYDEVRELFLEMKVSNTEPDAATYNTLIDVFGEGGYFKEVVTLFHDMAEENVEPNME 425

Query: 1800 TYEGLIYACGKGGLHEDAKRILLHMNGQGLVPSSKVYNGVIEAYGQAALYEEAVVAFNTM 1621
            TYEGLI+ACGKGGLH+DAK+ILLHM+ +G++PSSK Y GVIEAYGQAA+YEEA+V  NTM
Sbjct: 426  TYEGLIFACGKGGLHDDAKKILLHMSEKGMIPSSKAYTGVIEAYGQAAMYEEALVTLNTM 485

Query: 1620 NEVGSRPMVETFNSLIHAFAKGGLYKESEAIWFRMGEVGVPRNRDSFNGMIEGYRQGGQF 1441
            NE+GS+P +ET+N+LI+ FA+GGLYKE+EAI  +MG+ GV R RDSFNG+IEG+RQGGQF
Sbjct: 486  NEMGSKPTIETYNTLIYMFARGGLYKETEAILLKMGDFGVARERDSFNGVIEGFRQGGQF 545

Query: 1440 EEAVKSYVEMEKARCDPDERTLEAVLSVYCFAGLVDESEEQFQEIKSLGIQPSIICCCMM 1261
            EEA+K+YVEMEK+R  PDERTLEAVLSVYC AGLVDES EQFQEIK+ GI P+++C CMM
Sbjct: 546  EEAIKAYVEMEKSRLVPDERTLEAVLSVYCIAGLVDESVEQFQEIKASGILPNVMCYCMM 605

Query: 1260 LAIYAKSERWDMARELLNGVMTNKTSDMHQVIGQMIHGDLDDENNWQMVEYVFDKLKSEG 1081
            LA+YAKS+RW+ A ELL+ ++TN+ S++HQVIGQMI GD DD++NWQMVEYVFDKL SEG
Sbjct: 606  LAVYAKSDRWNEAYELLDEMLTNRASNIHQVIGQMIKGDFDDDSNWQMVEYVFDKLNSEG 665

Query: 1080 CELSMRFYNTLIEALWWLGQKERAARVLNEATKRGLFPELFRRNKLVWSVDVHRMWPGGA 901
            C L MRFYNTL+EALWWLGQKERA RVL EATKRG FPELFR++KLVWSVD+HRMW G A
Sbjct: 666  CGLGMRFYNTLLEALWWLGQKERAVRVLGEATKRGHFPELFRKSKLVWSVDIHRMWEGSA 725

Query: 900  CTALSVWLNDMEELFHKGEELPQLASVVVVRGQMEKSSITRDLPVAKAAYSFLKDTVSSS 721
             TA+SVWLN+M E+F   +++PQLASV+VVRG +EKSS+ +D P+ KA +SFL+D V SS
Sbjct: 726  YTAISVWLNNMYEIFMNRQDIPQLASVIVVRGLLEKSSVAQDFPIGKAVHSFLQDIVPSS 785

Query: 720  FFFPGWNKGRIVCQKTQLKR-TFSAEPSSEASKCDILIPLSNTPISLLGKQTSTSAAKRS 544
            F + GWN GRI CQ++QLKR     E  S+ +K D  I L+N+P SL G +TS S  + S
Sbjct: 786  FSYSGWNNGRITCQRSQLKRFLLGTELVSDGTKKDKFIMLTNSPFSLAGTRTS-SDIETS 844

Query: 543  ESVNADTERSTKSDSELMASSV 478
                +++     + +ELM S+V
Sbjct: 845  LHNKSNSGARMGTSTELMTSTV 866


>ref|XP_006600662.1| PREDICTED: pentatricopeptide repeat-containing protein At1g74850,
            chloroplastic-like isoform X2 [Glycine max]
          Length = 860

 Score = 1221 bits (3158), Expect = 0.0
 Identities = 611/861 (70%), Positives = 716/861 (83%), Gaps = 3/861 (0%)
 Frame = -2

Query: 3051 SLSYNSFSPV-LTPAPTSHRFLFPSKIPNPYKLSVIHRRLLLTVAVRAKPKDLILGNPTV 2875
            SLS    SP  LTP  T  +  F +  P+P       RR L   A   KP  LI  NP+V
Sbjct: 8    SLSVPHPSPFSLTPTTTLRQLFFTNFTPSP-------RRRLQLQARAGKPNVLIPINPSV 60

Query: 2874 TVEKGKYSYDVETLINKLSSLPPRGSIARCLDTFKNKLSLTDFSLVFKEFAARGDWQRSL 2695
             VEKGKYSYDVETLIN++++LPPRGSIARCLD FKNKLSL DF+LVFKEFA RGDWQRSL
Sbjct: 61   AVEKGKYSYDVETLINRITALPPRGSIARCLDPFKNKLSLNDFALVFKEFAQRGDWQRSL 120

Query: 2694 RLFKYMQRQIWCKPNEHIYTLMIGILGREGLLDKAYEVFDEMPTHSVARTVFSYTSIINA 2515
            RLFKYMQRQIWCKPNEHIYT+MI +LGREGLLDK  EVFDEMP++ VARTV+ YT++INA
Sbjct: 121  RLFKYMQRQIWCKPNEHIYTIMITLLGREGLLDKCREVFDEMPSNGVARTVYVYTAVINA 180

Query: 2514 YGRNGQYETSLQLLEKMKQEKIVPSILTYNTVINSCARGGYEWEGLLGLFAEMRHEGTQP 2335
            YGRNGQ+  SL+LL  MKQE++ PSILTYNTVIN+CARGG +WEGLLGLFAEMRHEG QP
Sbjct: 181  YGRNGQFHASLELLNGMKQERVSPSILTYNTVINACARGGLDWEGLLGLFAEMRHEGIQP 240

Query: 2334 DLVTYNTLLSACSSRGLEDEAEMVFRTMNEAGVLPDVTTYSYLVETFGKLGKLEKVSELL 2155
            D++TYNTLL AC+ RGL DEAEMVFRTMNE+G++PD+ TYSYLV+TFGKL +LEKVSELL
Sbjct: 241  DVITYNTLLGACAHRGLGDEAEMVFRTMNESGIVPDINTYSYLVQTFGKLNRLEKVSELL 300

Query: 2154 MEMEVGGTSPEVTSYNVLLEAYARLGSMKEAMDVFRQMQAAGCVANAETYSILLNLYGKN 1975
             EME GG  P++TSYNVLLEAYA LGS+KEAMDVFRQMQAAGCVANA TYS+LLNLYGK+
Sbjct: 301  REMESGGNLPDITSYNVLLEAYAELGSIKEAMDVFRQMQAAGCVANAATYSVLLNLYGKH 360

Query: 1974 GRYDQVRELFLEMKTSNTEPDADTYNILIQVFGEGGYFKEVVTLFHDMVEEKVEPNMETY 1795
            GRYD VR++FLEMK SNT+PDA TYNILIQVFGEGGYFKEVVTLFHDMVEE VEPNMETY
Sbjct: 361  GRYDDVRDIFLEMKVSNTDPDAGTYNILIQVFGEGGYFKEVVTLFHDMVEENVEPNMETY 420

Query: 1794 EGLIYACGKGGLHEDAKRILLHMNGQGLVPSSKVYNGVIEAYGQAALYEEAVVAFNTMNE 1615
            EGLI+ACGKGGL+EDAK+ILLHMN +G+VPSSK Y GVIEA+GQAALYEEA+V FNTMNE
Sbjct: 421  EGLIFACGKGGLYEDAKKILLHMNEKGIVPSSKAYTGVIEAFGQAALYEEALVVFNTMNE 480

Query: 1614 VGSRPMVETFNSLIHAFAKGGLYKESEAIWFRMGEVGVPRNRDSFNGMIEGYRQGGQFEE 1435
            VGS P VET+NS IHAFA+GGLYKE+EAI  RM E G+ R+  SFNG+I+ +RQGGQ+EE
Sbjct: 481  VGSNPTVETYNSFIHAFARGGLYKEAEAILSRMNESGLKRDVHSFNGVIKAFRQGGQYEE 540

Query: 1434 AVKSYVEMEKARCDPDERTLEAVLSVYCFAGLVDESEEQFQEIKSLGIQPSIICCCMMLA 1255
            AVKSYVEMEKA C+P+E TLE VLSVYC AGLVDESEEQFQEIK+ GI PS++C C+MLA
Sbjct: 541  AVKSYVEMEKANCEPNELTLEVVLSVYCSAGLVDESEEQFQEIKASGILPSVMCYCLMLA 600

Query: 1254 IYAKSERWDMARELLNGVMTNKTSDMHQVIGQMIHGDLDDENNWQMVEYVFDKLKSEGCE 1075
            +YAK++R + A  L++ ++T + SD+HQ IGQMI GD DDE+NWQ+VEYVFDKL SEGC 
Sbjct: 601  LYAKNDRLNDAYNLIDEMITMRVSDIHQGIGQMIKGDFDDESNWQIVEYVFDKLNSEGCG 660

Query: 1074 LSMRFYNTLIEALWWLGQKERAARVLNEATKRGLFPELFRRNKLVWSVDVHRMWPGGACT 895
            L MRFYN L+EALWW+ Q+ERAARVLNEA+KRGLFPELFR++KLVWSVDVHRM  GGA T
Sbjct: 661  LGMRFYNALLEALWWMFQRERAARVLNEASKRGLFPELFRKSKLVWSVDVHRMSEGGALT 720

Query: 894  ALSVWLNDMEELFHKGEELPQLASVVVV-RGQMEKSSITRDLPVAKAAYSFLKDTVSSSF 718
            ALSVWLN+M E+   G +LP+LA+VVVV RG MEKS+  +D P+AKAA SFL+D V SSF
Sbjct: 721  ALSVWLNNMHEMSRTGNDLPELATVVVVSRGHMEKSTEAQDFPIAKAAISFLQDNVPSSF 780

Query: 717  FFPGWNKGRIVCQKTQLKRTFS-AEPSSEASKCDILIPLSNTPISLLGKQTSTSAAKRSE 541
             FPGWNKGRIVCQ++QL+R  S  E SS   K D L+ LSNTP++  G  TS S  +  +
Sbjct: 781  TFPGWNKGRIVCQQSQLRRILSGTESSSSRKKMDKLVSLSNTPLTTAGVITSKSDVQSGK 840

Query: 540  SVNADTERSTKSDSELMASSV 478
            + + D+ R+  + +EL+ S++
Sbjct: 841  ANDVDS-RTDSTRTELLTSAI 860


>ref|XP_004157803.1| PREDICTED: pentatricopeptide repeat-containing protein At1g74850,
            chloroplastic-like [Cucumis sativus]
          Length = 864

 Score = 1215 bits (3143), Expect = 0.0
 Identities = 593/861 (68%), Positives = 715/861 (83%), Gaps = 10/861 (1%)
 Frame = -2

Query: 3030 SPVLTPAPTSHRFLFPSKIPNPYKLS--VIHRRLLL--------TVAVRAKPKDLILGNP 2881
            +P+ TP+    R+L   ++P   KLS   + RR              VRAK KDL+LGNP
Sbjct: 11   NPLRTPSTFQKRYLECQQLPFLSKLSNFSVRRRFFSDDWRLSSDVGKVRAKAKDLVLGNP 70

Query: 2880 TVTVEKGKYSYDVETLINKLSSLPPRGSIARCLDTFKNKLSLTDFSLVFKEFAARGDWQR 2701
            +V VEKGKYSYDVETLINKLSSLPPRGSIARCLD FKN+LSL DFSLVFKEFAARGDWQR
Sbjct: 71   SVIVEKGKYSYDVETLINKLSSLPPRGSIARCLDIFKNRLSLNDFSLVFKEFAARGDWQR 130

Query: 2700 SLRLFKYMQRQIWCKPNEHIYTLMIGILGREGLLDKAYEVFDEMPTHSVARTVFSYTSII 2521
            SLRLFKYMQRQIWCKPNEHIYT++I +LGREGLL+K  E+FDEM +  V R+VFSYT++I
Sbjct: 131  SLRLFKYMQRQIWCKPNEHIYTIIISLLGREGLLEKCSEIFDEMASQGVIRSVFSYTALI 190

Query: 2520 NAYGRNGQYETSLQLLEKMKQEKIVPSILTYNTVINSCARGGYEWEGLLGLFAEMRHEGT 2341
            NAYGRNGQYETSL+LLE+MK+E++ P+ILTYNTVIN+CARG  +WEGLLGLFAEMRHEG 
Sbjct: 191  NAYGRNGQYETSLELLERMKRERVSPNILTYNTVINACARGDLDWEGLLGLFAEMRHEGV 250

Query: 2340 QPDLVTYNTLLSACSSRGLEDEAEMVFRTMNEAGVLPDVTTYSYLVETFGKLGKLEKVSE 2161
            QPDLVTYNTLLSAC++RGL DEAEMVF+TM E G++P++TTYSY+VETFGKLGKLEKV+ 
Sbjct: 251  QPDLVTYNTLLSACAARGLGDEAEMVFKTMIEGGIVPEITTYSYIVETFGKLGKLEKVAM 310

Query: 2160 LLMEMEVGGTSPEVTSYNVLLEAYARLGSMKEAMDVFRQMQAAGCVANAETYSILLNLYG 1981
            LL EME  G  P+++SYNVL+EA+A+LGS+KEAMDVF+QMQAAGCV NA TYSILLNLYG
Sbjct: 311  LLKEMESEGYLPDISSYNVLIEAHAKLGSIKEAMDVFKQMQAAGCVPNASTYSILLNLYG 370

Query: 1980 KNGRYDQVRELFLEMKTSNTEPDADTYNILIQVFGEGGYFKEVVTLFHDMVEEKVEPNME 1801
            K+GRYD VRELFL+MK S+ EPDA TYNILI+VFGEGGYFKEVVTLFHD+V+E ++PNME
Sbjct: 371  KHGRYDDVRELFLQMKESSAEPDATTYNILIRVFGEGGYFKEVVTLFHDLVDENIDPNME 430

Query: 1800 TYEGLIYACGKGGLHEDAKRILLHMNGQGLVPSSKVYNGVIEAYGQAALYEEAVVAFNTM 1621
            TYEGL++ACGKGGLHEDAK+IL HMNG+G+VPSSK Y+G+IEAYGQAALY+EA+VAFNTM
Sbjct: 431  TYEGLVFACGKGGLHEDAKKILFHMNGKGIVPSSKAYSGLIEAYGQAALYDEALVAFNTM 490

Query: 1620 NEVGSRPMVETFNSLIHAFAKGGLYKESEAIWFRMGEVGVPRNRDSFNGMIEGYRQGGQF 1441
            NEVGS+  ++T+NSLIH FA+GGLYKE EAI  RM E G+ RN  SF+G+IEGYRQ GQ+
Sbjct: 491  NEVGSKSTIDTYNSLIHTFARGGLYKEFEAILSRMREYGISRNAKSFSGIIEGYRQSGQY 550

Query: 1440 EEAVKSYVEMEKARCDPDERTLEAVLSVYCFAGLVDESEEQFQEIKSLGIQPSIICCCMM 1261
            EEA+K++VEMEK RC+ DE+TLE VL VYCFAGLVDES+EQF EIK+ GI PS++C CMM
Sbjct: 551  EEAIKAFVEMEKMRCELDEQTLEGVLGVYCFAGLVDESKEQFIEIKASGILPSVLCYCMM 610

Query: 1260 LAIYAKSERWDMARELLNGVMTNKTSDMHQVIGQMIHGDLDDENNWQMVEYVFDKLKSEG 1081
            LA+YAK+ RWD A ELL+ ++  + S +HQVIGQMI GD DD++NWQMVEYVFDKL +EG
Sbjct: 611  LAVYAKNGRWDDASELLDEMIKTRVSSIHQVIGQMIKGDYDDDSNWQMVEYVFDKLNAEG 670

Query: 1080 CELSMRFYNTLIEALWWLGQKERAARVLNEATKRGLFPELFRRNKLVWSVDVHRMWPGGA 901
            C   MRFYNTL+EALWWLGQK RAARVL EATKRGLFPELFR++KLVWSVDVHRMW GGA
Sbjct: 671  CGFGMRFYNTLLEALWWLGQKGRAARVLTEATKRGLFPELFRQSKLVWSVDVHRMWEGGA 730

Query: 900  CTALSVWLNDMEELFHKGEELPQLASVVVVRGQMEKSSITRDLPVAKAAYSFLKDTVSSS 721
             TA+S+W+N M E+   GE+LPQLA+VVV RG +EK S  R+LP+A+A YSFL+D VSSS
Sbjct: 731  YTAVSLWVNKMNEMLMDGEDLPQLAAVVVGRGSLEKDSTARNLPIARAVYSFLQDNVSSS 790

Query: 720  FFFPGWNKGRIVCQKTQLKRTFSAEPSSEASKCDILIPLSNTPISLLGKQTSTSAAKRSE 541
            F FPGWN  RI+CQ++QLK+  +A  S        +I L+N+P +L   + S S     E
Sbjct: 791  FSFPGWNNSRIICQQSQLKQLLTASSSE-------IIALNNSPFNLPEAKISRSGINNDE 843

Query: 540  SVNADTERSTKSDSELMASSV 478
              + D++ S ++ +EL+ ++V
Sbjct: 844  YKDVDSKSSNRTGTELLTTTV 864


>ref|XP_004152453.1| PREDICTED: pentatricopeptide repeat-containing protein At1g74850,
            chloroplastic-like [Cucumis sativus]
          Length = 864

 Score = 1213 bits (3139), Expect = 0.0
 Identities = 592/861 (68%), Positives = 715/861 (83%), Gaps = 10/861 (1%)
 Frame = -2

Query: 3030 SPVLTPAPTSHRFLFPSKIPNPYKLS--VIHRRLLL--------TVAVRAKPKDLILGNP 2881
            +P+ TP+    R+L   ++P   KLS   + RR              VRAK KDL+LGNP
Sbjct: 11   NPLRTPSTFQKRYLECQQLPFLSKLSNFSVRRRFFSDDWRLSSDVGKVRAKAKDLVLGNP 70

Query: 2880 TVTVEKGKYSYDVETLINKLSSLPPRGSIARCLDTFKNKLSLTDFSLVFKEFAARGDWQR 2701
            +V VEKGKYSYDVETLINKLSSLPPRGSIARCLD FKN+LSL DFSLVFKEFAARGDWQR
Sbjct: 71   SVIVEKGKYSYDVETLINKLSSLPPRGSIARCLDIFKNRLSLNDFSLVFKEFAARGDWQR 130

Query: 2700 SLRLFKYMQRQIWCKPNEHIYTLMIGILGREGLLDKAYEVFDEMPTHSVARTVFSYTSII 2521
            SLRLFKYMQRQIWCKPNEHIYT++I +LGREGLL+K  E+FDEM +  V R+VFSYT++I
Sbjct: 131  SLRLFKYMQRQIWCKPNEHIYTIIISLLGREGLLEKCSEIFDEMASQGVIRSVFSYTALI 190

Query: 2520 NAYGRNGQYETSLQLLEKMKQEKIVPSILTYNTVINSCARGGYEWEGLLGLFAEMRHEGT 2341
            NAYGRNGQYETSL+LLE+MK+E++ P+ILTYNTVIN+CARG  +WEGLLGLFAEMRHEG 
Sbjct: 191  NAYGRNGQYETSLELLERMKRERVSPNILTYNTVINACARGDLDWEGLLGLFAEMRHEGV 250

Query: 2340 QPDLVTYNTLLSACSSRGLEDEAEMVFRTMNEAGVLPDVTTYSYLVETFGKLGKLEKVSE 2161
            QPDLVTYNTLLSAC++RGL DEAEMVF+TM E G++P++TTYSY+VETFGKLGKLEKV+ 
Sbjct: 251  QPDLVTYNTLLSACAARGLGDEAEMVFKTMIEGGIVPEITTYSYIVETFGKLGKLEKVAM 310

Query: 2160 LLMEMEVGGTSPEVTSYNVLLEAYARLGSMKEAMDVFRQMQAAGCVANAETYSILLNLYG 1981
            LL EME  G  P+++SYNVL+EA+A+LGS+KEAMDVF+QMQAAGCV NA TYSILLNLYG
Sbjct: 311  LLKEMESEGYLPDISSYNVLIEAHAKLGSIKEAMDVFKQMQAAGCVPNASTYSILLNLYG 370

Query: 1980 KNGRYDQVRELFLEMKTSNTEPDADTYNILIQVFGEGGYFKEVVTLFHDMVEEKVEPNME 1801
            K+GRYD VRELFL+MK S+ EPDA TYNILI+VFGEGGYFKEVVTLFHD+V+E ++PNME
Sbjct: 371  KHGRYDDVRELFLQMKESSAEPDATTYNILIRVFGEGGYFKEVVTLFHDLVDENIDPNME 430

Query: 1800 TYEGLIYACGKGGLHEDAKRILLHMNGQGLVPSSKVYNGVIEAYGQAALYEEAVVAFNTM 1621
            TYEGL++ACGKGGLHEDAK+IL HMNG+G+VPSSK Y+G+IEAYGQAALY+EA+VAFNTM
Sbjct: 431  TYEGLVFACGKGGLHEDAKKILFHMNGKGIVPSSKAYSGLIEAYGQAALYDEALVAFNTM 490

Query: 1620 NEVGSRPMVETFNSLIHAFAKGGLYKESEAIWFRMGEVGVPRNRDSFNGMIEGYRQGGQF 1441
            NEVGS+  ++T+NSLIH FA+GGLYKE EAI  RM E G+ RN  SF+G+IEGYRQ GQ+
Sbjct: 491  NEVGSKSTIDTYNSLIHTFARGGLYKEFEAILSRMREYGISRNAKSFSGIIEGYRQSGQY 550

Query: 1440 EEAVKSYVEMEKARCDPDERTLEAVLSVYCFAGLVDESEEQFQEIKSLGIQPSIICCCMM 1261
            EEA+K++VEMEK RC+ DE+TLE VL VYCFAGLVDES+EQF EIK+ GI PS++C CMM
Sbjct: 551  EEAIKAFVEMEKMRCELDEQTLEGVLGVYCFAGLVDESKEQFIEIKASGILPSVLCYCMM 610

Query: 1260 LAIYAKSERWDMARELLNGVMTNKTSDMHQVIGQMIHGDLDDENNWQMVEYVFDKLKSEG 1081
            LA+YAK+ RWD A ELL+ ++  + S +HQVIGQMI GD DD++NWQMVEYVFDKL +EG
Sbjct: 611  LAVYAKNGRWDDASELLDEMIKTRVSSIHQVIGQMIKGDYDDDSNWQMVEYVFDKLNAEG 670

Query: 1080 CELSMRFYNTLIEALWWLGQKERAARVLNEATKRGLFPELFRRNKLVWSVDVHRMWPGGA 901
            C   MRFYNTL+EALWWLGQK RAARVL EATKRGLFPELFR++KLVWSVDVHRMW GGA
Sbjct: 671  CGFGMRFYNTLLEALWWLGQKGRAARVLTEATKRGLFPELFRQSKLVWSVDVHRMWEGGA 730

Query: 900  CTALSVWLNDMEELFHKGEELPQLASVVVVRGQMEKSSITRDLPVAKAAYSFLKDTVSSS 721
             TA+S+W+N M E+   GE+LPQLA+VVV RG +EK S  R+LP+A+A YSFL+D VSSS
Sbjct: 731  YTAVSLWVNKMNEMLMDGEDLPQLAAVVVGRGSLEKDSTARNLPIARAVYSFLQDNVSSS 790

Query: 720  FFFPGWNKGRIVCQKTQLKRTFSAEPSSEASKCDILIPLSNTPISLLGKQTSTSAAKRSE 541
            F FPGWN  RI+CQ++QLK+  +A  S        +I L+N+P +L   + S S     +
Sbjct: 791  FSFPGWNNSRIICQQSQLKQLLTASSSE-------IIALNNSPFNLPEAKISRSGINNDK 843

Query: 540  SVNADTERSTKSDSELMASSV 478
              + D++ S ++ +EL+ ++V
Sbjct: 844  YKDVDSKSSNRTGTELLTTTV 864


>ref|XP_006300609.1| hypothetical protein CARUB_v10019779mg [Capsella rubella]
            gi|482569319|gb|EOA33507.1| hypothetical protein
            CARUB_v10019779mg [Capsella rubella]
          Length = 865

 Score = 1209 bits (3127), Expect = 0.0
 Identities = 586/814 (71%), Positives = 696/814 (85%), Gaps = 1/814 (0%)
 Frame = -2

Query: 2919 VRAKPKDLILGNPTVTVEKGKYSYDVETLINKLSSLPPRGSIARCLDTFKNKLSLTDFSL 2740
            ++AK KDL+LGNP+V+VEKGKYSYDVE+LINKLSSLPPRGSIARCLD FKNKLSL DF+L
Sbjct: 51   IKAKTKDLVLGNPSVSVEKGKYSYDVESLINKLSSLPPRGSIARCLDIFKNKLSLNDFAL 110

Query: 2739 VFKEFAARGDWQRSLRLFKYMQRQIWCKPNEHIYTLMIGILGREGLLDKAYEVFDEMPTH 2560
            VFKEFA R DWQRSLRLFKYMQRQIWCKPNEHIYT+MI +LGREGLLDK  EVFDEMP  
Sbjct: 111  VFKEFAGRSDWQRSLRLFKYMQRQIWCKPNEHIYTIMISLLGREGLLDKCLEVFDEMPGQ 170

Query: 2559 SVARTVFSYTSIINAYGRNGQYETSLQLLEKMKQEKIVPSILTYNTVINSCARGGYEWEG 2380
             V+R+VFSYT++INAYGRNG+YETSL+LL++MK EKI PSILTYNTVIN+CARGG +WEG
Sbjct: 171  GVSRSVFSYTALINAYGRNGRYETSLELLDRMKNEKISPSILTYNTVINACARGGLDWEG 230

Query: 2379 LLGLFAEMRHEGTQPDLVTYNTLLSACSSRGLEDEAEMVFRTMNEAGVLPDVTTYSYLVE 2200
            LLGLFAEMRHEG Q D+VTYNTLLSAC+ RGL DEAEMVFRTMN+ G++PD+TTYS+LVE
Sbjct: 231  LLGLFAEMRHEGIQSDIVTYNTLLSACAIRGLGDEAEMVFRTMNDGGIVPDLTTYSHLVE 290

Query: 2199 TFGKLGKLEKVSELLMEMEVGGTSPEVTSYNVLLEAYARLGSMKEAMDVFRQMQAAGCVA 2020
            TFGKLG+LEKVS+LL EM  GG+ P++TSYNVLLEAYA+ GS+KE+M VF QMQAAGC  
Sbjct: 291  TFGKLGRLEKVSDLLSEMASGGSLPDITSYNVLLEAYAKSGSIKESMGVFHQMQAAGCTP 350

Query: 2019 NAETYSILLNLYGKNGRYDQVRELFLEMKTSNTEPDADTYNILIQVFGEGGYFKEVVTLF 1840
            NA TYS+LLNL+G++GRYD VR+LFLEMK+SNT+PDA TYNILI+VFGEGGYFKEVVTLF
Sbjct: 351  NANTYSVLLNLFGQSGRYDDVRQLFLEMKSSNTDPDAATYNILIEVFGEGGYFKEVVTLF 410

Query: 1839 HDMVEEKVEPNMETYEGLIYACGKGGLHEDAKRILLHMNGQGLVPSSKVYNGVIEAYGQA 1660
            HDMVEE +EP+METYEG+I+ACGKGGL EDA++IL +M    +VPSSK Y GVIEA+GQA
Sbjct: 411  HDMVEENIEPDMETYEGIIFACGKGGLQEDARKILQYMTANDIVPSSKAYTGVIEAFGQA 470

Query: 1659 ALYEEAVVAFNTMNEVGSRPMVETFNSLIHAFAKGGLYKESEAIWFRMGEVGVPRNRDSF 1480
            ALYEEA+VAFNTM+EVGS P +ET++SL+++FA+GGL KESEAI  R+ + G+PRNRD+F
Sbjct: 471  ALYEEALVAFNTMHEVGSNPSIETYHSLLYSFARGGLVKESEAILSRLVDSGIPRNRDTF 530

Query: 1479 NGMIEGYRQGGQFEEAVKSYVEMEKARCDPDERTLEAVLSVYCFAGLVDESEEQFQEIKS 1300
            N  IE Y+QGG+FEEAVK+YV+MEK+RCDPDERTLEAVLSVY FA LVDE  EQF+E+K+
Sbjct: 531  NAQIEAYKQGGRFEEAVKTYVDMEKSRCDPDERTLEAVLSVYSFARLVDECREQFEEMKA 590

Query: 1299 LGIQPSIICCCMMLAIYAKSERWDMARELLNGVMTNKTSDMHQVIGQMIHGDLDDENNWQ 1120
              I PSI+C CMMLA+Y K+ERWD   ELL  +++N+ S++HQVIGQMI GD DD++NWQ
Sbjct: 591  SDILPSIMCYCMMLAVYGKTERWDDVNELLEEMLSNRVSNIHQVIGQMIKGDYDDDSNWQ 650

Query: 1119 MVEYVFDKLKSEGCELSMRFYNTLIEALWWLGQKERAARVLNEATKRGLFPELFRRNKLV 940
            +VEYV DKL SEGC L +RFYN L++ALWWLGQKERAARVLNEATKRGLFPELFR+NKLV
Sbjct: 651  IVEYVLDKLNSEGCGLGIRFYNALLDALWWLGQKERAARVLNEATKRGLFPELFRKNKLV 710

Query: 939  WSVDVHRMWPGGACTALSVWLNDMEELFHKGEELPQLASVVVVRGQMEKSSITRDLPVAK 760
            WSVDVHRM  GG  TALSVWLNDM ++F  GE+LPQLA VV VRGQ+EKSS  R+ P+AK
Sbjct: 711  WSVDVHRMSEGGMYTALSVWLNDMNDMFLTGEDLPQLAVVVSVRGQLEKSSAARESPIAK 770

Query: 759  AAYSFLKDTVSSSFFFPGWNKGRIVCQKTQLKRTFSA-EPSSEASKCDILIPLSNTPISL 583
            AA+SFL+D VSSSF F GWN GRI+CQ++QLK+  S  EP+SE S+   L+ L+N+P+  
Sbjct: 771  AAFSFLQDHVSSSFSFTGWNGGRIMCQRSQLKQLLSTKEPTSEESQDKNLVALTNSPVFA 830

Query: 582  LGKQTSTSAAKRSESVNADTERSTKSDSELMASS 481
             G +TSTS           T+R T+   EL  S+
Sbjct: 831  AGTRTSTSKDTNHSDSGNPTQRRTRVKKELAGST 864


>ref|NP_177623.1| plastid transcriptionally active 2 [Arabidopsis thaliana]
            gi|75194055|sp|Q9S7Q2.1|PP124_ARATH RecName:
            Full=Pentatricopeptide repeat-containing protein
            At1g74850, chloroplastic; AltName: Full=Protein PLASTID
            TRANSCRIPTIONALLY ACTIVE 2; Flags: Precursor
            gi|5882738|gb|AAD55291.1|AC008263_22 Contains 3 PF|01535
            DUF17 domains [Arabidopsis thaliana]
            gi|12323908|gb|AAG51934.1|AC013258_28 hypothetical
            protein; 81052-84129 [Arabidopsis thaliana]
            gi|332197518|gb|AEE35639.1| plastid transcriptionally
            active 2 [Arabidopsis thaliana]
          Length = 862

 Score = 1203 bits (3113), Expect = 0.0
 Identities = 589/814 (72%), Positives = 701/814 (86%), Gaps = 1/814 (0%)
 Frame = -2

Query: 2919 VRAKPKDLILGNPTVTVEKGKYSYDVETLINKLSSLPPRGSIARCLDTFKNKLSLTDFSL 2740
            ++AK KDL+LGNP+V+VEKGKYSYDVE+LINKLSSLPPRGSIARCLD FKNKLSL DF+L
Sbjct: 51   IKAKTKDLVLGNPSVSVEKGKYSYDVESLINKLSSLPPRGSIARCLDIFKNKLSLNDFAL 110

Query: 2739 VFKEFAARGDWQRSLRLFKYMQRQIWCKPNEHIYTLMIGILGREGLLDKAYEVFDEMPTH 2560
            VFKEFA RGDWQRSLRLFKYMQRQIWCKPNEHIYT+MI +LGREGLLDK  EVFDEMP+ 
Sbjct: 111  VFKEFAGRGDWQRSLRLFKYMQRQIWCKPNEHIYTIMISLLGREGLLDKCLEVFDEMPSQ 170

Query: 2559 SVARTVFSYTSIINAYGRNGQYETSLQLLEKMKQEKIVPSILTYNTVINSCARGGYEWEG 2380
             V+R+VFSYT++INAYGRNG+YETSL+LL++MK EKI PSILTYNTVIN+CARGG +WEG
Sbjct: 171  GVSRSVFSYTALINAYGRNGRYETSLELLDRMKNEKISPSILTYNTVINACARGGLDWEG 230

Query: 2379 LLGLFAEMRHEGTQPDLVTYNTLLSACSSRGLEDEAEMVFRTMNEAGVLPDVTTYSYLVE 2200
            LLGLFAEMRHEG QPD+VTYNTLLSAC+ RGL DEAEMVFRTMN+ G++PD+TTYS+LVE
Sbjct: 231  LLGLFAEMRHEGIQPDIVTYNTLLSACAIRGLGDEAEMVFRTMNDGGIVPDLTTYSHLVE 290

Query: 2199 TFGKLGKLEKVSELLMEMEVGGTSPEVTSYNVLLEAYARLGSMKEAMDVFRQMQAAGCVA 2020
            TFGKL +LEKV +LL EM  GG+ P++TSYNVLLEAYA+ GS+KEAM VF QMQAAGC  
Sbjct: 291  TFGKLRRLEKVCDLLGEMASGGSLPDITSYNVLLEAYAKSGSIKEAMGVFHQMQAAGCTP 350

Query: 2019 NAETYSILLNLYGKNGRYDQVRELFLEMKTSNTEPDADTYNILIQVFGEGGYFKEVVTLF 1840
            NA TYS+LLNL+G++GRYD VR+LFLEMK+SNT+PDA TYNILI+VFGEGGYFKEVVTLF
Sbjct: 351  NANTYSVLLNLFGQSGRYDDVRQLFLEMKSSNTDPDAATYNILIEVFGEGGYFKEVVTLF 410

Query: 1839 HDMVEEKVEPNMETYEGLIYACGKGGLHEDAKRILLHMNGQGLVPSSKVYNGVIEAYGQA 1660
            HDMVEE +EP+METYEG+I+ACGKGGLHEDA++IL +M    +VPSSK Y GVIEA+GQA
Sbjct: 411  HDMVEENIEPDMETYEGIIFACGKGGLHEDARKILQYMTANDIVPSSKAYTGVIEAFGQA 470

Query: 1659 ALYEEAVVAFNTMNEVGSRPMVETFNSLIHAFAKGGLYKESEAIWFRMGEVGVPRNRDSF 1480
            ALYEEA+VAFNTM+EVGS P +ETF+SL+++FA+GGL KESEAI  R+ + G+PRNRD+F
Sbjct: 471  ALYEEALVAFNTMHEVGSNPSIETFHSLLYSFARGGLVKESEAILSRLVDSGIPRNRDTF 530

Query: 1479 NGMIEGYRQGGQFEEAVKSYVEMEKARCDPDERTLEAVLSVYCFAGLVDESEEQFQEIKS 1300
            N  IE Y+QGG+FEEAVK+YV+MEK+RCDPDERTLEAVLSVY FA LVDE  EQF+E+K+
Sbjct: 531  NAQIEAYKQGGKFEEAVKTYVDMEKSRCDPDERTLEAVLSVYSFARLVDECREQFEEMKA 590

Query: 1299 LGIQPSIICCCMMLAIYAKSERWDMARELLNGVMTNKTSDMHQVIGQMIHGDLDDENNWQ 1120
              I PSI+C CMMLA+Y K+ERWD   ELL  +++N+ S++HQVIGQMI GD DD++NWQ
Sbjct: 591  SDILPSIMCYCMMLAVYGKTERWDDVNELLEEMLSNRVSNIHQVIGQMIKGDYDDDSNWQ 650

Query: 1119 MVEYVFDKLKSEGCELSMRFYNTLIEALWWLGQKERAARVLNEATKRGLFPELFRRNKLV 940
            +VEYV DKL SEGC L +RFYN L++ALWWLGQKERAARVLNEATKRGLFPELFR+NKLV
Sbjct: 651  IVEYVLDKLNSEGCGLGIRFYNALLDALWWLGQKERAARVLNEATKRGLFPELFRKNKLV 710

Query: 939  WSVDVHRMWPGGACTALSVWLNDMEELFHKGEELPQLASVVVVRGQMEKSSITRDLPVAK 760
            WSVDVHRM  GG  TALSVWLND+ ++  KG +LPQLA VV VRGQ+EKSS  R+ P+AK
Sbjct: 711  WSVDVHRMSEGGMYTALSVWLNDINDMLLKG-DLPQLAVVVSVRGQLEKSSAARESPIAK 769

Query: 759  AAYSFLKDTVSSSFFFPGWNKGRIVCQKTQLKRTFSA-EPSSEASKCDILIPLSNTPISL 583
            AA+SFL+D VSSSF F GWN GRI+CQ++QLK+  S  EP+SE S+   L+ L+N+PI  
Sbjct: 770  AAFSFLQDHVSSSFSFTGWNGGRIMCQRSQLKQLLSTKEPTSEESENKNLVALANSPIFA 829

Query: 582  LGKQTSTSAAKRSESVNADTERSTKSDSELMASS 481
             G + STS +  + S N  T+R T++  EL  S+
Sbjct: 830  AGTRASTS-SDTNHSGN-PTQRRTRTKKELAGST 861


>ref|XP_006390383.1| hypothetical protein EUTSA_v10018112mg [Eutrema salsugineum]
            gi|557086817|gb|ESQ27669.1| hypothetical protein
            EUTSA_v10018112mg [Eutrema salsugineum]
          Length = 863

 Score = 1202 bits (3109), Expect = 0.0
 Identities = 584/813 (71%), Positives = 697/813 (85%), Gaps = 1/813 (0%)
 Frame = -2

Query: 2919 VRAKPKDLILGNPTVTVEKGKYSYDVETLINKLSSLPPRGSIARCLDTFKNKLSLTDFSL 2740
            ++AK KDL+LGNP+V+VEKGKYSYDVE+LINKLSSLPPRGSIARCLD FKNKLSL DF+L
Sbjct: 50   IKAKTKDLVLGNPSVSVEKGKYSYDVESLINKLSSLPPRGSIARCLDIFKNKLSLNDFAL 109

Query: 2739 VFKEFAARGDWQRSLRLFKYMQRQIWCKPNEHIYTLMIGILGREGLLDKAYEVFDEMPTH 2560
            VFKEFA RGDWQRSLRLFKYMQRQIWCKPNEHIYT+MI +LGREGLLDK  E+FDEMP+ 
Sbjct: 110  VFKEFAGRGDWQRSLRLFKYMQRQIWCKPNEHIYTIMISLLGREGLLDKCLEIFDEMPSQ 169

Query: 2559 SVARTVFSYTSIINAYGRNGQYETSLQLLEKMKQEKIVPSILTYNTVINSCARGGYEWEG 2380
             VAR+VFSYT++INAYGRNG+YETSL+LL++MK EKI PSILTYNTVIN+CARGG +WEG
Sbjct: 170  GVARSVFSYTALINAYGRNGRYETSLELLDRMKNEKISPSILTYNTVINACARGGLDWEG 229

Query: 2379 LLGLFAEMRHEGTQPDLVTYNTLLSACSSRGLEDEAEMVFRTMNEAGVLPDVTTYSYLVE 2200
            LLGLFAEMRHEG QPD+VTYNTLLSAC+ RGL DEAEMVFRTMN+ G++PD+TTYS+LVE
Sbjct: 230  LLGLFAEMRHEGIQPDIVTYNTLLSACAIRGLGDEAEMVFRTMNDGGIVPDLTTYSHLVE 289

Query: 2199 TFGKLGKLEKVSELLMEMEVGGTSPEVTSYNVLLEAYARLGSMKEAMDVFRQMQAAGCVA 2020
            TFGKL +L KVS+LL EM  GG+ P++TSYNVLLEAYA+ GS+KEAM VF QMQAAGC  
Sbjct: 290  TFGKLSRLVKVSDLLSEMASGGSLPDITSYNVLLEAYAKSGSIKEAMGVFHQMQAAGCTP 349

Query: 2019 NAETYSILLNLYGKNGRYDQVRELFLEMKTSNTEPDADTYNILIQVFGEGGYFKEVVTLF 1840
            NA TYS+LLNL+G++GRYD VR+LFLEMK+SNT+PDA TYNILI+VFGEGGYFKEVVTLF
Sbjct: 350  NANTYSVLLNLFGQSGRYDDVRQLFLEMKSSNTDPDAATYNILIEVFGEGGYFKEVVTLF 409

Query: 1839 HDMVEEKVEPNMETYEGLIYACGKGGLHEDAKRILLHMNGQGLVPSSKVYNGVIEAYGQA 1660
            HDMVEE +EP+METYEG+I+ACGKGGLHEDA+++L +M  + +VPSSK Y GVIEA+GQA
Sbjct: 410  HDMVEENIEPDMETYEGIIFACGKGGLHEDARKVLQYMTAKDVVPSSKAYTGVIEAFGQA 469

Query: 1659 ALYEEAVVAFNTMNEVGSRPMVETFNSLIHAFAKGGLYKESEAIWFRMGEVGVPRNRDSF 1480
            ALYEEA+VAFNTM+EVGS P +ET++SL+++FA+GGL+KESE I  R+ + G+PRNRD+F
Sbjct: 470  ALYEEALVAFNTMHEVGSNPSIETYHSLLYSFARGGLFKESEVILSRLVDSGIPRNRDTF 529

Query: 1479 NGMIEGYRQGGQFEEAVKSYVEMEKARCDPDERTLEAVLSVYCFAGLVDESEEQFQEIKS 1300
            N  IE YRQGG+FEEAVK+YV+MEK+RCDPDERTLEAVLSVY  A LVDE  EQF+E+K+
Sbjct: 530  NAQIEAYRQGGKFEEAVKTYVDMEKSRCDPDERTLEAVLSVYSCARLVDECREQFEEMKA 589

Query: 1299 LGIQPSIICCCMMLAIYAKSERWDMARELLNGVMTNKTSDMHQVIGQMIHGDLDDENNWQ 1120
              I PSI+C CMML++Y K+ERW    ELL  +++N+ S++HQVIGQMI GD DD++NWQ
Sbjct: 590  SDILPSIMCYCMMLSVYGKTERWGDVNELLEEMLSNRVSNIHQVIGQMIKGDYDDDSNWQ 649

Query: 1119 MVEYVFDKLKSEGCELSMRFYNTLIEALWWLGQKERAARVLNEATKRGLFPELFRRNKLV 940
            +VEYV DKL SEGC L +RFYN L++ALWWLGQKERAARVLNEATKRGLFPELFR+NKLV
Sbjct: 650  IVEYVLDKLNSEGCGLGIRFYNALLDALWWLGQKERAARVLNEATKRGLFPELFRKNKLV 709

Query: 939  WSVDVHRMWPGGACTALSVWLNDMEELFHKGEELPQLASVVVVRGQMEKSSITRDLPVAK 760
             SVDVHRM  GG  TALSVWLND+ ++  KGE+LPQLA VV VRGQ+EKSS  R+ P+AK
Sbjct: 710  RSVDVHRMSEGGMYTALSVWLNDINDMLLKGEDLPQLAVVVSVRGQLEKSSAARESPIAK 769

Query: 759  AAYSFLKDTVSSSFFFPGWNKGRIVCQKTQLKRTFSA-EPSSEASKCDILIPLSNTPISL 583
            AA+SFL+D VSSSF F GWN GRI+CQ++QLK+  +  EP+SE S+   L+ LSN+PI  
Sbjct: 770  AAFSFLQDHVSSSFSFTGWNGGRIMCQRSQLKQLLATKEPTSEESQNKYLVALSNSPIFA 829

Query: 582  LGKQTSTSAAKRSESVNADTERSTKSDSELMAS 484
             G +TSTS+       N  ++R TK   EL  S
Sbjct: 830  AGTRTSTSSDTNHSGGN-PSQRRTKMKKELAGS 861


Top