BLASTX nr result

ID: Catharanthus22_contig00002061 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Catharanthus22_contig00002061
         (4787 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_006367266.1| PREDICTED: pentatricopeptide repeat-containi...   993   0.0  
ref|XP_004246707.1| PREDICTED: pentatricopeptide repeat-containi...   976   0.0  
ref|XP_002273255.1| PREDICTED: pentatricopeptide repeat-containi...   909   0.0  
gb|EOY07712.1| Tetratricopeptide repeat (TPR)-like superfamily p...   896   0.0  
ref|XP_004142106.1| PREDICTED: pentatricopeptide repeat-containi...   856   0.0  
ref|XP_004305248.1| PREDICTED: pentatricopeptide repeat-containi...   855   0.0  
ref|XP_006428907.1| hypothetical protein CICLE_v10011055mg [Citr...   852   0.0  
gb|EXB84820.1| hypothetical protein L484_003853 [Morus notabilis]     850   0.0  
ref|XP_002326162.1| predicted protein [Populus trichocarpa]           850   0.0  
ref|XP_006381507.1| pentatricopeptide repeat-containing family p...   849   0.0  
ref|XP_004167767.1| PREDICTED: LOW QUALITY PROTEIN: pentatricope...   837   0.0  
ref|XP_006287051.1| hypothetical protein CARUB_v10000200mg [Caps...   810   0.0  
ref|XP_006398739.1| hypothetical protein EUTSA_v10012661mg [Eutr...   805   0.0  
dbj|BAC42187.2| unknown protein [Arabidopsis thaliana]                800   0.0  
ref|NP_195903.2| pentatricopeptide repeat-containing protein [Ar...   799   0.0  
ref|XP_006398740.1| hypothetical protein EUTSA_v10012661mg [Eutr...   798   0.0  
ref|XP_002525196.1| pentatricopeptide repeat-containing protein,...   795   0.0  
ref|XP_006828302.1| hypothetical protein AMTR_s00023p00232870 [A...   768   0.0  
ref|XP_004493936.1| PREDICTED: pentatricopeptide repeat-containi...   752   0.0  
gb|ESW34707.1| hypothetical protein PHAVU_001G174000g [Phaseolus...   743   0.0  

>ref|XP_006367266.1| PREDICTED: pentatricopeptide repeat-containing protein At5g02830,
            chloroplastic-like [Solanum tuberosum]
          Length = 859

 Score =  993 bits (2567), Expect = 0.0
 Identities = 525/855 (61%), Positives = 636/855 (74%), Gaps = 12/855 (1%)
 Frame = -1

Query: 4739 MRDAVAILASSSVVLXXXXXXXXXXXXXHSQSST---KLKPL-LPRSFTRRTQSADSNSP 4572
            MR+A+ IL SSS+ L               + +T   K KP   P  FT  +      SP
Sbjct: 1    MREALVILPSSSITLPPNSSPTPPPHHRRYRRTTAKPKRKPTHSPSHFT--SSITTPQSP 58

Query: 4571 LLSTVRWDSPSQVRNRLKYYADLASNLSDDGRFTDFLMIAESVVVSGVKPSVFATLLNLK 4392
            LLST+RWDS S   N LKYYA+LAS L+ DGRF D LMIAESVVVSGV  + FA LLN+K
Sbjct: 59   LLSTLRWDSASGSCNGLKYYAELASKLAQDGRFDDSLMIAESVVVSGVNAAEFAALLNVK 118

Query: 4391 LVSVGIRRMLSEGKLVSLIQVFKGVKKLGFSVVELFDELAIEALRHECHRRLAKCDEALE 4212
            LVS GI R+L E K+ S++++  G ++LG   ++L D  A+ AL  EC R +  C E  E
Sbjct: 119  LVSGGIVRLLEERKVGSVVELLNGAQQLGIDPLKLLDGDALNALSRECRRTMG-CGEIEE 177

Query: 4211 IVSLMETLQELGFSIKGLVDASHIIRLCVNKRDPDAAIRYAQNFPQAGMLLCSIMIEFGK 4032
            +VSLMETL+  G  IK LV  S I+RLCV++R P+AA+RYA  FP   ++ C+I++EFGK
Sbjct: 178  VVSLMETLKGCGMPIKDLVKPSEILRLCVSQRKPNAAVRYAHIFPHVDIMFCTIILEFGK 237

Query: 4031 RRDLVSALTVFEVSKQNQGCPNMYAYRTIVDVCGLCGDCLKSRSIYEELLACKFAPNIYV 3852
            + DLVSALTVFE SKQNQ  PN+Y YRT +DVCGLCGD LKSRSIYE L+A KF PNIYV
Sbjct: 238  KGDLVSALTVFEASKQNQDTPNLYIYRTAIDVCGLCGDYLKSRSIYEGLIASKFTPNIYV 297

Query: 3851 FNSLMNVNASDLTYTLQIYKQMQKVGVTADLASYNILLKSCCLAGRVDLAKDIYNEVRKI 3672
            FNSLMNVNA DL+YTL IYKQMQK+GV ADL SYNILLKSCCLA RVDLAK+IY E++ +
Sbjct: 298  FNSLMNVNACDLSYTLDIYKQMQKLGVPADLTSYNILLKSCCLATRVDLAKEIYGELKHL 357

Query: 3671 ESMGVLKLDVFTYSTMIKVLADARMWQMALKIKEDMLLAGVIPNIVTWTSLISACANAGL 3492
            E  G LKLDVFTYST+IKV ADA+MWQMAL+IK+DML AGV PNIVTW+SLISACANAGL
Sbjct: 358  EMAGALKLDVFTYSTLIKVFADAKMWQMALEIKKDMLSAGVTPNIVTWSSLISACANAGL 417

Query: 3491 VEQAIQLFEEMLWAGCEPNSQCCNILLHACVESCQYDRAFRLFKGWKANGIQKGISVDIH 3312
            V+QAIQLFEEML AGCEPNSQC NILLHACVE+CQYDRAFRLF+ WK N +QK    D  
Sbjct: 418  VDQAIQLFEEMLQAGCEPNSQCYNILLHACVEACQYDRAFRLFRSWKENALQKDNCEDFG 477

Query: 3311 CKTENILCANPNHERDSRKSSSTSILKHQSFPVSFPFSPTTSTYNILMKACGTDYFRAKA 3132
             KT+N +  +P     +   + TS   H  F    PF PTTSTYNIL+KACG+DY+RAKA
Sbjct: 478  GKTDNTIDLSPTLVVSASIPTRTSASSHGHFSTRVPFRPTTSTYNILIKACGSDYYRAKA 537

Query: 3131 LMDEMKMEGLSPNHISWSILIDIFGGKGNVEGALQILRSMYQAGVQPDVVTYTTAIKVCV 2952
            LM+EMK  GLSPNHI+W+ILIDI GG GNVEGALQILR+M +AG+QPDVVTYTT IKVCV
Sbjct: 538  LMEEMKEVGLSPNHITWTILIDICGGSGNVEGALQILRAMREAGIQPDVVTYTTIIKVCV 597

Query: 2951 EQKNLKFAFSLFAEMKRYQIKPNMVTYNTLLRARSQYGSLQEVQQCLAIYQDMRKAGYKH 2772
            E K+ K AFSLFA MKRYQIKPNMVTYNTLLRARS+YGSLQEVQQCLAIYQ MRKAGYK 
Sbjct: 598  ENKDFKSAFSLFAAMKRYQIKPNMVTYNTLLRARSRYGSLQEVQQCLAIYQHMRKAGYKP 657

Query: 2771 NDYYLKQLIGEWCEGILQNNNRNKGQFVSHSGTSPELQNLLLEKVAVHLQDINAESLTID 2592
            NDYYLKQLI +WCEG++QN N+ K  F + + T    ++++L+KVA HLQ  +A S++I+
Sbjct: 658  NDYYLKQLIEQWCEGVIQNGNQRKYNFSTRNRTDLGPESMILDKVAEHLQKDSANSISIN 717

Query: 2591 LQGLSKIEARIVVLAVLRMIKEQHSPGDSVKDDLMIIF--EDVGSQ-TRHKGGVKEAIVK 2421
            L+GLSK+EARIVVLAVLRMI+E+++ GDS+K+D+ I    ++VG +    +  VKEAIVK
Sbjct: 718  LRGLSKVEARIVVLAVLRMIREKYTAGDSIKEDVQIFLGVQEVGIRAVGQESVVKEAIVK 777

Query: 2420 LLQYDLGLEVLSAGLRVKKDSESGGFENPFPSKDVLQGRD-----LPPKSPARRPAVLQR 2256
            LLQ+DLGLEV+SA  R+  D    G  +P    ++ +  +         SP R+P VLQ+
Sbjct: 778  LLQHDLGLEVISAASRIGNDRNQDGINHPDKHSNMEENAERVILRANVHSPTRKPVVLQK 837

Query: 2255 LKIKKASLNHWLQRK 2211
            ++I K SL  WL R+
Sbjct: 838  MRITKESLQSWLTRR 852


>ref|XP_004246707.1| PREDICTED: pentatricopeptide repeat-containing protein At5g02830,
            chloroplastic-like [Solanum lycopersicum]
          Length = 857

 Score =  976 bits (2524), Expect = 0.0
 Identities = 525/858 (61%), Positives = 634/858 (73%), Gaps = 15/858 (1%)
 Frame = -1

Query: 4739 MRDAVAILASSSVVLXXXXXXXXXXXXXHSQSST---KLKPL-LPRSFTRRTQSADSNSP 4572
            MR+A+ IL SSS+ L             H + +T   K KP   P  FT  +      SP
Sbjct: 1    MREALVILPSSSITLPPNSSPTPPPHHRHYRRTTAKPKRKPTHSPSHFT--SSITTPQSP 58

Query: 4571 LLSTVRWDSPSQVR--NRLKYYADLASNLSDDGRFTDFLMIAESVVVSGVKPSVFATLLN 4398
            LLS++RWDS S     N LKYYA+LAS L+ DGRF D LMIAESVVVSGV    F  LLN
Sbjct: 59   LLSSLRWDSASASGSCNGLKYYAELASKLAQDGRFDDSLMIAESVVVSGVNAEEFTALLN 118

Query: 4397 LKLVSVGIRRMLSEGKLVSLIQVFKGVKKLGFSVVELFDELAIEALRHECHRRLAKCDEA 4218
            +KLVS GI R+L E K+ S++++  G ++LG    +L DE +I AL  EC RR  +C E 
Sbjct: 119  VKLVSGGIVRLLEERKVGSVVELLNGAQQLGIDPSKLLDEDSINALSREC-RRTMQCSEI 177

Query: 4217 LEIVSLMETLQELGFSIKGLVDASHIIRLCVNKRDPDAAIRYAQNFPQAGMLLCSIMIEF 4038
             E+VSLMETL+  G  IK LV  S I+RLCV++R P+AA+RYA  FP   ++ C+I++EF
Sbjct: 178  EEVVSLMETLRGCGMPIKDLVKPSEILRLCVSQRKPNAAVRYAHIFPHVDIMFCTIILEF 237

Query: 4037 GKRRDLVSALTVFEVSKQNQGCPNMYAYRTIVDVCGLCGDCLKSRSIYEELLACKFAPNI 3858
            GK+ DL SALTVFE SKQNQ  PN+Y YRT +DVCGLCGD LKSRSIYE L+A KF PNI
Sbjct: 238  GKKGDLASALTVFEASKQNQDTPNLYIYRTAIDVCGLCGDYLKSRSIYEGLIASKFTPNI 297

Query: 3857 YVFNSLMNVNASDLTYTLQIYKQMQKVGVTADLASYNILLKSCCLAGRVDLAKDIYNEVR 3678
            YVFNSLMNVNA DL+YTL IYKQMQK+GV ADL SYNILLKSCCLA RVDLAK+IY E++
Sbjct: 298  YVFNSLMNVNACDLSYTLDIYKQMQKLGVPADLTSYNILLKSCCLATRVDLAKEIYGELK 357

Query: 3677 KIESMGVLKLDVFTYSTMIKVLADARMWQMALKIKEDMLLAGVIPNIVTWTSLISACANA 3498
             +E  G LKLDVFTYST+IKV ADA+MWQMAL+IK+DML AGV PNIVTW+SLISACANA
Sbjct: 358  HLEMAGALKLDVFTYSTLIKVFADAKMWQMALEIKKDMLSAGVTPNIVTWSSLISACANA 417

Query: 3497 GLVEQAIQLFEEMLWAGCEPNSQCCNILLHACVESCQYDRAFRLFKGWKANGIQKGISVD 3318
            G+V+QAIQLFEEML AGCEPNSQC NILLHACVE+CQYDRAFRLF+ WK N +QK    D
Sbjct: 418  GVVDQAIQLFEEMLQAGCEPNSQCYNILLHACVEACQYDRAFRLFRSWKENALQKDKCED 477

Query: 3317 IHCKTENILCANPNHERDSRKSSSTSILKHQSFPVSFPFSPTTSTYNILMKACGTDYFRA 3138
               KT+N +  +P     +   + TS   H+      PF PTTSTYNILMKACG+DY+RA
Sbjct: 478  YGGKTDNNIDLSPTLVVSASIPTRTSASSHRHISTRVPFIPTTSTYNILMKACGSDYYRA 537

Query: 3137 KALMDEMKMEGLSPNHISWSILIDIFGGKGNVEGALQILRSMYQAGVQPDVVTYTTAIKV 2958
            KALM+EMK  GLSPNHI+W+ILIDI GG GNVEGALQILR M +AG+QPDVVTYTT IKV
Sbjct: 538  KALMEEMKEVGLSPNHITWTILIDICGGSGNVEGALQILRVMREAGIQPDVVTYTTIIKV 597

Query: 2957 CVEQKNLKFAFSLFAEMKRYQIKPNMVTYNTLLRARSQYGSLQEVQQCLAIYQDMRKAGY 2778
            CVE K+ K AFSLFA MKRYQIKPNMVTYNTLLRARS+YGSLQEVQQCLAIYQDMRKAGY
Sbjct: 598  CVENKDFKSAFSLFAAMKRYQIKPNMVTYNTLLRARSRYGSLQEVQQCLAIYQDMRKAGY 657

Query: 2777 KHNDYYLKQLIGEWCEGILQNNNRNKGQFVSHSGTSPELQNLLLEKVAVHLQDINAESLT 2598
            K NDYYLKQLI +WCEG++QN N+ K  F + + T    Q+++LEKVA HLQ  +A S++
Sbjct: 658  KPNDYYLKQLIEQWCEGVIQNANQRKYNFSTRNRTDLGPQSMILEKVAEHLQKDSANSIS 717

Query: 2597 IDLQGLSKIEARIVVLAVLRMIKEQHSPGDSVKDDLMIIF--EDVGSQ-TRHKGGVKEAI 2427
            I+L+GL+K+EARIVVLAVLRMI+E+++ GDS+KDD+ I    ++VG +  + +  VKEAI
Sbjct: 718  INLRGLTKVEARIVVLAVLRMIREKYTAGDSIKDDVQIFLGVKEVGIRAVKQESVVKEAI 777

Query: 2426 VKLLQYDLGLEVLSAGLRVKKDSESGGFENPFPSKDVLQGRD----LPPK--SPARRPAV 2265
            ++LLQ+DLGLEV+SA   +       G  +P      ++       L P   SP R+P V
Sbjct: 778  IQLLQHDLGLEVISAASTI-----GNGINHPDNKHSNMEENAERVILRPSVYSPTRKPVV 832

Query: 2264 LQRLKIKKASLNHWLQRK 2211
            LQ+++I K SL  WL R+
Sbjct: 833  LQKMRITKESLQSWLTRR 850


>ref|XP_002273255.1| PREDICTED: pentatricopeptide repeat-containing protein At5g02830,
            chloroplastic [Vitis vinifera]
            gi|297741486|emb|CBI32618.3| unnamed protein product
            [Vitis vinifera]
          Length = 842

 Score =  909 bits (2350), Expect = 0.0
 Identities = 499/866 (57%), Positives = 610/866 (70%), Gaps = 20/866 (2%)
 Frame = -1

Query: 4739 MRDAVAILASSSVVLXXXXXXXXXXXXXHSQSSTKLKPLLPRSFTRRTQ----SADSNSP 4572
            MRD V  L  SS+VL               +   K KP L  S + R      S  S  P
Sbjct: 1    MRDLV--LLGSSIVLPPDPIPPHHRTKP--KPKPKPKPSLLTSTSARLSPPISSLRSRHP 56

Query: 4571 LLSTVRWDSPSQVRNRLKYYADLASNLSDDGRFTDFLMIAESVVVSGVKPSVFATLLNLK 4392
            LLS VRWD        L  Y+DLA+ L  DGRF DF  +AE++++SGV+ S       ++
Sbjct: 57   LLSDVRWD--------LNNYSDLATKLVQDGRFDDFSTMAETLILSGVELSQL-----VE 103

Query: 4391 LVSVGIRRMLSEGKLVSLIQVFKGVKKLGFSVVELFDELAIEALRHECHRRLAKCDEALE 4212
            LVS GI  +L EG++  +++V + V KLG   +ELFD   +E L  EC RR+  C +  E
Sbjct: 104  LVSAGISGLLREGRVYCVVEVLRKVDKLGICPLELFDGSTLELLSKEC-RRILNCGQVEE 162

Query: 4211 IVSLMETLQELGFSIKGLVDASHIIRLCVNKRDPDAAIRYAQNFPQAGMLLCSIMIEFGK 4032
            +V L+E L    F +K L++    I++CVNKR+P+ A+RYA   P A +L C+I+ EFGK
Sbjct: 163  VVELIEILDGFHFPVKKLLEPLDFIKICVNKRNPNLAVRYACILPHAQILFCTIIHEFGK 222

Query: 4031 RRDLVSALTVFEVSKQNQGCPNMYAYRTIVDVCGLCGDCLKSRSIYEELLACKFAPNIYV 3852
            +RDL SALT FE SKQ    PNMY YRT++DVCGLC    KSR IYEELLA K  PNIYV
Sbjct: 223  KRDLGSALTAFEASKQKLIGPNMYCYRTMIDVCGLCSHYQKSRYIYEELLAQKITPNIYV 282

Query: 3851 FNSLMNVNASDLTYTLQIYKQMQKVGVTADLASYNILLKSCCLAGRVDLAKDIYNEVRKI 3672
            FNSLMNVN  DL+YT  +YK MQ +GVTAD+ASYNILLK+CC+AGRVDLA++IY EV+ +
Sbjct: 283  FNSLMNVNVHDLSYTFNVYKNMQNLGVTADMASYNILLKACCVAGRVDLAQEIYREVQNL 342

Query: 3671 ESMGVLKLDVFTYSTMIKVLADARMWQMALKIKEDMLLAGVIPNIVTWTSLISACANAGL 3492
            ES G+LKLDVFTYST+IKV ADA++WQMALKIKEDML AGVIPN VTW++LIS+CANAG+
Sbjct: 343  ESNGMLKLDVFTYSTIIKVFADAKLWQMALKIKEDMLSAGVIPNTVTWSALISSCANAGI 402

Query: 3491 VEQAIQLFEEMLWAGCEPNSQCCNILLHACVESCQYDRAFRLFKGWK-------ANGIQK 3333
             EQAIQLF+EML AGCEPNSQC NILLHACVE+CQYDRAFRLF+ WK       + G   
Sbjct: 403  TEQAIQLFKEMLLAGCEPNSQCYNILLHACVEACQYDRAFRLFQSWKDSRFQEISGGTGN 462

Query: 3332 GISVDIHCKTENILCANPNHERDSRKSSSTSILKHQSFPVSFPFSPTTSTYNILMKACGT 3153
            G +V +  K +N + + PN   +S          H SF  SFPF+PTT+TYNILMKACGT
Sbjct: 463  GNTVGVELKHQNCITSMPNCLSNSH---------HLSFSKSFPFTPTTTTYNILMKACGT 513

Query: 3152 DYFRAKALMDEMKMEGLSPNHISWSILIDIFGGKGNVEGALQILRSMYQAGVQPDVVTYT 2973
            DY+RAKALMDEMK  GLSPNHISWSILIDI GG GN+ GA++IL++M +AG++PDVV YT
Sbjct: 514  DYYRAKALMDEMKTAGLSPNHISWSILIDICGGTGNIVGAVRILKTMREAGIKPDVVAYT 573

Query: 2972 TAIKVCVEQKNLKFAFSLFAEMKRYQIKPNMVTYNTLLRARSQYGSLQEVQQCLAIYQDM 2793
            TAIK CVE KNLK AFSLFAEMKRYQI+PN+VTYNTLLRARS+YGSL EVQQCLAIYQ M
Sbjct: 574  TAIKYCVESKNLKIAFSLFAEMKRYQIQPNLVTYNTLLRARSRYGSLHEVQQCLAIYQHM 633

Query: 2792 RKAGYKHNDYYLKQLIGEWCEGILQNNNRNKGQFVS-HSGTSPELQNLLLEKVAVHLQDI 2616
            RKAGYK NDYYLK+LI EWCEG++Q+NN N+ +F S +       Q+LLLEKVA HLQ  
Sbjct: 634  RKAGYKSNDYYLKELIEEWCEGVIQDNNLNQSKFSSVNRADWGRPQSLLLEKVAAHLQKS 693

Query: 2615 NAESLTIDLQGLSKIEARIVVLAVLRMIKEQHSPGDSVKDDLMIIF---EDVGSQTRHKG 2445
             AESL IDLQGL+++EARIVVLAVLRMIKE +  G  +KDD++II    +   +   H+ 
Sbjct: 694  VAESLAIDLQGLTQVEARIVVLAVLRMIKENYILGHPIKDDILIILGIKKVDANLVEHES 753

Query: 2444 GVKEAIVKLLQYDLGLEVLSAGLRVKKDS--ESGGFENPFPSKDVLQGRDLPP---KSPA 2280
             VK AI+KLLQ +LGLEV  AG ++  D     GG     P      GR+  P   +S  
Sbjct: 754  PVKGAIIKLLQDELGLEVAFAGPKIALDKRINLGGPPGSDPDWQEALGRNRLPTELESST 813

Query: 2279 RRPAVLQRLKIKKASLNHWLQRKGGS 2202
            RRPAVLQR K+ + SL+HWLQR+ G+
Sbjct: 814  RRPAVLQRFKVTRKSLDHWLQRRVGA 839


>gb|EOY07712.1| Tetratricopeptide repeat (TPR)-like superfamily protein, putative
            [Theobroma cacao]
          Length = 858

 Score =  896 bits (2315), Expect = 0.0
 Identities = 476/800 (59%), Positives = 586/800 (73%), Gaps = 12/800 (1%)
 Frame = -1

Query: 4574 PLLST--VRWDSPSQVRNRLKYYADLASNLSDDGRFTDFLMIAESVVVSGVKPSVFATLL 4401
            PLLS+  VRWD  S+  + LKYYADLAS L++DGR  DF MI E +V SGV      ++L
Sbjct: 64   PLLSSSSVRWDPTSRRSSLLKYYADLASKLAEDGRLEDFAMIVEMLVASGVNAPRIVSML 123

Query: 4400 NLKLVSVGIRRMLSEGKLVSLIQVFKGVKKLGFSVVELFDELAIEALRHECHRRLAKCDE 4221
            +++ VS G+   + EGK+ S+++V K V+KLG +  +L D   + +++ E  +R+    E
Sbjct: 124  SVQFVSKGVASNVQEGKVKSVVEVLKKVEKLGIAPSKLVDGFGLVSMKRE-FQRIVGSGE 182

Query: 4220 ALEIVSLMETLQELGFSIKGLVDASHIIRLCVNKRDPDAAIRYAQNFPQAGMLLCSIMIE 4041
              + V L+E L+   F+IK LVD S+II++CV+KR+P+ A+RYA   P A +L CSI+ E
Sbjct: 183  VEQAVDLLEALRGFQFTIKELVDPSYIIKVCVDKRNPNLAVRYACLLPHAKILFCSIISE 242

Query: 4040 FGKRRDLVSALTVFEVSKQNQGCPNMYAYRTIVDVCGLCGDCLKSRSIYEELLACKFAPN 3861
            FGK+RDL SALT +E SK+N   PNMY YR I+D CGLCGD LKSR+IYE+L+  +  PN
Sbjct: 243  FGKKRDLASALTAYEASKKNLSGPNMYLYRAIIDACGLCGDYLKSRNIYEDLVNQRVTPN 302

Query: 3860 IYVFNSLMNVNASDLTYTLQIYKQMQKVGVTADLASYNILLKSCCLAGRVDLAKDIYNEV 3681
            IYVFNSLMNVNA DL YTL +YK MQ +G+TAD+ASYNILLK+CCLA RVDLA+DIYNEV
Sbjct: 303  IYVFNSLMNVNAHDLGYTLDVYKDMQNLGITADMASYNILLKACCLAQRVDLAQDIYNEV 362

Query: 3680 RKIESMGVLKLDVFTYSTMIKVLADARMWQMALKIKEDMLLAGVIPNIVTWTSLISACAN 3501
            + +ES GVLKLDVFTY T+IKV ADAR+WQMALKIKEDML AGV PN VTW+SLISACAN
Sbjct: 363  KHLESTGVLKLDVFTYCTIIKVFADARLWQMALKIKEDMLSAGVTPNTVTWSSLISACAN 422

Query: 3500 AGLVEQAIQLFEEMLWAGCEPNSQCCNILLHACVESCQYDRAFRLFKGWKANGIQKGISV 3321
            AGLVEQA QLFEEM+  GCEPNSQCCNILLHACVE+ QYDRAFRLF  W   G Q+G + 
Sbjct: 423  AGLVEQAFQLFEEMILTGCEPNSQCCNILLHACVEASQYDRAFRLFHCW--TGGQEGFAG 480

Query: 3320 DIHCKTENILCANPNHERDSRKSSSTSILKHQSFPVSFPFSPTTSTYNILMKACGTDYFR 3141
            +I    +++L     + R +  + + S   H SF   F F+PTT+TYNILMKAC TDY+R
Sbjct: 481  NI----DSVLGTKQLNNRTTSTALTNS--HHLSFAKKFSFTPTTATYNILMKACCTDYYR 534

Query: 3140 AKALMDEMKMEGLSPNHISWSILIDIFGGKGNVEGALQILRSMYQAGVQPDVVTYTTAIK 2961
            AKALMDEMK  GLSPNH+SWSILIDI  G GNVEGA+QIL++M+  G++PDVV YTTAIK
Sbjct: 535  AKALMDEMKSVGLSPNHVSWSILIDICRGSGNVEGAIQILKTMHVTGIKPDVVAYTTAIK 594

Query: 2960 VCVEQKNLKFAFSLFAEMKRYQIKPNMVTYNTLLRARSQYGSLQEVQQCLAIYQDMRKAG 2781
            VCV  KNLK AFSLF EMKRY+++PN+VTYNTLLRARS+YGSL EVQQCLAIYQDMRKAG
Sbjct: 595  VCVGSKNLKLAFSLFEEMKRYRVQPNLVTYNTLLRARSRYGSLHEVQQCLAIYQDMRKAG 654

Query: 2780 YKHNDYYLKQLIGEWCEGILQNNNRNKGQFVSHSGTSPEL-QNLLLEKVAVHLQDINAES 2604
            YK ND YLK+LI EWCEG+++ NN  +    S   T  E   +LLLEK+AVHLQ   AES
Sbjct: 655  YKSNDIYLKELIEEWCEGVIKENNHKREGLSSCKRTDLERPHSLLLEKIAVHLQMSTAES 714

Query: 2603 LTIDLQGLSKIEARIVVLAVLRMIKEQHSPGDSVKDDLMIIF---EDVGSQTRHKGGVKE 2433
              IDL+GL+K+EARIVVLAVLRMIKE H  G SVKDD++II    E   +  + K  VK+
Sbjct: 715  PAIDLRGLTKVEARIVVLAVLRMIKENHILGHSVKDDMLIILGVSERHANAAKQKSEVKD 774

Query: 2432 AIVKLLQYDLGLEVLSAGLRVKKDSESGGFENPFPSKDVL------QGRDLPPKSPARRP 2271
            A++KLLQ +LGLEVL    +VK        + P  +  VL            P S  RRP
Sbjct: 775  AVMKLLQDELGLEVLLVEPQVKNGLVD--LQTPIDADPVLLETVGKNSLSSKPLSSTRRP 832

Query: 2270 AVLQRLKIKKASLNHWLQRK 2211
             +LQRLK+ + SLNHWL R+
Sbjct: 833  VILQRLKVTRKSLNHWLWRR 852


>ref|XP_004142106.1| PREDICTED: pentatricopeptide repeat-containing protein At5g02830,
            chloroplastic-like [Cucumis sativus]
          Length = 849

 Score =  856 bits (2212), Expect = 0.0
 Identities = 468/847 (55%), Positives = 592/847 (69%), Gaps = 11/847 (1%)
 Frame = -1

Query: 4721 ILASSSVVLXXXXXXXXXXXXXHSQSSTKLKPL---LPRSFTRRTQSADSNSPLLSTVRW 4551
            IL SSS  +                S + L P    LP  F+  T +  S   LLS+V  
Sbjct: 6    ILGSSSASIAGPRRYRHSHCKAPKSSLSNLSPTGTHLP--FSSHTSTRHSPPALLSSVEL 63

Query: 4550 D---SPSQVRNRLKYYADLASNLSDDGRFTDFLMIAESVVVSGVKPSVFATLLNLKLVSV 4380
            D   + S  R  +++YA +AS L++ G+  DF M+ ESVVV+GV+PS F  +L ++LV+ 
Sbjct: 64   DIAGASSGGRIPIQHYAGVASKLAEGGKLEDFAMVVESVVVAGVEPSQFGAMLAVELVAK 123

Query: 4379 GIRRMLSEGKLVSLIQVFKGVKKLGFSVVELFDELAIEALRHECHRRLAKCDEALEIVSL 4200
            GI R L EGK+ S++QV + V++LG SV+EL DE A+E+LR +C RR+AK  E  E+V L
Sbjct: 124  GISRCLREGKVWSVVQVLRKVEELGISVLELCDEPAVESLRRDC-RRMAKSGELEELVEL 182

Query: 4199 METLQELGFSIKGLVDASHIIRLCVNKRDPDAAIRYAQNFPQAGMLLCSIMIEFGKRRDL 4020
            ME L   GFS++ ++  S +I+LCV+ R+P  AIRYA   P A +L C+ + EFGK+RDL
Sbjct: 183  MEVLSGFGFSVREMMKPSEVIKLCVDYRNPKMAIRYASILPHADILFCTTINEFGKKRDL 242

Query: 4019 VSALTVFEVSKQNQGCPNMYAYRTIVDVCGLCGDCLKSRSIYEELLACKFAPNIYVFNSL 3840
             SA   +  SK N    NMY YRTI+DVCGLCGD  KSR+IY++L+     PNI+VFNSL
Sbjct: 243  KSAYIAYTESKANMNGSNMYIYRTIIDVCGLCGDYKKSRNIYQDLVNQNVIPNIFVFNSL 302

Query: 3839 MNVNASDLTYTLQIYKQMQKVGVTADLASYNILLKSCCLAGRVDLAKDIYNEVRKIESMG 3660
            MNVNA DL YT Q+YK MQ +GV AD+ASYNILLK+CCLAGRVDLA+DIY EV+ +E+ G
Sbjct: 303  MNVNAHDLNYTFQLYKNMQNLGVPADMASYNILLKACCLAGRVDLAQDIYREVKHLETTG 362

Query: 3659 VLKLDVFTYSTMIKVLADARMWQMALKIKEDMLLAGVIPNIVTWTSLISACANAGLVEQA 3480
            VLKLDVFTYST++KV ADA++W+MAL++KEDM  AGV PN+VTW+SLIS+CAN+GLVE A
Sbjct: 363  VLKLDVFTYSTIVKVFADAKLWKMALRVKEDMQSAGVSPNMVTWSSLISSCANSGLVELA 422

Query: 3479 IQLFEEMLWAGCEPNSQCCNILLHACVESCQYDRAFRLFKGWKANGIQKGISVDIHCKTE 3300
            IQLFEEM+ AGCEPN+QCCN LLHACVE  Q+DRAFRLF+ WK   +  GI  +    T+
Sbjct: 423  IQLFEEMVSAGCEPNTQCCNTLLHACVEGRQFDRAFRLFRSWKEKELWDGI--ERKSSTD 480

Query: 3299 NILCANPNHERDSRKSSSTSILKHQ-SFPVSFPFSPTTSTYNILMKACGTDYFRAKALMD 3123
            N L A+   +  + K  +     HQ SF  +F F PT +TYNILMKACGTDY+ AKALM+
Sbjct: 481  NNLDADSTSQLCNTKMPNAPSHVHQISFVGNFAFKPTITTYNILMKACGTDYYHAKALME 540

Query: 3122 EMKMEGLSPNHISWSILIDIFGGKGNVEGALQILRSMYQAGVQPDVVTYTTAIKVCVEQK 2943
            EMK  GL+PNHISWSIL+DI G   +VE A+QIL +M  AGV PDVV YTTAIKVCVE K
Sbjct: 541  EMKSVGLTPNHISWSILVDICGRSHDVESAVQILTTMRMAGVDPDVVAYTTAIKVCVEGK 600

Query: 2942 NLKFAFSLFAEMKRYQIKPNMVTYNTLLRARSQYGSLQEVQQCLAIYQDMRKAGYKHNDY 2763
            N K AFSLF EMKR++I+PN+VTY+TLLRARS YGSL EVQQCLAIYQDMRK+G+K ND+
Sbjct: 601  NWKLAFSLFEEMKRFEIQPNLVTYSTLLRARSTYGSLHEVQQCLAIYQDMRKSGFKSNDH 660

Query: 2762 YLKQLIGEWCEGILQNNNRNKGQFVSHSGTS-PELQNLLLEKVAVHLQDINAESLTIDLQ 2586
            YLK+LI EWCEG++Q NN+   +    +     + + L+LEKVA HLQ   AESLTIDLQ
Sbjct: 661  YLKELIAEWCEGVIQKNNQQPVEITPCNKIDIGKPRCLILEKVADHLQKSFAESLTIDLQ 720

Query: 2585 GLSKIEARIVVLAVLRMIKEQHSPGDSVKDDLMIIFEDVGSQT---RHKGGVKEAIVKLL 2415
             L+K+EARIVVLAVLRMIKE ++ G+SVKDD+ II E    +T        V++AI +LL
Sbjct: 721  ELTKVEARIVVLAVLRMIKENYALGESVKDDIFIILEVNKVETDLVPQNFEVRDAITRLL 780

Query: 2414 QYDLGLEVLSAGLRVKKDSESGGFENPFPSKDVLQGRDLPPKSPARRPAVLQRLKIKKAS 2235
            Q +LGLEVL  G  +  D       +       L+G     K   R+PA +QRLK+ K S
Sbjct: 781  QDELGLEVLPTGPTIALDKVPNSESSKISHTTKLKGTMGRNKYFTRKPADVQRLKVTKKS 840

Query: 2234 LNHWLQR 2214
            L  WLQR
Sbjct: 841  LQDWLQR 847


>ref|XP_004305248.1| PREDICTED: pentatricopeptide repeat-containing protein At5g02830,
            chloroplastic-like [Fragaria vesca subsp. vesca]
          Length = 847

 Score =  855 bits (2210), Expect = 0.0
 Identities = 470/835 (56%), Positives = 578/835 (69%), Gaps = 22/835 (2%)
 Frame = -1

Query: 4649 QSSTKLKPLLPRSFTRRTQSADSNSP---------LLSTVRWDSPSQVRNRLKYYADLAS 4497
            +S+   KP +P+  +    S+ S++P         L +  RWD        L Y+ADLAS
Sbjct: 27   RSTKPYKPPIPKLLSPSKSSSSSSAPPIISTRPPPLFAGTRWDPH---HTHLSYFADLAS 83

Query: 4496 NLSDDGRFTDFLMIAESVVVSGVKPSVFATLLNLKLVSVGIRRMLSEGKLVSLIQVFKGV 4317
             L+ DG+  DF M+ ESVV+SGVKPS F   L L +VS GI  +L +GK+  L++V   V
Sbjct: 84   KLARDGKLHDFSMLLESVVLSGVKPSQFTAALQLDMVSRGISGILKDGKVGGLVEVLVKV 143

Query: 4316 KKLGFSVVELFDELAIEALRHECHRRLAKCDEALEIVSLMETLQELGFSIKGLVDASHII 4137
             +LG   VELFD  A+E L   C  RL K  +  E+V LME L  L F I+ LVD S +I
Sbjct: 144  AELGVRPVELFDGYAMELLGAHC-LRLLKFKQVQELVELMEVLYGLHFPIRELVDPSEVI 202

Query: 4136 RLCVNKRDPDAAIRYAQNFPQAGMLLCSIMIEFGKRRDLVSALTVFEVSKQNQGCPNMYA 3957
            + CV KR P  AIRYA  FP + ML C+IM EFGK+R L SALT +E SK+     NMY 
Sbjct: 203  KACVEKRRPKLAIRYACIFPHSHMLFCNIMYEFGKKRALASALTAYEASKEKLSGSNMYI 262

Query: 3956 YRTIVDVCGLCGDCLKSRSIYEELLACKFAPNIYVFNSLMNVNASDLTYTLQIYKQMQKV 3777
            YRTI+DVCG+C D +KSR IYE+LL  K  PNIYVFNSLMNVN+ DL+YT  +YK MQ +
Sbjct: 263  YRTIIDVCGVCKDYMKSRYIYEDLLKQKVIPNIYVFNSLMNVNSHDLSYTFHVYKSMQNL 322

Query: 3776 GVTADLASYNILLKSCCLAGRVDLAKDIYNEVRKIESMGVLKLDVFTYSTMIKVLADARM 3597
            GVTADLA YNILLK+C LAGRVDLA+DIY EV+ +ES GVLKLDVFTYST++KV +DA+M
Sbjct: 323  GVTADLACYNILLKACSLAGRVDLAQDIYKEVQHLESTGVLKLDVFTYSTVVKVFSDAKM 382

Query: 3596 WQMALKIKEDMLLAGVIPNIVTWTSLISACANAGLVEQAIQLFEEMLWAGCEPNSQCCNI 3417
            W MAL +KEDM  AGVIPN VTW+S ISACANAGLV++AIQLFEEML A CEPNSQC NI
Sbjct: 383  WHMALNVKEDMQSAGVIPNTVTWSSFISACANAGLVDKAIQLFEEMLLASCEPNSQCFNI 442

Query: 3416 LLHACVESCQYDRAFRLFKGWKANGIQKGISVDIHCKTENILCANPNHERDSRKSSST-- 3243
            LLHACVE+CQYDRAFRLF  +K+N +Q+                  N++  +  SS+T  
Sbjct: 443  LLHACVEACQYDRAFRLFHSFKSNKLQETF--------------GKNYKGSAGSSSTTIP 488

Query: 3242 SILKHQSFPVSFPFSPTTSTYNILMKACGTDYFRAKALMDEMKMEGLSPNHISWSILIDI 3063
             I+   +F     F PTT+TYN LMKACG+DY+ AKALMDEMK  GL PN I+WSIL DI
Sbjct: 489  LIILPSNFAEGLSFKPTTTTYNTLMKACGSDYYHAKALMDEMKTVGLLPNQITWSILADI 548

Query: 3062 FGGKGNVEGALQILRSMYQAGVQPDVVTYTTAIKVCVEQKNLKFAFSLFAEMKRYQIKPN 2883
             G  GNV+GALQIL+SM  AG+QPDVV YTTAIK+CVE +NL  A  LFAEMK+YQI PN
Sbjct: 549  CGSSGNVQGALQILKSMRVAGIQPDVVAYTTAIKICVESENLDLALLLFAEMKKYQIHPN 608

Query: 2882 MVTYNTLLRARSQYGSLQEVQQCLAIYQDMRKAGYKHNDYYLKQLIGEWCEGILQNNNRN 2703
            +VTYNTLLRARS+YGS+ EVQQCLAIYQDMRKAGYK NDYYL+QLI EWCEG++Q++   
Sbjct: 609  LVTYNTLLRARSRYGSVSEVQQCLAIYQDMRKAGYKPNDYYLEQLIEEWCEGVIQDSCPK 668

Query: 2702 KGQF-VSHSGTSPELQNLLLEKVAVHLQDINAESLTIDLQGLSKIEARIVVLAVLRMIKE 2526
            +G+F            +LLLEKVA HLQ   A++L +DLQGL+K+EARIVVLAVLRMIKE
Sbjct: 669  QGEFSYGDKADIGRPGSLLLEKVAEHLQQHIADTLAVDLQGLTKVEARIVVLAVLRMIKE 728

Query: 2525 QHSPGDSVKDDLMIIF----EDVGSQTRHKGGVKEAIVKLLQYDLGLEVLSAGLRVKKDS 2358
             +  GDSVKDD++I+     E  G  T H   VK+AI KLLQ +LGL+VLS   +V  D+
Sbjct: 729  NYILGDSVKDDMLIMVGVHDEVDGGSTAHNLEVKDAITKLLQDELGLKVLSTVPKVALDT 788

Query: 2357 ESGGFENPFPSKDVLQ----GRDLPPK--SPARRPAVLQRLKIKKASLNHWLQRK 2211
                 +N   S   L      ++L P+     RRP VL+RLK+ + SL  WL+++
Sbjct: 789  TIVS-QNTIDSDQNLDEKPLRKELQPELIYSTRRPVVLERLKVSRKSLQQWLRKR 842


>ref|XP_006428907.1| hypothetical protein CICLE_v10011055mg [Citrus clementina]
            gi|568853887|ref|XP_006480569.1| PREDICTED:
            pentatricopeptide repeat-containing protein At5g02830,
            chloroplastic-like [Citrus sinensis]
            gi|557530964|gb|ESR42147.1| hypothetical protein
            CICLE_v10011055mg [Citrus clementina]
          Length = 850

 Score =  852 bits (2202), Expect = 0.0
 Identities = 466/830 (56%), Positives = 585/830 (70%), Gaps = 17/830 (2%)
 Frame = -1

Query: 4640 TKLKPLLPRSFTRRTQSADSN-SPLLSTVRWDSPSQVRNRLKYYADLASNLSDDGRFTDF 4464
            TKL PL   S      +  S+ + LLSTVR D  S    R  YYAD+AS L+ DGR  +F
Sbjct: 35   TKLFPLASSSSLSSIPTVHSSQTALLSTVRRDLSS----RNDYYADMASKLAKDGRLEEF 90

Query: 4463 LMIAESVVVSGVKPSVFATLLNLKLVSVGIRRMLSEGKLVSLIQVFKGVKKLGFSVVELF 4284
             MI ESVVVS    S FA++L+L++V+ GI + + EG++  ++ V K + +LG + +ELF
Sbjct: 91   AMIVESVVVSEGNVSKFASMLSLEMVASGIVKSIGEGRIDCVVGVLKKLNELGVAPLELF 150

Query: 4283 DELAIEALRHECHRRLAKCDEALEIVSLMETLQELGFSIKGLVDASHIIRLCVNKRDPDA 4104
                 + L++EC +RL    E    V LME L+E    +K L +   I++LCVNK D + 
Sbjct: 151  HGSGFKLLKNEC-QRLLDSGEVEMFVGLMEVLEEFRLPVKELDEEFRIVQLCVNKPDVNL 209

Query: 4103 AIRYAQNFPQAGMLLCSIMIEFGKRRDLVSALTVFEVSKQNQGCPNMYAYRTIVDVCGLC 3924
            AIRYA   P+A +L C+ + EFGK+RDLVSAL  +E SK++   PNMY  RTI+DVCGLC
Sbjct: 210  AIRYACIVPRADILFCNFVREFGKKRDLVSALRAYEASKKHLSSPNMYICRTIIDVCGLC 269

Query: 3923 GDCLKSRSIYEELLACKFAPNIYVFNSLMNVNASDLTYTLQIYKQMQKVGVTADLASYNI 3744
            GD +KSR+IYE+L +     NIYVFNSLMNVNA DL +TL++YK MQK+GV AD+ASYNI
Sbjct: 270  GDYMKSRAIYEDLRSQNVTLNIYVFNSLMNVNAHDLKFTLEVYKNMQKLGVMADMASYNI 329

Query: 3743 LLKSCCLAGRVDLAKDIYNEVRKIESMGVLKLDVFTYSTMIKVLADARMWQMALKIKEDM 3564
            LLK+CCLAG   LA++IY EV+ +E+ GVLKLDVFTYST++KV ADA+ WQMALK+KEDM
Sbjct: 330  LLKACCLAGNTVLAQEIYGEVKHLEAKGVLKLDVFTYSTIVKVFADAKWWQMALKVKEDM 389

Query: 3563 LLAGVIPNIVTWTSLISACANAGLVEQAIQLFEEMLWAGCEPNSQCCNILLHACVESCQY 3384
            L AGV PN +TW+SLI+ACANAGLVEQA+ LFEEM  AGCEPNSQCCNILL ACVE+CQ+
Sbjct: 390  LSAGVTPNTITWSSLINACANAGLVEQAMHLFEEMRQAGCEPNSQCCNILLQACVEACQF 449

Query: 3383 DRAFRLFKGWKANGIQKGISVDIHCKTENILCANPNHERDSRKSSSTSIL---KHQSFPV 3213
            DRAFRLF+ W  +  Q  +  D    T+ I  +N  H+     +++ + +    + SF  
Sbjct: 450  DRAFRLFRSWTLSKTQVALGEDYDGNTDRI--SNMEHKDKQSITNTPNFVPNSHYSSFDK 507

Query: 3212 SFPFSPTTSTYNILMKACGTDYFRAKALMDEMKMEGLSPNHISWSILIDIFGGKGNVEGA 3033
             F F PTT+TYNILMKAC TDY+R KALMDEM+  GLSPNHISW+ILID  GG GNVEGA
Sbjct: 508  RFSFKPTTTTYNILMKACCTDYYRVKALMDEMRTVGLSPNHISWTILIDACGGSGNVEGA 567

Query: 3032 LQILRSMYQAGVQPDVVTYTTAIKVCVEQKNLKFAFSLFAEMKRYQIKPNMVTYNTLLRA 2853
            LQIL+ M + G+ PDVV YTTAIKVCV  K LK AFSLF EMK YQI+PN+VTY TLLRA
Sbjct: 568  LQILKIMREDGMSPDVVAYTTAIKVCVRSKRLKLAFSLFEEMKHYQIQPNLVTYITLLRA 627

Query: 2852 RSQYGSLQEVQQCLAIYQDMRKAGYKHNDYYLKQLIGEWCEGILQNNNRNKGQF-VSHSG 2676
            RS+YGSL EVQQCLA+YQDM KAGYK ND YLK++I EWCEG++Q+ N+N+G+  +    
Sbjct: 628  RSRYGSLHEVQQCLAVYQDMWKAGYKANDTYLKEVIEEWCEGVIQDKNQNQGEVTLCRRT 687

Query: 2675 TSPELQNLLLEKVAVHLQDINAESLTIDLQGLSKIEARIVVLAVLRMIKEQHSPGDSVKD 2496
             S   Q+LLLEKVAVHLQ   AE+L IDLQGL+K+EARIVVLAVL+M+KE +S G  VKD
Sbjct: 688  NSQRPQSLLLEKVAVHLQKSAAENLAIDLQGLTKVEARIVVLAVLQMMKENYSLGVPVKD 747

Query: 2495 DLMIIF--EDVGS-QTRHKGGVKEAIVKLLQYDLGLEVLSAGLRVKK---------DSES 2352
            DLMI+     V   Q +H   VK+AI KLLQ DLGL+V   G  ++          DSES
Sbjct: 748  DLMIVLGPNKVNKIQAKHDLEVKDAITKLLQDDLGLKVFLDGPSIQHKNAHMQKLLDSES 807

Query: 2351 GGFENPFPSKDVLQGRDLPPKSPARRPAVLQRLKIKKASLNHWLQRKGGS 2202
                      ++ +   +  KS  RRP +LQRLK+ K SL+HWLQR+ GS
Sbjct: 808  ----------NMAKTLHIELKSSTRRPKILQRLKVPKKSLHHWLQRRVGS 847


>gb|EXB84820.1| hypothetical protein L484_003853 [Morus notabilis]
          Length = 822

 Score =  850 bits (2195), Expect = 0.0
 Identities = 458/816 (56%), Positives = 577/816 (70%), Gaps = 14/816 (1%)
 Frame = -1

Query: 4616 RSFTRRTQSADSNSPLLSTVRWDSPSQVRNRLKYYADLASNLSDDGRFTDFLMIAESVVV 4437
            R F   ++ + SN P  S       S VR+ L+++AD A     D +  D  ++ ES+ V
Sbjct: 29   RHFRSSSKFSPSNLPSRS-------SAVRSDLRHFADFAG----DAKLRDLSVVVESLAV 77

Query: 4436 SGVKPSVFATLLNLKLVSV--GIRRMLSEGKLVSLIQVFKGVKKLGFSVVELFDELAIEA 4263
            SGV  S   + L  +L S   GI  +L +GK+ S  ++   + +LGF  VE+FD  A+E 
Sbjct: 78   SGVDASRLRSALRAELASAEKGISAVLRDGKVRSFARLLGKLDELGFPPVEIFDGWALEL 137

Query: 4262 LRHECHRRLAKCDEALEIVSLMETLQELGFSIKGLVDASHIIRLCVNKRDPDAAIRYAQN 4083
            +R EC RR+ +C++  E+V L E L   GFSIK LV  S +I++CV KR+P  AIRYA  
Sbjct: 138  IRREC-RRILRCEQVEELVELFEVLSGYGFSIKELVKPSDVIKICVEKRNPKMAIRYACT 196

Query: 4082 FPQAGMLLCSIMIEFGKRRDLVSALTVFEVSKQNQGCPNMYAYRTIVDVCGLCGDCLKSR 3903
             P A ++ C  + EFGK+ DLVSAL   E SK+N    NMY YRTI+DVCG C D  KSR
Sbjct: 197  LPHAHIIFCDAVYEFGKKGDLVSALIAHEASKKNSTSTNMYLYRTIIDVCGRCHDYQKSR 256

Query: 3902 SIYEELLACKFAPNIYVFNSLMNVNASDLTYTLQIYKQMQKVGVTADLASYNILLKSCCL 3723
             IYE+LL  K  PN+YVFNSLMNVNA D +YTL +YK MQ +GV AD+ASYNILLK+CCL
Sbjct: 257  YIYEDLLNEKVTPNVYVFNSLMNVNAHDFSYTLNVYKDMQNLGVQADMASYNILLKACCL 316

Query: 3722 AGRVDLAKDIYNEVRKIESMGVLKLDVFTYSTMIKVLADARMWQMALKIKEDMLLAGVIP 3543
            AGRVDLA+DIY EV+ +ES G+LKLDVFTYST++KVLADA++WQMALK+KEDML AGV P
Sbjct: 317  AGRVDLAQDIYKEVQHLESTGLLKLDVFTYSTIVKVLADAKLWQMALKVKEDMLSAGVNP 376

Query: 3542 NIVTWTSLISACANAGLVEQAIQLFEEMLWAGCEPNSQCCNILLHACVESCQYDRAFRLF 3363
            N VTW+SLISACANAG+V++A+QLFEEML AGC+PN+QCCNILLHACVE+CQYDRAFRLF
Sbjct: 377  NTVTWSSLISACANAGIVDKAVQLFEEMLLAGCKPNTQCCNILLHACVEACQYDRAFRLF 436

Query: 3362 KGWKANGIQKGISVDIHCKTENILCANPNHERDSRKS------SSTSILKHQSFPVSFPF 3201
            +  K N +Q+    D               +RDS +S      S +S L   +F    PF
Sbjct: 437  EFLKRNRVQETSEED------------GRGDRDSNQSAGVTSISQSSTLCGLNFARELPF 484

Query: 3200 SPTTSTYNILMKACGTDYFRAKALMDEMKMEGLSPNHISWSILIDIFGGKGNVEGALQIL 3021
            +PTT+TYNILMKACG+DY+ AKAL++EM+  GLSPN I+WSILIDI G  GNVEGALQIL
Sbjct: 485  TPTTTTYNILMKACGSDYYHAKALIEEMEAVGLSPNQITWSILIDICGDLGNVEGALQIL 544

Query: 3020 RSMYQAGVQPDVVTYTTAIKVCVEQKNLKFAFSLFAEMKRYQIKPNMVTYNTLLRARSQY 2841
            ++M   G++PDVV YTT IKVCVE K+LK AF LFAEMKRYQI+PN+VTYNTLLRAR++Y
Sbjct: 545  KTMRATGIEPDVVAYTTVIKVCVESKDLKQAFELFAEMKRYQIQPNLVTYNTLLRARNRY 604

Query: 2840 GSLQEVQQCLAIYQDMRKAGYKHNDYYLKQLIGEWCEGILQNNNRNKGQFVSHSGTSPEL 2661
            GSLQEV+QCLA+YQDMR+AGY  NDYYLKQLI EWCEG++Q NN+N+ +  S + T  + 
Sbjct: 605  GSLQEVKQCLAVYQDMRRAGYNSNDYYLKQLIEEWCEGVIQGNNQNREESSSFNKTDKKR 664

Query: 2660 -QNLLLEKVAVHLQDINAESLTIDLQGLSKIEARIVVLAVLRMIKEQHSPGDSVKDDLMI 2484
             Q+LLLEKVA HL+   AE+LT+D+QGL K+EARIVVLAVLRM+KE ++ G  VKDD++I
Sbjct: 665  PQSLLLEKVAEHLEKHIAETLTVDVQGLKKVEARIVVLAVLRMVKENYTMGYLVKDDMLI 724

Query: 2483 IF---EDVGSQTRHKGGVKEAIVKLLQYDLGLEVLSAGLRVKKDSESGGFENPFPSKDVL 2313
            I    +        +  VK+AI KLL+ +LGLEVLS GL+++ + +           D L
Sbjct: 725  IIGACKVDAVPDEQELEVKDAITKLLKDELGLEVLSTGLKIEPNRQ--------VDSDSL 776

Query: 2312 QGRDL--PPKSPARRPAVLQRLKIKKASLNHWLQRK 2211
               D     K   RRP V+QRLK+ K SL HWLQRK
Sbjct: 777  GSSDFSGEMKYSTRRPVVIQRLKVTKESLQHWLQRK 812


>ref|XP_002326162.1| predicted protein [Populus trichocarpa]
          Length = 828

 Score =  850 bits (2195), Expect = 0.0
 Identities = 452/801 (56%), Positives = 578/801 (72%), Gaps = 10/801 (1%)
 Frame = -1

Query: 4574 PLLSTVRWDSPSQVRNRLKYYADLASNLSDDGRFTDFLMIAESVVVSGVKPSVFATLLNL 4395
            PLLST+ +       + L Y+A+LAS L++DGR  DF+MIAESV+ SGV+PS F   L++
Sbjct: 54   PLLSTIPFRQNHNSSSLLDYHANLASKLAEDGRLQDFVMIAESVIASGVEPSSFVAALSV 113

Query: 4394 KLVSVGIRRMLSEGKLVSLIQVFKGVKKLGFSVVELFDELAIEALRHECHRRLAKCDEAL 4215
              V+ GI + L +G +  +++  K  ++LG S ++  D +AI+ L+ E   R+  C +  
Sbjct: 114  GPVAKGISKNLQQGNVDCVVRFLKKTEELGVSTLKFLDGVAIDLLKKE-FIRIVNCGDVE 172

Query: 4214 EIVSLMETLQELGFSIKGLVDASHIIRLCVNKRDPDAAIRYAQNFPQAGMLL-CSIMIEF 4038
            ++V +METL    FS K LVD S+II++CV+K +P  A+RYA  FP  G +L C+I+ EF
Sbjct: 173  QVVYIMETLAGFCFSFKELVDPSYIIKICVDKLNPKMAVRYAAIFPGEGRILFCNIISEF 232

Query: 4037 GKRRDLVSALTVFEVSKQNQGCPNMYAYRTIVDVCGLCGDCLKSRSIYEELLACKFAPNI 3858
            G++  L SAL  ++ +K     PNMY +RTI+DVCGLCGD +KSR IYE+L+  K  PN+
Sbjct: 233  GRKGHLDSALVAYDEAKHKLSVPNMYLHRTIIDVCGLCGDYMKSRYIYEDLINRKVIPNV 292

Query: 3857 YVFNSLMNVNASDLTYTLQIYKQMQKVGVTADLASYNILLKSCCLAGRVDLAKDIYNEVR 3678
            YVFNSLMNVNA DL YT  ++K MQ +GVTAD+ASYNILLK+CC+AGRVDLAKDIY EV+
Sbjct: 293  YVFNSLMNVNAHDLGYTFSVFKNMQNLGVTADVASYNILLKACCIAGRVDLAKDIYREVK 352

Query: 3677 KIESMGVLKLDVFTYSTMIKVLADARMWQMALKIKEDMLLAGVIPNIVTWTSLISACANA 3498
            ++ES  VLKLDVFTY  ++K+ ADA+MWQMALKIKEDML +GV PN+  W+SLISACANA
Sbjct: 353  QLESAEVLKLDVFTYCMIVKIFADAKMWQMALKIKEDMLSSGVTPNMHIWSSLISACANA 412

Query: 3497 GLVEQAIQLFEEMLWAGCEPNSQCCNILLHACVESCQYDRAFRLFKGWKANGIQKGISVD 3318
            GLVEQAIQLFEEML +GC+PNSQCCNILLHACV++CQYDRAFRLF+ WK +  Q+    D
Sbjct: 413  GLVEQAIQLFEEMLLSGCKPNSQCCNILLHACVQACQYDRAFRLFQCWKGSEAQEVFHGD 472

Query: 3317 IHCKTENILCANPNHERDSRKSSSTSI--LKHQSFPVSFPFSPTTSTYNILMKACGTDYF 3144
                 + I      H +    + +T +    H +F   FPF+PT +TY++LMKACG+DY 
Sbjct: 473  HSGNADEI-----EHAQKHCPNMTTIVPNSHHLNFIKKFPFTPTPATYHMLMKACGSDYH 527

Query: 3143 RAKALMDEMKMEGLSPNHISWSILIDIFGGKGNVEGALQILRSMYQAGVQPDVVTYTTAI 2964
            RAKALMDEMK  G+SPNHISWSILIDI G  GNV GA+QIL++M  AGV+PDVV YTTAI
Sbjct: 528  RAKALMDEMKTVGISPNHISWSILIDICGVSGNVSGAVQILKNMRMAGVEPDVVAYTTAI 587

Query: 2963 KVCVEQKNLKFAFSLFAEMKRYQIKPNMVTYNTLLRARSQYGSLQEVQQCLAIYQDMRKA 2784
            KVCVE KNLK AFSLFAEMKR QI PN+VTYNTLLRAR++YGSL+EVQQCLAIYQDMRKA
Sbjct: 588  KVCVETKNLKLAFSLFAEMKRCQINPNLVTYNTLLRARTRYGSLREVQQCLAIYQDMRKA 647

Query: 2783 GYKHNDYYLKQLIGEWCEGILQNNNRNKGQFVSHSGTS-PELQNLLLEKVAVHLQDINAE 2607
            GYK NDYYLKQLI EWCEG++Q+NN+ +G F S   T     ++LLLEKVA HLQ+  +E
Sbjct: 648  GYKSNDYYLKQLIEEWCEGVIQDNNQIQGGFASCKRTDLGRPRSLLLEKVAAHLQNNISE 707

Query: 2606 SLTIDLQGLSKIEARIVVLAVLRMIKEQHSPGDSVKDDLMIIFE--DVGSQTRHKGGVKE 2433
            +L IDLQGL+K+EARIVVLAVLRMIKE ++ G SVK+D+ I  +   V   ++    VK 
Sbjct: 708  NLAIDLQGLTKVEARIVVLAVLRMIKENYTLGYSVKEDMWITLDVSKVDPASKRDSEVKN 767

Query: 2432 AIVKLLQYDLGLEVLSAGL----RVKKDSESGGFENPFPSKDVLQGRDLPPKSPARRPAV 2265
            AI++LL+ +LGLEVL A       +K DS+S                       +  P V
Sbjct: 768  AIIELLRNELGLEVLVAVPGHLDDIKTDSKS-----------------------SLDPVV 804

Query: 2264 LQRLKIKKASLNHWLQRKGGS 2202
             QRLK+++ SL+ WLQR+ G+
Sbjct: 805  TQRLKVRRKSLHEWLQRRAGA 825


>ref|XP_006381507.1| pentatricopeptide repeat-containing family protein [Populus
            trichocarpa] gi|550336211|gb|ERP59304.1|
            pentatricopeptide repeat-containing family protein
            [Populus trichocarpa]
          Length = 828

 Score =  849 bits (2193), Expect = 0.0
 Identities = 452/801 (56%), Positives = 578/801 (72%), Gaps = 10/801 (1%)
 Frame = -1

Query: 4574 PLLSTVRWDSPSQVRNRLKYYADLASNLSDDGRFTDFLMIAESVVVSGVKPSVFATLLNL 4395
            PLLST+ +       + L Y+A+LAS L++DGR  DF+MIAESV+ SGV+PS F   L++
Sbjct: 54   PLLSTIPFRQNHNSSSLLDYHANLASKLAEDGRLQDFVMIAESVIASGVEPSSFVAALSV 113

Query: 4394 KLVSVGIRRMLSEGKLVSLIQVFKGVKKLGFSVVELFDELAIEALRHECHRRLAKCDEAL 4215
              V+ GI + L +G +  +++  K  ++LG S ++  D +AI+ L+ E   R+  C +  
Sbjct: 114  GPVAKGISKNLQQGNVDCVVRFLKKTEELGVSTLKFLDGVAIDLLKKE-FIRIVNCGDVE 172

Query: 4214 EIVSLMETLQELGFSIKGLVDASHIIRLCVNKRDPDAAIRYAQNFPQAGMLL-CSIMIEF 4038
            ++V +METL    FS K LVD S+II++CV+K +P  A+RYA  FP  G +L C+I+ EF
Sbjct: 173  QVVYIMETLAGFCFSFKELVDPSYIIKICVDKLNPKMAVRYAAIFPGEGRILFCNIISEF 232

Query: 4037 GKRRDLVSALTVFEVSKQNQGCPNMYAYRTIVDVCGLCGDCLKSRSIYEELLACKFAPNI 3858
            G++  L SAL  ++ +K     PNMY +RTI+DVCGLCGD +KSR IYE+L+  K  PN+
Sbjct: 233  GRKGHLDSALVAYDEAKHKLSVPNMYLHRTIIDVCGLCGDYMKSRYIYEDLINRKVIPNV 292

Query: 3857 YVFNSLMNVNASDLTYTLQIYKQMQKVGVTADLASYNILLKSCCLAGRVDLAKDIYNEVR 3678
            YVFNSLMNVNA DL YT  ++K MQ +GVTAD+ASYNILLK+CC+AGRVDLAKDIY EV+
Sbjct: 293  YVFNSLMNVNAHDLGYTFSVFKNMQNLGVTADVASYNILLKACCIAGRVDLAKDIYREVK 352

Query: 3677 KIESMGVLKLDVFTYSTMIKVLADARMWQMALKIKEDMLLAGVIPNIVTWTSLISACANA 3498
            ++ES  VLKLDVFTY  ++K+ ADA+MWQMALKIKEDML +GV PN+  W+SLISACANA
Sbjct: 353  QLESAEVLKLDVFTYCMIVKIFADAKMWQMALKIKEDMLSSGVTPNMHIWSSLISACANA 412

Query: 3497 GLVEQAIQLFEEMLWAGCEPNSQCCNILLHACVESCQYDRAFRLFKGWKANGIQKGISVD 3318
            GLVEQAIQLFEEML +GC+PNSQCCNILLHACV++CQYDRAFRLF+ WK +  Q+    D
Sbjct: 413  GLVEQAIQLFEEMLLSGCKPNSQCCNILLHACVQACQYDRAFRLFQCWKGSEAQEVFHGD 472

Query: 3317 IHCKTENILCANPNHERDSRKSSSTSI--LKHQSFPVSFPFSPTTSTYNILMKACGTDYF 3144
                 + I      H +    + +T +    H +F   FPF+PT +TY++LMKACG+DY 
Sbjct: 473  HSGNADEI-----EHAQKHCPNMTTIVPNSHHLNFIKKFPFTPTPATYHMLMKACGSDYH 527

Query: 3143 RAKALMDEMKMEGLSPNHISWSILIDIFGGKGNVEGALQILRSMYQAGVQPDVVTYTTAI 2964
            RAKALMDEMK  G+SPNHISWSILIDI G  GNV GA+QIL++M  AGV+PDVV YTTAI
Sbjct: 528  RAKALMDEMKTVGISPNHISWSILIDICGVSGNVSGAVQILKNMRLAGVEPDVVAYTTAI 587

Query: 2963 KVCVEQKNLKFAFSLFAEMKRYQIKPNMVTYNTLLRARSQYGSLQEVQQCLAIYQDMRKA 2784
            KVCVE KNLK AFSLFAEMKR QI PN+VTYNTLLRAR++YGSL+EVQQCLAIYQDMRKA
Sbjct: 588  KVCVETKNLKLAFSLFAEMKRCQINPNLVTYNTLLRARTRYGSLREVQQCLAIYQDMRKA 647

Query: 2783 GYKHNDYYLKQLIGEWCEGILQNNNRNKGQFVSHSGTS-PELQNLLLEKVAVHLQDINAE 2607
            GYK NDYYLKQLI EWCEG++Q+NN+ +G F S   T     ++LLLEKVA HLQ+  +E
Sbjct: 648  GYKSNDYYLKQLIEEWCEGVIQDNNQIQGGFASCKRTDLGRPRSLLLEKVAAHLQNNISE 707

Query: 2606 SLTIDLQGLSKIEARIVVLAVLRMIKEQHSPGDSVKDDLMIIFE--DVGSQTRHKGGVKE 2433
            +L IDLQGL+K+EARIVVLAVLRMIKE ++ G SVK+D+ I  +   V   ++    VK 
Sbjct: 708  NLAIDLQGLTKVEARIVVLAVLRMIKENYTLGYSVKEDMWITLDVSKVDPASKRDSEVKN 767

Query: 2432 AIVKLLQYDLGLEVLSAGL----RVKKDSESGGFENPFPSKDVLQGRDLPPKSPARRPAV 2265
            AI++LL+ +LGLEVL A       +K DS+S                       +  P V
Sbjct: 768  AIIELLRNELGLEVLVAVPGHLDDIKTDSKS-----------------------SLDPVV 804

Query: 2264 LQRLKIKKASLNHWLQRKGGS 2202
             QRLK+++ SL+ WLQR+ G+
Sbjct: 805  TQRLKVRRKSLHEWLQRRAGA 825


>ref|XP_004167767.1| PREDICTED: LOW QUALITY PROTEIN: pentatricopeptide repeat-containing
            protein At5g02830, chloroplastic-like [Cucumis sativus]
          Length = 855

 Score =  837 bits (2163), Expect = 0.0
 Identities = 462/853 (54%), Positives = 588/853 (68%), Gaps = 17/853 (1%)
 Frame = -1

Query: 4721 ILASSSVVLXXXXXXXXXXXXXHSQSSTKLKPL---LPRSFTRRTQSADSNSPLLSTVRW 4551
            IL SSS  +                S + L P    LP  F+  T +  S   LLS+V  
Sbjct: 6    ILGSSSASIAGPRRYRHSHCKAPKSSLSNLSPTGTHLP--FSSHTSTRHSPPALLSSVEL 63

Query: 4550 D---SPSQVRNRLKYYADLASNLSDDGRFTDFLMIAESVVVSGVKPSVFATLLNLKLVSV 4380
            D   + S  R  +++YA +AS L++ G+  DF M+ ESVVV+GV+PS F  +L ++LV+ 
Sbjct: 64   DIAGASSGGRIPIQHYAGVASKLAEGGKLEDFAMVVESVVVAGVEPSQFGAMLAVELVAK 123

Query: 4379 GIRRMLSEGKLVSLIQVFKGVKKLGFSVVELFDELAIEALRHECHRRLAKCDEALEIVSL 4200
            GI R L EGK+ S++QV + V++LG SV+EL DE A+E+LR +C RR+AK  E  E+V L
Sbjct: 124  GISRCLREGKVWSVVQVLRKVEELGISVLELCDEPAVESLRRDC-RRMAKSGELEELVEL 182

Query: 4199 METLQELGFSIKGLVDASHIIRLCVNKRDPDAAIRYAQNFPQAGMLLCSIMIEFGKRRDL 4020
            ME L   GFS++ ++  S +I+LCV+ R+P  AIRYA   P A +L C+ + EFGK+RDL
Sbjct: 183  MEVLSGFGFSVREMMKPSEVIKLCVDYRNPKMAIRYASILPHADILFCTTINEFGKKRDL 242

Query: 4019 VSALTVFEVSKQNQGCPNMYAYRTIVDVCGLCGDCLKSRSIYEELLACKFAPNIYVFNSL 3840
             SA   +  SK N    NMY YRTI+DVCGLCGD  KSR+IY++L+     PNI+VFNSL
Sbjct: 243  KSAYIAYTESKANMNGSNMYIYRTIIDVCGLCGDYKKSRNIYQDLVNQNVTPNIFVFNSL 302

Query: 3839 MNVNASDLTYTLQIYKQMQKVGVTADLASYNILLKSCCLAGRVDLAKDIYNEVRKIESMG 3660
            MNVNA DL YT Q+YK MQ +GV AD+ASYNILLK+CCLAGRVDLA+DIY EV+ +E+ G
Sbjct: 303  MNVNAHDLNYTFQLYKNMQNLGVPADMASYNILLKACCLAGRVDLAQDIYREVKHLETTG 362

Query: 3659 VLKLDVFTYSTMIKVLADARMWQMALKIKEDMLLAGVIPNIVTWTSLISACANAGLVEQA 3480
            VLKLDVFTYST++KV ADA++W+MAL++KEDM  AGV PN+VTW+SLIS+CAN+GLVE A
Sbjct: 363  VLKLDVFTYSTIVKVFADAKLWKMALRVKEDMQSAGVSPNMVTWSSLISSCANSGLVELA 422

Query: 3479 IQLFEEMLWAGCEPNSQCCNILLHACVESCQYDRAFRLFKGWKANGIQKGISVDIHCKTE 3300
            IQLFEEM+ AGCEPN+QCCN LLHACVE  Q+DRAFRLF+ WK   +  GI  +    T+
Sbjct: 423  IQLFEEMVSAGCEPNTQCCNTLLHACVEGRQFDRAFRLFRSWKEKELWDGI--ERKSSTD 480

Query: 3299 NILCANPNHERDSRKSSSTSILKHQ-SFPVSFPFSPTTSTYNILMKACGTDYFRAKALMD 3123
            N L A+   +  + K  +     HQ SF  +  F PT +TYNILMKACGTDY+ AKALM+
Sbjct: 481  NNLDADSTSQLCTTKMPNAPSHVHQISFVGNLAFKPTITTYNILMKACGTDYYHAKALME 540

Query: 3122 EMKMEGLSPNHISWSILIDIFGGKGNVEGALQILRSMYQAGVQPDVVTYTTAIKVCVE-- 2949
            EMK  GL+PNHISWSIL+DI G   +VE A+QIL +M  AGV PDVV YTTAIKV +   
Sbjct: 541  EMKSVGLTPNHISWSILVDICGRSHDVESAVQILTTMRMAGVDPDVVAYTTAIKVSIPLA 600

Query: 2948 ----QKNLKFAFSLFAEMKRYQIKPNMVTYNTLLRARSQYGSLQEVQQCLAIYQDMRKAG 2781
                + N K AFSLF EMK ++I+PN+VTY+TLLRARS YGSL EVQQCLAIYQDMRK+G
Sbjct: 601  VLVLKXNWKLAFSLFEEMKGFEIQPNLVTYSTLLRARSTYGSLHEVQQCLAIYQDMRKSG 660

Query: 2780 YKHNDYYLKQLIGEWCEGILQNNNRNKGQFVSHSGTS-PELQNLLLEKVAVHLQDINAES 2604
            +K ND+YLK+LI EWCEG++Q NN+   +    +     + + L+LEKVA HLQ   AES
Sbjct: 661  FKSNDHYLKELIAEWCEGVIQKNNQQPVEITPCNKIDIGKPRCLILEKVADHLQKSFAES 720

Query: 2603 LTIDLQGLSKIEARIVVLAVLRMIKEQHSPGDSVKDDLMIIFEDVGSQT---RHKGGVKE 2433
            LTIDLQ L+K+EARIVVLAVLRMIKE ++ G+SVKDD+ II E    +T        V++
Sbjct: 721  LTIDLQELTKVEARIVVLAVLRMIKENYALGESVKDDIFIILEVNKVETDLVPQNFEVRD 780

Query: 2432 AIVKLLQYDLGLEVLSAGLRVKKDSESGGFENPFPSKDVLQGRDLPPKSPARRPAVLQRL 2253
            AI +LLQ +LGLEVL  G  +  D       +       L+G     K   R+PA +QRL
Sbjct: 781  AITRLLQDELGLEVLPTGPTIALDKVPNSESSKISHTTKLKGTMGRNKYFTRKPADVQRL 840

Query: 2252 KIKKASLNHWLQR 2214
            K+ K SL  WLQR
Sbjct: 841  KVTKKSLQDWLQR 853


>ref|XP_006287051.1| hypothetical protein CARUB_v10000200mg [Capsella rubella]
            gi|482555757|gb|EOA19949.1| hypothetical protein
            CARUB_v10000200mg [Capsella rubella]
          Length = 858

 Score =  810 bits (2093), Expect = 0.0
 Identities = 442/862 (51%), Positives = 573/862 (66%), Gaps = 19/862 (2%)
 Frame = -1

Query: 4739 MRDAVAILASSSVVLXXXXXXXXXXXXXH--------SQSSTKLKPLLPRSFTRRTQSAD 4584
            MRD V +  SSS +                       + S TKL P LP+  +       
Sbjct: 1    MRDLVIVFGSSSAITNPHHHSRRCYATAPDANRKSKPNPSLTKLLPSLPQQHSASPAPVS 60

Query: 4583 SNSPLLS----TVRWDSPSQVRNRLKYYADLASNLSDDGRFTDFLMIAESVVV-SGVKPS 4419
            +   L S     VRW         L+YYAD AS L++DGR  D  +IAE++   SG   +
Sbjct: 61   ATHSLSSHFSNVVRWLPDGS----LEYYADFASKLAEDGRIEDVALIAETLAAESGANVA 116

Query: 4418 VFATLLNLKLVSVGIRRMLSEGKLVSLIQVFKGVKKLGFSVVELFDELAIEALRHECHRR 4239
             FA++++  L+S GI   L +GK+ S++   K ++K+G + ++L DE +++ +R +  R 
Sbjct: 117  RFASMVDFDLLSKGISSNLRQGKIESVVYTLKRIEKVGIAPLDLVDESSVKLMRKQ-FRA 175

Query: 4238 LAKCDEALEIVSLMETLQELGFSIKGLVDASHIIRLCVNKRDPDAAIRYAQNFPQAGMLL 4059
            +A   +  + + LME L  L F IK LVD   I++ CV+  +P+ AIRYA   P   +LL
Sbjct: 176  MANSVQVEKAIDLMEILAGLRFKIKELVDPFDIVKSCVDISNPELAIRYACLLPHTEILL 235

Query: 4058 CSIMIEFGKRRDLVSALTVFEVSKQNQGCPNMYAYRTIVDVCGLCGDCLKSRSIYEELLA 3879
            C I++ FGK+ D+VS +T +E  KQ    PNMY  RT++DVCGLCGD +KSR IYE+LL 
Sbjct: 236  CRIILGFGKKGDMVSVMTAYEACKQILDTPNMYICRTMIDVCGLCGDYVKSRYIYEDLLK 295

Query: 3878 CKFAPNIYVFNSLMNVNASDLTYTLQIYKQMQKVGVTADLASYNILLKSCCLAGRVDLAK 3699
                PNIYV NSLMNVN+ DL YTL++YK MQK+ VTAD+ SYNILLK+CCLAGRVDLA+
Sbjct: 296  ENVKPNIYVMNSLMNVNSHDLGYTLKVYKNMQKLDVTADMTSYNILLKTCCLAGRVDLAQ 355

Query: 3698 DIYNEVRKIESMGVLKLDVFTYSTMIKVLADARMWQMALKIKEDMLLAGVIPNIVTWTSL 3519
            DIY E +++ES G+LKLD FTY T+IKV ADA+MW+ ALK+K+DM   GV PN  TW+SL
Sbjct: 356  DIYKEAKRMESSGLLKLDAFTYCTIIKVFADAKMWKWALKVKDDMKSVGVTPNTHTWSSL 415

Query: 3518 ISACANAGLVEQAIQLFEEMLWAGCEPNSQCCNILLHACVESCQYDRAFRLFKGWKANGI 3339
            ISACANAGLVEQA  LFEEML +GCEPNSQC NILLHACVE+CQYDRAFRLF+ WK + +
Sbjct: 416  ISACANAGLVEQANHLFEEMLASGCEPNSQCFNILLHACVEACQYDRAFRLFQSWKGSSV 475

Query: 3338 QKGISVDIHCKTENILCANPNHERD-SRKSSSTSILKHQSFPVSFPFSPTTSTYNILMKA 3162
            ++ +  D           N     D     ++ S   +      F F PTT+TYNIL+KA
Sbjct: 476  KEALYADKIVSKGRTFSPNKLKTNDPGSLVNNNSTSPYIQASNRFFFKPTTATYNILLKA 535

Query: 3161 CGTDYFRAKALMDEMKMEGLSPNHISWSILIDIFGGKGNVEGALQILRSMYQAGVQPDVV 2982
            CGTDY+R K LMDEMK  GL+PN I+WS LID+ GG G+VEGA++ILR+M+ AG +PDVV
Sbjct: 536  CGTDYYRGKELMDEMKSLGLTPNQITWSTLIDMCGGSGDVEGAVRILRTMHSAGTRPDVV 595

Query: 2981 TYTTAIKVCVEQKNLKFAFSLFAEMKRYQIKPNMVTYNTLLRARSQYGSLQEVQQCLAIY 2802
             YTTAIK+C E K+LK AFSLF EM+RYQIKPN VTYNTLL+ARS+YGSL EV+QCLAIY
Sbjct: 596  AYTTAIKICAENKSLKLAFSLFEEMRRYQIKPNWVTYNTLLKARSKYGSLLEVRQCLAIY 655

Query: 2801 QDMRKAGYKHNDYYLKQLIGEWCEGILQNNNRNKGQFVSHSGT-SPELQNLLLEKVAVHL 2625
            QDMRKAGYK ND++LK+LI EWCEG++Q N +++ +     G  +    +LL+EKVA HL
Sbjct: 656  QDMRKAGYKPNDHFLKELIEEWCEGVIQENGQSQNKISDQEGDHAGRPVSLLIEKVATHL 715

Query: 2624 QDINAESLTIDLQGLSKIEARIVVLAVLRMIKEQHSPGDSVKDDLMIIFEDVGSQT---R 2454
            Q+  A +L IDLQGL+K+EAR+VVLAVLRMIKE +  GD V DD++II     + T   +
Sbjct: 716  QERTAGNLAIDLQGLTKVEARLVVLAVLRMIKEDYMRGDVVIDDVLIILGTSEANTDSGK 775

Query: 2453 HKGGVKEAIVKLLQYDLGLEVLSAGLR-VKKDSESGGFENPFPSKDVLQGRDLPPKSPAR 2277
                VKEA+VKLLQ +L L VL AG R +K+D+      N      +   +     S  R
Sbjct: 776  QDIAVKEALVKLLQEELSLVVLPAGQRNIKQDAHCVDDANQDTEHTLENTKSFISISSTR 835

Query: 2276 RPAVLQRLKIKKASLNHWLQRK 2211
            RPA+L+RL + KASL  WLQRK
Sbjct: 836  RPAILERLMVTKASLYQWLQRK 857


>ref|XP_006398739.1| hypothetical protein EUTSA_v10012661mg [Eutrema salsugineum]
            gi|557099829|gb|ESQ40192.1| hypothetical protein
            EUTSA_v10012661mg [Eutrema salsugineum]
          Length = 858

 Score =  805 bits (2078), Expect = 0.0
 Identities = 451/871 (51%), Positives = 582/871 (66%), Gaps = 28/871 (3%)
 Frame = -1

Query: 4739 MRDAVAILASSSVVLXXXXXXXXXXXXXHSQ------SSTKLKPLLPRSFTRRTQSADSN 4578
            MRD V +  SSSV+               ++      S TK  P LP+  T    S  S 
Sbjct: 1    MRDLVIVFGSSSVITNPHHRRCYATAPELNRKLKSTSSLTKQLPSLPQQHTSSPASVSSA 60

Query: 4577 SPLLS----TVRWDSPSQVRNRLKYYADLASNLSDDGRFTDFLMIAESVVV-SGVKPSVF 4413
            + L S     VRW     V    +YYAD AS L++DGR  D  +IAE++   SG   + F
Sbjct: 61   NALSSHFSDAVRWIPDGSV----EYYADFASKLAEDGRIQDVALIAETLAAESGANVARF 116

Query: 4412 ATLLNLKLVSVGIRRMLSEGKLVSLIQVFKGVKKLGFSVVELFDELAIEALRHECHRRLA 4233
            A++++  L+S GI   L +GK+ S++   + ++K+G + ++L DE +++ +R    R +A
Sbjct: 117  ASMVDSDLLSKGISLNLRQGKIESVVYTLQRIEKVGIAPLDLVDESSVKLMRKH-FRAMA 175

Query: 4232 KCDEALEIVSLMETLQELGFSIKGLVDASHIIRLCVNKRDPDAAIRYAQNFPQAGMLLCS 4053
               +  + + LME L    F IK LVD   ++++CV+  +P  AIRYA   P   +LLC 
Sbjct: 176  NSVQVEKAIDLMEILAGFRFKIKELVDPFDVVKICVDISNPQLAIRYACLLPHTELLLCR 235

Query: 4052 IMIEFGKRRDLVSALTVFEVSKQNQGCPNMYAYRTIVDVCGLCGDCLKSRSIYEELLACK 3873
            I+  FGK+ D+VS LT +E  KQ    PNMY YRT++DVCGLCGD +KSR IYE+LL   
Sbjct: 236  IIHGFGKKGDMVSVLTAYEACKQILDNPNMYIYRTMIDVCGLCGDYVKSRYIYEDLLKEN 295

Query: 3872 FAPNIYVFNSLMNVNASDLTYTLQIYKQMQKVGVTADLASYNILLKSCCLAGRVDLAKDI 3693
              PNIYV NSLMNVN+ DL YTL++YK MQK+ VTAD+ SYNILLK+CCLAGRVDLA+DI
Sbjct: 296  IKPNIYVMNSLMNVNSHDLGYTLKVYKNMQKLDVTADMTSYNILLKTCCLAGRVDLAQDI 355

Query: 3692 YNEVRKIESMGVLKLDVFTYSTMIKVLADARMWQMALKIKEDMLLAGVIPNIVTWTSLIS 3513
            Y E +++ES G+LKLD FTY T+IKV ADA+MW+MALK+KEDM   GV PN  TW+SLIS
Sbjct: 356  YKEAKRMESSGLLKLDAFTYCTIIKVFADAKMWKMALKVKEDMQSVGVTPNTHTWSSLIS 415

Query: 3512 ACANAGLVEQAIQLFEEMLWAGCEPNSQCCNILLHACVESCQYDRAFRLFKGWKANGIQK 3333
            ACANAGLVEQA  LFEEML +GCEPNSQC NILLHACVE+CQ+DRAFRLF+ WK +  ++
Sbjct: 416  ACANAGLVEQANHLFEEMLASGCEPNSQCFNILLHACVEACQFDRAFRLFQSWKGSSDKE 475

Query: 3332 GISVDIHCKTENILCAN--PNHERDSRKSSSTSILKHQSFPVSFPFSPTTSTYNILMKAC 3159
             +  D      +I   N   NH   S  ++++S     S    F F PTT+TYNIL+KAC
Sbjct: 476  ALYADDITGKGSIFSPNKLKNHGNGSLVNTNSSPYIQAS--NRFFFKPTTATYNILLKAC 533

Query: 3158 GTDYFRAKALMDEMKMEGLSPNHISWSILIDIFGGKGNVEGALQILRSMYQAGVQPDVVT 2979
            GTDY+R K LMDEM+  GL+PN I+WS LIDI GG G+VEGA+ ILR+M+ AG +PDVV 
Sbjct: 534  GTDYYRGKELMDEMRSLGLAPNQITWSTLIDICGGSGDVEGAVGILRTMHSAGTRPDVVA 593

Query: 2978 YTTAIKVCVEQKNLKFAFSLFAEMKRYQIKPNMVTYNTLLRARSQYGSLQEVQQCLAIYQ 2799
            YTTAIK+C E K+LK AFSLF EM+RYQIKPN VTYNTLL+ARS+YGSL EV+QCLAIYQ
Sbjct: 594  YTTAIKICAENKSLKLAFSLFEEMRRYQIKPNWVTYNTLLKARSKYGSLLEVRQCLAIYQ 653

Query: 2798 DMRKAGYKHNDYYLKQLIGEWCEGILQNNNRNKGQFVSHSGTS-PELQNLLLEKVAVHLQ 2622
            DMRKAGYK ND++LK+LI EWCEG++Q N++++ +     GT+     +LL+EKVA HLQ
Sbjct: 654  DMRKAGYKPNDHFLKELIEEWCEGVIQENSQSQIKTSDQEGTNLGRPVSLLIEKVATHLQ 713

Query: 2621 DINAESLTIDLQGLSKIEARIVVLAVLRMIKEQHSPGDSVKDDLMIIFE------DVGSQ 2460
            +  A +L IDLQGL+K+EAR+VVLAVLRMIKE +  GD V DDL+II        D G Q
Sbjct: 714  ERTAGNLAIDLQGLTKVEARLVVLAVLRMIKEDYIRGDVVTDDLLIILGTGEANIDPGKQ 773

Query: 2459 TRHKGGVKEAIVKLLQYDLGLEVLSAGLRVKKDSESGGFENPFPSKDVLQGRDLPPK--- 2289
               +  VK+ +V+LL+ +L L VL AG R   D       +     D  QG +L  +   
Sbjct: 774  ---EIAVKDVLVQLLKDELSLVVLPAGHRHVLDITL----DARCVDDADQGIELTSENTK 826

Query: 2288 -----SPARRPAVLQRLKIKKASLNHWLQRK 2211
                 S  RRPA+L+RL + KASL+ WLQRK
Sbjct: 827  SIVGISSTRRPAILERLMVTKASLHQWLQRK 857


>dbj|BAC42187.2| unknown protein [Arabidopsis thaliana]
          Length = 852

 Score =  800 bits (2067), Expect = 0.0
 Identities = 443/866 (51%), Positives = 573/866 (66%), Gaps = 23/866 (2%)
 Frame = -1

Query: 4739 MRDAVAILASSSVVLXXXXXXXXXXXXXH--------SQSSTKLKPLLPRSFTRRTQSAD 4584
            MRD V +  SSS +                       + S TKL P LP+  +    S  
Sbjct: 1    MRDFVIVFGSSSAITNPHHHHRRCYATAPESNRKTKSNSSFTKLLPSLPQQHSPSPASVS 60

Query: 4583 SNSPLLS----TVRWDSPSQVRNRLKYYADLASNLSDDGRFTDFLMIAESVVV-SGVKPS 4419
            +   L S     VRW         L+YYAD AS L++DGR  D  +IAE++   SG   +
Sbjct: 61   ATHSLSSHFSNVVRWIPDGS----LEYYADFASKLAEDGRIEDVALIAETLAAESGANVA 116

Query: 4418 VFATLLNLKLVSVGIRRMLSEGKLVSLIQVFKGVKKLGFSVVELFDELAIEALRHECHRR 4239
             FA++++  L+S GI   L +GK+ S++   K ++K+G + ++L D+ +++ +R +  R 
Sbjct: 117  RFASMVDYDLLSKGISSNLRQGKIESVVYTLKRIEKVGIAPLDLVDDSSVKLMRKQ-FRA 175

Query: 4238 LAKCDEALEIVSLMETLQELGFSIKGLVDASHIIRLCVNKRDPDAAIRYAQNFPQAGMLL 4059
            +A   +  + + LME L  LGF IK LVD   +++ CV   +P  AIRYA   P   +LL
Sbjct: 176  MANSVQVEKAIDLMEILAGLGFKIKELVDPFDVVKSCVEISNPQLAIRYACLLPHTELLL 235

Query: 4058 CSIMIEFGKRRDLVSALTVFEVSKQNQGCPNMYAYRTIVDVCGLCGDCLKSRSIYEELLA 3879
            C I+  FGK+ D+VS +T +E  KQ    PNMY  RT++DVCGLCGD +KSR IYE+LL 
Sbjct: 236  CRIIHGFGKKGDMVSVMTAYEACKQILDTPNMYICRTMIDVCGLCGDYVKSRYIYEDLLK 295

Query: 3878 CKFAPNIYVFNSLMNVNASDLTYTLQIYKQMQKVGVTADLASYNILLKSCCLAGRVDLAK 3699
                PNIYV NSLMNVN+ DL YTL++YK MQ + VTAD+ SYNILLK+CCLAGRVDLA+
Sbjct: 296  ENIKPNIYVINSLMNVNSHDLGYTLKVYKNMQILDVTADMTSYNILLKTCCLAGRVDLAQ 355

Query: 3698 DIYNEVRKIESMGVLKLDVFTYSTMIKVLADARMWQMALKIKEDMLLAGVIPNIVTWTSL 3519
            DIY E +++ES G+LKLD FTY T+IKV ADA+MW+ ALK+K+DM   GV PN  TW+SL
Sbjct: 356  DIYKEAKRMESSGLLKLDAFTYCTIIKVFADAKMWKWALKVKDDMKSVGVTPNTHTWSSL 415

Query: 3518 ISACANAGLVEQAIQLFEEMLWAGCEPNSQCCNILLHACVESCQYDRAFRLFKGWKANGI 3339
            ISACANAGLVEQA  LFEEML +GCEPNSQC NILLHACVE+CQYDRAFRLF+ WK + +
Sbjct: 416  ISACANAGLVEQANHLFEEMLASGCEPNSQCFNILLHACVEACQYDRAFRLFQSWKGSSV 475

Query: 3338 QKGISVDI-----HCKTENILCANPNHERDSRKSSSTSILKHQSFPVSFPFSPTTSTYNI 3174
             + +  D         + NIL  N      +R S+S  I   +     F F PTT+TYNI
Sbjct: 476  NESLYADDIVSKGRTSSPNILKNNGPGSLVNRNSNSPYIQASK----RFCFKPTTATYNI 531

Query: 3173 LMKACGTDYFRAKALMDEMKMEGLSPNHISWSILIDIFGGKGNVEGALQILRSMYQAGVQ 2994
            L+KACGTDY+R K LMDEMK  GLSPN I+WS LID+ GG G+VEGA++ILR+M+ AG +
Sbjct: 532  LLKACGTDYYRGKELMDEMKSLGLSPNQITWSTLIDMCGGSGDVEGAVRILRTMHSAGTR 591

Query: 2993 PDVVTYTTAIKVCVEQKNLKFAFSLFAEMKRYQIKPNMVTYNTLLRARSQYGSLQEVQQC 2814
            PDVV YTTAIK+C E K LK AFSLF EM+RYQIKPN VTYNTLL+ARS+YGSL EV+QC
Sbjct: 592  PDVVAYTTAIKICAENKCLKLAFSLFEEMRRYQIKPNWVTYNTLLKARSKYGSLLEVRQC 651

Query: 2813 LAIYQDMRKAGYKHNDYYLKQLIGEWCEGILQNNNRNKGQFVSHSG-TSPELQNLLLEKV 2637
            LAIYQDMR AGYK ND++LK+LI EWCEG++Q N R++ +     G  +    +LL+EKV
Sbjct: 652  LAIYQDMRNAGYKPNDHFLKELIEEWCEGVIQENGRSQDKISDQEGDNAGRPVSLLIEKV 711

Query: 2636 AVHLQDINAESLTIDLQGLSKIEARIVVLAVLRMIKEQHSPGDSVKDDLMIIFEDVGSQT 2457
            A H+Q+  A +L IDLQGL+KIEAR+VVLAVLRMIKE +  GD V DD++II     + T
Sbjct: 712  ATHMQERTAGNLAIDLQGLTKIEARLVVLAVLRMIKEDYMRGDVVIDDVLIIIGTDEANT 771

Query: 2456 ---RHKGGVKEAIVKLLQYDLGLEVLSAGLR-VKKDSESGGFENPFPSKDVLQGRDLPPK 2289
               + +  V+EA+VKLL+ +L L VL AG R + +D+            D    +     
Sbjct: 772  VSGKQEITVQEALVKLLRDELSLVVLPAGQRNIIQDAHC------VDDADQENTKSFVSI 825

Query: 2288 SPARRPAVLQRLKIKKASLNHWLQRK 2211
            S  RRPA+L+RL + KASL  WLQR+
Sbjct: 826  SSTRRPAILERLMVTKASLYQWLQRR 851


>ref|NP_195903.2| pentatricopeptide repeat-containing protein [Arabidopsis thaliana]
            gi|332278227|sp|Q8GYL7.3|PP361_ARATH RecName:
            Full=Pentatricopeptide repeat-containing protein
            At5g02830, chloroplastic; Flags: Precursor
            gi|332003140|gb|AED90523.1| pentatricopeptide
            repeat-containing protein [Arabidopsis thaliana]
          Length = 852

 Score =  799 bits (2063), Expect = 0.0
 Identities = 442/866 (51%), Positives = 573/866 (66%), Gaps = 23/866 (2%)
 Frame = -1

Query: 4739 MRDAVAILASSSVVLXXXXXXXXXXXXXH--------SQSSTKLKPLLPRSFTRRTQSAD 4584
            MRD V +  SSS +                       + S TKL P LP+  +    S  
Sbjct: 1    MRDFVIVFGSSSAITNPHHHHRRCYATAPESNRKTKSNSSFTKLLPSLPQQHSPSPASVS 60

Query: 4583 SNSPLLS----TVRWDSPSQVRNRLKYYADLASNLSDDGRFTDFLMIAESVVV-SGVKPS 4419
            +   L S     VRW         L+YYAD AS L++DGR  D  +IAE++   SG   +
Sbjct: 61   ATHSLSSHFSNVVRWIPDGS----LEYYADFASKLAEDGRIEDVALIAETLAAESGANVA 116

Query: 4418 VFATLLNLKLVSVGIRRMLSEGKLVSLIQVFKGVKKLGFSVVELFDELAIEALRHECHRR 4239
             FA++++  L+S GI   L +GK+ S++   K ++K+G + ++L D+ +++ +R +  R 
Sbjct: 117  RFASMVDYDLLSKGISSNLRQGKIESVVYTLKRIEKVGIAPLDLVDDSSVKLMRKQ-FRA 175

Query: 4238 LAKCDEALEIVSLMETLQELGFSIKGLVDASHIIRLCVNKRDPDAAIRYAQNFPQAGMLL 4059
            +A   +  + + LME L  LGF IK LVD   +++ CV   +P  AIRYA   P   +LL
Sbjct: 176  MANSVQVEKAIDLMEILAGLGFKIKELVDPFDVVKSCVEISNPQLAIRYACLLPHTELLL 235

Query: 4058 CSIMIEFGKRRDLVSALTVFEVSKQNQGCPNMYAYRTIVDVCGLCGDCLKSRSIYEELLA 3879
            C I+  FGK+ D+VS +T +E  KQ    PNMY  RT++DVCGLCGD +KSR IYE+LL 
Sbjct: 236  CRIIHGFGKKGDMVSVMTAYEACKQILDTPNMYICRTMIDVCGLCGDYVKSRYIYEDLLK 295

Query: 3878 CKFAPNIYVFNSLMNVNASDLTYTLQIYKQMQKVGVTADLASYNILLKSCCLAGRVDLAK 3699
                PNIYV NSLMNVN+ DL YTL++YK MQ + VTAD+ SYNILLK+CCLAGRVDLA+
Sbjct: 296  ENIKPNIYVINSLMNVNSHDLGYTLKVYKNMQILDVTADMTSYNILLKTCCLAGRVDLAQ 355

Query: 3698 DIYNEVRKIESMGVLKLDVFTYSTMIKVLADARMWQMALKIKEDMLLAGVIPNIVTWTSL 3519
            DIY E +++ES G+LKLD FTY T+IKV ADA+MW+ ALK+K+DM   GV PN  TW+SL
Sbjct: 356  DIYKEAKRMESSGLLKLDAFTYCTIIKVFADAKMWKWALKVKDDMKSVGVTPNTHTWSSL 415

Query: 3518 ISACANAGLVEQAIQLFEEMLWAGCEPNSQCCNILLHACVESCQYDRAFRLFKGWKANGI 3339
            ISACANAGLVEQA  LFEEML +GCEPNSQC NILLHACVE+CQYDRAFRLF+ WK + +
Sbjct: 416  ISACANAGLVEQANHLFEEMLASGCEPNSQCFNILLHACVEACQYDRAFRLFQSWKGSSV 475

Query: 3338 QKGISVDI-----HCKTENILCANPNHERDSRKSSSTSILKHQSFPVSFPFSPTTSTYNI 3174
             + +  D         + NIL  N      +R S+S  I   +     F F PTT+TYNI
Sbjct: 476  NESLYADDIVSKGRTSSPNILKNNGPGSLVNRNSNSPYIQASK----RFCFKPTTATYNI 531

Query: 3173 LMKACGTDYFRAKALMDEMKMEGLSPNHISWSILIDIFGGKGNVEGALQILRSMYQAGVQ 2994
            L+KACGTDY+R K LMDEMK  GLSPN I+WS LID+ GG G+VEGA++ILR+M+ AG +
Sbjct: 532  LLKACGTDYYRGKELMDEMKSLGLSPNQITWSTLIDMCGGSGDVEGAVRILRTMHSAGTR 591

Query: 2993 PDVVTYTTAIKVCVEQKNLKFAFSLFAEMKRYQIKPNMVTYNTLLRARSQYGSLQEVQQC 2814
            PDVV YTTAIK+C E K LK AFSLF EM+RYQIKPN VTYNTLL+ARS+YGSL EV+QC
Sbjct: 592  PDVVAYTTAIKICAENKCLKLAFSLFEEMRRYQIKPNWVTYNTLLKARSKYGSLLEVRQC 651

Query: 2813 LAIYQDMRKAGYKHNDYYLKQLIGEWCEGILQNNNRNKGQFVSHSG-TSPELQNLLLEKV 2637
            LAIYQDMR AGYK ND++LK+LI EWCEG++Q N +++ +     G  +    +LL+EKV
Sbjct: 652  LAIYQDMRNAGYKPNDHFLKELIEEWCEGVIQENGQSQDKISDQEGDNAGRPVSLLIEKV 711

Query: 2636 AVHLQDINAESLTIDLQGLSKIEARIVVLAVLRMIKEQHSPGDSVKDDLMIIFEDVGSQT 2457
            A H+Q+  A +L IDLQGL+KIEAR+VVLAVLRMIKE +  GD V DD++II     + T
Sbjct: 712  ATHMQERTAGNLAIDLQGLTKIEARLVVLAVLRMIKEDYMRGDVVIDDVLIIIGTDEANT 771

Query: 2456 ---RHKGGVKEAIVKLLQYDLGLEVLSAGLR-VKKDSESGGFENPFPSKDVLQGRDLPPK 2289
               + +  V+EA+VKLL+ +L L VL AG R + +D+            D    +     
Sbjct: 772  VSGKQEITVQEALVKLLRDELSLVVLPAGQRNIIQDAHC------VDDADQENTKSFVSI 825

Query: 2288 SPARRPAVLQRLKIKKASLNHWLQRK 2211
            S  RRPA+L+RL + KASL  WLQR+
Sbjct: 826  SSTRRPAILERLMVTKASLYQWLQRR 851


>ref|XP_006398740.1| hypothetical protein EUTSA_v10012661mg [Eutrema salsugineum]
            gi|557099830|gb|ESQ40193.1| hypothetical protein
            EUTSA_v10012661mg [Eutrema salsugineum]
          Length = 863

 Score =  798 bits (2062), Expect = 0.0
 Identities = 451/876 (51%), Positives = 582/876 (66%), Gaps = 33/876 (3%)
 Frame = -1

Query: 4739 MRDAVAILASSSVVLXXXXXXXXXXXXXHSQ------SSTKLKPLLPRSFTRRTQSADSN 4578
            MRD V +  SSSV+               ++      S TK  P LP+  T    S  S 
Sbjct: 1    MRDLVIVFGSSSVITNPHHRRCYATAPELNRKLKSTSSLTKQLPSLPQQHTSSPASVSSA 60

Query: 4577 SPLLS----TVRWDSPSQVRNRLKYYADLASNLSDDGRFTDFLMIAESVVV-SGVKPSVF 4413
            + L S     VRW     V    +YYAD AS L++DGR  D  +IAE++   SG   + F
Sbjct: 61   NALSSHFSDAVRWIPDGSV----EYYADFASKLAEDGRIQDVALIAETLAAESGANVARF 116

Query: 4412 ATLLNLKLVSVGIRRMLSEGKLVSLIQVFKGVKKLGFSVVELFDELAIEALRHECHRRLA 4233
            A++++  L+S GI   L +GK+ S++   + ++K+G + ++L DE +++ +R    R +A
Sbjct: 117  ASMVDSDLLSKGISLNLRQGKIESVVYTLQRIEKVGIAPLDLVDESSVKLMRKH-FRAMA 175

Query: 4232 KCDEALEIVSLMETLQELGFSIKGLVDASHIIRLCVNKRDPDAAIRYAQNFPQAGMLLCS 4053
               +  + + LME L    F IK LVD   ++++CV+  +P  AIRYA   P   +LLC 
Sbjct: 176  NSVQVEKAIDLMEILAGFRFKIKELVDPFDVVKICVDISNPQLAIRYACLLPHTELLLCR 235

Query: 4052 IMIEFGKRRDLVSALTVFEVSKQNQGCPNMYAYRTIVDVCGLCGDCLKSRSIYEELLACK 3873
            I+  FGK+ D+VS LT +E  KQ    PNMY YRT++DVCGLCGD +KSR IYE+LL   
Sbjct: 236  IIHGFGKKGDMVSVLTAYEACKQILDNPNMYIYRTMIDVCGLCGDYVKSRYIYEDLLKEN 295

Query: 3872 FAPNIYVFNSLMNVNASDLTYTLQIYKQMQKVGVTADLASYNILLKSCCLAGRVDLAKDI 3693
              PNIYV NSLMNVN+ DL YTL++YK MQK+ VTAD+ SYNILLK+CCLAGRVDLA+DI
Sbjct: 296  IKPNIYVMNSLMNVNSHDLGYTLKVYKNMQKLDVTADMTSYNILLKTCCLAGRVDLAQDI 355

Query: 3692 YNEVRKIESMGVLKLDVFTYSTMIKVLADARMWQMALKIKEDMLLAGVIPNIVTWTSLIS 3513
            Y E +++ES G+LKLD FTY T+IKV ADA+MW+MALK+KEDM   GV PN  TW+SLIS
Sbjct: 356  YKEAKRMESSGLLKLDAFTYCTIIKVFADAKMWKMALKVKEDMQSVGVTPNTHTWSSLIS 415

Query: 3512 ACANAGLVEQAIQLFEEMLWAGCEPNSQCCNILLHACVESCQYDRAFRLFKGWKANGIQK 3333
            ACANAGLVEQA  LFEEML +GCEPNSQC NILLHACVE+CQ+DRAFRLF+ WK +  ++
Sbjct: 416  ACANAGLVEQANHLFEEMLASGCEPNSQCFNILLHACVEACQFDRAFRLFQSWKGSSDKE 475

Query: 3332 GISVDIHCKTENILCAN--PNHERDSRKSSSTSILKHQSFPVSFPFSPTTSTYNILMKAC 3159
             +  D      +I   N   NH   S  ++++S     S    F F PTT+TYNIL+KAC
Sbjct: 476  ALYADDITGKGSIFSPNKLKNHGNGSLVNTNSSPYIQAS--NRFFFKPTTATYNILLKAC 533

Query: 3158 GTDYFRAKALMDEMKMEGLSPNHISWSILIDIFGGKGNVEGALQILRSMYQAGVQPDVVT 2979
            GTDY+R K LMDEM+  GL+PN I+WS LIDI GG G+VEGA+ ILR+M+ AG +PDVV 
Sbjct: 534  GTDYYRGKELMDEMRSLGLAPNQITWSTLIDICGGSGDVEGAVGILRTMHSAGTRPDVVA 593

Query: 2978 YTTAIK-----VCVEQKNLKFAFSLFAEMKRYQIKPNMVTYNTLLRARSQYGSLQEVQQC 2814
            YTTAIK     +C E K+LK AFSLF EM+RYQIKPN VTYNTLL+ARS+YGSL EV+QC
Sbjct: 594  YTTAIKHAIFQICAENKSLKLAFSLFEEMRRYQIKPNWVTYNTLLKARSKYGSLLEVRQC 653

Query: 2813 LAIYQDMRKAGYKHNDYYLKQLIGEWCEGILQNNNRNKGQFVSHSGTS-PELQNLLLEKV 2637
            LAIYQDMRKAGYK ND++LK+LI EWCEG++Q N++++ +     GT+     +LL+EKV
Sbjct: 654  LAIYQDMRKAGYKPNDHFLKELIEEWCEGVIQENSQSQIKTSDQEGTNLGRPVSLLIEKV 713

Query: 2636 AVHLQDINAESLTIDLQGLSKIEARIVVLAVLRMIKEQHSPGDSVKDDLMIIFE------ 2475
            A HLQ+  A +L IDLQGL+K+EAR+VVLAVLRMIKE +  GD V DDL+II        
Sbjct: 714  ATHLQERTAGNLAIDLQGLTKVEARLVVLAVLRMIKEDYIRGDVVTDDLLIILGTGEANI 773

Query: 2474 DVGSQTRHKGGVKEAIVKLLQYDLGLEVLSAGLRVKKDSESGGFENPFPSKDVLQGRDLP 2295
            D G Q   +  VK+ +V+LL+ +L L VL AG R   D       +     D  QG +L 
Sbjct: 774  DPGKQ---EIAVKDVLVQLLKDELSLVVLPAGHRHVLDITL----DARCVDDADQGIELT 826

Query: 2294 PK--------SPARRPAVLQRLKIKKASLNHWLQRK 2211
             +        S  RRPA+L+RL + KASL+ WLQRK
Sbjct: 827  SENTKSIVGISSTRRPAILERLMVTKASLHQWLQRK 862


>ref|XP_002525196.1| pentatricopeptide repeat-containing protein, putative [Ricinus
            communis] gi|223535493|gb|EEF37162.1| pentatricopeptide
            repeat-containing protein, putative [Ricinus communis]
          Length = 786

 Score =  795 bits (2052), Expect = 0.0
 Identities = 422/743 (56%), Positives = 537/743 (72%), Gaps = 9/743 (1%)
 Frame = -1

Query: 4403 LNLKLVSVGIRRMLSEGKLVSLIQVFKGVKKLGFSVVELFDELAIEALRHECHRRLAKCD 4224
            L++  ++ GI + L E  + S++       +LG    +LFD  +++ L+ EC  R+    
Sbjct: 44   LHMIALAKGISKNLRERNVDSVVDALNTADQLGLPPSQLFDAASMDLLKTEC-LRIVNFG 102

Query: 4223 EALEIVSLMETLQELGFSIKGLVDASHIIRLCVNKRDPDAAIRYAQNFPQAGMLLCSIMI 4044
               +I+ LMETL    FSIK LV+ S +I+LCV++R+P  A+RYA+ FP  G+L+CSI+ 
Sbjct: 103  RLEDIILLMETLAGYSFSIKELVEPSRVIKLCVHQRNPHLAVRYARLFPHEGILMCSIVK 162

Query: 4043 EFGKRRDLVSALTVFEVSKQNQGCPNMYAYRTIVDVCGLCGDCLKSRSIYEELLACKFAP 3864
            +FGK+ DL SAL  +E   Q+   P+MY YR ++DVCGLCGD ++SR I+E++++ K  P
Sbjct: 163  QFGKKGDLDSALAAYEAYMQHSTVPDMYLYRALIDVCGLCGDYMQSRYIFEDIVSQKVIP 222

Query: 3863 NIYVFNSLMNVNASDLTYTLQIYKQMQKVGVTADLASYNILLKSCCLAGRVDLAKDIYNE 3684
            NI+VFNSLMNVNA DL YTL +YK+MQ +GVTAD+ SYNILLKSC LAG+VDLA+DIY E
Sbjct: 223  NIFVFNSLMNVNAHDLGYTLHVYKKMQNLGVTADMTSYNILLKSCSLAGKVDLAQDIYRE 282

Query: 3683 VRKIESMGVLKLDVFTYSTMIKVLADARMWQMALKIKEDMLLAGVIPNIVTWTSLISACA 3504
             +++E  G+LKLD FTY T+IK+ ADA++WQ+ALKIKEDML +GV PN  TW+SLISA A
Sbjct: 283  AKQLELAGLLKLDDFTYCTIIKIFADAKLWQLALKIKEDMLSSGVTPNTFTWSSLISASA 342

Query: 3503 NAGLVEQAIQLFEEMLWAGCEPNSQCCNILLHACVESCQYDRAFRLFKGWKANGIQKGIS 3324
            NAGLV+QAI+LFEEML AGC PNS CCNILLHACVE+CQYDRAFRLF  WK + IQ   +
Sbjct: 343  NAGLVDQAIKLFEEMLLAGCVPNSHCCNILLHACVEACQYDRAFRLFNAWKGSEIQNTFT 402

Query: 3323 VDIHCKTENILCA-NPNHERDSRKSSSTSILKHQSFPVSFPFSPTTSTYNILMKACGTDY 3147
             D +C  ++I  A +   +      +  S   H SF   FPF+P+++TYN LMKACG+DY
Sbjct: 403  TDYNCPVDDISSAMHACEDYIITVPNLASNSLHLSFLKKFPFTPSSATYNTLMKACGSDY 462

Query: 3146 FRAKALMDEMKMEGLSPNHISWSILIDIFGGKGNVEGALQILRSMYQAGVQPDVVTYTTA 2967
             RAKALMDEM+  GLSPNHISWSILIDI G  GN+EGA+QIL++M  AG++PDV+ YTTA
Sbjct: 463  NRAKALMDEMQAVGLSPNHISWSILIDICGSSGNMEGAIQILKNMRMAGIEPDVIAYTTA 522

Query: 2966 IKVCVEQKNLKFAFSLFAEMKRYQIKPNMVTYNTLLRARSQYGSLQEVQQCLAIYQDMRK 2787
            IKV VE KNLK AFSLFAEMKRYQ+KPN+VTY+TLLRAR++YGSL+EVQQCLAIYQDMRK
Sbjct: 523  IKVSVESKNLKMAFSLFAEMKRYQLKPNLVTYDTLLRARTRYGSLKEVQQCLAIYQDMRK 582

Query: 2786 AGYKHNDYYLKQLIGEWCEGILQNNNRNKGQF-VSHSGTSPELQNLLLEKVAVHLQDINA 2610
            AGYK ND YLKQLI EWCEG++Q+N++ +  F            +LLLEKVA HL    A
Sbjct: 583  AGYKSNDNYLKQLIEEWCEGVIQDNDQCQDDFKPCKRAEFGRPHSLLLEKVAAHLHHNVA 642

Query: 2609 ESLTIDLQGLSKIEARIVVLAVLRMIKEQHSPGDSVKDDLMIIF----EDVGSQTRHKGG 2442
            ESL++DLQGL+K+EARIVVLAVLRM+KE +  G  VKDD+ I       DV   T+ K  
Sbjct: 643  ESLSVDLQGLTKVEARIVVLAVLRMVKENYIQGHLVKDDMSITLGIDKVDVLPATQ-KAE 701

Query: 2441 VKEAIVKLLQYDLGLEVLSAGLRVKKDSESGGFE---NPFPSKDVLQGRDLPPKSPARRP 2271
            VK+AI KLL  +LGLEVL    R   D E+   E   N + +     GR+    S ARRP
Sbjct: 702  VKDAIFKLLHNELGLEVLIVVPRYTADLET-DLEIPLNSYQNWSKSSGRENIRVSSARRP 760

Query: 2270 AVLQRLKIKKASLNHWLQRKGGS 2202
             VLQRLK+ + SL+ WLQRK G+
Sbjct: 761  LVLQRLKVTRNSLHSWLQRKAGA 783


>ref|XP_006828302.1| hypothetical protein AMTR_s00023p00232870 [Amborella trichopoda]
            gi|548832949|gb|ERM95718.1| hypothetical protein
            AMTR_s00023p00232870 [Amborella trichopoda]
          Length = 855

 Score =  768 bits (1982), Expect = 0.0
 Identities = 424/823 (51%), Positives = 551/823 (66%), Gaps = 17/823 (2%)
 Frame = -1

Query: 4619 PRSFTRRTQSAD--SNSPLLSTVRWDSPSQVRNRLKYYADLASNLSDDGRFTDFLMIAES 4446
            P++ T+ T S    S++PLLS +R D   Q  + LK+YA +AS L+++GR  +F M+AES
Sbjct: 34   PQNPTKTTLSLKYLSSTPLLSDIRPDLGLQNPSSLKFYASMASKLAENGRLDEFSMLAES 93

Query: 4445 VVVSGVKPSVFATLLNLKLVSVGIRRMLSEGKLVSLIQVFKGVKKLGFSVVELFDELAIE 4266
             + SG+ P  F   L++K VS G    L  G+  +++ V +   KLG     +FD  A  
Sbjct: 94   FIGSGMAPGHFVEALSIKHVSAGFALCLKNGEFDTVLGVMEKFDKLGICPSLIFDGSARR 153

Query: 4265 ALRHECHRRLAKCDEALEIVSLMETLQELGFSIKGLVDASHIIRLCVNKRDPDAAIRYAQ 4086
             L   C RR+   D   E V L+E      FS+K +V  + I++ C+++ DP  A RYA 
Sbjct: 154  LLLSAC-RRVLDGDNIGEFVRLVEIFAGYRFSVKDVVKPTFILQACIDRHDPFMAGRYAS 212

Query: 4085 NFPQAGMLLCSIMIEFGKRRDLVSALTVFEVSKQNQGCPNMYAYRTIVDVCGLCGDCLKS 3906
              P A +    ++ EFGK++DL SAL  FEVSK     PNMY YR+I+D CG CGD LKS
Sbjct: 213  ILPHADVWFNFLICEFGKKKDLQSALVAFEVSKGKSVSPNMYIYRSIIDACGYCGDSLKS 272

Query: 3905 RSIYEELLACKFAPNIYVFNSLMNVNASDLTYTLQIYKQMQKVGVTADLASYNILLKSCC 3726
            RSI+E+LL  K  PN +VFNSLMNVNA D  Y L IYKQM+K+GV AD+ASYN+LLK CC
Sbjct: 273  RSIFEDLLVQKITPNTFVFNSLMNVNAHDSHYALHIYKQMKKLGVAADMASYNVLLKVCC 332

Query: 3725 LAGRVDLAKDIYNEVRKIESMGVLKLDVFTYSTMIKVLADARMWQMALKIKEDMLLAGVI 3546
            LAGRVDLA++IY E+ +    G LKLDV TYST+IKV ADA+MW+MA KIK+DM+ AGV 
Sbjct: 333  LAGRVDLAQEIYEEILQRALFGGLKLDVITYSTIIKVFADAKMWEMAFKIKDDMISAGVS 392

Query: 3545 PNIVTWTSLISACANAGLVEQAIQLFEEMLWAGCEPNSQCCNILLHACVESCQYDRAFRL 3366
            PNIVTW+SLISACANAGLVE+ IQ+ EEML  GCEPN+QCCNILL+ACVESCQ+DRAFR+
Sbjct: 393  PNIVTWSSLISACANAGLVERVIQVLEEMLVVGCEPNTQCCNILLNACVESCQFDRAFRI 452

Query: 3365 FKGWKANGIQKGISV-DIHCKT-----ENILCANPNHERDSRKSSSTSILKHQ-SFPVSF 3207
            F  WK NG   G +  +   KT     +N   ++ NHE      +S ++  H  +F    
Sbjct: 453  FHFWKQNGFSMGSNAKECGSKTVTDIKQNEYFSSGNHE---FHITSDALDPHDLNFSEVI 509

Query: 3206 PFSPTTSTYNILMKACGTDYFRAKALMDEMKMEGLSPNHISWSILIDIFGGKGNVEGALQ 3027
            PF PT +TYNILMKACGTDY+RA+ALMDEMK  GLSPNHISWSILIDI G   N++GA+Q
Sbjct: 510  PFKPTVATYNILMKACGTDYYRAQALMDEMKAGGLSPNHISWSILIDICGRSYNMKGAIQ 569

Query: 3026 ILRSMYQAGVQPDVVTYTTAIKVCVEQKNLKFAFSLFAEMKRYQIKPNMVTYNTLLRARS 2847
              +SMY AG+ PDVV YTTAIK CV  K  K AFSLF EMKR++++PN+VTYNTLL ARS
Sbjct: 570  AFKSMYNAGIIPDVVAYTTAIKACVGNKYFKMAFSLFEEMKRHRLQPNLVTYNTLLTARS 629

Query: 2846 QYGSLQEVQQCLAIYQDMRKAGYKHNDYYLKQLIGEWCEGILQNNNRNKGQF-VSHSGTS 2670
            +YGSL EV QCLAIYQDMRKAGY  ND +LK+L+ EWCEG++ +  +   +  +      
Sbjct: 630  RYGSLDEVLQCLAIYQDMRKAGYNSNDRFLKELLEEWCEGVISDKGKRWSELNIDKCDKG 689

Query: 2669 PEL---QNLLLEKVAVHLQDINAESLTIDLQGLSKIEARIVVLAVLRMIKEQHSPGDSVK 2499
             E+   Q+LLLEKVA +LQ+  AE+LTIDL+GL+K+EARI+VLA LRM+KE +  G  V+
Sbjct: 690  SEVYGPQSLLLEKVAAYLQENFAENLTIDLRGLTKVEARIIVLAKLRMLKENYILGKPVR 749

Query: 2498 DDLMIIFEDVGSQ---TRHKGGVKEAIVKLLQYDLGLEVLSA-GLRVKKDSESGGFENPF 2331
            DD++II  +  S       +  V++A++++LQ +LGL VL    L       +    +  
Sbjct: 750  DDMIIITANTRSNMDAAETELRVRDAVIRVLQGELGLSVLEGPELGELSTRHAHVISSLS 809

Query: 2330 PSKDVLQGRDLPPKSPARRPAVLQRLKIKKASLNHWLQRKGGS 2202
            P    +  R    +   RRP  +QRLKI + SLN WLQ+  G+
Sbjct: 810  PETLTMSKRPQLREYTTRRPVDVQRLKIPRRSLNLWLQKGVGT 852


>ref|XP_004493936.1| PREDICTED: pentatricopeptide repeat-containing protein At5g02830,
            chloroplastic-like [Cicer arietinum]
          Length = 799

 Score =  752 bits (1942), Expect = 0.0
 Identities = 407/714 (57%), Positives = 497/714 (69%), Gaps = 15/714 (2%)
 Frame = -1

Query: 4307 GFSVVELFDELAIEALRHECHRRLAKCDEALEIVSLMETLQELGFSIKGLVDASHIIRLC 4128
            G S+   FDE A+  +  EC   +       E V LME L     SI  LV  S II+ C
Sbjct: 104  GISLSTQFDESAMSVIAKECSFMVTS-GHIQESVELMEVLSRYQLSIGELVQPSDIIKRC 162

Query: 4127 VNKRDPDAAIRYAQNFPQAGMLLCSIMIEFGKRRDLVSALTVFEVSKQNQGCPNMYAYRT 3948
            V  R P+ A+RYA   PQA +L CSI+  FGK RDLVSAL  ++  K+N   PNMY YR 
Sbjct: 163  VLNRKPNLAVRYASLLPQAHILFCSIISGFGKSRDLVSALKAYDAMKKNLKRPNMYIYRA 222

Query: 3947 IVDVCGLCGDCLKSRSIYEELLACKFAPNIYVFNSLMNVNASDLTYTLQIYKQMQKVGVT 3768
            I+DVCGLCGD +KSR IYE+LL  K  PNIYVFNSLMN NA D++YTL +Y+ MQKVG+ 
Sbjct: 223  IIDVCGLCGDFMKSRYIYEDLLNQKITPNIYVFNSLMNANAHDISYTLNLYQNMQKVGLK 282

Query: 3767 ADLASYNILLKSCCLAGRVDLAKDIYNEVRKIESMGVLKLDVFTYSTMIKVLADARMWQM 3588
             D+ SYNILLK+CC+AGRVDLA+D+Y E++ +ES+G LKLDVFTYST+IKV ADA++WQM
Sbjct: 283  PDMTSYNILLKACCVAGRVDLAQDMYKELKHLESIGQLKLDVFTYSTIIKVFADAKLWQM 342

Query: 3587 ALKIKEDMLLAGVIPNIVTWTSLISACANAGLVEQAIQLFEEMLWAGCEPNSQCCNILLH 3408
            ALKIK DMLLAGV  N V W+SLI+ACA+AGLVEQAIQLFEEML +GCEPN+QC NI+LH
Sbjct: 343  ALKIKHDMLLAGVSLNTVAWSSLINACAHAGLVEQAIQLFEEMLLSGCEPNTQCFNIILH 402

Query: 3407 ACVESCQYDRAFRLFKGWKANGIQKGISVDIHCKTENILCANPNHERDSRKSSSTSILKH 3228
            ACVE CQYDRAFR F  WK N        + H         N N E     S +T++ K 
Sbjct: 403  ACVEGCQYDRAFRFFYSWKGNKTLVSFG-ESH---------NSNAEEGGMDSVTTTVPKG 452

Query: 3227 ------QSFPVSFPFSPTTSTYNILMKACGTDYFRAKALMDEMKMEGLSPNHISWSILID 3066
                   SF   FPF PTTSTYN L+KACGT+Y+ AKAL++EMK  GLSPN ISWSILI+
Sbjct: 453  ISSSHIMSFTERFPFKPTTSTYNTLLKACGTNYYHAKALINEMKTVGLSPNQISWSILIN 512

Query: 3065 IFGGKGNVEGALQILRSMYQAGVQPDVVTYTTAIKVCVEQKNLKFAFSLFAEMKRYQIKP 2886
            I GG  NVEGA++ILR+M  AGV+PDVV YTTAIKVCVE KN   A +L+ EMK Y+ +P
Sbjct: 513  ICGGSENVEGAIEILRTMIDAGVKPDVVAYTTAIKVCVESKNFTKALTLYEEMKSYETQP 572

Query: 2885 NMVTYNTLLRARSQYGSLQEVQQCLAIYQDMRKAGYKHNDYYLKQLIGEWCEGILQNNNR 2706
            N+VTYNTLLRARS+YGSL+EVQQCLAIYQDMRKAGYK NDYYL++LI EWCEG++Q+N  
Sbjct: 573  NLVTYNTLLRARSKYGSLREVQQCLAIYQDMRKAGYKPNDYYLEELIEEWCEGVIQDNEE 632

Query: 2705 NKGQFVSHSGTSPEL---QNLLLEKVAVHLQDINAESLTIDLQGLSKIEARIVVLAVLRM 2535
             + +F   S   PE+   ++LLLEK+A HL    A+ L ID+QGLSK+EAR+V+LAVLRM
Sbjct: 633  YEVEF--SSSKKPEIERPESLLLEKIAAHLLKRVADILAIDVQGLSKVEARLVILAVLRM 690

Query: 2534 IKEQHSPGDSVKDDLMIIF---EDVGSQTRHKGGVKEAIVKLLQYDLGLEVLSAGLRVKK 2364
            IKE ++ G SV DD++II    +   S  +    V+EA++KLL+ +LGLE L A  R   
Sbjct: 691  IKENYAFGHSVNDDILIIIGATKADESPAKEILEVQEAVIKLLRNELGLEALPAKTRFAP 750

Query: 2363 DSE---SGGFENPFPSKDVLQGRDLPPKSPARRPAVLQRLKIKKASLNHWLQRK 2211
                      EN  P+  V            RRPAVLQRLK+ K SL+ WLQR+
Sbjct: 751  SDSPKLQNTKENALPTTMVFH---------TRRPAVLQRLKVTKQSLHRWLQRR 795


>gb|ESW34707.1| hypothetical protein PHAVU_001G174000g [Phaseolus vulgaris]
          Length = 809

 Score =  743 bits (1917), Expect = 0.0
 Identities = 407/773 (52%), Positives = 523/773 (67%), Gaps = 11/773 (1%)
 Frame = -1

Query: 4496 NLSDDGRFTD-FLMIAESVVVSGVKPSVFATLLNLKLVSVGIRRMLSEGKLVSLIQVFKG 4320
            +LS D +  + F ++    + SGV   V A ++ L +    +R         S++     
Sbjct: 53   SLSADSKLVEEFEVVDGDAIDSGVDAEVLAKMVLLGIQGNSVR---------SVVHTLNR 103

Query: 4319 VKKLGFSVVELFDELAIEALRHECHRRLAKCDEALEIVSLMETLQELGFSIKGLVDASHI 4140
            V+    S+    +  +I+A+  EC R L  C    E V LME L     SI+G V  S +
Sbjct: 104  VQDHSVSLASHLNGSSIDAIAKECCR-LVMCGHIEEAVELMEVLTRFKISIRGFVQPSDV 162

Query: 4139 IRLCVNKRDPDAAIRYAQNFPQAGMLLCSIMIEFGKRRDLVSALTVFEVSKQNQGCPNMY 3960
            I+ CV  R+P  A+RYA   P A +L CSI+ EFGKRRDL+SA   +E+SK++   PNMY
Sbjct: 163  IKRCVLSRNPILAVRYACLLPHAQILFCSIISEFGKRRDLISAFKAYELSKKHMNIPNMY 222

Query: 3959 AYRTIVDVCGLCGDCLKSRSIYEELLACKFAPNIYVFNSLMNVNASDLTYTLQIYKQMQK 3780
             YR I+D CGLC D +KSR IYE+LL  K  PNIYVFNSLMNVNA DL+YTL +Y+ MQ 
Sbjct: 223  MYRAIIDACGLCRDYMKSRYIYEDLLNQKITPNIYVFNSLMNVNAHDLSYTLNLYQNMQN 282

Query: 3779 VGVTADLASYNILLKSCCLAGRVDLAKDIYNEVRKIESMGVLKLDVFTYSTMIKVLADAR 3600
            +G+  D+ SYNILLK CC+AGRVDLA+DIY E++ +ES+G LKLDVFTYST+IKV ADAR
Sbjct: 283  LGLKPDMTSYNILLKGCCVAGRVDLAQDIYRELKHLESVGQLKLDVFTYSTIIKVFADAR 342

Query: 3599 MWQMALKIKEDMLLAGVIPNIVTWTSLISACANAGLVEQAIQLFEEMLWAGCEPNSQCCN 3420
            +WQMAL IK+DML AGV  NIV W+SLI+ACA+AGLVEQAIQLFEEML AG EPN+QC N
Sbjct: 343  LWQMALTIKQDMLSAGVSLNIVAWSSLINACAHAGLVEQAIQLFEEMLLAGREPNTQCFN 402

Query: 3419 ILLHACVESCQYDRAFRLFKGWKANGIQKGISVDIHCKTENILCANPNHERDSRKSSSTS 3240
            I+L+ACVE+CQYDRAFR F  WK   +        +  T   L  N     +    S++ 
Sbjct: 403  IILNACVEACQYDRAFRFFHSWKGKKMLGSFGEGCNNNTRQELVHNVTTVPNG--ISNSH 460

Query: 3239 ILKHQSFPVSFPFSPTTSTYNILMKACGTDYFRAKALMDEMKMEGLSPNHISWSILIDIF 3060
            IL   SF   FPF+PTT+TYNIL+KACGTDY+ AKAL+ EM+  GLSPN ISWS LIDI 
Sbjct: 461  IL---SFAERFPFTPTTTTYNILLKACGTDYYHAKALIKEMETVGLSPNQISWSTLIDIC 517

Query: 3059 GGKGNVEGALQILRSMYQAGVQPDVVTYTTAIKVCVEQKNLKFAFSLFAEMKRYQIKPNM 2880
            G   NVEGA++IL++M  AG++PDV+ YTTAIKVCVE KN   A +L+ EMK Y I+PN+
Sbjct: 518  GASANVEGAIEILKNMGDAGIKPDVIAYTTAIKVCVESKNFMQALALYKEMKSYHIRPNL 577

Query: 2879 VTYNTLLRARSQYGSLQEVQQCLAIYQDMRKAGYKHNDYYLKQLIGEWCEGILQNNNRNK 2700
            +TYNTLL+ARS+YGSL EVQQCLAIYQDMRKAGYK ND YL++LI EWCEG++Q+N   +
Sbjct: 578  ITYNTLLKARSKYGSLHEVQQCLAIYQDMRKAGYKPNDCYLEELIEEWCEGVIQDNREIQ 637

Query: 2699 GQFVSHSGTSPE-LQNLLLEKVAVHLQDINAESLTIDLQGLSKIEARIVVLAVLRMIKEQ 2523
            G+F S + +  E  Q+LLLEK+A HL    A+ L ID+QGL+K+EAR+VVLAVLRMIKE 
Sbjct: 638  GEFSSSNKSELEKSQSLLLEKIAAHLLKRVADILAIDVQGLTKVEARLVVLAVLRMIKEN 697

Query: 2522 HSPGDSVKDDLMIIFEDV---GSQTRHKGGVKEAIVKLLQYDLGLEVLSAGLRVKKDSES 2352
            +S G S+ DD++I+        +  +    V+EAI+KLL+ +LGLE   A  R+   S++
Sbjct: 698  YSLGHSINDDILIVIGATKVDENPAKRILEVQEAILKLLRNELGLEAFPARTRLAL-SDT 756

Query: 2351 GGFENPFPSK---DVLQGRDLPPKS---PARRPAVLQRLKIKKASLNHWLQRK 2211
               +NP  +    + +   D  P S     RRP +L RLKI + SL  WL RK
Sbjct: 757  PKLKNPTLANLKIEAVPAEDALPTSMGFQTRRPGILVRLKITRKSLYSWLHRK 809


Top