BLASTX nr result

ID: Ephedra28_contig00004403 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Ephedra28_contig00004403
         (2837 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_006841446.1| hypothetical protein AMTR_s00003p00075520 [A...   939   0.0  
gb|EOX95298.1| S uncoupled 1 [Theobroma cacao]                        928   0.0  
ref|XP_002271180.1| PREDICTED: pentatricopeptide repeat-containi...   927   0.0  
ref|XP_002320970.2| hypothetical protein POPTR_0014s11380g [Popu...   923   0.0  
ref|XP_006492356.1| PREDICTED: pentatricopeptide repeat-containi...   920   0.0  
ref|XP_002515260.1| pentatricopeptide repeat-containing protein,...   920   0.0  
ref|XP_006444533.1| hypothetical protein CICLE_v10018807mg [Citr...   920   0.0  
ref|XP_006417966.1| hypothetical protein EUTSA_v10006755mg [Eutr...   913   0.0  
ref|XP_006386713.1| hypothetical protein POPTR_0002s19470g [Popu...   907   0.0  
gb|EXB28566.1| hypothetical protein L484_009725 [Morus notabilis]     894   0.0  
ref|XP_004135985.1| PREDICTED: pentatricopeptide repeat-containi...   894   0.0  
ref|XP_004166285.1| PREDICTED: pentatricopeptide repeat-containi...   893   0.0  
ref|XP_006293642.1| hypothetical protein CARUB_v10022597mg [Caps...   890   0.0  
gb|EMJ22752.1| hypothetical protein PRUPE_ppa001263mg [Prunus pe...   888   0.0  
ref|XP_006410275.1| hypothetical protein EUTSA_v10016219mg [Eutr...   887   0.0  
ref|XP_002881173.1| pentatricopeptide repeat-containing protein ...   882   0.0  
ref|NP_180698.1| pentatricopeptide-repeat protein GUN1 [Arabidop...   882   0.0  
ref|XP_002301519.2| hypothetical protein POPTR_0002s19470g [Popu...   879   0.0  
ref|XP_004288538.1| PREDICTED: pentatricopeptide repeat-containi...   878   0.0  
ref|XP_004240565.1| PREDICTED: pentatricopeptide repeat-containi...   855   0.0  

>ref|XP_006841446.1| hypothetical protein AMTR_s00003p00075520 [Amborella trichopoda]
            gi|548843467|gb|ERN03121.1| hypothetical protein
            AMTR_s00003p00075520 [Amborella trichopoda]
          Length = 857

 Score =  939 bits (2428), Expect = 0.0
 Identities = 487/826 (58%), Positives = 616/826 (74%), Gaps = 19/826 (2%)
 Frame = +1

Query: 97   VHHHHPTRRVV-SSALRTTSGPAHLS--------TISSNXXXXXXXXXXXQLGSDFCGKR 249
            VHHH P ++   +SA + TS  A  S        + SS+           +LGSDF G+R
Sbjct: 20   VHHHQPPQKFTFNSATKPTSKNASASHSLSPNFPSFSSSLSHPQTQKPKPELGSDFNGRR 79

Query: 250  PKRGISKNNLGRGNSMKCNQSLTAQAVLSDVLNAPLEQPVDNFLKGKRLVL---DDYVSL 420
              R +SK +  R        S  A+  L  +  A  +  V+  L      +   +D++ L
Sbjct: 80   STRFVSKMHFNRPKHGPKRHSSVAETALGHLTCADSDATVEAILTNLVFSVSSSEDFLFL 139

Query: 421  MKELGSRGECSKAVECFKWAVSREKRSNEQGKLVTIMISILGKLGRVDLAQQVFDRAKAD 600
            ++ELG+RGECSKA+ CF++AVSREKR  EQGKLV++MISILG+LG+VD+A++VF+ A+ D
Sbjct: 140  LRELGNRGECSKAIRCFEFAVSREKRRTEQGKLVSVMISILGRLGKVDIAREVFETARKD 199

Query: 601  GYGKNVYAFSALVSAYGRSGLWKEAISVFELMKRSGCAPNIVTYNAVIDACGKGGVDFKQ 780
            GYG +VYAFS+L++AYGRSG   EA+ VFE+M+ SG  PN+VTYN+VIDACGKGGV+F +
Sbjct: 200  GYGNSVYAFSSLINAYGRSGHCGEALGVFEMMRNSGFKPNLVTYNSVIDACGKGGVEFSR 259

Query: 781  AMGYFDEMIKRHVKPDRITFNSLIAVCSRGGRWVEAQDLFEQMLHRGIEQDIFTYNTLLD 960
            A+  F+EM +  VKPDRITFNSL+AVCSRGG W EA+  F +M+ RGI++D+FTYNTLLD
Sbjct: 260  ALKVFEEMEREGVKPDRITFNSLLAVCSRGGFWEEAKKCFNEMVFRGIDRDVFTYNTLLD 319

Query: 961  ALCKGGQMDLAASALCDMSERNIYPNAITYSTVIDGYAKAGRLNEALDLYHEMKNAGIPL 1140
            A+CKGGQM+LA   + DM  +N+ PN +TYST+IDGY KAGRL EAL+L+ EMK AGI L
Sbjct: 320  AVCKGGQMELALEIMSDMPSKNVLPNVVTYSTMIDGYFKAGRLEEALNLFQEMKLAGINL 379

Query: 1141 DRVSYNTLLSVYAKFGRFEEALRICREMEDAGFKKDTVTYNALLGGYGKQGKYDQVKALF 1320
            DRVSYNTLLS+YA+ G F++ALR+C EME AG K+D VTYN+LLGGYGKQGKYD VK LF
Sbjct: 380  DRVSYNTLLSIYARMGLFDDALRVCGEMERAGIKRDAVTYNSLLGGYGKQGKYDVVKHLF 439

Query: 1321 EEMRWQRVTPNVLTYSTLIDVYSKGGFQTEAMQVLNEFKQTGLKPDVVLYSAYIDALCKS 1500
            +EM+ + V PNVLTYSTLID+YSKGG   EA++V  EFK+ GLK DVVLYSA IDALCK+
Sbjct: 440  KEMKVEAVRPNVLTYSTLIDIYSKGGLLKEALEVFMEFKRVGLKADVVLYSALIDALCKN 499

Query: 1501 GSVGEAIHLLDKMTAEGITPNVVTCNSIIDSLGRSS----KGNSRSYGQ---SAEFLADN 1659
            G V  A  LLD+MT EGI PNVVT N IID+ GRS+    + +S   G+    +  +  +
Sbjct: 500  GLVESAFLLLDEMTGEGIRPNVVTYNCIIDAFGRSNQTQVQNDSYEMGKGPLDSSMIDSS 559

Query: 1660 DELVRSLENLSCVSSSPARKNERSNNNANALQIVARNAFASLSIPEEIKQKSEDILGAVD 1839
             E+V     L+ VS   A++NE  ++    L     +      + + +K KS ++L  + 
Sbjct: 560  SEIV-----LAEVSRGMAKENEGIDHLVKMLGPPPLD--KRHPVIKNMKGKSHEMLCILA 612

Query: 1840 IFHRMKKLGIKPNVVTFSVILNACSRCPSFKEASMLLEELRVFDSRVYGVAHGLLMGFRD 2019
            +FH+M ++ I+PNVVTFS ILNACSRC SF +ASMLLEELR+FD++VYGVAHGLLMG R 
Sbjct: 613  LFHKMHEMDIRPNVVTFSAILNACSRCHSFDDASMLLEELRLFDNQVYGVAHGLLMGLRK 672

Query: 2020 RVWSKAQTLFDEVTRMDSATASAFYNALTDMLWHFGQRKGAQLVVLDGRSRQVWENAWCE 2199
             +W +AQ+LFDEV RMDS+TASAFYNALTDMLWHFGQR+GAQLVV++G+ RQVWEN WCE
Sbjct: 673  DIWVQAQSLFDEVRRMDSSTASAFYNALTDMLWHFGQRRGAQLVVMEGKRRQVWENVWCE 732

Query: 2200 SCLDLHLMSAGAAQAMVHAWLLSIRSIVFEGRELPKLLSILTGWGKHSKVTGDSTLRRAV 2379
            SCLDLHLMSAGAAQAMVHAWLL+IRS+VFEG ELPKLL+ILTGWGKHSKV GDS+LR+A+
Sbjct: 733  SCLDLHLMSAGAAQAMVHAWLLTIRSVVFEGHELPKLLNILTGWGKHSKVAGDSSLRKAI 792

Query: 2380 ETLLLTLGAPFHLAKFNEGRFTSTGPVVDAWLRESGTLELLILRDK 2517
            E LL ++GAPF +AKFN GRF STG VV AWL+ES TL+LLIL D+
Sbjct: 793  EALLTSIGAPFEVAKFNVGRFISTGAVVGAWLKESRTLKLLILHDE 838


>gb|EOX95298.1| S uncoupled 1 [Theobroma cacao]
          Length = 866

 Score =  928 bits (2398), Expect = 0.0
 Identities = 475/768 (61%), Positives = 584/768 (76%), Gaps = 4/768 (0%)
 Frame = +1

Query: 223  LGSDFCGKRPKRGISKNNLGRGN-SMKCNQSLTAQAVLSDVLN---APLEQPVDNFLKGK 390
            L  DF G+R  R +SK +LGR   S     +  A+ VL   L+   + LE+ + +F + K
Sbjct: 86   LAPDFSGRRSTRFVSKMHLGRPKTSTNTRHTSIAEEVLQLALHNGHSGLERVLVSF-ESK 144

Query: 391  RLVLDDYVSLMKELGSRGECSKAVECFKWAVSREKRSNEQGKLVTIMISILGKLGRVDLA 570
                DDY  L++ELG+RGE  KA++CF++AV RE+R  EQGKL + MISILG+LG+V+LA
Sbjct: 145  LCGSDDYTFLLRELGNRGEYEKAIKCFQFAVRRERRKTEQGKLASAMISILGRLGKVELA 204

Query: 571  QQVFDRAKADGYGKNVYAFSALVSAYGRSGLWKEAISVFELMKRSGCAPNIVTYNAVIDA 750
            + +F+ A  +GYG  VYAFSAL+SA+GRSG   EAI VF+ MK +G  PN+VTYNAVIDA
Sbjct: 205  KGIFETALTEGYGNTVYAFSALISAFGRSGYSDEAIKVFDSMKNNGLKPNLVTYNAVIDA 264

Query: 751  CGKGGVDFKQAMGYFDEMIKRHVKPDRITFNSLIAVCSRGGRWVEAQDLFEQMLHRGIEQ 930
            CGKGGV+FK+ +  FDEM++  V+PDRITFNSL+AVCSRGG W  A++LF +M+HRGI+Q
Sbjct: 265  CGKGGVEFKRVVEIFDEMLRSGVQPDRITFNSLLAVCSRGGLWEAARNLFSEMVHRGIDQ 324

Query: 931  DIFTYNTLLDALCKGGQMDLAASALCDMSERNIYPNAITYSTVIDGYAKAGRLNEALDLY 1110
            DIFTYNTLLDA+CKGGQMDLA   + +M  +NI PN +TYST+IDGYAKAGR ++AL+L+
Sbjct: 325  DIFTYNTLLDAVCKGGQMDLAFEIMAEMPTKNILPNVVTYSTMIDGYAKAGRFDDALNLF 384

Query: 1111 HEMKNAGIPLDRVSYNTLLSVYAKFGRFEEALRICREMEDAGFKKDTVTYNALLGGYGKQ 1290
            +EMK  GI LDRVSYNT+LS+YAK GRFEEAL ICREME +G +KD VTYNALLGGYGKQ
Sbjct: 385  NEMKFLGIGLDRVSYNTVLSIYAKLGRFEEALDICREMEGSGIRKDVVTYNALLGGYGKQ 444

Query: 1291 GKYDQVKALFEEMRWQRVTPNVLTYSTLIDVYSKGGFQTEAMQVLNEFKQTGLKPDVVLY 1470
            GKYD+V+ LFEEM+ Q+V+PN+LTYST+IDVYSKGG   EAM V  EFK+ GLK DVVLY
Sbjct: 445  GKYDEVRRLFEEMKTQKVSPNLLTYSTVIDVYSKGGLYEEAMDVFREFKRVGLKADVVLY 504

Query: 1471 SAYIDALCKSGSVGEAIHLLDKMTAEGITPNVVTCNSIIDSLGRSSKGNSRSYGQSAEFL 1650
            SA IDALCK+G V  A+ LLD+MT EGI PNVVT NSIID+ GRS+            F 
Sbjct: 505  SALIDALCKNGLVESAVSLLDEMTKEGIRPNVVTYNSIIDAFGRSAT-------SECAFD 557

Query: 1651 ADNDELVRSLENLSCVSSSPARKNERSNNNANALQIVARNAFASLSIPEEIKQKSEDILG 1830
            A  +      E+ S V         R   +   ++   + A       ++  +  ++IL 
Sbjct: 558  AGGEISALQTESSSLVIGHSIEGKARDGEDNQVIKFFGQLAAEKGGQAKKDCRGKQEILC 617

Query: 1831 AVDIFHRMKKLGIKPNVVTFSVILNACSRCPSFKEASMLLEELRVFDSRVYGVAHGLLMG 2010
             + +F +M +L IKPNVVTFS ILNACSRC SF++ASMLLEELR+FD++VYGVAHGLLMG
Sbjct: 618  ILGVFQKMHELEIKPNVVTFSAILNACSRCDSFEDASMLLEELRLFDNQVYGVAHGLLMG 677

Query: 2011 FRDRVWSKAQTLFDEVTRMDSATASAFYNALTDMLWHFGQRKGAQLVVLDGRSRQVWENA 2190
            +R+ VW +AQ+LFDEV  MDS+TASAFYNALTDMLWHFGQ++GAQLVVL+G+ RQVWEN 
Sbjct: 678  YRENVWIQAQSLFDEVKLMDSSTASAFYNALTDMLWHFGQKRGAQLVVLEGKRRQVWENV 737

Query: 2191 WCESCLDLHLMSAGAAQAMVHAWLLSIRSIVFEGRELPKLLSILTGWGKHSKVTGDSTLR 2370
            W  SCLDLHLMS+GAA+AMVHAWLL+IRSI+FEG ELPKLLSILTGWGKHSKV GD  LR
Sbjct: 738  WSNSCLDLHLMSSGAARAMVHAWLLNIRSIIFEGHELPKLLSILTGWGKHSKVVGDGALR 797

Query: 2371 RAVETLLLTLGAPFHLAKFNEGRFTSTGPVVDAWLRESGTLELLILRD 2514
            R VE+L   +GAPF LAK N GRF STGPVV AWLRESGTL+LL+L D
Sbjct: 798  RTVESLFTGMGAPFRLAKCNLGRFVSTGPVVTAWLRESGTLKLLVLHD 845


>ref|XP_002271180.1| PREDICTED: pentatricopeptide repeat-containing protein At2g31400,
            chloroplastic [Vitis vinifera]
          Length = 867

 Score =  927 bits (2397), Expect = 0.0
 Identities = 479/773 (61%), Positives = 590/773 (76%), Gaps = 8/773 (1%)
 Frame = +1

Query: 220  QLGSDFCGKRPKRGISKNNLGRGNSMKC--NQSLTAQAVLSDVLNAPLEQPVDNFL---K 384
            +L +DF G+R  R +SK + GR  +     + S   +A+   +  A  ++ +D+ L   +
Sbjct: 83   ELTADFSGRRSTRFVSKMHFGRPKTAAAARHTSTAEEALRHAIRFASDDKGIDSVLLNFE 142

Query: 385  GKRLVLDDYVSLMKELGSRGECSKAVECFKWAVSREKRSNEQGKLVTIMISILGKLGRVD 564
             +    DDY  L++ELG+RGE +KA+ CF++AV RE+R NEQGKL + MISILG+LG+V+
Sbjct: 143  SRLCGSDDYTFLLRELGNRGEWAKAIRCFEFAVRREQRRNEQGKLASAMISILGRLGQVE 202

Query: 565  LAQQVFDRAKADGYGKNVYAFSALVSAYGRSGLWKEAISVFELMKRSGCAPNIVTYNAVI 744
            LA+ VF+ A  +GYG  VYAFSAL+SAYGRSG   EAI VFE MK SG  PN+VTYNAVI
Sbjct: 203  LAKNVFETALNEGYGNTVYAFSALISAYGRSGYCDEAIKVFETMKSSGLKPNLVTYNAVI 262

Query: 745  DACGKGGVDFKQAMGYFDEMIKRHVKPDRITFNSLIAVCSRGGRWVEAQDLFEQMLHRGI 924
            DACGKGGVDF +A   FDEM++  V+PDRITFNSL+AVC RGG W  A++LF +ML+RGI
Sbjct: 263  DACGKGGVDFNRAAEIFDEMLRNGVQPDRITFNSLLAVCGRGGLWEAARNLFSEMLYRGI 322

Query: 925  EQDIFTYNTLLDALCKGGQMDLAASALCDMSERNIYPNAITYSTVIDGYAKAGRLNEALD 1104
            EQDIFTYNTLLDA+CKGGQMDLA   + +M  ++I PN +TYSTVIDGYAKAGRL+EAL+
Sbjct: 323  EQDIFTYNTLLDAVCKGGQMDLAFQIMSEMPRKHIMPNVVTYSTVIDGYAKAGRLDEALN 382

Query: 1105 LYHEMKNAGIPLDRVSYNTLLSVYAKFGRFEEALRICREMEDAGFKKDTVTYNALLGGYG 1284
            L++EMK A I LDRVSYNTLLS+YAK GRFEEAL +C+EME +G KKD VTYNALLGGYG
Sbjct: 383  LFNEMKFASIGLDRVSYNTLLSIYAKLGRFEEALNVCKEMESSGIKKDAVTYNALLGGYG 442

Query: 1285 KQGKYDQVKALFEEMRWQRVTPNVLTYSTLIDVYSKGGFQTEAMQVLNEFKQTGLKPDVV 1464
            KQGKY++VK +FEEM+ +R+ PN+LTYSTLIDVYSKGG   EAM+V  EFK+ GLK DVV
Sbjct: 443  KQGKYEEVKRVFEEMKAERIFPNLLTYSTLIDVYSKGGLYQEAMEVFREFKKAGLKADVV 502

Query: 1465 LYSAYIDALCKSGSVGEAIHLLDKMTAEGITPNVVTCNSIIDSLGRSSKGNSRSYGQSAE 1644
            LYSA IDALCK+G V  A+  LD+MT EGI PNVVT NSIID+ GRS          SAE
Sbjct: 503  LYSALIDALCKNGLVESAVSFLDEMTKEGIRPNVVTYNSIIDAFGRSG---------SAE 553

Query: 1645 FLAD--NDELVRSLENLSCVSSSPARKNERSNNNAN-ALQIVARNAFASLSIPEEIKQKS 1815
             + D   +  V  + + S      A ++E  +   N  ++I  + A       ++  +  
Sbjct: 554  CVIDPPYETNVSKMSSSSLKVVEDATESEVGDKEDNQIIKIFGQLAAEKTCHAKKENRGR 613

Query: 1816 EDILGAVDIFHRMKKLGIKPNVVTFSVILNACSRCPSFKEASMLLEELRVFDSRVYGVAH 1995
            ++IL  + +FH+M +L IKPNVVTFS ILNACSRC SF++ASMLLEELR+FD++VYGVAH
Sbjct: 614  QEILCILAVFHKMHELDIKPNVVTFSAILNACSRCNSFEDASMLLEELRLFDNQVYGVAH 673

Query: 1996 GLLMGFRDRVWSKAQTLFDEVTRMDSATASAFYNALTDMLWHFGQRKGAQLVVLDGRSRQ 2175
            GLLMG+ D VW +AQ+LFDEV +MDS+TASAFYNALTDMLWHFGQR+GAQLVVL+G+ R 
Sbjct: 674  GLLMGYGDNVWVQAQSLFDEVKQMDSSTASAFYNALTDMLWHFGQRRGAQLVVLEGKRRH 733

Query: 2176 VWENAWCESCLDLHLMSAGAAQAMVHAWLLSIRSIVFEGRELPKLLSILTGWGKHSKVTG 2355
            VWEN W  SCLDLHLMS+GAA+AMVHAWLL+IRSIVFEG ELP+LLSILTGWGKHSKV G
Sbjct: 734  VWENMWSNSCLDLHLMSSGAARAMVHAWLLNIRSIVFEGHELPQLLSILTGWGKHSKVVG 793

Query: 2356 DSTLRRAVETLLLTLGAPFHLAKFNEGRFTSTGPVVDAWLRESGTLELLILRD 2514
            D  LRRA+E LL  +GAPF +AK N GRF STG VV AWLRESGTL++L+L D
Sbjct: 794  DGALRRAIEALLTGMGAPFRVAKCNLGRFISTGAVVAAWLRESGTLKVLVLHD 846


>ref|XP_002320970.2| hypothetical protein POPTR_0014s11380g [Populus trichocarpa]
            gi|550323986|gb|EEE99285.2| hypothetical protein
            POPTR_0014s11380g [Populus trichocarpa]
          Length = 875

 Score =  923 bits (2385), Expect = 0.0
 Identities = 475/775 (61%), Positives = 594/775 (76%), Gaps = 10/775 (1%)
 Frame = +1

Query: 220  QLGSDFCGKRPKRGISKNNLGRGNS-MKCNQSLTAQAVLSDVLN-----APLEQPVDNFL 381
            +L SDF G+R  R +SK N GR  + M    +  A+  L +V+        LE  + NF 
Sbjct: 93   ELASDFSGRRSTRFVSKLNFGRPRTTMGTRHTSVAEEALQNVIEYGKDEGALENVLLNF- 151

Query: 382  KGKRLVLDDYVSLMKELGSRGECSKAVECFKWAVSREKRSNEQGKLVTIMISILGKLGRV 561
            + +    DDY+ L++ELG+RG+C KA+ CF++AV RE++ NEQGKL + MIS LG+LG+V
Sbjct: 152  ESRLSGSDDYIFLLRELGNRGDCKKAICCFEFAVKRERKKNEQGKLASAMISTLGRLGKV 211

Query: 562  DLAQQVFDRAKADGYGKNVYAFSALVSAYGRSGLWKEAISVFELMKRSGCAPNIVTYNAV 741
            ++A+ VF+ A  +GYG  VYAFSA++SAYGRSG   EAI VF+ MK  G  PN+VTYNAV
Sbjct: 212  EIAKSVFEAALIEGYGNTVYAFSAIISAYGRSGYCDEAIKVFDSMKHYGLKPNLVTYNAV 271

Query: 742  IDACGKGGVDFKQAMGYFDEMIKRHVKPDRITFNSLIAVCSRGGRWVEAQDLFEQMLHRG 921
            IDACGKGGV+FK+ +  FDEM++  V+PDRITFNSL+AVCSRGG W  A+ L  +ML+RG
Sbjct: 272  IDACGKGGVEFKRVVEIFDEMLRNGVQPDRITFNSLLAVCSRGGLWEAARSLSSEMLNRG 331

Query: 922  IEQDIFTYNTLLDALCKGGQMDLAASALCDMSERNIYPNAITYSTVIDGYAKAGRLNEAL 1101
            I+QDIFTYNTLLDA+CKGGQMD+A   + +M  +NI PN +TYST+IDGYAKAGR ++AL
Sbjct: 332  IDQDIFTYNTLLDAVCKGGQMDMAFEIMSEMPAKNILPNVVTYSTMIDGYAKAGRFDDAL 391

Query: 1102 DLYHEMKNAGIPLDRVSYNTLLSVYAKFGRFEEALRICREMEDAGFKKDTVTYNALLGGY 1281
            +L++EMK   I LDRVSYNTLLS+YAK GRF+EAL +CREME+ G +KD VTYNALLGGY
Sbjct: 392  NLFNEMKFLCISLDRVSYNTLLSIYAKLGRFQEALDVCREMENCGIRKDVVTYNALLGGY 451

Query: 1282 GKQGKYDQVKALFEEMRWQRVTPNVLTYSTLIDVYSKGGFQTEAMQVLNEFKQTGLKPDV 1461
            GKQ KYD+V+ +F EM+  RV+PN+LTYSTLIDVYSKGG   EAM V  EFK+ GLK DV
Sbjct: 452  GKQCKYDEVRRVFGEMKAGRVSPNLLTYSTLIDVYSKGGLYREAMDVFREFKKAGLKADV 511

Query: 1462 VLYSAYIDALCKSGSVGEAIHLLDKMTAEGITPNVVTCNSIIDSLGRSSKGNSRSYGQSA 1641
            VLYSA IDALCK+G V  A+ LLD+MT EGI PNVVT NSIID+ GRS+   S       
Sbjct: 512  VLYSAVIDALCKNGLVESAVSLLDEMTKEGIRPNVVTYNSIIDAFGRSAITES------- 564

Query: 1642 EFLADNDELVR-SLENLSCVSSSPARKNERSNNNANALQIVARNAFASLSIPEEIKQKS- 1815
              + DN +  +  +E+LS      A K+  ++   N +  +    F  L++ +  + K+ 
Sbjct: 565  -VVDDNVQTSQLQIESLSSGVVEEATKSLLADREGNRIIKI----FGQLAVEKAGQAKNC 619

Query: 1816 --EDILGAVDIFHRMKKLGIKPNVVTFSVILNACSRCPSFKEASMLLEELRVFDSRVYGV 1989
              ++++  + +FH+M +L IKPNVVTFS ILNACSRC SF++ASMLLEELR+FD++VYGV
Sbjct: 620  SGQEMMCILAVFHKMHELEIKPNVVTFSAILNACSRCNSFEDASMLLEELRLFDNQVYGV 679

Query: 1990 AHGLLMGFRDRVWSKAQTLFDEVTRMDSATASAFYNALTDMLWHFGQRKGAQLVVLDGRS 2169
            AHGLLMG+R+ VW +AQ+LFDEV  MDS+TASAFYNALTDMLWHFGQ++GAQLVVL+G+ 
Sbjct: 680  AHGLLMGYRENVWEQAQSLFDEVKLMDSSTASAFYNALTDMLWHFGQKRGAQLVVLEGKR 739

Query: 2170 RQVWENAWCESCLDLHLMSAGAAQAMVHAWLLSIRSIVFEGRELPKLLSILTGWGKHSKV 2349
            RQVWEN W ESCLDLHLMS+GAA+AMVHAWLL+IRSIVFEG ELPKLLSILTGWGKHSKV
Sbjct: 740  RQVWENVWSESCLDLHLMSSGAARAMVHAWLLNIRSIVFEGHELPKLLSILTGWGKHSKV 799

Query: 2350 TGDSTLRRAVETLLLTLGAPFHLAKFNEGRFTSTGPVVDAWLRESGTLELLILRD 2514
             GDSTLRRA+E LL+ +GAPF LAK N GRF STG VV AWLRESGTL++L+L D
Sbjct: 800  VGDSTLRRAIEALLMGMGAPFRLAKCNLGRFISTGSVVAAWLRESGTLKVLVLHD 854


>ref|XP_006492356.1| PREDICTED: pentatricopeptide repeat-containing protein At2g31400,
            chloroplastic-like [Citrus sinensis]
          Length = 877

 Score =  920 bits (2379), Expect = 0.0
 Identities = 491/864 (56%), Positives = 607/864 (70%), Gaps = 32/864 (3%)
 Frame = +1

Query: 19   PHCPVPAISQXXXXXXXXXXXXELQEVHHHHPTRRV-------------VSSALRTTSGP 159
            PHC + A                      HHP+ R              +S + R    P
Sbjct: 6    PHCSITATKPYQNHQYPHNHLKNNHHRQSHHPSSRPHWTSHKVSLTKPPLSPSPRNAPKP 65

Query: 160  AHLST-ISSNXXXXXXXXXXX----QLGSDFCGKRPKRGISKNNLGRGN-SMKCNQSLTA 321
            A  ST ++ N               +L  DF G+R  R +SK + GR   +M    S+ A
Sbjct: 66   AATSTTVAPNPKPFHSLSPLPSSKSELAPDFSGRRSTRFVSKMHFGRPKIAMSTRHSVVA 125

Query: 322  QAVLSDVLN-APLEQPVDNFLKGKRLVL---DDYVSLMKELGSRGECSKAVECFKWAVSR 489
            +  L  V   A  +  + + LK     L   DDY  L++ELG+RGE SKA++CF +AV R
Sbjct: 126  EEALHHVTAFARDDVSLGDILKKFEFKLCGADDYTFLLRELGNRGEWSKAIQCFAFAVKR 185

Query: 490  EKRSNEQGKLVTIMISILGKLGRVDLAQQVFDRAKADGYGKNVYAFSALVSAYGRSGLWK 669
            E+R N+QGKL + MISILG+LG+VDLA+ +F+ A  +GYG  VYAFSAL+SAYGRSG  +
Sbjct: 186  EERKNDQGKLASAMISILGRLGKVDLAKNIFETALNEGYGNTVYAFSALISAYGRSGYCQ 245

Query: 670  EAISVFELMKRSGCAPNIVTYNAVIDACGKGGVDFKQAMGYFDEMIKRHVKPDRITFNSL 849
            EAISVF  MKR    PN+VTYNAVIDACGKGGVDFK  +  FD+M++  V+PDRITFNSL
Sbjct: 246  EAISVFNSMKRYNLKPNLVTYNAVIDACGKGGVDFKHVVEIFDDMLRNGVQPDRITFNSL 305

Query: 850  IAVCSRGGRWVEAQDLFEQMLHRGIEQDIFTYNTLLDALCKGGQMDLAASALCDMSERNI 1029
            +AVCSRGG W  A++LF +M+HRGI+QDIFTYNTLLDA+CKG QMDLA   + +M  +NI
Sbjct: 306  LAVCSRGGLWEAARNLFNEMVHRGIDQDIFTYNTLLDAICKGAQMDLAFEIMAEMPAKNI 365

Query: 1030 YPNAITYSTVIDGYAKAGRLNEALDLYHEMKNAGIPLDRVSYNTLLSVYAKFGRFEEALR 1209
             PN +TYST+IDGYAKAGRL++AL+++ EMK  GI LDRVSYNT+LS+YAK GRFEEAL 
Sbjct: 366  SPNVVTYSTMIDGYAKAGRLDDALNMFSEMKFLGIGLDRVSYNTVLSIYAKLGRFEEALL 425

Query: 1210 ICREMEDAGFKKDTVTYNALLGGYGKQGKYDQVKALFEEMRWQRVTPNVLTYSTLIDVYS 1389
            +C+EME +G +KD VTYNALLGGYGKQGKYD+V+ +FE+M+   V+PN+LTYSTLIDVYS
Sbjct: 426  VCKEMESSGIRKDAVTYNALLGGYGKQGKYDEVRRMFEQMKADCVSPNLLTYSTLIDVYS 485

Query: 1390 KGGFQTEAMQVLNEFKQTGLKPDVVLYSAYIDALCKSGSVGEAIHLLDKMTAEGITPNVV 1569
            KGG   EAMQ+  EFKQ GLK DVVLYSA IDALCK+G V  A+ LLD+MT EGI PNVV
Sbjct: 486  KGGLYKEAMQIFREFKQAGLKADVVLYSALIDALCKNGLVESAVSLLDEMTKEGIRPNVV 545

Query: 1570 TCNSIIDSLGRSSKGNSRSYGQSAEFLADNDE----LVRSLENLSCVSSSPARKNERSNN 1737
            T NSIID+ GRS+         + E   D+ E      +   NL  + S   +  + +  
Sbjct: 546  TYNSIIDAFGRSA---------TTECTVDDVERDLGKQKESANLDAMCSQDDKDVQEAGR 596

Query: 1738 NANAL-----QIVARNAFASLSIPEEIKQKSEDILGAVDIFHRMKKLGIKPNVVTFSVIL 1902
              N +     Q+VA  A       ++  +  ++IL  + +F +M KL IKPNVVTFS IL
Sbjct: 597  TDNQIIKVFGQLVAEKAGQG----KKENRCRQEILCILGVFQKMHKLKIKPNVVTFSAIL 652

Query: 1903 NACSRCPSFKEASMLLEELRVFDSRVYGVAHGLLMGFRDRVWSKAQTLFDEVTRMDSATA 2082
            NACSRC SF++ASMLLEELR+FD++VYGVAHGLLMG+RD +W +A +LFDEV  MDS+TA
Sbjct: 653  NACSRCNSFEDASMLLEELRLFDNQVYGVAHGLLMGYRDNIWVQALSLFDEVKLMDSSTA 712

Query: 2083 SAFYNALTDMLWHFGQRKGAQLVVLDGRSRQVWENAWCESCLDLHLMSAGAAQAMVHAWL 2262
            SAFYNALTDMLWHFGQ++GAQLVVL+G+ RQVWEN W ESCLDLHLMS+GAA+AMVHAWL
Sbjct: 713  SAFYNALTDMLWHFGQKRGAQLVVLEGKRRQVWENVWSESCLDLHLMSSGAARAMVHAWL 772

Query: 2263 LSIRSIVFEGRELPKLLSILTGWGKHSKVTGDSTLRRAVETLLLTLGAPFHLAKFNEGRF 2442
            L+I SIVFEG ELPKLLSILTGWGKHSKV GD  LRRAVE LL  +GAPF +A  N GRF
Sbjct: 773  LNIHSIVFEGHELPKLLSILTGWGKHSKVVGDGALRRAVEVLLTGMGAPFWVANCNLGRF 832

Query: 2443 TSTGPVVDAWLRESGTLELLILRD 2514
             STGP+V +WLRESGTL++L+L D
Sbjct: 833  ISTGPMVASWLRESGTLKVLVLHD 856


>ref|XP_002515260.1| pentatricopeptide repeat-containing protein, putative [Ricinus
            communis] gi|223545740|gb|EEF47244.1| pentatricopeptide
            repeat-containing protein, putative [Ricinus communis]
          Length = 878

 Score =  920 bits (2379), Expect = 0.0
 Identities = 487/855 (56%), Positives = 608/855 (71%), Gaps = 23/855 (2%)
 Frame = +1

Query: 19   PHCPVPAISQXXXXXXXXXXXXELQEVHHHHPTRRVVS-----------SALRTTSGPAH 165
            PHC + A                 ++ HHH  T + VS           +A +  +  A 
Sbjct: 6    PHCSITATKPYQNHQYPQNHLKNHRQTHHHRWTNQKVSLTKPPLAPSPCNAPKAAAAAAA 65

Query: 166  LSTISSNXXXXXXXXXXXQ-----LGSDFCGKRPKRGISKNNLGRGNSMKCNQSLTAQAV 330
             +T               Q     L +DF G+R  R +SK + GR  +     +  A   
Sbjct: 66   ATTTHHTPNPTFHSLSPLQSQKSDLSADFSGRRSTRFVSKLHFGRPKTNMNRHTSVALEA 125

Query: 331  LSDVL-----NAPLEQPVDNFLKGKRLVLDDYVSLMKELGSRGECSKAVECFKWAVSREK 495
            L  V+     +  LE  + NF + +    DDY  L++ELG+RG+ +KAV CF++AV RE 
Sbjct: 126  LQQVIQYGKDDKALENVLLNF-ESRLCGPDDYTFLLRELGNRGDSAKAVRCFEFAVRRES 184

Query: 496  RSNEQGKLVTIMISILGKLGRVDLAQQVFDRAKADGYGKNVYAFSALVSAYGRSGLWKEA 675
              NEQGKL + MIS LG+LG+V+LA+ VFD A  +GYGK VYAFSAL+SAYGRSG   EA
Sbjct: 185  GKNEQGKLASAMISTLGRLGKVELAKAVFDTALKEGYGKTVYAFSALISAYGRSGYCNEA 244

Query: 676  ISVFELMKRSGCAPNIVTYNAVIDACGKGGVDFKQAMGYFDEMIKRHVKPDRITFNSLIA 855
            I VF+ MK +G  PN+VTYNAVIDACGKGGV+FK+ +  FD M+   V+PDRITFNSL+A
Sbjct: 245  IKVFDSMKSNGLMPNLVTYNAVIDACGKGGVEFKKVVEIFDGMLSNGVQPDRITFNSLLA 304

Query: 856  VCSRGGRWVEAQDLFEQMLHRGIEQDIFTYNTLLDALCKGGQMDLAASALCDMSERNIYP 1035
            VCSRGG W  A+ LF  M+ +GI+QDIFTYNTLLDA+CKGGQMDLA   + +M  +NI P
Sbjct: 305  VCSRGGLWEAARRLFSAMVDKGIDQDIFTYNTLLDAVCKGGQMDLAFEIMSEMPTKNILP 364

Query: 1036 NAITYSTVIDGYAKAGRLNEALDLYHEMKNAGIPLDRVSYNTLLSVYAKFGRFEEALRIC 1215
            N +TYST+IDGYAK GRL++AL++++EMK  G+ LDRVSYNTLLSVYAK GRFE+AL +C
Sbjct: 365  NVVTYSTMIDGYAKVGRLDDALNMFNEMKFLGVGLDRVSYNTLLSVYAKLGRFEQALDVC 424

Query: 1216 REMEDAGFKKDTVTYNALLGGYGKQGKYDQVKALFEEMRWQRVTPNVLTYSTLIDVYSKG 1395
            +EME+AG +KD VTYNALL GYGKQ +YD+V+ +FEEM+  RV+PN+LTYSTLIDVYSKG
Sbjct: 425  KEMENAGIRKDVVTYNALLAGYGKQYRYDEVRRVFEEMKRGRVSPNLLTYSTLIDVYSKG 484

Query: 1396 GFQTEAMQVLNEFKQTGLKPDVVLYSAYIDALCKSGSVGEAIHLLDKMTAEGITPNVVTC 1575
            G   EAM+V  EFKQ GLK DVVLYSA IDALCK+G V  ++ LLD+MT EGI PNVVT 
Sbjct: 485  GLYKEAMEVFREFKQAGLKADVVLYSALIDALCKNGLVESSVTLLDEMTKEGIRPNVVTY 544

Query: 1576 NSIIDSLGRSSKGNSRSYGQSAEFLADN--DELVRSLENLSCVSSSPARKNERSNNNANA 1749
            NSIID+ GRS+         SA+ + D+  +     +E+LS +    A +++ ++   N 
Sbjct: 545  NSIIDAFGRSA---------SAQCVVDDSGETTALQVESLSSIVVQEAIESQAADKEDNR 595

Query: 1750 LQIVARNAFASLSIPEEIKQKSEDILGAVDIFHRMKKLGIKPNVVTFSVILNACSRCPSF 1929
            + I      A+    E      ++IL  + +F +M +L IKPNVVTFS ILNACSRC SF
Sbjct: 596  I-IEIFGKLAAEKACEAKNSGKQEILCILGVFQKMHELKIKPNVVTFSAILNACSRCDSF 654

Query: 1930 KEASMLLEELRVFDSRVYGVAHGLLMGFRDRVWSKAQTLFDEVTRMDSATASAFYNALTD 2109
            ++ASMLLEELR+FD++VYGVAHGLLMG+R+ VW +AQ+LFDEV  MDS+TASAFYNALTD
Sbjct: 655  EDASMLLEELRLFDNQVYGVAHGLLMGYRENVWLQAQSLFDEVKLMDSSTASAFYNALTD 714

Query: 2110 MLWHFGQRKGAQLVVLDGRSRQVWENAWCESCLDLHLMSAGAAQAMVHAWLLSIRSIVFE 2289
            MLWHFGQ++GAQLVVL+G+ RQVWEN W +SCLDLHLMS+GAA+AMVHAWLL+IRSIVFE
Sbjct: 715  MLWHFGQKRGAQLVVLEGKRRQVWENIWSDSCLDLHLMSSGAARAMVHAWLLNIRSIVFE 774

Query: 2290 GRELPKLLSILTGWGKHSKVTGDSTLRRAVETLLLTLGAPFHLAKFNEGRFTSTGPVVDA 2469
            G ELPKLLSILTGWGKHSKV GDS LRRAVE LL+ +GAPF LAK N GRF STG VV A
Sbjct: 775  GHELPKLLSILTGWGKHSKVVGDSALRRAVEALLIGMGAPFRLAKCNLGRFISTGSVVAA 834

Query: 2470 WLRESGTLELLILRD 2514
            WL+ESGTLE+L+L D
Sbjct: 835  WLKESGTLEVLVLHD 849


>ref|XP_006444533.1| hypothetical protein CICLE_v10018807mg [Citrus clementina]
            gi|557546795|gb|ESR57773.1| hypothetical protein
            CICLE_v10018807mg [Citrus clementina]
          Length = 877

 Score =  920 bits (2377), Expect = 0.0
 Identities = 491/864 (56%), Positives = 607/864 (70%), Gaps = 32/864 (3%)
 Frame = +1

Query: 19   PHCPVPAISQXXXXXXXXXXXXELQEVHHHHPTRRV-------------VSSALRTTSGP 159
            PHC + A                      HHP+ R              +S + R    P
Sbjct: 6    PHCSITATKPYQNHQYPHNHLKNNHHRQSHHPSSRPHWTSHKVSLTKPPLSPSPRNAPKP 65

Query: 160  AHLST-ISSNXXXXXXXXXXX----QLGSDFCGKRPKRGISKNNLGRGN-SMKCNQSLTA 321
            A  ST ++ N               +L  DF G+R  R +SK + GR   +M    S+ A
Sbjct: 66   AATSTTVAPNPKPFHSLSPLPSSKSELAPDFSGRRSTRFVSKMHFGRPKIAMSTRHSVVA 125

Query: 322  QAVLSDVLN-APLEQPVDNFLKGKRLVL---DDYVSLMKELGSRGECSKAVECFKWAVSR 489
            +  L  V   A  +  + + LK     L   DDY  L++ELG+RGE SKA++CF +AV R
Sbjct: 126  EEALHHVTAFARDDVSLGDILKKFEFKLCGADDYTFLLRELGNRGEWSKAIQCFAFAVKR 185

Query: 490  EKRSNEQGKLVTIMISILGKLGRVDLAQQVFDRAKADGYGKNVYAFSALVSAYGRSGLWK 669
            E+R N+QGKL + MISILG+LG+VDLA+ +F+ A  +GYG  VYAFSAL+SAYGRSG  +
Sbjct: 186  EERKNDQGKLASAMISILGRLGKVDLAKNIFETALNEGYGNTVYAFSALISAYGRSGYCQ 245

Query: 670  EAISVFELMKRSGCAPNIVTYNAVIDACGKGGVDFKQAMGYFDEMIKRHVKPDRITFNSL 849
            EAISVF  MKR    PN+VTYNAVIDACGKGGVDFK  +  FD+M++  V+PDRITFNSL
Sbjct: 246  EAISVFNSMKRYHLKPNLVTYNAVIDACGKGGVDFKHVVEIFDDMLRNGVQPDRITFNSL 305

Query: 850  IAVCSRGGRWVEAQDLFEQMLHRGIEQDIFTYNTLLDALCKGGQMDLAASALCDMSERNI 1029
            +AVCSRGG W  A++LF +M+HRGI+QDIFTYNTLLDA+CKG QMDLA   + +M  +NI
Sbjct: 306  LAVCSRGGLWEAARNLFNEMVHRGIDQDIFTYNTLLDAICKGAQMDLAFEIMAEMPAKNI 365

Query: 1030 YPNAITYSTVIDGYAKAGRLNEALDLYHEMKNAGIPLDRVSYNTLLSVYAKFGRFEEALR 1209
             PN +TYST+IDGYAKAGRL++AL+++ EMK  GI LDRVSYNT+LS+YAK GRFEEAL 
Sbjct: 366  SPNVVTYSTMIDGYAKAGRLDDALNMFSEMKFLGIGLDRVSYNTVLSIYAKLGRFEEALL 425

Query: 1210 ICREMEDAGFKKDTVTYNALLGGYGKQGKYDQVKALFEEMRWQRVTPNVLTYSTLIDVYS 1389
            +C+EME +G +KD VTYNALLGGYGKQGKYD+V+ +FE+M+   V+PN+LTYSTLIDVYS
Sbjct: 426  VCKEMESSGIRKDAVTYNALLGGYGKQGKYDEVRRMFEQMKADCVSPNLLTYSTLIDVYS 485

Query: 1390 KGGFQTEAMQVLNEFKQTGLKPDVVLYSAYIDALCKSGSVGEAIHLLDKMTAEGITPNVV 1569
            KGG   EAMQ+  EFKQ GLK DVVLYSA IDALCK+G V  A+ LLD+MT EGI PNVV
Sbjct: 486  KGGLYKEAMQIFREFKQAGLKADVVLYSALIDALCKNGLVESAVSLLDEMTKEGIRPNVV 545

Query: 1570 TCNSIIDSLGRSSKGNSRSYGQSAEFLADNDE----LVRSLENLSCVSSSPARKNERSNN 1737
            T NSIID+ GRS+         + E   D+ E      +   NL  + S   +  + +  
Sbjct: 546  TYNSIIDAFGRSA---------TTECTVDDVERDLGKQKESANLDAMCSQDDKDVQEAGR 596

Query: 1738 NANAL-----QIVARNAFASLSIPEEIKQKSEDILGAVDIFHRMKKLGIKPNVVTFSVIL 1902
              N +     Q+VA  A       ++  +  ++IL  + +F +M KL IKPNVVTFS IL
Sbjct: 597  TDNQIIKVFGQLVAEKAGQG----KKENRCRQEILCILGVFQKMHKLKIKPNVVTFSAIL 652

Query: 1903 NACSRCPSFKEASMLLEELRVFDSRVYGVAHGLLMGFRDRVWSKAQTLFDEVTRMDSATA 2082
            NACSRC SF++ASMLLEELR+FD++VYGVAHGLLMG+RD +W +A +LFDEV  MDS+TA
Sbjct: 653  NACSRCNSFEDASMLLEELRLFDNQVYGVAHGLLMGYRDNIWVQALSLFDEVKLMDSSTA 712

Query: 2083 SAFYNALTDMLWHFGQRKGAQLVVLDGRSRQVWENAWCESCLDLHLMSAGAAQAMVHAWL 2262
            SAFYNALTDMLWHFGQ++GAQLVVL+G+ RQVWEN W ESCLDLHLMS+GAA+AMVHAWL
Sbjct: 713  SAFYNALTDMLWHFGQKRGAQLVVLEGKRRQVWENVWSESCLDLHLMSSGAARAMVHAWL 772

Query: 2263 LSIRSIVFEGRELPKLLSILTGWGKHSKVTGDSTLRRAVETLLLTLGAPFHLAKFNEGRF 2442
            L+I SIVFEG ELPKLLSILTGWGKHSKV GD  LRRAVE LL  +GAPF +A  N GRF
Sbjct: 773  LNIHSIVFEGHELPKLLSILTGWGKHSKVVGDGALRRAVEVLLTGMGAPFWVANCNLGRF 832

Query: 2443 TSTGPVVDAWLRESGTLELLILRD 2514
             STGP+V +WLRESGTL++L+L D
Sbjct: 833  ISTGPMVASWLRESGTLKVLVLHD 856


>ref|XP_006417966.1| hypothetical protein EUTSA_v10006755mg [Eutrema salsugineum]
            gi|557095737|gb|ESQ36319.1| hypothetical protein
            EUTSA_v10006755mg [Eutrema salsugineum]
          Length = 895

 Score =  913 bits (2359), Expect = 0.0
 Identities = 471/784 (60%), Positives = 579/784 (73%), Gaps = 20/784 (2%)
 Frame = +1

Query: 223  LGSDFCGKRPKRGISKNNLGRGNS-MKCNQSLTAQAVLSDVLN-APLEQPVDNFL---KG 387
            L  DF G+R  R +SK + GR  + M    SL A+  L   +  +  ++ + N L   + 
Sbjct: 105  LSPDFAGRRSTRFVSKMHFGRPKTAMASRHSLVAEDALHHAIQFSGNDEGLQNLLLSFES 164

Query: 388  KRLVLDDYVSLMKELGSRGECSKAVECFKWAVSREKRSNEQGKLVTIMISILGKLGRVDL 567
            K    DDY  +++ELG+RGE  KAV  +++AV RE+R NEQGKL + MIS LG+LG+V +
Sbjct: 165  KLCGSDDYTYILRELGNRGEFEKAVRFYEFAVKRERRKNEQGKLASAMISTLGRLGKVGI 224

Query: 568  AQQVFDRAKADGYGKNVYAFSALVSAYGRSGLWKEAISVFELMKRSGCAPNIVTYNAVID 747
            A++VF+ A ADGYG  VYAFSA++SAYGRSG  ++AI VF  MK  G  PN+VTYNAVID
Sbjct: 225  AKRVFETALADGYGNTVYAFSAIISAYGRSGYHEDAIKVFSSMKGHGLRPNLVTYNAVID 284

Query: 748  ACGKGGVDFKQAMGYFDEMIKRHVKPDRITFNSLIAVCSRGGRWVEAQDLFEQMLHRGIE 927
            ACGKGG++FKQ   +FDEM +  V+PDRITFNSL+AVCSRGG W  A++LF++ML+RGIE
Sbjct: 285  ACGKGGMEFKQVAEFFDEMQRNRVQPDRITFNSLLAVCSRGGSWEAARNLFDEMLNRGIE 344

Query: 928  QDIFTYNTLLDALCKGGQMDLAASALCDMSERNIYPNAITYSTVIDGYAKAGRLNEALDL 1107
            QDIFTYNTLLDA+CKGGQMDLA   L  M  +NI PN +TYSTVIDGYAKAGR N+AL L
Sbjct: 345  QDIFTYNTLLDAICKGGQMDLAFEILAQMPAKNIMPNVVTYSTVIDGYAKAGRFNDALTL 404

Query: 1108 YHEMKNAGIPLDRVSYNTLLSVYAKFGRFEEALRICREMEDAGFKKDTVTYNALLGGYGK 1287
            + EMK  GIPLDRVSYNTL+S+YAK GRFEEAL I +EM  AG +KD VTYNALLGGYGK
Sbjct: 405  FGEMKYLGIPLDRVSYNTLVSIYAKLGRFEEALDIVKEMAAAGIRKDAVTYNALLGGYGK 464

Query: 1288 QGKYDQVKALFEEMRWQRVTPNVLTYSTLIDVYSKGGFQTEAMQVLNEFKQTGLKPDVVL 1467
              KYD+VK++F EM+ +RV PN+LTYSTLIDVYSKGG   EAM++  EFK  GL+ DVVL
Sbjct: 465  HEKYDEVKSVFAEMKQERVLPNLLTYSTLIDVYSKGGLYKEAMEIFREFKSVGLRADVVL 524

Query: 1468 YSAYIDALCKSGSVGEAIHLLDKMTAEGITPNVVTCNSIIDSLGRSSK------------ 1611
            YSA IDALCK+G V  A+ LLD+MT EGI+PNVVT NS+ID+ GRS+             
Sbjct: 525  YSALIDALCKNGLVESAVSLLDEMTKEGISPNVVTYNSMIDAFGRSATTECLADINEGGA 584

Query: 1612 ---GNSRSYGQSAEFLADNDELVRSLENLSCVSSSPARKNERSNNNANALQIVARNAFAS 1782
                   S+  S+  L+  D L  ++     +S     ++ R       L     N    
Sbjct: 585  NGLEEDESFSSSSASLSHTDSLSLAVGEADSLSKLTKTEDHRIVEIFGQLVTEGNN---- 640

Query: 1783 LSIPEEIKQKSEDILGAVDIFHRMKKLGIKPNVVTFSVILNACSRCPSFKEASMLLEELR 1962
              I  + KQ  +++   +++ H+M +L IKPNVVTFS ILNACSRC SF+EASMLLEELR
Sbjct: 641  -QIKRDCKQGVQELSCILEVCHKMHELEIKPNVVTFSAILNACSRCNSFEEASMLLEELR 699

Query: 1963 VFDSRVYGVAHGLLMGFRDRVWSKAQTLFDEVTRMDSATASAFYNALTDMLWHFGQRKGA 2142
            +FD++VYGVAHGLLMG+ + VW +AQ+LFDEV  MD +TASAFYNALTDMLWHFGQ++GA
Sbjct: 700  LFDNKVYGVAHGLLMGYNENVWIQAQSLFDEVKAMDGSTASAFYNALTDMLWHFGQKRGA 759

Query: 2143 QLVVLDGRSRQVWENAWCESCLDLHLMSAGAAQAMVHAWLLSIRSIVFEGRELPKLLSIL 2322
            Q VVL+GR R+VWEN W +SCLDLHLMS+GAA+AMVHAWLL+IRSIV+EG ELPKLLSIL
Sbjct: 760  QSVVLEGRRRKVWENVWSDSCLDLHLMSSGAARAMVHAWLLNIRSIVYEGHELPKLLSIL 819

Query: 2323 TGWGKHSKVTGDSTLRRAVETLLLTLGAPFHLAKFNEGRFTSTGPVVDAWLRESGTLELL 2502
            TGWGKHSKV GD TLRRAVE LL  +GAPFH+AK N GRF S+G VV AWLRESGTL++L
Sbjct: 820  TGWGKHSKVMGDGTLRRAVEALLRGMGAPFHVAKCNVGRFVSSGSVVAAWLRESGTLKVL 879

Query: 2503 ILRD 2514
            +L D
Sbjct: 880  VLED 883


>ref|XP_006386713.1| hypothetical protein POPTR_0002s19470g [Populus trichocarpa]
            gi|550345388|gb|ERP64510.1| hypothetical protein
            POPTR_0002s19470g [Populus trichocarpa]
          Length = 873

 Score =  907 bits (2344), Expect = 0.0
 Identities = 466/773 (60%), Positives = 587/773 (75%), Gaps = 8/773 (1%)
 Frame = +1

Query: 220  QLGSDFCGKRPKRGISKNNLGRGNS-MKCNQSLTAQAVLSDVLN-----APLEQPVDNFL 381
            +L SDF G+R  R +SK + GR  + M    +  AQ  L +V+        LE  + NF 
Sbjct: 91   ELVSDFPGRRSTRFVSKLHFGRPRTTMGTRHTSVAQEALQNVIEYGKDERALENVLLNF- 149

Query: 382  KGKRLVLDDYVSLMKELGSRGECSKAVECFKWAVSREKRSNEQGKLVTIMISILGKLGRV 561
            + +    DDYV L++ELG+RG+C KA+ CF++AV RE++ NEQGKL + MIS LG+LG+V
Sbjct: 150  ESRLSGSDDYVFLLRELGNRGDCKKAICCFEFAVKRERKKNEQGKLASAMISTLGRLGKV 209

Query: 562  DLAQQVFDRAKADGYGKNVYAFSALVSAYGRSGLWKEAISVFELMKRSGCAPNIVTYNAV 741
            ++A+ VF  A  +GYG  VYAFSA++SAYGRSG   EAI +F  MK  G  PN+VTYNAV
Sbjct: 210  EMAKTVFKAALTEGYGNTVYAFSAIISAYGRSGYCNEAIKIFYSMKDYGLKPNLVTYNAV 269

Query: 742  IDACGKGGVDFKQAMGYFDEMIKRHVKPDRITFNSLIAVCSRGGRWVEAQDLFEQMLHRG 921
            IDACGKGGV+FK+ +  FDEM++  ++PDRITFNSL+AVCS+GG W  A+ L  +M++RG
Sbjct: 270  IDACGKGGVEFKRVLEIFDEMLRNGMQPDRITFNSLLAVCSKGGLWEAARSLSCEMVNRG 329

Query: 922  IEQDIFTYNTLLDALCKGGQMDLAASALCDMSERNIYPNAITYSTVIDGYAKAGRLNEAL 1101
            I+QDIFTYNTLLDA+CKGGQ+D+A   + +M  +NI PN +TYST+IDGYAKAGRL++A 
Sbjct: 330  IDQDIFTYNTLLDAVCKGGQLDMAFEIMSEMPAKNILPNVVTYSTMIDGYAKAGRLDDAR 389

Query: 1102 DLYHEMKNAGIPLDRVSYNTLLSVYAKFGRFEEALRICREMEDAGFKKDTVTYNALLGGY 1281
            +L++EMK  GI LDRVSYNTLLS+YAK GRFEEA+ +CREME++G +KD VTYNALLGGY
Sbjct: 390  NLFNEMKFLGISLDRVSYNTLLSIYAKLGRFEEAMDVCREMENSGIRKDVVTYNALLGGY 449

Query: 1282 GKQGKYDQVKALFEEMRWQRVTPNVLTYSTLIDVYSKGGFQTEAMQVLNEFKQTGLKPDV 1461
            GKQ KYD V+ +FEEM+ + V+PN+LTYSTLIDVYSKGG   EAM V  EFK+ GLK DV
Sbjct: 450  GKQYKYDVVRKVFEEMKARHVSPNLLTYSTLIDVYSKGGLYREAMDVFREFKKAGLKADV 509

Query: 1462 VLYSAYIDALCKSGSVGEAIHLLDKMTAEGITPNVVTCNSIIDSLGRSSKGNS--RSYGQ 1635
            VLYSA IDALCK+G V  A+ LLD+MT EGI PNVVT NSIID+ GR +   S     GQ
Sbjct: 510  VLYSALIDALCKNGLVESAVSLLDEMTKEGIRPNVVTYNSIIDAFGRPATTESVVDDAGQ 569

Query: 1636 SAEFLADNDELVRSLENLSCVSSSPARKNERSNNNANALQIVARNAFASLSIPEEIKQKS 1815
            ++E           +++LS  +   A K+  ++   N + I      A+    +      
Sbjct: 570  TSEL---------QIDSLSSSAVEKATKSLVADREDNRI-IKIFGQLAAEKAGQAKNSGG 619

Query: 1816 EDILGAVDIFHRMKKLGIKPNVVTFSVILNACSRCPSFKEASMLLEELRVFDSRVYGVAH 1995
            ++++  + +FH+M +L IKPNVVTFS ILNACSRC SF+EASMLLEELR+FD++VYGVAH
Sbjct: 620  QEMMCILGVFHKMHELEIKPNVVTFSAILNACSRCNSFEEASMLLEELRLFDNQVYGVAH 679

Query: 1996 GLLMGFRDRVWSKAQTLFDEVTRMDSATASAFYNALTDMLWHFGQRKGAQLVVLDGRSRQ 2175
            GLLMG+R+ VW +AQ+LFDEV  MDS+TASAFYNALTDMLWHFGQ++GAQLVVL+G+ RQ
Sbjct: 680  GLLMGYRENVWEQAQSLFDEVKLMDSSTASAFYNALTDMLWHFGQKRGAQLVVLEGKRRQ 739

Query: 2176 VWENAWCESCLDLHLMSAGAAQAMVHAWLLSIRSIVFEGRELPKLLSILTGWGKHSKVTG 2355
            VWEN W ESCLDLHLMS+GAA+AMVHAWLL++R+IVFEG E+PKLLSILTGWGKHSKV G
Sbjct: 740  VWENVWSESCLDLHLMSSGAARAMVHAWLLNVRAIVFEGHEVPKLLSILTGWGKHSKVVG 799

Query: 2356 DSTLRRAVETLLLTLGAPFHLAKFNEGRFTSTGPVVDAWLRESGTLELLILRD 2514
            DSTLRRAVE LL+ +GAPF  AK N GR  STG VV +WLRESGTL++L+L D
Sbjct: 800  DSTLRRAVEALLMGMGAPFRSAKCNLGRLISTGSVVASWLRESGTLKVLVLHD 852


>gb|EXB28566.1| hypothetical protein L484_009725 [Morus notabilis]
          Length = 871

 Score =  894 bits (2311), Expect = 0.0
 Identities = 457/774 (59%), Positives = 583/774 (75%), Gaps = 10/774 (1%)
 Frame = +1

Query: 223  LGSDFCGKRPKRGISKNNLGRGNS-MKCNQSLTAQAVLSDVLNAPLEQ-PVDNFL---KG 387
            L + F G+R  R +SK +LGR  + +    +  A+ VL   +    +   +DN L   + 
Sbjct: 89   LAAVFSGRRSTRFVSKMHLGRPKTTVGSRHTAVAEEVLQQAIQFGKDDLGIDNVLLSFEP 148

Query: 388  KRLVLDDYVSLMKELGSRGECSKAVECFKWAVSREKRSNEQGKLVTIMISILGKLGRVDL 567
            K    DDY  L++ELG+RGEC KA+ CF++AV+RE+R  EQGKL + MIS LG+LG+V+L
Sbjct: 149  KLCGSDDYTFLLRELGNRGECRKAIRCFEFAVARERRKTEQGKLTSAMISTLGRLGKVEL 208

Query: 568  AQQVFDRAKADGYGKNVYAFSALVSAYGRSGLWKEAISVFELMKRSGCAPNIVTYNAVID 747
            A+ VF+ A   GYG  VY +SAL+SAYGRSG W+EA  V E MK SG  PN+VTYNAVID
Sbjct: 209  ARDVFETALFAGYGNTVYTYSALISAYGRSGYWEEARRVVESMKDSGLKPNLVTYNAVID 268

Query: 748  ACGKGGVDFKQAMGYFDEMIKRHVKPDRITFNSLIAVCSRGGRWVEAQDLFEQMLHRGIE 927
            ACGKGG +FK+ +  FDEM++  V+PDRIT+NSL+AVCSRGG W  A+ LF +M+ R I+
Sbjct: 269  ACGKGGAEFKRVVEIFDEMLRNGVQPDRITYNSLLAVCSRGGLWEAARSLFSEMVERQID 328

Query: 928  QDIFTYNTLLDALCKGGQMDLAASALCDMSERNIYPNAITYSTVIDGYAKAGRLNEALDL 1107
            QDI+TYNTLLDA+CKGGQMDLA   + +M  + I PN +TYST+IDGYAKAGRL +AL+L
Sbjct: 329  QDIYTYNTLLDAICKGGQMDLARQIMSEMPSKKILPNVVTYSTMIDGYAKAGRLEDALNL 388

Query: 1108 YHEMKNAGIPLDRVSYNTLLSVYAKFGRFEEALRICREMEDAGFKKDTVTYNALLGGYGK 1287
            ++EMK   I LDRV YNTLLS+YAK GRFEEAL++C+EME +G  +D V+YNALLGGYGK
Sbjct: 389  FNEMKYLAIGLDRVLYNTLLSIYAKLGRFEEALKVCKEMESSGIVRDVVSYNALLGGYGK 448

Query: 1288 QGKYDQVKALFEEMRWQRVTPNVLTYSTLIDVYSKGGFQTEAMQVLNEFKQTGLKPDVVL 1467
            QGKYD+VK ++++M+   V+PN+LTYSTLIDVYSKGG   EAM+V  EFKQ GLK DVVL
Sbjct: 449  QGKYDEVKRMYQDMKADHVSPNLLTYSTLIDVYSKGGLYREAMEVFREFKQAGLKADVVL 508

Query: 1468 YSAYIDALCKSGSVGEAIHLLDKMTAEGITPNVVTCNSIIDSLGRSSKGNSRSYGQSAEF 1647
            YS  I+ALCK+G V  A+ LLD+MT EGI PNV+T NSIID+ GR +  +S     +   
Sbjct: 509  YSELINALCKNGMVESAVSLLDEMTKEGIMPNVITYNSIIDAFGRPATADS-----ALGA 563

Query: 1648 LADNDELVRSLENLSCVSSSPARKNERSNNNANALQIVARNAFASLSIPEE-----IKQK 1812
                +EL   L   S +S+  A KN+  N   +  QI+    F  L+  +E      K+ 
Sbjct: 564  AIGGNELETELS--SSISNENANKNKAVNKGDH--QII--KMFGQLAAEQEGHTKKDKKI 617

Query: 1813 SEDILGAVDIFHRMKKLGIKPNVVTFSVILNACSRCPSFKEASMLLEELRVFDSRVYGVA 1992
             ++IL  + +F +M +L IKPNVVTFS ILNACSRC SF++ASMLLEELR+FD++VYGVA
Sbjct: 618  RQEILCILGVFQKMHELNIKPNVVTFSAILNACSRCNSFEDASMLLEELRLFDNQVYGVA 677

Query: 1993 HGLLMGFRDRVWSKAQTLFDEVTRMDSATASAFYNALTDMLWHFGQRKGAQLVVLDGRSR 2172
            HGLLMG R+ VW +AQ+LFDEV +MDS+TASAFYNALTDMLWHFGQ++GAQLVVL+G+ R
Sbjct: 678  HGLLMGHRENVWLEAQSLFDEVKQMDSSTASAFYNALTDMLWHFGQKRGAQLVVLEGKRR 737

Query: 2173 QVWENAWCESCLDLHLMSAGAAQAMVHAWLLSIRSIVFEGRELPKLLSILTGWGKHSKVT 2352
             VWE+ W  S LDLHLMS+GAA+A++HAWLL+IRS+VFEG+ELP+LLSILTGWGKHSKV 
Sbjct: 738  NVWESVWSNSFLDLHLMSSGAARALLHAWLLNIRSVVFEGQELPRLLSILTGWGKHSKVV 797

Query: 2353 GDSTLRRAVETLLLTLGAPFHLAKFNEGRFTSTGPVVDAWLRESGTLELLILRD 2514
            GDS LRRA+E+LL+++GAPF  AK N GRFTS GP+V  WL+ESGTL++L+L D
Sbjct: 798  GDSALRRAIESLLISMGAPFEAAKCNLGRFTSPGPMVAGWLKESGTLKVLVLHD 851


>ref|XP_004135985.1| PREDICTED: pentatricopeptide repeat-containing protein At2g31400,
            chloroplastic-like [Cucumis sativus]
          Length = 868

 Score =  894 bits (2311), Expect = 0.0
 Identities = 467/802 (58%), Positives = 592/802 (73%), Gaps = 8/802 (0%)
 Frame = +1

Query: 133  SALRTTSGPAHLSTISSNXXXXXXXXXXXQLGSDFCGKRPKRGISKNNLGRG-NSMKCNQ 309
            SA ++TS P  LS   +            +L S+F G+R  R +SK + GR  +SM    
Sbjct: 58   SATKSTSTP--LSQSPNFPSLCSLPTSKSELASNFSGRRSTRFVSKFHFGRPKSSMTTRH 115

Query: 310  SLTAQAVLSDVL-----NAPLEQPVDNFLKGKRLVLDDYVSLMKELGSRGECSKAVECFK 474
            S  A+ VL  VL     +A L+  + NF + K    +DY  L++ELG+RGEC KA+ CF 
Sbjct: 116  SAIAEEVLHQVLQFGKDDASLDNILLNF-ESKLCGSEDYTFLLRELGNRGECWKAIRCFD 174

Query: 475  WAVSREKRSNEQGKLVTIMISILGKLGRVDLAQQVFDRAKADGYGKNVYAFSALVSAYGR 654
            +A+ RE R NE+GKL + MIS LG+LG+V+LA+ VF+ A ++GYG  V+AFSAL+SAYG+
Sbjct: 175  FALVREGRKNERGKLASAMISTLGRLGKVELAKGVFETALSEGYGNTVFAFSALISAYGK 234

Query: 655  SGLWKEAISVFELMKRSGCAPNIVTYNAVIDACGKGGVDFKQAMGYFDEMIKRHVKPDRI 834
            SG + EAI VFE MK SG  PN+VTYNAVIDACGKGGV+FK+ +  F+EM++  V+PDRI
Sbjct: 235  SGYFDEAIKVFESMKVSGLKPNLVTYNAVIDACGKGGVEFKRVVEIFEEMLRNGVQPDRI 294

Query: 835  TFNSLIAVCSRGGRWVEAQDLFEQMLHRGIEQDIFTYNTLLDALCKGGQMDLAASALCDM 1014
            T+NSL+AVCSRGG W  A++LF +M+ RGI+QD+FTYNTLLDA+CKGGQMDLA   + +M
Sbjct: 295  TYNSLLAVCSRGGLWEAARNLFNEMIDRGIDQDVFTYNTLLDAVCKGGQMDLAYEIMLEM 354

Query: 1015 SERNIYPNAITYSTVIDGYAKAGRLNEALDLYHEMKNAGIPLDRVSYNTLLSVYAKFGRF 1194
              + I PN +TYST+ DGYAKAGRL +AL+LY+EMK  GI LDRVSYNTLLS+YAK GRF
Sbjct: 355  PGKKILPNVVTYSTMADGYAKAGRLEDALNLYNEMKFLGIGLDRVSYNTLLSIYAKLGRF 414

Query: 1195 EEALRICREMEDAGFKKDTVTYNALLGGYGKQGKYDQVKALFEEMRWQRVTPNVLTYSTL 1374
            E+AL++C+EM  +G KKD VTYNALL GYGKQGK+++V  +F+EM+  RV PN+LTYSTL
Sbjct: 415  EDALKVCKEMGSSGVKKDVVTYNALLDGYGKQGKFNEVTRVFKEMKKDRVFPNLLTYSTL 474

Query: 1375 IDVYSKGGFQTEAMQVLNEFKQTGLKPDVVLYSAYIDALCKSGSVGEAIHLLDKMTAEGI 1554
            IDVYSKG    EAM+V  EFKQ GLK DVVLYS  I+ALCK+G V  A+ LLD+MT EGI
Sbjct: 475  IDVYSKGSLYEEAMEVFREFKQAGLKADVVLYSELINALCKNGLVDSAVLLLDEMTKEGI 534

Query: 1555 TPNVVTCNSIIDSLGRSSKGNSRSYGQSAEFLAD--NDELVRSLENLSCVSSSPARKNER 1728
             PNVVT NSIID+ GRS+         +AEFL D       R  E+ S +      ++E 
Sbjct: 535  RPNVVTYNSIIDAFGRST---------TAEFLVDGVGASNERQSESPSFMLIEGVDESEI 585

Query: 1729 SNNNANALQIVARNAFASLSIPEEIKQKSEDILGAVDIFHRMKKLGIKPNVVTFSVILNA 1908
            + ++ +  +   +         ++ +   E+I   + +F +M +L IKPNVVTFS ILNA
Sbjct: 586  NWDDGHVFKFYQQLVSEKEGPAKKERLGKEEIRSILSVFKKMHELEIKPNVVTFSAILNA 645

Query: 1909 CSRCPSFKEASMLLEELRVFDSRVYGVAHGLLMGFRDRVWSKAQTLFDEVTRMDSATASA 2088
            CSRC S ++ASMLLEELR+FD++VYGVAHGLLMGF + VW +AQ LFDEV +MDS+TASA
Sbjct: 646  CSRCKSIEDASMLLEELRLFDNQVYGVAHGLLMGFSENVWIQAQYLFDEVKQMDSSTASA 705

Query: 2089 FYNALTDMLWHFGQRKGAQLVVLDGRSRQVWENAWCESCLDLHLMSAGAAQAMVHAWLLS 2268
            FYNALTDMLWHFGQ++GAQLVVL+G+ R+VWE  W +SCLDLHLMS+GAA+AMVHAWLL 
Sbjct: 706  FYNALTDMLWHFGQKRGAQLVVLEGKRRKVWETLWSDSCLDLHLMSSGAARAMVHAWLLG 765

Query: 2269 IRSIVFEGRELPKLLSILTGWGKHSKVTGDSTLRRAVETLLLTLGAPFHLAKFNEGRFTS 2448
            I S+VFEG +LPKLLSILTGWGKHSKV GD  LRRA+E LL ++GAPF +AK N GR+ S
Sbjct: 766  IHSVVFEGHQLPKLLSILTGWGKHSKVVGDGALRRAIEALLTSMGAPFRVAKCNIGRYVS 825

Query: 2449 TGPVVDAWLRESGTLELLILRD 2514
            TG VV AWL+ESGTL+LL+L D
Sbjct: 826  TGSVVAAWLKESGTLKLLVLHD 847


>ref|XP_004166285.1| PREDICTED: pentatricopeptide repeat-containing protein At2g31400,
            chloroplastic-like [Cucumis sativus]
          Length = 868

 Score =  893 bits (2308), Expect = 0.0
 Identities = 466/802 (58%), Positives = 592/802 (73%), Gaps = 8/802 (0%)
 Frame = +1

Query: 133  SALRTTSGPAHLSTISSNXXXXXXXXXXXQLGSDFCGKRPKRGISKNNLGRG-NSMKCNQ 309
            SA ++TS P  LS   +            +L S+F G+R  R +SK + GR  +SM    
Sbjct: 58   SATKSTSTP--LSQSPNFPSLCSLPTSKSELASNFSGRRSTRFVSKFHFGRPKSSMTTRH 115

Query: 310  SLTAQAVLSDVL-----NAPLEQPVDNFLKGKRLVLDDYVSLMKELGSRGECSKAVECFK 474
            S  A+ VL  VL     +A L+  + NF + K    +DY  L++ELG+RGEC KA+ CF 
Sbjct: 116  SAIAEEVLHQVLQFGKDDASLDNILLNF-ESKLCGSEDYTFLLRELGNRGECWKAIRCFD 174

Query: 475  WAVSREKRSNEQGKLVTIMISILGKLGRVDLAQQVFDRAKADGYGKNVYAFSALVSAYGR 654
            +A+ RE R NE+GKL + MIS LG+LG+V+LA+ VF+ A ++GYG  V+AFSAL+SAYG+
Sbjct: 175  FALVREGRKNERGKLASAMISTLGRLGKVELAKGVFETALSEGYGNTVFAFSALISAYGK 234

Query: 655  SGLWKEAISVFELMKRSGCAPNIVTYNAVIDACGKGGVDFKQAMGYFDEMIKRHVKPDRI 834
            SG + EAI VFE MK SG  PN+VTYNAVIDACGKGGV+FK+ +  F+EM++  V+PDRI
Sbjct: 235  SGYFDEAIKVFESMKVSGLKPNLVTYNAVIDACGKGGVEFKRVVEIFEEMLRNGVQPDRI 294

Query: 835  TFNSLIAVCSRGGRWVEAQDLFEQMLHRGIEQDIFTYNTLLDALCKGGQMDLAASALCDM 1014
            T+NSL+AVCSRGG W  A++LF +M+ RGI+QD+FTYNTLLDA+CKGGQMDLA   + +M
Sbjct: 295  TYNSLLAVCSRGGLWEAARNLFNEMIDRGIDQDVFTYNTLLDAVCKGGQMDLAYEIMLEM 354

Query: 1015 SERNIYPNAITYSTVIDGYAKAGRLNEALDLYHEMKNAGIPLDRVSYNTLLSVYAKFGRF 1194
              + I PN +TYST+ DGYAKAGRL +AL+LY+EMK  GI LDRVSYNTLLS+YAK GRF
Sbjct: 355  PGKKILPNVVTYSTMADGYAKAGRLEDALNLYNEMKFLGIGLDRVSYNTLLSIYAKLGRF 414

Query: 1195 EEALRICREMEDAGFKKDTVTYNALLGGYGKQGKYDQVKALFEEMRWQRVTPNVLTYSTL 1374
            E+AL++C+EM  +G KKD VTYNALL GYGKQGK+++V  +F+EM+  RV PN+LTYSTL
Sbjct: 415  EDALKVCKEMGSSGVKKDVVTYNALLDGYGKQGKFNEVTRVFKEMKKDRVFPNLLTYSTL 474

Query: 1375 IDVYSKGGFQTEAMQVLNEFKQTGLKPDVVLYSAYIDALCKSGSVGEAIHLLDKMTAEGI 1554
            IDVYSKG    EAM+V  EFKQ GLK DVVLYS  I+ALCK+G V  A+ LLD+MT EGI
Sbjct: 475  IDVYSKGSLYEEAMEVFREFKQAGLKADVVLYSELINALCKNGLVDSAVLLLDEMTKEGI 534

Query: 1555 TPNVVTCNSIIDSLGRSSKGNSRSYGQSAEFLAD--NDELVRSLENLSCVSSSPARKNER 1728
             PNVVT NSIID+ GRS+         +AEFL D       R  E+ + +      ++E 
Sbjct: 535  RPNVVTYNSIIDAFGRST---------TAEFLVDGVGASNERQSESPTFMLIEGVDESEI 585

Query: 1729 SNNNANALQIVARNAFASLSIPEEIKQKSEDILGAVDIFHRMKKLGIKPNVVTFSVILNA 1908
            + ++ +  +   +         ++ +   E+I   + +F +M +L IKPNVVTFS ILNA
Sbjct: 586  NWDDGHVFKFYQQLVSEKEGPAKKERLGKEEIRSILSVFKKMHELEIKPNVVTFSAILNA 645

Query: 1909 CSRCPSFKEASMLLEELRVFDSRVYGVAHGLLMGFRDRVWSKAQTLFDEVTRMDSATASA 2088
            CSRC S ++ASMLLEELR+FD++VYGVAHGLLMGF + VW +AQ LFDEV +MDS+TASA
Sbjct: 646  CSRCKSIEDASMLLEELRLFDNQVYGVAHGLLMGFSENVWIQAQYLFDEVKQMDSSTASA 705

Query: 2089 FYNALTDMLWHFGQRKGAQLVVLDGRSRQVWENAWCESCLDLHLMSAGAAQAMVHAWLLS 2268
            FYNALTDMLWHFGQ++GAQLVVL+G+ R+VWE  W +SCLDLHLMS+GAA+AMVHAWLL 
Sbjct: 706  FYNALTDMLWHFGQKRGAQLVVLEGKRRKVWETLWSDSCLDLHLMSSGAARAMVHAWLLG 765

Query: 2269 IRSIVFEGRELPKLLSILTGWGKHSKVTGDSTLRRAVETLLLTLGAPFHLAKFNEGRFTS 2448
            I S+VFEG +LPKLLSILTGWGKHSKV GD  LRRA+E LL ++GAPF +AK N GR+ S
Sbjct: 766  IHSVVFEGHQLPKLLSILTGWGKHSKVVGDGALRRAIEALLTSMGAPFRVAKCNIGRYVS 825

Query: 2449 TGPVVDAWLRESGTLELLILRD 2514
            TG VV AWL+ESGTL+LL+L D
Sbjct: 826  TGSVVAAWLKESGTLKLLVLHD 847


>ref|XP_006293642.1| hypothetical protein CARUB_v10022597mg [Capsella rubella]
            gi|482562350|gb|EOA26540.1| hypothetical protein
            CARUB_v10022597mg [Capsella rubella]
          Length = 932

 Score =  890 bits (2299), Expect = 0.0
 Identities = 461/770 (59%), Positives = 575/770 (74%), Gaps = 6/770 (0%)
 Frame = +1

Query: 223  LGSDFCGKRPKRGISKNNLGRGNS-MKCNQSLTAQAVLSDVLNAPLEQPVDNFL----KG 387
            L SDF G+R  R +SK + GR  + M    S  A+  L + ++   +  + + L    + 
Sbjct: 141  LSSDFSGRRSTRFVSKMHFGRPKTAMATRHSSAAEDALQNAIDFSGDSEMFHSLMLSFES 200

Query: 388  KRLVLDDYVSLMKELGSRGECSKAVECFKWAVSREKRSNEQGKLVTIMISILGKLGRVDL 567
            K    DD   +++ELG+RGEC KAV  +++AV RE+R NEQGKL + MIS LG+ G+V +
Sbjct: 201  KLCGSDDCTYIIRELGNRGECDKAVGFYEFAVKRERRKNEQGKLASAMISTLGRYGKVTI 260

Query: 568  AQQVFDRAKADGYGKNVYAFSALVSAYGRSGLWKEAISVFELMKRSGCAPNIVTYNAVID 747
            A+++F+ A A GYG  VYAFSAL+SAYGRSGL +EAISVF  MK  G  PN+VTYNAVID
Sbjct: 261  AKRIFETAFAGGYGNTVYAFSALISAYGRSGLHEEAISVFSSMKDHGLRPNLVTYNAVID 320

Query: 748  ACGKGGVDFKQAMGYFDEMIKRHVKPDRITFNSLIAVCSRGGRWVEAQDLFEQMLHRGIE 927
            ACGKGG++FKQ   +FDEM K  V+PDRITFNSL+AVCSRGG W  A++LF++M +R IE
Sbjct: 321  ACGKGGMEFKQVAKFFDEMQKNGVQPDRITFNSLLAVCSRGGLWEAARNLFDEMSNRRIE 380

Query: 928  QDIFTYNTLLDALCKGGQMDLAASALCDMSERNIYPNAITYSTVIDGYAKAGRLNEALDL 1107
            QD+F+YNTLLDA+CKGGQMDLA   L  M  + I PN ++YSTVIDG+AKAGR +EAL+L
Sbjct: 381  QDVFSYNTLLDAICKGGQMDLAFEILAQMPAKRIMPNVVSYSTVIDGFAKAGRFDEALNL 440

Query: 1108 YHEMKNAGIPLDRVSYNTLLSVYAKFGRFEEALRICREMEDAGFKKDTVTYNALLGGYGK 1287
            + EM+  GI LDRVSYNTLLS+Y K GR EEAL I REM   G KKD VTYNALLGGYGK
Sbjct: 441  FGEMRYLGIALDRVSYNTLLSIYTKVGRSEEALDILREMASVGIKKDVVTYNALLGGYGK 500

Query: 1288 QGKYDQVKALFEEMRWQRVTPNVLTYSTLIDVYSKGGFQTEAMQVLNEFKQTGLKPDVVL 1467
            QGKYD+VK +F EM+ + V PN+LTYSTLID YSKGG   EAM++  EFK  GL+ DVVL
Sbjct: 501  QGKYDEVKKVFAEMKREHVVPNLLTYSTLIDGYSKGGLYKEAMEIFREFKSAGLRADVVL 560

Query: 1468 YSAYIDALCKSGSVGEAIHLLDKMTAEGITPNVVTCNSIIDSLGRSSKGNSRSYGQSAEF 1647
            YSA IDALCK+G VG A+ L+D+MT EGI+PNVVT NSIID+ GRS+     +  +SA++
Sbjct: 561  YSALIDALCKNGLVGSAVSLIDEMTKEGISPNVVTYNSIIDAFGRSA-----TMERSADY 615

Query: 1648 LADNDELVRSLENLSCVSSSPARKNERSNNNANALQIVAR-NAFASLSIPEEIKQKSEDI 1824
               N E   +LE  S   SS A            +Q+  +  A ++  + ++ K+  +++
Sbjct: 616  --SNGE-ANNLEVGSLALSSSALSKLTETEGNRVIQLFGQLTAESNNRMTKDCKEGMQEL 672

Query: 1825 LGAVDIFHRMKKLGIKPNVVTFSVILNACSRCPSFKEASMLLEELRVFDSRVYGVAHGLL 2004
               +++F +M +L IKPNVVTFS ILNACSRC SF++ASMLLEELR+FD++VYGV HGLL
Sbjct: 673  SCILEVFRKMHQLEIKPNVVTFSAILNACSRCNSFEDASMLLEELRLFDNKVYGVVHGLL 732

Query: 2005 MGFRDRVWSKAQTLFDEVTRMDSATASAFYNALTDMLWHFGQRKGAQLVVLDGRSRQVWE 2184
            MG R+ VW +AQ+LFD+V  MD +TASAFYNALTDMLWHFGQ++GA+LV L+GRSRQVWE
Sbjct: 733  MGERENVWLQAQSLFDKVNEMDGSTASAFYNALTDMLWHFGQKRGAELVALEGRSRQVWE 792

Query: 2185 NAWCESCLDLHLMSAGAAQAMVHAWLLSIRSIVFEGRELPKLLSILTGWGKHSKVTGDST 2364
            N W +SCLDLHLMS+GAA+AMVHAWLL+IRSIV+EG ELPK+LSILTGWGKHSKV GD  
Sbjct: 793  NVWSDSCLDLHLMSSGAARAMVHAWLLNIRSIVYEGHELPKVLSILTGWGKHSKVVGDGA 852

Query: 2365 LRRAVETLLLTLGAPFHLAKFNEGRFTSTGPVVDAWLRESGTLELLILRD 2514
            LRRAVE LL  + APFHL+K N GRF S+G VV  WLRES TL+LLIL D
Sbjct: 853  LRRAVEVLLRGMDAPFHLSKCNMGRFISSGSVVATWLRESATLKLLILHD 902


>gb|EMJ22752.1| hypothetical protein PRUPE_ppa001263mg [Prunus persica]
          Length = 868

 Score =  888 bits (2294), Expect = 0.0
 Identities = 464/805 (57%), Positives = 585/805 (72%), Gaps = 10/805 (1%)
 Frame = +1

Query: 130  SSALRTTSGPAHLSTISSNXXXXXXXXXXXQLGSDFCGKRPKRGISKNNLGRGNSM--KC 303
            S A RT +     +  SS             L + F G+R  R +SK +LGR  +     
Sbjct: 54   SQAPRTAAKTPTATPTSSFSSLCPLPHPKSDLVTAFSGRRSTRFVSKMHLGRPKTTMGSY 113

Query: 304  NQSLTAQAVLSDVLNAPLEQPVDNFLKGKRLVL---DDYVSLMKELGSRGECSKAVECFK 474
               L  +A+   V     +  +D+ L      L   DDY  L +ELG+RGEC KA+ CF+
Sbjct: 114  RSPLAEEALHQAVQFGNDDLALDDILLSFHSRLCGSDDYTFLFRELGNRGECWKAIRCFE 173

Query: 475  WAVSREKRSNEQGKLVTIMISILGKLGRVDLAQQVFDRAKADGYGKNVYAFSALVSAYGR 654
            +AV REKR  EQGKL + MIS LG+LG+V+LA+ VF  A  +GYGK VY +SAL++AYGR
Sbjct: 174  FAVRREKRRTEQGKLASSMISTLGRLGKVELAKNVFQTAVNEGYGKTVYTYSALITAYGR 233

Query: 655  SGLWKEAISVFELMKRSGCAPNIVTYNAVIDACGKGGVDFKQAMGYFDEMIKRHVKPDRI 834
            +G  +EAI VFE MK SG  PN+VTYNAVIDA GKGGV+FK+ +  F+EM++   +PDRI
Sbjct: 234  NGYCEEAIRVFESMKDSGLKPNLVTYNAVIDAYGKGGVEFKRVVEIFNEMLRNGEQPDRI 293

Query: 835  TFNSLIAVCSRGGRWVEAQDLFEQMLHRGIEQDIFTYNTLLDALCKGGQMDLAASALCDM 1014
            T+NSL+AVCSRGG W  A++LF +M+ RGI+QDI+TYNTL+DA+CKGGQMDLA   + +M
Sbjct: 294  TYNSLLAVCSRGGLWEMARNLFSEMVDRGIDQDIYTYNTLIDAICKGGQMDLAYQIMSEM 353

Query: 1015 SERNIYPNAITYSTVIDGYAKAGRLNEALDLYHEMKNAGIPLDRVSYNTLLSVYAKFGRF 1194
              +NI PN +TYST+IDGYAKAGRL +AL L++EMK   I LDRV YNTLLS+Y K GRF
Sbjct: 354  PSKNILPNVVTYSTIIDGYAKAGRLEDALSLFNEMKFLAIGLDRVLYNTLLSLYGKLGRF 413

Query: 1195 EEALRICREMEDAGFKKDTVTYNALLGGYGKQGKYDQVKALFEEMRWQRVTPNVLTYSTL 1374
            E+AL++C+EME  G  KD V+YNALLGGYGKQGKYD  K ++ +M+ +RV+PN+LTYSTL
Sbjct: 414  EDALKVCKEMESVGIAKDVVSYNALLGGYGKQGKYDDAKRMYNQMKEERVSPNILTYSTL 473

Query: 1375 IDVYSKGGFQTEAMQVLNEFKQTGLKPDVVLYSAYIDALCKSGSVGEAIHLLDKMTAEGI 1554
            IDVYSKGG   EAM+V  EFKQ GLK DVVLYS  ++ALCK+G V  A+ LLD+MT EGI
Sbjct: 474  IDVYSKGGLYMEAMKVFREFKQAGLKADVVLYSELVNALCKNGLVESAVLLLDEMTKEGI 533

Query: 1555 TPNVVTCNSIIDSLGRSSKGNSRSYGQSAEFLAD--NDELVRSLENLSCVSSSPA---RK 1719
             PNVVT NSIID+ GRS+         + E  AD     +V   E+ S VS   A   + 
Sbjct: 534  RPNVVTYNSIIDAFGRSA---------TTECAADAAGGGIVLQTESSSSVSEGDAIGIQV 584

Query: 1720 NERSNNNANALQIVARNAFASLSIPEEIKQKSEDILGAVDIFHRMKKLGIKPNVVTFSVI 1899
             +R +N    +++  + A       +  ++  ++IL  + IF +M +L IKPNVVTFS I
Sbjct: 585  GDRGDN--RFMKMFGQLAAEKAGYAKTDRKVRQEILCILGIFQKMHELDIKPNVVTFSAI 642

Query: 1900 LNACSRCPSFKEASMLLEELRVFDSRVYGVAHGLLMGFRDRVWSKAQTLFDEVTRMDSAT 2079
            LNACSRC SF++ASMLLEELR+FD++VYGVAHGLLMG+RD VW KA++LFDEV +MDS+T
Sbjct: 643  LNACSRCNSFEDASMLLEELRLFDNKVYGVAHGLLMGYRDNVWVKAESLFDEVKQMDSST 702

Query: 2080 ASAFYNALTDMLWHFGQRKGAQLVVLDGRSRQVWENAWCESCLDLHLMSAGAAQAMVHAW 2259
            ASAFYNALTDMLWH+GQ++GAQLVVL+G+ R VWE+ W  SCLDLHLMS+GAA+AMVHAW
Sbjct: 703  ASAFYNALTDMLWHYGQKQGAQLVVLEGKRRNVWESVWSNSCLDLHLMSSGAARAMVHAW 762

Query: 2260 LLSIRSIVFEGRELPKLLSILTGWGKHSKVTGDSTLRRAVETLLLTLGAPFHLAKFNEGR 2439
            LL+IRSIVFEG++LP LLSILTGWGKHSKV GDSTLRRA+E LL ++GAPF +AK N GR
Sbjct: 763  LLNIRSIVFEGQQLPNLLSILTGWGKHSKVVGDSTLRRAIEALLTSMGAPFRVAKCNLGR 822

Query: 2440 FTSTGPVVDAWLRESGTLELLILRD 2514
            F STG +  AWLRESGTLE+L+L D
Sbjct: 823  FISTGSMAAAWLRESGTLEVLVLHD 847


>ref|XP_006410275.1| hypothetical protein EUTSA_v10016219mg [Eutrema salsugineum]
            gi|557111444|gb|ESQ51728.1| hypothetical protein
            EUTSA_v10016219mg [Eutrema salsugineum]
          Length = 885

 Score =  887 bits (2293), Expect = 0.0
 Identities = 469/832 (56%), Positives = 585/832 (70%), Gaps = 18/832 (2%)
 Frame = +1

Query: 91   QEVHHHHP--------TRRVVSSALRTTSGPAHLSTISSNXXXXXXXXXXXQL----GSD 234
            Q  H+H P          R V+SA  ++S    ++T++S            Q      SD
Sbjct: 44   QPTHNHRPWLPQRITSCPRAVTSAPPSSSAAVSVATVASAQLSKTPTLSPLQTPKSDSSD 103

Query: 235  FCGKRPKRGISKNNLGRGNSMKCNQ-SLTAQAVLSDVLNAPLEQPVDNFL----KGKRLV 399
            F G+R  R +SK +LGR  +    + S  A+  L   ++   E  +   L    + K   
Sbjct: 104  FSGRRSTRFVSKMHLGRPKTTTATRRSSAAEDALRSAIDLSGEDEMFQSLLLSFESKLRG 163

Query: 400  LDDYVSLMKELGSRGECSKAVECFKWAVSREKRSNEQGKLVTIMISILGKLGRVDLAQQV 579
             +DY  +++ELG+RGEC KAV  +++AV RE+R  EQGKL + MIS LG+LG+V +A+ V
Sbjct: 164  SEDYTFILRELGNRGECDKAVRFYEFAVIRERRRVEQGKLASAMISTLGRLGKVAIAKSV 223

Query: 580  FDRAKADGYGKNVYAFSALVSAYGRSGLWKEAISVFELMKRSGCAPNIVTYNAVIDACGK 759
            F+ A   GYG  VY FSA++SAYGRSG ++EAI VF+ MK  G  PN++TYNAVIDACGK
Sbjct: 224  FEAALDGGYGNTVYTFSAVISAYGRSGFYEEAIGVFDSMKSYGLKPNLITYNAVIDACGK 283

Query: 760  GGVDFKQAMGYFDEMIKRHVKPDRITFNSLIAVCSRGGRWVEAQDLFEQMLHRGIEQDIF 939
            GG++FKQ  G+FDEM +  V+PDRITFNSL+AVCSRGG W  A++LF++ML RGIEQD+F
Sbjct: 284  GGMEFKQVAGFFDEMQRNGVQPDRITFNSLLAVCSRGGLWEAARNLFDEMLKRGIEQDVF 343

Query: 940  TYNTLLDALCKGGQMDLAASALCDMSERNIYPNAITYSTVIDGYAKAGRLNEALDLYHEM 1119
            TYNTLLDA+CKGG+MDLA   L  M  + I PN ++YSTVIDG+AKAGR +EAL+L+ +M
Sbjct: 344  TYNTLLDAICKGGKMDLAFEILVQMPAKRILPNVVSYSTVIDGFAKAGRFDEALNLFDQM 403

Query: 1120 KNAGIPLDRVSYNTLLSVYAKFGRFEEALRICREMEDAGFKKDTVTYNALLGGYGKQGKY 1299
            K  GI LDRVSYNTLLS+Y   GR +EAL I REM   G KKD VTYNALLGGYGKQ KY
Sbjct: 404  KYLGIALDRVSYNTLLSIYTTLGRSKEALDILREMASVGIKKDVVTYNALLGGYGKQRKY 463

Query: 1300 DQVKALFEEMRWQRVTPNVLTYSTLIDVYSKGGFQTEAMQVLNEFKQTGLKPDVVLYSAY 1479
            D+VK +F EM+   V PN+LTYSTLIDVYSKGG   EAM++  EFK  GL+ DVVLYSA 
Sbjct: 464  DEVKNVFAEMKRDHVLPNLLTYSTLIDVYSKGGLYKEAMEIFREFKSVGLRADVVLYSAL 523

Query: 1480 IDALCKSGSVGEAIHLLDKMTAEGITPNVVTCNSIIDSLGRSSKGNSRSYGQSAEFLADN 1659
            IDALCK+G V  A+ L+ +MT EGI PNVVT NSIID+ GRS+   S   G       D 
Sbjct: 524  IDALCKNGLVSSAVSLIGEMTKEGIRPNVVTYNSIIDAFGRSATMKSAESG-------DG 576

Query: 1660 DELVRSLENLSCVSSSPARKNERSNNNANALQIVARNAFASLS-IPEEIKQKSEDILGAV 1836
                  + + +  SSS +   E  +N    +QI  +    S + +  + K+   ++   +
Sbjct: 577  GASTFEVGSSNIPSSSLSGLTETEDN--QIIQIFGQLTIESFNRMKNDCKEGMHELSCIL 634

Query: 1837 DIFHRMKKLGIKPNVVTFSVILNACSRCPSFKEASMLLEELRVFDSRVYGVAHGLLMGFR 2016
            ++  +M +L IKPNVVTFS ILNACSRC SF++ASMLLEELR+FD+RVYGV HGLLMG R
Sbjct: 635  EVIRKMHQLEIKPNVVTFSAILNACSRCNSFEDASMLLEELRLFDNRVYGVVHGLLMGHR 694

Query: 2017 DRVWSKAQTLFDEVTRMDSATASAFYNALTDMLWHFGQRKGAQLVVLDGRSRQVWENAWC 2196
            + VW +AQ+LFD+V  MD +TASAFYNALTDMLWHFGQ++GAQ+V L+GRSRQVWEN W 
Sbjct: 695  ENVWLQAQSLFDKVNEMDGSTASAFYNALTDMLWHFGQKRGAQMVALEGRSRQVWENVWS 754

Query: 2197 ESCLDLHLMSAGAAQAMVHAWLLSIRSIVFEGRELPKLLSILTGWGKHSKVTGDSTLRRA 2376
            ESCLDLHLMS+GAA+AMVHAWLL+IRSIV+EG ELPKLLSILTGWGKHSKV GD  LR A
Sbjct: 755  ESCLDLHLMSSGAARAMVHAWLLNIRSIVYEGHELPKLLSILTGWGKHSKVVGDGALRPA 814

Query: 2377 VETLLLTLGAPFHLAKFNEGRFTSTGPVVDAWLRESGTLELLILRDKIVGKA 2532
            +E LL  + APFHL+K N GRFTS+G VV  WLRES TL+LLIL D I  KA
Sbjct: 815  IEALLRGMNAPFHLSKCNMGRFTSSGSVVATWLRESATLKLLILHDHITTKA 866


>ref|XP_002881173.1| pentatricopeptide repeat-containing protein [Arabidopsis lyrata
            subsp. lyrata] gi|297327012|gb|EFH57432.1|
            pentatricopeptide repeat-containing protein [Arabidopsis
            lyrata subsp. lyrata]
          Length = 917

 Score =  882 bits (2280), Expect = 0.0
 Identities = 464/821 (56%), Positives = 590/821 (71%), Gaps = 11/821 (1%)
 Frame = +1

Query: 91   QEVHHHHPTRRVVSSALRTTSGPAHLSTI-----SSNXXXXXXXXXXXQLGSDFCGKRPK 255
            Q  +++H    V SS   +   P+ ++T+     S              L SDF G+R  
Sbjct: 83   QNPNYNHRPYGVSSSPRGSAPPPSSVATVAPAQLSQTPNFSPLQTPKSDLSSDFSGRRST 142

Query: 256  RGISKNNLGRGNS-MKCNQSLTAQAVLSDVLNAPLEQPVDNFL----KGKRLVLDDYVSL 420
            R +SK + GR  + M    S  A+  L + ++   +  + + L    + K    DD   +
Sbjct: 143  RFVSKMHFGRPKTTMATRHSSAAEDALQNAIDFSGDDEMFHSLMLSFESKLCGSDDCTYI 202

Query: 421  MKELGSRGECSKAVECFKWAVSREKRSNEQGKLVTIMISILGKLGRVDLAQQVFDRAKAD 600
            ++ELG+RGEC KAV  +++AV RE+R NEQGKL + MIS LG+ G+V +A+++F+ A + 
Sbjct: 203  IRELGNRGECDKAVGFYEFAVKRERRKNEQGKLASAMISTLGRYGKVTIAKRIFETAFSG 262

Query: 601  GYGKNVYAFSALVSAYGRSGLWKEAISVFELMKRSGCAPNIVTYNAVIDACGKGGVDFKQ 780
            GYG  VYAFSAL+SAYGRSGL +EAISVF  MK  G  PN+VTYNAVIDACGKGG++FKQ
Sbjct: 263  GYGNTVYAFSALISAYGRSGLHEEAISVFNSMKEYGLRPNLVTYNAVIDACGKGGMEFKQ 322

Query: 781  AMGYFDEMIKRHVKPDRITFNSLIAVCSRGGRWVEAQDLFEQMLHRGIEQDIFTYNTLLD 960
               +FDEM +  V+PDRITFNSL+AVCSRGG W  A++LF++M +R IEQD+F+YNTLLD
Sbjct: 323  VAKFFDEMQRNCVQPDRITFNSLLAVCSRGGLWEAARNLFDEMSNRRIEQDVFSYNTLLD 382

Query: 961  ALCKGGQMDLAASALCDMSERNIYPNAITYSTVIDGYAKAGRLNEALDLYHEMKNAGIPL 1140
            A+CKGGQMDLA   L  M  + I PN ++YSTVIDG+AKAGR +EAL+L+ EM+   I L
Sbjct: 383  AICKGGQMDLAFEILAQMPAKRIMPNVVSYSTVIDGFAKAGRFDEALNLFGEMRYLNIAL 442

Query: 1141 DRVSYNTLLSVYAKFGRFEEALRICREMEDAGFKKDTVTYNALLGGYGKQGKYDQVKALF 1320
            DRVSYNTLLS+Y K GR EEAL I REM   G KKD VTYNALLGGYGKQGKYD+VK +F
Sbjct: 443  DRVSYNTLLSIYTKVGRSEEALDILREMASVGIKKDVVTYNALLGGYGKQGKYDEVKKVF 502

Query: 1321 EEMRWQRVTPNVLTYSTLIDVYSKGGFQTEAMQVLNEFKQTGLKPDVVLYSAYIDALCKS 1500
             EM+ + V PN+LTYSTLID YSKGG   EAM+V  EFK  GL+ DVVLYSA IDALCK+
Sbjct: 503  AEMKREHVLPNLLTYSTLIDGYSKGGLYKEAMEVFREFKSAGLRADVVLYSALIDALCKN 562

Query: 1501 GSVGEAIHLLDKMTAEGITPNVVTCNSIIDSLGRSSKGNSRSYGQSAEFLADNDELVRSL 1680
            G VG A+ L+D+MT EGI+PNVVT NSIID+ GRS+     +  +SA++           
Sbjct: 563  GLVGSAVSLIDEMTKEGISPNVVTYNSIIDAFGRSA-----TMERSADYSNGG------- 610

Query: 1681 ENLSCVSSSPARKNERSNNNANALQIVAR-NAFASLSIPEEIKQKSEDILGAVDIFHRMK 1857
             +L   SS+ +   E   N    +Q+  +  +  +  + ++ K+  +++   +++F +M 
Sbjct: 611  -SLPFSSSALSELTETEGN--RVIQLFGQLTSEGNNRMTKDCKEGMQELSCILEVFRKMH 667

Query: 1858 KLGIKPNVVTFSVILNACSRCPSFKEASMLLEELRVFDSRVYGVAHGLLMGFRDRVWSKA 2037
            +L IKPNVVTFS ILNACSRC SF++ASMLLEELR+FD++VYGV HGLLMG R+ VW +A
Sbjct: 668  QLEIKPNVVTFSAILNACSRCNSFEDASMLLEELRLFDNKVYGVVHGLLMGQRENVWLQA 727

Query: 2038 QTLFDEVTRMDSATASAFYNALTDMLWHFGQRKGAQLVVLDGRSRQVWENAWCESCLDLH 2217
            Q+LFD+V  MD +TASAFYNALTDMLWHFGQ++GA+LV L+GRSRQVWEN W +SCLDLH
Sbjct: 728  QSLFDKVNEMDGSTASAFYNALTDMLWHFGQKRGAELVALEGRSRQVWENVWSDSCLDLH 787

Query: 2218 LMSAGAAQAMVHAWLLSIRSIVFEGRELPKLLSILTGWGKHSKVTGDSTLRRAVETLLLT 2397
            LMS+GAA+AMVHAWLL+IRSIV+EG ELPK+LSILTGWGKHSKV GD  L+RAVE LL  
Sbjct: 788  LMSSGAARAMVHAWLLNIRSIVYEGHELPKVLSILTGWGKHSKVVGDGALKRAVEVLLRG 847

Query: 2398 LGAPFHLAKFNEGRFTSTGPVVDAWLRESGTLELLILRDKI 2520
            + APFHL+K N GRFTS+G VV  WLRES TL+LLIL D I
Sbjct: 848  MDAPFHLSKCNMGRFTSSGSVVATWLRESATLKLLILHDHI 888


>ref|NP_180698.1| pentatricopeptide-repeat protein GUN1 [Arabidopsis thaliana]
            gi|75206083|sp|Q9SIC9.1|PP178_ARATH RecName:
            Full=Pentatricopeptide repeat-containing protein
            At2g31400, chloroplastic; Flags: Precursor
            gi|4589961|gb|AAD26479.1| unknown protein [Arabidopsis
            thaliana] gi|330253448|gb|AEC08542.1| genomes uncoupled 1
            protein [Arabidopsis thaliana]
          Length = 918

 Score =  882 bits (2279), Expect = 0.0
 Identities = 456/772 (59%), Positives = 575/772 (74%), Gaps = 6/772 (0%)
 Frame = +1

Query: 223  LGSDFCGKRPKRGISKNNLGRGNS-MKCNQSLTAQAVLSDVLNAPLEQPVDNFL----KG 387
            L SDF G+R  R +SK + GR  + M    S  A+  L + ++   +  + + L    + 
Sbjct: 132  LSSDFSGRRSTRFVSKMHFGRQKTTMATRHSSAAEDALQNAIDFSGDDEMFHSLMLSFES 191

Query: 388  KRLVLDDYVSLMKELGSRGECSKAVECFKWAVSREKRSNEQGKLVTIMISILGKLGRVDL 567
            K    DD   +++ELG+R EC KAV  +++AV RE+R NEQGKL + MIS LG+ G+V +
Sbjct: 192  KLCGSDDCTYIIRELGNRNECDKAVGFYEFAVKRERRKNEQGKLASAMISTLGRYGKVTI 251

Query: 568  AQQVFDRAKADGYGKNVYAFSALVSAYGRSGLWKEAISVFELMKRSGCAPNIVTYNAVID 747
            A+++F+ A A GYG  VYAFSAL+SAYGRSGL +EAISVF  MK  G  PN+VTYNAVID
Sbjct: 252  AKRIFETAFAGGYGNTVYAFSALISAYGRSGLHEEAISVFNSMKEYGLRPNLVTYNAVID 311

Query: 748  ACGKGGVDFKQAMGYFDEMIKRHVKPDRITFNSLIAVCSRGGRWVEAQDLFEQMLHRGIE 927
            ACGKGG++FKQ   +FDEM +  V+PDRITFNSL+AVCSRGG W  A++LF++M +R IE
Sbjct: 312  ACGKGGMEFKQVAKFFDEMQRNGVQPDRITFNSLLAVCSRGGLWEAARNLFDEMTNRRIE 371

Query: 928  QDIFTYNTLLDALCKGGQMDLAASALCDMSERNIYPNAITYSTVIDGYAKAGRLNEALDL 1107
            QD+F+YNTLLDA+CKGGQMDLA   L  M  + I PN ++YSTVIDG+AKAGR +EAL+L
Sbjct: 372  QDVFSYNTLLDAICKGGQMDLAFEILAQMPVKRIMPNVVSYSTVIDGFAKAGRFDEALNL 431

Query: 1108 YHEMKNAGIPLDRVSYNTLLSVYAKFGRFEEALRICREMEDAGFKKDTVTYNALLGGYGK 1287
            + EM+  GI LDRVSYNTLLS+Y K GR EEAL I REM   G KKD VTYNALLGGYGK
Sbjct: 432  FGEMRYLGIALDRVSYNTLLSIYTKVGRSEEALDILREMASVGIKKDVVTYNALLGGYGK 491

Query: 1288 QGKYDQVKALFEEMRWQRVTPNVLTYSTLIDVYSKGGFQTEAMQVLNEFKQTGLKPDVVL 1467
            QGKYD+VK +F EM+ + V PN+LTYSTLID YSKGG   EAM++  EFK  GL+ DVVL
Sbjct: 492  QGKYDEVKKVFTEMKREHVLPNLLTYSTLIDGYSKGGLYKEAMEIFREFKSAGLRADVVL 551

Query: 1468 YSAYIDALCKSGSVGEAIHLLDKMTAEGITPNVVTCNSIIDSLGRSSKGNSRSYGQSAEF 1647
            YSA IDALCK+G VG A+ L+D+MT EGI+PNVVT NSIID+ GRS+     +  +SA++
Sbjct: 552  YSALIDALCKNGLVGSAVSLIDEMTKEGISPNVVTYNSIIDAFGRSA-----TMDRSADY 606

Query: 1648 LADNDELVRSLENLSCVSSSPARKNERSNNNANALQIVARNAFASLS-IPEEIKQKSEDI 1824
             ++   L  S   LS ++ +   +          +Q+  +    S +   ++ ++  +++
Sbjct: 607  -SNGGSLPFSSSALSALTETEGNR---------VIQLFGQLTTESNNRTTKDCEEGMQEL 656

Query: 1825 LGAVDIFHRMKKLGIKPNVVTFSVILNACSRCPSFKEASMLLEELRVFDSRVYGVAHGLL 2004
               +++F +M +L IKPNVVTFS ILNACSRC SF++ASMLLEELR+FD++VYGV HGLL
Sbjct: 657  SCILEVFRKMHQLEIKPNVVTFSAILNACSRCNSFEDASMLLEELRLFDNKVYGVVHGLL 716

Query: 2005 MGFRDRVWSKAQTLFDEVTRMDSATASAFYNALTDMLWHFGQRKGAQLVVLDGRSRQVWE 2184
            MG R+ VW +AQ+LFD+V  MD +TASAFYNALTDMLWHFGQ++GA+LV L+GRSRQVWE
Sbjct: 717  MGQRENVWLQAQSLFDKVNEMDGSTASAFYNALTDMLWHFGQKRGAELVALEGRSRQVWE 776

Query: 2185 NAWCESCLDLHLMSAGAAQAMVHAWLLSIRSIVFEGRELPKLLSILTGWGKHSKVTGDST 2364
            N W +SCLDLHLMS+GAA+AMVHAWLL+IRSIV+EG ELPK+LSILTGWGKHSKV GD  
Sbjct: 777  NVWSDSCLDLHLMSSGAARAMVHAWLLNIRSIVYEGHELPKVLSILTGWGKHSKVVGDGA 836

Query: 2365 LRRAVETLLLTLGAPFHLAKFNEGRFTSTGPVVDAWLRESGTLELLILRDKI 2520
            LRRAVE LL  + APFHL+K N GRFTS+G VV  WLRES TL+LLIL D I
Sbjct: 837  LRRAVEVLLRGMDAPFHLSKCNMGRFTSSGSVVATWLRESATLKLLILHDHI 888


>ref|XP_002301519.2| hypothetical protein POPTR_0002s19470g [Populus trichocarpa]
            gi|550345387|gb|EEE80792.2| hypothetical protein
            POPTR_0002s19470g [Populus trichocarpa]
          Length = 864

 Score =  879 bits (2271), Expect = 0.0
 Identities = 457/773 (59%), Positives = 578/773 (74%), Gaps = 8/773 (1%)
 Frame = +1

Query: 220  QLGSDFCGKRPKRGISKNNLGRGNS-MKCNQSLTAQAVLSDVLN-----APLEQPVDNFL 381
            +L SDF G+R  R +SK + GR  + M    +  AQ  L +V+        LE  + NF 
Sbjct: 91   ELVSDFPGRRSTRFVSKLHFGRPRTTMGTRHTSVAQEALQNVIEYGKDERALENVLLNF- 149

Query: 382  KGKRLVLDDYVSLMKELGSRGECSKAVECFKWAVSREKRSNEQGKLVTIMISILGKLGRV 561
            + +    DDYV L++ELG+RG+C KA+ CF++AV RE++ NEQGKL + MIS LG+LG+V
Sbjct: 150  ESRLSGSDDYVFLLRELGNRGDCKKAICCFEFAVKRERKKNEQGKLASAMISTLGRLGKV 209

Query: 562  DLAQQVFDRAKADGYGKNVYAFSALVSAYGRSGLWKEAISVFELMKRSGCAPNIVTYNAV 741
            ++A+ VF  A  +GYG  VYAFSA++SAYGRSG   EAI +F  MK  G  PN+VTYNAV
Sbjct: 210  EMAKTVFKAALTEGYGNTVYAFSAIISAYGRSGYCNEAIKIFYSMKDYGLKPNLVTYNAV 269

Query: 742  IDACGKGGVDFKQAMGYFDEMIKRHVKPDRITFNSLIAVCSRGGRWVEAQDLFEQMLHRG 921
            IDACGKGGV+FK+ +  FDEM++  ++PDRITFNSL+AVCS+GG W  A+ L  +M++RG
Sbjct: 270  IDACGKGGVEFKRVLEIFDEMLRNGMQPDRITFNSLLAVCSKGGLWEAARSLSCEMVNRG 329

Query: 922  IEQDIFTYNTLLDALCKGGQMDLAASALCDMSERNIYPNAITYSTVIDGYAKAGRLNEAL 1101
            I+QDIFTYNTLLDA+CKGGQ+D+A   + +M  +NI PN +TYST+IDGYAKAGRL++A 
Sbjct: 330  IDQDIFTYNTLLDAVCKGGQLDMAFEIMSEMPAKNILPNVVTYSTMIDGYAKAGRLDDAR 389

Query: 1102 DLYHEMKNAGIPLDRVSYNTLLSVYAKFGRFEEALRICREMEDAGFKKDTVTYNALLGGY 1281
            +L++EMK  GI LDRVSYNTLLS+YAK GRFEEA+ +CREME++G +KD VTYNALLGGY
Sbjct: 390  NLFNEMKFLGISLDRVSYNTLLSIYAKLGRFEEAMDVCREMENSGIRKDVVTYNALLGGY 449

Query: 1282 GKQGKYDQVKALFEEMRWQRVTPNVLTYSTLIDVYSKGGFQTEAMQVLNEFKQTGLKPDV 1461
            GKQ KYD V+ +FEEM+ + V+PN+LTYSTLIDVYSKGG   EAM V  EFK+ GLK DV
Sbjct: 450  GKQYKYDVVRKVFEEMKARHVSPNLLTYSTLIDVYSKGGLYREAMDVFREFKKAGLKADV 509

Query: 1462 VLYSAYIDALCKSGSVGEAIHLLDKMTAEGITPNVVTCNSIIDSLGRSSKGNS--RSYGQ 1635
            VLYSA IDALCK+G V  A+ LLD+MT EGI PNVVT NSIID+ GR +   S     GQ
Sbjct: 510  VLYSALIDALCKNGLVESAVSLLDEMTKEGIRPNVVTYNSIIDAFGRPATTESVVDDAGQ 569

Query: 1636 SAEFLADNDELVRSLENLSCVSSSPARKNERSNNNANALQIVARNAFASLSIPEEIKQKS 1815
            ++E           +++LS  +   A K+  ++   N + I      A+    +      
Sbjct: 570  TSEL---------QIDSLSSSAVEKATKSLVADREDNRI-IKIFGQLAAEKAGQAKNSGG 619

Query: 1816 EDILGAVDIFHRMKKLGIKPNVVTFSVILNACSRCPSFKEASMLLEELRVFDSRVYGVAH 1995
            ++++  + +FH+M +L IKPNVVTFS ILNACSRC SF+EASMLLEELR+FD++VYGVAH
Sbjct: 620  QEMMCILGVFHKMHELEIKPNVVTFSAILNACSRCNSFEEASMLLEELRLFDNQVYGVAH 679

Query: 1996 GLLMGFRDRVWSKAQTLFDEVTRMDSATASAFYNALTDMLWHFGQRKGAQLVVLDGRSRQ 2175
            GLLMG+R+ VW +AQ+LFDEV  MDS+TASAFYNALTDMLWHFGQ++GAQLVVL+G+ RQ
Sbjct: 680  GLLMGYRENVWEQAQSLFDEVKLMDSSTASAFYNALTDMLWHFGQKRGAQLVVLEGKRRQ 739

Query: 2176 VWENAWCESCLDLHLMSAGAAQAMVHAWLLSIRSIVFEGRELPKLLSILTGWGKHSKVTG 2355
            VWEN W ESCLDLHLMS+GAA+AMVHAWLL++R+IVFEG E+PKLL         SKV G
Sbjct: 740  VWENVWSESCLDLHLMSSGAARAMVHAWLLNVRAIVFEGHEVPKLL---------SKVVG 790

Query: 2356 DSTLRRAVETLLLTLGAPFHLAKFNEGRFTSTGPVVDAWLRESGTLELLILRD 2514
            DSTLRRAVE LL+ +GAPF  AK N GR  STG VV +WLRESGTL++L+L D
Sbjct: 791  DSTLRRAVEALLMGMGAPFRSAKCNLGRLISTGSVVASWLRESGTLKVLVLHD 843


>ref|XP_004288538.1| PREDICTED: pentatricopeptide repeat-containing protein At2g31400,
            chloroplastic-like [Fragaria vesca subsp. vesca]
          Length = 870

 Score =  878 bits (2268), Expect = 0.0
 Identities = 453/769 (58%), Positives = 569/769 (73%), Gaps = 5/769 (0%)
 Frame = +1

Query: 223  LGSDFCGKRPKRGISKNNLGRGNSM--KCNQSLTAQAVLSDVLNAPLEQPVDNFLKG--K 390
            L S F G+R  R +SK +LGR  +     +  L  +A+ + +     +  +D+ L     
Sbjct: 82   LVSAFSGRRSTRMVSKMHLGRPKTTVGSRHSPLAEEALETAIRFGKDDFALDDVLHSFES 141

Query: 391  RLVLDDYVSLMKELGSRGECSKAVECFKWAVSREKRSNEQGKLVTIMISILGKLGRVDLA 570
            RLV DD+  L++ELG+RGEC KA+ CF++AV RE++  EQGKL + MIS LG+LG+V+LA
Sbjct: 142  RLVSDDFTFLLRELGNRGECWKAIRCFEFAVRRERKRTEQGKLASSMISTLGRLGKVELA 201

Query: 571  QQVFDRAKADGYGKNVYAFSALVSAYGRSGLWKEAISVFELMKRSGCAPNIVTYNAVIDA 750
            + VF  A  +GYG+ VY +SAL+SAYGRSG   EAI V E MK SG  PN+VTYNAVIDA
Sbjct: 202  KNVFQTAVNEGYGRTVYTYSALISAYGRSGYCDEAIRVLESMKDSGVKPNLVTYNAVIDA 261

Query: 751  CGKGGVDFKQAMGYFDEMIKRHVKPDRITFNSLIAVCSRGGRWVEAQDLFEQMLHRGIEQ 930
            CGKGGV+FK+ +  FDEM+K  V+PDRIT+NSL+AVCSRGG W  A++LF +M+ RGI+Q
Sbjct: 262  CGKGGVEFKKVVEIFDEMLKVGVQPDRITYNSLLAVCSRGGLWEAARNLFSEMVDRGIDQ 321

Query: 931  DIFTYNTLLDALCKGGQMDLAASALCDMSERNIYPNAITYSTVIDGYAKAGRLNEALDLY 1110
            DI+TYNTLLDA+ KGGQMDLA   + +M  +NI PN +TYST+IDGYAKAGRL +AL+L+
Sbjct: 322  DIYTYNTLLDAISKGGQMDLAYKIMSEMPSKNILPNVVTYSTMIDGYAKAGRLEDALNLF 381

Query: 1111 HEMKNAGIPLDRVSYNTLLSVYAKFGRFEEALRICREMEDAGFKKDTVTYNALLGGYGKQ 1290
            +EMK   I LDRV YNTLLS+Y K GRFEEAL +C+EME  G  KD V+YNALLGGYGKQ
Sbjct: 382  NEMKFLAIGLDRVLYNTLLSLYGKLGRFEEALNVCKEMESVGIAKDVVSYNALLGGYGKQ 441

Query: 1291 GKYDQVKALFEEMRWQRVTPNVLTYSTLIDVYSKGGFQTEAMQVLNEFKQTGLKPDVVLY 1470
            GKYD+VK L+ EM+ +RV+PN+LTYSTLIDVYSKGG   EA++V  EFKQ GLK DVVLY
Sbjct: 442  GKYDEVKGLYNEMKVERVSPNLLTYSTLIDVYSKGGLYAEAVKVFREFKQAGLKADVVLY 501

Query: 1471 SAYIDALCKSGSVGEAIHLLDKMTAEGITPNVVTCNSIIDSLGR-SSKGNSRSYGQSAEF 1647
            S  I+ALCK+G V  A+ LLD+MT EGI PNVVT NSIID+ GR ++   +   G     
Sbjct: 502  SELINALCKNGLVESAVSLLDEMTKEGIRPNVVTYNSIIDAFGRPATTVCAVDAGACGIV 561

Query: 1648 LADNDELVRSLENLSCVSSSPARKNERSNNNANALQIVARNAFASLSIPEEIKQKSEDIL 1827
            L        S  +   +S    +   R   +   +++  +         ++ ++  ++IL
Sbjct: 562  LRSESSSSISARDFD-ISDKNVQNEMRDREDTRIMKMFGQLTADKAGYAKKDRKVRQEIL 620

Query: 1828 GAVDIFHRMKKLGIKPNVVTFSVILNACSRCPSFKEASMLLEELRVFDSRVYGVAHGLLM 2007
              + +F +M +L IKPNVVTFS ILNACSRC SF++ASMLLEELR+FD++VYGVAHGLLM
Sbjct: 621  CILGVFQKMHELDIKPNVVTFSAILNACSRCNSFEDASMLLEELRLFDNQVYGVAHGLLM 680

Query: 2008 GFRDRVWSKAQTLFDEVTRMDSATASAFYNALTDMLWHFGQRKGAQLVVLDGRSRQVWEN 2187
            G R  VW KAQ+LFDEV +MD +TASAFYNALTDMLWHFGQ+KGAQLVVL+G  R VWEN
Sbjct: 681  GCRGNVWVKAQSLFDEVKQMDCSTASAFYNALTDMLWHFGQKKGAQLVVLEGERRNVWEN 740

Query: 2188 AWCESCLDLHLMSAGAAQAMVHAWLLSIRSIVFEGRELPKLLSILTGWGKHSKVTGDSTL 2367
            AW  S LDLHLMS+GAA+AMVHAWLL+I SIV++G++LP LLSILTGWGKHSKV GDS L
Sbjct: 741  AWSNSRLDLHLMSSGAARAMVHAWLLNIHSIVYQGQQLPNLLSILTGWGKHSKVVGDSAL 800

Query: 2368 RRAVETLLLTLGAPFHLAKFNEGRFTSTGPVVDAWLRESGTLELLILRD 2514
            RRAVE LL ++GAPF + + N GRF STG V  AWL+ESGTLE+L+L D
Sbjct: 801  RRAVEALLTSMGAPFRVHECNIGRFISTGSVAAAWLKESGTLEVLMLHD 849


>ref|XP_004240565.1| PREDICTED: pentatricopeptide repeat-containing protein At2g31400,
            chloroplastic-like isoform 2 [Solanum lycopersicum]
          Length = 829

 Score =  855 bits (2210), Expect = 0.0
 Identities = 444/769 (57%), Positives = 563/769 (73%), Gaps = 7/769 (0%)
 Frame = +1

Query: 229  SDFCGKRPKRGISKNNLGRGN-SMKCNQSLTAQAVLSDVLN-----APLEQPVDNFLKGK 390
            +DF G+R  R +SK + GR   S     S  AQ  L + +      A L+Q +  F  G 
Sbjct: 72   ADFSGRRSTRFVSKMHFGRAKISGNGRHSSFAQEALEEAIRCCNNEAGLDQVLLTF--GS 129

Query: 391  RLV-LDDYVSLMKELGSRGECSKAVECFKWAVSREKRSNEQGKLVTIMISILGKLGRVDL 567
            +LV  DDY  L +ELG+RGE   A+ CF++AV RE++ NEQGKL + MISILG+ G+VDL
Sbjct: 130  KLVGSDDYTFLFRELGNRGEWLAAMRCFQFAVGRERKRNEQGKLASSMISILGRSGKVDL 189

Query: 568  AQQVFDRAKADGYGKNVYAFSALVSAYGRSGLWKEAISVFELMKRSGCAPNIVTYNAVID 747
            A++VF+ A +DGYG  VYA+SAL+SAY +SG   EAI VFE MK SG  PN+VTYNA+ID
Sbjct: 190  AEKVFENAVSDGYGSTVYAYSALISAYAKSGYCNEAIRVFETMKDSGLKPNLVTYNALID 249

Query: 748  ACGKGGVDFKQAMGYFDEMIKRHVKPDRITFNSLIAVCSRGGRWVEAQDLFEQMLHRGIE 927
            ACGKGG DFK+A   FDEM++  V+PDRITFNSL+AVCS  G W  A+ LF +M++RGI+
Sbjct: 250  ACGKGGADFKRASEIFDEMLRNGVQPDRITFNSLLAVCSGAGLWETARGLFNEMIYRGID 309

Query: 928  QDIFTYNTLLDALCKGGQMDLAASALCDMSERNIYPNAITYSTVIDGYAKAGRLNEALDL 1107
            QDI+TYNT LD  C GGQ+D+A   + +M  +NI PN +TYSTVI G AKAGRL++AL L
Sbjct: 310  QDIYTYNTFLDVACNGGQIDVAFDIMSEMHAKNILPNQVTYSTVIRGCAKAGRLDKALSL 369

Query: 1108 YHEMKNAGIPLDRVSYNTLLSVYAKFGRFEEALRICREMEDAGFKKDTVTYNALLGGYGK 1287
            ++EMK AGI LDRVSYNTLL++YA  G+FEEAL + +EME  G KKD VTYNALL G+GK
Sbjct: 370  FNEMKCAGIKLDRVSYNTLLAIYASLGKFEEALNVSKEMEGMGIKKDVVTYNALLDGFGK 429

Query: 1288 QGKYDQVKALFEEMRWQRVTPNVLTYSTLIDVYSKGGFQTEAMQVLNEFKQTGLKPDVVL 1467
            QG Y +VK LF EM+ ++++PN+LTYSTLI VY KG    +A++V  EFK+ GLK DVV 
Sbjct: 430  QGMYTKVKQLFAEMKAEKLSPNLLTYSTLISVYLKGALYHDAVEVYKEFKKQGLKADVVF 489

Query: 1468 YSAYIDALCKSGSVGEAIHLLDKMTAEGITPNVVTCNSIIDSLGRSSKGNSRSYGQSAEF 1647
            YS  IDALCK G V  +  LL++MT EGI PNVVT NSII++ G S+     S       
Sbjct: 490  YSKLIDALCKKGLVEYSSLLLNEMTKEGIQPNVVTYNSIINAFGESANNECGS------- 542

Query: 1648 LADNDELVRSLENLSCVSSSPARKNERSNNNANALQIVARNAFASLSIPEEIKQKSEDIL 1827
                       +N++ + S+ ++    +    N ++I  + A    +  ++   + +D+L
Sbjct: 543  -----------DNVTHIVSAISQSKWENTEEDNIVKIFEQLAAQKSASGKKTNAERQDML 591

Query: 1828 GAVDIFHRMKKLGIKPNVVTFSVILNACSRCPSFKEASMLLEELRVFDSRVYGVAHGLLM 2007
              + +FH+M +L IKPNVVTFS ILNACSRC SF EAS+LLEELR+FD++VYGVAHGLLM
Sbjct: 592  CILGVFHKMHELQIKPNVVTFSAILNACSRCSSFDEASLLLEELRLFDNQVYGVAHGLLM 651

Query: 2008 GFRDRVWSKAQTLFDEVTRMDSATASAFYNALTDMLWHFGQRKGAQLVVLDGRSRQVWEN 2187
            G R+ VWS+A +LF+EV +MDS+TASAFYNALTDMLWHF Q++GAQLVVL+G+  +VWEN
Sbjct: 652  GQREGVWSQALSLFNEVKQMDSSTASAFYNALTDMLWHFDQKQGAQLVVLEGKRSEVWEN 711

Query: 2188 AWCESCLDLHLMSAGAAQAMVHAWLLSIRSIVFEGRELPKLLSILTGWGKHSKVTGDSTL 2367
             W  SCLDLHLMS+GAA AMVHAWLLSIRSIVFEG ELPK+LSILTGWGKHSK+TGD  L
Sbjct: 712  TWSTSCLDLHLMSSGAACAMVHAWLLSIRSIVFEGHELPKMLSILTGWGKHSKITGDGAL 771

Query: 2368 RRAVETLLLTLGAPFHLAKFNEGRFTSTGPVVDAWLRESGTLELLILRD 2514
            +RA+E LL ++GAPF +AK N GRF STG VV AWLRESGTLE+L+L+D
Sbjct: 772  KRAIEGLLTSIGAPFQIAKCNIGRFISTGAVVTAWLRESGTLEVLVLQD 820


Top