BLASTX nr result

ID: Ephedra26_contig00000389 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Ephedra26_contig00000389
         (3235 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_006841446.1| hypothetical protein AMTR_s00003p00075520 [A...   941   0.0  
gb|EOX95298.1| S uncoupled 1 [Theobroma cacao]                        929   0.0  
ref|XP_002271180.1| PREDICTED: pentatricopeptide repeat-containi...   929   0.0  
ref|XP_002320970.2| hypothetical protein POPTR_0014s11380g [Popu...   924   0.0  
ref|XP_006492356.1| PREDICTED: pentatricopeptide repeat-containi...   922   0.0  
ref|XP_002515260.1| pentatricopeptide repeat-containing protein,...   922   0.0  
ref|XP_006444533.1| hypothetical protein CICLE_v10018807mg [Citr...   921   0.0  
ref|XP_006417966.1| hypothetical protein EUTSA_v10006755mg [Eutr...   914   0.0  
ref|XP_006386713.1| hypothetical protein POPTR_0002s19470g [Popu...   909   0.0  
gb|EXB28566.1| hypothetical protein L484_009725 [Morus notabilis]     896   0.0  
ref|XP_004135985.1| PREDICTED: pentatricopeptide repeat-containi...   896   0.0  
ref|XP_004166285.1| PREDICTED: pentatricopeptide repeat-containi...   895   0.0  
ref|XP_006293642.1| hypothetical protein CARUB_v10022597mg [Caps...   891   0.0  
gb|EMJ22752.1| hypothetical protein PRUPE_ppa001263mg [Prunus pe...   889   0.0  
ref|XP_006410275.1| hypothetical protein EUTSA_v10016219mg [Eutr...   889   0.0  
ref|XP_002881173.1| pentatricopeptide repeat-containing protein ...   884   0.0  
ref|NP_180698.1| pentatricopeptide-repeat protein GUN1 [Arabidop...   884   0.0  
ref|XP_002301519.2| hypothetical protein POPTR_0002s19470g [Popu...   880   0.0  
ref|XP_004288538.1| PREDICTED: pentatricopeptide repeat-containi...   879   0.0  
ref|XP_004240565.1| PREDICTED: pentatricopeptide repeat-containi...   857   0.0  

>ref|XP_006841446.1| hypothetical protein AMTR_s00003p00075520 [Amborella trichopoda]
            gi|548843467|gb|ERN03121.1| hypothetical protein
            AMTR_s00003p00075520 [Amborella trichopoda]
          Length = 857

 Score =  941 bits (2432), Expect = 0.0
 Identities = 488/826 (59%), Positives = 617/826 (74%), Gaps = 19/826 (2%)
 Frame = +3

Query: 495  VHHHHPTRRVV-SSALRTTSGPAHLS--------TISSNXXXXXXXXXXXQLGSDFCGKR 647
            VHHH P ++   +SA + TS  A  S        + SS+           +LGSDF G+R
Sbjct: 20   VHHHQPPQKFTFNSATKPTSKNASASHSLSPNFPSFSSSLSHPQTQKPKPELGSDFNGRR 79

Query: 648  PKRGISKNNLGRGNSMKCNQSLTAQAVLSDVLNAPLEQPVDNFLKGKRLVL---DDYVSL 818
              R +SK +  R        S  A+  L  +  A  +  V+  L      +   +D++ L
Sbjct: 80   STRFVSKMHFNRPKHGPKRHSSVAETALGHLTCADSDATVEAILTNLVFSVSSSEDFLFL 139

Query: 819  MKELGSRGECSKAVECFKWAVSREKRSNEQGKLVTIMISILGKLGRVDLAQQVFDRAKAD 998
            ++ELG+RGECSKA+ CF++AVSREKR  EQGKLV++MISILG+LG+VD+A++VF+ A+ D
Sbjct: 140  LRELGNRGECSKAIRCFEFAVSREKRRTEQGKLVSVMISILGRLGKVDIAREVFETARKD 199

Query: 999  GYGKNVYAFSALVSAYGRSGLWKEAISVFELMKRSGCAPNIVTYNAVIDACGKGGVDFKQ 1178
            GYG +VYAFS+L++AYGRSG   EA+ VFE+M+ SG  PN+VTYN+VIDACGKGGV+F +
Sbjct: 200  GYGNSVYAFSSLINAYGRSGHCGEALGVFEMMRNSGFKPNLVTYNSVIDACGKGGVEFSR 259

Query: 1179 AMGYFDEMIKRHVKPDRITFNSLIAVCSRGGRWVEAQDLFEQMLHRGIEQDIFTYNTLLD 1358
            A+  F+EM +  VKPDRITFNSL+AVCSRGG W EA+  F +M+ RGI++D+FTYNTLLD
Sbjct: 260  ALKVFEEMEREGVKPDRITFNSLLAVCSRGGFWEEAKKCFNEMVFRGIDRDVFTYNTLLD 319

Query: 1359 ALCKGGQMDLAASALCDMSERNIYPNAITYSTVIDGYAKAGRLNEALDLYHEMKNAGIPL 1538
            A+CKGGQM+LA   + DM  +N+ PN +TYST+IDGY KAGRL EAL+L+ EMK AGI L
Sbjct: 320  AVCKGGQMELALEIMSDMPSKNVLPNVVTYSTMIDGYFKAGRLEEALNLFQEMKLAGINL 379

Query: 1539 DRVSYNTLLSVYAKFGRFEEALRICREMEDAGFKKDTVTYNALLGGYGKQGKYDQVKALF 1718
            DRVSYNTLLS+YA+ G F++ALR+C EME AG K+D VTYN+LLGGYGKQGKYD VK LF
Sbjct: 380  DRVSYNTLLSIYARMGLFDDALRVCGEMERAGIKRDAVTYNSLLGGYGKQGKYDVVKHLF 439

Query: 1719 EEMRWQRVTPNVLTYSTLIDVYSKGGLQTEAMQVLNEFKQTGLKPDVVLYSAYIDALCKS 1898
            +EM+ + V PNVLTYSTLID+YSKGGL  EA++V  EFK+ GLK DVVLYSA IDALCK+
Sbjct: 440  KEMKVEAVRPNVLTYSTLIDIYSKGGLLKEALEVFMEFKRVGLKADVVLYSALIDALCKN 499

Query: 1899 GSVGEAIHLLDKMTAEGITPNVVTCNSIIDSLGRSS----KGNSRSYGQ---SAEFLADN 2057
            G V  A  LLD+MT EGI PNVVT N IID+ GRS+    + +S   G+    +  +  +
Sbjct: 500  GLVESAFLLLDEMTGEGIRPNVVTYNCIIDAFGRSNQTQVQNDSYEMGKGPLDSSMIDSS 559

Query: 2058 DELVRSLENLSCVSSSPARKNERSNNNANALQIVARNAFASLSIPEEIKQKSEDILGAVD 2237
             E+V     L+ VS   A++NE  ++    L     +      + + +K KS ++L  + 
Sbjct: 560  SEIV-----LAEVSRGMAKENEGIDHLVKMLGPPPLD--KRHPVIKNMKGKSHEMLCILA 612

Query: 2238 IFHRMKKLGIKPNVVTFSVILNACSRCPSFKEASMLLEELRVFDSRVYGVAHGLLMGFRD 2417
            +FH+M ++ I+PNVVTFS ILNACSRC SF +ASMLLEELR+FD++VYGVAHGLLMG R 
Sbjct: 613  LFHKMHEMDIRPNVVTFSAILNACSRCHSFDDASMLLEELRLFDNQVYGVAHGLLMGLRK 672

Query: 2418 RVWSKAQTLFDEVTRMDSATASAFYNALTDMLWHFGQRKGAQLVVLDGRSRQVWENAWCE 2597
             +W +AQ+LFDEV RMDS+TASAFYNALTDMLWHFGQR+GAQLVV++G+ RQVWEN WCE
Sbjct: 673  DIWVQAQSLFDEVRRMDSSTASAFYNALTDMLWHFGQRRGAQLVVMEGKRRQVWENVWCE 732

Query: 2598 SCLDLHLMSAGAAQAMVHAWLLSIRSIVFEGRELPKLLSILTGWGKHSKVTGDSTLRRAV 2777
            SCLDLHLMSAGAAQAMVHAWLL+IRS+VFEG ELPKLL+ILTGWGKHSKV GDS+LR+A+
Sbjct: 733  SCLDLHLMSAGAAQAMVHAWLLTIRSVVFEGHELPKLLNILTGWGKHSKVAGDSSLRKAI 792

Query: 2778 ETLLLTLGAPFHLAKFNEGRFTSTGPVVDAWLRESGTLELLILRDK 2915
            E LL ++GAPF +AKFN GRF STG VV AWL+ES TL+LLIL D+
Sbjct: 793  EALLTSIGAPFEVAKFNVGRFISTGAVVGAWLKESRTLKLLILHDE 838


>gb|EOX95298.1| S uncoupled 1 [Theobroma cacao]
          Length = 866

 Score =  929 bits (2402), Expect = 0.0
 Identities = 476/768 (61%), Positives = 585/768 (76%), Gaps = 4/768 (0%)
 Frame = +3

Query: 621  LGSDFCGKRPKRGISKNNLGRGN-SMKCNQSLTAQAVLSDVLN---APLEQPVDNFLKGK 788
            L  DF G+R  R +SK +LGR   S     +  A+ VL   L+   + LE+ + +F + K
Sbjct: 86   LAPDFSGRRSTRFVSKMHLGRPKTSTNTRHTSIAEEVLQLALHNGHSGLERVLVSF-ESK 144

Query: 789  RLVLDDYVSLMKELGSRGECSKAVECFKWAVSREKRSNEQGKLVTIMISILGKLGRVDLA 968
                DDY  L++ELG+RGE  KA++CF++AV RE+R  EQGKL + MISILG+LG+V+LA
Sbjct: 145  LCGSDDYTFLLRELGNRGEYEKAIKCFQFAVRRERRKTEQGKLASAMISILGRLGKVELA 204

Query: 969  QQVFDRAKADGYGKNVYAFSALVSAYGRSGLWKEAISVFELMKRSGCAPNIVTYNAVIDA 1148
            + +F+ A  +GYG  VYAFSAL+SA+GRSG   EAI VF+ MK +G  PN+VTYNAVIDA
Sbjct: 205  KGIFETALTEGYGNTVYAFSALISAFGRSGYSDEAIKVFDSMKNNGLKPNLVTYNAVIDA 264

Query: 1149 CGKGGVDFKQAMGYFDEMIKRHVKPDRITFNSLIAVCSRGGRWVEAQDLFEQMLHRGIEQ 1328
            CGKGGV+FK+ +  FDEM++  V+PDRITFNSL+AVCSRGG W  A++LF +M+HRGI+Q
Sbjct: 265  CGKGGVEFKRVVEIFDEMLRSGVQPDRITFNSLLAVCSRGGLWEAARNLFSEMVHRGIDQ 324

Query: 1329 DIFTYNTLLDALCKGGQMDLAASALCDMSERNIYPNAITYSTVIDGYAKAGRLNEALDLY 1508
            DIFTYNTLLDA+CKGGQMDLA   + +M  +NI PN +TYST+IDGYAKAGR ++AL+L+
Sbjct: 325  DIFTYNTLLDAVCKGGQMDLAFEIMAEMPTKNILPNVVTYSTMIDGYAKAGRFDDALNLF 384

Query: 1509 HEMKNAGIPLDRVSYNTLLSVYAKFGRFEEALRICREMEDAGFKKDTVTYNALLGGYGKQ 1688
            +EMK  GI LDRVSYNT+LS+YAK GRFEEAL ICREME +G +KD VTYNALLGGYGKQ
Sbjct: 385  NEMKFLGIGLDRVSYNTVLSIYAKLGRFEEALDICREMEGSGIRKDVVTYNALLGGYGKQ 444

Query: 1689 GKYDQVKALFEEMRWQRVTPNVLTYSTLIDVYSKGGLQTEAMQVLNEFKQTGLKPDVVLY 1868
            GKYD+V+ LFEEM+ Q+V+PN+LTYST+IDVYSKGGL  EAM V  EFK+ GLK DVVLY
Sbjct: 445  GKYDEVRRLFEEMKTQKVSPNLLTYSTVIDVYSKGGLYEEAMDVFREFKRVGLKADVVLY 504

Query: 1869 SAYIDALCKSGSVGEAIHLLDKMTAEGITPNVVTCNSIIDSLGRSSKGNSRSYGQSAEFL 2048
            SA IDALCK+G V  A+ LLD+MT EGI PNVVT NSIID+ GRS+            F 
Sbjct: 505  SALIDALCKNGLVESAVSLLDEMTKEGIRPNVVTYNSIIDAFGRSAT-------SECAFD 557

Query: 2049 ADNDELVRSLENLSCVSSSPARKNERSNNNANALQIVARNAFASLSIPEEIKQKSEDILG 2228
            A  +      E+ S V         R   +   ++   + A       ++  +  ++IL 
Sbjct: 558  AGGEISALQTESSSLVIGHSIEGKARDGEDNQVIKFFGQLAAEKGGQAKKDCRGKQEILC 617

Query: 2229 AVDIFHRMKKLGIKPNVVTFSVILNACSRCPSFKEASMLLEELRVFDSRVYGVAHGLLMG 2408
             + +F +M +L IKPNVVTFS ILNACSRC SF++ASMLLEELR+FD++VYGVAHGLLMG
Sbjct: 618  ILGVFQKMHELEIKPNVVTFSAILNACSRCDSFEDASMLLEELRLFDNQVYGVAHGLLMG 677

Query: 2409 FRDRVWSKAQTLFDEVTRMDSATASAFYNALTDMLWHFGQRKGAQLVVLDGRSRQVWENA 2588
            +R+ VW +AQ+LFDEV  MDS+TASAFYNALTDMLWHFGQ++GAQLVVL+G+ RQVWEN 
Sbjct: 678  YRENVWIQAQSLFDEVKLMDSSTASAFYNALTDMLWHFGQKRGAQLVVLEGKRRQVWENV 737

Query: 2589 WCESCLDLHLMSAGAAQAMVHAWLLSIRSIVFEGRELPKLLSILTGWGKHSKVTGDSTLR 2768
            W  SCLDLHLMS+GAA+AMVHAWLL+IRSI+FEG ELPKLLSILTGWGKHSKV GD  LR
Sbjct: 738  WSNSCLDLHLMSSGAARAMVHAWLLNIRSIIFEGHELPKLLSILTGWGKHSKVVGDGALR 797

Query: 2769 RAVETLLLTLGAPFHLAKFNEGRFTSTGPVVDAWLRESGTLELLILRD 2912
            R VE+L   +GAPF LAK N GRF STGPVV AWLRESGTL+LL+L D
Sbjct: 798  RTVESLFTGMGAPFRLAKCNLGRFVSTGPVVTAWLRESGTLKLLVLHD 845


>ref|XP_002271180.1| PREDICTED: pentatricopeptide repeat-containing protein At2g31400,
            chloroplastic [Vitis vinifera]
          Length = 867

 Score =  929 bits (2401), Expect = 0.0
 Identities = 480/773 (62%), Positives = 591/773 (76%), Gaps = 8/773 (1%)
 Frame = +3

Query: 618  QLGSDFCGKRPKRGISKNNLGRGNSMKC--NQSLTAQAVLSDVLNAPLEQPVDNFL---K 782
            +L +DF G+R  R +SK + GR  +     + S   +A+   +  A  ++ +D+ L   +
Sbjct: 83   ELTADFSGRRSTRFVSKMHFGRPKTAAAARHTSTAEEALRHAIRFASDDKGIDSVLLNFE 142

Query: 783  GKRLVLDDYVSLMKELGSRGECSKAVECFKWAVSREKRSNEQGKLVTIMISILGKLGRVD 962
             +    DDY  L++ELG+RGE +KA+ CF++AV RE+R NEQGKL + MISILG+LG+V+
Sbjct: 143  SRLCGSDDYTFLLRELGNRGEWAKAIRCFEFAVRREQRRNEQGKLASAMISILGRLGQVE 202

Query: 963  LAQQVFDRAKADGYGKNVYAFSALVSAYGRSGLWKEAISVFELMKRSGCAPNIVTYNAVI 1142
            LA+ VF+ A  +GYG  VYAFSAL+SAYGRSG   EAI VFE MK SG  PN+VTYNAVI
Sbjct: 203  LAKNVFETALNEGYGNTVYAFSALISAYGRSGYCDEAIKVFETMKSSGLKPNLVTYNAVI 262

Query: 1143 DACGKGGVDFKQAMGYFDEMIKRHVKPDRITFNSLIAVCSRGGRWVEAQDLFEQMLHRGI 1322
            DACGKGGVDF +A   FDEM++  V+PDRITFNSL+AVC RGG W  A++LF +ML+RGI
Sbjct: 263  DACGKGGVDFNRAAEIFDEMLRNGVQPDRITFNSLLAVCGRGGLWEAARNLFSEMLYRGI 322

Query: 1323 EQDIFTYNTLLDALCKGGQMDLAASALCDMSERNIYPNAITYSTVIDGYAKAGRLNEALD 1502
            EQDIFTYNTLLDA+CKGGQMDLA   + +M  ++I PN +TYSTVIDGYAKAGRL+EAL+
Sbjct: 323  EQDIFTYNTLLDAVCKGGQMDLAFQIMSEMPRKHIMPNVVTYSTVIDGYAKAGRLDEALN 382

Query: 1503 LYHEMKNAGIPLDRVSYNTLLSVYAKFGRFEEALRICREMEDAGFKKDTVTYNALLGGYG 1682
            L++EMK A I LDRVSYNTLLS+YAK GRFEEAL +C+EME +G KKD VTYNALLGGYG
Sbjct: 383  LFNEMKFASIGLDRVSYNTLLSIYAKLGRFEEALNVCKEMESSGIKKDAVTYNALLGGYG 442

Query: 1683 KQGKYDQVKALFEEMRWQRVTPNVLTYSTLIDVYSKGGLQTEAMQVLNEFKQTGLKPDVV 1862
            KQGKY++VK +FEEM+ +R+ PN+LTYSTLIDVYSKGGL  EAM+V  EFK+ GLK DVV
Sbjct: 443  KQGKYEEVKRVFEEMKAERIFPNLLTYSTLIDVYSKGGLYQEAMEVFREFKKAGLKADVV 502

Query: 1863 LYSAYIDALCKSGSVGEAIHLLDKMTAEGITPNVVTCNSIIDSLGRSSKGNSRSYGQSAE 2042
            LYSA IDALCK+G V  A+  LD+MT EGI PNVVT NSIID+ GRS          SAE
Sbjct: 503  LYSALIDALCKNGLVESAVSFLDEMTKEGIRPNVVTYNSIIDAFGRSG---------SAE 553

Query: 2043 FLAD--NDELVRSLENLSCVSSSPARKNERSNNNAN-ALQIVARNAFASLSIPEEIKQKS 2213
             + D   +  V  + + S      A ++E  +   N  ++I  + A       ++  +  
Sbjct: 554  CVIDPPYETNVSKMSSSSLKVVEDATESEVGDKEDNQIIKIFGQLAAEKTCHAKKENRGR 613

Query: 2214 EDILGAVDIFHRMKKLGIKPNVVTFSVILNACSRCPSFKEASMLLEELRVFDSRVYGVAH 2393
            ++IL  + +FH+M +L IKPNVVTFS ILNACSRC SF++ASMLLEELR+FD++VYGVAH
Sbjct: 614  QEILCILAVFHKMHELDIKPNVVTFSAILNACSRCNSFEDASMLLEELRLFDNQVYGVAH 673

Query: 2394 GLLMGFRDRVWSKAQTLFDEVTRMDSATASAFYNALTDMLWHFGQRKGAQLVVLDGRSRQ 2573
            GLLMG+ D VW +AQ+LFDEV +MDS+TASAFYNALTDMLWHFGQR+GAQLVVL+G+ R 
Sbjct: 674  GLLMGYGDNVWVQAQSLFDEVKQMDSSTASAFYNALTDMLWHFGQRRGAQLVVLEGKRRH 733

Query: 2574 VWENAWCESCLDLHLMSAGAAQAMVHAWLLSIRSIVFEGRELPKLLSILTGWGKHSKVTG 2753
            VWEN W  SCLDLHLMS+GAA+AMVHAWLL+IRSIVFEG ELP+LLSILTGWGKHSKV G
Sbjct: 734  VWENMWSNSCLDLHLMSSGAARAMVHAWLLNIRSIVFEGHELPQLLSILTGWGKHSKVVG 793

Query: 2754 DSTLRRAVETLLLTLGAPFHLAKFNEGRFTSTGPVVDAWLRESGTLELLILRD 2912
            D  LRRA+E LL  +GAPF +AK N GRF STG VV AWLRESGTL++L+L D
Sbjct: 794  DGALRRAIEALLTGMGAPFRVAKCNLGRFISTGAVVAAWLRESGTLKVLVLHD 846


>ref|XP_002320970.2| hypothetical protein POPTR_0014s11380g [Populus trichocarpa]
            gi|550323986|gb|EEE99285.2| hypothetical protein
            POPTR_0014s11380g [Populus trichocarpa]
          Length = 875

 Score =  924 bits (2389), Expect = 0.0
 Identities = 476/775 (61%), Positives = 595/775 (76%), Gaps = 10/775 (1%)
 Frame = +3

Query: 618  QLGSDFCGKRPKRGISKNNLGRGNS-MKCNQSLTAQAVLSDVLN-----APLEQPVDNFL 779
            +L SDF G+R  R +SK N GR  + M    +  A+  L +V+        LE  + NF 
Sbjct: 93   ELASDFSGRRSTRFVSKLNFGRPRTTMGTRHTSVAEEALQNVIEYGKDEGALENVLLNF- 151

Query: 780  KGKRLVLDDYVSLMKELGSRGECSKAVECFKWAVSREKRSNEQGKLVTIMISILGKLGRV 959
            + +    DDY+ L++ELG+RG+C KA+ CF++AV RE++ NEQGKL + MIS LG+LG+V
Sbjct: 152  ESRLSGSDDYIFLLRELGNRGDCKKAICCFEFAVKRERKKNEQGKLASAMISTLGRLGKV 211

Query: 960  DLAQQVFDRAKADGYGKNVYAFSALVSAYGRSGLWKEAISVFELMKRSGCAPNIVTYNAV 1139
            ++A+ VF+ A  +GYG  VYAFSA++SAYGRSG   EAI VF+ MK  G  PN+VTYNAV
Sbjct: 212  EIAKSVFEAALIEGYGNTVYAFSAIISAYGRSGYCDEAIKVFDSMKHYGLKPNLVTYNAV 271

Query: 1140 IDACGKGGVDFKQAMGYFDEMIKRHVKPDRITFNSLIAVCSRGGRWVEAQDLFEQMLHRG 1319
            IDACGKGGV+FK+ +  FDEM++  V+PDRITFNSL+AVCSRGG W  A+ L  +ML+RG
Sbjct: 272  IDACGKGGVEFKRVVEIFDEMLRNGVQPDRITFNSLLAVCSRGGLWEAARSLSSEMLNRG 331

Query: 1320 IEQDIFTYNTLLDALCKGGQMDLAASALCDMSERNIYPNAITYSTVIDGYAKAGRLNEAL 1499
            I+QDIFTYNTLLDA+CKGGQMD+A   + +M  +NI PN +TYST+IDGYAKAGR ++AL
Sbjct: 332  IDQDIFTYNTLLDAVCKGGQMDMAFEIMSEMPAKNILPNVVTYSTMIDGYAKAGRFDDAL 391

Query: 1500 DLYHEMKNAGIPLDRVSYNTLLSVYAKFGRFEEALRICREMEDAGFKKDTVTYNALLGGY 1679
            +L++EMK   I LDRVSYNTLLS+YAK GRF+EAL +CREME+ G +KD VTYNALLGGY
Sbjct: 392  NLFNEMKFLCISLDRVSYNTLLSIYAKLGRFQEALDVCREMENCGIRKDVVTYNALLGGY 451

Query: 1680 GKQGKYDQVKALFEEMRWQRVTPNVLTYSTLIDVYSKGGLQTEAMQVLNEFKQTGLKPDV 1859
            GKQ KYD+V+ +F EM+  RV+PN+LTYSTLIDVYSKGGL  EAM V  EFK+ GLK DV
Sbjct: 452  GKQCKYDEVRRVFGEMKAGRVSPNLLTYSTLIDVYSKGGLYREAMDVFREFKKAGLKADV 511

Query: 1860 VLYSAYIDALCKSGSVGEAIHLLDKMTAEGITPNVVTCNSIIDSLGRSSKGNSRSYGQSA 2039
            VLYSA IDALCK+G V  A+ LLD+MT EGI PNVVT NSIID+ GRS+   S       
Sbjct: 512  VLYSAVIDALCKNGLVESAVSLLDEMTKEGIRPNVVTYNSIIDAFGRSAITES------- 564

Query: 2040 EFLADNDELVR-SLENLSCVSSSPARKNERSNNNANALQIVARNAFASLSIPEEIKQKS- 2213
              + DN +  +  +E+LS      A K+  ++   N +  +    F  L++ +  + K+ 
Sbjct: 565  -VVDDNVQTSQLQIESLSSGVVEEATKSLLADREGNRIIKI----FGQLAVEKAGQAKNC 619

Query: 2214 --EDILGAVDIFHRMKKLGIKPNVVTFSVILNACSRCPSFKEASMLLEELRVFDSRVYGV 2387
              ++++  + +FH+M +L IKPNVVTFS ILNACSRC SF++ASMLLEELR+FD++VYGV
Sbjct: 620  SGQEMMCILAVFHKMHELEIKPNVVTFSAILNACSRCNSFEDASMLLEELRLFDNQVYGV 679

Query: 2388 AHGLLMGFRDRVWSKAQTLFDEVTRMDSATASAFYNALTDMLWHFGQRKGAQLVVLDGRS 2567
            AHGLLMG+R+ VW +AQ+LFDEV  MDS+TASAFYNALTDMLWHFGQ++GAQLVVL+G+ 
Sbjct: 680  AHGLLMGYRENVWEQAQSLFDEVKLMDSSTASAFYNALTDMLWHFGQKRGAQLVVLEGKR 739

Query: 2568 RQVWENAWCESCLDLHLMSAGAAQAMVHAWLLSIRSIVFEGRELPKLLSILTGWGKHSKV 2747
            RQVWEN W ESCLDLHLMS+GAA+AMVHAWLL+IRSIVFEG ELPKLLSILTGWGKHSKV
Sbjct: 740  RQVWENVWSESCLDLHLMSSGAARAMVHAWLLNIRSIVFEGHELPKLLSILTGWGKHSKV 799

Query: 2748 TGDSTLRRAVETLLLTLGAPFHLAKFNEGRFTSTGPVVDAWLRESGTLELLILRD 2912
             GDSTLRRA+E LL+ +GAPF LAK N GRF STG VV AWLRESGTL++L+L D
Sbjct: 800  VGDSTLRRAIEALLMGMGAPFRLAKCNLGRFISTGSVVAAWLRESGTLKVLVLHD 854


>ref|XP_006492356.1| PREDICTED: pentatricopeptide repeat-containing protein At2g31400,
            chloroplastic-like [Citrus sinensis]
          Length = 877

 Score =  922 bits (2383), Expect = 0.0
 Identities = 492/864 (56%), Positives = 608/864 (70%), Gaps = 32/864 (3%)
 Frame = +3

Query: 417  PHCPVPAISQXXXXXXXXXXXXELQEVHHHHPTRRV-------------VSSALRTTSGP 557
            PHC + A                      HHP+ R              +S + R    P
Sbjct: 6    PHCSITATKPYQNHQYPHNHLKNNHHRQSHHPSSRPHWTSHKVSLTKPPLSPSPRNAPKP 65

Query: 558  AHLST-ISSNXXXXXXXXXXX----QLGSDFCGKRPKRGISKNNLGRGN-SMKCNQSLTA 719
            A  ST ++ N               +L  DF G+R  R +SK + GR   +M    S+ A
Sbjct: 66   AATSTTVAPNPKPFHSLSPLPSSKSELAPDFSGRRSTRFVSKMHFGRPKIAMSTRHSVVA 125

Query: 720  QAVLSDVLN-APLEQPVDNFLKGKRLVL---DDYVSLMKELGSRGECSKAVECFKWAVSR 887
            +  L  V   A  +  + + LK     L   DDY  L++ELG+RGE SKA++CF +AV R
Sbjct: 126  EEALHHVTAFARDDVSLGDILKKFEFKLCGADDYTFLLRELGNRGEWSKAIQCFAFAVKR 185

Query: 888  EKRSNEQGKLVTIMISILGKLGRVDLAQQVFDRAKADGYGKNVYAFSALVSAYGRSGLWK 1067
            E+R N+QGKL + MISILG+LG+VDLA+ +F+ A  +GYG  VYAFSAL+SAYGRSG  +
Sbjct: 186  EERKNDQGKLASAMISILGRLGKVDLAKNIFETALNEGYGNTVYAFSALISAYGRSGYCQ 245

Query: 1068 EAISVFELMKRSGCAPNIVTYNAVIDACGKGGVDFKQAMGYFDEMIKRHVKPDRITFNSL 1247
            EAISVF  MKR    PN+VTYNAVIDACGKGGVDFK  +  FD+M++  V+PDRITFNSL
Sbjct: 246  EAISVFNSMKRYNLKPNLVTYNAVIDACGKGGVDFKHVVEIFDDMLRNGVQPDRITFNSL 305

Query: 1248 IAVCSRGGRWVEAQDLFEQMLHRGIEQDIFTYNTLLDALCKGGQMDLAASALCDMSERNI 1427
            +AVCSRGG W  A++LF +M+HRGI+QDIFTYNTLLDA+CKG QMDLA   + +M  +NI
Sbjct: 306  LAVCSRGGLWEAARNLFNEMVHRGIDQDIFTYNTLLDAICKGAQMDLAFEIMAEMPAKNI 365

Query: 1428 YPNAITYSTVIDGYAKAGRLNEALDLYHEMKNAGIPLDRVSYNTLLSVYAKFGRFEEALR 1607
             PN +TYST+IDGYAKAGRL++AL+++ EMK  GI LDRVSYNT+LS+YAK GRFEEAL 
Sbjct: 366  SPNVVTYSTMIDGYAKAGRLDDALNMFSEMKFLGIGLDRVSYNTVLSIYAKLGRFEEALL 425

Query: 1608 ICREMEDAGFKKDTVTYNALLGGYGKQGKYDQVKALFEEMRWQRVTPNVLTYSTLIDVYS 1787
            +C+EME +G +KD VTYNALLGGYGKQGKYD+V+ +FE+M+   V+PN+LTYSTLIDVYS
Sbjct: 426  VCKEMESSGIRKDAVTYNALLGGYGKQGKYDEVRRMFEQMKADCVSPNLLTYSTLIDVYS 485

Query: 1788 KGGLQTEAMQVLNEFKQTGLKPDVVLYSAYIDALCKSGSVGEAIHLLDKMTAEGITPNVV 1967
            KGGL  EAMQ+  EFKQ GLK DVVLYSA IDALCK+G V  A+ LLD+MT EGI PNVV
Sbjct: 486  KGGLYKEAMQIFREFKQAGLKADVVLYSALIDALCKNGLVESAVSLLDEMTKEGIRPNVV 545

Query: 1968 TCNSIIDSLGRSSKGNSRSYGQSAEFLADNDE----LVRSLENLSCVSSSPARKNERSNN 2135
            T NSIID+ GRS+         + E   D+ E      +   NL  + S   +  + +  
Sbjct: 546  TYNSIIDAFGRSA---------TTECTVDDVERDLGKQKESANLDAMCSQDDKDVQEAGR 596

Query: 2136 NANAL-----QIVARNAFASLSIPEEIKQKSEDILGAVDIFHRMKKLGIKPNVVTFSVIL 2300
              N +     Q+VA  A       ++  +  ++IL  + +F +M KL IKPNVVTFS IL
Sbjct: 597  TDNQIIKVFGQLVAEKAGQG----KKENRCRQEILCILGVFQKMHKLKIKPNVVTFSAIL 652

Query: 2301 NACSRCPSFKEASMLLEELRVFDSRVYGVAHGLLMGFRDRVWSKAQTLFDEVTRMDSATA 2480
            NACSRC SF++ASMLLEELR+FD++VYGVAHGLLMG+RD +W +A +LFDEV  MDS+TA
Sbjct: 653  NACSRCNSFEDASMLLEELRLFDNQVYGVAHGLLMGYRDNIWVQALSLFDEVKLMDSSTA 712

Query: 2481 SAFYNALTDMLWHFGQRKGAQLVVLDGRSRQVWENAWCESCLDLHLMSAGAAQAMVHAWL 2660
            SAFYNALTDMLWHFGQ++GAQLVVL+G+ RQVWEN W ESCLDLHLMS+GAA+AMVHAWL
Sbjct: 713  SAFYNALTDMLWHFGQKRGAQLVVLEGKRRQVWENVWSESCLDLHLMSSGAARAMVHAWL 772

Query: 2661 LSIRSIVFEGRELPKLLSILTGWGKHSKVTGDSTLRRAVETLLLTLGAPFHLAKFNEGRF 2840
            L+I SIVFEG ELPKLLSILTGWGKHSKV GD  LRRAVE LL  +GAPF +A  N GRF
Sbjct: 773  LNIHSIVFEGHELPKLLSILTGWGKHSKVVGDGALRRAVEVLLTGMGAPFWVANCNLGRF 832

Query: 2841 TSTGPVVDAWLRESGTLELLILRD 2912
             STGP+V +WLRESGTL++L+L D
Sbjct: 833  ISTGPMVASWLRESGTLKVLVLHD 856


>ref|XP_002515260.1| pentatricopeptide repeat-containing protein, putative [Ricinus
            communis] gi|223545740|gb|EEF47244.1| pentatricopeptide
            repeat-containing protein, putative [Ricinus communis]
          Length = 878

 Score =  922 bits (2383), Expect = 0.0
 Identities = 488/855 (57%), Positives = 609/855 (71%), Gaps = 23/855 (2%)
 Frame = +3

Query: 417  PHCPVPAISQXXXXXXXXXXXXELQEVHHHHPTRRVVS-----------SALRTTSGPAH 563
            PHC + A                 ++ HHH  T + VS           +A +  +  A 
Sbjct: 6    PHCSITATKPYQNHQYPQNHLKNHRQTHHHRWTNQKVSLTKPPLAPSPCNAPKAAAAAAA 65

Query: 564  LSTISSNXXXXXXXXXXXQ-----LGSDFCGKRPKRGISKNNLGRGNSMKCNQSLTAQAV 728
             +T               Q     L +DF G+R  R +SK + GR  +     +  A   
Sbjct: 66   ATTTHHTPNPTFHSLSPLQSQKSDLSADFSGRRSTRFVSKLHFGRPKTNMNRHTSVALEA 125

Query: 729  LSDVL-----NAPLEQPVDNFLKGKRLVLDDYVSLMKELGSRGECSKAVECFKWAVSREK 893
            L  V+     +  LE  + NF + +    DDY  L++ELG+RG+ +KAV CF++AV RE 
Sbjct: 126  LQQVIQYGKDDKALENVLLNF-ESRLCGPDDYTFLLRELGNRGDSAKAVRCFEFAVRRES 184

Query: 894  RSNEQGKLVTIMISILGKLGRVDLAQQVFDRAKADGYGKNVYAFSALVSAYGRSGLWKEA 1073
              NEQGKL + MIS LG+LG+V+LA+ VFD A  +GYGK VYAFSAL+SAYGRSG   EA
Sbjct: 185  GKNEQGKLASAMISTLGRLGKVELAKAVFDTALKEGYGKTVYAFSALISAYGRSGYCNEA 244

Query: 1074 ISVFELMKRSGCAPNIVTYNAVIDACGKGGVDFKQAMGYFDEMIKRHVKPDRITFNSLIA 1253
            I VF+ MK +G  PN+VTYNAVIDACGKGGV+FK+ +  FD M+   V+PDRITFNSL+A
Sbjct: 245  IKVFDSMKSNGLMPNLVTYNAVIDACGKGGVEFKKVVEIFDGMLSNGVQPDRITFNSLLA 304

Query: 1254 VCSRGGRWVEAQDLFEQMLHRGIEQDIFTYNTLLDALCKGGQMDLAASALCDMSERNIYP 1433
            VCSRGG W  A+ LF  M+ +GI+QDIFTYNTLLDA+CKGGQMDLA   + +M  +NI P
Sbjct: 305  VCSRGGLWEAARRLFSAMVDKGIDQDIFTYNTLLDAVCKGGQMDLAFEIMSEMPTKNILP 364

Query: 1434 NAITYSTVIDGYAKAGRLNEALDLYHEMKNAGIPLDRVSYNTLLSVYAKFGRFEEALRIC 1613
            N +TYST+IDGYAK GRL++AL++++EMK  G+ LDRVSYNTLLSVYAK GRFE+AL +C
Sbjct: 365  NVVTYSTMIDGYAKVGRLDDALNMFNEMKFLGVGLDRVSYNTLLSVYAKLGRFEQALDVC 424

Query: 1614 REMEDAGFKKDTVTYNALLGGYGKQGKYDQVKALFEEMRWQRVTPNVLTYSTLIDVYSKG 1793
            +EME+AG +KD VTYNALL GYGKQ +YD+V+ +FEEM+  RV+PN+LTYSTLIDVYSKG
Sbjct: 425  KEMENAGIRKDVVTYNALLAGYGKQYRYDEVRRVFEEMKRGRVSPNLLTYSTLIDVYSKG 484

Query: 1794 GLQTEAMQVLNEFKQTGLKPDVVLYSAYIDALCKSGSVGEAIHLLDKMTAEGITPNVVTC 1973
            GL  EAM+V  EFKQ GLK DVVLYSA IDALCK+G V  ++ LLD+MT EGI PNVVT 
Sbjct: 485  GLYKEAMEVFREFKQAGLKADVVLYSALIDALCKNGLVESSVTLLDEMTKEGIRPNVVTY 544

Query: 1974 NSIIDSLGRSSKGNSRSYGQSAEFLADN--DELVRSLENLSCVSSSPARKNERSNNNANA 2147
            NSIID+ GRS+         SA+ + D+  +     +E+LS +    A +++ ++   N 
Sbjct: 545  NSIIDAFGRSA---------SAQCVVDDSGETTALQVESLSSIVVQEAIESQAADKEDNR 595

Query: 2148 LQIVARNAFASLSIPEEIKQKSEDILGAVDIFHRMKKLGIKPNVVTFSVILNACSRCPSF 2327
            + I      A+    E      ++IL  + +F +M +L IKPNVVTFS ILNACSRC SF
Sbjct: 596  I-IEIFGKLAAEKACEAKNSGKQEILCILGVFQKMHELKIKPNVVTFSAILNACSRCDSF 654

Query: 2328 KEASMLLEELRVFDSRVYGVAHGLLMGFRDRVWSKAQTLFDEVTRMDSATASAFYNALTD 2507
            ++ASMLLEELR+FD++VYGVAHGLLMG+R+ VW +AQ+LFDEV  MDS+TASAFYNALTD
Sbjct: 655  EDASMLLEELRLFDNQVYGVAHGLLMGYRENVWLQAQSLFDEVKLMDSSTASAFYNALTD 714

Query: 2508 MLWHFGQRKGAQLVVLDGRSRQVWENAWCESCLDLHLMSAGAAQAMVHAWLLSIRSIVFE 2687
            MLWHFGQ++GAQLVVL+G+ RQVWEN W +SCLDLHLMS+GAA+AMVHAWLL+IRSIVFE
Sbjct: 715  MLWHFGQKRGAQLVVLEGKRRQVWENIWSDSCLDLHLMSSGAARAMVHAWLLNIRSIVFE 774

Query: 2688 GRELPKLLSILTGWGKHSKVTGDSTLRRAVETLLLTLGAPFHLAKFNEGRFTSTGPVVDA 2867
            G ELPKLLSILTGWGKHSKV GDS LRRAVE LL+ +GAPF LAK N GRF STG VV A
Sbjct: 775  GHELPKLLSILTGWGKHSKVVGDSALRRAVEALLIGMGAPFRLAKCNLGRFISTGSVVAA 834

Query: 2868 WLRESGTLELLILRD 2912
            WL+ESGTLE+L+L D
Sbjct: 835  WLKESGTLEVLVLHD 849


>ref|XP_006444533.1| hypothetical protein CICLE_v10018807mg [Citrus clementina]
            gi|557546795|gb|ESR57773.1| hypothetical protein
            CICLE_v10018807mg [Citrus clementina]
          Length = 877

 Score =  921 bits (2381), Expect = 0.0
 Identities = 492/864 (56%), Positives = 608/864 (70%), Gaps = 32/864 (3%)
 Frame = +3

Query: 417  PHCPVPAISQXXXXXXXXXXXXELQEVHHHHPTRRV-------------VSSALRTTSGP 557
            PHC + A                      HHP+ R              +S + R    P
Sbjct: 6    PHCSITATKPYQNHQYPHNHLKNNHHRQSHHPSSRPHWTSHKVSLTKPPLSPSPRNAPKP 65

Query: 558  AHLST-ISSNXXXXXXXXXXX----QLGSDFCGKRPKRGISKNNLGRGN-SMKCNQSLTA 719
            A  ST ++ N               +L  DF G+R  R +SK + GR   +M    S+ A
Sbjct: 66   AATSTTVAPNPKPFHSLSPLPSSKSELAPDFSGRRSTRFVSKMHFGRPKIAMSTRHSVVA 125

Query: 720  QAVLSDVLN-APLEQPVDNFLKGKRLVL---DDYVSLMKELGSRGECSKAVECFKWAVSR 887
            +  L  V   A  +  + + LK     L   DDY  L++ELG+RGE SKA++CF +AV R
Sbjct: 126  EEALHHVTAFARDDVSLGDILKKFEFKLCGADDYTFLLRELGNRGEWSKAIQCFAFAVKR 185

Query: 888  EKRSNEQGKLVTIMISILGKLGRVDLAQQVFDRAKADGYGKNVYAFSALVSAYGRSGLWK 1067
            E+R N+QGKL + MISILG+LG+VDLA+ +F+ A  +GYG  VYAFSAL+SAYGRSG  +
Sbjct: 186  EERKNDQGKLASAMISILGRLGKVDLAKNIFETALNEGYGNTVYAFSALISAYGRSGYCQ 245

Query: 1068 EAISVFELMKRSGCAPNIVTYNAVIDACGKGGVDFKQAMGYFDEMIKRHVKPDRITFNSL 1247
            EAISVF  MKR    PN+VTYNAVIDACGKGGVDFK  +  FD+M++  V+PDRITFNSL
Sbjct: 246  EAISVFNSMKRYHLKPNLVTYNAVIDACGKGGVDFKHVVEIFDDMLRNGVQPDRITFNSL 305

Query: 1248 IAVCSRGGRWVEAQDLFEQMLHRGIEQDIFTYNTLLDALCKGGQMDLAASALCDMSERNI 1427
            +AVCSRGG W  A++LF +M+HRGI+QDIFTYNTLLDA+CKG QMDLA   + +M  +NI
Sbjct: 306  LAVCSRGGLWEAARNLFNEMVHRGIDQDIFTYNTLLDAICKGAQMDLAFEIMAEMPAKNI 365

Query: 1428 YPNAITYSTVIDGYAKAGRLNEALDLYHEMKNAGIPLDRVSYNTLLSVYAKFGRFEEALR 1607
             PN +TYST+IDGYAKAGRL++AL+++ EMK  GI LDRVSYNT+LS+YAK GRFEEAL 
Sbjct: 366  SPNVVTYSTMIDGYAKAGRLDDALNMFSEMKFLGIGLDRVSYNTVLSIYAKLGRFEEALL 425

Query: 1608 ICREMEDAGFKKDTVTYNALLGGYGKQGKYDQVKALFEEMRWQRVTPNVLTYSTLIDVYS 1787
            +C+EME +G +KD VTYNALLGGYGKQGKYD+V+ +FE+M+   V+PN+LTYSTLIDVYS
Sbjct: 426  VCKEMESSGIRKDAVTYNALLGGYGKQGKYDEVRRMFEQMKADCVSPNLLTYSTLIDVYS 485

Query: 1788 KGGLQTEAMQVLNEFKQTGLKPDVVLYSAYIDALCKSGSVGEAIHLLDKMTAEGITPNVV 1967
            KGGL  EAMQ+  EFKQ GLK DVVLYSA IDALCK+G V  A+ LLD+MT EGI PNVV
Sbjct: 486  KGGLYKEAMQIFREFKQAGLKADVVLYSALIDALCKNGLVESAVSLLDEMTKEGIRPNVV 545

Query: 1968 TCNSIIDSLGRSSKGNSRSYGQSAEFLADNDE----LVRSLENLSCVSSSPARKNERSNN 2135
            T NSIID+ GRS+         + E   D+ E      +   NL  + S   +  + +  
Sbjct: 546  TYNSIIDAFGRSA---------TTECTVDDVERDLGKQKESANLDAMCSQDDKDVQEAGR 596

Query: 2136 NANAL-----QIVARNAFASLSIPEEIKQKSEDILGAVDIFHRMKKLGIKPNVVTFSVIL 2300
              N +     Q+VA  A       ++  +  ++IL  + +F +M KL IKPNVVTFS IL
Sbjct: 597  TDNQIIKVFGQLVAEKAGQG----KKENRCRQEILCILGVFQKMHKLKIKPNVVTFSAIL 652

Query: 2301 NACSRCPSFKEASMLLEELRVFDSRVYGVAHGLLMGFRDRVWSKAQTLFDEVTRMDSATA 2480
            NACSRC SF++ASMLLEELR+FD++VYGVAHGLLMG+RD +W +A +LFDEV  MDS+TA
Sbjct: 653  NACSRCNSFEDASMLLEELRLFDNQVYGVAHGLLMGYRDNIWVQALSLFDEVKLMDSSTA 712

Query: 2481 SAFYNALTDMLWHFGQRKGAQLVVLDGRSRQVWENAWCESCLDLHLMSAGAAQAMVHAWL 2660
            SAFYNALTDMLWHFGQ++GAQLVVL+G+ RQVWEN W ESCLDLHLMS+GAA+AMVHAWL
Sbjct: 713  SAFYNALTDMLWHFGQKRGAQLVVLEGKRRQVWENVWSESCLDLHLMSSGAARAMVHAWL 772

Query: 2661 LSIRSIVFEGRELPKLLSILTGWGKHSKVTGDSTLRRAVETLLLTLGAPFHLAKFNEGRF 2840
            L+I SIVFEG ELPKLLSILTGWGKHSKV GD  LRRAVE LL  +GAPF +A  N GRF
Sbjct: 773  LNIHSIVFEGHELPKLLSILTGWGKHSKVVGDGALRRAVEVLLTGMGAPFWVANCNLGRF 832

Query: 2841 TSTGPVVDAWLRESGTLELLILRD 2912
             STGP+V +WLRESGTL++L+L D
Sbjct: 833  ISTGPMVASWLRESGTLKVLVLHD 856


>ref|XP_006417966.1| hypothetical protein EUTSA_v10006755mg [Eutrema salsugineum]
            gi|557095737|gb|ESQ36319.1| hypothetical protein
            EUTSA_v10006755mg [Eutrema salsugineum]
          Length = 895

 Score =  914 bits (2363), Expect = 0.0
 Identities = 472/784 (60%), Positives = 580/784 (73%), Gaps = 20/784 (2%)
 Frame = +3

Query: 621  LGSDFCGKRPKRGISKNNLGRGNS-MKCNQSLTAQAVLSDVLN-APLEQPVDNFL---KG 785
            L  DF G+R  R +SK + GR  + M    SL A+  L   +  +  ++ + N L   + 
Sbjct: 105  LSPDFAGRRSTRFVSKMHFGRPKTAMASRHSLVAEDALHHAIQFSGNDEGLQNLLLSFES 164

Query: 786  KRLVLDDYVSLMKELGSRGECSKAVECFKWAVSREKRSNEQGKLVTIMISILGKLGRVDL 965
            K    DDY  +++ELG+RGE  KAV  +++AV RE+R NEQGKL + MIS LG+LG+V +
Sbjct: 165  KLCGSDDYTYILRELGNRGEFEKAVRFYEFAVKRERRKNEQGKLASAMISTLGRLGKVGI 224

Query: 966  AQQVFDRAKADGYGKNVYAFSALVSAYGRSGLWKEAISVFELMKRSGCAPNIVTYNAVID 1145
            A++VF+ A ADGYG  VYAFSA++SAYGRSG  ++AI VF  MK  G  PN+VTYNAVID
Sbjct: 225  AKRVFETALADGYGNTVYAFSAIISAYGRSGYHEDAIKVFSSMKGHGLRPNLVTYNAVID 284

Query: 1146 ACGKGGVDFKQAMGYFDEMIKRHVKPDRITFNSLIAVCSRGGRWVEAQDLFEQMLHRGIE 1325
            ACGKGG++FKQ   +FDEM +  V+PDRITFNSL+AVCSRGG W  A++LF++ML+RGIE
Sbjct: 285  ACGKGGMEFKQVAEFFDEMQRNRVQPDRITFNSLLAVCSRGGSWEAARNLFDEMLNRGIE 344

Query: 1326 QDIFTYNTLLDALCKGGQMDLAASALCDMSERNIYPNAITYSTVIDGYAKAGRLNEALDL 1505
            QDIFTYNTLLDA+CKGGQMDLA   L  M  +NI PN +TYSTVIDGYAKAGR N+AL L
Sbjct: 345  QDIFTYNTLLDAICKGGQMDLAFEILAQMPAKNIMPNVVTYSTVIDGYAKAGRFNDALTL 404

Query: 1506 YHEMKNAGIPLDRVSYNTLLSVYAKFGRFEEALRICREMEDAGFKKDTVTYNALLGGYGK 1685
            + EMK  GIPLDRVSYNTL+S+YAK GRFEEAL I +EM  AG +KD VTYNALLGGYGK
Sbjct: 405  FGEMKYLGIPLDRVSYNTLVSIYAKLGRFEEALDIVKEMAAAGIRKDAVTYNALLGGYGK 464

Query: 1686 QGKYDQVKALFEEMRWQRVTPNVLTYSTLIDVYSKGGLQTEAMQVLNEFKQTGLKPDVVL 1865
              KYD+VK++F EM+ +RV PN+LTYSTLIDVYSKGGL  EAM++  EFK  GL+ DVVL
Sbjct: 465  HEKYDEVKSVFAEMKQERVLPNLLTYSTLIDVYSKGGLYKEAMEIFREFKSVGLRADVVL 524

Query: 1866 YSAYIDALCKSGSVGEAIHLLDKMTAEGITPNVVTCNSIIDSLGRSSK------------ 2009
            YSA IDALCK+G V  A+ LLD+MT EGI+PNVVT NS+ID+ GRS+             
Sbjct: 525  YSALIDALCKNGLVESAVSLLDEMTKEGISPNVVTYNSMIDAFGRSATTECLADINEGGA 584

Query: 2010 ---GNSRSYGQSAEFLADNDELVRSLENLSCVSSSPARKNERSNNNANALQIVARNAFAS 2180
                   S+  S+  L+  D L  ++     +S     ++ R       L     N    
Sbjct: 585  NGLEEDESFSSSSASLSHTDSLSLAVGEADSLSKLTKTEDHRIVEIFGQLVTEGNN---- 640

Query: 2181 LSIPEEIKQKSEDILGAVDIFHRMKKLGIKPNVVTFSVILNACSRCPSFKEASMLLEELR 2360
              I  + KQ  +++   +++ H+M +L IKPNVVTFS ILNACSRC SF+EASMLLEELR
Sbjct: 641  -QIKRDCKQGVQELSCILEVCHKMHELEIKPNVVTFSAILNACSRCNSFEEASMLLEELR 699

Query: 2361 VFDSRVYGVAHGLLMGFRDRVWSKAQTLFDEVTRMDSATASAFYNALTDMLWHFGQRKGA 2540
            +FD++VYGVAHGLLMG+ + VW +AQ+LFDEV  MD +TASAFYNALTDMLWHFGQ++GA
Sbjct: 700  LFDNKVYGVAHGLLMGYNENVWIQAQSLFDEVKAMDGSTASAFYNALTDMLWHFGQKRGA 759

Query: 2541 QLVVLDGRSRQVWENAWCESCLDLHLMSAGAAQAMVHAWLLSIRSIVFEGRELPKLLSIL 2720
            Q VVL+GR R+VWEN W +SCLDLHLMS+GAA+AMVHAWLL+IRSIV+EG ELPKLLSIL
Sbjct: 760  QSVVLEGRRRKVWENVWSDSCLDLHLMSSGAARAMVHAWLLNIRSIVYEGHELPKLLSIL 819

Query: 2721 TGWGKHSKVTGDSTLRRAVETLLLTLGAPFHLAKFNEGRFTSTGPVVDAWLRESGTLELL 2900
            TGWGKHSKV GD TLRRAVE LL  +GAPFH+AK N GRF S+G VV AWLRESGTL++L
Sbjct: 820  TGWGKHSKVMGDGTLRRAVEALLRGMGAPFHVAKCNVGRFVSSGSVVAAWLRESGTLKVL 879

Query: 2901 ILRD 2912
            +L D
Sbjct: 880  VLED 883


>ref|XP_006386713.1| hypothetical protein POPTR_0002s19470g [Populus trichocarpa]
            gi|550345388|gb|ERP64510.1| hypothetical protein
            POPTR_0002s19470g [Populus trichocarpa]
          Length = 873

 Score =  909 bits (2348), Expect = 0.0
 Identities = 467/773 (60%), Positives = 588/773 (76%), Gaps = 8/773 (1%)
 Frame = +3

Query: 618  QLGSDFCGKRPKRGISKNNLGRGNS-MKCNQSLTAQAVLSDVLN-----APLEQPVDNFL 779
            +L SDF G+R  R +SK + GR  + M    +  AQ  L +V+        LE  + NF 
Sbjct: 91   ELVSDFPGRRSTRFVSKLHFGRPRTTMGTRHTSVAQEALQNVIEYGKDERALENVLLNF- 149

Query: 780  KGKRLVLDDYVSLMKELGSRGECSKAVECFKWAVSREKRSNEQGKLVTIMISILGKLGRV 959
            + +    DDYV L++ELG+RG+C KA+ CF++AV RE++ NEQGKL + MIS LG+LG+V
Sbjct: 150  ESRLSGSDDYVFLLRELGNRGDCKKAICCFEFAVKRERKKNEQGKLASAMISTLGRLGKV 209

Query: 960  DLAQQVFDRAKADGYGKNVYAFSALVSAYGRSGLWKEAISVFELMKRSGCAPNIVTYNAV 1139
            ++A+ VF  A  +GYG  VYAFSA++SAYGRSG   EAI +F  MK  G  PN+VTYNAV
Sbjct: 210  EMAKTVFKAALTEGYGNTVYAFSAIISAYGRSGYCNEAIKIFYSMKDYGLKPNLVTYNAV 269

Query: 1140 IDACGKGGVDFKQAMGYFDEMIKRHVKPDRITFNSLIAVCSRGGRWVEAQDLFEQMLHRG 1319
            IDACGKGGV+FK+ +  FDEM++  ++PDRITFNSL+AVCS+GG W  A+ L  +M++RG
Sbjct: 270  IDACGKGGVEFKRVLEIFDEMLRNGMQPDRITFNSLLAVCSKGGLWEAARSLSCEMVNRG 329

Query: 1320 IEQDIFTYNTLLDALCKGGQMDLAASALCDMSERNIYPNAITYSTVIDGYAKAGRLNEAL 1499
            I+QDIFTYNTLLDA+CKGGQ+D+A   + +M  +NI PN +TYST+IDGYAKAGRL++A 
Sbjct: 330  IDQDIFTYNTLLDAVCKGGQLDMAFEIMSEMPAKNILPNVVTYSTMIDGYAKAGRLDDAR 389

Query: 1500 DLYHEMKNAGIPLDRVSYNTLLSVYAKFGRFEEALRICREMEDAGFKKDTVTYNALLGGY 1679
            +L++EMK  GI LDRVSYNTLLS+YAK GRFEEA+ +CREME++G +KD VTYNALLGGY
Sbjct: 390  NLFNEMKFLGISLDRVSYNTLLSIYAKLGRFEEAMDVCREMENSGIRKDVVTYNALLGGY 449

Query: 1680 GKQGKYDQVKALFEEMRWQRVTPNVLTYSTLIDVYSKGGLQTEAMQVLNEFKQTGLKPDV 1859
            GKQ KYD V+ +FEEM+ + V+PN+LTYSTLIDVYSKGGL  EAM V  EFK+ GLK DV
Sbjct: 450  GKQYKYDVVRKVFEEMKARHVSPNLLTYSTLIDVYSKGGLYREAMDVFREFKKAGLKADV 509

Query: 1860 VLYSAYIDALCKSGSVGEAIHLLDKMTAEGITPNVVTCNSIIDSLGRSSKGNS--RSYGQ 2033
            VLYSA IDALCK+G V  A+ LLD+MT EGI PNVVT NSIID+ GR +   S     GQ
Sbjct: 510  VLYSALIDALCKNGLVESAVSLLDEMTKEGIRPNVVTYNSIIDAFGRPATTESVVDDAGQ 569

Query: 2034 SAEFLADNDELVRSLENLSCVSSSPARKNERSNNNANALQIVARNAFASLSIPEEIKQKS 2213
            ++E           +++LS  +   A K+  ++   N + I      A+    +      
Sbjct: 570  TSEL---------QIDSLSSSAVEKATKSLVADREDNRI-IKIFGQLAAEKAGQAKNSGG 619

Query: 2214 EDILGAVDIFHRMKKLGIKPNVVTFSVILNACSRCPSFKEASMLLEELRVFDSRVYGVAH 2393
            ++++  + +FH+M +L IKPNVVTFS ILNACSRC SF+EASMLLEELR+FD++VYGVAH
Sbjct: 620  QEMMCILGVFHKMHELEIKPNVVTFSAILNACSRCNSFEEASMLLEELRLFDNQVYGVAH 679

Query: 2394 GLLMGFRDRVWSKAQTLFDEVTRMDSATASAFYNALTDMLWHFGQRKGAQLVVLDGRSRQ 2573
            GLLMG+R+ VW +AQ+LFDEV  MDS+TASAFYNALTDMLWHFGQ++GAQLVVL+G+ RQ
Sbjct: 680  GLLMGYRENVWEQAQSLFDEVKLMDSSTASAFYNALTDMLWHFGQKRGAQLVVLEGKRRQ 739

Query: 2574 VWENAWCESCLDLHLMSAGAAQAMVHAWLLSIRSIVFEGRELPKLLSILTGWGKHSKVTG 2753
            VWEN W ESCLDLHLMS+GAA+AMVHAWLL++R+IVFEG E+PKLLSILTGWGKHSKV G
Sbjct: 740  VWENVWSESCLDLHLMSSGAARAMVHAWLLNVRAIVFEGHEVPKLLSILTGWGKHSKVVG 799

Query: 2754 DSTLRRAVETLLLTLGAPFHLAKFNEGRFTSTGPVVDAWLRESGTLELLILRD 2912
            DSTLRRAVE LL+ +GAPF  AK N GR  STG VV +WLRESGTL++L+L D
Sbjct: 800  DSTLRRAVEALLMGMGAPFRSAKCNLGRLISTGSVVASWLRESGTLKVLVLHD 852


>gb|EXB28566.1| hypothetical protein L484_009725 [Morus notabilis]
          Length = 871

 Score =  896 bits (2315), Expect = 0.0
 Identities = 458/774 (59%), Positives = 584/774 (75%), Gaps = 10/774 (1%)
 Frame = +3

Query: 621  LGSDFCGKRPKRGISKNNLGRGNS-MKCNQSLTAQAVLSDVLNAPLEQ-PVDNFL---KG 785
            L + F G+R  R +SK +LGR  + +    +  A+ VL   +    +   +DN L   + 
Sbjct: 89   LAAVFSGRRSTRFVSKMHLGRPKTTVGSRHTAVAEEVLQQAIQFGKDDLGIDNVLLSFEP 148

Query: 786  KRLVLDDYVSLMKELGSRGECSKAVECFKWAVSREKRSNEQGKLVTIMISILGKLGRVDL 965
            K    DDY  L++ELG+RGEC KA+ CF++AV+RE+R  EQGKL + MIS LG+LG+V+L
Sbjct: 149  KLCGSDDYTFLLRELGNRGECRKAIRCFEFAVARERRKTEQGKLTSAMISTLGRLGKVEL 208

Query: 966  AQQVFDRAKADGYGKNVYAFSALVSAYGRSGLWKEAISVFELMKRSGCAPNIVTYNAVID 1145
            A+ VF+ A   GYG  VY +SAL+SAYGRSG W+EA  V E MK SG  PN+VTYNAVID
Sbjct: 209  ARDVFETALFAGYGNTVYTYSALISAYGRSGYWEEARRVVESMKDSGLKPNLVTYNAVID 268

Query: 1146 ACGKGGVDFKQAMGYFDEMIKRHVKPDRITFNSLIAVCSRGGRWVEAQDLFEQMLHRGIE 1325
            ACGKGG +FK+ +  FDEM++  V+PDRIT+NSL+AVCSRGG W  A+ LF +M+ R I+
Sbjct: 269  ACGKGGAEFKRVVEIFDEMLRNGVQPDRITYNSLLAVCSRGGLWEAARSLFSEMVERQID 328

Query: 1326 QDIFTYNTLLDALCKGGQMDLAASALCDMSERNIYPNAITYSTVIDGYAKAGRLNEALDL 1505
            QDI+TYNTLLDA+CKGGQMDLA   + +M  + I PN +TYST+IDGYAKAGRL +AL+L
Sbjct: 329  QDIYTYNTLLDAICKGGQMDLARQIMSEMPSKKILPNVVTYSTMIDGYAKAGRLEDALNL 388

Query: 1506 YHEMKNAGIPLDRVSYNTLLSVYAKFGRFEEALRICREMEDAGFKKDTVTYNALLGGYGK 1685
            ++EMK   I LDRV YNTLLS+YAK GRFEEAL++C+EME +G  +D V+YNALLGGYGK
Sbjct: 389  FNEMKYLAIGLDRVLYNTLLSIYAKLGRFEEALKVCKEMESSGIVRDVVSYNALLGGYGK 448

Query: 1686 QGKYDQVKALFEEMRWQRVTPNVLTYSTLIDVYSKGGLQTEAMQVLNEFKQTGLKPDVVL 1865
            QGKYD+VK ++++M+   V+PN+LTYSTLIDVYSKGGL  EAM+V  EFKQ GLK DVVL
Sbjct: 449  QGKYDEVKRMYQDMKADHVSPNLLTYSTLIDVYSKGGLYREAMEVFREFKQAGLKADVVL 508

Query: 1866 YSAYIDALCKSGSVGEAIHLLDKMTAEGITPNVVTCNSIIDSLGRSSKGNSRSYGQSAEF 2045
            YS  I+ALCK+G V  A+ LLD+MT EGI PNV+T NSIID+ GR +  +S     +   
Sbjct: 509  YSELINALCKNGMVESAVSLLDEMTKEGIMPNVITYNSIIDAFGRPATADS-----ALGA 563

Query: 2046 LADNDELVRSLENLSCVSSSPARKNERSNNNANALQIVARNAFASLSIPEE-----IKQK 2210
                +EL   L   S +S+  A KN+  N   +  QI+    F  L+  +E      K+ 
Sbjct: 564  AIGGNELETELS--SSISNENANKNKAVNKGDH--QII--KMFGQLAAEQEGHTKKDKKI 617

Query: 2211 SEDILGAVDIFHRMKKLGIKPNVVTFSVILNACSRCPSFKEASMLLEELRVFDSRVYGVA 2390
             ++IL  + +F +M +L IKPNVVTFS ILNACSRC SF++ASMLLEELR+FD++VYGVA
Sbjct: 618  RQEILCILGVFQKMHELNIKPNVVTFSAILNACSRCNSFEDASMLLEELRLFDNQVYGVA 677

Query: 2391 HGLLMGFRDRVWSKAQTLFDEVTRMDSATASAFYNALTDMLWHFGQRKGAQLVVLDGRSR 2570
            HGLLMG R+ VW +AQ+LFDEV +MDS+TASAFYNALTDMLWHFGQ++GAQLVVL+G+ R
Sbjct: 678  HGLLMGHRENVWLEAQSLFDEVKQMDSSTASAFYNALTDMLWHFGQKRGAQLVVLEGKRR 737

Query: 2571 QVWENAWCESCLDLHLMSAGAAQAMVHAWLLSIRSIVFEGRELPKLLSILTGWGKHSKVT 2750
             VWE+ W  S LDLHLMS+GAA+A++HAWLL+IRS+VFEG+ELP+LLSILTGWGKHSKV 
Sbjct: 738  NVWESVWSNSFLDLHLMSSGAARALLHAWLLNIRSVVFEGQELPRLLSILTGWGKHSKVV 797

Query: 2751 GDSTLRRAVETLLLTLGAPFHLAKFNEGRFTSTGPVVDAWLRESGTLELLILRD 2912
            GDS LRRA+E+LL+++GAPF  AK N GRFTS GP+V  WL+ESGTL++L+L D
Sbjct: 798  GDSALRRAIESLLISMGAPFEAAKCNLGRFTSPGPMVAGWLKESGTLKVLVLHD 851


>ref|XP_004135985.1| PREDICTED: pentatricopeptide repeat-containing protein At2g31400,
            chloroplastic-like [Cucumis sativus]
          Length = 868

 Score =  896 bits (2315), Expect = 0.0
 Identities = 468/802 (58%), Positives = 593/802 (73%), Gaps = 8/802 (0%)
 Frame = +3

Query: 531  SALRTTSGPAHLSTISSNXXXXXXXXXXXQLGSDFCGKRPKRGISKNNLGRG-NSMKCNQ 707
            SA ++TS P  LS   +            +L S+F G+R  R +SK + GR  +SM    
Sbjct: 58   SATKSTSTP--LSQSPNFPSLCSLPTSKSELASNFSGRRSTRFVSKFHFGRPKSSMTTRH 115

Query: 708  SLTAQAVLSDVL-----NAPLEQPVDNFLKGKRLVLDDYVSLMKELGSRGECSKAVECFK 872
            S  A+ VL  VL     +A L+  + NF + K    +DY  L++ELG+RGEC KA+ CF 
Sbjct: 116  SAIAEEVLHQVLQFGKDDASLDNILLNF-ESKLCGSEDYTFLLRELGNRGECWKAIRCFD 174

Query: 873  WAVSREKRSNEQGKLVTIMISILGKLGRVDLAQQVFDRAKADGYGKNVYAFSALVSAYGR 1052
            +A+ RE R NE+GKL + MIS LG+LG+V+LA+ VF+ A ++GYG  V+AFSAL+SAYG+
Sbjct: 175  FALVREGRKNERGKLASAMISTLGRLGKVELAKGVFETALSEGYGNTVFAFSALISAYGK 234

Query: 1053 SGLWKEAISVFELMKRSGCAPNIVTYNAVIDACGKGGVDFKQAMGYFDEMIKRHVKPDRI 1232
            SG + EAI VFE MK SG  PN+VTYNAVIDACGKGGV+FK+ +  F+EM++  V+PDRI
Sbjct: 235  SGYFDEAIKVFESMKVSGLKPNLVTYNAVIDACGKGGVEFKRVVEIFEEMLRNGVQPDRI 294

Query: 1233 TFNSLIAVCSRGGRWVEAQDLFEQMLHRGIEQDIFTYNTLLDALCKGGQMDLAASALCDM 1412
            T+NSL+AVCSRGG W  A++LF +M+ RGI+QD+FTYNTLLDA+CKGGQMDLA   + +M
Sbjct: 295  TYNSLLAVCSRGGLWEAARNLFNEMIDRGIDQDVFTYNTLLDAVCKGGQMDLAYEIMLEM 354

Query: 1413 SERNIYPNAITYSTVIDGYAKAGRLNEALDLYHEMKNAGIPLDRVSYNTLLSVYAKFGRF 1592
              + I PN +TYST+ DGYAKAGRL +AL+LY+EMK  GI LDRVSYNTLLS+YAK GRF
Sbjct: 355  PGKKILPNVVTYSTMADGYAKAGRLEDALNLYNEMKFLGIGLDRVSYNTLLSIYAKLGRF 414

Query: 1593 EEALRICREMEDAGFKKDTVTYNALLGGYGKQGKYDQVKALFEEMRWQRVTPNVLTYSTL 1772
            E+AL++C+EM  +G KKD VTYNALL GYGKQGK+++V  +F+EM+  RV PN+LTYSTL
Sbjct: 415  EDALKVCKEMGSSGVKKDVVTYNALLDGYGKQGKFNEVTRVFKEMKKDRVFPNLLTYSTL 474

Query: 1773 IDVYSKGGLQTEAMQVLNEFKQTGLKPDVVLYSAYIDALCKSGSVGEAIHLLDKMTAEGI 1952
            IDVYSKG L  EAM+V  EFKQ GLK DVVLYS  I+ALCK+G V  A+ LLD+MT EGI
Sbjct: 475  IDVYSKGSLYEEAMEVFREFKQAGLKADVVLYSELINALCKNGLVDSAVLLLDEMTKEGI 534

Query: 1953 TPNVVTCNSIIDSLGRSSKGNSRSYGQSAEFLAD--NDELVRSLENLSCVSSSPARKNER 2126
             PNVVT NSIID+ GRS+         +AEFL D       R  E+ S +      ++E 
Sbjct: 535  RPNVVTYNSIIDAFGRST---------TAEFLVDGVGASNERQSESPSFMLIEGVDESEI 585

Query: 2127 SNNNANALQIVARNAFASLSIPEEIKQKSEDILGAVDIFHRMKKLGIKPNVVTFSVILNA 2306
            + ++ +  +   +         ++ +   E+I   + +F +M +L IKPNVVTFS ILNA
Sbjct: 586  NWDDGHVFKFYQQLVSEKEGPAKKERLGKEEIRSILSVFKKMHELEIKPNVVTFSAILNA 645

Query: 2307 CSRCPSFKEASMLLEELRVFDSRVYGVAHGLLMGFRDRVWSKAQTLFDEVTRMDSATASA 2486
            CSRC S ++ASMLLEELR+FD++VYGVAHGLLMGF + VW +AQ LFDEV +MDS+TASA
Sbjct: 646  CSRCKSIEDASMLLEELRLFDNQVYGVAHGLLMGFSENVWIQAQYLFDEVKQMDSSTASA 705

Query: 2487 FYNALTDMLWHFGQRKGAQLVVLDGRSRQVWENAWCESCLDLHLMSAGAAQAMVHAWLLS 2666
            FYNALTDMLWHFGQ++GAQLVVL+G+ R+VWE  W +SCLDLHLMS+GAA+AMVHAWLL 
Sbjct: 706  FYNALTDMLWHFGQKRGAQLVVLEGKRRKVWETLWSDSCLDLHLMSSGAARAMVHAWLLG 765

Query: 2667 IRSIVFEGRELPKLLSILTGWGKHSKVTGDSTLRRAVETLLLTLGAPFHLAKFNEGRFTS 2846
            I S+VFEG +LPKLLSILTGWGKHSKV GD  LRRA+E LL ++GAPF +AK N GR+ S
Sbjct: 766  IHSVVFEGHQLPKLLSILTGWGKHSKVVGDGALRRAIEALLTSMGAPFRVAKCNIGRYVS 825

Query: 2847 TGPVVDAWLRESGTLELLILRD 2912
            TG VV AWL+ESGTL+LL+L D
Sbjct: 826  TGSVVAAWLKESGTLKLLVLHD 847


>ref|XP_004166285.1| PREDICTED: pentatricopeptide repeat-containing protein At2g31400,
            chloroplastic-like [Cucumis sativus]
          Length = 868

 Score =  895 bits (2312), Expect = 0.0
 Identities = 467/802 (58%), Positives = 593/802 (73%), Gaps = 8/802 (0%)
 Frame = +3

Query: 531  SALRTTSGPAHLSTISSNXXXXXXXXXXXQLGSDFCGKRPKRGISKNNLGRG-NSMKCNQ 707
            SA ++TS P  LS   +            +L S+F G+R  R +SK + GR  +SM    
Sbjct: 58   SATKSTSTP--LSQSPNFPSLCSLPTSKSELASNFSGRRSTRFVSKFHFGRPKSSMTTRH 115

Query: 708  SLTAQAVLSDVL-----NAPLEQPVDNFLKGKRLVLDDYVSLMKELGSRGECSKAVECFK 872
            S  A+ VL  VL     +A L+  + NF + K    +DY  L++ELG+RGEC KA+ CF 
Sbjct: 116  SAIAEEVLHQVLQFGKDDASLDNILLNF-ESKLCGSEDYTFLLRELGNRGECWKAIRCFD 174

Query: 873  WAVSREKRSNEQGKLVTIMISILGKLGRVDLAQQVFDRAKADGYGKNVYAFSALVSAYGR 1052
            +A+ RE R NE+GKL + MIS LG+LG+V+LA+ VF+ A ++GYG  V+AFSAL+SAYG+
Sbjct: 175  FALVREGRKNERGKLASAMISTLGRLGKVELAKGVFETALSEGYGNTVFAFSALISAYGK 234

Query: 1053 SGLWKEAISVFELMKRSGCAPNIVTYNAVIDACGKGGVDFKQAMGYFDEMIKRHVKPDRI 1232
            SG + EAI VFE MK SG  PN+VTYNAVIDACGKGGV+FK+ +  F+EM++  V+PDRI
Sbjct: 235  SGYFDEAIKVFESMKVSGLKPNLVTYNAVIDACGKGGVEFKRVVEIFEEMLRNGVQPDRI 294

Query: 1233 TFNSLIAVCSRGGRWVEAQDLFEQMLHRGIEQDIFTYNTLLDALCKGGQMDLAASALCDM 1412
            T+NSL+AVCSRGG W  A++LF +M+ RGI+QD+FTYNTLLDA+CKGGQMDLA   + +M
Sbjct: 295  TYNSLLAVCSRGGLWEAARNLFNEMIDRGIDQDVFTYNTLLDAVCKGGQMDLAYEIMLEM 354

Query: 1413 SERNIYPNAITYSTVIDGYAKAGRLNEALDLYHEMKNAGIPLDRVSYNTLLSVYAKFGRF 1592
              + I PN +TYST+ DGYAKAGRL +AL+LY+EMK  GI LDRVSYNTLLS+YAK GRF
Sbjct: 355  PGKKILPNVVTYSTMADGYAKAGRLEDALNLYNEMKFLGIGLDRVSYNTLLSIYAKLGRF 414

Query: 1593 EEALRICREMEDAGFKKDTVTYNALLGGYGKQGKYDQVKALFEEMRWQRVTPNVLTYSTL 1772
            E+AL++C+EM  +G KKD VTYNALL GYGKQGK+++V  +F+EM+  RV PN+LTYSTL
Sbjct: 415  EDALKVCKEMGSSGVKKDVVTYNALLDGYGKQGKFNEVTRVFKEMKKDRVFPNLLTYSTL 474

Query: 1773 IDVYSKGGLQTEAMQVLNEFKQTGLKPDVVLYSAYIDALCKSGSVGEAIHLLDKMTAEGI 1952
            IDVYSKG L  EAM+V  EFKQ GLK DVVLYS  I+ALCK+G V  A+ LLD+MT EGI
Sbjct: 475  IDVYSKGSLYEEAMEVFREFKQAGLKADVVLYSELINALCKNGLVDSAVLLLDEMTKEGI 534

Query: 1953 TPNVVTCNSIIDSLGRSSKGNSRSYGQSAEFLAD--NDELVRSLENLSCVSSSPARKNER 2126
             PNVVT NSIID+ GRS+         +AEFL D       R  E+ + +      ++E 
Sbjct: 535  RPNVVTYNSIIDAFGRST---------TAEFLVDGVGASNERQSESPTFMLIEGVDESEI 585

Query: 2127 SNNNANALQIVARNAFASLSIPEEIKQKSEDILGAVDIFHRMKKLGIKPNVVTFSVILNA 2306
            + ++ +  +   +         ++ +   E+I   + +F +M +L IKPNVVTFS ILNA
Sbjct: 586  NWDDGHVFKFYQQLVSEKEGPAKKERLGKEEIRSILSVFKKMHELEIKPNVVTFSAILNA 645

Query: 2307 CSRCPSFKEASMLLEELRVFDSRVYGVAHGLLMGFRDRVWSKAQTLFDEVTRMDSATASA 2486
            CSRC S ++ASMLLEELR+FD++VYGVAHGLLMGF + VW +AQ LFDEV +MDS+TASA
Sbjct: 646  CSRCKSIEDASMLLEELRLFDNQVYGVAHGLLMGFSENVWIQAQYLFDEVKQMDSSTASA 705

Query: 2487 FYNALTDMLWHFGQRKGAQLVVLDGRSRQVWENAWCESCLDLHLMSAGAAQAMVHAWLLS 2666
            FYNALTDMLWHFGQ++GAQLVVL+G+ R+VWE  W +SCLDLHLMS+GAA+AMVHAWLL 
Sbjct: 706  FYNALTDMLWHFGQKRGAQLVVLEGKRRKVWETLWSDSCLDLHLMSSGAARAMVHAWLLG 765

Query: 2667 IRSIVFEGRELPKLLSILTGWGKHSKVTGDSTLRRAVETLLLTLGAPFHLAKFNEGRFTS 2846
            I S+VFEG +LPKLLSILTGWGKHSKV GD  LRRA+E LL ++GAPF +AK N GR+ S
Sbjct: 766  IHSVVFEGHQLPKLLSILTGWGKHSKVVGDGALRRAIEALLTSMGAPFRVAKCNIGRYVS 825

Query: 2847 TGPVVDAWLRESGTLELLILRD 2912
            TG VV AWL+ESGTL+LL+L D
Sbjct: 826  TGSVVAAWLKESGTLKLLVLHD 847


>ref|XP_006293642.1| hypothetical protein CARUB_v10022597mg [Capsella rubella]
            gi|482562350|gb|EOA26540.1| hypothetical protein
            CARUB_v10022597mg [Capsella rubella]
          Length = 932

 Score =  891 bits (2303), Expect = 0.0
 Identities = 462/770 (60%), Positives = 576/770 (74%), Gaps = 6/770 (0%)
 Frame = +3

Query: 621  LGSDFCGKRPKRGISKNNLGRGNS-MKCNQSLTAQAVLSDVLNAPLEQPVDNFL----KG 785
            L SDF G+R  R +SK + GR  + M    S  A+  L + ++   +  + + L    + 
Sbjct: 141  LSSDFSGRRSTRFVSKMHFGRPKTAMATRHSSAAEDALQNAIDFSGDSEMFHSLMLSFES 200

Query: 786  KRLVLDDYVSLMKELGSRGECSKAVECFKWAVSREKRSNEQGKLVTIMISILGKLGRVDL 965
            K    DD   +++ELG+RGEC KAV  +++AV RE+R NEQGKL + MIS LG+ G+V +
Sbjct: 201  KLCGSDDCTYIIRELGNRGECDKAVGFYEFAVKRERRKNEQGKLASAMISTLGRYGKVTI 260

Query: 966  AQQVFDRAKADGYGKNVYAFSALVSAYGRSGLWKEAISVFELMKRSGCAPNIVTYNAVID 1145
            A+++F+ A A GYG  VYAFSAL+SAYGRSGL +EAISVF  MK  G  PN+VTYNAVID
Sbjct: 261  AKRIFETAFAGGYGNTVYAFSALISAYGRSGLHEEAISVFSSMKDHGLRPNLVTYNAVID 320

Query: 1146 ACGKGGVDFKQAMGYFDEMIKRHVKPDRITFNSLIAVCSRGGRWVEAQDLFEQMLHRGIE 1325
            ACGKGG++FKQ   +FDEM K  V+PDRITFNSL+AVCSRGG W  A++LF++M +R IE
Sbjct: 321  ACGKGGMEFKQVAKFFDEMQKNGVQPDRITFNSLLAVCSRGGLWEAARNLFDEMSNRRIE 380

Query: 1326 QDIFTYNTLLDALCKGGQMDLAASALCDMSERNIYPNAITYSTVIDGYAKAGRLNEALDL 1505
            QD+F+YNTLLDA+CKGGQMDLA   L  M  + I PN ++YSTVIDG+AKAGR +EAL+L
Sbjct: 381  QDVFSYNTLLDAICKGGQMDLAFEILAQMPAKRIMPNVVSYSTVIDGFAKAGRFDEALNL 440

Query: 1506 YHEMKNAGIPLDRVSYNTLLSVYAKFGRFEEALRICREMEDAGFKKDTVTYNALLGGYGK 1685
            + EM+  GI LDRVSYNTLLS+Y K GR EEAL I REM   G KKD VTYNALLGGYGK
Sbjct: 441  FGEMRYLGIALDRVSYNTLLSIYTKVGRSEEALDILREMASVGIKKDVVTYNALLGGYGK 500

Query: 1686 QGKYDQVKALFEEMRWQRVTPNVLTYSTLIDVYSKGGLQTEAMQVLNEFKQTGLKPDVVL 1865
            QGKYD+VK +F EM+ + V PN+LTYSTLID YSKGGL  EAM++  EFK  GL+ DVVL
Sbjct: 501  QGKYDEVKKVFAEMKREHVVPNLLTYSTLIDGYSKGGLYKEAMEIFREFKSAGLRADVVL 560

Query: 1866 YSAYIDALCKSGSVGEAIHLLDKMTAEGITPNVVTCNSIIDSLGRSSKGNSRSYGQSAEF 2045
            YSA IDALCK+G VG A+ L+D+MT EGI+PNVVT NSIID+ GRS+     +  +SA++
Sbjct: 561  YSALIDALCKNGLVGSAVSLIDEMTKEGISPNVVTYNSIIDAFGRSA-----TMERSADY 615

Query: 2046 LADNDELVRSLENLSCVSSSPARKNERSNNNANALQIVAR-NAFASLSIPEEIKQKSEDI 2222
               N E   +LE  S   SS A            +Q+  +  A ++  + ++ K+  +++
Sbjct: 616  --SNGE-ANNLEVGSLALSSSALSKLTETEGNRVIQLFGQLTAESNNRMTKDCKEGMQEL 672

Query: 2223 LGAVDIFHRMKKLGIKPNVVTFSVILNACSRCPSFKEASMLLEELRVFDSRVYGVAHGLL 2402
               +++F +M +L IKPNVVTFS ILNACSRC SF++ASMLLEELR+FD++VYGV HGLL
Sbjct: 673  SCILEVFRKMHQLEIKPNVVTFSAILNACSRCNSFEDASMLLEELRLFDNKVYGVVHGLL 732

Query: 2403 MGFRDRVWSKAQTLFDEVTRMDSATASAFYNALTDMLWHFGQRKGAQLVVLDGRSRQVWE 2582
            MG R+ VW +AQ+LFD+V  MD +TASAFYNALTDMLWHFGQ++GA+LV L+GRSRQVWE
Sbjct: 733  MGERENVWLQAQSLFDKVNEMDGSTASAFYNALTDMLWHFGQKRGAELVALEGRSRQVWE 792

Query: 2583 NAWCESCLDLHLMSAGAAQAMVHAWLLSIRSIVFEGRELPKLLSILTGWGKHSKVTGDST 2762
            N W +SCLDLHLMS+GAA+AMVHAWLL+IRSIV+EG ELPK+LSILTGWGKHSKV GD  
Sbjct: 793  NVWSDSCLDLHLMSSGAARAMVHAWLLNIRSIVYEGHELPKVLSILTGWGKHSKVVGDGA 852

Query: 2763 LRRAVETLLLTLGAPFHLAKFNEGRFTSTGPVVDAWLRESGTLELLILRD 2912
            LRRAVE LL  + APFHL+K N GRF S+G VV  WLRES TL+LLIL D
Sbjct: 853  LRRAVEVLLRGMDAPFHLSKCNMGRFISSGSVVATWLRESATLKLLILHD 902


>gb|EMJ22752.1| hypothetical protein PRUPE_ppa001263mg [Prunus persica]
          Length = 868

 Score =  889 bits (2298), Expect = 0.0
 Identities = 465/805 (57%), Positives = 586/805 (72%), Gaps = 10/805 (1%)
 Frame = +3

Query: 528  SSALRTTSGPAHLSTISSNXXXXXXXXXXXQLGSDFCGKRPKRGISKNNLGRGNSM--KC 701
            S A RT +     +  SS             L + F G+R  R +SK +LGR  +     
Sbjct: 54   SQAPRTAAKTPTATPTSSFSSLCPLPHPKSDLVTAFSGRRSTRFVSKMHLGRPKTTMGSY 113

Query: 702  NQSLTAQAVLSDVLNAPLEQPVDNFLKGKRLVL---DDYVSLMKELGSRGECSKAVECFK 872
               L  +A+   V     +  +D+ L      L   DDY  L +ELG+RGEC KA+ CF+
Sbjct: 114  RSPLAEEALHQAVQFGNDDLALDDILLSFHSRLCGSDDYTFLFRELGNRGECWKAIRCFE 173

Query: 873  WAVSREKRSNEQGKLVTIMISILGKLGRVDLAQQVFDRAKADGYGKNVYAFSALVSAYGR 1052
            +AV REKR  EQGKL + MIS LG+LG+V+LA+ VF  A  +GYGK VY +SAL++AYGR
Sbjct: 174  FAVRREKRRTEQGKLASSMISTLGRLGKVELAKNVFQTAVNEGYGKTVYTYSALITAYGR 233

Query: 1053 SGLWKEAISVFELMKRSGCAPNIVTYNAVIDACGKGGVDFKQAMGYFDEMIKRHVKPDRI 1232
            +G  +EAI VFE MK SG  PN+VTYNAVIDA GKGGV+FK+ +  F+EM++   +PDRI
Sbjct: 234  NGYCEEAIRVFESMKDSGLKPNLVTYNAVIDAYGKGGVEFKRVVEIFNEMLRNGEQPDRI 293

Query: 1233 TFNSLIAVCSRGGRWVEAQDLFEQMLHRGIEQDIFTYNTLLDALCKGGQMDLAASALCDM 1412
            T+NSL+AVCSRGG W  A++LF +M+ RGI+QDI+TYNTL+DA+CKGGQMDLA   + +M
Sbjct: 294  TYNSLLAVCSRGGLWEMARNLFSEMVDRGIDQDIYTYNTLIDAICKGGQMDLAYQIMSEM 353

Query: 1413 SERNIYPNAITYSTVIDGYAKAGRLNEALDLYHEMKNAGIPLDRVSYNTLLSVYAKFGRF 1592
              +NI PN +TYST+IDGYAKAGRL +AL L++EMK   I LDRV YNTLLS+Y K GRF
Sbjct: 354  PSKNILPNVVTYSTIIDGYAKAGRLEDALSLFNEMKFLAIGLDRVLYNTLLSLYGKLGRF 413

Query: 1593 EEALRICREMEDAGFKKDTVTYNALLGGYGKQGKYDQVKALFEEMRWQRVTPNVLTYSTL 1772
            E+AL++C+EME  G  KD V+YNALLGGYGKQGKYD  K ++ +M+ +RV+PN+LTYSTL
Sbjct: 414  EDALKVCKEMESVGIAKDVVSYNALLGGYGKQGKYDDAKRMYNQMKEERVSPNILTYSTL 473

Query: 1773 IDVYSKGGLQTEAMQVLNEFKQTGLKPDVVLYSAYIDALCKSGSVGEAIHLLDKMTAEGI 1952
            IDVYSKGGL  EAM+V  EFKQ GLK DVVLYS  ++ALCK+G V  A+ LLD+MT EGI
Sbjct: 474  IDVYSKGGLYMEAMKVFREFKQAGLKADVVLYSELVNALCKNGLVESAVLLLDEMTKEGI 533

Query: 1953 TPNVVTCNSIIDSLGRSSKGNSRSYGQSAEFLAD--NDELVRSLENLSCVSSSPA---RK 2117
             PNVVT NSIID+ GRS+         + E  AD     +V   E+ S VS   A   + 
Sbjct: 534  RPNVVTYNSIIDAFGRSA---------TTECAADAAGGGIVLQTESSSSVSEGDAIGIQV 584

Query: 2118 NERSNNNANALQIVARNAFASLSIPEEIKQKSEDILGAVDIFHRMKKLGIKPNVVTFSVI 2297
             +R +N    +++  + A       +  ++  ++IL  + IF +M +L IKPNVVTFS I
Sbjct: 585  GDRGDN--RFMKMFGQLAAEKAGYAKTDRKVRQEILCILGIFQKMHELDIKPNVVTFSAI 642

Query: 2298 LNACSRCPSFKEASMLLEELRVFDSRVYGVAHGLLMGFRDRVWSKAQTLFDEVTRMDSAT 2477
            LNACSRC SF++ASMLLEELR+FD++VYGVAHGLLMG+RD VW KA++LFDEV +MDS+T
Sbjct: 643  LNACSRCNSFEDASMLLEELRLFDNKVYGVAHGLLMGYRDNVWVKAESLFDEVKQMDSST 702

Query: 2478 ASAFYNALTDMLWHFGQRKGAQLVVLDGRSRQVWENAWCESCLDLHLMSAGAAQAMVHAW 2657
            ASAFYNALTDMLWH+GQ++GAQLVVL+G+ R VWE+ W  SCLDLHLMS+GAA+AMVHAW
Sbjct: 703  ASAFYNALTDMLWHYGQKQGAQLVVLEGKRRNVWESVWSNSCLDLHLMSSGAARAMVHAW 762

Query: 2658 LLSIRSIVFEGRELPKLLSILTGWGKHSKVTGDSTLRRAVETLLLTLGAPFHLAKFNEGR 2837
            LL+IRSIVFEG++LP LLSILTGWGKHSKV GDSTLRRA+E LL ++GAPF +AK N GR
Sbjct: 763  LLNIRSIVFEGQQLPNLLSILTGWGKHSKVVGDSTLRRAIEALLTSMGAPFRVAKCNLGR 822

Query: 2838 FTSTGPVVDAWLRESGTLELLILRD 2912
            F STG +  AWLRESGTLE+L+L D
Sbjct: 823  FISTGSMAAAWLRESGTLEVLVLHD 847


>ref|XP_006410275.1| hypothetical protein EUTSA_v10016219mg [Eutrema salsugineum]
            gi|557111444|gb|ESQ51728.1| hypothetical protein
            EUTSA_v10016219mg [Eutrema salsugineum]
          Length = 885

 Score =  889 bits (2297), Expect = 0.0
 Identities = 470/832 (56%), Positives = 586/832 (70%), Gaps = 18/832 (2%)
 Frame = +3

Query: 489  QEVHHHHP--------TRRVVSSALRTTSGPAHLSTISSNXXXXXXXXXXXQL----GSD 632
            Q  H+H P          R V+SA  ++S    ++T++S            Q      SD
Sbjct: 44   QPTHNHRPWLPQRITSCPRAVTSAPPSSSAAVSVATVASAQLSKTPTLSPLQTPKSDSSD 103

Query: 633  FCGKRPKRGISKNNLGRGNSMKCNQ-SLTAQAVLSDVLNAPLEQPVDNFL----KGKRLV 797
            F G+R  R +SK +LGR  +    + S  A+  L   ++   E  +   L    + K   
Sbjct: 104  FSGRRSTRFVSKMHLGRPKTTTATRRSSAAEDALRSAIDLSGEDEMFQSLLLSFESKLRG 163

Query: 798  LDDYVSLMKELGSRGECSKAVECFKWAVSREKRSNEQGKLVTIMISILGKLGRVDLAQQV 977
             +DY  +++ELG+RGEC KAV  +++AV RE+R  EQGKL + MIS LG+LG+V +A+ V
Sbjct: 164  SEDYTFILRELGNRGECDKAVRFYEFAVIRERRRVEQGKLASAMISTLGRLGKVAIAKSV 223

Query: 978  FDRAKADGYGKNVYAFSALVSAYGRSGLWKEAISVFELMKRSGCAPNIVTYNAVIDACGK 1157
            F+ A   GYG  VY FSA++SAYGRSG ++EAI VF+ MK  G  PN++TYNAVIDACGK
Sbjct: 224  FEAALDGGYGNTVYTFSAVISAYGRSGFYEEAIGVFDSMKSYGLKPNLITYNAVIDACGK 283

Query: 1158 GGVDFKQAMGYFDEMIKRHVKPDRITFNSLIAVCSRGGRWVEAQDLFEQMLHRGIEQDIF 1337
            GG++FKQ  G+FDEM +  V+PDRITFNSL+AVCSRGG W  A++LF++ML RGIEQD+F
Sbjct: 284  GGMEFKQVAGFFDEMQRNGVQPDRITFNSLLAVCSRGGLWEAARNLFDEMLKRGIEQDVF 343

Query: 1338 TYNTLLDALCKGGQMDLAASALCDMSERNIYPNAITYSTVIDGYAKAGRLNEALDLYHEM 1517
            TYNTLLDA+CKGG+MDLA   L  M  + I PN ++YSTVIDG+AKAGR +EAL+L+ +M
Sbjct: 344  TYNTLLDAICKGGKMDLAFEILVQMPAKRILPNVVSYSTVIDGFAKAGRFDEALNLFDQM 403

Query: 1518 KNAGIPLDRVSYNTLLSVYAKFGRFEEALRICREMEDAGFKKDTVTYNALLGGYGKQGKY 1697
            K  GI LDRVSYNTLLS+Y   GR +EAL I REM   G KKD VTYNALLGGYGKQ KY
Sbjct: 404  KYLGIALDRVSYNTLLSIYTTLGRSKEALDILREMASVGIKKDVVTYNALLGGYGKQRKY 463

Query: 1698 DQVKALFEEMRWQRVTPNVLTYSTLIDVYSKGGLQTEAMQVLNEFKQTGLKPDVVLYSAY 1877
            D+VK +F EM+   V PN+LTYSTLIDVYSKGGL  EAM++  EFK  GL+ DVVLYSA 
Sbjct: 464  DEVKNVFAEMKRDHVLPNLLTYSTLIDVYSKGGLYKEAMEIFREFKSVGLRADVVLYSAL 523

Query: 1878 IDALCKSGSVGEAIHLLDKMTAEGITPNVVTCNSIIDSLGRSSKGNSRSYGQSAEFLADN 2057
            IDALCK+G V  A+ L+ +MT EGI PNVVT NSIID+ GRS+   S   G       D 
Sbjct: 524  IDALCKNGLVSSAVSLIGEMTKEGIRPNVVTYNSIIDAFGRSATMKSAESG-------DG 576

Query: 2058 DELVRSLENLSCVSSSPARKNERSNNNANALQIVARNAFASLS-IPEEIKQKSEDILGAV 2234
                  + + +  SSS +   E  +N    +QI  +    S + +  + K+   ++   +
Sbjct: 577  GASTFEVGSSNIPSSSLSGLTETEDN--QIIQIFGQLTIESFNRMKNDCKEGMHELSCIL 634

Query: 2235 DIFHRMKKLGIKPNVVTFSVILNACSRCPSFKEASMLLEELRVFDSRVYGVAHGLLMGFR 2414
            ++  +M +L IKPNVVTFS ILNACSRC SF++ASMLLEELR+FD+RVYGV HGLLMG R
Sbjct: 635  EVIRKMHQLEIKPNVVTFSAILNACSRCNSFEDASMLLEELRLFDNRVYGVVHGLLMGHR 694

Query: 2415 DRVWSKAQTLFDEVTRMDSATASAFYNALTDMLWHFGQRKGAQLVVLDGRSRQVWENAWC 2594
            + VW +AQ+LFD+V  MD +TASAFYNALTDMLWHFGQ++GAQ+V L+GRSRQVWEN W 
Sbjct: 695  ENVWLQAQSLFDKVNEMDGSTASAFYNALTDMLWHFGQKRGAQMVALEGRSRQVWENVWS 754

Query: 2595 ESCLDLHLMSAGAAQAMVHAWLLSIRSIVFEGRELPKLLSILTGWGKHSKVTGDSTLRRA 2774
            ESCLDLHLMS+GAA+AMVHAWLL+IRSIV+EG ELPKLLSILTGWGKHSKV GD  LR A
Sbjct: 755  ESCLDLHLMSSGAARAMVHAWLLNIRSIVYEGHELPKLLSILTGWGKHSKVVGDGALRPA 814

Query: 2775 VETLLLTLGAPFHLAKFNEGRFTSTGPVVDAWLRESGTLELLILRDKIVGKA 2930
            +E LL  + APFHL+K N GRFTS+G VV  WLRES TL+LLIL D I  KA
Sbjct: 815  IEALLRGMNAPFHLSKCNMGRFTSSGSVVATWLRESATLKLLILHDHITTKA 866


>ref|XP_002881173.1| pentatricopeptide repeat-containing protein [Arabidopsis lyrata
            subsp. lyrata] gi|297327012|gb|EFH57432.1|
            pentatricopeptide repeat-containing protein [Arabidopsis
            lyrata subsp. lyrata]
          Length = 917

 Score =  884 bits (2284), Expect = 0.0
 Identities = 465/821 (56%), Positives = 591/821 (71%), Gaps = 11/821 (1%)
 Frame = +3

Query: 489  QEVHHHHPTRRVVSSALRTTSGPAHLSTI-----SSNXXXXXXXXXXXQLGSDFCGKRPK 653
            Q  +++H    V SS   +   P+ ++T+     S              L SDF G+R  
Sbjct: 83   QNPNYNHRPYGVSSSPRGSAPPPSSVATVAPAQLSQTPNFSPLQTPKSDLSSDFSGRRST 142

Query: 654  RGISKNNLGRGNS-MKCNQSLTAQAVLSDVLNAPLEQPVDNFL----KGKRLVLDDYVSL 818
            R +SK + GR  + M    S  A+  L + ++   +  + + L    + K    DD   +
Sbjct: 143  RFVSKMHFGRPKTTMATRHSSAAEDALQNAIDFSGDDEMFHSLMLSFESKLCGSDDCTYI 202

Query: 819  MKELGSRGECSKAVECFKWAVSREKRSNEQGKLVTIMISILGKLGRVDLAQQVFDRAKAD 998
            ++ELG+RGEC KAV  +++AV RE+R NEQGKL + MIS LG+ G+V +A+++F+ A + 
Sbjct: 203  IRELGNRGECDKAVGFYEFAVKRERRKNEQGKLASAMISTLGRYGKVTIAKRIFETAFSG 262

Query: 999  GYGKNVYAFSALVSAYGRSGLWKEAISVFELMKRSGCAPNIVTYNAVIDACGKGGVDFKQ 1178
            GYG  VYAFSAL+SAYGRSGL +EAISVF  MK  G  PN+VTYNAVIDACGKGG++FKQ
Sbjct: 263  GYGNTVYAFSALISAYGRSGLHEEAISVFNSMKEYGLRPNLVTYNAVIDACGKGGMEFKQ 322

Query: 1179 AMGYFDEMIKRHVKPDRITFNSLIAVCSRGGRWVEAQDLFEQMLHRGIEQDIFTYNTLLD 1358
               +FDEM +  V+PDRITFNSL+AVCSRGG W  A++LF++M +R IEQD+F+YNTLLD
Sbjct: 323  VAKFFDEMQRNCVQPDRITFNSLLAVCSRGGLWEAARNLFDEMSNRRIEQDVFSYNTLLD 382

Query: 1359 ALCKGGQMDLAASALCDMSERNIYPNAITYSTVIDGYAKAGRLNEALDLYHEMKNAGIPL 1538
            A+CKGGQMDLA   L  M  + I PN ++YSTVIDG+AKAGR +EAL+L+ EM+   I L
Sbjct: 383  AICKGGQMDLAFEILAQMPAKRIMPNVVSYSTVIDGFAKAGRFDEALNLFGEMRYLNIAL 442

Query: 1539 DRVSYNTLLSVYAKFGRFEEALRICREMEDAGFKKDTVTYNALLGGYGKQGKYDQVKALF 1718
            DRVSYNTLLS+Y K GR EEAL I REM   G KKD VTYNALLGGYGKQGKYD+VK +F
Sbjct: 443  DRVSYNTLLSIYTKVGRSEEALDILREMASVGIKKDVVTYNALLGGYGKQGKYDEVKKVF 502

Query: 1719 EEMRWQRVTPNVLTYSTLIDVYSKGGLQTEAMQVLNEFKQTGLKPDVVLYSAYIDALCKS 1898
             EM+ + V PN+LTYSTLID YSKGGL  EAM+V  EFK  GL+ DVVLYSA IDALCK+
Sbjct: 503  AEMKREHVLPNLLTYSTLIDGYSKGGLYKEAMEVFREFKSAGLRADVVLYSALIDALCKN 562

Query: 1899 GSVGEAIHLLDKMTAEGITPNVVTCNSIIDSLGRSSKGNSRSYGQSAEFLADNDELVRSL 2078
            G VG A+ L+D+MT EGI+PNVVT NSIID+ GRS+     +  +SA++           
Sbjct: 563  GLVGSAVSLIDEMTKEGISPNVVTYNSIIDAFGRSA-----TMERSADYSNGG------- 610

Query: 2079 ENLSCVSSSPARKNERSNNNANALQIVAR-NAFASLSIPEEIKQKSEDILGAVDIFHRMK 2255
             +L   SS+ +   E   N    +Q+  +  +  +  + ++ K+  +++   +++F +M 
Sbjct: 611  -SLPFSSSALSELTETEGN--RVIQLFGQLTSEGNNRMTKDCKEGMQELSCILEVFRKMH 667

Query: 2256 KLGIKPNVVTFSVILNACSRCPSFKEASMLLEELRVFDSRVYGVAHGLLMGFRDRVWSKA 2435
            +L IKPNVVTFS ILNACSRC SF++ASMLLEELR+FD++VYGV HGLLMG R+ VW +A
Sbjct: 668  QLEIKPNVVTFSAILNACSRCNSFEDASMLLEELRLFDNKVYGVVHGLLMGQRENVWLQA 727

Query: 2436 QTLFDEVTRMDSATASAFYNALTDMLWHFGQRKGAQLVVLDGRSRQVWENAWCESCLDLH 2615
            Q+LFD+V  MD +TASAFYNALTDMLWHFGQ++GA+LV L+GRSRQVWEN W +SCLDLH
Sbjct: 728  QSLFDKVNEMDGSTASAFYNALTDMLWHFGQKRGAELVALEGRSRQVWENVWSDSCLDLH 787

Query: 2616 LMSAGAAQAMVHAWLLSIRSIVFEGRELPKLLSILTGWGKHSKVTGDSTLRRAVETLLLT 2795
            LMS+GAA+AMVHAWLL+IRSIV+EG ELPK+LSILTGWGKHSKV GD  L+RAVE LL  
Sbjct: 788  LMSSGAARAMVHAWLLNIRSIVYEGHELPKVLSILTGWGKHSKVVGDGALKRAVEVLLRG 847

Query: 2796 LGAPFHLAKFNEGRFTSTGPVVDAWLRESGTLELLILRDKI 2918
            + APFHL+K N GRFTS+G VV  WLRES TL+LLIL D I
Sbjct: 848  MDAPFHLSKCNMGRFTSSGSVVATWLRESATLKLLILHDHI 888


>ref|NP_180698.1| pentatricopeptide-repeat protein GUN1 [Arabidopsis thaliana]
            gi|75206083|sp|Q9SIC9.1|PP178_ARATH RecName:
            Full=Pentatricopeptide repeat-containing protein
            At2g31400, chloroplastic; Flags: Precursor
            gi|4589961|gb|AAD26479.1| unknown protein [Arabidopsis
            thaliana] gi|330253448|gb|AEC08542.1| genomes uncoupled 1
            protein [Arabidopsis thaliana]
          Length = 918

 Score =  884 bits (2283), Expect = 0.0
 Identities = 457/772 (59%), Positives = 576/772 (74%), Gaps = 6/772 (0%)
 Frame = +3

Query: 621  LGSDFCGKRPKRGISKNNLGRGNS-MKCNQSLTAQAVLSDVLNAPLEQPVDNFL----KG 785
            L SDF G+R  R +SK + GR  + M    S  A+  L + ++   +  + + L    + 
Sbjct: 132  LSSDFSGRRSTRFVSKMHFGRQKTTMATRHSSAAEDALQNAIDFSGDDEMFHSLMLSFES 191

Query: 786  KRLVLDDYVSLMKELGSRGECSKAVECFKWAVSREKRSNEQGKLVTIMISILGKLGRVDL 965
            K    DD   +++ELG+R EC KAV  +++AV RE+R NEQGKL + MIS LG+ G+V +
Sbjct: 192  KLCGSDDCTYIIRELGNRNECDKAVGFYEFAVKRERRKNEQGKLASAMISTLGRYGKVTI 251

Query: 966  AQQVFDRAKADGYGKNVYAFSALVSAYGRSGLWKEAISVFELMKRSGCAPNIVTYNAVID 1145
            A+++F+ A A GYG  VYAFSAL+SAYGRSGL +EAISVF  MK  G  PN+VTYNAVID
Sbjct: 252  AKRIFETAFAGGYGNTVYAFSALISAYGRSGLHEEAISVFNSMKEYGLRPNLVTYNAVID 311

Query: 1146 ACGKGGVDFKQAMGYFDEMIKRHVKPDRITFNSLIAVCSRGGRWVEAQDLFEQMLHRGIE 1325
            ACGKGG++FKQ   +FDEM +  V+PDRITFNSL+AVCSRGG W  A++LF++M +R IE
Sbjct: 312  ACGKGGMEFKQVAKFFDEMQRNGVQPDRITFNSLLAVCSRGGLWEAARNLFDEMTNRRIE 371

Query: 1326 QDIFTYNTLLDALCKGGQMDLAASALCDMSERNIYPNAITYSTVIDGYAKAGRLNEALDL 1505
            QD+F+YNTLLDA+CKGGQMDLA   L  M  + I PN ++YSTVIDG+AKAGR +EAL+L
Sbjct: 372  QDVFSYNTLLDAICKGGQMDLAFEILAQMPVKRIMPNVVSYSTVIDGFAKAGRFDEALNL 431

Query: 1506 YHEMKNAGIPLDRVSYNTLLSVYAKFGRFEEALRICREMEDAGFKKDTVTYNALLGGYGK 1685
            + EM+  GI LDRVSYNTLLS+Y K GR EEAL I REM   G KKD VTYNALLGGYGK
Sbjct: 432  FGEMRYLGIALDRVSYNTLLSIYTKVGRSEEALDILREMASVGIKKDVVTYNALLGGYGK 491

Query: 1686 QGKYDQVKALFEEMRWQRVTPNVLTYSTLIDVYSKGGLQTEAMQVLNEFKQTGLKPDVVL 1865
            QGKYD+VK +F EM+ + V PN+LTYSTLID YSKGGL  EAM++  EFK  GL+ DVVL
Sbjct: 492  QGKYDEVKKVFTEMKREHVLPNLLTYSTLIDGYSKGGLYKEAMEIFREFKSAGLRADVVL 551

Query: 1866 YSAYIDALCKSGSVGEAIHLLDKMTAEGITPNVVTCNSIIDSLGRSSKGNSRSYGQSAEF 2045
            YSA IDALCK+G VG A+ L+D+MT EGI+PNVVT NSIID+ GRS+     +  +SA++
Sbjct: 552  YSALIDALCKNGLVGSAVSLIDEMTKEGISPNVVTYNSIIDAFGRSA-----TMDRSADY 606

Query: 2046 LADNDELVRSLENLSCVSSSPARKNERSNNNANALQIVARNAFASLS-IPEEIKQKSEDI 2222
             ++   L  S   LS ++ +   +          +Q+  +    S +   ++ ++  +++
Sbjct: 607  -SNGGSLPFSSSALSALTETEGNR---------VIQLFGQLTTESNNRTTKDCEEGMQEL 656

Query: 2223 LGAVDIFHRMKKLGIKPNVVTFSVILNACSRCPSFKEASMLLEELRVFDSRVYGVAHGLL 2402
               +++F +M +L IKPNVVTFS ILNACSRC SF++ASMLLEELR+FD++VYGV HGLL
Sbjct: 657  SCILEVFRKMHQLEIKPNVVTFSAILNACSRCNSFEDASMLLEELRLFDNKVYGVVHGLL 716

Query: 2403 MGFRDRVWSKAQTLFDEVTRMDSATASAFYNALTDMLWHFGQRKGAQLVVLDGRSRQVWE 2582
            MG R+ VW +AQ+LFD+V  MD +TASAFYNALTDMLWHFGQ++GA+LV L+GRSRQVWE
Sbjct: 717  MGQRENVWLQAQSLFDKVNEMDGSTASAFYNALTDMLWHFGQKRGAELVALEGRSRQVWE 776

Query: 2583 NAWCESCLDLHLMSAGAAQAMVHAWLLSIRSIVFEGRELPKLLSILTGWGKHSKVTGDST 2762
            N W +SCLDLHLMS+GAA+AMVHAWLL+IRSIV+EG ELPK+LSILTGWGKHSKV GD  
Sbjct: 777  NVWSDSCLDLHLMSSGAARAMVHAWLLNIRSIVYEGHELPKVLSILTGWGKHSKVVGDGA 836

Query: 2763 LRRAVETLLLTLGAPFHLAKFNEGRFTSTGPVVDAWLRESGTLELLILRDKI 2918
            LRRAVE LL  + APFHL+K N GRFTS+G VV  WLRES TL+LLIL D I
Sbjct: 837  LRRAVEVLLRGMDAPFHLSKCNMGRFTSSGSVVATWLRESATLKLLILHDHI 888


>ref|XP_002301519.2| hypothetical protein POPTR_0002s19470g [Populus trichocarpa]
            gi|550345387|gb|EEE80792.2| hypothetical protein
            POPTR_0002s19470g [Populus trichocarpa]
          Length = 864

 Score =  880 bits (2275), Expect = 0.0
 Identities = 458/773 (59%), Positives = 579/773 (74%), Gaps = 8/773 (1%)
 Frame = +3

Query: 618  QLGSDFCGKRPKRGISKNNLGRGNS-MKCNQSLTAQAVLSDVLN-----APLEQPVDNFL 779
            +L SDF G+R  R +SK + GR  + M    +  AQ  L +V+        LE  + NF 
Sbjct: 91   ELVSDFPGRRSTRFVSKLHFGRPRTTMGTRHTSVAQEALQNVIEYGKDERALENVLLNF- 149

Query: 780  KGKRLVLDDYVSLMKELGSRGECSKAVECFKWAVSREKRSNEQGKLVTIMISILGKLGRV 959
            + +    DDYV L++ELG+RG+C KA+ CF++AV RE++ NEQGKL + MIS LG+LG+V
Sbjct: 150  ESRLSGSDDYVFLLRELGNRGDCKKAICCFEFAVKRERKKNEQGKLASAMISTLGRLGKV 209

Query: 960  DLAQQVFDRAKADGYGKNVYAFSALVSAYGRSGLWKEAISVFELMKRSGCAPNIVTYNAV 1139
            ++A+ VF  A  +GYG  VYAFSA++SAYGRSG   EAI +F  MK  G  PN+VTYNAV
Sbjct: 210  EMAKTVFKAALTEGYGNTVYAFSAIISAYGRSGYCNEAIKIFYSMKDYGLKPNLVTYNAV 269

Query: 1140 IDACGKGGVDFKQAMGYFDEMIKRHVKPDRITFNSLIAVCSRGGRWVEAQDLFEQMLHRG 1319
            IDACGKGGV+FK+ +  FDEM++  ++PDRITFNSL+AVCS+GG W  A+ L  +M++RG
Sbjct: 270  IDACGKGGVEFKRVLEIFDEMLRNGMQPDRITFNSLLAVCSKGGLWEAARSLSCEMVNRG 329

Query: 1320 IEQDIFTYNTLLDALCKGGQMDLAASALCDMSERNIYPNAITYSTVIDGYAKAGRLNEAL 1499
            I+QDIFTYNTLLDA+CKGGQ+D+A   + +M  +NI PN +TYST+IDGYAKAGRL++A 
Sbjct: 330  IDQDIFTYNTLLDAVCKGGQLDMAFEIMSEMPAKNILPNVVTYSTMIDGYAKAGRLDDAR 389

Query: 1500 DLYHEMKNAGIPLDRVSYNTLLSVYAKFGRFEEALRICREMEDAGFKKDTVTYNALLGGY 1679
            +L++EMK  GI LDRVSYNTLLS+YAK GRFEEA+ +CREME++G +KD VTYNALLGGY
Sbjct: 390  NLFNEMKFLGISLDRVSYNTLLSIYAKLGRFEEAMDVCREMENSGIRKDVVTYNALLGGY 449

Query: 1680 GKQGKYDQVKALFEEMRWQRVTPNVLTYSTLIDVYSKGGLQTEAMQVLNEFKQTGLKPDV 1859
            GKQ KYD V+ +FEEM+ + V+PN+LTYSTLIDVYSKGGL  EAM V  EFK+ GLK DV
Sbjct: 450  GKQYKYDVVRKVFEEMKARHVSPNLLTYSTLIDVYSKGGLYREAMDVFREFKKAGLKADV 509

Query: 1860 VLYSAYIDALCKSGSVGEAIHLLDKMTAEGITPNVVTCNSIIDSLGRSSKGNS--RSYGQ 2033
            VLYSA IDALCK+G V  A+ LLD+MT EGI PNVVT NSIID+ GR +   S     GQ
Sbjct: 510  VLYSALIDALCKNGLVESAVSLLDEMTKEGIRPNVVTYNSIIDAFGRPATTESVVDDAGQ 569

Query: 2034 SAEFLADNDELVRSLENLSCVSSSPARKNERSNNNANALQIVARNAFASLSIPEEIKQKS 2213
            ++E           +++LS  +   A K+  ++   N + I      A+    +      
Sbjct: 570  TSEL---------QIDSLSSSAVEKATKSLVADREDNRI-IKIFGQLAAEKAGQAKNSGG 619

Query: 2214 EDILGAVDIFHRMKKLGIKPNVVTFSVILNACSRCPSFKEASMLLEELRVFDSRVYGVAH 2393
            ++++  + +FH+M +L IKPNVVTFS ILNACSRC SF+EASMLLEELR+FD++VYGVAH
Sbjct: 620  QEMMCILGVFHKMHELEIKPNVVTFSAILNACSRCNSFEEASMLLEELRLFDNQVYGVAH 679

Query: 2394 GLLMGFRDRVWSKAQTLFDEVTRMDSATASAFYNALTDMLWHFGQRKGAQLVVLDGRSRQ 2573
            GLLMG+R+ VW +AQ+LFDEV  MDS+TASAFYNALTDMLWHFGQ++GAQLVVL+G+ RQ
Sbjct: 680  GLLMGYRENVWEQAQSLFDEVKLMDSSTASAFYNALTDMLWHFGQKRGAQLVVLEGKRRQ 739

Query: 2574 VWENAWCESCLDLHLMSAGAAQAMVHAWLLSIRSIVFEGRELPKLLSILTGWGKHSKVTG 2753
            VWEN W ESCLDLHLMS+GAA+AMVHAWLL++R+IVFEG E+PKLL         SKV G
Sbjct: 740  VWENVWSESCLDLHLMSSGAARAMVHAWLLNVRAIVFEGHEVPKLL---------SKVVG 790

Query: 2754 DSTLRRAVETLLLTLGAPFHLAKFNEGRFTSTGPVVDAWLRESGTLELLILRD 2912
            DSTLRRAVE LL+ +GAPF  AK N GR  STG VV +WLRESGTL++L+L D
Sbjct: 791  DSTLRRAVEALLMGMGAPFRSAKCNLGRLISTGSVVASWLRESGTLKVLVLHD 843


>ref|XP_004288538.1| PREDICTED: pentatricopeptide repeat-containing protein At2g31400,
            chloroplastic-like [Fragaria vesca subsp. vesca]
          Length = 870

 Score =  879 bits (2272), Expect = 0.0
 Identities = 454/769 (59%), Positives = 570/769 (74%), Gaps = 5/769 (0%)
 Frame = +3

Query: 621  LGSDFCGKRPKRGISKNNLGRGNSM--KCNQSLTAQAVLSDVLNAPLEQPVDNFLKG--K 788
            L S F G+R  R +SK +LGR  +     +  L  +A+ + +     +  +D+ L     
Sbjct: 82   LVSAFSGRRSTRMVSKMHLGRPKTTVGSRHSPLAEEALETAIRFGKDDFALDDVLHSFES 141

Query: 789  RLVLDDYVSLMKELGSRGECSKAVECFKWAVSREKRSNEQGKLVTIMISILGKLGRVDLA 968
            RLV DD+  L++ELG+RGEC KA+ CF++AV RE++  EQGKL + MIS LG+LG+V+LA
Sbjct: 142  RLVSDDFTFLLRELGNRGECWKAIRCFEFAVRRERKRTEQGKLASSMISTLGRLGKVELA 201

Query: 969  QQVFDRAKADGYGKNVYAFSALVSAYGRSGLWKEAISVFELMKRSGCAPNIVTYNAVIDA 1148
            + VF  A  +GYG+ VY +SAL+SAYGRSG   EAI V E MK SG  PN+VTYNAVIDA
Sbjct: 202  KNVFQTAVNEGYGRTVYTYSALISAYGRSGYCDEAIRVLESMKDSGVKPNLVTYNAVIDA 261

Query: 1149 CGKGGVDFKQAMGYFDEMIKRHVKPDRITFNSLIAVCSRGGRWVEAQDLFEQMLHRGIEQ 1328
            CGKGGV+FK+ +  FDEM+K  V+PDRIT+NSL+AVCSRGG W  A++LF +M+ RGI+Q
Sbjct: 262  CGKGGVEFKKVVEIFDEMLKVGVQPDRITYNSLLAVCSRGGLWEAARNLFSEMVDRGIDQ 321

Query: 1329 DIFTYNTLLDALCKGGQMDLAASALCDMSERNIYPNAITYSTVIDGYAKAGRLNEALDLY 1508
            DI+TYNTLLDA+ KGGQMDLA   + +M  +NI PN +TYST+IDGYAKAGRL +AL+L+
Sbjct: 322  DIYTYNTLLDAISKGGQMDLAYKIMSEMPSKNILPNVVTYSTMIDGYAKAGRLEDALNLF 381

Query: 1509 HEMKNAGIPLDRVSYNTLLSVYAKFGRFEEALRICREMEDAGFKKDTVTYNALLGGYGKQ 1688
            +EMK   I LDRV YNTLLS+Y K GRFEEAL +C+EME  G  KD V+YNALLGGYGKQ
Sbjct: 382  NEMKFLAIGLDRVLYNTLLSLYGKLGRFEEALNVCKEMESVGIAKDVVSYNALLGGYGKQ 441

Query: 1689 GKYDQVKALFEEMRWQRVTPNVLTYSTLIDVYSKGGLQTEAMQVLNEFKQTGLKPDVVLY 1868
            GKYD+VK L+ EM+ +RV+PN+LTYSTLIDVYSKGGL  EA++V  EFKQ GLK DVVLY
Sbjct: 442  GKYDEVKGLYNEMKVERVSPNLLTYSTLIDVYSKGGLYAEAVKVFREFKQAGLKADVVLY 501

Query: 1869 SAYIDALCKSGSVGEAIHLLDKMTAEGITPNVVTCNSIIDSLGR-SSKGNSRSYGQSAEF 2045
            S  I+ALCK+G V  A+ LLD+MT EGI PNVVT NSIID+ GR ++   +   G     
Sbjct: 502  SELINALCKNGLVESAVSLLDEMTKEGIRPNVVTYNSIIDAFGRPATTVCAVDAGACGIV 561

Query: 2046 LADNDELVRSLENLSCVSSSPARKNERSNNNANALQIVARNAFASLSIPEEIKQKSEDIL 2225
            L        S  +   +S    +   R   +   +++  +         ++ ++  ++IL
Sbjct: 562  LRSESSSSISARDFD-ISDKNVQNEMRDREDTRIMKMFGQLTADKAGYAKKDRKVRQEIL 620

Query: 2226 GAVDIFHRMKKLGIKPNVVTFSVILNACSRCPSFKEASMLLEELRVFDSRVYGVAHGLLM 2405
              + +F +M +L IKPNVVTFS ILNACSRC SF++ASMLLEELR+FD++VYGVAHGLLM
Sbjct: 621  CILGVFQKMHELDIKPNVVTFSAILNACSRCNSFEDASMLLEELRLFDNQVYGVAHGLLM 680

Query: 2406 GFRDRVWSKAQTLFDEVTRMDSATASAFYNALTDMLWHFGQRKGAQLVVLDGRSRQVWEN 2585
            G R  VW KAQ+LFDEV +MD +TASAFYNALTDMLWHFGQ+KGAQLVVL+G  R VWEN
Sbjct: 681  GCRGNVWVKAQSLFDEVKQMDCSTASAFYNALTDMLWHFGQKKGAQLVVLEGERRNVWEN 740

Query: 2586 AWCESCLDLHLMSAGAAQAMVHAWLLSIRSIVFEGRELPKLLSILTGWGKHSKVTGDSTL 2765
            AW  S LDLHLMS+GAA+AMVHAWLL+I SIV++G++LP LLSILTGWGKHSKV GDS L
Sbjct: 741  AWSNSRLDLHLMSSGAARAMVHAWLLNIHSIVYQGQQLPNLLSILTGWGKHSKVVGDSAL 800

Query: 2766 RRAVETLLLTLGAPFHLAKFNEGRFTSTGPVVDAWLRESGTLELLILRD 2912
            RRAVE LL ++GAPF + + N GRF STG V  AWL+ESGTLE+L+L D
Sbjct: 801  RRAVEALLTSMGAPFRVHECNIGRFISTGSVAAAWLKESGTLEVLMLHD 849


>ref|XP_004240565.1| PREDICTED: pentatricopeptide repeat-containing protein At2g31400,
            chloroplastic-like isoform 2 [Solanum lycopersicum]
          Length = 829

 Score =  857 bits (2214), Expect = 0.0
 Identities = 445/769 (57%), Positives = 564/769 (73%), Gaps = 7/769 (0%)
 Frame = +3

Query: 627  SDFCGKRPKRGISKNNLGRGN-SMKCNQSLTAQAVLSDVLN-----APLEQPVDNFLKGK 788
            +DF G+R  R +SK + GR   S     S  AQ  L + +      A L+Q +  F  G 
Sbjct: 72   ADFSGRRSTRFVSKMHFGRAKISGNGRHSSFAQEALEEAIRCCNNEAGLDQVLLTF--GS 129

Query: 789  RLV-LDDYVSLMKELGSRGECSKAVECFKWAVSREKRSNEQGKLVTIMISILGKLGRVDL 965
            +LV  DDY  L +ELG+RGE   A+ CF++AV RE++ NEQGKL + MISILG+ G+VDL
Sbjct: 130  KLVGSDDYTFLFRELGNRGEWLAAMRCFQFAVGRERKRNEQGKLASSMISILGRSGKVDL 189

Query: 966  AQQVFDRAKADGYGKNVYAFSALVSAYGRSGLWKEAISVFELMKRSGCAPNIVTYNAVID 1145
            A++VF+ A +DGYG  VYA+SAL+SAY +SG   EAI VFE MK SG  PN+VTYNA+ID
Sbjct: 190  AEKVFENAVSDGYGSTVYAYSALISAYAKSGYCNEAIRVFETMKDSGLKPNLVTYNALID 249

Query: 1146 ACGKGGVDFKQAMGYFDEMIKRHVKPDRITFNSLIAVCSRGGRWVEAQDLFEQMLHRGIE 1325
            ACGKGG DFK+A   FDEM++  V+PDRITFNSL+AVCS  G W  A+ LF +M++RGI+
Sbjct: 250  ACGKGGADFKRASEIFDEMLRNGVQPDRITFNSLLAVCSGAGLWETARGLFNEMIYRGID 309

Query: 1326 QDIFTYNTLLDALCKGGQMDLAASALCDMSERNIYPNAITYSTVIDGYAKAGRLNEALDL 1505
            QDI+TYNT LD  C GGQ+D+A   + +M  +NI PN +TYSTVI G AKAGRL++AL L
Sbjct: 310  QDIYTYNTFLDVACNGGQIDVAFDIMSEMHAKNILPNQVTYSTVIRGCAKAGRLDKALSL 369

Query: 1506 YHEMKNAGIPLDRVSYNTLLSVYAKFGRFEEALRICREMEDAGFKKDTVTYNALLGGYGK 1685
            ++EMK AGI LDRVSYNTLL++YA  G+FEEAL + +EME  G KKD VTYNALL G+GK
Sbjct: 370  FNEMKCAGIKLDRVSYNTLLAIYASLGKFEEALNVSKEMEGMGIKKDVVTYNALLDGFGK 429

Query: 1686 QGKYDQVKALFEEMRWQRVTPNVLTYSTLIDVYSKGGLQTEAMQVLNEFKQTGLKPDVVL 1865
            QG Y +VK LF EM+ ++++PN+LTYSTLI VY KG L  +A++V  EFK+ GLK DVV 
Sbjct: 430  QGMYTKVKQLFAEMKAEKLSPNLLTYSTLISVYLKGALYHDAVEVYKEFKKQGLKADVVF 489

Query: 1866 YSAYIDALCKSGSVGEAIHLLDKMTAEGITPNVVTCNSIIDSLGRSSKGNSRSYGQSAEF 2045
            YS  IDALCK G V  +  LL++MT EGI PNVVT NSII++ G S+     S       
Sbjct: 490  YSKLIDALCKKGLVEYSSLLLNEMTKEGIQPNVVTYNSIINAFGESANNECGS------- 542

Query: 2046 LADNDELVRSLENLSCVSSSPARKNERSNNNANALQIVARNAFASLSIPEEIKQKSEDIL 2225
                       +N++ + S+ ++    +    N ++I  + A    +  ++   + +D+L
Sbjct: 543  -----------DNVTHIVSAISQSKWENTEEDNIVKIFEQLAAQKSASGKKTNAERQDML 591

Query: 2226 GAVDIFHRMKKLGIKPNVVTFSVILNACSRCPSFKEASMLLEELRVFDSRVYGVAHGLLM 2405
              + +FH+M +L IKPNVVTFS ILNACSRC SF EAS+LLEELR+FD++VYGVAHGLLM
Sbjct: 592  CILGVFHKMHELQIKPNVVTFSAILNACSRCSSFDEASLLLEELRLFDNQVYGVAHGLLM 651

Query: 2406 GFRDRVWSKAQTLFDEVTRMDSATASAFYNALTDMLWHFGQRKGAQLVVLDGRSRQVWEN 2585
            G R+ VWS+A +LF+EV +MDS+TASAFYNALTDMLWHF Q++GAQLVVL+G+  +VWEN
Sbjct: 652  GQREGVWSQALSLFNEVKQMDSSTASAFYNALTDMLWHFDQKQGAQLVVLEGKRSEVWEN 711

Query: 2586 AWCESCLDLHLMSAGAAQAMVHAWLLSIRSIVFEGRELPKLLSILTGWGKHSKVTGDSTL 2765
             W  SCLDLHLMS+GAA AMVHAWLLSIRSIVFEG ELPK+LSILTGWGKHSK+TGD  L
Sbjct: 712  TWSTSCLDLHLMSSGAACAMVHAWLLSIRSIVFEGHELPKMLSILTGWGKHSKITGDGAL 771

Query: 2766 RRAVETLLLTLGAPFHLAKFNEGRFTSTGPVVDAWLRESGTLELLILRD 2912
            +RA+E LL ++GAPF +AK N GRF STG VV AWLRESGTLE+L+L+D
Sbjct: 772  KRAIEGLLTSIGAPFQIAKCNIGRFISTGAVVTAWLRESGTLEVLVLQD 820


Top