BLASTX nr result

ID: Catharanthus23_contig00002621 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Catharanthus23_contig00002621
         (2636 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_006343621.1| PREDICTED: pentatricopeptide repeat-containi...  1100   0.0  
ref|XP_004242990.1| PREDICTED: pentatricopeptide repeat-containi...  1080   0.0  
ref|XP_002275298.1| PREDICTED: pentatricopeptide repeat-containi...  1028   0.0  
ref|XP_006474049.1| PREDICTED: pentatricopeptide repeat-containi...   977   0.0  
ref|XP_006453556.1| hypothetical protein CICLE_v10007511mg [Citr...   971   0.0  
gb|EXB42930.1| hypothetical protein L484_013952 [Morus notabilis]     970   0.0  
ref|XP_004288724.1| PREDICTED: pentatricopeptide repeat-containi...   962   0.0  
ref|XP_004152852.1| PREDICTED: pentatricopeptide repeat-containi...   958   0.0  
ref|XP_004163058.1| PREDICTED: LOW QUALITY PROTEIN: pentatricope...   957   0.0  
ref|XP_003610281.1| Pentatricopeptide repeat-containing protein ...   952   0.0  
gb|EOY31480.1| Pentatricopeptide repeat (PPR) superfamily protei...   949   0.0  
ref|XP_004507756.1| PREDICTED: pentatricopeptide repeat-containi...   938   0.0  
gb|ESW26860.1| hypothetical protein PHAVU_003G154400g [Phaseolus...   932   0.0  
ref|XP_003550682.1| PREDICTED: pentatricopeptide repeat-containi...   929   0.0  
gb|EPS60782.1| hypothetical protein M569_14020, partial [Genlise...   919   0.0  
ref|XP_002867333.1| pentatricopeptide repeat-containing protein ...   919   0.0  
ref|NP_194799.1| pentatricopeptide repeat-containing protein [Ar...   917   0.0  
ref|XP_006285932.1| hypothetical protein CARUB_v10007444mg [Caps...   908   0.0  
ref|XP_006412675.1| hypothetical protein EUTSA_v10024455mg [Eutr...   886   0.0  
gb|EMJ05798.1| hypothetical protein PRUPE_ppa002987mg [Prunus pe...   840   0.0  

>ref|XP_006343621.1| PREDICTED: pentatricopeptide repeat-containing protein At4g30700-like
            isoform X1 [Solanum tuberosum]
            gi|565353404|ref|XP_006343622.1| PREDICTED:
            pentatricopeptide repeat-containing protein
            At4g30700-like isoform X2 [Solanum tuberosum]
          Length = 791

 Score = 1100 bits (2846), Expect = 0.0
 Identities = 557/791 (70%), Positives = 633/791 (80%)
 Frame = +3

Query: 135  MICRAIAAAAQRDGNFFSSLLNKVSTLSQLNQTHAQLILNGLSKNLIINTKLTHKLFDLK 314
            MI R IA+ AQRD NFF SL+N+ +TLSQLNQ HA LI NGLS +LI  TKLTHK  D K
Sbjct: 1    MIYRTIASIAQRDRNFFISLINQATTLSQLNQIHANLIRNGLSNDLITITKLTHKFSDFK 60

Query: 315  SLSQAKLLFKCLSSTVPLDLFYYNVQIRGFSRNNSPLDAVSLFHHLRRETILKPDKFTYA 494
            S+S+AK LF   ++T P DLF YNV IRG SRN   ++A+SL+  L + + LKPD FT+A
Sbjct: 61   SISKAKNLFTTFNNTNPPDLFLYNVLIRGLSRNGLGVEALSLYLDLLKGSKLKPDNFTFA 120

Query: 495  FVVSAISSSGLEYVGILLHGQVIVSGCSSDLFVGSALVDMYMGSSRVRSAHKVFDEISEP 674
            FVVS  SSSG E VGIL+HG VIVSG  SD+FVGSALVDMYMG SR+  A+KVFD I E 
Sbjct: 121  FVVSGFSSSGCEKVGILIHGHVIVSGFGSDVFVGSALVDMYMGFSRIGHAYKVFDGIPER 180

Query: 675  DTVLWNTMLSGLVKNCCFYDCIQIFRHMVANDRXXXXXXXXXXXXXXXXXXXXRTGMIFH 854
            D+VLWNTM+SGLV+NCCF + IQ+F  MV                        R GM+ H
Sbjct: 181  DSVLWNTMVSGLVRNCCFEESIQVFGDMVGRGTKFDSTTLAVVLTAVAELQDLRNGMLIH 240

Query: 855  SLAVKTGCQSHDYVLTSLISMYSKCADVSTARFLFEQLEYPDLISYNAMISGFSCNNKTE 1034
             LAVK G   H+YVLT LISMYSKC DVSTA+ LF  +  PDLIS NAMI+GF  NN+ E
Sbjct: 241  CLAVKMGYDVHEYVLTGLISMYSKCGDVSTAKLLFGMIREPDLISCNAMIAGFCFNNENE 300

Query: 1035 SAVRLFNALLLLGDKVNSNTVVGLIPVSDPFGHLKLTCSIHGFSIKSSLVSGPSASTALI 1214
            S+VRLF  LL+ G+KVNS+T+VGLIPVS PFGHL LTCSIHGF +KS +VS PS STAL 
Sbjct: 301  SSVRLFRELLVHGEKVNSSTIVGLIPVSCPFGHLTLTCSIHGFCVKSGMVSNPSVSTALT 360

Query: 1215 TVYSRLNEFEFARKLFDESPHKSLASWNAMISGYAQNGFTETAIDLFREMQALDIHPNPV 1394
            TVYSRLNE E AR+LFDESP KSLASWNAMISGYAQNG TE AI LFREMQ LDIHPNPV
Sbjct: 361  TVYSRLNEMELARRLFDESPKKSLASWNAMISGYAQNGLTEMAISLFREMQKLDIHPNPV 420

Query: 1395 TITSILSACAQLGAPSLGKWVHDLIKQENFESNVYVSTALIDMYAKCGSIEEARGLFDTI 1574
            TITSILSACAQLG  S+GKWVHDLIK+E FESN+YV TAL+DMYAKCG+IEEAR +FD+I
Sbjct: 421  TITSILSACAQLGTLSMGKWVHDLIKKEKFESNIYVLTALVDMYAKCGNIEEARQVFDSI 480

Query: 1575 AEKNVVTWNAMISGYGLHGRGNEAVLLFYKMLGSGITPTVITFLSVLYACSHAGLVEEGE 1754
             EKNVVTWNAMIS YGLHG G EA++LF +ML SG++PT +TFL VLYACSHAGLVEEG+
Sbjct: 481  TEKNVVTWNAMISAYGLHGCGREALVLFDQMLHSGVSPTGVTFLCVLYACSHAGLVEEGQ 540

Query: 1755 KIFHSMTREFNFKPSPEHYACMVDLLGRAGKLETALEFINQMPIKPGPAEWGALLGACMI 1934
            KIFHSM+ + + +P PEHYACMVDLLGRAGKLE ALEFI +MP++PGPAEWGALLGACM+
Sbjct: 541  KIFHSMSHDHDTEPLPEHYACMVDLLGRAGKLENALEFIYEMPLEPGPAEWGALLGACMV 600

Query: 1935 HKDTNLAQFASDKLFEMDPKSVGHYVLLSNIYSADHNYPQAASVRQIAKKRNLAKSPGCT 2114
            HK+ +LA+ ASDKLF MD  SVG+YVLLSNIYSAD NY QAASVR++ K +NLAK+PGCT
Sbjct: 601  HKNIDLARLASDKLFAMDRGSVGYYVLLSNIYSADRNYCQAASVRKVLKNKNLAKTPGCT 660

Query: 2115 LIELNGILHVFTSSDQSHPQAAEIFEMLEKLMGKMREAGFQTETVTALHDVXXXXXXLMV 2294
            LIE+N   HVFTSSDQSHPQAA I+  LE+LM KMREAGF TET TALHDV      LMV
Sbjct: 661  LIEVNSYQHVFTSSDQSHPQAAAIYAKLEELMEKMREAGFHTETSTALHDVEEEEKELMV 720

Query: 2295 KVHSEKLAIAFGFIASEPGTEIRIIKNLRVCLDCHNFTKFVSMITKRLIVVRDANRFHHF 2474
            KVHSEKLAIAFG + SEP TEIRIIKNLRVC+DCHNFTKFVS +T R+IVVRDANRFHHF
Sbjct: 721  KVHSEKLAIAFGLLTSEPRTEIRIIKNLRVCVDCHNFTKFVSKVTDRVIVVRDANRFHHF 780

Query: 2475 KDGVCSCGDYW 2507
            KDG CSCGDYW
Sbjct: 781  KDGDCSCGDYW 791


>ref|XP_004242990.1| PREDICTED: pentatricopeptide repeat-containing protein At4g30700-like
            [Solanum lycopersicum]
          Length = 791

 Score = 1080 bits (2793), Expect = 0.0
 Identities = 546/791 (69%), Positives = 629/791 (79%)
 Frame = +3

Query: 135  MICRAIAAAAQRDGNFFSSLLNKVSTLSQLNQTHAQLILNGLSKNLIINTKLTHKLFDLK 314
            MI R IA+ AQRD NFF SL+N+ +TLSQLNQ HA LI NGLS +LI  TKLTHK  D K
Sbjct: 1    MIYRTIASIAQRDRNFFISLINQATTLSQLNQLHANLIRNGLSNDLITITKLTHKFSDFK 60

Query: 315  SLSQAKLLFKCLSSTVPLDLFYYNVQIRGFSRNNSPLDAVSLFHHLRRETILKPDKFTYA 494
            S+S+AK LF   ++T P DLF YNV IRG SRN   ++A+SL+  L +   LKPD FT+A
Sbjct: 61   SISKAKNLFTTFNNTNPPDLFLYNVLIRGLSRNGLGVEALSLYLDLLKGNKLKPDNFTFA 120

Query: 495  FVVSAISSSGLEYVGILLHGQVIVSGCSSDLFVGSALVDMYMGSSRVRSAHKVFDEISEP 674
            FVVS+ SSSG E VGIL+HG VIVSG  SD+FVGSALVDMYM  SR+  A+KVFD I E 
Sbjct: 121  FVVSSFSSSGCEKVGILIHGHVIVSGFGSDVFVGSALVDMYMRFSRIGHAYKVFDGIPER 180

Query: 675  DTVLWNTMLSGLVKNCCFYDCIQIFRHMVANDRXXXXXXXXXXXXXXXXXXXXRTGMIFH 854
            D+VLWNTM+SGLV+NCCF + +++F  MV                        R GM+ H
Sbjct: 181  DSVLWNTMVSGLVRNCCFEESLRVFGDMVGRGTGFDSTTLAVVLTAVAELQDLRNGMLIH 240

Query: 855  SLAVKTGCQSHDYVLTSLISMYSKCADVSTARFLFEQLEYPDLISYNAMISGFSCNNKTE 1034
             LAVK G   H+YVLT LIS+YSKC DV TA+ LF  ++ PDLIS NAMI+GF  N++ E
Sbjct: 241  CLAVKMGYDVHEYVLTGLISLYSKCGDVLTAKLLFGMIKEPDLISCNAMIAGFCFNDENE 300

Query: 1035 SAVRLFNALLLLGDKVNSNTVVGLIPVSDPFGHLKLTCSIHGFSIKSSLVSGPSASTALI 1214
            S+VRLF  LL+ G+KVNS+T+VGLIPVS PFGHL LTCSIHGF +K+ +V  PSASTAL 
Sbjct: 301  SSVRLFRELLVHGEKVNSSTIVGLIPVSCPFGHLNLTCSIHGFCVKTGMVLNPSASTALT 360

Query: 1215 TVYSRLNEFEFARKLFDESPHKSLASWNAMISGYAQNGFTETAIDLFREMQALDIHPNPV 1394
            TVYSRLNE E AR+LFDES  KSLASWNAMISGYAQNG TE AI LFREMQ LDI+PNP+
Sbjct: 361  TVYSRLNEMELARRLFDESTKKSLASWNAMISGYAQNGLTEMAISLFREMQKLDINPNPI 420

Query: 1395 TITSILSACAQLGAPSLGKWVHDLIKQENFESNVYVSTALIDMYAKCGSIEEARGLFDTI 1574
            TITSILSACAQLG  S+GKWVHDLIK+E FESN+YV TAL+DMYAKCG+IEEAR +FD+I
Sbjct: 421  TITSILSACAQLGTLSMGKWVHDLIKKEKFESNIYVLTALVDMYAKCGNIEEARQVFDSI 480

Query: 1575 AEKNVVTWNAMISGYGLHGRGNEAVLLFYKMLGSGITPTVITFLSVLYACSHAGLVEEGE 1754
             EKNVVTWNAMIS YGLHG G EA++LF +ML SG++PT +T+L VLYACSHAGLVEEG 
Sbjct: 481  TEKNVVTWNAMISAYGLHGCGQEALVLFDQMLHSGVSPTGVTYLCVLYACSHAGLVEEGR 540

Query: 1755 KIFHSMTREFNFKPSPEHYACMVDLLGRAGKLETALEFINQMPIKPGPAEWGALLGACMI 1934
            KIFHSM  + + +P PEHYACMVDLLGRAGKLE ALEFI +MPI+PGPAEWGALLGACM+
Sbjct: 541  KIFHSMIHDHDTEPLPEHYACMVDLLGRAGKLEKALEFIYEMPIEPGPAEWGALLGACMV 600

Query: 1935 HKDTNLAQFASDKLFEMDPKSVGHYVLLSNIYSADHNYPQAASVRQIAKKRNLAKSPGCT 2114
            HK+T+LA+ ASDKLF MD  SVG+YVLLSNIYSAD NY QAASVR++ K +NLAK+PGCT
Sbjct: 601  HKNTDLARLASDKLFAMDRGSVGYYVLLSNIYSADRNYFQAASVRKVLKNKNLAKTPGCT 660

Query: 2115 LIELNGILHVFTSSDQSHPQAAEIFEMLEKLMGKMREAGFQTETVTALHDVXXXXXXLMV 2294
            LIE+NG  HVFTSSDQSHPQAA I+  LE+LM KMREAGF TET TALHDV      LMV
Sbjct: 661  LIEVNGYQHVFTSSDQSHPQAAAIYAKLEELMEKMREAGFHTETSTALHDVEEEEKELMV 720

Query: 2295 KVHSEKLAIAFGFIASEPGTEIRIIKNLRVCLDCHNFTKFVSMITKRLIVVRDANRFHHF 2474
            KVHSEKLAIA+G + SEP TEIRIIKNLRVC+DCHNFTKFVS +T R +VVRD NRFHHF
Sbjct: 721  KVHSEKLAIAYGLLTSEPRTEIRIIKNLRVCVDCHNFTKFVSKVTDRNVVVRDTNRFHHF 780

Query: 2475 KDGVCSCGDYW 2507
            KDG CSCGDYW
Sbjct: 781  KDGECSCGDYW 791


>ref|XP_002275298.1| PREDICTED: pentatricopeptide repeat-containing protein At4g30700
            [Vitis vinifera]
          Length = 781

 Score = 1028 bits (2659), Expect = 0.0
 Identities = 529/792 (66%), Positives = 609/792 (76%), Gaps = 1/792 (0%)
 Frame = +3

Query: 135  MICRAIAAAAQRDGNFFSSLLNKVSTLSQLNQTHAQLILNGLSKNLIINTKLTHKLFDLK 314
            M+ R IA+      N F +L+N+VSTL QLNQTHAQ+ILNGL  +L+  TKLTHKL  LK
Sbjct: 1    MLYRGIASTR----NLFLTLINRVSTLHQLNQTHAQIILNGLHNDLVTVTKLTHKLSHLK 56

Query: 315  SLSQAKLLFKCLSSTVPL-DLFYYNVQIRGFSRNNSPLDAVSLFHHLRRETILKPDKFTY 491
            ++ QA LLF    ST+P  DLF YNV IR FS NNSP  AVSL+ HLR+ T L+PD FTY
Sbjct: 57   AIDQASLLF----STIPNPDLFLYNVLIRAFSLNNSPSSAVSLYTHLRKSTPLEPDNFTY 112

Query: 492  AFVVSAISSSGLEYVGILLHGQVIVSGCSSDLFVGSALVDMYMGSSRVRSAHKVFDEISE 671
            AFV+S  SS GL   G+LLH   IV+G  SDLFVGSA+V  Y   SRV +A KVFD + E
Sbjct: 113  AFVISGASSLGL---GLLLHAHSIVAGFGSDLFVGSAIVACYFKFSRVAAARKVFDGMLE 169

Query: 672  PDTVLWNTMLSGLVKNCCFYDCIQIFRHMVANDRXXXXXXXXXXXXXXXXXXXXRTGMIF 851
             DTVLWNTM+SGLVKN CF + I IF  MV                          GM  
Sbjct: 170  RDTVLWNTMVSGLVKNSCFDEAILIFGDMVKGGIGFDSTTVAAVLPGVAELQDLALGMGI 229

Query: 852  HSLAVKTGCQSHDYVLTSLISMYSKCADVSTARFLFEQLEYPDLISYNAMISGFSCNNKT 1031
              LA+K G  SH YV+T L  +YSKC ++ TAR LF Q+  PDL+SYNAMISG++CNN+T
Sbjct: 230  QCLAMKVGFHSHAYVITGLACLYSKCGEIETARLLFGQIGQPDLVSYNAMISGYTCNNET 289

Query: 1032 ESAVRLFNALLLLGDKVNSNTVVGLIPVSDPFGHLKLTCSIHGFSIKSSLVSGPSASTAL 1211
            ES+VRLF  LL+ G+KVNS+++VGLIPV  PFGHL LT  IHGF  KS +VS  S STAL
Sbjct: 290  ESSVRLFKELLVSGEKVNSSSIVGLIPVFFPFGHLHLTRCIHGFCTKSGVVSNSSVSTAL 349

Query: 1212 ITVYSRLNEFEFARKLFDESPHKSLASWNAMISGYAQNGFTETAIDLFREMQALDIHPNP 1391
             TVYSRLNE E AR LFDES  KSLASWNAMISGYAQNG TE AI LF+EMQ  ++ PNP
Sbjct: 350  TTVYSRLNEIESARLLFDESSEKSLASWNAMISGYAQNGLTEKAISLFQEMQKCEVRPNP 409

Query: 1392 VTITSILSACAQLGAPSLGKWVHDLIKQENFESNVYVSTALIDMYAKCGSIEEARGLFDT 1571
            VT+TSILSACAQLGA SLGKWVHDLI +E+FESN++VSTALIDMYAKCGSI EA+ LF  
Sbjct: 410  VTVTSILSACAQLGALSLGKWVHDLINRESFESNIFVSTALIDMYAKCGSITEAQRLFSM 469

Query: 1572 IAEKNVVTWNAMISGYGLHGRGNEAVLLFYKMLGSGITPTVITFLSVLYACSHAGLVEEG 1751
            + EKN VTWNAMISGYGLHG G+EA+ LF +ML S ++PT +TFLSVLYACSHAGLV EG
Sbjct: 470  MPEKNAVTWNAMISGYGLHGYGHEALNLFNEMLHSRVSPTGVTFLSVLYACSHAGLVREG 529

Query: 1752 EKIFHSMTREFNFKPSPEHYACMVDLLGRAGKLETALEFINQMPIKPGPAEWGALLGACM 1931
            ++IF SM  +  F+P PEHYACMVDLLGRAG L+ AL+FI +MP++PGP  WGALLGACM
Sbjct: 530  DEIFRSMVHDHGFEPLPEHYACMVDLLGRAGNLDKALDFIRKMPVEPGPPVWGALLGACM 589

Query: 1932 IHKDTNLAQFASDKLFEMDPKSVGHYVLLSNIYSADHNYPQAASVRQIAKKRNLAKSPGC 2111
            IHKD NLA+ ASDKLFE+DP++VG+YVLLSNIYSA  NYP+AASVR + K+R LAK+PGC
Sbjct: 590  IHKDANLARLASDKLFELDPQNVGYYVLLSNIYSAGQNYPEAASVRGVVKRRKLAKTPGC 649

Query: 2112 TLIELNGILHVFTSSDQSHPQAAEIFEMLEKLMGKMREAGFQTETVTALHDVXXXXXXLM 2291
            TLIE+   LH+FTS DQSHPQA  I+ MLEKL GKMREAGFQTET TALHDV      LM
Sbjct: 650  TLIEVANTLHIFTSGDQSHPQATAIYAMLEKLTGKMREAGFQTETGTALHDVEEEEKELM 709

Query: 2292 VKVHSEKLAIAFGFIASEPGTEIRIIKNLRVCLDCHNFTKFVSMITKRLIVVRDANRFHH 2471
            VKVHSEKLAIAFG I SEPGTEIRIIKNLRVCLDCHN TKF+S IT+R+IVVRDANRFHH
Sbjct: 710  VKVHSEKLAIAFGLITSEPGTEIRIIKNLRVCLDCHNATKFISKITERVIVVRDANRFHH 769

Query: 2472 FKDGVCSCGDYW 2507
            FKDG+CSCGDYW
Sbjct: 770  FKDGICSCGDYW 781


>ref|XP_006474049.1| PREDICTED: pentatricopeptide repeat-containing protein At4g30700-like
            [Citrus sinensis]
          Length = 784

 Score =  977 bits (2526), Expect = 0.0
 Identities = 497/778 (63%), Positives = 582/778 (74%), Gaps = 1/778 (0%)
 Frame = +3

Query: 177  NFFSSLLNKVSTLSQLNQTHAQLILNGLSKNLIINTKLTHKLFDLKSLSQAKLLFKCLSS 356
            N F SLL    T SQL QTHAQ+I++G   +L   TKL H+L D K+   A+ LF  +  
Sbjct: 10   NLFLSLLKGAKTQSQLTQTHAQIIIHGFQNDLSTVTKLAHRLSDFKATCYARALFFSIPK 69

Query: 357  TVPLDLFYYNVQIRGFSRNNSPLDAVSLFHHLRRETILKPDKFTYAFVVSAISSSGLEYV 536
                DLF +NV IRGFS N  P  ++  + HLR+ T L PD FTY+FV+SA S+     +
Sbjct: 70   P---DLFLFNVIIRGFSNNEMPKSSICFYTHLRKNTALTPDNFTYSFVLSAASACCDRSI 126

Query: 537  GILLHGQVIVSGCSSDLFVGSALVDMYMGSSRVRSAHKVFDEISEPDTVLWNTMLSGLVK 716
            G+LLHG  IVSG  SDLFVG+ALVD+Y   S V+SA KVFD++ E DTVLWN+M+SGL+K
Sbjct: 127  GVLLHGHAIVSGYGSDLFVGAALVDLYFKFSWVKSARKVFDKMPEKDTVLWNSMISGLMK 186

Query: 717  NCCFYDCIQIFRHMVAND-RXXXXXXXXXXXXXXXXXXXXRTGMIFHSLAVKTGCQSHDY 893
            NCCF D I +F  MV N                       R GM    L +K G   H Y
Sbjct: 187  NCCFQDSIWVFGDMVRNGGTWLDSTSVAAVLPAVAEVQELRLGMEIQCLGLKLGFHDHVY 246

Query: 894  VLTSLISMYSKCADVSTARFLFEQLEYPDLISYNAMISGFSCNNKTESAVRLFNALLLLG 1073
            VLT L+S YSKC +V  A  LF  +  PDLIS NAMISG++CN KTES++RLF  LL   
Sbjct: 247  VLTGLVSFYSKCGEVERAELLFRDIVRPDLISCNAMISGYTCNGKTESSLRLFRQLLASA 306

Query: 1074 DKVNSNTVVGLIPVSDPFGHLKLTCSIHGFSIKSSLVSGPSASTALITVYSRLNEFEFAR 1253
            ++VNS+T+VGLIPV  PFGHL LT  IH F +KS +VS  S  TAL TVYSRLNE E AR
Sbjct: 307  ERVNSSTIVGLIPVFYPFGHLHLTNCIHSFCLKSGIVSNSSVLTALSTVYSRLNEMEAAR 366

Query: 1254 KLFDESPHKSLASWNAMISGYAQNGFTETAIDLFREMQALDIHPNPVTITSILSACAQLG 1433
            KLFDES  KSLASWNAMI+GY QNG TE AI LF+EMQA  + PNPVT++SILSACAQLG
Sbjct: 367  KLFDESSEKSLASWNAMIAGYTQNGLTEEAISLFQEMQASKVAPNPVTVSSILSACAQLG 426

Query: 1434 APSLGKWVHDLIKQENFESNVYVSTALIDMYAKCGSIEEARGLFDTIAEKNVVTWNAMIS 1613
            A SLGKWVH+L+K  NFESN+YVSTALIDMYAKCG+I EAR LFD ++ K+ VTWN MIS
Sbjct: 427  AISLGKWVHELVKSRNFESNIYVSTALIDMYAKCGNIVEARELFDLMSHKSEVTWNTMIS 486

Query: 1614 GYGLHGRGNEAVLLFYKMLGSGITPTVITFLSVLYACSHAGLVEEGEKIFHSMTREFNFK 1793
            GYGLHG G EA+ LF +ML SGI P+ +TFLSVLYACSHAGLV EG++IF SM  +  FK
Sbjct: 487  GYGLHGHGLEALQLFSEMLHSGIRPSGVTFLSVLYACSHAGLVREGDEIFQSMIHDHGFK 546

Query: 1794 PSPEHYACMVDLLGRAGKLETALEFINQMPIKPGPAEWGALLGACMIHKDTNLAQFASDK 1973
            P  EHYACMVD+LGRAG+LE ALEFI  + ++PGPA WGALLGACMIHKDTNLA+ AS+K
Sbjct: 547  PLAEHYACMVDILGRAGQLEKALEFIKGLAVEPGPAVWGALLGACMIHKDTNLARVASEK 606

Query: 1974 LFEMDPKSVGHYVLLSNIYSADHNYPQAASVRQIAKKRNLAKSPGCTLIELNGILHVFTS 2153
            LFE+DP++VG++VLLSNIYSA+ +Y QAA+VRQ+ KKR LAK+PGCTLIE+ G  HVFTS
Sbjct: 607  LFELDPENVGYHVLLSNIYSAERDYLQAATVRQVVKKRKLAKAPGCTLIEVGGTPHVFTS 666

Query: 2154 SDQSHPQAAEIFEMLEKLMGKMREAGFQTETVTALHDVXXXXXXLMVKVHSEKLAIAFGF 2333
             DQ HPQ+  I+ MLEKL GKMREAGFQTETVTALHDV      LM+KVHSEKLAIAFG 
Sbjct: 667  GDQLHPQSTAIYAMLEKLNGKMREAGFQTETVTALHDVEEEEKELMMKVHSEKLAIAFGL 726

Query: 2334 IASEPGTEIRIIKNLRVCLDCHNFTKFVSMITKRLIVVRDANRFHHFKDGVCSCGDYW 2507
            IA+EPGTEIRIIKNLRVCLDCH  TKF+S +T R+IVVRDANRFHHFKDGVCSCGDYW
Sbjct: 727  IATEPGTEIRIIKNLRVCLDCHTATKFISKVTGRVIVVRDANRFHHFKDGVCSCGDYW 784


>ref|XP_006453556.1| hypothetical protein CICLE_v10007511mg [Citrus clementina]
            gi|557556782|gb|ESR66796.1| hypothetical protein
            CICLE_v10007511mg [Citrus clementina]
          Length = 784

 Score =  971 bits (2511), Expect = 0.0
 Identities = 495/778 (63%), Positives = 579/778 (74%), Gaps = 1/778 (0%)
 Frame = +3

Query: 177  NFFSSLLNKVSTLSQLNQTHAQLILNGLSKNLIINTKLTHKLFDLKSLSQAKLLFKCLSS 356
            N F SLL    T SQL QTHAQ+I++G   +L   TKL H+L D K+   A+ LF  +  
Sbjct: 10   NLFLSLLKGAKTQSQLTQTHAQIIIHGFQNDLSTVTKLAHRLSDFKATCYARALFFSIPK 69

Query: 357  TVPLDLFYYNVQIRGFSRNNSPLDAVSLFHHLRRETILKPDKFTYAFVVSAISSSGLEYV 536
                DLF +NV IRGFS N  P  ++  + HLR+ T L PD FTY+FV+SA S+     +
Sbjct: 70   P---DLFLFNVIIRGFSNNEMPKSSICFYTHLRKNTALTPDNFTYSFVLSAASACCDRSI 126

Query: 537  GILLHGQVIVSGCSSDLFVGSALVDMYMGSSRVRSAHKVFDEISEPDTVLWNTMLSGLVK 716
            G+LLHG  IVSG  SDLFVG+ALVD+Y   S V+SA KVFD++ E DTVLWN+M+SGL+K
Sbjct: 127  GVLLHGHAIVSGYGSDLFVGAALVDLYFKFSWVKSARKVFDKMPEKDTVLWNSMISGLMK 186

Query: 717  NCCFYDCIQIFRHMVAND-RXXXXXXXXXXXXXXXXXXXXRTGMIFHSLAVKTGCQSHDY 893
            NCCF D I +F  MV N                       R GM    L +K G   H Y
Sbjct: 187  NCCFQDSIWVFGDMVRNGGTWLDSTSVAAVLPAVAEVQELRLGMEIQCLGLKLGFHDHVY 246

Query: 894  VLTSLISMYSKCADVSTARFLFEQLEYPDLISYNAMISGFSCNNKTESAVRLFNALLLLG 1073
            VLT L+S YSKC +V  A  LF  +  PDLIS NAMISG++CN KTES++RLF  LL   
Sbjct: 247  VLTGLVSFYSKCGEVEGAELLFRDIVRPDLISCNAMISGYTCNGKTESSLRLFRQLLASA 306

Query: 1074 DKVNSNTVVGLIPVSDPFGHLKLTCSIHGFSIKSSLVSGPSASTALITVYSRLNEFEFAR 1253
            ++VNS+T+VGLIPV  PFGHL LT  IH F +KS +VS  S  TAL TVYSRLNE E AR
Sbjct: 307  ERVNSSTIVGLIPVFYPFGHLHLTNCIHNFCLKSGIVSNSSVLTALSTVYSRLNEMEAAR 366

Query: 1254 KLFDESPHKSLASWNAMISGYAQNGFTETAIDLFREMQALDIHPNPVTITSILSACAQLG 1433
            KLFDES  KSLASWNAMI+GY QNG TE AI LF+EMQA  + PNPVT++SILSACAQLG
Sbjct: 367  KLFDESSEKSLASWNAMIAGYTQNGLTEEAISLFQEMQASKVAPNPVTVSSILSACAQLG 426

Query: 1434 APSLGKWVHDLIKQENFESNVYVSTALIDMYAKCGSIEEARGLFDTIAEKNVVTWNAMIS 1613
            A SLGKWVH+L+K  NFESN+YVSTALIDMYAKCG+I EAR LFD +  K+ VTWN MIS
Sbjct: 427  AISLGKWVHELVKSRNFESNIYVSTALIDMYAKCGNIVEARDLFDLMPHKSEVTWNTMIS 486

Query: 1614 GYGLHGRGNEAVLLFYKMLGSGITPTVITFLSVLYACSHAGLVEEGEKIFHSMTREFNFK 1793
            GYGLHG G EA+ LF +ML SGI P+ +TFLSVLYACSHAGLV EG++IF SM  +   K
Sbjct: 487  GYGLHGHGLEALQLFSEMLHSGIRPSGVTFLSVLYACSHAGLVREGDEIFQSMIHDHGLK 546

Query: 1794 PSPEHYACMVDLLGRAGKLETALEFINQMPIKPGPAEWGALLGACMIHKDTNLAQFASDK 1973
            P  EHYACMVD+LGRAG+LE ALEFI  + ++PGPA WGALLGACMIHKDTNLA+ AS+K
Sbjct: 547  PLAEHYACMVDILGRAGQLEKALEFIKGLAVEPGPAVWGALLGACMIHKDTNLARVASEK 606

Query: 1974 LFEMDPKSVGHYVLLSNIYSADHNYPQAASVRQIAKKRNLAKSPGCTLIELNGILHVFTS 2153
            LFE+DP++VG++VLLSNIYSAD +Y QAA+VRQ+ KKR LAK+PGCTLIE+ G  HVFTS
Sbjct: 607  LFELDPENVGYHVLLSNIYSADRDYVQAATVRQVFKKRKLAKAPGCTLIEVGGTPHVFTS 666

Query: 2154 SDQSHPQAAEIFEMLEKLMGKMREAGFQTETVTALHDVXXXXXXLMVKVHSEKLAIAFGF 2333
             DQ HPQ+  I+ MLEKL GKMREAGFQTETVTALHDV      LM+KVHSEKLAIAFG 
Sbjct: 667  GDQLHPQSTAIYAMLEKLNGKMREAGFQTETVTALHDVEEEEKELMMKVHSEKLAIAFGL 726

Query: 2334 IASEPGTEIRIIKNLRVCLDCHNFTKFVSMITKRLIVVRDANRFHHFKDGVCSCGDYW 2507
            I++EPGTEIRIIKNLRVCLDCH  TKF+S +T R+IVVRDANRFHHFK GVCSCGDYW
Sbjct: 727  ISTEPGTEIRIIKNLRVCLDCHTATKFISKVTGRVIVVRDANRFHHFKGGVCSCGDYW 784


>gb|EXB42930.1| hypothetical protein L484_013952 [Morus notabilis]
          Length = 781

 Score =  970 bits (2508), Expect = 0.0
 Identities = 492/787 (62%), Positives = 587/787 (74%), Gaps = 1/787 (0%)
 Frame = +3

Query: 150  IAAAAQRDGNFFSSLLNKVSTLSQLNQTHAQLILNGLSKNLIINTKLTHKLFDLKSLSQA 329
            I+ A  R  NF   +L    TL +L+QTHAQ I++GL  +L   TKLTHKL DLK++ QA
Sbjct: 2    ISGAGSR--NFLVGILKTAGTLPELSQTHAQAIVHGLQNDLAFLTKLTHKLSDLKAIRQA 59

Query: 330  KLLFKCLSSTVPL-DLFYYNVQIRGFSRNNSPLDAVSLFHHLRRETILKPDKFTYAFVVS 506
              LF     TVP  DLF +NV ++GFS N SP  A+SL+ HLR  T L PD+FTYAF VS
Sbjct: 60   CDLFL----TVPNPDLFLFNVLLKGFSTNKSPFSAISLYTHLRINTSLVPDEFTYAFAVS 115

Query: 507  AISSSGLEYVGILLHGQVIVSGCSSDLFVGSALVDMYMGSSRVRSAHKVFDEISEPDTVL 686
            A SS      GILLH   +V G SS+LFVGSA VDMY+  S+V  A KVFD + E DTVL
Sbjct: 116  AASSFKDPRYGILLHAHSVVDGLSSNLFVGSAAVDMYLKFSKVELAQKVFDGMPERDTVL 175

Query: 687  WNTMLSGLVKNCCFYDCIQIFRHMVANDRXXXXXXXXXXXXXXXXXXXXRTGMIFHSLAV 866
            WNTM+SGLV+N CF D +++   MVA                         GM  H LA+
Sbjct: 176  WNTMISGLVRNFCFSDSVRVLADMVAGGTKFDSRTLAAVLPGVAELGELLLGMGIHGLAI 235

Query: 867  KTGCQSHDYVLTSLISMYSKCADVSTARFLFEQLEYPDLISYNAMISGFSCNNKTESAVR 1046
            K G  S  YV+T L+S+YSKC +   ARFLF QL YPDLI YNAMI+G++CNN TE+++R
Sbjct: 236  KVGLDSDVYVVTGLVSLYSKCGETDKARFLFGQLSYPDLICYNAMIAGYTCNNDTEASLR 295

Query: 1047 LFNALLLLGDKVNSNTVVGLIPVSDPFGHLKLTCSIHGFSIKSSLVSGPSASTALITVYS 1226
            LF  LL  G KVNS+T+VGLIPVS PFGHL+L  SI  F +K  ++S PS STAL TVY 
Sbjct: 296  LFKELLASGKKVNSSTIVGLIPVSSPFGHLQLASSIQSFCLKCGILSDPSVSTALTTVYC 355

Query: 1227 RLNEFEFARKLFDESPHKSLASWNAMISGYAQNGFTETAIDLFREMQALDIHPNPVTITS 1406
            RLNE E AR++FDESP +SLASWN+MISGYAQNG TE AI LFREM  +   PNP+TITS
Sbjct: 356  RLNETESARQVFDESPERSLASWNSMISGYAQNGLTEAAISLFREMIPV-FSPNPITITS 414

Query: 1407 ILSACAQLGAPSLGKWVHDLIKQENFESNVYVSTALIDMYAKCGSIEEARGLFDTIAEKN 1586
            ILSACAQLG  SLGKWVH LIK +N E N++V TAL+DMYAKCGSI EAR LFD +  K+
Sbjct: 415  ILSACAQLGTLSLGKWVHGLIKSKNLECNIFVKTALVDMYAKCGSITEARELFDLMTNKS 474

Query: 1587 VVTWNAMISGYGLHGRGNEAVLLFYKMLGSGITPTVITFLSVLYACSHAGLVEEGEKIFH 1766
            VVTWNAMISGYGLHG G++A  L+ +ML SG+ P  +TFLS+LYACSH+GLV EG++IF 
Sbjct: 475  VVTWNAMISGYGLHGHGHQAFKLYKEMLHSGVAPNGVTFLSILYACSHSGLVREGDEIFK 534

Query: 1767 SMTREFNFKPSPEHYACMVDLLGRAGKLETALEFINQMPIKPGPAEWGALLGACMIHKDT 1946
            SM  +  F+P PEHYA MVD+LGRAG+LE ALEFI +MPI+PG   WG+LLGACM HKDT
Sbjct: 535  SMVCDHGFEPLPEHYASMVDILGRAGQLEQALEFIRRMPIEPGVVVWGSLLGACMTHKDT 594

Query: 1947 NLAQFASDKLFEMDPKSVGHYVLLSNIYSADHNYPQAASVRQIAKKRNLAKSPGCTLIEL 2126
             +A+ AS+K+FE+DP + G+YVLLSNIYS D N+P+AA VRQ+ K R L K+PGCTLIE+
Sbjct: 595  RIARLASEKIFELDPGNTGYYVLLSNIYSVDRNFPKAAMVRQLVKNRKLEKTPGCTLIEI 654

Query: 2127 NGILHVFTSSDQSHPQAAEIFEMLEKLMGKMREAGFQTETVTALHDVXXXXXXLMVKVHS 2306
                HVFT+SD+SHP+A EI+ ML+KL GKM+EAGFQTETVTALHDV      LMVKVHS
Sbjct: 655  GETTHVFTASDRSHPRATEIYTMLDKLTGKMKEAGFQTETVTALHDVEEEEKELMVKVHS 714

Query: 2307 EKLAIAFGFIASEPGTEIRIIKNLRVCLDCHNFTKFVSMITKRLIVVRDANRFHHFKDGV 2486
            EKLAIAFG IA+EPGTEIRI+KNLRVCLDCHN TKF+S +T+R+IVVRDANRFHHFKDGV
Sbjct: 715  EKLAIAFGLIATEPGTEIRIVKNLRVCLDCHNATKFISKVTERVIVVRDANRFHHFKDGV 774

Query: 2487 CSCGDYW 2507
            CSCGDYW
Sbjct: 775  CSCGDYW 781


>ref|XP_004288724.1| PREDICTED: pentatricopeptide repeat-containing protein At4g30700-like
            [Fragaria vesca subsp. vesca]
          Length = 790

 Score =  962 bits (2488), Expect = 0.0
 Identities = 493/794 (62%), Positives = 589/794 (74%), Gaps = 3/794 (0%)
 Frame = +3

Query: 135  MICRAIAAA---AQRDGNFFSSLLNKVSTLSQLNQTHAQLILNGLSKNLIINTKLTHKLF 305
            MI R I+++   A RD N F S+L K +TLS L QTHAQ+ILNG   +L+I TKLTHK  
Sbjct: 1    MISRNISSSCSYAHRDRNLFLSILKKATTLSHLTQTHAQIILNGYQNDLVIITKLTHKFS 60

Query: 306  DLKSLSQAKLLFKCLSSTVPLDLFYYNVQIRGFSRNNSPLDAVSLFHHLRRETILKPDKF 485
            DLK++  A+ L   + S    DLF +NV I+GF+ N SPL +VSL+ HLR+ T L PDK+
Sbjct: 61   DLKAIRHARDL---VFSFPKPDLFLFNVLIKGFAANASPLSSVSLYTHLRKNTTLSPDKY 117

Query: 486  TYAFVVSAISSSGLEYVGILLHGQVIVSGCSSDLFVGSALVDMYMGSSRVRSAHKVFDEI 665
            TYAF VSA S    E  G LLH   ++ G  S+L+VGS LVD Y   SRV  A KVFDE+
Sbjct: 118  TYAFAVSAASGFNDEKHGALLHAHSVIDGLGSNLYVGSILVDFYFKFSRVGYALKVFDEM 177

Query: 666  SEPDTVLWNTMLSGLVKNCCFYDCIQIFRHMVANDRXXXXXXXXXXXXXXXXXXXXRTGM 845
             E DTV+WNTM+SGLV+NC   D +++FR MVA                       + G 
Sbjct: 178  PEKDTVVWNTMVSGLVRNCYNADAVRVFRDMVAGGTGFDSTTLATLLPAVAELQELKAGT 237

Query: 846  IFHSLAVKTGCQSHDYVLTSLISMYSKCADVSTARFLFEQLEYPDLISYNAMISGFSCNN 1025
                LAVK G     +VLT L+S+YSKC ++ TAR +F  +  PDLI YNAMI+G++CNN
Sbjct: 238  WVQCLAVKAGFHDDVHVLTGLVSLYSKCGELETARRVFGMIGEPDLICYNAMIAGYTCNN 297

Query: 1026 KTESAVRLFNALLLLGDKVNSNTVVGLIPVSDPFGHLKLTCSIHGFSIKSSLVSGPSAST 1205
            +T  AV LF  LL  G KVNS+T+VGL+PVS PFGHL+L+  +  F +KS +VS PS ST
Sbjct: 298  ETVLAVGLFKELLGYGKKVNSSTIVGLVPVSCPFGHLQLSGCLQSFCVKSGIVSHPSVST 357

Query: 1206 ALITVYSRLNEFEFARKLFDESPHKSLASWNAMISGYAQNGFTETAIDLFREMQALDIHP 1385
            AL+TVY RLNE + AR+LFDES  ++LASWNAMISGY QNG T+TAI LFREM + +  P
Sbjct: 358  ALVTVYCRLNEIDSARQLFDESAKRTLASWNAMISGYTQNGHTDTAISLFREMMS-EFSP 416

Query: 1386 NPVTITSILSACAQLGAPSLGKWVHDLIKQENFESNVYVSTALIDMYAKCGSIEEARGLF 1565
            NP T+T+ILSACAQLGA SLGKWVH LIK +N ESN+YV TAL+DMYAKCGSI EAR LF
Sbjct: 417  NPTTVTTILSACAQLGALSLGKWVHGLIKSKNLESNIYVLTALVDMYAKCGSIVEARQLF 476

Query: 1566 DTIAEKNVVTWNAMISGYGLHGRGNEAVLLFYKMLGSGITPTVITFLSVLYACSHAGLVE 1745
            D + EKNVVTWNAMISGYGLHG G+EA+ LF  ML SGI P+ ++FLSVLYACSHAGLV 
Sbjct: 477  DMMPEKNVVTWNAMISGYGLHGDGHEAMKLFNDMLDSGIPPSGVSFLSVLYACSHAGLVR 536

Query: 1746 EGEKIFHSMTREFNFKPSPEHYACMVDLLGRAGKLETALEFINQMPIKPGPAEWGALLGA 1925
            EG+ IFH M     F P  EHYACMVD+ GRAG+LE ALEFI +MP++PG A WGALLGA
Sbjct: 537  EGDDIFHRMVHNHKFVPLAEHYACMVDIFGRAGQLEKALEFIKKMPVEPGSAVWGALLGA 596

Query: 1926 CMIHKDTNLAQFASDKLFEMDPKSVGHYVLLSNIYSADHNYPQAASVRQIAKKRNLAKSP 2105
            C IHKDT LA+ AS+KL E+DP + G+YVLLSNI SAD N+P+AASVRQ+AK RNLAK+P
Sbjct: 597  CKIHKDTKLARLASEKLLELDPDNTGYYVLLSNILSADGNFPKAASVRQVAKHRNLAKTP 656

Query: 2106 GCTLIELNGILHVFTSSDQSHPQAAEIFEMLEKLMGKMREAGFQTETVTALHDVXXXXXX 2285
            GCTL+E+    HVFT  D+SHPQA  I++ML+KL GKMREAGFQTET TALHDV      
Sbjct: 657  GCTLLEIGDTQHVFTCGDRSHPQATAIYKMLDKLTGKMREAGFQTETGTALHDVEEEEKE 716

Query: 2286 LMVKVHSEKLAIAFGFIASEPGTEIRIIKNLRVCLDCHNFTKFVSMITKRLIVVRDANRF 2465
            LM+ VHSEKLAIAF  I++ PGTEIRIIKNLRVCLDCHN TKF+SMIT+R+IVVRDANRF
Sbjct: 717  LMMNVHSEKLAIAFALISTAPGTEIRIIKNLRVCLDCHNATKFISMITQRVIVVRDANRF 776

Query: 2466 HHFKDGVCSCGDYW 2507
            HHFKDGVCSCGDYW
Sbjct: 777  HHFKDGVCSCGDYW 790


>ref|XP_004152852.1| PREDICTED: pentatricopeptide repeat-containing protein At4g30700-like
            [Cucumis sativus]
          Length = 788

 Score =  958 bits (2477), Expect = 0.0
 Identities = 492/792 (62%), Positives = 588/792 (74%), Gaps = 1/792 (0%)
 Frame = +3

Query: 135  MICRAIAAAAQRDGNFFSSLLNKVSTLSQLNQTHAQLILNGLSKNLIINTKLTHKLFDLK 314
            MIC   A +A R   FF +LLN  +TLSQL Q  AQLIL+G+  +L   TKLTHK FDL 
Sbjct: 1    MICTNTATSAIRGQRFFLTLLNNATTLSQLLQIQAQLILHGIHYDLSSITKLTHKFFDLG 60

Query: 315  SLSQAKLLFKCLSSTVPLDLFYYNVQIRGFSRNNSPLDAVSLFHHLRRETILKPDKFTYA 494
            +++  + LF  +S     DLF +NV IRGFS N  P  ++ L+ HLR++T L+PD FTYA
Sbjct: 61   AVAHVRQLFNKVSKP---DLFLFNVLIRGFSDNGLPKSSIFLYTHLRKKTNLRPDNFTYA 117

Query: 495  FVVSAISSSGLEYVGILLHGQVIVSGCSSDLFVGSALVDMYMGSSRVRSAHKVFDEISEP 674
            F +SA S    E VG+LLH   IV G +S+LFVGSA+VD+Y   +R   A KVFD + E 
Sbjct: 118  FAISAASRLEDERVGVLLHAHSIVDGVASNLFVGSAIVDLYFKFTRAELARKVFDVMPER 177

Query: 675  DTVLWNTMLSGLVKNCCFYDCIQIFRHMVANDRXXXXXXXXXXXXXXXXXXXXRTGMIFH 854
            DTVLWNTM+SG  +N  F D I++F  M+                        R GM   
Sbjct: 178  DTVLWNTMISGFSRNSYFEDSIRVFVDMLDVGLSFDSTTLATVLTAVAELQEYRLGMGIQ 237

Query: 855  SLAVKTGCQSHDYVLTSLISMYSKCADVSTARFLFEQLEYPDLISYNAMISGFSCNNKTE 1034
             LA K G  S  YVLT LIS+YSKC      R LF+Q++ PDLISYNAMISG++ N++TE
Sbjct: 238  CLASKKGLHSDVYVLTGLISLYSKCGKSCKGRILFDQIDQPDLISYNAMISGYTFNHETE 297

Query: 1035 SAVRLFNALLLLGDKVNSNTVVGLIPVSDPFGHLKLTCSIHGFSIKSSLVSGPSASTALI 1214
            SAV LF  LL  G +VNS+T+VGLIPV  PF HL+L+  I   S+K  ++  PS STAL 
Sbjct: 298  SAVTLFRELLASGQRVNSSTLVGLIPVYLPFNHLQLSRLIQNLSLKIGIILQPSVSTALT 357

Query: 1215 TVYSRLNEFEFARKLFDESPHKSLASWNAMISGYAQNGFTETAIDLFREMQALDIHPNPV 1394
            TVY RLNE +FAR+LFDESP KSLASWNAMISGY QNG T+ AI LF+EM    + PNPV
Sbjct: 358  TVYCRLNEVQFARQLFDESPEKSLASWNAMISGYTQNGLTDRAISLFQEMMP-QLSPNPV 416

Query: 1395 TITSILSACAQLGAPSLGKWVHDLIKQENFESNVYVSTALIDMYAKCGSIEEARGLFDTI 1574
            T+TSILSACAQLGA S+GKWVH LIK E  ESNVYVSTAL+DMYAKCGSI EAR LFD +
Sbjct: 417  TVTSILSACAQLGALSIGKWVHGLIKSERLESNVYVSTALVDMYAKCGSIVEARQLFDLM 476

Query: 1575 AEKNVVTWNAMISGYGLHGRGNEAVLLFYKMLGSGITPTVITFLSVLYACSHAGLVEEGE 1754
             +KNVVTWNAMI+GYGLHG G EA+ LFY+ML SGI PT +TFLS+LYACSH+GLV EG 
Sbjct: 477  VDKNVVTWNAMITGYGLHGHGKEALKLFYEMLQSGIPPTGVTFLSILYACSHSGLVSEGN 536

Query: 1755 KIFHSMTREFNFKPSPEHYACMVDLLGRAGKLETALEFINQMPIKPGPAEWGALLGACMI 1934
            +IFHSM   + F+P  EHYACMVD+LGRAG+L  ALEFI +MP++PGPA WGALLGACMI
Sbjct: 537  EIFHSMANNYGFQPMSEHYACMVDILGRAGQLTNALEFIERMPLEPGPAVWGALLGACMI 596

Query: 1935 HKDTNLAQFASDKLFEMDPKSVGHYVLLSNIYSADHNYPQAASVRQIAKKRNLAKSPGCT 2114
            HK+T +A  AS +LF++DP++VG+YVLLSNIYS D N+P+AASVRQ+ KKR LAK+PGCT
Sbjct: 597  HKNTEMANVASKRLFQLDPENVGYYVLLSNIYSTDRNFPKAASVRQVVKKRKLAKTPGCT 656

Query: 2115 LIELNGILHVFTSSDQSHPQAAEIFEMLEKLMGKMREAGFQTETV-TALHDVXXXXXXLM 2291
            LIE++   +VFTS D+SHPQA  IFEMLEKL GKMREAG+Q ETV TALHDV      LM
Sbjct: 657  LIEIDDQQYVFTSGDRSHPQATAIFEMLEKLTGKMREAGYQAETVTTALHDVEDEEKELM 716

Query: 2292 VKVHSEKLAIAFGFIASEPGTEIRIIKNLRVCLDCHNFTKFVSMITKRLIVVRDANRFHH 2471
            V VHSEKLAIAFG I+++PGTEIRIIKNLRVCLDCH  TKF+S IT+R+IVVRDANRFHH
Sbjct: 717  VNVHSEKLAIAFGLISTKPGTEIRIIKNLRVCLDCHTATKFISKITERVIVVRDANRFHH 776

Query: 2472 FKDGVCSCGDYW 2507
            FK+G+CSCGDYW
Sbjct: 777  FKNGICSCGDYW 788


>ref|XP_004163058.1| PREDICTED: LOW QUALITY PROTEIN: pentatricopeptide repeat-containing
            protein At4g30700-like [Cucumis sativus]
          Length = 788

 Score =  957 bits (2475), Expect = 0.0
 Identities = 492/792 (62%), Positives = 587/792 (74%), Gaps = 1/792 (0%)
 Frame = +3

Query: 135  MICRAIAAAAQRDGNFFSSLLNKVSTLSQLNQTHAQLILNGLSKNLIINTKLTHKLFDLK 314
            MIC   A +A R   FF +LLN  +TLSQL Q  AQLIL+G+  +L   TKLTHK FDL 
Sbjct: 1    MICTNTATSAIRGQRFFLTLLNNATTLSQLLQIQAQLILHGIHYDLSSITKLTHKFFDLG 60

Query: 315  SLSQAKLLFKCLSSTVPLDLFYYNVQIRGFSRNNSPLDAVSLFHHLRRETILKPDKFTYA 494
            +++  + LF  +S     DLF +NV IRGFS N  P  ++ L+ HLR+ T L+PD FTYA
Sbjct: 61   AVAHVRQLFNKVSKP---DLFLFNVLIRGFSDNGLPKSSIFLYTHLRKXTNLRPDNFTYA 117

Query: 495  FVVSAISSSGLEYVGILLHGQVIVSGCSSDLFVGSALVDMYMGSSRVRSAHKVFDEISEP 674
            F +SA S    E VG+LLH   IV G +S+LFVGSA+VD+Y   +R   A KVFD + E 
Sbjct: 118  FAISAASRLEDERVGVLLHAHSIVDGVASNLFVGSAIVDLYFKFTRAELARKVFDVMPER 177

Query: 675  DTVLWNTMLSGLVKNCCFYDCIQIFRHMVANDRXXXXXXXXXXXXXXXXXXXXRTGMIFH 854
            DTVLWNTM+SG  +N  F D I++F  M+                        R GM   
Sbjct: 178  DTVLWNTMISGFSRNSYFEDSIRVFVDMLDVGLSFDSTTLATVLTAVAELQEYRLGMGIQ 237

Query: 855  SLAVKTGCQSHDYVLTSLISMYSKCADVSTARFLFEQLEYPDLISYNAMISGFSCNNKTE 1034
             LA K G  S  YVLT LIS+YSKC      R LF+Q++ PDLISYNAMISG++ N++TE
Sbjct: 238  CLASKKGLHSDVYVLTGLISLYSKCGKSCKGRILFDQIDQPDLISYNAMISGYTFNHETE 297

Query: 1035 SAVRLFNALLLLGDKVNSNTVVGLIPVSDPFGHLKLTCSIHGFSIKSSLVSGPSASTALI 1214
            SAV LF  LL  G +VNS+T+VGLIPV  PF HL+L+  I   S+K  ++  PS STAL 
Sbjct: 298  SAVTLFRELLASGQRVNSSTLVGLIPVYLPFNHLQLSRLIQNLSLKIGIILQPSVSTALT 357

Query: 1215 TVYSRLNEFEFARKLFDESPHKSLASWNAMISGYAQNGFTETAIDLFREMQALDIHPNPV 1394
            TVY RLNE +FAR+LFDESP KSLASWNAMISGY QNG T+ AI LF+EM    + PNPV
Sbjct: 358  TVYCRLNEVQFARQLFDESPEKSLASWNAMISGYTQNGLTDRAISLFQEMMP-QLSPNPV 416

Query: 1395 TITSILSACAQLGAPSLGKWVHDLIKQENFESNVYVSTALIDMYAKCGSIEEARGLFDTI 1574
            T+TSILSACAQLGA S+GKWVH LIK E  ESNVYVSTAL+DMYAKCGSI EAR LFD +
Sbjct: 417  TVTSILSACAQLGALSIGKWVHGLIKSERLESNVYVSTALVDMYAKCGSIVEARQLFDLM 476

Query: 1575 AEKNVVTWNAMISGYGLHGRGNEAVLLFYKMLGSGITPTVITFLSVLYACSHAGLVEEGE 1754
             +KNVVTWNAMI+GYGLHG G EA+ LFY+ML SGI PT +TFLS+LYACSH+GLV EG 
Sbjct: 477  VDKNVVTWNAMITGYGLHGHGKEALKLFYEMLQSGIPPTGVTFLSILYACSHSGLVSEGN 536

Query: 1755 KIFHSMTREFNFKPSPEHYACMVDLLGRAGKLETALEFINQMPIKPGPAEWGALLGACMI 1934
            +IFHSM   + F+P  EHYACMVD+LGRAG+L  ALEFI +MP++PGPA WGALLGACMI
Sbjct: 537  EIFHSMANNYGFQPMSEHYACMVDILGRAGQLTNALEFIERMPLEPGPAVWGALLGACMI 596

Query: 1935 HKDTNLAQFASDKLFEMDPKSVGHYVLLSNIYSADHNYPQAASVRQIAKKRNLAKSPGCT 2114
            HK+T +A  AS +LF++DP++VG+YVLLSNIYS D N+P+AASVRQ+ KKR LAK+PGCT
Sbjct: 597  HKNTEMANVASKRLFQLDPENVGYYVLLSNIYSTDRNFPKAASVRQVVKKRKLAKTPGCT 656

Query: 2115 LIELNGILHVFTSSDQSHPQAAEIFEMLEKLMGKMREAGFQTETV-TALHDVXXXXXXLM 2291
            LIE++   +VFTS D+SHPQA  IFEMLEKL GKMREAG+Q ETV TALHDV      LM
Sbjct: 657  LIEIDDQQYVFTSGDRSHPQATAIFEMLEKLTGKMREAGYQAETVTTALHDVEDEEKELM 716

Query: 2292 VKVHSEKLAIAFGFIASEPGTEIRIIKNLRVCLDCHNFTKFVSMITKRLIVVRDANRFHH 2471
            V VHSEKLAIAFG I+++PGTEIRIIKNLRVCLDCH  TKF+S IT+R+IVVRDANRFHH
Sbjct: 717  VNVHSEKLAIAFGLISTKPGTEIRIIKNLRVCLDCHTATKFISKITERVIVVRDANRFHH 776

Query: 2472 FKDGVCSCGDYW 2507
            FK+G+CSCGDYW
Sbjct: 777  FKNGICSCGDYW 788


>ref|XP_003610281.1| Pentatricopeptide repeat-containing protein [Medicago truncatula]
            gi|355511336|gb|AES92478.1| Pentatricopeptide
            repeat-containing protein [Medicago truncatula]
          Length = 783

 Score =  952 bits (2460), Expect = 0.0
 Identities = 475/777 (61%), Positives = 577/777 (74%)
 Frame = +3

Query: 177  NFFSSLLNKVSTLSQLNQTHAQLILNGLSKNLIINTKLTHKLFDLKSLSQAKLLFKCLSS 356
            N   SL+NK ST   L QTHAQ ILNG   +L   TKLT KLFD  +   A+ LF    S
Sbjct: 13   NTLFSLINKASTFPHLAQTHAQFILNGYRFDLATLTKLTQKLFDFSATRHARALF---FS 69

Query: 357  TVPLDLFYYNVQIRGFSRNNSPLDAVSLFHHLRRETILKPDKFTYAFVVSAISSSGLEYV 536
                D+F +NV +RGFS N+SP  ++SL+ HLRR T L PD FTYAF V+A S+   +++
Sbjct: 70   VPKPDIFLFNVLVRGFSLNDSPSSSISLYTHLRRNTNLSPDNFTYAFAVAACSND--KHL 127

Query: 537  GILLHGQVIVSGCSSDLFVGSALVDMYMGSSRVRSAHKVFDEISEPDTVLWNTMLSGLVK 716
             +LLH   I+ G  S++FVGSALVD+Y   SRV  A KVFD + E DTVLWNTM++GLVK
Sbjct: 128  -MLLHAHSIIDGYGSNVFVGSALVDLYCKFSRVVYARKVFDGMPERDTVLWNTMINGLVK 186

Query: 717  NCCFYDCIQIFRHMVANDRXXXXXXXXXXXXXXXXXXXXRTGMIFHSLAVKTGCQSHDYV 896
            NCCF D IQ+FR MVA+                      + GM    LA+K G    DYV
Sbjct: 187  NCCFDDSIQLFREMVADGVRVDSSTVTAVLPAAAELQELKVGMGIQCLALKIGFGFCDYV 246

Query: 897  LTSLISMYSKCADVSTARFLFEQLEYPDLISYNAMISGFSCNNKTESAVRLFNALLLLGD 1076
            LT LIS+YSKC DV+TAR LF ++  PDLI+YNAMISGF+ N  TE +V+LF  LL  G+
Sbjct: 247  LTGLISLYSKCGDVNTARLLFRRINRPDLIAYNAMISGFTANGGTECSVKLFRELLFSGE 306

Query: 1077 KVNSNTVVGLIPVSDPFGHLKLTCSIHGFSIKSSLVSGPSASTALITVYSRLNEFEFARK 1256
            +V+S+T+VGLIP+  PFGHL L CSIHGF +KS ++  P+ STA   +Y++LNE + AR 
Sbjct: 307  RVSSSTIVGLIPLHSPFGHLHLACSIHGFCVKSGIILNPTVSTAFTAIYNKLNEIDLARH 366

Query: 1257 LFDESPHKSLASWNAMISGYAQNGFTETAIDLFREMQALDIHPNPVTITSILSACAQLGA 1436
            LFDESP K++ +WNAMISGY QNG TETAI LF+EM   +  PN VTIT+ILSACAQLG+
Sbjct: 367  LFDESPEKTVVAWNAMISGYTQNGSTETAISLFKEMMKTEFTPNAVTITTILSACAQLGS 426

Query: 1437 PSLGKWVHDLIKQENFESNVYVSTALIDMYAKCGSIEEARGLFDTIAEKNVVTWNAMISG 1616
             S GKWVH LIK EN E N+YVSTAL+DMYAKCG+I EA  LFD+++EKN VTWN MI G
Sbjct: 427  LSFGKWVHHLIKSENLEPNIYVSTALVDMYAKCGNISEAWQLFDSMSEKNTVTWNTMIFG 486

Query: 1617 YGLHGRGNEAVLLFYKMLGSGITPTVITFLSVLYACSHAGLVEEGEKIFHSMTREFNFKP 1796
            YGLHG G+EA+ L+ +ML  G  P+ +TFLSVLYACSHAGLV EGE+IFH+M  ++  +P
Sbjct: 487  YGLHGYGHEALKLYNEMLHLGYNPSAVTFLSVLYACSHAGLVGEGEEIFHNMVNKYRIEP 546

Query: 1797 SPEHYACMVDLLGRAGKLETALEFINQMPIKPGPAEWGALLGACMIHKDTNLAQFASDKL 1976
              EHYACMVD+LGR+G+LE ALEFI +MP++PGPA WG LLGACMIHKDT++A+ AS++L
Sbjct: 547  LIEHYACMVDILGRSGQLEKALEFIKKMPVEPGPAVWGTLLGACMIHKDTDIARLASERL 606

Query: 1977 FEMDPKSVGHYVLLSNIYSADHNYPQAASVRQIAKKRNLAKSPGCTLIELNGILHVFTSS 2156
            FE+DP SVG+YVLLSNIYS + N+P+AAS+RQ+ KKR LAKSPGCTLIE+NG  HVF S 
Sbjct: 607  FELDPGSVGYYVLLSNIYSVERNFPKAASIRQVVKKRKLAKSPGCTLIEVNGTPHVFVSG 666

Query: 2157 DQSHPQAAEIFEMLEKLMGKMREAGFQTETVTALHDVXXXXXXLMVKVHSEKLAIAFGFI 2336
            D+SH  A +I+  LEKL GKMRE G+Q ETV ALHDV      L V VHSEKLAIAFG I
Sbjct: 667  DRSHSHATDIYAKLEKLTGKMREMGYQAETVPALHDVEEEEKELAVNVHSEKLAIAFGLI 726

Query: 2337 ASEPGTEIRIIKNLRVCLDCHNFTKFVSMITKRLIVVRDANRFHHFKDGVCSCGDYW 2507
             +EPG EIRIIKNLRVCLDCH  TKF+S IT+R+IVVRDANRFHHFKDG+CSCGDYW
Sbjct: 727  TTEPGNEIRIIKNLRVCLDCHTATKFISKITERVIVVRDANRFHHFKDGICSCGDYW 783


>gb|EOY31480.1| Pentatricopeptide repeat (PPR) superfamily protein [Theobroma cacao]
          Length = 801

 Score =  949 bits (2452), Expect = 0.0
 Identities = 488/795 (61%), Positives = 590/795 (74%), Gaps = 3/795 (0%)
 Frame = +3

Query: 132  QMICRAIAA--AAQRDGNFFSSLLNKVSTLSQLNQTHAQLILNGLSKNLIINTKLTHKLF 305
            QM  ++IA+  +  R  NFF +LL K +TL QL QTHAQLILNG   +L   TKLTH+LF
Sbjct: 14   QMFSKSIASTYSPTRSRNFFLNLLKKSTTLPQLTQTHAQLILNGFRNDLSTITKLTHRLF 73

Query: 306  DLKSLSQAKLLFKCLSSTVPLDLFYYNVQIRGFSRNNSPLDAVSLFHHLRRETILKPDKF 485
            DL + S A+ +F  + +    DLF +NV I+GFS  +S    +SL+ HLR+ T L PD F
Sbjct: 74   DLNATSYARDVFLSIPNP---DLFLFNVLIKGFSNTHS----ISLYTHLRKCTRLNPDNF 126

Query: 486  TYAFVVSAISSSGLEYVGILLHGQVIVSGCSSDLFVGSALVDMYMGSSRVRSAHKVFDEI 665
            TYAF +++ S+   E VG+ L+   +V G   DLFVG+A+VD Y    RV  A KVFD++
Sbjct: 127  TYAFAIASASTLSDEKVGMFLYEHAVVDGYGFDLFVGTAVVDFYFKIWRVELARKVFDKM 186

Query: 666  SEPDTVLWNTMLSGLVKNCCFYDCIQIFRHMVANDRXXXXXXXXXXXXXXXXXXXXR-TG 842
             E DTVLWN+M+SGLVKNCCF D I++FR M+ +                        +G
Sbjct: 187  PERDTVLWNSMISGLVKNCCFEDAIRVFRDMLEDGGIRLDSTSVAAVLPAFSELQELISG 246

Query: 843  MIFHSLAVKTGCQSHDYVLTSLISMYSKCADVSTARFLFEQLEYPDLISYNAMISGFSCN 1022
            M    LA+K G  SH YVLT LIS+YSK  ++  A+ LF ++  PDL+S NAMISG++ N
Sbjct: 247  MEVQCLALKLGFHSHVYVLTGLISLYSKGGEIEAAKLLFGEIGRPDLVSCNAMISGYTSN 306

Query: 1023 NKTESAVRLFNALLLLGDKVNSNTVVGLIPVSDPFGHLKLTCSIHGFSIKSSLVSGPSAS 1202
             ++E +VRLF  LL  G+KVNS+T+VGLIPV  PFG+L LT  IH F +K   VS  S S
Sbjct: 307  GESECSVRLFKQLLGSGEKVNSSTIVGLIPVLSPFGYLNLTNCIHSFCVKYGFVSQSSVS 366

Query: 1203 TALITVYSRLNEFEFARKLFDESPHKSLASWNAMISGYAQNGFTETAIDLFREMQALDIH 1382
            TAL T YSRLNE E AR+LFDES  K+ ASWNAMISGY QNG TE AI LF+EMQ   + 
Sbjct: 367  TALTTAYSRLNEIESARQLFDESSEKTPASWNAMISGYTQNGLTEAAISLFQEMQMSKVG 426

Query: 1383 PNPVTITSILSACAQLGAPSLGKWVHDLIKQENFESNVYVSTALIDMYAKCGSIEEARGL 1562
            PNPVT+TSILSACAQLGA SLGKWVH L+K ++F+SN+YVSTALIDMYAKCGSI EAR L
Sbjct: 427  PNPVTLTSILSACAQLGALSLGKWVHGLVKSKSFDSNIYVSTALIDMYAKCGSIREARQL 486

Query: 1563 FDTIAEKNVVTWNAMISGYGLHGRGNEAVLLFYKMLGSGITPTVITFLSVLYACSHAGLV 1742
            FD +  KNVVTWNAMISGYGLHG+G +A+ LF +ML SG++P  +TFLS+LYACSHAGLV
Sbjct: 487  FDLMLGKNVVTWNAMISGYGLHGQGQDALRLFSEMLHSGVSPNGVTFLSLLYACSHAGLV 546

Query: 1743 EEGEKIFHSMTREFNFKPSPEHYACMVDLLGRAGKLETALEFINQMPIKPGPAEWGALLG 1922
            +EGE+IF SM     FKP  EHYACMVD+LGRAG+LE A +FI +MP++PGPAEWGALLG
Sbjct: 547  KEGEEIFRSMVHANQFKPLAEHYACMVDILGRAGQLEKAFKFIKEMPVEPGPAEWGALLG 606

Query: 1923 ACMIHKDTNLAQFASDKLFEMDPKSVGHYVLLSNIYSADHNYPQAASVRQIAKKRNLAKS 2102
            ACMIHKD  LA  AS++LFE+DP++VG+YVLLSN+YSA+ NYP AASVRQ  KKR LAK 
Sbjct: 607  ACMIHKDKKLAHVASERLFELDPENVGYYVLLSNLYSAERNYPLAASVRQNVKKRMLAKI 666

Query: 2103 PGCTLIELNGILHVFTSSDQSHPQAAEIFEMLEKLMGKMREAGFQTETVTALHDVXXXXX 2282
            PGCTLIE+    HVFTS D+SHPQA EI+ MLEKL+ KM+EAGFQTET TALHDV     
Sbjct: 667  PGCTLIEIGETPHVFTSGDRSHPQATEIYAMLEKLIRKMKEAGFQTETDTALHDVEEEEK 726

Query: 2283 XLMVKVHSEKLAIAFGFIASEPGTEIRIIKNLRVCLDCHNFTKFVSMITKRLIVVRDANR 2462
             LMV VHSEKLAIAFG + ++PGTEIRI KNLRVC+DCH  TKF+S IT+R+IVVRDANR
Sbjct: 727  ELMVNVHSEKLAIAFGLVVTQPGTEIRIFKNLRVCVDCHTATKFISKITERVIVVRDANR 786

Query: 2463 FHHFKDGVCSCGDYW 2507
            FHHFKDGVCSCGDYW
Sbjct: 787  FHHFKDGVCSCGDYW 801


>ref|XP_004507756.1| PREDICTED: pentatricopeptide repeat-containing protein At4g30700-like
            [Cicer arietinum]
          Length = 783

 Score =  938 bits (2425), Expect = 0.0
 Identities = 471/791 (59%), Positives = 576/791 (72%)
 Frame = +3

Query: 135  MICRAIAAAAQRDGNFFSSLLNKVSTLSQLNQTHAQLILNGLSKNLIINTKLTHKLFDLK 314
            MI  +++    R  N   SL+NK  T   L QTHAQLILNG   +L   TKLT KLFD  
Sbjct: 1    MILNSMSGTLSR--NTLISLINKSLTFPHLAQTHAQLILNGYHFDLATITKLTQKLFDFG 58

Query: 315  SLSQAKLLFKCLSSTVPLDLFYYNVQIRGFSRNNSPLDAVSLFHHLRRETILKPDKFTYA 494
            +   A+ +F    S    D+F +NV +RGFS N SP  ++SL+ HLRR T L PD FTYA
Sbjct: 59   ATRHARAIF---FSVPKPDIFLFNVLVRGFSLNASPSSSISLYTHLRRNTNLSPDNFTYA 115

Query: 495  FVVSAISSSGLEYVGILLHGQVIVSGCSSDLFVGSALVDMYMGSSRVRSAHKVFDEISEP 674
            F ++A +    +++ +LLH   IV G  +++FVGSALVD+Y   SRV  A KVFD + E 
Sbjct: 116  FAIAACADD--KHL-MLLHAHSIVDGYGNNVFVGSALVDLYCKFSRVGFARKVFDGMLER 172

Query: 675  DTVLWNTMLSGLVKNCCFYDCIQIFRHMVANDRXXXXXXXXXXXXXXXXXXXXRTGMIFH 854
            DTVLWN+M++GLVKNCCF DC+Q+F  MVA                       R GM   
Sbjct: 173  DTVLWNSMINGLVKNCCFDDCVQLFVDMVAEGVRFDSTTVTAVLPAVAELQELRVGMGIQ 232

Query: 855  SLAVKTGCQSHDYVLTSLISMYSKCADVSTARFLFEQLEYPDLISYNAMISGFSCNNKTE 1034
             LA+K G   +DYV+T LIS+YSKC D  TAR LF  +  PDLI+YNAMISGF+ N + E
Sbjct: 233  CLALKKGFHFYDYVMTGLISLYSKCGDTKTARLLFGMIGKPDLIAYNAMISGFTSNGENE 292

Query: 1035 SAVRLFNALLLLGDKVNSNTVVGLIPVSDPFGHLKLTCSIHGFSIKSSLVSGPSASTALI 1214
             +V+LF  LL+ G+KV+S+T+VGLIP+  PFGHL L C IHGF +KS ++  P+ STA  
Sbjct: 293  CSVKLFRELLVSGEKVSSSTIVGLIPLPSPFGHLHLACLIHGFCLKSGIILNPTVSTAFT 352

Query: 1215 TVYSRLNEFEFARKLFDESPHKSLASWNAMISGYAQNGFTETAIDLFREMQALDIHPNPV 1394
             +Y++LNE + AR+LFDESP K++ +WNAMISGY QNG TETAI LF+EM   +  PN V
Sbjct: 353  AIYNKLNEIDMARQLFDESPEKTVVAWNAMISGYTQNGLTETAISLFQEMMKTEFTPNAV 412

Query: 1395 TITSILSACAQLGAPSLGKWVHDLIKQENFESNVYVSTALIDMYAKCGSIEEARGLFDTI 1574
            TIT+ILSACAQLG+ S GKWVH LIK +N E N+YVSTALIDMYAKCG+I +AR LFD +
Sbjct: 413  TITTILSACAQLGSLSFGKWVHQLIKSKNVEPNIYVSTALIDMYAKCGNILDARQLFDLM 472

Query: 1575 AEKNVVTWNAMISGYGLHGRGNEAVLLFYKMLGSGITPTVITFLSVLYACSHAGLVEEGE 1754
             EKN VTWN MI GYGLHG G+EA+ LF +ML  G  P+ +TFLSVLYACSHAGLV EGE
Sbjct: 473  NEKNTVTWNTMIFGYGLHGYGHEALKLFNEMLHLGFNPSAVTFLSVLYACSHAGLVGEGE 532

Query: 1755 KIFHSMTREFNFKPSPEHYACMVDLLGRAGKLETALEFINQMPIKPGPAEWGALLGACMI 1934
            +IFH M  ++  +P  EHYACMVD+LGR+G+LE ALEFI  MP++PGPA WG LLGACMI
Sbjct: 533  EIFHDMVNKYRIEPLVEHYACMVDILGRSGQLEKALEFIRAMPVEPGPAIWGTLLGACMI 592

Query: 1935 HKDTNLAQFASDKLFEMDPKSVGHYVLLSNIYSADHNYPQAASVRQIAKKRNLAKSPGCT 2114
            HKDTN+A+ AS++LFE+DP SVG+YVLLSNIYS + N+P+AAS+RQ+ KKR LAKSPGCT
Sbjct: 593  HKDTNIARLASERLFELDPGSVGYYVLLSNIYSVERNFPKAASIRQVVKKRKLAKSPGCT 652

Query: 2115 LIELNGILHVFTSSDQSHPQAAEIFEMLEKLMGKMREAGFQTETVTALHDVXXXXXXLMV 2294
            LIE+NG  HVF S D+ H  +  I+  LEKL+ KMRE G+Q+ETVTALHDV      L  
Sbjct: 653  LIEVNGTPHVFVSGDRCHSHSTAIYAELEKLVAKMREIGYQSETVTALHDVEEEEKELTF 712

Query: 2295 KVHSEKLAIAFGFIASEPGTEIRIIKNLRVCLDCHNFTKFVSMITKRLIVVRDANRFHHF 2474
             VHSEKLAIAFG I +EPGTEIRIIKNLRVCLDCH  TKF+S IT+R+IVVRDANRFHHF
Sbjct: 713  NVHSEKLAIAFGLITTEPGTEIRIIKNLRVCLDCHTATKFISKITERVIVVRDANRFHHF 772

Query: 2475 KDGVCSCGDYW 2507
            KDG+CSCGDYW
Sbjct: 773  KDGICSCGDYW 783


>gb|ESW26860.1| hypothetical protein PHAVU_003G154400g [Phaseolus vulgaris]
          Length = 778

 Score =  932 bits (2409), Expect = 0.0
 Identities = 469/777 (60%), Positives = 572/777 (73%)
 Frame = +3

Query: 177  NFFSSLLNKVSTLSQLNQTHAQLILNGLSKNLIINTKLTHKLFDLKSLSQAKLLFKCLSS 356
            N   +L+NK  T   L +THAQLI NG   +L   TKLT KLFD+ +   A+ LF    S
Sbjct: 9    NTLLALINKACTFPHLAETHAQLIRNGYHHDLATVTKLTQKLFDVGAAHHARALF---FS 65

Query: 357  TVPLDLFYYNVQIRGFSRNNSPLDAVSLFHHLRRETILKPDKFTYAFVVSAISSSGLEYV 536
                D+F +NV I+GFS   +   +VSL+ HLR+ T L PD FTYAF V+A     L   
Sbjct: 66   VPKPDIFLFNVVIKGFSFFPNA-SSVSLYTHLRKNTSLSPDNFTYAFAVAASPDDKL--- 121

Query: 537  GILLHGQVIVSGCSSDLFVGSALVDMYMGSSRVRSAHKVFDEISEPDTVLWNTMLSGLVK 716
            G+ LH   ++ G  S+LFV SALVD+Y   SRV  A KVFD++ E DTVLWNTM++GLV+
Sbjct: 122  GMCLHTHAVIDGFDSNLFVASALVDLYCKFSRVGYARKVFDKMLERDTVLWNTMITGLVR 181

Query: 717  NCCFYDCIQIFRHMVANDRXXXXXXXXXXXXXXXXXXXXRTGMIFHSLAVKTGCQSHDYV 896
            NCC+ D +Q+FR MVA                       + GM    LA+K G    DYV
Sbjct: 182  NCCYDDSVQVFRDMVAQGVQLDSTTVATVLPAVAEMEEGKVGMGIQCLALKLGFHFDDYV 241

Query: 897  LTSLISMYSKCADVSTARFLFEQLEYPDLISYNAMISGFSCNNKTESAVRLFNALLLLGD 1076
            LT LIS++SKC DV TA+ LF  ++ PDL+SYNAMISGFSCN +TES V+LF  LL+ G+
Sbjct: 242  LTGLISVFSKCGDVDTAKLLFGMIKKPDLVSYNAMISGFSCNGETESGVKLFRELLVSGE 301

Query: 1077 KVNSNTVVGLIPVSDPFGHLKLTCSIHGFSIKSSLVSGPSASTALITVYSRLNEFEFARK 1256
            +V+S+T+VGLIPVS PFGHL L C IHGF +K  ++  PS STAL T+YSRLNE + AR+
Sbjct: 302  RVSSSTMVGLIPVSSPFGHLHLACCIHGFCVKLGIILHPSLSTALTTIYSRLNEIDLARR 361

Query: 1257 LFDESPHKSLASWNAMISGYAQNGFTETAIDLFREMQALDIHPNPVTITSILSACAQLGA 1436
            LFDES  K++A+WNAMISGY QNG TETAI LF+EM A +  PNPVTITSILSACAQLGA
Sbjct: 362  LFDESSEKTVAAWNAMISGYTQNGLTETAIALFQEMMATEFIPNPVTITSILSACAQLGA 421

Query: 1437 PSLGKWVHDLIKQENFESNVYVSTALIDMYAKCGSIEEARGLFDTIAEKNVVTWNAMISG 1616
             S GKWVH LI+ +N E N+YV TALIDMYAKCG+I EA  LFD+++EKN VTWN MI G
Sbjct: 422  LSFGKWVHQLIRSKNLEPNIYVLTALIDMYAKCGNILEAWQLFDSMSEKNTVTWNTMIFG 481

Query: 1617 YGLHGRGNEAVLLFYKMLGSGITPTVITFLSVLYACSHAGLVEEGEKIFHSMTREFNFKP 1796
            YGLHG G+EA+ LF +ML  G  P+ +TFLS+LYACSH+GLV EG++IF++M  ++   P
Sbjct: 482  YGLHGYGHEALQLFNEMLELGFQPSSVTFLSILYACSHSGLVREGDEIFNAMVNKYRIVP 541

Query: 1797 SPEHYACMVDLLGRAGKLETALEFINQMPIKPGPAEWGALLGACMIHKDTNLAQFASDKL 1976
              EHYACMVD+LGRAG+LE ALEFI +MP++PGPA WG LLGACMIHKDT +A+ AS++L
Sbjct: 542  LAEHYACMVDILGRAGQLEKALEFIRRMPVEPGPAVWGTLLGACMIHKDTKIARMASERL 601

Query: 1977 FEMDPKSVGHYVLLSNIYSADHNYPQAASVRQIAKKRNLAKSPGCTLIELNGILHVFTSS 2156
            FE+DP SVG+YVLLSNIYS + N+P+AASVR++ KKR L+K+PGCTLIE+NG  HVF S 
Sbjct: 602  FELDPGSVGYYVLLSNIYSVERNFPKAASVREVVKKRKLSKTPGCTLIEVNGSPHVFVSG 661

Query: 2157 DQSHPQAAEIFEMLEKLMGKMREAGFQTETVTALHDVXXXXXXLMVKVHSEKLAIAFGFI 2336
            D+SH Q   I+  LEKL  KMRE G+++ETVTALHDV      LM  VHSEKLAIAFG I
Sbjct: 662  DRSHSQTVAIYAKLEKLTSKMREMGYKSETVTALHDVEEEEKELMFNVHSEKLAIAFGLI 721

Query: 2337 ASEPGTEIRIIKNLRVCLDCHNFTKFVSMITKRLIVVRDANRFHHFKDGVCSCGDYW 2507
             +EPGTEIRIIKNLRVCLDCH  TKF+S IT+R+IVVRDANRFHHFKDG CSCGDYW
Sbjct: 722  TTEPGTEIRIIKNLRVCLDCHTATKFISKITERVIVVRDANRFHHFKDGSCSCGDYW 778


>ref|XP_003550682.1| PREDICTED: pentatricopeptide repeat-containing protein At4g30700-like
            [Glycine max]
          Length = 778

 Score =  929 bits (2400), Expect = 0.0
 Identities = 467/777 (60%), Positives = 566/777 (72%)
 Frame = +3

Query: 177  NFFSSLLNKVSTLSQLNQTHAQLILNGLSKNLIINTKLTHKLFDLKSLSQAKLLFKCLSS 356
            N   +L++K  T   L +THAQLI NG   +L   TKLT KLFD+ +   A+ LF    S
Sbjct: 9    NTLLALISKACTFPHLAETHAQLIRNGYQHDLATVTKLTQKLFDVGATRHARALF---FS 65

Query: 357  TVPLDLFYYNVQIRGFSRNNSPLDAVSLFHHLRRETILKPDKFTYAFVVSAISSSGLEYV 536
                D+F +NV I+GFS +     ++S + HL + T L PD FTYAF +SA     L   
Sbjct: 66   VPKPDIFLFNVLIKGFSFSPDA-SSISFYTHLLKNTTLSPDNFTYAFAISASPDDNL--- 121

Query: 537  GILLHGQVIVSGCSSDLFVGSALVDMYMGSSRVRSAHKVFDEISEPDTVLWNTMLSGLVK 716
            G+ LH   +V G  S+LFV SALVD+Y   SRV  A KVFD++ + DTVLWNTM++GLV+
Sbjct: 122  GMCLHAHAVVDGFDSNLFVASALVDLYCKFSRVAYARKVFDKMPDRDTVLWNTMITGLVR 181

Query: 717  NCCFYDCIQIFRHMVANDRXXXXXXXXXXXXXXXXXXXXRTGMIFHSLAVKTGCQSHDYV 896
            NCC+ D +Q+F+ MVA                       + GM    LA+K G    DYV
Sbjct: 182  NCCYDDSVQVFKDMVAQGVRLDSTTVATVLPAVAEMQEVKVGMGIQCLALKLGFHFDDYV 241

Query: 897  LTSLISMYSKCADVSTARFLFEQLEYPDLISYNAMISGFSCNNKTESAVRLFNALLLLGD 1076
            LT LIS++SKC DV TAR LF  +  PDL+SYNA+ISGFSCN +TE AV+ F  LL+ G 
Sbjct: 242  LTGLISVFSKCEDVDTARLLFGMIRKPDLVSYNALISGFSCNGETECAVKYFRELLVSGQ 301

Query: 1077 KVNSNTVVGLIPVSDPFGHLKLTCSIHGFSIKSSLVSGPSASTALITVYSRLNEFEFARK 1256
            +V+S+T+VGLIPVS PFGHL L C I GF +KS  +  PS STAL T+YSRLNE + AR+
Sbjct: 302  RVSSSTMVGLIPVSSPFGHLHLACCIQGFCVKSGTILQPSVSTALTTIYSRLNEIDLARQ 361

Query: 1257 LFDESPHKSLASWNAMISGYAQNGFTETAIDLFREMQALDIHPNPVTITSILSACAQLGA 1436
            LFDES  K++A+WNAMISGYAQ+G TE AI LF+EM   +  PNPVTITSILSACAQLGA
Sbjct: 362  LFDESSEKTVAAWNAMISGYAQSGLTEMAISLFQEMMTTEFTPNPVTITSILSACAQLGA 421

Query: 1437 PSLGKWVHDLIKQENFESNVYVSTALIDMYAKCGSIEEARGLFDTIAEKNVVTWNAMISG 1616
             S GK VH LIK +N E N+YVSTALIDMYAKCG+I EA  LFD  +EKN VTWN MI G
Sbjct: 422  LSFGKSVHQLIKSKNLEQNIYVSTALIDMYAKCGNISEASQLFDLTSEKNTVTWNTMIFG 481

Query: 1617 YGLHGRGNEAVLLFYKMLGSGITPTVITFLSVLYACSHAGLVEEGEKIFHSMTREFNFKP 1796
            YGLHG G+EA+ LF +ML  G  P+ +TFLSVLYACSHAGLV EG++IFH+M  ++  +P
Sbjct: 482  YGLHGYGDEALKLFNEMLHLGFQPSSVTFLSVLYACSHAGLVREGDEIFHAMVNKYRIEP 541

Query: 1797 SPEHYACMVDLLGRAGKLETALEFINQMPIKPGPAEWGALLGACMIHKDTNLAQFASDKL 1976
              EHYACMVD+LGRAG+LE ALEFI +MP++PGPA WG LLGACMIHKDTNLA+ AS++L
Sbjct: 542  LAEHYACMVDILGRAGQLEKALEFIRKMPVEPGPAVWGTLLGACMIHKDTNLARVASERL 601

Query: 1977 FEMDPKSVGHYVLLSNIYSADHNYPQAASVRQIAKKRNLAKSPGCTLIELNGILHVFTSS 2156
            FE+DP +VG+YVLLSNIYS + N+P+AASVR+  KKRNL+K+PGCTLIE+NG  HVF   
Sbjct: 602  FELDPGNVGYYVLLSNIYSVERNFPKAASVREAVKKRNLSKTPGCTLIEVNGTPHVFVCG 661

Query: 2157 DQSHPQAAEIFEMLEKLMGKMREAGFQTETVTALHDVXXXXXXLMVKVHSEKLAIAFGFI 2336
            D+SH Q   I+  LE+L GKMRE G+Q+ETVTALHDV      LM  VHSEKLAIAFG I
Sbjct: 662  DRSHSQTTSIYAKLEELTGKMREMGYQSETVTALHDVEEEEKELMFNVHSEKLAIAFGLI 721

Query: 2337 ASEPGTEIRIIKNLRVCLDCHNFTKFVSMITKRLIVVRDANRFHHFKDGVCSCGDYW 2507
             +EPGTEIRIIKNLRVCLDCH  TKF+S IT+R+IVVRDANRFHHFKDG+CSCGDYW
Sbjct: 722  TTEPGTEIRIIKNLRVCLDCHAATKFISKITERVIVVRDANRFHHFKDGICSCGDYW 778


>gb|EPS60782.1| hypothetical protein M569_14020, partial [Genlisea aurea]
          Length = 774

 Score =  919 bits (2374), Expect = 0.0
 Identities = 470/779 (60%), Positives = 574/779 (73%), Gaps = 4/779 (0%)
 Frame = +3

Query: 183  FSSLLNKVSTLSQLNQTHAQLILNGLSKNLIINTKLTHKLFDLKSLSQAKLLFKCLSSTV 362
            F SL+ K  T+S +NQ HAQ+I +GLS + II T +  KL D K+ +QAK LF    S  
Sbjct: 1    FGSLIKKAVTISHINQCHAQIIQSGLSGDCIIVTGIIQKLLDFKAATQAKRLFAGFRSP- 59

Query: 363  PLDLFYYNVQIRGFSRNNSPLDAVSLFHHLRRETILKPDKFTYAFVVSAIS---SSGLEY 533
              DLF YN  I+G    + PLD+++ +  L R T  KPD FT++ +V+A +   S G+E 
Sbjct: 60   --DLFLYNALIKGLK--DDPLDSLNAYTALLRVTPFKPDNFTFSSIVAAFAGFPSPGMEK 115

Query: 534  VGILLHGQVIVSGCSSDLFVGSALVDMYMGSSRVRSAHKVFDEISEPDTVLWNTMLSGLV 713
            +G L+HG  ++SG  S++FVGSALVDMYM  S +R A KVFD I EPDTVLWNTMLSGLV
Sbjct: 116  LGRLIHGHAVISGFGSEVFVGSALVDMYMSFSWIRHARKVFDGIPEPDTVLWNTMLSGLV 175

Query: 714  KNCCFYDCIQIFRHMVANDRXXXXXXXXXXXXXXXXXXXXRTGMIFHSLAVKTGCQSHDY 893
            KN  F D I +F  MV ++                      +GM+  +LA+K GC  +D+
Sbjct: 176  KNSYFDDAIHVFHDMVLSNVGFDSTTLAVVLSCLAELRQVSSGMMVLALALKFGCDFNDH 235

Query: 894  VLTSLISMYSKCADVSTARFLFEQLEYPDLISYNAMISGFSCNNKTESAVRLFNALLLLG 1073
            ++T L+ +YSKC D+S AR +F     PDL++YNAMISG SCNN+TESAV LF+ LL  G
Sbjct: 236  LITGLVKLYSKCGDISKARSMFGLFINPDLVAYNAMISGLSCNNETESAVHLFHGLLSSG 295

Query: 1074 DKVNSNTVVGLIPVSDPFGHLKLTCSIHGFSIKSSLVSGPSASTALITVYSRLNEFEFAR 1253
             +V+S+T+VGLIP   P+GHL+LT SIHGF +KS  VS  S STAL++VYSRLNE   AR
Sbjct: 296  CRVSSSTIVGLIPTCHPYGHLELTQSIHGFCMKSCFVSHCSVSTALMSVYSRLNELNLAR 355

Query: 1254 KLFDESP-HKSLASWNAMISGYAQNGFTETAIDLFREMQALDIHPNPVTITSILSACAQL 1430
            ++FDES   K+LASWNAMISGYAQNG TE A+ LFREMQ LDIHPNPVTI+SILSACAQL
Sbjct: 356  QIFDESSAKKNLASWNAMISGYAQNGQTEMAVSLFREMQKLDIHPNPVTISSILSACAQL 415

Query: 1431 GAPSLGKWVHDLIKQENFESNVYVSTALIDMYAKCGSIEEARGLFDTIAEKNVVTWNAMI 1610
            G  SLGKWVHDL K ENFESN++VSTALIDMYAKCGSIEEAR  FD + EKNVVTWNAMI
Sbjct: 416  GMLSLGKWVHDLAKSENFESNIFVSTALIDMYAKCGSIEEARIFFDAMKEKNVVTWNAMI 475

Query: 1611 SGYGLHGRGNEAVLLFYKMLGSGITPTVITFLSVLYACSHAGLVEEGEKIFHSMTREFNF 1790
            S YGLHG GNEA+ L+  M  SGI+PT +TFLS+L+ACSHAGLVEEGE+IF SM  +  F
Sbjct: 476  SAYGLHGCGNEALRLYDDMAISGISPTGVTFLSILHACSHAGLVEEGERIFSSMVDDHGF 535

Query: 1791 KPSPEHYACMVDLLGRAGKLETALEFINQMPIKPGPAEWGALLGACMIHKDTNLAQFASD 1970
            +P+ EHYAC+VD+ GRAGKL+ A  FIN MPI PGP EWGALLGACMIHKD ++A  AS 
Sbjct: 536  EPTSEHYACLVDIFGRAGKLQKAFNFINNMPIVPGPGEWGALLGACMIHKDIDIAHVASR 595

Query: 1971 KLFEMDPKSVGHYVLLSNIYSADHNYPQAASVRQIAKKRNLAKSPGCTLIELNGILHVFT 2150
            KL  +D +S GH+VLLSN+YS   +YPQAAS+R+  KK+NL K+PGC+LIE+ G  HVF 
Sbjct: 596  KLIALDHESAGHHVLLSNLYSISQSYPQAASLRETIKKKNLMKTPGCSLIEVKGHTHVFK 655

Query: 2151 SSDQSHPQAAEIFEMLEKLMGKMREAGFQTETVTALHDVXXXXXXLMVKVHSEKLAIAFG 2330
            S+D SHPQ   I+  LE L+ KM+++G+  ET+TALHDV      L VKVHSEKLAIAFG
Sbjct: 656  SNDLSHPQLQSIYSELEMLVWKMKDSGYIVETLTALHDVEDEEKELTVKVHSEKLAIAFG 715

Query: 2331 FIASEPGTEIRIIKNLRVCLDCHNFTKFVSMITKRLIVVRDANRFHHFKDGVCSCGDYW 2507
             + S    +IRIIKNLRVCLDCHNFTKFVS IT+R I+VRDANRFHHF++G CSC DYW
Sbjct: 716  ILMSGEKEDIRIIKNLRVCLDCHNFTKFVSKITERAIIVRDANRFHHFRNGACSCRDYW 774


>ref|XP_002867333.1| pentatricopeptide repeat-containing protein [Arabidopsis lyrata
            subsp. lyrata] gi|297313169|gb|EFH43592.1|
            pentatricopeptide repeat-containing protein [Arabidopsis
            lyrata subsp. lyrata]
          Length = 792

 Score =  919 bits (2374), Expect = 0.0
 Identities = 472/798 (59%), Positives = 577/798 (72%), Gaps = 7/798 (0%)
 Frame = +3

Query: 135  MICRAIAAAAQR------DGNFFSSLLNKVSTLSQLNQTHAQLILNGLSKNLIINTKLTH 296
            M+ R ++AA           N F  L  + +++S L QTHAQ++L+G   ++ + TKLT 
Sbjct: 1    MLLRTVSAATAETTAALISKNNFLDLFKRSTSISHLAQTHAQIVLHGFRNDISLLTKLTQ 60

Query: 297  KLFDLKSLSQAKLLFKCLSSTVPLDLFYYNVQIRGFSRNNSPLDAVSLFHHLRRETILKP 476
            +L DL ++  A+ +F  LS   P D+F +NV +RGFS N SP  ++++F HLR+ T LKP
Sbjct: 61   RLSDLGAIYYARDIF--LSVQRP-DVFLFNVLMRGFSVNESPHSSLAVFAHLRKSTDLKP 117

Query: 477  DKFTYAFVVSAISSSGLEYVGILLHGQVIVSGCSSDLFVGSALVDMYMGSSRVRSAHKVF 656
            +  TYAF +SA S    +  G ++HGQ IV GC S+L +GS +V MY    RV  A KVF
Sbjct: 118  NSSTYAFAISAASGFRDDRAGCVIHGQAIVDGCDSELLLGSNIVKMYFKFWRVEDARKVF 177

Query: 657  DEISEPDTVLWNTMLSGLVKNCCFYDCIQIFRHMVAND-RXXXXXXXXXXXXXXXXXXXX 833
            D + E DT+LWNTM+SG  KN  + + IQ+FR ++                         
Sbjct: 178  DRMPEKDTILWNTMISGYRKNEMYVESIQVFRDLINESCTRLDTTTLLDILPAVAELQEL 237

Query: 834  RTGMIFHSLAVKTGCQSHDYVLTSLISMYSKCADVSTARFLFEQLEYPDLISYNAMISGF 1013
            R GM  HSLA KTGC SHDYVLT  IS+YSKC  +  A  LF +   PD+++YNAMI G+
Sbjct: 238  RLGMQIHSLATKTGCYSHDYVLTGFISLYSKCGKIKMASTLFREFRRPDIVAYNAMIHGY 297

Query: 1014 SCNNKTESAVRLFNALLLLGDKVNSNTVVGLIPVSDPFGHLKLTCSIHGFSIKSSLVSGP 1193
            + N +TE ++ LF  L+L G K+ S+T+V L+PVS   GHL L  +IHG+S+KS+ +S  
Sbjct: 298  TSNGETELSLSLFKELMLSGAKLKSSTLVSLVPVS---GHLMLIYAIHGYSLKSNFLSHT 354

Query: 1194 SASTALITVYSRLNEFEFARKLFDESPHKSLASWNAMISGYAQNGFTETAIDLFREMQAL 1373
            S STAL TVYS+LNE E ARKLFDESP KSL SWNAMISGY QNG TE AI LFREMQ  
Sbjct: 355  SVSTALTTVYSKLNEIESARKLFDESPEKSLPSWNAMISGYTQNGLTEDAISLFREMQNS 414

Query: 1374 DIHPNPVTITSILSACAQLGAPSLGKWVHDLIKQENFESNVYVSTALIDMYAKCGSIEEA 1553
            +  PNPVTIT ILSACAQLGA SLGKWVHDL++  +FES++YVSTALI MYAKCGSI EA
Sbjct: 415  EFSPNPVTITCILSACAQLGALSLGKWVHDLVRSTDFESSIYVSTALIGMYAKCGSIAEA 474

Query: 1554 RGLFDTIAEKNVVTWNAMISGYGLHGRGNEAVLLFYKMLGSGITPTVITFLSVLYACSHA 1733
            R LFD + +KN VTWN MISGYGLHG G EA+ +F +ML SGI PT +TFL VLYACSHA
Sbjct: 475  RRLFDFMPKKNEVTWNTMISGYGLHGHGQEALTIFSEMLNSGIAPTPVTFLCVLYACSHA 534

Query: 1734 GLVEEGEKIFHSMTREFNFKPSPEHYACMVDLLGRAGKLETALEFINQMPIKPGPAEWGA 1913
            GLV+EG++IF+SM   + F+PS +HYAC+VD+LGRAG L+ AL+FI  MPI+PGP+ W  
Sbjct: 535  GLVKEGDEIFNSMIHRYGFEPSVKHYACVVDILGRAGHLQRALQFIEAMPIQPGPSVWET 594

Query: 1914 LLGACMIHKDTNLAQFASDKLFEMDPKSVGHYVLLSNIYSADHNYPQAASVRQIAKKRNL 2093
            LLGAC IHKDTNLA+  S+KLFE+DP +VG++VLLSNI+SAD NYPQAA+VRQ AKKR L
Sbjct: 595  LLGACRIHKDTNLARTVSEKLFELDPDNVGYHVLLSNIHSADRNYPQAATVRQTAKKRKL 654

Query: 2094 AKSPGCTLIELNGILHVFTSSDQSHPQAAEIFEMLEKLMGKMREAGFQTETVTALHDVXX 2273
            AK+PG TLIE+    HVFTS DQSHPQ   I E LEKL GKMREAG+Q ET  ALHDV  
Sbjct: 655  AKAPGYTLIEIGETPHVFTSGDQSHPQVKAIHEKLEKLEGKMREAGYQPETELALHDVEE 714

Query: 2274 XXXXLMVKVHSEKLAIAFGFIASEPGTEIRIIKNLRVCLDCHNFTKFVSMITKRLIVVRD 2453
                LMVKVHSE+LAIAFG IA+EPGTEIRIIKNLRVCLDCH  TK +S IT+R+IVVRD
Sbjct: 715  EERELMVKVHSERLAIAFGLIATEPGTEIRIIKNLRVCLDCHTATKLISKITERVIVVRD 774

Query: 2454 ANRFHHFKDGVCSCGDYW 2507
            ANRFHHFKDGVCSCGDYW
Sbjct: 775  ANRFHHFKDGVCSCGDYW 792


>ref|NP_194799.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana]
            gi|75208664|sp|Q9SUH6.1|PP341_ARATH RecName:
            Full=Pentatricopeptide repeat-containing protein
            At4g30700; AltName: Full=Protein DYW9
            gi|5725434|emb|CAB52443.1| putative protein [Arabidopsis
            thaliana] gi|7269971|emb|CAB79788.1| putative protein
            [Arabidopsis thaliana] gi|332660398|gb|AEE85798.1|
            pentatricopeptide repeat-containing protein [Arabidopsis
            thaliana]
          Length = 792

 Score =  917 bits (2371), Expect = 0.0
 Identities = 469/788 (59%), Positives = 573/788 (72%), Gaps = 1/788 (0%)
 Frame = +3

Query: 147  AIAAAAQRDGNFFSSLLNKVSTLSQLNQTHAQLILNGLSKNLIINTKLTHKLFDLKSLSQ 326
            A   AA    N +     + +++S L QTHAQ+IL+G   ++ + TKLT +L DL ++  
Sbjct: 11   AETTAALISKNTYLDFFKRSTSISHLAQTHAQIILHGFRNDISLLTKLTQRLSDLGAIYY 70

Query: 327  AKLLFKCLSSTVPLDLFYYNVQIRGFSRNNSPLDAVSLFHHLRRETILKPDKFTYAFVVS 506
            A+ +F  LS   P D+F +NV +RGFS N SP  ++S+F HLR+ T LKP+  TYAF +S
Sbjct: 71   ARDIF--LSVQRP-DVFLFNVLMRGFSVNESPHSSLSVFAHLRKSTDLKPNSSTYAFAIS 127

Query: 507  AISSSGLEYVGILLHGQVIVSGCSSDLFVGSALVDMYMGSSRVRSAHKVFDEISEPDTVL 686
            A S    +  G ++HGQ +V GC S+L +GS +V MY    RV  A KVFD + E DT+L
Sbjct: 128  AASGFRDDRAGRVIHGQAVVDGCDSELLLGSNIVKMYFKFWRVEDARKVFDRMPEKDTIL 187

Query: 687  WNTMLSGLVKNCCFYDCIQIFRHMVAND-RXXXXXXXXXXXXXXXXXXXXRTGMIFHSLA 863
            WNTM+SG  KN  + + IQ+FR ++                         R GM  HSLA
Sbjct: 188  WNTMISGYRKNEMYVESIQVFRDLINESCTRLDTTTLLDILPAVAELQELRLGMQIHSLA 247

Query: 864  VKTGCQSHDYVLTSLISMYSKCADVSTARFLFEQLEYPDLISYNAMISGFSCNNKTESAV 1043
             KTGC SHDYVLT  IS+YSKC  +     LF +   PD+++YNAMI G++ N +TE ++
Sbjct: 248  TKTGCYSHDYVLTGFISLYSKCGKIKMGSALFREFRKPDIVAYNAMIHGYTSNGETELSL 307

Query: 1044 RLFNALLLLGDKVNSNTVVGLIPVSDPFGHLKLTCSIHGFSIKSSLVSGPSASTALITVY 1223
             LF  L+L G ++ S+T+V L+PVS   GHL L  +IHG+ +KS+ +S  S STAL TVY
Sbjct: 308  SLFKELMLSGARLRSSTLVSLVPVS---GHLMLIYAIHGYCLKSNFLSHASVSTALTTVY 364

Query: 1224 SRLNEFEFARKLFDESPHKSLASWNAMISGYAQNGFTETAIDLFREMQALDIHPNPVTIT 1403
            S+LNE E ARKLFDESP KSL SWNAMISGY QNG TE AI LFREMQ  +  PNPVTIT
Sbjct: 365  SKLNEIESARKLFDESPEKSLPSWNAMISGYTQNGLTEDAISLFREMQKSEFSPNPVTIT 424

Query: 1404 SILSACAQLGAPSLGKWVHDLIKQENFESNVYVSTALIDMYAKCGSIEEARGLFDTIAEK 1583
             ILSACAQLGA SLGKWVHDL++  +FES++YVSTALI MYAKCGSI EAR LFD + +K
Sbjct: 425  CILSACAQLGALSLGKWVHDLVRSTDFESSIYVSTALIGMYAKCGSIAEARRLFDLMTKK 484

Query: 1584 NVVTWNAMISGYGLHGRGNEAVLLFYKMLGSGITPTVITFLSVLYACSHAGLVEEGEKIF 1763
            N VTWN MISGYGLHG+G EA+ +FY+ML SGITPT +TFL VLYACSHAGLV+EG++IF
Sbjct: 485  NEVTWNTMISGYGLHGQGQEALNIFYEMLNSGITPTPVTFLCVLYACSHAGLVKEGDEIF 544

Query: 1764 HSMTREFNFKPSPEHYACMVDLLGRAGKLETALEFINQMPIKPGPAEWGALLGACMIHKD 1943
            +SM   + F+PS +HYACMVD+LGRAG L+ AL+FI  M I+PG + W  LLGAC IHKD
Sbjct: 545  NSMIHRYGFEPSVKHYACMVDILGRAGHLQRALQFIEAMSIEPGSSVWETLLGACRIHKD 604

Query: 1944 TNLAQFASDKLFEMDPKSVGHYVLLSNIYSADHNYPQAASVRQIAKKRNLAKSPGCTLIE 2123
            TNLA+  S+KLFE+DP +VG++VLLSNI+SAD NYPQAA+VRQ AKKR LAK+PG TLIE
Sbjct: 605  TNLARTVSEKLFELDPDNVGYHVLLSNIHSADRNYPQAATVRQTAKKRKLAKAPGYTLIE 664

Query: 2124 LNGILHVFTSSDQSHPQAAEIFEMLEKLMGKMREAGFQTETVTALHDVXXXXXXLMVKVH 2303
            +    HVFTS DQSHPQ  EI+E LEKL GKMREAG+Q ET  ALHDV      LMVKVH
Sbjct: 665  IGETPHVFTSGDQSHPQVKEIYEKLEKLEGKMREAGYQPETELALHDVEEEERELMVKVH 724

Query: 2304 SEKLAIAFGFIASEPGTEIRIIKNLRVCLDCHNFTKFVSMITKRLIVVRDANRFHHFKDG 2483
            SE+LAIAFG IA+EPGTEIRIIKNLRVCLDCH  TK +S IT+R+IVVRDANRFHHFKDG
Sbjct: 725  SERLAIAFGLIATEPGTEIRIIKNLRVCLDCHTVTKLISKITERVIVVRDANRFHHFKDG 784

Query: 2484 VCSCGDYW 2507
            VCSCGDYW
Sbjct: 785  VCSCGDYW 792


>ref|XP_006285932.1| hypothetical protein CARUB_v10007444mg [Capsella rubella]
            gi|482554637|gb|EOA18830.1| hypothetical protein
            CARUB_v10007444mg [Capsella rubella]
          Length = 790

 Score =  908 bits (2346), Expect = 0.0
 Identities = 466/796 (58%), Positives = 577/796 (72%), Gaps = 5/796 (0%)
 Frame = +3

Query: 135  MICRAIAAAAQR----DGNFFSSLLNKVSTLSQLNQTHAQLILNGLSKNLIINTKLTHKL 302
            M+ R ++AA         N F  L  + ++++ L QTHAQ+I++G   ++ + TKLT +L
Sbjct: 1    MLLRTVSAATAETTVASKNNFLDLFKRSTSVAHLAQTHAQVIVHGFRYDISLLTKLTQRL 60

Query: 303  FDLKSLSQAKLLFKCLSSTVPLDLFYYNVQIRGFSRNNSPLDAVSLFHHLRRETILKPDK 482
             DL ++  A+ LF  LS   P D+F +NV +RGFS N SP  ++S+F HLR+ T LKP+ 
Sbjct: 61   SDLGAIYYARDLF--LSVRRP-DVFLFNVLMRGFSVNESPHSSLSVFAHLRKSTELKPNS 117

Query: 483  FTYAFVVSAISSSGLEYVGILLHGQVIVSGCSSDLFVGSALVDMYMGSSRVRSAHKVFDE 662
             TYAF +SA S    E  G ++HGQ +V GC S+L +GS +V MY    R  +A KVFD 
Sbjct: 118  STYAFAISAASGLRDERPGCVIHGQAVVDGCDSELLLGSNIVKMYFKFLRAGNARKVFDR 177

Query: 663  ISEPDTVLWNTMLSGLVKNCCFYDCIQIFRHMVANDRXXXXXXXXXXXXXXXXXXXXRT- 839
            + E DTVLWNTM+SG  KN  + + IQ+FR ++++                       T 
Sbjct: 178  MPEKDTVLWNTMISGYRKNEMYEEAIQVFRDLISDSCIRLDTTTLLDILPAVAELQGLTL 237

Query: 840  GMIFHSLAVKTGCQSHDYVLTSLISMYSKCADVSTARFLFEQLEYPDLISYNAMISGFSC 1019
            GM  HSLA KTGC SH+YVLT  IS+YSKC  +  A  LF +   PD+++YNAMI G++ 
Sbjct: 238  GMQIHSLATKTGCYSHNYVLTGFISLYSKCGKIKMATTLFREFHKPDVVAYNAMIHGYTS 297

Query: 1020 NNKTESAVRLFNALLLLGDKVNSNTVVGLIPVSDPFGHLKLTCSIHGFSIKSSLVSGPSA 1199
            N +T  ++ LF  L+L G ++NS+T++ LIPVS   GHL L  +IHG+S+KS+ +S  S 
Sbjct: 298  NGETNLSLSLFKELVLSGQRLNSSTLMSLIPVS---GHLMLIYAIHGYSLKSNFLSHTSV 354

Query: 1200 STALITVYSRLNEFEFARKLFDESPHKSLASWNAMISGYAQNGFTETAIDLFREMQALDI 1379
            ST+L TVYS+LNE E ARKLFDESP KSL SWNAMISGY QNG TE AI LFR MQ  + 
Sbjct: 355  STSLTTVYSKLNEIESARKLFDESPEKSLPSWNAMISGYTQNGLTEDAISLFRRMQKSEF 414

Query: 1380 HPNPVTITSILSACAQLGAPSLGKWVHDLIKQENFESNVYVSTALIDMYAKCGSIEEARG 1559
             PNP TIT ILSACAQLG  SLGKWVHDL++  +FES++YVSTALI MYAKCGSI EAR 
Sbjct: 415  SPNPTTITCILSACAQLGVLSLGKWVHDLVRSTDFESSIYVSTALIGMYAKCGSIAEARR 474

Query: 1560 LFDTIAEKNVVTWNAMISGYGLHGRGNEAVLLFYKMLGSGITPTVITFLSVLYACSHAGL 1739
            LFD +  KN VTWN MISGYGLHG G EA+ +F +ML SGI PT +TFL VLYACSHAGL
Sbjct: 475  LFDLMPRKNEVTWNTMISGYGLHGHGQEALNIFSEMLNSGILPTPVTFLCVLYACSHAGL 534

Query: 1740 VEEGEKIFHSMTREFNFKPSPEHYACMVDLLGRAGKLETALEFINQMPIKPGPAEWGALL 1919
            V+EG++IF+SM   + F+PS +HYAC+VD+LGRAG L+ AL+FI  MPI+PGP+ W  LL
Sbjct: 535  VKEGDEIFNSMIHRYGFEPSVKHYACVVDILGRAGHLQRALQFIEAMPIEPGPSVWETLL 594

Query: 1920 GACMIHKDTNLAQFASDKLFEMDPKSVGHYVLLSNIYSADHNYPQAASVRQIAKKRNLAK 2099
            GAC IHKDTNLA+  S+KLFE+DP +VG++VLLSNI+SAD NYPQAA+VRQ AKKR LAK
Sbjct: 595  GACRIHKDTNLARTVSEKLFELDPDNVGYHVLLSNIHSADRNYPQAATVRQTAKKRKLAK 654

Query: 2100 SPGCTLIELNGILHVFTSSDQSHPQAAEIFEMLEKLMGKMREAGFQTETVTALHDVXXXX 2279
            +PG TLIE+  + HVFTS DQSHPQ   I+E LEKL GKMREAG+Q ET  ALHDV    
Sbjct: 655  APGYTLIEIGEMPHVFTSGDQSHPQVKAIYERLEKLEGKMREAGYQPETELALHDVEEEE 714

Query: 2280 XXLMVKVHSEKLAIAFGFIASEPGTEIRIIKNLRVCLDCHNFTKFVSMITKRLIVVRDAN 2459
              LMVKVHSE+LAIAFG IA+EPGTEIRI+KNLRVCLDCH  TK +S IT+R+IVVRDAN
Sbjct: 715  RELMVKVHSERLAIAFGLIATEPGTEIRIMKNLRVCLDCHTATKLISKITERVIVVRDAN 774

Query: 2460 RFHHFKDGVCSCGDYW 2507
            RFHHFKDGVCSCGDYW
Sbjct: 775  RFHHFKDGVCSCGDYW 790


>ref|XP_006412675.1| hypothetical protein EUTSA_v10024455mg [Eutrema salsugineum]
            gi|557113845|gb|ESQ54128.1| hypothetical protein
            EUTSA_v10024455mg [Eutrema salsugineum]
          Length = 790

 Score =  886 bits (2290), Expect = 0.0
 Identities = 467/798 (58%), Positives = 565/798 (70%), Gaps = 7/798 (0%)
 Frame = +3

Query: 135  MICRAIAAAAQR------DGNFFSSLLNKVSTLSQLNQTHAQLILNGLSKNLIINTKLTH 296
            M+ R ++AA         + N F  L  +++ LS LNQTHAQ+IL+G   N+ + TKLT 
Sbjct: 1    MLLRTVSAATAETTVAVINKNNFFDLFKRLTCLSHLNQTHAQIILHG--NNIELLTKLTQ 58

Query: 297  KLFDLKSLSQAKLLFKCLSSTVPLDLFYYNVQIRGFSRNNSPLDAVSLFHHLRRETILKP 476
            +L DL ++  A+ LF  LS   P D F +NV IRGFS N  P  ++SLF  LR+ T LKP
Sbjct: 59   RLSDLGAIHYARDLF--LSVQRP-DEFLFNVLIRGFSNNKLPHSSLSLFALLRKSTDLKP 115

Query: 477  DKFTYAFVVSAISSSGLEYVGILLHGQVIVSGCSSDLFVGSALVDMYMGSSRVRSAHKVF 656
            +  TY   +SA S    E  G ++H Q +V G  S+L VGS  V MY   SRV  A KVF
Sbjct: 116  NSSTYTCAISAASCFRDERPGRVIHSQAVVDGFDSELHVGSNFVKMYFKFSRVDDARKVF 175

Query: 657  DEISEPDTVLWNTMLSGLVKNCCFYDCIQIFRHMVAND-RXXXXXXXXXXXXXXXXXXXX 833
            D + E D VLWNTMLSG  +N  + + IQIFR M+                         
Sbjct: 176  DRMPEKDAVLWNTMLSGYRENEMYEESIQIFRDMINESCTRLDSTTVLNILPAVAELQEL 235

Query: 834  RTGMIFHSLAVKTGCQSHDYVLTSLISMYSKCADVSTARFLFEQLEYPDLISYNAMISGF 1013
            R G+   SLA KTGC SHD+VLT  IS++SKC        LF +   PD+++YNAMI G+
Sbjct: 236  RLGLQILSLATKTGCYSHDFVLTGFISVFSKCGKTEVLSTLFREFRIPDVVAYNAMIHGY 295

Query: 1014 SCNNKTESAVRLFNALLLLGDKVNSNTVVGLIPVSDPFGHLKLTCSIHGFSIKSSLVSGP 1193
            + N +TE ++ LF  L+L G ++NS+T+V LIPVS   GHL L  +IHG+S+KS  +   
Sbjct: 296  TSNGETELSLSLFKELVLSGTRLNSSTLVSLIPVS---GHLMLIYAIHGYSLKSGFLFHE 352

Query: 1194 SASTALITVYSRLNEFEFARKLFDESPHKSLASWNAMISGYAQNGFTETAIDLFREMQAL 1373
            S  TAL TVY +LNE E ARK+FDESP KSLA+WNAMISGY QNG TE AI LFREMQ  
Sbjct: 353  SVPTALTTVYCKLNEMESARKIFDESPDKSLATWNAMISGYTQNGLTEDAISLFREMQKS 412

Query: 1374 DIHPNPVTITSILSACAQLGAPSLGKWVHDLIKQENFESNVYVSTALIDMYAKCGSIEEA 1553
            +  PNP+TIT ILSACAQLG  SLGKWVH L++   FES++YVSTALI MYAKCGSI EA
Sbjct: 413  EFSPNPITITCILSACAQLGTLSLGKWVHGLVRSTGFESSIYVSTALIGMYAKCGSIAEA 472

Query: 1554 RGLFDTIAEKNVVTWNAMISGYGLHGRGNEAVLLFYKMLGSGITPTVITFLSVLYACSHA 1733
            R LFD + +KN VTWN MISGYGLHG+G+EA+ +F +ML SGI PT +TFL VLYACSHA
Sbjct: 473  RRLFDLMPKKNEVTWNTMISGYGLHGQGHEALNIFSEMLNSGIAPTPVTFLCVLYACSHA 532

Query: 1734 GLVEEGEKIFHSMTREFNFKPSPEHYACMVDLLGRAGKLETALEFINQMPIKPGPAEWGA 1913
            GLV+EG++IF+SM   + F+PS +HYACMVD+LGRAG L+ AL+FI  MPI+P P+ W  
Sbjct: 533  GLVKEGDEIFNSMIHRYGFEPSVKHYACMVDILGRAGHLQRALQFIEAMPIEPDPSVWQT 592

Query: 1914 LLGACMIHKDTNLAQFASDKLFEMDPKSVGHYVLLSNIYSADHNYPQAASVRQIAKKRNL 2093
            LLGAC IHKDTNLA+  S+KLFE+DP +VG++VLLSNI+SAD NYPQAA+VRQ AKKR L
Sbjct: 593  LLGACRIHKDTNLARTVSEKLFELDPDNVGYHVLLSNIHSADRNYPQAATVRQTAKKRKL 652

Query: 2094 AKSPGCTLIELNGILHVFTSSDQSHPQAAEIFEMLEKLMGKMREAGFQTETVTALHDVXX 2273
            AK+PG TLIE+    HVFTS DQSHPQ   I+E LEKL GKMREAG+Q ET  ALHDV  
Sbjct: 653  AKAPGYTLIEIGETPHVFTSGDQSHPQVKAIYEKLEKLEGKMREAGYQPETELALHDVEE 712

Query: 2274 XXXXLMVKVHSEKLAIAFGFIASEPGTEIRIIKNLRVCLDCHNFTKFVSMITKRLIVVRD 2453
                LMVKVHSE+LAIAFG IA+EPGTEIRIIKNLRVCLDCH  TK +S IT+R+IVVRD
Sbjct: 713  EERELMVKVHSERLAIAFGLIATEPGTEIRIIKNLRVCLDCHTVTKLISKITERVIVVRD 772

Query: 2454 ANRFHHFKDGVCSCGDYW 2507
            ANRFHHFKDGVCSCGDYW
Sbjct: 773  ANRFHHFKDGVCSCGDYW 790


>gb|EMJ05798.1| hypothetical protein PRUPE_ppa002987mg [Prunus persica]
          Length = 614

 Score =  840 bits (2169), Expect = 0.0
 Identities = 413/613 (67%), Positives = 481/613 (78%)
 Frame = +3

Query: 669  EPDTVLWNTMLSGLVKNCCFYDCIQIFRHMVANDRXXXXXXXXXXXXXXXXXXXXRTGMI 848
            E DTVLWNTM+SGLV+NC + D ++IFR MV                        + GM 
Sbjct: 3    EKDTVLWNTMISGLVRNCYYADSMRIFRDMVVGGTAFDSTTLATELPALAELQELKAGMG 62

Query: 849  FHSLAVKTGCQSHDYVLTSLISMYSKCADVSTARFLFEQLEYPDLISYNAMISGFSCNNK 1028
             H LA+K G  S  +VLT L+S+YSKC ++ TAR LF  +  PDLI YNAMI+G++CNN+
Sbjct: 63   IHCLALKVGFHSDVHVLTGLVSLYSKCKELETARLLFGHITQPDLICYNAMIAGYTCNNE 122

Query: 1029 TESAVRLFNALLLLGDKVNSNTVVGLIPVSDPFGHLKLTCSIHGFSIKSSLVSGPSASTA 1208
            T S+V LF  LL  G+KVNS+T+VGLIPVS PFGHL+LT S+  F +KS +VS PS STA
Sbjct: 123  TVSSVSLFRELLASGEKVNSSTIVGLIPVSSPFGHLQLTGSLQTFCVKSGIVSHPSVSTA 182

Query: 1209 LITVYSRLNEFEFARKLFDESPHKSLASWNAMISGYAQNGFTETAIDLFREMQALDIHPN 1388
             +TVY RLNE E AR+LFDESP K+LASWNAMI+GY QNG TETAI LFREM + +  PN
Sbjct: 183  FVTVYCRLNEIELARQLFDESPEKTLASWNAMIAGYTQNGLTETAISLFREMMS-EFSPN 241

Query: 1389 PVTITSILSACAQLGAPSLGKWVHDLIKQENFESNVYVSTALIDMYAKCGSIEEARGLFD 1568
            PVT+TSILSACAQLGA SLGKWVH LIK +N ESN+YV TAL+DMYAKCGSI EAR LFD
Sbjct: 242  PVTVTSILSACAQLGAISLGKWVHGLIKSKNLESNIYVLTALVDMYAKCGSIVEARKLFD 301

Query: 1569 TIAEKNVVTWNAMISGYGLHGRGNEAVLLFYKMLGSGITPTVITFLSVLYACSHAGLVEE 1748
             + EKNVVTWNAMIS YGLHG G+EA+ LF +ML SGI P+ +TFLSVLYACSHAGLV E
Sbjct: 302  LMTEKNVVTWNAMISAYGLHGDGHEALKLFTEMLHSGIQPSGVTFLSVLYACSHAGLVRE 361

Query: 1749 GEKIFHSMTREFNFKPSPEHYACMVDLLGRAGKLETALEFINQMPIKPGPAEWGALLGAC 1928
            GE++FH M     F+P  EHYACMVD+LGRAGKLE ALEFI +MP++PGPA WGALLGAC
Sbjct: 362  GEEVFHYMVHNHGFEPLAEHYACMVDILGRAGKLEKALEFIKEMPVEPGPAVWGALLGAC 421

Query: 1929 MIHKDTNLAQFASDKLFEMDPKSVGHYVLLSNIYSADHNYPQAASVRQIAKKRNLAKSPG 2108
            MIHK+T LA  AS++LFE+DP++ G+YVLLSNIYSAD N+P+AASVRQ+ K RNLAK+PG
Sbjct: 422  MIHKETELACVASERLFELDPENTGYYVLLSNIYSADRNFPKAASVRQVVKNRNLAKTPG 481

Query: 2109 CTLIELNGILHVFTSSDQSHPQAAEIFEMLEKLMGKMREAGFQTETVTALHDVXXXXXXL 2288
            CTL+E+    HVFT  DQSHP+A EI+ ML+KL GKM EAGFQTETVT LHDV      L
Sbjct: 482  CTLVEIGETPHVFTCGDQSHPRATEIYRMLDKLTGKMMEAGFQTETVTVLHDVEEEEKEL 541

Query: 2289 MVKVHSEKLAIAFGFIASEPGTEIRIIKNLRVCLDCHNFTKFVSMITKRLIVVRDANRFH 2468
            MVKVHSEKLAIAF  I + PGTEIRI KNLRVCLDCHN TKF+S IT+R+IVVRDANRFH
Sbjct: 542  MVKVHSEKLAIAFALIETAPGTEIRIFKNLRVCLDCHNATKFISKITERVIVVRDANRFH 601

Query: 2469 HFKDGVCSCGDYW 2507
            HFKDGVCSCGDYW
Sbjct: 602  HFKDGVCSCGDYW 614



 Score =  163 bits (413), Expect = 3e-37
 Identities = 114/425 (26%), Positives = 194/425 (45%), Gaps = 5/425 (1%)
 Frame = +3

Query: 369  DLFYYNVQIRGFSRNNSPLDAVSLFHHLRRETILKPDKF---TYAFVVSAISSSGLEYVG 539
            D   +N  I G  RN    D++ +F    R+ ++    F   T A  + A++       G
Sbjct: 5    DTVLWNTMISGLVRNCYYADSMRIF----RDMVVGGTAFDSTTLATELPALAELQELKAG 60

Query: 540  ILLHGQVIVSGCSSDLFVGSALVDMYMGSSRVRSAHKVFDEISEPDTVLWNTMLSGLVKN 719
            + +H   +  G  SD+ V + LV +Y     + +A  +F  I++PD + +N M++G   N
Sbjct: 61   MGIHCLALKVGFHSDVHVLTGLVSLYSKCKELETARLLFGHITQPDLICYNAMIAGYTCN 120

Query: 720  CCFYDCIQIFRHMVANDRXXXXXXXXXXXXXXXXXXXXRTGMIFHSLAVKTGCQSHDYVL 899
                  + +FR ++A+                      +      +  VK+G  SH  V 
Sbjct: 121  NETVSSVSLFRELLASGEKVNSSTIVGLIPVSSPFGHLQLTGSLQTFCVKSGIVSHPSVS 180

Query: 900  TSLISMYSKCADVSTARFLFEQLEYPDLISYNAMISGFSCNNKTESAVRLFNALLLLGDK 1079
            T+ +++Y +  ++  AR LF++     L S+NAMI+G++ N  TE+A+ LF   ++    
Sbjct: 181  TAFVTVYCRLNEIELARQLFDESPEKTLASWNAMIAGYTQNGLTETAISLFRE-MMSEFS 239

Query: 1080 VNSNTVVGLIPVSDPFGHLKLTCSIHGFSIKSSLVSGPSASTALITVYSRLNEFEFARKL 1259
             N  TV  ++      G + L   +HG     +L S     TAL+ +Y++      ARKL
Sbjct: 240  PNPVTVTSILSACAQLGAISLGKWVHGLIKSKNLESNIYVLTALVDMYAKCGSIVEARKL 299

Query: 1260 FDESPHKSLASWNAMISGYAQNGFTETAIDLFREMQALDIHPNPVTITSILSACAQLGAP 1439
            FD    K++ +WNAMIS Y  +G    A+ LF EM    I P+ VT  S+L AC+  G  
Sbjct: 300  FDLMTEKNVVTWNAMISAYGLHGDGHEALKLFTEMLHSGIQPSGVTFLSVLYACSHAGLV 359

Query: 1440 SLGKWV-HDLIKQENFESNVYVSTALIDMYAKCGSIEEARGLFDTI-AEKNVVTWNAMIS 1613
              G+ V H ++    FE        ++D+  + G +E+A      +  E     W A++ 
Sbjct: 360  REGEEVFHYMVHNHGFEPLAEHYACMVDILGRAGKLEKALEFIKEMPVEPGPAVWGALLG 419

Query: 1614 GYGLH 1628
               +H
Sbjct: 420  ACMIH 424



 Score = 92.8 bits (229), Expect = 7e-16
 Identities = 68/255 (26%), Positives = 119/255 (46%)
 Frame = +3

Query: 1272 PHKSLASWNAMISGYAQNGFTETAIDLFREMQALDIHPNPVTITSILSACAQLGAPSLGK 1451
            P K    WN MISG  +N +   ++ +FR+M       +  T+ + L A A+L     G 
Sbjct: 2    PEKDTVLWNTMISGLVRNCYYADSMRIFRDMVVGGTAFDSTTLATELPALAELQELKAGM 61

Query: 1452 WVHDLIKQENFESNVYVSTALIDMYAKCGSIEEARGLFDTIAEKNVVTWNAMISGYGLHG 1631
             +H L  +  F S+V+V T L+ +Y+KC  +E AR LF  I + +++ +NAMI+GY  + 
Sbjct: 62   GIHCLALKVGFHSDVHVLTGLVSLYSKCKELETARLLFGHITQPDLICYNAMIAGYTCNN 121

Query: 1632 RGNEAVLLFYKMLGSGITPTVITFLSVLYACSHAGLVEEGEKIFHSMTREFNFKPSPEHY 1811
                +V LF ++L SG      T + ++   S  G ++    +  +   +      P   
Sbjct: 122  ETVSSVSLFRELLASGEKVNSSTIVGLIPVSSPFGHLQLTGSL-QTFCVKSGIVSHPSVS 180

Query: 1812 ACMVDLLGRAGKLETALEFINQMPIKPGPAEWGALLGACMIHKDTNLAQFASDKLFEMDP 1991
               V +  R  ++E A +  ++ P K   A W A++     +    L + A     EM  
Sbjct: 181  TAFVTVYCRLNEIELARQLFDESPEKT-LASWNAMIAG---YTQNGLTETAISLFREMMS 236

Query: 1992 KSVGHYVLLSNIYSA 2036
            +   + V +++I SA
Sbjct: 237  EFSPNPVTVTSILSA 251



 Score = 68.2 bits (165), Expect = 2e-08
 Identities = 40/148 (27%), Positives = 75/148 (50%)
 Frame = +3

Query: 318 LSQAKLLFKCLSSTVPLDLFYYNVQIRGFSRNNSPLDAVSLFHHLRRETILKPDKFTYAF 497
           L++ +L  +    +    L  +N  I G+++N     A+SLF  +  E    P+  T   
Sbjct: 190 LNEIELARQLFDESPEKTLASWNAMIAGYTQNGLTETAISLFREMMSE--FSPNPVTVTS 247

Query: 498 VVSAISSSGLEYVGILLHGQVIVSGCSSDLFVGSALVDMYMGSSRVRSAHKVFDEISEPD 677
           ++SA +  G   +G  +HG +      S+++V +ALVDMY     +  A K+FD ++E +
Sbjct: 248 ILSACAQLGAISLGKWVHGLIKSKNLESNIYVLTALVDMYAKCGSIVEARKLFDLMTEKN 307

Query: 678 TVLWNTMLSGLVKNCCFYDCIQIFRHMV 761
            V WN M+S    +   ++ +++F  M+
Sbjct: 308 VVTWNAMISAYGLHGDGHEALKLFTEML 335


Top