BLASTX nr result

ID: Paeonia23_contig00015702 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Paeonia23_contig00015702
         (1968 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

emb|CAN84084.1| hypothetical protein VITISV_018999 [Vitis vinifera]   744   0.0  
emb|CBI40732.3| unnamed protein product [Vitis vinifera]              734   0.0  
ref|XP_007210680.1| hypothetical protein PRUPE_ppa023340mg [Prun...   716   0.0  
ref|XP_007018078.1| Pentatricopeptide repeat (PPR) superfamily p...   712   0.0  
ref|XP_006435387.1| hypothetical protein CICLE_v10000757mg [Citr...   678   0.0  
ref|XP_006473809.1| PREDICTED: putative pentatricopeptide repeat...   674   0.0  
ref|XP_004145547.1| PREDICTED: putative pentatricopeptide repeat...   635   e-179
ref|XP_003597616.1| hypothetical protein MTR_2g100200 [Medicago ...   633   e-179
ref|NP_199195.4| pentatricopeptide repeat-containing protein [Ar...   629   e-177
ref|XP_004486824.1| PREDICTED: putative pentatricopeptide repeat...   627   e-177
ref|XP_007158555.1| hypothetical protein PHAVU_002G162200g [Phas...   627   e-177
ref|XP_006279800.1| hypothetical protein CARUB_v10027962mg [Caps...   622   e-175
gb|EXC31210.1| hypothetical protein L484_005635 [Morus notabilis]     600   e-169
ref|XP_006403177.1| hypothetical protein EUTSA_v10003177mg [Eutr...   599   e-168
dbj|BAB11311.1| unnamed protein product [Arabidopsis thaliana]        568   e-159
ref|XP_002865400.1| pentatricopeptide repeat-containing protein ...   565   e-158
gb|EYU43538.1| hypothetical protein MIMGU_mgv1a024877mg [Mimulus...   550   e-154
ref|XP_006598344.1| PREDICTED: putative pentatricopeptide repeat...   533   e-149
ref|XP_002307761.2| hypothetical protein POPTR_0005s26850g [Popu...   519   e-144
ref|XP_006855725.1| hypothetical protein AMTR_s00044p00153760 [A...   499   e-138

>emb|CAN84084.1| hypothetical protein VITISV_018999 [Vitis vinifera]
          Length = 561

 Score =  744 bits (1922), Expect = 0.0
 Identities = 364/545 (66%), Positives = 432/545 (79%)
 Frame = -3

Query: 1888 WRSHCLSSSIPFFPFSTVPLPLTTLKDEPEKNQTKNLVNIGERHVLSELSNLLPIRYEAS 1709
            + +  L SS+  F FST+ +    L DEP  NQ K   N  ER VL +LS LLPI    S
Sbjct: 17   YHTRYLPSSVSLFQFSTLQVTSNPLMDEPTDNQIKRPSNFNERDVLYQLSGLLPICCNTS 76

Query: 1708 THNHFPESSPEKQLDSRSVDGFLSPEEKLRGVFLQKLRGKIAIEHALTNXXXXXXXXXXX 1529
                F E+SP++QL +R+VDGFLSP EKLRGVF+Q+LRGK AIE ALTN           
Sbjct: 77   ISKPFTENSPKEQLKTRAVDGFLSPGEKLRGVFIQRLRGKAAIELALTNVGIDLTIDIVS 136

Query: 1528 XXVNRGNLSGEWMVVFFNWAIKHSKIPKDIACYHIIIKALGRRKFFKHMVGILHDMKMEG 1349
               NRGNL GE MV FFNWA+K   IPKD+  Y++IIKALGRRKF +  V +L DM ++G
Sbjct: 137  EVXNRGNLGGEAMVXFFNWAVKQPTIPKDVDTYNVIIKALGRRKFIEFXVXVLKDMHIQG 196

Query: 1348 ITPVFESLLIVMDSFVRVHHVSKAIQLLRNLEEIGSKCDTESLNILLQCLCQRSHVGAAN 1169
            I+P +E+L IVMDSF++   VSKAI++ RNLEE G KCDTESLN+LLQCLCQRSHVGAAN
Sbjct: 197  ISPNYETLSIVMDSFIKARQVSKAIEMFRNLEEFGGKCDTESLNVLLQCLCQRSHVGAAN 256

Query: 1168 SVFHSMKGKIPYNSTTYNIIISGWSKFGEVNEIEKCLKAMVADGFSPDCSTFSHILEGLG 989
              F++MKG IP+N  TYNIII GWSK+G++ E+E+CLKAMVADGFSP+C TFSH++EGLG
Sbjct: 257  LFFNAMKGGIPFNCMTYNIIIGGWSKYGKIGEMERCLKAMVADGFSPNCLTFSHLIEGLG 316

Query: 988  RSERIADAIEIFDNMKENDCLPDTGVYNAMIFNYISVGDFDECMKYYELMLSNNIDPNVD 809
            R+ RI DA+E+F +M+E  C+P+  VYNA+I N+IS  DFDEC+KYY  M+S+N DPN+D
Sbjct: 317  RAGRIDDAVEVFHHMEETGCVPNACVYNALISNFISTRDFDECLKYYNFMVSSNCDPNMD 376

Query: 808  TFSKLIAAFLKARKVADALEMFDKMLERGIIPTTGTVTSFLQPLCSYGPPHAAMMIYQKA 629
            T++KLI AFLKARKVADALEM D+M+ RG+IPTTG +TSF++PLC YGPPHAAMMIY+KA
Sbjct: 377  TYTKLIVAFLKARKVADALEMLDEMVGRGMIPTTGAITSFIEPLCQYGPPHAAMMIYKKA 436

Query: 628  RKVGCKISSSAYKLLLMRLSRFGKCGMLLKLWDEMQESGYSSDVEVYEYVINGLCNNEQL 449
            RKVGC+IS SAYKLLLMRLSRFGKCGMLL LWDEMQESGYSSD EVYEYVINGLCN  QL
Sbjct: 437  RKVGCRISLSAYKLLLMRLSRFGKCGMLLNLWDEMQESGYSSDTEVYEYVINGLCNIGQL 496

Query: 448  ENAVLVMEECLRKGFCPSRLIYSKLNNKLLASNKVGRAYKLFLKIKTARLSENARKYWRS 269
            + AVLVMEE L KGFCPSRLI SKLNNKLLASNKV  AYKLFLKIK AR ++NAR++WR 
Sbjct: 497  DTAVLVMEESLXKGFCPSRLIRSKLNNKLLASNKVEMAYKLFLKIKXARQNDNARRFWRG 556

Query: 268  NGWHF 254
            NGWHF
Sbjct: 557  NGWHF 561


>emb|CBI40732.3| unnamed protein product [Vitis vinifera]
          Length = 520

 Score =  734 bits (1894), Expect = 0.0
 Identities = 356/519 (68%), Positives = 422/519 (81%)
 Frame = -3

Query: 1810 DEPEKNQTKNLVNIGERHVLSELSNLLPIRYEASTHNHFPESSPEKQLDSRSVDGFLSPE 1631
            DEP  NQ K   N  ER VL +LS LLPI    S    F E+SP++QL +R+VDGFLSP 
Sbjct: 2    DEPTDNQIKRPSNFNERDVLYQLSGLLPICCNTSISKPFTENSPKEQLKTRAVDGFLSPG 61

Query: 1630 EKLRGVFLQKLRGKIAIEHALTNXXXXXXXXXXXXXVNRGNLSGEWMVVFFNWAIKHSKI 1451
            EKLRGVF+Q+LRGK AIE ALTN             +NRGNL GE MV+FFNWA+K   I
Sbjct: 62   EKLRGVFIQRLRGKAAIELALTNVGIDLTIDIVSEVINRGNLGGEAMVIFFNWAVKQPTI 121

Query: 1450 PKDIACYHIIIKALGRRKFFKHMVGILHDMKMEGITPVFESLLIVMDSFVRVHHVSKAIQ 1271
            PKD+  Y++IIKALGRRKF + +V +L DM ++GI+P +E+L IVMDSF++   VSKAI+
Sbjct: 122  PKDVDTYNVIIKALGRRKFIEFVVKVLKDMHIQGISPNYETLSIVMDSFIKARQVSKAIE 181

Query: 1270 LLRNLEEIGSKCDTESLNILLQCLCQRSHVGAANSVFHSMKGKIPYNSTTYNIIISGWSK 1091
            + RNLEE G KCDTESLN+LLQCLCQRSHVGAAN  F++MKG IP+N  TYNIII GWSK
Sbjct: 182  MFRNLEEFGGKCDTESLNVLLQCLCQRSHVGAANLFFNAMKGGIPFNCMTYNIIIGGWSK 241

Query: 1090 FGEVNEIEKCLKAMVADGFSPDCSTFSHILEGLGRSERIADAIEIFDNMKENDCLPDTGV 911
            +G++ E+E+CLKAMVADGFSP+C TFSH++EGLGR+ RI DA+E+F +M+E  C+P+  V
Sbjct: 242  YGKIGEMERCLKAMVADGFSPNCLTFSHLIEGLGRAGRIDDAVEVFHHMEETGCVPNACV 301

Query: 910  YNAMIFNYISVGDFDECMKYYELMLSNNIDPNVDTFSKLIAAFLKARKVADALEMFDKML 731
            YNA+I N+IS  DFDEC+KYY  M+S+N DPN+DT++KLI AFLKARKVADALEM D+M+
Sbjct: 302  YNALISNFISTRDFDECLKYYNFMVSSNCDPNMDTYTKLIVAFLKARKVADALEMLDEMV 361

Query: 730  ERGIIPTTGTVTSFLQPLCSYGPPHAAMMIYQKARKVGCKISSSAYKLLLMRLSRFGKCG 551
             RG+IPTTG +TSF++PLC YGPPHAAMMIY+KARKVGC+IS SAYKLLLMRLSRFGKCG
Sbjct: 362  GRGMIPTTGAITSFIEPLCQYGPPHAAMMIYKKARKVGCRISLSAYKLLLMRLSRFGKCG 421

Query: 550  MLLKLWDEMQESGYSSDVEVYEYVINGLCNNEQLENAVLVMEECLRKGFCPSRLIYSKLN 371
            MLL LWDEMQESGYSSD EVYEYVINGLCN  QL+ AVLVMEE L KGFCPSRLI SKLN
Sbjct: 422  MLLNLWDEMQESGYSSDTEVYEYVINGLCNIGQLDTAVLVMEESLHKGFCPSRLIRSKLN 481

Query: 370  NKLLASNKVGRAYKLFLKIKTARLSENARKYWRSNGWHF 254
            NKLLASNKV  AYKLFLKIK AR ++NAR++WR NGWHF
Sbjct: 482  NKLLASNKVEMAYKLFLKIKIARQNDNARRFWRGNGWHF 520


>ref|XP_007210680.1| hypothetical protein PRUPE_ppa023340mg [Prunus persica]
            gi|462406415|gb|EMJ11879.1| hypothetical protein
            PRUPE_ppa023340mg [Prunus persica]
          Length = 562

 Score =  716 bits (1849), Expect = 0.0
 Identities = 365/544 (67%), Positives = 432/544 (79%), Gaps = 1/544 (0%)
 Frame = -3

Query: 1882 SHCLSSSIPFFPFSTVPLPLTTLKDEPEKNQTKNLVNIGERHVLSELSNLLPI-RYEAST 1706
            S+ + S I    FST+     +L DE   ++ K+   + E  VL  LSNLLPI R  +ST
Sbjct: 22   SYLVHSPISSSLFSTLYAQSNSLHDE---HRIKSQSTLDESFVLDRLSNLLPISRSNSST 78

Query: 1705 HNHFPESSPEKQLDSRSVDGFLSPEEKLRGVFLQKLRGKIAIEHALTNXXXXXXXXXXXX 1526
               F  S+ +KQ++ R+VDGFL P+EKLRGVFLQKLRG  AIEHAL N            
Sbjct: 79   ATLFEPSNSDKQIEIRTVDGFLLPDEKLRGVFLQKLRGTAAIEHALDNGGVDLSVDVVAQ 138

Query: 1525 XVNRGNLSGEWMVVFFNWAIKHSKIPKDIACYHIIIKALGRRKFFKHMVGILHDMKMEGI 1346
             VNRG L  E M+VFFNWAI+   I K I  YHII+KALGRRKFF HM+ ILH M+ +GI
Sbjct: 139  VVNRGGLGAEAMLVFFNWAIRKPTIAKYIETYHIILKALGRRKFFTHMMQILHHMRAQGI 198

Query: 1345 TPVFESLLIVMDSFVRVHHVSKAIQLLRNLEEIGSKCDTESLNILLQCLCQRSHVGAANS 1166
            +P  E++ IVMDSFVR  HVSKAIQ+ RNLEEIG +CDTESLN+LLQCLCQRSHVGAANS
Sbjct: 199  SPNLETISIVMDSFVRAQHVSKAIQMFRNLEEIGLECDTESLNLLLQCLCQRSHVGAANS 258

Query: 1165 VFHSMKGKIPYNSTTYNIIISGWSKFGEVNEIEKCLKAMVADGFSPDCSTFSHILEGLGR 986
              +S+KGKI +N  TYNIII GWS+ G V+EIE+ L+AMVADGFS D STFS ILEGLGR
Sbjct: 259  FLNSVKGKIQFNGNTYNIIIGGWSRHGRVSEIERILEAMVADGFSADSSTFSFILEGLGR 318

Query: 985  SERIADAIEIFDNMKENDCLPDTGVYNAMIFNYISVGDFDECMKYYELMLSNNIDPNVDT 806
            + RI DA+EIFD+MK   C+PDT VYNAMI N+ISV +FDEC++YY+ M SN+ DPN+DT
Sbjct: 319  AGRIDDAVEIFDSMKGKGCMPDTRVYNAMISNFISVRNFDECVRYYKGMSSNSCDPNIDT 378

Query: 805  FSKLIAAFLKARKVADALEMFDKMLERGIIPTTGTVTSFLQPLCSYGPPHAAMMIYQKAR 626
            ++KLIAAFLKARKVA ALEMFD+ML RG++PTTGT+TSF++PLCSYGPP+AAMMIY+KAR
Sbjct: 379  YTKLIAAFLKARKVAGALEMFDEMLGRGLVPTTGTITSFIEPLCSYGPPYAAMMIYKKAR 438

Query: 625  KVGCKISSSAYKLLLMRLSRFGKCGMLLKLWDEMQESGYSSDVEVYEYVINGLCNNEQLE 446
            KVGC+IS SAYKLLLMRLSRFGKCGMLL +W++MQE GY+SD EVY+YVINGLCN   LE
Sbjct: 439  KVGCRISLSAYKLLLMRLSRFGKCGMLLNIWEDMQECGYASDKEVYDYVINGLCNIGHLE 498

Query: 445  NAVLVMEECLRKGFCPSRLIYSKLNNKLLASNKVGRAYKLFLKIKTARLSENARKYWRSN 266
            NAVLVMEE L+KGFCPSRL+YSKLNNKLLASNKV RAYKLFLKIK AR  +NA+++WRS 
Sbjct: 499  NAVLVMEESLQKGFCPSRLVYSKLNNKLLASNKVERAYKLFLKIKHARRYDNAQRFWRSK 558

Query: 265  GWHF 254
            GWHF
Sbjct: 559  GWHF 562


>ref|XP_007018078.1| Pentatricopeptide repeat (PPR) superfamily protein, putative isoform
            1 [Theobroma cacao] gi|590595518|ref|XP_007018079.1|
            Pentatricopeptide repeat (PPR) superfamily protein,
            putative isoform 1 [Theobroma cacao]
            gi|590595521|ref|XP_007018080.1| Pentatricopeptide repeat
            (PPR) superfamily protein, putative isoform 1 [Theobroma
            cacao] gi|590595525|ref|XP_007018081.1| Pentatricopeptide
            repeat (PPR) superfamily protein, putative isoform 1
            [Theobroma cacao] gi|508723406|gb|EOY15303.1|
            Pentatricopeptide repeat (PPR) superfamily protein,
            putative isoform 1 [Theobroma cacao]
            gi|508723407|gb|EOY15304.1| Pentatricopeptide repeat
            (PPR) superfamily protein, putative isoform 1 [Theobroma
            cacao] gi|508723408|gb|EOY15305.1| Pentatricopeptide
            repeat (PPR) superfamily protein, putative isoform 1
            [Theobroma cacao] gi|508723409|gb|EOY15306.1|
            Pentatricopeptide repeat (PPR) superfamily protein,
            putative isoform 1 [Theobroma cacao]
          Length = 562

 Score =  712 bits (1839), Expect = 0.0
 Identities = 359/547 (65%), Positives = 439/547 (80%), Gaps = 3/547 (0%)
 Frame = -3

Query: 1885 RSH--CLSSSIPFFPFSTVPLPLTTLKDEPEKNQTKNLVNIGERHVLSELSNLLPIRYEA 1712
            R+H  C++S    F FST  L  +++K EP  NQ  N   + ER VL ELS+L    +  
Sbjct: 19   RNHLPCINSFSSAFSFST--LSDSSIK-EPSFNQISNQSTVDERRVLGELSDLFQFSHSN 75

Query: 1711 STHNH-FPESSPEKQLDSRSVDGFLSPEEKLRGVFLQKLRGKIAIEHALTNXXXXXXXXX 1535
            +T  + + ES P KQ++S +VD +L PEEKLRGVFLQKLRGK AIEHAL+N         
Sbjct: 76   ATVPYPYRESYPPKQIESGAVDEYLLPEEKLRGVFLQKLRGKTAIEHALSNVPVELSIDI 135

Query: 1534 XXXXVNRGNLSGEWMVVFFNWAIKHSKIPKDIACYHIIIKALGRRKFFKHMVGILHDMKM 1355
                VN GNL GE MV+FFNWA+K   I +DI  Y+IIIKALGRRKFFK M+  LHDM  
Sbjct: 136  IAKVVNIGNLGGEAMVLFFNWAMKQPGIARDIHSYYIIIKALGRRKFFKFMIETLHDMVK 195

Query: 1354 EGITPVFESLLIVMDSFVRVHHVSKAIQLLRNLEEIGSKCDTESLNILLQCLCQRSHVGA 1175
            EGI P  E+L IVMDSF+R   V KAI+   NLEE+G K DT+SLN+LLQCLC+R+HVGA
Sbjct: 196  EGIKPDVETLSIVMDSFIRAQRVQKAIETFENLEELGLKRDTKSLNVLLQCLCRRAHVGA 255

Query: 1174 ANSVFHSMKGKIPYNSTTYNIIISGWSKFGEVNEIEKCLKAMVADGFSPDCSTFSHILEG 995
            ANS+F+++ GK+ +N  TYNI+ISGWSK G V++IE+ LKAM+AD F+PDCSTFS+++EG
Sbjct: 256  ANSLFNAVNGKVKFNCDTYNIMISGWSKLGRVSKIERILKAMIADEFTPDCSTFSYLIEG 315

Query: 994  LGRSERIADAIEIFDNMKENDCLPDTGVYNAMIFNYISVGDFDECMKYYELMLSNNIDPN 815
            LGR+ RI DA+EIFD+MKE  C+PDT VYNAMI N+ISVG+FDECMKYY+ +L++N DP+
Sbjct: 316  LGRAGRIDDAVEIFDHMKEKGCIPDTRVYNAMISNFISVGNFDECMKYYKGLLNSNSDPD 375

Query: 814  VDTFSKLIAAFLKARKVADALEMFDKMLERGIIPTTGTVTSFLQPLCSYGPPHAAMMIYQ 635
            VDT++KLI+AFLKA+ VADALE+FD+ML +GI+PTTGT+TSF++PLCSYGPP+AAMM Y+
Sbjct: 376  VDTYTKLISAFLKAQNVADALEIFDEMLVQGIVPTTGTLTSFVEPLCSYGPPYAAMMFYK 435

Query: 634  KARKVGCKISSSAYKLLLMRLSRFGKCGMLLKLWDEMQESGYSSDVEVYEYVINGLCNNE 455
            KARK GCKIS SAYKLLLMRLSRFGKCGMLL +WDEMQESG++SD+EVYE+VINGLCN  
Sbjct: 436  KARKFGCKISLSAYKLLLMRLSRFGKCGMLLNIWDEMQESGHTSDMEVYEHVINGLCNIG 495

Query: 454  QLENAVLVMEECLRKGFCPSRLIYSKLNNKLLASNKVGRAYKLFLKIKTARLSENARKYW 275
             LENAVLVMEE LRKGFCPSR++YSKLNNKLLASN+V +AYKLFLKIK AR  ENAR+YW
Sbjct: 496  HLENAVLVMEEALRKGFCPSRVLYSKLNNKLLASNEVEKAYKLFLKIKNARRDENARRYW 555

Query: 274  RSNGWHF 254
            R+NGWHF
Sbjct: 556  RANGWHF 562


>ref|XP_006435387.1| hypothetical protein CICLE_v10000757mg [Citrus clementina]
            gi|557537509|gb|ESR48627.1| hypothetical protein
            CICLE_v10000757mg [Citrus clementina]
          Length = 551

 Score =  678 bits (1750), Expect = 0.0
 Identities = 348/547 (63%), Positives = 419/547 (76%), Gaps = 7/547 (1%)
 Frame = -3

Query: 1873 LSSSIPFFPFSTVPLPLTTLKDEPEKNQTKNLVNIGERHVLSELSNLLPIRYEASTHNHF 1694
            LSSS   F FST  +      +E   NQ KN+ ++ E HVL ELS+L    ++ S+HN F
Sbjct: 10   LSSSFSLFSFST-SVRSNLSYNELLSNQKKNMSSLDEHHVLKELSDL----FQISSHNSF 64

Query: 1693 PE------SSPEKQLDS-RSVDGFLSPEEKLRGVFLQKLRGKIAIEHALTNXXXXXXXXX 1535
            P       S+  K++DS R+VD FL PEE+LRGVFLQKL+GK  IE AL N         
Sbjct: 65   PNVYKESRSNSVKRIDSSRAVDEFLLPEERLRGVFLQKLKGKGVIEDALWNVNVDLSLDV 124

Query: 1534 XXXXVNRGNLSGEWMVVFFNWAIKHSKIPKDIACYHIIIKALGRRKFFKHMVGILHDMKM 1355
                VNRGNLSGE MV+FFNWAIKH  + KD+  Y++I+KALGRRKFF  M  +L DM  
Sbjct: 125  VGKVVNRGNLSGEAMVLFFNWAIKHPNVAKDVKSYNVIVKALGRRKFFDFMCNVLSDMAK 184

Query: 1354 EGITPVFESLLIVMDSFVRVHHVSKAIQLLRNLEEIGSKCDTESLNILLQCLCQRSHVGA 1175
            EG+ P  E+L IVMDSF+R   V KAIQ+L  LE+ G K D ESLN++L CLCQR HVGA
Sbjct: 185  EGVNPDLETLSIVMDSFIRAGQVYKAIQMLGRLEDFGLKFDAESLNVVLWCLCQRLHVGA 244

Query: 1174 ANSVFHSMKGKIPYNSTTYNIIISGWSKFGEVNEIEKCLKAMVADGFSPDCSTFSHILEG 995
            A+S+F+SMKGKI +N  TYNI+ISGWSK G+V E+E+ LK +VA+GFSPD  TFS ++EG
Sbjct: 245  ASSLFNSMKGKILFNVMTYNIVISGWSKLGQVVEMERVLKEIVAEGFSPDSLTFSFLIEG 304

Query: 994  LGRSERIADAIEIFDNMKENDCLPDTGVYNAMIFNYISVGDFDECMKYYELMLSNNIDPN 815
            LGR+ RI DAIE+FD MKE  C PDT  YNA+I NYISVGDFDECMKYY+ M SNN +PN
Sbjct: 305  LGRAGRIDDAIEVFDTMKEKGCGPDTNAYNAVISNYISVGDFDECMKYYKGMSSNNCEPN 364

Query: 814  VDTFSKLIAAFLKARKVADALEMFDKMLERGIIPTTGTVTSFLQPLCSYGPPHAAMMIYQ 635
            +DT+++LI+  LK+RKVADALE+F++ML+RGI+P+TGT+TSFL+PLCSYGPPHAAMM+Y+
Sbjct: 365  MDTYTRLISGLLKSRKVADALEVFEEMLDRGIVPSTGTITSFLEPLCSYGPPHAAMMMYK 424

Query: 634  KARKVGCKISSSAYKLLLMRLSRFGKCGMLLKLWDEMQESGYSSDVEVYEYVINGLCNNE 455
            KARKVGCK+S +AYKLLL RLS FGKCGMLL LW EMQESGY SD E+YEYVI GLCN  
Sbjct: 425  KARKVGCKLSLTAYKLLLRRLSGFGKCGMLLDLWHEMQESGYPSDGEIYEYVIAGLCNIG 484

Query: 454  QLENAVLVMEECLRKGFCPSRLIYSKLNNKLLASNKVGRAYKLFLKIKTARLSENARKYW 275
            QLENAVLVMEE LRKGFCPSRL+YSKL+NKLLASNK+  AY LF KIK AR ++ AR+ W
Sbjct: 485  QLENAVLVMEESLRKGFCPSRLVYSKLSNKLLASNKLESAYNLFRKIKIARQNDYARRLW 544

Query: 274  RSNGWHF 254
            RS GWHF
Sbjct: 545  RSKGWHF 551


>ref|XP_006473809.1| PREDICTED: putative pentatricopeptide repeat-containing protein
            At5g43820-like [Citrus sinensis]
          Length = 558

 Score =  674 bits (1739), Expect = 0.0
 Identities = 346/547 (63%), Positives = 418/547 (76%), Gaps = 7/547 (1%)
 Frame = -3

Query: 1873 LSSSIPFFPFSTVPLPLTTLKDEPEKNQTKNLVNIGERHVLSELSNLLPIRYEASTHNHF 1694
            LSSS   F FST  +      +E   NQ KN+ ++ E HVL ELS+L    ++ S+HN F
Sbjct: 17   LSSSFSLFLFST-SVRSNLSYNELLSNQKKNMSSLDEHHVLKELSDL----FQISSHNSF 71

Query: 1693 PE------SSPEKQLDS-RSVDGFLSPEEKLRGVFLQKLRGKIAIEHALTNXXXXXXXXX 1535
            P       S+  K++DS R+VD FL PEE+LRGVFLQKL+GK  IE AL N         
Sbjct: 72   PNVYKESRSNSVKRIDSSRAVDEFLLPEERLRGVFLQKLKGKGVIEDALWNVNVDLSLDV 131

Query: 1534 XXXXVNRGNLSGEWMVVFFNWAIKHSKIPKDIACYHIIIKALGRRKFFKHMVGILHDMKM 1355
                VNRGNLSGE MV+FFNWAIKH  + KD+  Y++I+KALGRRKFF  M  +L DM  
Sbjct: 132  VGKVVNRGNLSGEAMVLFFNWAIKHPNVAKDVKSYNVIVKALGRRKFFDFMCNVLSDMAK 191

Query: 1354 EGITPVFESLLIVMDSFVRVHHVSKAIQLLRNLEEIGSKCDTESLNILLQCLCQRSHVGA 1175
            EG+ P  E+L IVMDSF+R   V KAIQ+L  LE+ G K D ESLN++L CLCQR HVGA
Sbjct: 192  EGVNPDLETLSIVMDSFIRAGQVYKAIQMLGRLEDFGLKFDAESLNVVLWCLCQRLHVGA 251

Query: 1174 ANSVFHSMKGKIPYNSTTYNIIISGWSKFGEVNEIEKCLKAMVADGFSPDCSTFSHILEG 995
            A+S+F+SMKGK+ +N  TYNI+ISGWSK G+V E+E+ LK +VA+GFSPD  TFS ++EG
Sbjct: 252  ASSLFNSMKGKVLFNVMTYNIVISGWSKLGQVVEMERVLKEIVAEGFSPDSLTFSFLIEG 311

Query: 994  LGRSERIADAIEIFDNMKENDCLPDTGVYNAMIFNYISVGDFDECMKYYELMLSNNIDPN 815
            LGR+ RI DAIE+FD MKE  C PDT  YNA+I NYISVGDFDECMKYY+ M S N +PN
Sbjct: 312  LGRAGRIDDAIEVFDTMKEKGCGPDTNAYNAVISNYISVGDFDECMKYYKGMSSYNCEPN 371

Query: 814  VDTFSKLIAAFLKARKVADALEMFDKMLERGIIPTTGTVTSFLQPLCSYGPPHAAMMIYQ 635
            +DT+++LI+  LK+RKVADALE+F++ML+RGI+P+TGT+TSFL+PLCSYGPPHAAMM+Y+
Sbjct: 372  MDTYTRLISGLLKSRKVADALEVFEEMLDRGIVPSTGTITSFLEPLCSYGPPHAAMMMYK 431

Query: 634  KARKVGCKISSSAYKLLLMRLSRFGKCGMLLKLWDEMQESGYSSDVEVYEYVINGLCNNE 455
            KARKVGCK+S +AYKLLL RLS FGKCGMLL LW EMQESGY SD E+YEYVI GLCN  
Sbjct: 432  KARKVGCKLSLTAYKLLLRRLSGFGKCGMLLDLWHEMQESGYPSDGEIYEYVIAGLCNIG 491

Query: 454  QLENAVLVMEECLRKGFCPSRLIYSKLNNKLLASNKVGRAYKLFLKIKTARLSENARKYW 275
            QLENAVLVMEE LRKGFCPSRL+YSKL+NKLLASNK+  AY LF KIK AR ++ AR+ W
Sbjct: 492  QLENAVLVMEESLRKGFCPSRLVYSKLSNKLLASNKLESAYNLFRKIKIARQNDYARRLW 551

Query: 274  RSNGWHF 254
            RS GWHF
Sbjct: 552  RSKGWHF 558


>ref|XP_004145547.1| PREDICTED: putative pentatricopeptide repeat-containing protein
            At5g43820-like [Cucumis sativus]
          Length = 572

 Score =  635 bits (1637), Expect = e-179
 Identities = 319/531 (60%), Positives = 390/531 (73%)
 Frame = -3

Query: 1846 FSTVPLPLTTLKDEPEKNQTKNLVNIGERHVLSELSNLLPIRYEASTHNHFPESSPEKQL 1667
            FST+  P     D    N  +N   I ER V+SELS+LL +    S +N   E+S EKQ+
Sbjct: 43   FSTLDEPSNLFDDGLSGNGDRNQRCIDERFVISELSDLLLVNPYGSVYNTLKENSIEKQM 102

Query: 1666 DSRSVDGFLSPEEKLRGVFLQKLRGKIAIEHALTNXXXXXXXXXXXXXVNRGNLSGEWMV 1487
              R+VDGFL PEEKLRGVFLQKL GK AIEHAL N             +N G+L  E MV
Sbjct: 103  PVRAVDGFLLPEEKLRGVFLQKLNGKTAIEHALANTDVILSQDVVSKVLNTGSLGSEAMV 162

Query: 1486 VFFNWAIKHSKIPKDIACYHIIIKALGRRKFFKHMVGILHDMKMEGITPVFESLLIVMDS 1307
             FF WAIK   IPKD + Y+II+KALGRR FF  M+ +L++M  EG+    E + IV+DS
Sbjct: 163  TFFYWAIKQPSIPKDASSYNIILKALGRRGFFDSMMDVLYNMTREGVEATLEMVSIVVDS 222

Query: 1306 FVRVHHVSKAIQLLRNLEEIGSKCDTESLNILLQCLCQRSHVGAANSVFHSMKGKIPYNS 1127
             V+ H VSKA+Q  RNL+EIG KCDTE+LNILLQC+C+RSHVGAANS F+  KG IP+N 
Sbjct: 223  LVKGHQVSKALQFFRNLKEIGLKCDTETLNILLQCMCRRSHVGAANSFFNLTKGNIPFNV 282

Query: 1126 TTYNIIISGWSKFGEVNEIEKCLKAMVADGFSPDCSTFSHILEGLGRSERIADAIEIFDN 947
             TYNI+I GWS++G   E+E+ LKAM  DGFSPDC T ++++E LGR+ +I DA++IFD 
Sbjct: 283  MTYNIVIGGWSRYGRHGEVEQMLKAMELDGFSPDCLTHTYLIECLGRANQIDDAVKIFDK 342

Query: 946  MKENDCLPDTGVYNAMIFNYISVGDFDECMKYYELMLSNNIDPNVDTFSKLIAAFLKARK 767
            M EN C PD   YNAMI N+I +GDFD+C+ YYE MLSN  +P+++T+S LI  FLKA+K
Sbjct: 343  MDENGCTPDVDAYNAMISNFICIGDFDQCLTYYERMLSNRCEPDMNTYSNLITGFLKAKK 402

Query: 766  VADALEMFDKMLERGIIPTTGTVTSFLQPLCSYGPPHAAMMIYQKARKVGCKISSSAYKL 587
            VADALEMFD+M+ R IIPTTG +TSF+Q  CSYGPPHAAM+IY+KARKVGC+IS +AYKL
Sbjct: 403  VADALEMFDEMVAR-IIPTTGAITSFIQLSCSYGPPHAAMLIYKKARKVGCRISKNAYKL 461

Query: 586  LLMRLSRFGKCGMLLKLWDEMQESGYSSDVEVYEYVINGLCNNEQLENAVLVMEECLRKG 407
            LLMRLS FGK GMLL +W+EMQESGY  DVE YE+ I+ LC   QLENAVLVMEECLR+G
Sbjct: 462  LLMRLSLFGKFGMLLNIWNEMQESGYDPDVETYEHAIDCLCKTGQLENAVLVMEECLRQG 521

Query: 406  FCPSRLIYSKLNNKLLASNKVGRAYKLFLKIKTARLSENARKYWRSNGWHF 254
            F PSR   SKLNNKLLA N+   AYKL+LKIK AR  EN ++ WR+ GWH+
Sbjct: 522  FFPSRRTRSKLNNKLLACNRTEMAYKLWLKIKVARHQENLQRCWRAKGWHY 572


>ref|XP_003597616.1| hypothetical protein MTR_2g100200 [Medicago truncatula]
            gi|124360397|gb|ABN08410.1| Pentatricopeptide repeat
            [Medicago truncatula] gi|355486664|gb|AES67867.1|
            hypothetical protein MTR_2g100200 [Medicago truncatula]
          Length = 527

 Score =  633 bits (1633), Expect = e-179
 Identities = 311/511 (60%), Positives = 388/511 (75%)
 Frame = -3

Query: 1786 KNLVNIGERHVLSELSNLLPIRYEASTHNHFPESSPEKQLDSRSVDGFLSPEEKLRGVFL 1607
            +N  N+ ER +L ++S LLPI             +P+ Q DS+S+DGFLSPE+KLRG+FL
Sbjct: 30   QNSSNLDERLILHQISQLLPIP---------TSKTPDSQSDSKSIDGFLSPEDKLRGIFL 80

Query: 1606 QKLRGKIAIEHALTNXXXXXXXXXXXXXVNRGNLSGEWMVVFFNWAIKHSKIPKDIACYH 1427
            QKL+GK AIE AL+N             +N GNL GE MV+FFNWA+K   +P+D+  YH
Sbjct: 81   QKLKGKAAIEQALSNVCIDVNVDIIGKVLNFGNLGGEAMVMFFNWALKQPMVPRDVGSYH 140

Query: 1426 IIIKALGRRKFFKHMVGILHDMKMEGITPVFESLLIVMDSFVRVHHVSKAIQLLRNLEEI 1247
            +I+KALGRRKFF  M+ +L +M++ GI      L IV+DSFV   HVSKAIQL  NL+++
Sbjct: 141  VIVKALGRRKFFVFMMQVLDEMRLNGIKADLLMLSIVIDSFVNAGHVSKAIQLFGNLDDL 200

Query: 1246 GSKCDTESLNILLQCLCQRSHVGAANSVFHSMKGKIPYNSTTYNIIISGWSKFGEVNEIE 1067
            G   DTE LN+LL CLC+R HVGAA SVF+SMKGK+ +N  TYN+++ GWSK G VNEIE
Sbjct: 201  GLCRDTEVLNVLLSCLCRRCHVGAAASVFNSMKGKVSFNVDTYNVVVGGWSKLGRVNEIE 260

Query: 1066 KCLKAMVADGFSPDCSTFSHILEGLGRSERIADAIEIFDNMKENDCLPDTGVYNAMIFNY 887
            K +K M  +GFSPD +T +  LEGLGR+ R+ +A+E+F +MKE D    T +YNAMIFN+
Sbjct: 261  KVMKEMEVEGFSPDFNTLAFFLEGLGRAGRMDEAVEVFGSMKEKD----TAIYNAMIFNF 316

Query: 886  ISVGDFDECMKYYELMLSNNIDPNVDTFSKLIAAFLKARKVADALEMFDKMLERGIIPTT 707
            IS+GDFD  MKYY  MLS+N +PN+ T+S++I AFL+ RKVADAL MFD+ML +G++P T
Sbjct: 317  ISIGDFDGFMKYYNGMLSDNCEPNIHTYSRMITAFLRTRKVADALLMFDEMLRQGVVPPT 376

Query: 706  GTVTSFLQPLCSYGPPHAAMMIYQKARKVGCKISSSAYKLLLMRLSRFGKCGMLLKLWDE 527
            GT+TSF++ LCSYGPP+AAMMIY+K RK+ CKIS  AYK+LLMRLS+FGKCG LL +W E
Sbjct: 377  GTITSFIKQLCSYGPPYAAMMIYKKTRKLECKISMEAYKILLMRLSKFGKCGSLLSVWQE 436

Query: 526  MQESGYSSDVEVYEYVINGLCNNEQLENAVLVMEECLRKGFCPSRLIYSKLNNKLLASNK 347
            MQE GYSSDVEVYEY+I+GL N  QLENAVLVMEE LRKGFCPSRL+YSKL+NKLLASN 
Sbjct: 437  MQECGYSSDVEVYEYIISGLYNIGQLENAVLVMEEALRKGFCPSRLVYSKLSNKLLASNL 496

Query: 346  VGRAYKLFLKIKTARLSENARKYWRSNGWHF 254
              RAY+LFLKIK AR  +NAR YWR NGWHF
Sbjct: 497  TERAYRLFLKIKHARSLKNARSYWRDNGWHF 527


>ref|NP_199195.4| pentatricopeptide repeat-containing protein [Arabidopsis thaliana]
            gi|223635652|sp|P0C8R0.1|PP416_ARATH RecName:
            Full=Putative pentatricopeptide repeat-containing protein
            At5g43820 gi|332007631|gb|AED95014.1| pentatricopeptide
            repeat-containing protein [Arabidopsis thaliana]
          Length = 546

 Score =  629 bits (1622), Expect = e-177
 Identities = 304/506 (60%), Positives = 387/506 (76%)
 Frame = -3

Query: 1771 IGERHVLSELSNLLPIRYEASTHNHFPESSPEKQLDSRSVDGFLSPEEKLRGVFLQKLRG 1592
            + E +VL+ELS+LLPI    ++ +   +SS + Q+   ++D FLS E+KLRGVFLQKL+G
Sbjct: 45   VDESYVLAELSSLLPISSNKTSVSK-EDSSSKNQV---AIDSFLSAEDKLRGVFLQKLKG 100

Query: 1591 KIAIEHALTNXXXXXXXXXXXXXVNRGNLSGEWMVVFFNWAIKHSKIPKDIACYHIIIKA 1412
            K AI+ +L++             +NRGNLSGE MV FF+WA++   + KD+  Y +I++A
Sbjct: 101  KSAIQKSLSSLGIGLSIDIVADVLNRGNLSGEAMVTFFDWAVREPGVTKDVGSYSVILRA 160

Query: 1411 LGRRKFFKHMVGILHDMKMEGITPVFESLLIVMDSFVRVHHVSKAIQLLRNLEEIGSKCD 1232
            LGRRK F  M+ +L  M  EG+ P  E L I MDSFVRVH+V +AI+L    E  G KC 
Sbjct: 161  LGRRKLFSFMMDVLKGMVCEGVNPDLECLTIAMDSFVRVHYVRRAIELFEESESFGVKCS 220

Query: 1231 TESLNILLQCLCQRSHVGAANSVFHSMKGKIPYNSTTYNIIISGWSKFGEVNEIEKCLKA 1052
            TES N LL+CLC+RSHV AA SVF++ KG IP++S +YNI+ISGWSK GEV E+EK LK 
Sbjct: 221  TESFNALLRCLCERSHVSAAKSVFNAKKGNIPFDSCSYNIMISGWSKLGEVEEMEKVLKE 280

Query: 1051 MVADGFSPDCSTFSHILEGLGRSERIADAIEIFDNMKENDCLPDTGVYNAMIFNYISVGD 872
            MV  GF PDC ++SH++EGLGR+ RI D++EIFDN+K    +PD  VYNAMI N+IS  D
Sbjct: 281  MVESGFGPDCLSYSHLIEGLGRTGRINDSVEIFDNIKHKGNVPDANVYNAMICNFISARD 340

Query: 871  FDECMKYYELMLSNNIDPNVDTFSKLIAAFLKARKVADALEMFDKMLERGIIPTTGTVTS 692
            FDE M+YY  ML    +PN++T+SKL++  +K RKV+DALE+F++ML RG++PTTG VTS
Sbjct: 341  FDESMRYYRRMLDEECEPNLETYSKLVSGLIKGRKVSDALEIFEEMLSRGVLPTTGLVTS 400

Query: 691  FLQPLCSYGPPHAAMMIYQKARKVGCKISSSAYKLLLMRLSRFGKCGMLLKLWDEMQESG 512
            FL+PLCSYGPPHAAM+IYQK+RK GC+IS SAYKLLL RLSRFGKCGMLL +WDEMQESG
Sbjct: 401  FLKPLCSYGPPHAAMVIYQKSRKAGCRISESAYKLLLKRLSRFGKCGMLLNVWDEMQESG 460

Query: 511  YSSDVEVYEYVINGLCNNEQLENAVLVMEECLRKGFCPSRLIYSKLNNKLLASNKVGRAY 332
            Y SDVEVYEY+++GLC    LENAVLVMEE +RKGFCP+R +YS+L++KL+ASNK   AY
Sbjct: 461  YPSDVEVYEYIVDGLCIIGHLENAVLVMEEAMRKGFCPNRFVYSRLSSKLMASNKTELAY 520

Query: 331  KLFLKIKTARLSENARKYWRSNGWHF 254
            KLFLKIK AR +ENAR +WRSNGWHF
Sbjct: 521  KLFLKIKKARATENARSFWRSNGWHF 546


>ref|XP_004486824.1| PREDICTED: putative pentatricopeptide repeat-containing protein
            At5g43820-like isoform X1 [Cicer arietinum]
            gi|502081302|ref|XP_004486825.1| PREDICTED: putative
            pentatricopeptide repeat-containing protein
            At5g43820-like isoform X2 [Cicer arietinum]
          Length = 539

 Score =  627 bits (1618), Expect = e-177
 Identities = 315/537 (58%), Positives = 399/537 (74%)
 Frame = -3

Query: 1864 SIPFFPFSTVPLPLTTLKDEPEKNQTKNLVNIGERHVLSELSNLLPIRYEASTHNHFPES 1685
            ++P    S +PL   +    P+     N  ++ ER VL ++S LLPI    ST  +   S
Sbjct: 20   TLPSISSSLIPLLHISSLHTPQ-----NSSHLDERLVLHQISQLLPI----STSKNRESS 70

Query: 1684 SPEKQLDSRSVDGFLSPEEKLRGVFLQKLRGKIAIEHALTNXXXXXXXXXXXXXVNRGNL 1505
              E    S+SVDGFLSPE+KLRG+FLQKL+GK  +E AL+              +N GNL
Sbjct: 71   VSE----SKSVDGFLSPEDKLRGIFLQKLKGKTTVEQALSGVCVDVNADIIGRVLNYGNL 126

Query: 1504 SGEWMVVFFNWAIKHSKIPKDIACYHIIIKALGRRKFFKHMVGILHDMKMEGITPVFESL 1325
             GE MV FFNWA+K   +P D+  YH+I+KALGRRKFF  M+ +L+DM++ GI      L
Sbjct: 127  GGEAMVTFFNWALKQPMVPNDVGTYHVIVKALGRRKFFVFMMQVLNDMRLNGIKADLFML 186

Query: 1324 LIVMDSFVRVHHVSKAIQLLRNLEEIGSKCDTESLNILLQCLCQRSHVGAANSVFHSMKG 1145
             IV+DSFV   HVSKAIQ+  NL+++G   DTE+LN+LL CLC+R HVGAA SVF+SMKG
Sbjct: 187  SIVIDSFVNAGHVSKAIQVFGNLDDLGLDRDTEALNVLLSCLCRRCHVGAAASVFNSMKG 246

Query: 1144 KIPYNSTTYNIIISGWSKFGEVNEIEKCLKAMVADGFSPDCSTFSHILEGLGRSERIADA 965
            K+ +N  TYN++  GWSK G VNEIE+ +K M  +GFSPD +T++  LEGLGR+ R+ +A
Sbjct: 247  KVIFNVATYNVVAGGWSKSGRVNEIERVMKEMEVEGFSPDFTTYAFYLEGLGRAGRMDEA 306

Query: 964  IEIFDNMKENDCLPDTGVYNAMIFNYISVGDFDECMKYYELMLSNNIDPNVDTFSKLIAA 785
            +++F NMKE D    T  YNAMIFN+IS+G+FDECMKYY  M S+N +PN+DT++++I A
Sbjct: 307  VQVFCNMKEKD----TTTYNAMIFNFISIGNFDECMKYYNEMSSDNCEPNIDTYTRMITA 362

Query: 784  FLKARKVADALEMFDKMLERGIIPTTGTVTSFLQPLCSYGPPHAAMMIYQKARKVGCKIS 605
            FL+ RKVADAL MFD+ML +G++P TGT++SF++ LCSYGPP+AAMMIY+KARK+ CKIS
Sbjct: 363  FLRTRKVADALLMFDEMLRQGVVPPTGTISSFIKRLCSYGPPYAAMMIYKKARKLECKIS 422

Query: 604  SSAYKLLLMRLSRFGKCGMLLKLWDEMQESGYSSDVEVYEYVINGLCNNEQLENAVLVME 425
              AYKLLLMRLS+FGKCG LL +W EMQE GYSSD+EVYEY+I+GL N  QLENAVLVME
Sbjct: 423  MEAYKLLLMRLSKFGKCGTLLSVWQEMQECGYSSDIEVYEYIISGLYNIGQLENAVLVME 482

Query: 424  ECLRKGFCPSRLIYSKLNNKLLASNKVGRAYKLFLKIKTARLSENARKYWRSNGWHF 254
            E LRKGFCPSRL+YSKL+NKLLAS+K  RAY+LFLKIK AR  +NAR YWRSNGWHF
Sbjct: 483  EALRKGFCPSRLVYSKLSNKLLASDKTERAYRLFLKIKHARALKNARSYWRSNGWHF 539


>ref|XP_007158555.1| hypothetical protein PHAVU_002G162200g [Phaseolus vulgaris]
            gi|561031970|gb|ESW30549.1| hypothetical protein
            PHAVU_002G162200g [Phaseolus vulgaris]
          Length = 549

 Score =  627 bits (1616), Expect = e-177
 Identities = 307/533 (57%), Positives = 397/533 (74%), Gaps = 1/533 (0%)
 Frame = -3

Query: 1849 PFSTVPL-PLTTLKDEPEKNQTKNLVNIGERHVLSELSNLLPIRYEASTHNHFPESSPEK 1673
            P ++ P  PL++L   P  +   +  +I ER +  ++S+L PI    S  N   E     
Sbjct: 19   PLASTPCSPLSSLH-APPCSPHHHHPHIDERLIHDQISHLFPIPTSKS-QNTVSEPLKPS 76

Query: 1672 QLDSRSVDGFLSPEEKLRGVFLQKLRGKIAIEHALTNXXXXXXXXXXXXXVNRGNLSGEW 1493
             LD++SVD FL PE+KLRGVFLQKL+GK AIE AL+N             +N GNLSGE+
Sbjct: 77   HLDAKSVDAFLPPEDKLRGVFLQKLKGKAAIETALSNVGADVDVNILGKVLNNGNLSGEF 136

Query: 1492 MVVFFNWAIKHSKIPKDIACYHIIIKALGRRKFFKHMVGILHDMKMEGITPVFESLLIVM 1313
            MV FFNWA+K   IP ++  YH+I+KALGRRKFF  M+G+L DM+  GI      L IV+
Sbjct: 137  MVTFFNWAVKLPGIPNEVGSYHVIVKALGRRKFFVFMMGVLCDMRKCGINGDLLLLSIVI 196

Query: 1312 DSFVRVHHVSKAIQLLRNLEEIGSKCDTESLNILLQCLCQRSHVGAANSVFHSMKGKIPY 1133
            DSFVR  HVS+AIQ+  NL+++G + DTE+LN+LL CLC RSHVGAANSV +SMKGK+ +
Sbjct: 197  DSFVRAGHVSRAIQIFGNLDDLGVRRDTEALNVLLSCLCHRSHVGAANSVLNSMKGKVCF 256

Query: 1132 NSTTYNIIISGWSKFGEVNEIEKCLKAMVADGFSPDCSTFSHILEGLGRSERIADAIEIF 953
            +  TYN++  GWSK G+V E+E+ ++ M  DG   DC TF  ++E LGR  R+ +A+E+F
Sbjct: 257  DVGTYNVVAGGWSKIGKVGEVERIMREMEVDGVGHDCRTFGFLMESLGRVGRMDEAVEVF 316

Query: 952  DNMKENDCLPDTGVYNAMIFNYISVGDFDECMKYYELMLSNNIDPNVDTFSKLIAAFLKA 773
              M+E +C PDT  YNAMIFN++SVGDF+EC+KYY+ MLS+N +P++DTF ++I  FL+ 
Sbjct: 317  CGMREKNCQPDTAAYNAMIFNFVSVGDFEECIKYYKKMLSDNCEPDLDTFVRIITGFLRV 376

Query: 772  RKVADALEMFDKMLERGIIPTTGTVTSFLQPLCSYGPPHAAMMIYQKARKVGCKISSSAY 593
            RKVADAL+MFD+ML RG++P+ G +T+F++ LCSYGPP+AA++IY+KARK+GC IS  AY
Sbjct: 377  RKVADALQMFDEMLRRGVVPSIGIITTFIKRLCSYGPPYAALVIYKKARKLGCMISMEAY 436

Query: 592  KLLLMRLSRFGKCGMLLKLWDEMQESGYSSDVEVYEYVINGLCNNEQLENAVLVMEECLR 413
            K+LLMRLS  GKCG LL +W+EMQE GYSSD+EVYEY+I+GLCN  QLENAVLVMEE L 
Sbjct: 437  KILLMRLSEVGKCGTLLSIWEEMQECGYSSDLEVYEYIISGLCNVGQLENAVLVMEEALH 496

Query: 412  KGFCPSRLIYSKLNNKLLASNKVGRAYKLFLKIKTARLSENARKYWRSNGWHF 254
            KGFCPSRL+YSKL+N+LLA+ K  RAYKLFLKIK AR  ENAR YWRSNGWHF
Sbjct: 497  KGFCPSRLVYSKLSNRLLATEKTERAYKLFLKIKHARSLENARNYWRSNGWHF 549


>ref|XP_006279800.1| hypothetical protein CARUB_v10027962mg [Capsella rubella]
            gi|482548504|gb|EOA12698.1| hypothetical protein
            CARUB_v10027962mg [Capsella rubella]
          Length = 547

 Score =  622 bits (1603), Expect = e-175
 Identities = 300/506 (59%), Positives = 379/506 (74%)
 Frame = -3

Query: 1771 IGERHVLSELSNLLPIRYEASTHNHFPESSPEKQLDSRSVDGFLSPEEKLRGVFLQKLRG 1592
            + E +VLSELS+LLPI Y  ++     +     +    ++D FLSPEE++RGVFLQKL+G
Sbjct: 45   LDESYVLSELSSLLPISYNRTS---VAKEETVSRNQETAIDLFLSPEERIRGVFLQKLKG 101

Query: 1591 KIAIEHALTNXXXXXXXXXXXXXVNRGNLSGEWMVVFFNWAIKHSKIPKDIACYHIIIKA 1412
            K AI+ +L++             VNRGNLSGE MV FFNWAI    + KD+  Y +I++A
Sbjct: 102  KFAIQKSLSSLGIGLSIEIVADVVNRGNLSGEAMVSFFNWAICEPGVSKDVDSYCVILRA 161

Query: 1411 LGRRKFFKHMVGILHDMKMEGITPVFESLLIVMDSFVRVHHVSKAIQLLRNLEEIGSKCD 1232
            LGRRKFF  M+ +L  M  EG+ P    L I MDSF +VH+V +AI+L    E  G  C+
Sbjct: 162  LGRRKFFSFMMDVLRGMLCEGVKPDLRCLTIAMDSFTKVHYVRRAIELFEESESFGVNCN 221

Query: 1231 TESLNILLQCLCQRSHVGAANSVFHSMKGKIPYNSTTYNIIISGWSKFGEVNEIEKCLKA 1052
            TES N LL+CLC+RSHV AA SVF+S KG IP++  TYN++ISGWSK GE+ E+EK LK 
Sbjct: 222  TESFNALLRCLCERSHVTAAKSVFNSKKGNIPFDGLTYNVMISGWSKLGEIEEMEKVLKE 281

Query: 1051 MVADGFSPDCSTFSHILEGLGRSERIADAIEIFDNMKENDCLPDTGVYNAMIFNYISVGD 872
            MV  GF PDC ++SH++EGLGR+ RI D++EIFDN+K    +PD  VYNAMI N+IS  D
Sbjct: 282  MVESGFGPDCLSYSHLIEGLGRAGRINDSVEIFDNIKHKGSVPDANVYNAMICNFISARD 341

Query: 871  FDECMKYYELMLSNNIDPNVDTFSKLIAAFLKARKVADALEMFDKMLERGIIPTTGTVTS 692
            FDE + YY  ML    +PN++T+SKL++  +K RKV+DALE+F++ML RG +PTTG VTS
Sbjct: 342  FDESVMYYRRMLDEECEPNLETYSKLVSGLIKGRKVSDALEIFEEMLSRGFLPTTGLVTS 401

Query: 691  FLQPLCSYGPPHAAMMIYQKARKVGCKISSSAYKLLLMRLSRFGKCGMLLKLWDEMQESG 512
            FL+PLCSYGPPHAAM+IYQK+RK GCKIS SAYKLLL RLSRFGKCGMLL +WDEMQE G
Sbjct: 402  FLKPLCSYGPPHAAMVIYQKSRKAGCKISESAYKLLLKRLSRFGKCGMLLNVWDEMQECG 461

Query: 511  YSSDVEVYEYVINGLCNNEQLENAVLVMEECLRKGFCPSRLIYSKLNNKLLASNKVGRAY 332
            Y SDVEVYEY+++GLC    L+NAVLVMEE +RKGFCP+R +YS+L++KL+ASNK   AY
Sbjct: 462  YPSDVEVYEYIVDGLCIIGHLDNAVLVMEEAMRKGFCPNRFVYSRLSSKLMASNKTELAY 521

Query: 331  KLFLKIKTARLSENARKYWRSNGWHF 254
            KLFLKIK AR +ENAR++WRSNGWHF
Sbjct: 522  KLFLKIKKARATENARRFWRSNGWHF 547


>gb|EXC31210.1| hypothetical protein L484_005635 [Morus notabilis]
          Length = 591

 Score =  600 bits (1547), Expect = e-169
 Identities = 311/539 (57%), Positives = 389/539 (72%), Gaps = 6/539 (1%)
 Frame = -3

Query: 1861 IPFFPFSTVPLPLTTL---KDEPEKN-QTKNLVNIGERHVLSELSNLLPIRYEASTHNHF 1694
            +P+ P   +  P ++L    D P    +T+N   I ER VL EL++LLP+       +  
Sbjct: 25   LPYLPSPILSSPFSSLDGQSDTPNNEYRTRNQCVIDERSVLDELADLLPVLRGTPASDLH 84

Query: 1693 PESSPEKQLD-SRSVDGFLSPEEKLRGVFLQKLRGKIAIEHALTNXXXXXXXXXXXXXVN 1517
               + EK+++ +R+ DGFL PEEKLRGVFLQ LRGK AIE ALT+             VN
Sbjct: 85   KRGNSEKRVEITRAADGFLLPEEKLRGVFLQNLRGKTAIEQALTDVDVELNVEVVGKVVN 144

Query: 1516 RGNLSGEWMVVFFNWAIKHSKIPKDIACYHIIIKALGRRKFFKHMVGILHDMKMEGITPV 1337
            RGNL  + MV+FFNWAI+   I KDI  YHII+KALGRRKF   MV +LH +++EG+ P 
Sbjct: 145  RGNLDDKKMVMFFNWAIRQPTISKDIDTYHIILKALGRRKFLNCMVEVLHQLRIEGVNPN 204

Query: 1336 FESLLIVMDSFVRVHHVSKAIQLLRNLEEIGSKCDTESLNILLQCLCQRSHVGAANSVFH 1157
             E+L IVMDS VR   VSKAI+  RNL+E+G  CDTESLN+LL+CLC+RSHVGAANS+ H
Sbjct: 205  LETLEIVMDSLVRARQVSKAIRTFRNLDELGLDCDTESLNVLLECLCRRSHVGAANSLLH 264

Query: 1156 SMKGKIPYNSTTYNIIISGWSKFGEVNEIEKCLKAMVADGFSPDCSTFSHILEGLGRSER 977
            SMKGKIP+N  TYNI++SGW +FG V E+E+ L+ MV DG  PD ST S+++EGLGR+ R
Sbjct: 265  SMKGKIPFNGATYNIVMSGWCRFGRVGEMERILEMMVGDGIDPDGSTVSNLIEGLGRAGR 324

Query: 976  IADAIEIFDNMKE-NDCLPDTGVYNAMIFNYISVGDFDECMKYYELMLSNNIDPNVDTFS 800
            I DA++IF++MKE N  +PD+ VYNAMI NYI+VGD DEC+KYY  MLS+  +P++DT++
Sbjct: 325  IDDAVKIFEDMKEKNGWVPDSSVYNAMISNYIAVGDCDECVKYYNSMLSSACEPSIDTYT 384

Query: 799  KLIAAFLKARKVADALEMFDKMLERGIIPTTGTVTSFLQPLCSYGPPHAAMMIYQKARKV 620
            KLI AFLK R+VADALE+FD+ML+RG++P+TGTVTSF++PLCSYGPPHAAMM+Y+KA+KV
Sbjct: 385  KLIGAFLKVRRVADALELFDEMLDRGVVPSTGTVTSFIEPLCSYGPPHAAMMVYKKAKKV 444

Query: 619  GCKISSSAYKLLLMRLSRFGKCGMLLKLWDEMQESGYSSDVEVYEYVINGLCNNEQLENA 440
            GC+IS SAYKLLL+RLSRFG                                   QLENA
Sbjct: 445  GCRISLSAYKLLLIRLSRFG-----------------------------------QLENA 469

Query: 439  VLVMEECLRKGFCPSRLIYSKLNNKLLASNKVGRAYKLFLKIKTARLSENARKYWRSNG 263
            VLVMEECLRKGFCPSRLI SKLNNKLLA NKV  AYKLFLK+K ARL +NAR+YWR+ G
Sbjct: 470  VLVMEECLRKGFCPSRLICSKLNNKLLALNKVEIAYKLFLKLKDARLEDNARRYWRAKG 528


>ref|XP_006403177.1| hypothetical protein EUTSA_v10003177mg [Eutrema salsugineum]
            gi|557104290|gb|ESQ44630.1| hypothetical protein
            EUTSA_v10003177mg [Eutrema salsugineum]
          Length = 541

 Score =  599 bits (1544), Expect = e-168
 Identities = 298/506 (58%), Positives = 377/506 (74%)
 Frame = -3

Query: 1771 IGERHVLSELSNLLPIRYEASTHNHFPESSPEKQLDSRSVDGFLSPEEKLRGVFLQKLRG 1592
            + E +VL+ELS+LLPI  + ST     + S   QL   +VD FLSPEEKLRGVFLQKL+G
Sbjct: 49   VDESYVLAELSSLLPISSKTSTAKD--DVSSRNQL---AVDSFLSPEEKLRGVFLQKLKG 103

Query: 1591 KIAIEHALTNXXXXXXXXXXXXXVNRGNLSGEWMVVFFNWAIKHSKIPKDIACYHIIIKA 1412
            + A   ALT+             V+RGNLSGE MV FF+WAI+   + KD+  Y++I++A
Sbjct: 104  ETATRKALTSLGIDLSIETVSNVVDRGNLSGEAMVTFFDWAIREPGVSKDVESYYVILRA 163

Query: 1411 LGRRKFFKHMVGILHDMKMEGITPVFESLLIVMDSFVRVHHVSKAIQLLRNLEEIGSKCD 1232
            LGRRKFF  M  +L +M    + P  + L+I MDSF +  +V +AIQL    E+ G KC 
Sbjct: 164  LGRRKFFSFMTDVLREM----VNPDLKCLIIAMDSFAKARYVRRAIQLFEESEDFGVKCC 219

Query: 1231 TESLNILLQCLCQRSHVGAANSVFHSMKGKIPYNSTTYNIIISGWSKFGEVNEIEKCLKA 1052
            TES N LLQCLC+RSHV AA+SVF++ KGKIP++  TYNI+ISGWSK GEV E+EK LK 
Sbjct: 220  TESFNALLQCLCERSHVSAASSVFNAKKGKIPFDVCTYNIMISGWSKLGEVGEMEKVLKE 279

Query: 1051 MVADGFSPDCSTFSHILEGLGRSERIADAIEIFDNMKENDCLPDTGVYNAMIFNYISVGD 872
            MV  GF P+  +FS+++EGLGR+ R+ D+++IFDNM     +PD  VYNAMI N+I   D
Sbjct: 280  MVESGFVPNGLSFSYLIEGLGRAGRVNDSVKIFDNMD----VPDANVYNAMICNFIFARD 335

Query: 871  FDECMKYYELMLSNNIDPNVDTFSKLIAAFLKARKVADALEMFDKMLERGIIPTTGTVTS 692
            FDE ++YY  ML    +PN +T+SKL++  +K RK+ADALE++++ML RGI+PTTG VTS
Sbjct: 336  FDESVRYYRRMLDKGCEPNWETYSKLVSGLIKGRKIADALEIYEEMLSRGIVPTTGLVTS 395

Query: 691  FLQPLCSYGPPHAAMMIYQKARKVGCKISSSAYKLLLMRLSRFGKCGMLLKLWDEMQESG 512
            FL+PLC YGPPHAAM+IYQKARK GC+IS SAYKLLL RLS FGKCGMLL +WDEMQE  
Sbjct: 396  FLKPLCCYGPPHAAMVIYQKARKAGCRISQSAYKLLLKRLSGFGKCGMLLNVWDEMQECE 455

Query: 511  YSSDVEVYEYVINGLCNNEQLENAVLVMEECLRKGFCPSRLIYSKLNNKLLASNKVGRAY 332
            YSSDVEVYEY+++GLCN   LENAVLVMEE +RKGFCP+R +YS+L+NKL++S K   AY
Sbjct: 456  YSSDVEVYEYIVDGLCNIGHLENAVLVMEEAMRKGFCPNRFVYSRLSNKLMSSRKTEMAY 515

Query: 331  KLFLKIKTARLSENARKYWRSNGWHF 254
            KLFLKIK ARL +NAR++WR NGWHF
Sbjct: 516  KLFLKIKEARLKDNARRFWRRNGWHF 541


>dbj|BAB11311.1| unnamed protein product [Arabidopsis thaliana]
          Length = 680

 Score =  568 bits (1463), Expect = e-159
 Identities = 275/467 (58%), Positives = 354/467 (75%)
 Frame = -3

Query: 1771 IGERHVLSELSNLLPIRYEASTHNHFPESSPEKQLDSRSVDGFLSPEEKLRGVFLQKLRG 1592
            + E +VL+ELS+LLPI    ++ +   +SS + Q+   ++D FLS E+KLRGVFLQKL+G
Sbjct: 45   VDESYVLAELSSLLPISSNKTSVSK-EDSSSKNQV---AIDSFLSAEDKLRGVFLQKLKG 100

Query: 1591 KIAIEHALTNXXXXXXXXXXXXXVNRGNLSGEWMVVFFNWAIKHSKIPKDIACYHIIIKA 1412
            K AI+ +L++             +NRGNLSGE MV FF+WA++   + KD+  Y +I++A
Sbjct: 101  KSAIQKSLSSLGIGLSIDIVADVLNRGNLSGEAMVTFFDWAVREPGVTKDVGSYSVILRA 160

Query: 1411 LGRRKFFKHMVGILHDMKMEGITPVFESLLIVMDSFVRVHHVSKAIQLLRNLEEIGSKCD 1232
            LGRRK F  M+ +L  M  EG+ P  E L I MDSFVRVH+V +AI+L    E  G KC 
Sbjct: 161  LGRRKLFSFMMDVLKGMVCEGVNPDLECLTIAMDSFVRVHYVRRAIELFEESESFGVKCS 220

Query: 1231 TESLNILLQCLCQRSHVGAANSVFHSMKGKIPYNSTTYNIIISGWSKFGEVNEIEKCLKA 1052
            TES N LL+CLC+RSHV AA SVF++ KG IP++S +YNI+ISGWSK GEV E+EK LK 
Sbjct: 221  TESFNALLRCLCERSHVSAAKSVFNAKKGNIPFDSCSYNIMISGWSKLGEVEEMEKVLKE 280

Query: 1051 MVADGFSPDCSTFSHILEGLGRSERIADAIEIFDNMKENDCLPDTGVYNAMIFNYISVGD 872
            MV  GF PDC ++SH++EGLGR+ RI D++EIFDN+K    +PD  VYNAMI N+IS  D
Sbjct: 281  MVESGFGPDCLSYSHLIEGLGRTGRINDSVEIFDNIKHKGNVPDANVYNAMICNFISARD 340

Query: 871  FDECMKYYELMLSNNIDPNVDTFSKLIAAFLKARKVADALEMFDKMLERGIIPTTGTVTS 692
            FDE M+YY  ML    +PN++T+SKL++  +K RKV+DALE+F++ML RG++PTTG VTS
Sbjct: 341  FDESMRYYRRMLDEECEPNLETYSKLVSGLIKGRKVSDALEIFEEMLSRGVLPTTGLVTS 400

Query: 691  FLQPLCSYGPPHAAMMIYQKARKVGCKISSSAYKLLLMRLSRFGKCGMLLKLWDEMQESG 512
            FL+PLCSYGPPHAAM+IYQK+RK GC+IS SAYKLLL RLSRFGKCGMLL +WDEMQESG
Sbjct: 401  FLKPLCSYGPPHAAMVIYQKSRKAGCRISESAYKLLLKRLSRFGKCGMLLNVWDEMQESG 460

Query: 511  YSSDVEVYEYVINGLCNNEQLENAVLVMEECLRKGFCPSRLIYSKLN 371
            Y SDVEVYEY+++GLC    LENAVLVMEE +RKGFCP+R +YS+L+
Sbjct: 461  YPSDVEVYEYIVDGLCIIGHLENAVLVMEEAMRKGFCPNRFVYSRLS 507


>ref|XP_002865400.1| pentatricopeptide repeat-containing protein [Arabidopsis lyrata
            subsp. lyrata] gi|297311235|gb|EFH41659.1|
            pentatricopeptide repeat-containing protein [Arabidopsis
            lyrata subsp. lyrata]
          Length = 675

 Score =  565 bits (1457), Expect = e-158
 Identities = 277/467 (59%), Positives = 349/467 (74%)
 Frame = -3

Query: 1771 IGERHVLSELSNLLPIRYEASTHNHFPESSPEKQLDSRSVDGFLSPEEKLRGVFLQKLRG 1592
            + E +VL+ELS+LLPI       +++   S   Q+   S+D FLSP EKLRGVFLQKL+G
Sbjct: 42   LDESYVLAELSSLLPISSSLVKEDNY---SSRNQV---SIDSFLSPAEKLRGVFLQKLKG 95

Query: 1591 KIAIEHALTNXXXXXXXXXXXXXVNRGNLSGEWMVVFFNWAIKHSKIPKDIACYHIIIKA 1412
            K AI++ L++             +NRGNLSGE MV FFNWAI+   + KD+  Y +I++A
Sbjct: 96   KSAIQNCLSSLGIDLSIDIVSDVLNRGNLSGEAMVTFFNWAIREPGVSKDVDSYCVILRA 155

Query: 1411 LGRRKFFKHMVGILHDMKMEGITPVFESLLIVMDSFVRVHHVSKAIQLLRNLEEIGSKCD 1232
            LGRRKFF  M+ +L  M  EG+ P    L I MDSFVR H+V +AI+L    E  G KC 
Sbjct: 156  LGRRKFFSFMMDVLRGMVCEGVNPDLRCLTIAMDSFVRAHYVRRAIELFEESESYGVKCS 215

Query: 1231 TESLNILLQCLCQRSHVGAANSVFHSMKGKIPYNSTTYNIIISGWSKFGEVNEIEKCLKA 1052
            TES N LL+CLC+RSHV AANSVF++ KGKIP++S +YNI+ISGWSK GE+  +EK LK 
Sbjct: 216  TESFNALLRCLCERSHVSAANSVFNAKKGKIPFDSCSYNIMISGWSKLGEIEGMEKVLKE 275

Query: 1051 MVADGFSPDCSTFSHILEGLGRSERIADAIEIFDNMKENDCLPDTGVYNAMIFNYISVGD 872
            MV  GF PDC ++SH++EGLGR+ RI D++EIFDNMK    + D  VYNAMI N+IS  D
Sbjct: 276  MVEGGFVPDCLSYSHLIEGLGRAGRINDSVEIFDNMKHKGSVLDANVYNAMICNFISARD 335

Query: 871  FDECMKYYELMLSNNIDPNVDTFSKLIAAFLKARKVADALEMFDKMLERGIIPTTGTVTS 692
            FDE M+YY  ML    +PN++T+SKL++  +K RKV+DALE+F++ML RGI+PTTG VTS
Sbjct: 336  FDESMRYYRRMLDEECEPNLETYSKLVSGLIKGRKVSDALEIFEEMLSRGILPTTGLVTS 395

Query: 691  FLQPLCSYGPPHAAMMIYQKARKVGCKISSSAYKLLLMRLSRFGKCGMLLKLWDEMQESG 512
            FL+PLCSYGPPHAAM+IYQK+RK GC+IS SAYKLLL RLSRFGKCGMLL +WDEMQE G
Sbjct: 396  FLKPLCSYGPPHAAMVIYQKSRKAGCRISESAYKLLLKRLSRFGKCGMLLNVWDEMQECG 455

Query: 511  YSSDVEVYEYVINGLCNNEQLENAVLVMEECLRKGFCPSRLIYSKLN 371
            Y SDVEVYEY+++GLC    LENAVLVMEE +RKGFCP+R +YS+L+
Sbjct: 456  YPSDVEVYEYIVDGLCIIGHLENAVLVMEEAMRKGFCPNRFVYSRLS 502


>gb|EYU43538.1| hypothetical protein MIMGU_mgv1a024877mg [Mimulus guttatus]
          Length = 553

 Score =  550 bits (1417), Expect = e-154
 Identities = 285/562 (50%), Positives = 385/562 (68%), Gaps = 5/562 (0%)
 Frame = -3

Query: 1924 MLLQSQRLIGFIWRSHCLS---SSIPFFPFSTVPLPLTTLKDEPEKNQT-KNLVNIGERH 1757
            M L+      F +RS  ++   SSI  F FST+ + L         N T KN  N  E  
Sbjct: 1    MFLKRFSAKSFTYRSLLMNYRHSSISEFAFSTLEIDL---------NPTYKNHSNGDESR 51

Query: 1756 VLSELSNLLPIRYEASTHNHFPESSPEKQLD-SRSVDGFLSPEEKLRGVFLQKLRGKIAI 1580
            +LS+LS++ P             + P +Q + S +VD FL PE+KLRGVFLQ+  G+ AI
Sbjct: 52   ILSQLSDIFPTSISNPAAAAVAVNPPPRQSEISAAVDDFLPPEDKLRGVFLQRFSGETAI 111

Query: 1579 EHALTNXXXXXXXXXXXXXVNRGNLSGEWMVVFFNWAIKHSKIPKDIACYHIIIKALGRR 1400
              AL+              +NRGNL G+ MV FFNWAI+   + K I  YH+++K+LGRR
Sbjct: 112  HRALSGVGVELNDDVFAKVLNRGNLCGKSMVAFFNWAIEQPDLSKGIDSYHVVLKSLGRR 171

Query: 1399 KFFKHMVGILHDMKMEGITPVFESLLIVMDSFVRVHHVSKAIQLLRNLEEIGSKCDTESL 1220
            KFF HM+ +L D++ +G+ P  E+L I MDS+VR   VSKA +    L++ G   + E+ 
Sbjct: 172  KFFVHMMEMLKDIRDKGMCPNSETLFIFMDSYVRARQVSKATKFFGELDKYGLVFNEETF 231

Query: 1219 NILLQCLCQRSHVGAANSVFHSMKGKIPYNSTTYNIIISGWSKFGEVNEIEKCLKAMVAD 1040
             + L+CL QRS+V  A  +F+ M+ K+  +   YNIII GWSKFG V+EIEK LK MV +
Sbjct: 232  TVALKCLSQRSYVATACLLFNKMRDKVQCDCAMYNIIIGGWSKFGAVSEIEKYLKVMVDE 291

Query: 1039 GFSPDCSTFSHILEGLGRSERIADAIEIFDNMKENDCLPDTGVYNAMIFNYISVGDFDEC 860
            G  PDC T+S+++EG GR+ +I DA++IF  ++E       GVYNA+IFN I+ GD +  
Sbjct: 292  GVEPDCVTYSYVIEGFGRAGKIDDAVKIFKYLEEKGSGLSGGVYNAVIFNCIASGDINGA 351

Query: 859  MKYYELMLSNNIDPNVDTFSKLIAAFLKARKVADALEMFDKMLERGIIPTTGTVTSFLQP 680
            +KYYE MLSN  +PN+DT+++ I  FLK+R+V+DA+ M D+ML RG+IP+TG +T F++P
Sbjct: 352  LKYYEEMLSNCFEPNIDTYTRFIVYFLKSRRVSDAIGMLDEMLGRGVIPSTGILTGFIEP 411

Query: 679  LCSYGPPHAAMMIYQKARKVGCKISSSAYKLLLMRLSRFGKCGMLLKLWDEMQESGYSSD 500
            LCSYGPP+AA+M+Y+KARK GC+IS +AYKLLL RLSRFGK GMLL + DEMQESGYSSD
Sbjct: 412  LCSYGPPYAALMVYKKARKAGCRISFTAYKLLLSRLSRFGKFGMLLNILDEMQESGYSSD 471

Query: 499  VEVYEYVINGLCNNEQLENAVLVMEECLRKGFCPSRLIYSKLNNKLLASNKVGRAYKLFL 320
            ++VYEY+INGLCN  +LE AV VMEEC+RKGF P ++I SKLNN L+ SNKV  AYKLFL
Sbjct: 472  MQVYEYIINGLCNTGKLETAVKVMEECIRKGFYPGKIICSKLNNMLMDSNKVEVAYKLFL 531

Query: 319  KIKTARLSENARKYWRSNGWHF 254
            K++ AR++ENA++YWR+ GWHF
Sbjct: 532  KLRKARVNENAQRYWRAKGWHF 553


>ref|XP_006598344.1| PREDICTED: putative pentatricopeptide repeat-containing protein
            At5g43820-like [Glycine max]
          Length = 482

 Score =  533 bits (1374), Expect = e-149
 Identities = 278/517 (53%), Positives = 357/517 (69%), Gaps = 1/517 (0%)
 Frame = -3

Query: 1858 PFFPF-STVPLPLTTLKDEPEKNQTKNLVNIGERHVLSELSNLLPIRYEASTHNHFPESS 1682
            P  PF ST   PL++L   P  +   NL    +R VL +LS+L P     S +  FP   
Sbjct: 15   PHNPFPSTRRSPLSSLHAPPPHHDQPNL---DDRLVLDQLSHLFPTLTSKSQNPVFPNPH 71

Query: 1681 PEKQLDSRSVDGFLSPEEKLRGVFLQKLRGKIAIEHALTNXXXXXXXXXXXXXVNRGNLS 1502
            P     + +VD FL PE+KLRGVFLQKL+G+ AIE AL+N                    
Sbjct: 72   PNA---ANAVDAFLPPEDKLRGVFLQKLKGRAAIESALSN-------------------- 108

Query: 1501 GEWMVVFFNWAIKHSKIPKDIACYHIIIKALGRRKFFKHMVGILHDMKMEGITPVFESLL 1322
                                       + ALGRRKFF  M+  L DM+   I      L 
Sbjct: 109  ---------------------------VAALGRRKFFDFMMDALCDMRRNAIDGDLFMLS 141

Query: 1321 IVMDSFVRVHHVSKAIQLLRNLEEIGSKCDTESLNILLQCLCQRSHVGAANSVFHSMKGK 1142
            +V+DSFVR  HVS+AIQ+  NL+++G + DTE+LN+LL CLC+RSHVGAANSV +SMKGK
Sbjct: 142  VVVDSFVRAGHVSRAIQVFGNLDDLGVRRDTEALNVLLLCLCRRSHVGAANSVLNSMKGK 201

Query: 1141 IPYNSTTYNIIISGWSKFGEVNEIEKCLKAMVADGFSPDCSTFSHILEGLGRSERIADAI 962
            + ++  TYN +  GWS+FG V+E+E+ ++ M ADG  PDC TF  ++EGLGR  R+ +A+
Sbjct: 202  VDFDVGTYNAVAGGWSRFGRVSEVERVMREMEADGLRPDCRTFGFLIEGLGREGRMDEAV 261

Query: 961  EIFDNMKENDCLPDTGVYNAMIFNYISVGDFDECMKYYELMLSNNIDPNVDTFSKLIAAF 782
            EI   MKE +C PDT  YNA+IFN++SVGDF+EC+KYY  MLS+N +PN+DT++++I  F
Sbjct: 262  EILCGMKEMNCQPDTETYNAVIFNFVSVGDFEECIKYYNRMLSDNCEPNLDTYARMINRF 321

Query: 781  LKARKVADALEMFDKMLERGIIPTTGTVTSFLQPLCSYGPPHAAMMIYQKARKVGCKISS 602
            L+ARKVADAL MFD+ML RG++P+TGT+T+F++ LCSYGPP+AA+MIY+KARK+GC IS 
Sbjct: 322  LRARKVADALLMFDEMLRRGVVPSTGTITTFIKRLCSYGPPYAALMIYKKARKLGCVISM 381

Query: 601  SAYKLLLMRLSRFGKCGMLLKLWDEMQESGYSSDVEVYEYVINGLCNNEQLENAVLVMEE 422
             AYK+LLMRLS  GKCG LL +W+EMQE GYSSD+EVYE +I+GLCN  QLENAVLVMEE
Sbjct: 382  EAYKILLMRLSMVGKCGTLLSIWEEMQECGYSSDLEVYECIISGLCNVGQLENAVLVMEE 441

Query: 421  CLRKGFCPSRLIYSKLNNKLLASNKVGRAYKLFLKIK 311
             LRKGFCPSRL+YSKL+N+LLAS+K  RAYKLFLKIK
Sbjct: 442  ALRKGFCPSRLVYSKLSNRLLASDKSERAYKLFLKIK 478



 Score = 59.7 bits (143), Expect = 5e-06
 Identities = 56/280 (20%), Positives = 108/280 (38%)
 Frame = -3

Query: 1117 NIIISGWSKFGEVNEIEKCLKAMVADGFSPDCSTFSHILEGLGRSERIADAIEIFDNMKE 938
            ++++  + + G V+   +    +   G   D    + +L  L R   +  A  + ++MK 
Sbjct: 141  SVVVDSFVRAGHVSRAIQVFGNLDDLGVRRDTEALNVLLLCLCRRSHVGAANSVLNSMKG 200

Query: 937  NDCLPDTGVYNAMIFNYISVGDFDECMKYYELMLSNNIDPNVDTFSKLIAAFLKARKVAD 758
                 D G YNA+   +   G   E  +    M ++ + P+  TF  LI    +  ++ +
Sbjct: 201  KVDF-DVGTYNAVAGGWSRFGRVSEVERVMREMEADGLRPDCRTFGFLIEGLGREGRMDE 259

Query: 757  ALEMFDKMLERGIIPTTGTVTSFLQPLCSYGPPHAAMMIYQKARKVGCKISSSAYKLLLM 578
            A+E+   M E    P T T  + +    S G     +  Y +     C+ +   Y  ++ 
Sbjct: 260  AVEILCGMKEMNCQPDTETYNAVIFNFVSVGDFEECIKYYNRMLSDNCEPNLDTYARMIN 319

Query: 577  RLSRFGKCGMLLKLWDEMQESGYSSDVEVYEYVINGLCNNEQLENAVLVMEECLRKGFCP 398
            R  R  K    L ++DEM   G           I  LC+      A+++ ++  + G   
Sbjct: 320  RFLRARKVADALLMFDEMLRRGVVPSTGTITTFIKRLCSYGPPYAALMIYKKARKLGCVI 379

Query: 397  SRLIYSKLNNKLLASNKVGRAYKLFLKIKTARLSENARKY 278
            S   Y  L  +L    K G    ++ +++    S +   Y
Sbjct: 380  SMEAYKILLMRLSMVGKCGTLLSIWEEMQECGYSSDLEVY 419


>ref|XP_002307761.2| hypothetical protein POPTR_0005s26850g [Populus trichocarpa]
            gi|550339816|gb|EEE94757.2| hypothetical protein
            POPTR_0005s26850g [Populus trichocarpa]
          Length = 398

 Score =  519 bits (1337), Expect = e-144
 Identities = 244/370 (65%), Positives = 307/370 (82%), Gaps = 1/370 (0%)
 Frame = -3

Query: 1492 MVVFFNWAIKHSKIPKDIACYHIIIKALGRRKFFKHMVGILHDMKMEGITPVFESLLIVM 1313
            M++FFNWAIK   I KD+  Y+++I+ALGRRKF   MV  LH++++EG++   E+  IV+
Sbjct: 1    MIMFFNWAIKQPMISKDVDSYNVVIRALGRRKFIDFMVKFLHELRVEGVSMNSETFSIVI 60

Query: 1312 DSFVRVHHVSKAIQLLRNLEE-IGSKCDTESLNILLQCLCQRSHVGAANSVFHSMKGKIP 1136
            DS VR   V KAIQ+  NLEE  G + D ESLN+LLQCLC+RSHVGAANS F+S+KGKIP
Sbjct: 61   DSLVRARRVYKAIQMFGNLEEEFGFERDAESLNVLLQCLCRRSHVGAANSYFNSVKGKIP 120

Query: 1135 YNSTTYNIIISGWSKFGEVNEIEKCLKAMVADGFSPDCSTFSHILEGLGRSERIADAIEI 956
            +N  TYN+II GWSKFG V+E+++  + M  DGFSPDC +FS++LEGLGR+ +I DA+ I
Sbjct: 121  FNCMTYNVIIGGWSKFGRVSEMQRVFEEMEEDGFSPDCLSFSYLLEGLGRAGKIEDAVMI 180

Query: 955  FDNMKENDCLPDTGVYNAMIFNYISVGDFDECMKYYELMLSNNIDPNVDTFSKLIAAFLK 776
            F +++E  C+PDT VYNAMI N+ISVG+FDECMKYY  +LS N DPN+DT++++I+  +K
Sbjct: 181  FGSLEEKGCVPDTNVYNAMISNFISVGNFDECMKYYRCLLSKNCDPNIDTYTRMISGLIK 240

Query: 775  ARKVADALEMFDKMLERGIIPTTGTVTSFLQPLCSYGPPHAAMMIYQKARKVGCKISSSA 596
            A KVADALEMFD+ML+RG++  TGTVTSF++PLCS+GPPHAAM+IY KARKVGCKIS SA
Sbjct: 241  ASKVADALEMFDEMLDRGMVTKTGTVTSFIEPLCSFGPPHAAMVIYTKARKVGCKISLSA 300

Query: 595  YKLLLMRLSRFGKCGMLLKLWDEMQESGYSSDVEVYEYVINGLCNNEQLENAVLVMEECL 416
            YKLLLMRLSRFGKCGM+LK+WDEMQESGYSSD+EVYEY+I+GLCN  Q ENAVLVMEE +
Sbjct: 301  YKLLLMRLSRFGKCGMMLKIWDEMQESGYSSDMEVYEYLISGLCNIGQFENAVLVMEESM 360

Query: 415  RKGFCPSRLI 386
            RKGFCPSR +
Sbjct: 361  RKGFCPSRCL 370



 Score = 86.3 bits (212), Expect = 5e-14
 Identities = 69/324 (21%), Positives = 139/324 (42%), Gaps = 5/324 (1%)
 Frame = -3

Query: 1234 DTESLNILLQCLCQRSHVGAANSVFHSMKGK-IPYNSTTYNIIISGWSKFGEVNEIEKCL 1058
            D +S N++++ L +R  +       H ++ + +  NS T++I+I    +   V +  +  
Sbjct: 17   DVDSYNVVIRALGRRKFIDFMVKFLHELRVEGVSMNSETFSIVIDSLVRARRVYKAIQMF 76

Query: 1057 KAMVAD-GFSPDCSTFSHILEGLGRSERIADAIEIFDNMKEN---DCLPDTGVYNAMIFN 890
              +  + GF  D  + + +L+ L R   +  A   F+++K     +C+     YN +I  
Sbjct: 77   GNLEEEFGFERDAESLNVLLQCLCRRSHVGAANSYFNSVKGKIPFNCM----TYNVIIGG 132

Query: 889  YISVGDFDECMKYYELMLSNNIDPNVDTFSKLIAAFLKARKVADALEMFDKMLERGIIPT 710
            +   G   E  + +E M  +   P+  +FS L+    +A K+ DA+ +F  + E+G +P 
Sbjct: 133  WSKFGRVSEMQRVFEEMEEDGFSPDCLSFSYLLEGLGRAGKIEDAVMIFGSLEEKGCVPD 192

Query: 709  TGTVTSFLQPLCSYGPPHAAMMIYQKARKVGCKISSSAYKLLLMRLSRFGKCGMLLKLWD 530
            T    + +    S G     M  Y+      C  +   Y  ++  L +  K    L+++D
Sbjct: 193  TNVYNAMISNFISVGNFDECMKYYRCLLSKNCDPNIDTYTRMISGLIKASKVADALEMFD 252

Query: 529  EMQESGYSSDVEVYEYVINGLCNNEQLENAVLVMEECLRKGFCPSRLIYSKLNNKLLASN 350
            EM + G  +        I  LC+      A+++  +  + G   S   Y  L  +L    
Sbjct: 253  EMLDRGMVTKTGTVTSFIEPLCSFGPPHAAMVIYTKARKVGCKISLSAYKLLLMRLSRFG 312

Query: 349  KVGRAYKLFLKIKTARLSENARKY 278
            K G   K++ +++ +  S +   Y
Sbjct: 313  KCGMMLKIWDEMQESGYSSDMEVY 336



 Score = 72.0 bits (175), Expect = 9e-10
 Identities = 50/218 (22%), Positives = 97/218 (44%), Gaps = 1/218 (0%)
 Frame = -3

Query: 922 DTGVYNAMIFNYISVGDFDECMKYYELMLSNNIDPNVDTFSKLIAAFLKARKVADALEMF 743
           D   YN +I         D  +K+   +    +  N +TFS +I + ++AR+V  A++MF
Sbjct: 17  DVDSYNVVIRALGRRKFIDFMVKFLHELRVEGVSMNSETFSIVIDSLVRARRVYKAIQMF 76

Query: 742 DKMLER-GIIPTTGTVTSFLQPLCSYGPPHAAMMIYQKARKVGCKISSSAYKLLLMRLSR 566
             + E  G      ++   LQ LC      AA   +    K     +   Y +++   S+
Sbjct: 77  GNLEEEFGFERDAESLNVLLQCLCRRSHVGAANSYFNSV-KGKIPFNCMTYNVIIGGWSK 135

Query: 565 FGKCGMLLKLWDEMQESGYSSDVEVYEYVINGLCNNEQLENAVLVMEECLRKGFCPSRLI 386
           FG+   + ++++EM+E G+S D   + Y++ GL    ++E+AV++      KG  P   +
Sbjct: 136 FGRVSEMQRVFEEMEEDGFSPDCLSFSYLLEGLGRAGKIEDAVMIFGSLEEKGCVPDTNV 195

Query: 385 YSKLNNKLLASNKVGRAYKLFLKIKTARLSENARKYWR 272
           Y+ + +  ++        K +  + +     N   Y R
Sbjct: 196 YNAMISNFISVGNFDECMKYYRCLLSKNCDPNIDTYTR 233


>ref|XP_006855725.1| hypothetical protein AMTR_s00044p00153760 [Amborella trichopoda]
            gi|548859512|gb|ERN17192.1| hypothetical protein
            AMTR_s00044p00153760 [Amborella trichopoda]
          Length = 413

 Score =  499 bits (1284), Expect = e-138
 Identities = 234/413 (56%), Positives = 312/413 (75%)
 Frame = -3

Query: 1492 MVVFFNWAIKHSKIPKDIACYHIIIKALGRRKFFKHMVGILHDMKMEGITPVFESLLIVM 1313
            MV FF+WAI     PKD+  Y+I++++LGRRK+F HM  +LH M  EG  P  E++LIVM
Sbjct: 1    MVTFFSWAITQPSCPKDLQNYNILLRSLGRRKYFDHMERVLHHMNKEGPKPSLETMLIVM 60

Query: 1312 DSFVRVHHVSKAIQLLRNLEEIGSKCDTESLNILLQCLCQRSHVGAANSVFHSMKGKIPY 1133
             S+ R H VSKAIQ   NLEE G   DT + N+ L+ L +R HV  A S+ H+ +GKIP+
Sbjct: 61   GSYSRAHRVSKAIQYFENLEEFGLPSDTGAFNVFLKSLSERGHVRVATSLLHTFEGKIPF 120

Query: 1132 NSTTYNIIISGWSKFGEVNEIEKCLKAMVADGFSPDCSTFSHILEGLGRSERIADAIEIF 953
            ++TTY I+I GWS+ G ++E EK   AM+++GF PDCSTF+++LEGLGR+ RI +AI +F
Sbjct: 121  DTTTYTILIGGWSRLGRISETEKIWAAMLSNGFQPDCSTFNYLLEGLGRAGRIDNAIAVF 180

Query: 952  DNMKENDCLPDTGVYNAMIFNYISVGDFDECMKYYELMLSNNIDPNVDTFSKLIAAFLKA 773
            ++M E  C P+T  YNAMI N+IS G  +EC+KYY  M   +  P++ T++K+I AF+K 
Sbjct: 181  ESMGEKGCPPNTSSYNAMICNFISCGALNECVKYYATMSEKHCAPDIVTYTKMIGAFIKV 240

Query: 772  RKVADALEMFDKMLERGIIPTTGTVTSFLQPLCSYGPPHAAMMIYQKARKVGCKISSSAY 593
             +VADALEMFD ML RG+IP+TGT+TSF++PLC +GPPHAA+ IY+KA+KVGCK S  AY
Sbjct: 241  CRVADALEMFDSMLGRGVIPSTGTLTSFIEPLCKFGPPHAALEIYRKAKKVGCKFSVKAY 300

Query: 592  KLLLMRLSRFGKCGMLLKLWDEMQESGYSSDVEVYEYVINGLCNNEQLENAVLVMEECLR 413
            KLLL RL+RFGKCG +L++WD+M+  G+SSD EVYE VI+G CN  QL+NAVL +EE L 
Sbjct: 301  KLLLGRLARFGKCGTVLRVWDDMRTDGHSSDKEVYECVIDGFCNIGQLDNAVLALEEALS 360

Query: 412  KGFCPSRLIYSKLNNKLLASNKVGRAYKLFLKIKTARLSENARKYWRSNGWHF 254
             GFCP+++IYSKLN KLL ++KV  AYKL++KIK AR +E +RKYW +NGWHF
Sbjct: 361  LGFCPNKVIYSKLNCKLLDASKVELAYKLYVKIKEARRNELSRKYWFANGWHF 413



 Score = 71.2 bits (173), Expect = 2e-09
 Identities = 48/230 (20%), Positives = 99/230 (43%), Gaps = 1/230 (0%)
 Frame = -3

Query: 931 CLPDTGVYNAMIFNYISVGDFDECMKYYELMLSNNIDPNVDTFSKLIAAFLKARKVADAL 752
           C  D   YN ++ +      FD   +    M      P+++T   ++ ++ +A +V+ A+
Sbjct: 14  CPKDLQNYNILLRSLGRRKYFDHMERVLHHMNKEGPKPSLETMLIVMGSYSRAHRVSKAI 73

Query: 751 EMFDKMLERGIIPTTGTVTSFLQPLCSYGPPHAAM-MIYQKARKVGCKISSSAYKLLLMR 575
           + F+ + E G+   TG    FL+ L   G    A  +++    K+     ++ Y +L+  
Sbjct: 74  QYFENLEEFGLPSDTGAFNVFLKSLSERGHVRVATSLLHTFEGKI--PFDTTTYTILIGG 131

Query: 574 LSRFGKCGMLLKLWDEMQESGYSSDVEVYEYVINGLCNNEQLENAVLVMEECLRKGFCPS 395
            SR G+     K+W  M  +G+  D   + Y++ GL    +++NA+ V E    KG  P+
Sbjct: 132 WSRLGRISETEKIWAAMLSNGFQPDCSTFNYLLEGLGRAGRIDNAIAVFESMGEKGCPPN 191

Query: 394 RLIYSKLNNKLLASNKVGRAYKLFLKIKTARLSENARKYWRSNGWHF*IC 245
              Y+ +    ++   +    K +  +     + +   Y +  G    +C
Sbjct: 192 TSSYNAMICNFISCGALNECVKYYATMSEKHCAPDIVTYTKMIGAFIKVC 241


Top