BLASTX nr result

ID: Catharanthus22_contig00001970 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Catharanthus22_contig00001970
         (2426 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|EOY10909.1| Pentatricopeptide repeat (PPR-like) superfamily p...   777   0.0  
ref|XP_002266822.1| PREDICTED: pentatricopeptide repeat-containi...   776   0.0  
emb|CAN67401.1| hypothetical protein VITISV_025967 [Vitis vinifera]   774   0.0  
ref|XP_004304772.1| PREDICTED: pentatricopeptide repeat-containi...   772   0.0  
ref|XP_006361415.1| PREDICTED: pentatricopeptide repeat-containi...   769   0.0  
gb|EXB93167.1| hypothetical protein L484_024505 [Morus notabilis]     758   0.0  
ref|XP_004236781.1| PREDICTED: pentatricopeptide repeat-containi...   755   0.0  
ref|XP_002521980.1| pentatricopeptide repeat-containing protein,...   748   0.0  
gb|EMJ06317.1| hypothetical protein PRUPE_ppa004835mg [Prunus pe...   725   0.0  
ref|XP_006487702.1| PREDICTED: pentatricopeptide repeat-containi...   723   0.0  
ref|XP_006442665.1| hypothetical protein CICLE_v10019446mg [Citr...   722   0.0  
ref|XP_004142590.1| PREDICTED: pentatricopeptide repeat-containi...   714   0.0  
ref|XP_006371094.1| hypothetical protein POPTR_0019s03630g [Popu...   706   0.0  
ref|XP_006296608.1| hypothetical protein CARUB_v10013258mg [Caps...   704   0.0  
ref|XP_002884468.1| pentatricopeptide repeat-containing protein ...   703   0.0  
ref|NP_566237.1| pentatricopeptide repeat-containing protein [Ar...   702   0.0  
dbj|BAD95034.1| hypothetical protein [Arabidopsis thaliana]           702   0.0  
ref|XP_003590960.1| Pentatricopeptide repeat-containing protein ...   678   0.0  
ref|XP_003555568.1| PREDICTED: pentatricopeptide repeat-containi...   676   0.0  
ref|XP_006589209.1| PREDICTED: pentatricopeptide repeat-containi...   669   0.0  

>gb|EOY10909.1| Pentatricopeptide repeat (PPR-like) superfamily protein [Theobroma
            cacao]
          Length = 586

 Score =  777 bits (2007), Expect = 0.0
 Identities = 379/586 (64%), Positives = 483/586 (82%), Gaps = 9/586 (1%)
 Frame = +3

Query: 171  TILSAEFFPQPLPCTNSS-KP--NSQSTNKSLVRCRNER------KSRSPQRVKVCTENR 323
            T+ S E     LP T    KP  NS S + SLV C N        KSR+ Q+V+V  E R
Sbjct: 2    TLFSTELVTHSLPFTTQQLKPTSNSHSHHTSLVSCLNHESQDSSSKSRNNQKVRVSAETR 61

Query: 324  STQLQSNDSAEPNFVKLLNRSCKAGKFSEALYFLENMVNRGSHKPDVILCTKLIKGLFSS 503
             T L S D  E + +KLLNRSCKAGK++EA YFLE MV +G +KPDV+LCTK+IKG F+ 
Sbjct: 62   PTHLLSFDFKETHLMKLLNRSCKAGKYNEAFYFLECMVGKG-YKPDVVLCTKMIKGFFNG 120

Query: 504  KQMEKALRVLKILEVYGEPDVFAYNAVISGFCKGNQIDSANEMLNRMRARGISPDIVTYN 683
            + +EKA RV++ILE YGEPDVFAYNA+ISGFCK N++D AN++L+RMR+RG SPD+VTYN
Sbjct: 121  RNVEKATRVIEILEKYGEPDVFAYNAIISGFCKMNRLDFANKVLDRMRSRGFSPDVVTYN 180

Query: 684  IMIGSLCNRGKLGLAQKVFDELLEDNCKPTVVTYTILIEATILEGGIRKAMKLLDEMLSK 863
            IMIGS C+RGKL  A KV ++LL+DNCKP+V+TYTILIEAT+L+G I +AMKLLDEMLSK
Sbjct: 181  IMIGSFCSRGKLDSAYKVINQLLKDNCKPSVITYTILIEATMLQGEINEAMKLLDEMLSK 240

Query: 864  GLQPDMYTYNTIIRGLCREGMMDRAYEFISSLPEKGWKPDVISYNILLRSLLSEGKWRDG 1043
            GL+PDM+TYN IIRG+C++GM++RA++F+ SL  +G +PDVISYNILLR LL++GKW +G
Sbjct: 241  GLRPDMFTYNAIIRGMCKDGMVNRAFKFVRSLKARGCQPDVISYNILLRVLLNQGKWAEG 300

Query: 1044 KKLVAEMLSTGSEPNVVTYSILVTALCRDGKLEEAVNLLKLMMDKGLTPDTFTYEPVISA 1223
            +KLV EM+S G EPNVVTYSIL+++LCR+GKLEEAVN+LK+M ++GLTPD ++Y+P+ISA
Sbjct: 301  EKLVTEMVSRGCEPNVVTYSILISSLCREGKLEEAVNVLKMMKERGLTPDAYSYDPLISA 360

Query: 1224 FCREGKIDLAITFLDYMISSGCLPDIVNYNTILSAMCKNGKADQALEVFDKLDELGCPPD 1403
            FC+EG++DLAI FLD MIS GCLPDIVNYNT+L+ +CKNGKA+QALE+F+KL E+GCPP+
Sbjct: 361  FCKEGRLDLAIEFLDCMISDGCLPDIVNYNTVLATLCKNGKAEQALEIFEKLREVGCPPN 420

Query: 1404 VSTYNAMVSALWNNGERTKALIKLSEMMNKGMDPDEITFNSLISCLCRDGMVDEAIELLK 1583
            VS+YN M SALW++G++ KAL  +SEM++K + PDEIT+NSLISCLCRDGMVDEAIELL 
Sbjct: 421  VSSYNTMFSALWSSGDKVKALEMISEMLSKRIGPDEITYNSLISCLCRDGMVDEAIELLV 480

Query: 1584 DMERSGFPSTVITYNAVLLGLCKAHRIDDAIEILEEMVKKGCRPNETTYVLLVEGIGFAG 1763
            DM  SG P TVI+YN VLLGLCK HRI+DAIE+L  MV K C+PNETTY+LL+EGIGFAG
Sbjct: 481  DMGCSGIPPTVISYNIVLLGLCKVHRINDAIEVLAAMVDKRCQPNETTYILLIEGIGFAG 540

Query: 1764 WRAEAMKAASSLLKKNVISRESFRRLKRTFPALDVTKDVAYTETSK 1901
            WR+EAM+ A++L +   IS++SF+RL RTFP LDV K+ A ++++K
Sbjct: 541  WRSEAMELANALFRMEAISKDSFKRLNRTFPLLDVYKEFAGSDSNK 586


>ref|XP_002266822.1| PREDICTED: pentatricopeptide repeat-containing protein At3g04760,
            chloroplastic-like [Vitis vinifera]
          Length = 582

 Score =  776 bits (2004), Expect = 0.0
 Identities = 380/582 (65%), Positives = 481/582 (82%), Gaps = 5/582 (0%)
 Frame = +3

Query: 171  TILSAEFFPQPLPCTNSSKPNSQSTNKSLVRCRNER-----KSRSPQRVKVCTENRSTQL 335
            TI S +FFP+  P     KP S S + S+V CRN        SR+  +V V  E R   L
Sbjct: 2    TIYSTDFFPRCPPFNPQLKPTSHSHHTSIVTCRNPNPNDGFNSRNAPKVGVSAEARPAHL 61

Query: 336  QSNDSAEPNFVKLLNRSCKAGKFSEALYFLENMVNRGSHKPDVILCTKLIKGLFSSKQME 515
            QS D  E + +KLLNRSCKAGKF+E+LYFLE +VN+G + PDVILCTKLIKG F+ K +E
Sbjct: 62   QSYDFRETHLMKLLNRSCKAGKFNESLYFLECLVNKG-YTPDVILCTKLIKGFFNFKNIE 120

Query: 516  KALRVLKILEVYGEPDVFAYNAVISGFCKGNQIDSANEMLNRMRARGISPDIVTYNIMIG 695
            KA RV++ILE + EPDVFAYNAVISGFCK N+I++A ++LNRM+ARG  PDIVTYNIMIG
Sbjct: 121  KASRVMEILESHTEPDVFAYNAVISGFCKVNRIEAATQVLNRMKARGFLPDIVTYNIMIG 180

Query: 696  SLCNRGKLGLAQKVFDELLEDNCKPTVVTYTILIEATILEGGIRKAMKLLDEMLSKGLQP 875
            SLCNR KLGLA KV D+LL DNC PTV+TYTILIEATI+EGGI +AMKLL+EML++GL P
Sbjct: 181  SLCNRRKLGLALKVLDQLLLDNCMPTVITYTILIEATIVEGGINEAMKLLEEMLARGLLP 240

Query: 876  DMYTYNTIIRGLCREGMMDRAYEFISSLPEKGWKPDVISYNILLRSLLSEGKWRDGKKLV 1055
            DMYTYN IIRG+C+EGM++RA E I+SL  KG KPDVISYNILLR+ L++GKW +G+KLV
Sbjct: 241  DMYTYNAIIRGMCKEGMVERAAELITSLTSKGCKPDVISYNILLRAFLNQGKWDEGEKLV 300

Query: 1056 AEMLSTGSEPNVVTYSILVTALCRDGKLEEAVNLLKLMMDKGLTPDTFTYEPVISAFCRE 1235
            AEM S G EPN VTYSIL+++LCR G+++EA+++LK+M++K LTPDT++Y+P+ISA C+E
Sbjct: 301  AEMFSRGCEPNKVTYSILISSLCRFGRIDEAISVLKVMIEKELTPDTYSYDPLISALCKE 360

Query: 1236 GKIDLAITFLDYMISSGCLPDIVNYNTILSAMCKNGKADQALEVFDKLDELGCPPDVSTY 1415
            G++DLAI  +DYMIS+GCLPDIVNYNTIL+A+CKNG A+QALE+F+KL  +GCPP+VS+Y
Sbjct: 361  GRLDLAIGIMDYMISNGCLPDIVNYNTILAALCKNGNANQALEIFNKLRGMGCPPNVSSY 420

Query: 1416 NAMVSALWNNGERTKALIKLSEMMNKGMDPDEITFNSLISCLCRDGMVDEAIELLKDMER 1595
            N M+SALW+ G+R++AL  +  M++KG+DPDEIT+NSLISCLCRDG+V+EAI LL DME+
Sbjct: 421  NTMISALWSCGDRSRALGMVPAMISKGVDPDEITYNSLISCLCRDGLVEEAIGLLDDMEQ 480

Query: 1596 SGFPSTVITYNAVLLGLCKAHRIDDAIEILEEMVKKGCRPNETTYVLLVEGIGFAGWRAE 1775
            SGF  TVI+YN VLLGLCK  RIDDAI +  EM++KGCRPNETTY+LL+EGIGFAGWR E
Sbjct: 481  SGFRPTVISYNIVLLGLCKVRRIDDAIGMFAEMIEKGCRPNETTYILLIEGIGFAGWRTE 540

Query: 1776 AMKAASSLLKKNVISRESFRRLKRTFPALDVTKDVAYTETSK 1901
            AM+ A+SL  ++VIS++SF+RL +TFP LDV K+++ +ET K
Sbjct: 541  AMELANSLFSRDVISQDSFKRLNKTFPMLDVYKELSNSETKK 582


>emb|CAN67401.1| hypothetical protein VITISV_025967 [Vitis vinifera]
          Length = 592

 Score =  774 bits (1999), Expect = 0.0
 Identities = 379/584 (64%), Positives = 481/584 (82%), Gaps = 5/584 (0%)
 Frame = +3

Query: 165  MTTILSAEFFPQPLPCTNSSKPNSQSTNKSLVRCRNER-----KSRSPQRVKVCTENRST 329
            + TI S +FFP   P +   KP S S + S+V CRN        SR+  +V V  E R  
Sbjct: 10   LMTIYSTDFFPHCPPFSPQLKPTSHSHHTSIVTCRNPNPNDGYNSRNSPKVGVSAEARPA 69

Query: 330  QLQSNDSAEPNFVKLLNRSCKAGKFSEALYFLENMVNRGSHKPDVILCTKLIKGLFSSKQ 509
             LQS D  E + +KLLNRSCKAGKF+E+LYFLE +VN+G + PDVILCTKLIKG F+ K 
Sbjct: 70   HLQSYDFRETHLMKLLNRSCKAGKFNESLYFLECLVNKG-YTPDVILCTKLIKGFFNFKN 128

Query: 510  MEKALRVLKILEVYGEPDVFAYNAVISGFCKGNQIDSANEMLNRMRARGISPDIVTYNIM 689
            +EKA RV++ILE + EPDVFAYNAVISGFCK NQI++A ++LNRM+ARG  PDIVTYNIM
Sbjct: 129  IEKASRVMEILESHTEPDVFAYNAVISGFCKVNQIEAATQVLNRMKARGFLPDIVTYNIM 188

Query: 690  IGSLCNRGKLGLAQKVFDELLEDNCKPTVVTYTILIEATILEGGIRKAMKLLDEMLSKGL 869
            IGSLCNR KLGLA  V D+LL DNC PTV+TYTILIEATI+EGGI +AMKLL+EML++GL
Sbjct: 189  IGSLCNRRKLGLALTVLDQLLLDNCMPTVITYTILIEATIVEGGINEAMKLLEEMLARGL 248

Query: 870  QPDMYTYNTIIRGLCREGMMDRAYEFISSLPEKGWKPDVISYNILLRSLLSEGKWRDGKK 1049
             PDMYTYN IIRG+C+EGM++RA E I+SL  KG +PDVISYNILLR+ L++GKW +G+K
Sbjct: 249  LPDMYTYNAIIRGMCKEGMVERAAELITSLTSKGCEPDVISYNILLRAFLNQGKWDEGEK 308

Query: 1050 LVAEMLSTGSEPNVVTYSILVTALCRDGKLEEAVNLLKLMMDKGLTPDTFTYEPVISAFC 1229
            LVAEM S G EPN VTYSIL+++LCR G+++EA+++LK+M++K LTPDT++Y+P+ISA C
Sbjct: 309  LVAEMFSRGCEPNKVTYSILISSLCRFGRIDEAISVLKVMIEKELTPDTYSYDPLISALC 368

Query: 1230 REGKIDLAITFLDYMISSGCLPDIVNYNTILSAMCKNGKADQALEVFDKLDELGCPPDVS 1409
            +EG++DLAI  +DYMIS+GCLPDIVNYNTIL+A+CKNG A+QALE+F+KL  +GCPP+VS
Sbjct: 369  KEGRLDLAIGIMDYMISNGCLPDIVNYNTILAALCKNGNANQALEIFNKLRGMGCPPNVS 428

Query: 1410 TYNAMVSALWNNGERTKALIKLSEMMNKGMDPDEITFNSLISCLCRDGMVDEAIELLKDM 1589
            +YN M+SALW+ G+R++AL  +  M++KG+DPDEIT+NSLISCLCRDG+V+EAI LL DM
Sbjct: 429  SYNTMISALWSCGDRSRALGMVPAMISKGIDPDEITYNSLISCLCRDGLVEEAIGLLDDM 488

Query: 1590 ERSGFPSTVITYNAVLLGLCKAHRIDDAIEILEEMVKKGCRPNETTYVLLVEGIGFAGWR 1769
            E+SGF  TVI+YN VLLGLCK  RIDDAI +  EM++KGCRPNETTY+LL+EGIGFAGWR
Sbjct: 489  EQSGFRPTVISYNIVLLGLCKVRRIDDAIGMFAEMIEKGCRPNETTYILLIEGIGFAGWR 548

Query: 1770 AEAMKAASSLLKKNVISRESFRRLKRTFPALDVTKDVAYTETSK 1901
             EAM+ A+SL  ++VIS++SF+RL +TFP LDV K+++ +ET K
Sbjct: 549  TEAMELANSLFSRDVISQDSFKRLNKTFPMLDVYKELSNSETKK 592


>ref|XP_004304772.1| PREDICTED: pentatricopeptide repeat-containing protein At3g04760,
            chloroplastic-like [Fragaria vesca subsp. vesca]
          Length = 586

 Score =  772 bits (1993), Expect = 0.0
 Identities = 371/582 (63%), Positives = 470/582 (80%), Gaps = 9/582 (1%)
 Frame = +3

Query: 174  ILSAEFFPQPLPCTNSSKPNSQSTNKSLVRCR---------NERKSRSPQRVKVCTENRS 326
            I+S E  P     T+  KP S S + + + CR             SR+P RV V  E +S
Sbjct: 3    IVSTELLPHSFHTTSQLKPTSHSHHPTALSCRASSASSISNGRNSSRNPTRVSVSAEPKS 62

Query: 327  TQLQSNDSAEPNFVKLLNRSCKAGKFSEALYFLENMVNRGSHKPDVILCTKLIKGLFSSK 506
            TQLQ+ D  + + +K+LNRSCKAG+++EA+YFLE MVN+G +KPDVILCTKLIKG F+S+
Sbjct: 63   TQLQNYDFKDTHLMKVLNRSCKAGQYNEAIYFLELMVNKG-YKPDVILCTKLIKGFFNSR 121

Query: 507  QMEKALRVLKILEVYGEPDVFAYNAVISGFCKGNQIDSANEMLNRMRARGISPDIVTYNI 686
             +EKA+RV++ILE YGEPD+FAYNA+ISGFCK N+I+SAN++L+RM+++G  PD+VTYNI
Sbjct: 122  NIEKAIRVMQILEQYGEPDLFAYNALISGFCKANRIESANKVLDRMKSQGFKPDVVTYNI 181

Query: 687  MIGSLCNRGKLGLAQKVFDELLEDNCKPTVVTYTILIEATILEGGIRKAMKLLDEMLSKG 866
            MIGSLC+RGKLGLA +V D L+ DNCKPTV+TYTILIEA IL+GGI +AMKLLDEMLS+G
Sbjct: 182  MIGSLCSRGKLGLALQVMDRLVRDNCKPTVITYTILIEAIILDGGINEAMKLLDEMLSRG 241

Query: 867  LQPDMYTYNTIIRGLCREGMMDRAYEFISSLPEKGWKPDVISYNILLRSLLSEGKWRDGK 1046
            L+PDMYTYN I+RG+CREGM+DRA+EF+     KG  P+VISYNILLR+LL+ GKW +G+
Sbjct: 242  LKPDMYTYNAIVRGMCREGMLDRAFEFVKCFDAKGCAPNVISYNILLRALLNRGKWEEGE 301

Query: 1047 KLVAEMLSTGSEPNVVTYSILVTALCRDGKLEEAVNLLKLMMDKGLTPDTFTYEPVISAF 1226
             LVA M + G EPNVVTYSIL++ LCRDGK+E+ +N+LK+M +KGLTPD ++Y+P+IS F
Sbjct: 302  NLVANMCARGCEPNVVTYSILISTLCRDGKVEDGMNVLKIMKEKGLTPDAYSYDPLISCF 361

Query: 1227 CREGKIDLAITFLDYMISSGCLPDIVNYNTILSAMCKNGKADQALEVFDKLDELGCPPDV 1406
            C+EG++DLAI  LD MIS GCLPDIVNYNT+L+A+CKNG ADQALE+F+ L E+GCPP+V
Sbjct: 362  CKEGRLDLAIELLDCMISDGCLPDIVNYNTVLAALCKNGSADQALEIFENLGEVGCPPNV 421

Query: 1407 STYNAMVSALWNNGERTKALIKLSEMMNKGMDPDEITFNSLISCLCRDGMVDEAIELLKD 1586
            S+YN M SALWN G+R +AL  +S+M++KG++PDEIT+NSLISCLCRDGMV+EAI LL D
Sbjct: 422  SSYNTMFSALWNCGDRVRALGMVSDMVSKGIEPDEITYNSLISCLCRDGMVNEAIGLLVD 481

Query: 1587 MERSGFPSTVITYNAVLLGLCKAHRIDDAIEILEEMVKKGCRPNETTYVLLVEGIGFAGW 1766
            ME  GF  TVITYN VLLGL KA RI DAIE+   MV+KGCRPNETTY+LL+EGIGFAGW
Sbjct: 482  MEAGGFQPTVITYNIVLLGLSKARRIVDAIEVFTAMVEKGCRPNETTYILLIEGIGFAGW 541

Query: 1767 RAEAMKAASSLLKKNVISRESFRRLKRTFPALDVTKDVAYTE 1892
            RAEAM+ A S+   + I  +SF+RL RTFP LDV K++  +E
Sbjct: 542  RAEAMELAKSVYSLSAICEDSFKRLSRTFPMLDVYKELTLSE 583


>ref|XP_006361415.1| PREDICTED: pentatricopeptide repeat-containing protein At3g04760,
            chloroplastic-like [Solanum tuberosum]
          Length = 583

 Score =  770 bits (1987), Expect = 0.0
 Identities = 390/583 (66%), Positives = 481/583 (82%), Gaps = 11/583 (1%)
 Frame = +3

Query: 165  MTTILSAEFFPQPLPCTNSSKPNSQSTNKS-LVRC------RNERKSRSPQRVKVCTEN- 320
            MT I+ AE FPQ    +N+ KP SQS+  + +VRC      +++ K+R+P RVK+ +EN 
Sbjct: 1    MTRIIPAEIFPQCPFFSNNLKPKSQSSKHNFVVRCSSSSNDQSKVKTRNPLRVKISSENY 60

Query: 321  RSTQLQSNDSAEPNFVKLLNRSCKAGKFSEALYFLENMVNRGSHKPDVILCTKLIKGLFS 500
            R T          + +K+LN SCK GK+ E LY LE  +  G +KPDVILCTKLIKG  +
Sbjct: 61   RPT----------HDMKVLNWSCKVGKYDETLYLLECKLKSG-YKPDVILCTKLIKGFCN 109

Query: 501  SKQMEKALRVLKILEVYGEPDVFAYNAVISGFCKGNQIDSANEMLNRMRARGISPDIVTY 680
            SK  +K ++V++ILE +GEPDVFAYNA+ISGFCK N+I+ AN++LNRM+ARG  PD VTY
Sbjct: 110  SKNSDKGVKVMQILEQFGEPDVFAYNALISGFCKMNKIEEANKVLNRMKARGFPPDSVTY 169

Query: 681  NIMIGSLCNRGKLGLAQKVFDELLEDN-CKPTVVTYTILIEATILEGGIRKAMKLLDEML 857
            NI+IGSLC+RGKLG A K+ D+L E+N CKPTV+TYTILIEATILEGGI +AMKLLDEML
Sbjct: 170  NILIGSLCDRGKLGSALKLLDQLKEENNCKPTVITYTILIEATILEGGIHEAMKLLDEML 229

Query: 858  SKGLQPDMYTYNTIIRGLCREGMMDRAYEFISSLPEKGWKPDVISYNILLRSLLS-EGKW 1034
            S+GLQPDMYTYN IIRG+CRE MMD+AYEF+ SLP KG KPDVISYNILLR+LL  +GKW
Sbjct: 230  SRGLQPDMYTYNAIIRGMCREKMMDQAYEFVRSLPSKGCKPDVISYNILLRALLHHKGKW 289

Query: 1035 RDGKKLVAEMLSTGSEPNVVTYSILVTALCRDGKLEEAVNLLKLMMDKGLTPDTFTYEPV 1214
             DG+KL+ EMLS G EPNVVTYSIL++ALCRDGKL+EA+NLLK+MMDKGLTPDTFTY+P+
Sbjct: 290  SDGEKLMNEMLSAGCEPNVVTYSILMSALCRDGKLDEAINLLKIMMDKGLTPDTFTYDPL 349

Query: 1215 ISAFCREGKIDLAITFLDYMISSGCLPDIVNYNTILSAMCKNGKADQALEVFDKLDELGC 1394
            ISAFC+ G++DLAI FLDYMIS+GCLPDIVNYNTILS MCK GKAD+A+EVF+KL E+GC
Sbjct: 350  ISAFCKGGRLDLAIKFLDYMISNGCLPDIVNYNTILSTMCKKGKADEAMEVFEKLAEIGC 409

Query: 1395 PPDVSTYNAMVSALWNNGERTKALIKLSEMMNKGMDPDEITFNSLISCLCRDGMVDEAIE 1574
            PPDVSTYN ++SALWNNG R +AL  +SEM+ KG+DPDEIT+N+LISCLCRDGMV+EA++
Sbjct: 410  PPDVSTYNTLMSALWNNGGRARALKMVSEMIEKGVDPDEITYNALISCLCRDGMVNEALD 469

Query: 1575 LLKDMERSGFPSTVITYNAVLLGLCKAHRIDDAIEILEEMVKKGCRPNETTYVLLVEGIG 1754
            LL DME +GFP TVITYN +LLGLCKAHR+ +AIE+L EMV+KG RPNETTY+LL+EGIG
Sbjct: 470  LLGDMEGNGFPPTVITYNILLLGLCKAHRVVEAIEVLAEMVEKGRRPNETTYILLIEGIG 529

Query: 1755 FAGWRAEAMKAASSLLKKNVISRESFRRLKRTFPALDV-TKDV 1880
            F+G R +AM+ AS++  KN IS+ES +RL++TF   DV  KD+
Sbjct: 530  FSGRRVQAMEMASAIYHKNAISKESLQRLRKTFQVPDVYNKDI 572


>gb|EXB93167.1| hypothetical protein L484_024505 [Morus notabilis]
          Length = 587

 Score =  758 bits (1957), Expect = 0.0
 Identities = 365/583 (62%), Positives = 473/583 (81%), Gaps = 10/583 (1%)
 Frame = +3

Query: 174  ILSAEFFPQPLPCTNSSKPNSQSTNKSLVRCRN---------ERKSRSPQRVKVCTENRS 326
            I+S EF PQ LP +   K ++   + + + CRN          +K++ P RV+V  E +S
Sbjct: 3    IISTEFLPQTLPFSPQPKQHTSRQSHTCLSCRNPSQSSTDIYRKKNKKPLRVRVSVETKS 62

Query: 327  TQLQSN-DSAEPNFVKLLNRSCKAGKFSEALYFLENMVNRGSHKPDVILCTKLIKGLFSS 503
               QSN D +E + +K++NRSCK+GK++EALYFLE MV++G  KPDVILCTK+++G F+S
Sbjct: 63   PNSQSNSDFSESHLLKVINRSCKSGKYNEALYFLELMVSKG-FKPDVILCTKVMRGFFNS 121

Query: 504  KQMEKALRVLKILEVYGEPDVFAYNAVISGFCKGNQIDSANEMLNRMRARGISPDIVTYN 683
            + + KA+RV++ILE +GEPD+F+YNA+ISGFCK N+++ AN++L+RMR +G SPD +TYN
Sbjct: 122  RNIPKAIRVMEILEKHGEPDLFSYNAMISGFCKANRVELANKVLDRMRVQGFSPDTITYN 181

Query: 684  IMIGSLCNRGKLGLAQKVFDELLEDNCKPTVVTYTILIEATILEGGIRKAMKLLDEMLSK 863
            IMIGSLC+RGK+ +A KV DELL DNCKP+V+TYTILIEATI EGG+ KAM++L+EMLS+
Sbjct: 182  IMIGSLCSRGKVDMAFKVLDELLRDNCKPSVITYTILIEATISEGGVDKAMEVLEEMLSR 241

Query: 864  GLQPDMYTYNTIIRGLCREGMMDRAYEFISSLPEKGWKPDVISYNILLRSLLSEGKWRDG 1043
            GL PDM+TYN I+RG+CREGM+DRA+EF+ SL  KG  P+VISYNILLR+LL+ GKW DG
Sbjct: 242  GLLPDMFTYNAIVRGMCREGMLDRAFEFVRSLEAKGCSPNVISYNILLRALLNRGKWSDG 301

Query: 1044 KKLVAEMLSTGSEPNVVTYSILVTALCRDGKLEEAVNLLKLMMDKGLTPDTFTYEPVISA 1223
            +K++++M+S G EPNVVTYSIL++ LCRDGK+E+AVN+LK M +KG+TPD ++Y+P+ISA
Sbjct: 302  EKILSDMVSRGCEPNVVTYSILISTLCRDGKVEDAVNVLKAMKEKGITPDAYSYDPLISA 361

Query: 1224 FCREGKIDLAITFLDYMISSGCLPDIVNYNTILSAMCKNGKADQALEVFDKLDELGCPPD 1403
            FC+EG++DLAI F+DYMIS G LPDIVNYNTIL+A+CKNG AD ALE+F+KL E+GCPP 
Sbjct: 362  FCKEGRLDLAIEFMDYMISDGSLPDIVNYNTILAALCKNGNADHALEIFEKLGEVGCPPT 421

Query: 1404 VSTYNAMVSALWNNGERTKALIKLSEMMNKGMDPDEITFNSLISCLCRDGMVDEAIELLK 1583
            VS+YN M SALWN GER KAL  +SEM++K ++PDEIT+NSLISCLCR+GMV+EAI LL 
Sbjct: 422  VSSYNTMFSALWNCGERIKALEMISEMVSKRINPDEITYNSLISCLCREGMVNEAIGLLI 481

Query: 1584 DMERSGFPSTVITYNAVLLGLCKAHRIDDAIEILEEMVKKGCRPNETTYVLLVEGIGFAG 1763
            DME  GF  +VI+YN VLLGLCKA RIDDAIE+L  MV+KGCRPNETTY LL+EGIGFAG
Sbjct: 482  DMEAGGFKLSVISYNIVLLGLCKARRIDDAIELLAAMVEKGCRPNETTYTLLIEGIGFAG 541

Query: 1764 WRAEAMKAASSLLKKNVISRESFRRLKRTFPALDVTKDVAYTE 1892
            WR EAM  A+ L     IS  SF+RL +TFP LDV K++  +E
Sbjct: 542  WRVEAMGLANLLFDIEAISEHSFKRLNKTFPMLDVYKELTLSE 584


>ref|XP_004236781.1| PREDICTED: pentatricopeptide repeat-containing protein At3g04760,
            chloroplastic-like [Solanum lycopersicum]
          Length = 575

 Score =  755 bits (1950), Expect = 0.0
 Identities = 378/565 (66%), Positives = 468/565 (82%), Gaps = 9/565 (1%)
 Frame = +3

Query: 213  TNSSKPNSQSTNKS-LVRCR--NERKS---RSPQRVKVCTENRSTQLQSNDSAEPNFVKL 374
            +N+ KP S+S+  + +VRC   NE      R+PQRVK+ +ENR     S+D      +K+
Sbjct: 14   SNNLKPKSESSKHNFVVRCSISNEESRVNIRNPQRVKISSENRG----SHD------MKV 63

Query: 375  LNRSCKAGKFSEALYFLENMVNRGSHKPDVILCTKLIKGLFSSKQMEKALRVLKILEVYG 554
            LN SCK GK+ E LY LE  V  G +KPDVILCTKLIKG F+SK  +K ++V++ILE +G
Sbjct: 64   LNWSCKVGKYDETLYLLECKVKSG-YKPDVILCTKLIKGFFNSKNSDKGVKVMQILEQFG 122

Query: 555  EPDVFAYNAVISGFCKGNQIDSANEMLNRMRARGISPDIVTYNIMIGSLCNRGKLGLAQK 734
            EPDVFAYNA++SGFCK N+I+ AN++LNRM+  G  PD VTYNI+IGSLC+RGKLG A  
Sbjct: 123  EPDVFAYNALVSGFCKMNKIEEANKVLNRMKTHGFPPDSVTYNILIGSLCDRGKLGSALM 182

Query: 735  VFDELLED-NCKPTVVTYTILIEATILEGGIRKAMKLLDEMLSKGLQPDMYTYNTIIRGL 911
            + D+L E+ NCKPTV+TYTILIEATILEGGI +AMKLLDEMLS GLQPDMYTYN IIRG+
Sbjct: 183  LLDQLKEEHNCKPTVITYTILIEATILEGGIHEAMKLLDEMLSIGLQPDMYTYNAIIRGM 242

Query: 912  CREGMMDRAYEFISSLPEKGWKPDVISYNILLRSLLSE-GKWRDGKKLVAEMLSTGSEPN 1088
            CRE MMD+AYEF+ SLP KG KPDVISYNILLR+LL   GKW DG+KL+ EML  G EPN
Sbjct: 243  CREKMMDQAYEFVRSLPSKGCKPDVISYNILLRALLHHRGKWSDGEKLMNEMLCAGCEPN 302

Query: 1089 VVTYSILVTALCRDGKLEEAVNLLKLMMDKGLTPDTFTYEPVISAFCREGKIDLAITFLD 1268
            VVTYSIL++ALCRDGKL+EA+NLLK+M+DKGLTPDTFTY+P+ISAFC+ G++D+AI FLD
Sbjct: 303  VVTYSILMSALCRDGKLDEAINLLKIMVDKGLTPDTFTYDPLISAFCKGGRLDMAIKFLD 362

Query: 1269 YMISSGCLPDIVNYNTILSAMCKNGKADQALEVFDKLDELGCPPDVSTYNAMVSALWNNG 1448
            YMI++GCLPDIVNYNTILS MCK GKAD+A+EVF+KL E+GCPPDVSTYN ++SALWNNG
Sbjct: 363  YMITNGCLPDIVNYNTILSTMCKKGKADEAMEVFEKLAEIGCPPDVSTYNTLMSALWNNG 422

Query: 1449 ERTKALIKLSEMMNKGMDPDEITFNSLISCLCRDGMVDEAIELLKDMERSGFPSTVITYN 1628
             R +AL  +SEM+ KG+DPDEIT+N+LISCLCRDGMV+EA++LL DME +GFP TVITYN
Sbjct: 423  GRARALKMVSEMIEKGVDPDEITYNALISCLCRDGMVNEALDLLGDMEGNGFPPTVITYN 482

Query: 1629 AVLLGLCKAHRIDDAIEILEEMVKKGCRPNETTYVLLVEGIGFAGWRAEAMKAASSLLKK 1808
             +LLGLCKAHR+ +AIE+L EMV+KGCRPNETTY+LL+EGIGF+G R +AM+ A+++  K
Sbjct: 483  ILLLGLCKAHRVVEAIEVLAEMVEKGCRPNETTYILLIEGIGFSGRRVQAMEMATAIYHK 542

Query: 1809 NVISRESFRRLKRTFPALDV-TKDV 1880
            N IS+ES +RL++TF   DV +KD+
Sbjct: 543  NAISKESLQRLRKTFQVPDVYSKDI 567


>ref|XP_002521980.1| pentatricopeptide repeat-containing protein, putative [Ricinus
            communis] gi|223538784|gb|EEF40384.1| pentatricopeptide
            repeat-containing protein, putative [Ricinus communis]
          Length = 584

 Score =  748 bits (1931), Expect = 0.0
 Identities = 359/583 (61%), Positives = 471/583 (80%), Gaps = 6/583 (1%)
 Frame = +3

Query: 171  TILSAEFFPQPLPCTNSS-KPNSQSTNKSLVRC-----RNERKSRSPQRVKVCTENRSTQ 332
            T+ S EF P  +  T    KP S S + ++V C      +  K R+PQ+V+V  E R T 
Sbjct: 2    TLFSTEFLPHSISFTTQPLKPTSNSLHSTIVSCIRPELNDANKVRNPQKVRVSAETRQTH 61

Query: 333  LQSNDSAEPNFVKLLNRSCKAGKFSEALYFLENMVNRGSHKPDVILCTKLIKGLFSSKQM 512
            + S D  E + +KLLNRSC+AGK++E+LYFLE MV++G + PDVILCTKLIKG F+S+ +
Sbjct: 62   VLSFDFKEVHLMKLLNRSCRAGKYNESLYFLECMVDKG-YTPDVILCTKLIKGFFNSRNI 120

Query: 513  EKALRVLKILEVYGEPDVFAYNAVISGFCKGNQIDSANEMLNRMRARGISPDIVTYNIMI 692
             KA RV++ILE YG+PDVFAYNA+ISGF K NQ+++AN +L+RM++RG  PD+VTYNIMI
Sbjct: 121  GKATRVMEILERYGKPDVFAYNALISGFIKANQLENANRVLDRMKSRGFLPDVVTYNIMI 180

Query: 693  GSLCNRGKLGLAQKVFDELLEDNCKPTVVTYTILIEATILEGGIRKAMKLLDEMLSKGLQ 872
            GS C+RGKL LA ++F+ELL+DNC+PTV+TYTILIEATIL+GGI  AMKLLDEMLSKGL+
Sbjct: 181  GSFCSRGKLDLALEIFEELLKDNCEPTVITYTILIEATILDGGIDVAMKLLDEMLSKGLE 240

Query: 873  PDMYTYNTIIRGLCREGMMDRAYEFISSLPEKGWKPDVISYNILLRSLLSEGKWRDGKKL 1052
            PD  TYN IIRG+C+E M+D+A+E + SL  +G KPD+I+YNILLR+LLS GKW +G+KL
Sbjct: 241  PDTLTYNAIIRGMCKEMMVDKAFELLRSLSSRGCKPDIITYNILLRTLLSRGKWSEGEKL 300

Query: 1053 VAEMLSTGSEPNVVTYSILVTALCRDGKLEEAVNLLKLMMDKGLTPDTFTYEPVISAFCR 1232
            ++EM+S G +PNVVT+SIL+  LCRDGK+EEAVNLL+ M +KGL PD + Y+P+I+ FCR
Sbjct: 301  ISEMISIGCKPNVVTHSILIGTLCRDGKVEEAVNLLRSMKEKGLKPDAYCYDPLIAGFCR 360

Query: 1233 EGKIDLAITFLDYMISSGCLPDIVNYNTILSAMCKNGKADQALEVFDKLDELGCPPDVST 1412
            EG++DLA  FL+YMIS GCLPDIVNYNTI++ +C+ GKADQALEVF+KLDE+GCPP+VS+
Sbjct: 361  EGRLDLATEFLEYMISDGCLPDIVNYNTIMAGLCRTGKADQALEVFEKLDEVGCPPNVSS 420

Query: 1413 YNAMVSALWNNGERTKALIKLSEMMNKGMDPDEITFNSLISCLCRDGMVDEAIELLKDME 1592
            YN + SALW++G+R +AL  + +++N+G+DPDEIT+NSLISCLCRDGMVDEAIELL DM+
Sbjct: 421  YNTLFSALWSSGDRYRALEMILKLLNQGIDPDEITYNSLISCLCRDGMVDEAIELLVDMQ 480

Query: 1593 RSGFPSTVITYNAVLLGLCKAHRIDDAIEILEEMVKKGCRPNETTYVLLVEGIGFAGWRA 1772
               +   V++YN +LLGLCK +R +DAIE+L  M +KGC+PNETTY+LL+EGIGF+G RA
Sbjct: 481  SGRYRPNVVSYNIILLGLCKVNRANDAIEVLAAMTEKGCQPNETTYILLIEGIGFSGLRA 540

Query: 1773 EAMKAASSLLKKNVISRESFRRLKRTFPALDVTKDVAYTETSK 1901
            EAM+ A+SL   N IS +SF RL +TFP LDV KD+ +++ SK
Sbjct: 541  EAMELANSLHGMNAISEDSFNRLNKTFPLLDVYKDLTFSDGSK 583


>gb|EMJ06317.1| hypothetical protein PRUPE_ppa004835mg [Prunus persica]
          Length = 489

 Score =  725 bits (1871), Expect = 0.0
 Identities = 340/487 (69%), Positives = 426/487 (87%)
 Frame = +3

Query: 432  MVNRGSHKPDVILCTKLIKGLFSSKQMEKALRVLKILEVYGEPDVFAYNAVISGFCKGNQ 611
            MVN+G +KPDVILCTKLIKG F+S+ +EKA+RV++ILE YGEPD+F+YNA+ISGFCK N+
Sbjct: 1    MVNKG-YKPDVILCTKLIKGFFNSRNIEKAIRVMQILEKYGEPDLFSYNALISGFCKANR 59

Query: 612  IDSANEMLNRMRARGISPDIVTYNIMIGSLCNRGKLGLAQKVFDELLEDNCKPTVVTYTI 791
            I+SAN++L+RMR++G SPD+VTYNIMIGSLC+RGKLGLA KV D+L++DNC+PTV+TYTI
Sbjct: 60   IESANKVLDRMRSQGFSPDVVTYNIMIGSLCSRGKLGLALKVMDQLVKDNCRPTVITYTI 119

Query: 792  LIEATILEGGIRKAMKLLDEMLSKGLQPDMYTYNTIIRGLCREGMMDRAYEFISSLPEKG 971
            LIEATI++GGI +AMKLLDEMLS+GL+PDMYTYN +IRG+CREGM+DRA++F+ SL  KG
Sbjct: 120  LIEATIVDGGIDEAMKLLDEMLSRGLKPDMYTYNAVIRGMCREGMLDRAFQFVRSLDSKG 179

Query: 972  WKPDVISYNILLRSLLSEGKWRDGKKLVAEMLSTGSEPNVVTYSILVTALCRDGKLEEAV 1151
              P+VISYNILLR+LL+ GKW +G+KLV  M S G EPNVVTYSIL++ LCRDGK+E+AV
Sbjct: 180  CPPNVISYNILLRALLNRGKWEEGEKLVTNMCSRGCEPNVVTYSILISTLCRDGKVEDAV 239

Query: 1152 NLLKLMMDKGLTPDTFTYEPVISAFCREGKIDLAITFLDYMISSGCLPDIVNYNTILSAM 1331
            N+LK+M  KGLTPD ++Y+P++SAFC+EG++DLAI FLDYMIS GCLPDIVNYNTIL+A+
Sbjct: 240  NVLKIMKKKGLTPDAYSYDPLVSAFCKEGRLDLAIEFLDYMISDGCLPDIVNYNTILAAL 299

Query: 1332 CKNGKADQALEVFDKLDELGCPPDVSTYNAMVSALWNNGERTKALIKLSEMMNKGMDPDE 1511
            CK+GKADQAL++F+ L E+GCPP+VS+YN M SALWN G+R +AL  +SEM+ KG+ PDE
Sbjct: 300  CKSGKADQALQIFENLGEVGCPPNVSSYNTMFSALWNCGDRVRALGMVSEMVGKGIKPDE 359

Query: 1512 ITFNSLISCLCRDGMVDEAIELLKDMERSGFPSTVITYNAVLLGLCKAHRIDDAIEILEE 1691
            IT+NSLISCLCRDGMVDEAI LL DME  GF  TVI+YN +LLGLCK  R+ DAI++L E
Sbjct: 360  ITYNSLISCLCRDGMVDEAIGLLVDMETGGFQPTVISYNIILLGLCKTRRVVDAIQVLTE 419

Query: 1692 MVKKGCRPNETTYVLLVEGIGFAGWRAEAMKAASSLLKKNVISRESFRRLKRTFPALDVT 1871
            MV+KGCRPNETTY+LL+EGIGFAGWRAEAM+ A+S+     IS +SF+RL RTFP LDV 
Sbjct: 420  MVEKGCRPNETTYILLIEGIGFAGWRAEAMELANSVFSLRAISEDSFKRLNRTFPMLDVF 479

Query: 1872 KDVAYTE 1892
            K++  +E
Sbjct: 480  KELTLSE 486



 Score = 92.0 bits (227), Expect = 1e-15
 Identities = 58/197 (29%), Positives = 103/197 (52%), Gaps = 1/197 (0%)
 Frame = +3

Query: 372 LLNRSCKAGKFSEALYFLENMVNRGSHKPDVILCTKLIKGLFSSKQMEKALRVLKIL-EV 548
           L++  CK G+   A+ FL+ M++ G   PD++    ++  L  S + ++AL++ + L EV
Sbjct: 260 LVSAFCKEGRLDLAIEFLDYMISDGC-LPDIVNYNTILAALCKSGKADQALQIFENLGEV 318

Query: 549 YGEPDVFAYNAVISGFCKGNQIDSANEMLNRMRARGISPDIVTYNIMIGSLCNRGKLGLA 728
              P+V +YN + S          A  M++ M  +GI PD +TYN +I  LC  G +  A
Sbjct: 319 GCPPNVSSYNTMFSALWNCGDRVRALGMVSEMVGKGIKPDEITYNSLISCLCRDGMVDEA 378

Query: 729 QKVFDELLEDNCKPTVVTYTILIEATILEGGIRKAMKLLDEMLSKGLQPDMYTYNTIIRG 908
             +  ++     +PTV++Y I++        +  A+++L EM+ KG +P+  TY  +I G
Sbjct: 379 IGLLVDMETGGFQPTVISYNIILLGLCKTRRVVDAIQVLTEMVEKGCRPNETTYILLIEG 438

Query: 909 LCREGMMDRAYEFISSL 959
           +   G    A E  +S+
Sbjct: 439 IGFAGWRAEAMELANSV 455



 Score = 85.1 bits (209), Expect = 1e-13
 Identities = 51/168 (30%), Positives = 86/168 (51%), Gaps = 1/168 (0%)
 Frame = +3

Query: 360 NFVKLLNRSCKAGKFSEALYFLENMVNRGSHKPDVILCTKLIKGLFSSKQMEKALRVLKI 539
           N+  +L   CK+GK  +AL   EN+   G   P+V     +   L++     +AL ++  
Sbjct: 291 NYNTILAALCKSGKADQALQIFENLGEVGC-PPNVSSYNTMFSALWNCGDRVRALGMVSE 349

Query: 540 LEVYG-EPDVFAYNAVISGFCKGNQIDSANEMLNRMRARGISPDIVTYNIMIGSLCNRGK 716
           +   G +PD   YN++IS  C+   +D A  +L  M   G  P +++YNI++  LC   +
Sbjct: 350 MVGKGIKPDEITYNSLISCLCRDGMVDEAIGLLVDMETGGFQPTVISYNIILLGLCKTRR 409

Query: 717 LGLAQKVFDELLEDNCKPTVVTYTILIEATILEGGIRKAMKLLDEMLS 860
           +  A +V  E++E  C+P   TY +LIE     G   +AM+L + + S
Sbjct: 410 VVDAIQVLTEMVEKGCRPNETTYILLIEGIGFAGWRAEAMELANSVFS 457


>ref|XP_006487702.1| PREDICTED: pentatricopeptide repeat-containing protein At3g04760,
            chloroplastic-like [Citrus sinensis]
          Length = 583

 Score =  723 bits (1867), Expect = 0.0
 Identities = 352/583 (60%), Positives = 465/583 (79%), Gaps = 6/583 (1%)
 Frame = +3

Query: 165  MTTILSAEFFPQPLPCTNSS----KPN-SQSTNKSLVRCRNERKSRSPQRVKVCTENR-S 326
            M  I S +F P+ L    S     KP  S S   ++V C N  KS    RV    E R +
Sbjct: 1    MALISSTDFVPRNLVIFTSQHQQLKPTTSHSVQSTVVSCINP-KSNERVRVSSSAETRPN 59

Query: 327  TQLQSNDSAEPNFVKLLNRSCKAGKFSEALYFLENMVNRGSHKPDVILCTKLIKGLFSSK 506
            T L S D  E  F+KL+ +S +AGKF E+LYF+E+MV  G  KPDV++CTKLIK  F  +
Sbjct: 60   THLLSFDVKETQFMKLIKKSFRAGKFDESLYFIESMVANGC-KPDVVMCTKLIKKFFQER 118

Query: 507  QMEKALRVLKILEVYGEPDVFAYNAVISGFCKGNQIDSANEMLNRMRARGISPDIVTYNI 686
            +  KA+RV++ILE YGEPDVFAYNA+ISGFCK NQI+ AN++L+R+R+RG SPD+VTYNI
Sbjct: 119  KSNKAVRVMEILEKYGEPDVFAYNALISGFCKANQIELANKVLDRLRSRGFSPDVVTYNI 178

Query: 687  MIGSLCNRGKLGLAQKVFDELLEDNCKPTVVTYTILIEATILEGGIRKAMKLLDEMLSKG 866
            MIGSLC+RG +  A KVFD+LL DNCKPTV+TYTILI+AT+LEG   KAMKLLDEML++G
Sbjct: 179  MIGSLCSRGMIESAFKVFDQLLRDNCKPTVITYTILIQATMLEGQTDKAMKLLDEMLARG 238

Query: 867  LQPDMYTYNTIIRGLCREGMMDRAYEFISSLPEKGWKPDVISYNILLRSLLSEGKWRDGK 1046
            L PDM+T N IIRG+C++GM+ +A++F+ SL  +G +PDVISYN+LLR+LL+ GKW +G+
Sbjct: 239  LIPDMFTNNAIIRGMCKKGMVGQAFQFVRSLESRGCQPDVISYNMLLRTLLNMGKWEEGE 298

Query: 1047 KLVAEMLSTGSEPNVVTYSILVTALCRDGKLEEAVNLLKLMMDKGLTPDTFTYEPVISAF 1226
            KL+ EM+S G EPNVVTYSIL+++LCRDGK E+AV++L+   +KGLTPD ++Y+P+ISA+
Sbjct: 299  KLMTEMISRGLEPNVVTYSILISSLCRDGKTEDAVDVLRAAKEKGLTPDAYSYDPLISAY 358

Query: 1227 CREGKIDLAITFLDYMISSGCLPDIVNYNTILSAMCKNGKADQALEVFDKLDELGCPPDV 1406
            C++G++DLAI FLDYMIS GCLPDIVNYNTIL+A CKNG ADQALE+F+KL ++GCPP+V
Sbjct: 359  CKDGRLDLAIEFLDYMISDGCLPDIVNYNTILAAFCKNGNADQALEIFEKLSDVGCPPNV 418

Query: 1407 STYNAMVSALWNNGERTKALIKLSEMMNKGMDPDEITFNSLISCLCRDGMVDEAIELLKD 1586
            S+YN M SALW++G++ +AL  +SEM++KG++PDEIT+NSLISCLCRDGMVDEA+ LL D
Sbjct: 419  SSYNTMFSALWSSGDKIRALGMISEMLSKGIEPDEITYNSLISCLCRDGMVDEAVGLLVD 478

Query: 1587 MERSGFPSTVITYNAVLLGLCKAHRIDDAIEILEEMVKKGCRPNETTYVLLVEGIGFAGW 1766
            ME + F  TVI+YN ++LG CK  RI+++IE+L  M +KGC+PNETTYVLL+EGIG+ GW
Sbjct: 479  MESTRFRPTVISYNIIILGFCKTRRINESIEVLAAMFEKGCKPNETTYVLLIEGIGYGGW 538

Query: 1767 RAEAMKAASSLLKKNVISRESFRRLKRTFPALDVTKDVAYTET 1895
            RAEAM+ A++L+  + ISR++F+RL RTFP LDV K++++  T
Sbjct: 539  RAEAMELANALVSMHAISRDTFKRLNRTFPLLDVYKEISHLAT 581


>ref|XP_006442665.1| hypothetical protein CICLE_v10019446mg [Citrus clementina]
            gi|557544927|gb|ESR55905.1| hypothetical protein
            CICLE_v10019446mg [Citrus clementina]
          Length = 583

 Score =  722 bits (1864), Expect = 0.0
 Identities = 351/583 (60%), Positives = 463/583 (79%), Gaps = 6/583 (1%)
 Frame = +3

Query: 165  MTTILSAEFFPQPLPCTNSS----KPN-SQSTNKSLVRCRNERKSRSPQRVKVCTENR-S 326
            M  I S +F P+ L    S     KP  S S   ++V C N  KS    RV    E R +
Sbjct: 1    MALISSTDFVPRNLVIFTSQHQQQKPTTSHSVQSTVVSCINP-KSNERVRVSSSAETRPN 59

Query: 327  TQLQSNDSAEPNFVKLLNRSCKAGKFSEALYFLENMVNRGSHKPDVILCTKLIKGLFSSK 506
            T L S D  E  F+KL+ RS +AGKF E+LYF+E+MV  G  KPDV++CTKLIK  F  +
Sbjct: 60   THLLSFDVKETQFMKLIKRSFRAGKFDESLYFIESMVANGC-KPDVVMCTKLIKKFFQER 118

Query: 507  QMEKALRVLKILEVYGEPDVFAYNAVISGFCKGNQIDSANEMLNRMRARGISPDIVTYNI 686
            +  KA+RV++ILE YGEPDVFAYNA+ISGFCK NQI+ AN++L+R+R+RG SPD+VTYNI
Sbjct: 119  KSNKAVRVMEILEKYGEPDVFAYNALISGFCKANQIELANKVLDRLRSRGFSPDVVTYNI 178

Query: 687  MIGSLCNRGKLGLAQKVFDELLEDNCKPTVVTYTILIEATILEGGIRKAMKLLDEMLSKG 866
            MIGSLC+RG +    KVFD+LL DNCKPTV+TYTILI+AT+LEG   KAMKLLDEM ++G
Sbjct: 179  MIGSLCSRGMIESGFKVFDQLLRDNCKPTVITYTILIQATMLEGQTDKAMKLLDEMFARG 238

Query: 867  LQPDMYTYNTIIRGLCREGMMDRAYEFISSLPEKGWKPDVISYNILLRSLLSEGKWRDGK 1046
            L PDM+T N IIRG+C++GM+ +A++F+ SL  +G +PDVISYN+LLR+LL+ GKW +G+
Sbjct: 239  LIPDMFTNNAIIRGMCKKGMVGQAFQFVRSLESRGCQPDVISYNMLLRTLLNMGKWEEGE 298

Query: 1047 KLVAEMLSTGSEPNVVTYSILVTALCRDGKLEEAVNLLKLMMDKGLTPDTFTYEPVISAF 1226
            KL+ EM+S G EPNVVTYSIL+++LCRDGK E+AV++L+   +KGLTPD ++Y+P+ISA+
Sbjct: 299  KLMTEMISRGLEPNVVTYSILISSLCRDGKTEDAVDVLRAAKEKGLTPDAYSYDPLISAY 358

Query: 1227 CREGKIDLAITFLDYMISSGCLPDIVNYNTILSAMCKNGKADQALEVFDKLDELGCPPDV 1406
            C++G++DLAI FLDYMIS GCLPDIVNYNTIL+A CKNG ADQALE+F+KL ++GCPP+V
Sbjct: 359  CKDGRLDLAIEFLDYMISDGCLPDIVNYNTILAAFCKNGNADQALEIFEKLSDVGCPPNV 418

Query: 1407 STYNAMVSALWNNGERTKALIKLSEMMNKGMDPDEITFNSLISCLCRDGMVDEAIELLKD 1586
            S+YN M SALW++G++ +AL  +SEM++KG++PDEIT+NSLISCLCRDGMVDEA+ LL D
Sbjct: 419  SSYNTMFSALWSSGDKIRALGMISEMLSKGIEPDEITYNSLISCLCRDGMVDEAVGLLVD 478

Query: 1587 MERSGFPSTVITYNAVLLGLCKAHRIDDAIEILEEMVKKGCRPNETTYVLLVEGIGFAGW 1766
            ME + F  TV++YN ++LG CK  RI++AIE+L  M +KGC+PNETTYVLL+EGIG+ GW
Sbjct: 479  MESTRFRPTVVSYNIIILGFCKTRRINEAIEVLAAMFEKGCKPNETTYVLLIEGIGYGGW 538

Query: 1767 RAEAMKAASSLLKKNVISRESFRRLKRTFPALDVTKDVAYTET 1895
            RAEAM+ A++L+  + ISR++F+RL RTFP LDV K++++  T
Sbjct: 539  RAEAMELANALVSMHAISRDTFKRLNRTFPLLDVYKEISHLAT 581


>ref|XP_004142590.1| PREDICTED: pentatricopeptide repeat-containing protein At3g04760,
            chloroplastic-like [Cucumis sativus]
          Length = 581

 Score =  714 bits (1842), Expect = 0.0
 Identities = 346/581 (59%), Positives = 466/581 (80%), Gaps = 2/581 (0%)
 Frame = +3

Query: 174  ILSAEFFPQPLPCTNS-SKPN-SQSTNKSLVRCRNERKSRSPQRVKVCTENRSTQLQSND 347
            + S+EF PQ L  TN  +KP   QS + S+  CR   K+   + V    E R     + D
Sbjct: 1    MFSSEFLPQSLHFTNPLAKPTIPQSRSDSIPACRFSNKTHL-RNVTSSAEFRQPHFPNLD 59

Query: 348  SAEPNFVKLLNRSCKAGKFSEALYFLENMVNRGSHKPDVILCTKLIKGLFSSKQMEKALR 527
            + + + +KLLNRSC+AGK +E+LYFLE++V++G  KPDV+LCTKLIKG F+S+ ++KA+R
Sbjct: 60   NRDAHLMKLLNRSCRAGKHNESLYFLESVVSKG-FKPDVVLCTKLIKGFFNSRNLKKAMR 118

Query: 528  VLKILEVYGEPDVFAYNAVISGFCKGNQIDSANEMLNRMRARGISPDIVTYNIMIGSLCN 707
            V++ILE YG+PDV++YNA+ISGF K NQIDSAN++ +RMR+RG SPD+VTYNIMIGSLC+
Sbjct: 119  VMEILETYGDPDVYSYNAMISGFSKANQIDSANQVFDRMRSRGFSPDVVTYNIMIGSLCS 178

Query: 708  RGKLGLAQKVFDELLEDNCKPTVVTYTILIEATILEGGIRKAMKLLDEMLSKGLQPDMYT 887
            RGKL LA +V DELL+D CKP+V+TYTILIEATILEG I +A++L DE++S+GL+PD+YT
Sbjct: 179  RGKLELAFEVMDELLKDGCKPSVITYTILIEATILEGRINEALELFDELVSRGLRPDLYT 238

Query: 888  YNTIIRGLCREGMMDRAYEFISSLPEKGWKPDVISYNILLRSLLSEGKWRDGKKLVAEML 1067
            YN IIRG+C+EGM DRA +F+  L  +G  PDV+SYNILLRS L++ +W DG++L+ +M+
Sbjct: 239  YNAIIRGICKEGMEDRALDFVRHLSARGCNPDVVSYNILLRSFLNKSRWEDGERLMKDMV 298

Query: 1068 STGSEPNVVTYSILVTALCRDGKLEEAVNLLKLMMDKGLTPDTFTYEPVISAFCREGKID 1247
             +G EPNVVT+SIL+++ CR+G++ EAVN+L++M +KGLTPD+++Y+P+ISAFC+EG++D
Sbjct: 299  LSGCEPNVVTHSILISSFCREGRVREAVNVLEVMKEKGLTPDSYSYDPLISAFCKEGRLD 358

Query: 1248 LAITFLDYMISSGCLPDIVNYNTILSAMCKNGKADQALEVFDKLDELGCPPDVSTYNAMV 1427
            LAI +L+ M+S GCLPDIVNYNTIL+ +CK G AD AL+VF+KLDE+GCPP V  YN M 
Sbjct: 359  LAIEYLEKMVSDGCLPDIVNYNTILATLCKFGCADLALDVFEKLDEVGCPPTVRAYNTMF 418

Query: 1428 SALWNNGERTKALIKLSEMMNKGMDPDEITFNSLISCLCRDGMVDEAIELLKDMERSGFP 1607
            SALW+ G + KAL  +SEM+ KG+DPDEIT+NSLISCLCRDG+VDEAI LL DME + F 
Sbjct: 419  SALWSCGNKIKALEMISEMIRKGIDPDEITYNSLISCLCRDGLVDEAIGLLVDMEATRFQ 478

Query: 1608 STVITYNAVLLGLCKAHRIDDAIEILEEMVKKGCRPNETTYVLLVEGIGFAGWRAEAMKA 1787
             TVI++N VLLG+CKAHR+ + IE+L  MV+KGC PNET+YVLL+EGI +AGWRAEAM+ 
Sbjct: 479  PTVISFNIVLLGMCKAHRVFEGIELLITMVEKGCLPNETSYVLLIEGIAYAGWRAEAMEL 538

Query: 1788 ASSLLKKNVISRESFRRLKRTFPALDVTKDVAYTETSK*LI 1910
            A+SL +  VIS +S +RL +TFP LDV K ++ +E+   L+
Sbjct: 539  ANSLYRLGVISGDSSKRLNKTFPMLDVYKGLSLSESKNQLL 579


>ref|XP_006371094.1| hypothetical protein POPTR_0019s03630g [Populus trichocarpa]
            gi|550316702|gb|ERP48891.1| hypothetical protein
            POPTR_0019s03630g [Populus trichocarpa]
          Length = 586

 Score =  706 bits (1822), Expect = 0.0
 Identities = 347/581 (59%), Positives = 456/581 (78%), Gaps = 11/581 (1%)
 Frame = +3

Query: 171  TILSAEFFPQP--LPCTNSS-KPNSQSTNKSLVRC-------RNERKSRSPQRVKVCTEN 320
            T+ S EF       P T+   K +  S   ++V C        N      P+  +V  E 
Sbjct: 2    TMFSTEFISHSCSFPFTSKHFKLSLHSLQSNVVSCINPTHNDTNSNLGNPPKLRRVLPET 61

Query: 321  RSTQLQSNDSAEPNFVKLLNRSCKAGKFSEALYFLENMVNRGSHKPDVILCTKLIKGLFS 500
            + T + S D  E + +KLLNRSCKAGK +E+LYFLE MV +G ++PDVI+CTKLIKG F+
Sbjct: 62   KPTHVLSYDFKETHLMKLLNRSCKAGKCNESLYFLECMVAKG-YQPDVIMCTKLIKGFFN 120

Query: 501  SKQMEKALRVLKILEVYGEPDVFAYNAVISGFCKGNQIDSANEMLNRMRARGISPDIVTY 680
            S+ +EKA RV++ILE +GEPDVFAYNAVISGFCK N+I+SA ++L+RM+ +G S D+VTY
Sbjct: 121  SRNIEKATRVMEILEKHGEPDVFAYNAVISGFCKANRIESAKKVLDRMKRKGFSQDVVTY 180

Query: 681  NIMIGSLCNRGKLGLAQKVFDELLEDN-CKPTVVTYTILIEATILEGGIRKAMKLLDEML 857
            NIMIG+ C++GK+ LA KVF+ELL+DN CKPT++TYTILIEA ILEGGI + +KLLDEML
Sbjct: 181  NIMIGTFCSKGKIDLALKVFEELLKDNNCKPTLITYTILIEAHILEGGIDEGLKLLDEML 240

Query: 858  SKGLQPDMYTYNTIIRGLCREGMMDRAYEFISSLPEKGWKPDVISYNILLRSLLSEGKWR 1037
            S+GL+PD +TYN I+RGL +EG +++A+E + +L  +G KPDVI+YNILLR+LL +GKW 
Sbjct: 241  SRGLEPDTFTYNVIVRGLGKEGKVNQAFELVRTLNSRGCKPDVITYNILLRALLDQGKWY 300

Query: 1038 DGKKLVAEMLSTGSEPNVVTYSILVTALCRDGKLEEAVNLLKLMMDKGLTPDTFTYEPVI 1217
            +G+KL+ EM S G EPNVVTYSIL+++LCRDGK+EE+VNL+K+M +KGLTPD + Y+P+I
Sbjct: 301  EGEKLMDEMFSRGCEPNVVTYSILISSLCRDGKIEESVNLVKVMKEKGLTPDAYCYDPLI 360

Query: 1218 SAFCREGKIDLAITFLDYMISSGCLPDIVNYNTILSAMCKNGKADQALEVFDKLDELGCP 1397
            +AFCREGK+D+AI FLDYMIS G LPDIVNYNTI++A+CKNG +D A+E+F KL+E+GCP
Sbjct: 361  AAFCREGKLDMAIKFLDYMISDGFLPDIVNYNTIMAALCKNGNSDHAVEIFGKLEEVGCP 420

Query: 1398 PDVSTYNAMVSALWNNGERTKALIKLSEMMNKGMDPDEITFNSLISCLCRDGMVDEAIEL 1577
            P+VS+YN M+SALW +G+R +AL  +S+M++ G+DPD IT+NSLISCLCRDGMVDEAI L
Sbjct: 421  PNVSSYNTMLSALWGSGDRYRALGMISQMLSTGIDPDGITYNSLISCLCRDGMVDEAIGL 480

Query: 1578 LKDMERSGFPSTVITYNAVLLGLCKAHRIDDAIEILEEMVKKGCRPNETTYVLLVEGIGF 1757
            L DM    F   +++YN VLLGLCK HRIDDAIE+L  M++ GC+PNETTY LL+EGIGF
Sbjct: 481  LADMLSGRFQPNIVSYNIVLLGLCKVHRIDDAIEVLTAMIENGCQPNETTYTLLIEGIGF 540

Query: 1758 AGWRAEAMKAASSLLKKNVISRESFRRLKRTFPALDVTKDV 1880
            +G RA+AM+ A+SL   N IS  S++RL + FP LDV KD+
Sbjct: 541  SGSRAQAMELANSLYSMNAISEGSYKRLNKVFPLLDVYKDL 581


>ref|XP_006296608.1| hypothetical protein CARUB_v10013258mg [Capsella rubella]
            gi|482565317|gb|EOA29506.1| hypothetical protein
            CARUB_v10013258mg [Capsella rubella]
          Length = 607

 Score =  704 bits (1816), Expect = 0.0
 Identities = 337/554 (60%), Positives = 448/554 (80%), Gaps = 3/554 (0%)
 Frame = +3

Query: 216  NSSKPNSQSTNK-SLVRCRNERKSRSPQRVKVCTENRSTQLQSNDSA--EPNFVKLLNRS 386
            ++S PN  +    S        ++ + Q   V TE R  Q  S+     +   +K+ +RS
Sbjct: 44   SNSNPNHDNVKSFSSSGAARNLQAATTQDATVPTERRQHQTHSHSLGFRDTQMLKIFHRS 103

Query: 387  CKAGKFSEALYFLENMVNRGSHKPDVILCTKLIKGLFSSKQMEKALRVLKILEVYGEPDV 566
            C++G + E+L+ LE+MV +G + PDVILCTKLIKG F+ + + KA+RV++ILE +G+PDV
Sbjct: 104  CRSGNYIESLHLLESMVRKG-YNPDVILCTKLIKGFFTLRNIPKAVRVMEILEKFGQPDV 162

Query: 567  FAYNAVISGFCKGNQIDSANEMLNRMRARGISPDIVTYNIMIGSLCNRGKLGLAQKVFDE 746
            FAYNA+I+GFCK N+ID A  +L+RMR++G SPD VTYNIMIGSLC+RGKL LA KV D+
Sbjct: 163  FAYNALINGFCKMNRIDDATRVLDRMRSKGFSPDTVTYNIMIGSLCSRGKLVLALKVLDQ 222

Query: 747  LLEDNCKPTVVTYTILIEATILEGGIRKAMKLLDEMLSKGLQPDMYTYNTIIRGLCREGM 926
            LL DNC+PTV+TYTILIEAT+LEGG+ +A+KLLDEMLS+GL+PDM+TYNTIIRG+C+EGM
Sbjct: 223  LLSDNCQPTVITYTILIEATMLEGGVDEALKLLDEMLSRGLKPDMFTYNTIIRGMCKEGM 282

Query: 927  MDRAYEFISSLPEKGWKPDVISYNILLRSLLSEGKWRDGKKLVAEMLSTGSEPNVVTYSI 1106
            +DRA+E + +L  +G +PDVISYNILLR+LL++GKW +G+KL+ +M S   +PNVVTYSI
Sbjct: 283  VDRAFEMVRNLELRGCEPDVISYNILLRALLNQGKWEEGEKLMTKMFSEKCDPNVVTYSI 342

Query: 1107 LVTALCRDGKLEEAVNLLKLMMDKGLTPDTFTYEPVISAFCREGKIDLAITFLDYMISSG 1286
            L+T LCRDGK+EEA+NLLKLM +KGL+PD ++Y+P+I+AFCREG++DLAI FL+ MIS G
Sbjct: 343  LITTLCRDGKIEEALNLLKLMKEKGLSPDAYSYDPLIAAFCREGRLDLAIEFLETMISDG 402

Query: 1287 CLPDIVNYNTILSAMCKNGKADQALEVFDKLDELGCPPDVSTYNAMVSALWNNGERTKAL 1466
            CLPDIVNYNT+L+ +CKNGKADQALE+F KL E+GC P+ S+YN M SALW++G++ +AL
Sbjct: 403  CLPDIVNYNTVLATLCKNGKADQALEIFGKLGEVGCSPNSSSYNTMFSALWSSGDKIRAL 462

Query: 1467 IKLSEMMNKGMDPDEITFNSLISCLCRDGMVDEAIELLKDMERSGFPSTVITYNAVLLGL 1646
              +SEM+++G+DPDEIT+NS+ISCLCR+GMVDEA +LL DM    F  +V+TYN VLLG 
Sbjct: 463  HMISEMVSQGIDPDEITYNSMISCLCREGMVDEAFDLLVDMRSCEFHPSVVTYNIVLLGF 522

Query: 1647 CKAHRIDDAIEILEEMVKKGCRPNETTYVLLVEGIGFAGWRAEAMKAASSLLKKNVISRE 1826
            CKAHRI+DAI++LE MV  GCRPNE+TY +L+EGIGFAG+RAEAM+ A+ L++ + IS  
Sbjct: 523  CKAHRIEDAIDVLESMVGNGCRPNESTYTVLIEGIGFAGYRAEAMELANDLVRIDAISEH 582

Query: 1827 SFRRLKRTFPALDV 1868
            SF+RL RTFP L+V
Sbjct: 583  SFKRLHRTFPLLNV 596


>ref|XP_002884468.1| pentatricopeptide repeat-containing protein [Arabidopsis lyrata
            subsp. lyrata] gi|297330308|gb|EFH60727.1|
            pentatricopeptide repeat-containing protein [Arabidopsis
            lyrata subsp. lyrata]
          Length = 598

 Score =  703 bits (1814), Expect = 0.0
 Identities = 338/558 (60%), Positives = 444/558 (79%)
 Frame = +3

Query: 216  NSSKPNSQSTNKSLVRCRNERKSRSPQRVKVCTENRSTQLQSNDSAEPNFVKLLNRSCKA 395
            ++S PN  +  KS           +     + TE R    QS    +   +K+ +RSC++
Sbjct: 40   SNSNPNHDN-GKSFSSSGARNLQATTTDAAIPTERRQQHSQSLGFRDTQMLKIFHRSCRS 98

Query: 396  GKFSEALYFLENMVNRGSHKPDVILCTKLIKGLFSSKQMEKALRVLKILEVYGEPDVFAY 575
            G + E+L+ LE MV +G + PDVILCTKLIKG F+ + + KA+RV++ILE +G+PDVFAY
Sbjct: 99   GNYIESLHLLETMVRKG-YNPDVILCTKLIKGFFTLRNVPKAVRVMEILEKFGQPDVFAY 157

Query: 576  NAVISGFCKGNQIDSANEMLNRMRARGISPDIVTYNIMIGSLCNRGKLGLAQKVFDELLE 755
            NA+I+GFCK N+ID A  +L+RMR++  SPD VTYNIMIGSLC+RGKL LA KV D+LL 
Sbjct: 158  NALINGFCKMNRIDDATRVLDRMRSKDFSPDTVTYNIMIGSLCSRGKLDLALKVLDQLLS 217

Query: 756  DNCKPTVVTYTILIEATILEGGIRKAMKLLDEMLSKGLQPDMYTYNTIIRGLCREGMMDR 935
            DNC+PTV+TYTILIEAT+LEGG+ +A+KLLDEMLS+GL+PDM+TYNTIIRG+C+EGM+DR
Sbjct: 218  DNCQPTVITYTILIEATMLEGGVDEALKLLDEMLSRGLKPDMFTYNTIIRGMCKEGMVDR 277

Query: 936  AYEFISSLPEKGWKPDVISYNILLRSLLSEGKWRDGKKLVAEMLSTGSEPNVVTYSILVT 1115
            A+E I +L  KG +PDVISYNILLR+LL++GKW +G+KL+ +M S   +PNVVTYSIL+T
Sbjct: 278  AFEMIRNLELKGCEPDVISYNILLRALLNQGKWEEGEKLMTKMFSEKCDPNVVTYSILIT 337

Query: 1116 ALCRDGKLEEAVNLLKLMMDKGLTPDTFTYEPVISAFCREGKIDLAITFLDYMISSGCLP 1295
             LCRDGK+EEA+NLLKLM +KGLTPD ++Y+P+I+AFCREG++D+AI FL+ MIS GCLP
Sbjct: 338  TLCRDGKIEEAMNLLKLMKEKGLTPDAYSYDPLIAAFCREGRLDVAIEFLETMISDGCLP 397

Query: 1296 DIVNYNTILSAMCKNGKADQALEVFDKLDELGCPPDVSTYNAMVSALWNNGERTKALIKL 1475
            DIVNYNT+L+ +CKNGKADQALE+F KL E+GC P+ S+YN M SALW++G++ +AL  +
Sbjct: 398  DIVNYNTVLATLCKNGKADQALEIFGKLGEVGCSPNSSSYNTMFSALWSSGDKIRALHMI 457

Query: 1476 SEMMNKGMDPDEITFNSLISCLCRDGMVDEAIELLKDMERSGFPSTVITYNAVLLGLCKA 1655
             EM++ G+DPDEIT+NS+ISCLCR+GMVD+A ELL DM    F  +V+TYN VLLG CKA
Sbjct: 458  LEMVSNGIDPDEITYNSMISCLCREGMVDKAFELLVDMRSCEFHPSVVTYNIVLLGFCKA 517

Query: 1656 HRIDDAIEILEEMVKKGCRPNETTYVLLVEGIGFAGWRAEAMKAASSLLKKNVISRESFR 1835
            HRI+DAI++L+ MV  GCRPNETTY +L+EGIGFAG+RAEAM+ A+ L++ N IS  SF+
Sbjct: 518  HRIEDAIDVLDSMVGNGCRPNETTYTVLIEGIGFAGYRAEAMELANDLVRINAISEYSFK 577

Query: 1836 RLKRTFPALDVTKDVAYT 1889
            RL RTFP L+V +  + T
Sbjct: 578  RLHRTFPLLNVLQRSSQT 595


>ref|NP_566237.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana]
            gi|75207286|sp|Q9SR00.1|PP213_ARATH RecName:
            Full=Pentatricopeptide repeat-containing protein
            At3g04760, chloroplastic; Flags: Precursor
            gi|6175176|gb|AAF04902.1|AC011437_17 hypothetical protein
            [Arabidopsis thaliana] gi|15810359|gb|AAL07067.1| unknown
            protein [Arabidopsis thaliana] gi|22136960|gb|AAM91709.1|
            unknown protein [Arabidopsis thaliana]
            gi|332640611|gb|AEE74132.1| pentatricopeptide
            repeat-containing protein [Arabidopsis thaliana]
          Length = 602

 Score =  702 bits (1813), Expect = 0.0
 Identities = 339/562 (60%), Positives = 447/562 (79%)
 Frame = +3

Query: 204  LPCTNSSKPNSQSTNKSLVRCRNERKSRSPQRVKVCTENRSTQLQSNDSAEPNFVKLLNR 383
            L  +NS+  N    + S    RN + + +     + TE R    QS    +   +K+ +R
Sbjct: 40   LTFSNSNPNNDNGRSFSSSGARNLQTTTTTDAT-LPTERRQQHSQSLGFRDTQMLKIFHR 98

Query: 384  SCKAGKFSEALYFLENMVNRGSHKPDVILCTKLIKGLFSSKQMEKALRVLKILEVYGEPD 563
            SC++G + E+L+ LE MV +G + PDVILCTKLIKG F+ + + KA+RV++ILE +G+PD
Sbjct: 99   SCRSGNYIESLHLLETMVRKG-YNPDVILCTKLIKGFFTLRNIPKAVRVMEILEKFGQPD 157

Query: 564  VFAYNAVISGFCKGNQIDSANEMLNRMRARGISPDIVTYNIMIGSLCNRGKLGLAQKVFD 743
            VFAYNA+I+GFCK N+ID A  +L+RMR++  SPD VTYNIMIGSLC+RGKL LA KV +
Sbjct: 158  VFAYNALINGFCKMNRIDDATRVLDRMRSKDFSPDTVTYNIMIGSLCSRGKLDLALKVLN 217

Query: 744  ELLEDNCKPTVVTYTILIEATILEGGIRKAMKLLDEMLSKGLQPDMYTYNTIIRGLCREG 923
            +LL DNC+PTV+TYTILIEAT+LEGG+ +A+KL+DEMLS+GL+PDM+TYNTIIRG+C+EG
Sbjct: 218  QLLSDNCQPTVITYTILIEATMLEGGVDEALKLMDEMLSRGLKPDMFTYNTIIRGMCKEG 277

Query: 924  MMDRAYEFISSLPEKGWKPDVISYNILLRSLLSEGKWRDGKKLVAEMLSTGSEPNVVTYS 1103
            M+DRA+E + +L  KG +PDVISYNILLR+LL++GKW +G+KL+ +M S   +PNVVTYS
Sbjct: 278  MVDRAFEMVRNLELKGCEPDVISYNILLRALLNQGKWEEGEKLMTKMFSEKCDPNVVTYS 337

Query: 1104 ILVTALCRDGKLEEAVNLLKLMMDKGLTPDTFTYEPVISAFCREGKIDLAITFLDYMISS 1283
            IL+T LCRDGK+EEA+NLLKLM +KGLTPD ++Y+P+I+AFCREG++D+AI FL+ MIS 
Sbjct: 338  ILITTLCRDGKIEEAMNLLKLMKEKGLTPDAYSYDPLIAAFCREGRLDVAIEFLETMISD 397

Query: 1284 GCLPDIVNYNTILSAMCKNGKADQALEVFDKLDELGCPPDVSTYNAMVSALWNNGERTKA 1463
            GCLPDIVNYNT+L+ +CKNGKADQALE+F KL E+GC P+ S+YN M SALW++G++ +A
Sbjct: 398  GCLPDIVNYNTVLATLCKNGKADQALEIFGKLGEVGCSPNSSSYNTMFSALWSSGDKIRA 457

Query: 1464 LIKLSEMMNKGMDPDEITFNSLISCLCRDGMVDEAIELLKDMERSGFPSTVITYNAVLLG 1643
            L  + EMM+ G+DPDEIT+NS+ISCLCR+GMVDEA ELL DM    F  +V+TYN VLLG
Sbjct: 458  LHMILEMMSNGIDPDEITYNSMISCLCREGMVDEAFELLVDMRSCEFHPSVVTYNIVLLG 517

Query: 1644 LCKAHRIDDAIEILEEMVKKGCRPNETTYVLLVEGIGFAGWRAEAMKAASSLLKKNVISR 1823
             CKAHRI+DAI +LE MV  GCRPNETTY +L+EGIGFAG+RAEAM+ A+ L++ + IS 
Sbjct: 518  FCKAHRIEDAINVLESMVGNGCRPNETTYTVLIEGIGFAGYRAEAMELANDLVRIDAISE 577

Query: 1824 ESFRRLKRTFPALDVTKDVAYT 1889
             SF+RL RTFP L+V +  + T
Sbjct: 578  YSFKRLHRTFPLLNVLQRSSQT 599


>dbj|BAD95034.1| hypothetical protein [Arabidopsis thaliana]
          Length = 602

 Score =  702 bits (1812), Expect = 0.0
 Identities = 339/562 (60%), Positives = 447/562 (79%)
 Frame = +3

Query: 204  LPCTNSSKPNSQSTNKSLVRCRNERKSRSPQRVKVCTENRSTQLQSNDSAEPNFVKLLNR 383
            L  +NS+  N    + S    RN + + +     + TE R    QS    +   +K+ +R
Sbjct: 40   LTFSNSNPNNDNGRSFSSSGARNLQTTTTTDAT-LPTERRQQHSQSLGFRDTQMLKIFHR 98

Query: 384  SCKAGKFSEALYFLENMVNRGSHKPDVILCTKLIKGLFSSKQMEKALRVLKILEVYGEPD 563
            SC++G + E+L+ LE MV +G + PDVILCTKLIKG F+ + + KA+RV++ILE +G+PD
Sbjct: 99   SCRSGNYIESLHLLETMVRKG-YNPDVILCTKLIKGFFTLRNIPKAVRVMEILEKFGQPD 157

Query: 564  VFAYNAVISGFCKGNQIDSANEMLNRMRARGISPDIVTYNIMIGSLCNRGKLGLAQKVFD 743
            VFAYNA+I+GFCK N+ID A  +L+RMR++  SPD VTYNIMIGSLC+RGKL LA KV +
Sbjct: 158  VFAYNALINGFCKMNRIDDATRVLDRMRSKDFSPDTVTYNIMIGSLCSRGKLDLALKVLN 217

Query: 744  ELLEDNCKPTVVTYTILIEATILEGGIRKAMKLLDEMLSKGLQPDMYTYNTIIRGLCREG 923
            +LL DNC+PTV+TYTILIEAT+LEGG+ +A+KL+DEMLS+GL+PDM+TYNTIIRG+C+EG
Sbjct: 218  QLLSDNCQPTVITYTILIEATMLEGGVDEALKLMDEMLSRGLKPDMFTYNTIIRGMCKEG 277

Query: 924  MMDRAYEFISSLPEKGWKPDVISYNILLRSLLSEGKWRDGKKLVAEMLSTGSEPNVVTYS 1103
            M+DRA+E + +L  KG +PDVISYNILLR+LL++GKW +G+KL+ +M S   +PNVVTYS
Sbjct: 278  MVDRAFEMVRNLELKGSEPDVISYNILLRALLNQGKWEEGEKLMTKMFSEKCDPNVVTYS 337

Query: 1104 ILVTALCRDGKLEEAVNLLKLMMDKGLTPDTFTYEPVISAFCREGKIDLAITFLDYMISS 1283
            IL+T LCRDGK+EEA+NLLKLM +KGLTPD ++Y+P+I+AFCREG++D+AI FL+ MIS 
Sbjct: 338  ILITTLCRDGKIEEAMNLLKLMKEKGLTPDAYSYDPLIAAFCREGRLDVAIEFLETMISD 397

Query: 1284 GCLPDIVNYNTILSAMCKNGKADQALEVFDKLDELGCPPDVSTYNAMVSALWNNGERTKA 1463
            GCLPDIVNYNT+L+ +CKNGKADQALE+F KL E+GC P+ S+YN M SALW++G++ +A
Sbjct: 398  GCLPDIVNYNTVLATLCKNGKADQALEIFGKLGEVGCSPNSSSYNTMFSALWSSGDKIRA 457

Query: 1464 LIKLSEMMNKGMDPDEITFNSLISCLCRDGMVDEAIELLKDMERSGFPSTVITYNAVLLG 1643
            L  + EMM+ G+DPDEIT+NS+ISCLCR+GMVDEA ELL DM    F  +V+TYN VLLG
Sbjct: 458  LHMILEMMSNGIDPDEITYNSMISCLCREGMVDEAFELLVDMRSCEFHPSVVTYNIVLLG 517

Query: 1644 LCKAHRIDDAIEILEEMVKKGCRPNETTYVLLVEGIGFAGWRAEAMKAASSLLKKNVISR 1823
             CKAHRI+DAI +LE MV  GCRPNETTY +L+EGIGFAG+RAEAM+ A+ L++ + IS 
Sbjct: 518  FCKAHRIEDAINVLESMVGNGCRPNETTYTVLIEGIGFAGYRAEAMELANDLVRIDAISE 577

Query: 1824 ESFRRLKRTFPALDVTKDVAYT 1889
             SF+RL RTFP L+V +  + T
Sbjct: 578  YSFKRLHRTFPLLNVLQRSSQT 599


>ref|XP_003590960.1| Pentatricopeptide repeat-containing protein [Medicago truncatula]
            gi|355480008|gb|AES61211.1| Pentatricopeptide
            repeat-containing protein [Medicago truncatula]
          Length = 590

 Score =  678 bits (1750), Expect = 0.0
 Identities = 331/587 (56%), Positives = 446/587 (75%), Gaps = 16/587 (2%)
 Frame = +3

Query: 171  TILSAEFFPQPLP----CTNSSKPNSQSTNKSLVRCRNERKSRSPQRVKVCTENRSTQLQ 338
            T  S EF    L      ++ SKPN+     S++   NE  + + +R +    N   Q +
Sbjct: 2    TTFSTEFLSHTLNFRIHTSSHSKPNTIIITSSILFL-NEANNNNNKRRRRTNNNEQQQFR 60

Query: 339  SN-----------DSAEPNFVKLLNRSCKAGKFSEALYFLENMVNRGSHKPDVILCTKLI 485
             N           D  + NF+K LNRSCK+ K+ E+LYFL++MVNRG +KPDVILCTKLI
Sbjct: 61   VNETKPTKHDQDYDFRDTNFMKTLNRSCKSAKYDESLYFLQHMVNRG-YKPDVILCTKLI 119

Query: 486  KGLFSSKQMEKALRVLKILEVYGEPDVFAYNAVISGFCKGNQIDSANEMLNRMRARGISP 665
            KG F+ K++EKA++V++ILE +G+PDVFAYNAVISGFCK +++D A+++L+RM+ RG  P
Sbjct: 120  KGFFNMKKIEKAIQVMEILEKHGKPDVFAYNAVISGFCKADRVDHASKVLDRMKKRGFEP 179

Query: 666  DIVTYNIMIGSLCNRGKLGLAQKVFDELLEDNCKPTVVTYTILIEATILEGGIRKAMKLL 845
            D+VTYNI+IG+ C RG+L LA +V D+LL+DNCKPTV+TYTILIEATI +GGI +AMKLL
Sbjct: 180  DVVTYNILIGNFCGRGRLDLALRVMDQLLKDNCKPTVITYTILIEATITQGGIDEAMKLL 239

Query: 846  DEMLSKGLQPDMYTYNTIIRGLCREGMMDRAYEFISSLPEKGWKPDVISYNILLRSLLSE 1025
            DEMLS+GL+PD YTYN ++ G+C+EGM+DRA+EF+S + + G    V +YNILLR LL+E
Sbjct: 240  DEMLSRGLRPDRYTYNVVVNGMCKEGMLDRAFEFLSRISKNGCVAGVSTYNILLRDLLNE 299

Query: 1026 GKWRDGKKLVAEMLSTGSEPNVVTYSILVTALCRDGKLEEAVNLLKLMMDKGLTPDTFTY 1205
            GKW  G+KL+++ML  G EPN +TYS L+TALCRDGK++EA N+LK+M +K L PD ++Y
Sbjct: 300  GKWEYGEKLMSDMLVKGCEPNPITYSTLITALCRDGKIDEAKNVLKVMKEKALAPDGYSY 359

Query: 1206 EPVISAFCREGKIDLAITFLDYMISSGCLPDIVNYNTILSAMCKNGKADQALEVFDKLDE 1385
            +P+ISA CREGK+DLAI FLD MIS G LPDI++YN+IL+++CKNG AD+AL +F+KL E
Sbjct: 360  DPLISALCREGKVDLAIEFLDDMISGGHLPDILSYNSILASLCKNGNADEALNIFEKLGE 419

Query: 1386 LGCPPDVSTYNAMVSALWNNGERTKALIKLSEMMNKGMDPDEITFNSLISCLCRDGMVDE 1565
            +GCPP+  +YN +  ALW++G++ +AL  + EM++ G+DPDEIT+NSLISCLCRDG+VD+
Sbjct: 420  VGCPPNAGSYNTLFGALWSSGDKIRALGMILEMLSNGIDPDEITYNSLISCLCRDGLVDQ 479

Query: 1566 AIELLKDM-ERSGFPSTVITYNAVLLGLCKAHRIDDAIEILEEMVKKGCRPNETTYVLLV 1742
            AIELL DM E      TVI+YN VLLGLCK  RI DAIE+L  MV +GC PNETTY LL+
Sbjct: 480  AIELLVDMFESEKCQPTVISYNTVLLGLCKVQRIIDAIEVLAAMVNEGCLPNETTYTLLI 539

Query: 1743 EGIGFAGWRAEAMKAASSLLKKNVISRESFRRLKRTFPALDVTKDVA 1883
            +GIGFAGWR +AM+ A+ L+  + IS +SF+R ++ FP  D  K++A
Sbjct: 540  QGIGFAGWRYDAMELANLLVNMDAISEDSFKRFQKIFPVFDAHKELA 586


>ref|XP_003555568.1| PREDICTED: pentatricopeptide repeat-containing protein At3g04760,
            chloroplastic-like [Glycine max]
          Length = 576

 Score =  676 bits (1745), Expect = 0.0
 Identities = 333/573 (58%), Positives = 439/573 (76%), Gaps = 16/573 (2%)
 Frame = +3

Query: 177  LSAEFFPQPLPCTNSSK----PNSQST------------NKSLVRCRNERKSRSPQRVKV 308
            +S+EF    LP   +SK    PN  +T            N S  R  N   ++   RV  
Sbjct: 4    VSSEFLSHCLPLGTNSKRAWLPNPSNTVITCRIPLLNEDNPSKRRLNNNNNNKGHTRVT- 62

Query: 309  CTENRSTQLQSNDSAEPNFVKLLNRSCKAGKFSEALYFLENMVNRGSHKPDVILCTKLIK 488
             + +   Q Q  D  + + +K LNR CK GK++EALYFLE MV RG +KPDVILCTKLIK
Sbjct: 63   -SSDTRPQQQHYDFRDTHHMKALNRLCKTGKYTEALYFLEQMVKRG-YKPDVILCTKLIK 120

Query: 489  GLFSSKQMEKALRVLKILEVYGEPDVFAYNAVISGFCKGNQIDSANEMLNRMRARGISPD 668
            GLF+SK+ EKA+RV++ILE YG+PD FAYNAVISGFC+ ++ D+AN ++ RM+ RG SPD
Sbjct: 121  GLFTSKRTEKAVRVMEILEQYGDPDSFAYNAVISGFCRSDRFDAANRVILRMKYRGFSPD 180

Query: 669  IVTYNIMIGSLCNRGKLGLAQKVFDELLEDNCKPTVVTYTILIEATILEGGIRKAMKLLD 848
            +VTYNI+IGSLC RGKL LA KV D+LLEDNC PTV+TYTILIEATI+ G I  AM+LLD
Sbjct: 181  VVTYNILIGSLCARGKLDLALKVMDQLLEDNCNPTVITYTILIEATIIHGSIDDAMRLLD 240

Query: 849  EMLSKGLQPDMYTYNTIIRGLCREGMMDRAYEFISSLPEKGWKPDVISYNILLRSLLSEG 1028
            EM+S+GLQPDMYTYN I+RG+C+ G++DRA+EF+S+L      P +  YN+LL+ LL+EG
Sbjct: 241  EMMSRGLQPDMYTYNVIVRGMCKRGLVDRAFEFVSNL---NTTPSLNLYNLLLKGLLNEG 297

Query: 1029 KWRDGKKLVAEMLSTGSEPNVVTYSILVTALCRDGKLEEAVNLLKLMMDKGLTPDTFTYE 1208
            +W  G++L+++M+  G EPN+VTYS+L+++LCRDGK  EAV++L++M +KGL PD + Y+
Sbjct: 298  RWEAGERLMSDMIVKGCEPNIVTYSVLISSLCRDGKAGEAVDVLRVMKEKGLNPDAYCYD 357

Query: 1209 PVISAFCREGKIDLAITFLDYMISSGCLPDIVNYNTILSAMCKNGKADQALEVFDKLDEL 1388
            P+ISAFC+EGK+DLAI F+D MIS+G LPDIVNYNTI+ ++CK G+AD+AL +F KL+E+
Sbjct: 358  PLISAFCKEGKVDLAIGFVDDMISAGWLPDIVNYNTIMGSLCKKGRADEALNIFKKLEEV 417

Query: 1389 GCPPDVSTYNAMVSALWNNGERTKALIKLSEMMNKGMDPDEITFNSLISCLCRDGMVDEA 1568
            GCPP+ S+YN M  ALW++G++ +AL  + EM++ G+DPD IT+NSLIS LCRDGMVDEA
Sbjct: 418  GCPPNASSYNTMFGALWSSGDKIRALTMILEMLSNGVDPDRITYNSLISSLCRDGMVDEA 477

Query: 1569 IELLKDMERSGFPSTVITYNAVLLGLCKAHRIDDAIEILEEMVKKGCRPNETTYVLLVEG 1748
            I LL DMER+ +  TVI+YN VLLGLCKAHRI DAIE+L  MV  GC+PNETTY LLVEG
Sbjct: 478  IGLLVDMERTEWQPTVISYNIVLLGLCKAHRIVDAIEVLAVMVDNGCQPNETTYTLLVEG 537

Query: 1749 IGFAGWRAEAMKAASSLLKKNVISRESFRRLKR 1847
            +G+AGWR+ A++ A SL+  N IS++ FRRL++
Sbjct: 538  VGYAGWRSYAVELAKSLVSMNAISQDLFRRLQK 570


>ref|XP_006589209.1| PREDICTED: pentatricopeptide repeat-containing protein At3g04760,
            chloroplastic-like [Glycine max]
          Length = 561

 Score =  669 bits (1726), Expect = 0.0
 Identities = 331/565 (58%), Positives = 436/565 (77%), Gaps = 4/565 (0%)
 Frame = +3

Query: 165  MTTILSAEFFPQPLPCTNSSK----PNSQSTNKSLVRCRNERKSRSPQRVKVCTENRSTQ 332
            MTT+ S EF    LP   + K    PN  +T   ++ C N R +         ++ R  Q
Sbjct: 1    MTTV-STEFLSHTLPFQTNLKHAWHPNPTNT---VITCSNRRLNNKGHTKVTSSDTRPHQ 56

Query: 333  LQSNDSAEPNFVKLLNRSCKAGKFSEALYFLENMVNRGSHKPDVILCTKLIKGLFSSKQM 512
                D  + N +K LNR CK GK +EALYFLE MV  G +KPDVILCTKLIK LF+SK+ 
Sbjct: 57   --HYDFRDTNHIKSLNRLCKTGKCTEALYFLEQMVMNG-YKPDVILCTKLIKCLFTSKRT 113

Query: 513  EKALRVLKILEVYGEPDVFAYNAVISGFCKGNQIDSANEMLNRMRARGISPDIVTYNIMI 692
            EKA+RV++ILE YGEPD FAYNAVISGFC+ ++ D+AN ++ RM+ RG SPD+VTYNI+I
Sbjct: 114  EKAVRVMEILEQYGEPDSFAYNAVISGFCRSDRFDAANGVILRMKNRGFSPDVVTYNILI 173

Query: 693  GSLCNRGKLGLAQKVFDELLEDNCKPTVVTYTILIEATILEGGIRKAMKLLDEMLSKGLQ 872
            GSLC RG L LA KV D+LLEDNC PT++TYTILIEATI+ GGI +AM+LLDEM+S+GLQ
Sbjct: 174  GSLCARGNLDLALKVMDQLLEDNCNPTLITYTILIEATIIHGGIDEAMRLLDEMMSRGLQ 233

Query: 873  PDMYTYNTIIRGLCREGMMDRAYEFISSLPEKGWKPDVISYNILLRSLLSEGKWRDGKKL 1052
            PD+YTYN I+RG+C+ G++DRA+EF+S+L      P +  YN+LL+ LL+EG+W  G++L
Sbjct: 234  PDIYTYNVIVRGMCKRGLVDRAFEFVSNL---SITPSLNLYNLLLKGLLNEGRWEAGERL 290

Query: 1053 VAEMLSTGSEPNVVTYSILVTALCRDGKLEEAVNLLKLMMDKGLTPDTFTYEPVISAFCR 1232
            +++M+  G EPNVVTYS+L+++LCRDGK  EAV++L++M ++GL PD + Y+P+ISAFC+
Sbjct: 291  MSDMIVKGCEPNVVTYSVLISSLCRDGKAGEAVDVLRVMKERGLNPDAYCYDPLISAFCK 350

Query: 1233 EGKIDLAITFLDYMISSGCLPDIVNYNTILSAMCKNGKADQALEVFDKLDELGCPPDVST 1412
            EGK+DLAI F+D MIS+G LPDIVNYNTI+ ++CK G+AD+AL +F KL+E+GCPP+ S+
Sbjct: 351  EGKVDLAIGFVDDMISAGWLPDIVNYNTIMGSLCKKGRADEALNIFKKLEEVGCPPNASS 410

Query: 1413 YNAMVSALWNNGERTKALIKLSEMMNKGMDPDEITFNSLISCLCRDGMVDEAIELLKDME 1592
            YN M  ALW++G++ +AL  + EM++ G+DPD IT+NSLIS LCRDGMVDEAI LL DME
Sbjct: 411  YNTMFGALWSSGDKIRALGMILEMLSNGVDPDRITYNSLISSLCRDGMVDEAIGLLVDME 470

Query: 1593 RSGFPSTVITYNAVLLGLCKAHRIDDAIEILEEMVKKGCRPNETTYVLLVEGIGFAGWRA 1772
            RS +  TVI+YN VLLGLCKAHRI DAIE+L  MV  GC+PNETTY LLVEG+G+AGWR+
Sbjct: 471  RSEWQPTVISYNIVLLGLCKAHRIVDAIEVLAVMVDNGCQPNETTYTLLVEGVGYAGWRS 530

Query: 1773 EAMKAASSLLKKNVISRESFRRLKR 1847
             A++ A SL+  N IS++ FRRL++
Sbjct: 531  YAVELAKSLVSMNAISQDLFRRLQK 555


Top