BLASTX nr result

ID: Catharanthus23_contig00001463 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Catharanthus23_contig00001463
         (2117 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002266822.1| PREDICTED: pentatricopeptide repeat-containi...   778   0.0  
emb|CAN67401.1| hypothetical protein VITISV_025967 [Vitis vinifera]   776   0.0  
gb|EOY10909.1| Pentatricopeptide repeat (PPR-like) superfamily p...   775   0.0  
ref|XP_004304772.1| PREDICTED: pentatricopeptide repeat-containi...   770   0.0  
ref|XP_006361415.1| PREDICTED: pentatricopeptide repeat-containi...   768   0.0  
gb|EXB93167.1| hypothetical protein L484_024505 [Morus notabilis]     760   0.0  
ref|XP_004236781.1| PREDICTED: pentatricopeptide repeat-containi...   754   0.0  
ref|XP_002521980.1| pentatricopeptide repeat-containing protein,...   746   0.0  
gb|EMJ06317.1| hypothetical protein PRUPE_ppa004835mg [Prunus pe...   723   0.0  
ref|XP_006487702.1| PREDICTED: pentatricopeptide repeat-containi...   721   0.0  
ref|XP_006442665.1| hypothetical protein CICLE_v10019446mg [Citr...   720   0.0  
ref|XP_004142590.1| PREDICTED: pentatricopeptide repeat-containi...   712   0.0  
ref|XP_006371094.1| hypothetical protein POPTR_0019s03630g [Popu...   708   0.0  
ref|XP_006296608.1| hypothetical protein CARUB_v10013258mg [Caps...   702   0.0  
ref|XP_002884468.1| pentatricopeptide repeat-containing protein ...   701   0.0  
ref|NP_566237.1| pentatricopeptide repeat-containing protein [Ar...   701   0.0  
dbj|BAD95034.1| hypothetical protein [Arabidopsis thaliana]           701   0.0  
ref|XP_003590960.1| Pentatricopeptide repeat-containing protein ...   681   0.0  
ref|XP_003555568.1| PREDICTED: pentatricopeptide repeat-containi...   674   0.0  
gb|ESW16008.1| hypothetical protein PHAVU_007G121900g [Phaseolus...   671   0.0  

>ref|XP_002266822.1| PREDICTED: pentatricopeptide repeat-containing protein At3g04760,
            chloroplastic-like [Vitis vinifera]
          Length = 582

 Score =  778 bits (2010), Expect = 0.0
 Identities = 381/582 (65%), Positives = 481/582 (82%), Gaps = 5/582 (0%)
 Frame = +3

Query: 174  TILSAEFFPQPLPCTNSSKPNSQSTNKSLVRCRNER-----KSRSPQRVKVCTENRSTQL 338
            TI S +FFP+  P     KP S S + S+V CRN        SR+  +V V  E R   L
Sbjct: 2    TIYSTDFFPRCPPFNPQLKPTSHSHHTSIVTCRNPNPNDGFNSRNAPKVGVSAEARPAHL 61

Query: 339  QSNDSAEPNFVKLLNRSCKAGKFSEALYFLENMVNRGSHKPDVILCTKLIKGLFSSKQME 518
            QS D  E + +KLLNRSCKAGKF+E+LYFLE +VN+G + PDVILCTKLIKG F+ K +E
Sbjct: 62   QSYDFRETHLMKLLNRSCKAGKFNESLYFLECLVNKG-YTPDVILCTKLIKGFFNFKNIE 120

Query: 519  KALRVLKILEVHGEPDVFAYNAVISGFCKGNQIDSANEMLNRMRARGISPDIVTYNIMIG 698
            KA RV++ILE H EPDVFAYNAVISGFCK N+I++A ++LNRM+ARG  PDIVTYNIMIG
Sbjct: 121  KASRVMEILESHTEPDVFAYNAVISGFCKVNRIEAATQVLNRMKARGFLPDIVTYNIMIG 180

Query: 699  SLCNRGKLGLAQKVFDELLEDNCKPTVVTYTILIEATILEGGIRKAMKLLDEMLSKGLQP 878
            SLCNR KLGLA KV D+LL DNC PTV+TYTILIEATI+EGGI +AMKLL+EML++GL P
Sbjct: 181  SLCNRRKLGLALKVLDQLLLDNCMPTVITYTILIEATIVEGGINEAMKLLEEMLARGLLP 240

Query: 879  DMYTYNTIIRGLCREGMMDRAYEFISSLPEKGWKPDVISYNILLRSLLSEGKWRDGKKLV 1058
            DMYTYN IIRG+C+EGM++RA E I+SL  KG KPDVISYNILLR+ L++GKW +G+KLV
Sbjct: 241  DMYTYNAIIRGMCKEGMVERAAELITSLTSKGCKPDVISYNILLRAFLNQGKWDEGEKLV 300

Query: 1059 AEMLSTGSEPNVVTYSILVTALCRDGKLEEAVNLLKLMMDKGLTPDTFTYEPVISAFCRE 1238
            AEM S G EPN VTYSIL+++LCR G+++EA+++LK+M++K LTPDT++Y+P+ISA C+E
Sbjct: 301  AEMFSRGCEPNKVTYSILISSLCRFGRIDEAISVLKVMIEKELTPDTYSYDPLISALCKE 360

Query: 1239 GKIDLAITFLDYMISSGCLPDIVNYNTILSAMCKNGKADQALEVFDKLDELGCPPDVSTY 1418
            G++DLAI  +DYMIS+GCLPDIVNYNTIL+A+CKNG A+QALE+F+KL  +GCPP+VS+Y
Sbjct: 361  GRLDLAIGIMDYMISNGCLPDIVNYNTILAALCKNGNANQALEIFNKLRGMGCPPNVSSY 420

Query: 1419 NAMVSALWNNGERTKALIKLSEMMNKGMDPDEITFNSLISCLCRDGMVDEAIELLKDMER 1598
            N M+SALW+ G+R++AL  +  M++KG+DPDEIT+NSLISCLCRDG+V+EAI LL DME+
Sbjct: 421  NTMISALWSCGDRSRALGMVPAMISKGVDPDEITYNSLISCLCRDGLVEEAIGLLDDMEQ 480

Query: 1599 SGFPSTVITYNAVLLGLCKAHRIDDAIEILEEMVKKGCRPNETTYVLLVEGIGFAGWRAE 1778
            SGF  TVI+YN VLLGLCK  RIDDAI +  EM++KGCRPNETTY+LL+EGIGFAGWR E
Sbjct: 481  SGFRPTVISYNIVLLGLCKVRRIDDAIGMFAEMIEKGCRPNETTYILLIEGIGFAGWRTE 540

Query: 1779 AMKAASSLLKKNVISRESFRRLKRTFPALDVTKDVAYTETSK 1904
            AM+ A+SL  ++VIS++SF+RL +TFP LDV K+++ +ET K
Sbjct: 541  AMELANSLFSRDVISQDSFKRLNKTFPMLDVYKELSNSETKK 582


>emb|CAN67401.1| hypothetical protein VITISV_025967 [Vitis vinifera]
          Length = 592

 Score =  776 bits (2005), Expect = 0.0
 Identities = 380/584 (65%), Positives = 481/584 (82%), Gaps = 5/584 (0%)
 Frame = +3

Query: 168  MTTILSAEFFPQPLPCTNSSKPNSQSTNKSLVRCRNER-----KSRSPQRVKVCTENRST 332
            + TI S +FFP   P +   KP S S + S+V CRN        SR+  +V V  E R  
Sbjct: 10   LMTIYSTDFFPHCPPFSPQLKPTSHSHHTSIVTCRNPNPNDGYNSRNSPKVGVSAEARPA 69

Query: 333  QLQSNDSAEPNFVKLLNRSCKAGKFSEALYFLENMVNRGSHKPDVILCTKLIKGLFSSKQ 512
             LQS D  E + +KLLNRSCKAGKF+E+LYFLE +VN+G + PDVILCTKLIKG F+ K 
Sbjct: 70   HLQSYDFRETHLMKLLNRSCKAGKFNESLYFLECLVNKG-YTPDVILCTKLIKGFFNFKN 128

Query: 513  MEKALRVLKILEVHGEPDVFAYNAVISGFCKGNQIDSANEMLNRMRARGISPDIVTYNIM 692
            +EKA RV++ILE H EPDVFAYNAVISGFCK NQI++A ++LNRM+ARG  PDIVTYNIM
Sbjct: 129  IEKASRVMEILESHTEPDVFAYNAVISGFCKVNQIEAATQVLNRMKARGFLPDIVTYNIM 188

Query: 693  IGSLCNRGKLGLAQKVFDELLEDNCKPTVVTYTILIEATILEGGIRKAMKLLDEMLSKGL 872
            IGSLCNR KLGLA  V D+LL DNC PTV+TYTILIEATI+EGGI +AMKLL+EML++GL
Sbjct: 189  IGSLCNRRKLGLALTVLDQLLLDNCMPTVITYTILIEATIVEGGINEAMKLLEEMLARGL 248

Query: 873  QPDMYTYNTIIRGLCREGMMDRAYEFISSLPEKGWKPDVISYNILLRSLLSEGKWRDGKK 1052
             PDMYTYN IIRG+C+EGM++RA E I+SL  KG +PDVISYNILLR+ L++GKW +G+K
Sbjct: 249  LPDMYTYNAIIRGMCKEGMVERAAELITSLTSKGCEPDVISYNILLRAFLNQGKWDEGEK 308

Query: 1053 LVAEMLSTGSEPNVVTYSILVTALCRDGKLEEAVNLLKLMMDKGLTPDTFTYEPVISAFC 1232
            LVAEM S G EPN VTYSIL+++LCR G+++EA+++LK+M++K LTPDT++Y+P+ISA C
Sbjct: 309  LVAEMFSRGCEPNKVTYSILISSLCRFGRIDEAISVLKVMIEKELTPDTYSYDPLISALC 368

Query: 1233 REGKIDLAITFLDYMISSGCLPDIVNYNTILSAMCKNGKADQALEVFDKLDELGCPPDVS 1412
            +EG++DLAI  +DYMIS+GCLPDIVNYNTIL+A+CKNG A+QALE+F+KL  +GCPP+VS
Sbjct: 369  KEGRLDLAIGIMDYMISNGCLPDIVNYNTILAALCKNGNANQALEIFNKLRGMGCPPNVS 428

Query: 1413 TYNAMVSALWNNGERTKALIKLSEMMNKGMDPDEITFNSLISCLCRDGMVDEAIELLKDM 1592
            +YN M+SALW+ G+R++AL  +  M++KG+DPDEIT+NSLISCLCRDG+V+EAI LL DM
Sbjct: 429  SYNTMISALWSCGDRSRALGMVPAMISKGIDPDEITYNSLISCLCRDGLVEEAIGLLDDM 488

Query: 1593 ERSGFPSTVITYNAVLLGLCKAHRIDDAIEILEEMVKKGCRPNETTYVLLVEGIGFAGWR 1772
            E+SGF  TVI+YN VLLGLCK  RIDDAI +  EM++KGCRPNETTY+LL+EGIGFAGWR
Sbjct: 489  EQSGFRPTVISYNIVLLGLCKVRRIDDAIGMFAEMIEKGCRPNETTYILLIEGIGFAGWR 548

Query: 1773 AEAMKAASSLLKKNVISRESFRRLKRTFPALDVTKDVAYTETSK 1904
             EAM+ A+SL  ++VIS++SF+RL +TFP LDV K+++ +ET K
Sbjct: 549  TEAMELANSLFSRDVISQDSFKRLNKTFPMLDVYKELSNSETKK 592


>gb|EOY10909.1| Pentatricopeptide repeat (PPR-like) superfamily protein [Theobroma
            cacao]
          Length = 586

 Score =  775 bits (2002), Expect = 0.0
 Identities = 378/586 (64%), Positives = 483/586 (82%), Gaps = 9/586 (1%)
 Frame = +3

Query: 174  TILSAEFFPQPLPCTNSS-KP--NSQSTNKSLVRCRNER------KSRSPQRVKVCTENR 326
            T+ S E     LP T    KP  NS S + SLV C N        KSR+ Q+V+V  E R
Sbjct: 2    TLFSTELVTHSLPFTTQQLKPTSNSHSHHTSLVSCLNHESQDSSSKSRNNQKVRVSAETR 61

Query: 327  STQLQSNDSAEPNFVKLLNRSCKAGKFSEALYFLENMVNRGSHKPDVILCTKLIKGLFSS 506
             T L S D  E + +KLLNRSCKAGK++EA YFLE MV +G +KPDV+LCTK+IKG F+ 
Sbjct: 62   PTHLLSFDFKETHLMKLLNRSCKAGKYNEAFYFLECMVGKG-YKPDVVLCTKMIKGFFNG 120

Query: 507  KQMEKALRVLKILEVHGEPDVFAYNAVISGFCKGNQIDSANEMLNRMRARGISPDIVTYN 686
            + +EKA RV++ILE +GEPDVFAYNA+ISGFCK N++D AN++L+RMR+RG SPD+VTYN
Sbjct: 121  RNVEKATRVIEILEKYGEPDVFAYNAIISGFCKMNRLDFANKVLDRMRSRGFSPDVVTYN 180

Query: 687  IMIGSLCNRGKLGLAQKVFDELLEDNCKPTVVTYTILIEATILEGGIRKAMKLLDEMLSK 866
            IMIGS C+RGKL  A KV ++LL+DNCKP+V+TYTILIEAT+L+G I +AMKLLDEMLSK
Sbjct: 181  IMIGSFCSRGKLDSAYKVINQLLKDNCKPSVITYTILIEATMLQGEINEAMKLLDEMLSK 240

Query: 867  GLQPDMYTYNTIIRGLCREGMMDRAYEFISSLPEKGWKPDVISYNILLRSLLSEGKWRDG 1046
            GL+PDM+TYN IIRG+C++GM++RA++F+ SL  +G +PDVISYNILLR LL++GKW +G
Sbjct: 241  GLRPDMFTYNAIIRGMCKDGMVNRAFKFVRSLKARGCQPDVISYNILLRVLLNQGKWAEG 300

Query: 1047 KKLVAEMLSTGSEPNVVTYSILVTALCRDGKLEEAVNLLKLMMDKGLTPDTFTYEPVISA 1226
            +KLV EM+S G EPNVVTYSIL+++LCR+GKLEEAVN+LK+M ++GLTPD ++Y+P+ISA
Sbjct: 301  EKLVTEMVSRGCEPNVVTYSILISSLCREGKLEEAVNVLKMMKERGLTPDAYSYDPLISA 360

Query: 1227 FCREGKIDLAITFLDYMISSGCLPDIVNYNTILSAMCKNGKADQALEVFDKLDELGCPPD 1406
            FC+EG++DLAI FLD MIS GCLPDIVNYNT+L+ +CKNGKA+QALE+F+KL E+GCPP+
Sbjct: 361  FCKEGRLDLAIEFLDCMISDGCLPDIVNYNTVLATLCKNGKAEQALEIFEKLREVGCPPN 420

Query: 1407 VSTYNAMVSALWNNGERTKALIKLSEMMNKGMDPDEITFNSLISCLCRDGMVDEAIELLK 1586
            VS+YN M SALW++G++ KAL  +SEM++K + PDEIT+NSLISCLCRDGMVDEAIELL 
Sbjct: 421  VSSYNTMFSALWSSGDKVKALEMISEMLSKRIGPDEITYNSLISCLCRDGMVDEAIELLV 480

Query: 1587 DMERSGFPSTVITYNAVLLGLCKAHRIDDAIEILEEMVKKGCRPNETTYVLLVEGIGFAG 1766
            DM  SG P TVI+YN VLLGLCK HRI+DAIE+L  MV K C+PNETTY+LL+EGIGFAG
Sbjct: 481  DMGCSGIPPTVISYNIVLLGLCKVHRINDAIEVLAAMVDKRCQPNETTYILLIEGIGFAG 540

Query: 1767 WRAEAMKAASSLLKKNVISRESFRRLKRTFPALDVTKDVAYTETSK 1904
            WR+EAM+ A++L +   IS++SF+RL RTFP LDV K+ A ++++K
Sbjct: 541  WRSEAMELANALFRMEAISKDSFKRLNRTFPLLDVYKEFAGSDSNK 586


>ref|XP_004304772.1| PREDICTED: pentatricopeptide repeat-containing protein At3g04760,
            chloroplastic-like [Fragaria vesca subsp. vesca]
          Length = 586

 Score =  770 bits (1988), Expect = 0.0
 Identities = 370/582 (63%), Positives = 470/582 (80%), Gaps = 9/582 (1%)
 Frame = +3

Query: 177  ILSAEFFPQPLPCTNSSKPNSQSTNKSLVRCR---------NERKSRSPQRVKVCTENRS 329
            I+S E  P     T+  KP S S + + + CR             SR+P RV V  E +S
Sbjct: 3    IVSTELLPHSFHTTSQLKPTSHSHHPTALSCRASSASSISNGRNSSRNPTRVSVSAEPKS 62

Query: 330  TQLQSNDSAEPNFVKLLNRSCKAGKFSEALYFLENMVNRGSHKPDVILCTKLIKGLFSSK 509
            TQLQ+ D  + + +K+LNRSCKAG+++EA+YFLE MVN+G +KPDVILCTKLIKG F+S+
Sbjct: 63   TQLQNYDFKDTHLMKVLNRSCKAGQYNEAIYFLELMVNKG-YKPDVILCTKLIKGFFNSR 121

Query: 510  QMEKALRVLKILEVHGEPDVFAYNAVISGFCKGNQIDSANEMLNRMRARGISPDIVTYNI 689
             +EKA+RV++ILE +GEPD+FAYNA+ISGFCK N+I+SAN++L+RM+++G  PD+VTYNI
Sbjct: 122  NIEKAIRVMQILEQYGEPDLFAYNALISGFCKANRIESANKVLDRMKSQGFKPDVVTYNI 181

Query: 690  MIGSLCNRGKLGLAQKVFDELLEDNCKPTVVTYTILIEATILEGGIRKAMKLLDEMLSKG 869
            MIGSLC+RGKLGLA +V D L+ DNCKPTV+TYTILIEA IL+GGI +AMKLLDEMLS+G
Sbjct: 182  MIGSLCSRGKLGLALQVMDRLVRDNCKPTVITYTILIEAIILDGGINEAMKLLDEMLSRG 241

Query: 870  LQPDMYTYNTIIRGLCREGMMDRAYEFISSLPEKGWKPDVISYNILLRSLLSEGKWRDGK 1049
            L+PDMYTYN I+RG+CREGM+DRA+EF+     KG  P+VISYNILLR+LL+ GKW +G+
Sbjct: 242  LKPDMYTYNAIVRGMCREGMLDRAFEFVKCFDAKGCAPNVISYNILLRALLNRGKWEEGE 301

Query: 1050 KLVAEMLSTGSEPNVVTYSILVTALCRDGKLEEAVNLLKLMMDKGLTPDTFTYEPVISAF 1229
             LVA M + G EPNVVTYSIL++ LCRDGK+E+ +N+LK+M +KGLTPD ++Y+P+IS F
Sbjct: 302  NLVANMCARGCEPNVVTYSILISTLCRDGKVEDGMNVLKIMKEKGLTPDAYSYDPLISCF 361

Query: 1230 CREGKIDLAITFLDYMISSGCLPDIVNYNTILSAMCKNGKADQALEVFDKLDELGCPPDV 1409
            C+EG++DLAI  LD MIS GCLPDIVNYNT+L+A+CKNG ADQALE+F+ L E+GCPP+V
Sbjct: 362  CKEGRLDLAIELLDCMISDGCLPDIVNYNTVLAALCKNGSADQALEIFENLGEVGCPPNV 421

Query: 1410 STYNAMVSALWNNGERTKALIKLSEMMNKGMDPDEITFNSLISCLCRDGMVDEAIELLKD 1589
            S+YN M SALWN G+R +AL  +S+M++KG++PDEIT+NSLISCLCRDGMV+EAI LL D
Sbjct: 422  SSYNTMFSALWNCGDRVRALGMVSDMVSKGIEPDEITYNSLISCLCRDGMVNEAIGLLVD 481

Query: 1590 MERSGFPSTVITYNAVLLGLCKAHRIDDAIEILEEMVKKGCRPNETTYVLLVEGIGFAGW 1769
            ME  GF  TVITYN VLLGL KA RI DAIE+   MV+KGCRPNETTY+LL+EGIGFAGW
Sbjct: 482  MEAGGFQPTVITYNIVLLGLSKARRIVDAIEVFTAMVEKGCRPNETTYILLIEGIGFAGW 541

Query: 1770 RAEAMKAASSLLKKNVISRESFRRLKRTFPALDVTKDVAYTE 1895
            RAEAM+ A S+   + I  +SF+RL RTFP LDV K++  +E
Sbjct: 542  RAEAMELAKSVYSLSAICEDSFKRLSRTFPMLDVYKELTLSE 583


>ref|XP_006361415.1| PREDICTED: pentatricopeptide repeat-containing protein At3g04760,
            chloroplastic-like [Solanum tuberosum]
          Length = 583

 Score =  768 bits (1983), Expect = 0.0
 Identities = 390/583 (66%), Positives = 480/583 (82%), Gaps = 11/583 (1%)
 Frame = +3

Query: 168  MTTILSAEFFPQPLPCTNSSKPNSQSTNKS-LVRC------RNERKSRSPQRVKVCTEN- 323
            MT I+ AE FPQ    +N+ KP SQS+  + +VRC      +++ K+R+P RVK+ +EN 
Sbjct: 1    MTRIIPAEIFPQCPFFSNNLKPKSQSSKHNFVVRCSSSSNDQSKVKTRNPLRVKISSENY 60

Query: 324  RSTQLQSNDSAEPNFVKLLNRSCKAGKFSEALYFLENMVNRGSHKPDVILCTKLIKGLFS 503
            R T          + +K+LN SCK GK+ E LY LE  +  G +KPDVILCTKLIKG  +
Sbjct: 61   RPT----------HDMKVLNWSCKVGKYDETLYLLECKLKSG-YKPDVILCTKLIKGFCN 109

Query: 504  SKQMEKALRVLKILEVHGEPDVFAYNAVISGFCKGNQIDSANEMLNRMRARGISPDIVTY 683
            SK  +K ++V++ILE  GEPDVFAYNA+ISGFCK N+I+ AN++LNRM+ARG  PD VTY
Sbjct: 110  SKNSDKGVKVMQILEQFGEPDVFAYNALISGFCKMNKIEEANKVLNRMKARGFPPDSVTY 169

Query: 684  NIMIGSLCNRGKLGLAQKVFDELLEDN-CKPTVVTYTILIEATILEGGIRKAMKLLDEML 860
            NI+IGSLC+RGKLG A K+ D+L E+N CKPTV+TYTILIEATILEGGI +AMKLLDEML
Sbjct: 170  NILIGSLCDRGKLGSALKLLDQLKEENNCKPTVITYTILIEATILEGGIHEAMKLLDEML 229

Query: 861  SKGLQPDMYTYNTIIRGLCREGMMDRAYEFISSLPEKGWKPDVISYNILLRSLLS-EGKW 1037
            S+GLQPDMYTYN IIRG+CRE MMD+AYEF+ SLP KG KPDVISYNILLR+LL  +GKW
Sbjct: 230  SRGLQPDMYTYNAIIRGMCREKMMDQAYEFVRSLPSKGCKPDVISYNILLRALLHHKGKW 289

Query: 1038 RDGKKLVAEMLSTGSEPNVVTYSILVTALCRDGKLEEAVNLLKLMMDKGLTPDTFTYEPV 1217
             DG+KL+ EMLS G EPNVVTYSIL++ALCRDGKL+EA+NLLK+MMDKGLTPDTFTY+P+
Sbjct: 290  SDGEKLMNEMLSAGCEPNVVTYSILMSALCRDGKLDEAINLLKIMMDKGLTPDTFTYDPL 349

Query: 1218 ISAFCREGKIDLAITFLDYMISSGCLPDIVNYNTILSAMCKNGKADQALEVFDKLDELGC 1397
            ISAFC+ G++DLAI FLDYMIS+GCLPDIVNYNTILS MCK GKAD+A+EVF+KL E+GC
Sbjct: 350  ISAFCKGGRLDLAIKFLDYMISNGCLPDIVNYNTILSTMCKKGKADEAMEVFEKLAEIGC 409

Query: 1398 PPDVSTYNAMVSALWNNGERTKALIKLSEMMNKGMDPDEITFNSLISCLCRDGMVDEAIE 1577
            PPDVSTYN ++SALWNNG R +AL  +SEM+ KG+DPDEIT+N+LISCLCRDGMV+EA++
Sbjct: 410  PPDVSTYNTLMSALWNNGGRARALKMVSEMIEKGVDPDEITYNALISCLCRDGMVNEALD 469

Query: 1578 LLKDMERSGFPSTVITYNAVLLGLCKAHRIDDAIEILEEMVKKGCRPNETTYVLLVEGIG 1757
            LL DME +GFP TVITYN +LLGLCKAHR+ +AIE+L EMV+KG RPNETTY+LL+EGIG
Sbjct: 470  LLGDMEGNGFPPTVITYNILLLGLCKAHRVVEAIEVLAEMVEKGRRPNETTYILLIEGIG 529

Query: 1758 FAGWRAEAMKAASSLLKKNVISRESFRRLKRTFPALDV-TKDV 1883
            F+G R +AM+ AS++  KN IS+ES +RL++TF   DV  KD+
Sbjct: 530  FSGRRVQAMEMASAIYHKNAISKESLQRLRKTFQVPDVYNKDI 572


>gb|EXB93167.1| hypothetical protein L484_024505 [Morus notabilis]
          Length = 587

 Score =  760 bits (1963), Expect = 0.0
 Identities = 366/583 (62%), Positives = 473/583 (81%), Gaps = 10/583 (1%)
 Frame = +3

Query: 177  ILSAEFFPQPLPCTNSSKPNSQSTNKSLVRCRN---------ERKSRSPQRVKVCTENRS 329
            I+S EF PQ LP +   K ++   + + + CRN          +K++ P RV+V  E +S
Sbjct: 3    IISTEFLPQTLPFSPQPKQHTSRQSHTCLSCRNPSQSSTDIYRKKNKKPLRVRVSVETKS 62

Query: 330  TQLQSN-DSAEPNFVKLLNRSCKAGKFSEALYFLENMVNRGSHKPDVILCTKLIKGLFSS 506
               QSN D +E + +K++NRSCK+GK++EALYFLE MV++G  KPDVILCTK+++G F+S
Sbjct: 63   PNSQSNSDFSESHLLKVINRSCKSGKYNEALYFLELMVSKG-FKPDVILCTKVMRGFFNS 121

Query: 507  KQMEKALRVLKILEVHGEPDVFAYNAVISGFCKGNQIDSANEMLNRMRARGISPDIVTYN 686
            + + KA+RV++ILE HGEPD+F+YNA+ISGFCK N+++ AN++L+RMR +G SPD +TYN
Sbjct: 122  RNIPKAIRVMEILEKHGEPDLFSYNAMISGFCKANRVELANKVLDRMRVQGFSPDTITYN 181

Query: 687  IMIGSLCNRGKLGLAQKVFDELLEDNCKPTVVTYTILIEATILEGGIRKAMKLLDEMLSK 866
            IMIGSLC+RGK+ +A KV DELL DNCKP+V+TYTILIEATI EGG+ KAM++L+EMLS+
Sbjct: 182  IMIGSLCSRGKVDMAFKVLDELLRDNCKPSVITYTILIEATISEGGVDKAMEVLEEMLSR 241

Query: 867  GLQPDMYTYNTIIRGLCREGMMDRAYEFISSLPEKGWKPDVISYNILLRSLLSEGKWRDG 1046
            GL PDM+TYN I+RG+CREGM+DRA+EF+ SL  KG  P+VISYNILLR+LL+ GKW DG
Sbjct: 242  GLLPDMFTYNAIVRGMCREGMLDRAFEFVRSLEAKGCSPNVISYNILLRALLNRGKWSDG 301

Query: 1047 KKLVAEMLSTGSEPNVVTYSILVTALCRDGKLEEAVNLLKLMMDKGLTPDTFTYEPVISA 1226
            +K++++M+S G EPNVVTYSIL++ LCRDGK+E+AVN+LK M +KG+TPD ++Y+P+ISA
Sbjct: 302  EKILSDMVSRGCEPNVVTYSILISTLCRDGKVEDAVNVLKAMKEKGITPDAYSYDPLISA 361

Query: 1227 FCREGKIDLAITFLDYMISSGCLPDIVNYNTILSAMCKNGKADQALEVFDKLDELGCPPD 1406
            FC+EG++DLAI F+DYMIS G LPDIVNYNTIL+A+CKNG AD ALE+F+KL E+GCPP 
Sbjct: 362  FCKEGRLDLAIEFMDYMISDGSLPDIVNYNTILAALCKNGNADHALEIFEKLGEVGCPPT 421

Query: 1407 VSTYNAMVSALWNNGERTKALIKLSEMMNKGMDPDEITFNSLISCLCRDGMVDEAIELLK 1586
            VS+YN M SALWN GER KAL  +SEM++K ++PDEIT+NSLISCLCR+GMV+EAI LL 
Sbjct: 422  VSSYNTMFSALWNCGERIKALEMISEMVSKRINPDEITYNSLISCLCREGMVNEAIGLLI 481

Query: 1587 DMERSGFPSTVITYNAVLLGLCKAHRIDDAIEILEEMVKKGCRPNETTYVLLVEGIGFAG 1766
            DME  GF  +VI+YN VLLGLCKA RIDDAIE+L  MV+KGCRPNETTY LL+EGIGFAG
Sbjct: 482  DMEAGGFKLSVISYNIVLLGLCKARRIDDAIELLAAMVEKGCRPNETTYTLLIEGIGFAG 541

Query: 1767 WRAEAMKAASSLLKKNVISRESFRRLKRTFPALDVTKDVAYTE 1895
            WR EAM  A+ L     IS  SF+RL +TFP LDV K++  +E
Sbjct: 542  WRVEAMGLANLLFDIEAISEHSFKRLNKTFPMLDVYKELTLSE 584


>ref|XP_004236781.1| PREDICTED: pentatricopeptide repeat-containing protein At3g04760,
            chloroplastic-like [Solanum lycopersicum]
          Length = 575

 Score =  754 bits (1946), Expect = 0.0
 Identities = 378/565 (66%), Positives = 467/565 (82%), Gaps = 9/565 (1%)
 Frame = +3

Query: 216  TNSSKPNSQSTNKS-LVRCR--NERKS---RSPQRVKVCTENRSTQLQSNDSAEPNFVKL 377
            +N+ KP S+S+  + +VRC   NE      R+PQRVK+ +ENR     S+D      +K+
Sbjct: 14   SNNLKPKSESSKHNFVVRCSISNEESRVNIRNPQRVKISSENRG----SHD------MKV 63

Query: 378  LNRSCKAGKFSEALYFLENMVNRGSHKPDVILCTKLIKGLFSSKQMEKALRVLKILEVHG 557
            LN SCK GK+ E LY LE  V  G +KPDVILCTKLIKG F+SK  +K ++V++ILE  G
Sbjct: 64   LNWSCKVGKYDETLYLLECKVKSG-YKPDVILCTKLIKGFFNSKNSDKGVKVMQILEQFG 122

Query: 558  EPDVFAYNAVISGFCKGNQIDSANEMLNRMRARGISPDIVTYNIMIGSLCNRGKLGLAQK 737
            EPDVFAYNA++SGFCK N+I+ AN++LNRM+  G  PD VTYNI+IGSLC+RGKLG A  
Sbjct: 123  EPDVFAYNALVSGFCKMNKIEEANKVLNRMKTHGFPPDSVTYNILIGSLCDRGKLGSALM 182

Query: 738  VFDELLED-NCKPTVVTYTILIEATILEGGIRKAMKLLDEMLSKGLQPDMYTYNTIIRGL 914
            + D+L E+ NCKPTV+TYTILIEATILEGGI +AMKLLDEMLS GLQPDMYTYN IIRG+
Sbjct: 183  LLDQLKEEHNCKPTVITYTILIEATILEGGIHEAMKLLDEMLSIGLQPDMYTYNAIIRGM 242

Query: 915  CREGMMDRAYEFISSLPEKGWKPDVISYNILLRSLLSE-GKWRDGKKLVAEMLSTGSEPN 1091
            CRE MMD+AYEF+ SLP KG KPDVISYNILLR+LL   GKW DG+KL+ EML  G EPN
Sbjct: 243  CREKMMDQAYEFVRSLPSKGCKPDVISYNILLRALLHHRGKWSDGEKLMNEMLCAGCEPN 302

Query: 1092 VVTYSILVTALCRDGKLEEAVNLLKLMMDKGLTPDTFTYEPVISAFCREGKIDLAITFLD 1271
            VVTYSIL++ALCRDGKL+EA+NLLK+M+DKGLTPDTFTY+P+ISAFC+ G++D+AI FLD
Sbjct: 303  VVTYSILMSALCRDGKLDEAINLLKIMVDKGLTPDTFTYDPLISAFCKGGRLDMAIKFLD 362

Query: 1272 YMISSGCLPDIVNYNTILSAMCKNGKADQALEVFDKLDELGCPPDVSTYNAMVSALWNNG 1451
            YMI++GCLPDIVNYNTILS MCK GKAD+A+EVF+KL E+GCPPDVSTYN ++SALWNNG
Sbjct: 363  YMITNGCLPDIVNYNTILSTMCKKGKADEAMEVFEKLAEIGCPPDVSTYNTLMSALWNNG 422

Query: 1452 ERTKALIKLSEMMNKGMDPDEITFNSLISCLCRDGMVDEAIELLKDMERSGFPSTVITYN 1631
             R +AL  +SEM+ KG+DPDEIT+N+LISCLCRDGMV+EA++LL DME +GFP TVITYN
Sbjct: 423  GRARALKMVSEMIEKGVDPDEITYNALISCLCRDGMVNEALDLLGDMEGNGFPPTVITYN 482

Query: 1632 AVLLGLCKAHRIDDAIEILEEMVKKGCRPNETTYVLLVEGIGFAGWRAEAMKAASSLLKK 1811
             +LLGLCKAHR+ +AIE+L EMV+KGCRPNETTY+LL+EGIGF+G R +AM+ A+++  K
Sbjct: 483  ILLLGLCKAHRVVEAIEVLAEMVEKGCRPNETTYILLIEGIGFSGRRVQAMEMATAIYHK 542

Query: 1812 NVISRESFRRLKRTFPALDV-TKDV 1883
            N IS+ES +RL++TF   DV +KD+
Sbjct: 543  NAISKESLQRLRKTFQVPDVYSKDI 567


>ref|XP_002521980.1| pentatricopeptide repeat-containing protein, putative [Ricinus
            communis] gi|223538784|gb|EEF40384.1| pentatricopeptide
            repeat-containing protein, putative [Ricinus communis]
          Length = 584

 Score =  746 bits (1926), Expect = 0.0
 Identities = 358/583 (61%), Positives = 471/583 (80%), Gaps = 6/583 (1%)
 Frame = +3

Query: 174  TILSAEFFPQPLPCTNSS-KPNSQSTNKSLVRC-----RNERKSRSPQRVKVCTENRSTQ 335
            T+ S EF P  +  T    KP S S + ++V C      +  K R+PQ+V+V  E R T 
Sbjct: 2    TLFSTEFLPHSISFTTQPLKPTSNSLHSTIVSCIRPELNDANKVRNPQKVRVSAETRQTH 61

Query: 336  LQSNDSAEPNFVKLLNRSCKAGKFSEALYFLENMVNRGSHKPDVILCTKLIKGLFSSKQM 515
            + S D  E + +KLLNRSC+AGK++E+LYFLE MV++G + PDVILCTKLIKG F+S+ +
Sbjct: 62   VLSFDFKEVHLMKLLNRSCRAGKYNESLYFLECMVDKG-YTPDVILCTKLIKGFFNSRNI 120

Query: 516  EKALRVLKILEVHGEPDVFAYNAVISGFCKGNQIDSANEMLNRMRARGISPDIVTYNIMI 695
             KA RV++ILE +G+PDVFAYNA+ISGF K NQ+++AN +L+RM++RG  PD+VTYNIMI
Sbjct: 121  GKATRVMEILERYGKPDVFAYNALISGFIKANQLENANRVLDRMKSRGFLPDVVTYNIMI 180

Query: 696  GSLCNRGKLGLAQKVFDELLEDNCKPTVVTYTILIEATILEGGIRKAMKLLDEMLSKGLQ 875
            GS C+RGKL LA ++F+ELL+DNC+PTV+TYTILIEATIL+GGI  AMKLLDEMLSKGL+
Sbjct: 181  GSFCSRGKLDLALEIFEELLKDNCEPTVITYTILIEATILDGGIDVAMKLLDEMLSKGLE 240

Query: 876  PDMYTYNTIIRGLCREGMMDRAYEFISSLPEKGWKPDVISYNILLRSLLSEGKWRDGKKL 1055
            PD  TYN IIRG+C+E M+D+A+E + SL  +G KPD+I+YNILLR+LLS GKW +G+KL
Sbjct: 241  PDTLTYNAIIRGMCKEMMVDKAFELLRSLSSRGCKPDIITYNILLRTLLSRGKWSEGEKL 300

Query: 1056 VAEMLSTGSEPNVVTYSILVTALCRDGKLEEAVNLLKLMMDKGLTPDTFTYEPVISAFCR 1235
            ++EM+S G +PNVVT+SIL+  LCRDGK+EEAVNLL+ M +KGL PD + Y+P+I+ FCR
Sbjct: 301  ISEMISIGCKPNVVTHSILIGTLCRDGKVEEAVNLLRSMKEKGLKPDAYCYDPLIAGFCR 360

Query: 1236 EGKIDLAITFLDYMISSGCLPDIVNYNTILSAMCKNGKADQALEVFDKLDELGCPPDVST 1415
            EG++DLA  FL+YMIS GCLPDIVNYNTI++ +C+ GKADQALEVF+KLDE+GCPP+VS+
Sbjct: 361  EGRLDLATEFLEYMISDGCLPDIVNYNTIMAGLCRTGKADQALEVFEKLDEVGCPPNVSS 420

Query: 1416 YNAMVSALWNNGERTKALIKLSEMMNKGMDPDEITFNSLISCLCRDGMVDEAIELLKDME 1595
            YN + SALW++G+R +AL  + +++N+G+DPDEIT+NSLISCLCRDGMVDEAIELL DM+
Sbjct: 421  YNTLFSALWSSGDRYRALEMILKLLNQGIDPDEITYNSLISCLCRDGMVDEAIELLVDMQ 480

Query: 1596 RSGFPSTVITYNAVLLGLCKAHRIDDAIEILEEMVKKGCRPNETTYVLLVEGIGFAGWRA 1775
               +   V++YN +LLGLCK +R +DAIE+L  M +KGC+PNETTY+LL+EGIGF+G RA
Sbjct: 481  SGRYRPNVVSYNIILLGLCKVNRANDAIEVLAAMTEKGCQPNETTYILLIEGIGFSGLRA 540

Query: 1776 EAMKAASSLLKKNVISRESFRRLKRTFPALDVTKDVAYTETSK 1904
            EAM+ A+SL   N IS +SF RL +TFP LDV KD+ +++ SK
Sbjct: 541  EAMELANSLHGMNAISEDSFNRLNKTFPLLDVYKDLTFSDGSK 583


>gb|EMJ06317.1| hypothetical protein PRUPE_ppa004835mg [Prunus persica]
          Length = 489

 Score =  723 bits (1866), Expect = 0.0
 Identities = 339/487 (69%), Positives = 426/487 (87%)
 Frame = +3

Query: 435  MVNRGSHKPDVILCTKLIKGLFSSKQMEKALRVLKILEVHGEPDVFAYNAVISGFCKGNQ 614
            MVN+G +KPDVILCTKLIKG F+S+ +EKA+RV++ILE +GEPD+F+YNA+ISGFCK N+
Sbjct: 1    MVNKG-YKPDVILCTKLIKGFFNSRNIEKAIRVMQILEKYGEPDLFSYNALISGFCKANR 59

Query: 615  IDSANEMLNRMRARGISPDIVTYNIMIGSLCNRGKLGLAQKVFDELLEDNCKPTVVTYTI 794
            I+SAN++L+RMR++G SPD+VTYNIMIGSLC+RGKLGLA KV D+L++DNC+PTV+TYTI
Sbjct: 60   IESANKVLDRMRSQGFSPDVVTYNIMIGSLCSRGKLGLALKVMDQLVKDNCRPTVITYTI 119

Query: 795  LIEATILEGGIRKAMKLLDEMLSKGLQPDMYTYNTIIRGLCREGMMDRAYEFISSLPEKG 974
            LIEATI++GGI +AMKLLDEMLS+GL+PDMYTYN +IRG+CREGM+DRA++F+ SL  KG
Sbjct: 120  LIEATIVDGGIDEAMKLLDEMLSRGLKPDMYTYNAVIRGMCREGMLDRAFQFVRSLDSKG 179

Query: 975  WKPDVISYNILLRSLLSEGKWRDGKKLVAEMLSTGSEPNVVTYSILVTALCRDGKLEEAV 1154
              P+VISYNILLR+LL+ GKW +G+KLV  M S G EPNVVTYSIL++ LCRDGK+E+AV
Sbjct: 180  CPPNVISYNILLRALLNRGKWEEGEKLVTNMCSRGCEPNVVTYSILISTLCRDGKVEDAV 239

Query: 1155 NLLKLMMDKGLTPDTFTYEPVISAFCREGKIDLAITFLDYMISSGCLPDIVNYNTILSAM 1334
            N+LK+M  KGLTPD ++Y+P++SAFC+EG++DLAI FLDYMIS GCLPDIVNYNTIL+A+
Sbjct: 240  NVLKIMKKKGLTPDAYSYDPLVSAFCKEGRLDLAIEFLDYMISDGCLPDIVNYNTILAAL 299

Query: 1335 CKNGKADQALEVFDKLDELGCPPDVSTYNAMVSALWNNGERTKALIKLSEMMNKGMDPDE 1514
            CK+GKADQAL++F+ L E+GCPP+VS+YN M SALWN G+R +AL  +SEM+ KG+ PDE
Sbjct: 300  CKSGKADQALQIFENLGEVGCPPNVSSYNTMFSALWNCGDRVRALGMVSEMVGKGIKPDE 359

Query: 1515 ITFNSLISCLCRDGMVDEAIELLKDMERSGFPSTVITYNAVLLGLCKAHRIDDAIEILEE 1694
            IT+NSLISCLCRDGMVDEAI LL DME  GF  TVI+YN +LLGLCK  R+ DAI++L E
Sbjct: 360  ITYNSLISCLCRDGMVDEAIGLLVDMETGGFQPTVISYNIILLGLCKTRRVVDAIQVLTE 419

Query: 1695 MVKKGCRPNETTYVLLVEGIGFAGWRAEAMKAASSLLKKNVISRESFRRLKRTFPALDVT 1874
            MV+KGCRPNETTY+LL+EGIGFAGWRAEAM+ A+S+     IS +SF+RL RTFP LDV 
Sbjct: 420  MVEKGCRPNETTYILLIEGIGFAGWRAEAMELANSVFSLRAISEDSFKRLNRTFPMLDVF 479

Query: 1875 KDVAYTE 1895
            K++  +E
Sbjct: 480  KELTLSE 486



 Score = 92.4 bits (228), Expect = 7e-16
 Identities = 58/197 (29%), Positives = 103/197 (52%), Gaps = 1/197 (0%)
 Frame = +3

Query: 375 LLNRSCKAGKFSEALYFLENMVNRGSHKPDVILCTKLIKGLFSSKQMEKALRVLKIL-EV 551
           L++  CK G+   A+ FL+ M++ G   PD++    ++  L  S + ++AL++ + L EV
Sbjct: 260 LVSAFCKEGRLDLAIEFLDYMISDGC-LPDIVNYNTILAALCKSGKADQALQIFENLGEV 318

Query: 552 HGEPDVFAYNAVISGFCKGNQIDSANEMLNRMRARGISPDIVTYNIMIGSLCNRGKLGLA 731
              P+V +YN + S          A  M++ M  +GI PD +TYN +I  LC  G +  A
Sbjct: 319 GCPPNVSSYNTMFSALWNCGDRVRALGMVSEMVGKGIKPDEITYNSLISCLCRDGMVDEA 378

Query: 732 QKVFDELLEDNCKPTVVTYTILIEATILEGGIRKAMKLLDEMLSKGLQPDMYTYNTIIRG 911
             +  ++     +PTV++Y I++        +  A+++L EM+ KG +P+  TY  +I G
Sbjct: 379 IGLLVDMETGGFQPTVISYNIILLGLCKTRRVVDAIQVLTEMVEKGCRPNETTYILLIEG 438

Query: 912 LCREGMMDRAYEFISSL 962
           +   G    A E  +S+
Sbjct: 439 IGFAGWRAEAMELANSV 455



 Score = 85.5 bits (210), Expect = 8e-14
 Identities = 51/168 (30%), Positives = 86/168 (51%), Gaps = 1/168 (0%)
 Frame = +3

Query: 363 NFVKLLNRSCKAGKFSEALYFLENMVNRGSHKPDVILCTKLIKGLFSSKQMEKALRVLKI 542
           N+  +L   CK+GK  +AL   EN+   G   P+V     +   L++     +AL ++  
Sbjct: 291 NYNTILAALCKSGKADQALQIFENLGEVGC-PPNVSSYNTMFSALWNCGDRVRALGMVSE 349

Query: 543 LEVHG-EPDVFAYNAVISGFCKGNQIDSANEMLNRMRARGISPDIVTYNIMIGSLCNRGK 719
           +   G +PD   YN++IS  C+   +D A  +L  M   G  P +++YNI++  LC   +
Sbjct: 350 MVGKGIKPDEITYNSLISCLCRDGMVDEAIGLLVDMETGGFQPTVISYNIILLGLCKTRR 409

Query: 720 LGLAQKVFDELLEDNCKPTVVTYTILIEATILEGGIRKAMKLLDEMLS 863
           +  A +V  E++E  C+P   TY +LIE     G   +AM+L + + S
Sbjct: 410 VVDAIQVLTEMVEKGCRPNETTYILLIEGIGFAGWRAEAMELANSVFS 457


>ref|XP_006487702.1| PREDICTED: pentatricopeptide repeat-containing protein At3g04760,
            chloroplastic-like [Citrus sinensis]
          Length = 583

 Score =  721 bits (1862), Expect = 0.0
 Identities = 351/583 (60%), Positives = 465/583 (79%), Gaps = 6/583 (1%)
 Frame = +3

Query: 168  MTTILSAEFFPQPLPCTNSS----KPN-SQSTNKSLVRCRNERKSRSPQRVKVCTENR-S 329
            M  I S +F P+ L    S     KP  S S   ++V C N  KS    RV    E R +
Sbjct: 1    MALISSTDFVPRNLVIFTSQHQQLKPTTSHSVQSTVVSCINP-KSNERVRVSSSAETRPN 59

Query: 330  TQLQSNDSAEPNFVKLLNRSCKAGKFSEALYFLENMVNRGSHKPDVILCTKLIKGLFSSK 509
            T L S D  E  F+KL+ +S +AGKF E+LYF+E+MV  G  KPDV++CTKLIK  F  +
Sbjct: 60   THLLSFDVKETQFMKLIKKSFRAGKFDESLYFIESMVANGC-KPDVVMCTKLIKKFFQER 118

Query: 510  QMEKALRVLKILEVHGEPDVFAYNAVISGFCKGNQIDSANEMLNRMRARGISPDIVTYNI 689
            +  KA+RV++ILE +GEPDVFAYNA+ISGFCK NQI+ AN++L+R+R+RG SPD+VTYNI
Sbjct: 119  KSNKAVRVMEILEKYGEPDVFAYNALISGFCKANQIELANKVLDRLRSRGFSPDVVTYNI 178

Query: 690  MIGSLCNRGKLGLAQKVFDELLEDNCKPTVVTYTILIEATILEGGIRKAMKLLDEMLSKG 869
            MIGSLC+RG +  A KVFD+LL DNCKPTV+TYTILI+AT+LEG   KAMKLLDEML++G
Sbjct: 179  MIGSLCSRGMIESAFKVFDQLLRDNCKPTVITYTILIQATMLEGQTDKAMKLLDEMLARG 238

Query: 870  LQPDMYTYNTIIRGLCREGMMDRAYEFISSLPEKGWKPDVISYNILLRSLLSEGKWRDGK 1049
            L PDM+T N IIRG+C++GM+ +A++F+ SL  +G +PDVISYN+LLR+LL+ GKW +G+
Sbjct: 239  LIPDMFTNNAIIRGMCKKGMVGQAFQFVRSLESRGCQPDVISYNMLLRTLLNMGKWEEGE 298

Query: 1050 KLVAEMLSTGSEPNVVTYSILVTALCRDGKLEEAVNLLKLMMDKGLTPDTFTYEPVISAF 1229
            KL+ EM+S G EPNVVTYSIL+++LCRDGK E+AV++L+   +KGLTPD ++Y+P+ISA+
Sbjct: 299  KLMTEMISRGLEPNVVTYSILISSLCRDGKTEDAVDVLRAAKEKGLTPDAYSYDPLISAY 358

Query: 1230 CREGKIDLAITFLDYMISSGCLPDIVNYNTILSAMCKNGKADQALEVFDKLDELGCPPDV 1409
            C++G++DLAI FLDYMIS GCLPDIVNYNTIL+A CKNG ADQALE+F+KL ++GCPP+V
Sbjct: 359  CKDGRLDLAIEFLDYMISDGCLPDIVNYNTILAAFCKNGNADQALEIFEKLSDVGCPPNV 418

Query: 1410 STYNAMVSALWNNGERTKALIKLSEMMNKGMDPDEITFNSLISCLCRDGMVDEAIELLKD 1589
            S+YN M SALW++G++ +AL  +SEM++KG++PDEIT+NSLISCLCRDGMVDEA+ LL D
Sbjct: 419  SSYNTMFSALWSSGDKIRALGMISEMLSKGIEPDEITYNSLISCLCRDGMVDEAVGLLVD 478

Query: 1590 MERSGFPSTVITYNAVLLGLCKAHRIDDAIEILEEMVKKGCRPNETTYVLLVEGIGFAGW 1769
            ME + F  TVI+YN ++LG CK  RI+++IE+L  M +KGC+PNETTYVLL+EGIG+ GW
Sbjct: 479  MESTRFRPTVISYNIIILGFCKTRRINESIEVLAAMFEKGCKPNETTYVLLIEGIGYGGW 538

Query: 1770 RAEAMKAASSLLKKNVISRESFRRLKRTFPALDVTKDVAYTET 1898
            RAEAM+ A++L+  + ISR++F+RL RTFP LDV K++++  T
Sbjct: 539  RAEAMELANALVSMHAISRDTFKRLNRTFPLLDVYKEISHLAT 581


>ref|XP_006442665.1| hypothetical protein CICLE_v10019446mg [Citrus clementina]
            gi|557544927|gb|ESR55905.1| hypothetical protein
            CICLE_v10019446mg [Citrus clementina]
          Length = 583

 Score =  720 bits (1859), Expect = 0.0
 Identities = 350/583 (60%), Positives = 463/583 (79%), Gaps = 6/583 (1%)
 Frame = +3

Query: 168  MTTILSAEFFPQPLPCTNSS----KPN-SQSTNKSLVRCRNERKSRSPQRVKVCTENR-S 329
            M  I S +F P+ L    S     KP  S S   ++V C N  KS    RV    E R +
Sbjct: 1    MALISSTDFVPRNLVIFTSQHQQQKPTTSHSVQSTVVSCINP-KSNERVRVSSSAETRPN 59

Query: 330  TQLQSNDSAEPNFVKLLNRSCKAGKFSEALYFLENMVNRGSHKPDVILCTKLIKGLFSSK 509
            T L S D  E  F+KL+ RS +AGKF E+LYF+E+MV  G  KPDV++CTKLIK  F  +
Sbjct: 60   THLLSFDVKETQFMKLIKRSFRAGKFDESLYFIESMVANGC-KPDVVMCTKLIKKFFQER 118

Query: 510  QMEKALRVLKILEVHGEPDVFAYNAVISGFCKGNQIDSANEMLNRMRARGISPDIVTYNI 689
            +  KA+RV++ILE +GEPDVFAYNA+ISGFCK NQI+ AN++L+R+R+RG SPD+VTYNI
Sbjct: 119  KSNKAVRVMEILEKYGEPDVFAYNALISGFCKANQIELANKVLDRLRSRGFSPDVVTYNI 178

Query: 690  MIGSLCNRGKLGLAQKVFDELLEDNCKPTVVTYTILIEATILEGGIRKAMKLLDEMLSKG 869
            MIGSLC+RG +    KVFD+LL DNCKPTV+TYTILI+AT+LEG   KAMKLLDEM ++G
Sbjct: 179  MIGSLCSRGMIESGFKVFDQLLRDNCKPTVITYTILIQATMLEGQTDKAMKLLDEMFARG 238

Query: 870  LQPDMYTYNTIIRGLCREGMMDRAYEFISSLPEKGWKPDVISYNILLRSLLSEGKWRDGK 1049
            L PDM+T N IIRG+C++GM+ +A++F+ SL  +G +PDVISYN+LLR+LL+ GKW +G+
Sbjct: 239  LIPDMFTNNAIIRGMCKKGMVGQAFQFVRSLESRGCQPDVISYNMLLRTLLNMGKWEEGE 298

Query: 1050 KLVAEMLSTGSEPNVVTYSILVTALCRDGKLEEAVNLLKLMMDKGLTPDTFTYEPVISAF 1229
            KL+ EM+S G EPNVVTYSIL+++LCRDGK E+AV++L+   +KGLTPD ++Y+P+ISA+
Sbjct: 299  KLMTEMISRGLEPNVVTYSILISSLCRDGKTEDAVDVLRAAKEKGLTPDAYSYDPLISAY 358

Query: 1230 CREGKIDLAITFLDYMISSGCLPDIVNYNTILSAMCKNGKADQALEVFDKLDELGCPPDV 1409
            C++G++DLAI FLDYMIS GCLPDIVNYNTIL+A CKNG ADQALE+F+KL ++GCPP+V
Sbjct: 359  CKDGRLDLAIEFLDYMISDGCLPDIVNYNTILAAFCKNGNADQALEIFEKLSDVGCPPNV 418

Query: 1410 STYNAMVSALWNNGERTKALIKLSEMMNKGMDPDEITFNSLISCLCRDGMVDEAIELLKD 1589
            S+YN M SALW++G++ +AL  +SEM++KG++PDEIT+NSLISCLCRDGMVDEA+ LL D
Sbjct: 419  SSYNTMFSALWSSGDKIRALGMISEMLSKGIEPDEITYNSLISCLCRDGMVDEAVGLLVD 478

Query: 1590 MERSGFPSTVITYNAVLLGLCKAHRIDDAIEILEEMVKKGCRPNETTYVLLVEGIGFAGW 1769
            ME + F  TV++YN ++LG CK  RI++AIE+L  M +KGC+PNETTYVLL+EGIG+ GW
Sbjct: 479  MESTRFRPTVVSYNIIILGFCKTRRINEAIEVLAAMFEKGCKPNETTYVLLIEGIGYGGW 538

Query: 1770 RAEAMKAASSLLKKNVISRESFRRLKRTFPALDVTKDVAYTET 1898
            RAEAM+ A++L+  + ISR++F+RL RTFP LDV K++++  T
Sbjct: 539  RAEAMELANALVSMHAISRDTFKRLNRTFPLLDVYKEISHLAT 581


>ref|XP_004142590.1| PREDICTED: pentatricopeptide repeat-containing protein At3g04760,
            chloroplastic-like [Cucumis sativus]
          Length = 581

 Score =  712 bits (1837), Expect = 0.0
 Identities = 345/581 (59%), Positives = 466/581 (80%), Gaps = 2/581 (0%)
 Frame = +3

Query: 177  ILSAEFFPQPLPCTNS-SKPN-SQSTNKSLVRCRNERKSRSPQRVKVCTENRSTQLQSND 350
            + S+EF PQ L  TN  +KP   QS + S+  CR   K+   + V    E R     + D
Sbjct: 1    MFSSEFLPQSLHFTNPLAKPTIPQSRSDSIPACRFSNKTHL-RNVTSSAEFRQPHFPNLD 59

Query: 351  SAEPNFVKLLNRSCKAGKFSEALYFLENMVNRGSHKPDVILCTKLIKGLFSSKQMEKALR 530
            + + + +KLLNRSC+AGK +E+LYFLE++V++G  KPDV+LCTKLIKG F+S+ ++KA+R
Sbjct: 60   NRDAHLMKLLNRSCRAGKHNESLYFLESVVSKG-FKPDVVLCTKLIKGFFNSRNLKKAMR 118

Query: 531  VLKILEVHGEPDVFAYNAVISGFCKGNQIDSANEMLNRMRARGISPDIVTYNIMIGSLCN 710
            V++ILE +G+PDV++YNA+ISGF K NQIDSAN++ +RMR+RG SPD+VTYNIMIGSLC+
Sbjct: 119  VMEILETYGDPDVYSYNAMISGFSKANQIDSANQVFDRMRSRGFSPDVVTYNIMIGSLCS 178

Query: 711  RGKLGLAQKVFDELLEDNCKPTVVTYTILIEATILEGGIRKAMKLLDEMLSKGLQPDMYT 890
            RGKL LA +V DELL+D CKP+V+TYTILIEATILEG I +A++L DE++S+GL+PD+YT
Sbjct: 179  RGKLELAFEVMDELLKDGCKPSVITYTILIEATILEGRINEALELFDELVSRGLRPDLYT 238

Query: 891  YNTIIRGLCREGMMDRAYEFISSLPEKGWKPDVISYNILLRSLLSEGKWRDGKKLVAEML 1070
            YN IIRG+C+EGM DRA +F+  L  +G  PDV+SYNILLRS L++ +W DG++L+ +M+
Sbjct: 239  YNAIIRGICKEGMEDRALDFVRHLSARGCNPDVVSYNILLRSFLNKSRWEDGERLMKDMV 298

Query: 1071 STGSEPNVVTYSILVTALCRDGKLEEAVNLLKLMMDKGLTPDTFTYEPVISAFCREGKID 1250
             +G EPNVVT+SIL+++ CR+G++ EAVN+L++M +KGLTPD+++Y+P+ISAFC+EG++D
Sbjct: 299  LSGCEPNVVTHSILISSFCREGRVREAVNVLEVMKEKGLTPDSYSYDPLISAFCKEGRLD 358

Query: 1251 LAITFLDYMISSGCLPDIVNYNTILSAMCKNGKADQALEVFDKLDELGCPPDVSTYNAMV 1430
            LAI +L+ M+S GCLPDIVNYNTIL+ +CK G AD AL+VF+KLDE+GCPP V  YN M 
Sbjct: 359  LAIEYLEKMVSDGCLPDIVNYNTILATLCKFGCADLALDVFEKLDEVGCPPTVRAYNTMF 418

Query: 1431 SALWNNGERTKALIKLSEMMNKGMDPDEITFNSLISCLCRDGMVDEAIELLKDMERSGFP 1610
            SALW+ G + KAL  +SEM+ KG+DPDEIT+NSLISCLCRDG+VDEAI LL DME + F 
Sbjct: 419  SALWSCGNKIKALEMISEMIRKGIDPDEITYNSLISCLCRDGLVDEAIGLLVDMEATRFQ 478

Query: 1611 STVITYNAVLLGLCKAHRIDDAIEILEEMVKKGCRPNETTYVLLVEGIGFAGWRAEAMKA 1790
             TVI++N VLLG+CKAHR+ + IE+L  MV+KGC PNET+YVLL+EGI +AGWRAEAM+ 
Sbjct: 479  PTVISFNIVLLGMCKAHRVFEGIELLITMVEKGCLPNETSYVLLIEGIAYAGWRAEAMEL 538

Query: 1791 ASSLLKKNVISRESFRRLKRTFPALDVTKDVAYTETSK*LI 1913
            A+SL +  VIS +S +RL +TFP LDV K ++ +E+   L+
Sbjct: 539  ANSLYRLGVISGDSSKRLNKTFPMLDVYKGLSLSESKNQLL 579


>ref|XP_006371094.1| hypothetical protein POPTR_0019s03630g [Populus trichocarpa]
            gi|550316702|gb|ERP48891.1| hypothetical protein
            POPTR_0019s03630g [Populus trichocarpa]
          Length = 586

 Score =  708 bits (1828), Expect = 0.0
 Identities = 348/581 (59%), Positives = 456/581 (78%), Gaps = 11/581 (1%)
 Frame = +3

Query: 174  TILSAEFFPQP--LPCTNSS-KPNSQSTNKSLVRC-------RNERKSRSPQRVKVCTEN 323
            T+ S EF       P T+   K +  S   ++V C        N      P+  +V  E 
Sbjct: 2    TMFSTEFISHSCSFPFTSKHFKLSLHSLQSNVVSCINPTHNDTNSNLGNPPKLRRVLPET 61

Query: 324  RSTQLQSNDSAEPNFVKLLNRSCKAGKFSEALYFLENMVNRGSHKPDVILCTKLIKGLFS 503
            + T + S D  E + +KLLNRSCKAGK +E+LYFLE MV +G ++PDVI+CTKLIKG F+
Sbjct: 62   KPTHVLSYDFKETHLMKLLNRSCKAGKCNESLYFLECMVAKG-YQPDVIMCTKLIKGFFN 120

Query: 504  SKQMEKALRVLKILEVHGEPDVFAYNAVISGFCKGNQIDSANEMLNRMRARGISPDIVTY 683
            S+ +EKA RV++ILE HGEPDVFAYNAVISGFCK N+I+SA ++L+RM+ +G S D+VTY
Sbjct: 121  SRNIEKATRVMEILEKHGEPDVFAYNAVISGFCKANRIESAKKVLDRMKRKGFSQDVVTY 180

Query: 684  NIMIGSLCNRGKLGLAQKVFDELLEDN-CKPTVVTYTILIEATILEGGIRKAMKLLDEML 860
            NIMIG+ C++GK+ LA KVF+ELL+DN CKPT++TYTILIEA ILEGGI + +KLLDEML
Sbjct: 181  NIMIGTFCSKGKIDLALKVFEELLKDNNCKPTLITYTILIEAHILEGGIDEGLKLLDEML 240

Query: 861  SKGLQPDMYTYNTIIRGLCREGMMDRAYEFISSLPEKGWKPDVISYNILLRSLLSEGKWR 1040
            S+GL+PD +TYN I+RGL +EG +++A+E + +L  +G KPDVI+YNILLR+LL +GKW 
Sbjct: 241  SRGLEPDTFTYNVIVRGLGKEGKVNQAFELVRTLNSRGCKPDVITYNILLRALLDQGKWY 300

Query: 1041 DGKKLVAEMLSTGSEPNVVTYSILVTALCRDGKLEEAVNLLKLMMDKGLTPDTFTYEPVI 1220
            +G+KL+ EM S G EPNVVTYSIL+++LCRDGK+EE+VNL+K+M +KGLTPD + Y+P+I
Sbjct: 301  EGEKLMDEMFSRGCEPNVVTYSILISSLCRDGKIEESVNLVKVMKEKGLTPDAYCYDPLI 360

Query: 1221 SAFCREGKIDLAITFLDYMISSGCLPDIVNYNTILSAMCKNGKADQALEVFDKLDELGCP 1400
            +AFCREGK+D+AI FLDYMIS G LPDIVNYNTI++A+CKNG +D A+E+F KL+E+GCP
Sbjct: 361  AAFCREGKLDMAIKFLDYMISDGFLPDIVNYNTIMAALCKNGNSDHAVEIFGKLEEVGCP 420

Query: 1401 PDVSTYNAMVSALWNNGERTKALIKLSEMMNKGMDPDEITFNSLISCLCRDGMVDEAIEL 1580
            P+VS+YN M+SALW +G+R +AL  +S+M++ G+DPD IT+NSLISCLCRDGMVDEAI L
Sbjct: 421  PNVSSYNTMLSALWGSGDRYRALGMISQMLSTGIDPDGITYNSLISCLCRDGMVDEAIGL 480

Query: 1581 LKDMERSGFPSTVITYNAVLLGLCKAHRIDDAIEILEEMVKKGCRPNETTYVLLVEGIGF 1760
            L DM    F   +++YN VLLGLCK HRIDDAIE+L  M++ GC+PNETTY LL+EGIGF
Sbjct: 481  LADMLSGRFQPNIVSYNIVLLGLCKVHRIDDAIEVLTAMIENGCQPNETTYTLLIEGIGF 540

Query: 1761 AGWRAEAMKAASSLLKKNVISRESFRRLKRTFPALDVTKDV 1883
            +G RA+AM+ A+SL   N IS  S++RL + FP LDV KD+
Sbjct: 541  SGSRAQAMELANSLYSMNAISEGSYKRLNKVFPLLDVYKDL 581


>ref|XP_006296608.1| hypothetical protein CARUB_v10013258mg [Capsella rubella]
            gi|482565317|gb|EOA29506.1| hypothetical protein
            CARUB_v10013258mg [Capsella rubella]
          Length = 607

 Score =  702 bits (1812), Expect = 0.0
 Identities = 337/554 (60%), Positives = 447/554 (80%), Gaps = 3/554 (0%)
 Frame = +3

Query: 219  NSSKPNSQSTNK-SLVRCRNERKSRSPQRVKVCTENRSTQLQSNDSA--EPNFVKLLNRS 389
            ++S PN  +    S        ++ + Q   V TE R  Q  S+     +   +K+ +RS
Sbjct: 44   SNSNPNHDNVKSFSSSGAARNLQAATTQDATVPTERRQHQTHSHSLGFRDTQMLKIFHRS 103

Query: 390  CKAGKFSEALYFLENMVNRGSHKPDVILCTKLIKGLFSSKQMEKALRVLKILEVHGEPDV 569
            C++G + E+L+ LE+MV +G + PDVILCTKLIKG F+ + + KA+RV++ILE  G+PDV
Sbjct: 104  CRSGNYIESLHLLESMVRKG-YNPDVILCTKLIKGFFTLRNIPKAVRVMEILEKFGQPDV 162

Query: 570  FAYNAVISGFCKGNQIDSANEMLNRMRARGISPDIVTYNIMIGSLCNRGKLGLAQKVFDE 749
            FAYNA+I+GFCK N+ID A  +L+RMR++G SPD VTYNIMIGSLC+RGKL LA KV D+
Sbjct: 163  FAYNALINGFCKMNRIDDATRVLDRMRSKGFSPDTVTYNIMIGSLCSRGKLVLALKVLDQ 222

Query: 750  LLEDNCKPTVVTYTILIEATILEGGIRKAMKLLDEMLSKGLQPDMYTYNTIIRGLCREGM 929
            LL DNC+PTV+TYTILIEAT+LEGG+ +A+KLLDEMLS+GL+PDM+TYNTIIRG+C+EGM
Sbjct: 223  LLSDNCQPTVITYTILIEATMLEGGVDEALKLLDEMLSRGLKPDMFTYNTIIRGMCKEGM 282

Query: 930  MDRAYEFISSLPEKGWKPDVISYNILLRSLLSEGKWRDGKKLVAEMLSTGSEPNVVTYSI 1109
            +DRA+E + +L  +G +PDVISYNILLR+LL++GKW +G+KL+ +M S   +PNVVTYSI
Sbjct: 283  VDRAFEMVRNLELRGCEPDVISYNILLRALLNQGKWEEGEKLMTKMFSEKCDPNVVTYSI 342

Query: 1110 LVTALCRDGKLEEAVNLLKLMMDKGLTPDTFTYEPVISAFCREGKIDLAITFLDYMISSG 1289
            L+T LCRDGK+EEA+NLLKLM +KGL+PD ++Y+P+I+AFCREG++DLAI FL+ MIS G
Sbjct: 343  LITTLCRDGKIEEALNLLKLMKEKGLSPDAYSYDPLIAAFCREGRLDLAIEFLETMISDG 402

Query: 1290 CLPDIVNYNTILSAMCKNGKADQALEVFDKLDELGCPPDVSTYNAMVSALWNNGERTKAL 1469
            CLPDIVNYNT+L+ +CKNGKADQALE+F KL E+GC P+ S+YN M SALW++G++ +AL
Sbjct: 403  CLPDIVNYNTVLATLCKNGKADQALEIFGKLGEVGCSPNSSSYNTMFSALWSSGDKIRAL 462

Query: 1470 IKLSEMMNKGMDPDEITFNSLISCLCRDGMVDEAIELLKDMERSGFPSTVITYNAVLLGL 1649
              +SEM+++G+DPDEIT+NS+ISCLCR+GMVDEA +LL DM    F  +V+TYN VLLG 
Sbjct: 463  HMISEMVSQGIDPDEITYNSMISCLCREGMVDEAFDLLVDMRSCEFHPSVVTYNIVLLGF 522

Query: 1650 CKAHRIDDAIEILEEMVKKGCRPNETTYVLLVEGIGFAGWRAEAMKAASSLLKKNVISRE 1829
            CKAHRI+DAI++LE MV  GCRPNE+TY +L+EGIGFAG+RAEAM+ A+ L++ + IS  
Sbjct: 523  CKAHRIEDAIDVLESMVGNGCRPNESTYTVLIEGIGFAGYRAEAMELANDLVRIDAISEH 582

Query: 1830 SFRRLKRTFPALDV 1871
            SF+RL RTFP L+V
Sbjct: 583  SFKRLHRTFPLLNV 596


>ref|XP_002884468.1| pentatricopeptide repeat-containing protein [Arabidopsis lyrata
            subsp. lyrata] gi|297330308|gb|EFH60727.1|
            pentatricopeptide repeat-containing protein [Arabidopsis
            lyrata subsp. lyrata]
          Length = 598

 Score =  701 bits (1810), Expect = 0.0
 Identities = 338/558 (60%), Positives = 443/558 (79%)
 Frame = +3

Query: 219  NSSKPNSQSTNKSLVRCRNERKSRSPQRVKVCTENRSTQLQSNDSAEPNFVKLLNRSCKA 398
            ++S PN  +  KS           +     + TE R    QS    +   +K+ +RSC++
Sbjct: 40   SNSNPNHDN-GKSFSSSGARNLQATTTDAAIPTERRQQHSQSLGFRDTQMLKIFHRSCRS 98

Query: 399  GKFSEALYFLENMVNRGSHKPDVILCTKLIKGLFSSKQMEKALRVLKILEVHGEPDVFAY 578
            G + E+L+ LE MV +G + PDVILCTKLIKG F+ + + KA+RV++ILE  G+PDVFAY
Sbjct: 99   GNYIESLHLLETMVRKG-YNPDVILCTKLIKGFFTLRNVPKAVRVMEILEKFGQPDVFAY 157

Query: 579  NAVISGFCKGNQIDSANEMLNRMRARGISPDIVTYNIMIGSLCNRGKLGLAQKVFDELLE 758
            NA+I+GFCK N+ID A  +L+RMR++  SPD VTYNIMIGSLC+RGKL LA KV D+LL 
Sbjct: 158  NALINGFCKMNRIDDATRVLDRMRSKDFSPDTVTYNIMIGSLCSRGKLDLALKVLDQLLS 217

Query: 759  DNCKPTVVTYTILIEATILEGGIRKAMKLLDEMLSKGLQPDMYTYNTIIRGLCREGMMDR 938
            DNC+PTV+TYTILIEAT+LEGG+ +A+KLLDEMLS+GL+PDM+TYNTIIRG+C+EGM+DR
Sbjct: 218  DNCQPTVITYTILIEATMLEGGVDEALKLLDEMLSRGLKPDMFTYNTIIRGMCKEGMVDR 277

Query: 939  AYEFISSLPEKGWKPDVISYNILLRSLLSEGKWRDGKKLVAEMLSTGSEPNVVTYSILVT 1118
            A+E I +L  KG +PDVISYNILLR+LL++GKW +G+KL+ +M S   +PNVVTYSIL+T
Sbjct: 278  AFEMIRNLELKGCEPDVISYNILLRALLNQGKWEEGEKLMTKMFSEKCDPNVVTYSILIT 337

Query: 1119 ALCRDGKLEEAVNLLKLMMDKGLTPDTFTYEPVISAFCREGKIDLAITFLDYMISSGCLP 1298
             LCRDGK+EEA+NLLKLM +KGLTPD ++Y+P+I+AFCREG++D+AI FL+ MIS GCLP
Sbjct: 338  TLCRDGKIEEAMNLLKLMKEKGLTPDAYSYDPLIAAFCREGRLDVAIEFLETMISDGCLP 397

Query: 1299 DIVNYNTILSAMCKNGKADQALEVFDKLDELGCPPDVSTYNAMVSALWNNGERTKALIKL 1478
            DIVNYNT+L+ +CKNGKADQALE+F KL E+GC P+ S+YN M SALW++G++ +AL  +
Sbjct: 398  DIVNYNTVLATLCKNGKADQALEIFGKLGEVGCSPNSSSYNTMFSALWSSGDKIRALHMI 457

Query: 1479 SEMMNKGMDPDEITFNSLISCLCRDGMVDEAIELLKDMERSGFPSTVITYNAVLLGLCKA 1658
             EM++ G+DPDEIT+NS+ISCLCR+GMVD+A ELL DM    F  +V+TYN VLLG CKA
Sbjct: 458  LEMVSNGIDPDEITYNSMISCLCREGMVDKAFELLVDMRSCEFHPSVVTYNIVLLGFCKA 517

Query: 1659 HRIDDAIEILEEMVKKGCRPNETTYVLLVEGIGFAGWRAEAMKAASSLLKKNVISRESFR 1838
            HRI+DAI++L+ MV  GCRPNETTY +L+EGIGFAG+RAEAM+ A+ L++ N IS  SF+
Sbjct: 518  HRIEDAIDVLDSMVGNGCRPNETTYTVLIEGIGFAGYRAEAMELANDLVRINAISEYSFK 577

Query: 1839 RLKRTFPALDVTKDVAYT 1892
            RL RTFP L+V +  + T
Sbjct: 578  RLHRTFPLLNVLQRSSQT 595


>ref|NP_566237.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana]
            gi|75207286|sp|Q9SR00.1|PP213_ARATH RecName:
            Full=Pentatricopeptide repeat-containing protein
            At3g04760, chloroplastic; Flags: Precursor
            gi|6175176|gb|AAF04902.1|AC011437_17 hypothetical protein
            [Arabidopsis thaliana] gi|15810359|gb|AAL07067.1| unknown
            protein [Arabidopsis thaliana] gi|22136960|gb|AAM91709.1|
            unknown protein [Arabidopsis thaliana]
            gi|332640611|gb|AEE74132.1| pentatricopeptide
            repeat-containing protein [Arabidopsis thaliana]
          Length = 602

 Score =  701 bits (1809), Expect = 0.0
 Identities = 339/562 (60%), Positives = 446/562 (79%)
 Frame = +3

Query: 207  LPCTNSSKPNSQSTNKSLVRCRNERKSRSPQRVKVCTENRSTQLQSNDSAEPNFVKLLNR 386
            L  +NS+  N    + S    RN + + +     + TE R    QS    +   +K+ +R
Sbjct: 40   LTFSNSNPNNDNGRSFSSSGARNLQTTTTTDAT-LPTERRQQHSQSLGFRDTQMLKIFHR 98

Query: 387  SCKAGKFSEALYFLENMVNRGSHKPDVILCTKLIKGLFSSKQMEKALRVLKILEVHGEPD 566
            SC++G + E+L+ LE MV +G + PDVILCTKLIKG F+ + + KA+RV++ILE  G+PD
Sbjct: 99   SCRSGNYIESLHLLETMVRKG-YNPDVILCTKLIKGFFTLRNIPKAVRVMEILEKFGQPD 157

Query: 567  VFAYNAVISGFCKGNQIDSANEMLNRMRARGISPDIVTYNIMIGSLCNRGKLGLAQKVFD 746
            VFAYNA+I+GFCK N+ID A  +L+RMR++  SPD VTYNIMIGSLC+RGKL LA KV +
Sbjct: 158  VFAYNALINGFCKMNRIDDATRVLDRMRSKDFSPDTVTYNIMIGSLCSRGKLDLALKVLN 217

Query: 747  ELLEDNCKPTVVTYTILIEATILEGGIRKAMKLLDEMLSKGLQPDMYTYNTIIRGLCREG 926
            +LL DNC+PTV+TYTILIEAT+LEGG+ +A+KL+DEMLS+GL+PDM+TYNTIIRG+C+EG
Sbjct: 218  QLLSDNCQPTVITYTILIEATMLEGGVDEALKLMDEMLSRGLKPDMFTYNTIIRGMCKEG 277

Query: 927  MMDRAYEFISSLPEKGWKPDVISYNILLRSLLSEGKWRDGKKLVAEMLSTGSEPNVVTYS 1106
            M+DRA+E + +L  KG +PDVISYNILLR+LL++GKW +G+KL+ +M S   +PNVVTYS
Sbjct: 278  MVDRAFEMVRNLELKGCEPDVISYNILLRALLNQGKWEEGEKLMTKMFSEKCDPNVVTYS 337

Query: 1107 ILVTALCRDGKLEEAVNLLKLMMDKGLTPDTFTYEPVISAFCREGKIDLAITFLDYMISS 1286
            IL+T LCRDGK+EEA+NLLKLM +KGLTPD ++Y+P+I+AFCREG++D+AI FL+ MIS 
Sbjct: 338  ILITTLCRDGKIEEAMNLLKLMKEKGLTPDAYSYDPLIAAFCREGRLDVAIEFLETMISD 397

Query: 1287 GCLPDIVNYNTILSAMCKNGKADQALEVFDKLDELGCPPDVSTYNAMVSALWNNGERTKA 1466
            GCLPDIVNYNT+L+ +CKNGKADQALE+F KL E+GC P+ S+YN M SALW++G++ +A
Sbjct: 398  GCLPDIVNYNTVLATLCKNGKADQALEIFGKLGEVGCSPNSSSYNTMFSALWSSGDKIRA 457

Query: 1467 LIKLSEMMNKGMDPDEITFNSLISCLCRDGMVDEAIELLKDMERSGFPSTVITYNAVLLG 1646
            L  + EMM+ G+DPDEIT+NS+ISCLCR+GMVDEA ELL DM    F  +V+TYN VLLG
Sbjct: 458  LHMILEMMSNGIDPDEITYNSMISCLCREGMVDEAFELLVDMRSCEFHPSVVTYNIVLLG 517

Query: 1647 LCKAHRIDDAIEILEEMVKKGCRPNETTYVLLVEGIGFAGWRAEAMKAASSLLKKNVISR 1826
             CKAHRI+DAI +LE MV  GCRPNETTY +L+EGIGFAG+RAEAM+ A+ L++ + IS 
Sbjct: 518  FCKAHRIEDAINVLESMVGNGCRPNETTYTVLIEGIGFAGYRAEAMELANDLVRIDAISE 577

Query: 1827 ESFRRLKRTFPALDVTKDVAYT 1892
             SF+RL RTFP L+V +  + T
Sbjct: 578  YSFKRLHRTFPLLNVLQRSSQT 599


>dbj|BAD95034.1| hypothetical protein [Arabidopsis thaliana]
          Length = 602

 Score =  701 bits (1808), Expect = 0.0
 Identities = 339/562 (60%), Positives = 446/562 (79%)
 Frame = +3

Query: 207  LPCTNSSKPNSQSTNKSLVRCRNERKSRSPQRVKVCTENRSTQLQSNDSAEPNFVKLLNR 386
            L  +NS+  N    + S    RN + + +     + TE R    QS    +   +K+ +R
Sbjct: 40   LTFSNSNPNNDNGRSFSSSGARNLQTTTTTDAT-LPTERRQQHSQSLGFRDTQMLKIFHR 98

Query: 387  SCKAGKFSEALYFLENMVNRGSHKPDVILCTKLIKGLFSSKQMEKALRVLKILEVHGEPD 566
            SC++G + E+L+ LE MV +G + PDVILCTKLIKG F+ + + KA+RV++ILE  G+PD
Sbjct: 99   SCRSGNYIESLHLLETMVRKG-YNPDVILCTKLIKGFFTLRNIPKAVRVMEILEKFGQPD 157

Query: 567  VFAYNAVISGFCKGNQIDSANEMLNRMRARGISPDIVTYNIMIGSLCNRGKLGLAQKVFD 746
            VFAYNA+I+GFCK N+ID A  +L+RMR++  SPD VTYNIMIGSLC+RGKL LA KV +
Sbjct: 158  VFAYNALINGFCKMNRIDDATRVLDRMRSKDFSPDTVTYNIMIGSLCSRGKLDLALKVLN 217

Query: 747  ELLEDNCKPTVVTYTILIEATILEGGIRKAMKLLDEMLSKGLQPDMYTYNTIIRGLCREG 926
            +LL DNC+PTV+TYTILIEAT+LEGG+ +A+KL+DEMLS+GL+PDM+TYNTIIRG+C+EG
Sbjct: 218  QLLSDNCQPTVITYTILIEATMLEGGVDEALKLMDEMLSRGLKPDMFTYNTIIRGMCKEG 277

Query: 927  MMDRAYEFISSLPEKGWKPDVISYNILLRSLLSEGKWRDGKKLVAEMLSTGSEPNVVTYS 1106
            M+DRA+E + +L  KG +PDVISYNILLR+LL++GKW +G+KL+ +M S   +PNVVTYS
Sbjct: 278  MVDRAFEMVRNLELKGSEPDVISYNILLRALLNQGKWEEGEKLMTKMFSEKCDPNVVTYS 337

Query: 1107 ILVTALCRDGKLEEAVNLLKLMMDKGLTPDTFTYEPVISAFCREGKIDLAITFLDYMISS 1286
            IL+T LCRDGK+EEA+NLLKLM +KGLTPD ++Y+P+I+AFCREG++D+AI FL+ MIS 
Sbjct: 338  ILITTLCRDGKIEEAMNLLKLMKEKGLTPDAYSYDPLIAAFCREGRLDVAIEFLETMISD 397

Query: 1287 GCLPDIVNYNTILSAMCKNGKADQALEVFDKLDELGCPPDVSTYNAMVSALWNNGERTKA 1466
            GCLPDIVNYNT+L+ +CKNGKADQALE+F KL E+GC P+ S+YN M SALW++G++ +A
Sbjct: 398  GCLPDIVNYNTVLATLCKNGKADQALEIFGKLGEVGCSPNSSSYNTMFSALWSSGDKIRA 457

Query: 1467 LIKLSEMMNKGMDPDEITFNSLISCLCRDGMVDEAIELLKDMERSGFPSTVITYNAVLLG 1646
            L  + EMM+ G+DPDEIT+NS+ISCLCR+GMVDEA ELL DM    F  +V+TYN VLLG
Sbjct: 458  LHMILEMMSNGIDPDEITYNSMISCLCREGMVDEAFELLVDMRSCEFHPSVVTYNIVLLG 517

Query: 1647 LCKAHRIDDAIEILEEMVKKGCRPNETTYVLLVEGIGFAGWRAEAMKAASSLLKKNVISR 1826
             CKAHRI+DAI +LE MV  GCRPNETTY +L+EGIGFAG+RAEAM+ A+ L++ + IS 
Sbjct: 518  FCKAHRIEDAINVLESMVGNGCRPNETTYTVLIEGIGFAGYRAEAMELANDLVRIDAISE 577

Query: 1827 ESFRRLKRTFPALDVTKDVAYT 1892
             SF+RL RTFP L+V +  + T
Sbjct: 578  YSFKRLHRTFPLLNVLQRSSQT 599


>ref|XP_003590960.1| Pentatricopeptide repeat-containing protein [Medicago truncatula]
            gi|355480008|gb|AES61211.1| Pentatricopeptide
            repeat-containing protein [Medicago truncatula]
          Length = 590

 Score =  681 bits (1756), Expect = 0.0
 Identities = 332/587 (56%), Positives = 446/587 (75%), Gaps = 16/587 (2%)
 Frame = +3

Query: 174  TILSAEFFPQPLP----CTNSSKPNSQSTNKSLVRCRNERKSRSPQRVKVCTENRSTQLQ 341
            T  S EF    L      ++ SKPN+     S++   NE  + + +R +    N   Q +
Sbjct: 2    TTFSTEFLSHTLNFRIHTSSHSKPNTIIITSSILFL-NEANNNNNKRRRRTNNNEQQQFR 60

Query: 342  SN-----------DSAEPNFVKLLNRSCKAGKFSEALYFLENMVNRGSHKPDVILCTKLI 488
             N           D  + NF+K LNRSCK+ K+ E+LYFL++MVNRG +KPDVILCTKLI
Sbjct: 61   VNETKPTKHDQDYDFRDTNFMKTLNRSCKSAKYDESLYFLQHMVNRG-YKPDVILCTKLI 119

Query: 489  KGLFSSKQMEKALRVLKILEVHGEPDVFAYNAVISGFCKGNQIDSANEMLNRMRARGISP 668
            KG F+ K++EKA++V++ILE HG+PDVFAYNAVISGFCK +++D A+++L+RM+ RG  P
Sbjct: 120  KGFFNMKKIEKAIQVMEILEKHGKPDVFAYNAVISGFCKADRVDHASKVLDRMKKRGFEP 179

Query: 669  DIVTYNIMIGSLCNRGKLGLAQKVFDELLEDNCKPTVVTYTILIEATILEGGIRKAMKLL 848
            D+VTYNI+IG+ C RG+L LA +V D+LL+DNCKPTV+TYTILIEATI +GGI +AMKLL
Sbjct: 180  DVVTYNILIGNFCGRGRLDLALRVMDQLLKDNCKPTVITYTILIEATITQGGIDEAMKLL 239

Query: 849  DEMLSKGLQPDMYTYNTIIRGLCREGMMDRAYEFISSLPEKGWKPDVISYNILLRSLLSE 1028
            DEMLS+GL+PD YTYN ++ G+C+EGM+DRA+EF+S + + G    V +YNILLR LL+E
Sbjct: 240  DEMLSRGLRPDRYTYNVVVNGMCKEGMLDRAFEFLSRISKNGCVAGVSTYNILLRDLLNE 299

Query: 1029 GKWRDGKKLVAEMLSTGSEPNVVTYSILVTALCRDGKLEEAVNLLKLMMDKGLTPDTFTY 1208
            GKW  G+KL+++ML  G EPN +TYS L+TALCRDGK++EA N+LK+M +K L PD ++Y
Sbjct: 300  GKWEYGEKLMSDMLVKGCEPNPITYSTLITALCRDGKIDEAKNVLKVMKEKALAPDGYSY 359

Query: 1209 EPVISAFCREGKIDLAITFLDYMISSGCLPDIVNYNTILSAMCKNGKADQALEVFDKLDE 1388
            +P+ISA CREGK+DLAI FLD MIS G LPDI++YN+IL+++CKNG AD+AL +F+KL E
Sbjct: 360  DPLISALCREGKVDLAIEFLDDMISGGHLPDILSYNSILASLCKNGNADEALNIFEKLGE 419

Query: 1389 LGCPPDVSTYNAMVSALWNNGERTKALIKLSEMMNKGMDPDEITFNSLISCLCRDGMVDE 1568
            +GCPP+  +YN +  ALW++G++ +AL  + EM++ G+DPDEIT+NSLISCLCRDG+VD+
Sbjct: 420  VGCPPNAGSYNTLFGALWSSGDKIRALGMILEMLSNGIDPDEITYNSLISCLCRDGLVDQ 479

Query: 1569 AIELLKDM-ERSGFPSTVITYNAVLLGLCKAHRIDDAIEILEEMVKKGCRPNETTYVLLV 1745
            AIELL DM E      TVI+YN VLLGLCK  RI DAIE+L  MV +GC PNETTY LL+
Sbjct: 480  AIELLVDMFESEKCQPTVISYNTVLLGLCKVQRIIDAIEVLAAMVNEGCLPNETTYTLLI 539

Query: 1746 EGIGFAGWRAEAMKAASSLLKKNVISRESFRRLKRTFPALDVTKDVA 1886
            +GIGFAGWR +AM+ A+ L+  + IS +SF+R ++ FP  D  K++A
Sbjct: 540  QGIGFAGWRYDAMELANLLVNMDAISEDSFKRFQKIFPVFDAHKELA 586


>ref|XP_003555568.1| PREDICTED: pentatricopeptide repeat-containing protein At3g04760,
            chloroplastic-like [Glycine max]
          Length = 576

 Score =  674 bits (1740), Expect = 0.0
 Identities = 332/573 (57%), Positives = 439/573 (76%), Gaps = 16/573 (2%)
 Frame = +3

Query: 180  LSAEFFPQPLPCTNSSK----PNSQST------------NKSLVRCRNERKSRSPQRVKV 311
            +S+EF    LP   +SK    PN  +T            N S  R  N   ++   RV  
Sbjct: 4    VSSEFLSHCLPLGTNSKRAWLPNPSNTVITCRIPLLNEDNPSKRRLNNNNNNKGHTRVT- 62

Query: 312  CTENRSTQLQSNDSAEPNFVKLLNRSCKAGKFSEALYFLENMVNRGSHKPDVILCTKLIK 491
             + +   Q Q  D  + + +K LNR CK GK++EALYFLE MV RG +KPDVILCTKLIK
Sbjct: 63   -SSDTRPQQQHYDFRDTHHMKALNRLCKTGKYTEALYFLEQMVKRG-YKPDVILCTKLIK 120

Query: 492  GLFSSKQMEKALRVLKILEVHGEPDVFAYNAVISGFCKGNQIDSANEMLNRMRARGISPD 671
            GLF+SK+ EKA+RV++ILE +G+PD FAYNAVISGFC+ ++ D+AN ++ RM+ RG SPD
Sbjct: 121  GLFTSKRTEKAVRVMEILEQYGDPDSFAYNAVISGFCRSDRFDAANRVILRMKYRGFSPD 180

Query: 672  IVTYNIMIGSLCNRGKLGLAQKVFDELLEDNCKPTVVTYTILIEATILEGGIRKAMKLLD 851
            +VTYNI+IGSLC RGKL LA KV D+LLEDNC PTV+TYTILIEATI+ G I  AM+LLD
Sbjct: 181  VVTYNILIGSLCARGKLDLALKVMDQLLEDNCNPTVITYTILIEATIIHGSIDDAMRLLD 240

Query: 852  EMLSKGLQPDMYTYNTIIRGLCREGMMDRAYEFISSLPEKGWKPDVISYNILLRSLLSEG 1031
            EM+S+GLQPDMYTYN I+RG+C+ G++DRA+EF+S+L      P +  YN+LL+ LL+EG
Sbjct: 241  EMMSRGLQPDMYTYNVIVRGMCKRGLVDRAFEFVSNL---NTTPSLNLYNLLLKGLLNEG 297

Query: 1032 KWRDGKKLVAEMLSTGSEPNVVTYSILVTALCRDGKLEEAVNLLKLMMDKGLTPDTFTYE 1211
            +W  G++L+++M+  G EPN+VTYS+L+++LCRDGK  EAV++L++M +KGL PD + Y+
Sbjct: 298  RWEAGERLMSDMIVKGCEPNIVTYSVLISSLCRDGKAGEAVDVLRVMKEKGLNPDAYCYD 357

Query: 1212 PVISAFCREGKIDLAITFLDYMISSGCLPDIVNYNTILSAMCKNGKADQALEVFDKLDEL 1391
            P+ISAFC+EGK+DLAI F+D MIS+G LPDIVNYNTI+ ++CK G+AD+AL +F KL+E+
Sbjct: 358  PLISAFCKEGKVDLAIGFVDDMISAGWLPDIVNYNTIMGSLCKKGRADEALNIFKKLEEV 417

Query: 1392 GCPPDVSTYNAMVSALWNNGERTKALIKLSEMMNKGMDPDEITFNSLISCLCRDGMVDEA 1571
            GCPP+ S+YN M  ALW++G++ +AL  + EM++ G+DPD IT+NSLIS LCRDGMVDEA
Sbjct: 418  GCPPNASSYNTMFGALWSSGDKIRALTMILEMLSNGVDPDRITYNSLISSLCRDGMVDEA 477

Query: 1572 IELLKDMERSGFPSTVITYNAVLLGLCKAHRIDDAIEILEEMVKKGCRPNETTYVLLVEG 1751
            I LL DMER+ +  TVI+YN VLLGLCKAHRI DAIE+L  MV  GC+PNETTY LLVEG
Sbjct: 478  IGLLVDMERTEWQPTVISYNIVLLGLCKAHRIVDAIEVLAVMVDNGCQPNETTYTLLVEG 537

Query: 1752 IGFAGWRAEAMKAASSLLKKNVISRESFRRLKR 1850
            +G+AGWR+ A++ A SL+  N IS++ FRRL++
Sbjct: 538  VGYAGWRSYAVELAKSLVSMNAISQDLFRRLQK 570


>gb|ESW16008.1| hypothetical protein PHAVU_007G121900g [Phaseolus vulgaris]
          Length = 570

 Score =  671 bits (1730), Expect = 0.0
 Identities = 327/569 (57%), Positives = 438/569 (76%), Gaps = 8/569 (1%)
 Frame = +3

Query: 168  MTTILSAEFFPQPLPCTNSSKPNSQSTNKSLVRCR----NE----RKSRSPQRVKVCTEN 323
            MTT+ S EF    L    +SK        +++ CR    NE    ++  +  +      +
Sbjct: 1    MTTV-STEFLSHTLSLRTNSKGAWHPKPNTVITCRIPVLNEDNPSKRKNNYNKGNGRVSS 59

Query: 324  RSTQLQSNDSAEPNFVKLLNRSCKAGKFSEALYFLENMVNRGSHKPDVILCTKLIKGLFS 503
              T+ +  D  + + ++ LNR C+ GK++EALYFLE MV RG +KPDVILCTKLIKGLF+
Sbjct: 60   SDTRPRHYDFRDTHHMRALNRLCRTGKYTEALYFLEQMVKRG-YKPDVILCTKLIKGLFT 118

Query: 504  SKQMEKALRVLKILEVHGEPDVFAYNAVISGFCKGNQIDSANEMLNRMRARGISPDIVTY 683
            SK+ EKA++V++ILE HG+PD FAYNAVISGFC+ ++ D+AN +L RM+ RG SPD+VTY
Sbjct: 119  SKKTEKAVQVMEILEQHGDPDAFAYNAVISGFCRSDRFDAANGVLLRMKNRGFSPDVVTY 178

Query: 684  NIMIGSLCNRGKLGLAQKVFDELLEDNCKPTVVTYTILIEATILEGGIRKAMKLLDEMLS 863
            NI+IGSLC RGKL LA KV D+L++DNC PTV+TYTILIEATI+ G I KAMKLLDEM+S
Sbjct: 179  NILIGSLCARGKLDLAMKVMDQLMKDNCNPTVITYTILIEATIIHGVIDKAMKLLDEMVS 238

Query: 864  KGLQPDMYTYNTIIRGLCREGMMDRAYEFISSLPEKGWKPDVISYNILLRSLLSEGKWRD 1043
            +GLQPDMYTYN I+RG+C+ G++DRA+EF+ +L      P +  YN++L+ LL+EG+W+ 
Sbjct: 239  RGLQPDMYTYNVIVRGMCKRGLVDRAFEFVCNLSTT---PSLNLYNLVLKGLLNEGRWKT 295

Query: 1044 GKKLVAEMLSTGSEPNVVTYSILVTALCRDGKLEEAVNLLKLMMDKGLTPDTFTYEPVIS 1223
            G++L+++M+  G EPNVVTYS+L+ +LCRDGK  EAV+LLK+M +KGL+PD + Y+P+IS
Sbjct: 296  GERLMSDMMVKGCEPNVVTYSVLINSLCRDGKTGEAVDLLKVMKEKGLSPDAYCYDPLIS 355

Query: 1224 AFCREGKIDLAITFLDYMISSGCLPDIVNYNTILSAMCKNGKADQALEVFDKLDELGCPP 1403
            AFC+EGK+DLAI F+D M+S+G LPDI+NYNTI+ ++CK G+ D+AL +F KLDE+GCPP
Sbjct: 356  AFCKEGKVDLAIGFVDDMVSAGWLPDIINYNTIMGSLCKKGRGDEALSIFKKLDEVGCPP 415

Query: 1404 DVSTYNAMVSALWNNGERTKALIKLSEMMNKGMDPDEITFNSLISCLCRDGMVDEAIELL 1583
            +VS+YN M+ ALW++G++ +AL  + EM+N G+DPD IT+NSLISCLCRDGMVDEAI LL
Sbjct: 416  NVSSYNTMLGALWSSGDKIRALRMVLEMLNNGLDPDRITYNSLISCLCRDGMVDEAIGLL 475

Query: 1584 KDMERSGFPSTVITYNAVLLGLCKAHRIDDAIEILEEMVKKGCRPNETTYVLLVEGIGFA 1763
             DMERS +  TVI+YN VLLGLCKAHRI DAIE+L  MV  GC+PN+TTY LLVEGI +A
Sbjct: 476  VDMERSEWQPTVISYNIVLLGLCKAHRIVDAIEVLAVMVDNGCQPNQTTYTLLVEGISYA 535

Query: 1764 GWRAEAMKAASSLLKKNVISRESFRRLKR 1850
            GW ++A++ A SL     IS++ FRRL +
Sbjct: 536  GWPSDAVELAKSLSSMKAISQDLFRRLNK 564


Top