BLASTX nr result

ID: Catharanthus22_contig00019751 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Catharanthus22_contig00019751
         (2036 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_004233900.1| PREDICTED: pentatricopeptide repeat-containi...   839   0.0  
ref|XP_006362578.1| PREDICTED: pentatricopeptide repeat-containi...   838   0.0  
gb|EMJ13661.1| hypothetical protein PRUPE_ppa015022mg, partial [...   804   0.0  
ref|XP_003631192.1| PREDICTED: pentatricopeptide repeat-containi...   783   0.0  
emb|CAN61637.1| hypothetical protein VITISV_008458 [Vitis vinifera]   782   0.0  
ref|XP_004293756.1| PREDICTED: pentatricopeptide repeat-containi...   780   0.0  
ref|XP_006468480.1| PREDICTED: pentatricopeptide repeat-containi...   771   0.0  
gb|EOY27563.1| Pentatricopeptide repeat superfamily protein [The...   771   0.0  
ref|XP_002528370.1| pentatricopeptide repeat-containing protein,...   768   0.0  
ref|XP_006448708.1| hypothetical protein CICLE_v10014445mg [Citr...   764   0.0  
ref|XP_004145475.1| PREDICTED: pentatricopeptide repeat-containi...   764   0.0  
gb|EXB61730.1| hypothetical protein L484_008796 [Morus notabilis]     763   0.0  
gb|EOY27561.1| Pentatricopeptide repeat (PPR-like) superfamily p...   762   0.0  
ref|XP_004157755.1| PREDICTED: uncharacterized protein LOC101223...   758   0.0  
ref|XP_002867913.1| predicted protein [Arabidopsis lyrata subsp....   755   0.0  
ref|NP_001190774.1| Pentatricopeptide repeat domain-containing p...   754   0.0  
emb|CAB52870.1| putative protein [Arabidopsis thaliana] gi|72687...   754   0.0  
ref|XP_006413926.1| hypothetical protein EUTSA_v10027430mg, part...   751   0.0  
ref|XP_003533559.1| PREDICTED: pentatricopeptide repeat-containi...   746   0.0  
ref|XP_002322376.2| hypothetical protein POPTR_0015s15360g [Popu...   743   0.0  

>ref|XP_004233900.1| PREDICTED: pentatricopeptide repeat-containing protein At4g19890-like
            [Solanum lycopersicum]
          Length = 716

 Score =  839 bits (2168), Expect = 0.0
 Identities = 408/637 (64%), Positives = 506/637 (79%), Gaps = 2/637 (0%)
 Frame = +3

Query: 129  SGKWWRYRGLFTQTLFSHVSVRTMSHAHSDSVIPSH--SVVKTVRSLVCESYSRQQQKQN 302
            S  +WRY               T  H+ + + + S   SVV+ V SLV ESY + Q+  +
Sbjct: 38   SNPFWRY---------------TQFHSFTTNPLSSDFDSVVRRVCSLVSESYCKVQENTH 82

Query: 303  FRSIICKLNIPIDSEFLNPEEAITVAASLADEEGSLVALCFFYWAIGHLKFKYFMRFYIV 482
            F+S   KL +PIDSE L  E+AITV ASLADE GS++AL FFYWAIG++KF++FMR YIV
Sbjct: 83   FKSRHPKLKLPIDSECLTQEQAITVVASLADEGGSMLALSFFYWAIGYVKFRHFMRLYIV 142

Query: 483  LATCLIKNGNFERANEVIHCMVKNFGEIGMLKEAVDMVFEMQNQGLVLSVHTLNCILTVA 662
            LA  LIKNGNFER +EV+HCM++NF E+GMLKEAVDMVFEMQNQGLVL+  +LN +++V 
Sbjct: 143  LAIYLIKNGNFERTHEVMHCMLRNFCEVGMLKEAVDMVFEMQNQGLVLNAGSLNSVVSVV 202

Query: 663  AETGCVDTAHNVFGDMCERGVVPNTCSFESMVVGYCRVSRISEADRWLSAMLERGFLVDN 842
             E G V+ A  VFG+MC+RGV P++  FESMVV YCR+ R+ EADRWLSAMLERGFLVDN
Sbjct: 203  TEMGHVEMAEKVFGEMCDRGVCPDSFCFESMVVAYCRMGRVVEADRWLSAMLERGFLVDN 262

Query: 843  ATCTLIMTLYCEKGRLNRALWIFNKLVEMGFTPNVINFTSLINGLCKRGSIKRAFELLEE 1022
            ATCTLI++++CEKG +NR LWIFNKL+E+G  PNVIN+T LINGLCK+G IK AFELLEE
Sbjct: 263  ATCTLILSVFCEKGSINRVLWIFNKLIELGLAPNVINYTCLINGLCKKGIIKHAFELLEE 322

Query: 1023 MVSKGWKPNVYTHTALIDGLCKKGWTEKAFRLFLKLVRSDHYKPNVHTYTAMIAGYCKEG 1202
            MV KG KPNV+THTALIDGLCKKGW +KAFRLFLKLV+SD+YKPNVHTYTAMIAGYCK+ 
Sbjct: 323  MVRKGLKPNVFTHTALIDGLCKKGWMDKAFRLFLKLVKSDNYKPNVHTYTAMIAGYCKQE 382

Query: 1203 KLNRAEMLLGRMLEQGLTPNAHSYSTLIDGYCKVGNLDRSYELMDTVAKVGLVPNICIYN 1382
            KLNRAEMLL RM EQ L PNA++Y+ LIDGYCKVGN D +Y+L+  + + GL P+I  YN
Sbjct: 383  KLNRAEMLLSRMQEQELVPNANTYTALIDGYCKVGNFDVAYKLLRVMDEKGLAPSIFTYN 442

Query: 1383 CMIDSLCKKGRASEAYKLLKRGFETGLSADLVTFTILISECCKHGDMGQALAHLSKMTKT 1562
             +ID LCKKGR  EAY++LK+G + G+S DLVT+TIL+S+ CK GD GQA A  SKM K 
Sbjct: 443  AVIDGLCKKGRVQEAYQMLKKGMQIGISPDLVTYTILMSQSCKLGDNGQAFALFSKMVKA 502

Query: 1563 SLMPDTHVYTILISAFCRQKRMTESERIFNDATKLGIAPSTEMYTSMISGYFRDKNVSMA 1742
             + PD H YT LI+A CRQK+M +SE++F+DA  LG+ P+ E  TSMI GY RDKNV+MA
Sbjct: 503  GIGPDMHTYTTLIAALCRQKKMKDSEKLFDDAVILGLIPTKETCTSMICGYCRDKNVAMA 562

Query: 1743 LKFYEKMVKQGCIPDSLTYGAMISGLCKDLKVEEAKILYNTMMDKGLSPCEVTRLTIAYE 1922
             K++++M + GC+PDSLTYGA+ISGLCK+ K++EA+ LYN+M+DKG+ PCEVTRLT+AYE
Sbjct: 563  KKYFQRMGEYGCVPDSLTYGALISGLCKESKLDEARDLYNSMVDKGIPPCEVTRLTVAYE 622

Query: 1923 YCKKGEPSTAMVLLDRLDKKLWMRTFNTLIRKLCSEK 2033
            YCK  EP+  M LLD+L+KKLW+RT +TL+RKLCSEK
Sbjct: 623  YCKNNEPTITMGLLDKLEKKLWVRTVSTLVRKLCSEK 659



 Score =  191 bits (485), Expect = 1e-45
 Identities = 106/362 (29%), Positives = 193/362 (53%), Gaps = 4/362 (1%)
 Frame = +3

Query: 963  LINGLCKRGSIKRAFELLEEMVSKGWKPNVYTHTALIDGLCKKGWTEKAFRLFLKLVRSD 1142
            ++   C+ G +K A +++ EM ++G   N  +  +++  + + G  E A ++F ++    
Sbjct: 163  MLRNFCEVGMLKEAVDMVFEMQNQGLVLNAGSLNSVVSVVTEMGHVEMAEKVFGEMCDRG 222

Query: 1143 HYKPNVHTYTAMIAGYCKEGKLNRAEMLLGRMLEQGLTPNAHSYSTLIDGYCKVGNLDRS 1322
               P+   + +M+  YC+ G++  A+  L  MLE+G   +  + + ++  +C+ G+++R 
Sbjct: 223  -VCPDSFCFESMVVAYCRMGRVVEADRWLSAMLERGFLVDNATCTLILSVFCEKGSINRV 281

Query: 1323 YELMDTVAKVGLVPNICIYNCMIDSLCKKGRASEAYKLLKRGFETGLSADLVTFTILISE 1502
              + + + ++GL PN+  Y C+I+ LCKKG    A++LL+     GL  ++ T T LI  
Sbjct: 282  LWIFNKLIELGLAPNVINYTCLINGLCKKGIIKHAFELLEEMVRKGLKPNVFTHTALIDG 341

Query: 1503 CCKHGDMGQALAHLSKMTKT-SLMPDTHVYTILISAFCRQKRMTESERIFNDATKLGIAP 1679
             CK G M +A     K+ K+ +  P+ H YT +I+ +C+Q+++  +E + +   +  + P
Sbjct: 342  LCKKGWMDKAFRLFLKLVKSDNYKPNVHTYTAMIAGYCKQEKLNRAEMLLSRMQEQELVP 401

Query: 1680 STEMYTSMISGYFRDKNVSMALKFYEKMVKQGCIPDSLTYGAMISGLCKDLKVEEAKILY 1859
            +   YT++I GY +  N  +A K    M ++G  P   TY A+I GLCK  +V+EA  + 
Sbjct: 402  NANTYTALIDGYCKVGNFDVAYKLLRVMDEKGLAPSIFTYNAVIDGLCKKGRVQEAYQML 461

Query: 1860 NTMMDKGLSPCEVTRLTIAYEYCKKGEPSTAMVLLDRLDKKLW---MRTFNTLIRKLCSE 2030
               M  G+SP  VT   +  + CK G+   A  L  ++ K      M T+ TLI  LC +
Sbjct: 462  KKGMQIGISPDLVTYTILMSQSCKLGDNGQAFALFSKMVKAGIGPDMHTYTTLIAALCRQ 521

Query: 2031 KK 2036
            KK
Sbjct: 522  KK 523


>ref|XP_006362578.1| PREDICTED: pentatricopeptide repeat-containing protein At4g19890-like
            isoform X1 [Solanum tuberosum]
            gi|565393841|ref|XP_006362579.1| PREDICTED:
            pentatricopeptide repeat-containing protein
            At4g19890-like isoform X2 [Solanum tuberosum]
            gi|565393843|ref|XP_006362580.1| PREDICTED:
            pentatricopeptide repeat-containing protein
            At4g19890-like isoform X3 [Solanum tuberosum]
          Length = 716

 Score =  838 bits (2165), Expect = 0.0
 Identities = 409/635 (64%), Positives = 505/635 (79%)
 Frame = +3

Query: 129  SGKWWRYRGLFTQTLFSHVSVRTMSHAHSDSVIPSHSVVKTVRSLVCESYSRQQQKQNFR 308
            S  +WRY      T F+  +   +S           SVVK V SLV ESY + Q+  +F+
Sbjct: 38   SNPFWRY------TQFNSFTTNPLSSDFD-------SVVKRVCSLVSESYCKVQENTHFK 84

Query: 309  SIICKLNIPIDSEFLNPEEAITVAASLADEEGSLVALCFFYWAIGHLKFKYFMRFYIVLA 488
            S   KL +PIDSE+L  E+AITV ASLADE GS++AL FFYWAIG++KF++FMR YIVLA
Sbjct: 85   SRHPKLKLPIDSEYLTQEQAITVVASLADEGGSMLALSFFYWAIGYVKFRHFMRLYIVLA 144

Query: 489  TCLIKNGNFERANEVIHCMVKNFGEIGMLKEAVDMVFEMQNQGLVLSVHTLNCILTVAAE 668
              LIKNGNFER +EV+H M++NF E+GMLKEAVDMVFEMQNQGLVL+  +LN +++VA E
Sbjct: 145  IYLIKNGNFERTHEVMHFMLRNFCEVGMLKEAVDMVFEMQNQGLVLNAGSLNSVVSVATE 204

Query: 669  TGCVDTAHNVFGDMCERGVVPNTCSFESMVVGYCRVSRISEADRWLSAMLERGFLVDNAT 848
             G V+ A  VFG+MC+RGV P++  FESMVV YCR+ R+ EADRWLSAMLERGFLVDNAT
Sbjct: 205  MGHVEMAEKVFGEMCDRGVCPDSFCFESMVVAYCRMGRVVEADRWLSAMLERGFLVDNAT 264

Query: 849  CTLIMTLYCEKGRLNRALWIFNKLVEMGFTPNVINFTSLINGLCKRGSIKRAFELLEEMV 1028
            CTLIM+++C+KG +NR LWIFNKL+E+G  PNVIN+T LINGLCK+G IK AFELLEEMV
Sbjct: 265  CTLIMSVFCDKGSVNRVLWIFNKLIELGLAPNVINYTCLINGLCKKGIIKHAFELLEEMV 324

Query: 1029 SKGWKPNVYTHTALIDGLCKKGWTEKAFRLFLKLVRSDHYKPNVHTYTAMIAGYCKEGKL 1208
             KG KPNV+THT LIDGLCKKGW +KAFRLFLKLV+SD+YKPNVHTYTAMIAGYCK+ KL
Sbjct: 325  RKGLKPNVFTHTVLIDGLCKKGWMDKAFRLFLKLVKSDNYKPNVHTYTAMIAGYCKQEKL 384

Query: 1209 NRAEMLLGRMLEQGLTPNAHSYSTLIDGYCKVGNLDRSYELMDTVAKVGLVPNICIYNCM 1388
            NRAEMLL RM EQ L PNA++Y+ LIDGYCKVGN D +Y+L+  + + GL P+I  YN +
Sbjct: 385  NRAEMLLSRMQEQELVPNANTYTALIDGYCKVGNFDVAYKLLRVMDEKGLAPSIFTYNAV 444

Query: 1389 IDSLCKKGRASEAYKLLKRGFETGLSADLVTFTILISECCKHGDMGQALAHLSKMTKTSL 1568
            ID LCKKGR  EAY++LK+G +  +S DLVT+TIL+S+ CK GD GQA A  SKM K  +
Sbjct: 445  IDGLCKKGRVQEAYQMLKKGMQIEISPDLVTYTILMSQSCKLGDNGQAFALFSKMVKAGI 504

Query: 1569 MPDTHVYTILISAFCRQKRMTESERIFNDATKLGIAPSTEMYTSMISGYFRDKNVSMALK 1748
             PD H YT LI+A CRQK+M +SE++F+DA  LG+ P+ E  TSMI GY RDKNV+MA K
Sbjct: 505  SPDMHTYTTLIAALCRQKKMKDSEKLFDDAVILGLIPTKETCTSMICGYCRDKNVAMAKK 564

Query: 1749 FYEKMVKQGCIPDSLTYGAMISGLCKDLKVEEAKILYNTMMDKGLSPCEVTRLTIAYEYC 1928
            ++++M + GC+PDSLTYGA+ISGLCK+ K++EA+ LYN+M+DKG+ PCEVTRLT+AYEYC
Sbjct: 565  YFQRMGEYGCVPDSLTYGALISGLCKESKLDEARDLYNSMVDKGIPPCEVTRLTVAYEYC 624

Query: 1929 KKGEPSTAMVLLDRLDKKLWMRTFNTLIRKLCSEK 2033
            K  EP+ AM LLDRL+KKLW+RT +TL+RKLCSEK
Sbjct: 625  KNNEPTIAMGLLDRLEKKLWIRTVSTLVRKLCSEK 659



 Score =  188 bits (478), Expect = 7e-45
 Identities = 108/371 (29%), Positives = 196/371 (52%), Gaps = 4/371 (1%)
 Frame = +3

Query: 936  TPNVINFTSLINGLCKRGSIKRAFELLEEMVSKGWKPNVYTHTALIDGLCKKGWTEKAFR 1115
            T  V++F  ++   C+ G +K A +++ EM ++G   N  +  +++    + G  E A +
Sbjct: 156  THEVMHF--MLRNFCEVGMLKEAVDMVFEMQNQGLVLNAGSLNSVVSVATEMGHVEMAEK 213

Query: 1116 LFLKLVRSDHYKPNVHTYTAMIAGYCKEGKLNRAEMLLGRMLEQGLTPNAHSYSTLIDGY 1295
            +F ++       P+   + +M+  YC+ G++  A+  L  MLE+G   +  + + ++  +
Sbjct: 214  VFGEMCDRG-VCPDSFCFESMVVAYCRMGRVVEADRWLSAMLERGFLVDNATCTLIMSVF 272

Query: 1296 CKVGNLDRSYELMDTVAKVGLVPNICIYNCMIDSLCKKGRASEAYKLLKRGFETGLSADL 1475
            C  G+++R   + + + ++GL PN+  Y C+I+ LCKKG    A++LL+     GL  ++
Sbjct: 273  CDKGSVNRVLWIFNKLIELGLAPNVINYTCLINGLCKKGIIKHAFELLEEMVRKGLKPNV 332

Query: 1476 VTFTILISECCKHGDMGQALAHLSKMTKT-SLMPDTHVYTILISAFCRQKRMTESERIFN 1652
             T T+LI   CK G M +A     K+ K+ +  P+ H YT +I+ +C+Q+++  +E + +
Sbjct: 333  FTHTVLIDGLCKKGWMDKAFRLFLKLVKSDNYKPNVHTYTAMIAGYCKQEKLNRAEMLLS 392

Query: 1653 DATKLGIAPSTEMYTSMISGYFRDKNVSMALKFYEKMVKQGCIPDSLTYGAMISGLCKDL 1832
               +  + P+   YT++I GY +  N  +A K    M ++G  P   TY A+I GLCK  
Sbjct: 393  RMQEQELVPNANTYTALIDGYCKVGNFDVAYKLLRVMDEKGLAPSIFTYNAVIDGLCKKG 452

Query: 1833 KVEEAKILYNTMMDKGLSPCEVTRLTIAYEYCKKGEPSTAMVLLDRLDK---KLWMRTFN 2003
            +V+EA  +    M   +SP  VT   +  + CK G+   A  L  ++ K      M T+ 
Sbjct: 453  RVQEAYQMLKKGMQIEISPDLVTYTILMSQSCKLGDNGQAFALFSKMVKAGISPDMHTYT 512

Query: 2004 TLIRKLCSEKK 2036
            TLI  LC +KK
Sbjct: 513  TLIAALCRQKK 523


>gb|EMJ13661.1| hypothetical protein PRUPE_ppa015022mg, partial [Prunus persica]
          Length = 688

 Score =  804 bits (2076), Expect = 0.0
 Identities = 397/641 (61%), Positives = 488/641 (76%), Gaps = 18/641 (2%)
 Frame = +3

Query: 168  TLFSHVSVRTMSHAHSD------------------SVIPSHSVVKTVRSLVCESYSRQQQ 293
            TLFS   +RT+S+ H D                  S   S S+V+T+ +LVC+SYS Q  
Sbjct: 30   TLFS---LRTLSYTHYDDPYSTTTITTATSTTSTSSSSQSQSLVRTICALVCQSYSPQT- 85

Query: 294  KQNFRSIICKLNIPIDSEFLNPEEAITVAASLADEEGSLVALCFFYWAIGHLKFKYFMRF 473
              + RS   KLN+ ++++ L  E+AI+V ASLA+E GS+VAL FFYWAIG  KF+YFMR 
Sbjct: 86   --HLRSSPPKLNLDLNADSLTNEQAISVVASLAEEAGSMVALSFFYWAIGFPKFRYFMRL 143

Query: 474  YIVLATCLIKNGNFERANEVIHCMVKNFGEIGMLKEAVDMVFEMQNQGLVLSVHTLNCIL 653
            YI  A  L  NGN ERA+EV+HCMV+NF EIG LKEA DMVFEMQNQGL+LS  TLNC+L
Sbjct: 144  YIFCAMSLFGNGNLERAHEVVHCMVRNFAEIGRLKEAADMVFEMQNQGLMLSTRTLNCVL 203

Query: 654  TVAAETGCVDTAHNVFGDMCERGVVPNTCSFESMVVGYCRVSRISEADRWLSAMLERGFL 833
             +A + G V+ A N+F +MC RGV P++ S++SMVVGYCR  R+ E DRWLS MLERGF+
Sbjct: 204  GIACDLGLVEYAENLFEEMCVRGVSPDSLSYKSMVVGYCRNRRVLEVDRWLSKMLERGFV 263

Query: 834  VDNATCTLIMTLYCEKGRLNRALWIFNKLVEMGFTPNVINFTSLINGLCKRGSIKRAFEL 1013
            +DN T TLI++L+CEK          + ++ MG  PN+INFTSLI+GLC+RGSIK+AFE+
Sbjct: 264  LDNVTFTLIISLFCEK----------SLMIRMGVKPNLINFTSLIHGLCQRGSIKQAFEM 313

Query: 1014 LEEMVSKGWKPNVYTHTALIDGLCKKGWTEKAFRLFLKLVRSDHYKPNVHTYTAMIAGYC 1193
            LEEMV KGWKPNVYTHT LIDGLCKKGWTE+AFRLFLKLVRSD+YKPNVHTYTAMI GYC
Sbjct: 314  LEEMVRKGWKPNVYTHTGLIDGLCKKGWTERAFRLFLKLVRSDNYKPNVHTYTAMIRGYC 373

Query: 1194 KEGKLNRAEMLLGRMLEQGLTPNAHSYSTLIDGYCKVGNLDRSYELMDTVAKVGLVPNIC 1373
            +E K++RAEMLL RM EQGL PN ++Y+TL+ G+CK GN DR+YELMD + K G  PNIC
Sbjct: 374  EEDKMSRAEMLLSRMKEQGLIPNTNTYTTLVSGHCKAGNFDRAYELMDIMGKEGFAPNIC 433

Query: 1374 IYNCMIDSLCKKGRASEAYKLLKRGFETGLSADLVTFTILISECCKHGDMGQALAHLSKM 1553
             YN + DSLCKKGR  EAYKL+K+GF  GL AD VT+TI ISE CK GD+  AL   +KM
Sbjct: 434  TYNAVFDSLCKKGRVQEAYKLIKKGFRRGLEADRVTYTIFISEHCKRGDINGALVFFNKM 493

Query: 1554 TKTSLMPDTHVYTILISAFCRQKRMTESERIFNDATKLGIAPSTEMYTSMISGYFRDKNV 1733
             K  L PD H YT LI+AFCRQK+M ESE+ F  + +LG  P+ E YTSMI GY RD+N+
Sbjct: 494  LKVGLQPDMHSYTTLIAAFCRQKKMKESEKFFELSVRLGSIPTKETYTSMICGYCRDENI 553

Query: 1734 SMALKFYEKMVKQGCIPDSLTYGAMISGLCKDLKVEEAKILYNTMMDKGLSPCEVTRLTI 1913
            ++A+KF+ +M   GC PDS TYGA+ISGLCK+ K+EEA+ LY+TMMDKGLSPCEVTRLT+
Sbjct: 554  ALAIKFFHRMGDHGCAPDSFTYGALISGLCKEEKLEEARRLYDTMMDKGLSPCEVTRLTL 613

Query: 1914 AYEYCKKGEPSTAMVLLDRLDKKLWMRTFNTLIRKLCSEKK 2036
            AY+YCKK + + AMVLL+RL+KKLW+RT NTL+RKLCSEKK
Sbjct: 614  AYKYCKKDDSAAAMVLLERLEKKLWIRTVNTLVRKLCSEKK 654


>ref|XP_003631192.1| PREDICTED: pentatricopeptide repeat-containing protein At4g19890-like
            [Vitis vinifera]
          Length = 708

 Score =  783 bits (2021), Expect = 0.0
 Identities = 391/621 (62%), Positives = 473/621 (76%)
 Frame = +3

Query: 174  FSHVSVRTMSHAHSDSVIPSHSVVKTVRSLVCESYSRQQQKQNFRSIICKLNIPIDSEFL 353
            + H    T S + S S   S SVV+T+ SLVC+SY    Q+ + R    KL++P+DSE L
Sbjct: 42   YIHDEPSTSSSSQSQS--HSQSVVRTICSLVCQSY---YQQTHVRFTPPKLHLPLDSESL 96

Query: 354  NPEEAITVAASLADEEGSLVALCFFYWAIGHLKFKYFMRFYIVLATCLIKNGNFERANEV 533
              ++AITV ASLADE GS+VAL F YWAIG  KF++FMR YIV AT LI N N ERANEV
Sbjct: 97   THDQAITVVASLADEAGSMVALSFLYWAIGFPKFRHFMRLYIVSATALIGNKNLERANEV 156

Query: 534  IHCMVKNFGEIGMLKEAVDMVFEMQNQGLVLSVHTLNCILTVAAETGCVDTAHNVFGDMC 713
            + CMV NF E G LKEAV+MV EMQNQGLV S  TLNC+L VA   G V+ A N+F +MC
Sbjct: 157  MQCMVMNFAENGKLKEAVNMVVEMQNQGLVPSTQTLNCVLDVAVGMGLVEIAENMFVEMC 216

Query: 714  ERGVVPNTCSFESMVVGYCRVSRISEADRWLSAMLERGFLVDNATCTLIMTLYCEKGRLN 893
            +RGV P+  SF+ MVV  C + R+ EA+RWL+AM+ERGF+VDNATCTLI+  +C+KG +N
Sbjct: 217  QRGVSPDCVSFKLMVVACCNMGRVLEAERWLNAMVERGFIVDNATCTLIIDAFCQKGYVN 276

Query: 894  RALWIFNKLVEMGFTPNVINFTSLINGLCKRGSIKRAFELLEEMVSKGWKPNVYTHTALI 1073
            R +  F K+VEMG  PNVINFT+LINGLCK+GSIK+AFELLEEMV +GWKPNVYTHT LI
Sbjct: 277  RVVGYFWKMVEMGLAPNVINFTALINGLCKQGSIKQAFELLEEMVRRGWKPNVYTHTTLI 336

Query: 1074 DGLCKKGWTEKAFRLFLKLVRSDHYKPNVHTYTAMIAGYCKEGKLNRAEMLLGRMLEQGL 1253
            DGLCKKGWTEKAFRLFLKLVRSD YKPNVHTYTAMI GYCKE KLNRAEMLL RM EQGL
Sbjct: 337  DGLCKKGWTEKAFRLFLKLVRSDGYKPNVHTYTAMINGYCKEDKLNRAEMLLSRMQEQGL 396

Query: 1254 TPNAHSYSTLIDGYCKVGNLDRSYELMDTVAKVGLVPNICIYNCMIDSLCKKGRASEAYK 1433
             PN ++Y+TLIDG+CKVGN  R+YELMD + K G  PNI  YN +ID LCKKG   EAY+
Sbjct: 397  VPNTNTYTTLIDGHCKVGNFVRAYELMDLMGKEGFSPNIYTYNAIIDGLCKKGSLDEAYR 456

Query: 1434 LLKRGFETGLSADLVTFTILISECCKHGDMGQALAHLSKMTKTSLMPDTHVYTILISAFC 1613
            LL +    GL AD VT+TIL+S  C+  D  ++L   +KM K    PD H YT LIS FC
Sbjct: 457  LLNKVSVHGLQADGVTYTILMSVHCRQADTNRSLVFFNKMLKVGFTPDIHSYTTLISTFC 516

Query: 1614 RQKRMTESERIFNDATKLGIAPSTEMYTSMISGYFRDKNVSMALKFYEKMVKQGCIPDSL 1793
            RQK+M ESER+F +A  LG+ P+ + YTSMI GY R  N S+A+K +++M   GC PDS+
Sbjct: 517  RQKQMKESERLFEEAVSLGLIPTKKTYTSMICGYCRYGNTSLAVKLFQRMSNHGCAPDSI 576

Query: 1794 TYGAMISGLCKDLKVEEAKILYNTMMDKGLSPCEVTRLTIAYEYCKKGEPSTAMVLLDRL 1973
            TYGA+ISGLCK+ K+++A+ LY+ MMDKGLSPCEVTRLT+AYEYCKK + STA+ +LDRL
Sbjct: 577  TYGALISGLCKESKLDDARNLYDAMMDKGLSPCEVTRLTLAYEYCKKDDSSTAINVLDRL 636

Query: 1974 DKKLWMRTFNTLIRKLCSEKK 2036
            +K+ W+RT NTL+RKLCSE K
Sbjct: 637  EKRQWIRTVNTLVRKLCSEGK 657



 Score =  102 bits (254), Expect = 6e-19
 Identities = 78/344 (22%), Positives = 147/344 (42%), Gaps = 66/344 (19%)
 Frame = +3

Query: 495  LIKNGNFERANEVIHCMVKNFGEIGMLKEAVDMVFEMQNQGLVLSVHTLNCILTVAAETG 674
            L+++  ++        M+  + +   L  A  ++  MQ QGLV + +T   ++    + G
Sbjct: 355  LVRSDGYKPNVHTYTAMINGYCKEDKLNRAEMLLSRMQEQGLVPNTNTYTTLIDGHCKVG 414

Query: 675  CVDTAHNVFGDMCERGVVPNTCSFESMVVGYCRVSRISEADRWLSAMLERGFLVDNATCT 854
                A+ +   M + G  PN  ++ +++ G C+   + EA R L+ +   G   D  T T
Sbjct: 415  NFVRAYELMDLMGKEGFSPNIYTYNAIIDGLCKKGSLDEAYRLLNKVSVHGLQADGVTYT 474

Query: 855  LIMTLYCEKGRLNRALWIFNKLVEMGFTPNVINFTSLINGLCKRGSIKRAFELLEEMVSK 1034
            ++M+++C +   NR+L  FNK++++GFTP++ ++T+LI+  C++  +K +  L EE VS 
Sbjct: 475  ILMSVHCRQADTNRSLVFFNKMLKVGFTPDIHSYTTLISTFCRQKQMKESERLFEEAVSL 534

Query: 1035 GWKPNVYTHT-----------------------------------ALIDGLCKKGWTEKA 1109
            G  P   T+T                                   ALI GLCK+   + A
Sbjct: 535  GLIPTKKTYTSMICGYCRYGNTSLAVKLFQRMSNHGCAPDSITYGALISGLCKESKLDDA 594

Query: 1110 FRLF-------------LKLVRSDHY------------------KPNVHTYTAMIAGYCK 1196
              L+              +L  +  Y                  +  + T   ++   C 
Sbjct: 595  RNLYDAMMDKGLSPCEVTRLTLAYEYCKKDDSSTAINVLDRLEKRQWIRTVNTLVRKLCS 654

Query: 1197 EGKLNRAEMLLGRMLEQGLTPNAHSYSTLIDGYCKVGNLDRSYE 1328
            EGKL+ A +   ++L++   PN +  + L       G +++ YE
Sbjct: 655  EGKLDMAALFFHKLLDK--EPNVNRVTLL-------GFMNKCYE 689


>emb|CAN61637.1| hypothetical protein VITISV_008458 [Vitis vinifera]
          Length = 708

 Score =  782 bits (2019), Expect = 0.0
 Identities = 390/621 (62%), Positives = 473/621 (76%)
 Frame = +3

Query: 174  FSHVSVRTMSHAHSDSVIPSHSVVKTVRSLVCESYSRQQQKQNFRSIICKLNIPIDSEFL 353
            + H    T S + S S   S SVV+T+ SLVC+SY    Q+ + R    KL++P+DSE L
Sbjct: 42   YIHDEPSTSSSSQSQS--HSQSVVRTICSLVCQSY---YQQTHVRFTPPKLHLPLDSESL 96

Query: 354  NPEEAITVAASLADEEGSLVALCFFYWAIGHLKFKYFMRFYIVLATCLIKNGNFERANEV 533
              ++AITV ASLADE GS+VAL F YWAIG  KF++FMR YIV AT LI N N ERANEV
Sbjct: 97   THDQAITVVASLADEAGSMVALSFLYWAIGFPKFRHFMRLYIVSATALIGNKNLERANEV 156

Query: 534  IHCMVKNFGEIGMLKEAVDMVFEMQNQGLVLSVHTLNCILTVAAETGCVDTAHNVFGDMC 713
            + CMV NF E G LKEAV+MV EMQNQGLV S  TLNC+L VA   G V+ A N+F +MC
Sbjct: 157  MQCMVMNFAENGKLKEAVNMVVEMQNQGLVXSTQTLNCVLDVAVGMGLVEIAENMFVEMC 216

Query: 714  ERGVVPNTCSFESMVVGYCRVSRISEADRWLSAMLERGFLVDNATCTLIMTLYCEKGRLN 893
            +RGV P+  SF+ MVV  C + R+ EA++WL+AM+ERGF+VDNATCTLI+  +C+KG +N
Sbjct: 217  QRGVSPDCVSFKLMVVACCNMGRVLEAEKWLNAMVERGFIVDNATCTLIIDAFCQKGYVN 276

Query: 894  RALWIFNKLVEMGFTPNVINFTSLINGLCKRGSIKRAFELLEEMVSKGWKPNVYTHTALI 1073
            R +  F K+VEMG  PNVINFT+LINGLCK+GSIK+AFELLEEMV +GWKPNVYTHT LI
Sbjct: 277  RVVGYFWKMVEMGLAPNVINFTALINGLCKQGSIKQAFELLEEMVRRGWKPNVYTHTTLI 336

Query: 1074 DGLCKKGWTEKAFRLFLKLVRSDHYKPNVHTYTAMIAGYCKEGKLNRAEMLLGRMLEQGL 1253
            DGLCKKGWTEKAFRLFLKLVRSD YKPNVHTYTAMI GYCKE KLNRAEMLL RM EQGL
Sbjct: 337  DGLCKKGWTEKAFRLFLKLVRSDGYKPNVHTYTAMINGYCKEDKLNRAEMLLSRMQEQGL 396

Query: 1254 TPNAHSYSTLIDGYCKVGNLDRSYELMDTVAKVGLVPNICIYNCMIDSLCKKGRASEAYK 1433
             PN ++Y+TLIDG+CKVGN  R+YELMD + K G  PNI  YN +ID LCKKG   EAY+
Sbjct: 397  VPNTNTYTTLIDGHCKVGNFVRAYELMDLMGKEGFSPNIYTYNAIIDGLCKKGSLDEAYR 456

Query: 1434 LLKRGFETGLSADLVTFTILISECCKHGDMGQALAHLSKMTKTSLMPDTHVYTILISAFC 1613
            LL +    GL AD VT+TIL+S  C+  D  ++L   +KM K    PD H YT LIS FC
Sbjct: 457  LLNKVSVHGLQADGVTYTILMSVHCRQADTNRSLVFFNKMLKVGFTPDIHSYTTLISXFC 516

Query: 1614 RQKRMTESERIFNDATKLGIAPSTEMYTSMISGYFRDKNVSMALKFYEKMVKQGCIPDSL 1793
            RQK+M ESER+F +A  LG+ P+ + YTSMI GY R  N S+A+K +++M   GC PDS+
Sbjct: 517  RQKQMKESERLFEEAVSLGLIPTKKTYTSMICGYCRYGNTSLAVKLFQRMSNHGCAPDSI 576

Query: 1794 TYGAMISGLCKDLKVEEAKILYNTMMDKGLSPCEVTRLTIAYEYCKKGEPSTAMVLLDRL 1973
            TYGA+ISGLCK+ K+++A+ LY+ MMDKGLSPCEVTRLT+AYEYCKK + STA+ +LDRL
Sbjct: 577  TYGALISGLCKESKLDDARNLYDAMMDKGLSPCEVTRLTLAYEYCKKDDSSTAINVLDRL 636

Query: 1974 DKKLWMRTFNTLIRKLCSEKK 2036
            +K+ W+RT NTL+RKLCSE K
Sbjct: 637  EKRQWIRTVNTLVRKLCSEGK 657



 Score =  102 bits (255), Expect = 5e-19
 Identities = 78/344 (22%), Positives = 147/344 (42%), Gaps = 66/344 (19%)
 Frame = +3

Query: 495  LIKNGNFERANEVIHCMVKNFGEIGMLKEAVDMVFEMQNQGLVLSVHTLNCILTVAAETG 674
            L+++  ++        M+  + +   L  A  ++  MQ QGLV + +T   ++    + G
Sbjct: 355  LVRSDGYKPNVHTYTAMINGYCKEDKLNRAEMLLSRMQEQGLVPNTNTYTTLIDGHCKVG 414

Query: 675  CVDTAHNVFGDMCERGVVPNTCSFESMVVGYCRVSRISEADRWLSAMLERGFLVDNATCT 854
                A+ +   M + G  PN  ++ +++ G C+   + EA R L+ +   G   D  T T
Sbjct: 415  NFVRAYELMDLMGKEGFSPNIYTYNAIIDGLCKKGSLDEAYRLLNKVSVHGLQADGVTYT 474

Query: 855  LIMTLYCEKGRLNRALWIFNKLVEMGFTPNVINFTSLINGLCKRGSIKRAFELLEEMVSK 1034
            ++M+++C +   NR+L  FNK++++GFTP++ ++T+LI+  C++  +K +  L EE VS 
Sbjct: 475  ILMSVHCRQADTNRSLVFFNKMLKVGFTPDIHSYTTLISXFCRQKQMKESERLFEEAVSL 534

Query: 1035 GWKPNVYTHT-----------------------------------ALIDGLCKKGWTEKA 1109
            G  P   T+T                                   ALI GLCK+   + A
Sbjct: 535  GLIPTKKTYTSMICGYCRYGNTSLAVKLFQRMSNHGCAPDSITYGALISGLCKESKLDDA 594

Query: 1110 FRLF-------------LKLVRSDHY------------------KPNVHTYTAMIAGYCK 1196
              L+              +L  +  Y                  +  + T   ++   C 
Sbjct: 595  RNLYDAMMDKGLSPCEVTRLTLAYEYCKKDDSSTAINVLDRLEKRQWIRTVNTLVRKLCS 654

Query: 1197 EGKLNRAEMLLGRMLEQGLTPNAHSYSTLIDGYCKVGNLDRSYE 1328
            EGKL+ A +   ++L++   PN +  + L       G +++ YE
Sbjct: 655  EGKLDMAALFFHKLLDK--EPNVNRVTLL-------GFMNKCYE 689


>ref|XP_004293756.1| PREDICTED: pentatricopeptide repeat-containing protein At4g19890-like
            [Fragaria vesca subsp. vesca]
          Length = 705

 Score =  780 bits (2014), Expect = 0.0
 Identities = 373/608 (61%), Positives = 474/608 (77%)
 Frame = +3

Query: 213  SDSVIPSHSVVKTVRSLVCESYSRQQQKQNFRSIICKLNIPIDSEFLNPEEAITVAASLA 392
            SDS   SHS+V  + S+V +SYS Q    +F+S    LN+ ++ + L  E AI+V ASLA
Sbjct: 50   SDSQSESHSLVTQICSMVYKSYSPQT---HFKSSPPILNLDLNPDSLTHEHAISVVASLA 106

Query: 393  DEEGSLVALCFFYWAIGHLKFKYFMRFYIVLATCLIKNGNFERANEVIHCMVKNFGEIGM 572
             E GS+VAL FFYWA+G  KF+YFMR YI  A  +  NGN ER +EV+ CMV++F EIG 
Sbjct: 107  GEAGSMVALSFFYWAVGFTKFRYFMRLYIFCAMSIFGNGNLERTHEVVQCMVRSFAEIGR 166

Query: 573  LKEAVDMVFEMQNQGLVLSVHTLNCILTVAAETGCVDTAHNVFGDMCERGVVPNTCSFES 752
             KEA DMVF+MQNQGLVLS  TLNC++ +A E G ++ A NVF +M  RGV P+  SF+ 
Sbjct: 167  FKEAADMVFDMQNQGLVLSTRTLNCVVGIACEMGLMEYAENVFDEMSVRGVCPDGLSFKC 226

Query: 753  MVVGYCRVSRISEADRWLSAMLERGFLVDNATCTLIMTLYCEKGRLNRALWIFNKLVEMG 932
            MVVGYCR   + E DRWLS M+ERGF++DNA+ TLI++++CEKG ++RA W F+K+ +MG
Sbjct: 227  MVVGYCRKGAVMEVDRWLSRMIERGFVLDNASFTLIVSVFCEKGFVSRASWCFDKMSKMG 286

Query: 933  FTPNVINFTSLINGLCKRGSIKRAFELLEEMVSKGWKPNVYTHTALIDGLCKKGWTEKAF 1112
              PN++NFTSLI+GLCKRGS+K+AFE+LEEMV +GWKPNVYTHTALIDGLCKKGWTE+AF
Sbjct: 287  VKPNLVNFTSLIHGLCKRGSVKQAFEMLEEMVRRGWKPNVYTHTALIDGLCKKGWTERAF 346

Query: 1113 RLFLKLVRSDHYKPNVHTYTAMIAGYCKEGKLNRAEMLLGRMLEQGLTPNAHSYSTLIDG 1292
            RLFLKLVRSD+YKPNVHTYTAMI+GYCKE K++RAEMLL RM EQ L PNA++Y+TL+ G
Sbjct: 347  RLFLKLVRSDNYKPNVHTYTAMISGYCKEEKMSRAEMLLSRMKEQELVPNAYTYTTLVYG 406

Query: 1293 YCKVGNLDRSYELMDTVAKVGLVPNICIYNCMIDSLCKKGRASEAYKLLKRGFETGLSAD 1472
            +CK GN +++Y+LMD +++ G  PNIC YN ++D LCKK R  EAYKL+K+GF  GL AD
Sbjct: 407  HCKAGNFEKAYQLMDVMSEEGFAPNICTYNAVMDCLCKKERVQEAYKLIKKGFRRGLQAD 466

Query: 1473 LVTFTILISECCKHGDMGQALAHLSKMTKTSLMPDTHVYTILISAFCRQKRMTESERIFN 1652
             VT+TI ISE CK  D+  A A  +KM K  L PD H YT LI+AFCRQK+M ESE++F 
Sbjct: 467  RVTYTIFISEHCKQADIKGAQAFFNKMVKAGLEPDMHSYTTLIAAFCRQKKMKESEKLFE 526

Query: 1653 DATKLGIAPSTEMYTSMISGYFRDKNVSMALKFYEKMVKQGCIPDSLTYGAMISGLCKDL 1832
             A +LG+ P+ E YTSMI GY RD N+ +A+KF+ +M   GC PDS TYGA+ISGLCK+ 
Sbjct: 527  VAVRLGLIPTKETYTSMICGYCRDGNIVLAVKFFHRMSDHGCSPDSFTYGALISGLCKEE 586

Query: 1833 KVEEAKILYNTMMDKGLSPCEVTRLTIAYEYCKKGEPSTAMVLLDRLDKKLWMRTFNTLI 2012
            K++EA+ LY+TMMDKGLSPCEVTRLT+ ++YC+K + +TAMV+LDRL+KK W+RT NTL+
Sbjct: 587  KLDEARKLYDTMMDKGLSPCEVTRLTLTHKYCQKDDYATAMVILDRLEKKYWIRTVNTLV 646

Query: 2013 RKLCSEKK 2036
            RKLC EKK
Sbjct: 647  RKLCCEKK 654


>ref|XP_006468480.1| PREDICTED: pentatricopeptide repeat-containing protein At4g19890-like
            [Citrus sinensis]
          Length = 707

 Score =  771 bits (1992), Expect = 0.0
 Identities = 376/608 (61%), Positives = 468/608 (76%)
 Frame = +3

Query: 213  SDSVIPSHSVVKTVRSLVCESYSRQQQKQNFRSIICKLNIPIDSEFLNPEEAITVAASLA 392
            S S  P  S+VKTV S+V ESY    Q+ + RS   +LN+ ID + L  E+AITV ASLA
Sbjct: 52   SSSPSPPQSLVKTVCSMVLESY---YQQFHLRSSPPRLNLQIDIDSLTHEQAITVVASLA 108

Query: 393  DEEGSLVALCFFYWAIGHLKFKYFMRFYIVLATCLIKNGNFERANEVIHCMVKNFGEIGM 572
            +E GS+VAL FFYWAIG  KF++FMR YIV AT LI NGNFERA+EV+ CMV +F EIG 
Sbjct: 109  NEAGSMVALSFFYWAIGFAKFRHFMRLYIVCATSLISNGNFERAHEVMQCMVSSFAEIGR 168

Query: 573  LKEAVDMVFEMQNQGLVLSVHTLNCILTVAAETGCVDTAHNVFGDMCERGVVPNTCSFES 752
            LKE   MV EM N GL L   TLN ++ +A E G V+ A  VF +MC RGV  +  S++ 
Sbjct: 169  LKEGFSMVIEMTNNGLPLITSTLNRVVGIACEMGLVEYAEEVFDEMCARGVCADASSYKL 228

Query: 753  MVVGYCRVSRISEADRWLSAMLERGFLVDNATCTLIMTLYCEKGRLNRALWIFNKLVEMG 932
            MVV YCR+ R++EADRWLSAML+RG ++DNAT TL++T +C+KG ++RA W F+K++  G
Sbjct: 229  MVVAYCRMGRVTEADRWLSAMLDRGAILDNATLTLLITAFCDKGFVSRAFWYFDKMIVKG 288

Query: 933  FTPNVINFTSLINGLCKRGSIKRAFELLEEMVSKGWKPNVYTHTALIDGLCKKGWTEKAF 1112
              PN+INFTSLINGLCKRGSIK+AFELLEEMV KGWKPNVYTHT LIDGLCKKGWTEKAF
Sbjct: 289  LKPNLINFTSLINGLCKRGSIKQAFELLEEMVRKGWKPNVYTHTVLIDGLCKKGWTEKAF 348

Query: 1113 RLFLKLVRSDHYKPNVHTYTAMIAGYCKEGKLNRAEMLLGRMLEQGLTPNAHSYSTLIDG 1292
            RLFLKLVRSD+YKPNVHTYTAMI+GYCKE K+NRAEMLL RM EQGL PN ++Y++LI G
Sbjct: 349  RLFLKLVRSDNYKPNVHTYTAMISGYCKEEKMNRAEMLLERMKEQGLLPNTNTYTSLIYG 408

Query: 1293 YCKVGNLDRSYELMDTVAKVGLVPNICIYNCMIDSLCKKGRASEAYKLLKRGFETGLSAD 1472
            +CKVGN +R+Y+LMD + K G  PNI  YN +ID LCKKGR  EAY+LLK+ F+  L AD
Sbjct: 409  HCKVGNFERAYDLMDLMGKEGCTPNIYAYNAIIDGLCKKGRVQEAYELLKKAFQRELQAD 468

Query: 1473 LVTFTILISECCKHGDMGQALAHLSKMTKTSLMPDTHVYTILISAFCRQKRMTESERIFN 1652
             +T+TIL+SE  K  +  QAL    +M K  L PD H Y  LI+AFCRQK+M ESE+ F 
Sbjct: 469  KITYTILLSEHLKQAETKQALGLFCRMVKAGLNPDIHAYNTLIAAFCRQKKMKESEKFFQ 528

Query: 1653 DATKLGIAPSTEMYTSMISGYFRDKNVSMALKFYEKMVKQGCIPDSLTYGAMISGLCKDL 1832
            +A   G+ P+ E YTSMI GY RD N+S A+K++++M + GC PD++TYGA+ISGLCK  
Sbjct: 529  EAITAGLFPTKETYTSMICGYLRDGNISSAVKYFQRMNQIGCAPDNITYGALISGLCKQS 588

Query: 1833 KVEEAKILYNTMMDKGLSPCEVTRLTIAYEYCKKGEPSTAMVLLDRLDKKLWMRTFNTLI 2012
            K++EA   Y +M+ KG+SPCEVTR+T+AYEYCK+G+ +TAM++L+ LDKKLW+RT NTLI
Sbjct: 589  KLDEACQFYESMIGKGISPCEVTRVTLAYEYCKQGDSATAMIILESLDKKLWIRTVNTLI 648

Query: 2013 RKLCSEKK 2036
            RKLCSEK+
Sbjct: 649  RKLCSEKR 656


>gb|EOY27563.1| Pentatricopeptide repeat superfamily protein [Theobroma cacao]
          Length = 738

 Score =  771 bits (1992), Expect = 0.0
 Identities = 384/641 (59%), Positives = 476/641 (74%), Gaps = 10/641 (1%)
 Frame = +3

Query: 144  RYRGLFTQTLFSHVSVRTMSHAHSD-------SVIPS---HSVVKTVRSLVCESYSRQQQ 293
            RY G+  +   + +     S+ H D       S  PS    S +KT+ S V ESY    Q
Sbjct: 50   RYHGIKPRLWTNPLFTLNPSYLHFDTNFIDTQSPTPSSEPQSFIKTICSQVYESY---HQ 106

Query: 294  KQNFRSIICKLNIPIDSEFLNPEEAITVAASLADEEGSLVALCFFYWAIGHLKFKYFMRF 473
            + + R    KL + I+   L  E+AI++ ASLA+E GS+VAL FF+W +   KF+ F+R 
Sbjct: 107  QAHLRFSPPKLTLNINPYCLTHEQAISIVASLANEAGSMVALSFFHWVLEISKFRLFIRL 166

Query: 474  YIVLATCLIKNGNFERANEVIHCMVKNFGEIGMLKEAVDMVFEMQNQGLVLSVHTLNCIL 653
            YIV AT LIKNGNF++ANEV+ C+V++F ++G LKEAV+MVFEMQN GL     TLNCIL
Sbjct: 167  YIVTATSLIKNGNFDKANEVMQCLVRSFAKVGRLKEAVEMVFEMQNHGLKPKAETLNCIL 226

Query: 654  TVAAETGCVDTAHNVFGDMCERGVVPNTCSFESMVVGYCRVSRISEADRWLSAMLERGFL 833
             V  E G +D    VF +M ERGV  +  S++ MVVGYCR+  +SE D+WL+ ML RGF+
Sbjct: 227  GVGFEMGLLDYLEKVFDEMSERGVCGDCSSYKLMVVGYCRMGMVSEVDKWLTEMLGRGFI 286

Query: 834  VDNATCTLIMTLYCEKGRLNRALWIFNKLVEMGFTPNVINFTSLINGLCKRGSIKRAFEL 1013
            VDNATCTL+++L+CEKG  +RA W F+K+V+MGF PN+IN++ LINGLCKRGSIK+AF  
Sbjct: 287  VDNATCTLVISLFCEKGFASRASWYFDKMVKMGFKPNLINYSCLINGLCKRGSIKQAFGK 346

Query: 1014 LEEMVSKGWKPNVYTHTALIDGLCKKGWTEKAFRLFLKLVRSDHYKPNVHTYTAMIAGYC 1193
            LE+MV  GWKPNVY HTALIDGLC+KGWTEKAFRLFLKLVRSD+YK NVHTYT+MI+GYC
Sbjct: 347  LEDMVRAGWKPNVYIHTALIDGLCRKGWTEKAFRLFLKLVRSDNYKLNVHTYTSMISGYC 406

Query: 1194 KEGKLNRAEMLLGRMLEQGLTPNAHSYSTLIDGYCKVGNLDRSYELMDTVAKVGLVPNIC 1373
            KE KLNRAEMLL RM EQGL PN ++Y+TLIDG+CKVGN DR+YE MD + K G  PNIC
Sbjct: 407  KEEKLNRAEMLLSRMKEQGLVPNTNTYTTLIDGHCKVGNFDRAYEFMDVMDKEGFAPNIC 466

Query: 1374 IYNCMIDSLCKKGRASEAYKLLKRGFETGLSADLVTFTILISECCKHGDMGQALAHLSKM 1553
             YN +I  LCKKGR  EA++LL+ G   GL AD VT+TILI+E CK  D G+ LA   KM
Sbjct: 467  TYNAIIGGLCKKGRVEEAHELLRDGLLHGLQADRVTYTILITEHCKQADTGRVLAFFCKM 526

Query: 1554 TKTSLMPDTHVYTILISAFCRQKRMTESERIFNDATKLGIAPSTEMYTSMISGYFRDKNV 1733
             K  L PD H Y  LI++FC+QK+M ESE +F +A +LG+ P+ E YTSMI GY RD NV
Sbjct: 527  VKGGLQPDMHSYNTLIASFCKQKKMKESENLFEEALRLGLVPTKETYTSMICGYSRDGNV 586

Query: 1734 SMALKFYEKMVKQGCIPDSLTYGAMISGLCKDLKVEEAKILYNTMMDKGLSPCEVTRLTI 1913
            S+ LKF+ KM   GC+PDS+ YG +ISGLCK+ ++EEA  LY TMMD+GLSPCEVTRLTI
Sbjct: 587  SLGLKFFSKMNDHGCVPDSIAYGTVISGLCKESRLEEACQLYETMMDRGLSPCEVTRLTI 646

Query: 1914 AYEYCKKGEPSTAMVLLDRLDKKLWMRTFNTLIRKLCSEKK 2036
            AYEYCKKG+ + AMV+L+RL+KKLWMRT NTLIRKLCSEKK
Sbjct: 647  AYEYCKKGDSAVAMVMLERLEKKLWMRTVNTLIRKLCSEKK 687


>ref|XP_002528370.1| pentatricopeptide repeat-containing protein, putative [Ricinus
            communis] gi|223532238|gb|EEF34042.1| pentatricopeptide
            repeat-containing protein, putative [Ricinus communis]
          Length = 712

 Score =  768 bits (1984), Expect = 0.0
 Identities = 382/623 (61%), Positives = 466/623 (74%)
 Frame = +3

Query: 168  TLFSHVSVRTMSHAHSDSVIPSHSVVKTVRSLVCESYSRQQQKQNFRSIICKLNIPIDSE 347
            T F   S    S   S +  P  S V+++  LVCESY   QQ    +     LN+ I+  
Sbjct: 42   TTFIPTSPLPASPPQSLAPPPPESSVRSICLLVCESY---QQTSFSKPSSPSLNLEINPN 98

Query: 348  FLNPEEAITVAASLADEEGSLVALCFFYWAIGHLKFKYFMRFYIVLATCLIKNGNFERAN 527
             L  E+ ITV ASLA E GS+V+L FF W IG  KF++FMR YIV AT  + N N +RA 
Sbjct: 99   SLTHEQVITVVASLAQEAGSVVSLSFFNWVIGFSKFRHFMRLYIVCATTFLNNDNLDRAT 158

Query: 528  EVIHCMVKNFGEIGMLKEAVDMVFEMQNQGLVLSVHTLNCILTVAAETGCVDTAHNVFGD 707
            EV+ CMV++F EIG LKEAV+MV EMQN GLVL    LN ++ VA   G VD A  VF +
Sbjct: 159  EVMQCMVRSFSEIGKLKEAVNMVIEMQNHGLVLKARILNFVIDVALALGFVDYAEKVFDE 218

Query: 708  MCERGVVPNTCSFESMVVGYCRVSRISEADRWLSAMLERGFLVDNATCTLIMTLYCEKGR 887
            M +R VVP++ S++ MVVGYCR+ RIS+ DRWL  M+ERG+ VDNATCTL+++ + EKG 
Sbjct: 219  MLDRAVVPDSTSYKLMVVGYCRMGRISDVDRWLKDMIERGYAVDNATCTLMISTFSEKGF 278

Query: 888  LNRALWIFNKLVEMGFTPNVINFTSLINGLCKRGSIKRAFELLEEMVSKGWKPNVYTHTA 1067
            +NRA W F K V+MG  PN+INF+SLINGLCK GSIK+AFE+LEEMV KGWKPNVYTHTA
Sbjct: 279  VNRAFWYFKKWVQMGLNPNLINFSSLINGLCKIGSIKQAFEMLEEMVRKGWKPNVYTHTA 338

Query: 1068 LIDGLCKKGWTEKAFRLFLKLVRSDHYKPNVHTYTAMIAGYCKEGKLNRAEMLLGRMLEQ 1247
            LIDGLCKKGWTEKAFRLFLKLVRSD+YKPNV+TYT MI GYCKE KLNRAEMLL RM EQ
Sbjct: 339  LIDGLCKKGWTEKAFRLFLKLVRSDNYKPNVYTYTCMINGYCKEEKLNRAEMLLIRMKEQ 398

Query: 1248 GLTPNAHSYSTLIDGYCKVGNLDRSYELMDTVAKVGLVPNICIYNCMIDSLCKKGRASEA 1427
            GL PN ++Y+ LIDG+CK GN  R+YELMD + K G  PNI  YN +ID LCKKGR  EA
Sbjct: 399  GLVPNTNTYTCLIDGHCKAGNFGRAYELMDLMGKEGFTPNIFTYNAIIDGLCKKGRFPEA 458

Query: 1428 YKLLKRGFETGLSADLVTFTILISECCKHGDMGQALAHLSKMTKTSLMPDTHVYTILISA 1607
            YKLL+RG ++GL AD VT+TILISE C+  D  QALA  S+M K  L PD H Y +LI+ 
Sbjct: 459  YKLLRRGLKSGLHADKVTYTILISEFCRQTDNKQALAIFSRMFKVGLQPDMHTYNVLIAT 518

Query: 1608 FCRQKRMTESERIFNDATKLGIAPSTEMYTSMISGYFRDKNVSMALKFYEKMVKQGCIPD 1787
            FCRQK++ ESE++F +A  LG+ P+ E YTSMI GY RD ++S A+KF+ KM   GC PD
Sbjct: 519  FCRQKKVEESEKLFEEAVGLGLLPTKETYTSMICGYCRDGHISSAIKFFHKMRDYGCKPD 578

Query: 1788 SLTYGAMISGLCKDLKVEEAKILYNTMMDKGLSPCEVTRLTIAYEYCKKGEPSTAMVLLD 1967
            S+TYGA+ISGLC + K++EA  LY TM+D GLSPCEVTR+T+AYEYCK+G+ +TAM++L+
Sbjct: 579  SITYGALISGLCNESKLDEACQLYETMIDNGLSPCEVTRVTLAYEYCKQGDSATAMIILE 638

Query: 1968 RLDKKLWMRTFNTLIRKLCSEKK 2036
            RL+KKLW+RT NTLIRKLCSEKK
Sbjct: 639  RLEKKLWIRTVNTLIRKLCSEKK 661


>ref|XP_006448708.1| hypothetical protein CICLE_v10014445mg [Citrus clementina]
            gi|557551319|gb|ESR61948.1| hypothetical protein
            CICLE_v10014445mg [Citrus clementina]
          Length = 707

 Score =  764 bits (1974), Expect = 0.0
 Identities = 375/608 (61%), Positives = 465/608 (76%)
 Frame = +3

Query: 213  SDSVIPSHSVVKTVRSLVCESYSRQQQKQNFRSIICKLNIPIDSEFLNPEEAITVAASLA 392
            S S  P  S+VKTV S+V ESY +Q      RS   +LN+ ID + L  E+AITV ASLA
Sbjct: 52   SSSPSPPQSLVKTVCSMVLESYYQQFHS---RSSPPRLNLQIDIDSLTHEQAITVVASLA 108

Query: 393  DEEGSLVALCFFYWAIGHLKFKYFMRFYIVLATCLIKNGNFERANEVIHCMVKNFGEIGM 572
            +E GS+VAL FFYWAIG  KF++FMR YIV AT LI NGNFERA+EV+ CMV  F EIG 
Sbjct: 109  NEAGSMVALSFFYWAIGFAKFRHFMRLYIVCATSLISNGNFERAHEVMQCMVSGFAEIGR 168

Query: 573  LKEAVDMVFEMQNQGLVLSVHTLNCILTVAAETGCVDTAHNVFGDMCERGVVPNTCSFES 752
            LKE   MV EM N GL L   TLN ++ +A ETG V+ A  VF +MC R V  +  S++ 
Sbjct: 169  LKEGFSMVIEMSNNGLPLITSTLNRVMGIACETGLVEYAEEVFDEMCARAVCADASSYKL 228

Query: 753  MVVGYCRVSRISEADRWLSAMLERGFLVDNATCTLIMTLYCEKGRLNRALWIFNKLVEMG 932
            MVV YCR+ R++EADRWLSAML+RG ++DNAT TL++T +C+KG ++RA W F+K++  G
Sbjct: 229  MVVAYCRMGRVTEADRWLSAMLDRGAILDNATLTLLITAFCDKGFVSRAFWYFDKMIVKG 288

Query: 933  FTPNVINFTSLINGLCKRGSIKRAFELLEEMVSKGWKPNVYTHTALIDGLCKKGWTEKAF 1112
              PN+INFTSLINGLCKRGSIK+AFELLEEMV KG KPNVYTHT LIDGLCKKGWTEKAF
Sbjct: 289  LKPNLINFTSLINGLCKRGSIKQAFELLEEMVRKGLKPNVYTHTVLIDGLCKKGWTEKAF 348

Query: 1113 RLFLKLVRSDHYKPNVHTYTAMIAGYCKEGKLNRAEMLLGRMLEQGLTPNAHSYSTLIDG 1292
            RLFLKLVRSD+YKPNVHTYTAMI+GYCKE K+NRAEMLL RM EQGL PN ++Y++LI G
Sbjct: 349  RLFLKLVRSDNYKPNVHTYTAMISGYCKEEKMNRAEMLLERMKEQGLLPNTNTYTSLIYG 408

Query: 1293 YCKVGNLDRSYELMDTVAKVGLVPNICIYNCMIDSLCKKGRASEAYKLLKRGFETGLSAD 1472
            +CKVGN +R+Y+LMD + K G  PNI  YN +ID LCKKGR  EAY+LLK+ F+  L AD
Sbjct: 409  HCKVGNFERAYDLMDLMDKEGCTPNIYAYNAIIDGLCKKGRVQEAYELLKKAFQGELQAD 468

Query: 1473 LVTFTILISECCKHGDMGQALAHLSKMTKTSLMPDTHVYTILISAFCRQKRMTESERIFN 1652
             +T+TIL+S   K  +  QAL    +M K  L PD H YT LI+AFCRQK+M ESE  F+
Sbjct: 469  KITYTILLSGHLKQAETKQALGLFCRMVKAGLNPDIHAYTTLIAAFCRQKKMKESENFFH 528

Query: 1653 DATKLGIAPSTEMYTSMISGYFRDKNVSMALKFYEKMVKQGCIPDSLTYGAMISGLCKDL 1832
            +    G+ P+ E YTSMI GY RD N+S A+K++++M + GC PD++TYGA+ISGLCK  
Sbjct: 529  EVITAGLFPTKETYTSMICGYLRDGNISSAVKYFQRMNQIGCAPDNITYGALISGLCKQS 588

Query: 1833 KVEEAKILYNTMMDKGLSPCEVTRLTIAYEYCKKGEPSTAMVLLDRLDKKLWMRTFNTLI 2012
            K++EA   Y +M+DKG+SPCEVTR+T+AYEYCK+G+ +TAM++L+ LDKKLW+RT NTLI
Sbjct: 589  KLDEACQFYESMIDKGISPCEVTRVTLAYEYCKQGDSATAMIVLESLDKKLWIRTVNTLI 648

Query: 2013 RKLCSEKK 2036
            RKLCSEK+
Sbjct: 649  RKLCSEKR 656


>ref|XP_004145475.1| PREDICTED: pentatricopeptide repeat-containing protein At4g19890-like
            [Cucumis sativus]
          Length = 728

 Score =  764 bits (1974), Expect = 0.0
 Identities = 369/636 (58%), Positives = 479/636 (75%), Gaps = 10/636 (1%)
 Frame = +3

Query: 159  FTQTLFSHVSVRTMSHAHSDSVIPSH----------SVVKTVRSLVCESYSRQQQKQNFR 308
            F Q   +  S+   S  H DS+   H          S +K + SLV ++Y RQ    + R
Sbjct: 45   FQQGSSASASLAFFSSTHFDSISSPHHDFSSSSSLQSPLKKICSLVLDTYLRQP---HLR 101

Query: 309  SIICKLNIPIDSEFLNPEEAITVAASLADEEGSLVALCFFYWAIGHLKFKYFMRFYIVLA 488
                KLN+ +D+  L  E+AI+  A LA EEGS+VAL FFYWA+G  KF+YFMR YIV  
Sbjct: 102  FSPSKLNLDMDAASLTHEQAISAVALLASEEGSMVALSFFYWAVGFPKFRYFMRLYIVCT 161

Query: 489  TCLIKNGNFERANEVIHCMVKNFGEIGMLKEAVDMVFEMQNQGLVLSVHTLNCILTVAAE 668
              L+   N ERA+EV+ CMV  F EIG LKEAVDM+ +M+NQGLVL+   +N I+ VAAE
Sbjct: 162  MSLVGKCNLERAHEVVECMVGVFAEIGKLKEAVDMILDMRNQGLVLTTRVMNRIILVAAE 221

Query: 669  TGCVDTAHNVFGDMCERGVVPNTCSFESMVVGYCRVSRISEADRWLSAMLERGFLVDNAT 848
               V+ A NVF +M  RGV P++C+++ ++VGYCR   + EADRW+  M+ERGF+VDNAT
Sbjct: 222  MRLVEYAGNVFDEMSARGVYPDSCTYKYIIVGYCRNGNVLEADRWICEMMERGFVVDNAT 281

Query: 849  CTLIMTLYCEKGRLNRALWIFNKLVEMGFTPNVINFTSLINGLCKRGSIKRAFELLEEMV 1028
             TLI+T +CEK  +NRA+W F+K+ +MG +PN+IN++S+I+GLCKRGS+K+AFELLEEMV
Sbjct: 282  LTLIITAFCEKSLVNRAVWFFHKVTKMGLSPNLINYSSMISGLCKRGSVKQAFELLEEMV 341

Query: 1029 SKGWKPNVYTHTALIDGLCKKGWTEKAFRLFLKLVRSDHYKPNVHTYTAMIAGYCKEGKL 1208
              GWKPNVYTHT+LI GLCKKGWTE+AFRLFLKL+RSD+YKPNVHTYTAMI+GYCKE KL
Sbjct: 342  KNGWKPNVYTHTSLIHGLCKKGWTERAFRLFLKLIRSDNYKPNVHTYTAMISGYCKEEKL 401

Query: 1209 NRAEMLLGRMLEQGLTPNAHSYSTLIDGYCKVGNLDRSYELMDTVAKVGLVPNICIYNCM 1388
            +RAEML  RM EQGL PN ++Y+TLIDG+CK GN  ++YELM+ ++  G  PN C YN +
Sbjct: 402  SRAEMLFERMKEQGLVPNTNTYTTLIDGHCKAGNFSKAYELMELMSNEGFFPNTCTYNSI 461

Query: 1389 IDSLCKKGRASEAYKLLKRGFETGLSADLVTFTILISECCKHGDMGQALAHLSKMTKTSL 1568
            +D LCK+GRA EA+KLL  GF+  + AD VT+TILISE CK  DM QAL  L+KM K   
Sbjct: 462  VDGLCKRGRAEEAFKLLNTGFQNQIEADGVTYTILISEQCKRADMNQALVFLNKMFKVGF 521

Query: 1569 MPDTHVYTILISAFCRQKRMTESERIFNDATKLGIAPSTEMYTSMISGYFRDKNVSMALK 1748
             PD H+YT LI+AFCRQ  M +SE++F++  KLG+AP+ E YTSMI GY R+K VS+A+K
Sbjct: 522  QPDIHLYTTLIAAFCRQNMMKDSEKLFDEVIKLGLAPTKETYTSMICGYCREKKVSLAVK 581

Query: 1749 FYEKMVKQGCIPDSLTYGAMISGLCKDLKVEEAKILYNTMMDKGLSPCEVTRLTIAYEYC 1928
            F++KM   GC PDS++YGA+ISGLCK+ +++EA+ LY+TM+DKGLSPCEVTR+T+ YEYC
Sbjct: 582  FFQKMSDHGCAPDSISYGALISGLCKESRLDEARQLYDTMIDKGLSPCEVTRVTLTYEYC 641

Query: 1929 KKGEPSTAMVLLDRLDKKLWMRTFNTLIRKLCSEKK 2036
            K  + ++AMV+L+RL+KKLW+RT +TLIRKLC EKK
Sbjct: 642  KTEDFASAMVILERLNKKLWIRTVHTLIRKLCCEKK 677


>gb|EXB61730.1| hypothetical protein L484_008796 [Morus notabilis]
          Length = 731

 Score =  763 bits (1969), Expect = 0.0
 Identities = 370/619 (59%), Positives = 476/619 (76%), Gaps = 2/619 (0%)
 Frame = +3

Query: 186  SVRTMSHAHSDSVIPSHSVVKTVRSLVCESYSRQQQKQNFRSIICKLNIPIDSEFLNPEE 365
            S  + S + S S+  S S+++TV SLV ESY    Q  + R    KL + +D++ L  E+
Sbjct: 55   SSSSSSSSSSSSLSSSQSLIRTVCSLVFESY---YQHGHGRQSPPKLILNVDTDSLTHEQ 111

Query: 366  AITVAASLADEEGSLVALCFFYWAIGHLKFKYFMRFYIVLATCLIKNGNFERANEVIHCM 545
            AITV ASLADE GS+VAL FFYWAI   KF++FMR YIV A  LI NGN ERA+EV+ CM
Sbjct: 112  AITVVASLADEGGSMVALSFFYWAIEFSKFRHFMRLYIVCAMSLIGNGNLERAHEVMQCM 171

Query: 546  VKNFGEIGMLKEAVDMVFEMQNQGLVLSVHTLNCILTVAAETGCVDTAHNVFGDMCERGV 725
            + +F EIG LKEA DM+ ++QNQGL+L+ H LN ++ +A E   ++ A  +F +MC+R V
Sbjct: 172  LGSFAEIGRLKEAGDMILDLQNQGLMLTTHILNSVVRIAWEMNSIEYAEEMFEEMCQREV 231

Query: 726  VPNTCSFESMVVGYCRVSRISEADRWLSAMLERGFLVDNATCTLIMTLYCEKGRLNRALW 905
             P+  S++SMVVGYCR+ R+ EAD+WLS ML++GF VDNAT TLI++ +C+KG  N ALW
Sbjct: 232  SPDPSSYKSMVVGYCRIGRVLEADKWLSEMLDKGFAVDNATLTLIISTFCKKGFANHALW 291

Query: 906  IFNKLVEMGFTPNVINFTSLINGLCKRGSIKRAFELLEEMVSKGWKPNVYTHTALIDGLC 1085
             FNK++ MG +PN+IN+TSLINGLC+RGS+K+ FE+LEEMVSKGW+PNVYTHTALIDGLC
Sbjct: 292  FFNKMIGMGLSPNLINYTSLINGLCRRGSVKKGFEMLEEMVSKGWRPNVYTHTALIDGLC 351

Query: 1086 KKGWTEKAFRLFLKLVRSDHYKPNVHTYTAMIAGYCKEGKLNRAEMLLGRMLEQGLTPNA 1265
            KKGWTEKAFRLFLKLVRSD+YKPNVHTYT+MI+GYC+E K+NRAEML  +M EQGL PN 
Sbjct: 352  KKGWTEKAFRLFLKLVRSDNYKPNVHTYTSMISGYCREEKMNRAEMLFSKMKEQGLVPNT 411

Query: 1266 HSYSTLIDGYCKVGNLDRSYELMDTVAKVGLVPNICIYNCMIDSLCKKGRASEAYKLLKR 1445
            ++Y+TLIDG+CK GN   +Y+LMD++   G  PNI  YN ++D L KKGR  +A+KL+K+
Sbjct: 412  NTYTTLIDGHCKAGNFKTAYQLMDSMRVDGFAPNIYTYNVVMDGLLKKGRIPDAHKLMKK 471

Query: 1446 GFETGLSADLVTFTILISECCKHGDMGQ--ALAHLSKMTKTSLMPDTHVYTILISAFCRQ 1619
                G+ +D+VT+TILISE CK G+     AL   +KM K  + PD H+YT LI+ FCRQ
Sbjct: 472  ASWDGVRSDIVTYTILISEHCKKGETTDTGALMLFNKMVKVGIQPDIHLYTSLIAFFCRQ 531

Query: 1620 KRMTESERIFNDATKLGIAPSTEMYTSMISGYFRDKNVSMALKFYEKMVKQGCIPDSLTY 1799
            KRM ESER F DA + G+ P+ E YTSMI GY RD+NV+MA KF+ +M   GCIPDS+ Y
Sbjct: 532  KRMAESERFFEDAIRYGLEPTKETYTSMICGYCRDENVAMASKFFRRMTGHGCIPDSIAY 591

Query: 1800 GAMISGLCKDLKVEEAKILYNTMMDKGLSPCEVTRLTIAYEYCKKGEPSTAMVLLDRLDK 1979
            GA+ISGLCKD ++++A+ LY+TM+DKGLSPCEVTR+T+AYEYCKK   S AM +L+RLDK
Sbjct: 592  GALISGLCKDERLDDARRLYDTMVDKGLSPCEVTRVTLAYEYCKKENFSAAMAILERLDK 651

Query: 1980 KLWMRTFNTLIRKLCSEKK 2036
            +LW+RT NTLIRKLC+ KK
Sbjct: 652  RLWIRTVNTLIRKLCNNKK 670


>gb|EOY27561.1| Pentatricopeptide repeat (PPR-like) superfamily protein [Theobroma
            cacao]
          Length = 692

 Score =  762 bits (1968), Expect = 0.0
 Identities = 373/600 (62%), Positives = 458/600 (76%)
 Frame = +3

Query: 237  SVVKTVRSLVCESYSRQQQKQNFRSIICKLNIPIDSEFLNPEEAITVAASLADEEGSLVA 416
            S +KT+ S V ESY    Q+ + R    KL + I+   L  E+AI++ ASL +E GS+VA
Sbjct: 45   SFIKTICSQVYESY---HQQAHLRFSPPKLTLNINPYCLTHEQAISIVASLENEAGSMVA 101

Query: 417  LCFFYWAIGHLKFKYFMRFYIVLATCLIKNGNFERANEVIHCMVKNFGEIGMLKEAVDMV 596
            L FF+W +   KF+ FMR YIV AT LIKNGNF++ANEV+ C+V++F E+G LKEAV+MV
Sbjct: 102  LSFFHWVLEISKFRLFMRLYIVTATSLIKNGNFDKANEVMQCLVRSFAEVGRLKEAVEMV 161

Query: 597  FEMQNQGLVLSVHTLNCILTVAAETGCVDTAHNVFGDMCERGVVPNTCSFESMVVGYCRV 776
            FEMQN GL     TLNCIL V  E G +D    VF +M ERGV  +  S++ MVVGYCR+
Sbjct: 162  FEMQNHGLKPKAETLNCILGVGFEMGLMDYLEKVFDEMSERGVCGDCSSYKLMVVGYCRM 221

Query: 777  SRISEADRWLSAMLERGFLVDNATCTLIMTLYCEKGRLNRALWIFNKLVEMGFTPNVINF 956
              +SE  +WL+ ML RGF+VDNATCTL+++L+CEKG  +RA W F+K+V+MGF PN+IN+
Sbjct: 222  GMVSEVVKWLTEMLGRGFIVDNATCTLVISLFCEKGFASRASWYFDKMVKMGFKPNLINY 281

Query: 957  TSLINGLCKRGSIKRAFELLEEMVSKGWKPNVYTHTALIDGLCKKGWTEKAFRLFLKLVR 1136
            + LINGLCKRGSIK+AF  LE+MV  GWKPNVY HTALIDGLC+KGWTEKAFRLFLKLVR
Sbjct: 282  SCLINGLCKRGSIKQAFGKLEDMVRAGWKPNVYIHTALIDGLCRKGWTEKAFRLFLKLVR 341

Query: 1137 SDHYKPNVHTYTAMIAGYCKEGKLNRAEMLLGRMLEQGLTPNAHSYSTLIDGYCKVGNLD 1316
            SD+YK NV TYT+MI+GYCKE KLNRAEMLL RM EQGL PN ++Y+TLIDG+CKVGN D
Sbjct: 342  SDNYKLNVLTYTSMISGYCKEEKLNRAEMLLSRMKEQGLVPNTNTYTTLIDGHCKVGNFD 401

Query: 1317 RSYELMDTVAKVGLVPNICIYNCMIDSLCKKGRASEAYKLLKRGFETGLSADLVTFTILI 1496
            R+YE MD + K G  PNIC YN +I  LCKKGR  EA++LL+ G   GL AD VT+TILI
Sbjct: 402  RAYEFMDVMDKEGFAPNICTYNAIIGGLCKKGRVEEAHELLRDGLLHGLQADRVTYTILI 461

Query: 1497 SECCKHGDMGQALAHLSKMTKTSLMPDTHVYTILISAFCRQKRMTESERIFNDATKLGIA 1676
            +E CK  D G+ LA   K  K  L PD H Y  LI++FC+QK+M ESE +F +A +LG+ 
Sbjct: 462  TEHCKQADTGRVLAFFCKTVKVGLQPDMHSYNTLIASFCKQKKMKESENLFEEALRLGLV 521

Query: 1677 PSTEMYTSMISGYFRDKNVSMALKFYEKMVKQGCIPDSLTYGAMISGLCKDLKVEEAKIL 1856
            P+ E YTSMI GY RD NVS+ LKF+ KM   GC+PDS+ YG +ISGLCK+ ++EEA  L
Sbjct: 522  PTKETYTSMICGYSRDGNVSLGLKFFSKMNDHGCVPDSIAYGTVISGLCKESRLEEACQL 581

Query: 1857 YNTMMDKGLSPCEVTRLTIAYEYCKKGEPSTAMVLLDRLDKKLWMRTFNTLIRKLCSEKK 2036
            Y TMMD+GLSPCEVTRLTIAYEYCKKG+ + AMV+L+RL+KKLWMRT NTLIRKLCSEKK
Sbjct: 582  YETMMDRGLSPCEVTRLTIAYEYCKKGDSAVAMVMLERLEKKLWMRTVNTLIRKLCSEKK 641


>ref|XP_004157755.1| PREDICTED: uncharacterized protein LOC101223774 [Cucumis sativus]
          Length = 1315

 Score =  758 bits (1956), Expect = 0.0
 Identities = 359/593 (60%), Positives = 464/593 (78%)
 Frame = +3

Query: 258  SLVCESYSRQQQKQNFRSIICKLNIPIDSEFLNPEEAITVAASLADEEGSLVALCFFYWA 437
            SLV ++Y RQ    + R    KLN+ +D+  L  E+AI+  A LA EEGS+VAL FFYWA
Sbjct: 675  SLVLDTYLRQP---HLRFSPSKLNLDMDAASLTHEQAISAVALLASEEGSMVALSFFYWA 731

Query: 438  IGHLKFKYFMRFYIVLATCLIKNGNFERANEVIHCMVKNFGEIGMLKEAVDMVFEMQNQG 617
            +G  KF+YFMR YIV    L+   N ERA+EV+ CMV  F EIG LKEAVDM+ +M+NQG
Sbjct: 732  VGFPKFRYFMRLYIVCTMSLVGKCNLERAHEVVECMVGVFAEIGKLKEAVDMILDMRNQG 791

Query: 618  LVLSVHTLNCILTVAAETGCVDTAHNVFGDMCERGVVPNTCSFESMVVGYCRVSRISEAD 797
            LVL+   +N I+ VAAE   V+ A NVF +M  RGV P++C+++ ++VGYCR   + EAD
Sbjct: 792  LVLTTRVMNRIILVAAEMRLVEYAGNVFDEMSARGVYPDSCTYKYIIVGYCRNGNVLEAD 851

Query: 798  RWLSAMLERGFLVDNATCTLIMTLYCEKGRLNRALWIFNKLVEMGFTPNVINFTSLINGL 977
            RW+  M+ERGF+VDNAT TLI+T +CEK  +NRA+W F+K+ +MG +PN+IN++S+I+GL
Sbjct: 852  RWICEMMERGFVVDNATLTLIITAFCEKSLVNRAVWFFHKVTKMGLSPNLINYSSMISGL 911

Query: 978  CKRGSIKRAFELLEEMVSKGWKPNVYTHTALIDGLCKKGWTEKAFRLFLKLVRSDHYKPN 1157
            CKRGS+K+AFELLEEMV  GWKPNVYTHT+LI GLCKKGWTE+AFRLFLKL+RSD+YKPN
Sbjct: 912  CKRGSVKQAFELLEEMVKNGWKPNVYTHTSLIHGLCKKGWTERAFRLFLKLIRSDNYKPN 971

Query: 1158 VHTYTAMIAGYCKEGKLNRAEMLLGRMLEQGLTPNAHSYSTLIDGYCKVGNLDRSYELMD 1337
            VHTYTAMI+GYCKE KL+RAEML  RM EQGL PN ++Y+TLIDG+CK GN  ++YELM+
Sbjct: 972  VHTYTAMISGYCKEEKLSRAEMLFERMKEQGLVPNTNTYTTLIDGHCKAGNFSKAYELME 1031

Query: 1338 TVAKVGLVPNICIYNCMIDSLCKKGRASEAYKLLKRGFETGLSADLVTFTILISECCKHG 1517
             ++  G  PN C YN ++D LCK+GRA EA+KLL  GF+  + AD VT+TILISE CK  
Sbjct: 1032 LMSNEGFFPNTCTYNSIVDGLCKRGRAEEAFKLLNTGFQNQIEADGVTYTILISEQCKRA 1091

Query: 1518 DMGQALAHLSKMTKTSLMPDTHVYTILISAFCRQKRMTESERIFNDATKLGIAPSTEMYT 1697
            DM QAL  L+KM K    PD H+YT LI+AFCRQ  M +SE++F++  KLG+AP+ E YT
Sbjct: 1092 DMNQALVFLNKMFKVGFQPDIHLYTTLIAAFCRQNMMKDSEKLFDEVIKLGLAPTKETYT 1151

Query: 1698 SMISGYFRDKNVSMALKFYEKMVKQGCIPDSLTYGAMISGLCKDLKVEEAKILYNTMMDK 1877
            SMI GY R+K VS+A+KF++KM   GC PDS++YGA+ISGLCK+ +++EA+ LY+TM+DK
Sbjct: 1152 SMICGYCREKKVSLAVKFFQKMSDHGCAPDSISYGALISGLCKESRLDEARQLYDTMIDK 1211

Query: 1878 GLSPCEVTRLTIAYEYCKKGEPSTAMVLLDRLDKKLWMRTFNTLIRKLCSEKK 2036
            GLSPCEVTR+T+ YEYCK  + ++AMV+L+RL+KKLW+RT +TLIRKLC EKK
Sbjct: 1212 GLSPCEVTRVTLTYEYCKTEDFASAMVILERLNKKLWIRTVHTLIRKLCCEKK 1264


>ref|XP_002867913.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
            gi|297313749|gb|EFH44172.1| predicted protein
            [Arabidopsis lyrata subsp. lyrata]
          Length = 724

 Score =  755 bits (1949), Expect = 0.0
 Identities = 370/619 (59%), Positives = 463/619 (74%)
 Frame = +3

Query: 180  HVSVRTMSHAHSDSVIPSHSVVKTVRSLVCESYSRQQQKQNFRSIICKLNIPIDSEFLNP 359
            H S      + S S  PS S+VK+V SLV  SY RQ           ++N+  D+  L  
Sbjct: 58   HESSDVSPPSSSPSSPPSQSLVKSVCSLVYNSYLRQNHVIQSPH---RVNLDFDANSLTH 114

Query: 360  EEAITVAASLADEEGSLVALCFFYWAIGHLKFKYFMRFYIVLATCLIKNGNFERANEVIH 539
            E+AITV ASLA E GS+VALCFFYWA+G  KF++FMR Y+V A  LI NGN ++A+EV+ 
Sbjct: 115  EQAITVVASLASESGSMVALCFFYWAVGFEKFRHFMRLYLVTADSLIANGNLQKAHEVMR 174

Query: 540  CMVKNFGEIGMLKEAVDMVFEMQNQGLVLSVHTLNCILTVAAETGCVDTAHNVFGDMCER 719
            CM++NF EIG L EAV MV +MQNQGL  S  T+NC+L +A E+G +D A NVF +M  R
Sbjct: 175  CMLRNFSEIGRLNEAVGMVMDMQNQGLSPSSITMNCVLEIAIESGLIDYAENVFDEMSVR 234

Query: 720  GVVPNTCSFESMVVGYCRVSRISEADRWLSAMLERGFLVDNATCTLIMTLYCEKGRLNRA 899
            GV P++ SF+ MV+G  R  +I EADRWLS M++RGF+ DNATCTLI++  CE G +NRA
Sbjct: 235  GVCPDSSSFKLMVIGCFRDGKIQEADRWLSGMIQRGFIPDNATCTLILSALCENGLVNRA 294

Query: 900  LWIFNKLVEMGFTPNVINFTSLINGLCKRGSIKRAFELLEEMVSKGWKPNVYTHTALIDG 1079
            +W F K++++GF PN+INFTSLI+GLCK+GSIK+AFE+LEEMV  GWKPNVYTHTALIDG
Sbjct: 295  IWYFRKMIDLGFKPNLINFTSLIDGLCKKGSIKQAFEMLEEMVRNGWKPNVYTHTALIDG 354

Query: 1080 LCKKGWTEKAFRLFLKLVRSDHYKPNVHTYTAMIAGYCKEGKLNRAEMLLGRMLEQGLTP 1259
            LCK+GWTEKAFRLFLKLVRSD YKPNVHTYT+MI GYCKE KLNRAEML  RM EQGL P
Sbjct: 355  LCKRGWTEKAFRLFLKLVRSDIYKPNVHTYTSMIGGYCKEDKLNRAEMLFSRMKEQGLFP 414

Query: 1260 NAHSYSTLIDGYCKVGNLDRSYELMDTVAKVGLVPNICIYNCMIDSLCKKGRASEAYKLL 1439
            N ++Y+TLI+G+CK GN DR+YELM+ +   G  PNI  YN +IDSLCKK RA EAY+LL
Sbjct: 415  NVNTYTTLINGHCKAGNFDRAYELMNLMDDEGFRPNIYTYNAVIDSLCKKSRAPEAYELL 474

Query: 1440 KRGFETGLSADLVTFTILISECCKHGDMGQALAHLSKMTKTSLMPDTHVYTILISAFCRQ 1619
             + F  GL AD VT+TILI E CK  D+ QALA   +M KT    D  +  ILI+AFCRQ
Sbjct: 475  NKAFSCGLEADGVTYTILIQEQCKQSDIKQALAFFCRMNKTGFEADMRLNNILIAAFCRQ 534

Query: 1620 KRMTESERIFNDATKLGIAPSTEMYTSMISGYFRDKNVSMALKFYEKMVKQGCIPDSLTY 1799
            K+M ESER+F     LG+ P+ E YTSMISGY ++ +  +ALK++  M + GC+PDS TY
Sbjct: 535  KKMKESERLFQLVVSLGLVPTKETYTSMISGYCKEGDFDLALKYFHNMKRHGCVPDSFTY 594

Query: 1800 GAMISGLCKDLKVEEAKILYNTMMDKGLSPCEVTRLTIAYEYCKKGEPSTAMVLLDRLDK 1979
            G++ISGLCK   V+EA  LY  M+D+GLSP EVTR+T+AYEYCK+ + ++AM++L+ LDK
Sbjct: 595  GSLISGLCKKSMVDEACKLYEAMIDRGLSPPEVTRVTLAYEYCKRNDSASAMIVLEPLDK 654

Query: 1980 KLWMRTFNTLIRKLCSEKK 2036
            KLW+RT  TL+RKLCSEKK
Sbjct: 655  KLWIRTVRTLVRKLCSEKK 673



 Score =  135 bits (339), Expect = 9e-29
 Identities = 96/384 (25%), Positives = 162/384 (42%), Gaps = 67/384 (17%)
 Frame = +3

Query: 567  GMLKEAVDMVFEMQNQGLVLSVHTLNCILTVAAETGCVDTAHNVFGDMCERGVV-PNTCS 743
            G +K+A +M+ EM   G   +V+T   ++    + G  + A  +F  +    +  PN  +
Sbjct: 324  GSIKQAFEMLEEMVRNGWKPNVYTHTALIDGLCKRGWTEKAFRLFLKLVRSDIYKPNVHT 383

Query: 744  FESMVVGYCRVSRISEADRWLSAMLERGFLVDNATCTLIMTLYCEKGRLNRALWIFNKLV 923
            + SM+ GYC+  +++ A+   S M E+G   +  T T ++  +C+ G  +RA  + N + 
Sbjct: 384  YTSMIGGYCKEDKLNRAEMLFSRMKEQGLFPNVNTYTTLINGHCKAGNFDRAYELMNLMD 443

Query: 924  EMGFTPNVINFTSLINGLCKRGSIKRAFELLEEMVSKGWKPNVYTHTALIDGLCKKGWTE 1103
            + GF PN+  + ++I+ LCK+     A+ELL +  S G + +  T+T LI   CK+   +
Sbjct: 444  DEGFRPNIYTYNAVIDSLCKKSRAPEAYELLNKAFSCGLEADGVTYTILIQEQCKQSDIK 503

Query: 1104 KAFRLFLK----------------------------------LVRSDHYKPNVHTYTAMI 1181
            +A   F +                                  LV S    P   TYT+MI
Sbjct: 504  QALAFFCRMNKTGFEADMRLNNILIAAFCRQKKMKESERLFQLVVSLGLVPTKETYTSMI 563

Query: 1182 AGYCKEGKLNRAEMLLGRMLEQGLTPNAHSYSTLIDGYCKVGNLDRSYELMDTVAKVGLV 1361
            +GYCKEG  + A      M   G  P++ +Y +LI G CK   +D + +L + +   GL 
Sbjct: 564  SGYCKEGDFDLALKYFHNMKRHGCVPDSFTYGSLISGLCKKSMVDEACKLYEAMIDRGLS 623

Query: 1362 P----------NICIYN----------------------CMIDSLCKKGRASEAYKLLKR 1445
            P            C  N                       ++  LC + +   A    ++
Sbjct: 624  PPEVTRVTLAYEYCKRNDSASAMIVLEPLDKKLWIRTVRTLVRKLCSEKKVGVAALFFQK 683

Query: 1446 GFETGLSADLVTFTILISECCKHG 1517
              E   SAD VT     + C + G
Sbjct: 684  LLEKDSSADRVTLAAFTTACSESG 707


>ref|NP_001190774.1| Pentatricopeptide repeat domain-containing protein [Arabidopsis
            thaliana] gi|223635614|sp|P0C8Q3.1|PP326_ARATH RecName:
            Full=Pentatricopeptide repeat-containing protein
            At4g19890 gi|332658842|gb|AEE84242.1| Pentatricopeptide
            repeat domain-containing protein [Arabidopsis thaliana]
          Length = 701

 Score =  754 bits (1946), Expect = 0.0
 Identities = 366/613 (59%), Positives = 462/613 (75%)
 Frame = +3

Query: 198  MSHAHSDSVIPSHSVVKTVRSLVCESYSRQQQKQNFRSIICKLNIPIDSEFLNPEEAITV 377
            +S   S S  PS  +VK+V SLVC SY RQ    +  S   ++N+  D+  L  E+AITV
Sbjct: 41   LSLPSSPSSSPSQCLVKSVCSLVCTSYLRQN---HVVSSPHRVNLDFDANSLTHEQAITV 97

Query: 378  AASLADEEGSLVALCFFYWAIGHLKFKYFMRFYIVLATCLIKNGNFERANEVIHCMVKNF 557
             ASLA E GS+VALCFFYWA+G  KF++FMR Y+V A  L+ NGN ++A+EV+ CM++NF
Sbjct: 98   VASLASESGSMVALCFFYWAVGFEKFRHFMRLYLVTADSLLANGNLQKAHEVMRCMLRNF 157

Query: 558  GEIGMLKEAVDMVFEMQNQGLVLSVHTLNCILTVAAETGCVDTAHNVFGDMCERGVVPNT 737
             EIG L EAV MV +MQNQGL  S  T+NC+L +A E G ++ A NVF +M  RGVVP++
Sbjct: 158  SEIGRLNEAVGMVMDMQNQGLTPSSITMNCVLEIAVELGLIEYAENVFDEMSVRGVVPDS 217

Query: 738  CSFESMVVGYCRVSRISEADRWLSAMLERGFLVDNATCTLIMTLYCEKGRLNRALWIFNK 917
             S++ MV+G  R  +I EADRWL+ M++RGF+ DNATCTLI+T  CE G +NRA+W F K
Sbjct: 218  SSYKLMVIGCFRDGKIQEADRWLTGMIQRGFIPDNATCTLILTALCENGLVNRAIWYFRK 277

Query: 918  LVEMGFTPNVINFTSLINGLCKRGSIKRAFELLEEMVSKGWKPNVYTHTALIDGLCKKGW 1097
            ++++GF PN+INFTSLI+GLCK+GSIK+AFE+LEEMV  GWKPNVYTHTALIDGLCK+GW
Sbjct: 278  MIDLGFKPNLINFTSLIDGLCKKGSIKQAFEMLEEMVRNGWKPNVYTHTALIDGLCKRGW 337

Query: 1098 TEKAFRLFLKLVRSDHYKPNVHTYTAMIAGYCKEGKLNRAEMLLGRMLEQGLTPNAHSYS 1277
            TEKAFRLFLKLVRSD YKPNVHTYT+MI GYCKE KLNRAEML  RM EQGL PN ++Y+
Sbjct: 338  TEKAFRLFLKLVRSDTYKPNVHTYTSMIGGYCKEDKLNRAEMLFSRMKEQGLFPNVNTYT 397

Query: 1278 TLIDGYCKVGNLDRSYELMDTVAKVGLVPNICIYNCMIDSLCKKGRASEAYKLLKRGFET 1457
            TLI+G+CK G+  R+YELM+ +   G +PNI  YN  IDSLCKK RA EAY+LL + F  
Sbjct: 398  TLINGHCKAGSFGRAYELMNLMGDEGFMPNIYTYNAAIDSLCKKSRAPEAYELLNKAFSC 457

Query: 1458 GLSADLVTFTILISECCKHGDMGQALAHLSKMTKTSLMPDTHVYTILISAFCRQKRMTES 1637
            GL AD VT+TILI E CK  D+ QALA   +M KT    D  +  ILI+AFCRQK+M ES
Sbjct: 458  GLEADGVTYTILIQEQCKQNDINQALAFFCRMNKTGFEADMRLNNILIAAFCRQKKMKES 517

Query: 1638 ERIFNDATKLGIAPSTEMYTSMISGYFRDKNVSMALKFYEKMVKQGCIPDSLTYGAMISG 1817
            ER+F     LG+ P+ E YTSMIS Y ++ ++ +ALK++  M + GC+PDS TYG++ISG
Sbjct: 518  ERLFQLVVSLGLIPTKETYTSMISCYCKEGDIDLALKYFHNMKRHGCVPDSFTYGSLISG 577

Query: 1818 LCKDLKVEEAKILYNTMMDKGLSPCEVTRLTIAYEYCKKGEPSTAMVLLDRLDKKLWMRT 1997
            LCK   V+EA  LY  M+D+GLSP EVTR+T+AYEYCK+ + + AM+LL+ LDKKLW+RT
Sbjct: 578  LCKKSMVDEACKLYEAMIDRGLSPPEVTRVTLAYEYCKRNDSANAMILLEPLDKKLWIRT 637

Query: 1998 FNTLIRKLCSEKK 2036
              TL+RKLCSEKK
Sbjct: 638  VRTLVRKLCSEKK 650



 Score =  129 bits (323), Expect = 6e-27
 Identities = 95/384 (24%), Positives = 158/384 (41%), Gaps = 67/384 (17%)
 Frame = +3

Query: 567  GMLKEAVDMVFEMQNQGLVLSVHTLNCILTVAAETGCVDTAHNVFGDMCERGVV-PNTCS 743
            G +K+A +M+ EM   G   +V+T   ++    + G  + A  +F  +       PN  +
Sbjct: 301  GSIKQAFEMLEEMVRNGWKPNVYTHTALIDGLCKRGWTEKAFRLFLKLVRSDTYKPNVHT 360

Query: 744  FESMVVGYCRVSRISEADRWLSAMLERGFLVDNATCTLIMTLYCEKGRLNRALWIFNKLV 923
            + SM+ GYC+  +++ A+   S M E+G   +  T T ++  +C+ G   RA  + N + 
Sbjct: 361  YTSMIGGYCKEDKLNRAEMLFSRMKEQGLFPNVNTYTTLINGHCKAGSFGRAYELMNLMG 420

Query: 924  EMGFTPNVINFTSLINGLCKRGSIKRAFELLEEMVSKGWKPNVYTHTALIDGLCKKGWTE 1103
            + GF PN+  + + I+ LCK+     A+ELL +  S G + +  T+T LI   CK+    
Sbjct: 421  DEGFMPNIYTYNAAIDSLCKKSRAPEAYELLNKAFSCGLEADGVTYTILIQEQCKQNDIN 480

Query: 1104 KAFRLFLK----------------------------------LVRSDHYKPNVHTYTAMI 1181
            +A   F +                                  LV S    P   TYT+MI
Sbjct: 481  QALAFFCRMNKTGFEADMRLNNILIAAFCRQKKMKESERLFQLVVSLGLIPTKETYTSMI 540

Query: 1182 AGYCKEGKLNRAEMLLGRMLEQGLTPNAHSYSTLIDGYCKVGNLDRSYELMDTVAKVGLV 1361
            + YCKEG ++ A      M   G  P++ +Y +LI G CK   +D + +L + +   GL 
Sbjct: 541  SCYCKEGDIDLALKYFHNMKRHGCVPDSFTYGSLISGLCKKSMVDEACKLYEAMIDRGLS 600

Query: 1362 P----------NICIYN----------------------CMIDSLCKKGRASEAYKLLKR 1445
            P            C  N                       ++  LC + +   A    ++
Sbjct: 601  PPEVTRVTLAYEYCKRNDSANAMILLEPLDKKLWIRTVRTLVRKLCSEKKVGVAALFFQK 660

Query: 1446 GFETGLSADLVTFTILISECCKHG 1517
              E   SAD VT     + C + G
Sbjct: 661  LLEKDSSADRVTLAAFTTACSESG 684


>emb|CAB52870.1| putative protein [Arabidopsis thaliana] gi|7268785|emb|CAB78991.1|
            putative protein [Arabidopsis thaliana]
          Length = 1302

 Score =  754 bits (1946), Expect = 0.0
 Identities = 366/613 (59%), Positives = 462/613 (75%)
 Frame = +3

Query: 198  MSHAHSDSVIPSHSVVKTVRSLVCESYSRQQQKQNFRSIICKLNIPIDSEFLNPEEAITV 377
            +S   S S  PS  +VK+V SLVC SY RQ    +  S   ++N+  D+  L  E+AITV
Sbjct: 642  LSLPSSPSSSPSQCLVKSVCSLVCTSYLRQN---HVVSSPHRVNLDFDANSLTHEQAITV 698

Query: 378  AASLADEEGSLVALCFFYWAIGHLKFKYFMRFYIVLATCLIKNGNFERANEVIHCMVKNF 557
             ASLA E GS+VALCFFYWA+G  KF++FMR Y+V A  L+ NGN ++A+EV+ CM++NF
Sbjct: 699  VASLASESGSMVALCFFYWAVGFEKFRHFMRLYLVTADSLLANGNLQKAHEVMRCMLRNF 758

Query: 558  GEIGMLKEAVDMVFEMQNQGLVLSVHTLNCILTVAAETGCVDTAHNVFGDMCERGVVPNT 737
             EIG L EAV MV +MQNQGL  S  T+NC+L +A E G ++ A NVF +M  RGVVP++
Sbjct: 759  SEIGRLNEAVGMVMDMQNQGLTPSSITMNCVLEIAVELGLIEYAENVFDEMSVRGVVPDS 818

Query: 738  CSFESMVVGYCRVSRISEADRWLSAMLERGFLVDNATCTLIMTLYCEKGRLNRALWIFNK 917
             S++ MV+G  R  +I EADRWL+ M++RGF+ DNATCTLI+T  CE G +NRA+W F K
Sbjct: 819  SSYKLMVIGCFRDGKIQEADRWLTGMIQRGFIPDNATCTLILTALCENGLVNRAIWYFRK 878

Query: 918  LVEMGFTPNVINFTSLINGLCKRGSIKRAFELLEEMVSKGWKPNVYTHTALIDGLCKKGW 1097
            ++++GF PN+INFTSLI+GLCK+GSIK+AFE+LEEMV  GWKPNVYTHTALIDGLCK+GW
Sbjct: 879  MIDLGFKPNLINFTSLIDGLCKKGSIKQAFEMLEEMVRNGWKPNVYTHTALIDGLCKRGW 938

Query: 1098 TEKAFRLFLKLVRSDHYKPNVHTYTAMIAGYCKEGKLNRAEMLLGRMLEQGLTPNAHSYS 1277
            TEKAFRLFLKLVRSD YKPNVHTYT+MI GYCKE KLNRAEML  RM EQGL PN ++Y+
Sbjct: 939  TEKAFRLFLKLVRSDTYKPNVHTYTSMIGGYCKEDKLNRAEMLFSRMKEQGLFPNVNTYT 998

Query: 1278 TLIDGYCKVGNLDRSYELMDTVAKVGLVPNICIYNCMIDSLCKKGRASEAYKLLKRGFET 1457
            TLI+G+CK G+  R+YELM+ +   G +PNI  YN  IDSLCKK RA EAY+LL + F  
Sbjct: 999  TLINGHCKAGSFGRAYELMNLMGDEGFMPNIYTYNAAIDSLCKKSRAPEAYELLNKAFSC 1058

Query: 1458 GLSADLVTFTILISECCKHGDMGQALAHLSKMTKTSLMPDTHVYTILISAFCRQKRMTES 1637
            GL AD VT+TILI E CK  D+ QALA   +M KT    D  +  ILI+AFCRQK+M ES
Sbjct: 1059 GLEADGVTYTILIQEQCKQNDINQALAFFCRMNKTGFEADMRLNNILIAAFCRQKKMKES 1118

Query: 1638 ERIFNDATKLGIAPSTEMYTSMISGYFRDKNVSMALKFYEKMVKQGCIPDSLTYGAMISG 1817
            ER+F     LG+ P+ E YTSMIS Y ++ ++ +ALK++  M + GC+PDS TYG++ISG
Sbjct: 1119 ERLFQLVVSLGLIPTKETYTSMISCYCKEGDIDLALKYFHNMKRHGCVPDSFTYGSLISG 1178

Query: 1818 LCKDLKVEEAKILYNTMMDKGLSPCEVTRLTIAYEYCKKGEPSTAMVLLDRLDKKLWMRT 1997
            LCK   V+EA  LY  M+D+GLSP EVTR+T+AYEYCK+ + + AM+LL+ LDKKLW+RT
Sbjct: 1179 LCKKSMVDEACKLYEAMIDRGLSPPEVTRVTLAYEYCKRNDSANAMILLEPLDKKLWIRT 1238

Query: 1998 FNTLIRKLCSEKK 2036
              TL+RKLCSEKK
Sbjct: 1239 VRTLVRKLCSEKK 1251



 Score =  129 bits (323), Expect = 6e-27
 Identities = 95/384 (24%), Positives = 158/384 (41%), Gaps = 67/384 (17%)
 Frame = +3

Query: 567  GMLKEAVDMVFEMQNQGLVLSVHTLNCILTVAAETGCVDTAHNVFGDMCERGVV-PNTCS 743
            G +K+A +M+ EM   G   +V+T   ++    + G  + A  +F  +       PN  +
Sbjct: 902  GSIKQAFEMLEEMVRNGWKPNVYTHTALIDGLCKRGWTEKAFRLFLKLVRSDTYKPNVHT 961

Query: 744  FESMVVGYCRVSRISEADRWLSAMLERGFLVDNATCTLIMTLYCEKGRLNRALWIFNKLV 923
            + SM+ GYC+  +++ A+   S M E+G   +  T T ++  +C+ G   RA  + N + 
Sbjct: 962  YTSMIGGYCKEDKLNRAEMLFSRMKEQGLFPNVNTYTTLINGHCKAGSFGRAYELMNLMG 1021

Query: 924  EMGFTPNVINFTSLINGLCKRGSIKRAFELLEEMVSKGWKPNVYTHTALIDGLCKKGWTE 1103
            + GF PN+  + + I+ LCK+     A+ELL +  S G + +  T+T LI   CK+    
Sbjct: 1022 DEGFMPNIYTYNAAIDSLCKKSRAPEAYELLNKAFSCGLEADGVTYTILIQEQCKQNDIN 1081

Query: 1104 KAFRLFLK----------------------------------LVRSDHYKPNVHTYTAMI 1181
            +A   F +                                  LV S    P   TYT+MI
Sbjct: 1082 QALAFFCRMNKTGFEADMRLNNILIAAFCRQKKMKESERLFQLVVSLGLIPTKETYTSMI 1141

Query: 1182 AGYCKEGKLNRAEMLLGRMLEQGLTPNAHSYSTLIDGYCKVGNLDRSYELMDTVAKVGLV 1361
            + YCKEG ++ A      M   G  P++ +Y +LI G CK   +D + +L + +   GL 
Sbjct: 1142 SCYCKEGDIDLALKYFHNMKRHGCVPDSFTYGSLISGLCKKSMVDEACKLYEAMIDRGLS 1201

Query: 1362 P----------NICIYN----------------------CMIDSLCKKGRASEAYKLLKR 1445
            P            C  N                       ++  LC + +   A    ++
Sbjct: 1202 PPEVTRVTLAYEYCKRNDSANAMILLEPLDKKLWIRTVRTLVRKLCSEKKVGVAALFFQK 1261

Query: 1446 GFETGLSADLVTFTILISECCKHG 1517
              E   SAD VT     + C + G
Sbjct: 1262 LLEKDSSADRVTLAAFTTACSESG 1285


>ref|XP_006413926.1| hypothetical protein EUTSA_v10027430mg, partial [Eutrema salsugineum]
            gi|557115096|gb|ESQ55379.1| hypothetical protein
            EUTSA_v10027430mg, partial [Eutrema salsugineum]
          Length = 677

 Score =  751 bits (1938), Expect = 0.0
 Identities = 365/614 (59%), Positives = 465/614 (75%), Gaps = 2/614 (0%)
 Frame = +3

Query: 201  SHAHSDSVIPSHSVVKTVRSLVCESYSRQQQKQNFRSIIC--KLNIPIDSEFLNPEEAIT 374
            S + S S  PS S+VK+V SLVC SY RQ       +I+   ++N+ +D+  L  E+AIT
Sbjct: 18   SPSPSPSSSPSQSLVKSVCSLVCHSYLRQTH-----AILSPHRVNLDLDANSLTHEQAIT 72

Query: 375  VAASLADEEGSLVALCFFYWAIGHLKFKYFMRFYIVLATCLIKNGNFERANEVIHCMVKN 554
            V ASLA E GS+VALCFFYW++G  KF +FMR Y+V A  LI NGN E+A+EV+ CM++N
Sbjct: 73   VVASLASEAGSMVALCFFYWSVGFEKFHHFMRLYLVTADSLIANGNMEKAHEVMRCMLRN 132

Query: 555  FGEIGMLKEAVDMVFEMQNQGLVLSVHTLNCILTVAAETGCVDTAHNVFGDMCERGVVPN 734
            F EIG L EAV MV +MQNQGL  S  TLNC+L +A E+G ++ A NVF +M  RGV P+
Sbjct: 133  FSEIGRLNEAVGMVMDMQNQGLSPSATTLNCVLEIAIESGLIEYAENVFDEMSVRGVCPD 192

Query: 735  TCSFESMVVGYCRVSRISEADRWLSAMLERGFLVDNATCTLIMTLYCEKGRLNRALWIFN 914
            + S++ MV+G  R  +I EADRWL+ M++RGF+ DNATCTLI++  CE G +NRA+W F 
Sbjct: 193  SSSYKLMVIGCFREGKIQEADRWLNGMIQRGFVPDNATCTLILSALCENGLVNRAIWYFR 252

Query: 915  KLVEMGFTPNVINFTSLINGLCKRGSIKRAFELLEEMVSKGWKPNVYTHTALIDGLCKKG 1094
            K++++G  PN+INFTSLI+GLCK+GSIK+AFE+LEEMV  GWKPNVYTHTALIDGLCK+G
Sbjct: 253  KMIDIGLKPNLINFTSLIDGLCKKGSIKQAFEMLEEMVRIGWKPNVYTHTALIDGLCKRG 312

Query: 1095 WTEKAFRLFLKLVRSDHYKPNVHTYTAMIAGYCKEGKLNRAEMLLGRMLEQGLTPNAHSY 1274
            WTEKAFRLFLKLVRSD+YKPNV+TYT+MI GYCKE KLNRAEML  RM EQGL PN ++Y
Sbjct: 313  WTEKAFRLFLKLVRSDNYKPNVYTYTSMIGGYCKEDKLNRAEMLFTRMKEQGLIPNVNTY 372

Query: 1275 STLIDGYCKVGNLDRSYELMDTVAKVGLVPNICIYNCMIDSLCKKGRASEAYKLLKRGFE 1454
            +TLI+G+CK GN DR+YELM+ + + G  PNI  YN ++DSLCKK RASEAY+LL + F 
Sbjct: 373  TTLINGHCKAGNFDRAYELMNLMGEEGFKPNIYTYNAVVDSLCKKSRASEAYELLNKAFS 432

Query: 1455 TGLSADLVTFTILISECCKHGDMGQALAHLSKMTKTSLMPDTHVYTILISAFCRQKRMTE 1634
             GL AD VT+TILI E CK  D+ QALA   +M K     D  +  ILI+AFCRQK+M E
Sbjct: 433  AGLEADGVTYTILIQEQCKQSDINQALAFFCRMKKIGFEADMRLNNILIAAFCRQKQMKE 492

Query: 1635 SERIFNDATKLGIAPSTEMYTSMISGYFRDKNVSMALKFYEKMVKQGCIPDSLTYGAMIS 1814
            SE++F     LG+ P+ E YTSMISGY ++ ++ +AL++   M + GC+ DS TYG++IS
Sbjct: 493  SEKLFQYVVSLGLVPTKETYTSMISGYCKEGDIDLALRYLHNMKRHGCVADSFTYGSLIS 552

Query: 1815 GLCKDLKVEEAKILYNTMMDKGLSPCEVTRLTIAYEYCKKGEPSTAMVLLDRLDKKLWMR 1994
            GLCK   V+EA  LY  M+DKG+SP EVTR+TIAYEYCK+ + + AM+LL+ LDKKLW+R
Sbjct: 553  GLCKKSMVDEACKLYEAMIDKGISPSEVTRVTIAYEYCKRNDSANAMILLEPLDKKLWIR 612

Query: 1995 TFNTLIRKLCSEKK 2036
            T  TL+RKLCSEKK
Sbjct: 613  TVRTLVRKLCSEKK 626


>ref|XP_003533559.1| PREDICTED: pentatricopeptide repeat-containing protein At4g19890-like
            isoform X1 [Glycine max] gi|571479155|ref|XP_006587776.1|
            PREDICTED: pentatricopeptide repeat-containing protein
            At4g19890-like isoform X2 [Glycine max]
            gi|571479157|ref|XP_006587777.1| PREDICTED:
            pentatricopeptide repeat-containing protein
            At4g19890-like isoform X3 [Glycine max]
          Length = 693

 Score =  746 bits (1927), Expect = 0.0
 Identities = 355/615 (57%), Positives = 459/615 (74%)
 Frame = +3

Query: 192  RTMSHAHSDSVIPSHSVVKTVRSLVCESYSRQQQKQNFRSIICKLNIPIDSEFLNPEEAI 371
            +T++H  S S +   S V  V SLV +SY        F      L++ +D   L  ++A+
Sbjct: 25   KTLTHITSPSCV--QSTVTRVCSLVYDSYHHHYNHARFSPPT--LHLDVDPNSLTHDQAV 80

Query: 372  TVAASLADEEGSLVALCFFYWAIGHLKFKYFMRFYIVLATCLIKNGNFERANEVIHCMVK 551
            T+ ASLA + GS+VAL FF WAI   KF++F R YI  A  LI N NFE+A+EV+ CMVK
Sbjct: 81   TIVASLASDAGSMVALSFFNWAIASSKFRHFTRLYIACAASLISNKNFEKAHEVMQCMVK 140

Query: 552  NFGEIGMLKEAVDMVFEMQNQGLVLSVHTLNCILTVAAETGCVDTAHNVFGDMCERGVVP 731
            +F EIG +KEA++MV EM NQGL  S  TLN ++ +  E G V+ A N+F +MC RGV P
Sbjct: 141  SFAEIGRVKEAIEMVIEMHNQGLAPSTKTLNWVVKIVTEMGLVEYAENLFDEMCARGVQP 200

Query: 732  NTCSFESMVVGYCRVSRISEADRWLSAMLERGFLVDNATCTLIMTLYCEKGRLNRALWIF 911
            N  S+  MVVGYC++  + E+DRWL  M+ERGF+VDNAT +LI+  +CEKG + RALW F
Sbjct: 201  NCVSYRVMVVGYCKLGNVLESDRWLGGMIERGFVVDNATLSLIVREFCEKGFVTRALWYF 260

Query: 912  NKLVEMGFTPNVINFTSLINGLCKRGSIKRAFELLEEMVSKGWKPNVYTHTALIDGLCKK 1091
             +  EMG  PN+INFT +I GLCKRGS+K+AFE+LEEMV +GWKPNVYTHTALIDGLCKK
Sbjct: 261  RRFCEMGLRPNLINFTCMIEGLCKRGSVKQAFEMLEEMVGRGWKPNVYTHTALIDGLCKK 320

Query: 1092 GWTEKAFRLFLKLVRSDHYKPNVHTYTAMIAGYCKEGKLNRAEMLLGRMLEQGLTPNAHS 1271
            GWTEKAFRLFLKLVRS+++KPNV TYTAMI+GYC++ K+NRAEMLL RM EQGL PN ++
Sbjct: 321  GWTEKAFRLFLKLVRSENHKPNVLTYTAMISGYCRDEKMNRAEMLLSRMKEQGLAPNTNT 380

Query: 1272 YSTLIDGYCKVGNLDRSYELMDTVAKVGLVPNICIYNCMIDSLCKKGRASEAYKLLKRGF 1451
            Y+TLIDG+CK GN +R+YELM+ + + G  PN+C YN ++D LCKKGR  EAYK+LK GF
Sbjct: 381  YTTLIDGHCKAGNFERAYELMNVMNEEGFSPNVCTYNAIVDGLCKKGRVQEAYKVLKSGF 440

Query: 1452 ETGLSADLVTFTILISECCKHGDMGQALAHLSKMTKTSLMPDTHVYTILISAFCRQKRMT 1631
              GL AD VT+TILISE CK  ++ QAL   +KM K+ + PD H YT LI+ FCR+KRM 
Sbjct: 441  RNGLDADKVTYTILISEHCKQAEIKQALVLFNKMVKSGIQPDIHSYTTLIAVFCREKRMK 500

Query: 1632 ESERIFNDATKLGIAPSTEMYTSMISGYFRDKNVSMALKFYEKMVKQGCIPDSLTYGAMI 1811
            ESE  F +A + G+ P+ + YTSMI GY R+ N+ +ALKF+ +M   GC  DS+TYGA+I
Sbjct: 501  ESEMFFEEAVRFGLVPTNKTYTSMICGYCREGNLRLALKFFHRMSDHGCASDSITYGALI 560

Query: 1812 SGLCKDLKVEEAKILYNTMMDKGLSPCEVTRLTIAYEYCKKGEPSTAMVLLDRLDKKLWM 1991
            SGLCK  K++EA+ LY+ M++KGL+PCEVTR+T+AYEYCK  +  +AMV+L+RL+KKLW+
Sbjct: 561  SGLCKQSKLDEARCLYDAMIEKGLTPCEVTRVTLAYEYCKIDDGCSAMVVLERLEKKLWV 620

Query: 1992 RTFNTLIRKLCSEKK 2036
            RT NTL+RKLCSE+K
Sbjct: 621  RTVNTLVRKLCSERK 635


>ref|XP_002322376.2| hypothetical protein POPTR_0015s15360g [Populus trichocarpa]
            gi|550322786|gb|EEF06503.2| hypothetical protein
            POPTR_0015s15360g [Populus trichocarpa]
          Length = 594

 Score =  743 bits (1919), Expect = 0.0
 Identities = 360/543 (66%), Positives = 427/543 (78%)
 Frame = +3

Query: 408  LVALCFFYWAIGHLKFKYFMRFYIVLATCLIKNGNFERANEVIHCMVKNFGEIGMLKEAV 587
            +VAL FF WAIG  KF+YFMRFYIV AT  I N NFERA+EV+ CMV+ F EIG  +EAV
Sbjct: 1    MVALSFFNWAIGFPKFRYFMRFYIVCATSFIGNENFERAHEVMDCMVRVFAEIGKFQEAV 60

Query: 588  DMVFEMQNQGLVLSVHTLNCILTVAAETGCVDTAHNVFGDMCERGVVPNTCSFESMVVGY 767
            +MV EM+N GLVL+V TLNC+  VA E G V  A NVF +M  RGV P++ S++ M + Y
Sbjct: 61   NMVIEMENHGLVLTVRTLNCVTGVAGEMGLVGYAENVFDEMRVRGVCPDSVSYKLMAIAY 120

Query: 768  CRVSRISEADRWLSAMLERGFLVDNATCTLIMTLYCEKGRLNRALWIFNKLVEMGFTPNV 947
            CR+ RIS+ DRWL  M+ RGF+VDNATCTL+++ +CEKG  +R  W F+K VE+G  PN+
Sbjct: 121  CRMGRISDTDRWLKEMVRRGFVVDNATCTLMISTFCEKGFASRVFWYFDKWVELGLKPNL 180

Query: 948  INFTSLINGLCKRGSIKRAFELLEEMVSKGWKPNVYTHTALIDGLCKKGWTEKAFRLFLK 1127
            INFTSLINGLCKRGSIK+AFE+LEEMV KGWKPNVYTHTALIDGLCKKGWTEKAFRLFLK
Sbjct: 181  INFTSLINGLCKRGSIKQAFEMLEEMVKKGWKPNVYTHTALIDGLCKKGWTEKAFRLFLK 240

Query: 1128 LVRSDHYKPNVHTYTAMIAGYCKEGKLNRAEMLLGRMLEQGLTPNAHSYSTLIDGYCKVG 1307
            LVRSD YKPNVHTYT+MI GYCKE KLNRAEMLL RM EQGL PN  +Y+ LIDG+ K G
Sbjct: 241  LVRSDDYKPNVHTYTSMIHGYCKEDKLNRAEMLLSRMKEQGLVPNTKTYTCLIDGHSKAG 300

Query: 1308 NLDRSYELMDTVAKVGLVPNICIYNCMIDSLCKKGRASEAYKLLKRGFETGLSADLVTFT 1487
            N +++YELMD + K G   NI  YN  IDSLCKKGR  EA KLLK+GF  GL AD VT+T
Sbjct: 301  NFEKAYELMDLMGKEGFSANIFTYNAFIDSLCKKGRFLEACKLLKKGFRLGLQADTVTYT 360

Query: 1488 ILISECCKHGDMGQALAHLSKMTKTSLMPDTHVYTILISAFCRQKRMTESERIFNDATKL 1667
            ILISE C+  D  +AL   SKM K  + PD H Y  LI+AF RQ+RM ESE++F +A  L
Sbjct: 361  ILISELCRRADTREALVFFSKMFKAGVQPDMHTYNTLIAAFSRQRRMEESEKLFAEAVGL 420

Query: 1668 GIAPSTEMYTSMISGYFRDKNVSMALKFYEKMVKQGCIPDSLTYGAMISGLCKDLKVEEA 1847
            G+ P+ E YTSMI GY RD+NVS+ALKF+ +M   GC PDSLTYGA+ISGLCK+ K++EA
Sbjct: 421  GLVPTKETYTSMICGYCRDRNVSLALKFFNRMSDHGCTPDSLTYGALISGLCKESKLDEA 480

Query: 1848 KILYNTMMDKGLSPCEVTRLTIAYEYCKKGEPSTAMVLLDRLDKKLWMRTFNTLIRKLCS 2027
              LY  M+DKGLSPCEVTRLT+AYEYCK+ + +TAMV+L+RLDKKLW+RT NTLIRKLCS
Sbjct: 481  CQLYEAMVDKGLSPCEVTRLTLAYEYCKQDDSATAMVILERLDKKLWIRTVNTLIRKLCS 540

Query: 2028 EKK 2036
            E+K
Sbjct: 541  ERK 543


Top