BLASTX nr result

ID: Rehmannia23_contig00020933 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Rehmannia23_contig00020933
         (1227 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_006343601.1| PREDICTED: pentatricopeptide repeat-containi...   334   5e-89
ref|XP_004242995.1| PREDICTED: pentatricopeptide repeat-containi...   332   2e-88
ref|XP_003634022.1| PREDICTED: pentatricopeptide repeat-containi...   277   7e-72
gb|EPS64936.1| hypothetical protein M569_09839, partial [Genlise...   248   5e-63
gb|EOY31499.1| Tetratricopeptide repeat (TPR)-like superfamily p...   246   1e-62
ref|XP_004287149.1| PREDICTED: pentatricopeptide repeat-containi...   233   1e-58
gb|EMJ04907.1| hypothetical protein PRUPE_ppa019391mg, partial [...   215   3e-53
ref|XP_002528404.1| pentatricopeptide repeat-containing protein,...   214   6e-53
gb|EXB42922.1| Pentatricopeptide repeat-containing protein [Moru...   212   3e-52
ref|XP_006474045.1| PREDICTED: pentatricopeptide repeat-containi...   211   5e-52
ref|XP_004146719.1| PREDICTED: pentatricopeptide repeat-containi...   211   6e-52
ref|XP_006453565.1| hypothetical protein CICLE_v10007430mg [Citr...   207   7e-51
ref|XP_006285536.1| hypothetical protein CARUB_v10006977mg [Caps...   206   2e-50
ref|XP_002869359.1| pentatricopeptide repeat-containing protein ...   192   2e-46
gb|ESW24614.1| hypothetical protein PHAVU_004G145400g [Phaseolus...   191   4e-46
ref|NP_567856.1| pentatricopeptide repeat-containing protein [Ar...   188   3e-45
emb|CAA18211.1| puative protein [Arabidopsis thaliana] gi|726998...   188   3e-45
ref|XP_006412665.1| hypothetical protein EUTSA_v10024344mg [Eutr...   186   1e-44
ref|XP_004975413.1| PREDICTED: pentatricopeptide repeat-containi...   185   3e-44
ref|XP_006857035.1| hypothetical protein AMTR_s00065p00020910 [A...   181   4e-43

>ref|XP_006343601.1| PREDICTED: pentatricopeptide repeat-containing protein At4g30825,
           chloroplastic-like isoform X1 [Solanum tuberosum]
           gi|565353364|ref|XP_006343602.1| PREDICTED:
           pentatricopeptide repeat-containing protein At4g30825,
           chloroplastic-like isoform X2 [Solanum tuberosum]
          Length = 937

 Score =  334 bits (856), Expect = 5e-89
 Identities = 170/326 (52%), Positives = 222/326 (68%)
 Frame = -3

Query: 979 MASLKLSVSPDNNSYESNKLISGFNSLKFVSNTLFSGYVSTHGALIVKPFCKLKHIRVSR 800
           MASLKL +  D+ S+ES KL     +L F  +  +       GA +V PFC LKHIRVSR
Sbjct: 1   MASLKLPLYVDS-SWESKKLNCTVKALNFTDSKCWVPSFLGGGAFVVSPFCNLKHIRVSR 59

Query: 799 LDNESLDTCESNLDGFSIDNLEKYVAGDDNLIIEGQDFHGDSGKRKVNIWKKFRGAKTVR 620
           L+ E L+T E +LD   +D  E  + G+D+ + E  +   DS K K N+WK+FR  K V 
Sbjct: 60  LETEELETSELSLDNEGVDGFEGEL-GNDSFVTERPNLGRDSQKGKFNVWKRFRRVKKVP 118

Query: 619 KNTIRNLDFHRNGNKYKKHENPMVPLQKNSANPILDGHSVVDLDFNDVGPELSSERCNFI 440
           +++     F     K    ENPM+    NS   ++D  + VD    ++G + S ++CN I
Sbjct: 119 RDSNHRSSFRLKDRKNGMEENPMIAFDVNSDESVIDSQNGVDFPDENIGSDSSLDQCNAI 178

Query: 439 LEQLEKSNDSRALTFFEWMKVNGKLKNNVTAYNSILRVLGRKGDWDGAEVMIMEMVSDSD 260
           L++LE+ ND +AL+FF WM+ NGKLK NVTAYN ILRVLGR+GDWDGAE MI EM  +S 
Sbjct: 179 LKELERGNDGKALSFFRWMRKNGKLKQNVTAYNLILRVLGRRGDWDGAEGMIKEMSMESG 238

Query: 259 CELNYRIFNTLIYACYKSGLVDLGSRWFKMMLDYKVQPNVATFGMLMSLYQKGLVVEEAE 80
           C+L Y++FNTLIYAC+K GLV+LG++WF MML+  VQPN+ATFGMLM+LYQKG  VEEAE
Sbjct: 239 CKLTYQVFNTLIYACHKKGLVELGAKWFHMMLENGVQPNIATFGMLMALYQKGWHVEEAE 298

Query: 79  YTFSQMRNLKITCQSAYSALITIYTR 2
           + FS MRNLKI CQSAYS+++TIYTR
Sbjct: 299 FAFSMMRNLKIMCQSAYSSMLTIYTR 324


>ref|XP_004242995.1| PREDICTED: pentatricopeptide repeat-containing protein At4g30825,
            chloroplastic-like [Solanum lycopersicum]
          Length = 1201

 Score =  332 bits (850), Expect = 2e-88
 Identities = 168/330 (50%), Positives = 223/330 (67%)
 Frame = -3

Query: 991  PVCFMASLKLSVSPDNNSYESNKLISGFNSLKFVSNTLFSGYVSTHGALIVKPFCKLKHI 812
            P+C MASLKL +  D+ S+ES KL      L F  +          GA +V PFC LKHI
Sbjct: 261  PLCLMASLKLPLYVDS-SWESKKLNCTVKPLIFTDSKCCVPSFLGGGAFVVSPFCNLKHI 319

Query: 811  RVSRLDNESLDTCESNLDGFSIDNLEKYVAGDDNLIIEGQDFHGDSGKRKVNIWKKFRGA 632
            RVSRL+ E L+T E ++D   +D  E  + G+++ + E  +   DS K K N+W++FR  
Sbjct: 320  RVSRLETEELETSELSIDNEGVDGFEGEL-GNESFVTERPNLGRDSKKGKFNVWRRFRRV 378

Query: 631  KTVRKNTIRNLDFHRNGNKYKKHENPMVPLQKNSANPILDGHSVVDLDFNDVGPELSSER 452
            K V K++     F     KY   ENP +    NS   ++D  + VD    ++G + S ++
Sbjct: 379  KKVPKDSNYRSSFRLKDRKYGTEENPRIVFDVNSDENVIDSQNGVDFHDENIGSDSSLDQ 438

Query: 451  CNFILEQLEKSNDSRALTFFEWMKVNGKLKNNVTAYNSILRVLGRKGDWDGAEVMIMEMV 272
            CN IL++LE+ +D +AL+FF WM+ NGKLK NVTAYN ILRVLGR+GDWDGAE MI EM 
Sbjct: 439  CNAILKELERGDDGKALSFFRWMRKNGKLKQNVTAYNLILRVLGRRGDWDGAEGMIKEMS 498

Query: 271  SDSDCELNYRIFNTLIYACYKSGLVDLGSRWFKMMLDYKVQPNVATFGMLMSLYQKGLVV 92
             +S C+L Y++FNTLIYAC+K GLV+LG++WF MML+  VQPN+ATFG+LM+LYQKG  V
Sbjct: 499  MESGCKLTYQVFNTLIYACHKKGLVELGAKWFHMMLENGVQPNIATFGLLMALYQKGWHV 558

Query: 91   EEAEYTFSQMRNLKITCQSAYSALITIYTR 2
            EEAE+ FS MRNLKI CQSAYS+++TIYTR
Sbjct: 559  EEAEFAFSMMRNLKIMCQSAYSSMLTIYTR 588


>ref|XP_003634022.1| PREDICTED: pentatricopeptide repeat-containing protein At4g30825,
           chloroplastic-like [Vitis vinifera]
           gi|297745081|emb|CBI38673.3| unnamed protein product
           [Vitis vinifera]
          Length = 900

 Score =  277 bits (708), Expect = 7e-72
 Identities = 155/329 (47%), Positives = 214/329 (65%), Gaps = 3/329 (0%)
 Frame = -3

Query: 979 MASLKLSVSPDNNSYESNKLISGFNSLKFVSNTLFSGYVSTHGAL-IVKPFCKLKHIRVS 803
           MASLK SVS D  +Y+SNK                  + S + +L I+  F ++K I +S
Sbjct: 1   MASLKFSVSVD--TYDSNKF-----------------HFSVNPSLPIINSFARVKPINIS 41

Query: 802 RLDNESLDTCESNLDGFSIDNLEKYV--AGDDNLIIEGQDFHGDSGKRKVNIWKKFRGAK 629
           RL+ ES DT +SN     +DN++ +   +G +NLI+E  +F  D       IW++ +G K
Sbjct: 42  RLEAESWDTSDSNS---VVDNIKTWNKDSGSENLILESSNFRND-------IWRRVQGVK 91

Query: 628 TVRKNTIRNLDFHRNGNKYKKHENPMVPLQKNSANPILDGHSVVDLDFNDVGPELSSERC 449
            VR+            +K++   N     ++ S N   D    +D++   +GPELS ERC
Sbjct: 92  RVRRRD--------PNSKFRSIRNDNGHEEQKSVNHFDDE---IDVNEYGIGPELSVERC 140

Query: 448 NFILEQLEKSNDSRALTFFEWMKVNGKLKNNVTAYNSILRVLGRKGDWDGAEVMIMEMVS 269
           N IL+ LE+ +DS+ + FFEWM+ NGKL+ NV+AYN  LRVLGR+GDWD AE MI EM  
Sbjct: 141 NAILKGLERCSDSKTMKFFEWMRENGKLEGNVSAYNLALRVLGRRGDWDAAETMIWEMNG 200

Query: 268 DSDCELNYRIFNTLIYACYKSGLVDLGSRWFKMMLDYKVQPNVATFGMLMSLYQKGLVVE 89
           DSDC++N++++NTLIYACYK G V+LG++WF++ML+  V+PNVATFGM+MSLYQKG  V 
Sbjct: 201 DSDCQVNFQVYNTLIYACYKQGHVELGTKWFRLMLENGVRPNVATFGMVMSLYQKGWNVA 260

Query: 88  EAEYTFSQMRNLKITCQSAYSALITIYTR 2
           ++EY FSQMR+  ITCQSAYSA+ITIYTR
Sbjct: 261 DSEYAFSQMRSFGITCQSAYSAMITIYTR 289


>gb|EPS64936.1| hypothetical protein M569_09839, partial [Genlisea aurea]
          Length = 865

 Score =  248 bits (632), Expect = 5e-63
 Identities = 139/271 (51%), Positives = 183/271 (67%)
 Frame = -3

Query: 814 IRVSRLDNESLDTCESNLDGFSIDNLEKYVAGDDNLIIEGQDFHGDSGKRKVNIWKKFRG 635
           I VS L+N+  D+ ES  +   +D+ +K    + +   +G+D           + K+ R 
Sbjct: 1   ITVSNLENDVPDSSESKSN---LDSRKK----NRDFTAQGKD-----------VSKQCRI 42

Query: 634 AKTVRKNTIRNLDFHRNGNKYKKHENPMVPLQKNSANPILDGHSVVDLDFNDVGPELSSE 455
           AK  R++  ++LD H    K +K   P    Q+ S+   L   + + LD  DV PE + E
Sbjct: 43  AKMWREHKKQSLDPHLQSKKSRK-VRPTSLQQRASSGSALGSETDLCLDSWDVRPEETIE 101

Query: 454 RCNFILEQLEKSNDSRALTFFEWMKVNGKLKNNVTAYNSILRVLGRKGDWDGAEVMIMEM 275
           RCN ILE+LEKS+DS+A++FF+WM++N KLK NV A+N ILRVL RK DWDGAE ++ EM
Sbjct: 102 RCNMILERLEKSDDSKAISFFKWMRLNQKLKKNVIAHNVILRVLTRKDDWDGAEGLVKEM 161

Query: 274 VSDSDCELNYRIFNTLIYACYKSGLVDLGSRWFKMMLDYKVQPNVATFGMLMSLYQKGLV 95
           VSDS C LNY+IFNT+IYACYK GL D+ +RWFKMML+Y+V PNVAT+GMLMSLYQK   
Sbjct: 162 VSDSGCLLNYQIFNTVIYACYKKGLSDVATRWFKMMLNYQVDPNVATYGMLMSLYQKNWA 221

Query: 94  VEEAEYTFSQMRNLKITCQSAYSALITIYTR 2
           VEEAE T + MR LKITC SAYS++ITIY R
Sbjct: 222 VEEAESTLTHMRKLKITCNSAYSSMITIYIR 252


>gb|EOY31499.1| Tetratricopeptide repeat (TPR)-like superfamily protein [Theobroma
           cacao]
          Length = 916

 Score =  246 bits (629), Expect = 1e-62
 Identities = 146/331 (44%), Positives = 197/331 (59%), Gaps = 5/331 (1%)
 Frame = -3

Query: 979 MASLKLSVSPDNNSYESNKLISGFNSLKFVSNTLFSGYVS----THGALIVKPFCKLKHI 812
           MASLKL +S D  + +S KL    N      +     + S    T  A  +    +LKH 
Sbjct: 1   MASLKLPISLD--TVDSKKLNFYVNPSHVPDHCSIFSFTSCIHVTKAASNLTSLTRLKHF 58

Query: 811 RVSRLDNESLDTCESNLDGFSIDNLEKYVAGDDN-LIIEGQDFHGDSGKRKVNIWKKFRG 635
           +VSR + E  +  E +     I    K    ++N   +EGQ   G + K           
Sbjct: 59  KVSRFETEFPNIPEPSPVDKDIHFSSKIDLVNENPKFVEGQK--GQNPK----------- 105

Query: 634 AKTVRKNTIRNLDFHRNGNKYKKHENPMVPLQKNSANPILDGHSVVDLDFNDVGPELSSE 455
            K +RKN      F RN N+ ++ +              +  +S +D+D++ + P L+  
Sbjct: 106 -KGIRKNVGFKFRFRRNRNEIEREDL------------FVHNNSGLDVDYSAIKPNLNLP 152

Query: 454 RCNFILEQLEKSNDSRALTFFEWMKVNGKLKNNVTAYNSILRVLGRKGDWDGAEVMIMEM 275
            CNFIL++LE+SNDS AL FFEWM+ NGKLK NVTAY  +LRVLGR+ DWD AE+M+ + 
Sbjct: 153 HCNFILKRLERSNDSNALRFFEWMRSNGKLKGNVTAYRLVLRVLGRREDWDAAEMMLRQA 212

Query: 274 VSDSDCELNYRIFNTLIYACYKSGLVDLGSRWFKMMLDYKVQPNVATFGMLMSLYQKGLV 95
             DS C+LN+++FNT+IYAC K GLV+LG++WF+MML++  +PNVATFGMLM LYQKG  
Sbjct: 213 NGDSGCKLNFQVFNTIIYACSKKGLVELGAKWFRMMLEHGFRPNVATFGMLMGLYQKGWN 272

Query: 94  VEEAEYTFSQMRNLKITCQSAYSALITIYTR 2
             EAE+TFSQMRN  I CQSAYSA+ITIYTR
Sbjct: 273 ASEAEFTFSQMRNSGIVCQSAYSAMITIYTR 303


>ref|XP_004287149.1| PREDICTED: pentatricopeptide repeat-containing protein At4g30825,
           chloroplastic-like [Fragaria vesca subsp. vesca]
          Length = 885

 Score =  233 bits (594), Expect = 1e-58
 Identities = 137/300 (45%), Positives = 186/300 (62%), Gaps = 4/300 (1%)
 Frame = -3

Query: 889 SNTLFSGYVSTHGALIVKPFCKLKHIRVSRLDNESLDTCES----NLDGFSIDNLEKYVA 722
           S+  F+ +  +  +L+V    ++  I+V+R  +E L+  ES    N D  S   + K ++
Sbjct: 15  SSKKFNSFCYSRASLVVNSLNRVNAIKVNRFQSE-LNVAESLNEQNPD-CSRHEIGKGIS 72

Query: 721 GDDNLIIEGQDFHGDSGKRKVNIWKKFRGAKTVRKNTIRNLDFHRNGNKYKKHENPMVPL 542
           G   L            KR+V +    R +K VRK                  EN  V  
Sbjct: 73  GTKRL-----------SKREVGLRSSSRKSKWVRKL-----------------ENVFVN- 103

Query: 541 QKNSANPILDGHSVVDLDFNDVGPELSSERCNFILEQLEKSNDSRALTFFEWMKVNGKLK 362
                    DG    D+D++ +  ++S E CN IL++LE+S+D + L FFEWM++NGKLK
Sbjct: 104 ---------DGE--FDVDYSVIKSDMSLEHCNDILKRLERSSDFKTLKFFEWMRINGKLK 152

Query: 361 NNVTAYNSILRVLGRKGDWDGAEVMIMEMVSDSDCELNYRIFNTLIYACYKSGLVDLGSR 182
            NV+A+NS+ RVLGR+ +WD AE +I EMV++  CELNY++FNTLIYAC K G V+LG++
Sbjct: 153 GNVSAFNSVFRVLGRRENWDAAENLIQEMVTEFGCELNYQVFNTLIYACSKLGRVELGAK 212

Query: 181 WFKMMLDYKVQPNVATFGMLMSLYQKGLVVEEAEYTFSQMRNLKITCQSAYSALITIYTR 2
           WF MML+Y VQPNVATFGMLM+LYQKG  VEEAE+TFS+MRN  I CQSAYSA+ITIYTR
Sbjct: 213 WFAMMLEYGVQPNVATFGMLMALYQKGWNVEEAEFTFSRMRNFGIVCQSAYSAMITIYTR 272


>gb|EMJ04907.1| hypothetical protein PRUPE_ppa019391mg, partial [Prunus persica]
          Length = 766

 Score =  215 bits (548), Expect = 3e-53
 Identities = 97/152 (63%), Positives = 127/152 (83%)
 Frame = -3

Query: 457 ERCNFILEQLEKSNDSRALTFFEWMKVNGKLKNNVTAYNSILRVLGRKGDWDGAEVMIME 278
           E CN IL++LE+ +D + L FFEWM+ NGKL+ NV+A+N +LRV+GR+ DWDGAE ++ E
Sbjct: 2   EHCNDILKRLERCSDVKTLRFFEWMRSNGKLERNVSAFNLVLRVMGRREDWDGAEKLVQE 61

Query: 277 MVSDSDCELNYRIFNTLIYACYKSGLVDLGSRWFKMMLDYKVQPNVATFGMLMSLYQKGL 98
           +++D  CELNY++FNTLIYAC K G ++LG +WF+MML+++VQPN+ATFGMLM LYQKG 
Sbjct: 62  VIADLGCELNYQVFNTLIYACCKLGRLELGGKWFRMMLEHEVQPNIATFGMLMVLYQKGW 121

Query: 97  VVEEAEYTFSQMRNLKITCQSAYSALITIYTR 2
            VEEAE+TF QMRN  I CQSAYS++ITIYTR
Sbjct: 122 NVEEAEFTFFQMRNFGILCQSAYSSMITIYTR 153


>ref|XP_002528404.1| pentatricopeptide repeat-containing protein, putative [Ricinus
           communis] gi|223532192|gb|EEF33997.1| pentatricopeptide
           repeat-containing protein, putative [Ricinus communis]
          Length = 955

 Score =  214 bits (545), Expect = 6e-53
 Identities = 136/332 (40%), Positives = 199/332 (59%), Gaps = 6/332 (1%)
 Frame = -3

Query: 979 MASLKLSVSPDNNSYESNKLISGFNSLKFVSNTL---FSGYVSTHGALIVKPFCKLKHIR 809
           MASL+L++S D  +++S K     N L+  ++T     S    + GA I+        ++
Sbjct: 37  MASLRLTISLD--TFDSKKPNFSRNPLQLSTHTSPFSISSSTPSPGACIITTLTTFSPVK 94

Query: 808 VSRLDNESLDTCESNLDGFSIDNLEKYVAGD--DNLIIEGQDFHGDSGKRKVNIWKKFRG 635
           VSR++ E  +           D++    + D     I EG      + KR++   KK+RG
Sbjct: 95  VSRIETELFE-----------DDVVLSTSNDLPHECINEGLIDRNPNSKREIR--KKYRG 141

Query: 634 AKTVRKNTIRNLDFHRNGNKYKKHENPMVPLQKNSANPILDGHSVVDLDFNDVGPELSSE 455
               +K   R + F  N   YK++      +++   +  ++G  + D++++ +   LS E
Sbjct: 142 G--AKKRGKRKVGFKFN---YKRNG-----IEQEIEDLFVEGGEL-DVNYSVIHCNLSLE 190

Query: 454 RCNFILEQLEK-SNDSRALTFFEWMKVNGKLKNNVTAYNSILRVLGRKGDWDGAEVMIME 278
            CN IL++LE+ S+D ++L FFEWM+ NGKL+ N+ AYN ILRVLGR+ DW  AE MI E
Sbjct: 191 HCNLILKRLERCSSDDKSLRFFEWMRNNGKLEKNLNAYNVILRVLGRREDWGTAERMIGE 250

Query: 277 MVSDSDCELNYRIFNTLIYACYKSGLVDLGSRWFKMMLDYKVQPNVATFGMLMSLYQKGL 98
           +      EL++R+FNTLIYAC + G + LG +WF+MML+  VQPN+ATFGMLM LYQKG 
Sbjct: 251 VSDSFGSELDFRVFNTLIYACSRRGNMLLGGKWFRMMLELGVQPNIATFGMLMGLYQKGW 310

Query: 97  VVEEAEYTFSQMRNLKITCQSAYSALITIYTR 2
            VEEAE+ FS+MR+  I CQSAYSA+ITIYTR
Sbjct: 311 NVEEAEFVFSKMRSFGIICQSAYSAMITIYTR 342


>gb|EXB42922.1| Pentatricopeptide repeat-containing protein [Morus notabilis]
          Length = 889

 Score =  212 bits (539), Expect = 3e-52
 Identities = 109/215 (50%), Positives = 151/215 (70%)
 Frame = -3

Query: 646 KFRGAKTVRKNTIRNLDFHRNGNKYKKHENPMVPLQKNSANPILDGHSVVDLDFNDVGPE 467
           KFRG+K   K  + +    + G K  + E  +  L  N      DG   +D++++ +  +
Sbjct: 74  KFRGSKKEAKRFLGS----KVGMKKNRWERELENLFVN------DGE--IDVNYSVIRSD 121

Query: 466 LSSERCNFILEQLEKSNDSRALTFFEWMKVNGKLKNNVTAYNSILRVLGRKGDWDGAEVM 287
           LS E+CN +L++LE  +DS+ L FFEWM+ +GKL+ N++AYN + RVL RK DW  AE M
Sbjct: 122 LSLEQCNSVLKRLESCSDSKTLRFFEWMRSHGKLEGNISAYNLVFRVLSRKEDWGTAEKM 181

Query: 286 IMEMVSDSDCELNYRIFNTLIYACYKSGLVDLGSRWFKMMLDYKVQPNVATFGMLMSLYQ 107
           I E+ ++  CE+ Y++FNTLIYAC K G V+LG++WF+MML++ V+PNVATFGMLM LYQ
Sbjct: 182 IWELKNELGCEMGYQVFNTLIYACSKLGRVELGAKWFRMMLEHGVRPNVATFGMLMGLYQ 241

Query: 106 KGLVVEEAEYTFSQMRNLKITCQSAYSALITIYTR 2
           K   VEEAE+TF++MR+L   CQSAYSALITIYTR
Sbjct: 242 KSWNVEEAEFTFTRMRDLGTVCQSAYSALITIYTR 276


>ref|XP_006474045.1| PREDICTED: pentatricopeptide repeat-containing protein At4g30825,
           chloroplastic-like [Citrus sinensis]
          Length = 915

 Score =  211 bits (537), Expect = 5e-52
 Identities = 138/332 (41%), Positives = 199/332 (59%), Gaps = 6/332 (1%)
 Frame = -3

Query: 979 MASLKL-SVSPDNNSYESNKLISGFNSLKFVSN-TLFSGYVSTHGALIVKPFCKLKHIRV 806
           MASLKL S+S D  + +S KL    N  +   +  +FS  +S     IV    ++KH++ 
Sbjct: 1   MASLKLLSISLD--TVDSRKLNFAANPPQLSDHFPIFSFTMSC----IVTASNRVKHVK- 53

Query: 805 SRLDNESLDTCESNLDGFSIDNLEKYVAGDDNLIIEGQDFHGDSGKRKVNIWKKFRGAKT 626
             + +   D C  N    +   +E  V  +   +  G+     +  RKV      +G   
Sbjct: 54  -NVSSSETDLCSMNESKETDIGIENDVGSE---VFVGEC---SNVSRKVK-----KGRYG 101

Query: 625 VRKNTIRNLD----FHRNGNKYKKHENPMVPLQKNSANPILDGHSVVDLDFNDVGPELSS 458
           V+K + R++D    F R+  + ++        +   AN   DG   +D++++ +G +LS 
Sbjct: 102 VKKGSKRDVDMSLRFRRSAREQER--------EYFFAN---DGE--LDVNYSVIGADLSL 148

Query: 457 ERCNFILEQLEKSNDSRALTFFEWMKVNGKLKNNVTAYNSILRVLGRKGDWDGAEVMIME 278
           + CN IL++LEK +DS++L FFEWM+ NGKL+ NVTAYN +LRV  R+ DWD AE MI E
Sbjct: 149 DECNAILKRLEKYSDSKSLKFFEWMRTNGKLEKNVTAYNLVLRVFSRREDWDAAEKMIRE 208

Query: 277 MVSDSDCELNYRIFNTLIYACYKSGLVDLGSRWFKMMLDYKVQPNVATFGMLMSLYQKGL 98
           +      +LN+++FNTLIYAC K G V+LG++WF MML+  VQPNVATFGMLM LY+K  
Sbjct: 209 VRMSLGAKLNFQLFNTLIYACNKRGCVELGAKWFHMMLECDVQPNVATFGMLMGLYKKSW 268

Query: 97  VVEEAEYTFSQMRNLKITCQSAYSALITIYTR 2
            VEEAE+ F+QMR L + C+SAYSA+ITIYTR
Sbjct: 269 NVEEAEFAFNQMRKLGLVCESAYSAMITIYTR 300


>ref|XP_004146719.1| PREDICTED: pentatricopeptide repeat-containing protein At4g30825,
           chloroplastic-like [Cucumis sativus]
          Length = 894

 Score =  211 bits (536), Expect = 6e-52
 Identities = 129/330 (39%), Positives = 193/330 (58%), Gaps = 4/330 (1%)
 Frame = -3

Query: 979 MASLKLSVSPDNNSYESNKLISGFNS---LKFVSNTLFSGYVSTHGALIVKPFCKL-KHI 812
           MASLKLS S   +S++SNK     NS     + S    + ++  + + I+    ++ K  
Sbjct: 1   MASLKLSFSL--HSFDSNKFDFPLNSPLLSDYCSLFSINAHLHLNKSSIIYSLARVHKPS 58

Query: 811 RVSRLDNESLDTCESNLDGFSIDNLEKYVAGDDNLIIEGQDFHGDSGKRKVNIWKKFRGA 632
           +VS+++ ++ D  +S  D        + VA                  RK    K F   
Sbjct: 59  KVSQVEQDASDVSQSRFD--------EIVA------------------RK----KYFTSK 88

Query: 631 KTVRKNTIRNLDFHRNGNKYKKHENPMVPLQKNSANPILDGHSVVDLDFNDVGPELSSER 452
           K  ++    +  F RN N                 + IL     +D++++ +  +LS E 
Sbjct: 89  KPSKRAAGSHFSFSRNCN-----------------DNILFNGGELDVNYSTISSDLSLED 131

Query: 451 CNFILEQLEKSNDSRALTFFEWMKVNGKLKNNVTAYNSILRVLGRKGDWDGAEVMIMEMV 272
           CN IL++LEK NDS+ L FFEWM+ NGKLK+NV+AYN +LRVLGR+ DWD AE +I E+ 
Sbjct: 132 CNAILKRLEKCNDSKTLGFFEWMRSNGKLKHNVSAYNLVLRVLGRQEDWDAAEKLIEEVR 191

Query: 271 SDSDCELNYRIFNTLIYACYKSGLVDLGSRWFKMMLDYKVQPNVATFGMLMSLYQKGLVV 92
           ++   +L++++FNTLIYACYKS  V+ G++WF+MML+ +VQPNVATFGMLM LYQK   +
Sbjct: 192 AELGSQLDFQVFNTLIYACYKSRFVEQGTKWFRMMLECQVQPNVATFGMLMGLYQKKCDI 251

Query: 91  EEAEYTFSQMRNLKITCQSAYSALITIYTR 2
           +E+E+ F+QMRN  I C++AY+++ITIY R
Sbjct: 252 KESEFAFNQMRNFGIVCETAYASMITIYIR 281


>ref|XP_006453565.1| hypothetical protein CICLE_v10007430mg [Citrus clementina]
           gi|557556791|gb|ESR66805.1| hypothetical protein
           CICLE_v10007430mg [Citrus clementina]
          Length = 851

 Score =  207 bits (527), Expect = 7e-51
 Identities = 97/166 (58%), Positives = 129/166 (77%)
 Frame = -3

Query: 499 VDLDFNDVGPELSSERCNFILEQLEKSNDSRALTFFEWMKVNGKLKNNVTAYNSILRVLG 320
           +D++++ +G +LS + CN IL++LEK +DS++L FFEWM+ NGKL+ NV AYN +LRV  
Sbjct: 71  LDVNYSVIGADLSLDECNAILKRLEKYSDSKSLKFFEWMRTNGKLEKNVIAYNLVLRVFS 130

Query: 319 RKGDWDGAEVMIMEMVSDSDCELNYRIFNTLIYACYKSGLVDLGSRWFKMMLDYKVQPNV 140
           R+ DWD AE MI E+      +LN+++FNTLIYAC K G V+LG++WF MML+  VQPNV
Sbjct: 131 RREDWDAAEKMIREVRMSLGTKLNFQLFNTLIYACNKRGCVELGAKWFHMMLECDVQPNV 190

Query: 139 ATFGMLMSLYQKGLVVEEAEYTFSQMRNLKITCQSAYSALITIYTR 2
           ATFGMLM LY+K   VEEAE+ F+QMR L + C+SAYSA+ITIYTR
Sbjct: 191 ATFGMLMGLYKKSWSVEEAEFAFNQMRKLGLVCESAYSAMITIYTR 236


>ref|XP_006285536.1| hypothetical protein CARUB_v10006977mg [Capsella rubella]
           gi|482554241|gb|EOA18434.1| hypothetical protein
           CARUB_v10006977mg [Capsella rubella]
          Length = 907

 Score =  206 bits (523), Expect = 2e-50
 Identities = 125/327 (38%), Positives = 181/327 (55%), Gaps = 1/327 (0%)
 Frame = -3

Query: 979 MASLKLSVSPDNNSYESNKLISGFNSLKFVSN-TLFSGYVSTHGALIVKPFCKLKHIRVS 803
           M SL+ S+  D   ++S +     N  +F     +FS   S   A  +    + + IRVS
Sbjct: 1   MGSLRFSIPLD--PFDSKRFHFSANPFQFPDQFPIFSVTSSYVPATRIGSLVRAEKIRVS 58

Query: 802 RLDNESLDTCESNLDGFSIDNLEKYVAGDDNLIIEGQDFHGDSGKRKVNIWKKFRGAKTV 623
           RLD E+ +T E+ +D  S   +E+                  S K K          +  
Sbjct: 59  RLDVEAEET-ENAIDSASAAKVER----------------SSSSKLKSGKTVSSGNKRGT 101

Query: 622 RKNTIRNLDFHRNGNKYKKHENPMVPLQKNSANPILDGHSVVDLDFNDVGPELSSERCNF 443
           +K+ ++   F R     +  E             +L  +  +D++++ + P LS E CN 
Sbjct: 102 KKDVVKKFSFRRESINLELEE-------------LLVNNGEMDVNYSAIKPTLSLEHCNG 148

Query: 442 ILEQLEKSNDSRALTFFEWMKVNGKLKNNVTAYNSILRVLGRKGDWDGAEVMIMEMVSDS 263
           IL++LE  +DS A+ FF+WM  NGKL+ N +AY+ ILRVLGR+ DWD AE +I E+    
Sbjct: 149 ILKRLESCSDSNAVKFFDWMSCNGKLQGNFSAYSLILRVLGRRQDWDRAEDLIKELCGFQ 208

Query: 262 DCELNYRIFNTLIYACYKSGLVDLGSRWFKMMLDYKVQPNVATFGMLMSLYQKGLVVEEA 83
             + ++++FNT+IYAC K G V LGS+WF++ML+  V+PNVAT GMLM LYQK   V+EA
Sbjct: 209 GFQQSFQVFNTVIYACAKKGNVKLGSKWFQLMLELGVRPNVATIGMLMGLYQKNWNVDEA 268

Query: 82  EYTFSQMRNLKITCQSAYSALITIYTR 2
           E+ FSQMR   I C+SAYSA+ITIYTR
Sbjct: 269 EFAFSQMRKFGIVCESAYSAMITIYTR 295


>ref|XP_002869359.1| pentatricopeptide repeat-containing protein [Arabidopsis lyrata
           subsp. lyrata] gi|297315195|gb|EFH45618.1|
           pentatricopeptide repeat-containing protein [Arabidopsis
           lyrata subsp. lyrata]
          Length = 906

 Score =  192 bits (489), Expect = 2e-46
 Identities = 120/326 (36%), Positives = 182/326 (55%)
 Frame = -3

Query: 979 MASLKLSVSPDNNSYESNKLISGFNSLKFVSNTLFSGYVSTHGALIVKPFCKLKHIRVSR 800
           M SL+LS+  D   ++S +     N  +F          ++  A  +    ++K IRVSR
Sbjct: 1   MGSLRLSIPLD--PFDSKRFHFSANPFQFPDQVPIFSVSTSVPATRIGSLIRVKKIRVSR 58

Query: 799 LDNESLDTCESNLDGFSIDNLEKYVAGDDNLIIEGQDFHGDSGKRKVNIWKKFRGAKTVR 620
           LD E+ +  E+ +D  S+ N+E+      N  ++G +      +R              +
Sbjct: 59  LDIEAKEA-ENAIDSDSV-NVER----SSNSKLKGSNTVTSGNQRGT------------K 100

Query: 619 KNTIRNLDFHRNGNKYKKHENPMVPLQKNSANPILDGHSVVDLDFNDVGPELSSERCNFI 440
           K+  R   F R  N  +  EN  V             +  +D++++ + P LS E  N I
Sbjct: 101 KDVARKFSFRRESNDLEL-ENLFV------------NNGEMDVNYSAIKPGLSLEHYNAI 147

Query: 439 LEQLEKSNDSRALTFFEWMKVNGKLKNNVTAYNSILRVLGRKGDWDGAEVMIMEMVSDSD 260
           L++LE  +D+ A+ FF+WM+  GKL+ N  AY+ ILRVLGR+ +W+ AE +I E+     
Sbjct: 148 LKRLESCSDTNAIKFFDWMRCKGKLEGNFGAYSLILRVLGRREEWNRAEDLIEELCGFQG 207

Query: 259 CELNYRIFNTLIYACYKSGLVDLGSRWFKMMLDYKVQPNVATFGMLMSLYQKGLVVEEAE 80
            + ++++FNT+IYAC K G V L S+WF+MML+  V+PNVAT GMLM LYQK   V+EAE
Sbjct: 208 FQQSFQVFNTVIYACTKKGNVKLASKWFQMMLELGVRPNVATIGMLMGLYQKNWNVDEAE 267

Query: 79  YTFSQMRNLKITCQSAYSALITIYTR 2
           + FS MR  +I C+SAYS++ITIYTR
Sbjct: 268 FAFSHMRKFEIVCESAYSSMITIYTR 293


>gb|ESW24614.1| hypothetical protein PHAVU_004G145400g [Phaseolus vulgaris]
          Length = 852

 Score =  191 bits (486), Expect = 4e-46
 Identities = 96/167 (57%), Positives = 126/167 (75%), Gaps = 2/167 (1%)
 Frame = -3

Query: 496 DLDFNDVGPELSSERCNFILEQLEKS--NDSRALTFFEWMKVNGKLKNNVTAYNSILRVL 323
           D++F+    ELS+ +CN IL++LE+S  +D+  L+FFE M+  GKL+ N  AYN ILRV+
Sbjct: 75  DVEFSS---ELSTAQCNAILKRLEESAEDDAETLSFFEKMREGGKLERNAGAYNVILRVV 131

Query: 322 GRKGDWDGAEVMIMEMVSDSDCELNYRIFNTLIYACYKSGLVDLGSRWFKMMLDYKVQPN 143
            R+GDW+GAE +I EM +    EL++ +FNTLIYAC K  LV LG++WF+MMLDY V PN
Sbjct: 132 SRRGDWEGAEKLISEMKASFGSELSFNVFNTLIYACCKRNLVKLGTKWFRMMLDYGVAPN 191

Query: 142 VATFGMLMSLYQKGLVVEEAEYTFSQMRNLKITCQSAYSALITIYTR 2
           VAT GMLM LY+KG  +EEAE+ FSQMR   I C+SAYS++ITIYTR
Sbjct: 192 VATVGMLMGLYRKGWNLEEAEFAFSQMRGFGIVCESAYSSMITIYTR 238


>ref|NP_567856.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana]
           gi|223635625|sp|O65567.2|PP342_ARATH RecName:
           Full=Pentatricopeptide repeat-containing protein
           At4g30825, chloroplastic; Flags: Precursor
           gi|332660415|gb|AEE85815.1| pentatricopeptide
           repeat-containing protein [Arabidopsis thaliana]
          Length = 904

 Score =  188 bits (478), Expect = 3e-45
 Identities = 89/166 (53%), Positives = 121/166 (72%)
 Frame = -3

Query: 499 VDLDFNDVGPELSSERCNFILEQLEKSNDSRALTFFEWMKVNGKLKNNVTAYNSILRVLG 320
           +D++++ + P  S E CN IL++LE  +D+ A+ FF+WM+ NGKL  N  AY+ ILRVLG
Sbjct: 126 IDVNYSAIKPGQSLEHCNGILKRLESCSDTNAIKFFDWMRCNGKLVGNFVAYSLILRVLG 185

Query: 319 RKGDWDGAEVMIMEMVSDSDCELNYRIFNTLIYACYKSGLVDLGSRWFKMMLDYKVQPNV 140
           R+ +WD AE +I E+    + + +Y++FNT+IYAC K G V L S+WF MML++ V+PNV
Sbjct: 186 RREEWDRAEDLIKELCGFHEFQKSYQVFNTVIYACTKKGNVKLASKWFHMMLEFGVRPNV 245

Query: 139 ATFGMLMSLYQKGLVVEEAEYTFSQMRNLKITCQSAYSALITIYTR 2
           AT GMLM LYQK   VEEAE+ FS MR   I C+SAYS++ITIYTR
Sbjct: 246 ATIGMLMGLYQKNWNVEEAEFAFSHMRKFGIVCESAYSSMITIYTR 291


>emb|CAA18211.1| puative protein [Arabidopsis thaliana] gi|7269983|emb|CAB79800.1|
           puative protein [Arabidopsis thaliana]
          Length = 1075

 Score =  188 bits (478), Expect = 3e-45
 Identities = 89/166 (53%), Positives = 121/166 (72%)
 Frame = -3

Query: 499 VDLDFNDVGPELSSERCNFILEQLEKSNDSRALTFFEWMKVNGKLKNNVTAYNSILRVLG 320
           +D++++ + P  S E CN IL++LE  +D+ A+ FF+WM+ NGKL  N  AY+ ILRVLG
Sbjct: 297 IDVNYSAIKPGQSLEHCNGILKRLESCSDTNAIKFFDWMRCNGKLVGNFVAYSLILRVLG 356

Query: 319 RKGDWDGAEVMIMEMVSDSDCELNYRIFNTLIYACYKSGLVDLGSRWFKMMLDYKVQPNV 140
           R+ +WD AE +I E+    + + +Y++FNT+IYAC K G V L S+WF MML++ V+PNV
Sbjct: 357 RREEWDRAEDLIKELCGFHEFQKSYQVFNTVIYACTKKGNVKLASKWFHMMLEFGVRPNV 416

Query: 139 ATFGMLMSLYQKGLVVEEAEYTFSQMRNLKITCQSAYSALITIYTR 2
           AT GMLM LYQK   VEEAE+ FS MR   I C+SAYS++ITIYTR
Sbjct: 417 ATIGMLMGLYQKNWNVEEAEFAFSHMRKFGIVCESAYSSMITIYTR 462


>ref|XP_006412665.1| hypothetical protein EUTSA_v10024344mg [Eutrema salsugineum]
           gi|557113835|gb|ESQ54118.1| hypothetical protein
           EUTSA_v10024344mg [Eutrema salsugineum]
          Length = 916

 Score =  186 bits (473), Expect = 1e-44
 Identities = 116/330 (35%), Positives = 179/330 (54%), Gaps = 4/330 (1%)
 Frame = -3

Query: 979 MASLKLSVSPDNNSYESNKLISGFNSLKFVSN----TLFSGYVSTHGALIVKPFCKLKHI 812
           M SL+LS   D   ++S +     N  +F       ++ S   +T    I  P   +   
Sbjct: 1   MVSLRLSTPLD--PFDSKRFHFSANPFQFTDQFPIFSVTSSISATRTFTIGSPI-SVNKT 57

Query: 811 RVSRLDNESLDTCESNLDGFSIDNLEKYVAGDDNLIIEGQDFHGDSGKRKVNIWKKFRGA 632
           RV+RLD E+ +  E+ +D  S          +D+ + E       S K K          
Sbjct: 58  RVARLDTEA-NEAENAIDRSS----------EDDSVSEASVGRSWSSKLKGGNNVTSSNK 106

Query: 631 KTVRKNTIRNLDFHRNGNKYKKHENPMVPLQKNSANPILDGHSVVDLDFNDVGPELSSER 452
           + ++K+  R   F R  N+ +                +   +  +D++++ + P+LS E 
Sbjct: 107 RGIKKDVTRKSSFRRESNELE-------------LEGLFVNNGEMDVNYSAMKPDLSLEH 153

Query: 451 CNFILEQLEKSNDSRALTFFEWMKVNGKLKNNVTAYNSILRVLGRKGDWDGAEVMIMEMV 272
            N IL++LE  +D+ A+ FF+WM+  GKL+ N+ AY+ ILRVL R+ +WD AE +I E+ 
Sbjct: 154 YNGILKRLECCSDTNAVKFFDWMRCKGKLEGNIVAYSLILRVLARREEWDRAEDLIKELC 213

Query: 271 SDSDCELNYRIFNTLIYACYKSGLVDLGSRWFKMMLDYKVQPNVATFGMLMSLYQKGLVV 92
                + ++++FNT+IYAC K G V LGS+WF++ML+  V+PNVAT GMLM LYQK   V
Sbjct: 214 GFQGFQQSFQVFNTVIYACSKKGNVKLGSKWFQLMLELGVRPNVATIGMLMGLYQKNRNV 273

Query: 91  EEAEYTFSQMRNLKITCQSAYSALITIYTR 2
           +EAE+ F+ MR   I C+SAYSA+IT+YTR
Sbjct: 274 DEAEFAFTHMRRFGIVCESAYSAMITLYTR 303


>ref|XP_004975413.1| PREDICTED: pentatricopeptide repeat-containing protein At4g30825,
           chloroplastic-like isoform X1 [Setaria italica]
          Length = 957

 Score =  185 bits (470), Expect = 3e-44
 Identities = 100/223 (44%), Positives = 138/223 (61%)
 Frame = -3

Query: 670 KRKVNIWKKFRGAKTVRKNTIRNLDFHRNGNKYKKHENPMVPLQKNSANPILDGHSVVDL 491
           K++  +W++  G K +R++        ++G    +H    V   K+  N +L        
Sbjct: 132 KKEGKLWRRLGGGKKLRRHRAP-----KHGPGKDRHVRRSVV--KDDVNVVL-------- 176

Query: 490 DFNDVGPELSSERCNFILEQLEKSNDSRALTFFEWMKVNGKLKNNVTAYNSILRVLGRKG 311
             + +  E S E CN  L  LEK +D +AL FF+WMK NGKLK N  AY+  L+ +  K 
Sbjct: 177 --SCISQESSIEECNSALIHLEKHSDEKALNFFDWMKANGKLKGNAYAYHLALQAIAWKE 234

Query: 310 DWDGAEVMIMEMVSDSDCELNYRIFNTLIYACYKSGLVDLGSRWFKMMLDYKVQPNVATF 131
           +W  AE+++ EMV+DSDC L+ R FN LIY C K  L D G+RWF+MMLD +VQPNV+T 
Sbjct: 235 NWKMAELLLHEMVADSDCTLDARAFNGLIYVCAKRRLDDWGTRWFRMMLDSEVQPNVSTI 294

Query: 130 GMLMSLYQKGLVVEEAEYTFSQMRNLKITCQSAYSALITIYTR 2
           GMLM LYQK   + EAE+TF++MRN  I C +AYSA+IT+YTR
Sbjct: 295 GMLMGLYQKTGNLSEAEFTFAKMRNYNIKCVNAYSAMITLYTR 337


>ref|XP_006857035.1| hypothetical protein AMTR_s00065p00020910 [Amborella trichopoda]
           gi|548861118|gb|ERN18502.1| hypothetical protein
           AMTR_s00065p00020910 [Amborella trichopoda]
          Length = 903

 Score =  181 bits (460), Expect = 4e-43
 Identities = 102/226 (45%), Positives = 138/226 (61%)
 Frame = -3

Query: 679 DSGKRKVNIWKKFRGAKTVRKNTIRNLDFHRNGNKYKKHENPMVPLQKNSANPILDGHSV 500
           +SG++   +WK+ RG K       R ++   +  +  K E     L +   + +    S 
Sbjct: 71  NSGRK---LWKRLRGFK-------RPIESEVSARRLAKTEQ-CPSLDRKDGDSLSSTESE 119

Query: 499 VDLDFNDVGPELSSERCNFILEQLEKSNDSRALTFFEWMKVNGKLKNNVTAYNSILRVLG 320
           ++   + + P  S E CN  L+ LEKSND++AL  FEWMK NGKL  N TAYN  LRVL 
Sbjct: 120 LEAKLSTLEPLSSIENCNNYLKLLEKSNDAKALQLFEWMKSNGKLDRNPTAYNLALRVLS 179

Query: 319 RKGDWDGAEVMIMEMVSDSDCELNYRIFNTLIYACYKSGLVDLGSRWFKMMLDYKVQPNV 140
           RK DW  +E ++ EM + S+C  + ++FNTLIY C K  LV  G++WF+MML   V+PN 
Sbjct: 180 RKEDWKASEELLREMPTVSNCSPSSQMFNTLIYVCSKRELVGWGTKWFRMMLYCGVKPNQ 239

Query: 139 ATFGMLMSLYQKGLVVEEAEYTFSQMRNLKITCQSAYSALITIYTR 2
           AT GMLMSLYQKG  +EEAE+T  QMR   + C  AYSA++TIYTR
Sbjct: 240 ATIGMLMSLYQKGGNLEEAEFTLGQMRTHGLHCCVAYSAMMTIYTR 285


Top