BLASTX nr result

ID: Rheum21_contig00010940 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Rheum21_contig00010940
         (2324 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_003631269.1| PREDICTED: LOW QUALITY PROTEIN: pentatricope...   588   e-165
emb|CBI28530.3| unnamed protein product [Vitis vinifera]              587   e-165
ref|XP_006494986.1| PREDICTED: pentatricopeptide repeat-containi...   575   e-161
ref|XP_006440653.1| hypothetical protein CICLE_v10023621mg [Citr...   573   e-160
gb|EOY21933.1| Pentatricopeptide repeat superfamily protein [The...   554   e-155
ref|XP_004508741.1| PREDICTED: pentatricopeptide repeat-containi...   553   e-154
ref|XP_004138304.1| PREDICTED: pentatricopeptide repeat-containi...   552   e-154
gb|EMJ11418.1| hypothetical protein PRUPE_ppa015814mg [Prunus pe...   551   e-154
gb|EXB38552.1| hypothetical protein L484_008580 [Morus notabilis]     548   e-153
ref|XP_002514722.1| pentatricopeptide repeat-containing protein,...   546   e-152
ref|XP_003622167.1| Pentatricopeptide repeat protein [Medicago t...   542   e-151
ref|XP_006280363.1| hypothetical protein CARUB_v10026291mg [Caps...   536   e-149
ref|XP_002866430.1| hypothetical protein ARALYDRAFT_496296 [Arab...   536   e-149
gb|ESW27283.1| hypothetical protein PHAVU_003G188300g [Phaseolus...   530   e-147
gb|AAM65325.1| unknown [Arabidopsis thaliana]                         528   e-147
ref|NP_200945.1| pentatricopeptide repeat-containing protein [Ar...   528   e-147
ref|XP_002318601.2| hypothetical protein POPTR_0012s07030g, part...   522   e-145
ref|XP_006394515.1| hypothetical protein EUTSA_v10004085mg [Eutr...   515   e-143
ref|XP_006849319.1| hypothetical protein AMTR_s00164p00020970 [A...   383   e-103
gb|EEC66969.1| hypothetical protein OsI_33629 [Oryza sativa Indi...   359   3e-96

>ref|XP_003631269.1| PREDICTED: LOW QUALITY PROTEIN: pentatricopeptide repeat-containing
            protein At5g61370, mitochondrial-like [Vitis vinifera]
          Length = 505

 Score =  588 bits (1515), Expect = e-165
 Identities = 282/411 (68%), Positives = 343/411 (83%)
 Frame = +2

Query: 782  VQEICNLVSTPVGSLDDLESGLDKCGAPLTSSMVVQVVEHCKNNSPSRRLLRFFTWSCKR 961
            +QE+CN+VS  VGSLDDLE+ LD+  A  TSS++ Q+++ CKN +P+RRLLRFF WS K+
Sbjct: 40   LQELCNVVSNGVGSLDDLEASLDRLDASFTSSLISQILDTCKNEAPTRRLLRFFLWSSKK 99

Query: 962  LNGGLVDKEFNFAIQAFAEMKDGRAVEILLSDISKEGRVLDAQTFCDVVDVLVKLGREDD 1141
             N  L D +FN+AIQ FAE KD +A++IL+SD+S EGR + AQTF  V + LV LGREDD
Sbjct: 100  FNCKLEDDDFNYAIQVFAEKKDLKAIDILVSDLSNEGREMKAQTFGIVAETLVSLGREDD 159

Query: 1142 ALGIFKNLDKFGCPHDKTTVTAIITALCTKGHARRAEGVLRHHGDKISGVESCVYKSLLH 1321
            ALG+FKNLDKF C +D  TVTAI+ ALC+KGHARRAEGV+RHH DKI GV+ C+Y+SL +
Sbjct: 160  ALGLFKNLDKFKCSYDSVTVTAIVNALCSKGHARRAEGVVRHHKDKILGVKPCIYRSLFY 219

Query: 1322 GWSLKENVKEARRILQEMKSKKVMLDLYCFNTFLKCLCENNLKKNPSGLVPEALNVMMEM 1501
            GWS ++NVKEARRIL+EMKS  +M DL+C+NTFL+CLCE NLK NPSGLVPEALNVMMEM
Sbjct: 220  GWSEQKNVKEARRILKEMKSVGIMPDLFCYNTFLRCLCERNLKSNPSGLVPEALNVMMEM 279

Query: 1502 RSYKIMPNSISYNILLSCLGRTRRVKETLNTIELMRKTRCNPDWVSYYLVARVLYLSGRF 1681
            RS +I P SISYNILLSCLGRTRRVKE+   ++LM++  C+PDWVSYYLVARVLYL+GRF
Sbjct: 280  RSNRITPTSISYNILLSCLGRTRRVKESCRILDLMKRLGCSPDWVSYYLVARVLYLTGRF 339

Query: 1682 GKGNQIVEEMIEAGVDPPPKLYHNLVGVLCGVERVNYALELFERMKRSSVGEYGPVYDLL 1861
            GKGNQIV+EMIE G+ P  K Y++L+GVLCGVERVNYALE+FERMKRSS+G YGPVYD+L
Sbjct: 340  GKGNQIVDEMIEEGLVPDRKFYYDLIGVLCGVERVNYALEMFERMKRSSLGGYGPVYDVL 399

Query: 1862 IPKLCRGGDFVKGKELWDEAERMGVALCCSRVALDPSITEVYVCKRQDDEE 2014
            IPKLCR GDF KG+ELWDEA R+GV L CS   LDPSIT+V+   R+D+E+
Sbjct: 400  IPKLCRSGDFGKGRELWDEATRVGVLLHCSSEVLDPSITKVFKPARKDEEK 450


>emb|CBI28530.3| unnamed protein product [Vitis vinifera]
          Length = 452

 Score =  587 bits (1514), Expect = e-165
 Identities = 282/410 (68%), Positives = 342/410 (83%)
 Frame = +2

Query: 782  VQEICNLVSTPVGSLDDLESGLDKCGAPLTSSMVVQVVEHCKNNSPSRRLLRFFTWSCKR 961
            +QE+CN+VS  VGSLDDLE+ LD+  A  TSS++ Q+++ CKN +P+RRLLRFF WS K+
Sbjct: 9    LQELCNVVSNGVGSLDDLEASLDRLDASFTSSLISQILDTCKNEAPTRRLLRFFLWSSKK 68

Query: 962  LNGGLVDKEFNFAIQAFAEMKDGRAVEILLSDISKEGRVLDAQTFCDVVDVLVKLGREDD 1141
             N  L D +FN+AIQ FAE KD +A++IL+SD+S EGR + AQTF  V + LV LGREDD
Sbjct: 69   FNCKLEDDDFNYAIQVFAEKKDLKAIDILVSDLSNEGREMKAQTFGIVAETLVSLGREDD 128

Query: 1142 ALGIFKNLDKFGCPHDKTTVTAIITALCTKGHARRAEGVLRHHGDKISGVESCVYKSLLH 1321
            ALG+FKNLDKF C +D  TVTAI+ ALC+KGHARRAEGV+RHH DKI GV+ C+Y+SL +
Sbjct: 129  ALGLFKNLDKFKCSYDSVTVTAIVNALCSKGHARRAEGVVRHHKDKILGVKPCIYRSLFY 188

Query: 1322 GWSLKENVKEARRILQEMKSKKVMLDLYCFNTFLKCLCENNLKKNPSGLVPEALNVMMEM 1501
            GWS ++NVKEARRIL+EMKS  +M DL+C+NTFL+CLCE NLK NPSGLVPEALNVMMEM
Sbjct: 189  GWSEQKNVKEARRILKEMKSVGIMPDLFCYNTFLRCLCERNLKSNPSGLVPEALNVMMEM 248

Query: 1502 RSYKIMPNSISYNILLSCLGRTRRVKETLNTIELMRKTRCNPDWVSYYLVARVLYLSGRF 1681
            RS +I P SISYNILLSCLGRTRRVKE+   ++LM++  C+PDWVSYYLVARVLYL+GRF
Sbjct: 249  RSNRITPTSISYNILLSCLGRTRRVKESCRILDLMKRLGCSPDWVSYYLVARVLYLTGRF 308

Query: 1682 GKGNQIVEEMIEAGVDPPPKLYHNLVGVLCGVERVNYALELFERMKRSSVGEYGPVYDLL 1861
            GKGNQIV+EMIE G+ P  K Y++L+GVLCGVERVNYALE+FERMKRSS+G YGPVYD+L
Sbjct: 309  GKGNQIVDEMIEEGLVPDRKFYYDLIGVLCGVERVNYALEMFERMKRSSLGGYGPVYDVL 368

Query: 1862 IPKLCRGGDFVKGKELWDEAERMGVALCCSRVALDPSITEVYVCKRQDDE 2011
            IPKLCR GDF KG+ELWDEA R+GV L CS   LDPSIT+V+   R+D+E
Sbjct: 369  IPKLCRSGDFGKGRELWDEATRVGVLLHCSSEVLDPSITKVFKPARKDEE 418


>ref|XP_006494986.1| PREDICTED: pentatricopeptide repeat-containing protein At5g61370,
            mitochondrial-like [Citrus sinensis]
          Length = 495

 Score =  575 bits (1481), Expect = e-161
 Identities = 273/415 (65%), Positives = 343/415 (82%)
 Frame = +2

Query: 779  QVQEICNLVSTPVGSLDDLESGLDKCGAPLTSSMVVQVVEHCKNNSPSRRLLRFFTWSCK 958
            +++E+C +VS+ +G LDDLE  L++    LTSS+V QV++ CK  +P+RRLLRFF WSCK
Sbjct: 46   ELKELCKVVSSTIGGLDDLELSLNQFTGSLTSSLVTQVIDSCKQEAPTRRLLRFFLWSCK 105

Query: 959  RLNGGLVDKEFNFAIQAFAEMKDGRAVEILLSDISKEGRVLDAQTFCDVVDVLVKLGRED 1138
             ++  L DK++N AI+ FAE +D  A+ IL+SD+ KEGRV+++Q+F  +V+ LVKLGRED
Sbjct: 106  NMSASLEDKDYNHAIRVFAEKRDHTAMNILVSDLRKEGRVMESQSFGVLVETLVKLGRED 165

Query: 1139 DALGIFKNLDKFGCPHDKTTVTAIITALCTKGHARRAEGVLRHHGDKISGVESCVYKSLL 1318
            +ALGIFKNL+KF C  D  TV+AI++ALC KGHARRAEGV+ HH DKISGVE C+Y+SL+
Sbjct: 166  EALGIFKNLEKFKCVQDSVTVSAIVSALCAKGHARRAEGVVYHHKDKISGVELCIYRSLI 225

Query: 1319 HGWSLKENVKEARRILQEMKSKKVMLDLYCFNTFLKCLCENNLKKNPSGLVPEALNVMME 1498
            +GWS++ENVK AR+I++EMKS  +M DL+C+NTFL+ LCE NLK+NPSGLVPEALNVMME
Sbjct: 226  YGWSMQENVKAARKIIKEMKSAGIMPDLFCYNTFLRGLCERNLKRNPSGLVPEALNVMME 285

Query: 1499 MRSYKIMPNSISYNILLSCLGRTRRVKETLNTIELMRKTRCNPDWVSYYLVARVLYLSGR 1678
            MRSY+I P SISYNILLSCLGRTRRVKE+   +E M+K+ C PDWVSYYLVARVLYLSGR
Sbjct: 286  MRSYRIAPTSISYNILLSCLGRTRRVKESCQVLEQMKKSGCAPDWVSYYLVARVLYLSGR 345

Query: 1679 FGKGNQIVEEMIEAGVDPPPKLYHNLVGVLCGVERVNYALELFERMKRSSVGEYGPVYDL 1858
            FGKGN+IV+EMIE G+ P  K Y++L+G+LCGVERVN+ALELFERMKRSS+G YGPVYD+
Sbjct: 346  FGKGNKIVDEMIEEGLIPDRKFYYDLIGILCGVERVNFALELFERMKRSSLGGYGPVYDV 405

Query: 1859 LIPKLCRGGDFVKGKELWDEAERMGVALCCSRVALDPSITEVYVCKRQDDEELRG 2023
            LIPK+CRGGDFVKG+ELWDEA  MG+ L CS   LDPSI EV+  +R+  E   G
Sbjct: 406  LIPKVCRGGDFVKGRELWDEAMVMGLTLSCSSNVLDPSIIEVFQPRRKPTESCLG 460


>ref|XP_006440653.1| hypothetical protein CICLE_v10023621mg [Citrus clementina]
            gi|557542915|gb|ESR53893.1| hypothetical protein
            CICLE_v10023621mg [Citrus clementina]
          Length = 488

 Score =  573 bits (1477), Expect = e-160
 Identities = 274/415 (66%), Positives = 343/415 (82%)
 Frame = +2

Query: 779  QVQEICNLVSTPVGSLDDLESGLDKCGAPLTSSMVVQVVEHCKNNSPSRRLLRFFTWSCK 958
            +++E+C +VS+ +G LDDLE  L++    L+SS+V QV++ CK+ +P+RRLLRFF WSCK
Sbjct: 41   ELKELCKVVSSTIGGLDDLELSLNQFTGSLSSSLVTQVIDSCKHEAPTRRLLRFFLWSCK 100

Query: 959  RLNGGLVDKEFNFAIQAFAEMKDGRAVEILLSDISKEGRVLDAQTFCDVVDVLVKLGRED 1138
             L+  L DK++N AI+ FAE KD  A+ IL+SD+ KEGRV++ Q+F  +V+ LVKLGRED
Sbjct: 101  NLSASLEDKDYNHAIRVFAEKKDHMAMNILVSDLRKEGRVMETQSFGVLVETLVKLGRED 160

Query: 1139 DALGIFKNLDKFGCPHDKTTVTAIITALCTKGHARRAEGVLRHHGDKISGVESCVYKSLL 1318
            +ALGIFKNL+KF C  D  TV+AI++ALC KGHARRAEGV+ HH DKISGVE C+Y+SL+
Sbjct: 161  EALGIFKNLEKFKCVQDSVTVSAIVSALCAKGHARRAEGVVYHHKDKISGVELCIYRSLI 220

Query: 1319 HGWSLKENVKEARRILQEMKSKKVMLDLYCFNTFLKCLCENNLKKNPSGLVPEALNVMME 1498
            +GWS++ENVK AR+I++EMKS   M DL+C+NTFL+ LCE NLK+NPSGLVPEALNVMME
Sbjct: 221  YGWSMQENVKAARKIIKEMKSAGFMPDLFCYNTFLRGLCERNLKRNPSGLVPEALNVMME 280

Query: 1499 MRSYKIMPNSISYNILLSCLGRTRRVKETLNTIELMRKTRCNPDWVSYYLVARVLYLSGR 1678
            MRSY+I P SISYNILLSCLGRTRRVKE+   +E M+K+ C PDWVSYYLVARVLYLSGR
Sbjct: 281  MRSYRIAPTSISYNILLSCLGRTRRVKESCRVLEQMKKSGCAPDWVSYYLVARVLYLSGR 340

Query: 1679 FGKGNQIVEEMIEAGVDPPPKLYHNLVGVLCGVERVNYALELFERMKRSSVGEYGPVYDL 1858
            FGKGN+IV+EMIE G+ P  K Y++L+G+LCGVERVN+ALELFERMKRSS+G YGPVYD+
Sbjct: 341  FGKGNKIVDEMIEEGLIPDRKFYYDLIGILCGVERVNFALELFERMKRSSLGGYGPVYDV 400

Query: 1859 LIPKLCRGGDFVKGKELWDEAERMGVALCCSRVALDPSITEVYVCKRQDDEELRG 2023
            LIPK+C+GGDFVKG+ELWDEA  MG+ L CS   LDPSITEV+  +R+  E   G
Sbjct: 401  LIPKVCQGGDFVKGRELWDEAMVMGLTLSCSSNVLDPSITEVFHPRRKPTEGCLG 455


>gb|EOY21933.1| Pentatricopeptide repeat superfamily protein [Theobroma cacao]
          Length = 487

 Score =  554 bits (1427), Expect = e-155
 Identities = 263/411 (63%), Positives = 330/411 (80%)
 Frame = +2

Query: 779  QVQEICNLVSTPVGSLDDLESGLDKCGAPLTSSMVVQVVEHCKNNSPSRRLLRFFTWSCK 958
            + +E+C +VS+ +G LDDLES L++    L+  +V QV+  C+N +P+RRLLRFF WS K
Sbjct: 39   EFEELCKVVSSSMGGLDDLESSLNRFKLSLSPLLVTQVINSCENEAPTRRLLRFFLWSVK 98

Query: 959  RLNGGLVDKEFNFAIQAFAEMKDGRAVEILLSDISKEGRVLDAQTFCDVVDVLVKLGRED 1138
             L+  L DK+ N  ++ FA+ KD  A+ IL+SDI   GR +++QTF  V ++LVKLGRED
Sbjct: 99   NLSSSLEDKDLNNVVRVFAKKKDHTAMGILVSDIRNRGRTMESQTFSVVAEMLVKLGRED 158

Query: 1139 DALGIFKNLDKFGCPHDKTTVTAIITALCTKGHARRAEGVLRHHGDKISGVESCVYKSLL 1318
            +ALGIFKNL+KF CP D  ++TAI+ ALC KGHAR+AEGV+ HH D I+GVE C+Y+ LL
Sbjct: 159  EALGIFKNLEKFKCPRDSFSLTAIVNALCAKGHARKAEGVVYHHKDTIAGVEPCIYRCLL 218

Query: 1319 HGWSLKENVKEARRILQEMKSKKVMLDLYCFNTFLKCLCENNLKKNPSGLVPEALNVMME 1498
            +GWS++ENVKEARR+++EMKS    LDLYC+NTFL+CLC  N K+NPSGLVPEALNVMME
Sbjct: 219  YGWSVQENVKEARRVIKEMKSAGFELDLYCYNTFLRCLCGKNAKRNPSGLVPEALNVMME 278

Query: 1499 MRSYKIMPNSISYNILLSCLGRTRRVKETLNTIELMRKTRCNPDWVSYYLVARVLYLSGR 1678
            MRS +I P S+SYNILLSCLGRTRRVKE+   +ELM+K  C PDW+SYYLVARVLYL+GR
Sbjct: 279  MRSQRIAPTSVSYNILLSCLGRTRRVKESCQILELMKKAGCAPDWISYYLVARVLYLTGR 338

Query: 1679 FGKGNQIVEEMIEAGVDPPPKLYHNLVGVLCGVERVNYALELFERMKRSSVGEYGPVYDL 1858
            FGKGN+IV+EMIE G+ P  K Y++L+GVLCGVERVN+ALELFERMKRSS+G YGPVYD+
Sbjct: 339  FGKGNKIVDEMIEQGLTPDRKFYYDLIGVLCGVERVNFALELFERMKRSSLGGYGPVYDV 398

Query: 1859 LIPKLCRGGDFVKGKELWDEAERMGVALCCSRVALDPSITEVYVCKRQDDE 2011
            LIPKLCRGGDF KG+ELWDEA   GV+L CS   LDPSITEV+   R+ ++
Sbjct: 399  LIPKLCRGGDFEKGRELWDEAVATGVSLSCSSDVLDPSITEVFKPTRKAEK 449


>ref|XP_004508741.1| PREDICTED: pentatricopeptide repeat-containing protein At3g22690-like
            [Cicer arietinum]
          Length = 1253

 Score =  553 bits (1424), Expect = e-154
 Identities = 277/456 (60%), Positives = 349/456 (76%)
 Frame = +2

Query: 638  RSA*VRN*MISQNSLARLSALLKPTNAWKFKLLYFCSCTXXXXXXXXQVQEICNLVSTPV 817
            RSA  +  M+  ++L R     K T+ ++   + F S T        Q+QE+CN+V++ V
Sbjct: 756  RSARQKMHMLLNSALKRFGLQNKSTHKFQLLSVSFYS-TLHSISAPPQLQELCNIVTSTV 814

Query: 818  GSLDDLESGLDKCGAPLTSSMVVQVVEHCKNNSPSRRLLRFFTWSCKRLNGGLVDKEFNF 997
            G LDDLE  L+K    + SS+V Q ++  K+ + +RRLLRFF WS K L+  L D ++N+
Sbjct: 815  GGLDDLELSLNKFKGSINSSLVAQAIDSIKHEAHTRRLLRFFLWSNKHLSRDLEDNDYNY 874

Query: 998  AIQAFAEMKDGRAVEILLSDISKEGRVLDAQTFCDVVDVLVKLGREDDALGIFKNLDKFG 1177
            A++ FAE KD  A++ILL D+ KEGRV+DAQTF  V +  VKLG+ED+ALGIFKNLDK+ 
Sbjct: 875  ALRVFAEKKDYTAMDILLGDLKKEGRVMDAQTFGLVAETFVKLGKEDEALGIFKNLDKYK 934

Query: 1178 CPHDKTTVTAIITALCTKGHARRAEGVLRHHGDKISGVESCVYKSLLHGWSLKENVKEAR 1357
            C  D+ TVTAII ALC+KGHA+RAEGV+ HH DK+ GV  C+Y+SLL+GWS++ NVKEAR
Sbjct: 935  CFIDEFTVTAIINALCSKGHAKRAEGVVWHHKDKVKGVLPCIYRSLLYGWSVQRNVKEAR 994

Query: 1358 RILQEMKSKKVMLDLYCFNTFLKCLCENNLKKNPSGLVPEALNVMMEMRSYKIMPNSISY 1537
            RI+QEMKS  V  DL C+NTFL+CLCE NL+ NPSGLVPEALNVMMEMR YK++P SISY
Sbjct: 995  RIIQEMKSNGVNPDLVCYNTFLRCLCERNLRHNPSGLVPEALNVMMEMRFYKVLPTSISY 1054

Query: 1538 NILLSCLGRTRRVKETLNTIELMRKTRCNPDWVSYYLVARVLYLSGRFGKGNQIVEEMIE 1717
            NILLSCLG+TRRVKE+   +E M K+   PDWVSYYLVARVL+LSGRFGKG +IV++MIE
Sbjct: 1055 NILLSCLGKTRRVKESCQILEAMNKSGVAPDWVSYYLVARVLFLSGRFGKGKEIVDQMIE 1114

Query: 1718 AGVDPPPKLYHNLVGVLCGVERVNYALELFERMKRSSVGEYGPVYDLLIPKLCRGGDFVK 1897
             G+ P  K Y++L+G+LCGVERVN+ALELFE+MK SS+G YGPVYD+LIPKLCRGG F K
Sbjct: 1115 KGLVPNHKFYYSLIGILCGVERVNHALELFEKMKGSSLGGYGPVYDVLIPKLCRGGAFEK 1174

Query: 1898 GKELWDEAERMGVALCCSRVALDPSITEVYVCKRQD 2005
            G+ELWDEA+ MG+ L CSR  LDPSITEVY  KR +
Sbjct: 1175 GRELWDEAKCMGITLQCSRDVLDPSITEVYKPKRPE 1210


>ref|XP_004138304.1| PREDICTED: pentatricopeptide repeat-containing protein At5g61370,
            mitochondrial-like [Cucumis sativus]
            gi|449477571|ref|XP_004155060.1| PREDICTED:
            pentatricopeptide repeat-containing protein At5g61370,
            mitochondrial-like [Cucumis sativus]
          Length = 487

 Score =  552 bits (1422), Expect = e-154
 Identities = 263/412 (63%), Positives = 334/412 (81%)
 Frame = +2

Query: 782  VQEICNLVSTPVGSLDDLESGLDKCGAPLTSSMVVQVVEHCKNNSPSRRLLRFFTWSCKR 961
            V ++C ++S  +G LD+LES L+KC   LTSS+V QV++  KN +P+RRLLRFF WS K+
Sbjct: 50   VSKLCEVISCTIGGLDELESSLNKCTISLTSSLVTQVIDSSKNEAPTRRLLRFFLWSLKK 109

Query: 962  LNGGLVDKEFNFAIQAFAEMKDGRAVEILLSDISKEGRVLDAQTFCDVVDVLVKLGREDD 1141
            LN  L D++FN AI+ FA+ KD  AV ILLS++ K  R +D QTF  V +  VK+ RED+
Sbjct: 110  LNHTLEDEDFNNAIRFFAQKKDYTAVNILLSNLKKADRAMDGQTFGFVAEAFVKMDREDE 169

Query: 1142 ALGIFKNLDKFGCPHDKTTVTAIITALCTKGHARRAEGVLRHHGDKISGVESCVYKSLLH 1321
            ALG+FKNL+K+ CPHD+ TV AIITALC+KGHA+RAEGV+ HH DKIS   SC+Y+SLL+
Sbjct: 170  ALGLFKNLEKYKCPHDQFTVVAIITALCSKGHAKRAEGVVLHHKDKISSTMSCIYRSLLY 229

Query: 1322 GWSLKENVKEARRILQEMKSKKVMLDLYCFNTFLKCLCENNLKKNPSGLVPEALNVMMEM 1501
            GWS+K+N KEARRIL+EMKS   M DL+C+NTFLKCLCE N++KNPSGLVPE+LNVMMEM
Sbjct: 230  GWSIKKNTKEARRILKEMKSDGTMPDLFCYNTFLKCLCEKNVEKNPSGLVPESLNVMMEM 289

Query: 1502 RSYKIMPNSISYNILLSCLGRTRRVKETLNTIELMRKTRCNPDWVSYYLVARVLYLSGRF 1681
            RSYKI PNSISYNILLSCL +TRRVKE+   +E+M++T C PD VSYYL+ARVL+L+GRF
Sbjct: 290  RSYKISPNSISYNILLSCLCKTRRVKESCKILEMMKRTGCQPDCVSYYLMARVLFLTGRF 349

Query: 1682 GKGNQIVEEMIEAGVDPPPKLYHNLVGVLCGVERVNYALELFERMKRSSVGEYGPVYDLL 1861
            GKG +IV+EMIE G+ P  K Y++L+G+LCGVER NYALELFE+MKRSS+G YGPVYD+L
Sbjct: 350  GKGREIVDEMIEEGLTPDRKFYYDLIGILCGVERTNYALELFEKMKRSSLGGYGPVYDVL 409

Query: 1862 IPKLCRGGDFVKGKELWDEAERMGVALCCSRVALDPSITEVYVCKRQDDEEL 2017
            IPKLCRGG+F  G++LW+EA  MGV+L CS   LDPSIT+V+   R+ + ++
Sbjct: 410  IPKLCRGGEFEMGRQLWEEAMAMGVSLNCSSEILDPSITKVFKPTRKIENKI 461


>gb|EMJ11418.1| hypothetical protein PRUPE_ppa015814mg [Prunus persica]
          Length = 524

 Score =  551 bits (1421), Expect = e-154
 Identities = 264/411 (64%), Positives = 331/411 (80%)
 Frame = +2

Query: 779  QVQEICNLVSTPVGSLDDLESGLDKCGAPLTSSMVVQVVEHCKNNSPSRRLLRFFTWSCK 958
            ++QE+C +VS  +G LDDLE  L+K    LTSS+V QV++ CK+ +P+RRLLRFF+W  K
Sbjct: 7    ELQELCTIVSRAIGGLDDLELSLNKFTGSLTSSLVTQVIDSCKSEAPTRRLLRFFSWCHK 66

Query: 959  RLNGGLVDKEFNFAIQAFAEMKDGRAVEILLSDISKEGRVLDAQTFCDVVDVLVKLGRED 1138
             L+ GL DK++N+ I+ FAE KD  A+ ILLSD+ K GR ++AQTF  V   LVKLGRED
Sbjct: 67   NLDYGLKDKDYNYGIRVFAEKKDHTAMHILLSDLVKTGRAMEAQTFGLVAQALVKLGRED 126

Query: 1139 DALGIFKNLDKFGCPHDKTTVTAIITALCTKGHARRAEGVLRHHGDKISGVESCVYKSLL 1318
            +ALG+FKNL  + CP D  TVT+I+ ALC++GHA+RAEGV+ HH DKI+G+E C+YKSLL
Sbjct: 127  EALGLFKNLSTYKCPQDGHTVTSIVNALCSRGHAKRAEGVVWHHRDKIAGIEPCIYKSLL 186

Query: 1319 HGWSLKENVKEARRILQEMKSKKVMLDLYCFNTFLKCLCENNLKKNPSGLVPEALNVMME 1498
            +GWS++ENVKE RRI++EMKS  +M DL+C+NTFL+ LC  NLK NPSGLVPEALNVM+E
Sbjct: 187  YGWSVQENVKEERRIIKEMKSAGIMPDLFCYNTFLRSLCMKNLKCNPSGLVPEALNVMIE 246

Query: 1499 MRSYKIMPNSISYNILLSCLGRTRRVKETLNTIELMRKTRCNPDWVSYYLVARVLYLSGR 1678
            M++Y+I PNSISYNILLSCLGRTRRVKE+ N +E M+KT C+PDWVSYYLVARVLYLSGR
Sbjct: 247  MKTYRIFPNSISYNILLSCLGRTRRVKESCNILETMKKTGCSPDWVSYYLVARVLYLSGR 306

Query: 1679 FGKGNQIVEEMIEAGVDPPPKLYHNLVGVLCGVERVNYALELFERMKRSSVGEYGPVYDL 1858
            FGKGN++V+EM+  G+ P  K Y++L+G+L G ER  YALELFERMK SS+G YGPVYD+
Sbjct: 307  FGKGNKMVDEMLAEGLQPNCKFYYDLIGILVGNERPYYALELFERMKASSLGGYGPVYDV 366

Query: 1859 LIPKLCRGGDFVKGKELWDEAERMGVALCCSRVALDPSITEVYVCKRQDDE 2011
            LIPK CRGGDF KG+ELWDEA  MGV L CS   LDPSITEV+   R +++
Sbjct: 367  LIPKFCRGGDFEKGRELWDEAMAMGVTLRCSSDLLDPSITEVFKPTRNEEK 417


>gb|EXB38552.1| hypothetical protein L484_008580 [Morus notabilis]
          Length = 518

 Score =  548 bits (1411), Expect = e-153
 Identities = 270/441 (61%), Positives = 337/441 (76%), Gaps = 1/441 (0%)
 Frame = +2

Query: 698  LLKPTNAWKFKLLYFCSCTXXXXXXXX-QVQEICNLVSTPVGSLDDLESGLDKCGAPLTS 874
            LL+   A KF+ L   SC          ++QE+C +VS  +G LDDLES L      LTS
Sbjct: 13   LLRSFTAQKFRQL---SCLPNSNLSSASRLQELCTIVSRTIGGLDDLESSLSDFRGSLTS 69

Query: 875  SMVVQVVEHCKNNSPSRRLLRFFTWSCKRLNGGLVDKEFNFAIQAFAEMKDGRAVEILLS 1054
            S+V QV++ CK  +P+RRLLRFF WS K L   L DK++N AI+ FA  KD  A+EIL+S
Sbjct: 70   SLVTQVIDSCKTEAPTRRLLRFFLWSHKNLKCDLEDKDYNHAIRVFAGKKDHTALEILVS 129

Query: 1055 DISKEGRVLDAQTFCDVVDVLVKLGREDDALGIFKNLDKFGCPHDKTTVTAIITALCTKG 1234
            D+ K GR L++QT+  V + LVKLGRED+ALGIFKN DK+ CP +  TVTA++ ALC +G
Sbjct: 130  DLKKGGRALESQTYAIVAETLVKLGREDEALGIFKNSDKYKCPQNSFTVTAVVNALCAQG 189

Query: 1235 HARRAEGVLRHHGDKISGVESCVYKSLLHGWSLKENVKEARRILQEMKSKKVMLDLYCFN 1414
            HA+RAEGV+ HH D+ISG+E C+Y+SLL+GWS +ENVKEARRI++EMKS  +  DL+C+N
Sbjct: 190  HAKRAEGVVGHHKDRISGMERCIYRSLLYGWSEQENVKEARRIIKEMKSAGINPDLFCYN 249

Query: 1415 TFLKCLCENNLKKNPSGLVPEALNVMMEMRSYKIMPNSISYNILLSCLGRTRRVKETLNT 1594
            TFL+CLCE NLK+NPSGLVPEALNVMMEMRSY I PNSISYNILLSCLGR RRVKE    
Sbjct: 250  TFLRCLCERNLKRNPSGLVPEALNVMMEMRSYMITPNSISYNILLSCLGRARRVKEACQI 309

Query: 1595 IELMRKTRCNPDWVSYYLVARVLYLSGRFGKGNQIVEEMIEAGVDPPPKLYHNLVGVLCG 1774
            +E M++  C+PDW+SYYLV RVLYL+ RFGKGN++V+EMI  G+ P  K Y++L+GVLCG
Sbjct: 310  LERMKQAGCSPDWMSYYLVIRVLYLTMRFGKGNKLVDEMIGEGLVPNCKFYYDLIGVLCG 369

Query: 1775 VERVNYALELFERMKRSSVGEYGPVYDLLIPKLCRGGDFVKGKELWDEAERMGVALCCSR 1954
            VER  YALELFE MK+ S+G YGPVYD+LIPKLCRGGDF KG+ELW EA  MGV  CCS 
Sbjct: 370  VERPYYALELFEHMKKRSLGGYGPVYDVLIPKLCRGGDFEKGRELWIEAMNMGVDFCCSS 429

Query: 1955 VALDPSITEVYVCKRQDDEEL 2017
              LDPSIT+V+   R+++E++
Sbjct: 430  DVLDPSITKVFKPTRKEEEKI 450


>ref|XP_002514722.1| pentatricopeptide repeat-containing protein, putative [Ricinus
            communis] gi|223546326|gb|EEF47828.1| pentatricopeptide
            repeat-containing protein, putative [Ricinus communis]
          Length = 479

 Score =  546 bits (1406), Expect = e-152
 Identities = 270/442 (61%), Positives = 344/442 (77%), Gaps = 3/442 (0%)
 Frame = +2

Query: 713  NAWKFKLLYFCSCTXXXXXXXXQVQEICNLVSTPVGSLDDLESGLDKCGAPLTSSMVVQV 892
            NA K K       T        ++QEIC  VS+ +G LDDLES L+     LTS +V QV
Sbjct: 2    NANKSKHFVCLYSTISHNRVPLELQEICKAVSSSIGGLDDLESSLNGFRGNLTSQIVTQV 61

Query: 893  VEHCKNNSPSRRLLRFFTWSCKRLNGGLVDKEFNFAIQAFAEMKDGRAVEILLSDISKEG 1072
            ++ CK+ +P+RRLLRFF WS KRL+  + D++FN AI+  AE KD  A++IL+SD+ KEG
Sbjct: 62   IDCCKHEAPTRRLLRFFLWSYKRLDFSMKDEDFNHAIRVLAEKKDHTAMQILISDLRKEG 121

Query: 1073 RVLDAQTFCDVVDVLVKLGREDDALGIFKNLDKFGCPHDKTTVTAIITALCTKGHARRAE 1252
            RV++ QTF  V + LVKLGRED+ALGIFKNLDKF CP D  TVTAIITALC +GHA++A 
Sbjct: 122  RVMEPQTFGLVAEALVKLGREDEALGIFKNLDKFKCPQDCETVTAIITALCAEGHAKKAY 181

Query: 1253 GVLRHHGDKISGV-ESCVYKSLLHGWSLKENVKEARRILQEMKSKKVMLDLYCFNTFLKC 1429
            GV+ HH DK+S V   C+Y+SL++GWS+++NVK AR ++QEMK   +  DL+C+NTFL+C
Sbjct: 182  GVVLHHKDKLSEVIRPCIYRSLIYGWSMQKNVKRAREVIQEMKRNGIKPDLFCYNTFLRC 241

Query: 1430 LCENNLKKNPSGLVPEALNVMMEMRSYKIMPNSISYNILLSCLGRTRRVKETLNTIELMR 1609
            LCE N+++NPSGLVPE+LNVMMEMRSY+I PNSISYNILLSCLGR RRV+E+   +ELM+
Sbjct: 242  LCERNVERNPSGLVPESLNVMMEMRSYRIEPNSISYNILLSCLGRVRRVQESCKILELMK 301

Query: 1610 KTRCNPDWVSYYLVARVLYLSGRFGKGNQIVEEMIEAGVDPPPKLYHNLVGVLCGVERVN 1789
            K+ C PDWVSYYLVA+VLYL+GRFGKGN+IV+EMIE  + P  K Y++L+G+LCGVERVN
Sbjct: 302  KSSCAPDWVSYYLVAKVLYLTGRFGKGNKIVDEMIERRLVPDRKFYYDLIGILCGVERVN 361

Query: 1790 YALELFERMKRSSVGEYGPVYDLLIPKLCRGGDFVKGKELWDEAERMGVALCCSRVALDP 1969
            +AL+LF++MKRSS G YGPVYDLLIPKLC GG+F KGKELWDEA  MGV + CS   LDP
Sbjct: 362  FALKLFDQMKRSSSGGYGPVYDLLIPKLCIGGNFEKGKELWDEAMAMGVTVHCSSEVLDP 421

Query: 1970 SITEVY--VCKRQDDEELRGRD 2029
            SIT+V+    K +++EE+R +D
Sbjct: 422  SITKVFEPTRKVEEEEEVRLQD 443


>ref|XP_003622167.1| Pentatricopeptide repeat protein [Medicago truncatula]
            gi|355497182|gb|AES78385.1| Pentatricopeptide repeat
            protein [Medicago truncatula]
          Length = 563

 Score =  542 bits (1397), Expect = e-151
 Identities = 269/436 (61%), Positives = 337/436 (77%), Gaps = 1/436 (0%)
 Frame = +2

Query: 701  LKPTNAWKFKLLYFCS-CTXXXXXXXXQVQEICNLVSTPVGSLDDLESGLDKCGAPLTSS 877
            L+ T+  KF+LL      T         +Q++C++V++ VG LDDLES L+K    LTS 
Sbjct: 86   LQNTSTHKFQLLSVSLFSTLHPISTPPLLQDLCDIVTSTVGGLDDLESCLNKFKGSLTSP 145

Query: 878  MVVQVVEHCKNNSPSRRLLRFFTWSCKRLNGGLVDKEFNFAIQAFAEMKDGRAVEILLSD 1057
            +V QV++  K+ + +RRLLRFF WS K L+  L DK++N+A++ F E KD  A++ILL D
Sbjct: 146  LVAQVIDSVKHEAHTRRLLRFFLWSNKNLSNDLEDKDYNYALRVFIEKKDYTAMDILLGD 205

Query: 1058 ISKEGRVLDAQTFCDVVDVLVKLGREDDALGIFKNLDKFGCPHDKTTVTAIITALCTKGH 1237
              K+GRV++AQTF  V +  VKLG+ED+ALGIFKNLDK+ C  D+ TVTAII ALC+KGH
Sbjct: 206  FKKQGRVMEAQTFGVVAETYVKLGKEDEALGIFKNLDKYKCLIDEFTVTAIINALCSKGH 265

Query: 1238 ARRAEGVLRHHGDKISGVESCVYKSLLHGWSLKENVKEARRILQEMKSKKVMLDLYCFNT 1417
            A+RAEGV  HH DKI G   CVY+SLL+GWSL+ NVKE+RRI+QEMK+  V  DL C+NT
Sbjct: 266  AKRAEGVAWHHKDKIKGALPCVYRSLLYGWSLERNVKESRRIIQEMKTNGVTPDLVCYNT 325

Query: 1418 FLKCLCENNLKKNPSGLVPEALNVMMEMRSYKIMPNSISYNILLSCLGRTRRVKETLNTI 1597
            FL+CLCE NL+ NPSGLV EALNVMMEMRSYK+ P SISYNILLSCLG+TRRVKE+   +
Sbjct: 326  FLRCLCERNLRNNPSGLVLEALNVMMEMRSYKVFPTSISYNILLSCLGKTRRVKESCQIL 385

Query: 1598 ELMRKTRCNPDWVSYYLVARVLYLSGRFGKGNQIVEEMIEAGVDPPPKLYHNLVGVLCGV 1777
            E M K+   PDWVSYYLV+RVL+LSGRFGKG +IV++MIE G+ P  K Y++L+G+LCGV
Sbjct: 386  EAMNKSGVAPDWVSYYLVSRVLFLSGRFGKGKEIVDQMIEKGLVPNHKFYYSLIGILCGV 445

Query: 1778 ERVNYALELFERMKRSSVGEYGPVYDLLIPKLCRGGDFVKGKELWDEAERMGVALCCSRV 1957
            ERVN+AL+LFE+MK SSVG YGPVYD+LIPKLCRGGDF KG+ELWDE   MG+ L CS+ 
Sbjct: 446  ERVNHALDLFEKMKGSSVGGYGPVYDVLIPKLCRGGDFEKGRELWDEGTYMGITLQCSKD 505

Query: 1958 ALDPSITEVYVCKRQD 2005
             LDPSITEVY+ KR +
Sbjct: 506  VLDPSITEVYIPKRPE 521


>ref|XP_006280363.1| hypothetical protein CARUB_v10026291mg [Capsella rubella]
            gi|482549067|gb|EOA13261.1| hypothetical protein
            CARUB_v10026291mg [Capsella rubella]
          Length = 490

 Score =  536 bits (1381), Expect = e-149
 Identities = 259/428 (60%), Positives = 323/428 (75%), Gaps = 2/428 (0%)
 Frame = +2

Query: 737  YFCS--CTXXXXXXXXQVQEICNLVSTPVGSLDDLESGLDKCGAPLTSSMVVQVVEHCKN 910
            YFCS            ++QE   LVS+P+G LDDLE  L++     +S +V QV+E CKN
Sbjct: 23   YFCSHHLVDRLDHSSSELQEFIRLVSSPIGGLDDLEENLNRVSVSPSSKLVTQVIESCKN 82

Query: 911  NSPSRRLLRFFTWSCKRLNGGLVDKEFNFAIQAFAEMKDGRAVEILLSDISKEGRVLDAQ 1090
             +  RRLLRFF+WSCK L   L DKEFN  ++  AE KD  A++ILLSD+ KE R +D Q
Sbjct: 83   ETSPRRLLRFFSWSCKNLGSSLHDKEFNHVLRVLAEKKDNTAIQILLSDLRKENRAMDKQ 142

Query: 1091 TFCDVVDVLVKLGREDDALGIFKNLDKFGCPHDKTTVTAIITALCTKGHARRAEGVLRHH 1270
            TF  V + LVK+G+EDDA+GIFK LDKF CP D  TVTAII+ALC++GH +RA GV+ HH
Sbjct: 143  TFSIVAETLVKIGKEDDAIGIFKILDKFSCPQDSFTVTAIISALCSRGHVKRALGVMHHH 202

Query: 1271 GDKISGVESCVYKSLLHGWSLKENVKEARRILQEMKSKKVMLDLYCFNTFLKCLCENNLK 1450
             D ISG E  VY+SLL GWS++ NVKEARR++Q+MKS  +  DL+CFN+ L CLCE N+ 
Sbjct: 203  KDAISGNELSVYRSLLFGWSVQRNVKEARRVIQDMKSAGITPDLFCFNSLLTCLCERNVN 262

Query: 1451 KNPSGLVPEALNVMMEMRSYKIMPNSISYNILLSCLGRTRRVKETLNTIELMRKTRCNPD 1630
            +NPSGLVPEALN+M+EM+SYKI P SISYN LLSCLGRTRRVKE+   +E M+++ C+PD
Sbjct: 263  RNPSGLVPEALNIMLEMKSYKIQPTSISYNTLLSCLGRTRRVKESCQILEQMKRSGCDPD 322

Query: 1631 WVSYYLVARVLYLSGRFGKGNQIVEEMIEAGVDPPPKLYHNLVGVLCGVERVNYALELFE 1810
              SYY V RVLYL+GRFGKGNQIV+EMIE  + P  K Y++L+GVLCGVERVN+AL+LFE
Sbjct: 323  TASYYFVVRVLYLTGRFGKGNQIVDEMIERELRPERKFYYDLIGVLCGVERVNFALQLFE 382

Query: 1811 RMKRSSVGEYGPVYDLLIPKLCRGGDFVKGKELWDEAERMGVALCCSRVALDPSITEVYV 1990
            +MKRSSVG YGPVYDLLIPKLC+GG+F KGKELW+EA  + V LC S   LDPS+TEV+ 
Sbjct: 383  KMKRSSVGGYGPVYDLLIPKLCKGGNFEKGKELWEEAMSLDVTLCSSIDLLDPSVTEVFK 442

Query: 1991 CKRQDDEE 2014
              ++ + E
Sbjct: 443  PMKKKEVE 450


>ref|XP_002866430.1| hypothetical protein ARALYDRAFT_496296 [Arabidopsis lyrata subsp.
            lyrata] gi|297312265|gb|EFH42689.1| hypothetical protein
            ARALYDRAFT_496296 [Arabidopsis lyrata subsp. lyrata]
          Length = 489

 Score =  536 bits (1380), Expect = e-149
 Identities = 261/447 (58%), Positives = 335/447 (74%), Gaps = 2/447 (0%)
 Frame = +2

Query: 677  SLARLSALLKPTNAWKFKLLYFCS--CTXXXXXXXXQVQEICNLVSTPVGSLDDLESGLD 850
            S+ R + ++  TN  K    YFCS            ++QE+  +VS+P+G LDDLE  L+
Sbjct: 3    SIVRSNGIVFVTNTIKLTR-YFCSHHLVDRPDRASTELQEVIRIVSSPIGGLDDLEKNLN 61

Query: 851  KCGAPLTSSMVVQVVEHCKNNSPSRRLLRFFTWSCKRLNGGLVDKEFNFAIQAFAEMKDG 1030
            +     +S++V QV+E CKN +  RRLLRFF+WSCK L   + DKEFN  ++  AE KD 
Sbjct: 62   QVSVSPSSNLVTQVIESCKNETSPRRLLRFFSWSCKSLGSNVHDKEFNHVLRVLAEKKDH 121

Query: 1031 RAVEILLSDISKEGRVLDAQTFCDVVDVLVKLGREDDALGIFKNLDKFGCPHDKTTVTAI 1210
             A++ILLSD+ +E R +D QTF  V + LVK+G+E+DA+GIFK LDKF CP D  TVTAI
Sbjct: 122  TAIQILLSDLRQENRAMDKQTFSIVAETLVKIGKEEDAIGIFKILDKFLCPQDSFTVTAI 181

Query: 1211 ITALCTKGHARRAEGVLRHHGDKISGVESCVYKSLLHGWSLKENVKEARRILQEMKSKKV 1390
            I+ALC++GH +RA GV+ HH D ISG E  VY+SLL GWS++ NVKEARR++Q+MKS  +
Sbjct: 182  ISALCSRGHVKRALGVMHHHKDAISGNELSVYRSLLFGWSVQRNVKEARRVIQDMKSAGI 241

Query: 1391 MLDLYCFNTFLKCLCENNLKKNPSGLVPEALNVMMEMRSYKIMPNSISYNILLSCLGRTR 1570
              DL+CFN+ L CLCE N+ +NPSGLVPEALN+M+EMRSYKI P SISYNILLSCLGRTR
Sbjct: 242  TPDLFCFNSLLTCLCERNVNRNPSGLVPEALNIMLEMRSYKIQPTSISYNILLSCLGRTR 301

Query: 1571 RVKETLNTIELMRKTRCNPDWVSYYLVARVLYLSGRFGKGNQIVEEMIEAGVDPPPKLYH 1750
            RV+E+   +E M+++ C+PD  SYY V RVLYL+GRFGKGNQIV+EMIE G+ P  K Y+
Sbjct: 302  RVRESCQILEQMKRSGCDPDTASYYFVVRVLYLTGRFGKGNQIVDEMIERGLRPEHKFYY 361

Query: 1751 NLVGVLCGVERVNYALELFERMKRSSVGEYGPVYDLLIPKLCRGGDFVKGKELWDEAERM 1930
            +L+GVLCGVERVN+AL+LFE+MKRSSV  YGPVYDLLIPKLC+GG+F KGKELW+EA  +
Sbjct: 362  DLIGVLCGVERVNFALQLFEKMKRSSVDGYGPVYDLLIPKLCKGGNFEKGKELWEEAMSL 421

Query: 1931 GVALCCSRVALDPSITEVYVCKRQDDE 2011
             V L CS   LDPS+TEV+   ++ +E
Sbjct: 422  NVTLSCSISLLDPSVTEVFKPMKKKEE 448


>gb|ESW27283.1| hypothetical protein PHAVU_003G188300g [Phaseolus vulgaris]
          Length = 494

 Score =  530 bits (1364), Expect = e-147
 Identities = 253/403 (62%), Positives = 322/403 (79%)
 Frame = +2

Query: 779  QVQEICNLVSTPVGSLDDLESGLDKCGAPLTSSMVVQVVEHCKNNSPSRRLLRFFTWSCK 958
            Q+QE+C++V + VG LDDLE  L+K    LTSS+V Q ++  K+ + +RRLLRFF WS K
Sbjct: 45   QLQELCSVVVSTVGGLDDLEFSLNKFKDSLTSSLVAQAIDSSKHEAHTRRLLRFFLWSSK 104

Query: 959  RLNGGLVDKEFNFAIQAFAEMKDGRAVEILLSDISKEGRVLDAQTFCDVVDVLVKLGRED 1138
             L+  L +K++N A++ FAE  D  A++IL+ D+ KEGRV+DA+TF  V D LVKLG+ED
Sbjct: 105  NLSHSLENKDYNHALRVFAEKNDYTAMDILMEDLKKEGRVMDAETFGLVADTLVKLGKED 164

Query: 1139 DALGIFKNLDKFGCPHDKTTVTAIITALCTKGHARRAEGVLRHHGDKISGVESCVYKSLL 1318
             ALG+FKNLDK+ C  D+ TVTAII ALC+KGHA+RAEGV+ HH DKI+G + C+Y+SLL
Sbjct: 165  QALGVFKNLDKYKCSIDEFTVTAIINALCSKGHAKRAEGVVWHHRDKITGAKPCIYRSLL 224

Query: 1319 HGWSLKENVKEARRILQEMKSKKVMLDLYCFNTFLKCLCENNLKKNPSGLVPEALNVMME 1498
            +GWS++ NVKEARRI++EMK+  V  DL C+NTFL+CLCE NL+ NPSGLVPEALNVMME
Sbjct: 225  YGWSVQRNVKEARRIIKEMKANGVTPDLLCYNTFLRCLCERNLRHNPSGLVPEALNVMME 284

Query: 1499 MRSYKIMPNSISYNILLSCLGRTRRVKETLNTIELMRKTRCNPDWVSYYLVARVLYLSGR 1678
            MRS ++ P  ISYNILLSCLG+TRRVKE+   +E M    C+PDWVSYYLVA+VL+LSGR
Sbjct: 285  MRSCRVFPTPISYNILLSCLGKTRRVKESCQILETMTNGGCDPDWVSYYLVAKVLFLSGR 344

Query: 1679 FGKGNQIVEEMIEAGVDPPPKLYHNLVGVLCGVERVNYALELFERMKRSSVGEYGPVYDL 1858
            FGKG  IV++MI  G+ P  K Y++L+G+LCGVERVN+ALELFE+MK++S+G YGPVYD+
Sbjct: 345  FGKGKDIVDQMIGKGLMPNHKFYYSLIGILCGVERVNHALELFEKMKKNSMGGYGPVYDV 404

Query: 1859 LIPKLCRGGDFVKGKELWDEAERMGVALCCSRVALDPSITEVY 1987
            LIPKLC GG+F KG+ELWDEA  MG+ L CS   LDPSIT+VY
Sbjct: 405  LIPKLCTGGNFEKGRELWDEATSMGIILQCSEDVLDPSITQVY 447


>gb|AAM65325.1| unknown [Arabidopsis thaliana]
          Length = 487

 Score =  528 bits (1361), Expect = e-147
 Identities = 256/425 (60%), Positives = 323/425 (76%)
 Frame = +2

Query: 737  YFCSCTXXXXXXXXQVQEICNLVSTPVGSLDDLESGLDKCGAPLTSSMVVQVVEHCKNNS 916
            YFCS           + E+  +VS+PVG LDDLE  L++     +S++V QV+E CKN +
Sbjct: 23   YFCS-HHLVDRSETALHEVIRIVSSPVGGLDDLEENLNQVSVSPSSNLVTQVIESCKNET 81

Query: 917  PSRRLLRFFTWSCKRLNGGLVDKEFNFAIQAFAEMKDGRAVEILLSDISKEGRVLDAQTF 1096
              RRLLRFF+WSCK L   L DKEFN+ ++  AE KD  A++ILLSD+ KE R +D QTF
Sbjct: 82   SPRRLLRFFSWSCKSLGSSLHDKEFNYVLRVLAEKKDHTAMQILLSDLRKENRAMDKQTF 141

Query: 1097 CDVVDVLVKLGREDDALGIFKNLDKFGCPHDKTTVTAIITALCTKGHARRAEGVLRHHGD 1276
              V + LVK+G+E+DA+GIFK LDKF CP D  TVTAII+ALC++GH +RA GV+ HH D
Sbjct: 142  SIVAETLVKIGKEEDAIGIFKILDKFSCPQDGFTVTAIISALCSRGHVKRALGVMHHHKD 201

Query: 1277 KISGVESCVYKSLLHGWSLKENVKEARRILQEMKSKKVMLDLYCFNTFLKCLCENNLKKN 1456
             ISG E  VY+SLL GWS++ NVKEARR++Q+MKS  +  DL+CFN+ L CLCE N+ +N
Sbjct: 202  VISGNELSVYRSLLFGWSVQRNVKEARRVIQDMKSAGITPDLFCFNSLLTCLCERNVNRN 261

Query: 1457 PSGLVPEALNVMMEMRSYKIMPNSISYNILLSCLGRTRRVKETLNTIELMRKTRCNPDWV 1636
            PSGLVPEALN+M+EMRSYKI P S+SYNILLSCLGRTRRV+E+   +E M+++ C+PD  
Sbjct: 262  PSGLVPEALNIMLEMRSYKIQPTSMSYNILLSCLGRTRRVRESCQILEQMKRSGCDPDTG 321

Query: 1637 SYYLVARVLYLSGRFGKGNQIVEEMIEAGVDPPPKLYHNLVGVLCGVERVNYALELFERM 1816
            SYY V RVLYL+GRFGKGNQIV+EMIE G  P  K Y++L+GVLCGVERVN+AL+LFE+M
Sbjct: 322  SYYFVVRVLYLTGRFGKGNQIVDEMIERGFRPERKFYYDLIGVLCGVERVNFALQLFEKM 381

Query: 1817 KRSSVGEYGPVYDLLIPKLCRGGDFVKGKELWDEAERMGVALCCSRVALDPSITEVYVCK 1996
            KRSSVG YG VYDLLIPKLC+GG+F KG+ELW+EA  + V L CS   LDPS+TEV+   
Sbjct: 382  KRSSVGGYGQVYDLLIPKLCKGGNFEKGRELWEEALSIDVTLSCSISLLDPSVTEVFKPM 441

Query: 1997 RQDDE 2011
            +  +E
Sbjct: 442  KMKEE 446


>ref|NP_200945.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana]
            gi|75171474|sp|Q9FLJ6.1|PP439_ARATH RecName:
            Full=Pentatricopeptide repeat-containing protein
            At5g61370, mitochondrial; Flags: Precursor
            gi|9757858|dbj|BAB08492.1| unnamed protein product
            [Arabidopsis thaliana] gi|17529064|gb|AAL38742.1| unknown
            protein [Arabidopsis thaliana] gi|23296891|gb|AAN13197.1|
            unknown protein [Arabidopsis thaliana]
            gi|332010076|gb|AED97459.1| pentatricopeptide
            repeat-containing protein [Arabidopsis thaliana]
          Length = 487

 Score =  528 bits (1360), Expect = e-147
 Identities = 256/425 (60%), Positives = 323/425 (76%)
 Frame = +2

Query: 737  YFCSCTXXXXXXXXQVQEICNLVSTPVGSLDDLESGLDKCGAPLTSSMVVQVVEHCKNNS 916
            YFCS           + E+  +VS+PVG LDDLE  L++     +S++V QV+E CKN +
Sbjct: 23   YFCS-HHLVDRSETALHEVIRIVSSPVGGLDDLEENLNQVSVSPSSNLVTQVIESCKNET 81

Query: 917  PSRRLLRFFTWSCKRLNGGLVDKEFNFAIQAFAEMKDGRAVEILLSDISKEGRVLDAQTF 1096
              RRLLRFF+WSCK L   L DKEFN+ ++  AE KD  A++ILLSD+ KE R +D QTF
Sbjct: 82   SPRRLLRFFSWSCKSLGSSLHDKEFNYVLRVLAEKKDHTAMQILLSDLRKENRAMDKQTF 141

Query: 1097 CDVVDVLVKLGREDDALGIFKNLDKFGCPHDKTTVTAIITALCTKGHARRAEGVLRHHGD 1276
              V + LVK+G+E+DA+GIFK LDKF CP D  TVTAII+ALC++GH +RA GV+ HH D
Sbjct: 142  SIVAETLVKVGKEEDAIGIFKILDKFSCPQDGFTVTAIISALCSRGHVKRALGVMHHHKD 201

Query: 1277 KISGVESCVYKSLLHGWSLKENVKEARRILQEMKSKKVMLDLYCFNTFLKCLCENNLKKN 1456
             ISG E  VY+SLL GWS++ NVKEARR++Q+MKS  +  DL+CFN+ L CLCE N+ +N
Sbjct: 202  VISGNELSVYRSLLFGWSVQRNVKEARRVIQDMKSAGITPDLFCFNSLLTCLCERNVNRN 261

Query: 1457 PSGLVPEALNVMMEMRSYKIMPNSISYNILLSCLGRTRRVKETLNTIELMRKTRCNPDWV 1636
            PSGLVPEALN+M+EMRSYKI P S+SYNILLSCLGRTRRV+E+   +E M+++ C+PD  
Sbjct: 262  PSGLVPEALNIMLEMRSYKIQPTSMSYNILLSCLGRTRRVRESCQILEQMKRSGCDPDTG 321

Query: 1637 SYYLVARVLYLSGRFGKGNQIVEEMIEAGVDPPPKLYHNLVGVLCGVERVNYALELFERM 1816
            SYY V RVLYL+GRFGKGNQIV+EMIE G  P  K Y++L+GVLCGVERVN+AL+LFE+M
Sbjct: 322  SYYFVVRVLYLTGRFGKGNQIVDEMIERGFRPERKFYYDLIGVLCGVERVNFALQLFEKM 381

Query: 1817 KRSSVGEYGPVYDLLIPKLCRGGDFVKGKELWDEAERMGVALCCSRVALDPSITEVYVCK 1996
            KRSSVG YG VYDLLIPKLC+GG+F KG+ELW+EA  + V L CS   LDPS+TEV+   
Sbjct: 382  KRSSVGGYGQVYDLLIPKLCKGGNFEKGRELWEEALSIDVTLSCSISLLDPSVTEVFKPM 441

Query: 1997 RQDDE 2011
            +  +E
Sbjct: 442  KMKEE 446


>ref|XP_002318601.2| hypothetical protein POPTR_0012s07030g, partial [Populus trichocarpa]
            gi|550326549|gb|EEE96821.2| hypothetical protein
            POPTR_0012s07030g, partial [Populus trichocarpa]
          Length = 410

 Score =  522 bits (1344), Expect = e-145
 Identities = 256/399 (64%), Positives = 316/399 (79%), Gaps = 3/399 (0%)
 Frame = +2

Query: 794  CNLVSTPVGSLDDLESGLDKCGAPLTSSMVVQVVEHCKNNSPSRRLLRFFTWSCKRLNGG 973
            C ++S+ +G LDDLE  L++    LT  +V Q++  CK+ +PSRR+LRFF WS K L+  
Sbjct: 2    CKVISSWIGGLDDLELSLNQFKGQLTYPLVTQIINSCKHEAPSRRILRFFLWSNKVLDSE 61

Query: 974  -LVDKEFNFAIQAFAEMKDGRAVEILLSDISKEGRVLDAQTFCDVVDVLVKLGREDDALG 1150
             L D +FN  I+  AE KD   + IL+SD+ KEGRV+D QTF  V + LVKLGRED+ALG
Sbjct: 62   KLKDDDFNHVIRVLAEKKDHTGMRILISDLRKEGRVMDPQTFALVAETLVKLGREDEALG 121

Query: 1151 IFKNLDKFGCPHDKTTVTAIITALCTKGHARRAEGVLRHH-GDKISGVESCV-YKSLLHG 1324
            IFKNL+KF CP D   VTAII+ALC KGHA++A+GV  HH  +KISG+E CV Y+ LL+G
Sbjct: 122  IFKNLEKFKCPQDGFAVTAIISALCAKGHAKKAQGVFSHHKNNKISGLEPCVVYRCLLYG 181

Query: 1325 WSLKENVKEARRILQEMKSKKVMLDLYCFNTFLKCLCENNLKKNPSGLVPEALNVMMEMR 1504
            WS++ENVKEAR+I+QEMK   ++ DL+C+NTFLKCLCE NLK+NPSGLVPEALNVMMEMR
Sbjct: 182  WSVQENVKEARKIIQEMKGDGLIPDLFCYNTFLKCLCERNLKRNPSGLVPEALNVMMEMR 241

Query: 1505 SYKIMPNSISYNILLSCLGRTRRVKETLNTIELMRKTRCNPDWVSYYLVARVLYLSGRFG 1684
            SY+I PNSISYN LLS LGR RRVKE+   +E M+ T C PDWVSY+LVA+V+YL+GRFG
Sbjct: 242  SYRIEPNSISYNTLLSSLGRARRVKESYRMLETMKTTGCAPDWVSYFLVAKVMYLTGRFG 301

Query: 1685 KGNQIVEEMIEAGVDPPPKLYHNLVGVLCGVERVNYALELFERMKRSSVGEYGPVYDLLI 1864
            KGN+IV+EMI  G+ P  K Y+NL+GVLCGVERV+YALELFERMK SS+G YGPVYD+LI
Sbjct: 302  KGNEIVDEMIGQGLLPDRKFYYNLIGVLCGVERVSYALELFERMKTSSLGGYGPVYDILI 361

Query: 1865 PKLCRGGDFVKGKELWDEAERMGVALCCSRVALDPSITE 1981
            PKLC+GGDF +G+ELW+EA  MGV+  CS   LDPSITE
Sbjct: 362  PKLCKGGDFERGRELWEEATAMGVSFSCSSDVLDPSITE 400


>ref|XP_006394515.1| hypothetical protein EUTSA_v10004085mg [Eutrema salsugineum]
            gi|557091154|gb|ESQ31801.1| hypothetical protein
            EUTSA_v10004085mg [Eutrema salsugineum]
          Length = 489

 Score =  515 bits (1326), Expect = e-143
 Identities = 245/412 (59%), Positives = 315/412 (76%)
 Frame = +2

Query: 779  QVQEICNLVSTPVGSLDDLESGLDKCGAPLTSSMVVQVVEHCKNNSPSRRLLRFFTWSCK 958
            ++ E+  +VS+P+G LDDLE  L++     +S +V +V++ CK+ +  RRLLRFF+WSCK
Sbjct: 38   ELHEVIRIVSSPIGGLDDLEESLNQVSVSPSSKLVHKVIDSCKDETSPRRLLRFFSWSCK 97

Query: 959  RLNGGLVDKEFNFAIQAFAEMKDGRAVEILLSDISKEGRVLDAQTFCDVVDVLVKLGRED 1138
             L   L DK FN  ++  AE KD  A++ILLSD+ K+ R +D QTF  V + LVK+GRE+
Sbjct: 98   NLGSCLEDKTFNHVLRVLAEKKDHTAIQILLSDLRKQNRAMDKQTFSLVAETLVKIGREE 157

Query: 1139 DALGIFKNLDKFGCPHDKTTVTAIITALCTKGHARRAEGVLRHHGDKISGVESCVYKSLL 1318
            DA+GIFK LDKF C  D  TVTAII+ALC++GH +RA GV+ HH   ISG E  VY+SLL
Sbjct: 158  DAIGIFKILDKFSCQQDSFTVTAIISALCSRGHVKRALGVMHHHKALISGNELSVYRSLL 217

Query: 1319 HGWSLKENVKEARRILQEMKSKKVMLDLYCFNTFLKCLCENNLKKNPSGLVPEALNVMME 1498
             GWS++ NVKEARR++Q+MKS ++  DL+C+NT L CLCE N+ +NPSGLVPEALN+M+E
Sbjct: 218  FGWSVQRNVKEARRVIQDMKSSRITPDLFCYNTMLTCLCERNVNRNPSGLVPEALNIMLE 277

Query: 1499 MRSYKIMPNSISYNILLSCLGRTRRVKETLNTIELMRKTRCNPDWVSYYLVARVLYLSGR 1678
            MRSYKI P  ISYNILLSCL RTRRVKE+   +E M+K+ C+PD  SYY V RVLYL+GR
Sbjct: 278  MRSYKIQPTCISYNILLSCLARTRRVKESCQILEQMKKSGCDPDTASYYFVVRVLYLTGR 337

Query: 1679 FGKGNQIVEEMIEAGVDPPPKLYHNLVGVLCGVERVNYALELFERMKRSSVGEYGPVYDL 1858
            FGKGNQ V+EMIE G+ P  + Y++L+GVLCGV+RVN+AL+LF +MKRSSVG YGPVYDL
Sbjct: 338  FGKGNQTVDEMIERGLRPERRFYYDLIGVLCGVKRVNFALQLFAKMKRSSVGGYGPVYDL 397

Query: 1859 LIPKLCRGGDFVKGKELWDEAERMGVALCCSRVALDPSITEVYVCKRQDDEE 2014
            LIPKLC+GGDF KG+ELW+EA  + V L CS   LDPS+TEV+   ++  EE
Sbjct: 398  LIPKLCKGGDFEKGRELWEEAMSLDVTLSCSVDLLDPSLTEVFKPMKKKKEE 449


>ref|XP_006849319.1| hypothetical protein AMTR_s00164p00020970 [Amborella trichopoda]
            gi|548852840|gb|ERN10900.1| hypothetical protein
            AMTR_s00164p00020970 [Amborella trichopoda]
          Length = 459

 Score =  383 bits (984), Expect = e-103
 Identities = 185/397 (46%), Positives = 275/397 (69%)
 Frame = +2

Query: 815  VGSLDDLESGLDKCGAPLTSSMVVQVVEHCKNNSPSRRLLRFFTWSCKRLNGGLVDKEFN 994
            +G+LDD+ES L++    ++  +V QV+E C + + +RRLLRFFTWS K+    L D  FN
Sbjct: 35   IGNLDDIESNLNQSEILISPPLVTQVMESCTHRAQTRRLLRFFTWSAKQPTCKLPDTLFN 94

Query: 995  FAIQAFAEMKDGRAVEILLSDISKEGRVLDAQTFCDVVDVLVKLGREDDALGIFKNLDKF 1174
             AI+ FA +KD RA+E+L++++ +E R +   T+  +   +V  G+ED A+GIFKN++K+
Sbjct: 95   HAIKLFASLKDLRAMELLVTELKRESRGMGIDTWAAIATTMVDHGKEDQAIGIFKNIEKY 154

Query: 1175 GCPHDKTTVTAIITALCTKGHARRAEGVLRHHGDKISGVESCVYKSLLHGWSLKENVKEA 1354
             CP D+ ++  ++ ALC +GHAR+AEGV+ +  + +S ++S ++ +L+HGW +K   K+A
Sbjct: 155  RCPRDEKSLNLLVHALCARGHARKAEGVVWNAKNWVS-MDSYIFTTLIHGWCIKGEFKDA 213

Query: 1355 RRILQEMKSKKVMLDLYCFNTFLKCLCENNLKKNPSGLVPEALNVMMEMRSYKIMPNSIS 1534
            RR+ +EM+S     +L  +++ ++C+C  NL+ NPS LV +   ++MEMRS  + P +IS
Sbjct: 214  RRVFEEMRSNGFSPNLVAYHSLIRCVCAKNLRINPSALVRDFFELVMEMRSNSVCPTTIS 273

Query: 1535 YNILLSCLGRTRRVKETLNTIELMRKTRCNPDWVSYYLVARVLYLSGRFGKGNQIVEEMI 1714
            +NIL+S LGR RRVKE       M +  C+PD+VSY+LV R+LYL+GR GKGN++V+EMI
Sbjct: 274  FNILISYLGRARRVKEADQVFRAMVQEGCDPDYVSYFLVVRLLYLTGRMGKGNEMVDEMI 333

Query: 1715 EAGVDPPPKLYHNLVGVLCGVERVNYALELFERMKRSSVGEYGPVYDLLIPKLCRGGDFV 1894
            + G+ P  + YH+L GVLCGVE+V++AL L  RMK +    YGP YDLLI KLC+GG F 
Sbjct: 334  QIGLKPKARFYHSLTGVLCGVEKVDHALWLLARMKENCSEVYGPTYDLLITKLCKGGKFE 393

Query: 1895 KGKELWDEAERMGVALCCSRVALDPSITEVYVCKRQD 2005
             G++LWDEA   G  L CS   LDPS TEVY  KR++
Sbjct: 394  IGRKLWDEALERGAVLQCSVDLLDPSKTEVYKPKRKE 430


>gb|EEC66969.1| hypothetical protein OsI_33629 [Oryza sativa Indica Group]
          Length = 648

 Score =  359 bits (921), Expect = 3e-96
 Identities = 192/409 (46%), Positives = 267/409 (65%), Gaps = 15/409 (3%)
 Frame = +2

Query: 800  LVSTPVGSLDDLESGLDKCGAPLTSSMVVQVVEHCKNNSPS-RRLLRFFTWSCKRLNGGL 976
            +V +  GSLD++   LD+ G P++ +MV +V++ C     S RRLLRF +W   +  GG+
Sbjct: 30   VVCSGAGSLDEVGGALDRLGVPVSPAMVARVIDACSERMGSGRRLLRFLSWCRSKDAGGI 89

Query: 977  VDKEFNFAIQAFAEMKDGRAVEILLSDISKEGRVLDAQTFCDVVDVLVKLGREDDALGIF 1156
             D+  + AI A A M D  A+ I ++D  K+GR +  +TF  VV+ LVKLG+ED+A+ +F
Sbjct: 90   GDEALDSAIAALARMGDLTAMRIAVADAEKDGRRMSPETFTVVVEALVKLGKEDEAVRLF 149

Query: 1157 KNLDKFGCPHDK----------TTVTAIITALCTKGHARRAEGVLRHHGDKIS--GVESC 1300
            + L++      +          ++  A++ ALC KGHAR A+GV+ HH  ++S   + S 
Sbjct: 150  RGLERQRLLPQRDAGDGGEGVWSSSLAMVQALCMKGHAREAQGVVWHHKSELSVEPMVSI 209

Query: 1301 VYKSLLHGWSLKENVKEARRILQEMKSKKVMLDLYCFNTFLKCLCENNLKKNPSGLVPEA 1480
            V +SLLHGW +  N KEARR+L ++KS    L L  FN +L CLC  NLK NPS LV EA
Sbjct: 210  VQRSLLHGWCVHGNAKEARRVLDDIKSSCTPLGLPSFNDYLHCLCHRNLKFNPSALVTEA 269

Query: 1481 LNVMMEMRSYKIMPNSISYNILLSCLGRTRRVKETLNTIELMR--KTRCNPDWVSYYLVA 1654
            ++V+ EMRSY + P++ S NILLSCLGR RRVKE+   + LMR  K  C+PDWVSYYLV 
Sbjct: 270  MDVLSEMRSYGVTPDASSLNILLSCLGRARRVKESYRILYLMREGKAGCSPDWVSYYLVV 329

Query: 1655 RVLYLSGRFGKGNQIVEEMIEAGVDPPPKLYHNLVGVLCGVERVNYALELFERMKRSSVG 1834
            RVLYL+GR  +G ++V++M+E+GV P  K +H L+GVLCG E+V++ L++F  MKR  + 
Sbjct: 330  RVLYLTGRIIRGKRLVDDMLESGVLPTAKFFHGLIGVLCGTEKVDHGLDMFRLMKRCQLV 389

Query: 1835 EYGPVYDLLIPKLCRGGDFVKGKELWDEAERMGVALCCSRVALDPSITE 1981
            +    YDLLI KLCR G F  GKELWD+A++ G  L CS   LDP  TE
Sbjct: 390  D-THTYDLLIEKLCRNGRFENGKELWDDAKKNGFMLGCSEDLLDPLKTE 437


Top