BLASTX nr result

ID: Rheum21_contig00000937 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Rheum21_contig00000937
         (1793 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_004294568.1| PREDICTED: pentatricopeptide repeat-containi...   340   1e-90
emb|CBI30774.3| unnamed protein product [Vitis vinifera]              332   4e-88
gb|EMJ13198.1| hypothetical protein PRUPE_ppa011078mg [Prunus pe...   328   5e-87
ref|XP_006449586.1| hypothetical protein CICLE_v10016169mg [Citr...   327   1e-86
gb|EOY27863.1| Pentatricopeptide repeat superfamily protein [The...   320   1e-84
ref|XP_004164255.1| PREDICTED: pentatricopeptide repeat-containi...   320   2e-84
ref|XP_003546058.1| PREDICTED: pentatricopeptide repeat-containi...   320   2e-84
ref|XP_002316747.1| predicted protein [Populus trichocarpa]           320   2e-84
gb|ACU23441.1| unknown [Glycine max]                                  319   2e-84
ref|XP_006467567.1| PREDICTED: pentatricopeptide repeat-containi...   315   3e-83
ref|XP_004232997.1| PREDICTED: pentatricopeptide repeat-containi...   313   1e-82
ref|XP_006355563.1| PREDICTED: pentatricopeptide repeat-containi...   313   2e-82
gb|EXB58283.1| hypothetical protein L484_015617 [Morus notabilis]     312   3e-82
gb|ESW19954.1| hypothetical protein PHAVU_006G168700g [Phaseolus...   311   4e-82
ref|XP_004499754.1| PREDICTED: pentatricopeptide repeat-containi...   308   4e-81
ref|XP_002526919.1| conserved hypothetical protein [Ricinus comm...   305   4e-80
ref|XP_006848146.1| hypothetical protein AMTR_s00029p00227910 [A...   289   3e-75
ref|XP_002867967.1| hypothetical protein ARALYDRAFT_914772 [Arab...   288   5e-75
ref|NP_001031667.1| pentatricopeptide repeat-containing protein ...   283   1e-73
ref|NP_567571.1| pentatricopeptide repeat-containing protein [Ar...   283   1e-73

>ref|XP_004294568.1| PREDICTED: pentatricopeptide repeat-containing protein At4g18975,
            chloroplastic-like [Fragaria vesca subsp. vesca]
          Length = 300

 Score =  340 bits (872), Expect = 1e-90
 Identities = 175/285 (61%), Positives = 203/285 (71%), Gaps = 12/285 (4%)
 Frame = -1

Query: 1208 GSKQAKVADSSLRLRYGISFSRISPSMQISGLYPRRIKVQYSLDLNGPPCK--------- 1056
            G  Q K  +++L    G+S SR + S++ S     R  ++Y + LN    K         
Sbjct: 12   GINQIKPTEAALSRTVGLSNSRTALSLKSSSFLCVRNSLRYVVGLNMFDLKCCQKQSRQT 71

Query: 1055 ---SASXXXXXXXKPGKNEHHLWKKRDSTGSGQKALNLIRIISDLPNEKEAVYGALDKWT 885
               S +       K G+NEHHLWKK+DS GSGQKALNLIRI+SDLPNEKEA++GALDKWT
Sbjct: 72   VMASKAMEKKIIKKAGRNEHHLWKKKDSAGSGQKALNLIRIVSDLPNEKEAIFGALDKWT 131

Query: 884  AWELEFPXXXXXXXXXXXXXRSQWKRVIQVAKWMLSKGQGATMATYDTLLLAFDKDRRIE 705
            AWE EFP               QW+RVIQVAKWMLSKGQGATMATYDTLLLAFD D R++
Sbjct: 132  AWETEFPLIAAAKALRILRRTCQWRRVIQVAKWMLSKGQGATMATYDTLLLAFDMDNRLD 191

Query: 704  EAESLWNMILHTHSRSISKWLFSRMISLYDHHNHPEKIIEVFADMEELGVKPDADTVRRI 525
            EAESLWNMILHTH+RSISK LFSRMISLYDHH    KIIEVFADMEEL V+PD DTVRR+
Sbjct: 192  EAESLWNMILHTHTRSISKRLFSRMISLYDHHEMKTKIIEVFADMEELSVRPDEDTVRRV 251

Query: 524  ARAFQELGQVDKKQLVLKKYGLKWKYVHFKGERVRVRVEPWNEGE 390
            ARAFQE GQ DK +LVL++YG KWKY+HFKGERV+VR   W E +
Sbjct: 252  ARAFQEFGQEDKSKLVLRRYGCKWKYIHFKGERVKVRTNAWVEDD 296


>emb|CBI30774.3| unnamed protein product [Vitis vinifera]
          Length = 277

 Score =  332 bits (850), Expect = 4e-88
 Identities = 160/208 (76%), Positives = 177/208 (85%)
 Frame = -1

Query: 1019 GKNEHHLWKKRDSTGSGQKALNLIRIISDLPNEKEAVYGALDKWTAWELEFPXXXXXXXX 840
            GK EHHLW+KRDS GSGQKALNL+RI+S+LPNEKEAVYGALDKWTAWE EFP        
Sbjct: 68   GKKEHHLWRKRDSIGSGQKALNLVRIVSELPNEKEAVYGALDKWTAWETEFPLIAAAKAL 127

Query: 839  XXXXXRSQWKRVIQVAKWMLSKGQGATMATYDTLLLAFDKDRRIEEAESLWNMILHTHSR 660
                 R+QWKRVIQVAKWMLSKGQGATM TYDTLLLAFD D R++EAESLWNMILHTH+R
Sbjct: 128  RILRKRNQWKRVIQVAKWMLSKGQGATMGTYDTLLLAFDMDWRVDEAESLWNMILHTHTR 187

Query: 659  SISKWLFSRMISLYDHHNHPEKIIEVFADMEELGVKPDADTVRRIARAFQELGQVDKKQL 480
            SISK LFSRMISLYDHH+  +K+IEVFADMEELGVKPD DTVRR+A AFQ LGQ DK++L
Sbjct: 188  SISKQLFSRMISLYDHHDMRDKVIEVFADMEELGVKPDEDTVRRVACAFQTLGQEDKQKL 247

Query: 479  VLKKYGLKWKYVHFKGERVRVRVEPWNE 396
            VLKKY  KWKY+HF GERVRVR + W+E
Sbjct: 248  VLKKYQCKWKYIHFNGERVRVRRDAWDE 275


>gb|EMJ13198.1| hypothetical protein PRUPE_ppa011078mg [Prunus persica]
          Length = 224

 Score =  328 bits (841), Expect = 5e-87
 Identities = 157/210 (74%), Positives = 175/210 (83%)
 Frame = -1

Query: 1019 GKNEHHLWKKRDSTGSGQKALNLIRIISDLPNEKEAVYGALDKWTAWELEFPXXXXXXXX 840
            G+ EHHLW+KRDS GSGQKALNL+RI+S LPNEKE VYGALDKWTAWE EFP        
Sbjct: 11   GRKEHHLWQKRDSAGSGQKALNLVRIVSGLPNEKETVYGALDKWTAWETEFPLIAAVKAL 70

Query: 839  XXXXXRSQWKRVIQVAKWMLSKGQGATMATYDTLLLAFDKDRRIEEAESLWNMILHTHSR 660
                 RSQW RVIQVAKWMLSKGQGATM TYDTLLLAFD D+R++EAESLWNMILHTH+R
Sbjct: 71   RILRKRSQWVRVIQVAKWMLSKGQGATMGTYDTLLLAFDMDQRVDEAESLWNMILHTHTR 130

Query: 659  SISKWLFSRMISLYDHHNHPEKIIEVFADMEELGVKPDADTVRRIARAFQELGQVDKKQL 480
            SISK LFSRMISLYDHH+   KIIEVFADMEELGVKPD DTVRR+ARAF+ELGQ + K L
Sbjct: 131  SISKRLFSRMISLYDHHDKQNKIIEVFADMEELGVKPDEDTVRRVARAFKELGQEENKTL 190

Query: 479  VLKKYGLKWKYVHFKGERVRVRVEPWNEGE 390
            VL++Y  KWKY+HFKGERV+VR   W+E +
Sbjct: 191  VLRRYQCKWKYIHFKGERVKVRTNAWDEDD 220


>ref|XP_006449586.1| hypothetical protein CICLE_v10016169mg [Citrus clementina]
            gi|557552197|gb|ESR62826.1| hypothetical protein
            CICLE_v10016169mg [Citrus clementina]
          Length = 284

 Score =  327 bits (837), Expect = 1e-86
 Identities = 160/213 (75%), Positives = 175/213 (82%)
 Frame = -1

Query: 1019 GKNEHHLWKKRDSTGSGQKALNLIRIISDLPNEKEAVYGALDKWTAWELEFPXXXXXXXX 840
            GK E HLW+KRDS GSGQKALNL+RI+S+LPNEK AVYGALDKWTAWE EFP        
Sbjct: 72   GKKEQHLWQKRDSAGSGQKALNLVRIVSELPNEKHAVYGALDKWTAWETEFPLIAAAKAL 131

Query: 839  XXXXXRSQWKRVIQVAKWMLSKGQGATMATYDTLLLAFDKDRRIEEAESLWNMILHTHSR 660
                 R QW RVIQVAKWMLSKGQGATM TYDTLLLAFDKD R +EAESLWNMILHTH+R
Sbjct: 132  RILRKRGQWLRVIQVAKWMLSKGQGATMGTYDTLLLAFDKDHRADEAESLWNMILHTHTR 191

Query: 659  SISKWLFSRMISLYDHHNHPEKIIEVFADMEELGVKPDADTVRRIARAFQELGQVDKKQL 480
            SISK LFSRMISLYDHH+ P KIIEVFADMEELGV+PD DTVRRIA AFQ +GQ +K++L
Sbjct: 192  SISKRLFSRMISLYDHHDMPNKIIEVFADMEELGVRPDEDTVRRIASAFQRVGQDEKQKL 251

Query: 479  VLKKYGLKWKYVHFKGERVRVRVEPWNEGEADN 381
            VLKKY  KWKY+HFKGERVRVR + W E  + N
Sbjct: 252  VLKKYLSKWKYIHFKGERVRVRRDAWYESGSTN 284


>gb|EOY27863.1| Pentatricopeptide repeat superfamily protein [Theobroma cacao]
          Length = 276

 Score =  320 bits (821), Expect = 1e-84
 Identities = 156/201 (77%), Positives = 172/201 (85%)
 Frame = -1

Query: 1019 GKNEHHLWKKRDSTGSGQKALNLIRIISDLPNEKEAVYGALDKWTAWELEFPXXXXXXXX 840
            GKNEHHLWKKRDS GSGQKALNL+RIIS LPNEKEAVYGALDKWTAWE EFP        
Sbjct: 68   GKNEHHLWKKRDSAGSGQKALNLVRIISQLPNEKEAVYGALDKWTAWETEFPLIAAAKAL 127

Query: 839  XXXXXRSQWKRVIQVAKWMLSKGQGATMATYDTLLLAFDKDRRIEEAESLWNMILHTHSR 660
                 RSQW RVIQVAKWMLSKGQGATM TYDTLLLAFD D+R++EAESLWNMILH H+R
Sbjct: 128  RILRKRSQWLRVIQVAKWMLSKGQGATMGTYDTLLLAFDMDKRVDEAESLWNMILHIHTR 187

Query: 659  SISKWLFSRMISLYDHHNHPEKIIEVFADMEELGVKPDADTVRRIARAFQELGQVDKKQL 480
            SISK LFSRMISLYDHHN  +KIIEVFADMEEL V+PD +TVR++ARAFQ+LGQ DK++L
Sbjct: 188  SISKRLFSRMISLYDHHNMQDKIIEVFADMEELCVRPDENTVRKVARAFQKLGQEDKQKL 247

Query: 479  VLKKYGLKWKYVHFKGERVRV 417
            VL++Y  KWKY+HF GERVRV
Sbjct: 248  VLRRYLSKWKYIHFNGERVRV 268


>ref|XP_004164255.1| PREDICTED: pentatricopeptide repeat-containing protein At4g18975,
            chloroplastic-like [Cucumis sativus]
          Length = 270

 Score =  320 bits (819), Expect = 2e-84
 Identities = 151/208 (72%), Positives = 173/208 (83%)
 Frame = -1

Query: 1019 GKNEHHLWKKRDSTGSGQKALNLIRIISDLPNEKEAVYGALDKWTAWELEFPXXXXXXXX 840
            GK  HHLWKKRDS GSGQKALNL+RI+S  PNEKEAVYG L+KW AWE EFP        
Sbjct: 60   GKETHHLWKKRDSAGSGQKALNLVRIVSQCPNEKEAVYGELNKWIAWETEFPLIAAAKAL 119

Query: 839  XXXXXRSQWKRVIQVAKWMLSKGQGATMATYDTLLLAFDKDRRIEEAESLWNMILHTHSR 660
                 RSQWKRVIQVAKWMLSKGQGATM TYDTLLLAFD D+R++EAESLWNMILHTH+R
Sbjct: 120  RILRKRSQWKRVIQVAKWMLSKGQGATMGTYDTLLLAFDMDKRVDEAESLWNMILHTHTR 179

Query: 659  SISKWLFSRMISLYDHHNHPEKIIEVFADMEELGVKPDADTVRRIARAFQELGQVDKKQL 480
            SISK +FSRMISLY+HH+  +KIIE+FADMEELGVKPD DTVRR+ RAFQ+LGQ D +++
Sbjct: 180  SISKRVFSRMISLYEHHDLQDKIIEIFADMEELGVKPDEDTVRRVCRAFQKLGQEDNRKM 239

Query: 479  VLKKYGLKWKYVHFKGERVRVRVEPWNE 396
            V K+Y  +WKY+HFKGERVRVR + W+E
Sbjct: 240  VYKRYSCQWKYIHFKGERVRVRRDGWDE 267


>ref|XP_003546058.1| PREDICTED: pentatricopeptide repeat-containing protein At4g18975,
            chloroplastic-like isoform X1 [Glycine max]
            gi|571517206|ref|XP_006597502.1| PREDICTED:
            pentatricopeptide repeat-containing protein At4g18975,
            chloroplastic-like isoform X2 [Glycine max]
          Length = 288

 Score =  320 bits (819), Expect = 2e-84
 Identities = 151/213 (70%), Positives = 172/213 (80%)
 Frame = -1

Query: 1019 GKNEHHLWKKRDSTGSGQKALNLIRIISDLPNEKEAVYGALDKWTAWELEFPXXXXXXXX 840
            GK EHHLWK RDS  SGQKAL L+R +  LPNEKEAVYGALDKWTAWE EFP        
Sbjct: 76   GKKEHHLWKSRDSAQSGQKALALVRTVYKLPNEKEAVYGALDKWTAWETEFPVIAVSKAL 135

Query: 839  XXXXXRSQWKRVIQVAKWMLSKGQGATMATYDTLLLAFDKDRRIEEAESLWNMILHTHSR 660
                 R  W RVIQVAKWMLSKGQGATM TYDTLLLAFD D+R++EAESLWNMI+H H R
Sbjct: 136  KILRKRGHWVRVIQVAKWMLSKGQGATMGTYDTLLLAFDMDKRVDEAESLWNMIIHAHMR 195

Query: 659  SISKWLFSRMISLYDHHNHPEKIIEVFADMEELGVKPDADTVRRIARAFQELGQVDKKQL 480
            S+SK LFSRMISLYDHHN P+KII+VFADMEEL +KPD DTVRR+ARAF+ELG  +K++L
Sbjct: 196  SVSKRLFSRMISLYDHHNMPDKIIDVFADMEELRLKPDEDTVRRVARAFRELGDEEKRKL 255

Query: 479  VLKKYGLKWKYVHFKGERVRVRVEPWNEGEADN 381
            V+K+YGLKWKY+HF GERVRVR E W + ++ N
Sbjct: 256  VIKQYGLKWKYIHFNGERVRVRTEAWEDNKSTN 288


>ref|XP_002316747.1| predicted protein [Populus trichocarpa]
          Length = 272

 Score =  320 bits (819), Expect = 2e-84
 Identities = 151/208 (72%), Positives = 176/208 (84%)
 Frame = -1

Query: 1019 GKNEHHLWKKRDSTGSGQKALNLIRIISDLPNEKEAVYGALDKWTAWELEFPXXXXXXXX 840
            GK EHHLW+KRDS GSGQKALNL+RI+S+LPNEKEAVYGALDKWTAWE EFP        
Sbjct: 64   GKKEHHLWQKRDSAGSGQKALNLVRIVSELPNEKEAVYGALDKWTAWETEFPLIAAAKAL 123

Query: 839  XXXXXRSQWKRVIQVAKWMLSKGQGATMATYDTLLLAFDKDRRIEEAESLWNMILHTHSR 660
                 R QW RVIQVAKWMLSKGQGAT+ TYDTLLLAFDKD R++EA+SLWNMI+H H+R
Sbjct: 124  KILQQRRQWTRVIQVAKWMLSKGQGATLGTYDTLLLAFDKDDRVDEAKSLWNMIIHVHTR 183

Query: 659  SISKWLFSRMISLYDHHNHPEKIIEVFADMEELGVKPDADTVRRIARAFQELGQVDKKQL 480
            S+SK LFSRMISLYDHHN  ++IIEVFADMEELGV+PD DTV R+ARAF++LGQ +K++L
Sbjct: 184  SMSKRLFSRMISLYDHHNMQDEIIEVFADMEELGVRPDEDTVWRVARAFKKLGQEEKREL 243

Query: 479  VLKKYGLKWKYVHFKGERVRVRVEPWNE 396
            VL++Y  KWKY+HF GERVRV+ + WNE
Sbjct: 244  VLERYLCKWKYIHFNGERVRVKRDGWNE 271


>gb|ACU23441.1| unknown [Glycine max]
          Length = 288

 Score =  319 bits (818), Expect = 2e-84
 Identities = 151/213 (70%), Positives = 172/213 (80%)
 Frame = -1

Query: 1019 GKNEHHLWKKRDSTGSGQKALNLIRIISDLPNEKEAVYGALDKWTAWELEFPXXXXXXXX 840
            GK EHHLWK RDS  SGQKAL L+R +  LPNEKEAVYGALDKWTAWE EFP        
Sbjct: 76   GKKEHHLWKSRDSAQSGQKALALVRTVYKLPNEKEAVYGALDKWTAWETEFPVIAVSKAL 135

Query: 839  XXXXXRSQWKRVIQVAKWMLSKGQGATMATYDTLLLAFDKDRRIEEAESLWNMILHTHSR 660
                 R  W RVIQVAKWMLSKGQGATM TYDTLLLAFD D+R++EAESLWNMI+H H R
Sbjct: 136  KILRKRGHWVRVIQVAKWMLSKGQGATMGTYDTLLLAFDMDKRVDEAESLWNMIIHAHLR 195

Query: 659  SISKWLFSRMISLYDHHNHPEKIIEVFADMEELGVKPDADTVRRIARAFQELGQVDKKQL 480
            S+SK LFSRMISLYDHHN P+KII+VFADMEEL +KPD DTVRR+ARAF+ELG  +K++L
Sbjct: 196  SVSKRLFSRMISLYDHHNMPDKIIDVFADMEELRLKPDEDTVRRVARAFRELGDEEKRKL 255

Query: 479  VLKKYGLKWKYVHFKGERVRVRVEPWNEGEADN 381
            V+K+YGLKWKY+HF GERVRVR E W + ++ N
Sbjct: 256  VIKQYGLKWKYIHFNGERVRVRTEAWEDNKSTN 288


>ref|XP_006467567.1| PREDICTED: pentatricopeptide repeat-containing protein At4g18975,
            chloroplastic-like [Citrus sinensis]
          Length = 281

 Score =  315 bits (808), Expect = 3e-83
 Identities = 177/283 (62%), Positives = 195/283 (68%), Gaps = 13/283 (4%)
 Frame = -1

Query: 1190 VADSSLRLRYGISFS-RISPSMQISGLYPRRIKVQYS-------LDLNGPPCKS-----A 1050
            +  SS  + +G S S RI P    SG      K+  S       L+ N  P  S     A
Sbjct: 2    ICRSSWVMGFGFSNSCRIPPLQTASGFSLLTTKLATSNPHLKCFLNQNKQPPVSNSNANA 61

Query: 1049 SXXXXXXXKPGKNEHHLWKKRDSTGSGQKALNLIRIISDLPNEKEAVYGALDKWTAWELE 870
            S       K GK E HLW+KRDS GSGQKALNL+   S+LPNEK AVYGALDKWTAWE E
Sbjct: 62   SKKNKLVVKVGKKEQHLWQKRDSAGSGQKALNLV---SELPNEKHAVYGALDKWTAWETE 118

Query: 869  FPXXXXXXXXXXXXXRSQWKRVIQVAKWMLSKGQGATMATYDTLLLAFDKDRRIEEAESL 690
            FP             R QW RVIQVAKWMLSKGQGATM TYDTLLLAFDKD R +EAESL
Sbjct: 119  FPLIAAAKALRILRKRGQWLRVIQVAKWMLSKGQGATMGTYDTLLLAFDKDHRADEAESL 178

Query: 689  WNMILHTHSRSISKWLFSRMISLYDHHNHPEKIIEVFADMEELGVKPDADTVRRIARAFQ 510
            WNMILHT +RSISK LFSRMISLYDHH+ P KIIEVFADMEELGV+PD DTVRRIA AFQ
Sbjct: 179  WNMILHTQTRSISKRLFSRMISLYDHHDMPNKIIEVFADMEELGVRPDEDTVRRIASAFQ 238

Query: 509  ELGQVDKKQLVLKKYGLKWKYVHFKGERVRVRVEPWNEGEADN 381
             +GQ DK++LVLKKY  KWKY+HFKGERVRVR + W E  + N
Sbjct: 239  RVGQDDKQKLVLKKYLSKWKYIHFKGERVRVRRDAWYESGSTN 281


>ref|XP_004232997.1| PREDICTED: pentatricopeptide repeat-containing protein At4g18975,
            chloroplastic-like [Solanum lycopersicum]
          Length = 265

 Score =  313 bits (803), Expect = 1e-82
 Identities = 152/202 (75%), Positives = 168/202 (83%)
 Frame = -1

Query: 1019 GKNEHHLWKKRDSTGSGQKALNLIRIISDLPNEKEAVYGALDKWTAWELEFPXXXXXXXX 840
            GK EHHLWKKR+S GSGQKALNL+RIIS LPNEKE+VYGALDKW AWE EFP        
Sbjct: 60   GKVEHHLWKKRESAGSGQKALNLVRIISGLPNEKESVYGALDKWIAWETEFPLIAAAKAL 119

Query: 839  XXXXXRSQWKRVIQVAKWMLSKGQGATMATYDTLLLAFDKDRRIEEAESLWNMILHTHSR 660
                 +  WKRVIQVAKWMLSKGQGATMATYD LLLAFD D R++EAE+LWNMILHT +R
Sbjct: 120  RILRQQRLWKRVIQVAKWMLSKGQGATMATYDALLLAFDMDNRVDEAETLWNMILHTSTR 179

Query: 659  SISKWLFSRMISLYDHHNHPEKIIEVFADMEELGVKPDADTVRRIARAFQELGQVDKKQL 480
            S+SK LFSRMISLYDHH+ P+KI+EVFADMEELGVKPD DTVRR+ARAFQ LGQ D ++L
Sbjct: 180  SVSKRLFSRMISLYDHHHVPDKIVEVFADMEELGVKPDEDTVRRVARAFQMLGQEDNQKL 239

Query: 479  VLKKYGLKWKYVHFKGERVRVR 414
            VLKKY  +WKYVHF GER RVR
Sbjct: 240  VLKKYQSRWKYVHFNGERARVR 261


>ref|XP_006355563.1| PREDICTED: pentatricopeptide repeat-containing protein At4g18975,
            chloroplastic-like isoform X1 [Solanum tuberosum]
            gi|565378234|ref|XP_006355564.1| PREDICTED:
            pentatricopeptide repeat-containing protein At4g18975,
            chloroplastic-like isoform X2 [Solanum tuberosum]
            gi|565378236|ref|XP_006355565.1| PREDICTED:
            pentatricopeptide repeat-containing protein At4g18975,
            chloroplastic-like isoform X3 [Solanum tuberosum]
          Length = 265

 Score =  313 bits (801), Expect = 2e-82
 Identities = 152/202 (75%), Positives = 168/202 (83%)
 Frame = -1

Query: 1019 GKNEHHLWKKRDSTGSGQKALNLIRIISDLPNEKEAVYGALDKWTAWELEFPXXXXXXXX 840
            GK EHHLWKKR+S GSGQKALNL+RIIS LPNEKE+VYGALDKW AWE EFP        
Sbjct: 60   GKVEHHLWKKRESAGSGQKALNLVRIISGLPNEKESVYGALDKWIAWEAEFPLIAAAKAL 119

Query: 839  XXXXXRSQWKRVIQVAKWMLSKGQGATMATYDTLLLAFDKDRRIEEAESLWNMILHTHSR 660
                 +  WKRVIQVAKWMLSKGQGATMATYD LLLAFD D R++EAE+LWNMILHT +R
Sbjct: 120  RILRQQRLWKRVIQVAKWMLSKGQGATMATYDALLLAFDMDNRVDEAETLWNMILHTSTR 179

Query: 659  SISKWLFSRMISLYDHHNHPEKIIEVFADMEELGVKPDADTVRRIARAFQELGQVDKKQL 480
            S+SK LFSRMISLYDHH+ P+KI+EVFADMEELGVKPD DTV R+ARAFQ LGQ DK++L
Sbjct: 180  SVSKRLFSRMISLYDHHHVPDKIVEVFADMEELGVKPDEDTVGRVARAFQMLGQEDKQKL 239

Query: 479  VLKKYGLKWKYVHFKGERVRVR 414
            VLKKY  +WKYVHF GER RVR
Sbjct: 240  VLKKYQSRWKYVHFNGERARVR 261


>gb|EXB58283.1| hypothetical protein L484_015617 [Morus notabilis]
          Length = 326

 Score =  312 bits (799), Expect = 3e-82
 Identities = 172/321 (53%), Positives = 206/321 (64%), Gaps = 29/321 (9%)
 Frame = -1

Query: 1256 VCIIEANPMPYFSAAIGSKQAKVADSSLRLRYGISFSRISPSMQISGL---------YPR 1104
            V I   N +P    + G    +  ++ L  R+G S  +IS   + +G          +  
Sbjct: 5    VLISTGNSIPCHFLSPGINHVRTTEAVLPFRFGFSSCKISCFKKKTGFVLFATKGISFDD 64

Query: 1103 RIKVQYS-----LDLNGPPCKSASXXXXXXXK---------------PGKNEHHLWKKRD 984
            ++ + YS     +  NG P  S+                         GK E+HLWKK+D
Sbjct: 65   KLTMNYSHHNCSIKGNGEPLTSSKAIEKLQRLCIEFLYMEFRNLVKKTGKKEYHLWKKKD 124

Query: 983  STGSGQKALNLIRIISDLPNEKEAVYGALDKWTAWELEFPXXXXXXXXXXXXXRSQWKRV 804
            S GSGQKALNLIRI+S LPNEKE VYGAL+KW AWE EFP             RSQWKRV
Sbjct: 125  SAGSGQKALNLIRILSVLPNEKEVVYGALNKWIAWETEFPLIAAAKALRILRKRSQWKRV 184

Query: 803  IQVAKWMLSKGQGATMATYDTLLLAFDKDRRIEEAESLWNMILHTHSRSISKWLFSRMIS 624
            IQVAKWMLSKGQG TM TYDTLLLAFD D+R++EAES WNMILHTH RSISK LFSRMI+
Sbjct: 185  IQVAKWMLSKGQGTTMGTYDTLLLAFDMDQRVDEAESFWNMILHTHKRSISKRLFSRMIA 244

Query: 623  LYDHHNHPEKIIEVFADMEELGVKPDADTVRRIARAFQELGQVDKKQLVLKKYGLKWKYV 444
            LYDHH+  +KIIEVFADMEEL V+ D DTVRR+A AFQ+LGQ +KK+L+L+KY  KWKYV
Sbjct: 245  LYDHHDVKDKIIEVFADMEELSVRLDEDTVRRVAYAFQKLGQEEKKKLLLRKYQCKWKYV 304

Query: 443  HFKGERVRVRVEPWNEGEADN 381
            HFKGER+RVR +P     ADN
Sbjct: 305  HFKGERIRVRRDP----SADN 321


>gb|ESW19954.1| hypothetical protein PHAVU_006G168700g [Phaseolus vulgaris]
          Length = 289

 Score =  311 bits (798), Expect = 4e-82
 Identities = 148/213 (69%), Positives = 172/213 (80%)
 Frame = -1

Query: 1019 GKNEHHLWKKRDSTGSGQKALNLIRIISDLPNEKEAVYGALDKWTAWELEFPXXXXXXXX 840
            GK EHHLWK RDS  SGQKAL L+RI+S LPNEKEAVYGALDKW AWE EFP        
Sbjct: 77   GKKEHHLWKSRDSAQSGQKALTLVRIVSKLPNEKEAVYGALDKWIAWETEFPVIAAAKAL 136

Query: 839  XXXXXRSQWKRVIQVAKWMLSKGQGATMATYDTLLLAFDKDRRIEEAESLWNMILHTHSR 660
                 R  W RVIQVAKWMLSKGQGATM T+DTLLLAFD D+R++EAESLWNMI+HTH R
Sbjct: 137  KILRKRGHWVRVIQVAKWMLSKGQGATMGTFDTLLLAFDMDQRVDEAESLWNMIIHTHMR 196

Query: 659  SISKWLFSRMISLYDHHNHPEKIIEVFADMEELGVKPDADTVRRIARAFQELGQVDKKQL 480
            S+SK LFSRMIS+YD+H+ P+KIIEVFADMEEL VKPD DTVRR+ARAF ELG+ +K++L
Sbjct: 197  SVSKRLFSRMISIYDNHDMPDKIIEVFADMEELRVKPDEDTVRRVARAFTELGEEEKRKL 256

Query: 479  VLKKYGLKWKYVHFKGERVRVRVEPWNEGEADN 381
            V ++YG+KWKY+HF  ERVRVR E + + E+ N
Sbjct: 257  VARRYGIKWKYIHFNRERVRVRTEAYEDNESTN 289


>ref|XP_004499754.1| PREDICTED: pentatricopeptide repeat-containing protein At4g18975,
            chloroplastic-like isoform X5 [Cicer arietinum]
          Length = 287

 Score =  308 bits (790), Expect = 4e-81
 Identities = 145/208 (69%), Positives = 169/208 (81%)
 Frame = -1

Query: 1019 GKNEHHLWKKRDSTGSGQKALNLIRIISDLPNEKEAVYGALDKWTAWELEFPXXXXXXXX 840
            GK EHHLWK+R+S  SGQKAL L+R I +LPNEKE+VYGALDKWTAWE EFP        
Sbjct: 79   GKVEHHLWKRRNSAQSGQKALTLVRTICELPNEKESVYGALDKWTAWETEFPLVAAAKAL 138

Query: 839  XXXXXRSQWKRVIQVAKWMLSKGQGATMATYDTLLLAFDKDRRIEEAESLWNMILHTHSR 660
                 R QW RVIQ+AKWMLSKGQGATM TYDTLLLAFD D+RI+EAESLWNMI+H H R
Sbjct: 139  NILRKRGQWVRVIQLAKWMLSKGQGATMGTYDTLLLAFDMDQRIDEAESLWNMIIHAHMR 198

Query: 659  SISKWLFSRMISLYDHHNHPEKIIEVFADMEELGVKPDADTVRRIARAFQELGQVDKKQL 480
            S+SK LFSRMISLYDHHN  EKI+E+FADMEEL +KPD DTVR++  AF++LGQ +K++ 
Sbjct: 199  SVSKRLFSRMISLYDHHNLSEKIVEIFADMEELRIKPDEDTVRKVTNAFRKLGQEEKRKS 258

Query: 479  VLKKYGLKWKYVHFKGERVRVRVEPWNE 396
            V+K+YGLKWKY+HF GERVRVR + W E
Sbjct: 259  VIKRYGLKWKYIHFNGERVRVRRQAWEE 286


>ref|XP_002526919.1| conserved hypothetical protein [Ricinus communis]
            gi|223533738|gb|EEF35472.1| conserved hypothetical
            protein [Ricinus communis]
          Length = 224

 Score =  305 bits (781), Expect = 4e-80
 Identities = 147/208 (70%), Positives = 169/208 (81%)
 Frame = -1

Query: 1019 GKNEHHLWKKRDSTGSGQKALNLIRIISDLPNEKEAVYGALDKWTAWELEFPXXXXXXXX 840
            GK EHHLWKKRDS  SG+KAL+L+RI+ +LP+EKE VYGALDKWTAWE EFP        
Sbjct: 17   GKEEHHLWKKRDSARSGEKALSLVRIVCELPDEKECVYGALDKWTAWETEFPLIAVAKGL 76

Query: 839  XXXXXRSQWKRVIQVAKWMLSKGQGATMATYDTLLLAFDKDRRIEEAESLWNMILHTHSR 660
                  +QW RVIQVAKWMLSKGQG TM TYDTLLLAFD D R++EA SLWNMILHTH R
Sbjct: 77   RILRKHNQWLRVIQVAKWMLSKGQGTTMGTYDTLLLAFDMDNRVDEAASLWNMILHTHVR 136

Query: 659  SISKWLFSRMISLYDHHNHPEKIIEVFADMEELGVKPDADTVRRIARAFQELGQVDKKQL 480
            SISK LFSRMISLYDHHN P+ IIE+FADMEELGV+PD DTVRR+ARAF+ELGQ +K++L
Sbjct: 137  SISKRLFSRMISLYDHHNMPDGIIEIFADMEELGVRPDEDTVRRVARAFKELGQEEKQKL 196

Query: 479  VLKKYGLKWKYVHFKGERVRVRVEPWNE 396
            VLK+Y  +WKY+HFKGER   +VE + E
Sbjct: 197  VLKRYMSRWKYIHFKGER-DAQVEAFQE 223


>ref|XP_006848146.1| hypothetical protein AMTR_s00029p00227910 [Amborella trichopoda]
            gi|548851451|gb|ERN09727.1| hypothetical protein
            AMTR_s00029p00227910 [Amborella trichopoda]
          Length = 287

 Score =  289 bits (739), Expect = 3e-75
 Identities = 138/208 (66%), Positives = 169/208 (81%), Gaps = 1/208 (0%)
 Frame = -1

Query: 1016 KNEHHLWKKRDSTGSGQKALNLIRIISDLPNEKEAVYGALDKWTAWELEFPXXXXXXXXX 837
            K EHHLW KRDS GS QKALNL+RI+S + NEKEA+Y ALD+W AWE EFP         
Sbjct: 80   KKEHHLWMKRDSAGSSQKALNLVRIVSRVSNEKEAIYVALDEWAAWETEFPVIAAAKALG 139

Query: 836  XXXXRSQWKRVIQVAKWMLSKGQGATMATYDTLLLAFDKDRRIEEAESLWNMILHTHSRS 657
                R +W RVIQV+KW+LSKGQ  TM TYDTLLLAFD D R++EAE++WNMILHT++RS
Sbjct: 140  ILRKRRRWLRVIQVSKWLLSKGQVLTMGTYDTLLLAFDMDGRVDEAETIWNMILHTYTRS 199

Query: 656  ISKWLFSRMISLYDHHNHPEKIIEVFADMEELGVKPDADTVRRIARAFQELGQVDKKQLV 477
            ISK LFSRM+SLYDHH+ P+K++EVFADMEELGVKPD D+VRR+ARAFQ+LG+ +K++ V
Sbjct: 200  ISKRLFSRMMSLYDHHHIPDKLLEVFADMEELGVKPDQDSVRRVARAFQQLGEEEKQKQV 259

Query: 476  LKKYGLKWKYVHFKGERVRVRV-EPWNE 396
            L+KYGLK KY+HF GERVR++  E W+E
Sbjct: 260  LQKYGLKLKYIHFNGERVRIKAGENWDE 287


>ref|XP_002867967.1| hypothetical protein ARALYDRAFT_914772 [Arabidopsis lyrata subsp.
            lyrata] gi|297313803|gb|EFH44226.1| hypothetical protein
            ARALYDRAFT_914772 [Arabidopsis lyrata subsp. lyrata]
          Length = 284

 Score =  288 bits (737), Expect = 5e-75
 Identities = 137/202 (67%), Positives = 167/202 (82%)
 Frame = -1

Query: 1019 GKNEHHLWKKRDSTGSGQKALNLIRIISDLPNEKEAVYGALDKWTAWELEFPXXXXXXXX 840
            GK EHHLWKK DS GSGQKALNL+R++S LPNEKEAVYGAL+KW AWE+EFP        
Sbjct: 76   GKKEHHLWKKNDSAGSGQKALNLVRMLSGLPNEKEAVYGALNKWVAWEVEFPIIAAAKAL 135

Query: 839  XXXXXRSQWKRVIQVAKWMLSKGQGATMATYDTLLLAFDKDRRIEEAESLWNMILHTHSR 660
                 RSQW RVIQ+AKWMLSKGQGATM TYDTLLLAFD D+R +EAESLWNMILHTH+R
Sbjct: 136  QILRKRSQWHRVIQLAKWMLSKGQGATMGTYDTLLLAFDMDQRADEAESLWNMILHTHTR 195

Query: 659  SISKWLFSRMISLYDHHNHPEKIIEVFADMEELGVKPDADTVRRIARAFQELGQVDKKQL 480
            SI + LF+RMI+LY H++  +K+IEVFADMEEL V+PD DT RR+ARAF+ELGQ + ++L
Sbjct: 196  SIPRRLFARMIALYAHYDLHDKVIEVFADMEELKVRPDEDTARRVARAFRELGQEENRKL 255

Query: 479  VLKKYGLKWKYVHFKGERVRVR 414
            +L++Y  ++KY++F GERVRV+
Sbjct: 256  ILRRYLSEFKYIYFNGERVRVK 277


>ref|NP_001031667.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana]
            gi|332658716|gb|AEE84116.1| pentatricopeptide
            repeat-containing protein [Arabidopsis thaliana]
          Length = 260

 Score =  283 bits (725), Expect = 1e-73
 Identities = 135/202 (66%), Positives = 163/202 (80%)
 Frame = -1

Query: 1019 GKNEHHLWKKRDSTGSGQKALNLIRIISDLPNEKEAVYGALDKWTAWELEFPXXXXXXXX 840
            GK EHHLWKK DS GSGQKALNL+R++S LPNEKEAVYGAL+KW AWE+EFP        
Sbjct: 52   GKKEHHLWKKNDSAGSGQKALNLVRMLSGLPNEKEAVYGALNKWVAWEVEFPIIAAAKAL 111

Query: 839  XXXXXRSQWKRVIQVAKWMLSKGQGATMATYDTLLLAFDKDRRIEEAESLWNMILHTHSR 660
                 RSQW RVIQ+AKWMLSKGQGATM TYD LLLAFD D R +EAESLWNMILHTH+R
Sbjct: 112  QILRKRSQWHRVIQLAKWMLSKGQGATMGTYDILLLAFDMDERADEAESLWNMILHTHTR 171

Query: 659  SISKWLFSRMISLYDHHNHPEKIIEVFADMEELGVKPDADTVRRIARAFQELGQVDKKQL 480
            SI + LF+RMI+LY HH+  +K+IEVFADMEEL V PD D+ RR+ARAF+EL Q + ++L
Sbjct: 172  SIPRRLFARMIALYAHHDLHDKVIEVFADMEELKVSPDEDSARRVARAFRELNQEENRKL 231

Query: 479  VLKKYGLKWKYVHFKGERVRVR 414
            +L++Y  ++KY++F GERVRV+
Sbjct: 232  ILRRYLSEYKYIYFNGERVRVK 253


>ref|NP_567571.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana]
            gi|186512032|ref|NP_001119009.1| pentatricopeptide
            repeat-containing protein [Arabidopsis thaliana]
            gi|334186688|ref|NP_001190768.1| pentatricopeptide
            repeat-containing protein [Arabidopsis thaliana]
            gi|223635632|sp|Q2V3H0.2|PP322_ARATH RecName:
            Full=Pentatricopeptide repeat-containing protein
            At4g18975, chloroplastic; Flags: Precursor
            gi|332658715|gb|AEE84115.1| pentatricopeptide
            repeat-containing protein [Arabidopsis thaliana]
            gi|332658717|gb|AEE84117.1| pentatricopeptide
            repeat-containing protein [Arabidopsis thaliana]
            gi|332658718|gb|AEE84118.1| pentatricopeptide
            repeat-containing protein [Arabidopsis thaliana]
          Length = 287

 Score =  283 bits (725), Expect = 1e-73
 Identities = 135/202 (66%), Positives = 163/202 (80%)
 Frame = -1

Query: 1019 GKNEHHLWKKRDSTGSGQKALNLIRIISDLPNEKEAVYGALDKWTAWELEFPXXXXXXXX 840
            GK EHHLWKK DS GSGQKALNL+R++S LPNEKEAVYGAL+KW AWE+EFP        
Sbjct: 79   GKKEHHLWKKNDSAGSGQKALNLVRMLSGLPNEKEAVYGALNKWVAWEVEFPIIAAAKAL 138

Query: 839  XXXXXRSQWKRVIQVAKWMLSKGQGATMATYDTLLLAFDKDRRIEEAESLWNMILHTHSR 660
                 RSQW RVIQ+AKWMLSKGQGATM TYD LLLAFD D R +EAESLWNMILHTH+R
Sbjct: 139  QILRKRSQWHRVIQLAKWMLSKGQGATMGTYDILLLAFDMDERADEAESLWNMILHTHTR 198

Query: 659  SISKWLFSRMISLYDHHNHPEKIIEVFADMEELGVKPDADTVRRIARAFQELGQVDKKQL 480
            SI + LF+RMI+LY HH+  +K+IEVFADMEEL V PD D+ RR+ARAF+EL Q + ++L
Sbjct: 199  SIPRRLFARMIALYAHHDLHDKVIEVFADMEELKVSPDEDSARRVARAFRELNQEENRKL 258

Query: 479  VLKKYGLKWKYVHFKGERVRVR 414
            +L++Y  ++KY++F GERVRV+
Sbjct: 259  ILRRYLSEYKYIYFNGERVRVK 280


Top