BLASTX nr result

ID: Atropa21_contig00001606 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Atropa21_contig00001606
         (1690 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_004232997.1| PREDICTED: pentatricopeptide repeat-containi...   466   e-128
ref|XP_006355563.1| PREDICTED: pentatricopeptide repeat-containi...   460   e-127
gb|EOY27863.1| Pentatricopeptide repeat superfamily protein [The...   350   1e-93
ref|XP_004294568.1| PREDICTED: pentatricopeptide repeat-containi...   338   3e-90
emb|CBI30774.3| unnamed protein product [Vitis vinifera]              338   4e-90
ref|XP_004164255.1| PREDICTED: pentatricopeptide repeat-containi...   328   3e-87
ref|XP_002316747.1| predicted protein [Populus trichocarpa]           328   3e-87
gb|EMJ13198.1| hypothetical protein PRUPE_ppa011078mg [Prunus pe...   328   6e-87
ref|XP_006449586.1| hypothetical protein CICLE_v10016169mg [Citr...   327   1e-86
ref|XP_006467567.1| PREDICTED: pentatricopeptide repeat-containi...   321   7e-85
ref|XP_002526919.1| conserved hypothetical protein [Ricinus comm...   320   9e-85
gb|EXB58283.1| hypothetical protein L484_015617 [Morus notabilis]     319   3e-84
ref|XP_003546058.1| PREDICTED: pentatricopeptide repeat-containi...   312   3e-82
gb|ACU23441.1| unknown [Glycine max]                                  312   3e-82
gb|ESW19954.1| hypothetical protein PHAVU_006G168700g [Phaseolus...   309   2e-81
ref|XP_004499754.1| PREDICTED: pentatricopeptide repeat-containi...   309   2e-81
ref|XP_002867967.1| hypothetical protein ARALYDRAFT_914772 [Arab...   305   3e-80
ref|NP_567571.1| pentatricopeptide repeat-containing protein [Ar...   304   9e-80
ref|XP_006848146.1| hypothetical protein AMTR_s00029p00227910 [A...   302   3e-79
ref|NP_001031667.1| pentatricopeptide repeat-containing protein ...   302   3e-79

>ref|XP_004232997.1| PREDICTED: pentatricopeptide repeat-containing protein At4g18975,
            chloroplastic-like [Solanum lycopersicum]
          Length = 265

 Score =  466 bits (1199), Expect = e-128
 Identities = 233/265 (87%), Positives = 240/265 (90%)
 Frame = +3

Query: 423  MGASLQFEFFNCNLLLKGIKSTRLSEKLNVTSVLKHSQKQGELSLTISDAADQKKVKKAG 602
            MG SLQF FF+CN+LLKGI ST LS+KLNV+S LKHS+KQGELSLTISDAADQKKV+KAG
Sbjct: 1    MGGSLQFHFFSCNILLKGINSTGLSDKLNVSSALKHSKKQGELSLTISDAADQKKVQKAG 60

Query: 603  KVEHHLWKKRESAGSGQKALNLVRIISGLPNEKESVYGALDKWIAWETEFPXXXXXXXXX 782
            KVEHHLWKKRESAGSGQKALNLVRIISGLPNEKESVYGALDKWIAWETEFP         
Sbjct: 61   KVEHHLWKKRESAGSGQKALNLVRIISGLPNEKESVYGALDKWIAWETEFPLIAAAKALR 120

Query: 783  XXXXXXXWKRVIQVAKWMLSKGQGATMATYDTLLLAFDMDNRVDEAETLWNMILHTSTRS 962
                   WKRVIQVAKWMLSKGQGATMATYD LLLAFDMDNRVDEAETLWNMILHTSTRS
Sbjct: 121  ILRQQRLWKRVIQVAKWMLSKGQGATMATYDALLLAFDMDNRVDEAETLWNMILHTSTRS 180

Query: 963  VSKRLFSRMISLYDHHHVPYKIVEVFADMEELGVKPDEDTVRRVARAFQMLGQEDKQKLV 1142
            VSKRLFSRMISLYDHHHVP KIVEVFADMEELGVKPDEDTVRRVARAFQMLGQED QKLV
Sbjct: 181  VSKRLFSRMISLYDHHHVPDKIVEVFADMEELGVKPDEDTVRRVARAFQMLGQEDNQKLV 240

Query: 1143 LKKYQSRWKYIHFNGERARVRRDTE 1217
            LKKYQSRWKY+HFNGERARVRRD E
Sbjct: 241  LKKYQSRWKYVHFNGERARVRRDIE 265


>ref|XP_006355563.1| PREDICTED: pentatricopeptide repeat-containing protein At4g18975,
            chloroplastic-like isoform X1 [Solanum tuberosum]
            gi|565378234|ref|XP_006355564.1| PREDICTED:
            pentatricopeptide repeat-containing protein At4g18975,
            chloroplastic-like isoform X2 [Solanum tuberosum]
            gi|565378236|ref|XP_006355565.1| PREDICTED:
            pentatricopeptide repeat-containing protein At4g18975,
            chloroplastic-like isoform X3 [Solanum tuberosum]
          Length = 265

 Score =  460 bits (1183), Expect = e-127
 Identities = 231/265 (87%), Positives = 237/265 (89%)
 Frame = +3

Query: 423  MGASLQFEFFNCNLLLKGIKSTRLSEKLNVTSVLKHSQKQGELSLTISDAADQKKVKKAG 602
            MG SLQF FF+CN+LLKGI ST LS+KLNVTS LK S+KQGELSLTISD ADQKKV+KAG
Sbjct: 1    MGGSLQFHFFSCNILLKGINSTGLSDKLNVTSALKDSKKQGELSLTISDTADQKKVQKAG 60

Query: 603  KVEHHLWKKRESAGSGQKALNLVRIISGLPNEKESVYGALDKWIAWETEFPXXXXXXXXX 782
            KVEHHLWKKRESAGSGQKALNLVRIISGLPNEKESVYGALDKWIAWE EFP         
Sbjct: 61   KVEHHLWKKRESAGSGQKALNLVRIISGLPNEKESVYGALDKWIAWEAEFPLIAAAKALR 120

Query: 783  XXXXXXXWKRVIQVAKWMLSKGQGATMATYDTLLLAFDMDNRVDEAETLWNMILHTSTRS 962
                   WKRVIQVAKWMLSKGQGATMATYD LLLAFDMDNRVDEAETLWNMILHTSTRS
Sbjct: 121  ILRQQRLWKRVIQVAKWMLSKGQGATMATYDALLLAFDMDNRVDEAETLWNMILHTSTRS 180

Query: 963  VSKRLFSRMISLYDHHHVPYKIVEVFADMEELGVKPDEDTVRRVARAFQMLGQEDKQKLV 1142
            VSKRLFSRMISLYDHHHVP KIVEVFADMEELGVKPDEDTV RVARAFQMLGQEDKQKLV
Sbjct: 181  VSKRLFSRMISLYDHHHVPDKIVEVFADMEELGVKPDEDTVGRVARAFQMLGQEDKQKLV 240

Query: 1143 LKKYQSRWKYIHFNGERARVRRDTE 1217
            LKKYQSRWKY+HFNGERARVRRD E
Sbjct: 241  LKKYQSRWKYVHFNGERARVRRDME 265


>gb|EOY27863.1| Pentatricopeptide repeat superfamily protein [Theobroma cacao]
          Length = 276

 Score =  350 bits (897), Expect = 1e-93
 Identities = 175/231 (75%), Positives = 192/231 (83%)
 Frame = +3

Query: 516  SVLKHSQKQGELSLTISDAADQKKVKKAGKVEHHLWKKRESAGSGQKALNLVRIISGLPN 695
            S +K SQK GE SL IS+A ++K VKK GK EHHLWKKR+SAGSGQKALNLVRIIS LPN
Sbjct: 40   SYVKCSQKLGEQSLGISEAVEKKPVKKVGKNEHHLWKKRDSAGSGQKALNLVRIISQLPN 99

Query: 696  EKESVYGALDKWIAWETEFPXXXXXXXXXXXXXXXXWKRVIQVAKWMLSKGQGATMATYD 875
            EKE+VYGALDKW AWETEFP                W RVIQVAKWMLSKGQGATM TYD
Sbjct: 100  EKEAVYGALDKWTAWETEFPLIAAAKALRILRKRSQWLRVIQVAKWMLSKGQGATMGTYD 159

Query: 876  TLLLAFDMDNRVDEAETLWNMILHTSTRSVSKRLFSRMISLYDHHHVPYKIVEVFADMEE 1055
            TLLLAFDMD RVDEAE+LWNMILH  TRS+SKRLFSRMISLYDHH++  KI+EVFADMEE
Sbjct: 160  TLLLAFDMDKRVDEAESLWNMILHIHTRSISKRLFSRMISLYDHHNMQDKIIEVFADMEE 219

Query: 1056 LGVKPDEDTVRRVARAFQMLGQEDKQKLVLKKYQSRWKYIHFNGERARVRR 1208
            L V+PDE+TVR+VARAFQ LGQEDKQKLVL++Y S+WKYIHFNGER RV R
Sbjct: 220  LCVRPDENTVRKVARAFQKLGQEDKQKLVLRRYLSKWKYIHFNGERVRVTR 270


>ref|XP_004294568.1| PREDICTED: pentatricopeptide repeat-containing protein At4g18975,
            chloroplastic-like [Fragaria vesca subsp. vesca]
          Length = 300

 Score =  338 bits (868), Expect = 3e-90
 Identities = 163/228 (71%), Positives = 187/228 (82%)
 Frame = +3

Query: 522  LKHSQKQGELSLTISDAADQKKVKKAGKVEHHLWKKRESAGSGQKALNLVRIISGLPNEK 701
            LK  QKQ   ++  S A ++K +KKAG+ EHHLWKK++SAGSGQKALNL+RI+S LPNEK
Sbjct: 61   LKCCQKQSRQTVMASKAMEKKIIKKAGRNEHHLWKKKDSAGSGQKALNLIRIVSDLPNEK 120

Query: 702  ESVYGALDKWIAWETEFPXXXXXXXXXXXXXXXXWKRVIQVAKWMLSKGQGATMATYDTL 881
            E+++GALDKW AWETEFP                W+RVIQVAKWMLSKGQGATMATYDTL
Sbjct: 121  EAIFGALDKWTAWETEFPLIAAAKALRILRRTCQWRRVIQVAKWMLSKGQGATMATYDTL 180

Query: 882  LLAFDMDNRVDEAETLWNMILHTSTRSVSKRLFSRMISLYDHHHVPYKIVEVFADMEELG 1061
            LLAFDMDNR+DEAE+LWNMILHT TRS+SKRLFSRMISLYDHH +  KI+EVFADMEEL 
Sbjct: 181  LLAFDMDNRLDEAESLWNMILHTHTRSISKRLFSRMISLYDHHEMKTKIIEVFADMEELS 240

Query: 1062 VKPDEDTVRRVARAFQMLGQEDKQKLVLKKYQSRWKYIHFNGERARVR 1205
            V+PDEDTVRRVARAFQ  GQEDK KLVL++Y  +WKYIHF GER +VR
Sbjct: 241  VRPDEDTVRRVARAFQEFGQEDKSKLVLRRYGCKWKYIHFKGERVKVR 288


>emb|CBI30774.3| unnamed protein product [Vitis vinifera]
          Length = 277

 Score =  338 bits (867), Expect = 4e-90
 Identities = 166/214 (77%), Positives = 180/214 (84%)
 Frame = +3

Query: 570  AADQKKVKKAGKVEHHLWKKRESAGSGQKALNLVRIISGLPNEKESVYGALDKWIAWETE 749
            A +++  KK GK EHHLW+KR+S GSGQKALNLVRI+S LPNEKE+VYGALDKW AWETE
Sbjct: 58   AVEKEISKKVGKKEHHLWRKRDSIGSGQKALNLVRIVSELPNEKEAVYGALDKWTAWETE 117

Query: 750  FPXXXXXXXXXXXXXXXXWKRVIQVAKWMLSKGQGATMATYDTLLLAFDMDNRVDEAETL 929
            FP                WKRVIQVAKWMLSKGQGATM TYDTLLLAFDMD RVDEAE+L
Sbjct: 118  FPLIAAAKALRILRKRNQWKRVIQVAKWMLSKGQGATMGTYDTLLLAFDMDWRVDEAESL 177

Query: 930  WNMILHTSTRSVSKRLFSRMISLYDHHHVPYKIVEVFADMEELGVKPDEDTVRRVARAFQ 1109
            WNMILHT TRS+SK+LFSRMISLYDHH +  K++EVFADMEELGVKPDEDTVRRVA AFQ
Sbjct: 178  WNMILHTHTRSISKQLFSRMISLYDHHDMRDKVIEVFADMEELGVKPDEDTVRRVACAFQ 237

Query: 1110 MLGQEDKQKLVLKKYQSRWKYIHFNGERARVRRD 1211
             LGQEDKQKLVLKKYQ +WKYIHFNGER RVRRD
Sbjct: 238  TLGQEDKQKLVLKKYQCKWKYIHFNGERVRVRRD 271


>ref|XP_004164255.1| PREDICTED: pentatricopeptide repeat-containing protein At4g18975,
            chloroplastic-like [Cucumis sativus]
          Length = 270

 Score =  328 bits (842), Expect = 3e-87
 Identities = 160/224 (71%), Positives = 178/224 (79%)
 Frame = +3

Query: 540  QGELSLTISDAADQKKVKKAGKVEHHLWKKRESAGSGQKALNLVRIISGLPNEKESVYGA 719
            Q    LT     +++ VKK GK  HHLWKKR+SAGSGQKALNLVRI+S  PNEKE+VYG 
Sbjct: 40   QAAQPLTSFTTPERRVVKKVGKETHHLWKKRDSAGSGQKALNLVRIVSQCPNEKEAVYGE 99

Query: 720  LDKWIAWETEFPXXXXXXXXXXXXXXXXWKRVIQVAKWMLSKGQGATMATYDTLLLAFDM 899
            L+KWIAWETEFP                WKRVIQVAKWMLSKGQGATM TYDTLLLAFDM
Sbjct: 100  LNKWIAWETEFPLIAAAKALRILRKRSQWKRVIQVAKWMLSKGQGATMGTYDTLLLAFDM 159

Query: 900  DNRVDEAETLWNMILHTSTRSVSKRLFSRMISLYDHHHVPYKIVEVFADMEELGVKPDED 1079
            D RVDEAE+LWNMILHT TRS+SKR+FSRMISLY+HH +  KI+E+FADMEELGVKPDED
Sbjct: 160  DKRVDEAESLWNMILHTHTRSISKRVFSRMISLYEHHDLQDKIIEIFADMEELGVKPDED 219

Query: 1080 TVRRVARAFQMLGQEDKQKLVLKKYQSRWKYIHFNGERARVRRD 1211
            TVRRV RAFQ LGQED +K+V K+Y  +WKYIHF GER RVRRD
Sbjct: 220  TVRRVCRAFQKLGQEDNRKMVYKRYSCQWKYIHFKGERVRVRRD 263


>ref|XP_002316747.1| predicted protein [Populus trichocarpa]
          Length = 272

 Score =  328 bits (842), Expect = 3e-87
 Identities = 164/263 (62%), Positives = 201/263 (76%)
 Frame = +3

Query: 423  MGASLQFEFFNCNLLLKGIKSTRLSEKLNVTSVLKHSQKQGELSLTISDAADQKKVKKAG 602
            + +   F     +  L+  + T L  K+     +K S+KQ +L+    +  ++K VKK+G
Sbjct: 7    LSSGFLFPSVKISFFLRTARLTSLEPKVTSALCVKCSKKQLKLNSRADE--NRKVVKKSG 64

Query: 603  KVEHHLWKKRESAGSGQKALNLVRIISGLPNEKESVYGALDKWIAWETEFPXXXXXXXXX 782
            K EHHLW+KR+SAGSGQKALNLVRI+S LPNEKE+VYGALDKW AWETEFP         
Sbjct: 65   KKEHHLWQKRDSAGSGQKALNLVRIVSELPNEKEAVYGALDKWTAWETEFPLIAAAKALK 124

Query: 783  XXXXXXXWKRVIQVAKWMLSKGQGATMATYDTLLLAFDMDNRVDEAETLWNMILHTSTRS 962
                   W RVIQVAKWMLSKGQGAT+ TYDTLLLAFD D+RVDEA++LWNMI+H  TRS
Sbjct: 125  ILQQRRQWTRVIQVAKWMLSKGQGATLGTYDTLLLAFDKDDRVDEAKSLWNMIIHVHTRS 184

Query: 963  VSKRLFSRMISLYDHHHVPYKIVEVFADMEELGVKPDEDTVRRVARAFQMLGQEDKQKLV 1142
            +SKRLFSRMISLYDHH++  +I+EVFADMEELGV+PDEDTV RVARAF+ LGQE+K++LV
Sbjct: 185  MSKRLFSRMISLYDHHNMQDEIIEVFADMEELGVRPDEDTVWRVARAFKKLGQEEKRELV 244

Query: 1143 LKKYQSRWKYIHFNGERARVRRD 1211
            L++Y  +WKYIHFNGER RV+RD
Sbjct: 245  LERYLCKWKYIHFNGERVRVKRD 267


>gb|EMJ13198.1| hypothetical protein PRUPE_ppa011078mg [Prunus persica]
          Length = 224

 Score =  328 bits (840), Expect = 6e-87
 Identities = 158/209 (75%), Positives = 175/209 (83%)
 Frame = +3

Query: 579  QKKVKKAGKVEHHLWKKRESAGSGQKALNLVRIISGLPNEKESVYGALDKWIAWETEFPX 758
            +K +KK G+ EHHLW+KR+SAGSGQKALNLVRI+SGLPNEKE+VYGALDKW AWETEFP 
Sbjct: 4    RKTIKKVGRKEHHLWQKRDSAGSGQKALNLVRIVSGLPNEKETVYGALDKWTAWETEFPL 63

Query: 759  XXXXXXXXXXXXXXXWKRVIQVAKWMLSKGQGATMATYDTLLLAFDMDNRVDEAETLWNM 938
                           W RVIQVAKWMLSKGQGATM TYDTLLLAFDMD RVDEAE+LWNM
Sbjct: 64   IAAVKALRILRKRSQWVRVIQVAKWMLSKGQGATMGTYDTLLLAFDMDQRVDEAESLWNM 123

Query: 939  ILHTSTRSVSKRLFSRMISLYDHHHVPYKIVEVFADMEELGVKPDEDTVRRVARAFQMLG 1118
            ILHT TRS+SKRLFSRMISLYDHH    KI+EVFADMEELGVKPDEDTVRRVARAF+ LG
Sbjct: 124  ILHTHTRSISKRLFSRMISLYDHHDKQNKIIEVFADMEELGVKPDEDTVRRVARAFKELG 183

Query: 1119 QEDKQKLVLKKYQSRWKYIHFNGERARVR 1205
            QE+ + LVL++YQ +WKYIHF GER +VR
Sbjct: 184  QEENKTLVLRRYQCKWKYIHFKGERVKVR 212


>ref|XP_006449586.1| hypothetical protein CICLE_v10016169mg [Citrus clementina]
            gi|557552197|gb|ESR62826.1| hypothetical protein
            CICLE_v10016169mg [Citrus clementina]
          Length = 284

 Score =  327 bits (838), Expect = 1e-86
 Identities = 159/210 (75%), Positives = 174/210 (82%)
 Frame = +3

Query: 582  KKVKKAGKVEHHLWKKRESAGSGQKALNLVRIISGLPNEKESVYGALDKWIAWETEFPXX 761
            K V K GK E HLW+KR+SAGSGQKALNLVRI+S LPNEK +VYGALDKW AWETEFP  
Sbjct: 66   KLVVKVGKKEQHLWQKRDSAGSGQKALNLVRIVSELPNEKHAVYGALDKWTAWETEFPLI 125

Query: 762  XXXXXXXXXXXXXXWKRVIQVAKWMLSKGQGATMATYDTLLLAFDMDNRVDEAETLWNMI 941
                          W RVIQVAKWMLSKGQGATM TYDTLLLAFD D+R DEAE+LWNMI
Sbjct: 126  AAAKALRILRKRGQWLRVIQVAKWMLSKGQGATMGTYDTLLLAFDKDHRADEAESLWNMI 185

Query: 942  LHTSTRSVSKRLFSRMISLYDHHHVPYKIVEVFADMEELGVKPDEDTVRRVARAFQMLGQ 1121
            LHT TRS+SKRLFSRMISLYDHH +P KI+EVFADMEELGV+PDEDTVRR+A AFQ +GQ
Sbjct: 186  LHTHTRSISKRLFSRMISLYDHHDMPNKIIEVFADMEELGVRPDEDTVRRIASAFQRVGQ 245

Query: 1122 EDKQKLVLKKYQSRWKYIHFNGERARVRRD 1211
            ++KQKLVLKKY S+WKYIHF GER RVRRD
Sbjct: 246  DEKQKLVLKKYLSKWKYIHFKGERVRVRRD 275


>ref|XP_006467567.1| PREDICTED: pentatricopeptide repeat-containing protein At4g18975,
            chloroplastic-like [Citrus sinensis]
          Length = 281

 Score =  321 bits (822), Expect = 7e-85
 Identities = 162/229 (70%), Positives = 182/229 (79%), Gaps = 1/229 (0%)
 Frame = +3

Query: 528  HSQKQGELSLTISDAADQKK-VKKAGKVEHHLWKKRESAGSGQKALNLVRIISGLPNEKE 704
            +  KQ  +S + ++A+ + K V K GK E HLW+KR+SAGSGQKALNLV   S LPNEK 
Sbjct: 47   NQNKQPPVSNSNANASKKNKLVVKVGKKEQHLWQKRDSAGSGQKALNLV---SELPNEKH 103

Query: 705  SVYGALDKWIAWETEFPXXXXXXXXXXXXXXXXWKRVIQVAKWMLSKGQGATMATYDTLL 884
            +VYGALDKW AWETEFP                W RVIQVAKWMLSKGQGATM TYDTLL
Sbjct: 104  AVYGALDKWTAWETEFPLIAAAKALRILRKRGQWLRVIQVAKWMLSKGQGATMGTYDTLL 163

Query: 885  LAFDMDNRVDEAETLWNMILHTSTRSVSKRLFSRMISLYDHHHVPYKIVEVFADMEELGV 1064
            LAFD D+R DEAE+LWNMILHT TRS+SKRLFSRMISLYDHH +P KI+EVFADMEELGV
Sbjct: 164  LAFDKDHRADEAESLWNMILHTQTRSISKRLFSRMISLYDHHDMPNKIIEVFADMEELGV 223

Query: 1065 KPDEDTVRRVARAFQMLGQEDKQKLVLKKYQSRWKYIHFNGERARVRRD 1211
            +PDEDTVRR+A AFQ +GQ+DKQKLVLKKY S+WKYIHF GER RVRRD
Sbjct: 224  RPDEDTVRRIASAFQRVGQDDKQKLVLKKYLSKWKYIHFKGERVRVRRD 272


>ref|XP_002526919.1| conserved hypothetical protein [Ricinus communis]
            gi|223533738|gb|EEF35472.1| conserved hypothetical
            protein [Ricinus communis]
          Length = 224

 Score =  320 bits (821), Expect = 9e-85
 Identities = 156/209 (74%), Positives = 172/209 (82%)
 Frame = +3

Query: 567  DAADQKKVKKAGKVEHHLWKKRESAGSGQKALNLVRIISGLPNEKESVYGALDKWIAWET 746
            D   +K VKKAGK EHHLWKKR+SA SG+KAL+LVRI+  LP+EKE VYGALDKW AWET
Sbjct: 6    DDKSRKPVKKAGKEEHHLWKKRDSARSGEKALSLVRIVCELPDEKECVYGALDKWTAWET 65

Query: 747  EFPXXXXXXXXXXXXXXXXWKRVIQVAKWMLSKGQGATMATYDTLLLAFDMDNRVDEAET 926
            EFP                W RVIQVAKWMLSKGQG TM TYDTLLLAFDMDNRVDEA +
Sbjct: 66   EFPLIAVAKGLRILRKHNQWLRVIQVAKWMLSKGQGTTMGTYDTLLLAFDMDNRVDEAAS 125

Query: 927  LWNMILHTSTRSVSKRLFSRMISLYDHHHVPYKIVEVFADMEELGVKPDEDTVRRVARAF 1106
            LWNMILHT  RS+SKRLFSRMISLYDHH++P  I+E+FADMEELGV+PDEDTVRRVARAF
Sbjct: 126  LWNMILHTHVRSISKRLFSRMISLYDHHNMPDGIIEIFADMEELGVRPDEDTVRRVARAF 185

Query: 1107 QMLGQEDKQKLVLKKYQSRWKYIHFNGER 1193
            + LGQE+KQKLVLK+Y SRWKYIHF GER
Sbjct: 186  KELGQEEKQKLVLKRYMSRWKYIHFKGER 214


>gb|EXB58283.1| hypothetical protein L484_015617 [Morus notabilis]
          Length = 326

 Score =  319 bits (817), Expect = 3e-84
 Identities = 155/208 (74%), Positives = 172/208 (82%)
 Frame = +3

Query: 588  VKKAGKVEHHLWKKRESAGSGQKALNLVRIISGLPNEKESVYGALDKWIAWETEFPXXXX 767
            VKK GK E+HLWKK++SAGSGQKALNL+RI+S LPNEKE VYGAL+KWIAWETEFP    
Sbjct: 109  VKKTGKKEYHLWKKKDSAGSGQKALNLIRILSVLPNEKEVVYGALNKWIAWETEFPLIAA 168

Query: 768  XXXXXXXXXXXXWKRVIQVAKWMLSKGQGATMATYDTLLLAFDMDNRVDEAETLWNMILH 947
                        WKRVIQVAKWMLSKGQG TM TYDTLLLAFDMD RVDEAE+ WNMILH
Sbjct: 169  AKALRILRKRSQWKRVIQVAKWMLSKGQGTTMGTYDTLLLAFDMDQRVDEAESFWNMILH 228

Query: 948  TSTRSVSKRLFSRMISLYDHHHVPYKIVEVFADMEELGVKPDEDTVRRVARAFQMLGQED 1127
            T  RS+SKRLFSRMI+LYDHH V  KI+EVFADMEEL V+ DEDTVRRVA AFQ LGQE+
Sbjct: 229  THKRSISKRLFSRMIALYDHHDVKDKIIEVFADMEELSVRLDEDTVRRVAYAFQKLGQEE 288

Query: 1128 KQKLVLKKYQSRWKYIHFNGERARVRRD 1211
            K+KL+L+KYQ +WKY+HF GER RVRRD
Sbjct: 289  KKKLLLRKYQCKWKYVHFKGERIRVRRD 316


>ref|XP_003546058.1| PREDICTED: pentatricopeptide repeat-containing protein At4g18975,
            chloroplastic-like isoform X1 [Glycine max]
            gi|571517206|ref|XP_006597502.1| PREDICTED:
            pentatricopeptide repeat-containing protein At4g18975,
            chloroplastic-like isoform X2 [Glycine max]
          Length = 288

 Score =  312 bits (799), Expect = 3e-82
 Identities = 160/250 (64%), Positives = 184/250 (73%)
 Frame = +3

Query: 462  LLLKGIKSTRLSEKLNVTSVLKHSQKQGELSLTISDAADQKKVKKAGKVEHHLWKKRESA 641
            LLL    ST     L  TS ++ +  + + S       ++K  K  GK EHHLWK R+SA
Sbjct: 30   LLLGNKFSTMAVTALPKTSCIQCTIVRSKFSHKSGGPMEKKGKKTTGKKEHHLWKSRDSA 89

Query: 642  GSGQKALNLVRIISGLPNEKESVYGALDKWIAWETEFPXXXXXXXXXXXXXXXXWKRVIQ 821
             SGQKAL LVR +  LPNEKE+VYGALDKW AWETEFP                W RVIQ
Sbjct: 90   QSGQKALALVRTVYKLPNEKEAVYGALDKWTAWETEFPVIAVSKALKILRKRGHWVRVIQ 149

Query: 822  VAKWMLSKGQGATMATYDTLLLAFDMDNRVDEAETLWNMILHTSTRSVSKRLFSRMISLY 1001
            VAKWMLSKGQGATM TYDTLLLAFDMD RVDEAE+LWNMI+H   RSVSKRLFSRMISLY
Sbjct: 150  VAKWMLSKGQGATMGTYDTLLLAFDMDKRVDEAESLWNMIIHAHMRSVSKRLFSRMISLY 209

Query: 1002 DHHHVPYKIVEVFADMEELGVKPDEDTVRRVARAFQMLGQEDKQKLVLKKYQSRWKYIHF 1181
            DHH++P KI++VFADMEEL +KPDEDTVRRVARAF+ LG E+K+KLV+K+Y  +WKYIHF
Sbjct: 210  DHHNMPDKIIDVFADMEELRLKPDEDTVRRVARAFRELGDEEKRKLVIKQYGLKWKYIHF 269

Query: 1182 NGERARVRRD 1211
            NGER RVR +
Sbjct: 270  NGERVRVRTE 279


>gb|ACU23441.1| unknown [Glycine max]
          Length = 288

 Score =  312 bits (799), Expect = 3e-82
 Identities = 160/250 (64%), Positives = 184/250 (73%)
 Frame = +3

Query: 462  LLLKGIKSTRLSEKLNVTSVLKHSQKQGELSLTISDAADQKKVKKAGKVEHHLWKKRESA 641
            LLL    ST     L  TS ++ +  + + S       ++K  K  GK EHHLWK R+SA
Sbjct: 30   LLLGNKFSTMAVTALPKTSCIQCTIVRSKFSHKSGGPMEKKGKKTTGKKEHHLWKSRDSA 89

Query: 642  GSGQKALNLVRIISGLPNEKESVYGALDKWIAWETEFPXXXXXXXXXXXXXXXXWKRVIQ 821
             SGQKAL LVR +  LPNEKE+VYGALDKW AWETEFP                W RVIQ
Sbjct: 90   QSGQKALALVRTVYKLPNEKEAVYGALDKWTAWETEFPVIAVSKALKILRKRGHWVRVIQ 149

Query: 822  VAKWMLSKGQGATMATYDTLLLAFDMDNRVDEAETLWNMILHTSTRSVSKRLFSRMISLY 1001
            VAKWMLSKGQGATM TYDTLLLAFDMD RVDEAE+LWNMI+H   RSVSKRLFSRMISLY
Sbjct: 150  VAKWMLSKGQGATMGTYDTLLLAFDMDKRVDEAESLWNMIIHAHLRSVSKRLFSRMISLY 209

Query: 1002 DHHHVPYKIVEVFADMEELGVKPDEDTVRRVARAFQMLGQEDKQKLVLKKYQSRWKYIHF 1181
            DHH++P KI++VFADMEEL +KPDEDTVRRVARAF+ LG E+K+KLV+K+Y  +WKYIHF
Sbjct: 210  DHHNMPDKIIDVFADMEELRLKPDEDTVRRVARAFRELGDEEKRKLVIKQYGLKWKYIHF 269

Query: 1182 NGERARVRRD 1211
            NGER RVR +
Sbjct: 270  NGERVRVRTE 279


>gb|ESW19954.1| hypothetical protein PHAVU_006G168700g [Phaseolus vulgaris]
          Length = 289

 Score =  309 bits (792), Expect = 2e-81
 Identities = 154/213 (72%), Positives = 173/213 (81%), Gaps = 1/213 (0%)
 Frame = +3

Query: 576  DQKKVKKA-GKVEHHLWKKRESAGSGQKALNLVRIISGLPNEKESVYGALDKWIAWETEF 752
            ++KK KK  GK EHHLWK R+SA SGQKAL LVRI+S LPNEKE+VYGALDKWIAWETEF
Sbjct: 68   EKKKGKKTTGKKEHHLWKSRDSAQSGQKALTLVRIVSKLPNEKEAVYGALDKWIAWETEF 127

Query: 753  PXXXXXXXXXXXXXXXXWKRVIQVAKWMLSKGQGATMATYDTLLLAFDMDNRVDEAETLW 932
            P                W RVIQVAKWMLSKGQGATM T+DTLLLAFDMD RVDEAE+LW
Sbjct: 128  PVIAAAKALKILRKRGHWVRVIQVAKWMLSKGQGATMGTFDTLLLAFDMDQRVDEAESLW 187

Query: 933  NMILHTSTRSVSKRLFSRMISLYDHHHVPYKIVEVFADMEELGVKPDEDTVRRVARAFQM 1112
            NMI+HT  RSVSKRLFSRMIS+YD+H +P KI+EVFADMEEL VKPDEDTVRRVARAF  
Sbjct: 188  NMIIHTHMRSVSKRLFSRMISIYDNHDMPDKIIEVFADMEELRVKPDEDTVRRVARAFTE 247

Query: 1113 LGQEDKQKLVLKKYQSRWKYIHFNGERARVRRD 1211
            LG+E+K+KLV ++Y  +WKYIHFN ER RVR +
Sbjct: 248  LGEEEKRKLVARRYGIKWKYIHFNRERVRVRTE 280


>ref|XP_004499754.1| PREDICTED: pentatricopeptide repeat-containing protein At4g18975,
            chloroplastic-like isoform X5 [Cicer arietinum]
          Length = 287

 Score =  309 bits (792), Expect = 2e-81
 Identities = 157/242 (64%), Positives = 180/242 (74%)
 Frame = +3

Query: 483  STRLSEKLNVTSVLKHSQKQGELSLTISDAADQKKVKKAGKVEHHLWKKRESAGSGQKAL 662
            S  +S K + TS  +  Q +   ++      D+K  K  GKVEHHLWK+R SA SGQKAL
Sbjct: 41   SITISRKTSCTSC-RFVQSKSSPNVGRPVEKDKKGNKIKGKVEHHLWKRRNSAQSGQKAL 99

Query: 663  NLVRIISGLPNEKESVYGALDKWIAWETEFPXXXXXXXXXXXXXXXXWKRVIQVAKWMLS 842
             LVR I  LPNEKESVYGALDKW AWETEFP                W RVIQ+AKWMLS
Sbjct: 100  TLVRTICELPNEKESVYGALDKWTAWETEFPLVAAAKALNILRKRGQWVRVIQLAKWMLS 159

Query: 843  KGQGATMATYDTLLLAFDMDNRVDEAETLWNMILHTSTRSVSKRLFSRMISLYDHHHVPY 1022
            KGQGATM TYDTLLLAFDMD R+DEAE+LWNMI+H   RSVSKRLFSRMISLYDHH++  
Sbjct: 160  KGQGATMGTYDTLLLAFDMDQRIDEAESLWNMIIHAHMRSVSKRLFSRMISLYDHHNLSE 219

Query: 1023 KIVEVFADMEELGVKPDEDTVRRVARAFQMLGQEDKQKLVLKKYQSRWKYIHFNGERARV 1202
            KIVE+FADMEEL +KPDEDTVR+V  AF+ LGQE+K+K V+K+Y  +WKYIHFNGER RV
Sbjct: 220  KIVEIFADMEELRIKPDEDTVRKVTNAFRKLGQEEKRKSVIKRYGLKWKYIHFNGERVRV 279

Query: 1203 RR 1208
            RR
Sbjct: 280  RR 281


>ref|XP_002867967.1| hypothetical protein ARALYDRAFT_914772 [Arabidopsis lyrata subsp.
            lyrata] gi|297313803|gb|EFH44226.1| hypothetical protein
            ARALYDRAFT_914772 [Arabidopsis lyrata subsp. lyrata]
          Length = 284

 Score =  305 bits (782), Expect = 3e-80
 Identities = 149/229 (65%), Positives = 181/229 (79%), Gaps = 1/229 (0%)
 Frame = +3

Query: 525  KHSQKQ-GELSLTISDAADQKKVKKAGKVEHHLWKKRESAGSGQKALNLVRIISGLPNEK 701
            K S+KQ G+L +      +  ++KK GK EHHLWKK +SAGSGQKALNLVR++SGLPNEK
Sbjct: 53   KFSEKQAGKLDVA---TVNSNEIKKVGKKEHHLWKKNDSAGSGQKALNLVRMLSGLPNEK 109

Query: 702  ESVYGALDKWIAWETEFPXXXXXXXXXXXXXXXXWKRVIQVAKWMLSKGQGATMATYDTL 881
            E+VYGAL+KW+AWE EFP                W RVIQ+AKWMLSKGQGATM TYDTL
Sbjct: 110  EAVYGALNKWVAWEVEFPIIAAAKALQILRKRSQWHRVIQLAKWMLSKGQGATMGTYDTL 169

Query: 882  LLAFDMDNRVDEAETLWNMILHTSTRSVSKRLFSRMISLYDHHHVPYKIVEVFADMEELG 1061
            LLAFDMD R DEAE+LWNMILHT TRS+ +RLF+RMI+LY H+ +  K++EVFADMEEL 
Sbjct: 170  LLAFDMDQRADEAESLWNMILHTHTRSIPRRLFARMIALYAHYDLHDKVIEVFADMEELK 229

Query: 1062 VKPDEDTVRRVARAFQMLGQEDKQKLVLKKYQSRWKYIHFNGERARVRR 1208
            V+PDEDT RRVARAF+ LGQE+ +KL+L++Y S +KYI+FNGER RV+R
Sbjct: 230  VRPDEDTARRVARAFRELGQEENRKLILRRYLSEFKYIYFNGERVRVKR 278


>ref|NP_567571.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana]
            gi|186512032|ref|NP_001119009.1| pentatricopeptide
            repeat-containing protein [Arabidopsis thaliana]
            gi|334186688|ref|NP_001190768.1| pentatricopeptide
            repeat-containing protein [Arabidopsis thaliana]
            gi|223635632|sp|Q2V3H0.2|PP322_ARATH RecName:
            Full=Pentatricopeptide repeat-containing protein
            At4g18975, chloroplastic; Flags: Precursor
            gi|332658715|gb|AEE84115.1| pentatricopeptide
            repeat-containing protein [Arabidopsis thaliana]
            gi|332658717|gb|AEE84117.1| pentatricopeptide
            repeat-containing protein [Arabidopsis thaliana]
            gi|332658718|gb|AEE84118.1| pentatricopeptide
            repeat-containing protein [Arabidopsis thaliana]
          Length = 287

 Score =  304 bits (778), Expect = 9e-80
 Identities = 155/265 (58%), Positives = 191/265 (72%), Gaps = 4/265 (1%)
 Frame = +3

Query: 426  GASLQFEFFNCNLLLK---GIKSTRLSEKLNVTSVLKHSQKQ-GELSLTISDAADQKKVK 593
            G S   EF   +LL     G  S+  +++       K S+K+ G+L        + K++K
Sbjct: 17   GLSKSQEFICFSLLQSPRCGSYSSLKTKRFGFCIRSKFSEKEAGKLDRGYVATVNSKEIK 76

Query: 594  KAGKVEHHLWKKRESAGSGQKALNLVRIISGLPNEKESVYGALDKWIAWETEFPXXXXXX 773
            K GK EHHLWKK +SAGSGQKALNLVR++SGLPNEKE+VYGAL+KW+AWE EFP      
Sbjct: 77   KVGKKEHHLWKKNDSAGSGQKALNLVRMLSGLPNEKEAVYGALNKWVAWEVEFPIIAAAK 136

Query: 774  XXXXXXXXXXWKRVIQVAKWMLSKGQGATMATYDTLLLAFDMDNRVDEAETLWNMILHTS 953
                      W RVIQ+AKWMLSKGQGATM TYD LLLAFDMD R DEAE+LWNMILHT 
Sbjct: 137  ALQILRKRSQWHRVIQLAKWMLSKGQGATMGTYDILLLAFDMDERADEAESLWNMILHTH 196

Query: 954  TRSVSKRLFSRMISLYDHHHVPYKIVEVFADMEELGVKPDEDTVRRVARAFQMLGQEDKQ 1133
            TRS+ +RLF+RMI+LY HH +  K++EVFADMEEL V PDED+ RRVARAF+ L QE+ +
Sbjct: 197  TRSIPRRLFARMIALYAHHDLHDKVIEVFADMEELKVSPDEDSARRVARAFRELNQEENR 256

Query: 1134 KLVLKKYQSRWKYIHFNGERARVRR 1208
            KL+L++Y S +KYI+FNGER RV+R
Sbjct: 257  KLILRRYLSEYKYIYFNGERVRVKR 281


>ref|XP_006848146.1| hypothetical protein AMTR_s00029p00227910 [Amborella trichopoda]
            gi|548851451|gb|ERN09727.1| hypothetical protein
            AMTR_s00029p00227910 [Amborella trichopoda]
          Length = 287

 Score =  302 bits (774), Expect = 3e-79
 Identities = 146/208 (70%), Positives = 169/208 (81%)
 Frame = +3

Query: 582  KKVKKAGKVEHHLWKKRESAGSGQKALNLVRIISGLPNEKESVYGALDKWIAWETEFPXX 761
            +K KK  K EHHLW KR+SAGS QKALNLVRI+S + NEKE++Y ALD+W AWETEFP  
Sbjct: 73   EKPKKLFKKEHHLWMKRDSAGSSQKALNLVRIVSRVSNEKEAIYVALDEWAAWETEFPVI 132

Query: 762  XXXXXXXXXXXXXXWKRVIQVAKWMLSKGQGATMATYDTLLLAFDMDNRVDEAETLWNMI 941
                          W RVIQV+KW+LSKGQ  TM TYDTLLLAFDMD RVDEAET+WNMI
Sbjct: 133  AAAKALGILRKRRRWLRVIQVSKWLLSKGQVLTMGTYDTLLLAFDMDGRVDEAETIWNMI 192

Query: 942  LHTSTRSVSKRLFSRMISLYDHHHVPYKIVEVFADMEELGVKPDEDTVRRVARAFQMLGQ 1121
            LHT TRS+SKRLFSRM+SLYDHHH+P K++EVFADMEELGVKPD+D+VRRVARAFQ LG+
Sbjct: 193  LHTYTRSISKRLFSRMMSLYDHHHIPDKLLEVFADMEELGVKPDQDSVRRVARAFQQLGE 252

Query: 1122 EDKQKLVLKKYQSRWKYIHFNGERARVR 1205
            E+KQK VL+KY  + KYIHFNGER R++
Sbjct: 253  EEKQKQVLQKYGLKLKYIHFNGERVRIK 280


>ref|NP_001031667.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana]
            gi|332658716|gb|AEE84116.1| pentatricopeptide
            repeat-containing protein [Arabidopsis thaliana]
          Length = 260

 Score =  302 bits (773), Expect = 3e-79
 Identities = 144/225 (64%), Positives = 175/225 (77%)
 Frame = +3

Query: 534  QKQGELSLTISDAADQKKVKKAGKVEHHLWKKRESAGSGQKALNLVRIISGLPNEKESVY 713
            ++ G+L        + K++KK GK EHHLWKK +SAGSGQKALNLVR++SGLPNEKE+VY
Sbjct: 30   KEAGKLDRGYVATVNSKEIKKVGKKEHHLWKKNDSAGSGQKALNLVRMLSGLPNEKEAVY 89

Query: 714  GALDKWIAWETEFPXXXXXXXXXXXXXXXXWKRVIQVAKWMLSKGQGATMATYDTLLLAF 893
            GAL+KW+AWE EFP                W RVIQ+AKWMLSKGQGATM TYD LLLAF
Sbjct: 90   GALNKWVAWEVEFPIIAAAKALQILRKRSQWHRVIQLAKWMLSKGQGATMGTYDILLLAF 149

Query: 894  DMDNRVDEAETLWNMILHTSTRSVSKRLFSRMISLYDHHHVPYKIVEVFADMEELGVKPD 1073
            DMD R DEAE+LWNMILHT TRS+ +RLF+RMI+LY HH +  K++EVFADMEEL V PD
Sbjct: 150  DMDERADEAESLWNMILHTHTRSIPRRLFARMIALYAHHDLHDKVIEVFADMEELKVSPD 209

Query: 1074 EDTVRRVARAFQMLGQEDKQKLVLKKYQSRWKYIHFNGERARVRR 1208
            ED+ RRVARAF+ L QE+ +KL+L++Y S +KYI+FNGER RV+R
Sbjct: 210  EDSARRVARAFRELNQEENRKLILRRYLSEYKYIYFNGERVRVKR 254


Top