BLASTX nr result
ID: Achyranthes22_contig00004859
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Achyranthes22_contig00004859 (2309 letters) Database: ./nr 37,332,560 sequences; 13,225,080,153 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value gb|EMJ13198.1| hypothetical protein PRUPE_ppa011078mg [Prunus pe... 313 2e-82 ref|XP_002316747.1| predicted protein [Populus trichocarpa] 312 4e-82 ref|XP_006355563.1| PREDICTED: pentatricopeptide repeat-containi... 311 1e-81 ref|XP_004232997.1| PREDICTED: pentatricopeptide repeat-containi... 310 2e-81 ref|XP_004164255.1| PREDICTED: pentatricopeptide repeat-containi... 310 2e-81 emb|CBI30774.3| unnamed protein product [Vitis vinifera] 309 4e-81 ref|XP_004294568.1| PREDICTED: pentatricopeptide repeat-containi... 306 2e-80 gb|EOY27863.1| Pentatricopeptide repeat superfamily protein [The... 304 1e-79 ref|XP_006449586.1| hypothetical protein CICLE_v10016169mg [Citr... 302 5e-79 gb|EXB58283.1| hypothetical protein L484_015617 [Morus notabilis] 298 9e-78 ref|XP_004499754.1| PREDICTED: pentatricopeptide repeat-containi... 297 2e-77 gb|ESW19954.1| hypothetical protein PHAVU_006G168700g [Phaseolus... 296 2e-77 ref|XP_003546058.1| PREDICTED: pentatricopeptide repeat-containi... 295 8e-77 gb|ACU23441.1| unknown [Glycine max] 295 8e-77 ref|XP_002526919.1| conserved hypothetical protein [Ricinus comm... 291 7e-76 ref|XP_006467567.1| PREDICTED: pentatricopeptide repeat-containi... 291 9e-76 ref|XP_006848146.1| hypothetical protein AMTR_s00029p00227910 [A... 286 4e-74 ref|XP_002867967.1| hypothetical protein ARALYDRAFT_914772 [Arab... 283 2e-73 ref|NP_001031667.1| pentatricopeptide repeat-containing protein ... 280 2e-72 ref|NP_567571.1| pentatricopeptide repeat-containing protein [Ar... 280 2e-72 >gb|EMJ13198.1| hypothetical protein PRUPE_ppa011078mg [Prunus persica] Length = 224 Score = 313 bits (803), Expect = 2e-82 Identities = 151/208 (72%), Positives = 171/208 (82%), Gaps = 1/208 (0%) Frame = +3 Query: 1266 KSVKKPG-KEHHLWKHRDSAGSGHKAMNLVRIVSGLPNEKEAVYLALDEWIAWELEFPXX 1442 K++KK G KEHHLW+ RDSAGSG KA+NLVRIVSGLPNEKE VY ALD+W AWE EFP Sbjct: 5 KTIKKVGRKEHHLWQKRDSAGSGQKALNLVRIVSGLPNEKETVYGALDKWTAWETEFPLI 64 Query: 1443 XXXXXXXXXXXQRQWVRVIQVAKWMLSKGQGTTMATYDAVLLAYDQEGRIEEAGSLWNMI 1622 + QWVRVIQVAKWMLSKGQG TM TYD +LLA+D + R++EA SLWNMI Sbjct: 65 AAVKALRILRKRSQWVRVIQVAKWMLSKGQGATMGTYDTLLLAFDMDQRVDEAESLWNMI 124 Query: 1623 LHTHTRSISKRLFSRMISLYDHHDLPEKIIEVFADMEELGVKPDEDTVKRASQAFEKLGQ 1802 LHTHTRSISKRLFSRMISLYDHHD KIIEVFADMEELGVKPDEDTV+R ++AF++LGQ Sbjct: 125 LHTHTRSISKRLFSRMISLYDHHDKQNKIIEVFADMEELGVKPDEDTVRRVARAFKELGQ 184 Query: 1803 EDKRNMVIHRYGCKWKYIHFNGERVKVR 1886 E+ + +V+ RY CKWKYIHF GERVKVR Sbjct: 185 EENKTLVLRRYQCKWKYIHFKGERVKVR 212 >ref|XP_002316747.1| predicted protein [Populus trichocarpa] Length = 272 Score = 312 bits (800), Expect = 4e-82 Identities = 153/249 (61%), Positives = 189/249 (75%), Gaps = 1/249 (0%) Frame = +3 Query: 1149 ARMVLGKPAFLLVNSLQCNSKSIVQQCSGSIENTSQRYGKSVKKPGK-EHHLWKHRDSAG 1325 AR+ +P ++C+ K + + + + K VKK GK EHHLW+ RDSAG Sbjct: 25 ARLTSLEPKVTSALCVKCSKKQL------KLNSRADENRKVVKKSGKKEHHLWQKRDSAG 78 Query: 1326 SGHKAMNLVRIVSGLPNEKEAVYLALDEWIAWELEFPXXXXXXXXXXXXXQRQWVRVIQV 1505 SG KA+NLVRIVS LPNEKEAVY ALD+W AWE EFP +RQW RVIQV Sbjct: 79 SGQKALNLVRIVSELPNEKEAVYGALDKWTAWETEFPLIAAAKALKILQQRRQWTRVIQV 138 Query: 1506 AKWMLSKGQGTTMATYDAVLLAYDQEGRIEEAGSLWNMILHTHTRSISKRLFSRMISLYD 1685 AKWMLSKGQG T+ TYD +LLA+D++ R++EA SLWNMI+H HTRS+SKRLFSRMISLYD Sbjct: 139 AKWMLSKGQGATLGTYDTLLLAFDKDDRVDEAKSLWNMIIHVHTRSMSKRLFSRMISLYD 198 Query: 1686 HHDLPEKIIEVFADMEELGVKPDEDTVKRASQAFEKLGQEDKRNMVIHRYGCKWKYIHFN 1865 HH++ ++IIEVFADMEELGV+PDEDTV R ++AF+KLGQE+KR +V+ RY CKWKYIHFN Sbjct: 199 HHNMQDEIIEVFADMEELGVRPDEDTVWRVARAFKKLGQEEKRELVLERYLCKWKYIHFN 258 Query: 1866 GERVKVRRE 1892 GERV+V+R+ Sbjct: 259 GERVRVKRD 267 >ref|XP_006355563.1| PREDICTED: pentatricopeptide repeat-containing protein At4g18975, chloroplastic-like isoform X1 [Solanum tuberosum] gi|565378234|ref|XP_006355564.1| PREDICTED: pentatricopeptide repeat-containing protein At4g18975, chloroplastic-like isoform X2 [Solanum tuberosum] gi|565378236|ref|XP_006355565.1| PREDICTED: pentatricopeptide repeat-containing protein At4g18975, chloroplastic-like isoform X3 [Solanum tuberosum] Length = 265 Score = 311 bits (796), Expect = 1e-81 Identities = 150/224 (66%), Positives = 182/224 (81%), Gaps = 1/224 (0%) Frame = +3 Query: 1224 QCSGSIENTSQRYGKSVKKPGK-EHHLWKHRDSAGSGHKAMNLVRIVSGLPNEKEAVYLA 1400 + S +I +T+ + K V+K GK EHHLWK R+SAGSG KA+NLVRI+SGLPNEKE+VY A Sbjct: 42 ELSLTISDTADQ--KKVQKAGKVEHHLWKKRESAGSGQKALNLVRIISGLPNEKESVYGA 99 Query: 1401 LDEWIAWELEFPXXXXXXXXXXXXXQRQWVRVIQVAKWMLSKGQGTTMATYDAVLLAYDQ 1580 LD+WIAWE EFP QR W RVIQVAKWMLSKGQG TMATYDA+LLA+D Sbjct: 100 LDKWIAWEAEFPLIAAAKALRILRQQRLWKRVIQVAKWMLSKGQGATMATYDALLLAFDM 159 Query: 1581 EGRIEEAGSLWNMILHTHTRSISKRLFSRMISLYDHHDLPEKIIEVFADMEELGVKPDED 1760 + R++EA +LWNMILHT TRS+SKRLFSRMISLYDHH +P+KI+EVFADMEELGVKPDED Sbjct: 160 DNRVDEAETLWNMILHTSTRSVSKRLFSRMISLYDHHHVPDKIVEVFADMEELGVKPDED 219 Query: 1761 TVKRASQAFEKLGQEDKRNMVIHRYGCKWKYIHFNGERVKVRRE 1892 TV R ++AF+ LGQEDK+ +V+ +Y +WKY+HFNGER +VRR+ Sbjct: 220 TVGRVARAFQMLGQEDKQKLVLKKYQSRWKYVHFNGERARVRRD 263 >ref|XP_004232997.1| PREDICTED: pentatricopeptide repeat-containing protein At4g18975, chloroplastic-like [Solanum lycopersicum] Length = 265 Score = 310 bits (794), Expect = 2e-81 Identities = 146/210 (69%), Positives = 174/210 (82%), Gaps = 1/210 (0%) Frame = +3 Query: 1266 KSVKKPGK-EHHLWKHRDSAGSGHKAMNLVRIVSGLPNEKEAVYLALDEWIAWELEFPXX 1442 K V+K GK EHHLWK R+SAGSG KA+NLVRI+SGLPNEKE+VY ALD+WIAWE EFP Sbjct: 54 KKVQKAGKVEHHLWKKRESAGSGQKALNLVRIISGLPNEKESVYGALDKWIAWETEFPLI 113 Query: 1443 XXXXXXXXXXXQRQWVRVIQVAKWMLSKGQGTTMATYDAVLLAYDQEGRIEEAGSLWNMI 1622 QR W RVIQVAKWMLSKGQG TMATYDA+LLA+D + R++EA +LWNMI Sbjct: 114 AAAKALRILRQQRLWKRVIQVAKWMLSKGQGATMATYDALLLAFDMDNRVDEAETLWNMI 173 Query: 1623 LHTHTRSISKRLFSRMISLYDHHDLPEKIIEVFADMEELGVKPDEDTVKRASQAFEKLGQ 1802 LHT TRS+SKRLFSRMISLYDHH +P+KI+EVFADMEELGVKPDEDTV+R ++AF+ LGQ Sbjct: 174 LHTSTRSVSKRLFSRMISLYDHHHVPDKIVEVFADMEELGVKPDEDTVRRVARAFQMLGQ 233 Query: 1803 EDKRNMVIHRYGCKWKYIHFNGERVKVRRE 1892 ED + +V+ +Y +WKY+HFNGER +VRR+ Sbjct: 234 EDNQKLVLKKYQSRWKYVHFNGERARVRRD 263 >ref|XP_004164255.1| PREDICTED: pentatricopeptide repeat-containing protein At4g18975, chloroplastic-like [Cucumis sativus] Length = 270 Score = 310 bits (793), Expect = 2e-81 Identities = 150/208 (72%), Positives = 169/208 (81%), Gaps = 1/208 (0%) Frame = +3 Query: 1272 VKKPGKE-HHLWKHRDSAGSGHKAMNLVRIVSGLPNEKEAVYLALDEWIAWELEFPXXXX 1448 VKK GKE HHLWK RDSAGSG KA+NLVRIVS PNEKEAVY L++WIAWE EFP Sbjct: 56 VKKVGKETHHLWKKRDSAGSGQKALNLVRIVSQCPNEKEAVYGELNKWIAWETEFPLIAA 115 Query: 1449 XXXXXXXXXQRQWVRVIQVAKWMLSKGQGTTMATYDAVLLAYDQEGRIEEAGSLWNMILH 1628 + QW RVIQVAKWMLSKGQG TM TYD +LLA+D + R++EA SLWNMILH Sbjct: 116 AKALRILRKRSQWKRVIQVAKWMLSKGQGATMGTYDTLLLAFDMDKRVDEAESLWNMILH 175 Query: 1629 THTRSISKRLFSRMISLYDHHDLPEKIIEVFADMEELGVKPDEDTVKRASQAFEKLGQED 1808 THTRSISKR+FSRMISLY+HHDL +KIIE+FADMEELGVKPDEDTV+R +AF+KLGQED Sbjct: 176 THTRSISKRVFSRMISLYEHHDLQDKIIEIFADMEELGVKPDEDTVRRVCRAFQKLGQED 235 Query: 1809 KRNMVIHRYGCKWKYIHFNGERVKVRRE 1892 R MV RY C+WKYIHF GERV+VRR+ Sbjct: 236 NRKMVYKRYSCQWKYIHFKGERVRVRRD 263 >emb|CBI30774.3| unnamed protein product [Vitis vinifera] Length = 277 Score = 309 bits (791), Expect = 4e-81 Identities = 148/208 (71%), Positives = 171/208 (82%), Gaps = 1/208 (0%) Frame = +3 Query: 1275 KKPGK-EHHLWKHRDSAGSGHKAMNLVRIVSGLPNEKEAVYLALDEWIAWELEFPXXXXX 1451 KK GK EHHLW+ RDS GSG KA+NLVRIVS LPNEKEAVY ALD+W AWE EFP Sbjct: 65 KKVGKKEHHLWRKRDSIGSGQKALNLVRIVSELPNEKEAVYGALDKWTAWETEFPLIAAA 124 Query: 1452 XXXXXXXXQRQWVRVIQVAKWMLSKGQGTTMATYDAVLLAYDQEGRIEEAGSLWNMILHT 1631 + QW RVIQVAKWMLSKGQG TM TYD +LLA+D + R++EA SLWNMILHT Sbjct: 125 KALRILRKRNQWKRVIQVAKWMLSKGQGATMGTYDTLLLAFDMDWRVDEAESLWNMILHT 184 Query: 1632 HTRSISKRLFSRMISLYDHHDLPEKIIEVFADMEELGVKPDEDTVKRASQAFEKLGQEDK 1811 HTRSISK+LFSRMISLYDHHD+ +K+IEVFADMEELGVKPDEDTV+R + AF+ LGQEDK Sbjct: 185 HTRSISKQLFSRMISLYDHHDMRDKVIEVFADMEELGVKPDEDTVRRVACAFQTLGQEDK 244 Query: 1812 RNMVIHRYGCKWKYIHFNGERVKVRRES 1895 + +V+ +Y CKWKYIHFNGERV+VRR++ Sbjct: 245 QKLVLKKYQCKWKYIHFNGERVRVRRDA 272 >ref|XP_004294568.1| PREDICTED: pentatricopeptide repeat-containing protein At4g18975, chloroplastic-like [Fragaria vesca subsp. vesca] Length = 300 Score = 306 bits (785), Expect = 2e-80 Identities = 146/208 (70%), Positives = 169/208 (81%), Gaps = 1/208 (0%) Frame = +3 Query: 1266 KSVKKPGK-EHHLWKHRDSAGSGHKAMNLVRIVSGLPNEKEAVYLALDEWIAWELEFPXX 1442 K +KK G+ EHHLWK +DSAGSG KA+NL+RIVS LPNEKEA++ ALD+W AWE EFP Sbjct: 81 KIIKKAGRNEHHLWKKKDSAGSGQKALNLIRIVSDLPNEKEAIFGALDKWTAWETEFPLI 140 Query: 1443 XXXXXXXXXXXQRQWVRVIQVAKWMLSKGQGTTMATYDAVLLAYDQEGRIEEAGSLWNMI 1622 QW RVIQVAKWMLSKGQG TMATYD +LLA+D + R++EA SLWNMI Sbjct: 141 AAAKALRILRRTCQWRRVIQVAKWMLSKGQGATMATYDTLLLAFDMDNRLDEAESLWNMI 200 Query: 1623 LHTHTRSISKRLFSRMISLYDHHDLPEKIIEVFADMEELGVKPDEDTVKRASQAFEKLGQ 1802 LHTHTRSISKRLFSRMISLYDHH++ KIIEVFADMEEL V+PDEDTV+R ++AF++ GQ Sbjct: 201 LHTHTRSISKRLFSRMISLYDHHEMKTKIIEVFADMEELSVRPDEDTVRRVARAFQEFGQ 260 Query: 1803 EDKRNMVIHRYGCKWKYIHFNGERVKVR 1886 EDK +V+ RYGCKWKYIHF GERVKVR Sbjct: 261 EDKSKLVLRRYGCKWKYIHFKGERVKVR 288 >gb|EOY27863.1| Pentatricopeptide repeat superfamily protein [Theobroma cacao] Length = 276 Score = 304 bits (778), Expect = 1e-79 Identities = 152/236 (64%), Positives = 184/236 (77%), Gaps = 1/236 (0%) Frame = +3 Query: 1185 VNSLQCNSKSIVQQCSGSIENTSQRYGKSVKKPGK-EHHLWKHRDSAGSGHKAMNLVRIV 1361 ++ ++C+ K + +Q G E + K VKK GK EHHLWK RDSAGSG KA+NLVRI+ Sbjct: 39 ISYVKCSQK-LGEQSLGISEAVEK---KPVKKVGKNEHHLWKKRDSAGSGQKALNLVRII 94 Query: 1362 SGLPNEKEAVYLALDEWIAWELEFPXXXXXXXXXXXXXQRQWVRVIQVAKWMLSKGQGTT 1541 S LPNEKEAVY ALD+W AWE EFP + QW+RVIQVAKWMLSKGQG T Sbjct: 95 SQLPNEKEAVYGALDKWTAWETEFPLIAAAKALRILRKRSQWLRVIQVAKWMLSKGQGAT 154 Query: 1542 MATYDAVLLAYDQEGRIEEAGSLWNMILHTHTRSISKRLFSRMISLYDHHDLPEKIIEVF 1721 M TYD +LLA+D + R++EA SLWNMILH HTRSISKRLFSRMISLYDHH++ +KIIEVF Sbjct: 155 MGTYDTLLLAFDMDKRVDEAESLWNMILHIHTRSISKRLFSRMISLYDHHNMQDKIIEVF 214 Query: 1722 ADMEELGVKPDEDTVKRASQAFEKLGQEDKRNMVIHRYGCKWKYIHFNGERVKVRR 1889 ADMEEL V+PDE+TV++ ++AF+KLGQEDK+ +V+ RY KWKYIHFNGERV+V R Sbjct: 215 ADMEELCVRPDENTVRKVARAFQKLGQEDKQKLVLRRYLSKWKYIHFNGERVRVTR 270 >ref|XP_006449586.1| hypothetical protein CICLE_v10016169mg [Citrus clementina] gi|557552197|gb|ESR62826.1| hypothetical protein CICLE_v10016169mg [Citrus clementina] Length = 284 Score = 302 bits (773), Expect = 5e-79 Identities = 146/222 (65%), Positives = 175/222 (78%) Frame = +3 Query: 1230 SGSIENTSQRYGKSVKKPGKEHHLWKHRDSAGSGHKAMNLVRIVSGLPNEKEAVYLALDE 1409 + S N S++ VK KE HLW+ RDSAGSG KA+NLVRIVS LPNEK AVY ALD+ Sbjct: 55 ANSNANASKKNKLVVKVGKKEQHLWQKRDSAGSGQKALNLVRIVSELPNEKHAVYGALDK 114 Query: 1410 WIAWELEFPXXXXXXXXXXXXXQRQWVRVIQVAKWMLSKGQGTTMATYDAVLLAYDQEGR 1589 W AWE EFP + QW+RVIQVAKWMLSKGQG TM TYD +LLA+D++ R Sbjct: 115 WTAWETEFPLIAAAKALRILRKRGQWLRVIQVAKWMLSKGQGATMGTYDTLLLAFDKDHR 174 Query: 1590 IEEAGSLWNMILHTHTRSISKRLFSRMISLYDHHDLPEKIIEVFADMEELGVKPDEDTVK 1769 +EA SLWNMILHTHTRSISKRLFSRMISLYDHHD+P KIIEVFADMEELGV+PDEDTV+ Sbjct: 175 ADEAESLWNMILHTHTRSISKRLFSRMISLYDHHDMPNKIIEVFADMEELGVRPDEDTVR 234 Query: 1770 RASQAFEKLGQEDKRNMVIHRYGCKWKYIHFNGERVKVRRES 1895 R + AF+++GQ++K+ +V+ +Y KWKYIHF GERV+VRR++ Sbjct: 235 RIASAFQRVGQDEKQKLVLKKYLSKWKYIHFKGERVRVRRDA 276 >gb|EXB58283.1| hypothetical protein L484_015617 [Morus notabilis] Length = 326 Score = 298 bits (762), Expect = 9e-78 Identities = 144/231 (62%), Positives = 178/231 (77%), Gaps = 1/231 (0%) Frame = +3 Query: 1203 NSKSIVQQCSGSIENTSQRYGKSVKKPGK-EHHLWKHRDSAGSGHKAMNLVRIVSGLPNE 1379 +SK+I + IE + VKK GK E+HLWK +DSAGSG KA+NL+RI+S LPNE Sbjct: 86 SSKAIEKLQRLCIEFLYMEFRNLVKKTGKKEYHLWKKKDSAGSGQKALNLIRILSVLPNE 145 Query: 1380 KEAVYLALDEWIAWELEFPXXXXXXXXXXXXXQRQWVRVIQVAKWMLSKGQGTTMATYDA 1559 KE VY AL++WIAWE EFP + QW RVIQVAKWMLSKGQGTTM TYD Sbjct: 146 KEVVYGALNKWIAWETEFPLIAAAKALRILRKRSQWKRVIQVAKWMLSKGQGTTMGTYDT 205 Query: 1560 VLLAYDQEGRIEEAGSLWNMILHTHTRSISKRLFSRMISLYDHHDLPEKIIEVFADMEEL 1739 +LLA+D + R++EA S WNMILHTH RSISKRLFSRMI+LYDHHD+ +KIIEVFADMEEL Sbjct: 206 LLLAFDMDQRVDEAESFWNMILHTHKRSISKRLFSRMIALYDHHDVKDKIIEVFADMEEL 265 Query: 1740 GVKPDEDTVKRASQAFEKLGQEDKRNMVIHRYGCKWKYIHFNGERVKVRRE 1892 V+ DEDTV+R + AF+KLGQE+K+ +++ +Y CKWKY+HF GER++VRR+ Sbjct: 266 SVRLDEDTVRRVAYAFQKLGQEEKKKLLLRKYQCKWKYVHFKGERIRVRRD 316 >ref|XP_004499754.1| PREDICTED: pentatricopeptide repeat-containing protein At4g18975, chloroplastic-like isoform X5 [Cicer arietinum] Length = 287 Score = 297 bits (760), Expect = 2e-77 Identities = 146/246 (59%), Positives = 176/246 (71%), Gaps = 8/246 (3%) Frame = +3 Query: 1182 LVNSLQCNSKSIVQQCSGSIENTSQRYGKSVKKPGK--------EHHLWKHRDSAGSGHK 1337 L S+ + K+ C +S G+ V+K K EHHLWK R+SA SG K Sbjct: 38 LTTSITISRKTSCTSCRFVQSKSSPNVGRPVEKDKKGNKIKGKVEHHLWKRRNSAQSGQK 97 Query: 1338 AMNLVRIVSGLPNEKEAVYLALDEWIAWELEFPXXXXXXXXXXXXXQRQWVRVIQVAKWM 1517 A+ LVR + LPNEKE+VY ALD+W AWE EFP + QWVRVIQ+AKWM Sbjct: 98 ALTLVRTICELPNEKESVYGALDKWTAWETEFPLVAAAKALNILRKRGQWVRVIQLAKWM 157 Query: 1518 LSKGQGTTMATYDAVLLAYDQEGRIEEAGSLWNMILHTHTRSISKRLFSRMISLYDHHDL 1697 LSKGQG TM TYD +LLA+D + RI+EA SLWNMI+H H RS+SKRLFSRMISLYDHH+L Sbjct: 158 LSKGQGATMGTYDTLLLAFDMDQRIDEAESLWNMIIHAHMRSVSKRLFSRMISLYDHHNL 217 Query: 1698 PEKIIEVFADMEELGVKPDEDTVKRASQAFEKLGQEDKRNMVIHRYGCKWKYIHFNGERV 1877 EKI+E+FADMEEL +KPDEDTV++ + AF KLGQE+KR VI RYG KWKYIHFNGERV Sbjct: 218 SEKIVEIFADMEELRIKPDEDTVRKVTNAFRKLGQEEKRKSVIKRYGLKWKYIHFNGERV 277 Query: 1878 KVRRES 1895 +VRR++ Sbjct: 278 RVRRQA 283 >gb|ESW19954.1| hypothetical protein PHAVU_006G168700g [Phaseolus vulgaris] Length = 289 Score = 296 bits (759), Expect = 2e-77 Identities = 141/203 (69%), Positives = 165/203 (81%) Frame = +3 Query: 1287 KEHHLWKHRDSAGSGHKAMNLVRIVSGLPNEKEAVYLALDEWIAWELEFPXXXXXXXXXX 1466 KEHHLWK RDSA SG KA+ LVRIVS LPNEKEAVY ALD+WIAWE EFP Sbjct: 79 KEHHLWKSRDSAQSGQKALTLVRIVSKLPNEKEAVYGALDKWIAWETEFPVIAAAKALKI 138 Query: 1467 XXXQRQWVRVIQVAKWMLSKGQGTTMATYDAVLLAYDQEGRIEEAGSLWNMILHTHTRSI 1646 + WVRVIQVAKWMLSKGQG TM T+D +LLA+D + R++EA SLWNMI+HTH RS+ Sbjct: 139 LRKRGHWVRVIQVAKWMLSKGQGATMGTFDTLLLAFDMDQRVDEAESLWNMIIHTHMRSV 198 Query: 1647 SKRLFSRMISLYDHHDLPEKIIEVFADMEELGVKPDEDTVKRASQAFEKLGQEDKRNMVI 1826 SKRLFSRMIS+YD+HD+P+KIIEVFADMEEL VKPDEDTV+R ++AF +LG+E+KR +V Sbjct: 199 SKRLFSRMISIYDNHDMPDKIIEVFADMEELRVKPDEDTVRRVARAFTELGEEEKRKLVA 258 Query: 1827 HRYGCKWKYIHFNGERVKVRRES 1895 RYG KWKYIHFN ERV+VR E+ Sbjct: 259 RRYGIKWKYIHFNRERVRVRTEA 281 >ref|XP_003546058.1| PREDICTED: pentatricopeptide repeat-containing protein At4g18975, chloroplastic-like isoform X1 [Glycine max] gi|571517206|ref|XP_006597502.1| PREDICTED: pentatricopeptide repeat-containing protein At4g18975, chloroplastic-like isoform X2 [Glycine max] Length = 288 Score = 295 bits (754), Expect = 8e-77 Identities = 157/287 (54%), Positives = 197/287 (68%), Gaps = 6/287 (2%) Frame = +3 Query: 1053 MMATSCKSANLFGAPYFSLTQTMVDFPSRISLARMVLGKPAFLLVNSLQCNSKSIVQQCS 1232 MM +C+S+ + + Q D P+R A ++ K + + V +L K+ QC+ Sbjct: 1 MMPITCESSIVSPLLLSRVCQAKTD-PTR---ALLLGNKFSTMAVTALP---KTSCIQCT 53 Query: 1233 GSIENTSQRYGKSVKKPGK------EHHLWKHRDSAGSGHKAMNLVRIVSGLPNEKEAVY 1394 S + G ++K GK EHHLWK RDSA SG KA+ LVR V LPNEKEAVY Sbjct: 54 IVRSKFSHKSGGPMEKKGKKTTGKKEHHLWKSRDSAQSGQKALALVRTVYKLPNEKEAVY 113 Query: 1395 LALDEWIAWELEFPXXXXXXXXXXXXXQRQWVRVIQVAKWMLSKGQGTTMATYDAVLLAY 1574 ALD+W AWE EFP + WVRVIQVAKWMLSKGQG TM TYD +LLA+ Sbjct: 114 GALDKWTAWETEFPVIAVSKALKILRKRGHWVRVIQVAKWMLSKGQGATMGTYDTLLLAF 173 Query: 1575 DQEGRIEEAGSLWNMILHTHTRSISKRLFSRMISLYDHHDLPEKIIEVFADMEELGVKPD 1754 D + R++EA SLWNMI+H H RS+SKRLFSRMISLYDHH++P+KII+VFADMEEL +KPD Sbjct: 174 DMDKRVDEAESLWNMIIHAHMRSVSKRLFSRMISLYDHHNMPDKIIDVFADMEELRLKPD 233 Query: 1755 EDTVKRASQAFEKLGQEDKRNMVIHRYGCKWKYIHFNGERVKVRRES 1895 EDTV+R ++AF +LG E+KR +VI +YG KWKYIHFNGERV+VR E+ Sbjct: 234 EDTVRRVARAFRELGDEEKRKLVIKQYGLKWKYIHFNGERVRVRTEA 280 >gb|ACU23441.1| unknown [Glycine max] Length = 288 Score = 295 bits (754), Expect = 8e-77 Identities = 157/287 (54%), Positives = 197/287 (68%), Gaps = 6/287 (2%) Frame = +3 Query: 1053 MMATSCKSANLFGAPYFSLTQTMVDFPSRISLARMVLGKPAFLLVNSLQCNSKSIVQQCS 1232 MM +C+S+ + + Q D P+R A ++ K + + V +L K+ QC+ Sbjct: 1 MMPITCESSIVSPLLLSRVCQAKTD-PTR---ALLLGNKFSTMAVTALP---KTSCIQCT 53 Query: 1233 GSIENTSQRYGKSVKKPGK------EHHLWKHRDSAGSGHKAMNLVRIVSGLPNEKEAVY 1394 S + G ++K GK EHHLWK RDSA SG KA+ LVR V LPNEKEAVY Sbjct: 54 IVRSKFSHKSGGPMEKKGKKTTGKKEHHLWKSRDSAQSGQKALALVRTVYKLPNEKEAVY 113 Query: 1395 LALDEWIAWELEFPXXXXXXXXXXXXXQRQWVRVIQVAKWMLSKGQGTTMATYDAVLLAY 1574 ALD+W AWE EFP + WVRVIQVAKWMLSKGQG TM TYD +LLA+ Sbjct: 114 GALDKWTAWETEFPVIAVSKALKILRKRGHWVRVIQVAKWMLSKGQGATMGTYDTLLLAF 173 Query: 1575 DQEGRIEEAGSLWNMILHTHTRSISKRLFSRMISLYDHHDLPEKIIEVFADMEELGVKPD 1754 D + R++EA SLWNMI+H H RS+SKRLFSRMISLYDHH++P+KII+VFADMEEL +KPD Sbjct: 174 DMDKRVDEAESLWNMIIHAHLRSVSKRLFSRMISLYDHHNMPDKIIDVFADMEELRLKPD 233 Query: 1755 EDTVKRASQAFEKLGQEDKRNMVIHRYGCKWKYIHFNGERVKVRRES 1895 EDTV+R ++AF +LG E+KR +VI +YG KWKYIHFNGERV+VR E+ Sbjct: 234 EDTVRRVARAFRELGDEEKRKLVIKQYGLKWKYIHFNGERVRVRTEA 280 >ref|XP_002526919.1| conserved hypothetical protein [Ricinus communis] gi|223533738|gb|EEF35472.1| conserved hypothetical protein [Ricinus communis] Length = 224 Score = 291 bits (746), Expect = 7e-76 Identities = 139/204 (68%), Positives = 164/204 (80%), Gaps = 1/204 (0%) Frame = +3 Query: 1266 KSVKKPGKE-HHLWKHRDSAGSGHKAMNLVRIVSGLPNEKEAVYLALDEWIAWELEFPXX 1442 K VKK GKE HHLWK RDSA SG KA++LVRIV LP+EKE VY ALD+W AWE EFP Sbjct: 11 KPVKKAGKEEHHLWKKRDSARSGEKALSLVRIVCELPDEKECVYGALDKWTAWETEFPLI 70 Query: 1443 XXXXXXXXXXXQRQWVRVIQVAKWMLSKGQGTTMATYDAVLLAYDQEGRIEEAGSLWNMI 1622 QW+RVIQVAKWMLSKGQGTTM TYD +LLA+D + R++EA SLWNMI Sbjct: 71 AVAKGLRILRKHNQWLRVIQVAKWMLSKGQGTTMGTYDTLLLAFDMDNRVDEAASLWNMI 130 Query: 1623 LHTHTRSISKRLFSRMISLYDHHDLPEKIIEVFADMEELGVKPDEDTVKRASQAFEKLGQ 1802 LHTH RSISKRLFSRMISLYDHH++P+ IIE+FADMEELGV+PDEDTV+R ++AF++LGQ Sbjct: 131 LHTHVRSISKRLFSRMISLYDHHNMPDGIIEIFADMEELGVRPDEDTVRRVARAFKELGQ 190 Query: 1803 EDKRNMVIHRYGCKWKYIHFNGER 1874 E+K+ +V+ RY +WKYIHF GER Sbjct: 191 EEKQKLVLKRYMSRWKYIHFKGER 214 >ref|XP_006467567.1| PREDICTED: pentatricopeptide repeat-containing protein At4g18975, chloroplastic-like [Citrus sinensis] Length = 281 Score = 291 bits (745), Expect = 9e-76 Identities = 144/222 (64%), Positives = 171/222 (77%) Frame = +3 Query: 1230 SGSIENTSQRYGKSVKKPGKEHHLWKHRDSAGSGHKAMNLVRIVSGLPNEKEAVYLALDE 1409 S S N S++ VK KE HLW+ RDSAGSG KA+NLV S LPNEK AVY ALD+ Sbjct: 55 SNSNANASKKNKLVVKVGKKEQHLWQKRDSAGSGQKALNLV---SELPNEKHAVYGALDK 111 Query: 1410 WIAWELEFPXXXXXXXXXXXXXQRQWVRVIQVAKWMLSKGQGTTMATYDAVLLAYDQEGR 1589 W AWE EFP + QW+RVIQVAKWMLSKGQG TM TYD +LLA+D++ R Sbjct: 112 WTAWETEFPLIAAAKALRILRKRGQWLRVIQVAKWMLSKGQGATMGTYDTLLLAFDKDHR 171 Query: 1590 IEEAGSLWNMILHTHTRSISKRLFSRMISLYDHHDLPEKIIEVFADMEELGVKPDEDTVK 1769 +EA SLWNMILHT TRSISKRLFSRMISLYDHHD+P KIIEVFADMEELGV+PDEDTV+ Sbjct: 172 ADEAESLWNMILHTQTRSISKRLFSRMISLYDHHDMPNKIIEVFADMEELGVRPDEDTVR 231 Query: 1770 RASQAFEKLGQEDKRNMVIHRYGCKWKYIHFNGERVKVRRES 1895 R + AF+++GQ+DK+ +V+ +Y KWKYIHF GERV+VRR++ Sbjct: 232 RIASAFQRVGQDDKQKLVLKKYLSKWKYIHFKGERVRVRRDA 273 >ref|XP_006848146.1| hypothetical protein AMTR_s00029p00227910 [Amborella trichopoda] gi|548851451|gb|ERN09727.1| hypothetical protein AMTR_s00029p00227910 [Amborella trichopoda] Length = 287 Score = 286 bits (731), Expect = 4e-74 Identities = 129/200 (64%), Positives = 166/200 (83%) Frame = +3 Query: 1287 KEHHLWKHRDSAGSGHKAMNLVRIVSGLPNEKEAVYLALDEWIAWELEFPXXXXXXXXXX 1466 KEHHLW RDSAGS KA+NLVRIVS + NEKEA+Y+ALDEW AWE EFP Sbjct: 81 KEHHLWMKRDSAGSSQKALNLVRIVSRVSNEKEAIYVALDEWAAWETEFPVIAAAKALGI 140 Query: 1467 XXXQRQWVRVIQVAKWMLSKGQGTTMATYDAVLLAYDQEGRIEEAGSLWNMILHTHTRSI 1646 +R+W+RVIQV+KW+LSKGQ TM TYD +LLA+D +GR++EA ++WNMILHT+TRSI Sbjct: 141 LRKRRRWLRVIQVSKWLLSKGQVLTMGTYDTLLLAFDMDGRVDEAETIWNMILHTYTRSI 200 Query: 1647 SKRLFSRMISLYDHHDLPEKIIEVFADMEELGVKPDEDTVKRASQAFEKLGQEDKRNMVI 1826 SKRLFSRM+SLYDHH +P+K++EVFADMEELGVKPD+D+V+R ++AF++LG+E+K+ V+ Sbjct: 201 SKRLFSRMMSLYDHHHIPDKLLEVFADMEELGVKPDQDSVRRVARAFQQLGEEEKQKQVL 260 Query: 1827 HRYGCKWKYIHFNGERVKVR 1886 +YG K KYIHFNGERV+++ Sbjct: 261 QKYGLKLKYIHFNGERVRIK 280 >ref|XP_002867967.1| hypothetical protein ARALYDRAFT_914772 [Arabidopsis lyrata subsp. lyrata] gi|297313803|gb|EFH44226.1| hypothetical protein ARALYDRAFT_914772 [Arabidopsis lyrata subsp. lyrata] Length = 284 Score = 283 bits (724), Expect = 2e-73 Identities = 139/260 (53%), Positives = 187/260 (71%) Frame = +3 Query: 1116 TMVDFPSRISLARMVLGKPAFLLVNSLQCNSKSIVQQCSGSIENTSQRYGKSVKKPGKEH 1295 ++++FPS S +++ + F C ++ +G ++ + + K KEH Sbjct: 28 SLLEFPSCGSYSKLKTKRFGF-------CIRSKFSEKQAGKLDVATVNSNEIKKVGKKEH 80 Query: 1296 HLWKHRDSAGSGHKAMNLVRIVSGLPNEKEAVYLALDEWIAWELEFPXXXXXXXXXXXXX 1475 HLWK DSAGSG KA+NLVR++SGLPNEKEAVY AL++W+AWE+EFP Sbjct: 81 HLWKKNDSAGSGQKALNLVRMLSGLPNEKEAVYGALNKWVAWEVEFPIIAAAKALQILRK 140 Query: 1476 QRQWVRVIQVAKWMLSKGQGTTMATYDAVLLAYDQEGRIEEAGSLWNMILHTHTRSISKR 1655 + QW RVIQ+AKWMLSKGQG TM TYD +LLA+D + R +EA SLWNMILHTHTRSI +R Sbjct: 141 RSQWHRVIQLAKWMLSKGQGATMGTYDTLLLAFDMDQRADEAESLWNMILHTHTRSIPRR 200 Query: 1656 LFSRMISLYDHHDLPEKIIEVFADMEELGVKPDEDTVKRASQAFEKLGQEDKRNMVIHRY 1835 LF+RMI+LY H+DL +K+IEVFADMEEL V+PDEDT +R ++AF +LGQE+ R +++ RY Sbjct: 201 LFARMIALYAHYDLHDKVIEVFADMEELKVRPDEDTARRVARAFRELGQEENRKLILRRY 260 Query: 1836 GCKWKYIHFNGERVKVRRES 1895 ++KYI+FNGERV+V+R S Sbjct: 261 LSEFKYIYFNGERVRVKRYS 280 >ref|NP_001031667.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana] gi|332658716|gb|AEE84116.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana] Length = 260 Score = 280 bits (717), Expect = 2e-72 Identities = 133/209 (63%), Positives = 166/209 (79%), Gaps = 1/209 (0%) Frame = +3 Query: 1266 KSVKKPGK-EHHLWKHRDSAGSGHKAMNLVRIVSGLPNEKEAVYLALDEWIAWELEFPXX 1442 K +KK GK EHHLWK DSAGSG KA+NLVR++SGLPNEKEAVY AL++W+AWE+EFP Sbjct: 46 KEIKKVGKKEHHLWKKNDSAGSGQKALNLVRMLSGLPNEKEAVYGALNKWVAWEVEFPII 105 Query: 1443 XXXXXXXXXXXQRQWVRVIQVAKWMLSKGQGTTMATYDAVLLAYDQEGRIEEAGSLWNMI 1622 + QW RVIQ+AKWMLSKGQG TM TYD +LLA+D + R +EA SLWNMI Sbjct: 106 AAAKALQILRKRSQWHRVIQLAKWMLSKGQGATMGTYDILLLAFDMDERADEAESLWNMI 165 Query: 1623 LHTHTRSISKRLFSRMISLYDHHDLPEKIIEVFADMEELGVKPDEDTVKRASQAFEKLGQ 1802 LHTHTRSI +RLF+RMI+LY HHDL +K+IEVFADMEEL V PDED+ +R ++AF +L Q Sbjct: 166 LHTHTRSIPRRLFARMIALYAHHDLHDKVIEVFADMEELKVSPDEDSARRVARAFRELNQ 225 Query: 1803 EDKRNMVIHRYGCKWKYIHFNGERVKVRR 1889 E+ R +++ RY ++KYI+FNGERV+V+R Sbjct: 226 EENRKLILRRYLSEYKYIYFNGERVRVKR 254 >ref|NP_567571.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana] gi|186512032|ref|NP_001119009.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana] gi|334186688|ref|NP_001190768.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana] gi|223635632|sp|Q2V3H0.2|PP322_ARATH RecName: Full=Pentatricopeptide repeat-containing protein At4g18975, chloroplastic; Flags: Precursor gi|332658715|gb|AEE84115.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana] gi|332658717|gb|AEE84117.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana] gi|332658718|gb|AEE84118.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana] Length = 287 Score = 280 bits (717), Expect = 2e-72 Identities = 133/209 (63%), Positives = 166/209 (79%), Gaps = 1/209 (0%) Frame = +3 Query: 1266 KSVKKPGK-EHHLWKHRDSAGSGHKAMNLVRIVSGLPNEKEAVYLALDEWIAWELEFPXX 1442 K +KK GK EHHLWK DSAGSG KA+NLVR++SGLPNEKEAVY AL++W+AWE+EFP Sbjct: 73 KEIKKVGKKEHHLWKKNDSAGSGQKALNLVRMLSGLPNEKEAVYGALNKWVAWEVEFPII 132 Query: 1443 XXXXXXXXXXXQRQWVRVIQVAKWMLSKGQGTTMATYDAVLLAYDQEGRIEEAGSLWNMI 1622 + QW RVIQ+AKWMLSKGQG TM TYD +LLA+D + R +EA SLWNMI Sbjct: 133 AAAKALQILRKRSQWHRVIQLAKWMLSKGQGATMGTYDILLLAFDMDERADEAESLWNMI 192 Query: 1623 LHTHTRSISKRLFSRMISLYDHHDLPEKIIEVFADMEELGVKPDEDTVKRASQAFEKLGQ 1802 LHTHTRSI +RLF+RMI+LY HHDL +K+IEVFADMEEL V PDED+ +R ++AF +L Q Sbjct: 193 LHTHTRSIPRRLFARMIALYAHHDLHDKVIEVFADMEELKVSPDEDSARRVARAFRELNQ 252 Query: 1803 EDKRNMVIHRYGCKWKYIHFNGERVKVRR 1889 E+ R +++ RY ++KYI+FNGERV+V+R Sbjct: 253 EENRKLILRRYLSEYKYIYFNGERVRVKR 281