BLASTX nr result
ID: Catharanthus23_contig00018435
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Catharanthus23_contig00018435 (1429 letters) Database: ./nr 37,332,560 sequences; 13,225,080,153 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value emb|CBI30774.3| unnamed protein product [Vitis vinifera] 332 3e-88 ref|XP_006355563.1| PREDICTED: pentatricopeptide repeat-containi... 324 5e-86 ref|XP_006449586.1| hypothetical protein CICLE_v10016169mg [Citr... 324 6e-86 gb|EOY27863.1| Pentatricopeptide repeat superfamily protein [The... 323 8e-86 ref|XP_004232997.1| PREDICTED: pentatricopeptide repeat-containi... 323 8e-86 gb|EMJ13198.1| hypothetical protein PRUPE_ppa011078mg [Prunus pe... 322 2e-85 ref|XP_002316747.1| predicted protein [Populus trichocarpa] 317 1e-83 ref|XP_003546058.1| PREDICTED: pentatricopeptide repeat-containi... 313 9e-83 gb|ACU23441.1| unknown [Glycine max] 313 9e-83 ref|XP_006467567.1| PREDICTED: pentatricopeptide repeat-containi... 313 1e-82 gb|ESW19954.1| hypothetical protein PHAVU_006G168700g [Phaseolus... 312 2e-82 ref|XP_004294568.1| PREDICTED: pentatricopeptide repeat-containi... 310 7e-82 ref|XP_004164255.1| PREDICTED: pentatricopeptide repeat-containi... 310 7e-82 ref|XP_004499754.1| PREDICTED: pentatricopeptide repeat-containi... 309 2e-81 ref|XP_002526919.1| conserved hypothetical protein [Ricinus comm... 308 3e-81 gb|EXB58283.1| hypothetical protein L484_015617 [Morus notabilis] 303 1e-79 ref|XP_002867967.1| hypothetical protein ARALYDRAFT_914772 [Arab... 297 8e-78 ref|NP_001031667.1| pentatricopeptide repeat-containing protein ... 293 9e-77 ref|NP_567571.1| pentatricopeptide repeat-containing protein [Ar... 293 9e-77 ref|XP_006848146.1| hypothetical protein AMTR_s00029p00227910 [A... 285 3e-74 >emb|CBI30774.3| unnamed protein product [Vitis vinifera] Length = 277 Score = 332 bits (850), Expect = 3e-88 Identities = 165/221 (74%), Positives = 182/221 (82%), Gaps = 1/221 (0%) Frame = -2 Query: 1071 YNEKSKKTTQKA-RKEHHLWQKRESTKSGQKALNLVRIISALPNEKEAVYGALDKWIAWE 895 Y K+ ++K +KEHHLW+KR+S SGQKALNLVRI+S LPNEKEAVYGALDKW AWE Sbjct: 56 YRAVEKEISKKVGKKEHHLWRKRDSIGSGQKALNLVRIVSELPNEKEAVYGALDKWTAWE 115 Query: 894 AEFPXXXXXXXXXXXXXRNQWVRLIQVAKWMLSKGQGATMATYDSLLLAFDMDQRVDEAE 715 EFP RNQW R+IQVAKWMLSKGQGATM TYD+LLLAFDMD RVDEAE Sbjct: 116 TEFPLIAAAKALRILRKRNQWKRVIQVAKWMLSKGQGATMGTYDTLLLAFDMDWRVDEAE 175 Query: 714 MLWNMILHTHTRSISKQLFSRMISLYDHHSMPEKIIQVFADMEEVGVKPDEDTVRKIARA 535 LWNMILHTHTRSISKQLFSRMISLYDHH M +K+I+VFADMEE+GVKPDEDTVR++A A Sbjct: 176 SLWNMILHTHTRSISKQLFSRMISLYDHHDMRDKVIEVFADMEELGVKPDEDTVRRVACA 235 Query: 534 FQTLGQADKRKLVLNKYLSKWKYIHFNGERVRVRRTTDEWE 412 FQTLGQ DK+KLVL KY KWKYIHFNGERVRVRR D W+ Sbjct: 236 FQTLGQEDKQKLVLKKYQCKWKYIHFNGERVRVRR--DAWD 274 >ref|XP_006355563.1| PREDICTED: pentatricopeptide repeat-containing protein At4g18975, chloroplastic-like isoform X1 [Solanum tuberosum] gi|565378234|ref|XP_006355564.1| PREDICTED: pentatricopeptide repeat-containing protein At4g18975, chloroplastic-like isoform X2 [Solanum tuberosum] gi|565378236|ref|XP_006355565.1| PREDICTED: pentatricopeptide repeat-containing protein At4g18975, chloroplastic-like isoform X3 [Solanum tuberosum] Length = 265 Score = 324 bits (831), Expect = 5e-86 Identities = 159/210 (75%), Positives = 177/210 (84%), Gaps = 1/210 (0%) Frame = -2 Query: 1056 KKTTQKARK-EHHLWQKRESTKSGQKALNLVRIISALPNEKEAVYGALDKWIAWEAEFPX 880 +K QKA K EHHLW+KRES SGQKALNLVRIIS LPNEKE+VYGALDKWIAWEAEFP Sbjct: 53 QKKVQKAGKVEHHLWKKRESAGSGQKALNLVRIISGLPNEKESVYGALDKWIAWEAEFPL 112 Query: 879 XXXXXXXXXXXXRNQWVRLIQVAKWMLSKGQGATMATYDSLLLAFDMDQRVDEAEMLWNM 700 + W R+IQVAKWMLSKGQGATMATYD+LLLAFDMD RVDEAE LWNM Sbjct: 113 IAAAKALRILRQQRLWKRVIQVAKWMLSKGQGATMATYDALLLAFDMDNRVDEAETLWNM 172 Query: 699 ILHTHTRSISKQLFSRMISLYDHHSMPEKIIQVFADMEEVGVKPDEDTVRKIARAFQTLG 520 ILHT TRS+SK+LFSRMISLYDHH +P+KI++VFADMEE+GVKPDEDTV ++ARAFQ LG Sbjct: 173 ILHTSTRSVSKRLFSRMISLYDHHHVPDKIVEVFADMEELGVKPDEDTVGRVARAFQMLG 232 Query: 519 QADKRKLVLNKYLSKWKYIHFNGERVRVRR 430 Q DK+KLVL KY S+WKY+HFNGER RVRR Sbjct: 233 QEDKQKLVLKKYQSRWKYVHFNGERARVRR 262 >ref|XP_006449586.1| hypothetical protein CICLE_v10016169mg [Citrus clementina] gi|557552197|gb|ESR62826.1| hypothetical protein CICLE_v10016169mg [Citrus clementina] Length = 284 Score = 324 bits (830), Expect = 6e-86 Identities = 159/218 (72%), Positives = 177/218 (81%) Frame = -2 Query: 1068 NEKSKKTTQKARKEHHLWQKRESTKSGQKALNLVRIISALPNEKEAVYGALDKWIAWEAE 889 ++K+K + +KE HLWQKR+S SGQKALNLVRI+S LPNEK AVYGALDKW AWE E Sbjct: 62 SKKNKLVVKVGKKEQHLWQKRDSAGSGQKALNLVRIVSELPNEKHAVYGALDKWTAWETE 121 Query: 888 FPXXXXXXXXXXXXXRNQWVRLIQVAKWMLSKGQGATMATYDSLLLAFDMDQRVDEAEML 709 FP R QW+R+IQVAKWMLSKGQGATM TYD+LLLAFD D R DEAE L Sbjct: 122 FPLIAAAKALRILRKRGQWLRVIQVAKWMLSKGQGATMGTYDTLLLAFDKDHRADEAESL 181 Query: 708 WNMILHTHTRSISKQLFSRMISLYDHHSMPEKIIQVFADMEEVGVKPDEDTVRKIARAFQ 529 WNMILHTHTRSISK+LFSRMISLYDHH MP KII+VFADMEE+GV+PDEDTVR+IA AFQ Sbjct: 182 WNMILHTHTRSISKRLFSRMISLYDHHDMPNKIIEVFADMEELGVRPDEDTVRRIASAFQ 241 Query: 528 TLGQADKRKLVLNKYLSKWKYIHFNGERVRVRRTTDEW 415 +GQ +K+KLVL KYLSKWKYIHF GERVRVRR D W Sbjct: 242 RVGQDEKQKLVLKKYLSKWKYIHFKGERVRVRR--DAW 277 >gb|EOY27863.1| Pentatricopeptide repeat superfamily protein [Theobroma cacao] Length = 276 Score = 323 bits (829), Expect = 8e-86 Identities = 161/217 (74%), Positives = 180/217 (82%), Gaps = 1/217 (0%) Frame = -2 Query: 1077 GLYNEKSKKTTQKARK-EHHLWQKRESTKSGQKALNLVRIISALPNEKEAVYGALDKWIA 901 G+ KK +K K EHHLW+KR+S SGQKALNLVRIIS LPNEKEAVYGALDKW A Sbjct: 54 GISEAVEKKPVKKVGKNEHHLWKKRDSAGSGQKALNLVRIISQLPNEKEAVYGALDKWTA 113 Query: 900 WEAEFPXXXXXXXXXXXXXRNQWVRLIQVAKWMLSKGQGATMATYDSLLLAFDMDQRVDE 721 WE EFP R+QW+R+IQVAKWMLSKGQGATM TYD+LLLAFDMD+RVDE Sbjct: 114 WETEFPLIAAAKALRILRKRSQWLRVIQVAKWMLSKGQGATMGTYDTLLLAFDMDKRVDE 173 Query: 720 AEMLWNMILHTHTRSISKQLFSRMISLYDHHSMPEKIIQVFADMEEVGVKPDEDTVRKIA 541 AE LWNMILH HTRSISK+LFSRMISLYDHH+M +KII+VFADMEE+ V+PDE+TVRK+A Sbjct: 174 AESLWNMILHIHTRSISKRLFSRMISLYDHHNMQDKIIEVFADMEELCVRPDENTVRKVA 233 Query: 540 RAFQTLGQADKRKLVLNKYLSKWKYIHFNGERVRVRR 430 RAFQ LGQ DK+KLVL +YLSKWKYIHFNGERVRV R Sbjct: 234 RAFQKLGQEDKQKLVLRRYLSKWKYIHFNGERVRVTR 270 >ref|XP_004232997.1| PREDICTED: pentatricopeptide repeat-containing protein At4g18975, chloroplastic-like [Solanum lycopersicum] Length = 265 Score = 323 bits (829), Expect = 8e-86 Identities = 158/210 (75%), Positives = 176/210 (83%), Gaps = 1/210 (0%) Frame = -2 Query: 1056 KKTTQKARK-EHHLWQKRESTKSGQKALNLVRIISALPNEKEAVYGALDKWIAWEAEFPX 880 +K QKA K EHHLW+KRES SGQKALNLVRIIS LPNEKE+VYGALDKWIAWE EFP Sbjct: 53 QKKVQKAGKVEHHLWKKRESAGSGQKALNLVRIISGLPNEKESVYGALDKWIAWETEFPL 112 Query: 879 XXXXXXXXXXXXRNQWVRLIQVAKWMLSKGQGATMATYDSLLLAFDMDQRVDEAEMLWNM 700 + W R+IQVAKWMLSKGQGATMATYD+LLLAFDMD RVDEAE LWNM Sbjct: 113 IAAAKALRILRQQRLWKRVIQVAKWMLSKGQGATMATYDALLLAFDMDNRVDEAETLWNM 172 Query: 699 ILHTHTRSISKQLFSRMISLYDHHSMPEKIIQVFADMEEVGVKPDEDTVRKIARAFQTLG 520 ILHT TRS+SK+LFSRMISLYDHH +P+KI++VFADMEE+GVKPDEDTVR++ARAFQ LG Sbjct: 173 ILHTSTRSVSKRLFSRMISLYDHHHVPDKIVEVFADMEELGVKPDEDTVRRVARAFQMLG 232 Query: 519 QADKRKLVLNKYLSKWKYIHFNGERVRVRR 430 Q D +KLVL KY S+WKY+HFNGER RVRR Sbjct: 233 QEDNQKLVLKKYQSRWKYVHFNGERARVRR 262 >gb|EMJ13198.1| hypothetical protein PRUPE_ppa011078mg [Prunus persica] Length = 224 Score = 322 bits (825), Expect = 2e-85 Identities = 159/218 (72%), Positives = 178/218 (81%), Gaps = 1/218 (0%) Frame = -2 Query: 1062 KSKKTTQKA-RKEHHLWQKRESTKSGQKALNLVRIISALPNEKEAVYGALDKWIAWEAEF 886 K +KT +K RKEHHLWQKR+S SGQKALNLVRI+S LPNEKE VYGALDKW AWE EF Sbjct: 2 KCRKTIKKVGRKEHHLWQKRDSAGSGQKALNLVRIVSGLPNEKETVYGALDKWTAWETEF 61 Query: 885 PXXXXXXXXXXXXXRNQWVRLIQVAKWMLSKGQGATMATYDSLLLAFDMDQRVDEAEMLW 706 P R+QWVR+IQVAKWMLSKGQGATM TYD+LLLAFDMDQRVDEAE LW Sbjct: 62 PLIAAVKALRILRKRSQWVRVIQVAKWMLSKGQGATMGTYDTLLLAFDMDQRVDEAESLW 121 Query: 705 NMILHTHTRSISKQLFSRMISLYDHHSMPEKIIQVFADMEEVGVKPDEDTVRKIARAFQT 526 NMILHTHTRSISK+LFSRMISLYDHH KII+VFADMEE+GVKPDEDTVR++ARAF+ Sbjct: 122 NMILHTHTRSISKRLFSRMISLYDHHDKQNKIIEVFADMEELGVKPDEDTVRRVARAFKE 181 Query: 525 LGQADKRKLVLNKYLSKWKYIHFNGERVRVRRTTDEWE 412 LGQ + + LVL +Y KWKYIHF GERV+VR T+ W+ Sbjct: 182 LGQEENKTLVLRRYQCKWKYIHFKGERVKVR--TNAWD 217 >ref|XP_002316747.1| predicted protein [Populus trichocarpa] Length = 272 Score = 317 bits (811), Expect = 1e-83 Identities = 156/237 (65%), Positives = 182/237 (76%) Frame = -2 Query: 1125 VSCPNDMTEIISSTDKGLYNEKSKKTTQKARKEHHLWQKRESTKSGQKALNLVRIISALP 946 V C ++ S D E K + +KEHHLWQKR+S SGQKALNLVRI+S LP Sbjct: 40 VKCSKKQLKLNSRAD-----ENRKVVKKSGKKEHHLWQKRDSAGSGQKALNLVRIVSELP 94 Query: 945 NEKEAVYGALDKWIAWEAEFPXXXXXXXXXXXXXRNQWVRLIQVAKWMLSKGQGATMATY 766 NEKEAVYGALDKW AWE EFP R QW R+IQVAKWMLSKGQGAT+ TY Sbjct: 95 NEKEAVYGALDKWTAWETEFPLIAAAKALKILQQRRQWTRVIQVAKWMLSKGQGATLGTY 154 Query: 765 DSLLLAFDMDQRVDEAEMLWNMILHTHTRSISKQLFSRMISLYDHHSMPEKIIQVFADME 586 D+LLLAFD D RVDEA+ LWNMI+H HTRS+SK+LFSRMISLYDHH+M ++II+VFADME Sbjct: 155 DTLLLAFDKDDRVDEAKSLWNMIIHVHTRSMSKRLFSRMISLYDHHNMQDEIIEVFADME 214 Query: 585 EVGVKPDEDTVRKIARAFQTLGQADKRKLVLNKYLSKWKYIHFNGERVRVRRTTDEW 415 E+GV+PDEDTV ++ARAF+ LGQ +KR+LVL +YL KWKYIHFNGERVRV+R D W Sbjct: 215 ELGVRPDEDTVWRVARAFKKLGQEEKRELVLERYLCKWKYIHFNGERVRVKR--DGW 269 >ref|XP_003546058.1| PREDICTED: pentatricopeptide repeat-containing protein At4g18975, chloroplastic-like isoform X1 [Glycine max] gi|571517206|ref|XP_006597502.1| PREDICTED: pentatricopeptide repeat-containing protein At4g18975, chloroplastic-like isoform X2 [Glycine max] Length = 288 Score = 313 bits (803), Expect = 9e-83 Identities = 155/222 (69%), Positives = 177/222 (79%) Frame = -2 Query: 1077 GLYNEKSKKTTQKARKEHHLWQKRESTKSGQKALNLVRIISALPNEKEAVYGALDKWIAW 898 G +K KKTT K KEHHLW+ R+S +SGQKAL LVR + LPNEKEAVYGALDKW AW Sbjct: 65 GPMEKKGKKTTGK--KEHHLWKSRDSAQSGQKALALVRTVYKLPNEKEAVYGALDKWTAW 122 Query: 897 EAEFPXXXXXXXXXXXXXRNQWVRLIQVAKWMLSKGQGATMATYDSLLLAFDMDQRVDEA 718 E EFP R WVR+IQVAKWMLSKGQGATM TYD+LLLAFDMD+RVDEA Sbjct: 123 ETEFPVIAVSKALKILRKRGHWVRVIQVAKWMLSKGQGATMGTYDTLLLAFDMDKRVDEA 182 Query: 717 EMLWNMILHTHTRSISKQLFSRMISLYDHHSMPEKIIQVFADMEEVGVKPDEDTVRKIAR 538 E LWNMI+H H RS+SK+LFSRMISLYDHH+MP+KII VFADMEE+ +KPDEDTVR++AR Sbjct: 183 ESLWNMIIHAHMRSVSKRLFSRMISLYDHHNMPDKIIDVFADMEELRLKPDEDTVRRVAR 242 Query: 537 AFQTLGQADKRKLVLNKYLSKWKYIHFNGERVRVRRTTDEWE 412 AF+ LG +KRKLV+ +Y KWKYIHFNGERVRVR T+ WE Sbjct: 243 AFRELGDEEKRKLVIKQYGLKWKYIHFNGERVRVR--TEAWE 282 >gb|ACU23441.1| unknown [Glycine max] Length = 288 Score = 313 bits (803), Expect = 9e-83 Identities = 155/222 (69%), Positives = 177/222 (79%) Frame = -2 Query: 1077 GLYNEKSKKTTQKARKEHHLWQKRESTKSGQKALNLVRIISALPNEKEAVYGALDKWIAW 898 G +K KKTT K KEHHLW+ R+S +SGQKAL LVR + LPNEKEAVYGALDKW AW Sbjct: 65 GPMEKKGKKTTGK--KEHHLWKSRDSAQSGQKALALVRTVYKLPNEKEAVYGALDKWTAW 122 Query: 897 EAEFPXXXXXXXXXXXXXRNQWVRLIQVAKWMLSKGQGATMATYDSLLLAFDMDQRVDEA 718 E EFP R WVR+IQVAKWMLSKGQGATM TYD+LLLAFDMD+RVDEA Sbjct: 123 ETEFPVIAVSKALKILRKRGHWVRVIQVAKWMLSKGQGATMGTYDTLLLAFDMDKRVDEA 182 Query: 717 EMLWNMILHTHTRSISKQLFSRMISLYDHHSMPEKIIQVFADMEEVGVKPDEDTVRKIAR 538 E LWNMI+H H RS+SK+LFSRMISLYDHH+MP+KII VFADMEE+ +KPDEDTVR++AR Sbjct: 183 ESLWNMIIHAHLRSVSKRLFSRMISLYDHHNMPDKIIDVFADMEELRLKPDEDTVRRVAR 242 Query: 537 AFQTLGQADKRKLVLNKYLSKWKYIHFNGERVRVRRTTDEWE 412 AF+ LG +KRKLV+ +Y KWKYIHFNGERVRVR T+ WE Sbjct: 243 AFRELGDEEKRKLVIKQYGLKWKYIHFNGERVRVR--TEAWE 282 >ref|XP_006467567.1| PREDICTED: pentatricopeptide repeat-containing protein At4g18975, chloroplastic-like [Citrus sinensis] Length = 281 Score = 313 bits (802), Expect = 1e-82 Identities = 164/259 (63%), Positives = 188/259 (72%) Frame = -2 Query: 1191 VQKAKKFIIFSEKARGIIPMKKVSCPNDMTEIISSTDKGLYNEKSKKTTQKARKEHHLWQ 1012 +Q A F + + K P K + +S+++ ++K+K + +KE HLWQ Sbjct: 22 LQTASGFSLLTTKLATSNPHLKCFLNQNKQPPVSNSNANA-SKKNKLVVKVGKKEQHLWQ 80 Query: 1011 KRESTKSGQKALNLVRIISALPNEKEAVYGALDKWIAWEAEFPXXXXXXXXXXXXXRNQW 832 KR+S SGQKALNLV S LPNEK AVYGALDKW AWE EFP R QW Sbjct: 81 KRDSAGSGQKALNLV---SELPNEKHAVYGALDKWTAWETEFPLIAAAKALRILRKRGQW 137 Query: 831 VRLIQVAKWMLSKGQGATMATYDSLLLAFDMDQRVDEAEMLWNMILHTHTRSISKQLFSR 652 +R+IQVAKWMLSKGQGATM TYD+LLLAFD D R DEAE LWNMILHT TRSISK+LFSR Sbjct: 138 LRVIQVAKWMLSKGQGATMGTYDTLLLAFDKDHRADEAESLWNMILHTQTRSISKRLFSR 197 Query: 651 MISLYDHHSMPEKIIQVFADMEEVGVKPDEDTVRKIARAFQTLGQADKRKLVLNKYLSKW 472 MISLYDHH MP KII+VFADMEE+GV+PDEDTVR+IA AFQ +GQ DK+KLVL KYLSKW Sbjct: 198 MISLYDHHDMPNKIIEVFADMEELGVRPDEDTVRRIASAFQRVGQDDKQKLVLKKYLSKW 257 Query: 471 KYIHFNGERVRVRRTTDEW 415 KYIHF GERVRVRR D W Sbjct: 258 KYIHFKGERVRVRR--DAW 274 >gb|ESW19954.1| hypothetical protein PHAVU_006G168700g [Phaseolus vulgaris] Length = 289 Score = 312 bits (800), Expect = 2e-82 Identities = 153/211 (72%), Positives = 174/211 (82%) Frame = -2 Query: 1065 EKSKKTTQKARKEHHLWQKRESTKSGQKALNLVRIISALPNEKEAVYGALDKWIAWEAEF 886 +K KKTT K KEHHLW+ R+S +SGQKAL LVRI+S LPNEKEAVYGALDKWIAWE EF Sbjct: 70 KKGKKTTGK--KEHHLWKSRDSAQSGQKALTLVRIVSKLPNEKEAVYGALDKWIAWETEF 127 Query: 885 PXXXXXXXXXXXXXRNQWVRLIQVAKWMLSKGQGATMATYDSLLLAFDMDQRVDEAEMLW 706 P R WVR+IQVAKWMLSKGQGATM T+D+LLLAFDMDQRVDEAE LW Sbjct: 128 PVIAAAKALKILRKRGHWVRVIQVAKWMLSKGQGATMGTFDTLLLAFDMDQRVDEAESLW 187 Query: 705 NMILHTHTRSISKQLFSRMISLYDHHSMPEKIIQVFADMEEVGVKPDEDTVRKIARAFQT 526 NMI+HTH RS+SK+LFSRMIS+YD+H MP+KII+VFADMEE+ VKPDEDTVR++ARAF Sbjct: 188 NMIIHTHMRSVSKRLFSRMISIYDNHDMPDKIIEVFADMEELRVKPDEDTVRRVARAFTE 247 Query: 525 LGQADKRKLVLNKYLSKWKYIHFNGERVRVR 433 LG+ +KRKLV +Y KWKYIHFN ERVRVR Sbjct: 248 LGEEEKRKLVARRYGIKWKYIHFNRERVRVR 278 >ref|XP_004294568.1| PREDICTED: pentatricopeptide repeat-containing protein At4g18975, chloroplastic-like [Fragaria vesca subsp. vesca] Length = 300 Score = 310 bits (795), Expect = 7e-82 Identities = 153/215 (71%), Positives = 173/215 (80%), Gaps = 1/215 (0%) Frame = -2 Query: 1056 KKTTQKA-RKEHHLWQKRESTKSGQKALNLVRIISALPNEKEAVYGALDKWIAWEAEFPX 880 KK +KA R EHHLW+K++S SGQKALNL+RI+S LPNEKEA++GALDKW AWE EFP Sbjct: 80 KKIIKKAGRNEHHLWKKKDSAGSGQKALNLIRIVSDLPNEKEAIFGALDKWTAWETEFPL 139 Query: 879 XXXXXXXXXXXXRNQWVRLIQVAKWMLSKGQGATMATYDSLLLAFDMDQRVDEAEMLWNM 700 QW R+IQVAKWMLSKGQGATMATYD+LLLAFDMD R+DEAE LWNM Sbjct: 140 IAAAKALRILRRTCQWRRVIQVAKWMLSKGQGATMATYDTLLLAFDMDNRLDEAESLWNM 199 Query: 699 ILHTHTRSISKQLFSRMISLYDHHSMPEKIIQVFADMEEVGVKPDEDTVRKIARAFQTLG 520 ILHTHTRSISK+LFSRMISLYDHH M KII+VFADMEE+ V+PDEDTVR++ARAFQ G Sbjct: 200 ILHTHTRSISKRLFSRMISLYDHHEMKTKIIEVFADMEELSVRPDEDTVRRVARAFQEFG 259 Query: 519 QADKRKLVLNKYLSKWKYIHFNGERVRVRRTTDEW 415 Q DK KLVL +Y KWKYIHF GERV+VR T+ W Sbjct: 260 QEDKSKLVLRRYGCKWKYIHFKGERVKVR--TNAW 292 >ref|XP_004164255.1| PREDICTED: pentatricopeptide repeat-containing protein At4g18975, chloroplastic-like [Cucumis sativus] Length = 270 Score = 310 bits (795), Expect = 7e-82 Identities = 158/266 (59%), Positives = 189/266 (71%), Gaps = 1/266 (0%) Frame = -2 Query: 1206 SHGFAVQKAKKFIIFSEKARGIIPMKKVSCPNDMTEIISSTDKGLYNEKSKKTTQKARKE 1027 S GF K I+ P + N + ++S + ++ +K KE Sbjct: 8 STGFDALMPKIDCIYYHNKFTFTPSSVICVHNQAAQPLTS-----FTTPERRVVKKVGKE 62 Query: 1026 -HHLWQKRESTKSGQKALNLVRIISALPNEKEAVYGALDKWIAWEAEFPXXXXXXXXXXX 850 HHLW+KR+S SGQKALNLVRI+S PNEKEAVYG L+KWIAWE EFP Sbjct: 63 THHLWKKRDSAGSGQKALNLVRIVSQCPNEKEAVYGELNKWIAWETEFPLIAAAKALRIL 122 Query: 849 XXRNQWVRLIQVAKWMLSKGQGATMATYDSLLLAFDMDQRVDEAEMLWNMILHTHTRSIS 670 R+QW R+IQVAKWMLSKGQGATM TYD+LLLAFDMD+RVDEAE LWNMILHTHTRSIS Sbjct: 123 RKRSQWKRVIQVAKWMLSKGQGATMGTYDTLLLAFDMDKRVDEAESLWNMILHTHTRSIS 182 Query: 669 KQLFSRMISLYDHHSMPEKIIQVFADMEEVGVKPDEDTVRKIARAFQTLGQADKRKLVLN 490 K++FSRMISLY+HH + +KII++FADMEE+GVKPDEDTVR++ RAFQ LGQ D RK+V Sbjct: 183 KRVFSRMISLYEHHDLQDKIIEIFADMEELGVKPDEDTVRRVCRAFQKLGQEDNRKMVYK 242 Query: 489 KYLSKWKYIHFNGERVRVRRTTDEWE 412 +Y +WKYIHF GERVRVRR D W+ Sbjct: 243 RYSCQWKYIHFKGERVRVRR--DGWD 266 >ref|XP_004499754.1| PREDICTED: pentatricopeptide repeat-containing protein At4g18975, chloroplastic-like isoform X5 [Cicer arietinum] Length = 287 Score = 309 bits (791), Expect = 2e-81 Identities = 154/242 (63%), Positives = 183/242 (75%), Gaps = 2/242 (0%) Frame = -2 Query: 1131 KKVSCPN-DMTEIISSTDKGLYNEKSKKTTQ-KARKEHHLWQKRESTKSGQKALNLVRII 958 +K SC + + SS + G EK KK + K + EHHLW++R S +SGQKAL LVR I Sbjct: 46 RKTSCTSCRFVQSKSSPNVGRPVEKDKKGNKIKGKVEHHLWKRRNSAQSGQKALTLVRTI 105 Query: 957 SALPNEKEAVYGALDKWIAWEAEFPXXXXXXXXXXXXXRNQWVRLIQVAKWMLSKGQGAT 778 LPNEKE+VYGALDKW AWE EFP R QWVR+IQ+AKWMLSKGQGAT Sbjct: 106 CELPNEKESVYGALDKWTAWETEFPLVAAAKALNILRKRGQWVRVIQLAKWMLSKGQGAT 165 Query: 777 MATYDSLLLAFDMDQRVDEAEMLWNMILHTHTRSISKQLFSRMISLYDHHSMPEKIIQVF 598 M TYD+LLLAFDMDQR+DEAE LWNMI+H H RS+SK+LFSRMISLYDHH++ EKI+++F Sbjct: 166 MGTYDTLLLAFDMDQRIDEAESLWNMIIHAHMRSVSKRLFSRMISLYDHHNLSEKIVEIF 225 Query: 597 ADMEEVGVKPDEDTVRKIARAFQTLGQADKRKLVLNKYLSKWKYIHFNGERVRVRRTTDE 418 ADMEE+ +KPDEDTVRK+ AF+ LGQ +KRK V+ +Y KWKYIHFNGERVRVRR Sbjct: 226 ADMEELRIKPDEDTVRKVTNAFRKLGQEEKRKSVIKRYGLKWKYIHFNGERVRVRR--QA 283 Query: 417 WE 412 WE Sbjct: 284 WE 285 >ref|XP_002526919.1| conserved hypothetical protein [Ricinus communis] gi|223533738|gb|EEF35472.1| conserved hypothetical protein [Ricinus communis] Length = 224 Score = 308 bits (790), Expect = 3e-81 Identities = 145/209 (69%), Positives = 174/209 (83%), Gaps = 1/209 (0%) Frame = -2 Query: 1068 NEKSKKTTQKARKE-HHLWQKRESTKSGQKALNLVRIISALPNEKEAVYGALDKWIAWEA 892 ++KS+K +KA KE HHLW+KR+S +SG+KAL+LVRI+ LP+EKE VYGALDKW AWE Sbjct: 6 DDKSRKPVKKAGKEEHHLWKKRDSARSGEKALSLVRIVCELPDEKECVYGALDKWTAWET 65 Query: 891 EFPXXXXXXXXXXXXXRNQWVRLIQVAKWMLSKGQGATMATYDSLLLAFDMDQRVDEAEM 712 EFP NQW+R+IQVAKWMLSKGQG TM TYD+LLLAFDMD RVDEA Sbjct: 66 EFPLIAVAKGLRILRKHNQWLRVIQVAKWMLSKGQGTTMGTYDTLLLAFDMDNRVDEAAS 125 Query: 711 LWNMILHTHTRSISKQLFSRMISLYDHHSMPEKIIQVFADMEEVGVKPDEDTVRKIARAF 532 LWNMILHTH RSISK+LFSRMISLYDHH+MP+ II++FADMEE+GV+PDEDTVR++ARAF Sbjct: 126 LWNMILHTHVRSISKRLFSRMISLYDHHNMPDGIIEIFADMEELGVRPDEDTVRRVARAF 185 Query: 531 QTLGQADKRKLVLNKYLSKWKYIHFNGER 445 + LGQ +K+KLVL +Y+S+WKYIHF GER Sbjct: 186 KELGQEEKQKLVLKRYMSRWKYIHFKGER 214 >gb|EXB58283.1| hypothetical protein L484_015617 [Morus notabilis] Length = 326 Score = 303 bits (776), Expect = 1e-79 Identities = 145/215 (67%), Positives = 171/215 (79%) Frame = -2 Query: 1074 LYNEKSKKTTQKARKEHHLWQKRESTKSGQKALNLVRIISALPNEKEAVYGALDKWIAWE 895 LY E + +KE+HLW+K++S SGQKALNL+RI+S LPNEKE VYGAL+KWIAWE Sbjct: 101 LYMEFRNLVKKTGKKEYHLWKKKDSAGSGQKALNLIRILSVLPNEKEVVYGALNKWIAWE 160 Query: 894 AEFPXXXXXXXXXXXXXRNQWVRLIQVAKWMLSKGQGATMATYDSLLLAFDMDQRVDEAE 715 EFP R+QW R+IQVAKWMLSKGQG TM TYD+LLLAFDMDQRVDEAE Sbjct: 161 TEFPLIAAAKALRILRKRSQWKRVIQVAKWMLSKGQGTTMGTYDTLLLAFDMDQRVDEAE 220 Query: 714 MLWNMILHTHTRSISKQLFSRMISLYDHHSMPEKIIQVFADMEEVGVKPDEDTVRKIARA 535 WNMILHTH RSISK+LFSRMI+LYDHH + +KII+VFADMEE+ V+ DEDTVR++A A Sbjct: 221 SFWNMILHTHKRSISKRLFSRMIALYDHHDVKDKIIEVFADMEELSVRLDEDTVRRVAYA 280 Query: 534 FQTLGQADKRKLVLNKYLSKWKYIHFNGERVRVRR 430 FQ LGQ +K+KL+L KY KWKY+HF GER+RVRR Sbjct: 281 FQKLGQEEKKKLLLRKYQCKWKYVHFKGERIRVRR 315 >ref|XP_002867967.1| hypothetical protein ARALYDRAFT_914772 [Arabidopsis lyrata subsp. lyrata] gi|297313803|gb|EFH44226.1| hypothetical protein ARALYDRAFT_914772 [Arabidopsis lyrata subsp. lyrata] Length = 284 Score = 297 bits (760), Expect = 8e-78 Identities = 140/214 (65%), Positives = 175/214 (81%) Frame = -2 Query: 1059 SKKTTQKARKEHHLWQKRESTKSGQKALNLVRIISALPNEKEAVYGALDKWIAWEAEFPX 880 S + + +KEHHLW+K +S SGQKALNLVR++S LPNEKEAVYGAL+KW+AWE EFP Sbjct: 69 SNEIKKVGKKEHHLWKKNDSAGSGQKALNLVRMLSGLPNEKEAVYGALNKWVAWEVEFPI 128 Query: 879 XXXXXXXXXXXXRNQWVRLIQVAKWMLSKGQGATMATYDSLLLAFDMDQRVDEAEMLWNM 700 R+QW R+IQ+AKWMLSKGQGATM TYD+LLLAFDMDQR DEAE LWNM Sbjct: 129 IAAAKALQILRKRSQWHRVIQLAKWMLSKGQGATMGTYDTLLLAFDMDQRADEAESLWNM 188 Query: 699 ILHTHTRSISKQLFSRMISLYDHHSMPEKIIQVFADMEEVGVKPDEDTVRKIARAFQTLG 520 ILHTHTRSI ++LF+RMI+LY H+ + +K+I+VFADMEE+ V+PDEDT R++ARAF+ LG Sbjct: 189 ILHTHTRSIPRRLFARMIALYAHYDLHDKVIEVFADMEELKVRPDEDTARRVARAFRELG 248 Query: 519 QADKRKLVLNKYLSKWKYIHFNGERVRVRRTTDE 418 Q + RKL+L +YLS++KYI+FNGERVRV+R + E Sbjct: 249 QEENRKLILRRYLSEFKYIYFNGERVRVKRYSSE 282 >ref|NP_001031667.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana] gi|332658716|gb|AEE84116.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana] Length = 260 Score = 293 bits (751), Expect = 9e-77 Identities = 139/214 (64%), Positives = 172/214 (80%) Frame = -2 Query: 1059 SKKTTQKARKEHHLWQKRESTKSGQKALNLVRIISALPNEKEAVYGALDKWIAWEAEFPX 880 SK+ + +KEHHLW+K +S SGQKALNLVR++S LPNEKEAVYGAL+KW+AWE EFP Sbjct: 45 SKEIKKVGKKEHHLWKKNDSAGSGQKALNLVRMLSGLPNEKEAVYGALNKWVAWEVEFPI 104 Query: 879 XXXXXXXXXXXXRNQWVRLIQVAKWMLSKGQGATMATYDSLLLAFDMDQRVDEAEMLWNM 700 R+QW R+IQ+AKWMLSKGQGATM TYD LLLAFDMD+R DEAE LWNM Sbjct: 105 IAAAKALQILRKRSQWHRVIQLAKWMLSKGQGATMGTYDILLLAFDMDERADEAESLWNM 164 Query: 699 ILHTHTRSISKQLFSRMISLYDHHSMPEKIIQVFADMEEVGVKPDEDTVRKIARAFQTLG 520 ILHTHTRSI ++LF+RMI+LY HH + +K+I+VFADMEE+ V PDED+ R++ARAF+ L Sbjct: 165 ILHTHTRSIPRRLFARMIALYAHHDLHDKVIEVFADMEELKVSPDEDSARRVARAFRELN 224 Query: 519 QADKRKLVLNKYLSKWKYIHFNGERVRVRRTTDE 418 Q + RKL+L +YLS++KYI+FNGERVRV+R E Sbjct: 225 QEENRKLILRRYLSEYKYIYFNGERVRVKRYFSE 258 >ref|NP_567571.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana] gi|186512032|ref|NP_001119009.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana] gi|334186688|ref|NP_001190768.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana] gi|223635632|sp|Q2V3H0.2|PP322_ARATH RecName: Full=Pentatricopeptide repeat-containing protein At4g18975, chloroplastic; Flags: Precursor gi|332658715|gb|AEE84115.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana] gi|332658717|gb|AEE84117.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana] gi|332658718|gb|AEE84118.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana] Length = 287 Score = 293 bits (751), Expect = 9e-77 Identities = 139/214 (64%), Positives = 172/214 (80%) Frame = -2 Query: 1059 SKKTTQKARKEHHLWQKRESTKSGQKALNLVRIISALPNEKEAVYGALDKWIAWEAEFPX 880 SK+ + +KEHHLW+K +S SGQKALNLVR++S LPNEKEAVYGAL+KW+AWE EFP Sbjct: 72 SKEIKKVGKKEHHLWKKNDSAGSGQKALNLVRMLSGLPNEKEAVYGALNKWVAWEVEFPI 131 Query: 879 XXXXXXXXXXXXRNQWVRLIQVAKWMLSKGQGATMATYDSLLLAFDMDQRVDEAEMLWNM 700 R+QW R+IQ+AKWMLSKGQGATM TYD LLLAFDMD+R DEAE LWNM Sbjct: 132 IAAAKALQILRKRSQWHRVIQLAKWMLSKGQGATMGTYDILLLAFDMDERADEAESLWNM 191 Query: 699 ILHTHTRSISKQLFSRMISLYDHHSMPEKIIQVFADMEEVGVKPDEDTVRKIARAFQTLG 520 ILHTHTRSI ++LF+RMI+LY HH + +K+I+VFADMEE+ V PDED+ R++ARAF+ L Sbjct: 192 ILHTHTRSIPRRLFARMIALYAHHDLHDKVIEVFADMEELKVSPDEDSARRVARAFRELN 251 Query: 519 QADKRKLVLNKYLSKWKYIHFNGERVRVRRTTDE 418 Q + RKL+L +YLS++KYI+FNGERVRV+R E Sbjct: 252 QEENRKLILRRYLSEYKYIYFNGERVRVKRYFSE 285 >ref|XP_006848146.1| hypothetical protein AMTR_s00029p00227910 [Amborella trichopoda] gi|548851451|gb|ERN09727.1| hypothetical protein AMTR_s00029p00227910 [Amborella trichopoda] Length = 287 Score = 285 bits (730), Expect = 3e-74 Identities = 139/219 (63%), Positives = 173/219 (78%) Frame = -2 Query: 1068 NEKSKKTTQKARKEHHLWQKRESTKSGQKALNLVRIISALPNEKEAVYGALDKWIAWEAE 889 +EK KK +KEHHLW KR+S S QKALNLVRI+S + NEKEA+Y ALD+W AWE E Sbjct: 72 DEKPKKLF---KKEHHLWMKRDSAGSSQKALNLVRIVSRVSNEKEAIYVALDEWAAWETE 128 Query: 888 FPXXXXXXXXXXXXXRNQWVRLIQVAKWMLSKGQGATMATYDSLLLAFDMDQRVDEAEML 709 FP R +W+R+IQV+KW+LSKGQ TM TYD+LLLAFDMD RVDEAE + Sbjct: 129 FPVIAAAKALGILRKRRRWLRVIQVSKWLLSKGQVLTMGTYDTLLLAFDMDGRVDEAETI 188 Query: 708 WNMILHTHTRSISKQLFSRMISLYDHHSMPEKIIQVFADMEEVGVKPDEDTVRKIARAFQ 529 WNMILHT+TRSISK+LFSRM+SLYDHH +P+K+++VFADMEE+GVKPD+D+VR++ARAFQ Sbjct: 189 WNMILHTYTRSISKRLFSRMMSLYDHHHIPDKLLEVFADMEELGVKPDQDSVRRVARAFQ 248 Query: 528 TLGQADKRKLVLNKYLSKWKYIHFNGERVRVRRTTDEWE 412 LG+ +K+K VL KY K KYIHFNGERVR+ + + W+ Sbjct: 249 QLGEEEKQKQVLQKYGLKLKYIHFNGERVRI-KAGENWD 286