BLASTX nr result
ID: Rauwolfia21_contig00021387
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Rauwolfia21_contig00021387 (1252 letters) Database: ./nr 37,332,560 sequences; 13,225,080,153 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_004246209.1| PREDICTED: pentatricopeptide repeat-containi... 283 1e-73 ref|XP_006437483.1| hypothetical protein CICLE_v10033975mg [Citr... 277 8e-72 ref|XP_003631528.1| PREDICTED: pentatricopeptide repeat-containi... 267 6e-69 ref|XP_002519945.1| pentatricopeptide repeat-containing protein,... 253 1e-64 gb|EOY32970.1| Pentatricopeptide repeat-containing protein, puta... 253 2e-64 ref|XP_003612457.1| Pentatricopeptide repeat-containing protein ... 244 4e-62 ref|XP_004512307.1| PREDICTED: pentatricopeptide repeat-containi... 241 3e-61 gb|AFK33630.1| unknown [Lotus japonicus] 239 2e-60 gb|ESW30051.1| hypothetical protein PHAVU_002G120500g [Phaseolus... 235 3e-59 ref|XP_003516509.1| PREDICTED: pentatricopeptide repeat-containi... 235 3e-59 gb|EXB70628.1| hypothetical protein L484_023813 [Morus notabilis] 233 2e-58 emb|CAN69066.1| hypothetical protein VITISV_016070 [Vitis vinifera] 229 2e-57 ref|NP_174459.1| pentatricopeptide repeat-containing protein [Ar... 212 3e-52 ref|XP_002893686.1| pentatricopeptide repeat-containing protein ... 207 7e-51 ref|XP_006303657.1| hypothetical protein CARUB_v10011695mg [Caps... 204 8e-50 ref|XP_004292904.1| PREDICTED: pentatricopeptide repeat-containi... 200 9e-49 ref|XP_006415303.1| hypothetical protein EUTSA_v10009456mg [Eutr... 187 8e-45 ref|XP_006841553.1| hypothetical protein AMTR_s00003p00175270 [A... 176 1e-41 ref|NP_001059279.1| Os07g0244400 [Oryza sativa Japonica Group] g... 160 1e-36 gb|EAZ03360.1| hypothetical protein OsI_25499 [Oryza sativa Indi... 160 1e-36 >ref|XP_004246209.1| PREDICTED: pentatricopeptide repeat-containing protein At1g31790-like [Solanum lycopersicum] Length = 465 Score = 283 bits (724), Expect = 1e-73 Identities = 137/266 (51%), Positives = 183/266 (68%) Frame = +3 Query: 3 ELFDKSSVRNAYSWAVTIAGYFENGEYREVIDLFLEMQSWERAEGEFDDLNNIAVSGIIV 182 +LFDK VRN+ SWA IAG ENGE + LF+EMQS G ++ GI+V Sbjct: 66 QLFDKMRVRNSQSWAAMIAGCVENGECVGALRLFMEMQS---EAGNLCKCGDLIDDGILV 122 Query: 183 CVLKTCVKTMNFELGKQVHGLIIKAGCARIAVYISSLMRFYGEFGSLETSEYIFNQESNE 362 CVLK CV+ MN E G+Q+HG ++K G V S L++FYGEFG LE+++ +F+ + Sbjct: 123 CVLKACVELMNLEFGRQIHGWLLKLGNCESMVLNSFLIKFYGEFGYLESADNVFDHVPHC 182 Query: 363 NMVVWTARIANCCKEEMFDKAVHVFREMGQEGVKRNGYTFSSVLKACGKLGDDGCCGRQV 542 N VVWTARI N CKEE F+ A+ +FREM EGVK+N +TFSS+LKACGKL D GCCG+Q+ Sbjct: 183 NTVVWTARIGNLCKEEQFEGAIRIFREMVSEGVKKNSFTFSSILKACGKLRDAGCCGQQI 242 Query: 543 HACAIKLGLELDGYVHCGLIDMYRKCGSVNDAKNVFNTCVHDKDAACWNALINSYVKQGL 722 HA ++K+GL+ D YV C LIDMY K G + DA+ VFN + ACWNA++ ++ G Sbjct: 243 HATSVKVGLDTDSYVLCSLIDMYGKYGLLKDARRVFNAREDKSNIACWNAMLMGCIQHGF 302 Query: 723 CIEAIKMLYAMRAAGLQPQESLLHEL 800 +EA+K+LY M+ AGLQP ESL++E+ Sbjct: 303 GVEAMKVLYEMKEAGLQPHESLINEV 328 >ref|XP_006437483.1| hypothetical protein CICLE_v10033975mg [Citrus clementina] gi|557539679|gb|ESR50723.1| hypothetical protein CICLE_v10033975mg [Citrus clementina] Length = 425 Score = 277 bits (708), Expect = 8e-72 Identities = 139/272 (51%), Positives = 185/272 (68%) Frame = +3 Query: 3 ELFDKSSVRNAYSWAVTIAGYFENGEYREVIDLFLEMQSWERAEGEFDDLNNIAVSGIIV 182 +LFD+ +R+ SWAV I GY + +Y+E I LF EM +R +G + + + IIV Sbjct: 142 QLFDEMPLRDFNSWAVMIVGYVDVADYQECITLFAEMM--KRKKGH---MLLVFPAWIIV 196 Query: 183 CVLKTCVKTMNFELGKQVHGLIIKAGCARIAVYISSLMRFYGEFGSLETSEYIFNQESNE 362 CVLK CV TMN ELGKQVHGL+ K G +R SL+ FYG+F LE ++++F+Q Sbjct: 197 CVLKACVCTMNMELGKQVHGLLFKLGSSRNISLTGSLINFYGKFRCLEDADFVFSQLKRH 256 Query: 363 NMVVWTARIANCCKEEMFDKAVHVFREMGQEGVKRNGYTFSSVLKACGKLGDDGCCGRQV 542 N VVWTA+I N C+E F + + F+EMG+E +K+N YTFSSVLKACG + DDG CGRQV Sbjct: 257 NTVVWTAKIVNNCREGHFHQVFNDFKEMGRERIKKNSYTFSSVLKACGGVDDDGNCGRQV 316 Query: 543 HACAIKLGLELDGYVHCGLIDMYRKCGSVNDAKNVFNTCVHDKDAACWNALINSYVKQGL 722 HA +K+GLE D YV CGL+DMY KC + DAK VF V K+ A WNA++ Y++ GL Sbjct: 317 HANIVKIGLESDEYVQCGLVDMYGKCRLLRDAKRVFELIVDKKNIASWNAMLMGYIRNGL 376 Query: 723 CIEAIKMLYAMRAAGLQPQESLLHELRSLCGS 818 +EA K LY M+A+G+Q QESL+++LR C S Sbjct: 377 YVEATKFLYLMKASGIQIQESLINDLRIACSS 408 >ref|XP_003631528.1| PREDICTED: pentatricopeptide repeat-containing protein At1g31790-like [Vitis vinifera] Length = 414 Score = 267 bits (683), Expect = 6e-69 Identities = 134/279 (48%), Positives = 183/279 (65%), Gaps = 4/279 (1%) Frame = +3 Query: 6 LFDKSSV--RNAYSWAVTIAGYFENGEYREVIDLFLEMQSWERAEGEFDDLNNIAVSGII 179 +FDK +V +N+ SWA+ +A Y +NG Y E I LF++M + + I Sbjct: 134 MFDKMNVLNKNSISWAIMLAAYMDNGFYEEAIFLFVQMMELHST------IMLELPAWIF 187 Query: 180 VCVLKTCVKTMNFELGKQVHGLIIKAGCARIAVYISSLMRFYGEFGSLETSEYIFNQESN 359 +CVLK CV TMN LGKQVHG ++K G A L+ FYG+F L+ ++++F+Q S Sbjct: 188 ICVLKACVHTMNLTLGKQVHGWLLKVGYATNLFLSCYLISFYGKFRCLDDADFVFDQTSE 247 Query: 360 ENMVVWTARIANCCKEEMFDKAVHVFREMGQEGVKRNGYTFSSVLKACGKLGDDGCCGRQ 539 N V+WTA++ N C+ E +A+ F EMG+ GVKRN +T+SSVL+ACG++ D G CGR Sbjct: 248 RNTVIWTAKMVNKCQGEYMHEALVAFTEMGRAGVKRNEFTYSSVLRACGRMKDHGRCGRL 307 Query: 540 VHACAIKLGLELDGYVHCGLIDMYRKCGSVNDAKNVFNTC--VHDKDAACWNALINSYVK 713 +HA IKLGLE D YV CGL+DMY KCG + +A+ VF T + + CWNA++ Y++ Sbjct: 308 IHASTIKLGLESDIYVQCGLVDMYGKCGLLVEARRVFETVSDTNKTNIVCWNAMLTGYIR 367 Query: 714 QGLCIEAIKMLYAMRAAGLQPQESLLHELRSLCGSYEIE 830 GL IEAIK LY M+AAG+QPQESLL+ELR CGS +E Sbjct: 368 HGLYIEAIKFLYQMKAAGIQPQESLLNELRIACGSTTLE 406 >ref|XP_002519945.1| pentatricopeptide repeat-containing protein, putative [Ricinus communis] gi|223540991|gb|EEF42549.1| pentatricopeptide repeat-containing protein, putative [Ricinus communis] Length = 403 Score = 253 bits (646), Expect = 1e-64 Identities = 125/273 (45%), Positives = 171/273 (62%), Gaps = 2/273 (0%) Frame = +3 Query: 6 LFDKSSVRNAY-SWAVTIAGYFENGEYREVIDLFLEMQSWERA-EGEFDDLNNIAVSGII 179 LFDK ++ + SW + I G F N +Y I+LF++M +G DLN + II Sbjct: 131 LFDKMPLKKDFISWVIVIVGCFSNSKYEAGINLFIDMLLQHSVYDGLMFDLNTWNI--II 188 Query: 180 VCVLKTCVKTMNFELGKQVHGLIIKAGCARIAVYISSLMRFYGEFGSLETSEYIFNQESN 359 +C++K C+ +MN LGKQVHG++ K G + SLM FYG+ G LE +FN+ N Sbjct: 189 LCIIKCCIYSMNISLGKQVHGILFKVGLTSEISFNVSLMDFYGKLGCLEDVNSVFNKLDN 248 Query: 360 ENMVVWTARIANCCKEEMFDKAVHVFREMGQEGVKRNGYTFSSVLKACGKLGDDGCCGRQ 539 N WTA+I N C+ + F + + F+EMG+ G+KRN +T SSVL+AC ++GD G CG+Q Sbjct: 249 HNTATWTAKIVNSCRNQRFYEVIEDFKEMGEAGIKRNSFTVSSVLRACARMGDGGNCGKQ 308 Query: 540 VHACAIKLGLELDGYVHCGLIDMYRKCGSVNDAKNVFNTCVHDKDAACWNALINSYVKQG 719 VH IKLGLE D +V CGLI MY KCG + AK VF + + ACWNAL+ +YV+ Sbjct: 309 VHVIVIKLGLESDAFVQCGLIAMYGKCGMIRKAKKVFELVIDKTNTACWNALLMAYVRNE 368 Query: 720 LCIEAIKMLYAMRAAGLQPQESLLHELRSLCGS 818 L IEA+K+LY M AA +Q ESLL +R CG+ Sbjct: 369 LFIEAMKLLYQMEAAKIQVNESLLDHVRIACGT 401 >gb|EOY32970.1| Pentatricopeptide repeat-containing protein, putative [Theobroma cacao] Length = 413 Score = 253 bits (645), Expect = 2e-64 Identities = 124/271 (45%), Positives = 173/271 (63%) Frame = +3 Query: 6 LFDKSSVRNAYSWAVTIAGYFENGEYREVIDLFLEMQSWERAEGEFDDLNNIAVSGIIVC 185 LFD+ +R+ SWA+ I G+ + I F+ M E +L S IIVC Sbjct: 150 LFDQMLLRDFNSWAIMIVACLHAGDSEQAIAYFVRM--------ERHNLLFKCPSWIIVC 201 Query: 186 VLKTCVKTMNFELGKQVHGLIIKAGCARIAVYISSLMRFYGEFGSLETSEYIFNQESNEN 365 +LK+CV T N LGKQVHG ++K G + + SL+ FYG+F L+ ++++FNQ S N Sbjct: 202 LLKSCVVTKNMGLGKQVHGQLLKLGASNDSSLSGSLINFYGKFRCLDDADFVFNQLSRRN 261 Query: 366 MVVWTARIANCCKEEMFDKAVHVFREMGQEGVKRNGYTFSSVLKACGKLGDDGCCGRQVH 545 V WTARI N C+E+ F K + F EMG++G+K+N +TFS V KAC ++ DDG GRQVH Sbjct: 262 TVTWTARIVNSCREDQFGKVIDDFNEMGRQGIKKNNFTFSGVFKACARMDDDGMSGRQVH 321 Query: 546 ACAIKLGLELDGYVHCGLIDMYRKCGSVNDAKNVFNTCVHDKDAACWNALINSYVKQGLC 725 A A+KLGLE D +V CGLI +Y KCGSV DA+ F ++ ACWNA++ YV LC Sbjct: 322 ANALKLGLESDVFVQCGLIHLYGKCGSVRDAEKAFEIVGDKRNIACWNAMLMGYVHNELC 381 Query: 726 IEAIKMLYAMRAAGLQPQESLLHELRSLCGS 818 + AIK+LY M+ AG++ QESL++++R C + Sbjct: 382 LRAIKLLYRMKEAGIKVQESLINDVRIACAT 412 >ref|XP_003612457.1| Pentatricopeptide repeat-containing protein [Medicago truncatula] gi|355513792|gb|AES95415.1| Pentatricopeptide repeat-containing protein [Medicago truncatula] Length = 418 Score = 244 bits (624), Expect = 4e-62 Identities = 123/271 (45%), Positives = 171/271 (63%) Frame = +3 Query: 6 LFDKSSVRNAYSWAVTIAGYFENGEYREVIDLFLEMQSWERAEGEFDDLNNIAVSGIIVC 185 +FD SVR+ +SWA Y+ENGEY ID+F+ M + D + I C Sbjct: 147 VFDVMSVRDFHSWATLFVSYYENGEYENAIDVFVSMLC------QLDVMGFSFPPWIWSC 200 Query: 186 VLKTCVKTMNFELGKQVHGLIIKAGCARIAVYISSLMRFYGEFGSLETSEYIFNQESNEN 365 +LK C TMN LG QVHG ++K G + SSL+RFYG F LE + +FN+ S N Sbjct: 201 LLKACACTMNVPLGMQVHGCLLKLGACDHVLISSSLIRFYGRFKCLEDANMVFNRVSRHN 260 Query: 366 MVVWTARIANCCKEEMFDKAVHVFREMGQEGVKRNGYTFSSVLKACGKLGDDGCCGRQVH 545 + WTA+I + C+E F +A+ F++MG+ GVK++ +TFSSVLKACG++ + G CG QVH Sbjct: 261 TLTWTAKIVSSCRERHFSEALGDFKKMGRVGVKKDSFTFSSVLKACGRMQNRGSCGEQVH 320 Query: 546 ACAIKLGLELDGYVHCGLIDMYRKCGSVNDAKNVFNTCVHDKDAACWNALINSYVKQGLC 725 A AIKLGL+ D YV C LI MY + G + DA+ VF ++++ NA++ Y++ GL Sbjct: 321 ADAIKLGLDSDSYVQCSLIAMYGRSGLLRDAELVFEMTRNERNVDSLNAMLMGYIQNGLY 380 Query: 726 IEAIKMLYAMRAAGLQPQESLLHELRSLCGS 818 IEA+K +Y M+AAG+QP E LL +LR CGS Sbjct: 381 IEAVKFVYQMKAAGVQPHEPLLEKLRIACGS 411 >ref|XP_004512307.1| PREDICTED: pentatricopeptide repeat-containing protein At1g31790-like [Cicer arietinum] Length = 418 Score = 241 bits (616), Expect = 3e-61 Identities = 120/271 (44%), Positives = 171/271 (63%) Frame = +3 Query: 6 LFDKSSVRNAYSWAVTIAGYFENGEYREVIDLFLEMQSWERAEGEFDDLNNIAVSGIIVC 185 +FD+ VRN +SWA+ Y+EN +Y ID+F+ M + EF L C Sbjct: 147 VFDEMPVRNFHSWAILFVAYYENSDYENAIDVFMRMLR-QLGVMEFPFL-----PWFWSC 200 Query: 186 VLKTCVKTMNFELGKQVHGLIIKAGCARIAVYISSLMRFYGEFGSLETSEYIFNQESNEN 365 +L C T+N LG QVHG + K G + SSL+RFYG F LE + +FN+ S N Sbjct: 201 LLTACACTVNVPLGMQVHGSLTKLGACDHVLISSSLIRFYGRFKCLEDANVVFNRVSRHN 260 Query: 366 MVVWTARIANCCKEEMFDKAVHVFREMGQEGVKRNGYTFSSVLKACGKLGDDGCCGRQVH 545 + WTA+I + C+E F + + F+EMG+ G+K++ +TFSSVLKACG++ + G CG QVH Sbjct: 261 TLTWTAKIVSGCRERHFTQVLGDFKEMGRVGIKKDSFTFSSVLKACGRMQNYGSCGEQVH 320 Query: 546 ACAIKLGLELDGYVHCGLIDMYRKCGSVNDAKNVFNTCVHDKDAACWNALINSYVKQGLC 725 A +IKLGL+ D YV C LI MY + G + DAK VF T +++++ WNA++ Y++ GL Sbjct: 321 ADSIKLGLDSDNYVQCSLIAMYGRSGLLRDAKLVFETTLNERNVDSWNAMLMGYIQNGLY 380 Query: 726 IEAIKMLYAMRAAGLQPQESLLHELRSLCGS 818 I+A+K +Y M+AAG+ P ESLL +LR CGS Sbjct: 381 IKAVKFVYQMKAAGVHPHESLLEKLRIACGS 411 >gb|AFK33630.1| unknown [Lotus japonicus] Length = 356 Score = 239 bits (610), Expect = 2e-60 Identities = 123/272 (45%), Positives = 167/272 (61%) Frame = +3 Query: 3 ELFDKSSVRNAYSWAVTIAGYFENGEYREVIDLFLEMQSWERAEGEFDDLNNIAVSGIIV 182 +LFD V++ SWA Y++N +Y E ID+FL M + EF I Sbjct: 86 QLFDAMPVKDFNSWATLFIAYYDNADYEEAIDVFLAMLH-QLGMSEFPPW-------ICA 137 Query: 183 CVLKTCVKTMNFELGKQVHGLIIKAGCARIAVYISSLMRFYGEFGSLETSEYIFNQESNE 362 C LK C N LG QVHG ++K G + SSL+RFYG F ++ + +FN+ S Sbjct: 138 CFLKACACIENIPLGMQVHGWLLKLGTCDHVLLSSSLIRFYGRFTCVKDANAVFNKLSRH 197 Query: 363 NMVVWTARIANCCKEEMFDKAVHVFREMGQEGVKRNGYTFSSVLKACGKLGDDGCCGRQV 542 N WTA+I + C+E F + + F+EMG++G+K++ YTFSSVLKACGK+ D G CG QV Sbjct: 198 NTSTWTAKIVSGCREMDFPEVFNDFKEMGRQGIKKDTYTFSSVLKACGKMMDHGRCGEQV 257 Query: 543 HACAIKLGLELDGYVHCGLIDMYRKCGSVNDAKNVFNTCVHDKDAACWNALINSYVKQGL 722 HA A+KLGL D YV C LI MY + G + DAK VF T +++ WNA++ Y++ GL Sbjct: 258 HADAMKLGLASDNYVQCSLIAMYGRSGLLRDAKQVFETSRSERNVDSWNAMLMGYLENGL 317 Query: 723 CIEAIKMLYAMRAAGLQPQESLLHELRSLCGS 818 IEA+K LY M+AAGL+P ESLL ++R CGS Sbjct: 318 YIEAVKFLYQMKAAGLKPHESLLDKVRIACGS 349 >gb|ESW30051.1| hypothetical protein PHAVU_002G120500g [Phaseolus vulgaris] Length = 420 Score = 235 bits (599), Expect = 3e-59 Identities = 123/271 (45%), Positives = 162/271 (59%) Frame = +3 Query: 6 LFDKSSVRNAYSWAVTIAGYFENGEYREVIDLFLEMQSWERAEGEFDDLNNIAVSGIIVC 185 +F+K VR+ SWA Y++N EY E +F+ M G+ L I C Sbjct: 151 MFEKMRVRDFNSWATLFVAYYDNAEYEEATAVFVNML------GQLGMLQ--FPPWIWAC 202 Query: 186 VLKTCVKTMNFELGKQVHGLIIKAGCARIAVYISSLMRFYGEFGSLETSEYIFNQESNEN 365 +L+ C T+N LG QVHG ++K G + SSL+ FYG F LE + +FN S N Sbjct: 203 LLRACACTLNVPLGLQVHGWLLKLGACDHVLLSSSLINFYGRFTCLEDASAVFNGVSRHN 262 Query: 366 MVVWTARIANCCKEEMFDKAVHVFREMGQEGVKRNGYTFSSVLKACGKLGDDGCCGRQVH 545 + WTA+I + C+E F + FREMG GVK++ +TFSSVLKACGK+ + CG QVH Sbjct: 263 TLTWTAKIVSGCRERHFSEVFGDFREMGMRGVKKDCFTFSSVLKACGKMLNQERCGEQVH 322 Query: 546 ACAIKLGLELDGYVHCGLIDMYRKCGSVNDAKNVFNTCVHDKDAACWNALINSYVKQGLC 725 A AIKLGL D YV C LI MY +CG + DAK+VF ++ CWNA++ Y + G Sbjct: 323 ADAIKLGLISDHYVQCSLIAMYGRCGLLTDAKDVFEMTREERKVDCWNAMLMGYTQNGFH 382 Query: 726 IEAIKMLYAMRAAGLQPQESLLHELRSLCGS 818 IEA+K LY M+AAG+QP ESLL +LR CGS Sbjct: 383 IEAVKFLYQMQAAGMQPWESLLKKLRIACGS 413 >ref|XP_003516509.1| PREDICTED: pentatricopeptide repeat-containing protein At1g31790-like [Glycine max] Length = 423 Score = 235 bits (599), Expect = 3e-59 Identities = 119/271 (43%), Positives = 165/271 (60%) Frame = +3 Query: 6 LFDKSSVRNAYSWAVTIAGYFENGEYREVIDLFLEMQSWERAEGEFDDLNNIAVSGIIVC 185 +FDK VR+ +WA Y++N +Y E ++F+ M + + EF I C Sbjct: 154 MFDKMRVRDFNTWATLFVAYYDNTDYEEATNVFVNMLT-QLGMMEFPPW-------IWAC 205 Query: 186 VLKTCVKTMNFELGKQVHGLIIKAGCARIAVYISSLMRFYGEFGSLETSEYIFNQESNEN 365 +L+ C T+N LG QVHG ++K G + SSL+ FYG F LE + +F+ S N Sbjct: 206 LLRACACTVNVPLGMQVHGWLLKLGTCDHVLLSSSLINFYGRFTCLEDASVVFDGVSRHN 265 Query: 366 MVVWTARIANCCKEEMFDKAVHVFREMGQEGVKRNGYTFSSVLKACGKLGDDGCCGRQVH 545 + WTA+I + C+E F + F+EMG GVK++ +TFSSVLKACG++ + CG QVH Sbjct: 266 TLTWTAKIVSGCRERHFSEVFDDFKEMGMRGVKKDCFTFSSVLKACGRMLNQERCGEQVH 325 Query: 546 ACAIKLGLELDGYVHCGLIDMYRKCGSVNDAKNVFNTCVHDKDAACWNALINSYVKQGLC 725 AIKLGL D YV C LI MY +CG + DAK VF ++ CWNA++ Y++ GL Sbjct: 326 VDAIKLGLVSDHYVQCSLIAMYGRCGLLEDAKRVFEMSQEERKVDCWNAMLMGYIQNGLY 385 Query: 726 IEAIKMLYAMRAAGLQPQESLLHELRSLCGS 818 IEA+K LY M+AAG+QP+ESLL +LR CGS Sbjct: 386 IEAVKFLYQMQAAGMQPRESLLKKLRMACGS 416 >gb|EXB70628.1| hypothetical protein L484_023813 [Morus notabilis] Length = 453 Score = 233 bits (593), Expect = 2e-58 Identities = 123/278 (44%), Positives = 167/278 (60%), Gaps = 1/278 (0%) Frame = +3 Query: 3 ELFDKSSVRNAYSWAVTIAGYFENGEYREVIDLFLEMQSWERAEGEFDDLNNIAV-SGII 179 +LF + ++ SWA I N +Y E LFL+M +N + S II Sbjct: 179 DLFYRMPFKDFKSWATMIVANVNNSDYEEATSLFLKM---------LHHINMLEFPSWII 229 Query: 180 VCVLKTCVKTMNFELGKQVHGLIIKAGCARIAVYISSLMRFYGEFGSLETSEYIFNQESN 359 VC+LKTCV T N ELGKQVH +K G A S L+ FYG++G LE++ +FNQ Sbjct: 230 VCLLKTCVCTRNMELGKQVHACALKLGHANSLYLASCLINFYGKYGCLESANLVFNQLPR 289 Query: 360 ENMVVWTARIANCCKEEMFDKAVHVFREMGQEGVKRNGYTFSSVLKACGKLGDDGCCGRQ 539 + + W R+ N KEE+F + + F E+G+ G+K+N FSSVLKACG++ D G+Q Sbjct: 290 HDTLTWMTRLINNSKEELFFEVLRDFNEVGKAGIKKNVLMFSSVLKACGRIHDRRKSGQQ 349 Query: 540 VHACAIKLGLELDGYVHCGLIDMYRKCGSVNDAKNVFNTCVHDKDAACWNALINSYVKQG 719 VHA AIKLG E D YV CGLIDMY + G + DA+ VF ++ ACWNA++ Y++ Sbjct: 350 VHANAIKLGFESDLYVQCGLIDMYGRSGLLRDAQRVFEKSSDRRNNACWNAMLGGYIRNE 409 Query: 720 LCIEAIKMLYAMRAAGLQPQESLLHELRSLCGSYEIEK 833 L +EAIK +Y M+A GLQ Q+S+L ELR CGS + K Sbjct: 410 LYVEAIKFVYQMKAVGLQLQQSMLDELRIACGSDSLRK 447 >emb|CAN69066.1| hypothetical protein VITISV_016070 [Vitis vinifera] Length = 543 Score = 229 bits (583), Expect = 2e-57 Identities = 123/279 (44%), Positives = 163/279 (58%), Gaps = 4/279 (1%) Frame = +3 Query: 6 LFDKSSV--RNAYSWAVTIAGYFENGEYREVIDLFLEMQSWERAEGEFDDLNNIAVSGII 179 +FDK +V +N+ SWA+ +A Y +NG Y E I LF++M + + I Sbjct: 297 MFDKMNVLNKNSISWAIMLAAYMDNGFYEEAIFLFVQMMELHST------IMLELPAWIF 350 Query: 180 VCVLKTCVKTMNFELGKQVHGLIIKAGCARIAVYISSLMRFYGEFGSLETSEYIFNQESN 359 +CVLK CV TMN LGKQVHG + K Sbjct: 351 ICVLKACVHTMNLTLGKQVHGWLTK----------------------------------E 376 Query: 360 ENMVVWTARIANCCKEEMFDKAVHVFREMGQEGVKRNGYTFSSVLKACGKLGDDGCCGRQ 539 N V+WTA++ N C+ E +A+ F EMG+ GVKRN +T+SSVL+ACG++ D G CGR Sbjct: 377 RNTVIWTAKMVNKCQGEYMHEALVAFTEMGRAGVKRNEFTYSSVLRACGRMKDHGRCGRL 436 Query: 540 VHACAIKLGLELDGYVHCGLIDMYRKCGSVNDAKNVFNTC--VHDKDAACWNALINSYVK 713 +HA IKLGLE D YV CGL+DMY KCG + +A+ VF T + + CWNA++ Y++ Sbjct: 437 IHASTIKLGLESDIYVQCGLVDMYGKCGLLVEARRVFETVSDTNKTNIVCWNAMLTGYIR 496 Query: 714 QGLCIEAIKMLYAMRAAGLQPQESLLHELRSLCGSYEIE 830 GL IEAIK LY M+AAG+QPQESLL+ELR CGS +E Sbjct: 497 HGLYIEAIKFLYQMKAAGIQPQESLLNELRIACGSTTLE 535 >ref|NP_174459.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana] gi|75169166|sp|Q9C6R9.1|PPR66_ARATH RecName: Full=Pentatricopeptide repeat-containing protein At1g31790 gi|12321298|gb|AAG50719.1|AC079041_12 hypothetical protein [Arabidopsis thaliana] gi|111074348|gb|ABH04547.1| At1g31790 [Arabidopsis thaliana] gi|332193272|gb|AEE31393.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana] Length = 409 Score = 212 bits (539), Expect = 3e-52 Identities = 115/267 (43%), Positives = 161/267 (60%), Gaps = 2/267 (0%) Frame = +3 Query: 3 ELFDKSSVRNAYSWAVTIAGYFENGEYREVIDLFLEMQSWERAEGEFDDLNNIAVSGIIV 182 ++FD+ R+ +SWA+ G E G+Y + LF+ M + +G F S I+ Sbjct: 144 QMFDRMPHRDFHSWAIVFLGCIEMGDYEDAAFLFVSMLKHSQ-KGAFK-----IPSWILG 197 Query: 183 CVLKTCVKTMNFELGKQVHGLIIKAGCA-RIAVYIS-SLMRFYGEFGSLETSEYIFNQES 356 CVLK C +FELGKQVH L K G Y+S SL+RFYGEF LE + + +Q S Sbjct: 198 CVLKACAMIRDFELGKQVHALCHKLGFIDEEDSYLSGSLIRFYGEFRCLEDANLVLHQLS 257 Query: 357 NENMVVWTARIANCCKEEMFDKAVHVFREMGQEGVKRNGYTFSSVLKACGKLGDDGCCGR 536 N N V W A++ N +E F + + F EMG G+K+N FS+VLKAC + D G G+ Sbjct: 258 NANTVAWAAKVTNDYREGEFQEVIRDFIEMGNHGIKKNVSVFSNVLKACSWVSDGGRSGQ 317 Query: 537 QVHACAIKLGLELDGYVHCGLIDMYRKCGSVNDAKNVFNTCVHDKDAACWNALINSYVKQ 716 QVHA AIKLG E D + C LI+MY K G V DA+ VF + + +CWNA++ SY++ Sbjct: 318 QVHANAIKLGFESDCLIRCRLIEMYGKYGKVKDAEKVFKSSKDETSVSCWNAMVASYMQN 377 Query: 717 GLCIEAIKMLYAMRAAGLQPQESLLHE 797 G+ IEAIK+LY M+A G++ ++LL+E Sbjct: 378 GIYIEAIKLLYQMKATGIKAHDTLLNE 404 >ref|XP_002893686.1| pentatricopeptide repeat-containing protein [Arabidopsis lyrata subsp. lyrata] gi|297339528|gb|EFH69945.1| pentatricopeptide repeat-containing protein [Arabidopsis lyrata subsp. lyrata] Length = 410 Score = 207 bits (527), Expect = 7e-51 Identities = 113/266 (42%), Positives = 158/266 (59%), Gaps = 2/266 (0%) Frame = +3 Query: 6 LFDKSSVRNAYSWAVTIAGYFENGEYREVIDLFLEMQSWERAEGEFDDLNNIAVSGIIVC 185 +FDK R+ +SWA+ G E G+Y + LF+ M + G F S I+ C Sbjct: 146 MFDKMPHRDFHSWAIVFLGCIEMGDYEDAALLFVSMLKHSQ-NGAFK-----IPSWIMGC 199 Query: 186 VLKTCVKTMNFELGKQVHGLIIKAGCA-RIAVYIS-SLMRFYGEFGSLETSEYIFNQESN 359 VLK C +FELGKQVH L K GC Y+S SL+RFYGEF LE + + +Q SN Sbjct: 200 VLKACAMIRDFELGKQVHALCHKLGCIDEEDSYLSGSLIRFYGEFRCLEDANLVLHQLSN 259 Query: 360 ENMVVWTARIANCCKEEMFDKAVHVFREMGQEGVKRNGYTFSSVLKACGKLGDDGCCGRQ 539 N V W A++ N +E F + + F EMG +++N FS+VLKAC + D G G+Q Sbjct: 260 ANTVAWAAKVTNDYREGEFQEVIRDFIEMGNHRIRKNVSVFSNVLKACTWVSDGGRSGKQ 319 Query: 540 VHACAIKLGLELDGYVHCGLIDMYRKCGSVNDAKNVFNTCVHDKDAACWNALINSYVKQG 719 VHA AIKLG E D + C LI+MY K G V DA+ VF + + + CWNA++ Y++ G Sbjct: 320 VHAVAIKLGFESDCLIRCRLIEMYGKYGKVKDAEKVFKSSKDETNVNCWNAMVAGYMQNG 379 Query: 720 LCIEAIKMLYAMRAAGLQPQESLLHE 797 + +EAIK+L M+A G++ Q++LL+E Sbjct: 380 IYVEAIKLLCQMKATGIKAQDTLLNE 405 >ref|XP_006303657.1| hypothetical protein CARUB_v10011695mg [Capsella rubella] gi|482572368|gb|EOA36555.1| hypothetical protein CARUB_v10011695mg [Capsella rubella] Length = 411 Score = 204 bits (518), Expect = 8e-50 Identities = 112/266 (42%), Positives = 154/266 (57%), Gaps = 2/266 (0%) Frame = +3 Query: 6 LFDKSSVRNAYSWAVTIAGYFENGEYREVIDLFLEMQSWERAEGEFDDLNNIAVSGIIVC 185 +FDK R+ +SWA+ G E G+Y + LF+ M + G F S I+ C Sbjct: 146 MFDKMPHRDFHSWAIVFLGCIEMGDYEDAALLFVAMLKHSKNGGAFK-----IPSWIMGC 200 Query: 186 VLKTCVKTMNFELGKQVHGLIIKAGCA--RIAVYISSLMRFYGEFGSLETSEYIFNQESN 359 VLK C + LGKQVHGL K G + + SL+RFYGEF LE + + +Q SN Sbjct: 201 VLKACAMIRDLALGKQVHGLCQKLGFIGEEDSYLLGSLIRFYGEFRCLEDANLVLHQLSN 260 Query: 360 ENMVVWTARIANCCKEEMFDKAVHVFREMGQEGVKRNGYTFSSVLKACGKLGDDGCCGRQ 539 N VVW A++ N +E F + + F EMG+ GVK+N S+VLKAC + D G G+Q Sbjct: 261 ANTVVWAAKVTNDYREGEFQEVIRDFIEMGKLGVKKNVSVVSNVLKACTWVSDGGRSGQQ 320 Query: 540 VHACAIKLGLELDGYVHCGLIDMYRKCGSVNDAKNVFNTCVHDKDAACWNALINSYVKQG 719 VHA AIKLG E D + C LI+MY K V DA+ VF + + +CWNA++ Y++ G Sbjct: 321 VHANAIKLGFESDCLIRCQLIEMYGKYEKVKDAEKVFKSRKDETSVSCWNAMVAGYMQNG 380 Query: 720 LCIEAIKMLYAMRAAGLQPQESLLHE 797 IEAIK+LY M+A G++ + LL+E Sbjct: 381 FYIEAIKLLYQMKATGIKADDMLLNE 406 >ref|XP_004292904.1| PREDICTED: pentatricopeptide repeat-containing protein At1g31790-like [Fragaria vesca subsp. vesca] Length = 421 Score = 200 bits (509), Expect = 9e-49 Identities = 109/273 (39%), Positives = 160/273 (58%), Gaps = 1/273 (0%) Frame = +3 Query: 3 ELFDKSSVRNAYSWAVTIAGYFENGEYREVIDLFLEMQSWERAEGEFDDLNNIAVSGIIV 182 +LFD+ +++ SWA I Y +N +Y E + LFL M + + + + I+ Sbjct: 153 QLFDEMPLKDFNSWATLIVAYAQNADYAEALRLFLSMLHLQDCHVDISEFP----AWIMA 208 Query: 183 CVLKTCVKTMNFELGKQVHGLIIKAGCARIAVYIS-SLMRFYGEFGSLETSEYIFNQESN 359 CVL TM+ LG+Q+HG +K G A ++++ SL+ YG E ++ S Sbjct: 209 CVLDA---TMDVGLGEQLHGCCLKLGHANRDMFVATSLINLYGRLRCHEAAQRASLGLSQ 265 Query: 360 ENMVVWTARIANCCKEEMFDKAVHVFREMGQEGVKRNGYTFSSVLKACGKLGDDGCCGRQ 539 N + WTAR+ N + E F + + F+E+G+ G+ +N S VL+AC ++ D G GRQ Sbjct: 266 PNALTWTARMINNSRGERFFEVISDFKEIGRAGISKNTSMISCVLRACARMHDSGFRGRQ 325 Query: 540 VHACAIKLGLELDGYVHCGLIDMYRKCGSVNDAKNVFNTCVHDKDAACWNALINSYVKQG 719 VHA AIKLG++ +VHCGLIDMY + G + DAK VF T ACWNA++ +Y++ G Sbjct: 326 VHANAIKLGVDSHSFVHCGLIDMYGRNGLLRDAKLVFQTFNDTTSTACWNAMLTNYLRNG 385 Query: 720 LCIEAIKMLYAMRAAGLQPQESLLHELRSLCGS 818 L IEA+K LY M+A GLQPQE LL ++R C S Sbjct: 386 LHIEALKFLYEMQADGLQPQEYLLDQVRIACAS 418 >ref|XP_006415303.1| hypothetical protein EUTSA_v10009456mg [Eutrema salsugineum] gi|557093074|gb|ESQ33656.1| hypothetical protein EUTSA_v10009456mg [Eutrema salsugineum] Length = 400 Score = 187 bits (475), Expect = 8e-45 Identities = 109/268 (40%), Positives = 151/268 (56%), Gaps = 2/268 (0%) Frame = +3 Query: 3 ELFDKSSVRNAYSWAVTIAGYFENGEYREVIDLFLEMQSWERAEGEFDDLNNIAVSGIIV 182 ++FDK R+ +SWA+ I G E G+Y++ + LF+ M + + I+ Sbjct: 145 QMFDKMPQRDFHSWAIVILGCIEMGDYQDAVFLFVSMLKNQNRVSKIPPW-------IMG 197 Query: 183 CVLKTCVKTMNFELGKQVHGLIIKAGCARIA-VYISS-LMRFYGEFGSLETSEYIFNQES 356 CVLK C + +LGKQVHGL K G + Y+S L+RFYGEF LE + + NQ S Sbjct: 198 CVLKACGMIRDLDLGKQVHGLCQKLGFIEVEDSYLSGCLVRFYGEFRCLEDANLVLNQLS 257 Query: 357 NENMVVWTARIANCCKEEMFDKAVHVFREMGQEGVKRNGYTFSSVLKACGKLGDDGCCGR 536 N N VVW A++ N +E F + + F EMG+ G+K+N FS+VLKAC + D G GR Sbjct: 258 NANTVVWAAKVTNDYREGRFQEVILDFIEMGKHGIKKNVSVFSNVLKACTWVSDGGRSGR 317 Query: 537 QVHACAIKLGLELDGYVHCGLIDMYRKCGSVNDAKNVFNTCVHDKDAACWNALINSYVKQ 716 VHA AIKLG E D + C LI+MY K G V DA+ VF + Sbjct: 318 GVHASAIKLGFESDCMIRCRLIEMYGKYGKVKDAEKVF----------------KNERSN 361 Query: 717 GLCIEAIKMLYAMRAAGLQPQESLLHEL 800 G +EAIK+LY M+A GLQ +++LL+E+ Sbjct: 362 GFYVEAIKLLYQMKATGLQVEDTLLNEV 389 >ref|XP_006841553.1| hypothetical protein AMTR_s00003p00175270 [Amborella trichopoda] gi|548843574|gb|ERN03228.1| hypothetical protein AMTR_s00003p00175270 [Amborella trichopoda] Length = 327 Score = 176 bits (447), Expect = 1e-41 Identities = 104/273 (38%), Positives = 152/273 (55%), Gaps = 1/273 (0%) Frame = +3 Query: 3 ELFDKSSVRNAYSWAVTIAGYFENGEYREVIDLFLEM-QSWERAEGEFDDLNNIAVSGII 179 ++FDK S RN +W I G + G E +DL++ M Q R + N A+ G Sbjct: 66 QVFDKMSHRNTDTWQFMITGLMDLGMNEETLDLYIRMHQEMVRMKP------NTAIQG-- 117 Query: 180 VCVLKTCVKTMNFELGKQVHGLIIKAGCARIAVYISSLMRFYGEFGSLETSEYIFNQESN 359 VL+ C + LGKQ+H IK+G ++ L+ FY E L ++ F++ Sbjct: 118 -GVLRACAFIEDVGLGKQIHAKAIKSGSSKDTYLGCCLVDFYVEMKCLVSARKAFDEICK 176 Query: 360 ENMVVWTARIANCCKEEMFDKAVHVFREMGQEGVKRNGYTFSSVLKACGKLGDDGCCGRQ 539 N+V WTA I C +E F + VFREM + G + N YT+S +L A GK+G G+Q Sbjct: 177 PNVVAWTAMIVGCAREGEFHGVLEVFREMERVGKRGNCYTYSCLLGASGKMG-HVWMGKQ 235 Query: 540 VHACAIKLGLELDGYVHCGLIDMYRKCGSVNDAKNVFNTCVHDKDAACWNALINSYVKQG 719 V A IK+G+E D YV ++ MY KCG V DA+ VF+ + +K+A WNA++ Y K G Sbjct: 236 VQARVIKVGVEKDVYVGSSIVGMYGKCGFVEDARLVFDG-MREKNAVSWNAMLCGYAKNG 294 Query: 720 LCIEAIKMLYAMRAAGLQPQESLLHELRSLCGS 818 C EAIK+LY MR GL+P + +++E+ CG+ Sbjct: 295 CCDEAIKLLYEMRCKGLEPPQVMVNEVAIACGA 327 >ref|NP_001059279.1| Os07g0244400 [Oryza sativa Japonica Group] gi|24417179|dbj|BAC22540.1| putative pentatricopeptide repeat-containing protein [Oryza sativa Japonica Group] gi|50508329|dbj|BAD30147.1| putative pentatricopeptide repeat-containing protein [Oryza sativa Japonica Group] gi|113610815|dbj|BAF21193.1| Os07g0244400 [Oryza sativa Japonica Group] gi|125599686|gb|EAZ39262.1| hypothetical protein OsJ_23686 [Oryza sativa Japonica Group] Length = 435 Score = 160 bits (405), Expect = 1e-36 Identities = 84/277 (30%), Positives = 149/277 (53%), Gaps = 7/277 (2%) Frame = +3 Query: 3 ELFDKSSVRNAYSWAVTIAGYFENGEYREVIDLFLEMQSWERAEGEFDDLNNIAVSGIIV 182 ++FD+ V+N +WA ++ Y + + + + LF++M R + + IV Sbjct: 166 QVFDEMPVKNGITWATMVSAYSDGCFHHDALQLFVQMCHQVRG------ITGDHYTHAIV 219 Query: 183 CVLKTCVKTMNFELGKQVHGLIIKAG--CARIAVYISSLMRFYGEFGSLETSEYI----- 341 VL++C + + G+QVH ++K C + SSL++ Y + G L ++ ++ Sbjct: 220 AVLRSCARVNELQFGEQVHAFVVKKNGVCGDVG---SSLLQLYCDSGQLSSARHVLEMMR 276 Query: 342 FNQESNENMVVWTARIANCCKEEMFDKAVHVFREMGQEGVKRNGYTFSSVLKACGKLGDD 521 F+ + WT+ I ++ + D A+ VFR M G+ R+ ++ SS+L C + + Sbjct: 277 FSCQEPVPEAAWTSLITAYHRDGILDDAIDVFRGMASSGIARSSFSLSSILAVCAEAKNK 336 Query: 522 GCCGRQVHACAIKLGLELDGYVHCGLIDMYRKCGSVNDAKNVFNTCVHDKDAACWNALIN 701 GC G+QVHA AIK GL+++ +V GL+ MY K G + DA F DA CWNA+ Sbjct: 337 GCYGQQVHADAIKRGLDMNQFVGSGLLHMYAKEGQLADAARAFEAIDGKPDAVCWNAMAM 396 Query: 702 SYVKQGLCIEAIKMLYAMRAAGLQPQESLLHELRSLC 812 +Y + G+ EA +++Y M+AAG+ P + ++E++ C Sbjct: 397 AYARGGMYREATRVVYQMKAAGMNPSKLTMNEVKLAC 433 >gb|EAZ03360.1| hypothetical protein OsI_25499 [Oryza sativa Indica Group] Length = 436 Score = 160 bits (405), Expect = 1e-36 Identities = 84/277 (30%), Positives = 149/277 (53%), Gaps = 7/277 (2%) Frame = +3 Query: 3 ELFDKSSVRNAYSWAVTIAGYFENGEYREVIDLFLEMQSWERAEGEFDDLNNIAVSGIIV 182 ++FD++ V+N +WA ++ Y + + + + LF +M R + + IV Sbjct: 167 QVFDETPVKNGITWATMVSAYSDGCFHHDALQLFAQMCHQVRG------ITGDHYTHAIV 220 Query: 183 CVLKTCVKTMNFELGKQVHGLIIKAG--CARIAVYISSLMRFYGEFGSLETSEYI----- 341 VL++C + + G+QVH ++K C + SSL++ Y + G L ++ ++ Sbjct: 221 AVLRSCARVNELQFGEQVHAFVVKKNGVCGDVG---SSLLQLYCDSGQLSSARHVLEMMR 277 Query: 342 FNQESNENMVVWTARIANCCKEEMFDKAVHVFREMGQEGVKRNGYTFSSVLKACGKLGDD 521 F+ + WT+ I ++ + D A+ VFR M G+ R+ ++ SS+L C + + Sbjct: 278 FSCQEPVPEAAWTSLITAYHRDGILDDAIDVFRGMASSGIARSSFSLSSILAVCAEAKNK 337 Query: 522 GCCGRQVHACAIKLGLELDGYVHCGLIDMYRKCGSVNDAKNVFNTCVHDKDAACWNALIN 701 GC G+QVHA AIK GL+++ +V GL+ MY K G + DA F DA CWNA+ Sbjct: 338 GCYGQQVHADAIKRGLDMNQFVGSGLLHMYAKEGQLADAARAFEAIDGKPDAVCWNAMAM 397 Query: 702 SYVKQGLCIEAIKMLYAMRAAGLQPQESLLHELRSLC 812 +Y + G+ EA +++Y M+AAG+ P + ++E++ C Sbjct: 398 AYARGGMYREATRVVYQMKAAGMNPSKLTMNEVKLAC 434