BLASTX nr result
ID: Cephaelis21_contig00040992
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Cephaelis21_contig00040992 (553 letters) Database: ./nr 23,641,837 sequences; 8,123,359,852 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value emb|CBI22025.3| unnamed protein product [Vitis vinifera] 252 3e-65 ref|XP_002277494.1| PREDICTED: pentatricopeptide repeat-containi... 252 3e-65 ref|XP_004137032.1| PREDICTED: pentatricopeptide repeat-containi... 241 4e-62 ref|XP_003532658.1| PREDICTED: pentatricopeptide repeat-containi... 225 3e-57 ref|XP_002514539.1| pentatricopeptide repeat-containing protein,... 225 4e-57 >emb|CBI22025.3| unnamed protein product [Vitis vinifera] Length = 489 Score = 252 bits (643), Expect = 3e-65 Identities = 124/184 (67%), Positives = 149/184 (80%) Frame = +1 Query: 1 IITAGCLKDKFVRNHLLNAYSKMGQLETAVSLFDKMPKRNTMSYNILIGSFIQNGDLDTA 180 IIT+GC DKF+ NHLLN YSK GQL+TA++LF MP++N MS NILI + ++GD TA Sbjct: 78 IITSGCSSDKFISNHLLNLYSKCGQLDTAITLFGVMPRKNIMSCNILINGYFRSGDWVTA 137 Query: 181 FKLFDEMGERNLATWNAMIVGLVQHEFNEDGLSLFSQMHKLGFLPDAFTLGSVFRGCAGL 360 K+FDEM ERN+ATWNAM+ GL+Q EFNE+GL LFS+M++LGFLPD F LGSV RGCAGL Sbjct: 138 RKMFDEMPERNVATWNAMVAGLIQFEFNEEGLGLFSRMNELGFLPDEFALGSVLRGCAGL 197 Query: 361 KDLNTGRQVHGYVVKSGFEWHLVVGSSLAHMYMRSGSLREGEKVIRMMPVSSVPACNTLI 540 + L GRQVHGYV K GFE++LVV SSLAHMYM+ GSL EGE++IR MP +V A NTLI Sbjct: 198 RALVAGRQVHGYVRKCGFEFNLVVVSSLAHMYMKCGSLGEGERLIRAMPSQNVVAWNTLI 257 Query: 541 AGRA 552 AGRA Sbjct: 258 AGRA 261 Score = 75.9 bits (185), Expect = 4e-12 Identities = 52/182 (28%), Positives = 83/182 (45%), Gaps = 5/182 (2%) Frame = +1 Query: 13 GCLKDKFVRNHLLNAYSKMGQLETAVSLFDKMPKRNTMSYNILIGS-----FIQNGDLDT 177 G L D+F +L + + L + + K +N+++ S +++ G L Sbjct: 179 GFLPDEFALGSVLRGCAGLRALVAGRQVHGYVRKCG-FEFNLVVVSSLAHMYMKCGSLGE 237 Query: 178 AFKLFDEMGERNLATWNAMIVGLVQHEFNEDGLSLFSQMHKLGFLPDAFTLGSVFRGCAG 357 +L M +N+ WN +I G Q+ + E+ L ++ M GF PD T SV C+ Sbjct: 238 GERLIRAMPSQNVVAWNTLIAGRAQNGYPEEVLDQYNMMKMAGFRPDKITFVSVISSCSE 297 Query: 358 LKDLNTGRQVHGYVVKSGFEWHLVVGSSLAHMYMRSGSLREGEKVIRMMPVSSVPACNTL 537 L L G+Q+H V+K+G + V SSL MY R G L KV V +++ Sbjct: 298 LATLGQGQQIHAEVIKAGASLIVSVISSLISMYSRCGCLEYSLKVFLECENGDVVCWSSM 357 Query: 538 IA 543 IA Sbjct: 358 IA 359 >ref|XP_002277494.1| PREDICTED: pentatricopeptide repeat-containing protein At2g41080 [Vitis vinifera] Length = 657 Score = 252 bits (643), Expect = 3e-65 Identities = 124/184 (67%), Positives = 149/184 (80%) Frame = +1 Query: 1 IITAGCLKDKFVRNHLLNAYSKMGQLETAVSLFDKMPKRNTMSYNILIGSFIQNGDLDTA 180 IIT+GC DKF+ NHLLN YSK GQL+TA++LF MP++N MS NILI + ++GD TA Sbjct: 78 IITSGCSSDKFISNHLLNLYSKCGQLDTAITLFGVMPRKNIMSCNILINGYFRSGDWVTA 137 Query: 181 FKLFDEMGERNLATWNAMIVGLVQHEFNEDGLSLFSQMHKLGFLPDAFTLGSVFRGCAGL 360 K+FDEM ERN+ATWNAM+ GL+Q EFNE+GL LFS+M++LGFLPD F LGSV RGCAGL Sbjct: 138 RKMFDEMPERNVATWNAMVAGLIQFEFNEEGLGLFSRMNELGFLPDEFALGSVLRGCAGL 197 Query: 361 KDLNTGRQVHGYVVKSGFEWHLVVGSSLAHMYMRSGSLREGEKVIRMMPVSSVPACNTLI 540 + L GRQVHGYV K GFE++LVV SSLAHMYM+ GSL EGE++IR MP +V A NTLI Sbjct: 198 RALVAGRQVHGYVRKCGFEFNLVVVSSLAHMYMKCGSLGEGERLIRAMPSQNVVAWNTLI 257 Query: 541 AGRA 552 AGRA Sbjct: 258 AGRA 261 Score = 75.9 bits (185), Expect = 4e-12 Identities = 52/182 (28%), Positives = 83/182 (45%), Gaps = 5/182 (2%) Frame = +1 Query: 13 GCLKDKFVRNHLLNAYSKMGQLETAVSLFDKMPKRNTMSYNILIGS-----FIQNGDLDT 177 G L D+F +L + + L + + K +N+++ S +++ G L Sbjct: 179 GFLPDEFALGSVLRGCAGLRALVAGRQVHGYVRKCG-FEFNLVVVSSLAHMYMKCGSLGE 237 Query: 178 AFKLFDEMGERNLATWNAMIVGLVQHEFNEDGLSLFSQMHKLGFLPDAFTLGSVFRGCAG 357 +L M +N+ WN +I G Q+ + E+ L ++ M GF PD T SV C+ Sbjct: 238 GERLIRAMPSQNVVAWNTLIAGRAQNGYPEEVLDQYNMMKMAGFRPDKITFVSVISSCSE 297 Query: 358 LKDLNTGRQVHGYVVKSGFEWHLVVGSSLAHMYMRSGSLREGEKVIRMMPVSSVPACNTL 537 L L G+Q+H V+K+G + V SSL MY R G L KV V +++ Sbjct: 298 LATLGQGQQIHAEVIKAGASLIVSVISSLISMYSRCGCLEYSLKVFLECENGDVVCWSSM 357 Query: 538 IA 543 IA Sbjct: 358 IA 359 >ref|XP_004137032.1| PREDICTED: pentatricopeptide repeat-containing protein At2g41080-like [Cucumis sativus] gi|449526872|ref|XP_004170437.1| PREDICTED: pentatricopeptide repeat-containing protein At2g41080-like [Cucumis sativus] Length = 667 Score = 241 bits (616), Expect = 4e-62 Identities = 117/184 (63%), Positives = 147/184 (79%) Frame = +1 Query: 1 IITAGCLKDKFVRNHLLNAYSKMGQLETAVSLFDKMPKRNTMSYNILIGSFIQNGDLDTA 180 IIT+G KDKF+ NHLLN YSK+GQ ++++ LF MP+RN MS+NILI ++Q GDL++A Sbjct: 88 IITSGGSKDKFISNHLLNFYSKLGQFKSSLVLFSNMPRRNVMSFNILINGYLQLGDLESA 147 Query: 181 FKLFDEMGERNLATWNAMIVGLVQHEFNEDGLSLFSQMHKLGFLPDAFTLGSVFRGCAGL 360 KLFDEM ERN+ATWNAMI GL Q EFN+ LSLF +M+ LGFLPD FTLGSV RGCAGL Sbjct: 148 QKLFDEMSERNIATWNAMIAGLTQFEFNKQALSLFKEMYGLGFLPDEFTLGSVLRGCAGL 207 Query: 361 KDLNTGRQVHGYVVKSGFEWHLVVGSSLAHMYMRSGSLREGEKVIRMMPVSSVPACNTLI 540 + L G++VH ++K GFE VVGSSLAHMY++SGSL +GEK+I+ MP+ +V A NTLI Sbjct: 208 RSLLAGQEVHACLLKCGFELSSVVGSSLAHMYIKSGSLSDGEKLIKSMPIRTVVAWNTLI 267 Query: 541 AGRA 552 AG+A Sbjct: 268 AGKA 271 Score = 79.0 bits (193), Expect = 5e-13 Identities = 43/113 (38%), Positives = 59/113 (52%) Frame = +1 Query: 151 FIQNGDLDTAFKLFDEMGERNLATWNAMIVGLVQHEFNEDGLSLFSQMHKLGFLPDAFTL 330 +I++G L KL M R + WN +I G Q+ E+ L+ ++ M GF PD T Sbjct: 239 YIKSGSLSDGEKLIKSMPIRTVVAWNTLIAGKAQNGCPEEVLNQYNMMKMAGFRPDKITF 298 Query: 331 GSVFRGCAGLKDLNTGRQVHGYVVKSGFEWHLVVGSSLAHMYMRSGSLREGEK 489 SV C+ L L G+Q+H V+K+G L V SSL MY RSG L + K Sbjct: 299 VSVLSACSELATLGQGQQIHAEVIKAGASSVLAVVSSLISMYSRSGCLEDSIK 351 Score = 54.7 bits (130), Expect = 1e-05 Identities = 50/176 (28%), Positives = 77/176 (43%), Gaps = 9/176 (5%) Frame = +1 Query: 10 AGCLKDKFVRNHLLNAYSKMGQLETAVSLFDKMPKRNTMSYNILIGSFI----QNGDLDT 177 AG DK +L+A S++ L + ++ K S ++ S I ++G L+ Sbjct: 289 AGFRPDKITFVSVLSACSELATLGQGQQIHAEVIKAGASSVLAVVSSLISMYSRSGCLED 348 Query: 178 AFKLFDEMGERNLATWNAMIVGLVQHEFNEDGLSLFSQMHKLGFLPDAFTLGSVFRGCA- 354 + K F + ++ W++MI H E+ L LF QM L + T S+ C+ Sbjct: 349 SIKAFVDRENFDVVLWSSMIAAYGFHGRGEEALELFHQMEDLKMEANEVTFLSLLYACSH 408 Query: 355 -GLKDLNT---GRQVHGYVVKSGFEWHLVVGSSLAHMYMRSGSLREGEKVIRMMPV 510 GLK+ T V Y +K E + V L R+G L E E +IR MPV Sbjct: 409 SGLKEKGTEYFDLMVKKYKLKPRIEHYTCVVDLLG----RAGRLEEAEGMIRSMPV 460 >ref|XP_003532658.1| PREDICTED: pentatricopeptide repeat-containing protein At2g41080-like [Glycine max] Length = 674 Score = 225 bits (574), Expect = 3e-57 Identities = 107/184 (58%), Positives = 144/184 (78%) Frame = +1 Query: 1 IITAGCLKDKFVRNHLLNAYSKMGQLETAVSLFDKMPKRNTMSYNILIGSFIQNGDLDTA 180 I T+GC DKF+ NHLLN YSK G+L+ AV+LFD+MP+RN MS NI+I +++ G+L++A Sbjct: 95 IFTSGCSSDKFISNHLLNLYSKFGELQAAVALFDRMPRRNIMSCNIMIKAYLGMGNLESA 154 Query: 181 FKLFDEMGERNLATWNAMIVGLVQHEFNEDGLSLFSQMHKLGFLPDAFTLGSVFRGCAGL 360 LFDEM +RN+ATWNAM+ GL + E NE+ L LFS+M++L F+PD ++LGSV RGCA L Sbjct: 155 KNLFDEMPDRNVATWNAMVTGLTKFEMNEEALLLFSRMNELSFMPDEYSLGSVLRGCAHL 214 Query: 361 KDLNTGRQVHGYVVKSGFEWHLVVGSSLAHMYMRSGSLREGEKVIRMMPVSSVPACNTLI 540 L G+QVH YV+K GFE +LVVG SLAHMYM++GS+ +GE+VI MP S+ A NTL+ Sbjct: 215 GALLAGQQVHAYVMKCGFECNLVVGCSLAHMYMKAGSMHDGERVINWMPDCSLVAWNTLM 274 Query: 541 AGRA 552 +G+A Sbjct: 275 SGKA 278 Score = 66.2 bits (160), Expect = 3e-09 Identities = 41/142 (28%), Positives = 67/142 (47%), Gaps = 5/142 (3%) Frame = +1 Query: 133 NILIGS-----FIQNGDLDTAFKLFDEMGERNLATWNAMIVGLVQHEFNEDGLSLFSQMH 297 N+++G +++ G + ++ + M + +L WN ++ G Q + E L + M Sbjct: 235 NLVVGCSLAHMYMKAGSMHDGERVINWMPDCSLVAWNTLMSGKAQKGYFEGVLDQYCMMK 294 Query: 298 KLGFLPDAFTLGSVFRGCAGLKDLNTGRQVHGYVVKSGFEWHLVVGSSLAHMYMRSGSLR 477 GF PD T SV C+ L L G+Q+H VK+G + V SSL MY R G L+ Sbjct: 295 MAGFRPDKITFVSVISSCSELAILCQGKQIHAEAVKAGASSEVSVVSSLVSMYSRCGCLQ 354 Query: 478 EGEKVIRMMPVSSVPACNTLIA 543 + K V +++IA Sbjct: 355 DSIKTFLECKERDVVLWSSMIA 376 Score = 55.8 bits (133), Expect = 4e-06 Identities = 40/128 (31%), Positives = 61/128 (47%), Gaps = 2/128 (1%) Frame = +1 Query: 139 LIGSFIQNGDLDTAFKLFDEMGERNLATWNAMIVGLVQHEFNEDGLSLFSQMHKLGFLPD 318 L+ + + G L + K F E ER++ W++MI H E+ + LF++M + + Sbjct: 343 LVSMYSRCGCLQDSIKTFLECKERDVVLWSSMIAAYGFHGQGEEAIKLFNEMEQENLPGN 402 Query: 319 AFTLGSVFRGCA--GLKDLNTGRQVHGYVVKSGFEWHLVVGSSLAHMYMRSGSLREGEKV 492 T S+ C+ GLKD G V K G + L + L + RSG L E E + Sbjct: 403 EITFLSLLYACSHCGLKDKGLGL-FDMMVKKYGLKARLQHYTCLVDLLGRSGCLEEAEAM 461 Query: 493 IRMMPVSS 516 IR MPV + Sbjct: 462 IRSMPVKA 469 >ref|XP_002514539.1| pentatricopeptide repeat-containing protein, putative [Ricinus communis] gi|223546143|gb|EEF47645.1| pentatricopeptide repeat-containing protein, putative [Ricinus communis] Length = 332 Score = 225 bits (573), Expect = 4e-57 Identities = 108/165 (65%), Positives = 137/165 (83%) Frame = +1 Query: 58 YSKMGQLETAVSLFDKMPKRNTMSYNILIGSFIQNGDLDTAFKLFDEMGERNLATWNAMI 237 YSK+G+L+TA+ LF+ MP RN MS NILI ++I +G+LD+A LFDEM ERN+ATWNA++ Sbjct: 2 YSKIGELQTALLLFNSMPMRNIMSCNILINAYIHSGELDSARNLFDEMPERNVATWNAVV 61 Query: 238 VGLVQHEFNEDGLSLFSQMHKLGFLPDAFTLGSVFRGCAGLKDLNTGRQVHGYVVKSGFE 417 GL+Q E NE+GL+LF M++LGFLPD +TLGSV RGCAGLK + GRQVH YVVK GFE Sbjct: 62 AGLIQFEINEEGLNLFRDMYELGFLPDEYTLGSVLRGCAGLKSMYAGRQVHCYVVKCGFE 121 Query: 418 WHLVVGSSLAHMYMRSGSLREGEKVIRMMPVSSVPACNTLIAGRA 552 ++LVVGSSLAHMYM+SGSL EGE+VI+ MP ++ A NTLIAG++ Sbjct: 122 FNLVVGSSLAHMYMKSGSLSEGERVIKSMPSRNIVAWNTLIAGKS 166 Score = 86.7 bits (213), Expect = 2e-15 Identities = 44/126 (34%), Positives = 69/126 (54%), Gaps = 5/126 (3%) Frame = +1 Query: 130 YNILIGS-----FIQNGDLDTAFKLFDEMGERNLATWNAMIVGLVQHEFNEDGLSLFSQM 294 +N+++GS ++++G L ++ M RN+ WN +I G Q+ +E+ L L++ M Sbjct: 122 FNLVVGSSLAHMYMKSGSLSEGERVIKSMPSRNIVAWNTLIAGKSQNGHSEEVLDLYNMM 181 Query: 295 HKLGFLPDAFTLGSVFRGCAGLKDLNTGRQVHGYVVKSGFEWHLVVGSSLAHMYMRSGSL 474 +GF PD T SV C+ L L G+Q+H V+K+G + V SSL MY R G L Sbjct: 182 RIMGFRPDKITFASVISSCSELTTLGQGQQIHAEVIKAGASSVVAVTSSLISMYSRCGCL 241 Query: 475 REGEKV 492 + KV Sbjct: 242 EDSVKV 247