BLASTX nr result
ID: Dioscorea21_contig00039022
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Dioscorea21_contig00039022 (517 letters) Database: ./nr 23,641,837 sequences; 8,123,359,852 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_002264956.1| PREDICTED: pentatricopeptide repeat-containi... 156 2e-36 ref|XP_002305605.1| predicted protein [Populus trichocarpa] gi|2... 125 4e-27 sp|O49558.2|PP331_ARATH RecName: Full=Pentatricopeptide repeat-c... 123 2e-26 ref|XP_002518527.1| pentatricopeptide repeat-containing protein,... 115 3e-24 ref|NP_193849.2| pentatricopeptide repeat-containing protein [Ar... 104 8e-21 >ref|XP_002264956.1| PREDICTED: pentatricopeptide repeat-containing protein At4g21170-like [Vitis vinifera] Length = 569 Score = 156 bits (394), Expect = 2e-36 Identities = 74/139 (53%), Positives = 100/139 (71%) Frame = -3 Query: 515 CGEGWWIEAGDLWNAAMEKSILLNAHCSNSLLEHYCCEGLVDSAMLLHEKIMKLGGCLDA 336 C G W EA DL N +EK +L ++ C ++L+EHYC +DS++ LHEKI K+ G LD Sbjct: 430 CKNGRWTEADDLLNVTIEKGLLPDSFCCSALVEHYCRSRQIDSSIALHEKIKKVKGSLDV 489 Query: 335 IAYNSLLRIVLVQRKIEAASRVFDYMRERNVLSSSSYVIMISAFCREKEMRKAMELHDEM 156 YN LL + ++++IE A VFD MR +N+LSS+S+ IM+S CRE+E+RKAM+ HDEM Sbjct: 490 ATYNVLLNGLFMEKRIEDAVSVFDCMRSQNLLSSTSFTIMVSGLCRERELRKAMKFHDEM 549 Query: 155 LKVGLKPDDSIYKHLISGF 99 LK+GLKPD + YK LISGF Sbjct: 550 LKMGLKPDRATYKRLISGF 568 >ref|XP_002305605.1| predicted protein [Populus trichocarpa] gi|222848569|gb|EEE86116.1| predicted protein [Populus trichocarpa] Length = 564 Score = 125 bits (314), Expect = 4e-27 Identities = 64/134 (47%), Positives = 84/134 (62%) Frame = -3 Query: 500 WIEAGDLWNAAMEKSILLNAHCSNSLLEHYCCEGLVDSAMLLHEKIMKLGGCLDAIAYNS 321 W E DL + +EK +L ++ C SL+EHYC +D A+ LH K+ KL LD YN Sbjct: 429 WREVEDLLDLVLEKGLLPDSLCCCSLVEHYCSRRQIDKAVALHNKMEKLQASLDVATYNI 488 Query: 320 LLRIVLVQRKIEAASRVFDYMRERNVLSSSSYVIMISAFCREKEMRKAMELHDEMLKVGL 141 LL ++ +IE RVFDYM+ +++S S+ I I CR KEMRKAM+LHDEML +GL Sbjct: 489 LLDGLVKNGRIEEVVRVFDYMKGLKLVNSESFTITIRGLCRAKEMRKAMKLHDEMLDMGL 548 Query: 140 KPDDSIYKHLISGF 99 KPD + YK LI F Sbjct: 549 KPDKAAYKRLILEF 562 >sp|O49558.2|PP331_ARATH RecName: Full=Pentatricopeptide repeat-containing protein At4g21170 Length = 585 Score = 123 bits (308), Expect = 2e-26 Identities = 63/141 (44%), Positives = 91/141 (64%), Gaps = 2/141 (1%) Frame = -3 Query: 515 CGEGWWIEAGDLWNAAMEKSILLNAHCSNSLLEHYCCEGLVDSAMLLHEKIMKLGGCLDA 336 C + W A L ++ ME + ++ L+E YC G ++ A++LHEKI K+ G LD Sbjct: 444 CRKRRWKSAEKLLDSVMEMEVYFDSFACGLLMERYCRSGKLEKALVLHEKIKKMKGSLDV 503 Query: 335 IAYNSLLRIVLVQRK--IEAASRVFDYMRERNVLSSSSYVIMISAFCREKEMRKAMELHD 162 AYN++L +++++K +E A VF+YM+E N ++S S+ IMI CR KEM+KAM HD Sbjct: 504 NAYNAVLDRLMMRQKEMVEEAVVVFEYMKEINSVNSKSFTIMIQGLCRVKEMKKAMRSHD 563 Query: 161 EMLKVGLKPDDSIYKHLISGF 99 EML++GLKPD YK LI GF Sbjct: 564 EMLRLGLKPDLVTYKRLILGF 584 >ref|XP_002518527.1| pentatricopeptide repeat-containing protein, putative [Ricinus communis] gi|223542372|gb|EEF43914.1| pentatricopeptide repeat-containing protein, putative [Ricinus communis] Length = 599 Score = 115 bits (289), Expect = 3e-24 Identities = 59/139 (42%), Positives = 85/139 (61%) Frame = -3 Query: 515 CGEGWWIEAGDLWNAAMEKSILLNAHCSNSLLEHYCCEGLVDSAMLLHEKIMKLGGCLDA 336 C + W EA +L +EK +L + SL++HYC D A+ LH + KL LD Sbjct: 453 CKKRRWKEAEELLYMVLEKGLLPDTLSFCSLVKHYCSSKQTDKALALHNTLEKLQASLDI 512 Query: 335 IAYNSLLRIVLVQRKIEAASRVFDYMRERNVLSSSSYVIMISAFCREKEMRKAMELHDEM 156 AYN LL ++ + ++E + +VFDYM+ + +S+S+ ++I CR KE+RKAM+LHDEM Sbjct: 513 TAYNLLLGGLVKEGRVEESIKVFDYMKGLKLANSASFTVIIRGLCRAKELRKAMKLHDEM 572 Query: 155 LKVGLKPDDSIYKHLISGF 99 L +GLKPD YK LI F Sbjct: 573 LNMGLKPDKPTYKRLILEF 591 >ref|NP_193849.2| pentatricopeptide repeat-containing protein [Arabidopsis thaliana] gi|332659015|gb|AEE84415.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana] Length = 551 Score = 104 bits (259), Expect = 8e-21 Identities = 53/103 (51%), Positives = 74/103 (71%), Gaps = 2/103 (1%) Frame = -3 Query: 401 GLVDSAMLLHEKIMKLGGCLDAIAYNSLLRIVLVQRK--IEAASRVFDYMRERNVLSSSS 228 G ++ A++LHEKI K+ G LD AYN++L +++++K +E A VF+YM+E N ++S S Sbjct: 448 GKLEKALVLHEKIKKMKGSLDVNAYNAVLDRLMMRQKEMVEEAVVVFEYMKEINSVNSKS 507 Query: 227 YVIMISAFCREKEMRKAMELHDEMLKVGLKPDDSIYKHLISGF 99 + IMI CR KEM+KAM HDEML++GLKPD YK LI GF Sbjct: 508 FTIMIQGLCRVKEMKKAMRSHDEMLRLGLKPDLVTYKRLILGF 550