BLASTX nr result
ID: Dioscorea21_contig00022354
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Dioscorea21_contig00022354 (515 letters) Database: ./nr 23,641,837 sequences; 8,123,359,852 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_002281132.1| PREDICTED: pentatricopeptide repeat-containi... 116 2e-24 ref|XP_003537906.1| PREDICTED: pentatricopeptide repeat-containi... 112 2e-23 ref|XP_002519113.1| pentatricopeptide repeat-containing protein,... 105 4e-21 ref|NP_179197.1| pentatricopeptide repeat-containing protein [Ar... 100 9e-20 ref|XP_002306075.1| predicted protein [Populus trichocarpa] gi|2... 100 2e-19 >ref|XP_002281132.1| PREDICTED: pentatricopeptide repeat-containing protein At2g15980-like [Vitis vinifera] Length = 492 Score = 116 bits (290), Expect = 2e-24 Identities = 71/173 (41%), Positives = 99/173 (57%), Gaps = 13/173 (7%) Frame = +3 Query: 3 ISILKEHRSRSRWSQLKAILPPTGIPPSATIRILSALRNRPRLALAFFTF----SRCND- 167 +SIL+ RS+SRWS L+++ P G P+ +I+ ++N P LAL+FF + S CN Sbjct: 41 VSILRHQRSKSRWSHLQSLFPK-GFTPTEASQIVLQIKNNPHLALSFFLWCHHKSLCNHT 99 Query: 168 LLSFSAAAHIXXXXXXXXXXXXXXXXXXXXYDS--------PAIFETLTKTYRSFDSAPF 323 LLS+S HI +D P IFE+L KTY S SAPF Sbjct: 100 LLSYSTIIHILARARLKSQALGLIRTAIRVFDDSDECSSQPPKIFESLVKTYNSCGSAPF 159 Query: 324 VFDLLILANLDAKKFDRAVYITRLLLSRGIHPLLSTSNSLIRSVSKLNGSDAG 482 VFDLLI A L++K+ ++++ I ++L SRGI P +ST N+LI VS+ G DAG Sbjct: 160 VFDLLIKACLNSKRIEQSISIVKMLRSRGISPTISTCNALIWQVSRGRGCDAG 212 >ref|XP_003537906.1| PREDICTED: pentatricopeptide repeat-containing protein At2g15980-like [Glycine max] Length = 487 Score = 112 bits (281), Expect = 2e-23 Identities = 74/177 (41%), Positives = 99/177 (55%), Gaps = 17/177 (9%) Frame = +3 Query: 3 ISILKEHRSRSRWSQLKAILPPTGIPPSATIRILSALRNRPRLALAFFTFSR----CN-D 167 +SIL HRS+SRWS L++ P GI P+ I ++N+P+LAL FF +++ CN + Sbjct: 39 VSILTHHRSKSRWSNLRSACP-NGITPAEFSEITLHIKNKPQLALRFFLWTKSKSLCNHN 97 Query: 168 LLSFSA----------AAHIXXXXXXXXXXXXXXXXXXXXYDSPAI--FETLTKTYRSFD 311 L S+S+ ++H ++S + FETL KTYR Sbjct: 98 LASYSSIIHLLARARLSSHAYDLIRTAIRASHQNDEENCRFNSRPLNLFETLVKTYRDSG 157 Query: 312 SAPFVFDLLILANLDAKKFDRAVYITRLLLSRGIHPLLSTSNSLIRSVSKLNGSDAG 482 SAPFVFDLLI A LD+KK D ++ I R+LLSRGI P +ST NSLI V K G D G Sbjct: 158 SAPFVFDLLIKACLDSKKLDPSIEIVRMLLSRGISPKVSTLNSLISRVCKSRGVDEG 214 >ref|XP_002519113.1| pentatricopeptide repeat-containing protein, putative [Ricinus communis] gi|223541776|gb|EEF43324.1| pentatricopeptide repeat-containing protein, putative [Ricinus communis] Length = 486 Score = 105 bits (262), Expect = 4e-21 Identities = 72/180 (40%), Positives = 94/180 (52%), Gaps = 19/180 (10%) Frame = +3 Query: 6 SILKEHRSRSRWSQLKAILPPTG--IPPSATIRILSALRNRPRLALAFFTFSRCN----- 164 S+L HRS+SRW+ L++++ + + P+ +I+ L++ PRLAL FF F+ N Sbjct: 40 SLLIHHRSKSRWTHLRSLILTSNKTLTPTHFSQIILLLKSNPRLALRFFHFTLRNPSFCS 99 Query: 165 -DLLSFSAAAHIXXXXXXXXXXXXXXXXXXXXYDSPAI-----------FETLTKTYRSF 308 DL S S HI + SP + FE L KTYR Sbjct: 100 HDLRSISTITHILSRARLKPQAQSIIHLA---FTSPVLVDDSNGQALKFFEILVKTYREC 156 Query: 309 DSAPFVFDLLILANLDAKKFDRAVYITRLLLSRGIHPLLSTSNSLIRSVSKLNGSDAGLG 488 DSAPFVFDLLI + L+ KK D + I RLL SRGI PL+ST N L+ VSK G AG G Sbjct: 157 DSAPFVFDLLIKSCLELKKIDDGLKIVRLLRSRGISPLISTCNFLVSWVSKCKGCYAGYG 216 >ref|NP_179197.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana] gi|75267579|sp|Q9XIM8.1|PP155_ARATH RecName: Full=Pentatricopeptide repeat-containing protein At2g15980 gi|5306237|gb|AAD41970.1| hypothetical protein [Arabidopsis thaliana] gi|330251359|gb|AEC06453.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana] Length = 498 Score = 100 bits (250), Expect = 9e-20 Identities = 65/172 (37%), Positives = 89/172 (51%), Gaps = 12/172 (6%) Frame = +3 Query: 3 ISILKEHRSRSRWSQLKAILPPTGIPPSATIRILSALRNRPRLALAFFTFSR-------- 158 +SIL HRS+SRWS L++ L P+G PS I LRN P L+L FF F+R Sbjct: 46 VSILTHHRSKSRWSTLRS-LQPSGFTPSQFSEITLCLRNNPHLSLRFFLFTRRYSLCSHD 104 Query: 159 ---CNDLLSFSAAAHIXXXXXXXXXXXXXXXXXXXXYDSPA-IFETLTKTYRSFDSAPFV 326 C+ L+ + + + D +F +L K+Y SAPFV Sbjct: 105 THSCSTLIHILSRSRLKSHASEIIRLALRLAATDEDEDRVLKVFRSLIKSYNRCGSAPFV 164 Query: 327 FDLLILANLDAKKFDRAVYITRLLLSRGIHPLLSTSNSLIRSVSKLNGSDAG 482 FDLLI + LD+K+ D AV + R L SRGI+ +ST N+LI VS+ G+ G Sbjct: 165 FDLLIKSCLDSKEIDGAVMVMRKLRSRGINAQISTCNALITEVSRRRGASNG 216 >ref|XP_002306075.1| predicted protein [Populus trichocarpa] gi|222849039|gb|EEE86586.1| predicted protein [Populus trichocarpa] Length = 498 Score = 100 bits (248), Expect = 2e-19 Identities = 67/177 (37%), Positives = 90/177 (50%), Gaps = 15/177 (8%) Frame = +3 Query: 3 ISILKEHRSRSRWSQLKAILPPTGIPPSATIR---ILSALRNRPRLALAFFTFSRCN--- 164 IS+L HRS+SRWS L+++L T P A I L++ P LAL+FF F+ N Sbjct: 44 ISLLTHHRSKSRWSHLRSLLTTTTSTPLAPGHFSLITLKLKSNPHLALSFFHFTLHNSSL 103 Query: 165 ---DLLSFSAAAHIXXXXXXXXXXXXXXXXXXXXY------DSPAIFETLTKTYRSFDSA 317 +L S++ HI FE L K+YR DSA Sbjct: 104 CSHNLRSYATIIHILSRARLKAHAQEIIRAGLRSQILYHLLKEVRFFEVLVKSYRECDSA 163 Query: 318 PFVFDLLILANLDAKKFDRAVYITRLLLSRGIHPLLSTSNSLIRSVSKLNGSDAGLG 488 PFVFDLLI + L+ KK D ++ I ++L S+GI P +ST N+LI VS+ GS G G Sbjct: 164 PFVFDLLIKSCLELKKIDGSIEIVKMLRSKGISPSISTCNALISEVSRCKGSFVGYG 220