BLASTX nr result

ID: Dioscorea21_contig00022354 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Dioscorea21_contig00022354
         (515 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002281132.1| PREDICTED: pentatricopeptide repeat-containi...   116   2e-24
ref|XP_003537906.1| PREDICTED: pentatricopeptide repeat-containi...   112   2e-23
ref|XP_002519113.1| pentatricopeptide repeat-containing protein,...   105   4e-21
ref|NP_179197.1| pentatricopeptide repeat-containing protein [Ar...   100   9e-20
ref|XP_002306075.1| predicted protein [Populus trichocarpa] gi|2...   100   2e-19

>ref|XP_002281132.1| PREDICTED: pentatricopeptide repeat-containing protein
           At2g15980-like [Vitis vinifera]
          Length = 492

 Score =  116 bits (290), Expect = 2e-24
 Identities = 71/173 (41%), Positives = 99/173 (57%), Gaps = 13/173 (7%)
 Frame = +3

Query: 3   ISILKEHRSRSRWSQLKAILPPTGIPPSATIRILSALRNRPRLALAFFTF----SRCND- 167
           +SIL+  RS+SRWS L+++ P  G  P+   +I+  ++N P LAL+FF +    S CN  
Sbjct: 41  VSILRHQRSKSRWSHLQSLFPK-GFTPTEASQIVLQIKNNPHLALSFFLWCHHKSLCNHT 99

Query: 168 LLSFSAAAHIXXXXXXXXXXXXXXXXXXXXYDS--------PAIFETLTKTYRSFDSAPF 323
           LLS+S   HI                    +D         P IFE+L KTY S  SAPF
Sbjct: 100 LLSYSTIIHILARARLKSQALGLIRTAIRVFDDSDECSSQPPKIFESLVKTYNSCGSAPF 159

Query: 324 VFDLLILANLDAKKFDRAVYITRLLLSRGIHPLLSTSNSLIRSVSKLNGSDAG 482
           VFDLLI A L++K+ ++++ I ++L SRGI P +ST N+LI  VS+  G DAG
Sbjct: 160 VFDLLIKACLNSKRIEQSISIVKMLRSRGISPTISTCNALIWQVSRGRGCDAG 212


>ref|XP_003537906.1| PREDICTED: pentatricopeptide repeat-containing protein
           At2g15980-like [Glycine max]
          Length = 487

 Score =  112 bits (281), Expect = 2e-23
 Identities = 74/177 (41%), Positives = 99/177 (55%), Gaps = 17/177 (9%)
 Frame = +3

Query: 3   ISILKEHRSRSRWSQLKAILPPTGIPPSATIRILSALRNRPRLALAFFTFSR----CN-D 167
           +SIL  HRS+SRWS L++  P  GI P+    I   ++N+P+LAL FF +++    CN +
Sbjct: 39  VSILTHHRSKSRWSNLRSACP-NGITPAEFSEITLHIKNKPQLALRFFLWTKSKSLCNHN 97

Query: 168 LLSFSA----------AAHIXXXXXXXXXXXXXXXXXXXXYDSPAI--FETLTKTYRSFD 311
           L S+S+          ++H                     ++S  +  FETL KTYR   
Sbjct: 98  LASYSSIIHLLARARLSSHAYDLIRTAIRASHQNDEENCRFNSRPLNLFETLVKTYRDSG 157

Query: 312 SAPFVFDLLILANLDAKKFDRAVYITRLLLSRGIHPLLSTSNSLIRSVSKLNGSDAG 482
           SAPFVFDLLI A LD+KK D ++ I R+LLSRGI P +ST NSLI  V K  G D G
Sbjct: 158 SAPFVFDLLIKACLDSKKLDPSIEIVRMLLSRGISPKVSTLNSLISRVCKSRGVDEG 214


>ref|XP_002519113.1| pentatricopeptide repeat-containing protein, putative [Ricinus
           communis] gi|223541776|gb|EEF43324.1| pentatricopeptide
           repeat-containing protein, putative [Ricinus communis]
          Length = 486

 Score =  105 bits (262), Expect = 4e-21
 Identities = 72/180 (40%), Positives = 94/180 (52%), Gaps = 19/180 (10%)
 Frame = +3

Query: 6   SILKEHRSRSRWSQLKAILPPTG--IPPSATIRILSALRNRPRLALAFFTFSRCN----- 164
           S+L  HRS+SRW+ L++++  +   + P+   +I+  L++ PRLAL FF F+  N     
Sbjct: 40  SLLIHHRSKSRWTHLRSLILTSNKTLTPTHFSQIILLLKSNPRLALRFFHFTLRNPSFCS 99

Query: 165 -DLLSFSAAAHIXXXXXXXXXXXXXXXXXXXXYDSPAI-----------FETLTKTYRSF 308
            DL S S   HI                    + SP +           FE L KTYR  
Sbjct: 100 HDLRSISTITHILSRARLKPQAQSIIHLA---FTSPVLVDDSNGQALKFFEILVKTYREC 156

Query: 309 DSAPFVFDLLILANLDAKKFDRAVYITRLLLSRGIHPLLSTSNSLIRSVSKLNGSDAGLG 488
           DSAPFVFDLLI + L+ KK D  + I RLL SRGI PL+ST N L+  VSK  G  AG G
Sbjct: 157 DSAPFVFDLLIKSCLELKKIDDGLKIVRLLRSRGISPLISTCNFLVSWVSKCKGCYAGYG 216


>ref|NP_179197.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana]
           gi|75267579|sp|Q9XIM8.1|PP155_ARATH RecName:
           Full=Pentatricopeptide repeat-containing protein
           At2g15980 gi|5306237|gb|AAD41970.1| hypothetical protein
           [Arabidopsis thaliana] gi|330251359|gb|AEC06453.1|
           pentatricopeptide repeat-containing protein [Arabidopsis
           thaliana]
          Length = 498

 Score =  100 bits (250), Expect = 9e-20
 Identities = 65/172 (37%), Positives = 89/172 (51%), Gaps = 12/172 (6%)
 Frame = +3

Query: 3   ISILKEHRSRSRWSQLKAILPPTGIPPSATIRILSALRNRPRLALAFFTFSR-------- 158
           +SIL  HRS+SRWS L++ L P+G  PS    I   LRN P L+L FF F+R        
Sbjct: 46  VSILTHHRSKSRWSTLRS-LQPSGFTPSQFSEITLCLRNNPHLSLRFFLFTRRYSLCSHD 104

Query: 159 ---CNDLLSFSAAAHIXXXXXXXXXXXXXXXXXXXXYDSPA-IFETLTKTYRSFDSAPFV 326
              C+ L+   + + +                     D    +F +L K+Y    SAPFV
Sbjct: 105 THSCSTLIHILSRSRLKSHASEIIRLALRLAATDEDEDRVLKVFRSLIKSYNRCGSAPFV 164

Query: 327 FDLLILANLDAKKFDRAVYITRLLLSRGIHPLLSTSNSLIRSVSKLNGSDAG 482
           FDLLI + LD+K+ D AV + R L SRGI+  +ST N+LI  VS+  G+  G
Sbjct: 165 FDLLIKSCLDSKEIDGAVMVMRKLRSRGINAQISTCNALITEVSRRRGASNG 216


>ref|XP_002306075.1| predicted protein [Populus trichocarpa] gi|222849039|gb|EEE86586.1|
           predicted protein [Populus trichocarpa]
          Length = 498

 Score =  100 bits (248), Expect = 2e-19
 Identities = 67/177 (37%), Positives = 90/177 (50%), Gaps = 15/177 (8%)
 Frame = +3

Query: 3   ISILKEHRSRSRWSQLKAILPPTGIPPSATIR---ILSALRNRPRLALAFFTFSRCN--- 164
           IS+L  HRS+SRWS L+++L  T   P A      I   L++ P LAL+FF F+  N   
Sbjct: 44  ISLLTHHRSKSRWSHLRSLLTTTTSTPLAPGHFSLITLKLKSNPHLALSFFHFTLHNSSL 103

Query: 165 ---DLLSFSAAAHIXXXXXXXXXXXXXXXXXXXXY------DSPAIFETLTKTYRSFDSA 317
              +L S++   HI                                FE L K+YR  DSA
Sbjct: 104 CSHNLRSYATIIHILSRARLKAHAQEIIRAGLRSQILYHLLKEVRFFEVLVKSYRECDSA 163

Query: 318 PFVFDLLILANLDAKKFDRAVYITRLLLSRGIHPLLSTSNSLIRSVSKLNGSDAGLG 488
           PFVFDLLI + L+ KK D ++ I ++L S+GI P +ST N+LI  VS+  GS  G G
Sbjct: 164 PFVFDLLIKSCLELKKIDGSIEIVKMLRSKGISPSISTCNALISEVSRCKGSFVGYG 220