BLASTX nr result

ID: Dioscorea21_contig00017024 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Dioscorea21_contig00017024
         (1980 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002467270.1| hypothetical protein SORBIDRAFT_01g022360 [S...   219   3e-54
gb|ABB47573.1| expressed protein [Oryza sativa Japonica Group] g...   213   1e-52
ref|NP_001064562.1| Os10g0406200 [Oryza sativa Japonica Group] g...   213   1e-52
ref|XP_003573899.1| PREDICTED: uncharacterized protein LOC100841...   208   5e-51
ref|XP_003520084.1| PREDICTED: uncharacterized protein LOC100789...   204   5e-50

>ref|XP_002467270.1| hypothetical protein SORBIDRAFT_01g022360 [Sorghum bicolor]
            gi|241921124|gb|EER94268.1| hypothetical protein
            SORBIDRAFT_01g022360 [Sorghum bicolor]
          Length = 436

 Score =  219 bits (557), Expect = 3e-54
 Identities = 110/219 (50%), Positives = 141/219 (64%), Gaps = 3/219 (1%)
 Frame = +1

Query: 889  TPVTYSKSSPP--SDFKSIQAFTELHSPGEFEFGGSSRRGTGQWSSASSIELADVXXXXX 1062
            +PV  ++S+ P  S+F       EL  PG  ++G  S   +G WS+ASS++L D+     
Sbjct: 218  SPVAIARSNNPLCSEFNGTG---ELRFPGPMDYGSGSHGESGNWSAASSMDLTDLSERPE 274

Query: 1063 XXXXXXXXXN-IYEGAKCCLCERFLSQRSPWGSRRIVRNGDMPVVSVLSCWHVFHAECLE 1239
                     N + +  +C LCE+ L++RSPWGSRRIVR GD+PV  VL C HV+HAECLE
Sbjct: 275  AGQAGPLRPNNVMQKTRCDLCEKLLTKRSPWGSRRIVRTGDLPVAGVLPCSHVYHAECLE 334

Query: 1240 RTTSKTQKHDPPCPLCEKSEENVWEQWAACRLKNGVPRLKPLGEEGPSKVWTCGQVGDCV 1419
            RTT K QKHDPPCP+C+K      EQW+ CRLKNG PRL+ LG EGPS+VW+C   GDCV
Sbjct: 335  RTTPKGQKHDPPCPVCDKLAGKDTEQWSICRLKNGFPRLRSLG-EGPSRVWSCAHAGDCV 393

Query: 1420 ERALQTPKRANMLLLNRSRLKRQLSLKGSSGKDWAENSK 1536
              A+Q P+  ++ LL+RS  KR  S KG   KDWAE SK
Sbjct: 394  AGAVQIPRSNSIKLLSRSGHKRHASSKGEPSKDWAETSK 432



 Score = 65.5 bits (158), Expect = 5e-08
 Identities = 42/105 (40%), Positives = 52/105 (49%), Gaps = 12/105 (11%)
 Frame = +1

Query: 361 WDLRFRSEGRSLGSHGDS-----------SNSK-ESRNWLRGERGEFGHSHRYSPSDGVG 504
           W+ R  SEG S GSHGDS           SNSK  SR+W   ER E    HRYS S+G  
Sbjct: 55  WEYRINSEGLSYGSHGDSGVAVNYGSSLSSNSKGASRSW---ERNELPQDHRYSTSEGAI 111

Query: 505 SYIGSPSDSFLYHHLTPSSAIGVNLDDETREQASGPLSFSRLPEV 639
           SY+ SP  SF  HH+        ++D+  R   + P+    L EV
Sbjct: 112 SYLNSPDVSFQNHHIMLPMLQDSSVDEYMRVSVAEPIGALLLSEV 156


>gb|ABB47573.1| expressed protein [Oryza sativa Japonica Group]
            gi|125531860|gb|EAY78425.1| hypothetical protein
            OsI_33515 [Oryza sativa Indica Group]
          Length = 433

 Score =  213 bits (543), Expect = 1e-52
 Identities = 112/218 (51%), Positives = 142/218 (65%), Gaps = 5/218 (2%)
 Frame = +1

Query: 904  SKSSP-PSDFKSIQAFTELHSPG--EFEFGGSSRRGTGQWSSASSIELADVXXXXXXXXX 1074
            S SSP PS+FK+I    E+   G  ++ +   S   +  WS+ASS++L D+         
Sbjct: 215  STSSPMPSEFKAIG---EIRPSGLMDYAYASGSHGESANWSAASSMDLTDLSERHDAERS 271

Query: 1075 XXXXXN-IYEGAKCCLCERFLSQRSPWGSRRIVRNGDMPVVSVLSCWHVFHAECLERTTS 1251
                 N I +  +C LCER LS+RSPWGSRRIVR GD+PV  VL C HV+HAECLERTT 
Sbjct: 272  GPLRSNNIMDRTRCDLCERLLSKRSPWGSRRIVRTGDLPVAGVLPCCHVYHAECLERTTP 331

Query: 1252 KTQKHDPPCPLCEKSEENVWEQWAACRLKNGVPRLKPLGEEGPSKVWTCGQVGDCVERAL 1431
            K QKHDPPCP C++      EQW+ CRL+NG PRL+ LG EGPS+VW+C Q GDCV  A+
Sbjct: 332  KGQKHDPPCPACDRLSGKDTEQWSICRLRNGFPRLRSLG-EGPSRVWSCAQAGDCVAGAV 390

Query: 1432 QTPKRANMLLLNRSRLKR-QLSLKGSSGKDWAENSKKS 1542
            Q P+ +++ LL+RS  KR   + KG SGKDWAE S  S
Sbjct: 391  QIPRASSISLLSRSGHKRHHAASKGESGKDWAETSSSS 428



 Score =  118 bits (295), Expect = 7e-24
 Identities = 67/147 (45%), Positives = 84/147 (57%), Gaps = 13/147 (8%)
 Frame = +1

Query: 235 GSLCCVAARPHGSSTGSGDWS-MARSEPFWQNNSSFSPPLSRRWDLRFRSEGRSLGSHGD 411
           GSLCCVA+RPHG+ST S +WS + RS+P W+ N+ FSPPLSRRW+ R  SEG S GS GD
Sbjct: 2   GSLCCVASRPHGASTASREWSSIGRSDPLWRTNAGFSPPLSRRWEYRINSEGLSYGSQGD 61

Query: 412 -----------SSNSKE-SRNWLRGERGEFGHSHRYSPSDGVGSYIGSPSDSFLYHHLTP 555
                      SSNSKE SR+W R +       HRYS S+G  SY  SP  +F  HH+  
Sbjct: 62  SGAAAHYGSSLSSNSKEPSRSWERSDVPP--DHHRYSTSEGAISYFNSPDVTFQNHHIML 119

Query: 556 SSAIGVNLDDETREQASGPLSFSRLPE 636
                  +D+  R   + P+    L E
Sbjct: 120 PMLQDSGIDEYMRVSVAEPIGALLLSE 146


>ref|NP_001064562.1| Os10g0406200 [Oryza sativa Japonica Group]
            gi|15451552|gb|AAK98676.1|AC021893_10 Unknown protein
            [Oryza sativa Japonica Group]
            gi|113639171|dbj|BAF26476.1| Os10g0406200 [Oryza sativa
            Japonica Group] gi|125574736|gb|EAZ16020.1| hypothetical
            protein OsJ_31466 [Oryza sativa Japonica Group]
          Length = 498

 Score =  213 bits (543), Expect = 1e-52
 Identities = 112/218 (51%), Positives = 142/218 (65%), Gaps = 5/218 (2%)
 Frame = +1

Query: 904  SKSSP-PSDFKSIQAFTELHSPG--EFEFGGSSRRGTGQWSSASSIELADVXXXXXXXXX 1074
            S SSP PS+FK+I    E+   G  ++ +   S   +  WS+ASS++L D+         
Sbjct: 280  STSSPMPSEFKAIG---EIRPSGLMDYAYASGSHGESANWSAASSMDLTDLSERHDAERS 336

Query: 1075 XXXXXN-IYEGAKCCLCERFLSQRSPWGSRRIVRNGDMPVVSVLSCWHVFHAECLERTTS 1251
                 N I +  +C LCER LS+RSPWGSRRIVR GD+PV  VL C HV+HAECLERTT 
Sbjct: 337  GPLRSNNIMDRTRCDLCERLLSKRSPWGSRRIVRTGDLPVAGVLPCCHVYHAECLERTTP 396

Query: 1252 KTQKHDPPCPLCEKSEENVWEQWAACRLKNGVPRLKPLGEEGPSKVWTCGQVGDCVERAL 1431
            K QKHDPPCP C++      EQW+ CRL+NG PRL+ LG EGPS+VW+C Q GDCV  A+
Sbjct: 397  KGQKHDPPCPACDRLSGKDTEQWSICRLRNGFPRLRSLG-EGPSRVWSCAQAGDCVAGAV 455

Query: 1432 QTPKRANMLLLNRSRLKR-QLSLKGSSGKDWAENSKKS 1542
            Q P+ +++ LL+RS  KR   + KG SGKDWAE S  S
Sbjct: 456  QIPRASSISLLSRSGHKRHHAASKGESGKDWAETSSSS 493



 Score =  102 bits (254), Expect = 4e-19
 Identities = 60/139 (43%), Positives = 76/139 (54%), Gaps = 13/139 (9%)
 Frame = +1

Query: 259 RPHGSSTGSGDWS-MARSEPFWQNNSSFSPPLSRRWDLRFRSEGRSLGSHGD-------- 411
           RPHG+ST S +WS + RS+P W+ N+ FSPPLSRRW+ R  SEG S GS GD        
Sbjct: 75  RPHGASTASREWSSIGRSDPLWRTNAGFSPPLSRRWEYRINSEGLSYGSQGDSGAAAHYG 134

Query: 412 ---SSNSKE-SRNWLRGERGEFGHSHRYSPSDGVGSYIGSPSDSFLYHHLTPSSAIGVNL 579
              SSNSKE SR+W R +       HRYS S+G  SY  SP  +F  HH+         +
Sbjct: 135 SSLSSNSKEPSRSWERSDVPP--DHHRYSTSEGAISYFNSPDVTFQNHHIMLPMLQDSGI 192

Query: 580 DDETREQASGPLSFSRLPE 636
           D+  R   + P+    L E
Sbjct: 193 DEYMRVSVAEPIGALLLSE 211


>ref|XP_003573899.1| PREDICTED: uncharacterized protein LOC100841348 [Brachypodium
            distachyon]
          Length = 420

 Score =  208 bits (529), Expect = 5e-51
 Identities = 107/216 (49%), Positives = 135/216 (62%), Gaps = 3/216 (1%)
 Frame = +1

Query: 889  TPVTYSKSSPP--SDFKSIQAFTELHSPGEFEFGGSSRRGTGQWSSASSIELAD-VXXXX 1059
            +PV  + S+ P  S+FK I    E  SPG  ++   S   +  WS+ SS++L D      
Sbjct: 208  SPVASTNSNNPLRSEFKGIG---ERSSPGLMDYASGSHEESADWSAPSSMDLTDFTEQHV 264

Query: 1060 XXXXXXXXXXNIYEGAKCCLCERFLSQRSPWGSRRIVRNGDMPVVSVLSCWHVFHAECLE 1239
                      NI +  +C LCER LS+RSPWGSRRIVR GD+P+  VL C HV+HAECLE
Sbjct: 265  AERIAALHPINIMDKTRCDLCERLLSKRSPWGSRRIVRTGDLPIAGVLPCCHVYHAECLE 324

Query: 1240 RTTSKTQKHDPPCPLCEKSEENVWEQWAACRLKNGVPRLKPLGEEGPSKVWTCGQVGDCV 1419
            R+T K QKHDPPCP+C+K      EQW+ CRLKNG PRL+ LG EGPS+VW+C Q GDCV
Sbjct: 325  RSTPKGQKHDPPCPVCDKLAGKDTEQWSICRLKNGFPRLRSLG-EGPSRVWSCAQAGDCV 383

Query: 1420 ERALQTPKRANMLLLNRSRLKRQLSLKGSSGKDWAE 1527
              A+Q P+ + + LL RS  +R    KG SGKD  E
Sbjct: 384  AAAVQIPRPSGIALLGRSGHRRHGPSKGESGKDCTE 419



 Score =  117 bits (293), Expect = 1e-23
 Identities = 66/147 (44%), Positives = 84/147 (57%), Gaps = 13/147 (8%)
 Frame = +1

Query: 235 GSLCCVAARPHGSSTGSGDWS-MARSEPFWQNNSSFSPPLSRRWDLRFRSEGRSLGSHGD 411
           GSLCCVAARPHG+ST S +WS + RS+P W+  + +SPPLSRRW+ R  SEG S G+H D
Sbjct: 2   GSLCCVAARPHGTSTASREWSSVGRSDPLWRTTTGYSPPLSRRWEYRINSEGLSYGNHVD 61

Query: 412 -----------SSNSKE-SRNWLRGERGEFGHSHRYSPSDGVGSYIGSPSDSFLYHHLTP 555
                      SSNSK+ SR+W   ER E    HRYS S+   SY  SP  SF  HH+  
Sbjct: 62  SGVAANYGSSLSSNSKDASRSW---ERSEVQPDHRYSTSESAISYFNSPDVSFQNHHIML 118

Query: 556 SSAIGVNLDDETREQASGPLSFSRLPE 636
                 ++D+  R   + P+    L E
Sbjct: 119 PMLQDSSIDEYMRVSVAEPIGALLLSE 145


>ref|XP_003520084.1| PREDICTED: uncharacterized protein LOC100789831 [Glycine max]
          Length = 508

 Score =  204 bits (520), Expect = 5e-50
 Identities = 112/244 (45%), Positives = 143/244 (58%), Gaps = 6/244 (2%)
 Frame = +1

Query: 904  SKSSPPSDFKSIQAFTELHSPGEFEFG----GSSRRGTGQWSSASSI-ELADVXXXXXXX 1068
            SK   P  F  +    +   P   +F      +  R   +WSSASS  E AD+       
Sbjct: 260  SKPIHPMSFNDLTTTRDAFDPAVTDFTEFDTSTPLRDGHRWSSASSSQEFADITESFELE 319

Query: 1069 XXXXXXXNIYEGAKCCLCERFLSQRSPWGSRRIVRNGDMPVVSVLSCWHVFHAECLERTT 1248
                    + +G +C LCERFL+QRSPW SRRIVR+GDMP + VL C H FHAECLE+TT
Sbjct: 320  TPGRSHF-LSDGFRCGLCERFLTQRSPWSSRRIVRSGDMPTIGVLPCCHAFHAECLEQTT 378

Query: 1249 SKTQKHDPPCPLCEK-SEENVWEQWAACRLKNGVPRLKPLGEEGPSKVWTCGQVGDCVER 1425
             KTQK DPPCP+C K  EEN  +Q    RL+ G PRLK   ++GPS+ W C QVGDCVE 
Sbjct: 379  PKTQKSDPPCPVCVKLEEENSPDQRGHLRLRTGFPRLKSSRDDGPSRPWGCVQVGDCVEG 438

Query: 1426 ALQTPKRANMLLLNRSRLKRQLSLKGSSGKDWAENSKKSGCYSSKVLDGRKLGDQLTVAH 1605
            AL  P R  MLLLNR+R+K+ LSLKG+ GK++    +K+G +SS +  G     +   + 
Sbjct: 439  ALHAPPRNTMLLLNRNRIKKNLSLKGNIGKEFPGKMRKNGTFSSHLFSGSSADGEAVGSS 498

Query: 1606 SRTA 1617
              TA
Sbjct: 499  KATA 502



 Score =  108 bits (269), Expect = 7e-21
 Identities = 59/128 (46%), Positives = 75/128 (58%), Gaps = 8/128 (6%)
 Frame = +1

Query: 226 AKTGSLCCVAARPHGSSTGSGDWSMARSEPFWQNNSSFSPPLSRRWDLRFRSEGR----- 390
           AKTGSLCCVA+RPH S+ GS DWSM  +EP+W+ NSS+SPP   RWD RF+SEG      
Sbjct: 71  AKTGSLCCVASRPHESNAGSRDWSMGPNEPYWRTNSSYSPP-PTRWDFRFQSEGLPYDVN 129

Query: 391 ---SLGSHGDSSNSKESRNWLRGERGEFGHSHRYSPSDGVGSYIGSPSDSFLYHHLTPSS 561
               L     SS  KESR W+RG      +   YS SD  G ++ SPSD       TP +
Sbjct: 130 DGVQLYGSSTSSIDKESRGWVRGNH---LYDLHYSASDDTGIFLSSPSDLSQGPQWTPPA 186

Query: 562 AIGVNLDD 585
              +++D+
Sbjct: 187 IQEISIDN 194


Top