BLASTX nr result

ID: Dioscorea21_contig00005847 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Dioscorea21_contig00005847
         (1457 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002526713.1| conserved hypothetical protein [Ricinus comm...   285   2e-74
ref|XP_003520674.1| PREDICTED: uncharacterized protein LOC100783...   261   3e-67
ref|XP_003543910.1| PREDICTED: uncharacterized protein LOC100777...   259   1e-66
gb|AEL30354.1| retrotransposon gag protein [Arachis hypogaea]         254   4e-65
gb|ACY01934.1| hypothetical protein [Beta vulgaris]                   254   4e-65

>ref|XP_002526713.1| conserved hypothetical protein [Ricinus communis]
            gi|223533946|gb|EEF35670.1| conserved hypothetical
            protein [Ricinus communis]
          Length = 496

 Score =  285 bits (729), Expect = 2e-74
 Identities = 143/292 (48%), Positives = 205/292 (70%)
 Frame = -3

Query: 1299 TRVEIDPSVKETGIAIVEDPKSAEKIVKESDHDGNNESHSRGQSKPPEYKPQIPYPSRLK 1120
            T V+++P+ +     +VE+ +  E                R +S   E++P +PYP+RL+
Sbjct: 149  TIVQVEPAKEGPDFKVVENERMKE---------------DRQKSPVREHQPPVPYPARLR 193

Query: 1119 QDKEDVQFRKFINIFKQLHINIPIVEALTQMPKYAKFLKELLTNKRKLEEEGTVALSGNC 940
            QDK D Q+ KF+++FKQL IN+P VEA++QM KYAKFLKE+L+N RKLE+ G V L+  C
Sbjct: 194  QDKVDKQYSKFLDLFKQLKINLPFVEAISQMAKYAKFLKEILSNNRKLEDLGQVVLNEEC 253

Query: 939  SAILEKKLPQKLKDPGSFIIPCVLEGGVEENALADSGASINVMPYSMYLKLGLDELRPTR 760
            SAIL+ KLP K +D  SF IPC++       ALAD GASIN+MP S++ KLGL E +PTR
Sbjct: 254  SAILQNKLPLKRRDLESFTIPCMIGDLSISGALADLGASINLMPTSLFAKLGLHEPKPTR 313

Query: 759  MTLQLADRSTRTPRGIVEDVIVRVDKFVFPVDFVIMDIDEDVETPLILGRPFLSTSGALL 580
            M++QLADR+ + PRGI+EDV+++VDKF+FPVDFV+MD++ +   PLILGRPFL+TS A++
Sbjct: 314  MSVQLADRTVKIPRGIIEDVLIKVDKFIFPVDFVVMDMEGESTVPLILGRPFLATSRAVI 373

Query: 579  DLKGGKLTLRVGEEEAVYTLPAAMKHSLDHDDKLYFIDETDRLISTCVQEVL 424
            D+  GKL LRV +E   + L  +M+ SLD+D+ +Y  D  D ++ + +QE+L
Sbjct: 374  DVSDGKLKLRVDDETITFDLATSMRQSLDYDNTVYSTDVIDDVVESHLQEIL 425


>ref|XP_003520674.1| PREDICTED: uncharacterized protein LOC100783109 [Glycine max]
          Length = 1771

 Score =  261 bits (667), Expect = 3e-67
 Identities = 138/333 (41%), Positives = 206/333 (61%), Gaps = 2/333 (0%)
 Frame = -3

Query: 1446 ASVQSLEHQIGQLAKENLERPSCSLPSCPEENPREHLKAIALRSGK--QVETRVEIDPSV 1273
            A++++LE Q+GQLA++  ERP+ +  +  E+NP+E  KA+  R  +  Q E +VE +   
Sbjct: 568  AAIRNLEVQMGQLAQDKAERPTRTFGANTEKNPKEECKAVLTRGQRKAQEEGKVEEEDQT 627

Query: 1272 KETGIAIVEDPKSAEKIVKESDHDGNNESHSRGQSKPPEYKPQIPYPSRLKQDKEDVQFR 1093
            +E    I ED    E+ V       + ++    + +PP     +PYP    +  ++  F+
Sbjct: 628  EEDKTEIQEDRTEVEEKVASPPKTKSQKAREARKEEPPALPQDLPYPVVPTKKNKERYFK 687

Query: 1092 KFINIFKQLHINIPIVEALTQMPKYAKFLKELLTNKRKLEEEGTVALSGNCSAILEKKLP 913
            +F+ IFK L I +P  EAL QMP Y+KF+K++LT K K  +   + + GNCSAI+++KLP
Sbjct: 688  RFLEIFKGLEITMPFGEALQQMPLYSKFMKDILTKKGKYIDNENIVVGGNCSAIIQRKLP 747

Query: 912  QKLKDPGSFIIPCVLEGGVEENALADSGASINVMPYSMYLKLGLDELRPTRMTLQLADRS 733
            +K KDPGS  IPC +       AL D GASIN+MP SM  ++G  ++ PT+MTLQLADRS
Sbjct: 748  KKFKDPGSVTIPCTIGKETVNKALIDLGASINLMPLSMCKRIGNLKIDPTKMTLQLADRS 807

Query: 732  TRTPRGIVEDVIVRVDKFVFPVDFVIMDIDEDVETPLILGRPFLSTSGALLDLKGGKLTL 553
               P G+VEDV+V+V  F FPVDFVIMDI+ED + PLILGRPF+ T+  ++D+  G L L
Sbjct: 808  ITRPYGVVEDVLVKVRHFTFPVDFVIMDIEEDADIPLILGRPFMLTANCVVDMGNGNLEL 867

Query: 552  RVGEEEAVYTLPAAMKHSLDHDDKLYFIDETDR 454
             +  ++  + L  AMK+      K + ++E D+
Sbjct: 868  SIDNQKITFDLFKAMKYP-QEGWKCFRVEEIDK 899


>ref|XP_003543910.1| PREDICTED: uncharacterized protein LOC100777299 [Glycine max]
          Length = 1658

 Score =  259 bits (662), Expect = 1e-66
 Identities = 137/333 (41%), Positives = 205/333 (61%), Gaps = 2/333 (0%)
 Frame = -3

Query: 1446 ASVQSLEHQIGQLAKENLERPSCSLPSCPEENPREHLKAIALRSGK--QVETRVEIDPSV 1273
            A++++LE Q+GQLA++  ERP+ +  +  E+NP+E  KA+  R  +  Q E +VE +   
Sbjct: 455  AAIRNLEVQMGQLAQDKAERPTRTFGANTEKNPKEECKAVLTRGQRKAQEEGKVEEEDQT 514

Query: 1272 KETGIAIVEDPKSAEKIVKESDHDGNNESHSRGQSKPPEYKPQIPYPSRLKQDKEDVQFR 1093
            +E      ED    E+ V       + ++    + +PP     +PYP    +  ++  F+
Sbjct: 515  EEDKTETQEDRTEVEEKVASPPKTKSQKAREARKEEPPALPQDLPYPVVPTKKNKERYFK 574

Query: 1092 KFINIFKQLHINIPIVEALTQMPKYAKFLKELLTNKRKLEEEGTVALSGNCSAILEKKLP 913
            +F+ IFK L I +P  EAL QMP Y+KF+K++LT K K  +   + + GNCSAI+++KLP
Sbjct: 575  RFLEIFKGLEITMPFGEALQQMPLYSKFMKDILTKKGKYIDNENIVVGGNCSAIIQRKLP 634

Query: 912  QKLKDPGSFIIPCVLEGGVEENALADSGASINVMPYSMYLKLGLDELRPTRMTLQLADRS 733
            +K KDPGS  IPC +       AL D GASIN+MP SM  ++G  ++ PT+MTLQLADRS
Sbjct: 635  KKFKDPGSVTIPCTIGKETVNKALIDLGASINLMPLSMCKRIGNLKIDPTKMTLQLADRS 694

Query: 732  TRTPRGIVEDVIVRVDKFVFPVDFVIMDIDEDVETPLILGRPFLSTSGALLDLKGGKLTL 553
               P G+VEDV+V+V  F FPVDFVIMDI+ED + PLILGRPF+ T+  ++D+  G L L
Sbjct: 695  ITRPYGVVEDVLVKVRHFTFPVDFVIMDIEEDADIPLILGRPFMLTANCVVDMGNGNLEL 754

Query: 552  RVGEEEAVYTLPAAMKHSLDHDDKLYFIDETDR 454
             +  ++  + L  AMK+      K + ++E D+
Sbjct: 755  SIDNQKITFDLFKAMKYP-QEGWKCFRVEEIDK 786


>gb|AEL30354.1| retrotransposon gag protein [Arachis hypogaea]
          Length = 920

 Score =  254 bits (649), Expect = 4e-65
 Identities = 137/310 (44%), Positives = 199/310 (64%), Gaps = 2/310 (0%)
 Frame = -3

Query: 1449 QASVQSLEHQIGQLAKENLERPSCSLPSCPEENPREHLKAIALRSGKQV--ETRVEIDPS 1276
            +AS+++LE  +GQL+K+ LER   +       NP E  KAI LRSGK    ET+V  D  
Sbjct: 521  KASIRNLEVLVGQLSKQILERSVSTFQEDTVVNPGEDCKAIQLRSGKVADSETKVNEDVV 580

Query: 1275 VKETGIAIVEDPKSAEKIVKESDHDGNNESHSRGQSKPPEYKPQIPYPSRLKQDKEDVQF 1096
             KE      E+ + A     ++    + + +     K PEYKP++PYP RL+++ +  QF
Sbjct: 581  EKEAPDEKKEEVEHAPPKRADNPFPDSLDIYPT-LPKAPEYKPKMPYPQRLQKETKKKQF 639

Query: 1095 RKFINIFKQLHINIPIVEALTQMPKYAKFLKELLTNKRKLEEEGTVALSGNCSAILEKKL 916
             KF+ IF++L INIP  E L QMP Y KF+KELL+ K++L+ + TV L+  CSA+++  L
Sbjct: 640  SKFLEIFRKLQINIPFAEVLEQMPIYVKFMKELLSKKKRLKGDETVVLTKECSAVIQNNL 699

Query: 915  PQKLKDPGSFIIPCVLEGGVEENALADSGASINVMPYSMYLKLGLDELRPTRMTLQLADR 736
            P+K+ DPGSF IPC +     E +L D GASIN+MP S+  KL + E +PT++ LQ+AD+
Sbjct: 700  PRKMPDPGSFQIPCTIGSTTFEKSLCDLGASINLMPLSVMKKLHIQEAQPTKIALQMADK 759

Query: 735  STRTPRGIVEDVIVRVDKFVFPVDFVIMDIDEDVETPLILGRPFLSTSGALLDLKGGKLT 556
            S +   G+VE+++V+V KF  P DFVI+D  ED    +ILGRPFL+T  AL+D++ G+L 
Sbjct: 760  SMKPAYGLVENILVKVGKFFLPADFVILDTGEDENASIILGRPFLATGRALIDVEVGELV 819

Query: 555  LRVGEEEAVY 526
            LRV  E+ V+
Sbjct: 820  LRVHNEQLVF 829


>gb|ACY01934.1| hypothetical protein [Beta vulgaris]
          Length = 1717

 Score =  254 bits (649), Expect = 4e-65
 Identities = 146/342 (42%), Positives = 211/342 (61%), Gaps = 5/342 (1%)
 Frame = -3

Query: 1455 NLQASVQSLEHQIGQLAKENLERPSCSLPSCPEENPREHLKAIALRSGKQVETRVEIDPS 1276
            NL    + LE Q+ QLA  +  RP  +LPS     PR+   AI LRSG    T  +  P 
Sbjct: 401  NLGTHNKMLETQLAQLASSSASRPPGALPS-QSLQPRDTANAITLRSG----THYDGPPM 455

Query: 1275 VKETGIAIVEDPKSAEKIVKESDHDGNNESHSRGQSKP---PEYKP--QIPYPSRLKQDK 1111
             K+     VE  K+A+     S  +    +++  Q+ P   P   P  ++P+P+RL ++K
Sbjct: 456  PKD---GPVESEKNADITETSSAPEATTNTNAEKQTIPENSPSNTPAIKVPFPTRLSRNK 512

Query: 1110 EDVQFRKFINIFKQLHINIPIVEALTQMPKYAKFLKELLTNKRKLEEEGTVALSGNCSAI 931
             D Q  KF+ + K L + +P  E +TQ+P YAKFLKE+LT KR      TVA +  CSA+
Sbjct: 513  LDHQLGKFMEVVKNLQVTVPFTELITQVPAYAKFLKEILTRKRAFNAVETVAFTEECSAL 572

Query: 930  LEKKLPQKLKDPGSFIIPCVLEGGVEENALADSGASINVMPYSMYLKLGLDELRPTRMTL 751
            L+ + P KLKDPGSF IPC +     + AL D GAS++VMP ++  KL + +L+ T +TL
Sbjct: 573  LQNQSPPKLKDPGSFSIPCNIGTIFIDKALCDLGASVSVMPLTVCKKLDMGDLKCTNITL 632

Query: 750  QLADRSTRTPRGIVEDVIVRVDKFVFPVDFVIMDIDEDVETPLILGRPFLSTSGALLDLK 571
            Q+ADRS + P GI+EDV VRV KF  PVDFV++D++ED + P+ILGRPFL T+GA++D+K
Sbjct: 633  QMADRSVKYPLGILEDVPVRVGKFYIPVDFVVLDMEEDTQIPIILGRPFLHTAGAVIDVK 692

Query: 570  GGKLTLRVGEEEAVYTLPAAMKHSLDHDDKLYFIDETDRLIS 445
             GKLTL VG+++  ++L  A+K  +  ++  Y ID  D L++
Sbjct: 693  NGKLTLTVGDDKVTFSLTNALKSPM-LEEACYRIDVIDVLVN 733


Top