BLASTX nr result

ID: Dioscorea21_contig00001781 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Dioscorea21_contig00001781
         (1195 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002269239.1| PREDICTED: uncharacterized protein LOC100258...   338   1e-90
ref|XP_002531808.1| conserved hypothetical protein [Ricinus comm...   333   5e-89
ref|XP_002882577.1| alphavirus core protein family [Arabidopsis ...   332   9e-89
ref|NP_187476.1| uncharacterized protein [Arabidopsis thaliana] ...   330   4e-88
ref|XP_003534624.1| PREDICTED: uncharacterized protein LOC100798...   330   6e-88

>ref|XP_002269239.1| PREDICTED: uncharacterized protein LOC100258770 [Vitis vinifera]
          Length = 351

 Score =  338 bits (868), Expect = 1e-90
 Identities = 170/250 (68%), Positives = 192/250 (76%)
 Frame = +1

Query: 265  GLIGILLEGWRSRVSADPQFPFKVLMEELVGVSACVLGDMATRPNFGLNELDFVFSTVVV 444
            G++G+ L GWRSRVSADPQFPFKVLMEELVGV+ACV+GDMA+RPNFGLNELDFVFST+VV
Sbjct: 96   GVLGLFLNGWRSRVSADPQFPFKVLMEELVGVTACVIGDMASRPNFGLNELDFVFSTLVV 155

Query: 445  GSILNFVLMYXXXXXXXXXXXXXXXXXXXCPPSHMFESGPFSFPSRFGTFVYKGLTFAAV 624
            GSI+NFVLMY                   CPP HMFESG +   +RFGTFVYKG+ FA V
Sbjct: 156  GSIMNFVLMYLLAPTASSVTPNLPAIFAGCPPGHMFESGSYGVLNRFGTFVYKGVLFATV 215

Query: 625  GFIAGLAGTAISNGLIAFRKRMDPEFETPNKPPPTVLNALTWASHMGLSSNLRYQTLNGI 804
            GF AGL GTAISNGLI+ RK+MDP F TPNKPPPTVLNA+TWA HMGLSSN RYQTLNGI
Sbjct: 216  GFAAGLVGTAISNGLISMRKKMDPNFVTPNKPPPTVLNAITWAIHMGLSSNFRYQTLNGI 275

Query: 805  EFLMAKVLPPAGFKVSVVGLRCLNNVLGGASFVMLARLTGSQKVGEKREELKESLCAVVD 984
            EFL+AK LPP  FK SVV LRC NN+LGG SFV+LARLTGSQ V E +  L E+      
Sbjct: 276  EFLLAKGLPPLAFKSSVVVLRCFNNILGGMSFVLLARLTGSQSVEEGKVVLAEA------ 329

Query: 985  NADGELERRI 1014
             +D E E+ +
Sbjct: 330  GSDAEKEKLV 339


>ref|XP_002531808.1| conserved hypothetical protein [Ricinus communis]
            gi|223528542|gb|EEF30565.1| conserved hypothetical
            protein [Ricinus communis]
          Length = 680

 Score =  333 bits (854), Expect = 5e-89
 Identities = 165/233 (70%), Positives = 183/233 (78%)
 Frame = +1

Query: 265  GLIGILLEGWRSRVSADPQFPFKVLMEELVGVSACVLGDMATRPNFGLNELDFVFSTVVV 444
            G++G+ L GWRSRV+ADPQFPFKVLMEELVGVSACVLGDMA+RPNFGLNELDFVFST+VV
Sbjct: 429  GILGLFLNGWRSRVAADPQFPFKVLMEELVGVSACVLGDMASRPNFGLNELDFVFSTLVV 488

Query: 445  GSILNFVLMYXXXXXXXXXXXXXXXXXXXCPPSHMFESGPFSFPSRFGTFVYKGLTFAAV 624
            GSILNF LMY                   CP SHMFE G F+  +R GT VYKG  FAAV
Sbjct: 489  GSILNFTLMYLLAPTASAASATLPAIFANCPASHMFEPGAFTLMNRLGTAVYKGTIFAAV 548

Query: 625  GFIAGLAGTAISNGLIAFRKRMDPEFETPNKPPPTVLNALTWASHMGLSSNLRYQTLNGI 804
            GF AGL GTA+SNGLI  RK+MDP FETPNKPPPT+LNA+TWA HMG+SSNLRYQTLNG+
Sbjct: 549  GFAAGLVGTALSNGLITMRKKMDPTFETPNKPPPTILNAVTWALHMGISSNLRYQTLNGV 608

Query: 805  EFLMAKVLPPAGFKVSVVGLRCLNNVLGGASFVMLARLTGSQKVGEKREELKE 963
            EF++ K LPP  FK SVV LRCLNNVLGG SFV+LARLTGSQ V E +  L E
Sbjct: 609  EFVLEKGLPPLAFKSSVVVLRCLNNVLGGMSFVILARLTGSQSVAEAKPVLAE 661


>ref|XP_002882577.1| alphavirus core protein family [Arabidopsis lyrata subsp. lyrata]
           gi|297328417|gb|EFH58836.1| alphavirus core protein
           family [Arabidopsis lyrata subsp. lyrata]
          Length = 333

 Score =  332 bits (852), Expect = 9e-89
 Identities = 167/237 (70%), Positives = 188/237 (79%), Gaps = 2/237 (0%)
 Frame = +1

Query: 247 EPSNSSGLIGILLEGWRSRVSADPQFPFKVLMEELVGVSACVLGDMATRPNFGLNELDFV 426
           E S+  G IG+ ++GWRSRV+ADPQFPFKVLMEE+VG+SACVLGDMA+RPNFGLNELDFV
Sbjct: 89  EESSPWGPIGLFIQGWRSRVAADPQFPFKVLMEEIVGLSACVLGDMASRPNFGLNELDFV 148

Query: 427 FSTVVVGSILNFVLMYXXXXXXXXXXXXXXXXXXX--CPPSHMFESGPFSFPSRFGTFVY 600
           FST+VVGSILNFVLMY                     CP SHMFE G F+  +RFGT VY
Sbjct: 149 FSTLVVGSILNFVLMYLLAPTAATLGSSQTLPGIFRNCPSSHMFEQGSFTVMNRFGTLVY 208

Query: 601 KGLTFAAVGFIAGLAGTAISNGLIAFRKRMDPEFETPNKPPPTVLNALTWASHMGLSSNL 780
           KG+ FA+VG  AGL GTAISNGLI  RK+MDP+FETPNKPPPTVLN+LTWA+HMG+S+N+
Sbjct: 209 KGMVFASVGLAAGLVGTAISNGLIMLRKKMDPDFETPNKPPPTVLNSLTWATHMGVSANV 268

Query: 781 RYQTLNGIEFLMAKVLPPAGFKVSVVGLRCLNNVLGGASFVMLARLTGSQKVGEKRE 951
           RYQTLNGIEFL+AKVLPP  FK  VV LRC NNV GG SFVMLARLTGSQ V EK E
Sbjct: 269 RYQTLNGIEFLLAKVLPPLVFKTGVVVLRCANNVAGGMSFVMLARLTGSQSVEEKTE 325


>ref|NP_187476.1| uncharacterized protein [Arabidopsis thaliana]
           gi|12322723|gb|AAG51347.1|AC012562_8 unknown protein;
           33915-34928 [Arabidopsis thaliana]
           gi|19698961|gb|AAL91216.1| unknown protein [Arabidopsis
           thaliana] gi|22136296|gb|AAM91226.1| unknown protein
           [Arabidopsis thaliana] gi|332641136|gb|AEE74657.1|
           uncharacterized protein [Arabidopsis thaliana]
          Length = 337

 Score =  330 bits (846), Expect = 4e-88
 Identities = 165/237 (69%), Positives = 187/237 (78%), Gaps = 2/237 (0%)
 Frame = +1

Query: 247 EPSNSSGLIGILLEGWRSRVSADPQFPFKVLMEELVGVSACVLGDMATRPNFGLNELDFV 426
           E S+  G IG+ ++GWRSRV+ADPQFPFKVLMEE+VG+SACVLGDMA+RPNFGLNELDFV
Sbjct: 93  EESSPWGPIGLFIQGWRSRVAADPQFPFKVLMEEIVGLSACVLGDMASRPNFGLNELDFV 152

Query: 427 FSTVVVGSILNFVLMYXXXXXXXXXXXXXXXXXXX--CPPSHMFESGPFSFPSRFGTFVY 600
           FST+VVGSILNFVLMY                     CP SHMFE G F+  +RFGT VY
Sbjct: 153 FSTLVVGSILNFVLMYMLAPTAATLGSSQTLPGIFRNCPSSHMFEQGSFTVMNRFGTLVY 212

Query: 601 KGLTFAAVGFIAGLAGTAISNGLIAFRKRMDPEFETPNKPPPTVLNALTWASHMGLSSNL 780
           KG+ FA+VG  AGL GTAISNGLI  RK+MDP FETPNKPPPTVLN+LTWA+HMG+S+N 
Sbjct: 213 KGMVFASVGLAAGLVGTAISNGLIMLRKKMDPSFETPNKPPPTVLNSLTWATHMGVSANA 272

Query: 781 RYQTLNGIEFLMAKVLPPAGFKVSVVGLRCLNNVLGGASFVMLARLTGSQKVGEKRE 951
           RYQTLNGIEFL+AKVLPP  FK SV+ LRC NNV GG SFV+LAR+TGSQ V EK E
Sbjct: 273 RYQTLNGIEFLLAKVLPPLVFKTSVIVLRCANNVAGGMSFVLLARMTGSQSVEEKTE 329


>ref|XP_003534624.1| PREDICTED: uncharacterized protein LOC100798978 [Glycine max]
          Length = 349

 Score =  330 bits (845), Expect = 6e-88
 Identities = 165/265 (62%), Positives = 197/265 (74%), Gaps = 1/265 (0%)
 Frame = +1

Query: 238  DKPEPSNSSGLIGILLEGWRSRVSADPQFPFKVLMEELVGVSACVLGDMATRPNFGLNEL 417
            D     +S G++G+ L GWRSRV+ADPQFPFKVLMEELVGVSACVLGDMA+RPNFGLNEL
Sbjct: 82   DGKSKDSSLGILGLFLNGWRSRVAADPQFPFKVLMEELVGVSACVLGDMASRPNFGLNEL 141

Query: 418  DFVFSTVVVGSILNFVLMYXXXXXXXXXXXXXXXXXXXCPPSHMFESGPFSFPSRFGTFV 597
            DFVFST+VVG+ILNF LMY                   CP SHMFE G FS   R GT V
Sbjct: 142  DFVFSTLVVGAILNFTLMYLLAPTMTSSASNLPALFASCPKSHMFEPGAFSLLDRLGTLV 201

Query: 598  YKGLTFAAVGFIAGLAGTAISNGLIAFRKRMDPEFETPNKPPPTVLNALTWASHMGLSSN 777
            YKG  F+ VGF AGL GT +SNGLI  RK+MDP FETPNKPPPT+LNALTWA+HMG+SSN
Sbjct: 202  YKGTIFSVVGFGAGLVGTTLSNGLIKMRKKMDPTFETPNKPPPTILNALTWAAHMGISSN 261

Query: 778  LRYQTLNGIEFLMAKVLPPAGFKVSVVGLRCLNNVLGGASFVMLARLTGSQKVGEKREEL 957
            LRYQTLNG+EF++ +VL P  FK SV+ LRC+NNVLGG SFV+LARLTG+Q VG +++E 
Sbjct: 262  LRYQTLNGVEFMLERVLNPLAFKSSVLVLRCVNNVLGGMSFVVLARLTGAQSVGGEQKEN 321

Query: 958  KESLCAVVDN-ADGELERRIDGSES 1029
            + +L A  +   + E E  +  ++S
Sbjct: 322  EVALIAEKEKVVESEREEGLQNNQS 346


Top