BLASTX nr result

ID: Dioscorea21_contig00017159 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Dioscorea21_contig00017159
         (1902 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|AAO23078.1| polyprotein [Glycine max]                              271   3e-70
emb|CAN82562.1| hypothetical protein VITISV_014148 [Vitis vinifera]   268   3e-69
emb|CAN73467.1| hypothetical protein VITISV_043900 [Vitis vinifera]   268   4e-69
ref|XP_003553022.1| PREDICTED: uncharacterized protein LOC100788...   266   2e-68
emb|CAN70471.1| hypothetical protein VITISV_013478 [Vitis vinifera]   260   8e-67

>gb|AAO23078.1| polyprotein [Glycine max]
          Length = 1552

 Score =  271 bits (694), Expect = 3e-70
 Identities = 149/330 (45%), Positives = 195/330 (59%), Gaps = 10/330 (3%)
 Frame = -1

Query: 1902 RELFRLQGTKLAMSTAYHPQSDGQTEVVNRYLEDYLRSFASEQPRQWLRYLPWAEWHYNT 1723
            + LF+LQGT LAMS+AYHPQSDGQ+EV+N+ LE YLR F  E P+ W++ LPWAE+ YNT
Sbjct: 1220 QHLFKLQGTTLAMSSAYHPQSDGQSEVLNKCLEMYLRCFTYEHPKGWVKALPWAEFWYNT 1279

Query: 1722 AWHSSIRMTPYEAIFGRPPPSLLDYIQGTSRVAAVDDMLDDRSQILRTLKQNLADAQLRM 1543
            A+H S+ MTP+ A++GR PP+L          A V + L DR  +L  LK NL  AQ  M
Sbjct: 1280 AYHMSLGMTPFRALYGREPPTLTRQACSIDDPAEVREQLTDRDALLAKLKINLTRAQQVM 1339

Query: 1542 KNQADSKRKHVEFQPGDWVFLKLQPFQQRSLHGQHTHKLSKRFFGPFEVTDRIGSVAYRL 1363
            K QAD KR  V FQ GD V +KLQP++Q S   +   KLS R+FGPF+V  +IG VAY+L
Sbjct: 1340 KRQADKKRLDVSFQIGDEVLVKLQPYRQHSAVLRKNQKLSMRYFGPFKVLAKIGDVAYKL 1399

Query: 1362 ALPPTAQIHDVFHVSKLKRCIGDPTVQQLPFPELFKSEKPIFQPSAVLRSRTVLVQGKPL 1183
             LP  A+IH VFHVS+LK   G      LP P       P+ QP  +L SR ++     +
Sbjct: 1400 ELPSAARIHPVFHVSQLKPFNGTAQDPYLPLPLTVTEMGPVMQPVKILASRIIIRGHNQI 1459

Query: 1182 LQYLIQWEHGGAADATWEAADNLRKNFPAFNLEDKVV--DEGRAIVAVKDNAMVNEFESG 1009
             Q L+QWE+G   +ATWE  ++++ ++P FNLEDKVV   EG     +     VN     
Sbjct: 1460 EQILVQWENGLQDEATWEDIEDIKASYPTFNLEDKVVFKGEGNVTNGMSRGEKVNNTAES 1519

Query: 1008 DDPIGQHD--------GRANRANRRSTRDT 943
                G H+        GR  R  + S + T
Sbjct: 1520 SSERGLHNKLADFEELGRGKREKKPSWKIT 1549


>emb|CAN82562.1| hypothetical protein VITISV_014148 [Vitis vinifera]
          Length = 1384

 Score =  268 bits (686), Expect = 3e-69
 Identities = 132/283 (46%), Positives = 190/283 (67%)
 Frame = -1

Query: 1902 RELFRLQGTKLAMSTAYHPQSDGQTEVVNRYLEDYLRSFASEQPRQWLRYLPWAEWHYNT 1723
            +ELF+L GT+L MS++YHPQ+DGQ+EVVNR +E YLR +A   PR+W  +LPWAE+ YNT
Sbjct: 1084 QELFKLSGTQLKMSSSYHPQTDGQSEVVNRCVEQYLRCYAHHHPRKWSFFLPWAEFWYNT 1143

Query: 1722 AWHSSIRMTPYEAIFGRPPPSLLDYIQGTSRVAAVDDMLDDRSQILRTLKQNLADAQLRM 1543
             +H+S  MTP++A++GR PP++  Y+ GT+ + AVD  L  R+ ILR LK NL  A  RM
Sbjct: 1144 TYHASTGMTPFQALYGRLPPTIPHYLMGTTPIHAVDQNLASRNAILRQLKTNLHAATNRM 1203

Query: 1542 KNQADSKRKHVEFQPGDWVFLKLQPFQQRSLHGQHTHKLSKRFFGPFEVTDRIGSVAYRL 1363
            K  ADSKR+++E+Q GD VFLKLQP++Q+S+    + KL+ RF+GP+++  RIG VAY+L
Sbjct: 1204 KQVADSKRRNIEYQVGDMVFLKLQPYRQQSVFCXASQKLASRFYGPYQIEQRIGKVAYKL 1263

Query: 1362 ALPPTAQIHDVFHVSKLKRCIGDPTVQQLPFPELFKSEKPIFQPSAVLRSRTVLVQGKPL 1183
             LP  ++IH +FHVS LK+ +G+P    +  P      + + +P  +L +R V    +  
Sbjct: 1264 NLPEGSKIHPIFHVSLLKKKLGEPNNTTVELPLTDDEGEIVLEPEGILDTRWVKKGSRIF 1323

Query: 1182 LQYLIQWEHGGAADATWEAADNLRKNFPAFNLEDKVVDEGRAI 1054
             + L++W+     DATWE    LR  F   NLEDKV+ + R I
Sbjct: 1324 EESLVKWKRLPLDDATWEDTKMLRDRFINVNLEDKVLVQDRGI 1366


>emb|CAN73467.1| hypothetical protein VITISV_043900 [Vitis vinifera]
          Length = 1593

 Score =  268 bits (685), Expect = 4e-69
 Identities = 134/286 (46%), Positives = 187/286 (65%), Gaps = 3/286 (1%)
 Frame = -1

Query: 1902 RELFRLQGTKLAMSTAYHPQSDGQTEVVNRYLEDYLRSFASEQPRQWLRYLPWAEWHYNT 1723
            +E  +L GTKL M++AYHPQSDGQTEVVNR +E YLR F   +PR W   LPWAE+ YNT
Sbjct: 1059 QEFLKLSGTKLRMTSAYHPQSDGQTEVVNRCIEQYLRCFVHHKPRHWNSLLPWAEYWYNT 1118

Query: 1722 AWHSSIRMTPYEAIFGRPPPSLLDYIQGTSRVAAVDDMLDDRSQILRTLKQNLADAQLRM 1543
             +HSS  MTP++A++GRPPP++  Y  G+  +  +DD +  R+++L+ LK +L  A  RM
Sbjct: 1119 TYHSSTGMTPFQALYGRPPPAIPSYEIGSCPIEELDDQMTARNELLQELKAHLHAANNRM 1178

Query: 1542 KNQADSKRKHVEFQPGDWVFLKLQPFQQRSLHGQHTHKLSKRFFGPFEVTDRIGSVAYRL 1363
            K  AD KR+ V F+ GDWV+L+LQP++Q+S+  + +HKLS R++GP+E+ +RIG VAY+L
Sbjct: 1179 KQAADKKRREVNFEVGDWVYLRLQPYRQQSVFRRTSHKLSNRYYGPYEIEERIGPVAYKL 1238

Query: 1362 ALPPTAQIHDVFHVSKLKRCIGDPTVQQLPFPELFKSEKPIFQPSAVLRSRTVLVQGKPL 1183
             L P ++IH VFHVS LK+ IG+  +     P L +      QP  VL +R V       
Sbjct: 1239 KLSPGSRIHPVFHVSLLKKKIGEVAIANDELPPLTEEGVIRLQPRKVLSTRWVNKGSTSA 1298

Query: 1182 LQYLIQWEHGGAADATWEAADNLRKNFPAFNLEDK---VVDEGRAI 1054
             + L+ WE     +ATWE +  L ++FP  NLEDK    +D GR I
Sbjct: 1299 SESLVLWEGLPEEEATWEDSQQLLRSFPNLNLEDKDYWWLDLGRLI 1344


>ref|XP_003553022.1| PREDICTED: uncharacterized protein LOC100788433 [Glycine max]
          Length = 1433

 Score =  266 bits (679), Expect = 2e-68
 Identities = 137/277 (49%), Positives = 179/277 (64%), Gaps = 1/277 (0%)
 Frame = -1

Query: 1902 RELFRLQGTKLAMSTAYHPQSDGQTEVVNRYLEDYLRSFASEQPRQWLRYLPWAEWHYNT 1723
            RELFRL GT+L MSTAYHPQ+DGQTEV+NR LE YLRSF S  P  W ++L  AEW YNT
Sbjct: 1110 RELFRLSGTRLQMSTAYHPQTDGQTEVMNRVLEQYLRSFVSAHPSHWFKFLAMAEWSYNT 1169

Query: 1722 AWHSSIRMTPYEAIFGRPPPSLLDYIQGTSRVAAVDDMLDDRSQILRTLKQNLADAQLRM 1543
            + HSS   TPYE +FG+ PPS+  YI G+S   AVD +L  R  +  TL++ L  AQ  M
Sbjct: 1170 SVHSSTGFTPYEIVFGKAPPSIPHYITGSSTNEAVDSLLTSRQVLHDTLRRRLLKAQDTM 1229

Query: 1542 KNQADSKRKHVEFQPGDWVFLKLQPFQQRSLHGQHTHKLSKRFFGPFEVTDRIGSVAYRL 1363
            K+QAD+ R+ V F+ G WV+++L+P +QRS+ G    KLSKRFFGPF++ ++IG VAYRL
Sbjct: 1230 KHQADAHRRDVHFEVGQWVYVRLRPIRQRSITGTAHPKLSKRFFGPFQILEKIGPVAYRL 1289

Query: 1362 ALPPTAQIHDVFHVSKLKRCIGD-PTVQQLPFPELFKSEKPIFQPSAVLRSRTVLVQGKP 1186
             LPPTA+IH VFH S L+   G  P     P P    + +P+  P ++L S+       P
Sbjct: 1290 QLPPTAKIHPVFHCSLLRPHQGPLPMTTADPIPLTMVNNQPVLSPLSILSSKFDDSTQPP 1349

Query: 1185 LLQYLIQWEHGGAADATWEAADNLRKNFPAFNLEDKV 1075
                L+QW      D+TWE+ D L+     ++LEDKV
Sbjct: 1350 TRLVLVQWVGQAPEDSTWESWDELKAQ---YHLEDKV 1383


>emb|CAN70471.1| hypothetical protein VITISV_013478 [Vitis vinifera]
          Length = 1122

 Score =  260 bits (665), Expect = 8e-67
 Identities = 135/302 (44%), Positives = 193/302 (63%), Gaps = 7/302 (2%)
 Frame = -1

Query: 1902 RELFRLQGTKLAMSTAYHPQSDGQTEVVNRYLEDYLRSFASEQPRQWLRYLPWAEWHYNT 1723
            +E F+L GT+L MS++YHPQ+DGQ+EVVNR +E YL  +A   PR+W  +LPW E+ YNT
Sbjct: 763  QEFFKLSGTQLKMSSSYHPQTDGQSEVVNRCVEQYLCCYAHHHPRKWSFFLPWVEFWYNT 822

Query: 1722 AWHSSIRMTPYEAIFGRPPPSLLDYIQGTSRVAAVDDMLDDRSQILRTLKQNLADAQLRM 1543
             +H+S  MTP++A++GR PP++  Y+ GT+ V AVD  L  R  ILR LK NL  A  RM
Sbjct: 823  TYHTSTGMTPFQALYGRLPPNIPHYLMGTTPVHAVDQNLASRDAILRQLKTNLHVATNRM 882

Query: 1542 KNQADSKRKHVEFQPGDWVFLKLQPFQQRSLHGQHTHKLSKRFFGPFEVTDRIGSVAYRL 1363
            K  A+SKR+++E+Q GD VFLKLQP++Q+S+  + + KL+ RF+GP+++  RIG VAY+L
Sbjct: 883  KQVANSKRRNIEYQVGDMVFLKLQPYRQQSVFCRASQKLASRFYGPYQIEQRIGKVAYKL 942

Query: 1362 ALPPTAQIHDVFHVSKLKRCIGDPTVQQLPFPELFKSEKPIFQPSAVLRSRTVLVQGKPL 1183
             LP  ++IH +FHVS LK+ +G+P    +  P      + +  P  +L +R V    +  
Sbjct: 943  NLPEGSKIHPIFHVSLLKKKLGEPNNTTVELPLTNDEGEIVLXPEGILDTRWVKKGSRIF 1002

Query: 1182 LQYLIQWEHGGAADATWEAADNLRKNFPAFNLEDKV------VDEGRAIVAV-KDNAMVN 1024
             + L++W+     DATWE    L+  F   NLEDKV      +DE R    V K N  +N
Sbjct: 1003 EESLVKWKRLPLNDATWEDTKMLQDRFINVNLEDKVPVQDRGIDEPRRSQRVPKKNPRIN 1062

Query: 1023 EF 1018
            EF
Sbjct: 1063 EF 1064