BLASTX nr result

ID: Dioscorea21_contig00005055 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Dioscorea21_contig00005055
         (1455 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002268106.1| PREDICTED: uncharacterized protein LOC100248...   395   e-107
emb|CAN79954.1| hypothetical protein VITISV_027426 [Vitis vinifera]   385   e-104
ref|XP_003548423.1| PREDICTED: uncharacterized protein LOC100800...   375   e-101
ref|XP_002523365.1| hypothetical protein RCOM_0719270 [Ricinus c...   370   e-100
ref|XP_003528795.1| PREDICTED: uncharacterized protein LOC100809...   366   9e-99

>ref|XP_002268106.1| PREDICTED: uncharacterized protein LOC100248390 [Vitis vinifera]
          Length = 1353

 Score =  395 bits (1014), Expect = e-107
 Identities = 218/491 (44%), Positives = 310/491 (63%), Gaps = 36/491 (7%)
 Frame = +1

Query: 1    SKPVKDRRGRK--PSSDATSVPAKTKSGWQHEDSL-DLFSAQADDEA------------- 132
            S+  +DRRGR+  PS++ ++     K+G Q+E  L +  S+  D+++             
Sbjct: 844  SRSARDRRGRRTAPSAEPSTTYRSGKNGRQYEGELAEHVSSLPDNDSRNWIQLSMAGTEG 903

Query: 133  ----------SPRARTHQSPGYESAEIAGADSVITISPLVSGSQQKLVN-NNTGVVPIAF 279
                      S   RT+  PGYE A+++G+ S++ I+P++ GS  +    +N G+VP+AF
Sbjct: 904  AESTVSGTVDSSHVRTNLIPGYEPAQMSGSSSMLPITPMLVGSDSRQRGADNHGMVPVAF 963

Query: 280  YPTGPPVPFLTMVP--VYNFPSEAGNSNGPTTQVDNDELLEHGHANSCDQNFDSGESLDQ 453
            YP GPP+PF+ M+P  VYNFP+E GNS+  T+ +D DE   + +A+  DQN DS E+LDQ
Sbjct: 964  YPMGPPIPFVAMLPFPVYNFPNEMGNSSSSTSHLDGDEEFSNSNASQSDQNLDSPENLDQ 1023

Query: 454  FEA--HLGSAALRTAPILAEEHKADILNSDFASHWQSLQYGRFCQSTRQQGPLIYSSNAP 627
             E   +L S     +   +EEH++DIL+SDF  H Q+L+ G+ C +TR   P +Y S  P
Sbjct: 1024 SEIFNNLNSMKGPASMEPSEEHESDILDSDFPRHLQNLREGQLCLNTRNHEPWLYPSVMP 1083

Query: 628  PVYLQGHFPWDGPGRPFSSNGNLVTQIGNYGPRLVPVTALQPGPRRNSGV-QRFAEEIPR 804
            P+Y QG  PWD PGRP S+N NL  Q+  YGPRL+PV+ LQPG  R +GV Q + +E+PR
Sbjct: 1084 PMYFQG--PWDSPGRPLSTNMNLFAQLMGYGPRLIPVSPLQPGSNRPTGVYQHYGDEVPR 1141

Query: 805  YRRGTGTYLPNPK-SFRDRQSSNTRNHRGNQNFDQSDR-SDRERSW-INSKSRNTGRSYG 975
            YR GTGTYLPNPK SFRDRQSSNTRNHRG+  +D+ D   DR+ +W INSK R +GR+ G
Sbjct: 1142 YRGGTGTYLPNPKISFRDRQSSNTRNHRGHYGYDRKDHHGDRDGNWNINSKPRFSGRAQG 1201

Query: 976  RPQAEKPNLQSDRLSATDNRSDRTWNSYRHEPI-TXXXXXXXXXXXXXXXXXXXXXXXXX 1152
            R Q +KPN + DR ++++++SDR+W++++HEP  +                         
Sbjct: 1202 RNQVDKPNSRIDRSTSSNSQSDRSWDTFKHEPFPSYHSQNGPLSSSNSTNRGSANMAYGM 1261

Query: 1153 XPLPALNANGASPTGPALPPFVMLYPFNQGVGYGSPNEQLEFGSIGPVSLSSMNEVQVQP 1332
             P+P +N NG SP+G  +PP VMLYP++Q +GY SP +QLEFGS+GPV  S +NEV  Q 
Sbjct: 1262 YPMPVMNPNGVSPSGTGVPPVVMLYPYDQNMGYASPTDQLEFGSLGPVHFSGINEVS-QL 1320

Query: 1333 NEGVSARGAYD 1365
            +E VS+RG  D
Sbjct: 1321 SE-VSSRGVND 1330


>emb|CAN79954.1| hypothetical protein VITISV_027426 [Vitis vinifera]
          Length = 1388

 Score =  385 bits (990), Expect = e-104
 Identities = 208/471 (44%), Positives = 297/471 (63%), Gaps = 36/471 (7%)
 Frame = +1

Query: 1    SKPVKDRRGRK--PSSDATSVPAKTKSGWQHEDSL-DLFSAQADDEA------------- 132
            S+  +DRRGR+  PS++ ++     K+G Q+E  L +  S+  D+++             
Sbjct: 813  SRSARDRRGRRTAPSAEPSTTYRSGKNGRQYEGELAEHVSSLPDNDSRNWIQLSMAGTEG 872

Query: 133  ----------SPRARTHQSPGYESAEIAGADSVITISPLVSGSQQKLVN-NNTGVVPIAF 279
                      S   RT+  PGYE A+++G+ S++ I+P++ GS  +    +N G+VP+AF
Sbjct: 873  AESTVSGTVDSSHVRTNLIPGYEPAQMSGSSSMLPITPMLVGSDSRQRGADNHGMVPVAF 932

Query: 280  YPTGPPVPFLTMVP--VYNFPSEAGNSNGPTTQVDNDELLEHGHANSCDQNFDSGESLDQ 453
            YP GPP+PF+ M+P  VYNFP+E GNS+  T+ +D DE   + +A+  DQN DS E+LDQ
Sbjct: 933  YPMGPPIPFVAMLPFPVYNFPNEMGNSSSSTSHLDGDEEFSNSNASQSDQNLDSPENLDQ 992

Query: 454  FEA--HLGSAALRTAPILAEEHKADILNSDFASHWQSLQYGRFCQSTRQQGPLIYSSNAP 627
             E   +L S     +   +EEH++DIL+SDF  H Q+L+ G+ C +TR   P +Y S  P
Sbjct: 993  SEIFNNLNSMKGPASMEPSEEHESDILDSDFPRHLQNLREGQLCLNTRNHEPWLYPSVMP 1052

Query: 628  PVYLQGHFPWDGPGRPFSSNGNLVTQIGNYGPRLVPVTALQPGPRRNSGV-QRFAEEIPR 804
            P+Y QG  PWD PGRP S+N NL  Q+  YGPRL+PV+ LQPG  R +GV Q + +E+PR
Sbjct: 1053 PMYFQG--PWDSPGRPLSTNMNLFAQLMGYGPRLIPVSPLQPGSNRPTGVYQHYGDEVPR 1110

Query: 805  YRRGTGTYLPNPK-SFRDRQSSNTRNHRGNQNFDQSDR-SDRERSW-INSKSRNTGRSYG 975
            YR GTGTYLPNPK SFRDRQSSNTRNHRG+  +D+ D   DR+ +W INSK R +GR+ G
Sbjct: 1111 YRGGTGTYLPNPKISFRDRQSSNTRNHRGHYGYDRKDHHGDRDGNWNINSKPRFSGRAQG 1170

Query: 976  RPQAEKPNLQSDRLSATDNRSDRTWNSYRHEPI-TXXXXXXXXXXXXXXXXXXXXXXXXX 1152
            R Q +KPN + DR ++++++SDR+W++++HEP  +                         
Sbjct: 1171 RNQVDKPNSRIDRSTSSNSQSDRSWDTFKHEPFPSYHSQNGPLSSSNSTNRGSANMAYGM 1230

Query: 1153 XPLPALNANGASPTGPALPPFVMLYPFNQGVGYGSPNEQLEFGSIGPVSLS 1305
             P+P +N NG SP+G  +PP VMLYP++Q +GY SP +QLEFGS+GPV  S
Sbjct: 1231 YPMPVMNPNGVSPSGTGVPPVVMLYPYDQNMGYASPTDQLEFGSLGPVHFS 1281


>ref|XP_003548423.1| PREDICTED: uncharacterized protein LOC100800527 [Glycine max]
          Length = 1337

 Score =  375 bits (962), Expect = e-101
 Identities = 217/490 (44%), Positives = 290/490 (59%), Gaps = 33/490 (6%)
 Frame = +1

Query: 1    SKPVKDRRGRKPSSDATSVPAKTKSGWQHEDSLDLFSAQADDE----------------- 129
            SK  ++RRGRK +S   S P   K     E S    S + DDE                 
Sbjct: 843  SKSTRERRGRKNTSSIAS-PVYAKGKNVSETS----SNRVDDENREWTPLSTMASNISER 897

Query: 130  -------ASPRARTHQSPGYESAEIAGADSVITISPLV--SGSQQKLVNNNTGVVPIAFY 282
                    S     +Q  G+E+A+ +G+DS + ISP++   GS+Q+   +N+GVVP  FY
Sbjct: 898  SIWPTSSTSMHVPRNQISGFETAQTSGSDSPLPISPVLLGPGSRQR---DNSGVVPFTFY 954

Query: 283  PTGPPVPFLTMVPVYNFPSEAGNSNGPTTQVDNDELLEHGHANSCDQNFDSGESLDQFEA 462
            PTGPPVPF+TM+P+YNFP+E+ +     T   N  L E    +   QNFDS E  +    
Sbjct: 955  PTGPPVPFVTMLPLYNFPTESSD-----TSTSNFNLEEGADNSDSSQNFDSSEGYEHPGV 1009

Query: 463  HLGSAALRTAPILAEEHKADILNSDFASHWQSLQYGRFCQSTRQQGPLIYSSNA--PPVY 636
               S ++    I + EHK+DILNSDF SHWQ+LQYGRFCQ++R    + Y S    PPVY
Sbjct: 1010 SSPSNSMTRVAIESSEHKSDILNSDFVSHWQNLQYGRFCQNSRLPPSMTYPSPGMVPPVY 1069

Query: 637  LQGHFPWDGPGRPFSSNGNLVTQIGNYGPRLVPVTALQPGPRRNSGV-QRFAEEIPRYRR 813
            LQG +PWDGPGRP S N N+ +Q+ NYGPRLVPV  LQ    R + + QR+ +++PRYR 
Sbjct: 1070 LQGRYPWDGPGRPISGNMNIFSQLMNYGPRLVPVAPLQSVSNRPANIYQRYVDDMPRYRS 1129

Query: 814  GTGTYLPNPK-SFRDRQSSNTRNHRGNQNFDQSDR-SDRERSW-INSKSRNTGRSYGRPQ 984
            GTGTYLPNPK S RDR S+NTR  RGN N+D+SD   DRE +W  NSK R TGR + R Q
Sbjct: 1130 GTGTYLPNPKVSARDRHSTNTR--RGNYNYDRSDHHGDREGNWNTNSKLRGTGRGHNRNQ 1187

Query: 985  AEKPNLQSDRLSATDNRSDRTWNSYRHEPITXXXXXXXXXXXXXXXXXXXXXXXXXXPLP 1164
             EKPN +++RLS++++R++R+W S+RH+                             P+P
Sbjct: 1188 NEKPNSKTERLSSSESRAERSWGSHRHD--NFIPHQNGPVGSNSLQSNPSNVAYGMYPIP 1245

Query: 1165 ALNANGASPTGPALPPFVMLYPFNQGVGYGSPNEQLEFGSIGPVSLSSMNEVQVQPNEGV 1344
            A+N +G S  GP +P  VM YP++   GYGSP EQLEFG++GP+  S +NE+  Q NEG 
Sbjct: 1246 AMNPSGFSSNGPTMPSVVMFYPYDHNTGYGSPAEQLEFGTLGPMGFSGVNELS-QANEGT 1304

Query: 1345 SARGAY-DQR 1371
             + GA+ DQR
Sbjct: 1305 QSSGAHEDQR 1314


>ref|XP_002523365.1| hypothetical protein RCOM_0719270 [Ricinus communis]
            gi|223537453|gb|EEF39081.1| hypothetical protein
            RCOM_0719270 [Ricinus communis]
          Length = 1334

 Score =  370 bits (949), Expect = e-100
 Identities = 219/499 (43%), Positives = 299/499 (59%), Gaps = 36/499 (7%)
 Frame = +1

Query: 1    SKPVKDRRGRKPSSDA--TSVPAKTKSGWQHEDSLDLFSAQADDEA---------SPR-- 141
            SK  +++R RK +S    ++V  K K+  +H       S Q DDE          SP   
Sbjct: 840  SKSTREKRNRKTASSTVPSAVYGKGKNVSEHS------SNQGDDETKEWNPPSTISPEII 893

Query: 142  -------------ARTHQSPGYESAEIAGADSVITISPLVSG-SQQKLVNNNTGVVPIAF 279
                            HQ PG+E+A+ +G++S+++++P++ G   ++   +++G+VP AF
Sbjct: 894  ERSIGLQSASAVHVPRHQIPGFETAQTSGSESLLSMAPVLLGPGSRQRTTDSSGLVPFAF 953

Query: 280  YPTGPPVPFLTMVPVYNFPSEAGNSNGPTTQVDNDELLEHGHANS-CDQNFDSGESLDQF 456
            YPTGPPVPF+TM+PVYNFPSEAG S   T+Q      +E G  NS   QNFDS + +DQ 
Sbjct: 954  YPTGPPVPFVTMLPVYNFPSEAGTSEASTSQFS----VEEGADNSDSGQNFDSSDGIDQS 1009

Query: 457  EAHLGSAALRTAPILAEEHKADILNSDFASHWQSLQYGRFCQSTRQQGPLIYSS--NAPP 630
            E    ++ +RTA I   EHK DILNSDFASHWQ+LQYGRFCQ++R   P++  S    PP
Sbjct: 1010 EVLSTNSMIRTASIEPLEHKTDILNSDFASHWQNLQYGRFCQNSRFNSPMVCPSPLMVPP 1069

Query: 631  VYLQGHFPWDGPGRPFSSNGNLVTQIGNYGPRLVPVTALQPGPRRNSGV-QRFAEEIPRY 807
            VYLQG  PWDGPGRP  +N N+ +Q+ NYGPRL+PV  LQ    R +GV Q + +EIPRY
Sbjct: 1070 VYLQGRIPWDGPGRPLLTNMNIFSQLVNYGPRLIPVAPLQSVSNRPAGVYQHYVDEIPRY 1129

Query: 808  RRGTGTYLPNPK-SFRDRQSSNTRNHRGNQNFDQSD-RSDRERSW-INSKSRNTGRSYGR 978
            R GTGTYLP+PK S RDR +SNTR  +GN ++D++D   DRE +W +N K R  GR   R
Sbjct: 1130 RSGTGTYLPSPKVSIRDRHTSNTR--KGNYSYDRNDHHGDREGNWHVNPKPRAAGRP-SR 1186

Query: 979  PQAEKPNLQSDRLSATDNRSDRTWNSY-RHEPITXXXXXXXXXXXXXXXXXXXXXXXXXX 1155
             QAEK + + DRL+A ++R+DRTW S+ RH+  +                          
Sbjct: 1187 GQAEKLSSRLDRLAANESRTDRTWGSHNRHDTFSSYQSQNGPNRQNSQSGSTMAYG---- 1242

Query: 1156 PLPALNANGASPTGPALPPFVMLYPFNQGVGYGSPNEQLEFGSIGPVSLSSMNEVQVQPN 1335
             +  +N  G S  GP  PP +MLYP++Q  G+G+P EQLEFGS+GPV  S +NE+    N
Sbjct: 1243 -MYPVNPGGVSSNGPNFPPVLMLYPYDQSAGFGNPAEQLEFGSLGPVGFSGVNELS-HSN 1300

Query: 1336 EGVSARGAY-DQRQNAHFG 1389
            EG  + G + DQR +   G
Sbjct: 1301 EGSRSSGGFEDQRFHGSSG 1319


>ref|XP_003528795.1| PREDICTED: uncharacterized protein LOC100809742 [Glycine max]
          Length = 1331

 Score =  366 bits (939), Expect = 9e-99
 Identities = 213/491 (43%), Positives = 289/491 (58%), Gaps = 34/491 (6%)
 Frame = +1

Query: 1    SKPVKDRRGRK-PSSDATSVPAKTKSGWQHEDSLDLFSAQADDE---------------- 129
            SK  ++RRGRK  +S A+ V AK K+        ++ S + DDE                
Sbjct: 837  SKSTRERRGRKNTNSMASPVYAKGKN------VSEISSNRLDDENREWTPLSTMASNIPE 890

Query: 130  --------ASPRARTHQSPGYESAEIAGADSVITISPLV--SGSQQKLVNNNTGVVPIAF 279
                     S     +Q  G+E+A+ +G+DS + I+P++   GS+Q+    N+GVVP  F
Sbjct: 891  RSNWPTSGTSMHVPRNQISGFETAQTSGSDSPLPIAPVLLGPGSRQR---ENSGVVPFTF 947

Query: 280  YPTGPPVPFLTMVPVYNFPSEAGNSNGPTTQVDNDELLEHGHANSCDQNFDSGESLDQFE 459
            YPTGPPVPF+TM+P+YNFP+E+ +     T   N  L E    +   QNFDS E  +  E
Sbjct: 948  YPTGPPVPFVTMLPLYNFPTESSD-----TSTSNFNLEEGADNSDSSQNFDSSEGYEHPE 1002

Query: 460  AHLGSAALRTAPILAEEHKADILNSDFASHWQSLQYGRFCQSTRQQGPLIYSS--NAPPV 633
                S ++    I + EH+ DILNSDF SHWQ+LQYGRFCQ++R    + Y S    PPV
Sbjct: 1003 VSSPSNSMTRVAIESSEHRPDILNSDFVSHWQNLQYGRFCQNSRHPPSMTYPSPVMVPPV 1062

Query: 634  YLQGHFPWDGPGRPFSSNGNLVTQIGNYGPRLVPVTALQPGPRRNSGV-QRFAEEIPRYR 810
            YLQG +PWDGPGRP S N N+ +Q+ +YGPRLVPV  LQ    R + + QR+ +++PRYR
Sbjct: 1063 YLQGRYPWDGPGRPISGNMNIFSQLMSYGPRLVPVAPLQSVSNRPASIYQRYVDDMPRYR 1122

Query: 811  RGTGTYLPNPK-SFRDRQSSNTRNHRGNQNFDQSD-RSDRERSW-INSKSRNTGRSYGRP 981
             GTGTYLPNPK S RDR S+NTR  RGN  +D+SD   DRE +W  NSK R TGR + R 
Sbjct: 1123 SGTGTYLPNPKVSARDRHSTNTR--RGNYPYDRSDHHGDREGNWNTNSKLRGTGRGHNRN 1180

Query: 982  QAEKPNLQSDRLSATDNRSDRTWNSYRHEPITXXXXXXXXXXXXXXXXXXXXXXXXXXPL 1161
            Q EKPN + +RL+ +++R++R W S+RH+  T                          P+
Sbjct: 1181 QTEKPNSKMERLATSESRAERPWGSHRHD--TFIPHQNGPVRSNSSQSNPSNVAYGMYPM 1238

Query: 1162 PALNANGASPTGPALPPFVMLYPFNQGVGYGSPNEQLEFGSIGPVSLSSMNEVQVQPNEG 1341
            PA+N +G S  GP +P  VM YP++   GYGSP EQLEFG++G +  S +NE+  Q NEG
Sbjct: 1239 PAMNPSGVSSNGPTMPSVVMFYPYDHNTGYGSPAEQLEFGTLGSMGFSGVNELS-QANEG 1297

Query: 1342 VSARGAY-DQR 1371
              + GA+ DQR
Sbjct: 1298 SQSSGAHEDQR 1308


Top