BLASTX nr result
ID: Dioscorea21_contig00005055
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Dioscorea21_contig00005055 (1455 letters) Database: ./nr 23,641,837 sequences; 8,123,359,852 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_002268106.1| PREDICTED: uncharacterized protein LOC100248... 395 e-107 emb|CAN79954.1| hypothetical protein VITISV_027426 [Vitis vinifera] 385 e-104 ref|XP_003548423.1| PREDICTED: uncharacterized protein LOC100800... 375 e-101 ref|XP_002523365.1| hypothetical protein RCOM_0719270 [Ricinus c... 370 e-100 ref|XP_003528795.1| PREDICTED: uncharacterized protein LOC100809... 366 9e-99 >ref|XP_002268106.1| PREDICTED: uncharacterized protein LOC100248390 [Vitis vinifera] Length = 1353 Score = 395 bits (1014), Expect = e-107 Identities = 218/491 (44%), Positives = 310/491 (63%), Gaps = 36/491 (7%) Frame = +1 Query: 1 SKPVKDRRGRK--PSSDATSVPAKTKSGWQHEDSL-DLFSAQADDEA------------- 132 S+ +DRRGR+ PS++ ++ K+G Q+E L + S+ D+++ Sbjct: 844 SRSARDRRGRRTAPSAEPSTTYRSGKNGRQYEGELAEHVSSLPDNDSRNWIQLSMAGTEG 903 Query: 133 ----------SPRARTHQSPGYESAEIAGADSVITISPLVSGSQQKLVN-NNTGVVPIAF 279 S RT+ PGYE A+++G+ S++ I+P++ GS + +N G+VP+AF Sbjct: 904 AESTVSGTVDSSHVRTNLIPGYEPAQMSGSSSMLPITPMLVGSDSRQRGADNHGMVPVAF 963 Query: 280 YPTGPPVPFLTMVP--VYNFPSEAGNSNGPTTQVDNDELLEHGHANSCDQNFDSGESLDQ 453 YP GPP+PF+ M+P VYNFP+E GNS+ T+ +D DE + +A+ DQN DS E+LDQ Sbjct: 964 YPMGPPIPFVAMLPFPVYNFPNEMGNSSSSTSHLDGDEEFSNSNASQSDQNLDSPENLDQ 1023 Query: 454 FEA--HLGSAALRTAPILAEEHKADILNSDFASHWQSLQYGRFCQSTRQQGPLIYSSNAP 627 E +L S + +EEH++DIL+SDF H Q+L+ G+ C +TR P +Y S P Sbjct: 1024 SEIFNNLNSMKGPASMEPSEEHESDILDSDFPRHLQNLREGQLCLNTRNHEPWLYPSVMP 1083 Query: 628 PVYLQGHFPWDGPGRPFSSNGNLVTQIGNYGPRLVPVTALQPGPRRNSGV-QRFAEEIPR 804 P+Y QG PWD PGRP S+N NL Q+ YGPRL+PV+ LQPG R +GV Q + +E+PR Sbjct: 1084 PMYFQG--PWDSPGRPLSTNMNLFAQLMGYGPRLIPVSPLQPGSNRPTGVYQHYGDEVPR 1141 Query: 805 YRRGTGTYLPNPK-SFRDRQSSNTRNHRGNQNFDQSDR-SDRERSW-INSKSRNTGRSYG 975 YR GTGTYLPNPK SFRDRQSSNTRNHRG+ +D+ D DR+ +W INSK R +GR+ G Sbjct: 1142 YRGGTGTYLPNPKISFRDRQSSNTRNHRGHYGYDRKDHHGDRDGNWNINSKPRFSGRAQG 1201 Query: 976 RPQAEKPNLQSDRLSATDNRSDRTWNSYRHEPI-TXXXXXXXXXXXXXXXXXXXXXXXXX 1152 R Q +KPN + DR ++++++SDR+W++++HEP + Sbjct: 1202 RNQVDKPNSRIDRSTSSNSQSDRSWDTFKHEPFPSYHSQNGPLSSSNSTNRGSANMAYGM 1261 Query: 1153 XPLPALNANGASPTGPALPPFVMLYPFNQGVGYGSPNEQLEFGSIGPVSLSSMNEVQVQP 1332 P+P +N NG SP+G +PP VMLYP++Q +GY SP +QLEFGS+GPV S +NEV Q Sbjct: 1262 YPMPVMNPNGVSPSGTGVPPVVMLYPYDQNMGYASPTDQLEFGSLGPVHFSGINEVS-QL 1320 Query: 1333 NEGVSARGAYD 1365 +E VS+RG D Sbjct: 1321 SE-VSSRGVND 1330 >emb|CAN79954.1| hypothetical protein VITISV_027426 [Vitis vinifera] Length = 1388 Score = 385 bits (990), Expect = e-104 Identities = 208/471 (44%), Positives = 297/471 (63%), Gaps = 36/471 (7%) Frame = +1 Query: 1 SKPVKDRRGRK--PSSDATSVPAKTKSGWQHEDSL-DLFSAQADDEA------------- 132 S+ +DRRGR+ PS++ ++ K+G Q+E L + S+ D+++ Sbjct: 813 SRSARDRRGRRTAPSAEPSTTYRSGKNGRQYEGELAEHVSSLPDNDSRNWIQLSMAGTEG 872 Query: 133 ----------SPRARTHQSPGYESAEIAGADSVITISPLVSGSQQKLVN-NNTGVVPIAF 279 S RT+ PGYE A+++G+ S++ I+P++ GS + +N G+VP+AF Sbjct: 873 AESTVSGTVDSSHVRTNLIPGYEPAQMSGSSSMLPITPMLVGSDSRQRGADNHGMVPVAF 932 Query: 280 YPTGPPVPFLTMVP--VYNFPSEAGNSNGPTTQVDNDELLEHGHANSCDQNFDSGESLDQ 453 YP GPP+PF+ M+P VYNFP+E GNS+ T+ +D DE + +A+ DQN DS E+LDQ Sbjct: 933 YPMGPPIPFVAMLPFPVYNFPNEMGNSSSSTSHLDGDEEFSNSNASQSDQNLDSPENLDQ 992 Query: 454 FEA--HLGSAALRTAPILAEEHKADILNSDFASHWQSLQYGRFCQSTRQQGPLIYSSNAP 627 E +L S + +EEH++DIL+SDF H Q+L+ G+ C +TR P +Y S P Sbjct: 993 SEIFNNLNSMKGPASMEPSEEHESDILDSDFPRHLQNLREGQLCLNTRNHEPWLYPSVMP 1052 Query: 628 PVYLQGHFPWDGPGRPFSSNGNLVTQIGNYGPRLVPVTALQPGPRRNSGV-QRFAEEIPR 804 P+Y QG PWD PGRP S+N NL Q+ YGPRL+PV+ LQPG R +GV Q + +E+PR Sbjct: 1053 PMYFQG--PWDSPGRPLSTNMNLFAQLMGYGPRLIPVSPLQPGSNRPTGVYQHYGDEVPR 1110 Query: 805 YRRGTGTYLPNPK-SFRDRQSSNTRNHRGNQNFDQSDR-SDRERSW-INSKSRNTGRSYG 975 YR GTGTYLPNPK SFRDRQSSNTRNHRG+ +D+ D DR+ +W INSK R +GR+ G Sbjct: 1111 YRGGTGTYLPNPKISFRDRQSSNTRNHRGHYGYDRKDHHGDRDGNWNINSKPRFSGRAQG 1170 Query: 976 RPQAEKPNLQSDRLSATDNRSDRTWNSYRHEPI-TXXXXXXXXXXXXXXXXXXXXXXXXX 1152 R Q +KPN + DR ++++++SDR+W++++HEP + Sbjct: 1171 RNQVDKPNSRIDRSTSSNSQSDRSWDTFKHEPFPSYHSQNGPLSSSNSTNRGSANMAYGM 1230 Query: 1153 XPLPALNANGASPTGPALPPFVMLYPFNQGVGYGSPNEQLEFGSIGPVSLS 1305 P+P +N NG SP+G +PP VMLYP++Q +GY SP +QLEFGS+GPV S Sbjct: 1231 YPMPVMNPNGVSPSGTGVPPVVMLYPYDQNMGYASPTDQLEFGSLGPVHFS 1281 >ref|XP_003548423.1| PREDICTED: uncharacterized protein LOC100800527 [Glycine max] Length = 1337 Score = 375 bits (962), Expect = e-101 Identities = 217/490 (44%), Positives = 290/490 (59%), Gaps = 33/490 (6%) Frame = +1 Query: 1 SKPVKDRRGRKPSSDATSVPAKTKSGWQHEDSLDLFSAQADDE----------------- 129 SK ++RRGRK +S S P K E S S + DDE Sbjct: 843 SKSTRERRGRKNTSSIAS-PVYAKGKNVSETS----SNRVDDENREWTPLSTMASNISER 897 Query: 130 -------ASPRARTHQSPGYESAEIAGADSVITISPLV--SGSQQKLVNNNTGVVPIAFY 282 S +Q G+E+A+ +G+DS + ISP++ GS+Q+ +N+GVVP FY Sbjct: 898 SIWPTSSTSMHVPRNQISGFETAQTSGSDSPLPISPVLLGPGSRQR---DNSGVVPFTFY 954 Query: 283 PTGPPVPFLTMVPVYNFPSEAGNSNGPTTQVDNDELLEHGHANSCDQNFDSGESLDQFEA 462 PTGPPVPF+TM+P+YNFP+E+ + T N L E + QNFDS E + Sbjct: 955 PTGPPVPFVTMLPLYNFPTESSD-----TSTSNFNLEEGADNSDSSQNFDSSEGYEHPGV 1009 Query: 463 HLGSAALRTAPILAEEHKADILNSDFASHWQSLQYGRFCQSTRQQGPLIYSSNA--PPVY 636 S ++ I + EHK+DILNSDF SHWQ+LQYGRFCQ++R + Y S PPVY Sbjct: 1010 SSPSNSMTRVAIESSEHKSDILNSDFVSHWQNLQYGRFCQNSRLPPSMTYPSPGMVPPVY 1069 Query: 637 LQGHFPWDGPGRPFSSNGNLVTQIGNYGPRLVPVTALQPGPRRNSGV-QRFAEEIPRYRR 813 LQG +PWDGPGRP S N N+ +Q+ NYGPRLVPV LQ R + + QR+ +++PRYR Sbjct: 1070 LQGRYPWDGPGRPISGNMNIFSQLMNYGPRLVPVAPLQSVSNRPANIYQRYVDDMPRYRS 1129 Query: 814 GTGTYLPNPK-SFRDRQSSNTRNHRGNQNFDQSDR-SDRERSW-INSKSRNTGRSYGRPQ 984 GTGTYLPNPK S RDR S+NTR RGN N+D+SD DRE +W NSK R TGR + R Q Sbjct: 1130 GTGTYLPNPKVSARDRHSTNTR--RGNYNYDRSDHHGDREGNWNTNSKLRGTGRGHNRNQ 1187 Query: 985 AEKPNLQSDRLSATDNRSDRTWNSYRHEPITXXXXXXXXXXXXXXXXXXXXXXXXXXPLP 1164 EKPN +++RLS++++R++R+W S+RH+ P+P Sbjct: 1188 NEKPNSKTERLSSSESRAERSWGSHRHD--NFIPHQNGPVGSNSLQSNPSNVAYGMYPIP 1245 Query: 1165 ALNANGASPTGPALPPFVMLYPFNQGVGYGSPNEQLEFGSIGPVSLSSMNEVQVQPNEGV 1344 A+N +G S GP +P VM YP++ GYGSP EQLEFG++GP+ S +NE+ Q NEG Sbjct: 1246 AMNPSGFSSNGPTMPSVVMFYPYDHNTGYGSPAEQLEFGTLGPMGFSGVNELS-QANEGT 1304 Query: 1345 SARGAY-DQR 1371 + GA+ DQR Sbjct: 1305 QSSGAHEDQR 1314 >ref|XP_002523365.1| hypothetical protein RCOM_0719270 [Ricinus communis] gi|223537453|gb|EEF39081.1| hypothetical protein RCOM_0719270 [Ricinus communis] Length = 1334 Score = 370 bits (949), Expect = e-100 Identities = 219/499 (43%), Positives = 299/499 (59%), Gaps = 36/499 (7%) Frame = +1 Query: 1 SKPVKDRRGRKPSSDA--TSVPAKTKSGWQHEDSLDLFSAQADDEA---------SPR-- 141 SK +++R RK +S ++V K K+ +H S Q DDE SP Sbjct: 840 SKSTREKRNRKTASSTVPSAVYGKGKNVSEHS------SNQGDDETKEWNPPSTISPEII 893 Query: 142 -------------ARTHQSPGYESAEIAGADSVITISPLVSG-SQQKLVNNNTGVVPIAF 279 HQ PG+E+A+ +G++S+++++P++ G ++ +++G+VP AF Sbjct: 894 ERSIGLQSASAVHVPRHQIPGFETAQTSGSESLLSMAPVLLGPGSRQRTTDSSGLVPFAF 953 Query: 280 YPTGPPVPFLTMVPVYNFPSEAGNSNGPTTQVDNDELLEHGHANS-CDQNFDSGESLDQF 456 YPTGPPVPF+TM+PVYNFPSEAG S T+Q +E G NS QNFDS + +DQ Sbjct: 954 YPTGPPVPFVTMLPVYNFPSEAGTSEASTSQFS----VEEGADNSDSGQNFDSSDGIDQS 1009 Query: 457 EAHLGSAALRTAPILAEEHKADILNSDFASHWQSLQYGRFCQSTRQQGPLIYSS--NAPP 630 E ++ +RTA I EHK DILNSDFASHWQ+LQYGRFCQ++R P++ S PP Sbjct: 1010 EVLSTNSMIRTASIEPLEHKTDILNSDFASHWQNLQYGRFCQNSRFNSPMVCPSPLMVPP 1069 Query: 631 VYLQGHFPWDGPGRPFSSNGNLVTQIGNYGPRLVPVTALQPGPRRNSGV-QRFAEEIPRY 807 VYLQG PWDGPGRP +N N+ +Q+ NYGPRL+PV LQ R +GV Q + +EIPRY Sbjct: 1070 VYLQGRIPWDGPGRPLLTNMNIFSQLVNYGPRLIPVAPLQSVSNRPAGVYQHYVDEIPRY 1129 Query: 808 RRGTGTYLPNPK-SFRDRQSSNTRNHRGNQNFDQSD-RSDRERSW-INSKSRNTGRSYGR 978 R GTGTYLP+PK S RDR +SNTR +GN ++D++D DRE +W +N K R GR R Sbjct: 1130 RSGTGTYLPSPKVSIRDRHTSNTR--KGNYSYDRNDHHGDREGNWHVNPKPRAAGRP-SR 1186 Query: 979 PQAEKPNLQSDRLSATDNRSDRTWNSY-RHEPITXXXXXXXXXXXXXXXXXXXXXXXXXX 1155 QAEK + + DRL+A ++R+DRTW S+ RH+ + Sbjct: 1187 GQAEKLSSRLDRLAANESRTDRTWGSHNRHDTFSSYQSQNGPNRQNSQSGSTMAYG---- 1242 Query: 1156 PLPALNANGASPTGPALPPFVMLYPFNQGVGYGSPNEQLEFGSIGPVSLSSMNEVQVQPN 1335 + +N G S GP PP +MLYP++Q G+G+P EQLEFGS+GPV S +NE+ N Sbjct: 1243 -MYPVNPGGVSSNGPNFPPVLMLYPYDQSAGFGNPAEQLEFGSLGPVGFSGVNELS-HSN 1300 Query: 1336 EGVSARGAY-DQRQNAHFG 1389 EG + G + DQR + G Sbjct: 1301 EGSRSSGGFEDQRFHGSSG 1319 >ref|XP_003528795.1| PREDICTED: uncharacterized protein LOC100809742 [Glycine max] Length = 1331 Score = 366 bits (939), Expect = 9e-99 Identities = 213/491 (43%), Positives = 289/491 (58%), Gaps = 34/491 (6%) Frame = +1 Query: 1 SKPVKDRRGRK-PSSDATSVPAKTKSGWQHEDSLDLFSAQADDE---------------- 129 SK ++RRGRK +S A+ V AK K+ ++ S + DDE Sbjct: 837 SKSTRERRGRKNTNSMASPVYAKGKN------VSEISSNRLDDENREWTPLSTMASNIPE 890 Query: 130 --------ASPRARTHQSPGYESAEIAGADSVITISPLV--SGSQQKLVNNNTGVVPIAF 279 S +Q G+E+A+ +G+DS + I+P++ GS+Q+ N+GVVP F Sbjct: 891 RSNWPTSGTSMHVPRNQISGFETAQTSGSDSPLPIAPVLLGPGSRQR---ENSGVVPFTF 947 Query: 280 YPTGPPVPFLTMVPVYNFPSEAGNSNGPTTQVDNDELLEHGHANSCDQNFDSGESLDQFE 459 YPTGPPVPF+TM+P+YNFP+E+ + T N L E + QNFDS E + E Sbjct: 948 YPTGPPVPFVTMLPLYNFPTESSD-----TSTSNFNLEEGADNSDSSQNFDSSEGYEHPE 1002 Query: 460 AHLGSAALRTAPILAEEHKADILNSDFASHWQSLQYGRFCQSTRQQGPLIYSS--NAPPV 633 S ++ I + EH+ DILNSDF SHWQ+LQYGRFCQ++R + Y S PPV Sbjct: 1003 VSSPSNSMTRVAIESSEHRPDILNSDFVSHWQNLQYGRFCQNSRHPPSMTYPSPVMVPPV 1062 Query: 634 YLQGHFPWDGPGRPFSSNGNLVTQIGNYGPRLVPVTALQPGPRRNSGV-QRFAEEIPRYR 810 YLQG +PWDGPGRP S N N+ +Q+ +YGPRLVPV LQ R + + QR+ +++PRYR Sbjct: 1063 YLQGRYPWDGPGRPISGNMNIFSQLMSYGPRLVPVAPLQSVSNRPASIYQRYVDDMPRYR 1122 Query: 811 RGTGTYLPNPK-SFRDRQSSNTRNHRGNQNFDQSD-RSDRERSW-INSKSRNTGRSYGRP 981 GTGTYLPNPK S RDR S+NTR RGN +D+SD DRE +W NSK R TGR + R Sbjct: 1123 SGTGTYLPNPKVSARDRHSTNTR--RGNYPYDRSDHHGDREGNWNTNSKLRGTGRGHNRN 1180 Query: 982 QAEKPNLQSDRLSATDNRSDRTWNSYRHEPITXXXXXXXXXXXXXXXXXXXXXXXXXXPL 1161 Q EKPN + +RL+ +++R++R W S+RH+ T P+ Sbjct: 1181 QTEKPNSKMERLATSESRAERPWGSHRHD--TFIPHQNGPVRSNSSQSNPSNVAYGMYPM 1238 Query: 1162 PALNANGASPTGPALPPFVMLYPFNQGVGYGSPNEQLEFGSIGPVSLSSMNEVQVQPNEG 1341 PA+N +G S GP +P VM YP++ GYGSP EQLEFG++G + S +NE+ Q NEG Sbjct: 1239 PAMNPSGVSSNGPTMPSVVMFYPYDHNTGYGSPAEQLEFGTLGSMGFSGVNELS-QANEG 1297 Query: 1342 VSARGAY-DQR 1371 + GA+ DQR Sbjct: 1298 SQSSGAHEDQR 1308