BLASTX nr result
ID: Dioscorea21_contig00006577
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Dioscorea21_contig00006577 (1922 letters) Database: ./nr 23,641,837 sequences; 8,123,359,852 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_002298629.1| predicted protein [Populus trichocarpa] gi|2... 429 e-117 emb|CBI21348.3| unnamed protein product [Vitis vinifera] 377 e-102 ref|XP_002509512.1| conserved hypothetical protein [Ricinus comm... 306 1e-80 ref|XP_002967167.1| hypothetical protein SELMODRAFT_86596 [Selag... 264 6e-68 ref|XP_001763235.1| predicted protein [Physcomitrella patens sub... 239 1e-60 >ref|XP_002298629.1| predicted protein [Populus trichocarpa] gi|222845887|gb|EEE83434.1| predicted protein [Populus trichocarpa] Length = 1156 Score = 429 bits (1104), Expect = e-117 Identities = 255/638 (39%), Positives = 354/638 (55%), Gaps = 57/638 (8%) Frame = +1 Query: 4 EYSDMTSGLWWLIPRERLPWNKDTLSIWANHSQFEGSADHLEKNARDHISKEERRLFSYV 183 EYS+ T GL WLIPR++LPW SIW NH +++KE R+ S Sbjct: 543 EYSETTKGLRWLIPRQKLPWKDGGTSIWPNHV---------------YLAKENLRILSLE 587 Query: 184 HSSEYRFSSTPPLVRQSHDDAVERIRFAKYGNVHQSDLVQKDIRSKLGVLSGRQNSDIAS 363 + + + + S + +++ F Q +I G G S ++ Sbjct: 588 YHNWFNTN------LNSSSNMEDQLPF------------QTEINLNFGWRHGYSTSMKST 629 Query: 364 VASLPPLSSEEYFIYFLRGEPLSAANVVKKLENYEGWQDLEMNLFWLGVAVIGLIVMHSL 543 L PL+S EYF YFLRGEP SA N++K+ EN +GWQDLE NLFWLGV L+++H L Sbjct: 630 PYGL-PLNSREYFTYFLRGEPSSATNLIKETENCKGWQDLEKNLFWLGVGGGSLLIIHVL 688 Query: 544 VLLFLRWRTGKLVHGSLSIPRFELFLVIFMLPCIGQSSAFIIRGNTNXXXXXXXXXXXXX 723 LLFLRWRTG G S+PRFEL L+I MLPCI QSSAF+++G + Sbjct: 689 TLLFLRWRTGAPAQGIFSVPRFELLLLILMLPCISQSSAFVMKGGSPRGIIIGALLLVVP 748 Query: 724 XXFILSTFLFLVITVFTSNFFEYREVRTLDTSHSYCKK-FSNFLSRDTIGRWFPTEGRSS 900 IL T LFL+I +F+ +F Y+E+R + + KK +S F+ + IG+WF EG + Sbjct: 749 GALILFTILFLIIAIFSGSFALYKEIRDIAVGDPWYKKLWSVFVGKQVIGKWFYKEGLPT 808 Query: 901 SLLLRFGILFEDRKGPPKFLSV-----------------GTGRMRVLSPDSSSEEA-TSW 1026 SLL RFGILFE+ +GPP F+ V G GRMR +S D S+EE W Sbjct: 809 SLLPRFGILFENLRGPPLFVIVDHCDPNTLPTWIESGQSGIGRMRAVSSDDSNEETKMPW 868 Query: 1027 -ERLLGCVRSAYIIIDLLRRVGIGILAGAYSSPSESQCAIAFTLTVVQFLYLCTIKPYIR 1203 RL+GC RS+Y+I+DL+RR+G+GIL+GAY SP SQ +A +T++QF+YL T+KPYIR Sbjct: 869 SRRLVGCARSSYVILDLVRRIGLGILSGAYRSPESSQSLLALAITLIQFIYLLTLKPYIR 928 Query: 1204 AGVHLVENVSLVCESVLFGLLFCSNGREVLRYKDQRIIGLVMLVLLFISFVSQLMNEWYA 1383 VHLVE++SL+CE+ +FG S E + ++ I+G ML LLF++F+ ++NEWYA Sbjct: 929 RRVHLVESISLLCEAGIFGF---SIATERSNHMEESILGYTMLALLFLTFIVHIVNEWYA 985 Query: 1384 LMKCILRHPQTQMPSYKAGPISVGKRLVLPLLPRRYWSRLMPGFSEPTSGF--------- 1536 L+KC+LR Q + S+K G K LVLP LPR++WS+++P FS+P +G Sbjct: 986 LVKCLLRLSQPRRNSFKFGLKLAAKGLVLPFLPRKHWSKVIPIFSQPKTGLSAVPPLSPE 1045 Query: 1537 ---------------------XXXXXXXXXXXLKNTSLKAKQPSFHSI-------GNDEL 1632 ++ TS + S HS G+ L Sbjct: 1046 SVDRRTHHGDPLSTISATVVPVLSPGSPSLDVIQETSYTTAETSLHSAQSVGEGKGSQGL 1105 Query: 1633 KSDPKNEIKKLRELARASFSGKLWKTEEGSCSYAPREQ 1746 + K+E+KKLR+LARASFSG + S SYA R+Q Sbjct: 1106 NLEKKSELKKLRQLARASFSGN--SKGQESTSYAFRDQ 1141 >emb|CBI21348.3| unnamed protein product [Vitis vinifera] Length = 491 Score = 377 bits (967), Expect = e-102 Identities = 216/461 (46%), Positives = 279/461 (60%), Gaps = 38/461 (8%) Frame = +1 Query: 454 LENYEGWQDLEMNLFWLGVAVIGLIVMHSLVLLFLRWRTGKLVHGSLSIPRFELFLVIFM 633 +ENY+GW+DLEMNLFWLGV LI++H L+L+FLRWRTG HG LS+PRFELFL+I M Sbjct: 1 MENYKGWEDLEMNLFWLGVGGGSLIIIHILILIFLRWRTGTSAHGILSVPRFELFLLILM 60 Query: 634 LPCIGQSSAFIIRGNTNXXXXXXXXXXXXXXXFILSTFLFLVITVFTSNFFEYREVRTLD 813 LPCI QSSAF+IRG T I S LFL++ +F+ +F +Y+EVR Sbjct: 61 LPCISQSSAFVIRGGTTGGIIVGALLLAIPAALIFSVCLFLIVAIFSGSFAQYKEVRHTG 120 Query: 814 T-SHSYCKK-FSNFLSRDTIGRWFPTEGRSSSLLLRFGILFEDRKGPPKFLSV------- 966 T +C K + + R T G+WF EG S+ L RFGILFE RKGPP + V Sbjct: 121 TKEEGWCSKLWVSIAGRSTTGKWFYREGLPSTFLQRFGILFESRKGPPLLVLVDQNDLSS 180 Query: 967 ----------GTGRMRVLSPDSSSEEA--TSWERLLGCVRSAYIIIDLLRRVGIGILAGA 1110 G GRMR LS D S+EE +RLLGC RS+YII DLLRRV +GI++GA Sbjct: 181 LPKWTESGQSGIGRMRALSSDDSNEETKIPMSKRLLGCARSSYIIFDLLRRVTLGIISGA 240 Query: 1111 YSSPSESQCAIAFTLTVVQFLYLCTIKPYIRAGVHLVENVSLVCESVLFGLLFCSNGREV 1290 YSS SQ IA ++T+ QFLYL T+KPYIR GVH+ E+VSL+CE+ +FGL F G Sbjct: 241 YSSHGSSQSLIALSITLAQFLYLFTLKPYIRRGVHIAESVSLLCEAGIFGLSFSMVGSNP 300 Query: 1291 LRYKDQRIIGLVMLVLLFISFVSQLMNEWYALMKCILRHPQTQMPSYKAGPISVGKRLVL 1470 +R +G VML LLF++F SQL+NEWYALMKC+LR Q Q S+K G + LVL Sbjct: 301 ---NQERTVGFVMLALLFLTFSSQLVNEWYALMKCLLRLSQPQKNSFKLGLKCAAQGLVL 357 Query: 1471 PLLPRRYWSRLMPGFSEPTSGFXXXXXXXXXXXLK--------NTSLKAKQPSFHSIGND 1626 P LPR++W ++P S+P +G + N + + +I N Sbjct: 358 PFLPRKHWWTIIPLSSQPKTGPAEPLSCMTATVVPVLSPGSPFNANQTIASTAADTILNG 417 Query: 1627 E---------LKSDPKNEIKKLRELARASFSGKLWKTEEGS 1722 + +K + K+E++KLRELARASFSG K EEG+ Sbjct: 418 QRAEGKQPKGVKLESKSEMRKLRELARASFSGNP-KGEEGT 457 >ref|XP_002509512.1| conserved hypothetical protein [Ricinus communis] gi|223549411|gb|EEF50899.1| conserved hypothetical protein [Ricinus communis] Length = 1095 Score = 306 bits (784), Expect = 1e-80 Identities = 195/494 (39%), Positives = 269/494 (54%), Gaps = 59/494 (11%) Frame = +1 Query: 460 NYEGWQDLEMNLFWLGVAVIGLIVMHSLVLLFLRWRTG-KLVHGSLSIPRFELFLVIFML 636 N W+DLEMNLF+LGV L ++H L+LLFLRWR G HG LS PRFEL L+I L Sbjct: 608 NSREWEDLEMNLFFLGVGGGSLFMIHILILLFLRWRIGASSAHGILSFPRFELLLLILAL 667 Query: 637 PCIGQSSAFIIRGNTNXXXXXXXXXXXXXXXFILSTFLFLVIT---VFTSNFFEYREVRT 807 PC+ Q+SAF++RG T I++ L LVI +F++ R V Sbjct: 668 PCVSQASAFVMRGGTVGG--------------IITGALLLVIPAALIFSAVLXXXRHVDI 713 Query: 808 LDTSHSYCKKFSNFLSRDTIGRWFPTEGRSSSLLLRFGILFEDRKGPPKFLSV------- 966 T Y K + F+ R G+WF EG SS L RFGILFEDRKGPP ++ V Sbjct: 714 --TESWYTKLWLFFIGRPVFGKWFFGEGLPSSFLPRFGILFEDRKGPPLYVFVDQNDPST 771 Query: 967 ----------GTGRMRVLSPDSSSEEATS--WERLLGCVRSAYIIIDLLRRVGIGILAGA 1110 G GRMR LS D S+EE + R+LGCVRS+YII+DLLRRV +GI++GA Sbjct: 772 RLKWTGSGQTGIGRMRALSSDESNEEIKTPLARRILGCVRSSYIILDLLRRVSLGIISGA 831 Query: 1111 YSSPSESQCAIAFTLTVVQFLYLCTIKPYIRAGVHLVENVSLVCESVLFGLLFCSNGREV 1290 SS + + A +T++QF++L +KPYIR GV +VE++SL+CE +FGL SN Sbjct: 832 RSSQTSRKSHFALVITLLQFIFLFLLKPYIRRGVQVVESISLLCEVGIFGLSIASNHLNP 891 Query: 1291 LRYKDQRIIGLVMLVLLFISFVSQLMNEWYALMKCILRHPQTQMPSYKAGPISVGKRLVL 1470 L + R G +ML LLF++F++Q++NEWYAL+KCIL + + S++ G K LVL Sbjct: 892 L---EARNPGYIMLALLFLTFIAQIINEWYALIKCILGLSRPKRNSFRLGLKFAAKGLVL 948 Query: 1471 PLLPRRYWSRLMPGFSEPTSGFXXXXXXXXXXXLKNTSLKAKQP---------------- 1602 P LPR++WS ++P S+ +G ++T+++ +P Sbjct: 949 PFLPRKHWSGVIPNSSQMKTGLSTILPPETEFVTRDTTIENVEPYRAMTATVVPVLSPGS 1008 Query: 1603 -------------------SFHSIGNDEL-KSDPKNEIKKLRELARASFSGKLWKTEEGS 1722 + G + K + KNE+KKLRELA+ASF+G E S Sbjct: 1009 PSDLDVTLRTSSTPAEATLTEQRAGKGKTSKCERKNELKKLRELAKASFAGVSKSDEGRS 1068 Query: 1723 CSYAPREQRRPDET 1764 SY +EQ +T Sbjct: 1069 ASYKFKEQNFSPKT 1082 >ref|XP_002967167.1| hypothetical protein SELMODRAFT_86596 [Selaginella moellendorffii] gi|300165158|gb|EFJ31766.1| hypothetical protein SELMODRAFT_86596 [Selaginella moellendorffii] Length = 760 Score = 264 bits (675), Expect = 6e-68 Identities = 153/408 (37%), Positives = 223/408 (54%), Gaps = 22/408 (5%) Frame = +1 Query: 376 PPLSSEEYFIYFLRGEPLSAANVVKKLENYEGWQDLEMNLFWLGVAVIGLIVMHSLVLLF 555 P LSS+EY ++F ++ ++ W D N+FWLGV G ++ H L+L F Sbjct: 226 PALSSDEYRLFF------QVMYIINFFDSRYRWNDFGKNMFWLGVFGGGFVLFHLLLLYF 279 Query: 556 LRWRTGKLVHGSLSIPRFELFLVIFMLPCIGQSSAFIIRGNTNXXXXXXXXXXXXXXXFI 735 LRWRT + G+L +PRFE+FL+ LPCI Q++AFIIRG T F Sbjct: 280 LRWRTEATLRGALCVPRFEIFLLYVALPCICQAAAFIIRGGTTGGIIVGVLLLAVPTGFF 339 Query: 736 LSTFLFLVITVFTSNFFEYREVRTLDTSHSYCKKFSNFLSRDTIGRWFPTEGRSSSLLLR 915 LS +FL++ V +Y+E R+ H L IG+W EG SSS + + Sbjct: 340 LSVLMFLLVAVIWGALVQYKEYRSQAGGHVCRGVVRLLLGESHIGKWVRKEGLSSSFIPK 399 Query: 916 FGILFEDRKGPPKFLSV--------------GTGRMRVLSPDSSSEE--ATSWERLLGCV 1047 FG+LFE+RKGPP+ + V G GRM+ ++ D S + + +L+G Sbjct: 400 FGLLFENRKGPPRVVYVDEDYGSKWVDSEGKGIGRMKPVNSDEDSVDMSVSKAHKLIGAA 459 Query: 1048 RSAYIIIDLLRRVGIGILAGAY--SSPSESQCAIAFTLTVVQFLYLCTIKPYIRAGVHLV 1221 R YI+ D+ RR +GI+ G + S S Q ++A +T++Q LYL KPYIR GVHLV Sbjct: 460 RVFYIMADIARRATLGIVFGVHPGSEVSWRQLSLALAVTLIQLLYLVLFKPYIRRGVHLV 519 Query: 1222 ENVSLVCESVLFGLLFCSNGREVL----RYKDQRIIGLVMLVLLFISFVSQLMNEWYALM 1389 E+VSL+CE +F + G +L ++R +G+ M+ LL SF+ QL+NEWYALM Sbjct: 520 ESVSLLCELAVFSI-----GMALLPDDHSSDNRRSLGIAMVTLLLSSFMCQLINEWYALM 574 Query: 1390 KCILRHPQTQMPSYKAGPISVGKRLVLPLLPRRYWSRLMPGFSEPTSG 1533 + +L+ Q PS+KAG +GK LV P +P+R W + + +P G Sbjct: 575 EKLLKLSAPQEPSFKAGMRMLGKGLVFPFIPQRKWPKFITPPQQPRIG 622 >ref|XP_001763235.1| predicted protein [Physcomitrella patens subsp. patens] gi|162685718|gb|EDQ72112.1| predicted protein [Physcomitrella patens subsp. patens] Length = 1287 Score = 239 bits (611), Expect = 1e-60 Identities = 173/547 (31%), Positives = 254/547 (46%), Gaps = 47/547 (8%) Frame = +1 Query: 4 EYSDMTSGLWWLIPRERLPWNKDTLSIWANHSQFEGSADHLEKNARDHISKEERRLFSYV 183 EY++ +GL W+IP + PW KD + N ++S RR V Sbjct: 591 EYAETANGLQWIIPHVKTPWQKD-----------DNFTAGFASNVHTNLSLTIRRSLLSV 639 Query: 184 HSSEYRFSSTPPLVRQSHDDAVERIRF------AKYGNVHQS-------------DLVQK 306 S PP V + A+ +YGN++ S D +++ Sbjct: 640 -------SHVPPDVGEFQLQALHSACMMMQQWPTQYGNLNCSLYSDRLATTHWSDDYLEE 692 Query: 307 DIRSK----LGVLSGRQNSDIASVASLPPLSSEEYFIYFL-RGEPLSAANVVKKLENYEG 471 + K R+ P +++EEY YFL + LS+ Y G Sbjct: 693 TLPHKQEFHTSHFHDRRRLGANGTEFGPAMTAEEYNSYFLDQTSTLSSLRADLLTSEYTG 752 Query: 472 WQDLEMNLFWLGVAVIGLIVMHSLVLLFLRWRTGKLVHGSLSIPRFELFLVIFMLPCIGQ 651 W D N+FWL V LI++H L+LLFLRWRT +HG+LS PRFELFL++ LP I Q Sbjct: 753 WDDFLRNIFWLAVICGSLILLHILLLLFLRWRTKTPLHGALSFPRFELFLLVLALPAIAQ 812 Query: 652 SSAFIIRGNTNXXXXXXXXXXXXXXXFILSTFLFLVITVFTSNFFEYREVRTLDTSHSYC 831 + AFII G T ++ T + L++ +F +Y+E+R Sbjct: 813 ACAFIISGGTKAGIAVGVILLIIPALILIMTTVLLIVGIFLGKKVQYKELRPHLQQDGKL 872 Query: 832 ------KKFSNFLSRDTIGRWFPTEGRSSSLLLRFGILFEDRKGPPKFLSV--------- 966 K S G+W E S + + RFGILFEDRKGPP+ +S+ Sbjct: 873 PPPPTGKPLSFVTGSGYPGKWARKENTSPAFIPRFGILFEDRKGPPRLVSMIEDPNQRNE 932 Query: 967 ----GTGRMRVLSPDSSSEE--ATSWERLLGCVRSAYIIIDLLRRVGIGILAGAY--SSP 1122 G R ++ D +E + LLG ++AY+++D+LRR+ +G+ GA+ S Sbjct: 933 TGRSGFRRSATMNSDDEHDERVVSRSYTLLGGFQTAYVLVDMLRRILLGVFFGAFRISDE 992 Query: 1123 SESQCAIAFTLTVVQFLYLCTIKPYIRAGVHLVENVSLVCESVLFGLLFCSNGREVLRYK 1302 S +Q ++ +T VQFLYL KP+ R V VE VSL+CE +F G Y Sbjct: 993 SWTQVSLVLAITTVQFLYLVITKPFQRRFVQFVETVSLMCEIGIFVAAMVILGLN-RPYD 1051 Query: 1303 DQRIIGLVMLVLLFISFVSQLMNEWYALMKCILRHPQTQMPSYKAGPISVGKRLVLPLLP 1482 IG+ MLVL +SFV Q+ NEW+AL++ +L ++ S K G + L+LPLLP Sbjct: 1052 PHYGIGIFMLVLFVLSFVVQIANEWFALIRQLLALSNSEEISPKQGLQAFAAGLMLPLLP 1111 Query: 1483 RRYWSRL 1503 RR W ++ Sbjct: 1112 RRLWPQV 1118