BLASTX nr result
ID: Dioscorea21_contig00005575
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Dioscorea21_contig00005575 (1145 letters) Database: ./nr 23,641,837 sequences; 8,123,359,852 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value emb|CAN80099.1| hypothetical protein VITISV_002699 [Vitis vinifera] 161 3e-37 ref|XP_002330533.1| predicted protein [Populus trichocarpa] gi|2... 142 2e-31 ref|XP_002300593.1| predicted protein [Populus trichocarpa] gi|2... 135 2e-29 ref|XP_002525604.1| conserved hypothetical protein [Ricinus comm... 132 2e-28 ref|XP_003523853.1| PREDICTED: uncharacterized protein At4g26450... 109 1e-21 >emb|CAN80099.1| hypothetical protein VITISV_002699 [Vitis vinifera] Length = 718 Score = 161 bits (407), Expect = 3e-37 Identities = 139/414 (33%), Positives = 191/414 (46%), Gaps = 68/414 (16%) Frame = +1 Query: 28 RKTDILLEAGRLAAEYLVFKGLLPPNSLPARWQDRS-------FQEPKDREXXXXXXXSA 186 RK DI +EAGRLA EYL+ GLLPP++LP +WQ+ S FQ+ ++ + SA Sbjct: 64 RKGDIFMEAGRLATEYLISTGLLPPSALPVKWQNGSLKKQVGDFQDGENLQLPAEGRTSA 123 Query: 187 FSRLGSLQPDH-HGRRRFDDEYNXXXXXXXXXXXXYN---RSYSSDWGRENGRNGQWTER 354 +RLG+ D GRRRF DEYN RSY SDW NGR+G W + Sbjct: 124 LARLGNAVSDSGSGRRRFSDEYNPTGSRNHTRGRRRMGSFRSYGSDW---NGRSGSWDK- 179 Query: 355 SGRHWDGPEGDDDFAPGYQRERRTGFDDVGSSVSRTSRDERNSRSE---SGSEIENHEVV 525 R EGD+D GY E+ G DVGS V ++ E S+ SE E Sbjct: 180 -ARASPDTEGDEDSTCGYPEEQLVG-KDVGSGVQKSRPSELPPISDDVGDDSEAEPENTR 237 Query: 526 DDTGSKVSSSSTRKEQLAEVDVEMSKNKELDD----DVKADNLENGDVN--IDAEKKIQQ 687 DD SK SS +E E D E+ NK DD DV +++G N D +K Sbjct: 238 DDMSSKAGSSRVGREIPLETDGEL--NKRPDDSRVLDVAPGEMKDGTSNDSDDETEKQTA 295 Query: 688 EEDVNLHADNATENDSAVKLSSNLLKLCSFAKVPTRPRSSL------------TQRNDKH 831 ED+ + D+A ++D A K ++LL+LC+FAKVPT+ RSSL T++ + Sbjct: 296 SEDLTIQ-DSAVDDDIAGKSGTDLLRLCNFAKVPTKTRSSLMYKALKIDLTPTTEKGNTC 354 Query: 832 DDKPMEGSSSNSHADQ-DENLEGEARNATS-----------VLTDQPMEEAVDV------ 957 D P+ GSS +S + +E+L G + T LT + +E+A ++ Sbjct: 355 DIGPLRGSSYSSEDNPVEESLGGALSDQTQKSQCLNLDVSRALTVESLEDAEELDSKHVV 414 Query: 958 ------------------HNEANDDPPGFETFSSAIVEEEDASFEQHIQKNDGI 1065 EA+ PPGF SS I E + Q +GI Sbjct: 415 EQGKCVRSQSFPERAFMYEQEASQGPPGFGRCSSMIKERGEKRAGQQSDTMEGI 468 >ref|XP_002330533.1| predicted protein [Populus trichocarpa] gi|222872091|gb|EEF09222.1| predicted protein [Populus trichocarpa] Length = 725 Score = 142 bits (358), Expect = 2e-31 Identities = 117/356 (32%), Positives = 179/356 (50%), Gaps = 15/356 (4%) Frame = +1 Query: 28 RKTDILLEAGRLAAEYLVFKGLLPPNSLPARWQDRSFQEP--------KDREXXXXXXXS 183 +K IL+EAGRLAAEYLV KGLLP ++L +WQ+ SF+ + + S Sbjct: 66 QKGYILMEAGRLAAEYLVSKGLLPQSALSGKWQNGSFKRQAGDYQDFRQQEDLMQEGRTS 125 Query: 184 AFSRLGSLQPDHH-GRRRFDDEYNXXXXXXXXXXXXYNRSYSSDWGRENGRNGQWTERSG 360 A SRLGS D GRRR+ D++N + R YSS+WGRE GR+G ++R+ Sbjct: 126 AHSRLGSGASDAGLGRRRYPDDFNLRNHVKGRRRGEHYRGYSSEWGREYGRSGSLSDRNR 185 Query: 361 RHWDGPEGDDDFAPGYQRERRTGFDDVGSSVSRTSRDERNSRSESGSEIEN----HEVVD 528 D E +D G+ E++ +DVG + ++ + SE ++IE+ + + Sbjct: 186 MSPDTEE--NDTVSGHCEEQQVS-NDVGDGMEKSGQSGVAPESEETADIESGLSKYNYPN 242 Query: 529 DTGSKVSSSSTRKEQLAEVDVEMSKNKELDDDVKADNLENGDVNIDAE-KKIQQEEDVNL 705 +TGSK SSSS KE E D E SK +V N + D N D E +K ED+ + Sbjct: 243 ETGSKASSSSVLKE---ETDGEPSKGSGDPANVNLGNKDMKDGNYDYEIEKQIVPEDLPI 299 Query: 706 HADNATENDSAVKLSSNLLKLCSFAKVPTRPRSSLTQRNDKHDDKPMEGSSSNSHADQDE 885 ++D + K S+LL L FA VPT+ RS+L+ R+ + D P +N D + Sbjct: 300 Q-----QSDLSGKDESDLLTLSKFANVPTKMRSALSCRSSRVDQVP-----NNEEEDTSD 349 Query: 886 NLEGEARNATSVLTD-QPMEEAVDVHNEANDDPPGFETFSSAIVEEEDASFEQHIQ 1050 N G + + V+ D A DV+ + + P E A+V+ + + E+ ++ Sbjct: 350 N--GLNKGSEDVVQDGVDNVSATDVNATHDSNCPNSEIIKVAVVQPAEDADEEGLE 403 >ref|XP_002300593.1| predicted protein [Populus trichocarpa] gi|222847851|gb|EEE85398.1| predicted protein [Populus trichocarpa] Length = 732 Score = 135 bits (340), Expect = 2e-29 Identities = 109/327 (33%), Positives = 158/327 (48%), Gaps = 22/327 (6%) Frame = +1 Query: 28 RKTDILLEAGRLAAEYLVFKGLLPPNSLPARWQDRSFQEP--------KDREXXXXXXXS 183 +K D+L+EAGRLAAEYLV KGLLP ++L +WQ+ F+ + + S Sbjct: 69 QKGDVLMEAGRLAAEYLVSKGLLPQSALSGKWQNGGFKMQAGDYQDFRQQEDLMHEGRTS 128 Query: 184 AFSRLGSLQPDHH-GRRRFDDEYNXXXXXXXXXXXXYNRSYSSDWGRENGRNGQWTERSG 360 A SRLGS D RRR+ D++N + R YS++WGRE GR+G ++R+ Sbjct: 129 AHSRLGSGASDTGLSRRRYSDDFNSRNHVKGRRRGEHYRGYSAEWGREYGRSGPLSDRN- 187 Query: 361 RHWDGPEGDDDFAPGYQRERRTGFDDVGSSVSRTS------RDERNSRSESGSEIENHEV 522 R EG D + E++ +DVG + ++ E + ESG NH Sbjct: 188 RVSPDMEGQSDTVSEHYEEQQVS-NDVGDGMEKSGLSGVAPESEETADIESGLSKYNHP- 245 Query: 523 VDDTGSKVSSSSTRKEQLAEVDVEMSKNKELDDDVKADNLENGDVNIDAEKKIQ-QEEDV 699 D+TGSK SSSS KE E E SK ++ N E D N D E + Q ED+ Sbjct: 246 -DETGSKASSSSVPKE---ETGGEPSKGSGDPANLNLGNGEVKDSNYDYETEKQIVPEDL 301 Query: 700 NLHADNATENDSAVKLSSNLLKLCSFAKVPTRPRSSLTQRNDKHDDKPMEGSSSNSHADQ 879 + +A E D + + S+LL L FA VPT+ RS+L+ R+ + D P S Sbjct: 302 PIQ-QSAVEGDISGRNGSDLLTLSKFANVPTKTRSALSCRSSRVDQVPNNEDDGTSGIGL 360 Query: 880 DENLEGEAR------NATSVLTDQPME 942 ++ E + +A VL + P + Sbjct: 361 NKGSEDSVQDGMYNVSAADVLANAPRD 387 >ref|XP_002525604.1| conserved hypothetical protein [Ricinus communis] gi|223535040|gb|EEF36722.1| conserved hypothetical protein [Ricinus communis] Length = 724 Score = 132 bits (331), Expect = 2e-28 Identities = 111/357 (31%), Positives = 175/357 (49%), Gaps = 18/357 (5%) Frame = +1 Query: 28 RKTDILLEAGRLAAEYLVFKGLLPPNSLPA--RWQDRSFQEP-----KDREXXXXXXXSA 186 RK DIL+EAGRLAAEYLV KGLLP ++L A +WQ+ + ++ E SA Sbjct: 69 RKGDILMEAGRLAAEYLVSKGLLPQSALSASGKWQNGNSKKQVGDDYLQEELNQDGRTSA 128 Query: 187 FSRLGSLQPDHH-GRRRFDDEYN-XXXXXXXXXXXXYNRSYSSDW-GRENGRNGQWTERS 357 SRLGS D G+RR+ D++N YNRSY+S+W GRE GR+G W +R+ Sbjct: 129 HSRLGSGASDSGIGKRRYSDDFNLRNHVKGRRRGEYYNRSYNSEWGGREYGRSGSWLDRN 188 Query: 358 GRHWDGPEGDDDFAPGYQRERRTGFDDVGSSVSRTSRDERNSRSESGSEIEN---HEVVD 528 R EG+ D G+ E++ G +DV + ++ +E +++E+ + D Sbjct: 189 -RVSPDMEGEGDTISGHYDEQQAG-EDVSEGLQKSGLSGSVPDNEEAADMESFAEYTNSD 246 Query: 529 DTGSKVSSSSTRKEQL----AEVDVEMSKNKELDDDVKADNLENGDVNIDAEKKIQQEED 696 + GSK SS T K++ +V +++ +++K +N ++ + EK+I E+ Sbjct: 247 EMGSKASSLHTGKDEAVGEPGQVSDDLTNLNSGSEEMKDNNFKH-----ETEKQIAPEDL 301 Query: 697 VNLHADNATENDSAVKLSSNLLKLCSFAKVPTRPRSSLTQRNDKHDDKPMEGSSSNSHAD 876 + E D K S+LL C+FAKVPT+ RS+LT + K D P +A+ Sbjct: 302 ATQQC--SVEGDILDKHESDLLTFCNFAKVPTKIRSALTYKVPKVDQVP--------NAE 351 Query: 877 QDENLEGEARNATSVLTDQPMEEA-VDVHNEANDDPPGFETFSSAIVEEEDASFEQH 1044 + +G + + + D ++ A D ND S ++ ED H Sbjct: 352 ERNVSDGAQKGSEITVQDGTLDFATADQLPNTNDLKSADPEISISVQSAEDVGESGH 408 >ref|XP_003523853.1| PREDICTED: uncharacterized protein At4g26450-like [Glycine max] Length = 699 Score = 109 bits (273), Expect = 1e-21 Identities = 91/287 (31%), Positives = 142/287 (49%), Gaps = 22/287 (7%) Frame = +1 Query: 34 TDILLEAGRLAAEYLVFKGLLPPNSL-PARWQDRSFQEPKDREXXXXXXXSAFSRLGSLQ 210 +DI +EAGRLAAEYLV +G LPPN+L P +WQ+ R+ SA +RLGS Sbjct: 36 SDIFVEAGRLAAEYLVSQGQLPPNALPPPKWQNHKTPAEGGRQ-------SALARLGSAD 88 Query: 211 PDHHGRRRFDDEYNXXXXXXXXXXXXYNRSYSSDWGRENGRNGQWTER-SGRHWDGPEG- 384 GRR+ ++ ++RS DWGRE RNG W++R G D +G Sbjct: 89 ----GRRKLGG-FDEFGQKGGRRRGSFSRSNGMDWGREYRRNGSWSDRFRGGAVDVRDGE 143 Query: 385 DDDFAPG--------------YQRERRTGFDDVGSSVSRTSRDERNSRSESGSEIE---- 510 DDD+ G YQ+++ D S S ++ +E RSE G ++ Sbjct: 144 DDDYESGGFSVRHQDEEDQHQYQQQQHQNSVDDASMKSNSNLNEFAPRSEDGGDLNDKDG 203 Query: 511 NHEVVDDTGSKVSSSSTRKEQLAEVDVEMSKNKELDD-DVKADNLENGDVNIDAEKKIQQ 687 + E V+ V S K+ +++VD+E+ +L+ V +++G D ++++ Sbjct: 204 DKERVNAELMGVKQSGIGKD-VSDVDMEVGVGNDLESVSVGVKEVKDGSGGDDDSERLRN 262 Query: 688 EEDVNLHADNATENDSAVKLSSNLLKLCSFAKVPTRPRSSLTQRNDK 828 D + EN S+ + ++L+ LC KVPTR RSS+T++N K Sbjct: 263 VSD----QWSDQENSSSGGVVADLVSLCKSVKVPTRTRSSVTRKNLK 305