BLASTX nr result
ID: Dioscorea21_contig00010870
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Dioscorea21_contig00010870 (2538 letters) Database: ./nr 23,641,837 sequences; 8,123,359,852 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_002312573.1| predicted protein [Populus trichocarpa] gi|2... 212 5e-52 ref|XP_002517012.1| hypothetical protein RCOM_0908960 [Ricinus c... 182 5e-43 ref|XP_002312571.1| predicted protein [Populus trichocarpa] gi|2... 163 2e-37 ref|XP_002865410.1| zinc knuckle (CCHC-type) family protein [Ara... 130 2e-27 ref|XP_004170660.1| PREDICTED: LOW QUALITY PROTEIN: uncharacteri... 128 7e-27 >ref|XP_002312573.1| predicted protein [Populus trichocarpa] gi|222852393|gb|EEE89940.1| predicted protein [Populus trichocarpa] Length = 970 Score = 212 bits (539), Expect = 5e-52 Identities = 224/883 (25%), Positives = 365/883 (41%), Gaps = 59/883 (6%) Frame = -1 Query: 2520 GISSCPVSNYLKKNSGAGANANSRTDMVLITTNPLSELVWSPQKGLSLKYANSSLSEKKA 2341 G S+ + LK +SGAGANA S DM + TN LSELVWSP+KGLSLK A+ + S +K Sbjct: 19 GYSNQCIQRRLKNDSGAGANAASSVDMTFVATNALSELVWSPKKGLSLKCADGTFSNQKP 78 Query: 2340 SLLWNAESFNIMILPPQCPNVGESSKAMDTIDRNLNPVQLEINSESKNSNREAPPSPPQS 2161 SLL A P + + KA+ P + ++ SE + R+ P S Sbjct: 79 SLLRGAG-------PSDMVSGSNADKAIGKKVFMTPPEESDVRSEV--AGRDNPTKFVTS 129 Query: 2160 VAGMQPISLTLIHEQHSRSYGHMGQFGS------TSVNLDNPEKDKNEEILHSKSISRGD 1999 G+ P+ +H+ +Y + T+V L P K E+ ++K+ + Sbjct: 130 DTGLFPLLSESMHKVKIGNYEFLAATDDHKEEMKTAVGL--PFLQKMEDARNNKA----E 183 Query: 1998 EVWKNVKSAVDVMPEAFNLDNKKGPGDLKLNSAQVECEPISNFIQHFRGSIG----TRKD 1831 +++ + VD + + + KL+ AQ P S G +G T + Sbjct: 184 DIYDPINLQVDEISRTWETKFPSLSDETKLDVAQNG--PTSKEPNVRIGGVGDASHTLQT 241 Query: 1830 NLLGLEGKAEDYSSEKHDF-VTKLPSSPRTVANEGLNSENVRSNISTRVIECA------- 1675 ++ E +D + K P + + +N+ T C Sbjct: 242 EIVSASQVCSVEECESYDTNMQKAPLGREHFESPSCMEKERENNMGTGPYICPLEKLEST 301 Query: 1674 --DNFQSLSKQGMFGREAVDLQDRNEVHLTAPAQASDEHVAELRKASLLGKSAPTEGLLN 1501 ++F++ + + + +N + + +Q DE + + ++ K +PT Sbjct: 302 AENDFKTPHSENVCAVATEIVGSQNAKEVRSSSQQDDEILPKDNDCAI--KQSPTY---- 355 Query: 1500 KSESLRSHSEGNGHINSHKNGRDNVTKDXXXXXXXXXXXXXXXXXSTRKRELAFEPESSS 1321 S + R +G S N + + ST KR+ F+P S Sbjct: 356 -SRTRRYQMKGKAKALSDGNLNERMLDMDDDSHESVESCNSVGLFSTGKRQRNFDPHSYV 414 Query: 1320 ENKRLKTQAQDKFCSGSFHKQESSFMNWISTMTNGFSRSYQEKPLNQPLPIAH------D 1159 +K +KT+ Q+ S SF K + SFMNWIS M GF +S +++ + L +A+ D Sbjct: 415 GSKSIKTKIQESPGSSSFVKHDGSFMNWISNMMKGFLKSNEDEAPSLALTLANHKHGHED 474 Query: 1158 TNKG----------SCTNIGFGSIFHSLHSPRLLIQDRAQKDLDSQRVADVLSEQDDREQ 1009 +K C +GF S+F SL+ P+ Q+ + ++Q D++ Sbjct: 475 RDKNLISCNRNQDQGCKTMGFHSLFQSLYCPKTKAQETVALNANTQTEGSKELGLDNKIC 534 Query: 1008 ASTGAGLVGSDGLDSNLQNAIGTSSKATNS-SLKGIVCHEHVKLPTGALHSNENLKQATC 832 S + D+ + + + K S S G KL + + S + + + Sbjct: 535 DSNATPIPCRMVTDNVYKRFLQPNEKLNESTSGNGTASPALTKLLSTNIASGQEISGSNS 594 Query: 831 VDEALPLNTIYISSGKSHEKAVGNIGNFXXXXXXXXXXSQKALTTTLEGKAIGTIPSVLN 652 ++ N+ +++ K G N Q + EGKA T Sbjct: 595 AEKK---NSCNMATDKEKN---GTSSNSSRGKRKMNDAEQPS-----EGKATNT------ 637 Query: 651 GSSNLVSKKRGAFRESLWISRLLPKVSVSIPEPANCSHGVELSNEKHT-----KITEKSC 487 S R SLWI+RL PK S + C + + T K ++ Sbjct: 638 ------SGYRSDPLTSLWITRLSPKTSGPLSNRDLCHRRTGEALDGFTDFIRLKAQWQNH 691 Query: 486 PSLFGQKSFARGTIKAQGHSDSDG------SNGTNA-----------DGSSKSKLNCKLP 358 PS + K+ + + H D +N T D S K+N LP Sbjct: 692 PSSYQDKNIVGA--REEEHFTEDPVCMHNCANSTEVSFSINKVNGHHDEKSMCKMNSTLP 749 Query: 357 SQKLIKSEPMASVFARRLDAIKHITPAKTMNDKTSMLGTCFFCGKVGHSLKECPQLTESE 178 + SE MASVFARRLDA+ HI P+ +D + TCFFCG H +++CP++ +SE Sbjct: 750 FSRFRNSEAMASVFARRLDALMHIMPSYGTDDSSHGNLTCFFCGIKCHHVRDCPEIIDSE 809 Query: 177 LQDILRDLNSYDNTDGFLSICIRCFGFNHWAISCPFESSKIKN 49 L DILR+ NS++ + F +CIRCF NHWA++CP SS+ ++ Sbjct: 810 LADILRNANSFNGANEFPCVCIRCFQSNHWAVACPSASSRTRH 852 >ref|XP_002517012.1| hypothetical protein RCOM_0908960 [Ricinus communis] gi|223543647|gb|EEF45175.1| hypothetical protein RCOM_0908960 [Ricinus communis] Length = 1067 Score = 182 bits (461), Expect = 5e-43 Identities = 218/882 (24%), Positives = 353/882 (40%), Gaps = 58/882 (6%) Frame = -1 Query: 2520 GISSCPVSNYLKKNSGAGANANSRTDMVLITTNPLSELVWSPQKGLSLKYANSSLSEKKA 2341 G S+ + L + GAGANA S D+ + T+PLSELVWSP KGLSL+ A+ S +KK Sbjct: 19 GYSNQCIQRNLSNDPGAGANAASTADITFVATDPLSELVWSPHKGLSLRCADGSFIDKKP 78 Query: 2340 SLLWNAESFNIMILPPQCPNVGESSKAMDTIDRNLNPVQLEINSESKNSNREAPPSPPQS 2161 SLL P VG + A + +S+ SN Sbjct: 79 SLL---------------PGVGPTYMASGS------------SSDKPISNTGKLFDNEIC 111 Query: 2160 VAGMQPISLTL-IHEQHSRSY--GHMGQFGSTSVNLDNPEKDKNEEILHSKSISRGDEVW 1990 +A + L I +S ++ ++G + LD + GD+V Sbjct: 112 IASLPACKLASEISGDNSTTFLTSNVGIMPLSGTGLDKT--------------ATGDQVV 157 Query: 1989 KNVKSAVDVMPEAFNLDNKKGPGDLKLNSAQVECEPISNFIQHFRGSIGTRKDNLLGLE- 1813 + +K+AV+ + +L N K + KL+ AQ F + + D+ LG+E Sbjct: 158 E-MKNAVNYFLQKEDLRNDKAEDETKLDVAQ----NYRTFEEPIVRATDVNDDHELGMEI 212 Query: 1812 GKAEDYSSEK--HDFVTKLPSSPRTVANEGLNSENVRSN-------ISTRVIECADNFQS 1660 D+ + K D+ K+ ++ + E +VR I I D +S Sbjct: 213 VLVSDFHTVKGREDYGIKIQNAACS-GKENEEPPSVREKERKNKMVIGRPGIFSLDKLES 271 Query: 1659 LSKQGM---FGREAVDLQDRNEVHLTAP-AQASDEHVAELRKASLLGKSAPTEGLLNKSE 1492 ++ + FG + ++++N +A + + +H + +L +PT L + Sbjct: 272 TAENDLETPFGENSCSMRNKNLASESADRVENNTQHELIPIEYALGYNQSPTSSRLQNIQ 331 Query: 1491 SLRSHSEGNGHINSHKNGRDNVTKDXXXXXXXXXXXXXXXXXSTRKRELAFEPESSSENK 1312 +G S + ++ + + ST K+ F+ + +K Sbjct: 332 R-----QGQSKALSDGDAKERMLNEEDGSHESVESCNSTELFSTGKQRWNFDQQLIVGSK 386 Query: 1311 RLKTQAQDKFCSGSFHKQESSFMNWISTMTNGFSRS----------------YQEKPLNQ 1180 R+K Q QD S S KQ+SSF+NWIS M GF +S Y + +Q Sbjct: 387 RVKRQIQDSPGSSSLGKQDSSFVNWISNMMKGFLKSSEGEAPFLSSALSNPNYGHENPSQ 446 Query: 1179 PLPIAHDTNKGSCTNIGFGSIFHSLHSPRLLIQDRAQKDLDSQRVADVLSEQDDREQAST 1000 + + +C GF S+F SL+ + Q+ +++ Q +QD++ Sbjct: 447 DVFTCNRKEDPACDTRGFQSVFQSLYCRKTKGQETVTLNVNHQTEGSKECDQDNKI-CDL 505 Query: 999 GAGLVGSDGLDSNLQNAIGTSSKATNSSLKGIVCHEHVKLPTGALHSNENLKQATCVDEA 820 A + + N+ S++ N G H + +HS + + E+ Sbjct: 506 NAAPIACRMVTGNVYKRFLPSNEKHNEPTSGY----HAGM---TVHSRDISMSFPVIPES 558 Query: 819 LPL------NTIYISSGKSHEKAVGNIGNFXXXXXXXXXXSQKALTTTLEGKAIGTIPSV 658 N+ ++ GK + G NF T+ GK +PS Sbjct: 559 NGSVSTENKNSCNLAIGKEKD---GTDSNFSHGKHK----------TSSAGKIDPELPSE 605 Query: 657 LNGSSNLVSKKRGAFRESLWISRLLPKVS----------VSIPEPANCSHGV-------- 532 + K G SLWI+R PK S S E NCS Sbjct: 606 DKTAHGFGYK--GDPLGSLWIARFSPKTSGAPFNHYPSNKSTGEAFNCSADSMGLIPQVQ 663 Query: 531 -ELSNEKHTKITEKSCPSLFGQKSFARGTIKAQGHSDSDGSNGTNADGSSKSKLNCKLPS 355 L + +I E + + + D G N D S +KLN L S Sbjct: 664 NPLGSSSEHEIVEVRNKNFQEPLPIQNYSTANRAPFDFYNVKG-NIDNDSGNKLNPILSS 722 Query: 354 QKLIKSEPMASVFARRLDAIKHITPAKTMNDKTSMLGTCFFCGKVGHSLKECPQLTESEL 175 ++ SE MASV RRLDA K+ITP+ ++ TCFFCG GH L+EC ++T++EL Sbjct: 723 ARVKTSEAMASVSPRRLDAPKYITPSDDADNSDRASMTCFFCGIKGHDLRECSEVTDTEL 782 Query: 174 QDILRDLNSYDNTDGFLSICIRCFGFNHWAISCPFESSKIKN 49 +D+LR++N Y +CIRCF NHWA++CP ++++ Sbjct: 783 EDLLRNINIYGGIKELPCVCIRCFQLNHWAVACPSTCPRVRS 824 >ref|XP_002312571.1| predicted protein [Populus trichocarpa] gi|222852391|gb|EEE89938.1| predicted protein [Populus trichocarpa] Length = 779 Score = 163 bits (412), Expect = 2e-37 Identities = 135/477 (28%), Positives = 211/477 (44%), Gaps = 39/477 (8%) Frame = -1 Query: 1362 TRKRELAFEPESSSENKRLKTQAQDKFCSGSFHKQESSFMNWISTMTNGFSRSYQEKPLN 1183 T KR+ F+P S +K +KT+ Q+ S SF K + SFMNWIS M GF +S +++ + Sbjct: 210 TGKRQRNFDPHSYVGSKSIKTKIQESPGSSSFVKHDGSFMNWISNMMKGFLKSNEDEAPS 269 Query: 1182 QPLPIAH------DTNKG----------SCTNIGFGSIFHSLHSPRLLIQDRAQKDLDSQ 1051 L +A+ D +K C +GF S+F SL+ P+ Q+ + ++Q Sbjct: 270 LALTLANHKHGHEDRDKNLISCNRNQDQGCKTMGFHSLFQSLYCPKTKAQETVALNANTQ 329 Query: 1050 RVADVLSEQDDREQASTGAGLVGSDGLDSNLQNAIGTSSKATNS-SLKGIVCHEHVKLPT 874 D++ S + D+ + + + K S S G KL + Sbjct: 330 TEGSKELGLDNKICDSNATPITCPMVTDNVYKRFLQPNEKLNESTSGNGAASPALTKLLS 389 Query: 873 GALHSNENLKQATCVDEALPLNTIYISSGKSHEKAVGNIGNFXXXXXXXXXXSQKALTTT 694 + S++ + + ++ N+ +++ K G N Q + Sbjct: 390 TNIASSQEISGSNSAEKK---NSCNMATDKEKN---GTSSNSSPGKRKMNDAEQPS---- 439 Query: 693 LEGKAIGTIPSVLNGSSNLVSKKRGAFRESLWISRLLPKVSVSIPEPANCSHGVELSNEK 514 EGKA T S R SLWI+RL PK S + C + + Sbjct: 440 -EGKATNT------------SGYRSDPLTSLWITRLSPKTSGPLSNRDLCHRRTGEALDG 486 Query: 513 HT-----KITEKSCPSLFGQKSFARGTIKAQGHSDSDG------SNGTNA---------- 397 T K ++ PS + K+ + + H D +N T Sbjct: 487 FTDFIRLKAQWQNHPSSYQDKNIVGA--REEEHFTEDPVCMHNCANSTEVSFSINKVNGH 544 Query: 396 -DGSSKSKLNCKLPSQKLIKSEPMASVFARRLDAIKHITPAKTMNDKTSMLGTCFFCGKV 220 D S K+N LP + SE MASVFARRLDA+ HI P+ +D + TCFFCG Sbjct: 545 HDEKSMCKMNSTLPFSRFRNSEAMASVFARRLDALMHIMPSYGTDDSSHGNLTCFFCGIK 604 Query: 219 GHSLKECPQLTESELQDILRDLNSYDNTDGFLSICIRCFGFNHWAISCPFESSKIKN 49 H +++CP++ +SEL DILR+ NS++ + F +CIRCF NHWA++CP SS+ ++ Sbjct: 605 CHHVRDCPEIIDSELADILRNANSFNGANEFPCVCIRCFQSNHWAVACPSASSRTRH 661 >ref|XP_002865410.1| zinc knuckle (CCHC-type) family protein [Arabidopsis lyrata subsp. lyrata] gi|297311245|gb|EFH41669.1| zinc knuckle (CCHC-type) family protein [Arabidopsis lyrata subsp. lyrata] Length = 759 Score = 130 bits (327), Expect = 2e-27 Identities = 121/433 (27%), Positives = 184/433 (42%), Gaps = 31/433 (7%) Frame = -1 Query: 1278 SGSFH--KQESSFMNWISTMTNGFSRSYQEKPL-----------------------NQPL 1174 SGS+ KQ+SSFMNWIS MT G + +E Q Sbjct: 222 SGSYRRPKQDSSFMNWISNMTKGIWKGNEEDDSPFAALTTTSDANGHGQVNAIVDQQQLS 281 Query: 1173 PIAHDTNKGSCTNIGFGSIFHSLHSPRLLIQDRAQKDLDSQRVADVLSEQDDREQASTGA 994 P N G C N GF S+F S++ P+ QD + D + A L E + Sbjct: 282 PCCVKENSG-CRNTGFQSLFQSIYCPKKRSQDAVEMDFPNDANATSLQELPWIPEQ---C 337 Query: 993 GLVGSDGLDSNLQNAIGTSSKATNSSLKGIVCHEHVKLPTGALHSNENLKQATCVDEALP 814 G+ D L S+ N IG ++ SS K L S+EN ++ D+ Sbjct: 338 GIAKGDDLSSS-DNDIGPVAEPNISSGKVGFNQRSETL------SSENKRE----DKEPN 386 Query: 813 LNTIYISSGKSHEKAVGNIGNFXXXXXXXXXXSQKALTTTLEGKAIGTIPSVLNGSSNLV 634 ++ + +S K +E+ + G+A G + LN Sbjct: 387 ISLMSLSKSKPNEEP------------------------KICGEAGGKVSPCLNN----- 417 Query: 633 SKKRGAFRESLWISRLLPKVSVSIPEPANCSHGVELSNEKHTKITEKSCPSLFGQKSFAR 454 R + +SLWISR K + + + V S K + QK Sbjct: 418 ---RNSGLQSLWISRFSSKSPFPQKKTSETAKEVNASASDTAKTHDS-------QKMLVN 467 Query: 453 GTIKAQGHSDSDGSNGTNADGSSKSKLNCKLP---SQKLIKSEPMASVFARRLDAIKHIT 283 + S DG + KLN LP S ++ SE MAS+FARRL+A+KHI Sbjct: 468 NNVVIPSISSVDGLD----------KLNTVLPIVSSMRIESSEAMASLFARRLEAMKHII 517 Query: 282 PAKTMNDKTSMLGT---CFFCGKVGHSLKECPQLTESELQDILRDLNSYDNTDGFLSICI 112 PA ++ + CF+CGK GH L++C ++T++EL+D++++++S + + S+CI Sbjct: 518 PAGSLAENAEEEQPNLICFYCGKKGHCLQDCLEVTDTELRDLVQNISSRNGREEASSLCI 577 Query: 111 RCFGFNHWAISCP 73 RCF +HWA +CP Sbjct: 578 RCFQLSHWAATCP 590 >ref|XP_004170660.1| PREDICTED: LOW QUALITY PROTEIN: uncharacterized protein LOC101224596 [Cucumis sativus] Length = 1004 Score = 128 bits (322), Expect = 7e-27 Identities = 127/471 (26%), Positives = 190/471 (40%), Gaps = 36/471 (7%) Frame = -1 Query: 1362 TRKRELAFEPESSSENKRLKTQAQDKFCSGSFHKQESSFMNWISTMTNGFSRSYQEKPLN 1183 T KR +FE NKR K Q + S Q+SSFM WIS M GFS S Q++ Sbjct: 376 TSKRRWSFEQRLIVGNKRAKKQDGNASGPTSNLGQDSSFMIWISNMMKGFSESIQDEAPT 435 Query: 1182 QPL---------------PIAHDTNKGSCTNIGFGSIFHSLHSPRLLIQDRA-----QKD 1063 L PI N + IGF SIF SL++P + ++ A Q Sbjct: 436 LDLTLAKCDVEQGGPNEEPIYKKINAPGFSGIGFQSIFRSLYNPTMRGEEGAPSATCQAK 495 Query: 1062 LDSQRVADVLSEQDDREQASTGAGLVGSDGLDSNLQNAIGTSSKATNSSLKGIVCHEHVK 883 +++ + + + D G G L N T + N I + Sbjct: 496 QEAKGIEIIKNSCDLNATPIACFGESDHFGKQLLLNNENATDLISGNGPTLLIQLKNSPE 555 Query: 882 LPTGALHSNENLKQATCVDEALPLNTIYISSGKSHEKAVGNIGNFXXXXXXXXXXSQKAL 703 + G+ S++ Q L +S+ + E +G K Sbjct: 556 ISCGSHQSHKTRSQGNQNSSNL------VSAAGTGEVMHSALG--------------KCK 595 Query: 702 TTTLEGKAIGTIPSVLNGSSNLVSKKRGAFRESLWISRLLPKVS--VSIPEPANCSHGVE 529 + E + +N ++ VS +SLWISR K S S PE +N + + Sbjct: 596 SNGTENVDCDQLCGKINHTTGNVSDPL----KSLWISRFAAKASGFTSNPETSNLNTKDD 651 Query: 528 LSNEKHTKITEKSCP--------------SLFGQKSFARGTIKAQGHSDSDGSNGTNADG 391 H+ CP ++ ++ T + GH + +++ Sbjct: 652 SQCSMHSP-RHMPCPQNHIDHHSMDDLDTAVSKEQHNIANTETSPGHKEFKD----HSEQ 706 Query: 390 SSKSKLNCKLPSQKLIKSEPMASVFARRLDAIKHITPAKTMNDKTSMLGTCFFCGKVGHS 211 S SK L S K+ E MASVFARRL A+KHI P+ + + TCFFCG GH+ Sbjct: 707 KSISKFKSALRSPKIRSPEAMASVFARRLGALKHIIPSDLTINVGNETVTCFFCGTKGHN 766 Query: 210 LKECPQLTESELQDILRDLNSYDNTDGFLSICIRCFGFNHWAISCPFESSK 58 L C ++TE E++D+ R++ + T CIRCF NHWAI+CP ++ Sbjct: 767 LHNCSEITEREIEDLSRNIRFCNETVDPPCSCIRCFQLNHWAIACPLAPAR 817 Score = 71.6 bits (174), Expect = 1e-09 Identities = 41/102 (40%), Positives = 58/102 (56%) Frame = -1 Query: 2502 VSNYLKKNSGAGANANSRTDMVLITTNPLSELVWSPQKGLSLKYANSSLSEKKASLLWNA 2323 + L SG GANA S D+ +TT+ LSELVWSP KGLSL+ A+SS + +K S+LW+A Sbjct: 25 IQGRLTNRSGVGANAGSMVDVKYVTTDSLSELVWSPHKGLSLRCADSSFNNRKTSILWDA 84 Query: 2322 ESFNIMILPPQCPNVGESSKAMDTIDRNLNPVQLEINSESKN 2197 + PQ + E S + + +D N + + S KN Sbjct: 85 AANKANFALPQSV-IAEKSTSNNLLD-NRTIILSQAESHLKN 124