BLASTX nr result
ID: Dioscorea21_contig00017703
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Dioscorea21_contig00017703 (1195 letters) Database: ./nr 23,641,837 sequences; 8,123,359,852 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value emb|CCA66009.1| hypothetical protein [Beta vulgaris subsp. vulga... 149 2e-33 emb|CCA66036.1| hypothetical protein [Beta vulgaris subsp. vulga... 144 3e-32 emb|CCA66153.1| hypothetical protein [Beta vulgaris subsp. vulga... 138 2e-30 ref|XP_002489257.1| hypothetical protein SORBIDRAFT_0011s003210 ... 137 4e-30 gb|ABD28627.2| RNA-directed DNA polymerase (Reverse transcriptas... 137 5e-30 >emb|CCA66009.1| hypothetical protein [Beta vulgaris subsp. vulgaris] Length = 1378 Score = 149 bits (375), Expect = 2e-33 Identities = 117/402 (29%), Positives = 184/402 (45%), Gaps = 10/402 (2%) Frame = -3 Query: 1178 TVTPIAKSRFVLHLVITNAKKESWILSTVYNSSRLLDQRYVWYELAGITSLNL-PWILIG 1002 TVTP L + I + W+ S +Y S ++ +W EL I + PW+L G Sbjct: 78 TVTPYGSHSQHLTVEIRRIGDDPWLFSAIYASPDSTLRKELWRELEQIKNQYTGPWLLAG 137 Query: 1001 DFNTILSNSEFQGGSWNYYRRKSLVFSEFININNLLEVNFTGPSFTWCNNQRGAARKWAL 822 DFN S E G + +R+ F+ +I N L+++ FTGP+ TW K A Sbjct: 138 DFNETSSLCERNGSESSEMQRRCKDFANWIENNALIDLGFTGPAHTWSRGLSPTTFKSAR 197 Query: 821 LDRCFLNPFSSASLDNFIINHLPRVFSDHAPLLLTLT-----PRVSTGKRIFRFDNYWLD 657 LDR N ++ +LP+ SDH P+L++ + PR+ + FRF WL+ Sbjct: 198 LDRGLANSEWKLKFTEGVVRNLPKSQSDHCPILISTSGFAPVPRII---KPFRFQAAWLN 254 Query: 656 YLGSHEAVRKAWNFSPHSNP-LHAFAHFLSRAKHNLLCWKNSGLNSIDSSIKHLEMEILA 480 + E VRK WN P L +FA L++ W +I L I Sbjct: 255 HQVFCEFVRKNWNADAPIVPFLKSFADKLNK-------WNKEEFYNIFRKKSELWARISG 307 Query: 479 AEEDDHLQGTFNSN--VTVLSPLYNQLAALHRQNAIKLAQRARQAWVIGGDLNSNFFHNS 306 + L T N + + + L ++ + Q++R + GD N+ +FH S Sbjct: 308 VQA---LLSTGRQNHLIKLEAKLRREMDIVLDDEETLWFQKSRMEAICDGDRNTRYFHLS 364 Query: 305 IRSRNHLNHINMIHSPSGDCFTNPNEIEDVFCSFFSNLWTEPSSAFVADLMQALPND-LP 129 R N I+M+ + G+ +NP E++ + ++ +L++E S V LP D P Sbjct: 365 TVIRRSRNRIDMLQNNDGEWISNPMEVKAMVLGYWKHLFSEDS---VQSNFCHLPRDFFP 421 Query: 128 CISELDGELLTKDITKKEVYLALKSLPSGKSPGPDGFNSEFY 3 I+ D E + + +++ EV LALKS+ K+PGPDGF FY Sbjct: 422 QITADDFEKMMRPLSEVEVTLALKSMKPFKAPGPDGFQPLFY 463 >emb|CCA66036.1| hypothetical protein [Beta vulgaris subsp. vulgaris] Length = 1369 Score = 144 bits (364), Expect = 3e-32 Identities = 91/344 (26%), Positives = 159/344 (46%), Gaps = 5/344 (1%) Frame = -3 Query: 1019 PWILIGDFNTILSNSEFQGGSWNYYRRKSLVFSEFININNLLEVNFTGPSFTWCNNQRGA 840 PW+ GDFN +L SE +GG + R++ +F + + +++ F G FTW NN+ G Sbjct: 136 PWLCGGDFNLMLVASEKKGGD-GFNSREADIFRNAMEECHFMDLGFVGYEFTWTNNRGGD 194 Query: 839 ARKWALLDRCFLNPFSSASLDNFIINHLPRVFSDHAPLLLTLTPRVSTGKRI-----FRF 675 A LDR N ++HLP+ SDH P++ ++ S R FRF Sbjct: 195 ANIQERLDRFVANDLWKIKFPGSFVSHLPKRKSDHVPIVASVKGAQSAATRTKKSKRFRF 254 Query: 674 DNYWLDYLGSHEAVRKAWNFSPHSNPLHAFAHFLSRAKHNLLCWKNSGLNSIDSSIKHLE 495 + WL S E V++ W + L+R + LL W + I+ + Sbjct: 255 EAMWLREGESDEVVKETWMRGTDAGIN------LARTANKLLSWSKQKFGHVAKEIRMCQ 308 Query: 494 MEILAAEEDDHLQGTFNSNVTVLSPLYNQLAALHRQNAIKLAQRARQAWVIGGDLNSNFF 315 ++ E + + N+ + L ++ L ++ + QR+RQ W+ GD N+ FF Sbjct: 309 HQMKVLMESEPSE----DNIMHMRALDARMDELEKREEVYWHQRSRQDWIKSGDKNTKFF 364 Query: 314 HNSIRSRNHLNHINMIHSPSGDCFTNPNEIEDVFCSFFSNLWTEPSSAFVADLMQALPND 135 H R N++ I + +G+ F + +++ + F +F NL+ ++ + ++ + Sbjct: 365 HQKASHREQRNNVRRIRNEAGEWFEDEDDVTECFAHYFENLFQSGNNCEMDPILNIVK-- 422 Query: 134 LPCISELDGELLTKDITKKEVYLALKSLPSGKSPGPDGFNSEFY 3 P I++ G L ++EV AL + K+PGPDG N+ FY Sbjct: 423 -PQITDELGTQLDAPFRREEVSAALAQMHPNKAPGPDGMNALFY 465 >emb|CCA66153.1| hypothetical protein [Beta vulgaris subsp. vulgaris] Length = 1381 Score = 138 bits (348), Expect = 2e-30 Identities = 114/383 (29%), Positives = 173/383 (45%), Gaps = 6/383 (1%) Frame = -3 Query: 1136 VITNAKKESWILSTVYNSSRLLDQRYVWYELAGIT-SLNLPWILIGDFNTILSNSEFQGG 960 ++T+ K S++ +YN L D+ VW ELA + S P++LIGDFN +L S+ G Sbjct: 97 ILTSGFKCSFV--NIYNPCDLNDRAQVWLELAQLCISSESPYLLIGDFNEVLDPSD--RG 152 Query: 959 SWNYYRRKSLVFSEFININNLLEVNFTGPSFTWCNNQRGAARKWALLDRCFLNPFSSASL 780 S F F+ + L+E+ T FTW Q + LDR F++P Sbjct: 153 SQIVSTNGIHAFKSFVQVLELIEITPTTGKFTWFRGQSKSK-----LDRMFIHPQWLDLF 207 Query: 779 DNFIINHLPRVFSDHAPLLLTLTPRVSTGKRIFRFDNYWLDYLGSHEAVRKAWNFSPHSN 600 I+ L R SDH P+L+ T + G R FRF + WL + G + + K W H Sbjct: 208 PTLQISLLKRTLSDHCPILVQ-TKLKNWGPRPFRFIDAWLSHPGCLKLISKTW-LEAHDC 265 Query: 599 PLHAFAHFLSRAKHNLLCWKNSGLNSIDSSIKHLEMEILAAEEDDHLQGTFNSNVTVLSP 420 +F+ L + K +LL W ID I+ LE +I +E D + N L Sbjct: 266 ---SFSEKLKKVKSSLLKWNAEEFGCIDEKIQSLENKI---QEMDRIADDRNLEANELEE 319 Query: 419 LYNQLAALH---RQNAIKLAQRARQAWVIGGDLNSNFFHNSIRSRNHLNHINMIHSPSGD 249 L ++ + AQ++R W+ GD N+ +FH R N I + Sbjct: 320 RRKSQMDLWIWMKRKEVLWAQQSRVKWIKEGDRNTRYFHIMATMRRKKNAIESLIIEQKQ 379 Query: 248 CFTNPNEIEDVFCSFFSNLWTEPSSA--FVADLMQALPNDLPCISELDGELLTKDITKKE 75 +P +++ S+FS L+TE S DL + +++ E+LT T+ E Sbjct: 380 -IDSPEDLKAAAVSYFSELFTEELSPRPVFGDL------NFKQLNDSHREILTSQFTRSE 432 Query: 74 VYLALKSLPSGKSPGPDGFNSEF 6 + A+ S KSPGPDGFN +F Sbjct: 433 IDEAVSSCDGSKSPGPDGFNFKF 455 >ref|XP_002489257.1| hypothetical protein SORBIDRAFT_0011s003210 [Sorghum bicolor] gi|241947006|gb|EES20151.1| hypothetical protein SORBIDRAFT_0011s003210 [Sorghum bicolor] Length = 821 Score = 137 bits (346), Expect = 4e-30 Identities = 103/400 (25%), Positives = 178/400 (44%), Gaps = 9/400 (2%) Frame = -3 Query: 1175 VTPIAKSRFVLHLVIT--NAKKESWILSTVYNSSRLLDQRYVWYELAGITSLNL-PWILI 1005 V + KSR + ++++ + K W L+ Y R +R WY L + + + PW+ + Sbjct: 153 VAELTKSRSHIDVILSCDHLKISHWRLTGFYGEPRWERRRESWYLLRFLRAQSSDPWLCL 212 Query: 1004 GDFNTILSNSEFQGGSWNYYRRKSLVFSEFININNLLEVNFTGPSFTWCNNQRGAARKWA 825 GDFN +L+ E G + + + F + +N L ++ F G +TW N Q G Sbjct: 213 GDFNEVLAMEEQMGANEREMWQVT-AFQDVVNDCALTDLGFHGLPYTWDNRQEGGRNVKV 271 Query: 824 LLDRCFLNPFSSASLDNFIINHLPRVFSDHAPLLLTLT---PRVSTGKRI---FRFDNYW 663 LDR + L + HLP SDHA LL+ + P T +R FR++N W Sbjct: 272 RLDRALGDNKFMELLGGSEVFHLPTTESDHAGLLVEVRHQEPGAQTRRRRHKPFRYENMW 331 Query: 662 LDYLGSHEAVRKAWNFSPHSNPLHAFAHFLSRAKHNLLCWKNSGLNSIDSSIKHLEMEIL 483 HE V + W+ P S L A+ LS + + W S+ +K L ++ Sbjct: 332 KTRGDYHEFVNRTWDPGPGSANLSTVANALSALQGSFKSWDRDIFGSVTKKVKELRAKLE 391 Query: 482 AAEEDDHLQGTFNSNVTVLSPLYNQLAALHRQNAIKLAQRARQAWVIGGDLNSNFFHNSI 303 +G + ++++ L LA + + QR+R W+ GD + FF Sbjct: 392 EERRHTLYRGPTDRERSIMAQLTEVLA----REEVMAKQRSRITWLREGDRKTEFFQAKA 447 Query: 302 RSRNHLNHINMIHSPSGDCFTNPNEIEDVFCSFFSNLWTEPSSAFVADLMQALPNDLPCI 123 + R+ N I ++ G FT+ ++E + F+ L++ + + +P + Sbjct: 448 KPRSKTNRIKLLMDVDGHVFTDQEDLERLTGDFYQRLFSAQDELLPDLVCKHVPRK---V 504 Query: 122 SELDGELLTKDITKKEVYLALKSLPSGKSPGPDGFNSEFY 3 + + ELL +++EV AL + K+PG DGFN+ F+ Sbjct: 505 TPVMCELLGAPFSEQEVEEALFCMAPNKAPGVDGFNAGFF 544 >gb|ABD28627.2| RNA-directed DNA polymerase (Reverse transcriptase); Ribonuclease H [Medicago truncatula] Length = 1296 Score = 137 bits (345), Expect = 5e-30 Identities = 107/370 (28%), Positives = 172/370 (46%), Gaps = 6/370 (1%) Frame = -3 Query: 1094 VYNSSRLLDQRYVWYELAGIT-SLNLPWILIGDFNTILSNSEFQGGSWNYYRRKSLVFSE 918 +Y S + +W L I ++ PW+LIGDFN SE +GG++++ R + FS Sbjct: 105 IYASPNYSMRPNLWNYLVNINDTITGPWMLIGDFNETHLPSEQRGGTFHHNR--AATFSN 162 Query: 917 FININNLLEVNFTGPSFTWCNNQRGAARKWALLDRCFLNPFSSASLDNFIINHLPRVFSD 738 F+N NLL++ TG FTW N G LDR N S + L R+ SD Sbjct: 163 FMNNCNLLDLTTTGGRFTWHKNNNGIRILSKKLDRGMANVDWRLSFPEAFVEVLCRLHSD 222 Query: 737 HAPLLLTLTP-RVSTGKRIFRFDNYWLDYLGSHEAVRKAWNFSPHSNPLHAFAHFLSRAK 561 H PLLL ++ G R FRF+ W+D+ V+++W+ H NP + L + Sbjct: 223 HNPLLLRFGGLPLTRGPRPFRFEAAWIDHYDYGNVVKRSWSTHTH-NPTAS----LIKVM 277 Query: 560 HNLLCWKNSGLNSIDSSIKHLEMEILAAEEDDHLQGTFNSNVTVL-SPLYNQLAALHRQN 384 N + + + +I +E + + +L+ + T+L L ++ + Q Sbjct: 278 ENSIIFNHDVFGNIFQRKSRVEWRLKGVQ--SYLERVDSYRHTLLEKELQDEYNHILFQE 335 Query: 383 AIKLAQRARQAWVIGGDLNSNFFHNSIRSRNHLNHINMIHSPSGDCFTNPNEIEDVFCSF 204 + Q++R+ WV GD N+ FFH R N I+ + P+G ++ N +++ + Sbjct: 336 EMLWYQKSREQWVKLGDKNTAFFHAQTVIRRKWNKIHKLQLPNGISTSDSNILQEEALKY 395 Query: 203 FSNLWTE---PSSAFVADLMQALPNDLPCISELDGELLTKDITKKEVYLALKSLPSGKSP 33 F + P S F + P + + LT ITKKEV+ AL S+ K+P Sbjct: 396 FKKFFCGSQIPYSRFFNE------GRHPALDDTGKTSLTSPITKKEVFAALNSMKPYKAP 449 Query: 32 GPDGFNSEFY 3 GPDGF+ F+ Sbjct: 450 GPDGFHCIFF 459