BLASTX nr result
ID: Dioscorea21_contig00024757
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Dioscorea21_contig00024757 (826 letters) Database: ./nr 23,641,837 sequences; 8,123,359,852 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value pir||T00833 RNA-directed DNA polymerase homolog T13L16.7 - Arabi... 187 2e-45 gb|AAG51783.1|AC079679_3 reverse transcriptase, putative; 16838-... 185 1e-44 gb|AAB84340.1| putative non-LTR retroelement reverse transcripta... 184 2e-44 ref|XP_002449295.1| hypothetical protein SORBIDRAFT_05g007323 [S... 182 6e-44 ref|XP_003528143.1| PREDICTED: uncharacterized protein LOC100778... 182 8e-44 >pir||T00833 RNA-directed DNA polymerase homolog T13L16.7 - Arabidopsis thaliana (fragment) Length = 1365 Score = 187 bits (476), Expect = 2e-45 Identities = 107/264 (40%), Positives = 158/264 (59%), Gaps = 4/264 (1%) Frame = +1 Query: 1 PKKPNPVSVNDFRPISLCNVCYKLITKLLANRLKMVIHKLVGSEQSGFIPGRGAFDNIIA 180 PK +P ++D RPISLC+V YK+I+K+L RLK + +V + QS F+P R DNI+ Sbjct: 492 PKITSPQRMSDLRPISLCSVLYKIISKILTQRLKKHLPAIVSTTQSAFVPQRLISDNILV 551 Query: 181 AQEVAHSLET-DSSNPPMMMCKIDIEKAFDSIEWPAILATLRRMLFPDSWITWINSCLNS 357 A E+ HSL T D + M K D+ KA+D +EWP + + + F + WI+WI +C+ S Sbjct: 552 AHEMIHSLRTNDRISKEHMAFKTDMSKAYDRVEWPFLETMMTALGFNNKWISWIMNCVTS 611 Query: 358 ASFSLIINGKTSPWFNSSRGVRQGDPLSPLLFILVSQNLTAILNKALSTDLISGF---NR 528 S+S++ING+ +RG+RQGDPLSP LF+L ++ L ILNKA I+G ++ Sbjct: 612 VSYSVLINGQPYGHIIPTRGIRQGDPLSPALFVLCTEALIHILNKAEQAGKITGIQFQDK 671 Query: 529 SLSNNFIHLMFADDLLLITKASRRNARNCLLCLNVYHNLTGQKPNLLKSAFFVPSWCNKR 708 +S N HL+FADD LL+ KA+++ + CL+ Y L+GQ NL KSA + + Sbjct: 672 KVSVN--HLLFADDTLLMCKATKQECEELMQCLSQYGQLSGQMINLNKSAITFGKNVDIQ 729 Query: 709 LTTSISNILGINPGVFPFLYLGVP 780 + I + GI+ YLG+P Sbjct: 730 IKDWIKSRSGISLEGGTGKYLGLP 753 >gb|AAG51783.1|AC079679_3 reverse transcriptase, putative; 16838-20266 [Arabidopsis thaliana] Length = 1142 Score = 185 bits (469), Expect = 1e-44 Identities = 107/264 (40%), Positives = 150/264 (56%), Gaps = 2/264 (0%) Frame = +1 Query: 1 PKKPNPVSVNDFRPISLCNVCYKLITKLLANRLKMVIHKLVGSEQSGFIPGRGAFDNIIA 180 PK P + + RPISLCNV YK+I+K+L RLK V+ L+ QS F+ GR DNI+ Sbjct: 277 PKTERPTRMTELRPISLCNVGYKVISKILCQRLKTVLPNLISETQSAFVDGRLISDNILI 336 Query: 181 AQEVAHSLETDSS-NPPMMMCKIDIEKAFDSIEWPAILATLRRMLFPDSWITWINSCLNS 357 AQE+ H L T+SS M K D+ KA+D +EW I A LR+M F + WI+WI C+ + Sbjct: 337 AQEMFHGLRTNSSCKDKFMAIKTDMSKAYDQVEWNFIEALLRKMGFCEKWISWIMWCITT 396 Query: 358 ASFSLIINGKTSPWFNSSRGVRQGDPLSPLLFILVSQNLTAILNKALSTDLISGFNRSLS 537 + ++ING+ RG+RQGDPLSP LFIL ++ L A + KA +LI+G + Sbjct: 397 VQYKVLINGQPKGLIIPERGLRQGDPLSPYLFILCTEVLIANIRKAERQNLITGIKVATP 456 Query: 538 NNFI-HLMFADDLLLITKASRRNARNCLLCLNVYHNLTGQKPNLLKSAFFVPSWCNKRLT 714 + + HL+FADD L KA++ L L Y +++GQ+ N KS+ + Sbjct: 457 SPAVSHLLFADDSLFFCKANKEQCGIILEILKQYESVSGQQINFSKSSIQFGHKVEDSIK 516 Query: 715 TSISNILGINPGVFPFLYLGVPIS 786 I ILGI+ YLG+P S Sbjct: 517 ADIKLILGIHNLGGMGSYLGLPES 540 >gb|AAB84340.1| putative non-LTR retroelement reverse transcriptase [Arabidopsis thaliana] Length = 1094 Score = 184 bits (468), Expect = 2e-44 Identities = 105/262 (40%), Positives = 149/262 (56%), Gaps = 2/262 (0%) Frame = +1 Query: 1 PKKPNPVSVNDFRPISLCNVCYKLITKLLANRLKMVIHKLVGSEQSGFIPGRGAFDNIIA 180 PK P + D RPISLC+V YK+I+K+L+ RLK + +V QS F+ R DNII Sbjct: 222 PKITKPARMADIRPISLCSVMYKIISKILSARLKKYLPVIVSPTQSAFVAERLVSDNIIL 281 Query: 181 AQEVAHSLETDSS-NPPMMMCKIDIEKAFDSIEWPAILATLRRMLFPDSWITWINSCLNS 357 A E+ H+L T+ + M+ K D+ KA+D +EWP + L + F +WI W+ +C++S Sbjct: 282 AHEIVHNLRTNEKISKDFMVFKTDMSKAYDRVEWPFLKGILLALGFNSTWINWMMACVSS 341 Query: 358 ASFSLIINGKTSPWFNSSRGVRQGDPLSPLLFILVSQNLTAILNKALSTDLISGFN-RSL 534 S+S++ING+ RG+RQGDPLSP LF+L ++ L ILN+A ISG Sbjct: 342 VSYSVLINGQPFGHITPHRGLRQGDPLSPFLFVLCTEALIHILNQAEKIGKISGIQFNGT 401 Query: 535 SNNFIHLMFADDLLLITKASRRNARNCLLCLNVYHNLTGQKPNLLKSAFFVPSWCNKRLT 714 + HL+FADD LLI KAS+ + CL+ Y +++GQ N KSA + N+ Sbjct: 402 GPSVNHLLFADDTLLICKASQLECAEIMHCLSQYGHISGQMINSEKSAITFGAKVNEETK 461 Query: 715 TSISNILGINPGVFPFLYLGVP 780 I N GI YLG+P Sbjct: 462 QWIMNRSGIQTEGGTGKYLGLP 483 >ref|XP_002449295.1| hypothetical protein SORBIDRAFT_05g007323 [Sorghum bicolor] gi|241935138|gb|EES08283.1| hypothetical protein SORBIDRAFT_05g007323 [Sorghum bicolor] Length = 531 Score = 182 bits (463), Expect = 6e-44 Identities = 105/269 (39%), Positives = 153/269 (56%) Frame = +1 Query: 1 PKKPNPVSVNDFRPISLCNVCYKLITKLLANRLKMVIHKLVGSEQSGFIPGRGAFDNIIA 180 PKK NP +VNDFRPI+L N+ KL+TKLLA+RL++VI KLV + Q GFI R D + Sbjct: 177 PKKENPETVNDFRPIALMNISLKLLTKLLADRLQVVILKLVHTNQYGFIRSRAIQDCLAW 236 Query: 181 AQEVAHSLETDSSNPPMMMCKIDIEKAFDSIEWPAILATLRRMLFPDSWITWINSCLNSA 360 + E H + S ++ K+D EKAFD +E ++ + + PD WI W+++ L+SA Sbjct: 237 SYEYIHQCQ--QSRRETIILKLDFEKAFDMVEHSTMIQVMSHLGMPDRWIQWVSTILSSA 294 Query: 361 SFSLIINGKTSPWFNSSRGVRQGDPLSPLLFILVSQNLTAILNKALSTDLISGFNRSLSN 540 S ++++NG +F RGVRQGDPLSPLLF+L ++ L I+N+A+ LI Sbjct: 295 STAVLLNGTAGKFFKCKRGVRQGDPLSPLLFVLAAELLQIIINRAMIMGLIHKPLPQDGE 354 Query: 541 NFIHLMFADDLLLITKASRRNARNCLLCLNVYHNLTGQKPNLLKSAFFVPSWCNKRLTTS 720 ++ + +ADD LL +A R LN + TG K N KS + P + Sbjct: 355 DYPIVQYADDTLLFMQADARQLVFLKAILNSFSESTGLKINYSKSHMY-PINVSANKMNI 413 Query: 721 ISNILGINPGVFPFLYLGVPISPKKLKIN 807 ++ G + G PF YLG+P+ K KI+ Sbjct: 414 LAGTFGCDIGSMPFTYLGLPMGTTKPKID 442 >ref|XP_003528143.1| PREDICTED: uncharacterized protein LOC100778359 [Glycine max] Length = 2621 Score = 182 bits (462), Expect = 8e-44 Identities = 100/266 (37%), Positives = 150/266 (56%), Gaps = 5/266 (1%) Frame = +1 Query: 1 PKKPNPVSVNDFRPISLCNVCYKLITKLLANRLKMVIHKLVGSEQSGFIPGRGAFDNIIA 180 PKK +P +ND+RPISL YK++ K+LA R+K V+ ++ QS FI GR +++ Sbjct: 1164 PKKVDPQVLNDYRPISLIGCMYKIVAKILAKRIKTVLPTIINEAQSAFIEGRHLLQSVLI 1223 Query: 181 AQEVAHSLETDSSNPPMMMCKIDIEKAFDSIEWPAILATLRRMLFPDSWITWINSCLNSA 360 A EV E S+ P ++ K+D EKA+DS+ W +L L+R F WI+W+ CL SA Sbjct: 1224 ANEVID--EAKRSHKPCLIFKVDYEKAYDSVSWNFLLYMLKRTGFCPKWISWMEGCLKSA 1281 Query: 361 SFSLIINGKTSPWFNSSRGVRQGDPLSPLLFILVSQNLTAILNKALSTDLISGFNRSLSN 540 S S+++NG + F RG+RQGDPL+P LF +V++ L ++ AL+ +L GFN + S Sbjct: 1282 SISVLVNGSPTKEFKPQRGLRQGDPLAPFLFNIVAEALNGLMRTALAANLYKGFNIASSE 1341 Query: 541 NFIHLM-FADDLLLITKASRRNARNCLLCLNVYHNLTGQKPNLLKSAFFV----PSWCNK 705 I L+ +ADD + +AS +N + L + ++G K N KS+F W Sbjct: 1342 ISISLLQYADDTIFFGEASMKNVKVLKAILRTFEVVSGLKINFAKSSFGAFGRDDQWRQM 1401 Query: 706 RLTTSISNILGINPGVFPFLYLGVPI 783 T L + PF+YLG+PI Sbjct: 1402 AAT-----YLNCSQLALPFVYLGIPI 1422