BLASTX nr result
ID: Dioscorea21_contig00018021
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Dioscorea21_contig00018021 (2915 letters) Database: ./nr 23,641,837 sequences; 8,123,359,852 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value gb|AAG51783.1|AC079679_3 reverse transcriptase, putative; 16838-... 244 1e-61 pir||T00833 RNA-directed DNA polymerase homolog T13L16.7 - Arabi... 228 7e-57 dbj|BAE79382.1| unnamed protein product [Ipomoea batatas] 226 4e-56 dbj|BAE79385.1| unnamed protein product [Ipomoea batatas] 225 6e-56 emb|CCA66235.1| hypothetical protein [Beta vulgaris subsp. vulga... 224 8e-56 >gb|AAG51783.1|AC079679_3 reverse transcriptase, putative; 16838-20266 [Arabidopsis thaliana] Length = 1142 Score = 244 bits (622), Expect = 1e-61 Identities = 187/688 (27%), Positives = 311/688 (45%), Gaps = 29/688 (4%) Frame = +3 Query: 582 KMNFPSVWIAWIESCLRSTSFSFLINGQPSPWIKSCRGVRQGDPLSSSIFILVSQNLTSI 761 KM F WI+WI C+ + + LINGQP I RG+RQGDPLS +FIL ++ L + Sbjct: 379 KMGFCEKWISWIMWCITTVQYKVLINGQPKGLIIPERGLRQGDPLSPYLFILCTEVLIAN 438 Query: 762 LNYALAVGMIPGYDSGLRSN-FNHLMFADDLVIISNASRKAVRAIKLALDYYKSFSGQSL 938 + A +I G S +HL+FADD + A+++ I L Y+S SGQ + Sbjct: 439 IRKAERQNLITGIKVATPSPAVSHLLFADDSLFFCKANKEQCGIILEILKQYESVSGQQI 498 Query: 939 NASKSSIYFPTWANTIVAASISSILHIPRASFPFKYLGVYISTRRLAKGVFDPLVDSVRL 1118 N SKSSI F + A I IL I YLG+ S VF + D ++ Sbjct: 499 NFSKSSIQFGHKVEDSIKADIKLILGIHNLGGMGSYLGLPESLGGSKTKVFSFVRDRLQS 558 Query: 1119 RCARWTNSKLSPAAKAILINSTLLSLPIYYLSVYPIHDSVLAEINRIVRRFFWSKSSNGK 1298 R W+ LS K ++I S +LP Y +S + + ++ +++ V +F+WS + + + Sbjct: 559 RINGWSAKFLSKGGKEVMIKSVAATLPRYVMSCFRLPKAITSKLTSAVAKFWWSSNGDSR 618 Query: 1299 GIHSVSWSDLINSKPEGGLAIRNLFLMKHALMAKNVMKYINSEDSIWVDILHLKYGTLNF 1478 G+H ++W L +SK +GGL RN+ AL+AK + + I + DS++ + +Y + Sbjct: 619 GMHWMAWDKLCSSKSDGGLGFRNVDDFNSALLAKQLWRLITAPDSLFAKVFKGRYFRKS- 677 Query: 1479 WHNPIPA----GCSWFFRGLCRSALKIIPDVRLLSLNPLNTSIWHHPWCSDIPFAW-SPS 1643 NP+ + S+ +R + + + + + + S+W+ PW IP + P+ Sbjct: 678 --NPLDSIKSYSPSYGWRSMISARSLVYKGLIKRVGSGASISVWNDPW---IPAQFPRPA 732 Query: 1644 CFNINLCSTLNFVSDL--TGHNGWDLDNLGLLFGAHMDDYMLRLGFLVDNEPNHWIWSHK 1817 + ++ V L + N W++D L LF + L N + W Sbjct: 733 KYGGSIVDPSLKVKSLIDSRSNFWNIDLLKELFDPEDVPLISALPIGNPNMEDTLGWHFT 792 Query: 1818 AIHGSLSASVYHHLNVHATGSDTWIG------WKQLWDLNVAPKVKHFLWLVFKGRVSTF 1979 S YH + T IG +W + PK++HFLW + G V Sbjct: 793 KAGNYTVKSGYHTARLDLNEGTTLIGPDLTTLKAYIWKVQCPPKLRHFLWQILSGCVPVS 852 Query: 1980 EFLSSINLGPRNACVMCNLENESIEHLLYSCHKSKEIWETVGVKLAINVVFPDG--FCSG 2153 E L + CV C ESI H L+ CH +++IW + A +FP F + Sbjct: 853 ENLRKRGILCDKGCVSCGASEESINHTLFQCHPARQIWALSQIPTAPG-IFPSNSIFTNL 911 Query: 2154 SWLTGTHFTGFAKSIFASTAWFIWKNRCDIIFRNL-RAPCSVIV-----------SRAFA 2297 L +G + + W+IWK R + +F N+ + P +++ ++ Sbjct: 912 DHLFWRIPSGVDSAPYPWIIWYIWKARNEKVFENVDKDPMEILLLAVKEAQSWQEAQVEL 971 Query: 2298 HSRGYSRGSLDRRGLRFVINNSPSFNGNLLFIS*FWSEATEVGGSGFICVSTNGSYVLAG 2477 HS + S+D R ++ +F+G FI W + + G+G+ C+S+ G G Sbjct: 972 HSERHGSLSIDSRIRVRDVSQDTTFSGFRCFIDGSWKASDQFSGTGWFCLSSLGESPTMG 1031 Query: 2478 AIPFRAANRAV-AELLAIYFALHALIDA 2558 A R + + E+ A+ +A+ +I A Sbjct: 1032 AANVRRSLSPLHTEMEALLWAMKCMIGA 1059 >pir||T00833 RNA-directed DNA polymerase homolog T13L16.7 - Arabidopsis thaliana (fragment) Length = 1365 Score = 228 bits (581), Expect = 7e-57 Identities = 172/663 (25%), Positives = 307/663 (46%), Gaps = 34/663 (5%) Frame = +3 Query: 585 MNFPSVWIAWIESCLRSTSFSFLINGQPSPWIKSCRGVRQGDPLSSSIFILVSQNLTSIL 764 + F + WI+WI +C+ S S+S LINGQP I RG+RQGDPLS ++F+L ++ L IL Sbjct: 595 LGFNNKWISWIMNCVTSVSYSVLINGQPYGHIIPTRGIRQGDPLSPALFVLCTEALIHIL 654 Query: 765 NYALAVGMIPGYD-SGLRSNFNHLMFADDLVIISNASRKAVRAIKLALDYYKSFSGQSLN 941 N A G I G + + NHL+FADD +++ A+++ + L Y SGQ +N Sbjct: 655 NKAEQAGKITGIQFQDKKVSVNHLLFADDTLLMCKATKQECEELMQCLSQYGQLSGQMIN 714 Query: 942 ASKSSIYFPTWANTIVAASISSILHIPRASFPFKYLGVYISTRRLAKGVFDPLVDSVRLR 1121 +KS+I F + + I S I KYLG+ + +F + + ++ R Sbjct: 715 LNKSAITFGKNVDIQIKDWIKSRSGISLEGGTGKYLGLPECLSGSKRDLFGFIKEKLQSR 774 Query: 1122 CARWTNSKLSPAAKAILINSTLLSLPIYYLSVYPIHDSVLAEINRIVRRFFWSKSSNGKG 1301 W LS K +L+ S L+LP+Y +S + + ++ ++ ++ F+W+ + Sbjct: 775 LTGWYAKTLSQGGKEVLLKSIALALPVYVMSCFKLPKNLCQKLTTVMMDFWWNSMQQKRK 834 Query: 1302 IHSVSWSDLINSKPEGGLAIRNLFLMKHALMAKNVMKYINSEDSIWVDILHLKY-GTLNF 1478 IH +SW L K +GG ++L AL+AK + + + S++ + +Y +F Sbjct: 835 IHWLSWQRLTLPKDQGGFGFKDLQCFNQALLAKQAWRVLQEKGSLFSRVFQSRYFSNSDF 894 Query: 1479 WHNPIPAGCSWFFRGLCRSALKIIPDVRLLSLNPLNTSIWHHPWCSDIPFAWSPSCFN-- 1652 + S+ +R + ++ +R + N T +W W D + N Sbjct: 895 LSATRGSRPSYAWRSILFGRELLMQGLRTVIGNGQKTFVWTDKWLHD---GSNRRPLNRR 951 Query: 1653 --INLCSTLNFVSDLTGHNGWDLDNLGLLFGAHMDDYMLRLGFLVDNEPNH-WIWSHKAI 1823 IN+ ++ + D T N W+L+ L LF + +L+ L E + W+ SH + Sbjct: 952 RFINVDLKVSQLIDPTSRN-WNLNMLRDLFPWKDVEIILKQRPLFFKEDSFCWLHSHNGL 1010 Query: 1824 HG------SLSASVYHHLNVHATGSDTWIG-WKQLWDLNVAPKVKHFLWLVFKGRVSTFE 1982 + LS V+H L A + + ++W+L+ APK++ FLW G + + Sbjct: 1011 YSVKTGYEFLSKQVHHRLYQEAKVKPSVNSLFDKIWNLHTAPKIRIFLWKALHGAIPVED 1070 Query: 1983 FLSSINLGPRNACVMCNLENESIEHLLYSCHKSKEIWETVGVKLAINVVFPDGFCSGSWL 2162 L + + + C+MC+ ENE+I H+L+ C ++++W + A GS Sbjct: 1071 RLRTRGIRSDDGCLMCDTENETINHILFECPLARQVWAITHLSSA-----------GSEF 1119 Query: 2163 TGTHFTGFAKSI-------------FAS--TAWFIWKNRCDIIFRNLRAPCSVIVSRAF- 2294 + + +T ++ I F S WF+WKNR ++F + + +V +A+ Sbjct: 1120 SNSVYTNMSRLIDLTQQNDLPHHLRFVSPWILWFLWKNRNALLFEGKGSITTTLVDKAYE 1179 Query: 2295 AHSRGYSRGS---LDRRGLRFVINNSPSFNGNL-LFIS*FWSEATEVGGSGFICVSTNGS 2462 A+ +S + D + L+ + P G L I WS+ G+ ++ + G Sbjct: 1180 AYHEWFSAQTHMQNDEKHLK-ITKWCPPLPGELKCNIGFAWSKQHHFSGASWVVRDSQGK 1238 Query: 2463 YVL 2471 +L Sbjct: 1239 VLL 1241 >dbj|BAE79382.1| unnamed protein product [Ipomoea batatas] Length = 1366 Score = 226 bits (575), Expect = 4e-56 Identities = 171/684 (25%), Positives = 302/684 (44%), Gaps = 15/684 (2%) Frame = +3 Query: 591 FPSVWIAWIESCLRSTSFSFLINGQPSPWIKSCRGVRQGDPLSSSIFILVSQNLT-SILN 767 FP I I L+ +S + L NG P K RG+RQGDPL+ +F LV + L I Sbjct: 602 FPRRLIDLILFSLQESSLAILWNGGRLPPFKPGRGLRQGDPLAPYLFNLVMERLAHDIQT 661 Query: 768 YALAVGMIPGYDSGLRSNFNHLMFADDLVIISNASRKAVRAIKLALDYYKSFSGQSLNAS 947 A P + + + +HL FADDL++ AS + + LD + + SG +N S Sbjct: 662 RVNARTWKPVHITRGGTGISHLFFADDLMLFGEASEHQAQIMFDCLDSFSNASGLKVNFS 721 Query: 948 KSSIYFPTWANTIVAASISSILHIPRASFPFKYLGVYISTRRLAKGVFDPLVDSVRLRCA 1127 KS ++ + N + +I SIL +P A YLG+ + R+++ F+ ++D +R + + Sbjct: 722 KSLLFCSSNVNAGLKRAIGSILQVPVAESLGTYLGIPMLKERVSRNTFNAVIDKMRTKLS 781 Query: 1128 RWTNSKLSPAAKAILINSTLLSLPIYYLSVYPIHDSVLAEINRIVRRFFWSKSSNGKGIH 1307 W S L+ A + +L+ ++L ++P Y + V + S EI++ R F W +N + +H Sbjct: 782 SWKASSLNMAGRRVLVQASLATVPTYTMQVMALPVSTCNEIDKTCRNFLWGHDTNTRKLH 841 Query: 1308 SVSWSDLINSKPEGGLAIRNLFLMKHALMAKNVMKYINSEDSIWVDILHLKY-GTLNFWH 1484 SV+W+++ + EGGL +R A + K + ++ D +WV +L KY +F H Sbjct: 842 SVNWAEICKPRNEGGLGLRMARDFNRAFLTKMAWQIFSNIDKLWVKVLREKYVKNADFLH 901 Query: 1485 NPIPAGCSWFFRGLCRSALKIIPDVRLLSLNPLNTSIWHHPWCSDIPFAWSPSCFNINLC 1664 + CSW +R + + + ++ N + W+ W D P A + C N Sbjct: 902 LQSQSNCSWGWRSIMKGKDVLAGAIKWNVGNGRKINFWNDWWVGDGPLASNTDCINQPHM 961 Query: 1665 STLNFVSDLTGHNGWDLDNLGLLFGAHMDDYMLRLGFLVDNEPNHWI-WSHKAIHGSLSA 1841 + + +T WD L + +M D + +++E ++ W H G ++ Sbjct: 962 TDIKVEDLITSQRRWDTGALHNILPTNMIDMVRATPIAINSEQEDFLSWPHSTT-GMVTV 1020 Query: 1842 SVYHHLNVHATGSDTWIGWKQLWDLNVAPKVKHFLWLVFKGRVSTFEFLSSINLGPRNAC 2021 S + L G D W +W K+K F+W + K + L +C Sbjct: 1021 SSAYSLIAGHDGDDRSHDW--IWRATCTEKIKLFMWKIVKNGLMVNVERKRRGLADAASC 1078 Query: 2022 VMCNLENESIEHLLYSCHKSKEIWETVGVKLAINVVFPDGFCSGSWL----TGTHFTGFA 2189 +C E+E+++HL C ++ W++ L + SW+ + G++ Sbjct: 1079 PVCGEEDETLDHLFRRCLLAEACWDSAVPPLTFQT--SNHLHMHSWMKAACSSQQKDGYS 1136 Query: 2190 KS---IFASTAWFIWKNRCDIIFRNLRAPCSVIVSRAFAHSRGYSRGSLDRRGLR----- 2345 + IF W +WK R ++F N S I++R+F S R GL+ Sbjct: 1137 TNWSLIFPYILWNLWKARNRLVFDNNITAPSDILNRSFMESSEARCLLAKRTGLQTAFQT 1196 Query: 2346 FVINNSPSFNGNLLFIS*FWSEATEVGGSGFICVSTNGSYVLAGAIPFRAANRAVAELLA 2525 +V+ + P+ L + + +G + + NG +V AN +AEL Sbjct: 1197 WVVWSPPAAGFTKLNSDGACKSHSHLASAGGLLRNENGLWVAGYTCNIGTANSFLAELWG 1256 Query: 2526 IYFALHALIDAGTWFDQVFTTSDS 2597 + L L+ F ++ +DS Sbjct: 1257 LREGL--LLAKNRGFTKLIAETDS 1278 >dbj|BAE79385.1| unnamed protein product [Ipomoea batatas] Length = 1366 Score = 225 bits (573), Expect = 6e-56 Identities = 171/684 (25%), Positives = 301/684 (44%), Gaps = 15/684 (2%) Frame = +3 Query: 591 FPSVWIAWIESCLRSTSFSFLINGQPSPWIKSCRGVRQGDPLSSSIFILVSQNLT-SILN 767 FP I I L+ +S + L NG P K RG+RQGDPL+ +F LV + L I Sbjct: 602 FPRRLIDLILFSLQESSLAILWNGGRLPPFKPGRGLRQGDPLAPYLFNLVMERLAHDIQT 661 Query: 768 YALAVGMIPGYDSGLRSNFNHLMFADDLVIISNASRKAVRAIKLALDYYKSFSGQSLNAS 947 A P + + + +HL FADDL++ AS + + LD + + SG +N S Sbjct: 662 RVNARTWKPVHITRGGTGISHLFFADDLMLFGEASEHQAQIMFDCLDSFSNASGLKVNFS 721 Query: 948 KSSIYFPTWANTIVAASISSILHIPRASFPFKYLGVYISTRRLAKGVFDPLVDSVRLRCA 1127 KS ++ + N + +I SIL +P A YLG+ + R+++ F+ ++D +R + + Sbjct: 722 KSLLFCSSNVNAGLKRAIGSILQVPVAESLGTYLGIPMLKERVSRNTFNAVIDKMRTKLS 781 Query: 1128 RWTNSKLSPAAKAILINSTLLSLPIYYLSVYPIHDSVLAEINRIVRRFFWSKSSNGKGIH 1307 W S L+ A + +L+ ++L ++P Y + V + S EI++ R F W +N + +H Sbjct: 782 SWKASSLNMAGRRVLVQASLATVPTYTMQVMALPVSTCNEIDKTCRNFLWGHDTNTRKLH 841 Query: 1308 SVSWSDLINSKPEGGLAIRNLFLMKHALMAKNVMKYINSEDSIWVDILHLKY-GTLNFWH 1484 SV+W+++ + EGGL +R A + K + ++ D +WV +L KY +F H Sbjct: 842 SVNWAEICKPRNEGGLGLRMARDFNRAFLTKMAWQIFSNIDKLWVKVLREKYVKNADFLH 901 Query: 1485 NPIPAGCSWFFRGLCRSALKIIPDVRLLSLNPLNTSIWHHPWCSDIPFAWSPSCFNINLC 1664 + CSW +R + + + ++ N + W+ W D P A + C N Sbjct: 902 LQSQSNCSWGWRSIMKGKDVLAGAIKWNVGNGRKINFWNDWWVGDGPLASNTDCINQPHM 961 Query: 1665 STLNFVSDLTGHNGWDLDNLGLLFGAHMDDYMLRLGFLVDNEPNHWI-WSHKAIHGSLSA 1841 + + +T WD L + +M D + +++E ++ W H G ++ Sbjct: 962 TDIKVEDLITSQRRWDTGALHNILPTNMIDMVRATPIAINSEQEDFLSWPHSTT-GMVTV 1020 Query: 1842 SVYHHLNVHATGSDTWIGWKQLWDLNVAPKVKHFLWLVFKGRVSTFEFLSSINLGPRNAC 2021 S + L G D W +W K+K F+W + K + L +C Sbjct: 1021 SSAYSLIAGHDGDDRSHDW--IWRATCTEKIKLFMWKIVKNGLMVNVERKRRGLADAASC 1078 Query: 2022 VMCNLENESIEHLLYSCHKSKEIWETVGVKLAINVVFPDGFCSGSWL----TGTHFTGFA 2189 +C E+E+++HL C ++ W++ L + SW+ + G+ Sbjct: 1079 PVCGEEDETLDHLFRRCLLAEACWDSAVPPLTFQT--SNHLHMHSWMKAACSSQQKDGYG 1136 Query: 2190 KS---IFASTAWFIWKNRCDIIFRNLRAPCSVIVSRAFAHSRGYSRGSLDRRGLR----- 2345 + IF W +WK R ++F N S I++R+F S R GL+ Sbjct: 1137 TNWSLIFPYILWNLWKARNRLVFDNNITAPSDILNRSFMESSEARCLLAKRTGLQTAFQT 1196 Query: 2346 FVINNSPSFNGNLLFIS*FWSEATEVGGSGFICVSTNGSYVLAGAIPFRAANRAVAELLA 2525 +V+ + P+ L + + +G + + NG +V AN +AEL Sbjct: 1197 WVVWSPPAAGFTKLNSDGACKSHSHLASAGGLLRNENGLWVAGYICNIGTANSFLAELWG 1256 Query: 2526 IYFALHALIDAGTWFDQVFTTSDS 2597 + L L+ F ++ +DS Sbjct: 1257 LREGL--LLAKNRGFTKLIAETDS 1278 >emb|CCA66235.1| hypothetical protein [Beta vulgaris subsp. vulgaris] Length = 1380 Score = 224 bits (572), Expect = 8e-56 Identities = 179/606 (29%), Positives = 282/606 (46%), Gaps = 42/606 (6%) Frame = +3 Query: 582 KMNFPSVWIAWIESCLRSTSFSFLINGQPSPWIKSCRGVRQGDPLSSSIFILVSQNLTSI 761 KMNFP W WI++C+ + S S LING PSP K +G+RQGDPLS +F+LV + L + Sbjct: 592 KMNFPIQWRQWIQTCVTTASSSVLINGSPSPPFKLQKGLRQGDPLSPFLFVLVVETLNLL 651 Query: 762 LNYALAVGMIPGYD---SGLRSNFNHLMFADDLVIISNASRKAVRAIKLALDYYKSFSGQ 932 +N A+++G G + GL+ +HL +ADD +I ++ IK L + SG Sbjct: 652 INKAISLGFWEGVEVSKGGLK--LSHLQYADDTLIFCAPRIDYLQNIKKVLILFHLASGL 709 Query: 933 SLNASKSSIYFPTWANTIVAASISSILHIPRASFPFKYLGVYISTRRLAKGVFDPLVDSV 1112 +N KSS+ +N + + +S+L S PF YLG+ I ++P+++ + Sbjct: 710 QINFHKSSLIGINVSNQWMKDATASLL-CKGGSLPFNYLGLPIGGDSSRIKTWEPILERI 768 Query: 1113 RLRCARWTNSKLSPAAKAILINSTLLSLPIYYLSVYPIHDSVLAEINRIVRRFFWSKSSN 1292 + W LS + LI S++ SLP+Y++S++PI SV+ +IN++ R F WS Sbjct: 769 SKKLDSWKGRLLSIGGRVTLIKSSISSLPLYFMSLFPIPRSVIEQINKLQRHFLWSGDRG 828 Query: 1293 GKGIHSVSWSDLINSKPEGGLAIRNLFLMKHALMAKNVMKYINSEDSIWVDILHLKYG-- 1466 + + V+W + K GGL I N+F AL+ K + K+ N +W +++ KY Sbjct: 829 KRALSQVAWKVIELPKAFGGLGIGNIFHRNLALLFKWIWKFFNDTSPLWRELIWHKYKYK 888 Query: 1467 ---TLNFWHNPIPAGCSWFFRGLCRSALKIIPDVRLLSLNPLN--------TSIWHHPWC 1613 T+ P G W SA+ P + +++N + T WH W Sbjct: 889 QPLTIRDLDPPRQGG-PW---QKIVSAIIKSPTAKAIAINGVRSLVGDGALTLFWHDQWL 944 Query: 1614 SDIPF-AWSPSCFNINLCSTLNFVSDLTGHNGWDLDNLGLLFGAHMD------------- 1751 P A P + + N ++ + H WD GL + Sbjct: 945 GPKPLKAQFPRLYLL----ATNKMAPVASHCFWD----GLAWAWSFSWARHHRARDLDEK 996 Query: 1752 DYMLRLGFLVDNEPNH---WIWS-HKAIHGSLSASVYHHLNVHATGSDTWIGWKQLWDLN 1919 + +L L +V +P++ +WS HK+ GS S S + A K +W Sbjct: 997 EKLLELLDMVHLDPSNQDSLVWSYHKS--GSFSTSSFTAEMAKANLPPHTDAIKGVWVGL 1054 Query: 1920 VAPKVKHFLWLVFKGRVSTFEFLSSINLGPR--NACVMCNLENESIEHLLYSCHKSKEIW 2093 V +V+ F+W+ GR++T L+SI + P+ N CV+CN E HLL C S +W Sbjct: 1055 VPHRVEIFVWMALLGRINTRCKLASIGIIPQSENICVLCNTSPEQHNHLLLHCPFSLSLW 1114 Query: 2094 ETVGVKLAINVVFPDGF--CSGSWLTGTHFTGFAKSIFAST----AWFIWKNRCDIIFRN 2255 + V P+ WL+ T F K ++A+T +W IWK R IF N Sbjct: 1115 NWWLDLWRLKWVLPETLRGLFDQWLSPIK-TPFFKKVWAATFFIISWSIWKERNSRIFEN 1173 Query: 2256 LRAPCS 2273 +P S Sbjct: 1174 TSSPPS 1179