BLASTX nr result
ID: Angelica23_contig00022662
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Angelica23_contig00022662 (2669 letters) Database: ./nr 23,641,837 sequences; 8,123,359,852 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value gb|AAF98181.1|AC000107_4 F17F8.5 [Arabidopsis thaliana] 319 e-102 gb|AAC95175.1| putative non-LTR retroelement reverse transcripta... 319 e-100 dbj|BAB09379.1| non-LTR retroelement reverse transcriptase-like ... 322 e-99 emb|CAB72467.1| putative protein [Arabidopsis thaliana] 323 2e-95 gb|AAM82604.1|AF525305_2 putative AP endonuclease/reverse transc... 294 3e-94 >gb|AAF98181.1|AC000107_4 F17F8.5 [Arabidopsis thaliana] Length = 872 Score = 319 bits (817), Expect(2) = e-102 Identities = 168/397 (42%), Positives = 235/397 (59%) Frame = +1 Query: 544 MSHFRPISCCNVIYKCISKMLANRLKLVLPDLISPFQSVFVPNRSIGDNILLSQALCRDY 723 M +RPISCCNV+YK ISK++ANRLKL+LP I+ QS FV +R + +N+LL+ L +DY Sbjct: 173 MRDYRPISCCNVLYKVISKIIANRLKLLLPRFIAENQSAFVKDRLLIENLLLATELVKDY 232 Query: 724 HLNDVQPRCAIKLDIHKGFDSLNWSFLFETLRRMGFPRAFTDWIKKCITSSMYSVKVNGV 903 H + + RCAIK+DI K FDS+ WSFL TL M F F WI CIT++ +SV+VNG Sbjct: 233 HKDSISARCAIKIDISKAFDSVQWSFLTNTLVAMNFSPTFIHWINLCITTASFSVQVNGD 292 Query: 904 LEGYFKGKSGLRQGDPLSPYLFVVAMEVLTACLNKYVVENPNFQFHWRTKEVSLHHLIFA 1083 L GYF+ K GLRQG LSPYLFV+ M+VL+ L+K F FH + + + L HL FA Sbjct: 293 LVGYFQSKRGLRQGCSLSPYLFVICMDVLSKMLDK-AAGVRKFGFHPKCQRLGLTHLSFA 351 Query: 1084 DDVFLFCHGDERSVATLMKGVNLFSGMSGLTPNCSKXXXXXXXXXXXXXXXXXATTGFQV 1263 DD+ + G RS+ +++ + F SGL + K A F V Sbjct: 352 DDLMVLSDGKTRSIEGILEVFDEFCKRSGLRISLEKSTLYMAGVSPIIKQEIAAKFLFDV 411 Query: 1264 GSLPISYLGLPLITGKLHKRDCTPLVNKFCGRVELWTSRFLNFGGRLQLVKTILSGIVGY 1443 G LP+ YLGLPL+T +L D +PL+ + R+ WT RF +F GR L+K++L I + Sbjct: 412 GQLPVRYLGLPLVTKRLTSADYSPLLEQIKKRIATWTFRFFSFAGRFNLIKSVLWSICNF 471 Query: 1444 WSLYLFLPKSVLKALISIMFKFLWGGFYKPTGKCHYKVSWVEC*KPKSEGGLGLKNIIEW 1623 W LP+ ++ + + FLW G + K K+SW KPK+EGGLGL+N+ E Sbjct: 472 WLAAFRLPRQCIREIDKLCSSFLWSGSEMSSHKA--KISWDIVCKPKAEGGLGLRNLKEA 529 Query: 1624 NFAAIISQLWRIIQPGVRSIWVCWVRSVLLKRKAFWT 1734 N + + +WRII S+W WV L+++K+ W+ Sbjct: 530 NDVSCLKLVWRIIS-NSNSLWTKWVAEYLIRKKSIWS 565 Score = 82.8 bits (203), Expect(2) = e-102 Identities = 64/252 (25%), Positives = 99/252 (39%), Gaps = 16/252 (6%) Frame = +2 Query: 1757 SWAVRKIFKARNRVHHLINYHPGPSSRFLFWHDPWLRKKPILSQFXXXXXXXXXXXXMAV 1936 SW RKI K R+ G FW+D W ++ A Sbjct: 574 SWIWRKILKIRDVAKSFSRVEVGNGESASFWYDHWSAHGRLIDTVGDKGTIDLGIPREAS 633 Query: 1937 VAEF-----QRNDSWALPRDLIELRSLASMVQFSLEDHVTWVG----LKPKLVTTSVIWQ 2089 VA+ +R +L ++ E+ + + ED V W G KP T W Sbjct: 634 VADAWTRRSRRRHRTSLLNEIEEMMAYQRIHHSDAEDTVLWRGKNDVFKPHFSTRDT-WH 692 Query: 2090 STRSTATPALWSGLAWDCFSIPKCSFILWLALKNRLLTKDRMILFGM--AVDPTCVLCGY 2263 ++T++ W W + PK + WLA+ NRL T DRM+ + +V CVLC Sbjct: 693 LIKATSSTVSWHKGVWFRHATPKYALCTWLAIHNRLPTGDRMLKWNSSGSVSGNCVLCTN 752 Query: 2264 NAESVRHLFSDCPYFDRIRSASRIGLHRD-----WTLFLNGEFCAGDVKGVKRKVAGLFV 2428 N++++ HLF C Y + +A G+ + W+ L V+ + Sbjct: 753 NSKTLEHLFFSCSYASTVWAALAKGIWKTRYSTRWSHLLT-HISTHFQDRVEGFLTRYIF 811 Query: 2429 AIAVYHTWKERN 2464 +YH W+ERN Sbjct: 812 QATIYHVWRERN 823 >gb|AAC95175.1| putative non-LTR retroelement reverse transcriptase [Arabidopsis thaliana] Length = 1352 Score = 319 bits (817), Expect(2) = e-100 Identities = 162/397 (40%), Positives = 238/397 (59%) Frame = +1 Query: 544 MSHFRPISCCNVIYKCISKMLANRLKLVLPDLISPFQSVFVPNRSIGDNILLSQALCRDY 723 M +RPISCCNV+YK +SK++ANRLK +LP I+P QS F+ +R + +N+LL+ L +DY Sbjct: 673 MKDYRPISCCNVLYKIVSKLMANRLKEILPASIAPNQSAFIKDRLMMENLLLASELVKDY 732 Query: 724 HLNDVQPRCAIKLDIHKGFDSLNWSFLFETLRRMGFPRAFTDWIKKCITSSMYSVKVNGV 903 H + R A+K+DI K FD + W FL L+ + P F WI+ CI ++ +SV+VNG Sbjct: 733 HKESISSRSALKIDISKAFDFVQWPFLINVLKAIHLPEMFIHWIELCIGTASFSVQVNGE 792 Query: 904 LEGYFKGKSGLRQGDPLSPYLFVVAMEVLTACLNKYVVENPNFQFHWRTKEVSLHHLIFA 1083 L G+F+ + GLRQG LSPYL+V+ M VL+ L+K VE +H R + ++L HL FA Sbjct: 793 LSGFFRSERGLRQGCSLSPYLYVICMNVLSCMLDKAAVEK-KISYHPRCRNMNLTHLCFA 851 Query: 1084 DDVFLFCHGDERSVATLMKGVNLFSGMSGLTPNCSKXXXXXXXXXXXXXXXXXATTGFQV 1263 DD+ +F G +S+ + F+ MS L + K F++ Sbjct: 852 DDIMVFSDGTSKSIQGTLAIFEKFAAMSWLKISLEKSTIFMAGISPNAKTSILQQFPFEL 911 Query: 1264 GSLPISYLGLPLITGKLHKRDCTPLVNKFCGRVELWTSRFLNFGGRLQLVKTILSGIVGY 1443 G+LP+ YLGLPL+T ++ + D PLV K R+ WT+RFL+F GRLQL+K++LS I + Sbjct: 912 GTLPVKYLGLPLLTKRMTQSDYLPLVEKIRARITSWTNRFLSFAGRLQLIKSVLSSITNF 971 Query: 1444 WSLYLFLPKSVLKALISIMFKFLWGGFYKPTGKCHYKVSWVEC*KPKSEGGLGLKNIIEW 1623 W LPK+ L+ + + FLW G T K K++W E K K EGGLGLK + E Sbjct: 972 WLSVFRLPKACLQEIEKMFSAFLWSGPDLNTKKA--KIAWSEVCKLKEEGGLGLKPLKEA 1029 Query: 1624 NFAAIISQLWRIIQPGVRSIWVCWVRSVLLKRKAFWT 1734 N +++ +WRI+ S+WV WV L++++ FW+ Sbjct: 1030 NEVSLLKLIWRILS-ARDSLWVKWVNKHLIRKETFWS 1065 Score = 77.0 bits (188), Expect(2) = e-100 Identities = 66/252 (26%), Positives = 94/252 (37%), Gaps = 16/252 (6%) Frame = +2 Query: 1757 SWAVRKIFKARN--RVHHLINYHPGPSSRFLFWHDPWLRKKPILSQFXXXXXXXXXXXXM 1930 SW RKI K R+ R+ H + G + F WHD W + Sbjct: 1074 SWLWRKILKQRDKARLFHRMEVRSGTFTSF--WHDHWCPLGRLHQHMGSRGTIDLGIPNN 1131 Query: 1931 AVVAEFQRNDSWALPRDLIELRSLASMVQFSLEDHVT------WVGLKPKL---VTTSVI 2083 A VAE R L + S ++ + +D T W + ++S Sbjct: 1132 ATVAEVMNTHRRKRHRADF-LNQIKSQIELARQDRSTDGDRSLWKQKEDTFKSSFSSSKT 1190 Query: 2084 WQSTRSTATPALWSGLAWDCFSIPKCSFILWLALKNRLLTKDRMILFGMAVDPTCVLCGY 2263 WQ RS + W W S PK SF+ WLA NRL T D++ + CV CG Sbjct: 1191 WQQIRSISLRCDWYRGVWFSASTPKYSFVTWLAFHNRLTTSDKICKWNSGARYDCVFCGE 1250 Query: 2264 NAESVRHLFSDCPYFDRIRSASRIGLHRDWTLFLNGEFCAGDVKGVKRKVAGLFV----- 2428 E+ HLF CPY + + GL + LN + R +F Sbjct: 1251 ELETRDHLFFSCPYSSHVWFSLTKGLLNGRNI-LNWNLITPHLLDSSRPYLHVFTLRYAF 1309 Query: 2429 AIAVYHTWKERN 2464 +++ W+ERN Sbjct: 1310 QASIHSLWRERN 1321 >dbj|BAB09379.1| non-LTR retroelement reverse transcriptase-like protein [Arabidopsis thaliana] Length = 1223 Score = 322 bits (824), Expect(2) = e-99 Identities = 169/396 (42%), Positives = 233/396 (58%) Frame = +1 Query: 544 MSHFRPISCCNVIYKCISKMLANRLKLVLPDLISPFQSVFVPNRSIGDNILLSQALCRDY 723 M +RPISCCNV+YK ISK++ANRLKLVLP I+ QS FV +R + +N+LL+ L +DY Sbjct: 526 MKDYRPISCCNVLYKVISKIIANRLKLVLPKFIAGNQSAFVKDRLLIENLLLATELVKDY 585 Query: 724 HLNDVQPRCAIKLDIHKGFDSLNWSFLFETLRRMGFPRAFTDWIKKCITSSMYSVKVNGV 903 H + + RCAIK+DI K FDS+ W FL +GFPR F WI CIT++ +SV+VNG Sbjct: 586 HKDTISTRCAIKIDISKAFDSVQWPFLINVFTILGFPREFIHWINICITTASFSVQVNGE 645 Query: 904 LEGYFKGKSGLRQGDPLSPYLFVVAMEVLTACLNKYVVENPNFQFHWRTKEVSLHHLIFA 1083 L GYF+ GLRQG LSPYLFV+ M+VL+ L+K +F +H + K + L HL FA Sbjct: 646 LAGYFQSSRGLRQGCALSPYLFVICMDVLSKMLDKAAAAR-HFGYHPKCKTMGLTHLSFA 704 Query: 1084 DDVFLFCHGDERSVATLMKGVNLFSGMSGLTPNCSKXXXXXXXXXXXXXXXXXATTGFQV 1263 DD+ + G RS+ ++K + F+ SGL + K F Sbjct: 705 DDLMVLSDGKIRSIERIIKVFDEFAKWSGLRISLEKSTVYLAGLSATARNEVADRFPFSS 764 Query: 1264 GSLPISYLGLPLITGKLHKRDCTPLVNKFCGRVELWTSRFLNFGGRLQLVKTILSGIVGY 1443 G LP+ YLGLPLIT +L DC PL+ + R+ WTSRFL++ GRL L+ ++L I + Sbjct: 765 GQLPVRYLGLPLITKRLSTTDCLPLLEQVRKRIGSWTSRFLSYAGRLNLISSVLWSICNF 824 Query: 1444 WSLYLFLPKSVLKALISIMFKFLWGGFYKPTGKCHYKVSWVEC*KPKSEGGLGLKNIIEW 1623 W LP+ ++ L + FLW G + K K+SW KPK EGGLGL+++ E Sbjct: 825 WLAAFRLPRKCIRELEKMCSAFLWSGTEMNSNKA--KISWHMVCKPKDEGGLGLRSLKEA 882 Query: 1624 NFAAIISQLWRIIQPGVRSIWVCWVRSVLLKRKAFW 1731 N + +W+I+ S+WV WV LL+ +FW Sbjct: 883 NDVCCLKLVWKIVSHS-NSLWVKWVDQHLLRNASFW 917 Score = 71.2 bits (173), Expect(2) = e-99 Identities = 54/194 (27%), Positives = 75/194 (38%), Gaps = 11/194 (5%) Frame = +2 Query: 1757 SWAVRKIFKARNRVHHLINYHPGPSSRFLFWHDPWLRKKPILSQFXXXXXXXXXXXXMAV 1936 SW +K+ K R L G + FW+D W +L + Sbjct: 927 SWIWKKLLKYREVAKTLSKVEVGNGKQTSFWYDNWSDLGQLLERTGDRGLIDLGISRRMT 986 Query: 1937 VAEF--------QRNDSWALPRDLIELRSLASMVQFSLEDHVTWVGLKPKLVTTSVI--- 2083 V E RND + + D ++ +S + + ED V W G TT Sbjct: 987 VEEAWTNRRQRRHRNDVYNVIEDALK-KSWDTRTE--TEDKVLWRGKSDVFRTTFSTRDT 1043 Query: 2084 WQSTRSTATPALWSGLAWDCFSIPKCSFILWLALKNRLLTKDRMILFGMAVDPTCVLCGY 2263 W TRST+ W + W + PK SF WLA RL T DRMI + + C+ C Sbjct: 1044 WHHTRSTSARVPWHKVIWFSHATPKYSFCSWLAAHGRLPTGDRMINWANGIATDCIFCQG 1103 Query: 2264 NAESVRHLFSDCPY 2305 E+ HLF C + Sbjct: 1104 TLETRDHLFFTCSF 1117 >emb|CAB72467.1| putative protein [Arabidopsis thaliana] Length = 762 Score = 323 bits (829), Expect(2) = 2e-95 Identities = 174/396 (43%), Positives = 237/396 (59%) Frame = +1 Query: 544 MSHFRPISCCNVIYKCISKMLANRLKLVLPDLISPFQSVFVPNRSIGDNILLSQALCRDY 723 M +RPISCCNV+YK ISK+LANRLKL+LP I+ QS FV +R + +N+LL+ L +DY Sbjct: 79 MKDYRPISCCNVMYKVISKILANRLKLLLPQFIAGNQSSFVKDRLLIENVLLATDLVKDY 138 Query: 724 HLNDVQPRCAIKLDIHKGFDSLNWSFLFETLRRMGFPRAFTDWIKKCITSSMYSVKVNGV 903 H + + RCAIK+DI K DS+ WSFL TL M FP F WI+ CIT+ +SV+VNG Sbjct: 139 HKDSISERCAIKIDISKASDSVQWSFLINTLTAMHFPEMFIHWIRLCITTPSFSVQVNGE 198 Query: 904 LEGYFKGKSGLRQGDPLSPYLFVVAMEVLTACLNKYVVENPNFQFHWRTKEVSLHHLIFA 1083 L G+F+ GLRQG LSPYLFV+ M+VL+ L+K VV +H K + L HL FA Sbjct: 199 LAGFFQSSRGLRQGCALSPYLFVICMDVLSKLLDK-VVGIGRIGYHPHCKRMGLTHLSFA 257 Query: 1084 DDVFLFCHGDERSVATLMKGVNLFSGMSGLTPNCSKXXXXXXXXXXXXXXXXXATTGFQV 1263 DD+ + G RS+ +++ +LFS SGL + K F+V Sbjct: 258 DDLMILTDGQCRSIEGIIEVFDLFSKWSGLKISMEKSTIFSAGLSSTSRAQLHTHFPFEV 317 Query: 1264 GSLPISYLGLPLITGKLHKRDCTPLVNKFCGRVELWTSRFLNFGGRLQLVKTILSGIVGY 1443 G LPI YLGLPL+T +L D PL+ + R+ W+SRFL+F GR L+ +I+ + Sbjct: 318 GELPIRYLGLPLVTKRLSSVDYAPLIEQIRKRIGSWSSRFLSFAGRFNLISSIIWSSCNF 377 Query: 1444 WSLYLFLPKSVLKALISIMFKFLWGGFYKPTGKCHYKVSWVEC*KPKSEGGLGLKNIIEW 1623 W LP++ ++ + + FLW G + K K+SW + KPKSEGGLGL+++ E Sbjct: 378 WLSAFQLPRACIQEIEKLCSSFLWSGTNLNSKKA--KISWNQVCKPKSEGGLGLRSLKEA 435 Query: 1624 NFAAIISQLWRIIQPGVRSIWVCWVRSVLLKRKAFW 1731 N + +WRII G S+WV WV LLKR+ FW Sbjct: 436 NDVCCLKLVWRIISHG-DSLWVKWVEHNLLKREIFW 470 Score = 55.1 bits (131), Expect(2) = 2e-95 Identities = 47/194 (24%), Positives = 65/194 (33%), Gaps = 16/194 (8%) Frame = +2 Query: 1757 SWAVRKIFKARNRVHHLINYHPGPSSRFLFWHDPWLRKKPILSQFXXXXXXXXXXXXMAV 1936 SW +KI K R G FW D W ++ M + Sbjct: 480 SWIWKKILKYRGVAKRFCKAEVGNGESTSFWFDDWSLLGRLID-----VAGIRGTIDMGI 534 Query: 1937 VAEFQRNDSWALPRDLIELRSLASMVQFSL------------EDHVTWVG----LKPKLV 2068 D+W R + + + ++ L + V W G K K Sbjct: 535 SRTMSVADAWTSRRRRHHRQEILNTIEEVLSTQHQKRTQQQQQGRVLWKGKNDIYKDKFS 594 Query: 2069 TTSVIWQSTRSTATPALWSGLAWDCFSIPKCSFILWLALKNRLLTKDRMILFGMAVDPTC 2248 T + W R+T+ W W + PK SF LWLA +RL T RMI + C Sbjct: 595 TKNT-WNYLRTTSNEVAWHKGVWFPHATPKYSFCLWLAAHDRLATGARMIKWNRGETGDC 653 Query: 2249 VLCGYNAESVRHLF 2290 C E+ HLF Sbjct: 654 TFCRQGIETRDHLF 667 >gb|AAM82604.1|AF525305_2 putative AP endonuclease/reverse transcriptase [Brassica napus] Length = 1214 Score = 294 bits (752), Expect(2) = 3e-94 Identities = 159/399 (39%), Positives = 226/399 (56%), Gaps = 1/399 (0%) Frame = +1 Query: 544 MSHFRPISCCNVIYKCISKMLANRLKLVLPDLISPFQSVFVPNRSIGDNILLSQALCRDY 723 ++ FRPISCCN IYK ISK+LA RL+ +LP ISP QS FV R + +N+LL+ L + + Sbjct: 519 ITEFRPISCCNAIYKVISKLLARRLENILPLWISPSQSAFVKGRLLTENVLLATELVQGF 578 Query: 724 HLNDVQPRCAIKLDIHKGFDSLNWSFLFETLRRMGFPRAFTDWIKKCITSSMYSVKVNGV 903 ++ R +K+D+ K FDS+ W F+ ETL+ P F +WIK+CITS+ +S+ V+G Sbjct: 579 GQANISSRGVLKVDLRKAFDSVGWGFIIETLKAANAPPRFVNWIKQCITSTSFSINVSGS 638 Query: 904 LEGYFKGKSGLRQGDPLSPYLFVVAMEVLTACL-NKYVVENPNFQFHWRTKEVSLHHLIF 1080 L GYFKG GLRQGDPLSP LFV+AME+L+ L NK+ + + +H + EV + L F Sbjct: 639 LCGYFKGSKGLRQGDPLSPSLFVIAMEILSRLLENKF--SDGSIGYHPKASEVRISSLAF 696 Query: 1081 ADDVFLFCHGDERSVATLMKGVNLFSGMSGLTPNCSKXXXXXXXXXXXXXXXXXATTGFQ 1260 ADD+ +F G S+ + + F +SGL N K A GF Sbjct: 697 ADDLMIFYDGKASSLRGIKSVLESFKNLSGLEMNTEKSAVYTAGLEDTDKEDTLA-FGFV 755 Query: 1261 VGSLPISYLGLPLITGKLHKRDCTPLVNKFCGRVELWTSRFLNFGGRLQLVKTILSGIVG 1440 G+ P YLGLPL+ KL + D + L++K R W ++ L+F GRLQL+ +++ V Sbjct: 756 NGTFPFRYLGLPLLHRKLRRSDYSQLIDKIAARFNHWATKTLSFAGRLQLISSVIYSTVN 815 Query: 1441 YWSLYLFLPKSVLKALISIMFKFLWGGFYKPTGKCHYKVSWVEC*KPKSEGGLGLKNIIE 1620 +W LPK LK + + +FLWG T + KVSW PK+EGGLGL+N Sbjct: 816 FWLSSFILPKCCLKTIEQMCNRFLWGN--DITRRGDIKVSWQNSCLPKAEGGLGLRNFWT 873 Query: 1621 WNFAAIISQLWRIIQPGVRSIWVCWVRSVLLKRKAFWTS 1737 WN + +W + S+WV W + L+ FW + Sbjct: 874 WNKTLNLRLIWMLFARR-DSLWVAWNHANRLRHVNFWNA 911 Score = 80.5 bits (197), Expect(2) = 3e-94 Identities = 65/254 (25%), Positives = 94/254 (37%), Gaps = 18/254 (7%) Frame = +2 Query: 1757 SWAVRKIFKARNRVHHLINYHPGPSSRFLFWHDPWLRKKPILSQFXXXXXXXXXXXXMAV 1936 SW + I R + G +W+D W P++ AV Sbjct: 918 SWIWKAILGLRPLAKRFLRGAVGNGQLLSYWYDHWSNLGPLIEAIGASGPQLTGIHESAV 977 Query: 1937 VAEFQRNDSWALPRDLIELRSLASMVQFSL----------EDHVTWV--GLKPKLVTTSV 2080 V E + W LP SLA++ L ED TW G ++ + Sbjct: 978 VTEASSSTGWILPSARTRNASLANLRSTLLNSPAPSGDRGEDTYTWYIEGSSSTSFSSKL 1037 Query: 2081 IWQSTRSTATPALWSGLAWDCFSIPKCSFILWLALKNRLLTKDRMILFGMAVDPTCVLCG 2260 W+ R T LW+ W IPK +F W+A NRL + R + C +C Sbjct: 1038 TWECLRQRDTTKLWAAAVWYKGCIPKYAFNFWVAHLNRLPVRARTTHWSTNRPSLCCVCQ 1097 Query: 2261 YNAESVRHLFSDCPYFDRI--RSASRIG---LHRDWTLFLNGEFC-AGDVKGVKRKVAGL 2422 E+ HLF C I + +R G + R+W + G G +K+A Sbjct: 1098 RETETRDHLFIHCTLGSLIWQQVLARFGRSQMFREWKDIIEWMLSNQGSFSGTLKKLA-- 1155 Query: 2423 FVAIAVYHTWKERN 2464 V A++H WKERN Sbjct: 1156 -VQTAIFHIWKERN 1168