BLASTX nr result
ID: Angelica23_contig00009561
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Angelica23_contig00009561 (1334 letters) Database: ./nr 23,641,837 sequences; 8,123,359,852 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value gb|AAC67331.1| putative non-LTR retroelement reverse transcripta... 155 3e-61 gb|AAM82604.1|AF525305_2 putative AP endonuclease/reverse transc... 141 6e-60 gb|AAG50806.1|AC079281_8 unknown protein [Arabidopsis thaliana] 151 1e-59 gb|AAD12028.1| putative non-LTR retroelement reverse transcripta... 156 3e-55 gb|AAC28221.1| similar to reverse transcriptases (PFam: rvt.hmm,... 146 4e-55 >gb|AAC67331.1| putative non-LTR retroelement reverse transcriptase [Arabidopsis thaliana] Length = 1449 Score = 155 bits (392), Expect(2) = 3e-61 Identities = 98/272 (36%), Positives = 147/272 (54%), Gaps = 13/272 (4%) Frame = -3 Query: 777 YLLTRKLKKVKEALKGL-NSRVGNLHLLVSESREALMHYQNLLPSSPSPDQFNEEKRLIE 601 + T+KLK +K L+GL ++GNL V +REA Y +L + S Q N +R +E Sbjct: 700 FRFTKKLKALKPKLRGLAKEKMGNL---VKRTREA---YLSLCQAQQSNSQ-NPSQRAME 752 Query: 600 VYKDAF-------NMEECFLKQKSRVNWLKHGDGNNKYFFNSCRRRWNSNKITSLVDANG 442 + +A+ ++EE +LKQ S+++WLK GD NNK F + R N I + +G Sbjct: 753 IESEAYVRWDRIASIEEKYLKQVSKLHWLKVGDKNNKTFHRAATARAAQNSIREIQKEDG 812 Query: 441 DTQSTHEDMARVATSYYQSILGNSAEVCDFPEDLKLPSITSHQAS-----VLTNEFTSDD 277 T +T +D+ ++Q L + KL S+ + S +LT ++ + Sbjct: 813 STATTKDDIKNETERFFQEFLQLIPNDYEGITVEKLTSLLPYHCSPAEKDMLTASVSAKE 872 Query: 276 ILKTLKSMGKNRSPGPDGFPVEFYLSTWHIIGPYVTSGILYFFESLSLPRVVNAAAICLV 97 I L SM ++SPGPDG+ EFY W IIG + FFE LP+ VN + L+ Sbjct: 873 IRGALFSMPNDKSPGPDGYTSEFYKRAWDIIGAEFVLAVKSFFEKGFLPKGVNTTILALI 932 Query: 96 PKQQNASEMKHFRPISCCNVLYKCIAKMLASR 1 PK+ A EMK +RPISCCNV+YK I+K++A+R Sbjct: 933 PKKLEAKEMKDYRPISCCNVIYKVISKIIANR 964 Score = 107 bits (268), Expect(2) = 3e-61 Identities = 66/183 (36%), Positives = 93/183 (50%), Gaps = 7/183 (3%) Frame = -1 Query: 1331 VSCHATLLSYNKSFFIMFIYGSITVVERRALWEDLSAF--SSTILDSDWTIYGDFNTCLS 1158 ++C L S + FF F+Y S ER+ LW DL S I D W I+GDFN L Sbjct: 504 ITCSVKLESQEEEFFYSFVYASNFAEERKILWNDLRDHMDSPIIRDKPWIIFGDFNEILD 563 Query: 1157 IDE--KQGGNVLWTLGMMEFKDVCLKLGLSDLRSTWHFLTWWDNNLENPKFKKLDRVLIN 984 +DE + + T GM +F+ + SDL S TW + +P +KKLDRV++N Sbjct: 564 MDEHSRMEDHPAVTSGMRDFQSLVNYCSFSDLASHGPLFTWCNKRDNDPIWKKLDRVMVN 623 Query: 983 PSWHSVFPNASTVFMARGLSDHCPTATSLGMLN--QIR-NKPFQFFSHLIQDLDFISKVM 813 +W V+P + VF A G SDH +L M + Q+R NKPF+F + + +F V Sbjct: 624 EAWKMVYPQSYNVFEAGGCSDHLRCRINLNMNSGAQVRGNKPFKFVNAVADMEEFKPLVE 683 Query: 812 EAW 804 W Sbjct: 684 NFW 686 >gb|AAM82604.1|AF525305_2 putative AP endonuclease/reverse transcriptase [Brassica napus] Length = 1214 Score = 141 bits (355), Expect(2) = 6e-60 Identities = 86/269 (31%), Positives = 135/269 (50%), Gaps = 6/269 (2%) Frame = -3 Query: 789 GDPWYLLTRKLKKVKEALKGLN-SRVGNLHLLVSESREALMHYQNLLPSSPSPDQFNEEK 613 G + L++K K +K ++ N L V ++ + L QN L ++PS EK Sbjct: 274 GSAMFTLSKKSKFLKGTIRTFNREHYSGLEKRVVQAAQNLKTCQNNLLAAPSSYLAGLEK 333 Query: 612 RLIEVYKDAFNMEECFLKQKSRVNWLKHGDGNNKYFFNSCRRRWNSNKITSLVDANGDTQ 433 + + EE FL QKSRV WLK GD N +F R N+I L+D G Sbjct: 334 EAHRSWAELALAEERFLCQKSRVLWLKCGDSNTTFFHRMMTARRAINEIHYLLDQTGRRI 393 Query: 432 STHEDMARVATSYYQSILGNSAEVCDFPEDLKLPSITSHQAS-----VLTNEFTSDDILK 268 +++ +++ + G+S+ + ++ S+T + +L E + DI Sbjct: 394 ENTDELQTHCVDFFKELFGSSSHLISAEGISQINSLTRFKCDENTRQLLEAEVSEADIKS 453 Query: 267 TLKSMGKNRSPGPDGFPVEFYLSTWHIIGPYVTSGILYFFESLSLPRVVNAAAICLVPKQ 88 ++ N+SPGPDG+ EF+ TW I+GP + + + FF S L N+ A+ +VPK+ Sbjct: 454 EFFALPSNKSPGPDGYTSEFFKKTWSIVGPSLIAAVQEFFRSGRLLGQWNSTAVTMVPKK 513 Query: 87 QNASEMKHFRPISCCNVLYKCIAKMLASR 1 NA + FRPISCCN +YK I+K+LA R Sbjct: 514 PNADRITEFRPISCCNAIYKVISKLLARR 542 Score = 117 bits (294), Expect(2) = 6e-60 Identities = 65/179 (36%), Positives = 92/179 (51%), Gaps = 3/179 (1%) Frame = -1 Query: 1331 VSCHATLLSYNKSFFIMFIYGSITVVERRALWEDLS--AFSSTILDSDWTIYGDFNTCLS 1158 +SC L + F + F+Y RR LW +L A + T D W I GDFN L Sbjct: 90 ISCTVKLPHISTEFVVTFVYAVNCRYGRRRLWSELELLAANQTTSDKPWIILGDFNQSLD 149 Query: 1157 -IDEKQGGNVLWTLGMMEFKDVCLKLGLSDLRSTWHFLTWWDNNLENPKFKKLDRVLINP 981 +D GG+ + T GM EF++ L +SDL + TWW+N NP KK+DR+L+N Sbjct: 150 PVDASTGGSRI-TRGMEEFRECLLTSNISDLPFRGNHYTWWNNQENNPIAKKIDRILVND 208 Query: 980 SWHSVFPNASTVFMARGLSDHCPTATSLGMLNQIRNKPFQFFSHLIQDLDFISKVMEAW 804 SW P + F A SDHCP+ ++ + RNKPF+ + L+ +FI K+ W Sbjct: 209 SWLIASPLSYGSFCAMEFSDHCPSCVNISNQSGGRNKPFKLSNFLMHHPEFIEKIRVTW 267 >gb|AAG50806.1|AC079281_8 unknown protein [Arabidopsis thaliana] Length = 1213 Score = 151 bits (382), Expect(2) = 1e-59 Identities = 91/269 (33%), Positives = 140/269 (52%), Gaps = 6/269 (2%) Frame = -3 Query: 789 GDPWYLLTRKLKKVKEALKGLNS-RVGNLHLLVSESREALMHYQNLLPSSPSPDQFNEEK 613 G + +++KLK +K+ +K + L E+ + L+ Q+ + P+P + E Sbjct: 276 GSSMFRVSKKLKALKKPIKDFSRLNYSELEKRTKEAHDFLIGCQDRTLADPTPINASFEL 335 Query: 612 RLIEVYKDAFNMEECFLKQKSRVNWLKHGDGNNKYFFNSCRRRWNSNKITSLVDANGDTQ 433 + EE F +QKSR++W GDGN KYF R +SN I++L D NG Sbjct: 336 EAERKWHILTAAEESFFRQKSRISWFAEGDGNTKYFHRMADARNSSNSISALYDGNGKLV 395 Query: 432 STHEDMARVATSYYQSILGNSAEVCDFPEDLKLPSITSHQASV-----LTNEFTSDDILK 268 + E + + SY+ S+LG+ + E + + S++ S L + F+++DI Sbjct: 396 DSQEGILDLCASYFGSLLGDEVDPY-LMEQNDMNLLLSYRCSPAQVCELESTFSNEDIRA 454 Query: 267 TLKSMGKNRSPGPDGFPVEFYLSTWHIIGPYVTSGILYFFESLSLPRVVNAAAICLVPKQ 88 L S+ +N+S GPDGF EF++ +W I+G VT I FF S L + NA I L+PK Sbjct: 455 ALFSLPRNKSCGPDGFTAEFFIDSWSIVGAEVTDAIKEFFSSGCLLKQWNATTIVLIPKI 514 Query: 87 QNASEMKHFRPISCCNVLYKCIAKMLASR 1 N + FRPISC N LYK IA++L R Sbjct: 515 VNPTCTSDFRPISCLNTLYKVIARLLTDR 543 Score = 106 bits (265), Expect(2) = 1e-59 Identities = 56/179 (31%), Positives = 93/179 (51%), Gaps = 3/179 (1%) Frame = -1 Query: 1331 VSCHATLLSYNKSFFIMFIYGSITVVERRALWEDL--SAFSSTILDSDWTIYGDFNTCLS 1158 ++C L + +Y + V R+ LW ++ S I D W + GDFN L+ Sbjct: 91 ITCEVLLPGSPSWIIVSVVYAANEVASRKELWIEIVNMVVSGIIGDRPWLVLGDFNQVLN 150 Query: 1157 IDEKQGGNVLWT-LGMMEFKDVCLKLGLSDLRSTWHFLTWWDNNLENPKFKKLDRVLINP 981 E L + M +F+D L LSDLR + TWW+ + P KK+DR+L+N Sbjct: 151 PQEHSNPVSLNVDINMRDFRDCLLAAELSDLRYKGNTFTWWNKSHTTPVAKKIDRILVND 210 Query: 980 SWHSVFPNASTVFMARGLSDHCPTATSLGMLNQIRNKPFQFFSHLIQDLDFISKVMEAW 804 SW+++FP++ +F + SDH L + +PF+FF++L+++LDF++ V + W Sbjct: 211 SWNALFPSSLGIFGSLDFSDHVSCGVVLEETSIKAKRPFKFFNYLLKNLDFLNLVRDNW 269 >gb|AAD12028.1| putative non-LTR retroelement reverse transcriptase [Arabidopsis thaliana] Length = 1447 Score = 156 bits (395), Expect(2) = 3e-55 Identities = 91/267 (34%), Positives = 146/267 (54%), Gaps = 8/267 (2%) Frame = -3 Query: 777 YLLTRKLKKVKEALKGL-NSRVGNLHLLVSESREALMHYQNLLPSSPSPDQFNEEKRLIE 601 + ++KLK +K L+ L R+GNL E+ + L Q ++P+P+ EE + Sbjct: 693 FRFSKKLKSLKPLLRNLAKERLGNLVKKTREAYDTLCKKQESTLNNPTPNAMKEEVEAHD 752 Query: 600 VYKDAFNMEECFLKQKSRVNWLKHGDGNNKYFFNSCRRRWNSNKITSLVDANGDTQSTHE 421 ++ +EE FLK+KS+++WL GD NNK F + R N I+ + +G + + Sbjct: 753 RWEHVAGLEEKFLKKKSKLHWLDGGDKNNKAFHRAVVTREAQNSISEIQCQDGSVTAKGD 812 Query: 420 DMARVATSYYQSILG------NSAEVCDFPEDLKLP-SITSHQASVLTNEFTSDDILKTL 262 ++ A +++ L + D + L S T H+ +LT T+++I K L Sbjct: 813 EIKAYAERFFREFLQLIPNEYEGVTMADLQDLLPFRCSETEHE--LLTRVVTAEEIKKVL 870 Query: 261 KSMGKNRSPGPDGFPVEFYLSTWHIIGPYVTSGILYFFESLSLPRVVNAAAICLVPKQQN 82 SM ++SPGPDGF EF+ +TW I+G I FF LP+ +N + L+PK++ Sbjct: 871 FSMPNDKSPGPDGFTSEFFKATWEILGNEFILAIQSFFAKGFLPKGINTTILALIPKKKE 930 Query: 81 ASEMKHFRPISCCNVLYKCIAKMLASR 1 A EMK +RPISCCNV+YK I+K++A+R Sbjct: 931 AKEMKDYRPISCCNVIYKVISKIIANR 957 Score = 86.7 bits (213), Expect(2) = 3e-55 Identities = 58/183 (31%), Positives = 89/183 (48%), Gaps = 7/183 (3%) Frame = -1 Query: 1331 VSCHATLLSYNKSFFIMFIYGSITVVERRALWEDLSAF--SSTILDSDWTIYGDFNTCLS 1158 ++C L + + FF F+Y S +R+ LW +L S I W I+GDFN L Sbjct: 497 ITCSVKLENRDDEFFCSFVYASNFRDDRKVLWNELQDHYDSPIIKKKPWIIFGDFNETLE 556 Query: 1157 IDE--KQGGNVLWTLGMMEFKDVCLKLGLSDLRSTWHFLTWWDNNLENPKFKKLDRVLIN 984 ++E K N + ++GM +F+ + L+D+ TW + + KKLDRV++N Sbjct: 557 LEEHSKVEDNPVVSMGMRDFRSMVNYCSLTDMAHHGPLYTWSNKREHDLIAKKLDRVMVN 616 Query: 983 PSWHSVFPNASTVFMARGLSDHCPTATSL--GMLNQIRNK-PFQFFSHLIQDLDFISKVM 813 W FP + +VF A G DH +L G + +R K PF+F + L + DF V Sbjct: 617 DVWTQSFPQSYSVFEAGGCLDHLRGRINLNDGPGSIVRGKRPFKFVNVLTEMEDFKPTVD 676 Query: 812 EAW 804 W Sbjct: 677 SYW 679 >gb|AAC28221.1| similar to reverse transcriptases (PFam: rvt.hmm, score: 60.13) [Arabidopsis thaliana] Length = 1164 Score = 146 bits (369), Expect(2) = 4e-55 Identities = 91/269 (33%), Positives = 137/269 (50%), Gaps = 6/269 (2%) Frame = -3 Query: 789 GDPWYLLTRKLKKVKEALKGLN-SRVGNLHLLVSESREALMHYQNLLPSSPSPDQFNEEK 613 G Y ++ KLK +K+ ++ + ++ E+ +AL+ Q++L +SP P E Sbjct: 173 GSAMYRVSVKLKALKKVIRDFSRDNYSDIEKRTKEAHDALLLAQSVLLASPCPSNAAIEA 232 Query: 612 RLIEVYKDAFNMEECFLKQKSRVNWLKHGDGNNKYFFNSCRRRWNSNKITSLVDANGDTQ 433 ++ E F Q+SRVNWL+ GD N+ YF R + N I L D GD Sbjct: 233 ETQRKWRILAEAEASFFYQRSRVNWLREGDMNSSYFHKMASARQSLNHIHFLSDPVGDRI 292 Query: 432 STHEDMARVATSYYQSILGNSAEVCDFPEDLKLPSITSHQASV-----LTNEFTSDDILK 268 +++ Y+QS LG+ + F E + ++ S++ S L F+S+ I Sbjct: 293 EGQQNLENHCVEYFQSNLGSEQGLPLF-EQADISNLLSYRCSPAQQVSLDTPFSSEQIKN 351 Query: 267 TLKSMGKNRSPGPDGFPVEFYLSTWHIIGPYVTSGILYFFESLSLPRVVNAAAICLVPKQ 88 S+ +N++ GPDGF EF+ + W IIG VT I FF S L + NA + L+PK Sbjct: 352 AFFSLPRNKASGPDGFSPEFFCACWPIIGGEVTEAIHEFFTSGKLLKQWNATNLVLIPKI 411 Query: 87 QNASEMKHFRPISCCNVLYKCIAKMLASR 1 NAS M FRPISC N +YK I+K+L R Sbjct: 412 TNASSMSDFRPISCLNTVYKVISKLLTDR 440 Score = 96.3 bits (238), Expect(2) = 4e-55 Identities = 55/166 (33%), Positives = 84/166 (50%), Gaps = 5/166 (3%) Frame = -1 Query: 1286 IMFIYGSITVVERRALWEDLSAFSST--ILDSDWTIYGDFNTCLSIDE---KQGGNVLWT 1122 + F+Y S V R+ LW ++ FS+ ++D WT+ GDFN L E G NV Sbjct: 3 LSFVYASTDEVTRQILWNEIVDFSNDPCVIDKPWTVLGDFNQILHPSEHSTSDGFNVDRP 62 Query: 1121 LGMMEFKDVCLKLGLSDLRSTWHFLTWWDNNLENPKFKKLDRVLINPSWHSVFPNASTVF 942 + F++ L L+DL + TWW+ P KKLDR+L+N W + FP++ +F Sbjct: 63 TRI--FRETILLASLTDLSFRGNTFTWWNKRSRAPVAKKLDRILVNDKWTTTFPSSLGLF 120 Query: 941 MARGLSDHCPTATSLGMLNQIRNKPFQFFSHLIQDLDFISKVMEAW 804 SDH SL + KPF+F + L++D +F+S + W Sbjct: 121 GEPDFSDHSSCELSLMSASPRSKKPFRFNNFLLKDENFLSLICLKW 166