BLASTX nr result
ID: Angelica23_contig00009925
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Angelica23_contig00009925 (1120 letters) Database: ./nr 23,641,837 sequences; 8,123,359,852 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value emb|CAA18234.1| putative protein [Arabidopsis thaliana] gi|72694... 119 7e-42 gb|AAG50806.1|AC079281_8 unknown protein [Arabidopsis thaliana] 127 4e-41 gb|AAM82604.1|AF525305_2 putative AP endonuclease/reverse transc... 134 7e-41 ref|XP_002331338.1| predicted protein [Populus trichocarpa] gi|2... 120 1e-39 dbj|BAB01845.1| non-LTR retroelement reverse transcriptase-like ... 125 1e-37 >emb|CAA18234.1| putative protein [Arabidopsis thaliana] gi|7269488|emb|CAB79491.1| putative protein [Arabidopsis thaliana] Length = 1141 Score = 119 bits (298), Expect(2) = 7e-42 Identities = 75/228 (32%), Positives = 114/228 (50%), Gaps = 7/228 (3%) Frame = +2 Query: 5 CSYNIRGLNNKKAYVRDFLTVNNISLLAILETHVKQEASVSISKFISAKF-NWHFNYNSH 181 C +NI N + + VN ++E HVKQ KFI+A W F+ N Sbjct: 3 CGFNIPSHRNG---FKKWFKVNRPIFGGVIEKHVKQPKD---KKFINALLPGWFFDENYG 56 Query: 182 YN--GRIWVGFDPLIWRCTVISNTAQQITCSVQKIASKEKFYVSFVYAFNTPQERRLLWR 355 ++ G+IWV +DP + +++ + Q ITC V S+ +S VYA N +R+ LWR Sbjct: 57 FSDLGKIWVLWDPSV-EVVIVAKSLQMITCEVLFPNSRTWIVISVVYAANEDDKRKELWR 115 Query: 356 DLESVRS--LIGNTAWCLSGDFNDCLGPSESSNHANWNAG--MLEFKDASFQLGVTDLKS 523 ++ ++ + + N W L GDFN L P E S H + N + +F++ ++DL Sbjct: 116 EITALVASPVTFNRPWILLGDFNQVLHPHEHSRHVSLNVDRRIRDFRECLLDAELSDLVY 175 Query: 524 SGQKFTWWDSCIRDPLFKKLDRCLVNDYWLHSFPLAHVSIMPRGLSDH 667 G FTWW+ P+ KK+DR LVN+ W + FP + P SDH Sbjct: 176 KGSSFTWWNKSKTRPVAKKIDRILVNESWSNLFPSSFGLFGPPDFSDH 223 Score = 79.0 bits (193), Expect(2) = 7e-42 Identities = 45/141 (31%), Positives = 70/141 (49%), Gaps = 2/141 (1%) Frame = +1 Query: 703 KIPKPFQFFKHLIKAPGFMEAVSDAW-NTNIPGDPWLVLTSKIRRVKQAMRTLNA-NTGN 876 K +PF+FF L+K P F+ V D W +TN+ G ++ K++ +K+ ++ + N N Sbjct: 236 KAKRPFKFFNFLLKNPEFLNLVWDVWYSTNVVGSSMFRVSKKLKALKKPIKDFSRLNYSN 295 Query: 877 LHLKVSMARSELLTFQDNLPDCPSVAQLTEENRLKCNLTAALSEEEIFLKQKSRVSWLKS 1056 L + A LL+FQ+ D PS+ E + + EE F +Q+SRV+W Sbjct: 296 LEKRTEEAHETLLSFQNLTLDNPSLENAAHELEAQRKWQILATAEESFFRQRSRVTWFAE 355 Query: 1057 GDGNNSCFFNYCKGRWNSNKI 1119 GDGN F R + N I Sbjct: 356 GDGNTRYFHRMADSRKSVNTI 376 >gb|AAG50806.1|AC079281_8 unknown protein [Arabidopsis thaliana] Length = 1213 Score = 127 bits (319), Expect(2) = 4e-41 Identities = 84/231 (36%), Positives = 117/231 (50%), Gaps = 9/231 (3%) Frame = +2 Query: 2 FCSYNIRGLNN--KKAYVRDFLTVNNISLLAILETHVKQEASVSISKFISAKF-NWHF-- 166 FC +NIRG NN ++ + ++ N ++ETHVKQ KFI+A W F Sbjct: 6 FC-WNIRGFNNVSHRSGFKKWVKANKPIFGGVIETHVKQPKD---RKFINALLPGWSFVE 61 Query: 167 NYNSHYNGRIWVGFDPLIWRCTVISNTAQQITCSVQKIASKEKFYVSFVYAFNTPQERRL 346 NY G+IWV +DP + + V++ + Q ITC V S VS VYA N R+ Sbjct: 62 NYAFSDLGKIWVMWDPSV-QVVVVAKSLQMITCEVLLPGSPSWIIVSVVYAANEVASRKE 120 Query: 347 LWRDLES--VRSLIGNTAWCLSGDFNDCLGPSESSNHANWNA--GMLEFKDASFQLGVTD 514 LW ++ + V +IG+ W + GDFN L P E SN + N M +F+D ++D Sbjct: 121 LWIEIVNMVVSGIIGDRPWLVLGDFNQVLNPQEHSNPVSLNVDINMRDFRDCLLAAELSD 180 Query: 515 LKSSGQKFTWWDSCIRDPLFKKLDRCLVNDYWLHSFPLAHVSIMPRGLSDH 667 L+ G FTWW+ P+ KK+DR LVND W FP + SDH Sbjct: 181 LRYKGNTFTWWNKSHTTPVAKKIDRILVNDSWNALFPSSLGIFGSLDFSDH 231 Score = 68.2 bits (165), Expect(2) = 4e-41 Identities = 42/141 (29%), Positives = 66/141 (46%), Gaps = 2/141 (1%) Frame = +1 Query: 703 KIPKPFQFFKHLIKAPGFMEAVSDAWNT-NIPGDPWLVLTSKIRRVKQAMRTLNA-NTGN 876 K +PF+FF +L+K F+ V D W T N+ G ++ K++ +K+ ++ + N Sbjct: 244 KAKRPFKFFNYLLKNLDFLNLVRDNWFTLNVVGSSMFRVSKKLKALKKPIKDFSRLNYSE 303 Query: 877 LHLKVSMARSELLTFQDNLPDCPSVAQLTEENRLKCNLTAALSEEEIFLKQKSRVSWLKS 1056 L + A L+ QD P+ + E + + EE F +QKSR+SW Sbjct: 304 LEKRTKEAHDFLIGCQDRTLADPTPINASFELEAERKWHILTAAEESFFRQKSRISWFAE 363 Query: 1057 GDGNNSCFFNYCKGRWNSNKI 1119 GDGN F R +SN I Sbjct: 364 GDGNTKYFHRMADARNSSNSI 384 >gb|AAM82604.1|AF525305_2 putative AP endonuclease/reverse transcriptase [Brassica napus] Length = 1214 Score = 134 bits (337), Expect(2) = 7e-41 Identities = 81/236 (34%), Positives = 125/236 (52%), Gaps = 7/236 (2%) Frame = +2 Query: 8 SYNIRGLNN--KKAYVRDFLTVNNISLLAILETHVKQEASVSISKFISAKFNWHF--NYN 175 S+N+RG NN ++ R + ++ +ILET VK+ + +S+ W NY Sbjct: 6 SWNVRGFNNSVRRRNFRKWFKLSKALFGSILETRVKEHRARR--SLLSSFPGWKSVCNYE 63 Query: 176 SHYNGRIWVGFDPLIWRCTVISNTAQQITCSVQKIASKEKFYVSFVYAFNTPQERRLLWR 355 GRIWV +DP + TV+S + Q I+C+V+ +F V+FVYA N RR LW Sbjct: 64 FAALGRIWVVWDPAV-EVTVLSKSDQTISCTVKLPHISTEFVVTFVYAVNCRYGRRRLWS 122 Query: 356 DLE--SVRSLIGNTAWCLSGDFNDCLGPSESSNHANW-NAGMLEFKDASFQLGVTDLKSS 526 +LE + + W + GDFN L P ++S + GM EF++ ++DL Sbjct: 123 ELELLAANQTTSDKPWIILGDFNQSLDPVDASTGGSRITRGMEEFRECLLTSNISDLPFR 182 Query: 527 GQKFTWWDSCIRDPLFKKLDRCLVNDYWLHSFPLAHVSIMPRGLSDHCPLSTSLGH 694 G +TWW++ +P+ KK+DR LVND WL + PL++ S SDHCP ++ + Sbjct: 183 GNHYTWWNNQENNPIAKKIDRILVNDSWLIASPLSYGSFCAMEFSDHCPSCVNISN 238 Score = 60.5 bits (145), Expect(2) = 7e-41 Identities = 45/138 (32%), Positives = 60/138 (43%), Gaps = 2/138 (1%) Frame = +1 Query: 712 KPFQFFKHLIKAPGFMEAVSDAWNT-NIPGDPWLVLTSKIRRVKQAMRTLNA-NTGNLHL 885 KPF+ L+ P F+E + W+ G L+ K + +K +RT N + L Sbjct: 245 KPFKLSNFLMHHPEFIEKIRVTWDRLAYQGSAMFTLSKKSKFLKGTIRTFNREHYSGLEK 304 Query: 886 KVSMARSELLTFQDNLPDCPSVAQLTEENRLKCNLTAALSEEEIFLKQKSRVSWLKSGDG 1065 +V A L T Q+NL PS E + EE FL QKSRV WLK GD Sbjct: 305 RVVQAAQNLKTCQNNLLAAPSSYLAGLEKEAHRSWAELALAEERFLCQKSRVLWLKCGDS 364 Query: 1066 NNSCFFNYCKGRWNSNKI 1119 N + F R N+I Sbjct: 365 NTTFFHRMMTARRAINEI 382 >ref|XP_002331338.1| predicted protein [Populus trichocarpa] gi|222873371|gb|EEF10502.1| predicted protein [Populus trichocarpa] Length = 819 Score = 120 bits (302), Expect(2) = 1e-39 Identities = 71/222 (31%), Positives = 113/222 (50%), Gaps = 3/222 (1%) Frame = +2 Query: 20 RGLNN--KKAYVRDFLTVNNISLLAILETHVKQEASVSISKFISAKFNWHFNYNSHYNGR 193 RGLN+ K + +R + I+L ++ET VK + ++S+ + +++ +NY+ GR Sbjct: 386 RGLNDPIKHSELRRLIHQERIALFGLVETRVKDKNKDNVSQLLLRSWSFLYNYDFSCRGR 445 Query: 194 IWVGFDPLIWRCTVISNTAQQITCSVQKIASKEKFYVSFVYAFNTPQERRLLWRDLESVR 373 IWV ++ + V + Q I SV +A+ F S +Y N R LW D+ S Sbjct: 446 IWVCWNADTVKVDVFGMSDQAIHVSVTILATNISFNTSIIYGDNNASLREALWSDIVSRS 505 Query: 374 SLIGNTAWCLSGDFNDCLGPSESSNHANWNAGMLEFKDASF-QLGVTDLKSSGQKFTWWD 550 +T W L GDFN S+ + AG ++ D + V DL+ SG +TW + Sbjct: 506 DGWESTLWILIGDFNAIRNQSDRLGGSTTWAGTMDRLDTCIREAKVDDLRYSGMHYTWSN 565 Query: 551 SCIRDPLFKKLDRCLVNDYWLHSFPLAHVSIMPRGLSDHCPL 676 C + + +KLDR LVN+ W FPL+ +P G+SDH P+ Sbjct: 566 QCPENLIMRKLDRVLVNEKWNLKFPLSEARFLPSGMSDHSPM 607 Score = 70.1 bits (170), Expect(2) = 1e-39 Identities = 46/145 (31%), Positives = 67/145 (46%), Gaps = 9/145 (6%) Frame = +1 Query: 712 KPFQFFKHLIKAPGFMEAVSDAWNTNIPGDPWLVLTSKIRRVKQAMRTLN-ANTGNLHLK 888 KPF+FF + FM V W+ N G P L K+R++KQ ++ N A+ N+ + Sbjct: 620 KPFRFFDMWMDHDEFMPLVKKVWDQNSRGCPMYQLCCKLRKLKQELKLFNMAHFSNISDR 679 Query: 889 VSMARSELLTFQDNLPDCPSVAQLTEENRLKC--------NLTAALSEEEIFLKQKSRVS 1044 V A++++ Q L EN + C + + EE F KQK+R+ Sbjct: 680 VRDAKNKMDKAQQAL-------HTAHENPILCMRERDVVHKYASTVRAEESFFKQKARIQ 732 Query: 1045 WLKSGDGNNSCFFNYCKGRWNSNKI 1119 WL GD N S F GR N NK+ Sbjct: 733 WLSLGDQNTSYFHKSVNGRQNRNKL 757 >dbj|BAB01845.1| non-LTR retroelement reverse transcriptase-like protein [Arabidopsis thaliana] Length = 893 Score = 125 bits (315), Expect(2) = 1e-37 Identities = 86/243 (35%), Positives = 121/243 (49%), Gaps = 8/243 (3%) Frame = +2 Query: 2 FCSYNIRGLN---NKKAYVRDFLTVNNISLLAILETHVKQEASVSISKFISAKF-NWHF- 166 FC +N+RG N +++ + + FL +N ++ETHVKQ KFIS W F Sbjct: 6 FC-WNVRGFNISSHRRGFKKWFL-LNKPLFGGLIETHVKQPKE---KKFISNLLPGWSFV 60 Query: 167 -NYNSHYNGRIWVGFDPLIWRCTVISNTAQQITCSVQKIASKEKFYVSFVYAFNTPQERR 343 NY G+IWV +DP + + VI + Q ITC + S F VS VYA N R+ Sbjct: 61 ENYEFSVLGKIWVLWDPSV-KVVVIGRSLQMITCELLLPDSPSWFVVSIVYASNEEGTRK 119 Query: 344 LLWRDLE--SVRSLIGNTAWCLSGDFNDCLGPSESSNHANWNAGMLEFKDASFQLGVTDL 517 LW +L ++ ++ +W + GDFN L P ES+ +AN + F+ + DL Sbjct: 120 ELWNELVQLALSPVVVGRSWIVLGDFNQILNP-ESAINANIGRKIRAFRSCLLDSDLYDL 178 Query: 518 KSSGQKFTWWDSCIRDPLFKKLDRCLVNDYWLHSFPLAHVSIMPRGLSDHCPLSTSLGHA 697 G +TWW+ C PL KK+DR LVND+W FP A+ + SDH L A Sbjct: 179 VYKGSSYTWWNKCSSRPLAKKIDRILVNDHWNTLFPSAYANFGEPDFSDHSSCEVVLDPA 238 Query: 698 AEK 706 K Sbjct: 239 VLK 241 Score = 58.2 bits (139), Expect(2) = 1e-37 Identities = 39/141 (27%), Positives = 64/141 (45%), Gaps = 2/141 (1%) Frame = +1 Query: 703 KIPKPFQFFKHLIKAPGFMEAVSDAW-NTNIPGDPWLVLTSKIRRVKQAMRTLNA-NTGN 876 K +PF+FF + + P F++ + + W + N+ G ++ K++ +K + + N + Sbjct: 241 KAKRPFRFFNYFLHNPDFLQLIRENWYSCNVSGSAMYRVSKKLKHLKLPICCFSRENYSD 300 Query: 877 LHLKVSMARSELLTFQDNLPDCPSVAQLTEENRLKCNLTAALSEEEIFLKQKSRVSWLKS 1056 + +VS A + +L Q PSV T E EE F QKS +SWL Sbjct: 301 IEKRVSEAHAIVLHRQRITLTNPSVVHATLELEATRKWQILAKAEESFFCQKSSISWLYE 360 Query: 1057 GDGNNSCFFNYCKGRWNSNKI 1119 GD N + F R + N I Sbjct: 361 GDNNTAYFHKMADMRKSINTI 381