BLASTX nr result
ID: Angelica22_contig00026465
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Angelica22_contig00026465 (2100 letters) Database: ./nr 23,641,837 sequences; 8,123,359,852 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value dbj|BAD99219.1| polypeptide with a gag-like domain [Petunia x hy... 211 8e-52 emb|CAB10225.1| retrovirus-related like polyprotein [Arabidopsis... 190 1e-45 emb|CAN60374.1| hypothetical protein VITISV_001215 [Vitis vinifera] 173 1e-40 gb|ABD32757.1| Integrase, catalytic region [Medicago truncatula] 172 2e-40 gb|AAG10817.1|AC011808_5 Putative retroelement polyprotein [Arab... 168 6e-39 >dbj|BAD99219.1| polypeptide with a gag-like domain [Petunia x hybrida] Length = 463 Score = 211 bits (536), Expect = 8e-52 Identities = 128/405 (31%), Positives = 214/405 (52%), Gaps = 11/405 (2%) Frame = -1 Query: 1572 GVGSTTNAWNRNVTLNSQQDPTGVYYIHPSDTNTTQLVSIKFNGTGFSNWKRSMLLSIST 1393 G GS++ + + N L D YY+ SD L++ F+G+ + NWKR +L+S+S Sbjct: 14 GTGSSSGSRDENSPLKVL-DINHPYYLASSDAPGMNLINTSFDGSSYGNWKRGVLISLSA 72 Query: 1392 KNKIGFFNGTEVKPGSGTSDFKGWERCNDLVCSWLLNNLDDSISKSVLFFKTTKEIWDDL 1213 KNK+GF G KP F+ W RC+D+V +WLLN+L I++SVL+ +T +E+W +L Sbjct: 73 KNKLGFITGAYKKPDKEDLLFEQWRRCSDMVLAWLLNSLSKEIAESVLYSQTAQELWQEL 132 Query: 1212 EDRFGYASMTQVYSIEQRLSELNQGTKSVLDFFLLRLRLCGML*VMLVHIPCCTCHKCTC 1033 E R+G T+++ +++ L+ ++QGT V +F R+ + V+ + C C C Sbjct: 133 EQRYGQIDGTKMFQLQRELNNVSQGTNDVAAYFNKLKRIWDQMKVLNTFMVC----SCEC 188 Query: 1032 NVTQKVH--QMQQDHKLIQFMMKLSDKYSTLRGNILMQQPLPNMSSAFRMFSQEERHQEL 859 N K H +MQ+D +LIQF+M L++ YS +RGNILM +PLP+ + A+ + S EE + + Sbjct: 189 NCEAKGHNAKMQEDQQLIQFLMGLNEVYSGIRGNILMMKPLPSTAQAYSIISHEETQRGI 248 Query: 858 SQIGS-QTESLAFIADKKGFDSQRNYRSGHSNAQRQNFNSSSYKAGPGTNTVGNKKCANF 682 + + T+S AF A + +++QRN + Q N ++Y+ N Sbjct: 249 AAGNNVSTDSAAFNASTQNWNNQRNNYDNRARNQNYNNRRNNYEGRRSNQN-------NS 301 Query: 681 FCTHCKISGHSVDRCFKIHG------YPA-NFKSSKDRKVADVFICNNADTVEXXXXXXX 523 +CT+C+ SGH + C K++G PA N + + + N +T E Sbjct: 302 YCTYCRTSGHVREECRKLNGRDNRNRRPAGNPNTQANAAYEGNTMQRNGNTPENTSAGSV 361 Query: 522 XXXXXXVAQYQQLMELL-NKKSLAPSGSTSHTNVDHATMLVGPFA 391 Q +QL+++L + S++ + S S N + A VG +A Sbjct: 362 NAQGFTKEQCEQLIQMLQSAHSISSTASCSEVN-NSAANFVGKYA 405 >emb|CAB10225.1| retrovirus-related like polyprotein [Arabidopsis thaliana] gi|7268152|emb|CAB78488.1| retrovirus-related like polyprotein [Arabidopsis thaliana] Length = 1489 Score = 190 bits (482), Expect = 1e-45 Identities = 97/301 (32%), Positives = 165/301 (54%), Gaps = 1/301 (0%) Frame = -1 Query: 1500 YYIHPSDTNTTQLVSIKFN-GTGFSNWKRSMLLSISTKNKIGFFNGTEVKPGSGTSDFKG 1324 YY+H +D LVS + + F +W+RS+L++++ +NK+GF NGT KP DF Sbjct: 34 YYLHSADHAGLILVSDRLTTASDFHSWRRSILMALNVRNKLGFINGTITKPPEDHRDFGA 93 Query: 1323 WERCNDLVCSWLLNNLDDSISKSVLFFKTTKEIWDDLEDRFGYASMTQVYSIEQRLSELN 1144 W RCND+V +WL+N++D I +S+L+ T + IW++L RF +++ IEQ+LS++ Sbjct: 94 WSRCNDIVSTWLMNSVDKKIGQSLLYIATVQGIWNNLLSRFKQDDAPRIFDIEQKLSKIE 153 Query: 1143 QGTKSVLDFFLLRLRLCGML*VMLVHIPCCTCHKCTCNVTQKVHQMQQDHKLIQFMMKLS 964 QG+ + ++ L L V +P CTC +C C+ K +QQ ++ +F+ +L+ Sbjct: 154 QGSMDISTYYTALLTLWEEH-RNYVELPVCTCGRCECDAAVKWEHLQQRSRVTKFLKELN 212 Query: 963 DKYSTLRGNILMQQPLPNMSSAFRMFSQEERHQELSQIGSQTESLAFIADKKGFDSQRNY 784 + + R +ILM +P+P + AF M +Q+ER + + + ++ +S+AF Sbjct: 213 EGFDQTRRHILMLKPIPTIKEAFNMVTQDERQRNVKPL-TRVDSVAF------------- 258 Query: 783 RSGHSNAQRQNFNSSSYKAGPGTNTVGNKKCANFFCTHCKISGHSVDRCFKIHGYPANFK 604 N N + ++Y A T K CTHC GH++ +C+K+HGYP K Sbjct: 259 ----QNTSMINEDENAYVAAYNTVRPNQKP----ICTHCGKVGHTIQKCYKVHGYPPGMK 310 Query: 603 S 601 + Sbjct: 311 T 311 >emb|CAN60374.1| hypothetical protein VITISV_001215 [Vitis vinifera] Length = 1535 Score = 173 bits (439), Expect = 1e-40 Identities = 108/364 (29%), Positives = 179/364 (49%), Gaps = 10/364 (2%) Frame = -1 Query: 1524 SQQDPTGVYYIHPSDTNTTQLVSIKFNGTGFSNWKRSMLLSISTKNKIGFFNGTEVKPGS 1345 S+ D + Y+ H SD L+S NG +S W+R+M L+++ KNK+GF NG P Sbjct: 34 SKSDLSNPYFTHHSDHPGLVLISKPLNGDNYSAWRRAMALALNAKNKLGFVNGIIKAPSE 93 Query: 1344 GT--SDFKGWERCNDLVCSWLLNNLDDSISKSVLFFKTTKEIWDDLEDRFGYASMTQVYS 1171 T D+ W RCND+V SW++N L+ IS SV+++ T E+W+DL +RF ++ +++ Sbjct: 94 ETHPDDYATWSRCNDMVHSWIVNTLNPEISDSVIYYATAHEVWEDLCERFSQSNAPRIFE 153 Query: 1170 IEQRLSELNQGTKSVLDFFLLRLRLCGML*VMLVHIPCCTCHKCTCNVTQKVHQMQQDHK 991 I++ ++ Q S+ ++ L L + N Q Q + Sbjct: 154 IQREIAYHRQEQLSISVYYTKLKSLWDEL--------------ASYNDASSGAQ-QDQQR 198 Query: 990 LIQFMMKLSDKYSTLRGNILMQQPLPNMSSAFRMFSQEERHQELSQIGSQTESLAFIADK 811 L+QF+M L++ YS +RG IL+ PLP++ A+ QEE+ + LS + ES + A Sbjct: 199 LMQFLMGLNESYSAIRGQILLMNPLPSIRQAYSSVCQEEKQRLLSATHTTAESNSSAAMA 258 Query: 810 KGFDSQRNYRSGHSNAQRQN--FNSS--SYKAGPGTNTVGNKKCANFFCTHCKISGHSVD 643 + +N +G++ + R + +NSS S + G+ K CT+C GH V+ Sbjct: 259 VRSNQMKNNSAGNARSDRSDRFYNSSQDSRRFDQDKRRSGSSK-GRPQCTYCGEMGHFVE 317 Query: 642 RCFKIHGYPANFKSSKD----RKVADVFICNNADTVEXXXXXXXXXXXXXVAQYQQLMEL 475 +C+++HGYP ++ + + F+ N AQ QQL+ L Sbjct: 318 KCYQLHGYPPGHPKARTGSNFNRHKNTFVANQVSDGANKDGGKSVLTGITEAQLQQLLSL 377 Query: 474 LNKK 463 LN K Sbjct: 378 LNDK 381 Score = 82.4 bits (202), Expect = 4e-13 Identities = 44/93 (47%), Positives = 57/93 (61%), Gaps = 2/93 (2%) Frame = -2 Query: 275 VWHLRLGHVPFQKLKLIPPLSGLKNPSQELI--CQICPLAKQTRVSFPISSIKTKCPFEL 102 +WH RLGHV +L I N S + C IC LAKQ R+ F S I ++ PF+L Sbjct: 547 LWHSRLGHVSHSRLSFIA--KNFLNFSIQFNNDCPICLLAKQHRLPFGTSEISSEKPFDL 604 Query: 101 IHIDVRGPYKVKAHNGCNQFLTIVDDYSRFTWV 3 IH D+ G YK + +G + FLTIVDDY+RFTW+ Sbjct: 605 IHCDIWGRYKHSSLSGAHYFLTIVDDYTRFTWI 637 >gb|ABD32757.1| Integrase, catalytic region [Medicago truncatula] Length = 1157 Score = 172 bits (437), Expect = 2e-40 Identities = 99/321 (30%), Positives = 172/321 (53%), Gaps = 6/321 (1%) Frame = -1 Query: 1500 YYIHPSDTNTTQLVSIKFNGTGFSNWKRSMLLSISTKNKIGFFNGTEVKPGSGTSDFKGW 1321 YYIHPSD ++ +++ K NG+ + W RSM ++ KNK+ F +G+ P + + W Sbjct: 15 YYIHPSDGPSSLIITPKLNGSNYLAWHRSMQRALGAKNKLVFLDGSISVPDIDDLNRQAW 74 Query: 1320 ERCNDLVCSWLLNNLDDSISKSVLFFKTTKEIWDDLEDRFGYASMTQVYSIEQRLSELNQ 1141 ERCN L+ SW++N++ +SI+++++F T WDDL++ F +V S+ ++ L Q Sbjct: 75 ERCNHLIHSWIVNSVTESIAQTIVFHDTALSAWDDLKECFSKVDRVRVLSLRSTINNLKQ 134 Query: 1140 GTKSVLDFFLLRLRLCGML*VMLVH--IPCCTC-HKCTCNVTQKVHQMQQDHKLIQFMMK 970 GTKSVLD+F + LC + + H IP CTC H C C + + + +++QF+ Sbjct: 135 GTKSVLDYF---IELCTLWDELNSHRPIPNCTCIHPCRCESIRLAKYYRTEDQILQFLTG 191 Query: 969 LSDKYSTLRGNILMQQPLPNMSSAFRMFSQEERHQELSQIGSQTESLAFIADKKGFDSQR 790 L+D +S ++ IL+ PLP ++ + + QEE ++++ F DS Sbjct: 192 LNDTFSVVKTQILLMDPLPPINKVYSLVVQEE-----------SQNIVFSTPSISDDSSI 240 Query: 789 NYRSGHSNAQRQNFNSSSYKAGPGTNTVGNKKCANFFCTHCKISGHSVDRCFKIHGYPAN 610 + + S+A++ Y G GT++ K + FCT C H+++ C+ +G+P N Sbjct: 241 SVNA--SDARK------FYPRGKGTSSTSANKGKDRFCTFCNRYNHTIEFCYLKYGHP-N 291 Query: 609 F---KSSKDRKVADVFICNNA 556 F SS + V++ NN+ Sbjct: 292 FNKGNSSANNAVSEASDTNNS 312 >gb|AAG10817.1|AC011808_5 Putative retroelement polyprotein [Arabidopsis thaliana] Length = 1413 Score = 168 bits (425), Expect = 6e-39 Identities = 86/301 (28%), Positives = 154/301 (51%) Frame = -1 Query: 1497 YIHPSDTNTTQLVSIKFNGTGFSNWKRSMLLSISTKNKIGFFNGTEVKPGSGTSDFKGWE 1318 ++H +D +VS++ +G ++ W +M +++ KNKI F +G+ +P G + W Sbjct: 55 FLHNADHPGISIVSVQLDGANYNQWSSAMKIALDAKNKIAFIDGSCPRPEEGNHLLRIWS 114 Query: 1317 RCNDLVCSWLLNNLDDSISKSVLFFKTTKEIWDDLEDRFGYASMTQVYSIEQRLSELNQG 1138 RCN +V SW+LN+++ I S+L F +IW+DL +RF ++ + + + Q++ +L QG Sbjct: 115 RCNSMVKSWILNSVNREIYGSILSFDDAAQIWNDLHNRFHMTNLPRTFQLVQQIQDLRQG 174 Query: 1137 TKSVLDFFLLRLRLCGML*VMLVHIPCCTCHKCTCNVTQKVHQMQQDHKLIQFMMKLSDK 958 + ++ ++ L L +PC C K TC ++I+F+ L++K Sbjct: 175 SMNLSTYYTTLKTLRDNLDGAEASVPCHCCKKSTCESQIFAKSNVNRGRIIKFLAGLNEK 234 Query: 957 YSTLRGNILMQQPLPNMSSAFRMFSQEERHQELSQIGSQTESLAFIADKKGFDSQRNYRS 778 YS +RG I+M++PLP+++ + + Q++ ++ S + S AF K Sbjct: 235 YSIIRGQIIMKKPLPDLAEVYNILDQDDSQRQFS---NNVASAAFQVTK----------- 280 Query: 777 GHSNAQRQNFNSSSYKAGPGTNTVGNKKCANFFCTHCKISGHSVDRCFKIHGYPANFKSS 598 + Q SSS PG KK C+H +GH+ +RC+K+HGYP +K Sbjct: 281 --DDVQPGALASSSNMPQPGMLGAVQKKDKKSICSHYGYTGHTSERCYKLHGYPVGWKKG 338 Query: 597 K 595 K Sbjct: 339 K 339 Score = 63.5 bits (153), Expect = 2e-07 Identities = 29/62 (46%), Positives = 39/62 (62%) Frame = -2 Query: 188 LICQICPLAKQTRVSFPISSIKTKCPFELIHIDVRGPYKVKAHNGCNQFLTIVDDYSRFT 9 L C IC AKQ ++++P PF+L+HIDV GP+ G + FLTIVDD++R T Sbjct: 564 LHCDICQRAKQKKLTYPSRHNICLAPFDLLHIDVWGPFSEPTQEGYHYFLTIVDDHTRVT 623 Query: 8 WV 3 WV Sbjct: 624 WV 625