BLASTX nr result
ID: Angelica22_contig00011623
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Angelica22_contig00011623 (1751 letters) Database: ./nr 23,641,837 sequences; 8,123,359,852 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_002331075.1| predicted protein [Populus trichocarpa] gi|2... 155 3e-35 dbj|BAF01687.1| hypothetical protein [Arabidopsis thaliana] 141 6e-31 dbj|BAA97290.1| non-LTR retroelement reverse transcriptase-like ... 141 6e-31 gb|AAC33226.1| putative non-LTR retroelement reverse transcripta... 129 2e-27 gb|ABV21212.1| Ty1_Copia-element protein [Arabidopsis thaliana] 124 4e-27 >ref|XP_002331075.1| predicted protein [Populus trichocarpa] gi|222873039|gb|EEF10170.1| predicted protein [Populus trichocarpa] Length = 517 Score = 155 bits (392), Expect = 3e-35 Identities = 101/365 (27%), Positives = 156/365 (42%), Gaps = 4/365 (1%) Frame = +1 Query: 304 GLSEGFSTYKILGIAFNFWKLSVADCTPLVLKITERMNCWTCIFLHLAARIQLLKSTIFG 483 G EG K LG+ +L C LV +IT ++ WTC L A R+QL+ S +F Sbjct: 48 GFREGELPMKYLGVPLLSSRLKAIYCKGLVDRITSKVRHWTCRTLSYAGRVQLINSVLFS 107 Query: 484 I*WYWTAYLFLP*KVITKVQSLMLMFLWGGSLTTRAMPKISWNTCCLPKSVGGLGPRDLD 663 I YW + LP +VI V+ +M FLW GS K++W+ CLPK GGLG + + Sbjct: 108 IQVYWASLFLLPGQVIKNVEQIMKSFLWSGSDMRTTGAKVAWDQVCLPKKEGGLGIKSIK 167 Query: 664 *WNQASVCYQLW*TLQPQFSSVWLN*VCTIFLRNRIILDHEIPARTPWCFKNFLSARSIA 843 WN+ ++ +W S+W + + LR R + P W + L RS+A Sbjct: 168 EWNKIALLKHIWNLCNDSDGSIWSTWIRSNLLRGRNFWTIKTPQNCSWAWGKILKLRSLA 227 Query: 844 LPKIYV*CGVKFKVPRVARPVARLHTAEVKNMADGSQFISIVESTNLEEI*NCHLNDQWA 1023 PK+ G + H + G +FI ++ N +W Sbjct: 228 WPKMKYIIGDGM---TTSLWFDNWHPHSPLADSYGERFIYDSGMAKNAKVNVLIQNSEWK 284 Query: 1024 VTSSN----HDIAFAVRHSISTTHISGHDEILWDGLQLKHVSVSAVWHSFRQSAPQPPWL 1191 ++ H I A+ S S + DE++W SV W R+ W Sbjct: 285 TPTTQAIGWHPIIEAI-PSNSNPKMGQKDELVWLDSPNHRFSVKVAWEQLRRHRQMVEWH 343 Query: 1192 GAIWHPWHIPKRTITLWHALLNNVLTKD*MLQFSMRVDNPPCLLCNAATESANHLFMQCP 1371 +W +P+ + LW A+ + T+D + +F + N C LC E NHLF +C Sbjct: 344 DIVWFKNAVPRHSFLLWMAVQQKLTTQDKLHRFGIHGPN-RCSLCLRNNEDHNHLFFECS 402 Query: 1372 YAQSV 1386 Y +++ Sbjct: 403 YTKAI 407 >dbj|BAF01687.1| hypothetical protein [Arabidopsis thaliana] Length = 1072 Score = 141 bits (355), Expect = 6e-31 Identities = 121/478 (25%), Positives = 191/478 (39%), Gaps = 46/478 (9%) Frame = +1 Query: 304 GLSEGFSTYKILGIAFNFWKLSVADCTPLVLKITERMNCWTCIFLHLAARIQLLKSTIFG 483 G G + LG+ KL +AD PL+ K++ R+ W L A R QL+ S IFG Sbjct: 614 GFPAGTFPIRYLGLPLMCRKLRIADYGPLLEKLSARLRSWVSKALSFAGRTQLISSVIFG 673 Query: 484 I*WYWTAYLFLP*KVITKVQSLMLMFLWGGSLTTRAMPKISWNTCCLPKSVGGLGPRDLD 663 + +W + LP I K++SL FLW GS+ R K+SW CCLPKS GGLG R Sbjct: 674 LINFWMSTFLLPKGCIKKIESLCSKFLWAGSIDGRKSSKVSWVDCCLPKSEGGLGFRSFG 733 Query: 664 *WNQASVCYQLW*TLQPQFSSVWLN*VCTIFLRNRIILDHEIPARTPWCFKNFLSARSIA 843 WN+ + +W L + +S+W L + PW +K L+ R +A Sbjct: 734 EWNKTLLLRLIW-VLFDRDTSLWAQWQRHHRLGHASFWQVNALQTDPWTWKMLLNLRPLA 792 Query: 844 LPKIYV*CG------------------VKFKVPRVARPVARLHTAEVKNMADGSQFISIV 969 I G +K+ +RP+ +A+V + DGS Sbjct: 793 EKFIKAKVGNGGTVSFWFDCWTSLGPLIKYLGDVGSRPLRIPFSAKVADAIDGS------ 846 Query: 970 ESTNLEEI*NCHLNDQWAVTSSNHDIAFAVRHSIST----THISGHDEILW--DGLQLKH 1131 W + S A ++ +++ + + D W D + + Sbjct: 847 ---------------GWRLPLSRSLTADSILSHLASLPPPSPLMVSDSYSWCVDDVDCQG 891 Query: 1132 VSVSAVWHSFRQSAPQPPWLGAIWHPWHIPKRTITLWHALLNNVLTKD*MLQFSMRVDNP 1311 S + W R P W ++W +PK W A LN + T+ ++ + + V + Sbjct: 892 FSAAKTWEVLRPRRPVKRWARSVWFKGAVPKHAFNFWTAQLNRLPTRQRLVSWGL-VSSA 950 Query: 1312 PCLLCNAATESANHLFMQCPYAQSV-----LQASPMTYHFLVSWNPYLQW--QFHVGQVT 1470 C LC+ TE+ +HL + C ++ V L+ P L +W L W Q + Sbjct: 951 ECCLCSFDTETRDHLLLLCDFSSQVWRMVFLRLCP-RQRLLCTWAELLSWTRQSTAAAPS 1009 Query: 1471 TMRKQISYLY---------LAVVLQRMIHCNQVTRLIRLHL------MRHYASWQSEL 1599 +RK ++ L L + + C+ V RL+ L RH W+ L Sbjct: 1010 LLRKVVAQLVVYNLWRQRNLVLHSSLRVSCSVVFRLVDRELRNVILSRRHKRRWRELL 1067 >dbj|BAA97290.1| non-LTR retroelement reverse transcriptase-like [Arabidopsis thaliana] Length = 1072 Score = 141 bits (355), Expect = 6e-31 Identities = 121/478 (25%), Positives = 191/478 (39%), Gaps = 46/478 (9%) Frame = +1 Query: 304 GLSEGFSTYKILGIAFNFWKLSVADCTPLVLKITERMNCWTCIFLHLAARIQLLKSTIFG 483 G G + LG+ KL +AD PL+ K++ R+ W L A R QL+ S IFG Sbjct: 614 GFPAGTFPIRYLGLPLMCRKLRIADYGPLLEKLSARLRSWVSKALSFAGRTQLISSVIFG 673 Query: 484 I*WYWTAYLFLP*KVITKVQSLMLMFLWGGSLTTRAMPKISWNTCCLPKSVGGLGPRDLD 663 + +W + LP I K++SL FLW GS+ R K+SW CCLPKS GGLG R Sbjct: 674 LINFWMSTFLLPKGCIKKIESLCSKFLWAGSIDGRKSSKVSWVDCCLPKSEGGLGFRSFG 733 Query: 664 *WNQASVCYQLW*TLQPQFSSVWLN*VCTIFLRNRIILDHEIPARTPWCFKNFLSARSIA 843 WN+ + +W L + +S+W L + PW +K L+ R +A Sbjct: 734 EWNKTLLLRLIW-VLFDRDTSLWAQWQRHHRLGHASFWQVNALQTDPWTWKMLLNLRPLA 792 Query: 844 LPKIYV*CG------------------VKFKVPRVARPVARLHTAEVKNMADGSQFISIV 969 I G +K+ +RP+ +A+V + DGS Sbjct: 793 EKFIKAKVGNGGTVSFWFDCWTSLGPLIKYLGDVGSRPLRIPFSAKVADAIDGS------ 846 Query: 970 ESTNLEEI*NCHLNDQWAVTSSNHDIAFAVRHSIST----THISGHDEILW--DGLQLKH 1131 W + S A ++ +++ + + D W D + + Sbjct: 847 ---------------GWRLPLSRSLTADSILSHLASLPPPSPLMVSDSYSWCVDDVDCQG 891 Query: 1132 VSVSAVWHSFRQSAPQPPWLGAIWHPWHIPKRTITLWHALLNNVLTKD*MLQFSMRVDNP 1311 S + W R P W ++W +PK W A LN + T+ ++ + + V + Sbjct: 892 FSAAKTWEVLRPRRPVKRWAKSVWFKGAVPKHAFNFWTAQLNRLPTRQRLVSWGL-VSSA 950 Query: 1312 PCLLCNAATESANHLFMQCPYAQSV-----LQASPMTYHFLVSWNPYLQW--QFHVGQVT 1470 C LC+ TE+ +HL + C ++ V L+ P L +W L W Q + Sbjct: 951 ECCLCSFDTETRDHLLLLCDFSSQVWRMVFLRLCP-RQRLLCTWAELLSWTRQSTAAAPS 1009 Query: 1471 TMRKQISYLY---------LAVVLQRMIHCNQVTRLIRLHL------MRHYASWQSEL 1599 +RK ++ L L + + C+ V RL+ L RH W+ L Sbjct: 1010 LLRKVVAQLVVYNLWRQRNLVLHSSLRVSCSVVFRLVDRELRNVILSRRHKRRWRELL 1067 >gb|AAC33226.1| putative non-LTR retroelement reverse transcriptase [Arabidopsis thaliana] Length = 1529 Score = 129 bits (325), Expect = 2e-27 Identities = 104/381 (27%), Positives = 164/381 (43%), Gaps = 8/381 (2%) Frame = +1 Query: 277 KTVSSV*FNGLSEGFSTYKILGIAFNFWKLSVADCTPLVLKITERMNCWTCIFLHLAARI 456 +T+SS F + G + LG+ +++ AD +PL+ + +++ WT L A R+ Sbjct: 1052 QTLSSFPF---ANGQLPVRYLGLPLLTKQMTTADYSPLIEAVKTKISSWTARSLSYAGRL 1108 Query: 457 QLLKSTIFGI*WYWTAYLFLP*KVITKVQSLMLMFLWGGSLTTRAMPKISWNTCCLPKSV 636 LL S I I +W + LP I +++ L FLW G + KI+W++ C PK Sbjct: 1109 ALLNSVIVSIANFWMSAYRLPAGCIREIEKLCSAFLWSGPVLNPKKAKIAWSSICQPKKE 1168 Query: 637 GGLGPRDLD*WNQASVCYQLW*TLQPQFSSVWLN*VCTIFLRNRIILD-HEIPARTPWCF 813 GGLG + L N+ S +W L Q S+W+ + T +R +E + W + Sbjct: 1169 GGLGIKSLAEANKVSCLKLIWRLLSTQ-PSLWVTWIWTFIIRKGTFWSANERSSLGSWMW 1227 Query: 814 KNFLSARSIA--LPKIYV*CG--VKFKVPRVARPVARLHTAEVKNMADGSQFISIVESTN 981 K L R +A + K+ V G F + L + + D + I TN Sbjct: 1228 KKLLKYRELAKSMHKVEVRNGSSTSFWYDHWSHLGRLLDITGTRRVID----LGIPLETN 1283 Query: 982 LEEI*NCHLNDQWAVTSSNHDIAFAVRHSISTTHISGHDEILWDGLQ---LKHVSVSAVW 1152 LE + H + Q N I ++ +G D LW L+ K W Sbjct: 1284 LETVLRTHQHRQHRAAIYNR-INAEIQRLQQQEREAGPDISLWRSLKNDFNKRFITKVTW 1342 Query: 1153 HSFRQSAPQPPWLGAIWHPWHIPKRTITLWHALLNNVLTKD*MLQFSMRVDNPPCLLCNA 1332 ++ R PQ W +W P+ PK + LW + N + T D + ++ C LCN Sbjct: 1343 NNVRTHQPQQNWYKGVWFPYSTPKYSFLLWLTVQNRLSTGDRIKAWNSG-QLVTCTLCNN 1401 Query: 1333 ATESANHLFMQCPYAQSVLQA 1395 A E+ +HLF C Y V +A Sbjct: 1402 AEETRDHLFFSCQYTSYVWEA 1422 >gb|ABV21212.1| Ty1_Copia-element protein [Arabidopsis thaliana] Length = 438 Score = 124 bits (310), Expect(2) = 4e-27 Identities = 101/403 (25%), Positives = 170/403 (42%), Gaps = 16/403 (3%) Frame = +1 Query: 331 KILGIAFNFWKLSVADCTPLVLKITERMNCWTCIFLHLAARIQLLKSTIFGI*WYWTAYL 510 + LG+ KL +++ PLV+KI ++N W L A R+QLL S I GI +W + Sbjct: 34 RYLGLPLMSRKLKISEFEPLVVKIKAKLNFWAVKSLSFAGRLQLLSSVISGIVVFWMSTF 93 Query: 511 FLP*KVITKVQSLMLMFLWGGSLTTRAMPKISWNTCCLPKSVGGLGPRDLD*WNQASVCY 690 LP I +++S+ FLW G K+SW+T CLPK+ GGLG R WN A Sbjct: 94 RLPKGCIREIESMCARFLWSGGTDEHHKAKVSWSTVCLPKAEGGLGVRKFTEWNTALNLK 153 Query: 691 QLW*TLQPQFSSVWLN*VCTIFLRNRIILDHEIPART--PWCFKNFLSARSIALPKIYV* 864 +W L S+W+ L + I T W ++ L R +A ++ Sbjct: 154 LIW-LLFSNSGSLWVAWHLFHNLSTSVSNFWLIKEGTTDSWNWRCLLRLRPLASKFLFCS 212 Query: 865 CGVKFKVPRVARPVARLHTAEVKNMADGSQFISIVESTNLEEI*NCHLNDQWAVTSSNHD 1044 G A +DG + I + + ++ N ++W + S Sbjct: 213 IGNGLTASFWADSWTPFGPLLTFIGSDGPRNQRIPLCSKVADVVN---GNRWLLPSPRSS 269 Query: 1045 IAFAVRHSISTTHIS----GHDEILWDGLQLKHVSVSA--VWHSFRQSAPQPPWLGAIWH 1206 A + ++T I D LW + S+ W++ R + PW+ ++W Sbjct: 270 NALNLHAFLTTLSIPLQPLVEDSYLWKVENCSDIGFSSAHTWNALRHKEVEKPWVSSVWF 329 Query: 1207 PWHIPKRTITLWHALLNNVLTKD*MLQFSMRVDNPPCLLCNAATESANHLFMQCPYAQSV 1386 PK +W + + TK M+ + V +P C LC E+ +HL + C ++ SV Sbjct: 330 KGVTPKNAFNMWITHQDRLRTKLRMIAWGFLV-SPVCALCQVGFETRDHLMLSCDFSVSV 388 Query: 1387 LQ------ASPMTYHFLVSWNPYLQWQFHVGQV--TTMRKQIS 1491 +P+T +W+ + W + + +T+RK ++ Sbjct: 389 WALVRQRIGTPLT--IFQNWSELILWTQNRSKAAPSTLRKLVA 429 Score = 25.8 bits (55), Expect(2) = 4e-27 Identities = 10/13 (76%), Positives = 13/13 (100%) Frame = +2 Query: 320 SLPIKYLGLPLIS 358 +LPI+YLGLPL+S Sbjct: 30 NLPIRYLGLPLMS 42