BLASTX nr result
ID: Papaver32_contig00042011
seq
BLASTX 2.2.26 [Sep-21-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Papaver32_contig00042011 (780 letters) Database: ./nr 115,041,592 sequences; 42,171,959,267 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value OMO99308.1 Integrase, catalytic core [Corchorus capsularis] 231 8e-66 OMO62984.1 Integrase, catalytic core [Corchorus capsularis] 218 1e-61 OAP02304.1 hypothetical protein AXX17_AT3G39340 [Arabidopsis tha... 207 3e-57 AAC61290.1 putative retroelement pol polyprotein [Arabidopsis th... 204 2e-56 AAD21687.1 Strong similarity to gi|3600044 T12H20.12 protease ho... 204 3e-56 AAK51235.1 polyprotein [Arabidopsis thaliana] 202 1e-55 CAB43904.1 putative protein [Arabidopsis thaliana] CAB81478.1 pu... 200 7e-55 AAC35532.1 contains similarity to proteases [Arabidopsis thaliana] 199 2e-54 CAB40035.1 retrotransposon like protein [Arabidopsis thaliana] C... 199 2e-54 AAF02855.1 Similar to retrotransposon proteins [Arabidopsis thal... 197 4e-54 ACP30598.1 disease resistance protein [Brassica rapa subsp. peki... 197 9e-54 CAC37623.1 copia-like polyprotein [Arabidopsis thaliana] 196 2e-53 CAN61322.1 hypothetical protein VITISV_012106 [Vitis vinifera] 193 2e-52 OMO62605.1 Integrase, catalytic core [Corchorus capsularis] 190 2e-52 AAC67200.1 putative retroelement pol polyprotein [Arabidopsis th... 191 8e-52 CAN79884.1 hypothetical protein VITISV_002539 [Vitis vinifera] 190 2e-51 KYP66503.1 Retrovirus-related Pol polyprotein from transposon TN... 189 3e-51 KYP50444.1 Retrovirus-related Pol polyprotein from transposon TN... 189 5e-51 KYP72965.1 Retrovirus-related Pol polyprotein from transposon TN... 176 6e-51 CAN77295.1 hypothetical protein VITISV_005638 [Vitis vinifera] 188 9e-51 >OMO99308.1 Integrase, catalytic core [Corchorus capsularis] Length = 1335 Score = 231 bits (589), Expect = 8e-66 Identities = 120/273 (43%), Positives = 174/273 (63%), Gaps = 16/273 (5%) Frame = -3 Query: 775 VMLGNGKTLPISLIGNATLNTPTTSFRLQDVLHVPEMRHNLLSVAKFTRENSCYITFDHY 596 V +G+GK+L IS IG+++L + F L ++L VPE++ NLLS+A+FT++N+C + Sbjct: 363 VYIGDGKSLRISHIGDSSLCIGFSKFSLTNILFVPELKENLLSIAQFTKDNNCGFFLFPW 422 Query: 595 GYEIRNLHNHELLAHGKMINNLYPISSTMLQFAFSSLVA-----------TSDI----WH 461 G+ I++L ++L G + NLY I + + + SD+ WH Sbjct: 423 GFVIKDLRTGKVLLDGPVKGNLYMIPVKAAEKVVTKQLEQKQKQALFGGNNSDVSGVTWH 482 Query: 460 NRLGHPSSKIIQQLHNHKHIIVKS-SASHTLCSPCQLAKSKRLPFYXXXXXXXSPLALVH 284 RLGHP+ KII QLH+ K I K S + +C CQ KSKRLPF + L L+H Sbjct: 483 RRLGHPAGKIISQLHSSKLISPKDVSFCNLVCEACQTGKSKRLPFGYSTRVTSNVLDLIH 542 Query: 283 CDEWGPAPVSSIAGHRYYILFIDDFSRFHWIYPMQVKTEALQCFQHFKSLTENLFHTKLL 104 CD WGP+P ++++G+RYYILF+DD+SR+ WIYP++ ++++L CFQ FKS+ EN F K+ Sbjct: 543 CDIWGPSPTATVSGYRYYILFVDDYSRYSWIYPLKQRSDSLVCFQTFKSMVENQFGHKIK 602 Query: 103 AFQTDGASELVKGLFKKKLDQNGILLRISCPKT 5 FQ DGA ELV+G+FK+ LD +GI LRISCP T Sbjct: 603 FFQCDGAKELVEGVFKQFLDGHGISLRISCPHT 635 >OMO62984.1 Integrase, catalytic core [Corchorus capsularis] Length = 989 Score = 218 bits (556), Expect = 1e-61 Identities = 109/251 (43%), Positives = 159/251 (63%), Gaps = 16/251 (6%) Frame = -3 Query: 709 TTSFRLQDVLHVPEMRHNLLSVAKFTRENSCYITFDHYGYEIRNLHNHELLAHGKMINNL 530 ++ F L ++L VPE++ NLLS+A+FT++N+C +G+ I++L ++L G + NL Sbjct: 3 SSKFSLTNILFVPELKENLLSIAQFTKDNNCGFFLFPWGFVIKDLRTGKVLLDGPVKGNL 62 Query: 529 YPI---------------SSTMLQFAFSSLVATSDIWHNRLGHPSSKIIQQLHNHKHIIV 395 Y I + F ++ + WH RLGHP+ KII QLH+ K I Sbjct: 63 YMIPVKAAEKVVTKQLEQKQKQVLFGGNNSDVSGVTWHRRLGHPAGKIISQLHSSKLISP 122 Query: 394 KS-SASHTLCSPCQLAKSKRLPFYXXXXXXXSPLALVHCDEWGPAPVSSIAGHRYYILFI 218 K S+ + +C CQ KSKRLPF + L L+HCD WGP+P ++++G+RYYILF+ Sbjct: 123 KDVSSCNLVCEACQTGKSKRLPFGYSTRVTSNVLDLIHCDIWGPSPTTTVSGYRYYILFV 182 Query: 217 DDFSRFHWIYPMQVKTEALQCFQHFKSLTENLFHTKLLAFQTDGASELVKGLFKKKLDQN 38 DD+SR+ WIYP++ ++++L CFQ FKS+ EN F K+ FQ DGA ELV+G+FK+ LD + Sbjct: 183 DDYSRYSWIYPLKQRSDSLACFQTFKSMVENQFGHKIKFFQCDGAKELVEGVFKQFLDGH 242 Query: 37 GILLRISCPKT 5 GI LRISCP T Sbjct: 243 GISLRISCPHT 253 >OAP02304.1 hypothetical protein AXX17_AT3G39340 [Arabidopsis thaliana] Length = 2099 Score = 207 bits (526), Expect = 3e-57 Identities = 105/260 (40%), Positives = 151/260 (58%), Gaps = 2/260 (0%) Frame = -3 Query: 775 VMLGNGKTLPISLIGNATLNTPTTSFRLQDVLHVPEMRHNLLSVAKFTRENSCYITFDHY 596 VM+GNG LPI+ G+ TL + + L+DVL P + +L+SV+K TR+ C + FD Sbjct: 475 VMVGNGDFLPITHTGSTTLPSSSGILSLKDVLVCPNIGKSLVSVSKLTRDYPCSVDFDCD 534 Query: 595 GYEIRNLHNHELLAHGKMINNLYPISSTMLQFAFSSLVATS--DIWHNRLGHPSSKIIQQ 422 + + +LLA G N LY + + + +SS T+ D+WH RLGHP+ +I+Q Sbjct: 535 YVRVTDKATKKLLAQGNNFNGLYVLKDSSVHAFYSSRQQTTSEDVWHMRLGHPNQQILQL 594 Query: 421 LHNHKHIIVKSSASHTLCSPCQLAKSKRLPFYXXXXXXXSPLALVHCDEWGPAPVSSIAG 242 LH +K + + S +S +C CQ KS RLPF PL +HCD WGPAP+ S+ G Sbjct: 595 LHKNKAVNI-SKSSKGICEACQYGKSSRLPFSSSCSTISKPLQKIHCDLWGPAPIKSVQG 653 Query: 241 HRYYILFIDDFSRFHWIYPMQVKTEALQCFQHFKSLTENLFHTKLLAFQTDGASELVKGL 62 YY +F+D++SRF W YP++ K++ + F F++L EN F K+ +FQ DG E Sbjct: 654 FSYYAIFVDNYSRFCWFYPLKFKSDFFKIFTIFQALVENQFQNKIGSFQCDGGGEFTSAR 713 Query: 61 FKKKLDQNGILLRISCPKTP 2 F L Q+GI ISCP TP Sbjct: 714 FLNHLQQHGIQQLISCPYTP 733 >AAC61290.1 putative retroelement pol polyprotein [Arabidopsis thaliana] Length = 1149 Score = 204 bits (519), Expect = 2e-56 Identities = 108/261 (41%), Positives = 154/261 (59%), Gaps = 3/261 (1%) Frame = -3 Query: 775 VMLGNGKTLPISLIGNATLNTPTTSFRLQDVLHVPEMRHNLLSVAKFTRENSCYITFDHY 596 VM +G LPI+ IG+A L + + + L+DVL P + +LLSV+K T++ C TFD Sbjct: 343 VMASDGNFLPITHIGSANLPSTSGNLPLKDVLVCPNIAKSLLSVSKLTKDYPCSFTFDAD 402 Query: 595 GYEIRNLHNHELLAHGKMINN-LYPISSTMLQFAFSS--LVATSDIWHNRLGHPSSKIIQ 425 G +++ ++L G + LY + + Q +S+ + AT ++WH RLGHP+ +++Q Sbjct: 403 GVLVKDKATCKVLTKGSSTSEGLYKLENPKFQMFYSTRQVKATDEVWHMRLGHPNPQVLQ 462 Query: 424 QLHNHKHIIVKSSASHTLCSPCQLAKSKRLPFYXXXXXXXSPLALVHCDEWGPAPVSSIA 245 L N K I + S S +C C+L KS RLPF PL VHCD WGPAPVSSI Sbjct: 463 LLANKKAIQINKSTSK-MCESCRLGKSSRLPFIASDFIASRPLERVHCDLWGPAPVSSIQ 521 Query: 244 GHRYYILFIDDFSRFHWIYPMQVKTEALQCFQHFKSLTENLFHTKLLAFQTDGASELVKG 65 G +YY++FID+ SRF W YP++ K++ F F+S ENL TK+ FQ+DG E Sbjct: 522 GFQYYVIFIDNRSRFCWFYPLKHKSDFCSLFMKFQSFVENLLQTKIGTFQSDGGGEFTSN 581 Query: 64 LFKKKLDQNGILLRISCPKTP 2 F + L ++GI ISCP TP Sbjct: 582 RFLQHLQESGIQHYISCPHTP 602 >AAD21687.1 Strong similarity to gi|3600044 T12H20.12 protease homolog from Arabidopsis thaliana BAC gb|AF080119 and is a member of the reverse transcriptase family PF|00078 [Arabidopsis thaliana] Length = 1415 Score = 204 bits (518), Expect = 3e-56 Identities = 103/260 (39%), Positives = 149/260 (57%), Gaps = 2/260 (0%) Frame = -3 Query: 775 VMLGNGKTLPISLIGNATLNTPTTSFRLQDVLHVPEMRHNLLSVAKFTRENSCYITFDHY 596 V++G+G LPI+ G+ T+ + L +VL VP ++ +LLSV+K + C + FD Sbjct: 351 VLVGDGTYLPITHTGSTTIKSSNGKIPLNEVLVVPNIQKSLLSVSKLCDDYPCGVYFDAN 410 Query: 595 GYEIRNLHNHELLAHGKMINNLYPISSTMLQFAFSS--LVATSDIWHNRLGHPSSKIIQQ 422 I +L +++ G N LY + + +S+ AT ++WH+RLGH +SK +Q Sbjct: 411 KVCIIDLQTQKVVTTGPRRNGLYVLENQEFVALYSNRQCAATEEVWHHRLGHANSKALQH 470 Query: 421 LHNHKHIIVKSSASHTLCSPCQLAKSKRLPFYXXXXXXXSPLALVHCDEWGPAPVSSIAG 242 L N K I + S + +C PCQ+ KS RLPF PL +HCD WGP+PV S G Sbjct: 471 LQNSKAIQINKSRTSPVCEPCQMGKSSRLPFLISDSRVLHPLDRIHCDLWGPSPVVSNQG 530 Query: 241 HRYYILFIDDFSRFHWIYPMQVKTEALQCFQHFKSLTENLFHTKLLAFQTDGASELVKGL 62 +YY +F+DD+SR+ W YP+ K+E L F F+ L EN +TK+ FQ+DG E V Sbjct: 531 LKYYAIFVDDYSRYSWFYPLHNKSEFLSVFISFQKLVENQLNTKIKVFQSDGGGEFVSNK 590 Query: 61 FKKKLDQNGILLRISCPKTP 2 K L ++GI RISCP TP Sbjct: 591 LKTHLSEHGIHHRISCPYTP 610 >AAK51235.1 polyprotein [Arabidopsis thaliana] Length = 1453 Score = 202 bits (513), Expect = 1e-55 Identities = 98/260 (37%), Positives = 156/260 (60%), Gaps = 2/260 (0%) Frame = -3 Query: 775 VMLGNGKTLPISLIGNATLNTPTTSFRLQDVLHVPEMRHNLLSVAKFTRENSCYITFDHY 596 V++G+G LPI+ +G+ T+++ + + L +VL P+++ +LLSV+K + C + FD Sbjct: 354 VLVGDGAYLPITHVGSTTISSDSGTLPLNEVLVCPDIQKSLLSVSKLCDDYPCGVYFDAN 413 Query: 595 GYEIRNLHNHELLAHGKMINNLYPISSTMLQFAFSS--LVATSDIWHNRLGHPSSKIIQQ 422 I +++ ++++ G N LY + + +S+ A+ +IWH+RLGH +S+I+QQ Sbjct: 414 KVCIIDINTQKVVSKGPRSNGLYVLENQEFVAFYSNRQCAASEEIWHHRLGHSNSRILQQ 473 Query: 421 LHNHKHIIVKSSASHTLCSPCQLAKSKRLPFYXXXXXXXSPLALVHCDEWGPAPVSSIAG 242 L + K I S +C PCQ+ KS +L F+ L +HCD WGP+PV S G Sbjct: 474 LKSSKEISFNKSRMSPVCEPCQMGKSSKLQFFSSNSRELDLLGRIHCDLWGPSPVVSKQG 533 Query: 241 HRYYILFIDDFSRFHWIYPMQVKTEALQCFQHFKSLTENLFHTKLLAFQTDGASELVKGL 62 +YY++F+DD+SR+ W YP++ K++ F F++L EN F+TK+ FQ+DG E L Sbjct: 534 FKYYVVFVDDYSRYSWFYPLKAKSDFFAVFVAFQNLVENQFNTKIKVFQSDGGGEFTSNL 593 Query: 61 FKKKLDQNGILLRISCPKTP 2 KK L GI RISCP TP Sbjct: 594 MKKHLTDCGIQHRISCPYTP 613 >CAB43904.1 putative protein [Arabidopsis thaliana] CAB81478.1 putative protein [Arabidopsis thaliana] Length = 1415 Score = 200 bits (508), Expect = 7e-55 Identities = 107/260 (41%), Positives = 151/260 (58%), Gaps = 2/260 (0%) Frame = -3 Query: 775 VMLGNGKTLPISLIGNATLNTPTTSFRLQDVLHVPEMRHNLLSVAKFTRENSCYITFDHY 596 V++GN LPI+ IG+A L + + L+DVL P + +LLSV+K T + C I FD Sbjct: 322 VIVGNSDFLPITHIGSAVLTSNQGNLPLRDVLVCPNITKSLLSVSKLTSDYPCVIEFDSD 381 Query: 595 GYEIRNLHNHELLAHGKMINNLYPISSTMLQFAFSSLV-ATSD-IWHNRLGHPSSKIIQQ 422 G +++ +LL G N+LY + + +SS ATSD +WH RLGHP+ ++QQ Sbjct: 382 GVIVKDKLTKQLLTKGTRHNDLYLLENPKFMACYSSRQQATSDEVWHMRLGHPNQDVLQQ 441 Query: 421 LHNHKHIIVKSSASHTLCSPCQLAKSKRLPFYXXXXXXXSPLALVHCDEWGPAPVSSIAG 242 L +K I++ S SH+LC CQ+ K +LPF L VHCD WGPAPV S G Sbjct: 442 LLRNKAIVI-SKTSHSLCDACQMGKICKLPFASSDFVSSRLLERVHCDLWGPAPVVSSQG 500 Query: 241 HRYYILFIDDFSRFHWIYPMQVKTEALQCFQHFKSLTENLFHTKLLAFQTDGASELVKGL 62 RYY++FID++SRF W YP+++K++ F F+ + EN K+ +FQ DG E + Sbjct: 501 FRYYVIFIDNYSRFTWFYPLRLKSDFFSVFLTFQKMVENQCQQKIASFQCDGGGEFISNQ 560 Query: 61 FKKKLDQNGILLRISCPKTP 2 F L + GI ISCP TP Sbjct: 561 FVSHLAECGIRQLISCPYTP 580 >AAC35532.1 contains similarity to proteases [Arabidopsis thaliana] Length = 1392 Score = 199 bits (505), Expect = 2e-54 Identities = 104/260 (40%), Positives = 146/260 (56%), Gaps = 2/260 (0%) Frame = -3 Query: 775 VMLGNGKTLPISLIGNATLNTPTTSFRLQDVLHVPEMRHNLLSVAKFTRENSCYITFDHY 596 V++GNG LPI+ IG LN + L+DVL P + +LLSV+K T + C TFD Sbjct: 356 VIVGNGDFLPITHIGTIPLNISQGTLPLEDVLVCPGITKSLLSVSKLTDDYPCSFTFDSD 415 Query: 595 GYEIRNLHNHELLAHGKMINNLYPISSTMLQFAFSSLVATSD--IWHNRLGHPSSKIIQQ 422 I++ +LL G LY + Q +S+ +SD +WH RLGHP+ +++Q Sbjct: 416 SVVIKDKRTQQLLTQGNKHKGLYVLKDVPFQTYYSTRQQSSDDEVWHQRLGHPNKEVLQH 475 Query: 421 LHNHKHIIVKSSASHTLCSPCQLAKSKRLPFYXXXXXXXSPLALVHCDEWGPAPVSSIAG 242 L K I+V ++S+ +C CQ+ K RLPF PL +HCD WGPAPV+S G Sbjct: 476 LIKTKAIVVNKTSSN-MCEACQMGKVCRLPFVASEFVSSRPLERIHCDLWGPAPVTSAQG 534 Query: 241 HRYYILFIDDFSRFHWIYPMQVKTEALQCFQHFKSLTENLFHTKLLAFQTDGASELVKGL 62 +YY++FID++SRF W YP+++K++ F F+ L EN + K+ FQ DG E V Sbjct: 535 FQYYVIFIDNYSRFTWFYPLKLKSDFFSVFVLFQQLVENQYQHKIAMFQCDGGGEFVSYK 594 Query: 61 FKKKLDQNGILLRISCPKTP 2 F L GI ISCP TP Sbjct: 595 FVAHLASCGIKQLISCPHTP 614 >CAB40035.1 retrotransposon like protein [Arabidopsis thaliana] CAB81170.1 retrotransposon like protein [Arabidopsis thaliana] Length = 1515 Score = 199 bits (505), Expect = 2e-54 Identities = 104/260 (40%), Positives = 146/260 (56%), Gaps = 2/260 (0%) Frame = -3 Query: 775 VMLGNGKTLPISLIGNATLNTPTTSFRLQDVLHVPEMRHNLLSVAKFTRENSCYITFDHY 596 V++GNG LPI+ IG LN + L+DVL P + +LLSV+K T + C TFD Sbjct: 353 VIVGNGDFLPITHIGTIPLNISQGTLPLEDVLVCPGITKSLLSVSKLTDDYPCSFTFDSD 412 Query: 595 GYEIRNLHNHELLAHGKMINNLYPISSTMLQFAFSSLVATSD--IWHNRLGHPSSKIIQQ 422 I++ +LL G LY + Q +S+ +SD +WH RLGHP+ +++Q Sbjct: 413 SVVIKDKRTQQLLTQGNKHKGLYVLKDVPFQTYYSTRQQSSDDEVWHQRLGHPNKEVLQH 472 Query: 421 LHNHKHIIVKSSASHTLCSPCQLAKSKRLPFYXXXXXXXSPLALVHCDEWGPAPVSSIAG 242 L K I+V ++S+ +C CQ+ K RLPF PL +HCD WGPAPV+S G Sbjct: 473 LIKTKAIVVNKTSSN-MCEACQMGKVCRLPFVASEFVSSRPLERIHCDLWGPAPVTSAQG 531 Query: 241 HRYYILFIDDFSRFHWIYPMQVKTEALQCFQHFKSLTENLFHTKLLAFQTDGASELVKGL 62 +YY++FID++SRF W YP+++K++ F F+ L EN + K+ FQ DG E V Sbjct: 532 FQYYVIFIDNYSRFTWFYPLKLKSDFFSVFVLFQQLVENQYQHKIAMFQCDGGGEFVSYK 591 Query: 61 FKKKLDQNGILLRISCPKTP 2 F L GI ISCP TP Sbjct: 592 FVAHLASCGIKQLISCPHTP 611 >AAF02855.1 Similar to retrotransposon proteins [Arabidopsis thaliana] Length = 1522 Score = 197 bits (502), Expect = 4e-54 Identities = 97/260 (37%), Positives = 149/260 (57%), Gaps = 2/260 (0%) Frame = -3 Query: 775 VMLGNGKTLPISLIGNATLNTPTTSFRLQDVLHVPEMRHNLLSVAKFTRENSCYITFDHY 596 +M+ +G LPI+ G+ ++ + + L++VL P++ +LLSV+K T + C + FD Sbjct: 355 IMVADGNFLPITHTGSGSIASSSGKIPLKEVLVCPDIVKSLLSVSKLTSDYPCSVEFDAD 414 Query: 595 GYEIRNLHNHELLAHGKMINNLYPISSTMLQFAFSSLV--ATSDIWHNRLGHPSSKIIQQ 422 I + +LL G+ + LY + LQ +S+ A+S++WH RLGH +++++ Q Sbjct: 415 SVRINDKATKKLLVMGRNRDGLYSLEEPKLQVLYSTRQNSASSEVWHRRLGHANAEVLHQ 474 Query: 421 LHNHKHIIVKSSASHTLCSPCQLAKSKRLPFYXXXXXXXSPLALVHCDEWGPAPVSSIAG 242 L + K II+ + T+C C L KS RLPF PL +HCD WGP+P SS+ G Sbjct: 475 LASSKSIIIINKVVKTVCEACHLGKSTRLPFMLSTFNASRPLERIHCDLWGPSPTSSVQG 534 Query: 241 HRYYILFIDDFSRFHWIYPMQVKTEALQCFQHFKSLTENLFHTKLLAFQTDGASELVKGL 62 RYY++FID +SRF W YP+++K++ F F+ L EN K+ FQ DG E + Sbjct: 535 FRYYVVFIDHYSRFTWFYPLKLKSDFFSTFVMFQKLVENQLGHKIKIFQCDGGGEFISSQ 594 Query: 61 FKKKLDQNGILLRISCPKTP 2 F K L +GI +SCP TP Sbjct: 595 FLKHLQDHGIQQNMSCPYTP 614 >ACP30598.1 disease resistance protein [Brassica rapa subsp. pekinensis] Length = 2301 Score = 197 bits (500), Expect = 9e-54 Identities = 102/260 (39%), Positives = 151/260 (58%), Gaps = 2/260 (0%) Frame = -3 Query: 775 VMLGNGKTLPISLIGNATLNTPTTSFRLQDVLHVPEMRHNLLSVAKFTRENSCYITFDHY 596 VM+GNG+ LPI+ G A++ + + + L DVL P++ LLSV+KFT + C FD Sbjct: 360 VMVGNGEYLPITHTGAASIASSSGNLILNDVLVCPQIAKPLLSVSKFTTDYPCGFDFDAD 419 Query: 595 GYEIRNLHNHELLAHGKMINNLYPISSTMLQFAFSS--LVATSDIWHNRLGHPSSKIIQQ 422 I + ++L G+ LY I FS+ + A+ ++WH RLGHP+ I+Q+ Sbjct: 420 NVCIYDKATKKVLLQGRNTKGLYSIKEPAFHAFFSTRQVAASDEVWHQRLGHPNPHILQR 479 Query: 421 LHNHKHIIVKSSASHTLCSPCQLAKSKRLPFYXXXXXXXSPLALVHCDEWGPAPVSSIAG 242 L + K + + + S +LC CQ+AKS RLPF PL +HCD WGP+PV S+ Sbjct: 480 LASIKSVFI-NKRSKSLCVSCQMAKSSRLPFSASQFVATRPLERIHCDVWGPSPVVSVQE 538 Query: 241 HRYYILFIDDFSRFHWIYPMQVKTEALQCFQHFKSLTENLFHTKLLAFQTDGASELVKGL 62 +YY++ ID++SR+ W+YPM+ K++ F F+SL +N FHT + FQ DG E + Sbjct: 539 FKYYVVLIDNYSRYCWMYPMKKKSDFHSIFIAFQSLVQNQFHTTIGTFQCDGGGEFISNQ 598 Query: 61 FKKKLDQNGILLRISCPKTP 2 F L +NGI +SCP TP Sbjct: 599 FLLHLQKNGIQQLLSCPHTP 618 >CAC37623.1 copia-like polyprotein [Arabidopsis thaliana] Length = 1466 Score = 196 bits (497), Expect = 2e-53 Identities = 99/260 (38%), Positives = 153/260 (58%), Gaps = 2/260 (0%) Frame = -3 Query: 775 VMLGNGKTLPISLIGNATLNTPTTSFRLQDVLHVPEMRHNLLSVAKFTRENSCYITFDHY 596 V++G+G LPI+ +G+ T+++ + L +VL P ++ +LLSV+K + C + FD Sbjct: 353 VLVGDGTYLPITHVGSTTISSSKGTIPLNEVLVCPAIQKSLLSVSKLCDDYPCGVYFDAN 412 Query: 595 GYEIRNLHNHELLAHGKMINNLYPISSTMLQFAFSS--LVATSDIWHNRLGHPSSKIIQQ 422 I +L ++++ G N LY + ++ +S+ A+ + WH+RLGH +SKI+QQ Sbjct: 413 KVCIIDLTTQKVVSKGPRNNGLYMLENSEFVALYSNRQCAASMETWHHRLGHSNSKILQQ 472 Query: 421 LHNHKHIIVKSSASHTLCSPCQLAKSKRLPFYXXXXXXXSPLALVHCDEWGPAPVSSIAG 242 L K I V S + +C PCQ+ KS RL F+ PL VHCD WGP+PV S G Sbjct: 473 LLTRKEIQVNKSRTSPVCEPCQMGKSTRLQFFSSDFRALKPLDRVHCDLWGPSPVVSNQG 532 Query: 241 HRYYILFIDDFSRFHWIYPMQVKTEALQCFQHFKSLTENLFHTKLLAFQTDGASELVKGL 62 +YY +F+DDFSRF W +P+++K++ + F ++ L EN TK+ FQ+DG E Sbjct: 533 FKYYAVFVDDFSRFSWFFPLRMKSKFISVFIAYQKLVENQLGTKIKEFQSDGGGEFTSNK 592 Query: 61 FKKKLDQNGILLRISCPKTP 2 K+ ++GI RISCP TP Sbjct: 593 LKEHFREHGIHHRISCPYTP 612 >CAN61322.1 hypothetical protein VITISV_012106 [Vitis vinifera] Length = 1432 Score = 193 bits (490), Expect = 2e-52 Identities = 113/269 (42%), Positives = 152/269 (56%), Gaps = 14/269 (5%) Frame = -3 Query: 775 VMLGNGKTLPISLIGNATLNTPTTSFRLQDVLHVPEMRHNLLSVAKFTRENSCYITFDHY 596 V +GNGK L IS IG+ L++ T SFRL+ V HVP + NL+SVAKF EN+ I F Sbjct: 374 VTIGNGKHLSISNIGSKQLHSHTHSFRLKKVFHVPFISANLISVAKFCSENNALIEFHSN 433 Query: 595 GYEIRNLHNHELLAHGKMINNLY--PISSTMLQFA-----------FSSLVAT-SDIWHN 458 + +++LH +LA GK+ N LY P+ S + ++ FSS V +++WHN Sbjct: 434 AFFVKDLHTKMVLAQGKLENGLYKFPVFSNLKPYSSINNASAFHSQFSSTVENKAELWHN 493 Query: 457 RLGHPSSKIIQQLHNHKHIIVKSSASHTLCSPCQLAKSKRLPFYXXXXXXXSPLALVHCD 278 RLGH S I+ ++ N ++ S +CS CQLAKS RLP PL LV+ D Sbjct: 494 RLGHASFDIVSKVMNTCNVASGKYKSF-VCSDCQLAKSHRLPTQLSNFHASKPLELVYTD 552 Query: 277 EWGPAPVSSIAGHRYYILFIDDFSRFHWIYPMQVKTEALQCFQHFKSLTENLFHTKLLAF 98 WGPA + S +G RY+ILF+DD+SR+ W Y +Q K +AL F+ FK EN F TK+ Sbjct: 553 IWGPASIKSTSGARYFILFVDDYSRYTWFYSLQTKDQALPIFKXFKLQMENQFDTKIKCL 612 Query: 97 QTDGASELVKGLFKKKLDQNGILLRISCP 11 Q+D E F L GI R SCP Sbjct: 613 QSDNGGEFRS--FTSFLQAVGIAHRFSCP 639 >OMO62605.1 Integrase, catalytic core [Corchorus capsularis] Length = 734 Score = 190 bits (483), Expect = 2e-52 Identities = 98/262 (37%), Positives = 150/262 (57%), Gaps = 5/262 (1%) Frame = -3 Query: 775 VMLGNGKTLPISLIGNATLNTPTTSFRLQDVLHVPEMRHNLLSVAKFTRENSCYITFDHY 596 VM+GNG ++ IS G+ L L +VL VP+++ NL+S+++ T +N + F Sbjct: 263 VMVGNGASIDISHSGSIVLKVDDKQIVLDNVLVVPDIKKNLISISQLTTDNPFNVEFSDI 322 Query: 595 GYEIRNLHNHELLAHGKMINNLYPISSTMLQFAFSSL---VATSDIWHNRLGHPSSKIIQ 425 G++IR+ E++A GK +++LY + S+ AF S V T +WH+RLGHP ++Q Sbjct: 323 GFQIRDRRTGEVIATGKRVDDLYVLESSEKAKAFFSTRFRVVTRSVWHSRLGHPQVSVVQ 382 Query: 424 QLHNHK--HIIVKSSASHTLCSPCQLAKSKRLPFYXXXXXXXSPLALVHCDEWGPAPVSS 251 L N K H K S SH +CS CQ+ K+ RL F +P + HCD WGP+PV+S Sbjct: 383 YLDNKKLIHCSNKQSPSH-ICSSCQMGKACRLSFLSLSDFSTTPFEITHCDLWGPSPVNS 441 Query: 250 IAGHRYYILFIDDFSRFHWIYPMQVKTEALQCFQHFKSLTENLFHTKLLAFQTDGASELV 71 I +Y++FID+ +RF W +P++ K++ CF F F + FQ+DG E Sbjct: 442 IQRFHFYVIFIDECTRFTWFFPLKHKSDFTTCFIKFHKFITIQFERPIKNFQSDGGGEFD 501 Query: 70 KGLFKKKLDQNGILLRISCPKT 5 KG F+ L +GI ++SCP+T Sbjct: 502 KGEFQSYLSHHGIHHQLSCPRT 523 >AAC67200.1 putative retroelement pol polyprotein [Arabidopsis thaliana] Length = 1402 Score = 191 bits (485), Expect = 8e-52 Identities = 103/261 (39%), Positives = 144/261 (55%), Gaps = 3/261 (1%) Frame = -3 Query: 775 VMLGNGKTLPISLIGNATLNTPTTSFRLQDVLHVPEMRHNLLSVAKFTRENSCYITFDHY 596 VM+ +G LPI+ G+ L + + + L DVL P + +LLSV+K T++ C + FD Sbjct: 361 VMVADGNFLPITHTGSTNLASSSGNVPLTDVLVCPSITKSLLSVSKLTQDYPCTVEFDSD 420 Query: 595 GYEIRNLHNHELLAHGKMINNLYPISSTMLQFAFSSL---VATSDIWHNRLGHPSSKIIQ 425 G I + +LL G + LY + AF S A+ ++WH RLGHP +++Q Sbjct: 421 GVRINDKATKKLLIMGSTCDGLYCLKDDSQFKAFFSTRQQSASDEVWHRRLGHPHPQVLQ 480 Query: 424 QLHNHKHIIVKSSASHTLCSPCQLAKSKRLPFYXXXXXXXSPLALVHCDEWGPAPVSSIA 245 QL I + + S +LC CQL KS RLPF PL VHCD WGP+P++S+ Sbjct: 481 QLVKTNSISINKT-SKSLCEACQLGKSTRLPFVSSSFTSNRPLERVHCDLWGPSPITSVQ 539 Query: 244 GHRYYILFIDDFSRFHWIYPMQVKTEALQCFQHFKSLTENLFHTKLLAFQTDGASELVKG 65 G RYY +FID +SRF WIYP+++K++ F F L EN + K+ FQ DG E V Sbjct: 540 GFRYYAVFIDHYSRFSWIYPLKLKSDFYNIFVAFHKLVENQLNHKISVFQCDGGGEFVNH 599 Query: 64 LFKKKLDQNGILLRISCPKTP 2 F + L +GI IS P TP Sbjct: 600 KFLQHLQNHGIQQHISYPHTP 620 >CAN79884.1 hypothetical protein VITISV_002539 [Vitis vinifera] Length = 1453 Score = 190 bits (483), Expect = 2e-51 Identities = 106/263 (40%), Positives = 153/263 (58%), Gaps = 5/263 (1%) Frame = -3 Query: 775 VMLGNGKTLPISLIGNATLNTPTTSFRLQDVLHVPEMRHNLLSVAKFTRENSCYITFDHY 596 V++GNG +LPI+ G TL++ +++ +L DVL VP + NLLS++K T + +TF H Sbjct: 308 VIVGNGASLPITHTG--TLSS-SSNLQLLDVLVVPRLTKNLLSISKLTSDFPLSVTFSHD 364 Query: 595 GYEIRNLHNHELLAHGKMINNLYPISSTMLQFAF----SSLVATSDIWHNRLGHPSSKII 428 + ++N +A GK LY + FA +L A+ ++WH RLGH + I+ Sbjct: 365 NFVVQNRITGMAVAKGKRAGGLYVLERGHSAFASVLRNKNLHASFELWHARLGHVNHSIL 424 Query: 427 QQLHNHKHIIVKSSA-SHTLCSPCQLAKSKRLPFYXXXXXXXSPLALVHCDEWGPAPVSS 251 L+ + + S + +LCS CQLAKS RLPF L LVHCD WG APV S Sbjct: 425 SLLNKKGQLFLTSLLPTPSLCSTCQLAKSHRLPFSSNTTRSNVVLGLVHCDIWGLAPVKS 484 Query: 250 IAGHRYYILFIDDFSRFHWIYPMQVKTEALQCFQHFKSLTENLFHTKLLAFQTDGASELV 71 G YY+LFIDD+SRF W+YP+++K++ F F+ L EN + TK+ FQ+DG +E Sbjct: 485 NLGFNYYVLFIDDYSRFTWLYPLKLKSDFFDIFLQFQKLVENQYSTKIKIFQSDGGAEFT 544 Query: 70 KGLFKKKLDQNGILLRISCPKTP 2 F+ L Q GI ++SCP TP Sbjct: 545 SNRFQSHLQQFGIHHQMSCPYTP 567 >KYP66503.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cajanus cajan] Length = 1356 Score = 189 bits (481), Expect = 3e-51 Identities = 97/261 (37%), Positives = 143/261 (54%), Gaps = 3/261 (1%) Frame = -3 Query: 778 HVMLGNGKTLPISLIGNATLNTPTTSFRLQDVLHVPEMRHNLLSVAKFTRENSCYITFDH 599 ++++G+G +PI G L P F L +VLH P++ NL+SV KFT +N I FD Sbjct: 400 NIVVGSGHLIPIIGHGRTNLPPPHPPFLLNNVLHAPKLIKNLISVRKFTTDNWVSILFDP 459 Query: 598 YGYEIRNLHNHELLAHGKMINNLYPI---SSTMLQFAFSSLVATSDIWHNRLGHPSSKII 428 +G+ + +L L + +LYP+ S L + DIWHNRLGHP + II Sbjct: 460 FGFSVHDLQTGTKLMRCNSVGDLYPLFPPSQATLSNPSVFTTMSRDIWHNRLGHPGNAII 519 Query: 427 QQLHNHKHIIVKSSASHTLCSPCQLAKSKRLPFYXXXXXXXSPLALVHCDEWGPAPVSSI 248 L ++K I + A + CS C + K +LPFY P ++H D W +P++S Sbjct: 520 NSLRSNKFIEC-NKACQSFCSSCPIGKHVKLPFYDSSSYTVLPFDIIHSDLW-TSPIAST 577 Query: 247 AGHRYYILFIDDFSRFHWIYPMQVKTEALQCFQHFKSLTENLFHTKLLAFQTDGASELVK 68 +GHRYYILF+DD+S+F W +P+ K++ F F +L + F + FQ D +E V Sbjct: 578 SGHRYYILFLDDYSKFLWTFPIAKKSQVPHLFLSFHALVKTQFERSIKTFQCDNGTEYVN 637 Query: 67 GLFKKKLDQNGILLRISCPKT 5 G K+ D NG+L R+SCP T Sbjct: 638 GTLKQFFDHNGLLYRLSCPHT 658 >KYP50444.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cajanus cajan] Length = 1165 Score = 189 bits (479), Expect = 5e-51 Identities = 96/257 (37%), Positives = 144/257 (56%), Gaps = 2/257 (0%) Frame = -3 Query: 769 LGNGKTLPISLIGNATLNTPTTSFRLQDVLHVPEMRHNLLSVAKFTRENSCYITFDHYGY 590 +GNG L I G+++L+T S L+D+L+VP++ NLLS++K T +N Y+ F Sbjct: 264 VGNGANLKIIACGDSSLDTQQKSLNLKDILYVPKITKNLLSISKLTFDNDIYVEFHDVAC 323 Query: 589 EIRNLHNHELLAHGKMINNLY--PISSTMLQFAFSSLVATSDIWHNRLGHPSSKIIQQLH 416 +++ +L GK+ + LY P ST + + WH +LGHP+SK++ ++ Sbjct: 324 FVKDKLTGRILLEGKIKDGLYQLPGGSTSTNKRPHVFFSIKETWHRKLGHPNSKVLNEVM 383 Query: 415 NHKHIIVKSSASHTLCSPCQLAKSKRLPFYXXXXXXXSPLALVHCDEWGPAPVSSIAGHR 236 +I + C CQ K+ LPF PL LVH D WGPAP+SS++G + Sbjct: 384 KLCNIEASPCENFEFCEACQFGKAHNLPFQNSVSCAKEPLDLVHSDVWGPAPISSVSGFK 443 Query: 235 YYILFIDDFSRFHWIYPMQVKTEALQCFQHFKSLTENLFHTKLLAFQTDGASELVKGLFK 56 YY+LF+DD+SRF WIYP++ K++ Q F F++L EN F+ ++ Q DG E Sbjct: 444 YYVLFLDDWSRFTWIYPLKQKSDVFQAFIQFRNLVENQFNKRIKTLQCDGGGEFKS--LS 501 Query: 55 KKLDQNGILLRISCPKT 5 K L + GI LR SCP T Sbjct: 502 KVLIKTGIQLRESCPYT 518 >KYP72965.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94, partial [Cajanus cajan] Length = 261 Score = 176 bits (446), Expect = 6e-51 Identities = 94/262 (35%), Positives = 147/262 (56%), Gaps = 4/262 (1%) Frame = -3 Query: 778 HVMLGNGKTLPISLIGNATLNTPTTSFRLQDVLHVPEMRHNLLSVAKFTRENSCYITFDH 599 ++ +GNG +L I+ +G+ +L+ L DVL VP + NL+SV+K TR+N F Sbjct: 1 NIFVGNGTSLNITHVGSRSLSHTVP---LSDVLVVPNLTKNLVSVSKLTRDNHAKAIFVD 57 Query: 598 YGYEIRNLHNHELLAHGKMINNLYPISS---TMLQFAFSSLVATSDIWHNRLGHPSSKII 428 + I+N +LA G+ LY + +L + S A+ ++WH+RLGH + +I Sbjct: 58 DSFVIQNRKTGRVLARGRCDQGLYVMDQGPQALLTTSSSLPRASFELWHSRLGHVNFDVI 117 Query: 427 QQLHNHKHIIVKSSASHTLC-SPCQLAKSKRLPFYXXXXXXXSPLALVHCDEWGPAPVSS 251 +L+ ++ V S +C + CQ+AKSKRL FY + L L+HCD WGP+PV+S Sbjct: 118 NKLNKQGYLNVSSILPKPICCTTCQMAKSKRLVFYDNNKRASAVLDLIHCDLWGPSPVAS 177 Query: 250 IAGHRYYILFIDDFSRFHWIYPMQVKTEALQCFQHFKSLTENLFHTKLLAFQTDGASELV 71 +AG+ Y+++F+DDFSRF W YP++ K++ FK EN F + FQ+D +E Sbjct: 178 VAGYSYFVIFVDDFSRFTWFYPLRHKSDFYDVLVRFKVFVENQFSRFIKVFQSDNGTEFT 237 Query: 70 KGLFKKKLDQNGILLRISCPKT 5 + +G+L R SCP T Sbjct: 238 NNKVQDLFASSGVLHRFSCPHT 259 >CAN77295.1 hypothetical protein VITISV_005638 [Vitis vinifera] Length = 1198 Score = 188 bits (477), Expect = 9e-51 Identities = 111/277 (40%), Positives = 148/277 (53%), Gaps = 22/277 (7%) Frame = -3 Query: 775 VMLGNGKTLPISLIGNATLNTPTTSFRLQDVLHVPEMRHNLLSVAKFTRENSCYITFDHY 596 V +GNGK L IS G+ L + + SF L+ V HV + NL+SVAKF +N+ F Sbjct: 269 VTIGNGKHLSISNTGSHRLLSDSRSFHLKKVFHVHFISANLISVAKFYLDNNALFEFRSN 328 Query: 595 GYEIRNLHNHELLAHGKMINNLY--PI-------------SSTMLQFAFSSLVATSDIWH 461 + +++LH ++LA GK+ N LY P+ SST S +WH Sbjct: 329 SFFVKDLHTKKVLAQGKLENGLYRFPVLNSKKVAFVGAINSSTFYSHNSSIFDNKVKLWH 388 Query: 460 NRLGHPSSKIIQQLHNHKHI-------IVKSSASHTLCSPCQLAKSKRLPFYXXXXXXXS 302 +RLGH S+ I+ Q+ ++ V S+ T+CS CQLAKS RLP + Sbjct: 389 HRLGHASTNIVTQIMQSCNVSFEKNKNTVCSTVCSTVCSSCQLAKSHRLPTHLSLSCASK 448 Query: 301 PLALVHCDEWGPAPVSSIAGHRYYILFIDDFSRFHWIYPMQVKTEALQCFQHFKSLTENL 122 PL LVH D WGPA V S +G RY+ILF+DD+SR+ W YP+Q K +AL F+ FK EN Sbjct: 449 PLELVHTDLWGPASVKSTSGARYFILFLDDYSRYTWFYPLQTKDQALPAFKKFKLQVENQ 508 Query: 121 FHTKLLAFQTDGASELVKGLFKKKLDQNGILLRISCP 11 F K+ Q+D E FK L Q GI R SCP Sbjct: 509 FDAKIKCLQSDNGGEFRS--FKTFLQQTGIFHRFSCP 543