BLASTX nr result
ID: Alisma22_contig00024108
seq
BLASTX 2.2.26 [Sep-21-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Alisma22_contig00024108 (1016 letters) Database: ./nr 115,041,592 sequences; 42,171,959,267 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value AAC61290.1 putative retroelement pol polyprotein [Arabidopsis th... 180 7e-47 ACP30598.1 disease resistance protein [Brassica rapa subsp. peki... 177 5e-46 AAC67200.1 putative retroelement pol polyprotein [Arabidopsis th... 177 7e-46 XP_019085816.1 PREDICTED: uncharacterized protein LOC109126581 [... 169 7e-45 AAF02855.1 Similar to retrotransposon proteins [Arabidopsis thal... 174 8e-45 CAB43904.1 putative protein [Arabidopsis thaliana] CAB81478.1 pu... 171 1e-43 AAD21687.1 Strong similarity to gi|3600044 T12H20.12 protease ho... 169 4e-43 AAC35532.1 contains similarity to proteases [Arabidopsis thaliana] 168 7e-43 CAB40035.1 retrotransposon like protein [Arabidopsis thaliana] C... 168 8e-43 OAP02304.1 hypothetical protein AXX17_AT3G39340 [Arabidopsis tha... 168 8e-43 XP_013690295.1 PREDICTED: uncharacterized protein LOC106394259 [... 166 5e-42 XP_010064888.1 PREDICTED: uncharacterized protein LOC104452050 [... 163 8e-42 AAK51235.1 polyprotein [Arabidopsis thaliana] 159 8e-40 CAC37623.1 copia-like polyprotein [Arabidopsis thaliana] 158 2e-39 XP_013731927.1 PREDICTED: uncharacterized protein LOC106435564 [... 155 5e-39 KYP49968.1 Retrovirus-related Pol polyprotein from transposon TN... 144 2e-37 OMO61427.1 Zinc finger, CCCH-type [Corchorus capsularis] 149 3e-36 CAN68489.1 hypothetical protein VITISV_037543 [Vitis vinifera] 145 5e-35 XP_019085488.1 PREDICTED: uncharacterized protein LOC104715244 [... 143 8e-35 XP_013725106.1 PREDICTED: uncharacterized protein LOC106428899 [... 142 1e-34 >AAC61290.1 putative retroelement pol polyprotein [Arabidopsis thaliana] Length = 1149 Score = 180 bits (456), Expect = 7e-47 Identities = 108/271 (39%), Positives = 152/271 (56%), Gaps = 5/271 (1%) Frame = +1 Query: 211 ACQQQPM*QIPSTMSDFLKTWHTTLKCWHMYDNNFQNSSMPQALTENTSAALASIHLNEV 390 + ++PM QI K H L+CWH +D+++Q+S E +AA +++H+ +V Sbjct: 262 SASEKPMCQICG------KRGHYALQCWHRFDDSYQHS-------EAAAAAFSALHITDV 308 Query: 391 -DPSDWHIDSGATDHIAQFLGTLQNLCIYKGTNSIMIGNGDFLPITHTGDTFVNNLT*-V 564 D S W DS AT HI LQ + Y G +++M +G+FLPITH G + + + + Sbjct: 309 SDDSGWVPDSAATAHITNNSSRLQQMQPYLGNDTVMASDGNFLPITHIGSANLPSTSGNL 368 Query: 565 PLR--LVVPEIKRNLLSVSKLTSMLPCYVIFTEKTILIKDIKIHRVLVKGNC-SDGMYYL 735 PL+ LV P I ++LLSVSKLT PC F +L+KD +VL KG+ S+G+Y L Sbjct: 369 PLKDVLVCPNIAKSLLSVSKLTKDYPCSFTFDADGVLVKDKATCKVLTKGSSTSEGLYKL 428 Query: 736 PSPMSKVFFSSRFVKATGEVWHRRLGHPYKTILCHLL*YKLIYSTSHFDSICNSCELSKS 915 +P ++F+S+R VKAT EVWH RLGHP +L L K I +C SC L KS Sbjct: 429 ENPKFQMFYSTRQVKATDEVWHMRLGHPNPQVLQLLANKKAIQINKSTSKMCESCRLGKS 488 Query: 916 KKLPVQHSHTKAKHPFELLHCDVWGKDHVPS 1008 +LP S A P E +HCD+WG V S Sbjct: 489 SRLPFIASDFIASRPLERVHCDLWGPAPVSS 519 >ACP30598.1 disease resistance protein [Brassica rapa subsp. pekinensis] Length = 2301 Score = 177 bits (450), Expect = 5e-46 Identities = 99/252 (39%), Positives = 140/252 (55%), Gaps = 4/252 (1%) Frame = +1 Query: 265 KTWHTTLKCWHMYDNNFQNSSMPQALTENTSAALASIHLNEVDPSDWHIDSGATDHIAQF 444 K+ H ++CWH +DN++Q M AL AA+ + + +W D+GA+ HI Sbjct: 290 KSGHEAMRCWHRFDNSYQLDEMHNAL-----AAMRVSDMIDSRGGEWFPDTGASAHITNT 344 Query: 445 LGTLQNLCIYKGTNSIMIGNGDFLPITHTGDTFV----NNLT*VPLRLVVPEIKRNLLSV 612 LQN Y G++S+M+GNG++LPITHTG + NL + LV P+I + LLSV Sbjct: 345 PHHLQNAQPYMGSDSVMVGNGEYLPITHTGAASIASSSGNLILNDV-LVCPQIAKPLLSV 403 Query: 613 SKLTSMLPCYVIFTEKTILIKDIKIHRVLVKGNCSDGMYYLPSPMSKVFFSSRFVKATGE 792 SK T+ PC F + I D +VL++G + G+Y + P FFS+R V A+ E Sbjct: 404 SKFTTDYPCGFDFDADNVCIYDKATKKVLLQGRNTKGLYSIKEPAFHAFFSTRQVAASDE 463 Query: 793 VWHRRLGHPYKTILCHLL*YKLIYSTSHFDSICNSCELSKSKKLPVQHSHTKAKHPFELL 972 VWH+RLGHP IL L K ++ S+C SC+++KS +LP S A P E + Sbjct: 464 VWHQRLGHPNPHILQRLASIKSVFINKRSKSLCVSCQMAKSSRLPFSASQFVATRPLERI 523 Query: 973 HCDVWGKDHVPS 1008 HCDVWG V S Sbjct: 524 HCDVWGPSPVVS 535 >AAC67200.1 putative retroelement pol polyprotein [Arabidopsis thaliana] Length = 1402 Score = 177 bits (449), Expect = 7e-46 Identities = 98/252 (38%), Positives = 141/252 (55%), Gaps = 4/252 (1%) Frame = +1 Query: 265 KTWHTTLKCWHMYDNNFQNSSMPQALTENTSAALASIHLNEVDPSDWHIDSGATDHIAQF 444 K H LKCWH ++N++Q +P+AL AA+ + + ++W DS AT H+ Sbjct: 291 KMGHPALKCWHRFNNSYQYEELPRAL-----AAMRITDITDQHGNEWLPDSAATAHVTNS 345 Query: 445 LGTLQNLCIYKGTNSIMIGNGDFLPITHTGDT-FVNNLT*VPLR--LVVPEIKRNLLSVS 615 +LQ Y G++++M+ +G+FLPITHTG T ++ VPL LV P I ++LLSVS Sbjct: 346 PRSLQQSQPYHGSDAVMVADGNFLPITHTGSTNLASSSGNVPLTDVLVCPSITKSLLSVS 405 Query: 616 KLTSMLPCYVIFTEKTILIKDIKIHRVLVKGNCSDGMYYLPSPMS-KVFFSSRFVKATGE 792 KLT PC V F + I D ++L+ G+ DG+Y L K FFS+R A+ E Sbjct: 406 KLTQDYPCTVEFDSDGVRINDKATKKLLIMGSTCDGLYCLKDDSQFKAFFSTRQQSASDE 465 Query: 793 VWHRRLGHPYKTILCHLL*YKLIYSTSHFDSICNSCELSKSKKLPVQHSHTKAKHPFELL 972 VWHRRLGHP+ +L L+ I S+C +C+L KS +LP S + P E + Sbjct: 466 VWHRRLGHPHPQVLQQLVKTNSISINKTSKSLCEACQLGKSTRLPFVSSSFTSNRPLERV 525 Query: 973 HCDVWGKDHVPS 1008 HCD+WG + S Sbjct: 526 HCDLWGPSPITS 537 >XP_019085816.1 PREDICTED: uncharacterized protein LOC109126581 [Camelina sativa] Length = 475 Score = 169 bits (427), Expect = 7e-45 Identities = 95/253 (37%), Positives = 142/253 (56%), Gaps = 4/253 (1%) Frame = +1 Query: 265 KTWHTTLKCWHMYDNNFQNSSMPQALTENTSAALASIHLNEVDPSDWHIDSGATDHIAQF 444 +T +T +KC++ +DNN+Q+ E S A A + +++ + +WH+DS AT HI Sbjct: 209 RTGYTAIKCYNHFDNNYQS--------EVPSQAFAYLRVSDENGREWHLDSAATAHITTL 260 Query: 445 LGTLQNLCIYKGTNSIMIGNGDFLPITHTGDTFVNNLT*-VPLR--LVVPEIKRNLLSVS 615 LQ+ YKGT+++M+G+G +LPITH G T +++ +PL LV P++++NLLSVS Sbjct: 261 TSGLQDATSYKGTDAVMVGDGAYLPITHIGSTTISSAKGTIPLNEVLVCPDMQKNLLSVS 320 Query: 616 KLTSMLPCYVIFTEKTILIKDIKIHRVLVKGNCSDGMYYLPSPMSKVFFSSRFVKATGEV 795 KL C V F + I D+ +V+ KG G+Y L + F+S+R AT + Sbjct: 321 KLCDDYSCGVFFDSDFVYIIDLTTQKVVSKGPRKKGLYVLQNQEFVAFYSNRQCAATLDT 380 Query: 796 WHRRLGHPYKTILCHLL*YKLI-YSTSHFDSICNSCELSKSKKLPVQHSHTKAKHPFELL 972 WH RLGH IL HL K I + S IC C++ KS KL S ++ P E + Sbjct: 381 WHHRLGHSNSRILQHLRACKEIEVNKSRTSPICEPCQMRKSNKLQFFSSDSRDLQPLERV 440 Query: 973 HCDVWGKDHVPSH 1011 HCD+WG V S+ Sbjct: 441 HCDLWGPSPVVSN 453 >AAF02855.1 Similar to retrotransposon proteins [Arabidopsis thaliana] Length = 1522 Score = 174 bits (441), Expect = 8e-45 Identities = 98/246 (39%), Positives = 137/246 (55%), Gaps = 4/246 (1%) Frame = +1 Query: 265 KTWHTTLKCWHMYDNNFQNSSMPQALTENTSAALASIHLNEVDPSDWHIDSGATDHIAQF 444 K H LKCWH +DN++Q+ +P AL + H +E W DS A+ H+ Sbjct: 285 KAGHHALKCWHRFDNSYQHEDLPMALATMRITDVTDHHGHE-----WIPDSAASAHVTNN 339 Query: 445 LGTLQNLCIYKGTNSIMIGNGDFLPITHTGD-TFVNNLT*VPLR--LVVPEIKRNLLSVS 615 LQ Y G++SIM+ +G+FLPITHTG + ++ +PL+ LV P+I ++LLSVS Sbjct: 340 RHVLQQSQPYHGSDSIMVADGNFLPITHTGSGSIASSSGKIPLKEVLVCPDIVKSLLSVS 399 Query: 616 KLTSMLPCYVIFTEKTILIKDIKIHRVLVKGNCSDGMYYLPSPMSKVFFSSRFVKATGEV 795 KLTS PC V F ++ I D ++LV G DG+Y L P +V +S+R A+ EV Sbjct: 400 KLTSDYPCSVEFDADSVRINDKATKKLLVMGRNRDGLYSLEEPKLQVLYSTRQNSASSEV 459 Query: 796 WHRRLGHPYKTILCHLL*YK-LIYSTSHFDSICNSCELSKSKKLPVQHSHTKAKHPFELL 972 WHRRLGH +L L K +I ++C +C L KS +LP S A P E + Sbjct: 460 WHRRLGHANAEVLHQLASSKSIIIINKVVKTVCEACHLGKSTRLPFMLSTFNASRPLERI 519 Query: 973 HCDVWG 990 HCD+WG Sbjct: 520 HCDLWG 525 >CAB43904.1 putative protein [Arabidopsis thaliana] CAB81478.1 putative protein [Arabidopsis thaliana] Length = 1415 Score = 171 bits (432), Expect = 1e-43 Identities = 96/251 (38%), Positives = 139/251 (55%), Gaps = 3/251 (1%) Frame = +1 Query: 265 KTWHTTLKCWHMYDNNFQNSSMPQALTENTSAALASIHLNEVDPSDWHIDSGATDHIAQF 444 K H+ KCW +D+ FQ+ E+ S A A++ +++ + W DSGAT HI Sbjct: 255 KYGHSAYKCWKRFDHAFQS--------EDFSKAFAAMRVSDQKSNPWVTDSGATSHITNS 306 Query: 445 LGTLQNLCIYKGTNSIMIGNGDFLPITHTGDTFV-NNLT*VPLR--LVVPEIKRNLLSVS 615 LQ+ Y G +S+++GN DFLPITH G + +N +PLR LV P I ++LLSVS Sbjct: 307 TSQLQSAQPYSGEDSVIVGNSDFLPITHIGSAVLTSNQGNLPLRDVLVCPNITKSLLSVS 366 Query: 616 KLTSMLPCYVIFTEKTILIKDIKIHRVLVKGNCSDGMYYLPSPMSKVFFSSRFVKATGEV 795 KLTS PC + F +++KD ++L KG + +Y L +P +SSR + EV Sbjct: 367 KLTSDYPCVIEFDSDGVIVKDKLTKQLLTKGTRHNDLYLLENPKFMACYSSRQQATSDEV 426 Query: 796 WHRRLGHPYKTILCHLL*YKLIYSTSHFDSICNSCELSKSKKLPVQHSHTKAKHPFELLH 975 WH RLGHP + +L LL K I + S+C++C++ K KLP S + E +H Sbjct: 427 WHMRLGHPNQDVLQQLLRNKAIVISKTSHSLCDACQMGKICKLPFASSDFVSSRLLERVH 486 Query: 976 CDVWGKDHVPS 1008 CD+WG V S Sbjct: 487 CDLWGPAPVVS 497 >AAD21687.1 Strong similarity to gi|3600044 T12H20.12 protease homolog from Arabidopsis thaliana BAC gb|AF080119 and is a member of the reverse transcriptase family PF|00078 [Arabidopsis thaliana] Length = 1415 Score = 169 bits (428), Expect = 4e-43 Identities = 91/253 (35%), Positives = 140/253 (55%), Gaps = 4/253 (1%) Frame = +1 Query: 265 KTWHTTLKCWHMYDNNFQNSSMPQALTENTSAALASIHLNEVDPSDWHIDSGATDHIAQF 444 +T HT LKC++ +DNN+Q A +++ +++ +WH DS AT H+ Sbjct: 286 RTGHTALKCYNRFDNNYQAEIQ----------AFSTLRVSDDTGKEWHPDSAATAHVTSS 335 Query: 445 LGTLQNLCIYKGTNSIMIGNGDFLPITHTGDTFVNNLT*-VPLR--LVVPEIKRNLLSVS 615 LQ+ Y+G +++++G+G +LPITHTG T + + +PL LVVP I+++LLSVS Sbjct: 336 TNGLQSATEYEGDDAVLVGDGTYLPITHTGSTTIKSSNGKIPLNEVLVVPNIQKSLLSVS 395 Query: 616 KLTSMLPCYVIFTEKTILIKDIKIHRVLVKGNCSDGMYYLPSPMSKVFFSSRFVKATGEV 795 KL PC V F + I D++ +V+ G +G+Y L + +S+R AT EV Sbjct: 396 KLCDDYPCGVYFDANKVCIIDLQTQKVVTTGPRRNGLYVLENQEFVALYSNRQCAATEEV 455 Query: 796 WHRRLGHPYKTILCHLL*YKLI-YSTSHFDSICNSCELSKSKKLPVQHSHTKAKHPFELL 972 WH RLGH L HL K I + S +C C++ KS +LP S ++ HP + + Sbjct: 456 WHHRLGHANSKALQHLQNSKAIQINKSRTSPVCEPCQMGKSSRLPFLISDSRVLHPLDRI 515 Query: 973 HCDVWGKDHVPSH 1011 HCD+WG V S+ Sbjct: 516 HCDLWGPSPVVSN 528 >AAC35532.1 contains similarity to proteases [Arabidopsis thaliana] Length = 1392 Score = 168 bits (426), Expect = 7e-43 Identities = 93/251 (37%), Positives = 136/251 (54%), Gaps = 3/251 (1%) Frame = +1 Query: 265 KTWHTTLKCWHMYDNNFQNSSMPQALTENTSAALASIHLNEVDPSDWHIDSGATDHIAQF 444 K H+ KC+ ++ N+ +P A AA+ N+ +W DS AT HI Sbjct: 286 KYGHSAFKCYTRFEENYLPEDLPNAF-----AAMRVSDQNQASSHEWLPDSAATAHITNT 340 Query: 445 LGTLQNLCIYKGTNSIMIGNGDFLPITHTGDTFVN-NLT*VPLR--LVVPEIKRNLLSVS 615 LQN Y G +S+++GNGDFLPITH G +N + +PL LV P I ++LLSVS Sbjct: 341 TDGLQNSQTYSGDDSVIVGNGDFLPITHIGTIPLNISQGTLPLEDVLVCPGITKSLLSVS 400 Query: 616 KLTSMLPCYVIFTEKTILIKDIKIHRVLVKGNCSDGMYYLPSPMSKVFFSSRFVKATGEV 795 KLT PC F +++IKD + ++L +GN G+Y L + ++S+R + EV Sbjct: 401 KLTDDYPCSFTFDSDSVVIKDKRTQQLLTQGNKHKGLYVLKDVPFQTYYSTRQQSSDDEV 460 Query: 796 WHRRLGHPYKTILCHLL*YKLIYSTSHFDSICNSCELSKSKKLPVQHSHTKAKHPFELLH 975 WH+RLGHP K +L HL+ K I ++C +C++ K +LP S + P E +H Sbjct: 461 WHQRLGHPNKEVLQHLIKTKAIVVNKTSSNMCEACQMGKVCRLPFVASEFVSSRPLERIH 520 Query: 976 CDVWGKDHVPS 1008 CD+WG V S Sbjct: 521 CDLWGPAPVTS 531 >CAB40035.1 retrotransposon like protein [Arabidopsis thaliana] CAB81170.1 retrotransposon like protein [Arabidopsis thaliana] Length = 1515 Score = 168 bits (426), Expect = 8e-43 Identities = 93/251 (37%), Positives = 136/251 (54%), Gaps = 3/251 (1%) Frame = +1 Query: 265 KTWHTTLKCWHMYDNNFQNSSMPQALTENTSAALASIHLNEVDPSDWHIDSGATDHIAQF 444 K H+ KC+ ++ N+ +P A AA+ N+ +W DS AT HI Sbjct: 283 KYGHSAFKCYTRFEENYLPEDLPNAF-----AAMRVSDQNQASSHEWLPDSAATAHITNT 337 Query: 445 LGTLQNLCIYKGTNSIMIGNGDFLPITHTGDTFVN-NLT*VPLR--LVVPEIKRNLLSVS 615 LQN Y G +S+++GNGDFLPITH G +N + +PL LV P I ++LLSVS Sbjct: 338 TDGLQNSQTYSGDDSVIVGNGDFLPITHIGTIPLNISQGTLPLEDVLVCPGITKSLLSVS 397 Query: 616 KLTSMLPCYVIFTEKTILIKDIKIHRVLVKGNCSDGMYYLPSPMSKVFFSSRFVKATGEV 795 KLT PC F +++IKD + ++L +GN G+Y L + ++S+R + EV Sbjct: 398 KLTDDYPCSFTFDSDSVVIKDKRTQQLLTQGNKHKGLYVLKDVPFQTYYSTRQQSSDDEV 457 Query: 796 WHRRLGHPYKTILCHLL*YKLIYSTSHFDSICNSCELSKSKKLPVQHSHTKAKHPFELLH 975 WH+RLGHP K +L HL+ K I ++C +C++ K +LP S + P E +H Sbjct: 458 WHQRLGHPNKEVLQHLIKTKAIVVNKTSSNMCEACQMGKVCRLPFVASEFVSSRPLERIH 517 Query: 976 CDVWGKDHVPS 1008 CD+WG V S Sbjct: 518 CDLWGPAPVTS 528 >OAP02304.1 hypothetical protein AXX17_AT3G39340 [Arabidopsis thaliana] Length = 2099 Score = 168 bits (426), Expect = 8e-43 Identities = 94/251 (37%), Positives = 138/251 (54%), Gaps = 3/251 (1%) Frame = +1 Query: 265 KTWHTTLKCWHMYDNNFQNSSMPQALTENTSAALASIHLNEVDPSDWHIDSGATDHIAQF 444 K H LKCWH ++N++Q +P ALT A+ + + + + W DSGAT H+ Sbjct: 405 KPGHPALKCWHRFNNSYQYEELPAALT-----AMRITDVTDHNGNKWVGDSGATAHVTNS 459 Query: 445 LGTLQNLCIYKGTNSIMIGNGDFLPITHTGDTFVNNLT*V-PLR--LVVPEIKRNLLSVS 615 LQ Y G++S+M+GNGDFLPITHTG T + + + + L+ LV P I ++L+SVS Sbjct: 460 THNLQQSQPYGGSDSVMVGNGDFLPITHTGSTTLPSSSGILSLKDVLVCPNIGKSLVSVS 519 Query: 616 KLTSMLPCYVIFTEKTILIKDIKIHRVLVKGNCSDGMYYLPSPMSKVFFSSRFVKATGEV 795 KLT PC V F + + D ++L +GN +G+Y L F+SSR + +V Sbjct: 520 KLTRDYPCSVDFDCDYVRVTDKATKKLLAQGNNFNGLYVLKDSSVHAFYSSRQQTTSEDV 579 Query: 796 WHRRLGHPYKTILCHLL*YKLIYSTSHFDSICNSCELSKSKKLPVQHSHTKAKHPFELLH 975 WH RLGHP + IL L K + + IC +C+ KS +LP S + P + +H Sbjct: 580 WHMRLGHPNQQILQLLHKNKAVNISKSSKGICEACQYGKSSRLPFSSSCSTISKPLQKIH 639 Query: 976 CDVWGKDHVPS 1008 CD+WG + S Sbjct: 640 CDLWGPAPIKS 650 >XP_013690295.1 PREDICTED: uncharacterized protein LOC106394259 [Brassica napus] Length = 2800 Score = 166 bits (420), Expect = 5e-42 Identities = 96/246 (39%), Positives = 131/246 (53%), Gaps = 5/246 (2%) Frame = +1 Query: 286 KCWHMYDNNFQNSSMPQALTENTSAALASIHLNEVDPS--DWHIDSGATDHIAQFLGTLQ 459 KC+ +D NF S P T+ A N+ PS +W+ DSG++ H+ + L Sbjct: 292 KCYKRFDVNFVVSDPPPQANVLTTVAAH----NQSTPSGAEWYPDSGSSHHVTNSVDHLD 347 Query: 460 NLCIYKGTNSIMIGNGDFLPITHTGDTFVNNLT*-VPLR--LVVPEIKRNLLSVSKLTSM 630 Y G + +M+GNG+FLPITH G + + +PL L+ P+I ++LLSVSKLT Sbjct: 348 TAQPYAGLDQVMVGNGEFLPITHVGSASIPTQSGKIPLSDVLICPDITKSLLSVSKLTDD 407 Query: 631 LPCYVIFTEKTILIKDIKIHRVLVKGNCSDGMYYLPSPMSKVFFSSRFVKATGEVWHRRL 810 PC F T+ +KD RVL KGN G+Y L P F+S R A+ VWH+RL Sbjct: 408 FPCEFTFDSTTVCVKDKATCRVLSKGNKIKGLYRLDVPQLLTFYSFRQQVASDGVWHKRL 467 Query: 811 GHPYKTILCHLL*YKLIYSTSHFDSICNSCELSKSKKLPVQHSHTKAKHPFELLHCDVWG 990 GHP +L HL K I S+C SC+L K+ +LP S ++ P E +HCDVWG Sbjct: 468 GHPNDQVLKHLSTIKAISFNKTSQSMCESCQLGKTCRLPFSSSDFRSSRPLERIHCDVWG 527 Query: 991 KDHVPS 1008 V S Sbjct: 528 PAPVVS 533 Score = 166 bits (420), Expect = 5e-42 Identities = 96/246 (39%), Positives = 131/246 (53%), Gaps = 5/246 (2%) Frame = +1 Query: 286 KCWHMYDNNFQNSSMPQALTENTSAALASIHLNEVDPS--DWHIDSGATDHIAQFLGTLQ 459 KC+ +D NF S P T+ A N+ PS +W+ DSG++ H+ + L Sbjct: 1681 KCYKRFDVNFVVSDPPPQANVLTTVAAH----NQSTPSGAEWYPDSGSSHHVTNSVDHLD 1736 Query: 460 NLCIYKGTNSIMIGNGDFLPITHTGDTFVNNLT*-VPLR--LVVPEIKRNLLSVSKLTSM 630 Y G + +M+GNG+FLPITH G + + +PL L+ P+I ++LLSVSKLT Sbjct: 1737 TAQPYAGLDQVMVGNGEFLPITHVGSASIPTQSGKIPLSDVLICPDITKSLLSVSKLTDD 1796 Query: 631 LPCYVIFTEKTILIKDIKIHRVLVKGNCSDGMYYLPSPMSKVFFSSRFVKATGEVWHRRL 810 PC F T+ +KD RVL KGN G+Y L P F+S R A+ VWH+RL Sbjct: 1797 FPCEFTFDSTTVCVKDKATCRVLSKGNKIKGLYRLDVPQLLTFYSFRQQVASDGVWHKRL 1856 Query: 811 GHPYKTILCHLL*YKLIYSTSHFDSICNSCELSKSKKLPVQHSHTKAKHPFELLHCDVWG 990 GHP +L HL K I S+C SC+L K+ +LP S ++ P E +HCDVWG Sbjct: 1857 GHPNDQVLKHLSTIKAISFNKTSQSMCESCQLGKTCRLPFSSSDFRSSRPLERIHCDVWG 1916 Query: 991 KDHVPS 1008 V S Sbjct: 1917 PAPVVS 1922 >XP_010064888.1 PREDICTED: uncharacterized protein LOC104452050 [Eucalyptus grandis] Length = 616 Score = 163 bits (412), Expect = 8e-42 Identities = 89/251 (35%), Positives = 138/251 (54%), Gaps = 3/251 (1%) Frame = +1 Query: 265 KTWHTTLKCWHMYDNNFQNSSMPQALTENTSAALASIHLNEVDPSDWHIDSGATDHIAQF 444 K H L CW+ +DN++Q +P LA+IHL + S+W+ D+GAT HI Sbjct: 359 KPGHDALHCWYRFDNSYQAEEIP--------TTLAAIHLKDAKGSEWYPDTGATAHITAN 410 Query: 445 LGTLQNLCIYKGTNSIMIGNGDFLPITHTGDTFVNN-LT*VPLR--LVVPEIKRNLLSVS 615 L N Y G +++MIG+G L +T TG+T ++ + +PL L+VP+IK+NLL VS Sbjct: 411 SSILHNSSKYTGYDTVMIGDGSHLSVTCTGNTLLHTGKSLLPLNDVLIVPDIKKNLLLVS 470 Query: 616 KLTSMLPCYVIFTEKTILIKDIKIHRVLVKGNCSDGMYYLPSPMSKVFFSSRFVKATGEV 795 KLT C +F + + IKD + L+ G + G+Y + S ++ F + R + Sbjct: 471 KLTDDYHCSFVFDKFGVYIKDNWTNTTLLLGRKTKGLYQMNSKTTQAFLAQRHRAIAEDT 530 Query: 796 WHRRLGHPYKTILCHLL*YKLIYSTSHFDSICNSCELSKSKKLPVQHSHTKAKHPFELLH 975 WH+RL H IL +L KLI +S ++C+SC+++K+ LP S + P + +H Sbjct: 531 WHQRLAHTNLNILKYLQNQKLIQCSSRMLNVCSSCQVAKAVALPFPSSESITTMPLQKIH 590 Query: 976 CDVWGKDHVPS 1008 CD+WG V S Sbjct: 591 CDIWGPSPVTS 601 >AAK51235.1 polyprotein [Arabidopsis thaliana] Length = 1453 Score = 159 bits (403), Expect = 8e-40 Identities = 94/273 (34%), Positives = 150/273 (54%), Gaps = 4/273 (1%) Frame = +1 Query: 202 NINACQQQPM*QIPSTMSDFLKTWHTTLKCWHMYDNNFQNSSMPQALTENTSAALASIHL 381 N N ++P+ QI +T HT LKC++ +D+N+Q+ +T+ A +S+ + Sbjct: 272 NSNNTGERPVCQICG------RTGHTALKCYNRFDHNYQSV--------DTAQAFSSLRV 317 Query: 382 NEVDPSDWHIDSGATDHIAQFLGTLQNLCIYKGTNSIMIGNGDFLPITHTGDTFVNNLT* 561 ++ +W DS AT H+ LQ Y G++++++G+G +LPITH G T +++ + Sbjct: 318 SDSSGKEWVPDSAATAHVTSSTNNLQAASPYNGSDTVLVGDGAYLPITHVGSTTISSDSG 377 Query: 562 -VPLR--LVVPEIKRNLLSVSKLTSMLPCYVIFTEKTILIKDIKIHRVLVKGNCSDGMYY 732 +PL LV P+I+++LLSVSKL PC V F + I DI +V+ KG S+G+Y Sbjct: 378 TLPLNEVLVCPDIQKSLLSVSKLCDDYPCGVYFDANKVCIIDINTQKVVSKGPRSNGLYV 437 Query: 733 LPSPMSKVFFSSRFVKATGEVWHRRLGHPYKTILCHLL*YKLI-YSTSHFDSICNSCELS 909 L + F+S+R A+ E+WH RLGH IL L K I ++ S +C C++ Sbjct: 438 LENQEFVAFYSNRQCAASEEIWHHRLGHSNSRILQQLKSSKEISFNKSRMSPVCEPCQMG 497 Query: 910 KSKKLPVQHSHTKAKHPFELLHCDVWGKDHVPS 1008 KS KL S+++ +HCD+WG V S Sbjct: 498 KSSKLQFFSSNSRELDLLGRIHCDLWGPSPVVS 530 >CAC37623.1 copia-like polyprotein [Arabidopsis thaliana] Length = 1466 Score = 158 bits (400), Expect = 2e-39 Identities = 88/250 (35%), Positives = 139/250 (55%), Gaps = 4/250 (1%) Frame = +1 Query: 274 HTTLKCWHMYDNNFQNSSMPQALTENTSAALASIHLNEVDPSDWHIDSGATDHIAQFLGT 453 HT +KC++ +DNN+Q+ E + A +++ +++ +W+ DS AT HI Sbjct: 289 HTAIKCYNRFDNNYQS--------EVPTQAFSALRVSDETGKEWYPDSAATAHITASTSG 340 Query: 454 LQNLCIYKGTNSIMIGNGDFLPITHTGDTFVNNLT*-VPLR--LVVPEIKRNLLSVSKLT 624 LQN Y+G +++++G+G +LPITH G T +++ +PL LV P I+++LLSVSKL Sbjct: 341 LQNATTYEGNDAVLVGDGTYLPITHVGSTTISSSKGTIPLNEVLVCPAIQKSLLSVSKLC 400 Query: 625 SMLPCYVIFTEKTILIKDIKIHRVLVKGNCSDGMYYLPSPMSKVFFSSRFVKATGEVWHR 804 PC V F + I D+ +V+ KG ++G+Y L + +S+R A+ E WH Sbjct: 401 DDYPCGVYFDANKVCIIDLTTQKVVSKGPRNNGLYMLENSEFVALYSNRQCAASMETWHH 460 Query: 805 RLGHPYKTILCHLL*YKLI-YSTSHFDSICNSCELSKSKKLPVQHSHTKAKHPFELLHCD 981 RLGH IL LL K I + S +C C++ KS +L S +A P + +HCD Sbjct: 461 RLGHSNSKILQQLLTRKEIQVNKSRTSPVCEPCQMGKSTRLQFFSSDFRALKPLDRVHCD 520 Query: 982 VWGKDHVPSH 1011 +WG V S+ Sbjct: 521 LWGPSPVVSN 530 >XP_013731927.1 PREDICTED: uncharacterized protein LOC106435564 [Brassica napus] Length = 606 Score = 155 bits (392), Expect = 5e-39 Identities = 89/246 (36%), Positives = 135/246 (54%), Gaps = 8/246 (3%) Frame = +1 Query: 274 HTTLKCWHMYDNNFQNSSMPQALTENTSAALASIHLN---EVDPSDWHIDSGATDHIAQF 444 H+ KC++ +D ++Q + +N AL ++ L+ ++ +W+ DS A+ HI Sbjct: 350 HSAAKCYNRFDQDYQ-------VLDNLHNALTTMRLSNQEQLSGQEWYPDSAASAHITNK 402 Query: 445 LGTLQNLCIYKGTNSIMIGNGDFLPITHTGDTFVNNL--T*VPLR--LVVPEIKRNLLSV 612 L + Y G + +++GNGDFLPITH G ++ T +PL LV PEI +NLLSV Sbjct: 403 SSQLHSSEPYIGNDQVIVGNGDFLPITHVGFIALHTPQGTRLPLDDVLVCPEITKNLLSV 462 Query: 613 SKLTSMLPCYVIFTEKTILIKDIKIHRVLVKGNCSDGMYYLPSPMSKVFFSSRFVKATGE 792 SKLT PC F + +KD V+ +G +Y L +VF+S+R + E Sbjct: 463 SKLTKDYPCEFTFDSDHVFVKDKVTKAVITQGRRLKDLYMLKDARFQVFYSNRQQATSDE 522 Query: 793 VWHRRLGHPYKTILCHLL*YKLIYST-SHFDSICNSCELSKSKKLPVQHSHTKAKHPFEL 969 VWH+RLGHP+K IL HL I S + ++C++C++ KS +LP S T P E Sbjct: 523 VWHQRLGHPHKDILQHLSRKNAIVSNKTSSKTLCDACQVGKSSRLPFLVSETVTNRPLER 582 Query: 970 LHCDVW 987 +HCD+W Sbjct: 583 IHCDLW 588 >KYP49968.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cajanus cajan] Length = 261 Score = 144 bits (362), Expect = 2e-37 Identities = 87/253 (34%), Positives = 133/253 (52%), Gaps = 7/253 (2%) Frame = +1 Query: 265 KTWHTTLKCWHMYDNNFQNSSMPQALTENTSAALASIHLNEVDPSDWHIDSGATDHIAQF 444 K H CW + Q+ +PQAL T N + + W D GA++H+ Sbjct: 19 KMGHIEKICWWVPKKPTQSDDIPQALAALTLD-------NTIAKTKWTSDIGASNHMIGK 71 Query: 445 LGTLQNLCIYKGTNSIMIGNGDFLPITHTGDTFV--NNLT*VPLR--LVVPEIKRNLLSV 612 L N+ Y GTNS++IG+G LPI TGD+F+ N+T +PL L+VP + +NLLS+ Sbjct: 72 PSMLNNIQKYSGTNSVLIGDGSSLPILGTGDSFIKQRNVT-LPLHDVLLVPSLTKNLLSI 130 Query: 613 SKLTSMLPCYVIFTEKTILIKDIKIHRVLVKGNCSDGMYYLPSPMSKVFFSSRFVKATGE 792 S+LT P F+ +K+ + + ++ G +Y L SP ++ +S RF + + Sbjct: 131 SQLTKQFPVNCEFSNVDFCVKERETGKPMITGRRKGDLYVL-SPSPELHYSHRFKSRSAD 189 Query: 793 VWHRRLGHPYKTILCHLL*YK---LIYSTSHFDSICNSCELSKSKKLPVQHSHTKAKHPF 963 WH+RLGHP +TI LL K + ++ +C+SC+L K KLP S + F Sbjct: 190 TWHQRLGHP-QTIALQLLKNKGLIDVVGKVKYEHLCDSCQLGKLNKLPFSSSKHSSSAIF 248 Query: 964 ELLHCDVWGKDHV 1002 E +HCD+WG H+ Sbjct: 249 EKIHCDLWGPAHI 261 >OMO61427.1 Zinc finger, CCCH-type [Corchorus capsularis] Length = 1996 Score = 149 bits (376), Expect = 3e-36 Identities = 89/256 (34%), Positives = 140/256 (54%), Gaps = 11/256 (4%) Frame = +1 Query: 274 HTTLKCWHMYDNNFQNSSMPQALTENTSAALASIHLNEVDPSDWHIDSGATDHIAQFLGT 453 HT L C++ +++ +Q+ QA+ ++ L+ + W D+ A+ H+ G Sbjct: 264 HTALDCYNRFNHAYQSEKARQAM---------AMKLDGPIDNSWFPDTAASAHMTADPGI 314 Query: 454 LQNLCIYKGTNSIMIGNGDFLPITHTGDTFV---------NNLT*VPLRLVVPEIKRNLL 606 L +L Y G + I+IG+G L I+HTG + NN+ LVVPEIK+NLL Sbjct: 315 LSSLSQYHGCDKILIGDGSLLDISHTGTMDIPVLDGNLQLNNV------LVVPEIKKNLL 368 Query: 607 SVSKLTSMLPCYVIFTEKTILIKDIKIHRVLVKGNCSDGMYYLPSPMSKVFFSSRFVKAT 786 S +LT P F+ ++IKD + +++ KG+ DG+Y L + FFS+RF A+ Sbjct: 369 SAGQLTDDYPYTCEFSSAGVVIKDRETGKMIAKGSKQDGVYALGTKEKAAFFSTRFKTAS 428 Query: 787 GEVWHRRLGHPYKTILCHLL*YKLIYSTS--HFDSICNSCELSKSKKLPVQHSHTKAKHP 960 EVWH+RLGHP ++ L KLI STS + C+SC+++K+ +LP S+ P Sbjct: 429 DEVWHQRLGHPQPKVVELLKKNKLITSTSGNKVEHFCDSCQMAKACRLPFILSNEFCDTP 488 Query: 961 FELLHCDVWGKDHVPS 1008 +++HCD+WG V S Sbjct: 489 MDVIHCDLWGAAPVAS 504 >CAN68489.1 hypothetical protein VITISV_037543 [Vitis vinifera] Length = 1449 Score = 145 bits (367), Expect = 5e-35 Identities = 91/268 (33%), Positives = 138/268 (51%), Gaps = 20/268 (7%) Frame = +1 Query: 265 KTWHTTLKCWHMYDNNFQ--NSSMPQALTENTSAALASIHLNEVDPSD-----WHIDSGA 423 K HT ++C+H +D NFQ N +M T N A + PS W D+GA Sbjct: 326 KFGHTVVRCYHRFDINFQGYNPNMDTVQT-NKPNAKNQVQAMMASPSTISDEAWFFDTGA 384 Query: 424 TDHIAQFLGTLQNLCIYKGTNSIMIGNGDFLPITHTGDTFVNNLT*V-PLRLV--VPEIK 594 T H++Q + L ++ Y G + +++GNG L I HTG TF + + LR V VP+I Sbjct: 385 THHLSQSIDPLSDVQPYMGNDKVIVGNGKHLRILHTGTTFFPSSSKTFQLRQVLHVPDIA 444 Query: 595 RNLLSVSKLTSMLPCYVIFTEKTILIKDIKIHRVLVKGNCSDGMYYLPS---PMSKVFFS 765 NL+SVS+ + + F + +KD ++L++G+ G+Y P+ P F S Sbjct: 445 TNLISVSQFCADNNTFFEFHPRFFFVKDQVTKKILLQGSLEHGLYRFPARFVPSPAAFVS 504 Query: 766 SRFVKA-------TGEVWHRRLGHPYKTILCHLL*YKLIYSTSHFDSICNSCELSKSKKL 924 S + ++ T +WH RLGHP IL H+L I H +++C +C+ +KS KL Sbjct: 505 SSYDRSSNLSLTTTTTLWHSRLGHPADNILKHILTSCNISHQCHKNNVCCACQFAKSHKL 564 Query: 925 PVQHSHTKAKHPFELLHCDVWGKDHVPS 1008 P ++A HP LLH D+WG +PS Sbjct: 565 PFNVXVSRASHPLALLHADLWGPXSIPS 592 >XP_019085488.1 PREDICTED: uncharacterized protein LOC104715244 [Camelina sativa] Length = 584 Score = 143 bits (361), Expect = 8e-35 Identities = 83/240 (34%), Positives = 126/240 (52%), Gaps = 4/240 (1%) Frame = +1 Query: 301 YDNNFQNSSMPQALTENTSAALASIHLNEVDPSDWHIDSGATDHIAQFLGTLQNLCIYKG 480 +DN++Q+ PQAL A++ +++ +W DSG++ HI L Y G Sbjct: 289 FDNSYQSEDAPQAL--------AALQVSDTCGQEWVTDSGSSAHITAATTQLSTATPYNG 340 Query: 481 TNSIMIGNGDFLPITHTGDTFVNNLT*-VPLR--LVVPEIKRNLLSVSKLTSMLPCYVIF 651 + ++M+ +G LPITH G T + T +PL LV P ++++LLSVSKL PC V F Sbjct: 341 SKTVMVADGAHLPITHVGSTTLTTSTSSLPLLDVLVYPSMQKSLLSVSKLCDDYPCGVFF 400 Query: 652 TEKTILIKDIKIHRVLVKGNCSDGMYYLPSPMSKVFFSSRFVKATGEVWHRRLGHPYKTI 831 + + D++ +V+ KG +Y L + FS+R A+ +WH+RLGH + Sbjct: 401 DANAVYVIDLQTQKVVTKGPRRKSLYMLENKEFVALFSNRQCDASDMIWHQRLGHANLQV 460 Query: 832 LCHLL*YKLIYSTSHFDS-ICNSCELSKSKKLPVQHSHTKAKHPFELLHCDVWGKDHVPS 1008 L HL K I S S +C C++ KS +LP S AK P + +HCD+WG V S Sbjct: 461 LQHLKNSKAISSNKSSTSLVCGPCQMGKSCQLPFFSSDFSAKEPIDRIHCDLWGPSPVVS 520 >XP_013725106.1 PREDICTED: uncharacterized protein LOC106428899 [Brassica napus] Length = 537 Score = 142 bits (358), Expect = 1e-34 Identities = 81/221 (36%), Positives = 121/221 (54%), Gaps = 4/221 (1%) Frame = +1 Query: 274 HTTLKCWHMYDNNFQNSSMPQALTENTSAALASIHLNEVDPSDWHIDSGATDHIAQFLGT 453 HT LKC++ +DN +Q + PQA A++ + + +WH DSGAT H+ Sbjct: 294 HTALKCYNRFDNAYQTTQPPQAY--------AALQVADSSGKEWHPDSGATTHVTSSTNN 345 Query: 454 LQNLCIYKGTNSIMIGNGDFLPITHTGDTFVNNLT*-VPLR--LVVPEIKRNLLSVSKLT 624 L Y GT+++M+ +G +LPI+H G ++N T + L LV PEI+++LLSVSKL Sbjct: 346 LHTAETYNGTDAVMVADGTYLPISHIGSVTLSNTTGNISLNDVLVCPEIQKSLLSVSKLC 405 Query: 625 SMLPCYVIFTEKTILIKDIKIHRVLVKGNCSDGMYYLPSPMSKVFFSSRFVKATGEVWHR 804 PC V F + + D+ RV+ +G +G+Y L S +S+R A EVWH+ Sbjct: 406 DDYPCGVYFDASKVCVIDLITQRVVSEGPRRNGLYVLKSQELVAMYSNRQCGADAEVWHQ 465 Query: 805 RLGHPYKTILCHLL*YK-LIYSTSHFDSICNSCELSKSKKL 924 RLGH IL HL K + + S + IC C++ KS +L Sbjct: 466 RLGHSNYQILQHLKNNKEITVNKSSINPICEPCQIGKSSRL 506