BLASTX nr result

ID: Alisma22_contig00024108 seq

BLASTX 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Alisma22_contig00024108
         (1016 letters)

Database: ./nr 
           115,041,592 sequences; 42,171,959,267 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

AAC61290.1 putative retroelement pol polyprotein [Arabidopsis th...   180   7e-47
ACP30598.1 disease resistance protein [Brassica rapa subsp. peki...   177   5e-46
AAC67200.1 putative retroelement pol polyprotein [Arabidopsis th...   177   7e-46
XP_019085816.1 PREDICTED: uncharacterized protein LOC109126581 [...   169   7e-45
AAF02855.1 Similar to retrotransposon proteins [Arabidopsis thal...   174   8e-45
CAB43904.1 putative protein [Arabidopsis thaliana] CAB81478.1 pu...   171   1e-43
AAD21687.1 Strong similarity to gi|3600044 T12H20.12 protease ho...   169   4e-43
AAC35532.1 contains similarity to proteases [Arabidopsis thaliana]    168   7e-43
CAB40035.1 retrotransposon like protein [Arabidopsis thaliana] C...   168   8e-43
OAP02304.1 hypothetical protein AXX17_AT3G39340 [Arabidopsis tha...   168   8e-43
XP_013690295.1 PREDICTED: uncharacterized protein LOC106394259 [...   166   5e-42
XP_010064888.1 PREDICTED: uncharacterized protein LOC104452050 [...   163   8e-42
AAK51235.1 polyprotein [Arabidopsis thaliana]                         159   8e-40
CAC37623.1 copia-like polyprotein [Arabidopsis thaliana]              158   2e-39
XP_013731927.1 PREDICTED: uncharacterized protein LOC106435564 [...   155   5e-39
KYP49968.1 Retrovirus-related Pol polyprotein from transposon TN...   144   2e-37
OMO61427.1 Zinc finger, CCCH-type [Corchorus capsularis]              149   3e-36
CAN68489.1 hypothetical protein VITISV_037543 [Vitis vinifera]        145   5e-35
XP_019085488.1 PREDICTED: uncharacterized protein LOC104715244 [...   143   8e-35
XP_013725106.1 PREDICTED: uncharacterized protein LOC106428899 [...   142   1e-34

>AAC61290.1 putative retroelement pol polyprotein [Arabidopsis thaliana]
          Length = 1149

 Score =  180 bits (456), Expect = 7e-47
 Identities = 108/271 (39%), Positives = 152/271 (56%), Gaps = 5/271 (1%)
 Frame = +1

Query: 211  ACQQQPM*QIPSTMSDFLKTWHTTLKCWHMYDNNFQNSSMPQALTENTSAALASIHLNEV 390
            +  ++PM QI        K  H  L+CWH +D+++Q+S       E  +AA +++H+ +V
Sbjct: 262  SASEKPMCQICG------KRGHYALQCWHRFDDSYQHS-------EAAAAAFSALHITDV 308

Query: 391  -DPSDWHIDSGATDHIAQFLGTLQNLCIYKGTNSIMIGNGDFLPITHTGDTFVNNLT*-V 564
             D S W  DS AT HI      LQ +  Y G +++M  +G+FLPITH G   + + +  +
Sbjct: 309  SDDSGWVPDSAATAHITNNSSRLQQMQPYLGNDTVMASDGNFLPITHIGSANLPSTSGNL 368

Query: 565  PLR--LVVPEIKRNLLSVSKLTSMLPCYVIFTEKTILIKDIKIHRVLVKGNC-SDGMYYL 735
            PL+  LV P I ++LLSVSKLT   PC   F    +L+KD    +VL KG+  S+G+Y L
Sbjct: 369  PLKDVLVCPNIAKSLLSVSKLTKDYPCSFTFDADGVLVKDKATCKVLTKGSSTSEGLYKL 428

Query: 736  PSPMSKVFFSSRFVKATGEVWHRRLGHPYKTILCHLL*YKLIYSTSHFDSICNSCELSKS 915
             +P  ++F+S+R VKAT EVWH RLGHP   +L  L   K I        +C SC L KS
Sbjct: 429  ENPKFQMFYSTRQVKATDEVWHMRLGHPNPQVLQLLANKKAIQINKSTSKMCESCRLGKS 488

Query: 916  KKLPVQHSHTKAKHPFELLHCDVWGKDHVPS 1008
             +LP   S   A  P E +HCD+WG   V S
Sbjct: 489  SRLPFIASDFIASRPLERVHCDLWGPAPVSS 519


>ACP30598.1 disease resistance protein [Brassica rapa subsp. pekinensis]
          Length = 2301

 Score =  177 bits (450), Expect = 5e-46
 Identities = 99/252 (39%), Positives = 140/252 (55%), Gaps = 4/252 (1%)
 Frame = +1

Query: 265  KTWHTTLKCWHMYDNNFQNSSMPQALTENTSAALASIHLNEVDPSDWHIDSGATDHIAQF 444
            K+ H  ++CWH +DN++Q   M  AL     AA+    + +    +W  D+GA+ HI   
Sbjct: 290  KSGHEAMRCWHRFDNSYQLDEMHNAL-----AAMRVSDMIDSRGGEWFPDTGASAHITNT 344

Query: 445  LGTLQNLCIYKGTNSIMIGNGDFLPITHTGDTFV----NNLT*VPLRLVVPEIKRNLLSV 612
               LQN   Y G++S+M+GNG++LPITHTG   +     NL    + LV P+I + LLSV
Sbjct: 345  PHHLQNAQPYMGSDSVMVGNGEYLPITHTGAASIASSSGNLILNDV-LVCPQIAKPLLSV 403

Query: 613  SKLTSMLPCYVIFTEKTILIKDIKIHRVLVKGNCSDGMYYLPSPMSKVFFSSRFVKATGE 792
            SK T+  PC   F    + I D    +VL++G  + G+Y +  P    FFS+R V A+ E
Sbjct: 404  SKFTTDYPCGFDFDADNVCIYDKATKKVLLQGRNTKGLYSIKEPAFHAFFSTRQVAASDE 463

Query: 793  VWHRRLGHPYKTILCHLL*YKLIYSTSHFDSICNSCELSKSKKLPVQHSHTKAKHPFELL 972
            VWH+RLGHP   IL  L   K ++      S+C SC+++KS +LP   S   A  P E +
Sbjct: 464  VWHQRLGHPNPHILQRLASIKSVFINKRSKSLCVSCQMAKSSRLPFSASQFVATRPLERI 523

Query: 973  HCDVWGKDHVPS 1008
            HCDVWG   V S
Sbjct: 524  HCDVWGPSPVVS 535


>AAC67200.1 putative retroelement pol polyprotein [Arabidopsis thaliana]
          Length = 1402

 Score =  177 bits (449), Expect = 7e-46
 Identities = 98/252 (38%), Positives = 141/252 (55%), Gaps = 4/252 (1%)
 Frame = +1

Query: 265  KTWHTTLKCWHMYDNNFQNSSMPQALTENTSAALASIHLNEVDPSDWHIDSGATDHIAQF 444
            K  H  LKCWH ++N++Q   +P+AL     AA+    + +   ++W  DS AT H+   
Sbjct: 291  KMGHPALKCWHRFNNSYQYEELPRAL-----AAMRITDITDQHGNEWLPDSAATAHVTNS 345

Query: 445  LGTLQNLCIYKGTNSIMIGNGDFLPITHTGDT-FVNNLT*VPLR--LVVPEIKRNLLSVS 615
              +LQ    Y G++++M+ +G+FLPITHTG T   ++   VPL   LV P I ++LLSVS
Sbjct: 346  PRSLQQSQPYHGSDAVMVADGNFLPITHTGSTNLASSSGNVPLTDVLVCPSITKSLLSVS 405

Query: 616  KLTSMLPCYVIFTEKTILIKDIKIHRVLVKGNCSDGMYYLPSPMS-KVFFSSRFVKATGE 792
            KLT   PC V F    + I D    ++L+ G+  DG+Y L      K FFS+R   A+ E
Sbjct: 406  KLTQDYPCTVEFDSDGVRINDKATKKLLIMGSTCDGLYCLKDDSQFKAFFSTRQQSASDE 465

Query: 793  VWHRRLGHPYKTILCHLL*YKLIYSTSHFDSICNSCELSKSKKLPVQHSHTKAKHPFELL 972
            VWHRRLGHP+  +L  L+    I       S+C +C+L KS +LP   S   +  P E +
Sbjct: 466  VWHRRLGHPHPQVLQQLVKTNSISINKTSKSLCEACQLGKSTRLPFVSSSFTSNRPLERV 525

Query: 973  HCDVWGKDHVPS 1008
            HCD+WG   + S
Sbjct: 526  HCDLWGPSPITS 537


>XP_019085816.1 PREDICTED: uncharacterized protein LOC109126581 [Camelina sativa]
          Length = 475

 Score =  169 bits (427), Expect = 7e-45
 Identities = 95/253 (37%), Positives = 142/253 (56%), Gaps = 4/253 (1%)
 Frame = +1

Query: 265  KTWHTTLKCWHMYDNNFQNSSMPQALTENTSAALASIHLNEVDPSDWHIDSGATDHIAQF 444
            +T +T +KC++ +DNN+Q+        E  S A A + +++ +  +WH+DS AT HI   
Sbjct: 209  RTGYTAIKCYNHFDNNYQS--------EVPSQAFAYLRVSDENGREWHLDSAATAHITTL 260

Query: 445  LGTLQNLCIYKGTNSIMIGNGDFLPITHTGDTFVNNLT*-VPLR--LVVPEIKRNLLSVS 615
               LQ+   YKGT+++M+G+G +LPITH G T +++    +PL   LV P++++NLLSVS
Sbjct: 261  TSGLQDATSYKGTDAVMVGDGAYLPITHIGSTTISSAKGTIPLNEVLVCPDMQKNLLSVS 320

Query: 616  KLTSMLPCYVIFTEKTILIKDIKIHRVLVKGNCSDGMYYLPSPMSKVFFSSRFVKATGEV 795
            KL     C V F    + I D+   +V+ KG    G+Y L +     F+S+R   AT + 
Sbjct: 321  KLCDDYSCGVFFDSDFVYIIDLTTQKVVSKGPRKKGLYVLQNQEFVAFYSNRQCAATLDT 380

Query: 796  WHRRLGHPYKTILCHLL*YKLI-YSTSHFDSICNSCELSKSKKLPVQHSHTKAKHPFELL 972
            WH RLGH    IL HL   K I  + S    IC  C++ KS KL    S ++   P E +
Sbjct: 381  WHHRLGHSNSRILQHLRACKEIEVNKSRTSPICEPCQMRKSNKLQFFSSDSRDLQPLERV 440

Query: 973  HCDVWGKDHVPSH 1011
            HCD+WG   V S+
Sbjct: 441  HCDLWGPSPVVSN 453


>AAF02855.1 Similar to retrotransposon proteins [Arabidopsis thaliana]
          Length = 1522

 Score =  174 bits (441), Expect = 8e-45
 Identities = 98/246 (39%), Positives = 137/246 (55%), Gaps = 4/246 (1%)
 Frame = +1

Query: 265  KTWHTTLKCWHMYDNNFQNSSMPQALTENTSAALASIHLNEVDPSDWHIDSGATDHIAQF 444
            K  H  LKCWH +DN++Q+  +P AL       +   H +E     W  DS A+ H+   
Sbjct: 285  KAGHHALKCWHRFDNSYQHEDLPMALATMRITDVTDHHGHE-----WIPDSAASAHVTNN 339

Query: 445  LGTLQNLCIYKGTNSIMIGNGDFLPITHTGD-TFVNNLT*VPLR--LVVPEIKRNLLSVS 615
               LQ    Y G++SIM+ +G+FLPITHTG  +  ++   +PL+  LV P+I ++LLSVS
Sbjct: 340  RHVLQQSQPYHGSDSIMVADGNFLPITHTGSGSIASSSGKIPLKEVLVCPDIVKSLLSVS 399

Query: 616  KLTSMLPCYVIFTEKTILIKDIKIHRVLVKGNCSDGMYYLPSPMSKVFFSSRFVKATGEV 795
            KLTS  PC V F   ++ I D    ++LV G   DG+Y L  P  +V +S+R   A+ EV
Sbjct: 400  KLTSDYPCSVEFDADSVRINDKATKKLLVMGRNRDGLYSLEEPKLQVLYSTRQNSASSEV 459

Query: 796  WHRRLGHPYKTILCHLL*YK-LIYSTSHFDSICNSCELSKSKKLPVQHSHTKAKHPFELL 972
            WHRRLGH    +L  L   K +I       ++C +C L KS +LP   S   A  P E +
Sbjct: 460  WHRRLGHANAEVLHQLASSKSIIIINKVVKTVCEACHLGKSTRLPFMLSTFNASRPLERI 519

Query: 973  HCDVWG 990
            HCD+WG
Sbjct: 520  HCDLWG 525


>CAB43904.1 putative protein [Arabidopsis thaliana] CAB81478.1 putative protein
            [Arabidopsis thaliana]
          Length = 1415

 Score =  171 bits (432), Expect = 1e-43
 Identities = 96/251 (38%), Positives = 139/251 (55%), Gaps = 3/251 (1%)
 Frame = +1

Query: 265  KTWHTTLKCWHMYDNNFQNSSMPQALTENTSAALASIHLNEVDPSDWHIDSGATDHIAQF 444
            K  H+  KCW  +D+ FQ+        E+ S A A++ +++   + W  DSGAT HI   
Sbjct: 255  KYGHSAYKCWKRFDHAFQS--------EDFSKAFAAMRVSDQKSNPWVTDSGATSHITNS 306

Query: 445  LGTLQNLCIYKGTNSIMIGNGDFLPITHTGDTFV-NNLT*VPLR--LVVPEIKRNLLSVS 615
               LQ+   Y G +S+++GN DFLPITH G   + +N   +PLR  LV P I ++LLSVS
Sbjct: 307  TSQLQSAQPYSGEDSVIVGNSDFLPITHIGSAVLTSNQGNLPLRDVLVCPNITKSLLSVS 366

Query: 616  KLTSMLPCYVIFTEKTILIKDIKIHRVLVKGNCSDGMYYLPSPMSKVFFSSRFVKATGEV 795
            KLTS  PC + F    +++KD    ++L KG   + +Y L +P     +SSR    + EV
Sbjct: 367  KLTSDYPCVIEFDSDGVIVKDKLTKQLLTKGTRHNDLYLLENPKFMACYSSRQQATSDEV 426

Query: 796  WHRRLGHPYKTILCHLL*YKLIYSTSHFDSICNSCELSKSKKLPVQHSHTKAKHPFELLH 975
            WH RLGHP + +L  LL  K I  +    S+C++C++ K  KLP   S   +    E +H
Sbjct: 427  WHMRLGHPNQDVLQQLLRNKAIVISKTSHSLCDACQMGKICKLPFASSDFVSSRLLERVH 486

Query: 976  CDVWGKDHVPS 1008
            CD+WG   V S
Sbjct: 487  CDLWGPAPVVS 497


>AAD21687.1 Strong similarity to gi|3600044 T12H20.12 protease homolog from
            Arabidopsis thaliana BAC gb|AF080119 and is a member of
            the reverse transcriptase family PF|00078 [Arabidopsis
            thaliana]
          Length = 1415

 Score =  169 bits (428), Expect = 4e-43
 Identities = 91/253 (35%), Positives = 140/253 (55%), Gaps = 4/253 (1%)
 Frame = +1

Query: 265  KTWHTTLKCWHMYDNNFQNSSMPQALTENTSAALASIHLNEVDPSDWHIDSGATDHIAQF 444
            +T HT LKC++ +DNN+Q              A +++ +++    +WH DS AT H+   
Sbjct: 286  RTGHTALKCYNRFDNNYQAEIQ----------AFSTLRVSDDTGKEWHPDSAATAHVTSS 335

Query: 445  LGTLQNLCIYKGTNSIMIGNGDFLPITHTGDTFVNNLT*-VPLR--LVVPEIKRNLLSVS 615
               LQ+   Y+G +++++G+G +LPITHTG T + +    +PL   LVVP I+++LLSVS
Sbjct: 336  TNGLQSATEYEGDDAVLVGDGTYLPITHTGSTTIKSSNGKIPLNEVLVVPNIQKSLLSVS 395

Query: 616  KLTSMLPCYVIFTEKTILIKDIKIHRVLVKGNCSDGMYYLPSPMSKVFFSSRFVKATGEV 795
            KL    PC V F    + I D++  +V+  G   +G+Y L +      +S+R   AT EV
Sbjct: 396  KLCDDYPCGVYFDANKVCIIDLQTQKVVTTGPRRNGLYVLENQEFVALYSNRQCAATEEV 455

Query: 796  WHRRLGHPYKTILCHLL*YKLI-YSTSHFDSICNSCELSKSKKLPVQHSHTKAKHPFELL 972
            WH RLGH     L HL   K I  + S    +C  C++ KS +LP   S ++  HP + +
Sbjct: 456  WHHRLGHANSKALQHLQNSKAIQINKSRTSPVCEPCQMGKSSRLPFLISDSRVLHPLDRI 515

Query: 973  HCDVWGKDHVPSH 1011
            HCD+WG   V S+
Sbjct: 516  HCDLWGPSPVVSN 528


>AAC35532.1 contains similarity to proteases [Arabidopsis thaliana]
          Length = 1392

 Score =  168 bits (426), Expect = 7e-43
 Identities = 93/251 (37%), Positives = 136/251 (54%), Gaps = 3/251 (1%)
 Frame = +1

Query: 265  KTWHTTLKCWHMYDNNFQNSSMPQALTENTSAALASIHLNEVDPSDWHIDSGATDHIAQF 444
            K  H+  KC+  ++ N+    +P A      AA+     N+    +W  DS AT HI   
Sbjct: 286  KYGHSAFKCYTRFEENYLPEDLPNAF-----AAMRVSDQNQASSHEWLPDSAATAHITNT 340

Query: 445  LGTLQNLCIYKGTNSIMIGNGDFLPITHTGDTFVN-NLT*VPLR--LVVPEIKRNLLSVS 615
               LQN   Y G +S+++GNGDFLPITH G   +N +   +PL   LV P I ++LLSVS
Sbjct: 341  TDGLQNSQTYSGDDSVIVGNGDFLPITHIGTIPLNISQGTLPLEDVLVCPGITKSLLSVS 400

Query: 616  KLTSMLPCYVIFTEKTILIKDIKIHRVLVKGNCSDGMYYLPSPMSKVFFSSRFVKATGEV 795
            KLT   PC   F   +++IKD +  ++L +GN   G+Y L     + ++S+R   +  EV
Sbjct: 401  KLTDDYPCSFTFDSDSVVIKDKRTQQLLTQGNKHKGLYVLKDVPFQTYYSTRQQSSDDEV 460

Query: 796  WHRRLGHPYKTILCHLL*YKLIYSTSHFDSICNSCELSKSKKLPVQHSHTKAKHPFELLH 975
            WH+RLGHP K +L HL+  K I       ++C +C++ K  +LP   S   +  P E +H
Sbjct: 461  WHQRLGHPNKEVLQHLIKTKAIVVNKTSSNMCEACQMGKVCRLPFVASEFVSSRPLERIH 520

Query: 976  CDVWGKDHVPS 1008
            CD+WG   V S
Sbjct: 521  CDLWGPAPVTS 531


>CAB40035.1 retrotransposon like protein [Arabidopsis thaliana] CAB81170.1
            retrotransposon like protein [Arabidopsis thaliana]
          Length = 1515

 Score =  168 bits (426), Expect = 8e-43
 Identities = 93/251 (37%), Positives = 136/251 (54%), Gaps = 3/251 (1%)
 Frame = +1

Query: 265  KTWHTTLKCWHMYDNNFQNSSMPQALTENTSAALASIHLNEVDPSDWHIDSGATDHIAQF 444
            K  H+  KC+  ++ N+    +P A      AA+     N+    +W  DS AT HI   
Sbjct: 283  KYGHSAFKCYTRFEENYLPEDLPNAF-----AAMRVSDQNQASSHEWLPDSAATAHITNT 337

Query: 445  LGTLQNLCIYKGTNSIMIGNGDFLPITHTGDTFVN-NLT*VPLR--LVVPEIKRNLLSVS 615
               LQN   Y G +S+++GNGDFLPITH G   +N +   +PL   LV P I ++LLSVS
Sbjct: 338  TDGLQNSQTYSGDDSVIVGNGDFLPITHIGTIPLNISQGTLPLEDVLVCPGITKSLLSVS 397

Query: 616  KLTSMLPCYVIFTEKTILIKDIKIHRVLVKGNCSDGMYYLPSPMSKVFFSSRFVKATGEV 795
            KLT   PC   F   +++IKD +  ++L +GN   G+Y L     + ++S+R   +  EV
Sbjct: 398  KLTDDYPCSFTFDSDSVVIKDKRTQQLLTQGNKHKGLYVLKDVPFQTYYSTRQQSSDDEV 457

Query: 796  WHRRLGHPYKTILCHLL*YKLIYSTSHFDSICNSCELSKSKKLPVQHSHTKAKHPFELLH 975
            WH+RLGHP K +L HL+  K I       ++C +C++ K  +LP   S   +  P E +H
Sbjct: 458  WHQRLGHPNKEVLQHLIKTKAIVVNKTSSNMCEACQMGKVCRLPFVASEFVSSRPLERIH 517

Query: 976  CDVWGKDHVPS 1008
            CD+WG   V S
Sbjct: 518  CDLWGPAPVTS 528


>OAP02304.1 hypothetical protein AXX17_AT3G39340 [Arabidopsis thaliana]
          Length = 2099

 Score =  168 bits (426), Expect = 8e-43
 Identities = 94/251 (37%), Positives = 138/251 (54%), Gaps = 3/251 (1%)
 Frame = +1

Query: 265  KTWHTTLKCWHMYDNNFQNSSMPQALTENTSAALASIHLNEVDPSDWHIDSGATDHIAQF 444
            K  H  LKCWH ++N++Q   +P ALT     A+    + + + + W  DSGAT H+   
Sbjct: 405  KPGHPALKCWHRFNNSYQYEELPAALT-----AMRITDVTDHNGNKWVGDSGATAHVTNS 459

Query: 445  LGTLQNLCIYKGTNSIMIGNGDFLPITHTGDTFVNNLT*V-PLR--LVVPEIKRNLLSVS 615
               LQ    Y G++S+M+GNGDFLPITHTG T + + + +  L+  LV P I ++L+SVS
Sbjct: 460  THNLQQSQPYGGSDSVMVGNGDFLPITHTGSTTLPSSSGILSLKDVLVCPNIGKSLVSVS 519

Query: 616  KLTSMLPCYVIFTEKTILIKDIKIHRVLVKGNCSDGMYYLPSPMSKVFFSSRFVKATGEV 795
            KLT   PC V F    + + D    ++L +GN  +G+Y L       F+SSR    + +V
Sbjct: 520  KLTRDYPCSVDFDCDYVRVTDKATKKLLAQGNNFNGLYVLKDSSVHAFYSSRQQTTSEDV 579

Query: 796  WHRRLGHPYKTILCHLL*YKLIYSTSHFDSICNSCELSKSKKLPVQHSHTKAKHPFELLH 975
            WH RLGHP + IL  L   K +  +     IC +C+  KS +LP   S +    P + +H
Sbjct: 580  WHMRLGHPNQQILQLLHKNKAVNISKSSKGICEACQYGKSSRLPFSSSCSTISKPLQKIH 639

Query: 976  CDVWGKDHVPS 1008
            CD+WG   + S
Sbjct: 640  CDLWGPAPIKS 650


>XP_013690295.1 PREDICTED: uncharacterized protein LOC106394259 [Brassica napus]
          Length = 2800

 Score =  166 bits (420), Expect = 5e-42
 Identities = 96/246 (39%), Positives = 131/246 (53%), Gaps = 5/246 (2%)
 Frame = +1

Query: 286  KCWHMYDNNFQNSSMPQALTENTSAALASIHLNEVDPS--DWHIDSGATDHIAQFLGTLQ 459
            KC+  +D NF  S  P      T+ A      N+  PS  +W+ DSG++ H+   +  L 
Sbjct: 292  KCYKRFDVNFVVSDPPPQANVLTTVAAH----NQSTPSGAEWYPDSGSSHHVTNSVDHLD 347

Query: 460  NLCIYKGTNSIMIGNGDFLPITHTGDTFVNNLT*-VPLR--LVVPEIKRNLLSVSKLTSM 630
                Y G + +M+GNG+FLPITH G   +   +  +PL   L+ P+I ++LLSVSKLT  
Sbjct: 348  TAQPYAGLDQVMVGNGEFLPITHVGSASIPTQSGKIPLSDVLICPDITKSLLSVSKLTDD 407

Query: 631  LPCYVIFTEKTILIKDIKIHRVLVKGNCSDGMYYLPSPMSKVFFSSRFVKATGEVWHRRL 810
             PC   F   T+ +KD    RVL KGN   G+Y L  P    F+S R   A+  VWH+RL
Sbjct: 408  FPCEFTFDSTTVCVKDKATCRVLSKGNKIKGLYRLDVPQLLTFYSFRQQVASDGVWHKRL 467

Query: 811  GHPYKTILCHLL*YKLIYSTSHFDSICNSCELSKSKKLPVQHSHTKAKHPFELLHCDVWG 990
            GHP   +L HL   K I       S+C SC+L K+ +LP   S  ++  P E +HCDVWG
Sbjct: 468  GHPNDQVLKHLSTIKAISFNKTSQSMCESCQLGKTCRLPFSSSDFRSSRPLERIHCDVWG 527

Query: 991  KDHVPS 1008
               V S
Sbjct: 528  PAPVVS 533



 Score =  166 bits (420), Expect = 5e-42
 Identities = 96/246 (39%), Positives = 131/246 (53%), Gaps = 5/246 (2%)
 Frame = +1

Query: 286  KCWHMYDNNFQNSSMPQALTENTSAALASIHLNEVDPS--DWHIDSGATDHIAQFLGTLQ 459
            KC+  +D NF  S  P      T+ A      N+  PS  +W+ DSG++ H+   +  L 
Sbjct: 1681 KCYKRFDVNFVVSDPPPQANVLTTVAAH----NQSTPSGAEWYPDSGSSHHVTNSVDHLD 1736

Query: 460  NLCIYKGTNSIMIGNGDFLPITHTGDTFVNNLT*-VPLR--LVVPEIKRNLLSVSKLTSM 630
                Y G + +M+GNG+FLPITH G   +   +  +PL   L+ P+I ++LLSVSKLT  
Sbjct: 1737 TAQPYAGLDQVMVGNGEFLPITHVGSASIPTQSGKIPLSDVLICPDITKSLLSVSKLTDD 1796

Query: 631  LPCYVIFTEKTILIKDIKIHRVLVKGNCSDGMYYLPSPMSKVFFSSRFVKATGEVWHRRL 810
             PC   F   T+ +KD    RVL KGN   G+Y L  P    F+S R   A+  VWH+RL
Sbjct: 1797 FPCEFTFDSTTVCVKDKATCRVLSKGNKIKGLYRLDVPQLLTFYSFRQQVASDGVWHKRL 1856

Query: 811  GHPYKTILCHLL*YKLIYSTSHFDSICNSCELSKSKKLPVQHSHTKAKHPFELLHCDVWG 990
            GHP   +L HL   K I       S+C SC+L K+ +LP   S  ++  P E +HCDVWG
Sbjct: 1857 GHPNDQVLKHLSTIKAISFNKTSQSMCESCQLGKTCRLPFSSSDFRSSRPLERIHCDVWG 1916

Query: 991  KDHVPS 1008
               V S
Sbjct: 1917 PAPVVS 1922


>XP_010064888.1 PREDICTED: uncharacterized protein LOC104452050 [Eucalyptus grandis]
          Length = 616

 Score =  163 bits (412), Expect = 8e-42
 Identities = 89/251 (35%), Positives = 138/251 (54%), Gaps = 3/251 (1%)
 Frame = +1

Query: 265  KTWHTTLKCWHMYDNNFQNSSMPQALTENTSAALASIHLNEVDPSDWHIDSGATDHIAQF 444
            K  H  L CW+ +DN++Q   +P          LA+IHL +   S+W+ D+GAT HI   
Sbjct: 359  KPGHDALHCWYRFDNSYQAEEIP--------TTLAAIHLKDAKGSEWYPDTGATAHITAN 410

Query: 445  LGTLQNLCIYKGTNSIMIGNGDFLPITHTGDTFVNN-LT*VPLR--LVVPEIKRNLLSVS 615
               L N   Y G +++MIG+G  L +T TG+T ++   + +PL   L+VP+IK+NLL VS
Sbjct: 411  SSILHNSSKYTGYDTVMIGDGSHLSVTCTGNTLLHTGKSLLPLNDVLIVPDIKKNLLLVS 470

Query: 616  KLTSMLPCYVIFTEKTILIKDIKIHRVLVKGNCSDGMYYLPSPMSKVFFSSRFVKATGEV 795
            KLT    C  +F +  + IKD   +  L+ G  + G+Y + S  ++ F + R      + 
Sbjct: 471  KLTDDYHCSFVFDKFGVYIKDNWTNTTLLLGRKTKGLYQMNSKTTQAFLAQRHRAIAEDT 530

Query: 796  WHRRLGHPYKTILCHLL*YKLIYSTSHFDSICNSCELSKSKKLPVQHSHTKAKHPFELLH 975
            WH+RL H    IL +L   KLI  +S   ++C+SC+++K+  LP   S +    P + +H
Sbjct: 531  WHQRLAHTNLNILKYLQNQKLIQCSSRMLNVCSSCQVAKAVALPFPSSESITTMPLQKIH 590

Query: 976  CDVWGKDHVPS 1008
            CD+WG   V S
Sbjct: 591  CDIWGPSPVTS 601


>AAK51235.1 polyprotein [Arabidopsis thaliana]
          Length = 1453

 Score =  159 bits (403), Expect = 8e-40
 Identities = 94/273 (34%), Positives = 150/273 (54%), Gaps = 4/273 (1%)
 Frame = +1

Query: 202  NINACQQQPM*QIPSTMSDFLKTWHTTLKCWHMYDNNFQNSSMPQALTENTSAALASIHL 381
            N N   ++P+ QI        +T HT LKC++ +D+N+Q+         +T+ A +S+ +
Sbjct: 272  NSNNTGERPVCQICG------RTGHTALKCYNRFDHNYQSV--------DTAQAFSSLRV 317

Query: 382  NEVDPSDWHIDSGATDHIAQFLGTLQNLCIYKGTNSIMIGNGDFLPITHTGDTFVNNLT* 561
            ++    +W  DS AT H+      LQ    Y G++++++G+G +LPITH G T +++ + 
Sbjct: 318  SDSSGKEWVPDSAATAHVTSSTNNLQAASPYNGSDTVLVGDGAYLPITHVGSTTISSDSG 377

Query: 562  -VPLR--LVVPEIKRNLLSVSKLTSMLPCYVIFTEKTILIKDIKIHRVLVKGNCSDGMYY 732
             +PL   LV P+I+++LLSVSKL    PC V F    + I DI   +V+ KG  S+G+Y 
Sbjct: 378  TLPLNEVLVCPDIQKSLLSVSKLCDDYPCGVYFDANKVCIIDINTQKVVSKGPRSNGLYV 437

Query: 733  LPSPMSKVFFSSRFVKATGEVWHRRLGHPYKTILCHLL*YKLI-YSTSHFDSICNSCELS 909
            L +     F+S+R   A+ E+WH RLGH    IL  L   K I ++ S    +C  C++ 
Sbjct: 438  LENQEFVAFYSNRQCAASEEIWHHRLGHSNSRILQQLKSSKEISFNKSRMSPVCEPCQMG 497

Query: 910  KSKKLPVQHSHTKAKHPFELLHCDVWGKDHVPS 1008
            KS KL    S+++       +HCD+WG   V S
Sbjct: 498  KSSKLQFFSSNSRELDLLGRIHCDLWGPSPVVS 530


>CAC37623.1 copia-like polyprotein [Arabidopsis thaliana]
          Length = 1466

 Score =  158 bits (400), Expect = 2e-39
 Identities = 88/250 (35%), Positives = 139/250 (55%), Gaps = 4/250 (1%)
 Frame = +1

Query: 274  HTTLKCWHMYDNNFQNSSMPQALTENTSAALASIHLNEVDPSDWHIDSGATDHIAQFLGT 453
            HT +KC++ +DNN+Q+        E  + A +++ +++    +W+ DS AT HI      
Sbjct: 289  HTAIKCYNRFDNNYQS--------EVPTQAFSALRVSDETGKEWYPDSAATAHITASTSG 340

Query: 454  LQNLCIYKGTNSIMIGNGDFLPITHTGDTFVNNLT*-VPLR--LVVPEIKRNLLSVSKLT 624
            LQN   Y+G +++++G+G +LPITH G T +++    +PL   LV P I+++LLSVSKL 
Sbjct: 341  LQNATTYEGNDAVLVGDGTYLPITHVGSTTISSSKGTIPLNEVLVCPAIQKSLLSVSKLC 400

Query: 625  SMLPCYVIFTEKTILIKDIKIHRVLVKGNCSDGMYYLPSPMSKVFFSSRFVKATGEVWHR 804
               PC V F    + I D+   +V+ KG  ++G+Y L +      +S+R   A+ E WH 
Sbjct: 401  DDYPCGVYFDANKVCIIDLTTQKVVSKGPRNNGLYMLENSEFVALYSNRQCAASMETWHH 460

Query: 805  RLGHPYKTILCHLL*YKLI-YSTSHFDSICNSCELSKSKKLPVQHSHTKAKHPFELLHCD 981
            RLGH    IL  LL  K I  + S    +C  C++ KS +L    S  +A  P + +HCD
Sbjct: 461  RLGHSNSKILQQLLTRKEIQVNKSRTSPVCEPCQMGKSTRLQFFSSDFRALKPLDRVHCD 520

Query: 982  VWGKDHVPSH 1011
            +WG   V S+
Sbjct: 521  LWGPSPVVSN 530


>XP_013731927.1 PREDICTED: uncharacterized protein LOC106435564 [Brassica napus]
          Length = 606

 Score =  155 bits (392), Expect = 5e-39
 Identities = 89/246 (36%), Positives = 135/246 (54%), Gaps = 8/246 (3%)
 Frame = +1

Query: 274  HTTLKCWHMYDNNFQNSSMPQALTENTSAALASIHLN---EVDPSDWHIDSGATDHIAQF 444
            H+  KC++ +D ++Q       + +N   AL ++ L+   ++   +W+ DS A+ HI   
Sbjct: 350  HSAAKCYNRFDQDYQ-------VLDNLHNALTTMRLSNQEQLSGQEWYPDSAASAHITNK 402

Query: 445  LGTLQNLCIYKGTNSIMIGNGDFLPITHTGDTFVNNL--T*VPLR--LVVPEIKRNLLSV 612
               L +   Y G + +++GNGDFLPITH G   ++    T +PL   LV PEI +NLLSV
Sbjct: 403  SSQLHSSEPYIGNDQVIVGNGDFLPITHVGFIALHTPQGTRLPLDDVLVCPEITKNLLSV 462

Query: 613  SKLTSMLPCYVIFTEKTILIKDIKIHRVLVKGNCSDGMYYLPSPMSKVFFSSRFVKATGE 792
            SKLT   PC   F    + +KD     V+ +G     +Y L     +VF+S+R    + E
Sbjct: 463  SKLTKDYPCEFTFDSDHVFVKDKVTKAVITQGRRLKDLYMLKDARFQVFYSNRQQATSDE 522

Query: 793  VWHRRLGHPYKTILCHLL*YKLIYST-SHFDSICNSCELSKSKKLPVQHSHTKAKHPFEL 969
            VWH+RLGHP+K IL HL     I S  +   ++C++C++ KS +LP   S T    P E 
Sbjct: 523  VWHQRLGHPHKDILQHLSRKNAIVSNKTSSKTLCDACQVGKSSRLPFLVSETVTNRPLER 582

Query: 970  LHCDVW 987
            +HCD+W
Sbjct: 583  IHCDLW 588


>KYP49968.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cajanus
            cajan]
          Length = 261

 Score =  144 bits (362), Expect = 2e-37
 Identities = 87/253 (34%), Positives = 133/253 (52%), Gaps = 7/253 (2%)
 Frame = +1

Query: 265  KTWHTTLKCWHMYDNNFQNSSMPQALTENTSAALASIHLNEVDPSDWHIDSGATDHIAQF 444
            K  H    CW +     Q+  +PQAL   T         N +  + W  D GA++H+   
Sbjct: 19   KMGHIEKICWWVPKKPTQSDDIPQALAALTLD-------NTIAKTKWTSDIGASNHMIGK 71

Query: 445  LGTLQNLCIYKGTNSIMIGNGDFLPITHTGDTFV--NNLT*VPLR--LVVPEIKRNLLSV 612
               L N+  Y GTNS++IG+G  LPI  TGD+F+   N+T +PL   L+VP + +NLLS+
Sbjct: 72   PSMLNNIQKYSGTNSVLIGDGSSLPILGTGDSFIKQRNVT-LPLHDVLLVPSLTKNLLSI 130

Query: 613  SKLTSMLPCYVIFTEKTILIKDIKIHRVLVKGNCSDGMYYLPSPMSKVFFSSRFVKATGE 792
            S+LT   P    F+     +K+ +  + ++ G     +Y L SP  ++ +S RF   + +
Sbjct: 131  SQLTKQFPVNCEFSNVDFCVKERETGKPMITGRRKGDLYVL-SPSPELHYSHRFKSRSAD 189

Query: 793  VWHRRLGHPYKTILCHLL*YK---LIYSTSHFDSICNSCELSKSKKLPVQHSHTKAKHPF 963
             WH+RLGHP +TI   LL  K    +     ++ +C+SC+L K  KLP   S   +   F
Sbjct: 190  TWHQRLGHP-QTIALQLLKNKGLIDVVGKVKYEHLCDSCQLGKLNKLPFSSSKHSSSAIF 248

Query: 964  ELLHCDVWGKDHV 1002
            E +HCD+WG  H+
Sbjct: 249  EKIHCDLWGPAHI 261


>OMO61427.1 Zinc finger, CCCH-type [Corchorus capsularis]
          Length = 1996

 Score =  149 bits (376), Expect = 3e-36
 Identities = 89/256 (34%), Positives = 140/256 (54%), Gaps = 11/256 (4%)
 Frame = +1

Query: 274  HTTLKCWHMYDNNFQNSSMPQALTENTSAALASIHLNEVDPSDWHIDSGATDHIAQFLGT 453
            HT L C++ +++ +Q+    QA+         ++ L+    + W  D+ A+ H+    G 
Sbjct: 264  HTALDCYNRFNHAYQSEKARQAM---------AMKLDGPIDNSWFPDTAASAHMTADPGI 314

Query: 454  LQNLCIYKGTNSIMIGNGDFLPITHTGDTFV---------NNLT*VPLRLVVPEIKRNLL 606
            L +L  Y G + I+IG+G  L I+HTG   +         NN+      LVVPEIK+NLL
Sbjct: 315  LSSLSQYHGCDKILIGDGSLLDISHTGTMDIPVLDGNLQLNNV------LVVPEIKKNLL 368

Query: 607  SVSKLTSMLPCYVIFTEKTILIKDIKIHRVLVKGNCSDGMYYLPSPMSKVFFSSRFVKAT 786
            S  +LT   P    F+   ++IKD +  +++ KG+  DG+Y L +     FFS+RF  A+
Sbjct: 369  SAGQLTDDYPYTCEFSSAGVVIKDRETGKMIAKGSKQDGVYALGTKEKAAFFSTRFKTAS 428

Query: 787  GEVWHRRLGHPYKTILCHLL*YKLIYSTS--HFDSICNSCELSKSKKLPVQHSHTKAKHP 960
             EVWH+RLGHP   ++  L   KLI STS    +  C+SC+++K+ +LP   S+     P
Sbjct: 429  DEVWHQRLGHPQPKVVELLKKNKLITSTSGNKVEHFCDSCQMAKACRLPFILSNEFCDTP 488

Query: 961  FELLHCDVWGKDHVPS 1008
             +++HCD+WG   V S
Sbjct: 489  MDVIHCDLWGAAPVAS 504


>CAN68489.1 hypothetical protein VITISV_037543 [Vitis vinifera]
          Length = 1449

 Score =  145 bits (367), Expect = 5e-35
 Identities = 91/268 (33%), Positives = 138/268 (51%), Gaps = 20/268 (7%)
 Frame = +1

Query: 265  KTWHTTLKCWHMYDNNFQ--NSSMPQALTENTSAALASIHLNEVDPSD-----WHIDSGA 423
            K  HT ++C+H +D NFQ  N +M    T N   A   +      PS      W  D+GA
Sbjct: 326  KFGHTVVRCYHRFDINFQGYNPNMDTVQT-NKPNAKNQVQAMMASPSTISDEAWFFDTGA 384

Query: 424  TDHIAQFLGTLQNLCIYKGTNSIMIGNGDFLPITHTGDTFVNNLT*V-PLRLV--VPEIK 594
            T H++Q +  L ++  Y G + +++GNG  L I HTG TF  + +    LR V  VP+I 
Sbjct: 385  THHLSQSIDPLSDVQPYMGNDKVIVGNGKHLRILHTGTTFFPSSSKTFQLRQVLHVPDIA 444

Query: 595  RNLLSVSKLTSMLPCYVIFTEKTILIKDIKIHRVLVKGNCSDGMYYLPS---PMSKVFFS 765
             NL+SVS+  +    +  F  +   +KD    ++L++G+   G+Y  P+   P    F S
Sbjct: 445  TNLISVSQFCADNNTFFEFHPRFFFVKDQVTKKILLQGSLEHGLYRFPARFVPSPAAFVS 504

Query: 766  SRFVKA-------TGEVWHRRLGHPYKTILCHLL*YKLIYSTSHFDSICNSCELSKSKKL 924
            S + ++       T  +WH RLGHP   IL H+L    I    H +++C +C+ +KS KL
Sbjct: 505  SSYDRSSNLSLTTTTTLWHSRLGHPADNILKHILTSCNISHQCHKNNVCCACQFAKSHKL 564

Query: 925  PVQHSHTKAKHPFELLHCDVWGKDHVPS 1008
            P     ++A HP  LLH D+WG   +PS
Sbjct: 565  PFNVXVSRASHPLALLHADLWGPXSIPS 592


>XP_019085488.1 PREDICTED: uncharacterized protein LOC104715244 [Camelina sativa]
          Length = 584

 Score =  143 bits (361), Expect = 8e-35
 Identities = 83/240 (34%), Positives = 126/240 (52%), Gaps = 4/240 (1%)
 Frame = +1

Query: 301  YDNNFQNSSMPQALTENTSAALASIHLNEVDPSDWHIDSGATDHIAQFLGTLQNLCIYKG 480
            +DN++Q+   PQAL        A++ +++    +W  DSG++ HI      L     Y G
Sbjct: 289  FDNSYQSEDAPQAL--------AALQVSDTCGQEWVTDSGSSAHITAATTQLSTATPYNG 340

Query: 481  TNSIMIGNGDFLPITHTGDTFVNNLT*-VPLR--LVVPEIKRNLLSVSKLTSMLPCYVIF 651
            + ++M+ +G  LPITH G T +   T  +PL   LV P ++++LLSVSKL    PC V F
Sbjct: 341  SKTVMVADGAHLPITHVGSTTLTTSTSSLPLLDVLVYPSMQKSLLSVSKLCDDYPCGVFF 400

Query: 652  TEKTILIKDIKIHRVLVKGNCSDGMYYLPSPMSKVFFSSRFVKATGEVWHRRLGHPYKTI 831
                + + D++  +V+ KG     +Y L +      FS+R   A+  +WH+RLGH    +
Sbjct: 401  DANAVYVIDLQTQKVVTKGPRRKSLYMLENKEFVALFSNRQCDASDMIWHQRLGHANLQV 460

Query: 832  LCHLL*YKLIYSTSHFDS-ICNSCELSKSKKLPVQHSHTKAKHPFELLHCDVWGKDHVPS 1008
            L HL   K I S     S +C  C++ KS +LP   S   AK P + +HCD+WG   V S
Sbjct: 461  LQHLKNSKAISSNKSSTSLVCGPCQMGKSCQLPFFSSDFSAKEPIDRIHCDLWGPSPVVS 520


>XP_013725106.1 PREDICTED: uncharacterized protein LOC106428899 [Brassica napus]
          Length = 537

 Score =  142 bits (358), Expect = 1e-34
 Identities = 81/221 (36%), Positives = 121/221 (54%), Gaps = 4/221 (1%)
 Frame = +1

Query: 274 HTTLKCWHMYDNNFQNSSMPQALTENTSAALASIHLNEVDPSDWHIDSGATDHIAQFLGT 453
           HT LKC++ +DN +Q +  PQA         A++ + +    +WH DSGAT H+      
Sbjct: 294 HTALKCYNRFDNAYQTTQPPQAY--------AALQVADSSGKEWHPDSGATTHVTSSTNN 345

Query: 454 LQNLCIYKGTNSIMIGNGDFLPITHTGDTFVNNLT*-VPLR--LVVPEIKRNLLSVSKLT 624
           L     Y GT+++M+ +G +LPI+H G   ++N T  + L   LV PEI+++LLSVSKL 
Sbjct: 346 LHTAETYNGTDAVMVADGTYLPISHIGSVTLSNTTGNISLNDVLVCPEIQKSLLSVSKLC 405

Query: 625 SMLPCYVIFTEKTILIKDIKIHRVLVKGNCSDGMYYLPSPMSKVFFSSRFVKATGEVWHR 804
              PC V F    + + D+   RV+ +G   +G+Y L S      +S+R   A  EVWH+
Sbjct: 406 DDYPCGVYFDASKVCVIDLITQRVVSEGPRRNGLYVLKSQELVAMYSNRQCGADAEVWHQ 465

Query: 805 RLGHPYKTILCHLL*YK-LIYSTSHFDSICNSCELSKSKKL 924
           RLGH    IL HL   K +  + S  + IC  C++ KS +L
Sbjct: 466 RLGHSNYQILQHLKNNKEITVNKSSINPICEPCQIGKSSRL 506


Top