BLASTX nr result

ID: Mentha29_contig00025525 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Mentha29_contig00025525
         (1468 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|EYU44333.1| hypothetical protein MIMGU_mgv1a000009mg [Mimulus...   590   e-166
gb|EPS71292.1| hypothetical protein M569_03462, partial [Genlise...   421   e-115
ref|XP_006338249.1| PREDICTED: uncharacterized protein LOC102601...   417   e-114
ref|XP_006338248.1| PREDICTED: uncharacterized protein LOC102601...   417   e-114
ref|XP_004233645.1| PREDICTED: uncharacterized protein LOC101257...   400   e-109
ref|XP_006466676.1| PREDICTED: uncharacterized protein LOC102617...   379   e-102
ref|XP_006425795.1| hypothetical protein CICLE_v10024678mg [Citr...   379   e-102
ref|XP_007203912.1| hypothetical protein PRUPE_ppa016794mg, part...   371   e-100
ref|XP_007047104.1| Vacuolar protein sorting-associated protein ...   364   6e-98
ref|XP_006598717.1| PREDICTED: uncharacterized protein LOC100527...   345   3e-92
ref|XP_007155985.1| hypothetical protein PHAVU_003G249100g [Phas...   328   4e-87
ref|XP_004301869.1| PREDICTED: uncharacterized protein LOC101304...   327   1e-86
ref|XP_006405272.1| hypothetical protein EUTSA_v10027614mg [Eutr...   319   2e-84
ref|NP_190607.2| uncharacterized protein [Arabidopsis thaliana] ...   315   3e-83
ref|XP_002877744.1| hypothetical protein ARALYDRAFT_485391 [Arab...   313   1e-82
ref|XP_006293179.1| hypothetical protein CARUB_v10019496mg [Caps...   313   2e-82
emb|CAB62317.1| putative protein [Arabidopsis thaliana]               308   3e-81
gb|EMT16046.1| hypothetical protein F775_00816 [Aegilops tauschii]    297   7e-78
gb|EMS50104.1| Retrovirus-related Pol polyprotein LINE-1 [Tritic...   294   6e-77
gb|EEE53527.1| hypothetical protein OsJ_36721 [Oryza sativa Japo...   292   2e-76

>gb|EYU44333.1| hypothetical protein MIMGU_mgv1a000009mg [Mimulus guttatus]
          Length = 3157

 Score =  590 bits (1520), Expect = e-166
 Identities = 316/507 (62%), Positives = 376/507 (74%), Gaps = 20/507 (3%)
 Frame = +3

Query: 6    QYKDTIHPN-RDASASSTSVLIRS-SHVDITSR-----------QKYILKDLQYFLAIER 146
            Q+KD  HP+  DAS+SSTSV  R  SHV I+ R           Q+YILKDL+ FLA+E 
Sbjct: 1305 QHKDFDHPDLADASSSSTSVSQRGGSHVGISMRNPGQKDLYISAQRYILKDLRCFLAVEG 1364

Query: 147  PVTGNCSNPPCSSDIWVGSGSISRFDVTISLHEVNMILSAAESFSKALGSGGTTNVDSRR 326
            PVT +   P  S++IW+G+GSIS FDVTISL E+ M+LSA  SFSK   +  T  V+SR 
Sbjct: 1365 PVTRDRITPTYSNNIWIGTGSISGFDVTISLCEIKMVLSALGSFSKVSSNVETPKVESRH 1424

Query: 327  WSYSQESGENMEEMIPDGTIVAIQDVDEHMYITVRDVESGYDISGEVHHSLVGKRALFRV 506
             SY  E G N EEM+PDGTIVAIQDVD+HMYI V+  ES YD++G +H+SLVG+RALFRV
Sbjct: 1425 LSYDHEPGGNTEEMVPDGTIVAIQDVDQHMYIAVKGAESRYDVAGAMHYSLVGERALFRV 1484

Query: 507  NYRKPGRWRPHTQYFSLVSLYAKDNSGEPLQLSCQPRSKFVDV-CSDGSGYAVWEMLPFK 683
             Y KP RW+   QYFSL+SLYAKDNSGE L+L+C+PRS+FVDV CS  SG A+W ML FK
Sbjct: 1485 KYHKPSRWKSQIQYFSLISLYAKDNSGESLRLTCRPRSRFVDVSCSIDSGSALWRMLSFK 1544

Query: 684  PDAYEDAIGVESSTCISKRTFHLVNKKNDCAVAFIDGTLEFVSKPGNLFKWKVFDDDLGP 863
             DAYE AI VESST +SK+ FHLVNKKNDCA+AF DG LEFV KPGNLFKWKVFDD  GP
Sbjct: 1545 RDAYEVAIEVESSTSLSKKAFHLVNKKNDCALAFNDGILEFVGKPGNLFKWKVFDDP-GP 1603

Query: 864  VCNNLSPNRFTGISNATPVSNEQGSTDAGADLQ------SESNFTDVGMLRRNENLLGIT 1025
                        +SN  PV     ST    +LQ      S+SN  ++G L  N NL GI 
Sbjct: 1604 ------------LSNRFPVEGPSSSTAISRELQTYPRDGSDSNVMEMGELVANGNLSGIV 1651

Query: 1026 INVEKVSLTIVHEISETEEQFPLLQGCIMPNQTIVQISDVKVRIMDRFEVILYYFDGQQN 1205
            + V+K++LTIVHE+SETEE+FPLLQG I PNQ I+QIS+ K+R+M+ FEVILYYFD QQN
Sbjct: 1652 VTVDKITLTIVHELSETEEKFPLLQGSISPNQAIIQISNSKLRVMNTFEVILYYFDAQQN 1711

Query: 1206 SWKEFIQPLKICIFYSQKSLIQGAENPLHRVHSRFYANFKEVTVLLSELSLDIVLFVVGK 1385
             W EFIQPL+IC FYSQK LIQGAEN LH + S FYA  KEVTVLLSELSLDI+LFV+GK
Sbjct: 1712 KWTEFIQPLEICTFYSQKFLIQGAENSLHGLPSHFYAKIKEVTVLLSELSLDILLFVIGK 1771

Query: 1386 LNLAGPYSVKSSLVLANCCEVDNQSGL 1466
            L+LAGPY+VKSS+VLANC +V+NQ+GL
Sbjct: 1772 LDLAGPYAVKSSMVLANCYKVENQTGL 1798


>gb|EPS71292.1| hypothetical protein M569_03462, partial [Genlisea aurea]
          Length = 730

 Score =  421 bits (1081), Expect = e-115
 Identities = 213/419 (50%), Positives = 295/419 (70%), Gaps = 3/419 (0%)
 Frame = +3

Query: 219  FDVTISLHEVNMILSAAESFSKALGSGGTTNVDSRRWSYSQESGENMEEMIPDGTIVAIQ 398
            F + ++     M++S  E+ S+AL +  T+  +    S+S+E   N+ E++PDGTIVA+Q
Sbjct: 20   FKILLTFFWGQMVISTYEALSRALTTEQTSAAEFIGRSHSEEPEGNLSEIVPDGTIVALQ 79

Query: 399  DVDEHMYITVRDVESGYDISGEVHHSLVGKRALFRVNYRKPGRWRPHTQYFSLVSLYAKD 578
            DVD+H Y+ V   ES + I+G +H+S+VG++ALFR+ Y+K   W+   +YF+L SLYA+ 
Sbjct: 80   DVDQHTYVVVERAESRFKITGAIHYSVVGEQALFRIKYKKSRMWKSEAEYFTLTSLYAEK 139

Query: 579  NSGEPLQLSCQPRSKFVDV-CSDGSGYAVWEMLPFKPDAYEDAIGVESSTCISKRTFHLV 755
            N GE L+L+   RS+ VD+ C+D +  A+W+MLPFK DAYE++  +E ST +S+  FHLV
Sbjct: 140  NYGESLRLNFHARSRVVDISCTDENPSALWKMLPFKSDAYENSTELEYSTSLSRGLFHLV 199

Query: 756  NKKNDCAVAFIDGTLEFVSKPGNLFKWKVFDDDLGPVCNNLSPN--RFTGISNATPVSNE 929
            NKKND  VAF +G LEF+SKPGNLFKWKVFD    P  ++LSPN  R  G S+ +P S+E
Sbjct: 200  NKKNDFGVAFTNGILEFISKPGNLFKWKVFDGHNPPGSSSLSPNWLRVKGTSSLSPESSE 259

Query: 930  QGSTDAGADLQSESNFTDVGMLRRNENLLGITINVEKVSLTIVHEISETEEQFPLLQGCI 1109
                        +S+ ++ G +++ E   GI I V KV++T+ HE+S +EE+FPLLQ  +
Sbjct: 260  L----------HDSHTSEKGEIKKKERPFGILIAVNKVAVTVCHELSSSEERFPLLQVSL 309

Query: 1110 MPNQTIVQISDVKVRIMDRFEVILYYFDGQQNSWKEFIQPLKICIFYSQKSLIQGAENPL 1289
            MPNQ IVQ+SD KVR+M+ FEV LYYF+GQQN WK FIQPL++CIFYSQ   I GAE+  
Sbjct: 310  MPNQIIVQVSDSKVRVMNSFEVNLYYFNGQQNLWKNFIQPLELCIFYSQNIFIHGAESSS 369

Query: 1290 HRVHSRFYANFKEVTVLLSELSLDIVLFVVGKLNLAGPYSVKSSLVLANCCEVDNQSGL 1466
            H     FY    EV+V L++LSLDI+LFV+GKL+LAGPY+VKSS VL NCC+V+N+SGL
Sbjct: 370  HGFSKHFYIRTGEVSVFLTQLSLDILLFVIGKLDLAGPYAVKSSAVLGNCCKVENKSGL 428


>ref|XP_006338249.1| PREDICTED: uncharacterized protein LOC102601421 isoform X2 [Solanum
            tuberosum]
          Length = 2549

 Score =  417 bits (1072), Expect = e-114
 Identities = 214/467 (45%), Positives = 307/467 (65%), Gaps = 2/467 (0%)
 Frame = +3

Query: 72   SSHVDITSRQKYILKDLQYFLAIERPVTGNCSNPPCSSDIWVGSGSISRFDVTISLHEVN 251
            SS + + + Q Y+LKDL   L +E+P+  + S P  S+D W+GSGSI   D+T++L E+ 
Sbjct: 1368 SSQISLATPQNYVLKDLNAILVVEQPLKSSGSTPLQSNDFWIGSGSIDGCDMTLTLREIQ 1427

Query: 252  MILSAAESFSKALGSGGTTNVDSR-RWSYSQESGENMEEMIPDGTIVAIQDVDEHMYITV 428
            +IL A E+ S       T +++ +     S ES  +++EM+PDGTIV+I+DVD+HMY+ V
Sbjct: 1428 IILFAGEALSAVFSVEATKSIEQQTHQKNSGESTRSLDEMVPDGTIVSIKDVDQHMYVAV 1487

Query: 429  RDVESGYDISGEVHHSLVGKRALFRVNYRKPGRWRPHTQYFSLVSLYAKDNSGEPLQLSC 608
               ESGY++ GE+H+SLVG+RALFRV Y +  RW    QY S +SLYAKD SGEPL+L+C
Sbjct: 1488 DRAESGYNLVGEIHYSLVGERALFRVKYHQTRRWNSQVQYLSFISLYAKDESGEPLRLNC 1547

Query: 609  QPRSKFVDVCSDG-SGYAVWEMLPFKPDAYEDAIGVESSTCISKRTFHLVNKKNDCAVAF 785
              +S FVD+ S   S +A+W  LP+K D Y+  + +++    +K  F+LVNKKNDCA AF
Sbjct: 1548 HRQSDFVDISSSSDSAWALWRALPYKHDIYDADVDLKTYLPQTKNVFYLVNKKNDCAAAF 1607

Query: 786  IDGTLEFVSKPGNLFKWKVFDDDLGPVCNNLSPNRFTGISNATPVSNEQGSTDAGADLQS 965
            ++G LE V KPG+ FK+KVF D   P  NN+      G     P          G  L  
Sbjct: 1608 VNGVLEVVRKPGHPFKFKVFRDP-SPYVNNVF---LDGCLEKEP----------GTILLH 1653

Query: 966  ESNFTDVGMLRRNENLLGITINVEKVSLTIVHEISETEEQFPLLQGCIMPNQTIVQISDV 1145
            +S   +   L +  +  GIT+ V+KVSLTIV+E+S+++E+ PLLQG I   + ++QIS+ 
Sbjct: 1654 DSYIIEGKDLSQRGSSFGITVAVDKVSLTIVYELSDSKEKVPLLQGSISFTEVVIQISNT 1713

Query: 1146 KVRIMDRFEVILYYFDGQQNSWKEFIQPLKICIFYSQKSLIQGAENPLHRVHSRFYANFK 1325
            KVR M +  V++YYFD Q++ W++ + PL+I +FY    L QG EN +  V   FYA  K
Sbjct: 1714 KVRAMSKLGVLMYYFDSQKDMWRDLMHPLEIDVFYRYTFLNQGPENIILWVPGHFYARIK 1773

Query: 1326 EVTVLLSELSLDIVLFVVGKLNLAGPYSVKSSLVLANCCEVDNQSGL 1466
            E+++ ++ELSLDI+LF++GKLN AGPY+VK S +LANCC+V+NQSGL
Sbjct: 1774 ELSMTITELSLDIILFIIGKLNFAGPYAVKDSTILANCCKVENQSGL 1820


>ref|XP_006338248.1| PREDICTED: uncharacterized protein LOC102601421 isoform X1 [Solanum
            tuberosum]
          Length = 3185

 Score =  417 bits (1072), Expect = e-114
 Identities = 214/467 (45%), Positives = 307/467 (65%), Gaps = 2/467 (0%)
 Frame = +3

Query: 72   SSHVDITSRQKYILKDLQYFLAIERPVTGNCSNPPCSSDIWVGSGSISRFDVTISLHEVN 251
            SS + + + Q Y+LKDL   L +E+P+  + S P  S+D W+GSGSI   D+T++L E+ 
Sbjct: 1368 SSQISLATPQNYVLKDLNAILVVEQPLKSSGSTPLQSNDFWIGSGSIDGCDMTLTLREIQ 1427

Query: 252  MILSAAESFSKALGSGGTTNVDSR-RWSYSQESGENMEEMIPDGTIVAIQDVDEHMYITV 428
            +IL A E+ S       T +++ +     S ES  +++EM+PDGTIV+I+DVD+HMY+ V
Sbjct: 1428 IILFAGEALSAVFSVEATKSIEQQTHQKNSGESTRSLDEMVPDGTIVSIKDVDQHMYVAV 1487

Query: 429  RDVESGYDISGEVHHSLVGKRALFRVNYRKPGRWRPHTQYFSLVSLYAKDNSGEPLQLSC 608
               ESGY++ GE+H+SLVG+RALFRV Y +  RW    QY S +SLYAKD SGEPL+L+C
Sbjct: 1488 DRAESGYNLVGEIHYSLVGERALFRVKYHQTRRWNSQVQYLSFISLYAKDESGEPLRLNC 1547

Query: 609  QPRSKFVDVCSDG-SGYAVWEMLPFKPDAYEDAIGVESSTCISKRTFHLVNKKNDCAVAF 785
              +S FVD+ S   S +A+W  LP+K D Y+  + +++    +K  F+LVNKKNDCA AF
Sbjct: 1548 HRQSDFVDISSSSDSAWALWRALPYKHDIYDADVDLKTYLPQTKNVFYLVNKKNDCAAAF 1607

Query: 786  IDGTLEFVSKPGNLFKWKVFDDDLGPVCNNLSPNRFTGISNATPVSNEQGSTDAGADLQS 965
            ++G LE V KPG+ FK+KVF D   P  NN+      G     P          G  L  
Sbjct: 1608 VNGVLEVVRKPGHPFKFKVFRDP-SPYVNNVF---LDGCLEKEP----------GTILLH 1653

Query: 966  ESNFTDVGMLRRNENLLGITINVEKVSLTIVHEISETEEQFPLLQGCIMPNQTIVQISDV 1145
            +S   +   L +  +  GIT+ V+KVSLTIV+E+S+++E+ PLLQG I   + ++QIS+ 
Sbjct: 1654 DSYIIEGKDLSQRGSSFGITVAVDKVSLTIVYELSDSKEKVPLLQGSISFTEVVIQISNT 1713

Query: 1146 KVRIMDRFEVILYYFDGQQNSWKEFIQPLKICIFYSQKSLIQGAENPLHRVHSRFYANFK 1325
            KVR M +  V++YYFD Q++ W++ + PL+I +FY    L QG EN +  V   FYA  K
Sbjct: 1714 KVRAMSKLGVLMYYFDSQKDMWRDLMHPLEIDVFYRYTFLNQGPENIILWVPGHFYARIK 1773

Query: 1326 EVTVLLSELSLDIVLFVVGKLNLAGPYSVKSSLVLANCCEVDNQSGL 1466
            E+++ ++ELSLDI+LF++GKLN AGPY+VK S +LANCC+V+NQSGL
Sbjct: 1774 ELSMTITELSLDIILFIIGKLNFAGPYAVKDSTILANCCKVENQSGL 1820


>ref|XP_004233645.1| PREDICTED: uncharacterized protein LOC101257436 [Solanum
            lycopersicum]
          Length = 3178

 Score =  400 bits (1029), Expect = e-109
 Identities = 209/467 (44%), Positives = 302/467 (64%), Gaps = 2/467 (0%)
 Frame = +3

Query: 72   SSHVDITSRQKYILKDLQYFLAIERPVTGNCSNPPCSSDIWVGSGSISRFDVTISLHEVN 251
            SS + + + Q Y+LKDL   L +E+P+  + S P  S+D W+G+ SI   D+T+SL E+ 
Sbjct: 1362 SSQISLATPQNYVLKDLNASLVVEQPLNSSGSTPLQSNDFWIGNCSIDGCDMTLSLREIQ 1421

Query: 252  MILSAAESFSKALGSGGTTNVDSR-RWSYSQESGENMEEMIPDGTIVAIQDVDEHMYITV 428
            +IL A E+ S      GT +++ +     S ES  + +EM+PDGTIV+I+D+D+HMY+ V
Sbjct: 1422 IILFAGEALSAVFSVEGTKSIEQQTHQKNSGESTRSQDEMVPDGTIVSIKDIDQHMYVAV 1481

Query: 429  RDVESGYDISGEVHHSLVGKRALFRVNYRKPGRWRPHTQYFSLVSLYAKDNSGEPLQLSC 608
              VESGY++ G +H+SL G+RALFRV Y +  RW    QY S +SLYAKD  GEPL+L+C
Sbjct: 1482 DRVESGYNLVGAIHYSLFGERALFRVKYHQTRRWNSQVQYLSFISLYAKDELGEPLRLNC 1541

Query: 609  QPRSKFVDVCSDG-SGYAVWEMLPFKPDAYEDAIGVESSTCISKRTFHLVNKKNDCAVAF 785
              +S FVD+ S   S +A+W  LP+K D Y+  + +++    +K  F+LVNKKNDCA AF
Sbjct: 1542 HRQSDFVDISSSSDSAWALWRALPYKHDIYDADVDLKTYLPQTKNVFYLVNKKNDCAAAF 1601

Query: 786  IDGTLEFVSKPGNLFKWKVFDDDLGPVCNNLSPNRFTGISNATPVSNEQGSTDAGADLQS 965
            ++G LE V KPG+ FK+KVF D   P  N++      G     P          G  L  
Sbjct: 1602 VNGFLEVVRKPGHPFKFKVFRDP-SPYVNSVF---LDGCLEREP----------GTILLH 1647

Query: 966  ESNFTDVGMLRRNENLLGITINVEKVSLTIVHEISETEEQFPLLQGCIMPNQTIVQISDV 1145
            ++  ++   L +  +  GIT+ V KVSLTI +E+S+++E+ PLLQG I    + +Q+S+ 
Sbjct: 1648 DTCISEGKDLSQRGSSFGITVAVVKVSLTIDYELSDSKEKVPLLQGSISFTDSYIQVSNT 1707

Query: 1146 KVRIMDRFEVILYYFDGQQNSWKEFIQPLKICIFYSQKSLIQGAENPLHRVHSRFYANFK 1325
            KVR M R  V+L YFD Q++ W++ + PL+I +FY    L QG EN +  V   FYA  K
Sbjct: 1708 KVRAMSRLAVLLSYFDSQKDMWRDLMHPLEIDVFYRYTFLNQGPENSILWVPGHFYARIK 1767

Query: 1326 EVTVLLSELSLDIVLFVVGKLNLAGPYSVKSSLVLANCCEVDNQSGL 1466
            E+++ ++ELSLDI+LF++GKLNLAGPY+VK S +LANCC+V+NQSGL
Sbjct: 1768 ELSMTITELSLDIILFIIGKLNLAGPYAVKDSTILANCCKVENQSGL 1814


>ref|XP_006466676.1| PREDICTED: uncharacterized protein LOC102617616 [Citrus sinensis]
          Length = 3197

 Score =  379 bits (974), Expect = e-102
 Identities = 208/459 (45%), Positives = 276/459 (60%), Gaps = 1/459 (0%)
 Frame = +3

Query: 93   SRQKYILKDLQYFLAIERPVTGNCSNPPCSSDIWVGSGSISRFDVTISLHEVNMILSAAE 272
            S Q YIL  L  FL+ E+             + WVG GSIS FDVTISL E+ MI+S   
Sbjct: 1405 SHQNYILNHLSVFLSAEK-----------LENYWVGIGSISGFDVTISLPELQMIMSTVS 1453

Query: 273  SFSKALGSGGTTNVDSRRWSYSQESGENMEEMIPDGTIVAIQDVDEHMYITVRDVESGYD 452
            SF        +     R  S  QES    + M+P+G IVAIQDVD+H Y  V D E+ Y 
Sbjct: 1454 SFYGISSKEMSRKTTERHQSIKQESSNGFKAMVPNGAIVAIQDVDQHTYFAVEDGENKYT 1513

Query: 453  ISGEVHHSLVGKRALFRVNYRKPGRWRPHTQYFSLVSLYAKDNSGEPLQLSCQPRSKFVD 632
            ++G +H+SLVG+RALFRV Y K  RW     +FSL+SLYAK++ GEPL+L+C   S FVD
Sbjct: 1514 LAGAIHYSLVGERALFRVKYHKQKRWMSSVLWFSLISLYAKNDLGEPLRLNCHSGSCFVD 1573

Query: 633  VCS-DGSGYAVWEMLPFKPDAYEDAIGVESSTCISKRTFHLVNKKNDCAVAFIDGTLEFV 809
            + S D S   +W MLP   ++Y   +  E+   + K TF+LVNKKNDCAVAFIDG  EFV
Sbjct: 1574 ISSSDDSSCTLWRMLPCDSESYRGDVDWEAQNQLVKDTFYLVNKKNDCAVAFIDGVPEFV 1633

Query: 810  SKPGNLFKWKVFDDDLGPVCNNLSPNRFTGISNATPVSNEQGSTDAGADLQSESNFTDVG 989
             KPGN FK+K F        NNL+  R   +S+    S +   T+       + + T   
Sbjct: 1634 KKPGNSFKFKEF--------NNLAVTRDLVVSDG--YSFDASGTNVSRTEHDDEDKTS-- 1681

Query: 990  MLRRNENLLGITINVEKVSLTIVHEISETEEQFPLLQGCIMPNQTIVQISDVKVRIMDRF 1169
               ++  L  I I ++KV+LT+VHE+ +T+++ PL   C+   Q  VQ    K R+M   
Sbjct: 1682 --EKSGGLPCIHIKIDKVALTVVHELLDTKDRLPLFCACVSDTQIAVQSLSTKARVMSTS 1739

Query: 1170 EVILYYFDGQQNSWKEFIQPLKICIFYSQKSLIQGAENPLHRVHSRFYANFKEVTVLLSE 1349
              +L YFD Q+N W+E +QP++ICI+Y     IQG+E   HRV  R Y   KE  + L+E
Sbjct: 1740 RALLSYFDAQRNLWRELVQPVEICIYYRSSFQIQGSEALWHRVPLRIYCRIKEFQIFLTE 1799

Query: 1350 LSLDIVLFVVGKLNLAGPYSVKSSLVLANCCEVDNQSGL 1466
            LSLDI+LFVVGKL+LAGPY ++SS +LANCC+V+NQSGL
Sbjct: 1800 LSLDILLFVVGKLDLAGPYLIRSSRILANCCKVENQSGL 1838


>ref|XP_006425795.1| hypothetical protein CICLE_v10024678mg [Citrus clementina]
            gi|557527785|gb|ESR39035.1| hypothetical protein
            CICLE_v10024678mg [Citrus clementina]
          Length = 3169

 Score =  379 bits (974), Expect = e-102
 Identities = 208/459 (45%), Positives = 276/459 (60%), Gaps = 1/459 (0%)
 Frame = +3

Query: 93   SRQKYILKDLQYFLAIERPVTGNCSNPPCSSDIWVGSGSISRFDVTISLHEVNMILSAAE 272
            S Q YIL  L  FL+ E+             + WVG GSIS FDVTISL E+ MI+S   
Sbjct: 1405 SHQNYILNHLSVFLSAEK-----------LENYWVGIGSISGFDVTISLPELQMIMSTVS 1453

Query: 273  SFSKALGSGGTTNVDSRRWSYSQESGENMEEMIPDGTIVAIQDVDEHMYITVRDVESGYD 452
            SF        +     R  S  QES    + M+P+G IVAIQDVD+H Y  V D E+ Y 
Sbjct: 1454 SFYGISSKEMSRKTTERHQSIKQESSNGFKAMVPNGAIVAIQDVDQHTYFAVEDGENKYT 1513

Query: 453  ISGEVHHSLVGKRALFRVNYRKPGRWRPHTQYFSLVSLYAKDNSGEPLQLSCQPRSKFVD 632
            ++G +H+SLVG+RALFRV Y K  RW     +FSL+SLYAK++ GEPL+L+C   S FVD
Sbjct: 1514 LAGAIHYSLVGERALFRVKYHKQKRWMSSVLWFSLISLYAKNDLGEPLRLNCHSGSCFVD 1573

Query: 633  VCS-DGSGYAVWEMLPFKPDAYEDAIGVESSTCISKRTFHLVNKKNDCAVAFIDGTLEFV 809
            + S D S   +W MLP   ++Y   +  E+   + K TF+LVNKKNDCAVAFIDG  EFV
Sbjct: 1574 ISSSDDSSCTLWRMLPCDSESYRGDVDWEAQNQLVKDTFYLVNKKNDCAVAFIDGVPEFV 1633

Query: 810  SKPGNLFKWKVFDDDLGPVCNNLSPNRFTGISNATPVSNEQGSTDAGADLQSESNFTDVG 989
             KPGN FK+K F        NNL+  R   +S+    S +   T+       + + T   
Sbjct: 1634 KKPGNSFKFKEF--------NNLAVTRDLVVSDG--YSFDASGTNVSRTEHDDEDKTS-- 1681

Query: 990  MLRRNENLLGITINVEKVSLTIVHEISETEEQFPLLQGCIMPNQTIVQISDVKVRIMDRF 1169
               ++  L  I I ++KV+LT+VHE+ +T+++ PL   C+   Q  VQ    K R+M   
Sbjct: 1682 --EKSGGLPCIHIKIDKVALTVVHELLDTKDRLPLFCACVSDTQIAVQSLSTKARVMSTS 1739

Query: 1170 EVILYYFDGQQNSWKEFIQPLKICIFYSQKSLIQGAENPLHRVHSRFYANFKEVTVLLSE 1349
              +L YFD Q+N W+E +QP++ICI+Y     IQG+E   HRV  R Y   KE  + L+E
Sbjct: 1740 RALLSYFDAQRNLWRELVQPVEICIYYRSSFQIQGSEALWHRVPLRIYCRIKEFQIFLTE 1799

Query: 1350 LSLDIVLFVVGKLNLAGPYSVKSSLVLANCCEVDNQSGL 1466
            LSLDI+LFVVGKL+LAGPY ++SS +LANCC+V+NQSGL
Sbjct: 1800 LSLDILLFVVGKLDLAGPYLIRSSRILANCCKVENQSGL 1838


>ref|XP_007203912.1| hypothetical protein PRUPE_ppa016794mg, partial [Prunus persica]
            gi|462399443|gb|EMJ05111.1| hypothetical protein
            PRUPE_ppa016794mg, partial [Prunus persica]
          Length = 1855

 Score =  371 bits (953), Expect = e-100
 Identities = 206/491 (41%), Positives = 296/491 (60%), Gaps = 8/491 (1%)
 Frame = +3

Query: 15   DTIHPNRDASASST-------SVLIRSSHVDITSRQKYILKDLQYFLAIERPVTGNCSNP 173
            D IHP  DAS S         SV            Q YILK     +++E+P+  +    
Sbjct: 1359 DRIHPVNDASCSRDPGPQEEFSVHNSLPEAFRPIHQNYILKQAGAVISVEKPLNDSL--- 1415

Query: 174  PCSSDIWVGSGSISRFDVTISLHEVNMILSAAESFSKALGSGGTTNVDSRRWSYSQESGE 353
             C +++WVGSGSIS FD+TISL E+ M+LS   SFS        +  D R  S ++E   
Sbjct: 1416 -CLNEVWVGSGSISCFDITISLSEIQMLLSMISSFSGVFKEEMISEPDRRHQSSNEEFKN 1474

Query: 354  NMEEMIPDGTIVAIQDVDEHMYITVRDVESGYDISGEVHHSLVGKRALFRVNYRKPGRWR 533
            + E MIP+G IVAIQDV +HMY TV   E+ +++ G VH+SLVG+RALFRV Y   GRW+
Sbjct: 1475 SSETMIPNGAIVAIQDVHQHMYFTVEGEENKFNLVGVVHYSLVGERALFRVKYHNQGRWK 1534

Query: 534  PHTQYFSLVSLYAKDNSGEPLQLSCQPRSKFVDVCS-DGSGYAVWEMLPFKPDAYEDAIG 710
                +FSL+SLYAK++ GEPL+L+ +P S FVD+ S + +G+A+W+ +  +P+  E  I 
Sbjct: 1535 SSVSWFSLISLYAKNDLGEPLRLNYRPGSGFVDLSSANDNGWALWKAISCEPENSEGDID 1594

Query: 711  VESSTCISKRTFHLVNKKNDCAVAFIDGTLEFVSKPGNLFKWKVFDDDLGPVCNNLSPNR 890
             E +  + +RTF+L+NKK+D AVAF+DG  EFV KPGN FK KVF +    V  ++  + 
Sbjct: 1595 WEPNIQLVQRTFYLLNKKSDSAVAFVDGIPEFVRKPGNPFKLKVFHN--ASVARDIKMDS 1652

Query: 891  FTGISNATPVSNEQGSTDAGADLQSESNFTDVGMLRRNENLLGITINVEKVSLTIVHEIS 1070
            + G ++ T + ++    D    +             R+  L  I +  +K+SLTI HE+ 
Sbjct: 1653 YPGEASGTSLQHDALRDDGNTSV-------------RSGKLPCIDVTFDKISLTIFHELV 1699

Query: 1071 ETEEQFPLLQGCIMPNQTIVQISDVKVRIMDRFEVILYYFDGQQNSWKEFIQPLKICIFY 1250
            +TE+ FPLL GCI   +  VQI   K R++     +L+YFD Q+N W+E + P+++C+FY
Sbjct: 1700 DTEDMFPLLCGCIDQTKLTVQILPSKTRVISMSTAVLHYFDAQKNLWRELLHPVEVCLFY 1759

Query: 1251 SQKSLIQGAENPLHRVHSRFYANFKEVTVLLSELSLDIVLFVVGKLNLAGPYSVKSSLVL 1430
                 +QG++     V    +   KE+ + LSELSLDI+LFV+GKLNLAGPYSV+S+ + 
Sbjct: 1760 RSSFQLQGSQAVSPGVPVHIHCRTKELNISLSELSLDILLFVIGKLNLAGPYSVRSNKIW 1819

Query: 1431 ANCCEVDNQSG 1463
            ANCC+V N SG
Sbjct: 1820 ANCCKVVNHSG 1830


>ref|XP_007047104.1| Vacuolar protein sorting-associated protein 13C, putative [Theobroma
            cacao] gi|508699365|gb|EOX91261.1| Vacuolar protein
            sorting-associated protein 13C, putative [Theobroma
            cacao]
          Length = 3155

 Score =  364 bits (934), Expect = 6e-98
 Identities = 208/457 (45%), Positives = 280/457 (61%), Gaps = 1/457 (0%)
 Frame = +3

Query: 99   QKYILKDLQYFLAIERPVTGNCSNPPCSSDIWVGSGSISRFDVTISLHEVNMILSAAESF 278
            Q YIL  L   L +E+       +P     +WVGSGS+S FD+TISL E+ MILS   SF
Sbjct: 1373 QDYILNHLTASLLVEKAEV----SPLDPKQVWVGSGSVSGFDMTISLSELQMILSMVSSF 1428

Query: 279  SKALGSGGTTNVDSRRWSYSQESGENMEEMIPDGTIVAIQDVDEHMYITVRDVESGYDIS 458
            S   G G +     R W Y+Q+   N E  IPDG IVAIQDV +H+Y  V   E+ Y I 
Sbjct: 1429 SGLSGKGSSGEFVQRNWPYNQQDDNNFEARIPDGAIVAIQDVHQHLYFMVEGGENQYSIG 1488

Query: 459  GEVHHSLVGKRALFRVNYRKPGRWRPHTQYFSLVSLYAKDNSGEPLQLSCQPRSKFVDVC 638
            G VH+SLVG+RALFRV Y+K  +W      FSLVSL+AK+NSGEPL+L+  P S FV++ 
Sbjct: 1489 GAVHYSLVGERALFRVKYQKQ-KWNSSALLFSLVSLHAKNNSGEPLRLNSYPGSGFVELS 1547

Query: 639  S-DGSGYAVWEMLPFKPDAYEDAIGVESSTCISKRTFHLVNKKNDCAVAFIDGTLEFVSK 815
            S   + +A+W +L  K + Y+  I  E      + TF+LVNKKN CAVAF D    FV K
Sbjct: 1548 STTNNSWALWSILSCKRETYDGDIDWEPYNQGLRNTFYLVNKKNGCAVAFSDTVPVFVRK 1607

Query: 816  PGNLFKWKVFDDDLGPVCNNLSPNRFTGISNATPVSNEQGSTDAGADLQSESNFTDVGML 995
            PGN FK+KVF D        +S  +     +  P+++     +  A    ES        
Sbjct: 1608 PGNPFKFKVFSD--------MSVAQDVVTYSTCPLNSSGTEVNQSAHEDGES-------Y 1652

Query: 996  RRNENLLGITINVEKVSLTIVHEISETEEQFPLLQGCIMPNQTIVQISDVKVRIMDRFEV 1175
            R + NL  I I ++KV+ T+VHE+S+T ++FPLL GCI   Q  +QI   K R++   + 
Sbjct: 1653 RESRNLPCIDITIDKVAFTVVHELSDTNDRFPLLHGCINGTQLTLQILSTKARVICTSKA 1712

Query: 1176 ILYYFDGQQNSWKEFIQPLKICIFYSQKSLIQGAENPLHRVHSRFYANFKEVTVLLSELS 1355
            +L YFD Q NSW++F++P++ICIFY  +S  Q   NP H V    Y   KE+ + L+ELS
Sbjct: 1713 LLQYFDAQTNSWRDFLRPVEICIFY--RSCFQ---NP-HGVPVHVYCRTKELEISLTELS 1766

Query: 1356 LDIVLFVVGKLNLAGPYSVKSSLVLANCCEVDNQSGL 1466
            LDI+LFV+GKLNLAGP+SV+SS++LANC +V+NQ+GL
Sbjct: 1767 LDILLFVIGKLNLAGPFSVRSSMILANCGKVENQTGL 1803


>ref|XP_006598717.1| PREDICTED: uncharacterized protein LOC100527166 isoform X1 [Glycine
            max]
          Length = 3165

 Score =  345 bits (885), Expect = 3e-92
 Identities = 195/478 (40%), Positives = 276/478 (57%), Gaps = 1/478 (0%)
 Frame = +3

Query: 36   DASASSTSVLIRSSHVDITSRQKYILKDLQYFLAIERPVTGNCSNPPCSSDIWVGSGSIS 215
            +AS+S   V ++ SH      Q  ILK+L+ F+++ERP  G      C    W G GS+S
Sbjct: 1361 EASSSKNIVPVQLSH------QNQILKNLRAFMSLERPDNGTMHLSRC----WFGIGSLS 1410

Query: 216  RFDVTISLHEVNMILSAAESFSKALGSGGTTNVDSRRWSYSQESGENMEEMIPDGTIVAI 395
             FD+T+S+ E+  IL    + S         N++   WS S E   ++E MIPDG IVAI
Sbjct: 1411 GFDMTLSVSEIQTILLLYSTLSGISSQNTIKNLERNHWSTSHEVDNSLEAMIPDGAIVAI 1470

Query: 396  QDVDEHMYITVRDVESGYDISGEVHHSLVGKRALFRVNYRKPGRWRPHTQYFSLVSLYAK 575
            QDV++HMY TV   E  + + G +H+SLVG+RALF V +    RW+    +FS +SL+AK
Sbjct: 1471 QDVNQHMYFTVEGEEKNFSLGGVMHYSLVGERALFMVKHCPQRRWKSTVLWFSFISLFAK 1530

Query: 576  DNSGEPLQLSCQPRSKFVDV-CSDGSGYAVWEMLPFKPDAYEDAIGVESSTCISKRTFHL 752
            ++ G PL+L+ QP S FVD+ C++  G A+W + P + + Y      E+S    KRTF+L
Sbjct: 1531 NDMGVPLRLNFQPGSCFVDISCTNDGGCALWRVYPPQGENYVGITDSEASNQSMKRTFYL 1590

Query: 753  VNKKNDCAVAFIDGTLEFVSKPGNLFKWKVFDDDLGPVCNNLSPNRFTGISNATPVSNEQ 932
            VNKKND A+AF+DG LEFV KPG+  K+KVF+D               G+S         
Sbjct: 1591 VNKKNDSAIAFVDGALEFVRKPGSPIKFKVFNDITAAY----------GVSETASYPRMA 1640

Query: 933  GSTDAGADLQSESNFTDVGMLRRNENLLGITINVEKVSLTIVHEISETEEQFPLLQGCIM 1112
              T    D +S S         +      I I +EK+SL IVHE+S+TE  FPL+   I 
Sbjct: 1641 PQTTLRTDEESTS--------WQGGKHPCIDIRIEKISLNIVHELSDTEYLFPLICLFIN 1692

Query: 1113 PNQTIVQISDVKVRIMDRFEVILYYFDGQQNSWKEFIQPLKICIFYSQKSLIQGAENPLH 1292
              Q I+Q    K R++     + +YFD ++N W E + P++ICIFY      Q +E   H
Sbjct: 1693 NTQLIIQTLATKSRVISTSSAVAHYFDAERNLWGELLHPVEICIFYRSNIQAQLSEYRSH 1752

Query: 1293 RVHSRFYANFKEVTVLLSELSLDIVLFVVGKLNLAGPYSVKSSLVLANCCEVDNQSGL 1466
             V   F+   KE+ V L+E SLD++LFV+G LNL+GPYS++SS++ ANCC+V+NQSGL
Sbjct: 1753 AVPVNFFCRMKEMDVYLNENSLDVLLFVIGILNLSGPYSLRSSIIQANCCKVENQSGL 1810


>ref|XP_007155985.1| hypothetical protein PHAVU_003G249100g [Phaseolus vulgaris]
            gi|561029339|gb|ESW27979.1| hypothetical protein
            PHAVU_003G249100g [Phaseolus vulgaris]
          Length = 3168

 Score =  328 bits (841), Expect = 4e-87
 Identities = 190/480 (39%), Positives = 278/480 (57%), Gaps = 3/480 (0%)
 Frame = +3

Query: 36   DASASSTSVLIRSSHVDITSRQKYILKDLQYFLAIERPVTGNCSNPPCSSDIWVGSGSIS 215
            DAS+S  ++      V I S +  ILK+L+ FL++ERP  G+     C    W G GS+ 
Sbjct: 1363 DASSSKNTL-----PVQIISHENQILKNLRAFLSLERPDNGDMHLSQC----WFGIGSLL 1413

Query: 216  RFDVTISLHEVNMILSAAESFSKALGSGGTTNVDSRRWSYSQESGENMEEMIPDGTIVAI 395
             FD+T+S+ E+  I+S + S S+         ++   WS   +    +E +IPDG IVAI
Sbjct: 1414 GFDITLSISEIQTIMSMSSSLSEIASQNAIKKLERNHWSSIHDVDNCLEAVIPDGAIVAI 1473

Query: 396  QDVDEHMYITVRDVESGYDISGEVHHSLVGKRALFRVNYRKPGRWRPHTQYFSLVSLYAK 575
            QDV++HM+ TV   E  + + G +H+SLVG+RALFRV +    RW     +FS +SL+AK
Sbjct: 1474 QDVNQHMFFTVEGEEKTFRVGGIIHYSLVGERALFRVKHCLQRRWNSTVLWFSFISLFAK 1533

Query: 576  DNSGEPLQLSCQPRSKFVDVC--SDGSGYAVWEMLPFKPDAYEDAIGVESSTCISKRTFH 749
            ++ G PL+L+ +P S FVD+C  +DG G A+W   P + +     I  E +    KRTF+
Sbjct: 1534 NDMGVPLRLNFRPGSCFVDICCPNDG-GCALWSANPAQGENDVGLIDSEVNNQSFKRTFY 1592

Query: 750  LVNKKNDCAVAFIDGTLEFVSKPGNLFKWKVFDDDLGPVCNNLSPNRFTGISNATPV-SN 926
            LVNKKND A+AF+DG LEFV KPG+  K+K F+D              T    A+ + S 
Sbjct: 1593 LVNKKNDSAIAFVDGALEFVKKPGSPIKFKFFND-------------ITAAYGASEIASY 1639

Query: 927  EQGSTDAGADLQSESNFTDVGMLRRNENLLGITINVEKVSLTIVHEISETEEQFPLLQGC 1106
             + +T+       E      G          I I +EKVSL IVHE+S+TE  FPL+   
Sbjct: 1640 PRMATETTIYTDEEITSWQGG------KHPCIDIKIEKVSLNIVHELSDTEYLFPLISLL 1693

Query: 1107 IMPNQTIVQISDVKVRIMDRFEVILYYFDGQQNSWKEFIQPLKICIFYSQKSLIQGAENP 1286
            +   Q  +QIS  K R++     + +YFD ++NSW E + P++IC+FY      Q +E  
Sbjct: 1694 LNSTQLNIQISAKKYRVISTSSAVAHYFDVERNSWGELLHPVEICLFYRSNIEAQLSEYR 1753

Query: 1287 LHRVHSRFYANFKEVTVLLSELSLDIVLFVVGKLNLAGPYSVKSSLVLANCCEVDNQSGL 1466
               V   ++   KE+ V L+E SLD++LFV+GKLNL+GPYS+++S++ ANCC+V+NQSGL
Sbjct: 1754 SDAVPVNYFCRMKELDVFLNENSLDMLLFVIGKLNLSGPYSMRNSIIEANCCKVENQSGL 1813


>ref|XP_004301869.1| PREDICTED: uncharacterized protein LOC101304881 [Fragaria vesca
            subsp. vesca]
          Length = 3178

 Score =  327 bits (837), Expect = 1e-86
 Identities = 207/534 (38%), Positives = 291/534 (54%), Gaps = 47/534 (8%)
 Frame = +3

Query: 6    QYKDTIHPNRDASASST--SVLIRSSHVDITS---RQKYILKDLQY-------------- 128
            Q+ D IHP  +AS+S    S   RS+H  +      QKYILK  +               
Sbjct: 1366 QHMDEIHPVNNASSSRGPGSQEERSAHSSLHEAFRHQKYILKGQEQASSECESRQEGETV 1425

Query: 129  FLAIERPVTGNCSNPPCSSDIWVGSGSISRFDVTISLHEVNMILSAAESFSKALGSGGTT 308
            F+++E+P           +++W+GSG+IS FD+TISL ++ M+LS   SFS   G    +
Sbjct: 1426 FISVEKPPL---------NEVWIGSGTISCFDITISLCQIKMLLSMISSFSGVFGEEVIS 1476

Query: 309  NVDSRRWSYSQESGENMEEMIPDGTIVAIQDVDEHMYITVRDVESGYDISGEVHHSLVGK 488
              D R WS ++E   ++E ++P+G IVAIQDV +HMY TV   E+ Y ++G  H+SLVG+
Sbjct: 1477 EPDRRHWSSNEEFKNSLETVVPNGAIVAIQDVHQHMYFTVEGKENKYSLAGAAHYSLVGE 1536

Query: 489  RALFRVNYRKPGRWRPHTQYFSLVSLYAKDNSGEPLQLSCQPRSKFVDVCS-DGSGYAVW 665
             ALF V Y     W+  + +FSL+SL+AK+ SGEPL+L+    S FVDV S + +  A+W
Sbjct: 1537 SALFMVKYNNQRGWKSSSLWFSLISLHAKNASGEPLRLNYSRGSDFVDVSSANDNAAALW 1596

Query: 666  EMLPFKPDAYEDAIGVESSTCISKRTFHLVNKKNDCAVAFIDGTLEFVSKPGNLFKWKVF 845
              +  +P++YE  I  E    + KRTF+LVNKKND AVA +DG  EFV KPGN  K KVF
Sbjct: 1597 TTISCEPESYEGDIDWEPYNQLVKRTFYLVNKKNDSAVAIVDGIPEFVRKPGNPIKLKVF 1656

Query: 846  DD-DLGPVCNNLSPNRFTGISNATPVSNEQGSTDAGADLQSESNFTDVGMLRRNENLLGI 1022
             +  + P     S  R   I                A LQ  +  +D G+   +  L  I
Sbjct: 1657 HNASIAPDIKVDSYPRLESI----------------ASLQ-HNPLSDEGITSGSGKLPCI 1699

Query: 1023 TINVEKVSLTIVHEISETEEQFPLLQGCI--------------------------MPNQT 1124
             +  + +SLTI+HE+ +T++  PLL+ CI                           P  T
Sbjct: 1700 YVTFDTISLTIIHELVDTKD-VPLLRCCIGGTGQSKHELEDSKDMALLGGCSDRTKPKFT 1758

Query: 1125 IVQISDVKVRIMDRFEVILYYFDGQQNSWKEFIQPLKICIFYSQKSLIQGAENPLHRVHS 1304
            I QI   K R++     + YYFD Q+N W+E I P++ C FY      +G     H V  
Sbjct: 1759 I-QILPSKARVISSLTAVAYYFDAQRNKWRELIHPVETCFFYRSTHSSEGVS---HGVPV 1814

Query: 1305 RFYANFKEVTVLLSELSLDIVLFVVGKLNLAGPYSVKSSLVLANCCEVDNQSGL 1466
              +   KE+ + LSELSLDI+LF VGKLNLAGP+SV+S+ + ANCC+V+NQSGL
Sbjct: 1815 HIHCRTKELNISLSELSLDILLFTVGKLNLAGPFSVRSTKIWANCCKVENQSGL 1868


>ref|XP_006405272.1| hypothetical protein EUTSA_v10027614mg [Eutrema salsugineum]
            gi|557106410|gb|ESQ46725.1| hypothetical protein
            EUTSA_v10027614mg [Eutrema salsugineum]
          Length = 3132

 Score =  319 bits (818), Expect = 2e-84
 Identities = 188/458 (41%), Positives = 271/458 (59%), Gaps = 2/458 (0%)
 Frame = +3

Query: 99   QKYILKDLQYFL-AIERPVTGNCSNPPCSSDIWVGSGSISRFDVTISLHEVNMILSAAES 275
            + YIL++L+    A++R  TG+     CS   W G  S+  FD+TISL E+ M+LS   S
Sbjct: 1346 KNYILEELRVSASAMKRENTGH----QCSQ-AWEGGCSVLGFDITISLSELQMVLSMLSS 1400

Query: 276  FSKALGSGGTTNVDSRRWSYSQESGENMEEMIPDGTIVAIQDVDEHMYITVRDVESGYDI 455
            FS AL  GG+ +    R S+++E   + E ++PDG IVAIQD+ +HM+ TV D  +   +
Sbjct: 1401 FS-ALPGGGSADASLERPSFNREPERSFESVVPDGAIVAIQDIHQHMFFTVEDRGNKCVV 1459

Query: 456  SGEVHHSLVGKRALFRVNYRKPGRWRPHTQYFSLVSLYAKDNSGEPLQLSCQPRSKFVDV 635
            +G +H+SLVG+RALFRV Y +   W   T +FSL SLYAK+N GEPL+L+    S FV+V
Sbjct: 1460 TGTLHYSLVGERALFRVTYHRYQGWSSSTLWFSLTSLYAKNNKGEPLRLNYHSSSDFVNV 1519

Query: 636  CS-DGSGYAVWEMLPFKPDAYEDAIGVESSTCISKRTFHLVNKKNDCAVAFIDGTLEFVS 812
            C    +   ++     + + Y+  I  E+   + K TF+LVNKK+D AVAFID   EFV 
Sbjct: 1520 CGLHDNATTLFRASVGESENYKGDIDWETYRKLVKDTFYLVNKKSDSAVAFIDSFPEFVR 1579

Query: 813  KPGNLFKWKVFDDDLGPVCNNLSPNRFTGISNATPVSNEQGSTDAGADLQSESNFTDVGM 992
            KPGN FK+KVF + L              I N+T V   +           ES    V  
Sbjct: 1580 KPGNPFKFKVFRESLA-------------IRNSTSVVPPE---------IHESETQSV-- 1615

Query: 993  LRRNENLLGITINVEKVSLTIVHEISETEEQFPLLQGCIMPNQTIVQISDVKVRIMDRFE 1172
               N +   IT+ ++ VSLTIVHE+SET ++FPL +G I   Q  +Q+   K R+M    
Sbjct: 1616 --MNSSPPSITVTIDGVSLTIVHELSETRDRFPLFRGSINITQLTLQMLSSKARVMSTSN 1673

Query: 1173 VILYYFDGQQNSWKEFIQPLKICIFYSQKSLIQGAENPLHRVHSRFYANFKEVTVLLSEL 1352
            +++ YFD Q N W+EFI P+++  FY      Q  +N +H+V S  Y    ++ V L+EL
Sbjct: 1674 ILVLYFDAQTNQWREFIHPVEVSAFYRSTFQTQDLKNTMHKVPSHIYCRIGKLEVYLTEL 1733

Query: 1353 SLDIVLFVVGKLNLAGPYSVKSSLVLANCCEVDNQSGL 1466
            SLD++LFV+ +L  AGP+SVK+S++L NCC+++N SGL
Sbjct: 1734 SLDMLLFVLEELEFAGPFSVKTSVILPNCCKIENLSGL 1771


>ref|NP_190607.2| uncharacterized protein [Arabidopsis thaliana]
            gi|332645140|gb|AEE78661.1| uncharacterized protein
            AT3G50380 [Arabidopsis thaliana]
          Length = 3072

 Score =  315 bits (808), Expect = 3e-83
 Identities = 180/456 (39%), Positives = 269/456 (58%), Gaps = 2/456 (0%)
 Frame = +3

Query: 105  YILKDLQYFLAIE-RPVTGNCSNPPCSSDIWVGSGSISRFDVTISLHEVNMILSAAESFS 281
            YIL+DL+   +++ R  TG+       S  W G+ S+  FD+TISL E+ M+LS    F+
Sbjct: 1341 YILEDLRVSASVKKRENTGHQF-----SQAWAGACSVLGFDITISLSELQMVLSMLSLFA 1395

Query: 282  KALGSGGTTNVDSRRWSYSQESGENMEEMIPDGTIVAIQDVDEHMYITVRDVESGYDISG 461
               G         R  S++ ES  + E ++PDG IVAIQD+++HM++TV D  +   ++G
Sbjct: 1396 AIPGGDSAHASLERPSSFNSESERSFESVVPDGAIVAIQDINQHMFVTVEDGGNKCVVTG 1455

Query: 462  EVHHSLVGKRALFRVNYRKPGRWRPHTQYFSLVSLYAKDNSGEPLQLSCQPRSKFVDVCS 641
             +H+SLVG+RALFRV+Y +   W   T +FSL SLYAK+N GEPL+L+    S  V+V  
Sbjct: 1456 TLHYSLVGERALFRVSYHRHQGWNSSTLWFSLTSLYAKNNKGEPLRLNYHSSSDIVNVSG 1515

Query: 642  -DGSGYAVWEMLPFKPDAYEDAIGVESSTCISKRTFHLVNKKNDCAVAFIDGTLEFVSKP 818
               +   ++     + + Y+  I  E+   + K TF+LVNKK+D AVAFIDG  EFV KP
Sbjct: 1516 LYDNAPTLFRASSGESENYKGDIDWETYRKLVKDTFYLVNKKSDSAVAFIDGFPEFVRKP 1575

Query: 819  GNLFKWKVFDDDLGPVCNNLSPNRFTGISNATPVSNEQGSTDAGADLQSESNFTDVGMLR 998
            GN FK+KVF + L                + TPV            + SE + ++   + 
Sbjct: 1576 GNPFKFKVFHESLAT-------------RSLTPV------------VPSEIHESETHSVM 1610

Query: 999  RNENLLGITINVEKVSLTIVHEISETEEQFPLLQGCIMPNQTIVQISDVKVRIMDRFEVI 1178
             + +   IT+ ++ VSLTIVHE+SET ++FPL +G +   Q  VQ+   KVRIM    ++
Sbjct: 1611 VDSSPPSITVTIDGVSLTIVHELSETRDRFPLFRGSVNITQLTVQMLSSKVRIMSTSNIL 1670

Query: 1179 LYYFDGQQNSWKEFIQPLKICIFYSQKSLIQGAENPLHRVHSRFYANFKEVTVLLSELSL 1358
            + YFD Q N W+EFI P+++  FY      +   N +H+V +  Y    ++ V L+ELSL
Sbjct: 1671 VLYFDAQTNQWREFIHPVEVSAFYRSTFQTRDLNNTMHKVPTHIYCRIGKLEVFLTELSL 1730

Query: 1359 DIVLFVVGKLNLAGPYSVKSSLVLANCCEVDNQSGL 1466
            D++LF++GKL  AGP+SVK+S +L+NCC+++N SGL
Sbjct: 1731 DMLLFLLGKLEFAGPFSVKTSAILSNCCKIENLSGL 1766


>ref|XP_002877744.1| hypothetical protein ARALYDRAFT_485391 [Arabidopsis lyrata subsp.
            lyrata] gi|297323582|gb|EFH54003.1| hypothetical protein
            ARALYDRAFT_485391 [Arabidopsis lyrata subsp. lyrata]
          Length = 3074

 Score =  313 bits (802), Expect = 1e-82
 Identities = 192/496 (38%), Positives = 274/496 (55%), Gaps = 9/496 (1%)
 Frame = +3

Query: 6    QYKDTIHPNRDASASSTSVLIRSSHV------DITSRQK-YILKDLQYFLAIE-RPVTGN 161
            Q  D I      SAS     +R          D  SR K YIL+DL+   +++ R  TG+
Sbjct: 1303 QQSDVISSGDSTSASGDFNSVREFSANSNLQEDFHSRYKNYILEDLRVSASVKKRENTGH 1362

Query: 162  CSNPPCSSDIWVGSGSISRFDVTISLHEVNMILSAAESFSKALGSGGTTNVDSRRWSYSQ 341
                   S  WVG  S+  FD+TISL E+ M+LS    F+   G   T     R  S+  
Sbjct: 1363 QF-----SQAWVGGCSVLGFDMTISLSELQMVLSMLSLFAALPGGESTHASLERPSSFKS 1417

Query: 342  ESGENMEEMIPDGTIVAIQDVDEHMYITVRDVESGYDISGEVHHSLVGKRALFRVNYRKP 521
            ES  + E ++PDG IVAIQD+++HM+ TV D      ++G +H+SLVG+RALFRV+Y + 
Sbjct: 1418 ESERSFESVVPDGAIVAIQDINQHMFFTVEDGGDKCVVTGTLHYSLVGERALFRVSYHRH 1477

Query: 522  GRWRPHTQYFSLVSLYAKDNSGEPLQLSCQPRSKFVDVCSDGSGYAVWEMLPF-KPDAYE 698
              W   T +FSL SLYAK+N GEPL+L+    S  V+V              F + + Y+
Sbjct: 1478 QGWNSSTLWFSLTSLYAKNNKGEPLRLNYHSSSDIVNVSGLYDNAPTLFRASFGESENYK 1537

Query: 699  DAIGVESSTCISKRTFHLVNKKNDCAVAFIDGTLEFVSKPGNLFKWKVFDDDLGPVCNNL 878
              I  E+   + K TF+LVNKK+D AVAFIDG  EFV KPGN FK+KVF + L       
Sbjct: 1538 GDIDWETYRKLVKDTFYLVNKKSDLAVAFIDGFPEFVRKPGNPFKFKVFRESLAT----- 1592

Query: 879  SPNRFTGISNATPVSNEQGSTDAGADLQSESNFTDVGMLRRNENLLGITINVEKVSLTIV 1058
                     N TPV            + SE + ++   +  + +   IT+ ++ VSLTI+
Sbjct: 1593 --------RNLTPV------------VPSEIHESETQSVMVDSSPPSITVTIDSVSLTII 1632

Query: 1059 HEISETEEQFPLLQGCIMPNQTIVQISDVKVRIMDRFEVILYYFDGQQNSWKEFIQPLKI 1238
            HE+SET ++FPL +G +   +  VQ+   KVRIM    +++ YFD Q N W+EFI P+++
Sbjct: 1633 HELSETRDRFPLFRGSVNITELAVQMLSSKVRIMSISNILVLYFDAQTNQWREFIHPVEV 1692

Query: 1239 CIFYSQKSLIQGAENPLHRVHSRFYANFKEVTVLLSELSLDIVLFVVGKLNLAGPYSVKS 1418
              FY          N + +V +  Y    ++ V L+ELS+D++LFV+GKL  AGP+SVK+
Sbjct: 1693 SAFYRSTFQTPDLNNTMQKVPTHIYCRIGKLDVFLTELSMDMLLFVLGKLEFAGPFSVKT 1752

Query: 1419 SLVLANCCEVDNQSGL 1466
            S +L+NCC++ N SGL
Sbjct: 1753 SAILSNCCKIKNLSGL 1768


>ref|XP_006293179.1| hypothetical protein CARUB_v10019496mg [Capsella rubella]
            gi|482561886|gb|EOA26077.1| hypothetical protein
            CARUB_v10019496mg [Capsella rubella]
          Length = 3074

 Score =  313 bits (801), Expect = 2e-82
 Identities = 194/483 (40%), Positives = 280/483 (57%), Gaps = 5/483 (1%)
 Frame = +3

Query: 33   RDASASSTSVLIRSSHVDITSR-QKYILKDLQYFLAI-ERPVTGNCSNPPCSSDIWVGSG 206
            RD SA+S      +S  +  SR +KY+L+DL+   ++ +R  TG+       S  WVGS 
Sbjct: 1324 RDFSANS------NSQEEFHSRYKKYLLEDLRVSASVTKRENTGHQF-----SQAWVGSC 1372

Query: 207  SISRFDVTISLHEVNMILSAAESFSKALGSGGT-TNVDSRRWSYSQESGENMEEMIPDGT 383
            S+  FD+TISL E+ MILS   SF+   G G T  +++ R    + ES  + E ++PDG 
Sbjct: 1373 SVLGFDITISLSELQMILSMLSSFAALPGGGSTLASLEERPSLSNSESERSFESIVPDGA 1432

Query: 384  IVAIQDVDEHMYITVRDVESGYDISGEVHHSLVGKRALFRVNYRKPGRWRPHTQYFSLVS 563
            IVAIQD ++HM+ TV +      ++G +H+SLVG+RALFR++Y +   W   T +FSL S
Sbjct: 1433 IVAIQDTNQHMFFTVEERGDKCVVTGTLHYSLVGERALFRISYHRHQGWNSSTLWFSLTS 1492

Query: 564  LYAKDNSGEPLQLSCQPRSKFVDVCSDGSGYAVWEMLPF-KPDAYEDAIGVESSTCISKR 740
            LYAK++ GEPL+L+    S  V+V              F + + Y+  I  E+   + K 
Sbjct: 1493 LYAKNSKGEPLRLNYHSSSDSVNVSGLYDNAPTLFRASFDESENYKGDIDWETYRKMVKD 1552

Query: 741  TFHLVNKKNDCAVAFIDGTLEFVSKPGNLFKWKVFDDDLGPVCNNLSPNRFTGISNATPV 920
            TF+LVNKK+  AVAFIDG  EFV KPGN FK+KVF + L                N TPV
Sbjct: 1553 TFYLVNKKSASAVAFIDGFPEFVRKPGNPFKFKVFRESLTT-------------RNVTPV 1599

Query: 921  -SNEQGSTDAGADLQSESNFTDVGMLRRNENLLGITINVEKVSLTIVHEISETEEQFPLL 1097
             S+E   ++A + + S                  I I ++ VSLTIVHE+SET ++FPL 
Sbjct: 1600 VSSEINESEAQSVMDSFPP--------------SIAITIDGVSLTIVHELSETRDKFPLF 1645

Query: 1098 QGCIMPNQTIVQISDVKVRIMDRFEVILYYFDGQQNSWKEFIQPLKICIFYSQKSLIQGA 1277
            +G I   Q  +Q+   K RIM    +++ YFD Q N W+EFI P+++  FY      Q  
Sbjct: 1646 RGSINITQLSIQMLSSKARIMSTSNILVLYFDAQTNQWREFIHPVEVSAFYRSTFQTQEL 1705

Query: 1278 ENPLHRVHSRFYANFKEVTVLLSELSLDIVLFVVGKLNLAGPYSVKSSLVLANCCEVDNQ 1457
            +N +H+V +  Y    ++ V ++ELSLD++LFV+GKL  AGP+SVK+S +L+NCC+V+N 
Sbjct: 1706 QNTMHKVPTHIYCRVGKLEVFVTELSLDMLLFVLGKLEFAGPFSVKTSSILSNCCKVENL 1765

Query: 1458 SGL 1466
            SGL
Sbjct: 1766 SGL 1768


>emb|CAB62317.1| putative protein [Arabidopsis thaliana]
          Length = 3071

 Score =  308 bits (790), Expect = 3e-81
 Identities = 179/456 (39%), Positives = 268/456 (58%), Gaps = 2/456 (0%)
 Frame = +3

Query: 105  YILKDLQYFLAIE-RPVTGNCSNPPCSSDIWVGSGSISRFDVTISLHEVNMILSAAESFS 281
            YIL+DL+   +++ R  TG+       S  W G+ S+  FD+TISL E+ M+LS    F+
Sbjct: 1341 YILEDLRVSASVKKRENTGHQF-----SQAWAGACSVLGFDITISLSELQMVLSMLSLFA 1395

Query: 282  KALGSGGTTNVDSRRWSYSQESGENMEEMIPDGTIVAIQDVDEHMYITVRDVESGYDISG 461
               G         R  S++ ES  + E ++PD  IVAIQD+++HM++TV D  +   ++G
Sbjct: 1396 AIPGGDSAHASLERPSSFNSESERSFESVVPDA-IVAIQDINQHMFVTVEDGGNKCVVTG 1454

Query: 462  EVHHSLVGKRALFRVNYRKPGRWRPHTQYFSLVSLYAKDNSGEPLQLSCQPRSKFVDVCS 641
             +H+SLVG+RALFRV+Y +   W   T +FSL SLYAK+N GEPL+L+    S  V+V  
Sbjct: 1455 TLHYSLVGERALFRVSYHRHQGWNSSTLWFSLTSLYAKNNKGEPLRLNYHSSSDIVNVSG 1514

Query: 642  -DGSGYAVWEMLPFKPDAYEDAIGVESSTCISKRTFHLVNKKNDCAVAFIDGTLEFVSKP 818
               +   ++     + + Y+  I  E+   + K TF+LVNKK+D AVAFIDG  EFV KP
Sbjct: 1515 LYDNAPTLFRASSGESENYKGDIDWETYRKLVKDTFYLVNKKSDSAVAFIDGFPEFVRKP 1574

Query: 819  GNLFKWKVFDDDLGPVCNNLSPNRFTGISNATPVSNEQGSTDAGADLQSESNFTDVGMLR 998
            GN FK+KVF + L                + TPV            + SE + ++   + 
Sbjct: 1575 GNPFKFKVFHESLAT-------------RSLTPV------------VPSEIHESETHSVM 1609

Query: 999  RNENLLGITINVEKVSLTIVHEISETEEQFPLLQGCIMPNQTIVQISDVKVRIMDRFEVI 1178
             + +   IT+ ++ VSLTIVHE+SET ++FPL +G +   Q  VQ+   KVRIM    ++
Sbjct: 1610 VDSSPPSITVTIDGVSLTIVHELSETRDRFPLFRGSVNITQLTVQMLSSKVRIMSTSNIL 1669

Query: 1179 LYYFDGQQNSWKEFIQPLKICIFYSQKSLIQGAENPLHRVHSRFYANFKEVTVLLSELSL 1358
            + YFD Q N W+EFI P+++  FY      +   N +H+V +  Y    ++ V L+ELSL
Sbjct: 1670 VLYFDAQTNQWREFIHPVEVSAFYRSTFQTRDLNNTMHKVPTHIYCRIGKLEVFLTELSL 1729

Query: 1359 DIVLFVVGKLNLAGPYSVKSSLVLANCCEVDNQSGL 1466
            D++LF++GKL  AGP+SVK+S +L+NCC+++N SGL
Sbjct: 1730 DMLLFLLGKLEFAGPFSVKTSAILSNCCKIENLSGL 1765


>gb|EMT16046.1| hypothetical protein F775_00816 [Aegilops tauschii]
          Length = 3081

 Score =  297 bits (761), Expect = 7e-78
 Identities = 178/489 (36%), Positives = 276/489 (56%), Gaps = 10/489 (2%)
 Frame = +3

Query: 30   NRDASASSTSVLIRSSH--VDITSRQKYILKDLQYFLAIERPVTGNCSNPPCSSDIWVGS 203
            +RDA ASSTS L  S+   ++ +S + YIL      L IE+      SN  C S  W G+
Sbjct: 1385 DRDAPASSTSTLESSTGNTLEFSSHKSYILSHFSTSLKIEKKQLDKDSNLMCLSGDWCGN 1444

Query: 204  GSISRFDVTISLHEVNMILSAAESFSKALGSGGTTNVDSRRWSYSQESGENMEEMIPDGT 383
            G +S  +VT+SL  + MI S    F   L S  T        +  QE  +N++  IPDG 
Sbjct: 1445 GFVSGLEVTMSLSSIEMISSLLAPFHGMLSSTATQKEIQIGDTTQQEQLDNIDCTIPDGA 1504

Query: 384  IVAIQDVDEHMYITVRDVESGYDISGEVHHSLVGKRALFRVNYRKPGRWRPHTQYFSLVS 563
            IVAI+D+D+ MY++V+++   Y + G  H+SL G+ ALF+V + K  RWR  T Y SL+S
Sbjct: 1505 IVAIRDLDQQMYVSVKNIGMKYQVVGAYHYSLAGEHALFKVKHHK--RWRSDTPYISLLS 1562

Query: 564  LYAKDNSGEPLQLSCQPRSKFVDVCSD-GSGYAVWEMLPFKPDAYEDAIGVESSTC--IS 734
            L AK + G+ L LS    S  V++ S      ++W M P   D++ED    + ++C  IS
Sbjct: 1563 LCAKTDEGKELALSFSQGSDLVEISSFVDKPCSLWSMFPLGFDSFEDDED-DGNSCKVIS 1621

Query: 735  KRTFHLVNKKNDCAVAFIDGTLEFVSKPGNLFKWKVFDDDLGPVCNNLSPNRFTGISNAT 914
              ++HLVNKKN+  +AF+DG LEFV KPGN FK K+ D+ L           F+ ++   
Sbjct: 1622 SSSYHLVNKKNNYGIAFVDGLLEFVKKPGNPFKLKILDESL-----------FSDVARLI 1670

Query: 915  PVSNEQGSTDAGADLQSESNFTDVGMLRRNENLLGITINVEKVSLTIVHEISETEEQFPL 1094
             V N     ++  D++ E     +  L    +   ITI+++K+  TI HE+ +T + FPL
Sbjct: 1671 -VPNMNLDGNSYLDVEDELPSAAMDRLETVASSQHITISIDKIVFTITHEVFDTGDVFPL 1729

Query: 1095 LQGCIMPNQTIVQISDVKVRIMDRFEVILYYFDGQQNSWKEFIQPLKICI-----FYSQK 1259
            +Q CI   + + QI   K+RI+  F+V   YFD ++N W++ I P+   +     F++Q 
Sbjct: 1730 VQNCINDIRVVTQIYPSKIRILSSFKVSGQYFDARKNMWEDLISPITSYVFLRFRFFNQD 1789

Query: 1260 SLIQGAENPLHRVHSRFYANFKEVTVLLSELSLDIVLFVVGKLNLAGPYSVKSSLVLANC 1439
             + + +  PL     RF+ + K+V + ++ELS+D +L++VGKL L GPY+V++S +  NC
Sbjct: 1790 PVTRRSGTPL-----RFFFHLKQVDIFINELSVDTLLYLVGKLGLMGPYAVRNSAIFPNC 1844

Query: 1440 CEVDNQSGL 1466
            C+++N S L
Sbjct: 1845 CKIENNSRL 1853


>gb|EMS50104.1| Retrovirus-related Pol polyprotein LINE-1 [Triticum urartu]
          Length = 3154

 Score =  294 bits (753), Expect = 6e-77
 Identities = 175/484 (36%), Positives = 272/484 (56%), Gaps = 5/484 (1%)
 Frame = +3

Query: 30   NRDASASSTSVLIRSSH--VDITSRQKYILKDLQYFLAIERPVTGNCSNPPCSSDIWVGS 203
            +RDA ASSTS L  S+   ++ +S + YIL      L IE+      SN  C S  + G+
Sbjct: 1211 DRDAPASSTSTLESSTGHTLEFSSHKSYILSHFSTSLKIEKKQLDRDSNLMCLSGDYCGN 1270

Query: 204  GSISRFDVTISLHEVNMILSAAESFSKALGSGGTTNVDSRRWSYSQESGENMEEMIPDGT 383
            G +S  +VTISL  + MI S    F   L S  T        +  QE  +N++  IPDG 
Sbjct: 1271 GFVSGLEVTISLSSIEMISSLLAPFHGMLSSTATQKEIQIGDTTQQEQLDNIDCTIPDGA 1330

Query: 384  IVAIQDVDEHMYITVRDVESGYDISGEVHHSLVGKRALFRVNYRKPGRWRPHTQYFSLVS 563
            IVAI+D+D+ MY++V+++   Y + G  H+SL G+ ALF+V + K  RW   T Y SL+S
Sbjct: 1331 IVAIRDLDQQMYVSVKNIGMKYQVVGAYHYSLAGEHALFKVKHHK--RWGSDTPYISLLS 1388

Query: 564  LYAKDNSGEPLQLSCQPRSKFVDVCSD-GSGYAVWEMLPFKPDAYEDAIGVESSTC--IS 734
            L AK + G+ L LS    S  V++ S      ++W + P   D++ED    + ++C  IS
Sbjct: 1389 LCAKTDEGKELALSFSQGSDLVEISSFVDKPCSLWSLFPLGFDSFEDDED-DGNSCKVIS 1447

Query: 735  KRTFHLVNKKNDCAVAFIDGTLEFVSKPGNLFKWKVFDDDLGPVCNNLSPNRFTGISNAT 914
              ++HLVNKKN+  +AF+DG LEFV KPGN FK K+ D+ L           F+ ++   
Sbjct: 1448 SSSYHLVNKKNNYGIAFVDGLLEFVKKPGNPFKLKILDESL-----------FSDVARLI 1496

Query: 915  PVSNEQGSTDAGADLQSESNFTDVGMLRRNENLLGITINVEKVSLTIVHEISETEEQFPL 1094
             V N     ++  D++ E     +  L    +   ITI+++K+  TI HE+ +T + FPL
Sbjct: 1497 -VPNMNLDGNSYLDVEDELPSAVMDRLETVASSQHITISIDKIVFTITHEVFDTGDVFPL 1555

Query: 1095 LQGCIMPNQTIVQISDVKVRIMDRFEVILYYFDGQQNSWKEFIQPLKICIFYSQKSLIQG 1274
            +Q CI   + + QI   K+RI+  F+V   YFD ++N W++ I P+   +F   +   Q 
Sbjct: 1556 VQNCINDIRVVTQIYPSKIRILSSFKVSGQYFDARKNMWEDLISPITSYVFLRFRFFNQD 1615

Query: 1275 AENPLHRVHSRFYANFKEVTVLLSELSLDIVLFVVGKLNLAGPYSVKSSLVLANCCEVDN 1454
            +     R   RF+ + K+V + ++ELS+D++L++VGKL L GPY+V++S +  NCC+++N
Sbjct: 1616 SVTRRSRTPLRFFFHLKQVDIFINELSVDMLLYLVGKLGLMGPYAVRNSAIFPNCCKIEN 1675

Query: 1455 QSGL 1466
             S L
Sbjct: 1676 NSRL 1679


>gb|EEE53527.1| hypothetical protein OsJ_36721 [Oryza sativa Japonica Group]
          Length = 4290

 Score =  292 bits (748), Expect = 2e-76
 Identities = 177/486 (36%), Positives = 271/486 (55%), Gaps = 7/486 (1%)
 Frame = +3

Query: 30   NRDASASSTSVLIRSSH---VDITSRQKYILKDLQYFLAIERPVTGNCSNPPCSSDIWVG 200
            + DA +SS S +  S+    ++++S + YIL+    +L +E+      SN   SS  W G
Sbjct: 1380 DHDAPSSSNSTVESSTGNPPLELSSHKSYILRHFATYLKLEKKELNGDSNLMRSSGDWFG 1439

Query: 201  SGSISRFDVTISLHEVNMILSAAESFSKALGSGGTTNVDSRRWSYSQESGENMEEMIPDG 380
            +GS+S  +VT+SL  + MILS    F + L SG T        +  QE  +N +  IPDG
Sbjct: 1440 NGSVSGLEVTMSLSSIEMILSLFAPFHEILRSGSTQKEIQTGDTPHQELLDNRDYTIPDG 1499

Query: 381  TIVAIQDVDEHMYITVRDVESGYDISGEVHHSLVGKRALFRVNYRKPGRWRPHTQYFSLV 560
             IVAI+D+D+ MY+++++    Y + G  H+SL  + ALF+V + K   WR  T   SL+
Sbjct: 1500 AIVAIRDLDQQMYVSIKNTGKKYQVVGTYHYSLSSECALFKVKHHKG--WRSDTPCISLL 1557

Query: 561  SLYAKDNSGEPLQLSCQPRSKFVDVCSD-GSGYAVWEMLPFKPDAYEDAIGVESSTC--I 731
            SLYAK + G+ L LS    S  V+V S      ++W   P + D +ED  G +   C  I
Sbjct: 1558 SLYAKTDEGKELALSFSHGSDLVEVSSSVDKPSSLWTTSPLRFDGFEDD-GDDGKYCKII 1616

Query: 732  SKRTFHLVNKKNDCAVAFIDGTLEFVSKPGNLFKWKVFDDDL-GPVCNNLSPNRFTGISN 908
            S+ + HLVNKK++  +AF DG LEFV KPGN FK KV D+ L   V     PN    + N
Sbjct: 1617 SRSSNHLVNKKSNYGIAFNDGLLEFVRKPGNPFKVKVLDESLFSDVARPFVPN--VNLDN 1674

Query: 909  ATPVSNEQGSTDAGADLQSESNFTDVGMLRRNENLLGITINVEKVSLTIVHEISETEEQF 1088
             T +           D+++E  F     L    +   + I+++K+  TI HE+ +T   F
Sbjct: 1675 NTYL-----------DVENELPFGMGDSLETGVSSQHVIISIDKIVFTITHEVLDTGNVF 1723

Query: 1089 PLLQGCIMPNQTIVQISDVKVRIMDRFEVILYYFDGQQNSWKEFIQPLKICIFYSQKSLI 1268
            PL+Q CI   + I QI   K+RI+  F+VI++YF+ ++  W+E + P+   +F+  +   
Sbjct: 1724 PLVQNCINDTRIITQIFPSKIRILSSFKVIIHYFNARKYLWEELVSPITAYMFFRYRFFN 1783

Query: 1269 QGAENPLHRVHSRFYANFKEVTVLLSELSLDIVLFVVGKLNLAGPYSVKSSLVLANCCEV 1448
                    R+  RF+ + K+V + ++ELS+DI+L+V GKLN+ GPY+VKSS V  NCC++
Sbjct: 1784 LVPVTRCRRMPLRFFVHLKQVDIFVNELSIDILLYVAGKLNVMGPYAVKSSAVFPNCCKI 1843

Query: 1449 DNQSGL 1466
            +N S L
Sbjct: 1844 ENNSRL 1849


Top