BLASTX nr result

ID: Mentha23_contig00003054 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Mentha23_contig00003054
         (1916 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|AAK29467.1| polyprotein-like [Solanum chilense]                    127   2e-26
sp|P10978.1|POLX_TOBAC RecName: Full=Retrovirus-related Pol poly...   122   8e-25
ref|XP_003625577.1| Ubiquitin-protein ligase [Medicago truncatul...   117   1e-23
ref|XP_003613324.1| Cc-nbs-lrr resistance protein [Medicago trun...   115   5e-23
ref|XP_007038204.1| Transducin/WD40 repeat-like superfamily prot...   114   2e-22
emb|CAN72567.1| hypothetical protein VITISV_044177 [Vitis vinifera]   110   2e-21
ref|XP_007028624.1| Uncharacterized protein TCM_024518 [Theobrom...   110   3e-21
emb|CAN84102.1| hypothetical protein VITISV_041248 [Vitis vinifera]   108   7e-21
gb|AAP53866.2| retrotransposon protein, putative, Ty1-copia subc...   108   9e-21
gb|ABA98861.1| retrotransposon protein, putative, Ty1-copia subc...   107   1e-20
emb|CAN61630.1| hypothetical protein VITISV_003191 [Vitis vinifera]   107   2e-20
gb|ABA96825.1| retrotransposon protein, putative, Ty1-copia subc...   105   6e-20
gb|ABD96963.1| hypothetical protein [Cleome spinosa]                  105   6e-20
emb|CAA31653.1| polyprotein [Arabidopsis thaliana]                    104   2e-19
emb|CAJ09951.2| putative gag-pol polyprotein [Citrus sinensis]        104   2e-19
emb|CAA37918.1| unnamed protein product [Arabidopsis thaliana]         99   5e-18
emb|CAN80304.1| hypothetical protein VITISV_017821 [Vitis vinifera]    98   2e-17
ref|XP_004165166.1| PREDICTED: uncharacterized protein LOC101225...    95   1e-16
gb|AAC62132.1| copia-like retroelement pol polyprotein [Arabidop...    95   1e-16
emb|CAB40039.1| putative retrotransposon [Arabidopsis thaliana] ...    95   1e-16

>gb|AAK29467.1| polyprotein-like [Solanum chilense]
          Length = 1328

 Score =  127 bits (318), Expect = 2e-26
 Identities = 83/265 (31%), Positives = 135/265 (50%), Gaps = 3/265 (1%)
 Frame = -1

Query: 1229 MSSMVYGLVPFDGSSD-FGMWKRKMKCILIDKRAYKAI--TLEYXXXXXXXXXXXXXDLA 1059
            MS + Y +  F+G    F MW+R+MK +LI +  +KA+    +              + A
Sbjct: 1    MSGVKYEVAKFNGDKPVFSMWQRRMKDLLIQQGLHKALGGKSKKPESMKLEDWEELDEKA 60

Query: 1058 ISVICLGLSDCVLMHVDNIDSAKDLWEKLETLYAETSLASQMFLFETFFSFKIDVTKSIT 879
             S I L L+D V+ ++ + +SA  +W KLE LY   +L ++++L +  ++  +D   +  
Sbjct: 61   ASAIRLHLTDDVVNNIVDEESACGIWTKLENLYMSKTLTNKLYLKKQLYTLHMDEGTNFL 120

Query: 878  ENLNVFNKLIKDIKQTGDKGIDLYAPYVLLNAIPESYGDVKSAIKYGRDKVTLDVVMNAL 699
             +LNV N LI  +   G K  +     VLLN++P SY  + + I +G+D + L  V +AL
Sbjct: 121  SHLNVLNGLITQLANLGVKIEEEDKRIVLLNSLPSSYDTLSTTILHGKDSIQLKDVTSAL 180

Query: 698  KHKEKDLNLLKNVHSGPDKVFHVNGKTNSYYQHNEHANGSGDKKSNDKGAFKPRYKPRRC 519
               EK    ++       +VF    +  SY + + +   SG   +  K   + + K R C
Sbjct: 181  LLNEK----MRKKPENHGQVFITESRGRSYQRSSSNYGRSG---ARGKSKVRSKSKARNC 233

Query: 518  YNCGEIGHYVRDCPNPKRNQKGEQA 444
            YNC + GH+ RDCPNPKR  KGE +
Sbjct: 234  YNCDQPGHFKRDCPNPKRG-KGESS 257


>sp|P10978.1|POLX_TOBAC RecName: Full=Retrovirus-related Pol polyprotein from transposon TNT
            1-94; Includes: RecName: Full=Protease; Includes:
            RecName: Full=Reverse transcriptase; Includes: RecName:
            Full=Endonuclease gi|20045|emb|CAA32025.1| unnamed
            protein product [Nicotiana tabacum]
          Length = 1328

 Score =  122 bits (305), Expect = 8e-25
 Identities = 74/262 (28%), Positives = 134/262 (51%), Gaps = 2/262 (0%)
 Frame = -1

Query: 1229 MSSMVYGLVPFDGSSDFGMWKRKMKCILIDKRAYKAITLEYXXXXXXXXXXXXXD--LAI 1056
            MS + Y +  F+G + F  W+R+M+ +LI +  +K + ++                  A 
Sbjct: 1    MSGVKYEVAKFNGDNGFSTWQRRMRDLLIQQGLHKVLDVDSKKPDTMKAEDWADLDERAA 60

Query: 1055 SVICLGLSDCVLMHVDNIDSAKDLWEKLETLYAETSLASQMFLFETFFSFKIDVTKSITE 876
            S I L LSD V+ ++ + D+A+ +W +LE+LY   +L ++++L +  ++  +    +   
Sbjct: 61   SAIRLHLSDDVVNNIIDEDTARGIWTRLESLYMSKTLTNKLYLKKQLYALHMSEGTNFLS 120

Query: 875  NLNVFNKLIKDIKQTGDKGIDLYAPYVLLNAIPESYGDVKSAIKYGRDKVTLDVVMNALK 696
            +LNVFN LI  +   G K  +     +LLN++P SY ++ + I +G+  + L  V +AL 
Sbjct: 121  HLNVFNGLITQLANLGVKIEEEDKAILLLNSLPSSYDNLATTILHGKTTIELKDVTSALL 180

Query: 695  HKEKDLNLLKNVHSGPDKVFHVNGKTNSYYQHNEHANGSGDKKSNDKGAFKPRYKPRRCY 516
              EK    ++       +     G+  SY + + +   SG   +  K   + + + R CY
Sbjct: 181  LNEK----MRKKPENQGQALITEGRGRSYQRSSNNYGRSG---ARGKSKNRSKSRVRNCY 233

Query: 515  NCGEIGHYVRDCPNPKRNQKGE 450
            NC + GH+ RDCPNP++  KGE
Sbjct: 234  NCNQPGHFKRDCPNPRKG-KGE 254


>ref|XP_003625577.1| Ubiquitin-protein ligase [Medicago truncatula]
            gi|355500592|gb|AES81795.1| Ubiquitin-protein ligase
            [Medicago truncatula]
          Length = 1405

 Score =  117 bits (294), Expect = 1e-23
 Identities = 86/265 (32%), Positives = 132/265 (49%), Gaps = 7/265 (2%)
 Frame = -1

Query: 1199 FDGSSDFGMWKRKMKCILIDKRAYKAITLEYXXXXXXXXXXXXXDL--AISVICLGLSDC 1026
            F G +DFG+WK KM+ +LI ++  KA+  E               +  A S + L L D 
Sbjct: 10   FTGDNDFGLWKVKMEAVLIQQKCEKALKGEGSLPVTMSQAEKTEMVDKARSAVVLCLGDK 69

Query: 1025 VLMHVDNIDSAKDLWEKLETLYAETSLASQMFLFETFFSFKIDVTKSITENLNVFNKLIK 846
            VL  V    +A  +W KLE+LY   SLA + FL +  +SF++  +K+I E L  FNK++ 
Sbjct: 70   VLREVAKEATAASIWAKLESLYMTKSLAHRQFLKQQLYSFRMVESKAIMEQLTEFNKILD 129

Query: 845  DIKQTGDKGIDLYAPYVLLNAIPESYGDVKSAIKYGRD-KVTLDVVMNALKHKE----KD 681
            D++    +  D     +LL A+P+S+   K  + YG++  VTL+ V  AL+ KE    KD
Sbjct: 130  DLENIEVQLEDEDKAILLLCALPKSFESFKDTMLYGKEGTVTLEEVQAALRTKELTKSKD 189

Query: 680  LNLLKNVHSGPDKVFHVNGKTNSYYQHNEHANGSGDKKSNDKGAFKPRYKPRRCYNCGEI 501
            L              H +G+  S  + N    G+  +KS +K  F+       C+NC ++
Sbjct: 190  LT-------------HEHGEGLSVTRGNGGGRGN-RRKSGNKSRFE-------CFNCHKM 228

Query: 500  GHYVRDCPNPKRNQKGEQANVVSAG 426
            GH+ +DCP       G  A +VS G
Sbjct: 229  GHFKKDCP----EINGNSAQIVSEG 249


>ref|XP_003613324.1| Cc-nbs-lrr resistance protein [Medicago truncatula]
            gi|355514659|gb|AES96282.1| Cc-nbs-lrr resistance protein
            [Medicago truncatula]
          Length = 1104

 Score =  115 bits (289), Expect = 5e-23
 Identities = 86/265 (32%), Positives = 130/265 (49%), Gaps = 7/265 (2%)
 Frame = -1

Query: 1199 FDGSSDFGMWKRKMKCILIDKRAYKAITLEYXXXXXXXXXXXXXDL--AISVICLGLSDC 1026
            F G +DFG+WK KM+ +LI ++  KA+  E               +  A S + L L D 
Sbjct: 10   FTGDNDFGLWKVKMEAVLIQQKCEKALKGEGSLPVTMSQAEKTEMVDKARSAVVLCLGDK 69

Query: 1025 VLMHVDNIDSAKDLWEKLETLYAETSLASQMFLFETFFSFKIDVTKSITENLNVFNKLIK 846
            VL  V    +A  +W KLE+LY   SLA + FL +  +SF++  +K+I E L  FNK++ 
Sbjct: 70   VLREVAKEATAASIWAKLESLYMTKSLAHRQFLKQQLYSFRMVESKAIMEQLTEFNKILD 129

Query: 845  DIKQTGDKGIDLYAPYVLLNAIPESYGDVKSAIKYGRD-KVTLDVVMNALKHKE----KD 681
            D++    +  D     +LL A+P+S+   K  + YG++  VTL+ V  AL+ KE    KD
Sbjct: 130  DLENIEVQLEDEDKAILLLCALPKSFESFKDTMLYGKEGTVTLEEVQAALRTKELTKSKD 189

Query: 680  LNLLKNVHSGPDKVFHVNGKTNSYYQHNEHANGSGDKKSNDKGAFKPRYKPRRCYNCGEI 501
            L              H  G   S  + N    G+  +KS +K  F+       C+NC ++
Sbjct: 190  LT-------------HEYGDGLSVTRGNGGGRGN-RRKSGNKSRFE-------CFNCHKM 228

Query: 500  GHYVRDCPNPKRNQKGEQANVVSAG 426
            GH+ +DCP       G  A +VS G
Sbjct: 229  GHFKKDCP----EINGNSAQIVSEG 249


>ref|XP_007038204.1| Transducin/WD40 repeat-like superfamily protein [Theobroma cacao]
            gi|508775449|gb|EOY22705.1| Transducin/WD40 repeat-like
            superfamily protein [Theobroma cacao]
          Length = 1029

 Score =  114 bits (284), Expect = 2e-22
 Identities = 80/272 (29%), Positives = 134/272 (49%), Gaps = 8/272 (2%)
 Frame = -1

Query: 1226 SSMVYGLVPFDGSSDFGMWKRKMKCILIDKRAYKAITLEYXXXXXXXXXXXXXDL--AIS 1053
            SS  Y +  F+G +DF +W+ KM+ +L+ +   KA+  +               +  A S
Sbjct: 126  SSTKYEIEKFNGRNDFSLWRVKMRALLVQQGLLKALKGKEHLPSNLSDGEKDDLMKKAHS 185

Query: 1052 VICLGLSDCVLMHVDNIDSAKDLWEKLETLYAETSLASQMFLFETFFSFKIDVTKSITEN 873
            VI L LSD VL  V + +SA  +W KLE++Y   SL +++++ +  ++ K+    S+  +
Sbjct: 186  VILLALSDEVLREVTDEESAAAVWFKLESIYMTKSLTNRLYMKQRLYTLKMSEGTSVNTH 245

Query: 872  LNVFNKLIKDIKQTGDKGIDLYAPYVLLNAIPESYGDVKSAIKYGRDKVTLDVVMNALKH 693
            ++ FN++I D+K    K  D     +LL  +P SY +    + YGRD +T + V  +L  
Sbjct: 246  IDEFNRVILDLKNIDVKIEDEDLALILLCYLPPSYENFVDTMLYGRDTLTFEDVRASLNF 305

Query: 692  KE--KDLNLLKNVHSGPDKVFHVNGKTNSYYQHNEHANGSGDKKSNDKGAFKPRYKPRRC 519
            KE  K +  ++N +     V +                G G +K  D+   K R K + C
Sbjct: 306  KELKKKVGGIRNENQAEGLVVN---------------RGRGKEKGLDRKG-KSRAKGKTC 349

Query: 518  YNCGEIGHYVRDC----PNPKRNQKGEQANVV 435
            +NCG+ GH+ +DC     + K N+    ANVV
Sbjct: 350  WNCGQKGHFRQDCTKFKDDEKFNKSENTANVV 381


>emb|CAN72567.1| hypothetical protein VITISV_044177 [Vitis vinifera]
          Length = 950

 Score =  110 bits (275), Expect = 2e-21
 Identities = 79/261 (30%), Positives = 130/261 (49%), Gaps = 2/261 (0%)
 Frame = -1

Query: 1229 MSSMVYGLVPFDGSSDFGMWKRKMKCILIDKRAYKAITLEYXXXXXXXXXXXXXDL--AI 1056
            MSS  + +  F+GS+DF +WK KMK +L+ ++  +AI  E               +  A 
Sbjct: 1    MSSQKFEVEKFNGSNDFTLWKLKMKALLVQQKCAQAIEGEETLPVGLTAVEKEEVVSRAH 60

Query: 1055 SVICLGLSDCVLMHVDNIDSAKDLWEKLETLYAETSLASQMFLFETFFSFKIDVTKSITE 876
            S I L L+D VL  V +  +A  LW K E+ Y + SL ++++      + K+     + +
Sbjct: 61   SAILLSLADEVLREVADETTAVGLWRKFESKYQKKSLTNRLYQKRQLHTLKMSEGMQVRD 120

Query: 875  NLNVFNKLIKDIKQTGDKGIDLYAPYVLLNAIPESYGDVKSAIKYGRDKVTLDVVMNALK 696
            +LN FN++I D+   G K  +     +LL ++P SY +    + YGR+ ++ + V +AL+
Sbjct: 121  HLNNFNRIILDLNGVGVKVEEEDQAMILLCSLPSSYENFVDTMMYGRBSISXNDVKDALQ 180

Query: 695  HKEKDLNLLKNVHSGPDKVFHVNGKTNSYYQHNEHANGSGDKKSNDKGAFKPRYKPRRCY 516
             KE     L+ + SG ++     G T S  +  E  NG G  KS  K          RC+
Sbjct: 181  SKE-----LQKLVSGSEEGSVETGLTVSRGRSMER-NGGGRSKSXSKSK-----AAMRCF 229

Query: 515  NCGEIGHYVRDCPNPKRNQKG 453
            +  E GH+ ++CP   + QKG
Sbjct: 230  HXKEKGHFRKNCP---QRQKG 247


>ref|XP_007028624.1| Uncharacterized protein TCM_024518 [Theobroma cacao]
            gi|508717229|gb|EOY09126.1| Uncharacterized protein
            TCM_024518 [Theobroma cacao]
          Length = 277

 Score =  110 bits (274), Expect = 3e-21
 Identities = 80/272 (29%), Positives = 131/272 (48%), Gaps = 8/272 (2%)
 Frame = -1

Query: 1226 SSMVYGLVPFDGSSDFGMWKRKMKCILIDKRAYKAITLEYXXXXXXXXXXXXXDL--AIS 1053
            SS  Y +  F+G +DF +W+ KM  +L+ +   KA+  +               +  A S
Sbjct: 4    SSTKYEIEKFNGRNDFSLWRVKMCALLVQQGLLKALKGKEHLPSNLSDSEKDDLMEKAHS 63

Query: 1052 VICLGLSDCVLMHVDNIDSAKDLWEKLETLYAETSLASQMFLFETFFSFKIDVTKSITEN 873
             I L LSD VL  V + +SA  +W KLE++Y   SL +++++ +  ++ K+    S+  +
Sbjct: 64   AILLTLSDEVLREVTDEESAAAMWFKLESIYITKSLTNRLYMKQRLYTLKMSEGTSVNTH 123

Query: 872  LNVFNKLIKDIKQTGDKGIDLYAPYVLLNAIPESYGDVKSAIKYGRDKVTLDVVMNALKH 693
            ++ FN++I D+K    K  D     +LL  +P SY +    + YGRD +T + V   L  
Sbjct: 124  IDEFNRVILDLKNIDVKIEDEDLALILLCYLPPSYENFVDTMLYGRDTLTFEDVRAYLNS 183

Query: 692  KE--KDLNLLKNVHSGPDKVFHVNGKTNSYYQHNEHANGSGDKKSNDKGAFKPRYKPRRC 519
            KE  K +  ++N +     V +                G G +K  DK   K R K + C
Sbjct: 184  KELKKKVGGIRNENQAEGLVVN---------------RGRGKEKGLDKKG-KSRAKGKTC 227

Query: 518  YNCGEIGHYVRDC----PNPKRNQKGEQANVV 435
            +NCG+ GH+ +DC     + K N+    ANVV
Sbjct: 228  WNCGQKGHFRQDCTKFKDDEKFNKSENTANVV 259


>emb|CAN84102.1| hypothetical protein VITISV_041248 [Vitis vinifera]
          Length = 894

 Score =  108 bits (271), Expect = 7e-21
 Identities = 77/265 (29%), Positives = 128/265 (48%), Gaps = 2/265 (0%)
 Frame = -1

Query: 1229 MSSMVYGLVPFDGSSDFGMWKRKMKCILIDKRAYKAITLEYXXXXXXXXXXXXXDL--AI 1056
            M ++ + +  F G +DFG+W+ KM+ +L+ +    A+  E               L  A 
Sbjct: 1    MGTVKFDVEKFTGKNDFGLWRLKMRALLVQQGLQDALLGEKNLPXTMQEKHKIELLEKAH 60

Query: 1055 SVICLGLSDCVLMHVDNIDSAKDLWEKLETLYAETSLASQMFLFETFFSFKIDVTKSITE 876
              I L L D  L  V    SA  L  KLE+LY   SLA+++      ++FK+  + SI E
Sbjct: 61   GAIILSLGDTXLREVAKAKSAAKLLLKLESLYMTKSLANRLHKXIKLYTFKMTPSMSIEE 120

Query: 875  NLNVFNKLIKDIKQTGDKGIDLYAPYVLLNAIPESYGDVKSAIKYGRDKVTLDVVMNALK 696
            +L+ FNK+I D+K       +     +LL ++  SY ++K AI YGRD +T D V + L 
Sbjct: 121  HLDHFNKIILDLKNIDIAVSNEDKAILLLTSLDASYTNMKEAIMYGRDILTFDEVQSILH 180

Query: 695  HKEKDLNLLKNVHSGPDKVFHVNGKTNSYYQHNEHANGSGDKKSNDKGAFKPRYKPRRCY 516
             +E  L+  +       +  ++ GK+    +          K +N K   K + K  +C+
Sbjct: 181  ARE--LHKQEESKEELGEGLNIRGKSKKREK---------KKGNNSKSRSKSKTKKFKCF 229

Query: 515  NCGEIGHYVRDCPNPKRNQKGEQAN 441
             C + GH+ +DCP+ ++N   +  N
Sbjct: 230  ICHKEGHFKKDCPDMRQNTXKKTMN 254


>gb|AAP53866.2| retrotransposon protein, putative, Ty1-copia subclass [Oryza sativa
            Japonica Group]
          Length = 415

 Score =  108 bits (270), Expect = 9e-21
 Identities = 69/264 (26%), Positives = 132/264 (50%)
 Frame = -1

Query: 1214 YGLVPFDGSSDFGMWKRKMKCILIDKRAYKAITLEYXXXXXXXXXXXXXDLAISVICLGL 1035
            + +V FDG+ +F +W+ ++K +L  +   KA+                   A + I L L
Sbjct: 9    FEVVKFDGTGNFVLWQMRLKDLLAQQGISKALQETMPEKIDADKWNEMKAQAAATIRLSL 68

Query: 1034 SDCVLMHVDNIDSAKDLWEKLETLYAETSLASQMFLFETFFSFKIDVTKSITENLNVFNK 855
            SD V+  V +  S K++W+KL +L+   SL S+++L +  +  ++     + ++++VFN+
Sbjct: 69   SDSVMYQVMDEKSPKEIWDKLASLHMSKSLTSKLYLKQQLYGLQVQEESDLRKHVDVFNQ 128

Query: 854  LIKDIKQTGDKGIDLYAPYVLLNAIPESYGDVKSAIKYGRDKVTLDVVMNALKHKEKDLN 675
            L+ D+ +   K  D     +LL ++P SY  V + + +G+D V  + ++++L        
Sbjct: 129  LVVDLSKLDVKLDDEDKAIILLCSLPLSYEHVVTTLTHGKDTVKTEEIISSL-------- 180

Query: 674  LLKNVHSGPDKVFHVNGKTNSYYQHNEHANGSGDKKSNDKGAFKPRYKPRRCYNCGEIGH 495
            L +++            +  S    ++H + +G  KS +KGA        RCY C E GH
Sbjct: 181  LARDLRRSKKNEATKASQGKSLLVKDKHDHEAGVSKSKEKGA--------RCYKCHEFGH 232

Query: 494  YVRDCPNPKRNQKGEQANVVSAGE 423
              R+CP  K+ +KG  A++ + G+
Sbjct: 233  IRRNCPLLKK-RKGGIASLAARGD 255


>gb|ABA98861.1| retrotransposon protein, putative, Ty1-copia subclass [Oryza sativa
            Japonica Group]
          Length = 425

 Score =  107 bits (268), Expect = 1e-20
 Identities = 73/272 (26%), Positives = 135/272 (49%), Gaps = 3/272 (1%)
 Frame = -1

Query: 1229 MSSMV---YGLVPFDGSSDFGMWKRKMKCILIDKRAYKAITLEYXXXXXXXXXXXXXDLA 1059
            M++MV   + +V FDG+ +F +W+ ++K +L  +   KA+                   A
Sbjct: 1    MAAMVVSKFEVVKFDGTGNFILWQMRLKDLLAQQGISKALEETMPEKMDAGKWEEMKAQA 60

Query: 1058 ISVICLGLSDCVLMHVDNIDSAKDLWEKLETLYAETSLASQMFLFETFFSFKIDVTKSIT 879
             + I L LSD V+  V +  + K++W+KL +LY   SL S+++L +  +  ++     + 
Sbjct: 61   AATIRLSLSDSVMYPVMDEKTPKEIWDKLASLYMSKSLTSKLYLKQQLYGLQMQEESDLR 120

Query: 878  ENLNVFNKLIKDIKQTGDKGIDLYAPYVLLNAIPESYGDVKSAIKYGRDKVTLDVVMNAL 699
            ++++VFN+L+ D+ +   K  D     +LL ++P SY  V + + +G+D V  +  +++L
Sbjct: 121  KHVDVFNQLVVDLSKLDVKLDDEDMAIILLCSLPPSYEHVVTTLMHGKDTVKTEEKISSL 180

Query: 698  KHKEKDLNLLKNVHSGPDKVFHVNGKTNSYYQHNEHANGSGDKKSNDKGAFKPRYKPRRC 519
                    L +++            +  S     +H + +G  KS DKGA        RC
Sbjct: 181  --------LARDLRRSNKNEAMEASQAESLLVKAKHDHEAGVSKSKDKGA--------RC 224

Query: 518  YNCGEIGHYVRDCPNPKRNQKGEQANVVSAGE 423
            Y C E GH  R+CP  K+ +KG  A++ + G+
Sbjct: 225  YKCHEFGHIRRNCPLLKK-RKGGIASLAARGD 255


>emb|CAN61630.1| hypothetical protein VITISV_003191 [Vitis vinifera]
          Length = 1208

 Score =  107 bits (267), Expect = 2e-20
 Identities = 75/265 (28%), Positives = 124/265 (46%), Gaps = 2/265 (0%)
 Frame = -1

Query: 1229 MSSMVYGLVPFDGSSDFGMWKRKMKCILIDKRAYKAITLEYXXXXXXXXXXXXXDL--AI 1056
            M +  + +  F G +DFG+ + KM+ +L+ +    A+  E               L  A 
Sbjct: 1    MGTAKFDVEKFTGKNDFGLXRLKMRALLVQQGLQDALLGEKNLPSTMQEKQKIELLEKAH 60

Query: 1055 SVICLGLSDCVLMHVDNIDSAKDLWEKLETLYAETSLASQMFLFETFFSFKIDVTKSITE 876
            S I L L D VL       SA ++W KLE+LY   SLA+++      ++FK+    SI  
Sbjct: 61   SAIILSLGDTVLREXAKAKSAAEVWLKLESLYMTKSLANRLHKKIKLYTFKMTPGMSIEX 120

Query: 875  NLNVFNKLIKDIKQTGDKGIDLYAPYVLLNAIPESYGDVKSAIKYGRDKVTLDVVMNALK 696
            +L+ FNK+I D++       D     +LL ++  SY ++K AI YGRD +T D V + L 
Sbjct: 121  HLDHFNKIILDLENIDITISDEDKAILLLTSLDASYTNMKDAIMYGRDSLTFDEVQSILH 180

Query: 695  HKEKDLNLLKNVHSGPDKVFHVNGKTNSYYQHNEHANGSGDKKSNDKGAFKPRYKPRRCY 516
             +E          SG  +  ++ G++    +          K  N K   K + K  +C+
Sbjct: 181  ARELQKQEESKEESG--EGLNIRGRSEKREK----------KGKNSKSRSKSKTKKFKCF 228

Query: 515  NCGEIGHYVRDCPNPKRNQKGEQAN 441
             C + GH+ +DCP+ ++N   +  N
Sbjct: 229  ICHKEGHFKKDCPDRRQNTVKKTVN 253


>gb|ABA96825.1| retrotransposon protein, putative, Ty1-copia subclass [Oryza sativa
            Japonica Group]
          Length = 424

 Score =  105 bits (263), Expect = 6e-20
 Identities = 67/254 (26%), Positives = 124/254 (48%)
 Frame = -1

Query: 1214 YGLVPFDGSSDFGMWKRKMKCILIDKRAYKAITLEYXXXXXXXXXXXXXDLAISVICLGL 1035
            + +V FDG+ +F +W+ ++K +L  +   KA+                   A + I L L
Sbjct: 9    FEVVKFDGTGNFVLWQMRLKDLLAQQGISKALEETMPEKMDAGKWEEMKAQAAATIRLSL 68

Query: 1034 SDCVLMHVDNIDSAKDLWEKLETLYAETSLASQMFLFETFFSFKIDVTKSITENLNVFNK 855
            SD V+  V +  ++K++W KL +LY   SL S+++L +  +  ++     + ++++VFN+
Sbjct: 69   SDSVMYQVMDEKTSKEIWVKLTSLYMSKSLTSKLYLKQQLYGLQMQEESDLRKHVDVFNQ 128

Query: 854  LIKDIKQTGDKGIDLYAPYVLLNAIPESYGDVKSAIKYGRDKVTLDVVMNALKHKEKDLN 675
            L+ D+ +   K  D     +LL ++P SY  V + + +G+D +  +++ + L    +DL 
Sbjct: 129  LVVDLSKLDVKLDDEDKAIILLCSLPPSYEHVVTILTHGKDTIKTEIISSLL---ARDLR 185

Query: 674  LLKNVHSGPDKVFHVNGKTNSYYQHNEHANGSGDKKSNDKGAFKPRYKPRRCYNCGEIGH 495
              K   +          +  S     +H + +G  KS +KGA        RCY C E GH
Sbjct: 186  RSKKNEA------MEASQAESLLVKAKHDHEAGVSKSKEKGA--------RCYKCHEFGH 231

Query: 494  YVRDCPNPKRNQKG 453
              R+CP  K+ + G
Sbjct: 232  IRRNCPLLKKRKDG 245


>gb|ABD96963.1| hypothetical protein [Cleome spinosa]
          Length = 408

 Score =  105 bits (263), Expect = 6e-20
 Identities = 63/211 (29%), Positives = 111/211 (52%)
 Frame = -1

Query: 1061 AISVICLGLSDCVLMHVDNIDSAKDLWEKLETLYAETSLASQMFLFETFFSFKIDVTKSI 882
            A ++I L L+D VL  V +  +A  +W KLE L+ E SL ++M+L +    F++D +++I
Sbjct: 98   ARNLIVLALADQVLRKVISERTAFGIWRKLERLHIEQSLPNRMYLMQRVSGFRMDSSRTI 157

Query: 881  TENLNVFNKLIKDIKQTGDKGIDLYAPYVLLNAIPESYGDVKSAIKYGRDKVTLDVVMNA 702
             ENL++F KL+ D+     K  + Y    LLN++P +Y  ++  +KY R  ++++ V  A
Sbjct: 158  EENLDIFQKLLSDLHSLNVKVEEEYQAVYLLNSLPPAYEQLREVLKYSRATISVEEVKAA 217

Query: 701  LKHKEKDLNLLKNVHSGPDKVFHVNGKTNSYYQHNEHANGSGDKKSNDKGAFKPRYKPRR 522
             + KE +L     +  G  +   V GK           +G G KK+ D+           
Sbjct: 218  ARMKELELLAQGTLTRGTGEGLVVKGKPEK--------SGGGKKKAKDQ---------VE 260

Query: 521  CYNCGEIGHYVRDCPNPKRNQKGEQANVVSA 429
            C+ CG+ GHY ++C + +  ++ E   VV++
Sbjct: 261  CWYCGKKGHYKKECRSRRAKEETEGKGVVAS 291


>emb|CAA31653.1| polyprotein [Arabidopsis thaliana]
          Length = 1291

 Score =  104 bits (259), Expect = 2e-19
 Identities = 65/218 (29%), Positives = 106/218 (48%), Gaps = 3/218 (1%)
 Frame = -1

Query: 1061 AISVICLGLSDCVLMHVDNIDSAKDLWEKLETLYAETSLASQMFLFETFFSFKIDVTKSI 882
            A+++I   + D VL  +D+  SA ++WE L   Y ETSL +++++   F+SFK++ TKSI
Sbjct: 90   AMNIIIAHVGDAVLRKIDHCKSAAEMWETLNKQYMETSLPNRIYVQLKFYSFKMNDTKSI 149

Query: 881  TENLNVFNKLIKDIKQTGDKGIDLYAPYVLLNAIPESYGDVKSAIKYGRDKVTLDVVMNA 702
             EN+N F K++ ++       ++     + LN +   Y  +K  +KYG   ++L  V++A
Sbjct: 150  NENVNEFLKIVAELSSLEINVVEEVRAILFLNRLSSRYSQLKHTLKYGNKALSLKDVISA 209

Query: 701  LKHKEKDLNLLKNVHSGPDKVFHVNGKTNSY---YQHNEHANGSGDKKSNDKGAFKPRYK 531
             +  E++LN  K        V + N ++        HN+   G G  KSN          
Sbjct: 210  ARSLERELNEQKETDKNTSTVLYTNERSRPQTRNQNHNKGGQGRGRSKSNSNAKL----- 264

Query: 530  PRRCYNCGEIGHYVRDCPNPKRNQKGEQANVVSAGETT 417
               C+ C + GH  +D    KR  K E  N   AG  T
Sbjct: 265  --TCWYCKKEGHVKKDYFARKR--KLESENPGEAGVIT 298


>emb|CAJ09951.2| putative gag-pol polyprotein [Citrus sinensis]
          Length = 1334

 Score =  104 bits (259), Expect = 2e-19
 Identities = 70/267 (26%), Positives = 126/267 (47%), Gaps = 5/267 (1%)
 Frame = -1

Query: 1229 MSSMVYGLVPFDGSSDFGMWKRKMKCILIDKRAYKAITLE----YXXXXXXXXXXXXXDL 1062
            MS   + +  F    DF +WK KMK +L+ +    A+  E                  + 
Sbjct: 1    MSLPRHEIEKFTIGGDFSLWKLKMKALLVHQGLESALDEEDLEASTGSGIDDKRRQIQNR 60

Query: 1061 AISVICLGLSDCVLMHVDNIDSAKDLWEKLETLYAETSLASQMFLFETFFSFKIDVTKSI 882
            A S + L L D +L  +    +A  +W K+ETL  + SLA ++FL +  ++F +    +I
Sbjct: 61   AHSTLILSLGDSILREISEEKTALGIWNKVETLCMKKSLAHRLFLKKRLYTFSMREGVTI 120

Query: 881  TENLNVFNKLIKDIKQTGD-KGIDLYAPYVLLNAIPESYGDVKSAIKYGRDKVTLDVVMN 705
             ++++ FNK+I D++   + K  D    + LL+++P+SY      + YGR  +TL+ V  
Sbjct: 121  QDHIDTFNKIILDLEGVENVKICDEDKAFFLLSSLPKSYEGFVDTMLYGRTTLTLEDVKA 180

Query: 704  ALKHKEKDLNLLKNVHSGPDKVFHVNGKTNSYYQHNEHANGSGDKKSNDKGAFKPRYKPR 525
            +L  KE   N      +G   +     K +   ++    +G   + ++ K       K R
Sbjct: 181  SLSSKEIQKNCELETSNGEGLMARTEKKKDQKNKNQGKGHGKNQETADKK------KKKR 234

Query: 524  RCYNCGEIGHYVRDCPNPKRNQKGEQA 444
            +C+ C + GHY+RDC   K+ +  E++
Sbjct: 235  KCFYCRKEGHYIRDCFEKKKKESQEKS 261


>emb|CAA37918.1| unnamed protein product [Arabidopsis thaliana]
          Length = 560

 Score = 99.4 bits (246), Expect = 5e-18
 Identities = 61/218 (27%), Positives = 107/218 (49%), Gaps = 3/218 (1%)
 Frame = -1

Query: 1061 AISVICLGLSDCVLMHVDNIDSAKDLWEKLETLYAETSLASQMFLFETFFSFKIDVTKSI 882
            A+++I   + D VL  +D+  SA ++W+ L   Y ETSL +++++   F+SFK++ +KSI
Sbjct: 78   AMNIIITHVGDAVLRKIDHCKSAAEMWKTLNKQYMETSLPNRIYVQLKFYSFKMNDSKSI 137

Query: 881  TENLNVFNKLIKDIKQTGDKGIDLYAPYVLLNAIPESYGDVKSAIKYGRDKVTLDVVMNA 702
             EN+N F K++ ++       ++     + LN +   Y  +K  +KYG   ++L  V+++
Sbjct: 138  NENVNEFLKIVAELSSLEINVVEEVRAILFLNGLSSRYSQLKHTLKYGNKALSLQDVISS 197

Query: 701  LKHKEKDLNLLKNVHSGPDKVFHVNGKTNSYYQH---NEHANGSGDKKSNDKGAFKPRYK 531
             +  E++L+  K        V + N +     ++   N+   G G  KSN          
Sbjct: 198  ARSLERELDEQKETDKNTSTVLYTNERGRPLTRNQNQNKGGQGRGRSKSNSNAKL----- 252

Query: 530  PRRCYNCGEIGHYVRDCPNPKRNQKGEQANVVSAGETT 417
               C+ C + GH  +DC   KR  K E  N   AG  T
Sbjct: 253  --TCWYCKKEGHVKKDCFARKR--KLESENPGEAGVIT 286


>emb|CAN80304.1| hypothetical protein VITISV_017821 [Vitis vinifera]
          Length = 939

 Score = 97.8 bits (242), Expect = 2e-17
 Identities = 58/186 (31%), Positives = 96/186 (51%), Gaps = 2/186 (1%)
 Frame = -1

Query: 1229 MSSMVYGLVPFDGSSDFGMWKRKMKCILIDKRAYKAIT--LEYXXXXXXXXXXXXXDLAI 1056
            M S+   +  F G +DF +W+ +MK IL  +    A+    E              + A 
Sbjct: 1    MGSIKSEIERFIGKNDFNVWRMRMKAILFQQGVKDALKDESELPVTMTAKEKSDIDEKAY 60

Query: 1055 SVICLGLSDCVLMHVDNIDSAKDLWEKLETLYAETSLASQMFLFETFFSFKIDVTKSITE 876
             +I L L D  L       +AK +W KLE LY + SL+++++L E  + FK+   +SI +
Sbjct: 61   HLIILALGDKALREFSEETTAKGVWNKLEQLYMQNSLSNRLYLKERLYGFKMQEDRSIAD 120

Query: 875  NLNVFNKLIKDIKQTGDKGIDLYAPYVLLNAIPESYGDVKSAIKYGRDKVTLDVVMNALK 696
            NL+ F K++ ++   G K  D     ++L ++P  Y + K  +KYGR  +TL+ V +AL+
Sbjct: 121  NLDDFAKIVLEMSNIGIKVDDEDKAVLVLKSLPGLYSNFKETMKYGRKTLTLEEVQSALR 180

Query: 695  HKEKDL 678
             KE +L
Sbjct: 181  SKELEL 186


>ref|XP_004165166.1| PREDICTED: uncharacterized protein LOC101225243 [Cucumis sativus]
          Length = 158

 Score = 95.1 bits (235), Expect = 1e-16
 Identities = 50/145 (34%), Positives = 84/145 (57%), Gaps = 2/145 (1%)
 Frame = -1

Query: 1199 FDGSSDFGMWKRKMKCILIDKRAYKAIT--LEYXXXXXXXXXXXXXDLAISVICLGLSDC 1026
            FDG  DF +WK K+K +L  ++A+KA+   LE               +A   + L +SD 
Sbjct: 11   FDGKGDFALWKAKIKALLGQQKAHKALLDPLELPTILTATQKEEIKLIAYGTLILNISDN 70

Query: 1025 VLMHVDNIDSAKDLWEKLETLYAETSLASQMFLFETFFSFKIDVTKSITENLNVFNKLIK 846
            ++  V   ++A  +W+KLE+LYA   L +++ L E  F++K+D +K++TENL+ F K++ 
Sbjct: 71   IIRQVLEEETAHKVWKKLESLYATKDLPNKICLREKIFTYKMDSSKTLTENLDEFKKIVS 130

Query: 845  DIKQTGDKGIDLYAPYVLLNAIPES 771
            + K   DK  D    +VLLN +P++
Sbjct: 131  NFKSLEDKLDDENEAFVLLNFLPKA 155


>gb|AAC62132.1| copia-like retroelement pol polyprotein [Arabidopsis thaliana]
          Length = 1137

 Score = 95.1 bits (235), Expect = 1e-16
 Identities = 61/212 (28%), Positives = 108/212 (50%), Gaps = 4/212 (1%)
 Frame = -1

Query: 1061 AISVICLGLSDCVLMHVDNIDSAKDLWEKLETLYAETSLASQMFLFETFFSFKIDVTKSI 882
            A+ +I + + D VL +++N  +A + W  L+ LY   SL ++++L    +++++  +K++
Sbjct: 44   AMDMIFINVGDKVLRNIENSKTAAEAWATLDKLYLVKSLPNRVYLQLKVYNYRMQDSKTL 103

Query: 881  TENLNVFNKLIKDIKQTGDKGIDLYAPYVLLNAIPESYGDVKSAIKYGRDKVTLDVVMNA 702
             EN++ F K+I D+     +  D     ++L+A+P+SY  +K  +KYGR+ + LD V++A
Sbjct: 104  EENVDEFQKMISDLNNLQIQVPDEVQAILILSALPDSYDMLKETLKYGREGIKLDDVISA 163

Query: 701  LKHKEKDLNLLKNVHSGPDKVFHVNGKTNSYYQHNEHANGSGDKKSNDKGAFKPRYKPRR 522
             K KE +L           +  +V GK+         A GS   KS +          + 
Sbjct: 164  AKSKELELRDSSGGSRPVGEGLYVRGKS--------QARGSDGPKSTE--------GKKV 207

Query: 521  CYNCGEIGHYVRDC----PNPKRNQKGEQANV 438
            C+ CG+ GH+ R C       K N  GE A V
Sbjct: 208  CWICGKEGHFKRQCYKWLEKNKANGAGETALV 239


>emb|CAB40039.1| putative retrotransposon [Arabidopsis thaliana]
            gi|7267743|emb|CAB78169.1| putative retrotransposon
            [Arabidopsis thaliana]
          Length = 1230

 Score = 94.7 bits (234), Expect = 1e-16
 Identities = 80/288 (27%), Positives = 130/288 (45%), Gaps = 19/288 (6%)
 Frame = -1

Query: 1229 MSSMVYGLVPFDGSSDFGMWKRKMKC------ILIDKRAYKAIT--LEYXXXXXXXXXXX 1074
            MSS    +  FDG  D+ +WK K+        + +  R  ++++  LE            
Sbjct: 1    MSSARVEMEKFDGHGDYTLWKEKLMAHMDLLGLTVALRETQSVSDPLESEEEGKESEKGD 60

Query: 1073 XXDL-------AISVICLGLSDCVLMHVDNIDSAKDLWEKLETLYAETSLASQMFLFETF 915
               L       A S I L +SD VL       +A  + E L+ LY   +L ++++L +  
Sbjct: 61   KEALMEEKRQKARSTIVLSVSDQVLRKSKKEKTAPSMLEALDKLYMSKALPNRIYLKQKL 120

Query: 914  FSFKIDVTKSITENLNVFNKLIKDIKQTGDKGIDLYAPYVLLNAIPESYGDVKSAIKYGR 735
            +S+K+    S+  N++ F +LI D++ T     D     +LL ++P+ +  +K  +KYG 
Sbjct: 121  YSYKMQENLSVEGNIDEFLRLIADLENTNVLVSDEDQAILLLMSLPKQFDQLKDTLKYGS 180

Query: 734  DKVTLDV--VMNALKHKEKDLNLLKNVHSGPDKVFHVNGKTNSYYQHNEHANGSGDKKSN 561
             + TL V  V+ A+  KE +L   K    G  +  +V  K        E    S  K+  
Sbjct: 181  GRTTLSVDEVVAAIYSKELELGSNKKSIRGQAEGLYVKDKP-------ETRGMSEQKEKG 233

Query: 560  DKGAFKPRYKP-RRCYNCGEIGHYVRDCPNP-KRNQKGEQANVVSAGE 423
            +KG  + R K  + C+ CGE GH+   CPN  K+  KG+     S GE
Sbjct: 234  NKGRSRSRSKGWKGCWICGEEGHFKTSCPNKGKQQNKGKDQASGSKGE 281


Top