BLASTX nr result

ID: Catharanthus23_contig00012386 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Catharanthus23_contig00012386
         (832 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_006586558.1| PREDICTED: uncharacterized protein LOC102661...   112   1e-22
ref|XP_006480040.1| PREDICTED: uncharacterized protein LOC102624...    99   2e-18
gb|AAB87099.1| putative retroelement pol polyprotein [Arabidopsi...    88   4e-15
gb|AAG50751.1|AC079733_19 polyprotein, putative [Arabidopsis tha...    88   4e-15
ref|XP_006397294.1| hypothetical protein EUTSA_v10029485mg [Eutr...    86   2e-14
gb|AAG09097.1|AC009323_8 Putative retroelement polyprotein [Arab...    80   9e-13
dbj|BAA97099.1| retroelement pol polyprotein-like [Arabidopsis t...    80   1e-12
dbj|BAA97287.1| retroelement pol polyprotein-like [Arabidopsis t...    78   4e-12
gb|AAT71979.1| At5g39185 [Arabidopsis thaliana]                        77   1e-11
emb|CAN74230.1| hypothetical protein VITISV_000585 [Vitis vinifera]    76   1e-11
emb|CAN81001.1| hypothetical protein VITISV_006992 [Vitis vinifera]    76   2e-11
dbj|BAB10837.1| retroelement pol polyprotein-like [Arabidopsis t...    74   6e-11
gb|AAD15368.1| putative retroelement pol polyprotein [Arabidopsi...    74   8e-11
gb|AAC67205.1| putative retroelement pol polyprotein [Arabidopsi...    73   1e-10
emb|CAN67762.1| hypothetical protein VITISV_040650 [Vitis vinifera]    72   2e-10
ref|XP_006418883.1| hypothetical protein EUTSA_v10003063mg [Eutr...    72   3e-10
ref|XP_004240202.1| PREDICTED: uncharacterized protein LOC101267...    71   4e-10
ref|XP_004245419.1| PREDICTED: uncharacterized protein LOC101260...    70   7e-10
gb|EPS58738.1| hypothetical protein M569_16075, partial [Genlise...    69   2e-09
gb|EPS60009.1| hypothetical protein M569_14795, partial [Genlise...    69   2e-09

>ref|XP_006586558.1| PREDICTED: uncharacterized protein LOC102661920 [Glycine max]
          Length = 516

 Score =  112 bits (281), Expect = 1e-22
 Identities = 80/256 (31%), Positives = 118/256 (46%), Gaps = 3/256 (1%)
 Frame = +3

Query: 60  KGNNYAEXXXXXXXXXXXXXXXGFIDGSIXXXXXXXXXXXXXXXMNTLVGSWILNTIEPT 239
           KG NY E                F+DGSI               +N+++ SWI NTIEP 
Sbjct: 49  KGENYDEWARAVRGSLRARRKFRFVDGSIKKPDDAAPEIDDWWTVNSMIVSWIFNTIEPK 108

Query: 240 LRSSINYLEHVEELWVDLEEWFLVSNGPRKYELNAALAICK*GGDFVNVYHS*LKKLSFG 419
           LRS+I Y E+ +ELW D+++ F +SNGPR  +L + LA CK  GD +  Y   LKKL   
Sbjct: 109 LRSTITYRENAQELWDDIKQRFSISNGPRIQQLKSELANCKQNGDSIVTYFGRLKKLWDE 168

Query: 420 IEKVMGLTSQLYTIAIISYKFRNIGHSDQRQRGGKSIYQFLMD*MIIGLNDEIFGTVRSG 599
           +        Q+        K   I  +  ++R  + ++QFLM     GL+D  F TVRS 
Sbjct: 169 LNDF----DQIPMCTCNGCKC-GISAALNKKREEEKLHQFLM-----GLDDTQFRTVRSN 218

Query: 600 ITHEEPLPKLK---QIMARHLQGGTPSTHDSDFNSGRERKYDHGFR*RWNKPVVQTRTKS 770
           +   +PLP L    Q++ +  + G  +    +           G    W K    T ++ 
Sbjct: 219 VLSLDPLPNLNRAYQMVVQEERVGVMTRGKEERGDPIAFAVKSGRTSSWEKK-PNTGSEK 277

Query: 771 VCTHCQKQGHDVIPVF 818
            C+HC++ GHD+   F
Sbjct: 278 PCSHCKRDGHDIDSCF 293


>ref|XP_006480040.1| PREDICTED: uncharacterized protein LOC102624694 isoform X1 [Citrus
           sinensis] gi|568852764|ref|XP_006480041.1| PREDICTED:
           uncharacterized protein LOC102624694 isoform X2 [Citrus
           sinensis] gi|568852766|ref|XP_006480042.1| PREDICTED:
           uncharacterized protein LOC102624694 isoform X3 [Citrus
           sinensis]
          Length = 320

 Score = 98.6 bits (244), Expect = 2e-18
 Identities = 75/221 (33%), Positives = 112/221 (50%), Gaps = 12/221 (5%)
 Frame = +3

Query: 192 MNTLVGSWILNTIEPTLRSSINYLEHVEELWVDLEEWFLVSNGPRKYELNAALAICK*GG 371
           +N+++ SWILNTIEPTLRS+I ++E  ++LW D++E F V NGPR ++L + LA CK  G
Sbjct: 16  VNSMIVSWILNTIEPTLRSTITHMEVAKKLWDDIKERFSVGNGPRVHQLKSELAECKQRG 75

Query: 372 DFVNVYHS*LKKL--------SFGIEKVMGLTSQLYTIAIISYKFRNIGHSDQRQRGGKS 527
             +  Y+  LK +         + I    G T +L            +    + +R    
Sbjct: 76  MTILSYYGKLKLIWEELANYEQYPICSCGGCTCELEA---------KLNKKCEEER---- 122

Query: 528 IYQFLMD*MIIGLNDEIFGTVRSGITHEEPLPKLKQIMARHLQGGTPSTHDSDFNSGRE- 704
           ++QFLM     GL+D I+G+VRS I   +PLP L +  +  +Q     T       G+E 
Sbjct: 123 LHQFLM-----GLDDTIYGSVRSNILSTDPLPPLNRAYSLVVQEERVQT----ITRGKEG 173

Query: 705 RKYDHGFR*RWN-KPVVQTRTKS--VCTHCQKQGHDVIPVF 818
           R     F  +   K  ++ R KS  +C HC+K GHD    F
Sbjct: 174 RGEPVAFAVQGGVKGQIEIREKSSVICKHCRKTGHDADSCF 214


>gb|AAB87099.1| putative retroelement pol polyprotein [Arabidopsis thaliana]
          Length = 1496

 Score = 87.8 bits (216), Expect = 4e-15
 Identities = 75/264 (28%), Positives = 119/264 (45%), Gaps = 6/264 (2%)
 Frame = +3

Query: 45  SKRTFKGNNYAEXXXXXXXXXXXXXXXGFIDGSIXXXXXXXXXXXXXXXMNTLVGSWILN 224
           S    K NNYAE               GFIDGSI               +N+++  WI  
Sbjct: 35  SSVVLKENNYAEWSEELQNFLRAKQKLGFIDGSIPKPAADPELSLWIA-INSMIVGWIRT 93

Query: 225 TIEPTLRSSINYLEHVEELWVDLEEWFLVSNGPRKYELNAALAICK*GGDFVNVYHS*LK 404
           +I+PT+RS++ ++    +LW +L   F V NG RK  L   +A C   G  V  Y+  L 
Sbjct: 94  SIDPTIRSTVGFVSEASQLWENLRRRFSVGNGVRKTLLKDEIAACTQDGQPVLAYYGRLI 153

Query: 405 KLSFGIEKVMGLTSQLYTIAIISYKFRNIGHSD-QRQRGGKSIYQFLMD*MIIGLNDEIF 581
           KL    E++    S          + +    SD +++R    +++FL     +GL D  F
Sbjct: 154 KL---WEELQNYKS--------GRECKCEAASDIEKEREDDRVHKFL-----LGL-DSRF 196

Query: 582 GTVRSGITHEEPLPKLKQIMARHLQGGTPSTHDSDFNSGRERKYDH----GFR*RWN-KP 746
            ++RS IT  EPLP L Q+ +R ++       + + N+ R +        GF  + +  P
Sbjct: 197 SSIRSSITDIEPLPDLYQVYSRVVR------EEQNLNASRTKDVVKTEAIGFSVQSSTTP 250

Query: 747 VVQTRTKSVCTHCQKQGHDVIPVF 818
             + ++   CTHC ++GH+V   F
Sbjct: 251 RFRDKSTLFCTHCNRKGHEVTQCF 274


>gb|AAG50751.1|AC079733_19 polyprotein, putative [Arabidopsis thaliana]
          Length = 1468

 Score = 87.8 bits (216), Expect = 4e-15
 Identities = 67/255 (26%), Positives = 108/255 (42%), Gaps = 8/255 (3%)
 Frame = +3

Query: 60  KGNNYAEXXXXXXXXXXXXXXXGFIDGSIXXXXXXXXXXXXXXXMNTLVGSWILNTIEPT 239
           K NNY E               GF+DG+I               +N L+ SW+  TI+  
Sbjct: 39  KTNNYEEWACGFKTALRSRKKFGFLDGTIPQPLDGSPDLEDWLTINALLVSWMKMTIDSE 98

Query: 240 LRSSINYLEHVEELWVDLEEWFLVSNGPRKYELNAALAICK*GGDFVNVYHS*LKKLSFG 419
           L ++I++ +   +LW  + + F VSNGP+  ++ A LA CK  G  V  Y+  L K+   
Sbjct: 99  LLTNISHRDVARDLWEQIRKRFSVSNGPKNQKMKADLATCKQEGMTVEGYYGKLNKIWDN 158

Query: 420 IEKVMGLTSQLYTIAIISYKFRNIGHSDQRQRGGKSIYQFLMD*MIIGLNDEIFGTVRSG 599
           I     L      I        N+G   ++ R    ++Q+L      GLN+  F T+RS 
Sbjct: 159 INSYRPL-----RICKCGRCICNLGTDQEKYREDDMVHQYL-----YGLNETKFHTIRSS 208

Query: 600 ITHEEPLPKLKQIMARHLQGGTPSTHDSDFNSGRERKYDHGFR*RWNKPVV--------Q 755
           +T   PLP L+++    ++      ++   N  R        + R    V+        +
Sbjct: 209 LTSRVPLPGLEEVY-NIVRQEEDMVNNRSSNEERTDVTAFAVQMRPRSEVISEKFANSEK 267

Query: 756 TRTKSVCTHCQKQGH 800
            + K +CTHC + GH
Sbjct: 268 LQNKKLCTHCNRGGH 282


>ref|XP_006397294.1| hypothetical protein EUTSA_v10029485mg [Eutrema salsugineum]
           gi|557098311|gb|ESQ38747.1| hypothetical protein
           EUTSA_v10029485mg [Eutrema salsugineum]
          Length = 196

 Score = 85.5 bits (210), Expect = 2e-14
 Identities = 57/193 (29%), Positives = 92/193 (47%)
 Frame = +3

Query: 60  KGNNYAEXXXXXXXXXXXXXXXGFIDGSIXXXXXXXXXXXXXXXMNTLVGSWILNTIEPT 239
           +G NY +               GFI+G++               +N+++ +WI+NTIE  
Sbjct: 19  RGENYEDWAKHVRNALRTKRKLGFIEGTLPKPTAPKELEQWEV-VNSMLVAWIMNTIESN 77

Query: 240 LRSSINYLEHVEELWVDLEEWFLVSNGPRKYELNAALAICK*GGDFVNVYHS*LKKLSFG 419
           L+++I+ ++  +ELW DL+  FLV NGP+  EL A +A C+  GD + VY   LK     
Sbjct: 78  LKTTISMVDEAKELWDDLKLQFLVGNGPQISELRADIANCRQNGDSIMVYFEKLK----- 132

Query: 420 IEKVMGLTSQLYTIAIISYKFRNIGHSDQRQRGGKSIYQFLMD*MIIGLNDEIFGTVRSG 599
                     ++    +    R     + R +  + + +   +  + GL+ E FGTVRS 
Sbjct: 133 ----------MWDELAVYKPIRTCSCGELRAQLEEDLEEERTNTFLTGLDAERFGTVRST 182

Query: 600 ITHEEPLPKLKQI 638
           I   EPLPKL Q+
Sbjct: 183 IRSLEPLPKLTQV 195


>gb|AAG09097.1|AC009323_8 Putative retroelement polyprotein [Arabidopsis thaliana]
          Length = 1486

 Score = 80.1 bits (196), Expect = 9e-13
 Identities = 66/256 (25%), Positives = 104/256 (40%), Gaps = 4/256 (1%)
 Frame = +3

Query: 45  SKRTFKGNNYAEXXXXXXXXXXXXXXXGFIDGSIXXXXXXXXXXXXXXXMNTLVGSWILN 224
           SK   +G NY E               GF DGSI                N LV SW+  
Sbjct: 38  SKPLLRGPNYDEWATNLRLALKARKKFGFADGSIPQPVETDPDFEDWTANNALVVSWMKL 97

Query: 225 TIEPTLRSSINYLEHVEELWVDLEEWFLVSNGPRKYELNAALAICK*GGDFVNVYHS*LK 404
           TI+ T+ +S+++L+   ELW  +++ F V NG R   L   LA C+  G  +  Y+    
Sbjct: 98  TIDETVSTSMSHLDDSHELWTHIQKRFGVKNGQRVQRLKTELATCRQKGVAIETYY---- 153

Query: 405 KLSFGIEKVMGLTSQLYTIAIISYKFRNIGHSDQRQRGGKSIYQFLMD*MIIGLNDEIFG 584
                     G  SQL+  ++  Y+        +++R    ++QFLM     GL++ ++G
Sbjct: 154 ----------GRLSQLWR-SLADYQQAKTMDDVRKEREEDKLHQFLM-----GLDESVYG 197

Query: 585 TVRSGITHEEPLPKLKQIMARHLQGGTPSTHDSDFNS----GRERKYDHGFR*RWNKPVV 752
            V+S +    PLP L++            T D +  S      ER     F  +      
Sbjct: 198 AVKSALLSRVPLPSLEEAY-------NALTQDEESKSLSRLHNERVDGVSFAVQTTSRPR 250

Query: 753 QTRTKSVCTHCQKQGH 800
            +    VC++C + GH
Sbjct: 251 DSSENRVCSNCGRVGH 266


>dbj|BAA97099.1| retroelement pol polyprotein-like [Arabidopsis thaliana]
          Length = 1098

 Score = 79.7 bits (195), Expect = 1e-12
 Identities = 69/269 (25%), Positives = 114/269 (42%), Gaps = 16/269 (5%)
 Frame = +3

Query: 60  KGNNYAEXXXXXXXXXXXXXXXGFIDGSIXXXXXXXXXXXXXXXMNTLVGSWILNTIEPT 239
           K +NY+E               GF+DG+I                + ++G WI  +I+PT
Sbjct: 32  KEDNYSEWAEELMNSLQAKQKLGFLDGTIPKPTTEPALSSWKAANSMIIG-WIRTSIDPT 90

Query: 240 LRSSINYLEHVEELWVDLEEWFLVSNGPRKYELNAALAICK*GGDFVNVYHS*LKKLSFG 419
           +RS++ ++   ++LW  L++ F   NG RK  L   +  CK  G  V VY+  L KL   
Sbjct: 91  IRSTVAFVSDAKDLWDSLKQRFSNGNGVRKQLLKDEILACKQDGQSVLVYYGRLTKLWEE 150

Query: 420 IEKVMGLTSQLYTIAIISYKFRNIGHSDQRQRGGKSIYQFLMD*MIIGLNDEIFGTVRSG 599
           ++     TS+  T                ++R    ++QFL++       DE F  +RS 
Sbjct: 151 LQNYK--TSRTCTC--------EAAPDIAKEREDDKVHQFLLN------LDERFRPIRST 194

Query: 600 ITHEEPLPKLKQIMARHLQGGTPSTHDSDFNSGR----ERKYDHGFR*R----------- 734
           IT ++PLP L Q+ +R +        + + N+ R     +    GF  +           
Sbjct: 195 ITVQDPLPALNQVYSRVIH------EEQNLNASRIKDDIKTEAVGFTVQATPLPPTPQVA 248

Query: 735 -WNKPVVQTRTKSVCTHCQKQGHDVIPVF 818
             + P  + R+   CTH  +QGHD+   F
Sbjct: 249 AVSAPRFRDRSSLTCTHYHRQGHDITECF 277


>dbj|BAA97287.1| retroelement pol polyprotein-like [Arabidopsis thaliana]
          Length = 1491

 Score = 77.8 bits (190), Expect = 4e-12
 Identities = 63/259 (24%), Positives = 112/259 (43%), Gaps = 12/259 (4%)
 Frame = +3

Query: 63  GNNYAEXXXXXXXXXXXXXXXGFIDGSIXXXXXXXXXXXXXXXMNTLVGSWILNTIEPTL 242
           G+NY E               GFI+GSI               +N+++  WI  +IEP +
Sbjct: 46  GDNYNEWSTEMLNALQAKRKTGFINGSISKPPLDNPDYENWQAVNSMIVGWIRASIEPKV 105

Query: 243 RSSINYLEHVEELWVDLEEWFLVSNGPRKYELNAALAICK*GGDFVNVYHS*LKKL--SF 416
           +S++ ++    +LW +L++ F V N  R +++ A LA C+  G  V  Y+  L KL   F
Sbjct: 106 KSTVTFISDAHQLWSELKQRFSVGNKVRVHQIKAQLAACRQDGQPVIDYYGRLCKLWEEF 165

Query: 417 GIEKVMGLTSQLYTIAIISYKFRNIGHSDQRQRGGKSIYQFLMD*MIIGLNDEIFGTVRS 596
            I K +       T+               ++R  + I+QF     ++GL+D  FG + +
Sbjct: 166 QIYKPI-------TVCKCGLCTCGATLEPSKEREEEKIHQF-----VLGLDDSRFGGLSA 213

Query: 597 GITHEEPLPKLKQIMARHLQGGTPSTHDSDFNSGRERKYDHGFR*RWNKPVVQTRTKS-- 770
            +   +P P L +I +R ++        +      +++   GF  R ++     RT S  
Sbjct: 214 TLIAMDPFPSLGEIYSRVVR---EEQRLASVQIREQQQSAIGFLTRQSEVTADGRTDSSI 270

Query: 771 --------VCTHCQKQGHD 803
                   +C+HC + GH+
Sbjct: 271 IKSRDRSVLCSHCGRSGHE 289


>gb|AAT71979.1| At5g39185 [Arabidopsis thaliana]
          Length = 348

 Score = 76.6 bits (187), Expect = 1e-11
 Identities = 64/260 (24%), Positives = 108/260 (41%), Gaps = 8/260 (3%)
 Frame = +3

Query: 45  SKRTFKGNNYAEXXXXXXXXXXXXXXXGFIDGSIXXXXXXXXXXXXXXXMNTLVGSWILN 224
           SK   +G NY E               GF DG+I                N LV SW+  
Sbjct: 37  SKPLLRGPNYDEWATNLRLALKARKKFGFADGTIPQPDETNPDFDDWIANNALVVSWMKL 96

Query: 225 TIEPTLRSSINYLEHVEELWVDLEEWFLVSNGPRKYELNAALAICK*GGDFVNVYHS*LK 404
           TI  +L +S+++L+   ++W  +++ F V NG R   L   LA C+  G  +  Y+    
Sbjct: 97  TIHESLATSMSHLDDSHDMWTHIQKRFGVKNGQRIQRLKTELATCRQKGTPIETYY---- 152

Query: 405 KLSFGIEKVMGLTSQLYTIAIISYKFRNIGHSDQRQRGGKSIYQFLMD*MIIGLNDEIFG 584
                     G  SQL+  ++  Y+        +++R    ++QFLM     GL++ ++G
Sbjct: 153 ----------GKLSQLWR-SLADYQQAKTMEEVRKEREEDKLHQFLM-----GLDESMYG 196

Query: 585 TVRSGITHEEPLPKLKQ---IMARHLQGGTPSTHDSDFNSGRERKYDHGFR*RWNKPVVQ 755
            V+S +    PLP L++    + +  +  + S    + N G               P  +
Sbjct: 197 AVKSALLSRVPLPSLEEAYNTLTQDEESKSLSRLHDERNDGVSFAVQ-------TTPRTR 249

Query: 756 TRTKS-----VCTHCQKQGH 800
           + TK+     VC+HC + GH
Sbjct: 250 SLTKNKDSAIVCSHCGRLGH 269


>emb|CAN74230.1| hypothetical protein VITISV_000585 [Vitis vinifera]
          Length = 334

 Score = 76.3 bits (186), Expect = 1e-11
 Identities = 64/256 (25%), Positives = 107/256 (41%), Gaps = 4/256 (1%)
 Frame = +3

Query: 51  RTFKGNNYAEXXXXXXXXXXXXXXXGFIDGSIXXXXXXXXXXXXXXXMNTLVGSWILNTI 230
           +  +G+NY+                GF+ GSI                N +V SW+LN+I
Sbjct: 36  KVLEGDNYSTWSRAMRISLSAKDKIGFVTGSIKPPSSTDDSFPSWQRCNDMVISWLLNSI 95

Query: 231 EPTLRSSINYLEHVEELWVDLEEWFLVSNGPRKYELNAALAICK*GGDFVNVYHS*LKKL 410
            P + SS+ Y E   E+W DL E F   N  R Y++   +   + G   ++VY++ LK  
Sbjct: 96  HPDIASSVIYAETASEIWADLRERFSQGNDSRIYQIKRDIVEHRQGQQSISVYYTKLK-- 153

Query: 411 SFGIEKVMGLTSQLYTIAIISYKFRNIGHSDQRQRGGKSIYQFLMD*MIIGLNDEIFGTV 590
           +F  E    L+S    ++        +   D+++R    + QFLM     GLND  +  +
Sbjct: 154 AFXDE----LSSYHEVLSCSCGGLEKLKERDEKER----VMQFLM-----GLNDS-YAAI 199

Query: 591 RSGITHEEPLPKLKQIMARHLQGGTPSTHDSDFNSGRERKY----DHGFR*RWNKPVVQT 758
           R  I    PLP  + + +  LQ       +   N+G +  Y    D   +      V + 
Sbjct: 200 RGQILLMXPLPDTRXVYSLVLQ--QEKQVEVSLNNGNKNHYAMLADRDNKATSAHXVQKQ 257

Query: 759 RTKSVCTHCQKQGHDV 806
           +T   C++C +  H +
Sbjct: 258 KTPLHCSYCDRDXHSI 273


>emb|CAN81001.1| hypothetical protein VITISV_006992 [Vitis vinifera]
          Length = 1131

 Score = 75.9 bits (185), Expect = 2e-11
 Identities = 65/235 (27%), Positives = 103/235 (43%), Gaps = 10/235 (4%)
 Frame = +3

Query: 51  RTFKGNNYAEXXXXXXXXXXXXXXXGFIDGSIXXXXXXXXXXXXXXXMNTLVGSWILNTI 230
           +  +G+NY+                GF+ GSI                N +V SW+LN+I
Sbjct: 33  KVLEGDNYSTWSRAMRISLSAKDKIGFVTGSIKPPSSTDDSFPSWQRCNDMVISWLLNSI 92

Query: 231 EPTLRSSINYLEHVEELWVDLEEWFLVSNGPRKYELNAALAICK*GGDFVNVYHS*LKKL 410
            P + SS+ Y E   E+W DL E F   N  R Y++   +   + G   ++VY++ LK  
Sbjct: 93  HPDIASSVIYAETTSEIWADLRERFSQGNDSRIYQIKRDIVEHRQGQQSISVYYTKLK-- 150

Query: 411 SFGIEKVMGLTSQLYTIAIISYKFRNIGHSDQRQRGGKSIYQFLMD*MIIGLNDEIFGTV 590
           +F  E    L+S    ++        +   D+++R    + QFLM     GLND  +  +
Sbjct: 151 AFWDE----LSSYHEVLSCSCGGLEKLKEMDEKER----VMQFLM-----GLNDS-YAAI 196

Query: 591 RSGITHEEPLPKLKQIMARHLQ-------GGTP---STHDSDFNSGRERKYDHGF 725
           R  I   +PLP  +++ +  LQ         TP   S  D D++S  +  Y HGF
Sbjct: 197 RGQILLMQPLPDTRRVYSLVLQQEKQVQKQKTPLHCSYCDRDYHSIEKCYYLHGF 251


>dbj|BAB10837.1| retroelement pol polyprotein-like [Arabidopsis thaliana]
          Length = 1462

 Score = 73.9 bits (180), Expect = 6e-11
 Identities = 52/197 (26%), Positives = 86/197 (43%)
 Frame = +3

Query: 45  SKRTFKGNNYAEXXXXXXXXXXXXXXXGFIDGSIXXXXXXXXXXXXXXXMNTLVGSWILN 224
           SK   +G NY E               GF DG+I                N LV SW+  
Sbjct: 37  SKPLLRGPNYDEWATNLRLALKARKKFGFADGTIPQPDETNPDFDDWIANNALVVSWMKL 96

Query: 225 TIEPTLRSSINYLEHVEELWVDLEEWFLVSNGPRKYELNAALAICK*GGDFVNVYHS*LK 404
           TI  +L +S+++L+   ++W  +++ F V NG R   L   LA C+  G  +  Y+    
Sbjct: 97  TIHESLATSMSHLDDSHDMWTHIQKRFGVKNGQRIQRLKTELATCRQKGTPIETYY---- 152

Query: 405 KLSFGIEKVMGLTSQLYTIAIISYKFRNIGHSDQRQRGGKSIYQFLMD*MIIGLNDEIFG 584
                     G  SQL+  ++  Y+        +++R    ++QFLM     GL++ ++G
Sbjct: 153 ----------GKLSQLWR-SLADYQQAKTMEEVRKEREEDKLHQFLM-----GLDESMYG 196

Query: 585 TVRSGITHEEPLPKLKQ 635
            V+S +    PLP L++
Sbjct: 197 AVKSALLSRVPLPSLEE 213


>gb|AAD15368.1| putative retroelement pol polyprotein [Arabidopsis thaliana]
           gi|17065314|gb|AAL32811.1| putative retroelement pol
           polyprotein [Arabidopsis thaliana]
           gi|21387147|gb|AAM47977.1| putative retroelement pol
           polyprotein [Arabidopsis thaliana]
          Length = 411

 Score = 73.6 bits (179), Expect = 8e-11
 Identities = 71/255 (27%), Positives = 100/255 (39%), Gaps = 7/255 (2%)
 Frame = +3

Query: 63  GNNYAEXXXXXXXXXXXXXXXGFIDGSIXXXXXXXXXXXXXXXMNTLVGSWILNTIEPTL 242
           G+NY                 GF+DGS+                N++V SWILN +   +
Sbjct: 86  GSNYNSWSIAMRISLDAKNKLGFVDGSLLRPSVDDSTFRIWSRCNSMVKSWILNVVNKEI 145

Query: 243 RSSINYLEHVEELWVDLEEWFLVSNGPRKYELNAALAICK*GGDFVNVYHS*LKKLSFGI 422
             SI Y E   E+W DL   F V+N PRKY+L  A+   K G   ++ Y +         
Sbjct: 146 YDSILYYEDAVEMWTDLFTRFRVNNLPRKYQLEQAVMTLKQGSLNLSTYFT--------- 196

Query: 423 EKVMGLTSQLYTIAIISYKFRNIGHSDQRQRGGKS--IYQFLMD*MIIGLNDEIFGTVRS 596
            K   L  QL      S K  +     +     ++  + QFLM     GLND+ F T+ S
Sbjct: 197 -KKKTLWEQLLNTKTRSVKKCDCDQVKELLEDAETSRVIQFLM-----GLNDD-FNTIMS 249

Query: 597 GITHEEPLPKLKQI--MARHLQGGTPSTHDSDFNSGRERKYDHGFR*RWNKPVVQTR--- 761
            I + +P P L +I  M    +      H S            G     N P++  +   
Sbjct: 250 QILNMKPRPGLNEIYNMLDQDESQRLVGHASKPTPSPAAFQTQGLLTEQN-PILMAQGNF 308

Query: 762 TKSVCTHCQKQGHDV 806
            K  CTHC + GH V
Sbjct: 309 KKPKCTHCNRIGHTV 323


>gb|AAC67205.1| putative retroelement pol polyprotein [Arabidopsis thaliana]
          Length = 1413

 Score = 72.8 bits (177), Expect = 1e-10
 Identities = 61/259 (23%), Positives = 110/259 (42%), Gaps = 12/259 (4%)
 Frame = +3

Query: 63  GNNYAEXXXXXXXXXXXXXXXGFIDGSIXXXXXXXXXXXXXXXMNTLVGSWILNTIEPTL 242
           G+NY E               GFI+GSI               +N+++  WI  +IEP +
Sbjct: 46  GDNYNEWSTKMLNALQAKRKTGFINGSISKPPLDNPDYENWQAVNSMIVGWIRASIEPKV 105

Query: 243 RSSINYLEHVEELWVDLEEWFLVSNGPRKYELNAALAICK*GGDFVNVYHS*LKKL--SF 416
           +S++ ++    +LW +L++ F V N    +++   LA C+  G  V  Y+  L KL   F
Sbjct: 106 KSTVTFICDAHQLWSELKQRFSVGNKVHVHQIKTQLAACRQDGQPVIDYYGRLCKLWEEF 165

Query: 417 GIEKVMGLTSQLYTIAIISYKFRNIGHSDQRQRGGKSIYQFLMD*MIIGLNDEIFGTVRS 596
            I K +       T+               ++R  + I+QF     ++GL+D  FG + +
Sbjct: 166 QIYKPI-------TVCKCGLCTCGATLEPSKEREEEKIHQF-----VLGLDDSRFGGLSA 213

Query: 597 GITHEEPLPKLKQIMARHLQGGTPSTHDSDFNSGRERKYDHGFR*RWNKPVVQTRTKS-- 770
            +   +P P L +I +R ++        +      +++   GF  R ++     RT S  
Sbjct: 214 TLIAMDPFPSLGEIYSRVVR---EEQRLASVQIREQQQSAIGFLTRQSEVTADGRTDSSI 270

Query: 771 --------VCTHCQKQGHD 803
                   +C+HC + GH+
Sbjct: 271 IKSRDRSVLCSHCGRSGHE 289


>emb|CAN67762.1| hypothetical protein VITISV_040650 [Vitis vinifera]
          Length = 1316

 Score = 72.0 bits (175), Expect = 2e-10
 Identities = 56/180 (31%), Positives = 76/180 (42%)
 Frame = +3

Query: 279 LWVDLEEWFLVSNGPRKYELNAALAICK*GGDFVNVYHS*LKKLSFGIEKVMGLTSQLYT 458
           +W DL+E + V N PR ++L + +   K  G  V  Y++          K+ G+  +L  
Sbjct: 1   MWEDLKERYAVGNAPRVHQLRSEIVNLKQEGMTVAAYYA----------KIKGMWDELNQ 50

Query: 459 IAIISYKFRNIGHSDQRQRGGKSIYQFLMD*MIIGLNDEIFGTVRSGITHEEPLPKLKQI 638
              I         +  + R  +  +QFLM     GL+D  FGTVRS I   +PLP L +I
Sbjct: 51  YIEIPECTCGAAQAIVKSREDEKAHQFLM-----GLDDTTFGTVRSSILALDPLPTLGKI 105

Query: 639 MARHLQGGTPSTHDSDFNSGRERKYDHGFR*RWNKPVVQTRTKSVCTHCQKQGHDVIPVF 818
            A      T          G +R     F     KP  QT     CTHC K GHDV   F
Sbjct: 106 YAM----VTQEERHRSMARGADRAEITVFAAXTEKPGGQTNKSGSCTHCGKTGHDVADCF 161


>ref|XP_006418883.1| hypothetical protein EUTSA_v10003063mg [Eutrema salsugineum]
           gi|557096811|gb|ESQ37319.1| hypothetical protein
           EUTSA_v10003063mg [Eutrema salsugineum]
          Length = 197

 Score = 71.6 bits (174), Expect = 3e-10
 Identities = 60/193 (31%), Positives = 81/193 (41%)
 Frame = +3

Query: 66  NNYAEXXXXXXXXXXXXXXXGFIDGSIXXXXXXXXXXXXXXXMNTLVGSWILNTIEPTLR 245
           NNY                 GFIDG I               +N+++ +WILNTIEP LR
Sbjct: 5   NNYERWSKLMRNSLKAKNKLGFIDGVITEPEGVKELKKWGI-VNSMLVAWILNTIEPELR 63

Query: 246 SSINYLEHVEELWVDLEEWFLVSNGPRKYELNAALAICK*GGDFVNVYHS*LKKLSFGIE 425
            S++  E   +LW D+ E F V N PR YEL AA    K     V  Y++ +K +   I+
Sbjct: 64  GSVSCAETAHQLWTDIRERFSVDNEPRIYELQAAFNSYKQEKQTVQDYYAKMKLMWDAID 123

Query: 426 KVMGLTSQLYTIAIISYKFRNIGHSDQRQRGGKSIYQFLMD*MIIGLNDEIFGTVRSGIT 605
           +   L            K   +  +   QR  +   QFLM     GL+   FG VRS I 
Sbjct: 124 EFEPLME-----CCCGGKSCKVIKALIEQRDKQRRRQFLM-----GLDAGRFGNVRSNIL 173

Query: 606 HEEPLPKLKQIMA 644
              P P L  + +
Sbjct: 174 CMSPPPNLNAVFS 186


>ref|XP_004240202.1| PREDICTED: uncharacterized protein LOC101267997 [Solanum
           lycopersicum]
          Length = 811

 Score = 71.2 bits (173), Expect = 4e-10
 Identities = 62/240 (25%), Positives = 107/240 (44%), Gaps = 15/240 (6%)
 Frame = +3

Query: 126 GFIDGSIXXXXXXXXXXXXXXXMNTLVGSWILNTIEPTLRSSINYLEHVEELWVDLEEWF 305
           GFIDG+                 + +V SWILN++   +  S+ Y+ +  ELW +LE+ +
Sbjct: 59  GFIDGNCAKPAENSPQARQWQRCDDMVTSWILNSLTKEIADSVEYVSNSCELWKELEDRY 118

Query: 306 LVSNGPRKYELNAALAICK*GGDFVNVYHS*LKKLSFGIEKVMGLTSQLYTIAIISYKFR 485
             +NG + Y++   +     G   + VY++ LKKL    E++  L ++       + + +
Sbjct: 119 DQTNGAKLYQIQKKIDDLTQGTLDIIVYYTKLKKL---WEELNTLNTKSICTCTCTCRAK 175

Query: 486 NIGHSDQRQRGGKSIYQFLMD*MIIGLNDEIFGTVRSGITHEEPLPKLKQIMARHLQ--- 656
           +  H  ++ R    + QFLM     GLN E++  +R  I    PLP   Q  +  +Q   
Sbjct: 176 DSMHKSEQDR---RLIQFLM-----GLN-EVYTVIRGNILMMSPLPSTAQAFSLLIQEEK 226

Query: 657 -------GGTPSTHDS-DFNSGRERKYDHGFR*RWNK----PVVQTRTKSVCTHCQKQGH 800
                    TP    S + N+GR  +   G+R  ++          R+  +C  C+KQGH
Sbjct: 227 QREYRPTSRTPMESVSLNANAGRGSQGGRGYRTNFSSNGELSNYNDRSTLICDFCKKQGH 286


>ref|XP_004245419.1| PREDICTED: uncharacterized protein LOC101260366 [Solanum
           lycopersicum]
          Length = 650

 Score = 70.5 bits (171), Expect = 7e-10
 Identities = 65/239 (27%), Positives = 105/239 (43%), Gaps = 15/239 (6%)
 Frame = +3

Query: 129 FIDGSIXXXXXXXXXXXXXXXMNTLVGSWILNTIEPTLRSSINYLEHVEELWVDLEEWFL 308
           FIDGS                 + +V SWILN++   +  S+ Y+ +  ELW +LE+ + 
Sbjct: 60  FIDGSCVKPAENSPQARQWQRCDDMVTSWILNSLTKEIADSVEYVNNSCELWKELEDRYD 119

Query: 309 VSNGPRKYELNAALAICK*GGDFVNVYHS*LKKLSFGIEKVMGLTSQLYTIAIISYKFRN 488
            +NG + Y++   +     G   + VY++ LKKL   +  +   T  +YT   I     +
Sbjct: 120 QTNGAKLYQIQKEIDDLTQGTLDITVYYTKLKKLWEELNTLN--TKSVYTCTCICGAKDS 177

Query: 489 IGHSDQRQRGGKSIYQFLMD*MIIGLNDEIFGTVRSGITHEEPLPKLKQIMARHLQ---- 656
           +  S+Q +R    + QFL     IGLN E++  +R  I    PLP   Q  +  +Q    
Sbjct: 178 MHKSEQDRR----LIQFL-----IGLN-EVYTVIRGNILMMSPLPSTAQAFSLLIQEEKQ 227

Query: 657 ------GGTPSTHDS-DFNSGRERKYDHGFR*RWNKPVV----QTRTKSVCTHCQKQGH 800
                   TP    S + N+GR  +   G R  ++          R+  +C  C+KQGH
Sbjct: 228 REYRPTSRTPMESISLNANAGRGSQGGRGHRTNFSSSSELNNNGNRSVLICDFCKKQGH 286


>gb|EPS58738.1| hypothetical protein M569_16075, partial [Genlisea aurea]
          Length = 156

 Score = 69.3 bits (168), Expect = 2e-09
 Identities = 43/115 (37%), Positives = 58/115 (50%)
 Frame = +3

Query: 66  NNYAEXXXXXXXXXXXXXXXGFIDGSIXXXXXXXXXXXXXXXMNTLVGSWILNTIEPTLR 245
           +NY E               GF+DG+I               +N+L+ +WI NTIEP LR
Sbjct: 1   DNYEEWARGIRAGLRAKRKYGFLDGTIIDRPPEVSVDDWEQ-LNSLLVAWIFNTIEPNLR 59

Query: 246 SSINYLEHVEELWVDLEEWFLVSNGPRKYELNAALAICK*GGDFVNVYHS*LKKL 410
           S+I   + V+ LW DL + F +S+GPR   L   LA C+ GGD V  Y+  L KL
Sbjct: 60  STIMISDLVKPLWDDLRDRFGISHGPRLQYLKQELAKCRQGGDSVVQYYGRLTKL 114


>gb|EPS60009.1| hypothetical protein M569_14795, partial [Genlisea aurea]
          Length = 156

 Score = 68.9 bits (167), Expect = 2e-09
 Identities = 51/159 (32%), Positives = 73/159 (45%)
 Frame = +3

Query: 69  NYAEXXXXXXXXXXXXXXXGFIDGSIXXXXXXXXXXXXXXXMNTLVGSWILNTIEPTLRS 248
           NY E               GF+DG+I               +N+++ +WI+NT+EP LR+
Sbjct: 2   NYDEWAKAMRAGLRAKKKYGFVDGTITERPPEISVDLWEQ-VNSMLVAWIINTVEPGLRT 60

Query: 249 SINYLEHVEELWVDLEEWFLVSNGPRKYELNAALAICK*GGDFVNVYHS*LKKLSFGIEK 428
           ++   + V  LW DL+E F VS+GPR  +L   LA C+ GGD V  Y   +KK       
Sbjct: 61  TVTITDLVFPLWNDLQERFCVSHGPRLTQLKIDLARCQQGGDSVVQYFGRMKKYWDEYTT 120

Query: 429 VMGLTSQLYTIAIISYKFRNIGHSDQRQRGGKSIYQFLM 545
           + GL S             N+     R+R    I+QFLM
Sbjct: 121 LDGLPS-----CNCGGCRCNLNLQLNRKRESDKIHQFLM 154


Top