BLASTX nr result

ID: Catharanthus23_contig00022210 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Catharanthus23_contig00022210
         (915 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

emb|CAN67762.1| hypothetical protein VITISV_040650 [Vitis vinifera]   100   2e-31
gb|AAT40486.1| putative polyprotein [Solanum demissum]                 84   2e-27
gb|AAG50751.1|AC079733_19 polyprotein, putative [Arabidopsis tha...   100   2e-25
gb|AAC67205.1| putative retroelement pol polyprotein [Arabidopsi...    87   2e-24
emb|CAN82073.1| hypothetical protein VITISV_036538 [Vitis vinifera]    87   2e-24
ref|XP_006586558.1| PREDICTED: uncharacterized protein LOC102661...    91   2e-24
gb|AAG09097.1|AC009323_8 Putative retroelement polyprotein [Arab...    86   3e-24
gb|AAD19784.1| putative retroelement pol polyprotein [Arabidopsi...    89   4e-24
dbj|BAA97287.1| retroelement pol polyprotein-like [Arabidopsis t...    85   6e-24
ref|XP_006480040.1| PREDICTED: uncharacterized protein LOC102624...    77   7e-23
gb|AAB87099.1| putative retroelement pol polyprotein [Arabidopsi...    79   2e-21
ref|XP_004243119.1| PREDICTED: uncharacterized protein LOC101247...    86   6e-21
dbj|BAA97099.1| retroelement pol polyprotein-like [Arabidopsis t...    77   1e-19
gb|AAG51258.1|AC025782_3 Ty1/copia-element polyprotein [Arabidop...    89   1e-19
emb|CAN78026.1| hypothetical protein VITISV_032464 [Vitis vinifera]    77   4e-18
emb|CAN80919.1| hypothetical protein VITISV_002640 [Vitis vinifera]    76   3e-17
emb|CAN74847.1| hypothetical protein VITISV_028741 [Vitis vinifera]    73   6e-17
gb|EPS69771.1| hypothetical protein M569_04993 [Genlisea aurea]        61   6e-16
gb|AAC98469.1| putative retroelement pol polyprotein [Arabidopsi...    77   3e-14
ref|XP_004234727.1| PREDICTED: uncharacterized protein LOC101248...    82   3e-13

>emb|CAN67762.1| hypothetical protein VITISV_040650 [Vitis vinifera]
          Length = 1316

 Score =  100 bits (249), Expect(2) = 2e-31
 Identities = 71/215 (33%), Positives = 106/215 (49%), Gaps = 3/215 (1%)
 Frame = -3

Query: 643 TSHKSTCSYCRISGHEIANCYQLIGFPDWWERNRAKASRGSSLDRDRERTGGGRSNSYPH 464
           T+   +C++C  +GH++A+C+QL G+PDWW   +    RG          G GR NSY  
Sbjct: 141 TNKSGSCTHCGKTGHDVADCFQLKGYPDWWPTRQMGRGRG---------RGRGR-NSY-- 188

Query: 463 ATRGKGHGQGRAHSGWADAVIDNGGRLQAMSGAHNDWVNAAIEGGRGLQITAPGTSANSG 284
              G+G   GR H   A A  D   + Q +              G  ++         + 
Sbjct: 189 --AGRGATSGRVHYXNAVAEADTQEKGQCV--------------GHDVE--------RNI 224

Query: 283 IPGLSTEQWKSLLNILQN-QANSNRLS--CKVVITWIFYSGCSHHMTGTGDLFMNLYPVS 113
           IPGL+ + ++ L+ +L+N  +N+ +L+   K+V  WI  SG S HMTG  DLF  L    
Sbjct: 225 IPGLNDDNFQKLMALLRNGSSNAEKLTGKNKIVEEWILDSGASMHMTGRRDLFDWLRKWE 284

Query: 112 PYIIRLPDGTKVVASGLGTVCAG*NFIFQNVLYIP 8
              + LPDGTK VA+ +G V    +   +NVLY+P
Sbjct: 285 TACVGLPDGTKTVANEMGYVKLSKDLCLKNVLYVP 319



 Score = 62.8 bits (151), Expect(2) = 2e-31
 Identities = 30/68 (44%), Positives = 47/68 (69%)
 Frame = -2

Query: 908 IAKEREEEQVYQLLMGLNDSIFGSVRSHIIQEEPLPKIKTIFVLICKEEQHQNLARAEVV 729
           I K RE+E+ +Q LMGL+D+ FG+VRS I+  +PLP +  I+ ++ +EE+H+++AR    
Sbjct: 65  IVKSREDEKAHQFLMGLDDTTFGTVRSSILALDPLPTLGKIYAMVTQEERHRSMARGADR 124

Query: 728 EEQTRAAA 705
            E T  AA
Sbjct: 125 AEITVFAA 132


>gb|AAT40486.1| putative polyprotein [Solanum demissum]
          Length = 1065

 Score = 83.6 bits (205), Expect(2) = 2e-27
 Identities = 59/213 (27%), Positives = 93/213 (43%), Gaps = 3/213 (1%)
 Frame = -3

Query: 634 KSTCSYCRISGHEIANCYQLIGFPDWWERNRAKASRGSSLDRDRERTGGGRSNSYPHATR 455
           K  C+ C    +E    + LIG+P+WW   R                 GG++N   H  R
Sbjct: 218 KPPCAKCGKFNYETKKYFLLIGYPEWWGTGRE----------------GGKNNGRGHGGR 261

Query: 454 GKGHGQ-GRAHSGWADAVIDNGGRLQAMSGAHNDWVNAAIEGGRGLQITAPGTSANSGIP 278
              +G  G + +G   A + N  +    +G      NA                      
Sbjct: 262 SGNYGDFGHSGTGRGVAAVANVAQATGSNGTTKKEANAWT-------------------- 301

Query: 277 GLSTEQWKSLLNILQNQ-ANSNRLSCKVV-ITWIFYSGCSHHMTGTGDLFMNLYPVSPYI 104
           GLS +QW +LL++L +   N  +L+  ++ I WI  +G SHHM+G   LF +L  V PY+
Sbjct: 302 GLSNDQWSALLSMLNSHNKNHEKLAGNILGICWIVDTGASHHMSGDAQLFNDLCDVPPYL 361

Query: 103 IRLPDGTKVVASGLGTVCAG*NFIFQNVLYIPE 5
           + LP+G+  +AS +  V         +VLY+P+
Sbjct: 362 VSLPNGSTTIAS-MEIVILTDKMKLHHVLYVPQ 393



 Score = 66.6 bits (161), Expect(2) = 2e-27
 Identities = 27/58 (46%), Positives = 45/58 (77%)
 Frame = -2

Query: 908 IAKEREEEQVYQLLMGLNDSIFGSVRSHIIQEEPLPKIKTIFVLICKEEQHQNLARAE 735
           + +ERE+E+V+Q  MGL+D +FG+  S+I+  +PLP +  ++ +I +EE+HQNLARA+
Sbjct: 130 LTQEREKEKVHQFSMGLDDKVFGTTHSNILSTKPLPTLNRVYAMIIQEERHQNLARAK 187


>gb|AAG50751.1|AC079733_19 polyprotein, putative [Arabidopsis thaliana]
          Length = 1468

 Score =  100 bits (250), Expect(2) = 2e-25
 Identities = 67/222 (30%), Positives = 105/222 (47%), Gaps = 11/222 (4%)
 Frame = -3

Query: 637 HKSTCSYCRISGHEIANCYQLIGFPDWW-ERNRAKASRGSSLDRDRERTGGGRSNSYPHA 461
           +K  C++C   GH   NC+ LIG+P+WW +R R K++   S  R R R G G +   P  
Sbjct: 270 NKKLCTHCNRGGHSPENCFVLIGYPEWWGDRPRGKSNSNGSTSRGRGRFGPGFNGGQPRP 329

Query: 460 TRGKGHGQGRAHSGWADAVIDNGGRLQAMSGAH--NDWVNAAIEGGRGLQITAPGTSANS 287
           T             + + V         M+G    ++ VN  I             S   
Sbjct: 330 T-------------YVNVV---------MTGPFPSSEHVNRVITD-----------SDRD 356

Query: 286 GIPGLSTEQWKSLLNILQ-----NQANSNRL---SCKVVITWIFYSGCSHHMTGTGDLFM 131
            + GL+ EQW+ ++ +L      N++N++     +C +  +WI  +G SHHMTG  +L  
Sbjct: 357 AVSGLTDEQWRGVVKLLNAGRSDNKSNAHETQSGTCSLFTSWILDTGASHHMTGNLELLS 416

Query: 130 NLYPVSPYIIRLPDGTKVVASGLGTVCAG*NFIFQNVLYIPE 5
           ++  +SP +I L DG K VA   GTV  G + I ++V Y+ E
Sbjct: 417 DMRSMSPVLIILADGNKRVAVSEGTVRLGSHLILKSVFYVKE 458



 Score = 42.4 bits (98), Expect(2) = 2e-25
 Identities = 22/66 (33%), Positives = 39/66 (59%)
 Frame = -2

Query: 902 KEREEEQVYQLLMGLNDSIFGSVRSHIIQEEPLPKIKTIFVLICKEEQHQNLARAEVVEE 723
           K RE++ V+Q L GLN++ F ++RS +    PLP ++ ++ ++ +EE   N   +   EE
Sbjct: 183 KYREDDMVHQYLYGLNETKFHTIRSSLTSRVPLPGLEEVYNIVRQEEDMVNNRSSN--EE 240

Query: 722 QTRAAA 705
           +T   A
Sbjct: 241 RTDVTA 246


>gb|AAC67205.1| putative retroelement pol polyprotein [Arabidopsis thaliana]
          Length = 1413

 Score = 87.0 bits (214), Expect(2) = 2e-24
 Identities = 68/211 (32%), Positives = 97/211 (45%), Gaps = 5/211 (2%)
 Frame = -3

Query: 625 CSYCRISGHEIANCYQLIGFPDWWERNRAKASRGSSLDRDRERT-GGGRSNSYPHATRGK 449
           CS+C  SGHE  +C+Q++GFPDWW                 ERT GGGR +S    +RG+
Sbjct: 280 CSHCGRSGHEKKDCWQIVGFPDWWT----------------ERTNGGGRGSS----SRGR 319

Query: 448 GHGQGRAHSGWADAVIDNGGRLQAMSGAHNDWVNAAIEGGRGLQITAPGTSAN-SGIPGL 272
           G   GR+         +N GR                  GRG    A  T++N S  P  
Sbjct: 320 G---GRSSGS------NNSGR------------------GRGQVTAAHATTSNLSPFPEF 352

Query: 271 STEQWKSLLNILQNQAN--SNRLSCKVVI-TWIFYSGCSHHMTGTGDLFMNLYPVSPYII 101
           + +Q + +  ++QN+ N  S++LS K+ +   I  +G SHHMTG   L  N+  +    +
Sbjct: 353 TPDQLRVITQMIQNKNNGTSDKLSGKMKLGDVILDTGASHHMTGQLSLLTNIVTIPSCSV 412

Query: 100 RLPDGTKVVASGLGTVCAG*NFIFQNVLYIP 8
              DG K  A  +GT          NVLY+P
Sbjct: 413 GFADGRKTFAISMGTFKLSETVSLSNVLYVP 443



 Score = 53.1 bits (126), Expect(2) = 2e-24
 Identities = 26/70 (37%), Positives = 45/70 (64%)
 Frame = -2

Query: 905 AKEREEEQVYQLLMGLNDSIFGSVRSHIIQEEPLPKIKTIFVLICKEEQHQNLARAEVVE 726
           +KEREEE+++Q ++GL+DS FG + + +I  +P P +  I+  + +EE  Q LA  ++ E
Sbjct: 188 SKEREEEKIHQFVLGLDDSRFGGLSATLIAMDPFPSLGEIYSRVVREE--QRLASVQIRE 245

Query: 725 EQTRAAAGVT 696
           +Q  A   +T
Sbjct: 246 QQQSAIGFLT 255


>emb|CAN82073.1| hypothetical protein VITISV_036538 [Vitis vinifera]
          Length = 1157

 Score = 87.0 bits (214), Expect(2) = 2e-24
 Identities = 57/188 (30%), Positives = 84/188 (44%), Gaps = 3/188 (1%)
 Frame = -3

Query: 625 CSYCRISGHEIANCYQLIGFPDWWERNRAKASRGSSLDRDRERTGGGRSNSYPHATRGKG 446
           CS C+  GHE+ +C+Q I +P+WW             DR R  T G          +G G
Sbjct: 218 CSNCKRKGHEVDSCFQRIAYPEWWG------------DRPRTTTSGCSGGHGRGVQQGTG 265

Query: 445 HGQGRAHSGWADAVIDNGGRLQAMSGAHNDWVNAAIEGGRGLQITAPGTSANSGIPGLST 266
            G+GR  +  A+ V   G                  +GGR +       S  +GI GLS 
Sbjct: 266 GGRGRGGTARANVVQTLG-----------------TDGGRSVVTD----SNRTGISGLSD 304

Query: 265 EQWKSLLNILQNQ---ANSNRLSCKVVITWIFYSGCSHHMTGTGDLFMNLYPVSPYIIRL 95
           +QW +LL +L +    AN   +  + ++ WI  +G SHHMT T +   +L  + P  + L
Sbjct: 305 KQWTTLLTMLNSHKGGANERLIGKQNILPWIIDTGASHHMTDTYECLNDLRDIIPCPVGL 364

Query: 94  PDGTKVVA 71
           P+G K  A
Sbjct: 365 PNGAKTKA 372



 Score = 53.1 bits (126), Expect(2) = 2e-24
 Identities = 22/58 (37%), Positives = 42/58 (72%)
 Frame = -2

Query: 908 IAKEREEEQVYQLLMGLNDSIFGSVRSHIIQEEPLPKIKTIFVLICKEEQHQNLARAE 735
           + K+REEE+V+Q LMGL++  +G+VRS+I+  EPL  +  ++ +I ++E+ + + R +
Sbjct: 132 LEKKREEERVHQFLMGLDEDGYGTVRSNILSIEPLSNLNRVYAMIVQQERVRTMTRTK 189


>ref|XP_006586558.1| PREDICTED: uncharacterized protein LOC102661920 [Glycine max]
          Length = 516

 Score = 90.5 bits (223), Expect(2) = 2e-24
 Identities = 65/232 (28%), Positives = 100/232 (43%), Gaps = 2/232 (0%)
 Frame = -3

Query: 697 PFAAVAKPSSMEAMLTTP-TSHKSTCSYCRISGHEIANCYQLIGFPDWWERNRAKASRGS 521
           P A   K     +    P T  +  CS+C+  GH+I +C+QL+G+PDWW           
Sbjct: 254 PIAFAVKSGRTSSWEKKPNTGSEKPCSHCKRDGHDIDSCFQLVGYPDWWG---------- 303

Query: 520 SLDRDRERTGGGRSNSYPHATRGKGHGQGRAHSGWADAVIDNGGRLQAMSGAHNDWVNAA 341
               DR R+ G           G+G    R  SG             A+ G  N   NA 
Sbjct: 304 ----DRPRSVG--------RALGRGKHVHRPMSG-------------ALKGRGN---NAK 335

Query: 340 IEGGRGLQITAPGTSANSGI-PGLSTEQWKSLLNILQNQANSNRLSCKVVITWIFYSGCS 164
           +   + +  T      +  + PGLS++QW +LLN +  Q            +WI  +G S
Sbjct: 336 VNMTQVVDDTEVMKYEDDQVLPGLSSKQWNALLNAINTQKGGTSTRLTGENSWIIDTGAS 395

Query: 163 HHMTGTGDLFMNLYPVSPYIIRLPDGTKVVASGLGTVCAG*NFIFQNVLYIP 8
           HHMT T     ++  + P  I +P+GT+  A+  G V  G   + ++VL++P
Sbjct: 396 HHMTSTLACMNDVRDIEPCPIGMPNGTRTYATKEGMVTVGDKLMLKHVLFVP 447



 Score = 49.7 bits (117), Expect(2) = 2e-24
 Identities = 20/48 (41%), Positives = 37/48 (77%)
 Frame = -2

Query: 902 KEREEEQVYQLLMGLNDSIFGSVRSHIIQEEPLPKIKTIFVLICKEEQ 759
           K+REEE+++Q LMGL+D+ F +VRS+++  +PLP +   + ++ +EE+
Sbjct: 193 KKREEEKLHQFLMGLDDTQFRTVRSNVLSLDPLPNLNRAYQMVVQEER 240


>gb|AAG09097.1|AC009323_8 Putative retroelement polyprotein [Arabidopsis thaliana]
          Length = 1486

 Score = 85.9 bits (211), Expect(2) = 3e-24
 Identities = 65/220 (29%), Positives = 100/220 (45%), Gaps = 9/220 (4%)
 Frame = -3

Query: 643 TSHKSTCSYCRISGHEIANCYQLIGFPDWWE-----RNRAKASRGSSLDRDRERTGGGRS 479
           +S    CS C   GH    C++LIG+P W E     +N A +SRG  L   + +   GR 
Sbjct: 252 SSENRVCSNCGRVGHLAEQCFKLIGYPPWLEEKLRLKNTASSSRGG-LSSFKGKQSHGRG 310

Query: 478 NSYPHATRGKGHGQGRAHSGWADAVIDNGGRLQAMSGAHNDWVNAAIEGGRGLQITAPGT 299
           +S  H           A SG A  V+ N                          +T+P T
Sbjct: 311 SSINHV----------ASSGMAANVVTNSS------------------------LTSPLT 336

Query: 298 SANS-GIPGLSTEQWKSLLNILQNQ---ANSNRLSCKVVITWIFYSGCSHHMTGTGDLFM 131
           S +  G+ GL+  QWK L  IL+ +   +N ++     + +WI  SG ++HMTG+     
Sbjct: 337 SDDRIGLSGLNDSQWKILQTILEERKSTSNDHQSGKYFLESWIIDSGATNHMTGSLAFLR 396

Query: 130 NLYPVSPYIIRLPDGTKVVASGLGTVCAG*NFIFQNVLYI 11
           N+  + P +I+LPDG    A+  G+V  G +   Q+VL++
Sbjct: 397 NVCDMPPVLIKLPDGRFTTATKQGSVQLGSSLDLQDVLFV 436



 Score = 53.5 bits (127), Expect(2) = 3e-24
 Identities = 20/56 (35%), Positives = 43/56 (76%)
 Frame = -2

Query: 908 IAKEREEEQVYQLLMGLNDSIFGSVRSHIIQEEPLPKIKTIFVLICKEEQHQNLAR 741
           + KEREE++++Q LMGL++S++G+V+S ++   PLP ++  +  + ++E+ ++L+R
Sbjct: 175 VRKEREEDKLHQFLMGLDESVYGAVKSALLSRVPLPSLEEAYNALTQDEESKSLSR 230


>gb|AAD19784.1| putative retroelement pol polyprotein [Arabidopsis thaliana]
          Length = 1501

 Score = 89.0 bits (219), Expect(2) = 4e-24
 Identities = 69/217 (31%), Positives = 95/217 (43%), Gaps = 10/217 (4%)
 Frame = -3

Query: 628 TCSYCRISGHEIANCYQLIGFPDWWERNRAKASRGSSLDRDRERTGGGRSNSYPHATRGK 449
           TCS C  +GHE   C+Q++GFPDWW                 ER GG  SN      RG+
Sbjct: 295 TCSNCGRTGHEKKECWQIVGFPDWWS----------------ERNGGRGSNG-----RGR 333

Query: 448 GHGQGRAHSGWADAVIDNGGRLQAMSGAHNDWVNAAIEGGRGLQITAPGTSANSGI-PGL 272
           G G+G            NGGR                  G+G  + A  TS+NS + P  
Sbjct: 334 G-GRG-----------SNGGR------------------GQGQVMAAHATSSNSSVFPEF 363

Query: 271 STEQWKSLLNILQNQANS--------NRLSCKVVITWIFY-SGCSHHMTGTGDLFMNLYP 119
           + E  + L  +++ ++NS        +RLS K  +  I   SG SHHMTGT     N+ P
Sbjct: 364 TEEHMRVLSQLVKEKSNSGSTSNNNSDRLSGKTKLGDIILDSGASHHMTGTLSSLTNVVP 423

Query: 118 VSPYIIRLPDGTKVVASGLGTVCAG*NFIFQNVLYIP 8
           V P  +   DG+K  A  +G +         NVL++P
Sbjct: 424 VPPCPVGFADGSKAFALSVGVLTLSNTVSLTNVLFVP 460



 Score = 50.1 bits (118), Expect(2) = 4e-24
 Identities = 25/64 (39%), Positives = 42/64 (65%)
 Frame = -2

Query: 902 KEREEEQVYQLLMGLNDSIFGSVRSHIIQEEPLPKIKTIFVLICKEEQHQNLARAEVVEE 723
           KEREEE+++Q ++GL++S FG + + +I  +PLP +  I+  + +EE  Q LA   V E+
Sbjct: 194 KEREEEKIHQFVLGLDESRFGGLCATLINMDPLPSLGEIYSRVIREE--QRLASVHVREQ 251

Query: 722 QTRA 711
           +  A
Sbjct: 252 KEEA 255


>dbj|BAA97287.1| retroelement pol polyprotein-like [Arabidopsis thaliana]
          Length = 1491

 Score = 85.1 bits (209), Expect(2) = 6e-24
 Identities = 67/211 (31%), Positives = 96/211 (45%), Gaps = 5/211 (2%)
 Frame = -3

Query: 625 CSYCRISGHEIANCYQLIGFPDWWERNRAKASRGSSLDRDRERT-GGGRSNSYPHATRGK 449
           CS+C  SGHE  +C+Q++GFPDWW                 ERT GGGR +S    +RG+
Sbjct: 280 CSHCGRSGHEKKDCWQIVGFPDWWT----------------ERTNGGGRGSS----SRGR 319

Query: 448 GHGQGRAHSGWADAVIDNGGRLQAMSGAHNDWVNAAIEGGRGLQITAPGTSAN-SGIPGL 272
           G   GR+         +N GR                  GRG    A  T++N S  P  
Sbjct: 320 G---GRSSGS------NNSGR------------------GRGQVTAAHATTSNLSSFPEF 352

Query: 271 STEQWKSLLNILQNQAN--SNRLSCKVVI-TWIFYSGCSHHMTGTGDLFMNLYPVSPYII 101
           + +Q + +  ++QN+ N  S++LS K+ +   I  +G SHHMTG   L  N+  +    +
Sbjct: 353 TPDQLRVITQMIQNKNNGTSDKLSGKMKLGDVILDTGASHHMTGQLSLLTNIVTIPSCSV 412

Query: 100 RLPDGTKVVASGLGTVCAG*NFIFQNVLYIP 8
              D  K  A  +GT          NVLY+P
Sbjct: 413 GFADDRKTFAISMGTFKLSETVSLSNVLYVP 443



 Score = 53.1 bits (126), Expect(2) = 6e-24
 Identities = 26/70 (37%), Positives = 45/70 (64%)
 Frame = -2

Query: 905 AKEREEEQVYQLLMGLNDSIFGSVRSHIIQEEPLPKIKTIFVLICKEEQHQNLARAEVVE 726
           +KEREEE+++Q ++GL+DS FG + + +I  +P P +  I+  + +EE  Q LA  ++ E
Sbjct: 188 SKEREEEKIHQFVLGLDDSRFGGLSATLIAMDPFPSLGEIYSRVVREE--QRLASVQIRE 245

Query: 725 EQTRAAAGVT 696
           +Q  A   +T
Sbjct: 246 QQQSAIGFLT 255


>ref|XP_006480040.1| PREDICTED: uncharacterized protein LOC102624694 isoform X1 [Citrus
           sinensis] gi|568852764|ref|XP_006480041.1| PREDICTED:
           uncharacterized protein LOC102624694 isoform X2 [Citrus
           sinensis] gi|568852766|ref|XP_006480042.1| PREDICTED:
           uncharacterized protein LOC102624694 isoform X3 [Citrus
           sinensis]
          Length = 320

 Score = 77.0 bits (188), Expect(2) = 7e-23
 Identities = 49/138 (35%), Positives = 67/138 (48%)
 Frame = -3

Query: 625 CSYCRISGHEIANCYQLIGFPDWWERNRAKASRGSSLDRDRERTGGGRSNSYPHATRGKG 446
           C +CR +GH+  +C+QLIG+P+WW               DR RTGG          RG G
Sbjct: 200 CKHCRKTGHDADSCFQLIGYPEWW--------------GDRSRTGG----------RGAG 235

Query: 445 HGQGRAHSGWADAVIDNGGRLQAMSGAHNDWVNAAIEGGRGLQITAPGTSANSGIPGLST 266
            GQG    G A      G +++A + AH   V +   G +G  + A  T    G+ GLS 
Sbjct: 236 RGQGGQRQGIAQGGKGRGSQIKA-NAAH---VTSEGSGIQGHVLDADKT----GLKGLSN 287

Query: 265 EQWKSLLNILQNQANSNR 212
           EQW  LLN+L +Q   N+
Sbjct: 288 EQWSMLLNLLNSQTEKNQ 305



 Score = 57.8 bits (138), Expect(2) = 7e-23
 Identities = 25/56 (44%), Positives = 42/56 (75%)
 Frame = -2

Query: 902 KEREEEQVYQLLMGLNDSIFGSVRSHIIQEEPLPKIKTIFVLICKEEQHQNLARAE 735
           K+ EEE+++Q LMGL+D+I+GSVRS+I+  +PLP +   + L+ +EE+ Q + R +
Sbjct: 116 KKCEEERLHQFLMGLDDTIYGSVRSNILSTDPLPPLNRAYSLVVQEERVQTITRGK 171


>gb|AAB87099.1| putative retroelement pol polyprotein [Arabidopsis thaliana]
          Length = 1496

 Score = 78.6 bits (192), Expect(2) = 2e-21
 Identities = 55/225 (24%), Positives = 97/225 (43%), Gaps = 3/225 (1%)
 Frame = -3

Query: 670 SMEAMLTTPTSHKST--CSYCRISGHEIANCYQLIGFPDWWERNRAKASRGSSLDRDRER 497
           S+++  T     KST  C++C   GHE+  C+ + G+PDWW     + ++ S+  R    
Sbjct: 243 SVQSSTTPRFRDKSTLFCTHCNRKGHEVTQCFLVHGYPDWWLEQNPQENQPSTRGRGSNG 302

Query: 496 TGGGRSNSYPHATRGKGHGQGRAHSGWADAVIDNGGRLQAMSGAHNDWVNAAIEGGRGLQ 317
            G         ++     G+GRA++  A A          +SG  ND +   I       
Sbjct: 303 RGSSSGRGGNRSSAPTTRGRGRANNAQAAA--------PTVSGDGNDQIAQLI------- 347

Query: 316 ITAPGTSANSGIPGLSTEQWKSLLNILQNQANSNRLSCKVVIT-WIFYSGCSHHMTGTGD 140
                                SLL   +  ++S RLS    +T  +  +G SHHMTG   
Sbjct: 348 ---------------------SLLQAQRPSSSSERLSGNTCLTDGVIDTGASHHMTGDCS 386

Query: 139 LFMNLYPVSPYIIRLPDGTKVVASGLGTVCAG*NFIFQNVLYIPE 5
           + ++++ ++P  +  PDG    A+  GT+    ++   +VL++P+
Sbjct: 387 ILVDVFDITPSPVTKPDGKASQATKCGTLLLHDSYKLHDVLFVPD 431



 Score = 51.2 bits (121), Expect(2) = 2e-21
 Identities = 26/60 (43%), Positives = 41/60 (68%)
 Frame = -2

Query: 908 IAKEREEEQVYQLLMGLNDSIFGSVRSHIIQEEPLPKIKTIFVLICKEEQHQNLARAEVV 729
           I KERE+++V++ L+GL DS F S+RS I   EPLP +  ++  + +EEQ+ N +R + V
Sbjct: 176 IEKEREDDRVHKFLLGL-DSRFSSIRSSITDIEPLPDLYQVYSRVVREEQNLNASRTKDV 234


>ref|XP_004243119.1| PREDICTED: uncharacterized protein LOC101247933 [Solanum
           lycopersicum]
          Length = 528

 Score = 85.9 bits (211), Expect(2) = 6e-21
 Identities = 73/224 (32%), Positives = 94/224 (41%), Gaps = 3/224 (1%)
 Frame = -3

Query: 670 SMEAMLTTPTSHKSTCSYCRISGHEIANCYQLIGFPDWWERNRAKASRGSSLDRDRERTG 491
           ++E   T P  +K  C++C  +GH    C+ LIGFP             S   R RE   
Sbjct: 251 AVETQPTPPLKYK--CTHCGKNGHSAERCFILIGFP-------------SGGRRGREGGR 295

Query: 490 GGRSNSYPHATRGKGHGQGRAHSGWADAVIDNGGRLQAMSGAHNDWVNAAIEGGRGLQIT 311
           GGR        RG+G   GR  S          GR   M+ AH D   +           
Sbjct: 296 GGR--------RGQGPPSGREQSA---------GRGGGMA-AHTDSPTSPA--------V 329

Query: 310 APGTSANSGIPGLSTEQWKSLLNILQNQANSNRLSCKVVIT---WIFYSGCSHHMTGTGD 140
             G S     P LS EQ   LLN+L     S   +  V      W+  SG SHHMTG   
Sbjct: 330 TIGNSQGGNFPRLSAEQMTRLLNMLDTPTQSRNNTGTVHALSPDWLIDSGASHHMTGNFS 389

Query: 139 LFMNLYPVSPYIIRLPDGTKVVASGLGTVCAG*NFIFQNVLYIP 8
              ++  V    I LPDGT+VVA+  G+V    N I +NVL++P
Sbjct: 390 SLYDIMSVPECSIGLPDGTRVVANYCGSVQISANLILKNVLFVP 433



 Score = 42.4 bits (98), Expect(2) = 6e-21
 Identities = 18/49 (36%), Positives = 32/49 (65%)
 Frame = -2

Query: 893 EEEQVYQLLMGLNDSIFGSVRSHIIQEEPLPKIKTIFVLICKEEQHQNL 747
           EEE+ +  L+GL+D+ FG+ RS I    PL  +   + L+ +EE+H+++
Sbjct: 189 EEEKTHAFLLGLDDAQFGATRSEIFGTHPLFVLNEAYYLVSQEERHKSI 237


>dbj|BAA97099.1| retroelement pol polyprotein-like [Arabidopsis thaliana]
          Length = 1098

 Score = 77.4 bits (189), Expect(2) = 1e-19
 Identities = 58/213 (27%), Positives = 94/213 (44%), Gaps = 5/213 (2%)
 Frame = -3

Query: 628 TCSYCRISGHEIANCYQLIGFPDWW-ERNRAKASRGSSLDRDRERTGGGRSNSYPHATRG 452
           TC++    GH+I  C+ + G+PDWW E+N +  S G      R   G G +N    ++  
Sbjct: 262 TCTHYHRQGHDITECFLVHGYPDWWLEQNGSNGSAGRGTS-GRGNNGRGNNNRGGRSSSS 320

Query: 451 KGHGQGRAHSGWADAVIDNGGRLQAMSGAHNDWVNAAIEGGRGLQITAPGTSANSGIPGL 272
              G+GRA++                +  H                  P TS  S     
Sbjct: 321 GSRGKGRANA----------------ASTH-----------------PPPTSTPS----- 342

Query: 271 STEQWKSLLNILQNQ---ANSNRLSCKVVITWIFY-SGCSHHMTGTGDLFMNLYPVSPYI 104
           + +Q   L+++LQ Q    +S +LS K   T++   +G SHHMTG   L  N+  + P  
Sbjct: 343 NADQINQLISLLQAQNPATSSQKLSGKTFTTYVIIDTGASHHMTGDITLLTNVEDIIPSP 402

Query: 103 IRLPDGTKVVASGLGTVCAG*NFIFQNVLYIPE 5
           +  PDGT   A+  GT+     ++  +VL++P+
Sbjct: 403 VTKPDGTASRATKRGTLALHNAYVLPDVLFVPD 435



 Score = 46.6 bits (109), Expect(2) = 1e-19
 Identities = 23/56 (41%), Positives = 37/56 (66%)
 Frame = -2

Query: 908 IAKEREEEQVYQLLMGLNDSIFGSVRSHIIQEEPLPKIKTIFVLICKEEQHQNLAR 741
           IAKERE+++V+Q L+ L D  F  +RS I  ++PLP +  ++  +  EEQ+ N +R
Sbjct: 168 IAKEREDDKVHQFLLNL-DERFRPIRSTITVQDPLPALNQVYSRVIHEEQNLNASR 222


>gb|AAG51258.1|AC025782_3 Ty1/copia-element polyprotein [Arabidopsis thaliana]
          Length = 1152

 Score = 89.0 bits (219), Expect(2) = 1e-19
 Identities = 65/246 (26%), Positives = 111/246 (45%), Gaps = 5/246 (2%)
 Frame = -3

Query: 727 RNKLELLLALPFAAVAKPSSMEAMLTTPTSHKSTCSYCRISGHEIANCYQLIGFPDWWER 548
           R+K E + A+ FA     +++ ++  T  ++   C++C  S H    C++L G P+W+  
Sbjct: 243 RSKEERVDAVGFAVQTGVNAIASV--TRVNNMGPCTHCGRSNHSADTCFKLHGVPEWYTE 300

Query: 547 NRAKASRGSSLDRDRERTGGGRSNSYPHATRGKGHGQGRAHSGWADAVIDNGGRLQAMSG 368
                S G          G GRS++     RG+G G G ++                   
Sbjct: 301 KYGDTSSG---------RGRGRSST----PRGRGRGHGNSYK------------------ 329

Query: 367 AHNDWVNAAIEGGRGLQITAPGTSAN--SGIPGLSTEQWKSLLNILQNQ--ANSNRLSCK 200
                           Q + P +SA+  S IPG+S E W ++ N+L+     +S +LS K
Sbjct: 330 ------------ANNAQTSHPSSSASEFSDIPGVSKEAWSAIRNLLKQDTATSSEKLSGK 377

Query: 199 V-VITWIFYSGCSHHMTGTGDLFMNLYPVSPYIIRLPDGTKVVASGLGTVCAG*NFIFQN 23
              + ++  SG SHHMTG  DL   +Y +   ++ LP+    +A+  GT+  G N    +
Sbjct: 378 TNCVDFLIDSGASHHMTGFLDLLTEIYEIPHSVVVLPNAKHTIATKKGTLILGANMKLTH 437

Query: 22  VLYIPE 5
           VL++P+
Sbjct: 438 VLFVPD 443



 Score = 34.7 bits (78), Expect(2) = 1e-19
 Identities = 17/60 (28%), Positives = 37/60 (61%), Gaps = 3/60 (5%)
 Frame = -2

Query: 905 AKEREEEQVYQLLMGLNDSIFGSVRSHI---IQEEPLPKIKTIFVLICKEEQHQNLARAE 735
           ++ R+ E+++Q LMGL+ + FG+ R++I   +  +    + +I+  I  EE+H  + R++
Sbjct: 186 SQRRDHERIHQFLMGLDAAKFGTSRTNILGRLSRDDNISLDSIYSEIIAEERHLTITRSK 245


>emb|CAN78026.1| hypothetical protein VITISV_032464 [Vitis vinifera]
          Length = 685

 Score = 76.6 bits (187), Expect(2) = 4e-18
 Identities = 60/210 (28%), Positives = 90/210 (42%), Gaps = 3/210 (1%)
 Frame = -3

Query: 625 CSYCRISGHEIANCYQLIGFPDWWERNRAKASRGSSLDRDRERTGGGRSNSYPHATRGKG 446
           C +C   GH+  NCY+++G+P+ W            LD+++   G GRS        G+G
Sbjct: 252 CPHCHKHGHDKNNCYEIVGYPEGW------------LDQNKADGGAGRSRQQA----GRG 295

Query: 445 HGQGRAHSGWADAVIDNGGRLQAMSGAHNDWVNAAIEGGRGLQITAPGTSANSGIPGLST 266
            G  RA                          NAA         T   +S  S    L T
Sbjct: 296 RGSARA--------------------------NAASS-------TIGASSTKSSTDQLFT 322

Query: 265 -EQWKSLLNILQN-QANSNRLSCKV-VITWIFYSGCSHHMTGTGDLFMNLYPVSPYIIRL 95
            EQWK+L  ++ N Q   +RL+ K    +WI  +G +HH+TG      +   +    + L
Sbjct: 323 PEQWKALAGLIGNAQVPDDRLNGKFDTKSWIIDTGATHHVTGDLSWLFDTIALFECPVGL 382

Query: 94  PDGTKVVASGLGTVCAG*NFIFQNVLYIPE 5
           P+G  VVA+  G+V    N   +NVLY+P+
Sbjct: 383 PNGESVVATQSGSVRLSNNITLKNVLYVPK 412



 Score = 42.0 bits (97), Expect(2) = 4e-18
 Identities = 22/71 (30%), Positives = 42/71 (59%), Gaps = 7/71 (9%)
 Frame = -2

Query: 893 EEEQVYQLLMGLNDSIFGSVRSHIIQEEPLPKIKTIFVLICKEEQHQNLAR-------AE 735
           E+E+++  LMGLN  ++  +R++I+ ++PLP +   + L+ ++E+   LA+       AE
Sbjct: 153 EQEKLHDFLMGLNTDLYAQLRTNILSQDPLPSLDRAYQLVIQDER-VRLAKAVTKDKPAE 211

Query: 734 VVEEQTRAAAG 702
           V+    R  AG
Sbjct: 212 VLGFAVRTGAG 222


>emb|CAN80919.1| hypothetical protein VITISV_002640 [Vitis vinifera]
          Length = 1450

 Score = 75.9 bits (185), Expect(2) = 3e-17
 Identities = 59/208 (28%), Positives = 90/208 (43%), Gaps = 3/208 (1%)
 Frame = -3

Query: 625 CSYCRISGHEIANCYQLIGFPDWWERNRAKASRGSSLDRDRERTGGGRSNSYPHATRGKG 446
           C +C   GH+  NCY+++G+P+ W            LD+++   G GRS        G+G
Sbjct: 285 CPHCHKHGHDKNNCYEIVGYPEGW------------LDQNKADGGAGRSRQQA----GRG 328

Query: 445 HGQGRAHSGWADAVIDNGGRLQAMSGAHNDWVNAAIEGGRGLQITAPGTSANSGIPGLST 266
            G  RA                          NAA         T   +S  S    L T
Sbjct: 329 RGSARA--------------------------NAASS-------TIGASSTKSSTDQLFT 355

Query: 265 -EQWKSLLNILQN-QANSNRLSCKV-VITWIFYSGCSHHMTGTGDLFMNLYPVSPYIIRL 95
            EQWK+L  ++ N Q  ++RL+ K    +WI  +G +HH+TG      +   +   ++ L
Sbjct: 356 PEQWKALAGLIGNAQVPNDRLNGKFDTKSWIIDTGATHHVTGDLSWLFDTIALFECLVGL 415

Query: 94  PDGTKVVASGLGTVCAG*NFIFQNVLYI 11
           P+G  VVA+  G+V    N   +NVLY+
Sbjct: 416 PNGESVVATQSGSVRLSNNITLKNVLYV 443



 Score = 40.0 bits (92), Expect(2) = 3e-17
 Identities = 19/65 (29%), Positives = 40/65 (61%)
 Frame = -2

Query: 896 REEEQVYQLLMGLNDSIFGSVRSHIIQEEPLPKIKTIFVLICKEEQHQNLARAEVVEEQT 717
           RE+ +++  LMGLN  ++  +R++I+ ++PLP +   + L+  +++   LA+A V E++ 
Sbjct: 185 REQGKLHDFLMGLNTDLYAQLRTNILSQDPLPSLDRAYQLVI-QDKRVRLAKA-VTEDKP 242

Query: 716 RAAAG 702
               G
Sbjct: 243 AEVLG 247


>emb|CAN74847.1| hypothetical protein VITISV_028741 [Vitis vinifera]
          Length = 1262

 Score = 73.2 bits (178), Expect(2) = 6e-17
 Identities = 56/210 (26%), Positives = 89/210 (42%), Gaps = 3/210 (1%)
 Frame = -3

Query: 625 CSYCRISGHEIANCYQLIGFPDWWERNRAKASRGSSLDRDRERTGGGRSNSYPHATRGKG 446
           C +C   GH+  NCY+++G+P+ W            LD+++   G GRS       +  G
Sbjct: 284 CPHCHKHGHDKNNCYEIVGYPEGW------------LDQNKADGGAGRSR------QQAG 325

Query: 445 HGQGRAHSGWADAVIDNGGRLQAMSGAHNDWVNAAIEGGRGLQITAPGTSANSGIPGLST 266
            G+G A    A + I                                 +S  S    L T
Sbjct: 326 RGRGSARXNTASSTIG-------------------------------ASSTKSSTDQLFT 354

Query: 265 -EQWKSLLNILQN-QANSNRLSCKV-VITWIFYSGCSHHMTGTGDLFMNLYPVSPYIIRL 95
            EQWK+L  ++ N Q   +RL+ K    +WI  +G +HH+TG      +   +    + L
Sbjct: 355 PEQWKALAGLIGNAQVPYDRLNGKFDTKSWIIDTGATHHVTGDLXWLFDTIALFECPVGL 414

Query: 94  PDGTKVVASGLGTVCAG*NFIFQNVLYIPE 5
           P+G  +VA+  G+V    N   +NVLY+P+
Sbjct: 415 PNGESIVATQSGSVRLSNNITLKNVLYVPK 444



 Score = 41.6 bits (96), Expect(2) = 6e-17
 Identities = 20/65 (30%), Positives = 41/65 (63%)
 Frame = -2

Query: 896 REEEQVYQLLMGLNDSIFGSVRSHIIQEEPLPKIKTIFVLICKEEQHQNLARAEVVEEQT 717
           RE+ +++  LMGLN  ++  +R++I+ ++PLP +   + L+ ++E+   LA+A V E++ 
Sbjct: 184 REQGKLHDFLMGLNTDLYAQLRTNILSQDPLPSLDRAYQLVIQDER-VRLAKA-VTEDKP 241

Query: 716 RAAAG 702
               G
Sbjct: 242 AEVLG 246


>gb|EPS69771.1| hypothetical protein M569_04993 [Genlisea aurea]
          Length = 266

 Score = 61.2 bits (147), Expect(2) = 6e-16
 Identities = 44/156 (28%), Positives = 66/156 (42%), Gaps = 11/156 (7%)
 Frame = -3

Query: 625 CSYCRISGHEIANCYQLIGFPDWW-ERNR---------AKASRGSSLDRDRERTGGGRSN 476
           CS C  SGH+   C+ L+G+P+WW +R R          +   G +      R  GG S 
Sbjct: 72  CSVCGFSGHDKDGCFVLLGYPEWWGDRPRYQFDEKGKLVQCGGGPATSSQESRNRGGISR 131

Query: 475 SYPHATRGKGHGQGRAHSGWADAVIDNGGRLQAMSGAHNDWVNAAIEGGRGLQITAPGTS 296
                 RG+G G  R++    +  +     LQA +      V+A     R L +T     
Sbjct: 132 RV--TGRGRGRGGSRSNGSREETAVAGPSTLQAHAATGGQVVSAE---SRNLTLT---NE 183

Query: 295 ANSGIPGLSTEQWKSLLNILQNQANSNRLS-CKVVI 191
               +  LS  QW+ L  IL  + +S +LS C + I
Sbjct: 184 DKQQVTSLSESQWRKLEQILARKDDSEKLSACSITI 219



 Score = 50.1 bits (118), Expect(2) = 6e-16
 Identities = 23/44 (52%), Positives = 32/44 (72%)
 Frame = -2

Query: 866 MGLNDSIFGSVRSHIIQEEPLPKIKTIFVLICKEEQHQNLARAE 735
           MGL+D+IF +V S I+ EEPLP    I+  I +EEQH+N+ R+E
Sbjct: 1   MGLDDAIFSTVCSQILAEEPLPGFNQIYNRIIREEQHRNIKRSE 44


>gb|AAC98469.1| putative retroelement pol polyprotein [Arabidopsis thaliana]
          Length = 1102

 Score = 76.6 bits (187), Expect(2) = 3e-14
 Identities = 59/235 (25%), Positives = 101/235 (42%), Gaps = 2/235 (0%)
 Frame = -3

Query: 703 ALPFAAVAKPSSMEAMLTTPTSHKSTCSYCRISGHEIANCYQLIGFPDWWERNRAKASRG 524
           A+ F+     +   A +  P     +C++C   GH++ +C+ + GFP+W+   +   SR 
Sbjct: 58  AINFSVKTPSAPQVAAVYAPKPRDRSCTHCHRQGHDVTDCFLVHGFPEWYYEQKG-GSRV 116

Query: 523 SSLDRD-RERTGGGRSNSYPHATRGKGHGQGRAHSGWADAVIDNGGRLQAMSGAHNDWVN 347
           SS +R+   R     +     +++G G G+GR +S  A     NG               
Sbjct: 117 SSDNREVVSRLENKPAKREGRSSKGNGRGRGRVNSARAPLSSSNGSD------------- 163

Query: 346 AAIEGGRGLQITAPGTSANSGIPGLSTEQWKSLLNILQNQANSNRLSCKVVIT-WIFYSG 170
                    QIT                Q  SLL   + ++ S RLS    +T  I  SG
Sbjct: 164 ---------QIT----------------QLISLLQAQRPKSTSERLSGNTCLTDVIIDSG 198

Query: 169 CSHHMTGTGDLFMNLYPVSPYIIRLPDGTKVVASGLGTVCAG*NFIFQNVLYIPE 5
            SHHMTG   + ++++ + P  +  PDG    A+   T+    ++  Q+VL++P+
Sbjct: 199 ASHHMTGDCSILVDVFDIIPSAVTKPDGKASCATKCVTLLLSSSYKLQDVLFVPD 253



 Score = 28.9 bits (63), Expect(2) = 3e-14
 Identities = 11/37 (29%), Positives = 23/37 (62%)
 Frame = -2

Query: 845 FGSVRSHIIQEEPLPKIKTIFVLICKEEQHQNLARAE 735
           F  +RS I  E+PLP    ++  + + +Q+ ++AR++
Sbjct: 15  FAPIRSKITDEDPLPSHNRVYSRVIRGQQNLDVARSK 51


>ref|XP_004234727.1| PREDICTED: uncharacterized protein LOC101248080 [Solanum
           lycopersicum]
          Length = 422

 Score = 82.0 bits (201), Expect = 3e-13
 Identities = 67/224 (29%), Positives = 92/224 (41%), Gaps = 3/224 (1%)
 Frame = -3

Query: 670 SMEAMLTTPTSHKSTCSYCRISGHEIANCYQLIGFPDWWERNRAKASRGSSLDRDRERTG 491
           ++E     P  +K T  +C  +GH    C+ LIGFP+   R                  G
Sbjct: 190 TVETQPKPPLKYKFT--HCGKNGHSNERCFLLIGFPNGGRRGH----------------G 231

Query: 490 GGRSNSYPHATRGKGHGQGRAHSGWADAVIDNGGRLQAMSGAHNDWVNAAIEGGRGLQIT 311
           GGR         G+G   GR  S          GR   M+   ++  + A+         
Sbjct: 232 GGR-----RGRGGRGLPSGREQSS---------GRTGGMAAHADNPTSRAVR-------- 269

Query: 310 APGTSANSGIPGLSTEQWKSLLNILQNQANSNRLSCKVVIT---WIFYSGCSHHMTGTGD 140
             G S      GLSTE+   LLN+L     S   +  V      W+  SG SHHMTG   
Sbjct: 270 -TGNSQGGNFLGLSTEKMTRLLNMLDTPTQSGNNTGTVHALSPDWLIDSGASHHMTGNFS 328

Query: 139 LFMNLYPVSPYIIRLPDGTKVVASGLGTVCAG*NFIFQNVLYIP 8
              ++ P+    I LPDGT+VVA+  G+V    N I  NVL++P
Sbjct: 329 SLYDIMPIPECSIGLPDGTRVVANYCGSVQISVNLILNNVLFVP 372


Top