BLASTX nr result
ID: Catharanthus23_contig00022210
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Catharanthus23_contig00022210 (915 letters) Database: ./nr 37,332,560 sequences; 13,225,080,153 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value emb|CAN67762.1| hypothetical protein VITISV_040650 [Vitis vinifera] 100 2e-31 gb|AAT40486.1| putative polyprotein [Solanum demissum] 84 2e-27 gb|AAG50751.1|AC079733_19 polyprotein, putative [Arabidopsis tha... 100 2e-25 gb|AAC67205.1| putative retroelement pol polyprotein [Arabidopsi... 87 2e-24 emb|CAN82073.1| hypothetical protein VITISV_036538 [Vitis vinifera] 87 2e-24 ref|XP_006586558.1| PREDICTED: uncharacterized protein LOC102661... 91 2e-24 gb|AAG09097.1|AC009323_8 Putative retroelement polyprotein [Arab... 86 3e-24 gb|AAD19784.1| putative retroelement pol polyprotein [Arabidopsi... 89 4e-24 dbj|BAA97287.1| retroelement pol polyprotein-like [Arabidopsis t... 85 6e-24 ref|XP_006480040.1| PREDICTED: uncharacterized protein LOC102624... 77 7e-23 gb|AAB87099.1| putative retroelement pol polyprotein [Arabidopsi... 79 2e-21 ref|XP_004243119.1| PREDICTED: uncharacterized protein LOC101247... 86 6e-21 dbj|BAA97099.1| retroelement pol polyprotein-like [Arabidopsis t... 77 1e-19 gb|AAG51258.1|AC025782_3 Ty1/copia-element polyprotein [Arabidop... 89 1e-19 emb|CAN78026.1| hypothetical protein VITISV_032464 [Vitis vinifera] 77 4e-18 emb|CAN80919.1| hypothetical protein VITISV_002640 [Vitis vinifera] 76 3e-17 emb|CAN74847.1| hypothetical protein VITISV_028741 [Vitis vinifera] 73 6e-17 gb|EPS69771.1| hypothetical protein M569_04993 [Genlisea aurea] 61 6e-16 gb|AAC98469.1| putative retroelement pol polyprotein [Arabidopsi... 77 3e-14 ref|XP_004234727.1| PREDICTED: uncharacterized protein LOC101248... 82 3e-13 >emb|CAN67762.1| hypothetical protein VITISV_040650 [Vitis vinifera] Length = 1316 Score = 100 bits (249), Expect(2) = 2e-31 Identities = 71/215 (33%), Positives = 106/215 (49%), Gaps = 3/215 (1%) Frame = -3 Query: 643 TSHKSTCSYCRISGHEIANCYQLIGFPDWWERNRAKASRGSSLDRDRERTGGGRSNSYPH 464 T+ +C++C +GH++A+C+QL G+PDWW + RG G GR NSY Sbjct: 141 TNKSGSCTHCGKTGHDVADCFQLKGYPDWWPTRQMGRGRG---------RGRGR-NSY-- 188 Query: 463 ATRGKGHGQGRAHSGWADAVIDNGGRLQAMSGAHNDWVNAAIEGGRGLQITAPGTSANSG 284 G+G GR H A A D + Q + G ++ + Sbjct: 189 --AGRGATSGRVHYXNAVAEADTQEKGQCV--------------GHDVE--------RNI 224 Query: 283 IPGLSTEQWKSLLNILQN-QANSNRLS--CKVVITWIFYSGCSHHMTGTGDLFMNLYPVS 113 IPGL+ + ++ L+ +L+N +N+ +L+ K+V WI SG S HMTG DLF L Sbjct: 225 IPGLNDDNFQKLMALLRNGSSNAEKLTGKNKIVEEWILDSGASMHMTGRRDLFDWLRKWE 284 Query: 112 PYIIRLPDGTKVVASGLGTVCAG*NFIFQNVLYIP 8 + LPDGTK VA+ +G V + +NVLY+P Sbjct: 285 TACVGLPDGTKTVANEMGYVKLSKDLCLKNVLYVP 319 Score = 62.8 bits (151), Expect(2) = 2e-31 Identities = 30/68 (44%), Positives = 47/68 (69%) Frame = -2 Query: 908 IAKEREEEQVYQLLMGLNDSIFGSVRSHIIQEEPLPKIKTIFVLICKEEQHQNLARAEVV 729 I K RE+E+ +Q LMGL+D+ FG+VRS I+ +PLP + I+ ++ +EE+H+++AR Sbjct: 65 IVKSREDEKAHQFLMGLDDTTFGTVRSSILALDPLPTLGKIYAMVTQEERHRSMARGADR 124 Query: 728 EEQTRAAA 705 E T AA Sbjct: 125 AEITVFAA 132 >gb|AAT40486.1| putative polyprotein [Solanum demissum] Length = 1065 Score = 83.6 bits (205), Expect(2) = 2e-27 Identities = 59/213 (27%), Positives = 93/213 (43%), Gaps = 3/213 (1%) Frame = -3 Query: 634 KSTCSYCRISGHEIANCYQLIGFPDWWERNRAKASRGSSLDRDRERTGGGRSNSYPHATR 455 K C+ C +E + LIG+P+WW R GG++N H R Sbjct: 218 KPPCAKCGKFNYETKKYFLLIGYPEWWGTGRE----------------GGKNNGRGHGGR 261 Query: 454 GKGHGQ-GRAHSGWADAVIDNGGRLQAMSGAHNDWVNAAIEGGRGLQITAPGTSANSGIP 278 +G G + +G A + N + +G NA Sbjct: 262 SGNYGDFGHSGTGRGVAAVANVAQATGSNGTTKKEANAWT-------------------- 301 Query: 277 GLSTEQWKSLLNILQNQ-ANSNRLSCKVV-ITWIFYSGCSHHMTGTGDLFMNLYPVSPYI 104 GLS +QW +LL++L + N +L+ ++ I WI +G SHHM+G LF +L V PY+ Sbjct: 302 GLSNDQWSALLSMLNSHNKNHEKLAGNILGICWIVDTGASHHMSGDAQLFNDLCDVPPYL 361 Query: 103 IRLPDGTKVVASGLGTVCAG*NFIFQNVLYIPE 5 + LP+G+ +AS + V +VLY+P+ Sbjct: 362 VSLPNGSTTIAS-MEIVILTDKMKLHHVLYVPQ 393 Score = 66.6 bits (161), Expect(2) = 2e-27 Identities = 27/58 (46%), Positives = 45/58 (77%) Frame = -2 Query: 908 IAKEREEEQVYQLLMGLNDSIFGSVRSHIIQEEPLPKIKTIFVLICKEEQHQNLARAE 735 + +ERE+E+V+Q MGL+D +FG+ S+I+ +PLP + ++ +I +EE+HQNLARA+ Sbjct: 130 LTQEREKEKVHQFSMGLDDKVFGTTHSNILSTKPLPTLNRVYAMIIQEERHQNLARAK 187 >gb|AAG50751.1|AC079733_19 polyprotein, putative [Arabidopsis thaliana] Length = 1468 Score = 100 bits (250), Expect(2) = 2e-25 Identities = 67/222 (30%), Positives = 105/222 (47%), Gaps = 11/222 (4%) Frame = -3 Query: 637 HKSTCSYCRISGHEIANCYQLIGFPDWW-ERNRAKASRGSSLDRDRERTGGGRSNSYPHA 461 +K C++C GH NC+ LIG+P+WW +R R K++ S R R R G G + P Sbjct: 270 NKKLCTHCNRGGHSPENCFVLIGYPEWWGDRPRGKSNSNGSTSRGRGRFGPGFNGGQPRP 329 Query: 460 TRGKGHGQGRAHSGWADAVIDNGGRLQAMSGAH--NDWVNAAIEGGRGLQITAPGTSANS 287 T + + V M+G ++ VN I S Sbjct: 330 T-------------YVNVV---------MTGPFPSSEHVNRVITD-----------SDRD 356 Query: 286 GIPGLSTEQWKSLLNILQ-----NQANSNRL---SCKVVITWIFYSGCSHHMTGTGDLFM 131 + GL+ EQW+ ++ +L N++N++ +C + +WI +G SHHMTG +L Sbjct: 357 AVSGLTDEQWRGVVKLLNAGRSDNKSNAHETQSGTCSLFTSWILDTGASHHMTGNLELLS 416 Query: 130 NLYPVSPYIIRLPDGTKVVASGLGTVCAG*NFIFQNVLYIPE 5 ++ +SP +I L DG K VA GTV G + I ++V Y+ E Sbjct: 417 DMRSMSPVLIILADGNKRVAVSEGTVRLGSHLILKSVFYVKE 458 Score = 42.4 bits (98), Expect(2) = 2e-25 Identities = 22/66 (33%), Positives = 39/66 (59%) Frame = -2 Query: 902 KEREEEQVYQLLMGLNDSIFGSVRSHIIQEEPLPKIKTIFVLICKEEQHQNLARAEVVEE 723 K RE++ V+Q L GLN++ F ++RS + PLP ++ ++ ++ +EE N + EE Sbjct: 183 KYREDDMVHQYLYGLNETKFHTIRSSLTSRVPLPGLEEVYNIVRQEEDMVNNRSSN--EE 240 Query: 722 QTRAAA 705 +T A Sbjct: 241 RTDVTA 246 >gb|AAC67205.1| putative retroelement pol polyprotein [Arabidopsis thaliana] Length = 1413 Score = 87.0 bits (214), Expect(2) = 2e-24 Identities = 68/211 (32%), Positives = 97/211 (45%), Gaps = 5/211 (2%) Frame = -3 Query: 625 CSYCRISGHEIANCYQLIGFPDWWERNRAKASRGSSLDRDRERT-GGGRSNSYPHATRGK 449 CS+C SGHE +C+Q++GFPDWW ERT GGGR +S +RG+ Sbjct: 280 CSHCGRSGHEKKDCWQIVGFPDWWT----------------ERTNGGGRGSS----SRGR 319 Query: 448 GHGQGRAHSGWADAVIDNGGRLQAMSGAHNDWVNAAIEGGRGLQITAPGTSAN-SGIPGL 272 G GR+ +N GR GRG A T++N S P Sbjct: 320 G---GRSSGS------NNSGR------------------GRGQVTAAHATTSNLSPFPEF 352 Query: 271 STEQWKSLLNILQNQAN--SNRLSCKVVI-TWIFYSGCSHHMTGTGDLFMNLYPVSPYII 101 + +Q + + ++QN+ N S++LS K+ + I +G SHHMTG L N+ + + Sbjct: 353 TPDQLRVITQMIQNKNNGTSDKLSGKMKLGDVILDTGASHHMTGQLSLLTNIVTIPSCSV 412 Query: 100 RLPDGTKVVASGLGTVCAG*NFIFQNVLYIP 8 DG K A +GT NVLY+P Sbjct: 413 GFADGRKTFAISMGTFKLSETVSLSNVLYVP 443 Score = 53.1 bits (126), Expect(2) = 2e-24 Identities = 26/70 (37%), Positives = 45/70 (64%) Frame = -2 Query: 905 AKEREEEQVYQLLMGLNDSIFGSVRSHIIQEEPLPKIKTIFVLICKEEQHQNLARAEVVE 726 +KEREEE+++Q ++GL+DS FG + + +I +P P + I+ + +EE Q LA ++ E Sbjct: 188 SKEREEEKIHQFVLGLDDSRFGGLSATLIAMDPFPSLGEIYSRVVREE--QRLASVQIRE 245 Query: 725 EQTRAAAGVT 696 +Q A +T Sbjct: 246 QQQSAIGFLT 255 >emb|CAN82073.1| hypothetical protein VITISV_036538 [Vitis vinifera] Length = 1157 Score = 87.0 bits (214), Expect(2) = 2e-24 Identities = 57/188 (30%), Positives = 84/188 (44%), Gaps = 3/188 (1%) Frame = -3 Query: 625 CSYCRISGHEIANCYQLIGFPDWWERNRAKASRGSSLDRDRERTGGGRSNSYPHATRGKG 446 CS C+ GHE+ +C+Q I +P+WW DR R T G +G G Sbjct: 218 CSNCKRKGHEVDSCFQRIAYPEWWG------------DRPRTTTSGCSGGHGRGVQQGTG 265 Query: 445 HGQGRAHSGWADAVIDNGGRLQAMSGAHNDWVNAAIEGGRGLQITAPGTSANSGIPGLST 266 G+GR + A+ V G +GGR + S +GI GLS Sbjct: 266 GGRGRGGTARANVVQTLG-----------------TDGGRSVVTD----SNRTGISGLSD 304 Query: 265 EQWKSLLNILQNQ---ANSNRLSCKVVITWIFYSGCSHHMTGTGDLFMNLYPVSPYIIRL 95 +QW +LL +L + AN + + ++ WI +G SHHMT T + +L + P + L Sbjct: 305 KQWTTLLTMLNSHKGGANERLIGKQNILPWIIDTGASHHMTDTYECLNDLRDIIPCPVGL 364 Query: 94 PDGTKVVA 71 P+G K A Sbjct: 365 PNGAKTKA 372 Score = 53.1 bits (126), Expect(2) = 2e-24 Identities = 22/58 (37%), Positives = 42/58 (72%) Frame = -2 Query: 908 IAKEREEEQVYQLLMGLNDSIFGSVRSHIIQEEPLPKIKTIFVLICKEEQHQNLARAE 735 + K+REEE+V+Q LMGL++ +G+VRS+I+ EPL + ++ +I ++E+ + + R + Sbjct: 132 LEKKREEERVHQFLMGLDEDGYGTVRSNILSIEPLSNLNRVYAMIVQQERVRTMTRTK 189 >ref|XP_006586558.1| PREDICTED: uncharacterized protein LOC102661920 [Glycine max] Length = 516 Score = 90.5 bits (223), Expect(2) = 2e-24 Identities = 65/232 (28%), Positives = 100/232 (43%), Gaps = 2/232 (0%) Frame = -3 Query: 697 PFAAVAKPSSMEAMLTTP-TSHKSTCSYCRISGHEIANCYQLIGFPDWWERNRAKASRGS 521 P A K + P T + CS+C+ GH+I +C+QL+G+PDWW Sbjct: 254 PIAFAVKSGRTSSWEKKPNTGSEKPCSHCKRDGHDIDSCFQLVGYPDWWG---------- 303 Query: 520 SLDRDRERTGGGRSNSYPHATRGKGHGQGRAHSGWADAVIDNGGRLQAMSGAHNDWVNAA 341 DR R+ G G+G R SG A+ G N NA Sbjct: 304 ----DRPRSVG--------RALGRGKHVHRPMSG-------------ALKGRGN---NAK 335 Query: 340 IEGGRGLQITAPGTSANSGI-PGLSTEQWKSLLNILQNQANSNRLSCKVVITWIFYSGCS 164 + + + T + + PGLS++QW +LLN + Q +WI +G S Sbjct: 336 VNMTQVVDDTEVMKYEDDQVLPGLSSKQWNALLNAINTQKGGTSTRLTGENSWIIDTGAS 395 Query: 163 HHMTGTGDLFMNLYPVSPYIIRLPDGTKVVASGLGTVCAG*NFIFQNVLYIP 8 HHMT T ++ + P I +P+GT+ A+ G V G + ++VL++P Sbjct: 396 HHMTSTLACMNDVRDIEPCPIGMPNGTRTYATKEGMVTVGDKLMLKHVLFVP 447 Score = 49.7 bits (117), Expect(2) = 2e-24 Identities = 20/48 (41%), Positives = 37/48 (77%) Frame = -2 Query: 902 KEREEEQVYQLLMGLNDSIFGSVRSHIIQEEPLPKIKTIFVLICKEEQ 759 K+REEE+++Q LMGL+D+ F +VRS+++ +PLP + + ++ +EE+ Sbjct: 193 KKREEEKLHQFLMGLDDTQFRTVRSNVLSLDPLPNLNRAYQMVVQEER 240 >gb|AAG09097.1|AC009323_8 Putative retroelement polyprotein [Arabidopsis thaliana] Length = 1486 Score = 85.9 bits (211), Expect(2) = 3e-24 Identities = 65/220 (29%), Positives = 100/220 (45%), Gaps = 9/220 (4%) Frame = -3 Query: 643 TSHKSTCSYCRISGHEIANCYQLIGFPDWWE-----RNRAKASRGSSLDRDRERTGGGRS 479 +S CS C GH C++LIG+P W E +N A +SRG L + + GR Sbjct: 252 SSENRVCSNCGRVGHLAEQCFKLIGYPPWLEEKLRLKNTASSSRGG-LSSFKGKQSHGRG 310 Query: 478 NSYPHATRGKGHGQGRAHSGWADAVIDNGGRLQAMSGAHNDWVNAAIEGGRGLQITAPGT 299 +S H A SG A V+ N +T+P T Sbjct: 311 SSINHV----------ASSGMAANVVTNSS------------------------LTSPLT 336 Query: 298 SANS-GIPGLSTEQWKSLLNILQNQ---ANSNRLSCKVVITWIFYSGCSHHMTGTGDLFM 131 S + G+ GL+ QWK L IL+ + +N ++ + +WI SG ++HMTG+ Sbjct: 337 SDDRIGLSGLNDSQWKILQTILEERKSTSNDHQSGKYFLESWIIDSGATNHMTGSLAFLR 396 Query: 130 NLYPVSPYIIRLPDGTKVVASGLGTVCAG*NFIFQNVLYI 11 N+ + P +I+LPDG A+ G+V G + Q+VL++ Sbjct: 397 NVCDMPPVLIKLPDGRFTTATKQGSVQLGSSLDLQDVLFV 436 Score = 53.5 bits (127), Expect(2) = 3e-24 Identities = 20/56 (35%), Positives = 43/56 (76%) Frame = -2 Query: 908 IAKEREEEQVYQLLMGLNDSIFGSVRSHIIQEEPLPKIKTIFVLICKEEQHQNLAR 741 + KEREE++++Q LMGL++S++G+V+S ++ PLP ++ + + ++E+ ++L+R Sbjct: 175 VRKEREEDKLHQFLMGLDESVYGAVKSALLSRVPLPSLEEAYNALTQDEESKSLSR 230 >gb|AAD19784.1| putative retroelement pol polyprotein [Arabidopsis thaliana] Length = 1501 Score = 89.0 bits (219), Expect(2) = 4e-24 Identities = 69/217 (31%), Positives = 95/217 (43%), Gaps = 10/217 (4%) Frame = -3 Query: 628 TCSYCRISGHEIANCYQLIGFPDWWERNRAKASRGSSLDRDRERTGGGRSNSYPHATRGK 449 TCS C +GHE C+Q++GFPDWW ER GG SN RG+ Sbjct: 295 TCSNCGRTGHEKKECWQIVGFPDWWS----------------ERNGGRGSNG-----RGR 333 Query: 448 GHGQGRAHSGWADAVIDNGGRLQAMSGAHNDWVNAAIEGGRGLQITAPGTSANSGI-PGL 272 G G+G NGGR G+G + A TS+NS + P Sbjct: 334 G-GRG-----------SNGGR------------------GQGQVMAAHATSSNSSVFPEF 363 Query: 271 STEQWKSLLNILQNQANS--------NRLSCKVVITWIFY-SGCSHHMTGTGDLFMNLYP 119 + E + L +++ ++NS +RLS K + I SG SHHMTGT N+ P Sbjct: 364 TEEHMRVLSQLVKEKSNSGSTSNNNSDRLSGKTKLGDIILDSGASHHMTGTLSSLTNVVP 423 Query: 118 VSPYIIRLPDGTKVVASGLGTVCAG*NFIFQNVLYIP 8 V P + DG+K A +G + NVL++P Sbjct: 424 VPPCPVGFADGSKAFALSVGVLTLSNTVSLTNVLFVP 460 Score = 50.1 bits (118), Expect(2) = 4e-24 Identities = 25/64 (39%), Positives = 42/64 (65%) Frame = -2 Query: 902 KEREEEQVYQLLMGLNDSIFGSVRSHIIQEEPLPKIKTIFVLICKEEQHQNLARAEVVEE 723 KEREEE+++Q ++GL++S FG + + +I +PLP + I+ + +EE Q LA V E+ Sbjct: 194 KEREEEKIHQFVLGLDESRFGGLCATLINMDPLPSLGEIYSRVIREE--QRLASVHVREQ 251 Query: 722 QTRA 711 + A Sbjct: 252 KEEA 255 >dbj|BAA97287.1| retroelement pol polyprotein-like [Arabidopsis thaliana] Length = 1491 Score = 85.1 bits (209), Expect(2) = 6e-24 Identities = 67/211 (31%), Positives = 96/211 (45%), Gaps = 5/211 (2%) Frame = -3 Query: 625 CSYCRISGHEIANCYQLIGFPDWWERNRAKASRGSSLDRDRERT-GGGRSNSYPHATRGK 449 CS+C SGHE +C+Q++GFPDWW ERT GGGR +S +RG+ Sbjct: 280 CSHCGRSGHEKKDCWQIVGFPDWWT----------------ERTNGGGRGSS----SRGR 319 Query: 448 GHGQGRAHSGWADAVIDNGGRLQAMSGAHNDWVNAAIEGGRGLQITAPGTSAN-SGIPGL 272 G GR+ +N GR GRG A T++N S P Sbjct: 320 G---GRSSGS------NNSGR------------------GRGQVTAAHATTSNLSSFPEF 352 Query: 271 STEQWKSLLNILQNQAN--SNRLSCKVVI-TWIFYSGCSHHMTGTGDLFMNLYPVSPYII 101 + +Q + + ++QN+ N S++LS K+ + I +G SHHMTG L N+ + + Sbjct: 353 TPDQLRVITQMIQNKNNGTSDKLSGKMKLGDVILDTGASHHMTGQLSLLTNIVTIPSCSV 412 Query: 100 RLPDGTKVVASGLGTVCAG*NFIFQNVLYIP 8 D K A +GT NVLY+P Sbjct: 413 GFADDRKTFAISMGTFKLSETVSLSNVLYVP 443 Score = 53.1 bits (126), Expect(2) = 6e-24 Identities = 26/70 (37%), Positives = 45/70 (64%) Frame = -2 Query: 905 AKEREEEQVYQLLMGLNDSIFGSVRSHIIQEEPLPKIKTIFVLICKEEQHQNLARAEVVE 726 +KEREEE+++Q ++GL+DS FG + + +I +P P + I+ + +EE Q LA ++ E Sbjct: 188 SKEREEEKIHQFVLGLDDSRFGGLSATLIAMDPFPSLGEIYSRVVREE--QRLASVQIRE 245 Query: 725 EQTRAAAGVT 696 +Q A +T Sbjct: 246 QQQSAIGFLT 255 >ref|XP_006480040.1| PREDICTED: uncharacterized protein LOC102624694 isoform X1 [Citrus sinensis] gi|568852764|ref|XP_006480041.1| PREDICTED: uncharacterized protein LOC102624694 isoform X2 [Citrus sinensis] gi|568852766|ref|XP_006480042.1| PREDICTED: uncharacterized protein LOC102624694 isoform X3 [Citrus sinensis] Length = 320 Score = 77.0 bits (188), Expect(2) = 7e-23 Identities = 49/138 (35%), Positives = 67/138 (48%) Frame = -3 Query: 625 CSYCRISGHEIANCYQLIGFPDWWERNRAKASRGSSLDRDRERTGGGRSNSYPHATRGKG 446 C +CR +GH+ +C+QLIG+P+WW DR RTGG RG G Sbjct: 200 CKHCRKTGHDADSCFQLIGYPEWW--------------GDRSRTGG----------RGAG 235 Query: 445 HGQGRAHSGWADAVIDNGGRLQAMSGAHNDWVNAAIEGGRGLQITAPGTSANSGIPGLST 266 GQG G A G +++A + AH V + G +G + A T G+ GLS Sbjct: 236 RGQGGQRQGIAQGGKGRGSQIKA-NAAH---VTSEGSGIQGHVLDADKT----GLKGLSN 287 Query: 265 EQWKSLLNILQNQANSNR 212 EQW LLN+L +Q N+ Sbjct: 288 EQWSMLLNLLNSQTEKNQ 305 Score = 57.8 bits (138), Expect(2) = 7e-23 Identities = 25/56 (44%), Positives = 42/56 (75%) Frame = -2 Query: 902 KEREEEQVYQLLMGLNDSIFGSVRSHIIQEEPLPKIKTIFVLICKEEQHQNLARAE 735 K+ EEE+++Q LMGL+D+I+GSVRS+I+ +PLP + + L+ +EE+ Q + R + Sbjct: 116 KKCEEERLHQFLMGLDDTIYGSVRSNILSTDPLPPLNRAYSLVVQEERVQTITRGK 171 >gb|AAB87099.1| putative retroelement pol polyprotein [Arabidopsis thaliana] Length = 1496 Score = 78.6 bits (192), Expect(2) = 2e-21 Identities = 55/225 (24%), Positives = 97/225 (43%), Gaps = 3/225 (1%) Frame = -3 Query: 670 SMEAMLTTPTSHKST--CSYCRISGHEIANCYQLIGFPDWWERNRAKASRGSSLDRDRER 497 S+++ T KST C++C GHE+ C+ + G+PDWW + ++ S+ R Sbjct: 243 SVQSSTTPRFRDKSTLFCTHCNRKGHEVTQCFLVHGYPDWWLEQNPQENQPSTRGRGSNG 302 Query: 496 TGGGRSNSYPHATRGKGHGQGRAHSGWADAVIDNGGRLQAMSGAHNDWVNAAIEGGRGLQ 317 G ++ G+GRA++ A A +SG ND + I Sbjct: 303 RGSSSGRGGNRSSAPTTRGRGRANNAQAAA--------PTVSGDGNDQIAQLI------- 347 Query: 316 ITAPGTSANSGIPGLSTEQWKSLLNILQNQANSNRLSCKVVIT-WIFYSGCSHHMTGTGD 140 SLL + ++S RLS +T + +G SHHMTG Sbjct: 348 ---------------------SLLQAQRPSSSSERLSGNTCLTDGVIDTGASHHMTGDCS 386 Query: 139 LFMNLYPVSPYIIRLPDGTKVVASGLGTVCAG*NFIFQNVLYIPE 5 + ++++ ++P + PDG A+ GT+ ++ +VL++P+ Sbjct: 387 ILVDVFDITPSPVTKPDGKASQATKCGTLLLHDSYKLHDVLFVPD 431 Score = 51.2 bits (121), Expect(2) = 2e-21 Identities = 26/60 (43%), Positives = 41/60 (68%) Frame = -2 Query: 908 IAKEREEEQVYQLLMGLNDSIFGSVRSHIIQEEPLPKIKTIFVLICKEEQHQNLARAEVV 729 I KERE+++V++ L+GL DS F S+RS I EPLP + ++ + +EEQ+ N +R + V Sbjct: 176 IEKEREDDRVHKFLLGL-DSRFSSIRSSITDIEPLPDLYQVYSRVVREEQNLNASRTKDV 234 >ref|XP_004243119.1| PREDICTED: uncharacterized protein LOC101247933 [Solanum lycopersicum] Length = 528 Score = 85.9 bits (211), Expect(2) = 6e-21 Identities = 73/224 (32%), Positives = 94/224 (41%), Gaps = 3/224 (1%) Frame = -3 Query: 670 SMEAMLTTPTSHKSTCSYCRISGHEIANCYQLIGFPDWWERNRAKASRGSSLDRDRERTG 491 ++E T P +K C++C +GH C+ LIGFP S R RE Sbjct: 251 AVETQPTPPLKYK--CTHCGKNGHSAERCFILIGFP-------------SGGRRGREGGR 295 Query: 490 GGRSNSYPHATRGKGHGQGRAHSGWADAVIDNGGRLQAMSGAHNDWVNAAIEGGRGLQIT 311 GGR RG+G GR S GR M+ AH D + Sbjct: 296 GGR--------RGQGPPSGREQSA---------GRGGGMA-AHTDSPTSPA--------V 329 Query: 310 APGTSANSGIPGLSTEQWKSLLNILQNQANSNRLSCKVVIT---WIFYSGCSHHMTGTGD 140 G S P LS EQ LLN+L S + V W+ SG SHHMTG Sbjct: 330 TIGNSQGGNFPRLSAEQMTRLLNMLDTPTQSRNNTGTVHALSPDWLIDSGASHHMTGNFS 389 Query: 139 LFMNLYPVSPYIIRLPDGTKVVASGLGTVCAG*NFIFQNVLYIP 8 ++ V I LPDGT+VVA+ G+V N I +NVL++P Sbjct: 390 SLYDIMSVPECSIGLPDGTRVVANYCGSVQISANLILKNVLFVP 433 Score = 42.4 bits (98), Expect(2) = 6e-21 Identities = 18/49 (36%), Positives = 32/49 (65%) Frame = -2 Query: 893 EEEQVYQLLMGLNDSIFGSVRSHIIQEEPLPKIKTIFVLICKEEQHQNL 747 EEE+ + L+GL+D+ FG+ RS I PL + + L+ +EE+H+++ Sbjct: 189 EEEKTHAFLLGLDDAQFGATRSEIFGTHPLFVLNEAYYLVSQEERHKSI 237 >dbj|BAA97099.1| retroelement pol polyprotein-like [Arabidopsis thaliana] Length = 1098 Score = 77.4 bits (189), Expect(2) = 1e-19 Identities = 58/213 (27%), Positives = 94/213 (44%), Gaps = 5/213 (2%) Frame = -3 Query: 628 TCSYCRISGHEIANCYQLIGFPDWW-ERNRAKASRGSSLDRDRERTGGGRSNSYPHATRG 452 TC++ GH+I C+ + G+PDWW E+N + S G R G G +N ++ Sbjct: 262 TCTHYHRQGHDITECFLVHGYPDWWLEQNGSNGSAGRGTS-GRGNNGRGNNNRGGRSSSS 320 Query: 451 KGHGQGRAHSGWADAVIDNGGRLQAMSGAHNDWVNAAIEGGRGLQITAPGTSANSGIPGL 272 G+GRA++ + H P TS S Sbjct: 321 GSRGKGRANA----------------ASTH-----------------PPPTSTPS----- 342 Query: 271 STEQWKSLLNILQNQ---ANSNRLSCKVVITWIFY-SGCSHHMTGTGDLFMNLYPVSPYI 104 + +Q L+++LQ Q +S +LS K T++ +G SHHMTG L N+ + P Sbjct: 343 NADQINQLISLLQAQNPATSSQKLSGKTFTTYVIIDTGASHHMTGDITLLTNVEDIIPSP 402 Query: 103 IRLPDGTKVVASGLGTVCAG*NFIFQNVLYIPE 5 + PDGT A+ GT+ ++ +VL++P+ Sbjct: 403 VTKPDGTASRATKRGTLALHNAYVLPDVLFVPD 435 Score = 46.6 bits (109), Expect(2) = 1e-19 Identities = 23/56 (41%), Positives = 37/56 (66%) Frame = -2 Query: 908 IAKEREEEQVYQLLMGLNDSIFGSVRSHIIQEEPLPKIKTIFVLICKEEQHQNLAR 741 IAKERE+++V+Q L+ L D F +RS I ++PLP + ++ + EEQ+ N +R Sbjct: 168 IAKEREDDKVHQFLLNL-DERFRPIRSTITVQDPLPALNQVYSRVIHEEQNLNASR 222 >gb|AAG51258.1|AC025782_3 Ty1/copia-element polyprotein [Arabidopsis thaliana] Length = 1152 Score = 89.0 bits (219), Expect(2) = 1e-19 Identities = 65/246 (26%), Positives = 111/246 (45%), Gaps = 5/246 (2%) Frame = -3 Query: 727 RNKLELLLALPFAAVAKPSSMEAMLTTPTSHKSTCSYCRISGHEIANCYQLIGFPDWWER 548 R+K E + A+ FA +++ ++ T ++ C++C S H C++L G P+W+ Sbjct: 243 RSKEERVDAVGFAVQTGVNAIASV--TRVNNMGPCTHCGRSNHSADTCFKLHGVPEWYTE 300 Query: 547 NRAKASRGSSLDRDRERTGGGRSNSYPHATRGKGHGQGRAHSGWADAVIDNGGRLQAMSG 368 S G G GRS++ RG+G G G ++ Sbjct: 301 KYGDTSSG---------RGRGRSST----PRGRGRGHGNSYK------------------ 329 Query: 367 AHNDWVNAAIEGGRGLQITAPGTSAN--SGIPGLSTEQWKSLLNILQNQ--ANSNRLSCK 200 Q + P +SA+ S IPG+S E W ++ N+L+ +S +LS K Sbjct: 330 ------------ANNAQTSHPSSSASEFSDIPGVSKEAWSAIRNLLKQDTATSSEKLSGK 377 Query: 199 V-VITWIFYSGCSHHMTGTGDLFMNLYPVSPYIIRLPDGTKVVASGLGTVCAG*NFIFQN 23 + ++ SG SHHMTG DL +Y + ++ LP+ +A+ GT+ G N + Sbjct: 378 TNCVDFLIDSGASHHMTGFLDLLTEIYEIPHSVVVLPNAKHTIATKKGTLILGANMKLTH 437 Query: 22 VLYIPE 5 VL++P+ Sbjct: 438 VLFVPD 443 Score = 34.7 bits (78), Expect(2) = 1e-19 Identities = 17/60 (28%), Positives = 37/60 (61%), Gaps = 3/60 (5%) Frame = -2 Query: 905 AKEREEEQVYQLLMGLNDSIFGSVRSHI---IQEEPLPKIKTIFVLICKEEQHQNLARAE 735 ++ R+ E+++Q LMGL+ + FG+ R++I + + + +I+ I EE+H + R++ Sbjct: 186 SQRRDHERIHQFLMGLDAAKFGTSRTNILGRLSRDDNISLDSIYSEIIAEERHLTITRSK 245 >emb|CAN78026.1| hypothetical protein VITISV_032464 [Vitis vinifera] Length = 685 Score = 76.6 bits (187), Expect(2) = 4e-18 Identities = 60/210 (28%), Positives = 90/210 (42%), Gaps = 3/210 (1%) Frame = -3 Query: 625 CSYCRISGHEIANCYQLIGFPDWWERNRAKASRGSSLDRDRERTGGGRSNSYPHATRGKG 446 C +C GH+ NCY+++G+P+ W LD+++ G GRS G+G Sbjct: 252 CPHCHKHGHDKNNCYEIVGYPEGW------------LDQNKADGGAGRSRQQA----GRG 295 Query: 445 HGQGRAHSGWADAVIDNGGRLQAMSGAHNDWVNAAIEGGRGLQITAPGTSANSGIPGLST 266 G RA NAA T +S S L T Sbjct: 296 RGSARA--------------------------NAASS-------TIGASSTKSSTDQLFT 322 Query: 265 -EQWKSLLNILQN-QANSNRLSCKV-VITWIFYSGCSHHMTGTGDLFMNLYPVSPYIIRL 95 EQWK+L ++ N Q +RL+ K +WI +G +HH+TG + + + L Sbjct: 323 PEQWKALAGLIGNAQVPDDRLNGKFDTKSWIIDTGATHHVTGDLSWLFDTIALFECPVGL 382 Query: 94 PDGTKVVASGLGTVCAG*NFIFQNVLYIPE 5 P+G VVA+ G+V N +NVLY+P+ Sbjct: 383 PNGESVVATQSGSVRLSNNITLKNVLYVPK 412 Score = 42.0 bits (97), Expect(2) = 4e-18 Identities = 22/71 (30%), Positives = 42/71 (59%), Gaps = 7/71 (9%) Frame = -2 Query: 893 EEEQVYQLLMGLNDSIFGSVRSHIIQEEPLPKIKTIFVLICKEEQHQNLAR-------AE 735 E+E+++ LMGLN ++ +R++I+ ++PLP + + L+ ++E+ LA+ AE Sbjct: 153 EQEKLHDFLMGLNTDLYAQLRTNILSQDPLPSLDRAYQLVIQDER-VRLAKAVTKDKPAE 211 Query: 734 VVEEQTRAAAG 702 V+ R AG Sbjct: 212 VLGFAVRTGAG 222 >emb|CAN80919.1| hypothetical protein VITISV_002640 [Vitis vinifera] Length = 1450 Score = 75.9 bits (185), Expect(2) = 3e-17 Identities = 59/208 (28%), Positives = 90/208 (43%), Gaps = 3/208 (1%) Frame = -3 Query: 625 CSYCRISGHEIANCYQLIGFPDWWERNRAKASRGSSLDRDRERTGGGRSNSYPHATRGKG 446 C +C GH+ NCY+++G+P+ W LD+++ G GRS G+G Sbjct: 285 CPHCHKHGHDKNNCYEIVGYPEGW------------LDQNKADGGAGRSRQQA----GRG 328 Query: 445 HGQGRAHSGWADAVIDNGGRLQAMSGAHNDWVNAAIEGGRGLQITAPGTSANSGIPGLST 266 G RA NAA T +S S L T Sbjct: 329 RGSARA--------------------------NAASS-------TIGASSTKSSTDQLFT 355 Query: 265 -EQWKSLLNILQN-QANSNRLSCKV-VITWIFYSGCSHHMTGTGDLFMNLYPVSPYIIRL 95 EQWK+L ++ N Q ++RL+ K +WI +G +HH+TG + + ++ L Sbjct: 356 PEQWKALAGLIGNAQVPNDRLNGKFDTKSWIIDTGATHHVTGDLSWLFDTIALFECLVGL 415 Query: 94 PDGTKVVASGLGTVCAG*NFIFQNVLYI 11 P+G VVA+ G+V N +NVLY+ Sbjct: 416 PNGESVVATQSGSVRLSNNITLKNVLYV 443 Score = 40.0 bits (92), Expect(2) = 3e-17 Identities = 19/65 (29%), Positives = 40/65 (61%) Frame = -2 Query: 896 REEEQVYQLLMGLNDSIFGSVRSHIIQEEPLPKIKTIFVLICKEEQHQNLARAEVVEEQT 717 RE+ +++ LMGLN ++ +R++I+ ++PLP + + L+ +++ LA+A V E++ Sbjct: 185 REQGKLHDFLMGLNTDLYAQLRTNILSQDPLPSLDRAYQLVI-QDKRVRLAKA-VTEDKP 242 Query: 716 RAAAG 702 G Sbjct: 243 AEVLG 247 >emb|CAN74847.1| hypothetical protein VITISV_028741 [Vitis vinifera] Length = 1262 Score = 73.2 bits (178), Expect(2) = 6e-17 Identities = 56/210 (26%), Positives = 89/210 (42%), Gaps = 3/210 (1%) Frame = -3 Query: 625 CSYCRISGHEIANCYQLIGFPDWWERNRAKASRGSSLDRDRERTGGGRSNSYPHATRGKG 446 C +C GH+ NCY+++G+P+ W LD+++ G GRS + G Sbjct: 284 CPHCHKHGHDKNNCYEIVGYPEGW------------LDQNKADGGAGRSR------QQAG 325 Query: 445 HGQGRAHSGWADAVIDNGGRLQAMSGAHNDWVNAAIEGGRGLQITAPGTSANSGIPGLST 266 G+G A A + I +S S L T Sbjct: 326 RGRGSARXNTASSTIG-------------------------------ASSTKSSTDQLFT 354 Query: 265 -EQWKSLLNILQN-QANSNRLSCKV-VITWIFYSGCSHHMTGTGDLFMNLYPVSPYIIRL 95 EQWK+L ++ N Q +RL+ K +WI +G +HH+TG + + + L Sbjct: 355 PEQWKALAGLIGNAQVPYDRLNGKFDTKSWIIDTGATHHVTGDLXWLFDTIALFECPVGL 414 Query: 94 PDGTKVVASGLGTVCAG*NFIFQNVLYIPE 5 P+G +VA+ G+V N +NVLY+P+ Sbjct: 415 PNGESIVATQSGSVRLSNNITLKNVLYVPK 444 Score = 41.6 bits (96), Expect(2) = 6e-17 Identities = 20/65 (30%), Positives = 41/65 (63%) Frame = -2 Query: 896 REEEQVYQLLMGLNDSIFGSVRSHIIQEEPLPKIKTIFVLICKEEQHQNLARAEVVEEQT 717 RE+ +++ LMGLN ++ +R++I+ ++PLP + + L+ ++E+ LA+A V E++ Sbjct: 184 REQGKLHDFLMGLNTDLYAQLRTNILSQDPLPSLDRAYQLVIQDER-VRLAKA-VTEDKP 241 Query: 716 RAAAG 702 G Sbjct: 242 AEVLG 246 >gb|EPS69771.1| hypothetical protein M569_04993 [Genlisea aurea] Length = 266 Score = 61.2 bits (147), Expect(2) = 6e-16 Identities = 44/156 (28%), Positives = 66/156 (42%), Gaps = 11/156 (7%) Frame = -3 Query: 625 CSYCRISGHEIANCYQLIGFPDWW-ERNR---------AKASRGSSLDRDRERTGGGRSN 476 CS C SGH+ C+ L+G+P+WW +R R + G + R GG S Sbjct: 72 CSVCGFSGHDKDGCFVLLGYPEWWGDRPRYQFDEKGKLVQCGGGPATSSQESRNRGGISR 131 Query: 475 SYPHATRGKGHGQGRAHSGWADAVIDNGGRLQAMSGAHNDWVNAAIEGGRGLQITAPGTS 296 RG+G G R++ + + LQA + V+A R L +T Sbjct: 132 RV--TGRGRGRGGSRSNGSREETAVAGPSTLQAHAATGGQVVSAE---SRNLTLT---NE 183 Query: 295 ANSGIPGLSTEQWKSLLNILQNQANSNRLS-CKVVI 191 + LS QW+ L IL + +S +LS C + I Sbjct: 184 DKQQVTSLSESQWRKLEQILARKDDSEKLSACSITI 219 Score = 50.1 bits (118), Expect(2) = 6e-16 Identities = 23/44 (52%), Positives = 32/44 (72%) Frame = -2 Query: 866 MGLNDSIFGSVRSHIIQEEPLPKIKTIFVLICKEEQHQNLARAE 735 MGL+D+IF +V S I+ EEPLP I+ I +EEQH+N+ R+E Sbjct: 1 MGLDDAIFSTVCSQILAEEPLPGFNQIYNRIIREEQHRNIKRSE 44 >gb|AAC98469.1| putative retroelement pol polyprotein [Arabidopsis thaliana] Length = 1102 Score = 76.6 bits (187), Expect(2) = 3e-14 Identities = 59/235 (25%), Positives = 101/235 (42%), Gaps = 2/235 (0%) Frame = -3 Query: 703 ALPFAAVAKPSSMEAMLTTPTSHKSTCSYCRISGHEIANCYQLIGFPDWWERNRAKASRG 524 A+ F+ + A + P +C++C GH++ +C+ + GFP+W+ + SR Sbjct: 58 AINFSVKTPSAPQVAAVYAPKPRDRSCTHCHRQGHDVTDCFLVHGFPEWYYEQKG-GSRV 116 Query: 523 SSLDRD-RERTGGGRSNSYPHATRGKGHGQGRAHSGWADAVIDNGGRLQAMSGAHNDWVN 347 SS +R+ R + +++G G G+GR +S A NG Sbjct: 117 SSDNREVVSRLENKPAKREGRSSKGNGRGRGRVNSARAPLSSSNGSD------------- 163 Query: 346 AAIEGGRGLQITAPGTSANSGIPGLSTEQWKSLLNILQNQANSNRLSCKVVIT-WIFYSG 170 QIT Q SLL + ++ S RLS +T I SG Sbjct: 164 ---------QIT----------------QLISLLQAQRPKSTSERLSGNTCLTDVIIDSG 198 Query: 169 CSHHMTGTGDLFMNLYPVSPYIIRLPDGTKVVASGLGTVCAG*NFIFQNVLYIPE 5 SHHMTG + ++++ + P + PDG A+ T+ ++ Q+VL++P+ Sbjct: 199 ASHHMTGDCSILVDVFDIIPSAVTKPDGKASCATKCVTLLLSSSYKLQDVLFVPD 253 Score = 28.9 bits (63), Expect(2) = 3e-14 Identities = 11/37 (29%), Positives = 23/37 (62%) Frame = -2 Query: 845 FGSVRSHIIQEEPLPKIKTIFVLICKEEQHQNLARAE 735 F +RS I E+PLP ++ + + +Q+ ++AR++ Sbjct: 15 FAPIRSKITDEDPLPSHNRVYSRVIRGQQNLDVARSK 51 >ref|XP_004234727.1| PREDICTED: uncharacterized protein LOC101248080 [Solanum lycopersicum] Length = 422 Score = 82.0 bits (201), Expect = 3e-13 Identities = 67/224 (29%), Positives = 92/224 (41%), Gaps = 3/224 (1%) Frame = -3 Query: 670 SMEAMLTTPTSHKSTCSYCRISGHEIANCYQLIGFPDWWERNRAKASRGSSLDRDRERTG 491 ++E P +K T +C +GH C+ LIGFP+ R G Sbjct: 190 TVETQPKPPLKYKFT--HCGKNGHSNERCFLLIGFPNGGRRGH----------------G 231 Query: 490 GGRSNSYPHATRGKGHGQGRAHSGWADAVIDNGGRLQAMSGAHNDWVNAAIEGGRGLQIT 311 GGR G+G GR S GR M+ ++ + A+ Sbjct: 232 GGR-----RGRGGRGLPSGREQSS---------GRTGGMAAHADNPTSRAVR-------- 269 Query: 310 APGTSANSGIPGLSTEQWKSLLNILQNQANSNRLSCKVVIT---WIFYSGCSHHMTGTGD 140 G S GLSTE+ LLN+L S + V W+ SG SHHMTG Sbjct: 270 -TGNSQGGNFLGLSTEKMTRLLNMLDTPTQSGNNTGTVHALSPDWLIDSGASHHMTGNFS 328 Query: 139 LFMNLYPVSPYIIRLPDGTKVVASGLGTVCAG*NFIFQNVLYIP 8 ++ P+ I LPDGT+VVA+ G+V N I NVL++P Sbjct: 329 SLYDIMPIPECSIGLPDGTRVVANYCGSVQISVNLILNNVLFVP 372