BLASTX nr result
ID: Forsythia21_contig00050045
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Forsythia21_contig00050045 (864 letters) Database: ./nr 69,698,275 sequences; 24,982,196,650 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_011073037.1| PREDICTED: retrovirus-related Pol polyprotei... 148 7e-43 ref|XP_012849618.1| PREDICTED: uncharacterized protein LOC105969... 114 9e-33 ref|XP_011100917.1| PREDICTED: uncharacterized protein LOC105179... 92 8e-25 gb|AAC62132.1| copia-like retroelement pol polyprotein [Arabidop... 101 3e-21 dbj|BAA96887.1| copia-like retroelement pol polyprotein [Arabido... 99 1e-20 gb|KHN13665.1| Retrovirus-related Pol polyprotein from transposo... 95 4e-20 emb|CDX93457.1| BnaA06g05910D [Brassica napus] 99 1e-19 gb|AAD23679.1| putative retroelement pol polyprotein [Arabidopsi... 94 4e-19 ref|XP_009107606.1| PREDICTED: LOW QUALITY PROTEIN: uncharacteri... 91 2e-18 emb|CAN70013.1| hypothetical protein VITISV_017116 [Vitis vinifera] 91 3e-18 gb|AAF19226.1|AC007505_2 Highly similar to Ta1-3 polyprotein [Ar... 86 1e-17 emb|CAN80304.1| hypothetical protein VITISV_017821 [Vitis vinifera] 83 2e-17 dbj|BAB09923.1| copia-like retrotransposable element [Arabidopsi... 87 6e-17 gb|ABD96963.1| hypothetical protein [Cleome spinosa] 86 8e-17 emb|CAA20201.1| putative transposable element [Arabidopsis thali... 94 2e-16 ref|XP_010692517.1| PREDICTED: uncharacterized protein LOC104905... 80 1e-15 gb|AAK29467.1| polyprotein-like [Solanum chilense] 81 1e-15 sp|P10978.1|POLX_TOBAC RecName: Full=Retrovirus-related Pol poly... 78 1e-15 emb|CAN61630.1| hypothetical protein VITISV_003191 [Vitis vinifera] 82 2e-15 emb|CAN84102.1| hypothetical protein VITISV_041248 [Vitis vinifera] 89 4e-15 >ref|XP_011073037.1| PREDICTED: retrovirus-related Pol polyprotein from transposon TNT 1-94 [Sesamum indicum] Length = 472 Score = 148 bits (373), Expect(2) = 7e-43 Identities = 79/224 (35%), Positives = 125/224 (55%), Gaps = 2/224 (0%) Frame = +1 Query: 199 VLERFFSFKMDQSKDLDDNVDVFNKLVQDIVNCGETVSEEYKAIILLNSIPEIYKEVKNA 378 +LE+ F +K+D SK++D+N+D F KL+QDI G+ +EY I+LLN+IPE + +VK A Sbjct: 99 LLEKIFRYKLDLSKNIDENLDDFTKLIQDIKLAGDKYIDEYSPIVLLNAIPESFSDVKAA 158 Query: 379 IKYGRDTLTPEIVIDSLRSKEMEMKAEKHDKKSGEIHTMRGRTQFRQTGSP-DYQXXXXX 555 IKYGRD++ E V++ L+SKE+++K K + EI+++RGRT+F S + + Sbjct: 159 IKYGRDSINLETVVNGLKSKELDLKVNKPSQSHYEINSVRGRTRFGNFNSRYNSRSRSKT 218 Query: 556 XXXXXXXXXXXXXXXXXXXXXXXCYGCGNTGHFIKDCHKAKNKQTGKGRDEVNVMSS-SN 732 CY CG GH+IKDC K + + + D+ +S+ S Sbjct: 219 KTNRSKSRPRETNLRDDKIRDRRCYNCGTKGHYIKDCRKPRRENRDRNYDDKEKVSNVSI 278 Query: 733 SDQGKVYVLINSVVDTAELNLTAKSRLHEWVLDSGAYFHVTSHK 864 G+V+V+ E N + +HEW++DSG FH++ K Sbjct: 279 ESNGEVFVVY-------EANSVSTFDMHEWLIDSGCTFHMSPFK 315 Score = 54.3 bits (129), Expect(2) = 7e-43 Identities = 27/58 (46%), Positives = 39/58 (67%) Frame = +2 Query: 26 SFTKEQKEDSDELAYSAIILHLSDAVLRKVGKHDTTXXXXXXXXXXXXXXSIPNKLFL 199 + T+E+K ++DE AYS+I+L+LSD VLRKVGK +++ S+PNKLFL Sbjct: 42 NITEEKKLENDEFAYSSIVLNLSDTVLRKVGKLESSKALWDKLEELFTEISLPNKLFL 99 >ref|XP_012849618.1| PREDICTED: uncharacterized protein LOC105969411 [Erythranthe guttatus] Length = 1213 Score = 114 bits (286), Expect(2) = 9e-33 Identities = 52/111 (46%), Positives = 81/111 (72%) Frame = +1 Query: 199 VLERFFSFKMDQSKDLDDNVDVFNKLVQDIVNCGETVSEEYKAIILLNSIPEIYKEVKNA 378 +LE+FF FK+D +KD+D+N+D F +LVQDI G+ + Y I+LLN+IP+ Y ++K+A Sbjct: 542 LLEKFFRFKLDLTKDIDENIDQFTRLVQDIKLTGDKSIDNYTPIVLLNAIPDSYNDLKSA 601 Query: 379 IKYGRDTLTPEIVIDSLRSKEMEMKAEKHDKKSGEIHTMRGRTQFRQTGSP 531 IKYGRD ++ + VI+ L+SKEM+++ K +K GE++ +RGR Q R + P Sbjct: 602 IKYGRDNISLDTVINGLKSKEMDLRVNKSNKSFGEVNFVRGRQQNRFSNKP 652 Score = 53.9 bits (128), Expect(2) = 9e-33 Identities = 28/56 (50%), Positives = 36/56 (64%) Frame = +2 Query: 32 TKEQKEDSDELAYSAIILHLSDAVLRKVGKHDTTXXXXXXXXXXXXXXSIPNKLFL 199 T +K + DELAYSAIIL+LSD+V+RKVG HD+ S+P+KLFL Sbjct: 487 TASKKIELDELAYSAIILNLSDSVIRKVGMHDSAKGLWEKLDELYTETSLPSKLFL 542 >ref|XP_011100917.1| PREDICTED: uncharacterized protein LOC105179028 [Sesamum indicum] Length = 188 Score = 92.0 bits (227), Expect(2) = 8e-25 Identities = 42/88 (47%), Positives = 64/88 (72%) Frame = +1 Query: 199 VLERFFSFKMDQSKDLDDNVDVFNKLVQDIVNCGETVSEEYKAIILLNSIPEIYKEVKNA 378 +LE+ F +K+D SK +D+N+D F KL+QDI G+ +EY I+LLN+IP+ Y + K A Sbjct: 99 LLEKKFHYKLDLSKSIDENLDDFTKLIQDIKLTGDKNIDEYSPIVLLNAIPKSYSDAKAA 158 Query: 379 IKYGRDTLTPEIVIDSLRSKEMEMKAEK 462 IKYGRD++ + V++ L+SKEM++K K Sbjct: 159 IKYGRDSVNLDTVVNGLKSKEMDLKVSK 186 Score = 50.1 bits (118), Expect(2) = 8e-25 Identities = 27/63 (42%), Positives = 38/63 (60%) Frame = +2 Query: 26 SFTKEQKEDSDELAYSAIILHLSDAVLRKVGKHDTTXXXXXXXXXXXXXXSIPNKLFLC* 205 + + E+K +DE AYS+IIL+LSD VLRKVGK ++ S+P+KLFL Sbjct: 42 NISDEKKIQNDEFAYSSIILNLSDNVLRKVGKQSSSKDLWEKLEDLYTETSLPSKLFLLE 101 Query: 206 RDF 214 + F Sbjct: 102 KKF 104 >gb|AAC62132.1| copia-like retroelement pol polyprotein [Arabidopsis thaliana] Length = 1137 Score = 101 bits (251), Expect(2) = 3e-21 Identities = 61/220 (27%), Positives = 108/220 (49%), Gaps = 1/220 (0%) Frame = +1 Query: 208 RFFSFKMDQSKDLDDNVDVFNKLVQDIVNCGETVSEEYKAIILLNSIPEIYKEVKNAIKY 387 + ++++M SK L++NVD F K++ D+ N V +E +AI++L+++P+ Y +K +KY Sbjct: 91 KVYNYRMQDSKTLEENVDEFQKMISDLNNLQIQVPDEVQAILILSALPDSYDMLKETLKY 150 Query: 388 GRDTLTPEIVIDSLRSKEMEMK-AEKHDKKSGEIHTMRGRTQFRQTGSPDYQXXXXXXXX 564 GR+ + + VI + +SKE+E++ + + GE +RG++Q R + P Sbjct: 151 GREGIKLDDVISAAKSKELELRDSSGGSRPVGEGLYVRGKSQARGSDGP----------- 199 Query: 565 XXXXXXXXXXXXXXXXXXXXCYGCGNTGHFIKDCHKAKNKQTGKGRDEVNVMSSSNSDQG 744 C+ CG GHF + C+K K G E ++ D Sbjct: 200 ------------KSTEGKKVCWICGKEGHFKRQCYKWLEKNKANGAGETALVKDDAQD-- 245 Query: 745 KVYVLINSVVDTAELNLTAKSRLHEWVLDSGAYFHVTSHK 864 + L+ S V+ +E K EW++D+G FH+T K Sbjct: 246 -LVGLVASEVNMSE----GKDDQEEWIMDTGCSFHMTPRK 280 Score = 28.5 bits (62), Expect(2) = 3e-21 Identities = 14/50 (28%), Positives = 22/50 (44%) Frame = +2 Query: 50 DSDELAYSAIILHLSDAVLRKVGKHDTTXXXXXXXXXXXXXXSIPNKLFL 199 D DE A I +++ D VLR + T S+PN+++L Sbjct: 39 DQDEKAMDMIFINVGDKVLRNIENSKTAAEAWATLDKLYLVKSLPNRVYL 88 >dbj|BAA96887.1| copia-like retroelement pol polyprotein [Arabidopsis thaliana] Length = 1140 Score = 99.4 bits (246), Expect(2) = 1e-20 Identities = 68/221 (30%), Positives = 107/221 (48%), Gaps = 4/221 (1%) Frame = +1 Query: 208 RFFSFKMDQSKDLDDNVDVFNKLVQDIVNCGETVSEEYKAIILLNSIPEIYKEVKNAIKY 387 +F+SF+M SK +D NVD F ++V ++ + V+EE +AI++LNS+P Y ++K+ +KY Sbjct: 133 KFYSFRMMTSKTIDQNVDDFLRIVAELGSLDIKVAEEVQAILILNSLPVTYDQLKHTLKY 192 Query: 388 GRDTLTPEIVIDSLRSKEMEMKAEKHDKK--SGEIHTM-RGRTQFR-QTGSPDYQXXXXX 555 G TL+ + V+ S +S E EM K + K + ++T RGR Q R Q GS Sbjct: 193 GNKTLSVKDVVSSSKSLEREMAELKENTKVVNTTLYTAERGRPQTRNQNGS-----QGNN 247 Query: 556 XXXXXXXXXXXXXXXXXXXXXXXCYGCGNTGHFIKDCHKAKNKQTGKGRDEVNVMSSSNS 735 C+ C GH KDC K K N Sbjct: 248 QGNNQGKNQGKGKSRSNSKSRVTCWFCKKEGHVKKDCFARKKK-------------FENE 294 Query: 736 DQGKVYVLINSVVDTAELNLTAKSRLHEWVLDSGAYFHVTS 858 +QG+ V+ +V + L++ + +WV+DSG +H+TS Sbjct: 295 EQGEAGVITEKLVYSEALSMHDQEAKEKWVIDSGCTYHITS 335 Score = 28.9 bits (63), Expect(2) = 1e-20 Identities = 16/51 (31%), Positives = 23/51 (45%) Frame = +2 Query: 44 KEDSDELAYSAIILHLSDAVLRKVGKHDTTXXXXXXXXXXXXXXSIPNKLF 196 K + E A + II H+SD VLRKV T +PN+++ Sbjct: 79 KIEKSEQAMNIIINHISDTVLRKVNHCKTAATLWELLNELYMETLLPNRIY 129 >gb|KHN13665.1| Retrovirus-related Pol polyprotein from transposon TNT 1-94, partial [Glycine soja] Length = 337 Score = 94.7 bits (234), Expect(2) = 4e-20 Identities = 59/219 (26%), Positives = 103/219 (47%), Gaps = 2/219 (0%) Frame = +1 Query: 205 ERFFSFKMDQSKDLDDNVDVFNKLVQDIVNCGETVSEEYKAIILLNSIPEIYKEVKNAIK 384 +R KM++ + ++V +F K V D+ + + EE +A++LL S+P ++ + + + Sbjct: 104 KRLHQLKMEEGSSIKEHVSLFTKAVLDLKSVDVRIDEEDQAVMLLCSLPSSFENLVDTML 163 Query: 385 YGRDTLTPEIVIDSLRSKEMEMKAEKHDKKSGEIHTM--RGRTQFRQTGSPDYQXXXXXX 558 +GRDTLT E V +L S+E++ K ++ + G+ + RGR + R + S + + Sbjct: 164 FGRDTLTLEEVKATLNSRELKKKITENKGEGGDPEALMARGRLEKRDSKSKNKR------ 217 Query: 559 XXXXXXXXXXXXXXXXXXXXXXCYGCGNTGHFIKDCHKAKNKQTGKGRDEVNVMSSSNSD 738 CY C GHF K+C + K K GK DE ++ Sbjct: 218 -------------RSKYKNEKACYYCKKEGHFRKECPERKKKNNGKYNDESDIA------ 258 Query: 739 QGKVYVLINSVVDTAELNLTAKSRLHEWVLDSGAYFHVT 855 V+ + L+++ K EW+LDSG FH+T Sbjct: 259 -----VVADGYESAEVLSISTKKHSEEWILDSGCSFHMT 292 Score = 31.6 bits (70), Expect(2) = 4e-20 Identities = 18/58 (31%), Positives = 29/58 (50%) Frame = +2 Query: 26 SFTKEQKEDSDELAYSAIILHLSDAVLRKVGKHDTTXXXXXXXXXXXXXXSIPNKLFL 199 + + ++K+D A+S IIL L D VLR+V + + S+ NKL+L Sbjct: 45 TLSDKEKKDLLSKAHSTIILSLGDEVLREVAEEKSAAGIWLKLESLYMTKSLTNKLYL 102 >emb|CDX93457.1| BnaA06g05910D [Brassica napus] Length = 1205 Score = 99.4 bits (246), Expect(2) = 1e-19 Identities = 71/233 (30%), Positives = 110/233 (47%), Gaps = 9/233 (3%) Frame = +1 Query: 193 FLVLERFFSFKMDQSKDLDDNVDVFNKLVQDIVNCGETVSEEYKAIILLNSIPEIYKEVK 372 F + +RF+S+KM+ K+LD N+DVF KLV D+ + +SEE +A+ILLNS+P ++ + Sbjct: 115 FYLKQRFYSYKMEDDKNLDKNLDVFTKLVSDLASLDVELSEEDQAVILLNSLPRRFEPLV 174 Query: 373 NAIKYGRD--TLTPEIVIDSLRSKEMEMKAEKHDKKSGE-----IHTMRGRTQFRQTGSP 531 + +KYGRD T+T + + + S E++MKA+ SG RGR++ + T Sbjct: 175 HTLKYGRDQETITLKEITRAAYSIELDMKAKGSSGSSGTSGEGLYFQSRGRSEKKTTSK- 233 Query: 532 DYQXXXXXXXXXXXXXXXXXXXXXXXXXXXXCYGCGNTGHFIKDCHKAKNKQTGKGRDEV 711 C+ CG GHF ++C K +N G + E Sbjct: 234 ---------------VKSNSRSKSRPKFKKTCWVCGVEGHFKRECPK-RNNSNGSVKAEA 277 Query: 712 NVMSSSNSDQGKVYVLINSVVDTAELNLTAKSRLHE--WVLDSGAYFHVTSHK 864 +V + + L LTA L+ WVLDSG +H+T K Sbjct: 278 SVGKAEYHE---------------PLMLTASCHLNREGWVLDSGCSYHMTFRK 315 Score = 25.0 bits (53), Expect(2) = 1e-19 Identities = 14/50 (28%), Positives = 22/50 (44%) Frame = +2 Query: 50 DSDELAYSAIILHLSDAVLRKVGKHDTTXXXXXXXXXXXXXXSIPNKLFL 199 + D + + + LSD VLRKV K + S+PN+ +L Sbjct: 68 EKDRRVRNLLSMSLSDMVLRKVIKSRSALEMWTALEATYQIKSLPNRFYL 117 >gb|AAD23679.1| putative retroelement pol polyprotein [Arabidopsis thaliana] Length = 838 Score = 94.4 bits (233), Expect(2) = 4e-19 Identities = 65/231 (28%), Positives = 112/231 (48%), Gaps = 11/231 (4%) Frame = +1 Query: 205 ERFFSFKMDQSKDLDDNVDVFNKLVQDIVNCGETVSEEYKAIILLNSIPEIYKEVKNAIK 384 +R + +KM S +++NV+ F KL+ D+ N +V +E +AI+LL S+P+ + ++K+ +K Sbjct: 117 QRLYGYKMSDSMTIEENVNDFFKLISDLENVKVSVPDEDQAIVLLMSLPKQFDQLKDTLK 176 Query: 385 YGRDTLTPEIVIDSLRSKEMEMKAE-KHDKKSGEIHTM--RGRTQFRQTGSPDYQXXXXX 555 YG+ TL + + ++RSK +E+ A K K S + + RGR++ R S Sbjct: 177 YGKTTLALDEITGAIRSKVLELGASGKMLKNSSDALFVQDRGRSEKRDKSS--------- 227 Query: 556 XXXXXXXXXXXXXXXXXXXXXXXCYGCGNTGHFIKDCH--KAKNKQTGKGRDEVNVMSSS 729 C+ CG GHF K C+ K KNK+ + Sbjct: 228 -------ERNKSQSRSKSREKKVCWVCGKEGHFKKQCYVWKEKNKK------------GN 268 Query: 730 NSDQGKVYVLINSVVDTAELNLTAKSRL------HEWVLDSGAYFHVTSHK 864 NS++G+ +I D A L + +S +EW++D+G FH+T + Sbjct: 269 NSEKGESSNVIGQAADAAALAVREESNADNQEVDNEWIMDTGCSFHMTPRR 319 Score = 28.5 bits (62), Expect(2) = 4e-19 Identities = 16/45 (35%), Positives = 22/45 (48%) Frame = +2 Query: 65 AYSAIILHLSDAVLRKVGKHDTTXXXXXXXXXXXXXXSIPNKLFL 199 A S +IL L + VLRKV K T S+PN+++L Sbjct: 71 ARSTVILSLGNHVLRKVIKEKTAAGMIRVLDKLFMAKSLPNRIYL 115 >ref|XP_009107606.1| PREDICTED: LOW QUALITY PROTEIN: uncharacterized protein LOC103833265 [Brassica rapa] Length = 3353 Score = 90.9 bits (224), Expect(2) = 2e-18 Identities = 70/232 (30%), Positives = 112/232 (48%), Gaps = 9/232 (3%) Frame = +1 Query: 196 LVLERFFS-FKMDQSKDLDDNVDVFNKLVQDIVNCGETVSEEYKAIILLNSIPEIYKEVK 372 + L+R FS +KM++ K +++NVDVF KL+ D+ + T+++E +AI LL+ +P Y+++ Sbjct: 2101 IYLKRQFSCYKMEEDKSIEENVDVFLKLIADLESLKVTITDEEQAIQLLSGLPAAYEQLV 2160 Query: 373 NAIKY--GRDTLTPEIVIDSLRSKEMEMKA-----EKHDKKSGEIHTMRGRTQFRQTGSP 531 + ++Y GRDTLT V+ S SKE E++ +K G RGR+ R Sbjct: 2161 HTLQYGTGRDTLTVSEVVTSAYSKEAELRQKGLLNKKKPTSEGLYVESRGRSSKRTENGN 2220 Query: 532 DYQXXXXXXXXXXXXXXXXXXXXXXXXXXXXCYGCGNTGHFIKDCHKAKNKQTGKGRDE- 708 + + C+ CG GH+ +DC NK G ++ Sbjct: 2221 NKK----YNRSRSRGDRSKSKGKSDKTKKGACFSCGKEGHWKRDC---PNKGHMNGSEQA 2273 Query: 709 VNVMSSSNSDQGKVYVLINSVVDTAELNLTAKSRLHEWVLDSGAYFHVTSHK 864 VN + S+ + VL S+ D+ +EWVLDSG FH+T K Sbjct: 2274 VNAV----SEMRQPLVLTASIHDSR----------NEWVLDSGCTFHITPDK 2311 Score = 29.6 bits (65), Expect(2) = 2e-18 Identities = 15/54 (27%), Positives = 25/54 (46%) Frame = +2 Query: 38 EQKEDSDELAYSAIILHLSDAVLRKVGKHDTTXXXXXXXXXXXXXXSIPNKLFL 199 ++K D D S + LSD++LRK+ T S+PN+++L Sbjct: 2050 QKKADKDLRVRSLLCTCLSDSILRKIMNEQTALGMWKSLEKLYQLKSLPNRIYL 2103 Score = 90.9 bits (224), Expect = 1e-15 Identities = 70/232 (30%), Positives = 112/232 (48%), Gaps = 9/232 (3%) Frame = +1 Query: 196 LVLERFFS-FKMDQSKDLDDNVDVFNKLVQDIVNCGETVSEEYKAIILLNSIPEIYKEVK 372 + L+R FS +KM++ K +++NVDVF KL+ D+ + T+++E +AI LL+ +P Y+++ Sbjct: 26 IYLKRQFSCYKMEEDKSIEENVDVFLKLIADLESLKVTITDEEQAIQLLSGLPAAYEQLV 85 Query: 373 NAIKY--GRDTLTPEIVIDSLRSKEMEMKA-----EKHDKKSGEIHTMRGRTQFRQTGSP 531 + ++Y GRDTLT V+ S SKE E++ +K G RGR+ R Sbjct: 86 HTLQYGTGRDTLTVSEVVTSAYSKEAELRQKGLLNKKKPTSEGLYVESRGRSSKRTENGN 145 Query: 532 DYQXXXXXXXXXXXXXXXXXXXXXXXXXXXXCYGCGNTGHFIKDCHKAKNKQTGKGRDE- 708 + + C+ CG GH+ +DC NK G ++ Sbjct: 146 NKK----YNRSRSRGDRSKSKGKSDKTKKGACFSCGKEGHWKRDC---PNKGHMNGSEQA 198 Query: 709 VNVMSSSNSDQGKVYVLINSVVDTAELNLTAKSRLHEWVLDSGAYFHVTSHK 864 VN + S+ + VL S+ D+ +EWVLDSG FH+T K Sbjct: 199 VNAV----SEMRQPLVLTASIHDSR----------NEWVLDSGCTFHITPDK 236 >emb|CAN70013.1| hypothetical protein VITISV_017116 [Vitis vinifera] Length = 947 Score = 90.5 bits (223), Expect(2) = 3e-18 Identities = 62/215 (28%), Positives = 99/215 (46%) Frame = +1 Query: 208 RFFSFKMDQSKDLDDNVDVFNKLVQDIVNCGETVSEEYKAIILLNSIPEIYKEVKNAIKY 387 + ++FK+ ++++ D FNK++ D+ N TVS E KAI+LL S+ Y +K AI Y Sbjct: 150 KLYTFKITPGMSIEEHFDHFNKIILDLENIDITVSNEDKAILLLTSLDASYTNMKEAIMY 209 Query: 388 GRDTLTPEIVIDSLRSKEMEMKAEKHDKKSGEIHTMRGRTQFRQTGSPDYQXXXXXXXXX 567 GRD++T + V L +E++ + E D +SGE +RGR R+ + + Sbjct: 210 GRDSMTFDEVQSILHPRELQKQEESKD-ESGEGLNIRGRYDKREKKCKNLK--------- 259 Query: 568 XXXXXXXXXXXXXXXXXXXCYGCGNTGHFIKDCHKAKNKQTGKGRDEVNVMSSSNSDQGK 747 C+ C GHF KDC +K+ + VN +G Sbjct: 260 --------AKSKSNTKKFKCFICHKEGHFKKDC---SDKRQNTIKKTVN--------EGD 300 Query: 748 VYVLINSVVDTAELNLTAKSRLHEWVLDSGAYFHV 852 V+++ LN+ EW+LDSG FH+ Sbjct: 301 AAVILDGYDSAKVLNVAEMDSGKEWILDSGCSFHM 335 Score = 29.3 bits (64), Expect(2) = 3e-18 Identities = 19/59 (32%), Positives = 27/59 (45%) Frame = +2 Query: 17 LFRSFTKEQKEDSDELAYSAIILHLSDAVLRKVGKHDTTXXXXXXXXXXXXXXSIPNKL 193 L + +++K E A SAIIL L D +LR+V K T S+ N+L Sbjct: 87 LLNTMQEKEKTKLLEKAQSAIILSLGDTMLREVAKAKPTAELWLKLESLYMTKSLANRL 145 >gb|AAF19226.1|AC007505_2 Highly similar to Ta1-3 polyprotein [Arabidopsis thaliana] Length = 1356 Score = 86.3 bits (212), Expect(2) = 1e-17 Identities = 63/223 (28%), Positives = 98/223 (43%), Gaps = 4/223 (1%) Frame = +1 Query: 208 RFFSFKMDQSKDLDDNVDVFNKLVQDIVNCGETVSEEYKAIILLNSIPEIYKEVKNAIKY 387 + +SFKM + +D NVD F ++V ++ + V EE +AI++LNS+P + ++K+ +KY Sbjct: 129 KLYSFKMVSTMTIDQNVDEFLRIVAELGSLEIQVDEEVQAILILNSLPASHIQLKHTLKY 188 Query: 388 GRDTLTPEIVIDSLRSKEMEMKAEKHDKKSGE----IHTMRGRTQFRQTGSPDYQXXXXX 555 G TLT + V S +S E E+ AE D G+ T RGR R Sbjct: 189 GNKTLTVQDVTSSAKSLEREL-AEAVDLDKGQAAVLYTTERGRPLVRNN----------- 236 Query: 556 XXXXXXXXXXXXXXXXXXXXXXXCYGCGNTGHFIKDCHKAKNKQTGKGRDEVNVMSSSNS 735 C+ C GH KDC+ K K +G Sbjct: 237 ----QKGGQGKGRSRSNSKTKVPCWYCKKEGHVKKDCYSRKKKMESEG------------ 280 Query: 736 DQGKVYVLINSVVDTAELNLTAKSRLHEWVLDSGAYFHVTSHK 864 QG+ V+ +V + L++ + W+LDSG H+TS + Sbjct: 281 -QGEAGVITEKLVFSEALSVNEQMVKDLWILDSGCTSHMTSRR 322 Score = 31.6 bits (70), Expect(2) = 1e-17 Identities = 17/51 (33%), Positives = 25/51 (49%) Frame = +2 Query: 44 KEDSDELAYSAIILHLSDAVLRKVGKHDTTXXXXXXXXXXXXXXSIPNKLF 196 K + E A + II H+SD VL KV + TT S+PN+++ Sbjct: 75 KIEQSEQAKNIIINHISDVVLLKVNHYATTADLWATLNKKYMETSLPNRIY 125 >emb|CAN80304.1| hypothetical protein VITISV_017821 [Vitis vinifera] Length = 939 Score = 82.8 bits (203), Expect(2) = 2e-17 Identities = 43/100 (43%), Positives = 65/100 (65%) Frame = +1 Query: 205 ERFFSFKMDQSKDLDDNVDVFNKLVQDIVNCGETVSEEYKAIILLNSIPEIYKEVKNAIK 384 ER + FKM + + + DN+D F K+V ++ N G V +E KA+++L S+P +Y K +K Sbjct: 105 ERLYGFKMQEDRSIADNLDDFAKIVLEMSNIGIKVDDEDKAVLVLKSLPGLYSNFKETMK 164 Query: 385 YGRDTLTPEIVIDSLRSKEMEMKAEKHDKKSGEIHTMRGR 504 YGR TLT E V +LRSKE+E+K + +GE ++RGR Sbjct: 165 YGRKTLTLEEVQSALRSKELELKKK---GSNGEGLSIRGR 201 Score = 34.7 bits (78), Expect(2) = 2e-17 Identities = 19/58 (32%), Positives = 27/58 (46%) Frame = +2 Query: 26 SFTKEQKEDSDELAYSAIILHLSDAVLRKVGKHDTTXXXXXXXXXXXXXXSIPNKLFL 199 + T ++K D DE AY IIL L D LR+ + T S+ N+L+L Sbjct: 46 TMTAKEKSDIDEKAYHLIILALGDKALREFSEETTAKGVWNKLEQLYMQNSLSNRLYL 103 >dbj|BAB09923.1| copia-like retrotransposable element [Arabidopsis thaliana] Length = 1342 Score = 86.7 bits (213), Expect(2) = 6e-17 Identities = 63/231 (27%), Positives = 104/231 (45%), Gaps = 11/231 (4%) Frame = +1 Query: 205 ERFFSFKMDQSKDLDDNVDVFNKLVQDIVNCGETVSEEYKAIILLNSIPEIYKEVKNAIK 384 +R + +KM ++ +++NV+ F KL+ D+ N V +E +AI+LL S+P + ++K +K Sbjct: 125 QRLYGYKMSENMTMEENVNDFFKLISDLENVKVVVPDEDQAIVLLMSLPRQFDQLKETLK 184 Query: 385 YGRDTLTPEIVIDSLRSKEMEMKAE---KHDKKSGEIHTMRGRTQFRQTGSPDYQXXXXX 555 Y + TL E + ++RSK +E+ A + G RGR++ R G Sbjct: 185 YCKTTLHLEEITSAIRSKILELGASGKLLKNNSDGLFVQDRGRSETRGKG---------- 234 Query: 556 XXXXXXXXXXXXXXXXXXXXXXXCYGCGNTGHFIKDCH--KAKNKQTGKGRDEVNVMSSS 729 C+ CG GHF K C+ K +NKQ S Sbjct: 235 -------PNKNKSRSKSKGAGKTCWICGKEGHFKKQCYVWKERNKQ------------GS 275 Query: 730 NSDQGKVYVLINSVVDTAELNLT------AKSRLHEWVLDSGAYFHVTSHK 864 S++G+ + V D A L ++ A+ W+LD+G FH+T K Sbjct: 276 TSERGEASTVTARVTDAAALVVSRALLGFAEVTPDTWILDTGCSFHMTCRK 326 Score = 28.9 bits (63), Expect(2) = 6e-17 Identities = 17/45 (37%), Positives = 22/45 (48%) Frame = +2 Query: 65 AYSAIILHLSDAVLRKVGKHDTTXXXXXXXXXXXXXXSIPNKLFL 199 A S IIL L + VLRKV K T S+PN+++L Sbjct: 79 ARSTIILSLGNNVLRKVIKQKTAAGMIKVLDQLFMAKSLPNRIYL 123 >gb|ABD96963.1| hypothetical protein [Cleome spinosa] Length = 408 Score = 85.9 bits (211), Expect(2) = 8e-17 Identities = 48/181 (26%), Positives = 92/181 (50%), Gaps = 1/181 (0%) Frame = +1 Query: 199 VLERFFSFKMDQSKDLDDNVDVFNKLVQDIVNCGETVSEEYKAIILLNSIPEIYKEVKNA 378 +++R F+MD S+ +++N+D+F KL+ D+ + V EEY+A+ LLNS+P Y++++ Sbjct: 142 LMQRVSGFRMDSSRTIEENLDIFQKLLSDLHSLNVKVEEEYQAVYLLNSLPPAYEQLREV 201 Query: 379 IKYGRDTLTPEIVIDSLRSKEMEMKAE-KHDKKSGEIHTMRGRTQFRQTGSPDYQXXXXX 555 +KY R T++ E V + R KE+E+ A+ + +GE ++G+ + G + Sbjct: 202 LKYSRATISVEEVKAAARMKELELLAQGTLTRGTGEGLVVKGKPEKSGGGKKKAK----- 256 Query: 556 XXXXXXXXXXXXXXXXXXXXXXXCYGCGNTGHFIKDCHKAKNKQTGKGRDEVNVMSSSNS 735 C+ CG GH+ K+C + K+ +G+ V + +S Sbjct: 257 -------------------DQVECWYCGKKGHYKKECRSRRAKEETEGKGVVASVQEYDS 297 Query: 736 D 738 + Sbjct: 298 E 298 Score = 29.3 bits (64), Expect(2) = 8e-17 Identities = 15/54 (27%), Positives = 27/54 (50%) Frame = +2 Query: 38 EQKEDSDELAYSAIILHLSDAVLRKVGKHDTTXXXXXXXXXXXXXXSIPNKLFL 199 +++++ A + I+L L+D VLRKV T S+PN+++L Sbjct: 89 KERQERSRRARNLIVLALADQVLRKVISERTAFGIWRKLERLHIEQSLPNRMYL 142 >emb|CAA20201.1| putative transposable element [Arabidopsis thaliana] gi|7268932|emb|CAB79135.1| putative transposable element [Arabidopsis thaliana] Length = 1308 Score = 93.6 bits (231), Expect = 2e-16 Identities = 63/221 (28%), Positives = 102/221 (46%), Gaps = 4/221 (1%) Frame = +1 Query: 208 RFFSFKMDQSKDLDDNVDVFNKLVQDIVNCGETVSEEYKAIILLNSIPEIYKEVKNAIKY 387 R +SFKM + +D N D F ++V ++ + V EE +AI++LNS+P Y ++K+ +KY Sbjct: 131 RLYSFKMVDNLSIDQNTDEFLRIVAELGSLQIQVGEEVQAILILNSLPPSYIQLKHTLKY 190 Query: 388 GRDTLTPEIVIDSLRSKEMEMKAEKHDKK---SGEIHTM-RGRTQFRQTGSPDYQXXXXX 555 G TL+ + V+ S +S E E+ +K + S ++T RGR Q + T Sbjct: 191 GNKTLSVQDVVSSAKSLERELSEQKETIRAPASTALYTAERGRPQTKNT----------- 239 Query: 556 XXXXXXXXXXXXXXXXXXXXXXXCYGCGNTGHFIKDCHKAKNKQTGKGRDEVNVMSSSNS 735 C+ C GH KDC+ K K +G Sbjct: 240 ------QGQGKGRGRSNSKSRLTCWFCKKEGHVKKDCYAGKRKLENEG------------ 281 Query: 736 DQGKVYVLINSVVDTAELNLTAKSRLHEWVLDSGAYFHVTS 858 QGK V+ +V + L++ + +WV+DSG +H+TS Sbjct: 282 -QGKAGVITEKLVYSEALSMYDQEAKDKWVIDSGCTYHMTS 321 >ref|XP_010692517.1| PREDICTED: uncharacterized protein LOC104905624 [Beta vulgaris subsp. vulgaris] Length = 2676 Score = 79.7 bits (195), Expect(2) = 1e-15 Identities = 62/229 (27%), Positives = 101/229 (44%), Gaps = 6/229 (2%) Frame = +1 Query: 196 LVLERFFSFKMDQSKDLDDNVDVFNKLVQDIVNCGETVSEEYKAIILLNSIPEIYKEVKN 375 L+ R +S ++++ L ++D F +V D+ N + +E AI LL S+P YK + Sbjct: 99 LLKSRLYSLRLEEGNSLKSHIDEFYSIVMDLQNIDVILDDEDLAIWLLCSLPHSYKHFRE 158 Query: 376 AIKYGRDTLTPEIVIDSLRSKEM-EMKAEKHDKKSGEIHTMRGRTQFRQTGSPDYQXXXX 552 I YGRD L+ + V D+L +E+ + + S + +RGR+ + YQ Sbjct: 159 TILYGRDDLSIDDVRDALNQRELIDNQLTSKSSNSSDGLFVRGRS---NDVASTYQ---- 211 Query: 553 XXXXXXXXXXXXXXXXXXXXXXXXCYGCGNTGHFIKDCHKAKNKQTG-----KGRDEVNV 717 C C GH +C+K KNKQ+G KG++ +N Sbjct: 212 -GGNGGKNRGRSKSKKPNSNKHKTCNYCHLKGHIKSECYKLKNKQSGDSKPRKGKEPMNS 270 Query: 718 MSSSNSDQGKVYVLINSVVDTAELNLTAKSRLHEWVLDSGAYFHVTSHK 864 S YV +S +++ + EW++DSG FHVT +K Sbjct: 271 ADVS-------YVDFDSESGALDVSKS------EWIMDSGCSFHVTPYK 306 Score = 31.6 bits (70), Expect(2) = 1e-15 Identities = 20/58 (34%), Positives = 30/58 (51%) Frame = +2 Query: 26 SFTKEQKEDSDELAYSAIILHLSDAVLRKVGKHDTTXXXXXXXXXXXXXXSIPNKLFL 199 + T+E+ E+ D A AI L LS+ VLR+V K D++ S+ N+L L Sbjct: 43 TMTEEKWEELDLKALLAIQLCLSNEVLREVAKEDSSAGLWLKLESLYMAKSVTNRLLL 100 >gb|AAK29467.1| polyprotein-like [Solanum chilense] Length = 1328 Score = 80.9 bits (198), Expect(2) = 1e-15 Identities = 57/221 (25%), Positives = 97/221 (43%), Gaps = 4/221 (1%) Frame = +1 Query: 205 ERFFSFKMDQSKDLDDNVDVFNKLVQDIVNCGETVSEEYKAIILLNSIPEIYKEVKNAIK 384 ++ ++ MD+ + +++V N L+ + N G + EE K I+LLNS+P Y + I Sbjct: 106 KQLYTLHMDEGTNFLSHLNVLNGLITQLANLGVKIEEEDKRIVLLNSLPSSYDTLSTTIL 165 Query: 385 YGRDTLTPEIVIDS-LRSKEMEMKAEKHDKKSGEIHTMRGRTQFRQTGSPDYQXXXXXXX 561 +G+D++ + V + L +++M K E H G++ R + Q S +Y Sbjct: 166 HGKDSIQLKDVTSALLLNEKMRKKPENH----GQVFITESRGRSYQRSSSNY-------- 213 Query: 562 XXXXXXXXXXXXXXXXXXXXXCYGCGNTGHFIKDC---HKAKNKQTGKGRDEVNVMSSSN 732 CY C GHF +DC + K + +G+ D+ N Sbjct: 214 --GRSGARGKSKVRSKSKARNCYNCDQPGHFKRDCPNPKRGKGESSGQKNDDNTAAMVQN 271 Query: 733 SDQGKVYVLINSVVDTAELNLTAKSRLHEWVLDSGAYFHVT 855 +D V +LIN + L T EWV+D+ A +H T Sbjct: 272 NDD--VVLLINEEEECMHLAGTES----EWVVDTAASYHAT 306 Score = 30.0 bits (66), Expect(2) = 1e-15 Identities = 18/58 (31%), Positives = 27/58 (46%) Frame = +2 Query: 26 SFTKEQKEDSDELAYSAIILHLSDAVLRKVGKHDTTXXXXXXXXXXXXXXSIPNKLFL 199 S E E+ DE A SAI LHL+D V+ + ++ ++ NKL+L Sbjct: 47 SMKLEDWEELDEKAASAIRLHLTDDVVNNIVDEESACGIWTKLENLYMSKTLTNKLYL 104 >sp|P10978.1|POLX_TOBAC RecName: Full=Retrovirus-related Pol polyprotein from transposon TNT 1-94; Includes: RecName: Full=Protease; Includes: RecName: Full=Reverse transcriptase; Includes: RecName: Full=Endonuclease [Nicotiana tabacum] gi|20045|emb|CAA32025.1| unnamed protein product [Nicotiana tabacum] Length = 1328 Score = 78.2 bits (191), Expect(2) = 1e-15 Identities = 57/220 (25%), Positives = 92/220 (41%), Gaps = 3/220 (1%) Frame = +1 Query: 205 ERFFSFKMDQSKDLDDNVDVFNKLVQDIVNCGETVSEEYKAIILLNSIPEIYKEVKNAIK 384 ++ ++ M + + +++VFN L+ + N G + EE KAI+LLNS+P Y + I Sbjct: 105 KQLYALHMSEGTNFLSHLNVFNGLITQLANLGVKIEEEDKAILLLNSLPSSYDNLATTIL 164 Query: 385 YGRDTLTPEIVIDSLRSKEMEMKAEKHDKKSGEIHTMRGRTQFRQTGSPDYQXXXXXXXX 564 +G+ T+ + V +L E K K + G+ GR + Q S +Y Sbjct: 165 HGKTTIELKDVTSALLLNE---KMRKKPENQGQALITEGRGRSYQRSSNNY--------- 212 Query: 565 XXXXXXXXXXXXXXXXXXXXCYGCGNTGHFIKDC---HKAKNKQTGKGRDEVNVMSSSNS 735 CY C GHF +DC K K + +G+ D+ N+ Sbjct: 213 -GRSGARGKSKNRSKSRVRNCYNCNQPGHFKRDCPNPRKGKGETSGQKNDDNTAAMVQNN 271 Query: 736 DQGKVYVLINSVVDTAELNLTAKSRLHEWVLDSGAYFHVT 855 D V + IN + L+ EWV+D+ A H T Sbjct: 272 D--NVVLFINEEEECMHLS----GPESEWVVDTAASHHAT 305 Score = 32.7 bits (73), Expect(2) = 1e-15 Identities = 20/54 (37%), Positives = 25/54 (46%) Frame = +2 Query: 38 EQKEDSDELAYSAIILHLSDAVLRKVGKHDTTXXXXXXXXXXXXXXSIPNKLFL 199 E D DE A SAI LHLSD V+ + DT ++ NKL+L Sbjct: 50 EDWADLDERAASAIRLHLSDDVVNNIIDEDTARGIWTRLESLYMSKTLTNKLYL 103 >emb|CAN61630.1| hypothetical protein VITISV_003191 [Vitis vinifera] Length = 1208 Score = 82.0 bits (201), Expect(2) = 2e-15 Identities = 53/187 (28%), Positives = 91/187 (48%) Frame = +1 Query: 208 RFFSFKMDQSKDLDDNVDVFNKLVQDIVNCGETVSEEYKAIILLNSIPEIYKEVKNAIKY 387 + ++FKM ++ ++D FNK++ D+ N T+S+E KAI+LL S+ Y +K+AI Y Sbjct: 106 KLYTFKMTPGMSIEXHLDHFNKIILDLENIDITISDEDKAILLLTSLDASYTNMKDAIMY 165 Query: 388 GRDTLTPEIVIDSLRSKEMEMKAEKHDKKSGEIHTMRGRTQFRQTGSPDYQXXXXXXXXX 567 GRD+LT + V L ++E++ K E+ ++SGE +RGR++ R+ + + Sbjct: 166 GRDSLTFDEVQSILHARELQ-KQEESKEESGEGLNIRGRSEKREKKGKNSK--------- 215 Query: 568 XXXXXXXXXXXXXXXXXXXCYGCGNTGHFIKDCHKAKNKQTGKGRDEVNVMSSSNSDQGK 747 C+ C GHF KDC + K + + S QG Sbjct: 216 --------SRSKSKTKKFKCFICHKEGHFKKDCPDRRQNTVKKTVNRWTRVRSGYLIQGA 267 Query: 748 VYVLINS 768 ++ + S Sbjct: 268 LFTCVLS 274 Score = 28.5 bits (62), Expect(2) = 2e-15 Identities = 18/53 (33%), Positives = 26/53 (49%) Frame = +2 Query: 35 KEQKEDSDELAYSAIILHLSDAVLRKVGKHDTTXXXXXXXXXXXXXXSIPNKL 193 ++QK + E A+SAIIL L D VLR+ K + S+ N+L Sbjct: 49 EKQKIELLEKAHSAIILSLGDTVLREXAKAKSAAEVWLKLESLYMTKSLANRL 101 >emb|CAN84102.1| hypothetical protein VITISV_041248 [Vitis vinifera] Length = 894 Score = 89.0 bits (219), Expect = 4e-15 Identities = 60/215 (27%), Positives = 98/215 (45%) Frame = +1 Query: 208 RFFSFKMDQSKDLDDNVDVFNKLVQDIVNCGETVSEEYKAIILLNSIPEIYKEVKNAIKY 387 + ++FKM S +++++D FNK++ D+ N VS E KAI+LL S+ Y +K AI Y Sbjct: 106 KLYTFKMTPSMSIEEHLDHFNKIILDLKNIDIAVSNEDKAILLLTSLDASYTNMKEAIMY 165 Query: 388 GRDTLTPEIVIDSLRSKEMEMKAEKHDKKSGEIHTMRGRTQFRQTGSPDYQXXXXXXXXX 567 GRD LT + V L ++E+ K E+ ++ GE +RG+++ R+ + Sbjct: 166 GRDILTFDEVQSILHARELH-KQEESKEELGEGLNIRGKSKKREKKKGN----------- 213 Query: 568 XXXXXXXXXXXXXXXXXXXCYGCGNTGHFIKDCHKAKNKQTGKGRDEVNVMSSSNSDQGK 747 C+ C GHF KDC + K +E G Sbjct: 214 -----NSKSRSKSKTKKFKCFICHKEGHFKKDCPDMRQNTXKKTMNE-----------GD 257 Query: 748 VYVLINSVVDTAELNLTAKSRLHEWVLDSGAYFHV 852 ++++ + LN+ EW+LDSG FH+ Sbjct: 258 ATMILDGYDNAGVLNVAEVDSGKEWILDSGCSFHM 292