BLASTX nr result
ID: Forsythia23_contig00034122
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Forsythia23_contig00034122 (596 letters) Database: ./nr 69,698,275 sequences; 24,982,196,650 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value gb|AAC62132.1| copia-like retroelement pol polyprotein [Arabidop... 115 2e-23 ref|XP_011073037.1| PREDICTED: retrovirus-related Pol polyprotei... 111 2e-22 emb|CAN71583.1| hypothetical protein VITISV_043292 [Vitis vinifera] 109 1e-21 ref|XP_010692517.1| PREDICTED: uncharacterized protein LOC104905... 108 1e-21 gb|AAD23679.1| putative retroelement pol polyprotein [Arabidopsi... 107 3e-21 ref|XP_012849618.1| PREDICTED: uncharacterized protein LOC105969... 107 4e-21 ref|XP_010532694.1| PREDICTED: endo-1,3;1,4-beta-D-glucanase-lik... 107 6e-21 dbj|BAQ19356.1| putative gag-pol polyprotein [Torenia fournieri] 104 4e-20 gb|KHN28812.1| Retrovirus-related Pol polyprotein from transposo... 103 6e-20 emb|CAN61630.1| hypothetical protein VITISV_003191 [Vitis vinifera] 103 8e-20 gb|KHN13665.1| Retrovirus-related Pol polyprotein from transposo... 102 1e-19 emb|CDX93457.1| BnaA06g05910D [Brassica napus] 101 3e-19 gb|ABD96963.1| hypothetical protein [Cleome spinosa] 101 3e-19 ref|XP_006651032.1| PREDICTED: uncharacterized protein LOC102707... 100 5e-19 emb|CAN80304.1| hypothetical protein VITISV_017821 [Vitis vinifera] 100 5e-19 dbj|BAB09923.1| copia-like retrotransposable element [Arabidopsi... 99 1e-18 emb|CAB40039.1| putative retrotransposon [Arabidopsis thaliana] ... 99 1e-18 ref|XP_009107606.1| PREDICTED: LOW QUALITY PROTEIN: uncharacteri... 98 3e-18 emb|CAN84102.1| hypothetical protein VITISV_041248 [Vitis vinifera] 98 3e-18 ref|XP_007028624.1| Uncharacterized protein TCM_024518 [Theobrom... 97 4e-18 >gb|AAC62132.1| copia-like retroelement pol polyprotein [Arabidopsis thaliana] Length = 1137 Score = 115 bits (288), Expect = 2e-23 Identities = 64/176 (36%), Positives = 98/176 (55%), Gaps = 2/176 (1%) Frame = +1 Query: 4 WKALDTLFLTKTLPNKIYLLEKLFSFRMDASKDLEANLSDFSIIVKSLAHNDKKFDDEDL 183 W LD L+L K+LPN++YL K++++RM SK LE N+ +F ++ L + + DE Sbjct: 70 WATLDKLYLVKSLPNRVYLQLKVYNYRMQDSKTLEENVDEFQKMISDLNNLQIQVPDEVQ 129 Query: 184 AVILLNSLPDSYKDVRNAIKYARDTLTQTIVIDALRSKELEVRKESKSGVKE--ESMYVR 357 A+++L++LPDSY ++ +KY R+ + VI A +SKELE+R +S G + E +YVR Sbjct: 130 AILILSALPDSYDMLKETLKYGREGIKLDDVISAAKSKELELR-DSSGGSRPVGEGLYVR 188 Query: 358 GRSEKKDQNYXXXXXXXXXXXXXXXXXXXXXXXCYHCGKLGHFKRDCYDLKNKQKA 525 G+S+ + + C+ CGK GHFKR CY K KA Sbjct: 189 GKSQARGSD--------------GPKSTEGKKVCWICGKEGHFKRQCYKWLEKNKA 230 >ref|XP_011073037.1| PREDICTED: retrovirus-related Pol polyprotein from transposon TNT 1-94 [Sesamum indicum] Length = 472 Score = 111 bits (278), Expect = 2e-22 Identities = 64/214 (29%), Positives = 106/214 (49%), Gaps = 21/214 (9%) Frame = +1 Query: 1 LWKALDTLFLTKTLPNKIYLLEKLFSFRMDASKDLEANLSDFSIIVKSLAHNDKKFDDED 180 LW L+ LF +LPNK++LLEK+F +++D SK+++ NL DF+ +++ + K+ DE Sbjct: 80 LWDKLEELFTEISLPNKLFLLEKIFRYKLDLSKNIDENLDDFTKLIQDIKLAGDKYIDEY 139 Query: 181 LAVILLNSLPDSYKDVRNAIKYARDTLTQTIVIDALRSKELEVRKESKSGVKEESMYVRG 360 ++LLN++P+S+ DV+ AIKY RD++ V++ L+SKEL+++ S E VRG Sbjct: 140 SPIVLLNAIPESFSDVKAAIKYGRDSINLETVVNGLKSKELDLKVNKPSQSHYEINSVRG 199 Query: 361 R----------SEKKDQNYXXXXXXXXXXXXXXXXXXXXXXXCYHCGKLGHFKRDC---- 498 R + + CY+CG GH+ +DC Sbjct: 200 RTRFGNFNSRYNSRSRSKTKTNRSKSRPRETNLRDDKIRDRRCYNCGTKGHYIKDCRKPR 259 Query: 499 -------YDLKNKQKAQALEPQNNAALEAEIDSL 579 YD K K ++E + E +S+ Sbjct: 260 RENRDRNYDDKEKVSNVSIESNGEVFVVYEANSV 293 >emb|CAN71583.1| hypothetical protein VITISV_043292 [Vitis vinifera] Length = 531 Score = 109 bits (272), Expect = 1e-21 Identities = 53/163 (32%), Positives = 92/163 (56%) Frame = +1 Query: 1 LWKALDTLFLTKTLPNKIYLLEKLFSFRMDASKDLEANLSDFSIIVKSLAHNDKKFDDED 180 +WK L+ +++ K+LPN+IYL +K F F+M K ++ + +F+ +V L K +DED Sbjct: 28 IWKKLEEIYMLKSLPNRIYLKQKFFGFKMHEKKSIDEIIDEFTKLVVDLESLCVKIEDED 87 Query: 181 LAVILLNSLPDSYKDVRNAIKYARDTLTQTIVIDALRSKELEVRKESKSGVKEESMYVRG 360 AV+LLNSLP + +R+ +KY++D+L+ V+ A+ +K++++R E E + VRG Sbjct: 88 QAVLLLNSLPKVFDQLRDTLKYSKDSLSLEGVVSAIHAKKMDIRVERIGTTSSEGLVVRG 147 Query: 361 RSEKKDQNYXXXXXXXXXXXXXXXXXXXXXXXCYHCGKLGHFK 489 R +K++ N C+HC K GH++ Sbjct: 148 RPKKRNNN---QKGNHDKSKSKSKSKSKFTRKCFHCHKEGHYR 187 >ref|XP_010692517.1| PREDICTED: uncharacterized protein LOC104905624 [Beta vulgaris subsp. vulgaris] Length = 2676 Score = 108 bits (271), Expect = 1e-21 Identities = 67/191 (35%), Positives = 97/191 (50%), Gaps = 7/191 (3%) Frame = +1 Query: 1 LWKALDTLFLTKTLPNKIYLLEKLFSFRMDASKDLEANLSDFSIIVKSLAHNDKKFDDED 180 LW L++L++ K++ N++ L +L+S R++ L++++ +F IV L + D DDED Sbjct: 81 LWLKLESLYMAKSVTNRLLLKSRLYSLRLEEGNSLKSHIDEFYSIVMDLQNIDVILDDED 140 Query: 181 LAVILLNSLPDSYKDVRNAIKYARDTLTQTIVIDALRSKEL-EVRKESKSGVKEESMYVR 357 LA+ LL SLP SYK R I Y RD L+ V DAL +EL + + SKS + ++VR Sbjct: 141 LAIWLLCSLPHSYKHFRETILYGRDDLSIDDVRDALNQRELIDNQLTSKSSNSSDGLFVR 200 Query: 358 GRSEKKDQNY-XXXXXXXXXXXXXXXXXXXXXXXCYHCGKLGHFKRDCYDLKNKQKA--- 525 GRS Y C +C GH K +CY LKNKQ Sbjct: 201 GRSNDVASTYQGGNGGKNRGRSKSKKPNSNKHKTCNYCHLKGHIKSECYKLKNKQSGDSK 260 Query: 526 --QALEPQNNA 552 + EP N+A Sbjct: 261 PRKGKEPMNSA 271 >gb|AAD23679.1| putative retroelement pol polyprotein [Arabidopsis thaliana] Length = 838 Score = 107 bits (268), Expect = 3e-21 Identities = 60/175 (34%), Positives = 95/175 (54%), Gaps = 3/175 (1%) Frame = +1 Query: 7 KALDTLFLTKTLPNKIYLLEKLFSFRMDASKDLEANLSDFSIIVKSLAHNDKKFDDEDLA 186 + LD LF+ K+LPN+IYL ++L+ ++M S +E N++DF ++ L + DED A Sbjct: 98 RVLDKLFMAKSLPNRIYLKQRLYGYKMSDSMTIEENVNDFFKLISDLENVKVSVPDEDQA 157 Query: 187 VILLNSLPDSYKDVRNAIKYARDTLTQTIVIDALRSKELEVRKESKS-GVKEESMYV--R 357 ++LL SLP + +++ +KY + TL + A+RSK LE+ K ++++V R Sbjct: 158 IVLLMSLPKQFDQLKDTLKYGKTTLALDEITGAIRSKVLELGASGKMLKNSSDALFVQDR 217 Query: 358 GRSEKKDQNYXXXXXXXXXXXXXXXXXXXXXXXCYHCGKLGHFKRDCYDLKNKQK 522 GRSEK+D++ C+ CGK GHFK+ CY K K K Sbjct: 218 GRSEKRDKS-------SERNKSQSRSKSREKKVCWVCGKEGHFKKQCYVWKEKNK 265 >ref|XP_012849618.1| PREDICTED: uncharacterized protein LOC105969411 [Erythranthe guttatus] Length = 1213 Score = 107 bits (267), Expect = 4e-21 Identities = 49/128 (38%), Positives = 83/128 (64%) Frame = +1 Query: 1 LWKALDTLFLTKTLPNKIYLLEKLFSFRMDASKDLEANLSDFSIIVKSLAHNDKKFDDED 180 LW+ LD L+ +LP+K++LLEK F F++D +KD++ N+ F+ +V+ + K D Sbjct: 523 LWEKLDELYTETSLPSKLFLLEKFFRFKLDLTKDIDENIDQFTRLVQDIKLTGDKSIDNY 582 Query: 181 LAVILLNSLPDSYKDVRNAIKYARDTLTQTIVIDALRSKELEVRKESKSGVKEESMYVRG 360 ++LLN++PDSY D+++AIKY RD ++ VI+ L+SKE+++R + E +VRG Sbjct: 583 TPIVLLNAIPDSYNDLKSAIKYGRDNISLDTVINGLKSKEMDLRVNKSNKSFGEVNFVRG 642 Query: 361 RSEKKDQN 384 R + + N Sbjct: 643 RQQNRFSN 650 >ref|XP_010532694.1| PREDICTED: endo-1,3;1,4-beta-D-glucanase-like [Tarenaya hassleriana] Length = 321 Score = 107 bits (266), Expect = 6e-21 Identities = 66/190 (34%), Positives = 98/190 (51%), Gaps = 1/190 (0%) Frame = +1 Query: 4 WKALDTLFLTKTLPNKIYLLEKLFSFRMDASKDLEANLSDFSIIVKSLAHNDKKFDDEDL 183 W + + L K+LPNK+YL ++L F+MDA+K +E N+ F +V L++ + ED Sbjct: 128 WGSKVVVELAKSLPNKMYLKQRLIDFKMDATKSIEDNVDVFKKLVNDLSNLKIEVAKEDQ 187 Query: 184 AVILLNSLPDSYKDVRNAIKY-ARDTLTQTIVIDALRSKELEVRKESKSGVKEESMYVRG 360 +ILLNSLPD Y +++ ++Y R+T+T + SKELE+ + + E + VRG Sbjct: 188 VLILLNSLPDQYDQLKDTLRYNRRETITLDEITSVAYSKELELAAKG-TRATAEGLVVRG 246 Query: 361 RSEKKDQNYXXXXXXXXXXXXXXXXXXXXXXXCYHCGKLGHFKRDCYDLKNKQKAQALEP 540 RSEK++ C+ CGK GHFKRDC K + QA E Sbjct: 247 RSEKRNST-----GNKSRKKSRSKSRSKSETECWFCGKEGHFKRDCRSRKKHFEEQAKEI 301 Query: 541 QNNAALEAEI 570 AA+ I Sbjct: 302 GEAAAVTQSI 311 >dbj|BAQ19356.1| putative gag-pol polyprotein [Torenia fournieri] Length = 605 Score = 104 bits (259), Expect = 4e-20 Identities = 63/200 (31%), Positives = 110/200 (55%), Gaps = 2/200 (1%) Frame = +1 Query: 1 LWKALDTLFLTKTLPNKIYLLEKLFSFRMDASKDLEANLSDFSIIVKSLAHNDKKFDDED 180 LWK L+ L++TK+L N++YL + L+SF+M K ++ + F ++ L + + K +DED Sbjct: 102 LWKKLEELYMTKSLANRLYLKQALYSFKMIEEKAIDEQMDQFIKLILDLENIEVKIEDED 161 Query: 181 LAVILLNSLPDSYKDVRNAIKYARDTLTQTIVIDALRSKELEVRKESKS-GVKEESMYVR 357 A++L+ +LP SY ++ + Y R+TLT V AL+SK+L R ++K+ G E++YV+ Sbjct: 162 QALLLVCALPRSYNTFKDTLLYGRETLTLKEVQAALKSKQLNTRIDNKAVGSTSEALYVK 221 Query: 358 GRSEKKDQNYXXXXXXXXXXXXXXXXXXXXXXXCYHCGKLGHFKRDCYDLKNKQKAQALE 537 G+ E+K + C++C + GH ++C K + K + +E Sbjct: 222 GKGEEKKTH------------KERKNKSKKKVKCFYCDEEGHMCKNC-PKKERDKGKKVE 268 Query: 538 PQNNAALEAE-IDSLHVMTV 594 Q AA+ E +S V+ V Sbjct: 269 -QGEAAMACESYESADVLAV 287 >gb|KHN28812.1| Retrovirus-related Pol polyprotein from transposon TNT 1-94, partial [Glycine soja] Length = 257 Score = 103 bits (257), Expect = 6e-20 Identities = 59/181 (32%), Positives = 103/181 (56%), Gaps = 2/181 (1%) Frame = +1 Query: 1 LWKALDTLFLTKTLPNKIYLLEKLFSFRMDASKDLEANLSDFSIIVKSLAHNDKKFDDED 180 +W L++L++TK+L N++ L + L++F+M S+ L DF+ I+ L + + K ++ED Sbjct: 81 MWLKLESLYMTKSLANRLCLRQHLYTFKMIESRTTTEQLVDFNKIIDDLENIEVKLEEED 140 Query: 181 LAVILLNSLPDSYKDVRNAIKYARD-TLTQTIVIDALRSKELEVRKESKSGVKEESMYV- 354 A++LLNSLP S++ ++AI Y +D +T V ++R+KE++ +++SKS ES+ + Sbjct: 141 KALLLLNSLPKSFEHFKDAILYGKDQDITLEEVQTSIRTKEMQKQQDSKSEDNGESLNIS 200 Query: 355 RGRSEKKDQNYXXXXXXXXXXXXXXXXXXXXXXXCYHCGKLGHFKRDCYDLKNKQKAQAL 534 RGR+EKK C++C K GHFK+DC D K+ + Sbjct: 201 RGRNEKKGTR----RKKSRSRSTDSKNGQKTKFKCFNCHKTGHFKKDCPDKIKKRSLDSA 256 Query: 535 E 537 + Sbjct: 257 D 257 >emb|CAN61630.1| hypothetical protein VITISV_003191 [Vitis vinifera] Length = 1208 Score = 103 bits (256), Expect = 8e-20 Identities = 58/168 (34%), Positives = 94/168 (55%) Frame = +1 Query: 1 LWKALDTLFLTKTLPNKIYLLEKLFSFRMDASKDLEANLSDFSIIVKSLAHNDKKFDDED 180 +W L++L++TK+L N+++ KL++F+M +E +L F+ I+ L + D DED Sbjct: 84 VWLKLESLYMTKSLANRLHKKIKLYTFKMTPGMSIEXHLDHFNKIILDLENIDITISDED 143 Query: 181 LAVILLNSLPDSYKDVRNAIKYARDTLTQTIVIDALRSKELEVRKESKSGVKEESMYVRG 360 A++LL SL SY ++++AI Y RD+LT V L ++EL+ ++ESK E + +RG Sbjct: 144 KAILLLTSLDASYTNMKDAIMYGRDSLTFDEVQSILHARELQKQEESKE-ESGEGLNIRG 202 Query: 361 RSEKKDQNYXXXXXXXXXXXXXXXXXXXXXXXCYHCGKLGHFKRDCYD 504 RSEK+++ C+ C K GHFK+DC D Sbjct: 203 RSEKREKK--------GKNSKSRSKSKTKKFKCFICHKEGHFKKDCPD 242 >gb|KHN13665.1| Retrovirus-related Pol polyprotein from transposon TNT 1-94, partial [Glycine soja] Length = 337 Score = 102 bits (255), Expect = 1e-19 Identities = 59/200 (29%), Positives = 105/200 (52%), Gaps = 2/200 (1%) Frame = +1 Query: 1 LWKALDTLFLTKTLPNKIYLLEKLFSFRMDASKDLEANLSDFSIIVKSLAHNDKKFDDED 180 +W L++L++TK+L NK+YL ++L +M+ ++ ++S F+ V L D + D+ED Sbjct: 83 IWLKLESLYMTKSLTNKLYLKKRLHQLKMEEGSSIKEHVSLFTKAVLDLKSVDVRIDEED 142 Query: 181 LAVILLNSLPDSYKDVRNAIKYARDTLTQTIVIDALRSKEL--EVRKESKSGVKEESMYV 354 AV+LL SLP S++++ + + + RDTLT V L S+EL ++ + G E++ Sbjct: 143 QAVMLLCSLPSSFENLVDTMLFGRDTLTLEEVKATLNSRELKKKITENKGEGGDPEALMA 202 Query: 355 RGRSEKKDQNYXXXXXXXXXXXXXXXXXXXXXXXCYHCGKLGHFKRDCYDLKNKQKAQAL 534 RGR EK+D CY+C K GHF+++C + K K + Sbjct: 203 RGRLEKRDSK----------SKNKRRSKYKNEKACYYCKKEGHFRKECPERKKKNNGKYN 252 Query: 535 EPQNNAALEAEIDSLHVMTV 594 + + A + +S V+++ Sbjct: 253 DESDIAVVADGYESAEVLSI 272 >emb|CDX93457.1| BnaA06g05910D [Brassica napus] Length = 1205 Score = 101 bits (251), Expect = 3e-19 Identities = 65/187 (34%), Positives = 97/187 (51%), Gaps = 10/187 (5%) Frame = +1 Query: 1 LWKALDTLFLTKTLPNKIYLLEKLFSFRMDASKDLEANLSDFSIIVKSLAHNDKKFDDED 180 +W AL+ + K+LPN+ YL ++ +S++M+ K+L+ NL F+ +V LA D + +ED Sbjct: 98 MWTALEATYQIKSLPNRFYLKQRFYSYKMEDDKNLDKNLDVFTKLVSDLASLDVELSEED 157 Query: 181 LAVILLNSLPDSYKDVRNAIKYARD--TLTQTIVIDALRSKELEVR---KESKSGVKEES 345 AVILLNSLP ++ + + +KY RD T+T + A S EL+++ SG E Sbjct: 158 QAVILLNSLPRRFEPLVHTLKYGRDQETITLKEITRAAYSIELDMKAKGSSGSSGTSGEG 217 Query: 346 MYV--RGRSEKKDQNYXXXXXXXXXXXXXXXXXXXXXXXCYHCGKLGHFKRDCYDLKNKQ 519 +Y RGRSEKK + C+ CG GHFKR+C N Sbjct: 218 LYFQSRGRSEKKTTS-------KVKSNSRSKSRPKFKKTCWVCGVEGHFKRECPKRNNSN 270 Query: 520 ---KAQA 531 KA+A Sbjct: 271 GSVKAEA 277 >gb|ABD96963.1| hypothetical protein [Cleome spinosa] Length = 408 Score = 101 bits (251), Expect = 3e-19 Identities = 55/177 (31%), Positives = 98/177 (55%), Gaps = 1/177 (0%) Frame = +1 Query: 1 LWKALDTLFLTKTLPNKIYLLEKLFSFRMDASKDLEANLSDFSIIVKSLAHNDKKFDDED 180 +W+ L+ L + ++LPN++YL++++ FRMD+S+ +E NL F ++ L + K ++E Sbjct: 123 IWRKLERLHIEQSLPNRMYLMQRVSGFRMDSSRTIEENLDIFQKLLSDLHSLNVKVEEEY 182 Query: 181 LAVILLNSLPDSYKDVRNAIKYARDTLTQTIVIDALRSKELEVRKE-SKSGVKEESMYVR 357 AV LLNSLP +Y+ +R +KY+R T++ V A R KELE+ + + + E + V+ Sbjct: 183 QAVYLLNSLPPAYEQLREVLKYSRATISVEEVKAAARMKELELLAQGTLTRGTGEGLVVK 242 Query: 358 GRSEKKDQNYXXXXXXXXXXXXXXXXXXXXXXXCYHCGKLGHFKRDCYDLKNKQKAQ 528 G+ EK C++CGK GH+K++C + K++ + Sbjct: 243 GKPEKS---------------GGGKKKAKDQVECWYCGKKGHYKKECRSRRAKEETE 284 >ref|XP_006651032.1| PREDICTED: uncharacterized protein LOC102707106 [Oryza brachyantha] Length = 1773 Score = 100 bits (249), Expect = 5e-19 Identities = 62/176 (35%), Positives = 95/176 (53%), Gaps = 2/176 (1%) Frame = +1 Query: 1 LWKALDTLFLTKTLPNKIYLLEKLFSFRMDASKDLEANLSDFSIIVKSLAHNDKKFDDED 180 LW L+ + +TK L +K++L +KLF ++ + + +LS F IV L + K+D+ED Sbjct: 25 LWLKLEQICMTKDLTSKMHLKQKLFLHKLQDDESVMDHLSTFKEIVADLESMEIKYDEED 84 Query: 181 LAVILLNSLPDSYKDVRNAIKYARDTLTQTIVIDALRSKE--LEVRKESKSGVKEESMYV 354 L +ILL SLP SY + R+ I Y+RDTLT V DAL +KE ++ S + ES+ V Sbjct: 85 LGLILLCSLPSSYANFRDTILYSRDTLTLQEVYDALHAKEKMKKMVPSEGSNSQPESLVV 144 Query: 355 RGRSEKKDQNYXXXXXXXXXXXXXXXXXXXXXXXCYHCGKLGHFKRDCYDLKNKQK 522 RGR ++K+ + C +C + GH DC+ L+NK+K Sbjct: 145 RGRQQEKNTD-NKSRDKSSSGYRGRSKSRGRYKSCKYCKRGGHDISDCWKLQNKEK 199 >emb|CAN80304.1| hypothetical protein VITISV_017821 [Vitis vinifera] Length = 939 Score = 100 bits (249), Expect = 5e-19 Identities = 53/126 (42%), Positives = 80/126 (63%) Frame = +1 Query: 1 LWKALDTLFLTKTLPNKIYLLEKLFSFRMDASKDLEANLSDFSIIVKSLAHNDKKFDDED 180 +W L+ L++ +L N++YL E+L+ F+M + + NL DF+ IV +++ K DDED Sbjct: 84 VWNKLEQLYMQNSLSNRLYLKERLYGFKMQEDRSIADNLDDFAKIVLEMSNIGIKVDDED 143 Query: 181 LAVILLNSLPDSYKDVRNAIKYARDTLTQTIVIDALRSKELEVRKESKSGVKEESMYVRG 360 AV++L SLP Y + + +KY R TLT V ALRSKELE++K+ +G E + +RG Sbjct: 144 KAVLVLKSLPGLYSNFKETMKYGRKTLTLEEVQSALRSKELELKKKGSNG---EGLSIRG 200 Query: 361 RSEKKD 378 R KKD Sbjct: 201 R--KKD 204 >dbj|BAB09923.1| copia-like retrotransposable element [Arabidopsis thaliana] Length = 1342 Score = 99.4 bits (246), Expect = 1e-18 Identities = 58/186 (31%), Positives = 94/186 (50%), Gaps = 3/186 (1%) Frame = +1 Query: 7 KALDTLFLTKTLPNKIYLLEKLFSFRMDASKDLEANLSDFSIIVKSLAHNDKKFDDEDLA 186 K LD LF+ K+LPN+IYL ++L+ ++M + +E N++DF ++ L + DED A Sbjct: 106 KVLDQLFMAKSLPNRIYLKQRLYGYKMSENMTMEENVNDFFKLISDLENVKVVVPDEDQA 165 Query: 187 VILLNSLPDSYKDVRNAIKYARDTLTQTIVIDALRSKELEVRKESK-SGVKEESMYV--R 357 ++LL SLP + ++ +KY + TL + A+RSK LE+ K + ++V R Sbjct: 166 IVLLMSLPRQFDQLKETLKYCKTTLHLEEITSAIRSKILELGASGKLLKNNSDGLFVQDR 225 Query: 358 GRSEKKDQNYXXXXXXXXXXXXXXXXXXXXXXXCYHCGKLGHFKRDCYDLKNKQKAQALE 537 GRSE + + C+ CGK GHFK+ CY K + K + Sbjct: 226 GRSETRGKG--------PNKNKSRSKSKGAGKTCWICGKEGHFKKQCYVWKERNKQGSTS 277 Query: 538 PQNNAA 555 + A+ Sbjct: 278 ERGEAS 283 >emb|CAB40039.1| putative retrotransposon [Arabidopsis thaliana] gi|7267743|emb|CAB78169.1| putative retrotransposon [Arabidopsis thaliana] Length = 1230 Score = 99.4 bits (246), Expect = 1e-18 Identities = 65/195 (33%), Positives = 103/195 (52%), Gaps = 12/195 (6%) Frame = +1 Query: 7 KALDTLFLTKTLPNKIYLLEKLFSFRMDASKDLEANLSDFSIIVKSLAHNDKKFDDEDLA 186 +ALD L+++K LPN+IYL +KL+S++M + +E N+ +F ++ L + + DED A Sbjct: 99 EALDKLYMSKALPNRIYLKQKLYSYKMQENLSVEGNIDEFLRLIADLENTNVLVSDEDQA 158 Query: 187 VILLNSLPDSYKDVRNAIKY--ARDTLTQTIVIDALRSKELEVRKESKS-GVKEESMYV- 354 ++LL SLP + +++ +KY R TL+ V+ A+ SKELE+ KS + E +YV Sbjct: 159 ILLLMSLPKQFDQLKDTLKYGSGRTTLSVDEVVAAIYSKELELGSNKKSIRGQAEGLYVK 218 Query: 355 -----RGRSEKKDQNYXXXXXXXXXXXXXXXXXXXXXXXCYHCGKLGHFKRDCYD---LK 510 RG SE+K++ C+ CG+ GHFK C + + Sbjct: 219 DKPETRGMSEQKEKG----------NKGRSRSRSKGWKGCWICGEEGHFKTSCPNKGKQQ 268 Query: 511 NKQKAQALEPQNNAA 555 NK K QA + AA Sbjct: 269 NKGKDQASGSKGEAA 283 >ref|XP_009107606.1| PREDICTED: LOW QUALITY PROTEIN: uncharacterized protein LOC103833265 [Brassica rapa] Length = 3353 Score = 98.2 bits (243), Expect = 3e-18 Identities = 64/199 (32%), Positives = 99/199 (49%), Gaps = 12/199 (6%) Frame = +1 Query: 1 LWKALDTLFLTKTLPNKIYLLEKLFSFRMDASKDLEANLSDFSIIVKSLAHNDKKFDDED 180 +WK+L+ L+ K+LPN+IYL + ++M+ K +E N+ F ++ L DE+ Sbjct: 9 MWKSLEKLYQLKSLPNRIYLKRQFSCYKMEEDKSIEENVDVFLKLIADLESLKVTITDEE 68 Query: 181 LAVILLNSLPDSYKDVRNAIKY--ARDTLTQTIVIDALRSKELEVRKE---SKSGVKEES 345 A+ LL+ LP +Y+ + + ++Y RDTLT + V+ + SKE E+R++ +K E Sbjct: 69 QAIQLLSGLPAAYEQLVHTLQYGTGRDTLTVSEVVTSAYSKEAELRQKGLLNKKKPTSEG 128 Query: 346 MYV--RGRSEKKDQN-----YXXXXXXXXXXXXXXXXXXXXXXXCYHCGKLGHFKRDCYD 504 +YV RGRS K+ +N Y C+ CGK GH+KRDC Sbjct: 129 LYVESRGRSSKRTENGNNKKYNRSRSRGDRSKSKGKSDKTKKGACFSCGKEGHWKRDC-- 186 Query: 505 LKNKQKAQALEPQNNAALE 561 NK E NA E Sbjct: 187 -PNKGHMNGSEQAVNAVSE 204 Score = 98.2 bits (243), Expect = 3e-18 Identities = 64/199 (32%), Positives = 99/199 (49%), Gaps = 12/199 (6%) Frame = +1 Query: 1 LWKALDTLFLTKTLPNKIYLLEKLFSFRMDASKDLEANLSDFSIIVKSLAHNDKKFDDED 180 +WK+L+ L+ K+LPN+IYL + ++M+ K +E N+ F ++ L DE+ Sbjct: 2084 MWKSLEKLYQLKSLPNRIYLKRQFSCYKMEEDKSIEENVDVFLKLIADLESLKVTITDEE 2143 Query: 181 LAVILLNSLPDSYKDVRNAIKY--ARDTLTQTIVIDALRSKELEVRKE---SKSGVKEES 345 A+ LL+ LP +Y+ + + ++Y RDTLT + V+ + SKE E+R++ +K E Sbjct: 2144 QAIQLLSGLPAAYEQLVHTLQYGTGRDTLTVSEVVTSAYSKEAELRQKGLLNKKKPTSEG 2203 Query: 346 MYV--RGRSEKKDQN-----YXXXXXXXXXXXXXXXXXXXXXXXCYHCGKLGHFKRDCYD 504 +YV RGRS K+ +N Y C+ CGK GH+KRDC Sbjct: 2204 LYVESRGRSSKRTENGNNKKYNRSRSRGDRSKSKGKSDKTKKGACFSCGKEGHWKRDC-- 2261 Query: 505 LKNKQKAQALEPQNNAALE 561 NK E NA E Sbjct: 2262 -PNKGHMNGSEQAVNAVSE 2279 >emb|CAN84102.1| hypothetical protein VITISV_041248 [Vitis vinifera] Length = 894 Score = 97.8 bits (242), Expect = 3e-18 Identities = 58/194 (29%), Positives = 102/194 (52%) Frame = +1 Query: 13 LDTLFLTKTLPNKIYLLEKLFSFRMDASKDLEANLSDFSIIVKSLAHNDKKFDDEDLAVI 192 L++L++TK+L N+++ KL++F+M S +E +L F+ I+ L + D +ED A++ Sbjct: 88 LESLYMTKSLANRLHKXIKLYTFKMTPSMSIEEHLDHFNKIILDLKNIDIAVSNEDKAIL 147 Query: 193 LLNSLPDSYKDVRNAIKYARDTLTQTIVIDALRSKELEVRKESKSGVKEESMYVRGRSEK 372 LL SL SY +++ AI Y RD LT V L ++EL ++ESK + E + +RG+S+K Sbjct: 148 LLTSLDASYTNMKEAIMYGRDILTFDEVQSILHARELHKQEESKEEL-GEGLNIRGKSKK 206 Query: 373 KDQNYXXXXXXXXXXXXXXXXXXXXXXXCYHCGKLGHFKRDCYDLKNKQKAQALEPQNNA 552 +++ C+ C K GHFK+DC D++ + + + Sbjct: 207 REKK-------KGNNSKSRSKSKTKKFKCFICHKEGHFKKDCPDMRQNTXKKTMNEGDAT 259 Query: 553 ALEAEIDSLHVMTV 594 + D+ V+ V Sbjct: 260 MILDGYDNAGVLNV 273 >ref|XP_007028624.1| Uncharacterized protein TCM_024518 [Theobroma cacao] gi|508717229|gb|EOY09126.1| Uncharacterized protein TCM_024518 [Theobroma cacao] Length = 277 Score = 97.4 bits (241), Expect = 4e-18 Identities = 56/188 (29%), Positives = 100/188 (53%), Gaps = 5/188 (2%) Frame = +1 Query: 1 LWKALDTLFLTKTLPNKIYLLEKLFSFRMDASKDLEANLSDFSIIVKSLAHNDKKFDDED 180 +W L+++++TK+L N++Y+ ++L++ +M + ++ +F+ ++ L + D K +DED Sbjct: 86 MWFKLESIYITKSLTNRLYMKQRLYTLKMSEGTSVNTHIDEFNRVILDLKNIDVKIEDED 145 Query: 181 LAVILLNSLPDSYKDVRNAIKYARDTLTQTIVIDALRSKELEVRKESKSGVKEES----- 345 LA+ILL LP SY++ + + Y RDTLT V L SKEL K+ G++ E+ Sbjct: 146 LALILLCYLPPSYENFVDTMLYGRDTLTFEDVRAYLNSKEL---KKKVGGIRNENQAEGL 202 Query: 346 MYVRGRSEKKDQNYXXXXXXXXXXXXXXXXXXXXXXXCYHCGKLGHFKRDCYDLKNKQKA 525 + RGR ++K + C++CG+ GHF++DC K+ +K Sbjct: 203 VVNRGRGKEKGLD-------------KKGKSRAKGKTCWNCGQKGHFRQDCTKFKDDEKF 249 Query: 526 QALEPQNN 549 E N Sbjct: 250 NKSENTAN 257