BLASTX nr result
ID: Papaver31_contig00035058
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Papaver31_contig00035058 (2423 letters) Database: ./nr 77,306,371 sequences; 28,104,191,420 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_008349809.1| PREDICTED: uncharacterized protein LOC103413... 237 6e-61 ref|XP_008385055.1| PREDICTED: uncharacterized protein LOC103447... 235 2e-60 emb|CAN81355.1| hypothetical protein VITISV_039158 [Vitis vinifera] 226 1e-56 gb|AAK51235.1|AF287471_1 polyprotein [Arabidopsis thaliana] 211 4e-56 gb|AAF02855.1|AC009324_4 Similar to retrotransposon proteins [Ar... 209 5e-54 ref|XP_008356535.1| PREDICTED: uncharacterized protein LOC103420... 214 6e-54 gb|AAC61290.1| putative retroelement pol polyprotein [Arabidopsi... 205 6e-54 gb|AAK62788.1|AC027036_9 polyprotein, putative [Arabidopsis thal... 215 1e-53 emb|CAN68489.1| hypothetical protein VITISV_037543 [Vitis vinifera] 215 2e-53 gb|AAD21687.1| Strong similarity to gi|3600044 T12H20.12 proteas... 213 5e-52 gb|AAC67200.1| putative retroelement pol polyprotein [Arabidopsi... 199 4e-51 gb|AAC35532.1| contains similarity to proteases [Arabidopsis tha... 199 1e-50 emb|CAC37623.1| copia-like polyprotein [Arabidopsis thaliana] 207 4e-50 dbj|BAK41512.1| C-end truncated polyprotein [Arabidopsis thaliana] 201 1e-49 emb|CAN77295.1| hypothetical protein VITISV_005638 [Vitis vinifera] 204 3e-49 gb|KHN36156.1| Retrovirus-related Pol polyprotein from transposo... 192 2e-48 gb|KHN22040.1| Retrovirus-related Pol polyprotein from transposo... 192 2e-48 emb|CAB40035.1| retrotransposon like protein [Arabidopsis thalia... 199 1e-47 gb|KFK44388.1| hypothetical protein AALP_AA1G251100, partial [Ar... 196 8e-47 emb|CAN81099.1| hypothetical protein VITISV_017741 [Vitis vinifera] 192 1e-45 >ref|XP_008349809.1| PREDICTED: uncharacterized protein LOC103413096 [Malus domestica] Length = 954 Score = 237 bits (604), Expect(2) = 6e-61 Identities = 170/591 (28%), Positives = 259/591 (43%), Gaps = 12/591 (2%) Frame = -2 Query: 1798 QYQFVRFVDGSIEPQPQFLNHNNVPVVYPIYLEWRTLDQFVGSCINATISPSLATELLGK 1619 +Y+ + +DG+ FL ++ + +W DQ + +N+T+S + +G Sbjct: 72 RYKLLGVIDGTDVCPSPFLPDRSINXAFE---DWYEKDQNLLIWLNSTLSEEIIPFTVGV 128 Query: 1618 SIARDKWLHLSKIFTQQFFARKSQLRGQLHSIQRGHYSIFDYLHQLKTISDSLAEIGEPV 1439 S +R+ W+ L + F A QLR ++ S+Q+G SI DYL +LK ISDSL G V Sbjct: 129 SSSRELWVKLEQRFGGISEAHIHQLRSRJQSVQKGSRSISDYLQELKEISDSLQAAGASV 188 Query: 1438 QDDDLVMYTLSGLGSEYAHFVITMQNREVPLSFAKLRSRLINHEQWLKDQENAIYSPLIA 1259 D DL+ L GL E+ F+ + R S +L L+ E + ++ SP+ Sbjct: 189 SDRDLIAAILHGLPDEFESFIDCIMLRLSSTSLDELHGLLLTKELSMARRKTVSSSPV-- 246 Query: 1258 DPSNSAFFVRKQQ------NTFXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXPGF 1097 S AF V+ Q + F F Sbjct: 247 PESFQAFSVQSQXPXLPTPSAFAAQNXPLXSASRFNSNRGRNTKGQFFSNRGHRGNRGNF 306 Query: 1096 QFKKGE-----FNPNLTVDYGEIPSQICNKKGHFANTFYYRYVPSMNNSPPMQKAFASFE 932 +G F N + ++ QIC H A + R P + P K A Sbjct: 307 PNNRGNRGYQGFRSNQXSSHFKVLCQICGSTSHEAIDCFDRMNPDICGRIPPAKLAAMCA 366 Query: 931 NALRFHSWSPAESMAYXXXXXXXXXXXXSHWSCNDQGPIWIPDSGATSHMTNNTAIMTDA 752 HS P++ W+ DSGATSH+TN+ A +T Sbjct: 367 Q----HSAKPSQP--------------------------WLIDSGATSHITNDVANLTSP 396 Query: 751 VEFIGDEQAMVGDGKXXXXXXXXXXXXXXXTGDTSFDLHNVLYVPHIKHNLFSIANFTLE 572 + G+++ +GDGK SF L NVL+VPHI HNL S F + Sbjct: 397 TPYTGEDKVYIGDGKGLSILNVGSSTLHT--SHNSFQLRNVLHVPHITHNLLSAYQFVND 454 Query: 571 NSCSYEFFPWGYEIKSLPSRKVLARGPNASELYPIKSSALRRNTALSASIMSNSNTXXXX 392 N+CS P+G +K S K+L RGP YP++SS+ + +A + + Sbjct: 455 NNCSLTLDPYGSYVKDRISGKMLLRGPVRDGFYPLQSSSNLHPLSPTALLSIKAPVTI-- 512 Query: 391 XXXXXXXLWNQRLGHPISTVINNLHTTGAIQLSNK-VFESVCTSCQLGKSHSLPFLVSPS 215 W++RLGHP S++ L ++ + L K + C+ C L K+H LPF + S Sbjct: 513 --------WHKRLGHPSSSIFRRLLSSNNLALQGKSTVDFFCSDCALAKNHKLPFKAATS 564 Query: 214 RACQPLSLVHCDIWGPAPSISHLGFKYFIVFVDDFSKYNWIFPMKCRSESL 62 L L+HCD+WGPA S GF+Y+++ VDD+SKY+W FP+K +S+ + Sbjct: 565 STTHSLQLLHCDLWGPASITSSSGFQYYLLIVDDYSKYSWFFPLKSKSDGI 615 Score = 28.1 bits (61), Expect(2) = 6e-61 Identities = 14/49 (28%), Positives = 23/49 (46%) Frame = -3 Query: 1920 NHNYVYRFTPLPFPNVSNFVSTKLDGSNFLVWKDQLSSILISTNLYGLL 1774 N + + +T L N+ + V KL SN+L W+ I L G++ Sbjct: 31 NPSXLTSYTSLTIHNIGSMVPIKLKRSNYLPWRALFGPIFRRYKLLGVI 79 >ref|XP_008385055.1| PREDICTED: uncharacterized protein LOC103447646 [Malus domestica] Length = 727 Score = 235 bits (600), Expect(2) = 2e-60 Identities = 174/617 (28%), Positives = 277/617 (44%), Gaps = 19/617 (3%) Frame = -2 Query: 1798 QYQFVRFVDGS-IEPQPQFLNH--NNVPVVYPIYLEWRTLDQFVGSCINATISPSLATEL 1628 +Y+ +DGS + P P L+ N P + W DQ + +N+T+S L Sbjct: 68 RYKLTGILDGSEVCPSPFLLDASGNTTSTPNPAFDLWYEKDQNILIWLNSTLSEDLIPFT 127 Query: 1627 LGKSIARDKWLHLSKIFTQQFFARKSQLRGQLHSIQRGHYSIFDYLHQLKTISDSLAEIG 1448 +G + +R+ WL+L + F A QLR +LH++Q+ I +Y+ +KTI D+L G Sbjct: 128 VGVTSSRELWLNLKQRFGGVSAAHIHQLRSRLHTVQKRDLIISNYIQLIKTIYDALMATG 187 Query: 1447 EPVQDDDLVMYTLSGLGSEYAHFVITMQNREVPLSFAKLRSRLINHEQWLKDQENAIYSP 1268 P+ + DL++ TL+GL +Y FV ++ R S +L LIN E ++ +++ I + Sbjct: 188 APLSESDLIVVTLNGLSEDYESFVDSIMLRISSTSLDELHGLLINKELFM-NRKKKIVAS 246 Query: 1267 LIADP--SNSAFFVRKQ------------QNTFXXXXXXXXXXXXXXXXXXXXXXXXXXX 1130 +++P + +A + Q QN + Sbjct: 247 SVSEPFQAYAAQYQHSQAPLLPTPQGHPGQNLYISAPRQFNRGKGTYMGNNYRGNNNYRG 306 Query: 1129 XXXXXXXXPGFQFKKGEFNPNL-TVDYGEIPSQICNKKGHFANTFYYRYVPSMNNSPPMQ 953 F G + + T P QIC+ H A + R + P Sbjct: 307 NNYRGNSRGNFNRNSGSYTRHSGTTTSHRDPCQICHSPDHEALDCFERMNHAFAGKIPPA 366 Query: 952 KAFASFENALRFHSWSPAESMAYXXXXXXXXXXXXSHWSCNDQGPIWIPDSGATSHMTNN 773 K A + ++ S+SP W+ DSGATSH+TN+ Sbjct: 367 KLAAMCAHTIK--SFSPT----------------------------WLMDSGATSHITND 396 Query: 772 TAIMTDAVEFIGDEQAMVGDGKXXXXXXXXXXXXXXXTGDTSFDLHNVLYVPHIKHNLFS 593 + + + G ++ +G+G+ +F L+NVL+VP +KHNLFS Sbjct: 397 ISAIHSPTNYNGQDKVYIGNGQGMLIHHTGTTFLTTP--TATFRLNNVLHVPAMKHNLFS 454 Query: 592 IANFTLENSCSYEFFPWGYEIKSLPSRKVLARGPNASELYPIKSSALRRNTALSASIMSN 413 F +N G +IK S +L R P YP + +T+ SA + S Sbjct: 455 AYQFLRDNHYKLTLDSDGSKIKDCISGMMLFRRPIKDGFYPFQ-GITPASTSPSALVCSK 513 Query: 412 SNTXXXXXXXXXXXLWNQRLGHPISTVINNLHTTGAIQLSNKVFES-VCTSCQLGKSHSL 236 + +W+ RLGHP S + + I S+K F S C+ C +GK+H L Sbjct: 514 A----------PLQIWHNRLGHPSSAIFRKTLNSSTIVYSDKKFTSFFCSDCAIGKNHKL 563 Query: 235 PFLVSPSRACQPLSLVHCDIWGPAPSISHLGFKYFIVFVDDFSKYNWIFPMKCRSESLNC 56 PF S S PL LVHCD+WGP P++S G+KY+++FVD+F+KY+W+FP+K +SE + Sbjct: 564 PFTTSISFVSVPLELVHCDVWGPTPTLSLSGYKYYVLFVDEFTKYSWMFPLKLKSEVYSV 623 Query: 55 FMLFKSLMENLLEFKKK 5 F+ FK +ENL+ K K Sbjct: 624 FVNFKCYVENLVGNKIK 640 Score = 27.7 bits (60), Expect(2) = 2e-60 Identities = 14/35 (40%), Positives = 19/35 (54%) Frame = -3 Query: 1878 NVSNFVSTKLDGSNFLVWKDQLSSILISTNLYGLL 1774 ++S V TKL N+LVWK + I L G+L Sbjct: 41 HISCMVPTKLKRDNYLVWKALFAPIFRRYKLTGIL 75 >emb|CAN81355.1| hypothetical protein VITISV_039158 [Vitis vinifera] Length = 1402 Score = 226 bits (575), Expect(2) = 1e-56 Identities = 169/600 (28%), Positives = 270/600 (45%), Gaps = 5/600 (0%) Frame = -2 Query: 1789 FVRFVDG-SIEPQPQFLNHNNVPVVYPIYLEWRTLDQFVGSCINATISPSLATELLGKSI 1613 F F+DG S+ P+ + + P ++ R D+ + S I ++++P + +++G + Sbjct: 58 FEDFIDGTSVCPEKELRPGE----INPAFVAXRRQDRTILSWIYSSLTPGIMAQIIGHNS 113 Query: 1612 ARDKWLHLSKIFTQQFFARKSQLRGQLHSIQRGHYSIFDYLHQLKTISDSLAEIGEPVQD 1433 + W L KIF+ AR QL + S ++G S+ DY+ ++K +DSLA IGEPV + Sbjct: 114 SHSAWNALEKIFSSCSRARIMQLXLEFQSTKKGSMSMIDYIMKVKGAADSLAAIGEPVSE 173 Query: 1432 DDLVMYTLSGLGSEYAHFVITMQNREVPLSFAKLRSRLINHEQWLKDQENAIYSPLI--- 1262 D +M L GLGS+Y V + RE +S + S L+ EQ L+ Q + P + Sbjct: 174 QDQIMNLLGGLGSDYNAVVTAINIREDKISLEAVHSMLLAFEQRLEQQGSIEQLPAMSAN 233 Query: 1261 -ADPSNSAFFVRKQQNTFXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXPGFQFKK 1085 A SN+ RK G + Sbjct: 234 YASXSNNRGGGRKYNG----------------------GRGPNFMMTNSNFRGRGRGXRY 271 Query: 1084 GEFNPNLTVDYGEIPSQICNKKGHFANTFYYRYVPSMNNSPPMQKAFASFENALRFHSWS 905 G+ + Q+C K GH Y+R+ + ++ ++ N+ + Sbjct: 272 GQSGRQNSSSSERPQCQLCGKFGHTVQVCYHRFDITFQSTQNNTTGVSNSGNS----NXM 327 Query: 904 PAESMAYXXXXXXXXXXXXSHWSCNDQGPIWIPDSGATSHMTNNTAIMTDAVEFIGDEQA 725 PA S N W DSGA+ H+T N A +T+A + G ++ Sbjct: 328 PA----------------MVAXSNNXADDNWYLDSGASHHLTQNVANLTNATPYTGADKV 371 Query: 724 MVGDGKXXXXXXXXXXXXXXXTGDTSFDLHNVLYVPHIKHNLFSIANFTLENSCSYEFFP 545 +G+GK SF L V +VP I NL S+A F +N+ EF Sbjct: 372 TIGNGKHLTISNTXFTRLFS--NPHSFQLKKVFHVPFISANLISVAKFCSDNNALIEFHS 429 Query: 544 WGYEIKSLPSRKVLARGPNASELYPIKSSALRRNTALSASIMSNSNTXXXXXXXXXXXLW 365 G+ +K L +++VLA+G + LY + ++ + ++N +T LW Sbjct: 430 NGFFLKDLHTKRVLAQGKLENGLYKFPVISNKKTAYVG---ITNDSTFQCSNIENKRELW 486 Query: 364 NQRLGHPISTVINNLHTTGAIQLSNKVFESVCTSCQLGKSHSLPFLVSPSRACQPLSLVH 185 + RLGH + ++ + + K +VC+SCQL KSH LP +S A +PL LV+ Sbjct: 487 HHRLGHAATDIVTRIMHNCNVSCG-KYKATVCSSCQLAKSHRLPTHLSSFHASKPLELVY 545 Query: 184 CDIWGPAPSISHLGFKYFIVFVDDFSKYNWIFPMKCRSESLNCFMLFKSLMENLLEFKKK 5 DIWGPA S G KYFI+FVDD+S+Y W++ ++ + ++L F FK +EN E K K Sbjct: 546 TDIWGPASVTSTSGAKYFILFVDDYSRYTWLYLLQSKDQALPIFKXFKLQVENQFEAKIK 605 Score = 25.0 bits (53), Expect(2) = 1e-56 Identities = 6/20 (30%), Positives = 16/20 (80%) Frame = -3 Query: 1854 KLDGSNFLVWKDQLSSILIS 1795 KLD +N+++W+ Q+ +++ + Sbjct: 36 KLDRTNYILWRSQIDNVIFA 55 >gb|AAK51235.1|AF287471_1 polyprotein [Arabidopsis thaliana] Length = 1453 Score = 211 bits (537), Expect(2) = 4e-56 Identities = 161/608 (26%), Positives = 267/608 (43%), Gaps = 11/608 (1%) Frame = -2 Query: 1795 YQFVRFVDGSIEPQPQFLN----HNNVPVVYPIYLEWRTLDQFVGSCINATISPSLATEL 1628 ++ + FV+G I P P+ LN +V V P Y W DQ + S + T+S + + Sbjct: 40 HKLIGFVNGGITPPPRTLNVVTGDTSVDVANPQYESWFCTDQLIRSWLFGTLSEEVLGYV 99 Query: 1627 LGKSIARDKWLHLSKIFTQQFFARKSQLRGQLHSIQRGHYSIFDYLHQLKTISDSLAEIG 1448 +RD W+ L++ F + AR+ LR L + + ++ Y + + D+L+ IG Sbjct: 100 HNLQTSRDIWISLAENFNKSSVAREFTLRRTLQLLSKKDKTLSAYCREFIAVCDALSSIG 159 Query: 1447 EPVQDDDLVMYTLSGLGSEYAHFVITMQN---REVPLSFAKLRSRLINHEQWLKDQENAI 1277 +PV + + L+GLG EY +Q+ + P +F + S + + L+ E ++ Sbjct: 160 KPVDESMKIFGFLNGLGREYDPITTVIQSSLSKISPPTFRDVISEVKGFDVKLQSYEESV 219 Query: 1276 YSPLIADPSNSAFFVRKQQNTFXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXPGF 1097 A+P + AF ++ + T G Sbjct: 220 ----TANP-HMAFNTQRSEYT----------DNYTSGNRGKGRGGYGQNRGRSGYSTRGR 264 Query: 1096 QFKKGEFNPNLTVDYGEIP-SQICNKKGHFANTFYYRYVPSMNNSPPMQKAFASFENALR 920 F + + N N T GE P QIC + GH A Y R+ + S + A Sbjct: 265 GFSQHQTNSNNT---GERPVCQICGRTGHTALKCYNRF----------DHNYQSVDTAQA 311 Query: 919 FHSWSPAESMAYXXXXXXXXXXXXSHWSCNDQGPIWIPDSGATSHMTNNTAIMTDAVEFI 740 F S ++S G W+PDS AT+H+T++T + A + Sbjct: 312 FSSLRVSDS----------------------SGKEWVPDSAATAHVTSSTNNLQAASPYN 349 Query: 739 GDEQAMVGDGKXXXXXXXXXXXXXXXTGDTSFDLHNVLYVPHIKHNLFSIANFTLENSCS 560 G + +VGDG +G + L+ VL P I+ +L S++ + C Sbjct: 350 GSDTVLVGDGAYLPITHVGSTTISSDSG--TLPLNEVLVCPDIQKSLLSVSKLCDDYPCG 407 Query: 559 YEFFPWGYEIKSLPSRKVLARGPNASELYPIKSS---ALRRNTALSASIMSNSNTXXXXX 389 F I + ++KV+++GP ++ LY +++ A N +AS Sbjct: 408 VYFDANKVCIIDINTQKVVSKGPRSNGLYVLENQEFVAFYSNRQCAAS------------ 455 Query: 388 XXXXXXLWNQRLGHPISTVINNLHTTGAIQLSNKVFESVCTSCQLGKSHSLPFLVSPSRA 209 +W+ RLGH S ++ L ++ I + VC CQ+GKS L F S SR Sbjct: 456 ----EEIWHHRLGHSNSRILQQLKSSKEISFNKSRMSPVCEPCQMGKSSKLQFFSSNSRE 511 Query: 208 CQPLSLVHCDIWGPAPSISHLGFKYFIVFVDDFSKYNWIFPMKCRSESLNCFMLFKSLME 29 L +HCD+WGP+P +S GFKY++VFVDD+S+Y+W +P+K +S+ F+ F++L+E Sbjct: 512 LDLLGRIHCDLWGPSPVVSKQGFKYYVVFVDDYSRYSWFYPLKAKSDFFAVFVAFQNLVE 571 Query: 28 NLLEFKKK 5 N K K Sbjct: 572 NQFNTKIK 579 Score = 37.7 bits (86), Expect(2) = 4e-56 Identities = 19/43 (44%), Positives = 27/43 (62%), Gaps = 3/43 (6%) Frame = -3 Query: 1893 PLPFPN---VSNFVSTKLDGSNFLVWKDQLSSILISTNLYGLL 1774 P PFP+ VS+ V+ KL+ SN+L+WK Q S+L L G + Sbjct: 4 PYPFPDNVHVSSSVTLKLNDSNYLLWKTQFESLLSCHKLIGFV 46 >gb|AAF02855.1|AC009324_4 Similar to retrotransposon proteins [Arabidopsis thaliana] Length = 1522 Score = 209 bits (532), Expect(2) = 5e-54 Identities = 159/606 (26%), Positives = 267/606 (44%), Gaps = 14/606 (2%) Frame = -2 Query: 1780 FVDGSIEP--QPQFLNHNNVPVVYPI--YLEWRTLDQFVGSCINATISPSLATELLGKSI 1613 FV GSI Q + + HNNV P + W DQ V S + + + + + ++ Sbjct: 43 FVTGSISAPAQTRSVTHNNVTSEEPNPEFYTWHQTDQVVKSWLLGSFAEDILSVVVNCFT 102 Query: 1612 ARDKWLHLSKIFTQQFFARKSQLRGQLHSIQRGHYSIFDYLHQLKTISDSLAEIGEPVQD 1433 + WL L+ F + +R +L+ +L ++++ ++ +L LK I D LA +G PV + Sbjct: 103 SHQVWLTLANHFNRVSSSRLFELQRRLQTLEKKDNTMEVFLKDLKHICDQLASVGSPVPE 162 Query: 1432 DDLVMYTLSGLGSEYAHFVITMQNR---EVPLSFAKLRSRLINHEQWLKDQENAIYSPLI 1262 + L+GLG EY T++N LS ++ S+L ++ L ++ + P I Sbjct: 163 KMKIFSALNGLGREYEPIKTTIENSVDSNPSLSLDEVASKLRGYDDRL---QSYVTEPTI 219 Query: 1261 ADPSNSAFFVRKQQNTFXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXPGFQFKKG 1082 + + AF V + + F + Sbjct: 220 SP--HVAFNVTHSDSGYYHNNNRGKGRSNSGSGKS------------------SFSTRGR 259 Query: 1081 EFNPNLTVDYGE------IPSQICNKKGHFANTFYYRYVPSMNNSP-PMQKAFASFENAL 923 F+ ++ G + QIC K GH A ++R+ S + PM A + Sbjct: 260 GFHQQISPTSGSQAGNSGLVCQICGKAGHHALKCWHRFDNSYQHEDLPMALATMRITDVT 319 Query: 922 RFHSWSPAESMAYXXXXXXXXXXXXSHWSCNDQGPIWIPDSGATSHMTNNTAIMTDAVEF 743 H G WIPDS A++H+TNN ++ + + Sbjct: 320 DHH------------------------------GHEWIPDSAASAHVTNNRHVLQQSQPY 349 Query: 742 IGDEQAMVGDGKXXXXXXXXXXXXXXXTGDTSFDLHNVLYVPHIKHNLFSIANFTLENSC 563 G + MV DG +G L VL P I +L S++ T + C Sbjct: 350 HGSDSIMVADGNFLPITHTGSGSIASSSG--KIPLKEVLVCPDIVKSLLSVSKLTSDYPC 407 Query: 562 SYEFFPWGYEIKSLPSRKVLARGPNASELYPIKSSALRRNTALSASIMSNSNTXXXXXXX 383 S EF I ++K+L G N LY ++ L+ + S NS + Sbjct: 408 SVEFDADSVRINDKATKKLLVMGRNRDGLYSLEEPKLQ----VLYSTRQNSASSEV---- 459 Query: 382 XXXXLWNQRLGHPISTVINNLHTTGAIQLSNKVFESVCTSCQLGKSHSLPFLVSPSRACQ 203 W++RLGH + V++ L ++ +I + NKV ++VC +C LGKS LPF++S A + Sbjct: 460 -----WHRRLGHANAEVLHQLASSKSIIIINKVVKTVCEACHLGKSTRLPFMLSTFNASR 514 Query: 202 PLSLVHCDIWGPAPSISHLGFKYFIVFVDDFSKYNWIFPMKCRSESLNCFMLFKSLMENL 23 PL +HCD+WGP+P+ S GF+Y++VF+D +S++ W +P+K +S+ + F++F+ L+EN Sbjct: 515 PLERIHCDLWGPSPTSSVQGFRYYVVFIDHYSRFTWFYPLKLKSDFFSTFVMFQKLVENQ 574 Query: 22 LEFKKK 5 L K K Sbjct: 575 LGHKIK 580 Score = 32.7 bits (73), Expect(2) = 5e-54 Identities = 14/39 (35%), Positives = 22/39 (56%) Frame = -3 Query: 1890 LPFPNVSNFVSTKLDGSNFLVWKDQLSSILISTNLYGLL 1774 +P N+SN V+ L+ N+++WK Q S L L G + Sbjct: 6 VPPLNISNCVTVTLNQQNYILWKSQFESFLSGQGLLGFV 44 >ref|XP_008356535.1| PREDICTED: uncharacterized protein LOC103420252 [Malus domestica] Length = 1312 Score = 214 bits (544), Expect(2) = 6e-54 Identities = 172/623 (27%), Positives = 264/623 (42%), Gaps = 25/623 (4%) Frame = -2 Query: 1798 QYQFVRFVDGSIEPQPQ-FLNHNNVPVVYPIYLEWRTLDQFVGSCINATISPSLATELLG 1622 +Y+ + VDG+ EP P FL ++ P + +W DQ + N+T+S + +G Sbjct: 66 RYKLLGIVDGT-EPCPSPFLPDRSIN---PHFEQWYEKDQNLLIWFNSTLSEEIIPFTVG 121 Query: 1621 KSIARDKWLHLSKIFTQQFFARKSQLRGQLHSIQRGHYSIFDYLHQLKTISDSLAEIGEP 1442 S ARD WL L + F A QLR +L +IQ+G S+ DYL Q+K ISDSL G Sbjct: 122 VSSARDLWLKLEQRFGGVSDAHIHQLRSKLQNIQKGSQSMADYLQQIKEISDSLTAAGAS 181 Query: 1441 VQDDDLVMYTLSGLGSEYAHFVITMQNREVPLSFAKLRSRLINHEQWLKDQENAIYS--- 1271 V D DL+ TL+GL ++ F ++ R S +L L+ E ++ ++ + S Sbjct: 182 VTDRDLIAATLAGLTDDFESFTDSILLRLSSTSLDELHGLLLTKELSMERRKKSSSSEPF 241 Query: 1270 ---------PLIADPSNSAFFVRKQ-----QNTFXXXXXXXXXXXXXXXXXXXXXXXXXX 1133 PL+ P A QN+F Sbjct: 242 HAFSVQSQAPLLPTPPPHALVAPNPGASPLQNSFRYNSTRSYTRGSNRGFSRGSNRNYNR 301 Query: 1132 XXXXXXXXXPGFQFKKGEFNPNL----TVDYGEIPSQICNKKGHFANTFYYRYVPSMNN- 968 F +G +N + + QIC H A + R P ++ Sbjct: 302 GSNRGNFNSG---FNRGSYNSGFNRPASSSGHKTSCQICGSTSHEALDCFDRMNPEISGK 358 Query: 967 -SPPMQKAFASFENALRFHSWSPAESMAYXXXXXXXXXXXXSHWSCNDQGPIWIPDSGAT 791 SP A + A +SW + DSGAT Sbjct: 359 FSPAKLAAMCAHYTAKSSNSW--------------------------------LIDSGAT 386 Query: 790 SHMTNNTAIMTDAVEFIGDEQAMVGDGKXXXXXXXXXXXXXXXTGDTSFDLHNVLYVPHI 611 SH+TN+ + + + G+++ +GDGK SF LHNVL+VP + Sbjct: 387 SHITNDISNIQSPTPYHGEDKVYIGDGKGLSIDHIGTSILHTPA--HSFKLHNVLHVPQM 444 Query: 610 KHNLFSIANFTLENSCSYEFFPWGYEIKSLPSRKVLARGPNASELYPIKSSALRRNTALS 431 +H+L S F +N CS G +K + + L RG +P+ S + Sbjct: 445 QHSLLSAYQFIKDNBCSLTLDINGSSVKDRFTGRTLLRGQVKDGFFPLHGSP-------A 497 Query: 430 ASIMSNSNTXXXXXXXXXXXLWNQRLGHPISTVINNLHTTGAIQL-SNKVFESVCTSCQL 254 S +S+S T +W+ RLGHP S + + +T + + C C L Sbjct: 498 LSTVSHSPT-ALVSTAANVRIWHSRLGHPSSAIFRKVLSTNKVVVHGTSSLAFFCKDCAL 556 Query: 253 GKSHSLPFLVSPSRACQPLSLVHCDIWGPAPSISHLGFKYFIVFVDDFSKYNWIFPMKCR 74 K+H LPF S + L L+HCD+WGP+P +S G++Y+++ VDD+SKY+W FP+K + Sbjct: 557 AKNHKLPFGSPQSVSTASLELLHCDVWGPSPVVSVSGYRYYLLIVDDYSKYSWYFPLKSK 616 Query: 73 SESLNCFMLFKSLMENLLEFKKK 5 S + F+ FKS +EN + K K Sbjct: 617 SSVFSIFVDFKSYVENAIGNKIK 639 Score = 27.7 bits (60), Expect(2) = 6e-54 Identities = 15/49 (30%), Positives = 23/49 (46%) Frame = -3 Query: 1920 NHNYVYRFTPLPFPNVSNFVSTKLDGSNFLVWKDQLSSILISTNLYGLL 1774 N N + L N+ + V KL SN+L W+ + IL L G++ Sbjct: 25 NLNTSQMYHSLTIQNIGSMVPIKLRRSNYLPWRALFAPILRRYKLLGIV 73 >gb|AAC61290.1| putative retroelement pol polyprotein [Arabidopsis thaliana] Length = 1149 Score = 205 bits (522), Expect(2) = 6e-54 Identities = 158/573 (27%), Positives = 259/573 (45%), Gaps = 5/573 (0%) Frame = -2 Query: 1714 PIYLEWRTLDQFVGSCINATISPSLATELLGKSIARDKWLHLSKIFTQQFFARKSQLRGQ 1535 P Y W DQ V +S + + ++G + + W++L+K F + +R +L+ + Sbjct: 74 PDYQAWFRSDQVV-------MSEDILSVVVGSKTSHEVWMNLAKHFNRISSSRIFELQRR 126 Query: 1534 LHSIQRGHYSIFDYLHQLKTISDSLAEIGEPVQDDDLVMYTLSGLGSEYAHFVITMQNRE 1355 LHS+ + ++ +YL LKTI D LA +G PV + + + GL EY + +++ Sbjct: 127 LHSLSKEGKTMEEYLRYLKTICDQLASVGSPVAEKMKIFAMVHGLTREYEPLITSLEGTL 186 Query: 1354 VPL---SFAKLRSRLINHEQWLKDQENAIYSPLIADPSNSAFFVRKQQNTFXXXXXXXXX 1184 S+ + RL N + L+ SP +A NTF Sbjct: 187 DAFPGPSYEDVVYRLKNFDDRLQGYTVTDVSPHLAF------------NTFRSSNRGRGG 234 Query: 1183 XXXXXXXXXXXXXXXXXXXXXXXXXXPGFQFKKGEFNPNLTVDYGEIPS-QICNKKGHFA 1007 G F++ + + +V E P QIC K+GH+A Sbjct: 235 RNNRGKGNFSTR---------------GRGFQQQFSSSSSSVSASEKPMCQICGKRGHYA 279 Query: 1006 NTFYYRYVPSMNNSPPMQKAFASFENALRFHSWSPAESMAYXXXXXXXXXXXXSHWSCND 827 ++R+ S +S AF+ AL S +D Sbjct: 280 LQCWHRFDDSYQHSEAAAAAFS----ALHITDVS------------------------DD 311 Query: 826 QGPIWIPDSGATSHMTNNTAIMTDAVEFIGDEQAMVGDGKXXXXXXXXXXXXXXXTGDTS 647 G W+PDS AT+H+TNN++ + ++G++ M DG +G+ Sbjct: 312 SG--WVPDSAATAHITNNSSRLQQMQPYLGNDTVMASDGNFLPITHIGSANLPSTSGN-- 367 Query: 646 FDLHNVLYVPHIKHNLFSIANFTLENSCSYEFFPWGYEIKSLPSRKVLARGPNASE-LYP 470 L +VL P+I +L S++ T + CS+ F G +K + KVL +G + SE LY Sbjct: 368 LPLKDVLVCPNIAKSLLSVSKLTKDYPCSFTFDADGVLVKDKATCKVLTKGSSTSEGLYK 427 Query: 469 IKSSALRRNTALSASIMSNSNTXXXXXXXXXXXLWNQRLGHPISTVINNLHTTGAIQLSN 290 +++ + + ++ +W+ RLGHP V+ L AIQ+ N Sbjct: 428 LENPKFQMFYSTRQVKATDE-------------VWHMRLGHPNPQVLQLLANKKAIQI-N 473 Query: 289 KVFESVCTSCQLGKSHSLPFLVSPSRACQPLSLVHCDIWGPAPSISHLGFKYFIVFVDDF 110 K +C SC+LGKS LPF+ S A +PL VHCD+WGPAP S GF+Y+++F+D+ Sbjct: 474 KSTSKMCESCRLGKSSRLPFIASDFIASRPLERVHCDLWGPAPVSSIQGFQYYVIFIDNR 533 Query: 109 SKYNWIFPMKCRSESLNCFMLFKSLMENLLEFK 11 S++ W +P+K +S+ + FM F+S +ENLL+ K Sbjct: 534 SRFCWFYPLKHKSDFCSLFMKFQSFVENLLQTK 566 Score = 36.2 bits (82), Expect(2) = 6e-54 Identities = 16/39 (41%), Positives = 22/39 (56%) Frame = -3 Query: 1890 LPFPNVSNFVSTKLDGSNFLVWKDQLSSILISTNLYGLL 1774 LP N+SN V+ KL N+++WK Q S L L G + Sbjct: 10 LPSLNISNCVTVKLTDRNYILWKSQFESFLSGQGLLGFV 48 >gb|AAK62788.1|AC027036_9 polyprotein, putative [Arabidopsis thaliana] gi|18265373|dbj|BAB84015.1| polyprotein [Arabidopsis thaliana] Length = 1466 Score = 215 bits (547), Expect(2) = 1e-53 Identities = 156/598 (26%), Positives = 261/598 (43%), Gaps = 6/598 (1%) Frame = -2 Query: 1801 DQYQFVRFVDGSIEPQPQFLNHNNVPVVYPIYLEWRTLDQFVGSCINATISPSLATELLG 1622 D Y+ F+DGS P + + P V P Y W+ D+ + S + IS S+ + Sbjct: 43 DGYELAGFLDGSTTMPPATIGTDAAPRVNPDYTRWKRQDKLIYSAVLGAISMSVQPAVSR 102 Query: 1621 KSIARDKWLHLSKIFTQQFFARKSQLRGQLHSIQRGHYSIFDYLHQLKTISDSLAEIGEP 1442 + A W L KI+ + +QLR QL +G +I DY+ L T D LA +G+P Sbjct: 103 ATTAAQIWETLRKIYANPSYGHVTQLRTQLKQWTKGTKTIDDYMQGLVTRFDQLALLGKP 162 Query: 1441 VQDDDLVMYTLSGLGSEYAHFVITMQNREVPLSFAKLRSRLINHEQWLKDQENAIYSPLI 1262 + D+ V L L EY + + ++ P + ++ RL+NHE + +A P+ Sbjct: 163 MDHDEQVERVLENLPEEYKPVIDQIAAKDTPPTLTEIHERLLNHESKILAVSSATVIPIT 222 Query: 1261 ADPSNSAFFVRKQQNTFXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXPGFQFKKG 1082 A+ + N +Q Sbjct: 223 ANAVSHRNTTTTNNNN-------------------NGNRNNRYDNRNNNNNSKPWQQSST 263 Query: 1081 EFNPNLTVDYGEIPS-QICNKKGHFAN--TFYYRYVPSMNNSPPMQKAFASFENALRFHS 911 F+PN + QIC +GH A + ++ S+N+ P F Sbjct: 264 NFHPNNNQSKPYLGKCQICGVQGHSAKRCSQLQHFLSSVNSQQPPSP----------FTP 313 Query: 910 WSPAESMAYXXXXXXXXXXXXSHWSCNDQGPIWIPDSGATSHMTNNTAIMTDAVEFIGDE 731 W P ++A +S N+ W+ DSGAT H+T++ ++ + G + Sbjct: 314 WQPRANLALGSP-----------YSSNN----WLLDSGATHHITSDFNNLSLHQPYTGGD 358 Query: 730 QAMVGDGKXXXXXXXXXXXXXXXTGDTSFDLHNVLYVPHIKHNLFSIANFTLENSCSYEF 551 MV DG +LHN+LYVP+I NL S+ N S EF Sbjct: 359 DVMVADGSTIPISHTGSTSLSTK--SRPLNLHNILYVPNIHKNLISVYRLCNANGVSVEF 416 Query: 550 FPWGYEIKSLPSRKVLARGPNASELY--PIKSSALRRNTALSASIMSNSNTXXXXXXXXX 377 FP +++K L + L +G ELY PI SS + +L AS S + Sbjct: 417 FPASFQVKDLNTGVPLLQGKTKDELYEWPIASS---QPVSLFASPSSKAT---------- 463 Query: 376 XXLWNQRLGHPISTVINNLHTTGAIQLSNKVFESV-CTSCQLGKSHSLPFLVSPSRACQP 200 W+ RLGHP +++N++ + ++ + N + + C+ C + KS+ +PF S + +P Sbjct: 464 HSSWHARLGHPAPSILNSVISNYSLSVLNPSHKFLSCSDCLINKSNKVPFSQSTINSTRP 523 Query: 199 LSLVHCDIWGPAPSISHLGFKYFIVFVDDFSKYNWIFPMKCRSESLNCFMLFKSLMEN 26 L ++ D+W +P +SH ++Y+++FVD F++Y W++P+K +S+ F+ FK+L+EN Sbjct: 524 LEYIYSDVWS-SPILSHDNYRYYVIFVDHFTRYTWLYPLKQKSQVKETFITFKNLLEN 580 Score = 25.8 bits (55), Expect(2) = 1e-53 Identities = 13/35 (37%), Positives = 20/35 (57%) Frame = -3 Query: 1878 NVSNFVSTKLDGSNFLVWKDQLSSILISTNLYGLL 1774 N+SN TKL +N+L+W Q+ ++ L G L Sbjct: 19 NMSNV--TKLTSTNYLMWSRQVHALFDGYELAGFL 51 >emb|CAN68489.1| hypothetical protein VITISV_037543 [Vitis vinifera] Length = 1449 Score = 215 bits (548), Expect(2) = 2e-53 Identities = 147/568 (25%), Positives = 259/568 (45%), Gaps = 5/568 (0%) Frame = -2 Query: 1714 PIYLEWRTLDQFVGSCINATISPSLATELLGKSIARDKWLHLSKIFTQQFFARKSQLRGQ 1535 P ++ WR D+ + S I ++++P + +++G + W L F AR QLR + Sbjct: 144 PDFVMWRRFDRMILSWIYSSLTPEIMGQIVGYQSSHAXWFALEXXFXASSRARVMQLRLE 203 Query: 1534 LHSIQRGHYSIFDYLHQLKTISDSLAEIGEPVQDDDLVMYTLSGLGSEYAHFVITMQNRE 1355 + ++G ++ +Y+ +LK+++D+LA IGEPV D D ++ L GLG++Y V ++ RE Sbjct: 204 FQTTRKGSLTMMEYILKLKSLADNLAAIGEPVTDRDQILQLLGGLGADYNSIVASLTARE 263 Query: 1354 VPLSFAKLRSRLINHEQWLKDQENAIYSPLIADPSNSAFFVRKQQNTFXXXXXXXXXXXX 1175 ++ E+ + S +A P F ++ Sbjct: 264 ---------------DEDNSVAEDNVISANLATPQYQHFNNKRSSGQ------------- 295 Query: 1174 XXXXXXXXXXXXXXXXXXXXXXXPGFQFKKGEFNPNLTVDYGEIPSQICNKKGHFANTFY 995 GF ++G Q+C K GH Y Sbjct: 296 --------------------NRQSGFNTRRGTNGGRSQSSQHRPQCQLCGKFGHTVVRCY 335 Query: 994 YRYVPSMNNSPP----MQKAFASFENALRFHSWSPAESMAYXXXXXXXXXXXXSHWSCND 827 +R+ + P +Q + +N ++ SP+ + +D Sbjct: 336 HRFDINFQGYNPNMDTVQTNKPNAKNQVQAMMASPS--------------------TISD 375 Query: 826 QGPIWIPDSGATSHMTNNTAIMTDAVEFIGDEQAMVGDGKXXXXXXXXXXXXXXXTGDTS 647 + W D+GAT H++ + ++D ++G+++ +VG+GK + Sbjct: 376 EA--WFFDTGATHHLSQSIDPLSDVQPYMGNDKVIVGNGKHLRILHTGTTFFPS--SSKT 431 Query: 646 FDLHNVLYVPHIKHNLFSIANFTLENSCSYEFFPWGYEIKSLPSRKVLARGPNASELYPI 467 F L VL+VP I NL S++ F +N+ +EF P + +K ++K+L +G LY Sbjct: 432 FQLRQVLHVPDIATNLISVSQFCADNNTFFEFHPRFFFVKDQVTKKILLQGSLEHGLYRF 491 Query: 466 KSSALRRNTALSASIMSNSNTXXXXXXXXXXXLWNQRLGHPISTVINNLHTTGAIQLSNK 287 + + A +S S+ W+ RLGHP ++ ++ T+ +S++ Sbjct: 492 PARFVPSPAAFVSSSYDRSSNLSLTTTTTL---WHSRLGHPADNILKHILTS--CNISHQ 546 Query: 286 VFES-VCTSCQLGKSHSLPFLVSPSRACQPLSLVHCDIWGPAPSISHLGFKYFIVFVDDF 110 ++ VC +CQ KSH LPF V SRA PL+L+H D+WGP S G +YFI+FVDDF Sbjct: 547 CHKNNVCCACQFAKSHKLPFNVXVSRASHPLALLHADLWGPXSIPSTTGARYFILFVDDF 606 Query: 109 SKYNWIFPMKCRSESLNCFMLFKSLMEN 26 S+++WI+P+ + ++L+ F+ FKSL+EN Sbjct: 607 SRFSWIYPLHSKDQALSVFIKFKSLVEN 634 Score = 24.6 bits (52), Expect(2) = 2e-53 Identities = 9/34 (26%), Positives = 22/34 (64%), Gaps = 4/34 (11%) Frame = -3 Query: 1854 KLDGSNFLVWKDQLSSILIST----NLYGLLMVP 1765 KLD +N+++W+ Q+ +++ + ++ GL + P Sbjct: 100 KLDRNNYILWRTQMENVVFANGFEDHIEGLKICP 133 >gb|AAD21687.1| Strong similarity to gi|3600044 T12H20.12 protease homolog from Arabidopsis thaliana BAC gb|AF080119 and is a member of the reverse transcriptase family PF|00078 [Arabidopsis thaliana] Length = 1415 Score = 213 bits (543), Expect = 5e-52 Identities = 165/611 (27%), Positives = 265/611 (43%), Gaps = 11/611 (1%) Frame = -2 Query: 1804 IDQYQFVRFVDGSIEPQPQFLNHNNVPVVY----PIYLEWRTLDQFVGSCINATISPSLA 1637 + + + FV+G++ Q N V P+Y W DQ V S + T+S + Sbjct: 37 LSSQKLIGFVNGAVNAPSQSRLVVNGEVTSEEPNPLYESWFCTDQLVRSWLFGTLSEEVL 96 Query: 1636 TELLGKSIARDKWLHLSKIFTQQFFARKSQLRGQLHSIQRGHYSIFDYLHQLKTISDSLA 1457 + S +R W+ L++ F + AR+ LR L + + Y + KTI D+L+ Sbjct: 97 GHVHNLSTSRQIWVSLAENFNKSSVAREFSLRQNLQLLSKKEKPFSVYCREFKTICDALS 156 Query: 1456 EIGEPVQDDDLVMYTLSGLGSEYAHFVITMQNREVPLSFAKLRSRLINH---EQWLKDQE 1286 IG+PV + + L+GLG +Y +Q+ S +KL + N E D + Sbjct: 157 SIGKPVDESMKIFGFLNGLGRDYDPITTVIQS-----SLSKLPTPTFNDVVSEVQGFDSK 211 Query: 1285 NAIYSPLIADPSNSAFFVRKQQNTFXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 1106 Y + + AF + + ++ Sbjct: 212 LQSYEEAASVTPHLAFNIERSES-----------GSPQYNPNQKGRGRSGQNKGRGGYST 260 Query: 1105 PGFQFKKGEFNPNLTVDYGEIP-SQICNKKGHFANTFYYRYVPSMNNSPPMQKAFASFEN 929 G F + + +P ++ G P QIC + GH A Y R+ NN +AF++ Sbjct: 261 RGRGFSQHQSSPQVS---GPRPVCQICGRTGHTALKCYNRFD---NNYQAEIQAFSTLRV 314 Query: 928 ALRFHSWSPAESMAYXXXXXXXXXXXXSHWSCNDQGPIWIPDSGATSHMTNNTAIMTDAV 749 + +D G W PDS AT+H+T++T + A Sbjct: 315 S-------------------------------DDTGKEWHPDSAATAHVTSSTNGLQSAT 343 Query: 748 EFIGDEQAMVGDGKXXXXXXXXXXXXXXXTGDTSFDLHNVLYVPHIKHNLFSIANFTLEN 569 E+ GD+ +VGDG G L+ VL VP+I+ +L S++ + Sbjct: 344 EYEGDDAVLVGDGTYLPITHTGSTTIKSSNG--KIPLNEVLVVPNIQKSLLSVSKLCDDY 401 Query: 568 SCSYEFFPWGYEIKSLPSRKVLARGPNASELYPIKSS---ALRRNTALSASIMSNSNTXX 398 C F I L ++KV+ GP + LY +++ AL N +A+ Sbjct: 402 PCGVYFDANKVCIIDLQTQKVVTTGPRRNGLYVLENQEFVALYSNRQCAAT--------- 452 Query: 397 XXXXXXXXXLWNQRLGHPISTVINNLHTTGAIQLSNKVFESVCTSCQLGKSHSLPFLVSP 218 +W+ RLGH S + +L + AIQ++ VC CQ+GKS LPFL+S Sbjct: 453 -------EEVWHHRLGHANSKALQHLQNSKAIQINKSRTSPVCEPCQMGKSSRLPFLISD 505 Query: 217 SRACQPLSLVHCDIWGPAPSISHLGFKYFIVFVDDFSKYNWIFPMKCRSESLNCFMLFKS 38 SR PL +HCD+WGP+P +S+ G KY+ +FVDD+S+Y+W +P+ +SE L+ F+ F+ Sbjct: 506 SRVLHPLDRIHCDLWGPSPVVSNQGLKYYAIFVDDYSRYSWFYPLHNKSEFLSVFISFQK 565 Query: 37 LMENLLEFKKK 5 L+EN L K K Sbjct: 566 LVENQLNTKIK 576 >gb|AAC67200.1| putative retroelement pol polyprotein [Arabidopsis thaliana] Length = 1402 Score = 199 bits (506), Expect(2) = 4e-51 Identities = 157/608 (25%), Positives = 253/608 (41%), Gaps = 12/608 (1%) Frame = -2 Query: 1798 QYQFVRFVDGSIEPQPQFLNHNNVPVVYPIYLEWRTLDQFVGSCINATISPSLATELLGK 1619 Q V +DGS P P Y W D+ V S + + + + ++ Sbjct: 53 QTSVVSDIDGSTSASPN-----------PEYYTWFKTDRVVKSWLLGSFLEDILSVVVNC 101 Query: 1618 SIARDKWLHLSKIFTQQFFARKSQLRGQLHSIQRGHYSIFDYLHQLKTISDSLAEIGEPV 1439 + + + W+ ++ F + +R +L+ +L ++ + S+ +YL LKTI D LA +G PV Sbjct: 102 NTSHEVWISVANHFNRVSSSRLFELQRRLQNVSKRDKSMDEYLKDLKTICDQLASVGSPV 161 Query: 1438 QDDDLVMYTLSGLGSEYAHFVITMQNREVPLSFAKLRS---RLINHEQWLKDQ-ENAIYS 1271 + + L+GLG EY T++N L L +L ++ L+ E S Sbjct: 162 TEKMKIFAALNGLGREYEPIKTTIENSMDALPGPSLEDVIPKLTGYDDRLQGYLEETAVS 221 Query: 1270 PLIA------DPSNSAFFVRKQQNTFXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 1109 P +A D SN++ + Sbjct: 222 PHVAFNITTSDDSNASGYFNAYNR---------------------GKGKSNRGRNSFSTR 260 Query: 1108 XPGFQFKKGEFNPNLTVDYG--EIPSQICNKKGHFANTFYYRYVPSMNNSPPMQKAFASF 935 GF + N + G + QIC K GH P K + F Sbjct: 261 GRGFHQQISSTNSSSGSQSGGTSVVCQICGKMGH-----------------PALKCWHRF 303 Query: 934 ENALRFHSWSPAESMAYXXXXXXXXXXXXSHWSCNDQGPIWIPDSGATSHMTNNTAIMTD 755 N+ ++ A + + G W+PDS AT+H+TN+ + Sbjct: 304 NNSYQYEELPRALAAMRITDIT------------DQHGNEWLPDSAATAHVTNSPRSLQQ 351 Query: 754 AVEFIGDEQAMVGDGKXXXXXXXXXXXXXXXTGDTSFDLHNVLYVPHIKHNLFSIANFTL 575 + + G + MV DG +G+ L +VL P I +L S++ T Sbjct: 352 SQPYHGSDAVMVADGNFLPITHTGSTNLASSSGNVP--LTDVLVCPSITKSLLSVSKLTQ 409 Query: 574 ENSCSYEFFPWGYEIKSLPSRKVLARGPNASELYPIKSSALRRNTALSASIMSNSNTXXX 395 + C+ EF G I ++K+L G LY +K + + S S S+ Sbjct: 410 DYPCTVEFDSDGVRINDKATKKLLIMGSTCDGLYCLKDDS-QFKAFFSTRQQSASDEV-- 466 Query: 394 XXXXXXXXLWNQRLGHPISTVINNLHTTGAIQLSNKVFESVCTSCQLGKSHSLPFLVSPS 215 W++RLGHP V+ L T +I + NK +S+C +CQLGKS LPF+ S Sbjct: 467 ---------WHRRLGHPHPQVLQQLVKTNSISI-NKTSKSLCEACQLGKSTRLPFVSSSF 516 Query: 214 RACQPLSLVHCDIWGPAPSISHLGFKYFIVFVDDFSKYNWIFPMKCRSESLNCFMLFKSL 35 + +PL VHCD+WGP+P S GF+Y+ VF+D +S+++WI+P+K +S+ N F+ F L Sbjct: 517 TSNRPLERVHCDLWGPSPITSVQGFRYYAVFIDHYSRFSWIYPLKLKSDFYNIFVAFHKL 576 Query: 34 MENLLEFK 11 +EN L K Sbjct: 577 VENQLNHK 584 Score = 33.1 bits (74), Expect(2) = 4e-51 Identities = 14/39 (35%), Positives = 21/39 (53%) Frame = -3 Query: 1890 LPFPNVSNFVSTKLDGSNFLVWKDQLSSILISTNLYGLL 1774 +P N+SN V+ L N+++WK Q S L L G + Sbjct: 6 VPSLNISNCVTVTLTAKNYILWKSQFESFLDGQGLLGFV 44 >gb|AAC35532.1| contains similarity to proteases [Arabidopsis thaliana] Length = 1392 Score = 199 bits (505), Expect(2) = 1e-50 Identities = 151/571 (26%), Positives = 257/571 (45%), Gaps = 5/571 (0%) Frame = -2 Query: 1708 YLEWRTLDQFVGSCINATISPSLATELLGKSIARDKWLHLSKIFTQQFFARKSQLRGQLH 1529 +L+W +DQ V + I ++S ++G + A++ WL L++ F + RK L+ +L Sbjct: 72 FLKWTRIDQLVKAWIFGSLSEEALKVVIGLNSAQEVWLGLARRFNRFSTTRKYDLQKRLG 131 Query: 1528 SIQRGHYSIFDYLHQLKTISDSLAEIGEPVQDDDLVMYTLSGLGSEYAHFVITMQNREVP 1349 + + ++ YL ++K I D L IG PV + + + L+GLG EY +++ Sbjct: 132 TCSKAGKTMDAYLSEVKNICDQLDSIGFPVTEQEKIFGVLNGLGKEYESIATVIEHSLDV 191 Query: 1348 LS---FAKLRSRLINHEQWLKDQE-NAIYSPLIADPSNSAFFVRKQQNTFXXXXXXXXXX 1181 F + +L + L N+ +P +A ++ ++ R N+ Sbjct: 192 YPGPCFDDVVYKLTTFDDKLSTYTANSEVTPHLAFYTDKSYSSRGNNNS----------- 240 Query: 1180 XXXXXXXXXXXXXXXXXXXXXXXXXPGFQFKKGEFNPNLTVDYGEIPSQICNKKGHFANT 1001 GF + G + N + + + QIC K GH A Sbjct: 241 -------RGGRYGNFRGRGSYSSRGRGFHQQFGSGSNNGSGNGSKPTCQICRKYGHSAFK 293 Query: 1000 FYYRYVPSMNNSPP-MQKAFASFENALRFHSWSPAESMAYXXXXXXXXXXXXSHWSCNDQ 824 Y R+ N P + AFA A+R + A S Sbjct: 294 CYTRF--EENYLPEDLPNAFA----AMRVSDQNQASSHE--------------------- 326 Query: 823 GPIWIPDSGATSHMTNNTAIMTDAVEFIGDEQAMVGDGKXXXXXXXXXXXXXXXTGDTSF 644 W+PDS AT+H+TN T + ++ + GD+ +VG+G G + Sbjct: 327 ---WLPDSAATAHITNTTDGLQNSQTYSGDDSVIVGNGDFLPITHIGTIPLNISQG--TL 381 Query: 643 DLHNVLYVPHIKHNLFSIANFTLENSCSYEFFPWGYEIKSLPSRKVLARGPNASELYPIK 464 L +VL P I +L S++ T + CS+ F IK ++++L +G LY +K Sbjct: 382 PLEDVLVCPGITKSLLSVSKLTDDYPCSFTFDSDSVVIKDKRTQQLLTQGNKHKGLYVLK 441 Query: 463 SSALRRNTALSASIMSNSNTXXXXXXXXXXXLWNQRLGHPISTVINNLHTTGAIQLSNKV 284 + T S S+ + W+QRLGHP V+ +L T AI + NK Sbjct: 442 DVPFQ--TYYSTRQQSSDDEV-----------WHQRLGHPNKEVLQHLIKTKAIVV-NKT 487 Query: 283 FESVCTSCQLGKSHSLPFLVSPSRACQPLSLVHCDIWGPAPSISHLGFKYFIVFVDDFSK 104 ++C +CQ+GK LPF+ S + +PL +HCD+WGPAP S GF+Y+++F+D++S+ Sbjct: 488 SSNMCEACQMGKVCRLPFVASEFVSSRPLERIHCDLWGPAPVTSAQGFQYYVIFIDNYSR 547 Query: 103 YNWIFPMKCRSESLNCFMLFKSLMENLLEFK 11 + W +P+K +S+ + F+LF+ L+EN + K Sbjct: 548 FTWFYPLKLKSDFFSVFVLFQQLVENQYQHK 578 Score = 32.0 bits (71), Expect(2) = 1e-50 Identities = 15/35 (42%), Positives = 21/35 (60%) Frame = -3 Query: 1878 NVSNFVSTKLDGSNFLVWKDQLSSILISTNLYGLL 1774 N+S V+ KL +N+L+WK Q S L S L G + Sbjct: 11 NISQVVTLKLTPTNYLLWKTQFESYLSSHLLLGFV 45 >emb|CAC37623.1| copia-like polyprotein [Arabidopsis thaliana] Length = 1466 Score = 207 bits (527), Expect = 4e-50 Identities = 166/614 (27%), Positives = 275/614 (44%), Gaps = 13/614 (2%) Frame = -2 Query: 1804 IDQYQFVRFVDGSIEP--QPQFLNHNNVP--VVYPIYLEWRTLDQFVGSCINATISPSLA 1637 + + + FV+G + P Q + + +++V V P Y +W DQ V S + T+S + Sbjct: 37 LSSQKLIGFVNGVVTPPAQTRLVVNDDVTSEVPNPQYEDWFCTDQLVRSWLFGTLSEEVL 96 Query: 1636 TELLGKSIARDKWLHLSKIFTQQFFARKSQLRGQLHSIQRGHYSIFDYLHQLKTISDSLA 1457 + + +R W+ L++ F + AR+ LR L + + S+ Y K I DSL+ Sbjct: 97 GHVHNLTTSRQIWISLAENFNKSSIAREFSLRRNLQLLTKKDKSLSVYCRDFKIICDSLS 156 Query: 1456 EIGEPVQDDDLVMYTLSGLGSEYAHFVITMQNREVPLSFAKLRSRLINH---EQWLKDQE 1286 IG+PV++ + L+GLG EY +Q+ S +KL + N E D + Sbjct: 157 SIGKPVEESMKIFGFLNGLGREYDPITTVIQS-----SLSKLPAPTFNDVISEVQGFDSK 211 Query: 1285 NAIYSPLIADPSNSAFFVRKQQNTFXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 1106 Y ++ + AF + + Sbjct: 212 LQSYDDTVSVNPHLAFNTERSNS----------------GAPQYNSNSRGRGRSGQNRGR 255 Query: 1105 PGFQFKKGEFNPNLTVD--YGEIP-SQICNKKGHFANTFYYRYVPSMNNSPPMQKAFASF 935 G+ + F+ + + G+ P QIC + GH A Y R+ + + P Q AF+ Sbjct: 256 GGYSTRGRGFSQHQSASPSSGQRPVCQICGRIGHTAIKCYNRFDNNYQSEVPTQ-AFS-- 312 Query: 934 ENALRFHSWSPAESMAYXXXXXXXXXXXXSHWSCNDQGPIWIPDSGATSHMTNNTAIMTD 755 ALR ++ G W PDS AT+H+T +T+ + + Sbjct: 313 --ALRVS---------------------------DETGKEWYPDSAATAHITASTSGLQN 343 Query: 754 AVEFIGDEQAMVGDGKXXXXXXXXXXXXXXXTGDTSFDLHNVLYVPHIKHNLFSIANFTL 575 A + G++ +VGDG G + L+ VL P I+ +L S++ Sbjct: 344 ATTYEGNDAVLVGDGTYLPITHVGSTTISSSKG--TIPLNEVLVCPAIQKSLLSVSKLCD 401 Query: 574 ENSCSYEFFPWGYEIKSLPSRKVLARGPNASELYPIKSS---ALRRNTALSASIMSNSNT 404 + C F I L ++KV+++GP + LY +++S AL N +AS+ + Sbjct: 402 DYPCGVYFDANKVCIIDLTTQKVVSKGPRNNGLYMLENSEFVALYSNRQCAASMET---- 457 Query: 403 XXXXXXXXXXXLWNQRLGHPISTVINNLHTTGAIQLSNKVFESVCTSCQLGKSHSLPFLV 224 W+ RLGH S ++ L T IQ++ VC CQ+GKS L F Sbjct: 458 ------------WHHRLGHSNSKILQQLLTRKEIQVNKSRTSPVCEPCQMGKSTRLQFFS 505 Query: 223 SPSRACQPLSLVHCDIWGPAPSISHLGFKYFIVFVDDFSKYNWIFPMKCRSESLNCFMLF 44 S RA +PL VHCD+WGP+P +S+ GFKY+ VFVDDFS+++W FP++ +S+ ++ F+ + Sbjct: 506 SDFRALKPLDRVHCDLWGPSPVVSNQGFKYYAVFVDDFSRFSWFFPLRMKSKFISVFIAY 565 Query: 43 KSLMENLLEFKKKK 2 + L+EN L K K+ Sbjct: 566 QKLVENQLGTKIKE 579 >dbj|BAK41512.1| C-end truncated polyprotein [Arabidopsis thaliana] Length = 1048 Score = 201 bits (512), Expect(2) = 1e-49 Identities = 153/598 (25%), Positives = 255/598 (42%), Gaps = 6/598 (1%) Frame = -2 Query: 1801 DQYQFVRFVDGSIEPQPQFLNHNNVPVVYPIYLEWRTLDQFVGSCINATISPSLATELLG 1622 D Y+ F+DGS P + + P V P Y W+ D+ + + + IS S+ + Sbjct: 43 DGYELAGFLDGSTTMPPATIGTDAAPRVNPDYTRWKRQDKLIYNAVLGAISMSVQPAVSR 102 Query: 1621 KSIARDKWLHLSKIFTQQFFARKSQLRGQLHSIQRGHYSIFDYLHQLKTISDSLAEIGEP 1442 + A W L KI+ + +QLR QL +G +I DY+ T D LA +G+P Sbjct: 103 ATTAAQIWETLRKIYANPSYGHVTQLRTQLKQWTKGTKTIDDYMQGFVTRFDQLALLGKP 162 Query: 1441 VQDDDLVMYTLSGLGSEYAHFVITMQNREVPLSFAKLRSRLINHEQWLKDQENAIYSPLI 1262 + D+ V L L EY P + ++ RL+N E + +A P+ Sbjct: 163 MDHDEQVERVLENLPEEYKPVKAC-----TPPTLTEIHERLLNQESKILAVSSATVIPIT 217 Query: 1261 ADPSNSAFFVRKQQNTFXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXPGFQFKKG 1082 A+ + N +Q Sbjct: 218 ANAVSHRNTTTTNNNN-------------------NGNRNNRYDNRNNNNNSKPWQQSST 258 Query: 1081 EFNPNLTVDYGEIPS-QICNKKGHFAN--TFYYRYVPSMNNSPPMQKAFASFENALRFHS 911 F+PN + QIC +GH A + ++ S+N+ P F Sbjct: 259 NFHPNNNQSKPYLGKCQICGVQGHSAKRCSQLQHFLSSVNSQQPSSP----------FTP 308 Query: 910 WSPAESMAYXXXXXXXXXXXXSHWSCNDQGPIWIPDSGATSHMTNNTAIMTDAVEFIGDE 731 W P ++A +S N+ W+ DSGAT H+T++ ++ + G + Sbjct: 309 WQPRANLALGSP-----------YSSNN----WLLDSGATHHITSDFNNLSLHQPYTGGD 353 Query: 730 QAMVGDGKXXXXXXXXXXXXXXXTGDTSFDLHNVLYVPHIKHNLFSIANFTLENSCSYEF 551 MV DG +LHN+LYVP+I NL S+ N S EF Sbjct: 354 DVMVADGSTIPISHTGSTSLSTK--SRPLNLHNILYVPNIHKNLISVYRLCNANGVSVEF 411 Query: 550 FPWGYEIKSLPSRKVLARGPNASELY--PIKSSALRRNTALSASIMSNSNTXXXXXXXXX 377 FP +++K L + L +G ELY PI SS + +L AS S + Sbjct: 412 FPASFQVKDLNTGVPLLQGKTKDELYEWPIASS---QPVSLFASPSSKAT---------- 458 Query: 376 XXLWNQRLGHPISTVINNLHTTGAIQLSNKVFESV-CTSCQLGKSHSLPFLVSPSRACQP 200 W+ RLGHP +++N++ + ++ + N + + C+ C + KS+ +PF S + +P Sbjct: 459 HSSWHARLGHPAPSILNSVISNYSLSVLNPSHKFLSCSDCLINKSNKVPFSQSTINSTRP 518 Query: 199 LSLVHCDIWGPAPSISHLGFKYFIVFVDDFSKYNWIFPMKCRSESLNCFMLFKSLMEN 26 L ++ D+W +P +SH ++Y+++FVD F++Y W++P+K +S+ F+ FK+L+EN Sbjct: 519 LEYIYSDVWS-SPILSHDNYRYYVIFVDHFTRYTWLYPLKQKSQVKETFITFKNLLEN 575 Score = 25.8 bits (55), Expect(2) = 1e-49 Identities = 13/35 (37%), Positives = 20/35 (57%) Frame = -3 Query: 1878 NVSNFVSTKLDGSNFLVWKDQLSSILISTNLYGLL 1774 N+SN TKL +N+L+W Q+ ++ L G L Sbjct: 19 NMSNV--TKLTSTNYLMWSRQVHALFDGYELAGFL 51 >emb|CAN77295.1| hypothetical protein VITISV_005638 [Vitis vinifera] Length = 1198 Score = 204 bits (519), Expect = 3e-49 Identities = 160/555 (28%), Positives = 245/555 (44%), Gaps = 12/555 (2%) Frame = -2 Query: 1633 ELLGKSIARDKWLHLSKIFTQQFFARKSQLRGQLHSIQRGHYSIFDYLHQLKTISDSLAE 1454 +++G + + W L K F+ AR QLR +L S ++G S+ DY+ ++K +DSLA Sbjct: 3 QIIGHNSSHSAWNALEKTFSSSSRARIMQLRLELQSTKKGSLSMIDYIMKVKGAADSLAA 62 Query: 1453 IGEPVQDDDLVMYTLSGLGSEYAHFVITMQNREVPLSFAKLRSRLINHEQWLKDQEN--- 1283 IGEPV + D VM L GLGS+Y V + ++ +S + S L+ E L+ Q + Sbjct: 63 IGEPVSEQDQVMNLLGGLGSDYNAVVTAINIKDDKISIEVVHSMLLAFEHRLEQQSSIEQ 122 Query: 1282 -AIYSPLIADPSNSAFFVRKQQNTFXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 1106 + S A SNS R+ Sbjct: 123 FSSISANYASSSNSRGSGRRYNG-----------------GRGQNHTPNISNYTYRGRGR 165 Query: 1105 PGFQFKKGEFNPNLTVDYGEIPS-QICNKKGHFANTFYYRYVPSMNNSPPMQKAFASFEN 929 G + G N N + E P Q+C K GH Y+++ S +S Q + S N Sbjct: 166 GGRYGQNGRHNSNSS----EKPQCQLCGKFGHTVQICYHKFDISYQSS---QSSNTSPSN 218 Query: 928 ALRFHSWSPAESMAYXXXXXXXXXXXXSHWSCNDQGPIWIPDSGATSHMTNNTAIMTDAV 749 A +S PA + S N W DSGA H+T + +T + Sbjct: 219 ASNPNS-IPAMVAS----------------SNNLAEDTWYLDSGANHHLTQSVGNLTSSS 261 Query: 748 EFIGDEQAMVGDGKXXXXXXXXXXXXXXXTGDTSFDLHNVLYVPHIKHNLFSIANFTLEN 569 + G ++ +G+GK SF L V +V I NL S+A F L+N Sbjct: 262 PYTGIDKVTIGNGKHLSISNTGSHRLLSD--SRSFHLKKVFHVHFISANLISVAKFYLDN 319 Query: 568 SCSYEFFPWGYEIKSLPSRKVLARGPNASELYPIKSSALRRNTALSASIMSNSNTXXXXX 389 + +EF + +K L ++KVLA+G + LY ++ + A S + Sbjct: 320 NALFEFRSNSFFVKDLHTKKVLAQGKLENGLYRFPVLNSKKVAFVGAINSSTFYSHNSSI 379 Query: 388 XXXXXXLWNQRLGHPISTVINNLHTTGAIQLSNK-------VFESVCTSCQLGKSHSLPF 230 LW+ RLGH + ++ + + + V +VC+SCQL KSH LP Sbjct: 380 FDNKVKLWHHRLGHASTNIVTQIMQSCNVSFEKNKNTVCSTVCSTVCSSCQLAKSHRLPT 439 Query: 229 LVSPSRACQPLSLVHCDIWGPAPSISHLGFKYFIVFVDDFSKYNWIFPMKCRSESLNCFM 50 +S S A +PL LVH D+WGPA S G +YFI+F+DD+S+Y W +P++ + ++L F Sbjct: 440 HLSLSCASKPLELVHTDLWGPASVKSTSGARYFILFLDDYSRYTWFYPLQTKDQALPAFK 499 Query: 49 LFKSLMENLLEFKKK 5 FK +EN + K K Sbjct: 500 KFKLQVENQFDAKIK 514 >gb|KHN36156.1| Retrovirus-related Pol polyprotein from transposon TNT 1-94, partial [Glycine soja] Length = 1417 Score = 192 bits (488), Expect(2) = 2e-48 Identities = 147/582 (25%), Positives = 252/582 (43%), Gaps = 21/582 (3%) Frame = -2 Query: 1708 YLEWRTLDQFVGSCINATISPSLATELLGKSIARDKWLHLSKIFTQQFFARKSQLRGQLH 1529 Y +W DQ + + + +T+S + +L A + W + K F +R QLR +L Sbjct: 54 YQQWLIKDQTLFTWLLSTLSDGVLPRVLSCRHAHEVWDKIHKYFNSVLKSRARQLRSELK 113 Query: 1528 SIQRGHYSIFDYLHQLKTISDSLAEIGEPVQDDDLVMYTLSGLGSEYAHFVITMQNREVP 1349 + ++ S+ +YL ++K+I +SL +G+ V + + V L GL E+ FV+ + +R Sbjct: 114 NTKKLSRSVNEYLLRIKSIVNSLVAVGDMVSEQEQVDSILEGLPEEFNSFVMMVYSRFDT 173 Query: 1348 LSFAKLRSRLINHEQWLKDQENAIYSPLIADPSNSAFFVRKQQNTFXXXXXXXXXXXXXX 1169 + + + L+ L++ + + + PS SA + N Sbjct: 174 PTVEDVEALLL-----LQEAQFEKFKQELTSPSVSANVAHTETNA------SDSNSEHES 222 Query: 1168 XXXXXXXXXXXXXXXXXXXXXPGFQFKKGEFNPNLTVDYGEIPSQICNKKGHFANTFYYR 989 G KG+ + G++ QIC K H A +YR Sbjct: 223 QELGTEHYNVNANRGRGRGKGRGRGRGKGQAQ-----NQGKVKCQICAKPNHDAINCWYR 277 Query: 988 YVPSMNNSPPMQKAFASFENALRFHSWSPAESMAYXXXXXXXXXXXXSHWSCNDQ--GPI 815 Y P N Q + ++ S P Y DQ Sbjct: 278 YDPQAMN----QNSRGGYQVG---PSNRPQNFNPYMRPTAHLAMPQPYAMPNMDQFSNGA 330 Query: 814 WIPDSGATSHMTNNTAIMTDAVEFIGDEQAMVGDGKXXXXXXXXXXXXXXXTG-DTSFDL 638 W PDSGA+ H+T N ++ + + G +Q ++G+G+ + L Sbjct: 331 WYPDSGASHHLTYNPNNLSYSSPYTGQDQVVMGNGQGVSIHSLGQSQFHSPNEPNVKLTL 390 Query: 637 HNVLYVPHIKHNLFSIANFTLENSCSYEFFPWGYEIKSLPSRKVLARGP-NASELYPIKS 461 ++L+VP+I NL S++ F +N+ +EF P+ +K S++VL G A LY K Sbjct: 391 KDLLHVPNISKNLLSVSKFAQDNNVIFEFHPYHCFVKYQDSKQVLLEGTVGADGLYQFKP 450 Query: 460 SALRRNTALSASIMSNS-----------------NTXXXXXXXXXXXLWNQRLGHPISTV 332 N+ +++ S+S NT +W+ RLGH ++ Sbjct: 451 FKFLTNSGAASNSDSSSMSSSSQFSVFNNPVNCNNTVSVMQNGNVFQMWHLRLGHAHTSA 510 Query: 331 INNLHTTGAIQLSNKVFESVCTSCQLGKSHSLPFLVSPSRACQPLSLVHCDIWGPAPSIS 152 + N+ I SNK CT C +GKSH L +S + +P ++HCD+WGPAP +S Sbjct: 511 VKNILNLCNIPFSNKTATLPCTFCCMGKSHRLHSPLSNTVYTKPFEVIHCDLWGPAPFVS 570 Query: 151 HLGFKYFIVFVDDFSKYNWIFPMKCRSESLNCFMLFKSLMEN 26 + G+ Y+I FVD ++K+ WI+ +K +S++L F FK+L++N Sbjct: 571 YYGYSYYITFVDTYTKFTWIYFLKAKSDALKAFTQFKALIQN 612 Score = 30.8 bits (68), Expect(2) = 2e-48 Identities = 11/33 (33%), Positives = 22/33 (66%) Frame = -3 Query: 1863 VSTKLDGSNFLVWKDQLSSILISTNLYGLLMVP 1765 ++ KLD NFL+W Q++ ++ + NL+ ++ P Sbjct: 1 LTIKLDEKNFLLWSQQVNGVITAHNLHRFVVNP 33 >gb|KHN22040.1| Retrovirus-related Pol polyprotein from transposon TNT 1-94, partial [Glycine soja] Length = 1417 Score = 192 bits (488), Expect(2) = 2e-48 Identities = 147/582 (25%), Positives = 252/582 (43%), Gaps = 21/582 (3%) Frame = -2 Query: 1708 YLEWRTLDQFVGSCINATISPSLATELLGKSIARDKWLHLSKIFTQQFFARKSQLRGQLH 1529 Y +W DQ + + + +T+S + +L A + W + K F +R QLR +L Sbjct: 54 YQQWLIKDQTLFTWLLSTLSDGVLPRVLSCRHAHEVWDKIHKYFNSVLKSRARQLRSELK 113 Query: 1528 SIQRGHYSIFDYLHQLKTISDSLAEIGEPVQDDDLVMYTLSGLGSEYAHFVITMQNREVP 1349 + ++ S+ +YL ++K+I +SL +G+ V + + V L GL E+ FV+ + +R Sbjct: 114 NTKKLSRSVNEYLLRIKSIVNSLVAVGDMVSEQEQVDSILEGLPEEFNSFVMMVYSRFDT 173 Query: 1348 LSFAKLRSRLINHEQWLKDQENAIYSPLIADPSNSAFFVRKQQNTFXXXXXXXXXXXXXX 1169 + + + L+ L++ + + + PS SA + N Sbjct: 174 PTVEDVEALLL-----LQEAQFEKFKQELTSPSVSANVAHTETNA------SDSNSEHES 222 Query: 1168 XXXXXXXXXXXXXXXXXXXXXPGFQFKKGEFNPNLTVDYGEIPSQICNKKGHFANTFYYR 989 G KG+ + G++ QIC K H A +YR Sbjct: 223 QELGTEHYNVNANRGRGRGKGRGRGRGKGQAQ-----NQGKVKCQICAKPNHDAINCWYR 277 Query: 988 YVPSMNNSPPMQKAFASFENALRFHSWSPAESMAYXXXXXXXXXXXXSHWSCNDQ--GPI 815 Y P N Q + ++ S P Y DQ Sbjct: 278 YDPQAMN----QNSRGGYQVG---PSNRPQNFNPYMRPTAHLAMPQPYAMPNMDQFSNGA 330 Query: 814 WIPDSGATSHMTNNTAIMTDAVEFIGDEQAMVGDGKXXXXXXXXXXXXXXXTG-DTSFDL 638 W PDSGA+ H+T N ++ + + G +Q ++G+G+ + L Sbjct: 331 WYPDSGASHHLTYNPNNLSYSSPYTGQDQVVMGNGQGVSIHSLGQSQFHSPNEPNVKLTL 390 Query: 637 HNVLYVPHIKHNLFSIANFTLENSCSYEFFPWGYEIKSLPSRKVLARGP-NASELYPIKS 461 ++L+VP+I NL S++ F +N+ +EF P+ +K S++VL G A LY K Sbjct: 391 KDLLHVPNISKNLLSVSKFAQDNNVIFEFHPYHCFVKYQDSKQVLLEGTVGADGLYQFKP 450 Query: 460 SALRRNTALSASIMSNS-----------------NTXXXXXXXXXXXLWNQRLGHPISTV 332 N+ +++ S+S NT +W+ RLGH ++ Sbjct: 451 FKFLTNSGAASNSDSSSMSSSSQFSVFNNPVNCNNTVSVMQNGNVFQMWHLRLGHAHTSA 510 Query: 331 INNLHTTGAIQLSNKVFESVCTSCQLGKSHSLPFLVSPSRACQPLSLVHCDIWGPAPSIS 152 + N+ I SNK CT C +GKSH L +S + +P ++HCD+WGPAP +S Sbjct: 511 VKNILNLCNIPFSNKTATLPCTFCCMGKSHRLHSPLSNTVYTKPFEVIHCDLWGPAPFVS 570 Query: 151 HLGFKYFIVFVDDFSKYNWIFPMKCRSESLNCFMLFKSLMEN 26 + G+ Y+I FVD ++K+ WI+ +K +S++L F FK+L++N Sbjct: 571 YYGYSYYITFVDTYTKFTWIYFLKAKSDALKAFTQFKALIQN 612 Score = 30.8 bits (68), Expect(2) = 2e-48 Identities = 11/33 (33%), Positives = 22/33 (66%) Frame = -3 Query: 1863 VSTKLDGSNFLVWKDQLSSILISTNLYGLLMVP 1765 ++ KLD NFL+W Q++ ++ + NL+ ++ P Sbjct: 1 LTIKLDEKNFLLWSQQVNGVITAHNLHRFVVNP 33 >emb|CAB40035.1| retrotransposon like protein [Arabidopsis thaliana] gi|7267767|emb|CAB81170.1| retrotransposon like protein [Arabidopsis thaliana] Length = 1515 Score = 199 bits (505), Expect = 1e-47 Identities = 151/571 (26%), Positives = 257/571 (45%), Gaps = 5/571 (0%) Frame = -2 Query: 1708 YLEWRTLDQFVGSCINATISPSLATELLGKSIARDKWLHLSKIFTQQFFARKSQLRGQLH 1529 +L+W +DQ V + I ++S ++G + A++ WL L++ F + RK L+ +L Sbjct: 69 FLKWTRIDQLVKAWIFGSLSEEALKVVIGLNSAQEVWLGLARRFNRFSTTRKYDLQKRLG 128 Query: 1528 SIQRGHYSIFDYLHQLKTISDSLAEIGEPVQDDDLVMYTLSGLGSEYAHFVITMQNREVP 1349 + + ++ YL ++K I D L IG PV + + + L+GLG EY +++ Sbjct: 129 TCSKAGKTMDAYLSEVKNICDQLDSIGFPVTEQEKIFGVLNGLGKEYESIATVIEHSLDV 188 Query: 1348 LS---FAKLRSRLINHEQWLKDQE-NAIYSPLIADPSNSAFFVRKQQNTFXXXXXXXXXX 1181 F + +L + L N+ +P +A ++ ++ R N+ Sbjct: 189 YPGPCFDDVVYKLTTFDDKLSTYTANSEVTPHLAFYTDKSYSSRGNNNS----------- 237 Query: 1180 XXXXXXXXXXXXXXXXXXXXXXXXXPGFQFKKGEFNPNLTVDYGEIPSQICNKKGHFANT 1001 GF + G + N + + + QIC K GH A Sbjct: 238 -------RGGRYGNFRGRGSYSSRGRGFHQQFGSGSNNGSGNGSKPTCQICRKYGHSAFK 290 Query: 1000 FYYRYVPSMNNSPP-MQKAFASFENALRFHSWSPAESMAYXXXXXXXXXXXXSHWSCNDQ 824 Y R+ N P + AFA A+R + A S Sbjct: 291 CYTRF--EENYLPEDLPNAFA----AMRVSDQNQASSHE--------------------- 323 Query: 823 GPIWIPDSGATSHMTNNTAIMTDAVEFIGDEQAMVGDGKXXXXXXXXXXXXXXXTGDTSF 644 W+PDS AT+H+TN T + ++ + GD+ +VG+G G + Sbjct: 324 ---WLPDSAATAHITNTTDGLQNSQTYSGDDSVIVGNGDFLPITHIGTIPLNISQG--TL 378 Query: 643 DLHNVLYVPHIKHNLFSIANFTLENSCSYEFFPWGYEIKSLPSRKVLARGPNASELYPIK 464 L +VL P I +L S++ T + CS+ F IK ++++L +G LY +K Sbjct: 379 PLEDVLVCPGITKSLLSVSKLTDDYPCSFTFDSDSVVIKDKRTQQLLTQGNKHKGLYVLK 438 Query: 463 SSALRRNTALSASIMSNSNTXXXXXXXXXXXLWNQRLGHPISTVINNLHTTGAIQLSNKV 284 + T S S+ + W+QRLGHP V+ +L T AI + NK Sbjct: 439 DVPFQ--TYYSTRQQSSDDEV-----------WHQRLGHPNKEVLQHLIKTKAIVV-NKT 484 Query: 283 FESVCTSCQLGKSHSLPFLVSPSRACQPLSLVHCDIWGPAPSISHLGFKYFIVFVDDFSK 104 ++C +CQ+GK LPF+ S + +PL +HCD+WGPAP S GF+Y+++F+D++S+ Sbjct: 485 SSNMCEACQMGKVCRLPFVASEFVSSRPLERIHCDLWGPAPVTSAQGFQYYVIFIDNYSR 544 Query: 103 YNWIFPMKCRSESLNCFMLFKSLMENLLEFK 11 + W +P+K +S+ + F+LF+ L+EN + K Sbjct: 545 FTWFYPLKLKSDFFSVFVLFQQLVENQYQHK 575 >gb|KFK44388.1| hypothetical protein AALP_AA1G251100, partial [Arabis alpina] Length = 2090 Score = 196 bits (498), Expect = 8e-47 Identities = 158/607 (26%), Positives = 263/607 (43%), Gaps = 14/607 (2%) Frame = -2 Query: 1804 IDQYQFVRFVDGSIEPQPQFLNHNNVPVVYPIYLEWRTLDQFVGSCINATISPSLATELL 1625 +D Y +DGS E L N+V V P Y W D+ + S + IS L + Sbjct: 50 LDGYALAGHLDGSKEIPAATLTTNDVVSVNPAYTLWTRQDRLIFSSLIGAISTPLQPLVS 109 Query: 1624 GKSIARDKWLHLSKIFTQQFFARKSQLRGQLHSIQRGHYSIFDYLHQLKTISDSLAEIGE 1445 + + + W L+ + + QL+ QL + +I Y+ + T D LA +G Sbjct: 110 RATSSSEIWNTLASTYAKPSRGHIRQLKTQLKQWHKETKTIDVYVQGITTRLDQLAILGA 169 Query: 1444 PVQDDDLVMYTLSGLGSEYAHFVITMQNREVPLSFAKLRSRLINHEQWLKDQENAI--YS 1271 + ++ + L GL EY + V ++ R+ P + +L RL+NHE L + + Sbjct: 170 AMGHEEQIDLILDGLPEEYKNVVDQVEGRDTPPTITELHERLLNHEAKLLSAMETLVPHG 229 Query: 1270 PLIADPSNSAFFVRKQQNTFXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXPGFQF 1091 P+ A+ + F +N P + Sbjct: 230 PVTANAAQHRNFSNNNKNQ-----------------------SRNRTTNNQWQHSPSSNW 266 Query: 1090 KKGEFNPNLTVDYGEIP----SQICNKKGHFANTFYYRYVPSMNNSPPMQKAFASFENAL 923 + G+ N G P QIC +GH A R +S SF+++ Sbjct: 267 QSGQ---NRADSQGPRPYLGRCQICGIQGHSAK----RCSKLQRHSAKRCSKLQSFQSSA 319 Query: 922 R-----FHSWSPAESMAYXXXXXXXXXXXXSHWSCNDQGPIWIPDSGATSHMTNNTAIMT 758 + F SW P ++A +S ++ W+ DSGAT HMT++ ++ Sbjct: 320 QQQQSPFTSWQPRANLAMNSS-----------YSADN----WLLDSGATHHMTSDLHNLS 364 Query: 757 DAVEFIGDEQAMVGDGKXXXXXXXXXXXXXXXTGDTSFDLHNVLYVPHIKHNLFSIANFT 578 + G + + DG + D LH VLYVP ++ NL S+ Sbjct: 365 LHQPYRGSDGVTIADGSTIPITQTGFKSFPSNSRD--LQLHKVLYVPDLQKNLISVYRLC 422 Query: 577 LENSCSYEFFPWGYEIKSLPSRKVLARGPNASELY--PIKSSALRRNTALSASIMSNSNT 404 N S EFFP +++K L + L +G +ELY PI SS+ TA +AS S + Sbjct: 423 NTNRVSVEFFPASFQVKDLSTETPLLQGRTINELYEWPISSSS---PTAFAASPSSTTTL 479 Query: 403 XXXXXXXXXXXLWNQRLGHPISTVINNLHTTGAIQLSNKVFESV-CTSCQLGKSHSLPFL 227 W+ RLGHP S + NN+ + +I +S + + + C+ C + K+H +PF Sbjct: 480 QS----------WHSRLGHPSSLIFNNIVSRFSIPISKQSSQPLSCSDCFINKTHKIPFS 529 Query: 226 VSPSRACQPLSLVHCDIWGPAPSISHLGFKYFIVFVDDFSKYNWIFPMKCRSESLNCFML 47 S + +PL ++ D+W +P +S FKY+++FVD +++Y W++P+K +S+ + F+ Sbjct: 530 KSTITSSKPLEYIYSDVWS-SPILSLENFKYYLIFVDHYTRYTWLYPLKLKSQVKDTFIA 588 Query: 46 FKSLMEN 26 FKSL+EN Sbjct: 589 FKSLVEN 595 Score = 95.5 bits (236), Expect = 2e-16 Identities = 54/163 (33%), Positives = 92/163 (56%), Gaps = 3/163 (1%) Frame = -2 Query: 505 LARGPNASELY--PIKSSALRRNTALSASIMSNSNTXXXXXXXXXXXLWNQRLGHPISTV 332 L +G +ELY PI SS+ TA +AS S + W+ RLGHP S + Sbjct: 1507 LLQGRTINELYEWPISSSS---PTAFAASPSSTTTLQS----------WHSRLGHPSSLI 1553 Query: 331 INNLHTTGAIQLSNKVFESV-CTSCQLGKSHSLPFLVSPSRACQPLSLVHCDIWGPAPSI 155 NN+ + +I +S + + + C+ C + K+H +PF S + +PL ++ D+W Sbjct: 1554 FNNIVSRFSIPISKQSSQPLSCSDCFINKTHKIPFSKSTITSSKPLEYIYSDVWS----- 1608 Query: 154 SHLGFKYFIVFVDDFSKYNWIFPMKCRSESLNCFMLFKSLMEN 26 SH+ Y+++FVD +++Y W++P+K +S+ + F+ FKSL+EN Sbjct: 1609 SHI---YYLIFVDHYTRYTWLYPLKLKSQVKDTFIAFKSLVEN 1648 >emb|CAN81099.1| hypothetical protein VITISV_017741 [Vitis vinifera] Length = 1455 Score = 192 bits (488), Expect = 1e-45 Identities = 152/560 (27%), Positives = 252/560 (45%), Gaps = 29/560 (5%) Frame = -2 Query: 1597 LHLSKIFTQQFFARKS-----QLRGQLHSIQRGHYSIFDYLHQLKTISDSLAEIGEPVQD 1433 L LS+ F +Q+FA ++ Q + QL ++G +I +YL ++K DSLA +G + Sbjct: 102 LFLSQYFLEQYFASQTRAKAKQFKTQLQHTKKGGSTIDEYLAKIKVCVDSLASVGVSLST 161 Query: 1432 DDLVMYTLSGLGSEYAHFVITMQNREVPLSFAKLRSRLINHEQWLKDQENAIYSPLIADP 1253 D V L GL ++Y FV ++ R S ++ + L+ HE ++ N++ S A Sbjct: 162 KDHVESILDGLPNDYESFVTSVILRNDDFSVEEIEALLMAHESRVEKNNNSLDSSPSAHV 221 Query: 1252 SNSA-----------FFVRKQQNTFXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 1106 ++S ++ Q + Sbjct: 222 ASSNAVEKGNRFKQDYYAANSQGSHSGYNGGFGRGGDFGRRGGFYGGRGFNWNYNGRSNR 281 Query: 1105 PGFQFK--KGEFN---PNLTVDYGEIPS-QICNKKGHFANTFYYRYVPSMNNSPPMQKAF 944 GF+ + KG F P + + E P+ Q+C K GH YYR+ +++ + + Sbjct: 282 GGFRGRGNKGSFQARPPWNSDNQNEKPACQLCGKIGHVVAQCYYRF----DHTFQVPQNL 337 Query: 943 ASFENALR-FHSWSPAESMAYXXXXXXXXXXXXSHWSCNDQGPIWIPDSGATSHMTNNTA 767 +S ++ R ++S+SP + W PDSGA++H+T N Sbjct: 338 SSRNSSPRAYYSFSPQVNGVIPTSEVFSDDN-------------WYPDSGASNHVTPNPE 384 Query: 766 IMTDAVEFIGDEQAMVGDGKXXXXXXXXXXXXXXXTGDTSFDLHNVLYVPHIKHNLFSIA 587 + + EF G Q VG+G L+++L+VP I NL S++ Sbjct: 385 NLMKSAEFAGQNQVHVGNGTGLSIKHIGQSEFLSPFSSKPLLLNHLLHVPSITKNLLSVS 444 Query: 586 NFTLENSCSYEFFPWGYEIKSLPSRKVLARGPNASELYPIKSS--ALRRNTALSAS---- 425 F +N +EF +K ++ VL G LY SS ALR +LS S Sbjct: 445 KFAKDNKVFFEFHSDSCFVKDQVTQAVLMVGKVRDGLYAFDSSHLALRPTQSLSKSPSVV 504 Query: 424 IMSNSNTXXXXXXXXXXXLWNQRLGHPISTVINNLHTTGAIQLSNKVFESVCTSCQLGKS 245 S S+ LW++RLGHP + I N+ + + NK+ + C+SC LGK Sbjct: 505 ASSFSSKVCTTSLSSTFDLWHKRLGHPSAATIKNVLSKCNVAHINKMDSNFCSSCCLGKI 564 Query: 244 HSLPFLVSPSRACQPLSLVHCDIWGPAPSISHLGFKYFIVFVDDFSKYNWIFPMKCRSES 65 H PF +S + +PL L+H D+WGP +S+ G++Y+I FVD FS+++WIF ++ +SE+ Sbjct: 565 HRFPFSLSHTTYTKPLELIHLDLWGPTLVLSNSGYRYYIHFVDAFSRFSWIFLLRNKSEA 624 Query: 64 LNCFMLFKSLMENLLEFKKK 5 + F+ FK+ +E + K K Sbjct: 625 IKTFVNFKTQVELQFDLKIK 644