BLASTX nr result
ID: Atropa21_contig00033118
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Atropa21_contig00033118 (1700 letters) Database: nr 37,332,560 sequences; 13,225,080,153 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value emb|CCA65979.1| hypothetical protein [Beta vulgaris subsp. vulga... 272 4e-70 gb|ABD33261.1| RNA-directed DNA polymerase (Reverse transcriptas... 260 1e-66 emb|CCA65981.1| hypothetical protein [Beta vulgaris subsp. vulga... 249 3e-63 ref|XP_004252692.1| PREDICTED: uncharacterized protein LOC101261... 218 5e-54 ref|XP_004247247.1| PREDICTED: uncharacterized protein LOC101256... 215 5e-53 ref|XP_004253224.1| PREDICTED: uncharacterized protein LOC101268... 212 3e-52 gb|AAC33226.1| putative non-LTR retroelement reverse transcripta... 204 9e-50 gb|AAF99785.1|AC012463_2 T2E6.4 [Arabidopsis thaliana] 199 4e-48 gb|EMJ15226.1| hypothetical protein PRUPE_ppa016677mg [Prunus pe... 136 3e-46 gb|AAF98181.1|AC000107_4 F17F8.5 [Arabidopsis thaliana] 191 8e-46 gb|AAC13599.1| similar to reverse transcriptase (Pfam: transcrip... 191 8e-46 dbj|BAB09379.1| non-LTR retroelement reverse transcriptase-like ... 190 2e-45 dbj|BAA97290.1| non-LTR retroelement reverse transcriptase-like ... 189 2e-45 dbj|BAF01687.1| hypothetical protein [Arabidopsis thaliana] 189 2e-45 gb|AAC63678.1| putative non-LTR retroelement reverse transcripta... 188 5e-45 gb|ABA99600.2| retrotransposon protein, putative, unclassified [... 122 9e-45 emb|CAN62743.1| hypothetical protein VITISV_033107 [Vitis vinifera] 127 2e-44 gb|AAD08951.1| putative reverse transcriptase [Arabidopsis thali... 185 6e-44 gb|AAD32866.1|AC005489_4 F14N23.4 [Arabidopsis thaliana] 157 1e-35 dbj|BAF00918.1| putative reverse transcriptase [Arabidopsis thal... 157 1e-35 >emb|CCA65979.1| hypothetical protein [Beta vulgaris subsp. vulgaris] Length = 1110 Score = 272 bits (695), Expect = 4e-70 Identities = 137/308 (44%), Positives = 202/308 (65%) Frame = +1 Query: 370 TSCISLTQRNINIMKNEPVLSHAHGIELCKKVSDEEIYEALQSIGDDKVPGVDEYSAHFY 549 T +L ++N ++ LS L ++V+ EI EAL IG+DK PG+D ++A+F+ Sbjct: 407 TRASTLMGVDLNTVRGGKCLSAQAKESLIREVASTEIDEALAGIGNDKAPGLDGFNAYFF 466 Query: 550 KKSWNIIKGDLMAVVRDFFYKGKLYRPIECTFITLFPKTACPIIVKEYRPIACCSALFKI 729 KKSW IK ++ A +++FF +++RPI C +TL PK VKE+RPIACC+ ++KI Sbjct: 467 KKSWGSIKQEIYAGIQEFFNNSRMHRPINCIVVTLLPKVQHATRVKEFRPIACCTVIYKI 526 Query: 730 IAKVLASRLQIIIALVTSESQSWIIPGRKIANNIILVTELVKAYSQKHIFPRCMVKIDLQ 909 I+K+L +R++ II V +E+QS IPGR IA+NI+L +EL++ Y++KH+ PRC++K+D++ Sbjct: 527 ISKMLTNRMKGIIGEVVNEAQSGFIPGRHIADNILLASELIRGYTRKHMSPRCIMKVDIR 586 Query: 910 KAYDSVGWVYLKQILEALCFPVKFVEWHMECIYTVNHSITINEESTPPLI*CS*GT*TRE 1089 KAYDSV W +L+ +L FP +FV W MEC+ TV++S+ +N T P G + Sbjct: 587 KAYDSVEWSFLETLLYEFGFPSRFVGWIMECVSTVSYSVLVNGIPTQPFQ-ARKGLRQGD 645 Query: 1090 THFPFLVFHCHGYLSRCLVELRVTKNFKFYPKCARFEITHLCFADDLLLFARGDLASVQT 1269 PFL C YLSRCL EL+ + +F F+PKC R ITHL FADDLL+F R D +S+ Sbjct: 646 PMSPFLFALCMEYLSRCLEELKGSPDFNFHPKCERLNITHLMFADDLLMFCRADKSSLDH 705 Query: 1270 LPVFPQVF 1293 + V Q F Sbjct: 706 MNVAFQKF 713 Score = 147 bits (371), Expect = 1e-32 Identities = 83/221 (37%), Positives = 126/221 (57%), Gaps = 10/221 (4%) Frame = +2 Query: 1058 FDAAKGLRQGKPISPFLFSIAMDTSVDVLLN*GLPKILNSIPNVLDLK*HTSA--LLMIY 1231 F A KGLRQG P+SPFLF++ M+ L + L + D H L + + Sbjct: 635 FQARKGLRQGDPMSPFLFALCMEY---------LSRCLEELKGSPDFNFHPKCERLNITH 685 Query: 1232 YFLQEEI*LLCK--------LYQFFLKFSKASGLQANIGKSSIYFGGVADATKATIFQHL 1387 +++ + C+ + F KFS ASGL A+ KS+IYF GV D T + ++ Sbjct: 686 LMFADDLLMFCRADKSSLDHMNVAFQKFSHASGLAASHEKSNIYFCGVDDETARELADYV 745 Query: 1388 GFSLGELPFKYLGIPLSTKKLTILQWTPLIDKIVARISSLTAKKLSYAGRVQLVQLVLFG 1567 LGELPF+YLG+PL++KKLT Q PL++ I R + AK LSYAGR+QL++ +L Sbjct: 746 HMQLGELPFRYLGVPLTSKKLTYAQCKPLVEMITNRAQTWMAKLLSYAGRLQLIKSILSS 805 Query: 1568 IQAYWS*LFVIPSKELKFIDAYCRSYVWSGINTITKRALIA 1690 +Q YW+ +F + K ++ ++ CR ++W+G TK+A +A Sbjct: 806 MQNYWAHIFPLSKKVIQAVEKVCRKFLWTGKTEETKKAPVA 846 >gb|ABD33261.1| RNA-directed DNA polymerase (Reverse transcriptase) [Medicago truncatula] Length = 402 Score = 260 bits (664), Expect = 1e-66 Identities = 135/301 (44%), Positives = 188/301 (62%) Frame = +1 Query: 370 TSCISLTQRNINIMKNEPVLSHAHGIELCKKVSDEEIYEALQSIGDDKVPGVDEYSAHFY 549 +S SL + N++K P+LS LC K + E+ L S+ K PG+D Y+ HF+ Sbjct: 90 SSVDSLPMVDKNVVKRGPMLSQHQQDLLCSKFTAVEVKNVLFSMDSSKAPGIDGYNVHFF 149 Query: 550 KKSWNIIKGDLMAVVRDFFYKGKLYRPIECTFITLFPKTACPIIVKEYRPIACCSALFKI 729 K SWNII ++ + DFF G + + I CT++TL PK VK +RPIACCS ++KI Sbjct: 150 KCSWNIIGDSVIDAILDFFKTGFMPKIINCTYMTLLPKEVNVTSVKNFRPIACCSVIYKI 209 Query: 730 IAKVLASRLQIIIALVTSESQSWIIPGRKIANNIILVTELVKAYSQKHIFPRCMVKIDLQ 909 I+K+L SR+Q ++ V SE+QS + GR I +NIIL ELVK+YS+K I PRCMVKIDLQ Sbjct: 210 ISKILTSRMQGVLNSVVSENQSAFVKGRVIFDNIILSHELVKSYSRKGISPRCMVKIDLQ 269 Query: 910 KAYDSVGWVYLKQILEALCFPVKFVEWHMECIYTVNHSITINEESTPPLI*CS*GT*TRE 1089 KAY+SV W ++K ++ L F KFV W M C+ T +++ IN + T P G + Sbjct: 270 KAYNSVEWPFIKHLMLELGFSYKFVNWVMGCLTTASYTFNINGDLTRPFA-AKKGLRQGD 328 Query: 1090 THFPFLVFHCHGYLSRCLVELRVTKNFKFYPKCARFEITHLCFADDLLLFARGDLASVQT 1269 P+L C YL+ CL++LR F+F+P+C R + H+CF DDLLLF+RGD+ SV Sbjct: 329 PISPYLFVICMEYLNICLIQLRKNAAFRFHPRCKRLNLIHVCFVDDLLLFSRGDVDSVSQ 388 Query: 1270 L 1272 L Sbjct: 389 L 389 >emb|CCA65981.1| hypothetical protein [Beta vulgaris subsp. vulgaris] Length = 1114 Score = 249 bits (636), Expect = 3e-63 Identities = 125/301 (41%), Positives = 190/301 (63%) Frame = +1 Query: 370 TSCISLTQRNINIMKNEPVLSHAHGIELCKKVSDEEIYEALQSIGDDKVPGVDEYSAHFY 549 TS L ++++++ LS +L + ++ +EI +AL I D K PG+D +++ F+ Sbjct: 410 TSSSQLEAIDLHVVRVGAKLSATSCAQLVQPITIQEIDQALADIDDTKAPGLDGFNSVFF 469 Query: 550 KKSWNIIKGDLMAVVRDFFYKGKLYRPIECTFITLFPKTACPIIVKEYRPIACCSALFKI 729 KKSW +IK ++ + DFF G +++PI CT +TL PK K+YRPIACCS L+KI Sbjct: 470 KKSWLVIKQEIYEGILDFFENGFMHKPINCTAVTLIPKIDEAKHAKDYRPIACCSTLYKI 529 Query: 730 IAKVLASRLQIIIALVTSESQSWIIPGRKIANNIILVTELVKAYSQKHIFPRCMVKIDLQ 909 I+K+L RLQ +I V +Q+ IP R I +NI+L TEL++ Y+++H+ PRC++K+D++ Sbjct: 530 ISKILTKRLQAVITEVVDCAQTGFIPERHIGDNILLATELIRGYNRRHVSPRCVIKVDIR 589 Query: 910 KAYDSVGWVYLKQILEALCFPVKFVEWHMECIYTVNHSITINEESTPPLI*CS*GT*TRE 1089 KAYDSV WV+L+ +L+ L FP F+ W M C+ TV++SI +N + P G + Sbjct: 590 KAYDSVEWVFLESMLKELGFPSMFIRWIMACVKTVSYSILLNGIPSIPFD-AQKGLRQGD 648 Query: 1090 THFPFLVFHCHGYLSRCLVELRVTKNFKFYPKCARFEITHLCFADDLLLFARGDLASVQT 1269 PFL YLSRC+ + F F+PKC R ++THL FADDLL+FAR D +S+ Sbjct: 649 PLSPFLFALSMEYLSRCMGNMCKDPEFNFHPKCERIKLTHLMFADDLLMFARADASSISK 708 Query: 1270 L 1272 + Sbjct: 709 I 709 Score = 155 bits (393), Expect = 4e-35 Identities = 87/214 (40%), Positives = 123/214 (57%), Gaps = 1/214 (0%) Frame = +2 Query: 1058 FDAAKGLRQGKPISPFLFSIAMDTSVDVLLN*GLPKILNSIPNVLDLK*-HTSALLMIYY 1234 FDA KGLRQG P+SPFLF+++M+ + N N P +K H + Sbjct: 638 FDAQKGLRQGDPLSPFLFALSMEYLSRCMGNMCKDPEFNFHPKCERIKLTHLMFADDLLM 697 Query: 1235 FLQEEI*LLCKLYQFFLKFSKASGLQANIGKSSIYFGGVADATKATIFQHLGFSLGELPF 1414 F + + + K+ F FSKASGLQA+I KS IYFGGV + + +G LPF Sbjct: 698 FARADASSISKIMAAFNSFSKASGLQASIEKSCIYFGGVCHEEAEQLADRIQMPIGSLPF 757 Query: 1415 KYLGIPLSTKKLTILQWTPLIDKIVARISSLTAKKLSYAGRVQLVQLVLFGIQAYWS*LF 1594 +YLG+PL++KKL Q PLIDKI R A LSYAGR+QLV+ +L+ +Q YW +F Sbjct: 758 RYLGVPLASKKLNFSQCKPLIDKITTRAQGWVAHLLSYAGRLQLVKTILYSMQNYWGQIF 817 Query: 1595 VIPSKELKFIDAYCRSYVWSGINTITKRALIA*D 1696 +P K +K ++ CR ++W+G + +A +A D Sbjct: 818 PLPKKLIKAVETTCRKFLWTGTVDTSYKAPVAWD 851 >ref|XP_004252692.1| PREDICTED: uncharacterized protein LOC101261795 [Solanum lycopersicum] Length = 413 Score = 218 bits (556), Expect = 5e-54 Identities = 102/197 (51%), Positives = 140/197 (71%) Frame = +1 Query: 397 NINIMKNEPVLSHAHGIELCKKVSDEEIYEALQSIGDDKVPGVDEYSAHFYKKSWNIIKG 576 N +MK PV S I+LC ++++EIY LQS G+DK PG+D Y+A F+K +W IIK Sbjct: 217 NAQVMKRGPVSSRQQRIQLCTDITEQEIYSTLQSYGNDKAPGIDGYNALFFKHTWKIIKK 276 Query: 577 DLMAVVRDFFYKGKLYRPIECTFITLFPKTACPIIVKEYRPIACCSALFKIIAKVLASRL 756 D++ V++FF GKL++P CT ++L PK CP VKEY PIACC+ L+KII+KV+ R+ Sbjct: 277 DVIEAVKNFFTTGKLFKPFNCTLVSLIPKVQCPKTVKEYTPIACCTVLYKIISKVITRRM 336 Query: 757 QIIIALVTSESQSWIIPGRKIANNIILVTELVKAYSQKHIFPRCMVKIDLQKAYDSVGWV 936 +I V ESQ+ IPGRKIA+NIIL ELVK Y++K+I PR ++KIDL KAYDSV W Sbjct: 337 HDVIHDVICESQAGFIPGRKIADNIILAHELVKTYTRKNISPRIILKIDLHKAYDSVEWP 396 Query: 937 YLKQILEALCFPVKFVE 987 +L+Q++ L FP F++ Sbjct: 397 FLEQVMVGLGFPEMFIQ 413 >ref|XP_004247247.1| PREDICTED: uncharacterized protein LOC101256917 [Solanum lycopersicum] Length = 421 Score = 215 bits (547), Expect = 5e-53 Identities = 100/195 (51%), Positives = 141/195 (72%) Frame = +1 Query: 463 VSDEEIYEALQSIGDDKVPGVDEYSAHFYKKSWNIIKGDLMAVVRDFFYKGKLYRPIECT 642 V++E+I+ ALQSIG+DK PG+D Y+A F+K +W IIK D++ VV+ FF GKL++P CT Sbjct: 106 VTEEKIFAALQSIGNDKAPGIDGYNAFFFKYTWKIIKNDIIEVVQSFFKPGKLFKPFNCT 165 Query: 643 FITLFPKTACPIIVKEYRPIACCSALFKIIAKVLASRLQIIIALVTSESQSWIIPGRKIA 822 ++L PK P VKEYR I CC+ L+KII+KV+ +R+ +I V +SQ I GRKI+ Sbjct: 166 LVSLIPKVQSPKNVKEYRTITCCTVLYKIISKVITNRMHDVIHNVICDSQVGFILGRKIS 225 Query: 823 NNIILVTELVKAYSQKHIFPRCMVKIDLQKAYDSVGWVYLKQILEALCFPVKFVEWHMEC 1002 NI+L ELV +Y++K+I PR M+KIDLQK YDSV W +LKQ++ L FP F +W M C Sbjct: 226 ENILLAHELVNSYTRKNISPRSMLKIDLQKVYDSVEWPFLKQVMVGLGFPDMFTQWVMHC 285 Query: 1003 IYTVNHSITINEEST 1047 + TVN++I +N ++T Sbjct: 286 VKTVNYTIVVNGQTT 300 Score = 87.4 bits (215), Expect = 2e-14 Identities = 41/69 (59%), Positives = 54/69 (78%) Frame = +2 Query: 1280 FLKFSKASGLQANIGKSSIYFGGVADATKATIFQHLGFSLGELPFKYLGIPLSTKKLTIL 1459 FL+FS+ASG QAN+ KSSIY GGV + I + L + + E+PFKYLG+PLS+KKL+IL Sbjct: 333 FLEFSQASGQQANLNKSSIYCGGVQMEVRQQIVRQLHYKMEEIPFKYLGVPLSSKKLSIL 392 Query: 1460 QWTPLIDKI 1486 QW PLI+K+ Sbjct: 393 QWYPLIEKL 401 >ref|XP_004253224.1| PREDICTED: uncharacterized protein LOC101268376 [Solanum lycopersicum] Length = 717 Score = 212 bits (540), Expect = 3e-52 Identities = 114/215 (53%), Positives = 149/215 (69%), Gaps = 1/215 (0%) Frame = +2 Query: 1058 FDAAKGLRQGKPISPFLFSIAMDTSVDVLLN*GLPKILNSIPNVLDLK*-HTSALLMIYY 1234 FDAAKGLRQG P+SPFLF+IAM+ +L K P L H + Sbjct: 443 FDAAKGLRQGDPMSPFLFAIAMEYLSRLLKGLKEDKSFKYHPKYAKLDVTHLCFADDLLL 502 Query: 1235 FLQEEI*LLCKLYQFFLKFSKASGLQANIGKSSIYFGGVADATKATIFQHLGFSLGELPF 1414 F + ++ + L + F +FS+ASGLQAN+ KSSIY GGV + I Q LG+++ ELPF Sbjct: 503 FSRGDLNSIKALQKCFTEFSQASGLQANLNKSSIYCGGVQMEVRQQIIQQLGYTIEELPF 562 Query: 1415 KYLGIPLSTKKLTILQWTPLIDKIVARISSLTAKKLSYAGRVQLVQLVLFGIQAYWS*LF 1594 KYLG+PLS+KKL +QW PLI+K++ARI+S TAKKLSYAGR QLV+ VLFG+QA W+ LF Sbjct: 563 KYLGVPLSSKKLNTIQWYPLIEKVMARINSWTAKKLSYAGRAQLVKTVLFGVQALWAQLF 622 Query: 1595 VIPSKELKFIDAYCRSYVWSGINTITKRALIA*DK 1699 +IP+K +K I+ CRSY+WSG+ +TK+ALIA DK Sbjct: 623 IIPAKIIKLIEGLCRSYLWSGVGYVTKKALIAWDK 657 Score = 172 bits (437), Expect = 3e-40 Identities = 92/203 (45%), Positives = 134/203 (66%), Gaps = 3/203 (1%) Frame = +1 Query: 673 PIIVKEYRPIACCSALFKIIAKVLAS---RLQIIIALVTSESQSWIIPGRKIANNIILVT 843 P++ ++R C + + + I + L S +I + S+SQ+ IPGRKI +NIIL Sbjct: 313 PVLSIQHRIQLCATIIEQEIIEALKSIGNDKAPVIHTIISDSQAGFIPGRKIGDNIILAH 372 Query: 844 ELVKAYSQKHIFPRCMVKIDLQKAYDSVGWVYLKQILEALCFPVKFVEWHMECIYTVNHS 1023 ELVKAY++K++ PRCM+KIDL KAYDSV W +L+Q++E L FP F +W M+C+ TVN++ Sbjct: 373 ELVKAYTRKNVSPRCMLKIDLHKAYDSVEWPFLEQVMEGLGFPDLFTKWVMKCVKTVNYT 432 Query: 1024 ITINEESTPPLI*CS*GT*TRETHFPFLVFHCHGYLSRCLVELRVTKNFKFYPKCARFEI 1203 I +N ++T + G + PFL YLSR L L+ K+FK++PK A+ ++ Sbjct: 433 IVVNGQNTQRFD-AAKGLRQGDPMSPFLFAIAMEYLSRLLKGLKEDKSFKYHPKYAKLDV 491 Query: 1204 THLCFADDLLLFARGDLASVQTL 1272 THLCFADDLLLF+RGDL S++ L Sbjct: 492 THLCFADDLLLFSRGDLNSIKAL 514 >gb|AAC33226.1| putative non-LTR retroelement reverse transcriptase [Arabidopsis thaliana] Length = 1529 Score = 204 bits (519), Expect = 9e-50 Identities = 108/284 (38%), Positives = 168/284 (59%) Frame = +1 Query: 451 LCKKVSDEEIYEALQSIGDDKVPGVDEYSAHFYKKSWNIIKGDLMAVVRDFFYKGKLYRP 630 L ++V+ EEI + L ++ ++K PG D Y++ F+K +W++ D +A ++ FF KG L + Sbjct: 746 LTREVTGEEIQKVLFAMPNNKSPGPDGYTSEFFKATWSLTGPDFIAAIQSFFVKGFLPKG 805 Query: 631 IECTFITLFPKTACPIIVKEYRPIACCSALFKIIAKVLASRLQIIIALVTSESQSWIIPG 810 + T + L PK I +K+YRPI+CC+ L+K+I+K+LA+RL++++ ++QS + Sbjct: 806 LNATILALIPKKDEAIEMKDYRPISCCNVLYKVISKILANRLKLLLPSFILQNQSAFVKE 865 Query: 811 RKIANNIILVTELVKAYSQKHIFPRCMVKIDLQKAYDSVGWVYLKQILEALCFPVKFVEW 990 R + N++L TELVK Y ++ + PRC +KID+ KA+DSV W +L LEAL FP F W Sbjct: 866 RLLMENVLLATELVKDYHKESVTPRCAMKIDISKAFDSVQWQFLLNTLEALNFPETFRHW 925 Query: 991 HMECIYTVNHSITINEESTPPLI*CS*GT*TRETHFPFLVFHCHGYLSRCLVELRVTKNF 1170 CI T S+ +N E S G P+L C LS + E V +N Sbjct: 926 IKLCISTATFSVQVNGE-LAGFFGSSRGLRQGCALSPYLFVICMNVLSHMIDEAAVHRNI 984 Query: 1171 KFYPKCARFEITHLCFADDLLLFARGDLASVQTLPVFPQVFKGF 1302 ++PKC + +THLCFADDL++F G S++ + VFK F Sbjct: 985 GYHPKCEKIGLTHLCFADDLMVFVDGHQWSIEGV---INVFKEF 1025 Score = 114 bits (285), Expect = 1e-22 Identities = 67/212 (31%), Positives = 117/212 (55%), Gaps = 1/212 (0%) Frame = +2 Query: 1058 FDAAKGLRQGKPISPFLFSIAMDTSVDVLLN*GLPKILNSIPNVLDLK*-HTSALLMIYY 1234 F +++GLRQG +SP+LF I M+ ++ + + + P + H + Sbjct: 947 FGSSRGLRQGCALSPYLFVICMNVLSHMIDEAAVHRNIGYHPKCEKIGLTHLCFADDLMV 1006 Query: 1235 FLQEEI*LLCKLYQFFLKFSKASGLQANIGKSSIYFGGVADATKATIFQHLGFSLGELPF 1414 F+ + + F +F+ SGLQ ++ KS+IY GV+ + + F+ G+LP Sbjct: 1007 FVDGHQWSIEGVINVFKEFAGRSGLQISLEKSTIYLAGVSASDRVQTLSSFPFANGQLPV 1066 Query: 1415 KYLGIPLSTKKLTILQWTPLIDKIVARISSLTAKKLSYAGRVQLVQLVLFGIQAYWS*LF 1594 +YLG+PL TK++T ++PLI+ + +ISS TA+ LSYAGR+ L+ V+ I +W + Sbjct: 1067 RYLGLPLLTKQMTTADYSPLIEAVKTKISSWTARSLSYAGRLALLNSVIVSIANFWMSAY 1126 Query: 1595 VIPSKELKFIDAYCRSYVWSGINTITKRALIA 1690 +P+ ++ I+ C +++WSG K+A IA Sbjct: 1127 RLPAGCIREIEKLCSAFLWSGPVLNPKKAKIA 1158 >gb|AAF99785.1|AC012463_2 T2E6.4 [Arabidopsis thaliana] Length = 740 Score = 199 bits (505), Expect = 4e-48 Identities = 104/288 (36%), Positives = 164/288 (56%) Frame = +1 Query: 451 LCKKVSDEEIYEALQSIGDDKVPGVDEYSAHFYKKSWNIIKGDLMAVVRDFFYKGKLYRP 630 L ++V+ EE + L ++ +K PG D Y++ F+K +W+I D +A ++ FF KG L + Sbjct: 22 LTREVTSEENQKVLFAMPSNKFPGPDGYTSEFFKATWSITGQDFIAAIKSFFIKGFLPKG 81 Query: 631 IECTFITLFPKTACPIIVKEYRPIACCSALFKIIAKVLASRLQIIIALVTSESQSWIIPG 810 + T + L PK ++++YRPI+CC+ ++K+I+K++A+RL++++ ++QS + Sbjct: 82 LNATILALIPKKDEATLMRDYRPISCCNVIYKVISKIIANRLKVMLPTFILQNQSAFVRE 141 Query: 811 RKIANNIILVTELVKAYSQKHIFPRCMVKIDLQKAYDSVGWVYLKQILEALCFPVKFVEW 990 R + N++L TELVK Y + I PRC +KID+ KA+DSV W +L LEAL FP F W Sbjct: 142 RLLIENVLLATELVKDYHKDSISPRCAMKIDISKAFDSVQWQFLLNTLEALNFPENFCHW 201 Query: 991 HMECIYTVNHSITINEESTPPLI*CS*GT*TRETHFPFLVFHCHGYLSRCLVELRVTKNF 1170 CI T S+ +N E G P+L C LS + V +N Sbjct: 202 IKLCISTATFSVQVNGE-LAGFFGSKRGLRQGCALSPYLFVICMNVLSHMIDVAAVHRNI 260 Query: 1171 KFYPKCARFEITHLCFADDLLLFARGDLASVQTLPVFPQVFKGFWSSS 1314 ++PKC + +THLCFADDL++F G SV+ + +FK F S Sbjct: 261 GYHPKCKKLSLTHLCFADDLMVFIDGQQRSVEGV---INIFKEFAGKS 305 Score = 119 bits (299), Expect = 3e-24 Identities = 67/211 (31%), Positives = 118/211 (55%), Gaps = 1/211 (0%) Frame = +2 Query: 1058 FDAAKGLRQGKPISPFLFSIAMDTSVDVLLN*GLPKILNSIPNVLDLK*-HTSALLMIYY 1234 F + +GLRQG +SP+LF I M+ ++ + + + P L H + Sbjct: 223 FGSKRGLRQGCALSPYLFVICMNVLSHMIDVAAVHRNIGYHPKCKKLSLTHLCFADDLMV 282 Query: 1235 FLQEEI*LLCKLYQFFLKFSKASGLQANIGKSSIYFGGVADATKATIFQHLGFSLGELPF 1414 F+ + + + F +F+ SGL ++ KS++Y GV++ + I F+ G+LP Sbjct: 283 FIDGQQRSVEGVINIFKEFAGKSGLHISLEKSTLYLAGVSELNRNNILSAFPFASGQLPV 342 Query: 1415 KYLGIPLSTKKLTILQWTPLIDKIVARISSLTAKKLSYAGRVQLVQLVLFGIQAYWS*LF 1594 +YLG+PL TK++T ++PL+DK+ ++ISS TA+ LSYAGR+ L+ V+ + +W + Sbjct: 343 RYLGLPLLTKQMTTADYSPLLDKVRSKISSWTARSLSYAGRLALINSVIVSLSNFWMSAY 402 Query: 1595 VIPSKELKFIDAYCRSYVWSGINTITKRALI 1687 +P+ +K I+ C +++WSG K+A I Sbjct: 403 RLPAGCIKEIEKLCSAFLWSGPELNPKKAKI 433 >gb|EMJ15226.1| hypothetical protein PRUPE_ppa016677mg [Prunus persica] Length = 1421 Score = 136 bits (342), Expect(2) = 3e-46 Identities = 72/188 (38%), Positives = 114/188 (60%) Frame = +1 Query: 472 EEIYEALQSIGDDKVPGVDEYSAHFYKKSWNIIKGDLMAVVRDFFYKGKLYRPIECTFIT 651 EE+ +A+ G DK PG D +S F++ W ++KGDLM V++DFF G + TFI Sbjct: 707 EEVQKAVFDCGKDKSPGPDGFSMSFFQSCWEVVKGDLMKVMQDFFQSGIVNGVTNETFIC 766 Query: 652 LFPKTACPIIVKEYRPIACCSALFKIIAKVLASRLQIIIALVTSESQSWIIPGRKIANNI 831 L PK A + V +YRPI+ ++L+K+I+KVLASRL+ ++ S+SQ + R+I + + Sbjct: 767 LIPKKANSVKVTDYRPISLVTSLYKVISKVLASRLREVLGNTISQSQGAFVQKRQILDAV 826 Query: 832 ILVTELVKAYSQKHIFPRCMVKIDLQKAYDSVGWVYLKQILEALCFPVKFVEWHMECIYT 1011 ++ E+V+ +K + KID +KAYD V W ++ ++ F VK+ W + C+ + Sbjct: 827 LVANEVVEEV-RKQKRKGLVFKIDFEKAYDHVEWNFVDDVMARKGFGVKWRGWIIGCLES 885 Query: 1012 VNHSITIN 1035 VN SI IN Sbjct: 886 VNFSIMIN 893 Score = 78.2 bits (191), Expect(2) = 3e-46 Identities = 56/210 (26%), Positives = 88/210 (41%) Frame = +2 Query: 1058 FDAAKGLRQGKPISPFLFSIAMDTSVDVLLN*GLPKILNSIPNVLDLK*HTSALLMIYYF 1237 F A++GLRQG P+SPFLF++ M+ S H + Sbjct: 900 FRASRGLRQGDPLSPFLFTLVMEVS------------------------HLQFADDTIFL 935 Query: 1238 LQEEI*LLCKLYQFFLKFSKASGLQANIGKSSIYFGGVADATKATIFQHLGFSLGELPFK 1417 L + L Q F SG++ N KS I + + G +G P Sbjct: 936 LDGKEEYWLNLLQLLKLFCDVSGMKINKAKSCILGINFSTEVLNNMAGSWGCEVGCWPMI 995 Query: 1418 YLGIPLSTKKLTILQWTPLIDKIVARISSLTAKKLSYAGRVQLVQLVLFGIQAYWS*LFV 1597 YLG+PL + W P+++K+ R+ LS GR+ L+Q VL I +Y+ LF Sbjct: 996 YLGLPLGGNPRALNFWNPVMEKVEKRLQKWKRACLSKGGRLTLIQAVLSSIPSYYMSLFK 1055 Query: 1598 IPSKELKFIDAYCRSYVWSGINTITKRALI 1687 +P ++ R+++W G+ K L+ Sbjct: 1056 MPIGVAAKVEQLMRNFLWEGLEEGKKCHLV 1085 >gb|AAF98181.1|AC000107_4 F17F8.5 [Arabidopsis thaliana] Length = 872 Score = 191 bits (485), Expect = 8e-46 Identities = 105/283 (37%), Positives = 161/283 (56%), Gaps = 1/283 (0%) Frame = +1 Query: 451 LCKKVSDEEIYEALQSIGDDKVPGVDEYSAHFYKKSWNIIKGDLMAVVRDFFYKGKLYRP 630 L ++VS EEI L S+ DK PG D Y++ FYK +W+II + V+ FF KG L + Sbjct: 96 LTREVSSEEIKTVLFSMPKDKSPGPDGYTSEFYKATWDIIGQEFTLPVQSFFQKGFLPKG 155 Query: 631 IECTFITLFPKTACPIIVKEYRPIACCSALFKIIAKVLASRLQIIIALVTSESQSWIIPG 810 I + L PK +++YRPI+CC+ L+K+I+K++A+RL++++ +E+QS + Sbjct: 156 INSIILALIPKKLAAKEMRDYRPISCCNVLYKVISKIIANRLKLLLPRFIAENQSAFVKD 215 Query: 811 RKIANNIILVTELVKAYSQKHIFPRCMVKIDLQKAYDSVGWVYLKQILEALCFPVKFVEW 990 R + N++L TELVK Y + I RC +KID+ KA+DSV W +L L A+ F F+ W Sbjct: 216 RLLIENLLLATELVKDYHKDSISARCAIKIDISKAFDSVQWSFLTNTLVAMNFSPTFIHW 275 Query: 991 HMECIYTVNHSITINEESTPPLI*CS*GT*TRETHFPFLVFHCHGYLSRCLVELRVTKNF 1170 CI T + S+ +N + G + P+L C LS+ L + + F Sbjct: 276 INLCITTASFSVQVNGDLV-GYFQSKRGLRQGCSLSPYLFVICMDVLSKMLDKAAGVRKF 334 Query: 1171 KFYPKCARFEITHLCFADDLLLFARGDLASVQ-TLPVFPQVFK 1296 F+PKC R +THL FADDL++ + G S++ L VF + K Sbjct: 335 GFHPKCQRLGLTHLSFADDLMVLSDGKTRSIEGILEVFDEFCK 377 Score = 118 bits (296), Expect = 7e-24 Identities = 73/223 (32%), Positives = 121/223 (54%), Gaps = 10/223 (4%) Frame = +2 Query: 1058 FDAAKGLRQGKPISPFLFSIAMDTSVDVLLN*GLPKILNSIPNVLDLK*HTSA--LLMIY 1231 F + +GLRQG +SP+LF I MD L K+L+ V H L + + Sbjct: 297 FQSKRGLRQGCSLSPYLFVICMDV---------LSKMLDKAAGVRKFGFHPKCQRLGLTH 347 Query: 1232 YFLQEEI*LLCK--------LYQFFLKFSKASGLQANIGKSSIYFGGVADATKATIFQHL 1387 +++ +L + + F +F K SGL+ ++ KS++Y GV+ K I Sbjct: 348 LSFADDLMVLSDGKTRSIEGILEVFDEFCKRSGLRISLEKSTLYMAGVSPIIKQEIAAKF 407 Query: 1388 GFSLGELPFKYLGIPLSTKKLTILQWTPLIDKIVARISSLTAKKLSYAGRVQLVQLVLFG 1567 F +G+LP +YLG+PL TK+LT ++PL+++I RI++ T + S+AGR L++ VL+ Sbjct: 408 LFDVGQLPVRYLGLPLVTKRLTSADYSPLLEQIKKRIATWTFRFFSFAGRFNLIKSVLWS 467 Query: 1568 IQAYWS*LFVIPSKELKFIDAYCRSYVWSGINTITKRALIA*D 1696 I +W F +P + ++ ID C S++WSG + +A I+ D Sbjct: 468 ICNFWLAAFRLPRQCIREIDKLCSSFLWSGSEMSSHKAKISWD 510 >gb|AAC13599.1| similar to reverse transcriptase (Pfam: transcript_fact.hmm, score: 72.31) [Arabidopsis thaliana] Length = 928 Score = 191 bits (485), Expect = 8e-46 Identities = 101/276 (36%), Positives = 159/276 (57%), Gaps = 5/276 (1%) Frame = +1 Query: 451 LCKKVSDEEIYEALQSIGDDKVPGVDEYSAHFYKKSWNIIKGDLMAVVRDFFYKGKLYRP 630 L VS EEI++ + S+ +DK PG D Y+A FYK +WNII + + ++ FF KG L + Sbjct: 337 LTNHVSAEEIHKVVFSMPNDKSPGPDGYTAEFYKGAWNIIGAEFILAIQSFFAKGFLPKG 396 Query: 631 IECTFITLFPKTACPIIVKEYRPIACCSALFKIIAKVLASRLQIIIALVTSESQSWIIPG 810 I T + L PK +K+YRPI+CC+ L+K+I+K++A+RL++++ +QS + Sbjct: 397 INSTILALIPKKKEAKEMKDYRPISCCNVLYKVISKIIANRLKLVLPKFIVGNQSAFVKD 456 Query: 811 RKIANNIILVTELVKAYSQKHIFPRCMVKIDLQKAYDSVGWVYLKQILEALCFPVKFVEW 990 R + N++L TE+VK Y + + RC +KID+ KA+DSV W +L +LEA+ FP +F W Sbjct: 457 RLLIENVLLATEIVKDYHKDSVSSRCALKIDISKAFDSVQWKFLINVLEAMNFPPEFTHW 516 Query: 991 HMECIYTVNHSITINEESTPPLI*CS*GT*TRETH-----FPFLVFHCHGYLSRCLVELR 1155 CI T + S+ +N E + RE P+L LS+ L + Sbjct: 517 ITLCITTASFSVQVNGELAGVF------SSARELRQGCSLSPYLFVISMDVLSKMLDKAV 570 Query: 1156 VTKNFKFYPKCARFEITHLCFADDLLLFARGDLASV 1263 + F ++PKC +THL FADDL++ + G + S+ Sbjct: 571 GARQFGYHPKCRAIGLTHLSFADDLMILSDGKVRSI 606 Score = 110 bits (274), Expect = 2e-21 Identities = 66/221 (29%), Positives = 118/221 (53%), Gaps = 10/221 (4%) Frame = +2 Query: 1058 FDAAKGLRQGKPISPFLFSIAMDTSVDVLLN*GLPKILNSIPNVLDLK*HTS--ALLMIY 1231 F +A+ LRQG +SP+LF I+MD L K+L+ H A+ + + Sbjct: 538 FSSARELRQGCSLSPYLFVISMDV---------LSKMLDKAVGARQFGYHPKCRAIGLTH 588 Query: 1232 YFLQEEI*LLCK--------LYQFFLKFSKASGLQANIGKSSIYFGGVADATKATIFQHL 1387 +++ +L + + +F+K SGL+ ++ KS++Y GV + I Q Sbjct: 589 LSFADDLMILSDGKVRSIDGIVKVLYEFAKWSGLKISMEKSTMYLAGVQASVYQEIVQKF 648 Query: 1388 GFSLGELPFKYLGIPLSTKKLTILQWTPLIDKIVARISSLTAKKLSYAGRVQLVQLVLFG 1567 F +G+LP +YLG+PL +K+LT PLI+++ +I + T++ LS+AGR+ L+ L+ Sbjct: 649 SFDVGKLPVRYLGLPLVSKRLTASDCLPLIEQLRKKIEAWTSRFLSFAGRLNLISSTLWS 708 Query: 1568 IQAYWS*LFVIPSKELKFIDAYCRSYVWSGINTITKRALIA 1690 I +W F +P ++ ID C +++WSG + +A ++ Sbjct: 709 ICNFWMAAFRLPRACIREIDKLCSAFLWSGTELSSNKAKVS 749 >dbj|BAB09379.1| non-LTR retroelement reverse transcriptase-like protein [Arabidopsis thaliana] Length = 1223 Score = 190 bits (482), Expect = 2e-45 Identities = 106/294 (36%), Positives = 165/294 (56%), Gaps = 1/294 (0%) Frame = +1 Query: 430 SHAHGIELCKKVSDEEIYEALQSIGDDKVPGVDEYSAHFYKKSWNIIKGDLMAVVRDFFY 609 S A L + V+ EEI + L + DK PG D Y++ F+K +W II + V+ FF Sbjct: 442 SDADQQSLIRPVTAEEIRKVLFRMPSDKSPGPDGYTSEFFKATWEIIGDEFTLAVQSFFT 501 Query: 610 KGKLYRPIECTFITLFPKTACPIIVKEYRPIACCSALFKIIAKVLASRLQIIIALVTSES 789 KG L + I T + L PK +K+YRPI+CC+ L+K+I+K++A+RL++++ + + Sbjct: 502 KGFLPKGINSTILALIPKKTEAREMKDYRPISCCNVLYKVISKIIANRLKLVLPKFIAGN 561 Query: 790 QSWIIPGRKIANNIILVTELVKAYSQKHIFPRCMVKIDLQKAYDSVGWVYLKQILEALCF 969 QS + R + N++L TELVK Y + I RC +KID+ KA+DSV W +L + L F Sbjct: 562 QSAFVKDRLLIENLLLATELVKDYHKDTISTRCAIKIDISKAFDSVQWPFLINVFTILGF 621 Query: 970 PVKFVEWHMECIYTVNHSITINEESTPPLI*CS*GT*TRETHFPFLVFHCHGYLSRCLVE 1149 P +F+ W CI T + S+ +N E S G P+L C LS+ L + Sbjct: 622 PREFIHWINICITTASFSVQVNGE-LAGYFQSSRGLRQGCALSPYLFVICMDVLSKMLDK 680 Query: 1150 LRVTKNFKFYPKCARFEITHLCFADDLLLFARGDLASVQ-TLPVFPQVFKGFWS 1308 ++F ++PKC +THL FADDL++ + G + S++ + VF + K WS Sbjct: 681 AAAARHFGYHPKCKTMGLTHLSFADDLMVLSDGKIRSIERIIKVFDEFAK--WS 732 Score = 117 bits (294), Expect = 1e-23 Identities = 67/212 (31%), Positives = 118/212 (55%), Gaps = 1/212 (0%) Frame = +2 Query: 1058 FDAAKGLRQGKPISPFLFSIAMDTSVDVLLN*GLPKILNSIPNVLDLK*-HTSALLMIYY 1234 F +++GLRQG +SP+LF I MD +L + P + H S + Sbjct: 650 FQSSRGLRQGCALSPYLFVICMDVLSKMLDKAAAARHFGYHPKCKTMGLTHLSFADDLMV 709 Query: 1235 FLQEEI*LLCKLYQFFLKFSKASGLQANIGKSSIYFGGVADATKATIFQHLGFSLGELPF 1414 +I + ++ + F +F+K SGL+ ++ KS++Y G++ + + FS G+LP Sbjct: 710 LSDGKIRSIERIIKVFDEFAKWSGLRISLEKSTVYLAGLSATARNEVADRFPFSSGQLPV 769 Query: 1415 KYLGIPLSTKKLTILQWTPLIDKIVARISSLTAKKLSYAGRVQLVQLVLFGIQAYWS*LF 1594 +YLG+PL TK+L+ PL++++ RI S T++ LSYAGR+ L+ VL+ I +W F Sbjct: 770 RYLGLPLITKRLSTTDCLPLLEQVRKRIGSWTSRFLSYAGRLNLISSVLWSICNFWLAAF 829 Query: 1595 VIPSKELKFIDAYCRSYVWSGINTITKRALIA 1690 +P K ++ ++ C +++WSG + +A I+ Sbjct: 830 RLPRKCIRELEKMCSAFLWSGTEMNSNKAKIS 861 >dbj|BAA97290.1| non-LTR retroelement reverse transcriptase-like [Arabidopsis thaliana] Length = 1072 Score = 189 bits (481), Expect = 2e-45 Identities = 104/313 (33%), Positives = 176/313 (56%) Frame = +1 Query: 325 RMIGMLLSVYGLEHKTSCISLTQRNINIMKNEPVLSHAHGIELCKKVSDEEIYEALQSIG 504 R++G + S + +E + + LT R S EL K +D+EI A +S+ Sbjct: 271 RLLGSIESPFSMEQEDMNLLLTYR----------CSQDQCSELEKSFTDDEIKAAFKSLP 320 Query: 505 DDKVPGVDEYSAHFYKKSWNIIKGDLMAVVRDFFYKGKLYRPIECTFITLFPKTACPIIV 684 +K G D YS F++ +W+II +++A + +FF G+L + T + L PKT+ + Sbjct: 321 RNKTSGPDGYSVEFFRDTWSIIGPEVLAAIHEFFDSGQLLKQWNATTLVLIPKTSNACTI 380 Query: 685 KEYRPIACCSALFKIIAKVLASRLQIIIALVTSESQSWIIPGRKIANNIILVTELVKAYS 864 E+RPI+C + L+K+I+K+L SRLQ +++ V SQS +PGR +A N++L TE+V Y+ Sbjct: 381 SEFRPISCLNTLYKVISKLLTSRLQGLLSAVIGHSQSAFLPGRSLAENVLLATEMVHGYN 440 Query: 865 QKHIFPRCMVKIDLQKAYDSVGWVYLKQILEALCFPVKFVEWHMECIYTVNHSITINEES 1044 + +I PR M+K+DL+KA+DSV W ++ L AL P +++ W +CI T + +I++N + Sbjct: 441 RLNISPRGMLKVDLKKAFDSVKWEFVTAALRALAIPERYINWIHQCITTPSFTISVN-GA 499 Query: 1045 TPPLI*CS*GT*TRETHFPFLVFHCHGYLSRCLVELRVTKNFKFYPKCARFEITHLCFAD 1224 T + G + P+L S+ L + ++PK I+HL FAD Sbjct: 500 TGGFFRSTKGLRQGDPLSPYLFVLAMEVFSKLLYSRYDSGYIHYHPKAGDLSISHLMFAD 559 Query: 1225 DLLLFARGDLASV 1263 D+++F G +S+ Sbjct: 560 DVMIFFDGGSSSM 572 Score = 111 bits (277), Expect = 1e-21 Identities = 66/201 (32%), Positives = 108/201 (53%), Gaps = 1/201 (0%) Frame = +2 Query: 1058 FDAAKGLRQGKPISPFLFSIAMDTSVDVLLN*GLPKILNSIPNVLDLK*-HTSALLMIYY 1234 F + KGLRQG P+SP+LF +AM+ +L + ++ P DL H + Sbjct: 504 FRSTKGLRQGDPLSPYLFVLAMEVFSKLLYSRYDSGYIHYHPKAGDLSISHLMFADDVMI 563 Query: 1235 FLQEEI*LLCKLYQFFLKFSKASGLQANIGKSSIYFGGVADATKATIFQHLGFSLGELPF 1414 F + + + F+ SGL+ N KS ++ G+ D ++ GF G P Sbjct: 564 FFDGGSSSMHGICETLDDFADWSGLKVNKDKSQLFQAGL-DLSERITSAAYGFPAGTFPI 622 Query: 1415 KYLGIPLSTKKLTILQWTPLIDKIVARISSLTAKKLSYAGRVQLVQLVLFGIQAYWS*LF 1594 +YLG+PL +KL I + PL++K+ AR+ S +K LS+AGR QL+ V+FG+ +W F Sbjct: 623 RYLGLPLMCRKLRIADYGPLLEKLSARLRSWVSKALSFAGRTQLISSVIFGLINFWMSTF 682 Query: 1595 VIPSKELKFIDAYCRSYVWSG 1657 ++P +K I++ C ++W+G Sbjct: 683 LLPKGCIKKIESLCSKFLWAG 703 >dbj|BAF01687.1| hypothetical protein [Arabidopsis thaliana] Length = 1072 Score = 189 bits (481), Expect = 2e-45 Identities = 104/313 (33%), Positives = 176/313 (56%) Frame = +1 Query: 325 RMIGMLLSVYGLEHKTSCISLTQRNINIMKNEPVLSHAHGIELCKKVSDEEIYEALQSIG 504 R++G + S + +E + + LT R S EL K +D+EI A +S+ Sbjct: 271 RLLGSIESPFSMEQEDMNLLLTYR----------CSQDQCSELEKSFTDDEIKAAFKSLP 320 Query: 505 DDKVPGVDEYSAHFYKKSWNIIKGDLMAVVRDFFYKGKLYRPIECTFITLFPKTACPIIV 684 +K G D YS F++ +W+II +++A + +FF G+L + T + L PKT+ + Sbjct: 321 RNKTSGPDGYSVEFFRDTWSIIGPEVLAAIHEFFDSGQLLKQWNATTLVLIPKTSNACTI 380 Query: 685 KEYRPIACCSALFKIIAKVLASRLQIIIALVTSESQSWIIPGRKIANNIILVTELVKAYS 864 E+RPI+C + L+K+I+K+L SRLQ +++ V SQS +PGR +A N++L TE+V Y+ Sbjct: 381 SEFRPISCLNTLYKVISKLLTSRLQGLLSAVIGHSQSAFLPGRSLAENVLLATEMVHGYN 440 Query: 865 QKHIFPRCMVKIDLQKAYDSVGWVYLKQILEALCFPVKFVEWHMECIYTVNHSITINEES 1044 + +I PR M+K+DL+KA+DSV W ++ L AL P +++ W +CI T + +I++N + Sbjct: 441 RLNISPRGMLKVDLKKAFDSVKWEFVTAALRALAIPERYINWIHQCITTPSFTISVN-GA 499 Query: 1045 TPPLI*CS*GT*TRETHFPFLVFHCHGYLSRCLVELRVTKNFKFYPKCARFEITHLCFAD 1224 T + G + P+L S+ L + ++PK I+HL FAD Sbjct: 500 TGGFFRSTKGLRQGDPLSPYLFVLAMEVFSKLLYSRYDSGYIHYHPKAGDLSISHLMFAD 559 Query: 1225 DLLLFARGDLASV 1263 D+++F G +S+ Sbjct: 560 DVMIFFDGGSSSM 572 Score = 111 bits (277), Expect = 1e-21 Identities = 66/201 (32%), Positives = 108/201 (53%), Gaps = 1/201 (0%) Frame = +2 Query: 1058 FDAAKGLRQGKPISPFLFSIAMDTSVDVLLN*GLPKILNSIPNVLDLK*-HTSALLMIYY 1234 F + KGLRQG P+SP+LF +AM+ +L + ++ P DL H + Sbjct: 504 FRSTKGLRQGDPLSPYLFVLAMEVFSKLLYSRYDSGYIHYHPKAGDLSISHLMFADDVMI 563 Query: 1235 FLQEEI*LLCKLYQFFLKFSKASGLQANIGKSSIYFGGVADATKATIFQHLGFSLGELPF 1414 F + + + F+ SGL+ N KS ++ G+ D ++ GF G P Sbjct: 564 FFDGGSSSMHGICETLDDFADWSGLKVNKDKSQLFQAGL-DLSERITSAAYGFPAGTFPI 622 Query: 1415 KYLGIPLSTKKLTILQWTPLIDKIVARISSLTAKKLSYAGRVQLVQLVLFGIQAYWS*LF 1594 +YLG+PL +KL I + PL++K+ AR+ S +K LS+AGR QL+ V+FG+ +W F Sbjct: 623 RYLGLPLMCRKLRIADYGPLLEKLSARLRSWVSKALSFAGRTQLISSVIFGLINFWMSTF 682 Query: 1595 VIPSKELKFIDAYCRSYVWSG 1657 ++P +K I++ C ++W+G Sbjct: 683 LLPKGCIKKIESLCSKFLWAG 703 >gb|AAC63678.1| putative non-LTR retroelement reverse transcriptase [Arabidopsis thaliana] Length = 1216 Score = 188 bits (478), Expect = 5e-45 Identities = 101/271 (37%), Positives = 156/271 (57%) Frame = +1 Query: 451 LCKKVSDEEIYEALQSIGDDKVPGVDEYSAHFYKKSWNIIKGDLMAVVRDFFYKGKLYRP 630 L + V+ EEI + + S+ DK PG D Y++ FYK SW II +++ ++ FF KG L + Sbjct: 170 LTRVVTGEEIKKVIFSMPKDKSPGPDGYTSEFYKASWEIIGDEVIIAIQSFFAKGFLPKG 229 Query: 631 IECTFITLFPKTACPIIVKEYRPIACCSALFKIIAKVLASRLQIIIALVTSESQSWIIPG 810 + T + L PK +K+YRPI+CC+ L+K I+K+LA+RL+ I+ +QS + Sbjct: 230 VNSTILALIPKKKEAREIKDYRPISCCNVLYKAISKILANRLKRILPKFIVGNQSAFVKD 289 Query: 811 RKIANNIILVTELVKAYSQKHIFPRCMVKIDLQKAYDSVGWVYLKQILEALCFPVKFVEW 990 R + N++L TELVK Y + I RC +KID+ KA+DS+ W +L +L A+ FP +F+ W Sbjct: 290 RLLIENVLLATELVKDYHKDSISTRCAMKIDISKAFDSLQWSFLTHVLAAMNFPGEFIHW 349 Query: 991 HMECIYTVNHSITINEESTPPLI*CS*GT*TRETHFPFLVFHCHGYLSRCLVELRVTKNF 1170 C+ T + SI +N E + G + P+L LSR L + + F Sbjct: 350 ISLCMSTASFSIQVNGE-LAGYFRSARGLRQGCSLSPYLFVISMDVLSRMLDKAAGAREF 408 Query: 1171 KFYPKCARFEITHLCFADDLLLFARGDLASV 1263 ++P+C +THLCFADDL++ G + SV Sbjct: 409 GYHPRCKTLGLTHLCFADDLMILTDGKIRSV 439 Score = 108 bits (271), Expect = 5e-21 Identities = 67/224 (29%), Positives = 120/224 (53%), Gaps = 10/224 (4%) Frame = +2 Query: 1058 FDAAKGLRQGKPISPFLFSIAMDTSVDVLLN*GLPKILNSIPNVLDLK*HT--SALLMIY 1231 F +A+GLRQG +SP+LF I+MD L ++L+ + H L + + Sbjct: 371 FRSARGLRQGCSLSPYLFVISMDV---------LSRMLDKAAGAREFGYHPRCKTLGLTH 421 Query: 1232 YFLQEEI*LLCK--------LYQFFLKFSKASGLQANIGKSSIYFGGVADATKATIFQHL 1387 +++ +L + + +F+ GL+ + K+++Y GV+D ++ + Sbjct: 422 LCFADDLMILTDGKIRSVDGIVKVLNQFAAKLGLKICMEKTTLYLAGVSDHSRQLMSSRY 481 Query: 1388 GFSLGELPFKYLGIPLSTKKLTILQWTPLIDKIVARISSLTAKKLSYAGRVQLVQLVLFG 1567 F +G+LP +YLG+PL TK+LT ++PLID+I RI T++ LS+AGR+ L+ VL+ Sbjct: 482 SFGVGKLPVRYLGLPLVTKRLTTSDYSPLIDQIRRRIGMWTSRYLSFAGRLSLINSVLWS 541 Query: 1568 IQAYWS*LFVIPSKELKFIDAYCRSYVWSGINTITKRALIA*DK 1699 I +W F +P + + I+ + +WSG K+A ++ D+ Sbjct: 542 ITNFWMNAFRLPRECINEINRISSALLWSGPELNPKKAKVSWDE 585 >gb|ABA99600.2| retrotransposon protein, putative, unclassified [Oryza sativa Japonica Group] Length = 1432 Score = 122 bits (306), Expect(2) = 9e-45 Identities = 71/197 (36%), Positives = 111/197 (56%), Gaps = 2/197 (1%) Frame = +1 Query: 466 SDEEIYEALQSIGDDKVPGVDEYSAHFYKKSWNIIKGDLMAVVRDFFY-KGKLYRPIECT 642 S+ E++EA++S+ ++K PG D Y+A FY+K W+IIKGD+M F + + Sbjct: 710 SENEVWEAIKSLPNEKSPGPDGYTALFYQKCWDIIKGDIMKAFEKFCRGNSQNLEMLNTA 769 Query: 643 FITLFPKTACPIIVKEYRPIACCSALFKIIAKVLASRLQIIIALVTSESQSWIIPGRKIA 822 ITL PK P ++K+YRPI+ + K+ AKV+A RL + + +Q+ I GR I Sbjct: 770 VITLIPKKDSPTLLKDYRPISLIHSFAKLAAKVMAQRLAPRMNELVPYTQNAFIRGRSIH 829 Query: 823 NNIILVTELVKAYSQKHIFPRCMVKIDLQKAYDSVGWVYLKQILEALCFPVKFVEWHMEC 1002 N I V LV+ Y ++H ++K+D+ KA+D+V W +L +L+ F K+ W + Sbjct: 830 ENFIFVKGLVQQYHKQH-KEMILLKLDISKAFDTVSWCFLLDMLKWRGFGAKWRLWLVSL 888 Query: 1003 IYTVNHSITIN-EESTP 1050 T +I IN ES P Sbjct: 889 FLTAETNILINGNESNP 905 Score = 87.0 bits (214), Expect(2) = 9e-45 Identities = 59/203 (29%), Positives = 101/203 (49%), Gaps = 3/203 (1%) Frame = +2 Query: 1058 FDAAKGLRQGKPISPFLFSIAMDTSVDVLLN*GLPKILNS-IPN--VLDLK*HTSALLMI 1228 F A+GLRQG P+SP LF +AMD V+ +L+ P V + + ++ Sbjct: 906 FKPARGLRQGDPLSPLLFVLAMDALQAVVAQAKASGLLSEPAPRRPVPSISIYADDAVLF 965 Query: 1229 YYFLQEEI*LLCKLYQFFLKFSKASGLQANIGKSSIYFGGVADATKATIFQHLGFSLGEL 1408 + Q+E ++ + Q F ASGL N K++I + + L ++ Sbjct: 966 FKPSQQEAKVVKAILQIF---GAASGLMTNYNKTAITPIQCSQEQLQVVADELQCNIQLF 1022 Query: 1409 PFKYLGIPLSTKKLTILQWTPLIDKIVARISSLTAKKLSYAGRVQLVQLVLFGIQAYWS* 1588 P YLG+PLST+K T + P++DK+ +++ K LS GR+ L++ VL + ++ Sbjct: 1023 PIIYLGLPLSTRKPTKAEVQPILDKLANKVAGWKPKLLSPDGRLCLIKSVLMALPVHFMS 1082 Query: 1589 LFVIPSKELKFIDAYCRSYVWSG 1657 + +P +K I+ CR ++W G Sbjct: 1083 VLQLPKWAIKDIERKCRGFLWKG 1105 >emb|CAN62743.1| hypothetical protein VITISV_033107 [Vitis vinifera] Length = 1168 Score = 127 bits (320), Expect(2) = 2e-44 Identities = 65/201 (32%), Positives = 116/201 (57%), Gaps = 2/201 (0%) Frame = +1 Query: 439 HGIELCKKVS--DEEIYEALQSIGDDKVPGVDEYSAHFYKKSWNIIKGDLMAVVRDFFYK 612 HG C +V + EI+ AL + DK PG D +S F++ +W K ++M + ++F Sbjct: 381 HGAAECIEVPFVENEIHSALMEMNGDKAPGPDGFSVAFWQNAWAFAKEEIMEMFKEFHEH 440 Query: 613 GKLYRPIECTFITLFPKTACPIIVKEYRPIACCSALFKIIAKVLASRLQIIIALVTSESQ 792 R + TF+ L PK + + ++RPI+ L+K++AKVLA+RL+ +I V S +Q Sbjct: 441 STFVRSLNNTFLVLIPKKSGVEDLGDFRPISLLGGLYKLLAKVLANRLKRVIGKVVSSAQ 500 Query: 793 SWIIPGRKIANNIILVTELVKAYSQKHIFPRCMVKIDLQKAYDSVGWVYLKQILEALCFP 972 + + GR+I + ++ E++ ++ QK + K+D++KAYDS+ W +L ++L+ + F Sbjct: 501 NAFVMGRQILDASLIANEVIDSW-QKRKEKGLICKLDIEKAYDSINWNFLMKVLQKMGFG 559 Query: 973 VKFVEWHMECIYTVNHSITIN 1035 K+V W C+ + SI +N Sbjct: 560 NKWVGWMWSCVSSAKFSILVN 580 Score = 80.5 bits (197), Expect(2) = 2e-44 Identities = 59/221 (26%), Positives = 106/221 (47%), Gaps = 8/221 (3%) Frame = +2 Query: 1058 FDAAKGLRQGKPISPFLFSIAMDTSVDVLLN*GLPKILNSIPNVLDLK*HTSALLMIYYF 1237 F +++GLRQG P+SP+LF + M+ +DVL+ + S N+ D +++L + + F Sbjct: 587 FPSSRGLRQGDPLSPYLFVMGMEI-LDVLIRRAVEGGYLSGCNIRDGS--STSLHISHLF 643 Query: 1238 LQEEI*LLCK--------LYQFFLKFSKASGLQANIGKSSIYFGGVADATKATIFQHLGF 1393 ++ + C+ L F ASGL+ N+ KS I G + + + LG Sbjct: 644 FADDTIVFCEANKDQVSHLSWILFWFEAASGLRMNLAKSEIIPVGEVEEIQE-LAAELGC 702 Query: 1394 SLGELPFKYLGIPLSTKKLTILQWTPLIDKIVARISSLTAKKLSYAGRVQLVQLVLFGIQ 1573 +G LP YLG+PL W + +++ R++ + +S GR+ L++ L + Sbjct: 703 RVGSLPSHYLGLPLGVPNRASSMWDGVEERVRRRLALWKRQYISKGGRITLIKSALASMP 762 Query: 1574 AYWS*LFVIPSKELKFIDAYCRSYVWSGINTITKRALIA*D 1696 Y +F +P + ++ R ++W G N K L+ D Sbjct: 763 IYQMSIFRMPKSVARRVEKIQRDFLWGGGNLGGKIHLVKWD 803 >gb|AAD08951.1| putative reverse transcriptase [Arabidopsis thaliana] gi|20197043|gb|AAM14892.1| putative reverse transcriptase [Arabidopsis thaliana] Length = 1412 Score = 185 bits (469), Expect = 6e-44 Identities = 101/300 (33%), Positives = 164/300 (54%) Frame = +1 Query: 451 LCKKVSDEEIYEALQSIGDDKVPGVDEYSAHFYKKSWNIIKGDLMAVVRDFFYKGKLYRP 630 L ++++ E+ + SI +K PG D Y+ F++++W++I ++ ++ FF G L + Sbjct: 715 LVAEITEAEVMKVFFSIPLNKSPGPDGYTVEFFRETWSVIGQEVTMAIKSFFTYGFLPKG 774 Query: 631 IECTFITLFPKTACPIIVKEYRPIACCSALFKIIAKVLASRLQIIIALVTSESQSWIIPG 810 + T + L PK +K+YRPI+CC+ L+K I+K+LA+RL+ ++ + +QS I Sbjct: 775 LNSTILALIPKRTYAKEMKDYRPISCCNVLYKAISKLLANRLKCLLPEFIAPNQSAFISD 834 Query: 811 RKIANNIILVTELVKAYSQKHIFPRCMVKIDLQKAYDSVGWVYLKQILEALCFPVKFVEW 990 R + N++L +ELVK Y + + PRC +KIDL KA+DSV W +L L AL P KF+ W Sbjct: 835 RLLMENLLLASELVKDYHKDGLSPRCAMKIDLSKAFDSVQWPFLLNTLAALDIPEKFIHW 894 Query: 991 HMECIYTVNHSITINEESTPPLI*CS*GT*TRETHFPFLVFHCHGYLSRCLVELRVTKNF 1170 CI T + S+ +N G + P+L C LS L + V K F Sbjct: 895 INLCISTASFSVQVN------------GLRQGCSLSPYLFVICMNVLSAMLDKGAVEKRF 942 Query: 1171 KFYPKCARFEITHLCFADDLLLFARGDLASVQTLPVFPQVFKGFWSSSKYWEKQYLLWRS 1350 ++P+C +THLCFADD+++F+ G S++ + + F F + EK L S Sbjct: 943 GYHPRCRNMGLTHLCFADDIMVFSAGSAHSLEGVLAIFKDFAAFSGLNISLEKSTLFMAS 1002 Score = 104 bits (260), Expect = 1e-19 Identities = 66/207 (31%), Positives = 107/207 (51%), Gaps = 1/207 (0%) Frame = +2 Query: 1073 GLRQGKPISPFLFSIAMDTSVDVLLN*GLPKILNSIPNVLDLK*-HTSALLMIYYFLQEE 1249 GLRQG +SP+LF I M+ +L + K P ++ H I F Sbjct: 910 GLRQGCSLSPYLFVICMNVLSAMLDKGAVEKRFGYHPRCRNMGLTHLCFADDIMVFSAGS 969 Query: 1250 I*LLCKLYQFFLKFSKASGLQANIGKSSIYFGGVADATKATIFQHLGFSLGELPFKYLGI 1429 L + F F+ SGL ++ KS+++ ++ T A+I F G LP +YLG+ Sbjct: 970 AHSLEGVLAIFKDFAAFSGLNISLEKSTLFMASISSETCASILARFPFDSGSLPVRYLGL 1029 Query: 1430 PLSTKKLTILQWTPLIDKIVARISSLTAKKLSYAGRVQLVQLVLFGIQAYWS*LFVIPSK 1609 PL TK++T+ PL++KI +RISS + LSYAGR+QL+ V+ + +W F +P Sbjct: 1030 PLMTKRMTLADCLPLLEKIRSRISSWKNRFLSYAGRLQLLNSVISSLTKFWISAFRLPRA 1089 Query: 1610 ELKFIDAYCRSYVWSGINTITKRALIA 1690 ++ I+ +++WSG + +A +A Sbjct: 1090 CIREIEQISAAFLWSGTDLNPHKAKVA 1116 >gb|AAD32866.1|AC005489_4 F14N23.4 [Arabidopsis thaliana] Length = 1161 Score = 157 bits (398), Expect = 1e-35 Identities = 88/266 (33%), Positives = 142/266 (53%) Frame = +1 Query: 466 SDEEIYEALQSIGDDKVPGVDEYSAHFYKKSWNIIKGDLMAVVRDFFYKGKLYRPIECTF 645 S+EEI + L S+ +K PG D + F+ ++W I+K ++A +R+FF G L R T Sbjct: 491 SEEEITQVLFSMPRNKAPGPDGFPVEFFIEAWAIVKSSVVAAIREFFISGNLPRGFNATA 550 Query: 646 ITLFPKTACPIIVKEYRPIACCSALFKIIAKVLASRLQIIIALVTSESQSWIIPGRKIAN 825 ITL PK + ++RP+ACC+ ++K+I ++++ RL++ I +Q I GR + Sbjct: 551 ITLIPKVTGADRLTQFRPVACCTTIYKVITRIISRRLKLFIDQAVQANQVGFIKGRLLCE 610 Query: 826 NIILVTELVKAYSQKHIFPRCMVKIDLQKAYDSVGWVYLKQILEALCFPVKFVEWHMECI 1005 N++L +ELV + R +++D+ KAYD+V W +L IL+AL P+ F+ W CI Sbjct: 611 NVLLASELVDNFEADGETTRGCLQVDISKAYDNVNWEFLINILKALDLPLVFIHWIWVCI 670 Query: 1006 YTVNHSITINEESTPPLI*CS*GT*TRETHFPFLVFHCHGYLSRCLVELRVTKNFKFYPK 1185 + ++SI N E G + L LS+ L + F +P Sbjct: 671 SSASYSIAFNGELI-GFFQGKKGIRQGDPMSSHLFVLVMDVLSKSLDLGALNGLFNLHPN 729 Query: 1186 CARFEITHLCFADDLLLFARGDLASV 1263 C ITHL FADD+L+F+ G +S+ Sbjct: 730 CLAPIITHLSFADDVLVFSDGAASSI 755 Score = 84.7 bits (208), Expect = 1e-13 Identities = 55/168 (32%), Positives = 85/168 (50%), Gaps = 1/168 (0%) Frame = +2 Query: 1058 FDAAKGLRQGKPISPFLFSIAMDTSVDVLLN*GLPKILNSIPNVL-DLK*HTSALLMIYY 1234 F KG+RQG P+S LF + MD L L + N PN L + H S + Sbjct: 687 FQGKKGIRQGDPMSSHLFVLVMDVLSKSLDLGALNGLFNLHPNCLAPIITHLSFADDVLV 746 Query: 1235 FLQEEI*LLCKLYQFFLKFSKASGLQANIGKSSIYFGGVADATKATIFQHLGFSLGELPF 1414 F + + F + SGL N K+ + G A ++ +LG + G LP Sbjct: 747 FSDGAASSIAGILTILDDFRQGSGLGINREKTELLLDGGNFARNRSLADNLGITHGSLPV 806 Query: 1415 KYLGIPLSTKKLTILQWTPLIDKIVARISSLTAKKLSYAGRVQLVQLV 1558 +YLG+PL ++K+ + PL+D+I +R +S TA+ LS+AGR+QL+ + Sbjct: 807 RYLGVPLMSQKMRRQDYQPLVDRINSRFTSWTARHLSFAGRLQLLNWI 854 >dbj|BAF00918.1| putative reverse transcriptase [Arabidopsis thaliana] Length = 910 Score = 157 bits (398), Expect = 1e-35 Identities = 88/266 (33%), Positives = 142/266 (53%) Frame = +1 Query: 466 SDEEIYEALQSIGDDKVPGVDEYSAHFYKKSWNIIKGDLMAVVRDFFYKGKLYRPIECTF 645 S+EEI + L S+ +K PG D + F+ ++W I+K ++A +R+FF G L R T Sbjct: 448 SEEEITQVLFSMPRNKAPGPDGFPVEFFIEAWAIVKSSVVAAIREFFISGNLPRGFNATA 507 Query: 646 ITLFPKTACPIIVKEYRPIACCSALFKIIAKVLASRLQIIIALVTSESQSWIIPGRKIAN 825 ITL PK + ++RP+ACC+ ++K+I ++++ RL++ I +Q I GR + Sbjct: 508 ITLIPKVTGADRLTQFRPVACCTTIYKVITRIISRRLKLFIDQAVQANQVGFIKGRLLCE 567 Query: 826 NIILVTELVKAYSQKHIFPRCMVKIDLQKAYDSVGWVYLKQILEALCFPVKFVEWHMECI 1005 N++L +ELV + R +++D+ KAYD+V W +L IL+AL P+ F+ W CI Sbjct: 568 NVLLASELVDNFEADGETTRGCLQVDISKAYDNVNWEFLINILKALDLPLVFIHWIWVCI 627 Query: 1006 YTVNHSITINEESTPPLI*CS*GT*TRETHFPFLVFHCHGYLSRCLVELRVTKNFKFYPK 1185 + ++SI N E G + L LS+ L + F +P Sbjct: 628 SSASYSIAFNGELI-GFFQGKKGIRQGDPMSSHLFVLVMDVLSKSLDLGALNGLFNLHPN 686 Query: 1186 CARFEITHLCFADDLLLFARGDLASV 1263 C ITHL FADD+L+F+ G +S+ Sbjct: 687 CLAPIITHLSFADDVLVFSDGAASSI 712 Score = 110 bits (276), Expect = 1e-21 Identities = 66/212 (31%), Positives = 112/212 (52%), Gaps = 1/212 (0%) Frame = +2 Query: 1058 FDAAKGLRQGKPISPFLFSIAMDTSVDVLLN*GLPKILNSIPNVL-DLK*HTSALLMIYY 1234 F KG+RQG P+S LF + MD L L + N PN L + H S + Sbjct: 644 FQGKKGIRQGDPMSSHLFVLVMDVLSKSLDLGALNGLFNLHPNCLAPIITHLSFADDVLV 703 Query: 1235 FLQEEI*LLCKLYQFFLKFSKASGLQANIGKSSIYFGGVADATKATIFQHLGFSLGELPF 1414 F + + F + SGL N K+ + G A ++ +LG + G LP Sbjct: 704 FSDGAASSIAGILTILDDFRQGSGLGINREKTELLLDGGNFARNRSLADNLGITHGSLPV 763 Query: 1415 KYLGIPLSTKKLTILQWTPLIDKIVARISSLTAKKLSYAGRVQLVQLVLFGIQAYWS*LF 1594 +YLG+PL ++K+ + PL+D+I +R +S TA+ LS+AGR+QL++ V++ +W+ +F Sbjct: 764 RYLGVPLMSQKMRRQDYQPLVDRINSRFTSWTARHLSFAGRLQLLKSVIYSTINFWASVF 823 Query: 1595 VIPSKELKFIDAYCRSYVWSGINTITKRALIA 1690 + P++ L+ ++ C +++WSG + A I+ Sbjct: 824 IFPNQCLQKLEQMCNAFLWSGAPNSARGAKIS 855