BLASTX nr result
ID: Cocculus22_contig00019876
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Cocculus22_contig00019876 (2150 letters) Database: ./nr 38,876,450 sequences; 13,856,398,315 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value gb|AAC63678.1| putative non-LTR retroelement reverse transcripta... 183 7e-50 dbj|BAB09379.1| non-LTR retroelement reverse transcriptase-like ... 182 9e-50 gb|AAG50886.1|AC025294_24 hypothetical protein [Arabidopsis thal... 178 5e-49 gb|AAC95175.1| putative non-LTR retroelement reverse transcripta... 184 6e-47 gb|AAM82604.1|AF525305_2 putative AP endonuclease/reverse transc... 162 1e-44 gb|AAF98181.1|AC000107_4 F17F8.5 [Arabidopsis thaliana] 176 1e-44 gb|AAC33226.1| putative non-LTR retroelement reverse transcripta... 170 2e-43 emb|CAB39942.1| putative protein [Arabidopsis thaliana] gi|72678... 182 4e-43 emb|CAB72467.1| putative protein [Arabidopsis thaliana] 169 5e-43 gb|ABW81051.1| tn7 reverse transcriptase [Arabidopsis lyrata sub... 175 7e-41 dbj|BAD95408.1| hypothetical protein [Arabidopsis thaliana] 171 1e-40 gb|AAG51098.1|AC025295_6 hypothetical protein [Arabidopsis thali... 160 9e-40 gb|AAC19278.1| T14P8.10 [Arabidopsis thaliana] gi|7269009|emb|CA... 145 1e-39 gb|AAG50806.1|AC079281_8 unknown protein [Arabidopsis thaliana] 145 1e-37 dbj|BAA97290.1| non-LTR retroelement reverse transcriptase-like ... 138 5e-36 dbj|BAF01687.1| hypothetical protein [Arabidopsis thaliana] 138 5e-36 gb|AAC28221.1| similar to reverse transcriptases (PFam: rvt.hmm,... 131 7e-36 gb|AAF99785.1|AC012463_2 T2E6.4 [Arabidopsis thaliana] 137 6e-35 ref|XP_007026458.1| Uncharacterized protein TCM_021522 [Theobrom... 124 2e-30 dbj|BAB08692.1| non-LTR retroelement reverse transcriptase-like ... 139 5e-30 >gb|AAC63678.1| putative non-LTR retroelement reverse transcriptase [Arabidopsis thaliana] Length = 1216 Score = 183 bits (465), Expect(2) = 7e-50 Identities = 110/423 (26%), Positives = 194/423 (45%), Gaps = 14/423 (3%) Frame = +2 Query: 374 SYILDKVHHHLHG*KASCFSTARRMVLLKHVMQALHLYWEKCFAIPSSILKKAESLMNKF 553 S ++D++ + + S A R+ L+ V+ ++ +W F +P + + + + Sbjct: 508 SPLIDQIRRRIGMWTSRYLSFAGRLSLINSVLWSITNFWMNAFRLPRECINEINRISSAL 567 Query: 554 L----NFNSRIISMSWENVCKPREEGGLGLRRLKEVSLAAGLKLAWWVASKKDLLWIKWI 721 L N + +SW+ +CKP++EGGLGL+ L+E + + LKL W + S +D LW+KW Sbjct: 568 LWSGPELNPKKAKVSWDEICKPKKEGGLGLQSLREANKVSSLKLIWRLLSCQDSLWVKWT 627 Query: 722 HSQYIRSDTYWVSQAHMS-SSWIQKRLMKIRRDLEANVSYLIGKDLYTKVWLDPWSPLGV 898 ++ +++W H + SWI +RL+K R ++ + + T W D WS G Sbjct: 628 RMNLLKKESFWSIGTHSTLGSWIWRRLLKHREVAKSFCKIEVNNGVNTSFWFDNWSEKGP 687 Query: 899 IARDCGKNARRGSDLRRDATIDDLSKCSSLSPIVVELKDKLNEV-----QRISGDHADRL 1063 + G + R T+ + VE+ ++ E+ Q + + D + Sbjct: 688 LINLTGARGAIDMGISRHMTLAEAWSRRRRKRHRVEILNEFEEILLQKYQHRNIELEDAI 747 Query: 1064 IWRLEP---SGEFSMKSTWEFIQRKKHTFRWVNLIWYPHHIPRHSLTLWKLANQRLKIAD 1234 +WR + FS K TW I+ + W +W+ H P+ S W RL D Sbjct: 748 LWRGKEDVFKARFSTKDTWNHIRTSSNQRAWHKGVWFAHATPKFSFCAWLAIRNRLSTGD 807 Query: 1235 RLPKLNIGVDANCCLCWNALETNNHLFFLMHFCSDYMKPIASMLFVDGRRHLFWETLGSK 1414 R+ N G C C + +ET +HLFF + S+ IA ++ D R W + + Sbjct: 808 RMMTWNNGTPTTCVFCSSPMETRDHLFFQCCYSSEIWTSIAKNVYKD-RFSTKWSAVVNY 866 Query: 1415 MSSYDFAGTKLNTSV-KLSFAATIHQIWWERNCRRIQNKLRPVEQIVEAIIFYVRNIISN 1591 +S D ++ + + + +F +IH IW ERN RR K R ++ I +RN +S Sbjct: 867 IS--DSQPDRIQSFLSRYTFQVSIHSIWRERNSRRHGEKSRSASNLIRQIDKTIRNQLST 924 Query: 1592 VSQ 1600 + + Sbjct: 925 IKK 927 Score = 43.5 bits (101), Expect(2) = 7e-50 Identities = 31/108 (28%), Positives = 58/108 (53%), Gaps = 3/108 (2%) Frame = +3 Query: 66 FGHNERTCLKSAGILTYIIFADDVLVFIYLRVSYLEEFNKLLRDIERKTGLAMNPEKSRI 245 FG++ R K+ G LT++ FADD+++ ++ ++ K+L K GL + EK+ + Sbjct: 408 FGYHPRC--KTLG-LTHLCFADDLMILTDGKIRSVDGIVKVLNQFAAKLGLKICMEKTTL 464 Query: 246 VASWVRGH---LFW**VPLGIQKPSFPLKYLGLPIVSRKRFVNECLPI 380 + V H L G+ K P++YLGLP+V+++ ++ P+ Sbjct: 465 YLAGVSDHSRQLMSSRYSFGVGK--LPVRYLGLPLVTKRLTTSDYSPL 510 >dbj|BAB09379.1| non-LTR retroelement reverse transcriptase-like protein [Arabidopsis thaliana] Length = 1223 Score = 182 bits (463), Expect(2) = 9e-50 Identities = 119/423 (28%), Positives = 191/423 (45%), Gaps = 18/423 (4%) Frame = +2 Query: 380 ILDKVHHHLHG*KASCFSTARRMVLLKHVMQALHLYWEKCFAIPSSILKKAESLMNKFL- 556 +L++V + + S A R+ L+ V+ ++ +W F +P +++ E + + FL Sbjct: 789 LLEQVRKRIGSWTSRFLSYAGRLNLISSVLWSICNFWLAAFRLPRKCIRELEKMCSAFLW 848 Query: 557 ---NFNSRIISMSWENVCKPREEGGLGLRRLKEVSLAAGLKLAWWVASKKDLLWIKWIHS 727 NS +SW VCKP++EGGLGLR LKE + LKL W + S + LW+KW+ Sbjct: 849 SGTEMNSNKAKISWHMVCKPKDEGGLGLRSLKEANDVCCLKLVWKIVSHSNSLWVKWVDQ 908 Query: 728 QYIRSDTYW-VSQAHMSSSWIQKRLMKIRRDLEANVSYLIGKDLYTKVWLDPWSPLGVIA 904 +R+ ++W V Q SWI K+L+K R + +G T W D WS LG + Sbjct: 909 HLLRNASFWEVKQTVSQGSWIWKKLLKYREVAKTLSKVEVGNGKQTSFWYDNWSDLGQLL 968 Query: 905 RDCGKNARRGSDLRRDATIDDL----SKCSSLSPIVVELKDKLNEVQRISGDHADRLIWR 1072 G + R T+++ + + + ++D L + + D+++WR Sbjct: 969 ERTGDRGLIDLGISRRMTVEEAWTNRRQRRHRNDVYNVIEDALKKSWDTRTETEDKVLWR 1028 Query: 1073 LEPS---GEFSMKSTWEFIQRKKHTFRWVNLIWYPHHIPRHSLTLWKLANQRLKIADRLP 1243 + FS + TW + W +IW+ H P++S W A+ RL DR+ Sbjct: 1029 GKSDVFRTTFSTRDTWHHTRSTSARVPWHKVIWFSHATPKYSFCSWLAAHGRLPTGDRMI 1088 Query: 1244 KLNIGVDANCCLCWNALETNNHLFFLMHFCSDYMKPIASMLFVDGRRHLFWETLGSKMSS 1423 G+ +C C LET +HLFF F S+++VD R +F S S Sbjct: 1089 NWANGIATDCIFCQGTLETRDHLFFTCSF--------TSVIWVDLARGIFKTQYTSHWQS 1140 Query: 1424 YDFAGTKLNTS------VKLSFAATIHQIWWERNCRRIQNKLRPVEQIVEAIIFYVRNII 1585 A T + F ATI+ +W ERN RR Q+V I +RN + Sbjct: 1141 IIEAITNSQHHRVEWFLRRYVFQATIYIVWRERNGRRHGEPPNTASQLVGWIDKQIRNQL 1200 Query: 1586 SNV 1594 S++ Sbjct: 1201 SSI 1203 Score = 43.9 bits (102), Expect(2) = 9e-50 Identities = 28/109 (25%), Positives = 59/109 (54%), Gaps = 4/109 (3%) Frame = +3 Query: 66 FGHNERTCLKSAGILTYIIFADDVLVFIYLRVSYLEEFNKLLRDIERKTGLAMNPEKSRI 245 FG++ + K+ G LT++ FADD++V ++ +E K+ + + +GL ++ EKS + Sbjct: 687 FGYHPKC--KTMG-LTHLSFADDLMVLSDGKIRSIERIIKVFDEFAKWSGLRISLEKSTV 743 Query: 246 ----VASWVRGHLFW**VPLGIQKPSFPLKYLGLPIVSRKRFVNECLPI 380 +++ R + P++YLGLP+++++ +CLP+ Sbjct: 744 YLAGLSATARNEVA---DRFPFSSGQLPVRYLGLPLITKRLSTTDCLPL 789 >gb|AAG50886.1|AC025294_24 hypothetical protein [Arabidopsis thaliana] Length = 629 Score = 178 bits (452), Expect(2) = 5e-49 Identities = 110/421 (26%), Positives = 192/421 (45%), Gaps = 16/421 (3%) Frame = +2 Query: 374 SYILDKVHHHLHG*KASCFSTARRMVLLKHVMQALHLYWEKCFAIPSSILKKAESLMNKF 553 S + +++ + + + S A R+ L+ V+ + +W F +PS+ LK+ S+ + F Sbjct: 192 SPLFEQIRNRIGTWTSRYLSFAGRLNLISSVLWSTMNFWMSAFRLPSACLKEINSICSAF 251 Query: 554 L----NFNSRIISMSWENVCKPREEGGLGLRRLKEVSLAAGLKLAWWVASKKDLLWIKWI 721 L + R +SW+++CKP++EGGLGLR L E ++ + LKL W V S D LW+KW Sbjct: 252 LWSGPELHRRKAKVSWDDICKPKQEGGLGLRSLTEANVVSVLKLIWRVTSNDDSLWVKWS 311 Query: 722 HSQYIRSDTYWVSQAHMS-SSWIQKRLMKIRRDLEANVSYLIGKDLYTKVWLDPWSPLGV 898 ++ +++W + S SW+ K+++K R + + T W D WS +G Sbjct: 312 KMNLLKQESFWSLTPNSSLGSWMWKKMLKYRETAKPFSRVEVNNGARTSFWFDNWSGMGH 371 Query: 899 IARDCGKNARRGSDLRRDATIDDL--------SKCSSLSPIVVELKDKLNEVQRISGDHA 1054 + G+ + + R+ T+ + + L+ I L K + Sbjct: 372 LMDVTGQRGQIDLGISRNKTVAEAWSNRRRRKHRTEQLNDIEAALNQKYQTRNLL---RE 428 Query: 1055 DRLIWRLEP---SGEFSMKSTWEFIQRKKHTFRWVNLIWYPHHIPRHSLTLWKLANQRLK 1225 D +WR + FS K TW +++K + W +W+ H P++ W RL Sbjct: 429 DATLWRGKGDVFKTSFSTKDTWNQVRKKSNEVAWYKGVWFSHSTPKYQFCTWLALRNRLS 488 Query: 1226 IADRLPKLNIGVDANCCLCWNALETNNHLFFLMHFCSDYMKPIASMLFVDGRRHLFWETL 1405 R+ N G D C C ++ET +HLFF + S IA + + R W+T+ Sbjct: 489 TGYRMQLWNNGSDVKCTFCSTSIETRDHLFFSCSYASAIWTAIAKNV-LQHRFSTDWQTI 547 Query: 1406 GSKMSSYDFAGTKLNTSVKLSFAATIHQIWWERNCRRIQNKLRPVEQIVEAIIFYVRNII 1585 + +S + S + F T+H +W ERN RR + R ++ + +RN + Sbjct: 548 VNYISETQTDRIRSFLS-RYIFQLTVHTVWKERNDRRHGEEPRTSANLISWMDKQIRNQL 606 Query: 1586 S 1588 S Sbjct: 607 S 607 Score = 45.4 bits (106), Expect(2) = 5e-49 Identities = 25/95 (26%), Positives = 55/95 (57%), Gaps = 3/95 (3%) Frame = +3 Query: 108 LTYIIFADDVLVFIYLRVSYLEEFNKLLRDIERKTGLAMNPEKSRIVASWVRGHLFW**V 287 LT++ FADD+++ +V ++ +++ +++GL +N EK+ + + V H + + Sbjct: 103 LTHLCFADDLMILTDGKVRSVDGIVEVMNLFAKRSGLQINMEKTTLYTAGVSDHNRYMMI 162 Query: 288 ---PLGIQKPSFPLKYLGLPIVSRKRFVNECLPIF 383 P G+ + P++YLGLP+V+++ + P+F Sbjct: 163 SRYPFGLGQ--LPVRYLGLPLVTKRLTKEDLSPLF 195 >gb|AAC95175.1| putative non-LTR retroelement reverse transcriptase [Arabidopsis thaliana] Length = 1352 Score = 184 bits (466), Expect(2) = 6e-47 Identities = 106/392 (27%), Positives = 192/392 (48%), Gaps = 13/392 (3%) Frame = +2 Query: 380 ILDKVHHHLHG*KASCFSTARRMVLLKHVMQALHLYWEKCFAIPSSILKKAESLMNKFL- 556 +++K+ + S A R+ L+K V+ ++ +W F +P + L++ E + + FL Sbjct: 936 LVEKIRARITSWTNRFLSFAGRLQLIKSVLSSITNFWLSVFRLPKACLQEIEKMFSAFLW 995 Query: 557 ---NFNSRIISMSWENVCKPREEGGLGLRRLKEVSLAAGLKLAWWVASKKDLLWIKWIHS 727 + N++ ++W VCK +EEGGLGL+ LKE + + LKL W + S +D LW+KW++ Sbjct: 996 SGPDLNTKKAKIAWSEVCKLKEEGGLGLKPLKEANEVSLLKLIWRILSARDSLWVKWVNK 1055 Query: 728 QYIRSDTYW-VSQAHMSSSWIQKRLMKIRRDLEANVSYLIGKDLYTKVWLDPWSPLGVIA 904 IR +T+W V + SW+ ++++K R + +T W D W PLG + Sbjct: 1056 HLIRKETFWSVKENTGLGSWLWRKILKQRDKARLFHRMEVRSGTFTSFWHDHWCPLGRLH 1115 Query: 905 RDCGKNARRGSDLRRDATIDDL----SKCSSLSPIVVELKDKLNEVQRISGDHADRLIWR 1072 + G + +AT+ ++ + + + ++K ++ ++ DR +W+ Sbjct: 1116 QHMGSRGTIDLGIPNNATVAEVMNTHRRKRHRADFLNQIKSQIELARQDRSTDGDRSLWK 1175 Query: 1073 LEP---SGEFSMKSTWEFIQRKKHTFRWVNLIWYPHHIPRHSLTLWKLANQRLKIADRLP 1243 + FS TW+ I+ W +W+ P++S W + RL +D++ Sbjct: 1176 QKEDTFKSSFSSSKTWQQIRSISLRCDWYRGVWFSASTPKYSFVTWLAFHNRLTTSDKIC 1235 Query: 1244 KLNIGVDANCCLCWNALETNNHLFFLMHFCSDYMKPIASMLFVDGRRHLFWETLGSKMSS 1423 K N G +C C LET +HLFF + S + L ++GR L W + + Sbjct: 1236 KWNSGARYDCVFCGEELETRDHLFFSCPYSSHVWFSLTKGL-LNGRNILNWNLITPHL-- 1292 Query: 1424 YDFAGTKLNT-SVKLSFAATIHQIWWERNCRR 1516 D + L+ +++ +F A+IH +W ERNCRR Sbjct: 1293 LDSSRPYLHVFTLRYAFQASIHSLWRERNCRR 1324 Score = 33.1 bits (74), Expect(2) = 6e-47 Identities = 22/92 (23%), Positives = 47/92 (51%), Gaps = 1/92 (1%) Frame = +3 Query: 108 LTYIIFADDVLVFIYLRVSYLEEFNKLLRDIERKTGLAMNPEKSRIVASWVRGHLFW**V 287 LT++ FADD++VF ++ + + L ++ EKS I + + + + Sbjct: 845 LTHLCFADDIMVFSDGTSKSIQGTLAIFEKFAAMSWLKISLEKSTIFMAGISPNAKTSIL 904 Query: 288 P-LGIQKPSFPLKYLGLPIVSRKRFVNECLPI 380 + + P+KYLGLP+++++ ++ LP+ Sbjct: 905 QQFPFELGTLPVKYLGLPLLTKRMTQSDYLPL 936 >gb|AAM82604.1|AF525305_2 putative AP endonuclease/reverse transcriptase [Brassica napus] Length = 1214 Score = 162 bits (411), Expect(2) = 1e-44 Identities = 108/397 (27%), Positives = 175/397 (44%), Gaps = 17/397 (4%) Frame = +2 Query: 374 SYILDKVHHHLHG*KASCFSTARRMVLLKHVMQALHLYWEKCFAIPSSILKKAESLMNKF 553 S ++DK+ + S A R+ L+ V+ + +W F +P LK E + N+F Sbjct: 779 SQLIDKIAARFNHWATKTLSFAGRLQLISSVIYSTVNFWLSSFILPKCCLKTIEQMCNRF 838 Query: 554 LNFNSRI----ISMSWENVCKPREEGGLGLRRLKEVSLAAGLKLAWWVASKKDLLWIKWI 721 L N I +SW+N C P+ EGGLGLR + L+L W + +++D LW+ W Sbjct: 839 LWGNDITRRGDIKVSWQNSCLPKAEGGLGLRNFWTWNKTLNLRLIWMLFARRDSLWVAWN 898 Query: 722 HSQYIRSDTYWVSQAHMSSSWIQKRLMKIRRDLEANVSYLIGKDLYTKVWLDPWSPLGVI 901 H+ +R +W ++A SWI K ++ +R + + +G W D WS LG + Sbjct: 899 HANRLRHVNFWNAEAASHHSWIWKAILGLRPLAKRFLRGAVGNGQLLSYWYDHWSNLGPL 958 Query: 902 ARDCGKNARRGSDLRRDATIDDLSKCS--------SLSPIVVELKDKLNEVQRISGDHA- 1054 G + + + + A + + S + + + + L+ L SGD Sbjct: 959 IEAIGASGPQLTGIHESAVVTEASSSTGWILPSARTRNASLANLRSTLLNSPAPSGDRGE 1018 Query: 1055 DRLIWRLEPSG--EFSMKSTWEFIQRKKHTFRWVNLIWYPHHIPRHSLTLWKLANQRLKI 1228 D W +E S FS K TWE ++++ T W +WY IP+++ W RL + Sbjct: 1019 DTYTWYIEGSSSTSFSSKLTWECLRQRDTTKLWAAAVWYKGCIPKYAFNFWVAHLNRLPV 1078 Query: 1229 ADRLPKLNIGVDANCCLCWNALETNNHLFFLMHFCSDYMKPIASMLFVDGRRHLF--WET 1402 R + + CC+C ET +HLF S + + + GR +F W+ Sbjct: 1079 RARTTHWSTNRPSLCCVCQRETETRDHLFIHCTLGSLIWQQVLARF---GRSQMFREWKD 1135 Query: 1403 LGSKMSSYDFAGTKLNTSVKLSFAATIHQIWWERNCR 1513 + M S G+ T KL+ I IW ERN R Sbjct: 1136 IIEWMLSNQ--GSFSGTLKKLAVQTAIFHIWKERNSR 1170 Score = 47.0 bits (110), Expect(2) = 1e-44 Identities = 25/82 (30%), Positives = 43/82 (52%) Frame = +3 Query: 108 LTYIIFADDVLVFIYLRVSYLEEFNKLLRDIERKTGLAMNPEKSRIVASWVRGHLFW**V 287 ++ + FADD+++F + S L +L + +GL MN EKS + + + + Sbjct: 691 ISSLAFADDLMIFYDGKASSLRGIKSVLESFKNLSGLEMNTEKSAVYTAGLEDTDKEDTL 750 Query: 288 PLGIQKPSFPLKYLGLPIVSRK 353 G +FP +YLGLP++ RK Sbjct: 751 AFGFVNGTFPFRYLGLPLLHRK 772 >gb|AAF98181.1|AC000107_4 F17F8.5 [Arabidopsis thaliana] Length = 872 Score = 176 bits (446), Expect(2) = 1e-44 Identities = 121/405 (29%), Positives = 188/405 (46%), Gaps = 14/405 (3%) Frame = +2 Query: 428 FSTARRMVLLKHVMQALHLYWEKCFAIPSSILKKAESLMNKFLNFNSRIIS----MSWEN 595 FS A R L+K V+ ++ +W F +P +++ + L + FL S + S +SW+ Sbjct: 452 FSFAGRFNLIKSVLWSICNFWLAAFRLPRQCIREIDKLCSSFLWSGSEMSSHKAKISWDI 511 Query: 596 VCKPREEGGLGLRRLKEVSLAAGLKLAWWVASKKDLLWIKWIHSQYIRSDTYW-VSQAHM 772 VCKP+ EGGLGLR LKE + + LKL W + S + LW KW+ IR + W + Q+ Sbjct: 512 VCKPKAEGGLGLRNLKEANDVSCLKLVWRIISNSNSLWTKWVAEYLIRKKSIWSLKQSTS 571 Query: 773 SSSWIQKRLMKIRRDLEANVSYLIGKDLYTKVWLDPWSPLGVIARDCGKNARRGSDLRRD 952 SWI ++++KIR ++ +G W D WS G + G + R+ Sbjct: 572 MGSWIWRKILKIRDVAKSFSRVEVGNGESASFWYDHWSAHGRLIDTVGDKGTIDLGIPRE 631 Query: 953 ATIDDLSKCSSLSPIVVELKDKLNEV---QRI-SGDHADRLIWRLEP---SGEFSMKSTW 1111 A++ D S L +++ E+ QRI D D ++WR + FS + TW Sbjct: 632 ASVADAWTRRSRRRHRTSLLNEIEEMMAYQRIHHSDAEDTVLWRGKNDVFKPHFSTRDTW 691 Query: 1112 EFIQRKKHTFRWVNLIWYPHHIPRHSLTLWKLANQRLKIADRLPKLNI--GVDANCCLCW 1285 I+ T W +W+ H P+++L W + RL DR+ K N V NC LC Sbjct: 692 HLIKATSSTVSWHKGVWFRHATPKYALCTWLAIHNRLPTGDRMLKWNSSGSVSGNCVLCT 751 Query: 1286 NALETNNHLFFLMHFCSDYMKPIASMLFVDGRRHLFWETLGSKMSSYDFAGTKLNTSVKL 1465 N +T HLFF + S +A ++ R W L + +S++ F + Sbjct: 752 NNSKTLEHLFFSCSYASTVWAALAKGIW-KTRYSTRWSHLLTHISTH-FQDRVEGFLTRY 809 Query: 1466 SFAATIHQIWWERNCRRIQNKLRPVEQIVEAIIFYVRNIISNVSQ 1600 F ATI+ +W ERN RR ++ I RN I+ + Q Sbjct: 810 IFQATIYHVWRERNGRRHDAAPNTPATVIGWIDKQTRNQITIIRQ 854 Score = 33.1 bits (74), Expect(2) = 1e-44 Identities = 23/91 (25%), Positives = 47/91 (51%), Gaps = 9/91 (9%) Frame = +3 Query: 108 LTYIIFADDVLVFIYLRVSYLEEFNKLLRDIERKTGLAMNPEKSRIVASWVRGHLFW**V 287 LT++ FADD++V + +E ++ + +++GL ++ EKS + + V Sbjct: 345 LTHLSFADDLMVLSDGKTRSIEGILEVFDEFCKRSGLRISLEKSTLYMAGVS-------- 396 Query: 288 PLGIQK---------PSFPLKYLGLPIVSRK 353 P+ Q+ P++YLGLP+V+++ Sbjct: 397 PIIKQEIAAKFLFDVGQLPVRYLGLPLVTKR 427 >gb|AAC33226.1| putative non-LTR retroelement reverse transcriptase [Arabidopsis thaliana] Length = 1529 Score = 170 bits (430), Expect(2) = 2e-43 Identities = 111/420 (26%), Positives = 194/420 (46%), Gaps = 13/420 (3%) Frame = +2 Query: 374 SYILDKVHHHLHG*KASCFSTARRMVLLKHVMQALHLYWEKCFAIPSSILKKAESLMNKF 553 S +++ V + A S A R+ LL V+ ++ +W + +P+ +++ E L + F Sbjct: 1084 SPLIEAVKTKISSWTARSLSYAGRLALLNSVIVSIANFWMSAYRLPAGCIREIEKLCSAF 1143 Query: 554 L----NFNSRIISMSWENVCKPREEGGLGLRRLKEVSLAAGLKLAWWVASKKDLLWIKWI 721 L N + ++W ++C+P++EGGLG++ L E + + LKL W + S + LW+ WI Sbjct: 1144 LWSGPVLNPKKAKIAWSSICQPKKEGGLGIKSLAEANKVSCLKLIWRLLSTQPSLWVTWI 1203 Query: 722 HSQYIRSDTYWVSQAHMS-SSWIQKRLMKIRRDLEANVSYLIGKDLYTKVWLDPWSPLGV 898 + IR T+W + S SW+ K+L+K R ++ + T W D WS LG Sbjct: 1204 WTFIIRKGTFWSANERSSLGSWMWKKLLKYRELAKSMHKVEVRNGSSTSFWYDHWSHLGR 1263 Query: 899 IARDCGKNARRGSDLRRDATIDDLSKCSSLSPIVVELKDKLN-EVQRISGDHA----DRL 1063 + G + + ++ + + + +++N E+QR+ D Sbjct: 1264 LLDITGTRRVIDLGIPLETNLETVLRTHQHRQHRAAIYNRINAEIQRLQQQEREAGPDIS 1323 Query: 1064 IWRL---EPSGEFSMKSTWEFIQRKKHTFRWVNLIWYPHHIPRHSLTLWKLANQRLKIAD 1234 +WR + + F K TW ++ + W +W+P+ P++S LW RL D Sbjct: 1324 LWRSLKNDFNKRFITKVTWNNVRTHQPQQNWYKGVWFPYSTPKYSFLLWLTVQNRLSTGD 1383 Query: 1235 RLPKLNIGVDANCCLCWNALETNNHLFFLMHFCSDYMKPIASMLFVDGRRHLFWETLGSK 1414 R+ N G C LC NA ET +HLFF + S Y+ + + W L + Sbjct: 1384 RIKAWNSGQLVTCTLCNNAEETRDHLFFSCQYTS-YVWEALTQRLLSTNYSRDWNRLFTL 1442 Query: 1415 MSSYDFAGTKLNTSVKLSFAATIHQIWWERNCRRIQNKLRPVEQIVEAIIFYVRNIISNV 1594 + + + L + F A+I+ IW ERN RR P ++++ I VRN IS++ Sbjct: 1443 LCTSNLPRDHL-FLFRYVFQASIYHIWRERNARRHGEISSPTNRLIKLIDKTVRNRISSI 1501 Score = 35.4 bits (80), Expect(2) = 2e-43 Identities = 23/92 (25%), Positives = 46/92 (50%), Gaps = 1/92 (1%) Frame = +3 Query: 108 LTYIIFADDVLVFIYLRVSYLEEFNKLLRDIERKTGLAMNPEKSRIVASWV-RGHLFW** 284 LT++ FADD++VF+ +E + ++ ++GL ++ EKS I + V Sbjct: 995 LTHLCFADDLMVFVDGHQWSIEGVINVFKEFAGRSGLQISLEKSTIYLAGVSASDRVQTL 1054 Query: 285 VPLGIQKPSFPLKYLGLPIVSRKRFVNECLPI 380 P++YLGLP+++++ + P+ Sbjct: 1055 SSFPFANGQLPVRYLGLPLLTKQMTTADYSPL 1086 >emb|CAB39942.1| putative protein [Arabidopsis thaliana] gi|7267871|emb|CAB78214.1| putative protein [Arabidopsis thaliana] Length = 473 Score = 182 bits (463), Expect = 4e-43 Identities = 108/406 (26%), Positives = 193/406 (47%), Gaps = 12/406 (2%) Frame = +2 Query: 419 ASCFSTARRMVLLKHVMQALHLYWEKCFAIPSSILKKAESLMNKFL----NFNSRIISMS 586 A S A R+ L+ V+ ++ +W F +P +++ + + + +L N+ ++ Sbjct: 51 ARFLSYAGRLNLISSVLWSICNFWMGAFRLPRDCIREIDKMCSAYLWSGGELNTSKAKIT 110 Query: 587 WENVCKPREEGGLGLRRLKEVSLAAGLKLAWWVASKKDLLWIKWIHSQYIRSDTYWVSQA 766 W VCKP+EEGGLGLR LKE + LKL W + S D LW+KWI S ++ ++W + Sbjct: 111 WAFVCKPKEEGGLGLRSLKEANDVCCLKLIWRIISHADSLWVKWIQSSLLKKVSFWAVRE 170 Query: 767 HMS-SSWIQKRLMKIRRDLEANVSYLIGKDLYTKVWLDPWSPLGVIARDCGKNARRGSDL 943 + S SW+ ++++K R I T W D WS LG + G + Sbjct: 171 NTSLGSWMWRKILKFRDIARTLCKVEINNGARTSFWYDDWSDLGRLIDSAGDRGAIDLGI 230 Query: 944 RRDATIDDL----SKCSSLSPIVVELKDKLNEVQRISGDHADRLIWRLEPS---GEFSMK 1102 + AT+ + + + + ++++L DR +W+ + + FS K Sbjct: 231 NKHATVVEAWGNRRRRRHRTNFLNRVEERLILSWNSRNQAEDRALWKGKENRFRSIFSTK 290 Query: 1103 STWEFIQRKKHTFRWVNLIWYPHHIPRHSLTLWKLANQRLKIADRLPKLNIGVDANCCLC 1282 TW I+ + W +W+ IP+H+ +W + RL DR+ N+GVDA C LC Sbjct: 291 DTWNHIRTVSNKVAWYKGVWFAQAIPKHAFCMWLAVHNRLSTGDRMTLWNMGVDATCILC 350 Query: 1283 WNALETNNHLFFLMHFCSDYMKPIASMLFVDGRRHLFWETLGSKMSSYDFAGTKLNTSVK 1462 ALE+ +HLFF F ++ +P+A ++ + + W+T+ + +S ++ + Sbjct: 351 NKALESRDHLFFSCPFATEIWEPLAKTIY-NTCFYTDWQTIINNVSR-NWPDRIAGFLAR 408 Query: 1463 LSFAATIHQIWWERNCRRIQNKLRPVEQIVEAIIFYVRNIISNVSQ 1600 TI+ +W ERN R+ +++ I ++RN + + Q Sbjct: 409 CILQVTIYTLWRERNERKHGASPNSSSRLISWIDKHIRNHLMAIKQ 454 >emb|CAB72467.1| putative protein [Arabidopsis thaliana] Length = 762 Score = 169 bits (429), Expect(2) = 5e-43 Identities = 92/328 (28%), Positives = 158/328 (48%), Gaps = 14/328 (4%) Frame = +2 Query: 380 ILDKVHHHLHG*KASCFSTARRMVLLKHVMQALHLYWEKCFAIPSSILKKAESLMNKFL- 556 +++++ + + S A R L+ ++ + +W F +P + +++ E L + FL Sbjct: 342 LIEQIRKRIGSWSSRFLSFAGRFNLISSIIWSSCNFWLSAFQLPRACIQEIEKLCSSFLW 401 Query: 557 ---NFNSRIISMSWENVCKPREEGGLGLRRLKEVSLAAGLKLAWWVASKKDLLWIKWIHS 727 N NS+ +SW VCKP+ EGGLGLR LKE + LKL W + S D LW+KW+ Sbjct: 402 SGTNLNSKKAKISWNQVCKPKSEGGLGLRSLKEANDVCCLKLVWRIISHGDSLWVKWVEH 461 Query: 728 QYIRSDTYWVSQAHMS-SSWIQKRLMKIRRDLEANVSYLIGKDLYTKVWLDPWSPLGVIA 904 ++ + +W+ + + + SWI K+++K R + +G T W D WS LG + Sbjct: 462 NLLKREIFWIVKENANLGSWIWKKILKYRGVAKRFCKAEVGNGESTSFWFDDWSLLGRLI 521 Query: 905 RDCGKNARRGSDLRRDATIDDLSKCSSLSPIVVELKDKLNEV------QRISGDHADRLI 1066 G + R ++ D E+ + + EV +R R++ Sbjct: 522 DVAGIRGTIDMGISRTMSVADAWTSRRRRHHRQEILNTIEEVLSTQHQKRTQQQQQGRVL 581 Query: 1067 WRLEP---SGEFSMKSTWEFIQRKKHTFRWVNLIWYPHHIPRHSLTLWKLANQRLKIADR 1237 W+ + +FS K+TW +++ + W +W+PH P++S LW A+ RL R Sbjct: 582 WKGKNDIYKDKFSTKNTWNYLRTTSNEVAWHKGVWFPHATPKYSFCLWLAAHDRLATGAR 641 Query: 1238 LPKLNIGVDANCCLCWNALETNNHLFFL 1321 + K N G +C C +ET +HLFF+ Sbjct: 642 MIKWNRGETGDCTFCRQGIETRDHLFFM 669 Score = 34.3 bits (77), Expect(2) = 5e-43 Identities = 23/86 (26%), Positives = 45/86 (52%), Gaps = 4/86 (4%) Frame = +3 Query: 108 LTYIIFADDVLVFIYLRVSYLEEFNKLLRDIERKTGLAMNPEKSRI----VASWVRGHLF 275 LT++ FADD+++ + +E ++ + +GL ++ EKS I ++S R L Sbjct: 251 LTHLSFADDLMILTDGQCRSIEGIIEVFDLFSKWSGLKISMEKSTIFSAGLSSTSRAQLH 310 Query: 276 W**VPLGIQKPSFPLKYLGLPIVSRK 353 + P++YLGLP+V+++ Sbjct: 311 ---THFPFEVGELPIRYLGLPLVTKR 333 >gb|ABW81051.1| tn7 reverse transcriptase [Arabidopsis lyrata subsp. lyrata] Length = 441 Score = 175 bits (444), Expect = 7e-41 Identities = 116/422 (27%), Positives = 193/422 (45%), Gaps = 17/422 (4%) Frame = +2 Query: 380 ILDKVHHHLHG*KASCFSTARRMVLLKHVMQALHLYWEKCFAIPSSILKKAESLMNKFL- 556 +++++ + A S A R+ L+ V+ +L +W F +P++ +K+ + L + FL Sbjct: 9 LIERIRERISCWTARHLSFAGRLQLISSVIHSLTNFWMSAFRLPNACIKEIDGLCSAFLW 68 Query: 557 ---NFNSRIISMSWENVCKPREEGGLGLRRLKEVSLAAGLKLAWWVASKKDLLWIKWIHS 727 N + +SW +VC P+EEGGLGLR L E + LKL W + S L W++W+ Sbjct: 69 SGPELNRKKAKVSWNDVCMPKEEGGLGLRSLTEANKVCCLKLIWRLLSSSSL-WVQWLRQ 127 Query: 728 QYIRSDTYW-VSQAHMSSSWIQKRLMKIRRDLEANVSYLIGKDLYTKVWLDPWSPLGVIA 904 IR ++W + SW+ ++L+K R Y I W D WSPLG + Sbjct: 128 YVIRKGSFWSLRDTSTLGSWMWRKLLKYRHLASGFTQYEIRNGKGVSFWHDNWSPLGPLI 187 Query: 905 RDCGKNARRGSDLRRDATIDDL---SKCSSLSPIVVELKDKLNEVQRISG--DHADRLIW 1069 G + AT+ + + + + +++ +L E+ R G + D ++W Sbjct: 188 AISGTRGCIDMGIDIHATVAEALTHRRRRHRADHLNQMEAQLEEL-RTKGLVETEDVVLW 246 Query: 1070 -----RLEPSGEFSMKSTWEFIQRKKHTFRWVNLIWYPHHIPRHSLTLWKLANQRLKIAD 1234 R +PS FS K TW + +K W IW+ H P++S W RL D Sbjct: 247 KGKGGRFKPS--FSTKETWADTREQKPRNEWYQGIWFSHATPKYSFITWLATKNRLSTGD 304 Query: 1235 RLPKLNIGVDANCCLCWNALETNNHLFFLMHFCSDYMKPIASMLFVDGRRHLF--WETLG 1408 R+ N GV+ +C C ET NHLFF + + + S L RH W T+ Sbjct: 305 RMMSWNAGVNLSCVFCQEQTETRNHLFFTCRYSREVWSGLTSKLLT---RHYSTDWTTIL 361 Query: 1409 SKMSSYDFAGTKLNTSVKLSFAATIHQIWWERNCRRIQNKLRPVEQIVEAIIFYVRNIIS 1588 ++ +L ++ +F ++ IW ERN RR + P +++ + VRN +S Sbjct: 362 KLLTDKTLGNNRL-FLLRYAFQILVYSIWKERNSRRHGEEPLPSALLLKRLDKEVRNKLS 420 Query: 1589 NV 1594 + Sbjct: 421 TI 422 >dbj|BAD95408.1| hypothetical protein [Arabidopsis thaliana] Length = 478 Score = 171 bits (433), Expect(2) = 1e-40 Identities = 112/422 (26%), Positives = 201/422 (47%), Gaps = 17/422 (4%) Frame = +2 Query: 380 ILDKVHHHLHG*KASCFSTARRMVLLKHVMQALHLYWEKCFAIPSSILKKAESLMNKFL- 556 +++K+ + A S A R+ L+ V+ +L +W F +PS+ +K+ +S+ + FL Sbjct: 45 LVEKIRVRIGKWTARHLSFAGRLQLISSVIHSLTNFWMSAFRLPSACIKEIDSICSSFLW 104 Query: 557 ---NFNSRIISMSWENVCKPREEGGLGLRRLKEVSLAAGLKLAWWVASKKDLLWIKWIHS 727 N++ ++W +VC P++EGGLG+R LKE + + LKL W + S L W++W+ Sbjct: 105 SGPELNTKKAKVAWSDVCTPKDEGGLGIRSLKEANKVSLLKLIWRMLSSTSL-WVQWLRL 163 Query: 728 QYIRSDTYW-VSQAHMSSSWIQKRLMKIRRDLEANVSYLIGKDLYTKVWLDPWSPLGVIA 904 +R ++W +S SW+ K+++K R V + I T W D WS +G + Sbjct: 164 YLLRKGSFWSISGNTTLGSWMWKKILKHRALASGFVKHDIHNGSNTSFWFDNWSKIGRLI 223 Query: 905 RDCGKNARRGSDLRRDATIDDL----SKCSSLSPIVVELKDKLNEVQR---ISGDHADRL 1063 G + A++ + ++ ++D + EV+ SG+ D + Sbjct: 224 DVTGHRGCIDMGITLHASVAEAVVNHRPRRHRHDTLLRIEDVIAEVRHQGLTSGE--DTV 281 Query: 1064 IWRLEPSGE-----FSMKSTWEFIQRKKHTFRWVNLIWYPHHIPRHSLTLWKLANQRLKI 1228 W+ +G+ F+ K TW + K W +W+ H P++S+ W RL Sbjct: 282 RWK--GNGDIFKPCFNTKETWAATREPKLKVNWYKGVWFSHATPKYSVLAWIAIKNRLTT 339 Query: 1229 ADRLPKLNIGVDANCCLCWNALETNNHLFFLMHFCSDYMKPIASMLFVDGRRHLFWETLG 1408 DR+ N G D++C LC + +ET +HLFF + ++ + L + WE + Sbjct: 340 GDRMLSWNAGADSSCVLCHHLVETRDHLFFTCPYSAEVWSTLTRKLLSQHFTNR-WEAI- 397 Query: 1409 SKMSSYDFAGTKLNTSVKLSFAATIHQIWWERNCRRIQNKLRPVEQIVEAIIFYVRNIIS 1588 K+ + G ++ + +F T+H +W ERN RR + Q+V + VRN IS Sbjct: 398 LKLLTNKSLGHEVPFLTRYTFQLTLHSLWKERNGRRHGEVPQAAAQMVRFLDKQVRNRIS 457 Query: 1589 NV 1594 ++ Sbjct: 458 SI 459 Score = 25.0 bits (53), Expect(2) = 1e-40 Identities = 8/24 (33%), Positives = 18/24 (75%) Frame = +3 Query: 309 SFPLKYLGLPIVSRKRFVNECLPI 380 + P++YLGLP++++K ++ P+ Sbjct: 22 ALPVRYLGLPLLTKKMTTSDYGPL 45 >gb|AAG51098.1|AC025295_6 hypothetical protein [Arabidopsis thaliana] Length = 504 Score = 160 bits (405), Expect(2) = 9e-40 Identities = 88/323 (27%), Positives = 152/323 (47%), Gaps = 12/323 (3%) Frame = +2 Query: 380 ILDKVHHHLHG*KASCFSTARRMVLLKHVMQALHLYWEKCFAIPSSILKKAESLMNKFL- 556 ++D + + A S R+ L+ ++ ++ +W F +P +++ + + + +L Sbjct: 119 LIDHIKQKICSWSARFLSYTGRLNLISSILWSICNFWMGAFRLPRDCIREIDKMCSAYLW 178 Query: 557 ---NFNSRIISMSWENVCKPREEGGLGLRRLKEVSLAAGLKLAWWVASKKDLLWIKWIHS 727 N+ ++W VCKP+EEGGLGLR LKE + LKL W + S D LW+KWI S Sbjct: 179 SGGELNTSKAKIAWAFVCKPKEEGGLGLRSLKEANDVCCLKLIWRIISHADSLWVKWIQS 238 Query: 728 QYIRSDTYWVSQAHMS-SSWIQKRLMKIRRDLEANVSYLIGKDLYTKVWLDPWSPLGVIA 904 ++ +W + + S SW+ ++++K R I T W D WS LG + Sbjct: 239 SLLKKVFFWAVRENTSLGSWMWRKILKFRDIARTLCKVEINNGAQTSFWYDDWSDLGRLI 298 Query: 905 RDCGKNARRGSDLRRDATIDDL----SKCSSLSPIVVELKDKLNEVQRISGDHADRLIWR 1072 G + + AT+ + + + + ++++L D +W+ Sbjct: 299 ESAGDRGAIDLGINKHATVVEAWGNRRRRRHRANFLNRVEERLVLSWNSRNQAEDCALWK 358 Query: 1073 LEPS---GEFSMKSTWEFIQRKKHTFRWVNLIWYPHHIPRHSLTLWKLANQRLKIADRLP 1243 + + FS K TW I+ + W +W+ IP+H+ +W + RL DR+ Sbjct: 359 GKENRFRSIFSTKDTWNHIRTVSNKVAWYKGVWFAQAIPKHAFCMWLAVHNRLSTGDRMT 418 Query: 1244 KLNIGVDANCCLCWNALETNNHL 1312 N+GVDA C LC NALE+ +HL Sbjct: 419 LWNMGVDATCILCNNALESRDHL 441 Score = 32.7 bits (73), Expect(2) = 9e-40 Identities = 32/110 (29%), Positives = 57/110 (51%), Gaps = 5/110 (4%) Frame = +3 Query: 66 FGHNERTCLKSAGILTYIIFADDVLVFIYLRVSYLEEFNKLLRDIERKTGLAMNPEKSRI 245 FG++ R K G LT++ FADD++V +V +E + + + L ++ EKS + Sbjct: 17 FGYHPRC--KQIG-LTHLSFADDLMVLSDGKVRSIEGIVDVFDTFAKCSDLKISMEKSTV 73 Query: 246 VASWVRGHLFW**VPLGIQKPSF-----PLKYLGLPIVSRKRFVNECLPI 380 + + H V I + SF P++YLGLP+V+++ + LP+ Sbjct: 74 YLAGL-SHTTRQEV---IDRFSFAVGTLPVRYLGLPLVTKQFSSTDYLPL 119 >gb|AAC19278.1| T14P8.10 [Arabidopsis thaliana] gi|7269009|emb|CAB80742.1| AT4g02490 [Arabidopsis thaliana] Length = 657 Score = 145 bits (365), Expect(2) = 1e-39 Identities = 94/335 (28%), Positives = 150/335 (44%), Gaps = 16/335 (4%) Frame = +2 Query: 380 ILDKVHHHLHG*KASCFSTARRMVLLKHVMQALHLYWEKCFAIPSSILKKAESLMNKFL- 556 ++D+++ A S A R+ LLK V+ + +W F +P+ L K E + N FL Sbjct: 330 LVDRINSRFTSWTARHLSFAGRLQLLKSVIYSTINFWASIFILPNQCLHKLEQMCNAFLW 389 Query: 557 ---NFNSRIISMSWENVCKPREEGGLGLRRLKEVSLAAGLKLAWWVASKKDLLWIKWIHS 727 ++R +SW+ VC +E GGLGL+RL + LKL W + + LW+ W+ Sbjct: 390 SGAPNSAREAKISWDIVCSSKESGGLGLKRLSSWNKVLALKLIWLLFTASGSLWVSWVR- 448 Query: 728 QYIRSDTYWVSQAHMSSSWIQKRLMKIRRDLEANVSYLIGKDLYTKVWLDPWSPLGVIAR 907 W+ ++L K+R V +G + + W D W+ G + Sbjct: 449 ------------------WVWRKLCKLREVARPFVICEVGSGITARFWQDNWTGHGPLIH 490 Query: 908 DCGKNARRGSDLR-----RDATIDD---LSKCSSLSPIVVELKDKLNEVQR-ISGDHADR 1060 G + L RDA +D ++ S +P+++ LK L V + +H D Sbjct: 491 LTGLTGPQLVGLSITSVVRDAIRNDDWWIASSRSRNPVILLLKSLLPPVGNLVDCEHDDS 550 Query: 1061 LIWRLE---PSGEFSMKSTWEFIQRKKHTFRWVNLIWYPHHIPRHSLTLWKLANQRLKIA 1231 +W++ PS +FS TW +Q + W +W+ + +P+H+ W A RL Sbjct: 551 YLWKVGDRVPSSKFSTADTWRALQPFSVSVSWHKAVWFTNQVPKHAFISWVTAWNRLHTR 610 Query: 1232 DRLPKLNIGVDANCCLCWNALETNNHLFFLMHFCS 1336 DRL + V A C LC ET +HLFF F S Sbjct: 611 DRLRSWGLIVPAECVLCNLVDETRDHLFFACRFSS 645 Score = 47.4 bits (111), Expect(2) = 1e-39 Identities = 28/97 (28%), Positives = 57/97 (58%), Gaps = 3/97 (3%) Frame = +3 Query: 99 AGILTYIIFADDVLVFIYLRVSYLEEFNKLLRDIERKTGLAMNPEKSRIV---ASWVRGH 269 A ++T++ FADD+LVF +S L +L ++ +GL +N +K+ ++ ++ R Sbjct: 236 APMITHLSFADDILVFCDGSLSSLVAILDILDVFKKGSGLGINLQKTALLLDGGNFERNR 295 Query: 270 LFW**VPLGIQKPSFPLKYLGLPIVSRKRFVNECLPI 380 + LG+ + S P++YLG+P++S+K ++ P+ Sbjct: 296 IMA--ASLGVSQGSLPVRYLGVPLMSQKMKKHDYQPL 330 >gb|AAG50806.1|AC079281_8 unknown protein [Arabidopsis thaliana] Length = 1213 Score = 145 bits (366), Expect(2) = 1e-37 Identities = 112/419 (26%), Positives = 180/419 (42%), Gaps = 15/419 (3%) Frame = +2 Query: 380 ILDKVHHHLHG*KASCFSTARRMVLLKHVMQALHLYWEKCFAIPSSILKKAESLMNKFL- 556 +L+K+ C S A R+ L+ V+ +W F +P +K+ ESL ++FL Sbjct: 782 LLEKITARFRSWVNKCLSFAGRIQLISSVIFGSINFWMSTFLLPKGCIKRIESLCSRFLW 841 Query: 557 --NFN-SRIISMSWENVCKPREEGGLGLRRLKEVSLAAGLKLAWWVASKKDLLWIKWIHS 727 N ++ I +SW +C P+ EGGLGLRRL E + ++L W + KD LW W H Sbjct: 842 SGNIEQAKGIKVSWAALCLPKSEGGLGLRRLLEWNKTLSMRLIWRLFVAKDSLWADWQHL 901 Query: 728 QYIRSDTYWVSQAHMSSSWIQKRLMKIRRDLEANVSYLIGKDLYTKVWLDPWSPLGVIAR 907 ++ ++W + S SW KRL+ +R + +G L W D W+ LG + R Sbjct: 902 HHLSRGSFWAVEGGQSDSWTWKRLLSLRPLAHQFLVCKVGNGLKADYWYDNWTSLGPLFR 961 Query: 908 ---DCGKNARRGSDLRRDATI---DDLSKCSSLSPIVVELKDKL--NEVQRISGDHADRL 1063 D G ++ R L + A+ D S S + D L V + + DR Sbjct: 962 IIGDIGPSSLRVPLLAKVASAFSEDGWRLPVSRSAPAKGIHDHLCTVPVPSTAQEDVDRY 1021 Query: 1064 IWRLEP--SGEFSMKSTWEFIQRKKHTFRWVNLIWYPHHIPRHSLTLWKLANQRLKIADR 1237 W + FS TWE I+ K W + IW+ +P+++ +W RL R Sbjct: 1022 EWSVNGFLCQGFSAAKTWEAIRPKATVKSWASSIWFKGAVPKYAFNMWVSHLNRLLTRQR 1081 Query: 1238 LPKLNIGVDANCCLCWNALETNNHLFFLMHFCSDYMKPIASMLFVDGRRHLFWETLGSKM 1417 L C LC A E+ +HL + F + + + + R W L S + Sbjct: 1082 LASWGHIQSDACVLCSFASESRDHLLLICEFSAQVWRLVFRRICPRQRLFSSWSELLSWV 1141 Query: 1418 SSYDFAGTKLNTSVKLSFAATIHQIWWERNCRRIQNKLRPVEQIVEAII-FYVRNIISN 1591 L K+ ++ +W +RN + N LR ++ ++ +RNIIS+ Sbjct: 1142 RQSSPEAPPLLR--KIVSQVVVYNLWRQRN-NLLHNSLRLAPAVIFKLVDREIRNIISS 1197 Score = 40.8 bits (94), Expect(2) = 1e-37 Identities = 25/91 (27%), Positives = 47/91 (51%) Frame = +3 Query: 108 LTYIIFADDVLVFIYLRVSYLEEFNKLLRDIERKTGLAMNPEKSRIVASWVRGHLFW**V 287 +++++FADDV++F L + L D +GL +N +KS + + + Sbjct: 692 ISHLMFADDVMIFFDGGSFSLHGICETLDDFASWSGLKVNKDKSHLYLAGLNQLESNANA 751 Query: 288 PLGIQKPSFPLKYLGLPIVSRKRFVNECLPI 380 G + P++YLGLP+++RK + E P+ Sbjct: 752 AYGFPIGTLPIRYLGLPLMNRKLRIAEYEPL 782 >dbj|BAA97290.1| non-LTR retroelement reverse transcriptase-like [Arabidopsis thaliana] Length = 1072 Score = 138 bits (347), Expect(2) = 5e-36 Identities = 99/390 (25%), Positives = 155/390 (39%), Gaps = 14/390 (3%) Frame = +2 Query: 380 ILDKVHHHLHG*KASCFSTARRMVLLKHVMQALHLYWEKCFAIPSSILKKAESLMNKFLN 559 +L+K+ L + S A R L+ V+ L +W F +P +KK ESL +KFL Sbjct: 642 LLEKLSARLRSWVSKALSFAGRTQLISSVIFGLINFWMSTFLLPKGCIKKIESLCSKFLW 701 Query: 560 FNS----RIISMSWENVCKPREEGGLGLRRLKEVSLAAGLKLAWWVASKKDLLWIKWIHS 727 S + +SW + C P+ EGGLG R E + L+L W + + LW +W Sbjct: 702 AGSIDGRKSSKVSWVDCCLPKSEGGLGFRSFGEWNKTLLLRLIWVLFDRDTSLWAQWQRH 761 Query: 728 QYIRSDTYWVSQAHMSSSWIQKRLMKIRRDLEANVSYLIGKDLYTKVWLDPWSPLGVIAR 907 + ++W A + W K L+ +R E + +G W D W+ LG + + Sbjct: 762 HRLGHASFWQVNALQTDPWTWKMLLNLRPLAEKFIKAKVGNGGTVSFWFDCWTSLGPLIK 821 Query: 908 DCGKNARRGSDLRRDATIDD----------LSKCSSLSPIVVELKDKLNEVQRISGDHAD 1057 G R + A + D LS+ + I+ L + D Sbjct: 822 YLGDVGSRPLRIPFSAKVADAIDGSGWRLPLSRSLTADSILSHLASLPPPSPLMVSDSYS 881 Query: 1058 RLIWRLEPSGEFSMKSTWEFIQRKKHTFRWVNLIWYPHHIPRHSLTLWKLANQRLKIADR 1237 + ++ G FS TWE ++ ++ RW +W+ +P+H+ W RL R Sbjct: 882 WCVDDVDCQG-FSAAKTWEVLRPRRPVKRWAKSVWFKGAVPKHAFNFWTAQLNRLPTRQR 940 Query: 1238 LPKLNIGVDANCCLCWNALETNNHLFFLMHFCSDYMKPIASMLFVDGRRHLFWETLGSKM 1417 L + A CCLC ET +HL L F S + + L R W L S Sbjct: 941 LVSWGLVSSAECCLCSFDTETRDHLLLLCDFSSQVWRMVFLRLCPRQRLLCTWAELLSWT 1000 Query: 1418 SSYDFAGTKLNTSVKLSFAATIHQIWWERN 1507 A L V ++ +W +RN Sbjct: 1001 RQSTAAAPSLLRKVVAQL--VVYNLWRQRN 1028 Score = 42.4 bits (98), Expect(2) = 5e-36 Identities = 25/91 (27%), Positives = 49/91 (53%) Frame = +3 Query: 108 LTYIIFADDVLVFIYLRVSYLEEFNKLLRDIERKTGLAMNPEKSRIVASWVRGHLFW**V 287 +++++FADDV++F S + + L D +GL +N +KS++ + + Sbjct: 552 ISHLMFADDVMIFFDGGSSSMHGICETLDDFADWSGLKVNKDKSQLFQAGLDLSERITSA 611 Query: 288 PLGIQKPSFPLKYLGLPIVSRKRFVNECLPI 380 G +FP++YLGLP++ RK + + P+ Sbjct: 612 AYGFPAGTFPIRYLGLPLMCRKLRIADYGPL 642 >dbj|BAF01687.1| hypothetical protein [Arabidopsis thaliana] Length = 1072 Score = 138 bits (347), Expect(2) = 5e-36 Identities = 99/390 (25%), Positives = 155/390 (39%), Gaps = 14/390 (3%) Frame = +2 Query: 380 ILDKVHHHLHG*KASCFSTARRMVLLKHVMQALHLYWEKCFAIPSSILKKAESLMNKFLN 559 +L+K+ L + S A R L+ V+ L +W F +P +KK ESL +KFL Sbjct: 642 LLEKLSARLRSWVSKALSFAGRTQLISSVIFGLINFWMSTFLLPKGCIKKIESLCSKFLW 701 Query: 560 FNS----RIISMSWENVCKPREEGGLGLRRLKEVSLAAGLKLAWWVASKKDLLWIKWIHS 727 S + +SW + C P+ EGGLG R E + L+L W + + LW +W Sbjct: 702 AGSIDGRKSSKVSWVDCCLPKSEGGLGFRSFGEWNKTLLLRLIWVLFDRDTSLWAQWQRH 761 Query: 728 QYIRSDTYWVSQAHMSSSWIQKRLMKIRRDLEANVSYLIGKDLYTKVWLDPWSPLGVIAR 907 + ++W A + W K L+ +R E + +G W D W+ LG + + Sbjct: 762 HRLGHASFWQVNALQTDPWTWKMLLNLRPLAEKFIKAKVGNGGTVSFWFDCWTSLGPLIK 821 Query: 908 DCGKNARRGSDLRRDATIDD----------LSKCSSLSPIVVELKDKLNEVQRISGDHAD 1057 G R + A + D LS+ + I+ L + D Sbjct: 822 YLGDVGSRPLRIPFSAKVADAIDGSGWRLPLSRSLTADSILSHLASLPPPSPLMVSDSYS 881 Query: 1058 RLIWRLEPSGEFSMKSTWEFIQRKKHTFRWVNLIWYPHHIPRHSLTLWKLANQRLKIADR 1237 + ++ G FS TWE ++ ++ RW +W+ +P+H+ W RL R Sbjct: 882 WCVDDVDCQG-FSAAKTWEVLRPRRPVKRWARSVWFKGAVPKHAFNFWTAQLNRLPTRQR 940 Query: 1238 LPKLNIGVDANCCLCWNALETNNHLFFLMHFCSDYMKPIASMLFVDGRRHLFWETLGSKM 1417 L + A CCLC ET +HL L F S + + L R W L S Sbjct: 941 LVSWGLVSSAECCLCSFDTETRDHLLLLCDFSSQVWRMVFLRLCPRQRLLCTWAELLSWT 1000 Query: 1418 SSYDFAGTKLNTSVKLSFAATIHQIWWERN 1507 A L V ++ +W +RN Sbjct: 1001 RQSTAAAPSLLRKVVAQL--VVYNLWRQRN 1028 Score = 42.4 bits (98), Expect(2) = 5e-36 Identities = 25/91 (27%), Positives = 49/91 (53%) Frame = +3 Query: 108 LTYIIFADDVLVFIYLRVSYLEEFNKLLRDIERKTGLAMNPEKSRIVASWVRGHLFW**V 287 +++++FADDV++F S + + L D +GL +N +KS++ + + Sbjct: 552 ISHLMFADDVMIFFDGGSSSMHGICETLDDFADWSGLKVNKDKSQLFQAGLDLSERITSA 611 Query: 288 PLGIQKPSFPLKYLGLPIVSRKRFVNECLPI 380 G +FP++YLGLP++ RK + + P+ Sbjct: 612 AYGFPAGTFPIRYLGLPLMCRKLRIADYGPL 642 >gb|AAC28221.1| similar to reverse transcriptases (PFam: rvt.hmm, score: 60.13) [Arabidopsis thaliana] Length = 1164 Score = 131 bits (329), Expect(2) = 7e-36 Identities = 89/335 (26%), Positives = 143/335 (42%), Gaps = 16/335 (4%) Frame = +2 Query: 380 ILDKVHHHLHG*KASCFSTARRMVLLKHVMQALHLYWEKCFAIPSSILKKAESLMNKFLN 559 +++K+ + S A R+ LL V+ + +W F +P +KK ESL ++FL Sbjct: 679 LIEKITARFNSWVVRLLSFAGRVQLLASVISGIVNFWISSFILPLGCIKKIESLCSRFL- 737 Query: 560 FNSRI-----ISMSWENVCKPREEGGLGLRRLKEVSLAAGLKLAWWVASKKDLLWIKWIH 724 ++SRI ++W VC P+ EGG+GLRR + L++ W + S LW+ W H Sbjct: 738 WSSRIDKKGIAKVAWSQVCLPKAEGGIGLRRFAVSNRTLYLRMIWLLFSNSGSLWVAW-H 796 Query: 725 SQYI--RSDTYWVSQAHMSSSWIQKRLMKIRRDLEANVSYLIGKDLYTKVWLDPWSPLGV 898 Q+ +S ++W SW K L+++R E + +G W D W+P G Sbjct: 797 KQHSLGKSTSFWNQPEKPHDSWNWKCLLRLRVVAERFIRCNVGNGRDASFWFDNWTPFGP 856 Query: 899 IARDCGKNARRGSDLRRDATIDDL------SKCSSLSPIVVELKDKLNEVQRIS-GDHAD 1057 + + G R + +A I D+ S S + L L + S D Sbjct: 857 LIKFLGNEGPRDLRVHLNAKISDVCTSEGWSIADPRSDQALSLHTHLTNISMPSDAQDLD 916 Query: 1058 RLIWRLEPS--GEFSMKSTWEFIQRKKHTFRWVNLIWYPHHIPRHSLTLWKLANQRLKIA 1231 W ++ FS +TW ++ W +W+ P+H+ LW RL Sbjct: 917 SYDWVVDNKVCQGFSAAATWSALRPSSAPVPWARAVWFKGATPKHAFHLWTAHLDRLPTK 976 Query: 1232 DRLPKLNIGVDANCCLCWNALETNNHLFFLMHFCS 1336 RL + +D C LC ET +HLF F + Sbjct: 977 VRLASWGMQIDTTCGLCSLHPETRDHLFLSCDFAN 1011 Score = 48.9 bits (115), Expect(2) = 7e-36 Identities = 28/91 (30%), Positives = 50/91 (54%) Frame = +3 Query: 108 LTYIIFADDVLVFIYLRVSYLEEFNKLLRDIERKTGLAMNPEKSRIVASWVRGHLFW**V 287 +++++FADDV++F + S L + L D +GL MN K+++ + + Sbjct: 589 ISHLMFADDVMIFFDGKSSSLHGIVESLEDFAGWSGLLMNTNKTQLYHAGLSQSESDSMA 648 Query: 288 PLGIQKPSFPLKYLGLPIVSRKRFVNECLPI 380 G + S P++YLGLP++SRK + E P+ Sbjct: 649 SYGFKLGSLPVRYLGLPLMSRKLTIAEYAPL 679 >gb|AAF99785.1|AC012463_2 T2E6.4 [Arabidopsis thaliana] Length = 740 Score = 137 bits (346), Expect(2) = 6e-35 Identities = 108/397 (27%), Positives = 171/397 (43%), Gaps = 31/397 (7%) Frame = +2 Query: 374 SYILDKVHHHLHG*KASCFSTARRMVLLKHVMQALHLYWEKCFAIPSSILKKAESLMNKF 553 S +LDKV + A S A R+ L+ V+ +L +W + +P+ +K+ E L + F Sbjct: 360 SPLLDKVRSKISSWTARSLSYAGRLALINSVIVSLSNFWMSAYRLPAGCIKEIEKLCSAF 419 Query: 554 L----NFNSRIISMSWENVCKPREEGGLGLRRLKEVSLAAGLKLAWWVASKKDLLWIKWI 721 L N + ++W ++CK ++EGGLG++ L E + + LKL W + S++ LW+ W+ Sbjct: 420 LWSGPELNPKKAKITWTSLCKLKQEGGLGIKSLLEANKVSCLKLIWRLVSRQSSLWVNWV 479 Query: 722 HSQYIRSDTYWVSQAHMS-SSWIQKRLMKIRRDLEANVSYLIGKDLYTKVWLDPWSPLGV 898 + IR ++W + S SW+ K+L+K R ++ I T W D WS LG Sbjct: 480 WTYIIRKGSFWSANDRSSLGSWMWKKLLKYRDVAKSMCKVEIKSGSSTSFWYDNWSQLGQ 539 Query: 899 IARDCGKNARRGSD--LRRDATIDDLSKCSSLSPIVVELKDKLNE-----VQRISGDHAD 1057 + NARR D + AT+ + + +K+ +QR D Sbjct: 540 LVD--VTNARRTIDMGIPLAATVATVLASHRTKHHRTAIYNKIEAEIQSILQRERSGAPD 597 Query: 1058 RLIWRLEPSG---EFSMKSTWEFIQRKKHTFR-WVNLIWYPHHIPRHSLTLWKLANQRLK 1225 +WR F K TW I R HT R W +W+ ++ P++S LW + RL Sbjct: 598 IFLWRSSGDNFRQSFITKVTWHNI-RVIHTHRQWYKGVWFSYNTPKYSFLLWLAIHDRLS 656 Query: 1226 IADRLPKLNIGVDA---------------NCCLCWNALETNNHLFFLMHFCSDYMKPIAS 1360 DR+ K N G C C N + F+L F S KPI+ Sbjct: 657 TGDRIKKWNSGQQTFSTPLSIFTLKFLRNRCIFCNNMISK----FYLTIFDS-LSKPIS- 710 Query: 1361 MLFVDGRRHLFWETLGSKMSSYDFAGTKLNTSVKLSF 1471 F + L +K F + +NT L + Sbjct: 711 ----------FIDCLTNKSHKLSFTESSINTICPLEY 737 Score = 39.3 bits (90), Expect(2) = 6e-35 Identities = 25/95 (26%), Positives = 50/95 (52%), Gaps = 4/95 (4%) Frame = +3 Query: 108 LTYIIFADDVLVFIYLRVSYLEEFNKLLRDIERKTGLAMNPEKSRIVASWV----RGHLF 275 LT++ FADD++VFI + +E + ++ K+GL ++ EKS + + V R ++ Sbjct: 271 LTHLCFADDLMVFIDGQQRSVEGVINIFKEFAGKSGLHISLEKSTLYLAGVSELNRNNIL 330 Query: 276 W**VPLGIQKPSFPLKYLGLPIVSRKRFVNECLPI 380 P++YLGLP+++++ + P+ Sbjct: 331 ---SAFPFASGQLPVRYLGLPLLTKQMTTADYSPL 362 >ref|XP_007026458.1| Uncharacterized protein TCM_021522 [Theobroma cacao] gi|508715063|gb|EOY06960.1| Uncharacterized protein TCM_021522 [Theobroma cacao] Length = 3503 Score = 124 bits (312), Expect(2) = 2e-30 Identities = 132/522 (25%), Positives = 226/522 (43%), Gaps = 22/522 (4%) Frame = +2 Query: 380 ILDKVHHHLHG*KASCFSTARRMVLLKHVMQALHLYWEKCFAIPSSILKKAESLMNKFLN 559 ++ K+ + G + S R+ LL+ + +L +Y + P +L++ L N FL Sbjct: 2902 LVAKIEERITGWENKILSPGGRITLLRSTLSSLPIYLLQVLKPPIIVLERINRLFNNFLW 2961 Query: 560 FNS----RIISMSWENVCKPREEGGLGLRRLKEVSLAAGLKLAWWVASKKDLLWIKWIHS 727 S RI SW + P EGGL +R L++V A +KL WW + LW++++ + Sbjct: 2962 GGSASSKRIHWASWGKIALPIAEGGLDIRNLEDVFKAFSMKL-WWRFRTTNSLWMQFMRA 3020 Query: 728 QYIRSD--TYWVSQAHMSSSWIQKRLMKIRRDLEANVSYLIGKDLYTKVWLDPW---SPL 892 +Y T+ + H S +W KR++ I E N+ + +G W D W PL Sbjct: 3021 KYCGGQLPTHVQPKLHDSQTW--KRMVTISSITEQNIRWRVGHGKLF-FWHDCWMGEEPL 3077 Query: 893 GVIARDCGKNARRGSDLRRDATIDDLSKCSSLSPIVVELKDKLNEVQRISGDHADRLIWR 1072 + ++ + + SD + + D S L VVE K+ I+ DR W Sbjct: 3078 VIRNQEFASSMAQVSDFFLNNSWDIEKLKSVLQQEVVEEIAKIP----INASSNDRAYWT 3133 Query: 1073 LEPSGEFSMKSTWEFIQRKKHTFRWVNLIWYPHHIPRHSLTLWKLANQRLKIADRLPKLN 1252 P+G+FS KS W+ + +K N IW+ S LW+L + + + ++ Sbjct: 3134 PTPNGDFSTKSAWQLSRERKVVNPTYNYIWHKSVPLTTSFFLWRLLHDWVPVELKMKSKG 3193 Query: 1253 IGVDANCCLCWNALETNNHLFFLMHFCSDYMKPIASMLFVDGRR----HLFWE-TLGSKM 1417 + A+ C C + E+ LMH D P+A+ ++ + H+ T+ + Sbjct: 3194 FQL-ASRCRCCKSEES------LMHVMWD--NPVANQVWSYFAKVFQIHIINPCTINHII 3244 Query: 1418 SSYDFAGT-----KLNTSVKLSFAATIHQIWWERNCRRIQNKLRPVEQIVEAIIFYVRNI 1582 S++ ++G + T V L + +W ERN + +N +IV I+ + + Sbjct: 3245 SAWFYSGDYSKPGHIRTLVPLFI---LWFLWVERNDAKHRNLGMYPNRIVWKILKLIHQL 3301 Query: 1583 ISNVSQDSVCSKEALYMARQWQINLTWKAKS-FFLISWSPPHYGWICLNVDAS--YSQFR 1753 + +A++W I L A S L+ W+ P G LNVD S Y+ Sbjct: 3302 FQGKQLQKWQWQGDKQIAQEWGIILKAVAPSPPKLLFWNKPSIGEFKLNVDGSSKYNLQT 3361 Query: 1754 LGFGGLLRDHLGTPLVAFAGAQDPSSVILAEITDMLEGVQAC 1879 GGLLRDH G+ + F+ + AE+ + G+ C Sbjct: 3362 AAGGGLLRDHTGSMIFGFSENFGSQDSLQAELMALHRGLLLC 3403 Score = 37.0 bits (84), Expect(2) = 2e-30 Identities = 25/83 (30%), Positives = 41/83 (49%), Gaps = 5/83 (6%) Frame = +3 Query: 108 LTYIIFADDVLVFIYLRVSYLEEFNKLLRDIERKTGLAMNPEKSRIV-----ASWVRGHL 272 ++++ FADDV++F S L+ L++ E +G +NP+KS +V AS R + Sbjct: 2810 ISHLAFADDVIIFANGSKSALQRILAFLQEYEELSGQRINPQKSCVVTHTNMASSRRQII 2869 Query: 273 FW**VPLGIQKPSFPLKYLGLPI 341 G P+ YLG P+ Sbjct: 2870 L---QATGFSHRPLPITYLGAPL 2889 Score = 117 bits (293), Expect = 2e-23 Identities = 127/521 (24%), Positives = 228/521 (43%), Gaps = 21/521 (4%) Frame = +2 Query: 380 ILDKVHHHLHG*KASCFSTARRMVLLKHVMQALHLYWEKCFAIPSSILKKAESLMNKFLN 559 ++ K+ + G + S R+ LL+ V+ + +Y + P ++++K E L N FL Sbjct: 1108 LISKIRDRISGWENKILSPGGRITLLRSVLSSQPMYLLQVLKPPVTVIEKIERLFNSFLW 1167 Query: 560 FNS----RIISMSWENVCKPREEGGLGLRRLKEVSLAAGLKLAWWVASKKDLLWIKWIHS 727 +S ++ +W + P EGGL +R L++V A LKL WW + LW +++ + Sbjct: 1168 GDSCDGKKLHWTAWSKITFPVSEGGLDIRNLRDVFEAFSLKL-WWRFQTCNSLWTRFLRT 1226 Query: 728 QYIRSDTYWVSQAHMSSSWIQKRLMKIRRDLEANVSYLIGK-DLYTKVWLDPW---SPLG 895 +Y + Q + S + KR++ R N+ + IGK +L+ W D W PL Sbjct: 1227 KYCLGRIPHLVQPKLHDSQVWKRMIVGRDVALQNIRWRIGKGELF--FWHDCWMGDQPLA 1284 Query: 896 VIARDCGKNARRGSDLRRDATID--DLSKCSSLSPIVVELKDKLNEVQRISGDHA--DRL 1063 + + S + + D D+ K +S P + ++E+ +I D + D Sbjct: 1285 TLFPSFHNDM---SHVHKFYNGDEWDIVKLNSYLPTSL-----VDEILQIPFDRSQEDVA 1336 Query: 1064 IWRLEPSGEFSMKSTWEFIQRKKHTFRWVNLIWYPHHIPRHSLTLWKLANQRLKIADRLP 1243 W L +GEFS S WE I++++ ++ W+ S LW++ N + + R+ Sbjct: 1337 YWALTSNGEFSFWSAWEIIRQRQTPNALLSFNWHRSIPLSISFFLWRVLNNWIPVELRMK 1396 Query: 1244 KLNIGVDANCCLCWNALETNNHLFFLMHFCSDYMKPIAS--MLFVDGRRHLFWETLGSKM 1417 I + A+ C+C + E+ H+ + A ++V +H+ + + + Sbjct: 1397 DKGIHL-ASKCVCCRSEESLIHVLWENPVAKQVWNFFAKSFQIYVSKPKHIS-QIIWAWF 1454 Query: 1418 SSYDFAGTKLNTSVKLSFAATI-HQIWWERNCRRIQNKLRPVEQIVEAIIFYVRNIISNV 1594 S D+ N +++ I +W ERN K R + +I+ + +++ + Sbjct: 1455 FSGDYT---RNGHIRILIPLFICWFLWLERN----DAKHRHMGMYPNRVIWRIMKLLNQL 1507 Query: 1595 SQDSVCS----KEALYMARQWQINLTWK-AKSFFLISWSPPHYGWICLNVD-ASYSQFRL 1756 S+ K +A W K +S +ISW P G LNVD +S S Sbjct: 1508 HAGSLLKQWQWKGDTDIATMWGFKYPPKYCQSPQIISWIKPFIGEYKLNVDGSSKSSQNA 1567 Query: 1757 GFGGLLRDHLGTPLVAFAGAQDPSSVILAEITDMLEGVQAC 1879 GG+LRDH G AF+ P + AE+ +L G+ C Sbjct: 1568 AGGGVLRDHTGKLAFAFSENLGPLPSLQAELHALLRGLLLC 1608 >dbj|BAB08692.1| non-LTR retroelement reverse transcriptase-like protein [Arabidopsis thaliana] gi|93007380|gb|ABE97193.1| hypothetical protein At5g13655 [Arabidopsis thaliana] Length = 385 Score = 139 bits (350), Expect = 5e-30 Identities = 95/358 (26%), Positives = 161/358 (44%), Gaps = 12/358 (3%) Frame = +2 Query: 557 NFNSRIISMSWENVCKPREEGGLGLRRLKEVSLAAGLKLAWWVASKKDLLWIKWIHSQYI 736 + N+R ++W VC P+ EGGLGLR ++E + LKL W + S K LW+ W+ + Sbjct: 11 SLNARKTKVAWSVVCTPKSEGGLGLRAVEETNKVCMLKLIWRILSAKGSLWVDWVKKHLL 70 Query: 737 RSDTYW-VSQAHMSSSWIQKRLMKIRRDLEANVSYLIGKDLYTKVWLDPWSPLGVIARDC 913 R + W V + SWI K+L+K R + + T W D WS LG + Sbjct: 71 RGGSLWAVKETSSRGSWIWKKLLKYRDKAKCFHKVDVRNGESTSFWYDSWSSLGCLYDKF 130 Query: 914 GKNARRGSDLRRDATIDD----LSKCSSLSPIV--VELKDKLNEVQRISGDHADRLIWRL 1075 G+ + +D+T+ + P++ VE + + + RI + D +W+ Sbjct: 131 GERGCIDMGIPKDSTLSSAIMTTRRRKHRQPLLNAVETEIQKQKQSRIVTER-DVALWKG 189 Query: 1076 EPSG---EFSMKSTWEFIQRKKHTFRWVNLIWYPHHIPRHSLTLWKLANQRLKIADRLPK 1246 + G F K TW I+ + + IW+ + P+++L W + R+ +++ Sbjct: 190 KEDGFHPTFLSKETWSQIRNTQPEMQGYRGIWFSNATPKYALLTWLMVRNRIATGEKMGL 249 Query: 1247 LNIGVDANCCLCWNALETNNHLFFLMHFCSDYMKPIASMLFVDGRRHLFWETLGSKMSSY 1426 N D +C C N ET HLFF + + L +D + W+ + ++ Sbjct: 250 WNQNTDTSCIFCKNPNETREHLFFQCVYTRKVWNGLIKGLLLD-KYSDRWQDIILMLTRK 308 Query: 1427 DFAGTKLNTSVKLSFAA--TIHQIWWERNCRRIQNKLRPVEQIVEAIIFYVRNIISNV 1594 DF TK S L + +IH IW ER+ RR E++++ I +RN +S + Sbjct: 309 DFDTTK---SFILGYVLQNSIHSIWRERDDRRHGEDPSNEERLIKFIDKNIRNRLSTL 363