BLASTX nr result
ID: Cocculus23_contig00023532
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Cocculus23_contig00023532 (642 letters) Database: ./nr 38,876,450 sequences; 13,856,398,315 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value gb|AAF98181.1|AC000107_4 F17F8.5 [Arabidopsis thaliana] 117 4e-24 emb|CAB39942.1| putative protein [Arabidopsis thaliana] gi|72678... 117 4e-24 emb|CAB72467.1| putative protein [Arabidopsis thaliana] 114 3e-23 gb|AAG51098.1|AC025295_6 hypothetical protein [Arabidopsis thali... 114 3e-23 gb|AAG50886.1|AC025294_24 hypothetical protein [Arabidopsis thal... 112 1e-22 gb|AAC63678.1| putative non-LTR retroelement reverse transcripta... 110 4e-22 dbj|BAB09379.1| non-LTR retroelement reverse transcriptase-like ... 108 1e-21 gb|AAG50806.1|AC079281_8 unknown protein [Arabidopsis thaliana] 107 3e-21 gb|ABV21212.1| Ty1_Copia-element protein [Arabidopsis thaliana] 104 2e-20 gb|AAC95175.1| putative non-LTR retroelement reverse transcripta... 102 1e-19 ref|XP_004305958.1| PREDICTED: uncharacterized protein LOC101308... 102 1e-19 gb|ABE65398.1| hypothetical protein At1g43570 [Arabidopsis thali... 101 2e-19 dbj|BAA97290.1| non-LTR retroelement reverse transcriptase-like ... 101 2e-19 gb|ABK28140.1| unknown [Arabidopsis thaliana] 101 2e-19 gb|AAC28221.1| similar to reverse transcriptases (PFam: rvt.hmm,... 100 3e-19 dbj|BAF01687.1| hypothetical protein [Arabidopsis thaliana] 100 3e-19 gb|ABW81051.1| tn7 reverse transcriptase [Arabidopsis lyrata sub... 97 4e-18 gb|AAM82604.1|AF525305_2 putative AP endonuclease/reverse transc... 96 7e-18 dbj|BAD95408.1| hypothetical protein [Arabidopsis thaliana] 96 9e-18 dbj|BAB08692.1| non-LTR retroelement reverse transcriptase-like ... 95 2e-17 >gb|AAF98181.1|AC000107_4 F17F8.5 [Arabidopsis thaliana] Length = 872 Score = 117 bits (292), Expect = 4e-24 Identities = 68/204 (33%), Positives = 101/204 (49%), Gaps = 5/204 (2%) Frame = +1 Query: 46 ITRPKAEGGLNIRIPKDMNKADLLQLLWKIACNKESLWIK*IHESYLKRDSIWTVTPH-S 222 + +PKAEGGL +R K+ N L+L+W+I N SLW K + E +++ SIW++ S Sbjct: 512 VCKPKAEGGLGLRNLKEANDVSCLKLVWRIISNSNSLWTKWVAEYLIRKKSIWSLKQSTS 571 Query: 223 DSPWTWRKILKIRNLAATFIKSVIGNGNETKFWLNNWHPMGILEQLIPPSNRLGSALQRY 402 W WRKILKIR++A +F + +GNG FW ++W G L + + + R Sbjct: 572 MGSWIWRKILKIRDVAKSFSRVEVGNGESASFWYDHWSAHGRLIDTVGDKGTIDLGIPRE 631 Query: 403 NKVAEILNQEGWSFPTSGNTNLLQIFEACKTIQRIPWNE-EDMVVW---NPVDSGIFSIK 570 VA+ + T+LL E QRI ++ ED V+W N V FS + Sbjct: 632 ASVADAWTRRS---RRRHRTSLLNEIEEMMAYQRIHHSDAEDTVLWRGKNDVFKPHFSTR 688 Query: 571 TAWDKVRISYPTCPWWSMVWFPQA 642 W ++ + T W VWF A Sbjct: 689 DTWHLIKATSSTVSWHKGVWFRHA 712 >emb|CAB39942.1| putative protein [Arabidopsis thaliana] gi|7267871|emb|CAB78214.1| putative protein [Arabidopsis thaliana] Length = 473 Score = 117 bits (292), Expect = 4e-24 Identities = 71/207 (34%), Positives = 101/207 (48%), Gaps = 8/207 (3%) Frame = +1 Query: 46 ITRPKAEGGLNIRIPKDMNKADLLQLLWKIACNKESLWIK*IHESYLKRDSIWTVTPHSD 225 + +PK EGGL +R K+ N L+L+W+I + +SLW+K I S LK+ S W V ++ Sbjct: 114 VCKPKEEGGLGLRSLKEANDVCCLKLIWRIISHADSLWVKWIQSSLLKKVSFWAVRENTS 173 Query: 226 -SPWTWRKILKIRNLAATFIKSVIGNGNETKFWLNNWHPMGILEQLIPPSNRLGSALQRY 402 W WRKILK R++A T K I NG T FW ++W +G +LI + G+ Sbjct: 174 LGSWMWRKILKFRDIARTLCKVEINNGARTSFWYDDWSDLG---RLIDSAGDRGAIDLGI 230 Query: 403 NKVAEILNQEGWSFPTSGNTNLLQIFEACKTIQRIPWNE----EDMVVWNPVDS---GIF 561 NK A ++ G TN L E + WN ED +W ++ IF Sbjct: 231 NKHATVVEAWGNRRRRRHRTNFLNRVEERLILS---WNSRNQAEDRALWKGKENRFRSIF 287 Query: 562 SIKTAWDKVRISYPTCPWWSMVWFPQA 642 S K W+ +R W+ VWF QA Sbjct: 288 STKDTWNHIRTVSNKVAWYKGVWFAQA 314 >emb|CAB72467.1| putative protein [Arabidopsis thaliana] Length = 762 Score = 114 bits (285), Expect = 3e-23 Identities = 60/204 (29%), Positives = 101/204 (49%), Gaps = 4/204 (1%) Frame = +1 Query: 43 EITRPKAEGGLNIRIPKDMNKADLLQLLWKIACNKESLWIK*IHESYLKRDSIWTVTPHS 222 ++ +PK+EGGL +R K+ N L+L+W+I + +SLW+K + + LKR+ W V ++ Sbjct: 417 QVCKPKSEGGLGLRSLKEANDVCCLKLVWRIISHGDSLWVKWVEHNLLKREIFWIVKENA 476 Query: 223 D-SPWTWRKILKIRNLAATFIKSVIGNGNETKFWLNNWHPMGILEQLIPPSNRLGSALQR 399 + W W+KILK R +A F K+ +GNG T FW ++W +G L + + + R Sbjct: 477 NLGSWIWKKILKYRGVAKRFCKAEVGNGESTSFWFDDWSLLGRLIDVAGIRGTIDMGISR 536 Query: 400 YNKVAEILNQEGWSFPTSGNTNLLQIFEACKTIQRIPWNEEDMVVW---NPVDSGIFSIK 570 VA+ N ++ + + +R ++ V+W N + FS K Sbjct: 537 TMSVADAWTSRRRRHHRQEILNTIEEVLSTQHQKRTQQQQQGRVLWKGKNDIYKDKFSTK 596 Query: 571 TAWDKVRISYPTCPWWSMVWFPQA 642 W+ +R + W VWFP A Sbjct: 597 NTWNYLRTTSNEVAWHKGVWFPHA 620 >gb|AAG51098.1|AC025295_6 hypothetical protein [Arabidopsis thaliana] Length = 504 Score = 114 bits (284), Expect = 3e-23 Identities = 69/207 (33%), Positives = 100/207 (48%), Gaps = 8/207 (3%) Frame = +1 Query: 46 ITRPKAEGGLNIRIPKDMNKADLLQLLWKIACNKESLWIK*IHESYLKRDSIWTVTPHSD 225 + +PK EGGL +R K+ N L+L+W+I + +SLW+K I S LK+ W V ++ Sbjct: 195 VCKPKEEGGLGLRSLKEANDVCCLKLIWRIISHADSLWVKWIQSSLLKKVFFWAVRENTS 254 Query: 226 -SPWTWRKILKIRNLAATFIKSVIGNGNETKFWLNNWHPMGILEQLIPPSNRLGSALQRY 402 W WRKILK R++A T K I NG +T FW ++W +G +LI + G+ Sbjct: 255 LGSWMWRKILKFRDIARTLCKVEINNGAQTSFWYDDWSDLG---RLIESAGDRGAIDLGI 311 Query: 403 NKVAEILNQEGWSFPTSGNTNLLQIFEACKTIQRIPWNE----EDMVVWNPVDS---GIF 561 NK A ++ G N L E + WN ED +W ++ IF Sbjct: 312 NKHATVVEAWGNRRRRRHRANFLNRVEERLVLS---WNSRNQAEDCALWKGKENRFRSIF 368 Query: 562 SIKTAWDKVRISYPTCPWWSMVWFPQA 642 S K W+ +R W+ VWF QA Sbjct: 369 STKDTWNHIRTVSNKVAWYKGVWFAQA 395 >gb|AAG50886.1|AC025294_24 hypothetical protein [Arabidopsis thaliana] Length = 629 Score = 112 bits (279), Expect = 1e-22 Identities = 67/205 (32%), Positives = 98/205 (47%), Gaps = 8/205 (3%) Frame = +1 Query: 43 EITRPKAEGGLNIRIPKDMNKADLLQLLWKIACNKESLWIK*IHESYLKRDSIWTVTPHS 222 +I +PK EGGL +R + N +L+L+W++ N +SLW+K + LK++S W++TP+S Sbjct: 269 DICKPKQEGGLGLRSLTEANVVSVLKLIWRVTSNDDSLWVKWSKMNLLKQESFWSLTPNS 328 Query: 223 D-SPWTWRKILKIRNLAATFIKSVIGNGNETKFWLNNWHPMGILEQLIPPSNRLGSALQR 399 W W+K+LK R A F + + NG T FW +NW MG L + ++ + R Sbjct: 329 SLGSWMWKKMLKYRETAKPFSRVEVNNGARTSFWFDNWSGMGHLMDVTGQRGQIDLGISR 388 Query: 400 YNKVAEILNQEGWS--FPTSGNTNLLQIFEAC--KTIQRIPWNEEDMVVW---NPVDSGI 558 VA E WS T L EA + Q ED +W V Sbjct: 389 NKTVA-----EAWSNRRRRKHRTEQLNDIEAALNQKYQTRNLLREDATLWRGKGDVFKTS 443 Query: 559 FSIKTAWDKVRISYPTCPWWSMVWF 633 FS K W++VR W+ VWF Sbjct: 444 FSTKDTWNQVRKKSNEVAWYKGVWF 468 >gb|AAC63678.1| putative non-LTR retroelement reverse transcriptase [Arabidopsis thaliana] Length = 1216 Score = 110 bits (275), Expect = 4e-22 Identities = 66/208 (31%), Positives = 99/208 (47%), Gaps = 8/208 (3%) Frame = +1 Query: 43 EITRPKAEGGLNIRIPKDMNKADLLQLLWKIACNKESLWIK*IHESYLKRDSIWTVTPHS 222 EI +PK EGGL ++ ++ NK L+L+W++ ++SLW+K + LK++S W++ HS Sbjct: 585 EICKPKKEGGLGLQSLREANKVSSLKLIWRLLSCQDSLWVKWTRMNLLKKESFWSIGTHS 644 Query: 223 D-SPWTWRKILKIRNLAATFIKSVIGNGNETKFWLNNWHPMGILEQLIPPSNRLGSALQR 399 W WR++LK R +A +F K + NG T FW +NW G L L + + R Sbjct: 645 TLGSWIWRRLLKHREVAKSFCKIEVNNGVNTSFWFDNWSEKGPLINLTGARGAIDMGISR 704 Query: 400 YNKVAEILNQEGWS--FPTSGNTNLLQIFE--ACKTIQRIPWNEEDMVVW---NPVDSGI 558 + +A E WS +L FE + Q ED ++W V Sbjct: 705 HMTLA-----EAWSRRRRKRHRVEILNEFEEILLQKYQHRNIELEDAILWRGKEDVFKAR 759 Query: 559 FSIKTAWDKVRISYPTCPWWSMVWFPQA 642 FS K W+ +R S W VWF A Sbjct: 760 FSTKDTWNHIRTSSNQRAWHKGVWFAHA 787 >dbj|BAB09379.1| non-LTR retroelement reverse transcriptase-like protein [Arabidopsis thaliana] Length = 1223 Score = 108 bits (270), Expect = 1e-21 Identities = 62/203 (30%), Positives = 94/203 (46%), Gaps = 4/203 (1%) Frame = +1 Query: 46 ITRPKAEGGLNIRIPKDMNKADLLQLLWKIACNKESLWIK*IHESYLKRDSIWTVTPH-S 222 + +PK EGGL +R K+ N L+L+WKI + SLW+K + + L+ S W V S Sbjct: 865 VCKPKDEGGLGLRSLKEANDVCCLKLVWKIVSHSNSLWVKWVDQHLLRNASFWEVKQTVS 924 Query: 223 DSPWTWRKILKIRNLAATFIKSVIGNGNETKFWLNNWHPMGILEQLIPPSNRLGSALQRY 402 W W+K+LK R +A T K +GNG +T FW +NW +G L + + + R Sbjct: 925 QGSWIWKKLLKYREVAKTLSKVEVGNGKQTSFWYDNWSDLGQLLERTGDRGLIDLGISRR 984 Query: 403 NKVAEILNQEGWSFPTSGNTNLLQIFEACKTIQRIPWNEEDMVVW---NPVDSGIFSIKT 573 V E + N+++ +A K ED V+W + V FS + Sbjct: 985 MTVEEAWTNRRQRRHRNDVYNVIE--DALKKSWDTRTETEDKVLWRGKSDVFRTTFSTRD 1042 Query: 574 AWDKVRISYPTCPWWSMVWFPQA 642 W R + PW ++WF A Sbjct: 1043 TWHHTRSTSARVPWHKVIWFSHA 1065 >gb|AAG50806.1|AC079281_8 unknown protein [Arabidopsis thaliana] Length = 1213 Score = 107 bits (267), Expect = 3e-21 Identities = 59/200 (29%), Positives = 95/200 (47%), Gaps = 4/200 (2%) Frame = +1 Query: 55 PKAEGGLNIRIPKDMNKADLLQLLWKIACNKESLWIK*IHESYLKRDSIWTVTPHSDSPW 234 PK+EGGL +R + NK ++L+W++ K+SLW H +L R S W V W Sbjct: 861 PKSEGGLGLRRLLEWNKTLSMRLIWRLFVAKDSLWADWQHLHHLSRGSFWAVEGGQSDSW 920 Query: 235 TWRKILKIRNLAATFIKSVIGNGNETKFWLNNWHPMGILEQLIPPSNRLGSALQRYNKVA 414 TW+++L +R LA F+ +GNG + +W +NW +G L ++I + KVA Sbjct: 921 TWKRLLSLRPLAHQFLVCKVGNGLKADYWYDNWTSLGPLFRIIGDIGPSSLRVPLLAKVA 980 Query: 415 EILNQEGWSFPTSGNTNLLQIFEACKTIQRIPWNEEDMVVWNPVDSGI----FSIKTAWD 582 +++GW P S + I + T+ +ED+ + +G FS W+ Sbjct: 981 SAFSEDGWRLPVSRSAPAKGIHDHLCTVPVPSTAQEDVDRYEWSVNGFLCQGFSAAKTWE 1040 Query: 583 KVRISYPTCPWWSMVWFPQA 642 +R W S +WF A Sbjct: 1041 AIRPKATVKSWASSIWFKGA 1060 >gb|ABV21212.1| Ty1_Copia-element protein [Arabidopsis thaliana] Length = 438 Score = 104 bits (260), Expect = 2e-20 Identities = 66/200 (33%), Positives = 94/200 (47%), Gaps = 7/200 (3%) Frame = +1 Query: 55 PKAEGGLNIRIPKDMNKADLLQLLWKIACNKESLWIK*--IHESYLKRDSIWTVTPHSDS 228 PKAEGGL +R + N A L+L+W + N SLW+ H + W + + Sbjct: 132 PKAEGGLGVRKFTEWNTALNLKLIWLLFSNSGSLWVAWHLFHNLSTSVSNFWLIKEGTTD 191 Query: 229 PWTWRKILKIRNLAATFIKSVIGNGNETKFWLNNWHPMGILEQLIPPSNRLGSALQRYNK 408 W WR +L++R LA+ F+ IGNG FW ++W P G L I + +K Sbjct: 192 SWNWRCLLRLRPLASKFLFCSIGNGLTASFWADSWTPFGPLLTFIGSDGPRNQRIPLCSK 251 Query: 409 VAEILNQEGWSFPTSGNTNLLQIFEACKTIQRIPWNE--EDMVVW---NPVDSGIFSIKT 573 VA+++N W P+ ++N L + A T IP ED +W N D G FS Sbjct: 252 VADVVNGNRWLLPSPRSSNALNL-HAFLTTLSIPLQPLVEDSYLWKVENCSDIG-FSSAH 309 Query: 574 AWDKVRISYPTCPWWSMVWF 633 W+ +R PW S VWF Sbjct: 310 TWNALRHKEVEKPWVSSVWF 329 >gb|AAC95175.1| putative non-LTR retroelement reverse transcriptase [Arabidopsis thaliana] Length = 1352 Score = 102 bits (254), Expect = 1e-19 Identities = 56/201 (27%), Positives = 101/201 (50%), Gaps = 4/201 (1%) Frame = +1 Query: 43 EITRPKAEGGLNIRIPKDMNKADLLQLLWKIACNKESLWIK*IHESYLKRDSIWTVTPHS 222 E+ + K EGGL ++ K+ N+ LL+L+W+I ++SLW+K +++ +++++ W+V ++ Sbjct: 1011 EVCKLKEEGGLGLKPLKEANEVSLLKLIWRILSARDSLWVKWVNKHLIRKETFWSVKENT 1070 Query: 223 D-SPWTWRKILKIRNLAATFIKSVIGNGNETKFWLNNWHPMGILEQLIPPSNRLGSALQR 399 W WRKILK R+ A F + + +G T FW ++W P+G L Q + + + Sbjct: 1071 GLGSWLWRKILKQRDKARLFHRMEVRSGTFTSFWHDHWCPLGRLHQHMGSRGTIDLGIPN 1130 Query: 400 YNKVAEILNQEGWSFPTSGNTNLLQIFEACKTIQRIPWNEEDMVVWNPVD---SGIFSIK 570 VAE++N + N QI + ++ + D +W + FS Sbjct: 1131 NATVAEVMNTHRRKRHRADFLN--QIKSQIELARQDRSTDGDRSLWKQKEDTFKSSFSSS 1188 Query: 571 TAWDKVRISYPTCPWWSMVWF 633 W ++R C W+ VWF Sbjct: 1189 KTWQQIRSISLRCDWYRGVWF 1209 >ref|XP_004305958.1| PREDICTED: uncharacterized protein LOC101308407 [Fragaria vesca subsp. vesca] Length = 177 Score = 102 bits (253), Expect = 1e-19 Identities = 55/160 (34%), Positives = 82/160 (51%), Gaps = 1/160 (0%) Frame = +1 Query: 151 SLWIK*IHESYLKRDSIWTVTPHSDSPWTWRKILKIRNLAATFIKSVIGNGNETKFWLNN 330 SLW I ++L+ S W V+ W WRK+LKIR+ IK +IG+G T FW + Sbjct: 14 SLWSSWIKVNFLRDKSFWMVSTPQICSWNWRKLLKIRDFIRPSIKHIIGDGKSTYFWHDY 73 Query: 331 WHPMGILEQLIPPSNRLGSALQRYNKVAEILNQEGWSFPTSGNTNLLQIFEACKTIQRIP 510 WHP G L + P + S + V+ I+ E W +P S N+ +L++ A IP Sbjct: 74 WHPFGPLLPRLGPGAMINSGIPSNALVSSIVKGESWCWPLSTNSAILRV--ASNVEGLIP 131 Query: 511 WNE-EDMVVWNPVDSGIFSIKTAWDKVRISYPTCPWWSMV 627 + +D +W P SGIFS + D++ I +P W +V Sbjct: 132 NSSCKDSCIWLPSTSGIFSTASTMDQIWIHHPVVDWAKIV 171 >gb|ABE65398.1| hypothetical protein At1g43570 [Arabidopsis thaliana] Length = 348 Score = 101 bits (251), Expect = 2e-19 Identities = 60/205 (29%), Positives = 102/205 (49%), Gaps = 6/205 (2%) Frame = +1 Query: 43 EITRPKAEGGLNIRIPKDMNKADLLQLLWKIACNKESLWIK*IHESYLKRDSIWTVTPHS 222 ++ P EGGL +R K++N L+L+W++ ++ SLW + + ++R++ W + S Sbjct: 88 KVCLPMCEGGLGLRPLKEINTVCGLKLIWRLLASQTSLWGQWVQTYLIRRNNFWAIKASS 147 Query: 223 -DSPWTWRKILKIRNLAATFIKSVIGNGNETKFWLNNWHPMGILEQLIPPSNRLGSALQR 399 W WRK+L +R++A +F K I NG T FW ++W MG L ++ + + Sbjct: 148 YQGSWMWRKLLTLRDVARSFHKKEIHNGRNTSFWYDHWSSMGTLVDVLGARGCIDLGITT 207 Query: 400 YNKVAEILNQEGWSFPTSGNTNLLQI---FEACKTIQRIPWNEEDMVVWNPVD--SGIFS 564 + + S G+ L++I EA KT R + ED+ +W P FS Sbjct: 208 TTNMENVFTTRR-SRKHRGDL-LVRIEAEIEAAKT--RHQPDIEDIDLWKPSTGYKKTFS 263 Query: 565 IKTAWDKVRISYPTCPWWSMVWFPQ 639 + W +R++ P C W VWFP+ Sbjct: 264 TRETWKLIRMAEPVCEWAKGVWFPK 288 >dbj|BAA97290.1| non-LTR retroelement reverse transcriptase-like [Arabidopsis thaliana] Length = 1072 Score = 101 bits (251), Expect = 2e-19 Identities = 62/199 (31%), Positives = 87/199 (43%), Gaps = 3/199 (1%) Frame = +1 Query: 55 PKAEGGLNIRIPKDMNKADLLQLLWKIACNKESLWIK*IHESYLKRDSIWTVTPHSDSPW 234 PK+EGGL R + NK LL+L+W + SLW + L S W V PW Sbjct: 721 PKSEGGLGFRSFGEWNKTLLLRLIWVLFDRDTSLWAQWQRHHRLGHASFWQVNALQTDPW 780 Query: 235 TWRKILKIRNLAATFIKSVIGNGNETKFWLNNWHPMGILEQLIPPSNRLGSALQRYNKVA 414 TW+ +L +R LA FIK+ +GNG FW + W +G L + + + KVA Sbjct: 781 TWKMLLNLRPLAEKFIKAKVGNGGTVSFWFDCWTSLGPLIKYLGDVGSRPLRIPFSAKVA 840 Query: 415 EILNQEGWSFPTSGNTNLLQIFEACKTI-QRIPWNEEDMVVW--NPVDSGIFSIKTAWDK 585 + ++ GW P S + I ++ P D W + VD FS W+ Sbjct: 841 DAIDGSGWRLPLSRSLTADSILSHLASLPPPSPLMVSDSYSWCVDDVDCQGFSAAKTWEV 900 Query: 586 VRISYPTCPWWSMVWFPQA 642 +R P W VWF A Sbjct: 901 LRPRRPVKRWAKSVWFKGA 919 >gb|ABK28140.1| unknown [Arabidopsis thaliana] Length = 349 Score = 101 bits (251), Expect = 2e-19 Identities = 60/205 (29%), Positives = 102/205 (49%), Gaps = 6/205 (2%) Frame = +1 Query: 43 EITRPKAEGGLNIRIPKDMNKADLLQLLWKIACNKESLWIK*IHESYLKRDSIWTVTPHS 222 ++ P EGGL +R K++N L+L+W++ ++ SLW + + ++R++ W + S Sbjct: 88 KVCLPMCEGGLGLRPLKEINTVCGLKLIWRLLASQTSLWGQWVQTYLIRRNNFWAIKASS 147 Query: 223 -DSPWTWRKILKIRNLAATFIKSVIGNGNETKFWLNNWHPMGILEQLIPPSNRLGSALQR 399 W WRK+L +R++A +F K I NG T FW ++W MG L ++ + + Sbjct: 148 YQGSWMWRKLLTLRDVARSFHKKEIHNGRNTSFWYDHWSSMGTLVDVLGARGCIDLGITT 207 Query: 400 YNKVAEILNQEGWSFPTSGNTNLLQI---FEACKTIQRIPWNEEDMVVWNPVD--SGIFS 564 + + S G+ L++I EA KT R + ED+ +W P FS Sbjct: 208 TTNMENVFTTRR-SRKHRGDL-LVRIEAEIEAAKT--RHQPDIEDIDLWKPSTGYKKTFS 263 Query: 565 IKTAWDKVRISYPTCPWWSMVWFPQ 639 + W +R++ P C W VWFP+ Sbjct: 264 TRETWKLIRMAEPVCEWAKGVWFPK 288 >gb|AAC28221.1| similar to reverse transcriptases (PFam: rvt.hmm, score: 60.13) [Arabidopsis thaliana] Length = 1164 Score = 100 bits (250), Expect = 3e-19 Identities = 60/206 (29%), Positives = 94/206 (45%), Gaps = 6/206 (2%) Frame = +1 Query: 43 EITRPKAEGGLNIRIPKDMNKADLLQLLWKIACNKESLWIK*IHESYL-KRDSIWTVTPH 219 ++ PKAEGG+ +R N+ L+++W + N SLW+ + L K S W Sbjct: 754 QVCLPKAEGGIGLRRFAVSNRTLYLRMIWLLFSNSGSLWVAWHKQHSLGKSTSFWNQPEK 813 Query: 220 SDSPWTWRKILKIRNLAATFIKSVIGNGNETKFWLNNWHPMGILEQLIPPSNRLGSALQR 399 W W+ +L++R +A FI+ +GNG + FW +NW P G L + + + Sbjct: 814 PHDSWNWKCLLRLRVVAERFIRCNVGNGRDASFWFDNWTPFGPLIKFLGNEGPRDLRVHL 873 Query: 400 YNKVAEILNQEGWSFPTSGNTNLLQIFEACKTIQRIPWNEEDM-----VVWNPVDSGIFS 564 K++++ EGWS + L + I +P + +D+ VV N V G FS Sbjct: 874 NAKISDVCTSEGWSIADPRSDQALSLHTHLTNIS-MPSDAQDLDSYDWVVDNKVCQG-FS 931 Query: 565 IKTAWDKVRISYPTCPWWSMVWFPQA 642 W +R S PW VWF A Sbjct: 932 AAATWSALRPSSAPVPWARAVWFKGA 957 >dbj|BAF01687.1| hypothetical protein [Arabidopsis thaliana] Length = 1072 Score = 100 bits (250), Expect = 3e-19 Identities = 62/199 (31%), Positives = 87/199 (43%), Gaps = 3/199 (1%) Frame = +1 Query: 55 PKAEGGLNIRIPKDMNKADLLQLLWKIACNKESLWIK*IHESYLKRDSIWTVTPHSDSPW 234 PK+EGGL R + NK LL+L+W + SLW + L S W V PW Sbjct: 721 PKSEGGLGFRSFGEWNKTLLLRLIWVLFDRDTSLWAQWQRHHRLGHASFWQVNALQTDPW 780 Query: 235 TWRKILKIRNLAATFIKSVIGNGNETKFWLNNWHPMGILEQLIPPSNRLGSALQRYNKVA 414 TW+ +L +R LA FIK+ +GNG FW + W +G L + + + KVA Sbjct: 781 TWKMLLNLRPLAEKFIKAKVGNGGTVSFWFDCWTSLGPLIKYLGDVGSRPLRIPFSAKVA 840 Query: 415 EILNQEGWSFPTSGNTNLLQIFEACKTI-QRIPWNEEDMVVW--NPVDSGIFSIKTAWDK 585 + ++ GW P S + I ++ P D W + VD FS W+ Sbjct: 841 DAIDGSGWRLPLSRSLTADSILSHLASLPPPSPLMVSDSYSWCVDDVDCQGFSAAKTWEV 900 Query: 586 VRISYPTCPWWSMVWFPQA 642 +R P W VWF A Sbjct: 901 LRPRRPVKRWARSVWFKGA 919 >gb|ABW81051.1| tn7 reverse transcriptase [Arabidopsis lyrata subsp. lyrata] Length = 441 Score = 97.1 bits (240), Expect = 4e-18 Identities = 58/204 (28%), Positives = 93/204 (45%), Gaps = 4/204 (1%) Frame = +1 Query: 43 EITRPKAEGGLNIRIPKDMNKADLLQLLWKIACNKESLWIK*IHESYLKRDSIWTVTPHS 222 ++ PK EGGL +R + NK L+L+W++ + SLW++ + + +++ S W++ S Sbjct: 84 DVCMPKEEGGLGLRSLTEANKVCCLKLIWRLL-SSSSLWVQWLRQYVIRKGSFWSLRDTS 142 Query: 223 D-SPWTWRKILKIRNLAATFIKSVIGNGNETKFWLNNWHPMGILEQLIPPSNRLGSALQR 399 W WRK+LK R+LA+ F + I NG FW +NW P+G L + + + Sbjct: 143 TLGSWMWRKLLKYRHLASGFTQYEIRNGKGVSFWHDNWSPLGPLIAISGTRGCIDMGIDI 202 Query: 400 YNKVAEILNQEGWSFPTSGNTNLLQIFEACKTIQRIPWNEEDMVVWNPVDSGI---FSIK 570 + VAE L + E +T + ED+V+W FS K Sbjct: 203 HATVAEALTHRRRRHRADHLNQMEAQLEELRTKGLV--ETEDVVLWKGKGGRFKPSFSTK 260 Query: 571 TAWDKVRISYPTCPWWSMVWFPQA 642 W R P W+ +WF A Sbjct: 261 ETWADTREQKPRNEWYQGIWFSHA 284 >gb|AAM82604.1|AF525305_2 putative AP endonuclease/reverse transcriptase [Brassica napus] Length = 1214 Score = 96.3 bits (238), Expect = 7e-18 Identities = 59/199 (29%), Positives = 88/199 (44%), Gaps = 6/199 (3%) Frame = +1 Query: 55 PKAEGGLNIRIPKDMNKADLLQLLWKIACNKESLWIK*IHESYLKRDSIWTVTPHSDSPW 234 PKAEGGL +R NK L+L+W + ++SLW+ H + L+ + W S W Sbjct: 860 PKAEGGLGLRNFWTWNKTLNLRLIWMLFARRDSLWVAWNHANRLRHVNFWNAEAASHHSW 919 Query: 235 TWRKILKIRNLAATFIKSVIGNGNETKFWLNNWHPMGILEQLIPPSNRLGSALQRYNKVA 414 W+ IL +R LA F++ +GNG +W ++W +G L + I S + + V Sbjct: 920 IWKAILGLRPLAKRFLRGAVGNGQLLSYWYDHWSNLGPLIEAIGASGPQLTGIHESAVVT 979 Query: 415 EILNQEGWSFPTSGNTNLLQIFEACKTIQRIPW----NEEDMVVW--NPVDSGIFSIKTA 576 E + GW P S T + T+ P ED W S FS K Sbjct: 980 EASSSTGWILP-SARTRNASLANLRSTLLNSPAPSGDRGEDTYTWYIEGSSSTSFSSKLT 1038 Query: 577 WDKVRISYPTCPWWSMVWF 633 W+ +R T W + VW+ Sbjct: 1039 WECLRQRDTTKLWAAAVWY 1057 >dbj|BAD95408.1| hypothetical protein [Arabidopsis thaliana] Length = 478 Score = 95.9 bits (237), Expect = 9e-18 Identities = 62/206 (30%), Positives = 100/206 (48%), Gaps = 6/206 (2%) Frame = +1 Query: 43 EITRPKAEGGLNIRIPKDMNKADLLQLLWKIACNKESLWIK*IHESYLKRDSIWTVTPHS 222 ++ PK EGGL IR K+ NK LL+L+W++ + SLW++ + L++ S W+++ ++ Sbjct: 120 DVCTPKDEGGLGIRSLKEANKVSLLKLIWRML-SSTSLWVQWLRLYLLRKGSFWSISGNT 178 Query: 223 D-SPWTWRKILKIRNLAATFIKSVIGNGNETKFWLNNWHPMGILEQLIPPSNRLGSALQR 399 W W+KILK R LA+ F+K I NG+ T FW +NW +G L + + + Sbjct: 179 TLGSWMWKKILKHRALASGFVKHDIHNGSNTSFWFDNWSKIGRLIDVTGHRGCIDMGITL 238 Query: 400 YNKVAE-ILNQEGWSFPTSGNTNLLQIFEACKTIQRIPWNE-EDMVVW---NPVDSGIFS 564 + VAE ++N + LL+I + ++ ED V W + F+ Sbjct: 239 HASVAEAVVNHRP---RRHRHDTLLRIEDVIAEVRHQGLTSGEDTVRWKGNGDIFKPCFN 295 Query: 565 IKTAWDKVRISYPTCPWWSMVWFPQA 642 K W R W+ VWF A Sbjct: 296 TKETWAATREPKLKVNWYKGVWFSHA 321 >dbj|BAB08692.1| non-LTR retroelement reverse transcriptase-like protein [Arabidopsis thaliana] gi|93007380|gb|ABE97193.1| hypothetical protein At5g13655 [Arabidopsis thaliana] Length = 385 Score = 95.1 bits (235), Expect = 2e-17 Identities = 59/216 (27%), Positives = 96/216 (44%), Gaps = 17/216 (7%) Frame = +1 Query: 46 ITRPKAEGGLNIRIPKDMNKADLLQLLWKIACNKESLWIK*IHESYLKRDSIWTVTPHSD 225 + PK+EGGL +R ++ NK +L+L+W+I K SLW+ + + L+ S+W V S Sbjct: 24 VCTPKSEGGLGLRAVEETNKVCMLKLIWRILSAKGSLWVDWVKKHLLRGGSLWAVKETSS 83 Query: 226 -SPWTWRKILKIRNLAATFIKSVIGNGNETKFWLNNWHPMGIL----------EQLIPPS 372 W W+K+LK R+ A F K + NG T FW ++W +G L + IP Sbjct: 84 RGSWIWKKLLKYRDKAKCFHKVDVRNGESTSFWYDSWSSLGCLYDKFGERGCIDMGIPKD 143 Query: 373 NRLGSAL---QRYNKVAEILNQEGWSFPTSGNTNLLQIFEACKTIQRIPWNEEDMVVWNP 543 + L SA+ +R +LN + ++ E D+ +W Sbjct: 144 STLSSAIMTTRRRKHRQPLLNAVETEIQKQKQSRIV--------------TERDVALWKG 189 Query: 544 VDSGI---FSIKTAWDKVRISYPTCPWWSMVWFPQA 642 + G F K W ++R + P + +WF A Sbjct: 190 KEDGFHPTFLSKETWSQIRNTQPEMQGYRGIWFSNA 225