BLASTX nr result
ID: Catharanthus23_contig00011795
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Catharanthus23_contig00011795 (862 letters) Database: ./nr 37,332,560 sequences; 13,225,080,153 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value emb|CCA65979.1| hypothetical protein [Beta vulgaris subsp. vulga... 160 4e-37 ref|XP_006584238.1| PREDICTED: uncharacterized protein LOC102664... 159 1e-36 gb|AAF99785.1|AC012463_2 T2E6.4 [Arabidopsis thaliana] 159 2e-36 gb|AAD15471.1| putative non-LTR retroelement reverse transcripta... 157 6e-36 gb|AAC33226.1| putative non-LTR retroelement reverse transcripta... 152 2e-34 emb|CCA65981.1| hypothetical protein [Beta vulgaris subsp. vulga... 151 3e-34 gb|ABD96948.1| hypothetical protein [Cleome spinosa] 145 2e-32 ref|XP_004253224.1| PREDICTED: uncharacterized protein LOC101268... 144 3e-32 gb|AAD22330.1| putative non-LTR retroelement reverse transcripta... 142 2e-31 emb|CAB10226.1| reverse transcriptase like protein [Arabidopsis ... 140 8e-31 gb|AAF98181.1|AC000107_4 F17F8.5 [Arabidopsis thaliana] 139 1e-30 gb|AAD08951.1| putative reverse transcriptase [Arabidopsis thali... 139 2e-30 gb|AAG50806.1|AC079281_8 unknown protein [Arabidopsis thaliana] 137 5e-30 dbj|BAA97290.1| non-LTR retroelement reverse transcriptase-like ... 136 1e-29 dbj|BAF01687.1| hypothetical protein [Arabidopsis thaliana] 136 1e-29 emb|CAB72467.1| putative protein [Arabidopsis thaliana] 135 2e-29 gb|AAC63678.1| putative non-LTR retroelement reverse transcripta... 134 3e-29 gb|AAC13599.1| similar to reverse transcriptase (Pfam: transcrip... 134 3e-29 gb|AAM82604.1|AF525305_2 putative AP endonuclease/reverse transc... 134 4e-29 dbj|BAB09379.1| non-LTR retroelement reverse transcriptase-like ... 134 6e-29 >emb|CCA65979.1| hypothetical protein [Beta vulgaris subsp. vulgaris] Length = 1110 Score = 160 bits (406), Expect = 4e-37 Identities = 97/295 (32%), Positives = 137/295 (46%), Gaps = 31/295 (10%) Frame = +3 Query: 69 MTENIHLAQEIMRGHARKHTSPKCTPKIDIRKTYDTVSWDFL*ETLYALDFLKRFINWXX 248 + +NI LA E++RG+ RKH SP+C K+DIRK YD+V W FL LY F RF+ W Sbjct: 556 IADNILLASELIRGYTRKHMSPRCIMKVDIRKAYDSVEWSFLETLLYEFGFPSRFVGWIM 615 Query: 249 XXXXXXXXXXXXXXXXXSKGSMV*G--KEILFSPFLFVICMEYLSRSLN*ATLHENFNFY 422 G + SPFLF +CMEYLSR L +FNF+ Sbjct: 616 ECVSTVSYSVLVNGIPTQPFQARKGLRQGDPMSPFLFALCMEYLSRCLEELKGSPDFNFH 675 Query: 423 PKCEKLKISHLACADDLVIFSRGDYLSVQVLCEVLKAFGGVSGLKVNPTKFNVFLASMDE 602 PKCE+L I+HL ADDL++F R D S+ + + F SGL + K N++ +D+ Sbjct: 676 PKCERLNITHLMFADDLLMFCRADKSSLDHMNVAFQKFSHASGLAASHEKSNIYFCGVDD 735 Query: 603 EERNLITNSIGFFVGAMPFRYSGILLAGVYLKLADYAPLIDKVSKTLLAWQDL------- 761 E + + + +G +PFRY G+ L L A PL++ ++ W Sbjct: 736 ETARELADYVHMQLGELPFRYLGVPLTSKKLTYAQCKPLVEMITNRAQTWMAKLLSYAGR 795 Query: 762 -----------------IFHMSAAVLDRFISLCCQFLWGG-----NYARVAWKTM 860 IF +S V+ +C +FLW G A VAW T+ Sbjct: 796 LQLIKSILSSMQNYWAHIFPLSKKVIQAVEKVCRKFLWTGKTEETKKAPVAWATI 850 >ref|XP_006584238.1| PREDICTED: uncharacterized protein LOC102664824 [Glycine max] Length = 939 Score = 159 bits (402), Expect = 1e-36 Identities = 101/279 (36%), Positives = 142/279 (50%), Gaps = 26/279 (9%) Frame = +3 Query: 75 ENIHLAQEIMRGHARKHTSPKCTPKIDIRKTYDTVSWDFL*ETLYALDFLKRFINWXXXX 254 +++ LA E++RG+ RKH +PKC +IDI+K YDTV WD L L L F +FI W Sbjct: 386 DHVMLAFELLRGYERKHGTPKCMLQIDIQKAYDTVHWDALEHILRELGFPDQFIKWIMIA 445 Query: 255 XXXXXXXXXXXXXXXSKGSMV*G--KEILFSPFLFVICMEYLSRSLN*ATLHENFNFYPK 428 + G + SP LF++ MEYL+R L+ NFN++ K Sbjct: 446 VRSVTYVFNINGRFTRRLEARRGIRQGDPISPLLFILVMEYLNRILSQLDKIPNFNYHSK 505 Query: 429 CEKLKISHLACADDLVIFSRGDYLSVQVLCEVLKAFGGVSGLKVNPTKFNVFLASMDEEE 608 CEK+KI++L ADDL++FSRGD SVQ++ + F GL VNP+K N++ S+D Sbjct: 506 CEKMKITNLCFADDLLLFSRGDIGSVQIMLDKFNTFLRSMGLHVNPSKCNIYCGSVDINV 565 Query: 609 RNLITNSIGFFVGAMPFRYSGILLAGVYLKLADYAPLIDKVSKTLLAW------------ 752 + + GF G MPFRY GI L+ L + Y LIDK+ + W Sbjct: 566 KEQLLLISGFKEGKMPFRYLGIPLSSKKLNIKHYQVLIDKIVGRITHWSAGLLSYAGRVQ 625 Query: 753 --QDLI-----FHMSAAVLDRFI-----SLCCQFLWGGN 833 Q +I F M L +F+ ++C FLW GN Sbjct: 626 LIQSVIFATINFWMQCLPLPKFVIMRINAICRSFLWIGN 664 >gb|AAF99785.1|AC012463_2 T2E6.4 [Arabidopsis thaliana] Length = 740 Score = 159 bits (401), Expect = 2e-36 Identities = 94/297 (31%), Positives = 144/297 (48%), Gaps = 33/297 (11%) Frame = +3 Query: 69 MTENIHLAQEIMRGHARKHTSPKCTPKIDIRKTYDTVSWDFL*ETLYALDFLKRFINWXX 248 + EN+ LA E+++ + + SP+C KIDI K +D+V W FL TL AL+F + F +W Sbjct: 144 LIENVLLATELVKDYHKDSISPRCAMKIDISKAFDSVQWQFLLNTLEALNFPENFCHWIK 203 Query: 249 XXXXXXXXXXXXXXXXX----SKGSMV*GKEILFSPFLFVICMEYLSRSLN*ATLHENFN 416 SK + G + SP+LFVICM LS ++ A +H N Sbjct: 204 LCISTATFSVQVNGELAGFFGSKRGLRQGCAL--SPYLFVICMNVLSHMIDVAAVHRNIG 261 Query: 417 FYPKCEKLKISHLACADDLVIFSRGDYLSVQVLCEVLKAFGGVSGLKVNPTKFNVFLASM 596 ++PKC+KL ++HL ADDL++F G SV+ + + K F G SGL ++ K ++LA + Sbjct: 262 YHPKCKKLSLTHLCFADDLMVFIDGQQRSVEGVINIFKEFAGKSGLHISLEKSTLYLAGV 321 Query: 597 DEEERNLITNSIGFFVGAMPFRYSGILLAGVYLKLADYAPLIDKVSKTLLAWQD------ 758 E RN I ++ F G +P RY G+ L + ADY+PL+DKV + +W Sbjct: 322 SELNRNNILSAFPFASGQLPVRYLGLPLLTKQMTTADYSPLLDKVRSKISSWTARSLSYA 381 Query: 759 ------------------LIFHMSAAVLDRFISLCCQFLWGG-----NYARVAWKTM 860 + + A + LC FLW G A++ W ++ Sbjct: 382 GRLALINSVIVSLSNFWMSAYRLPAGCIKEIEKLCSAFLWSGPELNPKKAKITWTSL 438 >gb|AAD15471.1| putative non-LTR retroelement reverse transcriptase [Arabidopsis thaliana] Length = 1277 Score = 157 bits (396), Expect = 6e-36 Identities = 94/297 (31%), Positives = 143/297 (48%), Gaps = 33/297 (11%) Frame = +3 Query: 69 MTENIHLAQEIMRGHARKHTSPKCTPKIDIRKTYDTVSWDFL*ETLYALDFLKRFINWXX 248 + EN+ LA E+++ + + SP+C KIDI K +D+V W FL TL AL F ++F +W Sbjct: 713 LMENVLLATELVKDYHKDSISPRCAMKIDISKAFDSVQWQFLLNTLEALKFPEKFRHWIK 772 Query: 249 XXXXXXXXXXXXXXXXX----SKGSMV*GKEILFSPFLFVICMEYLSRSLN*ATLHENFN 416 SK + G + SP+LFVICM LS ++ A +H N Sbjct: 773 LCISTATFSVQVNSEQAGFFGSKRGLRQGCAL--SPYLFVICMNVLSHMIDVAAVHRNIG 830 Query: 417 FYPKCEKLKISHLACADDLVIFSRGDYLSVQVLCEVLKAFGGVSGLKVNPTKFNVFLASM 596 ++PKC+KL ++HL ADDL++F G SV+ + + K F G SGL ++ K ++LA + Sbjct: 831 YHPKCKKLSLTHLCFADDLMVFIDGQQRSVEGVINIFKDFAGKSGLHISLEKSTLYLAEV 890 Query: 597 DEEERNLITNSIGFFVGAMPFRYSGILLAGVYLKLADYAPLIDKVSKTLLAWQD------ 758 E RN I ++ F G +P RY G L + ADY+PL+DKV + +W Sbjct: 891 SELNRNNILSAFPFASGQLPVRYLGFPLLTKQMTTADYSPLLDKVRSKISSWTARSLSYA 950 Query: 759 ------------------LIFHMSAAVLDRFISLCCQFLWGG-----NYARVAWKTM 860 + + A + LC FLW G A++ W ++ Sbjct: 951 GRLALINSVIVSLSNFWMSAYRLPAGCIKEIEKLCSAFLWSGPELNPKKAKITWTSL 1007 >gb|AAC33226.1| putative non-LTR retroelement reverse transcriptase [Arabidopsis thaliana] Length = 1529 Score = 152 bits (384), Expect = 2e-34 Identities = 92/295 (31%), Positives = 145/295 (49%), Gaps = 31/295 (10%) Frame = +3 Query: 69 MTENIHLAQEIMRGHARKHTSPKCTPKIDIRKTYDTVSWDFL*ETLYALDFLKRFINWXX 248 + EN+ LA E+++ + ++ +P+C KIDI K +D+V W FL TL AL+F + F +W Sbjct: 868 LMENVLLATELVKDYHKESVTPRCAMKIDISKAFDSVQWQFLLNTLEALNFPETFRHWIK 927 Query: 249 XXXXXXXXXXXXXXXXXSKGSMV*G--KEILFSPFLFVICMEYLSRSLN*ATLHENFNFY 422 G + SP+LFVICM LS ++ A +H N ++ Sbjct: 928 LCISTATFSVQVNGELAGFFGSSRGLRQGCALSPYLFVICMNVLSHMIDEAAVHRNIGYH 987 Query: 423 PKCEKLKISHLACADDLVIFSRGDYLSVQVLCEVLKAFGGVSGLKVNPTKFNVFLASMDE 602 PKCEK+ ++HL ADDL++F G S++ + V K F G SGL+++ K ++LA + Sbjct: 988 PKCEKIGLTHLCFADDLMVFVDGHQWSIEGVINVFKEFAGRSGLQISLEKSTIYLAGVSA 1047 Query: 603 EERNLITNSIGFFVGAMPFRYSGILLAGVYLKLADYAPLIDKVSKTLLAW--QDLIFHMS 776 +R +S F G +P RY G+ L + ADY+PLI+ V + +W + L + Sbjct: 1048 SDRVQTLSSFPFANGQLPVRYLGLPLLTKQMTTADYSPLIEAVKTKISSWTARSLSYAGR 1107 Query: 777 AAVLDRFI----------------------SLCCQFLWGG-----NYARVAWKTM 860 A+L+ I LC FLW G A++AW ++ Sbjct: 1108 LALLNSVIVSIANFWMSAYRLPAGCIREIEKLCSAFLWSGPVLNPKKAKIAWSSI 1162 >emb|CCA65981.1| hypothetical protein [Beta vulgaris subsp. vulgaris] Length = 1114 Score = 151 bits (382), Expect = 3e-34 Identities = 88/254 (34%), Positives = 127/254 (50%), Gaps = 3/254 (1%) Frame = +3 Query: 75 ENIHLAQEIMRGHARKHTSPKCTPKIDIRKTYDTVSWDFL*ETLYALDFLKRFINWXXXX 254 +NI LA E++RG+ R+H SP+C K+DIRK YD+V W FL L L F FI W Sbjct: 561 DNILLATELIRGYNRRHVSPRCVIKVDIRKAYDSVEWVFLESMLKELGFPSMFIRWIMAC 620 Query: 255 XXXXXXXXXXXXXXXSKGSMV*G--KEILFSPFLFVICMEYLSRSLN*ATLHENFNFYPK 428 G + SPFLF + MEYLSR + FNF+PK Sbjct: 621 VKTVSYSILLNGIPSIPFDAQKGLRQGDPLSPFLFALSMEYLSRCMGNMCKDPEFNFHPK 680 Query: 429 CEKLKISHLACADDLVIFSRGDYLSVQVLCEVLKAFGGVSGLKVNPTKFNVFLASMDEEE 608 CE++K++HL ADDL++F+R D S+ + +F SGL+ + K ++ + EE Sbjct: 681 CERIKLTHLMFADDLLMFARADASSISKIMAAFNSFSKASGLQASIEKSCIYFGGVCHEE 740 Query: 609 RNLITNSIGFFVGAMPFRYSGILLAGVYLKLADYAPLIDKVSKTLLAW-QDLIFHMSAAV 785 + + I +G++PFRY G+ LA L + PLIDK++ W L+ + Sbjct: 741 AEQLADRIQMPIGSLPFRYLGVPLASKKLNFSQCKPLIDKITTRAQGWVAHLLSYAGRLQ 800 Query: 786 LDRFISLCCQFLWG 827 L + I Q WG Sbjct: 801 LVKTILYSMQNYWG 814 >gb|ABD96948.1| hypothetical protein [Cleome spinosa] Length = 539 Score = 145 bits (365), Expect = 2e-32 Identities = 94/290 (32%), Positives = 144/290 (49%), Gaps = 27/290 (9%) Frame = +3 Query: 69 MTENIHLAQEIMRGHARKHTSPKCTPKIDIRKTYDTVSWDFL*ETLYALDFLKRFINWXX 248 M EN+ LA E++ + R +TS + KID+RK +DTVSW+F+ + + AL+ + F+ W Sbjct: 18 MVENVLLATELVHEYNRPNTSKRAMLKIDLRKAFDTVSWEFITKIMQALNLPRTFVTWVK 77 Query: 249 XXXXXXXXXXXXXXXXXS--KGSMV*GKEILFSPFLFVICMEYLSRSLN*ATLHENFNFY 422 KG + SP+LF++ ME LSR L+ + + Sbjct: 78 VCMETPKFSVSINGELAGYFKGRRGLRQGDPLSPYLFIMSMEVLSRMLDRCAAESRLSLH 137 Query: 423 PKCEKLKISHLACADDLVIFSRGDYLSVQVLCEVLKAFGGVSGLKVNPTKFNVFLASMDE 602 PKC I+HLA ADD++IF+ G+ S+ + L +F SGL +N K +FL ++ Sbjct: 138 PKCHSPVITHLAFADDIMIFTSGETRSLLEVKNTLDSFSRASGLYLNTEKTEIFLRGLNG 197 Query: 603 EERNLITNSIGFFVGAMPFRYSGILLAGVYLKLADYAPLIDKVSKTLLAWQ--------- 755 E + + IGF G +P RY G+ L+ V L +DY PL+D+V + +W Sbjct: 198 TEASTLCAVIGFTRGYLPVRYLGVSLSPVRLTKSDYQPLLDRVKAKINSWTTRYLSYAGR 257 Query: 756 -----DLIFHMSAA-----VLDRFIS-----LCCQFLWG-GNYARVAWKT 857 +I+ M A +L +F + LC FLWG G RV+W T Sbjct: 258 LQLVGTVIYGMVNAWGMIFMLPKFFTKQVDRLCAGFLWGAGTTHRVSWDT 307 >ref|XP_004253224.1| PREDICTED: uncharacterized protein LOC101268376 [Solanum lycopersicum] Length = 717 Score = 144 bits (364), Expect = 3e-32 Identities = 90/290 (31%), Positives = 136/290 (46%), Gaps = 31/290 (10%) Frame = +3 Query: 75 ENIHLAQEIMRGHARKHTSPKCTPKIDIRKTYDTVSWDFL*ETLYALDFLKRFINWXXXX 254 +NI LA E+++ + RK+ SP+C KID+ K YD+V W FL + + L F F W Sbjct: 366 DNIILAHELVKAYTRKNVSPRCMLKIDLHKAYDSVEWPFLEQVMEGLGFPDLFTKWVMKC 425 Query: 255 XXXXXXXXXXXXXXXSKGSMV*G--KEILFSPFLFVICMEYLSRSLN*ATLHENFNFYPK 428 + G + SPFLF I MEYLSR L ++F ++PK Sbjct: 426 VKTVNYTIVVNGQNTQRFDAAKGLRQGDPMSPFLFAIAMEYLSRLLKGLKEDKSFKYHPK 485 Query: 429 CEKLKISHLACADDLVIFSRGDYLSVQVLCEVLKAFGGVSGLKVNPTKFNVFLASMDEEE 608 KL ++HL ADDL++FSRGD S++ L + F SGL+ N K +++ + E Sbjct: 486 YAKLDVTHLCFADDLLLFSRGDLNSIKALQKCFTEFSQASGLQANLNKSSIYCGGVQMEV 545 Query: 609 RNLITNSIGFFVGAMPFRYSGILLAGVYLKLADYAPLIDKVSKTLLAW------------ 752 R I +G+ + +PF+Y G+ L+ L + PLI+KV + +W Sbjct: 546 RQQIIQQLGYTIEELPFKYLGVPLSSKKLNTIQWYPLIEKVMARINSWTAKKLSYAGRAQ 605 Query: 753 --QDLIFHMSAAVLDRFI----------SLCCQFLWGG-----NYARVAW 851 + ++F + A FI LC +LW G A +AW Sbjct: 606 LVKTVLFGVQALWAQLFIIPAKIIKLIEGLCRSYLWSGVGYVTKKALIAW 655 >gb|AAD22330.1| putative non-LTR retroelement reverse transcriptase [Arabidopsis thaliana] Length = 631 Score = 142 bits (357), Expect = 2e-31 Identities = 77/224 (34%), Positives = 119/224 (53%), Gaps = 2/224 (0%) Frame = +3 Query: 69 MTENIHLAQEIMRGHARKHTSPKCTPKIDIRKTYDTVSWDFL*ETLYALDFLKRFINWXX 248 + EN+ LA E+++ + + SP+C KIDI K +D+V W FL TL AL+F ++ +W Sbjct: 141 LMENVLLATELVKDYHKDSISPRCAMKIDISKAFDSVQWQFLLNTLEALNFPEKIRHWIK 200 Query: 249 XXXXXXXXXXXXXXXXXSKGSMV*G--KEILFSPFLFVICMEYLSRSLN*ATLHENFNFY 422 G + SP+LFVICM LS ++ A + N ++ Sbjct: 201 LCISTATFSVQVNGELAGFFGNKRGLRQGCALSPYLFVICMNVLSHMIDEAAVRRNIGYH 260 Query: 423 PKCEKLKISHLACADDLVIFSRGDYLSVQVLCEVLKAFGGVSGLKVNPTKFNVFLASMDE 602 PKC+KL ++HL DDL++F G S++ + + F G SGL ++ K ++LA + E Sbjct: 261 PKCKKLSLTHLCFVDDLMVFIDGQQRSIEGVINIFHEFAGKSGLHISLEKSTLYLAGVSE 320 Query: 603 EERNLITNSIGFFVGAMPFRYSGILLAGVYLKLADYAPLIDKVS 734 R+ I ++ F G +P RY G+ L + ADY+PLIDK S Sbjct: 321 PNRDHILSAFSFASGQLPVRYLGLPLMTKQMTTADYSPLIDKPS 364 >emb|CAB10226.1| reverse transcriptase like protein [Arabidopsis thaliana] gi|7268153|emb|CAB78489.1| reverse transcriptase like protein [Arabidopsis thaliana] Length = 318 Score = 140 bits (352), Expect = 8e-31 Identities = 87/278 (31%), Positives = 138/278 (49%), Gaps = 14/278 (5%) Frame = +3 Query: 69 MTENIHLAQEIMRGHARKHTSPKCTPKIDIRKTYDTVSWDFL*ETLYALDFLKRFINWXX 248 + EN+ LA E+++ + + S +C KIDI K +D+V W FL L LDF + F++W Sbjct: 7 LIENLLLATELVKDYHKDSVSERCAIKIDISKAFDSVQWTFLKNVLLTLDFPQVFVHWIM 66 Query: 249 XXXXXXXXXXXXXXXXXSKGSMV*G--KEILFSPFLFVICMEYLSRSLN*ATLHENFNFY 422 + G + SP+LFVI M+ LS+ L+ A F ++ Sbjct: 67 LCVTTASFSVQVNGELAGYFNSSRGLRQGCSLSPYLFVIVMDVLSKKLDRAAGLRKFGYH 126 Query: 423 PKCEKLKISHLACADDLVIFSRGDYLSVQVLCEVLKAFGGVSGLKVNPTKFNVFLASMDE 602 PKC+ L ++HL+ ADD+++ + G S++ + EV +F S LK++ K ++LA + + Sbjct: 127 PKCKNLGLTHLSFADDIMVLTDGKLRSLEGIVEVFDSFAKQSCLKISMEKTTIYLAGISD 186 Query: 603 EERNLITNSIGFFVGAMPFRYSGILLAGVYLKLADYAPLIDKVSKTLLAWQDLI--FHMS 776 R F VG +P RY G+ L L DY PL++++ + + W I F +S Sbjct: 187 TVRQEFEEQFHFEVGCLPVRYLGLPLVTKRLTSQDYNPLLEQIKRRIGTWTARICNFWLS 246 Query: 777 AAVLDR-----FISLCCQFLWGG-----NYARVAWKTM 860 A L R LC FLW G A++AW T+ Sbjct: 247 AFRLPRECIREIDKLCSAFLWSGPELSTKKAKIAWDTI 284 >gb|AAF98181.1|AC000107_4 F17F8.5 [Arabidopsis thaliana] Length = 872 Score = 139 bits (351), Expect = 1e-30 Identities = 90/294 (30%), Positives = 143/294 (48%), Gaps = 33/294 (11%) Frame = +3 Query: 69 MTENIHLAQEIMRGHARKHTSPKCTPKIDIRKTYDTVSWDFL*ETLYALDFLKRFINWXX 248 + EN+ LA E+++ + + S +C KIDI K +D+V W FL TL A++F FI+W Sbjct: 218 LIENLLLATELVKDYHKDSISARCAIKIDISKAFDSVQWSFLTNTLVAMNFSPTFIHWIN 277 Query: 249 XXXXXXXXXXXXXXXXX----SKGSMV*GKEILFSPFLFVICMEYLSRSLN*ATLHENFN 416 SK + G + SP+LFVICM+ LS+ L+ A F Sbjct: 278 LCITTASFSVQVNGDLVGYFQSKRGLRQGCSL--SPYLFVICMDVLSKMLDKAAGVRKFG 335 Query: 417 FYPKCEKLKISHLACADDLVIFSRGDYLSVQVLCEVLKAFGGVSGLKVNPTKFNVFLASM 596 F+PKC++L ++HL+ ADDL++ S G S++ + EV F SGL+++ K +++A + Sbjct: 336 FHPKCQRLGLTHLSFADDLMVLSDGKTRSIEGILEVFDEFCKRSGLRISLEKSTLYMAGV 395 Query: 597 DEEERNLITNSIGFFVGAMPFRYSGILLAGVYLKLADYAPLIDKVSKTLLAWQ------- 755 + I F VG +P RY G+ L L ADY+PL++++ K + W Sbjct: 396 SPIIKQEIAAKFLFDVGQLPVRYLGLPLVTKRLTSADYSPLLEQIKKRIATWTFRFFSFA 455 Query: 756 ---DLI--------------FHMSAAVLDRFISLCCQFLWGG-----NYARVAW 851 +LI F + + LC FLW G + A+++W Sbjct: 456 GRFNLIKSVLWSICNFWLAAFRLPRQCIREIDKLCSSFLWSGSEMSSHKAKISW 509 >gb|AAD08951.1| putative reverse transcriptase [Arabidopsis thaliana] gi|20197043|gb|AAM14892.1| putative reverse transcriptase [Arabidopsis thaliana] Length = 1412 Score = 139 bits (349), Expect = 2e-30 Identities = 83/290 (28%), Positives = 135/290 (46%), Gaps = 29/290 (10%) Frame = +3 Query: 69 MTENIHLAQEIMRGHARKHTSPKCTPKIDIRKTYDTVSWDFL*ETLYALDFLKRFINWXX 248 + EN+ LA E+++ + + SP+C KID+ K +D+V W FL TL ALD ++FI+W Sbjct: 837 LMENLLLASELVKDYHKDGLSPRCAMKIDLSKAFDSVQWPFLLNTLAALDIPEKFIHWIN 896 Query: 249 XXXXXXXXXXXXXXXXXSKGSMV*GKEILFSPFLFVICMEYLSRSLN*ATLHENFNFYPK 428 SP+LFVICM LS L+ + + F ++P+ Sbjct: 897 LCISTASFSVQVNGLRQGCS---------LSPYLFVICMNVLSAMLDKGAVEKRFGYHPR 947 Query: 429 CEKLKISHLACADDLVIFSRGDYLSVQVLCEVLKAFGGVSGLKVNPTKFNVFLASMDEEE 608 C + ++HL ADD+++FS G S++ + + K F SGL ++ K +F+AS+ E Sbjct: 948 CRNMGLTHLCFADDIMVFSAGSAHSLEGVLAIFKDFAAFSGLNISLEKSTLFMASISSET 1007 Query: 609 RNLITNSIGFFVGAMPFRYSGILLAGVYLKLADYAPLIDKVSKTLLAWQDLI-------- 764 I F G++P RY G+ L + LAD PL++K+ + +W++ Sbjct: 1008 CASILARFPFDSGSLPVRYLGLPLMTKRMTLADCLPLLEKIRSRISSWKNRFLSYAGRLQ 1067 Query: 765 ----------------FHMSAAVLDRFISLCCQFLWGG-----NYARVAW 851 F + A + + FLW G + A+VAW Sbjct: 1068 LLNSVISSLTKFWISAFRLPRACIREIEQISAAFLWSGTDLNPHKAKVAW 1117 >gb|AAG50806.1|AC079281_8 unknown protein [Arabidopsis thaliana] Length = 1213 Score = 137 bits (345), Expect = 5e-30 Identities = 90/295 (30%), Positives = 144/295 (48%), Gaps = 31/295 (10%) Frame = +3 Query: 69 MTENIHLAQEIMRGHARKHTSPKCTPKIDIRKTYDTVSWDFL*ETLYALDFLKRFINWXX 248 + EN+ LA +++ G+ + SP+ K+D++K +D+V W+F+ L AL ++FINW Sbjct: 565 LAENVLLATDLVHGYNWSNISPRGMLKVDLKKAFDSVRWEFVIAALRALAIPEKFINWIS 624 Query: 249 XXXXXXXXXXXXXXXXXS--KGSMV*GKEILFSPFLFVICMEYLSRSLN*ATLHENFNFY 422 K + + SP+LFV+ ME S L+ +++ Sbjct: 625 QCISTPTFTVSINGGNGGFFKSTKGLRQGDPLSPYLFVLAMEAFSNLLHSRYESGLIHYH 684 Query: 423 PKCEKLKISHLACADDLVIFSRGDYLSVQVLCEVLKAFGGVSGLKVNPTKFNVFLASMDE 602 PK L ISHL ADD++IF G S+ +CE L F SGLKVN K +++LA +++ Sbjct: 685 PKASNLSISHLMFADDVMIFFDGGSFSLHGICETLDDFASWSGLKVNKDKSHLYLAGLNQ 744 Query: 603 EERNLITNSIGFFVGAMPFRYSGILLAGVYLKLADYAPLIDKVSKTLLAWQD-------- 758 E N + GF +G +P RY G+ L L++A+Y PL++K++ +W + Sbjct: 745 LESN-ANAAYGFPIGTLPIRYLGLPLMNRKLRIAEYEPLLEKITARFRSWVNKCLSFAGR 803 Query: 759 -----------LIFHMSAAVL-----DRFISLCCQFLWGGNY-----ARVAWKTM 860 + F MS +L R SLC +FLW GN +V+W + Sbjct: 804 IQLISSVIFGSINFWMSTFLLPKGCIKRIESLCSRFLWSGNIEQAKGIKVSWAAL 858 >dbj|BAA97290.1| non-LTR retroelement reverse transcriptase-like [Arabidopsis thaliana] Length = 1072 Score = 136 bits (342), Expect = 1e-29 Identities = 94/292 (32%), Positives = 141/292 (48%), Gaps = 31/292 (10%) Frame = +3 Query: 69 MTENIHLAQEIMRGHARKHTSPKCTPKIDIRKTYDTVSWDFL*ETLYALDFLKRFINWXX 248 + EN+ LA E++ G+ R + SP+ K+D++K +D+V W+F+ L AL +R+INW Sbjct: 425 LAENVLLATEMVHGYNRLNISPRGMLKVDLKKAFDSVKWEFVTAALRALAIPERYINWIH 484 Query: 249 XXXXXXXXXXXXXXXXXSKGSMV*G--KEILFSPFLFVICMEYLSRSLN*ATLHENFNFY 422 G + SP+LFV+ ME S+ L +++ Sbjct: 485 QCITTPSFTISVNGATGGFFRSTKGLRQGDPLSPYLFVLAMEVFSKLLYSRYDSGYIHYH 544 Query: 423 PKCEKLKISHLACADDLVIFSRGDYLSVQVLCEVLKAFGGVSGLKVNPTKFNVFLASMDE 602 PK L ISHL ADD++IF G S+ +CE L F SGLKVN K +F A +D Sbjct: 545 PKAGDLSISHLMFADDVMIFFDGGSSSMHGICETLDDFADWSGLKVNKDKSQLFQAGLDL 604 Query: 603 EERNLITNSIGFFVGAMPFRYSGILLAGVYLKLADYAPLIDKVSKTLLAWQD-------- 758 ER + + + GF G P RY G+ L L++ADY PL++K+S L +W Sbjct: 605 SER-ITSAAYGFPAGTFPIRYLGLPLMCRKLRIADYGPLLEKLSARLRSWVSKALSFAGR 663 Query: 759 -----------LIFHMSAAVL-----DRFISLCCQFLWGGNY-----ARVAW 851 + F MS +L + SLC +FLW G+ ++V+W Sbjct: 664 TQLISSVIFGLINFWMSTFLLPKGCIKKIESLCSKFLWAGSIDGRKSSKVSW 715 >dbj|BAF01687.1| hypothetical protein [Arabidopsis thaliana] Length = 1072 Score = 136 bits (342), Expect = 1e-29 Identities = 94/292 (32%), Positives = 141/292 (48%), Gaps = 31/292 (10%) Frame = +3 Query: 69 MTENIHLAQEIMRGHARKHTSPKCTPKIDIRKTYDTVSWDFL*ETLYALDFLKRFINWXX 248 + EN+ LA E++ G+ R + SP+ K+D++K +D+V W+F+ L AL +R+INW Sbjct: 425 LAENVLLATEMVHGYNRLNISPRGMLKVDLKKAFDSVKWEFVTAALRALAIPERYINWIH 484 Query: 249 XXXXXXXXXXXXXXXXXSKGSMV*G--KEILFSPFLFVICMEYLSRSLN*ATLHENFNFY 422 G + SP+LFV+ ME S+ L +++ Sbjct: 485 QCITTPSFTISVNGATGGFFRSTKGLRQGDPLSPYLFVLAMEVFSKLLYSRYDSGYIHYH 544 Query: 423 PKCEKLKISHLACADDLVIFSRGDYLSVQVLCEVLKAFGGVSGLKVNPTKFNVFLASMDE 602 PK L ISHL ADD++IF G S+ +CE L F SGLKVN K +F A +D Sbjct: 545 PKAGDLSISHLMFADDVMIFFDGGSSSMHGICETLDDFADWSGLKVNKDKSQLFQAGLDL 604 Query: 603 EERNLITNSIGFFVGAMPFRYSGILLAGVYLKLADYAPLIDKVSKTLLAWQD-------- 758 ER + + + GF G P RY G+ L L++ADY PL++K+S L +W Sbjct: 605 SER-ITSAAYGFPAGTFPIRYLGLPLMCRKLRIADYGPLLEKLSARLRSWVSKALSFAGR 663 Query: 759 -----------LIFHMSAAVL-----DRFISLCCQFLWGGNY-----ARVAW 851 + F MS +L + SLC +FLW G+ ++V+W Sbjct: 664 TQLISSVIFGLINFWMSTFLLPKGCIKKIESLCSKFLWAGSIDGRKSSKVSW 715 >emb|CAB72467.1| putative protein [Arabidopsis thaliana] Length = 762 Score = 135 bits (339), Expect = 2e-29 Identities = 84/292 (28%), Positives = 133/292 (45%), Gaps = 31/292 (10%) Frame = +3 Query: 69 MTENIHLAQEIMRGHARKHTSPKCTPKIDIRKTYDTVSWDFL*ETLYALDFLKRFINWXX 248 + EN+ LA ++++ + + S +C KIDI K D+V W FL TL A+ F + FI+W Sbjct: 124 LIENVLLATDLVKDYHKDSISERCAIKIDISKASDSVQWSFLINTLTAMHFPEMFIHWIR 183 Query: 249 XXXXXXXXXXXXXXXXXS--KGSMV*GKEILFSPFLFVICMEYLSRSLN*ATLHENFNFY 422 + S + SP+LFVICM+ LS+ L+ ++ Sbjct: 184 LCITTPSFSVQVNGELAGFFQSSRGLRQGCALSPYLFVICMDVLSKLLDKVVGIGRIGYH 243 Query: 423 PKCEKLKISHLACADDLVIFSRGDYLSVQVLCEVLKAFGGVSGLKVNPTKFNVFLASMDE 602 P C+++ ++HL+ ADDL+I + G S++ + EV F SGLK++ K +F A + Sbjct: 244 PHCKRMGLTHLSFADDLMILTDGQCRSIEGIIEVFDLFSKWSGLKISMEKSTIFSAGLSS 303 Query: 603 EERNLITNSIGFFVGAMPFRYSGILLAGVYLKLADYAPLIDKVSKTLLAWQDLI------ 764 R + F VG +P RY G+ L L DYAPLI+++ K + +W Sbjct: 304 TSRAQLHTHFPFEVGELPIRYLGLPLVTKRLSSVDYAPLIEQIRKRIGSWSSRFLSFAGR 363 Query: 765 ------------------FHMSAAVLDRFISLCCQFLWGG-----NYARVAW 851 F + A + LC FLW G A+++W Sbjct: 364 FNLISSIIWSSCNFWLSAFQLPRACIQEIEKLCSSFLWSGTNLNSKKAKISW 415 >gb|AAC63678.1| putative non-LTR retroelement reverse transcriptase [Arabidopsis thaliana] Length = 1216 Score = 134 bits (338), Expect = 3e-29 Identities = 76/230 (33%), Positives = 120/230 (52%), Gaps = 2/230 (0%) Frame = +3 Query: 69 MTENIHLAQEIMRGHARKHTSPKCTPKIDIRKTYDTVSWDFL*ETLYALDFLKRFINWXX 248 + EN+ LA E+++ + + S +C KIDI K +D++ W FL L A++F FI+W Sbjct: 292 LIENVLLATELVKDYHKDSISTRCAMKIDISKAFDSLQWSFLTHVLAAMNFPGEFIHWIS 351 Query: 249 XXXXXXXXXXXXXXXXXSKGSMV*G--KEILFSPFLFVICMEYLSRSLN*ATLHENFNFY 422 G + SP+LFVI M+ LSR L+ A F ++ Sbjct: 352 LCMSTASFSIQVNGELAGYFRSARGLRQGCSLSPYLFVISMDVLSRMLDKAAGAREFGYH 411 Query: 423 PKCEKLKISHLACADDLVIFSRGDYLSVQVLCEVLKAFGGVSGLKVNPTKFNVFLASMDE 602 P+C+ L ++HL ADDL+I + G SV + +VL F GLK+ K ++LA + + Sbjct: 412 PRCKTLGLTHLCFADDLMILTDGKIRSVDGIVKVLNQFAAKLGLKICMEKTTLYLAGVSD 471 Query: 603 EERNLITNSIGFFVGAMPFRYSGILLAGVYLKLADYAPLIDKVSKTLLAW 752 R L+++ F VG +P RY G+ L L +DY+PLID++ + + W Sbjct: 472 HSRQLMSSRYSFGVGKLPVRYLGLPLVTKRLTTSDYSPLIDQIRRRIGMW 521 >gb|AAC13599.1| similar to reverse transcriptase (Pfam: transcript_fact.hmm, score: 72.31) [Arabidopsis thaliana] Length = 928 Score = 134 bits (338), Expect = 3e-29 Identities = 89/297 (29%), Positives = 135/297 (45%), Gaps = 33/297 (11%) Frame = +3 Query: 69 MTENIHLAQEIMRGHARKHTSPKCTPKIDIRKTYDTVSWDFL*ETLYALDFLKRFINWXX 248 + EN+ LA EI++ + + S +C KIDI K +D+V W FL L A++F F +W Sbjct: 459 LIENVLLATEIVKDYHKDSVSSRCALKIDISKAFDSVQWKFLINVLEAMNFPPEFTHWIT 518 Query: 249 XXXXXXXXXXXXXXXXXSKGSMV*GKEIL----FSPFLFVICMEYLSRSLN*ATLHENFN 416 S +E+ SP+LFVI M+ LS+ L+ A F Sbjct: 519 LCITTASFSVQVNGELAGVFSSA--RELRQGCSLSPYLFVISMDVLSKMLDKAVGARQFG 576 Query: 417 FYPKCEKLKISHLACADDLVIFSRGDYLSVQVLCEVLKAFGGVSGLKVNPTKFNVFLASM 596 ++PKC + ++HL+ ADDL+I S G S+ + +VL F SGLK++ K ++LA + Sbjct: 577 YHPKCRAIGLTHLSFADDLMILSDGKVRSIDGIVKVLYEFAKWSGLKISMEKSTMYLAGV 636 Query: 597 DEEERNLITNSIGFFVGAMPFRYSGILLAGVYLKLADYAPLIDKVSKTLLAWQD------ 758 I F VG +P RY G+ L L +D PLI+++ K + AW Sbjct: 637 QASVYQEIVQKFSFDVGKLPVRYLGLPLVSKRLTASDCLPLIEQLRKKIEAWTSRFLSFA 696 Query: 759 ------------------LIFHMSAAVLDRFISLCCQFLWGG-----NYARVAWKTM 860 F + A + LC FLW G N A+V+W+ + Sbjct: 697 GRLNLISSTLWSICNFWMAAFRLPRACIREIDKLCSAFLWSGTELSSNKAKVSWEAI 753 >gb|AAM82604.1|AF525305_2 putative AP endonuclease/reverse transcriptase [Brassica napus] Length = 1214 Score = 134 bits (337), Expect = 4e-29 Identities = 94/293 (32%), Positives = 150/293 (51%), Gaps = 31/293 (10%) Frame = +3 Query: 69 MTENIHLAQEIMRGHARKHTSPKCTPKIDIRKTYDTVSWDFL*ETLYALDFLKRFINWXX 248 +TEN+ LA E+++G + + S + K+D+RK +D+V W F+ ETL A + RF+NW Sbjct: 564 LTENVLLATELVQGFGQANISSRGVLKVDLRKAFDSVGWGFIIETLKAANAPPRFVNWIK 623 Query: 249 XXXXXXXXXXXXXXXXXS--KGSMV*GKEILFSPFLFVICMEYLSRSLN*ATLHENFNFY 422 KGS + SP LFVI ME LSR L + ++ Sbjct: 624 QCITSTSFSINVSGSLCGYFKGSKGLRQGDPLSPSLFVIAMEILSRLLENKFSDGSIGYH 683 Query: 423 PKCEKLKISHLACADDLVIFSRGDYLSVQVLCEVLKAFGGVSGLKVNPTKFNVFLASMDE 602 PK +++IS LA ADDL+IF G S++ + VL++F +SGL++N K V+ A +++ Sbjct: 684 PKASEVRISSLAFADDLMIFYDGKASSLRGIKSVLESFKNLSGLEMNTEKSAVYTAGLED 743 Query: 603 EERNLITNSIGFFVGAMPFRYSGILLAGVYLKLADYAPLIDKV--------SKTL----- 743 ++ T + GF G PFRY G+ L L+ +DY+ LIDK+ +KTL Sbjct: 744 TDKE-DTLAFGFVNGTFPFRYLGLPLLHRKLRRSDYSQLIDKIAARFNHWATKTLSFAGR 802 Query: 744 ------LAWQDLIFHMSAAVLDR-----FISLCCQFLWGGNYAR-----VAWK 854 + + + F +S+ +L + +C +FLWG + R V+W+ Sbjct: 803 LQLISSVIYSTVNFWLSSFILPKCCLKTIEQMCNRFLWGNDITRRGDIKVSWQ 855 >dbj|BAB09379.1| non-LTR retroelement reverse transcriptase-like protein [Arabidopsis thaliana] Length = 1223 Score = 134 bits (336), Expect = 6e-29 Identities = 82/292 (28%), Positives = 134/292 (45%), Gaps = 31/292 (10%) Frame = +3 Query: 69 MTENIHLAQEIMRGHARKHTSPKCTPKIDIRKTYDTVSWDFL*ETLYALDFLKRFINWXX 248 + EN+ LA E+++ + + S +C KIDI K +D+V W FL L F + FI+W Sbjct: 571 LIENLLLATELVKDYHKDTISTRCAIKIDISKAFDSVQWPFLINVFTILGFPREFIHWIN 630 Query: 249 XXXXXXXXXXXXXXXXXS--KGSMV*GKEILFSPFLFVICMEYLSRSLN*ATLHENFNFY 422 + S + SP+LFVICM+ LS+ L+ A +F ++ Sbjct: 631 ICITTASFSVQVNGELAGYFQSSRGLRQGCALSPYLFVICMDVLSKMLDKAAAARHFGYH 690 Query: 423 PKCEKLKISHLACADDLVIFSRGDYLSVQVLCEVLKAFGGVSGLKVNPTKFNVFLASMDE 602 PKC+ + ++HL+ ADDL++ S G S++ + +V F SGL+++ K V+LA + Sbjct: 691 PKCKTMGLTHLSFADDLMVLSDGKIRSIERIIKVFDEFAKWSGLRISLEKSTVYLAGLSA 750 Query: 603 EERNLITNSIGFFVGAMPFRYSGILLAGVYLKLADYAPLIDKVSKTLLAWQD-------- 758 RN + + F G +P RY G+ L L D PL+++V K + +W Sbjct: 751 TARNEVADRFPFSSGQLPVRYLGLPLITKRLSTTDCLPLLEQVRKRIGSWTSRFLSYAGR 810 Query: 759 ----------------LIFHMSAAVLDRFISLCCQFLWGG-----NYARVAW 851 F + + +C FLW G N A+++W Sbjct: 811 LNLISSVLWSICNFWLAAFRLPRKCIRELEKMCSAFLWSGTEMNSNKAKISW 862