BLASTX nr result
ID: Cocculus22_contig00008921
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Cocculus22_contig00008921 (1712 letters) Database: ./nr 38,876,450 sequences; 13,856,398,315 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value dbj|BAB09379.1| non-LTR retroelement reverse transcriptase-like ... 233 2e-58 gb|AAC63678.1| putative non-LTR retroelement reverse transcripta... 232 3e-58 gb|AAF98181.1|AC000107_4 F17F8.5 [Arabidopsis thaliana] 229 2e-57 gb|AAC95175.1| putative non-LTR retroelement reverse transcripta... 224 1e-55 gb|AAG50886.1|AC025294_24 hypothetical protein [Arabidopsis thal... 219 3e-54 emb|CAB39942.1| putative protein [Arabidopsis thaliana] gi|72678... 219 4e-54 gb|AAG50806.1|AC079281_8 unknown protein [Arabidopsis thaliana] 218 9e-54 gb|AAM82604.1|AF525305_2 putative AP endonuclease/reverse transc... 212 4e-52 dbj|BAA97290.1| non-LTR retroelement reverse transcriptase-like ... 211 6e-52 dbj|BAF01687.1| hypothetical protein [Arabidopsis thaliana] 211 8e-52 gb|ABV21212.1| Ty1_Copia-element protein [Arabidopsis thaliana] 211 1e-51 gb|AAC33226.1| putative non-LTR retroelement reverse transcripta... 208 7e-51 gb|AAG51098.1|AC025295_6 hypothetical protein [Arabidopsis thali... 194 8e-47 emb|CAB72467.1| putative protein [Arabidopsis thaliana] 193 2e-46 gb|AAC19278.1| T14P8.10 [Arabidopsis thaliana] gi|7269009|emb|CA... 192 4e-46 gb|AAC28221.1| similar to reverse transcriptases (PFam: rvt.hmm,... 189 4e-45 emb|CCA65997.1| hypothetical protein [Beta vulgaris subsp. vulga... 189 4e-45 gb|ABW81051.1| tn7 reverse transcriptase [Arabidopsis lyrata sub... 185 5e-44 ref|XP_004293181.1| PREDICTED: uncharacterized protein LOC101298... 184 8e-44 ref|XP_007017131.1| Uncharacterized protein TCM_033752 [Theobrom... 183 2e-43 >dbj|BAB09379.1| non-LTR retroelement reverse transcriptase-like protein [Arabidopsis thaliana] Length = 1223 Score = 233 bits (594), Expect = 2e-58 Identities = 141/456 (30%), Positives = 222/456 (48%), Gaps = 23/456 (5%) Frame = +3 Query: 90 TGQLPVKHLGLPVIPGKLSIADCAPLVGMFARKLEGWQAKILSYAGRLELVKSVLQSSYI 269 +GQLPV++LGLP+I +LS DC PL+ +++ W ++ LSYAGRL L+ SVL S Sbjct: 764 SGQLPVRYLGLPLITKRLSTTDCLPLLEQVRKRIGSWTSRFLSYAGRLNLISSVLWSICN 823 Query: 270 FWTGAFPIPYSVCSKLESLMGSFLHGKSKLR----LISWATICRPLEEGGLGIRRIKDMN 437 FW AF +P +LE + +FL +++ ISW +C+P +EGGLG+R +K+ N Sbjct: 824 FWLAAFRLPRKCIRELEKMCSAFLWSGTEMNSNKAKISWHMVCKPKDEGGLGLRSLKEAN 883 Query: 438 KAGLCKLLWWIYSSKKSLWVQWIHSRFLRNNSIW-TATIPNDVSWVYRRILKIRNQFAHH 614 KL+W I S SLWV+W+ LRN S W + SW+++++LK R Sbjct: 884 DVCCLKLVWKIVSHSNSLWVKWVDQHLLRNASFWEVKQTVSQGSWIWKKLLKYREVAKTL 943 Query: 615 CFNLVGNGDATKFWLHRWHPQGMLLDTFGENCRLFTRIPRRATIKEV----REAGYWNSP 782 VGNG T FW W G LL+ G+ + I RR T++E R+ + N Sbjct: 944 SKVEVGNGKQTSFWYDNWSDLGQLLERTGDRGLIDLGISRRMTVEEAWTNRRQRRHRNDV 1003 Query: 783 PS-SSPMVRTAWRQFQQIPKLGCDEEDQFVW---SPCPSGLFSVASAWEQIRHHYDVWEW 950 + ++ +W + ED+ +W S FS W R W Sbjct: 1004 YNVIEDALKKSW-------DTRTETEDKVLWRGKSDVFRTTFSTRDTWHHTRSTSARVPW 1056 Query: 951 TELVWFYDKIPKCSFTCWRMLLSKLPTKDKLTRF--GAQSRCELCWAGVESEDHLFFECP 1124 +++WF PK SF W +LPT D++ + G + C C +E+ DHLFF C Sbjct: 1057 HKVIWFSHATPKYSFCSWLAAHGRLPTGDRMINWANGIATDCIFCQGTLETRDHLFFTCS 1116 Query: 1125 FSSEVWMRIKVKCWRNVQVVRGRFQESQT--------ILRSFGLNRADGVIRKLCYTVTV 1280 F+S +W V + RG F+ T + + +R + +R+ + T+ Sbjct: 1117 FTSVIW----------VDLARGIFKTQYTSHWQSIIEAITNSQHHRVEWFLRRYVFQATI 1166 Query: 1281 HFIWWERNMRLFNKGWRSATRLAEEIIQLVHQKVST 1388 + +W ERN R + +A++L I + + ++S+ Sbjct: 1167 YIVWRERNGRRHGEPPNTASQLVGWIDKQIRNQLSS 1202 >gb|AAC63678.1| putative non-LTR retroelement reverse transcriptase [Arabidopsis thaliana] Length = 1216 Score = 232 bits (592), Expect = 3e-58 Identities = 139/452 (30%), Positives = 224/452 (49%), Gaps = 17/452 (3%) Frame = +3 Query: 93 GQLPVKHLGLPVIPGKLSIADCAPLVGMFARKLEGWQAKILSYAGRLELVKSVLQSSYIF 272 G+LPV++LGLP++ +L+ +D +PL+ R++ W ++ LS+AGRL L+ SVL S F Sbjct: 486 GKLPVRYLGLPLVTKRLTTSDYSPLIDQIRRRIGMWTSRYLSFAGRLSLINSVLWSITNF 545 Query: 273 WTGAFPIPYSVCSKLESLMGSFLHGKSKLR----LISWATICRPLEEGGLGIRRIKDMNK 440 W AF +P +++ + + L +L +SW IC+P +EGGLG++ +++ NK Sbjct: 546 WMNAFRLPRECINEINRISSALLWSGPELNPKKAKVSWDEICKPKKEGGLGLQSLREANK 605 Query: 441 AGLCKLLWWIYSSKKSLWVQWIHSRFLRNNSIWTATIPNDV-SWVYRRILKIRNQFAHHC 617 KL+W + S + SLWV+W L+ S W+ + + SW++RR+LK R C Sbjct: 606 VSSLKLIWRLLSCQDSLWVKWTRMNLLKKESFWSIGTHSTLGSWIWRRLLKHREVAKSFC 665 Query: 618 FNLVGNGDATKFWLHRWHPQGMLLDTFGENCRLFTRIPRRATIKEVREAGYWNSPPSSSP 797 V NG T FW W +G L++ G + I R T+ E W+ Sbjct: 666 KIEVNNGVNTSFWFDNWSEKGPLINLTGARGAIDMGISRHMTLAEA-----WSRRRRKRH 720 Query: 798 MVRTAWRQFQQI-----PKLGCDEEDQFVW---SPCPSGLFSVASAWEQIRHHYDVWEWT 953 V +F++I + ED +W FS W IR + W Sbjct: 721 RVEIL-NEFEEILLQKYQHRNIELEDAILWRGKEDVFKARFSTKDTWNHIRTSSNQRAWH 779 Query: 954 ELVWFYDKIPKCSFTCWRMLLSKLPTKDKLTRF--GAQSRCELCWAGVESEDHLFFECPF 1127 + VWF PK SF W + ++L T D++ + G + C C + +E+ DHLFF+C + Sbjct: 780 KGVWFAHATPKFSFCAWLAIRNRLSTGDRMMTWNNGTPTTCVFCSSPMETRDHLFFQCCY 839 Query: 1128 SSEVWMRIKVKCWRNVQVVRGRFQESQTI--LRSFGLNRADGVIRKLCYTVTVHFIWWER 1301 SSE+W I +NV R + S + + +R + + + V++H IW ER Sbjct: 840 SSEIWTSIA----KNVYKDRFSTKWSAVVNYISDSQPDRIQSFLSRYTFQVSIHSIWRER 895 Query: 1302 NMRLFNKGWRSATRLAEEIIQLVHQKVSTSKK 1397 N R + RSA+ L +I + + ++ST KK Sbjct: 896 NSRRHGEKSRSASNLIRQIDKTIRNQLSTIKK 927 >gb|AAF98181.1|AC000107_4 F17F8.5 [Arabidopsis thaliana] Length = 872 Score = 229 bits (585), Expect = 2e-57 Identities = 129/418 (30%), Positives = 207/418 (49%), Gaps = 12/418 (2%) Frame = +3 Query: 93 GQLPVKHLGLPVIPGKLSIADCAPLVGMFARKLEGWQAKILSYAGRLELVKSVLQSSYIF 272 GQLPV++LGLP++ +L+ AD +PL+ +++ W + S+AGR L+KSVL S F Sbjct: 412 GQLPVRYLGLPLVTKRLTSADYSPLLEQIKKRIATWTFRFFSFAGRFNLIKSVLWSICNF 471 Query: 273 WTGAFPIPYSVCSKLESLMGSFLHGKSKL----RLISWATICRPLEEGGLGIRRIKDMNK 440 W AF +P +++ L SFL S++ ISW +C+P EGGLG+R +K+ N Sbjct: 472 WLAAFRLPRQCIREIDKLCSSFLWSGSEMSSHKAKISWDIVCKPKAEGGLGLRNLKEAND 531 Query: 441 AGLCKLLWWIYSSKKSLWVQWIHSRFLRNNSIWTATIPNDV-SWVYRRILKIRNQFAHHC 617 KL+W I S+ SLW +W+ +R SIW+ + SW++R+ILKIR+ Sbjct: 532 VSCLKLVWRIISNSNSLWTKWVAEYLIRKKSIWSLKQSTSMGSWIWRKILKIRDVAKSFS 591 Query: 618 FNLVGNGDATKFWLHRWHPQGMLLDTFGENCRLFTRIPRRATIKEVREAGYWNSPPSSSP 797 VGNG++ FW W G L+DT G+ + IPR A++ + +S Sbjct: 592 RVEVGNGESASFWYDHWSAHGRLIDTVGDKGTIDLGIPREASVADAWTRRSRRRHRTSLL 651 Query: 798 MVRTAWRQFQQIPKLGCDEEDQFVW---SPCPSGLFSVASAWEQIRHHYDVWEWTELVWF 968 +Q+I D ED +W + FS W I+ W + VWF Sbjct: 652 NEIEEMMAYQRIHH--SDAEDTVLWRGKNDVFKPHFSTRDTWHLIKATSSTVSWHKGVWF 709 Query: 969 YDKIPKCSFTCWRMLLSKLPTKDKLTRFGA----QSRCELCWAGVESEDHLFFECPFSSE 1136 PK + W + ++LPT D++ ++ + C LC ++ +HLFF C ++S Sbjct: 710 RHATPKYALCTWLAIHNRLPTGDRMLKWNSSGSVSGNCVLCTNNSKTLEHLFFSCSYAST 769 Query: 1137 VWMRIKVKCWRNVQVVRGRFQESQTILRSFGLNRADGVIRKLCYTVTVHFIWWERNMR 1310 VW + W+ R+ T + + +R +G + + + T++ +W ERN R Sbjct: 770 VWAALAKGIWKTRYST--RWSHLLTHISTHFQDRVEGFLTRYIFQATIYHVWRERNGR 825 >gb|AAC95175.1| putative non-LTR retroelement reverse transcriptase [Arabidopsis thaliana] Length = 1352 Score = 224 bits (570), Expect = 1e-55 Identities = 142/441 (32%), Positives = 216/441 (48%), Gaps = 21/441 (4%) Frame = +3 Query: 93 GQLPVKHLGLPVIPGKLSIADCAPLVGMFARKLEGWQAKILSYAGRLELVKSVLQSSYIF 272 G LPVK+LGLP++ +++ +D PLV ++ W + LS+AGRL+L+KSVL S F Sbjct: 912 GTLPVKYLGLPLLTKRMTQSDYLPLVEKIRARITSWTNRFLSFAGRLQLIKSVLSSITNF 971 Query: 273 WTGAFPIPYSVCSKLESLMGSFLHG----KSKLRLISWATICRPLEEGGLGIRRIKDMNK 440 W F +P + ++E + +FL +K I+W+ +C+ EEGGLG++ +K+ N+ Sbjct: 972 WLSVFRLPKACLQEIEKMFSAFLWSGPDLNTKKAKIAWSEVCKLKEEGGLGLKPLKEANE 1031 Query: 441 AGLCKLLWWIYSSKKSLWVQWIHSRFLRNNSIWTATIPNDV-SWVYRRILKIRNQFAHHC 617 L KL+W I S++ SLWV+W++ +R + W+ + SW++R+ILK R++ Sbjct: 1032 VSLLKLIWRILSARDSLWVKWVNKHLIRKETFWSVKENTGLGSWLWRKILKQRDKARLFH 1091 Query: 618 FNLVGNGDATKFWLHRWHPQGMLLDTFGENCRLFTRIPRRATIKEV--------REAGYW 773 V +G T FW W P G L G + IP AT+ EV A + Sbjct: 1092 RMEVRSGTFTSFWHDHWCPLGRLHQHMGSRGTIDLGIPNNATVAEVMNTHRRKRHRADFL 1151 Query: 774 NSPPSSSPMVRTAWRQFQQIPKLGCDEEDQFVWSPCPSGLFSVASAWEQIRHHYDVWEWT 953 N S + R R L +ED F S FS + W+QIR +W Sbjct: 1152 NQIKSQIELARQD-RSTDGDRSLWKQKEDTFKSS------FSSSKTWQQIRSISLRCDWY 1204 Query: 954 ELVWFYDKIPKCSFTCWRMLLSKLPTKDKLTRF--GAQSRCELCWAGVESEDHLFFECPF 1127 VWF PK SF W ++L T DK+ ++ GA+ C C +E+ DHLFF CP+ Sbjct: 1205 RGVWFSASTPKYSFVTWLAFHNRLTTSDKICKWNSGARYDCVFCGEELETRDHLFFSCPY 1264 Query: 1128 SSEVWMRIK--VKCWRNV----QVVRGRFQESQTILRSFGLNRADGVIRKLCYTVTVHFI 1289 SS VW + + RN+ + S+ L F L A + ++H + Sbjct: 1265 SSHVWFSLTKGLLNGRNILNWNLITPHLLDSSRPYLHVFTLRYA--------FQASIHSL 1316 Query: 1290 WWERNMRLFNKGWRSATRLAE 1352 W ERN R + A +LA+ Sbjct: 1317 WRERNCRRHGETAIPAAKLAK 1337 >gb|AAG50886.1|AC025294_24 hypothetical protein [Arabidopsis thaliana] Length = 629 Score = 219 bits (558), Expect = 3e-54 Identities = 137/452 (30%), Positives = 218/452 (48%), Gaps = 21/452 (4%) Frame = +3 Query: 93 GQLPVKHLGLPVIPGKLSIADCAPLVGMFARKLEGWQAKILSYAGRLELVKSVLQSSYIF 272 GQLPV++LGLP++ +L+ D +PL ++ W ++ LS+AGRL L+ SVL S+ F Sbjct: 170 GQLPVRYLGLPLVTKRLTKEDLSPLFEQIRNRIGTWTSRYLSFAGRLNLISSVLWSTMNF 229 Query: 273 WTGAFPIPYSVCSKLESLMGSFLHGKSKLR----LISWATICRPLEEGGLGIRRIKDMNK 440 W AF +P + ++ S+ +FL +L +SW IC+P +EGGLG+R + + N Sbjct: 230 WMSAFRLPSACLKEINSICSAFLWSGPELHRRKAKVSWDDICKPKQEGGLGLRSLTEANV 289 Query: 441 AGLCKLLWWIYSSKKSLWVQWIHSRFLRNNSIWTATIPNDV--SWVYRRILKIRNQFAHH 614 + KL+W + S+ SLWV+W L+ S W+ T PN SW+++++LK R Sbjct: 290 VSVLKLIWRVTSNDDSLWVKWSKMNLLKQESFWSLT-PNSSLGSWMWKKMLKYRETAKPF 348 Query: 615 CFNLVGNGDATKFWLHRWHPQGMLLDTFGENCRLFTRIPRRATIKEVREAGYWNSPPSSS 794 V NG T FW W G L+D G+ ++ I R T+ E W++ Sbjct: 349 SRVEVNNGARTSFWFDNWSGMGHLMDVTGQRGQIDLGISRNKTVAEA-----WSNRRRRK 403 Query: 795 PM------VRTAWRQFQQIPKLGCDEEDQFVW---SPCPSGLFSVASAWEQIRHHYDVWE 947 + A Q Q L ED +W FS W Q+R + Sbjct: 404 HRTEQLNDIEAALNQKYQTRNL--LREDATLWRGKGDVFKTSFSTKDTWNQVRKKSNEVA 461 Query: 948 WTELVWFYDKIPKCSFTCWRMLLSKLPT--KDKLTRFGAQSRCELCWAGVESEDHLFFEC 1121 W + VWF PK F W L ++L T + +L G+ +C C +E+ DHLFF C Sbjct: 462 WYKGVWFSHSTPKYQFCTWLALRNRLSTGYRMQLWNNGSDVKCTFCSTSIETRDHLFFSC 521 Query: 1122 PFSSEVWMRIKVKCWRNVQVVRGRFQ-ESQTILRSFGLNRADGV---IRKLCYTVTVHFI 1289 ++S +W I V++ RF + QTI+ + D + + + + +TVH + Sbjct: 522 SYASAIWTAIA------KNVLQHRFSTDWQTIVNYISETQTDRIRSFLSRYIFQLTVHTV 575 Query: 1290 WWERNMRLFNKGWRSATRLAEEIIQLVHQKVS 1385 W ERN R + R++ L + + + ++S Sbjct: 576 WKERNDRRHGEEPRTSANLISWMDKQIRNQLS 607 >emb|CAB39942.1| putative protein [Arabidopsis thaliana] gi|7267871|emb|CAB78214.1| putative protein [Arabidopsis thaliana] Length = 473 Score = 219 bits (557), Expect = 4e-54 Identities = 143/453 (31%), Positives = 214/453 (47%), Gaps = 19/453 (4%) Frame = +3 Query: 141 LSIADCAPLVGMFAR-KLEGWQAKILSYAGRLELVKSVLQSSYIFWTGAFPIPYSVCSKL 317 L + C +V MF+R K+ W A+ LSYAGRL L+ SVL S FW GAF +P ++ Sbjct: 30 LDVTHCN-IVTMFSRQKICSWSARFLSYAGRLNLISSVLWSICNFWMGAFRLPRDCIREI 88 Query: 318 ESLMGSFLHGKSKLRL----ISWATICRPLEEGGLGIRRIKDMNKAGLCKLLWWIYSSKK 485 + + ++L +L I+WA +C+P EEGGLG+R +K+ N KL+W I S Sbjct: 89 DKMCSAYLWSGGELNTSKAKITWAFVCKPKEEGGLGLRSLKEANDVCCLKLIWRIISHAD 148 Query: 486 SLWVQWIHSRFLRNNSIWTATIPNDV-SWVYRRILKIRNQFAHHCFNLVGNGDATKFWLH 662 SLWV+WI S L+ S W + SW++R+ILK R+ C + NG T FW Sbjct: 149 SLWVKWIQSSLLKKVSFWAVRENTSLGSWMWRKILKFRDIARTLCKVEINNGARTSFWYD 208 Query: 663 RWHPQGMLLDTFGENCRLFTRIPRRATIKEV-----REAGYWNSPPSSSPMVRTAWRQFQ 827 W G L+D+ G+ + I + AT+ E R N + +W Sbjct: 209 DWSDLGRLIDSAGDRGAIDLGINKHATVVEAWGNRRRRRHRTNFLNRVEERLILSWNSRN 268 Query: 828 QIPKLGCDEEDQFVWSPCPS---GLFSVASAWEQIRHHYDVWEWTELVWFYDKIPKCSFT 998 Q ED+ +W + +FS W IR + W + VWF IPK +F Sbjct: 269 Q-------AEDRALWKGKENRFRSIFSTKDTWNHIRTVSNKVAWYKGVWFAQAIPKHAFC 321 Query: 999 CWRMLLSKLPTKDKLT--RFGAQSRCELCWAGVESEDHLFFECPFSSEVWMRIKVKCWRN 1172 W + ++L T D++T G + C LC +ES DHLFF CPF++E+W + + Sbjct: 322 MWLAVHNRLSTGDRMTLWNMGVDATCILCNKALESRDHLFFSCPFATEIWEPLAKTIYNT 381 Query: 1173 VQVVRGRFQESQTILRSFGLN---RADGVIRKLCYTVTVHFIWWERNMRLFNKGWRSATR 1343 + + QTI+ + N R G + + VT++ +W ERN R S++R Sbjct: 382 C-----FYTDWQTIINNVSRNWPDRIAGFLARCILQVTIYTLWRERNERKHGASPNSSSR 436 Query: 1344 LAEEIIQLVHQKVSTSKKLLSNSLDGEFLAQFW 1442 L I + + + K+ D F Q W Sbjct: 437 LISWIDKHIRNHLMAIKQSGDRRFDRGF--QVW 467 >gb|AAG50806.1|AC079281_8 unknown protein [Arabidopsis thaliana] Length = 1213 Score = 218 bits (554), Expect = 9e-54 Identities = 140/446 (31%), Positives = 209/446 (46%), Gaps = 16/446 (3%) Frame = +3 Query: 93 GQLPVKHLGLPVIPGKLSIADCAPLVGMFARKLEGWQAKILSYAGRLELVKSVLQSSYIF 272 G LP+++LGLP++ KL IA+ PL+ + W K LS+AGR++L+ SV+ S F Sbjct: 758 GTLPIRYLGLPLMNRKLRIAEYEPLLEKITARFRSWVNKCLSFAGRIQLISSVIFGSINF 817 Query: 273 WTGAFPIPYSVCSKLESLMGSFLHG----KSKLRLISWATICRPLEEGGLGIRRIKDMNK 440 W F +P ++ESL FL ++K +SWA +C P EGGLG+RR+ + NK Sbjct: 818 WMSTFLLPKGCIKRIESLCSRFLWSGNIEQAKGIKVSWAALCLPKSEGGLGLRRLLEWNK 877 Query: 441 AGLCKLLWWIYSSKKSLWVQWIHSRFLRNNSIWTATIPNDVSWVYRRILKIRNQFAHHCF 620 +L+W ++ +K SLW W H L S W SW ++R+L +R Sbjct: 878 TLSMRLIWRLFVAKDSLWADWQHLHHLSRGSFWAVEGGQSDSWTWKRLLSLRPLAHQFLV 937 Query: 621 NLVGNGDATKFWLHRWHPQGMLLDTFGENCRLFTRIPRRATIKEVREAGYWNSPPSSSPM 800 VGNG +W W G L G+ R+P A + W P S S Sbjct: 938 CKVGNGLKADYWYDNWTSLGPLFRIIGDIGPSSLRVPLLAKVASAFSEDGWRLPVSRSAP 997 Query: 801 VRTAWRQF--QQIPKLGCDEEDQFVWSP----CPSGLFSVASAWEQIRHHYDVWEWTELV 962 + +P ++ D++ WS C FS A WE IR V W + Sbjct: 998 AKGIHDHLCTVPVPSTAQEDVDRYEWSVNGFLCQG--FSAAKTWEAIRPKATVKSWASSI 1055 Query: 963 WFYDKIPKCSFTCWRMLLSKLPTKDKLTRFG--AQSRCELCWAGVESEDHLFFECPFSSE 1136 WF +PK +F W L++L T+ +L +G C LC ES DHL C FS++ Sbjct: 1056 WFKGAVPKYAFNMWVSHLNRLLTRQRLASWGHIQSDACVLCSFASESRDHLLLICEFSAQ 1115 Query: 1137 VWMRIKVKCWRNVQVVRGRFQESQTILRSF---GLNRADGVIRKLCYTVTVHFIWWERNM 1307 VW + +R + R R S + L S+ A ++RK+ V V+ +W +RN Sbjct: 1116 VWRLV----FRRI-CPRQRLFSSWSELLSWVRQSSPEAPPLLRKIVSQVVVYNLWRQRNN 1170 Query: 1308 RLFNKGWRSATRLAEEII-QLVHQKV 1382 L N + RLA +I +LV +++ Sbjct: 1171 LLHN-----SLRLAPAVIFKLVDREI 1191 >gb|AAM82604.1|AF525305_2 putative AP endonuclease/reverse transcriptase [Brassica napus] Length = 1214 Score = 212 bits (540), Expect = 4e-52 Identities = 127/445 (28%), Positives = 201/445 (45%), Gaps = 13/445 (2%) Frame = +3 Query: 87 ITGQLPVKHLGLPVIPGKLSIADCAPLVGMFARKLEGWQAKILSYAGRLELVKSVLQSSY 266 + G P ++LGLP++ KL +D + L+ A + W K LS+AGRL+L+ SV+ S+ Sbjct: 755 VNGTFPFRYLGLPLLHRKLRRSDYSQLIDKIAARFNHWATKTLSFAGRLQLISSVIYSTV 814 Query: 267 IFWTGAFPIPYSVCSKLESLMGSFLHGKSKLRL----ISWATICRPLEEGGLGIRRIKDM 434 FW +F +P +E + FL G R +SW C P EGGLG+R Sbjct: 815 NFWLSSFILPKCCLKTIEQMCNRFLWGNDITRRGDIKVSWQNSCLPKAEGGLGLRNFWTW 874 Query: 435 NKAGLCKLLWWIYSSKKSLWVQWIHSRFLRNNSIWTATIPNDVSWVYRRILKIRNQFAHH 614 NK +L+W +++ + SLWV W H+ LR+ + W A + SW+++ IL +R Sbjct: 875 NKTLNLRLIWMLFARRDSLWVAWNHANRLRHVNFWNAEAASHHSWIWKAILGLRPLAKRF 934 Query: 615 CFNLVGNGDATKFWLHRWHPQGMLLDTFGENCRLFTRIPRRATIKEVREAGYWNSPPS-- 788 VGNG +W W G L++ G + T I A + E + W P + Sbjct: 935 LRGAVGNGQLLSYWYDHWSNLGPLIEAIGASGPQLTGIHESAVVTEASSSTGWILPSART 994 Query: 789 -SSPMVRTAWRQFQQIPKLGCDEEDQFVW--SPCPSGLFSVASAWEQIRHHYDVWEWTEL 959 ++ + G ED + W S FS WE +R W Sbjct: 995 RNASLANLRSTLLNSPAPSGDRGEDTYTWYIEGSSSTSFSSKLTWECLRQRDTTKLWAAA 1054 Query: 960 VWFYDKIPKCSFTCWRMLLSKLPTKDKLTRFGAQ--SRCELCWAGVESEDHLFFECPFSS 1133 VW+ IPK +F W L++LP + + T + S C +C E+ DHLF C S Sbjct: 1055 VWYKGCIPKYAFNFWVAHLNRLPVRARTTHWSTNRPSLCCVCQRETETRDHLFIHCTLGS 1114 Query: 1134 EVWMRIKVKCWRNVQVVRGRFQESQTILRSFGLNRA--DGVIRKLCYTVTVHFIWWERNM 1307 +W ++ + R+ F+E + I+ N+ G ++KL + IW ERN Sbjct: 1115 LIWQQVLARFGRSQM-----FREWKDIIEWMLSNQGSFSGTLKKLAVQTAIFHIWKERNS 1169 Query: 1308 RLFNKGWRSATRLAEEIIQLVHQKV 1382 RL + S T + ++I + + + Sbjct: 1170 RLHSAMSASHTAIFKQIDRSIRDSI 1194 >dbj|BAA97290.1| non-LTR retroelement reverse transcriptase-like [Arabidopsis thaliana] Length = 1072 Score = 211 bits (538), Expect = 6e-52 Identities = 136/442 (30%), Positives = 203/442 (45%), Gaps = 12/442 (2%) Frame = +3 Query: 24 MTLIYKITNKSWGF*CKNMKNITGQLPVKHLGLPVIPGKLSIADCAPLVGMFARKLEGWQ 203 + L +IT+ ++GF G P+++LGLP++ KL IAD PL+ + +L W Sbjct: 602 LDLSERITSAAYGF-------PAGTFPIRYLGLPLMCRKLRIADYGPLLEKLSARLRSWV 654 Query: 204 AKILSYAGRLELVKSVLQSSYIFWTGAFPIPYSVCSKLESLMGSFLHGKS----KLRLIS 371 +K LS+AGR +L+ SV+ FW F +P K+ESL FL S K +S Sbjct: 655 SKALSFAGRTQLISSVIFGLINFWMSTFLLPKGCIKKIESLCSKFLWAGSIDGRKSSKVS 714 Query: 372 WATICRPLEEGGLGIRRIKDMNKAGLCKLLWWIYSSKKSLWVQWIHSRFLRNNSIWTATI 551 W C P EGGLG R + NK L +L+W ++ SLW QW L + S W Sbjct: 715 WVDCCLPKSEGGLGFRSFGEWNKTLLLRLIWVLFDRDTSLWAQWQRHHRLGHASFWQVNA 774 Query: 552 PNDVSWVYRRILKIRNQFAHHCFNLVGNGDATKFWLHRWHPQGMLLDTFGENCRLFTRIP 731 W ++ +L +R VGNG FW W G L+ G+ RIP Sbjct: 775 LQTDPWTWKMLLNLRPLAEKFIKAKVGNGGTVSFWFDCWTSLGPLIKYLGDVGSRPLRIP 834 Query: 732 RRATIKEVREAGYWNSPPSSSPMVRTAWRQFQQIPKLG-CDEEDQFVWSPCPSGL----F 896 A + + + W P S S + +P D + W C + F Sbjct: 835 FSAKVADAIDGSGWRLPLSRSLTADSILSHLASLPPPSPLMVSDSYSW--CVDDVDCQGF 892 Query: 897 SVASAWEQIRHHYDVWEWTELVWFYDKIPKCSFTCWRMLLSKLPTKDKLTRFG--AQSRC 1070 S A WE +R V W + VWF +PK +F W L++LPT+ +L +G + + C Sbjct: 893 SAAKTWEVLRPRRPVKRWAKSVWFKGAVPKHAFNFWTAQLNRLPTRQRLVSWGLVSSAEC 952 Query: 1071 ELCWAGVESEDHLFFECPFSSEVWMRIKVK-CWRNVQVVRGRFQESQTILRSFGLNRADG 1247 LC E+ DHL C FSS+VW + ++ C R Q + + E + R A Sbjct: 953 CLCSFDTETRDHLLLLCDFSSQVWRMVFLRLCPR--QRLLCTWAELLSWTRQ-STAAAPS 1009 Query: 1248 VIRKLCYTVTVHFIWWERNMRL 1313 ++RK+ + V+ +W +RN+ L Sbjct: 1010 LLRKVVAQLVVYNLWRQRNLVL 1031 >dbj|BAF01687.1| hypothetical protein [Arabidopsis thaliana] Length = 1072 Score = 211 bits (537), Expect = 8e-52 Identities = 136/442 (30%), Positives = 202/442 (45%), Gaps = 12/442 (2%) Frame = +3 Query: 24 MTLIYKITNKSWGF*CKNMKNITGQLPVKHLGLPVIPGKLSIADCAPLVGMFARKLEGWQ 203 + L +IT+ ++GF G P+++LGLP++ KL IAD PL+ + +L W Sbjct: 602 LDLSERITSAAYGF-------PAGTFPIRYLGLPLMCRKLRIADYGPLLEKLSARLRSWV 654 Query: 204 AKILSYAGRLELVKSVLQSSYIFWTGAFPIPYSVCSKLESLMGSFLHGKS----KLRLIS 371 +K LS+AGR +L+ SV+ FW F +P K+ESL FL S K +S Sbjct: 655 SKALSFAGRTQLISSVIFGLINFWMSTFLLPKGCIKKIESLCSKFLWAGSIDGRKSSKVS 714 Query: 372 WATICRPLEEGGLGIRRIKDMNKAGLCKLLWWIYSSKKSLWVQWIHSRFLRNNSIWTATI 551 W C P EGGLG R + NK L +L+W ++ SLW QW L + S W Sbjct: 715 WVDCCLPKSEGGLGFRSFGEWNKTLLLRLIWVLFDRDTSLWAQWQRHHRLGHASFWQVNA 774 Query: 552 PNDVSWVYRRILKIRNQFAHHCFNLVGNGDATKFWLHRWHPQGMLLDTFGENCRLFTRIP 731 W ++ +L +R VGNG FW W G L+ G+ RIP Sbjct: 775 LQTDPWTWKMLLNLRPLAEKFIKAKVGNGGTVSFWFDCWTSLGPLIKYLGDVGSRPLRIP 834 Query: 732 RRATIKEVREAGYWNSPPSSSPMVRTAWRQFQQIPKLG-CDEEDQFVWSPCPSGL----F 896 A + + + W P S S + +P D + W C + F Sbjct: 835 FSAKVADAIDGSGWRLPLSRSLTADSILSHLASLPPPSPLMVSDSYSW--CVDDVDCQGF 892 Query: 897 SVASAWEQIRHHYDVWEWTELVWFYDKIPKCSFTCWRMLLSKLPTKDKLTRFG--AQSRC 1070 S A WE +R V W VWF +PK +F W L++LPT+ +L +G + + C Sbjct: 893 SAAKTWEVLRPRRPVKRWARSVWFKGAVPKHAFNFWTAQLNRLPTRQRLVSWGLVSSAEC 952 Query: 1071 ELCWAGVESEDHLFFECPFSSEVWMRIKVK-CWRNVQVVRGRFQESQTILRSFGLNRADG 1247 LC E+ DHL C FSS+VW + ++ C R Q + + E + R A Sbjct: 953 CLCSFDTETRDHLLLLCDFSSQVWRMVFLRLCPR--QRLLCTWAELLSWTRQ-STAAAPS 1009 Query: 1248 VIRKLCYTVTVHFIWWERNMRL 1313 ++RK+ + V+ +W +RN+ L Sbjct: 1010 LLRKVVAQLVVYNLWRQRNLVL 1031 >gb|ABV21212.1| Ty1_Copia-element protein [Arabidopsis thaliana] Length = 438 Score = 211 bits (536), Expect = 1e-51 Identities = 136/415 (32%), Positives = 200/415 (48%), Gaps = 17/415 (4%) Frame = +3 Query: 99 LPVKHLGLPVIPGKLSIADCAPLVGMFARKLEGWQAKILSYAGRLELVKSVLQSSYIFWT 278 LP+++LGLP++ KL I++ PLV KL W K LS+AGRL+L+ SV+ +FW Sbjct: 31 LPIRYLGLPLMSRKLKISEFEPLVVKIKAKLNFWAVKSLSFAGRLQLLSSVISGIVVFWM 90 Query: 279 GAFPIPYSVCSKLESLMGSFL-------HGKSKLRLISWATICRPLEEGGLGIRRIKDMN 437 F +P ++ES+ FL H K+K +SW+T+C P EGGLG+R+ + N Sbjct: 91 STFRLPKGCIREIESMCARFLWSGGTDEHHKAK---VSWSTVCLPKAEGGLGVRKFTEWN 147 Query: 438 KAGLCKLLWWIYSSKKSLWVQW--IHSRFLRNNSIWTATIPNDVSWVYRRILKIRNQFAH 611 A KL+W ++S+ SLWV W H+ ++ W SW +R +L++R + Sbjct: 148 TALNLKLIWLLFSNSGSLWVAWHLFHNLSTSVSNFWLIKEGTTDSWNWRCLLRLRPLASK 207 Query: 612 HCFNLVGNGDATKFWLHRWHPQGMLLDTFGENCRLFTRIPRRATIKEVREAGYW--NSPP 785 F +GNG FW W P G LL G + RIP + + +V W SP Sbjct: 208 FLFCSIGNGLTASFWADSWTPFGPLLTFIGSDGPRNQRIPLCSKVADVVNGNRWLLPSPR 267 Query: 786 SSSPMVRTAWRQFQQIPKLGCDEEDQFVW--SPCPSGLFSVASAWEQIRHHYDVWEWTEL 959 SS+ + A+ IP L ED ++W C FS A W +RH W Sbjct: 268 SSNALNLHAFLTTLSIP-LQPLVEDSYLWKVENCSDIGFSSAHTWNALRHKEVEKPWVSS 326 Query: 960 VWFYDKIPKCSFTCWRMLLSKLPTKDKLTRFG--AQSRCELCWAGVESEDHLFFECPFSS 1133 VWF PK +F W +L TK ++ +G C LC G E+ DHL C FS Sbjct: 327 VWFKGVTPKNAFNMWITHQDRLRTKLRMIAWGFLVSPVCALCQVGFETRDHLMLSCDFSV 386 Query: 1134 EVWMRIKVKCWRNVQVVRGRFQE-SQTILRSFGLNR-ADGVIRKLCYTVTVHFIW 1292 VW ++ + + + FQ S+ IL + ++ A +RKL V+ +W Sbjct: 387 SVWALVRQRIGTPLTI----FQNWSELILWTQNRSKAAPSTLRKLVAQAVVYALW 437 >gb|AAC33226.1| putative non-LTR retroelement reverse transcriptase [Arabidopsis thaliana] Length = 1529 Score = 208 bits (529), Expect = 7e-51 Identities = 133/443 (30%), Positives = 215/443 (48%), Gaps = 11/443 (2%) Frame = +3 Query: 93 GQLPVKHLGLPVIPGKLSIADCAPLVGMFARKLEGWQAKILSYAGRLELVKSVLQSSYIF 272 GQLPV++LGLP++ +++ AD +PL+ K+ W A+ LSYAGRL L+ SV+ S F Sbjct: 1062 GQLPVRYLGLPLLTKQMTTADYSPLIEAVKTKISSWTARSLSYAGRLALLNSVIVSIANF 1121 Query: 273 WTGAFPIPYSVCSKLESLMGSFLHG----KSKLRLISWATICRPLEEGGLGIRRIKDMNK 440 W A+ +P ++E L +FL K I+W++IC+P +EGGLGI+ + + NK Sbjct: 1122 WMSAYRLPAGCIREIEKLCSAFLWSGPVLNPKKAKIAWSSICQPKKEGGLGIKSLAEANK 1181 Query: 441 AGLCKLLWWIYSSKKSLWVQWIHSRFLRNNSIWTATIPNDV-SWVYRRILKIRNQFAHHC 617 KL+W + S++ SLWV WI + +R + W+A + + SW+++++LK R Sbjct: 1182 VSCLKLIWRLLSTQPSLWVTWIWTFIIRKGTFWSANERSSLGSWMWKKLLKYRELAKSMH 1241 Query: 618 FNLVGNGDATKFWLHRWHPQGMLLDTFGENCRLFTRIPRRATIKEV-REAGYWNSPPSSS 794 V NG +T FW W G LLD G + IP ++ V R + + Sbjct: 1242 KVEVRNGSSTSFWYDHWSHLGRLLDITGTRRVIDLGIPLETNLETVLRTHQHRQHRAAIY 1301 Query: 795 PMVRTAWRQFQQIPKLGCDEEDQFVWSPCPSGL---FSVASAWEQIRHHYDVWEWTELVW 965 + ++ QQ + D +W + F W +R H W + VW Sbjct: 1302 NRINAEIQRLQQQEREA--GPDISLWRSLKNDFNKRFITKVTWNNVRTHQPQQNWYKGVW 1359 Query: 966 FYDKIPKCSFTCWRMLLSKLPTKDKLTRF--GAQSRCELCWAGVESEDHLFFECPFSSEV 1139 F PK SF W + ++L T D++ + G C LC E+ DHLFF C ++S V Sbjct: 1360 FPYSTPKYSFLLWLTVQNRLSTGDRIKAWNSGQLVTCTLCNNAEETRDHLFFSCQYTSYV 1419 Query: 1140 WMRIKVKCWRNVQVVRGRFQESQTILRSFGLNRADGVIRKLCYTVTVHFIWWERNMRLFN 1319 W + + + R + T+L + L R + + + +++ IW ERN R Sbjct: 1420 WEALTQRL-LSTNYSRD-WNRLFTLLCTSNLPRDHLFLFRYVFQASIYHIWRERNARRHG 1477 Query: 1320 KGWRSATRLAEEIIQLVHQKVST 1388 + RL + I + V ++S+ Sbjct: 1478 EISSPTNRLIKLIDKTVRNRISS 1500 >gb|AAG51098.1|AC025295_6 hypothetical protein [Arabidopsis thaliana] Length = 504 Score = 194 bits (494), Expect = 8e-47 Identities = 116/354 (32%), Positives = 172/354 (48%), Gaps = 15/354 (4%) Frame = +3 Query: 93 GQLPVKHLGLPVIPGKLSIADCAPLVGMFARKLEGWQAKILSYAGRLELVKSVLQSSYIF 272 G LPV++LGLP++ + S D PL+ +K+ W A+ LSY GRL L+ S+L S F Sbjct: 95 GTLPVRYLGLPLVTKQFSSTDYLPLIDHIKQKICSWSARFLSYTGRLNLISSILWSICNF 154 Query: 273 WTGAFPIPYSVCSKLESLMGSFLHGKSKLRL----ISWATICRPLEEGGLGIRRIKDMNK 440 W GAF +P +++ + ++L +L I+WA +C+P EEGGLG+R +K+ N Sbjct: 155 WMGAFRLPRDCIREIDKMCSAYLWSGGELNTSKAKIAWAFVCKPKEEGGLGLRSLKEAND 214 Query: 441 AGLCKLLWWIYSSKKSLWVQWIHSRFLRNNSIWTATIPNDV-SWVYRRILKIRNQFAHHC 617 KL+W I S SLWV+WI S L+ W + SW++R+ILK R+ C Sbjct: 215 VCCLKLIWRIISHADSLWVKWIQSSLLKKVFFWAVRENTSLGSWMWRKILKFRDIARTLC 274 Query: 618 FNLVGNGDATKFWLHRWHPQGMLLDTFGENCRLFTRIPRRATIKEV-----REAGYWNSP 782 + NG T FW W G L+++ G+ + I + AT+ E R N Sbjct: 275 KVEINNGAQTSFWYDDWSDLGRLIESAGDRGAIDLGINKHATVVEAWGNRRRRRHRANFL 334 Query: 783 PSSSPMVRTAWRQFQQIPKLGCDEEDQFVWSPCPS---GLFSVASAWEQIRHHYDVWEWT 953 + +W Q ED +W + +FS W IR + W Sbjct: 335 NRVEERLVLSWNSRNQ-------AEDCALWKGKENRFRSIFSTKDTWNHIRTVSNKVAWY 387 Query: 954 ELVWFYDKIPKCSFTCWRMLLSKLPTKDKLT--RFGAQSRCELCWAGVESEDHL 1109 + VWF IPK +F W + ++L T D++T G + C LC +ES DHL Sbjct: 388 KGVWFAQAIPKHAFCMWLAVHNRLSTGDRMTLWNMGVDATCILCNNALESRDHL 441 >emb|CAB72467.1| putative protein [Arabidopsis thaliana] Length = 762 Score = 193 bits (490), Expect = 2e-46 Identities = 109/351 (31%), Positives = 168/351 (47%), Gaps = 10/351 (2%) Frame = +3 Query: 93 GQLPVKHLGLPVIPGKLSIADCAPLVGMFARKLEGWQAKILSYAGRLELVKSVLQSSYIF 272 G+LP+++LGLP++ +LS D APL+ +++ W ++ LS+AGR L+ S++ SS F Sbjct: 318 GELPIRYLGLPLVTKRLSSVDYAPLIEQIRKRIGSWSSRFLSFAGRFNLISSIIWSSCNF 377 Query: 273 WTGAFPIPYSVCSKLESLMGSFLHG----KSKLRLISWATICRPLEEGGLGIRRIKDMNK 440 W AF +P + ++E L SFL SK ISW +C+P EGGLG+R +K+ N Sbjct: 378 WLSAFQLPRACIQEIEKLCSSFLWSGTNLNSKKAKISWNQVCKPKSEGGLGLRSLKEAND 437 Query: 441 AGLCKLLWWIYSSKKSLWVQWIHSRFLRNNSIW-TATIPNDVSWVYRRILKIRNQFAHHC 617 KL+W I S SLWV+W+ L+ W N SW++++ILK R C Sbjct: 438 VCCLKLVWRIISHGDSLWVKWVEHNLLKREIFWIVKENANLGSWIWKKILKYRGVAKRFC 497 Query: 618 FNLVGNGDATKFWLHRWHPQGMLLDTFGENCRLFTRIPRRATIKEVREAGYWNSPPSSSP 797 VGNG++T FW W G L+D G + I R ++ + + Sbjct: 498 KAEVGNGESTSFWFDDWSLLGRLIDVAGIRGTIDMGISRTMSVADAWTSRRRRHHRQEIL 557 Query: 798 MVRTAWRQFQQIPKLGCDEEDQFVW---SPCPSGLFSVASAWEQIRHHYDVWEWTELVWF 968 Q + ++ + +W + FS + W +R + W + VWF Sbjct: 558 NTIEEVLSTQHQKRTQQQQQGRVLWKGKNDIYKDKFSTKNTWNYLRTTSNEVAWHKGVWF 617 Query: 969 YDKIPKCSFTCWRMLLSKLPTKDKLTRF--GAQSRCELCWAGVESEDHLFF 1115 PK SF W +L T ++ ++ G C C G+E+ DHLFF Sbjct: 618 PHATPKYSFCLWLAAHDRLATGARMIKWNRGETGDCTFCRQGIETRDHLFF 668 >gb|AAC19278.1| T14P8.10 [Arabidopsis thaliana] gi|7269009|emb|CAB80742.1| AT4g02490 [Arabidopsis thaliana] Length = 657 Score = 192 bits (488), Expect = 4e-46 Identities = 111/362 (30%), Positives = 175/362 (48%), Gaps = 12/362 (3%) Frame = +3 Query: 93 GQLPVKHLGLPVIPGKLSIADCAPLVGMFARKLEGWQAKILSYAGRLELVKSVLQSSYIF 272 G LPV++LG+P++ K+ D PLV + W A+ LS+AGRL+L+KSV+ S+ F Sbjct: 306 GSLPVRYLGVPLMSQKMKKHDYQPLVDRINSRFTSWTARHLSFAGRLQLLKSVIYSTINF 365 Query: 273 WTGAFPIPYSVCSKLESLMGSFL----HGKSKLRLISWATICRPLEEGGLGIRRIKDMNK 440 W F +P KLE + +FL ++ ISW +C E GGLG++R+ NK Sbjct: 366 WASIFILPNQCLHKLEQMCNAFLWSGAPNSAREAKISWDIVCSSKESGGLGLKRLSSWNK 425 Query: 441 AGLCKLLWWIYSSKKSLWVQWIHSRFLRNNSIWTATIPNDVSWVYRRILKIRNQFAHHCF 620 KL+W ++++ SLWV W V WV+R++ K+R Sbjct: 426 VLALKLIWLLFTASGSLWVSW-------------------VRWVWRKLCKLREVARPFVI 466 Query: 621 NLVGNGDATKFWLHRWHPQGMLLDTFGENCRLFTRIPRRATIKE-VREAGYW-NSPPSSS 794 VG+G +FW W G L+ G + + +++ +R +W S S + Sbjct: 467 CEVGSGITARFWQDNWTGHGPLIHLTGLTGPQLVGLSITSVVRDAIRNDDWWIASSRSRN 526 Query: 795 PMVRTAWRQFQQIPKL-GCDEEDQFVW---SPCPSGLFSVASAWEQIRHHYDVWEWTELV 962 P++ + L C+ +D ++W PS FS A W ++ W + V Sbjct: 527 PVILLLKSLLPPVGNLVDCEHDDSYLWKVGDRVPSSKFSTADTWRALQPFSVSVSWHKAV 586 Query: 963 WFYDKIPKCSFTCWRMLLSKLPTKDKLTRFG--AQSRCELCWAGVESEDHLFFECPFSSE 1136 WF +++PK +F W ++L T+D+L +G + C LC E+ DHLFF C FSS Sbjct: 587 WFTNQVPKHAFISWVTAWNRLHTRDRLRSWGLIVPAECVLCNLVDETRDHLFFACRFSSR 646 Query: 1137 VW 1142 +W Sbjct: 647 IW 648 >gb|AAC28221.1| similar to reverse transcriptases (PFam: rvt.hmm, score: 60.13) [Arabidopsis thaliana] Length = 1164 Score = 189 bits (479), Expect = 4e-45 Identities = 111/359 (30%), Positives = 168/359 (46%), Gaps = 12/359 (3%) Frame = +3 Query: 93 GQLPVKHLGLPVIPGKLSIADCAPLVGMFARKLEGWQAKILSYAGRLELVKSVLQSSYIF 272 G LPV++LGLP++ KL+IA+ APL+ + W ++LS+AGR++L+ SV+ F Sbjct: 655 GSLPVRYLGLPLMSRKLTIAEYAPLIEKITARFNSWVVRLLSFAGRVQLLASVISGIVNF 714 Query: 273 WTGAFPIPYSVCSKLESLMGSFLHG----KSKLRLISWATICRPLEEGGLGIRRIKDMNK 440 W +F +P K+ESL FL K + ++W+ +C P EGG+G+RR N+ Sbjct: 715 WISSFILPLGCIKKIESLCSRFLWSSRIDKKGIAKVAWSQVCLPKAEGGIGLRRFAVSNR 774 Query: 441 AGLCKLLWWIYSSKKSLWVQWIHSRFL-RNNSIWTATIPNDVSWVYRRILKIRNQFAHHC 617 +++W ++S+ SLWV W L ++ S W SW ++ +L++R Sbjct: 775 TLYLRMIWLLFSNSGSLWVAWHKQHSLGKSTSFWNQPEKPHDSWNWKCLLRLRVVAERFI 834 Query: 618 FNLVGNGDATKFWLHRWHPQGMLLDTFGENCRLFTRIPRRATIKEVREAGYWNSPPSSSP 797 VGNG FW W P G L+ G R+ A I +V + W+ S Sbjct: 835 RCNVGNGRDASFWFDNWTPFGPLIKFLGNEGPRDLRVHLNAKISDVCTSEGWSIADPRSD 894 Query: 798 MVRTAWRQFQQIP-KLGCDEEDQFVW----SPCPSGLFSVASAWEQIRHHYDVWEWTELV 962 + I + D + W C FS A+ W +R W V Sbjct: 895 QALSLHTHLTNISMPSDAQDLDSYDWVVDNKVCQG--FSAAATWSALRPSSAPVPWARAV 952 Query: 963 WFYDKIPKCSFTCWRMLLSKLPTKDKLTRFGAQ--SRCELCWAGVESEDHLFFECPFSS 1133 WF PK +F W L +LPTK +L +G Q + C LC E+ DHLF C F++ Sbjct: 953 WFKGATPKHAFHLWTAHLDRLPTKVRLASWGMQIDTTCGLCSLHPETRDHLFLSCDFAN 1011 >emb|CCA65997.1| hypothetical protein [Beta vulgaris subsp. vulgaris] Length = 1363 Score = 189 bits (479), Expect = 4e-45 Identities = 156/542 (28%), Positives = 234/542 (43%), Gaps = 16/542 (2%) Frame = +3 Query: 108 KHLGLPVIPGKLSIADCAPLVGMFARKLEGWQAKILSYAGRLELVKSVLQSSYIFWTGAF 287 K+LG ++P KL D L+ + GWQAK L+ AGR L+KSV+ S ++ + Sbjct: 752 KYLGCNILPNKLRRGDYDGLLEKVKSAINGWQAKYLNMAGRCTLIKSVVSSFPVYGMQSS 811 Query: 288 PIPYSVCSKLESLMGSFLHGKSK----LRLISWATICRPLEEGGLGIRRIKDMNKAGLCK 455 +P SV +++E FL K L +SW IC P +GGLG RR+ + N A + K Sbjct: 812 LLPVSVMNEIEKDCRKFLWNKMDKSHYLARMSWDRICSPTGKGGLGFRRLHNWNLAFMAK 871 Query: 456 LLWWIYSSKKSLWVQWIHSRFLRNNSIWTATIPNDVSWVYRRILKIRNQFAHHCFNLVGN 635 L W I + LWV+ + +R+ S +A N S ++R I+K R +GN Sbjct: 872 LGWMIIKDETKLWVRILKARYWERGSFLSAVGKNHHSPIWRDIVKGRELLEKGLVRRIGN 931 Query: 636 GDATKFWLHRWHPQGMLLDTFGENCRLFTRIPRRATIKEVREAGYWNSPPSSSPMVRTAW 815 G +T W H W G L+D G N F + + + G W++ S + Sbjct: 932 GRSTSLWYHWWVGGGPLVDVMGSNIPEFM---SHWQVSNIIKRGRWDTKKISHLLPPDIL 988 Query: 816 RQFQQIPKLGCDE-EDQFVWSPCPSGLFSVASAWEQIRHHYD----VWEWTELVWFYDKI 980 +Q ++IP E ED F W+ +G FSV SA+ I + W L W + Sbjct: 989 KQIKEIPLASMSEVEDDFTWNFEKNGTFSVKSAYYLINRREEETGGKGSWRGL-WRKNIP 1047 Query: 981 PKCSFTCWRMLLSKLPTKDKLTR--FGAQSRCELCWAGVESEDHLFFECPFSSEVWMRIK 1154 K W + + LPT L + +C C +E HLF +C +S VW+ I Sbjct: 1048 FKYKLLIWNGIHNILPTALFLAKRIHNFNPQCVACDHPIEDMIHLFRDCCVASSVWIEIL 1107 Query: 1155 VKCWRNVQVVRGRFQESQTILRSFGLNRADGVIRKLCYTVTVHFIWWERNMRLFNKGWRS 1334 N Q + + + I F LN+ D + K +T IW RN +F Sbjct: 1108 KHHKPNNQNLFFNLEWEEWI--DFNLNQHDYWVTK--FTTAFWHIWCSRNKTVF------ 1157 Query: 1335 ATRLAEEIIQLVHQKVSTSKKLLSNSLDGEFLAQF--WHVNFDQLR-DPIVCQWSPPPEG 1505 + K N + +F + VN Q +V +W PP +G Sbjct: 1158 -------------ECAVNHPKFTYNRVVADFFTNIRAFQVNNTQGNGSKVVLRWKPPHQG 1204 Query: 1506 ELVLNSDGSLSA--RGCFFGGVIRNHMGEIILGYSGGTSQGSVLLLEALGMYHGLKVAKE 1679 L LN+DG+ A GGV R+ +G LG++ GS E + + GL+VA + Sbjct: 1205 FLKLNTDGAWKADWENAGIGGVFRDAVGNWELGFAKRVDAGSPEAAELMAIREGLQVAWD 1264 Query: 1680 RN 1685 N Sbjct: 1265 CN 1266 >gb|ABW81051.1| tn7 reverse transcriptase [Arabidopsis lyrata subsp. lyrata] Length = 441 Score = 185 bits (470), Expect = 5e-44 Identities = 129/440 (29%), Positives = 200/440 (45%), Gaps = 24/440 (5%) Frame = +3 Query: 141 LSIADCAPLVGMFARKLEGWQAKILSYAGRLELVKSVLQSSYIFWTGAFPIPYSVCSKLE 320 ++ +D PL+ ++ W A+ LS+AGRL+L+ SV+ S FW AF +P + +++ Sbjct: 1 MTTSDYIPLIERIRERISCWTARHLSFAGRLQLISSVIHSLTNFWMSAFRLPNACIKEID 60 Query: 321 SLMGSFLHGKSKLR----LISWATICRPLEEGGLGIRRIKDMNKAGLCKLLWWIYSSKKS 488 L +FL +L +SW +C P EEGGLG+R + + NK KL+W + SS S Sbjct: 61 GLCSAFLWSGPELNRKKAKVSWNDVCMPKEEGGLGLRSLTEANKVCCLKLIWRLLSS-SS 119 Query: 489 LWVQWIHSRFLRNNSIWTATIPNDV-SWVYRRILKIRNQFAHHCFNLVGNGDATKFWLHR 665 LWVQW+ +R S W+ + + SW++R++LK R+ + + NG FW Sbjct: 120 LWVQWLRQYVIRKGSFWSLRDTSTLGSWMWRKLLKYRHLASGFTQYEIRNGKGVSFWHDN 179 Query: 666 WHPQGMLLDTFGENCRLFTRIPRRATIKEV-------REAGYWNSPPSSSPMVRTAWRQF 824 W P G L+ G + I AT+ E A + N + +RT Sbjct: 180 WSPLGPLIAISGTRGCIDMGIDIHATVAEALTHRRRRHRADHLNQMEAQLEELRT----- 234 Query: 825 QQIPKLGCDEEDQFVWSP-----CPSGLFSVASAWEQIRHHYDVWEWTELVWFYDKIPKC 989 K + ED +W PS FS W R EW + +WF PK Sbjct: 235 ----KGLVETEDVVLWKGKGGRFKPS--FSTKETWADTREQKPRNEWYQGIWFSHATPKY 288 Query: 990 SFTCWRMLLSKLPTKDKLTRF--GAQSRCELCWAGVESEDHLFFECPFSSEVWMRIKVKC 1163 SF W ++L T D++ + G C C E+ +HLFF C +S EVW + K Sbjct: 289 SFITWLATKNRLSTGDRMMSWNAGVNLSCVFCQEQTETRNHLFFTCRYSREVWSGLTSKL 348 Query: 1164 WRNVQVVRGRFQESQTIL-----RSFGLNRADGVIRKLCYTVTVHFIWWERNMRLFNKGW 1328 + R + TIL ++ G NR + + + + V+ IW ERN R + Sbjct: 349 -----LTRHYSTDWTTILKLLTDKTLGNNRL--FLLRYAFQILVYSIWKERNSRRHGEEP 401 Query: 1329 RSATRLAEEIIQLVHQKVST 1388 + L + + + V K+ST Sbjct: 402 LPSALLLKRLDKEVRNKLST 421 >ref|XP_004293181.1| PREDICTED: uncharacterized protein LOC101298394 [Fragaria vesca subsp. vesca] Length = 958 Score = 184 bits (468), Expect = 8e-44 Identities = 119/416 (28%), Positives = 181/416 (43%), Gaps = 8/416 (1%) Frame = +3 Query: 93 GQLPVKHLGLPVIPGKLSIADCAPLVGMFARKLEGWQAKILSYAGRLELVKSVLQSSYIF 272 G PV++LG+P+I KL + DC+PL+ +++ W+ K+LS+AGRL+L++SVL S ++ Sbjct: 587 GTCPVRYLGIPLITSKLRMQDCSPLLDRIETRIKSWENKVLSFAGRLQLIQSVLSSIQVY 646 Query: 273 WTGAFPIPYSVCSKLESLMGSFL-----HGKSKLRLISWATICRPLEEGGLGIRRIKDMN 437 W +P V +E + FL G++ + ++W+ IC P EGGLGI+ + N Sbjct: 647 WASHLILPKKVLKDIEKRLRCFLWAGNCSGRAATK-VAWSEICLPKCEGGLGIKDLHCWN 705 Query: 438 KAGLCKLLWWIYSSKKSLWVQWIHSRFLRNNSIWTATIPNDVSWVYRRILKIRNQFAHHC 617 KA + +W + SS + W W+ L+ NS W A +P+ SW +R++LKIR Sbjct: 706 KALMISHIWNLVSSSSNFWTDWVKVYLLKGNSFWNAPLPSICSWNWRKLLKIRELCCSFF 765 Query: 618 FNLVGNGDATKFWLHRWHPQGMLLDTFGENCRLFTRIPRRATIKEVREAGYWNSPPSSSP 797 N++G+G AT W WHP G L + N + E+G S S Sbjct: 766 VNIIGDGRATSLWFDNWHPLGPLTLRWSSNI--------------IGESGL-----SKSA 806 Query: 798 MVRTAWRQFQQIPKLGCDEEDQFVWSPCPSGLFSVASAWEQIRHHYDVWEWTELVWFYDK 977 M+ P+G +S +SAW +R + W LVWF Sbjct: 807 ML-------------------------TPNGFYSTSSAWNTLRPSRFIVPWYRLVWFV-- 839 Query: 978 IPKCSFTCWRMLLSKLPTKDKLTRFGAQSRCELCWAGVESEDHLFFECPFSSEVWMRIKV 1157 E+ +HLFF+C +S +W + Sbjct: 840 -------------------------------------AETHNHLFFDCAYSFGIWTHVLS 862 Query: 1158 KCWRNVQVVRGRFQESQTIL---RSFGLNRADGVIRKLCYTVTVHFIWWERNMRLF 1316 KC V + S I ++ N VI KL V+ IW ERN R F Sbjct: 863 KC----DVSKPLLPWSDFIFWVATNWKGNSLPVVILKLALQAVVYAIWRERNNRRF 914 >ref|XP_007017131.1| Uncharacterized protein TCM_033752 [Theobroma cacao] gi|508722459|gb|EOY14356.1| Uncharacterized protein TCM_033752 [Theobroma cacao] Length = 2251 Score = 183 bits (465), Expect = 2e-43 Identities = 154/548 (28%), Positives = 242/548 (44%), Gaps = 18/548 (3%) Frame = +3 Query: 99 LPVKHLGLPVIPGKLSIADCAPLVGMFARKLEGWQAKILSYAGRLELVKSVLQSSYIFWT 278 LP+ +LG P+ G + LV ++ GW+ KILS GR+ L++SVL S I+ Sbjct: 1629 LPITYLGAPLYKGHKKVILFNDLVAKIEERITGWENKILSPGGRITLLRSVLASLPIYLL 1688 Query: 279 GAFPIPYSVCSKLESLMGSFLHGKS----KLRLISWATICRPLEEGGLGIRRIKDMNKAG 446 P V ++ L SFL G S ++ SWA I P+ EGGL IR + ++ +A Sbjct: 1689 QVLKPPVCVLERVNRLFNSFLWGGSAASKRIHWASWAKIALPVTEGGLDIRSLAEVFEAF 1748 Query: 447 LCKLLWWIYSSKKSLWVQWIHSRFLRNNSIWTATIPNDVSWVYRRILKIRNQFAHHCFNL 626 K LWW + + SLW +++ ++ R S ++R+L H Sbjct: 1749 SMK-LWWRFRTTDSLWTRFMRMKYCRGQLPMQTQPKLHDSQTWKRMLTSSTITEQHMRWR 1807 Query: 627 VGNGDATKFWLHRWHPQGMLLDTFGENCRLFTRIPRRATIKEVREAGYWNSPPSSSPMVR 806 VG G+ FW W + L+ + E FT + + WN + + + Sbjct: 1808 VGQGNVF-FWHDCWMGEAPLISSNQE----FT--SSMVQVCDFFTNNSWNIEKLKTVLQQ 1860 Query: 807 TAWRQFQQIPKLGCDEEDQFVWSPCPSGLFSVASAWEQIRHHYDVWEWTELVWFYDKIPK 986 + +IP + +D+ W+P P+G FS SAW+ IR V +W Sbjct: 1861 EVVDEIAKIP-IDTMNKDEAYWTPTPNGDFSTKSAWQLIRKRKVVNPVFNFIWHKTVPLT 1919 Query: 987 CSFTCWRMLLSKLPTKDKLTRFGAQ--SRCELCWAGVESEDHLFFECPFSSEVW------ 1142 SF WR+L +P + K+ G Q SRC C ES H+ ++ P + +VW Sbjct: 1920 TSFFLWRLLHDWIPVELKMKSKGLQLASRCRCC-KSEESIMHVMWDNPVAMQVWNYFAKL 1978 Query: 1143 --MRIKVKCWRNVQVVRGRFQESQTILRSFGLNRADGVIRKLCYTVTVHFIWWERNMRLF 1316 + I C N Q++ F G G IR L + F+W ERN Sbjct: 1979 FQILIINPCTIN-QIIGAWFYS--------GDYCKPGHIRTLVPLFILWFLWVERNDAKH 2029 Query: 1317 NKGWRSATRLAEEIIQLVHQKVSTSKKLLSNSLDGE-FLAQFWHVNF--DQLRDPIVCQW 1487 R+ +++L+ Q++S ++LL G+ +AQ W + F + L P V W Sbjct: 2030 RNLGMYPNRVVWRVLKLI-QQLSLGQQLLKWQWKGDKQIAQEWGIIFQAESLAPPKVFSW 2088 Query: 1488 SPPPEGELVLNSDGSL-SARGCFFGGVIRNHMGEIILGYSGGTSQGSVLLLEALGMYHGL 1664 P GE LN DGS + GG++R+H GE++ G+S + L E L +Y GL Sbjct: 2089 HKPSLGEFKLNVDGSAKQSHNAAGGGILRDHAGEMVFGFSENLGTQNSLQAELLALYRGL 2148 Query: 1665 KVAKERNV 1688 + ++ N+ Sbjct: 2149 ILCRDYNI 2156