BLASTX nr result
ID: Atropa21_contig00031455
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Atropa21_contig00031455 (2507 letters) Database: nr 37,332,560 sequences; 13,225,080,153 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value emb|CCA65979.1| hypothetical protein [Beta vulgaris subsp. vulga... 362 e-136 emb|CCA65981.1| hypothetical protein [Beta vulgaris subsp. vulga... 377 e-101 gb|AAM82604.1|AF525305_2 putative AP endonuclease/reverse transc... 247 2e-97 gb|ABD33261.1| RNA-directed DNA polymerase (Reverse transcriptas... 348 5e-93 gb|AAG50806.1|AC079281_8 unknown protein [Arabidopsis thaliana] 234 9e-90 ref|XP_004247247.1| PREDICTED: uncharacterized protein LOC101256... 298 6e-78 gb|EOY17513.1| Uncharacterized protein TCM_036737 [Theobroma cacao] 209 8e-78 gb|AAC67331.1| putative non-LTR retroelement reverse transcripta... 211 2e-70 ref|XP_004252692.1| PREDICTED: uncharacterized protein LOC101261... 243 4e-70 ref|XP_006576082.1| PREDICTED: uncharacterized protein LOC102659... 158 7e-70 ref|XP_006577697.1| PREDICTED: uncharacterized protein LOC102664... 136 1e-67 dbj|BAA97290.1| non-LTR retroelement reverse transcriptase-like ... 255 6e-65 dbj|BAF01687.1| hypothetical protein [Arabidopsis thaliana] 255 6e-65 emb|CCA66188.1| hypothetical protein [Beta vulgaris subsp. vulga... 197 8e-64 ref|XP_004253225.1| PREDICTED: uncharacterized protein LOC101268... 185 8e-64 gb|ABD28627.2| RNA-directed DNA polymerase (Reverse transcriptas... 222 1e-63 gb|AAC33226.1| putative non-LTR retroelement reverse transcripta... 247 2e-62 dbj|BAB09379.1| non-LTR retroelement reverse transcriptase-like ... 245 6e-62 gb|AAD21699.1| Contains reverse transcriptase domain (rvt) PF|00... 241 9e-61 gb|AAC28221.1| similar to reverse transcriptases (PFam: rvt.hmm,... 241 1e-60 >emb|CCA65979.1| hypothetical protein [Beta vulgaris subsp. vulgaris] Length = 1110 Score = 362 bits (928), Expect(3) = e-136 Identities = 183/429 (42%), Positives = 264/429 (61%) Frame = +2 Query: 1220 KIQLVRQELNDVQVC*DTHDDQ*LYAREKQLKSELEKWNQVEESILKQKPRVQWLSLGDS 1399 K++ +R +L D+Q D + + K + ++L W+ +E+SIL+QK R+ WL GD+ Sbjct: 299 KVKNLRHQLQDLQSQDDFDHNDIMQTDAKSIMNDLRHWSHIEDSILQQKSRITWLQQGDT 358 Query: 1400 NSAYFFASMRGRINQNHIKKLVNDSGRILYTKREVEDEIIGFYKTLLRSCATELSGIQSD 1579 NS FF +++ R N I L + GR++ EV++EI+ FYK LL + A+ L G+ + Sbjct: 359 NSKLFFTAVKARHAINRIDMLNTEDGRVIQDADEVQEEILEFYKKLLGTRASTLMGVDLN 418 Query: 1580 VMNNDPILRRDQQLLLMAPVTKQEIQAALNDISDLKAPGCDGFNVVFFKKSWEVISDEVV 1759 + L + L+ V EI AL I + KAPG DGFN FFKKSW I E+ Sbjct: 419 TVRGGKCLSAQAKESLIREVASTEIDEALAGIGNDKAPGLDGFNAYFFKKSWGSIKQEIY 478 Query: 1760 AVVTNILHTKRIFKAINRTTVILIPKVQNPSYAKEFRPISCSTVLYKLIS*VLTKRLQGV 1939 A + + R+ + IN V L+PKVQ+ + KEFRPI+C TV+YK+IS +LT R++G+ Sbjct: 479 AGIQEFFNNSRMHRPINCIVVTLLPKVQHATRVKEFRPIACCTVIYKIISKMLTNRMKGI 538 Query: 1940 MDSIIDSSQAAFVSGRVITDNILLSHELVNGYCRKGVSARCMLKIDMQKAYDSLEWDFLE 2119 + +++ +Q+ F+ GR I DNILL+ EL+ GY RK +S RC++K+D++KAYDS+EW FLE Sbjct: 539 IGEVVNEAQSGFIPGRHIADNILLASELIRGYTRKHMSPRCIMKVDIRKAYDSVEWSFLE 598 Query: 2120 QVLVALNFPSTFVQWIMMCVQSVSYSILINGHPTTPFDAKKXXXXXXXXXXXXXVMAMEY 2299 +L FPS FV WIM CV +VSYS+L+NG PT PF A+K + MEY Sbjct: 599 TLLYEFGFPSRFVGWIMECVSTVSYSVLVNGIPTQPFQARKGLRQGDPMSPFLFALCMEY 658 Query: 2300 FSRFLEQLGQNSQFHFHPKCSGLKLIQLGFVDDLLLFCRGDVGSMELIFDKFKLFSRASS 2479 SR LE+L + F+FHPKC L + L F DDLL+FCR D S++ + F+ FS AS Sbjct: 659 LSRCLEELKGSPDFNFHPKCERLNITHLMFADDLLMFCRADKSSLDHMNVAFQKFSHASG 718 Query: 2480 LIANLNKSS 2506 L A+ KS+ Sbjct: 719 LAASHEKSN 727 Score = 142 bits (358), Expect(3) = e-136 Identities = 85/225 (37%), Positives = 128/225 (56%), Gaps = 6/225 (2%) Frame = +1 Query: 385 NVRGFNKLHKHKEFLKTVRKEHINIIAIVEHRVHKNKATQIVKKVVPG*H*HYNYDISGK 564 NVRG N K KE + I + A++E RV + A+++ K+ NY S + Sbjct: 7 NVRGMNDPFKIKEIKNFLYSHKIVVCALLETRVREQNASKVQGKLGKDWKWLNNYSHSAR 66 Query: 565 ERIWLIWDSAYVDVTILHTNDQFVHCMIELPAQGIKVEFTAIYGFHTVETRRSLWSSL-E 741 ERIW+ W A+V+VT+ HT +Q + C I+ Q K++ A+YG HT+ R+SLWS L + Sbjct: 67 ERIWIGWRPAWVNVTLTHTQEQLMVCDIQ--DQSHKLKMVAVYGLHTIADRKSLWSGLLQ 124 Query: 742 SIEPTVLHPWLIMGDFNVVLRGEDRLNGS*VVDAEVKDFAQCLLTTGLTEMKAIGRFYTW 921 ++ P +I+GDFN V DRL G+ V DAE +DF Q LL + L E ++ +Y+W Sbjct: 125 CVQQQ--DPMIIIGDFNAVCHSNDRLYGTLVTDAETEDFQQFLLQSNLIESRSTWSYYSW 182 Query: 922 TN-----NRVLSKIDRALMNPA*VNK*PQVDVTVMDSQISDHALL 1041 +N +RVLS+ID+A +N + +V V + ISDH+ L Sbjct: 183 SNSSIGRDRVLSRIDKAYVNLVWLGMYAEVSVQYLPPGISDHSPL 227 Score = 33.5 bits (75), Expect(3) = e-136 Identities = 16/55 (29%), Positives = 27/55 (49%) Frame = +3 Query: 1041 LTTLEQQQDQIARPFKFLNHLAQHNDFLLRVRDIWSRQVHGSPMERVWKKFKLMK 1205 L L + Q +PFKF+N +A+ +FL V W+ ++ +W K +K Sbjct: 228 LFNLMTGRPQGGKPFKFMNVMAEQGEFLETVEKAWNSVNGRFKLQAIWLNLKAVK 282 >emb|CCA65981.1| hypothetical protein [Beta vulgaris subsp. vulgaris] Length = 1114 Score = 377 bits (967), Expect = e-101 Identities = 195/428 (45%), Positives = 272/428 (63%) Frame = +2 Query: 1220 KIQLVRQELNDVQVC*DTHDDQ*LYAREKQLKSELEKWNQVEESILKQKPRVQWLSLGDS 1399 +++ +R++L VQ + L EK L ++L KW+ ++ESILKQK R+QWLSLGDS Sbjct: 302 QVEELRRKLAAVQALPEVSQVSELQEEEKDLIAQLRKWSTIDESILKQKSRIQWLSLGDS 361 Query: 1400 NSAYFFASMRGRINQNHIKKLVNDSGRILYTKREVEDEIIGFYKTLLRSCATELSGIQSD 1579 NS +FF +++ R +N I L ND G L E+++EI FY+ LL + +++L I Sbjct: 362 NSKFFFTAIKVRKARNKIVLLQNDRGDQLTENTEIQNEICNFYRRLLGTSSSQLEAIDLH 421 Query: 1580 VMNNDPILRRDQQLLLMAPVTKQEIQAALNDISDLKAPGCDGFNVVFFKKSWEVISDEVV 1759 V+ L L+ P+T QEI AL DI D KAPG DGFN VFFKKSW VI E+ Sbjct: 422 VVRVGAKLSATSCAQLVQPITIQEIDQALADIDDTKAPGLDGFNSVFFKKSWLVIKQEIY 481 Query: 1760 AVVTNILHTKRIFKAINRTTVILIPKVQNPSYAKEFRPISCSTVLYKLIS*VLTKRLQGV 1939 + + + K IN T V LIPK+ +AK++RPI+C + LYK+IS +LTKRLQ V Sbjct: 482 EGILDFFENGFMHKPINCTAVTLIPKIDEAKHAKDYRPIACCSTLYKIISKILTKRLQAV 541 Query: 1940 MDSIIDSSQAAFVSGRVITDNILLSHELVNGYCRKGVSARCMLKIDMQKAYDSLEWDFLE 2119 + ++D +Q F+ R I DNILL+ EL+ GY R+ VS RC++K+D++KAYDS+EW FLE Sbjct: 542 ITEVVDCAQTGFIPERHIGDNILLATELIRGYNRRHVSPRCVIKVDIRKAYDSVEWVFLE 601 Query: 2120 QVLVALNFPSTFVQWIMMCVQSVSYSILINGHPTTPFDAKKXXXXXXXXXXXXXVMAMEY 2299 +L L FPS F++WIM CV++VSYSIL+NG P+ PFDA+K ++MEY Sbjct: 602 SMLKELGFPSMFIRWIMACVKTVSYSILLNGIPSIPFDAQKGLRQGDPLSPFLFALSMEY 661 Query: 2300 FSRFLEQLGQNSQFHFHPKCSGLKLIQLGFVDDLLLFCRGDVGSMELIFDKFKLFSRASS 2479 SR + + ++ +F+FHPKC +KL L F DDLL+F R D S+ I F FS+AS Sbjct: 662 LSRCMGNMCKDPEFNFHPKCERIKLTHLMFADDLLMFARADASSISKIMAAFNSFSKASG 721 Query: 2480 LIANLNKS 2503 L A++ KS Sbjct: 722 LQASIEKS 729 Score = 125 bits (314), Expect(2) = 2e-33 Identities = 75/230 (32%), Positives = 119/230 (51%), Gaps = 5/230 (2%) Frame = +1 Query: 367 MKIAT*NVRGFNKLHKHKEFLKTVRKEHINIIAIVEHRVHKNKATQIVKKVVPG*H*HYN 546 MKI T NVRG N K KE + + I++ ++ E RV + + +I KK N Sbjct: 1 MKITTWNVRGLNDPIKVKEVKHFLHSQKISLCSLFETRVRQQNSGKIQKKFGNRWSWINN 60 Query: 547 YDISGKERIWLIWDSAYVDVTILHTNDQFVHCMIELPAQGIKVEFTAIYGFHTVETRRSL 726 Y S + RIW+ W + V++ +L +Q + ++ + A+YG HT+ R+ L Sbjct: 61 YACSPRGRIWVGWLNNDVNINVLSVTEQVITMEVKNSYGLNMFKMAAVYGLHTIADRKVL 120 Query: 727 WSSLESIEPTVLHPWLIMGDFNVVLRGEDRLNGS*VVDAEVKDFAQCLLTTGLTEMKAIG 906 W L + P +++GD+N V +DRLNG+ V +AE D +L L E G Sbjct: 121 WEELYNFVSVCHEPCILIGDYNAVYSAQDRLNGNDVSEAETSDLRSFVLKAQLLEAPTTG 180 Query: 907 RFYTWTN-----NRVLSKIDRALMNPA*VNK*PQVDVTVMDSQISDHALL 1041 FY+W N +R+ S+ID++ +N A +N+ P V V ++ ISDH+ L Sbjct: 181 LFYSWNNKSIGADRISSRIDKSFVNVAWINQYPDVVVEYREAGISDHSPL 230 Score = 47.0 bits (110), Expect(2) = 2e-33 Identities = 24/78 (30%), Positives = 38/78 (48%) Frame = +3 Query: 1050 LEQQQDQIARPFKFLNHLAQHNDFLLRVRDIWSRQVHGSPMERVWKKFKLMKGAICGRYN 1229 L Q D+ RPFKFLN LA N F+ V++ W H M+ +W + + +K A+ Sbjct: 234 LATQHDEGGRPFKFLNFLADQNGFVEVVKEAWGSANHRFKMKNIWVRLQAVKRAL----- 288 Query: 1230 **GRSLMMSRFVETHMMI 1283 +S +F + H + Sbjct: 289 ---KSFHSKKFSKAHCQV 303 >gb|AAM82604.1|AF525305_2 putative AP endonuclease/reverse transcriptase [Brassica napus] Length = 1214 Score = 247 bits (630), Expect(3) = 2e-97 Identities = 139/404 (34%), Positives = 214/404 (52%), Gaps = 2/404 (0%) Frame = +2 Query: 1301 EKQLKSELEKWNQVEESILKQKPRVQWLSLGDSNSAYFFASMRGRINQNHIKKLVNDSGR 1480 EK+ + EE L QK RV WL GDSN+ +F M R N I L++ +GR Sbjct: 332 EKEAHRSWAELALAEERFLCQKSRVLWLKCGDSNTTFFHRMMTARRAINEIHYLLDQTGR 391 Query: 1481 ILYTKREVEDEIIGFYKTLLRSCATELSGIQSDVMNNDPILRRDQQL--LLMAPVTKQEI 1654 + E++ + F+K L S + +S +N+ + D+ LL A V++ +I Sbjct: 392 RIENTDELQTHCVDFFKELFGSSSHLISAEGISQINSLTRFKCDENTRQLLEAEVSEADI 451 Query: 1655 QAALNDISDLKAPGCDGFNVVFFKKSWEVISDEVVAVVTNILHTKRIFKAINRTTVILIP 1834 ++ + K+PG DG+ FFKK+W ++ ++A V + R+ N T V ++P Sbjct: 452 KSEFFALPSNKSPGPDGYTSEFFKKTWSIVGPSLIAAVQEFFRSGRLLGQWNSTAVTMVP 511 Query: 1835 KVQNPSYAKEFRPISCSTVLYKLIS*VLTKRLQGVMDSIIDSSQAAFVSGRVITDNILLS 2014 K N EFRPISC +YK+IS +L +RL+ ++ I SQ+AFV GR++T+N+LL+ Sbjct: 512 KKPNADRITEFRPISCCNAIYKVISKLLARRLENILPLWISPSQSAFVKGRLLTENVLLA 571 Query: 2015 HELVNGYCRKGVSARCMLKIDMQKAYDSLEWDFLEQVLVALNFPSTFVQWIMMCVQSVSY 2194 ELV G+ + +S+R +LK+D++KA+DS+ W F+ + L A N P FV WI C+ S S+ Sbjct: 572 TELVQGFGQANISSRGVLKVDLRKAFDSVGWGFIIETLKAANAPPRFVNWIKQCITSTSF 631 Query: 2195 SILINGHPTTPFDAKKXXXXXXXXXXXXXVMAMEYFSRFLEQLGQNSQFHFHPKCSGLKL 2374 SI ++G F K V+AME SR LE + +HPK S +++ Sbjct: 632 SINVSGSLCGYFKGSKGLRQGDPLSPSLFVIAMEILSRLLENKFSDGSIGYHPKASEVRI 691 Query: 2375 IQLGFVDDLLLFCRGDVGSMELIFDKFKLFSRASSLIANLNKSS 2506 L F DDL++F G S+ I + F S L N KS+ Sbjct: 692 SSLAFADDLMIFYDGKASSLRGIKSVLESFKNLSGLEMNTEKSA 735 Score = 120 bits (302), Expect(3) = 2e-97 Identities = 73/223 (32%), Positives = 117/223 (52%), Gaps = 7/223 (3%) Frame = +1 Query: 385 NVRGFNKLHKHKEFLKTVRKEHINIIAIVEHRVHKNKATQIVKKVVPG*H*HYNYDISGK 564 NVRGFN + + F K + +I+E RV +++A + + PG NY+ + Sbjct: 8 NVRGFNNSVRRRNFRKWFKLSKALFGSILETRVKEHRARRSLLSSFPGWKSVCNYEFAAL 67 Query: 565 ERIWLIWDSAYVDVTILHTNDQFVHCMIELPAQGIKVEFTAIYGFHTVETRRSLWSSLES 744 RIW++WD A V+VT+L +DQ + C ++LP + T +Y + RR LWS LE Sbjct: 68 GRIWVVWDPA-VEVTVLSKSDQTISCTVKLPHISTEFVVTFVYAVNCRYGRRRLWSELEL 126 Query: 745 I---EPTVLHPWLIMGDFNVVLRGEDRLNGS*VVDAEVKDFAQCLLTTGLTEMKAIGRFY 915 + + T PW+I+GDFN L D G + +++F +CLLT+ ++++ G Y Sbjct: 127 LAANQTTSDKPWIILGDFNQSLDPVDASTGGSRITRGMEEFRECLLTSNISDLPFRGNHY 186 Query: 916 TWTNNR----VLSKIDRALMNPA*VNK*PQVDVTVMDSQISDH 1032 TW NN+ + KIDR L+N + + P + + SDH Sbjct: 187 TWWNNQENNPIAKKIDRILVNDSWLIASPLSYGSFCAMEFSDH 229 Score = 39.7 bits (91), Expect(3) = 2e-97 Identities = 21/62 (33%), Positives = 31/62 (50%), Gaps = 1/62 (1%) Frame = +3 Query: 1032 CFALTTLEQQQDQIARPFKFLNHLAQHNDFLLRVRDIWSRQVH-GSPMERVWKKFKLMKG 1208 C + + Q +PFK N L H +F+ ++R W R + GS M + KK K +KG Sbjct: 230 CPSCVNISNQSGGRNKPFKLSNFLMHHPEFIEKIRVTWDRLAYQGSAMFTLSKKSKFLKG 289 Query: 1209 AI 1214 I Sbjct: 290 TI 291 >gb|ABD33261.1| RNA-directed DNA polymerase (Reverse transcriptase) [Medicago truncatula] Length = 402 Score = 348 bits (894), Expect = 5e-93 Identities = 180/396 (45%), Positives = 248/396 (62%) Frame = +2 Query: 1289 LYAREKQLKSELEKWNQVEESILKQKPRVQWLSLGDSNSAYFFASMRGRINQNHIKKLVN 1468 L EK S LEKW+ +EE I QK R W+ LGDSN+ +F A + R QN+IK L+ Sbjct: 5 LIEAEKICLSSLEKWSTIEEKIWMQKSRANWIQLGDSNTKFFHAYAKERRCQNNIKFLIT 64 Query: 1469 DSGRILYTKREVEDEIIGFYKTLLRSCATELSGIQSDVMNNDPILRRDQQLLLMAPVTKQ 1648 + G + +++EI GFY L+ S L + +V+ P+L + QQ LL + T Sbjct: 65 EDGTRIDKHNLIKEEIRGFYLKLMGSSVDSLPMVDKNVVKRGPMLSQHQQDLLCSKFTAV 124 Query: 1649 EIQAALNDISDLKAPGCDGFNVVFFKKSWEVISDEVVAVVTNILHTKRIFKAINRTTVIL 1828 E++ L + KAPG DG+NV FFK SW +I D V+ + + T + K IN T + L Sbjct: 125 EVKNVLFSMDSSKAPGIDGYNVHFFKCSWNIIGDSVIDAILDFFKTGFMPKIINCTYMTL 184 Query: 1829 IPKVQNPSYAKEFRPISCSTVLYKLIS*VLTKRLQGVMDSIIDSSQAAFVSGRVITDNIL 2008 +PK N + K FRPI+C +V+YK+IS +LT R+QGV++S++ +Q+AFV GRVI DNI+ Sbjct: 185 LPKEVNVTSVKNFRPIACCSVIYKIISKILTSRMQGVLNSVVSENQSAFVKGRVIFDNII 244 Query: 2009 LSHELVNGYCRKGVSARCMLKIDMQKAYDSLEWDFLEQVLVALNFPSTFVQWIMMCVQSV 2188 LSHELV Y RKG+S RCM+KID+QKAY+S+EW F++ +++ L F FV W+M C+ + Sbjct: 245 LSHELVKSYSRKGISPRCMVKIDLQKAYNSVEWPFIKHLMLELGFSYKFVNWVMGCLTTA 304 Query: 2189 SYSILINGHPTTPFDAKKXXXXXXXXXXXXXVMAMEYFSRFLEQLGQNSQFHFHPKCSGL 2368 SY+ ING T PF AKK V+ MEY + L QL +N+ F FHP+C L Sbjct: 305 SYTFNINGDLTRPFAAKKGLRQGDPISPYLFVICMEYLNICLIQLRKNAAFRFHPRCKRL 364 Query: 2369 KLIQLGFVDDLLLFCRGDVGSMELIFDKFKLFSRAS 2476 LI + FVDDLLLF RGDV S+ +F+ F LFS AS Sbjct: 365 NLIHVCFVDDLLLFSRGDVDSVSQLFEAFSLFSAAS 400 >gb|AAG50806.1|AC079281_8 unknown protein [Arabidopsis thaliana] Length = 1213 Score = 234 bits (596), Expect(3) = 9e-90 Identities = 137/404 (33%), Positives = 218/404 (53%), Gaps = 5/404 (1%) Frame = +2 Query: 1307 QLKSELEKWN---QVEESILKQKPRVQWLSLGDSNSAYFFASMRGRINQNHIKKLVNDSG 1477 +L++E KW+ EES +QK R+ W + GD N+ YF R + N I L + +G Sbjct: 334 ELEAE-RKWHILTAAEESFFRQKSRISWFAEGDGNTKYFHRMADARNSSNSISALYDGNG 392 Query: 1478 RILYTKREVEDEIIGFYKTLLRSCATELSGIQSDVMNNDPILRRD--QQLLLMAPVTKQE 1651 +++ ++ + D ++ +LL Q+D MN R Q L + + ++ Sbjct: 393 KLVDSQEGILDLCASYFGSLLGDEVDPYLMEQND-MNLLLSYRCSPAQVCELESTFSNED 451 Query: 1652 IQAALNDISDLKAPGCDGFNVVFFKKSWEVISDEVVAVVTNILHTKRIFKAINRTTVILI 1831 I+AAL + K+ G DGF FF SW ++ EV + + + K N TT++LI Sbjct: 452 IRAALFSLPRNKSCGPDGFTAEFFIDSWSIVGAEVTDAIKEFFSSGCLLKQWNATTIVLI 511 Query: 1832 PKVQNPSYAKEFRPISCSTVLYKLIS*VLTKRLQGVMDSIIDSSQAAFVSGRVITDNILL 2011 PK+ NP+ +FRPISC LYK+I+ +LT RLQ ++ +I S+Q+AF+ GR + +N+LL Sbjct: 512 PKIVNPTCTSDFRPISCLNTLYKVIARLLTDRLQRLLSGVISSAQSAFLPGRSLAENVLL 571 Query: 2012 SHELVNGYCRKGVSARCMLKIDMQKAYDSLEWDFLEQVLVALNFPSTFVQWIMMCVQSVS 2191 + +LV+GY +S R MLK+D++KA+DS+ W+F+ L AL P F+ WI C+ + + Sbjct: 572 ATDLVHGYNWSNISPRGMLKVDLKKAFDSVRWEFVIAALRALAIPEKFINWISQCISTPT 631 Query: 2192 YSILINGHPTTPFDAKKXXXXXXXXXXXXXVMAMEYFSRFLEQLGQNSQFHFHPKCSGLK 2371 +++ ING F + K V+AME FS L ++ H+HPK S L Sbjct: 632 FTVSINGGNGGFFKSTKGLRQGDPLSPYLFVLAMEAFSNLLHSRYESGLIHYHPKASNLS 691 Query: 2372 LIQLGFVDDLLLFCRGDVGSMELIFDKFKLFSRASSLIANLNKS 2503 + L F DD+++F G S+ I + F+ S L N +KS Sbjct: 692 ISHLMFADDVMIFFDGGSFSLHGICETLDDFASWSGLKVNKDKS 735 Score = 102 bits (254), Expect(3) = 9e-90 Identities = 61/202 (30%), Positives = 103/202 (50%), Gaps = 8/202 (3%) Frame = +1 Query: 385 NVRGFNKLHKHKEFLKTVRKEHINIIAIVEHRVHKNKATQIVKKVVPG*H*HYNYDISGK 564 N+RGFN + F K V+ ++E V + K + + ++PG NY S Sbjct: 9 NIRGFNNVSHRSGFKKWVKANKPIFGGVIETHVKQPKDRKFINALLPGWSFVENYAFSDL 68 Query: 565 ERIWLIWDSAYVDVTILHTNDQFVHCMIELPAQGIKVEFTAIYGFHTVETRRSLWSSLES 744 +IW++WD + V V ++ + Q + C + LP + + +Y + V +R+ LW + + Sbjct: 69 GKIWVMWDPS-VQVVVVAKSLQMITCEVLLPGSPSWIIVSVVYAANEVASRKELWIEIVN 127 Query: 745 IEPTVL---HPWLIMGDFNVVLRGEDRLNG-S*VVDAEVKDFAQCLLTTGLTEMKAIGRF 912 + + + PWL++GDFN VL ++ N S VD ++DF CLL L++++ G Sbjct: 128 MVVSGIIGDRPWLVLGDFNQVLNPQEHSNPVSLNVDINMRDFRDCLLAAELSDLRYKGNT 187 Query: 913 YTWTNNR----VLSKIDRALMN 966 +TW N V KIDR L+N Sbjct: 188 FTWWNKSHTTPVAKKIDRILVN 209 Score = 45.4 bits (106), Expect(3) = 9e-90 Identities = 27/56 (48%), Positives = 34/56 (60%), Gaps = 1/56 (1%) Frame = +3 Query: 1050 LEQQQDQIARPFKFLNHLAQHNDFLLRVRDIW-SRQVHGSPMERVWKKFKLMKGAI 1214 LE+ + RPFKF N+L ++ DFL VRD W + V GS M RV KK K +K I Sbjct: 238 LEETSIKAKRPFKFFNYLLKNLDFLNLVRDNWFTLNVVGSSMFRVSKKLKALKKPI 293 >ref|XP_004247247.1| PREDICTED: uncharacterized protein LOC101256917 [Solanum lycopersicum] Length = 421 Score = 298 bits (764), Expect = 6e-78 Identities = 161/406 (39%), Positives = 238/406 (58%) Frame = +2 Query: 1289 LYAREKQLKSELEKWNQVEESILKQKPRVQWLSLGDSNSAYFFASMRGRINQNHIKKLVN 1468 L +E++L +LEKW+ +EES +QK R +W+ LGD+N+ YF + ++ R HI+ +++ Sbjct: 18 LITKEEELPIKLEKWSMIEESAQRQKARAKWIQLGDANNKYFSSVIKERTQNKHIRNILS 77 Query: 1469 DSGRILYTKREVEDEIIGFYKTLLRSCATELSGIQSDVMNNDPILRRDQQLLLMAPVTKQ 1648 GR+LY +E++DE++ FYK+L+ + A VT++ Sbjct: 78 IHGRMLYEPQEIQDEVVLFYKSLMGTSA----------------------------VTEE 109 Query: 1649 EIQAALNDISDLKAPGCDGFNVVFFKKSWEVISDEVVAVVTNILHTKRIFKAINRTTVIL 1828 +I AAL I + KAPG DG+N FFK +W++I ++++ VV + ++FK N T V L Sbjct: 110 KIFAALQSIGNDKAPGIDGYNAFFFKYTWKIIKNDIIEVVQSFFKPGKLFKPFNCTLVSL 169 Query: 1829 IPKVQNPSYAKEFRPISCSTVLYKLIS*VLTKRLQGVMDSIIDSSQAAFVSGRVITDNIL 2008 IPKVQ+P KE+R I+C TVLYK+IS V+T R+ V+ ++I SQ F+ GR I++NIL Sbjct: 170 IPKVQSPKNVKEYRTITCCTVLYKIISKVITNRMHDVIHNVICDSQVGFILGRKISENIL 229 Query: 2009 LSHELVNGYCRKGVSARCMLKIDMQKAYDSLEWDFLEQVLVALNFPSTFVQWIMMCVQSV 2188 L+HELVN Y RK +S R MLKID+QK YDS+EW FL+QV+V L FP F QW+M CV++V Sbjct: 230 LAHELVNSYTRKNISPRSMLKIDLQKVYDSVEWPFLKQVMVGLGFPDMFTQWVMHCVKTV 289 Query: 2189 SYSILINGHPTTPFDAKKXXXXXXXXXXXXXVMAMEYFSRFLEQLGQNSQFHFHPKCSGL 2368 +Y+I++NG T FDA + F+ + Sbjct: 290 NYTIVVNGQTTQRFDAARL-------------------------------FYCY------ 312 Query: 2369 KLIQLGFVDDLLLFCRGDVGSMELIFDKFKLFSRASSLIANLNKSS 2506 ++LLLF RGD+ S++ + F FS+AS ANLNKSS Sbjct: 313 --------NNLLLFSRGDLNSIKALKGCFLEFSQASGQQANLNKSS 350 >gb|EOY17513.1| Uncharacterized protein TCM_036737 [Theobroma cacao] Length = 2215 Score = 209 bits (531), Expect(3) = 8e-78 Identities = 129/428 (30%), Positives = 219/428 (51%), Gaps = 4/428 (0%) Frame = +2 Query: 1232 VRQELNDVQVC*DTHDDQ*LYAREKQLK---SELEKWNQVEESILKQKPRVQWLSLGDSN 1402 +++ V+ C H + QL ++L K +EE KQK V+W+ G+ N Sbjct: 1141 IKEAEKRVEECEILHQQEQTIGSRIQLNKSYAQLNKQLSMEEIFWKQKSGVKWVVEGERN 1200 Query: 1403 SAYFFASMRGRINQNHIKKLVNDSGRILYTKREVEDEIIGFYKTLLRSCATELSGIQSDV 1582 + +F M+ + ++HI K+ G + +++ I F+ +LL++ + + + QS + Sbjct: 1201 TKFFHMRMQKKRIRSHIFKIQEQDGNWIEDPEQLQQSAIDFFSSLLKAESCDDTRFQSSL 1260 Query: 1583 MNNDPILRRDQQLLLMAPVTKQEIQAALNDISDLKAPGCDGFNVVFFKKSWEVISDEVVA 1762 + I+ L A T QE++ A+ I A G DGF+ F+++ W++I+ ++ Sbjct: 1261 CPS--IISDTDNGFLCAEPTLQEVKEAVFGIDPESAAGPDGFSSHFYQQCWDIIAHDLFE 1318 Query: 1763 VVTNILHTKRIFKAINRTTVILIPKVQNPSYAKEFRPISCSTVLYKLIS*VLTKRLQGVM 1942 V H I + + TT++LIPK + S EFRPIS TV+ K+I+ +L RL ++ Sbjct: 1319 AVKEFFHGADIPQGMTSTTLVLIPKTTSASKWSEFRPISLCTVMNKIITKILANRLAKIL 1378 Query: 1943 DSIIDSSQAAFVSGRVITDNILLSHELVNGYCRKGVSARCMLKIDMQKAYDSLEWDFLEQ 2122 SII +Q+ FV GR+I+DNILL+ EL+ +K LK+DM KAYD L+W FL + Sbjct: 1379 PSIITENQSGFVGGRLISDNILLAQELIGKLDQKNRGGNVALKLDMMKAYDRLDWSFLFK 1438 Query: 2123 VLVALNFPSTFVQWIMMCVQSVSYSILINGHPTTPFDAKKXXXXXXXXXXXXXVMAMEYF 2302 VL L F + ++ I C+ + +S+L+NG F +++ ++A EY Sbjct: 1439 VLQHLGFNAQWIGMIQKCISNCWFSLLLNGRTVGYFKSERGLRQGDSISPQLFILAAEYL 1498 Query: 2303 SRFLEQL-GQNSQFHFHPKCSGLKLIQLGFVDDLLLFCRGDVGSMELIFDKFKLFSRASS 2479 +R L L Q H+ CS L + L F DD+++F G +++ I + + + S Sbjct: 1499 ARGLNALYDQYPSLHYSSGCS-LSVSHLAFADDVIIFANGSKSALQKIMAFLQEYEKLSG 1557 Query: 2480 LIANLNKS 2503 N KS Sbjct: 1558 QRINPQKS 1565 Score = 99.0 bits (245), Expect(3) = 8e-78 Identities = 57/197 (28%), Positives = 101/197 (51%) Frame = +1 Query: 451 INIIAIVEHRVHKNKATQIVKKVVPG*H*HYNYDISGKERIWLIWDSAYVDVTILHTNDQ 630 + I+AI+E V +KA +K+ ++ ++IWL ++ +L + Q Sbjct: 878 LKILAILEPMVDTSKAEYFRRKMG-----FEKVIVNNSQKIWLFHSVEFI-CEVLLDHPQ 931 Query: 631 FVHCMIELPAQGIKVEFTAIYGFHTVETRRSLWSSLESIEPTVLHPWLIMGDFNVVLRGE 810 +H + +P + + T +Y T R LW+ L ++ + PW++ GDFN++L+ E Sbjct: 932 CLHVRVTIPWLDLPIFTTFVYAKCTRSERTPLWNCLRNLAADMEGPWIVGGDFNIILKRE 991 Query: 811 DRLNGS*VVDAEVKDFAQCLLTTGLTEMKAIGRFYTWTNNRVLSKIDRALMNPA*VNK*P 990 +RL G+ + ++DFA LL GL + G +TWTNNR+ ++DR + N +NK P Sbjct: 992 ERLYGADPHEGSIEDFASVLLDCGLLDGGFEGNPFTWTNNRMFQRLDRMVYNQQWINKFP 1051 Query: 991 QVDVTVMDSQISDHALL 1041 + ++ SDH L Sbjct: 1052 ITRIQHLNRDGSDHCPL 1068 Score = 33.9 bits (76), Expect(3) = 8e-78 Identities = 16/58 (27%), Positives = 28/58 (48%) Frame = +3 Query: 1032 CFALTTLEQQQDQIARPFKFLNHLAQHNDFLLRVRDIWSRQVHGSPMERVWKKFKLMK 1205 C L + ++ F+FL+ A H++F V W+ ++GS + W K K +K Sbjct: 1066 CPLLLSCSNSSEKAPSSFRFLHAWALHHNFNASVEGNWNLPINGSGLMAFWSKQKRLK 1123 >gb|AAC67331.1| putative non-LTR retroelement reverse transcriptase [Arabidopsis thaliana] Length = 1449 Score = 211 bits (537), Expect(3) = 2e-70 Identities = 111/325 (34%), Positives = 190/325 (58%), Gaps = 10/325 (3%) Frame = +2 Query: 1298 REKQLKSELE-KWNQV---EESILKQKPRVQWLSLGDSNSAYFFASMRGRINQNHIKKLV 1465 R +++SE +W+++ EE LKQ ++ WL +GD N+ F + R QN I+++ Sbjct: 749 RAMEIESEAYVRWDRIASIEEKYLKQVSKLHWLKVGDKNNKTFHRAATARAAQNSIREIQ 808 Query: 1466 NDSGRILYTKREVEDEIIGFYKTLLRSCATELSGIQSDVMNN------DPILRRDQQLLL 1627 + G TK ++++E F++ L+ + GI + + + P ++ +L Sbjct: 809 KEDGSTATTKDDIKNETERFFQEFLQLIPNDYEGITVEKLTSLLPYHCSPA----EKDML 864 Query: 1628 MAPVTKQEIQAALNDISDLKAPGCDGFNVVFFKKSWEVISDEVVAVVTNILHTKRIFKAI 1807 A V+ +EI+ AL + + K+PG DG+ F+K++W++I E V V + + K + Sbjct: 865 TASVSAKEIRGALFSMPNDKSPGPDGYTSEFYKRAWDIIGAEFVLAVKSFFEKGFLPKGV 924 Query: 1808 NRTTVILIPKVQNPSYAKEFRPISCSTVLYKLIS*VLTKRLQGVMDSIIDSSQAAFVSGR 1987 N T + LIPK K++RPISC V+YK+IS ++ RL+ V+ + I +Q+AFV R Sbjct: 925 NTTILALIPKKLEAKEMKDYRPISCCNVIYKVISKIIANRLKHVLPNFIAGNQSAFVKDR 984 Query: 1988 VITDNILLSHELVNGYCRKGVSARCMLKIDMQKAYDSLEWDFLEQVLVALNFPSTFVQWI 2167 ++ +N+LL+ ELV Y + +S RC +KID+ KA+DS++W FL+ VL AL+FP FV W+ Sbjct: 985 LLIENLLLATELVKDYHKDTISGRCAIKIDISKAFDSVQWSFLKNVLSALDFPPEFVHWV 1044 Query: 2168 MMCVQSVSYSILINGHPTTPFDAKK 2242 M+CV + S+S+ +NG F + + Sbjct: 1045 MLCVTTASFSVQVNGELAGYFQSSR 1069 Score = 77.8 bits (190), Expect(3) = 2e-70 Identities = 46/190 (24%), Positives = 97/190 (51%), Gaps = 9/190 (4%) Frame = +1 Query: 430 KTVRKEHINIIAIVEHRVHKNKATQIVKKVVPG*H*HYNYDISGKERIWLIWDSAYVDVT 609 K V +++ ++E RV + + + K+ NY+ + + R+W++W V T Sbjct: 437 KWVDEQNFQFGCLIETRVKEENSQWLGSKLFKDWSMLTNYEFNRRGRLWVVWREN-VRFT 495 Query: 610 ILHTNDQFVHCMIELPAQGIKVEFTAIYGFHTVETRRSLWSSL-ESIEPTVLH--PWLIM 780 + +DQ + C ++L +Q + ++ +Y + E R+ LW+ L + ++ ++ PW+I Sbjct: 496 PFYKSDQLITCSVKLESQEEEFFYSFVYASNFAEERKILWNDLRDHMDSPIIRDKPWIIF 555 Query: 781 GDFNVVLRGED--RLNGS*VVDAEVKDFAQCLLTTGLTEMKAIGRFYTWTNNR----VLS 942 GDFN +L ++ R+ V + ++DF + +++ + G +TW N R + Sbjct: 556 GDFNEILDMDEHSRMEDHPAVTSGMRDFQSLVNYCSFSDLASHGPLFTWCNKRDNDPIWK 615 Query: 943 KIDRALMNPA 972 K+DR ++N A Sbjct: 616 KLDRVMVNEA 625 Score = 28.1 bits (61), Expect(3) = 2e-70 Identities = 17/52 (32%), Positives = 27/52 (51%), Gaps = 4/52 (7%) Frame = +3 Query: 1077 RPFKFLNHLAQHNDFLLRVRDIW--SRQVH--GSPMERVWKKFKLMKGAICG 1220 +PFKF+N +A +F V + W + +H S + R KK K +K + G Sbjct: 664 KPFKFVNAVADMEEFKPLVENFWRETEPIHMSTSSLFRFTKKLKALKPKLRG 715 >ref|XP_004252692.1| PREDICTED: uncharacterized protein LOC101261795 [Solanum lycopersicum] Length = 413 Score = 243 bits (621), Expect(3) = 4e-70 Identities = 127/313 (40%), Positives = 185/313 (59%) Frame = +2 Query: 1223 IQLVRQELNDVQVC*DTHDDQ*LYAREKQLKSELEKWNQVEESILKQKPRVQWLSLGDSN 1402 I+ R EL ++Q + L+ +EK L +++KW+ +EES L+QK R +W++LGD+ Sbjct: 133 IEKKRIELVELQEQLYSQASDELFTKEKDLLIKVDKWSMIEESALRQKARARWITLGDAK 192 Query: 1403 SAYFFASMRGRINQNHIKKLVNDSGRILYTKREVEDEIIGFYKTLLRSCATELSGIQSDV 1582 + YF + ++ R + HI+ ++L I + V Sbjct: 193 NKYFSSVIKERNQKKHIR--------------------------------SKLPAINAQV 220 Query: 1583 MNNDPILRRDQQLLLMAPVTKQEIQAALNDISDLKAPGCDGFNVVFFKKSWEVISDEVVA 1762 M P+ R Q++ L +T+QEI + L + KAPG DG+N +FFK +W++I +V+ Sbjct: 221 MKRGPVSSRQQRIQLCTDITEQEIYSTLQSYGNDKAPGIDGYNALFFKHTWKIIKKDVIE 280 Query: 1763 VVTNILHTKRIFKAINRTTVILIPKVQNPSYAKEFRPISCSTVLYKLIS*VLTKRLQGVM 1942 V N T ++FK N T V LIPKVQ P KE+ PI+C TVLYK+IS V+T+R+ V+ Sbjct: 281 AVKNFFTTGKLFKPFNCTLVSLIPKVQCPKTVKEYTPIACCTVLYKIISKVITRRMHDVI 340 Query: 1943 DSIIDSSQAAFVSGRVITDNILLSHELVNGYCRKGVSARCMLKIDMQKAYDSLEWDFLEQ 2122 +I SQA F+ GR I DNI+L+HELV Y RK +S R +LKID+ KAYDS+EW FLEQ Sbjct: 341 HDVICESQAGFIPGRKIADNIILAHELVKTYTRKNISPRIILKIDLHKAYDSVEWPFLEQ 400 Query: 2123 VLVALNFPSTFVQ 2161 V+V L FP F+Q Sbjct: 401 VMVGLGFPEMFIQ 413 Score = 37.4 bits (85), Expect(3) = 4e-70 Identities = 20/56 (35%), Positives = 31/56 (55%), Gaps = 5/56 (8%) Frame = +1 Query: 880 GLTEMKAIGRFYTWTNN-----RVLSKIDRALMNPA*VNK*PQVDVTVMDSQISDH 1032 G+TE++ G +YTWTN R+ S+IDRA N ++K + + +SDH Sbjct: 2 GITEVQWKGNYYTWTNKQISNARIASRIDRAFGNVTWMDKWGHAAIESGNPGVSDH 57 Score = 35.0 bits (79), Expect(3) = 4e-70 Identities = 20/50 (40%), Positives = 25/50 (50%), Gaps = 1/50 (2%) Frame = +3 Query: 1050 LEQQQDQIARPFKFLNHLAQHNDFLLRVRDIWSRQVHGSP-MERVWKKFK 1196 L Q QI FK N L +H FL V +W +Q HGS M+ +W K Sbjct: 64 LHQSYHQIKVSFKLFNVLIEHKSFLELVDKVW-KQKHGSEVMKEIWYNLK 112 >ref|XP_006576082.1| PREDICTED: uncharacterized protein LOC102659506 [Glycine max] Length = 964 Score = 158 bits (400), Expect(3) = 7e-70 Identities = 92/294 (31%), Positives = 151/294 (51%), Gaps = 2/294 (0%) Frame = +2 Query: 1220 KIQLVRQELNDV-QVC*DTHDDQ*LYAREKQLKSELEKWNQVEESILKQKPRVQWLSLGD 1396 +++L E N V D L A + + + + E Q + ++L D Sbjct: 670 RVELAEAEYNSVLNSIKQNPQDPSLLALANRTRGQTIMLRKAESMKFAQLIKNKYLLQAD 729 Query: 1397 SNSAYFFASMRGRINQNHIKKLVNDSGRILYTKREVEDEIIGFYKTLLRSCATELSGIQS 1576 S +F A ++ + I + + G ++ E+ + ++ A EL+ S Sbjct: 730 KCSKFFHALIKRNKHSRFIAAIRLEDGHNTSSQDEIALAFVNHFRNFFS--AHELTQTPS 787 Query: 1577 -DVMNNDPILRRDQQLLLMAPVTKQEIQAALNDISDLKAPGCDGFNVVFFKKSWEVISDE 1753 + N P + D L+ P +KQ++ ++ +++ KAPG DGFNV+FFKK+W ++ D+ Sbjct: 788 ISICNRGPKVPTDCFAALLCPTSKQKVWNIISVMANNKAPGPDGFNVLFFKKAWNIVGDD 847 Query: 1754 VVAVVTNILHTKRIFKAINRTTVILIPKVQNPSYAKEFRPISCSTVLYKLIS*VLTKRLQ 1933 + A V T +I K +N ++LIPK S FRPISC +LYK++S +L R+ Sbjct: 848 IFAAVNEFFTTGKILKQLNHAIIVLIPKHDQASQVNHFRPISCCNLLYKIVSKILANRIA 907 Query: 1934 GVMDSIIDSSQAAFVSGRVITDNILLSHELVNGYCRKGVSARCMLKIDMQKAYD 2095 V+++II +Q AF+ R + DNI L E++ Y RK S RC+LKID+ KAYD Sbjct: 908 PVLETIIGETQTAFIKNRKMMDNIFLVQEILRKYARKRPSPRCLLKIDLHKAYD 961 Score = 114 bits (285), Expect(3) = 7e-70 Identities = 60/146 (41%), Positives = 83/146 (56%), Gaps = 1/146 (0%) Frame = +1 Query: 607 TILHTNDQFVHCMIELPAQGIKVEFTAIYGFHTVETRRSLWSSLESIEPTVLHPWLIMGD 786 ++L +N Q +HC I+ + + + IYG H++ RRSLW +L SI + PWL++GD Sbjct: 453 SVLESNAQLIHCAIDCKTTAKRFQVSFIYGLHSIMARRSLWINLNSINANMNCPWLLIGD 512 Query: 787 FNVVLRGEDRLNGS*VVDAEVKDFAQCLLTTGLTEMKAIGRFYTWTNNRVLSKIDRALMN 966 FN +L DR NG+ + E++DF C GL + G YTWTN+RV SK+DRAL N Sbjct: 513 FNSILSPTDRFNGAELNAYELQDFVDCYSDLGLGSINTHGPLYTWTNSRVWSKLDRALCN 572 Query: 967 PA*VNK*PQVDVTVMD-SQISDHALL 1041 A N VM+ ISDH L Sbjct: 573 QAWFNSFGNSACEVMEFISISDHTPL 598 Score = 42.4 bits (98), Expect(3) = 7e-70 Identities = 19/45 (42%), Positives = 26/45 (57%) Frame = +3 Query: 1080 PFKFLNHLAQHNDFLLRVRDIWSRQVHGSPMERVWKKFKLMKGAI 1214 PFKF N + H +FL V D W + +HG M +V KK K +K + Sbjct: 612 PFKFNNLIVDHPNFLRIVADGWKQNIHGCSMFKVCKKLKALKAPL 656 >ref|XP_006577697.1| PREDICTED: uncharacterized protein LOC102664381 [Glycine max] Length = 515 Score = 136 bits (343), Expect(3) = 1e-67 Identities = 77/223 (34%), Positives = 115/223 (51%), Gaps = 4/223 (1%) Frame = +1 Query: 385 NVRGFNKLHKHKEFLKTVRKEHINIIAIVEHRVHKNKATQIVKKVVPG*H*HYNYDISGK 564 N+RG NK+ K E ++ + II ++E RV KNKA + K+ NYD Sbjct: 6 NIRGLNKVGKTIEISSRLKSLNPTIIVLLETRVRKNKALTVRNKLNLNMKYLDNYDKHEN 65 Query: 565 ERIWLIWDSAYVDVTILHTNDQFVHCMIELPAQGIKVEFTAIYGFHTVETRRSLWSSLES 744 RIW IWD + V + + + Q +HC + P TAIY + ++ RR LW +E Sbjct: 66 GRIWFIWDDSKVMIKHICSTSQLIHCGVYNPNGDFLHWCTAIYALNHLDDRRKLWKDIED 125 Query: 745 IEPTVLHPWLIMGDFNVVLRGEDRLNGS*VVDAEVKDFAQCLLTTGLTEMKAIGRFYTWT 924 + PW ++GDFN VL+ EDR+ G V+++E D + + GL EM G F+TWT Sbjct: 126 LRVQQADPWCLLGDFNNVLKAEDRIGGRDVIESEYVDLREMMSRVGLYEMDTCGDFFTWT 185 Query: 925 N----NRVLSKIDRALMNPA*VNK*PQVDVTVMDSQISDHALL 1041 N N + S+IDR L N + + ++ +SDHAL+ Sbjct: 186 NKQADNTIYSRIDRFLGNLNWLQMHIDSTLKILAPSVSDHALM 228 Score = 131 bits (329), Expect(3) = 1e-67 Identities = 61/190 (32%), Positives = 107/190 (56%) Frame = +2 Query: 1298 REKQLKSELEKWNQVEESILKQKPRVQWLSLGDSNSAYFFASMRGRINQNHIKKLVNDSG 1477 R K SEL + N++E++ L+QK ++ W+ GD N++YF A+++GR N I+ L+ + G Sbjct: 326 RVKDRTSELLQLNELEDNDLRQKAKINWIRQGDGNNSYFHATIKGRYKHNAIRSLIKEDG 385 Query: 1478 RILYTKREVEDEIIGFYKTLLRSCATELSGIQSDVMNNDPILRRDQQLLLMAPVTKQEIQ 1657 + + ++E+E++ FY LL S + L+G+ + N L + Q+ +L+ PV+ EI Sbjct: 386 SCITSHEDIEEEVLKFYSALLGSSESNLAGLNIPAIRNGNTLNQFQRDMLIGPVSNAEID 445 Query: 1658 AALNDISDLKAPGCDGFNVVFFKKSWEVISDEVVAVVTNILHTKRIFKAINRTTVILIPK 1837 + + K PG DG+ V FFK +W ++ +V + + R+ K N + V LIPK Sbjct: 446 TTIKGMDVNKTPGIDGYGVGFFKDAWSIVGSDVREAILDFFLRNRLHKGFNSSVVALIPK 505 Query: 1838 VQNPSYAKEF 1867 + K+F Sbjct: 506 HKEAKMIKDF 515 Score = 39.7 bits (91), Expect(3) = 1e-67 Identities = 17/54 (31%), Positives = 30/54 (55%) Frame = +3 Query: 1053 EQQQDQIARPFKFLNHLAQHNDFLLRVRDIWSRQVHGSPMERVWKKFKLMKGAI 1214 + Q ++ FK+ N LA+ N F V+ W+ VHG+PM ++W K ++ + Sbjct: 233 KDQSSRLRGRFKYRNSLARLNGFHDEVKKNWNLGVHGNPMYKLWTKLSRLQSVL 286 >dbj|BAA97290.1| non-LTR retroelement reverse transcriptase-like [Arabidopsis thaliana] Length = 1072 Score = 255 bits (652), Expect = 6e-65 Identities = 144/389 (37%), Positives = 218/389 (56%), Gaps = 2/389 (0%) Frame = +2 Query: 1343 EESILKQKPRVQWLSLGDSNSAYFFASMRGRINQNHIKKLVNDSGRILYTKREVEDEIIG 1522 EES Q+ RV W + GDSN+ YF + R + N I LV+ +G ++ +++ + D + Sbjct: 208 EESFFHQRSRVSWFAEGDSNTHYFHRMVDSRKSFNTINSLVDSNGLLIDSQQGILDHCVT 267 Query: 1523 FYKTLLRSCATELSGIQSDVMNNDPILR--RDQQLLLMAPVTKQEIQAALNDISDLKAPG 1696 +Y+ LL S + S Q D MN R +DQ L T EI+AA + K G Sbjct: 268 YYERLLGSIESPFSMEQED-MNLLLTYRCSQDQCSELEKSFTDDEIKAAFKSLPRNKTSG 326 Query: 1697 CDGFNVVFFKKSWEVISDEVVAVVTNILHTKRIFKAINRTTVILIPKVQNPSYAKEFRPI 1876 DG++V FF+ +W +I EV+A + + ++ K N TT++LIPK N EFRPI Sbjct: 327 PDGYSVEFFRDTWSIIGPEVLAAIHEFFDSGQLLKQWNATTLVLIPKTSNACTISEFRPI 386 Query: 1877 SCSTVLYKLIS*VLTKRLQGVMDSIIDSSQAAFVSGRVITDNILLSHELVNGYCRKGVSA 2056 SC LYK+IS +LT RLQG++ ++I SQ+AF+ GR + +N+LL+ E+V+GY R +S Sbjct: 387 SCLNTLYKVISKLLTSRLQGLLSAVIGHSQSAFLPGRSLAENVLLATEMVHGYNRLNISP 446 Query: 2057 RCMLKIDMQKAYDSLEWDFLEQVLVALNFPSTFVQWIMMCVQSVSYSILINGHPTTPFDA 2236 R MLK+D++KA+DS++W+F+ L AL P ++ WI C+ + S++I +NG F + Sbjct: 447 RGMLKVDLKKAFDSVKWEFVTAALRALAIPERYINWIHQCITTPSFTISVNGATGGFFRS 506 Query: 2237 KKXXXXXXXXXXXXXVMAMEYFSRFLEQLGQNSQFHFHPKCSGLKLIQLGFVDDLLLFCR 2416 K V+AME FS+ L + H+HPK L + L F DD+++F Sbjct: 507 TKGLRQGDPLSPYLFVLAMEVFSKLLYSRYDSGYIHYHPKAGDLSISHLMFADDVMIFFD 566 Query: 2417 GDVGSMELIFDKFKLFSRASSLIANLNKS 2503 G SM I + F+ S L N +KS Sbjct: 567 GGSSSMHGICETLDDFADWSGLKVNKDKS 595 Score = 43.9 bits (102), Expect(2) = 4e-08 Identities = 28/56 (50%), Positives = 30/56 (53%), Gaps = 1/56 (1%) Frame = +3 Query: 1050 LEQQQDQIARPFKFLNHLAQHNDFLLRVRDIW-SRQVHGSPMERVWKKFKLMKGAI 1214 LE RPFKF N L ++ DFL V D W S V GS M RV KK K MK I Sbjct: 98 LEANGISAKRPFKFFNFLLKNEDFLNVVMDNWFSTNVVGSSMYRVSKKLKAMKKPI 153 Score = 42.7 bits (99), Expect(2) = 4e-08 Identities = 28/91 (30%), Positives = 43/91 (47%), Gaps = 5/91 (5%) Frame = +1 Query: 775 IMGDFNVVLRGEDRLNG-S*VVDAEVKDFAQCLLTTGLTEMKAIGRFYTWTNNR----VL 939 ++GDFN VL ++ N S +D ++DF CL L+++ G +TW N + Sbjct: 1 MLGDFNQVLLPQEHSNPPSLNIDRRMRDFGSCLSEMELSDLVFKGNSFTWWNKSSIRPIA 60 Query: 940 SKIDRALMNPA*VNK*PQVDVTVMDSQISDH 1032 K+DR L N + N P + SDH Sbjct: 61 KKLDRILANDSWCNLYPSSHGLFGNLDFSDH 91 >dbj|BAF01687.1| hypothetical protein [Arabidopsis thaliana] Length = 1072 Score = 255 bits (652), Expect = 6e-65 Identities = 144/389 (37%), Positives = 218/389 (56%), Gaps = 2/389 (0%) Frame = +2 Query: 1343 EESILKQKPRVQWLSLGDSNSAYFFASMRGRINQNHIKKLVNDSGRILYTKREVEDEIIG 1522 EES Q+ RV W + GDSN+ YF + R + N I LV+ +G ++ +++ + D + Sbjct: 208 EESFFHQRSRVSWFAEGDSNTHYFHRMVDSRKSFNTINSLVDSNGLLIDSQQGILDHCVT 267 Query: 1523 FYKTLLRSCATELSGIQSDVMNNDPILR--RDQQLLLMAPVTKQEIQAALNDISDLKAPG 1696 +Y+ LL S + S Q D MN R +DQ L T EI+AA + K G Sbjct: 268 YYERLLGSIESPFSMEQED-MNLLLTYRCSQDQCSELEKSFTDDEIKAAFKSLPRNKTSG 326 Query: 1697 CDGFNVVFFKKSWEVISDEVVAVVTNILHTKRIFKAINRTTVILIPKVQNPSYAKEFRPI 1876 DG++V FF+ +W +I EV+A + + ++ K N TT++LIPK N EFRPI Sbjct: 327 PDGYSVEFFRDTWSIIGPEVLAAIHEFFDSGQLLKQWNATTLVLIPKTSNACTISEFRPI 386 Query: 1877 SCSTVLYKLIS*VLTKRLQGVMDSIIDSSQAAFVSGRVITDNILLSHELVNGYCRKGVSA 2056 SC LYK+IS +LT RLQG++ ++I SQ+AF+ GR + +N+LL+ E+V+GY R +S Sbjct: 387 SCLNTLYKVISKLLTSRLQGLLSAVIGHSQSAFLPGRSLAENVLLATEMVHGYNRLNISP 446 Query: 2057 RCMLKIDMQKAYDSLEWDFLEQVLVALNFPSTFVQWIMMCVQSVSYSILINGHPTTPFDA 2236 R MLK+D++KA+DS++W+F+ L AL P ++ WI C+ + S++I +NG F + Sbjct: 447 RGMLKVDLKKAFDSVKWEFVTAALRALAIPERYINWIHQCITTPSFTISVNGATGGFFRS 506 Query: 2237 KKXXXXXXXXXXXXXVMAMEYFSRFLEQLGQNSQFHFHPKCSGLKLIQLGFVDDLLLFCR 2416 K V+AME FS+ L + H+HPK L + L F DD+++F Sbjct: 507 TKGLRQGDPLSPYLFVLAMEVFSKLLYSRYDSGYIHYHPKAGDLSISHLMFADDVMIFFD 566 Query: 2417 GDVGSMELIFDKFKLFSRASSLIANLNKS 2503 G SM I + F+ S L N +KS Sbjct: 567 GGSSSMHGICETLDDFADWSGLKVNKDKS 595 Score = 43.9 bits (102), Expect(2) = 4e-08 Identities = 28/56 (50%), Positives = 30/56 (53%), Gaps = 1/56 (1%) Frame = +3 Query: 1050 LEQQQDQIARPFKFLNHLAQHNDFLLRVRDIW-SRQVHGSPMERVWKKFKLMKGAI 1214 LE RPFKF N L ++ DFL V D W S V GS M RV KK K MK I Sbjct: 98 LEANGISAKRPFKFFNFLLKNEDFLNVVMDNWFSTNVVGSSMYRVSKKLKAMKKPI 153 Score = 42.7 bits (99), Expect(2) = 4e-08 Identities = 28/91 (30%), Positives = 43/91 (47%), Gaps = 5/91 (5%) Frame = +1 Query: 775 IMGDFNVVLRGEDRLNG-S*VVDAEVKDFAQCLLTTGLTEMKAIGRFYTWTNNR----VL 939 ++GDFN VL ++ N S +D ++DF CL L+++ G +TW N + Sbjct: 1 MLGDFNQVLLPQEHSNPPSLNIDRRMRDFGSCLSEMELSDLVFKGNSFTWWNKSSIRPIA 60 Query: 940 SKIDRALMNPA*VNK*PQVDVTVMDSQISDH 1032 K+DR L N + N P + SDH Sbjct: 61 KKLDRILANDSWCNLYPSSHGLFGNLDFSDH 91 >emb|CCA66188.1| hypothetical protein [Beta vulgaris subsp. vulgaris] Length = 1381 Score = 197 bits (502), Expect(3) = 8e-64 Identities = 123/408 (30%), Positives = 205/408 (50%), Gaps = 4/408 (0%) Frame = +2 Query: 1295 AREKQLKSELEKWNQVEESILKQKPRVQWLSLGDSNSAYFFASMRGRINQNHIKKLVNDS 1474 A + + EL W + +E+ Q R +W+ GD N+ YF R +N I L+ ++ Sbjct: 318 AERRSSQMELWVWLRRKEAFWAQNSRAKWIKEGDKNTKYFHTLASTRKKKNTIPALITNN 377 Query: 1475 GRILYTKREVEDEIIGFYKTLLR---SCATELSGIQSDVMNNDPILRRDQQLLLMAPVTK 1645 G ++ + E + F+K++ + S +G+Q ++ + + + L P + Sbjct: 378 G-VVSDPAGIHHEAVSFFKSIFKEDFSSRPVFNGLQFRSLSCEQVSQ------LTEPFSH 430 Query: 1646 QEIQAALNDISDLKAPGCDGFNVVFFKKSWEVISDEVVAVVTNILHTKRIFKAINRTTVI 1825 +E+ A+ KAPG DG+N F K SW++I +V +V N ++ + K N + Sbjct: 431 KEVDEAVESCDPQKAPGPDGYNFRFIKDSWDIIKLDVYNIVENFWNSGSLPKGSNVAFIA 490 Query: 1826 LIPKVQNPSYAKEFRPISCSTVLYKLIS*VLTKRLQGVMDSIIDSSQAAFVSGRVITDNI 2005 LI K + P +FRPIS +YK+I+ +L +RLQ VMDS+I Q++F++GR I D Sbjct: 491 LIAKREVPEGLNDFRPISMVGCIYKIIAKLLARRLQKVMDSLIGPYQSSFIAGRQILDGA 550 Query: 2006 LLSHELVNGYCRKGVSARCMLKIDMQKAYDSLEWDFLEQVLVALNFPSTFVQWIMMCVQS 2185 L++ EL++ CR+ +LK+D KA+DS+ W FL+ L + FP + WI C+ S Sbjct: 551 LIAGELID-TCRRKKVQLSILKLDFHKAFDSVAWSFLDWTLDKMGFPPRWRMWISSCITS 609 Query: 2186 VSYSILINGHPTTPFDAKKXXXXXXXXXXXXXVMAMEYFSRFLEQLGQNSQFH-FHPKCS 2362 + SILING PT PF + + +E S +++ + + Sbjct: 610 AAASILINGSPTAPFKLHRGLRQGDPLSPFLFDLVVETLSLVIQKASHLGLWEGVEVTKN 669 Query: 2363 GLKLIQLGFVDDLLLFCRGDVGSMELIFDKFKLFSRASSLIANLNKSS 2506 G K+ L + DD ++FC ++ + I LF AS L N +KSS Sbjct: 670 GEKITHLQYADDTIIFCPPNLDYLLNIKKTLILFQLASGLQVNFHKSS 717 Score = 68.6 bits (166), Expect(3) = 8e-64 Identities = 53/227 (23%), Positives = 102/227 (44%), Gaps = 2/227 (0%) Frame = +1 Query: 367 MKIAT*NVRGFNKLHKHKEFLKTVRKEHINIIAIVEHRVHKNKATQIVKKVVPG*H*HYN 546 M I + N+RG N K K + + I + E ++ ++ + ++ + + Sbjct: 1 MIIISWNIRGLNARVKKSSLRKLISRHDPKFIFLQETKM-ESLNPKTIRSIWNSDDIDWL 59 Query: 547 Y--DISGKERIWLIWDSAYVDVTILHTNDQFVHCMIELPAQGIKVEFTAIYGFHTVETRR 720 + I + +W Y +T + + ++ ++P++ + +Y +R Sbjct: 60 FIPSIGNSGGLLSMWKIDYFSLTSHKSENNWIALNGKIPSKNFQGVLVNVYNPCCRVSRS 119 Query: 721 SLWSSLESIEPTVLHPWLIMGDFNVVLRGEDRLNGS*VVDAEVKDFAQCLLTTGLTEMKA 900 +W+S+ P L++GDFN VL DR +G V DF + T L E+ A Sbjct: 120 KVWTSISDYWAESQSPMLMVGDFNEVLDPSDRGSGI-SSQLGVLDFKNFIQQTHLMEISA 178 Query: 901 IGRFYTWTNNRVLSKIDRALMNPA*VNK*PQVDVTVMDSQISDHALL 1041 ++TW + + SK+DR L+NP V+ P + V+++ +SDH L Sbjct: 179 SDGWFTWFSGQAKSKLDRLLVNPEWVSLFPSLQVSILRRNLSDHCPL 225 Score = 28.5 bits (62), Expect(3) = 8e-64 Identities = 12/43 (27%), Positives = 23/43 (53%) Frame = +3 Query: 1077 RPFKFLNHLAQHNDFLLRVRDIWSRQVHGSPMERVWKKFKLMK 1205 RPF+F N H L ++D+W+ G+ +++ + K +K Sbjct: 237 RPFRFQNCWLSHPGCLQIIKDVWASHTSGNLTDKLKETKKRLK 279 >ref|XP_004253225.1| PREDICTED: uncharacterized protein LOC101268668 [Solanum lycopersicum] Length = 390 Score = 185 bits (470), Expect(3) = 8e-64 Identities = 102/300 (34%), Positives = 155/300 (51%) Frame = +2 Query: 1220 KIQLVRQELNDVQVC*DTHDDQ*LYAREKQLKSELEKWNQVEESILKQKPRVQWLSLGDS 1399 KI+ R EL ++Q L ++K+L +LEKW+ +EE+ L+QK R +W++LGD+ Sbjct: 158 KIEKARSELEELQEKLYNQAQDDLVTKDKELLIQLEKWSMLEENALRQKARARWITLGDT 217 Query: 1400 NSAYFFASMRGRINQNHIKKLVNDSGRILYTKREVEDEIIGFYKTLLRSCATELSGIQSD 1579 N+ YF A ++ R + HI+ +++ G++LY +E+++E + FYK+L+ S A +L I + Sbjct: 218 NNKYFSAVIKERNQKKHIRSILSLDGKMLYEPQEIQEEFVKFYKSLMGSSAGKLPAINAQ 277 Query: 1580 VMNNDPILRRDQQLLLMAPVTKQEIQAALNDISDLKAPGCDGFNVVFFKKSWEVISDEVV 1759 + ND KAPG DG+N +FFK +W+++ +V+ Sbjct: 278 SIGND------------------------------KAPGIDGYNELFFKHTWKIVKKDVI 307 Query: 1760 AVVTNILHTKRIFKAINRTTVILIPKVQNPSYAKEFRPISCSTVLYKLIS*VLTKRLQGV 1939 TN ++FK N T V LIPKVQNP Sbjct: 308 TAATNFFTKGKLFKTFNCTLVSLIPKVQNP------------------------------ 337 Query: 1940 MDSIIDSSQAAFVSGRVITDNILLSHELVNGYCRKGVSARCMLKIDMQKAYDSLEWDFLE 2119 F+ GR I +NI+L+HELV Y RK +S R MLKID+Q+AYD +EW FLE Sbjct: 338 -------QTDGFIPGRKIAENIILAHELVKSYTRKNISPRSMLKIDLQQAYDLVEWSFLE 390 Score = 82.8 bits (203), Expect(3) = 8e-64 Identities = 41/95 (43%), Positives = 60/95 (63%), Gaps = 5/95 (5%) Frame = +1 Query: 715 RRSLWSSLESIEPTVLHPWLIMGDFNVVLRGEDRLNGS*VVDAEVKDFAQCLLTTGLTEM 894 RRSLW+ L+ + +V PW+I+GDFN +L +DRL+ V E+KDF +C+ G+TE+ Sbjct: 2 RRSLWNELKMLTHSVSEPWIIIGDFNAILSPKDRLDRVLVTLNEIKDFEECVKDMGVTEI 61 Query: 895 KAIGRFYTWTNN-----RVLSKIDRALMNPA*VNK 984 G +YTWTN R+ S+IDRA N ++K Sbjct: 62 HWKGNYYTWTNKQVGAARIASRIDRAFGNDCWMDK 96 Score = 26.6 bits (57), Expect(3) = 8e-64 Identities = 11/43 (25%), Positives = 20/43 (46%) Frame = +3 Query: 1050 LEQQQDQIARPFKFLNHLAQHNDFLLRVRDIWSRQVHGSPMER 1178 L++ I FKF N +H F+ V +W ++ +E+ Sbjct: 119 LQKSYHHIRVGFKFFNVWVEHESFMEMVDTVWKQEYGSQKIEK 161 >gb|ABD28627.2| RNA-directed DNA polymerase (Reverse transcriptase); Ribonuclease H [Medicago truncatula] Length = 1296 Score = 222 bits (565), Expect(3) = 1e-63 Identities = 135/403 (33%), Positives = 212/403 (52%), Gaps = 2/403 (0%) Frame = +2 Query: 1301 EKQLKSELEKWNQVEESILKQKPRVQWLSLGDSNSAYFFASMRGRINQNHIKKLVNDSGR 1480 EK+L+ E EE + QK R QW+ LGD N+A+F A R N I KL +G Sbjct: 321 EKELQDEYNHILFQEEMLWYQKSREQWVKLGDKNTAFFHAQTVIRRKWNKIHKLQLPNGI 380 Query: 1481 ILYTKREVEDEIIGFYKTLLRSCATELSGIQSDVMNNDPILRRDQQLLLMAPVTKQEIQA 1660 +++E + ++K C +++ + P L + L +P+TK+E+ A Sbjct: 381 STSDSNILQEEALKYFKKFF--CGSQIPYSRFFNEGRHPALDDTGKTSLTSPITKKEVFA 438 Query: 1661 ALNDISDLKAPGCDGFNVVFFKKSWEVISDEVVAVVTNILHTKRIFKAINRTTVILIPKV 1840 ALN + KAPG DGF+ +FFK+ W ++ D+V +V + T AI+ T + LIPK+ Sbjct: 439 ALNSMKPYKAPGPDGFHCIFFKQYWHIVGDDVFHLVRSAFLTGHFDPAISNTLIALIPKI 498 Query: 1841 QNPSYAKEFRPISCSTVLYKLIS*VLTKRLQGVMDSIIDSSQAAFVSGRVITDNILLSHE 2020 +P+ K+FRPIS LYK+I+ VL RL+ ++++I Q++F+ GR DN ++ E Sbjct: 499 DSPNTYKDFRPISLCNTLYKIITKVLVHRLRPFLNNLIGPYQSSFLPGRGTADNSIILQE 558 Query: 2021 LVNGYCR-KGVSARCMLKIDMQKAYDSLEWDFLEQVLVALNFPSTFVQWIMMCVQSVSYS 2197 +++ R K K+D++KA+D++ WDFL L+ FP V+ IM CV S +YS Sbjct: 559 ILHFMKRSKRKKGYVAFKLDLEKAFDNVNWDFLNSCLLDFGFPDIIVKLIMHCVSSANYS 618 Query: 2198 ILINGHPTTPFDAKKXXXXXXXXXXXXXVMAMEYFSRFLEQ-LGQNSQFHFHPKCSGLKL 2374 +L NG+ PF ++ ME S ++ + Q S H G ++ Sbjct: 619 LLWNGNKMPPFKPTHGLRQGDPLSPYLFILCMEKLSVAIQDAVLQGSWEPIHIINDGPQI 678 Query: 2375 IQLGFVDDLLLFCRGDVGSMELIFDKFKLFSRASSLIANLNKS 2503 L F DD+LLF + ++ I + F FSRAS L N++KS Sbjct: 679 SHLLFADDVLLFTKAKSSQLQFITNLFDRFSRASGLKINISKS 721 Score = 49.7 bits (117), Expect(3) = 1e-63 Identities = 46/162 (28%), Positives = 71/162 (43%), Gaps = 8/162 (4%) Frame = +1 Query: 571 IWLIWDSAY-VDVTILHTNDQFVHCMIELPAQGIKVEF-TAIYGFHTVETRRSLWSSLES 744 +WL+ S + T+L N + +I +G + T IY R +LW+ L + Sbjct: 67 VWLLKHSTTNITSTVLDFNQYSITFII---GRGAAITTCTCIYASPNYSMRPNLWNYLVN 123 Query: 745 IEPTVLHPWLIMGDFNVV-LRGEDRLNGS*VVDAEVKDFAQCLLTTGLTEMKAIGRFYTW 921 I T+ PW+++GDFN L E R G F+ + L ++ G +TW Sbjct: 124 INDTITGPWMLIGDFNETHLPSEQR--GGTFHHNRAATFSNFMNNCNLLDLTTTGGRFTW 181 Query: 922 TNN----RVLS-KIDRALMNPA*VNK*PQVDVTVMDSQISDH 1032 N R+LS K+DR + N P+ V V+ SDH Sbjct: 182 HKNNNGIRILSKKLDRGMANVDWRLSFPEAFVEVLCRLHSDH 223 Score = 22.3 bits (46), Expect(3) = 1e-63 Identities = 17/75 (22%), Positives = 31/75 (41%), Gaps = 1/75 (1%) Frame = +3 Query: 1077 RPFKFLNHLAQHNDFLLRVRDIWSRQVHGSPMERVWKKFKLMKGAICGRYN**GRSLMMS 1256 RPF+F H D+ V+ WS H + K+M+ +I ++ G Sbjct: 240 RPFRFEAAWIDHYDYGNVVKRSWSTHTHNPTASLI----KVMENSIIFNHDVFGNIFQRK 295 Query: 1257 RFVETHMM-INNYMQ 1298 VE + + +Y++ Sbjct: 296 SRVEWRLKGVQSYLE 310 >gb|AAC33226.1| putative non-LTR retroelement reverse transcriptase [Arabidopsis thaliana] Length = 1529 Score = 247 bits (631), Expect = 2e-62 Identities = 134/406 (33%), Positives = 222/406 (54%), Gaps = 2/406 (0%) Frame = +2 Query: 1295 AREKQLKSELEKWNQVEESILKQKPRVQWLSLGDSNSAYFFASMRGRINQNHIKKLVNDS 1474 A E + ++ +++EE LKQK ++ W+++GD N++YF + + R +N I+++ + Sbjct: 634 AEELKAYTDWTHLSELEEGFLKQKSKLHWMNVGDGNNSYFHKAAQVRKMRNSIREIRGPN 693 Query: 1475 GRILYTKREVEDEIIGFYKTLLRSCATELSGIQSDVMNNDPILRRD--QQLLLMAPVTKQ 1648 L T E++ E F+ L + + GI + + N R Q +L VT + Sbjct: 694 AETLQTSEEIKGEAERFFNEFLNRQSGDFHGISVEDLRNLMSYRCSVTDQNILTREVTGE 753 Query: 1649 EIQAALNDISDLKAPGCDGFNVVFFKKSWEVISDEVVAVVTNILHTKRIFKAINRTTVIL 1828 EIQ L + + K+PG DG+ FFK +W + + +A + + + K +N T + L Sbjct: 754 EIQKVLFAMPNNKSPGPDGYTSEFFKATWSLTGPDFIAAIQSFFVKGFLPKGLNATILAL 813 Query: 1829 IPKVQNPSYAKEFRPISCSTVLYKLIS*VLTKRLQGVMDSIIDSSQAAFVSGRVITDNIL 2008 IPK K++RPISC VLYK+IS +L RL+ ++ S I +Q+AFV R++ +N+L Sbjct: 814 IPKKDEAIEMKDYRPISCCNVLYKVISKILANRLKLLLPSFILQNQSAFVKERLLMENVL 873 Query: 2009 LSHELVNGYCRKGVSARCMLKIDMQKAYDSLEWDFLEQVLVALNFPSTFVQWIMMCVQSV 2188 L+ ELV Y ++ V+ RC +KID+ KA+DS++W FL L ALNFP TF WI +C+ + Sbjct: 874 LATELVKDYHKESVTPRCAMKIDISKAFDSVQWQFLLNTLEALNFPETFRHWIKLCISTA 933 Query: 2189 SYSILINGHPTTPFDAKKXXXXXXXXXXXXXVMAMEYFSRFLEQLGQNSQFHFHPKCSGL 2368 ++S+ +NG F + + V+ M S +++ + +HPKC + Sbjct: 934 TFSVQVNGELAGFFGSSRGLRQGCALSPYLFVICMNVLSHMIDEAAVHRNIGYHPKCEKI 993 Query: 2369 KLIQLGFVDDLLLFCRGDVGSMELIFDKFKLFSRASSLIANLNKSS 2506 L L F DDL++F G S+E + + FK F+ S L +L KS+ Sbjct: 994 GLTHLCFADDLMVFVDGHQWSIEGVINVFKEFAGRSGLQISLEKST 1039 >dbj|BAB09379.1| non-LTR retroelement reverse transcriptase-like protein [Arabidopsis thaliana] Length = 1223 Score = 245 bits (626), Expect = 6e-62 Identities = 134/398 (33%), Positives = 216/398 (54%), Gaps = 5/398 (1%) Frame = +2 Query: 1328 KWNQV---EESILKQKPRVQWLSLGDSNSAYFFASMRGRINQNHIKKLVNDSGRILYTKR 1498 +W++V EE LKQK ++ W +GD N+ F + R N I++++++ G + Sbjct: 345 RWDRVAILEEKYLKQKSKLHWCQVGDQNTKAFHRAAAAREAHNTIREILSNDGIVKTKGD 404 Query: 1499 EVEDEIIGFYKTLLRSCATELSGIQ-SDVMNNDPILRRD-QQLLLMAPVTKQEIQAALND 1672 E++ E F++ L+ + G+ +++ P+ D Q L+ PVT +EI+ L Sbjct: 405 EIKAEAERFFREFLQLIPNDFEGVTITELQQLLPVRCSDADQQSLIRPVTAEEIRKVLFR 464 Query: 1673 ISDLKAPGCDGFNVVFFKKSWEVISDEVVAVVTNILHTKRIFKAINRTTVILIPKVQNPS 1852 + K+PG DG+ FFK +WE+I DE V + + K IN T + LIPK Sbjct: 465 MPSDKSPGPDGYTSEFFKATWEIIGDEFTLAVQSFFTKGFLPKGINSTILALIPKKTEAR 524 Query: 1853 YAKEFRPISCSTVLYKLIS*VLTKRLQGVMDSIIDSSQAAFVSGRVITDNILLSHELVNG 2032 K++RPISC VLYK+IS ++ RL+ V+ I +Q+AFV R++ +N+LL+ ELV Sbjct: 525 EMKDYRPISCCNVLYKVISKIIANRLKLVLPKFIAGNQSAFVKDRLLIENLLLATELVKD 584 Query: 2033 YCRKGVSARCMLKIDMQKAYDSLEWDFLEQVLVALNFPSTFVQWIMMCVQSVSYSILING 2212 Y + +S RC +KID+ KA+DS++W FL V L FP F+ WI +C+ + S+S+ +NG Sbjct: 585 YHKDTISTRCAIKIDISKAFDSVQWPFLINVFTILGFPREFIHWINICITTASFSVQVNG 644 Query: 2213 HPTTPFDAKKXXXXXXXXXXXXXVMAMEYFSRFLEQLGQNSQFHFHPKCSGLKLIQLGFV 2392 F + + V+ M+ S+ L++ F +HPKC + L L F Sbjct: 645 ELAGYFQSSRGLRQGCALSPYLFVICMDVLSKMLDKAAAARHFGYHPKCKTMGLTHLSFA 704 Query: 2393 DDLLLFCRGDVGSMELIFDKFKLFSRASSLIANLNKSS 2506 DDL++ G + S+E I F F++ S L +L KS+ Sbjct: 705 DDLMVLSDGKIRSIERIIKVFDEFAKWSGLRISLEKST 742 Score = 96.7 bits (239), Expect(2) = 3e-18 Identities = 62/203 (30%), Positives = 105/203 (51%), Gaps = 9/203 (4%) Frame = +1 Query: 385 NVRGFNKLHKHKEFLKTVRKEHINIIAIVEHRVHKNKATQIVKKVVPG*H*HYNYDISGK 564 NVRG NK KH K + + + +VE RV ++K +Q+V K+ NY+ + + Sbjct: 7 NVRGLNKSSKHSVIKKWIEENNFQFGCLVETRVKESKVSQLVGKLFKDWSILTNYEHNRR 66 Query: 565 ERIWLIWDSAYVDVTILHTNDQFVHCMIELPAQGIKVEFTAIYGFHTVETRRSLWSSLES 744 RIW++W V ++ ++ + Q + C ++L + + + +Y + VE R+ LWS L+ Sbjct: 67 GRIWVLW-RKNVRLSPIYKSCQLLTCSVKLEDRQDEFFCSFVYASNYVEERKVLWSELKD 125 Query: 745 --IEPTVLH-PWLIMGDFNVVLRGEDRLNG--S*VVDAEVKDFAQCLLTTGLTEMKAIGR 909 P + H PW ++GDFN L + +V ++DF Q + LT+M A G Sbjct: 126 HYDSPIIRHKPWTLLGDFNETLDIAEHSQSFVHPMVTPGMRDFQQVINYCSLTDMAAQGP 185 Query: 910 FYTWTNNR----VLSKIDRALMN 966 +TW N R ++ K+DR L+N Sbjct: 186 LFTWCNKREHGLIMKKLDRVLIN 208 Score = 24.3 bits (51), Expect(2) = 3e-18 Identities = 10/23 (43%), Positives = 12/23 (52%) Frame = +3 Query: 1077 RPFKFLNHLAQHNDFLLRVRDIW 1145 +PFKF+N L DF V W Sbjct: 249 KPFKFVNALTDMEDFKPMVSTYW 271 >gb|AAD21699.1| Contains reverse transcriptase domain (rvt) PF|00078 [Arabidopsis thaliana] Length = 1253 Score = 241 bits (616), Expect = 9e-61 Identities = 138/394 (35%), Positives = 212/394 (53%), Gaps = 5/394 (1%) Frame = +2 Query: 1337 QVEESILKQKPRVQWLSLGDSNSAYFFASMRGRINQNHIKKLVNDSGRILYTKREVEDEI 1516 + EES Q+ RV W+ GDSN++YF R N I +++D+G + T+ +++ Sbjct: 295 KAEESFFCQRSRVTWMGEGDSNTSYFHRMADSRKAVNTIHIIIDDNGVKIDTQLGIKEHC 354 Query: 1517 IGFYKTLLRSCATELSGIQSDVMNNDPI-LRRDQQLLLMAPVTKQEIQAALNDISDLKAP 1693 I ++ LL IQ D P DQ+ L ++Q+I++A K Sbjct: 355 IEYFSNLLGGEVGPPMLIQEDFDLLLPFRCSHDQKKELAMSFSRQDIKSAFFSFPSNKTS 414 Query: 1694 GCDGFNVVFFKKSWEVISDEVVAVVTNILHTKRIFKAINRTTVILIPKVQNPSYAKEFRP 1873 G DGF V FFK++W VI EV V+ + + K N TT++LIPK+ N S +FRP Sbjct: 415 GPDGFPVEFFKETWSVIGTEVTDAVSEFFTSSVLLKQWNATTLVLIPKITNASKMNDFRP 474 Query: 1874 ISCST----VLYKLIS*VLTKRLQGVMDSIIDSSQAAFVSGRVITDNILLSHELVNGYCR 2041 ISC+ LYK+I+ +LT RLQ ++ +I Q+AF+ GR + +N+LL+ ELV GY R Sbjct: 475 ISCNDFGPITLYKVIARLLTNRLQCLLSQVISPFQSAFLPGRFLAENVLLATELVQGYNR 534 Query: 2042 KGVSARCMLKIDMQKAYDSLEWDFLEQVLVALNFPSTFVQWIMMCVQSVSYSILINGHPT 2221 + + R MLK+D++KA+DS+ WDF+ L A+ P FV WI C+ + ++S+ +NG+ Sbjct: 535 QNIDPRGMLKVDLRKAFDSIRWDFIISALKAIGIPDRFVYWITQCISTPTFSVCVNGNTG 594 Query: 2222 TPFDAKKXXXXXXXXXXXXXVMAMEYFSRFLEQLGQNSQFHFHPKCSGLKLIQLGFVDDL 2401 F + + V+AME FS L Q H+HPK S L + L F DD+ Sbjct: 595 GFFKSTRGLRQGNPLSPFLFVLAMEVFSSLLNSRFQAGYIHYHPKTSPLSISHLMFADDI 654 Query: 2402 LLFCRGDVGSMELIFDKFKLFSRASSLIANLNKS 2503 ++F G S+ I + + F+ S L+ N K+ Sbjct: 655 MVFFDGGSSSLHGISEALEDFAFWSGLVLNREKT 688 Score = 55.1 bits (131), Expect(2) = 3e-10 Identities = 38/129 (29%), Positives = 59/129 (45%), Gaps = 8/129 (6%) Frame = +1 Query: 673 VEFTAIYGFHTVETRRSLWSSLESIEPTVL---HPWLIMGDFNVVL-RGEDRLNGS*VVD 840 V + +Y + TR+ LW L + ++ PW+++GDFN VL E S V+ Sbjct: 53 VVVSIVYAANEAITRKELWEELLLLSVSLSGNGKPWIMLGDFNQVLCPAEHSQATSLNVN 112 Query: 841 AEVKDFAQCLLTTGLTEMKAIGRFYTWTNNR----VLSKIDRALMNPA*VNK*PQVDVTV 1008 +K F CL L ++ G +TW N V K+DR L+N + ++ P Sbjct: 113 RRMKVFRDCLFEAELCDLVFKGNTFTWWNKSATRPVAKKLDRILVNESWCSRFPSAYAVF 172 Query: 1009 MDSQISDHA 1035 + SDHA Sbjct: 173 GEPDFSDHA 181 Score = 38.9 bits (89), Expect(2) = 3e-10 Identities = 21/47 (44%), Positives = 29/47 (61%), Gaps = 1/47 (2%) Frame = +3 Query: 1077 RPFKFLNHLAQHNDFLLRVRDIW-SRQVHGSPMERVWKKFKLMKGAI 1214 RPF+F N L Q+ DF+ V ++W S V GS M ++ KK K +K I Sbjct: 196 RPFRFYNFLLQNPDFISLVGELWYSINVVGSSMFKMSKKLKALKNPI 242 >gb|AAC28221.1| similar to reverse transcriptases (PFam: rvt.hmm, score: 60.13) [Arabidopsis thaliana] Length = 1164 Score = 241 bits (614), Expect = 1e-60 Identities = 137/396 (34%), Positives = 215/396 (54%), Gaps = 4/396 (1%) Frame = +2 Query: 1328 KWN---QVEESILKQKPRVQWLSLGDSNSAYFFASMRGRINQNHIKKLVNDSGRILYTKR 1498 KW + E S Q+ RV WL GD NS+YF R + NHI L + G + ++ Sbjct: 237 KWRILAEAEASFFYQRSRVNWLREGDMNSSYFHKMASARQSLNHIHFLSDPVGDRIEGQQ 296 Query: 1499 EVEDEIIGFYKTLLRSCATELSGIQSDVMNNDPI-LRRDQQLLLMAPVTKQEIQAALNDI 1675 +E+ + ++++ L S Q+D+ N QQ+ L P + ++I+ A + Sbjct: 297 NLENHCVEYFQSNLGSEQGLPLFEQADISNLLSYRCSPAQQVSLDTPFSSEQIKNAFFSL 356 Query: 1676 SDLKAPGCDGFNVVFFKKSWEVISDEVVAVVTNILHTKRIFKAINRTTVILIPKVQNPSY 1855 KA G DGF+ FF W +I EV + + ++ K N T ++LIPK+ N S Sbjct: 357 PRNKASGPDGFSPEFFCACWPIIGGEVTEAIHEFFTSGKLLKQWNATNLVLIPKITNASS 416 Query: 1856 AKEFRPISCSTVLYKLIS*VLTKRLQGVMDSIIDSSQAAFVSGRVITDNILLSHELVNGY 2035 +FRPISC +YK+IS +LT RL+ + + I SQ+AF+ GR+ +N+LL+ ELV+GY Sbjct: 417 MSDFRPISCLNTVYKVISKLLTDRLKDFLPAAISHSQSAFMPGRLFLENVLLATELVHGY 476 Query: 2036 CRKGVSARCMLKIDMQKAYDSLEWDFLEQVLVALNFPSTFVQWIMMCVQSVSYSILINGH 2215 +K ++ MLK+D++KA+DS+ WDF+ L ALN P F WI+ C+ + S+S+++NGH Sbjct: 477 NKKNIAPSSMLKVDLRKAFDSVRWDFIVSALRALNVPEKFTCWILECLSTASFSVILNGH 536 Query: 2216 PTTPFDAKKXXXXXXXXXXXXXVMAMEYFSRFLEQLGQNSQFHFHPKCSGLKLIQLGFVD 2395 F + K V+AME FS L+ + +HPK S L++ L F D Sbjct: 537 SAGHFWSSKGLRQGDPMSPYLFVLAMEVFSGLLQSRYTSGYIAYHPKTSQLEISHLMFAD 596 Query: 2396 DLLLFCRGDVGSMELIFDKFKLFSRASSLIANLNKS 2503 D+++F G S+ I + + F+ S L+ N NK+ Sbjct: 597 DVMIFFDGKSSSLHGIVESLEDFAGWSGLLMNTNKT 632