BLASTX nr result
ID: Catharanthus22_contig00016248
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Catharanthus22_contig00016248 (4392 letters) Database: ./nr 37,332,560 sequences; 13,225,080,153 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_006582542.1| PREDICTED: uncharacterized protein LOC102668... 184 1e-58 ref|XP_006574288.1| PREDICTED: uncharacterized protein LOC102661... 182 2e-46 gb|AAG50806.1|AC079281_8 unknown protein [Arabidopsis thaliana] 126 7e-45 ref|XP_006590131.1| PREDICTED: uncharacterized protein LOC102665... 179 7e-45 gb|AAC67331.1| putative non-LTR retroelement reverse transcripta... 124 1e-44 ref|XP_002331746.1| predicted protein [Populus trichocarpa] 94 7e-44 ref|XP_006576082.1| PREDICTED: uncharacterized protein LOC102659... 140 1e-43 ref|XP_006598323.1| PREDICTED: uncharacterized protein LOC102660... 119 1e-42 ref|XP_006595271.1| PREDICTED: uncharacterized protein LOC100781... 127 1e-42 emb|CCA65981.1| hypothetical protein [Beta vulgaris subsp. vulga... 101 3e-42 gb|EEC79647.1| hypothetical protein OsI_20882 [Oryza sativa Indi... 103 1e-40 dbj|BAB01845.1| non-LTR retroelement reverse transcriptase-like ... 116 3e-39 emb|CAA66812.1| non-ltr retrotransposon reverse transcriptase-li... 116 3e-39 dbj|BAF00918.1| putative reverse transcriptase [Arabidopsis thal... 89 6e-39 gb|EMJ04543.1| hypothetical protein PRUPE_ppa020282mg [Prunus pe... 105 6e-39 emb|CCA65974.1| hypothetical protein [Beta vulgaris subsp. vulga... 96 5e-36 gb|AAD32866.1|AC005489_4 F14N23.4 [Arabidopsis thaliana] 89 2e-35 gb|EMJ25392.1| hypothetical protein PRUPE_ppa017155mg, partial [... 103 9e-35 emb|CCA66222.1| hypothetical protein [Beta vulgaris subsp. vulga... 94 5e-34 gb|AAC28221.1| similar to reverse transcriptases (PFam: rvt.hmm,... 120 8e-34 >ref|XP_006582542.1| PREDICTED: uncharacterized protein LOC102668030 [Glycine max] Length = 411 Score = 184 bits (466), Expect(2) = 1e-58 Identities = 100/253 (39%), Positives = 137/253 (54%), Gaps = 2/253 (0%) Frame = +3 Query: 2418 MKIGWWNIRGFAKALKHIEVYRFLKAKNIVVFXXXXXXXXXXXXFDIMAWKFKEWKVSHN 2597 M I WNIRGF LKH + FL+ K I V +IM KF +W +HN Sbjct: 1 MIIASWNIRGFNLPLKHHAMQSFLRCKEINVMVVLETKLNKASVEEIMRRKFGDWHFTHN 60 Query: 2598 FGEHGAGRIVILWKPLLVSVHVQEATQQTIQTSVTCMISRKTFWVSFVYGLHSIAGRRPL 2777 F H A RI+ILWK + + V E+ Q I ++ C + K F VSF+YGLHSI RR L Sbjct: 61 FTSHNASRILILWKQDKIHLSVLESNAQLIHCAIDCKTTAKRFQVSFIYGLHSIMARRSL 120 Query: 2778 WDSLIRFGTSINLPWLVVGDFDIVLK--DRLRQTQVSSYEVWDFLNCCVDLGLTDVNYSG 2951 W +L ++N PWL++GDF+ ++ DR + ++YE+ DF++C DLGL +N G Sbjct: 121 WINLNSINANMNCPWLLIGDFNSIMSPTDRFNGAEPNAYELQDFVDCYSDLGLGSINTHG 180 Query: 2952 SHYTRSNGYTWSKIDKAMCNQQWLLNGLHAREEFLAPGCISHHLPRVDTLFDTPNRPRTC 3131 YT +NG WSK+D+A+CNQ W + ++ E + IS H P V T R + Sbjct: 181 PLYTWTNGRVWSKLDRALCNQAWFNSFGNSACEVMEFISISDHTPLVVTTELVVPRGNSP 240 Query: 3132 FMFFNMWADHDRF 3170 F F N DH F Sbjct: 241 FKFNNAIMDHPNF 253 Score = 73.6 bits (179), Expect(2) = 1e-58 Identities = 42/134 (31%), Positives = 65/134 (48%) Frame = +1 Query: 3175 LVKNGWNVQILWACQYRLCRXXXXXXXXXXXXNRKDFAHISSRAEAARKELKQQQNLLHD 3354 +V + W I +++C+ +++F +IS+R E A E N L Sbjct: 256 IVADSWKQNIHGYSMFKVCKKLKALKAPLKNLFKQEFRNISNRVELAEAEYNSVLNSLKQ 315 Query: 3355 NPMDHLLQESVKQLQQKALFHIEAEMKLCAQKLKYDFLINGDKATKLFHSLIKRNAKKNF 3534 NP D L + + + + +AE AQ +K +L+ DK +K FH+LIKRN F Sbjct: 316 NPQDPSLLALANRTRGQTIMLRKAESMKFAQLIKNKYLLQADKCSKFFHALIKRNRHSRF 375 Query: 3535 IASITREDGSLTNS 3576 IA+I EDG T+S Sbjct: 376 IAAIRLEDGHNTSS 389 >ref|XP_006574288.1| PREDICTED: uncharacterized protein LOC102661053 [Glycine max] Length = 331 Score = 182 bits (462), Expect(2) = 2e-46 Identities = 100/253 (39%), Positives = 134/253 (52%), Gaps = 2/253 (0%) Frame = +3 Query: 2418 MKIGWWNIRGFAKALKHIEVYRFLKAKNIVVFXXXXXXXXXXXXFDIMAWKFKEWKVSHN 2597 M I WNIRGF LKH + FL+ K I V +IM KF +W +HN Sbjct: 1 MIIASWNIRGFNLPLKHHAMQNFLRCKEINVMAVLETKLNKASVEEIMRRKFSDWHFTHN 60 Query: 2598 FGEHGAGRIVILWKPLLVSVHVQEATQQTIQTSVTCMISRKTFWVSFVYGLHSIAGRRPL 2777 F H AGRI ILWK + V E+ Q I ++ C + K VSF+Y LHSI RR L Sbjct: 61 FTSHNAGRIFILWKQDKIHFSVLESNAQLIHCAINCKTNSKRLQVSFIYDLHSIMARRSL 120 Query: 2778 WDSLIRFGTSINLPWLVVGDFDIVLK--DRLRQTQVSSYEVWDFLNCCVDLGLTDVNYSG 2951 W +L ++N PWL++GDF+ +L DR + ++YE+ DF++C DLGL +N G Sbjct: 121 WMNLNSINANMNCPWLLIGDFNSILSPTDRFNGAEPNAYELQDFVDCYSDLGLGSINTHG 180 Query: 2952 SHYTRSNGYTWSKIDKAMCNQQWLLNGLHAREEFLAPGCISHHLPRVDTLFDTPNRPRTC 3131 YT +NG WSK+D+A+CNQ W + ++ E + IS H P V T R + Sbjct: 181 PLYTWTNGRVWSKLDRALCNQAWFNSFGNSACEVMEFISISDHTPLVVTTELVVPRGNSP 240 Query: 3132 FMFFNMWADHDRF 3170 F F N DH F Sbjct: 241 FKFNNAIVDHPNF 253 Score = 34.3 bits (77), Expect(2) = 2e-46 Identities = 16/64 (25%), Positives = 29/64 (45%) Frame = +1 Query: 3175 LVKNGWNVQILWACQYRLCRXXXXXXXXXXXXNRKDFAHISSRAEAARKELKQQQNLLHD 3354 +V +GW I +++C+ +++F++IS+R + A E N L Sbjct: 256 IVADGWKQNIHGCSMFKVCKKLKALKAPLKNLFKQEFSNISNRVKLAEAEYNSVLNSLKQ 315 Query: 3355 NPMD 3366 NP D Sbjct: 316 NPQD 319 >gb|AAG50806.1|AC079281_8 unknown protein [Arabidopsis thaliana] Length = 1213 Score = 126 bits (316), Expect(3) = 7e-45 Identities = 66/145 (45%), Positives = 90/145 (62%) Frame = +3 Query: 3663 PKQADFLVADFSKKEIKPTLFNIGNEKSSGPDSYTSYLFKKAWNIV*DDLCDAIQEFFTF 3842 P Q L + FS ++I+ LF++ KS GPD +T+ F +W+IV ++ DAI+EFF+ Sbjct: 437 PAQVCELESTFSNEDIRAALFSLPRNKSCGPDGFTAEFFIDSWSIVGAEVTDAIKEFFSS 496 Query: 3843 GKLLKQAHHCVISLIPKNEQGTSVGDFTPISCCNIFYKVITKLLADRMGSISLALIDKAQ 4022 G LLKQ + I LIPK T DF PISC N YKVI +LL DR+ + +I AQ Sbjct: 497 GCLLKQWNATTIVLIPKIVNPTCTSDFRPISCLNTLYKVIARLLTDRLQRLLSGVISSAQ 556 Query: 4023 SAFVKGRSMLENFHLAQEILRGYIW 4097 SAF+ GRS+ EN LA +++ GY W Sbjct: 557 SAFLPGRSLAENVLLATDLVHGYNW 581 Score = 70.9 bits (172), Expect(3) = 7e-45 Identities = 65/253 (25%), Positives = 103/253 (40%), Gaps = 15/253 (5%) Frame = +3 Query: 2433 WNIRGFAKALKHIEVYRFLKAKNIVVFXXXXXXXXXXXXFDIMAWKFKEWKVSHNFGEHG 2612 WNIRGF +++KA + + W N+ Sbjct: 8 WNIRGFNNVSHRSGFKKWVKANKPIFGGVIETHVKQPKDRKFINALLPGWSFVENYAFSD 67 Query: 2613 AGRIVILWKPLLVSVHVQEATQQTIQTSVTCMI----SRKTFWVSFVYGLHSIAGRRPLW 2780 G+I ++W P + V V ++ Q +TC + S VS VY + +A R+ LW Sbjct: 68 LGKIWVMWDPSVQVVVVAKSLQM-----ITCEVLLPGSPSWIIVSVVYAANEVASRKELW 122 Query: 2781 DSLIRF---GTSINLPWLVVGDFDIVLKDRLRQTQVS---SYEVWDFLNCCVDLGLTDVN 2942 ++ G + PWLV+GDF+ VL + VS + DF +C + L+D+ Sbjct: 123 IEIVNMVVSGIIGDRPWLVLGDFNQVLNPQEHSNPVSLNVDINMRDFRDCLLAAELSDLR 182 Query: 2943 YSGSHYTRSN-GYTW---SKIDKAMCNQQWLLNGLHAREEFLAPGC-ISHHLPRVDTLFD 3107 Y G+ +T N +T KID+ + N W N L + S H+ L + Sbjct: 183 YKGNTFTWWNKSHTTPVAKKIDRILVNDSW--NALFPSSLGIFGSLDFSDHVSCGVVLEE 240 Query: 3108 TPNRPRTCFMFFN 3146 T + + F FFN Sbjct: 241 TSIKAKRPFKFFN 253 Score = 35.0 bits (79), Expect(3) = 7e-45 Identities = 29/142 (20%), Positives = 61/142 (42%), Gaps = 5/142 (3%) Frame = +1 Query: 3172 SLVKNGW-NVQILWACQYRLCRXXXXXXXXXXXXNRKDFAHISSRAEAARKELK--QQQN 3342 +LV++ W + ++ + +R+ + +R +++ + R + A L Q + Sbjct: 263 NLVRDNWFTLNVVGSSMFRVSKKLKALKKPIKDFSRLNYSELEKRTKEAHDFLIGCQDRT 322 Query: 3343 LLHDNPMDHLLQESVKQLQQKALFHI--EAEMKLCAQKLKYDFLINGDKATKLFHSLIKR 3516 L P++ + L+ + +HI AE QK + + GD TK FH + Sbjct: 323 LADPTPINASFE-----LEAERKWHILTAAEESFFRQKSRISWFAEGDGNTKYFHRMADA 377 Query: 3517 NAKKNFIASITREDGSLTNSMK 3582 N I+++ +G L +S + Sbjct: 378 RNSSNSISALYDGNGKLVDSQE 399 Score = 65.5 bits (158), Expect = 2e-07 Identities = 34/93 (36%), Positives = 52/93 (55%), Gaps = 1/93 (1%) Frame = +2 Query: 4082 EGVYLAQEILRGYKRKRTSPKYTLKIDIRRCMTPFHRSFWRRCSQ-LDFPTIFIE*IMAC 4258 E V LA +++ GY SP+ LK+D+++ F + L P FI I C Sbjct: 567 ENVLLATDLVHGYNWSNISPRGMLKVDLKKAFDSVRWEFVIAALRALAIPEKFINWISQC 626 Query: 4259 VSSPSYSLKVNGEIIGFFKGETGLR*GDPISPF 4357 +S+P++++ +NG GFFK GLR GDP+SP+ Sbjct: 627 ISTPTFTVSINGGNGGFFKSTKGLRQGDPLSPY 659 >ref|XP_006590131.1| PREDICTED: uncharacterized protein LOC102665788 [Glycine max] Length = 317 Score = 179 bits (454), Expect(2) = 7e-45 Identities = 96/253 (37%), Positives = 136/253 (53%), Gaps = 2/253 (0%) Frame = +3 Query: 2418 MKIGWWNIRGFAKALKHIEVYRFLKAKNIVVFXXXXXXXXXXXXFDIMAWKFKEWKVSHN 2597 M I WNIRGF LKH + FL+ K + V +IM KF +W +HN Sbjct: 1 MIIASWNIRGFNLPLKHHAMQSFLRCKEVNVMVVLETKLNKVSVKEIMRRKFGDWHFTHN 60 Query: 2598 FGEHGAGRIVILWKPLLVSVHVQEATQQTIQTSVTCMISRKTFWVSFVYGLHSIAGRRPL 2777 F + A I+ILWK + + + E+ I ++ C + K F VSF+YGLHSI RR L Sbjct: 61 FASYNADIILILWKQDKIHLSILESNAHLIHCAIDCKTTAKRFQVSFIYGLHSIVARRSL 120 Query: 2778 WDSLIRFGTSINLPWLVVGDFDIVLK--DRLRQTQVSSYEVWDFLNCCVDLGLTDVNYSG 2951 W +L ++N PWL++GDF+ +L DR + ++YE+ DF++CC DLGL ++N G Sbjct: 121 WINLNSINANMNYPWLLIGDFNSILSPTDRFNGAEPNAYELQDFVDCCSDLGLGNINSHG 180 Query: 2952 SHYTRSNGYTWSKIDKAMCNQQWLLNGLHAREEFLAPGCISHHLPRVDTLFDTPNRPRTC 3131 YT +NG WSK+D+A+CNQ W + ++ E + IS H V T R + Sbjct: 181 PLYTWTNGRVWSKLDRALCNQAWFNSFGNSAYEVMEFISISDHTLLVVTTELVVPRGNSP 240 Query: 3132 FMFFNMWADHDRF 3170 F F N DH F Sbjct: 241 FKFNNAIVDHPNF 253 Score = 32.0 bits (71), Expect(2) = 7e-45 Identities = 17/64 (26%), Positives = 27/64 (42%) Frame = +1 Query: 3169 SSLVKNGWNVQILWACQYRLCRXXXXXXXXXXXXNRKDFAHISSRAEAARKELKQQQNLL 3348 S +V +GW I +++C+ +++F +IS R E A E N L Sbjct: 254 SRIVADGWKQNIHGYSMFKVCKKLKALKAPLKNLFKQEFNNISHRVELAEAEYNSVLNSL 313 Query: 3349 HDNP 3360 NP Sbjct: 314 KQNP 317 >gb|AAC67331.1| putative non-LTR retroelement reverse transcriptase [Arabidopsis thaliana] Length = 1449 Score = 124 bits (312), Expect(3) = 1e-44 Identities = 61/143 (42%), Positives = 92/143 (64%) Frame = +3 Query: 3663 PKQADFLVADFSKKEIKPTLFNIGNEKSSGPDSYTSYLFKKAWNIV*DDLCDAIQEFFTF 3842 P + D L A S KEI+ LF++ N+KS GPD YTS +K+AW+I+ + A++ FF Sbjct: 858 PAEKDMLTASVSAKEIRGALFSMPNDKSPGPDGYTSEFYKRAWDIIGAEFVLAVKSFFEK 917 Query: 3843 GKLLKQAHHCVISLIPKNEQGTSVGDFTPISCCNIFYKVITKLLADRMGSISLALIDKAQ 4022 G L K + +++LIPK + + D+ PISCCN+ YKVI+K++A+R+ + I Q Sbjct: 918 GFLPKGVNTTILALIPKKLEAKEMKDYRPISCCNVIYKVISKIIANRLKHVLPNFIAGNQ 977 Query: 4023 SAFVKGRSMLENFHLAQEILRGY 4091 SAFVK R ++EN LA E+++ Y Sbjct: 978 SAFVKDRLLIENLLLATELVKDY 1000 Score = 66.6 bits (161), Expect(3) = 1e-44 Identities = 54/214 (25%), Positives = 94/214 (43%), Gaps = 13/214 (6%) Frame = +3 Query: 2571 FKEWKVSHNFGEHGAGRIVILWKPLLVSVHVQEATQQTIQTSVTCMISRKTFWVSFVYGL 2750 FK+W + N+ + GR+ ++W+ + ++ Q I SV + F+ SFVY Sbjct: 467 FKDWSMLTNYEFNRRGRLWVVWRENVRFTPFYKS-DQLITCSVKLESQEEEFFYSFVYAS 525 Query: 2751 HSIAGRRPLWDSLIRFGTSINL---PWLVVGDFDIVLK----DRLRQTQVSSYEVWDFLN 2909 + R+ LW+ L S + PW++ GDF+ +L R+ + + DF + Sbjct: 526 NFAEERKILWNDLRDHMDSPIIRDKPWIIFGDFNEILDMDEHSRMEDHPAVTSGMRDFQS 585 Query: 2910 CCVDLGLTDVNYSGSHYT----RSNGYTWSKIDKAMCNQQWLLNGLHAREEFLAPGCISH 3077 +D+ G +T R N W K+D+ M N+ W + + F A GC H Sbjct: 586 LVNYCSFSDLASHGPLFTWCNKRDNDPIWKKLDRVMVNEAWKMVYPQSYNVFEAGGCSDH 645 Query: 3078 HLPRVDTLFDTPNRPR--TCFMFFNMWADHDRFQ 3173 R++ ++ + R F F N AD + F+ Sbjct: 646 LRCRINLNMNSGAQVRGNKPFKFVNAVADMEEFK 679 Score = 39.7 bits (91), Expect(3) = 1e-44 Identities = 36/145 (24%), Positives = 56/145 (38%), Gaps = 5/145 (3%) Frame = +1 Query: 3175 LVKNGWN----VQILWACQYRLCRXXXXXXXXXXXXNRKDFAHISSRAEAARKELKQQQN 3342 LV+N W + + + +R + ++ ++ R A L Q Q Sbjct: 681 LVENFWRETEPIHMSTSSLFRFTKKLKALKPKLRGLAKEKMGNLVKRTREAYLSLCQAQQ 740 Query: 3343 LLHDNPMDHLLQ-ESVKQLQQKALFHIEAEMKLCAQKLKYDFLINGDKATKLFHSLIKRN 3519 NP ++ ES ++ + IE K Q K +L GDK K FH Sbjct: 741 SNSQNPSQRAMEIESEAYVRWDRIASIEE--KYLKQVSKLHWLKVGDKNNKTFHRAATAR 798 Query: 3520 AKKNFIASITREDGSLTNSMKYVKN 3594 A +N I I +EDGS + +KN Sbjct: 799 AAQNSIREIQKEDGSTATTKDDIKN 823 >ref|XP_002331746.1| predicted protein [Populus trichocarpa] Length = 503 Score = 94.0 bits (232), Expect(3) = 7e-44 Identities = 61/207 (29%), Positives = 100/207 (48%), Gaps = 10/207 (4%) Frame = +3 Query: 2574 KEWKVSHNFGEHGAGRIVILWKPLLVSVHVQEATQQTIQTSVTCMISRKTFWVSFVYGLH 2753 + W +N+ GRI + W V V+V + Q I SVT + + +F S +YG + Sbjct: 37 RSWSFLYNYDFSCRGRIWVCWNADTVKVNVFGMSDQAIHVSVTILATNISFNTSIIYGDN 96 Query: 2754 SIAGRRPLWDSLI-RFGTSINLPWLVVGDFDIVLKDRLRQTQVSSYEVW----DFLNCCV 2918 + + R LW ++ R + PW+++GDF+ + R + ++ W D L+ C+ Sbjct: 97 NASLREALWSDIVSRSDGWESTPWILMGDFNAI---RNQSHRLGGSTTWAGTMDRLDTCI 153 Query: 2919 -DLGLTDVNYSGSHYTRSN----GYTWSKIDKAMCNQQWLLNGLHAREEFLAPGCISHHL 3083 + + D+ YSG HYT SN K+D+ + N++W LN + FL P IS H Sbjct: 154 REAKVDDLRYSGMHYTWSNQCPENLIMRKLDRVLVNEKWNLNFPLSEVRFL-PSGISDHS 212 Query: 3084 PRVDTLFDTPNRPRTCFMFFNMWADHD 3164 P V + + F FF+MW D + Sbjct: 213 PMVVKVIGNDQNIKKPFRFFDMWMDQN 239 Score = 77.4 bits (189), Expect(3) = 7e-44 Identities = 42/130 (32%), Positives = 70/130 (53%), Gaps = 9/130 (6%) Frame = +3 Query: 3582 VREEYLKFYVGLVGTKQETHEFDKMVMPK---------QADFLVADFSKKEIKPTLFNIG 3734 V+ E + ++ ++G Q ++ VM Q L ++KEIK +F++ Sbjct: 368 VKSEVIAYFHRVLGVDQMPRVLNEEVMESAINLKLSSTQQHVLAQVVTRKEIKHAMFSLK 427 Query: 3735 NEKSSGPDSYTSYLFKKAWNIV*DDLCDAIQEFFTFGKLLKQAHHCVISLIPKNEQGTSV 3914 N K+ G D + + FK+ W+IV +D+ +A++ F ++LK+ + ISLIPK T + Sbjct: 428 NNKAPGLDGFNAGFFKRMWHIVGEDVINAVRSLFQTRRMLKEMNATSISLIPKVANPTRL 487 Query: 3915 GDFTPISCCN 3944 DF PISCCN Sbjct: 488 TDFRPISCCN 497 Score = 57.4 bits (137), Expect(3) = 7e-44 Identities = 37/136 (27%), Positives = 59/136 (43%), Gaps = 1/136 (0%) Frame = +1 Query: 3190 WNVQILWAC-QYRLCRXXXXXXXXXXXXNRKDFAHISSRAEAARKELKQQQNLLHDNPMD 3366 W Q C Y+LC N F++IS R + A+ E+ + Q LH + Sbjct: 235 WMDQNSGGCPMYQLCCNLKKLKQELKLFNMAHFSNISDRVKDAKNEMDKAQQALHTAHEN 294 Query: 3367 HLLQESVKQLQQKALFHIEAEMKLCAQKLKYDFLINGDKATKLFHSLIKRNAKKNFIASI 3546 +L + + K + AE QK + +L GD+ T FH + +N + S+ Sbjct: 295 PILCMRERDVVHKYASTVRAEESFFKQKARIQWLSLGDQNTSYFHKSVNGRHNRNKLLSL 354 Query: 3547 TREDGSLTNSMKYVKN 3594 TREDG + + VK+ Sbjct: 355 TREDGEVVEGHEAVKS 370 >ref|XP_006576082.1| PREDICTED: uncharacterized protein LOC102659506 [Glycine max] Length = 964 Score = 130 bits (327), Expect(2) = 1e-43 Identities = 68/172 (39%), Positives = 98/172 (56%), Gaps = 2/172 (1%) Frame = +3 Query: 2661 VQEATQQTIQTSVTCMISRKTFWVSFVYGLHSIAGRRPLWDSLIRFGTSINLPWLVVGDF 2840 V E+ Q I ++ C + K F VSF+YGLHSI RR LW +L ++N PWL++GDF Sbjct: 454 VLESNAQLIHCAIDCKTTAKRFQVSFIYGLHSIMARRSLWINLNSINANMNCPWLLIGDF 513 Query: 2841 DIVLK--DRLRQTQVSSYEVWDFLNCCVDLGLTDVNYSGSHYTRSNGYTWSKIDKAMCNQ 3014 + +L DR ++++YE+ DF++C DLGL +N G YT +N WSK+D+A+CNQ Sbjct: 514 NSILSPTDRFNGAELNAYELQDFVDCYSDLGLGSINTHGPLYTWTNSRVWSKLDRALCNQ 573 Query: 3015 QWLLNGLHAREEFLAPGCISHHLPRVDTLFDTPNRPRTCFMFFNMWADHDRF 3170 W + ++ E + IS H P V T R + F F N+ DH F Sbjct: 574 AWFNSFGNSACEVMEFISISDHTPLVVTTELVVPRGNSPFKFNNLIVDHPNF 625 Score = 76.6 bits (187), Expect(2) = 1e-43 Identities = 42/134 (31%), Positives = 67/134 (50%) Frame = +1 Query: 3175 LVKNGWNVQILWACQYRLCRXXXXXXXXXXXXNRKDFAHISSRAEAARKELKQQQNLLHD 3354 +V +GW I +++C+ +++F++IS+R E A E N + Sbjct: 628 IVADGWKQNIHGCSMFKVCKKLKALKAPLKNLFKQEFSNISNRVELAEAEYNSVLNSIKQ 687 Query: 3355 NPMDHLLQESVKQLQQKALFHIEAEMKLCAQKLKYDFLINGDKATKLFHSLIKRNAKKNF 3534 NP D L + + + + +AE AQ +K +L+ DK +K FH+LIKRN F Sbjct: 688 NPQDPSLLALANRTRGQTIMLRKAESMKFAQLIKNKYLLQADKCSKFFHALIKRNKHSRF 747 Query: 3535 IASITREDGSLTNS 3576 IA+I EDG T+S Sbjct: 748 IAAIRLEDGHNTSS 761 Score = 140 bits (353), Expect = 5e-30 Identities = 64/137 (46%), Positives = 94/137 (68%) Frame = +3 Query: 3681 LVADFSKKEIKPTLFNIGNEKSSGPDSYTSYLFKKAWNIV*DDLCDAIQEFFTFGKLLKQ 3860 L+ SK+++ + + N K+ GPD + FKKAWNIV DD+ A+ EFFT GK+LKQ Sbjct: 805 LLCPTSKQKVWNIISVMANNKAPGPDGFNVLFFKKAWNIVGDDIFAAVNEFFTTGKILKQ 864 Query: 3861 AHHCVISLIPKNEQGTSVGDFTPISCCNIFYKVITKLLADRMGSISLALIDKAQSAFVKG 4040 +H +I LIPK++Q + V F PISCCN+ YK+++K+LA+R+ + +I + Q+AF+K Sbjct: 865 LNHAIIVLIPKHDQASQVNHFRPISCCNLLYKIVSKILANRIAPVLETIIGETQTAFIKN 924 Query: 4041 RSMLENFHLAQEILRGY 4091 R M++N L QEILR Y Sbjct: 925 RKMMDNIFLVQEILRKY 941 Score = 80.9 bits (198), Expect(2) = 1e-17 Identities = 39/85 (45%), Positives = 53/85 (62%), Gaps = 1/85 (1%) Frame = +1 Query: 1414 WKTPHKFHIHRTGWLVFKFDSEVDRQKILDGG-HMIYGRLLVLKNMPPLFEFGACINTML 1590 W + H +GWLVFKF+SE D ++L G + I+ R L+LK MP F+FG + + Sbjct: 201 WGVKFSYSAHESGWLVFKFESEDDLNQVLSAGPYFIFQRPLLLKVMPAFFDFGNEELSKI 260 Query: 1591 PVWVTLPGLPIDLWNERVLAKICSK 1665 PVWV L LP++LWN + L KI SK Sbjct: 261 PVWVKLRNLPLELWNPQALGKILSK 285 Score = 38.9 bits (89), Expect(2) = 1e-17 Identities = 20/53 (37%), Positives = 31/53 (58%), Gaps = 1/53 (1%) Frame = +2 Query: 1226 SLLQDNRNPSRGMSL*-KVEDHDDVVDMETDEVSDLVATMGYALVSYVAGGFP 1381 +L +DNR+PS+G + DD V +E ++ L G++L+ YVAG FP Sbjct: 137 NLFKDNRSPSKGFGMKFSPPPSDDEVLLEETDLQPLEEAWGHSLIGYVAGRFP 189 Score = 62.8 bits (151), Expect = 1e-06 Identities = 33/82 (40%), Positives = 47/82 (57%) Frame = +3 Query: 1638 KGFGQDMLKIGEPMCADAMTARM*RISYARVLAKVDIAKELITEVTIKLSQWEMRSQYVL 1817 + G+ + KIG P+ +D +TA IS+AR L +VD + ELI EV +L + Q + Sbjct: 277 QALGKILSKIGSPIRSDHLTASKGSISFARALVEVDASLELIDEVRFRLPTGKTFVQKIE 336 Query: 1818 YENLPKFCSLCHVIGHS*EMCK 1883 YEN P FC+ C + GH CK Sbjct: 337 YENRPSFCTHCKMTGHRLTNCK 358 >ref|XP_006598323.1| PREDICTED: uncharacterized protein LOC102660513 [Glycine max] Length = 543 Score = 119 bits (299), Expect(3) = 1e-42 Identities = 62/179 (34%), Positives = 106/179 (59%), Gaps = 8/179 (4%) Frame = +3 Query: 3579 EVREEYLKFYVGLVGTKQETHE-FDKMVMPK-------QADFLVADFSKKEIKPTLFNIG 3734 E+ +E ++FY L+G ++ + D +M K Q +L+ + +EI L +IG Sbjct: 362 EIEDEIMRFYGDLMGREEPNLDSVDINIMRKGCQLNFDQRKYLIGRITDEEIDKALKSIG 421 Query: 3735 NEKSSGPDSYTSYLFKKAWNIV*DDLCDAIQEFFTFGKLLKQAHHCVISLIPKNEQGTSV 3914 + K+ G D Y + FK AW+I+ D DAI+EFF GK+ + + ++ LIPKN++ Sbjct: 422 DLKAPGIDGYGAKFFKDAWSIIKSDFTDAIREFFEKGKMYEPINTSLVILIPKNQEAKYA 481 Query: 3915 GDFTPISCCNIFYKVITKLLADRMGSISLALIDKAQSAFVKGRSMLENFHLAQEILRGY 4091 D+ PISCC YKVI+K+L R+ + +++ ++Q+AFV G+ + + LA E+++GY Sbjct: 482 RDYRPISCCTTIYKVISKVLTTRLSRVIKSIVHQSQAAFVPGQKIHDQILLAYELIQGY 540 Score = 75.1 bits (183), Expect(3) = 1e-42 Identities = 54/202 (26%), Positives = 85/202 (42%), Gaps = 6/202 (2%) Frame = +3 Query: 2583 KVSHNFGEHGAGRIVILWKPLLVSVHVQEATQQTIQTSVTCMISRKTFWVSFVYGLHSIA 2762 K N+ +H GRI + W +V + T Q I V W++ +YG + + Sbjct: 23 KYLDNYVKHNNGRIWVYWDDNIVDIQEVNCTAQLIHCKVYDATGYFMQWLTAIYGFNYLE 82 Query: 2763 GRRPLWDSLIRFGTSINLPWLVVGDFDIVLK--DRLRQTQVSSYEVWDFLNCCVDLGLTD 2936 LW L + PW ++GDF+ VLK DR+ V E D + GL + Sbjct: 83 QCTDLWHDLEAINKTQQGPWCLIGDFNNVLKTNDRVGGKMVCEKEYKDLRTMMDNTGLAE 142 Query: 2937 VNYSGSHYTRSN----GYTWSKIDKAMCNQQWLLNGLHAREEFLAPGCISHHLPRVDTLF 3104 ++ G +YT SN +S+ID+ + N +W L+ + PG IS H Sbjct: 143 MDSKGDYYTWSNKQSENIIYSRIDRILGNTEWFSKNLNLSLTNMTPG-ISDHAMLCLRDD 201 Query: 3105 DTPNRPRTCFMFFNMWADHDRF 3170 P + + F + N + D F Sbjct: 202 SVPVKRKARFKYANCVSGMDNF 223 Score = 29.6 bits (65), Expect(3) = 1e-42 Identities = 21/106 (19%), Positives = 44/106 (41%) Frame = +1 Query: 3277 KDFAHISSRAEAARKELKQQQNLLHDNPMDHLLQESVKQLQQKALFHIEAEMKLCAQKLK 3456 K I + + AR++L Q L + ++ + + + E E ++ Q+ K Sbjct: 260 KPLIGIKVKLQEAREKLTHAQMELTLDRLNKDKIDRTNDCTEAVIKWTEMEEQMLQQRAK 319 Query: 3457 YDFLINGDKATKLFHSLIKRNAKKNFIASITREDGSLTNSMKYVKN 3594 +L GD FH+ +K + I + DG+ + K +++ Sbjct: 320 IRWLRLGDGNNAYFHASLKAKYNQTSIKKLYMNDGNFVTTQKEIED 365 >ref|XP_006595271.1| PREDICTED: uncharacterized protein LOC100781932 [Glycine max] Length = 952 Score = 127 bits (318), Expect(2) = 1e-42 Identities = 66/172 (38%), Positives = 97/172 (56%), Gaps = 2/172 (1%) Frame = +3 Query: 2661 VQEATQQTIQTSVTCMISRKTFWVSFVYGLHSIAGRRPLWDSLIRFGTSINLPWLVVGDF 2840 V E+ + I ++ C + K F VSF+YGLHSI R+ LW ++ ++N WL++GDF Sbjct: 529 VFESNAKLIHCAIDCKTTAKRFQVSFIYGLHSIVARKSLWINMNSINANMNCLWLLIGDF 588 Query: 2841 DIVLK--DRLRQTQVSSYEVWDFLNCCVDLGLTDVNYSGSHYTRSNGYTWSKIDKAMCNQ 3014 + +L DR + ++YE+ DF++CC DLGL +N G YT +NG WSK+D+A+CNQ Sbjct: 589 NSILSPTDRFNGAEPNAYELQDFVDCCSDLGLGSINTHGPLYTWTNGRVWSKLDRALCNQ 648 Query: 3015 QWLLNGLHAREEFLAPGCISHHLPRVDTLFDTPNRPRTCFMFFNMWADHDRF 3170 W + ++ E + IS H P V T R + F F N DH F Sbjct: 649 VWFNSFGNSACEVMEFISISDHTPLVVTTKLVVPRGNSPFKFNNAIVDHPNF 700 Score = 76.6 bits (187), Expect(2) = 1e-42 Identities = 43/136 (31%), Positives = 67/136 (49%) Frame = +1 Query: 3169 SSLVKNGWNVQILWACQYRLCRXXXXXXXXXXXXNRKDFAHISSRAEAARKELKQQQNLL 3348 S +V +GW I +++C+ +++F++IS+R E A E N L Sbjct: 701 SRIVADGWKQNIHGCSMFKVCKKLKVLKASLKNLFKQEFSNISNRVELAEVEYNSVLNSL 760 Query: 3349 HDNPMDHLLQESVKQLQQKALFHIEAEMKLCAQKLKYDFLINGDKATKLFHSLIKRNAKK 3528 NP DH L + + + + + E AQ +K +L+ D +K FH+LIKRN Sbjct: 761 KQNPQDHSLLALANRTRGQTIMFRKVESMKFAQLIKNRYLLQVDICSKFFHALIKRNRHS 820 Query: 3529 NFIASITREDGSLTNS 3576 FIA+I EDG T+S Sbjct: 821 RFIAAIRLEDGHNTSS 836 Score = 75.9 bits (185), Expect = 2e-10 Identities = 32/68 (47%), Positives = 50/68 (73%) Frame = +3 Query: 3696 SKKEIKPTLFNIGNEKSSGPDSYTSYLFKKAWNIV*DDLCDAIQEFFTFGKLLKQAHHCV 3875 SK+E+ +F + N K+ GP+ + + FKKAWNI+ DD+ +A+ EFFT K+LKQ +H + Sbjct: 885 SKQEVWNVIFVMDNNKAPGPNGFNALFFKKAWNIIGDDIFEAVNEFFTTRKILKQINHAI 944 Query: 3876 ISLIPKNE 3899 I+LIPK++ Sbjct: 945 IALIPKHD 952 >emb|CCA65981.1| hypothetical protein [Beta vulgaris subsp. vulgaris] Length = 1114 Score = 101 bits (251), Expect(3) = 3e-42 Identities = 57/179 (31%), Positives = 97/179 (54%), Gaps = 8/179 (4%) Frame = +3 Query: 3579 EVREEYLKFYVGLVGTKQETHE-FDKMVMPKQADF-------LVADFSKKEIKPTLFNIG 3734 E++ E FY L+GT E D V+ A LV + +EI L +I Sbjct: 395 EIQNEICNFYRRLLGTSSSQLEAIDLHVVRVGAKLSATSCAQLVQPITIQEIDQALADID 454 Query: 3735 NEKSSGPDSYTSYLFKKAWNIV*DDLCDAIQEFFTFGKLLKQAHHCVISLIPKNEQGTSV 3914 + K+ G D + S FKK+W ++ ++ + I +FF G + K + ++LIPK ++ Sbjct: 455 DTKAPGLDGFNSVFFKKSWLVIKQEIYEGILDFFENGFMHKPINCTAVTLIPKIDEAKHA 514 Query: 3915 GDFTPISCCNIFYKVITKLLADRMGSISLALIDKAQSAFVKGRSMLENFHLAQEILRGY 4091 D+ PI+CC+ YK+I+K+L R+ ++ ++D AQ+ F+ R + +N LA E++RGY Sbjct: 515 KDYRPIACCSTLYKIISKILTKRLQAVITEVVDCAQTGFIPERHIGDNILLATELIRGY 573 Score = 92.4 bits (228), Expect(3) = 3e-42 Identities = 73/258 (28%), Positives = 111/258 (43%), Gaps = 7/258 (2%) Frame = +3 Query: 2418 MKIGWWNIRGFAKALKHIEVYRFLKAKNIVVFXXXXXXXXXXXXFDIMAWKFKEWKVSHN 2597 MKI WN+RG +K EV FL ++ I + I W +N Sbjct: 1 MKITTWNVRGLNDPIKVKEVKHFLHSQKISLCSLFETRVRQQNSGKIQKKFGNRWSWINN 60 Query: 2598 FGEHGAGRIVILWKPLLVSVHVQEATQQTIQTSVTCMISRKTFWVSFVYGLHSIAGRRPL 2777 + GRI + W V+++V T+Q I V F ++ VYGLH+IA R+ L Sbjct: 61 YACSPRGRIWVGWLNNDVNINVLSVTEQVITMEVKNSYGLNMFKMAAVYGLHTIADRKVL 120 Query: 2778 WDSLIRFGTSINLPWLVVGDFDIV--LKDRLRQTQVSSYEVWDFLNCCVDLGLTDVNYSG 2951 W+ L F + + P +++GD++ V +DRL VS E D + + L + +G Sbjct: 121 WEELYNFVSVCHEPCILIGDYNAVYSAQDRLNGNDVSEAETSDLRSFVLKAQLLEAPTTG 180 Query: 2952 SHYTRSN-----GYTWSKIDKAMCNQQWLLNGLHAREEFLAPGCISHHLPRVDTLFDTPN 3116 Y+ +N S+IDK+ N W+ E+ G IS H P + L + Sbjct: 181 LFYSWNNKSIGADRISSRIDKSFVNVAWINQYPDVVVEYREAG-ISDHSPLIFNLATQHD 239 Query: 3117 RPRTCFMFFNMWADHDRF 3170 F F N AD + F Sbjct: 240 EGGRPFKFLNFLADQNGF 257 Score = 29.6 bits (65), Expect(3) = 3e-42 Identities = 25/106 (23%), Positives = 43/106 (40%) Frame = +1 Query: 3277 KDFAHISSRAEAARKELKQQQNLLHDNPMDHLLQESVKQLQQKALFHIEAEMKLCAQKLK 3456 K F+ + E R++L Q L + + L QE K L + + + QK + Sbjct: 294 KKFSKAHCQVEELRRKLAAVQALPEVSQVSEL-QEEEKDLIAQLRKWSTIDESILKQKSR 352 Query: 3457 YDFLINGDKATKLFHSLIKRNAKKNFIASITREDGSLTNSMKYVKN 3594 +L GD +K F + IK +N I + + G ++N Sbjct: 353 IQWLSLGDSNSKFFFTAIKVRKARNKIVLLQNDRGDQLTENTEIQN 398 Score = 74.7 bits (182), Expect = 3e-10 Identities = 39/93 (41%), Positives = 56/93 (60%), Gaps = 1/93 (1%) Frame = +2 Query: 4082 EGVYLAQEILRGYKRKRTSPKYTLKIDIRRCMTPFHRSFWR-RCSQLDFPTIFIE*IMAC 4258 + + LA E++RGY R+ SP+ +K+DIR+ F +L FP++FI IMAC Sbjct: 561 DNILLATELIRGYNRRHVSPRCVIKVDIRKAYDSVEWVFLESMLKELGFPSMFIRWIMAC 620 Query: 4259 VSSPSYSLKVNGEIIGFFKGETGLR*GDPISPF 4357 V + SYS+ +NG F + GLR GDP+SPF Sbjct: 621 VKTVSYSILLNGIPSIPFDAQKGLRQGDPLSPF 653 >gb|EEC79647.1| hypothetical protein OsI_20882 [Oryza sativa Indica Group] Length = 1784 Score = 103 bits (256), Expect(4) = 1e-40 Identities = 58/172 (33%), Positives = 92/172 (53%), Gaps = 3/172 (1%) Frame = +3 Query: 3570 KFNEVREEYLKFYVG---LVGTKQETHEFDKMVMPKQADFLVADFSKKEIKPTLFNIGNE 3740 K ++ EY K L+ + T + V P + L ++F ++EI +F IG Sbjct: 389 KLEDMATEYFKEVFSADPLLDQSKVTRLIQRKVSPAMNETLCSEFKEEEISNAMFQIGPL 448 Query: 3741 KSSGPDSYTSYLFKKAWNIV*DDLCDAIQEFFTFGKLLKQAHHCVISLIPKNEQGTSVGD 3920 K+ GPD + + +++ W + +D+ A++ FF G + + + I LIPK EQ + D Sbjct: 449 KAPGPDGFPARFYQRHWGFMKNDIVRAVKLFFDTGVMPEGVNDTAIVLIPKIEQPMELRD 508 Query: 3921 FTPISCCNIFYKVITKLLADRMGSISLALIDKAQSAFVKGRSMLENFHLAQE 4076 F PIS CN+ YKV++K L +R+ I L+ QSAFV GR + +N LA E Sbjct: 509 FRPISLCNVIYKVVSKCLVNRLRPILDELVSPCQSAFVLGRMITDNAILAFE 560 Score = 56.2 bits (134), Expect(4) = 1e-40 Identities = 47/189 (24%), Positives = 80/189 (42%), Gaps = 9/189 (4%) Frame = +3 Query: 2613 AGRIVILWKPLLVSVHVQEATQQTIQTSVTCMISRKTFWVSFVYGLHSIAGRRPLWDSLI 2792 +G + + W V V ++ ++ I V ++ + V+FVYG + R +W L Sbjct: 65 SGGLALFWDES-VFVEIKGINERYIDAYVRLSVNDPIWHVTFVYGEPRVEHRHRMWSMLN 123 Query: 2793 RFGTSINLPWLVVGDFDIVL--KDRLRQTQVSSYEVWDFLNCCVDLGLTDVNYSGSHYTR 2966 S NLPWLV+GDF+ L + + + S ++ F + L D+ + G YT Sbjct: 124 SIKQSSNLPWLVLGDFNETLWQFEHFSKKKRSEVQMQAFRDVLQTYELHDLGFKGLPYTY 183 Query: 2967 SN-----GYTWSKIDKAMCNQQWLLNGLHAREEFLAPGCISHHLPRVDTLFDTPNRP--R 3125 N ++D+ + + W ++ E L C S+H P V N+P R Sbjct: 184 DNKREGINNVRVRLDRVVADDGWRDMFRSSQVEHLISPC-SNHCPVVLKFCVDTNQPARR 242 Query: 3126 TCFMFFNMW 3152 C + W Sbjct: 243 KCLHYEIFW 251 Score = 46.2 bits (108), Expect(4) = 1e-40 Identities = 28/82 (34%), Positives = 41/82 (50%), Gaps = 4/82 (4%) Frame = +2 Query: 4124 RKRTSPKYTLKIDIRRCMTPFHRSFWRRCSQ----LDFPTIFIE*IMACVSSPSYSLKVN 4291 RK S K+D+ + + R W Q L F ++ IM C+++ YS+K N Sbjct: 569 RKPESAACAYKLDLSKA---YDRVDWGFLEQSLYKLGFAHRWVRWIMVCITTVRYSVKFN 625 Query: 4292 GEIIGFFKGETGLR*GDPISPF 4357 G ++ F GLR GDP+SPF Sbjct: 626 GTLLSTFAPSRGLRQGDPLSPF 647 Score = 32.3 bits (72), Expect(4) = 1e-40 Identities = 25/99 (25%), Positives = 44/99 (44%) Frame = +1 Query: 3271 NRKDFAHISSRAEAARKELKQQQNLLHDNPMDHLLQESVKQLQQKALFHIEAEMKLCAQK 3450 +R+ ++ E RK L + L ++ D S + L+ E L Q+ Sbjct: 292 SRRKCKNVGREIEKGRKRLAE----LIESGADSRSIRSASDNLHELLYR---EEMLWLQR 344 Query: 3451 LKYDFLINGDKATKLFHSLIKRNAKKNFIASITREDGSL 3567 + ++L GD+ T+ FHS AKKN I + +G++ Sbjct: 345 SRVNWLKEGDRNTRFFHSKAVWRAKKNRITKLKDREGTV 383 Score = 102 bits (255), Expect(3) = 6e-30 Identities = 58/172 (33%), Positives = 92/172 (53%), Gaps = 3/172 (1%) Frame = +3 Query: 3570 KFNEVREEYLKFYVG---LVGTKQETHEFDKMVMPKQADFLVADFSKKEIKPTLFNIGNE 3740 K ++ EY K L+ + T + V P + L ++F ++EI +F IG Sbjct: 1087 KLEDMATEYFKEVFSADPLLDQSKVTRLIQRKVSPAMNETLCSEFKEEEISNAMFQIGPL 1146 Query: 3741 KSSGPDSYTSYLFKKAWNIV*DDLCDAIQEFFTFGKLLKQAHHCVISLIPKNEQGTSVGD 3920 K+ GPD + + +++ W + +D+ A++ FF G + + + I LIPK EQ + D Sbjct: 1147 KALGPDGFPARFYQRHWGFMKNDIVRAVKLFFDTGVMPEGVNDTAIVLIPKIEQPMELRD 1206 Query: 3921 FTPISCCNIFYKVITKLLADRMGSISLALIDKAQSAFVKGRSMLENFHLAQE 4076 F PIS CN+ YKV++K L +R+ I L+ QSAFV GR + +N LA E Sbjct: 1207 FRPISLCNVIYKVVSKCLVNRLRPILDELVSPCQSAFVLGRMITDNAILAFE 1258 Score = 46.2 bits (108), Expect(3) = 6e-30 Identities = 28/82 (34%), Positives = 41/82 (50%), Gaps = 4/82 (4%) Frame = +2 Query: 4124 RKRTSPKYTLKIDIRRCMTPFHRSFWRRCSQ----LDFPTIFIE*IMACVSSPSYSLKVN 4291 RK S K+D+ + + R W Q L F ++ IM C+++ YS+K N Sbjct: 1267 RKPESAACAYKLDLSKA---YDRVDWGFLEQSLYKLGFAHRWVRWIMVCITTVRYSVKFN 1323 Query: 4292 GEIIGFFKGETGLR*GDPISPF 4357 G ++ F GLR GDP+SPF Sbjct: 1324 GTLLSTFAPSRGLRQGDPLSPF 1345 Score = 32.7 bits (73), Expect(3) = 6e-30 Identities = 25/99 (25%), Positives = 44/99 (44%) Frame = +1 Query: 3271 NRKDFAHISSRAEAARKELKQQQNLLHDNPMDHLLQESVKQLQQKALFHIEAEMKLCAQK 3450 +R+ ++ E RK L + L ++ D S + L+ E L Q+ Sbjct: 990 SRRKCKNVGREIEKGRKRLAE----LIESGADSTSIRSASDNLHELLYR---EEMLWLQR 1042 Query: 3451 LKYDFLINGDKATKLFHSLIKRNAKKNFIASITREDGSL 3567 + ++L GD+ T+ FHS AKKN I + +G++ Sbjct: 1043 SRVNWLKEGDRNTRFFHSKAVWRAKKNRITKLKDREGTV 1081 >dbj|BAB01845.1| non-LTR retroelement reverse transcriptase-like protein [Arabidopsis thaliana] Length = 893 Score = 116 bits (290), Expect(3) = 3e-39 Identities = 61/141 (43%), Positives = 85/141 (60%) Frame = +3 Query: 3669 QADFLVADFSKKEIKPTLFNIGNEKSSGPDSYTSYLFKKAWNIV*DDLCDAIQEFFTFGK 3848 Q + L FS +I+ F++ K+SGPD Y+S FK W +V ++ +A+QEFF G+ Sbjct: 440 QINDLERSFSDLDIQEAFFSLPRNKASGPDGYSSEFFKGVWFVVGPEVTEAVQEFFRSGQ 499 Query: 3849 LLKQAHHCVISLIPKNEQGTSVGDFTPISCCNIFYKVITKLLADRMGSISLALIDKAQSA 4028 LLKQ + + LIPK + + DF PISC N YKVI KLL R+ + +I +QSA Sbjct: 500 LLKQWNATTLVLIPKITNSSKMTDFRPISCLNTLYKVIAKLLTSRLKKLLNEVISPSQSA 559 Query: 4029 FVKGRSMLENFHLAQEILRGY 4091 F+ GR + EN LA EI+ GY Sbjct: 560 FLPGRLLSENVLLATEIVHGY 580 Score = 64.3 bits (155), Expect(3) = 3e-39 Identities = 56/204 (27%), Positives = 83/204 (40%), Gaps = 7/204 (3%) Frame = +3 Query: 2580 WKVSHNFGEHGAGRIVILWKPLLVSVHVQEATQQTIQTSVTCMISRKTFWVSFVYGLHSI 2759 W N+ G+I +LW P V V V + Q I + S F VS VY + Sbjct: 57 WSFVENYEFSVLGKIWVLWDP-SVKVVVIGRSLQMITCELLLPDSPSWFVVSIVYASNEE 115 Query: 2760 AGRRPLWDSLIRFGTS---INLPWLVVGDFDIVLKDRLRQTQVSSYEVWDFLNCCVDLGL 2930 R+ LW+ L++ S + W+V+GDF+ +L ++ F +C +D L Sbjct: 116 GTRKELWNELVQLALSPVVVGRSWIVLGDFNQILNPESAINANIGRKIRAFRSCLLDSDL 175 Query: 2931 TDVNYSGSHYTRSNGYT----WSKIDKAMCNQQWLLNGLHAREEFLAPGCISHHLPRVDT 3098 D+ Y GS YT N + KID+ + N W A F P H V Sbjct: 176 YDLVYKGSSYTWWNKCSSRPLAKKIDRILVNDHWNTLFPSAYANFGEPDFSDHSSCEV-V 234 Query: 3099 LFDTPNRPRTCFMFFNMWADHDRF 3170 L + + F FFN + + F Sbjct: 235 LDPAVLKAKRPFRFFNYFLHNPDF 258 Score = 32.7 bits (73), Expect(3) = 3e-39 Identities = 27/142 (19%), Positives = 55/142 (38%), Gaps = 3/142 (2%) Frame = +1 Query: 3175 LVKNGW-NVQILWACQYRLCRXXXXXXXXXXXXNRKDFAHISSRAEAARKELKQQQNLLH 3351 L++ W + + + YR+ + +R++++ I R A + +Q + Sbjct: 261 LIRENWYSCNVSGSAMYRVSKKLKHLKLPICCFSRENYSDIEKRVSEAHAIVLHRQRITL 320 Query: 3352 DNPMDHLLQESVKQLQQKALFHI--EAEMKLCAQKLKYDFLINGDKATKLFHSLIKRNAK 3525 NP + + +L+ + I +AE QK +L GD T FH + Sbjct: 321 TNPS---VVHATLELEATRKWQILAKAEESFFCQKSSISWLYEGDNNTAYFHKMADMRKS 377 Query: 3526 KNFIASITREDGSLTNSMKYVK 3591 N I + + G + + +K Sbjct: 378 INTINFLIDDFGERIETQQGIK 399 Score = 67.8 bits (164), Expect = 4e-08 Identities = 38/93 (40%), Positives = 51/93 (54%), Gaps = 1/93 (1%) Frame = +2 Query: 4082 EGVYLAQEILRGYKRKRTSPKYTLKIDIRRCMTPFHRSFWRRCSQ-LDFPTIFIE*IMAC 4258 E V LA EI+ GY K S + LK+D+R+ F + L P F+ I C Sbjct: 568 ENVLLATEIVHGYNTKNISSRGMLKVDLRKAFDSVRWDFIISAFRALAVPEKFVCWINQC 627 Query: 4259 VSSPSYSLKVNGEIIGFFKGETGLR*GDPISPF 4357 +S+P +S+ VNG GFFK GLR GDP+SP+ Sbjct: 628 ISTPYFSVMVNGSSSGFFKSNKGLRQGDPLSPY 660 >emb|CAA66812.1| non-ltr retrotransposon reverse transcriptase-like protein [Arabidopsis thaliana] Length = 893 Score = 116 bits (290), Expect(3) = 3e-39 Identities = 61/141 (43%), Positives = 85/141 (60%) Frame = +3 Query: 3669 QADFLVADFSKKEIKPTLFNIGNEKSSGPDSYTSYLFKKAWNIV*DDLCDAIQEFFTFGK 3848 Q + L FS +I+ F++ K+SGPD Y+S FK W +V ++ +A+QEFF G+ Sbjct: 440 QINDLERSFSDLDIQEAFFSLPRNKASGPDGYSSEFFKGVWFVVGPEVTEAVQEFFRSGQ 499 Query: 3849 LLKQAHHCVISLIPKNEQGTSVGDFTPISCCNIFYKVITKLLADRMGSISLALIDKAQSA 4028 LLKQ + + LIPK + + DF PISC N YKVI KLL R+ + +I +QSA Sbjct: 500 LLKQWNATTLVLIPKITNSSKMTDFRPISCLNTLYKVIAKLLTSRLKKLLNEVISPSQSA 559 Query: 4029 FVKGRSMLENFHLAQEILRGY 4091 F+ GR + EN LA EI+ GY Sbjct: 560 FLPGRLLSENVLLATEIVHGY 580 Score = 64.3 bits (155), Expect(3) = 3e-39 Identities = 56/204 (27%), Positives = 83/204 (40%), Gaps = 7/204 (3%) Frame = +3 Query: 2580 WKVSHNFGEHGAGRIVILWKPLLVSVHVQEATQQTIQTSVTCMISRKTFWVSFVYGLHSI 2759 W N+ G+I +LW P V V V + Q I + S F VS VY + Sbjct: 57 WSFVENYEFSVLGKIWVLWDP-SVKVVVIGRSLQMITCELLLPDSPSWFVVSIVYASNEE 115 Query: 2760 AGRRPLWDSLIRFGTS---INLPWLVVGDFDIVLKDRLRQTQVSSYEVWDFLNCCVDLGL 2930 R+ LW+ L++ S + W+V+GDF+ +L ++ F +C +D L Sbjct: 116 GTRKELWNELVQLALSPVVVGRSWIVLGDFNQILNPESAINANIGRKIRAFRSCLLDSDL 175 Query: 2931 TDVNYSGSHYTRSNGYT----WSKIDKAMCNQQWLLNGLHAREEFLAPGCISHHLPRVDT 3098 D+ Y GS YT N + KID+ + N W A F P H V Sbjct: 176 YDLVYKGSSYTWWNKCSSRPLAKKIDRILVNDHWNTLFPSAYANFGEPDFSDHSSCEV-V 234 Query: 3099 LFDTPNRPRTCFMFFNMWADHDRF 3170 L + + F FFN + + F Sbjct: 235 LDPAVLKAKRPFRFFNYFLHNPDF 258 Score = 32.7 bits (73), Expect(3) = 3e-39 Identities = 27/142 (19%), Positives = 55/142 (38%), Gaps = 3/142 (2%) Frame = +1 Query: 3175 LVKNGW-NVQILWACQYRLCRXXXXXXXXXXXXNRKDFAHISSRAEAARKELKQQQNLLH 3351 L++ W + + + YR+ + +R++++ I R A + +Q + Sbjct: 261 LIRENWYSCNVSGSAMYRVSKKLKHLKLPICCFSRENYSDIEKRVSEAHAIVLHRQRITL 320 Query: 3352 DNPMDHLLQESVKQLQQKALFHI--EAEMKLCAQKLKYDFLINGDKATKLFHSLIKRNAK 3525 NP + + +L+ + I +AE QK +L GD T FH + Sbjct: 321 TNPS---VVHATLELEATRKWQILAKAEESFFCQKSSISWLYEGDNNTAYFHKMADMRKS 377 Query: 3526 KNFIASITREDGSLTNSMKYVK 3591 N I + + G + + +K Sbjct: 378 INTINFLIDDFGERIETQQGIK 399 Score = 67.8 bits (164), Expect = 4e-08 Identities = 38/93 (40%), Positives = 51/93 (54%), Gaps = 1/93 (1%) Frame = +2 Query: 4082 EGVYLAQEILRGYKRKRTSPKYTLKIDIRRCMTPFHRSFWRRCSQ-LDFPTIFIE*IMAC 4258 E V LA EI+ GY K S + LK+D+R+ F + L P F+ I C Sbjct: 568 ENVLLATEIVHGYNTKNISSRGMLKVDLRKAFDSVRWDFIISAFRALAVPEKFVCWINQC 627 Query: 4259 VSSPSYSLKVNGEIIGFFKGETGLR*GDPISPF 4357 +S+P +S+ VNG GFFK GLR GDP+SP+ Sbjct: 628 ISTPYFSVMVNGSSSGFFKSNKGLRQGDPLSPY 660 >dbj|BAF00918.1| putative reverse transcriptase [Arabidopsis thaliana] Length = 910 Score = 89.4 bits (220), Expect(3) = 6e-39 Identities = 55/182 (30%), Positives = 93/182 (51%), Gaps = 10/182 (5%) Frame = +3 Query: 3576 NEVREEYLKFYVGLVGTKQE-----THEFDKMVMPKQADFLVADF-----SKKEIKPTLF 3725 ++++ + +Y L+G E + E K ++P + D +A S++EI LF Sbjct: 398 DQIKGMLIAYYSHLLGIPSENVTPFSVEKIKGLLPFRCDSFLASQLTTIPSEEEITQVLF 457 Query: 3726 NIGNEKSSGPDSYTSYLFKKAWNIV*DDLCDAIQEFFTFGKLLKQAHHCVISLIPKNEQG 3905 ++ K+ GPD + F +AW IV + AI+EFF G L + + I+LIPK Sbjct: 458 SMPRNKAPGPDGFPVEFFIEAWAIVKSSVVAAIREFFISGNLPRGFNATAITLIPKVTGA 517 Query: 3906 TSVGDFTPISCCNIFYKVITKLLADRMGSISLALIDKAQSAFVKGRSMLENFHLAQEILR 4085 + F P++CC YKVIT++++ R+ + Q F+KGR + EN LA E++ Sbjct: 518 DRLTQFRPVACCTTIYKVITRIISRRLKLFIDQAVQANQVGFIKGRLLCENVLLASELVD 577 Query: 4086 GY 4091 + Sbjct: 578 NF 579 Score = 87.4 bits (215), Expect(3) = 6e-39 Identities = 69/262 (26%), Positives = 118/262 (45%), Gaps = 11/262 (4%) Frame = +3 Query: 2418 MKIGWWNIRGFAKALKHIEVYRFLKAKNIVVFXXXXXXXXXXXXFDIMAWKFKEWKVSHN 2597 MK+ WNIRG + V ++ + N++V ++A W++ N Sbjct: 1 MKVFCWNIRGLNSRNRQRVVRSWIASNNLLVGCFLETHVAQENANSVLASTLPGWRMDSN 60 Query: 2598 FGEHGAGRIVILWKPLLVSVHVQEATQQTIQTSVTCMISRKTFWVSFVYGLHSIAGRRPL 2777 + GRI I+W P +SV V + T Q + S+ ++F V+FVYG +S RR L Sbjct: 61 YCCSELGRIWIVWDPS-ISVLVFKRTDQIMFCSIKIPSLLQSFAVAFVYGRNSELDRRSL 119 Query: 2778 WDSLIRFGTSINL---PWLVVGDFDIVLKD----RLRQTQVSSYEVWDFLNCCVDLGLTD 2936 W+ ++ + L PWL++GDF+ + + Q+ ++ + D C D L+D Sbjct: 120 WEDILVLSRTSPLSVTPWLLLGDFNQIAAASEHYSINQSLLNLRGMEDLQCCLRDSQLSD 179 Query: 2937 VNYSGSHYTRSN----GYTWSKIDKAMCNQQWLLNGLHAREEFLAPGCISHHLPRVDTLF 3104 + G +T SN K+D+A+ N +W A F PG S H P + + Sbjct: 180 LPSRGVFFTWSNHQQDNPILRKLDRALANGEWFAVFPSALAVFDPPG-DSDHAPCIILID 238 Query: 3105 DTPNRPRTCFMFFNMWADHDRF 3170 + P + F +F+ + H + Sbjct: 239 NQPPPSKKSFKYFSFLSSHPSY 260 Score = 35.4 bits (80), Expect(3) = 6e-39 Identities = 27/107 (25%), Positives = 44/107 (41%) Frame = +1 Query: 3271 NRKDFAHISSRAEAARKELKQQQNLLHDNPMDHLLQESVKQLQQKALFHIEAEMKLCAQK 3450 NR F++I R + L+ Q L +P D L + +Q F E QK Sbjct: 296 NRLRFSNIQQRTAQSLTRLEDIQVELLTSPSDTLFRREHVARKQWIFFAAALE-SFFRQK 354 Query: 3451 LKYDFLINGDKATKLFHSLIKRNAKKNFIASITREDGSLTNSMKYVK 3591 + +L GD T+ FH + + N I + +DG ++ +K Sbjct: 355 SRIRWLHEGDANTRFFHRAVIAHQATNLIKFLRGDDGFRVENVDQIK 401 Score = 66.6 bits (161), Expect = 9e-08 Identities = 35/91 (38%), Positives = 54/91 (59%), Gaps = 1/91 (1%) Frame = +2 Query: 4082 EGVYLAQEILRGYKRKRTSPKYTLKIDIRRCMTPFHRSFWRRCSQ-LDFPTIFIE*IMAC 4258 E V LA E++ ++ + + L++DI + + F + LD P +FI I C Sbjct: 567 ENVLLASELVDNFEADGETTRGCLQVDISKAYDNVNWEFLINILKALDLPLVFIHWIWVC 626 Query: 4259 VSSPSYSLKVNGEIIGFFKGETGLR*GDPIS 4351 +SS SYS+ NGE+IGFF+G+ G+R GDP+S Sbjct: 627 ISSASYSIAFNGELIGFFQGKKGIRQGDPMS 657 >gb|EMJ04543.1| hypothetical protein PRUPE_ppa020282mg [Prunus persica] Length = 1496 Score = 105 bits (261), Expect(4) = 6e-39 Identities = 61/174 (35%), Positives = 98/174 (56%), Gaps = 3/174 (1%) Frame = +3 Query: 3594 YLKFYVGLVGTKQETHEFDKM---VMPKQADFLVADFSKKEIKPTLFNIGNEKSSGPDSY 3764 Y + G+ + T D + V + L+A F+ +EIK LF + K+ GPD + Sbjct: 755 YFQHLFSSTGSSEYTEVVDGVRGRVTEEMNQALLAVFTPEEIKIALFQMHPSKAPGPDGF 814 Query: 3765 TSYLFKKAWNIV*DDLCDAIQEFFTFGKLLKQAHHCVISLIPKNEQGTSVGDFTPISCCN 3944 + + ++K W IV +D+ A+ FF GKLLK+ + ++LIPK + ++ PIS CN Sbjct: 815 SPFFYQKYWPIVGEDVVAAVLHFFKTGKLLKRINFTHVALIPKVHEPKNMMQLRPISLCN 874 Query: 3945 IFYKVITKLLADRMGSISLALIDKAQSAFVKGRSMLENFHLAQEILRGYIWHKK 4106 + YK+ K+L R+ +I LI QSAFV GR++ +N +A E+L ++ HKK Sbjct: 875 VLYKIGAKVLTTRLKAILPTLISDTQSAFVPGRAISDNSIVAFELL--HMMHKK 926 Score = 52.4 bits (124), Expect(4) = 6e-39 Identities = 54/189 (28%), Positives = 77/189 (40%), Gaps = 4/189 (2%) Frame = +3 Query: 2607 HGA-GRIVILWKPLLVSVHVQEATQQTIQTSVTCMISRKTFWVSFVYGLHSIAGRRPLWD 2783 HGA G + ++W LV V + I T V + R + + YG A R WD Sbjct: 447 HGASGGLCLMWTEELV-VTARSFGTNHIDTEVEILGVRGKWRFTGFYGCPVTAERHRSWD 505 Query: 2784 SLIRFGTSINLPWLVVGDFDIVLKDRLRQTQVSSYEVWDFLNCCVDLGLTDVNYSGSHYT 2963 L R G + LPWL GDF+ + LR + + + F D+ Y+G YT Sbjct: 506 LLRRLGATNYLPWLCCGDFNEI----LRADEKLAIDTCRF---------KDLGYTGPKYT 552 Query: 2964 --RSNGYTWS-KIDKAMCNQQWLLNGLHAREEFLAPGCISHHLPRVDTLFDTPNRPRTCF 3134 R+N ++D+A+ W L + L P S HLP + F Sbjct: 553 WWRNNPMEIRIRLDRALATADWCSRFLGTKVIHLNP-TKSDHLPL-----------KKLF 600 Query: 3135 MFFNMWADH 3161 F MWA+H Sbjct: 601 RFEEMWAEH 609 Score = 47.0 bits (110), Expect(4) = 6e-39 Identities = 26/70 (37%), Positives = 40/70 (57%), Gaps = 1/70 (1%) Frame = +2 Query: 4151 LKIDIRRCMTPFHRSFWRRCSQ-LDFPTIFIE*IMACVSSPSYSLKVNGEIIGFFKGETG 4327 LKID+ + SF + + F +I+ IM CV++ SYS +NG +G+ + G Sbjct: 936 LKIDMSKAYDRVEWSFLEALMKGMGFAPRWIQLIMECVTTVSYSFMLNGNPVGYVIPQRG 995 Query: 4328 LR*GDPISPF 4357 LR GDP+SP+ Sbjct: 996 LRQGDPLSPY 1005 Score = 27.7 bits (60), Expect(4) = 6e-39 Identities = 25/129 (19%), Positives = 55/129 (42%), Gaps = 1/129 (0%) Frame = +1 Query: 3178 VKNGWNVQILWACQYRLCRXXXXXXXXXXXXNRKDFAHISSRAEAARKELKQQQNLLHDN 3357 +++GW + + ++ +F H+ ++ + R++L + L D Sbjct: 616 IQDGWQRTCRGSAPFTTTEKLKCTRHKLLGWSKCNFGHLPNQIKITREKLGE----LLDA 671 Query: 3358 PMDHLLQESVKQLQQKALFHIEAEMKLC-AQKLKYDFLINGDKATKLFHSLIKRNAKKNF 3534 P H E ++ K L + A+ ++ Q + +L GD+ +K FH ++N Sbjct: 672 PPSHHTAE-LRNALTKQLDSLMAKNEVYWRQCSRATWLKAGDRNSKFFHYKASSRRRRNT 730 Query: 3535 IASITREDG 3561 I+++ E G Sbjct: 731 ISALEDEHG 739 >emb|CCA65974.1| hypothetical protein [Beta vulgaris subsp. vulgaris] Length = 1379 Score = 96.3 bits (238), Expect(3) = 5e-36 Identities = 57/177 (32%), Positives = 100/177 (56%), Gaps = 6/177 (3%) Frame = +3 Query: 3570 KFNEVREEYLKFYVGLVG---TKQETHE---FDKMVMPKQADFLVADFSKKEIKPTLFNI 3731 K N+++EE + F+ + T++ T E F+++ QAD L+ FS +EI + + Sbjct: 382 KPNQIKEEAVTFFKEIFTEEFTERPTLEGLQFNQLSQ-NQADSLIQPFSDEEIDYAVNSC 440 Query: 3732 GNEKSSGPDSYTSYLFKKAWNIV*DDLCDAIQEFFTFGKLLKQAHHCVISLIPKNEQGTS 3911 ++K+ GPD + K AW + +D+ ++EF+ KL K ++ I+LIPK + + Sbjct: 441 ASDKAPGPDGFNFKFIKNAWETIKEDVYTLVREFWATSKLPKGSNSTFITLIPKIDNPEN 500 Query: 3912 VGDFTPISCCNIFYKVITKLLADRMGSISLALIDKAQSAFVKGRSMLENFHLAQEIL 4082 DF PIS YK+I KL+A R+ + +LI QS++V+GR +L+ +A E++ Sbjct: 501 FKDFRPISMVGCVYKIIAKLMAKRIQRVMSSLIGPLQSSYVEGRQILDGALVASEVI 557 Score = 58.9 bits (141), Expect(3) = 5e-36 Identities = 61/252 (24%), Positives = 105/252 (41%), Gaps = 7/252 (2%) Frame = +3 Query: 2418 MKIGWWNIRGFAKALKHIEVYRFLKAKNIVVFXXXXXXXXXXXXFDIMA--WKFK--EWK 2585 M + WNIRG +K + + ++ K+ F +I+ WK + EW Sbjct: 1 MSVLSWNIRGLTARVKRSAIRKLIQ-KHTPDFVFVQETKMEGISLEIVKTMWKSQDVEWT 59 Query: 2586 VSHNFGEHGAGRIVILWKPLLVSVHVQEATQQTIQTSVTCMISRKTFWVSF--VYGLHSI 2759 + G G ++ +W S+ Q I +++ SR F VY +++ Sbjct: 60 WYPSVGNSGG--LISMWNKSAFSMKSSSVNQHWI--AISGSFSRINFECILFNVYNPNTV 115 Query: 2760 AGRRPLWDSLIRFGTSINLPWLVVGDFDIVLK-DRLRQTQVSSYEVWDFLNCCVDLGLTD 2936 R +W+ ++ F + LP L++GDF+ L+ D S+ +F N + L + Sbjct: 116 GARASVWEEIVTFHKTNPLPSLLIGDFNETLEPDDRGSLLFSNIGTDNFKNFLQVMELLE 175 Query: 2937 VNYSGSHYTRSNGYTWSKIDKAMCNQQWLLNGLHAREEFLAPGCISHHLPRVDTLFDTPN 3116 V+ S +T G + S +D+ + N +W+ R L G +S H P + T T N Sbjct: 176 VSPSNKGFTWFRGRSKSVLDRLLLNPEWINEFPSMRLSLLQRG-LSDHCPLL-TNIHTQN 233 Query: 3117 RPRTCFMFFNMW 3152 F F N W Sbjct: 234 WGPKPFRFQNCW 245 Score = 47.0 bits (110), Expect(3) = 5e-36 Identities = 25/92 (27%), Positives = 46/92 (50%) Frame = +1 Query: 3271 NRKDFAHISSRAEAARKELKQQQNLLHDNPMDHLLQESVKQLQQKALFHIEAEMKLCAQK 3450 NR +F HI + + E+++ + ++ +D E K+ Q ++ + AQ Sbjct: 282 NRDEFGHIDTNIKIMEDEIQKFDTISNERELDEQEIERRKEAQSDLWMWMKRKELYWAQN 341 Query: 3451 LKYDFLINGDKATKLFHSLIKRNAKKNFIASI 3546 + +L +GD+ TK FH + ++NFIASI Sbjct: 342 SRILWLKHGDRNTKFFHMVASNKKRRNFIASI 373 >gb|AAD32866.1|AC005489_4 F14N23.4 [Arabidopsis thaliana] Length = 1161 Score = 89.4 bits (220), Expect(3) = 2e-35 Identities = 55/182 (30%), Positives = 93/182 (51%), Gaps = 10/182 (5%) Frame = +3 Query: 3576 NEVREEYLKFYVGLVGTKQE-----THEFDKMVMPKQADFLVADF-----SKKEIKPTLF 3725 ++++ + +Y L+G E + E K ++P + D +A S++EI LF Sbjct: 441 DQIKGMLIAYYSHLLGIPSENVTPFSVEKIKGLLPFRCDSFLASQLTTIPSEEEITQVLF 500 Query: 3726 NIGNEKSSGPDSYTSYLFKKAWNIV*DDLCDAIQEFFTFGKLLKQAHHCVISLIPKNEQG 3905 ++ K+ GPD + F +AW IV + AI+EFF G L + + I+LIPK Sbjct: 501 SMPRNKAPGPDGFPVEFFIEAWAIVKSSVVAAIREFFISGNLPRGFNATAITLIPKVTGA 560 Query: 3906 TSVGDFTPISCCNIFYKVITKLLADRMGSISLALIDKAQSAFVKGRSMLENFHLAQEILR 4085 + F P++CC YKVIT++++ R+ + Q F+KGR + EN LA E++ Sbjct: 561 DRLTQFRPVACCTTIYKVITRIISRRLKLFIDQAVQANQVGFIKGRLLCENVLLASELVD 620 Query: 4086 GY 4091 + Sbjct: 621 NF 622 Score = 75.9 bits (185), Expect(3) = 2e-35 Identities = 59/216 (27%), Positives = 101/216 (46%), Gaps = 11/216 (5%) Frame = +3 Query: 2556 IMAWKFKEWKVSHNFGEHGAGRIVILWKPLLVSVHVQEATQQTIQTSVTCMISRKTFWVS 2735 ++A W++ N+ GRI I+W P +SV V + T Q + S+ ++F V+ Sbjct: 90 VLASTLPGWRMDSNYCCSELGRIWIVWDPS-ISVLVFKRTDQIMFCSIKIPSLLQSFAVA 148 Query: 2736 FVYGLHSIAGRRPLWDSLIRFGTSINL---PWLVVGDFDIVLKD----RLRQTQVSSYEV 2894 FVYG +S RR LW+ ++ + L PWL++GDF+ + + Q+ ++ + Sbjct: 149 FVYGRNSELDRRSLWEDILVLSRTSPLSVTPWLLLGDFNQIAAASEHYSINQSLLNLRGM 208 Query: 2895 WDFLNCCVDLGLTDVNYSGSHYTRSN----GYTWSKIDKAMCNQQWLLNGLHAREEFLAP 3062 D C D L+D+ G +T SN K+D+A+ N +W A F P Sbjct: 209 EDLQCCLRDSQLSDLPSRGVFFTWSNHQQDNPILRKLDRALANGEWFAVFPSALAVFDPP 268 Query: 3063 GCISHHLPRVDTLFDTPNRPRTCFMFFNMWADHDRF 3170 G S H P + + + P + F +F+ + H + Sbjct: 269 G-DSDHAPCIILIDNQPPPSKKSFKYFSFLSSHPSY 303 Score = 35.4 bits (80), Expect(3) = 2e-35 Identities = 27/107 (25%), Positives = 44/107 (41%) Frame = +1 Query: 3271 NRKDFAHISSRAEAARKELKQQQNLLHDNPMDHLLQESVKQLQQKALFHIEAEMKLCAQK 3450 NR F++I R + L+ Q L +P D L + +Q F E QK Sbjct: 339 NRLRFSNIQQRTAQSLTRLEDIQVELLTSPSDTLFRREHVARKQWIFFAAALE-SFFRQK 397 Query: 3451 LKYDFLINGDKATKLFHSLIKRNAKKNFIASITREDGSLTNSMKYVK 3591 + +L GD T+ FH + + N I + +DG ++ +K Sbjct: 398 SRIRWLHEGDANTRFFHRAVIAHQATNLIKFLRGDDGFRVENVDQIK 444 Score = 66.6 bits (161), Expect = 9e-08 Identities = 35/91 (38%), Positives = 54/91 (59%), Gaps = 1/91 (1%) Frame = +2 Query: 4082 EGVYLAQEILRGYKRKRTSPKYTLKIDIRRCMTPFHRSFWRRCSQ-LDFPTIFIE*IMAC 4258 E V LA E++ ++ + + L++DI + + F + LD P +FI I C Sbjct: 610 ENVLLASELVDNFEADGETTRGCLQVDISKAYDNVNWEFLINILKALDLPLVFIHWIWVC 669 Query: 4259 VSSPSYSLKVNGEIIGFFKGETGLR*GDPIS 4351 +SS SYS+ NGE+IGFF+G+ G+R GDP+S Sbjct: 670 ISSASYSIAFNGELIGFFQGKKGIRQGDPMS 700 >gb|EMJ25392.1| hypothetical protein PRUPE_ppa017155mg, partial [Prunus persica] Length = 916 Score = 103 bits (256), Expect(4) = 9e-35 Identities = 59/174 (33%), Positives = 97/174 (55%), Gaps = 3/174 (1%) Frame = +3 Query: 3594 YLKFYVGLVGTKQETHEFDKM---VMPKQADFLVADFSKKEIKPTLFNIGNEKSSGPDSY 3764 Y + +G+ T D + V + L+A+F+ +EIK LF + K+ GPD + Sbjct: 275 YFQHLFSSIGSSDYTEVVDGVRGRVTEEMNQALLAEFTPEEIKIALFQMHPSKAPGPDDF 334 Query: 3765 TSYLFKKAWNIV*DDLCDAIQEFFTFGKLLKQAHHCVISLIPKNEQGTSVGDFTPISCCN 3944 + + ++K W IV +D+ A+ FF GKLLK+ + ++LIPK + ++ PIS CN Sbjct: 335 SPFFYQKYWQIVGEDMVAAVLHFFKTGKLLKKINFTHVALIPKVHEPKNMTQLRPISLCN 394 Query: 3945 IFYKVITKLLADRMGSISLALIDKAQSAFVKGRSMLENFHLAQEILRGYIWHKK 4106 +F K+ K+LA + +I LI QSAF R++ +N +A E+L ++ HKK Sbjct: 395 VFNKIGAKVLATHLKAILPTLISDTQSAFAPDRAISDNSIVAFELL--HMMHKK 446 Score = 43.5 bits (101), Expect(4) = 9e-35 Identities = 35/124 (28%), Positives = 55/124 (44%), Gaps = 8/124 (6%) Frame = +3 Query: 2814 LPWLVVGDFDIVLK--DRLRQTQVSSYEVWDFLNCCVDLGLTDVNYSGSHYT--RSNGYT 2981 LPWL GDF+ +L+ ++L + ++ F G D+ Y+G YT R+N Sbjct: 7 LPWLCCGDFNEILRADEKLGGRRRREGQMLGFRQAIDTCGFKDMGYTGPKYTWWRNNPME 66 Query: 2982 WS-KIDKAMCNQQWLLNGLHAREEFLAPGCISHHLPRVDTLFD---TPNRPRTCFMFFNM 3149 ++D+ + W L + L P S HLP T+ + R + F F M Sbjct: 67 IRIRLDRVLATADWCSRFLGTKVIHLNP-TKSDHLPLKVTISERMLLNGRRKKLFRFEEM 125 Query: 3150 WADH 3161 WA+H Sbjct: 126 WAEH 129 Score = 42.7 bits (99), Expect(4) = 9e-35 Identities = 25/70 (35%), Positives = 39/70 (55%), Gaps = 1/70 (1%) Frame = +2 Query: 4151 LKIDIRRCMTPFHRSFWRRCSQ-LDFPTIFIE*IMACVSSPSYSLKVNGEIIGFFKGETG 4327 LKID+ + SF + + F +I+ IM V++ SYS +NG +G+ + G Sbjct: 456 LKIDMSKAYDRVEWSFLEALMKGMGFAPRWIQLIMEYVTTVSYSFMLNGNPVGYVIPQRG 515 Query: 4328 LR*GDPISPF 4357 LR GDP+SP+ Sbjct: 516 LRQGDPLSPY 525 Score = 28.5 bits (62), Expect(4) = 9e-35 Identities = 24/129 (18%), Positives = 57/129 (44%), Gaps = 1/129 (0%) Frame = +1 Query: 3178 VKNGWNVQILWACQYRLCRXXXXXXXXXXXXNRKDFAHISSRAEAARKELKQQQNLLHDN 3357 +++GW + + ++ +F H+ ++ + R++L + LL Sbjct: 136 IQDGWQRTCRGSAPFTTTEKLKCTRHQLLGWSKCNFGHLPNQIKITREKLGE---LLDAP 192 Query: 3358 PMDHLLQESVKQLQQKALFHIEAEMKLC-AQKLKYDFLINGDKATKLFHSLIKRNAKKNF 3534 P H ++ ++ K L + A+ ++ Q+ + +L GD+ +K FH ++N Sbjct: 193 PSHHTVE--LRNALTKQLDSLMAKNEVYWRQRSRATWLKAGDRNSKFFHYKASSCRRRNT 250 Query: 3535 IASITREDG 3561 I+++ E G Sbjct: 251 ISALEDEHG 259 >emb|CCA66222.1| hypothetical protein [Beta vulgaris subsp. vulgaris] Length = 1383 Score = 94.0 bits (232), Expect(3) = 5e-34 Identities = 52/152 (34%), Positives = 81/152 (53%) Frame = +3 Query: 3627 KQETHEFDKMVMPKQADFLVADFSKKEIKPTLFNIGNEKSSGPDSYTSYLFKKAWNIV*D 3806 K E EF K++ P Q + L FS EI + + K+ GPD + K AW ++ Sbjct: 407 KFENLEFKKLI-PSQTNMLCEPFSLDEIDAAVASCDGNKAPGPDGFNFNFIKSAWEVIKQ 465 Query: 3807 DLCDAIQEFFTFGKLLKQAHHCVISLIPKNEQGTSVGDFTPISCCNIFYKVITKLLADRM 3986 D+ D ++ F+ G L K + I+LIPK E S D+ PIS YK+++K+LA R+ Sbjct: 466 DVYDMVRRFWNTGYLPKGCNTAFIALIPKVESPMSFKDYRPISMVGCVYKIVSKILARRL 525 Query: 3987 GSISLALIDKAQSAFVKGRSMLENFHLAQEIL 4082 + L+ QS+F+ GR +L+ +A EI+ Sbjct: 526 QRVMDHLVGTLQSSFIGGRQILDGALVAGEII 557 Score = 63.5 bits (153), Expect(3) = 5e-34 Identities = 50/199 (25%), Positives = 85/199 (42%), Gaps = 1/199 (0%) Frame = +3 Query: 2577 EWKVSHNFGEHGAGRIVILWKPLLVSVHVQEATQQTIQTSVTCMISRKTFWVSFVYGLHS 2756 EW S + G G ++ +W+ + + I + + R + +Y + Sbjct: 57 EWLFSPSVGNSGG--LISIWEKSAFQMESSHIQRNWIAIQGSIVHPRFRCLLINIYNPCN 114 Query: 2757 IAGRRPLWDSLIRFGTSINLPWLVVGDFDIVLKDRLRQTQVSSYE-VWDFLNCCVDLGLT 2933 I GR +W+ + F P L++GDF+ VL R + +SS E V DF N LGL Sbjct: 115 IEGRAVVWNDISEFCRINIFPTLIMGDFNEVLSSSERGSGLSSQEGVEDFRNFIQSLGLI 174 Query: 2934 DVNYSGSHYTRSNGYTWSKIDKAMCNQQWLLNGLHAREEFLAPGCISHHLPRVDTLFDTP 3113 D++ + +T +G S++D+ + W+ + + L +S H P + T Sbjct: 175 DISSANGRFTWFHGNRKSRLDRCLVTSDWIQQYPNLSLQIL-NRTVSDHCPILAHSPATN 233 Query: 3114 NRPRTCFMFFNMWADHDRF 3170 P+ F F N W H F Sbjct: 234 WGPKP-FRFLNCWVSHPNF 251 Score = 38.1 bits (87), Expect(3) = 5e-34 Identities = 23/101 (22%), Positives = 48/101 (47%) Frame = +1 Query: 3271 NRKDFAHISSRAEAARKELKQQQNLLHDNPMDHLLQESVKQLQQKALFHIEAEMKLCAQK 3450 N+ +F I ++ + ++ ++ +D + +S K +Q ++ AQ Sbjct: 282 NKSEFGAIDTKIKELEDLIQHFDDIANDRTLSDSELDSRKSVQMDLWSWLKKREAYWAQV 341 Query: 3451 LKYDFLINGDKATKLFHSLIKRNAKKNFIASITREDGSLTN 3573 + +L GD+ TK FH+L +KN I+SI ++ +L + Sbjct: 342 SRSKWLKEGDRNTKFFHTLASIRRQKNSISSILIDNTNLVD 382 >gb|AAC28221.1| similar to reverse transcriptases (PFam: rvt.hmm, score: 60.13) [Arabidopsis thaliana] Length = 1164 Score = 120 bits (302), Expect(3) = 8e-34 Identities = 65/143 (45%), Positives = 86/143 (60%) Frame = +3 Query: 3663 PKQADFLVADFSKKEIKPTLFNIGNEKSSGPDSYTSYLFKKAWNIV*DDLCDAIQEFFTF 3842 P Q L FS ++IK F++ K+SGPD ++ F W I+ ++ +AI EFFT Sbjct: 334 PAQQVSLDTPFSSEQIKNAFFSLPRNKASGPDGFSPEFFCACWPIIGGEVTEAIHEFFTS 393 Query: 3843 GKLLKQAHHCVISLIPKNEQGTSVGDFTPISCCNIFYKVITKLLADRMGSISLALIDKAQ 4022 GKLLKQ + + LIPK +S+ DF PISC N YKVI+KLL DR+ A I +Q Sbjct: 394 GKLLKQWNATNLVLIPKITNASSMSDFRPISCLNTVYKVISKLLTDRLKDFLPAAISHSQ 453 Query: 4023 SAFVKGRSMLENFHLAQEILRGY 4091 SAF+ GR LEN LA E++ GY Sbjct: 454 SAFMPGRLFLENVLLATELVHGY 476 Score = 44.7 bits (104), Expect(3) = 8e-34 Identities = 39/159 (24%), Positives = 64/159 (40%), Gaps = 10/159 (6%) Frame = +3 Query: 2730 VSFVYGLHSIAGRRPLWDSLIRFGTS---INLPWLVVGDFDIVLKDRLRQTQVS---SYE 2891 +SFVY R+ LW+ ++ F I+ PW V+GDF+ +L T Sbjct: 3 LSFVYASTDEVTRQILWNEIVDFSNDPCVIDKPWTVLGDFNQILHPSEHSTSDGFNVDRP 62 Query: 2892 VWDFLNCCVDLGLTDVNYSGSHYT----RSNGYTWSKIDKAMCNQQWLLNGLHAREEFLA 3059 F + LTD+++ G+ +T RS K+D+ + N +W + F Sbjct: 63 TRIFRETILLASLTDLSFRGNTFTWWNKRSRAPVAKKLDRILVNDKWTTTFPSSLGLFGE 122 Query: 3060 PGCISHHLPRVDTLFDTPNRPRTCFMFFNMWADHDRFQS 3176 P H + + +P R + F F N + F S Sbjct: 123 PDFSDHSSCELSLMSASP-RSKKPFRFNNFLLKDENFLS 160 Score = 29.3 bits (64), Expect(3) = 8e-34 Identities = 28/142 (19%), Positives = 54/142 (38%), Gaps = 1/142 (0%) Frame = +1 Query: 3172 SLVKNGW-NVQILWACQYRLCRXXXXXXXXXXXXNRKDFAHISSRAEAARKELKQQQNLL 3348 SL+ W + + + YR+ +R +++ I R + A L Q++L Sbjct: 160 SLICLKWFSTSVTGSAMYRVSVKLKALKKVIRDFSRDNYSDIEKRTKEAHDALLLAQSVL 219 Query: 3349 HDNPMDHLLQESVKQLQQKALFHIEAEMKLCAQKLKYDFLINGDKATKLFHSLIKRNAKK 3528 +P + Q+K EAE Q+ + ++L GD + FH + Sbjct: 220 LASPCPSNAAIEA-ETQRKWRILAEAEASFFYQRSRVNWLREGDMNSSYFHKMASARQSL 278 Query: 3529 NFIASITREDGSLTNSMKYVKN 3594 N I ++ G + ++N Sbjct: 279 NHIHFLSDPVGDRIEGQQNLEN 300 Score = 61.6 bits (148), Expect = 3e-06 Identities = 34/93 (36%), Positives = 51/93 (54%), Gaps = 1/93 (1%) Frame = +2 Query: 4082 EGVYLAQEILRGYKRKRTSPKYTLKIDIRRCMTPFHRSFWRRCSQ-LDFPTIFIE*IMAC 4258 E V LA E++ GY +K +P LK+D+R+ F + L+ P F I+ C Sbjct: 464 ENVLLATELVHGYNKKNIAPSSMLKVDLRKAFDSVRWDFIVSALRALNVPEKFTCWILEC 523 Query: 4259 VSSPSYSLKVNGEIIGFFKGETGLR*GDPISPF 4357 +S+ S+S+ +NG G F GLR GDP+SP+ Sbjct: 524 LSTASFSVILNGHSAGHFWSSKGLRQGDPMSPY 556