BLASTX nr result
ID: Glycyrrhiza35_contig00021559
seq
BLASTX 2.2.26 [Sep-21-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Glycyrrhiza35_contig00021559 (955 letters) Database: ./nr 115,041,592 sequences; 42,171,959,267 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value KHN28364.1 hypothetical protein glysoja_029625, partial [Glycine... 279 1e-89 KHN41376.1 hypothetical protein glysoja_032562, partial [Glycine... 279 4e-89 GAU25065.1 hypothetical protein TSUD_257590 [Trifolium subterran... 286 1e-87 GAU48536.1 hypothetical protein TSUD_282880 [Trifolium subterran... 290 3e-85 GAU33402.1 hypothetical protein TSUD_20950 [Trifolium subterraneum] 288 7e-85 GAU10537.1 hypothetical protein TSUD_419470, partial [Trifolium ... 270 4e-83 GAU46303.1 hypothetical protein TSUD_283280 [Trifolium subterran... 281 3e-82 GAU50864.1 hypothetical protein TSUD_411040 [Trifolium subterran... 275 4e-82 GAU14107.1 hypothetical protein TSUD_169320 [Trifolium subterran... 256 2e-76 GAU42690.1 hypothetical protein TSUD_302000 [Trifolium subterran... 261 6e-76 GAU20609.1 hypothetical protein TSUD_33450 [Trifolium subterraneum] 256 2e-75 XP_006577560.1 PREDICTED: uncharacterized protein LOC102665607 [... 232 5e-71 KYP46096.1 LINE-1 reverse transcriptase isogeny [Cajanus cajan] 242 6e-71 GAU42970.1 hypothetical protein TSUD_188430 [Trifolium subterran... 241 2e-70 GAU50433.1 hypothetical protein TSUD_134890, partial [Trifolium ... 226 2e-69 GAU51623.1 hypothetical protein TSUD_414500 [Trifolium subterran... 238 9e-69 GAU42390.1 hypothetical protein TSUD_296880 [Trifolium subterran... 236 3e-67 GAU49526.1 hypothetical protein TSUD_377390 [Trifolium subterran... 237 3e-67 GAU47952.1 hypothetical protein TSUD_06860 [Trifolium subterraneum] 230 3e-66 KYP36320.1 Transposon TX1 uncharacterized [Cajanus cajan] 233 2e-65 >KHN28364.1 hypothetical protein glysoja_029625, partial [Glycine soja] Length = 326 Score = 279 bits (714), Expect = 1e-89 Identities = 132/247 (53%), Positives = 174/247 (70%), Gaps = 1/247 (0%) Frame = -3 Query: 752 WCVAGDFNSVSSSQEKRGASNNYRRRDISRFNQFISEMHLVDVPVLGKKFTWWSGDGRAS 573 WC+ GDFN+VS+ +E+ G S N+ D+ FN F++EM+L+D P+ G KFT++ DG A+ Sbjct: 58 WCLVGDFNAVSNREERTGRSENWGYIDMVDFNAFVNEMNLIDPPLHGNKFTYFCSDGIAA 117 Query: 572 SRLDRFLLSEELILKWQVAAQWVGNRDISDHSPIWLDCSVYDWGPKPFRFNNCWLGHKEF 393 SRLDRFL+S+ ++ WQV Q VG RDISDH PIWL+CS +WGPKPFRFNNCWL H F Sbjct: 118 SRLDRFLVSDGIMNLWQVKGQRVGKRDISDHCPIWLECSNLNWGPKPFRFNNCWLEHDGF 177 Query: 392 KSFVEECWKGFQVGGWKIYAFXXXXXXXXXXXXEWNKEVFGFMDLNIENIVQDMNVLDDI 213 KSF+ E WK Q+ G K Y +WNKEVFG++DLNIENIV DMN LD Sbjct: 178 KSFIVEEWKKIQITGRKAYVIKEKLKIIRESLKKWNKEVFGWLDLNIENIVADMNELDRG 237 Query: 212 VGSG-DSRVGENRKELTSRFWHQIHAKESLIRQKARCKWIAEGDANTCFFHACVRGRRRR 36 + G + V +KE + FW Q+ KESL++QK+R +WI EGD+NT FFH+C++ RRR+ Sbjct: 238 IEEGCNLNVVVKKKEANALFWQQLMMKESLLKQKSRLRWIKEGDSNTKFFHSCLQDRRRK 297 Query: 35 NQILALR 15 NQIL+L+ Sbjct: 298 NQILSLQ 304 >KHN41376.1 hypothetical protein glysoja_032562, partial [Glycine soja] Length = 362 Score = 279 bits (714), Expect = 4e-89 Identities = 132/247 (53%), Positives = 174/247 (70%), Gaps = 1/247 (0%) Frame = -3 Query: 752 WCVAGDFNSVSSSQEKRGASNNYRRRDISRFNQFISEMHLVDVPVLGKKFTWWSGDGRAS 573 WC+ GDFN+VS+ +E+ G S N+ D+ FN F++EM+L+D P+ G KFT++ DG A+ Sbjct: 58 WCLVGDFNAVSNREERTGRSENWGYIDMVDFNAFVNEMNLIDPPLHGNKFTYFCSDGIAA 117 Query: 572 SRLDRFLLSEELILKWQVAAQWVGNRDISDHSPIWLDCSVYDWGPKPFRFNNCWLGHKEF 393 SRLDRFL+S+ ++ WQV Q VG RDISDH PIWL+CS +WGPKPFRFNNCWL H F Sbjct: 118 SRLDRFLVSDGIMNLWQVKGQRVGKRDISDHCPIWLECSNLNWGPKPFRFNNCWLEHDGF 177 Query: 392 KSFVEECWKGFQVGGWKIYAFXXXXXXXXXXXXEWNKEVFGFMDLNIENIVQDMNVLDDI 213 KSF+ E WK Q+ G K Y +WNKEVFG++DLNIENIV DMN LD Sbjct: 178 KSFIVEEWKKIQITGRKAYVIKEKLKIIRESLKKWNKEVFGWLDLNIENIVADMNELDRG 237 Query: 212 VGSG-DSRVGENRKELTSRFWHQIHAKESLIRQKARCKWIAEGDANTCFFHACVRGRRRR 36 + G + V +KE + FW Q+ KESL++QK+R +WI EGD+NT FFH+C++ RRR+ Sbjct: 238 IEEGCNLNVVVKKKEANALFWQQLMMKESLLKQKSRLRWIKEGDSNTKFFHSCLQDRRRK 297 Query: 35 NQILALR 15 NQIL+L+ Sbjct: 298 NQILSLQ 304 >GAU25065.1 hypothetical protein TSUD_257590 [Trifolium subterraneum] Length = 721 Score = 286 bits (732), Expect = 1e-87 Identities = 136/291 (46%), Positives = 189/291 (64%), Gaps = 1/291 (0%) Frame = -3 Query: 875 RKRRSSDVFFMVNVYSPCDWGGKNQCWKELILCKRILGGELWCVAGDFNSVSSSQEKRGA 696 R R D +VNVYSPC GK + W++L+ K+ GG WCV GDFNS+ S E++G+ Sbjct: 196 RVDREGDELNIVNVYSPCIISGKKKLWEDLLALKQSTGGGKWCVRGDFNSILHSSERKGS 255 Query: 695 SNNYRRRDISRFNQFISEMHLVDVPVLGKKFTWWSGDGRASSRLDRFLLSEELILKWQVA 516 S R+ + S FN+F+ EM L+D PVLGKKF+W+S DG++ SR+DRFLLS+ + K+ + Sbjct: 256 SIVSRQNESSLFNRFVEEMELIDTPVLGKKFSWFSADGKSMSRIDRFLLSDGFVSKFGIT 315 Query: 515 AQWVGNRDISDHSPIWLDCSVYDWGPKPFRFNNCWLGHKEFKSFVEECWKGFQVGGWKIY 336 +W+G+RDIS H PIWL CS Y+WGPKPFR N W+ H +F FVE WK F V G K Sbjct: 316 GKWIGDRDISYHCPIWLLCSSYNWGPKPFRVINGWMEHPDFFDFVETTWKSFDVHGKK-- 373 Query: 335 AFXXXXXXXXXXXXEWNKEVFGFMDLNIENIVQDMNVLDDIVGSGDSRVGENRKE-LTSR 159 EV+GF+DLNIE V D+NV+++++G D + R+ L Sbjct: 374 -----------------GEVYGFLDLNIEKTVTDINVIENLLGGDDEEIDLTRRAGLNKD 416 Query: 158 FWHQIHAKESLIRQKARCKWIAEGDANTCFFHACVRGRRRRNQILALRKYD 6 FW Q+ KESL++QK+R +W+ EGD+N+ FFH ++ RRRRNQ++AL+ D Sbjct: 417 FWKQLIHKESLLKQKSRMRWVKEGDSNSKFFHESIKSRRRRNQLVALKDGD 467 >GAU48536.1 hypothetical protein TSUD_282880 [Trifolium subterraneum] Length = 1794 Score = 290 bits (741), Expect = 3e-85 Identities = 137/315 (43%), Positives = 194/315 (61%), Gaps = 1/315 (0%) Frame = -3 Query: 953 MIIIWDTNVLEXXXXXXXXXXXXVCARKRRSSDVFFMVNVYSPCDWGGKNQCWKELILCK 774 ++I+W+ + +C + + F++N+YSPC GK + W +L+ K Sbjct: 748 LLIMWNAGLFNLKFSFTGDNFLGLCVECKEG--ILFIINIYSPCSLSGKRKLWSDLLEFK 805 Query: 773 RILGGELWCVAGDFNSVSSSQEKRGASNNYRRRDISRFNQFISEMHLVDVPVLGKKFTWW 594 + WC+ GDFN V + E++G+S R+ + F QF+ M L DVPV GKKF+W+ Sbjct: 806 QNNEQGEWCLGGDFNVVLKTGERKGSSALCRQNERLEFCQFVEAMELCDVPVAGKKFSWF 865 Query: 593 SGDGRASSRLDRFLLSEELILKWQVAAQWVGNRDISDHSPIWLDCSVYDWGPKPFRFNNC 414 S DG + SRLDRFLLSE+ I +V QW+G+RDISDH PIWL CS +WGPKPF+ NNC Sbjct: 866 SADGTSMSRLDRFLLSEKFIDSEKVTGQWIGSRDISDHCPIWLLCSNLNWGPKPFKVNNC 925 Query: 413 WLGHKEFKSFVEECWKGFQVGGWKIYAFXXXXXXXXXXXXEWNKEVFGFMDLNIENIVQD 234 WL H EFK FVE+ W V G K + WN++VFG +DLNIENIV++ Sbjct: 926 WLEHPEFKPFVEKTWNKLNVEGKKAFVIKEKLKRLKEELRRWNRDVFGILDLNIENIVRE 985 Query: 233 MNVLDDIVG-SGDSRVGENRKELTSRFWHQIHAKESLIRQKARCKWIAEGDANTCFFHAC 57 +N + ++ G + V + + +FW Q+H KESLI+QK+R KW+ EGD+N+ FFHA Sbjct: 986 LNEAEGLLAIDGANSVTCDVSAINKKFWDQLHFKESLIKQKSRLKWVREGDSNSRFFHAS 1045 Query: 56 VRGRRRRNQILALRK 12 ++ RRRRNQ+ LR+ Sbjct: 1046 IKSRRRRNQLSILRR 1060 >GAU33402.1 hypothetical protein TSUD_20950 [Trifolium subterraneum] Length = 1594 Score = 288 bits (738), Expect = 7e-85 Identities = 133/282 (47%), Positives = 188/282 (66%), Gaps = 1/282 (0%) Frame = -3 Query: 848 FMVNVYSPCDWGGKNQCWKELILCKRILGGELWCVAGDFNSVSSSQEKRGASNNYRRRDI 669 +++NVYSPC GK + W +L+ K WC+ GDFN V + E++G++++ R+ + Sbjct: 704 YIINVYSPCSLSGKRKLWSDLLEFKLNNEQGEWCLRGDFNVVLNVGERKGSTSSARQNER 763 Query: 668 SRFNQFISEMHLVDVPVLGKKFTWWSGDGRASSRLDRFLLSEELILKWQVAAQWVGNRDI 489 F QF+ M L+DVPV GKKF+W+S DG A SRLDRFLLS+ I K +VA QW+GN DI Sbjct: 764 LEFCQFVEAMELIDVPVAGKKFSWFSADGNAISRLDRFLLSDNFIEKEEVAGQWIGNHDI 823 Query: 488 SDHSPIWLDCSVYDWGPKPFRFNNCWLGHKEFKSFVEECWKGFQVGGWKIYAFXXXXXXX 309 SDH PIWL CS +WGPKPF+ NNCWL H EFK FVE+ W+ + G K + Sbjct: 824 SDHCPIWLMCSNLNWGPKPFKVNNCWLEHSEFKLFVEKTWEKLNIRGKKAFVIKEKLKRL 883 Query: 308 XXXXXEWNKEVFGFMDLNIENIVQDMNVLDDIVGS-GDSRVGENRKELTSRFWHQIHAKE 132 WN+EVF +DLNIE V+++N ++ +VG+ G + V ++ + +FW Q++ KE Sbjct: 884 KEELRGWNREVFSILDLNIEKTVKELNEVEGLVGNDGVNSVMGDKSGVNRKFWEQLYFKE 943 Query: 131 SLIRQKARCKWIAEGDANTCFFHACVRGRRRRNQILALRKYD 6 S+I+QK+R KW+ EGD+NT FF A ++ RRRRNQ++ LR+ D Sbjct: 944 SMIKQKSRLKWVREGDSNTRFFQASLKNRRRRNQLVLLRRGD 985 >GAU10537.1 hypothetical protein TSUD_419470, partial [Trifolium subterraneum] Length = 557 Score = 270 bits (690), Expect = 4e-83 Identities = 126/268 (47%), Positives = 175/268 (65%), Gaps = 1/268 (0%) Frame = -3 Query: 812 GKNQCWKELILCKRILGGELWCVAGDFNSVSSSQEKRGASNNYRRRDISRFNQFISEMHL 633 GK + W +LI + WC+ GDFNS++ + E+RG+SN + + F Q I M L Sbjct: 3 GKRKLWHDLIEFRMNNAPGEWCLGGDFNSITKTSERRGSSNWSGNTERTEFVQIIETMEL 62 Query: 632 VDVPVLGKKFTWWSGDGRASSRLDRFLLSEELILKWQVAAQWVGNRDISDHSPIWLDCSV 453 +D+PVLGKKFTW + D A SRLDRFLLSE LI K + QWVG+RDISDH PIWL+C+ Sbjct: 63 IDIPVLGKKFTWSNSDNSAMSRLDRFLLSEGLIEKGGITNQWVGDRDISDHYPIWLECNN 122 Query: 452 YDWGPKPFRFNNCWLGHKEFKSFVEECWKGFQVGGWKIYAFXXXXXXXXXXXXEWNKEVF 273 +W PKPF+FNNCWL H +F FV+ W+ + G K + +WN EVF Sbjct: 123 RNWCPKPFKFNNCWLEHPDFIPFVKASWESMDIHGRKAFILKEKLKRLKESLKKWNHEVF 182 Query: 272 GFMDLNIENIVQDMNVLDDIVGSGDSR-VGENRKELTSRFWHQIHAKESLIRQKARCKWI 96 G MDLNIE V+++N +++++ +G+S + N K+ + FW Q+ KES+++QK+R KWI Sbjct: 183 GIMDLNIEKTVKELNEIEEMIANGNSHPMYPNSKKQSKMFWEQLRFKESILKQKSRTKWI 242 Query: 95 AEGDANTCFFHACVRGRRRRNQILALRK 12 EGD+NT FFHA ++GR R N+I +RK Sbjct: 243 QEGDSNTSFFHATIKGRHRSNRIAKIRK 270 >GAU46303.1 hypothetical protein TSUD_283280 [Trifolium subterraneum] Length = 1636 Score = 281 bits (718), Expect = 3e-82 Identities = 134/280 (47%), Positives = 181/280 (64%), Gaps = 1/280 (0%) Frame = -3 Query: 848 FMVNVYSPCDWGGKNQCWKELILCKRILGGELWCVAGDFNSVSSSQEKRGASNNYRRRDI 669 ++VN+YSPC G GDFNS++ E+RG+ R+ Sbjct: 670 YIVNIYSPCTMAG-----------------------GDFNSITKIGERRGSHGGSVYRER 706 Query: 668 SRFNQFISEMHLVDVPVLGKKFTWWSGDGRASSRLDRFLLSEELILKWQVAAQWVGNRDI 489 F+QFI M LVD+PVLGKKFTW++ D A SRLDRFLLSE I K ++ QWVGNRDI Sbjct: 707 IEFSQFIDAMELVDIPVLGKKFTWFNSDCSAMSRLDRFLLSEGFIEKGGISNQWVGNRDI 766 Query: 488 SDHSPIWLDCSVYDWGPKPFRFNNCWLGHKEFKSFVEECWKGFQVGGWKIYAFXXXXXXX 309 SDH PIWL+ S +WGPKPF+FNNCWL H +F FV+ W+ + G K + Sbjct: 767 SDHCPIWLESSNINWGPKPFKFNNCWLEHSDFLPFVKATWEKMNIHGKKAFIIKEKLKRL 826 Query: 308 XXXXXEWNKEVFGFMDLNIENIVQDMNVLDDIVGSGDSRVGE-NRKELTSRFWHQIHAKE 132 WN+EVFG MDLNIE V+D+N +++++ +GD+++ N KEL+ +FW Q+H KE Sbjct: 827 KEALKTWNQEVFGIMDLNIEKTVKDLNEIEELIANGDNQLDSVNSKELSKKFWEQLHFKE 886 Query: 131 SLIRQKARCKWIAEGDANTCFFHACVRGRRRRNQILALRK 12 S+++QK+R KWI EGD+NT FFHA ++GRRRRN+I+ L+K Sbjct: 887 SILQQKSRTKWIQEGDSNTRFFHASIKGRRRRNRIVKLKK 926 >GAU50864.1 hypothetical protein TSUD_411040 [Trifolium subterraneum] Length = 862 Score = 275 bits (702), Expect = 4e-82 Identities = 128/288 (44%), Positives = 186/288 (64%), Gaps = 1/288 (0%) Frame = -3 Query: 866 RSSDVFFMVNVYSPCDWGGKNQCWKELILCKRILGGELWCVAGDFNSVSSSQEKRGASNN 687 R V +VN+YSPC GK + W++L+ K++ G C+ GDFN++ S E++GAS + Sbjct: 409 REGAVTHLVNIYSPCSLSGKKKLWEDLLEIKQLFTGGECCLRGDFNAILHSSERKGASAD 468 Query: 686 YRRRDISRFNQFISEMHLVDVPVLGKKFTWWSGDGRASSRLDRFLLSEELILKWQVAAQW 507 R+ + FN+F+ EM ++DVPVLG K +W S DG++ SRLDRF+LS+ I K+ + QW Sbjct: 469 SRQGERMMFNRFVEEMEMIDVPVLGMKVSWVSADGKSMSRLDRFILSDGFITKFGIIGQW 528 Query: 506 VGNRDISDHSPIWLDCSVYDWGPKPFRFNNCWLGHKEFKSFVEECWKGFQVGGWKIYAFX 327 +GNR+I DH PIWL S +WGPKPFR N L H +F F+E CWK F + G K Y Sbjct: 529 IGNRNIFDHCPIWLYASAKNWGPKPFRAINGCLEHPDFLVFLESCWKSFDIQGTKAYVLK 588 Query: 326 XXXXXXXXXXXEWNKEVFGFMDLNIENIVQDMNVLDDIVGSGDSRVGENRKE-LTSRFWH 150 +WNKEVFG +DLNI+ V+++N ++ ++G D V R+E L S FW Sbjct: 589 EKLRFLKEILKKWNKEVFGILDLNIDKTVKELNDIEKMLGDDDPDVELTRREGLNSEFWS 648 Query: 149 QIHAKESLIRQKARCKWIAEGDANTCFFHACVRGRRRRNQILALRKYD 6 Q+H KE L++QK+R + + EGD+N+ FFH ++ RRR+NQ++ L+ D Sbjct: 649 QLHFKEILLQQKSRTRRVKEGDSNSKFFHESIKRRRRKNQLVVLKDGD 696 >GAU14107.1 hypothetical protein TSUD_169320 [Trifolium subterraneum] Length = 695 Score = 256 bits (654), Expect = 2e-76 Identities = 124/282 (43%), Positives = 172/282 (60%), Gaps = 1/282 (0%) Frame = -3 Query: 854 VFFMVNVYSPCDWGGKNQCWKELILCKRILGGELWCVAGDFNSVSSSQEKRGASNN-YRR 678 + ++VNVYS C+ GK + W +LI K E WC+ GDFNS+S E+RG+S+ +R+ Sbjct: 88 LLYIVNVYSSCNVSGKRKLWNDLIDFKLNNEPEEWCLGGDFNSISKVGERRGSSSGAWRQ 147 Query: 677 RDISRFNQFISEMHLVDVPVLGKKFTWWSGDGRASSRLDRFLLSEELILKWQVAAQWVGN 498 + F QFI + +VD+P+ K FTW++ DG A SRL+ FL+SE I K ++ QWVG+ Sbjct: 148 GERIEFIQFIDALEVVDIPLKDKMFTWFNSDGSAMSRLNHFLVSEGFIEKGSLSYQWVGD 207 Query: 497 RDISDHSPIWLDCSVYDWGPKPFRFNNCWLGHKEFKSFVEECWKGFQVGGWKIYAFXXXX 318 RDISDH PIWL CS +WGPKPF FNNCWL H +F FV+E W+ + G K + Sbjct: 208 RDISDHCPIWLMCSNINWGPKPFTFNNCWLEHPKFFEFVKETWENMDIRGKKAFIIKEKL 267 Query: 317 XXXXXXXXEWNKEVFGFMDLNIENIVQDMNVLDDIVGSGDSRVGENRKELTSRFWHQIHA 138 WN+EVFGFM+L I+ V ++N FW Q++ Sbjct: 268 KGLKEALKVWNREVFGFMELKIDKTVNELN----------------------EFWEQLNF 305 Query: 137 KESLIRQKARCKWIAEGDANTCFFHACVRGRRRRNQILALRK 12 KESL+ QK+R KW EGD+N+ +FHA ++ RRR+NQI+ L+K Sbjct: 306 KESLLHQKSRTKWAKEGDSNSRYFHASIKSRRRKNQIVTLKK 347 >GAU42690.1 hypothetical protein TSUD_302000 [Trifolium subterraneum] Length = 1092 Score = 261 bits (667), Expect = 6e-76 Identities = 125/263 (47%), Positives = 171/263 (65%), Gaps = 1/263 (0%) Frame = -3 Query: 854 VFFMVNVYSPCDWGGKNQCWKELILCKRILGGELWCVAGDFNSVSSSQEKRGASNNYRRR 675 V +VNVYSPC+ GK Q W++L+ K+ + LWCV GDFN++ S E++G+S + R+ Sbjct: 237 VLHIVNVYSPCNISGKKQLWEDLLELKQRVAEGLWCVGGDFNAILHSFERQGSSTDSRKS 296 Query: 674 DISRFNQFISEMHLVDVPVLGKKFTWWSGDGRASSRLDRFLLSEELILKWQVAAQWVGNR 495 + FN F+ EM L+D+PVLGKKF+W+S DG++ SR+DRFLLS+ + K+ + QW+G+R Sbjct: 297 ERVLFNSFVEEMELIDIPVLGKKFSWFSADGKSMSRIDRFLLSDGFVSKFGITGQWIGDR 356 Query: 494 DISDHSPIWLDCSVYDWGPKPFRFNNCWLGHKEFKSFVEECWKGFQVGGWKIYAFXXXXX 315 DISDH PIWL S WGPKPFR N WL H +F +FVE WK F V G K Y Sbjct: 357 DISDHCPIWLLFSSNIWGPKPFRVINGWLDHPDFLTFVETTWKSFAVHGKKAYILKEKFK 416 Query: 314 XXXXXXXEWNKEVFGFMDLNIENIVQDMNVLDDIVGSGDSRVGENRKE-LTSRFWHQIHA 138 +WNKEV+GF+DLNIE V D+N +++++G D R+E L F Q H Sbjct: 417 LLKDSLRKWNKEVYGFLDLNIEKTVNDINDIENLLGGDDMEAELIRREGLNKDFGRQHHF 476 Query: 137 KESLIRQKARCKWIAEGDANTCF 69 KESL++QK+R +W+ E D T F Sbjct: 477 KESLLKQKSRMRWVKE-DVQTAF 498 >GAU20609.1 hypothetical protein TSUD_33450 [Trifolium subterraneum] Length = 798 Score = 256 bits (653), Expect = 2e-75 Identities = 112/220 (50%), Positives = 154/220 (70%), Gaps = 1/220 (0%) Frame = -3 Query: 668 SRFNQFISEMHLVDVPVLGKKFTWWSGDGRASSRLDRFLLSEELILKWQVAAQWVGNRDI 489 + F QFI M L+D+PVLGKKFTW + D SRLDRFLLSE +I K + QWVG+RDI Sbjct: 11 TEFVQFIDAMELIDIPVLGKKFTWSNSDNSVMSRLDRFLLSEGIIEKGGITNQWVGDRDI 70 Query: 488 SDHSPIWLDCSVYDWGPKPFRFNNCWLGHKEFKSFVEECWKGFQVGGWKIYAFXXXXXXX 309 SDH PIWL+C+ +WGPKPF+FNNCWL HK+F V+ W+ + G K + Sbjct: 71 SDHHPIWLECNNLNWGPKPFKFNNCWLEHKDFIPVVKATWESLNINGRKAHVLKEKMKRL 130 Query: 308 XXXXXEWNKEVFGFMDLNIENIVQDMNVLDDIVGSGDSR-VGENRKELTSRFWHQIHAKE 132 WNKEVFG +DLNI+ V+D+N +++++ +GD+ + N KE+ +FW Q+H KE Sbjct: 131 KEELKVWNKEVFGILDLNIDKTVKDLNEVEELIANGDNHPLHLNSKEIAKKFWEQLHFKE 190 Query: 131 SLIRQKARCKWIAEGDANTCFFHACVRGRRRRNQILALRK 12 S+++QK+R KWI +GD+NTC+FHA ++GRRRRN IL ++K Sbjct: 191 SILKQKSRSKWIKKGDSNTCYFHATIKGRRRRNHILKIKK 230 >XP_006577560.1 PREDICTED: uncharacterized protein LOC102665607 [Glycine max] Length = 326 Score = 232 bits (591), Expect = 5e-71 Identities = 111/210 (52%), Positives = 145/210 (69%), Gaps = 1/210 (0%) Frame = -3 Query: 641 MHLVDVPVLGKKFTWWSGDGRASSRLDRFLLSEELILKWQVAAQWVGNRDISDHSPIWLD 462 M+L+D P+ G KFT++ DG A+SRLDRFL+S+ ++ WQ Q VG RDI DH PIWL+ Sbjct: 1 MNLIDPPLHGNKFTYFCSDGIAASRLDRFLVSDGIMNLWQEKGQRVGKRDIYDHCPIWLE 60 Query: 461 CSVYDWGPKPFRFNNCWLGHKEFKSFVEECWKGFQVGGWKIYAFXXXXXXXXXXXXEWNK 282 CS +WGPKPFRFNNCWL H +FKSF+ E WK Q+ G K Y +WNK Sbjct: 61 CSNLNWGPKPFRFNNCWLEHDDFKSFIVEEWKKIQITGRKAYVIKEKLKIIRESLKKWNK 120 Query: 281 EVFGFMDLNIENIVQDMNVLDDIVGSG-DSRVGENRKELTSRFWHQIHAKESLIRQKARC 105 EVFG++DLNIENIV +MN LD + G + +KE + FW Q+ KESL++QK+R Sbjct: 121 EVFGWLDLNIENIVAEMNKLDRGIEEGCNLNEVVKKKEAKALFWQQLMMKESLLKQKSRL 180 Query: 104 KWIAEGDANTCFFHACVRGRRRRNQILALR 15 +WI EGD NT FFH+C++ RRR+NQIL+L+ Sbjct: 181 RWIKEGDYNTKFFHSCLQDRRRKNQILSLQ 210 >KYP46096.1 LINE-1 reverse transcriptase isogeny [Cajanus cajan] Length = 729 Score = 242 bits (618), Expect = 6e-71 Identities = 116/279 (41%), Positives = 164/279 (58%), Gaps = 2/279 (0%) Frame = -3 Query: 848 FMVNVYSPCDWGGKNQCWKELILCKRILGGELWCVAGDFNSVSSSQEKRGASNNYRRRDI 669 F+VN+YSPCD GK W+E+ K G WC+ GDFN+V E++G +++ Sbjct: 25 FIVNIYSPCDLRGKKNLWEEIHKIKNSYGSGRWCICGDFNTVRLKSERKGVHTRREEKEM 84 Query: 668 SRFNQFISEMHLVDVPVLGKKFTWWSGDGRASSRLDRFLLSEELILKWQVAAQWVGNRDI 489 +NQFI ++ L+D+P+ G K+TW+ + +SR+DRFL+S+E + +W +Q RD+ Sbjct: 85 LCYNQFIEDVELIDLPLGGGKYTWFRPNRIIASRIDRFLVSQEWLTQWPHCSQKALQRDV 144 Query: 488 SDHSPIWLDCSVYDWGPKPFRFNNCWLGHKEFKSFVEECWKGFQVGGWKIYAFXXXXXXX 309 SDH PI L DWGPKPFR NCW F FVEE WKGF V GW + Sbjct: 145 SDHRPILLKDIRLDWGPKPFRSLNCWFDDPSFLGFVEEKWKGFSVTGWGAFILKEKLKHL 204 Query: 308 XXXXXEWNKEVFGFMDLNIENIVQDMNVLDDIV--GSGDSRVGENRKELTSRFWHQIHAK 135 EWNK+ FG + IE + +++N LD IV S + R +R+ L + W ++ K Sbjct: 205 KKSIKEWNKQAFGNIHTQIEEVKRNINSLDSIVETRSLNERKVSDRRNLNVKLWDLLNKK 264 Query: 134 ESLIRQKARCKWIAEGDANTCFFHACVRGRRRRNQILAL 18 ESL+ QK+R KW EGD+N+ FFH CV RR+ N+I+ L Sbjct: 265 ESLLLQKSRLKWAREGDSNSSFFHMCVNKRRKMNEIIGL 303 >GAU42970.1 hypothetical protein TSUD_188430 [Trifolium subterraneum] Length = 767 Score = 241 bits (616), Expect = 2e-70 Identities = 119/280 (42%), Positives = 163/280 (58%), Gaps = 1/280 (0%) Frame = -3 Query: 848 FMVNVYSPCDWGGKNQCWKELILCKRILGGELWCVAGDFNSVSSSQEKRGASNNYRRRDI 669 + VN+Y C GK + W++LI K + WC+ GDFNS++ ++ G+SN ++ Sbjct: 76 YFVNIYFACSLAGKRKLWRDLIDFKLLNTPGEWCLGGDFNSITKVSKRSGSSNGSSNKER 135 Query: 668 SRFNQFISEMHLVDVPVLGKKFTWWSGDGRASSRLDRFLLSEELILKWQVAAQWVGNRDI 489 + F QFI M LVD+PV GKKFTW + D A SRLDRFLLSE LI K ++ QWVG RDI Sbjct: 136 TEFAQFIDAMELVDIPVFGKKFTWSNSDNSAMSRLDRFLLSEGLIEKGGISNQWVGGRDI 195 Query: 488 SDHSPIWLDCSVYDWGPKPFRFNNCWLGHKEFKSFVEECWKGFQVGGWKIYAFXXXXXXX 309 SDH PIWL+CS +WGPKPF+FNN WL H +F FV+ W+ + G K + Sbjct: 196 SDHHPIWLECSNINWGPKPFKFNNFWLDHPDFIPFVKATWESMNIHGKKAFILKEKLKRL 255 Query: 308 XXXXXEWNKEVFGFMDLNIENIVQDMNVLDDIVGSGDSR-VGENRKELTSRFWHQIHAKE 132 WN+EVFG MDL+IE V+D+N +++++ +GD + N K+L+ +FW Q+H KE Sbjct: 256 KEVLKTWNREVFGIMDLDIEKTVKDLNEVEEMIANGDCHPLFSNAKDLSKKFWEQLHNKE 315 Query: 131 SLIRQKARCKWIAEGDANTCFFHACVRGRRRRNQILALRK 12 S RRR N+I+ LRK Sbjct: 316 S---------------------------RRRSNRIVKLRK 328 >GAU50433.1 hypothetical protein TSUD_134890, partial [Trifolium subterraneum] Length = 286 Score = 226 bits (577), Expect = 2e-69 Identities = 104/213 (48%), Positives = 141/213 (66%), Gaps = 1/213 (0%) Frame = -3 Query: 641 MHLVDVPVLGKKFTWWSGDGRASSRLDRFLLSEELILKWQVAAQWVGNRDISDHSPIWLD 462 M +D+PVLGKKF+W+S DG+A SRLDRFLLS+ + K V QW+G+RDISDH P+WL Sbjct: 1 MEFIDIPVLGKKFSWFSPDGKAMSRLDRFLLSDGFLTKNGVTGQWIGDRDISDHCPVWLL 60 Query: 461 CSVYDWGPKPFRFNNCWLGHKEFKSFVEECWKGFQVGGWKIYAFXXXXXXXXXXXXEWNK 282 S +WGPKPFR N W+ H EF FVE WK + V G K + +WNK Sbjct: 61 SSFCNWGPKPFRVINGWINHPEFNDFVESAWKSYDVRGKKAFVLKEKLKLLRESLKKWNK 120 Query: 281 EVFGFMDLNIENIVQDMNVLDDIVGSGDSRVGENRKE-LTSRFWHQIHAKESLIRQKARC 105 EVFG++DLNIE IV D+N + ++ S D + L FW Q+H K+SL+++K+R Sbjct: 121 EVFGYLDLNIEKIVTDINKFEGLLSSTDGDADYLMLDGLNKEFWKQLHFKDSLLKRKSRS 180 Query: 104 KWIAEGDANTCFFHACVRGRRRRNQILALRKYD 6 KW+ +GD+N+ +FH ++GRRRRNQ++ALR D Sbjct: 181 KWVKDGDSNSKYFHQSLKGRRRRNQLVALRDGD 213 >GAU51623.1 hypothetical protein TSUD_414500 [Trifolium subterraneum] Length = 838 Score = 238 bits (608), Expect = 9e-69 Identities = 107/213 (50%), Positives = 145/213 (68%), Gaps = 1/213 (0%) Frame = -3 Query: 641 MHLVDVPVLGKKFTWWSGDGRASSRLDRFLLSEELILKWQVAAQWVGNRDISDHSPIWLD 462 M L+D+PV+GKKF+W+S DG+A SRLDRFLLS+ I K ++ QW+GNRDISDH P+WL Sbjct: 1 MELIDIPVIGKKFSWFSADGKAMSRLDRFLLSDNFIAKEEILGQWIGNRDISDHCPVWLI 60 Query: 461 CSVYDWGPKPFRFNNCWLGHKEFKSFVEECWKGFQVGGWKIYAFXXXXXXXXXXXXEWNK 282 CS +WGPKPF+FNNCWL H E FV W V G K + WN+ Sbjct: 61 CSNLNWGPKPFKFNNCWLKHPELSLFVTRIWVKMNVTGKKAFVIKEKLKRLKEELRGWNR 120 Query: 281 EVFGFMDLNIENIVQDMNVLDDIVG-SGDSRVGENRKELTSRFWHQIHAKESLIRQKARC 105 EVFG +DLNIEN V+++N L+ + G + + ++ + +FW Q++ KESLIRQK+R Sbjct: 121 EVFGILDLNIENTVKELNELEGLAAIDGTNSMLVDKGGINKKFWDQLNFKESLIRQKSRA 180 Query: 104 KWIAEGDANTCFFHACVRGRRRRNQILALRKYD 6 W++EGD+NT FFHA ++ RRRRNQ++ LR+ D Sbjct: 181 NWVSEGDSNTRFFHASLKSRRRRNQMIMLRRGD 213 >GAU42390.1 hypothetical protein TSUD_296880 [Trifolium subterraneum] Length = 938 Score = 236 bits (601), Expect = 3e-67 Identities = 109/232 (46%), Positives = 155/232 (66%), Gaps = 1/232 (0%) Frame = -3 Query: 716 SQEKRGASNNYRRRDISRFNQFISEMHLVDVPVLGKKFTWWSGDGRASSRLDRFLLSEEL 537 S E+RG+S + + S F+ FI M ++D+PVLGKKFTW++ +G RL RFLLSE Sbjct: 344 SGERRGSSGSGCLSERSEFSLFIEAMEVIDIPVLGKKFTWFNSNGSTMRRLYRFLLSEGF 403 Query: 536 ILKWQVAAQWVGNRDISDHSPIWLDCSVYDWGPKPFRFNNCWLGHKEFKSFVEECWKGFQ 357 I K ++ QW+ + DISDH PIWL+CS+ +WG KP +FNNCW+ H EF V+ + Sbjct: 404 IHKGGISNQWISDHDISDHCPIWLECSILNWGHKPVKFNNCWVDHPEFLDLVKNIFAQSN 463 Query: 356 VGGWKIYAFXXXXXXXXXXXXEWNKEVFGFMDLNIENIVQDMNVLDDIVGSGDSRVGE-N 180 V G K + +WN++VFGF DL I+ V+++N ++D++ +GD + N Sbjct: 464 VRGTKTFVISEKMKRLKEALKKWNRDVFGFKDLCIDKTVRELNEVEDLIANGDVDPADLN 523 Query: 179 RKELTSRFWHQIHAKESLIRQKARCKWIAEGDANTCFFHACVRGRRRRNQIL 24 KEL +FW QIH+KESL+RQK+R KWI EGD+N+ FFH+ ++GRRRRNQI+ Sbjct: 524 SKELVRKFWEQIHSKESLLRQKSRTKWIQEGDSNSRFFHSSIKGRRRRNQIV 575 >GAU49526.1 hypothetical protein TSUD_377390 [Trifolium subterraneum] Length = 1149 Score = 237 bits (605), Expect = 3e-67 Identities = 112/239 (46%), Positives = 159/239 (66%), Gaps = 1/239 (0%) Frame = -3 Query: 716 SQEKRGASNNYRRRDISRFNQFISEMHLVDVPVLGKKFTWWSGDGRASSRLDRFLLSEEL 537 S E+RG + + S F+ FI M ++D+P+LGKKFTW++ DG SRLDRFLLSE Sbjct: 325 SGERRGCRGSVCLSERSEFSLFIEAMEVIDIPILGKKFTWFNSDGSTMSRLDRFLLSEGF 384 Query: 536 ILKWQVAAQWVGNRDISDHSPIWLDCSVYDWGPKPFRFNNCWLGHKEFKSFVEECWKGFQ 357 I K ++ QW+G+RDISD+ PIWL+CS +WGPKPF+FNNCW+ H EF V+ W Sbjct: 385 IHKGGISNQWIGDRDISDYFPIWLECSNLNWGPKPFKFNNCWVDHPEFLDLVKNIWVQSN 444 Query: 356 VGGWKIYAFXXXXXXXXXXXXEWNKEVFGFMDLNIENIVQDMNVLDDIVGSGD-SRVGEN 180 + K +WN++VFGF DL I+ ++++N ++D++ +GD V N Sbjct: 445 MKRLK------------EALKKWNRDVFGFKDLCIDKTLRELNEVEDLIANGDVDPVDLN 492 Query: 179 RKELTSRFWHQIHAKESLIRQKARCKWIAEGDANTCFFHACVRGRRRRNQILALRKYDA 3 KEL +FW QIH+KESL+R+K+R KWI EGD+N+ FF + ++GR RRNQI+ L+K +A Sbjct: 493 SKELVRKFWEQIHSKESLLRKKSRTKWIQEGDSNSHFFRSSIKGRHRRNQIVMLKKGEA 551 >GAU47952.1 hypothetical protein TSUD_06860 [Trifolium subterraneum] Length = 724 Score = 230 bits (586), Expect = 3e-66 Identities = 104/213 (48%), Positives = 146/213 (68%), Gaps = 1/213 (0%) Frame = -3 Query: 641 MHLVDVPVLGKKFTWWSGDGRASSRLDRFLLSEELILKWQVAAQWVGNRDISDHSPIWLD 462 M L+DVPVLGKKF+W+S +G++ SR+DRFLLS+ + K+ + QW+G+RDISDH PIWL Sbjct: 1 MTLLDVPVLGKKFSWFSANGKSMSRIDRFLLSDGFVSKYGITGQWIGDRDISDHCPIWLL 60 Query: 461 CSVYDWGPKPFRFNNCWLGHKEFKSFVEECWKGFQVGGWKIYAFXXXXXXXXXXXXEWNK 282 S Y WGPKPFR N WL H +F FVE WK F V G K Y +WNK Sbjct: 61 VSSYKWGPKPFRVINGWLDHPDFLPFVESAWKSFVVHGKKAYVLKEKFRLLKERLRKWNK 120 Query: 281 EVFGFMDLNIENIVQDMNVLDDIVGSGDSRVGENRKE-LTSRFWHQIHAKESLIRQKARC 105 EV+G++DLNIE V ++N +++++G D V R++ L FW Q++ KESL++QK+R Sbjct: 121 EVYGYLDLNIEKTVNEINDIENMLGDDDMEVELTRRQGLNKEFWSQLYHKESLLKQKSRT 180 Query: 104 KWIAEGDANTCFFHACVRGRRRRNQILALRKYD 6 +W+ EGD+N+ +FH ++ RRRRNQ++AL+ D Sbjct: 181 RWVKEGDSNSRYFHESIKSRRRRNQLVALKDGD 213 >KYP36320.1 Transposon TX1 uncharacterized [Cajanus cajan] Length = 1401 Score = 233 bits (593), Expect = 2e-65 Identities = 116/282 (41%), Positives = 162/282 (57%), Gaps = 5/282 (1%) Frame = -3 Query: 845 MVNVYSPCDWGGKNQCWKELILCKRILGGELWCVAGDFNSVSSSQEKRGASNNYRRRDIS 666 +VNVYS C K + W ++I+ KR G LWC+ GDFN+V +E++G ++ RD+ Sbjct: 694 VVNVYSSCHLVDKRRLWGDIIMSKRGFGSCLWCIVGDFNTVRRLEERKGGFGDHGARDME 753 Query: 665 RFNQFISEMHLVDVPVLGKKFTWWSGDGRASSRLDRFLLSEELILKWQVAAQWVGNRDIS 486 FN FI+EM L+DVP++GK+FTW+ DG SRLDR L+SE W V RD+S Sbjct: 754 EFNSFITEMELIDVPLVGKRFTWFRSDGSIMSRLDRVLVSESWSAHWGAGFVKVIPRDVS 813 Query: 485 DHSPIWLDCSVYDWGPKPFRFNNCWLGHKEFKSFVEECWKGFQVGGWKIYAFXXXXXXXX 306 DH P+ L+ V +WGPKPFRFNNCWL H + V W+ G W Sbjct: 814 DHCPLILNHKVLNWGPKPFRFNNCWLSHCGIEGVVRSAWEKQVQGPWAAQRLRSKLLNVK 873 Query: 305 XXXXEWNKEVFGFMDLNIENIVQDMNVLDDIVGSGDSRV-----GENRKELTSRFWHQIH 141 +WN EVFG +D I+++ ++ LD + +V +KEL + W Sbjct: 874 NALKKWNIEVFGNVDTMIKSLTNELKELD---AKNEEQVLIQSERNRQKELVAGIWSARR 930 Query: 140 AKESLIRQKARCKWIAEGDANTCFFHACVRGRRRRNQILALR 15 K +L+ QKAR +W GD N+ +FHAC+RGR+RRNQI+AL+ Sbjct: 931 NKLTLLAQKARIRWGKYGDQNSKYFHACIRGRQRRNQIVALK 972