BLASTX nr result
ID: Glycyrrhiza36_contig00033625
seq
BLASTX 2.2.26 [Sep-21-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Glycyrrhiza36_contig00033625 (955 letters) Database: ./nr 115,041,592 sequences; 42,171,959,267 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value KHN28364.1 hypothetical protein glysoja_029625, partial [Glycine... 282 1e-90 KHN41376.1 hypothetical protein glysoja_032562, partial [Glycine... 282 4e-90 GAU25065.1 hypothetical protein TSUD_257590 [Trifolium subterran... 286 2e-87 GAU48536.1 hypothetical protein TSUD_282880 [Trifolium subterran... 294 7e-87 GAU33402.1 hypothetical protein TSUD_20950 [Trifolium subterraneum] 292 4e-86 GAU46303.1 hypothetical protein TSUD_283280 [Trifolium subterran... 285 8e-84 GAU10537.1 hypothetical protein TSUD_419470, partial [Trifolium ... 271 2e-83 GAU50864.1 hypothetical protein TSUD_411040 [Trifolium subterran... 277 4e-83 GAU14107.1 hypothetical protein TSUD_169320 [Trifolium subterran... 260 5e-78 GAU42690.1 hypothetical protein TSUD_302000 [Trifolium subterran... 262 2e-76 GAU20609.1 hypothetical protein TSUD_33450 [Trifolium subterraneum] 251 9e-74 XP_006577560.1 PREDICTED: uncharacterized protein LOC102665607 [... 234 6e-72 KYP46096.1 LINE-1 reverse transcriptase isogeny [Cajanus cajan] 244 2e-71 GAU42970.1 hypothetical protein TSUD_188430 [Trifolium subterran... 242 1e-70 GAU50433.1 hypothetical protein TSUD_134890, partial [Trifolium ... 229 2e-70 GAU51623.1 hypothetical protein TSUD_414500 [Trifolium subterran... 242 5e-70 GAU42390.1 hypothetical protein TSUD_296880 [Trifolium subterran... 240 6e-69 GAU47952.1 hypothetical protein TSUD_06860 [Trifolium subterraneum] 233 1e-67 GAU49526.1 hypothetical protein TSUD_377390 [Trifolium subterran... 238 2e-67 KYP36320.1 Transposon TX1 uncharacterized [Cajanus cajan] 237 7e-67 >KHN28364.1 hypothetical protein glysoja_029625, partial [Glycine soja] Length = 326 Score = 282 bits (721), Expect = 1e-90 Identities = 132/247 (53%), Positives = 173/247 (70%), Gaps = 1/247 (0%) Frame = +3 Query: 204 WCVAGDFNSVSRSQEKRGASNNYRRRDIARFNQFISEMHLVDVPVLGKKFTWWSGDGRAS 383 WC+ GDFN+VS +E+ G S N+ D+ FN F++EM+L+D P+ G KFT++ DG A+ Sbjct: 58 WCLVGDFNAVSNREERTGRSENWGYIDMVDFNAFVNEMNLIDPPLHGNKFTYFCSDGIAA 117 Query: 384 SRLDRFLLSEELILKWQVAAQWVGNRDISDHSPIWLDCSVYDWGPKPFRFNNCWLGHKEF 563 SRLDRFL+S+ ++ WQV Q VG RDISDH PIWL+CS +WGPKPFRFNNCWL H F Sbjct: 118 SRLDRFLVSDGIMNLWQVKGQRVGKRDISDHCPIWLECSNLNWGPKPFRFNNCWLEHDGF 177 Query: 564 KSFVEECWKGFQVGGWKIYAFXXXXXXXXXXXXXWNKEVFGFMDLNIENIVQDMNALDDI 743 KSF+ E WK Q+ G K Y WNKEVFG++DLNIENIV DMN LD Sbjct: 178 KSFIVEEWKKIQITGRKAYVIKEKLKIIRESLKKWNKEVFGWLDLNIENIVADMNELDRG 237 Query: 744 VGSG-DSRVGENRKELTSRFWHQIHAKESLIRQKARCKWIAEGDANTRFFHACVRGRRRR 920 + G + V +KE + FW Q+ KESL++QK+R +WI EGD+NT+FFH+C++ RRR+ Sbjct: 238 IEEGCNLNVVVKKKEANALFWQQLMMKESLLKQKSRLRWIKEGDSNTKFFHSCLQDRRRK 297 Query: 921 NQILALR 941 NQIL+L+ Sbjct: 298 NQILSLQ 304 >KHN41376.1 hypothetical protein glysoja_032562, partial [Glycine soja] Length = 362 Score = 282 bits (721), Expect = 4e-90 Identities = 132/247 (53%), Positives = 173/247 (70%), Gaps = 1/247 (0%) Frame = +3 Query: 204 WCVAGDFNSVSRSQEKRGASNNYRRRDIARFNQFISEMHLVDVPVLGKKFTWWSGDGRAS 383 WC+ GDFN+VS +E+ G S N+ D+ FN F++EM+L+D P+ G KFT++ DG A+ Sbjct: 58 WCLVGDFNAVSNREERTGRSENWGYIDMVDFNAFVNEMNLIDPPLHGNKFTYFCSDGIAA 117 Query: 384 SRLDRFLLSEELILKWQVAAQWVGNRDISDHSPIWLDCSVYDWGPKPFRFNNCWLGHKEF 563 SRLDRFL+S+ ++ WQV Q VG RDISDH PIWL+CS +WGPKPFRFNNCWL H F Sbjct: 118 SRLDRFLVSDGIMNLWQVKGQRVGKRDISDHCPIWLECSNLNWGPKPFRFNNCWLEHDGF 177 Query: 564 KSFVEECWKGFQVGGWKIYAFXXXXXXXXXXXXXWNKEVFGFMDLNIENIVQDMNALDDI 743 KSF+ E WK Q+ G K Y WNKEVFG++DLNIENIV DMN LD Sbjct: 178 KSFIVEEWKKIQITGRKAYVIKEKLKIIRESLKKWNKEVFGWLDLNIENIVADMNELDRG 237 Query: 744 VGSG-DSRVGENRKELTSRFWHQIHAKESLIRQKARCKWIAEGDANTRFFHACVRGRRRR 920 + G + V +KE + FW Q+ KESL++QK+R +WI EGD+NT+FFH+C++ RRR+ Sbjct: 238 IEEGCNLNVVVKKKEANALFWQQLMMKESLLKQKSRLRWIKEGDSNTKFFHSCLQDRRRK 297 Query: 921 NQILALR 941 NQIL+L+ Sbjct: 298 NQILSLQ 304 >GAU25065.1 hypothetical protein TSUD_257590 [Trifolium subterraneum] Length = 721 Score = 286 bits (731), Expect = 2e-87 Identities = 134/291 (46%), Positives = 189/291 (64%), Gaps = 1/291 (0%) Frame = +3 Query: 81 RKRRSSDVFFMVNVYSPCDWGGKNQCWKELILCKRILGGELWCVAGDFNSVSRSQEKRGA 260 R R D +VNVYSPC GK + W++L+ K+ GG WCV GDFNS+ S E++G+ Sbjct: 196 RVDREGDELNIVNVYSPCIISGKKKLWEDLLALKQSTGGGKWCVRGDFNSILHSSERKGS 255 Query: 261 SNNYRRRDIARFNQFISEMHLVDVPVLGKKFTWWSGDGRASSRLDRFLLSEELILKWQVA 440 S R+ + + FN+F+ EM L+D PVLGKKF+W+S DG++ SR+DRFLLS+ + K+ + Sbjct: 256 SIVSRQNESSLFNRFVEEMELIDTPVLGKKFSWFSADGKSMSRIDRFLLSDGFVSKFGIT 315 Query: 441 AQWVGNRDISDHSPIWLDCSVYDWGPKPFRFNNCWLGHKEFKSFVEECWKGFQVGGWKIY 620 +W+G+RDIS H PIWL CS Y+WGPKPFR N W+ H +F FVE WK F V G K Sbjct: 316 GKWIGDRDISYHCPIWLLCSSYNWGPKPFRVINGWMEHPDFFDFVETTWKSFDVHGKK-- 373 Query: 621 AFXXXXXXXXXXXXXWNKEVFGFMDLNIENIVQDMNALDDIVGSGDSRVGENRKE-LTSR 797 EV+GF+DLNIE V D+N +++++G D + R+ L Sbjct: 374 -----------------GEVYGFLDLNIEKTVTDINVIENLLGGDDEEIDLTRRAGLNKD 416 Query: 798 FWHQIHAKESLIRQKARCKWIAEGDANTRFFHACVRGRRRRNQILALRKYD 950 FW Q+ KESL++QK+R +W+ EGD+N++FFH ++ RRRRNQ++AL+ D Sbjct: 417 FWKQLIHKESLLKQKSRMRWVKEGDSNSKFFHESIKSRRRRNQLVALKDGD 467 >GAU48536.1 hypothetical protein TSUD_282880 [Trifolium subterraneum] Length = 1794 Score = 294 bits (753), Expect = 7e-87 Identities = 138/315 (43%), Positives = 195/315 (61%), Gaps = 1/315 (0%) Frame = +3 Query: 3 MIIIWDTNVLEXXXXXXXXXXXXXCARKRRSSDVFFMVNVYSPCDWGGKNQCWKELILCK 182 ++I+W+ + C + + F++N+YSPC GK + W +L+ K Sbjct: 748 LLIMWNAGLFNLKFSFTGDNFLGLCVECKEG--ILFIINIYSPCSLSGKRKLWSDLLEFK 805 Query: 183 RILGGELWCVAGDFNSVSRSQEKRGASNNYRRRDIARFNQFISEMHLVDVPVLGKKFTWW 362 + WC+ GDFN V ++ E++G+S R+ + F QF+ M L DVPV GKKF+W+ Sbjct: 806 QNNEQGEWCLGGDFNVVLKTGERKGSSALCRQNERLEFCQFVEAMELCDVPVAGKKFSWF 865 Query: 363 SGDGRASSRLDRFLLSEELILKWQVAAQWVGNRDISDHSPIWLDCSVYDWGPKPFRFNNC 542 S DG + SRLDRFLLSE+ I +V QW+G+RDISDH PIWL CS +WGPKPF+ NNC Sbjct: 866 SADGTSMSRLDRFLLSEKFIDSEKVTGQWIGSRDISDHCPIWLLCSNLNWGPKPFKVNNC 925 Query: 543 WLGHKEFKSFVEECWKGFQVGGWKIYAFXXXXXXXXXXXXXWNKEVFGFMDLNIENIVQD 722 WL H EFK FVE+ W V G K + WN++VFG +DLNIENIV++ Sbjct: 926 WLEHPEFKPFVEKTWNKLNVEGKKAFVIKEKLKRLKEELRRWNRDVFGILDLNIENIVRE 985 Query: 723 MNALDDIVG-SGDSRVGENRKELTSRFWHQIHAKESLIRQKARCKWIAEGDANTRFFHAC 899 +N + ++ G + V + + +FW Q+H KESLI+QK+R KW+ EGD+N+RFFHA Sbjct: 986 LNEAEGLLAIDGANSVTCDVSAINKKFWDQLHFKESLIKQKSRLKWVREGDSNSRFFHAS 1045 Query: 900 VRGRRRRNQILALRK 944 ++ RRRRNQ+ LR+ Sbjct: 1046 IKSRRRRNQLSILRR 1060 >GAU33402.1 hypothetical protein TSUD_20950 [Trifolium subterraneum] Length = 1594 Score = 292 bits (747), Expect = 4e-86 Identities = 134/282 (47%), Positives = 188/282 (66%), Gaps = 1/282 (0%) Frame = +3 Query: 108 FMVNVYSPCDWGGKNQCWKELILCKRILGGELWCVAGDFNSVSRSQEKRGASNNYRRRDI 287 +++NVYSPC GK + W +L+ K WC+ GDFN V E++G++++ R+ + Sbjct: 704 YIINVYSPCSLSGKRKLWSDLLEFKLNNEQGEWCLRGDFNVVLNVGERKGSTSSARQNER 763 Query: 288 ARFNQFISEMHLVDVPVLGKKFTWWSGDGRASSRLDRFLLSEELILKWQVAAQWVGNRDI 467 F QF+ M L+DVPV GKKF+W+S DG A SRLDRFLLS+ I K +VA QW+GN DI Sbjct: 764 LEFCQFVEAMELIDVPVAGKKFSWFSADGNAISRLDRFLLSDNFIEKEEVAGQWIGNHDI 823 Query: 468 SDHSPIWLDCSVYDWGPKPFRFNNCWLGHKEFKSFVEECWKGFQVGGWKIYAFXXXXXXX 647 SDH PIWL CS +WGPKPF+ NNCWL H EFK FVE+ W+ + G K + Sbjct: 824 SDHCPIWLMCSNLNWGPKPFKVNNCWLEHSEFKLFVEKTWEKLNIRGKKAFVIKEKLKRL 883 Query: 648 XXXXXXWNKEVFGFMDLNIENIVQDMNALDDIVGS-GDSRVGENRKELTSRFWHQIHAKE 824 WN+EVF +DLNIE V+++N ++ +VG+ G + V ++ + +FW Q++ KE Sbjct: 884 KEELRGWNREVFSILDLNIEKTVKELNEVEGLVGNDGVNSVMGDKSGVNRKFWEQLYFKE 943 Query: 825 SLIRQKARCKWIAEGDANTRFFHACVRGRRRRNQILALRKYD 950 S+I+QK+R KW+ EGD+NTRFF A ++ RRRRNQ++ LR+ D Sbjct: 944 SMIKQKSRLKWVREGDSNTRFFQASLKNRRRRNQLVLLRRGD 985 >GAU46303.1 hypothetical protein TSUD_283280 [Trifolium subterraneum] Length = 1636 Score = 285 bits (730), Expect = 8e-84 Identities = 135/280 (48%), Positives = 183/280 (65%), Gaps = 1/280 (0%) Frame = +3 Query: 108 FMVNVYSPCDWGGKNQCWKELILCKRILGGELWCVAGDFNSVSRSQEKRGASNNYRRRDI 287 ++VN+YSPC G GDFNS+++ E+RG+ R+ Sbjct: 670 YIVNIYSPCTMAG-----------------------GDFNSITKIGERRGSHGGSVYRER 706 Query: 288 ARFNQFISEMHLVDVPVLGKKFTWWSGDGRASSRLDRFLLSEELILKWQVAAQWVGNRDI 467 F+QFI M LVD+PVLGKKFTW++ D A SRLDRFLLSE I K ++ QWVGNRDI Sbjct: 707 IEFSQFIDAMELVDIPVLGKKFTWFNSDCSAMSRLDRFLLSEGFIEKGGISNQWVGNRDI 766 Query: 468 SDHSPIWLDCSVYDWGPKPFRFNNCWLGHKEFKSFVEECWKGFQVGGWKIYAFXXXXXXX 647 SDH PIWL+ S +WGPKPF+FNNCWL H +F FV+ W+ + G K + Sbjct: 767 SDHCPIWLESSNINWGPKPFKFNNCWLEHSDFLPFVKATWEKMNIHGKKAFIIKEKLKRL 826 Query: 648 XXXXXXWNKEVFGFMDLNIENIVQDMNALDDIVGSGDSRVGE-NRKELTSRFWHQIHAKE 824 WN+EVFG MDLNIE V+D+N +++++ +GD+++ N KEL+ +FW Q+H KE Sbjct: 827 KEALKTWNQEVFGIMDLNIEKTVKDLNEIEELIANGDNQLDSVNSKELSKKFWEQLHFKE 886 Query: 825 SLIRQKARCKWIAEGDANTRFFHACVRGRRRRNQILALRK 944 S+++QK+R KWI EGD+NTRFFHA ++GRRRRN+I+ L+K Sbjct: 887 SILQQKSRTKWIQEGDSNTRFFHASIKGRRRRNRIVKLKK 926 >GAU10537.1 hypothetical protein TSUD_419470, partial [Trifolium subterraneum] Length = 557 Score = 271 bits (692), Expect = 2e-83 Identities = 126/268 (47%), Positives = 174/268 (64%), Gaps = 1/268 (0%) Frame = +3 Query: 144 GKNQCWKELILCKRILGGELWCVAGDFNSVSRSQEKRGASNNYRRRDIARFNQFISEMHL 323 GK + W +LI + WC+ GDFNS++++ E+RG+SN + F Q I M L Sbjct: 3 GKRKLWHDLIEFRMNNAPGEWCLGGDFNSITKTSERRGSSNWSGNTERTEFVQIIETMEL 62 Query: 324 VDVPVLGKKFTWWSGDGRASSRLDRFLLSEELILKWQVAAQWVGNRDISDHSPIWLDCSV 503 +D+PVLGKKFTW + D A SRLDRFLLSE LI K + QWVG+RDISDH PIWL+C+ Sbjct: 63 IDIPVLGKKFTWSNSDNSAMSRLDRFLLSEGLIEKGGITNQWVGDRDISDHYPIWLECNN 122 Query: 504 YDWGPKPFRFNNCWLGHKEFKSFVEECWKGFQVGGWKIYAFXXXXXXXXXXXXXWNKEVF 683 +W PKPF+FNNCWL H +F FV+ W+ + G K + WN EVF Sbjct: 123 RNWCPKPFKFNNCWLEHPDFIPFVKASWESMDIHGRKAFILKEKLKRLKESLKKWNHEVF 182 Query: 684 GFMDLNIENIVQDMNALDDIVGSGDSR-VGENRKELTSRFWHQIHAKESLIRQKARCKWI 860 G MDLNIE V+++N +++++ +G+S + N K+ + FW Q+ KES+++QK+R KWI Sbjct: 183 GIMDLNIEKTVKELNEIEEMIANGNSHPMYPNSKKQSKMFWEQLRFKESILKQKSRTKWI 242 Query: 861 AEGDANTRFFHACVRGRRRRNQILALRK 944 EGD+NT FFHA ++GR R N+I +RK Sbjct: 243 QEGDSNTSFFHATIKGRHRSNRIAKIRK 270 >GAU50864.1 hypothetical protein TSUD_411040 [Trifolium subterraneum] Length = 862 Score = 277 bits (709), Expect = 4e-83 Identities = 128/288 (44%), Positives = 186/288 (64%), Gaps = 1/288 (0%) Frame = +3 Query: 90 RSSDVFFMVNVYSPCDWGGKNQCWKELILCKRILGGELWCVAGDFNSVSRSQEKRGASNN 269 R V +VN+YSPC GK + W++L+ K++ G C+ GDFN++ S E++GAS + Sbjct: 409 REGAVTHLVNIYSPCSLSGKKKLWEDLLEIKQLFTGGECCLRGDFNAILHSSERKGASAD 468 Query: 270 YRRRDIARFNQFISEMHLVDVPVLGKKFTWWSGDGRASSRLDRFLLSEELILKWQVAAQW 449 R+ + FN+F+ EM ++DVPVLG K +W S DG++ SRLDRF+LS+ I K+ + QW Sbjct: 469 SRQGERMMFNRFVEEMEMIDVPVLGMKVSWVSADGKSMSRLDRFILSDGFITKFGIIGQW 528 Query: 450 VGNRDISDHSPIWLDCSVYDWGPKPFRFNNCWLGHKEFKSFVEECWKGFQVGGWKIYAFX 629 +GNR+I DH PIWL S +WGPKPFR N L H +F F+E CWK F + G K Y Sbjct: 529 IGNRNIFDHCPIWLYASAKNWGPKPFRAINGCLEHPDFLVFLESCWKSFDIQGTKAYVLK 588 Query: 630 XXXXXXXXXXXXWNKEVFGFMDLNIENIVQDMNALDDIVGSGDSRVGENRKE-LTSRFWH 806 WNKEVFG +DLNI+ V+++N ++ ++G D V R+E L S FW Sbjct: 589 EKLRFLKEILKKWNKEVFGILDLNIDKTVKELNDIEKMLGDDDPDVELTRREGLNSEFWS 648 Query: 807 QIHAKESLIRQKARCKWIAEGDANTRFFHACVRGRRRRNQILALRKYD 950 Q+H KE L++QK+R + + EGD+N++FFH ++ RRR+NQ++ L+ D Sbjct: 649 QLHFKEILLQQKSRTRRVKEGDSNSKFFHESIKRRRRKNQLVVLKDGD 696 >GAU14107.1 hypothetical protein TSUD_169320 [Trifolium subterraneum] Length = 695 Score = 260 bits (665), Expect = 5e-78 Identities = 125/282 (44%), Positives = 174/282 (61%), Gaps = 1/282 (0%) Frame = +3 Query: 102 VFFMVNVYSPCDWGGKNQCWKELILCKRILGGELWCVAGDFNSVSRSQEKRGASNN-YRR 278 + ++VNVYS C+ GK + W +LI K E WC+ GDFNS+S+ E+RG+S+ +R+ Sbjct: 88 LLYIVNVYSSCNVSGKRKLWNDLIDFKLNNEPEEWCLGGDFNSISKVGERRGSSSGAWRQ 147 Query: 279 RDIARFNQFISEMHLVDVPVLGKKFTWWSGDGRASSRLDRFLLSEELILKWQVAAQWVGN 458 + F QFI + +VD+P+ K FTW++ DG A SRL+ FL+SE I K ++ QWVG+ Sbjct: 148 GERIEFIQFIDALEVVDIPLKDKMFTWFNSDGSAMSRLNHFLVSEGFIEKGSLSYQWVGD 207 Query: 459 RDISDHSPIWLDCSVYDWGPKPFRFNNCWLGHKEFKSFVEECWKGFQVGGWKIYAFXXXX 638 RDISDH PIWL CS +WGPKPF FNNCWL H +F FV+E W+ + G K + Sbjct: 208 RDISDHCPIWLMCSNINWGPKPFTFNNCWLEHPKFFEFVKETWENMDIRGKKAFIIKEKL 267 Query: 639 XXXXXXXXXWNKEVFGFMDLNIENIVQDMNALDDIVGSGDSRVGENRKELTSRFWHQIHA 818 WN+EVFGFM+L I+ V ++N FW Q++ Sbjct: 268 KGLKEALKVWNREVFGFMELKIDKTVNELN----------------------EFWEQLNF 305 Query: 819 KESLIRQKARCKWIAEGDANTRFFHACVRGRRRRNQILALRK 944 KESL+ QK+R KW EGD+N+R+FHA ++ RRR+NQI+ L+K Sbjct: 306 KESLLHQKSRTKWAKEGDSNSRYFHASIKSRRRKNQIVTLKK 347 >GAU42690.1 hypothetical protein TSUD_302000 [Trifolium subterraneum] Length = 1092 Score = 262 bits (670), Expect = 2e-76 Identities = 125/263 (47%), Positives = 170/263 (64%), Gaps = 1/263 (0%) Frame = +3 Query: 102 VFFMVNVYSPCDWGGKNQCWKELILCKRILGGELWCVAGDFNSVSRSQEKRGASNNYRRR 281 V +VNVYSPC+ GK Q W++L+ K+ + LWCV GDFN++ S E++G+S + R+ Sbjct: 237 VLHIVNVYSPCNISGKKQLWEDLLELKQRVAEGLWCVGGDFNAILHSFERQGSSTDSRKS 296 Query: 282 DIARFNQFISEMHLVDVPVLGKKFTWWSGDGRASSRLDRFLLSEELILKWQVAAQWVGNR 461 + FN F+ EM L+D+PVLGKKF+W+S DG++ SR+DRFLLS+ + K+ + QW+G+R Sbjct: 297 ERVLFNSFVEEMELIDIPVLGKKFSWFSADGKSMSRIDRFLLSDGFVSKFGITGQWIGDR 356 Query: 462 DISDHSPIWLDCSVYDWGPKPFRFNNCWLGHKEFKSFVEECWKGFQVGGWKIYAFXXXXX 641 DISDH PIWL S WGPKPFR N WL H +F +FVE WK F V G K Y Sbjct: 357 DISDHCPIWLLFSSNIWGPKPFRVINGWLDHPDFLTFVETTWKSFAVHGKKAYILKEKFK 416 Query: 642 XXXXXXXXWNKEVFGFMDLNIENIVQDMNALDDIVGSGDSRVGENRKE-LTSRFWHQIHA 818 WNKEV+GF+DLNIE V D+N +++++G D R+E L F Q H Sbjct: 417 LLKDSLRKWNKEVYGFLDLNIEKTVNDINDIENLLGGDDMEAELIRREGLNKDFGRQHHF 476 Query: 819 KESLIRQKARCKWIAEGDANTRF 887 KESL++QK+R +W+ E D T F Sbjct: 477 KESLLKQKSRMRWVKE-DVQTAF 498 >GAU20609.1 hypothetical protein TSUD_33450 [Trifolium subterraneum] Length = 798 Score = 251 bits (641), Expect = 9e-74 Identities = 111/218 (50%), Positives = 152/218 (69%), Gaps = 1/218 (0%) Frame = +3 Query: 294 FNQFISEMHLVDVPVLGKKFTWWSGDGRASSRLDRFLLSEELILKWQVAAQWVGNRDISD 473 F QFI M L+D+PVLGKKFTW + D SRLDRFLLSE +I K + QWVG+RDISD Sbjct: 13 FVQFIDAMELIDIPVLGKKFTWSNSDNSVMSRLDRFLLSEGIIEKGGITNQWVGDRDISD 72 Query: 474 HSPIWLDCSVYDWGPKPFRFNNCWLGHKEFKSFVEECWKGFQVGGWKIYAFXXXXXXXXX 653 H PIWL+C+ +WGPKPF+FNNCWL HK+F V+ W+ + G K + Sbjct: 73 HHPIWLECNNLNWGPKPFKFNNCWLEHKDFIPVVKATWESLNINGRKAHVLKEKMKRLKE 132 Query: 654 XXXXWNKEVFGFMDLNIENIVQDMNALDDIVGSGDSR-VGENRKELTSRFWHQIHAKESL 830 WNKEVFG +DLNI+ V+D+N +++++ +GD+ + N KE+ +FW Q+H KES+ Sbjct: 133 ELKVWNKEVFGILDLNIDKTVKDLNEVEELIANGDNHPLHLNSKEIAKKFWEQLHFKESI 192 Query: 831 IRQKARCKWIAEGDANTRFFHACVRGRRRRNQILALRK 944 ++QK+R KWI +GD+NT +FHA ++GRRRRN IL ++K Sbjct: 193 LKQKSRSKWIKKGDSNTCYFHATIKGRRRRNHILKIKK 230 >XP_006577560.1 PREDICTED: uncharacterized protein LOC102665607 [Glycine max] Length = 326 Score = 234 bits (597), Expect = 6e-72 Identities = 111/210 (52%), Positives = 145/210 (69%), Gaps = 1/210 (0%) Frame = +3 Query: 315 MHLVDVPVLGKKFTWWSGDGRASSRLDRFLLSEELILKWQVAAQWVGNRDISDHSPIWLD 494 M+L+D P+ G KFT++ DG A+SRLDRFL+S+ ++ WQ Q VG RDI DH PIWL+ Sbjct: 1 MNLIDPPLHGNKFTYFCSDGIAASRLDRFLVSDGIMNLWQEKGQRVGKRDIYDHCPIWLE 60 Query: 495 CSVYDWGPKPFRFNNCWLGHKEFKSFVEECWKGFQVGGWKIYAFXXXXXXXXXXXXXWNK 674 CS +WGPKPFRFNNCWL H +FKSF+ E WK Q+ G K Y WNK Sbjct: 61 CSNLNWGPKPFRFNNCWLEHDDFKSFIVEEWKKIQITGRKAYVIKEKLKIIRESLKKWNK 120 Query: 675 EVFGFMDLNIENIVQDMNALDDIVGSG-DSRVGENRKELTSRFWHQIHAKESLIRQKARC 851 EVFG++DLNIENIV +MN LD + G + +KE + FW Q+ KESL++QK+R Sbjct: 121 EVFGWLDLNIENIVAEMNKLDRGIEEGCNLNEVVKKKEAKALFWQQLMMKESLLKQKSRL 180 Query: 852 KWIAEGDANTRFFHACVRGRRRRNQILALR 941 +WI EGD NT+FFH+C++ RRR+NQIL+L+ Sbjct: 181 RWIKEGDYNTKFFHSCLQDRRRKNQILSLQ 210 >KYP46096.1 LINE-1 reverse transcriptase isogeny [Cajanus cajan] Length = 729 Score = 244 bits (622), Expect = 2e-71 Identities = 115/279 (41%), Positives = 164/279 (58%), Gaps = 2/279 (0%) Frame = +3 Query: 108 FMVNVYSPCDWGGKNQCWKELILCKRILGGELWCVAGDFNSVSRSQEKRGASNNYRRRDI 287 F+VN+YSPCD GK W+E+ K G WC+ GDFN+V E++G +++ Sbjct: 25 FIVNIYSPCDLRGKKNLWEEIHKIKNSYGSGRWCICGDFNTVRLKSERKGVHTRREEKEM 84 Query: 288 ARFNQFISEMHLVDVPVLGKKFTWWSGDGRASSRLDRFLLSEELILKWQVAAQWVGNRDI 467 +NQFI ++ L+D+P+ G K+TW+ + +SR+DRFL+S+E + +W +Q RD+ Sbjct: 85 LCYNQFIEDVELIDLPLGGGKYTWFRPNRIIASRIDRFLVSQEWLTQWPHCSQKALQRDV 144 Query: 468 SDHSPIWLDCSVYDWGPKPFRFNNCWLGHKEFKSFVEECWKGFQVGGWKIYAFXXXXXXX 647 SDH PI L DWGPKPFR NCW F FVEE WKGF V GW + Sbjct: 145 SDHRPILLKDIRLDWGPKPFRSLNCWFDDPSFLGFVEEKWKGFSVTGWGAFILKEKLKHL 204 Query: 648 XXXXXXWNKEVFGFMDLNIENIVQDMNALDDIV--GSGDSRVGENRKELTSRFWHQIHAK 821 WNK+ FG + IE + +++N+LD IV S + R +R+ L + W ++ K Sbjct: 205 KKSIKEWNKQAFGNIHTQIEEVKRNINSLDSIVETRSLNERKVSDRRNLNVKLWDLLNKK 264 Query: 822 ESLIRQKARCKWIAEGDANTRFFHACVRGRRRRNQILAL 938 ESL+ QK+R KW EGD+N+ FFH CV RR+ N+I+ L Sbjct: 265 ESLLLQKSRLKWAREGDSNSSFFHMCVNKRRKMNEIIGL 303 >GAU42970.1 hypothetical protein TSUD_188430 [Trifolium subterraneum] Length = 767 Score = 242 bits (618), Expect = 1e-70 Identities = 119/280 (42%), Positives = 163/280 (58%), Gaps = 1/280 (0%) Frame = +3 Query: 108 FMVNVYSPCDWGGKNQCWKELILCKRILGGELWCVAGDFNSVSRSQEKRGASNNYRRRDI 287 + VN+Y C GK + W++LI K + WC+ GDFNS+++ ++ G+SN ++ Sbjct: 76 YFVNIYFACSLAGKRKLWRDLIDFKLLNTPGEWCLGGDFNSITKVSKRSGSSNGSSNKER 135 Query: 288 ARFNQFISEMHLVDVPVLGKKFTWWSGDGRASSRLDRFLLSEELILKWQVAAQWVGNRDI 467 F QFI M LVD+PV GKKFTW + D A SRLDRFLLSE LI K ++ QWVG RDI Sbjct: 136 TEFAQFIDAMELVDIPVFGKKFTWSNSDNSAMSRLDRFLLSEGLIEKGGISNQWVGGRDI 195 Query: 468 SDHSPIWLDCSVYDWGPKPFRFNNCWLGHKEFKSFVEECWKGFQVGGWKIYAFXXXXXXX 647 SDH PIWL+CS +WGPKPF+FNN WL H +F FV+ W+ + G K + Sbjct: 196 SDHHPIWLECSNINWGPKPFKFNNFWLDHPDFIPFVKATWESMNIHGKKAFILKEKLKRL 255 Query: 648 XXXXXXWNKEVFGFMDLNIENIVQDMNALDDIVGSGDSR-VGENRKELTSRFWHQIHAKE 824 WN+EVFG MDL+IE V+D+N +++++ +GD + N K+L+ +FW Q+H KE Sbjct: 256 KEVLKTWNREVFGIMDLDIEKTVKDLNEVEEMIANGDCHPLFSNAKDLSKKFWEQLHNKE 315 Query: 825 SLIRQKARCKWIAEGDANTRFFHACVRGRRRRNQILALRK 944 S RRR N+I+ LRK Sbjct: 316 S---------------------------RRRSNRIVKLRK 328 >GAU50433.1 hypothetical protein TSUD_134890, partial [Trifolium subterraneum] Length = 286 Score = 229 bits (583), Expect = 2e-70 Identities = 104/213 (48%), Positives = 141/213 (66%), Gaps = 1/213 (0%) Frame = +3 Query: 315 MHLVDVPVLGKKFTWWSGDGRASSRLDRFLLSEELILKWQVAAQWVGNRDISDHSPIWLD 494 M +D+PVLGKKF+W+S DG+A SRLDRFLLS+ + K V QW+G+RDISDH P+WL Sbjct: 1 MEFIDIPVLGKKFSWFSPDGKAMSRLDRFLLSDGFLTKNGVTGQWIGDRDISDHCPVWLL 60 Query: 495 CSVYDWGPKPFRFNNCWLGHKEFKSFVEECWKGFQVGGWKIYAFXXXXXXXXXXXXXWNK 674 S +WGPKPFR N W+ H EF FVE WK + V G K + WNK Sbjct: 61 SSFCNWGPKPFRVINGWINHPEFNDFVESAWKSYDVRGKKAFVLKEKLKLLRESLKKWNK 120 Query: 675 EVFGFMDLNIENIVQDMNALDDIVGSGDSRVGENRKE-LTSRFWHQIHAKESLIRQKARC 851 EVFG++DLNIE IV D+N + ++ S D + L FW Q+H K+SL+++K+R Sbjct: 121 EVFGYLDLNIEKIVTDINKFEGLLSSTDGDADYLMLDGLNKEFWKQLHFKDSLLKRKSRS 180 Query: 852 KWIAEGDANTRFFHACVRGRRRRNQILALRKYD 950 KW+ +GD+N+++FH ++GRRRRNQ++ALR D Sbjct: 181 KWVKDGDSNSKYFHQSLKGRRRRNQLVALRDGD 213 >GAU51623.1 hypothetical protein TSUD_414500 [Trifolium subterraneum] Length = 838 Score = 242 bits (617), Expect = 5e-70 Identities = 108/213 (50%), Positives = 146/213 (68%), Gaps = 1/213 (0%) Frame = +3 Query: 315 MHLVDVPVLGKKFTWWSGDGRASSRLDRFLLSEELILKWQVAAQWVGNRDISDHSPIWLD 494 M L+D+PV+GKKF+W+S DG+A SRLDRFLLS+ I K ++ QW+GNRDISDH P+WL Sbjct: 1 MELIDIPVIGKKFSWFSADGKAMSRLDRFLLSDNFIAKEEILGQWIGNRDISDHCPVWLI 60 Query: 495 CSVYDWGPKPFRFNNCWLGHKEFKSFVEECWKGFQVGGWKIYAFXXXXXXXXXXXXXWNK 674 CS +WGPKPF+FNNCWL H E FV W V G K + WN+ Sbjct: 61 CSNLNWGPKPFKFNNCWLKHPELSLFVTRIWVKMNVTGKKAFVIKEKLKRLKEELRGWNR 120 Query: 675 EVFGFMDLNIENIVQDMNALDDIVG-SGDSRVGENRKELTSRFWHQIHAKESLIRQKARC 851 EVFG +DLNIEN V+++N L+ + G + + ++ + +FW Q++ KESLIRQK+R Sbjct: 121 EVFGILDLNIENTVKELNELEGLAAIDGTNSMLVDKGGINKKFWDQLNFKESLIRQKSRA 180 Query: 852 KWIAEGDANTRFFHACVRGRRRRNQILALRKYD 950 W++EGD+NTRFFHA ++ RRRRNQ++ LR+ D Sbjct: 181 NWVSEGDSNTRFFHASLKSRRRRNQMIMLRRGD 213 >GAU42390.1 hypothetical protein TSUD_296880 [Trifolium subterraneum] Length = 938 Score = 240 bits (613), Expect = 6e-69 Identities = 110/234 (47%), Positives = 157/234 (67%), Gaps = 1/234 (0%) Frame = +3 Query: 234 SRSQEKRGASNNYRRRDIARFNQFISEMHLVDVPVLGKKFTWWSGDGRASSRLDRFLLSE 413 +RS E+RG+S + + + F+ FI M ++D+PVLGKKFTW++ +G RL RFLLSE Sbjct: 342 NRSGERRGSSGSGCLSERSEFSLFIEAMEVIDIPVLGKKFTWFNSNGSTMRRLYRFLLSE 401 Query: 414 ELILKWQVAAQWVGNRDISDHSPIWLDCSVYDWGPKPFRFNNCWLGHKEFKSFVEECWKG 593 I K ++ QW+ + DISDH PIWL+CS+ +WG KP +FNNCW+ H EF V+ + Sbjct: 402 GFIHKGGISNQWISDHDISDHCPIWLECSILNWGHKPVKFNNCWVDHPEFLDLVKNIFAQ 461 Query: 594 FQVGGWKIYAFXXXXXXXXXXXXXWNKEVFGFMDLNIENIVQDMNALDDIVGSGDSRVGE 773 V G K + WN++VFGF DL I+ V+++N ++D++ +GD + Sbjct: 462 SNVRGTKTFVISEKMKRLKEALKKWNRDVFGFKDLCIDKTVRELNEVEDLIANGDVDPAD 521 Query: 774 -NRKELTSRFWHQIHAKESLIRQKARCKWIAEGDANTRFFHACVRGRRRRNQIL 932 N KEL +FW QIH+KESL+RQK+R KWI EGD+N+RFFH+ ++GRRRRNQI+ Sbjct: 522 LNSKELVRKFWEQIHSKESLLRQKSRTKWIQEGDSNSRFFHSSIKGRRRRNQIV 575 >GAU47952.1 hypothetical protein TSUD_06860 [Trifolium subterraneum] Length = 724 Score = 233 bits (595), Expect = 1e-67 Identities = 105/213 (49%), Positives = 146/213 (68%), Gaps = 1/213 (0%) Frame = +3 Query: 315 MHLVDVPVLGKKFTWWSGDGRASSRLDRFLLSEELILKWQVAAQWVGNRDISDHSPIWLD 494 M L+DVPVLGKKF+W+S +G++ SR+DRFLLS+ + K+ + QW+G+RDISDH PIWL Sbjct: 1 MTLLDVPVLGKKFSWFSANGKSMSRIDRFLLSDGFVSKYGITGQWIGDRDISDHCPIWLL 60 Query: 495 CSVYDWGPKPFRFNNCWLGHKEFKSFVEECWKGFQVGGWKIYAFXXXXXXXXXXXXXWNK 674 S Y WGPKPFR N WL H +F FVE WK F V G K Y WNK Sbjct: 61 VSSYKWGPKPFRVINGWLDHPDFLPFVESAWKSFVVHGKKAYVLKEKFRLLKERLRKWNK 120 Query: 675 EVFGFMDLNIENIVQDMNALDDIVGSGDSRVGENRKE-LTSRFWHQIHAKESLIRQKARC 851 EV+G++DLNIE V ++N +++++G D V R++ L FW Q++ KESL++QK+R Sbjct: 121 EVYGYLDLNIEKTVNEINDIENMLGDDDMEVELTRRQGLNKEFWSQLYHKESLLKQKSRT 180 Query: 852 KWIAEGDANTRFFHACVRGRRRRNQILALRKYD 950 +W+ EGD+N+R+FH ++ RRRRNQ++AL+ D Sbjct: 181 RWVKEGDSNSRYFHESIKSRRRRNQLVALKDGD 213 >GAU49526.1 hypothetical protein TSUD_377390 [Trifolium subterraneum] Length = 1149 Score = 238 bits (606), Expect = 2e-67 Identities = 111/239 (46%), Positives = 158/239 (66%), Gaps = 1/239 (0%) Frame = +3 Query: 240 SQEKRGASNNYRRRDIARFNQFISEMHLVDVPVLGKKFTWWSGDGRASSRLDRFLLSEEL 419 S E+RG + + + F+ FI M ++D+P+LGKKFTW++ DG SRLDRFLLSE Sbjct: 325 SGERRGCRGSVCLSERSEFSLFIEAMEVIDIPILGKKFTWFNSDGSTMSRLDRFLLSEGF 384 Query: 420 ILKWQVAAQWVGNRDISDHSPIWLDCSVYDWGPKPFRFNNCWLGHKEFKSFVEECWKGFQ 599 I K ++ QW+G+RDISD+ PIWL+CS +WGPKPF+FNNCW+ H EF V+ W Sbjct: 385 IHKGGISNQWIGDRDISDYFPIWLECSNLNWGPKPFKFNNCWVDHPEFLDLVKNIWVQSN 444 Query: 600 VGGWKIYAFXXXXXXXXXXXXXWNKEVFGFMDLNIENIVQDMNALDDIVGSGD-SRVGEN 776 + K WN++VFGF DL I+ ++++N ++D++ +GD V N Sbjct: 445 MKRLK------------EALKKWNRDVFGFKDLCIDKTLRELNEVEDLIANGDVDPVDLN 492 Query: 777 RKELTSRFWHQIHAKESLIRQKARCKWIAEGDANTRFFHACVRGRRRRNQILALRKYDA 953 KEL +FW QIH+KESL+R+K+R KWI EGD+N+ FF + ++GR RRNQI+ L+K +A Sbjct: 493 SKELVRKFWEQIHSKESLLRKKSRTKWIQEGDSNSHFFRSSIKGRHRRNQIVMLKKGEA 551 >KYP36320.1 Transposon TX1 uncharacterized [Cajanus cajan] Length = 1401 Score = 237 bits (604), Expect = 7e-67 Identities = 117/282 (41%), Positives = 163/282 (57%), Gaps = 5/282 (1%) Frame = +3 Query: 111 MVNVYSPCDWGGKNQCWKELILCKRILGGELWCVAGDFNSVSRSQEKRGASNNYRRRDIA 290 +VNVYS C K + W ++I+ KR G LWC+ GDFN+V R +E++G ++ RD+ Sbjct: 694 VVNVYSSCHLVDKRRLWGDIIMSKRGFGSCLWCIVGDFNTVRRLEERKGGFGDHGARDME 753 Query: 291 RFNQFISEMHLVDVPVLGKKFTWWSGDGRASSRLDRFLLSEELILKWQVAAQWVGNRDIS 470 FN FI+EM L+DVP++GK+FTW+ DG SRLDR L+SE W V RD+S Sbjct: 754 EFNSFITEMELIDVPLVGKRFTWFRSDGSIMSRLDRVLVSESWSAHWGAGFVKVIPRDVS 813 Query: 471 DHSPIWLDCSVYDWGPKPFRFNNCWLGHKEFKSFVEECWKGFQVGGWKIYAFXXXXXXXX 650 DH P+ L+ V +WGPKPFRFNNCWL H + V W+ G W Sbjct: 814 DHCPLILNHKVLNWGPKPFRFNNCWLSHCGIEGVVRSAWEKQVQGPWAAQRLRSKLLNVK 873 Query: 651 XXXXXWNKEVFGFMDLNIENIVQDMNALDDIVGSGDSRV-----GENRKELTSRFWHQIH 815 WN EVFG +D I+++ ++ LD + +V +KEL + W Sbjct: 874 NALKKWNIEVFGNVDTMIKSLTNELKELD---AKNEEQVLIQSERNRQKELVAGIWSARR 930 Query: 816 AKESLIRQKARCKWIAEGDANTRFFHACVRGRRRRNQILALR 941 K +L+ QKAR +W GD N+++FHAC+RGR+RRNQI+AL+ Sbjct: 931 NKLTLLAQKARIRWGKYGDQNSKYFHACIRGRQRRNQIVALK 972