BLASTX nr result

ID: Glycyrrhiza35_contig00021559 seq

BLASTX 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Glycyrrhiza35_contig00021559
         (955 letters)

Database: ./nr 
           115,041,592 sequences; 42,171,959,267 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

KHN28364.1 hypothetical protein glysoja_029625, partial [Glycine...   279   1e-89
KHN41376.1 hypothetical protein glysoja_032562, partial [Glycine...   279   4e-89
GAU25065.1 hypothetical protein TSUD_257590 [Trifolium subterran...   286   1e-87
GAU48536.1 hypothetical protein TSUD_282880 [Trifolium subterran...   290   3e-85
GAU33402.1 hypothetical protein TSUD_20950 [Trifolium subterraneum]   288   7e-85
GAU10537.1 hypothetical protein TSUD_419470, partial [Trifolium ...   270   4e-83
GAU46303.1 hypothetical protein TSUD_283280 [Trifolium subterran...   281   3e-82
GAU50864.1 hypothetical protein TSUD_411040 [Trifolium subterran...   275   4e-82
GAU14107.1 hypothetical protein TSUD_169320 [Trifolium subterran...   256   2e-76
GAU42690.1 hypothetical protein TSUD_302000 [Trifolium subterran...   261   6e-76
GAU20609.1 hypothetical protein TSUD_33450 [Trifolium subterraneum]   256   2e-75
XP_006577560.1 PREDICTED: uncharacterized protein LOC102665607 [...   232   5e-71
KYP46096.1 LINE-1 reverse transcriptase isogeny [Cajanus cajan]       242   6e-71
GAU42970.1 hypothetical protein TSUD_188430 [Trifolium subterran...   241   2e-70
GAU50433.1 hypothetical protein TSUD_134890, partial [Trifolium ...   226   2e-69
GAU51623.1 hypothetical protein TSUD_414500 [Trifolium subterran...   238   9e-69
GAU42390.1 hypothetical protein TSUD_296880 [Trifolium subterran...   236   3e-67
GAU49526.1 hypothetical protein TSUD_377390 [Trifolium subterran...   237   3e-67
GAU47952.1 hypothetical protein TSUD_06860 [Trifolium subterraneum]   230   3e-66
KYP36320.1 Transposon TX1 uncharacterized [Cajanus cajan]             233   2e-65

>KHN28364.1 hypothetical protein glysoja_029625, partial [Glycine soja]
          Length = 326

 Score =  279 bits (714), Expect = 1e-89
 Identities = 132/247 (53%), Positives = 174/247 (70%), Gaps = 1/247 (0%)
 Frame = -3

Query: 752 WCVAGDFNSVSSSQEKRGASNNYRRRDISRFNQFISEMHLVDVPVLGKKFTWWSGDGRAS 573
           WC+ GDFN+VS+ +E+ G S N+   D+  FN F++EM+L+D P+ G KFT++  DG A+
Sbjct: 58  WCLVGDFNAVSNREERTGRSENWGYIDMVDFNAFVNEMNLIDPPLHGNKFTYFCSDGIAA 117

Query: 572 SRLDRFLLSEELILKWQVAAQWVGNRDISDHSPIWLDCSVYDWGPKPFRFNNCWLGHKEF 393
           SRLDRFL+S+ ++  WQV  Q VG RDISDH PIWL+CS  +WGPKPFRFNNCWL H  F
Sbjct: 118 SRLDRFLVSDGIMNLWQVKGQRVGKRDISDHCPIWLECSNLNWGPKPFRFNNCWLEHDGF 177

Query: 392 KSFVEECWKGFQVGGWKIYAFXXXXXXXXXXXXEWNKEVFGFMDLNIENIVQDMNVLDDI 213
           KSF+ E WK  Q+ G K Y              +WNKEVFG++DLNIENIV DMN LD  
Sbjct: 178 KSFIVEEWKKIQITGRKAYVIKEKLKIIRESLKKWNKEVFGWLDLNIENIVADMNELDRG 237

Query: 212 VGSG-DSRVGENRKELTSRFWHQIHAKESLIRQKARCKWIAEGDANTCFFHACVRGRRRR 36
           +  G +  V   +KE  + FW Q+  KESL++QK+R +WI EGD+NT FFH+C++ RRR+
Sbjct: 238 IEEGCNLNVVVKKKEANALFWQQLMMKESLLKQKSRLRWIKEGDSNTKFFHSCLQDRRRK 297

Query: 35  NQILALR 15
           NQIL+L+
Sbjct: 298 NQILSLQ 304


>KHN41376.1 hypothetical protein glysoja_032562, partial [Glycine soja]
          Length = 362

 Score =  279 bits (714), Expect = 4e-89
 Identities = 132/247 (53%), Positives = 174/247 (70%), Gaps = 1/247 (0%)
 Frame = -3

Query: 752 WCVAGDFNSVSSSQEKRGASNNYRRRDISRFNQFISEMHLVDVPVLGKKFTWWSGDGRAS 573
           WC+ GDFN+VS+ +E+ G S N+   D+  FN F++EM+L+D P+ G KFT++  DG A+
Sbjct: 58  WCLVGDFNAVSNREERTGRSENWGYIDMVDFNAFVNEMNLIDPPLHGNKFTYFCSDGIAA 117

Query: 572 SRLDRFLLSEELILKWQVAAQWVGNRDISDHSPIWLDCSVYDWGPKPFRFNNCWLGHKEF 393
           SRLDRFL+S+ ++  WQV  Q VG RDISDH PIWL+CS  +WGPKPFRFNNCWL H  F
Sbjct: 118 SRLDRFLVSDGIMNLWQVKGQRVGKRDISDHCPIWLECSNLNWGPKPFRFNNCWLEHDGF 177

Query: 392 KSFVEECWKGFQVGGWKIYAFXXXXXXXXXXXXEWNKEVFGFMDLNIENIVQDMNVLDDI 213
           KSF+ E WK  Q+ G K Y              +WNKEVFG++DLNIENIV DMN LD  
Sbjct: 178 KSFIVEEWKKIQITGRKAYVIKEKLKIIRESLKKWNKEVFGWLDLNIENIVADMNELDRG 237

Query: 212 VGSG-DSRVGENRKELTSRFWHQIHAKESLIRQKARCKWIAEGDANTCFFHACVRGRRRR 36
           +  G +  V   +KE  + FW Q+  KESL++QK+R +WI EGD+NT FFH+C++ RRR+
Sbjct: 238 IEEGCNLNVVVKKKEANALFWQQLMMKESLLKQKSRLRWIKEGDSNTKFFHSCLQDRRRK 297

Query: 35  NQILALR 15
           NQIL+L+
Sbjct: 298 NQILSLQ 304


>GAU25065.1 hypothetical protein TSUD_257590 [Trifolium subterraneum]
          Length = 721

 Score =  286 bits (732), Expect = 1e-87
 Identities = 136/291 (46%), Positives = 189/291 (64%), Gaps = 1/291 (0%)
 Frame = -3

Query: 875  RKRRSSDVFFMVNVYSPCDWGGKNQCWKELILCKRILGGELWCVAGDFNSVSSSQEKRGA 696
            R  R  D   +VNVYSPC   GK + W++L+  K+  GG  WCV GDFNS+  S E++G+
Sbjct: 196  RVDREGDELNIVNVYSPCIISGKKKLWEDLLALKQSTGGGKWCVRGDFNSILHSSERKGS 255

Query: 695  SNNYRRRDISRFNQFISEMHLVDVPVLGKKFTWWSGDGRASSRLDRFLLSEELILKWQVA 516
            S   R+ + S FN+F+ EM L+D PVLGKKF+W+S DG++ SR+DRFLLS+  + K+ + 
Sbjct: 256  SIVSRQNESSLFNRFVEEMELIDTPVLGKKFSWFSADGKSMSRIDRFLLSDGFVSKFGIT 315

Query: 515  AQWVGNRDISDHSPIWLDCSVYDWGPKPFRFNNCWLGHKEFKSFVEECWKGFQVGGWKIY 336
             +W+G+RDIS H PIWL CS Y+WGPKPFR  N W+ H +F  FVE  WK F V G K  
Sbjct: 316  GKWIGDRDISYHCPIWLLCSSYNWGPKPFRVINGWMEHPDFFDFVETTWKSFDVHGKK-- 373

Query: 335  AFXXXXXXXXXXXXEWNKEVFGFMDLNIENIVQDMNVLDDIVGSGDSRVGENRKE-LTSR 159
                              EV+GF+DLNIE  V D+NV+++++G  D  +   R+  L   
Sbjct: 374  -----------------GEVYGFLDLNIEKTVTDINVIENLLGGDDEEIDLTRRAGLNKD 416

Query: 158  FWHQIHAKESLIRQKARCKWIAEGDANTCFFHACVRGRRRRNQILALRKYD 6
            FW Q+  KESL++QK+R +W+ EGD+N+ FFH  ++ RRRRNQ++AL+  D
Sbjct: 417  FWKQLIHKESLLKQKSRMRWVKEGDSNSKFFHESIKSRRRRNQLVALKDGD 467


>GAU48536.1 hypothetical protein TSUD_282880 [Trifolium subterraneum]
          Length = 1794

 Score =  290 bits (741), Expect = 3e-85
 Identities = 137/315 (43%), Positives = 194/315 (61%), Gaps = 1/315 (0%)
 Frame = -3

Query: 953  MIIIWDTNVLEXXXXXXXXXXXXVCARKRRSSDVFFMVNVYSPCDWGGKNQCWKELILCK 774
            ++I+W+  +              +C   +    + F++N+YSPC   GK + W +L+  K
Sbjct: 748  LLIMWNAGLFNLKFSFTGDNFLGLCVECKEG--ILFIINIYSPCSLSGKRKLWSDLLEFK 805

Query: 773  RILGGELWCVAGDFNSVSSSQEKRGASNNYRRRDISRFNQFISEMHLVDVPVLGKKFTWW 594
            +      WC+ GDFN V  + E++G+S   R+ +   F QF+  M L DVPV GKKF+W+
Sbjct: 806  QNNEQGEWCLGGDFNVVLKTGERKGSSALCRQNERLEFCQFVEAMELCDVPVAGKKFSWF 865

Query: 593  SGDGRASSRLDRFLLSEELILKWQVAAQWVGNRDISDHSPIWLDCSVYDWGPKPFRFNNC 414
            S DG + SRLDRFLLSE+ I   +V  QW+G+RDISDH PIWL CS  +WGPKPF+ NNC
Sbjct: 866  SADGTSMSRLDRFLLSEKFIDSEKVTGQWIGSRDISDHCPIWLLCSNLNWGPKPFKVNNC 925

Query: 413  WLGHKEFKSFVEECWKGFQVGGWKIYAFXXXXXXXXXXXXEWNKEVFGFMDLNIENIVQD 234
            WL H EFK FVE+ W    V G K +               WN++VFG +DLNIENIV++
Sbjct: 926  WLEHPEFKPFVEKTWNKLNVEGKKAFVIKEKLKRLKEELRRWNRDVFGILDLNIENIVRE 985

Query: 233  MNVLDDIVG-SGDSRVGENRKELTSRFWHQIHAKESLIRQKARCKWIAEGDANTCFFHAC 57
            +N  + ++   G + V  +   +  +FW Q+H KESLI+QK+R KW+ EGD+N+ FFHA 
Sbjct: 986  LNEAEGLLAIDGANSVTCDVSAINKKFWDQLHFKESLIKQKSRLKWVREGDSNSRFFHAS 1045

Query: 56   VRGRRRRNQILALRK 12
            ++ RRRRNQ+  LR+
Sbjct: 1046 IKSRRRRNQLSILRR 1060


>GAU33402.1 hypothetical protein TSUD_20950 [Trifolium subterraneum]
          Length = 1594

 Score =  288 bits (738), Expect = 7e-85
 Identities = 133/282 (47%), Positives = 188/282 (66%), Gaps = 1/282 (0%)
 Frame = -3

Query: 848  FMVNVYSPCDWGGKNQCWKELILCKRILGGELWCVAGDFNSVSSSQEKRGASNNYRRRDI 669
            +++NVYSPC   GK + W +L+  K       WC+ GDFN V +  E++G++++ R+ + 
Sbjct: 704  YIINVYSPCSLSGKRKLWSDLLEFKLNNEQGEWCLRGDFNVVLNVGERKGSTSSARQNER 763

Query: 668  SRFNQFISEMHLVDVPVLGKKFTWWSGDGRASSRLDRFLLSEELILKWQVAAQWVGNRDI 489
              F QF+  M L+DVPV GKKF+W+S DG A SRLDRFLLS+  I K +VA QW+GN DI
Sbjct: 764  LEFCQFVEAMELIDVPVAGKKFSWFSADGNAISRLDRFLLSDNFIEKEEVAGQWIGNHDI 823

Query: 488  SDHSPIWLDCSVYDWGPKPFRFNNCWLGHKEFKSFVEECWKGFQVGGWKIYAFXXXXXXX 309
            SDH PIWL CS  +WGPKPF+ NNCWL H EFK FVE+ W+   + G K +         
Sbjct: 824  SDHCPIWLMCSNLNWGPKPFKVNNCWLEHSEFKLFVEKTWEKLNIRGKKAFVIKEKLKRL 883

Query: 308  XXXXXEWNKEVFGFMDLNIENIVQDMNVLDDIVGS-GDSRVGENRKELTSRFWHQIHAKE 132
                  WN+EVF  +DLNIE  V+++N ++ +VG+ G + V  ++  +  +FW Q++ KE
Sbjct: 884  KEELRGWNREVFSILDLNIEKTVKELNEVEGLVGNDGVNSVMGDKSGVNRKFWEQLYFKE 943

Query: 131  SLIRQKARCKWIAEGDANTCFFHACVRGRRRRNQILALRKYD 6
            S+I+QK+R KW+ EGD+NT FF A ++ RRRRNQ++ LR+ D
Sbjct: 944  SMIKQKSRLKWVREGDSNTRFFQASLKNRRRRNQLVLLRRGD 985


>GAU10537.1 hypothetical protein TSUD_419470, partial [Trifolium subterraneum]
          Length = 557

 Score =  270 bits (690), Expect = 4e-83
 Identities = 126/268 (47%), Positives = 175/268 (65%), Gaps = 1/268 (0%)
 Frame = -3

Query: 812 GKNQCWKELILCKRILGGELWCVAGDFNSVSSSQEKRGASNNYRRRDISRFNQFISEMHL 633
           GK + W +LI  +       WC+ GDFNS++ + E+RG+SN     + + F Q I  M L
Sbjct: 3   GKRKLWHDLIEFRMNNAPGEWCLGGDFNSITKTSERRGSSNWSGNTERTEFVQIIETMEL 62

Query: 632 VDVPVLGKKFTWWSGDGRASSRLDRFLLSEELILKWQVAAQWVGNRDISDHSPIWLDCSV 453
           +D+PVLGKKFTW + D  A SRLDRFLLSE LI K  +  QWVG+RDISDH PIWL+C+ 
Sbjct: 63  IDIPVLGKKFTWSNSDNSAMSRLDRFLLSEGLIEKGGITNQWVGDRDISDHYPIWLECNN 122

Query: 452 YDWGPKPFRFNNCWLGHKEFKSFVEECWKGFQVGGWKIYAFXXXXXXXXXXXXEWNKEVF 273
            +W PKPF+FNNCWL H +F  FV+  W+   + G K +              +WN EVF
Sbjct: 123 RNWCPKPFKFNNCWLEHPDFIPFVKASWESMDIHGRKAFILKEKLKRLKESLKKWNHEVF 182

Query: 272 GFMDLNIENIVQDMNVLDDIVGSGDSR-VGENRKELTSRFWHQIHAKESLIRQKARCKWI 96
           G MDLNIE  V+++N +++++ +G+S  +  N K+ +  FW Q+  KES+++QK+R KWI
Sbjct: 183 GIMDLNIEKTVKELNEIEEMIANGNSHPMYPNSKKQSKMFWEQLRFKESILKQKSRTKWI 242

Query: 95  AEGDANTCFFHACVRGRRRRNQILALRK 12
            EGD+NT FFHA ++GR R N+I  +RK
Sbjct: 243 QEGDSNTSFFHATIKGRHRSNRIAKIRK 270


>GAU46303.1 hypothetical protein TSUD_283280 [Trifolium subterraneum]
          Length = 1636

 Score =  281 bits (718), Expect = 3e-82
 Identities = 134/280 (47%), Positives = 181/280 (64%), Gaps = 1/280 (0%)
 Frame = -3

Query: 848  FMVNVYSPCDWGGKNQCWKELILCKRILGGELWCVAGDFNSVSSSQEKRGASNNYRRRDI 669
            ++VN+YSPC   G                       GDFNS++   E+RG+      R+ 
Sbjct: 670  YIVNIYSPCTMAG-----------------------GDFNSITKIGERRGSHGGSVYRER 706

Query: 668  SRFNQFISEMHLVDVPVLGKKFTWWSGDGRASSRLDRFLLSEELILKWQVAAQWVGNRDI 489
              F+QFI  M LVD+PVLGKKFTW++ D  A SRLDRFLLSE  I K  ++ QWVGNRDI
Sbjct: 707  IEFSQFIDAMELVDIPVLGKKFTWFNSDCSAMSRLDRFLLSEGFIEKGGISNQWVGNRDI 766

Query: 488  SDHSPIWLDCSVYDWGPKPFRFNNCWLGHKEFKSFVEECWKGFQVGGWKIYAFXXXXXXX 309
            SDH PIWL+ S  +WGPKPF+FNNCWL H +F  FV+  W+   + G K +         
Sbjct: 767  SDHCPIWLESSNINWGPKPFKFNNCWLEHSDFLPFVKATWEKMNIHGKKAFIIKEKLKRL 826

Query: 308  XXXXXEWNKEVFGFMDLNIENIVQDMNVLDDIVGSGDSRVGE-NRKELTSRFWHQIHAKE 132
                  WN+EVFG MDLNIE  V+D+N +++++ +GD+++   N KEL+ +FW Q+H KE
Sbjct: 827  KEALKTWNQEVFGIMDLNIEKTVKDLNEIEELIANGDNQLDSVNSKELSKKFWEQLHFKE 886

Query: 131  SLIRQKARCKWIAEGDANTCFFHACVRGRRRRNQILALRK 12
            S+++QK+R KWI EGD+NT FFHA ++GRRRRN+I+ L+K
Sbjct: 887  SILQQKSRTKWIQEGDSNTRFFHASIKGRRRRNRIVKLKK 926


>GAU50864.1 hypothetical protein TSUD_411040 [Trifolium subterraneum]
          Length = 862

 Score =  275 bits (702), Expect = 4e-82
 Identities = 128/288 (44%), Positives = 186/288 (64%), Gaps = 1/288 (0%)
 Frame = -3

Query: 866  RSSDVFFMVNVYSPCDWGGKNQCWKELILCKRILGGELWCVAGDFNSVSSSQEKRGASNN 687
            R   V  +VN+YSPC   GK + W++L+  K++  G   C+ GDFN++  S E++GAS +
Sbjct: 409  REGAVTHLVNIYSPCSLSGKKKLWEDLLEIKQLFTGGECCLRGDFNAILHSSERKGASAD 468

Query: 686  YRRRDISRFNQFISEMHLVDVPVLGKKFTWWSGDGRASSRLDRFLLSEELILKWQVAAQW 507
             R+ +   FN+F+ EM ++DVPVLG K +W S DG++ SRLDRF+LS+  I K+ +  QW
Sbjct: 469  SRQGERMMFNRFVEEMEMIDVPVLGMKVSWVSADGKSMSRLDRFILSDGFITKFGIIGQW 528

Query: 506  VGNRDISDHSPIWLDCSVYDWGPKPFRFNNCWLGHKEFKSFVEECWKGFQVGGWKIYAFX 327
            +GNR+I DH PIWL  S  +WGPKPFR  N  L H +F  F+E CWK F + G K Y   
Sbjct: 529  IGNRNIFDHCPIWLYASAKNWGPKPFRAINGCLEHPDFLVFLESCWKSFDIQGTKAYVLK 588

Query: 326  XXXXXXXXXXXEWNKEVFGFMDLNIENIVQDMNVLDDIVGSGDSRVGENRKE-LTSRFWH 150
                       +WNKEVFG +DLNI+  V+++N ++ ++G  D  V   R+E L S FW 
Sbjct: 589  EKLRFLKEILKKWNKEVFGILDLNIDKTVKELNDIEKMLGDDDPDVELTRREGLNSEFWS 648

Query: 149  QIHAKESLIRQKARCKWIAEGDANTCFFHACVRGRRRRNQILALRKYD 6
            Q+H KE L++QK+R + + EGD+N+ FFH  ++ RRR+NQ++ L+  D
Sbjct: 649  QLHFKEILLQQKSRTRRVKEGDSNSKFFHESIKRRRRKNQLVVLKDGD 696


>GAU14107.1 hypothetical protein TSUD_169320 [Trifolium subterraneum]
          Length = 695

 Score =  256 bits (654), Expect = 2e-76
 Identities = 124/282 (43%), Positives = 172/282 (60%), Gaps = 1/282 (0%)
 Frame = -3

Query: 854 VFFMVNVYSPCDWGGKNQCWKELILCKRILGGELWCVAGDFNSVSSSQEKRGASNN-YRR 678
           + ++VNVYS C+  GK + W +LI  K     E WC+ GDFNS+S   E+RG+S+  +R+
Sbjct: 88  LLYIVNVYSSCNVSGKRKLWNDLIDFKLNNEPEEWCLGGDFNSISKVGERRGSSSGAWRQ 147

Query: 677 RDISRFNQFISEMHLVDVPVLGKKFTWWSGDGRASSRLDRFLLSEELILKWQVAAQWVGN 498
            +   F QFI  + +VD+P+  K FTW++ DG A SRL+ FL+SE  I K  ++ QWVG+
Sbjct: 148 GERIEFIQFIDALEVVDIPLKDKMFTWFNSDGSAMSRLNHFLVSEGFIEKGSLSYQWVGD 207

Query: 497 RDISDHSPIWLDCSVYDWGPKPFRFNNCWLGHKEFKSFVEECWKGFQVGGWKIYAFXXXX 318
           RDISDH PIWL CS  +WGPKPF FNNCWL H +F  FV+E W+   + G K +      
Sbjct: 208 RDISDHCPIWLMCSNINWGPKPFTFNNCWLEHPKFFEFVKETWENMDIRGKKAFIIKEKL 267

Query: 317 XXXXXXXXEWNKEVFGFMDLNIENIVQDMNVLDDIVGSGDSRVGENRKELTSRFWHQIHA 138
                    WN+EVFGFM+L I+  V ++N                       FW Q++ 
Sbjct: 268 KGLKEALKVWNREVFGFMELKIDKTVNELN----------------------EFWEQLNF 305

Query: 137 KESLIRQKARCKWIAEGDANTCFFHACVRGRRRRNQILALRK 12
           KESL+ QK+R KW  EGD+N+ +FHA ++ RRR+NQI+ L+K
Sbjct: 306 KESLLHQKSRTKWAKEGDSNSRYFHASIKSRRRKNQIVTLKK 347


>GAU42690.1 hypothetical protein TSUD_302000 [Trifolium subterraneum]
          Length = 1092

 Score =  261 bits (667), Expect = 6e-76
 Identities = 125/263 (47%), Positives = 171/263 (65%), Gaps = 1/263 (0%)
 Frame = -3

Query: 854  VFFMVNVYSPCDWGGKNQCWKELILCKRILGGELWCVAGDFNSVSSSQEKRGASNNYRRR 675
            V  +VNVYSPC+  GK Q W++L+  K+ +   LWCV GDFN++  S E++G+S + R+ 
Sbjct: 237  VLHIVNVYSPCNISGKKQLWEDLLELKQRVAEGLWCVGGDFNAILHSFERQGSSTDSRKS 296

Query: 674  DISRFNQFISEMHLVDVPVLGKKFTWWSGDGRASSRLDRFLLSEELILKWQVAAQWVGNR 495
            +   FN F+ EM L+D+PVLGKKF+W+S DG++ SR+DRFLLS+  + K+ +  QW+G+R
Sbjct: 297  ERVLFNSFVEEMELIDIPVLGKKFSWFSADGKSMSRIDRFLLSDGFVSKFGITGQWIGDR 356

Query: 494  DISDHSPIWLDCSVYDWGPKPFRFNNCWLGHKEFKSFVEECWKGFQVGGWKIYAFXXXXX 315
            DISDH PIWL  S   WGPKPFR  N WL H +F +FVE  WK F V G K Y       
Sbjct: 357  DISDHCPIWLLFSSNIWGPKPFRVINGWLDHPDFLTFVETTWKSFAVHGKKAYILKEKFK 416

Query: 314  XXXXXXXEWNKEVFGFMDLNIENIVQDMNVLDDIVGSGDSRVGENRKE-LTSRFWHQIHA 138
                   +WNKEV+GF+DLNIE  V D+N +++++G  D      R+E L   F  Q H 
Sbjct: 417  LLKDSLRKWNKEVYGFLDLNIEKTVNDINDIENLLGGDDMEAELIRREGLNKDFGRQHHF 476

Query: 137  KESLIRQKARCKWIAEGDANTCF 69
            KESL++QK+R +W+ E D  T F
Sbjct: 477  KESLLKQKSRMRWVKE-DVQTAF 498


>GAU20609.1 hypothetical protein TSUD_33450 [Trifolium subterraneum]
          Length = 798

 Score =  256 bits (653), Expect = 2e-75
 Identities = 112/220 (50%), Positives = 154/220 (70%), Gaps = 1/220 (0%)
 Frame = -3

Query: 668 SRFNQFISEMHLVDVPVLGKKFTWWSGDGRASSRLDRFLLSEELILKWQVAAQWVGNRDI 489
           + F QFI  M L+D+PVLGKKFTW + D    SRLDRFLLSE +I K  +  QWVG+RDI
Sbjct: 11  TEFVQFIDAMELIDIPVLGKKFTWSNSDNSVMSRLDRFLLSEGIIEKGGITNQWVGDRDI 70

Query: 488 SDHSPIWLDCSVYDWGPKPFRFNNCWLGHKEFKSFVEECWKGFQVGGWKIYAFXXXXXXX 309
           SDH PIWL+C+  +WGPKPF+FNNCWL HK+F   V+  W+   + G K +         
Sbjct: 71  SDHHPIWLECNNLNWGPKPFKFNNCWLEHKDFIPVVKATWESLNINGRKAHVLKEKMKRL 130

Query: 308 XXXXXEWNKEVFGFMDLNIENIVQDMNVLDDIVGSGDSR-VGENRKELTSRFWHQIHAKE 132
                 WNKEVFG +DLNI+  V+D+N +++++ +GD+  +  N KE+  +FW Q+H KE
Sbjct: 131 KEELKVWNKEVFGILDLNIDKTVKDLNEVEELIANGDNHPLHLNSKEIAKKFWEQLHFKE 190

Query: 131 SLIRQKARCKWIAEGDANTCFFHACVRGRRRRNQILALRK 12
           S+++QK+R KWI +GD+NTC+FHA ++GRRRRN IL ++K
Sbjct: 191 SILKQKSRSKWIKKGDSNTCYFHATIKGRRRRNHILKIKK 230


>XP_006577560.1 PREDICTED: uncharacterized protein LOC102665607 [Glycine max]
          Length = 326

 Score =  232 bits (591), Expect = 5e-71
 Identities = 111/210 (52%), Positives = 145/210 (69%), Gaps = 1/210 (0%)
 Frame = -3

Query: 641 MHLVDVPVLGKKFTWWSGDGRASSRLDRFLLSEELILKWQVAAQWVGNRDISDHSPIWLD 462
           M+L+D P+ G KFT++  DG A+SRLDRFL+S+ ++  WQ   Q VG RDI DH PIWL+
Sbjct: 1   MNLIDPPLHGNKFTYFCSDGIAASRLDRFLVSDGIMNLWQEKGQRVGKRDIYDHCPIWLE 60

Query: 461 CSVYDWGPKPFRFNNCWLGHKEFKSFVEECWKGFQVGGWKIYAFXXXXXXXXXXXXEWNK 282
           CS  +WGPKPFRFNNCWL H +FKSF+ E WK  Q+ G K Y              +WNK
Sbjct: 61  CSNLNWGPKPFRFNNCWLEHDDFKSFIVEEWKKIQITGRKAYVIKEKLKIIRESLKKWNK 120

Query: 281 EVFGFMDLNIENIVQDMNVLDDIVGSG-DSRVGENRKELTSRFWHQIHAKESLIRQKARC 105
           EVFG++DLNIENIV +MN LD  +  G +      +KE  + FW Q+  KESL++QK+R 
Sbjct: 121 EVFGWLDLNIENIVAEMNKLDRGIEEGCNLNEVVKKKEAKALFWQQLMMKESLLKQKSRL 180

Query: 104 KWIAEGDANTCFFHACVRGRRRRNQILALR 15
           +WI EGD NT FFH+C++ RRR+NQIL+L+
Sbjct: 181 RWIKEGDYNTKFFHSCLQDRRRKNQILSLQ 210


>KYP46096.1 LINE-1 reverse transcriptase isogeny [Cajanus cajan]
          Length = 729

 Score =  242 bits (618), Expect = 6e-71
 Identities = 116/279 (41%), Positives = 164/279 (58%), Gaps = 2/279 (0%)
 Frame = -3

Query: 848 FMVNVYSPCDWGGKNQCWKELILCKRILGGELWCVAGDFNSVSSSQEKRGASNNYRRRDI 669
           F+VN+YSPCD  GK   W+E+   K   G   WC+ GDFN+V    E++G       +++
Sbjct: 25  FIVNIYSPCDLRGKKNLWEEIHKIKNSYGSGRWCICGDFNTVRLKSERKGVHTRREEKEM 84

Query: 668 SRFNQFISEMHLVDVPVLGKKFTWWSGDGRASSRLDRFLLSEELILKWQVAAQWVGNRDI 489
             +NQFI ++ L+D+P+ G K+TW+  +   +SR+DRFL+S+E + +W   +Q    RD+
Sbjct: 85  LCYNQFIEDVELIDLPLGGGKYTWFRPNRIIASRIDRFLVSQEWLTQWPHCSQKALQRDV 144

Query: 488 SDHSPIWLDCSVYDWGPKPFRFNNCWLGHKEFKSFVEECWKGFQVGGWKIYAFXXXXXXX 309
           SDH PI L     DWGPKPFR  NCW     F  FVEE WKGF V GW  +         
Sbjct: 145 SDHRPILLKDIRLDWGPKPFRSLNCWFDDPSFLGFVEEKWKGFSVTGWGAFILKEKLKHL 204

Query: 308 XXXXXEWNKEVFGFMDLNIENIVQDMNVLDDIV--GSGDSRVGENRKELTSRFWHQIHAK 135
                EWNK+ FG +   IE + +++N LD IV   S + R   +R+ L  + W  ++ K
Sbjct: 205 KKSIKEWNKQAFGNIHTQIEEVKRNINSLDSIVETRSLNERKVSDRRNLNVKLWDLLNKK 264

Query: 134 ESLIRQKARCKWIAEGDANTCFFHACVRGRRRRNQILAL 18
           ESL+ QK+R KW  EGD+N+ FFH CV  RR+ N+I+ L
Sbjct: 265 ESLLLQKSRLKWAREGDSNSSFFHMCVNKRRKMNEIIGL 303


>GAU42970.1 hypothetical protein TSUD_188430 [Trifolium subterraneum]
          Length = 767

 Score =  241 bits (616), Expect = 2e-70
 Identities = 119/280 (42%), Positives = 163/280 (58%), Gaps = 1/280 (0%)
 Frame = -3

Query: 848 FMVNVYSPCDWGGKNQCWKELILCKRILGGELWCVAGDFNSVSSSQEKRGASNNYRRRDI 669
           + VN+Y  C   GK + W++LI  K +     WC+ GDFNS++   ++ G+SN    ++ 
Sbjct: 76  YFVNIYFACSLAGKRKLWRDLIDFKLLNTPGEWCLGGDFNSITKVSKRSGSSNGSSNKER 135

Query: 668 SRFNQFISEMHLVDVPVLGKKFTWWSGDGRASSRLDRFLLSEELILKWQVAAQWVGNRDI 489
           + F QFI  M LVD+PV GKKFTW + D  A SRLDRFLLSE LI K  ++ QWVG RDI
Sbjct: 136 TEFAQFIDAMELVDIPVFGKKFTWSNSDNSAMSRLDRFLLSEGLIEKGGISNQWVGGRDI 195

Query: 488 SDHSPIWLDCSVYDWGPKPFRFNNCWLGHKEFKSFVEECWKGFQVGGWKIYAFXXXXXXX 309
           SDH PIWL+CS  +WGPKPF+FNN WL H +F  FV+  W+   + G K +         
Sbjct: 196 SDHHPIWLECSNINWGPKPFKFNNFWLDHPDFIPFVKATWESMNIHGKKAFILKEKLKRL 255

Query: 308 XXXXXEWNKEVFGFMDLNIENIVQDMNVLDDIVGSGDSR-VGENRKELTSRFWHQIHAKE 132
                 WN+EVFG MDL+IE  V+D+N +++++ +GD   +  N K+L+ +FW Q+H KE
Sbjct: 256 KEVLKTWNREVFGIMDLDIEKTVKDLNEVEEMIANGDCHPLFSNAKDLSKKFWEQLHNKE 315

Query: 131 SLIRQKARCKWIAEGDANTCFFHACVRGRRRRNQILALRK 12
           S                           RRR N+I+ LRK
Sbjct: 316 S---------------------------RRRSNRIVKLRK 328


>GAU50433.1 hypothetical protein TSUD_134890, partial [Trifolium subterraneum]
          Length = 286

 Score =  226 bits (577), Expect = 2e-69
 Identities = 104/213 (48%), Positives = 141/213 (66%), Gaps = 1/213 (0%)
 Frame = -3

Query: 641 MHLVDVPVLGKKFTWWSGDGRASSRLDRFLLSEELILKWQVAAQWVGNRDISDHSPIWLD 462
           M  +D+PVLGKKF+W+S DG+A SRLDRFLLS+  + K  V  QW+G+RDISDH P+WL 
Sbjct: 1   MEFIDIPVLGKKFSWFSPDGKAMSRLDRFLLSDGFLTKNGVTGQWIGDRDISDHCPVWLL 60

Query: 461 CSVYDWGPKPFRFNNCWLGHKEFKSFVEECWKGFQVGGWKIYAFXXXXXXXXXXXXEWNK 282
            S  +WGPKPFR  N W+ H EF  FVE  WK + V G K +              +WNK
Sbjct: 61  SSFCNWGPKPFRVINGWINHPEFNDFVESAWKSYDVRGKKAFVLKEKLKLLRESLKKWNK 120

Query: 281 EVFGFMDLNIENIVQDMNVLDDIVGSGDSRVGENRKE-LTSRFWHQIHAKESLIRQKARC 105
           EVFG++DLNIE IV D+N  + ++ S D        + L   FW Q+H K+SL+++K+R 
Sbjct: 121 EVFGYLDLNIEKIVTDINKFEGLLSSTDGDADYLMLDGLNKEFWKQLHFKDSLLKRKSRS 180

Query: 104 KWIAEGDANTCFFHACVRGRRRRNQILALRKYD 6
           KW+ +GD+N+ +FH  ++GRRRRNQ++ALR  D
Sbjct: 181 KWVKDGDSNSKYFHQSLKGRRRRNQLVALRDGD 213


>GAU51623.1 hypothetical protein TSUD_414500 [Trifolium subterraneum]
          Length = 838

 Score =  238 bits (608), Expect = 9e-69
 Identities = 107/213 (50%), Positives = 145/213 (68%), Gaps = 1/213 (0%)
 Frame = -3

Query: 641 MHLVDVPVLGKKFTWWSGDGRASSRLDRFLLSEELILKWQVAAQWVGNRDISDHSPIWLD 462
           M L+D+PV+GKKF+W+S DG+A SRLDRFLLS+  I K ++  QW+GNRDISDH P+WL 
Sbjct: 1   MELIDIPVIGKKFSWFSADGKAMSRLDRFLLSDNFIAKEEILGQWIGNRDISDHCPVWLI 60

Query: 461 CSVYDWGPKPFRFNNCWLGHKEFKSFVEECWKGFQVGGWKIYAFXXXXXXXXXXXXEWNK 282
           CS  +WGPKPF+FNNCWL H E   FV   W    V G K +               WN+
Sbjct: 61  CSNLNWGPKPFKFNNCWLKHPELSLFVTRIWVKMNVTGKKAFVIKEKLKRLKEELRGWNR 120

Query: 281 EVFGFMDLNIENIVQDMNVLDDIVG-SGDSRVGENRKELTSRFWHQIHAKESLIRQKARC 105
           EVFG +DLNIEN V+++N L+ +    G + +  ++  +  +FW Q++ KESLIRQK+R 
Sbjct: 121 EVFGILDLNIENTVKELNELEGLAAIDGTNSMLVDKGGINKKFWDQLNFKESLIRQKSRA 180

Query: 104 KWIAEGDANTCFFHACVRGRRRRNQILALRKYD 6
            W++EGD+NT FFHA ++ RRRRNQ++ LR+ D
Sbjct: 181 NWVSEGDSNTRFFHASLKSRRRRNQMIMLRRGD 213


>GAU42390.1 hypothetical protein TSUD_296880 [Trifolium subterraneum]
          Length = 938

 Score =  236 bits (601), Expect = 3e-67
 Identities = 109/232 (46%), Positives = 155/232 (66%), Gaps = 1/232 (0%)
 Frame = -3

Query: 716  SQEKRGASNNYRRRDISRFNQFISEMHLVDVPVLGKKFTWWSGDGRASSRLDRFLLSEEL 537
            S E+RG+S +    + S F+ FI  M ++D+PVLGKKFTW++ +G    RL RFLLSE  
Sbjct: 344  SGERRGSSGSGCLSERSEFSLFIEAMEVIDIPVLGKKFTWFNSNGSTMRRLYRFLLSEGF 403

Query: 536  ILKWQVAAQWVGNRDISDHSPIWLDCSVYDWGPKPFRFNNCWLGHKEFKSFVEECWKGFQ 357
            I K  ++ QW+ + DISDH PIWL+CS+ +WG KP +FNNCW+ H EF   V+  +    
Sbjct: 404  IHKGGISNQWISDHDISDHCPIWLECSILNWGHKPVKFNNCWVDHPEFLDLVKNIFAQSN 463

Query: 356  VGGWKIYAFXXXXXXXXXXXXEWNKEVFGFMDLNIENIVQDMNVLDDIVGSGDSRVGE-N 180
            V G K +              +WN++VFGF DL I+  V+++N ++D++ +GD    + N
Sbjct: 464  VRGTKTFVISEKMKRLKEALKKWNRDVFGFKDLCIDKTVRELNEVEDLIANGDVDPADLN 523

Query: 179  RKELTSRFWHQIHAKESLIRQKARCKWIAEGDANTCFFHACVRGRRRRNQIL 24
             KEL  +FW QIH+KESL+RQK+R KWI EGD+N+ FFH+ ++GRRRRNQI+
Sbjct: 524  SKELVRKFWEQIHSKESLLRQKSRTKWIQEGDSNSRFFHSSIKGRRRRNQIV 575


>GAU49526.1 hypothetical protein TSUD_377390 [Trifolium subterraneum]
          Length = 1149

 Score =  237 bits (605), Expect = 3e-67
 Identities = 112/239 (46%), Positives = 159/239 (66%), Gaps = 1/239 (0%)
 Frame = -3

Query: 716  SQEKRGASNNYRRRDISRFNQFISEMHLVDVPVLGKKFTWWSGDGRASSRLDRFLLSEEL 537
            S E+RG   +    + S F+ FI  M ++D+P+LGKKFTW++ DG   SRLDRFLLSE  
Sbjct: 325  SGERRGCRGSVCLSERSEFSLFIEAMEVIDIPILGKKFTWFNSDGSTMSRLDRFLLSEGF 384

Query: 536  ILKWQVAAQWVGNRDISDHSPIWLDCSVYDWGPKPFRFNNCWLGHKEFKSFVEECWKGFQ 357
            I K  ++ QW+G+RDISD+ PIWL+CS  +WGPKPF+FNNCW+ H EF   V+  W    
Sbjct: 385  IHKGGISNQWIGDRDISDYFPIWLECSNLNWGPKPFKFNNCWVDHPEFLDLVKNIWVQSN 444

Query: 356  VGGWKIYAFXXXXXXXXXXXXEWNKEVFGFMDLNIENIVQDMNVLDDIVGSGD-SRVGEN 180
            +   K                +WN++VFGF DL I+  ++++N ++D++ +GD   V  N
Sbjct: 445  MKRLK------------EALKKWNRDVFGFKDLCIDKTLRELNEVEDLIANGDVDPVDLN 492

Query: 179  RKELTSRFWHQIHAKESLIRQKARCKWIAEGDANTCFFHACVRGRRRRNQILALRKYDA 3
             KEL  +FW QIH+KESL+R+K+R KWI EGD+N+ FF + ++GR RRNQI+ L+K +A
Sbjct: 493  SKELVRKFWEQIHSKESLLRKKSRTKWIQEGDSNSHFFRSSIKGRHRRNQIVMLKKGEA 551


>GAU47952.1 hypothetical protein TSUD_06860 [Trifolium subterraneum]
          Length = 724

 Score =  230 bits (586), Expect = 3e-66
 Identities = 104/213 (48%), Positives = 146/213 (68%), Gaps = 1/213 (0%)
 Frame = -3

Query: 641 MHLVDVPVLGKKFTWWSGDGRASSRLDRFLLSEELILKWQVAAQWVGNRDISDHSPIWLD 462
           M L+DVPVLGKKF+W+S +G++ SR+DRFLLS+  + K+ +  QW+G+RDISDH PIWL 
Sbjct: 1   MTLLDVPVLGKKFSWFSANGKSMSRIDRFLLSDGFVSKYGITGQWIGDRDISDHCPIWLL 60

Query: 461 CSVYDWGPKPFRFNNCWLGHKEFKSFVEECWKGFQVGGWKIYAFXXXXXXXXXXXXEWNK 282
            S Y WGPKPFR  N WL H +F  FVE  WK F V G K Y              +WNK
Sbjct: 61  VSSYKWGPKPFRVINGWLDHPDFLPFVESAWKSFVVHGKKAYVLKEKFRLLKERLRKWNK 120

Query: 281 EVFGFMDLNIENIVQDMNVLDDIVGSGDSRVGENRKE-LTSRFWHQIHAKESLIRQKARC 105
           EV+G++DLNIE  V ++N +++++G  D  V   R++ L   FW Q++ KESL++QK+R 
Sbjct: 121 EVYGYLDLNIEKTVNEINDIENMLGDDDMEVELTRRQGLNKEFWSQLYHKESLLKQKSRT 180

Query: 104 KWIAEGDANTCFFHACVRGRRRRNQILALRKYD 6
           +W+ EGD+N+ +FH  ++ RRRRNQ++AL+  D
Sbjct: 181 RWVKEGDSNSRYFHESIKSRRRRNQLVALKDGD 213


>KYP36320.1 Transposon TX1 uncharacterized [Cajanus cajan]
          Length = 1401

 Score =  233 bits (593), Expect = 2e-65
 Identities = 116/282 (41%), Positives = 162/282 (57%), Gaps = 5/282 (1%)
 Frame = -3

Query: 845  MVNVYSPCDWGGKNQCWKELILCKRILGGELWCVAGDFNSVSSSQEKRGASNNYRRRDIS 666
            +VNVYS C    K + W ++I+ KR  G  LWC+ GDFN+V   +E++G   ++  RD+ 
Sbjct: 694  VVNVYSSCHLVDKRRLWGDIIMSKRGFGSCLWCIVGDFNTVRRLEERKGGFGDHGARDME 753

Query: 665  RFNQFISEMHLVDVPVLGKKFTWWSGDGRASSRLDRFLLSEELILKWQVAAQWVGNRDIS 486
             FN FI+EM L+DVP++GK+FTW+  DG   SRLDR L+SE     W      V  RD+S
Sbjct: 754  EFNSFITEMELIDVPLVGKRFTWFRSDGSIMSRLDRVLVSESWSAHWGAGFVKVIPRDVS 813

Query: 485  DHSPIWLDCSVYDWGPKPFRFNNCWLGHKEFKSFVEECWKGFQVGGWKIYAFXXXXXXXX 306
            DH P+ L+  V +WGPKPFRFNNCWL H   +  V   W+    G W             
Sbjct: 814  DHCPLILNHKVLNWGPKPFRFNNCWLSHCGIEGVVRSAWEKQVQGPWAAQRLRSKLLNVK 873

Query: 305  XXXXEWNKEVFGFMDLNIENIVQDMNVLDDIVGSGDSRV-----GENRKELTSRFWHQIH 141
                +WN EVFG +D  I+++  ++  LD      + +V        +KEL +  W    
Sbjct: 874  NALKKWNIEVFGNVDTMIKSLTNELKELD---AKNEEQVLIQSERNRQKELVAGIWSARR 930

Query: 140  AKESLIRQKARCKWIAEGDANTCFFHACVRGRRRRNQILALR 15
             K +L+ QKAR +W   GD N+ +FHAC+RGR+RRNQI+AL+
Sbjct: 931  NKLTLLAQKARIRWGKYGDQNSKYFHACIRGRQRRNQIVALK 972


Top