BLASTX nr result

ID: Glycyrrhiza35_contig00013575 seq

BLASTX 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Glycyrrhiza35_contig00013575
         (1387 letters)

Database: ./nr 
           115,041,592 sequences; 42,171,959,267 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

GAU15285.1 hypothetical protein TSUD_03520 [Trifolium subterraneum]   300   7e-95
GAU20748.1 hypothetical protein TSUD_231620 [Trifolium subterran...   270   7e-84
GAU44321.1 hypothetical protein TSUD_305020 [Trifolium subterran...   271   1e-82
GAU19342.1 hypothetical protein TSUD_336290 [Trifolium subterran...   286   6e-82
GAU29238.1 hypothetical protein TSUD_362280 [Trifolium subterran...   281   3e-80
KHN36156.1 Retrovirus-related Pol polyprotein from transposon TN...   281   3e-80
KHN22040.1 Retrovirus-related Pol polyprotein from transposon TN...   281   3e-80
GAU26016.1 hypothetical protein TSUD_64040 [Trifolium subterraneum]   274   2e-79
GAU30708.1 hypothetical protein TSUD_39320 [Trifolium subterraneum]   277   6e-79
GAU11134.1 hypothetical protein TSUD_197580 [Trifolium subterran...   268   7e-76
GAU37351.1 hypothetical protein TSUD_395330 [Trifolium subterran...   264   1e-74
KYP43730.1 hypothetical protein KK1_034810, partial [Cajanus cajan]   231   2e-68
KYP45646.1 hypothetical protein KK1_032760, partial [Cajanus cajan]   222   6e-67
KYP37941.1 hypothetical protein KK1_040830 [Cajanus cajan]            221   2e-65
KYP34307.1 Retrovirus-related Pol polyprotein from transposon TN...   236   4e-65
KYP46257.1 Retrovirus-related Pol polyprotein from transposon TN...   237   6e-65
KYP33001.1 hypothetical protein KK1_046197, partial [Cajanus cajan]   225   7e-65
GAU27211.1 hypothetical protein TSUD_108020 [Trifolium subterran...   227   5e-62
KHN49021.1 hypothetical protein glysoja_031232, partial [Glycine...   209   2e-60
KYP36809.1 hypothetical protein KK1_042047 [Cajanus cajan]            207   2e-60

>GAU15285.1 hypothetical protein TSUD_03520 [Trifolium subterraneum]
          Length = 392

 Score =  300 bits (769), Expect = 7e-95
 Identities = 144/210 (68%), Positives = 181/210 (86%), Gaps = 1/210 (0%)
 Frame = -1

Query: 1180 TLSLKLDEKNFLLWNQQVEGVITAHKLHLFVVNPMIPVKYASESDRELDLVSEEFQRWLV 1001
            T  LKLD+ N+LLW+QQVEGVI A+KLH FVVNP IP KYASESDRELD VSE + +WLV
Sbjct: 26   TPKLKLDDGNYLLWSQQVEGVILANKLHRFVVNPQIPAKYASESDRELDRVSEAYDKWLV 85

Query: 1000 QDQLLFTWLLSSLSESMLPRVIGCKHSYEVWDKIHKHFHAHMKARVRQFRAELKSTKKGT 821
            QDQ+LFTWLLS+L+ES+LPR IGC+H+++VWD+IHKHF AH+KA+VRQ R+ELK+ KKGT
Sbjct: 86   QDQMLFTWLLSTLAESVLPRTIGCRHAFQVWDQIHKHFEAHLKAKVRQLRSELKNVKKGT 145

Query: 820  KSISEYLLRIKSIVDSLLAIGDPITEQDHIDSILEGLPEEYNSFVMLIYGHIDPLLVTDI 641
            KSI+E++LR++ I D+L++IGD I+EQD IDSILEGLPEEYN FVM+IYG  D   + DI
Sbjct: 146  KSITEFVLRVRVIADTLISIGDSISEQDQIDSILEGLPEEYNPFVMMIYGRSDSPSLYDI 205

Query: 640  EALLMVQEAQLEKFKQELTT-SATVNIARA 554
            E LL+VQE+QLEKF+QEL+T SA+ N+A +
Sbjct: 206  EGLLLVQESQLEKFRQELSTPSASANLAHS 235


>GAU20748.1 hypothetical protein TSUD_231620 [Trifolium subterraneum]
          Length = 327

 Score =  270 bits (690), Expect = 7e-84
 Identities = 128/239 (53%), Positives = 186/239 (77%), Gaps = 7/239 (2%)
 Frame = -1

Query: 1186 THTLSLKLDEKNFLLWNQQVEGVITAHKLHLFVVNPMIPVKYASESDRELDLVSEEFQRW 1007
            TH+L++KLDEKNFL W+QQV GVITAH LH F+VNP IP+++A+ +DR     S+E+++W
Sbjct: 34   THSLTIKLDEKNFLSWSQQVNGVITAHNLHRFIVNPEIPLQFATVADRIDGKTSDEYRKW 93

Query: 1006 LVQDQLLFTWLLSSLSESMLPRVIGCKHSYEVWDKIHKHFHAHMKARVRQFRAELKSTKK 827
            + +DQ LFTWLLS++S+ +LPRV+ CKH++EVWDKIHK+F++ +K+R+RQ ++ELK+TKK
Sbjct: 94   IFKDQTLFTWLLSTISDVVLPRVVHCKHAHEVWDKIHKYFNSVLKSRIRQLKSELKNTKK 153

Query: 826  GTKSISEYLLRIKSIVDSLLAIGDPITEQDHIDSILEGLPEEYNSFVMLIYGHIDPLLVT 647
              + +SEYLLRIKSIV+SL+A+G+ ITEQ+ +++IL+GLPEE+NSFVM++Y   D   + 
Sbjct: 154  LARPVSEYLLRIKSIVNSLIAMGEMITEQEQVEAILDGLPEEFNSFVMMVYSRFDTPTIE 213

Query: 646  DIEALLMVQEAQLEKFKQELTT-SATVNIARAPPSNR------PDSAASSQQFPNFGQY 491
             +E LLM+QEAQ EKF+QELT  S + N+A+    N        D+ + ++Q+ NF  Y
Sbjct: 214  YVEGLLMIQEAQFEKFRQELTNPSVSANVAQMESKNNQANQDGEDNESDTEQY-NFNAY 271


>GAU44321.1 hypothetical protein TSUD_305020 [Trifolium subterraneum]
          Length = 468

 Score =  271 bits (694), Expect = 1e-82
 Identities = 125/201 (62%), Positives = 168/201 (83%)
 Frame = -1

Query: 1186 THTLSLKLDEKNFLLWNQQVEGVITAHKLHLFVVNPMIPVKYASESDRELDLVSEEFQRW 1007
            TH+L++KLDEKNFLLW+QQ+ GVIT H LH FVVNP IP+++AS +DR    +SEE+Q+W
Sbjct: 258  THSLTIKLDEKNFLLWSQQINGVITTHNLHRFVVNPEIPLQFASVNDRLNGKISEEYQKW 317

Query: 1006 LVQDQLLFTWLLSSLSESMLPRVIGCKHSYEVWDKIHKHFHAHMKARVRQFRAELKSTKK 827
            L  DQ LFTWLLS++S+S+LPRV+ CKH++EVWDKIHKHF++ +K+R+RQ R ELK+TKK
Sbjct: 318  LFIDQTLFTWLLSTISDSILPRVLHCKHAHEVWDKIHKHFNSVLKSRIRQLRFELKNTKK 377

Query: 826  GTKSISEYLLRIKSIVDSLLAIGDPITEQDHIDSILEGLPEEYNSFVMLIYGHIDPLLVT 647
              + ISEYLLRIKSI++SL+A+G+ ++EQ+ ++ IL+GLPEE+N FVM++Y   D   V 
Sbjct: 378  LARPISEYLLRIKSIINSLIALGEAVSEQEQVNVILDGLPEEFNPFVMMVYSRYDTPTVE 437

Query: 646  DIEALLMVQEAQLEKFKQELT 584
            D+E LLM+QEAQ EKF+QELT
Sbjct: 438  DVEGLLMLQEAQFEKFRQELT 458


>GAU19342.1 hypothetical protein TSUD_336290 [Trifolium subterraneum]
          Length = 1442

 Score =  286 bits (731), Expect = 6e-82
 Identities = 128/210 (60%), Positives = 181/210 (86%), Gaps = 1/210 (0%)
 Frame = -1

Query: 1186 THTLSLKLDEKNFLLWNQQVEGVITAHKLHLFVVNPMIPVKYASESDRELDLVSEEFQRW 1007
            TH+L++KLDEKN+LLWNQQV GVITAH LH F+VNP IP+++AS++DR  D  S+E+++W
Sbjct: 34   THSLTIKLDEKNYLLWNQQVNGVITAHDLHRFIVNPQIPIQFASDADRVADRTSDEYRQW 93

Query: 1006 LVQDQLLFTWLLSSLSESMLPRVIGCKHSYEVWDKIHKHFHAHMKARVRQFRAELKSTKK 827
            + +DQ LFTWLLS+LS+S+LPRV+GCKH+++VWD+IHK+FH+ ++AR RQ R+ELK+TKK
Sbjct: 94   IFKDQTLFTWLLSTLSDSVLPRVLGCKHAFQVWDQIHKYFHSVLQARARQLRSELKNTKK 153

Query: 826  GTKSISEYLLRIKSIVDSLLAIGDPITEQDHIDSILEGLPEEYNSFVMLIYGHIDPLLVT 647
             ++S+ EYLLRIKSIV+SLLA+GD +++++ +D+ILEGLPEE+NSFVM++Y   D   V 
Sbjct: 154  ASRSVGEYLLRIKSIVNSLLAVGDLVSDREQVDAILEGLPEEFNSFVMMVYSRFDTPTVE 213

Query: 646  DIEALLMVQEAQLEKFKQELTT-SATVNIA 560
            D+EALL++QEAQ EKF+QEL + S + ++A
Sbjct: 214  DVEALLLLQEAQFEKFRQELASPSVSAHVA 243



 Score = 65.1 bits (157), Expect = 2e-07
 Identities = 30/43 (69%), Positives = 33/43 (76%)
 Frame = -2

Query: 297 SWYPD*GASHHLTADPQNLRQSSQFIGTDQVMMGNGQGL*ANS 169
           SWYPD GASHHLT DP NL  S+ + G DQVMMGNGQG+  NS
Sbjct: 361 SWYPDSGASHHLTYDPYNLVHSNPYTGHDQVMMGNGQGVSINS 403


>GAU29238.1 hypothetical protein TSUD_362280 [Trifolium subterraneum]
          Length = 1433

 Score =  281 bits (719), Expect = 3e-80
 Identities = 134/221 (60%), Positives = 181/221 (81%), Gaps = 1/221 (0%)
 Frame = -1

Query: 1186 THTLSLKLDEKNFLLWNQQVEGVITAHKLHLFVVNPMIPVKYASESDRELDLVSEEFQRW 1007
            TH+L++KLDEKNFLLW+QQV GVITAH LH FVVNP IP+++AS +DR     S+E+++W
Sbjct: 34   THSLTIKLDEKNFLLWSQQVNGVITAHNLHRFVVNPEIPLQFASVNDRIEGKTSDEYRKW 93

Query: 1006 LVQDQLLFTWLLSSLSESMLPRVIGCKHSYEVWDKIHKHFHAHMKARVRQFRAELKSTKK 827
            L +DQ LFTWLLS++S+++LPRV+ CKHS+EVWDKIHK+F++ +K+R+RQ R+ELK+TKK
Sbjct: 94   LFKDQTLFTWLLSTISDAVLPRVVHCKHSHEVWDKIHKYFNSVLKSRIRQLRSELKNTKK 153

Query: 826  GTKSISEYLLRIKSIVDSLLAIGDPITEQDHIDSILEGLPEEYNSFVMLIYGHIDPLLVT 647
              + +SEYLLRIKSIV+SL+A+G+ +TEQ+ ID+ILEGLPE++NSFVM++Y   D   V 
Sbjct: 154  LARPVSEYLLRIKSIVNSLIAMGETVTEQEQIDAILEGLPEDFNSFVMMMYSRFDTPTVE 213

Query: 646  DIEALLMVQEAQLEKFKQELTT-SATVNIARAPPSNRPDSA 527
            DIE LLM+QEAQ EKF+QELT  S + NIA+    N   ++
Sbjct: 214  DIEGLLMLQEAQFEKFRQELTNPSVSANIAQIGSKNHQSNS 254



 Score = 63.5 bits (153), Expect = 6e-07
 Identities = 30/62 (48%), Positives = 39/62 (62%)
 Frame = -2

Query: 366 PNAYLATQQPHKEPVSQEFQISQSWYPD*GASHHLTADPQNLRQSSQFIGTDQVMMGNGQ 187
           P+A+LA  Q +      +   + SWYPD GASHHLT +P NL   + + G DQV MGNGQ
Sbjct: 344 PSAHLALPQYYNPIADMDSVSNASWYPDSGASHHLTFNPNNLTYRTPYQGQDQVTMGNGQ 403

Query: 186 GL 181
           G+
Sbjct: 404 GV 405


>KHN36156.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94, partial
            [Glycine soja]
          Length = 1417

 Score =  281 bits (718), Expect = 3e-80
 Identities = 134/217 (61%), Positives = 181/217 (83%), Gaps = 1/217 (0%)
 Frame = -1

Query: 1177 LSLKLDEKNFLLWNQQVEGVITAHKLHLFVVNPMIPVKYASESDRELDLVSEEFQRWLVQ 998
            L++KLDEKNFLLW+QQV GVITAH LH FVVNP IP+++AS  D  L + S+E+Q+WL++
Sbjct: 1    LTIKLDEKNFLLWSQQVNGVITAHNLHRFVVNPQIPLQFASIEDCALGINSDEYQQWLIK 60

Query: 997  DQLLFTWLLSSLSESMLPRVIGCKHSYEVWDKIHKHFHAHMKARVRQFRAELKSTKKGTK 818
            DQ LFTWLLS+LS+ +LPRV+ C+H++EVWDKIHK+F++ +K+R RQ R+ELK+TKK ++
Sbjct: 61   DQTLFTWLLSTLSDGVLPRVLSCRHAHEVWDKIHKYFNSVLKSRARQLRSELKNTKKLSR 120

Query: 817  SISEYLLRIKSIVDSLLAIGDPITEQDHIDSILEGLPEEYNSFVMLIYGHIDPLLVTDIE 638
            S++EYLLRIKSIV+SL+A+GD ++EQ+ +DSILEGLPEE+NSFVM++Y   D   V D+E
Sbjct: 121  SVNEYLLRIKSIVNSLVAVGDMVSEQEQVDSILEGLPEEFNSFVMMVYSRFDTPTVEDVE 180

Query: 637  ALLMVQEAQLEKFKQELTT-SATVNIARAPPSNRPDS 530
            ALL++QEAQ EKFKQELT+ S + N+A    +N  DS
Sbjct: 181  ALLLLQEAQFEKFKQELTSPSVSANVAHT-ETNASDS 216



 Score = 71.6 bits (174), Expect = 2e-09
 Identities = 34/62 (54%), Positives = 43/62 (69%)
 Frame = -2

Query: 366 PNAYLATQQPHKEPVSQEFQISQSWYPD*GASHHLTADPQNLRQSSQFIGTDQVMMGNGQ 187
           P A+LA  QP+  P   +F  + +WYPD GASHHLT +P NL  SS + G DQV+MGNGQ
Sbjct: 308 PTAHLAMPQPYAMPNMDQFS-NGAWYPDSGASHHLTYNPNNLSYSSPYTGQDQVVMGNGQ 366

Query: 186 GL 181
           G+
Sbjct: 367 GV 368


>KHN22040.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94, partial
            [Glycine soja]
          Length = 1417

 Score =  281 bits (718), Expect = 3e-80
 Identities = 134/217 (61%), Positives = 181/217 (83%), Gaps = 1/217 (0%)
 Frame = -1

Query: 1177 LSLKLDEKNFLLWNQQVEGVITAHKLHLFVVNPMIPVKYASESDRELDLVSEEFQRWLVQ 998
            L++KLDEKNFLLW+QQV GVITAH LH FVVNP IP+++AS  D  L + S+E+Q+WL++
Sbjct: 1    LTIKLDEKNFLLWSQQVNGVITAHNLHRFVVNPQIPLQFASIEDCALGINSDEYQQWLIK 60

Query: 997  DQLLFTWLLSSLSESMLPRVIGCKHSYEVWDKIHKHFHAHMKARVRQFRAELKSTKKGTK 818
            DQ LFTWLLS+LS+ +LPRV+ C+H++EVWDKIHK+F++ +K+R RQ R+ELK+TKK ++
Sbjct: 61   DQTLFTWLLSTLSDGVLPRVLSCRHAHEVWDKIHKYFNSVLKSRARQLRSELKNTKKLSR 120

Query: 817  SISEYLLRIKSIVDSLLAIGDPITEQDHIDSILEGLPEEYNSFVMLIYGHIDPLLVTDIE 638
            S++EYLLRIKSIV+SL+A+GD ++EQ+ +DSILEGLPEE+NSFVM++Y   D   V D+E
Sbjct: 121  SVNEYLLRIKSIVNSLVAVGDMVSEQEQVDSILEGLPEEFNSFVMMVYSRFDTPTVEDVE 180

Query: 637  ALLMVQEAQLEKFKQELTT-SATVNIARAPPSNRPDS 530
            ALL++QEAQ EKFKQELT+ S + N+A    +N  DS
Sbjct: 181  ALLLLQEAQFEKFKQELTSPSVSANVAHT-ETNASDS 216



 Score = 71.6 bits (174), Expect = 2e-09
 Identities = 34/62 (54%), Positives = 43/62 (69%)
 Frame = -2

Query: 366 PNAYLATQQPHKEPVSQEFQISQSWYPD*GASHHLTADPQNLRQSSQFIGTDQVMMGNGQ 187
           P A+LA  QP+  P   +F  + +WYPD GASHHLT +P NL  SS + G DQV+MGNGQ
Sbjct: 308 PTAHLAMPQPYAMPNMDQFS-NGAWYPDSGASHHLTYNPNNLSYSSPYTGQDQVVMGNGQ 366

Query: 186 GL 181
           G+
Sbjct: 367 GV 368


>GAU26016.1 hypothetical protein TSUD_64040 [Trifolium subterraneum]
          Length = 942

 Score =  274 bits (701), Expect = 2e-79
 Identities = 128/216 (59%), Positives = 178/216 (82%), Gaps = 1/216 (0%)
 Frame = -1

Query: 1186 THTLSLKLDEKNFLLWNQQVEGVITAHKLHLFVVNPMIPVKYASESDRELDLVSEEFQRW 1007
            TH++++KLDEKNFLLW+QQV GVITAH LH FVVNP IP+++A+ +DR     SEE+++W
Sbjct: 34   THSITIKLDEKNFLLWSQQVNGVITAHNLHRFVVNPNIPLQFATVNDRIEGNTSEEYRKW 93

Query: 1006 LVQDQLLFTWLLSSLSESMLPRVIGCKHSYEVWDKIHKHFHAHMKARVRQFRAELKSTKK 827
            L +DQ LFTWLLS++S+S+LPRV+ CKHS+EVWDKIHK+F++ +K+R+RQ R+ELK+TKK
Sbjct: 94   LFKDQTLFTWLLSTISDSVLPRVLHCKHSHEVWDKIHKYFNSVLKSRIRQLRSELKNTKK 153

Query: 826  GTKSISEYLLRIKSIVDSLLAIGDPITEQDHIDSILEGLPEEYNSFVMLIYGHIDPLLVT 647
              +S+SEYLLRIKSIV+SL+A+ + ++EQ+ +D+IL+GLPE++NSFVM++Y   D   V 
Sbjct: 154  LARSVSEYLLRIKSIVNSLIAMSEVVSEQEQVDAILDGLPEDFNSFVMMVYSRFDTPTVE 213

Query: 646  DIEALLMVQEAQLEKFKQELTT-SATVNIARAPPSN 542
            DIE LLM+QEAQ EKF+QEL   + + N+A+    N
Sbjct: 214  DIEGLLMLQEAQFEKFRQELANPNVSANVAQMESKN 249



 Score = 63.5 bits (153), Expect = 5e-07
 Identities = 30/62 (48%), Positives = 39/62 (62%)
 Frame = -2

Query: 366 PNAYLATQQPHKEPVSQEFQISQSWYPD*GASHHLTADPQNLRQSSQFIGTDQVMMGNGQ 187
           P+A+LA  Q +      +   + SWYPD GASHHLT +P NL   + + G DQV MGNGQ
Sbjct: 344 PSAHLALPQHYNPIADMDTFSNASWYPDSGASHHLTFNPNNLTYRTPYQGQDQVTMGNGQ 403

Query: 186 GL 181
           G+
Sbjct: 404 GV 405


>GAU30708.1 hypothetical protein TSUD_39320 [Trifolium subterraneum]
          Length = 1432

 Score =  277 bits (709), Expect = 6e-79
 Identities = 130/217 (59%), Positives = 180/217 (82%), Gaps = 1/217 (0%)
 Frame = -1

Query: 1186 THTLSLKLDEKNFLLWNQQVEGVITAHKLHLFVVNPMIPVKYASESDRELDLVSEEFQRW 1007
            TH+L++KLDEKNFLLW+QQV GVIT H LH FVVNP IP+++AS +DR    +S+E+++W
Sbjct: 34   THSLTIKLDEKNFLLWSQQVNGVITTHNLHRFVVNPEIPLQFASVNDRLDGKISDEYRKW 93

Query: 1006 LVQDQLLFTWLLSSLSESMLPRVIGCKHSYEVWDKIHKHFHAHMKARVRQFRAELKSTKK 827
            L +DQ LFTWLLS++S+S+LPRV+ CKHS+EVWDKIHK+F++ +K+R+RQ R+ELK+TKK
Sbjct: 94   LFKDQTLFTWLLSTISDSVLPRVLHCKHSHEVWDKIHKYFNSVLKSRIRQLRSELKNTKK 153

Query: 826  GTKSISEYLLRIKSIVDSLLAIGDPITEQDHIDSILEGLPEEYNSFVMLIYGHIDPLLVT 647
              +S+SEYLLRIKSI++SL+A+G+ I+EQ+ ID+IL+GL EE+NSFVM++Y   D   V 
Sbjct: 154  LARSVSEYLLRIKSIINSLIAMGESISEQEQIDAILDGLSEEFNSFVMMVYSRFDNPTVE 213

Query: 646  DIEALLMVQEAQLEKFKQELTT-SATVNIARAPPSNR 539
            D+E LLM+QEAQ +KF+QELT  S + N+A+    N+
Sbjct: 214  DVEGLLMLQEAQFDKFRQELTNPSVSANVAQMDSKNQ 250



 Score = 62.8 bits (151), Expect = 1e-06
 Identities = 29/62 (46%), Positives = 39/62 (62%)
 Frame = -2

Query: 366 PNAYLATQQPHKEPVSQEFQISQSWYPD*GASHHLTADPQNLRQSSQFIGTDQVMMGNGQ 187
           P+A+LA  Q +      +   + SWYPD GASHHLT +P N+   + + G DQV MGNGQ
Sbjct: 344 PSAHLALPQYYNPTAEFDTYSNASWYPDSGASHHLTFNPNNMAYRTPYQGQDQVTMGNGQ 403

Query: 186 GL 181
           G+
Sbjct: 404 GV 405


>GAU11134.1 hypothetical protein TSUD_197580 [Trifolium subterraneum]
          Length = 1234

 Score =  268 bits (684), Expect = 7e-76
 Identities = 133/261 (50%), Positives = 184/261 (70%)
 Frame = -1

Query: 1297 MASVINSPNHSRHSSHGNPFTDHIXXXXXXXXXPISFTHTLSLKLDEKNFLLWNQQVEGV 1118
            MA    SPNH+  +   N  TD               TH+L++KLDEKNFLLWNQQV GV
Sbjct: 1    MAVSNESPNHTTETL--NHTTDVNLPPTAKNSRKSGLTHSLTIKLDEKNFLLWNQQVNGV 58

Query: 1117 ITAHKLHLFVVNPMIPVKYASESDRELDLVSEEFQRWLVQDQLLFTWLLSSLSESMLPRV 938
            ITAH LH FVVNP IP++Y S  DR     S+E+Q+WL +DQ LFTWLLS++S+ +LPRV
Sbjct: 59   ITAHNLHRFVVNPQIPLQYESVEDRLDGKNSDEYQQWLFKDQSLFTWLLSTISDDVLPRV 118

Query: 937  IGCKHSYEVWDKIHKHFHAHMKARVRQFRAELKSTKKGTKSISEYLLRIKSIVDSLLAIG 758
            + CKHSYEVW++IHKHF++ +K+R RQ R+ELK+TKK  +S++EYL+RIKSIV+SL+A+G
Sbjct: 119  LSCKHSYEVWEQIHKHFNSVLKSRSRQLRSELKNTKKMARSVNEYLIRIKSIVNSLIAVG 178

Query: 757  DPITEQDHIDSILEGLPEEYNSFVMLIYGHIDPLLVTDIEALLMVQEAQLEKFKQELTTS 578
            D +++++ ++++LEGLP+E++SFVM+IY       V D+EALL+++E Q EKF+QEL   
Sbjct: 179  DVVSDKEQVEAVLEGLPKEFSSFVMMIYSQFATPKVKDVEALLLLREVQFEKFRQELANP 238

Query: 577  ATVNIARAPPSNRPDSAASSQ 515
                      SN  D A  ++
Sbjct: 239  RVSANTTQVQSNFNDEAMDTE 259



 Score = 65.1 bits (157), Expect = 2e-07
 Identities = 35/76 (46%), Positives = 43/76 (56%)
 Frame = -2

Query: 411 QPPQSNETAQATGTPPNAYLATQQPHKEPVSQEFQISQSWYPD*GASHHLTADPQNLRQS 232
           +PP  N  A+     P+A+LA  Q        +   S SWYPD GASHHLT +P N    
Sbjct: 336 RPPPYNPYAR-----PSAHLAVPQYFPSIPDTDSVSSASWYPDFGASHHLTFNPNNFAHR 390

Query: 231 SQFIGTDQVMMGNGQG 184
           + + G DQVMMGNGQG
Sbjct: 391 APYQGPDQVMMGNGQG 406


>GAU37351.1 hypothetical protein TSUD_395330 [Trifolium subterraneum]
          Length = 1216

 Score =  264 bits (674), Expect = 1e-74
 Identities = 125/201 (62%), Positives = 165/201 (82%)
 Frame = -1

Query: 1186 THTLSLKLDEKNFLLWNQQVEGVITAHKLHLFVVNPMIPVKYASESDRELDLVSEEFQRW 1007
            TH+L++KLDE NFLLW+QQV GVITAH LH FVVNP IP+++ S  DR     S E+Q+W
Sbjct: 31   THSLTIKLDENNFLLWSQQVNGVITAHNLHRFVVNPQIPLQFDSIEDRANLKNSVEYQKW 90

Query: 1006 LVQDQLLFTWLLSSLSESMLPRVIGCKHSYEVWDKIHKHFHAHMKARVRQFRAELKSTKK 827
            LV+DQ LFTWLLS++S+ +LPRV+ C+H++EVWD IHK+F++ +K+R RQ R ELK+TKK
Sbjct: 91   LVKDQTLFTWLLSTISDGILPRVLSCRHAHEVWDSIHKYFNSMLKSRARQLRFELKNTKK 150

Query: 826  GTKSISEYLLRIKSIVDSLLAIGDPITEQDHIDSILEGLPEEYNSFVMLIYGHIDPLLVT 647
             + S++EYLLRIKSIV+S +A+GD +T+Q+ ID+ILEGLPEE+NSFVM++Y   D   V 
Sbjct: 151  MSCSVNEYLLRIKSIVNSPVAVGDIVTKQEQIDAILEGLPEEFNSFVMMVYSRFDTPTVE 210

Query: 646  DIEALLMVQEAQLEKFKQELT 584
            DIEALL++QE Q EKFKQEL+
Sbjct: 211  DIEALLLLQEVQFEKFKQELS 231



 Score = 51.2 bits (121), Expect(2) = 7e-07
 Identities = 24/43 (55%), Positives = 28/43 (65%)
 Frame = -2

Query: 315 EFQISQSWYPD*GASHHLTADPQNLRQSSQFIGTDQVMMGNGQ 187
           E   S +WYPD GASHHLT +P NL     + G DQV+MG GQ
Sbjct: 316 ESSTSGAWYPDSGASHHLTYNPNNLSYRVPYNGYDQVLMGIGQ 358



 Score = 32.0 bits (71), Expect(2) = 7e-07
 Identities = 15/30 (50%), Positives = 20/30 (66%)
 Frame = -3

Query: 200 WVMDKASKQILLKGTLGDDGLYCFQDIHLL 111
           +V  + S QILL+G++G DGLY FQ    L
Sbjct: 378 FVKSQGSNQILLEGSVGVDGLYKFQPFKFL 407


>KYP43730.1 hypothetical protein KK1_034810, partial [Cajanus cajan]
          Length = 363

 Score =  231 bits (589), Expect = 2e-68
 Identities = 112/238 (47%), Positives = 165/238 (69%), Gaps = 4/238 (1%)
 Frame = -1

Query: 1195 ISFTHTLSLKLDEKNFLLWNQQVEGVITAHKLHLFVVNPMIPVKYASESDRELDLVSEEF 1016
            ++F HT+S KLD KN+LLW QQV+ VI  H+LH F+VNP IP K+ + +DR++  +SE +
Sbjct: 18   LTFAHTISEKLDTKNYLLWCQQVKPVIKGHRLHHFLVNPQIPQKFLNLADRDVGRISEPY 77

Query: 1015 QRWLVQDQLLFTWLLSSLSESMLPRVIGCKHSYEVWDKIHKHFHAHMKARVRQFRAELKS 836
              W  QDQLL +WL SS+S+ ML RVIGCK S+++WDKIH +FH+HM A+ RQ R EL+S
Sbjct: 78   LAWEQQDQLLLSWLQSSMSKDMLTRVIGCKTSFQLWDKIHSYFHSHMNAKARQLRNELRS 137

Query: 835  TKKGTKSISEYLLRIKSIVDSLLAIGDPITEQDHIDSILEGLPEEYNSFVMLIYGHIDPL 656
            T    ++IS+Y+L+I+++VD+L AIGD ++ ++H+D ILEGLPEEY S V LI    D L
Sbjct: 138  TNLENQTISDYVLQIQTLVDTLTAIGDSVSPKEHLDIILEGLPEEYESTVSLISSRFDLL 197

Query: 655  LVTDIEALLMVQEAQLEKFKQELTTSATVNIARAPP----SNRPDSAASSQQFPNFGQ 494
             + ++E LL+  E++L+KFK+++  S  V      P    ++   + A  +  P+F Q
Sbjct: 198  SIEEVETLLLGHESRLDKFKKKVAVSLNVTTTTLEPNLSLAHPQANLAHQENRPSFSQ 255


>KYP45646.1 hypothetical protein KK1_032760, partial [Cajanus cajan]
          Length = 202

 Score =  222 bits (565), Expect = 6e-67
 Identities = 104/196 (53%), Positives = 145/196 (73%)
 Frame = -1

Query: 1195 ISFTHTLSLKLDEKNFLLWNQQVEGVITAHKLHLFVVNPMIPVKYASESDRELDLVSEEF 1016
            ++F HT+S KLD KN+LLW QQVE VI  H+LH F+VNP IP K+ + SD++ + VSEE+
Sbjct: 5    LTFAHTISEKLDTKNYLLWCQQVEPVIKGHRLHHFLVNPQIPPKFLTISDKDENCVSEEY 64

Query: 1015 QRWLVQDQLLFTWLLSSLSESMLPRVIGCKHSYEVWDKIHKHFHAHMKARVRQFRAELKS 836
              W  QDQLL +WL SS+S+ ML  VIGCK S+++WDKIH++FHAH  A+ RQ R++L+S
Sbjct: 65   LAWEQQDQLLLSWLQSSMSKDMLTHVIGCKSSFQIWDKIHEYFHAHTNAKARQLRSDLRS 124

Query: 835  TKKGTKSISEYLLRIKSIVDSLLAIGDPITEQDHIDSILEGLPEEYNSFVMLIYGHIDPL 656
            T     +IS+YLLRI+S+VDSL AIGD ++ ++H+  +L+GLPEEY S V LI    D L
Sbjct: 125  TTLDNGTISDYLLRIQSLVDSLTAIGDSVSSKEHLGIVLDGLPEEYESTVSLISSRFDVL 184

Query: 655  LVTDIEALLMVQEAQL 608
             + ++E LL+  E++L
Sbjct: 185  SIEEVETLLLAHESRL 200


>KYP37941.1 hypothetical protein KK1_040830 [Cajanus cajan]
          Length = 296

 Score =  221 bits (564), Expect = 2e-65
 Identities = 114/238 (47%), Positives = 161/238 (67%), Gaps = 6/238 (2%)
 Frame = -1

Query: 1195 ISFTHTLSLKLDEKNFLLWNQQVEGVITAHKLHLFVVNPMIPVKYASESDRELDLVSEEF 1016
            ++F HT+S KLD KN+LL  QQVE VI  H+LH F+VNP I  K+ + S+++ + VS+E+
Sbjct: 29   LTFAHTISEKLDTKNYLLGCQQVEPVIKGHRLHHFLVNPQILPKFLTVSNKDENRVSKEY 88

Query: 1015 QRWLVQDQLLFTWLLSSLSESMLPRVIGCKHSYEVWDKIHKHFHAHMKARVRQFRAELKS 836
              W  QDQLL +WL SS+S+ ML RVIGCK S+++WDKIH +FHAH  A+ RQ  ++L+S
Sbjct: 89   LAWEQQDQLLLSWLQSSMSKDMLARVIGCKSSFQIWDKIHAYFHAHTNAKARQLHSDLRS 148

Query: 835  TKKGTKSISEYLLRIKSIVDSLLAIGDPITEQDHIDSILEGLPEEYNSFVMLIYGHIDPL 656
            T     +IS+YLLRI+S+VDSL AIGD ++ ++H+D +LEGLP EY S V LI    D L
Sbjct: 149  TTLDNCTISDYLLRIQSLVDSLTAIGDSVSSKEHLDIVLEGLPGEYESTVSLISSRFDVL 208

Query: 655  LVTDIEALLMVQEAQLEKFKQELTTSATVNIARAPPSNRP------DSAASSQQFPNF 500
             + ++E LL+  E +LEKFK++   S  + +  +  SN P      + A    QFP+F
Sbjct: 209  SIEEVETLLLAHEFRLEKFKKKNLISVNL-LESSSGSNTPALQPQANLAHQDSQFPSF 265


>KYP34307.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cajanus
            cajan]
          Length = 1102

 Score =  236 bits (603), Expect = 4e-65
 Identities = 111/217 (51%), Positives = 158/217 (72%)
 Frame = -1

Query: 1195 ISFTHTLSLKLDEKNFLLWNQQVEGVITAHKLHLFVVNPMIPVKYASESDRELDLVSEEF 1016
            ++F+HT+S KLD KN+LLW QQVE VI  H+LH ++VNP IP K+A+ +DR+   +SE +
Sbjct: 34   LTFSHTISEKLDTKNYLLWCQQVEPVIKGHRLHHYLVNPQIPQKFATLADRDAGRISESY 93

Query: 1015 QRWLVQDQLLFTWLLSSLSESMLPRVIGCKHSYEVWDKIHKHFHAHMKARVRQFRAELKS 836
              W  QDQLL +WL SS+S+ ML RVIGCK S+++WDKIH +FH+HM A+ RQ R EL++
Sbjct: 94   LAWEQQDQLLLSWLQSSMSKDMLTRVIGCKSSFQLWDKIHSYFHSHMNAKARQLRNELRN 153

Query: 835  TKKGTKSISEYLLRIKSIVDSLLAIGDPITEQDHIDSILEGLPEEYNSFVMLIYGHIDPL 656
            T     SISEY+LRI+++VD+L AIG+ ++ ++H+D ILEGLPEEY S V LI  H D L
Sbjct: 154  TSLENLSISEYVLRIQTLVDALTAIGNSVSPKEHLDIILEGLPEEYESTVSLISSHFDLL 213

Query: 655  LVTDIEALLMVQEAQLEKFKQELTTSATVNIARAPPS 545
             + ++E LL+  E++L+KFK+++  S  V      P+
Sbjct: 214  TIDEVETLLLGHESRLDKFKKKVAASINVTTTTTEPN 250


>KYP46257.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cajanus
            cajan]
          Length = 1408

 Score =  237 bits (604), Expect = 6e-65
 Identities = 116/228 (50%), Positives = 166/228 (72%)
 Frame = -1

Query: 1195 ISFTHTLSLKLDEKNFLLWNQQVEGVITAHKLHLFVVNPMIPVKYASESDRELDLVSEEF 1016
            ++F+HT+S KLD KN+LLW QQVE VI  H+LH ++VNP IP K+A+ +DR+   +SE +
Sbjct: 34   LTFSHTISEKLDTKNYLLWCQQVEPVIKGHRLHHYLVNPQIPQKFATLADRDAGHISESY 93

Query: 1015 QRWLVQDQLLFTWLLSSLSESMLPRVIGCKHSYEVWDKIHKHFHAHMKARVRQFRAELKS 836
              W  QDQLL +WL SS+S+ ML RVIGCK S+++WDKIH +FH+HM A+ RQ R EL+S
Sbjct: 94   LAWEQQDQLLLSWLQSSMSKDMLTRVIGCKSSFQLWDKIHTYFHSHMNAKARQLRNELRS 153

Query: 835  TKKGTKSISEYLLRIKSIVDSLLAIGDPITEQDHIDSILEGLPEEYNSFVMLIYGHIDPL 656
            T     SISEY+LRI+++VD+L AIGD ++ ++H+D ILEGLPEEY S V LI    D L
Sbjct: 154  TTLDNLSISEYVLRIQTLVDALTAIGDSVSPKEHLDIILEGLPEEYESTVSLISSRFDLL 213

Query: 655  LVTDIEALLMVQEAQLEKFKQELTTSATVNIARAPPSNRPDSAASSQQ 512
             + ++E LL+  E++L+KFK++   +A++N+  A     PD +A++ Q
Sbjct: 214  TIDEVETLLLGHESRLDKFKKK--AAASINVTTA--VTEPDPSATNPQ 257


>KYP33001.1 hypothetical protein KK1_046197, partial [Cajanus cajan]
          Length = 470

 Score =  225 bits (574), Expect = 7e-65
 Identities = 109/217 (50%), Positives = 153/217 (70%)
 Frame = -1

Query: 1195 ISFTHTLSLKLDEKNFLLWNQQVEGVITAHKLHLFVVNPMIPVKYASESDRELDLVSEEF 1016
            ++F+HT+S KL  KN+LLW QQVE VI  H+LH ++VNP IP K+A+ +DR+   +SE +
Sbjct: 13   LTFSHTISEKLGTKNYLLWCQQVEPVIKGHRLHHYLVNPQIPQKFATLADRDAGCISESY 72

Query: 1015 QRWLVQDQLLFTWLLSSLSESMLPRVIGCKHSYEVWDKIHKHFHAHMKARVRQFRAELKS 836
              W  QDQLL +WL SS+S+ ML RVIGCK S+++WDKIH +FH+HM A+  Q R EL S
Sbjct: 73   LAWEQQDQLLLSWLQSSMSKDMLTRVIGCKSSFQLWDKIHSYFHSHMNAKACQLRNELCS 132

Query: 835  TKKGTKSISEYLLRIKSIVDSLLAIGDPITEQDHIDSILEGLPEEYNSFVMLIYGHIDPL 656
            T     SISEY+LRI+++VD+L AIGD ++ ++H+D ILEGLPEEY S + LI    D L
Sbjct: 133  TSLENLSISEYVLRIQTLVDALTAIGDSVSLKEHLDIILEGLPEEYESTMSLISSRFDLL 192

Query: 655  LVTDIEALLMVQEAQLEKFKQELTTSATVNIARAPPS 545
             + ++E LL+  E++L+KFK++      V  A   P+
Sbjct: 193  TIDEVETLLLGHESRLDKFKKKAAAYINVTTATIEPN 229


>GAU27211.1 hypothetical protein TSUD_108020 [Trifolium subterraneum]
          Length = 967

 Score =  227 bits (578), Expect = 5e-62
 Identities = 122/236 (51%), Positives = 165/236 (69%), Gaps = 4/236 (1%)
 Frame = -1

Query: 1186 THTLSLKLDEKNFLLWNQQVEGVITAHKLHLFVVNPMIPVKYASESDRELDLVSEEFQRW 1007
            TH+L++KLDEKNFLLW+QQV GVITAH LH FVVNP I ++YAS +DR     SEE++ W
Sbjct: 34   THSLTIKLDEKNFLLWSQQVNGVITAHNLHRFVVNPEILLQYASIADRLDGKNSEEYKTW 93

Query: 1006 LVQDQLLFTWLLSSLSESMLPRVIGCKHSYEVWDKIHKHFHAHMKARVRQFRAELKSTKK 827
            L +DQ LFTWLLS++S+ +LPRV+ CKHS+EVW+KIHK+F++ +K+R RQ R+ELK+TKK
Sbjct: 94   LFKDQSLFTWLLSTISDGVLPRVLNCKHSHEVWEKIHKYFNSVLKSRARQLRSELKNTKK 153

Query: 826  GTKSISEYLLRIKSIVDSLLAIGDPITEQDHIDSILEGLPEEYNSFVMLIYGHIDPLLVT 647
              +S+SEYLLRIKSIV+SL+A+GD    +   +     L   +   ++  +   D   V 
Sbjct: 154  SARSMSEYLLRIKSIVNSLIAMGDMDCWKQESEKERIVLLSHFVKIIVKSFTGSDNPTVE 213

Query: 646  DIEALLMVQEAQLEKFKQELTT-SATVNIAR-APPSNRP--DSAASSQQFPNFGQY 491
            DIE LL++QEAQ EKF+QEL   S + N+A+    SN P  D      + P+F  Y
Sbjct: 214  DIEGLLLLQEAQFEKFRQELANPSVSTNVAQMETQSNSPNMDLEGPPSRPPHFNPY 269


>KHN49021.1 hypothetical protein glysoja_031232, partial [Glycine soja]
          Length = 323

 Score =  209 bits (532), Expect = 2e-60
 Identities = 106/235 (45%), Positives = 158/235 (67%), Gaps = 6/235 (2%)
 Frame = -1

Query: 1192 SFTHTLSLKLDEKNFLLWNQQVEGVITAHKLHLFVVNPMIPVKYASESDRELDLVSEEFQ 1013
            SF + +S+KLD  N+L+W QQ+E V+ AH+LH F V P IP +YASE DR  ++ +  F 
Sbjct: 14   SFNYKISVKLDATNYLVWLQQIEPVLRAHRLHRFCVTPEIPPQYASEHDRLANIENPAFS 73

Query: 1012 RWLVQDQLLFTWLLSSLSESMLPRVIGCKHSYEVWDKIHKHFHAHMKARVRQFRAELKST 833
             W +QDQLL  WL SSLS ++LP VIGCKH++++W+ IH+ F +  KA+ RQ R +L++T
Sbjct: 74   NWELQDQLLLAWLQSSLSPAILPSVIGCKHTFQLWENIHQSFQSKTKAQARQLRTQLRTT 133

Query: 832  KKGTKSISEYLLRIKSIVDSLLAIGDPITEQDHIDSILEGLPEEYNSFVMLIYGHIDPLL 653
            KKG+ SISE+L +IK I DSL +IG+ ++ QD +D ILEGLP E+ S V LI   I+   
Sbjct: 134  KKGSSSISEFLAKIKHISDSLTSIGESVSLQDQLDVILEGLPNEFESLVTLINSKIEWFD 193

Query: 652  VTDIEALLMVQEAQLEKFKQELTTSATVNIARAPPSNR------PDSAASSQQFP 506
            + +I ALL+  E +L+K  +    +A++N  ++ P+++      P+SA  +Q  P
Sbjct: 194  LEEIRALLLAHEQRLDK-ARITEEAASLNFTQSQPNSKIPNSVNPNSATETQIAP 247


>KYP36809.1 hypothetical protein KK1_042047 [Cajanus cajan]
          Length = 280

 Score =  207 bits (528), Expect = 2e-60
 Identities = 99/169 (58%), Positives = 140/169 (82%), Gaps = 2/169 (1%)
 Frame = -1

Query: 991 LLFTWLLSSLSESMLPRVIGCKHSYEVWDKIHKHFHAHMKARVRQFRAELKSTKKG-TKS 815
           +LF+WLLSSLSES+LPRV+GCKHSYE+WDKIHKH+++H+ A+ RQ R+ELKS+KKG ++ 
Sbjct: 1   MLFSWLLSSLSESVLPRVLGCKHSYEIWDKIHKHYYSHLHAKKRQLRSELKSSKKGPSQP 60

Query: 814 ISEYLLRIKSIVDSLLAIGDPITEQDHIDSILEGLPEEYNSFVMLIYGHIDPLLVTDIEA 635
           ISEY+LRI+ I++SL+ +GD +T+QD ID+IL+GLPE+YNSF+M+IYG  D + VTD+E+
Sbjct: 61  ISEYILRIREIINSLIVVGDLVTDQDQIDTILDGLPEDYNSFIMMIYGRSDSISVTDVES 120

Query: 634 LLMVQEAQLEKFKQELTT-SATVNIARAPPSNRPDSAASSQQFPNFGQY 491
           LL+VQEAQLEK++Q+LT+ S +VN+ + P  ++  S   SQ   N G +
Sbjct: 121 LLLVQEAQLEKYRQDLTSPSVSVNVVQGPQDSQFQS--QSQFVSNRGGF 167


Top