BLASTX nr result

ID: Glycyrrhiza35_contig00002445 seq

BLASTX 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Glycyrrhiza35_contig00002445
         (1890 letters)

Database: ./nr 
           115,041,592 sequences; 42,171,959,267 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

GAU15285.1 hypothetical protein TSUD_03520 [Trifolium subterraneum]   352   e-113
GAU26016.1 hypothetical protein TSUD_64040 [Trifolium subterraneum]   338   e-101
GAU29238.1 hypothetical protein TSUD_362280 [Trifolium subterran...   334   5e-97
KHN36156.1 Retrovirus-related Pol polyprotein from transposon TN...   334   7e-97
KHN22040.1 Retrovirus-related Pol polyprotein from transposon TN...   334   7e-97
GAU19342.1 hypothetical protein TSUD_336290 [Trifolium subterran...   332   3e-96
GAU30708.1 hypothetical protein TSUD_39320 [Trifolium subterraneum]   330   2e-95
GAU11134.1 hypothetical protein TSUD_197580 [Trifolium subterran...   316   3e-91
GAU20748.1 hypothetical protein TSUD_231620 [Trifolium subterran...   274   2e-83
GAU44321.1 hypothetical protein TSUD_305020 [Trifolium subterran...   277   1e-82
KYP36809.1 hypothetical protein KK1_042047 [Cajanus cajan]            253   5e-76
KYP36193.1 hypothetical protein KK1_042704 [Cajanus cajan]            236   2e-70
KYP61342.1 Retrovirus-related Pol polyprotein from transposon TN...   256   8e-70
KYP46257.1 Retrovirus-related Pol polyprotein from transposon TN...   256   2e-69
KYP40244.1 Retrovirus-related Pol polyprotein from transposon TN...   248   2e-69
KYP33001.1 hypothetical protein KK1_046197, partial [Cajanus cajan]   241   4e-69
KYP34307.1 Retrovirus-related Pol polyprotein from transposon TN...   253   6e-69
GAU37351.1 hypothetical protein TSUD_395330 [Trifolium subterran...   253   6e-69
KHN49021.1 hypothetical protein glysoja_031232, partial [Glycine...   219   1e-62
GAU27211.1 hypothetical protein TSUD_108020 [Trifolium subterran...   232   3e-62

>GAU15285.1 hypothetical protein TSUD_03520 [Trifolium subterraneum]
          Length = 392

 Score =  352 bits (904), Expect = e-113
 Identities = 181/377 (48%), Positives = 244/377 (64%), Gaps = 3/377 (0%)
 Frame = -1

Query: 1431 VQPSSPAPHSFTHSLSLKLDDKNFLLWNQQVEAVIAAHKLHRFVVNPNIPLRYANEMDRT 1252
            V+ +  + ++ + +  LKLDD N+LLW+QQVE VI A+KLHRFVVNP IP +YA+E DR 
Sbjct: 13   VESTGSSTNAASFTPKLKLDDGNYLLWSQQVEGVILANKLHRFVVNPQIPAKYASESDRE 72

Query: 1251 LDIETEEYQRWFVQDQMLFTWLLSSLSDSILPRVIGCKHSWQVWDKVHKHFYTQLKAKVR 1072
            LD  +E Y +W VQDQMLFTWLLS+L++S+LPR IGC+H++QVWD++HKHF   LKAKVR
Sbjct: 73   LDRVSEAYDKWLVQDQMLFTWLLSTLAESVLPRTIGCRHAFQVWDQIHKHFEAHLKAKVR 132

Query: 1071 SLRSELKTTKKGTRSISEYVLRIKAISDSLIAIGEPISEQDQVDAILDGLPEDYGPFIMM 892
             LRSELK  KKGT+SI+E+VLR++ I+D+LI+IG+ ISEQDQ+D+IL+GLPE+Y PF+MM
Sbjct: 133  QLRSELKNVKKGTKSITEFVLRVRVIADTLISIGDSISEQDQIDSILEGLPEEYNPFVMM 192

Query: 891  MYGRSDCPSVADVESLLLVQEAQFDKYKSELSTGSVSINVAQGPGNRESADEQNFNSXXX 712
            +YGRSD PS+ D+E LLLVQE+Q +K++ ELST S S N+A   G R ++  +       
Sbjct: 193  IYGRSDSPSLYDIEGLLLVQESQLEKFRQELSTPSASANLAHSRGGRGNSGARG------ 246

Query: 711  XXXXXXXXXXXXXXXXXXXXXGPRPTCQICFRYGHDAFRCWNRFDQDYVQPDPPPEIXXX 532
                                 G RPTCQ+C +YGH    CW RFD+++V P P   +   
Sbjct: 247  -----RGRSTRGRGRPAASPTGNRPTCQLCGKYGHHVIDCWYRFDENFV-PAPNSSVLKS 300

Query: 531  XXXXXXXXXXXXXXXXXXXXXXAPRVYAATTQEVQVPRMFDSQVWYPDSGASHHVTADSQ 352
                                      +A  TQE+ +P     Q W+PDSGASHH+TAD+ 
Sbjct: 301  DTSGPKTNHESPQACTAN--------FAPATQELVIP-----QSWFPDSGASHHITADAS 347

Query: 351  HLTQSVPFN---GSDQV 310
            +L Q +      G DQ+
Sbjct: 348  NLAQDLSVQHVPGIDQI 364


>GAU26016.1 hypothetical protein TSUD_64040 [Trifolium subterraneum]
          Length = 942

 Score =  338 bits (868), Expect = e-101
 Identities = 179/383 (46%), Positives = 236/383 (61%), Gaps = 12/383 (3%)
 Frame = -1

Query: 1398 THSLSLKLDDKNFLLWNQQVEAVIAAHKLHRFVVNPNIPLRYANEMDRTLDIETEEYQRW 1219
            THS+++KLD+KNFLLW+QQV  VI AH LHRFVVNPNIPL++A   DR     +EEY++W
Sbjct: 34   THSITIKLDEKNFLLWSQQVNGVITAHNLHRFVVNPNIPLQFATVNDRIEGNTSEEYRKW 93

Query: 1218 FVQDQMLFTWLLSSLSDSILPRVIGCKHSWQVWDKVHKHFYTQLKAKVRSLRSELKTTKK 1039
              +DQ LFTWLLS++SDS+LPRV+ CKHS +VWDK+HK+F + LK+++R LRSELK TKK
Sbjct: 94   LFKDQTLFTWLLSTISDSVLPRVLHCKHSHEVWDKIHKYFNSVLKSRIRQLRSELKNTKK 153

Query: 1038 GTRSISEYVLRIKAISDSLIAIGEPISEQDQVDAILDGLPEDYGPFIMMMYGRSDCPSVA 859
              RS+SEY+LRIK+I +SLIA+ E +SEQ+QVDAILDGLPED+  F+MM+Y R D P+V 
Sbjct: 154  LARSVSEYLLRIKSIVNSLIAMSEVVSEQEQVDAILDGLPEDFNSFVMMVYSRFDTPTVE 213

Query: 858  DVESLLLVQEAQFDKYKSELSTGSVSINVAQGPG-----NRESADEQNFN----SXXXXX 706
            D+E LL++QEAQF+K++ EL+  +VS NVAQ        N+++ D ++ N    S     
Sbjct: 214  DIEGLLMLQEAQFEKFRQELANPNVSANVAQMESKNHHSNQDTEDTESVNESYGSNTYRG 273

Query: 705  XXXXXXXXXXXXXXXXXXXGPRPTCQICFRYGHDAFRCWNRFDQDYVQPDPPPEIXXXXX 526
                                 +  CQIC ++ HDA  CW R+D        PP       
Sbjct: 274  RGRGKGRARGRDKAPITPNAGKVQCQICAKHNHDAANCWYRYD--------PPS---SRY 322

Query: 525  XXXXXXXXXXXXXXXXXXXXAPRVYAATTQE---VQVPRMFDSQVWYPDSGASHHVTADS 355
                                 P  + A  Q    +     F +  WYPDSGASHH+T + 
Sbjct: 323  NARGYNAGSTSRQPQYNPYPRPSAHLALPQHYNPIADMDTFSNASWYPDSGASHHLTFNP 382

Query: 354  QHLTQSVPFNGSDQVLMGNGQGV 286
             +LT   P+ G DQV MGNGQGV
Sbjct: 383  NNLTYRTPYQGQDQVTMGNGQGV 405


>GAU29238.1 hypothetical protein TSUD_362280 [Trifolium subterraneum]
          Length = 1433

 Score =  334 bits (857), Expect = 5e-97
 Identities = 175/380 (46%), Positives = 232/380 (61%), Gaps = 9/380 (2%)
 Frame = -1

Query: 1398 THSLSLKLDDKNFLLWNQQVEAVIAAHKLHRFVVNPNIPLRYANEMDRTLDIETEEYQRW 1219
            THSL++KLD+KNFLLW+QQV  VI AH LHRFVVNP IPL++A+  DR     ++EY++W
Sbjct: 34   THSLTIKLDEKNFLLWSQQVNGVITAHNLHRFVVNPEIPLQFASVNDRIEGKTSDEYRKW 93

Query: 1218 FVQDQMLFTWLLSSLSDSILPRVIGCKHSWQVWDKVHKHFYTQLKAKVRSLRSELKTTKK 1039
              +DQ LFTWLLS++SD++LPRV+ CKHS +VWDK+HK+F + LK+++R LRSELK TKK
Sbjct: 94   LFKDQTLFTWLLSTISDAVLPRVVHCKHSHEVWDKIHKYFNSVLKSRIRQLRSELKNTKK 153

Query: 1038 GTRSISEYVLRIKAISDSLIAIGEPISEQDQVDAILDGLPEDYGPFIMMMYGRSDCPSVA 859
              R +SEY+LRIK+I +SLIA+GE ++EQ+Q+DAIL+GLPED+  F+MMMY R D P+V 
Sbjct: 154  LARPVSEYLLRIKSIVNSLIAMGETVTEQEQIDAILEGLPEDFNSFVMMMYSRFDTPTVE 213

Query: 858  DVESLLLVQEAQFDKYKSELSTGSVSINVAQ-GPGNRESADE--------QNFNSXXXXX 706
            D+E LL++QEAQF+K++ EL+  SVS N+AQ G  N +S  E        +++NS     
Sbjct: 214  DIEGLLMLQEAQFEKFRQELTNPSVSANIAQIGSKNHQSNSEAEDTESGNESYNSNSYRG 273

Query: 705  XXXXXXXXXXXXXXXXXXXGPRPTCQICFRYGHDAFRCWNRFDQDYVQPDPPPEIXXXXX 526
                                 +  CQIC +  HDA  CW R    Y  P           
Sbjct: 274  RGRGKGRARGRGRAPNAPNTGKVQCQICGKANHDAAICWYR----YEPPSSRSNACGHNA 329

Query: 525  XXXXXXXXXXXXXXXXXXXXAPRVYAATTQEVQVPRMFDSQVWYPDSGASHHVTADSQHL 346
                                 P+ Y        V     +  WYPDSGASHH+T +  +L
Sbjct: 330  GSSSRPPPYNPYPRPSAHLALPQYYNPIADMDSV----SNASWYPDSGASHHLTFNPNNL 385

Query: 345  TQSVPFNGSDQVLMGNGQGV 286
            T   P+ G DQV MGNGQGV
Sbjct: 386  TYRTPYQGQDQVTMGNGQGV 405


>KHN36156.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94, partial
            [Glycine soja]
          Length = 1417

 Score =  334 bits (856), Expect = 7e-97
 Identities = 170/378 (44%), Positives = 237/378 (62%), Gaps = 10/378 (2%)
 Frame = -1

Query: 1389 LSLKLDDKNFLLWNQQVEAVIAAHKLHRFVVNPNIPLRYANEMDRTLDIETEEYQRWFVQ 1210
            L++KLD+KNFLLW+QQV  VI AH LHRFVVNP IPL++A+  D  L I ++EYQ+W ++
Sbjct: 1    LTIKLDEKNFLLWSQQVNGVITAHNLHRFVVNPQIPLQFASIEDCALGINSDEYQQWLIK 60

Query: 1209 DQMLFTWLLSSLSDSILPRVIGCKHSWQVWDKVHKHFYTQLKAKVRSLRSELKTTKKGTR 1030
            DQ LFTWLLS+LSD +LPRV+ C+H+ +VWDK+HK+F + LK++ R LRSELK TKK +R
Sbjct: 61   DQTLFTWLLSTLSDGVLPRVLSCRHAHEVWDKIHKYFNSVLKSRARQLRSELKNTKKLSR 120

Query: 1029 SISEYVLRIKAISDSLIAIGEPISEQDQVDAILDGLPEDYGPFIMMMYGRSDCPSVADVE 850
            S++EY+LRIK+I +SL+A+G+ +SEQ+QVD+IL+GLPE++  F+MM+Y R D P+V DVE
Sbjct: 121  SVNEYLLRIKSIVNSLVAVGDMVSEQEQVDSILEGLPEEFNSFVMMVYSRFDTPTVEDVE 180

Query: 849  SLLLVQEAQFDKYKSELSTGSVSINVAQGPGNRESADEQNFN--------SXXXXXXXXX 694
            +LLL+QEAQF+K+K EL++ SVS NVA    N   ++ ++ +        +         
Sbjct: 181  ALLLLQEAQFEKFKQELTSPSVSANVAHTETNASDSNSEHESQELGTEHYNVNANRGRGR 240

Query: 693  XXXXXXXXXXXXXXXGPRPTCQICFRYGHDAFRCWNRFDQDYVQPDPPPEIXXXXXXXXX 514
                             +  CQIC +  HDA  CW R+D   +  +              
Sbjct: 241  GKGRGRGRGKGQAQNQGKVKCQICAKPNHDAINCWYRYDPQAMNQN----------SRGG 290

Query: 513  XXXXXXXXXXXXXXXXAPRVYAATTQEVQVPRM--FDSQVWYPDSGASHHVTADSQHLTQ 340
                             P  + A  Q   +P M  F +  WYPDSGASHH+T +  +L+ 
Sbjct: 291  YQVGPSNRPQNFNPYMRPTAHLAMPQPYAMPNMDQFSNGAWYPDSGASHHLTYNPNNLSY 350

Query: 339  SVPFNGSDQVLMGNGQGV 286
            S P+ G DQV+MGNGQGV
Sbjct: 351  SSPYTGQDQVVMGNGQGV 368


>KHN22040.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94, partial
            [Glycine soja]
          Length = 1417

 Score =  334 bits (856), Expect = 7e-97
 Identities = 170/378 (44%), Positives = 237/378 (62%), Gaps = 10/378 (2%)
 Frame = -1

Query: 1389 LSLKLDDKNFLLWNQQVEAVIAAHKLHRFVVNPNIPLRYANEMDRTLDIETEEYQRWFVQ 1210
            L++KLD+KNFLLW+QQV  VI AH LHRFVVNP IPL++A+  D  L I ++EYQ+W ++
Sbjct: 1    LTIKLDEKNFLLWSQQVNGVITAHNLHRFVVNPQIPLQFASIEDCALGINSDEYQQWLIK 60

Query: 1209 DQMLFTWLLSSLSDSILPRVIGCKHSWQVWDKVHKHFYTQLKAKVRSLRSELKTTKKGTR 1030
            DQ LFTWLLS+LSD +LPRV+ C+H+ +VWDK+HK+F + LK++ R LRSELK TKK +R
Sbjct: 61   DQTLFTWLLSTLSDGVLPRVLSCRHAHEVWDKIHKYFNSVLKSRARQLRSELKNTKKLSR 120

Query: 1029 SISEYVLRIKAISDSLIAIGEPISEQDQVDAILDGLPEDYGPFIMMMYGRSDCPSVADVE 850
            S++EY+LRIK+I +SL+A+G+ +SEQ+QVD+IL+GLPE++  F+MM+Y R D P+V DVE
Sbjct: 121  SVNEYLLRIKSIVNSLVAVGDMVSEQEQVDSILEGLPEEFNSFVMMVYSRFDTPTVEDVE 180

Query: 849  SLLLVQEAQFDKYKSELSTGSVSINVAQGPGNRESADEQNFN--------SXXXXXXXXX 694
            +LLL+QEAQF+K+K EL++ SVS NVA    N   ++ ++ +        +         
Sbjct: 181  ALLLLQEAQFEKFKQELTSPSVSANVAHTETNASDSNSEHESQELGTEHYNVNANRGRGR 240

Query: 693  XXXXXXXXXXXXXXXGPRPTCQICFRYGHDAFRCWNRFDQDYVQPDPPPEIXXXXXXXXX 514
                             +  CQIC +  HDA  CW R+D   +  +              
Sbjct: 241  GKGRGRGRGKGQAQNQGKVKCQICAKPNHDAINCWYRYDPQAMNQN----------SRGG 290

Query: 513  XXXXXXXXXXXXXXXXAPRVYAATTQEVQVPRM--FDSQVWYPDSGASHHVTADSQHLTQ 340
                             P  + A  Q   +P M  F +  WYPDSGASHH+T +  +L+ 
Sbjct: 291  YQVGPSNRPQNFNPYMRPTAHLAMPQPYAMPNMDQFSNGAWYPDSGASHHLTYNPNNLSY 350

Query: 339  SVPFNGSDQVLMGNGQGV 286
            S P+ G DQV+MGNGQGV
Sbjct: 351  SSPYTGQDQVVMGNGQGV 368


>GAU19342.1 hypothetical protein TSUD_336290 [Trifolium subterraneum]
          Length = 1442

 Score =  332 bits (852), Expect = 3e-96
 Identities = 172/392 (43%), Positives = 239/392 (60%), Gaps = 9/392 (2%)
 Frame = -1

Query: 1434 PVQPSSPAPHSFTHSLSLKLDDKNFLLWNQQVEAVIAAHKLHRFVVNPNIPLRYANEMDR 1255
            P      A    THSL++KLD+KN+LLWNQQV  VI AH LHRF+VNP IP+++A++ DR
Sbjct: 22   PQTSKDSAKSGLTHSLTIKLDEKNYLLWNQQVNGVITAHDLHRFIVNPQIPIQFASDADR 81

Query: 1254 TLDIETEEYQRWFVQDQMLFTWLLSSLSDSILPRVIGCKHSWQVWDKVHKHFYTQLKAKV 1075
              D  ++EY++W  +DQ LFTWLLS+LSDS+LPRV+GCKH++QVWD++HK+F++ L+A+ 
Sbjct: 82   VADRTSDEYRQWIFKDQTLFTWLLSTLSDSVLPRVLGCKHAFQVWDQIHKYFHSVLQARA 141

Query: 1074 RSLRSELKTTKKGTRSISEYVLRIKAISDSLIAIGEPISEQDQVDAILDGLPEDYGPFIM 895
            R LRSELK TKK +RS+ EY+LRIK+I +SL+A+G+ +S+++QVDAIL+GLPE++  F+M
Sbjct: 142  RQLRSELKNTKKASRSVGEYLLRIKSIVNSLLAVGDLVSDREQVDAILEGLPEEFNSFVM 201

Query: 894  MMYGRSDCPSVADVESLLLVQEAQFDKYKSELSTGSVSINVA--QGPGNRESADE----- 736
            M+Y R D P+V DVE+LLL+QEAQF+K++ EL++ SVS +VA      +  S D+     
Sbjct: 202  MVYSRFDTPTVEDVEALLLLQEAQFEKFRQELASPSVSAHVALTDSKMSDNSVDQDSHEV 261

Query: 735  --QNFNSXXXXXXXXXXXXXXXXXXXXXXXXGPRPTCQICFRYGHDAFRCWNRFDQDYVQ 562
              +++ +                             CQIC +  HDA  CW R+      
Sbjct: 262  GTEHYVAGKGRGRGKGRGKGRSRGRGSYSGGNQGTQCQICSKSSHDAVNCWYRY------ 315

Query: 561  PDPPPEIXXXXXXXXXXXXXXXXXXXXXXXXXAPRVYAATTQEVQVPRMFDSQVWYPDSG 382
              P P +                          P  + A       P       WYPDSG
Sbjct: 316  -HPSPSM----MNAPRGHAVAHSRPPPYNPPMRPSAHLALPYYTGAP---SEASWYPDSG 367

Query: 381  ASHHVTADSQHLTQSVPFNGSDQVLMGNGQGV 286
            ASHH+T D  +L  S P+ G DQV+MGNGQGV
Sbjct: 368  ASHHLTYDPYNLVHSNPYTGHDQVMMGNGQGV 399


>GAU30708.1 hypothetical protein TSUD_39320 [Trifolium subterraneum]
          Length = 1432

 Score =  330 bits (845), Expect = 2e-95
 Identities = 182/417 (43%), Positives = 245/417 (58%), Gaps = 12/417 (2%)
 Frame = -1

Query: 1434 PVQPSSPAPHSFTHSLSLKLDDKNFLLWNQQVEAVIAAHKLHRFVVNPNIPLRYANEMDR 1255
            P      +    THSL++KLD+KNFLLW+QQV  VI  H LHRFVVNP IPL++A+  DR
Sbjct: 22   PTTAKDSSKSGLTHSLTIKLDEKNFLLWSQQVNGVITTHNLHRFVVNPEIPLQFASVNDR 81

Query: 1254 TLDIETEEYQRWFVQDQMLFTWLLSSLSDSILPRVIGCKHSWQVWDKVHKHFYTQLKAKV 1075
                 ++EY++W  +DQ LFTWLLS++SDS+LPRV+ CKHS +VWDK+HK+F + LK+++
Sbjct: 82   LDGKISDEYRKWLFKDQTLFTWLLSTISDSVLPRVLHCKHSHEVWDKIHKYFNSVLKSRI 141

Query: 1074 RSLRSELKTTKKGTRSISEYVLRIKAISDSLIAIGEPISEQDQVDAILDGLPEDYGPFIM 895
            R LRSELK TKK  RS+SEY+LRIK+I +SLIA+GE ISEQ+Q+DAILDGL E++  F+M
Sbjct: 142  RQLRSELKNTKKLARSVSEYLLRIKSIINSLIAMGESISEQEQIDAILDGLSEEFNSFVM 201

Query: 894  MMYGRSDCPSVADVESLLLVQEAQFDKYKSELSTGSVSINVAQGPG-----NRESADEQN 730
            M+Y R D P+V DVE LL++QEAQFDK++ EL+  SVS NVAQ        N+E  D ++
Sbjct: 202  MVYSRFDNPTVEDVEGLLMLQEAQFDKFRQELTNPSVSANVAQMDSKNQHPNQEVEDTES 261

Query: 729  FNSXXXXXXXXXXXXXXXXXXXXXXXXGP----RPTCQICFRYGHDAFRCWNRFDQDYVQ 562
             N                               +  CQIC +  HDA  CW R++     
Sbjct: 262  GNEHYTFNTYRGKGRGRGKAKARGKAPNALNNGKVQCQICSKSNHDAANCWYRYE----- 316

Query: 561  PDPPPEIXXXXXXXXXXXXXXXXXXXXXXXXXAPRVYAATTQEVQVPRMFD---SQVWYP 391
               PP                            P  + A  Q       FD   +  WYP
Sbjct: 317  ---PPS---SRTNGRGYNAGNTSRPPLYNPYPRPSAHLALPQYYNPTAEFDTYSNASWYP 370

Query: 390  DSGASHHVTADSQHLTQSVPFNGSDQVLMGNGQGVQ*STSSRYFRSRWSVLFPATEI 220
            DSGASHH+T +  ++    P+ G DQV MGNGQGV  ST+S  + + ++   P+ ++
Sbjct: 371  DSGASHHLTFNPNNMAYRTPYQGQDQVTMGNGQGV--STASLGYSNFYAPNNPSVQL 425


>GAU11134.1 hypothetical protein TSUD_197580 [Trifolium subterraneum]
          Length = 1234

 Score =  316 bits (810), Expect = 3e-91
 Identities = 162/399 (40%), Positives = 231/399 (57%), Gaps = 29/399 (7%)
 Frame = -1

Query: 1398 THSLSLKLDDKNFLLWNQQVEAVIAAHKLHRFVVNPNIPLRYANEMDRTLDIETEEYQRW 1219
            THSL++KLD+KNFLLWNQQV  VI AH LHRFVVNP IPL+Y +  DR     ++EYQ+W
Sbjct: 36   THSLTIKLDEKNFLLWNQQVNGVITAHNLHRFVVNPQIPLQYESVEDRLDGKNSDEYQQW 95

Query: 1218 FVQDQMLFTWLLSSLSDSILPRVIGCKHSWQVWDKVHKHFYTQLKAKVRSLRSELKTTKK 1039
              +DQ LFTWLLS++SD +LPRV+ CKHS++VW+++HKHF + LK++ R LRSELK TKK
Sbjct: 96   LFKDQSLFTWLLSTISDDVLPRVLSCKHSYEVWEQIHKHFNSVLKSRSRQLRSELKNTKK 155

Query: 1038 GTRSISEYVLRIKAISDSLIAIGEPISEQDQVDAILDGLPEDYGPFIMMMYGRSDCPSVA 859
              RS++EY++RIK+I +SLIA+G+ +S+++QV+A+L+GLP+++  F+MM+Y +   P V 
Sbjct: 156  MARSVNEYLIRIKSIVNSLIAVGDVVSDKEQVEAVLEGLPKEFSSFVMMIYSQFATPKVK 215

Query: 858  DVESLLLVQEAQFDKYKSELSTGSVSINVAQGPGN---------RESADEQNFNSXXXXX 706
            DVE+LLL++E QF+K++ EL+   VS N  Q   N          + +  +++N      
Sbjct: 216  DVEALLLLREVQFEKFRQELANPRVSANTTQVQSNFNDEAMDTETQESGTEHYNVSANRG 275

Query: 705  XXXXXXXXXXXXXXXXXXXGPRPTCQICFRYGHDAFRCWNRFDQDYVQPD---------- 556
                                 +  CQIC +  HDA  CW+R++    +P+          
Sbjct: 276  KGRGKGRGRGRGRASNPQNSGKVQCQICGKLNHDALNCWHRYEPQSTKPNSCGYHAPSGS 335

Query: 555  -PPPEIXXXXXXXXXXXXXXXXXXXXXXXXXAPRVYAATTQEVQVPRMF---------DS 406
             PPP                               YA  +  + VP+ F          S
Sbjct: 336  RPPPY----------------------------NPYARPSAHLAVPQYFPSIPDTDSVSS 367

Query: 405  QVWYPDSGASHHVTADSQHLTQSVPFNGSDQVLMGNGQG 289
              WYPD GASHH+T +  +     P+ G DQV+MGNGQG
Sbjct: 368  ASWYPDFGASHHLTFNPNNFAHRAPYQGPDQVMMGNGQG 406


>GAU20748.1 hypothetical protein TSUD_231620 [Trifolium subterraneum]
          Length = 327

 Score =  274 bits (700), Expect = 2e-83
 Identities = 130/237 (54%), Positives = 178/237 (75%), Gaps = 11/237 (4%)
 Frame = -1

Query: 1398 THSLSLKLDDKNFLLWNQQVEAVIAAHKLHRFVVNPNIPLRYANEMDRTLDIETEEYQRW 1219
            THSL++KLD+KNFL W+QQV  VI AH LHRF+VNP IPL++A   DR     ++EY++W
Sbjct: 34   THSLTIKLDEKNFLSWSQQVNGVITAHNLHRFIVNPEIPLQFATVADRIDGKTSDEYRKW 93

Query: 1218 FVQDQMLFTWLLSSLSDSILPRVIGCKHSWQVWDKVHKHFYTQLKAKVRSLRSELKTTKK 1039
              +DQ LFTWLLS++SD +LPRV+ CKH+ +VWDK+HK+F + LK+++R L+SELK TKK
Sbjct: 94   IFKDQTLFTWLLSTISDVVLPRVVHCKHAHEVWDKIHKYFNSVLKSRIRQLKSELKNTKK 153

Query: 1038 GTRSISEYVLRIKAISDSLIAIGEPISEQDQVDAILDGLPEDYGPFIMMMYGRSDCPSVA 859
              R +SEY+LRIK+I +SLIA+GE I+EQ+QV+AILDGLPE++  F+MM+Y R D P++ 
Sbjct: 154  LARPVSEYLLRIKSIVNSLIAMGEMITEQEQVEAILDGLPEEFNSFVMMVYSRFDTPTIE 213

Query: 858  DVESLLLVQEAQFDKYKSELSTGSVSINVAQ-----------GPGNRESADEQNFNS 721
             VE LL++QEAQF+K++ EL+  SVS NVAQ           G  N    ++ NFN+
Sbjct: 214  YVEGLLMIQEAQFEKFRQELTNPSVSANVAQMESKNNQANQDGEDNESDTEQYNFNA 270


>GAU44321.1 hypothetical protein TSUD_305020 [Trifolium subterraneum]
          Length = 468

 Score =  277 bits (708), Expect = 1e-82
 Identities = 131/220 (59%), Positives = 169/220 (76%)
 Frame = -1

Query: 1434 PVQPSSPAPHSFTHSLSLKLDDKNFLLWNQQVEAVIAAHKLHRFVVNPNIPLRYANEMDR 1255
            P           THSL++KLD+KNFLLW+QQ+  VI  H LHRFVVNP IPL++A+  DR
Sbjct: 246  PTTTKDSTKSGLTHSLTIKLDEKNFLLWSQQINGVITTHNLHRFVVNPEIPLQFASVNDR 305

Query: 1254 TLDIETEEYQRWFVQDQMLFTWLLSSLSDSILPRVIGCKHSWQVWDKVHKHFYTQLKAKV 1075
                 +EEYQ+W   DQ LFTWLLS++SDSILPRV+ CKH+ +VWDK+HKHF + LK+++
Sbjct: 306  LNGKISEEYQKWLFIDQTLFTWLLSTISDSILPRVLHCKHAHEVWDKIHKHFNSVLKSRI 365

Query: 1074 RSLRSELKTTKKGTRSISEYVLRIKAISDSLIAIGEPISEQDQVDAILDGLPEDYGPFIM 895
            R LR ELK TKK  R ISEY+LRIK+I +SLIA+GE +SEQ+QV+ ILDGLPE++ PF+M
Sbjct: 366  RQLRFELKNTKKLARPISEYLLRIKSIINSLIALGEAVSEQEQVNVILDGLPEEFNPFVM 425

Query: 894  MMYGRSDCPSVADVESLLLVQEAQFDKYKSELSTGSVSIN 775
            M+Y R D P+V DVE LL++QEAQF+K++ EL+  SVS N
Sbjct: 426  MVYSRYDTPTVEDVEGLLMLQEAQFEKFRQELTNPSVSAN 465


>KYP36809.1 hypothetical protein KK1_042047 [Cajanus cajan]
          Length = 280

 Score =  253 bits (646), Expect = 5e-76
 Identities = 124/243 (51%), Positives = 164/243 (67%), Gaps = 22/243 (9%)
 Frame = -1

Query: 1203 MLFTWLLSSLSDSILPRVIGCKHSWQVWDKVHKHFYTQLKAKVRSLRSELKTTKKG-TRS 1027
            MLF+WLLSSLS+S+LPRV+GCKHS+++WDK+HKH+Y+ L AK R LRSELK++KKG ++ 
Sbjct: 1    MLFSWLLSSLSESVLPRVLGCKHSYEIWDKIHKHYYSHLHAKKRQLRSELKSSKKGPSQP 60

Query: 1026 ISEYVLRIKAISDSLIAIGEPISEQDQVDAILDGLPEDYGPFIMMMYGRSDCPSVADVES 847
            ISEY+LRI+ I +SLI +G+ +++QDQ+D ILDGLPEDY  FIMM+YGRSD  SV DVES
Sbjct: 61   ISEYILRIREIINSLIVVGDLVTDQDQIDTILDGLPEDYNSFIMMIYGRSDSISVTDVES 120

Query: 846  LLLVQEAQFDKYKSELSTGSVSINVAQGPGNRESADEQNFNS------------------ 721
            LLLVQEAQ +KY+ +L++ SVS+NV QGP + +   +  F S                  
Sbjct: 121  LLLVQEAQLEKYRQDLTSPSVSVNVVQGPQDSQFQSQSQFVSNRGGFQFSTRGGRYRGGR 180

Query: 720  ---XXXXXXXXXXXXXXXXXXXXXXXXGPRPTCQICFRYGHDAFRCWNRFDQDYVQPDPP 550
                                       G RPTCQ+C++YGHDAF CWNRFD+ ++QP  P
Sbjct: 181  YRCRGRNRCGGRYRGGRYRGRGRNRGGGQRPTCQLCYKYGHDAFHCWNRFDEAFIQPSQP 240

Query: 549  PEI 541
            P +
Sbjct: 241  PNL 243


>KYP36193.1 hypothetical protein KK1_042704 [Cajanus cajan]
          Length = 221

 Score =  236 bits (602), Expect = 2e-70
 Identities = 116/212 (54%), Positives = 151/212 (71%), Gaps = 5/212 (2%)
 Frame = -1

Query: 1203 MLFTWLLSSLSDSILPRVIGCKHSWQVWDKVHKHFYTQLKAKVRSLRSELKTTKKG-TRS 1027
            MLF+WLLSSLS+ +LPRV+GCKHS+++WDK+HKH+Y+ L AK R LRSELK++K G ++ 
Sbjct: 1    MLFSWLLSSLSEPVLPRVLGCKHSYEIWDKIHKHYYSHLHAKKRQLRSELKSSKIGPSQP 60

Query: 1026 ISEYVLRIKAISDSLIAIGEPISEQDQVDAILDGLPEDYGPFIMMMYGRSDCPSVADVES 847
            ISEY+LRI+AI +SLIA+GE ++ QDQ+D ILDGLPEDY  F+MM+YGRSD  SV DVES
Sbjct: 61   ISEYILRIRAIINSLIAVGELVTNQDQIDTILDGLPEDYNSFVMMIYGRSDSISVTDVES 120

Query: 846  LLLVQEAQFDKYKSELSTGSVSINVAQGPGNRESADEQNFNS----XXXXXXXXXXXXXX 679
            LLLVQEAQ +KY+ +L++ SVS+N  QGP +     +  F S                  
Sbjct: 121  LLLVQEAQLEKYRQDLTSPSVSVNAVQGPQDSHFQSQSQFVSNRGGFQFSNRGGRYCGGR 180

Query: 678  XXXXXXXXXXGPRPTCQICFRYGHDAFRCWNR 583
                      G RPTCQ+C++YG+DAF CWNR
Sbjct: 181  YRGRGRNRGGGQRPTCQLCYKYGYDAFHCWNR 212


>KYP61342.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cajanus
            cajan]
          Length = 1358

 Score =  256 bits (655), Expect = 8e-70
 Identities = 148/380 (38%), Positives = 208/380 (54%)
 Frame = -1

Query: 1422 SSPAPHSFTHSLSLKLDDKNFLLWNQQVEAVIAAHKLHRFVVNPNIPLRYANEMDRTLDI 1243
            SSP    F++S++ KLDD N+L W QQ+E VI +HKL RFVVNP IP RY  + DR  DI
Sbjct: 7    SSPFSQFFSNSIAEKLDDSNYLHWRQQIEPVIKSHKLQRFVVNPQIPPRYLTDADRDSDI 66

Query: 1242 ETEEYQRWFVQDQMLFTWLLSSLSDSILPRVIGCKHSWQVWDKVHKHFYTQLKAKVRSLR 1063
                Y+ W VQDQML TWL S+LS SIL RVIG  HS+QVWDKVH++F+TQ KA+ R LR
Sbjct: 67   VNPAYETWEVQDQMLLTWLQSTLSKSILSRVIGSVHSYQVWDKVHEYFHTQTKARARQLR 126

Query: 1062 SELKTTKKGTRSISEYVLRIKAISDSLIAIGEPISEQDQVDAILDGLPEDYGPFIMMMYG 883
            ++L++T    +S+ +++ +IK I+D L  +G P+S ++ VDA+L+GLP++Y P + ++  
Sbjct: 127  TDLRSTTLDGQSMRDFLTQIKTIADELAGVGSPVSLEEYVDAVLEGLPQEYAPVVSVIES 186

Query: 882  RSDCPSVADVESLLLVQEAQFDKYKSELSTGSVSINVAQGPGNRESADEQNFNSXXXXXX 703
            +   P +A+VE+LLL  E++ ++++ +  + S SIN  QG             S      
Sbjct: 187  KFVTPPIAEVEALLLAHESRANRFRKQ--SFSPSINYTQG------------YSRGSVSG 232

Query: 702  XXXXXXXXXXXXXXXXXXGPRPTCQICFRYGHDAFRCWNRFDQDYVQPDPPPEIXXXXXX 523
                                   CQICF+YGH A  C+ R D +Y QP            
Sbjct: 233  GHSGRRGGRGSGRGRGGRFANFQCQICFKYGHTANVCFYRADVNY-QP------------ 279

Query: 522  XXXXXXXXXXXXXXXXXXXAPRVYAATTQEVQVPRMFDSQVWYPDSGASHHVTADSQHLT 343
                               A  +  A       P    S  W PDSGAS HVT + Q++ 
Sbjct: 280  -------------------AESLVLAMVANTSQPGANSS--WIPDSGASFHVTGEPQNIH 318

Query: 342  QSVPFNGSDQVLMGNGQGVQ 283
            Q   F+G DQ+ +GNGQG+Q
Sbjct: 319  QLEHFDGPDQIFIGNGQGLQ 338


>KYP46257.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cajanus
            cajan]
          Length = 1408

 Score =  256 bits (653), Expect = 2e-69
 Identities = 149/399 (37%), Positives = 215/399 (53%), Gaps = 14/399 (3%)
 Frame = -1

Query: 1440 ISPVQPSSPAPHS-----FTHSLSLKLDDKNFLLWNQQVEAVIAAHKLHRFVVNPNIPLR 1276
            ++ V P +P  +S     F+H++S KLD KN+LLW QQVE VI  H+LH ++VNP IP +
Sbjct: 18   VNTVPPKNPPSNSHPSLTFSHTISEKLDTKNYLLWCQQVEPVIKGHRLHHYLVNPQIPQK 77

Query: 1275 YANEMDRTLDIETEEYQRWFVQDQMLFTWLLSSLSDSILPRVIGCKHSWQVWDKVHKHFY 1096
            +A   DR     +E Y  W  QDQ+L +WL SS+S  +L RVIGCK S+Q+WDK+H +F+
Sbjct: 78   FATLADRDAGHISESYLAWEQQDQLLLSWLQSSMSKDMLTRVIGCKSSFQLWDKIHTYFH 137

Query: 1095 TQLKAKVRSLRSELKTTKKGTRSISEYVLRIKAISDSLIAIGEPISEQDQVDAILDGLPE 916
            + + AK R LR+EL++T     SISEYVLRI+ + D+L AIG+ +S ++ +D IL+GLPE
Sbjct: 138  SHMNAKARQLRNELRSTTLDNLSISEYVLRIQTLVDALTAIGDSVSPKEHLDIILEGLPE 197

Query: 915  DYGPFIMMMYGRSDCPSVADVESLLLVQEAQFDKYKSELSTGSVSINVA------QGPGN 754
            +Y   + ++  R D  ++ +VE+LLL  E++ DK+K + +  S+++  A           
Sbjct: 198  EYESTVSLISSRFDLLTIDEVETLLLGHESRLDKFKKK-AAASINVTTAVTEPDPSATNP 256

Query: 753  RESADEQNFNSXXXXXXXXXXXXXXXXXXXXXXXXGPR---PTCQICFRYGHDAFRCWNR 583
            +     QN  S                          R     CQ+C RYGH A  C+ R
Sbjct: 257  QAHLTHQNNQSGPSHRRGGRTNSRGGRFSNWAGRGRGRFAGYQCQVCHRYGHVASACYYR 316

Query: 582  FDQDYVQPDPPPEIXXXXXXXXXXXXXXXXXXXXXXXXXAPRVYAATTQEVQVPRMFDSQ 403
            FD+ YV P  P E                              Y +  Q        ++ 
Sbjct: 317  FDETYV-PSSPLEAP---------------------------AYPSNNQHTNPGACNNN- 347

Query: 402  VWYPDSGASHHVTADSQHLTQSVPFNGSDQVLMGNGQGV 286
             WYPDSGAS+HVT  SQ++ Q  PF G DQ+ +GNGQG+
Sbjct: 348  -WYPDSGASNHVTNVSQNIHQFTPFEGPDQIHVGNGQGL 385


>KYP40244.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cajanus
            cajan]
          Length = 720

 Score =  248 bits (634), Expect = 2e-69
 Identities = 139/387 (35%), Positives = 209/387 (54%)
 Frame = -1

Query: 1419 SPAPHSFTHSLSLKLDDKNFLLWNQQVEAVIAAHKLHRFVVNPNIPLRYANEMDRTLDIE 1240
            SP    F++S++ KLDD N+L W QQ++ +I +HKL RFVVNP IP RY  + DR  DI 
Sbjct: 8    SPFSQFFSNSIAEKLDDSNYLHWRQQIKPIIKSHKLQRFVVNPQIPPRYLTDADRDYDIV 67

Query: 1239 TEEYQRWFVQDQMLFTWLLSSLSDSILPRVIGCKHSWQVWDKVHKHFYTQLKAKVRSLRS 1060
               Y+ W VQDQML TWL S LS +IL RVIG  HS+QVWDKVH++F+TQ KA+ R LR+
Sbjct: 68   NPAYETWEVQDQMLLTWLQSMLSKTILSRVIGSVHSYQVWDKVHEYFHTQTKARARQLRT 127

Query: 1059 ELKTTKKGTRSISEYVLRIKAISDSLIAIGEPISEQDQVDAILDGLPEDYGPFIMMMYGR 880
            +L++T    +S+ +++ +IK I+D L  +G P+S ++ VD +L+GLP++Y P + ++  +
Sbjct: 128  DLRSTTLDGKSMRDFLTQIKNIADQLAGVGSPMSLEEYVDVVLEGLPQEYTPVVSVIESK 187

Query: 879  SDCPSVADVESLLLVQEAQFDKYKSELSTGSVSINVAQGPGNRESADEQNFNSXXXXXXX 700
               P +A+VE+LLL  E++ ++++ +  + S SIN  QG  +R S   ++F         
Sbjct: 188  FVTPPIAEVEALLLAHESRVNRFRKQ--SFSPSINYTQG-YSRGSISGESFRDRDGGHSG 244

Query: 699  XXXXXXXXXXXXXXXXXGPRPTCQICFRYGHDAFRCWNRFDQDYVQPDPPPEIXXXXXXX 520
                                  CQ CF+YGH A  C+ R D +Y   +            
Sbjct: 245  CRGGQGSGRGRGGRFANF---HCQNCFKYGHTANVCFYRADVNYQLVE------------ 289

Query: 519  XXXXXXXXXXXXXXXXXXAPRVYAATTQEVQVPRMFDSQVWYPDSGASHHVTADSQHLTQ 340
                                  +          +   +  W PDSGAS H+T + Q++ Q
Sbjct: 290  ----------------------FLVLAMVANTSQAGANSSWIPDSGASFHITGEPQNIHQ 327

Query: 339  SVPFNGSDQVLMGNGQGVQ*STSSRYF 259
               F+G DQ+ +GNGQG+Q + S   F
Sbjct: 328  LEHFDGLDQIFIGNGQGLQINGSGSSF 354


>KYP33001.1 hypothetical protein KK1_046197, partial [Cajanus cajan]
          Length = 470

 Score =  241 bits (615), Expect = 4e-69
 Identities = 149/410 (36%), Positives = 213/410 (51%), Gaps = 30/410 (7%)
 Frame = -1

Query: 1425 PSSPAPHS-----FTHSLSLKLDDKNFLLWNQQVEAVIAAHKLHRFVVNPNIPLRYANEM 1261
            P +P P+S     F+H++S KL  KN+LLW QQVE VI  H+LH ++VNP IP ++A   
Sbjct: 2    PKNPPPNSHPSLTFSHTISEKLGTKNYLLWCQQVEPVIKGHRLHHYLVNPQIPQKFATLA 61

Query: 1260 DRTLDIETEEYQRWFVQDQMLFTWLLSSLSDSILPRVIGCKHSWQVWDKVHKHFYTQLKA 1081
            DR     +E Y  W  QDQ+L +WL SS+S  +L RVIGCK S+Q+WDK+H +F++ + A
Sbjct: 62   DRDAGCISESYLAWEQQDQLLLSWLQSSMSKDMLTRVIGCKSSFQLWDKIHSYFHSHMNA 121

Query: 1080 KVRSLRSELKTTKKGTRSISEYVLRIKAISDSLIAIGEPISEQDQVDAILDGLPEDYGPF 901
            K   LR+EL +T     SISEYVLRI+ + D+L AIG+ +S ++ +D IL+GLPE+Y   
Sbjct: 122  KACQLRNELCSTSLENLSISEYVLRIQTLVDALTAIGDSVSLKEHLDIILEGLPEEYEST 181

Query: 900  IMMMYGRSDCPSVADVESLLLVQEAQFDKYKSE------LSTGSVSIN--VAQGPGNRES 745
            + ++  R D  ++ +VE+LLL  E++ DK+K +      ++T ++  N  V     +   
Sbjct: 182  MSLISSRFDLLTIDEVETLLLGHESRLDKFKKKAAAYINVTTATIEPNPSVTNPQAHLAH 241

Query: 744  ADEQNFNSXXXXXXXXXXXXXXXXXXXXXXXXGPRPTCQICFRYGHDAFRCWNRFDQDYV 565
             + Q+  S                             CQ+C RY H A  C+ RFD+ YV
Sbjct: 242  QENQSGFSHRRGGHTNFRGGRFSNRAGRGRGRFAAYQCQVCHRYEHVASACYYRFDETYV 301

Query: 564  QPDPPPE------IXXXXXXXXXXXXXXXXXXXXXXXXXAPRVYAA-----TTQEVQVPR 418
             P  P E      I                          PR         T+ + Q   
Sbjct: 302  -PSSPLEAPAYHSINQHTNPGAWYNNQPASPSPHQNGILGPRPQFTPQVQFTSTQAQPQA 360

Query: 417  MFDSQV------WYPDSGASHHVTADSQHLTQSVPFNGSDQVLMGNGQGV 286
            M  S        WYPDSGAS+HVT  SQ++ Q   F G DQ+ +GNGQG+
Sbjct: 361  MIASSSSSSNNNWYPDSGASNHVTNVSQNIHQFTLFKGPDQIHVGNGQGL 410


>KYP34307.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cajanus
            cajan]
          Length = 1102

 Score =  253 bits (645), Expect = 6e-69
 Identities = 149/415 (35%), Positives = 217/415 (52%), Gaps = 30/415 (7%)
 Frame = -1

Query: 1440 ISPVQPSSPAPHS-----FTHSLSLKLDDKNFLLWNQQVEAVIAAHKLHRFVVNPNIPLR 1276
            ++ V P +P P+S     F+H++S KLD KN+LLW QQVE VI  H+LH ++VNP IP +
Sbjct: 18   VNTVPPKNPPPNSHPSLTFSHTISEKLDTKNYLLWCQQVEPVIKGHRLHHYLVNPQIPQK 77

Query: 1275 YANEMDRTLDIETEEYQRWFVQDQMLFTWLLSSLSDSILPRVIGCKHSWQVWDKVHKHFY 1096
            +A   DR     +E Y  W  QDQ+L +WL SS+S  +L RVIGCK S+Q+WDK+H +F+
Sbjct: 78   FATLADRDAGRISESYLAWEQQDQLLLSWLQSSMSKDMLTRVIGCKSSFQLWDKIHSYFH 137

Query: 1095 TQLKAKVRSLRSELKTTKKGTRSISEYVLRIKAISDSLIAIGEPISEQDQVDAILDGLPE 916
            + + AK R LR+EL+ T     SISEYVLRI+ + D+L AIG  +S ++ +D IL+GLPE
Sbjct: 138  SHMNAKARQLRNELRNTSLENLSISEYVLRIQTLVDALTAIGNSVSPKEHLDIILEGLPE 197

Query: 915  DYGPFIMMMYGRSDCPSVADVESLLLVQEAQFDKYKSEL--------STGSVSINVAQGP 760
            +Y   + ++    D  ++ +VE+LLL  E++ DK+K ++        +T   + +V    
Sbjct: 198  EYESTVSLISSHFDLLTIDEVETLLLGHESRLDKFKKKVAASINVTTTTTEPNPSVTNPQ 257

Query: 759  GNRESADEQNFNSXXXXXXXXXXXXXXXXXXXXXXXXGPRPTCQICFRYGHDAFRCWNRF 580
             +    + Q+  S                             CQ+C RYGH A  C+ RF
Sbjct: 258  AHLAHQENQSGFSHRQGGRTNFRGGRFSNRAGRGRGRFAGYQCQVCHRYGHVASACYYRF 317

Query: 579  DQDYVQPDPPPEI------------XXXXXXXXXXXXXXXXXXXXXXXXXAPRVYAATTQ 436
            D+ YV P  P E                                       P+V   +TQ
Sbjct: 318  DETYV-PSSPLEAPAYHSINQHTNPGAWYSNQTASPSSHRNEILGPRPQFTPQVQFTSTQ 376

Query: 435  ---EVQVPRMFDSQV--WYPDSGASHHVTADSQHLTQSVPFNGSDQVLMGNGQGV 286
               +  +     S +  WYPDS AS+HVT  SQ++ Q  PF G DQ+ +GNGQG+
Sbjct: 377  AQPQAMIASSSSSSINNWYPDSRASNHVTNVSQNIHQFTPFEGPDQIHVGNGQGL 431


>GAU37351.1 hypothetical protein TSUD_395330 [Trifolium subterraneum]
          Length = 1216

 Score =  253 bits (647), Expect = 6e-69
 Identities = 116/213 (54%), Positives = 162/213 (76%)
 Frame = -1

Query: 1434 PVQPSSPAPHSFTHSLSLKLDDKNFLLWNQQVEAVIAAHKLHRFVVNPNIPLRYANEMDR 1255
            P           THSL++KLD+ NFLLW+QQV  VI AH LHRFVVNP IPL++ +  DR
Sbjct: 19   PATTKESTRSGLTHSLTIKLDENNFLLWSQQVNGVITAHNLHRFVVNPQIPLQFDSIEDR 78

Query: 1254 TLDIETEEYQRWFVQDQMLFTWLLSSLSDSILPRVIGCKHSWQVWDKVHKHFYTQLKAKV 1075
                 + EYQ+W V+DQ LFTWLLS++SD ILPRV+ C+H+ +VWD +HK+F + LK++ 
Sbjct: 79   ANLKNSVEYQKWLVKDQTLFTWLLSTISDGILPRVLSCRHAHEVWDSIHKYFNSMLKSRA 138

Query: 1074 RSLRSELKTTKKGTRSISEYVLRIKAISDSLIAIGEPISEQDQVDAILDGLPEDYGPFIM 895
            R LR ELK TKK + S++EY+LRIK+I +S +A+G+ +++Q+Q+DAIL+GLPE++  F+M
Sbjct: 139  RQLRFELKNTKKMSCSVNEYLLRIKSIVNSPVAVGDIVTKQEQIDAILEGLPEEFNSFVM 198

Query: 894  MMYGRSDCPSVADVESLLLVQEAQFDKYKSELS 796
            M+Y R D P+V D+E+LLL+QE QF+K+K ELS
Sbjct: 199  MVYSRFDTPTVEDIEALLLLQEVQFEKFKQELS 231



 Score = 57.4 bits (137), Expect(2) = 7e-11
 Identities = 25/39 (64%), Positives = 30/39 (76%)
 Frame = -1

Query: 408 SQVWYPDSGASHHVTADSQHLTQSVPFNGSDQVLMGNGQ 292
           S  WYPDSGASHH+T +  +L+  VP+NG DQVLMG GQ
Sbjct: 320 SGAWYPDSGASHHLTYNPNNLSYRVPYNGYDQVLMGIGQ 358



 Score = 40.0 bits (92), Expect(2) = 7e-11
 Identities = 18/38 (47%), Positives = 26/38 (68%)
 Frame = -2

Query: 287 SNKVLLQGTLGPDGLYCFQPLKFLQGSSCGAQFRSQSS 174
           SN++LL+G++G DGLY FQP KFL  +   ++    SS
Sbjct: 384 SNQILLEGSVGVDGLYKFQPFKFLPINGVNSKLTQASS 421


>KHN49021.1 hypothetical protein glysoja_031232, partial [Glycine soja]
          Length = 323

 Score =  219 bits (559), Expect = 1e-62
 Identities = 116/311 (37%), Positives = 173/311 (55%), Gaps = 27/311 (8%)
 Frame = -1

Query: 1404 SFTHSLSLKLDDKNFLLWNQQVEAVIAAHKLHRFVVNPNIPLRYANEMDRTLDIETEEYQ 1225
            SF + +S+KLD  N+L+W QQ+E V+ AH+LHRF V P IP +YA+E DR  +IE   + 
Sbjct: 14   SFNYKISVKLDATNYLVWLQQIEPVLRAHRLHRFCVTPEIPPQYASEHDRLANIENPAFS 73

Query: 1224 RWFVQDQMLFTWLLSSLSDSILPRVIGCKHSWQVWDKVHKHFYTQLKAKVRSLRSELKTT 1045
             W +QDQ+L  WL SSLS +ILP VIGCKH++Q+W+ +H+ F ++ KA+ R LR++L+TT
Sbjct: 74   NWELQDQLLLAWLQSSLSPAILPSVIGCKHTFQLWENIHQSFQSKTKAQARQLRTQLRTT 133

Query: 1044 KKGTRSISEYVLRIKAISDSLIAIGEPISEQDQVDAILDGLPEDYGPFIMMMYGRSDCPS 865
            KKG+ SISE++ +IK ISDSL +IGE +S QDQ+D IL+GLP ++   + ++  + +   
Sbjct: 134  KKGSSSISEFLAKIKHISDSLTSIGESVSLQDQLDVILEGLPNEFESLVTLINSKIEWFD 193

Query: 864  VADVESLLLVQEAQFDKYKSELSTGSVSINVAQ--------------------------- 766
            + ++ +LLL  E + DK +      S++   +Q                           
Sbjct: 194  LEEIRALLLAHEQRLDKARITEEAASLNFTQSQPNSKIPNSVNPNSATETQIAPQANWTT 253

Query: 765  GPGNRESADEQNFNSXXXXXXXXXXXXXXXXXXXXXXXXGPRPTCQICFRYGHDAFRCWN 586
            G  N  + D QN N+                             CQ+C R GHDA  C++
Sbjct: 254  GNSNSGNYDSQN-NNFKNNNQSRGRGGRNGRGNRGGRGGRSTVQCQVCHRTGHDASYCYH 312

Query: 585  RFDQDYVQPDP 553
            RF+  Y    P
Sbjct: 313  RFNAAYGSNQP 323


>GAU27211.1 hypothetical protein TSUD_108020 [Trifolium subterraneum]
          Length = 967

 Score =  232 bits (592), Expect = 3e-62
 Identities = 116/211 (54%), Positives = 155/211 (73%)
 Frame = -1

Query: 1398 THSLSLKLDDKNFLLWNQQVEAVIAAHKLHRFVVNPNIPLRYANEMDRTLDIETEEYQRW 1219
            THSL++KLD+KNFLLW+QQV  VI AH LHRFVVNP I L+YA+  DR     +EEY+ W
Sbjct: 34   THSLTIKLDEKNFLLWSQQVNGVITAHNLHRFVVNPEILLQYASIADRLDGKNSEEYKTW 93

Query: 1218 FVQDQMLFTWLLSSLSDSILPRVIGCKHSWQVWDKVHKHFYTQLKAKVRSLRSELKTTKK 1039
              +DQ LFTWLLS++SD +LPRV+ CKHS +VW+K+HK+F + LK++ R LRSELK TKK
Sbjct: 94   LFKDQSLFTWLLSTISDGVLPRVLNCKHSHEVWEKIHKYFNSVLKSRARQLRSELKNTKK 153

Query: 1038 GTRSISEYVLRIKAISDSLIAIGEPISEQDQVDAILDGLPEDYGPFIMMMYGRSDCPSVA 859
              RS+SEY+LRIK+I +SLIA+G+    + + +     L   +   I+  +  SD P+V 
Sbjct: 154  SARSMSEYLLRIKSIVNSLIAMGDMDCWKQESEKERIVLLSHFVKIIVKSFTGSDNPTVE 213

Query: 858  DVESLLLVQEAQFDKYKSELSTGSVSINVAQ 766
            D+E LLL+QEAQF+K++ EL+  SVS NVAQ
Sbjct: 214  DIEGLLLLQEAQFEKFRQELANPSVSTNVAQ 244


Top