BLASTX nr result
ID: Glycyrrhiza35_contig00002445
seq
BLASTX 2.2.26 [Sep-21-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Glycyrrhiza35_contig00002445 (1890 letters) Database: ./nr 115,041,592 sequences; 42,171,959,267 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value GAU15285.1 hypothetical protein TSUD_03520 [Trifolium subterraneum] 352 e-113 GAU26016.1 hypothetical protein TSUD_64040 [Trifolium subterraneum] 338 e-101 GAU29238.1 hypothetical protein TSUD_362280 [Trifolium subterran... 334 5e-97 KHN36156.1 Retrovirus-related Pol polyprotein from transposon TN... 334 7e-97 KHN22040.1 Retrovirus-related Pol polyprotein from transposon TN... 334 7e-97 GAU19342.1 hypothetical protein TSUD_336290 [Trifolium subterran... 332 3e-96 GAU30708.1 hypothetical protein TSUD_39320 [Trifolium subterraneum] 330 2e-95 GAU11134.1 hypothetical protein TSUD_197580 [Trifolium subterran... 316 3e-91 GAU20748.1 hypothetical protein TSUD_231620 [Trifolium subterran... 274 2e-83 GAU44321.1 hypothetical protein TSUD_305020 [Trifolium subterran... 277 1e-82 KYP36809.1 hypothetical protein KK1_042047 [Cajanus cajan] 253 5e-76 KYP36193.1 hypothetical protein KK1_042704 [Cajanus cajan] 236 2e-70 KYP61342.1 Retrovirus-related Pol polyprotein from transposon TN... 256 8e-70 KYP46257.1 Retrovirus-related Pol polyprotein from transposon TN... 256 2e-69 KYP40244.1 Retrovirus-related Pol polyprotein from transposon TN... 248 2e-69 KYP33001.1 hypothetical protein KK1_046197, partial [Cajanus cajan] 241 4e-69 KYP34307.1 Retrovirus-related Pol polyprotein from transposon TN... 253 6e-69 GAU37351.1 hypothetical protein TSUD_395330 [Trifolium subterran... 253 6e-69 KHN49021.1 hypothetical protein glysoja_031232, partial [Glycine... 219 1e-62 GAU27211.1 hypothetical protein TSUD_108020 [Trifolium subterran... 232 3e-62 >GAU15285.1 hypothetical protein TSUD_03520 [Trifolium subterraneum] Length = 392 Score = 352 bits (904), Expect = e-113 Identities = 181/377 (48%), Positives = 244/377 (64%), Gaps = 3/377 (0%) Frame = -1 Query: 1431 VQPSSPAPHSFTHSLSLKLDDKNFLLWNQQVEAVIAAHKLHRFVVNPNIPLRYANEMDRT 1252 V+ + + ++ + + LKLDD N+LLW+QQVE VI A+KLHRFVVNP IP +YA+E DR Sbjct: 13 VESTGSSTNAASFTPKLKLDDGNYLLWSQQVEGVILANKLHRFVVNPQIPAKYASESDRE 72 Query: 1251 LDIETEEYQRWFVQDQMLFTWLLSSLSDSILPRVIGCKHSWQVWDKVHKHFYTQLKAKVR 1072 LD +E Y +W VQDQMLFTWLLS+L++S+LPR IGC+H++QVWD++HKHF LKAKVR Sbjct: 73 LDRVSEAYDKWLVQDQMLFTWLLSTLAESVLPRTIGCRHAFQVWDQIHKHFEAHLKAKVR 132 Query: 1071 SLRSELKTTKKGTRSISEYVLRIKAISDSLIAIGEPISEQDQVDAILDGLPEDYGPFIMM 892 LRSELK KKGT+SI+E+VLR++ I+D+LI+IG+ ISEQDQ+D+IL+GLPE+Y PF+MM Sbjct: 133 QLRSELKNVKKGTKSITEFVLRVRVIADTLISIGDSISEQDQIDSILEGLPEEYNPFVMM 192 Query: 891 MYGRSDCPSVADVESLLLVQEAQFDKYKSELSTGSVSINVAQGPGNRESADEQNFNSXXX 712 +YGRSD PS+ D+E LLLVQE+Q +K++ ELST S S N+A G R ++ + Sbjct: 193 IYGRSDSPSLYDIEGLLLVQESQLEKFRQELSTPSASANLAHSRGGRGNSGARG------ 246 Query: 711 XXXXXXXXXXXXXXXXXXXXXGPRPTCQICFRYGHDAFRCWNRFDQDYVQPDPPPEIXXX 532 G RPTCQ+C +YGH CW RFD+++V P P + Sbjct: 247 -----RGRSTRGRGRPAASPTGNRPTCQLCGKYGHHVIDCWYRFDENFV-PAPNSSVLKS 300 Query: 531 XXXXXXXXXXXXXXXXXXXXXXAPRVYAATTQEVQVPRMFDSQVWYPDSGASHHVTADSQ 352 +A TQE+ +P Q W+PDSGASHH+TAD+ Sbjct: 301 DTSGPKTNHESPQACTAN--------FAPATQELVIP-----QSWFPDSGASHHITADAS 347 Query: 351 HLTQSVPFN---GSDQV 310 +L Q + G DQ+ Sbjct: 348 NLAQDLSVQHVPGIDQI 364 >GAU26016.1 hypothetical protein TSUD_64040 [Trifolium subterraneum] Length = 942 Score = 338 bits (868), Expect = e-101 Identities = 179/383 (46%), Positives = 236/383 (61%), Gaps = 12/383 (3%) Frame = -1 Query: 1398 THSLSLKLDDKNFLLWNQQVEAVIAAHKLHRFVVNPNIPLRYANEMDRTLDIETEEYQRW 1219 THS+++KLD+KNFLLW+QQV VI AH LHRFVVNPNIPL++A DR +EEY++W Sbjct: 34 THSITIKLDEKNFLLWSQQVNGVITAHNLHRFVVNPNIPLQFATVNDRIEGNTSEEYRKW 93 Query: 1218 FVQDQMLFTWLLSSLSDSILPRVIGCKHSWQVWDKVHKHFYTQLKAKVRSLRSELKTTKK 1039 +DQ LFTWLLS++SDS+LPRV+ CKHS +VWDK+HK+F + LK+++R LRSELK TKK Sbjct: 94 LFKDQTLFTWLLSTISDSVLPRVLHCKHSHEVWDKIHKYFNSVLKSRIRQLRSELKNTKK 153 Query: 1038 GTRSISEYVLRIKAISDSLIAIGEPISEQDQVDAILDGLPEDYGPFIMMMYGRSDCPSVA 859 RS+SEY+LRIK+I +SLIA+ E +SEQ+QVDAILDGLPED+ F+MM+Y R D P+V Sbjct: 154 LARSVSEYLLRIKSIVNSLIAMSEVVSEQEQVDAILDGLPEDFNSFVMMVYSRFDTPTVE 213 Query: 858 DVESLLLVQEAQFDKYKSELSTGSVSINVAQGPG-----NRESADEQNFN----SXXXXX 706 D+E LL++QEAQF+K++ EL+ +VS NVAQ N+++ D ++ N S Sbjct: 214 DIEGLLMLQEAQFEKFRQELANPNVSANVAQMESKNHHSNQDTEDTESVNESYGSNTYRG 273 Query: 705 XXXXXXXXXXXXXXXXXXXGPRPTCQICFRYGHDAFRCWNRFDQDYVQPDPPPEIXXXXX 526 + CQIC ++ HDA CW R+D PP Sbjct: 274 RGRGKGRARGRDKAPITPNAGKVQCQICAKHNHDAANCWYRYD--------PPS---SRY 322 Query: 525 XXXXXXXXXXXXXXXXXXXXAPRVYAATTQE---VQVPRMFDSQVWYPDSGASHHVTADS 355 P + A Q + F + WYPDSGASHH+T + Sbjct: 323 NARGYNAGSTSRQPQYNPYPRPSAHLALPQHYNPIADMDTFSNASWYPDSGASHHLTFNP 382 Query: 354 QHLTQSVPFNGSDQVLMGNGQGV 286 +LT P+ G DQV MGNGQGV Sbjct: 383 NNLTYRTPYQGQDQVTMGNGQGV 405 >GAU29238.1 hypothetical protein TSUD_362280 [Trifolium subterraneum] Length = 1433 Score = 334 bits (857), Expect = 5e-97 Identities = 175/380 (46%), Positives = 232/380 (61%), Gaps = 9/380 (2%) Frame = -1 Query: 1398 THSLSLKLDDKNFLLWNQQVEAVIAAHKLHRFVVNPNIPLRYANEMDRTLDIETEEYQRW 1219 THSL++KLD+KNFLLW+QQV VI AH LHRFVVNP IPL++A+ DR ++EY++W Sbjct: 34 THSLTIKLDEKNFLLWSQQVNGVITAHNLHRFVVNPEIPLQFASVNDRIEGKTSDEYRKW 93 Query: 1218 FVQDQMLFTWLLSSLSDSILPRVIGCKHSWQVWDKVHKHFYTQLKAKVRSLRSELKTTKK 1039 +DQ LFTWLLS++SD++LPRV+ CKHS +VWDK+HK+F + LK+++R LRSELK TKK Sbjct: 94 LFKDQTLFTWLLSTISDAVLPRVVHCKHSHEVWDKIHKYFNSVLKSRIRQLRSELKNTKK 153 Query: 1038 GTRSISEYVLRIKAISDSLIAIGEPISEQDQVDAILDGLPEDYGPFIMMMYGRSDCPSVA 859 R +SEY+LRIK+I +SLIA+GE ++EQ+Q+DAIL+GLPED+ F+MMMY R D P+V Sbjct: 154 LARPVSEYLLRIKSIVNSLIAMGETVTEQEQIDAILEGLPEDFNSFVMMMYSRFDTPTVE 213 Query: 858 DVESLLLVQEAQFDKYKSELSTGSVSINVAQ-GPGNRESADE--------QNFNSXXXXX 706 D+E LL++QEAQF+K++ EL+ SVS N+AQ G N +S E +++NS Sbjct: 214 DIEGLLMLQEAQFEKFRQELTNPSVSANIAQIGSKNHQSNSEAEDTESGNESYNSNSYRG 273 Query: 705 XXXXXXXXXXXXXXXXXXXGPRPTCQICFRYGHDAFRCWNRFDQDYVQPDPPPEIXXXXX 526 + CQIC + HDA CW R Y P Sbjct: 274 RGRGKGRARGRGRAPNAPNTGKVQCQICGKANHDAAICWYR----YEPPSSRSNACGHNA 329 Query: 525 XXXXXXXXXXXXXXXXXXXXAPRVYAATTQEVQVPRMFDSQVWYPDSGASHHVTADSQHL 346 P+ Y V + WYPDSGASHH+T + +L Sbjct: 330 GSSSRPPPYNPYPRPSAHLALPQYYNPIADMDSV----SNASWYPDSGASHHLTFNPNNL 385 Query: 345 TQSVPFNGSDQVLMGNGQGV 286 T P+ G DQV MGNGQGV Sbjct: 386 TYRTPYQGQDQVTMGNGQGV 405 >KHN36156.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94, partial [Glycine soja] Length = 1417 Score = 334 bits (856), Expect = 7e-97 Identities = 170/378 (44%), Positives = 237/378 (62%), Gaps = 10/378 (2%) Frame = -1 Query: 1389 LSLKLDDKNFLLWNQQVEAVIAAHKLHRFVVNPNIPLRYANEMDRTLDIETEEYQRWFVQ 1210 L++KLD+KNFLLW+QQV VI AH LHRFVVNP IPL++A+ D L I ++EYQ+W ++ Sbjct: 1 LTIKLDEKNFLLWSQQVNGVITAHNLHRFVVNPQIPLQFASIEDCALGINSDEYQQWLIK 60 Query: 1209 DQMLFTWLLSSLSDSILPRVIGCKHSWQVWDKVHKHFYTQLKAKVRSLRSELKTTKKGTR 1030 DQ LFTWLLS+LSD +LPRV+ C+H+ +VWDK+HK+F + LK++ R LRSELK TKK +R Sbjct: 61 DQTLFTWLLSTLSDGVLPRVLSCRHAHEVWDKIHKYFNSVLKSRARQLRSELKNTKKLSR 120 Query: 1029 SISEYVLRIKAISDSLIAIGEPISEQDQVDAILDGLPEDYGPFIMMMYGRSDCPSVADVE 850 S++EY+LRIK+I +SL+A+G+ +SEQ+QVD+IL+GLPE++ F+MM+Y R D P+V DVE Sbjct: 121 SVNEYLLRIKSIVNSLVAVGDMVSEQEQVDSILEGLPEEFNSFVMMVYSRFDTPTVEDVE 180 Query: 849 SLLLVQEAQFDKYKSELSTGSVSINVAQGPGNRESADEQNFN--------SXXXXXXXXX 694 +LLL+QEAQF+K+K EL++ SVS NVA N ++ ++ + + Sbjct: 181 ALLLLQEAQFEKFKQELTSPSVSANVAHTETNASDSNSEHESQELGTEHYNVNANRGRGR 240 Query: 693 XXXXXXXXXXXXXXXGPRPTCQICFRYGHDAFRCWNRFDQDYVQPDPPPEIXXXXXXXXX 514 + CQIC + HDA CW R+D + + Sbjct: 241 GKGRGRGRGKGQAQNQGKVKCQICAKPNHDAINCWYRYDPQAMNQN----------SRGG 290 Query: 513 XXXXXXXXXXXXXXXXAPRVYAATTQEVQVPRM--FDSQVWYPDSGASHHVTADSQHLTQ 340 P + A Q +P M F + WYPDSGASHH+T + +L+ Sbjct: 291 YQVGPSNRPQNFNPYMRPTAHLAMPQPYAMPNMDQFSNGAWYPDSGASHHLTYNPNNLSY 350 Query: 339 SVPFNGSDQVLMGNGQGV 286 S P+ G DQV+MGNGQGV Sbjct: 351 SSPYTGQDQVVMGNGQGV 368 >KHN22040.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94, partial [Glycine soja] Length = 1417 Score = 334 bits (856), Expect = 7e-97 Identities = 170/378 (44%), Positives = 237/378 (62%), Gaps = 10/378 (2%) Frame = -1 Query: 1389 LSLKLDDKNFLLWNQQVEAVIAAHKLHRFVVNPNIPLRYANEMDRTLDIETEEYQRWFVQ 1210 L++KLD+KNFLLW+QQV VI AH LHRFVVNP IPL++A+ D L I ++EYQ+W ++ Sbjct: 1 LTIKLDEKNFLLWSQQVNGVITAHNLHRFVVNPQIPLQFASIEDCALGINSDEYQQWLIK 60 Query: 1209 DQMLFTWLLSSLSDSILPRVIGCKHSWQVWDKVHKHFYTQLKAKVRSLRSELKTTKKGTR 1030 DQ LFTWLLS+LSD +LPRV+ C+H+ +VWDK+HK+F + LK++ R LRSELK TKK +R Sbjct: 61 DQTLFTWLLSTLSDGVLPRVLSCRHAHEVWDKIHKYFNSVLKSRARQLRSELKNTKKLSR 120 Query: 1029 SISEYVLRIKAISDSLIAIGEPISEQDQVDAILDGLPEDYGPFIMMMYGRSDCPSVADVE 850 S++EY+LRIK+I +SL+A+G+ +SEQ+QVD+IL+GLPE++ F+MM+Y R D P+V DVE Sbjct: 121 SVNEYLLRIKSIVNSLVAVGDMVSEQEQVDSILEGLPEEFNSFVMMVYSRFDTPTVEDVE 180 Query: 849 SLLLVQEAQFDKYKSELSTGSVSINVAQGPGNRESADEQNFN--------SXXXXXXXXX 694 +LLL+QEAQF+K+K EL++ SVS NVA N ++ ++ + + Sbjct: 181 ALLLLQEAQFEKFKQELTSPSVSANVAHTETNASDSNSEHESQELGTEHYNVNANRGRGR 240 Query: 693 XXXXXXXXXXXXXXXGPRPTCQICFRYGHDAFRCWNRFDQDYVQPDPPPEIXXXXXXXXX 514 + CQIC + HDA CW R+D + + Sbjct: 241 GKGRGRGRGKGQAQNQGKVKCQICAKPNHDAINCWYRYDPQAMNQN----------SRGG 290 Query: 513 XXXXXXXXXXXXXXXXAPRVYAATTQEVQVPRM--FDSQVWYPDSGASHHVTADSQHLTQ 340 P + A Q +P M F + WYPDSGASHH+T + +L+ Sbjct: 291 YQVGPSNRPQNFNPYMRPTAHLAMPQPYAMPNMDQFSNGAWYPDSGASHHLTYNPNNLSY 350 Query: 339 SVPFNGSDQVLMGNGQGV 286 S P+ G DQV+MGNGQGV Sbjct: 351 SSPYTGQDQVVMGNGQGV 368 >GAU19342.1 hypothetical protein TSUD_336290 [Trifolium subterraneum] Length = 1442 Score = 332 bits (852), Expect = 3e-96 Identities = 172/392 (43%), Positives = 239/392 (60%), Gaps = 9/392 (2%) Frame = -1 Query: 1434 PVQPSSPAPHSFTHSLSLKLDDKNFLLWNQQVEAVIAAHKLHRFVVNPNIPLRYANEMDR 1255 P A THSL++KLD+KN+LLWNQQV VI AH LHRF+VNP IP+++A++ DR Sbjct: 22 PQTSKDSAKSGLTHSLTIKLDEKNYLLWNQQVNGVITAHDLHRFIVNPQIPIQFASDADR 81 Query: 1254 TLDIETEEYQRWFVQDQMLFTWLLSSLSDSILPRVIGCKHSWQVWDKVHKHFYTQLKAKV 1075 D ++EY++W +DQ LFTWLLS+LSDS+LPRV+GCKH++QVWD++HK+F++ L+A+ Sbjct: 82 VADRTSDEYRQWIFKDQTLFTWLLSTLSDSVLPRVLGCKHAFQVWDQIHKYFHSVLQARA 141 Query: 1074 RSLRSELKTTKKGTRSISEYVLRIKAISDSLIAIGEPISEQDQVDAILDGLPEDYGPFIM 895 R LRSELK TKK +RS+ EY+LRIK+I +SL+A+G+ +S+++QVDAIL+GLPE++ F+M Sbjct: 142 RQLRSELKNTKKASRSVGEYLLRIKSIVNSLLAVGDLVSDREQVDAILEGLPEEFNSFVM 201 Query: 894 MMYGRSDCPSVADVESLLLVQEAQFDKYKSELSTGSVSINVA--QGPGNRESADE----- 736 M+Y R D P+V DVE+LLL+QEAQF+K++ EL++ SVS +VA + S D+ Sbjct: 202 MVYSRFDTPTVEDVEALLLLQEAQFEKFRQELASPSVSAHVALTDSKMSDNSVDQDSHEV 261 Query: 735 --QNFNSXXXXXXXXXXXXXXXXXXXXXXXXGPRPTCQICFRYGHDAFRCWNRFDQDYVQ 562 +++ + CQIC + HDA CW R+ Sbjct: 262 GTEHYVAGKGRGRGKGRGKGRSRGRGSYSGGNQGTQCQICSKSSHDAVNCWYRY------ 315 Query: 561 PDPPPEIXXXXXXXXXXXXXXXXXXXXXXXXXAPRVYAATTQEVQVPRMFDSQVWYPDSG 382 P P + P + A P WYPDSG Sbjct: 316 -HPSPSM----MNAPRGHAVAHSRPPPYNPPMRPSAHLALPYYTGAP---SEASWYPDSG 367 Query: 381 ASHHVTADSQHLTQSVPFNGSDQVLMGNGQGV 286 ASHH+T D +L S P+ G DQV+MGNGQGV Sbjct: 368 ASHHLTYDPYNLVHSNPYTGHDQVMMGNGQGV 399 >GAU30708.1 hypothetical protein TSUD_39320 [Trifolium subterraneum] Length = 1432 Score = 330 bits (845), Expect = 2e-95 Identities = 182/417 (43%), Positives = 245/417 (58%), Gaps = 12/417 (2%) Frame = -1 Query: 1434 PVQPSSPAPHSFTHSLSLKLDDKNFLLWNQQVEAVIAAHKLHRFVVNPNIPLRYANEMDR 1255 P + THSL++KLD+KNFLLW+QQV VI H LHRFVVNP IPL++A+ DR Sbjct: 22 PTTAKDSSKSGLTHSLTIKLDEKNFLLWSQQVNGVITTHNLHRFVVNPEIPLQFASVNDR 81 Query: 1254 TLDIETEEYQRWFVQDQMLFTWLLSSLSDSILPRVIGCKHSWQVWDKVHKHFYTQLKAKV 1075 ++EY++W +DQ LFTWLLS++SDS+LPRV+ CKHS +VWDK+HK+F + LK+++ Sbjct: 82 LDGKISDEYRKWLFKDQTLFTWLLSTISDSVLPRVLHCKHSHEVWDKIHKYFNSVLKSRI 141 Query: 1074 RSLRSELKTTKKGTRSISEYVLRIKAISDSLIAIGEPISEQDQVDAILDGLPEDYGPFIM 895 R LRSELK TKK RS+SEY+LRIK+I +SLIA+GE ISEQ+Q+DAILDGL E++ F+M Sbjct: 142 RQLRSELKNTKKLARSVSEYLLRIKSIINSLIAMGESISEQEQIDAILDGLSEEFNSFVM 201 Query: 894 MMYGRSDCPSVADVESLLLVQEAQFDKYKSELSTGSVSINVAQGPG-----NRESADEQN 730 M+Y R D P+V DVE LL++QEAQFDK++ EL+ SVS NVAQ N+E D ++ Sbjct: 202 MVYSRFDNPTVEDVEGLLMLQEAQFDKFRQELTNPSVSANVAQMDSKNQHPNQEVEDTES 261 Query: 729 FNSXXXXXXXXXXXXXXXXXXXXXXXXGP----RPTCQICFRYGHDAFRCWNRFDQDYVQ 562 N + CQIC + HDA CW R++ Sbjct: 262 GNEHYTFNTYRGKGRGRGKAKARGKAPNALNNGKVQCQICSKSNHDAANCWYRYE----- 316 Query: 561 PDPPPEIXXXXXXXXXXXXXXXXXXXXXXXXXAPRVYAATTQEVQVPRMFD---SQVWYP 391 PP P + A Q FD + WYP Sbjct: 317 ---PPS---SRTNGRGYNAGNTSRPPLYNPYPRPSAHLALPQYYNPTAEFDTYSNASWYP 370 Query: 390 DSGASHHVTADSQHLTQSVPFNGSDQVLMGNGQGVQ*STSSRYFRSRWSVLFPATEI 220 DSGASHH+T + ++ P+ G DQV MGNGQGV ST+S + + ++ P+ ++ Sbjct: 371 DSGASHHLTFNPNNMAYRTPYQGQDQVTMGNGQGV--STASLGYSNFYAPNNPSVQL 425 >GAU11134.1 hypothetical protein TSUD_197580 [Trifolium subterraneum] Length = 1234 Score = 316 bits (810), Expect = 3e-91 Identities = 162/399 (40%), Positives = 231/399 (57%), Gaps = 29/399 (7%) Frame = -1 Query: 1398 THSLSLKLDDKNFLLWNQQVEAVIAAHKLHRFVVNPNIPLRYANEMDRTLDIETEEYQRW 1219 THSL++KLD+KNFLLWNQQV VI AH LHRFVVNP IPL+Y + DR ++EYQ+W Sbjct: 36 THSLTIKLDEKNFLLWNQQVNGVITAHNLHRFVVNPQIPLQYESVEDRLDGKNSDEYQQW 95 Query: 1218 FVQDQMLFTWLLSSLSDSILPRVIGCKHSWQVWDKVHKHFYTQLKAKVRSLRSELKTTKK 1039 +DQ LFTWLLS++SD +LPRV+ CKHS++VW+++HKHF + LK++ R LRSELK TKK Sbjct: 96 LFKDQSLFTWLLSTISDDVLPRVLSCKHSYEVWEQIHKHFNSVLKSRSRQLRSELKNTKK 155 Query: 1038 GTRSISEYVLRIKAISDSLIAIGEPISEQDQVDAILDGLPEDYGPFIMMMYGRSDCPSVA 859 RS++EY++RIK+I +SLIA+G+ +S+++QV+A+L+GLP+++ F+MM+Y + P V Sbjct: 156 MARSVNEYLIRIKSIVNSLIAVGDVVSDKEQVEAVLEGLPKEFSSFVMMIYSQFATPKVK 215 Query: 858 DVESLLLVQEAQFDKYKSELSTGSVSINVAQGPGN---------RESADEQNFNSXXXXX 706 DVE+LLL++E QF+K++ EL+ VS N Q N + + +++N Sbjct: 216 DVEALLLLREVQFEKFRQELANPRVSANTTQVQSNFNDEAMDTETQESGTEHYNVSANRG 275 Query: 705 XXXXXXXXXXXXXXXXXXXGPRPTCQICFRYGHDAFRCWNRFDQDYVQPD---------- 556 + CQIC + HDA CW+R++ +P+ Sbjct: 276 KGRGKGRGRGRGRASNPQNSGKVQCQICGKLNHDALNCWHRYEPQSTKPNSCGYHAPSGS 335 Query: 555 -PPPEIXXXXXXXXXXXXXXXXXXXXXXXXXAPRVYAATTQEVQVPRMF---------DS 406 PPP YA + + VP+ F S Sbjct: 336 RPPPY----------------------------NPYARPSAHLAVPQYFPSIPDTDSVSS 367 Query: 405 QVWYPDSGASHHVTADSQHLTQSVPFNGSDQVLMGNGQG 289 WYPD GASHH+T + + P+ G DQV+MGNGQG Sbjct: 368 ASWYPDFGASHHLTFNPNNFAHRAPYQGPDQVMMGNGQG 406 >GAU20748.1 hypothetical protein TSUD_231620 [Trifolium subterraneum] Length = 327 Score = 274 bits (700), Expect = 2e-83 Identities = 130/237 (54%), Positives = 178/237 (75%), Gaps = 11/237 (4%) Frame = -1 Query: 1398 THSLSLKLDDKNFLLWNQQVEAVIAAHKLHRFVVNPNIPLRYANEMDRTLDIETEEYQRW 1219 THSL++KLD+KNFL W+QQV VI AH LHRF+VNP IPL++A DR ++EY++W Sbjct: 34 THSLTIKLDEKNFLSWSQQVNGVITAHNLHRFIVNPEIPLQFATVADRIDGKTSDEYRKW 93 Query: 1218 FVQDQMLFTWLLSSLSDSILPRVIGCKHSWQVWDKVHKHFYTQLKAKVRSLRSELKTTKK 1039 +DQ LFTWLLS++SD +LPRV+ CKH+ +VWDK+HK+F + LK+++R L+SELK TKK Sbjct: 94 IFKDQTLFTWLLSTISDVVLPRVVHCKHAHEVWDKIHKYFNSVLKSRIRQLKSELKNTKK 153 Query: 1038 GTRSISEYVLRIKAISDSLIAIGEPISEQDQVDAILDGLPEDYGPFIMMMYGRSDCPSVA 859 R +SEY+LRIK+I +SLIA+GE I+EQ+QV+AILDGLPE++ F+MM+Y R D P++ Sbjct: 154 LARPVSEYLLRIKSIVNSLIAMGEMITEQEQVEAILDGLPEEFNSFVMMVYSRFDTPTIE 213 Query: 858 DVESLLLVQEAQFDKYKSELSTGSVSINVAQ-----------GPGNRESADEQNFNS 721 VE LL++QEAQF+K++ EL+ SVS NVAQ G N ++ NFN+ Sbjct: 214 YVEGLLMIQEAQFEKFRQELTNPSVSANVAQMESKNNQANQDGEDNESDTEQYNFNA 270 >GAU44321.1 hypothetical protein TSUD_305020 [Trifolium subterraneum] Length = 468 Score = 277 bits (708), Expect = 1e-82 Identities = 131/220 (59%), Positives = 169/220 (76%) Frame = -1 Query: 1434 PVQPSSPAPHSFTHSLSLKLDDKNFLLWNQQVEAVIAAHKLHRFVVNPNIPLRYANEMDR 1255 P THSL++KLD+KNFLLW+QQ+ VI H LHRFVVNP IPL++A+ DR Sbjct: 246 PTTTKDSTKSGLTHSLTIKLDEKNFLLWSQQINGVITTHNLHRFVVNPEIPLQFASVNDR 305 Query: 1254 TLDIETEEYQRWFVQDQMLFTWLLSSLSDSILPRVIGCKHSWQVWDKVHKHFYTQLKAKV 1075 +EEYQ+W DQ LFTWLLS++SDSILPRV+ CKH+ +VWDK+HKHF + LK+++ Sbjct: 306 LNGKISEEYQKWLFIDQTLFTWLLSTISDSILPRVLHCKHAHEVWDKIHKHFNSVLKSRI 365 Query: 1074 RSLRSELKTTKKGTRSISEYVLRIKAISDSLIAIGEPISEQDQVDAILDGLPEDYGPFIM 895 R LR ELK TKK R ISEY+LRIK+I +SLIA+GE +SEQ+QV+ ILDGLPE++ PF+M Sbjct: 366 RQLRFELKNTKKLARPISEYLLRIKSIINSLIALGEAVSEQEQVNVILDGLPEEFNPFVM 425 Query: 894 MMYGRSDCPSVADVESLLLVQEAQFDKYKSELSTGSVSIN 775 M+Y R D P+V DVE LL++QEAQF+K++ EL+ SVS N Sbjct: 426 MVYSRYDTPTVEDVEGLLMLQEAQFEKFRQELTNPSVSAN 465 >KYP36809.1 hypothetical protein KK1_042047 [Cajanus cajan] Length = 280 Score = 253 bits (646), Expect = 5e-76 Identities = 124/243 (51%), Positives = 164/243 (67%), Gaps = 22/243 (9%) Frame = -1 Query: 1203 MLFTWLLSSLSDSILPRVIGCKHSWQVWDKVHKHFYTQLKAKVRSLRSELKTTKKG-TRS 1027 MLF+WLLSSLS+S+LPRV+GCKHS+++WDK+HKH+Y+ L AK R LRSELK++KKG ++ Sbjct: 1 MLFSWLLSSLSESVLPRVLGCKHSYEIWDKIHKHYYSHLHAKKRQLRSELKSSKKGPSQP 60 Query: 1026 ISEYVLRIKAISDSLIAIGEPISEQDQVDAILDGLPEDYGPFIMMMYGRSDCPSVADVES 847 ISEY+LRI+ I +SLI +G+ +++QDQ+D ILDGLPEDY FIMM+YGRSD SV DVES Sbjct: 61 ISEYILRIREIINSLIVVGDLVTDQDQIDTILDGLPEDYNSFIMMIYGRSDSISVTDVES 120 Query: 846 LLLVQEAQFDKYKSELSTGSVSINVAQGPGNRESADEQNFNS------------------ 721 LLLVQEAQ +KY+ +L++ SVS+NV QGP + + + F S Sbjct: 121 LLLVQEAQLEKYRQDLTSPSVSVNVVQGPQDSQFQSQSQFVSNRGGFQFSTRGGRYRGGR 180 Query: 720 ---XXXXXXXXXXXXXXXXXXXXXXXXGPRPTCQICFRYGHDAFRCWNRFDQDYVQPDPP 550 G RPTCQ+C++YGHDAF CWNRFD+ ++QP P Sbjct: 181 YRCRGRNRCGGRYRGGRYRGRGRNRGGGQRPTCQLCYKYGHDAFHCWNRFDEAFIQPSQP 240 Query: 549 PEI 541 P + Sbjct: 241 PNL 243 >KYP36193.1 hypothetical protein KK1_042704 [Cajanus cajan] Length = 221 Score = 236 bits (602), Expect = 2e-70 Identities = 116/212 (54%), Positives = 151/212 (71%), Gaps = 5/212 (2%) Frame = -1 Query: 1203 MLFTWLLSSLSDSILPRVIGCKHSWQVWDKVHKHFYTQLKAKVRSLRSELKTTKKG-TRS 1027 MLF+WLLSSLS+ +LPRV+GCKHS+++WDK+HKH+Y+ L AK R LRSELK++K G ++ Sbjct: 1 MLFSWLLSSLSEPVLPRVLGCKHSYEIWDKIHKHYYSHLHAKKRQLRSELKSSKIGPSQP 60 Query: 1026 ISEYVLRIKAISDSLIAIGEPISEQDQVDAILDGLPEDYGPFIMMMYGRSDCPSVADVES 847 ISEY+LRI+AI +SLIA+GE ++ QDQ+D ILDGLPEDY F+MM+YGRSD SV DVES Sbjct: 61 ISEYILRIRAIINSLIAVGELVTNQDQIDTILDGLPEDYNSFVMMIYGRSDSISVTDVES 120 Query: 846 LLLVQEAQFDKYKSELSTGSVSINVAQGPGNRESADEQNFNS----XXXXXXXXXXXXXX 679 LLLVQEAQ +KY+ +L++ SVS+N QGP + + F S Sbjct: 121 LLLVQEAQLEKYRQDLTSPSVSVNAVQGPQDSHFQSQSQFVSNRGGFQFSNRGGRYCGGR 180 Query: 678 XXXXXXXXXXGPRPTCQICFRYGHDAFRCWNR 583 G RPTCQ+C++YG+DAF CWNR Sbjct: 181 YRGRGRNRGGGQRPTCQLCYKYGYDAFHCWNR 212 >KYP61342.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cajanus cajan] Length = 1358 Score = 256 bits (655), Expect = 8e-70 Identities = 148/380 (38%), Positives = 208/380 (54%) Frame = -1 Query: 1422 SSPAPHSFTHSLSLKLDDKNFLLWNQQVEAVIAAHKLHRFVVNPNIPLRYANEMDRTLDI 1243 SSP F++S++ KLDD N+L W QQ+E VI +HKL RFVVNP IP RY + DR DI Sbjct: 7 SSPFSQFFSNSIAEKLDDSNYLHWRQQIEPVIKSHKLQRFVVNPQIPPRYLTDADRDSDI 66 Query: 1242 ETEEYQRWFVQDQMLFTWLLSSLSDSILPRVIGCKHSWQVWDKVHKHFYTQLKAKVRSLR 1063 Y+ W VQDQML TWL S+LS SIL RVIG HS+QVWDKVH++F+TQ KA+ R LR Sbjct: 67 VNPAYETWEVQDQMLLTWLQSTLSKSILSRVIGSVHSYQVWDKVHEYFHTQTKARARQLR 126 Query: 1062 SELKTTKKGTRSISEYVLRIKAISDSLIAIGEPISEQDQVDAILDGLPEDYGPFIMMMYG 883 ++L++T +S+ +++ +IK I+D L +G P+S ++ VDA+L+GLP++Y P + ++ Sbjct: 127 TDLRSTTLDGQSMRDFLTQIKTIADELAGVGSPVSLEEYVDAVLEGLPQEYAPVVSVIES 186 Query: 882 RSDCPSVADVESLLLVQEAQFDKYKSELSTGSVSINVAQGPGNRESADEQNFNSXXXXXX 703 + P +A+VE+LLL E++ ++++ + + S SIN QG S Sbjct: 187 KFVTPPIAEVEALLLAHESRANRFRKQ--SFSPSINYTQG------------YSRGSVSG 232 Query: 702 XXXXXXXXXXXXXXXXXXGPRPTCQICFRYGHDAFRCWNRFDQDYVQPDPPPEIXXXXXX 523 CQICF+YGH A C+ R D +Y QP Sbjct: 233 GHSGRRGGRGSGRGRGGRFANFQCQICFKYGHTANVCFYRADVNY-QP------------ 279 Query: 522 XXXXXXXXXXXXXXXXXXXAPRVYAATTQEVQVPRMFDSQVWYPDSGASHHVTADSQHLT 343 A + A P S W PDSGAS HVT + Q++ Sbjct: 280 -------------------AESLVLAMVANTSQPGANSS--WIPDSGASFHVTGEPQNIH 318 Query: 342 QSVPFNGSDQVLMGNGQGVQ 283 Q F+G DQ+ +GNGQG+Q Sbjct: 319 QLEHFDGPDQIFIGNGQGLQ 338 >KYP46257.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cajanus cajan] Length = 1408 Score = 256 bits (653), Expect = 2e-69 Identities = 149/399 (37%), Positives = 215/399 (53%), Gaps = 14/399 (3%) Frame = -1 Query: 1440 ISPVQPSSPAPHS-----FTHSLSLKLDDKNFLLWNQQVEAVIAAHKLHRFVVNPNIPLR 1276 ++ V P +P +S F+H++S KLD KN+LLW QQVE VI H+LH ++VNP IP + Sbjct: 18 VNTVPPKNPPSNSHPSLTFSHTISEKLDTKNYLLWCQQVEPVIKGHRLHHYLVNPQIPQK 77 Query: 1275 YANEMDRTLDIETEEYQRWFVQDQMLFTWLLSSLSDSILPRVIGCKHSWQVWDKVHKHFY 1096 +A DR +E Y W QDQ+L +WL SS+S +L RVIGCK S+Q+WDK+H +F+ Sbjct: 78 FATLADRDAGHISESYLAWEQQDQLLLSWLQSSMSKDMLTRVIGCKSSFQLWDKIHTYFH 137 Query: 1095 TQLKAKVRSLRSELKTTKKGTRSISEYVLRIKAISDSLIAIGEPISEQDQVDAILDGLPE 916 + + AK R LR+EL++T SISEYVLRI+ + D+L AIG+ +S ++ +D IL+GLPE Sbjct: 138 SHMNAKARQLRNELRSTTLDNLSISEYVLRIQTLVDALTAIGDSVSPKEHLDIILEGLPE 197 Query: 915 DYGPFIMMMYGRSDCPSVADVESLLLVQEAQFDKYKSELSTGSVSINVA------QGPGN 754 +Y + ++ R D ++ +VE+LLL E++ DK+K + + S+++ A Sbjct: 198 EYESTVSLISSRFDLLTIDEVETLLLGHESRLDKFKKK-AAASINVTTAVTEPDPSATNP 256 Query: 753 RESADEQNFNSXXXXXXXXXXXXXXXXXXXXXXXXGPR---PTCQICFRYGHDAFRCWNR 583 + QN S R CQ+C RYGH A C+ R Sbjct: 257 QAHLTHQNNQSGPSHRRGGRTNSRGGRFSNWAGRGRGRFAGYQCQVCHRYGHVASACYYR 316 Query: 582 FDQDYVQPDPPPEIXXXXXXXXXXXXXXXXXXXXXXXXXAPRVYAATTQEVQVPRMFDSQ 403 FD+ YV P P E Y + Q ++ Sbjct: 317 FDETYV-PSSPLEAP---------------------------AYPSNNQHTNPGACNNN- 347 Query: 402 VWYPDSGASHHVTADSQHLTQSVPFNGSDQVLMGNGQGV 286 WYPDSGAS+HVT SQ++ Q PF G DQ+ +GNGQG+ Sbjct: 348 -WYPDSGASNHVTNVSQNIHQFTPFEGPDQIHVGNGQGL 385 >KYP40244.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cajanus cajan] Length = 720 Score = 248 bits (634), Expect = 2e-69 Identities = 139/387 (35%), Positives = 209/387 (54%) Frame = -1 Query: 1419 SPAPHSFTHSLSLKLDDKNFLLWNQQVEAVIAAHKLHRFVVNPNIPLRYANEMDRTLDIE 1240 SP F++S++ KLDD N+L W QQ++ +I +HKL RFVVNP IP RY + DR DI Sbjct: 8 SPFSQFFSNSIAEKLDDSNYLHWRQQIKPIIKSHKLQRFVVNPQIPPRYLTDADRDYDIV 67 Query: 1239 TEEYQRWFVQDQMLFTWLLSSLSDSILPRVIGCKHSWQVWDKVHKHFYTQLKAKVRSLRS 1060 Y+ W VQDQML TWL S LS +IL RVIG HS+QVWDKVH++F+TQ KA+ R LR+ Sbjct: 68 NPAYETWEVQDQMLLTWLQSMLSKTILSRVIGSVHSYQVWDKVHEYFHTQTKARARQLRT 127 Query: 1059 ELKTTKKGTRSISEYVLRIKAISDSLIAIGEPISEQDQVDAILDGLPEDYGPFIMMMYGR 880 +L++T +S+ +++ +IK I+D L +G P+S ++ VD +L+GLP++Y P + ++ + Sbjct: 128 DLRSTTLDGKSMRDFLTQIKNIADQLAGVGSPMSLEEYVDVVLEGLPQEYTPVVSVIESK 187 Query: 879 SDCPSVADVESLLLVQEAQFDKYKSELSTGSVSINVAQGPGNRESADEQNFNSXXXXXXX 700 P +A+VE+LLL E++ ++++ + + S SIN QG +R S ++F Sbjct: 188 FVTPPIAEVEALLLAHESRVNRFRKQ--SFSPSINYTQG-YSRGSISGESFRDRDGGHSG 244 Query: 699 XXXXXXXXXXXXXXXXXGPRPTCQICFRYGHDAFRCWNRFDQDYVQPDPPPEIXXXXXXX 520 CQ CF+YGH A C+ R D +Y + Sbjct: 245 CRGGQGSGRGRGGRFANF---HCQNCFKYGHTANVCFYRADVNYQLVE------------ 289 Query: 519 XXXXXXXXXXXXXXXXXXAPRVYAATTQEVQVPRMFDSQVWYPDSGASHHVTADSQHLTQ 340 + + + W PDSGAS H+T + Q++ Q Sbjct: 290 ----------------------FLVLAMVANTSQAGANSSWIPDSGASFHITGEPQNIHQ 327 Query: 339 SVPFNGSDQVLMGNGQGVQ*STSSRYF 259 F+G DQ+ +GNGQG+Q + S F Sbjct: 328 LEHFDGLDQIFIGNGQGLQINGSGSSF 354 >KYP33001.1 hypothetical protein KK1_046197, partial [Cajanus cajan] Length = 470 Score = 241 bits (615), Expect = 4e-69 Identities = 149/410 (36%), Positives = 213/410 (51%), Gaps = 30/410 (7%) Frame = -1 Query: 1425 PSSPAPHS-----FTHSLSLKLDDKNFLLWNQQVEAVIAAHKLHRFVVNPNIPLRYANEM 1261 P +P P+S F+H++S KL KN+LLW QQVE VI H+LH ++VNP IP ++A Sbjct: 2 PKNPPPNSHPSLTFSHTISEKLGTKNYLLWCQQVEPVIKGHRLHHYLVNPQIPQKFATLA 61 Query: 1260 DRTLDIETEEYQRWFVQDQMLFTWLLSSLSDSILPRVIGCKHSWQVWDKVHKHFYTQLKA 1081 DR +E Y W QDQ+L +WL SS+S +L RVIGCK S+Q+WDK+H +F++ + A Sbjct: 62 DRDAGCISESYLAWEQQDQLLLSWLQSSMSKDMLTRVIGCKSSFQLWDKIHSYFHSHMNA 121 Query: 1080 KVRSLRSELKTTKKGTRSISEYVLRIKAISDSLIAIGEPISEQDQVDAILDGLPEDYGPF 901 K LR+EL +T SISEYVLRI+ + D+L AIG+ +S ++ +D IL+GLPE+Y Sbjct: 122 KACQLRNELCSTSLENLSISEYVLRIQTLVDALTAIGDSVSLKEHLDIILEGLPEEYEST 181 Query: 900 IMMMYGRSDCPSVADVESLLLVQEAQFDKYKSE------LSTGSVSIN--VAQGPGNRES 745 + ++ R D ++ +VE+LLL E++ DK+K + ++T ++ N V + Sbjct: 182 MSLISSRFDLLTIDEVETLLLGHESRLDKFKKKAAAYINVTTATIEPNPSVTNPQAHLAH 241 Query: 744 ADEQNFNSXXXXXXXXXXXXXXXXXXXXXXXXGPRPTCQICFRYGHDAFRCWNRFDQDYV 565 + Q+ S CQ+C RY H A C+ RFD+ YV Sbjct: 242 QENQSGFSHRRGGHTNFRGGRFSNRAGRGRGRFAAYQCQVCHRYEHVASACYYRFDETYV 301 Query: 564 QPDPPPE------IXXXXXXXXXXXXXXXXXXXXXXXXXAPRVYAA-----TTQEVQVPR 418 P P E I PR T+ + Q Sbjct: 302 -PSSPLEAPAYHSINQHTNPGAWYNNQPASPSPHQNGILGPRPQFTPQVQFTSTQAQPQA 360 Query: 417 MFDSQV------WYPDSGASHHVTADSQHLTQSVPFNGSDQVLMGNGQGV 286 M S WYPDSGAS+HVT SQ++ Q F G DQ+ +GNGQG+ Sbjct: 361 MIASSSSSSNNNWYPDSGASNHVTNVSQNIHQFTLFKGPDQIHVGNGQGL 410 >KYP34307.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cajanus cajan] Length = 1102 Score = 253 bits (645), Expect = 6e-69 Identities = 149/415 (35%), Positives = 217/415 (52%), Gaps = 30/415 (7%) Frame = -1 Query: 1440 ISPVQPSSPAPHS-----FTHSLSLKLDDKNFLLWNQQVEAVIAAHKLHRFVVNPNIPLR 1276 ++ V P +P P+S F+H++S KLD KN+LLW QQVE VI H+LH ++VNP IP + Sbjct: 18 VNTVPPKNPPPNSHPSLTFSHTISEKLDTKNYLLWCQQVEPVIKGHRLHHYLVNPQIPQK 77 Query: 1275 YANEMDRTLDIETEEYQRWFVQDQMLFTWLLSSLSDSILPRVIGCKHSWQVWDKVHKHFY 1096 +A DR +E Y W QDQ+L +WL SS+S +L RVIGCK S+Q+WDK+H +F+ Sbjct: 78 FATLADRDAGRISESYLAWEQQDQLLLSWLQSSMSKDMLTRVIGCKSSFQLWDKIHSYFH 137 Query: 1095 TQLKAKVRSLRSELKTTKKGTRSISEYVLRIKAISDSLIAIGEPISEQDQVDAILDGLPE 916 + + AK R LR+EL+ T SISEYVLRI+ + D+L AIG +S ++ +D IL+GLPE Sbjct: 138 SHMNAKARQLRNELRNTSLENLSISEYVLRIQTLVDALTAIGNSVSPKEHLDIILEGLPE 197 Query: 915 DYGPFIMMMYGRSDCPSVADVESLLLVQEAQFDKYKSEL--------STGSVSINVAQGP 760 +Y + ++ D ++ +VE+LLL E++ DK+K ++ +T + +V Sbjct: 198 EYESTVSLISSHFDLLTIDEVETLLLGHESRLDKFKKKVAASINVTTTTTEPNPSVTNPQ 257 Query: 759 GNRESADEQNFNSXXXXXXXXXXXXXXXXXXXXXXXXGPRPTCQICFRYGHDAFRCWNRF 580 + + Q+ S CQ+C RYGH A C+ RF Sbjct: 258 AHLAHQENQSGFSHRQGGRTNFRGGRFSNRAGRGRGRFAGYQCQVCHRYGHVASACYYRF 317 Query: 579 DQDYVQPDPPPEI------------XXXXXXXXXXXXXXXXXXXXXXXXXAPRVYAATTQ 436 D+ YV P P E P+V +TQ Sbjct: 318 DETYV-PSSPLEAPAYHSINQHTNPGAWYSNQTASPSSHRNEILGPRPQFTPQVQFTSTQ 376 Query: 435 ---EVQVPRMFDSQV--WYPDSGASHHVTADSQHLTQSVPFNGSDQVLMGNGQGV 286 + + S + WYPDS AS+HVT SQ++ Q PF G DQ+ +GNGQG+ Sbjct: 377 AQPQAMIASSSSSSINNWYPDSRASNHVTNVSQNIHQFTPFEGPDQIHVGNGQGL 431 >GAU37351.1 hypothetical protein TSUD_395330 [Trifolium subterraneum] Length = 1216 Score = 253 bits (647), Expect = 6e-69 Identities = 116/213 (54%), Positives = 162/213 (76%) Frame = -1 Query: 1434 PVQPSSPAPHSFTHSLSLKLDDKNFLLWNQQVEAVIAAHKLHRFVVNPNIPLRYANEMDR 1255 P THSL++KLD+ NFLLW+QQV VI AH LHRFVVNP IPL++ + DR Sbjct: 19 PATTKESTRSGLTHSLTIKLDENNFLLWSQQVNGVITAHNLHRFVVNPQIPLQFDSIEDR 78 Query: 1254 TLDIETEEYQRWFVQDQMLFTWLLSSLSDSILPRVIGCKHSWQVWDKVHKHFYTQLKAKV 1075 + EYQ+W V+DQ LFTWLLS++SD ILPRV+ C+H+ +VWD +HK+F + LK++ Sbjct: 79 ANLKNSVEYQKWLVKDQTLFTWLLSTISDGILPRVLSCRHAHEVWDSIHKYFNSMLKSRA 138 Query: 1074 RSLRSELKTTKKGTRSISEYVLRIKAISDSLIAIGEPISEQDQVDAILDGLPEDYGPFIM 895 R LR ELK TKK + S++EY+LRIK+I +S +A+G+ +++Q+Q+DAIL+GLPE++ F+M Sbjct: 139 RQLRFELKNTKKMSCSVNEYLLRIKSIVNSPVAVGDIVTKQEQIDAILEGLPEEFNSFVM 198 Query: 894 MMYGRSDCPSVADVESLLLVQEAQFDKYKSELS 796 M+Y R D P+V D+E+LLL+QE QF+K+K ELS Sbjct: 199 MVYSRFDTPTVEDIEALLLLQEVQFEKFKQELS 231 Score = 57.4 bits (137), Expect(2) = 7e-11 Identities = 25/39 (64%), Positives = 30/39 (76%) Frame = -1 Query: 408 SQVWYPDSGASHHVTADSQHLTQSVPFNGSDQVLMGNGQ 292 S WYPDSGASHH+T + +L+ VP+NG DQVLMG GQ Sbjct: 320 SGAWYPDSGASHHLTYNPNNLSYRVPYNGYDQVLMGIGQ 358 Score = 40.0 bits (92), Expect(2) = 7e-11 Identities = 18/38 (47%), Positives = 26/38 (68%) Frame = -2 Query: 287 SNKVLLQGTLGPDGLYCFQPLKFLQGSSCGAQFRSQSS 174 SN++LL+G++G DGLY FQP KFL + ++ SS Sbjct: 384 SNQILLEGSVGVDGLYKFQPFKFLPINGVNSKLTQASS 421 >KHN49021.1 hypothetical protein glysoja_031232, partial [Glycine soja] Length = 323 Score = 219 bits (559), Expect = 1e-62 Identities = 116/311 (37%), Positives = 173/311 (55%), Gaps = 27/311 (8%) Frame = -1 Query: 1404 SFTHSLSLKLDDKNFLLWNQQVEAVIAAHKLHRFVVNPNIPLRYANEMDRTLDIETEEYQ 1225 SF + +S+KLD N+L+W QQ+E V+ AH+LHRF V P IP +YA+E DR +IE + Sbjct: 14 SFNYKISVKLDATNYLVWLQQIEPVLRAHRLHRFCVTPEIPPQYASEHDRLANIENPAFS 73 Query: 1224 RWFVQDQMLFTWLLSSLSDSILPRVIGCKHSWQVWDKVHKHFYTQLKAKVRSLRSELKTT 1045 W +QDQ+L WL SSLS +ILP VIGCKH++Q+W+ +H+ F ++ KA+ R LR++L+TT Sbjct: 74 NWELQDQLLLAWLQSSLSPAILPSVIGCKHTFQLWENIHQSFQSKTKAQARQLRTQLRTT 133 Query: 1044 KKGTRSISEYVLRIKAISDSLIAIGEPISEQDQVDAILDGLPEDYGPFIMMMYGRSDCPS 865 KKG+ SISE++ +IK ISDSL +IGE +S QDQ+D IL+GLP ++ + ++ + + Sbjct: 134 KKGSSSISEFLAKIKHISDSLTSIGESVSLQDQLDVILEGLPNEFESLVTLINSKIEWFD 193 Query: 864 VADVESLLLVQEAQFDKYKSELSTGSVSINVAQ--------------------------- 766 + ++ +LLL E + DK + S++ +Q Sbjct: 194 LEEIRALLLAHEQRLDKARITEEAASLNFTQSQPNSKIPNSVNPNSATETQIAPQANWTT 253 Query: 765 GPGNRESADEQNFNSXXXXXXXXXXXXXXXXXXXXXXXXGPRPTCQICFRYGHDAFRCWN 586 G N + D QN N+ CQ+C R GHDA C++ Sbjct: 254 GNSNSGNYDSQN-NNFKNNNQSRGRGGRNGRGNRGGRGGRSTVQCQVCHRTGHDASYCYH 312 Query: 585 RFDQDYVQPDP 553 RF+ Y P Sbjct: 313 RFNAAYGSNQP 323 >GAU27211.1 hypothetical protein TSUD_108020 [Trifolium subterraneum] Length = 967 Score = 232 bits (592), Expect = 3e-62 Identities = 116/211 (54%), Positives = 155/211 (73%) Frame = -1 Query: 1398 THSLSLKLDDKNFLLWNQQVEAVIAAHKLHRFVVNPNIPLRYANEMDRTLDIETEEYQRW 1219 THSL++KLD+KNFLLW+QQV VI AH LHRFVVNP I L+YA+ DR +EEY+ W Sbjct: 34 THSLTIKLDEKNFLLWSQQVNGVITAHNLHRFVVNPEILLQYASIADRLDGKNSEEYKTW 93 Query: 1218 FVQDQMLFTWLLSSLSDSILPRVIGCKHSWQVWDKVHKHFYTQLKAKVRSLRSELKTTKK 1039 +DQ LFTWLLS++SD +LPRV+ CKHS +VW+K+HK+F + LK++ R LRSELK TKK Sbjct: 94 LFKDQSLFTWLLSTISDGVLPRVLNCKHSHEVWEKIHKYFNSVLKSRARQLRSELKNTKK 153 Query: 1038 GTRSISEYVLRIKAISDSLIAIGEPISEQDQVDAILDGLPEDYGPFIMMMYGRSDCPSVA 859 RS+SEY+LRIK+I +SLIA+G+ + + + L + I+ + SD P+V Sbjct: 154 SARSMSEYLLRIKSIVNSLIAMGDMDCWKQESEKERIVLLSHFVKIIVKSFTGSDNPTVE 213 Query: 858 DVESLLLVQEAQFDKYKSELSTGSVSINVAQ 766 D+E LLL+QEAQF+K++ EL+ SVS NVAQ Sbjct: 214 DIEGLLLLQEAQFEKFRQELANPSVSTNVAQ 244