BLASTX nr result
ID: Astragalus24_contig00022125
seq
BLASTX 2.2.26 [Sep-21-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Astragalus24_contig00022125 (496 letters) Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects 149,584,005 sequences; 54,822,741,787 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value gb|PNY18017.1| copia-type polyprotein [Trifolium pratense] 179 8e-50 gb|PNX70198.1| cationic amino acid transporter 1-like protein, p... 160 5e-47 gb|PNX61214.1| copia-type polyprotein, partial [Trifolium pratense] 162 6e-47 dbj|GAU42259.1| hypothetical protein TSUD_327370 [Trifolium subt... 172 1e-46 gb|PNX73153.1| hypothetical protein L195_g029051 [Trifolium prat... 160 2e-46 gb|PNX68532.1| copia-type polyprotein [Trifolium pratense] 159 3e-46 dbj|GAU37611.1| hypothetical protein TSUD_365320 [Trifolium subt... 169 4e-46 dbj|GAU35215.1| hypothetical protein TSUD_204910 [Trifolium subt... 168 2e-45 gb|KYP32045.1| Retrovirus-related Pol polyprotein from transposo... 160 4e-45 dbj|GAU50483.1| hypothetical protein TSUD_409690 [Trifolium subt... 167 4e-45 gb|PNX66461.1| hypothetical protein L195_g055105, partial [Trifo... 157 6e-45 dbj|GAU46965.1| hypothetical protein TSUD_143070 [Trifolium subt... 160 7e-45 dbj|GAU39052.1| hypothetical protein TSUD_396570 [Trifolium subt... 155 1e-44 gb|KYP66912.1| Retrovirus-related Pol polyprotein from transposo... 157 2e-44 gb|KYP76820.1| Copia protein [Cajanus cajan] 154 2e-44 gb|KYP67250.1| Copia protein [Cajanus cajan] 153 3e-44 dbj|GAU42845.1| hypothetical protein TSUD_387380 [Trifolium subt... 164 3e-44 gb|KYP66838.1| Retrovirus-related Pol polyprotein from transposo... 152 4e-44 dbj|GAU22332.1| hypothetical protein TSUD_106600 [Trifolium subt... 159 6e-44 dbj|GAU12596.1| hypothetical protein TSUD_132060 [Trifolium subt... 159 6e-44 >gb|PNY18017.1| copia-type polyprotein [Trifolium pratense] Length = 999 Score = 179 bits (453), Expect(2) = 8e-50 Identities = 91/155 (58%), Positives = 105/155 (67%), Gaps = 27/155 (17%) Frame = +3 Query: 3 DICFAVGAVSRHM---------------------------FPIKPVQNHAELIGYSDSDW 101 DICF VG VSR M FPI E++ YSD+DW Sbjct: 792 DICFVVGLVSRFMEEPRKSHMNAARRVLRYIAGTLEFGILFPISARNAKPEIVCYSDADW 851 Query: 102 CGDKVDRRSTTGYVFMYHGAPISWCSKKQPVVALSSCEAEYIAGATAACQAIWLKSVLTE 281 CGDK+DRRSTTGY F + A ISWCS+KQPVVALSSCEAEYIAG+ AACQA+W++SVL E Sbjct: 852 CGDKIDRRSTTGYFFKFMNASISWCSRKQPVVALSSCEAEYIAGSYAACQALWIESVLKE 911 Query: 282 LGIDIKRPIKLMIDNKSAIMLAKNAVSHGRSKHID 386 L +D++RPIKL IDNKSAI LAKN V HGRSKHI+ Sbjct: 912 LKVDVERPIKLQIDNKSAINLAKNPVLHGRSKHIE 946 Score = 46.2 bits (108), Expect(2) = 8e-50 Identities = 21/37 (56%), Positives = 28/37 (75%) Frame = +1 Query: 385 TKFHFLRDQVEKKLLEVCYCNTQVQIADVLTKALKIE 495 T+FHFLR+QV + LEV +C T QIAD +TK+LK + Sbjct: 947 TRFHFLREQVNQGSLEVIHCATGSQIADAMTKSLKTD 983 >gb|PNX70198.1| cationic amino acid transporter 1-like protein, partial [Trifolium pratense] Length = 199 Score = 160 bits (406), Expect = 5e-47 Identities = 83/155 (53%), Positives = 105/155 (67%), Gaps = 27/155 (17%) Frame = +3 Query: 3 DICFAVGAVSR--------HMFPIKPVQNHAE-------------------LIGYSDSDW 101 D+ FAVGAVSR HM +K + + + L G+S SD Sbjct: 14 DLAFAVGAVSRFVNSPKKSHMIAVKKIMRYVKGTMNYGILLPNTLSNAVNRLEGFSYSDR 73 Query: 102 CGDKVDRRSTTGYVFMYHGAPISWCSKKQPVVALSSCEAEYIAGATAACQAIWLKSVLTE 281 CGD VDRRSTTGY+F + APISWCSKKQPV+ALSSCEAEYIA A AACQ IWL+S+L + Sbjct: 74 CGDHVDRRSTTGYIFKFLDAPISWCSKKQPVIALSSCEAEYIACAFAACQGIWLESLLKD 133 Query: 282 LGIDIKRPIKLMIDNKSAIMLAKNAVSHGRSKHID 386 + I++ P++L++DNKSAI LA+N +SHGRSKHI+ Sbjct: 134 IKIELTEPMQLLVDNKSAINLARNPISHGRSKHIE 168 >gb|PNX61214.1| copia-type polyprotein, partial [Trifolium pratense] Length = 239 Score = 162 bits (409), Expect = 6e-47 Identities = 85/157 (54%), Positives = 99/157 (63%), Gaps = 27/157 (17%) Frame = +3 Query: 3 DICFAVGAVSRHM---------------------------FPIKPVQNHAELIGYSDSDW 101 D CFA G VSR M P V E+I YSD DW Sbjct: 49 DTCFAAGLVSRFMEDPRQSHMKAAMRISRYIADTLDFGILSPKSAVNAKLEIICYSDVDW 108 Query: 102 CGDKVDRRSTTGYVFMYHGAPISWCSKKQPVVALSSCEAEYIAGATAACQAIWLKSVLTE 281 CGDKVDRRS TGY F Y A ++WCS+KQPVVALSSCEA+YIAG+ AACQA+W+ SVL E Sbjct: 109 CGDKVDRRSITGYFFKYLNASVAWCSRKQPVVALSSCEAKYIAGSYAACQALWINSVLKE 168 Query: 282 LGIDIKRPIKLMIDNKSAIMLAKNAVSHGRSKHIDKV 392 L I++K+PI L I N+SAI LAKN V HGRSKHI+ + Sbjct: 169 LKINVKKPITLQIXNQSAINLAKNPVLHGRSKHIEAI 205 >dbj|GAU42259.1| hypothetical protein TSUD_327370 [Trifolium subterraneum] Length = 1090 Score = 172 bits (436), Expect = 1e-46 Identities = 88/155 (56%), Positives = 103/155 (66%), Gaps = 27/155 (17%) Frame = +3 Query: 3 DICFAVGAVSRHM---------------------------FPIKPVQNHAELIGYSDSDW 101 +ICFAVG VSR M FP E++ YSD+DW Sbjct: 864 NICFAVGLVSRFMEDPRESHMKADTRILRYIAGTPYYGILFPKSAKNTKLEIVCYSDADW 923 Query: 102 CGDKVDRRSTTGYVFMYHGAPISWCSKKQPVVALSSCEAEYIAGATAACQAIWLKSVLTE 281 CGDKVDRRST GY F + AP++WCS+KQPVVALSSCEAEYIAG+ AACQ +W+KSVL E Sbjct: 924 CGDKVDRRSTPGYFFKFLKAPVAWCSRKQPVVALSSCEAEYIAGSYAACQTLWMKSVLEE 983 Query: 282 LGIDIKRPIKLMIDNKSAIMLAKNAVSHGRSKHID 386 L ID+K+PI L IDN+SAI LAKN V HGRSKHI+ Sbjct: 984 LKIDVKKPITLQIDNQSAINLAKNPVLHGRSKHIE 1018 >gb|PNX73153.1| hypothetical protein L195_g029051 [Trifolium pratense] Length = 238 Score = 160 bits (406), Expect = 2e-46 Identities = 73/116 (62%), Positives = 93/116 (80%) Frame = +3 Query: 39 MFPIKPVQNHAELIGYSDSDWCGDKVDRRSTTGYVFMYHGAPISWCSKKQPVVALSSCEA 218 +FP ++ AEL+ YSDS+W GDK+DRRST+GYV +Y+GA ISWC+KK PV ALS+CEA Sbjct: 91 LFPTGMKKDSAELVSYSDSNWGGDKIDRRSTSGYVMLYNGALISWCTKKHPVTALSTCEA 150 Query: 219 EYIAGATAACQAIWLKSVLTELGIDIKRPIKLMIDNKSAIMLAKNAVSHGRSKHID 386 EYIA + CQ IWL SV+ EL ++K+P+KL+IDNKSAI L KN +SHGRSKHI+ Sbjct: 151 EYIAETFSTCQVIWLDSVMKELKCELKKPLKLLIDNKSAISLVKNPISHGRSKHIE 206 >gb|PNX68532.1| copia-type polyprotein [Trifolium pratense] Length = 194 Score = 159 bits (401), Expect = 3e-46 Identities = 75/116 (64%), Positives = 90/116 (77%) Frame = +3 Query: 39 MFPIKPVQNHAELIGYSDSDWCGDKVDRRSTTGYVFMYHGAPISWCSKKQPVVALSSCEA 218 +FP + L YSDSDW GD DRRST+GYV +Y+GA I+WC+KKQPV ALS+CEA Sbjct: 28 LFPTGLKDDCESLSSYSDSDWGGDATDRRSTSGYVMLYNGASIAWCTKKQPVTALSTCEA 87 Query: 219 EYIAGATAACQAIWLKSVLTELGIDIKRPIKLMIDNKSAIMLAKNAVSHGRSKHID 386 EYIAG A CQ +WL SVL EL ++++P+KLMIDNKSAI LAKN VSHGRSKHI+ Sbjct: 88 EYIAGTFATCQMVWLDSVLRELKCELQKPLKLMIDNKSAINLAKNPVSHGRSKHIE 143 >dbj|GAU37611.1| hypothetical protein TSUD_365320 [Trifolium subterraneum] Length = 718 Score = 169 bits (429), Expect = 4e-46 Identities = 86/155 (55%), Positives = 107/155 (69%), Gaps = 27/155 (17%) Frame = +3 Query: 3 DICFAVGAVSR--------HMFPIKPVQNHAE-------------------LIGYSDSDW 101 D+ FAVGAVSR HM +K + + + L G+SDSDW Sbjct: 508 DLSFAVGAVSRFVESPKQSHMVAVKRILRYVQGTMEFGVLFPNNISNSVNRLEGFSDSDW 567 Query: 102 CGDKVDRRSTTGYVFMYHGAPISWCSKKQPVVALSSCEAEYIAGATAACQAIWLKSVLTE 281 CGD VDRRSTTGY+F + APISWCSKKQPV+ALSSCEAEYIA A AACQ IWL+S+L + Sbjct: 568 CGDHVDRRSTTGYIFKFLHAPISWCSKKQPVIALSSCEAEYIACAFAACQGIWLESLLKD 627 Query: 282 LGIDIKRPIKLMIDNKSAIMLAKNAVSHGRSKHID 386 + +++ PI+L++DNKSAI LAKN +SHGRSKHI+ Sbjct: 628 IQVELTEPIQLLVDNKSAINLAKNPISHGRSKHIE 662 >dbj|GAU35215.1| hypothetical protein TSUD_204910 [Trifolium subterraneum] Length = 1149 Score = 168 bits (425), Expect(2) = 2e-45 Identities = 85/155 (54%), Positives = 106/155 (68%), Gaps = 27/155 (17%) Frame = +3 Query: 3 DICFAVGAVSR--------HMFPIKPVQNHAE-------------------LIGYSDSDW 101 D+ FAVG VSR HM +K + + + L G+SDSDW Sbjct: 939 DLSFAVGVVSRFVESPKQSHMVAVKRILRYVQGTMEFGVLFPNNISNSVNKLEGFSDSDW 998 Query: 102 CGDKVDRRSTTGYVFMYHGAPISWCSKKQPVVALSSCEAEYIAGATAACQAIWLKSVLTE 281 CGD VDRRSTTGY+F + APISWCSKKQPV+ALSSCEAEYIA A AACQ IWL+S+L + Sbjct: 999 CGDHVDRRSTTGYIFKFLHAPISWCSKKQPVIALSSCEAEYIACAFAACQGIWLESLLKD 1058 Query: 282 LGIDIKRPIKLMIDNKSAIMLAKNAVSHGRSKHID 386 + +++ PI+L++DNKSAI LAKN +SHGRSKHI+ Sbjct: 1059 IQVELTEPIQLLVDNKSAINLAKNPISHGRSKHIE 1093 Score = 42.0 bits (97), Expect(2) = 2e-45 Identities = 18/35 (51%), Positives = 26/35 (74%) Frame = +1 Query: 385 TKFHFLRDQVEKKLLEVCYCNTQVQIADVLTKALK 489 T+FHF+R+QV + + +C T+VQ AD+LTK LK Sbjct: 1094 TRFHFIREQVNNGRIVLKHCPTEVQEADILTKGLK 1128 >gb|KYP32045.1| Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cajanus cajan] Length = 1205 Score = 160 bits (404), Expect(2) = 4e-45 Identities = 75/116 (64%), Positives = 92/116 (79%) Frame = +3 Query: 39 MFPIKPVQNHAELIGYSDSDWCGDKVDRRSTTGYVFMYHGAPISWCSKKQPVVALSSCEA 218 MFP K + ++GYSD+DWCGDK DR+STTGYVFM APISWCS+KQ VVALSSCEA Sbjct: 1035 MFPNKFSSPNHNMVGYSDADWCGDKADRKSTTGYVFMLGDAPISWCSRKQSVVALSSCEA 1094 Query: 219 EYIAGATAACQAIWLKSVLTELGIDIKRPIKLMIDNKSAIMLAKNAVSHGRSKHID 386 EYIA + ACQA+WL+++L EL + + + LM+DNKSAI LAKN V+HGRSKHI+ Sbjct: 1095 EYIAASMGACQALWLETLLEELKTETEEGMLLMVDNKSAINLAKNPVAHGRSKHIE 1150 Score = 49.3 bits (116), Expect(2) = 4e-45 Identities = 21/37 (56%), Positives = 30/37 (81%) Frame = +1 Query: 385 TKFHFLRDQVEKKLLEVCYCNTQVQIADVLTKALKIE 495 T+FHFLRDQ+ K+ L++ +C ++ QIAD+LTK LK E Sbjct: 1151 TRFHFLRDQISKRKLKLEFCRSESQIADILTKPLKKE 1187 >dbj|GAU50483.1| hypothetical protein TSUD_409690 [Trifolium subterraneum] Length = 1073 Score = 167 bits (423), Expect(2) = 4e-45 Identities = 85/155 (54%), Positives = 106/155 (68%), Gaps = 27/155 (17%) Frame = +3 Query: 3 DICFAVGAVSR--------HMFPIKPVQNHAE-------------------LIGYSDSDW 101 D+ FAVGAVSR HM +K + + + L G+SDSDW Sbjct: 863 DLSFAVGAVSRFVESPKQSHMVAVKRILRYVQGTMEFGVLFPNNISNSVNRLEGFSDSDW 922 Query: 102 CGDKVDRRSTTGYVFMYHGAPISWCSKKQPVVALSSCEAEYIAGATAACQAIWLKSVLTE 281 CGD VDRRSTTGY+F + APISWCSKKQPV+ALSSCEAEYIA AACQ IWL+S+L + Sbjct: 923 CGDHVDRRSTTGYIFKFLHAPISWCSKKQPVIALSSCEAEYIACDFAACQGIWLESLLKD 982 Query: 282 LGIDIKRPIKLMIDNKSAIMLAKNAVSHGRSKHID 386 + +++ PI+L++DNKSAI LAKN +SHGRSKHI+ Sbjct: 983 IQVELTEPIQLLVDNKSAINLAKNPISHGRSKHIE 1017 Score = 42.0 bits (97), Expect(2) = 4e-45 Identities = 18/35 (51%), Positives = 26/35 (74%) Frame = +1 Query: 385 TKFHFLRDQVEKKLLEVCYCNTQVQIADVLTKALK 489 T+FHF+R+QV + + +C T+VQ AD+LTK LK Sbjct: 1018 TRFHFIREQVNNGRIVLNHCPTEVQEADILTKGLK 1052 >gb|PNX66461.1| hypothetical protein L195_g055105, partial [Trifolium pratense] Length = 254 Score = 157 bits (397), Expect = 6e-45 Identities = 75/116 (64%), Positives = 90/116 (77%) Frame = +3 Query: 39 MFPIKPVQNHAELIGYSDSDWCGDKVDRRSTTGYVFMYHGAPISWCSKKQPVVALSSCEA 218 +FP EL G+SD+DWCGDKVDRRST+GY+F + AP+SWCSKKQ V+ALSSCEA Sbjct: 86 LFPYSKDSVKLELNGFSDADWCGDKVDRRSTSGYLFKFQNAPVSWCSKKQSVIALSSCEA 145 Query: 219 EYIAGATAACQAIWLKSVLTELGIDIKRPIKLMIDNKSAIMLAKNAVSHGRSKHID 386 EY+AG+ AACQA WL+S+L E+ I I L IDNKSAI LAKN VSHG+SKHI+ Sbjct: 146 EYVAGSLAACQANWLQSLLNEMKIIDNITIMLKIDNKSAINLAKNPVSHGKSKHIE 201 >dbj|GAU46965.1| hypothetical protein TSUD_143070 [Trifolium subterraneum] Length = 1119 Score = 160 bits (404), Expect(2) = 7e-45 Identities = 83/155 (53%), Positives = 102/155 (65%), Gaps = 27/155 (17%) Frame = +3 Query: 3 DICFAVGAVSRHM---------------------------FPIKPVQNHAELIGYSDSDW 101 DI +AVG VSR M +P + EL G+SD+DW Sbjct: 910 DIIYAVGYVSRFMSNPLKSHLLAAKRILRYINGTIHYGVLYPYARDSSKLELNGFSDADW 969 Query: 102 CGDKVDRRSTTGYVFMYHGAPISWCSKKQPVVALSSCEAEYIAGATAACQAIWLKSVLTE 281 CGDKVDRRST+GYVF + AP+SWCSKKQ V+ALSSCEAEY+AG+ AACQA WL+S+L+E Sbjct: 970 CGDKVDRRSTSGYVFKFQNAPVSWCSKKQSVIALSSCEAEYVAGSLAACQANWLQSLLSE 1029 Query: 282 LGIDIKRPIKLMIDNKSAIMLAKNAVSHGRSKHID 386 + I + L IDNKSAI LAKN+VSHG+SKHI+ Sbjct: 1030 MKITNNITVMLKIDNKSAINLAKNSVSHGKSKHIE 1064 Score = 48.5 bits (114), Expect(2) = 7e-45 Identities = 23/43 (53%), Positives = 30/43 (69%) Frame = +1 Query: 361 HMDAANTLTKFHFLRDQVEKKLLEVCYCNTQVQIADVLTKALK 489 H + + T+FHFLRDQV K L + YC+T Q AD+LTKA+K Sbjct: 1057 HGKSKHIETRFHFLRDQVNKGKLSLKYCSTNDQQADILTKAMK 1099 >dbj|GAU39052.1| hypothetical protein TSUD_396570 [Trifolium subterraneum] Length = 1309 Score = 155 bits (391), Expect(2) = 1e-44 Identities = 79/155 (50%), Positives = 101/155 (65%), Gaps = 27/155 (17%) Frame = +3 Query: 3 DICFAVGAVSRHM---------------------------FPIKPVQNHAELIGYSDSDW 101 DI +AVG+VSR M FP ++ EL G+SDSDW Sbjct: 1100 DINYAVGSVSRFMSNPKASHMVAAKRILRYLKGTKDFGLVFPTNNKESKIELEGFSDSDW 1159 Query: 102 CGDKVDRRSTTGYVFMYHGAPISWCSKKQPVVALSSCEAEYIAGATAACQAIWLKSVLTE 281 CGDK DRRS +GY F + +PISW S+KQ +VALSSC+AEY+A A AACQA+WL+S+L E Sbjct: 1160 CGDKDDRRSKSGYWFRFKNSPISWSSRKQSIVALSSCDAEYVAAAQAACQAVWLESLLDE 1219 Query: 282 LGIDIKRPIKLMIDNKSAIMLAKNAVSHGRSKHID 386 L I +P+KL +DNKSAI LA+N ++HGRSKHI+ Sbjct: 1220 LKIKYVKPVKLNVDNKSAISLARNPIAHGRSKHIE 1254 Score = 52.8 bits (125), Expect(2) = 1e-44 Identities = 24/37 (64%), Positives = 29/37 (78%) Frame = +1 Query: 385 TKFHFLRDQVEKKLLEVCYCNTQVQIADVLTKALKIE 495 TK+HFLRDQV K+ L V YC T VQIAD+LTK L+ + Sbjct: 1255 TKYHFLRDQVSKEKLTVEYCKTDVQIADILTKPLRAD 1291 >gb|KYP66912.1| Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cajanus cajan] Length = 861 Score = 157 bits (398), Expect(2) = 2e-44 Identities = 74/116 (63%), Positives = 92/116 (79%) Frame = +3 Query: 39 MFPIKPVQNHAELIGYSDSDWCGDKVDRRSTTGYVFMYHGAPISWCSKKQPVVALSSCEA 218 MFP K + ++GYSD+DWCGDK DR+STTGYVFM APISWCS+KQ VV LSSCEA Sbjct: 691 MFPNKFSCPNHNMVGYSDADWCGDKADRKSTTGYVFMLGDAPISWCSRKQSVVPLSSCEA 750 Query: 219 EYIAGATAACQAIWLKSVLTELGIDIKRPIKLMIDNKSAIMLAKNAVSHGRSKHID 386 EYIA + ACQA+WL+++L EL I+ + + LM+DNKS+I LAKN V+HGRSKHI+ Sbjct: 751 EYIAASMGACQALWLETLLEELKIETEEGMLLMVDNKSSINLAKNPVAHGRSKHIE 806 Score = 49.3 bits (116), Expect(2) = 2e-44 Identities = 21/37 (56%), Positives = 30/37 (81%) Frame = +1 Query: 385 TKFHFLRDQVEKKLLEVCYCNTQVQIADVLTKALKIE 495 T+FHFLRDQ+ K+ L++ +C ++ QIAD+LTK LK E Sbjct: 807 TRFHFLRDQISKRKLKLEFCRSESQIADILTKPLKKE 843 >gb|KYP76820.1| Copia protein [Cajanus cajan] Length = 189 Score = 154 bits (388), Expect(2) = 2e-44 Identities = 71/117 (60%), Positives = 89/117 (76%) Frame = +3 Query: 39 MFPIKPVQNHAELIGYSDSDWCGDKVDRRSTTGYVFMYHGAPISWCSKKQPVVALSSCEA 218 +FP + +L+ YSDSDWCGDK DR+ST Y+F Y GAPISW S K+PVVALSSCEA Sbjct: 28 LFPKGKGEIEEKLVAYSDSDWCGDKSDRKSTARYIFFYGGAPISWSSSKEPVVALSSCEA 87 Query: 219 EYIAGATAACQAIWLKSVLTELGIDIKRPIKLMIDNKSAIMLAKNAVSHGRSKHIDK 389 EYIA + AACQA+WL +++ EL ++ +KL++DNKSAI LAK+ HGRSKHIDK Sbjct: 88 EYIAASEAACQAVWLDALMKELQVEHLNKVKLLVDNKSAIDLAKHTTVHGRSKHIDK 144 Score = 53.1 bits (126), Expect(2) = 2e-44 Identities = 22/38 (57%), Positives = 32/38 (84%) Frame = +1 Query: 379 TLTKFHFLRDQVEKKLLEVCYCNTQVQIADVLTKALKI 492 T T+FH+LR+QV K+ LE+ +C T++Q AD+LTKALK+ Sbjct: 152 TETRFHYLREQVSKEKLEIEHCGTEIQFADILTKALKL 189 >gb|KYP67250.1| Copia protein [Cajanus cajan] Length = 178 Score = 153 bits (386), Expect = 3e-44 Identities = 71/104 (68%), Positives = 85/104 (81%) Frame = +3 Query: 75 LIGYSDSDWCGDKVDRRSTTGYVFMYHGAPISWCSKKQPVVALSSCEAEYIAGATAACQA 254 L+ YSDSDWCGD VDRRSTTG VF+ G+PISW SKKQ VVALS+CEAEYIA AACQA Sbjct: 41 LVAYSDSDWCGDLVDRRSTTGQVFLLSGSPISWSSKKQTVVALSTCEAEYIAACLAACQA 100 Query: 255 IWLKSVLTELGIDIKRPIKLMIDNKSAIMLAKNAVSHGRSKHID 386 +WL S+L+EL + + ++L++D+KS I LAKN VSHGRSKHID Sbjct: 101 LWLSSLLSELKVSVDNGVELLVDSKSTIDLAKNPVSHGRSKHID 144 >dbj|GAU42845.1| hypothetical protein TSUD_387380 [Trifolium subterraneum] Length = 1239 Score = 164 bits (416), Expect(2) = 3e-44 Identities = 85/155 (54%), Positives = 106/155 (68%), Gaps = 27/155 (17%) Frame = +3 Query: 3 DICFAVGAVSR--------HMFPIKPVQNHAE-------------------LIGYSDSDW 101 D+ FAVGAVSR HM +K + + + L G+SDSDW Sbjct: 1029 DLSFAVGAVSRFVESPKQSHMVAVKRILRYVQGTMEFGVLFPNNISNSVNRLEGFSDSDW 1088 Query: 102 CGDKVDRRSTTGYVFMYHGAPISWCSKKQPVVALSSCEAEYIAGATAACQAIWLKSVLTE 281 CGD VDRRSTTGY+F + APISWCSKKQPV+ALSSCEAEYIA A AACQ I L+S+L + Sbjct: 1089 CGDHVDRRSTTGYIFKFLHAPISWCSKKQPVIALSSCEAEYIACAFAACQGIGLESLLKD 1148 Query: 282 LGIDIKRPIKLMIDNKSAIMLAKNAVSHGRSKHID 386 + +++ PI+L++DNKSAI LAKN +SHGRSKHI+ Sbjct: 1149 IQVELTEPIQLLVDNKSAINLAKNPISHGRSKHIE 1183 Score = 41.6 bits (96), Expect(2) = 3e-44 Identities = 18/35 (51%), Positives = 26/35 (74%) Frame = +1 Query: 385 TKFHFLRDQVEKKLLEVCYCNTQVQIADVLTKALK 489 T+FHF+R+QV + + +C T+VQ AD+LTK LK Sbjct: 1184 TRFHFIREQVNNGRIILNHCPTEVQEADILTKGLK 1218 >gb|KYP66838.1| Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cajanus cajan] Length = 1317 Score = 152 bits (385), Expect(2) = 4e-44 Identities = 70/104 (67%), Positives = 86/104 (82%) Frame = +3 Query: 75 LIGYSDSDWCGDKVDRRSTTGYVFMYHGAPISWCSKKQPVVALSSCEAEYIAGATAACQA 254 L+ YSDSDWCGD VDRRST G VF++ G+PISW SKKQ VV LS+CEAEYIA +AACQA Sbjct: 1163 LVAYSDSDWCGDLVDRRSTMGQVFLFSGSPISWSSKKQTVVVLSTCEAEYIAACSAACQA 1222 Query: 255 IWLKSVLTELGIDIKRPIKLMIDNKSAIMLAKNAVSHGRSKHID 386 +WL S+L+EL + + ++L++D+KSAI LAKN VSHGRSKHID Sbjct: 1223 LWLSSLLSELKVSVDSGVELLVDSKSAIDLAKNPVSHGRSKHID 1266 Score = 53.1 bits (126), Expect(2) = 4e-44 Identities = 22/37 (59%), Positives = 32/37 (86%) Frame = +1 Query: 385 TKFHFLRDQVEKKLLEVCYCNTQVQIADVLTKALKIE 495 TK+HFLRDQV K +++ +C T+VQ+AD++TK+LKIE Sbjct: 1267 TKYHFLRDQVSKGRIKLKHCRTEVQLADIMTKSLKIE 1303 >dbj|GAU22332.1| hypothetical protein TSUD_106600 [Trifolium subterraneum] Length = 1171 Score = 159 bits (401), Expect(2) = 6e-44 Identities = 75/116 (64%), Positives = 91/116 (78%) Frame = +3 Query: 39 MFPIKPVQNHAELIGYSDSDWCGDKVDRRSTTGYVFMYHGAPISWCSKKQPVVALSSCEA 218 +FP + EL G+SD+DWCGDKVDRRST+GYVF + AP+SWCSKKQ V+ALSSCEA Sbjct: 1001 LFPYSRDSSKLELNGFSDADWCGDKVDRRSTSGYVFKFQSAPVSWCSKKQSVIALSSCEA 1060 Query: 219 EYIAGATAACQAIWLKSVLTELGIDIKRPIKLMIDNKSAIMLAKNAVSHGRSKHID 386 EY+AG+ AACQA W++S+L E+ I I L IDNKSAI LAKN VSHG+SKHI+ Sbjct: 1061 EYVAGSLAACQANWMQSLLNEMKIIDNITIMLKIDNKSAINLAKNPVSHGKSKHIE 1116 Score = 46.6 bits (109), Expect(2) = 6e-44 Identities = 22/43 (51%), Positives = 30/43 (69%) Frame = +1 Query: 361 HMDAANTLTKFHFLRDQVEKKLLEVCYCNTQVQIADVLTKALK 489 H + + T+FHFLRDQV K L + YC+T Q AD+LTK++K Sbjct: 1109 HGKSKHIETRFHFLRDQVNKGKLSLEYCSTNDQQADILTKSVK 1151 >dbj|GAU12596.1| hypothetical protein TSUD_132060 [Trifolium subterraneum] Length = 1096 Score = 159 bits (401), Expect(2) = 6e-44 Identities = 83/155 (53%), Positives = 100/155 (64%), Gaps = 27/155 (17%) Frame = +3 Query: 3 DICFAVGAVSRHM---------------------------FPIKPVQNHAELIGYSDSDW 101 DI +AVG VSR M FP + EL G+SD+DW Sbjct: 887 DISYAVGYVSRFMSKPLKSYLLAAKRILRYINGTIHYGVLFPYSRDSSKLELNGFSDADW 946 Query: 102 CGDKVDRRSTTGYVFMYHGAPISWCSKKQPVVALSSCEAEYIAGATAACQAIWLKSVLTE 281 CGDKVDRRST+GYVF + AP+SWCSKKQ V+ LSSCEAEY+AG+ AACQA WL+S+L+E Sbjct: 947 CGDKVDRRSTSGYVFKFQNAPVSWCSKKQSVIVLSSCEAEYVAGSLAACQANWLQSLLSE 1006 Query: 282 LGIDIKRPIKLMIDNKSAIMLAKNAVSHGRSKHID 386 + I + L IDNKSAI LAKN VSHG+SKHI+ Sbjct: 1007 MKITDNITVMLKIDNKSAINLAKNHVSHGKSKHIE 1041 Score = 46.6 bits (109), Expect(2) = 6e-44 Identities = 23/43 (53%), Positives = 29/43 (67%) Frame = +1 Query: 361 HMDAANTLTKFHFLRDQVEKKLLEVCYCNTQVQIADVLTKALK 489 H + + T FHFLRDQV K L + YC+T Q AD+LTKA+K Sbjct: 1034 HGKSKHIETMFHFLRDQVNKGKLSLEYCSTNDQHADILTKAVK 1076