BLASTX nr result
ID: Astragalus22_contig00026421
seq
BLASTX 2.2.26 [Sep-21-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Astragalus22_contig00026421 (349 letters) Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects 149,584,005 sequences; 54,822,741,787 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value dbj|GAU10538.1| hypothetical protein TSUD_422910, partial [Trifo... 102 7e-24 gb|PNX69862.1| gag-pol polyprotein, partial [Trifolium pratense] 100 9e-24 gb|PNX98274.1| gag-pol polyprotein, partial [Trifolium pratense] 97 2e-23 gb|PNX98240.1| gag-pol polyprotein [Trifolium pratense] 100 2e-23 gb|KYP74527.1| Transposon Ty3-I Gag-Pol polyprotein [Cajanus cajan] 103 2e-23 ref|XP_020232010.1| uncharacterized protein LOC109812452 [Cajanu... 103 2e-23 ref|XP_020225047.1| uncharacterized protein LOC109806929 [Cajanu... 103 3e-23 ref|XP_015953975.1| uncharacterized protein LOC107478346 [Arachi... 103 3e-23 gb|KYP51328.1| Retrovirus-related Pol polyprotein from transposo... 102 6e-23 gb|KYP43468.1| Uncharacterized protein Mb2253c family [Cajanus c... 96 9e-23 gb|PNX89467.1| gag-pol polyprotein, partial [Trifolium pratense] 97 9e-23 ref|XP_020203766.1| uncharacterized protein LOC109789265 [Cajanu... 100 1e-22 gb|KYP33369.1| Uncharacterized protein Mb2253c family, partial [... 95 1e-22 dbj|GAU10080.1| hypothetical protein TSUD_423780, partial [Trifo... 98 1e-22 dbj|GAU28888.1| hypothetical protein TSUD_293400 [Trifolium subt... 101 1e-22 gb|KYP76415.1| Uncharacterized protein Mb2253c [Cajanus cajan] 98 1e-22 dbj|GAU10833.1| hypothetical protein TSUD_425960, partial [Trifo... 96 1e-22 gb|KYP73954.1| Uncharacterized protein Mb2253c family [Cajanus c... 99 2e-22 dbj|GAU29444.1| hypothetical protein TSUD_150140 [Trifolium subt... 100 2e-22 dbj|GAU46380.1| hypothetical protein TSUD_280790 [Trifolium subt... 100 2e-22 >dbj|GAU10538.1| hypothetical protein TSUD_422910, partial [Trifolium subterraneum] Length = 312 Score = 102 bits (254), Expect = 7e-24 Identities = 52/113 (46%), Positives = 74/113 (65%) Frame = +3 Query: 3 LSLNTDSLLVVSQIGKQYEAKDPILQRYLTQVQELLSLFTFTEVKHVPRSENIRADILSK 182 + + TDS LV SQ+ +Y+AK+ L YL V+E ++ F FTE++HVPR N RADILSK Sbjct: 130 IKIYTDSQLVASQVLGEYQAKNDNLSEYLALVKERITKFDFTEIQHVPREHNKRADILSK 189 Query: 183 LASTKIPGNHRSIIQETLSKPSVILNPEANFSVNVVEESQGWIAPLIDYIRTG 341 LASTK ++S+IQE LS PS I P +N + ++ GW+ P+ +Y+ G Sbjct: 190 LASTKRKNGNKSVIQEILSHPS-IQKPTRVLDINAIGDANGWMTPVYNYLAHG 241 >gb|PNX69862.1| gag-pol polyprotein, partial [Trifolium pratense] Length = 204 Score = 99.8 bits (247), Expect = 9e-24 Identities = 55/115 (47%), Positives = 73/115 (63%), Gaps = 1/115 (0%) Frame = +3 Query: 3 LSLNTDSLLVVSQIGKQYEAKDPILQRYLTQVQELLSLFTFTEVKHVPRSENIRADILSK 182 + + TDS LV SQI +Y+ KD L YL ++E L+ F TEVKHVPR N RADILSK Sbjct: 74 IKIFTDSQLVASQIAGEYQTKDERLTEYLNLIKEKLTKFKHTEVKHVPREHNARADILSK 133 Query: 183 LAST-KIPGNHRSIIQETLSKPSVILNPEANFSVNVVEESQGWIAPLIDYIRTGN 344 LA T K G ++S+IQETLSKPS+ E + +S W+ P+ +++ TGN Sbjct: 134 LAXTKKKKGGNQSLIQETLSKPSIAKPXEVFLICEINADS--WMTPVFEFLNTGN 186 >gb|PNX98274.1| gag-pol polyprotein, partial [Trifolium pratense] Length = 147 Score = 97.4 bits (241), Expect = 2e-23 Identities = 53/115 (46%), Positives = 73/115 (63%), Gaps = 5/115 (4%) Frame = +3 Query: 15 TDSLLVVSQIGKQYEAKDPILQRYLTQVQELLSLFTFTEVKHVPRSENIRADILSKLAST 194 TDS LVVSQ+ +Y+AK+ LQ YL V+E+L+LF EVKHVPR +N RADILSKLAST Sbjct: 21 TDSQLVVSQVIGEYQAKNDHLQEYLRLVKEMLALFDHIEVKHVPRGDNTRADILSKLAST 80 Query: 195 KIPGNHRSIIQETLSKPSVILNPEAN-----FSVNVVEESQGWIAPLIDYIRTGN 344 K G ++S+IQE L +PS+ + VN +++ W+ Y+ G+ Sbjct: 81 KKKGGNKSVIQEILPRPSIEERTSSTPSVLVIDVNSIKDGTSWMTNYYMYLAHGH 135 >gb|PNX98240.1| gag-pol polyprotein [Trifolium pratense] Length = 291 Score = 100 bits (250), Expect = 2e-23 Identities = 54/115 (46%), Positives = 75/115 (65%), Gaps = 5/115 (4%) Frame = +3 Query: 15 TDSLLVVSQIGKQYEAKDPILQRYLTQVQELLSLFTFTEVKHVPRSENIRADILSKLAST 194 TDS LVVSQ+ +Y+AK+ LQ YL V+E+L+LF + EVKHVPR +N+RADI SKLAST Sbjct: 116 TDSQLVVSQVIGEYQAKNDHLQDYLRLVREMLALFDYIEVKHVPRGDNMRADIFSKLAST 175 Query: 195 KIPGNHRSIIQETLSKPSVILNPEANFS-----VNVVEESQGWIAPLIDYIRTGN 344 K G ++S+IQE L +PS+ + S VN +E+ W+ Y+ G+ Sbjct: 176 KKKGGNKSVIQEILPRPSIEEHTSPTLSVMAVDVNSIEDGTSWMTNYYTYLAHGH 230 >gb|KYP74527.1| Transposon Ty3-I Gag-Pol polyprotein [Cajanus cajan] Length = 676 Score = 103 bits (257), Expect = 2e-23 Identities = 55/113 (48%), Positives = 75/113 (66%) Frame = +3 Query: 3 LSLNTDSLLVVSQIGKQYEAKDPILQRYLTQVQELLSLFTFTEVKHVPRSENIRADILSK 182 +S N+DS L+V Q+G Y+AKD +LQRY + +S F +KHVPR +N RAD+LSK Sbjct: 190 VSCNSDSKLMVEQLGGTYQAKDALLQRYFHTAAQQISSFDEFSIKHVPREQNTRADLLSK 249 Query: 183 LASTKIPGNHRSIIQETLSKPSVILNPEANFSVNVVEESQGWIAPLIDYIRTG 341 LASTK PG HR+IIQETL PS+ + +VN EE GW+ + +Y++ G Sbjct: 250 LASTKKPGQHRTIIQETLHSPSL---DDKTVNVNDSEE-LGWMTDIWNYLKDG 298 >ref|XP_020232010.1| uncharacterized protein LOC109812452 [Cajanus cajan] Length = 700 Score = 103 bits (257), Expect = 2e-23 Identities = 55/113 (48%), Positives = 75/113 (66%) Frame = +3 Query: 3 LSLNTDSLLVVSQIGKQYEAKDPILQRYLTQVQELLSLFTFTEVKHVPRSENIRADILSK 182 +S N+DS L+V Q+G Y+AKD +LQRY + +S F +KHVPR +N RAD+LSK Sbjct: 310 VSCNSDSKLMVEQLGGTYQAKDALLQRYFHTAAQQISSFDEFSIKHVPREQNTRADLLSK 369 Query: 183 LASTKIPGNHRSIIQETLSKPSVILNPEANFSVNVVEESQGWIAPLIDYIRTG 341 LASTK PG HR+IIQETL PS+ + +VN EE GW+ + +Y++ G Sbjct: 370 LASTKKPGQHRTIIQETLHSPSL---DDKTVNVNDSEE-LGWMTDIWNYLKDG 418 >ref|XP_020225047.1| uncharacterized protein LOC109806929 [Cajanus cajan] Length = 1070 Score = 103 bits (257), Expect = 3e-23 Identities = 55/113 (48%), Positives = 75/113 (66%) Frame = +3 Query: 3 LSLNTDSLLVVSQIGKQYEAKDPILQRYLTQVQELLSLFTFTEVKHVPRSENIRADILSK 182 +S N+DS L+V Q+G Y+AKD +LQRY + +S F +KHVPR +N RAD+LSK Sbjct: 596 VSCNSDSKLMVEQLGGTYQAKDALLQRYFHTAAQQISSFDEFSIKHVPREQNTRADLLSK 655 Query: 183 LASTKIPGNHRSIIQETLSKPSVILNPEANFSVNVVEESQGWIAPLIDYIRTG 341 LASTK PG HR+IIQETL PS+ + +VN EE GW+ + +Y++ G Sbjct: 656 LASTKKPGQHRTIIQETLHSPSL---DDKTVNVNDSEE-LGWMTDIWNYLKDG 704 >ref|XP_015953975.1| uncharacterized protein LOC107478346 [Arachis duranensis] Length = 689 Score = 103 bits (256), Expect = 3e-23 Identities = 52/108 (48%), Positives = 74/108 (68%) Frame = +3 Query: 15 TDSLLVVSQIGKQYEAKDPILQRYLTQVQELLSLFTFTEVKHVPRSENIRADILSKLAST 194 +DS +V SQI ++Y+AKDP ++RYL + E L F TE+KH+ R+ N RAD LSKLAST Sbjct: 310 SDSQVVTSQINREYQAKDPNMKRYLDKTLEHLRRFEETEIKHITRNLNSRADALSKLAST 369 Query: 195 KIPGNHRSIIQETLSKPSVILNPEANFSVNVVEESQGWIAPLIDYIRT 338 K GN+RS+IQ+TL +PSV A + V GW+ PL++Y+++ Sbjct: 370 KPGGNNRSLIQKTLPEPSVAKTEVAQDVLEVTGPDLGWMKPLVEYLKS 417 >gb|KYP51328.1| Retrovirus-related Pol polyprotein from transposon 17.6 [Cajanus cajan] Length = 787 Score = 102 bits (254), Expect = 6e-23 Identities = 54/116 (46%), Positives = 74/116 (63%), Gaps = 3/116 (2%) Frame = +3 Query: 3 LSLNTDSLLVVSQIGKQYEAKDPILQRYLTQVQELLSLFTFTEVKHVPRSENIRADILSK 182 +S N+DS L+V Q+ + Y+AKD +LQRY +S F +KHVPR +N RAD+LSK Sbjct: 353 VSCNSDSKLMVDQLSRTYQAKDTLLQRYFHTASHQISSFDKFTIKHVPREQNARADLLSK 412 Query: 183 LASTKIPGNHRSIIQETLSKPSV---ILNPEANFSVNVVEESQGWIAPLIDYIRTG 341 LASTK PG HR+IIQETL PS+ ++N N E GW+A + +Y++ G Sbjct: 413 LASTKRPGQHRTIIQETLHSPSLDDKVINVSDN-------EDLGWMADIWNYLKEG 461 >gb|KYP43468.1| Uncharacterized protein Mb2253c family [Cajanus cajan] Length = 156 Score = 95.9 bits (237), Expect = 9e-23 Identities = 51/112 (45%), Positives = 70/112 (62%), Gaps = 3/112 (2%) Frame = +3 Query: 15 TDSLLVVSQIGKQYEAKDPILQRYLTQVQELLSLFTFTEVKHVPRSENIRADILSKLAST 194 +DS L+ Q+G Y+ K+P LQRY V L S F ++KHVPR+ N+RAD+LSKLAST Sbjct: 42 SDSKLITEQVGGSYQTKEPQLQRYNLMVSHLTSSFDHFQIKHVPRAHNVRADLLSKLAST 101 Query: 195 KIPGNHRSIIQETLSKPSVILNPEANFSVNVVEESQG---WIAPLIDYIRTG 341 K PG H++IIQET+S PS SV V+ + G W++ + Y+ G Sbjct: 102 KRPGQHKTIIQETISAPSY-------DSVTVLANNPGQSSWMSNIRQYLTDG 146 >gb|PNX89467.1| gag-pol polyprotein, partial [Trifolium pratense] Length = 213 Score = 97.4 bits (241), Expect = 9e-23 Identities = 53/115 (46%), Positives = 74/115 (64%), Gaps = 1/115 (0%) Frame = +3 Query: 3 LSLNTDSLLVVSQIGKQYEAKDPILQRYLTQVQELLSLFTFTEVKHVPRSENIRADILSK 182 + + TDS LV SQI +Y+ KD L YL ++E L+ F +EVKHVPR N RADILSK Sbjct: 45 IKIFTDSQLVASQIAGEYQTKDERLTEYLNLIKEKLTKFKQSEVKHVPREHNARADILSK 104 Query: 183 LAST-KIPGNHRSIIQETLSKPSVILNPEANFSVNVVEESQGWIAPLIDYIRTGN 344 LAST K G ++S+IQETLSKPS++ E + + W+A +++++ GN Sbjct: 105 LASTKKKKGGNQSLIQETLSKPSIVKPSEVFLICEI--NANSWMATVLEFLNKGN 157 >ref|XP_020203766.1| uncharacterized protein LOC109789265 [Cajanus cajan] Length = 390 Score = 100 bits (249), Expect = 1e-22 Identities = 50/113 (44%), Positives = 72/113 (63%) Frame = +3 Query: 3 LSLNTDSLLVVSQIGKQYEAKDPILQRYLTQVQELLSLFTFTEVKHVPRSENIRADILSK 182 +S N+DS L+V Q+ Y+ KD +LQRY + +S F ++HVPR +N+RAD+LSK Sbjct: 172 VSCNSDSKLMVEQLSGAYQTKDTLLQRYFHAASQQISSFDEFTIRHVPREQNVRADLLSK 231 Query: 183 LASTKIPGNHRSIIQETLSKPSVILNPEANFSVNVVEESQGWIAPLIDYIRTG 341 LASTK PG HR+IIQETL+ PS+ + + E QGW+ + Y++ G Sbjct: 232 LASTKRPGQHRTIIQETLNSPSL----DDKVVIANKNEDQGWMTGIWSYLKEG 280 >gb|KYP33369.1| Uncharacterized protein Mb2253c family, partial [Cajanus cajan] Length = 138 Score = 95.1 bits (235), Expect = 1e-22 Identities = 51/114 (44%), Positives = 71/114 (62%), Gaps = 1/114 (0%) Frame = +3 Query: 3 LSLNTDSLLVVSQIGKQYEAKDPILQRYLTQVQELLSLFTFTEVKHVPRSENIRADILSK 182 +S N+DS L+V Q+ Y+AKD +LQ+Y +S F ++HVPR +N RAD+LSK Sbjct: 21 VSCNSDSKLMVEQLSGTYQAKDTLLQQYFDIASHQISSFDEFTIQHVPREQNARADLLSK 80 Query: 183 LASTKIPGNHRSIIQETLSKPSVILNPEANFSVNVVE-ESQGWIAPLIDYIRTG 341 LA TK PG H++IIQETL PS+ N VN + E QGW+ + Y++ G Sbjct: 81 LAGTKRPGQHQTIIQETLHSPSL-----DNKVVNASDSEDQGWMTSIWSYLKEG 129 >dbj|GAU10080.1| hypothetical protein TSUD_423780, partial [Trifolium subterraneum] Length = 241 Score = 97.8 bits (242), Expect = 1e-22 Identities = 51/110 (46%), Positives = 72/110 (65%) Frame = +3 Query: 3 LSLNTDSLLVVSQIGKQYEAKDPILQRYLTQVQELLSLFTFTEVKHVPRSENIRADILSK 182 L +DS LV SQ+ +++AKDP L +YL QV+ L F E+ +VPR +N RAD+LSK Sbjct: 107 LRAKSDSQLVTSQVSGEFQAKDPQLIKYLEQVRSLAKHFNTFELIYVPREQNARADLLSK 166 Query: 183 LASTKIPGNHRSIIQETLSKPSVILNPEANFSVNVVEESQGWIAPLIDYI 332 LASTK PGN+R++IQET++KPS + V +V + W P+I Y+ Sbjct: 167 LASTKKPGNNRTVIQETVAKPST-----GDLEVWMVTRNDDWRTPIIQYL 211 >dbj|GAU28888.1| hypothetical protein TSUD_293400 [Trifolium subterraneum] Length = 1635 Score = 101 bits (252), Expect = 1e-22 Identities = 52/110 (47%), Positives = 74/110 (67%) Frame = +3 Query: 3 LSLNTDSLLVVSQIGKQYEAKDPILQRYLTQVQELLSLFTFTEVKHVPRSENIRADILSK 182 L N+DS LV SQ+ +++AKDP L +YL QV+ L F E+ +VPR +N+RAD+LSK Sbjct: 1132 LRANSDSQLVTSQVSGEFQAKDPQLIKYLEQVRSLAKHFNTFELIYVPREQNVRADLLSK 1191 Query: 183 LASTKIPGNHRSIIQETLSKPSVILNPEANFSVNVVEESQGWIAPLIDYI 332 LASTK PGN+R++IQET++KPS + V +V + W P+I Y+ Sbjct: 1192 LASTKKPGNNRTVIQETVAKPST-----GDLEVWMVTRNDDWRTPIIQYL 1236 >gb|KYP76415.1| Uncharacterized protein Mb2253c [Cajanus cajan] Length = 266 Score = 98.2 bits (243), Expect = 1e-22 Identities = 54/113 (47%), Positives = 71/113 (62%) Frame = +3 Query: 3 LSLNTDSLLVVSQIGKQYEAKDPILQRYLTQVQELLSLFTFTEVKHVPRSENIRADILSK 182 +S N+DS L+V Q+ Y+AKD +LQRY +S F +KHVPR +N RAD+LSK Sbjct: 106 VSCNSDSKLMVEQLSGTYQAKDTLLQRYFHTASHQISSFDEFTIKHVPREQNARADLLSK 165 Query: 183 LASTKIPGNHRSIIQETLSKPSVILNPEANFSVNVVEESQGWIAPLIDYIRTG 341 ASTK PG HR+IIQETL PS + + N S N E GW+A + Y++ G Sbjct: 166 FASTKRPGQHRTIIQETLHSPS-LDDKVVNVSDN---EDLGWMAGIWGYLKEG 214 >dbj|GAU10833.1| hypothetical protein TSUD_425960, partial [Trifolium subterraneum] Length = 174 Score = 95.9 bits (237), Expect = 1e-22 Identities = 50/113 (44%), Positives = 72/113 (63%) Frame = +3 Query: 3 LSLNTDSLLVVSQIGKQYEAKDPILQRYLTQVQELLSLFTFTEVKHVPRSENIRADILSK 182 + + TDS LV SQ+ +Y+AK+ L YLT V+E ++ F E++HVPR N RADILSK Sbjct: 20 IKIYTDSQLVASQVLGEYQAKNDNLSEYLTLVKERITKFDSAEIQHVPREHNKRADILSK 79 Query: 183 LASTKIPGNHRSIIQETLSKPSVILNPEANFSVNVVEESQGWIAPLIDYIRTG 341 LASTK ++S+IQE LS PS I P +N + ++ W+ P+ +Y+ G Sbjct: 80 LASTKRKNGNKSVIQEILSHPS-IQKPTRVLDINAIGDANCWMTPVYNYLAHG 131 >gb|KYP73954.1| Uncharacterized protein Mb2253c family [Cajanus cajan] Length = 312 Score = 99.0 bits (245), Expect = 2e-22 Identities = 54/116 (46%), Positives = 73/116 (62%), Gaps = 3/116 (2%) Frame = +3 Query: 3 LSLNTDSLLVVSQIGKQYEAKDPILQRYLTQVQELLSLFTFTEVKHVPRSENIRADILSK 182 +S N+DS L+V Q+ Y+AKD +LQ YL + +S F ++HVPR +N RAD+LSK Sbjct: 106 VSCNSDSKLMVEQLSGTYQAKDVLLQWYLHMASQQISSFDEFTIQHVPREQNTRADLLSK 165 Query: 183 LASTKIPGNHRSIIQETLSKPSV---ILNPEANFSVNVVEESQGWIAPLIDYIRTG 341 LASTK PG HR+IIQETL PS+ I+N + E QGW+ + Y+R G Sbjct: 166 LASTKRPGQHRTIIQETLHSPSLDDKIVNTSDS-------EEQGWMTGIWSYLRAG 214 >dbj|GAU29444.1| hypothetical protein TSUD_150140 [Trifolium subterraneum] Length = 1507 Score = 100 bits (250), Expect = 2e-22 Identities = 48/111 (43%), Positives = 74/111 (66%) Frame = +3 Query: 3 LSLNTDSLLVVSQIGKQYEAKDPILQRYLTQVQELLSLFTFTEVKHVPRSENIRADILSK 182 L + +DS LV +Q+ +++ KDP L +YL +V+ + FT E+ +VPR +N RAD+L+K Sbjct: 1111 LKVQSDSQLVANQVSGEFQTKDPQLAKYLEKVKGMAKQFTMFELTYVPREQNARADLLAK 1170 Query: 183 LASTKIPGNHRSIIQETLSKPSVILNPEANFSVNVVEESQGWIAPLIDYIR 335 LASTK PGNHR++IQETL PS+ + +V E + W +P+I Y++ Sbjct: 1171 LASTKKPGNHRTVIQETLKSPSI-----NEVEIGMVVEEEDWRSPIIRYLQ 1216 >dbj|GAU46380.1| hypothetical protein TSUD_280790 [Trifolium subterraneum] Length = 1521 Score = 100 bits (250), Expect = 2e-22 Identities = 53/113 (46%), Positives = 74/113 (65%) Frame = +3 Query: 3 LSLNTDSLLVVSQIGKQYEAKDPILQRYLTQVQELLSLFTFTEVKHVPRSENIRADILSK 182 + + TDS LV SQ+ +Y+AK+ L YLT V+E ++ F E++HVPR N RADILSK Sbjct: 1132 IKIYTDSQLVASQVLGEYQAKNDNLSEYLTLVKERITKFDSVEIQHVPREHNKRADILSK 1191 Query: 183 LASTKIPGNHRSIIQETLSKPSVILNPEANFSVNVVEESQGWIAPLIDYIRTG 341 LASTKI ++SIIQE LS PS I P +N +E++ W+ P+ +Y+ G Sbjct: 1192 LASTKINNGNKSIIQEILSHPS-IEKPTKVLGINAIEDTNCWMTPVYNYLAYG 1243