BLASTX nr result
ID: Astragalus22_contig00007724
seq
BLASTX 2.2.26 [Sep-21-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Astragalus22_contig00007724 (1237 letters) Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects 149,584,005 sequences; 54,822,741,787 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value dbj|GAU29525.1| hypothetical protein TSUD_115470 [Trifolium subt... 100 9e-19 dbj|GAU10375.1| hypothetical protein TSUD_419040, partial [Trifo... 99 2e-18 gb|PNX93203.1| hypothetical protein L195_g016354 [Trifolium prat... 99 2e-18 gb|KHN25579.1| hypothetical protein glysoja_037857, partial [Gly... 88 1e-17 gb|AAO23078.1| polyprotein [Glycine max] 96 2e-17 gb|PNX55258.1| hypothetical protein L195_g048885, partial [Trifo... 89 3e-17 dbj|GAU12540.1| hypothetical protein TSUD_182540 [Trifolium subt... 96 3e-17 gb|PNX71262.1| methylesterase chloroplastic-like, partial [Trifo... 89 3e-17 gb|PNX64913.1| hypothetical protein L195_g054267 [Trifolium prat... 90 5e-17 dbj|GAU45358.1| hypothetical protein TSUD_239070 [Trifolium subt... 94 6e-17 gb|KYP46164.1| hypothetical protein KK1_032268 [Cajanus cajan] 87 2e-16 dbj|GAU43714.1| hypothetical protein TSUD_179970 [Trifolium subt... 93 2e-16 gb|PNY17453.1| Ty3/gypsy retrotransposon protein [Trifolium prat... 92 3e-16 ref|XP_006598537.1| PREDICTED: uncharacterized protein LOC100793... 91 5e-16 dbj|GAU35592.1| hypothetical protein TSUD_295280 [Trifolium subt... 92 5e-16 dbj|GAU28865.1| hypothetical protein TSUD_293170 [Trifolium subt... 91 7e-16 gb|PNX77624.1| Ty3/gypsy retrotransposon protein [Trifolium prat... 89 9e-16 dbj|GAU37387.1| hypothetical protein TSUD_22610 [Trifolium subte... 91 1e-15 gb|KYP59123.1| hypothetical protein KK1_014552 [Cajanus cajan] >... 85 1e-15 gb|KYP63732.1| Transposon Ty3-G Gag-Pol polyprotein [Cajanus cajan] 90 2e-15 >dbj|GAU29525.1| hypothetical protein TSUD_115470 [Trifolium subterraneum] Length = 1556 Score = 100 bits (248), Expect = 9e-19 Identities = 61/138 (44%), Positives = 78/138 (56%) Frame = -3 Query: 821 KPCEWKNIILLGHNIGGPYAMGSFPLRVFKVAALALVLMRPVSVAQVWKDMEKSTISQLK 642 +P ++ L H G G FP+ V KV +A L P + K +SQLK Sbjct: 1357 QPYRQSSVALRKHQKLGLRYFGPFPI-VAKVGVVAYRLGLPSTT----KIHPVFHVSQLK 1411 Query: 641 PFQDAVIEPYMSLPLTTNELGPVLSPRAVLRTRGVMRGSVQIKQVLIVWDDNDIAAATWE 462 F I PYM LPLTT+ELGP+L P A+L +R +MRG+ I QVLI W+ + A ATWE Sbjct: 1412 LFHGPHIPPYMPLPLTTSELGPILQPEALLDSRLIMRGNTPISQVLISWEGLETADATWE 1471 Query: 461 NAEEIKKNHANFNLEDKI 408 + E K H NFNLEDK+ Sbjct: 1472 DLVEFKLAHPNFNLEDKV 1489 >dbj|GAU10375.1| hypothetical protein TSUD_419040, partial [Trifolium subterraneum] Length = 708 Score = 98.6 bits (244), Expect = 2e-18 Identities = 48/89 (53%), Positives = 60/89 (67%) Frame = -3 Query: 656 ISQLKPFQDAVIEPYMSLPLTTNELGPVLSPRAVLRTRGVMRGSVQIKQVLIVWDDNDIA 477 ISQLK F+ + +PYM LPLTT ELGP+L P VL+ R +MR I QVLI W+ Sbjct: 597 ISQLKQFKGPITDPYMPLPLTTTELGPILQPTTVLQKRDIMRKEQVIPQVLIKWEGLSDK 656 Query: 476 AATWENAEEIKKNHANFNLEDKIHFKEEG 390 ATWE+ ++I ++ NFNLEDKI FK EG Sbjct: 657 EATWEDVDDISGSYPNFNLEDKIDFKGEG 685 >gb|PNX93203.1| hypothetical protein L195_g016354 [Trifolium pratense] Length = 869 Score = 98.6 bits (244), Expect = 2e-18 Identities = 57/136 (41%), Positives = 79/136 (58%) Frame = -3 Query: 791 LGHNIGGPYAMGSFPLRVFKVAALALVLMRPVSVAQVWKDMEKSTISQLKPFQDAVIEPY 612 LG GP+ + + KV +A + PV K ISQLK F+ +PY Sbjct: 694 LGMRYFGPFTI------IEKVGKVAYKVQLPVEA----KIHPVFHISQLKQFKGRATDPY 743 Query: 611 MSLPLTTNELGPVLSPRAVLRTRGVMRGSVQIKQVLIVWDDNDIAAATWENAEEIKKNHA 432 + LPLTT+ELGP+L P AVL+ R ++R I+QVLI W+ + ATWE+ +EI +N+ Sbjct: 744 IPLPLTTHELGPILQPIAVLQRRDIVRNEHAIQQVLIKWEGLNDTDATWEDVDEITENYP 803 Query: 431 NFNLEDKIHFKEEGSA 384 NFNLEDK+ K +G A Sbjct: 804 NFNLEDKVEVKGKGIA 819 >gb|KHN25579.1| hypothetical protein glysoja_037857, partial [Glycine soja] Length = 101 Score = 88.2 bits (217), Expect = 1e-17 Identities = 43/91 (47%), Positives = 59/91 (64%) Frame = -3 Query: 656 ISQLKPFQDAVIEPYMSLPLTTNELGPVLSPRAVLRTRGVMRGSVQIKQVLIVWDDNDIA 477 ISQLKPF+ + YM LPLTT + P+++P A+L+TR V +GS I QVL+ W++ A Sbjct: 9 ISQLKPFKGVPHDQYMPLPLTTTDTSPIIAPVAILQTRSVKQGSSFIPQVLVQWENTTPA 68 Query: 476 AATWENAEEIKKNHANFNLEDKIHFKEEGSA 384 ATWEN E+ + N NLEDK+ +G A Sbjct: 69 EATWENFNEMLDSFPNLNLEDKVVINGDGIA 99 >gb|AAO23078.1| polyprotein [Glycine max] Length = 1552 Score = 95.9 bits (237), Expect = 2e-17 Identities = 43/90 (47%), Positives = 63/90 (70%) Frame = -3 Query: 656 ISQLKPFQDAVIEPYMSLPLTTNELGPVLSPRAVLRTRGVMRGSVQIKQVLIVWDDNDIA 477 +SQLKPF +PY+ LPLT E+GPV+ P +L +R ++RG QI+Q+L+ W++ Sbjct: 1413 VSQLKPFNGTAQDPYLPLPLTVTEMGPVMQPVKILASRIIIRGHNQIEQILVQWENGLQD 1472 Query: 476 AATWENAEEIKKNHANFNLEDKIHFKEEGS 387 ATWE+ E+IK ++ FNLEDK+ FK EG+ Sbjct: 1473 EATWEDIEDIKASYPTFNLEDKVVFKGEGN 1502 >gb|PNX55258.1| hypothetical protein L195_g048885, partial [Trifolium pratense] Length = 160 Score = 89.0 bits (219), Expect = 3e-17 Identities = 41/78 (52%), Positives = 56/78 (71%) Frame = -3 Query: 620 EPYMSLPLTTNELGPVLSPRAVLRTRGVMRGSVQIKQVLIVWDDNDIAAATWENAEEIKK 441 EPYM LP+TT+ELGP+L P VL+ R +++G+ I QVLI W++ ATWEN +I Sbjct: 24 EPYMPLPMTTHELGPILQPARVLQDRVILKGTESIHQVLIQWENVGENEATWENYADIIS 83 Query: 440 NHANFNLEDKIHFKEEGS 387 ++ NFNLEDK+ FK EG+ Sbjct: 84 SYPNFNLEDKVDFKGEGN 101 >dbj|GAU12540.1| hypothetical protein TSUD_182540 [Trifolium subterraneum] Length = 1451 Score = 95.5 bits (236), Expect = 3e-17 Identities = 43/89 (48%), Positives = 63/89 (70%) Frame = -3 Query: 656 ISQLKPFQDAVIEPYMSLPLTTNELGPVLSPRAVLRTRGVMRGSVQIKQVLIVWDDNDIA 477 +SQLKPF+ + YM LPLT + GP++ P VL+ R +M+G+ +I Q+L+ WD DIA Sbjct: 1335 VSQLKPFKGVPQQQYMPLPLTMFDNGPMIQPVEVLQARTIMQGTQKIHQILVQWDQYDIA 1394 Query: 476 AATWENAEEIKKNHANFNLEDKIHFKEEG 390 ATWEN ++++KN +NLEDK+ FK +G Sbjct: 1395 EATWENVDDLQKNFPLYNLEDKVIFKGDG 1423 >gb|PNX71262.1| methylesterase chloroplastic-like, partial [Trifolium pratense] Length = 172 Score = 89.0 bits (219), Expect = 3e-17 Identities = 49/104 (47%), Positives = 65/104 (62%), Gaps = 4/104 (3%) Frame = -3 Query: 1007 NQFF*VYGGGVWAGCWYKTISIHIENGYRVAIINFTCFGI-SIVTKNITSLSHSLMPLPD 831 N F V+GGG A CWYKTI++ E+GY+V+ I+ T G+ S T NITSLS + PL D Sbjct: 11 NHFVLVHGGGFGAWCWYKTIALLEESGYKVSAIDLTGSGVHSFDTNNITSLSQYVTPLTD 70 Query: 830 FFGKPCEWKNIILLGHNIGG---PYAMGSFPLRVFKVAALALVL 708 F K E K +IL+GH+ GG YAM FPL++ K +A + Sbjct: 71 FLEKLPEGKKVILVGHDFGGACISYAMELFPLKISKAVFIAAAM 114 >gb|PNX64913.1| hypothetical protein L195_g054267 [Trifolium pratense] Length = 227 Score = 90.1 bits (222), Expect = 5e-17 Identities = 39/89 (43%), Positives = 64/89 (71%) Frame = -3 Query: 656 ISQLKPFQDAVIEPYMSLPLTTNELGPVLSPRAVLRTRGVMRGSVQIKQVLIVWDDNDIA 477 ISQLKPF+ + + Y+ LPLT ++ GP+++P VL+ R V++G+ + QVLI WD + ++ Sbjct: 75 ISQLKPFKGPLQDQYLPLPLTMSDSGPIINPIKVLQARTVIKGNQTVHQVLIQWDQHAVS 134 Query: 476 AATWENAEEIKKNHANFNLEDKIHFKEEG 390 ATWE +++++ +FNLEDK++F EG Sbjct: 135 EATWEAIDDLQQKFPSFNLEDKVNFNGEG 163 >dbj|GAU45358.1| hypothetical protein TSUD_239070 [Trifolium subterraneum] Length = 1227 Score = 94.4 bits (233), Expect = 6e-17 Identities = 52/121 (42%), Positives = 72/121 (59%), Gaps = 5/121 (4%) Frame = -3 Query: 737 FKVAALALVLMRPVSVAQV-WKDMEKSTISQLKPFQDAVI----EPYMSLPLTTNELGPV 573 FKV + LV ++P V + +K ++ PF+ EPY+ LPLTT+++GP+ Sbjct: 1061 FKVGDMVLVRLQPYRQHSVNLRKNQKLSMRYFGPFKVLARGDSDEPYIPLPLTTSDIGPI 1120 Query: 572 LSPRAVLRTRGVMRGSVQIKQVLIVWDDNDIAAATWENAEEIKKNHANFNLEDKIHFKEE 393 L P VL TR VM+G Q+ QVLI W D A WEN ++IK N+ +NLEDK+ FKE Sbjct: 1121 LLPNKVLDTRMVMQGKTQVPQVLIQWGDEPNADIKWENFQDIKDNYPLYNLEDKVEFKEG 1180 Query: 392 G 390 G Sbjct: 1181 G 1181 >gb|KYP46164.1| hypothetical protein KK1_032268 [Cajanus cajan] Length = 187 Score = 87.4 bits (215), Expect = 2e-16 Identities = 42/83 (50%), Positives = 58/83 (69%) Frame = -3 Query: 656 ISQLKPFQDAVIEPYMSLPLTTNELGPVLSPRAVLRTRGVMRGSVQIKQVLIVWDDNDIA 477 +S LK F+ + + Y+ LPLTT ELGP + P VL +R +MR S + QVLI WD D+A Sbjct: 75 VSLLKAFKGSPSQVYLPLPLTTTELGPTVQPLQVLDSRIIMRQSQSVPQVLIQWDSLDVA 134 Query: 476 AATWENAEEIKKNHANFNLEDKI 408 AATWE+ EI+++ +FNLEDK+ Sbjct: 135 AATWEDTVEIQESFPDFNLEDKV 157 >dbj|GAU43714.1| hypothetical protein TSUD_179970 [Trifolium subterraneum] Length = 1291 Score = 92.8 bits (229), Expect = 2e-16 Identities = 43/89 (48%), Positives = 60/89 (67%) Frame = -3 Query: 656 ISQLKPFQDAVIEPYMSLPLTTNELGPVLSPRAVLRTRGVMRGSVQIKQVLIVWDDNDIA 477 I+QLK F+ EPY+ LPLTT ++GP L P AVL +R +++G Q+ QVLI W ++ + Sbjct: 1156 IAQLKAFKGGTDEPYIPLPLTTTDVGPALIPTAVLDSRMIIQGKTQVPQVLIQWGEDKLT 1215 Query: 476 AATWENAEEIKKNHANFNLEDKIHFKEEG 390 WE+ +EIK N+ NLEDK+ FKE G Sbjct: 1216 EIKWESFQEIKDNYPQLNLEDKVIFKEGG 1244 >gb|PNY17453.1| Ty3/gypsy retrotransposon protein [Trifolium pratense] Length = 1535 Score = 92.4 bits (228), Expect = 3e-16 Identities = 41/89 (46%), Positives = 60/89 (67%) Frame = -3 Query: 656 ISQLKPFQDAVIEPYMSLPLTTNELGPVLSPRAVLRTRGVMRGSVQIKQVLIVWDDNDIA 477 +SQLKPF+ A + Y+ LPLT E GP++ P AVL+ R +MRG+ ++ Q+L+ WD N A Sbjct: 1390 VSQLKPFKGAASDQYLPLPLTMTETGPIMQPIAVLQARTIMRGTQRVHQILVQWDTNAEA 1449 Query: 476 AATWENAEEIKKNHANFNLEDKIHFKEEG 390 ATWE+ ++++ NLEDK+ F EG Sbjct: 1450 EATWEDFDDLQLKFPTLNLEDKVVFNGEG 1478 >ref|XP_006598537.1| PREDICTED: uncharacterized protein LOC100793977 [Glycine max] Length = 490 Score = 90.9 bits (224), Expect = 5e-16 Identities = 52/134 (38%), Positives = 76/134 (56%) Frame = -3 Query: 791 LGHNIGGPYAMGSFPLRVFKVAALALVLMRPVSVAQVWKDMEKSTISQLKPFQDAVIEPY 612 LG GP+ + + KV A+A L P A++ +SQLK F+ E Y Sbjct: 311 LGMRFFGPFKI------LAKVGAVAYKLELPAE-ARIHNVFH---VSQLKLFKGTPGEQY 360 Query: 611 MSLPLTTNELGPVLSPRAVLRTRGVMRGSVQIKQVLIVWDDNDIAAATWENAEEIKKNHA 432 + LPLTT E GP++ P +L+ R +++G+ ++ QVL+ W+ D A ATWEN E + N Sbjct: 361 LPLPLTTTESGPIIMPSKLLQVRTLLKGNQKVPQVLVQWEGTDEAEATWENVAEFQANFP 420 Query: 431 NFNLEDKIHFKEEG 390 NFNLEDK+ K +G Sbjct: 421 NFNLEDKVVLKGDG 434 >dbj|GAU35592.1| hypothetical protein TSUD_295280 [Trifolium subterraneum] Length = 1358 Score = 91.7 bits (226), Expect = 5e-16 Identities = 50/119 (42%), Positives = 73/119 (61%) Frame = -3 Query: 758 GSFPLRVFKVAALALVLMRPVSVAQVWKDMEKSTISQLKPFQDAVIEPYMSLPLTTNELG 579 G FP+ K+ ++A L P S A++ ISQLK F + +PY L TT LG Sbjct: 1187 GPFPVTA-KIGSIAYKLQLP-STARIHPVFH---ISQLKKFNGSATDPYYPLSDTTTVLG 1241 Query: 578 PVLSPRAVLRTRGVMRGSVQIKQVLIVWDDNDIAAATWENAEEIKKNHANFNLEDKIHF 402 P+L P ++L+ R +++G + + QVL+ W D D + ATWE+ +EI +N+ NFNLEDKI F Sbjct: 1242 PLLQPESILKVRTILKGPLLVPQVLVKWQDIDESLATWEDKKEILENYPNFNLEDKIVF 1300 >dbj|GAU28865.1| hypothetical protein TSUD_293170 [Trifolium subterraneum] Length = 824 Score = 90.9 bits (224), Expect = 7e-16 Identities = 43/86 (50%), Positives = 58/86 (67%) Frame = -3 Query: 656 ISQLKPFQDAVIEPYMSLPLTTNELGPVLSPRAVLRTRGVMRGSVQIKQVLIVWDDNDIA 477 ISQLKPF+ +PY+ LPLTT+ELGP+L P AVL++R ++R I QVLI W+ Sbjct: 589 ISQLKPFKGQTSDPYIPLPLTTHELGPILQPMAVLKSRNILRKDQVIPQVLIKWESLSNT 648 Query: 476 AATWENAEEIKKNHANFNLEDKIHFK 399 TWE+ ++ +N+ FNLEDKI K Sbjct: 649 DVTWEDVKDKAENYPTFNLEDKIDVK 674 >gb|PNX77624.1| Ty3/gypsy retrotransposon protein [Trifolium pratense] gb|PNY16672.1| Ty3/gypsy retrotransposon protein [Trifolium pratense] Length = 367 Score = 89.0 bits (219), Expect = 9e-16 Identities = 39/89 (43%), Positives = 59/89 (66%) Frame = -3 Query: 656 ISQLKPFQDAVIEPYMSLPLTTNELGPVLSPRAVLRTRGVMRGSVQIKQVLIVWDDNDIA 477 +SQLKPF+ V E Y+ LPLT N+ GP++ P AVL+ R + +G+ ++ Q+L+ W+ N Sbjct: 221 VSQLKPFKGNVTEHYLPLPLTMNDTGPIIQPVAVLQARTIRKGTQKVHQILVQWEQNSKD 280 Query: 476 AATWENAEEIKKNHANFNLEDKIHFKEEG 390 AATWE+ +++ NLEDK+ F EG Sbjct: 281 AATWEDLHDLQFKFPTLNLEDKVVFNGEG 309 >dbj|GAU37387.1| hypothetical protein TSUD_22610 [Trifolium subterraneum] Length = 1418 Score = 90.5 bits (223), Expect = 1e-15 Identities = 39/95 (41%), Positives = 61/95 (64%) Frame = -3 Query: 656 ISQLKPFQDAVIEPYMSLPLTTNELGPVLSPRAVLRTRGVMRGSVQIKQVLIVWDDNDIA 477 +SQLKPF+ + Y+ LPLT +E GP++ P AVL+ R +MRG+ ++ Q+L+ WD ++ Sbjct: 1270 VSQLKPFKGETKDQYLPLPLTMSETGPIIQPIAVLQARTIMRGTQKVHQILVQWDQLPVS 1329 Query: 476 AATWENAEEIKKNHANFNLEDKIHFKEEGSAAGNN 372 ATWE+ + ++ NLEDK+ F EG +N Sbjct: 1330 EATWEDLDALQNKFPTLNLEDKVSFNGEGIVMRSN 1364 >gb|KYP59123.1| hypothetical protein KK1_014552 [Cajanus cajan] gb|KYP59152.1| hypothetical protein KK1_014583 [Cajanus cajan] Length = 177 Score = 84.7 bits (208), Expect = 1e-15 Identities = 41/83 (49%), Positives = 57/83 (68%) Frame = -3 Query: 656 ISQLKPFQDAVIEPYMSLPLTTNELGPVLSPRAVLRTRGVMRGSVQIKQVLIVWDDNDIA 477 +S LK F+ + + Y+ LPLTT ELGP + P VL +R +MR S I Q+LI WD D+A Sbjct: 34 VSLLKAFKGSPSQVYLPLPLTTTELGPTVQPLQVLDSRVIMRQSQSIPQLLIQWDSLDVA 93 Query: 476 AATWENAEEIKKNHANFNLEDKI 408 AATWE+ EI+++ +FN EDK+ Sbjct: 94 AATWEDTAEIQESFPDFNHEDKV 116 >gb|KYP63732.1| Transposon Ty3-G Gag-Pol polyprotein [Cajanus cajan] Length = 1084 Score = 90.1 bits (222), Expect = 2e-15 Identities = 55/145 (37%), Positives = 77/145 (53%) Frame = -3 Query: 821 KPCEWKNIILLGHNIGGPYAMGSFPLRVFKVAALALVLMRPVSVAQVWKDMEKSTISQLK 642 +P ++ L H G G FP+ + K+ ++A L+ P S K +S LK Sbjct: 885 QPYRQHSVALRKHQKLGLRYFGPFPI-IKKIGSVAYKLLLPASA----KIHSVFHVSLLK 939 Query: 641 PFQDAVIEPYMSLPLTTNELGPVLSPRAVLRTRGVMRGSVQIKQVLIVWDDNDIAAATWE 462 + PY+ LPL TNE GPV+ P +L +R ++RG I QVLI WD D ATWE Sbjct: 940 KCKGNHQTPYLPLPLLTNEFGPVVQPSRILDSRTIIRGDQHIAQVLIQWDGLDATQATWE 999 Query: 461 NAEEIKKNHANFNLEDKIHFKEEGS 387 +A I K++ NF LEDK+ F G+ Sbjct: 1000 DATVIHKDYPNFYLEDKVDFYGGGN 1024