BLASTX nr result
ID: Astragalus24_contig00012207
seq
BLASTX 2.2.26 [Sep-21-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Astragalus24_contig00012207 (844 letters) Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects 149,584,005 sequences; 54,822,741,787 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value gb|PNX91063.1| hypothetical protein L195_g047192, partial [Trifo... 224 7e-68 ref|XP_020208977.1| uncharacterized protein LOC109793916 [Cajanu... 222 4e-67 gb|KYP65733.1| hypothetical protein KK1_011995 [Cajanus cajan] >... 222 4e-66 ref|XP_014630525.1| PREDICTED: uncharacterized protein LOC106798... 214 9e-64 gb|KHN07990.1| hypothetical protein glysoja_045923, partial [Gly... 214 7e-63 gb|KHN02608.1| hypothetical protein glysoja_043563, partial [Gly... 214 7e-63 ref|XP_021671488.1| uncharacterized protein LOC110658261 [Hevea ... 211 7e-63 ref|XP_019455138.1| PREDICTED: uncharacterized protein LOC109356... 221 1e-62 dbj|GAU50616.1| hypothetical protein TSUD_290710 [Trifolium subt... 211 2e-62 ref|XP_006576053.1| PREDICTED: uncharacterized protein LOC102662... 211 2e-62 ref|XP_017423564.1| PREDICTED: uncharacterized protein LOC108332... 204 1e-60 gb|KHN30273.1| hypothetical protein glysoja_042433, partial [Gly... 207 2e-60 gb|PNX93614.1| retrovirus-related Pol polyprotein from transposo... 216 6e-60 gb|PNX71626.1| flavonol sulfotransferase-like protein, partial [... 209 7e-60 ref|XP_004497583.1| PREDICTED: uncharacterized protein LOC101505... 203 8e-60 gb|PNX82129.1| hypothetical protein L195_g038157 [Trifolium prat... 203 1e-59 gb|KHN34741.1| Retrovirus-related Pol polyprotein from transposo... 206 2e-59 gb|PNX85368.1| hypothetical protein L195_g041436, partial [Trifo... 200 5e-59 gb|PNY08535.1| retrovirus-related Pol polyprotein from transposo... 211 2e-58 dbj|GAU41679.1| hypothetical protein TSUD_272630 [Trifolium subt... 207 7e-57 >gb|PNX91063.1| hypothetical protein L195_g047192, partial [Trifolium pratense] Length = 359 Score = 224 bits (570), Expect = 7e-68 Identities = 112/274 (40%), Positives = 158/274 (57%), Gaps = 9/274 (3%) Frame = +1 Query: 1 MVLAWIQRPVSANVLKSIIFFDKGYDAWKDLKDRFDQGDIFKIADL*DDLCKVTQGTRDI 180 MVL+W+QR +S ++ KSI++ DK W +L+ RF QGDIF+IAD+ DDL + QGT DI Sbjct: 82 MVLSWLQRAISESISKSILWIDKASSVWTNLELRFSQGDIFRIADIQDDLTRFQQGTLDI 141 Query: 181 SDYYTELKALWDELDNYMPLSSCTCTTPCACGAIQSMRKIRDQDYTINFLKGLNEEYSNV 360 S+YYT+L A+W+E+DN+ P +CTC PC CGA +K ++QD I FLKGLNE+YS+V Sbjct: 142 SNYYTQLTAMWEEIDNFRPTKNCTCAIPCTCGAASDFQKYKEQDKVIKFLKGLNEQYSHV 201 Query: 361 RSQIMMMDLFPAISKAFLLVIQQERHLQVATPTVSETENQVKTPRIANN---------IS 513 RSQIM+++ P +SK F LV+ QER L + TP TE Q ++ ++ Sbjct: 202 RSQIMLIEPLPILSKTFSLVLVQERQLNLPTPYDPSTEKQSLAMQVQSSSFNGGGRGKSQ 261 Query: 514 XXXXXXXXXXXXXXXXXXXXXXXXTNRLCTHCGKTNQTIDFCYFIHGFPQGYQVKNNKPT 693 R+CTHCGK N + C+ +G+P G+Q KNNK Sbjct: 262 FPNKGRGRAGFNGGRGRGGLGDGDDTRVCTHCGKNNHIVQNCFVKYGYPPGFQHKNNK-- 319 Query: 694 ASANSAGTESESQTKAEAPNTSNSNISLTHDQYQ 795 AS N A + QT + S +++ +QYQ Sbjct: 320 ASVNHAANFASEQTSTQETAPSTPSLNTIQEQYQ 353 >ref|XP_020208977.1| uncharacterized protein LOC109793916 [Cajanus cajan] Length = 357 Score = 222 bits (565), Expect = 4e-67 Identities = 114/276 (41%), Positives = 164/276 (59%), Gaps = 11/276 (3%) Frame = +1 Query: 1 MVLAWIQRPVSANVLKSIIFFDKGYDAWKDLKDRFDQGDIFKIADL*DDLCKVTQGTRDI 180 MV++W+Q +S ++KSI++FD D W+DLK RF QGD+F++A L +DL K QG+ D+ Sbjct: 37 MVISWLQHSISEKIVKSILWFDTASDIWQDLKARFSQGDVFRVAQLQEDLYKFHQGSLDV 96 Query: 181 SDYYTELKALWDELDNYMPLSSCTCTTPCACGAIQSMRKIRDQDYTINFLKGLNEEYSNV 360 ++Y+T+LK +WDE+DN PLS C C+ C+CGA+ S K R+QD I FL+GLN++Y++V Sbjct: 97 TEYFTQLKEMWDEIDNLRPLSRCKCSIACSCGAVDSSYKYREQDAVIRFLRGLNDQYTHV 156 Query: 361 RSQIMMMDLFPAISKAFLLVIQQERHLQVA-----TPTVSETE--NQVKTPRIANNISXX 519 RSQIM+MD P++SK F LV QQERHL + T ++ T + +TP + S Sbjct: 157 RSQIMLMDPLPSLSKTFSLVGQQERHLNQSAIHDDTKVLAATSFGSLPQTPTTQQHQSPQ 216 Query: 520 XXXXXXXXXXXXXXXXXXXXXXTN---RLCTHCGKTNQTIDFCYFIHGFPQGYQVK-NNK 687 T+ ++CTHCG+ N T+D CYF HGFP GYQ K Sbjct: 217 QQQFGFRRGGYSHGRGRGRGGRTHGSIKICTHCGRNNHTVDTCYFKHGFPPGYQSKGGTS 276 Query: 688 PTASANSAGTESESQTKAEAPNTSNSNISLTHDQYQ 795 + N+ T S S + P ++N N T +Q Q Sbjct: 277 ANFTVNAVETTSPS---SMVPESNNPNFGFTQEQCQ 309 >gb|KYP65733.1| hypothetical protein KK1_011995 [Cajanus cajan] gb|KYP72745.1| hypothetical protein KK1_005345 [Cajanus cajan] Length = 445 Score = 222 bits (565), Expect = 4e-66 Identities = 114/276 (41%), Positives = 164/276 (59%), Gaps = 11/276 (3%) Frame = +1 Query: 1 MVLAWIQRPVSANVLKSIIFFDKGYDAWKDLKDRFDQGDIFKIADL*DDLCKVTQGTRDI 180 MV++W+Q +S ++KSI++FD D W+DLK RF QGD+F++A L +DL K QG+ D+ Sbjct: 82 MVISWLQHSISEKIVKSILWFDTASDIWQDLKARFSQGDVFRVAQLQEDLYKFHQGSLDV 141 Query: 181 SDYYTELKALWDELDNYMPLSSCTCTTPCACGAIQSMRKIRDQDYTINFLKGLNEEYSNV 360 ++Y+T+LK +WDE+DN PLS C C+ C+CGA+ S K R+QD I FL+GLN++Y++V Sbjct: 142 TEYFTQLKEMWDEIDNLRPLSRCKCSIACSCGAVDSSYKYREQDAVIRFLRGLNDQYTHV 201 Query: 361 RSQIMMMDLFPAISKAFLLVIQQERHLQVA-----TPTVSETE--NQVKTPRIANNISXX 519 RSQIM+MD P++SK F LV QQERHL + T ++ T + +TP + S Sbjct: 202 RSQIMLMDPLPSLSKTFSLVGQQERHLNQSAIHDDTKVLAATSFGSLPQTPTTQQHQSPQ 261 Query: 520 XXXXXXXXXXXXXXXXXXXXXXTN---RLCTHCGKTNQTIDFCYFIHGFPQGYQVK-NNK 687 T+ ++CTHCG+ N T+D CYF HGFP GYQ K Sbjct: 262 QQQFGFRRGGYSHGRGRGRGGRTHGSIKICTHCGRNNHTVDTCYFKHGFPPGYQSKGGTS 321 Query: 688 PTASANSAGTESESQTKAEAPNTSNSNISLTHDQYQ 795 + N+ T S S + P ++N N T +Q Q Sbjct: 322 ANFTVNAVETTSPS---SMVPESNNPNFGFTQEQCQ 354 >ref|XP_014630525.1| PREDICTED: uncharacterized protein LOC106798459 [Glycine max] Length = 389 Score = 214 bits (545), Expect = 9e-64 Identities = 110/270 (40%), Positives = 160/270 (59%), Gaps = 4/270 (1%) Frame = +1 Query: 1 MVLAWIQRPVSANVLKSIIFFDKGYDAWKDLKDRFDQGDIFKIADL*DDLCKVTQGTRDI 180 MVLAWI R +S ++ KS+++ D WK+L+ RF Q DIF+I+DL +DL + QGT D+ Sbjct: 82 MVLAWIHRSISDSIAKSVLWIDTAAGVWKNLRIRFSQSDIFRISDLQEDLYRFRQGTLDV 141 Query: 181 SDYYTELKALWDELDNYMPLSSCTCTTPCACGAIQSMRKIRDQDYTINFLKGLNEEYSNV 360 SDY+T+LK WDEL+NY P+ C C+ PC+CG I S+R R+QDY + FLKGLN+ +S+ Sbjct: 142 SDYFTQLKIYWDELENYRPIPHCKCSIPCSCGGIDSVRVYREQDYVVRFLKGLNDRFSHS 201 Query: 361 RSQIMMMDLFPAISKAFLLVIQQERH-LQVATPTVSE-TENQVKTPRIANNISXXXXXXX 534 +SQIMMM+ P I F LVIQQER L + +VSE T + ++ +N S Sbjct: 202 KSQIMMMNPLPDIDHVFSLVIQQEREMLGSNSDSVSEATSDSAMAMQVNSNQSNFNGKGG 261 Query: 535 XXXXXXXXXXXXXXXXXTNRLCTHCGKTNQTIDFCYFIHGFPQGYQVKNNKPTASANSAG 714 NR+CTHCGKTN +D C+ G+P GY+ +K ++S++ A Sbjct: 262 YYNKGKGSSKGG------NRVCTHCGKTNHIVDNCFEKIGYPPGYKTNKSKNSSSSSQAN 315 Query: 715 TESESQT--KAEAPNTSNSNISLTHDQYQG 798 S + + +++ S+ T + YQG Sbjct: 316 NTSNASALESTQQGSSAQSSFQFTQEMYQG 345 >gb|KHN07990.1| hypothetical protein glysoja_045923, partial [Glycine soja] Length = 484 Score = 214 bits (546), Expect = 7e-63 Identities = 111/270 (41%), Positives = 160/270 (59%), Gaps = 4/270 (1%) Frame = +1 Query: 1 MVLAWIQRPVSANVLKSIIFFDKGYDAWKDLKDRFDQGDIFKIADL*DDLCKVTQGTRDI 180 MVLAWI R +S ++ KS+++ D WK+L+ RF Q DIF+I+DL +DL + QGT D+ Sbjct: 74 MVLAWIHRSISDSIAKSVLWIDTAAGVWKNLRIRFSQSDIFRISDLQEDLYRFRQGTLDV 133 Query: 181 SDYYTELKALWDELDNYMPLSSCTCTTPCACGAIQSMRKIRDQDYTINFLKGLNEEYSNV 360 SDY+T+LK WDEL+NY P+ C C+ PC+CG I S+R R+QDY I FLKGLN+ +S+ Sbjct: 134 SDYFTQLKIYWDELENYRPIPHCKCSIPCSCGGIDSVRVYREQDYVIRFLKGLNDRFSHS 193 Query: 361 RSQIMMMDLFPAISKAFLLVIQQERH-LQVATPTVSE-TENQVKTPRIANNISXXXXXXX 534 +SQIMMM+ P I F LVIQQER L + +VSE T + ++ +N S Sbjct: 194 KSQIMMMNPLPDIDHVFSLVIQQEREMLGSNSDSVSEATSDSAMAMQVNSNQSNFNGKGG 253 Query: 535 XXXXXXXXXXXXXXXXXTNRLCTHCGKTNQTIDFCYFIHGFPQGYQVKNNKPTASANSAG 714 NR+CTHCGKTN +D C+ G+P GY+ +K ++S++ A Sbjct: 254 YYNKGKGSSKGG------NRVCTHCGKTNHIVDNCFEKIGYPPGYKTNKSKNSSSSSQAN 307 Query: 715 TESESQT--KAEAPNTSNSNISLTHDQYQG 798 S + + +++ S+ T + YQG Sbjct: 308 NTSNASALESTQQGSSAQSSFQFTQEMYQG 337 >gb|KHN02608.1| hypothetical protein glysoja_043563, partial [Glycine soja] Length = 484 Score = 214 bits (546), Expect = 7e-63 Identities = 111/270 (41%), Positives = 160/270 (59%), Gaps = 4/270 (1%) Frame = +1 Query: 1 MVLAWIQRPVSANVLKSIIFFDKGYDAWKDLKDRFDQGDIFKIADL*DDLCKVTQGTRDI 180 MVLAWI R +S ++ KS+++ D WK+L+ RF Q DIF+I+DL +DL + QGT D+ Sbjct: 74 MVLAWIHRSISDSIAKSVLWIDTAAGVWKNLRIRFSQSDIFRISDLQEDLYRFRQGTLDV 133 Query: 181 SDYYTELKALWDELDNYMPLSSCTCTTPCACGAIQSMRKIRDQDYTINFLKGLNEEYSNV 360 SDY+T+LK WDEL+NY P+ C C+ PC+CG I S+R R+QDY I FLKGLN+ +S+ Sbjct: 134 SDYFTQLKIYWDELENYRPIPHCKCSIPCSCGGIDSVRVYREQDYVIRFLKGLNDRFSHS 193 Query: 361 RSQIMMMDLFPAISKAFLLVIQQERH-LQVATPTVSE-TENQVKTPRIANNISXXXXXXX 534 +SQIMMM+ P I F LVIQQER L + +VSE T + ++ +N S Sbjct: 194 KSQIMMMNPLPDIDHVFSLVIQQEREMLGSNSDSVSEATSDSAMAMQVNSNQSNFNGKGG 253 Query: 535 XXXXXXXXXXXXXXXXXTNRLCTHCGKTNQTIDFCYFIHGFPQGYQVKNNKPTASANSAG 714 NR+CTHCGKTN +D C+ G+P GY+ +K ++S++ A Sbjct: 254 YYNKGKGSSKGG------NRVCTHCGKTNHIVDNCFEKIGYPPGYKTNKSKNSSSSSQAN 307 Query: 715 TESESQT--KAEAPNTSNSNISLTHDQYQG 798 S + + +++ S+ T + YQG Sbjct: 308 NTSNASALESTQQGSSAQSSFQFTQEMYQG 337 >ref|XP_021671488.1| uncharacterized protein LOC110658261 [Hevea brasiliensis] Length = 363 Score = 211 bits (537), Expect = 7e-63 Identities = 110/272 (40%), Positives = 160/272 (58%), Gaps = 7/272 (2%) Frame = +1 Query: 1 MVLAWIQRPVSANVLKSIIFFDKGYDAWKDLKDRFDQGDIFKIADL*DDLCKVTQGTRDI 180 MVL+W+ +S ++ SI++ DK D WKDLK+ F QGDI +I DL +D+ + QG R + Sbjct: 1 MVLSWLIHSLSPSITHSILWIDKTVDVWKDLKEAFSQGDILRILDLQEDIFSIKQGDRSV 60 Query: 181 SDYYTELKALWDELDNYMPLSSCTCTTPCACGAIQSMRKIRDQDYTINFLKGLNEEYSNV 360 +DY+TELK LWDEL N+ P+ C+C CA GA ++K D DY I FLKGLN++Y+ Sbjct: 61 TDYFTELKILWDELLNFRPIPVCSCENSCAYGAFLKIKKYHDHDYVIRFLKGLNDQYAIA 120 Query: 361 RSQIMMMDLFPAISKAFLLVIQQERHLQVATPTVSETENQVKTPR-IANNISXXXXXXXX 537 +SQIM++DLFP+I+KAF L++QQER L + TE +V R + +++S Sbjct: 121 KSQIMLLDLFPSINKAFSLLVQQERQL----APILATEPKVFVNRSVRSDVSNSGAKPFF 176 Query: 538 XXXXXXXXXXXXXXXXTNRLCTHCGKTNQTIDFCYFIHGFPQGYQVKNNKPTASANSAGT 717 +R CT+CGK TI+ CY HG+P GY+ + +A AN+ Sbjct: 177 APKTFTGIPKQFSSSRDDRFCTYCGKPRHTIETCYKKHGYPLGYKPRGY--SAFANNIFG 234 Query: 718 ESESQTKAEAP------NTSNSNISLTHDQYQ 795 +E ++ + P + NSNI LT +QYQ Sbjct: 235 SAEIESPTDTPISLAQGSNGNSNIGLTQEQYQ 266 >ref|XP_019455138.1| PREDICTED: uncharacterized protein LOC109356267 [Lupinus angustifolius] Length = 834 Score = 221 bits (562), Expect = 1e-62 Identities = 112/274 (40%), Positives = 159/274 (58%), Gaps = 8/274 (2%) Frame = +1 Query: 1 MVLAWIQRPVSANVLKSIIFFDKGYDAWKDLKDRFDQGDIFKIADL*DDLCKVTQGTRDI 180 MVL+WIQ V +++KSI++ D +AWKDL DRF GDIF+IA L + + QG DI Sbjct: 80 MVLSWIQHCVDESIVKSILWIDTTAEAWKDLHDRFSHGDIFRIAALQKEFYHLDQGNLDI 139 Query: 181 SDYYTELKALWDELDNYMPLSSCTCTTPCACGAIQSMRKIRDQDYTINFLKGLNEEYSNV 360 SDY+T+LK LWDE++++ P SC C TPC CGA+ S++ ++QDY I FL+GLNE++++V Sbjct: 140 SDYFTKLKTLWDEIEDFRPFPSCKCNTPCICGAMDSLKTYKEQDYVIRFLEGLNEQFAHV 199 Query: 361 RSQIMMMDLFPAISKAFLLVIQQERHLQVATPTVSETENQV------KTPRIANNI--SX 516 +SQIM+MD P I+KAF L+IQQER Q+ P E +N+V + + NN + Sbjct: 200 KSQIMLMDPLPNITKAFALLIQQERQTQLPVPPSLEPDNRVMNVSSRQDSQYRNNSTNNS 259 Query: 517 XXXXXXXXXXXXXXXXXXXXXXXTNRLCTHCGKTNQTIDFCYFIHGFPQGYQVKNNKPTA 696 NR CT+C +TN TI+ CY HG+P GYQ + Sbjct: 260 FRGRGIIPFRGRGNRAAGFGRGQNNRFCTYCERTNHTIETCYLKHGYPPGYQSTRSSKMV 319 Query: 697 SANSAGTESESQTKAEAPNTSNSNISLTHDQYQG 798 + + + S A T N++ S T +Q QG Sbjct: 320 NHTTGYSFDTSTNNEAAHQTQNNSTSFTKEQVQG 353 >dbj|GAU50616.1| hypothetical protein TSUD_290710 [Trifolium subterraneum] Length = 404 Score = 211 bits (537), Expect = 2e-62 Identities = 106/270 (39%), Positives = 153/270 (56%), Gaps = 6/270 (2%) Frame = +1 Query: 1 MVLAWIQRPVSANVLKSIIFFDKGYDAWKDLKDRFDQGDIFKIADL*DDLCKVTQGTRDI 180 MVL+WIQR +S ++KSI++ D WK L+ RF GDIF+IAD+ +++ + QGT DI Sbjct: 82 MVLSWIQRSISETIVKSIMWCDCAAVVWKCLERRFAHGDIFRIADILEEIARYQQGTLDI 141 Query: 181 SDYYTELKALWDELDNYMPLSSCTCTTPCACGAIQSMRKIRDQDYTINFLKGLNEEYSNV 360 S Y+T L LW+EL+N+ PL C+C PC CGA ++K ++QD I FLKGLNE+Y++V Sbjct: 142 SSYFTHLTTLWEELENFRPLKDCSCAIPCTCGAASDLKKYKEQDKVIKFLKGLNEQYASV 201 Query: 361 RSQIMMMDLFPAISKAFLLVIQQERHLQVATPTVSETENQVKTPRIANNI------SXXX 522 RSQIM++D P I + F LV+QQER + + T + + Q ++ Sbjct: 202 RSQIMLLDPLPDIDRCFSLVLQQERQMLIPIITDNSVDQQASIMQVRQTSYNHGKHYTSF 261 Query: 523 XXXXXXXXXXXXXXXXXXXXXTNRLCTHCGKTNQTIDFCYFIHGFPQGYQVKNNKPTASA 702 NR CTHCG+ N +D C+ +HG+P GYQ KN+K S Sbjct: 262 SSTHHGGRGRGRGNHHGGRGPNNRTCTHCGRHNHIVDTCFELHGYPPGYQHKNSK---SV 318 Query: 703 NSAGTESESQTKAEAPNTSNSNISLTHDQY 792 N A T S + K N +++ I+ +QY Sbjct: 319 NVAATASNATLKEGHINLTSATINTIQEQY 348 >ref|XP_006576053.1| PREDICTED: uncharacterized protein LOC102662412 [Glycine max] Length = 424 Score = 211 bits (538), Expect = 2e-62 Identities = 102/272 (37%), Positives = 160/272 (58%), Gaps = 8/272 (2%) Frame = +1 Query: 1 MVLAWIQRPVSANVLKSIIFFDKGYDAWKDLKDRFDQGDIFKIADL*DDLCKVTQGTRDI 180 +VL+W+QR S + KS+++ D+ WK L++RF QGDIF++AD+ +++ + QGT DI Sbjct: 81 LVLSWLQRSTSEEIAKSLLWCDRASFVWKSLENRFSQGDIFRVADIQEEVACLQQGTLDI 140 Query: 181 SDYYTELKALWDELDNYMPLSSCTCTTPCACGAIQSMRKIRDQDYTINFLKGLNEEYSNV 360 S Y+T+L LW+E++N+ P+ CTC PC+CGA +RK ++QD I FLKGL ++YS+V Sbjct: 141 SSYFTKLMTLWEEIENFRPIRDCTCAIPCSCGAATDLRKFKEQDKVIKFLKGLGDQYSHV 200 Query: 361 RSQIMMMDLFPAISKAFLLVIQQERHLQVATPTVSETENQVKTPRIANNISXXXXXXXXX 540 RSQIM+M P + AF L++QQER + + T S ENQ + S Sbjct: 201 RSQIMLMSPLPTLDNAFNLILQQERQFNLPSTTDSSIENQSSVNHFSQTPSRPSNNSGCG 260 Query: 541 XXXXXXXXXXXXXXXTNRLCTHCGKTNQTIDFCYFIHGFPQGYQVKNNKPTASAN----- 705 NRLCTHC +TN T++ C+ HG+P G+Q + + + +A+ Sbjct: 261 RGRGYSSGGRG-----NRLCTHCNRTNHTVETCFIKHGYPPGFQHRKSNSSGNASVVNSV 315 Query: 706 -SAGTE--SESQTKAEAPNTSNSNISLTHDQY 792 AG+ S S + + + N S++++S +QY Sbjct: 316 QDAGSAHISSSSSASTSTNGSSASLSTIQEQY 347 >ref|XP_017423564.1| PREDICTED: uncharacterized protein LOC108332768 [Vigna angularis] Length = 317 Score = 204 bits (518), Expect = 1e-60 Identities = 99/236 (41%), Positives = 142/236 (60%), Gaps = 13/236 (5%) Frame = +1 Query: 1 MVLAWIQRPVSANVLKSIIFFDKGYDAWKDLKDRFDQGDIFKIADL*DDLCKVTQGTRDI 180 MVLAWI R + ++L+SI++ D+ + W+DL+D F Q D+F+++DL +++ ++ QGT I Sbjct: 84 MVLAWIHRSIDDSILQSILWIDQASEVWQDLQDHFSQADMFRVSDLQEEIFRLQQGTLTI 143 Query: 181 SDYYTELKALWDELDNYMPLSSCTCTTPCACGAIQSMRKIRDQDYTINFLKGLNEEYSNV 360 S Y+T+LK LWDE +NY P+ C C+ PC C AIQ+ ++ RDQDY I FLKGLNE++S+V Sbjct: 144 SQYFTQLKGLWDEFENYRPILHCKCSIPCTCEAIQAYKRYRDQDYVIRFLKGLNEQFSHV 203 Query: 361 RSQIMMMDLFPAISKAFLLVIQQERHLQVATPTVSETENQVKTPRI-------------A 501 RSQIM++D P I+K F L++QQER Q+ T +E + KT + Sbjct: 204 RSQIMLLDPLPPINKVFSLIVQQER--QMTTIERTELSSDAKTFAVNTYQYTSLGRGLAV 261 Query: 502 NNISXXXXXXXXXXXXXXXXXXXXXXXXTNRLCTHCGKTNQTIDFCYFIHGFPQGY 669 N N+LCT+CGKTN T++ CYF HGF GY Sbjct: 262 NPNQYFGYGRGRGNRGRGRATIGRGQNSFNKLCTYCGKTNHTVETCYFKHGFHLGY 317 >gb|KHN30273.1| hypothetical protein glysoja_042433, partial [Glycine soja] Length = 456 Score = 207 bits (527), Expect = 2e-60 Identities = 100/272 (36%), Positives = 160/272 (58%), Gaps = 8/272 (2%) Frame = +1 Query: 1 MVLAWIQRPVSANVLKSIIFFDKGYDAWKDLKDRFDQGDIFKIADL*DDLCKVTQGTRDI 180 +VL+W+QR S + KS+++ D+ WK L++RF QGDIF++AD+ +++ ++ QGT +I Sbjct: 66 LVLSWLQRSTSEEIAKSLLWCDRASFVWKSLENRFSQGDIFRVADIQEEVARLQQGTLEI 125 Query: 181 SDYYTELKALWDELDNYMPLSSCTCTTPCACGAIQSMRKIRDQDYTINFLKGLNEEYSNV 360 S Y+T+L LW+E++N+ P+ CTC PC+CGA +RK ++QD I FLKGL ++YS+V Sbjct: 126 SSYFTKLMTLWEEIENFRPIRDCTCAIPCSCGAATDLRKFKEQDKVIKFLKGLGDQYSHV 185 Query: 361 RSQIMMMDLFPAISKAFLLVIQQERHLQVATPTVSETENQVKTPRIANNISXXXXXXXXX 540 RSQIM+M P + AF L++QQER + + T S ENQ + S Sbjct: 186 RSQIMLMSPLPTLDNAFNLILQQERQFNLPSTTDSSIENQSSVNHFSQTPSRPSNNFGCG 245 Query: 541 XXXXXXXXXXXXXXXTNRLCTHCGKTNQTIDFCYFIHGFPQGYQVKNNKPTASAN----- 705 NRL THC +TN T++ C+ HG+P G+Q + + + +A+ Sbjct: 246 RGRGYSSGGRG-----NRLRTHCNRTNHTVETCFIKHGYPPGFQHRKSNSSGNASMVNSV 300 Query: 706 -SAGTE--SESQTKAEAPNTSNSNISLTHDQY 792 AG+ S S + + + N S++++S +QY Sbjct: 301 QDAGSAHISSSSSASTSTNGSSASLSTIQEQY 332 >gb|PNX93614.1| retrovirus-related Pol polyprotein from transposon TNT 1-94 [Trifolium pratense] Length = 1430 Score = 216 bits (549), Expect = 6e-60 Identities = 116/287 (40%), Positives = 167/287 (58%), Gaps = 9/287 (3%) Frame = +1 Query: 1 MVLAWIQRPVSANVLKSIIFFDKGYDAWKDLKDRFDQGDIFKIADL*DDLCKVTQGTRDI 180 MVL+WIQR +S ++ KSII+FD WKDL+ RF GD+FKI+DL +++ ++ QG+ DI Sbjct: 82 MVLSWIQRSISPDIAKSIIWFDHASAVWKDLEFRFSHGDMFKISDLQEEILRLHQGSLDI 141 Query: 181 SDYYTELKALWDELDNYMPLSSCTCTTPCACGAIQSMRKIRDQDYTINFLKGLNEEYSNV 360 S YYT+LK+L +E++ Y P+ CTC PC+CGA+ M+K R+QD + FLKGLNE+YS+V Sbjct: 142 SSYYTQLKSLSEEIEIYRPVRDCTCAIPCSCGAVADMKKYREQDCVLKFLKGLNEQYSHV 201 Query: 361 RSQIMMMDLFPAISKAFLLVIQQERHLQVATPTVSETE-----NQVKTPRIANNISXXXX 525 RSQIMMM+ P + K F LV+QQER+L V S+ E QV++ + S Sbjct: 202 RSQIMMMEPLPPLHKVFSLVLQQERNLPVFNTVDSQNELSAMAMQVQSTGSNSQPSKNFN 261 Query: 526 XXXXXXXXXXXXXXXXXXXXTNRLCTHCGKTNQTIDFCYFIHGFPQGYQVKNNKPTASAN 705 + R CTHCG N ID C+ +GFP GYQ +K S+N Sbjct: 262 FGSGNRGRGKGRRNFGRGQHSTRYCTHCGGDNHIIDNCFVKYGFPPGYQ---SKGVQSSN 318 Query: 706 SAGTESESQTKAEAPNTSN----SNISLTHDQYQGFWTYFNKQDNNN 834 + S T +++ S+ S+++ Q+Q F F +Q +N Sbjct: 319 AKSVNLASTTNSDSSLVSSSAMASSLNELQGQFQQFLKLFQQQTESN 365 >gb|PNX71626.1| flavonol sulfotransferase-like protein, partial [Trifolium pratense] Length = 591 Score = 209 bits (532), Expect = 7e-60 Identities = 106/270 (39%), Positives = 154/270 (57%), Gaps = 4/270 (1%) Frame = +1 Query: 1 MVLAWIQRPVSANVLKSIIFFDKGYDAWKDLKDRFDQGDIFKIADL*DDLCKVTQGTRDI 180 MVLAWI R +S ++ +S+++ D WK+L+ RF QGDIF+I+DL ++L ++ QG D+ Sbjct: 80 MVLAWIHRSLSESIARSVLWIDSAAGLWKNLRTRFSQGDIFRISDLQEELYRLRQGNLDV 139 Query: 181 SDYYTELKALWDELDNYMPLSSCTCTTPCACGAIQSMRKIRDQDYTINFLKGLNEEYSNV 360 SDY+T+LK LWDEL+NY P+ C C+ C CGAI+S + R+QDY I FLKGLN+ +SN Sbjct: 140 SDYFTKLKVLWDELENYRPIPFCKCSIACTCGAIESFKVYREQDYVIRFLKGLNDRFSNT 199 Query: 361 RSQIMMMDLFPAISKAFLLVIQQERHL--QVATP-TVSETENQVKTPRIANNISXXXXXX 531 +SQIM+M+ P + F ++IQQER + + P T E T +AN+ Sbjct: 200 KSQIMLMNPLPDVDTVFSMLIQQEREIAYSILDPITHDAPEVDSSTALLANSHYRNQNGK 259 Query: 532 XXXXXXXXXXXXXXXXXXTNRLCTHCGKTNQTIDFCYFIHGFPQGYQVK-NNKPTASANS 708 NRLCT+C TN + C+ +G+P GY+ K N S Sbjct: 260 TNYYGKGKGQAPNSAPKGYNRLCTYCKGTNHIVQNCWIKYGYPPGYKNKGKNSSQPSHTV 319 Query: 709 AGTESESQTKAEAPNTSNSNISLTHDQYQG 798 A +S +Q +++ T+ LT DQY G Sbjct: 320 AAVDSSTQPDSQSSTTATPPFGLTQDQYDG 349 >ref|XP_004497583.1| PREDICTED: uncharacterized protein LOC101505117 [Cicer arietinum] Length = 355 Score = 203 bits (516), Expect = 8e-60 Identities = 98/270 (36%), Positives = 158/270 (58%), Gaps = 6/270 (2%) Frame = +1 Query: 1 MVLAWIQRPVSANVLKSIIFFDKGYDAWKDLKDRFDQGDIFKIADL*DDLCKVTQGTRDI 180 M L+W+Q + ++ SI++ D + WK L+++F QGDIFKI+D+ DDL ++ QG DI Sbjct: 41 MTLSWLQCSILESIAHSILWIDNAHTVWKILENQFSQGDIFKISDIQDDLTRLQQGNLDI 100 Query: 181 SDYYTELKALWDELDNYMPLSSCTCTTPCACGAIQSMRKIRDQDYTINFLKGLNEEYSNV 360 +Y+T+L +LW+++D++ P C C C CG +RK ++QD I FL GLNE++SNV Sbjct: 101 INYFTKLTSLWEQIDSFRPTRDCVCAIQCTCGDTTDLRKYKNQDRVIKFLNGLNEQFSNV 160 Query: 361 RSQIMMMDLFPAISKAFLLVIQQERHLQVATPTVSETENQVKTPRIANNI-----SXXXX 525 RSQIM+++ P++ K F LV+ QER L V + S ENQ ++ NN Sbjct: 161 RSQIMLLEPLPSLDKTFSLVLGQERQLNVQASSNSAPENQAMAMQVQNNHYNGGGRGTNN 220 Query: 526 XXXXXXXXXXXXXXXXXXXXTNRLCTHCGKTNQTIDFCYFIHGFPQGYQVKNNKPTASAN 705 +NR+CTHCG+TN T++ C+ HG+P G+Q ++++ +A Sbjct: 221 SNNRGKGHNNSAFGRGPYQNSNRICTHCGRTNHTVETCFLKHGYPPGFQQRHSR---AAF 277 Query: 706 SAGTESESQTKAEAPNTS-NSNISLTHDQY 792 + T S+SQ + S ++++++ DQY Sbjct: 278 NTATASDSQDSSPVDQESTDASLTIIQDQY 307 >gb|PNX82129.1| hypothetical protein L195_g038157 [Trifolium pratense] Length = 392 Score = 203 bits (517), Expect = 1e-59 Identities = 103/276 (37%), Positives = 152/276 (55%), Gaps = 3/276 (1%) Frame = +1 Query: 1 MVLAWIQRPVSANVLKSIIFFDKGYDAWKDLKDRFDQGDIFKIADL*DDLCKVTQGTRDI 180 MVLAWI R +S ++ +S+++ D WK+L+ RF QGDIF+I+D+ ++L K QGT DI Sbjct: 82 MVLAWIHRSISDSIARSVLWIDTAAGVWKNLRIRFSQGDIFRISDIQEELYKFRQGTLDI 141 Query: 181 SDYYTELKALWDELDNYMPLSSCTCTTPCACGAIQSMRKIRDQDYTINFLKGLNEEYSNV 360 SDY+T+LK LWDEL+NY P+ C C+ C CGAI S+ R QDY I FLKGLN+++S+ Sbjct: 142 SDYFTQLKVLWDELENYRPIPHCKCSIACTCGAIDSINIYRQQDYVIRFLKGLNDKFSHT 201 Query: 361 RSQIMMMDLFPAISKAFLLVIQQERHLQVATPTVSETENQVKTPRIANN-ISXXXXXXXX 537 +SQIM+M+ P I F ++IQQER ++ + N +N ++ Sbjct: 202 KSQIMLMNPLPDIDTVFSMLIQQER--EIGNSVIDSIVNDAPDKNSSNVFLANSSYGNFH 259 Query: 538 XXXXXXXXXXXXXXXXTNRLCTHCGKTNQTIDFCYFIHGFPQGY--QVKNNKPTASANSA 711 +NR CTHC TN ++ C+ HG+P GY + KN+ + ANSA Sbjct: 260 GKYNSKGKGQHSGSKGSNRFCTHCQGTNHIVENCWIKHGYPIGYKGKGKNSFQSTQANSA 319 Query: 712 GTESESQTKAEAPNTSNSNISLTHDQYQGFWTYFNK 819 + +++ T +QY G F + Sbjct: 320 AVPNSPMQLDSTTSSTKPPFGFTQEQYHGILGLFQQ 355 >gb|KHN34741.1| Retrovirus-related Pol polyprotein from transposon TNT 1-94, partial [Glycine soja] Length = 495 Score = 206 bits (523), Expect = 2e-59 Identities = 109/262 (41%), Positives = 156/262 (59%), Gaps = 2/262 (0%) Frame = +1 Query: 1 MVLAWIQRPVSANVLKSIIFFDKGYDAWKDLKDRFDQGDIFKIADL*DDLCKVTQGTRDI 180 MVLAWI R +S ++ KS+++ D WK+L+ RF DIF+I+DL +DL + QGT D+ Sbjct: 71 MVLAWIHRSISDSIAKSVLWIDTAAGVWKNLRIRFSHSDIFRISDLQEDLYRFRQGTLDV 130 Query: 181 SDYYTELKALWDELDNYMPLSSCTCTTPCACGAIQSMRKIRDQDYTINFLKGLNEEYSNV 360 SDY+T+LK WDEL+NY P+ C C+ PC+CG I S+R R+QDY I FLKGLN+ +S+ Sbjct: 131 SDYFTQLKIYWDELENYRPIPYCKCSIPCSCGGIDSVRVYREQDYVIRFLKGLNDRFSHS 190 Query: 361 RSQIMMMDLFPAISKAFLLVIQQERH-LQVATPTVSE-TENQVKTPRIANNISXXXXXXX 534 +SQIMMM+ P I F LVIQQER L + +VSE T + ++ +N S Sbjct: 191 KSQIMMMNPLPDIDHVFSLVIQQEREMLGSNSDSVSEATSDSAMAMQVNSNQSNFNGKGG 250 Query: 535 XXXXXXXXXXXXXXXXXTNRLCTHCGKTNQTIDFCYFIHGFPQGYQVKNNKPTASANSAG 714 NR+CTHCGKTN +D C+ G+P GY+ +K ++S++ A Sbjct: 251 YYNKGKGSSKGG------NRVCTHCGKTNHIVDNCFEKIGYPPGYKTNKSKNSSSSSQAN 304 Query: 715 TESESQTKAEAPNTSNSNISLT 780 S + + E+ +S S+T Sbjct: 305 NTSNA-SALESTQQGSSAQSIT 325 >gb|PNX85368.1| hypothetical protein L195_g041436, partial [Trifolium pratense] Length = 337 Score = 200 bits (509), Expect = 5e-59 Identities = 98/236 (41%), Positives = 138/236 (58%), Gaps = 7/236 (2%) Frame = +1 Query: 1 MVLAWIQRPVSANVLKSIIFFDKGYDAWKDLKDRFDQGDIFKIADL*DDLCKVTQGTRDI 180 MVL+W+QR +S ++ KSI++ DK W +L+ RF QGDIF+IAD+ DDL + QGT DI Sbjct: 82 MVLSWLQRSISESISKSILWIDKASSVWTNLELRFSQGDIFRIADIQDDLTRFQQGTLDI 141 Query: 181 SDYYTELKALWDELDNYMPLSSCTCTTPCACGAIQSMRKIRDQDYTINFLKGLNEEYSNV 360 S+YYT+L A+W+E+DN+ P +CTC PC CGA +K ++QD I FLKGLNE+YS+V Sbjct: 142 SNYYTQLTAMWEEIDNFRPTKNCTCAIPCTCGAASDFQKYKEQDKVIKFLKGLNEQYSHV 201 Query: 361 RSQIMMMDLFPAISKAFLLVIQQERHLQVATPTVSETENQVKTPRIANNIS-------XX 519 RS IM+++ P +SK F +V+ QER L + TE Q ++ ++ S Sbjct: 202 RSHIMLIEPLPNLSKTFSMVLGQERQLNLPILPDPSTEKQPLAMQVQSSSSNGGGRGKSQ 261 Query: 520 XXXXXXXXXXXXXXXXXXXXXXTNRLCTHCGKTNQTIDFCYFIHGFPQGYQVKNNK 687 CTHCGK N + C+ +G+P G+Q KNNK Sbjct: 262 YPNKGRGRANFSGGRGGLGGGRDTGGCTHCGKNNHIVQNCFVKYGYPPGFQQKNNK 317 >gb|PNY08535.1| retrovirus-related Pol polyprotein from transposon TNT 1-94 [Trifolium pratense] Length = 1205 Score = 211 bits (538), Expect = 2e-58 Identities = 108/270 (40%), Positives = 154/270 (57%), Gaps = 4/270 (1%) Frame = +1 Query: 1 MVLAWIQRPVSANVLKSIIFFDKGYDAWKDLKDRFDQGDIFKIADL*DDLCKVTQGTRDI 180 MVLAW+ R VS ++ +SI++ D WK+L+ RF QGDIF+I+D+ ++L + QG DI Sbjct: 80 MVLAWLHRSVSESIARSILWIDSAAGVWKNLRIRFSQGDIFRISDIQEELYRFRQGNLDI 139 Query: 181 SDYYTELKALWDELDNYMPLSSCTCTTPCACGAIQSMRKIRDQDYTINFLKGLNEEYSNV 360 SDY+T+LK LWDEL+NY P+ C C+ PC CGAI S + R+QDY I FLKGLN+ +SN Sbjct: 140 SDYFTKLKVLWDELENYRPIPLCKCSIPCTCGAIDSFKVYREQDYVIRFLKGLNDRFSNT 199 Query: 361 RSQIMMMDLFPAISKAFLLVIQQERHL--QVATP-TVSETENQVKTPRIANNISXXXXXX 531 +SQIM+M+ P + F ++IQQER + + P T E T +AN+ S Sbjct: 200 KSQIMLMNPLPDVDTVFSMLIQQEREIAYSILDPITHDAPEVDSSTALLANSHSRNQNGK 259 Query: 532 XXXXXXXXXXXXXXXXXXTNRLCTHCGKTNQTIDFCYFIHGFPQGYQVK-NNKPTASANS 708 +RLCT+C TN + C+ +G+P GY+ K N S Sbjct: 260 SNYYGKGKGQAPNSAPKGHDRLCTYCKGTNHVVQNCWIKYGYPPGYKNKGKNSSQPSHTV 319 Query: 709 AGTESESQTKAEAPNTSNSNISLTHDQYQG 798 A +S +Q +++ T+ LT DQY G Sbjct: 320 AAVDSSTQLDSQSSTTATPPFGLTQDQYDG 349 >dbj|GAU41679.1| hypothetical protein TSUD_272630 [Trifolium subterraneum] Length = 1178 Score = 207 bits (526), Expect = 7e-57 Identities = 106/272 (38%), Positives = 156/272 (57%), Gaps = 6/272 (2%) Frame = +1 Query: 1 MVLAWIQRPVSANVLKSIIFFDKGYDAWKDLKDRFDQGDIFKIADL*DDLCKVTQGTRDI 180 MVLAWI R +S ++ +S+++ D WK+L+ RF QGDIF+I+DL ++L ++ QG D+ Sbjct: 80 MVLAWIHRSLSDSIARSVLWIDSAASLWKNLRTRFSQGDIFRISDLQEELYRLRQGNLDV 139 Query: 181 SDYYTELKALWDELDNYMPLSSCTCTTPCACGAIQSMRKIRDQDYTINFLKGLNEEYSNV 360 SDY+T+L+ LWDEL+NY P+ C C+ C CGA++S + R+QDY I FLKGLN+ +SN Sbjct: 140 SDYFTKLQVLWDELENYRPIPLCKCSIACTCGAVESFKLYREQDYVIRFLKGLNDRFSNT 199 Query: 361 RSQIMMMDLFPAISKAFLLVIQQERHL--QVATP-TVSETENQVKTPRIANNISXXXXXX 531 +SQIM+++ P + F ++IQQER + + P T E T +AN+ Sbjct: 200 KSQIMLINPLPDVDTVFSMLIQQEREIAYSILDPITHDAPEVDFSTALLANSHYKNQNGK 259 Query: 532 XXXXXXXXXXXXXXXXXXTNRLCTHCGKTNQTIDFCYFIHGFPQGYQVKNNKPTASANS- 708 NRLCTHC TN + C+ +G+P GY KNN+ +S S Sbjct: 260 SNYYGKGRGQAPNSAPKGHNRLCTHCRGTNHIVQDCWIKYGYPPGY--KNNRKNSSQPSH 317 Query: 709 --AGTESESQTKAEAPNTSNSNISLTHDQYQG 798 A +S +Q ++ NT+ LT QY G Sbjct: 318 IVAAVDSSTQHDSQFSNTATPPFGLTQVQYDG 349