BLASTX nr result
ID: Astragalus23_contig00019639
seq
BLASTX 2.2.26 [Sep-21-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Astragalus23_contig00019639 (1694 letters) Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects 149,584,005 sequences; 54,822,741,787 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value gb|KYP31776.1| hypothetical protein KK1_047731 [Cajanus cajan] 167 2e-43 gb|KYP46875.1| hypothetical protein KK1_031521 [Cajanus cajan] 164 2e-41 gb|KYP33062.1| hypothetical protein KK1_046125 [Cajanus cajan] 150 3e-36 gb|KYP56386.1| hypothetical protein KK1_002624 [Cajanus cajan] 144 1e-34 gb|KYP76607.1| hypothetical protein KK1_020855 [Cajanus cajan] 143 1e-34 ref|XP_007153799.1| hypothetical protein PHAVU_003G065500g [Phas... 141 4e-33 gb|KOM31456.1| hypothetical protein LR48_Vigan01g101100 [Vigna a... 139 1e-32 dbj|GAU51860.1| hypothetical protein TSUD_416440 [Trifolium subt... 140 2e-32 gb|KYP64073.1| hypothetical protein KK1_018661 [Cajanus cajan] 136 5e-32 gb|KYP44892.1| hypothetical protein KK1_033572 [Cajanus cajan] >... 132 2e-31 gb|KYP50043.1| hypothetical protein KK1_028198 [Cajanus cajan] 134 7e-31 gb|KYP40386.1| hypothetical protein KK1_038278 [Cajanus cajan] 133 1e-30 gb|KYP75955.1| hypothetical protein KK1_020168 [Cajanus cajan] 132 7e-30 gb|KYP63063.1| hypothetical protein KK1_017628 [Cajanus cajan] 125 4e-29 gb|KHN31113.1| hypothetical protein glysoja_046590, partial [Gly... 128 8e-29 gb|KHN15637.1| hypothetical protein glysoja_031426 [Glycine soja] 124 7e-28 gb|KYP32287.1| Retrovirus-related Pol polyprotein from transposo... 125 3e-27 ref|XP_014622353.1| PREDICTED: uncharacterized protein LOC100778... 128 3e-27 ref|XP_014630540.1| PREDICTED: uncharacterized protein LOC106798... 128 4e-27 gb|KYP44107.1| hypothetical protein KK1_034423, partial [Cajanus... 122 2e-26 >gb|KYP31776.1| hypothetical protein KK1_047731 [Cajanus cajan] Length = 342 Score = 167 bits (424), Expect = 2e-43 Identities = 118/355 (33%), Positives = 188/355 (52%), Gaps = 12/355 (3%) Frame = -1 Query: 1694 EKISAQIVREFYANA-SSGDSSQGERETWVRGKMILFDRDTINMYLDEPLPPYSENQLDG 1518 EK + + V+EFYANA S+ ++ WVRG+ I +DRD IN L EP+ E + Sbjct: 8 EKYNEKNVKEFYANAWPIRRDSEVIKKYWVRGRWIPYDRDAINKLLGEPMV-LREGRSCS 66 Query: 1517 YTRYLKENR--FDKDEVARDLCIAGHTYQDEPGKPKTFLRPSLKTLAQIWFIFLCSNVYP 1344 Y +++K +R F+ EVA+ LC+ G +Y+ G + +R +A++W FL +NV P Sbjct: 67 Y-QFIKSSRHGFNNLEVAKLLCLLGQSYKSNRGFARCIMRGKKTKIAKVWMTFLFANVTP 125 Query: 1343 TIHTSDLKLKKSYLVWSIMVKH-LEVDIAQIISDKILSVVQSD------NPRVLPYPALI 1185 T H SD+++ +++L+++I+ H VDIA IISD++ V S + + L + ALI Sbjct: 126 TTHVSDIRISRAHLLYTILHSHAYRVDIATIISDEMYQFVTSSPSKKAISAKPLGFLALI 185 Query: 1184 TGLCEFQRAQIPAHPISVLNPPINARHIKQYCVNPLENMXXXXXXXXXXXXXXXXXXXQY 1005 T LC+ IPA P++ + PINA I ++C N ++ Sbjct: 186 TALCKAHGVVIPAKPLTKIRGPINATFIDKFCNNQTTK------------APTAPVPPRH 233 Query: 1004 QMLMNLLNEQRKMMIANQTTQQRLDQKLDATYRALSSLNDSFYRFSLH--HGGENSFGWP 831 + +++ + + +TQ R + A +R L LN+S YRF+LH N F WP Sbjct: 234 PPMRPVISPMEQRL----STQIR--EHFGAIHRGLDRLNESCYRFTLHQYQQDSNPFSWP 287 Query: 830 TPVQFAAQVAWPEDGPMSREGQEVAHDGVDTQIQGDTGDARGAEAEDDFME*SED 666 TP QF + +WPED P+ +E EV + V+ ++ G+ D + E E D E +ED Sbjct: 288 TPEQFTSICSWPEDRPIHQE--EVEPEVVNDKVGGN--DQQDEENEADSEEGTED 338 >gb|KYP46875.1| hypothetical protein KK1_031521 [Cajanus cajan] Length = 421 Score = 164 bits (416), Expect = 2e-41 Identities = 111/311 (35%), Positives = 149/311 (47%), Gaps = 7/311 (2%) Frame = -1 Query: 1694 EKISAQIVREFYANASSGDSSQGERETWVRGKMILFDRDTINMYLDEPLPPYSENQLDGY 1515 +K S +I+REFYANA + R++WVRG + + RD IN +L P + + DGY Sbjct: 103 KKYSEEIIREFYANALPLQNRDQTRKSWVRGTQVYYHRDAINDFLGNPYSLGGDGR-DGY 161 Query: 1514 TRYLKENRFDKDEVARDLCIAGHTYQ-DEPGKPKTFLRPSLKTLAQIWFIFLCSNVYPTI 1338 R F DEVA LC+ G TY GKP LR +L TLA+IW FL NV+PT+ Sbjct: 162 GRLKNACSFKADEVAERLCLPGCTYTLGASGKPVKILRKNLNTLARIWQNFLYCNVFPTM 221 Query: 1337 HTSDLKLKKSYLVWSIMVKHLEVDIAQIISDKILSVVQSDN-----PRVLPYPALITGLC 1173 H SDL + ++ L++SIM K VD+A IIS++I VV S + L +P LI GLC Sbjct: 222 HISDLTMPRATLLYSIMNK-TGVDVATIISNEIHRVVLSTPSPTGVSKPLGFPGLIMGLC 280 Query: 1172 EFQRAQIPAHPISVLNPPINARHIKQYCVNPLENMXXXXXXXXXXXXXXXXXXXQYQMLM 993 RA +P+H + PPINA +IK +C N + L Sbjct: 281 RAARATVPSHLSKTIRPPINASYIKTHCKNAQQGSTSQPGSQRHGQASSSSQVASSAFLA 340 Query: 992 NLLNEQRKMMIANQTTQQRLDQKLDATYRALSSLNDSFYR-FSLHHGGENSFGWPTPVQF 816 + + +AN + AL S+N S Y G F WP+P F Sbjct: 341 AHFHHIEQQNLAN--------------HLALMSINTSLYHAHQQQFYGGPLFQWPSPETF 386 Query: 815 AAQVAWPEDGP 783 Q WP D P Sbjct: 387 QQQFQWPGDSP 397 >gb|KYP33062.1| hypothetical protein KK1_046125 [Cajanus cajan] Length = 440 Score = 150 bits (379), Expect = 3e-36 Identities = 112/341 (32%), Positives = 156/341 (45%), Gaps = 7/341 (2%) Frame = -1 Query: 1694 EKISAQIVREFYANASSGDSSQGERETWVRGKMILFDRDTINMYLDEPLPPYSENQLDGY 1515 +K + +I++EFYANA + R +WVRG ++ + RD I YL + LD + Sbjct: 99 KKYNEEIIKEFYANAYPLQRTDQTRNSWVRGAVVSYSRDAIQQYLGSR-DVIGGDGLDEF 157 Query: 1514 TRYLKENRFDKDEVARDLCIAGHTYQ-DEPGKPKTFLRPSLKTLAQIWFIFLCSNVYPTI 1338 R K + F+ D++A+ LC+ G TY G P +FLR +L T A+IW L NVY Sbjct: 158 GRLKKAHAFNADKMAKLLCLPGCTYTVGLTGNPVSFLRKNLTTTARIWQNLLYCNVYCIT 217 Query: 1337 HTSDLKLKKSYLVWSIMVKHLEVDIAQIISDKILSVVQSDN-----PRVLPYPALITGLC 1173 H SDL + ++ L++SI+ K EVDI IISD+I +V S R L +P LITGLC Sbjct: 218 HISDLNMPRATLLYSILQK-TEVDIPTIISDEIHKIVLSSPSSTGVSRPLGFPGLITGLC 276 Query: 1172 EFQRAQIPAHPISVLNPPINARHIKQYCVNPLENMXXXXXXXXXXXXXXXXXXXQYQMLM 993 F +++P + L PPINA +IK +C + + Sbjct: 277 LFSGSRLPGNLNKALRPPINAAYIKIHCKSEQHGDASQPRPPRHGQGSSSSQVPAQDFMA 336 Query: 992 NLLNEQRKMMIANQTTQQRLDQKLDATYRALSSLNDSFYR-FSLHHGGENSFGWPTPVQF 816 + + +AN + AL SLN S Y G F WP+P F Sbjct: 337 AHFHHIEQQNLAN--------------HLALMSLNTSMYHAHQQQFQGGPPFQWPSPEAF 382 Query: 815 AAQVAWPEDGPMSREGQEVAHDGVDTQIQGDTGDARGAEAE 693 WP D P G+E D Q Q G A G E E Sbjct: 383 QQHFYWPGDSPHFEGGEEEQPMPEDEQEQ--EGGAGGEEEE 421 >gb|KYP56386.1| hypothetical protein KK1_002624 [Cajanus cajan] Length = 362 Score = 144 bits (363), Expect = 1e-34 Identities = 119/359 (33%), Positives = 168/359 (46%), Gaps = 13/359 (3%) Frame = -1 Query: 1694 EKISAQIVREFYANASSGDSSQGERETWVRGKMILFDRDTINMYLDEPLPPYSENQLDGY 1515 +K + I++EFYANA R +WVRG + + RDTIN YL P ++ LD Y Sbjct: 20 KKYNEDIIKEFYANAFPLQRLDQTRNSWVRGVTVNYARDTINEYLGSPY-SLGDDGLDEY 78 Query: 1514 TRYLKENRFDKDEVARDLCIAGHTY-QDEPGKPKTFLRPSLKTLAQIWFIFLCSNVYPTI 1338 R K F D++A+ LC G TY G P + LR +L T+A+I FL NVY Sbjct: 79 GRLKKARAFKADKMAKLLCFPGCTYIVGVTGNPVSILRKNLTTIARIGQNFLYCNVYSIT 138 Query: 1337 HTSDLKLKKSYLVWSIMVKHLEVDIAQIISDKILSVVQSDN-----PRVLPYPALITGLC 1173 H SDL + ++ L++SI+ K VDIA IISD+I V S + L + LITGLC Sbjct: 139 HISDLNMSRATLLYSILTKD-GVDIASIISDEIHKTVLSTPSITGVSKPLGFLGLITGLC 197 Query: 1172 EFQRAQIPAHPISVLNPPINARHIKQYCVNPLENMXXXXXXXXXXXXXXXXXXXQYQMLM 993 + +++P + L PPINA +IK + + + +Q Sbjct: 198 KATGSRLPNNLNKSLRPPINAIYIKIHYKSEQQG-----------DTSQPRSQRHWQTSS 246 Query: 992 NLLNEQRKMMIANQTTQQRLDQKLDATYRALSSLNDSFYRFSLH--HGGENSFGWPTPVQ 819 + M A+ ++QK A Y AL SLN S Y H HGG F WP+ Sbjct: 247 SSQVASLAFMAAH---FHHIEQKNLANYLALMSLNTSLYHAHRHQFHGGP-PFKWPSLDT 302 Query: 818 FAAQVAWPEDGPMSREGQEVAHDGVDTQIQGDTGDAR-----GAEAEDDFME*SEDLDQ 657 F Q WP D P G E + + +G GD + G + + + +E +D DQ Sbjct: 303 FQQQFHWPGDSPNFEGGVEEEQPEQEAEQEG--GDEKEDEEGGEDKQREDVEEGDDDDQ 359 >gb|KYP76607.1| hypothetical protein KK1_020855 [Cajanus cajan] Length = 338 Score = 143 bits (361), Expect = 1e-34 Identities = 106/304 (34%), Positives = 152/304 (50%), Gaps = 8/304 (2%) Frame = -1 Query: 1694 EKISAQIVREFYANASSGDSSQGERETWVRGKMILFDRDTINMYLDEPLPPYSENQLDGY 1515 +K + I++EFYANA + R +WVRG M+ + RD IN L P + LD Y Sbjct: 50 KKYNEDIIKEFYANAYPLQRTDKTRNSWVRGVMVSYSRDAINECLGNPYS-LGGDDLDEY 108 Query: 1514 TRYLKENRFDKDEVARDLCIAGHTYQ-DEPGKPKTFLRPSLKTLAQIWFIFLCSNVYPTI 1338 R K F+ D++A+ LC+ G TY G P +FLR +L T+A+IW FL NVY Sbjct: 109 GRIKKAQVFNADKMAKLLCLPGCTYTVGVMGNPVSFLRKNLTTIARIWQNFLYYNVYCLT 168 Query: 1337 HTSDLKLKKSYLVWSIMVKHLEVDIAQIISDKILSVV-----QSDNPRVLPYPALITGLC 1173 H S+L + ++ L++SI+ K VDIA IISD+I V Q+ + L +P LI GLC Sbjct: 169 HISNLNMPRATLLYSILQKD-GVDIASIISDEIHKTVLSTPSQTGVSKPLGFPGLIRGLC 227 Query: 1172 EFQRAQIPAHPISVLNPPINARHIKQYCVNPLENMXXXXXXXXXXXXXXXXXXXQYQMLM 993 F +++P + L PPINA +IK +C N + +Q Sbjct: 228 LFSGSRLPGNLNKSLRPPINASYIKIHCKNEQQG-----------DASQPRPQRHWQGYS 276 Query: 992 NLLNEQRKMMIANQTTQQRLDQKLDATYRALSSLNDSFYRF--SLHHGGENSFGWPTPVQ 819 + ++ M A+ ++Q+ A + AL SLN S Y HGG F WP+P Sbjct: 277 SSQAPGQEFMAAH---FHHIEQQNLANHLALMSLNTSMYHAHQQQFHGGP-LFQWPSPET 332 Query: 818 FAAQ 807 F Q Sbjct: 333 FQQQ 336 >ref|XP_007153799.1| hypothetical protein PHAVU_003G065500g [Phaseolus vulgaris] gb|ESW25793.1| hypothetical protein PHAVU_003G065500g [Phaseolus vulgaris] Length = 407 Score = 141 bits (355), Expect = 4e-33 Identities = 104/333 (31%), Positives = 159/333 (47%), Gaps = 18/333 (5%) Frame = -1 Query: 1694 EKISAQIVREFYANASSGDSSQGERETWVRGKMILFDRDTINMYLDEPLPPYSENQLDGY 1515 +K +V EFYANA + VRGK I +DR+TIN +L PL +++L Y Sbjct: 82 DKYDPDVVLEFYANAWPVKEGDTNLRSKVRGKWIPYDRNTINDFLGNPLQ-LDQDELCTY 140 Query: 1514 TRYLKENRF---DKDEVARDLCIAGHTYQ-DEPGKPKTFLRPSLKTLAQIWFIFLCSNVY 1347 + F E + LCI G TY+ + GKP +R S+ TL QIW L SNV Sbjct: 141 GMLKRGTNFTSLSNTETSDLLCIPGRTYETNNNGKPLRIIRSSMTTLTQIWTSLLLSNVI 200 Query: 1346 PTIHTSDLKLKKSYLVWSIMVKHLEVDIAQIISDKILSVV---QSDNP---RVLPYPALI 1185 P H+SDL + K ++V+ ++ K +VD+A +ISD I V +NP + L +P+LI Sbjct: 201 PNKHSSDLSMAKCHIVFCLL-KQYDVDVATLISDSIHHFVLQQGGNNPLHRKGLGFPSLI 259 Query: 1184 TGLCEFQRAQIPAHPISVLNPPINARHIKQYC----VNPLENMXXXXXXXXXXXXXXXXX 1017 T LC Q+ + + + PPI+ + I++ C L+ Sbjct: 260 TSLCAANGIQV--NLSTRIRPPIDKKIIQRNCSEKDQQQLQRQQSQQGQDQPVEPPINQL 317 Query: 1016 XXQYQMLMNLLNEQRKMMIANQTTQQRLDQKLDATYRALSSLNDSFYRFSLHHGGENSFG 837 MN+L+ + + ++ + + + A +RA+ SLN SF+ ++LHH E+ Sbjct: 318 HSPIVPEMNMLDYIKNL----ESHMKHVQMQQAANHRAMVSLNGSFHSYALHHSAESKLV 373 Query: 836 WPTPVQFAAQVAWPED----GPMSREGQEVAHD 750 WP +F V WP D S+E E HD Sbjct: 374 WPNAEEFDHLVKWPGDETVLAAQSKESHEEIHD 406 >gb|KOM31456.1| hypothetical protein LR48_Vigan01g101100 [Vigna angularis] Length = 406 Score = 139 bits (351), Expect = 1e-32 Identities = 102/321 (31%), Positives = 163/321 (50%), Gaps = 11/321 (3%) Frame = -1 Query: 1691 KISAQIVREFYANASSGDSSQGERETWVRGKMILFDRDTINMYLDEPLPPYSENQLDGYT 1512 K +IVREFYANA D GE+ + VRG+ + +DR +I+ +L PLP +E QL YT Sbjct: 86 KFDLEIVREFYANAYPLDGL-GEKRSKVRGRWVTYDRASISEFLGHPLP-LAEGQLCDYT 143 Query: 1511 RYLK-ENRFDKDEVARDLCIAGHTYQ-DEPGKPKTFLRPSLKTLAQIWFIFLCSNVYPTI 1338 R + + FD++EV + I+ +Y+ G P+ LR +KTLAQ++ FL SN+ P Sbjct: 144 RRRRSQEAFDEEEVVNLIFISNRSYRLGSSGDPRRILRTDMKTLAQVFMTFLLSNIVPIG 203 Query: 1337 HTSDLKLKKSYLVWSIMVKHLEVDIAQIISDKILSVVQSDNPR-------VLPYPALITG 1179 H SDL + + +L+++IM + L VD+A II ++I V+ + + L +P LIT Sbjct: 204 HVSDLNVPRCHLLFNIMREDLTVDVAIIIFEEIHKFVRYEVNKNNEKRKCALGFPTLITA 263 Query: 1178 LCEFQRAQIPAHPISV-LNPPINARHIKQYCVNPLENMXXXXXXXXXXXXXXXXXXXQYQ 1002 LC+ Q ++ +SV + P I R I+++C NP E M Q Sbjct: 264 LCQAQGVEV---DLSVKIRPTITKRFIEKFCTNPAEIMPQLEQPVAAEQTPCPEQQPQLN 320 Query: 1001 MLMNLLNEQRKMMI-ANQTTQQRLDQKLDATYRALSSLNDSFYRFSLHHGGENSFGWPTP 825 + LL + R + + T QQ + + +R L++ Y + + TP Sbjct: 321 IHHELLEQMRYLRLQMEHTCQQNI-----SIHRGQLHLHEYLY-----NNVRGPYPGMTP 370 Query: 824 VQFAAQVAWPEDGPMSREGQE 762 +F A + WP D P+ G++ Sbjct: 371 QEFLAYLQWPGDSPIFFRGRK 391 >dbj|GAU51860.1| hypothetical protein TSUD_416440 [Trifolium subterraneum] Length = 432 Score = 140 bits (352), Expect = 2e-32 Identities = 104/343 (30%), Positives = 162/343 (47%), Gaps = 8/343 (2%) Frame = -1 Query: 1691 KISAQIVREFYANASSGDSSQGERETWVRGKMILFDRDTINMYLDEPLPPYSENQLDGYT 1512 K++ ++VREFYANA T VRG+ I FDRDT+N +L EP +QL Y+ Sbjct: 83 KLNYEVVREFYANAIPIGQEPYNFTTVVRGRQIHFDRDTLNRFLGEP-SNLDSDQLCEYS 141 Query: 1511 RYLKENRFDKDEVARDLCIAGHTYQ-DEPGKPKTFLRPSLKTLAQIWFIFLCSNVYPTIH 1335 L + +++ RD+ I G ++Q + + + ++ +L AQI + +C N+ P H Sbjct: 142 EMLVRQNWPVEDMVRDIFIEGESFQLNNQREERRAIKETLTIPAQIIHLLICYNLKPRSH 201 Query: 1334 TSDLKLKKSYLVWSIMVKHLEVDIAQIISDKILSVVQS----DNPRVLPYPALITGLCEF 1167 + ++ L+W I+ EVDIA++I++++ SV +S D VLPYP LI GLCE Sbjct: 202 VHTAPMDRATLIWYILTGR-EVDIARVIANEMRSVAESGIKNDAKPVLPYPGLIIGLCEA 260 Query: 1166 QRAQIPAHPISVLNPPINARHIKQYC-VNPLENMXXXXXXXXXXXXXXXXXXXQYQMLMN 990 + IPA + IN ++IK+YC + ++ Q N Sbjct: 261 EHVHIPAIVSHTTDKLINDKYIKRYCKLKEVQQQPQQQPQAPQLPAAPLHPVEPQQAYPN 320 Query: 989 LLNEQRKMMIANQTTQQRLDQKLDATYRALSSLNDSFYRFSLHHG--GENSFGWPTPVQF 816 + + R T Q + +RAL+ L +S YR L G + + P F Sbjct: 321 I--DPRLQNWFYHTWDQN-----TSNHRALTVLQESMYRMQLDQGVPVNHDYQVMDPQHF 373 Query: 815 AAQVAWPEDGPMSREGQEVAHDGVDTQIQGDTGDARGAEAEDD 687 +AWP D P G E ++ G D D+ D A+A DD Sbjct: 374 QTHIAWPGDRPQFTGGAETSNVGGDND---DSIDEAAADAMDD 413 >gb|KYP64073.1| hypothetical protein KK1_018661 [Cajanus cajan] Length = 331 Score = 136 bits (342), Expect = 5e-32 Identities = 86/208 (41%), Positives = 117/208 (56%), Gaps = 6/208 (2%) Frame = -1 Query: 1694 EKISAQIVREFYANASSGDSSQGERETWVRGKMILFDRDTINMYLDEPLPPYSENQLDGY 1515 +K + I++EFYANA R +WVRG I + RD IN YL P + LD Y Sbjct: 99 KKYNEDIIKEFYANAYPLQRHDQTRNSWVRGVTISYSRDAINEYLGNPYSLGGDG-LDEY 157 Query: 1514 TRYLKENRFDKDEVARDLCIAGHTYQ-DEPGKPKTFLRPSLKTLAQIWFIFLCSNVYPTI 1338 R K F+ D++A+ LC+ G TY G P +FLR +L TLA+IW FL NVY Sbjct: 158 GRLKKARGFNADKMAKLLCLPGCTYTVGVTGNPVSFLRKNLTTLARIWQNFLYCNVYSIT 217 Query: 1337 HTSDLKLKKSYLVWSIMVKHLEVDIAQIISDKILSVVQSDN-----PRVLPYPALITGLC 1173 H SDL + ++ L++SI+ K+ VDIA IISD+I V S + L +P LITGLC Sbjct: 218 HISDLNMPRATLLYSILQKN-GVDIASIISDEIHKTVLSTPSMTGVSKPLGFPGLITGLC 276 Query: 1172 EFQRAQIPAHPISVLNPPINARHIKQYC 1089 +++P + L PPIN +IK +C Sbjct: 277 LAGGSRLPNNLNKSLRPPINVAYIKIHC 304 >gb|KYP44892.1| hypothetical protein KK1_033572 [Cajanus cajan] gb|KYP44914.1| hypothetical protein KK1_033595 [Cajanus cajan] Length = 268 Score = 132 bits (333), Expect = 2e-31 Identities = 78/214 (36%), Positives = 125/214 (58%), Gaps = 10/214 (4%) Frame = -1 Query: 1694 EKISAQIVREFYANA-SSGDSSQGERETWVRGKMILFDRDTINMYLDEPLPPYSENQLDG 1518 EK + +IV+EFYANA + +++WVRG+ + +DRD IN L EP+ E Q Sbjct: 8 EKYNEKIVKEFYANAWPIRRDYEVIKKSWVRGRWVPYDRDAINKLLGEPMV-LREGQSCS 66 Query: 1517 YTRYLKENR--FDKDEVARDLCIAGHTYQDEPGKPKTFLRPSLKTLAQIWFIFLCSNVYP 1344 Y +++K +R F+ EVA+ L + G +Y+ G + +R + +A++W FL +NV P Sbjct: 67 Y-QFIKSSRHGFNNPEVAKLLSLPGQSYESNKGFARRIMRGKMTKIARVWMTFLFANVTP 125 Query: 1343 TIHTSDLKLKKSYLVWSIMVKH-LEVDIAQIISDKILSVVQSD------NPRVLPYPALI 1185 T H D+++ +++L++SI+ H VDI IISD++ V S + + L +PALI Sbjct: 126 TTHVLDIRMSRAHLLYSILHSHAYRVDITAIISDEMYQFVTSSPSKKTISAKSLGFPALI 185 Query: 1184 TGLCEFQRAQIPAHPISVLNPPINARHIKQYCVN 1083 T LC+ IPA P++ + PINA I ++C N Sbjct: 186 TALCKAHGVVIPAKPLTKIRGPINATFIDKFCNN 219 >gb|KYP50043.1| hypothetical protein KK1_028198 [Cajanus cajan] Length = 361 Score = 134 bits (336), Expect = 7e-31 Identities = 82/207 (39%), Positives = 116/207 (56%), Gaps = 6/207 (2%) Frame = -1 Query: 1691 KISAQIVREFYANASSGDSSQGERETWVRGKMILFDRDTINMYLDEPLPPYSENQLDGYT 1512 K + I++EFYANA R +WVR ++ + RD IN YL P ++ LD Y Sbjct: 103 KYNEDIIKEFYANAFPLQKHDQTRNSWVREVIVSYARDAINEYLGSPYS-LGDDGLDEYG 161 Query: 1511 RYLKENRFDKDEVARDLCIAGHTYQ-DEPGKPKTFLRPSLKTLAQIWFIFLCSNVYPTIH 1335 R K F D++ + LC+ G TY G P + LR +L T+A+IW FL NVY IH Sbjct: 162 RLKKARAFKADKMVKLLCLPGCTYTVGVTGNPVSILRKNLTTIARIWQNFLYCNVYSIIH 221 Query: 1334 TSDLKLKKSYLVWSIMVKHLEVDIAQIISDKILSVVQSDNPRV-----LPYPALITGLCE 1170 SDL + ++ L++SI+ K VDI IISD+I VV S + L +P L+TGLC+ Sbjct: 222 ISDLNMPRATLLYSILTKD-GVDITLIISDEIHKVVLSTPSLIGVSKPLGFPGLLTGLCK 280 Query: 1169 FQRAQIPAHPISVLNPPINARHIKQYC 1089 +++P + L PPINA +IK +C Sbjct: 281 ASGSRLPNNLNKSLRPPINASYIKIHC 307 >gb|KYP40386.1| hypothetical protein KK1_038278 [Cajanus cajan] Length = 371 Score = 133 bits (335), Expect = 1e-30 Identities = 81/208 (38%), Positives = 118/208 (56%), Gaps = 6/208 (2%) Frame = -1 Query: 1694 EKISAQIVREFYANASSGDSSQGERETWVRGKMILFDRDTINMYLDEPLPPYSENQLDGY 1515 +K + I++EF ANA + R +WVRG M+ + RD IN YL P + LD Y Sbjct: 30 KKYNEDIIKEFNANAYPLQRTDKTRNSWVRGAMVSYSRDAINEYLGNPYSLGGDG-LDEY 88 Query: 1514 TRYLKENRFDKDEVARDLCIAGHTYQ-DEPGKPKTFLRPSLKTLAQIWFIFLCSNVYPTI 1338 R K F+ D++ + LC+ G TY G P +FLR +L T A+IW FL NVY Sbjct: 89 GRIKKARAFNADKMDKLLCLPGCTYTVGVTGNPDSFLRKNLTTTARIWQNFLYCNVYCLT 148 Query: 1337 HTSDLKLKKSYLVWSIMVKHLEVDIAQIISDKILSVVQSDN-----PRVLPYPALITGLC 1173 H SDL + ++ L++S++ K+ +DI IIS++I +V S + L +P LITGLC Sbjct: 149 HISDLNMPRATLLYSVLRKN-GMDITSIISNEIHKIVLSTPSLTGVSKPLGFPGLITGLC 207 Query: 1172 EFQRAQIPAHPISVLNPPINARHIKQYC 1089 F +++P + L PPINA +IK +C Sbjct: 208 LFSGSRLPGNLNKSLRPPINAAYIKIHC 235 >gb|KYP75955.1| hypothetical protein KK1_020168 [Cajanus cajan] Length = 446 Score = 132 bits (333), Expect = 7e-30 Identities = 101/335 (30%), Positives = 158/335 (47%), Gaps = 26/335 (7%) Frame = -1 Query: 1694 EKISAQIVREFYANASSGDSSQGERETWVRGKMILFDRDTINMYLDEPLPPYSENQLDGY 1515 +K + I+REFYAN+ +R +WVRGK I +D + IN +L Y+ + D Y Sbjct: 115 DKYNEIIIREFYANSFPVRPDSKDRISWVRGKTIAYDPEAINTFLQTE---YTIPEEDDY 171 Query: 1514 TRYLKE--NRFDKDEVARDLCIAGHTYQD-EPGKPKTFLRPSLKTLAQIWFIFLCSNVYP 1344 + +K N + V L + G YQ +P LR LK+L ++W + L SNV P Sbjct: 172 RKLMKTAMNEEMSNLVLETLSLLGSQYQTGTKNQPTHILRADLKSLVRLWQVILYSNVVP 231 Query: 1343 TIHTSDLKLKKSYLVWSIMVKHLEVDIAQIISDKILSVV-----QSDNPRVLPYPALITG 1179 HTSD+ + K+ L++ I+++ +VDIA +IS++I ++V +S R L +P LITG Sbjct: 232 LTHTSDITISKAKLIFCILLQK-DVDIATLISNEIHAIVLSKPSKSGTVRPLAFPGLITG 290 Query: 1178 LCEFQRAQIPAHPISVLNPPINARHIKQYCVNPLE------------NMXXXXXXXXXXX 1035 LC+ +R IP P+ + I+ + C NP E Sbjct: 291 LCKAKRVVIP-QPLVPIRRTIDHVFVNARCYNPREFPRASRRSRPPPTQSPPPVTSPPVL 349 Query: 1034 XXXXXXXXQYQMLM----NLLNEQRKMMIANQTTQQRLDQKLDATYRALSSLNDSFYRFS 867 Q M + EQ ++ +A+ +L Q A +R +L+ FY ++ Sbjct: 350 PTGPFDLSTMQACMMQHFQHMEEQHQLEMAH-LRHVKLQQA--ANHRGQLALHSYFYNYT 406 Query: 866 LHHG--GENSFGWPTPVQFAAQVAWPEDGPMSREG 768 LH G + + WPTP QF + WP D P+ G Sbjct: 407 LHQANTGGSLYPWPTPEQFQDAILWPGDNPVFSGG 441 >gb|KYP63063.1| hypothetical protein KK1_017628 [Cajanus cajan] Length = 234 Score = 125 bits (314), Expect = 4e-29 Identities = 82/210 (39%), Positives = 113/210 (53%), Gaps = 8/210 (3%) Frame = -1 Query: 1694 EKISAQIVREFYANASSGDSSQGERETWVRGKMILFDRDTINMYLDEPLPPYSENQLDGY 1515 +K + I++EFYANA R WVRG I + RD IN YL P + LD Y Sbjct: 20 KKYNKDIIKEFYANAFPLQRLDQTRNYWVRGVTISYSRDAINEYLGSPYSLGGDG-LDEY 78 Query: 1514 TRYLKENRFDKDEVARDLCIAGHTY-QDEPGKPKTFLRPSLKTLAQIWFIFLCSNVYPTI 1338 R K F+ D++A+ LC+ TY G P + LR +L T+A+IW FL NVY Sbjct: 79 GRLKKPRAFNVDKMAKLLCLPSCTYIVGVIGNPVSILRKNLTTIARIWQNFLYCNVYSIT 138 Query: 1337 HTSDLKLKKSYLVWSIMVKHLEVDIAQIISDKI-------LSVVQSDNPRVLPYPALITG 1179 H SDL + ++ L++SI+ K VDIA IIS++I LS+ P L +P LITG Sbjct: 139 HISDLNMPRATLLYSILTKD-GVDIASIISNEIHKTVLSTLSITGVSKP--LGFPGLITG 195 Query: 1178 LCEFQRAQIPAHPISVLNPPINARHIKQYC 1089 LC +++P + L PPIN +I +C Sbjct: 196 LCMASGSRLPNNLNKSLRPPINVAYINIHC 225 >gb|KHN31113.1| hypothetical protein glysoja_046590, partial [Glycine soja] Length = 380 Score = 128 bits (322), Expect = 8e-29 Identities = 96/316 (30%), Positives = 143/316 (45%), Gaps = 13/316 (4%) Frame = -1 Query: 1691 KISAQIVREFYANASSGDSSQGERETWVRGKMILFDRDTINMYLDEPLPPYSENQLDGYT 1512 K IV EFYANA + + +WVRG+ I FD D ++ +L +PL + + Sbjct: 82 KFDPDIVLEFYANALPTEEGVRDMRSWVRGQWISFDADALSQFLGDPLVLEEGQECEFSQ 141 Query: 1511 RYLKENRFDKDEVARDLCIAGHTY-QDEPGKPKTFLRPSLKTLAQIWFIFLCSNVYPTIH 1335 R + FD++ +A LC+ G + Q G+ +R S+ TL Q+W L SNV P+ H Sbjct: 142 RRNMADGFDEEAIAHLLCMPGQDFAQTAVGRRVRIMRTSMTTLTQMWMTLLLSNVLPSDH 201 Query: 1334 TSDLKLKKSYLVWSIMVKHLEVDIAQIISDKIL----------SVVQSDNPRVLPYPALI 1185 DL L K LV++I+ + + V +AQ+I+D I + + R L +PALI Sbjct: 202 NFDLPLPKCQLVYAILTQ-MSVHVAQLIADAIYLFAGMPPTRHPLDPDKSSRALGFPALI 260 Query: 1184 TGLCEFQRAQIPAHPISVLNPPINARHIKQYCVNPLENMXXXXXXXXXXXXXXXXXXXQY 1005 TGLC Q +P P V+ PI I++YC P + Sbjct: 261 TGLC--QSFGVPVTPSKVIRSPITRAFIEKYC-TPRQAQGDAHQAADAPPPPHQADP--- 314 Query: 1004 QMLMNLLNEQRKMMIANQTTQQRLDQKLDATYRALSSLNDSFYRFSLHH--GGENSFGWP 831 L +R + Q L ++ A +R +++ Y+ SL G F P Sbjct: 315 ---AESLGMERYL--------QHLVRQQAANHRGQVQIHECLYQLSLSQQVQGFAPFACP 363 Query: 830 TPVQFAAQVAWPEDGP 783 TP QF +VAWP D P Sbjct: 364 TPDQFRDEVAWPGDWP 379 >gb|KHN15637.1| hypothetical protein glysoja_031426 [Glycine soja] Length = 330 Score = 124 bits (312), Expect = 7e-28 Identities = 76/203 (37%), Positives = 115/203 (56%), Gaps = 1/203 (0%) Frame = -1 Query: 1688 ISAQIVREFYANASSGDSSQGERETWVRGKMILFDRDTINMYLDEPLPPYSENQLDGYTR 1509 I IV+EFYAN + + ++ VRG +I FD DT+N +L P+ L Y+R Sbjct: 77 IDVAIVKEFYANLYDSED-KSPKQVKVRGHLIKFDEDTLNTFLKTPVILEEGENLCAYSR 135 Query: 1508 YLKENRFDKDEVARDLCIAGHTYQ-DEPGKPKTFLRPSLKTLAQIWFIFLCSNVYPTIHT 1332 + R D E+A LCI G ++ + G P LR +L TLAQ W + SN+ PT HT Sbjct: 136 FALL-RPDPQELAAKLCIPGRGFELNADGHPLKILRKNLTTLAQTWSVLSFSNLIPTSHT 194 Query: 1331 SDLKLKKSYLVWSIMVKHLEVDIAQIISDKILSVVQSDNPRVLPYPALITGLCEFQRAQI 1152 SD+ L ++ L++ I+VK ++ ++ +IS +I Q D+ R L +PALIT LC+ + Q Sbjct: 195 SDVTLDRAKLIYGIIVK-MDTNVGYLISHQISITAQHDSSR-LGFPALITALCKARGVQS 252 Query: 1151 PAHPISVLNPPINARHIKQYCVN 1083 + + L+P IN +IK+ C N Sbjct: 253 DSRSLESLSPAINLAYIKKNCWN 275 >gb|KYP32287.1| Retrovirus-related Pol polyprotein from transposon opus [Cajanus cajan] Length = 481 Score = 125 bits (315), Expect = 3e-27 Identities = 80/208 (38%), Positives = 112/208 (53%), Gaps = 6/208 (2%) Frame = -1 Query: 1694 EKISAQIVREFYANASSGDSSQGERETWVRGKMILFDRDTINMYLDEPLPPYSENQLDGY 1515 +K I++EFYAN + R +WVRG M+ + RD IN YL P + LD Y Sbjct: 251 KKYHEDIIKEFYANVYPLQRTDKIRNSWVRGAMVSYSRDAINEYLGNPY-SLGGDGLDEY 309 Query: 1514 TRYLKENRFDKDEVARDLCIAGHTYQ-DEPGKPKTFLRPSLKTLAQIWFIFLCSNVYPTI 1338 R K F+ D++A+ LC+ G TY G P +FLR +L T +IW FL NVY Sbjct: 310 GRLKKARAFNADKMAKLLCLPGCTYTVGVTGNPVSFLRKNLTTTTRIWQNFLYCNVYCIT 369 Query: 1337 HTSDLKLKKSYLVWSIMVKHLEVDIAQIISDKILSVVQSDN-----PRVLPYPALITGLC 1173 H SDL + + L++S++ K VDIA II D+I V S + L +P LITGLC Sbjct: 370 HISDLNMPRETLLYSVLQK-TGVDIAAIILDEIHKTVLSTPSLTGVSKPLGFPGLITGLC 428 Query: 1172 EFQRAQIPAHPISVLNPPINARHIKQYC 1089 F +++ ++ L PP N +IK +C Sbjct: 429 LFNGSRLLSNLNKSLWPPTNVAYIKIHC 456 >ref|XP_014622353.1| PREDICTED: uncharacterized protein LOC100778023 [Glycine max] Length = 2264 Score = 128 bits (322), Expect = 3e-27 Identities = 92/328 (28%), Positives = 149/328 (45%), Gaps = 4/328 (1%) Frame = -1 Query: 1694 EKISAQIVREFYANASSGDSSQGERETWVRGKMILFDRDTINMYLDEPLPPYSENQLDGY 1515 ++I +V+EFY+N + R VRG+++ FD DTIN +LD P+ + Y Sbjct: 1923 KRIDVALVKEFYSNLYDPED-HSPRFCRVRGQVVRFDADTINDFLDTPVILEDGEEYTAY 1981 Query: 1514 TRYLKENRFDKDEVARDLCIAGHTYQ-DEPGKPKTFLRPSLKTLAQIWFIFLCSNVYPTI 1338 TRYL + D D +A LC G + + G P LR + TLAQ W + ++ PT Sbjct: 1982 TRYLSTHP-DPDTIAATLCTPGGRFVLNADGLPWKLLRKDMTTLAQTWSVLSYYDLAPTS 2040 Query: 1337 HTSDLKLKKSYLVWSIMVKHLEVDIAQIISDKILSVVQSDNPRVLPYPALITGLCEFQRA 1158 HTSD+ L ++ L++ +V +++D+ IS +I + QS R L +PALIT LC+ Q Sbjct: 2041 HTSDVNLDRARLIYG-LVSRMDMDVGSFISQQISQIAQSSTSR-LGFPALITALCDIQGV 2098 Query: 1157 QIPAHPISVLNPPINARHIKQYCVNPLENMXXXXXXXXXXXXXXXXXXXQYQMLMNLLNE 978 L+P IN ++K+ C NP + + Sbjct: 2099 VFDTLIFESLSPAINLAYVKKNCWNPADPSITFPGPRRTRTRASASAPEAPLPTQSPSQP 2158 Query: 977 QRKMMIANQTTQQRLD---QKLDATYRALSSLNDSFYRFSLHHGGENSFGWPTPVQFAAQ 807 ++ +T +D Q L + + + ++ +R SLH + TP + + Sbjct: 2159 SQRPRHPPASTSASMDMHGQMLRSLHVGQQLIMENMHRLSLHLQMDPPL--TTPEAYRQR 2216 Query: 806 VAWPEDGPMSREGQEVAHDGVDTQIQGD 723 VAWP D P + G+E + D + D Sbjct: 2217 VAWPGDQPSTDRGEEPSGAAEDPAVDED 2244 >ref|XP_014630540.1| PREDICTED: uncharacterized protein LOC106798466 [Glycine max] Length = 1749 Score = 128 bits (321), Expect = 4e-27 Identities = 92/328 (28%), Positives = 147/328 (44%), Gaps = 4/328 (1%) Frame = -1 Query: 1694 EKISAQIVREFYANASSGDSSQGERETWVRGKMILFDRDTINMYLDEPLPPYSENQLDGY 1515 ++I +V+EFY+N + R VRG+++ FD DTIN +LD P+ + Y Sbjct: 1408 KRIDVALVKEFYSNLYDPED-HSPRFCRVRGQVVRFDADTINDFLDTPVILEVGEEYPAY 1466 Query: 1514 TRYLKENRFDKDEVARDLCIAGHTYQ-DEPGKPKTFLRPSLKTLAQIWFIFLCSNVYPTI 1338 TRYL + D D +A LC G + + G P LR + TLAQ W + ++ PT Sbjct: 1467 TRYLSTHP-DPDTIAATLCTPGGRFVLNADGLPWKLLRKDMTTLAQTWSVLSYYDLAPTS 1525 Query: 1337 HTSDLKLKKSYLVWSIMVKHLEVDIAQIISDKILSVVQSDNPRVLPYPALITGLCEFQRA 1158 HTSD+ L ++ L++ +V +++D+ IS +I + QS R L +PALIT LC+ Q Sbjct: 1526 HTSDVNLDRARLIYG-LVSRMDMDVGSFISQQISQIAQSSTSR-LGFPALITALCDIQGV 1583 Query: 1157 QIPAHPISVLNPPINARHIKQYCVNPLE---NMXXXXXXXXXXXXXXXXXXXQYQMLMNL 987 L+P IN ++K+ C NP + Q Sbjct: 1584 VSDTLIFESLSPAINLAYVKKNCWNPADPSITFPGPRRTRTRASASASEAPLPTQSPSQP 1643 Query: 986 LNEQRKMMIANQTTQQRLDQKLDATYRALSSLNDSFYRFSLHHGGENSFGWPTPVQFAAQ 807 R + + + Q L + + + ++ +R SLH + TP + + Sbjct: 1644 SQRPRHLPASTSASMDTHGQMLRSLHVGQQLIMENMHRLSLHLQMDPPL--TTPEAYRQR 1701 Query: 806 VAWPEDGPMSREGQEVAHDGVDTQIQGD 723 VAWP D P + G+E + D + D Sbjct: 1702 VAWPGDQPSTDRGEEPSGAAEDPAVDED 1729 >gb|KYP44107.1| hypothetical protein KK1_034423, partial [Cajanus cajan] Length = 384 Score = 122 bits (305), Expect = 2e-26 Identities = 109/375 (29%), Positives = 170/375 (45%), Gaps = 32/375 (8%) Frame = -1 Query: 1694 EKISAQIVREFYANASSGDSSQGERETWVRGKMILFDRDTINMYLDEPLPPYSENQLDGY 1515 +K + I REFYANA + +R +WVRGK I +D TIN +L Y+ + D Y Sbjct: 13 DKYNEIITREFYANAFPVRPNSKDRISWVRGKTIAYDPATINTFLQTG---YTIPEQDDY 69 Query: 1514 TRYLKENRFDKDE-----VARDLCIAGHTYQD-EPGKPKTFLRPSLKTLAQIWFIFLCSN 1353 + + R DE + L + G YQ +P LR LK+L ++W L SN Sbjct: 70 RKLM---RVAMDEEMSTLMLETLSLPGSQYQTGTKSQPTHILRADLKSLVRLWQAVLYSN 126 Query: 1352 VYPTIHTSDLKLKKSYLVWSIMVKHLEVDIAQIISDKILSVV-----QSDNPRVLPYPAL 1188 V+P HTSD+ + K+ L++ I+++ +VDIA +IS++I ++V +S R L +P L Sbjct: 127 VFPLTHTSDITISKAKLIFCILLQK-DVDIATLISNEIHAIVLSKPSKSGAVRPLAFPGL 185 Query: 1187 ITGLCEFQRAQIPAHPISVLNPPINARHIKQYCVNPLE------------NMXXXXXXXX 1044 ITGLC+ + IP P+ + I+ + NP E Sbjct: 186 ITGLCKAKGVVIP-QPLVPIRRTIDHVFVNACRYNPREFPRASSRSGPPPTQSTPPVTSP 244 Query: 1043 XXXXXXXXXXXQYQMLM----NLLNEQRKMMIANQTTQQRLDQKLDATYRALSSLNDSFY 876 Q M + EQ ++ +A+ + + + A +R +L+ FY Sbjct: 245 PVPPTGSFDLSTMQACMMQHFQHMEEQHQLEMAH---LRHVQLQQAANHRGQVALHSYFY 301 Query: 875 RFSLHHG--GENSFGWPTPVQFAAQVAWPEDGPMSREG---QEVAHDGVDTQIQGDTGDA 711 ++L+ G + + WPTP QF + WP D P+ G E H G QG DA Sbjct: 302 HYTLNQASTGGSLYPWPTPEQFQDVIRWPGDSPVFSGGGGESEQPHVGE----QGQRSDA 357 Query: 710 RGAEAEDDFME*SED 666 E +D E +E+ Sbjct: 358 EVEEEGNDGGEENEE 372