BLASTX nr result

ID: Astragalus23_contig00019639 seq

BLASTX 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Astragalus23_contig00019639
         (1694 letters)

Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF excluding environmental samples
from WGS projects 
           149,584,005 sequences; 54,822,741,787 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|KYP31776.1| hypothetical protein KK1_047731 [Cajanus cajan]        167   2e-43
gb|KYP46875.1| hypothetical protein KK1_031521 [Cajanus cajan]        164   2e-41
gb|KYP33062.1| hypothetical protein KK1_046125 [Cajanus cajan]        150   3e-36
gb|KYP56386.1| hypothetical protein KK1_002624 [Cajanus cajan]        144   1e-34
gb|KYP76607.1| hypothetical protein KK1_020855 [Cajanus cajan]        143   1e-34
ref|XP_007153799.1| hypothetical protein PHAVU_003G065500g [Phas...   141   4e-33
gb|KOM31456.1| hypothetical protein LR48_Vigan01g101100 [Vigna a...   139   1e-32
dbj|GAU51860.1| hypothetical protein TSUD_416440 [Trifolium subt...   140   2e-32
gb|KYP64073.1| hypothetical protein KK1_018661 [Cajanus cajan]        136   5e-32
gb|KYP44892.1| hypothetical protein KK1_033572 [Cajanus cajan] >...   132   2e-31
gb|KYP50043.1| hypothetical protein KK1_028198 [Cajanus cajan]        134   7e-31
gb|KYP40386.1| hypothetical protein KK1_038278 [Cajanus cajan]        133   1e-30
gb|KYP75955.1| hypothetical protein KK1_020168 [Cajanus cajan]        132   7e-30
gb|KYP63063.1| hypothetical protein KK1_017628 [Cajanus cajan]        125   4e-29
gb|KHN31113.1| hypothetical protein glysoja_046590, partial [Gly...   128   8e-29
gb|KHN15637.1| hypothetical protein glysoja_031426 [Glycine soja]     124   7e-28
gb|KYP32287.1| Retrovirus-related Pol polyprotein from transposo...   125   3e-27
ref|XP_014622353.1| PREDICTED: uncharacterized protein LOC100778...   128   3e-27
ref|XP_014630540.1| PREDICTED: uncharacterized protein LOC106798...   128   4e-27
gb|KYP44107.1| hypothetical protein KK1_034423, partial [Cajanus...   122   2e-26

>gb|KYP31776.1| hypothetical protein KK1_047731 [Cajanus cajan]
          Length = 342

 Score =  167 bits (424), Expect = 2e-43
 Identities = 118/355 (33%), Positives = 188/355 (52%), Gaps = 12/355 (3%)
 Frame = -1

Query: 1694 EKISAQIVREFYANA-SSGDSSQGERETWVRGKMILFDRDTINMYLDEPLPPYSENQLDG 1518
            EK + + V+EFYANA      S+  ++ WVRG+ I +DRD IN  L EP+    E +   
Sbjct: 8    EKYNEKNVKEFYANAWPIRRDSEVIKKYWVRGRWIPYDRDAINKLLGEPMV-LREGRSCS 66

Query: 1517 YTRYLKENR--FDKDEVARDLCIAGHTYQDEPGKPKTFLRPSLKTLAQIWFIFLCSNVYP 1344
            Y +++K +R  F+  EVA+ LC+ G +Y+   G  +  +R     +A++W  FL +NV P
Sbjct: 67   Y-QFIKSSRHGFNNLEVAKLLCLLGQSYKSNRGFARCIMRGKKTKIAKVWMTFLFANVTP 125

Query: 1343 TIHTSDLKLKKSYLVWSIMVKH-LEVDIAQIISDKILSVVQSD------NPRVLPYPALI 1185
            T H SD+++ +++L+++I+  H   VDIA IISD++   V S       + + L + ALI
Sbjct: 126  TTHVSDIRISRAHLLYTILHSHAYRVDIATIISDEMYQFVTSSPSKKAISAKPLGFLALI 185

Query: 1184 TGLCEFQRAQIPAHPISVLNPPINARHIKQYCVNPLENMXXXXXXXXXXXXXXXXXXXQY 1005
            T LC+     IPA P++ +  PINA  I ++C N                        ++
Sbjct: 186  TALCKAHGVVIPAKPLTKIRGPINATFIDKFCNNQTTK------------APTAPVPPRH 233

Query: 1004 QMLMNLLNEQRKMMIANQTTQQRLDQKLDATYRALSSLNDSFYRFSLH--HGGENSFGWP 831
              +  +++   + +    +TQ R  +   A +R L  LN+S YRF+LH      N F WP
Sbjct: 234  PPMRPVISPMEQRL----STQIR--EHFGAIHRGLDRLNESCYRFTLHQYQQDSNPFSWP 287

Query: 830  TPVQFAAQVAWPEDGPMSREGQEVAHDGVDTQIQGDTGDARGAEAEDDFME*SED 666
            TP QF +  +WPED P+ +E  EV  + V+ ++ G+  D +  E E D  E +ED
Sbjct: 288  TPEQFTSICSWPEDRPIHQE--EVEPEVVNDKVGGN--DQQDEENEADSEEGTED 338


>gb|KYP46875.1| hypothetical protein KK1_031521 [Cajanus cajan]
          Length = 421

 Score =  164 bits (416), Expect = 2e-41
 Identities = 111/311 (35%), Positives = 149/311 (47%), Gaps = 7/311 (2%)
 Frame = -1

Query: 1694 EKISAQIVREFYANASSGDSSQGERETWVRGKMILFDRDTINMYLDEPLPPYSENQLDGY 1515
            +K S +I+REFYANA    +    R++WVRG  + + RD IN +L  P     + + DGY
Sbjct: 103  KKYSEEIIREFYANALPLQNRDQTRKSWVRGTQVYYHRDAINDFLGNPYSLGGDGR-DGY 161

Query: 1514 TRYLKENRFDKDEVARDLCIAGHTYQ-DEPGKPKTFLRPSLKTLAQIWFIFLCSNVYPTI 1338
             R      F  DEVA  LC+ G TY     GKP   LR +L TLA+IW  FL  NV+PT+
Sbjct: 162  GRLKNACSFKADEVAERLCLPGCTYTLGASGKPVKILRKNLNTLARIWQNFLYCNVFPTM 221

Query: 1337 HTSDLKLKKSYLVWSIMVKHLEVDIAQIISDKILSVVQSDN-----PRVLPYPALITGLC 1173
            H SDL + ++ L++SIM K   VD+A IIS++I  VV S        + L +P LI GLC
Sbjct: 222  HISDLTMPRATLLYSIMNK-TGVDVATIISNEIHRVVLSTPSPTGVSKPLGFPGLIMGLC 280

Query: 1172 EFQRAQIPAHPISVLNPPINARHIKQYCVNPLENMXXXXXXXXXXXXXXXXXXXQYQMLM 993
               RA +P+H    + PPINA +IK +C N  +                         L 
Sbjct: 281  RAARATVPSHLSKTIRPPINASYIKTHCKNAQQGSTSQPGSQRHGQASSSSQVASSAFLA 340

Query: 992  NLLNEQRKMMIANQTTQQRLDQKLDATYRALSSLNDSFYR-FSLHHGGENSFGWPTPVQF 816
               +   +  +AN              + AL S+N S Y        G   F WP+P  F
Sbjct: 341  AHFHHIEQQNLAN--------------HLALMSINTSLYHAHQQQFYGGPLFQWPSPETF 386

Query: 815  AAQVAWPEDGP 783
              Q  WP D P
Sbjct: 387  QQQFQWPGDSP 397


>gb|KYP33062.1| hypothetical protein KK1_046125 [Cajanus cajan]
          Length = 440

 Score =  150 bits (379), Expect = 3e-36
 Identities = 112/341 (32%), Positives = 156/341 (45%), Gaps = 7/341 (2%)
 Frame = -1

Query: 1694 EKISAQIVREFYANASSGDSSQGERETWVRGKMILFDRDTINMYLDEPLPPYSENQLDGY 1515
            +K + +I++EFYANA     +   R +WVRG ++ + RD I  YL         + LD +
Sbjct: 99   KKYNEEIIKEFYANAYPLQRTDQTRNSWVRGAVVSYSRDAIQQYLGSR-DVIGGDGLDEF 157

Query: 1514 TRYLKENRFDKDEVARDLCIAGHTYQ-DEPGKPKTFLRPSLKTLAQIWFIFLCSNVYPTI 1338
             R  K + F+ D++A+ LC+ G TY     G P +FLR +L T A+IW   L  NVY   
Sbjct: 158  GRLKKAHAFNADKMAKLLCLPGCTYTVGLTGNPVSFLRKNLTTTARIWQNLLYCNVYCIT 217

Query: 1337 HTSDLKLKKSYLVWSIMVKHLEVDIAQIISDKILSVVQSDN-----PRVLPYPALITGLC 1173
            H SDL + ++ L++SI+ K  EVDI  IISD+I  +V S        R L +P LITGLC
Sbjct: 218  HISDLNMPRATLLYSILQK-TEVDIPTIISDEIHKIVLSSPSSTGVSRPLGFPGLITGLC 276

Query: 1172 EFQRAQIPAHPISVLNPPINARHIKQYCVNPLENMXXXXXXXXXXXXXXXXXXXQYQMLM 993
             F  +++P +    L PPINA +IK +C +                            + 
Sbjct: 277  LFSGSRLPGNLNKALRPPINAAYIKIHCKSEQHGDASQPRPPRHGQGSSSSQVPAQDFMA 336

Query: 992  NLLNEQRKMMIANQTTQQRLDQKLDATYRALSSLNDSFYR-FSLHHGGENSFGWPTPVQF 816
               +   +  +AN              + AL SLN S Y        G   F WP+P  F
Sbjct: 337  AHFHHIEQQNLAN--------------HLALMSLNTSMYHAHQQQFQGGPPFQWPSPEAF 382

Query: 815  AAQVAWPEDGPMSREGQEVAHDGVDTQIQGDTGDARGAEAE 693
                 WP D P    G+E      D Q Q   G A G E E
Sbjct: 383  QQHFYWPGDSPHFEGGEEEQPMPEDEQEQ--EGGAGGEEEE 421


>gb|KYP56386.1| hypothetical protein KK1_002624 [Cajanus cajan]
          Length = 362

 Score =  144 bits (363), Expect = 1e-34
 Identities = 119/359 (33%), Positives = 168/359 (46%), Gaps = 13/359 (3%)
 Frame = -1

Query: 1694 EKISAQIVREFYANASSGDSSQGERETWVRGKMILFDRDTINMYLDEPLPPYSENQLDGY 1515
            +K +  I++EFYANA         R +WVRG  + + RDTIN YL  P     ++ LD Y
Sbjct: 20   KKYNEDIIKEFYANAFPLQRLDQTRNSWVRGVTVNYARDTINEYLGSPY-SLGDDGLDEY 78

Query: 1514 TRYLKENRFDKDEVARDLCIAGHTY-QDEPGKPKTFLRPSLKTLAQIWFIFLCSNVYPTI 1338
             R  K   F  D++A+ LC  G TY     G P + LR +L T+A+I   FL  NVY   
Sbjct: 79   GRLKKARAFKADKMAKLLCFPGCTYIVGVTGNPVSILRKNLTTIARIGQNFLYCNVYSIT 138

Query: 1337 HTSDLKLKKSYLVWSIMVKHLEVDIAQIISDKILSVVQSDN-----PRVLPYPALITGLC 1173
            H SDL + ++ L++SI+ K   VDIA IISD+I   V S        + L +  LITGLC
Sbjct: 139  HISDLNMSRATLLYSILTKD-GVDIASIISDEIHKTVLSTPSITGVSKPLGFLGLITGLC 197

Query: 1172 EFQRAQIPAHPISVLNPPINARHIKQYCVNPLENMXXXXXXXXXXXXXXXXXXXQYQMLM 993
            +   +++P +    L PPINA +IK +  +  +                      +Q   
Sbjct: 198  KATGSRLPNNLNKSLRPPINAIYIKIHYKSEQQG-----------DTSQPRSQRHWQTSS 246

Query: 992  NLLNEQRKMMIANQTTQQRLDQKLDATYRALSSLNDSFYRFSLH--HGGENSFGWPTPVQ 819
            +        M A+      ++QK  A Y AL SLN S Y    H  HGG   F WP+   
Sbjct: 247  SSQVASLAFMAAH---FHHIEQKNLANYLALMSLNTSLYHAHRHQFHGGP-PFKWPSLDT 302

Query: 818  FAAQVAWPEDGPMSREGQEVAHDGVDTQIQGDTGDAR-----GAEAEDDFME*SEDLDQ 657
            F  Q  WP D P    G E      + + +G  GD +     G + + + +E  +D DQ
Sbjct: 303  FQQQFHWPGDSPNFEGGVEEEQPEQEAEQEG--GDEKEDEEGGEDKQREDVEEGDDDDQ 359


>gb|KYP76607.1| hypothetical protein KK1_020855 [Cajanus cajan]
          Length = 338

 Score =  143 bits (361), Expect = 1e-34
 Identities = 106/304 (34%), Positives = 152/304 (50%), Gaps = 8/304 (2%)
 Frame = -1

Query: 1694 EKISAQIVREFYANASSGDSSQGERETWVRGKMILFDRDTINMYLDEPLPPYSENQLDGY 1515
            +K +  I++EFYANA     +   R +WVRG M+ + RD IN  L  P      + LD Y
Sbjct: 50   KKYNEDIIKEFYANAYPLQRTDKTRNSWVRGVMVSYSRDAINECLGNPYS-LGGDDLDEY 108

Query: 1514 TRYLKENRFDKDEVARDLCIAGHTYQ-DEPGKPKTFLRPSLKTLAQIWFIFLCSNVYPTI 1338
             R  K   F+ D++A+ LC+ G TY     G P +FLR +L T+A+IW  FL  NVY   
Sbjct: 109  GRIKKAQVFNADKMAKLLCLPGCTYTVGVMGNPVSFLRKNLTTIARIWQNFLYYNVYCLT 168

Query: 1337 HTSDLKLKKSYLVWSIMVKHLEVDIAQIISDKILSVV-----QSDNPRVLPYPALITGLC 1173
            H S+L + ++ L++SI+ K   VDIA IISD+I   V     Q+   + L +P LI GLC
Sbjct: 169  HISNLNMPRATLLYSILQKD-GVDIASIISDEIHKTVLSTPSQTGVSKPLGFPGLIRGLC 227

Query: 1172 EFQRAQIPAHPISVLNPPINARHIKQYCVNPLENMXXXXXXXXXXXXXXXXXXXQYQMLM 993
             F  +++P +    L PPINA +IK +C N  +                      +Q   
Sbjct: 228  LFSGSRLPGNLNKSLRPPINASYIKIHCKNEQQG-----------DASQPRPQRHWQGYS 276

Query: 992  NLLNEQRKMMIANQTTQQRLDQKLDATYRALSSLNDSFYRF--SLHHGGENSFGWPTPVQ 819
            +     ++ M A+      ++Q+  A + AL SLN S Y       HGG   F WP+P  
Sbjct: 277  SSQAPGQEFMAAH---FHHIEQQNLANHLALMSLNTSMYHAHQQQFHGGP-LFQWPSPET 332

Query: 818  FAAQ 807
            F  Q
Sbjct: 333  FQQQ 336


>ref|XP_007153799.1| hypothetical protein PHAVU_003G065500g [Phaseolus vulgaris]
 gb|ESW25793.1| hypothetical protein PHAVU_003G065500g [Phaseolus vulgaris]
          Length = 407

 Score =  141 bits (355), Expect = 4e-33
 Identities = 104/333 (31%), Positives = 159/333 (47%), Gaps = 18/333 (5%)
 Frame = -1

Query: 1694 EKISAQIVREFYANASSGDSSQGERETWVRGKMILFDRDTINMYLDEPLPPYSENQLDGY 1515
            +K    +V EFYANA           + VRGK I +DR+TIN +L  PL    +++L  Y
Sbjct: 82   DKYDPDVVLEFYANAWPVKEGDTNLRSKVRGKWIPYDRNTINDFLGNPLQ-LDQDELCTY 140

Query: 1514 TRYLKENRF---DKDEVARDLCIAGHTYQ-DEPGKPKTFLRPSLKTLAQIWFIFLCSNVY 1347
                +   F      E +  LCI G TY+ +  GKP   +R S+ TL QIW   L SNV 
Sbjct: 141  GMLKRGTNFTSLSNTETSDLLCIPGRTYETNNNGKPLRIIRSSMTTLTQIWTSLLLSNVI 200

Query: 1346 PTIHTSDLKLKKSYLVWSIMVKHLEVDIAQIISDKILSVV---QSDNP---RVLPYPALI 1185
            P  H+SDL + K ++V+ ++ K  +VD+A +ISD I   V     +NP   + L +P+LI
Sbjct: 201  PNKHSSDLSMAKCHIVFCLL-KQYDVDVATLISDSIHHFVLQQGGNNPLHRKGLGFPSLI 259

Query: 1184 TGLCEFQRAQIPAHPISVLNPPINARHIKQYC----VNPLENMXXXXXXXXXXXXXXXXX 1017
            T LC     Q+  +  + + PPI+ + I++ C       L+                   
Sbjct: 260  TSLCAANGIQV--NLSTRIRPPIDKKIIQRNCSEKDQQQLQRQQSQQGQDQPVEPPINQL 317

Query: 1016 XXQYQMLMNLLNEQRKMMIANQTTQQRLDQKLDATYRALSSLNDSFYRFSLHHGGENSFG 837
                   MN+L+  + +    ++  + +  +  A +RA+ SLN SF+ ++LHH  E+   
Sbjct: 318  HSPIVPEMNMLDYIKNL----ESHMKHVQMQQAANHRAMVSLNGSFHSYALHHSAESKLV 373

Query: 836  WPTPVQFAAQVAWPED----GPMSREGQEVAHD 750
            WP   +F   V WP D       S+E  E  HD
Sbjct: 374  WPNAEEFDHLVKWPGDETVLAAQSKESHEEIHD 406


>gb|KOM31456.1| hypothetical protein LR48_Vigan01g101100 [Vigna angularis]
          Length = 406

 Score =  139 bits (351), Expect = 1e-32
 Identities = 102/321 (31%), Positives = 163/321 (50%), Gaps = 11/321 (3%)
 Frame = -1

Query: 1691 KISAQIVREFYANASSGDSSQGERETWVRGKMILFDRDTINMYLDEPLPPYSENQLDGYT 1512
            K   +IVREFYANA   D   GE+ + VRG+ + +DR +I+ +L  PLP  +E QL  YT
Sbjct: 86   KFDLEIVREFYANAYPLDGL-GEKRSKVRGRWVTYDRASISEFLGHPLP-LAEGQLCDYT 143

Query: 1511 RYLK-ENRFDKDEVARDLCIAGHTYQ-DEPGKPKTFLRPSLKTLAQIWFIFLCSNVYPTI 1338
            R  + +  FD++EV   + I+  +Y+    G P+  LR  +KTLAQ++  FL SN+ P  
Sbjct: 144  RRRRSQEAFDEEEVVNLIFISNRSYRLGSSGDPRRILRTDMKTLAQVFMTFLLSNIVPIG 203

Query: 1337 HTSDLKLKKSYLVWSIMVKHLEVDIAQIISDKILSVVQSDNPR-------VLPYPALITG 1179
            H SDL + + +L+++IM + L VD+A II ++I   V+ +  +        L +P LIT 
Sbjct: 204  HVSDLNVPRCHLLFNIMREDLTVDVAIIIFEEIHKFVRYEVNKNNEKRKCALGFPTLITA 263

Query: 1178 LCEFQRAQIPAHPISV-LNPPINARHIKQYCVNPLENMXXXXXXXXXXXXXXXXXXXQYQ 1002
            LC+ Q  ++    +SV + P I  R I+++C NP E M                   Q  
Sbjct: 264  LCQAQGVEV---DLSVKIRPTITKRFIEKFCTNPAEIMPQLEQPVAAEQTPCPEQQPQLN 320

Query: 1001 MLMNLLNEQRKMMI-ANQTTQQRLDQKLDATYRALSSLNDSFYRFSLHHGGENSFGWPTP 825
            +   LL + R + +    T QQ +     + +R    L++  Y     +     +   TP
Sbjct: 321  IHHELLEQMRYLRLQMEHTCQQNI-----SIHRGQLHLHEYLY-----NNVRGPYPGMTP 370

Query: 824  VQFAAQVAWPEDGPMSREGQE 762
             +F A + WP D P+   G++
Sbjct: 371  QEFLAYLQWPGDSPIFFRGRK 391


>dbj|GAU51860.1| hypothetical protein TSUD_416440 [Trifolium subterraneum]
          Length = 432

 Score =  140 bits (352), Expect = 2e-32
 Identities = 104/343 (30%), Positives = 162/343 (47%), Gaps = 8/343 (2%)
 Frame = -1

Query: 1691 KISAQIVREFYANASSGDSSQGERETWVRGKMILFDRDTINMYLDEPLPPYSENQLDGYT 1512
            K++ ++VREFYANA           T VRG+ I FDRDT+N +L EP      +QL  Y+
Sbjct: 83   KLNYEVVREFYANAIPIGQEPYNFTTVVRGRQIHFDRDTLNRFLGEP-SNLDSDQLCEYS 141

Query: 1511 RYLKENRFDKDEVARDLCIAGHTYQ-DEPGKPKTFLRPSLKTLAQIWFIFLCSNVYPTIH 1335
              L    +  +++ RD+ I G ++Q +   + +  ++ +L   AQI  + +C N+ P  H
Sbjct: 142  EMLVRQNWPVEDMVRDIFIEGESFQLNNQREERRAIKETLTIPAQIIHLLICYNLKPRSH 201

Query: 1334 TSDLKLKKSYLVWSIMVKHLEVDIAQIISDKILSVVQS----DNPRVLPYPALITGLCEF 1167
                 + ++ L+W I+    EVDIA++I++++ SV +S    D   VLPYP LI GLCE 
Sbjct: 202  VHTAPMDRATLIWYILTGR-EVDIARVIANEMRSVAESGIKNDAKPVLPYPGLIIGLCEA 260

Query: 1166 QRAQIPAHPISVLNPPINARHIKQYC-VNPLENMXXXXXXXXXXXXXXXXXXXQYQMLMN 990
            +   IPA      +  IN ++IK+YC +  ++                       Q   N
Sbjct: 261  EHVHIPAIVSHTTDKLINDKYIKRYCKLKEVQQQPQQQPQAPQLPAAPLHPVEPQQAYPN 320

Query: 989  LLNEQRKMMIANQTTQQRLDQKLDATYRALSSLNDSFYRFSLHHG--GENSFGWPTPVQF 816
            +  + R       T  Q       + +RAL+ L +S YR  L  G    + +    P  F
Sbjct: 321  I--DPRLQNWFYHTWDQN-----TSNHRALTVLQESMYRMQLDQGVPVNHDYQVMDPQHF 373

Query: 815  AAQVAWPEDGPMSREGQEVAHDGVDTQIQGDTGDARGAEAEDD 687
               +AWP D P    G E ++ G D     D+ D   A+A DD
Sbjct: 374  QTHIAWPGDRPQFTGGAETSNVGGDND---DSIDEAAADAMDD 413


>gb|KYP64073.1| hypothetical protein KK1_018661 [Cajanus cajan]
          Length = 331

 Score =  136 bits (342), Expect = 5e-32
 Identities = 86/208 (41%), Positives = 117/208 (56%), Gaps = 6/208 (2%)
 Frame = -1

Query: 1694 EKISAQIVREFYANASSGDSSQGERETWVRGKMILFDRDTINMYLDEPLPPYSENQLDGY 1515
            +K +  I++EFYANA         R +WVRG  I + RD IN YL  P     +  LD Y
Sbjct: 99   KKYNEDIIKEFYANAYPLQRHDQTRNSWVRGVTISYSRDAINEYLGNPYSLGGDG-LDEY 157

Query: 1514 TRYLKENRFDKDEVARDLCIAGHTYQ-DEPGKPKTFLRPSLKTLAQIWFIFLCSNVYPTI 1338
             R  K   F+ D++A+ LC+ G TY     G P +FLR +L TLA+IW  FL  NVY   
Sbjct: 158  GRLKKARGFNADKMAKLLCLPGCTYTVGVTGNPVSFLRKNLTTLARIWQNFLYCNVYSIT 217

Query: 1337 HTSDLKLKKSYLVWSIMVKHLEVDIAQIISDKILSVVQSDN-----PRVLPYPALITGLC 1173
            H SDL + ++ L++SI+ K+  VDIA IISD+I   V S        + L +P LITGLC
Sbjct: 218  HISDLNMPRATLLYSILQKN-GVDIASIISDEIHKTVLSTPSMTGVSKPLGFPGLITGLC 276

Query: 1172 EFQRAQIPAHPISVLNPPINARHIKQYC 1089
                +++P +    L PPIN  +IK +C
Sbjct: 277  LAGGSRLPNNLNKSLRPPINVAYIKIHC 304


>gb|KYP44892.1| hypothetical protein KK1_033572 [Cajanus cajan]
 gb|KYP44914.1| hypothetical protein KK1_033595 [Cajanus cajan]
          Length = 268

 Score =  132 bits (333), Expect = 2e-31
 Identities = 78/214 (36%), Positives = 125/214 (58%), Gaps = 10/214 (4%)
 Frame = -1

Query: 1694 EKISAQIVREFYANA-SSGDSSQGERETWVRGKMILFDRDTINMYLDEPLPPYSENQLDG 1518
            EK + +IV+EFYANA       +  +++WVRG+ + +DRD IN  L EP+    E Q   
Sbjct: 8    EKYNEKIVKEFYANAWPIRRDYEVIKKSWVRGRWVPYDRDAINKLLGEPMV-LREGQSCS 66

Query: 1517 YTRYLKENR--FDKDEVARDLCIAGHTYQDEPGKPKTFLRPSLKTLAQIWFIFLCSNVYP 1344
            Y +++K +R  F+  EVA+ L + G +Y+   G  +  +R  +  +A++W  FL +NV P
Sbjct: 67   Y-QFIKSSRHGFNNPEVAKLLSLPGQSYESNKGFARRIMRGKMTKIARVWMTFLFANVTP 125

Query: 1343 TIHTSDLKLKKSYLVWSIMVKH-LEVDIAQIISDKILSVVQSD------NPRVLPYPALI 1185
            T H  D+++ +++L++SI+  H   VDI  IISD++   V S       + + L +PALI
Sbjct: 126  TTHVLDIRMSRAHLLYSILHSHAYRVDITAIISDEMYQFVTSSPSKKTISAKSLGFPALI 185

Query: 1184 TGLCEFQRAQIPAHPISVLNPPINARHIKQYCVN 1083
            T LC+     IPA P++ +  PINA  I ++C N
Sbjct: 186  TALCKAHGVVIPAKPLTKIRGPINATFIDKFCNN 219


>gb|KYP50043.1| hypothetical protein KK1_028198 [Cajanus cajan]
          Length = 361

 Score =  134 bits (336), Expect = 7e-31
 Identities = 82/207 (39%), Positives = 116/207 (56%), Gaps = 6/207 (2%)
 Frame = -1

Query: 1691 KISAQIVREFYANASSGDSSQGERETWVRGKMILFDRDTINMYLDEPLPPYSENQLDGYT 1512
            K +  I++EFYANA         R +WVR  ++ + RD IN YL  P     ++ LD Y 
Sbjct: 103  KYNEDIIKEFYANAFPLQKHDQTRNSWVREVIVSYARDAINEYLGSPYS-LGDDGLDEYG 161

Query: 1511 RYLKENRFDKDEVARDLCIAGHTYQ-DEPGKPKTFLRPSLKTLAQIWFIFLCSNVYPTIH 1335
            R  K   F  D++ + LC+ G TY     G P + LR +L T+A+IW  FL  NVY  IH
Sbjct: 162  RLKKARAFKADKMVKLLCLPGCTYTVGVTGNPVSILRKNLTTIARIWQNFLYCNVYSIIH 221

Query: 1334 TSDLKLKKSYLVWSIMVKHLEVDIAQIISDKILSVVQSDNPRV-----LPYPALITGLCE 1170
             SDL + ++ L++SI+ K   VDI  IISD+I  VV S    +     L +P L+TGLC+
Sbjct: 222  ISDLNMPRATLLYSILTKD-GVDITLIISDEIHKVVLSTPSLIGVSKPLGFPGLLTGLCK 280

Query: 1169 FQRAQIPAHPISVLNPPINARHIKQYC 1089
               +++P +    L PPINA +IK +C
Sbjct: 281  ASGSRLPNNLNKSLRPPINASYIKIHC 307


>gb|KYP40386.1| hypothetical protein KK1_038278 [Cajanus cajan]
          Length = 371

 Score =  133 bits (335), Expect = 1e-30
 Identities = 81/208 (38%), Positives = 118/208 (56%), Gaps = 6/208 (2%)
 Frame = -1

Query: 1694 EKISAQIVREFYANASSGDSSQGERETWVRGKMILFDRDTINMYLDEPLPPYSENQLDGY 1515
            +K +  I++EF ANA     +   R +WVRG M+ + RD IN YL  P     +  LD Y
Sbjct: 30   KKYNEDIIKEFNANAYPLQRTDKTRNSWVRGAMVSYSRDAINEYLGNPYSLGGDG-LDEY 88

Query: 1514 TRYLKENRFDKDEVARDLCIAGHTYQ-DEPGKPKTFLRPSLKTLAQIWFIFLCSNVYPTI 1338
             R  K   F+ D++ + LC+ G TY     G P +FLR +L T A+IW  FL  NVY   
Sbjct: 89   GRIKKARAFNADKMDKLLCLPGCTYTVGVTGNPDSFLRKNLTTTARIWQNFLYCNVYCLT 148

Query: 1337 HTSDLKLKKSYLVWSIMVKHLEVDIAQIISDKILSVVQSDN-----PRVLPYPALITGLC 1173
            H SDL + ++ L++S++ K+  +DI  IIS++I  +V S        + L +P LITGLC
Sbjct: 149  HISDLNMPRATLLYSVLRKN-GMDITSIISNEIHKIVLSTPSLTGVSKPLGFPGLITGLC 207

Query: 1172 EFQRAQIPAHPISVLNPPINARHIKQYC 1089
             F  +++P +    L PPINA +IK +C
Sbjct: 208  LFSGSRLPGNLNKSLRPPINAAYIKIHC 235


>gb|KYP75955.1| hypothetical protein KK1_020168 [Cajanus cajan]
          Length = 446

 Score =  132 bits (333), Expect = 7e-30
 Identities = 101/335 (30%), Positives = 158/335 (47%), Gaps = 26/335 (7%)
 Frame = -1

Query: 1694 EKISAQIVREFYANASSGDSSQGERETWVRGKMILFDRDTINMYLDEPLPPYSENQLDGY 1515
            +K +  I+REFYAN+        +R +WVRGK I +D + IN +L      Y+  + D Y
Sbjct: 115  DKYNEIIIREFYANSFPVRPDSKDRISWVRGKTIAYDPEAINTFLQTE---YTIPEEDDY 171

Query: 1514 TRYLKE--NRFDKDEVARDLCIAGHTYQD-EPGKPKTFLRPSLKTLAQIWFIFLCSNVYP 1344
             + +K   N    + V   L + G  YQ     +P   LR  LK+L ++W + L SNV P
Sbjct: 172  RKLMKTAMNEEMSNLVLETLSLLGSQYQTGTKNQPTHILRADLKSLVRLWQVILYSNVVP 231

Query: 1343 TIHTSDLKLKKSYLVWSIMVKHLEVDIAQIISDKILSVV-----QSDNPRVLPYPALITG 1179
              HTSD+ + K+ L++ I+++  +VDIA +IS++I ++V     +S   R L +P LITG
Sbjct: 232  LTHTSDITISKAKLIFCILLQK-DVDIATLISNEIHAIVLSKPSKSGTVRPLAFPGLITG 290

Query: 1178 LCEFQRAQIPAHPISVLNPPINARHIKQYCVNPLE------------NMXXXXXXXXXXX 1035
            LC+ +R  IP  P+  +   I+   +   C NP E                         
Sbjct: 291  LCKAKRVVIP-QPLVPIRRTIDHVFVNARCYNPREFPRASRRSRPPPTQSPPPVTSPPVL 349

Query: 1034 XXXXXXXXQYQMLM----NLLNEQRKMMIANQTTQQRLDQKLDATYRALSSLNDSFYRFS 867
                      Q  M      + EQ ++ +A+     +L Q   A +R   +L+  FY ++
Sbjct: 350  PTGPFDLSTMQACMMQHFQHMEEQHQLEMAH-LRHVKLQQA--ANHRGQLALHSYFYNYT 406

Query: 866  LHHG--GENSFGWPTPVQFAAQVAWPEDGPMSREG 768
            LH    G + + WPTP QF   + WP D P+   G
Sbjct: 407  LHQANTGGSLYPWPTPEQFQDAILWPGDNPVFSGG 441


>gb|KYP63063.1| hypothetical protein KK1_017628 [Cajanus cajan]
          Length = 234

 Score =  125 bits (314), Expect = 4e-29
 Identities = 82/210 (39%), Positives = 113/210 (53%), Gaps = 8/210 (3%)
 Frame = -1

Query: 1694 EKISAQIVREFYANASSGDSSQGERETWVRGKMILFDRDTINMYLDEPLPPYSENQLDGY 1515
            +K +  I++EFYANA         R  WVRG  I + RD IN YL  P     +  LD Y
Sbjct: 20   KKYNKDIIKEFYANAFPLQRLDQTRNYWVRGVTISYSRDAINEYLGSPYSLGGDG-LDEY 78

Query: 1514 TRYLKENRFDKDEVARDLCIAGHTY-QDEPGKPKTFLRPSLKTLAQIWFIFLCSNVYPTI 1338
             R  K   F+ D++A+ LC+   TY     G P + LR +L T+A+IW  FL  NVY   
Sbjct: 79   GRLKKPRAFNVDKMAKLLCLPSCTYIVGVIGNPVSILRKNLTTIARIWQNFLYCNVYSIT 138

Query: 1337 HTSDLKLKKSYLVWSIMVKHLEVDIAQIISDKI-------LSVVQSDNPRVLPYPALITG 1179
            H SDL + ++ L++SI+ K   VDIA IIS++I       LS+     P  L +P LITG
Sbjct: 139  HISDLNMPRATLLYSILTKD-GVDIASIISNEIHKTVLSTLSITGVSKP--LGFPGLITG 195

Query: 1178 LCEFQRAQIPAHPISVLNPPINARHIKQYC 1089
            LC    +++P +    L PPIN  +I  +C
Sbjct: 196  LCMASGSRLPNNLNKSLRPPINVAYINIHC 225


>gb|KHN31113.1| hypothetical protein glysoja_046590, partial [Glycine soja]
          Length = 380

 Score =  128 bits (322), Expect = 8e-29
 Identities = 96/316 (30%), Positives = 143/316 (45%), Gaps = 13/316 (4%)
 Frame = -1

Query: 1691 KISAQIVREFYANASSGDSSQGERETWVRGKMILFDRDTINMYLDEPLPPYSENQLDGYT 1512
            K    IV EFYANA   +    +  +WVRG+ I FD D ++ +L +PL      + +   
Sbjct: 82   KFDPDIVLEFYANALPTEEGVRDMRSWVRGQWISFDADALSQFLGDPLVLEEGQECEFSQ 141

Query: 1511 RYLKENRFDKDEVARDLCIAGHTY-QDEPGKPKTFLRPSLKTLAQIWFIFLCSNVYPTIH 1335
            R    + FD++ +A  LC+ G  + Q   G+    +R S+ TL Q+W   L SNV P+ H
Sbjct: 142  RRNMADGFDEEAIAHLLCMPGQDFAQTAVGRRVRIMRTSMTTLTQMWMTLLLSNVLPSDH 201

Query: 1334 TSDLKLKKSYLVWSIMVKHLEVDIAQIISDKIL----------SVVQSDNPRVLPYPALI 1185
              DL L K  LV++I+ + + V +AQ+I+D I            +    + R L +PALI
Sbjct: 202  NFDLPLPKCQLVYAILTQ-MSVHVAQLIADAIYLFAGMPPTRHPLDPDKSSRALGFPALI 260

Query: 1184 TGLCEFQRAQIPAHPISVLNPPINARHIKQYCVNPLENMXXXXXXXXXXXXXXXXXXXQY 1005
            TGLC  Q   +P  P  V+  PI    I++YC  P +                       
Sbjct: 261  TGLC--QSFGVPVTPSKVIRSPITRAFIEKYC-TPRQAQGDAHQAADAPPPPHQADP--- 314

Query: 1004 QMLMNLLNEQRKMMIANQTTQQRLDQKLDATYRALSSLNDSFYRFSLHH--GGENSFGWP 831
                  L  +R +        Q L ++  A +R    +++  Y+ SL     G   F  P
Sbjct: 315  ---AESLGMERYL--------QHLVRQQAANHRGQVQIHECLYQLSLSQQVQGFAPFACP 363

Query: 830  TPVQFAAQVAWPEDGP 783
            TP QF  +VAWP D P
Sbjct: 364  TPDQFRDEVAWPGDWP 379


>gb|KHN15637.1| hypothetical protein glysoja_031426 [Glycine soja]
          Length = 330

 Score =  124 bits (312), Expect = 7e-28
 Identities = 76/203 (37%), Positives = 115/203 (56%), Gaps = 1/203 (0%)
 Frame = -1

Query: 1688 ISAQIVREFYANASSGDSSQGERETWVRGKMILFDRDTINMYLDEPLPPYSENQLDGYTR 1509
            I   IV+EFYAN    +  +  ++  VRG +I FD DT+N +L  P+       L  Y+R
Sbjct: 77   IDVAIVKEFYANLYDSED-KSPKQVKVRGHLIKFDEDTLNTFLKTPVILEEGENLCAYSR 135

Query: 1508 YLKENRFDKDEVARDLCIAGHTYQ-DEPGKPKTFLRPSLKTLAQIWFIFLCSNVYPTIHT 1332
            +    R D  E+A  LCI G  ++ +  G P   LR +L TLAQ W +   SN+ PT HT
Sbjct: 136  FALL-RPDPQELAAKLCIPGRGFELNADGHPLKILRKNLTTLAQTWSVLSFSNLIPTSHT 194

Query: 1331 SDLKLKKSYLVWSIMVKHLEVDIAQIISDKILSVVQSDNPRVLPYPALITGLCEFQRAQI 1152
            SD+ L ++ L++ I+VK ++ ++  +IS +I    Q D+ R L +PALIT LC+ +  Q 
Sbjct: 195  SDVTLDRAKLIYGIIVK-MDTNVGYLISHQISITAQHDSSR-LGFPALITALCKARGVQS 252

Query: 1151 PAHPISVLNPPINARHIKQYCVN 1083
             +  +  L+P IN  +IK+ C N
Sbjct: 253  DSRSLESLSPAINLAYIKKNCWN 275


>gb|KYP32287.1| Retrovirus-related Pol polyprotein from transposon opus [Cajanus
            cajan]
          Length = 481

 Score =  125 bits (315), Expect = 3e-27
 Identities = 80/208 (38%), Positives = 112/208 (53%), Gaps = 6/208 (2%)
 Frame = -1

Query: 1694 EKISAQIVREFYANASSGDSSQGERETWVRGKMILFDRDTINMYLDEPLPPYSENQLDGY 1515
            +K    I++EFYAN      +   R +WVRG M+ + RD IN YL  P      + LD Y
Sbjct: 251  KKYHEDIIKEFYANVYPLQRTDKIRNSWVRGAMVSYSRDAINEYLGNPY-SLGGDGLDEY 309

Query: 1514 TRYLKENRFDKDEVARDLCIAGHTYQ-DEPGKPKTFLRPSLKTLAQIWFIFLCSNVYPTI 1338
             R  K   F+ D++A+ LC+ G TY     G P +FLR +L T  +IW  FL  NVY   
Sbjct: 310  GRLKKARAFNADKMAKLLCLPGCTYTVGVTGNPVSFLRKNLTTTTRIWQNFLYCNVYCIT 369

Query: 1337 HTSDLKLKKSYLVWSIMVKHLEVDIAQIISDKILSVVQSDN-----PRVLPYPALITGLC 1173
            H SDL + +  L++S++ K   VDIA II D+I   V S        + L +P LITGLC
Sbjct: 370  HISDLNMPRETLLYSVLQK-TGVDIAAIILDEIHKTVLSTPSLTGVSKPLGFPGLITGLC 428

Query: 1172 EFQRAQIPAHPISVLNPPINARHIKQYC 1089
             F  +++ ++    L PP N  +IK +C
Sbjct: 429  LFNGSRLLSNLNKSLWPPTNVAYIKIHC 456


>ref|XP_014622353.1| PREDICTED: uncharacterized protein LOC100778023 [Glycine max]
          Length = 2264

 Score =  128 bits (322), Expect = 3e-27
 Identities = 92/328 (28%), Positives = 149/328 (45%), Gaps = 4/328 (1%)
 Frame = -1

Query: 1694 EKISAQIVREFYANASSGDSSQGERETWVRGKMILFDRDTINMYLDEPLPPYSENQLDGY 1515
            ++I   +V+EFY+N    +     R   VRG+++ FD DTIN +LD P+      +   Y
Sbjct: 1923 KRIDVALVKEFYSNLYDPED-HSPRFCRVRGQVVRFDADTINDFLDTPVILEDGEEYTAY 1981

Query: 1514 TRYLKENRFDKDEVARDLCIAGHTYQ-DEPGKPKTFLRPSLKTLAQIWFIFLCSNVYPTI 1338
            TRYL  +  D D +A  LC  G  +  +  G P   LR  + TLAQ W +    ++ PT 
Sbjct: 1982 TRYLSTHP-DPDTIAATLCTPGGRFVLNADGLPWKLLRKDMTTLAQTWSVLSYYDLAPTS 2040

Query: 1337 HTSDLKLKKSYLVWSIMVKHLEVDIAQIISDKILSVVQSDNPRVLPYPALITGLCEFQRA 1158
            HTSD+ L ++ L++  +V  +++D+   IS +I  + QS   R L +PALIT LC+ Q  
Sbjct: 2041 HTSDVNLDRARLIYG-LVSRMDMDVGSFISQQISQIAQSSTSR-LGFPALITALCDIQGV 2098

Query: 1157 QIPAHPISVLNPPINARHIKQYCVNPLENMXXXXXXXXXXXXXXXXXXXQYQMLMNLLNE 978
                     L+P IN  ++K+ C NP +                           +    
Sbjct: 2099 VFDTLIFESLSPAINLAYVKKNCWNPADPSITFPGPRRTRTRASASAPEAPLPTQSPSQP 2158

Query: 977  QRKMMIANQTTQQRLD---QKLDATYRALSSLNDSFYRFSLHHGGENSFGWPTPVQFAAQ 807
             ++      +T   +D   Q L + +     + ++ +R SLH   +      TP  +  +
Sbjct: 2159 SQRPRHPPASTSASMDMHGQMLRSLHVGQQLIMENMHRLSLHLQMDPPL--TTPEAYRQR 2216

Query: 806  VAWPEDGPMSREGQEVAHDGVDTQIQGD 723
            VAWP D P +  G+E +    D  +  D
Sbjct: 2217 VAWPGDQPSTDRGEEPSGAAEDPAVDED 2244


>ref|XP_014630540.1| PREDICTED: uncharacterized protein LOC106798466 [Glycine max]
          Length = 1749

 Score =  128 bits (321), Expect = 4e-27
 Identities = 92/328 (28%), Positives = 147/328 (44%), Gaps = 4/328 (1%)
 Frame = -1

Query: 1694 EKISAQIVREFYANASSGDSSQGERETWVRGKMILFDRDTINMYLDEPLPPYSENQLDGY 1515
            ++I   +V+EFY+N    +     R   VRG+++ FD DTIN +LD P+      +   Y
Sbjct: 1408 KRIDVALVKEFYSNLYDPED-HSPRFCRVRGQVVRFDADTINDFLDTPVILEVGEEYPAY 1466

Query: 1514 TRYLKENRFDKDEVARDLCIAGHTYQ-DEPGKPKTFLRPSLKTLAQIWFIFLCSNVYPTI 1338
            TRYL  +  D D +A  LC  G  +  +  G P   LR  + TLAQ W +    ++ PT 
Sbjct: 1467 TRYLSTHP-DPDTIAATLCTPGGRFVLNADGLPWKLLRKDMTTLAQTWSVLSYYDLAPTS 1525

Query: 1337 HTSDLKLKKSYLVWSIMVKHLEVDIAQIISDKILSVVQSDNPRVLPYPALITGLCEFQRA 1158
            HTSD+ L ++ L++  +V  +++D+   IS +I  + QS   R L +PALIT LC+ Q  
Sbjct: 1526 HTSDVNLDRARLIYG-LVSRMDMDVGSFISQQISQIAQSSTSR-LGFPALITALCDIQGV 1583

Query: 1157 QIPAHPISVLNPPINARHIKQYCVNPLE---NMXXXXXXXXXXXXXXXXXXXQYQMLMNL 987
                     L+P IN  ++K+ C NP +                          Q     
Sbjct: 1584 VSDTLIFESLSPAINLAYVKKNCWNPADPSITFPGPRRTRTRASASASEAPLPTQSPSQP 1643

Query: 986  LNEQRKMMIANQTTQQRLDQKLDATYRALSSLNDSFYRFSLHHGGENSFGWPTPVQFAAQ 807
                R +  +   +     Q L + +     + ++ +R SLH   +      TP  +  +
Sbjct: 1644 SQRPRHLPASTSASMDTHGQMLRSLHVGQQLIMENMHRLSLHLQMDPPL--TTPEAYRQR 1701

Query: 806  VAWPEDGPMSREGQEVAHDGVDTQIQGD 723
            VAWP D P +  G+E +    D  +  D
Sbjct: 1702 VAWPGDQPSTDRGEEPSGAAEDPAVDED 1729


>gb|KYP44107.1| hypothetical protein KK1_034423, partial [Cajanus cajan]
          Length = 384

 Score =  122 bits (305), Expect = 2e-26
 Identities = 109/375 (29%), Positives = 170/375 (45%), Gaps = 32/375 (8%)
 Frame = -1

Query: 1694 EKISAQIVREFYANASSGDSSQGERETWVRGKMILFDRDTINMYLDEPLPPYSENQLDGY 1515
            +K +  I REFYANA     +  +R +WVRGK I +D  TIN +L      Y+  + D Y
Sbjct: 13   DKYNEIITREFYANAFPVRPNSKDRISWVRGKTIAYDPATINTFLQTG---YTIPEQDDY 69

Query: 1514 TRYLKENRFDKDE-----VARDLCIAGHTYQD-EPGKPKTFLRPSLKTLAQIWFIFLCSN 1353
             + +   R   DE     +   L + G  YQ     +P   LR  LK+L ++W   L SN
Sbjct: 70   RKLM---RVAMDEEMSTLMLETLSLPGSQYQTGTKSQPTHILRADLKSLVRLWQAVLYSN 126

Query: 1352 VYPTIHTSDLKLKKSYLVWSIMVKHLEVDIAQIISDKILSVV-----QSDNPRVLPYPAL 1188
            V+P  HTSD+ + K+ L++ I+++  +VDIA +IS++I ++V     +S   R L +P L
Sbjct: 127  VFPLTHTSDITISKAKLIFCILLQK-DVDIATLISNEIHAIVLSKPSKSGAVRPLAFPGL 185

Query: 1187 ITGLCEFQRAQIPAHPISVLNPPINARHIKQYCVNPLE------------NMXXXXXXXX 1044
            ITGLC+ +   IP  P+  +   I+   +     NP E                      
Sbjct: 186  ITGLCKAKGVVIP-QPLVPIRRTIDHVFVNACRYNPREFPRASSRSGPPPTQSTPPVTSP 244

Query: 1043 XXXXXXXXXXXQYQMLM----NLLNEQRKMMIANQTTQQRLDQKLDATYRALSSLNDSFY 876
                         Q  M      + EQ ++ +A+    + +  +  A +R   +L+  FY
Sbjct: 245  PVPPTGSFDLSTMQACMMQHFQHMEEQHQLEMAH---LRHVQLQQAANHRGQVALHSYFY 301

Query: 875  RFSLHHG--GENSFGWPTPVQFAAQVAWPEDGPMSREG---QEVAHDGVDTQIQGDTGDA 711
             ++L+    G + + WPTP QF   + WP D P+   G    E  H G     QG   DA
Sbjct: 302  HYTLNQASTGGSLYPWPTPEQFQDVIRWPGDSPVFSGGGGESEQPHVGE----QGQRSDA 357

Query: 710  RGAEAEDDFME*SED 666
               E  +D  E +E+
Sbjct: 358  EVEEEGNDGGEENEE 372


Top