BLASTX nr result

ID: Wisteria21_contig00012880 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Wisteria21_contig00012880
         (1736 letters)

Database: ./nr 
           77,306,371 sequences; 28,104,191,420 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_004500362.1| PREDICTED: uncharacterized protein LOC101503...   580   e-162
ref|XP_007146880.1| hypothetical protein PHAVU_006G078200g [Phas...   494   e-136
ref|XP_014489681.1| PREDICTED: uncharacterized protein LOC106752...   487   e-134
gb|KOM52707.1| hypothetical protein LR48_Vigan09g136600 [Vigna a...   474   e-131
gb|KHN18665.1| Telomeric repeat-binding factor 1 [Glycine soja]       300   4e-78
ref|XP_003552913.1| PREDICTED: uncharacterized protein LOC100791...   300   4e-78
gb|KRH30478.1| hypothetical protein GLYMA_11G186600 [Glycine max...   292   6e-76
ref|XP_006591259.1| PREDICTED: uncharacterized protein LOC100819...   292   6e-76
gb|KHN11870.1| Telomeric repeat-binding factor 1 [Glycine soja]       291   2e-75
ref|XP_006602062.1| PREDICTED: uncharacterized protein LOC100791...   276   5e-71
ref|XP_013460445.1| myb-like DNA-binding domain protein [Medicag...   263   3e-67
gb|KRG98141.1| hypothetical protein GLYMA_18G052700 [Glycine max]     214   3e-52
ref|XP_006602063.1| PREDICTED: uncharacterized protein LOC100791...   214   3e-52
gb|KRH30481.1| hypothetical protein GLYMA_11G186600 [Glycine max]     202   8e-49
ref|XP_002276395.2| PREDICTED: uncharacterized protein LOC100244...   169   6e-39
emb|CAN65086.1| hypothetical protein VITISV_035031 [Vitis vinifera]   169   6e-39
ref|XP_010108474.1| hypothetical protein L484_010667 [Morus nota...   151   2e-33
gb|KNA10403.1| hypothetical protein SOVF_144720 isoform C [Spina...   149   6e-33
gb|KNA10402.1| hypothetical protein SOVF_144720 isoform B [Spina...   149   6e-33
gb|KNA10401.1| hypothetical protein SOVF_144720 isoform A [Spina...   149   6e-33

>ref|XP_004500362.1| PREDICTED: uncharacterized protein LOC101503526 [Cicer arietinum]
          Length = 504

 Score =  580 bits (1494), Expect = e-162
 Identities = 326/533 (61%), Positives = 365/533 (68%), Gaps = 5/533 (0%)
 Frame = -1

Query: 1622 MDKDISGWVLEFLLRSSVPDSLIQKTLTVLPLSGADPXXXXXXXXXXXXXXXLKASLSET 1443
            MDKDIS WV+EFLLRSSVPDSLIQKTLTVLP+S AD                LKASLSE 
Sbjct: 1    MDKDISNWVMEFLLRSSVPDSLIQKTLTVLPVSAADSRLKKTLLLRILQTQHLKASLSEM 60

Query: 1442 ALHIXXXXXXLDRNNDAVPVFDSMRRAYCAVAVECTVKYLAASPDDPSGEYFAAVRRIWR 1263
            +LHI      L   NDAVPV D+MR AYCAVAV+ TVKYL +SP+DPSGEYF+AVRRIWR
Sbjct: 61   SLHILEQLEEL-HCNDAVPVPDAMRSAYCAVAVDSTVKYLISSPEDPSGEYFSAVRRIWR 119

Query: 1262 GRVTQMSSAAEEGRRSGLLSDELSRWGDDVEAALWDPRVSERLAGLNTRRDALVEVRRFL 1083
            GRV ++S+     RRSG+ SDEL +W +D+EAALWD RVSERL GLNTRRDA+ EV+R+L
Sbjct: 120  GRVVKLSALE---RRSGIFSDELIQWAEDIEAALWDVRVSERLVGLNTRRDAMNEVKRYL 176

Query: 1082 KEAWETMGPSFLDSVATMSKAK-----GVCEIASGGEPVRKRDGWLEGSGKEKRDSFGSD 918
            KEAW  MGPSFLDS+A+ SKAK     GVCEIASGGE +RKRD      GKEK    G  
Sbjct: 177  KEAWGLMGPSFLDSMASFSKAKDSRPEGVCEIASGGEMLRKRDDL----GKEKMYYVGDG 232

Query: 917  RXXXXXXXXXXXXXXXXXXXXXXXXXXXXGACHXXXXXXXXXXXXXXERVGASVGASQEA 738
                                                            RVG SV  +QE 
Sbjct: 233  DDKNDNDNVVDDSDDDVAEVSENLGDEQLEE-----------------RVGTSVDPNQEV 275

Query: 737  SGSVCDSAKGDKDKAIQKGNLQLKRKHSALRTCHRGVKISGAGEVGAAKLWSKYDPLPSA 558
             G  CDS KGDK+  I+KGNLQ KRK+S+LRTCHRGVKISGA EV    L  KYD LPSA
Sbjct: 276  GG--CDSLKGDKE--IRKGNLQPKRKYSSLRTCHRGVKISGAEEVRPTNLLCKYDSLPSA 331

Query: 557  EVNKVRESLKSSSMELRALVKDPLPDVLHTSEIVRSQLETKDINFGPPIENQSGDVYVPD 378
            EV KVRESLKSSSMEL+ALVKDPLPD LHTSE+VRS+L TKDIN GPPIENQS  V VPD
Sbjct: 332  EVKKVRESLKSSSMELKALVKDPLPDALHTSELVRSKLATKDINLGPPIENQSQRVDVPD 391

Query: 377  SDVCRSIVVYQPSGANLGINSPVHCSNVHHPNLMERKSSARTYEWDDSIDNLXXXXXXXX 198
            S+ C+ +V+YQP  A L   S  HCSN H PNLME+ SSAR  EW+DSI+NL        
Sbjct: 392  SNACKIVVLYQPKDAKLEKKSVAHCSNSHRPNLMEQASSARASEWNDSIENLPHESQPRR 451

Query: 197  XXXKWMPLEEETLRAGVKMFGEGNWATIRNFYSNIFEYRSGVDLKDKWRNMIR 39
               +W  LEEETLRAGVKMFGEGNW TIR+FYSNIFEYRSGVDLKDKWRNM+R
Sbjct: 452  KKRRWTSLEEETLRAGVKMFGEGNWRTIRDFYSNIFEYRSGVDLKDKWRNMLR 504


>ref|XP_007146880.1| hypothetical protein PHAVU_006G078200g [Phaseolus vulgaris]
            gi|561020103|gb|ESW18874.1| hypothetical protein
            PHAVU_006G078200g [Phaseolus vulgaris]
          Length = 473

 Score =  494 bits (1271), Expect = e-136
 Identities = 287/528 (54%), Positives = 333/528 (63%)
 Frame = -1

Query: 1622 MDKDISGWVLEFLLRSSVPDSLIQKTLTVLPLSGADPXXXXXXXXXXXXXXXLKASLSET 1443
            MD DIS WV+EFLLRSSVPDSLIQKTL VLPLSGAD                LKASLSET
Sbjct: 1    MDSDISRWVMEFLLRSSVPDSLIQKTLIVLPLSGADSRLKKTLLLRILRTLLLKASLSET 60

Query: 1442 ALHIXXXXXXLDRNNDAVPVFDSMRRAYCAVAVECTVKYLAASPDDPSGEYFAAVRRIWR 1263
            AL I      LD  + +       RRAY AVAVECTVKYLAA+PDD  GE+  AV RIWR
Sbjct: 61   ALQILELLEDLDGASASAV----RRRAYLAVAVECTVKYLAAAPDDADGEFSGAVNRIWR 116

Query: 1262 GRVTQMSSAAEEGRRSGLLSDELSRWGDDVEAALWDPRVSERLAGLNTRRDALVEVRRFL 1083
            GRV     AA E RRSGL+S EL+RW DD+EAAL D R  ERLA LN+RR+A+ EVR +L
Sbjct: 117  GRV-----AALEARRSGLVSGELARWRDDLEAALGDSRACERLADLNSRREAMNEVRAYL 171

Query: 1082 KEAWETMGPSFLDSVATMSKAKGVCEIASGGEPVRKRDGWLEGSGKEKRDSFGSDRXXXX 903
            KEAWE+MGPSFL+SVA MSK              +++D ++    K   D    +     
Sbjct: 172  KEAWESMGPSFLESVAAMSKGL-----------TKEKDDFVVSGNKRDNDHENDN----- 215

Query: 902  XXXXXXXXXXXXXXXXXXXXXXXXGACHXXXXXXXXXXXXXXERVGASVGASQEASGSVC 723
                                     AC               +++   + A+QE  G   
Sbjct: 216  ------------------------DAC--MEDVAMHDENQGKQQLEEKIDANQEVGGR-- 247

Query: 722  DSAKGDKDKAIQKGNLQLKRKHSALRTCHRGVKISGAGEVGAAKLWSKYDPLPSAEVNKV 543
                 + DK IQK NL++K KHSALR CHRGVKISG+ EV +AK WSK+D +PS E+ KV
Sbjct: 248  -DLLLESDKVIQKQNLRVKHKHSALRACHRGVKISGSEEVESAKSWSKHDSVPS-EIRKV 305

Query: 542  RESLKSSSMELRALVKDPLPDVLHTSEIVRSQLETKDINFGPPIENQSGDVYVPDSDVCR 363
            RESLKSS+ ELRALV DPLPD L  SE+VRS+L T D N  PP ENQS  + VPD DVC+
Sbjct: 306  RESLKSSTCELRALVNDPLPDALRLSEVVRSKLATSDTNIEPPTENQSSHIDVPDPDVCQ 365

Query: 362  SIVVYQPSGANLGINSPVHCSNVHHPNLMERKSSARTYEWDDSIDNLXXXXXXXXXXXKW 183
            SIV +QP+ A+L   S VHCSN+H PNL+ER  SART EWDDSIDN            +W
Sbjct: 366  SIVPFQPNDAHLREKSSVHCSNIHQPNLIERNRSARTIEWDDSIDNSPLARQSRRRKRRW 425

Query: 182  MPLEEETLRAGVKMFGEGNWATIRNFYSNIFEYRSGVDLKDKWRNMIR 39
              LEEETLRAGVKMFGEGNWATIR+FYSNIF+ RSGVDLKDKWRNMIR
Sbjct: 426  SSLEEETLRAGVKMFGEGNWATIRSFYSNIFDNRSGVDLKDKWRNMIR 473


>ref|XP_014489681.1| PREDICTED: uncharacterized protein LOC106752502 [Vigna radiata var.
            radiata] gi|951068415|ref|XP_014489682.1| PREDICTED:
            uncharacterized protein LOC106752502 [Vigna radiata var.
            radiata]
          Length = 477

 Score =  487 bits (1253), Expect = e-134
 Identities = 287/528 (54%), Positives = 334/528 (63%)
 Frame = -1

Query: 1622 MDKDISGWVLEFLLRSSVPDSLIQKTLTVLPLSGADPXXXXXXXXXXXXXXXLKASLSET 1443
            MD DIS WV EFLLRSSVPDSLI+K L VLPLSGAD                LKASLSET
Sbjct: 1    MDTDISRWVTEFLLRSSVPDSLIRKALIVLPLSGADSRFKKTLLLRTLRTLLLKASLSET 60

Query: 1442 ALHIXXXXXXLDRNNDAVPVFDSMRRAYCAVAVECTVKYLAASPDDPSGEYFAAVRRIWR 1263
            AL I      LD  + +V    ++RRAY AVAVECTVKYLAA+PDDP GE+  AV+RIWR
Sbjct: 61   ALQILEHLEELDGVSTSV----ALRRAYLAVAVECTVKYLAAAPDDPEGEFSGAVKRIWR 116

Query: 1262 GRVTQMSSAAEEGRRSGLLSDELSRWGDDVEAALWDPRVSERLAGLNTRRDALVEVRRFL 1083
            GRV     AA E RRSGL+S EL RW DDVEAAL D R  ERLA  N+RR+A+ EVR +L
Sbjct: 117  GRV-----AALEARRSGLVSGELVRWRDDVEAALGDSRACERLADFNSRREAMKEVRAYL 171

Query: 1082 KEAWETMGPSFLDSVATMSKAKGVCEIASGGEPVRKRDGWLEGSGKEKRDSFGSDRXXXX 903
            KEA E+MGPSFL+SVA MSK          G    K D  + G G +     G+D     
Sbjct: 172  KEASESMGPSFLESVAAMSK----------GLTKEKDDFVISGDGCDSDHDNGND----- 216

Query: 902  XXXXXXXXXXXXXXXXXXXXXXXXGACHXXXXXXXXXXXXXXERVGASVGASQEASGSVC 723
                                      C               +++   + A+QE  G   
Sbjct: 217  ----------------------GDHGC--MEDVVMHDENQTKQQLEEKIEANQEVGGR-- 250

Query: 722  DSAKGDKDKAIQKGNLQLKRKHSALRTCHRGVKISGAGEVGAAKLWSKYDPLPSAEVNKV 543
                   DK IQK NL++K KHSALR CHRGVKI+G+ EV +AK  SK+D +PS+EV KV
Sbjct: 251  -DLLLQGDKVIQKRNLRVKHKHSALRACHRGVKINGSEEVESAKSLSKHDSVPSSEVKKV 309

Query: 542  RESLKSSSMELRALVKDPLPDVLHTSEIVRSQLETKDINFGPPIENQSGDVYVPDSDVCR 363
            RESLKSSS +LRALVKDPLPD LH SE VRS+L   D N  PPIE QS DV VPD +VC+
Sbjct: 310  RESLKSSSYDLRALVKDPLPDALHLSEAVRSKLANSDTNIEPPIEKQSPDVDVPDPNVCQ 369

Query: 362  SIVVYQPSGANLGINSPVHCSNVHHPNLMERKSSARTYEWDDSIDNLXXXXXXXXXXXKW 183
            SIV++Q + ANLG  S VH S++H PNLMER  SA+T+EWDDSIDN            +W
Sbjct: 370  SIVLFQSNDANLGKESSVHSSDLHQPNLMERNRSAQTFEWDDSIDNSPQARQTRRRKRRW 429

Query: 182  MPLEEETLRAGVKMFGEGNWATIRNFYSNIFEYRSGVDLKDKWRNMIR 39
              LEEETLRAGVKMFGEGNWATI++FYSNIF+ RSGVDLKDKWRNM+R
Sbjct: 430  SSLEEETLRAGVKMFGEGNWATIKSFYSNIFDNRSGVDLKDKWRNMLR 477


>gb|KOM52707.1| hypothetical protein LR48_Vigan09g136600 [Vigna angularis]
          Length = 472

 Score =  474 bits (1221), Expect = e-131
 Identities = 282/528 (53%), Positives = 336/528 (63%)
 Frame = -1

Query: 1622 MDKDISGWVLEFLLRSSVPDSLIQKTLTVLPLSGADPXXXXXXXXXXXXXXXLKASLSET 1443
            MD DIS WV+EFLLRSSVPDSLIQK L VLPLSGAD                LKASLSET
Sbjct: 1    MDTDISRWVMEFLLRSSVPDSLIQKALIVLPLSGADSRLKKTLLLRTLRTLLLKASLSET 60

Query: 1442 ALHIXXXXXXLDRNNDAVPVFDSMRRAYCAVAVECTVKYLAASPDDPSGEYFAAVRRIWR 1263
            AL I      LD ++ +V    ++RRAY AVAVECTVKYLAA+PDDP G +  AV+RIWR
Sbjct: 61   ALQIFEHLEELDGDSTSV----ALRRAYLAVAVECTVKYLAAAPDDPEGVFSGAVKRIWR 116

Query: 1262 GRVTQMSSAAEEGRRSGLLSDELSRWGDDVEAALWDPRVSERLAGLNTRRDALVEVRRFL 1083
            GRV     AA E RRSGL+S EL++W DDVEAAL D R  ERLA  N+RR+A+ EVR +L
Sbjct: 117  GRV-----AALEARRSGLVSGELAQWRDDVEAALGDSRACERLADFNSRREAMKEVRAYL 171

Query: 1082 KEAWETMGPSFLDSVATMSKAKGVCEIASGGEPVRKRDGWLEGSGKEKRDSFGSDRXXXX 903
            KEAWE+MGPSFL+SVA MS  KG+ +         K D  + G   +     G++     
Sbjct: 172  KEAWESMGPSFLESVAAMS--KGLTK--------EKEDFVISGDRCDNDQDKGNN----- 216

Query: 902  XXXXXXXXXXXXXXXXXXXXXXXXGACHXXXXXXXXXXXXXXERVGASVGASQEASGSVC 723
                                     A H               ++   + A+QE  G  C
Sbjct: 217  ----------------GDHGCMEGVAMHDENQTKQ--------QLEEKIEANQEVGG--C 250

Query: 722  DSAKGDKDKAIQKGNLQLKRKHSALRTCHRGVKISGAGEVGAAKLWSKYDPLPSAEVNKV 543
            D      DK IQK NL++K KHSALR CHRGVKI+ + E       SK+D +PS+EV +V
Sbjct: 251  DLPL-QGDKVIQKRNLRVKHKHSALRACHRGVKINYSEEGR-----SKHDSVPSSEVKEV 304

Query: 542  RESLKSSSMELRALVKDPLPDVLHTSEIVRSQLETKDINFGPPIENQSGDVYVPDSDVCR 363
            RESLKSSS +LRALVKDPLPD LH SE +RS+L T D N  PPIE QS +V +PD DVC+
Sbjct: 305  RESLKSSSYDLRALVKDPLPDALHLSEAIRSKLATSDTNIEPPIEKQSPNVDLPDPDVCQ 364

Query: 362  SIVVYQPSGANLGINSPVHCSNVHHPNLMERKSSARTYEWDDSIDNLXXXXXXXXXXXKW 183
            +IV++QP+ ANLG  S V CS++H PNLMER  SA+T EW+DSIDN            +W
Sbjct: 365  TIVLFQPNDANLGKKSSVCCSDLHQPNLMERNRSAQTSEWNDSIDNSPQARQPKRRKRRW 424

Query: 182  MPLEEETLRAGVKMFGEGNWATIRNFYSNIFEYRSGVDLKDKWRNMIR 39
              LEEETLRAGVKMFGEGNWATI++FYSNIF+ RSGVDLKDKWRNMIR
Sbjct: 425  SSLEEETLRAGVKMFGEGNWATIKSFYSNIFDNRSGVDLKDKWRNMIR 472


>gb|KHN18665.1| Telomeric repeat-binding factor 1 [Glycine soja]
          Length = 432

 Score =  300 bits (767), Expect = 4e-78
 Identities = 157/243 (64%), Positives = 175/243 (72%), Gaps = 2/243 (0%)
 Frame = -1

Query: 761 SVGASQEASGSVCDSAKGDKDKAIQKGNLQLKRKHSALRTCH--RGVKISGAGEVGAAKL 588
           SV A+QE  GS        +DKAI K N QLK KHSA R  H  RG+KIS   EV + K 
Sbjct: 193 SVDANQEVGGS---DLTPQRDKAILKRNPQLKHKHSAFRASHKGRGIKISSPEEVESTKP 249

Query: 587 WSKYDPLPSAEVNKVRESLKSSSMELRALVKDPLPDVLHTSEIVRSQLETKDINFGPPIE 408
           W K+DP+PSAEV K+RESLKSSS EL+ALV DPLPD LH S++VRS+L T D    PPIE
Sbjct: 250 WRKHDPVPSAEVKKIRESLKSSSSELQALVNDPLPDALHISDVVRSKLATSDTKIEPPIE 309

Query: 407 NQSGDVYVPDSDVCRSIVVYQPSGANLGINSPVHCSNVHHPNLMERKSSARTYEWDDSID 228
           NQ  DV V D DVC SIV +QP+  NLG  S VHCSN+H P+LMER  SART+EW+DSID
Sbjct: 310 NQHEDVEVQDPDVCLSIVPFQPNDVNLGKKSSVHCSNIHQPSLMERNRSARTFEWEDSID 369

Query: 227 NLXXXXXXXXXXXKWMPLEEETLRAGVKMFGEGNWATIRNFYSNIFEYRSGVDLKDKWRN 48
           N            KW  LEEETLRAGVKMFGEGNWATIR+FYSNIFE RSGVDLKDKWRN
Sbjct: 370 NSQQARQPRRRKRKWSSLEEETLRAGVKMFGEGNWATIRSFYSNIFENRSGVDLKDKWRN 429

Query: 47  MIR 39
           MIR
Sbjct: 430 MIR 432



 Score =  109 bits (272), Expect = 9e-21
 Identities = 83/212 (39%), Positives = 100/212 (47%), Gaps = 2/212 (0%)
 Frame = -1

Query: 1622 MDKDISGWVLEFLLRSSVPDSLIQKTLTVLPLSGADPXXXXXXXXXXXXXXXLKASLSET 1443
            M+ DIS WV+EFLLRSSVPDSLIQKTLT LPLS A+P                +A+LSET
Sbjct: 1    MNSDISRWVMEFLLRSSVPDSLIQKTLTALPLSPAEPRLKKNLLLRTLQTLLRRATLSET 60

Query: 1442 ALHIXXXXXXLDRNNDAVPVFDSMRRAYCAVAVECTVKYLAASPDDPSGEYFAAVRRIWR 1263
            AL I      +   N A                       A SP              W 
Sbjct: 61   ALDILEDLSPISNVNTA---------------------RTAQSP--------------W- 84

Query: 1262 GRVTQMSSAAEEGRRSGLLSDELSRWGDDVEAALWDPRVSERLAGLNTRRDALVEVRRFL 1083
                  S+ +    R+  LS      G+D           ERLAGLN+RRDA+ EVR +L
Sbjct: 85   ------SAPSSTSPRALTLSMAKDALGED------SKETRERLAGLNSRRDAMNEVRVYL 132

Query: 1082 KEAWETMGPSFLDSVATMSKA--KGVCEIASG 993
            KEAWE MGPSFL++VA   K   +G C+  SG
Sbjct: 133  KEAWEMMGPSFLETVAATEKKNDEGACDNGSG 164


>ref|XP_003552913.1| PREDICTED: uncharacterized protein LOC100791258 isoform X1 [Glycine
           max] gi|947048611|gb|KRG98139.1| hypothetical protein
           GLYMA_18G052700 [Glycine max]
           gi|947048612|gb|KRG98140.1| hypothetical protein
           GLYMA_18G052700 [Glycine max]
          Length = 468

 Score =  300 bits (767), Expect = 4e-78
 Identities = 157/243 (64%), Positives = 175/243 (72%), Gaps = 2/243 (0%)
 Frame = -1

Query: 761 SVGASQEASGSVCDSAKGDKDKAIQKGNLQLKRKHSALRTCH--RGVKISGAGEVGAAKL 588
           SV A+QE  GS        +DKAI K N QLK KHSA R  H  RG+KIS   EV + K 
Sbjct: 229 SVDANQEVGGS---DLTPQRDKAILKRNPQLKHKHSAFRASHKGRGIKISSPEEVESTKP 285

Query: 587 WSKYDPLPSAEVNKVRESLKSSSMELRALVKDPLPDVLHTSEIVRSQLETKDINFGPPIE 408
           W K+DP+PSAEV K+RESLKSSS EL+ALV DPLPD LH S++VRS+L T D    PPIE
Sbjct: 286 WRKHDPVPSAEVKKIRESLKSSSSELQALVNDPLPDALHISDVVRSKLATSDTKIEPPIE 345

Query: 407 NQSGDVYVPDSDVCRSIVVYQPSGANLGINSPVHCSNVHHPNLMERKSSARTYEWDDSID 228
           NQ  DV V D DVC SIV +QP+  NLG  S VHCSN+H P+LMER  SART+EW+DSID
Sbjct: 346 NQHEDVEVQDPDVCLSIVPFQPNDVNLGKKSSVHCSNIHQPSLMERNRSARTFEWEDSID 405

Query: 227 NLXXXXXXXXXXXKWMPLEEETLRAGVKMFGEGNWATIRNFYSNIFEYRSGVDLKDKWRN 48
           N            KW  LEEETLRAGVKMFGEGNWATIR+FYSNIFE RSGVDLKDKWRN
Sbjct: 406 NSQQARQPRRRKRKWSSLEEETLRAGVKMFGEGNWATIRSFYSNIFENRSGVDLKDKWRN 465

Query: 47  MIR 39
           MIR
Sbjct: 466 MIR 468



 Score =  214 bits (544), Expect = 3e-52
 Identities = 126/214 (58%), Positives = 144/214 (67%), Gaps = 4/214 (1%)
 Frame = -1

Query: 1622 MDKDISGWVLEFLLRSSVPDSLIQKTLTVLPLSGADPXXXXXXXXXXXXXXXLKASLSET 1443
            M+ DIS WV+EFLLRSSVPDSLIQKTLT LPLS A+P                +A+LSET
Sbjct: 1    MNSDISRWVMEFLLRSSVPDSLIQKTLTALPLSPAEPRLKKNLLLRTLQTLLRRATLSET 60

Query: 1442 ALHIXXXXXXLDRNNDAVPVFDSMRRAYCAVAVECTVKYLAASPDDPSGEYFAAVRRIWR 1263
            AL I           D  PV D+ RRAYCAVAVECTVKYLAA PD   GEY  AVRRIWR
Sbjct: 61   ALDIL---------EDLAPVSDAQRRAYCAVAVECTVKYLAACPDVIDGEYAGAVRRIWR 111

Query: 1262 GRVTQMSSAAEEGRRSGLLSDELSRWGDDVEAALW-DPRVS-ERLAGLNTRRDALVEVRR 1089
            GRV     AA + RRSGL+S EL RW D++E AL  D R + ERLAGLN+RRDA+ EVR 
Sbjct: 112  GRV-----AALQARRSGLVSGELVRWRDEIENALGEDSRAARERLAGLNSRRDAMNEVRV 166

Query: 1088 FLKEAWETMGPSFLDSVATMSKA--KGVCEIASG 993
            +LKEAWE MGPSFL++VA   K   +G C+  SG
Sbjct: 167  YLKEAWEMMGPSFLETVAATEKKNDEGACDNGSG 200


>gb|KRH30478.1| hypothetical protein GLYMA_11G186600 [Glycine max]
           gi|947081690|gb|KRH30479.1| hypothetical protein
           GLYMA_11G186600 [Glycine max]
           gi|947081691|gb|KRH30480.1| hypothetical protein
           GLYMA_11G186600 [Glycine max]
          Length = 469

 Score =  292 bits (748), Expect = 6e-76
 Identities = 155/243 (63%), Positives = 175/243 (72%), Gaps = 2/243 (0%)
 Frame = -1

Query: 761 SVGASQEASGSVCDSAKGDKDKAIQKGNLQLKRKHSALRTCHRG--VKISGAGEVGAAKL 588
           SV A+QE  G         +DKAI K N QLK KHSA R  HRG  V+IS   EV A K 
Sbjct: 230 SVDANQEVGGF---DLSPRRDKAIPKRNSQLKHKHSAFRASHRGRGVEISSPKEVKATKS 286

Query: 587 WSKYDPLPSAEVNKVRESLKSSSMELRALVKDPLPDVLHTSEIVRSQLETKDINFGPPIE 408
           WSK+DP+PSAEV KVRESLKSSS+ELRALVKDPLP  LH S++VRS+L T D    P IE
Sbjct: 287 WSKHDPVPSAEVKKVRESLKSSSLELRALVKDPLPHALHISDVVRSKLATSDTKTEPLIE 346

Query: 407 NQSGDVYVPDSDVCRSIVVYQPSGANLGINSPVHCSNVHHPNLMERKSSARTYEWDDSID 228
           NQ  DV V D DVC+SIV +QP+  NLG  S VHCSN+H P LME+  SART+EW+DS+D
Sbjct: 347 NQHEDVEVQDPDVCQSIVPFQPNDVNLGKKSFVHCSNIHQPYLMEQNISARTFEWEDSVD 406

Query: 227 NLXXXXXXXXXXXKWMPLEEETLRAGVKMFGEGNWATIRNFYSNIFEYRSGVDLKDKWRN 48
           N            KW  LEEETLRAGVKMFGEGNWA+IR+FYSN+FE RSGVDLKDKWRN
Sbjct: 407 NSPQARQPRRRKRKWSSLEEETLRAGVKMFGEGNWASIRSFYSNVFENRSGVDLKDKWRN 466

Query: 47  MIR 39
           MIR
Sbjct: 467 MIR 469



 Score =  202 bits (514), Expect = 8e-49
 Identities = 116/202 (57%), Positives = 130/202 (64%)
 Frame = -1

Query: 1622 MDKDISGWVLEFLLRSSVPDSLIQKTLTVLPLSGADPXXXXXXXXXXXXXXXLKASLSET 1443
            MD DIS WV EFLLRSSVPDSLIQKTL  LPLS A P                 A+LSET
Sbjct: 1    MDSDISQWVTEFLLRSSVPDSLIQKTLAALPLSTASPRLKKTLLLRTLQTLLRTATLSET 60

Query: 1442 ALHIXXXXXXLDRNNDAVPVFDSMRRAYCAVAVECTVKYLAASPDDPSGEYFAAVRRIWR 1263
            AL I            + PV D+ RRAYCAVAVECTVKYLAA P+D  GEY  AVRRIWR
Sbjct: 61   ALDILELL------EPSAPVSDAHRRAYCAVAVECTVKYLAACPEDIDGEYAGAVRRIWR 114

Query: 1262 GRVTQMSSAAEEGRRSGLLSDELSRWGDDVEAALWDPRVSERLAGLNTRRDALVEVRRFL 1083
            GRV+ + +     R S L+S EL+RW D VE A  D R  +RL GLN+RRDA+ EVR FL
Sbjct: 115  GRVSALKA-----RWSRLVSGELARWRDVVEDAFGDSRARQRLVGLNSRRDAMKEVRVFL 169

Query: 1082 KEAWETMGPSFLDSVATMSKAK 1017
            KEAW  MGPSFL++VA   K K
Sbjct: 170  KEAWGAMGPSFLETVAAKEKNK 191


>ref|XP_006591259.1| PREDICTED: uncharacterized protein LOC100819448 isoform X1 [Glycine
           max] gi|571489633|ref|XP_006591260.1| PREDICTED:
           uncharacterized protein LOC100819448 isoform X2 [Glycine
           max]
          Length = 507

 Score =  292 bits (748), Expect = 6e-76
 Identities = 155/243 (63%), Positives = 175/243 (72%), Gaps = 2/243 (0%)
 Frame = -1

Query: 761 SVGASQEASGSVCDSAKGDKDKAIQKGNLQLKRKHSALRTCHRG--VKISGAGEVGAAKL 588
           SV A+QE  G         +DKAI K N QLK KHSA R  HRG  V+IS   EV A K 
Sbjct: 268 SVDANQEVGGF---DLSPRRDKAIPKRNSQLKHKHSAFRASHRGRGVEISSPKEVKATKS 324

Query: 587 WSKYDPLPSAEVNKVRESLKSSSMELRALVKDPLPDVLHTSEIVRSQLETKDINFGPPIE 408
           WSK+DP+PSAEV KVRESLKSSS+ELRALVKDPLP  LH S++VRS+L T D    P IE
Sbjct: 325 WSKHDPVPSAEVKKVRESLKSSSLELRALVKDPLPHALHISDVVRSKLATSDTKTEPLIE 384

Query: 407 NQSGDVYVPDSDVCRSIVVYQPSGANLGINSPVHCSNVHHPNLMERKSSARTYEWDDSID 228
           NQ  DV V D DVC+SIV +QP+  NLG  S VHCSN+H P LME+  SART+EW+DS+D
Sbjct: 385 NQHEDVEVQDPDVCQSIVPFQPNDVNLGKKSFVHCSNIHQPYLMEQNISARTFEWEDSVD 444

Query: 227 NLXXXXXXXXXXXKWMPLEEETLRAGVKMFGEGNWATIRNFYSNIFEYRSGVDLKDKWRN 48
           N            KW  LEEETLRAGVKMFGEGNWA+IR+FYSN+FE RSGVDLKDKWRN
Sbjct: 445 NSPQARQPRRRKRKWSSLEEETLRAGVKMFGEGNWASIRSFYSNVFENRSGVDLKDKWRN 504

Query: 47  MIR 39
           MIR
Sbjct: 505 MIR 507



 Score =  204 bits (520), Expect = 2e-49
 Identities = 117/204 (57%), Positives = 132/204 (64%)
 Frame = -1

Query: 1628 EKMDKDISGWVLEFLLRSSVPDSLIQKTLTVLPLSGADPXXXXXXXXXXXXXXXLKASLS 1449
            E+MD DIS WV EFLLRSSVPDSLIQKTL  LPLS A P                 A+LS
Sbjct: 37   EEMDSDISQWVTEFLLRSSVPDSLIQKTLAALPLSTASPRLKKTLLLRTLQTLLRTATLS 96

Query: 1448 ETALHIXXXXXXLDRNNDAVPVFDSMRRAYCAVAVECTVKYLAASPDDPSGEYFAAVRRI 1269
            ETAL I            + PV D+ RRAYCAVAVECTVKYLAA P+D  GEY  AVRRI
Sbjct: 97   ETALDILELL------EPSAPVSDAHRRAYCAVAVECTVKYLAACPEDIDGEYAGAVRRI 150

Query: 1268 WRGRVTQMSSAAEEGRRSGLLSDELSRWGDDVEAALWDPRVSERLAGLNTRRDALVEVRR 1089
            WRGRV+ + +     R S L+S EL+RW D VE A  D R  +RL GLN+RRDA+ EVR 
Sbjct: 151  WRGRVSALKA-----RWSRLVSGELARWRDVVEDAFGDSRARQRLVGLNSRRDAMKEVRV 205

Query: 1088 FLKEAWETMGPSFLDSVATMSKAK 1017
            FLKEAW  MGPSFL++VA   K K
Sbjct: 206  FLKEAWGAMGPSFLETVAAKEKNK 229


>gb|KHN11870.1| Telomeric repeat-binding factor 1 [Glycine soja]
          Length = 308

 Score =  291 bits (744), Expect = 2e-75
 Identities = 154/243 (63%), Positives = 175/243 (72%), Gaps = 2/243 (0%)
 Frame = -1

Query: 761 SVGASQEASGSVCDSAKGDKDKAIQKGNLQLKRKHSALRTCHRG--VKISGAGEVGAAKL 588
           SV A+QE  G         +D+AI K N QLK KHSA R  HRG  V+IS   EV A K 
Sbjct: 69  SVDANQEVGGF---DLSPRRDEAIPKRNSQLKHKHSAFRASHRGRGVEISSPKEVKATKS 125

Query: 587 WSKYDPLPSAEVNKVRESLKSSSMELRALVKDPLPDVLHTSEIVRSQLETKDINFGPPIE 408
           WSK+DP+PSAEV KVRESLKSSS+ELRALVKDPLP  LH S++VRS+L T D    P IE
Sbjct: 126 WSKHDPVPSAEVKKVRESLKSSSLELRALVKDPLPHALHISDVVRSKLATSDTKTEPLIE 185

Query: 407 NQSGDVYVPDSDVCRSIVVYQPSGANLGINSPVHCSNVHHPNLMERKSSARTYEWDDSID 228
           NQ  DV V D DVC+SIV +QP+  NLG  S VHCSN+H P LME+  SART+EW+DS+D
Sbjct: 186 NQHEDVEVQDPDVCQSIVPFQPNDVNLGKKSFVHCSNIHQPYLMEQNISARTFEWEDSVD 245

Query: 227 NLXXXXXXXXXXXKWMPLEEETLRAGVKMFGEGNWATIRNFYSNIFEYRSGVDLKDKWRN 48
           N            KW  LEEETLRAGVKMFGEGNWA+IR+FYSN+FE RSGVDLKDKWRN
Sbjct: 246 NSPQARQPRRRKRKWSSLEEETLRAGVKMFGEGNWASIRSFYSNVFENRSGVDLKDKWRN 305

Query: 47  MIR 39
           MIR
Sbjct: 306 MIR 308


>ref|XP_006602062.1| PREDICTED: uncharacterized protein LOC100791258 isoform X2 [Glycine
           max]
          Length = 479

 Score =  276 bits (705), Expect = 5e-71
 Identities = 146/232 (62%), Positives = 164/232 (70%), Gaps = 2/232 (0%)
 Frame = -1

Query: 761 SVGASQEASGSVCDSAKGDKDKAIQKGNLQLKRKHSALRTCH--RGVKISGAGEVGAAKL 588
           SV A+QE  GS        +DKAI K N QLK KHSA R  H  RG+KIS   EV + K 
Sbjct: 229 SVDANQEVGGS---DLTPQRDKAILKRNPQLKHKHSAFRASHKGRGIKISSPEEVESTKP 285

Query: 587 WSKYDPLPSAEVNKVRESLKSSSMELRALVKDPLPDVLHTSEIVRSQLETKDINFGPPIE 408
           W K+DP+PSAEV K+RESLKSSS EL+ALV DPLPD LH S++VRS+L T D    PPIE
Sbjct: 286 WRKHDPVPSAEVKKIRESLKSSSSELQALVNDPLPDALHISDVVRSKLATSDTKIEPPIE 345

Query: 407 NQSGDVYVPDSDVCRSIVVYQPSGANLGINSPVHCSNVHHPNLMERKSSARTYEWDDSID 228
           NQ  DV V D DVC SIV +QP+  NLG  S VHCSN+H P+LMER  SART+EW+DSID
Sbjct: 346 NQHEDVEVQDPDVCLSIVPFQPNDVNLGKKSSVHCSNIHQPSLMERNRSARTFEWEDSID 405

Query: 227 NLXXXXXXXXXXXKWMPLEEETLRAGVKMFGEGNWATIRNFYSNIFEYRSGV 72
           N            KW  LEEETLRAGVKMFGEGNWATIR+FYSNIFE RSGV
Sbjct: 406 NSQQARQPRRRKRKWSSLEEETLRAGVKMFGEGNWATIRSFYSNIFENRSGV 457



 Score =  214 bits (544), Expect = 3e-52
 Identities = 126/214 (58%), Positives = 144/214 (67%), Gaps = 4/214 (1%)
 Frame = -1

Query: 1622 MDKDISGWVLEFLLRSSVPDSLIQKTLTVLPLSGADPXXXXXXXXXXXXXXXLKASLSET 1443
            M+ DIS WV+EFLLRSSVPDSLIQKTLT LPLS A+P                +A+LSET
Sbjct: 1    MNSDISRWVMEFLLRSSVPDSLIQKTLTALPLSPAEPRLKKNLLLRTLQTLLRRATLSET 60

Query: 1442 ALHIXXXXXXLDRNNDAVPVFDSMRRAYCAVAVECTVKYLAASPDDPSGEYFAAVRRIWR 1263
            AL I           D  PV D+ RRAYCAVAVECTVKYLAA PD   GEY  AVRRIWR
Sbjct: 61   ALDIL---------EDLAPVSDAQRRAYCAVAVECTVKYLAACPDVIDGEYAGAVRRIWR 111

Query: 1262 GRVTQMSSAAEEGRRSGLLSDELSRWGDDVEAALW-DPRVS-ERLAGLNTRRDALVEVRR 1089
            GRV     AA + RRSGL+S EL RW D++E AL  D R + ERLAGLN+RRDA+ EVR 
Sbjct: 112  GRV-----AALQARRSGLVSGELVRWRDEIENALGEDSRAARERLAGLNSRRDAMNEVRV 166

Query: 1088 FLKEAWETMGPSFLDSVATMSKA--KGVCEIASG 993
            +LKEAWE MGPSFL++VA   K   +G C+  SG
Sbjct: 167  YLKEAWEMMGPSFLETVAATEKKNDEGACDNGSG 200


>ref|XP_013460445.1| myb-like DNA-binding domain protein [Medicago truncatula]
            gi|657393669|gb|KEH34478.1| myb-like DNA-binding domain
            protein [Medicago truncatula]
          Length = 620

 Score =  263 bits (673), Expect = 3e-67
 Identities = 143/239 (59%), Positives = 163/239 (68%), Gaps = 15/239 (6%)
 Frame = -1

Query: 710  GDKDKAIQKGNLQLKRKHSALRTCHRGVKISGAGEVGAAKLWSKYDPLPSAEVNKVRESL 531
            G +DK I+K N QLKRKHSAL+T HRGVK+SG  E       +KY+ L SA+V K+RESL
Sbjct: 382  GMRDKEIRKDNGQLKRKHSALQTRHRGVKLSGDEEARPINFSTKYENLRSADVKKLRESL 441

Query: 530  KSSSMELRALVKDPLPDVLHTSEIVRSQLETKDINFGPPIENQSGDVYVPDSDVCRSIVV 351
            KSSS+EL+ALVKDPLPD LHTSE VRS+L TKDIN  P  E QS  V V DSD C++IV 
Sbjct: 442  KSSSLELKALVKDPLPDALHTSEAVRSKLATKDINRRPASEKQSEHVDVRDSDACKTIVP 501

Query: 350  YQPSGANLGINSPVHC---------------SNVHHPNLMERKSSARTYEWDDSIDNLXX 216
            YQP+ AN      V C               SN   PNLM R S A+TYEWDDSI+NL  
Sbjct: 502  YQPNDANFAKEPSVPCSSDRPNCSNGSRPNYSNDRRPNLMRRASYAQTYEWDDSIENLPQ 561

Query: 215  XXXXXXXXXKWMPLEEETLRAGVKMFGEGNWATIRNFYSNIFEYRSGVDLKDKWRNMIR 39
                     KW  LEEETLRAGV+MFGEGNW TI +FYS IFEYR+GVDLKDKWRNM+R
Sbjct: 562  QSLPRRKKRKWTSLEEETLRAGVRMFGEGNWRTILDFYSTIFEYRNGVDLKDKWRNMMR 620



 Score =  218 bits (555), Expect = 1e-53
 Identities = 123/232 (53%), Positives = 155/232 (66%), Gaps = 4/232 (1%)
 Frame = -1

Query: 1622 MDKDISGWVLEFLLRSS-VPDSLIQKTLTVLPLSGADPXXXXXXXXXXXXXXXLKASLSE 1446
            M+++IS W++EFLLR++ VPD LIQKTLT+LPLSGAD                L AS+SE
Sbjct: 1    MNENISNWIMEFLLRNTTVPDPLIQKTLTLLPLSGADSRLKKTLLLRVLQTHILNASISE 60

Query: 1445 TALHIXXXXXXLDRNNDAVPVFDSMRRAYCAVAVECTVKYLAASPDDPSGEYFAAVRRIW 1266
             +L I      + R+ D V + ++ + AYCAVAVECTVKYL  SP+DPSGEYF+AVRRIW
Sbjct: 61   ASLQILEHLEEIYRD-DGVSISNAFQSAYCAVAVECTVKYLINSPEDPSGEYFSAVRRIW 119

Query: 1265 RGRVTQMSSAAEEGRRSGLLSDELSRWGDDVEAALWDPRVSERLAGLNTRRDALVEVRRF 1086
            RGR           R SGL+SD   +WG+++E ALWD RV+ERL GLNTRRDA++EV+RF
Sbjct: 120  RGR-----------RGSGLVSDGFVQWGEEIEGALWDVRVAERLVGLNTRRDAVIEVKRF 168

Query: 1085 LKEAWETMGPSFLDSVATMSKAKGVCEIASGGEPVRKRDG---WLEGSGKEK 939
            LKEAW +MG SFLD +A +SK  G+C    GG      +G    LEG GK K
Sbjct: 169  LKEAWGSMGDSFLDLIAMVSKGNGLC---PGGVCENAAEGSRRLLEGLGKAK 217


>gb|KRG98141.1| hypothetical protein GLYMA_18G052700 [Glycine max]
          Length = 416

 Score =  214 bits (544), Expect = 3e-52
 Identities = 126/214 (58%), Positives = 144/214 (67%), Gaps = 4/214 (1%)
 Frame = -1

Query: 1622 MDKDISGWVLEFLLRSSVPDSLIQKTLTVLPLSGADPXXXXXXXXXXXXXXXLKASLSET 1443
            M+ DIS WV+EFLLRSSVPDSLIQKTLT LPLS A+P                +A+LSET
Sbjct: 1    MNSDISRWVMEFLLRSSVPDSLIQKTLTALPLSPAEPRLKKNLLLRTLQTLLRRATLSET 60

Query: 1442 ALHIXXXXXXLDRNNDAVPVFDSMRRAYCAVAVECTVKYLAASPDDPSGEYFAAVRRIWR 1263
            AL I           D  PV D+ RRAYCAVAVECTVKYLAA PD   GEY  AVRRIWR
Sbjct: 61   ALDIL---------EDLAPVSDAQRRAYCAVAVECTVKYLAACPDVIDGEYAGAVRRIWR 111

Query: 1262 GRVTQMSSAAEEGRRSGLLSDELSRWGDDVEAALW-DPRVS-ERLAGLNTRRDALVEVRR 1089
            GRV     AA + RRSGL+S EL RW D++E AL  D R + ERLAGLN+RRDA+ EVR 
Sbjct: 112  GRV-----AALQARRSGLVSGELVRWRDEIENALGEDSRAARERLAGLNSRRDAMNEVRV 166

Query: 1088 FLKEAWETMGPSFLDSVATMSKA--KGVCEIASG 993
            +LKEAWE MGPSFL++VA   K   +G C+  SG
Sbjct: 167  YLKEAWEMMGPSFLETVAATEKKNDEGACDNGSG 200



 Score =  197 bits (501), Expect = 2e-47
 Identities = 105/175 (60%), Positives = 122/175 (69%), Gaps = 2/175 (1%)
 Frame = -1

Query: 761 SVGASQEASGSVCDSAKGDKDKAIQKGNLQLKRKHSALRTCH--RGVKISGAGEVGAAKL 588
           SV A+QE  GS        +DKAI K N QLK KHSA R  H  RG+KIS   EV + K 
Sbjct: 229 SVDANQEVGGS---DLTPQRDKAILKRNPQLKHKHSAFRASHKGRGIKISSPEEVESTKP 285

Query: 587 WSKYDPLPSAEVNKVRESLKSSSMELRALVKDPLPDVLHTSEIVRSQLETKDINFGPPIE 408
           W K+DP+PSAEV K+RESLKSSS EL+ALV DPLPD LH S++VRS+L T D    PPIE
Sbjct: 286 WRKHDPVPSAEVKKIRESLKSSSSELQALVNDPLPDALHISDVVRSKLATSDTKIEPPIE 345

Query: 407 NQSGDVYVPDSDVCRSIVVYQPSGANLGINSPVHCSNVHHPNLMERKSSARTYEW 243
           NQ  DV V D DVC SIV +QP+  NLG  S VHCSN+H P+LMER  SART+E+
Sbjct: 346 NQHEDVEVQDPDVCLSIVPFQPNDVNLGKKSSVHCSNIHQPSLMERNRSARTFEF 400


>ref|XP_006602063.1| PREDICTED: uncharacterized protein LOC100791258 isoform X3 [Glycine
            max]
          Length = 405

 Score =  214 bits (544), Expect = 3e-52
 Identities = 126/214 (58%), Positives = 144/214 (67%), Gaps = 4/214 (1%)
 Frame = -1

Query: 1622 MDKDISGWVLEFLLRSSVPDSLIQKTLTVLPLSGADPXXXXXXXXXXXXXXXLKASLSET 1443
            M+ DIS WV+EFLLRSSVPDSLIQKTLT LPLS A+P                +A+LSET
Sbjct: 1    MNSDISRWVMEFLLRSSVPDSLIQKTLTALPLSPAEPRLKKNLLLRTLQTLLRRATLSET 60

Query: 1442 ALHIXXXXXXLDRNNDAVPVFDSMRRAYCAVAVECTVKYLAASPDDPSGEYFAAVRRIWR 1263
            AL I           D  PV D+ RRAYCAVAVECTVKYLAA PD   GEY  AVRRIWR
Sbjct: 61   ALDIL---------EDLAPVSDAQRRAYCAVAVECTVKYLAACPDVIDGEYAGAVRRIWR 111

Query: 1262 GRVTQMSSAAEEGRRSGLLSDELSRWGDDVEAALW-DPRVS-ERLAGLNTRRDALVEVRR 1089
            GRV     AA + RRSGL+S EL RW D++E AL  D R + ERLAGLN+RRDA+ EVR 
Sbjct: 112  GRV-----AALQARRSGLVSGELVRWRDEIENALGEDSRAARERLAGLNSRRDAMNEVRV 166

Query: 1088 FLKEAWETMGPSFLDSVATMSKA--KGVCEIASG 993
            +LKEAWE MGPSFL++VA   K   +G C+  SG
Sbjct: 167  YLKEAWEMMGPSFLETVAATEKKNDEGACDNGSG 200



 Score =  197 bits (500), Expect = 3e-47
 Identities = 105/174 (60%), Positives = 121/174 (69%), Gaps = 2/174 (1%)
 Frame = -1

Query: 761 SVGASQEASGSVCDSAKGDKDKAIQKGNLQLKRKHSALRTCH--RGVKISGAGEVGAAKL 588
           SV A+QE  GS        +DKAI K N QLK KHSA R  H  RG+KIS   EV + K 
Sbjct: 229 SVDANQEVGGS---DLTPQRDKAILKRNPQLKHKHSAFRASHKGRGIKISSPEEVESTKP 285

Query: 587 WSKYDPLPSAEVNKVRESLKSSSMELRALVKDPLPDVLHTSEIVRSQLETKDINFGPPIE 408
           W K+DP+PSAEV K+RESLKSSS EL+ALV DPLPD LH S++VRS+L T D    PPIE
Sbjct: 286 WRKHDPVPSAEVKKIRESLKSSSSELQALVNDPLPDALHISDVVRSKLATSDTKIEPPIE 345

Query: 407 NQSGDVYVPDSDVCRSIVVYQPSGANLGINSPVHCSNVHHPNLMERKSSARTYE 246
           NQ  DV V D DVC SIV +QP+  NLG  S VHCSN+H P+LMER  SART+E
Sbjct: 346 NQHEDVEVQDPDVCLSIVPFQPNDVNLGKKSSVHCSNIHQPSLMERNRSARTFE 399


>gb|KRH30481.1| hypothetical protein GLYMA_11G186600 [Glycine max]
          Length = 406

 Score =  202 bits (514), Expect = 8e-49
 Identities = 116/202 (57%), Positives = 130/202 (64%)
 Frame = -1

Query: 1622 MDKDISGWVLEFLLRSSVPDSLIQKTLTVLPLSGADPXXXXXXXXXXXXXXXLKASLSET 1443
            MD DIS WV EFLLRSSVPDSLIQKTL  LPLS A P                 A+LSET
Sbjct: 1    MDSDISQWVTEFLLRSSVPDSLIQKTLAALPLSTASPRLKKTLLLRTLQTLLRTATLSET 60

Query: 1442 ALHIXXXXXXLDRNNDAVPVFDSMRRAYCAVAVECTVKYLAASPDDPSGEYFAAVRRIWR 1263
            AL I            + PV D+ RRAYCAVAVECTVKYLAA P+D  GEY  AVRRIWR
Sbjct: 61   ALDILELL------EPSAPVSDAHRRAYCAVAVECTVKYLAACPEDIDGEYAGAVRRIWR 114

Query: 1262 GRVTQMSSAAEEGRRSGLLSDELSRWGDDVEAALWDPRVSERLAGLNTRRDALVEVRRFL 1083
            GRV+ + +     R S L+S EL+RW D VE A  D R  +RL GLN+RRDA+ EVR FL
Sbjct: 115  GRVSALKA-----RWSRLVSGELARWRDVVEDAFGDSRARQRLVGLNSRRDAMKEVRVFL 169

Query: 1082 KEAWETMGPSFLDSVATMSKAK 1017
            KEAW  MGPSFL++VA   K K
Sbjct: 170  KEAWGAMGPSFLETVAAKEKNK 191



 Score =  192 bits (487), Expect = 1e-45
 Identities = 106/174 (60%), Positives = 121/174 (69%), Gaps = 2/174 (1%)
 Frame = -1

Query: 761 SVGASQEASGSVCDSAKGDKDKAIQKGNLQLKRKHSALRTCHRG--VKISGAGEVGAAKL 588
           SV A+QE  G         +DKAI K N QLK KHSA R  HRG  V+IS   EV A K 
Sbjct: 230 SVDANQEVGGF---DLSPRRDKAIPKRNSQLKHKHSAFRASHRGRGVEISSPKEVKATKS 286

Query: 587 WSKYDPLPSAEVNKVRESLKSSSMELRALVKDPLPDVLHTSEIVRSQLETKDINFGPPIE 408
           WSK+DP+PSAEV KVRESLKSSS+ELRALVKDPLP  LH S++VRS+L T D    P IE
Sbjct: 287 WSKHDPVPSAEVKKVRESLKSSSLELRALVKDPLPHALHISDVVRSKLATSDTKTEPLIE 346

Query: 407 NQSGDVYVPDSDVCRSIVVYQPSGANLGINSPVHCSNVHHPNLMERKSSARTYE 246
           NQ  DV V D DVC+SIV +QP+  NLG  S VHCSN+H P LME+  SART+E
Sbjct: 347 NQHEDVEVQDPDVCQSIVPFQPNDVNLGKKSFVHCSNIHQPYLMEQNISARTFE 400


>ref|XP_002276395.2| PREDICTED: uncharacterized protein LOC100244907 [Vitis vinifera]
            gi|297745761|emb|CBI15817.3| unnamed protein product
            [Vitis vinifera]
          Length = 479

 Score =  169 bits (429), Expect = 6e-39
 Identities = 90/200 (45%), Positives = 124/200 (62%)
 Frame = -1

Query: 1622 MDKDISGWVLEFLLRSSVPDSLIQKTLTVLPLSGADPXXXXXXXXXXXXXXXLKASLSET 1443
            MD+D+S W+LEF++R  + DSL+++ +++LPLS + P                  S+SET
Sbjct: 1    MDEDVSRWILEFMIRKPIGDSLVRRLISILPLSNSHPRMKKTVLLRKIESEISDGSVSET 60

Query: 1442 ALHIXXXXXXLDRNNDAVPVFDSMRRAYCAVAVECTVKYLAASPDDPSGEYFAAVRRIWR 1263
             L +      LD   + V V DSM+ AYCAVAVECTVK+L  S     G+YF AV+RIWR
Sbjct: 61   ILELLEIIEELDYK-EGVAVLDSMKNAYCAVAVECTVKFLVGS-GGKEGKYFDAVKRIWR 118

Query: 1262 GRVTQMSSAAEEGRRSGLLSDELSRWGDDVEAALWDPRVSERLAGLNTRRDALVEVRRFL 1083
            G++ +M S+A     +GL+SD+L +W DD+EAA+WD RV E +   NTR DAL  VR ++
Sbjct: 119  GKIHKMESSAT----AGLVSDQLRKWRDDIEAAVWDARVCEDILAKNTRNDALRLVRAYV 174

Query: 1082 KEAWETMGPSFLDSVATMSK 1023
             EAW  MGP FL+  A   K
Sbjct: 175  AEAWAIMGPPFLELAARAIK 194



 Score =  166 bits (420), Expect = 6e-38
 Identities = 113/281 (40%), Positives = 144/281 (51%), Gaps = 42/281 (14%)
 Frame = -1

Query: 755  GASQEASGSVCDSAKG------------DKDKAIQKGNLQLKRKHSAL--RTCHRGVKIS 618
            G     +GS C+ A              DKDK   K ++  KRKH     R    GVKI+
Sbjct: 198  GLPGAGNGSTCNQAAACSPNVATDLVVPDKDKETLKASMLPKRKHVGGHGRRSRGGVKIT 257

Query: 617  GAGEVGAAKLWSKYDPLPSAEVNKVRESLKSSSMELRALVKDPLPDVLHTSEIVRSQLET 438
               EV      SKYD LPS EV++V+ +LKSSS+EL+ALVKDPLP+ L  +E V S L  
Sbjct: 258  DTEEVRGQTSGSKYDCLPSPEVDRVQAALKSSSLELQALVKDPLPEALQLAEAVISGLAK 317

Query: 437  KDINFGPPIENQS-GDVYVPDSDVCRSIVVYQPSGANLGINSPVHCSNVHHPNLMERKSS 261
            KD+N  P  ++Q   DV  P+  V +++V  Q + A+ G       +NV  P+LM R  +
Sbjct: 318  KDVNHEPLTKDQGIIDVAAPNPSVGKNLVADQTNEADSGHQCTTDQNNVPRPSLMARNGT 377

Query: 260  ARTYEWDDSID----------NL-----------------XXXXXXXXXXXKWMPLEEET 162
            ART EWDDSID          N+                            KW  LEE+T
Sbjct: 378  ARTCEWDDSIDASPEGLSSDTNICLPSPKRKAVSPLKKYEITKLAKRRQMKKWSILEEDT 437

Query: 161  LRAGVKMFGEGNWATIRNFYSNIFEYRSGVDLKDKWRNMIR 39
            LR GV  FG+GNW  I N Y +IFE R+ VDLKDKWRNM +
Sbjct: 438  LRTGVLKFGKGNWTLILNCYRDIFEERTQVDLKDKWRNMTK 478


>emb|CAN65086.1| hypothetical protein VITISV_035031 [Vitis vinifera]
          Length = 444

 Score =  169 bits (429), Expect = 6e-39
 Identities = 90/200 (45%), Positives = 124/200 (62%)
 Frame = -1

Query: 1622 MDKDISGWVLEFLLRSSVPDSLIQKTLTVLPLSGADPXXXXXXXXXXXXXXXLKASLSET 1443
            MD+D+S W+LEF++R  + DSL+++ +++LPLS + P                  S+SET
Sbjct: 1    MDEDVSRWILEFMIRKPIGDSLVRRLISILPLSNSHPRMKKTVLLRKIESEISDGSVSET 60

Query: 1442 ALHIXXXXXXLDRNNDAVPVFDSMRRAYCAVAVECTVKYLAASPDDPSGEYFAAVRRIWR 1263
             L +      LD   + V V DSM+ AYCAVAVECTVK+L  S     G+YF AV+RIWR
Sbjct: 61   ILELLEIIEELDYK-EGVAVLDSMKNAYCAVAVECTVKFLVGS-GGKEGKYFDAVKRIWR 118

Query: 1262 GRVTQMSSAAEEGRRSGLLSDELSRWGDDVEAALWDPRVSERLAGLNTRRDALVEVRRFL 1083
            G++ +M S+A     +GL+SD+L +W DD+EAA+WD RV E +   NTR DAL  VR ++
Sbjct: 119  GKIHKMESSAT----AGLVSDQLRKWRDDIEAAVWDARVCEDILAKNTRNDALRLVRAYV 174

Query: 1082 KEAWETMGPSFLDSVATMSK 1023
             EAW  MGP FL+  A   K
Sbjct: 175  AEAWAIMGPPFLELAARAIK 194



 Score =  122 bits (306), Expect = 1e-24
 Identities = 80/191 (41%), Positives = 105/191 (54%), Gaps = 15/191 (7%)
 Frame = -1

Query: 755 GASQEASGSVCDSAKG------------DKDKAIQKGNLQLKRKHSAL--RTCHRGVKIS 618
           G     +GS C+ A              DKDK   K ++  KRKH     R    GVKI+
Sbjct: 198 GLPGAGNGSTCNQAAACSPNVATDLVVPDKDKETLKASMLPKRKHVGGHGRRSRGGVKIT 257

Query: 617 GAGEVGAAKLWSKYDPLPSAEVNKVRESLKSSSMELRALVKDPLPDVLHTSEIVRSQLET 438
              EV      SKYD LPS EV++V+ +LKSSS+EL+ALVKDPLP+ L  +E V S L  
Sbjct: 258 DTEEVRGQTSGSKYDCLPSPEVDRVQAALKSSSLELQALVKDPLPEALQLAEAVISGLAK 317

Query: 437 KDINFGPPIENQS-GDVYVPDSDVCRSIVVYQPSGANLGINSPVHCSNVHHPNLMERKSS 261
           KD+N  P  ++Q   DV  P+  V +++V  Q + A+ G       +NV  P+LM R  +
Sbjct: 318 KDVNHEPLTKDQGIIDVAAPNPSVGKNLVADQTNEADSGHQCTTDQNNVPRPSLMARNGT 377

Query: 260 ARTYEWDDSID 228
           ART EWDDSID
Sbjct: 378 ARTCEWDDSID 388


>ref|XP_010108474.1| hypothetical protein L484_010667 [Morus notabilis]
            gi|587932488|gb|EXC19536.1| hypothetical protein
            L484_010667 [Morus notabilis]
          Length = 587

 Score =  151 bits (382), Expect = 2e-33
 Identities = 90/225 (40%), Positives = 127/225 (56%), Gaps = 2/225 (0%)
 Frame = -1

Query: 1631 MEKMDKDISGWVLEFLLRSSVPDSLIQKTLTVLPLSGADPXXXXXXXXXXXXXXXLKASL 1452
            M++ DK +S W+LEFLLR+S  +   ++ L VLP+   DP                +  +
Sbjct: 1    MDEPDKTLSRWLLEFLLRNSDDEPFAKRALAVLPIPDDDPRLKKTVLLRTIEYEVSEGLV 60

Query: 1451 SETALHIXXXXXXLD--RNNDAVPVFDSMRRAYCAVAVECTVKYLAASPDDPSGEYFAAV 1278
            + TAL        LD  R + A    DSM+ AYCAVA++CTV+ L  +   P G++  AV
Sbjct: 61   TSTALENLELIEELDSRRGSAAAAAGDSMKAAYCAVALDCTVRVLVGNGGKPGGKFLNAV 120

Query: 1277 RRIWRGRVTQMSSAAEEGRRSGLLSDELSRWGDDVEAALWDPRVSERLAGLNTRRDALVE 1098
            +RIWRGRV +M  +A   R S LLS +L R  D+VEAA+WD  V  +L  +NTR+DAL+ 
Sbjct: 121  KRIWRGRVGRMEKSAA-ARESRLLSSDLRRCWDEVEAAIWDEGVCRKLMRINTRKDALML 179

Query: 1097 VRRFLKEAWETMGPSFLDSVATMSKAKGVCEIASGGEPVRKRDGW 963
            +  +LKEAW  MGPSF+   A +S  +   E  +G E   +  GW
Sbjct: 180  LGAYLKEAWALMGPSFVAWAARLSAKQRFREDGNGEE--SRGRGW 222



 Score =  123 bits (309), Expect = 5e-25
 Identities = 82/239 (34%), Positives = 119/239 (49%), Gaps = 32/239 (13%)
 Frame = -1

Query: 692  IQKGNLQLKRKHSALRTCHRG-VKISGAGEVGAAKLWSKYDPLPSAEVNKVRESLKSSSM 516
            + +  + L+ KH   +   RG V+IS   ++       +++ +P+ EVN+  E+LKSSS+
Sbjct: 303  VPREKVVLRSKHVGFQKRIRGPVRISDVEDLETDASPRRFNSIPTPEVNEAHEALKSSSL 362

Query: 515  ELRALVKDPLPDVLHTSEIVRSQLETKDINFGPPIENQ---SGDVYVPDSDVCRSIVVYQ 345
            +L+A V DPLP+ +  +E V S L TK++    P+ENQ     D   P    C   V  Q
Sbjct: 363  DLQAAVTDPLPEAVREAETVVSDLTTKNVIHEHPLENQRRTEADAANPSIHTCTEPV--Q 420

Query: 344  PSGANLGINSPVHCSNVHHPNLMERKSSARTYEWDDSIDNL------------------- 222
                NLG  S  + +NV  P+LMER ++A TYEWDDSID+                    
Sbjct: 421  SCDVNLGNPSSSYPNNVPRPSLMERNNTAHTYEWDDSIDDSPEAMVNCKDRLHLPSPKKR 480

Query: 221  ---------XXXXXXXXXXXKWMPLEEETLRAGVKMFGEGNWATIRNFYSNIFEYRSGV 72
                                +W  LEE+TLR GV+ +G+GNW  I   YS+IFE R+ V
Sbjct: 481  ALSPLKKYEPTKLCKRRKVKRWTLLEEDTLRDGVQKYGKGNWKLILKLYSDIFEERTEV 539


>gb|KNA10403.1| hypothetical protein SOVF_144720 isoform C [Spinacia oleracea]
          Length = 720

 Score =  149 bits (377), Expect = 6e-33
 Identities = 83/197 (42%), Positives = 120/197 (60%), Gaps = 1/197 (0%)
 Frame = -1

Query: 1622 MDKDISGWVLEFLLRSSVPDSLIQKTLTVLPLSGADPXXXXXXXXXXXXXXXLKASLSET 1443
            MD+ ++ W++EFLLR  + + +I   L+ LPLS  D                   S+SE 
Sbjct: 1    MDEKVAKWIIEFLLRKPIDEKIINGVLSCLPLSNNDKRLKKTILLRKIESAISDGSVSEK 60

Query: 1442 ALHIXXXXXXLDRN-NDAVPVFDSMRRAYCAVAVECTVKYLAASPDDPSGEYFAAVRRIW 1266
             L +      L+      VP  DSM+RAYCAVAV+CTV++L  + ++ +G+YF AV R+W
Sbjct: 61   LLELLEMVDELNHGAGKTVP--DSMKRAYCAVAVDCTVRFLEENVEE-NGKYFEAVERVW 117

Query: 1265 RGRVTQMSSAAEEGRRSGLLSDELSRWGDDVEAALWDPRVSERLAGLNTRRDALVEVRRF 1086
            R RV ++ S         L+S+EL+RW +D+EAA+WD  V E++   NTR +AL  V+R+
Sbjct: 118  RDRVCKVES---------LVSEELNRWLEDIEAAVWDSSVCEKILMRNTRNEALRAVKRY 168

Query: 1085 LKEAWETMGPSFLDSVA 1035
            LKEAWE+MGPSFL+ VA
Sbjct: 169  LKEAWESMGPSFLELVA 185



 Score = 63.5 bits (153), Expect = 6e-07
 Identities = 40/112 (35%), Positives = 49/112 (43%), Gaps = 28/112 (25%)
 Frame = -1

Query: 284 NLMERKSSARTYEWDDSIDNLXXXXXXXXXXXK--------------------------- 186
           +LME   ++ T+EWDDSIDN+                                       
Sbjct: 609 SLMEANGTSHTFEWDDSIDNVNRGSELHLPPSPKRSKVTPLEQYEDKPKPKPQPLIKGRK 668

Query: 185 -WMPLEEETLRAGVKMFGEGNWATIRNFYSNIFEYRSGVDLKDKWRNMIR*Q 33
            W   EE+TLR  VK  G  NW  I   + + FE RS VDLKDKWRNM+R Q
Sbjct: 669 KWSVEEEDTLRVAVKELGR-NWKLILQCHGDAFEGRSNVDLKDKWRNMLRSQ 719


>gb|KNA10402.1| hypothetical protein SOVF_144720 isoform B [Spinacia oleracea]
          Length = 719

 Score =  149 bits (377), Expect = 6e-33
 Identities = 83/197 (42%), Positives = 120/197 (60%), Gaps = 1/197 (0%)
 Frame = -1

Query: 1622 MDKDISGWVLEFLLRSSVPDSLIQKTLTVLPLSGADPXXXXXXXXXXXXXXXLKASLSET 1443
            MD+ ++ W++EFLLR  + + +I   L+ LPLS  D                   S+SE 
Sbjct: 1    MDEKVAKWIIEFLLRKPIDEKIINGVLSCLPLSNNDKRLKKTILLRKIESAISDGSVSEK 60

Query: 1442 ALHIXXXXXXLDRN-NDAVPVFDSMRRAYCAVAVECTVKYLAASPDDPSGEYFAAVRRIW 1266
             L +      L+      VP  DSM+RAYCAVAV+CTV++L  + ++ +G+YF AV R+W
Sbjct: 61   LLELLEMVDELNHGAGKTVP--DSMKRAYCAVAVDCTVRFLEENVEE-NGKYFEAVERVW 117

Query: 1265 RGRVTQMSSAAEEGRRSGLLSDELSRWGDDVEAALWDPRVSERLAGLNTRRDALVEVRRF 1086
            R RV ++ S         L+S+EL+RW +D+EAA+WD  V E++   NTR +AL  V+R+
Sbjct: 118  RDRVCKVES---------LVSEELNRWLEDIEAAVWDSSVCEKILMRNTRNEALRAVKRY 168

Query: 1085 LKEAWETMGPSFLDSVA 1035
            LKEAWE+MGPSFL+ VA
Sbjct: 169  LKEAWESMGPSFLELVA 185



 Score = 63.5 bits (153), Expect = 6e-07
 Identities = 40/112 (35%), Positives = 49/112 (43%), Gaps = 28/112 (25%)
 Frame = -1

Query: 284 NLMERKSSARTYEWDDSIDNLXXXXXXXXXXXK--------------------------- 186
           +LME   ++ T+EWDDSIDN+                                       
Sbjct: 608 SLMEANGTSHTFEWDDSIDNVNRGSELHLPPSPKRSKVTPLEQYEDKPKPKPQPLIKGRK 667

Query: 185 -WMPLEEETLRAGVKMFGEGNWATIRNFYSNIFEYRSGVDLKDKWRNMIR*Q 33
            W   EE+TLR  VK  G  NW  I   + + FE RS VDLKDKWRNM+R Q
Sbjct: 668 KWSVEEEDTLRVAVKELGR-NWKLILQCHGDAFEGRSNVDLKDKWRNMLRSQ 718


>gb|KNA10401.1| hypothetical protein SOVF_144720 isoform A [Spinacia oleracea]
          Length = 704

 Score =  149 bits (377), Expect = 6e-33
 Identities = 83/197 (42%), Positives = 120/197 (60%), Gaps = 1/197 (0%)
 Frame = -1

Query: 1622 MDKDISGWVLEFLLRSSVPDSLIQKTLTVLPLSGADPXXXXXXXXXXXXXXXLKASLSET 1443
            MD+ ++ W++EFLLR  + + +I   L+ LPLS  D                   S+SE 
Sbjct: 1    MDEKVAKWIIEFLLRKPIDEKIINGVLSCLPLSNNDKRLKKTILLRKIESAISDGSVSEK 60

Query: 1442 ALHIXXXXXXLDRN-NDAVPVFDSMRRAYCAVAVECTVKYLAASPDDPSGEYFAAVRRIW 1266
             L +      L+      VP  DSM+RAYCAVAV+CTV++L  + ++ +G+YF AV R+W
Sbjct: 61   LLELLEMVDELNHGAGKTVP--DSMKRAYCAVAVDCTVRFLEENVEE-NGKYFEAVERVW 117

Query: 1265 RGRVTQMSSAAEEGRRSGLLSDELSRWGDDVEAALWDPRVSERLAGLNTRRDALVEVRRF 1086
            R RV ++ S         L+S+EL+RW +D+EAA+WD  V E++   NTR +AL  V+R+
Sbjct: 118  RDRVCKVES---------LVSEELNRWLEDIEAAVWDSSVCEKILMRNTRNEALRAVKRY 168

Query: 1085 LKEAWETMGPSFLDSVA 1035
            LKEAWE+MGPSFL+ VA
Sbjct: 169  LKEAWESMGPSFLELVA 185



 Score = 63.5 bits (153), Expect = 6e-07
 Identities = 40/112 (35%), Positives = 49/112 (43%), Gaps = 28/112 (25%)
 Frame = -1

Query: 284 NLMERKSSARTYEWDDSIDNLXXXXXXXXXXXK--------------------------- 186
           +LME   ++ T+EWDDSIDN+                                       
Sbjct: 593 SLMEANGTSHTFEWDDSIDNVNRGSELHLPPSPKRSKVTPLEQYEDKPKPKPQPLIKGRK 652

Query: 185 -WMPLEEETLRAGVKMFGEGNWATIRNFYSNIFEYRSGVDLKDKWRNMIR*Q 33
            W   EE+TLR  VK  G  NW  I   + + FE RS VDLKDKWRNM+R Q
Sbjct: 653 KWSVEEEDTLRVAVKELGR-NWKLILQCHGDAFEGRSNVDLKDKWRNMLRSQ 703


Top