BLASTX nr result

ID: Sinomenium21_contig00015462 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Sinomenium21_contig00015462
         (1831 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_007047984.1| VQ motif-containing protein [Theobroma cacao...   215   6e-53
emb|CAN62228.1| hypothetical protein VITISV_008028 [Vitis vinifera]   210   2e-51
emb|CAN67718.1| hypothetical protein VITISV_002357 [Vitis vinifera]   206   2e-50
ref|XP_007018802.1| VQ motif-containing protein [Theobroma cacao...   201   1e-48
ref|XP_002301045.1| VQ motif-containing family protein [Populus ...   197   1e-47
ref|XP_002534310.1| conserved hypothetical protein [Ricinus comm...   186   2e-44
ref|XP_002307093.1| VQ motif-containing family protein [Populus ...   186   3e-44
ref|XP_002310570.2| VQ motif-containing family protein [Populus ...   177   1e-41
ref|XP_002307385.1| VQ motif-containing family protein [Populus ...   177   1e-41
ref|XP_002513906.1| conserved hypothetical protein [Ricinus comm...   174   1e-40
gb|EXB29424.1| hypothetical protein L484_022090 [Morus notabilis]     154   1e-34
ref|XP_006428174.1| hypothetical protein CICLE_v10025465mg [Citr...   145   8e-32
ref|XP_006434009.1| hypothetical protein CICLE_v10001250mg [Citr...   142   4e-31
ref|XP_006472623.1| PREDICTED: uncharacterized serine-rich prote...   141   9e-31
ref|XP_006603093.1| PREDICTED: myb-like protein A-like [Glycine ...   140   2e-30
ref|XP_007163642.1| hypothetical protein PHAVU_001G251600g [Phas...   138   1e-29
ref|XP_006826929.1| hypothetical protein AMTR_s00010p00175790 [A...   133   2e-28
ref|XP_007159640.1| hypothetical protein PHAVU_002G254700g [Phas...   132   4e-28
ref|XP_006580869.1| PREDICTED: histone-lysine N-methyltransferas...   127   2e-26
ref|XP_006591815.1| PREDICTED: uncharacterized serine-rich prote...   126   3e-26

>ref|XP_007047984.1| VQ motif-containing protein [Theobroma cacao]
            gi|508700245|gb|EOX92141.1| VQ motif-containing protein
            [Theobroma cacao]
          Length = 472

 Score =  215 bits (547), Expect = 6e-53
 Identities = 156/349 (44%), Positives = 186/349 (53%), Gaps = 18/349 (5%)
 Frame = -2

Query: 1068 QGSVRAPLVGSASSDPHNNAAAGRNPKKRSRASRRAPTTVLTTDTSNFRAMVQEFTGIPA 889
            + + ++ + G+     +NN+   RNPKKRSRASRRAPTTVLTTDT+NFRAMVQEFTGIPA
Sbjct: 145  ESATKSSISGTGDQPNNNNSNMVRNPKKRSRASRRAPTTVLTTDTTNFRAMVQEFTGIPA 204

Query: 888  PPFSAGSPFQRSRLDLFGAGSGL------TLPPSYLLRPFAHKLQQPVPSFVSHSNIGD- 730
            PPF++ SPF R+RLDLFG  S +        PP YLLRPFA K+    P FVS S     
Sbjct: 205  PPFTS-SPFPRTRLDLFGTPSTMRSTPLDPSPPHYLLRPFAQKIHP--PPFVSSSTASSS 261

Query: 729  --HXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXNYQRPSELGLHKQPGQNLVLNMQXX 556
                                               NYQ  SELGL KQP   L +NMQ  
Sbjct: 262  FPSSSMVDAIASTPSTNITSASASNNNTTSSSTSINYQLSSELGLLKQPQNLLNINMQN- 320

Query: 555  XXXXXXXHPSMLSFQSLLQSPNPKHTSPNVPMFGLKSQGSSLTIPSH---VELGGAS--- 394
                      +L+FQSLLQ+P PK+  PN  + G K QG SL IPS+   +++G      
Sbjct: 321  ---------PILNFQSLLQAP-PKYPLPNSTILGTKLQG-SLDIPSNDSSLKMGVLEEFG 369

Query: 393  -STHHLGHGLSNLVN--SSDGISLRSSGDINWATEGVGSNDGDHHQSHLRXXXXXXXXXX 223
             S  H+   LS L N  SSDG   R+    N  + G G+   +H QS LR          
Sbjct: 370  LSHGHVNTNLSGLQNMVSSDGALPRNDSSTNPPSWGEGTGSQEHDQSLLR------SING 423

Query: 222  XXXXXSQRVSTSCKMNFSASSSDFHADKGSENVPSRGEGMVGSWICPSD 76
                 SQRVS     NFSASSSDFH DKG ENV +R EGMV SWIC SD
Sbjct: 424  GYNSNSQRVSNGKVSNFSASSSDFHGDKGPENVAARSEGMVESWICSSD 472


>emb|CAN62228.1| hypothetical protein VITISV_008028 [Vitis vinifera]
          Length = 422

 Score =  210 bits (534), Expect = 2e-51
 Identities = 158/341 (46%), Positives = 183/341 (53%), Gaps = 15/341 (4%)
 Frame = -2

Query: 1053 APLVGSASSDPHNNAAAGRNPKKRSRASRRAPTTVLTTDTSNFRAMVQEFTGIPAPPFSA 874
            A    SAS+D  N A   RNPKKRSRASRRAPTTVLTTDT+NFRAMVQEFTGIPA PF++
Sbjct: 110  ARATASASNDQTNVA---RNPKKRSRASRRAPTTVLTTDTTNFRAMVQEFTGIPAQPFTS 166

Query: 873  GSPFQRSRLDLFGAGSGLT------LPPSYLLRPFAHKLQQPVPSFVSHSNIGDHXXXXX 712
             SPF RSRLDLFG  S +        PPSYLLRPFA KLQ   P F S            
Sbjct: 167  -SPFPRSRLDLFGTASTMRSGHLDHAPPSYLLRPFAQKLQP--PPFASPPPSSSSSFSSS 223

Query: 711  XXXXXXXXXXXXXXXXXXXXXXXXXXXNYQRPSELGLHKQPGQNLVLNMQXXXXXXXXXH 532
                                       NYQ PS+LGL KQP   L +N+Q          
Sbjct: 224  SMVDAIASTTNITSGSASNTSSNSTSINYQLPSDLGLVKQPQNLLNMNVQN--------- 274

Query: 531  PSMLSFQSLLQSPNPKHTSPNVPMFGLKSQGSSLTIP---SHVELGGAS----STHHLGH 373
              +LS QS LQ+P  K+  PN  + G K QG SL IP   SH+++GG      S  H+  
Sbjct: 275  -PILSIQSFLQTP-LKYPHPNSAIMGSKPQG-SLEIPSTDSHIKMGGLEDFGLSHGHVNT 331

Query: 372  GLSNLVN--SSDGISLRSSGDINWATEGVGSNDGDHHQSHLRXXXXXXXXXXXXXXXSQR 199
             LS L N  SSD  + RS  +     +G+GS+ G+H Q                   SQR
Sbjct: 332  HLSGLPNLVSSDRTASRSDNNPPSWNDGLGSSGGNHGQ---------LGPLNGNYNNSQR 382

Query: 198  VSTSCKMNFSASSSDFHADKGSENVPSRGEGMVGSWICPSD 76
            V T+ KMN+SASSSDFH DK  ENV +R EGMV SWIC SD
Sbjct: 383  V-TNGKMNYSASSSDFHGDKVPENVSTRSEGMVESWICSSD 422


>emb|CAN67718.1| hypothetical protein VITISV_002357 [Vitis vinifera]
          Length = 449

 Score =  206 bits (525), Expect = 2e-50
 Identities = 181/461 (39%), Positives = 216/461 (46%), Gaps = 58/461 (12%)
 Frame = -2

Query: 1284 EELDSRPDSITSFFSSSG---PCSNQQQPPP----------LYDPLSTYLDVFS------ 1162
            EE +SRP+SI +F + SG     S+  QPPP          L+DP S Y+D FS      
Sbjct: 17   EEYESRPESIPAFLNPSGHFGSVSSNPQPPPFPHHQNHPPTLFDPRSNYVDAFSQSSANP 76

Query: 1161 ----------------RPPPSXXXXXXXXXXXXXXXXXXXXXXPVVQQGSVRAPLVGSAS 1030
                            R  P+                        VQ          S++
Sbjct: 77   NANSLLNLDTVWSRGLRSEPNCTDFGNLTGLSSSSTSSSGQSMLGVQGPVHENGGRASSA 136

Query: 1029 SDPHNNAAAGRNPKKRSRASRRAPTTVLTTDTSNFRAMVQEFTGIPAPPFSAGSPFQRSR 850
            S P +     R+ KKR+RASRRAPTTVLTTDTSNFRAMVQEFTGIPAPPFSA SP+ R R
Sbjct: 137  SLPSDQTNVVRSSKKRTRASRRAPTTVLTTDTSNFRAMVQEFTGIPAPPFSA-SPYSR-R 194

Query: 849  LDLFGAGSGL------TLPPSYLLRPFAHKLQ---------QPVPSFVSHSNIGDHXXXX 715
            LDLFGAGS +       L P Y LRP  HK+Q          P PSF  +S IGD     
Sbjct: 195  LDLFGAGSSIKPGHLEPLGPLYPLRPSPHKVQPNLFVSSSSSPSPSFF-NSTIGDSIVST 253

Query: 714  XXXXXXXXXXXXXXXXXXXXXXXXXXXXNYQRPSELGLHKQPGQNLVLNMQXXXXXXXXX 535
                                         YQ PS+ G  KQP QN VL MQ         
Sbjct: 254  TNIATTSTNNIITTSMAAATNAINSGSNTYQLPSDPGFPKQP-QN-VLGMQN-------- 303

Query: 534  HPSMLSFQSLLQSPNP---KHTSPNVPMFGLKSQGS-SLTIPSHVELGGASSTHHLGHGL 367
               +LSFQSLLQSP     K+   +VP+FG KS  S +L +PS  ELG      H+   +
Sbjct: 304  --PILSFQSLLQSPPSHPLKYPLADVPVFGTKSPASLTLPLPSFEELGVPHG--HVNANI 359

Query: 366  SNL-VNSSDGISLRSSGDIN---WATEGVGSNDGDHHQSHLRXXXXXXXXXXXXXXXSQR 199
            S L  +++ G S R   D N   W  +G GSN+G   Q  LR                  
Sbjct: 360  SGLPSHATSGGSRRLRTDDNGTCW-RDGAGSNEGSREQ--LRPFNGNYGDSPQV------ 410

Query: 198  VSTSCKMNFSASSSDFHADKGSENVPSRGEGMVGSWICPSD 76
              +S K+N SASSS FH +KGS+NV SRGEG V SWICPSD
Sbjct: 411  --SSFKLNCSASSSAFHPEKGSDNVSSRGEGTVDSWICPSD 449


>ref|XP_007018802.1| VQ motif-containing protein [Theobroma cacao]
            gi|508724130|gb|EOY16027.1| VQ motif-containing protein
            [Theobroma cacao]
          Length = 551

 Score =  201 bits (510), Expect = 1e-48
 Identities = 171/431 (39%), Positives = 208/431 (48%), Gaps = 48/431 (11%)
 Frame = -2

Query: 1284 EELDSRPDSITSFFSSSG---PCSN---------QQQPPPLYDPLSTYLDVFSRPPPSXX 1141
            EE DSRP+S+ +F ++SG   P SN         Q  PP  +DP S YL+ FS+  P+  
Sbjct: 78   EEYDSRPESLPAFLNASGHFSPLSNPHPSLVSHHQDHPPTFFDPSSNYLNPFSQSQPNNS 137

Query: 1140 XXXXXXXXXXXXXXXXXXXXPV------------------VQQGSVRAPLVGSASSDP-H 1018
                                 +                  + QGS   P   S  S P H
Sbjct: 138  LLNLDGGVRPRGLRSEPNCTDLGNLPGSSSSSQSMLGAQGLNQGSF--PSSSSMQSRPAH 195

Query: 1017 NNAAAG----------RNPKKRSRASRRAPTTVLTTDTSNFRAMVQEFTGIPAPPFSAGS 868
            +N A            +NPKKR+RASRRAPTTVLTTDT+NFRAMVQEFTGIPAPPFS GS
Sbjct: 196  DNGARSLAQSDQTSVVKNPKKRTRASRRAPTTVLTTDTTNFRAMVQEFTGIPAPPFS-GS 254

Query: 867  PFQRSRLDLFGAGSGLT------LPPSYLLRPFAHKLQQ-PVPSFVSHSNIGDHXXXXXX 709
             + R RLDLFG+GSG+       L   Y LRP A ++Q  P  S  S S + +       
Sbjct: 255  SYSR-RLDLFGSGSGMRSSHLEPLGSLYPLRPSAKRVQPTPFVSSSSPSLLNNPLVDAAN 313

Query: 708  XXXXXXXXXXXXXXXXXXXXXXXXXXNYQRPSELGLHKQPGQNLVLNMQXXXXXXXXXHP 529
                                      NYQ PS+L L KQP QN+ LN+Q           
Sbjct: 314  ITNTTSNSTIPTSIAATTNAFNPTSSNYQLPSDLSLLKQP-QNM-LNLQNQSP------- 364

Query: 528  SMLSFQSLLQSPNPKHTSPNVPMFGLKSQGSSLTIPSHVELGGASSTHHLGHGLSNLVNS 349
             +LSFQS LQ P   H S N+P FG+KSQGSS  +PS  ELG   S  H+   L  L + 
Sbjct: 365  -VLSFQSFLQPPT-LHPSLNLPGFGVKSQGSS-AMPSLDELG--MSHGHVNANLGGLQSH 419

Query: 348  SDGISLRSSGDINWATEGVGSNDGDHHQSHLRXXXXXXXXXXXXXXXSQRVSTSCKMNFS 169
                  R+  D NW  +G+G NDG+  Q HLR                QRV+ SCK+NFS
Sbjct: 420  VTPDGPRARSDSNWR-DGIGLNDGN--QDHLRPLDGNYGNDHHNS---QRVNNSCKLNFS 473

Query: 168  ASSSDFHADKG 136
            ASSSDFH DKG
Sbjct: 474  ASSSDFHHDKG 484


>ref|XP_002301045.1| VQ motif-containing family protein [Populus trichocarpa]
            gi|222842771|gb|EEE80318.1| VQ motif-containing family
            protein [Populus trichocarpa]
          Length = 423

 Score =  197 bits (501), Expect = 1e-47
 Identities = 178/443 (40%), Positives = 210/443 (47%), Gaps = 40/443 (9%)
 Frame = -2

Query: 1284 EELDSRPDSITSFFSSS----GPCS-NQQQPPPLYDPLSTYLDVFSRPPP---------- 1150
            EE DSRP+S+ +F + S    GP   + QQP  L+DP  +   VFS+  P          
Sbjct: 17   EEYDSRPESLPAFLNPSTHNFGPSLLSHQQPVTLFDPTPSLFHVFSQSQPNPIMVQSRGL 76

Query: 1149 -SXXXXXXXXXXXXXXXXXXXXXXPVVQQGSVRAPLV---------GSASSDPHNNAAAG 1000
             S                        VQ  S   P           G  SS P ++   G
Sbjct: 77   RSDPNCTDLGINLPDSLSSSQSAVLGVQGSSQALPSSKQLRSVHDDGGRSSSPSHDQTHG 136

Query: 999  --RNPKKRSRASRRAPTTVLTTDTSNFRAMVQEFTGIPAPPFSAGSPFQRSRLDLFGAGS 826
              RNPKKR+RASRRAPTTVLTTDTSNFR MVQEFTGIPAPPFS GSPF R RLDLFG GS
Sbjct: 137  IARNPKKRTRASRRAPTTVLTTDTSNFRQMVQEFTGIPAPPFS-GSPFTR-RLDLFGPGS 194

Query: 825  GLT---LPPSYLLRPFAHKLQQPVPSFVSHS-------NIGDHXXXXXXXXXXXXXXXXX 676
            GL    L P Y LRP A K+      F+S S       NI                    
Sbjct: 195  GLRSGHLEPLYPLRPTAQKVHHQQTPFLSSSFPSLLNNNI---VHTTNIASTSTTANNNN 251

Query: 675  XXXXXXXXXXXXXXXNYQRPSELGLHKQPGQNLVLNMQXXXXXXXXXHPSMLSFQSLLQ- 499
                           NYQ P ++GLHKQ  +NL LNMQ            MLS   LL  
Sbjct: 252  TISTAATSTFNPSSLNYQLPDDIGLHKQT-RNL-LNMQN----------QMLSIHPLLHP 299

Query: 498  -SPNPKHTSPNVPMFGLKSQGSSLTIPSHVELGGASSTHHLGHGLSNLVNSSDGISLRSS 322
              P P    PNVP  G  S+ +SL +PS  ELG       +GHG  N  N S   S  ++
Sbjct: 300  PPPPPPQQLPNVPGLGANSR-ASLPLPSLEELG-------MGHGYVN-ANLSGLTSHVTT 350

Query: 321  GDINWATEGVGSNDGDHHQSHLRXXXXXXXXXXXXXXXSQRVSTSCKMNFSASSSDFHAD 142
             ++        SNDG HH  +LR                QRV+ SCK+N+S++SSDFH +
Sbjct: 351  EEMRL------SNDGSHH--NLR-------SLNGNYGNMQRVN-SCKLNYSSASSDFHHE 394

Query: 141  KGSENVPSRG-EGMVGSWICPSD 76
            KG ENV SRG EG V SWICPS+
Sbjct: 395  KGLENVSSRGTEGTVDSWICPSE 417


>ref|XP_002534310.1| conserved hypothetical protein [Ricinus communis]
            gi|223525518|gb|EEF28072.1| conserved hypothetical
            protein [Ricinus communis]
          Length = 498

 Score =  186 bits (473), Expect = 2e-44
 Identities = 149/370 (40%), Positives = 180/370 (48%), Gaps = 39/370 (10%)
 Frame = -2

Query: 1068 QGSVRAPLVGSASSDPHNNAAAG--RNPKKRSRASRRAPTTVLTTDTSNFRAMVQEFTGI 895
            +G   A   GS     +N       RNPKKRSRASRRAPTTVLTTDT+NFRAMVQEFTGI
Sbjct: 154  RGPGSASASGSNGHQTNNTTTTNIVRNPKKRSRASRRAPTTVLTTDTTNFRAMVQEFTGI 213

Query: 894  PAPPFSAGSPFQRSRLDLFGAGSGLTL----------PPSYLLRPFAHKLQQPVPSFVSH 745
            PAPPF++ SPF RSRLDLFG  +  +L           PSYLLRPFA K+Q P  S  S 
Sbjct: 214  PAPPFTS-SPFPRSRLDLFGTAAASSLRSVVSHLEPSHPSYLLRPFAQKIQPPFLSSSSS 272

Query: 744  SNIGDHXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXNYQRPSELGLHKQPGQNLVLNM 565
            S++ D                                      S+L L K P   L +NM
Sbjct: 273  SSMIDAIASSTTTSTNINSNATTNTTTNTTSN-----------SDLSLLKYPQNLLNINM 321

Query: 564  QXXXXXXXXXHPSMLSFQSLLQSPNPKHTSPNVPMFGLKSQGSSLTIPS---HVELG--- 403
                      +  ML+F SL Q P+PK++ PN  +   K Q  SL  PS   H+++G   
Sbjct: 322  H---------NSPMLNFHSLFQ-PSPKYSLPNSSILATKPQEGSLDTPSNDPHLKMGVLE 371

Query: 402  --GASSTHHLGH--GLSNLVNSSDGISLRS------------SGDINWATEGVGSNDGDH 271
              G S  H   +  GL NLV+SSD    RS            +   NW    VGSN+GDH
Sbjct: 372  EFGLSHGHVSTNLTGLHNLVSSSDTTLRRSDHNSSSSSNNNNNNSGNWGDRRVGSNEGDH 431

Query: 270  HQSHLRXXXXXXXXXXXXXXXSQRVSTSCKMNFSASSSDF-HADKGSEN----VPSRGEG 106
                LR                + V+ + K+N+SASSSDF H DKG E       +R EG
Sbjct: 432  ---LLRSINGNYNNNNSSSNTQRVVANNGKVNYSASSSDFNHGDKGPETNVVVANTRSEG 488

Query: 105  MVGSWICPSD 76
            MV SWIC SD
Sbjct: 489  MVESWICSSD 498


>ref|XP_002307093.1| VQ motif-containing family protein [Populus trichocarpa]
            gi|222856542|gb|EEE94089.1| VQ motif-containing family
            protein [Populus trichocarpa]
          Length = 510

 Score =  186 bits (472), Expect = 3e-44
 Identities = 152/370 (41%), Positives = 185/370 (50%), Gaps = 38/370 (10%)
 Frame = -2

Query: 1071 QQGSVRAPLVGSAS--SDPHNNAAAGRNPKKRSRASRRAPTTVLTTDTSNFRAMVQEFTG 898
            Q+ + R P  GS S  +D  +N A  RNPKKRSRASRRAPTTVL+TDT+NFRAMVQEFTG
Sbjct: 154  QESATRVPGSGSVSGTNDQVSNTAGIRNPKKRSRASRRAPTTVLSTDTTNFRAMVQEFTG 213

Query: 897  IPAPPFSAGSPFQRSRLDLFGAGSGL----------TLPPSYLLRPFAHKLQ-QPVPSFV 751
            IPAPPF++ SPF RSRLDLFG  +              PP YLL PFA K Q  P P FV
Sbjct: 214  IPAPPFTS-SPFPRSRLDLFGTAASTLRSAVSQHLDPSPPPYLLGPFAKKFQPPPPPPFV 272

Query: 750  SHSNIGDH---XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXNYQRPSELGLHKQPGQN 580
            S  +                                       NYQ PS+LGL KQP   
Sbjct: 273  SSGSAASSFSASMVDAIASTTATNINGTCTNTTISNNIPLTSINYQLPSDLGLLKQPHNL 332

Query: 579  LVLNMQXXXXXXXXXHPSMLSFQSLLQSPNPKHTSPNVP--MFGLKSQGSSLTIP---SH 415
            L LN+Q            +L+F  LLQ+P PK+  P+ P  +   K Q  SL IP   SH
Sbjct: 333  LNLNVQN----------PILNFHPLLQAP-PKYPLPDSPNILGTTKPQQGSLEIPLNVSH 381

Query: 414  VELGGAS--STHHLGH------GLSNLVNSS----DGISLRSSGDINWAT---EGVGSND 280
            +++        +H GH      GL N+V+SS    D   +R S   N  T   +G GSN+
Sbjct: 382  LKMVVLEEFGLNH-GHVNTNLSGLQNIVSSSSPSADVTLVRRSDHSNSLTNWGDGAGSNE 440

Query: 279  GDHHQSHLRXXXXXXXXXXXXXXXSQRVSTSCKMNFSASSSDFHADK--GSENVPSRGEG 106
             DHH    +               S +  T+ K+NF ASSSDF  D   G ENV +R EG
Sbjct: 441  VDHHHHQQQQQQGLLRSINGDYNNSTQRVTNGKVNFLASSSDFCGDHKLGQENVATRSEG 500

Query: 105  MVGSWICPSD 76
             + SWIC SD
Sbjct: 501  TMESWICSSD 510


>ref|XP_002310570.2| VQ motif-containing family protein [Populus trichocarpa]
            gi|550334197|gb|EEE91020.2| VQ motif-containing family
            protein [Populus trichocarpa]
          Length = 527

 Score =  177 bits (449), Expect = 1e-41
 Identities = 147/364 (40%), Positives = 182/364 (50%), Gaps = 41/364 (11%)
 Frame = -2

Query: 1068 QGSVRAPLVGSASSDPHNNAAAGRNPKKRSRASRRAPTTVLTTDTSNFRAMVQEFTGIPA 889
            + + R P+ G+  +D  +N A  RNPKKRSRASRRAPTTVLTTDT+NFRAMVQEFTGIPA
Sbjct: 151  ESATRGPVSGT--NDQVSNTAGVRNPKKRSRASRRAPTTVLTTDTTNFRAMVQEFTGIPA 208

Query: 888  PPFSAGSPFQRSRLDLFGAGSGL----------TLPPSYLLRPFAHKLQ---QPVPSFVS 748
            PPF++ SPF RSRLDLFG  +              PP YLLRPFA + Q    P P F S
Sbjct: 209  PPFTS-SPFPRSRLDLFGTAASTLRSAVSHHLDPSPPPYLLRPFAQRFQPPPPPAPPFAS 267

Query: 747  HSNIGDH-----XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXNYQRPSELGLHKQPGQ 583
              +                                         NYQ PS+LGL KQP  
Sbjct: 268  SGSTASSFSTSMVDAIASTTTTNINNSGACTNSTTTSNISSTSINYQLPSDLGLLKQPHH 327

Query: 582  NLVLNMQXXXXXXXXXHPSMLSFQSLLQSPN--PKHTSPNVPMFGLKSQGSSLTIP---S 418
             L +N+Q            +L+F  L Q+P+  P   S N+       QGSSL IP   S
Sbjct: 328  LLNINVQN----------PILNFHPLFQAPHKYPLPNSTNILGTTKAQQGSSLEIPSNDS 377

Query: 417  HVELG-----GASSTHHLGH--GLSNLVNSSDGIS----LRSSGD-----INWATEGVGS 286
            H+++G     G S  H   +  GL N+V+SS   S    L   GD      NW  +GVGS
Sbjct: 378  HLKMGVLEEFGMSHGHVSTNLTGLQNIVSSSSSPSADATLMRRGDHNNNLANWG-DGVGS 436

Query: 285  NDGDHHQSHLRXXXXXXXXXXXXXXXSQRVSTSCKMNF-SASSSDFHAD-KGSENVPSRG 112
            N G HH  H +               +QRV T+ K+NF ++SSSDF  D KG ENV +R 
Sbjct: 437  NGGGHHH-HQQQQGLLRSINGNYNNSTQRV-TNGKVNFLASSSSDFRGDNKGQENVATRS 494

Query: 111  EGMV 100
            E +V
Sbjct: 495  EEVV 498


>ref|XP_002307385.1| VQ motif-containing family protein [Populus trichocarpa]
            gi|222856834|gb|EEE94381.1| VQ motif-containing family
            protein [Populus trichocarpa]
          Length = 437

 Score =  177 bits (449), Expect = 1e-41
 Identities = 164/453 (36%), Positives = 198/453 (43%), Gaps = 50/453 (11%)
 Frame = -2

Query: 1284 EELDSRPDSITSFFSSSGP-----CSNQQQPPPLYDPLSTYLDVFSRPPPSXXXXXXXXX 1120
            EE DSRP+S+ +F ++S         +  QP  ++DP       FS+             
Sbjct: 17   EEYDSRPESLPAFLNASSQNFDPSLFSHHQPAAIFDPSPALFHAFSQSQSITNPNSSMLN 76

Query: 1119 XXXXXXXXXXXXXPVVQQG---------SVRAPLVGSASSDP----------HNNAA--- 1006
                            + G         S  APL    SS            H+N     
Sbjct: 77   LDMVHSRGLRSEHSCTRLGINLPDSLSSSQSAPLGAQGSSQALPSSMQLRSVHDNGVRSS 136

Query: 1005 --------AGRNPKKRSRASRRAPTTVLTTDTSNFRAMVQEFTGIPAPPFSAGSPFQRSR 850
                      RNPKKR+RASRRAPTTVLTTDTSNFR MVQEFTGIPAPPF+ GS F R R
Sbjct: 137  SPSDQTHGVARNPKKRTRASRRAPTTVLTTDTSNFRQMVQEFTGIPAPPFT-GSSFTR-R 194

Query: 849  LDLFGAGSGL------TLPPSYLLRPFAHKLQQPVPSFVSHSN----IGDHXXXXXXXXX 700
            LDLFG GSGL       +   Y LRP A K+       +S S+      D          
Sbjct: 195  LDLFGPGSGLRSGHLEPIGSLYPLRPSAQKVHHQQTPLLSSSSPSFFNNDIVDGTNIAST 254

Query: 699  XXXXXXXXXXXXXXXXXXXXXXXNYQRPSELGLHKQPGQNLVLNMQXXXXXXXXXHPSML 520
                                   NYQ  + LGLHKQP QNL LNMQ            ML
Sbjct: 255  STTANNNNTITTATTSTFNPSSVNYQLSAHLGLHKQP-QNL-LNMQN----------QML 302

Query: 519  SFQSLLQSPNPKHTS-PNVPMFGLKSQGSSLTIPSHVELGGASSTHHLGHGLSNLVN--S 349
            S   LLQ P P   S  NVP  G KSQ +S  +PS  ELG      H+   L  L +  +
Sbjct: 303  SIHPLLQPPAPPFQSLANVPGLGAKSQ-ASFPLPSFEELGMGHGDGHVNAHLGGLTSHVT 361

Query: 348  SDGISLRSSGDINWATEGVGSNDGDHHQSHLRXXXXXXXXXXXXXXXSQRVSTSCKMNF- 172
            ++G+ L S GD +     +  N G+                       +RV+ SCK+N+ 
Sbjct: 362  TEGMRLSSDGDQDHNLRSLDGNYGN----------------------MKRVN-SCKLNYS 398

Query: 171  SASSSDFHADKGSENVPSRG-EGMVGSWICPSD 76
            SASSS FH DK  ENV SRG EG V SWICPS+
Sbjct: 399  SASSSGFHHDKVLENVSSRGAEGTVDSWICPSE 431


>ref|XP_002513906.1| conserved hypothetical protein [Ricinus communis]
            gi|223546992|gb|EEF48489.1| conserved hypothetical
            protein [Ricinus communis]
          Length = 446

 Score =  174 bits (441), Expect = 1e-40
 Identities = 145/342 (42%), Positives = 172/342 (50%), Gaps = 20/342 (5%)
 Frame = -2

Query: 1041 GSASSDPHNNAAAGRNPKKRSRASRRAPTTVLTTDTSNFRAMVQEFTGIPAPPFSAGSPF 862
            G  SS         RNPKKR+RASRRAPTTVLTTDTSNFRAMVQEFTGIPAPPFS GSP+
Sbjct: 160  GRCSSPSDQTHVVTRNPKKRTRASRRAPTTVLTTDTSNFRAMVQEFTGIPAPPFS-GSPY 218

Query: 861  QRSRLDLFGA-GSGL------TLPPSYLLRPFAHKL--QQPVPSFVSHSNI--------- 736
             R RLDLFG+ GSG+       +   Y L P A K+  QQ   SF S S++         
Sbjct: 219  SRCRLDLFGSVGSGMRSSHLEQMGSLYPLHPSAQKVQHQQSPFSFSSSSSLLNTNTMVDA 278

Query: 735  GDHXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXNYQRPSELG-LHKQPGQNLVLNMQX 559
             +                                 NYQ PS+LG L KQP QN+ LNMQ 
Sbjct: 279  TNIASTTTTTNDNNTITSSSIPAGTTSTFNPSSINNYQLPSDLGQLSKQP-QNM-LNMQN 336

Query: 558  XXXXXXXXHPSMLSFQSLLQSPNPKHTSPNVPMFGLKSQGSSLTIPSHVELGGASSTHHL 379
                       MLSFQSLLQ P   H+S NV   G KSQ +S+ +PS  +L G S   +L
Sbjct: 337  ----------QMLSFQSLLQPPPLHHSSLNVHGLGAKSQ-ASMPLPSLDDL-GMSHNANL 384

Query: 378  GHGLSNLVNSSDGISLRSSGDINWATEGVGSNDGDHHQSHLRXXXXXXXXXXXXXXXSQR 199
                S+ V +++G+ LR++               DH                        
Sbjct: 385  SGIPSHNVTTAEGMRLRNN---------------DH------------------------ 405

Query: 198  VSTSCKMNFSASSSDF-HADKGSENVPSRGEGMVGSWICPSD 76
             + SCK N+SASSSDF H DKG E VP RGEG V SWICPS+
Sbjct: 406  -NNSCKFNYSASSSDFHHHDKGLEIVPPRGEGAVDSWICPSE 446


>gb|EXB29424.1| hypothetical protein L484_022090 [Morus notabilis]
          Length = 443

 Score =  154 bits (390), Expect = 1e-34
 Identities = 128/334 (38%), Positives = 157/334 (47%), Gaps = 20/334 (5%)
 Frame = -2

Query: 1017 NNAAAGRNPKKRSRASRRAPTTVLTTDTSNFRAMVQEFTGIPAPPFSAGSPFQRSRLDLF 838
            N  AA RNPKKRSRASRRAPTTVLTTDTSNFRAMVQEFTGIPAPPF++ SPF R+RLDLF
Sbjct: 168  NGGAAPRNPKKRSRASRRAPTTVLTTDTSNFRAMVQEFTGIPAPPFTS-SPFPRTRLDLF 226

Query: 837  GAGSGLTLPP-------------SY-LLRPFAHKLQQPVPSFVSHSNIGDHXXXXXXXXX 700
            G+GSG+   P             SY LLRPFA K+QQ  P FV+ S              
Sbjct: 227  GSGSGIRSAPLDPHHHHPSTGTSSYNLLRPFAQKIQQTTP-FVNTS-------------- 271

Query: 699  XXXXXXXXXXXXXXXXXXXXXXXNYQRPSELGLHKQPGQNLVLNMQXXXXXXXXXHPSML 520
                                        S          N +LN+Q            +L
Sbjct: 272  ---------------------------ASSSSSPSTTTSNSLLNIQTN---------PVL 295

Query: 519  SFQSLLQSPNPK-----HTSPNVPMFGLKSQGSSLTIPSHVELGGASSTHHLGHGLSNLV 355
            SF SLLQ+  PK      TS +   FGL S G  + +  + +LGG  +       ++   
Sbjct: 296  SFHSLLQNAPPKFAKMGSTSASADQFGL-SHGHHVNV--NPQLGGIPNPPTT---MATTT 349

Query: 354  NSSDGISLRSSGDINWATEGVGSNDGDHHQSHLRXXXXXXXXXXXXXXXSQRVSTSCKMN 175
             ++ GI+       N    G   N+ +  +  L                +  VS   K+N
Sbjct: 350  ATNWGITTDHGMGSNDNNNGNNGNNSNVDEGLLLRSINGGYTANTTAASAAAVSNGHKVN 409

Query: 174  FSASSS-DFHADKGSENVPSRGEGMVGSWICPSD 76
            +SASSS DFH  K   NV +R EGMV SWIC SD
Sbjct: 410  YSASSSTDFHGSKTEINVAARSEGMVESWICSSD 443


>ref|XP_006428174.1| hypothetical protein CICLE_v10025465mg [Citrus clementina]
            gi|568819356|ref|XP_006464221.1| PREDICTED:
            uncharacterized threonine-rich GPI-anchored glycoprotein
            PJ4664.02-like [Citrus sinensis]
            gi|557530164|gb|ESR41414.1| hypothetical protein
            CICLE_v10025465mg [Citrus clementina]
          Length = 491

 Score =  145 bits (365), Expect = 8e-32
 Identities = 135/359 (37%), Positives = 165/359 (45%), Gaps = 40/359 (11%)
 Frame = -2

Query: 1032 SSDPHNNAAAGRNPKKRSRASRRAPTTVLTTDTSNFRAMVQEFTGIPAPPFSAGSPFQRS 853
            SS+ H+     RNPKKRSRASRRAPTTVLTTDT+NFRAMVQEFTGIPAPPF++ S F R+
Sbjct: 179  SSNQHSTMMV-RNPKKRSRASRRAPTTVLTTDTTNFRAMVQEFTGIPAPPFTS-SHFPRT 236

Query: 852  RLDLFG------------------AGSGLTLPPS--YLLRPFAHKLQQPVPSFVSHSNIG 733
            RLDLFG                    S L  PPS   LLRPFA KL  P+P   S+++  
Sbjct: 237  RLDLFGNSSSTSSSLMMRSTIGSHLDSSLPQPPSSYNLLRPFAQKLINPIPFSTSNNS-- 294

Query: 732  DHXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXNYQRPSELGLHKQPGQNLV-LNMQXX 556
                                              NYQ   +L  H+   QNL  +NMQ  
Sbjct: 295  --------SIFIDAIASASTSTPNATATTTSTNINYQ---QLPSHQ---QNLFGMNMQHN 340

Query: 555  XXXXXXXHPSMLSFQSLLQSPNPKHTSPNVPMFGLK-------SQGSSLTIPSHVELGGA 397
                      +L+  SLLQ P PK+   N P+   K        QGSSL           
Sbjct: 341  N--------PILNLHSLLQVP-PKYPLANSPILETKPPPPPPPPQGSSLEELGLSHAAAV 391

Query: 396  SSTH---HLGHGLSNLVNSSDG----ISLRSSGDINWATEGVGSNDGDHHQSHLRXXXXX 238
            S++H   +L  GL +LV+SSD      S    G    AT  VGSN+ +   + L      
Sbjct: 392  SASHLNTNLMSGLQSLVSSSDNNNSPTSWHGHGTGTGATAAVGSNENEEATAGL------ 445

Query: 237  XXXXXXXXXXSQRVSTSCKMNFSASSSDFHADKGSENV-----PSRGEGMVGSWICPSD 76
                          +      FS +SS+FH DKG E+V      +R EGMV SWIC SD
Sbjct: 446  -------------FANGKLSRFSQASSEFHRDKGQESVNVAAATTRTEGMVESWICSSD 491


>ref|XP_006434009.1| hypothetical protein CICLE_v10001250mg [Citrus clementina]
            gi|557536131|gb|ESR47249.1| hypothetical protein
            CICLE_v10001250mg [Citrus clementina]
          Length = 426

 Score =  142 bits (359), Expect = 4e-31
 Identities = 124/340 (36%), Positives = 150/340 (44%), Gaps = 22/340 (6%)
 Frame = -2

Query: 1029 SDPHNNAAAGRNPKKRSRASRRAPTTVLTTDTSNFRAMVQEFTGIPAPPFSAGSPFQRSR 850
            +D +      +NPKKR+R SRRAPTTVLTTDTSNFRAMVQEFTGIP+ PFS GS + R R
Sbjct: 146  NDQYQTNVVVKNPKKRTRTSRRAPTTVLTTDTSNFRAMVQEFTGIPSQPFSVGSSYSR-R 204

Query: 849  LDLFGAGSGLTL-------------PPSYLLRPFAHKLQQPVPSFVSHSNIGDHXXXXXX 709
            LDLFG GS +               P +Y LRP   K Q  + S  S S+          
Sbjct: 205  LDLFGPGSAIKSSGNIHNHLEPMGGPLNYHLRPATQKFQPNLFSSPSSSS---------- 254

Query: 708  XXXXXXXXXXXXXXXXXXXXXXXXXXNYQRPSELGLH-------KQPGQNLVLNMQXXXX 550
                                      NY   SELGL+       K+P   L   MQ    
Sbjct: 255  --------SMIDAIAAAASTSHNTTSNYHVLSELGLNSNNNNNTKEPQNTLNNIMQSQNL 306

Query: 549  XXXXXHPSMLSFQSLLQSPNPKHTSPNVPMFGLKSQGSSLTIPSHVELG-GASSTHHLGH 373
                    ++SFQS+LQ  +P +          KSQ S    PS  +     + +HH  H
Sbjct: 307  NHH----PVVSFQSILQHSSPLNNPSLTFGANNKSQASHFGAPSFEDHDHHLAMSHHASH 362

Query: 372  GLSNLVNSSDGISLRSSGDINWATEGVGSNDGDHHQSHLRXXXXXXXXXXXXXXXSQRVS 193
                L N+ D                  S DG+   S  R                   +
Sbjct: 363  --VGLPNNQDQFR---------------SFDGNFANSSQR-------------------A 386

Query: 192  TSCKMNFSASSSDFHADKGSENVPSRG-EGMVGSWICPSD 76
            TSCK+N+SASSSDFH +K  ENV SRG EG V SWICPSD
Sbjct: 387  TSCKLNYSASSSDFHHNKNLENVSSRGTEGTVDSWICPSD 426


>ref|XP_006472623.1| PREDICTED: uncharacterized serine-rich protein C215.13-like [Citrus
            sinensis]
          Length = 429

 Score =  141 bits (356), Expect = 9e-31
 Identities = 123/339 (36%), Positives = 151/339 (44%), Gaps = 21/339 (6%)
 Frame = -2

Query: 1029 SDPHNNAAAGRNPKKRSRASRRAPTTVLTTDTSNFRAMVQEFTGIPAPPFSAGSPFQRSR 850
            +D +      +NPKKR+R SRRAPTTVLTTDTSNFRAMVQEFTGIP+ PFS GS + R R
Sbjct: 147  NDQYQTNVVVKNPKKRTRTSRRAPTTVLTTDTSNFRAMVQEFTGIPSQPFSVGSSYSR-R 205

Query: 849  LDLFGAGSGLTL-------------PPSYLLRPFAHKLQQPVPSFVSHSNIGDHXXXXXX 709
            LDLFG GS +               P +Y LRP   K Q  + S  S S+          
Sbjct: 206  LDLFGPGSAIKSSGNIHNHLEPMGGPLNYHLRPATQKFQPNLFSSPSSSS---------- 255

Query: 708  XXXXXXXXXXXXXXXXXXXXXXXXXXNYQRPSELGLH-------KQPGQNLVLNMQXXXX 550
                                      NY   SELGL+       K+P   L   MQ    
Sbjct: 256  ------SMIDAIAAAAAASTSHNTTSNYHVLSELGLNSNNNNNTKEPQNTLNNIMQSQNL 309

Query: 549  XXXXXHPSMLSFQSLLQSPNPKHTSPNVPMFGLKSQGSSLTIPSHVELGGASSTHHLGHG 370
                    ++SFQS+LQ  +P +          KSQ S    PS  +       HHL   
Sbjct: 310  NHH----PVVSFQSILQHSSPLNNPSLTFGANNKSQASHFGAPSFED-----HDHHLA-- 358

Query: 369  LSNLVNSSDGISLRSSGDINWATEGVGSNDGDHHQSHLRXXXXXXXXXXXXXXXSQRVST 190
               + +    + L ++ D         S DG+   S  R                   +T
Sbjct: 359  ---MSHHGSHVGLPNNQD------QFRSFDGNFANSSQR-------------------AT 390

Query: 189  SCKMNFSASSSDFHADKGSENVPSRG-EGMVGSWICPSD 76
            SCK+N+SASSSDFH +K  ENV SRG EG V SWICPSD
Sbjct: 391  SCKLNYSASSSDFHHNKNLENVSSRGTEGTVDSWICPSD 429


>ref|XP_006603093.1| PREDICTED: myb-like protein A-like [Glycine max]
          Length = 454

 Score =  140 bits (353), Expect = 2e-30
 Identities = 123/348 (35%), Positives = 158/348 (45%), Gaps = 40/348 (11%)
 Frame = -2

Query: 999  RNPKKRSRASRRAPTTVLTTDTSNFRAMVQEFTGIPAPPF--SAGSPFQRSRLDLFGAGS 826
            RNPKKRSRASRRAPTTVLTTDT+NFRAMVQEFTGIPAPPF  S+ S F R+RLDLF + +
Sbjct: 176  RNPKKRSRASRRAPTTVLTTDTTNFRAMVQEFTGIPAPPFTSSSSSSFPRTRLDLFASSN 235

Query: 825  GL------------TLPPSYLLRPFAHKLQQPVPSFVSHSNIGDHXXXXXXXXXXXXXXX 682
             +            T  P YLLRPFAHK+Q  +PS +   +                   
Sbjct: 236  SIASSSSSSIIREQTQTPPYLLRPFAHKVQAQLPSSIPPPS------------------- 276

Query: 681  XXXXXXXXXXXXXXXXXNYQRPSELGLHKQPGQNLVLNMQXXXXXXXXXHPSMLSFQSLL 502
                                 P  L  ++Q   N+  N              +LSFQS+L
Sbjct: 277  -------------------SFPPMLNNYQQHSLNMQQN-------------PILSFQSIL 304

Query: 501  QSPNPKHTSPNVPMFGLKSQGSSLTIP------SHVELG-----GASSTHHLGH--GLSN 361
            Q           P+ G K+Q  SL IP      SH+++G     G S+ H  GH    + 
Sbjct: 305  QPQ---------PLIGSKTQQPSLEIPPSAVDSSHLKMGGLEELGLSNAHDGGHHQNFNM 355

Query: 360  LVNSSDGISLR--------SSGDINWA---TEGVGSNDGDHHQSHLRXXXXXXXXXXXXX 214
            + +SSDG   R             +WA    + + +NDG   +S                
Sbjct: 356  VSSSSDGALSRVTNSNMRGGPSSADWALSQAQRIDNNDGGVLRS--------LGGATATL 407

Query: 213  XXSQRVSTSCKMNFSASSSDFHADKGSE-NVPSRGEGMVGSWI-CPSD 76
                 VS   ++  + ++SDFH DKG E  V +R EGMV SWI C SD
Sbjct: 408  NYRSNVSDP-RVKVTNNNSDFHGDKGPECAVAARSEGMVESWINCSSD 454


>ref|XP_007163642.1| hypothetical protein PHAVU_001G251600g [Phaseolus vulgaris]
            gi|561037106|gb|ESW35636.1| hypothetical protein
            PHAVU_001G251600g [Phaseolus vulgaris]
          Length = 479

 Score =  138 bits (347), Expect = 1e-29
 Identities = 125/350 (35%), Positives = 158/350 (45%), Gaps = 24/350 (6%)
 Frame = -2

Query: 1053 APLVGSASSDPHNNAAAGRNPKKRSRASRRAPTTVLTTDTSNFRAMVQEFTGIPAPPFSA 874
            +P  G   +  + N    RNPKKRSRASRRAPTTVLTTDT+NFRAMVQEFTGIPAPPF++
Sbjct: 163  SPRGGFEQNSGNANTNVVRNPKKRSRASRRAPTTVLTTDTTNFRAMVQEFTGIPAPPFTS 222

Query: 873  GSPFQRSRLDLF------------GAGSGLTLP--------PSYLLRPFAHKLQQPVPSF 754
             SPF R+RLDLF             A S L  P        PSYLLRPFAHK+Q   PS 
Sbjct: 223  -SPFPRTRLDLFASSNASSSVLLRSASSHLEQPSSQTHTQTPSYLLRPFAHKVQAQ-PSS 280

Query: 753  VSHSNIGDHXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXNYQRPSELGLHKQPGQNLV 574
            + H+N                                    +YQ+ S L +H     N +
Sbjct: 281  IPHNN--------------SFSSMLNTLASNNNSGSGSASIHYQQHS-LNMH-----NPI 320

Query: 573  LNMQXXXXXXXXXHPSMLSFQSLLQSPNPKHTSPNVPMFGLKSQGSSLTIPSHVELGGAS 394
            L++Q           S+L      Q       +P      LK  G       H  +GG  
Sbjct: 321  LSLQ---SILGNNDSSVLVGSKTQQQQPSLEITPGTVDSHLKMSGLEELGLRHAHVGG-- 375

Query: 393  STHHLGHGLSNLV-NSSDGISLRSSGDINWATEGVGSNDGDHHQSHLRXXXXXXXXXXXX 217
              HH  H   N+V +SSDG   R + +I+      G +  D  Q+  R            
Sbjct: 376  --HHHHHQNMNMVSSSSDGALSRVNNNISINNNMRGPSSADWAQAQ-RIGGSNDGGVLRS 432

Query: 216  XXXSQRVSTSCKMNFSASSSDFHADKGSEN--VPSRGEGMVGSWI-CPSD 76
                    T   +N+ +S SDFH +KG+ +  V +R EGMV SWI C SD
Sbjct: 433  LSGGTATGT---LNYRSSVSDFHGEKGAPDCAVAARSEGMVESWINCSSD 479


>ref|XP_006826929.1| hypothetical protein AMTR_s00010p00175790 [Amborella trichopoda]
           gi|548831358|gb|ERM94166.1| hypothetical protein
           AMTR_s00010p00175790 [Amborella trichopoda]
          Length = 326

 Score =  133 bits (335), Expect = 2e-28
 Identities = 118/325 (36%), Positives = 145/325 (44%), Gaps = 20/325 (6%)
 Frame = -2

Query: 990 KKRSRASRRAPTTVLTTDTSNFRAMVQEFTGIPAPPFSAGSPFQR--SRLDLFGAGSGLT 817
           KKRSRASRRAPTTVLTTDT+NFRAMVQEFTGIP PPFS+ SPFQR  +R D  G G G  
Sbjct: 47  KKRSRASRRAPTTVLTTDTTNFRAMVQEFTGIPNPPFSS-SPFQRASTRFDFIGGGGGSR 105

Query: 816 LPPS--YLLRPFAHKLQQPVPSFVSHSNIGDHXXXXXXXXXXXXXXXXXXXXXXXXXXXX 643
             P+  +LLRPF  K   P+ S  S S                                 
Sbjct: 106 SEPAPPFLLRPFPQKPSPPLSSSNSISGSSS--------------------LNIVSSNAD 145

Query: 642 XXXXNYQRPSELGLHKQPGQNLVLNMQXXXXXXXXXHPSMLSFQSLLQSPNPKHTSPNVP 463
               NY   S       P   L + MQ          PS ++F  +L S N K  SP  P
Sbjct: 146 IVMPNYLAASS-SSQNVPVPQLPIQMQ--------GPPSFVNFHPVL-SHNAKFMSPMAP 195

Query: 462 MFGLKSQGSSLTIPSHV-------------ELGGASSTHH--LGHGLSNLVNSSDGISLR 328
           M G  +    +   S +             ++GGAS   H   G    + V+   G    
Sbjct: 196 MPGFLAGKGQIPADSRLKSGVLEGFGSDSGQIGGASGHGHGQTGGPRPDFVSGGGG---S 252

Query: 327 SSGDINWATEGVGSNDGDHHQSHLRXXXXXXXXXXXXXXXSQRVSTSCKMNFS-ASSSDF 151
             GD+ +         GD  +                    +  ++SCK+N+S +SSSDF
Sbjct: 253 RGGDLGYG--------GDEEEEGFMRSSSSSVAANNYFGNQR--NSSCKLNYSVSSSSDF 302

Query: 150 HADKGSENVPSRGEGMVGSWICPSD 76
           H +KGSENV  RGEGMV SWIC SD
Sbjct: 303 HVEKGSENV-GRGEGMVDSWICSSD 326


>ref|XP_007159640.1| hypothetical protein PHAVU_002G254700g [Phaseolus vulgaris]
            gi|561033055|gb|ESW31634.1| hypothetical protein
            PHAVU_002G254700g [Phaseolus vulgaris]
          Length = 493

 Score =  132 bits (333), Expect = 4e-28
 Identities = 128/369 (34%), Positives = 168/369 (45%), Gaps = 48/369 (13%)
 Frame = -2

Query: 1038 SASSDPHNNAAAGRNPKKRSRASRRAPTTVLTTDTSNFRAMVQEFTGIPAPPFSAGSPFQ 859
            + +S  +NN+   RNPKKRSRASRRAPTTVLTTDT+NFRAMVQEFTGIPAPPF++ SPF 
Sbjct: 165  TTNSTNNNNSNVVRNPKKRSRASRRAPTTVLTTDTTNFRAMVQEFTGIPAPPFTS-SPFP 223

Query: 858  RSRLDLFGAGSGLTL---------------PPSYLLRPFAHKLQ----QPVPSFVSHSNI 736
            R+RLDLF + +  TL               PP YLLRPFA KLQ     P P  +S++  
Sbjct: 224  RTRLDLFASAATPTLRSNLNVNVNPLDPPTPPPYLLRPFAQKLQFRSLHPFPPSLSNT-- 281

Query: 735  GDHXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXNYQRPSE-LGLHKQPGQNLVLNMQX 559
                                                 Q  SE  GL KQP          
Sbjct: 282  -------------LSPSTNSTTNSTSINYHQQQQQQQQNLSEHFGLMKQP---------- 318

Query: 558  XXXXXXXXHPSMLSFQSLLQSPNPKHTSPNVP-MFGLKSQGSSLTIPSHVELG-----GA 397
                     PS+ ++       +PK+   N   +     Q SS  IP  +++G     G 
Sbjct: 319  ---HNFNNTPSLEAYH------HPKYPLGNSSVLVSRPQQQSSFDIPPSLKMGVFEELGL 369

Query: 396  SSTHHLGHGL----SNLVNS-SDGISLRSSGDIN-------------WA--TEGVGSNDG 277
                H+   L     N+V+S S G+   SSG+ N             W   T  + ++D 
Sbjct: 370  RPDGHVNTDLRCLHQNMVSSTSVGVGALSSGNNNNNNNLSNANPSTEWVQRTGTITNDDC 429

Query: 276  DHHQSHLRXXXXXXXXXXXXXXXSQRVSTSCKMNFSASSSDFHADKGSE-NVPSRGEGMV 100
            DH                     ++RVS   K+++SASSSDFH +K  + +V +R +GMV
Sbjct: 430  DHGGG----GGGGLSGTVSYSDIAERVSNG-KVHYSASSSDFHGEKVPDFSVTARSQGMV 484

Query: 99   GSWI-CPSD 76
             SWI C SD
Sbjct: 485  ESWINCSSD 493


>ref|XP_006580869.1| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-79
            specific-like [Glycine max]
          Length = 486

 Score =  127 bits (318), Expect = 2e-26
 Identities = 123/360 (34%), Positives = 156/360 (43%), Gaps = 45/360 (12%)
 Frame = -2

Query: 1020 HNNAAAGRNPKKRSRASRRAPTTVLTTDTSNFRAMVQEFTGIPAPPFSAGSPFQRSRLDL 841
            +NN    RNPKKRSRASRRAPTTVLTTDT+NFRAMVQEFTGIPAPPF++ S F R+RLDL
Sbjct: 164  NNNCNVVRNPKKRSRASRRAPTTVLTTDTTNFRAMVQEFTGIPAPPFTSSS-FPRTRLDL 222

Query: 840  FGAGSGLTL-------------PPSYLLRPFAHKLQ----QPVPSFVSHSNIGDHXXXXX 712
            F + +  TL              P YLLRPFA KLQ     P P   S++          
Sbjct: 223  FASTATPTLRSNVNVNPFDPPTQPPYLLRPFAQKLQLRSLHPFPPSFSNT---------- 272

Query: 711  XXXXXXXXXXXXXXXXXXXXXXXXXXXNYQRPSELGLHKQPGQNLVLNMQXXXXXXXXXH 532
                                         Q     GL KQP                   
Sbjct: 273  -------LPPPSTNSPTNSTSINYHQQQQQLSEHFGLAKQP------------FNFNNTT 313

Query: 531  PSMLSFQSLLQSPNPKHTSPNVP--MFGLKSQGSSLTIPSHVELGGASSTH--------H 382
            P   + ++     +PK+T  N    +     Q  SL IP ++++G               
Sbjct: 314  PDTSTLEAY---HHPKYTLGNSSSVLVSRTQQQHSLEIPPNLKMGLYEELELRHDHVNTD 370

Query: 381  LGHGLSNLVNS-SDGISLRSSGDIN-----------WA--TEGVGSNDGDHHQSHLRXXX 244
            LG    N+V+S S G+   SS + N           WA  T  + +ND DH     R   
Sbjct: 371  LGCLHQNMVSSTSVGVGALSSDNNNNLSNATNSSTEWAQRTGTITNNDCDHG----RGGG 426

Query: 243  XXXXXXXXXXXXSQRVSTSCKMNFSASSSDFHADKGSE---NVPSRGEGMVGSWI-CPSD 76
                           V T+ K+++SASSSDFH +KG +      +R +GMV SWI C SD
Sbjct: 427  ALSGTVNYNDIGEGAVVTNGKVHYSASSSDFHGEKGPDFTVTTAARTQGMVESWINCSSD 486


>ref|XP_006591815.1| PREDICTED: uncharacterized serine-rich protein C215.13-like [Glycine
            max]
          Length = 425

 Score =  126 bits (317), Expect = 3e-26
 Identities = 115/331 (34%), Positives = 147/331 (44%), Gaps = 12/331 (3%)
 Frame = -2

Query: 1032 SSDPHNNAAAGRNPKKRSRASRRAPTTVLTTDTSNFRAMVQEFTGIPAPPF-SAGSPFQR 856
            SS+ + N    RNPKKRSRASRRAPTTVLTTDT+NFRAMVQEFTGIPAPPF S+ S F R
Sbjct: 160  SSNTNKNMV--RNPKKRSRASRRAPTTVLTTDTTNFRAMVQEFTGIPAPPFTSSSSSFPR 217

Query: 855  SRLDLFGAGSGLT------LPPSYLLRPFAHKLQQPVPSFVSHSNIGDHXXXXXXXXXXX 694
            +RLDLF   +  +        PSYLLRPFAHK+Q  VPS +   +               
Sbjct: 218  TRLDLFATSNASSSSIIREQTPSYLLRPFAHKVQAQVPSSIPPPS--------------- 262

Query: 693  XXXXXXXXXXXXXXXXXXXXXNYQRPSELGLHKQPGQNLVLNMQXXXXXXXXXHPSMLSF 514
                                 ++Q       H     N +L+ Q           S+L  
Sbjct: 263  ---------------------SFQPMLNNYHHHHHQHNPILSFQ-----------SILQP 290

Query: 513  QSLLQSPNPKHTSPNVPMFGLKSQGSSLTIPSHVELGGASSTHHLGHGLSNLVNSSDGIS 334
              L+ S   +  S  +P   L  +   L    H  +   SS+     G  + VN+ +   
Sbjct: 291  HQLIGSKTQQQPSLEIPPSALGLEELGLNHAHHQNINMRSSS---SDGTLSRVNNDNNNM 347

Query: 333  LRSSGDINWA---TEGVGSNDGDHHQSHLRXXXXXXXXXXXXXXXSQRVSTSCKMNFSAS 163
               S D  WA    + + +NDG      L                 QRV  +       +
Sbjct: 348  RGPSAD--WAQAQAQRIDNNDG----GLLGSLTGATLNYRSNIVSDQRVKVT-------N 394

Query: 162  SSDFHADKGSENVPS-RGEGMVGSWI-CPSD 76
            +SDFH +KG E V + R EGMV SWI C SD
Sbjct: 395  NSDFHGEKGPECVVAVRSEGMVESWINCSSD 425


Top