BLASTX nr result

ID: Akebia23_contig00011625 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Akebia23_contig00011625
         (2091 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

emb|CAN62228.1| hypothetical protein VITISV_008028 [Vitis vinifera]   273   2e-70
ref|XP_007047984.1| VQ motif-containing protein [Theobroma cacao...   266   2e-68
emb|CAN67718.1| hypothetical protein VITISV_002357 [Vitis vinifera]   252   4e-64
ref|XP_002301045.1| VQ motif-containing family protein [Populus ...   239   5e-60
ref|XP_002307385.1| VQ motif-containing family protein [Populus ...   215   7e-53
ref|XP_002307093.1| VQ motif-containing family protein [Populus ...   211   1e-51
ref|XP_007018802.1| VQ motif-containing protein [Theobroma cacao...   210   2e-51
ref|XP_002534310.1| conserved hypothetical protein [Ricinus comm...   202   6e-49
ref|XP_002513906.1| conserved hypothetical protein [Ricinus comm...   200   2e-48
ref|XP_002310570.2| VQ motif-containing family protein [Populus ...   198   7e-48
gb|EXB29424.1| hypothetical protein L484_022090 [Morus notabilis]     182   7e-43
ref|XP_006434009.1| hypothetical protein CICLE_v10001250mg [Citr...   163   3e-37
ref|XP_006472623.1| PREDICTED: uncharacterized serine-rich prote...   162   6e-37
ref|XP_006428174.1| hypothetical protein CICLE_v10025465mg [Citr...   154   2e-34
ref|XP_006826929.1| hypothetical protein AMTR_s00010p00175790 [A...   152   6e-34
ref|XP_006596337.1| PREDICTED: probable myosin light chain kinas...   149   6e-33
ref|XP_007159640.1| hypothetical protein PHAVU_002G254700g [Phas...   146   3e-32
ref|XP_006580869.1| PREDICTED: histone-lysine N-methyltransferas...   144   1e-31
ref|XP_007163642.1| hypothetical protein PHAVU_001G251600g [Phas...   143   3e-31
ref|XP_006603093.1| PREDICTED: myb-like protein A-like [Glycine ...   140   2e-30

>emb|CAN62228.1| hypothetical protein VITISV_008028 [Vitis vinifera]
          Length = 422

 Score =  273 bits (699), Expect = 2e-70
 Identities = 176/399 (44%), Positives = 215/399 (53%), Gaps = 74/399 (18%)
 Frame = -3

Query: 1291 LFDPLTSYLEVFSRS-----TPTPTPNLDIVWPRTQRSEPNCTQXXXXXXXXXXXS---- 1139
            +FDPL++Y +  SRS      P    NLD+VW +T RS+PNCT+                
Sbjct: 25   MFDPLSNYFDPLSRSPTQLQNPNSLLNLDMVWSKTLRSDPNCTEIGGILASSSSTPPFSG 84

Query: 1138 ----------AQLDRIPFP---------NPSTSTDPN---RNTKKRSRASRRAPTTVLTT 1025
                      + L  +PFP           S S D     RN KKRSRASRRAPTTVLTT
Sbjct: 85   AQGQIRATFPSSLPSMPFPPAPENAARATASASNDQTNVARNPKKRSRASRRAPTTVLTT 144

Query: 1024 DTSNFRAMVQEFTGIPAPPFSASPFTRSRFDLFNSTSTLRSGHXXXXXXXXXXP-FAQKV 848
            DT+NFRAMVQEFTGIPA PF++SPF RSR DLF + ST+RSGH            FAQK+
Sbjct: 145  DTTNFRAMVQEFTGIPAQPFTSSPFPRSRLDLFGTASTMRSGHLDHAPPSYLLRPFAQKL 204

Query: 847  QQPPFVS-----------STMIDXXXXXXXXXXXXXXXXXXXXXXXXNYQLPSELGLHRQ 701
            Q PPF S           S+M+D                         YQLPS+LGL +Q
Sbjct: 205  QPPPFASPPPSSSSSFSSSSMVDAIASTTNITSGSASNTSSNSTSIN-YQLPSDLGLVKQ 263

Query: 700  SSNLQNM--ENPMLTFQSLLQSP-----PNKYPFLAKSQAPSSI---DARLKVRVLEEFG 551
              NL NM  +NP+L+ QS LQ+P     PN     +K Q    I   D+ +K+  LE+FG
Sbjct: 264  PQNLLNMNVQNPILSIQSFLQTPLKYPHPNSAIMGSKPQGSLEIPSTDSHIKMGGLEDFG 323

Query: 550  TSHGHVNANVGGLPNM---------------------GNSDGDQNHLRSFNGNYGNSQRV 434
             SHGHVN ++ GLPN+                     G+S G+   L   NGNY NSQRV
Sbjct: 324  LSHGHVNTHLSGLPNLVSSDRTASRSDNNPPSWNDGLGSSGGNHGQLGPLNGNYNNSQRV 383

Query: 433  SSCKMNFSASSSDFHADKGSENVSSRGEGMVDSWICSSD 317
            ++ KMN+SASSSDFH DK  ENVS+R EGMV+SWICSSD
Sbjct: 384  TNGKMNYSASSSDFHGDKVPENVSTRSEGMVESWICSSD 422


>ref|XP_007047984.1| VQ motif-containing protein [Theobroma cacao]
            gi|508700245|gb|EOX92141.1| VQ motif-containing protein
            [Theobroma cacao]
          Length = 472

 Score =  266 bits (681), Expect = 2e-68
 Identities = 186/409 (45%), Positives = 216/409 (52%), Gaps = 84/409 (20%)
 Frame = -3

Query: 1291 LFDPLTSYLE-VFSRST-----PTPTPNLDIVWPRTQRSEPNCTQXXXXXXXXXXXSAQL 1130
            +FDPL++Y +   SRS      P    NLD+VW +  RSEPNCT               L
Sbjct: 65   MFDPLSNYFDHPLSRSPQLTTIPNSLLNLDVVWSKNLRSEPNCTDLGGFIASSSPTQQLL 124

Query: 1129 ------DRIPFPN---------------PSTSTDPN-------RNTKKRSRASRRAPTTV 1034
                   R  FP+                 T   PN       RN KKRSRASRRAPTTV
Sbjct: 125  TNQQAQSRATFPSMQIPQGPESATKSSISGTGDQPNNNNSNMVRNPKKRSRASRRAPTTV 184

Query: 1033 LTTDTSNFRAMVQEFTGIPAPPFSASPFTRSRFDLFNSTSTLRSGHXXXXXXXXXXP-FA 857
            LTTDT+NFRAMVQEFTGIPAPPF++SPF R+R DLF + ST+RS              FA
Sbjct: 185  LTTDTTNFRAMVQEFTGIPAPPFTSSPFPRTRLDLFGTPSTMRSTPLDPSPPHYLLRPFA 244

Query: 856  QKVQQPPFVSST----------MIDXXXXXXXXXXXXXXXXXXXXXXXXN---YQLPSEL 716
            QK+  PPFVSS+          M+D                            YQL SEL
Sbjct: 245  QKIHPPPFVSSSTASSSFPSSSMVDAIASTPSTNITSASASNNNTTSSSTSINYQLSSEL 304

Query: 715  GLHRQSSNLQN--MENPMLTFQSLLQSPPNKYPF---------LAKSQAPSSIDARLKVR 569
            GL +Q  NL N  M+NP+L FQSLLQ+PP KYP          L  S    S D+ LK+ 
Sbjct: 305  GLLKQPQNLLNINMQNPILNFQSLLQAPP-KYPLPNSTILGTKLQGSLDIPSNDSSLKMG 363

Query: 568  VLEEFGTSHGHVNANVGGLPNMGNSDG-----------------------DQNHLRSFNG 458
            VLEEFG SHGHVN N+ GL NM +SDG                       DQ+ LRS NG
Sbjct: 364  VLEEFGLSHGHVNTNLSGLQNMVSSDGALPRNDSSTNPPSWGEGTGSQEHDQSLLRSING 423

Query: 457  NYG-NSQRVSSCKM-NFSASSSDFHADKGSENVSSRGEGMVDSWICSSD 317
             Y  NSQRVS+ K+ NFSASSSDFH DKG ENV++R EGMV+SWICSSD
Sbjct: 424  GYNSNSQRVSNGKVSNFSASSSDFHGDKGPENVAARSEGMVESWICSSD 472


>emb|CAN67718.1| hypothetical protein VITISV_002357 [Vitis vinifera]
          Length = 449

 Score =  252 bits (644), Expect = 4e-64
 Identities = 165/395 (41%), Positives = 210/395 (53%), Gaps = 70/395 (17%)
 Frame = -3

Query: 1291 LFDPLTSYLEVFSRSTPTPTPN----LDIVWPRTQRSEPNCTQXXXXXXXXXXXSAQLDR 1124
            LFDP ++Y++ FS+S+  P  N    LD VW R  RSEPNCT            ++   +
Sbjct: 58   LFDPRSNYVDAFSQSSANPNANSLLNLDTVWSRGLRSEPNCTDFGNLTGLSSSSTSSSGQ 117

Query: 1123 IPF----------------PNPSTSTDPNRNTKKRSRASRRAPTTVLTTDTSNFRAMVQE 992
                                 PS  T+  R++KKR+RASRRAPTTVLTTDTSNFRAMVQE
Sbjct: 118  SMLGVQGPVHENGGRASSASLPSDQTNVVRSSKKRTRASRRAPTTVLTTDTSNFRAMVQE 177

Query: 991  FTGIPAPPFSASPFTRSRFDLFNSTSTLRSGHXXXXXXXXXXPFA-QKVQQPPFVSS--- 824
            FTGIPAPPFSASP++R R DLF + S+++ GH            +  KVQ   FVSS   
Sbjct: 178  FTGIPAPPFSASPYSR-RLDLFGAGSSIKPGHLEPLGPLYPLRPSPHKVQPNLFVSSSSS 236

Query: 823  -----------------TMIDXXXXXXXXXXXXXXXXXXXXXXXXNYQLPSELGLHRQSS 695
                             T I                          YQLPS+ G  +Q  
Sbjct: 237  PSPSFFNSTIGDSIVSTTNIATTSTNNIITTSMAAATNAINSGSNTYQLPSDPGFPKQPQ 296

Query: 694  NLQNMENPMLTFQSLLQSPPN---KYPF----LAKSQAPSSIDARLKVRVLEEFGTSHGH 536
            N+  M+NP+L+FQSLLQSPP+   KYP     +  +++P+S+   L +   EE G  HGH
Sbjct: 297  NVLGMQNPILSFQSLLQSPPSHPLKYPLADVPVFGTKSPASLT--LPLPSFEELGVPHGH 354

Query: 535  VNANVGGLPN----------------------MGNSDGDQNHLRSFNGNYGNSQRVSSCK 422
            VNAN+ GLP+                       G+++G +  LR FNGNYG+S +VSS K
Sbjct: 355  VNANISGLPSHATSGGSRRLRTDDNGTCWRDGAGSNEGSREQLRPFNGNYGDSPQVSSFK 414

Query: 421  MNFSASSSDFHADKGSENVSSRGEGMVDSWICSSD 317
            +N SASSS FH +KGS+NVSSRGEG VDSWIC SD
Sbjct: 415  LNCSASSSAFHPEKGSDNVSSRGEGTVDSWICPSD 449


>ref|XP_002301045.1| VQ motif-containing family protein [Populus trichocarpa]
            gi|222842771|gb|EEE80318.1| VQ motif-containing family
            protein [Populus trichocarpa]
          Length = 423

 Score =  239 bits (609), Expect = 5e-60
 Identities = 162/375 (43%), Positives = 195/375 (52%), Gaps = 50/375 (13%)
 Frame = -3

Query: 1291 LFDPLTSYLEVFSRSTPTPTPNLDIVWPRTQRSEPNCT---------------------- 1178
            LFDP  S   VFS+S P P     +V  R  RS+PNCT                      
Sbjct: 50   LFDPTPSLFHVFSQSQPNPI----MVQSRGLRSDPNCTDLGINLPDSLSSSQSAVLGVQG 105

Query: 1177 --QXXXXXXXXXXXSAQLDRIPFPNPSTSTDPNRNTKKRSRASRRAPTTVLTTDTSNFRA 1004
              Q                R   P+   +    RN KKR+RASRRAPTTVLTTDTSNFR 
Sbjct: 106  SSQALPSSKQLRSVHDDGGRSSSPSHDQTHGIARNPKKRTRASRRAPTTVLTTDTSNFRQ 165

Query: 1003 MVQEFTGIPAPPFSASPFTRSRFDLFNSTSTLRSGHXXXXXXXXXXPFAQKV--QQPPFV 830
            MVQEFTGIPAPPFS SPFTR R DLF   S LRSGH            AQKV  QQ PF+
Sbjct: 166  MVQEFTGIPAPPFSGSPFTR-RLDLFGPGSGLRSGHLEPLYPLRPT--AQKVHHQQTPFL 222

Query: 829  SST--------------MIDXXXXXXXXXXXXXXXXXXXXXXXXNYQLPSELGLHRQSSN 692
            SS+              +                          NYQLP ++GLH+Q+ N
Sbjct: 223  SSSFPSLLNNNIVHTTNIASTSTTANNNNTISTAATSTFNPSSLNYQLPDDIGLHKQTRN 282

Query: 691  LQNMENPMLTFQSLLQSPPNKYPFLAKSQAPSSIDAR--LKVRVLEEFGTSHGHVNANVG 518
            L NM+N ML+   LL  PP   P    +      ++R  L +  LEE G  HG+VNAN+ 
Sbjct: 283  LLNMQNQMLSIHPLLHPPPPPPPQQLPNVPGLGANSRASLPLPSLEELGMGHGYVNANLS 342

Query: 517  GLPNMG-------NSDGDQNHLRSFNGNYGNSQRVSSCKMNFSASSSDFHADKGSENVSS 359
            GL +         ++DG  ++LRS NGNYGN QRV+SCK+N+S++SSDFH +KG ENVSS
Sbjct: 343  GLTSHVTTEEMRLSNDGSHHNLRSLNGNYGNMQRVNSCKLNYSSASSDFHHEKGLENVSS 402

Query: 358  RG-EGMVDSWICSSD 317
            RG EG VDSWIC S+
Sbjct: 403  RGTEGTVDSWICPSE 417


>ref|XP_002307385.1| VQ motif-containing family protein [Populus trichocarpa]
            gi|222856834|gb|EEE94381.1| VQ motif-containing family
            protein [Populus trichocarpa]
          Length = 437

 Score =  215 bits (547), Expect = 7e-53
 Identities = 163/385 (42%), Positives = 195/385 (50%), Gaps = 60/385 (15%)
 Frame = -3

Query: 1291 LFDPLTSYLEVFSRSTPTPTPN-----LDIVWPRTQRSEPNCTQXXXXXXXXXXXSAQLD 1127
            +FDP  +    FS+S     PN     LD+V  R  RSE +CT+           S    
Sbjct: 50   IFDPSPALFHAFSQSQSITNPNSSMLNLDMVHSRGLRSEHSCTRLGINLPDSLSSSQSAP 109

Query: 1126 ----------------RIPFPNPSTSTDPN-------RNTKKRSRASRRAPTTVLTTDTS 1016
                            R    N   S+ P+       RN KKR+RASRRAPTTVLTTDTS
Sbjct: 110  LGAQGSSQALPSSMQLRSVHDNGVRSSSPSDQTHGVARNPKKRTRASRRAPTTVLTTDTS 169

Query: 1015 NFRAMVQEFTGIPAPPFSASPFTRSRFDLFNSTSTLRSGHXXXXXXXXXXP-FAQKV--Q 845
            NFR MVQEFTGIPAPPF+ S FTR R DLF   S LRSGH             AQKV  Q
Sbjct: 170  NFRQMVQEFTGIPAPPFTGSSFTR-RLDLFGPGSGLRSGHLEPIGSLYPLRPSAQKVHHQ 228

Query: 844  QPPFVSST--------------MIDXXXXXXXXXXXXXXXXXXXXXXXXNYQLPSELGLH 707
            Q P +SS+              +                          NYQL + LGLH
Sbjct: 229  QTPLLSSSSPSFFNNDIVDGTNIASTSTTANNNNTITTATTSTFNPSSVNYQLSAHLGLH 288

Query: 706  RQSSNLQNMENPMLTFQSLLQSPPNKYPFLAKSQAP---SSIDARLKVRVLEEFGTSH-- 542
            +Q  NL NM+N ML+   LLQ P    PF + +  P   +   A   +   EE G  H  
Sbjct: 289  KQPQNLLNMQNQMLSIHPLLQPPAP--PFQSLANVPGLGAKSQASFPLPSFEELGMGHGD 346

Query: 541  GHVNANVGGLPN-------MGNSDGDQNH-LRSFNGNYGNSQRVSSCKMNF-SASSSDFH 389
            GHVNA++GGL +         +SDGDQ+H LRS +GNYGN +RV+SCK+N+ SASSS FH
Sbjct: 347  GHVNAHLGGLTSHVTTEGMRLSSDGDQDHNLRSLDGNYGNMKRVNSCKLNYSSASSSGFH 406

Query: 388  ADKGSENVSSRG-EGMVDSWICSSD 317
             DK  ENVSSRG EG VDSWIC S+
Sbjct: 407  HDKVLENVSSRGAEGTVDSWICPSE 431


>ref|XP_002307093.1| VQ motif-containing family protein [Populus trichocarpa]
            gi|222856542|gb|EEE94089.1| VQ motif-containing family
            protein [Populus trichocarpa]
          Length = 510

 Score =  211 bits (537), Expect = 1e-51
 Identities = 165/440 (37%), Positives = 210/440 (47%), Gaps = 116/440 (26%)
 Frame = -3

Query: 1288 FDPLTSYLEVFSRST------------PTPTPNLDIVWPRTQRSEPNCTQXXXXXXXXXX 1145
            FDP ++Y +  + S+            P    NLD+VW +  RS+PNCT           
Sbjct: 73   FDPFSNYFDPLAPSSSSSRSPLQSLTNPNSLNNLDMVWSKNLRSDPNCTDLGGFISSSLP 132

Query: 1144 XSA--------------------QLDRIPFPNPSTSTDPN-------RNTKKRSRASRRA 1046
                                      R+P     + T+         RN KKRSRASRRA
Sbjct: 133  TQQFTNQTQNRTTFQSLPSHGQESATRVPGSGSVSGTNDQVSNTAGIRNPKKRSRASRRA 192

Query: 1045 PTTVLTTDTSNFRAMVQEFTGIPAPPFSASPFTRSRFDLF-NSTSTLRSG----HXXXXX 881
            PTTVL+TDT+NFRAMVQEFTGIPAPPF++SPF RSR DLF  + STLRS           
Sbjct: 193  PTTVLSTDTTNFRAMVQEFTGIPAPPFTSSPFPRSRLDLFGTAASTLRSAVSQHLDPSPP 252

Query: 880  XXXXXPFAQKVQ---QPPFVSS---------TMID----XXXXXXXXXXXXXXXXXXXXX 749
                 PFA+K Q    PPFVSS         +M+D                         
Sbjct: 253  PYLLGPFAKKFQPPPPPPFVSSGSAASSFSASMVDAIASTTATNINGTCTNTTISNNIPL 312

Query: 748  XXXNYQLPSELGLHRQSSNLQNM--ENPMLTFQSLLQSPPNKYPF-------------LA 614
               NYQLPS+LGL +Q  NL N+  +NP+L F  LLQ+PP KYP                
Sbjct: 313  TSINYQLPSDLGLLKQPHNLLNLNVQNPILNFHPLLQAPP-KYPLPDSPNILGTTKPQQG 371

Query: 613  KSQAPSSIDARLKVRVLEEFGTSHGHVNANVGGLPNM----------------------- 503
              + P ++ + LK+ VLEEFG +HGHVN N+ GL N+                       
Sbjct: 372  SLEIPLNV-SHLKMVVLEEFGLNHGHVNTNLSGLQNIVSSSSPSADVTLVRRSDHSNSLT 430

Query: 502  ------GNSDGDQNH---------LRSFNGNYGNS-QRVSSCKMNFSASSSDFHADK--G 377
                  G+++ D +H         LRS NG+Y NS QRV++ K+NF ASSSDF  D   G
Sbjct: 431  NWGDGAGSNEVDHHHHQQQQQQGLLRSINGDYNNSTQRVTNGKVNFLASSSDFCGDHKLG 490

Query: 376  SENVSSRGEGMVDSWICSSD 317
             ENV++R EG ++SWICSSD
Sbjct: 491  QENVATRSEGTMESWICSSD 510


>ref|XP_007018802.1| VQ motif-containing protein [Theobroma cacao]
            gi|508724130|gb|EOY16027.1| VQ motif-containing protein
            [Theobroma cacao]
          Length = 551

 Score =  210 bits (534), Expect = 2e-51
 Identities = 158/374 (42%), Positives = 197/374 (52%), Gaps = 70/374 (18%)
 Frame = -3

Query: 1288 FDPLTSYLEVFSRSTPTPTP-NLDI-VWPRTQRSEPNCTQXXXXXXXXXXXSAQ-----L 1130
            FDP ++YL  FS+S P  +  NLD  V PR  RSEPNCT             +      L
Sbjct: 119  FDPSSNYLNPFSQSQPNNSLLNLDGGVRPRGLRSEPNCTDLGNLPGSSSSSQSMLGAQGL 178

Query: 1129 DRIPFPNPST------------------STDPNRNTKKRSRASRRAPTTVLTTDTSNFRA 1004
            ++  FP+ S+                   T   +N KKR+RASRRAPTTVLTTDT+NFRA
Sbjct: 179  NQGSFPSSSSMQSRPAHDNGARSLAQSDQTSVVKNPKKRTRASRRAPTTVLTTDTTNFRA 238

Query: 1003 MVQEFTGIPAPPFSASPFTRSRFDLFNSTSTLRSGH-XXXXXXXXXXPFAQKVQQPPFVS 827
            MVQEFTGIPAPPFS S ++R R DLF S S +RS H           P A++VQ  PFVS
Sbjct: 239  MVQEFTGIPAPPFSGSSYSR-RLDLFGSGSGMRSSHLEPLGSLYPLRPSAKRVQPTPFVS 297

Query: 826  STM-------------IDXXXXXXXXXXXXXXXXXXXXXXXXNYQLPSELGLHRQSSNLQ 686
            S+              I                         NYQLPS+L L +Q  N+ 
Sbjct: 298  SSSPSLLNNPLVDAANITNTTSNSTIPTSIAATTNAFNPTSSNYQLPSDLSLLKQPQNML 357

Query: 685  NMEN--PMLTFQSLLQSPPNKYP------FLAKSQAPSSIDARLKVRVLEEFGTSHGHVN 530
            N++N  P+L+FQS LQ PP  +P      F  KSQ  S++ +      L+E G SHGHVN
Sbjct: 358  NLQNQSPVLSFQSFLQ-PPTLHPSLNLPGFGVKSQGSSAMPS------LDELGMSHGHVN 410

Query: 529  ANVGGLPN------------------MGNSDGDQNHLRSFNGNYG----NSQRV-SSCKM 419
            AN+GGL +                  +G +DG+Q+HLR  +GNYG    NSQRV +SCK+
Sbjct: 411  ANLGGLQSHVTPDGPRARSDSNWRDGIGLNDGNQDHLRPLDGNYGNDHHNSQRVNNSCKL 470

Query: 418  NFSASSSDFHADKG 377
            NFSASSSDFH DKG
Sbjct: 471  NFSASSSDFHHDKG 484


>ref|XP_002534310.1| conserved hypothetical protein [Ricinus communis]
            gi|223525518|gb|EEF28072.1| conserved hypothetical
            protein [Ricinus communis]
          Length = 498

 Score =  202 bits (513), Expect = 6e-49
 Identities = 170/437 (38%), Positives = 208/437 (47%), Gaps = 112/437 (25%)
 Frame = -3

Query: 1291 LFDPLTSYLEVFSRSTPTPTP-------NLDIVWPRTQRSEPNCTQXXXXXXXXXXXSAQ 1133
            +FDPL++Y +  S S P P         NLD+VW +  RS+ NCT            + Q
Sbjct: 66   MFDPLSNYFDPLSSSRPPPPLTHPNSLLNLDMVWSKNLRSDTNCTDLGGFIATSSSPTQQ 125

Query: 1132 L-----DRIPFPNPS--------------------------------TSTDPNRNTKKRS 1064
                     P  NPS                                T+T+  RN KKRS
Sbjct: 126  FFTNQTQTGPTYNPSIQIPPVQETTAPSRGPGSASASGSNGHQTNNTTTTNIVRNPKKRS 185

Query: 1063 RASRRAPTTVLTTDTSNFRAMVQEFTGIPAPPFSASPFTRSRFDLFN--STSTLRS--GH 896
            RASRRAPTTVLTTDT+NFRAMVQEFTGIPAPPF++SPF RSR DLF   + S+LRS   H
Sbjct: 186  RASRRAPTTVLTTDTTNFRAMVQEFTGIPAPPFTSSPFPRSRLDLFGTAAASSLRSVVSH 245

Query: 895  -XXXXXXXXXXPFAQKVQQPPFV----SSTMIDXXXXXXXXXXXXXXXXXXXXXXXXNYQ 731
                       PFAQK+ QPPF+    SS+MID                           
Sbjct: 246  LEPSHPSYLLRPFAQKI-QPPFLSSSSSSSMIDAIASSTTTSTNINSNATTNTTTNTTSN 304

Query: 730  LPSELGLHRQSSNLQNM---ENPMLTFQSLLQ-----SPPNKYPFLAKSQAPS----SID 587
              S+L L +   NL N+    +PML F SL Q     S PN      K Q  S    S D
Sbjct: 305  --SDLSLLKYPQNLLNINMHNSPMLNFHSLFQPSPKYSLPNSSILATKPQEGSLDTPSND 362

Query: 586  ARLKVRVLEEFGTSHGHVNANVGGLPNM-------------------------------- 503
              LK+ VLEEFG SHGHV+ N+ GL N+                                
Sbjct: 363  PHLKMGVLEEFGLSHGHVSTNLTGLHNLVSSSDTTLRRSDHNSSSSSNNNNNNSGNWGDR 422

Query: 502  --GNSDGDQNHLRSFNGNY------GNSQRV--SSCKMNFSASSSDF-HADKGSEN---- 368
              G+++GD + LRS NGNY       N+QRV  ++ K+N+SASSSDF H DKG E     
Sbjct: 423  RVGSNEGD-HLLRSINGNYNNNNSSSNTQRVVANNGKVNYSASSSDFNHGDKGPETNVVV 481

Query: 367  VSSRGEGMVDSWICSSD 317
             ++R EGMV+SWICSSD
Sbjct: 482  ANTRSEGMVESWICSSD 498


>ref|XP_002513906.1| conserved hypothetical protein [Ricinus communis]
            gi|223546992|gb|EEF48489.1| conserved hypothetical
            protein [Ricinus communis]
          Length = 446

 Score =  200 bits (509), Expect = 2e-48
 Identities = 154/390 (39%), Positives = 186/390 (47%), Gaps = 66/390 (16%)
 Frame = -3

Query: 1288 FDPLTSYLEVFSRSTPTPTPN-----LDIVWPRTQRSEPNCTQXXXXXXXXXXXSAQLDR 1124
            FDP  +    FS+S+P P  N     LD+V PR  RS+P+CT            SA    
Sbjct: 67   FDPSPNLFHAFSQSSPNPNLNSSLLNLDVVRPRGLRSDPDCTDLRSNLPGSSSSSATAPA 126

Query: 1123 I------------------PFPN---------------PSTSTDP-NRNTKKRSRASRRA 1046
                               P P+               PS  T    RN KKR+RASRRA
Sbjct: 127  AAPSGQSSVLGAQGSGQGAPLPSMQLRSVQDNGGRCSSPSDQTHVVTRNPKKRTRASRRA 186

Query: 1045 PTTVLTTDTSNFRAMVQEFTGIPAPPFSASPFTRSRFDLFNST-STLRSGHXXXXXXXXX 869
            PTTVLTTDTSNFRAMVQEFTGIPAPPFS SP++R R DLF S  S +RS H         
Sbjct: 187  PTTVLTTDTSNFRAMVQEFTGIPAPPFSGSPYSRCRLDLFGSVGSGMRSSHLEQMGSLYP 246

Query: 868  XP-FAQKVQ--QPPFVSS---------TMIDXXXXXXXXXXXXXXXXXXXXXXXXN---- 737
                AQKVQ  Q PF  S         TM+D                             
Sbjct: 247  LHPSAQKVQHQQSPFSFSSSSSLLNTNTMVDATNIASTTTTTNDNNTITSSSIPAGTTST 306

Query: 736  --------YQLPSELG-LHRQSSNLQNMENPMLTFQSLLQSPPNKYPFLAKSQAPSSIDA 584
                    YQLPS+LG L +Q  N+ NM+N ML+FQSLLQ PP  +  L      +   A
Sbjct: 307  FNPSSINNYQLPSDLGQLSKQPQNMLNMQNQMLSFQSLLQPPPLHHSSLNVHGLGAKSQA 366

Query: 583  RLKVRVLEEFGTSHGHVNANVGGLPNMGNSDGDQNHLRSFNGNYGNSQRVSSCKMNFSAS 404
             + +  L++ G SH   NAN+ G+P+   +  +   LR       N+   +SCK N+SAS
Sbjct: 367  SMPLPSLDDLGMSH---NANLSGIPSHNVTTAEGMRLR-------NNDHNNSCKFNYSAS 416

Query: 403  SSDF-HADKGSENVSSRGEGMVDSWICSSD 317
            SSDF H DKG E V  RGEG VDSWIC S+
Sbjct: 417  SSDFHHHDKGLEIVPPRGEGAVDSWICPSE 446


>ref|XP_002310570.2| VQ motif-containing family protein [Populus trichocarpa]
            gi|550334197|gb|EEE91020.2| VQ motif-containing family
            protein [Populus trichocarpa]
          Length = 527

 Score =  198 bits (504), Expect = 7e-48
 Identities = 168/432 (38%), Positives = 209/432 (48%), Gaps = 114/432 (26%)
 Frame = -3

Query: 1291 LFDPLTSYLEVFS----RSTPTPTPN------LDIVWPRTQRSEPNCTQXXXXXXXXXXX 1142
            LFDPL++Y +  S    RS P P  N      LD+VW +  RSEPNCT            
Sbjct: 69   LFDPLSNYFDPLSSASSRSPPPPFTNPNSLLNLDMVWSKNLRSEPNCTDLGGFISSSSPT 128

Query: 1141 SAQLD-----RIPFPN------------PSTSTDPN-------RNTKKRSRASRRAPTTV 1034
                      R  F +            P + T+         RN KKRSRASRRAPTTV
Sbjct: 129  QQLFTNQTQTRTTFQSLPPHGHESATRGPVSGTNDQVSNTAGVRNPKKRSRASRRAPTTV 188

Query: 1033 LTTDTSNFRAMVQEFTGIPAPPFSASPFTRSRFDLF-NSTSTLRSG----HXXXXXXXXX 869
            LTTDT+NFRAMVQEFTGIPAPPF++SPF RSR DLF  + STLRS               
Sbjct: 189  LTTDTTNFRAMVQEFTGIPAPPFTSSPFPRSRLDLFGTAASTLRSAVSHHLDPSPPPYLL 248

Query: 868  XPFAQKVQ-----QPPFVSS---------TMID------XXXXXXXXXXXXXXXXXXXXX 749
             PFAQ+ Q      PPF SS         +M+D                           
Sbjct: 249  RPFAQRFQPPPPPAPPFASSGSTASSFSTSMVDAIASTTTTNINNSGACTNSTTTSNISS 308

Query: 748  XXXNYQLPSELGLHRQSSNLQ--NMENPMLTFQSLLQSPPNKYPF--------LAKSQAP 599
               NYQLPS+LGL +Q  +L   N++NP+L F  L Q+ P+KYP           K+Q  
Sbjct: 309  TSINYQLPSDLGLLKQPHHLLNINVQNPILNFHPLFQA-PHKYPLPNSTNILGTTKAQQG 367

Query: 598  SSI-----DARLKVRVLEEFGTSHGHVNANVGGLPNMGNSD------------GDQNH-- 476
            SS+     D+ LK+ VLEEFG SHGHV+ N+ GL N+ +S             GD N+  
Sbjct: 368  SSLEIPSNDSHLKMGVLEEFGMSHGHVSTNLTGLQNIVSSSSSPSADATLMRRGDHNNNL 427

Query: 475  -----------------------LRSFNGNYGNS-QRVSSCKMNF-SASSSDFHAD-KGS 374
                                   LRS NGNY NS QRV++ K+NF ++SSSDF  D KG 
Sbjct: 428  ANWGDGVGSNGGGHHHHQQQQGLLRSINGNYNNSTQRVTNGKVNFLASSSSDFRGDNKGQ 487

Query: 373  ENVSSRGEGMVD 338
            ENV++R E +V+
Sbjct: 488  ENVATRSEEVVN 499


>gb|EXB29424.1| hypothetical protein L484_022090 [Morus notabilis]
          Length = 443

 Score =  182 bits (461), Expect = 7e-43
 Identities = 146/429 (34%), Positives = 191/429 (44%), Gaps = 104/429 (24%)
 Frame = -3

Query: 1291 LFDPLTSYLEVFSRST------PTPTPNLDIVWPRTQRSEPNCTQXXXXXXXXXXXS--- 1139
            +FDPL+++ +  S S+      P P  NLD+VWP+  RS+PN                  
Sbjct: 54   MFDPLSNFFDPVSSSSSSRSLNPNPFLNLDMVWPKPVRSDPNPNSSELVSLIPSSSQPNF 113

Query: 1138 --------------------------------AQLDRIPFPNPSTSTD-----PN----- 1085
                                            +Q+ R+P  + +   D     PN     
Sbjct: 114  FSSNNSIQLGQTAVAGASNFPAIQIAPEIQTRSQIHRLPSSSSADRNDAVNITPNGGAAP 173

Query: 1084 RNTKKRSRASRRAPTTVLTTDTSNFRAMVQEFTGIPAPPFSASPFTRSRFDLFNSTSTLR 905
            RN KKRSRASRRAPTTVLTTDTSNFRAMVQEFTGIPAPPF++SPF R+R DLF S S +R
Sbjct: 174  RNPKKRSRASRRAPTTVLTTDTSNFRAMVQEFTGIPAPPFTSSPFPRTRLDLFGSGSGIR 233

Query: 904  SG---------HXXXXXXXXXXPFAQKVQQ-PPFVSSTMIDXXXXXXXXXXXXXXXXXXX 755
            S                     PFAQK+QQ  PFV+++                      
Sbjct: 234  SAPLDPHHHHPSTGTSSYNLLRPFAQKIQQTTPFVNTSA--------------------- 272

Query: 754  XXXXXNYQLPSELGLHRQSSNLQNMENPMLTFQSLLQSPPNKYPFLAKSQAPSSIDARLK 575
                      S       +S L    NP+L+F SLLQ+ P K+  +  + A +       
Sbjct: 273  -------SSSSSPSTTTSNSLLNIQTNPVLSFHSLLQNAPPKFAKMGSTSASA------- 318

Query: 574  VRVLEEFGTSHGH---VNANVGGLPN--------------------MGNSDGDQNH---- 476
                ++FG SHGH   VN  +GG+PN                    MG++D +  +    
Sbjct: 319  ----DQFGLSHGHHVNVNPQLGGIPNPPTTMATTTATNWGITTDHGMGSNDNNNGNNGNN 374

Query: 475  --------LRSFNGNYGNSQRVSSC-------KMNFSASSS-DFHADKGSENVSSRGEGM 344
                    LRS NG Y  +   +S        K+N+SASSS DFH  K   NV++R EGM
Sbjct: 375  SNVDEGLLLRSINGGYTANTTAASAAAVSNGHKVNYSASSSTDFHGSKTEINVAARSEGM 434

Query: 343  VDSWICSSD 317
            V+SWICSSD
Sbjct: 435  VESWICSSD 443


>ref|XP_006434009.1| hypothetical protein CICLE_v10001250mg [Citrus clementina]
            gi|557536131|gb|ESR47249.1| hypothetical protein
            CICLE_v10001250mg [Citrus clementina]
          Length = 426

 Score =  163 bits (412), Expect = 3e-37
 Identities = 144/386 (37%), Positives = 177/386 (45%), Gaps = 64/386 (16%)
 Frame = -3

Query: 1282 PLTSYLEVFSRSTPTPTP----------NLDIVWPRTQRS-------EPNCTQXXXXXXX 1154
            P +SYL+  S+S     P          NLD+V  RT RS       EP+CT        
Sbjct: 61   PSSSYLQAHSQSQSQSQPQHNSNPSSFLNLDLVGSRTTRSCSLIRSSEPSCTDSSTVAHQ 120

Query: 1153 XXXXSAQLDRIP-----------FPNPSTSTDPN-RNTKKRSRASRRAPTTVLTTDTSNF 1010
                       P             N    T+   +N KKR+R SRRAPTTVLTTDTSNF
Sbjct: 121  GLINHGSFSTAPSSSHMQQQSRLLVNDQYQTNVVVKNPKKRTRTSRRAPTTVLTTDTSNF 180

Query: 1009 RAMVQEFTGIPAPPFSASPFTRSRFDLFNSTSTLRS------------GHXXXXXXXXXX 866
            RAMVQEFTGIP+ PFS       R DLF   S ++S            G           
Sbjct: 181  RAMVQEFTGIPSQPFSVGSSYSRRLDLFGPGSAIKSSGNIHNHLEPMGGPLNYHLRPATQ 240

Query: 865  PFAQKVQQPPFVSSTMIDXXXXXXXXXXXXXXXXXXXXXXXXNYQLPSELGLHRQSSN-- 692
             F   +   P  SS+MID                        NY + SELGL+  ++N  
Sbjct: 241  KFQPNLFSSPSSSSSMID----------AIAAAASTSHNTTSNYHVLSELGLNSNNNNNT 290

Query: 691  ------LQNM-------ENPMLTFQSLLQ-SPPNKYPFLA-----KSQAPSSIDARLKVR 569
                  L N+        +P+++FQS+LQ S P   P L      KSQA S   A     
Sbjct: 291  KEPQNTLNNIMQSQNLNHHPVVSFQSILQHSSPLNNPSLTFGANNKSQA-SHFGAPSFED 349

Query: 568  VLEEFGTSHGHVNANVGGLPNMGNSDGDQNHLRSFNGNYGN-SQRVSSCKMNFSASSSDF 392
                   SH   +A+  GLPN      +Q+  RSF+GN+ N SQR +SCK+N+SASSSDF
Sbjct: 350  HDHHLAMSH---HASHVGLPN------NQDQFRSFDGNFANSSQRATSCKLNYSASSSDF 400

Query: 391  HADKGSENVSSRG-EGMVDSWICSSD 317
            H +K  ENVSSRG EG VDSWIC SD
Sbjct: 401  HHNKNLENVSSRGTEGTVDSWICPSD 426


>ref|XP_006472623.1| PREDICTED: uncharacterized serine-rich protein C215.13-like [Citrus
            sinensis]
          Length = 429

 Score =  162 bits (410), Expect = 6e-37
 Identities = 144/393 (36%), Positives = 179/393 (45%), Gaps = 71/393 (18%)
 Frame = -3

Query: 1282 PLTSYLEVFSRSTPTPTP------------NLDIVWPRTQRS-------EPNCTQXXXXX 1160
            P +SYL+  S+S     P            NLD+V  RT RS       EP+CT      
Sbjct: 60   PSSSYLQAHSQSQSQSQPQPQHNSNPSSFLNLDLVGSRTTRSCSVFRSSEPSCTDSSTVA 119

Query: 1159 XXXXXXSAQLDRIP-----------FPNPSTSTDPN-RNTKKRSRASRRAPTTVLTTDTS 1016
                         P             N    T+   +N KKR+R SRRAPTTVLTTDTS
Sbjct: 120  HQGLINHGSFSTAPSSSHMQQQSRLLVNDQYQTNVVVKNPKKRTRTSRRAPTTVLTTDTS 179

Query: 1015 NFRAMVQEFTGIPAPPFSASPFTRSRFDLFNSTSTLRS------------GHXXXXXXXX 872
            NFRAMVQEFTGIP+ PFS       R DLF   S ++S            G         
Sbjct: 180  NFRAMVQEFTGIPSQPFSVGSSYSRRLDLFGPGSAIKSSGNIHNHLEPMGGPLNYHLRPA 239

Query: 871  XXPFAQKVQQPPFVSSTMIDXXXXXXXXXXXXXXXXXXXXXXXXNYQLPSELGLHRQSSN 692
               F   +   P  SS+MID                        NY + SELGL+  ++N
Sbjct: 240  TQKFQPNLFSSPSSSSSMID--------AIAAAAAASTSHNTTSNYHVLSELGLNSNNNN 291

Query: 691  --------LQNM-------ENPMLTFQSLLQ-SPPNKYPFLA-----KSQ-----APSSI 590
                    L N+        +P+++FQS+LQ S P   P L      KSQ     APS  
Sbjct: 292  NTKEPQNTLNNIMQSQNLNHHPVVSFQSILQHSSPLNNPSLTFGANNKSQASHFGAPSFE 351

Query: 589  DARLKVRVLEEFGTSHGHVNANVGGLPNMGNSDGDQNHLRSFNGNYGN-SQRVSSCKMNF 413
            D         +   +  H  ++V GLPN      +Q+  RSF+GN+ N SQR +SCK+N+
Sbjct: 352  D--------HDHHLAMSHHGSHV-GLPN------NQDQFRSFDGNFANSSQRATSCKLNY 396

Query: 412  SASSSDFHADKGSENVSSRG-EGMVDSWICSSD 317
            SASSSDFH +K  ENVSSRG EG VDSWIC SD
Sbjct: 397  SASSSDFHHNKNLENVSSRGTEGTVDSWICPSD 429


>ref|XP_006428174.1| hypothetical protein CICLE_v10025465mg [Citrus clementina]
            gi|568819356|ref|XP_006464221.1| PREDICTED:
            uncharacterized threonine-rich GPI-anchored glycoprotein
            PJ4664.02-like [Citrus sinensis]
            gi|557530164|gb|ESR41414.1| hypothetical protein
            CICLE_v10025465mg [Citrus clementina]
          Length = 491

 Score =  154 bits (389), Expect = 2e-34
 Identities = 142/412 (34%), Positives = 184/412 (44%), Gaps = 91/412 (22%)
 Frame = -3

Query: 1279 LTSYLEVFSRSTPTPTPNLDIVWPRTQRSEPNCTQXXXXXXXXXXXSAQL---------- 1130
            L+S L + +   P    NLD+VW ++ RSEPNCT            SA            
Sbjct: 86   LSSSLPLNTNHHPNSLLNLDMVWSKSLRSEPNCTDLGGLFVPSSSSSATAFPALHVSPRE 145

Query: 1129 -------DRIPF-----PNPSTSTDPN----------------RNTKKRSRASRRAPTTV 1034
                    + P      P P  + D +                RN KKRSRASRRAPTTV
Sbjct: 146  GGTESVSSKAPSFLATAPGPGLNIDQSHINLNNSSNQHSTMMVRNPKKRSRASRRAPTTV 205

Query: 1033 LTTDTSNFRAMVQEFTGIPAPPFSASPFTRSRFDLFNSTSTLRSG--------------- 899
            LTTDT+NFRAMVQEFTGIPAPPF++S F R+R DLF ++S+  S                
Sbjct: 206  LTTDTTNFRAMVQEFTGIPAPPFTSSHFPRTRLDLFGNSSSTSSSLMMRSTIGSHLDSSL 265

Query: 898  HXXXXXXXXXXPFAQKVQQP-PFVSS--TMIDXXXXXXXXXXXXXXXXXXXXXXXXNYQL 728
                       PFAQK+  P PF +S  + I                           QL
Sbjct: 266  PQPPSSYNLLRPFAQKLINPIPFSTSNNSSIFIDAIASASTSTPNATATTTSTNINYQQL 325

Query: 727  PSELGLHRQSSNLQNME--NPMLTFQSLLQSPPNKYPFLAKS----QAPSSIDARLKVRV 566
            PS    H+Q+    NM+  NP+L   SLLQ PP KYP LA S      P       +   
Sbjct: 326  PS----HQQNLFGMNMQHNNPILNLHSLLQVPP-KYP-LANSPILETKPPPPPPPPQGSS 379

Query: 565  LEEFGTSH------GHVNAN-VGGLPNMGNSDGDQNHLRSFNGNYGNSQRVSSCKMN--- 416
            LEE G SH       H+N N + GL ++ +S  + N   S++G+   +   ++   N   
Sbjct: 380  LEELGLSHAAAVSASHLNTNLMSGLQSLVSSSDNNNSPTSWHGHGTGTGATAAVGSNENE 439

Query: 415  --------------FSASSSDFHADKGSENV-----SSRGEGMVDSWICSSD 317
                          FS +SS+FH DKG E+V     ++R EGMV+SWICSSD
Sbjct: 440  EATAGLFANGKLSRFSQASSEFHRDKGQESVNVAAATTRTEGMVESWICSSD 491


>ref|XP_006826929.1| hypothetical protein AMTR_s00010p00175790 [Amborella trichopoda]
            gi|548831358|gb|ERM94166.1| hypothetical protein
            AMTR_s00010p00175790 [Amborella trichopoda]
          Length = 326

 Score =  152 bits (384), Expect = 6e-34
 Identities = 125/302 (41%), Positives = 149/302 (49%), Gaps = 49/302 (16%)
 Frame = -3

Query: 1075 KKRSRASRRAPTTVLTTDTSNFRAMVQEFTGIPAPPFSASPFTR--SRFDLFNSTSTLRS 902
            KKRSRASRRAPTTVLTTDT+NFRAMVQEFTGIP PPFS+SPF R  +RFD        RS
Sbjct: 47   KKRSRASRRAPTTVLTTDTTNFRAMVQEFTGIPNPPFSSSPFQRASTRFDFIGGGGGSRS 106

Query: 901  GHXXXXXXXXXXPFAQKVQQPPFVSSTMIDXXXXXXXXXXXXXXXXXXXXXXXXNYQLPS 722
                        PF QK   PP  SS  I                            +P+
Sbjct: 107  ---EPAPPFLLRPFPQK-PSPPLSSSNSISGSSSLNIVSSNADIV------------MPN 150

Query: 721  ELGLHRQSSNLQNMENPMLTFQSLLQSPP---NKYPFL---AKSQAPSS----------- 593
             L     SS+ QN+  P L  Q  +Q PP   N +P L   AK  +P +           
Sbjct: 151  YLA---ASSSSQNVPVPQLPIQ--MQGPPSFVNFHPVLSHNAKFMSPMAPMPGFLAGKGQ 205

Query: 592  --IDARLKVRVLEEFG---------TSHGH-------------VNANVGGLPNMGNSDGD 485
               D+RLK  VLE FG         + HGH                + GG    G  + +
Sbjct: 206  IPADSRLKSGVLEGFGSDSGQIGGASGHGHGQTGGPRPDFVSGGGGSRGGDLGYGGDEEE 265

Query: 484  QNHLRSFN-----GNYGNSQRVSSCKMNFS-ASSSDFHADKGSENVSSRGEGMVDSWICS 323
            +  +RS +      NY  +QR SSCK+N+S +SSSDFH +KGSENV  RGEGMVDSWICS
Sbjct: 266  EGFMRSSSSSVAANNYFGNQRNSSCKLNYSVSSSSDFHVEKGSENV-GRGEGMVDSWICS 324

Query: 322  SD 317
            SD
Sbjct: 325  SD 326


>ref|XP_006596337.1| PREDICTED: probable myosin light chain kinase DDB_G0279831-like
            [Glycine max]
          Length = 429

 Score =  149 bits (375), Expect = 6e-33
 Identities = 139/381 (36%), Positives = 171/381 (44%), Gaps = 56/381 (14%)
 Frame = -3

Query: 1291 LFDPLTSYLEVFSRSTPTPTPNLDIVWPRTQ--RSEPNCTQXXXXXXXXXXXS------- 1139
            LFD   SYL   S+  P    NLD     +Q  RSEP+CT            +       
Sbjct: 65   LFDLSPSYLHALSQ--PNSFLNLDTTTSSSQPRRSEPDCTLNVTSSPPPPTTTNIDQCLL 122

Query: 1138 ----------AQLDRIPFPNPSTSTDPNRNTKKRSRASRRAPTTVLTTDTSNFRAMVQEF 989
                      A+ D I F +  TS +  RN+KKR+RASRRAPTTVLTTDTSNFRAMVQEF
Sbjct: 123  GSQGGLNVDNARRDTILFESGKTS-NLGRNSKKRTRASRRAPTTVLTTDTSNFRAMVQEF 181

Query: 988  TGIPAPPFSASPFTRSRFDLFNSTSTLRSGHXXXXXXXXXXPF---AQKV-------QQP 839
            TGIPAPPFSAS     R DL   +S+LRS            P     QKV       Q P
Sbjct: 182  TGIPAPPFSASSSYSRRLDLLTGSSSLRSFSHLDTTTGPFYPLRPSPQKVHHHHHHHQNP 241

Query: 838  PFVSST------MID--XXXXXXXXXXXXXXXXXXXXXXXXNYQLPSELGL--HRQSSN- 692
              +SS+      M+D                            QLP +LGL  H    N 
Sbjct: 242  LLLSSSSSPYNNMVDAIASTTTTNNSSSNNNNNNNNPINFQQQQLPPDLGLPYHHNPQNI 301

Query: 691  -LQNMENPMLTFQSLLQSPPNKYPFLAKSQAPSSIDARLKVRVLEEFGTSHGHVNANVGG 515
             L   ++P L F      PP  +PF   ++ PS          +E+ G SHG VN N   
Sbjct: 302  MLSMQDHPTLAFH---PPPPPLHPFGFSAKLPS----------IEDLGMSHGQVNNNNPN 348

Query: 514  LPNMGNSDGDQNHLRSFNGNYGNSQRVS-------SCKMNF---SASSSDFHADKGSEN- 368
                G+   +   LRS N + G ++ VS       SCK+NF   SAS+S  H     +N 
Sbjct: 349  FVASGHVTSEGVPLRSVNNDGGGARDVSLRSLDGGSCKLNFSVASASTSLNHEKSTLQNN 408

Query: 367  ----VSSRGEGMVDSWICSSD 317
                  +RGEG VDSWICSS+
Sbjct: 409  NNASTGTRGEGTVDSWICSSE 429


>ref|XP_007159640.1| hypothetical protein PHAVU_002G254700g [Phaseolus vulgaris]
            gi|561033055|gb|ESW31634.1| hypothetical protein
            PHAVU_002G254700g [Phaseolus vulgaris]
          Length = 493

 Score =  146 bits (369), Expect = 3e-32
 Identities = 144/436 (33%), Positives = 192/436 (44%), Gaps = 111/436 (25%)
 Frame = -3

Query: 1291 LFDPLTSYLEVFSRSTPTPTPNLDIVWPRTQRSEPNCT---------------------- 1178
            +FDPL+ YL+  ++S+ T   NLD++W +  RSEPN T                      
Sbjct: 67   MFDPLSGYLDPLTQSS-TSLLNLDVMWSKPGRSEPNQTTLANLIPCSSSSPSPHNQAFLS 125

Query: 1177 ----------------QXXXXXXXXXXXSAQLDRIPFPNPSTSTDPN-----RNTKKRSR 1061
                            +           +A  D+I   + + ST+ N     RN KKRSR
Sbjct: 126  SQTRGGNTGAFPTLLPESGSRGLMLSVSAANNDQIQTHSTTNSTNNNNSNVVRNPKKRSR 185

Query: 1060 ASRRAPTTVLTTDTSNFRAMVQEFTGIPAPPFSASPFTRSRFDLFNS--TSTLRSG---- 899
            ASRRAPTTVLTTDT+NFRAMVQEFTGIPAPPF++SPF R+R DLF S  T TLRS     
Sbjct: 186  ASRRAPTTVLTTDTTNFRAMVQEFTGIPAPPFTSSPFPRTRLDLFASAATPTLRSNLNVN 245

Query: 898  ----HXXXXXXXXXXPFAQKVQ------QPPFVSSTMIDXXXXXXXXXXXXXXXXXXXXX 749
                           PFAQK+Q       PP +S+T+                       
Sbjct: 246  VNPLDPPTPPPYLLRPFAQKLQFRSLHPFPPSLSNTL----SPSTNSTTNSTSINYHQQQ 301

Query: 748  XXXNYQLPSELGLHRQSSNLQNMENPMLTFQSLLQSP-PNKYPFLAKSQAPSSID--ARL 578
                  L    GL +Q  N  N   P L      + P  N    +++ Q  SS D    L
Sbjct: 302  QQQQQNLSEHFGLMKQPHNFNN--TPSLEAYHHPKYPLGNSSVLVSRPQQQSSFDIPPSL 359

Query: 577  KVRVLEEFG-TSHGHVNAN---------------VGGLPNMGNSDGDQNHLRSFN----- 461
            K+ V EE G    GHVN +               VG L +  N+  + N+L + N     
Sbjct: 360  KMGVFEELGLRPDGHVNTDLRCLHQNMVSSTSVGVGALSSGNNN--NNNNLSNANPSTEW 417

Query: 460  ----GNYGN----------------------SQRVSSCKMNFSASSSDFHADKGSE-NVS 362
                G   N                      ++RVS+ K+++SASSSDFH +K  + +V+
Sbjct: 418  VQRTGTITNDDCDHGGGGGGGLSGTVSYSDIAERVSNGKVHYSASSSDFHGEKVPDFSVT 477

Query: 361  SRGEGMVDSWI-CSSD 317
            +R +GMV+SWI CSSD
Sbjct: 478  ARSQGMVESWINCSSD 493


>ref|XP_006580869.1| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-79
            specific-like [Glycine max]
          Length = 486

 Score =  144 bits (364), Expect = 1e-31
 Identities = 137/431 (31%), Positives = 184/431 (42%), Gaps = 106/431 (24%)
 Frame = -3

Query: 1291 LFDPLTSYLEVFSRSTPTPTPNLDIVWPRTQRSEPNCT---------------------- 1178
            +FDPL++YL+  ++S+ T   NLD++W +T RSE N T                      
Sbjct: 64   MFDPLSNYLDPITQSS-TSLLNLDVMWSKTGRSESNQTDLVGLIPCSSSSVPSPHNEAFV 122

Query: 1177 -----------------QXXXXXXXXXXXSAQLDRIPFPNPSTSTDPNRNTKKRSRASRR 1049
                             +           +A  D+I   N + + +  RN KKRSRASRR
Sbjct: 123  SSQTRGNNSGAFPTLPPESGSRGLMLSVSAANNDQIQTHNNNNNCNVVRNPKKRSRASRR 182

Query: 1048 APTTVLTTDTSNFRAMVQEFTGIPAPPFSASPFTRSRFDLFNSTS--TLRSG------HX 893
            APTTVLTTDT+NFRAMVQEFTGIPAPPF++S F R+R DLF ST+  TLRS         
Sbjct: 183  APTTVLTTDTTNFRAMVQEFTGIPAPPFTSSSFPRTRLDLFASTATPTLRSNVNVNPFDP 242

Query: 892  XXXXXXXXXPFAQKVQQ------PPFVSSTMIDXXXXXXXXXXXXXXXXXXXXXXXXNYQ 731
                     PFAQK+Q       PP  S+T+                            Q
Sbjct: 243  PTQPPYLLRPFAQKLQLRSLHPFPPSFSNTL-------PPPSTNSPTNSTSINYHQQQQQ 295

Query: 730  LPSELGLHRQSSNLQNMENPMLTFQS------LLQSPPNKYPFLAKSQAPSSIDARLKVR 569
            L    GL +Q  N  N      T ++       L +  +      + Q    I   LK+ 
Sbjct: 296  LSEHFGLAKQPFNFNNTTPDTSTLEAYHHPKYTLGNSSSVLVSRTQQQHSLEIPPNLKMG 355

Query: 568  VLEEFGTSHGHVNANVG-----------------------GLPNMGNS------------ 494
            + EE    H HVN ++G                        L N  NS            
Sbjct: 356  LYEELELRHDHVNTDLGCLHQNMVSSTSVGVGALSSDNNNNLSNATNSSTEWAQRTGTIT 415

Query: 493  DGDQNHLR-----SFNGNY---GNSQRVSSCKMNFSASSSDFHADKGSE---NVSSRGEG 347
            + D +H R     S   NY   G    V++ K+++SASSSDFH +KG +     ++R +G
Sbjct: 416  NNDCDHGRGGGALSGTVNYNDIGEGAVVTNGKVHYSASSSDFHGEKGPDFTVTTAARTQG 475

Query: 346  MVDSWI-CSSD 317
            MV+SWI CSSD
Sbjct: 476  MVESWINCSSD 486


>ref|XP_007163642.1| hypothetical protein PHAVU_001G251600g [Phaseolus vulgaris]
            gi|561037106|gb|ESW35636.1| hypothetical protein
            PHAVU_001G251600g [Phaseolus vulgaris]
          Length = 479

 Score =  143 bits (361), Expect = 3e-31
 Identities = 115/326 (35%), Positives = 160/326 (49%), Gaps = 63/326 (19%)
 Frame = -3

Query: 1105 STSTDPNRNTKKRSRASRRAPTTVLTTDTSNFRAMVQEFTGIPAPPFSASPFTRSRFDLF 926
            + +T+  RN KKRSRASRRAPTTVLTTDT+NFRAMVQEFTGIPAPPF++SPF R+R DLF
Sbjct: 174  NANTNVVRNPKKRSRASRRAPTTVLTTDTTNFRAMVQEFTGIPAPPFTSSPFPRTRLDLF 233

Query: 925  NSTST-----LRSGH----------XXXXXXXXXXPFAQKVQ-QPPFVS-----STMIDX 809
             S++      LRS                      PFA KVQ QP  +      S+M++ 
Sbjct: 234  ASSNASSSVLLRSASSHLEQPSSQTHTQTPSYLLRPFAHKVQAQPSSIPHNNSFSSMLNT 293

Query: 808  XXXXXXXXXXXXXXXXXXXXXXXNYQLPSELGLHRQSSNLQNMENPMLTFQSLLQSPPNK 629
                                            +H Q  +L NM NP+L+ QS+L +  + 
Sbjct: 294  LASNNNSG-------------------SGSASIHYQQHSL-NMHNPILSLQSILGNNDSS 333

Query: 628  YPFLAKSQ--------APSSIDARLKVRVLEEFGTSHGHV--------NANV------GG 515
                +K+Q         P ++D+ LK+  LEE G  H HV        N N+      G 
Sbjct: 334  VLVGSKTQQQQPSLEITPGTVDSHLKMSGLEELGLRHAHVGGHHHHHQNMNMVSSSSDGA 393

Query: 514  LPNMGNSDGDQNHLRS-FNGNYGNSQRVSSCK----------------MNFSASSSDFHA 386
            L  + N+    N++R   + ++  +QR+                    +N+ +S SDFH 
Sbjct: 394  LSRVNNNISINNNMRGPSSADWAQAQRIGGSNDGGVLRSLSGGTATGTLNYRSSVSDFHG 453

Query: 385  DKGSEN--VSSRGEGMVDSWI-CSSD 317
            +KG+ +  V++R EGMV+SWI CSSD
Sbjct: 454  EKGAPDCAVAARSEGMVESWINCSSD 479


>ref|XP_006603093.1| PREDICTED: myb-like protein A-like [Glycine max]
          Length = 454

 Score =  140 bits (353), Expect = 2e-30
 Identities = 113/310 (36%), Positives = 154/310 (49%), Gaps = 45/310 (14%)
 Frame = -3

Query: 1111 NPSTSTDPNRNTKKRSRASRRAPTTVLTTDTSNFRAMVQEFTGIPAPPF---SASPFTRS 941
            N S++    RN KKRSRASRRAPTTVLTTDT+NFRAMVQEFTGIPAPPF   S+S F R+
Sbjct: 167  NSSSTNTVVRNPKKRSRASRRAPTTVLTTDTTNFRAMVQEFTGIPAPPFTSSSSSSFPRT 226

Query: 940  RFDLFNSTSTLRSGHXXXXXXXXXXPFAQKVQQPPFVSSTMIDXXXXXXXXXXXXXXXXX 761
            R DLF S++++ S               ++ Q PP++                       
Sbjct: 227  RLDLFASSNSIASS-------SSSSIIREQTQTPPYLLRPFAHKVQAQLPSSIPPPS--- 276

Query: 760  XXXXXXXNYQLPSELGLHRQSSNLQNMENPMLTFQSLLQSPP---NKYPFLAKSQAPSSI 590
                       P  L  ++Q S L   +NP+L+FQS+LQ  P   +K    +    PS++
Sbjct: 277  ---------SFPPMLNNYQQHS-LNMQQNPILSFQSILQPQPLIGSKTQQPSLEIPPSAV 326

Query: 589  D-ARLKVRVLEEFGTSHGH----------------------VNANVGGLPNMGN------ 497
            D + LK+  LEE G S+ H                       N+N+ G P+  +      
Sbjct: 327  DSSHLKMGGLEELGLSNAHDGGHHQNFNMVSSSSDGALSRVTNSNMRGGPSSADWALSQA 386

Query: 496  ---SDGDQNHLRSFNG-----NYGNSQRVSSCKMNFSASSSDFHADKGSE-NVSSRGEGM 344
                + D   LRS  G     NY ++  VS  ++  + ++SDFH DKG E  V++R EGM
Sbjct: 387  QRIDNNDGGVLRSLGGATATLNYRSN--VSDPRVKVTNNNSDFHGDKGPECAVAARSEGM 444

Query: 343  VDSWI-CSSD 317
            V+SWI CSSD
Sbjct: 445  VESWINCSSD 454


Top