BLASTX nr result

ID: Akebia26_contig00013982 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Akebia26_contig00013982
         (3823 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002278494.1| PREDICTED: uncharacterized protein LOC100259...   453   e-124
ref|XP_004139818.1| PREDICTED: uncharacterized protein LOC101210...   412   e-112
ref|XP_007221792.1| hypothetical protein PRUPE_ppa003948mg [Prun...   375   e-100
ref|XP_004300997.1| PREDICTED: uncharacterized protein LOC101304...   361   1e-96
emb|CBI26253.3| unnamed protein product [Vitis vinifera]              361   1e-96
ref|XP_004496723.1| PREDICTED: uncharacterized protein LOC101489...   333   4e-88
ref|XP_007034759.1| Uncharacterized protein TCM_020625 [Theobrom...   331   1e-87
ref|XP_004247355.1| PREDICTED: uncharacterized protein LOC101252...   324   2e-85
ref|XP_006360795.1| PREDICTED: uncharacterized protein LOC102579...   317   3e-83
ref|XP_007143259.1| hypothetical protein PHAVU_007G057400g [Phas...   315   8e-83
ref|XP_003556049.1| PREDICTED: uncharacterized protein LOC100805...   314   2e-82
ref|XP_002517137.1| conserved hypothetical protein [Ricinus comm...   313   4e-82
ref|XP_006826283.1| hypothetical protein AMTR_s00004p00052660 [A...   312   7e-82
ref|XP_007143258.1| hypothetical protein PHAVU_007G057400g [Phas...   301   1e-78
gb|EXB41290.1| hypothetical protein L484_004460 [Morus notabilis]     295   8e-77
ref|XP_006489387.1| PREDICTED: uncharacterized protein LOC102629...   291   2e-75
ref|XP_006406164.1| hypothetical protein EUTSA_v10020450mg [Eutr...   289   6e-75
ref|XP_003535649.1| PREDICTED: protein SUPPRESSOR OF GENE SILENC...   289   8e-75
ref|XP_006406165.1| hypothetical protein EUTSA_v10020450mg [Eutr...   284   3e-73
emb|CAN69769.1| hypothetical protein VITISV_022064 [Vitis vinifera]   279   6e-72

>ref|XP_002278494.1| PREDICTED: uncharacterized protein LOC100259596 [Vitis vinifera]
          Length = 582

 Score =  453 bits (1166), Expect = e-124
 Identities = 268/495 (54%), Positives = 313/495 (63%), Gaps = 27/495 (5%)
 Frame = +2

Query: 2420 MAGGNPKGXXXXXXXXXXXNRKTRWESGNNPQPDHKSGT----NAESKPNKPSSNPKDEQ 2587
            MAGGNPK            +RK+RWESG+NP  D KSG     N+ S P  P+++PK   
Sbjct: 1    MAGGNPKASSHKPSSSSS-HRKSRWESGSNP--DKKSGDSKPPNSSSTPKTPNNDPKQAP 57

Query: 2588 SQGSGPS--KPVSDPKTXXXXXXXXXXXXXXXXILSDPNVSXXXXXXXXXXXXXXXYGFH 2761
            +  SG S  KP +D                    L DP                  YGFH
Sbjct: 58   ASTSGSSHPKPPAD-SVPTSAAAPVRPPVAGAPFLPDPTT--------FGPPPAPQYGFH 108

Query: 2762 MLDRRTIALADGSVRSYFALPPDYQDFPPHG-RPFDHSERFFPFGHGGREPEPGGMGFGF 2938
            ML+RRTI LADGSVRSYFAL PDYQDFPP   R  D + RF P G G   PEP G G G 
Sbjct: 109  MLERRTIVLADGSVRSYFALSPDYQDFPPPPPRAMDPAGRFLPMGPG-HGPEPVGPGLG- 166

Query: 2939 DKHFPPGGRLSPEGFRRDRDEAFGRGGGPHDYWNSLGLDGRDPISQEGSLKRKYGDGDER 3118
               FP  G +SPEGFR +RD+ + RG    DYWNSLGLDGR     EGS+KRKY + DER
Sbjct: 167  --RFPLTGPMSPEGFRGERDDPYSRGRH-QDYWNSLGLDGRG--HPEGSMKRKYSEEDER 221

Query: 3119 DVR---------DEFARQRQQLLHYXXXXXXXXXXXXXXXXDRANYLSGTTSSPFHRDHP 3271
            D R         DEFARQRQQLL Y                DR+ YL+G  SSPF R   
Sbjct: 222  DRREDRDRRDGNDEFARQRQQLLQYGNPSLNPNGYPLGG--DRSEYLAGP-SSPFRRG-V 277

Query: 3272 MDSARRIDEIRSSKHIRVGGDY---------GVDVSLKHPDVDQQALKKAFLRFVKSLNE 3424
            MD  R  DE+RSSK++R+GG Y         G +V LKH +VDQ ALKKAF++FVK +NE
Sbjct: 278  MDPIRG-DELRSSKYMRIGGGYEGFSRQGGVGDNVGLKHHNVDQNALKKAFIQFVKLINE 336

Query: 3425 NLAQKKNYLEDGKYGSLQCLACGRASKEFPDVHGLIMHAYNSHS--ADLHVDHLGLHKAL 3598
            + +Q++ YLEDGK G L+CLACGR+SK+FPD+H L+MH YNS+S  A+L VDHLGLHKAL
Sbjct: 337  SASQRRLYLEDGKQGPLRCLACGRSSKDFPDMHALVMHTYNSNSDNANLLVDHLGLHKAL 396

Query: 3599 CVLMGWNYAKVPDNSKAYQLVSADEAAANNDDLIMWPPLVIIHNTNSGRSKDGRMEGMGN 3778
            CVL+GWNY+  PDNSK YQ +SADEAAAN DDLIMWPP VIIHNT SG+ KDGRMEG+GN
Sbjct: 397  CVLLGWNYSMPPDNSKTYQFLSADEAAANQDDLIMWPPTVIIHNTVSGKGKDGRMEGLGN 456

Query: 3779 KVMDNKLKDLGFGGG 3823
            K MDNKL+DLGFGGG
Sbjct: 457  KAMDNKLRDLGFGGG 471


>ref|XP_004139818.1| PREDICTED: uncharacterized protein LOC101210911 [Cucumis sativus]
            gi|449492576|ref|XP_004159037.1| PREDICTED:
            uncharacterized LOC101210911 [Cucumis sativus]
          Length = 564

 Score =  412 bits (1058), Expect = e-112
 Identities = 234/483 (48%), Positives = 288/483 (59%), Gaps = 15/483 (3%)
 Frame = +2

Query: 2420 MAGG---NPKGXXXXXXXXXXXNRKTRWESGNNPQPDHKSGTNAESKPNKPSSNPKDEQS 2590
            MAGG   N              +RK+RWES +N  P      +  SKP+ PSS       
Sbjct: 1    MAGGSNTNKSSQKPSSSSAAASHRKSRWESSSNNPPSLPKSDSKSSKPHHPSSK-SGISP 59

Query: 2591 QGSGPSKPVSDPKTXXXXXXXXXXXXXXXXILSDPNVSXXXXXXXXXXXXXXXYGFHMLD 2770
              + P  P   P                   L  P++S               YGFHML+
Sbjct: 60   NSTHPKHPTDKPLNPTPASAPLPSPGLP---LPFPDLSALGPPPPPS------YGFHMLE 110

Query: 2771 RRTIALADGSVRSYFALPPDYQDFPPHGRPFDHSERFFPFGHGGREPEPGGMGFGFDKHF 2950
            RRTI LADGSVRSYFALP DY +F P  R  D + RF P G      E GG    FD  F
Sbjct: 111  RRTIVLADGSVRSYFALPLDYHEFTPPARSMDLAARFLPMGAAASGHEYGG----FDHRF 166

Query: 2951 PPGGRLSPEGFRRDRDEAFGRGGGPHDYWNSLGLDGRDPISQEGSLKRKYGDGDERDVRD 3130
            PPGG +SP+ FR  R+E FGRG  P D+WNS G D R   + + S+KRK+ D  E+D +D
Sbjct: 167  PPGGPMSPDEFRGAREEQFGRGR-PQDHWNSRGTDERGGPA-DSSMKRKFNDDSEKDRKD 224

Query: 3131 E---FARQRQQLLHYXXXXXXXXXXXXXXXXDRANYLSGTTSSPFHRDHPMDSARRIDEI 3301
            E    +R++QQLLH                  R ++L+GT+          D   R ++ 
Sbjct: 225  EKDDLSRRQQQLLHNGNPNGFLTGSGER----RGDFLAGTS----------DPYGRTEDT 270

Query: 3302 RSSKHIRVGGDY---------GVDVSLKHPDVDQQALKKAFLRFVKSLNENLAQKKNYLE 3454
            R SK++R GG Y         G  V+ K+ +VDQ AL+KAFL FVK++NEN  QKKNYLE
Sbjct: 271  RFSKYMRAGGSYENEGLRLGNGNSVAPKYLEVDQSALRKAFLHFVKTINENANQKKNYLE 330

Query: 3455 DGKYGSLQCLACGRASKEFPDVHGLIMHAYNSHSADLHVDHLGLHKALCVLMGWNYAKVP 3634
            DGK+G LQCLAC R+S++FPD+HGLIMH YNS SAD  VDHLGLHKALCVLMGWNY+K P
Sbjct: 331  DGKHGRLQCLACARSSRDFPDMHGLIMHTYNSESADSQVDHLGLHKALCVLMGWNYSKPP 390

Query: 3635 DNSKAYQLVSADEAAANNDDLIMWPPLVIIHNTNSGRSKDGRMEGMGNKVMDNKLKDLGF 3814
            DNS+ Y+ +SADEAAAN +DLIMWPPLVIIHNT +G+SKDGRMEG+GNK MD+K++DLGF
Sbjct: 391  DNSRGYRFLSADEAAANQEDLIMWPPLVIIHNTITGKSKDGRMEGLGNKAMDSKIRDLGF 450

Query: 3815 GGG 3823
            GGG
Sbjct: 451  GGG 453


>ref|XP_007221792.1| hypothetical protein PRUPE_ppa003948mg [Prunus persica]
            gi|462418728|gb|EMJ22991.1| hypothetical protein
            PRUPE_ppa003948mg [Prunus persica]
          Length = 539

 Score =  375 bits (962), Expect = e-100
 Identities = 224/470 (47%), Positives = 266/470 (56%), Gaps = 21/470 (4%)
 Frame = +2

Query: 2477 NRKTRWESGNNPQPDHKSGTN----AESKPNKPSSNPKDEQSQGSGPSKPVSDPKTXXXX 2644
            NRK+RWES  NP     + T     ++ KP KP+S P  +    S PS P   P      
Sbjct: 22   NRKSRWESSPNPAAAATAITTKNNPSDPKPAKPNSGPSPKPGATSTPSHPKHPPSAPSPG 81

Query: 2645 XXXXXXXXXXXXILSDPNVSXXXXXXXXXXXXXXXYGFHMLDRRTIALADGSVRSYFALP 2824
                            P V                YGFHML+RRT  LADGSVRSYFALP
Sbjct: 82   PAPFPFPDPAAFGPPPPPV----------------YGFHMLERRTFVLADGSVRSYFALP 125

Query: 2825 PDYQDFPPHGRPFDHSERFFPFGHGGREPEPGGMGFGFDKHFPPGGRLSPEGFRRDRDEA 3004
            PDYQ+FPP   P D S RF PFG GG                PPG               
Sbjct: 126  PDYQEFPP---PMDPSGRFLPFGPGG----------------PPGP-------------- 152

Query: 3005 FGRGGGPHDYWNSLGLDGRDPISQEGSLKRKYGDG-DERDVRDEFARQRQQLLHYXXXXX 3181
                 GP DYWNSLGLDGR P   EG  KRKY +  D+RD   EF  +R Q + +     
Sbjct: 153  -----GP-DYWNSLGLDGRGPA--EGPAKRKYAEEEDQRDKAGEFGMRRPQFMQHANPNG 204

Query: 3182 XXXXXXXXXXXDRANYLSGTTSSPFHRDHPMDSARRIDEIRSSKHIRVGGDY-------- 3337
                        R  +L+  TSSPF R+   D  R  +E R++K++R+GG          
Sbjct: 205  FPVGPG-----SRGEFLA-ETSSPFRRE-AADQGRGGEEARANKYMRIGGGGYESAGFRL 257

Query: 3338 --------GVDVSLKHPDVDQQALKKAFLRFVKSLNENLAQKKNYLEDGKYGSLQCLACG 3493
                    G +V  KH  VDQ ALKKAFL +VK ++EN  Q+K YLEDGK G L CLAC 
Sbjct: 258  GGGGGGGGGENVVHKHVQVDQSALKKAFLNYVKLIHENTQQRKIYLEDGKNGRLHCLACA 317

Query: 3494 RASKEFPDVHGLIMHAYNSHSADLHVDHLGLHKALCVLMGWNYAKVPDNSKAYQLVSADE 3673
            R+SK+FPD+H LIMH+YNS +ADL VDHLGLHKALCVLMGW+Y K PDNSKAYQ +SA+E
Sbjct: 318  RSSKDFPDMHSLIMHSYNSDNADLRVDHLGLHKALCVLMGWDYLKPPDNSKAYQFLSAEE 377

Query: 3674 AAANNDDLIMWPPLVIIHNTNSGRSKDGRMEGMGNKVMDNKLKDLGFGGG 3823
            AAAN DDLIMWPP+VIIHNT +G+SKDGRMEG+GNK MD+ ++DLGFG G
Sbjct: 378  AAANVDDLIMWPPVVIIHNTVTGKSKDGRMEGLGNKAMDSIIRDLGFGSG 427


>ref|XP_004300997.1| PREDICTED: uncharacterized protein LOC101304679 [Fragaria vesca
            subsp. vesca]
          Length = 529

 Score =  361 bits (927), Expect = 1e-96
 Identities = 228/506 (45%), Positives = 269/506 (53%), Gaps = 38/506 (7%)
 Frame = +2

Query: 2420 MAGGN-PKGXXXXXXXXXXX--NRKTRWESG------------NNPQPDHKSGTNAESKP 2554
            MAGGN PKG             NRK+RWES             N   PD K  T    KP
Sbjct: 1    MAGGNHPKGPPHKPSSSSSAASNRKSRWESSPSTNNKNNQNHRNKNPPDPKPATGPSPKP 60

Query: 2555 NKPSS--NPKDEQ--SQGSGPSKPVSDPKTXXXXXXXXXXXXXXXXILSDPNVSXXXXXX 2722
             K +S  NPK     S G+ P  P  DP +                    P V       
Sbjct: 61   GKTASPANPKHPPAPSPGAAPPFPFPDPSSFGPPP---------------PPV------- 98

Query: 2723 XXXXXXXXXYGFHMLDRRTIALADGSVRSYFALPPDYQDFPPHGRPFDHSERFFPFGHGG 2902
                     YGFH L+RRTI LADG+VRSYFALPPDYQDFPP                  
Sbjct: 99   ---------YGFHNLERRTIVLADGTVRSYFALPPDYQDFPP------------------ 131

Query: 2903 REPEPGGMGFGFDKHFPPGGRLSPEGFRRDRDEAFGRGGGPHDYWNSLGLDGRDPISQEG 3082
                          H  P GR  P          FG GG   DYWNSLG+DGR    +  
Sbjct: 132  -------------PHMDPSGRFLP----------FGPGGPAPDYWNSLGIDGRGGPQEGS 168

Query: 3083 SLKRKYGDGDE-RDVRDEFARQRQQLLHYXXXXXXXXXXXXXXXXDRANYLSGTTSSPFH 3259
            S+KRK+G+ +E RD  +E A++RQQL+                     N      SSPF 
Sbjct: 169  SMKRKFGEEEEHRDKGEELAKRRQQLVQLG----------------NPNGFPAGPSSPFR 212

Query: 3260 RDHPMDSARRIDEIRSSKHIRVGGDY---------------GVDVSLKHPDVDQQALKKA 3394
            R+    S R  D+ R+SK +R GG +               G +V  K+  VDQ ALKKA
Sbjct: 213  REMGAQS-RSGDDPRASKFMRTGGGFENVGFRQSGGSGGGGGDNVGHKYLQVDQAALKKA 271

Query: 3395 FLRFVKSLNENLAQKKNYLEDGKYGSLQCLACG---RASKEFPDVHGLIMHAYNSHSADL 3565
            FL F K +NEN AQKK Y+EDGK G L CLACG   R++K+FPD+H LIMH+YN+ +AD+
Sbjct: 272  FLYFAKVINENGAQKKIYIEDGKQGRLNCLACGTTGRSAKDFPDMHSLIMHSYNTDNADI 331

Query: 3566 HVDHLGLHKALCVLMGWNYAKVPDNSKAYQLVSADEAAANNDDLIMWPPLVIIHNTNSGR 3745
             VDHLGLHKALCVLMGWNY K PDNSKAYQ +SADEAAAN DDLIMWPP+VIIHNT +G+
Sbjct: 332  RVDHLGLHKALCVLMGWNYLKPPDNSKAYQFLSADEAAANQDDLIMWPPMVIIHNTLTGK 391

Query: 3746 SKDGRMEGMGNKVMDNKLKDLGFGGG 3823
            SKDGRMEG+GNK MD+ ++ LGFG G
Sbjct: 392  SKDGRMEGLGNKAMDSYIRALGFGSG 417


>emb|CBI26253.3| unnamed protein product [Vitis vinifera]
          Length = 507

 Score =  361 bits (927), Expect = 1e-96
 Identities = 228/485 (47%), Positives = 267/485 (55%), Gaps = 17/485 (3%)
 Frame = +2

Query: 2420 MAGGNPKGXXXXXXXXXXXNRKTRWESGNNPQPDHKSGT----NAESKPNKPSSNPKDEQ 2587
            MAGGNPK            +RK+RWESG+NP  D KSG     N+ S P  P+++PK   
Sbjct: 1    MAGGNPKASSHKPSSSSS-HRKSRWESGSNP--DKKSGDSKPPNSSSTPKTPNNDPKQAP 57

Query: 2588 SQGSGPS--KPVSDPKTXXXXXXXXXXXXXXXXILSDPNVSXXXXXXXXXXXXXXXYGFH 2761
            +  SG S  KP +D                    L DP                  YGFH
Sbjct: 58   ASTSGSSHPKPPAD-SVPTSAAAPVRPPVAGAPFLPDPTT--------FGPPPAPQYGFH 108

Query: 2762 MLDRRTIALADGSVRSYFALPPDYQDFPPHGRPFDHSERFFPFGHGGREPEPGGMGFGFD 2941
            ML+RRTI LADGSVRSYFAL PDYQDFPP                    P P  M     
Sbjct: 109  MLERRTIVLADGSVRSYFALSPDYQDFPP--------------------PPPRAMD---- 144

Query: 2942 KHFPPGGRLSPEGFRRDRDEAFGRGGGPHDYWNSLGLDGRDPISQEGSLKRKYGDGDERD 3121
                P GR  P           G G GP                                
Sbjct: 145  ----PAGRFLP----------MGPGHGP-------------------------------- 158

Query: 3122 VRDEFARQRQQLLHYXXXXXXXXXXXXXXXXDRANYLSGTTSSPFHRDHPMDSARRIDEI 3301
              + FARQRQQLL Y                DR+ YL+G  SSPF R   MD  R  DE+
Sbjct: 159  --EPFARQRQQLLQYGNPSLNPNGYPLGG--DRSEYLAGP-SSPFRRG-VMDPIRG-DEL 211

Query: 3302 RSSKHIRVGGDY---------GVDVSLKHPDVDQQALKKAFLRFVKSLNENLAQKKNYLE 3454
            RSSK++R+GG Y         G +V LKH +VDQ ALKKAF++FVK +NE+ +Q++ YLE
Sbjct: 212  RSSKYMRIGGGYEGFSRQGGVGDNVGLKHHNVDQNALKKAFIQFVKLINESASQRRLYLE 271

Query: 3455 DGKYGSLQCLACGRASKEFPDVHGLIMHAYNSHS--ADLHVDHLGLHKALCVLMGWNYAK 3628
            DGK G L+CLACGR+SK+FPD+H L+MH YNS+S  A+L VDHLGLHKALCVL+GWNY+ 
Sbjct: 272  DGKQGPLRCLACGRSSKDFPDMHALVMHTYNSNSDNANLLVDHLGLHKALCVLLGWNYSM 331

Query: 3629 VPDNSKAYQLVSADEAAANNDDLIMWPPLVIIHNTNSGRSKDGRMEGMGNKVMDNKLKDL 3808
             PDNSK YQ +SADEAAAN DDLIMWPP VIIHNT SG+ KDGRMEG+GNK MDNKL+DL
Sbjct: 332  PPDNSKTYQFLSADEAAANQDDLIMWPPTVIIHNTVSGKGKDGRMEGLGNKAMDNKLRDL 391

Query: 3809 GFGGG 3823
            GFGGG
Sbjct: 392  GFGGG 396


>ref|XP_004496723.1| PREDICTED: uncharacterized protein LOC101489729 [Cicer arietinum]
          Length = 491

 Score =  333 bits (854), Expect = 4e-88
 Identities = 213/483 (44%), Positives = 261/483 (54%), Gaps = 15/483 (3%)
 Frame = +2

Query: 2420 MAGGN-PKGXXXXXXXXXXXNRKTRWESGNNPQPDH-KSGTNAESKPN--KPSSNPKDEQ 2587
            MAGGN PK            +RKTRWES  +  P + KS ++ +SKPN   P+SNP  + 
Sbjct: 1    MAGGNHPKSSSSS-------HRKTRWESNTSATPTNTKSPSDPKSKPNHNNPNSNPNQKP 53

Query: 2588 SQGSGPSKPVSDPKTXXXXXXXXXXXXXXXXILSDPNVSXXXXXXXXXXXXXXXYGFHML 2767
            +    P +  +D                      +P                  YGFHML
Sbjct: 54   NPNPSPKQHPNDHPALIPFQ------------FPEPG-----------PPPPPAYGFHML 90

Query: 2768 DRRTIALADGSVRSYFALPPDYQDFPPHGRPFDHSERFFPFGHGGREPEPGGMGFGFDKH 2947
            +RRTI LADGSVRSYFALPPDYQDF P  RP D                       F+  
Sbjct: 91   ERRTIILADGSVRSYFALPPDYQDFAPPPRPLDR----------------------FNMR 128

Query: 2948 FPPGGRLSPEGFRRDRDEAFGRGGGPHDYWNSLGLDGRDPISQEGSLKRKYGDGDERDVR 3127
            FPP  R                     DY N +         +  S KRKYG+    + R
Sbjct: 129  FPPVVRHP-------------------DYQNPM---------EASSAKRKYGE----EGR 156

Query: 3128 DEFARQRQQLLHYXXXXXXXXXXXXXXXXDRANYLSGTT-----SSPFHRDHPMDSARRI 3292
            DEFARQR+QLL                    AN + G       S P  RD  M+S    
Sbjct: 157  DEFARQREQLLRNANGF--------------ANRVPGGEFPVGPSGPLKRDM-MESI--- 198

Query: 3293 DEIRSSKHIRVGGDYGVDVSLKHPDVDQQALKKAFLRFVKSLNENLAQKKNYLEDGKYGS 3472
             ++R SKH RV G   V+ + +H  V Q ALKKAFL+FV+ +N+N   KK++LEDGK G 
Sbjct: 199  -DLRPSKHSRVDGVGSVNNNARHVQVAQDALKKAFLQFVRLINDNTLLKKSFLEDGKQGR 257

Query: 3473 LQCLACG------RASKEFPDVHGLIMHAYNSHSADLHVDHLGLHKALCVLMGWNYAKVP 3634
            LQC+ACG      R++K+F D+H LIMH YNS +ADL   HLGLHKALCVLMGWNY+K P
Sbjct: 258  LQCVACGSAGGSNRSAKDFSDMHALIMHTYNSDNADLSAGHLGLHKALCVLMGWNYSKPP 317

Query: 3635 DNSKAYQLVSADEAAANNDDLIMWPPLVIIHNTNSGRSKDGRMEGMGNKVMDNKLKDLGF 3814
            DNSKAYQ +SADEA AN DDLIMWPPLVI+HNTN+G+S+DGRMEG+GNK MDNK+++LGF
Sbjct: 318  DNSKAYQFLSADEAEANQDDLIMWPPLVIVHNTNTGKSRDGRMEGLGNKWMDNKIRELGF 377

Query: 3815 GGG 3823
             GG
Sbjct: 378  AGG 380


>ref|XP_007034759.1| Uncharacterized protein TCM_020625 [Theobroma cacao]
            gi|508713788|gb|EOY05685.1| Uncharacterized protein
            TCM_020625 [Theobroma cacao]
          Length = 496

 Score =  331 bits (849), Expect = 1e-87
 Identities = 219/486 (45%), Positives = 264/486 (54%), Gaps = 18/486 (3%)
 Frame = +2

Query: 2420 MAGGNPKGXXXXXXXXXXXNRKTRWESGNNPQPDHKSGTNAESKPNKPSSNPKDEQSQGS 2599
            MAG NP             +RK+RWES +             S PNK  S+ K + S  +
Sbjct: 1    MAGPNPP---KQPSSSSNNHRKSRWESSS-------------SIPNKNPSSTKPKPSPKT 44

Query: 2600 GPS-KPVSDPKTXXXXXXXXXXXXXXXXILSDPNVSXXXXXXXXXXXXXXX----YGFHM 2764
            GPS  P +  K+                  SDPN +                   YGFHM
Sbjct: 45   GPSPSPATQNKSQ-----------------SDPNPALPPIPFPDPAALGPPPPPAYGFHM 87

Query: 2765 LDRRTIALADGSVRSYFALPPDYQDFPPHGRPFDHSERFFPFGHGGREPEPGGMGFGFDK 2944
            L+RRTI L DGSVRSYFALP DYQ+FP   RP                            
Sbjct: 88   LERRTIVLYDGSVRSYFALPSDYQEFPT--RPL--------------------------- 118

Query: 2945 HFPPGGRLSPEGFRRDRDEAFGRGGGPHDYWNSLGLDGRDPISQEGSLKRKYGDGDERDV 3124
              PP     P GFR +R           DYWN        P    G  KRKYG+ +E+D+
Sbjct: 119  LVPPDFGSPPLGFRDNR-----------DYWNG-------PGEGPGLFKRKYGE-EEKDL 159

Query: 3125 R----DEFARQRQQLLHYXXXXXXXXXXXXXXXXDRANYLSGTTSSPFHRDHPMDSARRI 3292
            R    +EFARQR                      DR   L+G TSSPF          R 
Sbjct: 160  REEKKEEFARQRH------GHPNAKVYSSGPGWPDR---LAG-TSSPF----------RN 199

Query: 3293 DEIRSSKHIRVGGDY---GVDVSLKHPDVDQQALKKAFLRFVKSLNENLAQKKNYLEDGK 3463
            +E+R++K++RVGG +    +  + KH +VDQ ALKKAFL FVK++ EN AQKKNYLEDGK
Sbjct: 200  EEMRAAKYMRVGGGFENNNLGFNNKHLEVDQNALKKAFLHFVKAVFENAAQKKNYLEDGK 259

Query: 3464 YGSLQCLACG------RASKEFPDVHGLIMHAYNSHSADLHVDHLGLHKALCVLMGWNYA 3625
             G LQCLACG      R+SK+FPD+HGLIMH Y S +ADL VDHLGLHKALCVLMGWNY+
Sbjct: 260  QGRLQCLACGRFDDKFRSSKDFPDMHGLIMHTYYSDNADLRVDHLGLHKALCVLMGWNYS 319

Query: 3626 KVPDNSKAYQLVSADEAAANNDDLIMWPPLVIIHNTNSGRSKDGRMEGMGNKVMDNKLKD 3805
            K PDNSK Y+ + ADEAAAN +DLIMWPP+VI+HNT +G+SKDGRMEG+GNK MD+KL+D
Sbjct: 320  KPPDNSKVYRFLPADEAAANQEDLIMWPPVVIVHNTITGKSKDGRMEGLGNKAMDSKLRD 379

Query: 3806 LGFGGG 3823
            LGFG G
Sbjct: 380  LGFGSG 385


>ref|XP_004247355.1| PREDICTED: uncharacterized protein LOC101252627 [Solanum
            lycopersicum]
          Length = 512

 Score =  324 bits (830), Expect = 2e-85
 Identities = 203/478 (42%), Positives = 251/478 (52%), Gaps = 10/478 (2%)
 Frame = +2

Query: 2420 MAGGNPKGXXXXXXXXXXXNRKTRWES--GNNPQPDHKS---GTNAESKPNKPSSNPKDE 2584
            MAGGNP             +RK+RWES  G  P  D K+   G  A S    P S P  +
Sbjct: 1    MAGGNPPKPSSNKPAPSASHRKSRWESTTGKKPSSDPKTSVAGAGAASGSGDPKSKPSPK 60

Query: 2585 QSQGSGPS----KPVSDPKTXXXXXXXXXXXXXXXXILSDPNVSXXXXXXXXXXXXXXX- 2749
             +    P+    KP+S P                     DPN                  
Sbjct: 61   PTNPIQPTTPNPKPISKPSPKP-----------------DPNAHFGLPPFPFRDPPPPPL 103

Query: 2750 YGFHMLDRRTIALADGSVRSYFALPPDYQDFPPHGRPFDHSERFFPFGHGGREPEPGGMG 2929
            YGFHML+RRTI LADGSVRSYFALP DYQDFP   RP                   G  G
Sbjct: 104  YGFHMLERRTIVLADGSVRSYFALPHDYQDFPAFPRPDFR----------------GPPG 147

Query: 2930 FGFDKHFPPGGRLSPEGFRRDRDEAFGRGGGPHDYWNSLGLDGRDPISQEGSLKRKYGDG 3109
             GF++ FP       +GF R+R+          D+WN LG++G      +G++KRK+GD 
Sbjct: 148  LGFERQFPD------DGFMRNRNP---------DHWNPLGVEGGRV--GDGAMKRKFGD- 189

Query: 3110 DERDVRDEFARQRQQLLHYXXXXXXXXXXXXXXXXDRANYLSGTTSSPFHRDHPMDSARR 3289
               + +D   R RQQ+L +                       G++SS   R   M+    
Sbjct: 190  ---EGKDGLDRLRQQVLEHGNAGPVPP---------------GSSSSYMGRGEEMN---- 227

Query: 3290 IDEIRSSKHIRVGGDYGVDVSLKHPDVDQQALKKAFLRFVKSLNENLAQKKNYLEDGKYG 3469
                R  K++R GG  G     KH +VDQ ALKK+FL  VK + +    K++YL DGK G
Sbjct: 228  ----RPPKYMRSGGFEGRASRTKHNEVDQSALKKSFLPMVKLIFDTANVKRSYLADGKQG 283

Query: 3470 SLQCLACGRASKEFPDVHGLIMHAYNSHSADLHVDHLGLHKALCVLMGWNYAKVPDNSKA 3649
             LQCLAC R SK+FPD+H LIMHAYN  SAD  VDHL  HKALCVLMGWNY   PD+SK+
Sbjct: 284  RLQCLACNRTSKDFPDMHSLIMHAYNPDSADSLVDHLAFHKALCVLMGWNYLTPPDHSKS 343

Query: 3650 YQLVSADEAAANNDDLIMWPPLVIIHNTNSGRSKDGRMEGMGNKVMDNKLKDLGFGGG 3823
            YQ++SADEA AN DDL++WPPLVIIHNT +G+  DGRMEG+GNK MD+ LK +GF GG
Sbjct: 344  YQMLSADEATANRDDLVLWPPLVIIHNTITGKRDDGRMEGLGNKAMDSYLKGIGFHGG 401


>ref|XP_006360795.1| PREDICTED: uncharacterized protein LOC102579696 [Solanum tuberosum]
          Length = 513

 Score =  317 bits (812), Expect = 3e-83
 Identities = 199/479 (41%), Positives = 252/479 (52%), Gaps = 11/479 (2%)
 Frame = +2

Query: 2420 MAGGNP---KGXXXXXXXXXXXNRKTRWES--GNNPQPDHKSGT-NAESKPNKPSSNPKD 2581
            MAGGNP                +RK+RWES  G  P  D K+    A S    P S P  
Sbjct: 1    MAGGNPPKPSSSKPAPSSASASHRKSRWESTTGKKPSSDPKTSVAGAASGSGDPKSKPSP 60

Query: 2582 EQSQGSGPS----KPVSDPKTXXXXXXXXXXXXXXXXILSDPNVSXXXXXXXXXXXXXXX 2749
            + +  + P+    KP+ +P                     DPN                 
Sbjct: 61   KTTNPNHPTTPNPKPIKNPSPKP-----------------DPNAHFGLPPFPFRDPPPPP 103

Query: 2750 -YGFHMLDRRTIALADGSVRSYFALPPDYQDFPPHGRPFDHSERFFPFGHGGREPEPGGM 2926
             YGFHML+RRTI LADGSVRSYFALP DYQDFP   RP                   G  
Sbjct: 104  LYGFHMLERRTIVLADGSVRSYFALPHDYQDFPAFTRPDFR----------------GPP 147

Query: 2927 GFGFDKHFPPGGRLSPEGFRRDRDEAFGRGGGPHDYWNSLGLDGRDPISQEGSLKRKYGD 3106
            G GF++ FP       +GF R+R+          D+WN +G++G      +G++KRK+GD
Sbjct: 148  GLGFERQFPD------DGFMRNRNP---------DHWNPIGVEGGRV--GDGAMKRKFGD 190

Query: 3107 GDERDVRDEFARQRQQLLHYXXXXXXXXXXXXXXXXDRANYLSGTTSSPFHRDHPMDSAR 3286
                + +D   R RQQ+L +                       G++S    R   M+   
Sbjct: 191  ----EGKDGLDRLRQQVLEHGNAGPVPP---------------GSSSLYMGRGEEMN--- 228

Query: 3287 RIDEIRSSKHIRVGGDYGVDVSLKHPDVDQQALKKAFLRFVKSLNENLAQKKNYLEDGKY 3466
                 R +K++R GG  G     KH +VDQ ALKK+FL  VK + +    K++YL DGK 
Sbjct: 229  -----RPAKYMRSGGFEGSASRTKHNEVDQSALKKSFLLMVKLIFDTANVKRSYLADGKQ 283

Query: 3467 GSLQCLACGRASKEFPDVHGLIMHAYNSHSADLHVDHLGLHKALCVLMGWNYAKVPDNSK 3646
            G LQCLAC R SK+FPD+H LIMHAYNS SAD  VDHL  HKALCVLMGW+Y   PD+SK
Sbjct: 284  GRLQCLACNRTSKDFPDMHSLIMHAYNSESADSLVDHLAFHKALCVLMGWSYLTPPDHSK 343

Query: 3647 AYQLVSADEAAANNDDLIMWPPLVIIHNTNSGRSKDGRMEGMGNKVMDNKLKDLGFGGG 3823
            +YQ++SADEA AN DDL++WPPLVIIHNT +G+  DGRMEG+GNK MD+ LK +GF GG
Sbjct: 344  SYQMLSADEATANRDDLVLWPPLVIIHNTITGKRDDGRMEGLGNKAMDSYLKGIGFHGG 402


>ref|XP_007143259.1| hypothetical protein PHAVU_007G057400g [Phaseolus vulgaris]
            gi|561016449|gb|ESW15253.1| hypothetical protein
            PHAVU_007G057400g [Phaseolus vulgaris]
          Length = 478

 Score =  315 bits (808), Expect = 8e-83
 Identities = 201/456 (44%), Positives = 242/456 (53%), Gaps = 7/456 (1%)
 Frame = +2

Query: 2477 NRKTRWESGNNPQPDHKSGTNAESKPNKPSSNPKDEQSQGSGPSKPVSDPKTXXXXXXXX 2656
            +RK+RWE  N+  P  K  +N    P  PS +P         PS P+  P          
Sbjct: 27   HRKSRWEP-NSSSPKPKPNSNPNPSPKHPSDHPSLLPFPFPDPS-PLGPPPPPA------ 78

Query: 2657 XXXXXXXXILSDPNVSXXXXXXXXXXXXXXXYGFHMLDRRTIALADGSVRSYFALPPDYQ 2836
                                           YGFHML+RRTI LADGSVRSYFALP DYQ
Sbjct: 79   -------------------------------YGFHMLERRTIVLADGSVRSYFALPLDYQ 107

Query: 2837 DFPPHGRPFDHSERFFPFGHGGREPEPGGMGFGFDKHFPPGGRLSPEGFRRDRDEAFGRG 3016
            DF P  RP D   RF         P P                LSP  FR          
Sbjct: 108  DFAP--RPLDFLHRF---------PPP----------------LSPGRFRL--------- 131

Query: 3017 GGPHDYWNSLGLDGRDPISQEGSLKRKYGDGDERDVRDEFARQRQQLLHYXXXXXXXXXX 3196
                            P    G+ KRKYGD D    RD+ ARQR+QLL            
Sbjct: 132  ----------------PDFPPGASKRKYGDDDGS--RDDLARQREQLLR----------- 162

Query: 3197 XXXXXXDRANYLSGTTSSPFHRDHPMDSARRID-----EIRSSKHIRVGGDYGVDVSLKH 3361
                    AN LS  +   F       +  + +     E+R SKH R  G    + S +H
Sbjct: 163  -------NANGLSRISGGEFSAGPSGGTPLKRELVDPPEMRPSKHSRHDG---ANFS-RH 211

Query: 3362 PDVDQQALKKAFLRFVKSLNENLAQKKNYLEDGKYGSLQCLACG--RASKEFPDVHGLIM 3535
              VDQ ALK+AF+ F K +N+N++QK++YLEDGK G L CLACG  R++K+FPD+H LIM
Sbjct: 212  SQVDQDALKRAFVNFAKLINDNVSQKRSYLEDGKQGRLHCLACGTGRSAKDFPDMHSLIM 271

Query: 3536 HAYNSHSADLHVDHLGLHKALCVLMGWNYAKVPDNSKAYQLVSADEAAANNDDLIMWPPL 3715
            H YNS +AD  VDHLGLHKALCVLMGWNY+K PDNSKAYQ +S+DEAAAN DDLIMWPPL
Sbjct: 272  HTYNSDNADSQVDHLGLHKALCVLMGWNYSKPPDNSKAYQFLSSDEAAANQDDLIMWPPL 331

Query: 3716 VIIHNTNSGRSKDGRMEGMGNKVMDNKLKDLGFGGG 3823
            VIIHNTN+G+++DGRMEG+GNK MDNK+++LGF GG
Sbjct: 332  VIIHNTNTGKNRDGRMEGLGNKTMDNKIRELGFMGG 367


>ref|XP_003556049.1| PREDICTED: uncharacterized protein LOC100805242 [Glycine max]
          Length = 475

 Score =  314 bits (805), Expect = 2e-82
 Identities = 209/477 (43%), Positives = 245/477 (51%), Gaps = 9/477 (1%)
 Frame = +2

Query: 2420 MAGGN-PKGXXXXXXXXXXXNRKTRWESGNNPQP----DHKSGTNAESKPNKPSSNPKDE 2584
            M GGN PK            +RK+RWE  ++       D KS T    KPN P+SNP   
Sbjct: 1    MVGGNHPKSSHHKKPPPSASHRKSRWEPNSSSSAKSPADPKSSTAPSPKPN-PNSNPNPS 59

Query: 2585 QSQGSGPSKPVSDPKTXXXXXXXXXXXXXXXXILSDPNVSXXXXXXXXXXXXXXXYGFHM 2764
                  P  P  DP                   L  P                  YGFHM
Sbjct: 60   PKHLPFPF-PFPDPAPAP---------------LGTP--------------PPPAYGFHM 89

Query: 2765 LDRRTIALADGSVRSYFALPPDYQDFPPHGRPFDHSERFFPFGHGGREPEPGGMGFGFDK 2944
            L+RRTI LADGSVRSYFALP DYQDF P  RP D   RF         P P         
Sbjct: 90   LERRTIVLADGSVRSYFALPSDYQDFAP--RPLDLPPRF---------PPP--------- 129

Query: 2945 HFPPGGRLSPEGFRRDRDEAFGRGGGPHDYWNSLGLDGRDPISQEGSLKRKYGDGDERDV 3124
                   LSP  FR              DY ++         +   + KRKYGD D    
Sbjct: 130  -------LSPGRFRLP------------DYSHA---------AAAAAAKRKYGD-DNGGP 160

Query: 3125 RDEFARQRQQLLHYXXXXXXXXXXXXXXXXDRANYLSGTTSSPFHRDHPMDSARRIDEIR 3304
            RD+ ARQR+QLL                    AN LS    S               E+R
Sbjct: 161  RDDLARQREQLLR------------------NANGLSREQFS-----------AGPSELR 191

Query: 3305 SSKHIRVGGDYGVDVSLKHPDVDQQALKKAFLRFVKSLNENLAQKKNYLEDGKYGSLQCL 3484
             SKH R+ G      S +H  VDQ ALKKAF  F K ++EN +QK+ YLEDGK G L CL
Sbjct: 192  PSKHSRLDGSN----STRHSQVDQDALKKAFCNFAKLISENASQKRTYLEDGKQGRLHCL 247

Query: 3485 AC----GRASKEFPDVHGLIMHAYNSHSADLHVDHLGLHKALCVLMGWNYAKVPDNSKAY 3652
             C    GR++K+FPD+H LIMH YN  +AD  +DHLGLHKALCVLM WNY+K PDNSKAY
Sbjct: 248  VCGTGTGRSAKDFPDMHALIMHTYNPDNADSRIDHLGLHKALCVLMRWNYSKPPDNSKAY 307

Query: 3653 QLVSADEAAANNDDLIMWPPLVIIHNTNSGRSKDGRMEGMGNKVMDNKLKDLGFGGG 3823
            Q + ADEAAAN DDLIMWPPLVIIHNTN+G+++DGRMEG+GNK+MDNK+++LGF GG
Sbjct: 308  QFLPADEAAANQDDLIMWPPLVIIHNTNTGKNRDGRMEGLGNKMMDNKIRELGFVGG 364


>ref|XP_002517137.1| conserved hypothetical protein [Ricinus communis]
            gi|223543772|gb|EEF45300.1| conserved hypothetical
            protein [Ricinus communis]
          Length = 505

 Score =  313 bits (802), Expect = 4e-82
 Identities = 193/452 (42%), Positives = 242/452 (53%), Gaps = 3/452 (0%)
 Frame = +2

Query: 2477 NRKTRWESGNNPQPDHKSGTNAESKPNKPSSNPKDEQSQGSGPSKPVSDPKTXXXXXXXX 2656
            +RK+RWES +   P   S +N ++K    +SNP  +    +  +     P T        
Sbjct: 20   HRKSRWESSSTNNPTSDSKSNHQTKQPPSNSNPSPKPLTNNNNNTNNRTPATPSNSSLPP 79

Query: 2657 XXXXXXXXILSDPNVSXXXXXXXXXXXXXXXYGFHMLDRRTIALADGSVRSYFALPPDYQ 2836
                    +   P                  YGFHML+RRTIALADGSVRSYFALPPDYQ
Sbjct: 80   GSTLPFHDLAPPP-------PPVPPPPPPPTYGFHMLERRTIALADGSVRSYFALPPDYQ 132

Query: 2837 DFPPHGRPFDHSERFFPFGHGGREPE-PGGMGFGFDKHFPPGGRLSPEGFR-RDRDEAFG 3010
            DFP   RP     RF P G     P+ PGG        FPP   +SP+G   RD ++   
Sbjct: 133  DFPL--RP---PLRFPPLGPN---PDFPGG------PRFPP---MSPQGLGFRDHNQN-- 173

Query: 3011 RGGGPHDYWNSLGLDGRDPISQEGSLKRKYGDGDERDVRDEFARQRQQLLHYXXXXXXXX 3190
                                      KRK+G G E      F+R                
Sbjct: 174  --------------------------KRKFGGGGE------FSRYGNN----------NN 191

Query: 3191 XXXXXXXXDRANYLSGTTSSPFHRDHPMDSARRIDEIRSSKHIRVG-GDYGVDVSLKHPD 3367
                    D+    + T+SSPF R          D+ R++KH+R G  D  ++ + KHP+
Sbjct: 192  ITNGSYHPDQLMAGTSTSSSPFRRSFG-------DDFRAAKHMRFGDNDLNINNN-KHPE 243

Query: 3368 VDQQALKKAFLRFVKSLNENLAQKKNYLEDGKYGSLQCLACGRASKEFPDVHGLIMHAYN 3547
            VD   L KAFL F K +NE  A +K YLE+GK G L CL CGR+SK+FPD H L+MH YN
Sbjct: 244  VDHIKLNKAFLHFTKLINETEADRKRYLENGKQGRLMCLVCGRSSKDFPDTHALVMHTYN 303

Query: 3548 SHSADLHVDHLGLHKALCVLMGWNYAKVPDNSKAYQLVSADEAAANNDDLIMWPPLVIIH 3727
            S +ADL VDHLGLHKALC+LMGWNY+K PDN+K YQL+ AD AA N DDL+MWPP+VIIH
Sbjct: 304  SDNADLRVDHLGLHKALCILMGWNYSKPPDNAKVYQLLPADVAATNQDDLVMWPPMVIIH 363

Query: 3728 NTNSGRSKDGRMEGMGNKVMDNKLKDLGFGGG 3823
            NT +G+ KDGR+EG+GNK MDNK++DLGF GG
Sbjct: 364  NTVTGKGKDGRIEGLGNKAMDNKIRDLGFSGG 395


>ref|XP_006826283.1| hypothetical protein AMTR_s00004p00052660 [Amborella trichopoda]
            gi|548830597|gb|ERM93520.1| hypothetical protein
            AMTR_s00004p00052660 [Amborella trichopoda]
          Length = 575

 Score =  312 bits (800), Expect = 7e-82
 Identities = 214/493 (43%), Positives = 257/493 (52%), Gaps = 44/493 (8%)
 Frame = +2

Query: 2477 NRKTRWESGNNP----QPDHKSGTNAESKPNKPSSNPKDEQSQGSGPSKPVSDPKTXXXX 2644
            +RK+RW++  +P    Q D K     E +   PS  PK   +    P  PV +P      
Sbjct: 17   HRKSRWDNSKSPADGPQSDRKKAPAREEEG--PSPKPKPNLNANPNPPPPVPEPSFPVPP 74

Query: 2645 XXXXXXXXXXXXILSDPNVSXXXXXXXXXXXXXXXYGFHMLDRRTIALADGSVRSYFALP 2824
                          ++PN+                YGFHML+RRTI LADGSVRSYFALP
Sbjct: 75   T-------------NEPNIG---------------YGFHMLERRTIVLADGSVRSYFALP 106

Query: 2825 PDYQ-DFPP---HGRPFDHS----ERFFPFGHGGREP------EPGGMGFGFDKHFPPGG 2962
            PD   DFP    H  P D +    ER  P    G +P          MG  FD H P   
Sbjct: 107  PDPNPDFPNLDLHRFPPDRATLGLERRGPIEPEGFDPGFPRRESNLSMGRAFDFHGPLEN 166

Query: 2963 -RLSPEGFRRDRDEAFGRGG--GPHDYWNSLGLDGRDPISQEGSLKRKYGDGDERDV--- 3124
             R  PE FR   +   G     GP +      L G      E S+KRKY + + R++   
Sbjct: 167  LRGPPENFRGPPENLRGPENLRGPPE-----NLHG----PHENSIKRKYVEEEGRELGFS 217

Query: 3125 -----------RDEFARQRQQLLHYXXXXXXXXXXXXXXXXDRANYL--SGTTSS----- 3250
                        DE +R R QLL Y                +  + L  SG  S      
Sbjct: 218  REAGPFPGHLQSDELSRHRHQLLQYGNPNPMFDGFQASRLPESGSPLPESGRVSEDMRSL 277

Query: 3251 --PFHRDHPMDSARRIDEIRSSKHIRVGGDYGVDVSLKHPDVDQQALKKAFLRFVKSLNE 3424
              P + D  + SA+      S+K+ R        V  + PDV+Q AL+KAFLRFVK+LNE
Sbjct: 278  KLPRYDDKRVGSAKA-----SAKNAR---PCEAVVLKRLPDVNQDALQKAFLRFVKTLNE 329

Query: 3425 NLAQKKNYLEDGKYGSLQCLACGRASKEFPDVHGLIMHAYNSHSADLHVDHLGLHKALCV 3604
            N +QKKNYLEDGK GSL CL CGR SKEF DVH LIMHAY+  + D+  DHL  HKALCV
Sbjct: 330  NPSQKKNYLEDGKSGSLHCLVCGRNSKEFSDVHSLIMHAYHMQNVDVRTDHLAFHKALCV 389

Query: 3605 LMGWNYAKVPDNSKAYQLVSADEAAANNDDLIMWPPLVIIHNTNSGRSKDGRMEGMGNKV 3784
            LMGWNYAKVP+NSKAYQ  S DEA AN +D I+WPP+VIIHNTN GR KDGR+EGMGNK 
Sbjct: 390  LMGWNYAKVPENSKAYQTFSTDEATANKEDHIIWPPIVIIHNTNYGRRKDGRIEGMGNKE 449

Query: 3785 MDNKLKDLGFGGG 3823
            MD KLK+LGFGGG
Sbjct: 450  MDTKLKELGFGGG 462


>ref|XP_007143258.1| hypothetical protein PHAVU_007G057400g [Phaseolus vulgaris]
            gi|561016448|gb|ESW15252.1| hypothetical protein
            PHAVU_007G057400g [Phaseolus vulgaris]
          Length = 396

 Score =  301 bits (772), Expect = 1e-78
 Identities = 180/361 (49%), Positives = 215/361 (59%), Gaps = 7/361 (1%)
 Frame = +2

Query: 2762 MLDRRTIALADGSVRSYFALPPDYQDFPPHGRPFDHSERFFPFGHGGREPEPGGMGFGFD 2941
            ML+RRTI LADGSVRSYFALP DYQDF P  RP D   RF         P P        
Sbjct: 1    MLERRTIVLADGSVRSYFALPLDYQDFAP--RPLDFLHRF---------PPP-------- 41

Query: 2942 KHFPPGGRLSPEGFRRDRDEAFGRGGGPHDYWNSLGLDGRDPISQEGSLKRKYGDGDERD 3121
                    LSP  FR                          P    G+ KRKYGD D   
Sbjct: 42   --------LSPGRFRL-------------------------PDFPPGASKRKYGDDDGS- 67

Query: 3122 VRDEFARQRQQLLHYXXXXXXXXXXXXXXXXDRANYLSGTTSSPFHRDHPMDSARRID-- 3295
             RD+ ARQR+QLL                    AN LS  +   F       +  + +  
Sbjct: 68   -RDDLARQREQLLR------------------NANGLSRISGGEFSAGPSGGTPLKRELV 108

Query: 3296 ---EIRSSKHIRVGGDYGVDVSLKHPDVDQQALKKAFLRFVKSLNENLAQKKNYLEDGKY 3466
               E+R SKH R  G    + S +H  VDQ ALK+AF+ F K +N+N++QK++YLEDGK 
Sbjct: 109  DPPEMRPSKHSRHDG---ANFS-RHSQVDQDALKRAFVNFAKLINDNVSQKRSYLEDGKQ 164

Query: 3467 GSLQCLACG--RASKEFPDVHGLIMHAYNSHSADLHVDHLGLHKALCVLMGWNYAKVPDN 3640
            G L CLACG  R++K+FPD+H LIMH YNS +AD  VDHLGLHKALCVLMGWNY+K PDN
Sbjct: 165  GRLHCLACGTGRSAKDFPDMHSLIMHTYNSDNADSQVDHLGLHKALCVLMGWNYSKPPDN 224

Query: 3641 SKAYQLVSADEAAANNDDLIMWPPLVIIHNTNSGRSKDGRMEGMGNKVMDNKLKDLGFGG 3820
            SKAYQ +S+DEAAAN DDLIMWPPLVIIHNTN+G+++DGRMEG+GNK MDNK+++LGF G
Sbjct: 225  SKAYQFLSSDEAAANQDDLIMWPPLVIIHNTNTGKNRDGRMEGLGNKTMDNKIRELGFMG 284

Query: 3821 G 3823
            G
Sbjct: 285  G 285


>gb|EXB41290.1| hypothetical protein L484_004460 [Morus notabilis]
          Length = 523

 Score =  295 bits (756), Expect = 8e-77
 Identities = 173/358 (48%), Positives = 213/358 (59%)
 Frame = +2

Query: 2750 YGFHMLDRRTIALADGSVRSYFALPPDYQDFPPHGRPFDHSERFFPFGHGGREPEPGGMG 2929
            YGFHML+RRTI LADGSVRSYFALPPDYQDFPP       + RFFP              
Sbjct: 98   YGFHMLERRTIVLADGSVRSYFALPPDYQDFPPP------AARFFP-------------- 137

Query: 2930 FGFDKHFPPGGRLSPEGFRRDRDEAFGRGGGPHDYWNSLGLDGRDPISQEGSLKRKYGDG 3109
                     GG +SP G  R +D           YWNSLGLDG          KRK+ D 
Sbjct: 138  ---------GGPVSPVGPNRHQD-----------YWNSLGLDG--------PAKRKFPDE 169

Query: 3110 DERDVRDEFARQRQQLLHYXXXXXXXXXXXXXXXXDRANYLSGTTSSPFHRDHPMDSARR 3289
            ++ D R                              RA+  + T     + ++  ++   
Sbjct: 170  EDTDQR------------------------RYGEDSRASKYTRTVGGFDNGNNNNNNNVG 205

Query: 3290 IDEIRSSKHIRVGGDYGVDVSLKHPDVDQQALKKAFLRFVKSLNENLAQKKNYLEDGKYG 3469
            + +   S     GGDY  +   KH DVDQ  LKKAFLRFVK LNEN  ++K Y E+GK  
Sbjct: 206  LRQGSGSG----GGDY--NPGHKHLDVDQIELKKAFLRFVKILNENAKERKIYFENGK-- 257

Query: 3470 SLQCLACGRASKEFPDVHGLIMHAYNSHSADLHVDHLGLHKALCVLMGWNYAKVPDNSKA 3649
             LQC+ACGR+SK+FPD   LI H+YN  + DL VDHLGLHKALCVLMGWNY++ PDNS+A
Sbjct: 258  RLQCVACGRSSKDFPDTPSLITHSYNYDNDDLRVDHLGLHKALCVLMGWNYSRPPDNSRA 317

Query: 3650 YQLVSADEAAANNDDLIMWPPLVIIHNTNSGRSKDGRMEGMGNKVMDNKLKDLGFGGG 3823
            YQ +SADEAAAN DDLI+WPP+VIIHNT +G++K+GRMEG+GNK+MD +++DLGF GG
Sbjct: 318  YQFLSADEAAANQDDLILWPPMVIIHNTLTGKNKEGRMEGLGNKLMDARIRDLGFHGG 375


>ref|XP_006489387.1| PREDICTED: uncharacterized protein LOC102629231 [Citrus sinensis]
          Length = 470

 Score =  291 bits (745), Expect = 2e-75
 Identities = 198/477 (41%), Positives = 242/477 (50%), Gaps = 9/477 (1%)
 Frame = +2

Query: 2420 MAGGN-PKGXXXXXXXXXXXN--RKTRWESGNNPQPDHKSGTNAES-KPNKPSSNPKDEQ 2587
            MAGGN PK            +  RK+RWES  NP  D K   +     P +P S P    
Sbjct: 1    MAGGNHPKSSSHKPPPSSALSSYRKSRWESPKNPPSDQKPKPSPNKHSPAQPKSLPAPTH 60

Query: 2588 SQGS--GPSKPVSDPKTXXXXXXXXXXXXXXXXILSDPNVSXXXXXXXXXXXXXXXYGFH 2761
               S  GP  P S+P                                         YGFH
Sbjct: 61   PSFSSHGPPLPYSEPPPPPPA-----------------------------------YGFH 85

Query: 2762 MLDRRTIALADGSVRSYFALPPDYQDFPPHGRPFDHSERFFPFGHGGREPEPGGMGFGFD 2941
            ML+RRTI LADGSVRSYFALPPDY DF P      H+    P                 +
Sbjct: 86   MLERRTIVLADGSVRSYFALPPDY-DFTPR-----HNSLLRP-----------------E 122

Query: 2942 KHFPPGGRLSPEGFRRDRDEAFGRGGGPHDYWNSLGLDGRDPISQEGSLKRKYGDGDERD 3121
             HF P       GFR                      D R+ I+  G +KRK+G  +E++
Sbjct: 123  FHFSP----EAAGFR----------------------DRREYINGPGPMKRKFGVDEEKE 156

Query: 3122 VRDEFARQRQQLLHYXXXXXXXXXXXXXXXXDRANYLSGTTSSPFHRDHPMDSARRIDEI 3301
            ++   +R                        DR   L GT+    H D         +E 
Sbjct: 157  LQHLMSRANSS-------------------RDR---LVGTSG---HFD---------EET 182

Query: 3302 RSSKHIRVG-GDYGVDV-SLKHPDVDQQALKKAFLRFVKSLNENLAQKKNYL-EDGKYGS 3472
            R++K++R   G  G  V   K+ +VD   LKK FL FVK +NEN+A +K+YL EDGK G 
Sbjct: 183  RAAKYMRTTPGAVGPSVVKHKYDEVDHAMLKKVFLHFVKVINENVALRKSYLVEDGKQGR 242

Query: 3473 LQCLACGRASKEFPDVHGLIMHAYNSHSADLHVDHLGLHKALCVLMGWNYAKVPDNSKAY 3652
            LQC+AC R+SK+F D+HGLIMH YNS +ADL VDHLGLHKALCVLMGWNY+K PDNSKAY
Sbjct: 243  LQCIACRRSSKDFSDMHGLIMHTYNSDNADLRVDHLGLHKALCVLMGWNYSKPPDNSKAY 302

Query: 3653 QLVSADEAAANNDDLIMWPPLVIIHNTNSGRSKDGRMEGMGNKVMDNKLKDLGFGGG 3823
            + +  DEAAAN DDLIMWPP+VIIHNT +G+ KDGRMEG+GNK MD  ++DLGFG G
Sbjct: 303  KFLPPDEAAANQDDLIMWPPVVIIHNTLTGKGKDGRMEGLGNKAMDKTIRDLGFGTG 359


>ref|XP_006406164.1| hypothetical protein EUTSA_v10020450mg [Eutrema salsugineum]
            gi|557107310|gb|ESQ47617.1| hypothetical protein
            EUTSA_v10020450mg [Eutrema salsugineum]
          Length = 545

 Score =  289 bits (740), Expect = 6e-75
 Identities = 186/466 (39%), Positives = 236/466 (50%), Gaps = 17/466 (3%)
 Frame = +2

Query: 2477 NRKTRWESGNNPQPDHKSGTNAESKPNKPSSN---------PKDEQSQGSGPSKPVSDPK 2629
            +RK+RW S NN     K+  N  +  NKP +          PK   S    P+   S P 
Sbjct: 30   DRKSRWASSNNDGGSSKNNINNNNNSNKPMTGGQKVADNKLPKPNPSPKLAPTPSQSYPN 89

Query: 2630 TXXXXXXXXXXXXXXXXILSDPNVSXXXXXXXXXXXXXXXYGFHMLDRRTIALADGSVRS 2809
                               S    +               YGFHML+RRTI L DGSVRS
Sbjct: 90   HPNPAGPSSRPAPGSAFPASQ--FAFPDSSAALGAPPAPTYGFHMLERRTIVLVDGSVRS 147

Query: 2810 YFALPPDYQDFPP-HGRPFDHSERFFPFGHGGREPEPGGMGFGFDKHFPPGGRLSPEGFR 2986
            YFALPP+Y+DFPP   R  D +   F             MG  F + FPP     PE FR
Sbjct: 148  YFALPPNYRDFPPSQSRLADPAANRF-------------MGPEFSR-FPP---FHPEEFR 190

Query: 2987 RDRDEAFGRGGGPHDYWNSLGLDGRDPISQEGSLKRKYG-----DGDERDVRDEFARQRQ 3151
              R             W+            EGS+KRK+      D  ERD R E  RQR 
Sbjct: 191  DQRQ-----------LWDR----------PEGSMKRKFPGEEEIDRRERDERGEMLRQRH 229

Query: 3152 QLLHYXXXXXXXXXXXXXXXXDRANYLSGTTSSPFHRDHPMDSARRIDEIRSSKHIRVGG 3331
            Q +HY                     L   TSSPF RD   D+       R++KH+R+G 
Sbjct: 230  QFMHYGNPNDQS--------------LMARTSSPFTRDVGEDA-------RAAKHMRIGS 268

Query: 3332 DYGVD--VSLKHPDVDQQALKKAFLRFVKSLNENLAQKKNYLEDGKYGSLQCLACGRASK 3505
                +   +L +  VDQ ALKK+FL +VK + E+ ++KKNYLE+G  G LQCL CGR+ K
Sbjct: 269  SRHENGGQALNYLQVDQVALKKSFLGYVKRIYEDPSEKKNYLENGSTGPLQCLVCGRSPK 328

Query: 3506 EFPDVHGLIMHAYNSHSADLHVDHLGLHKALCVLMGWNYAKVPDNSKAYQLVSADEAAAN 3685
            +  D HGL+MH Y    A   V HLGLHKALCVLMGWN++K PDNSKAYQ + A+ AA N
Sbjct: 329  DVQDTHGLVMHTYYYDDASSRVHHLGLHKALCVLMGWNFSKAPDNSKAYQNLPAEVAAIN 388

Query: 3686 NDDLIMWPPLVIIHNTNSGRSKDGRMEGMGNKVMDNKLKDLGFGGG 3823
             D LI+WPP +I+HNT++G+ KDGRMEG+G+K MDN++++L   GG
Sbjct: 389  QDQLIIWPPHIIVHNTSTGKGKDGRMEGLGSKRMDNRIRELKLTGG 434


>ref|XP_003535649.1| PREDICTED: protein SUPPRESSOR OF GENE SILENCING 3-like [Glycine max]
          Length = 460

 Score =  289 bits (739), Expect = 8e-75
 Identities = 197/475 (41%), Positives = 233/475 (49%), Gaps = 7/475 (1%)
 Frame = +2

Query: 2420 MAGGN-PKGXXXXXXXXXXXNRKTRWE---SGNNPQPDHKSGTNAESKPNKPSSNPKDEQ 2587
            MAGGN PK            +RK+RWE   S  N   D KS ++    P KP SN     
Sbjct: 1    MAGGNHPKSSHHNKPPPSASHRKSRWEPNSSSANSPADPKSKSSTAPSP-KPKSNTNPNP 59

Query: 2588 SQGSGPSKPVSDPKTXXXXXXXXXXXXXXXXILSDPNVSXXXXXXXXXXXXXXXYGFHML 2767
            S    P  P  DP                   L  P                  YGFHML
Sbjct: 60   SPKHLPF-PFPDPAP-----------------LGPP--------------PPPAYGFHML 87

Query: 2768 DRRTIALADGSVRSYFALPPDYQDFPPHGRPFDHSERFFPFGHGGREPEPGGMGFGFDKH 2947
            +RRTI LADGSVRSYFALPPDYQDF P  RP D   RF                      
Sbjct: 88   ERRTIVLADGSVRSYFALPPDYQDFAP--RPLDLPPRFC--------------------- 124

Query: 2948 FPPGGRLSPEGFRRDRDEAFGRGGGPHDYWNSLGLDGRDPISQEGSLKRKYGDGDERDVR 3127
             P     +    R+  D+     GGP D                                
Sbjct: 125  LPDYSYTAAAAKRKYGDD----DGGPRD-------------------------------- 148

Query: 3128 DEFARQRQQLLHYXXXXXXXXXXXXXXXXDRANYLSGTTSSPFHRDHPMDSARRIDEI-R 3304
             + ARQR+QLL                    AN +S    S    D       R+D +  
Sbjct: 149  -DLARQREQLLR------------------NANGISREQFSAGPSDLRPSKHSRLDGLSN 189

Query: 3305 SSKHIRVGGDYGVDVSLKHPDVDQQALKKAFLRFVKSLNENLAQKKNYLEDGKYGSLQCL 3484
            S++H +V               DQ ALKK+F  F K +NEN++QK+  LEDGK G L CL
Sbjct: 190  STRHSQV---------------DQDALKKSFCNFSKLINENVSQKRTCLEDGKQGRLHCL 234

Query: 3485 AC--GRASKEFPDVHGLIMHAYNSHSADLHVDHLGLHKALCVLMGWNYAKVPDNSKAYQL 3658
            AC  GR++K+FPD+H LIMH YN  +AD  VDHLGLHKALCVLMGWNY+K PDNSKAYQ 
Sbjct: 235  ACGTGRSAKDFPDMHALIMHTYNPDNADSRVDHLGLHKALCVLMGWNYSKPPDNSKAYQF 294

Query: 3659 VSADEAAANNDDLIMWPPLVIIHNTNSGRSKDGRMEGMGNKVMDNKLKDLGFGGG 3823
            + ADEAAAN DDLIMWPPLVIIHNTN+G+++DGRMEG+GNK MDNK+++LGF GG
Sbjct: 295  LPADEAAANQDDLIMWPPLVIIHNTNTGKNRDGRMEGLGNKTMDNKIRELGFVGG 349


>ref|XP_006406165.1| hypothetical protein EUTSA_v10020450mg [Eutrema salsugineum]
            gi|557107311|gb|ESQ47618.1| hypothetical protein
            EUTSA_v10020450mg [Eutrema salsugineum]
          Length = 548

 Score =  284 bits (726), Expect = 3e-73
 Identities = 186/469 (39%), Positives = 236/469 (50%), Gaps = 20/469 (4%)
 Frame = +2

Query: 2477 NRKTRWESGNNPQPDHKSGTNAESKPNKPSSN---------PKDEQSQGSGPSKPVSDPK 2629
            +RK+RW S NN     K+  N  +  NKP +          PK   S    P+   S P 
Sbjct: 30   DRKSRWASSNNDGGSSKNNINNNNNSNKPMTGGQKVADNKLPKPNPSPKLAPTPSQSYPN 89

Query: 2630 TXXXXXXXXXXXXXXXXILSDPNVSXXXXXXXXXXXXXXXYGFHMLDRRTIALADGSVRS 2809
                               S    +               YGFHML+RRTI L DGSVRS
Sbjct: 90   HPNPAGPSSRPAPGSAFPASQ--FAFPDSSAALGAPPAPTYGFHMLERRTIVLVDGSVRS 147

Query: 2810 YFALPPDYQDFPP-HGRPFDHSERFFPFGHGGREPEPGGMGFGFDKHFPPGGRLSPEGFR 2986
            YFALPP+Y+DFPP   R  D +   F             MG  F + FPP     PE FR
Sbjct: 148  YFALPPNYRDFPPSQSRLADPAANRF-------------MGPEFSR-FPP---FHPEEFR 190

Query: 2987 RDRDEAFGRGGGPHDYWNSLGLDGRDPISQEGSLKRKYG-----DGDERDVRDEFARQRQ 3151
              R             W+            EGS+KRK+      D  ERD R E  RQR 
Sbjct: 191  DQRQ-----------LWDR----------PEGSMKRKFPGEEEIDRRERDERGEMLRQRH 229

Query: 3152 QLLHYXXXXXXXXXXXXXXXXDRANYLSGTTSSPFHRDHPMDSARRIDEIRSSKHIRVGG 3331
            Q +HY                     L   TSSPF RD   D+       R++KH+R+G 
Sbjct: 230  QFMHYGNPNDQS--------------LMARTSSPFTRDVGEDA-------RAAKHMRIGS 268

Query: 3332 DYGVD--VSLKHPDVDQQALKKAFLRFVKSLNENLAQKKNYLEDGKYGSLQCLACGR--- 3496
                +   +L +  VDQ ALKK+FL +VK + E+ ++KKNYLE+G  G LQCL CGR   
Sbjct: 269  SRHENGGQALNYLQVDQVALKKSFLGYVKRIYEDPSEKKNYLENGSTGPLQCLVCGRFDR 328

Query: 3497 ASKEFPDVHGLIMHAYNSHSADLHVDHLGLHKALCVLMGWNYAKVPDNSKAYQLVSADEA 3676
            + K+  D HGL+MH Y    A   V HLGLHKALCVLMGWN++K PDNSKAYQ + A+ A
Sbjct: 329  SPKDVQDTHGLVMHTYYYDDASSRVHHLGLHKALCVLMGWNFSKAPDNSKAYQNLPAEVA 388

Query: 3677 AANNDDLIMWPPLVIIHNTNSGRSKDGRMEGMGNKVMDNKLKDLGFGGG 3823
            A N D LI+WPP +I+HNT++G+ KDGRMEG+G+K MDN++++L   GG
Sbjct: 389  AINQDQLIIWPPHIIVHNTSTGKGKDGRMEGLGSKRMDNRIRELKLTGG 437


>emb|CAN69769.1| hypothetical protein VITISV_022064 [Vitis vinifera]
          Length = 400

 Score =  279 bits (714), Expect = 6e-72
 Identities = 187/397 (47%), Positives = 222/397 (55%), Gaps = 25/397 (6%)
 Frame = +2

Query: 2420 MAGGNPKGXXXXXXXXXXXNRKTRWESGNNPQPDHKSGT----NAESKPNKPSSNPKDEQ 2587
            MAGGNPK            +RK+RWESG+NP  D KSG     N+ S P  P+++PK   
Sbjct: 1    MAGGNPKASSHKPSSSSS-HRKSRWESGSNP--DKKSGDSKPPNSSSTPKTPNNDPKQAP 57

Query: 2588 SQGSGPS--KPVSDPKTXXXXXXXXXXXXXXXXILSDPNVSXXXXXXXXXXXXXXXYGFH 2761
            +  SG S  KP +D                    L DP                  YGFH
Sbjct: 58   ASTSGSSHPKPPAD-SVPTSAAAPVRPPVAGAPFLPDPTT--------FGPPPTPQYGFH 108

Query: 2762 MLDRRTIALADGSVRSYFALPPDYQDFPPHG-RPFDHSERFFPFGHGGREPEPGGMGFGF 2938
            ML+RRTI LADGSVRSYFAL PDYQDFPP   R  D + RF P G G   PEP G G G 
Sbjct: 109  MLERRTIVLADGSVRSYFALSPDYQDFPPPPPRAMDPAGRFLPMGPG-HGPEPVGPGLG- 166

Query: 2939 DKHFPPGGRLSPEGFRRDRDEAFGRGGGPHDYWNSLGLDGRDPISQEGSLKRKYGDGDER 3118
               FP  G +SPEGFR +RD+ + RG    DYWNSLGLDGR     EGS+KRKY + DER
Sbjct: 167  --RFPXTGPMSPEGFRGERDDPYSRGRH-QDYWNSLGLDGRG--HPEGSMKRKYSEEDER 221

Query: 3119 DVR---------DEFARQRQQLLHYXXXXXXXXXXXXXXXXDRANYLSGTTSSPFHRDHP 3271
            D R         DEFARQRQQLL Y                DR+ YL+G  SSPF R   
Sbjct: 222  DRREDRDRRDGNDEFARQRQQLLQYGNPSLNPNGYPLGG--DRSEYLAGP-SSPFRRG-V 277

Query: 3272 MDSARRIDEIRSSKHIRVGGDY---------GVDVSLKHPDVDQQALKKAFLRFVKSLNE 3424
            MD  R  DE+RSSK++R+GG Y         G +V LKH +VDQ ALKKAF++FVK +NE
Sbjct: 278  MDPIRG-DELRSSKYMRIGGGYEGFSRQGGVGDNVGLKHHNVDQNALKKAFIQFVKLINE 336

Query: 3425 NLAQKKNYLEDGKYGSLQCLACGRASKEFPDVHGLIM 3535
            + +Q++ YLEDGK G L+CLACGR  K  P V  L++
Sbjct: 337  SASQRRLYLEDGKQGPLRCLACGRFGKNGPLVPSLLL 373


Top