BLASTX nr result

ID: Akebia23_contig00002447 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Akebia23_contig00002447
         (1608 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002282784.1| PREDICTED: uncharacterized protein LOC100245...   249   2e-63
ref|XP_006360776.1| PREDICTED: uncharacterized protein LOC102602...   236   2e-59
ref|XP_004247572.1| PREDICTED: uncharacterized protein LOC101247...   235   4e-59
ref|XP_007223281.1| hypothetical protein PRUPE_ppa009460mg [Prun...   232   3e-58
ref|XP_002314686.2| senescence-associated family protein [Populu...   232   4e-58
ref|XP_006379838.1| hypothetical protein POPTR_0008s15440g [Popu...   223   2e-55
ref|XP_004139816.1| PREDICTED: uncharacterized protein LOC101210...   220   1e-54
ref|XP_002517116.1| conserved hypothetical protein [Ricinus comm...   208   5e-51
ref|XP_004297111.1| PREDICTED: uncharacterized protein LOC101295...   204   9e-50
ref|XP_004297110.1| PREDICTED: uncharacterized protein LOC101295...   204   9e-50
ref|XP_007034793.1| NAD(P)H-quinone oxidoreductase subunit H, pu...   203   2e-49
ref|XP_003632307.1| PREDICTED: uncharacterized protein LOC100855...   201   1e-48
ref|XP_006489415.1| PREDICTED: uncharacterized protein LOC102612...   196   3e-47
ref|XP_006419965.1| hypothetical protein CICLE_v10005538mg [Citr...   196   3e-47
gb|EYU41472.1| hypothetical protein MIMGU_mgv1a010847mg [Mimulus...   194   1e-46
ref|XP_007050557.1| Uncharacterized protein isoform 2 [Theobroma...   193   2e-46
ref|XP_007050556.1| Uncharacterized protein isoform 1 [Theobroma...   193   2e-46
ref|XP_006379837.1| hypothetical protein POPTR_0008s15440g [Popu...   192   4e-46
ref|XP_003556054.1| PREDICTED: uncharacterized protein LOC100807...   186   3e-44
ref|XP_002524763.1| conserved hypothetical protein [Ricinus comm...   186   3e-44

>ref|XP_002282784.1| PREDICTED: uncharacterized protein LOC100245904 [Vitis vinifera]
          Length = 307

 Score =  249 bits (637), Expect = 2e-63
 Identities = 152/278 (54%), Positives = 182/278 (65%), Gaps = 20/278 (7%)
 Frame = +2

Query: 833  MLRKRSRAVTSNKRTLMAD---------KHIRP-ISSFFNSPRFFSGFSAIGSPETESMM 982
            MLRKRSRA TS K+ LMAD         K+ RP  SSFF+SPR F+GFS+    ETE+MM
Sbjct: 1    MLRKRSRASTS-KQALMADCGSLPSPTDKYRRPPSSSFFSSPRLFTGFSSKVFSETETMM 59

Query: 983  SPTSILDTKPFSVLGN--VKPSLDFRPNHPFEK----KLDSRGVGVGLGIVDALNDQETN 1144
            SPTSILD+KPFS   N     +   +P+ P  +    KLDSR +G  LGIVDAL   E++
Sbjct: 60   SPTSILDSKPFSGFRNPFTNTTSTIKPSEPEPRRHWDKLDSRMIG--LGIVDALTHDESD 117

Query: 1145 KKFSKSDSRMVLFGSQLKIQIXXXXXXXXXXXXXXXXXXDFGIKPRNSQLGLFSPCST-- 1318
             K SK +SRMVLFGSQLKIQI                  DFGIK RNSQLG FSPC +  
Sbjct: 118  PKLSKPESRMVLFGSQLKIQIPPLPSSVLSPAESPKSPADFGIKTRNSQLGPFSPCLSQS 177

Query: 1319 --KKSPFGSSNSGLETPNSPQVFTGCLSASEMELSEDYTCVIARGPNPRTTHIYGDCIVE 1492
              KKS FGS+NSGLE PNSP++FTGCLSA+EMELSEDYTCVI+ GPNPRTTHI+ +CIV 
Sbjct: 178  PAKKSGFGSANSGLEAPNSPRIFTGCLSATEMELSEDYTCVISHGPNPRTTHIFDNCIV- 236

Query: 1493 NCCGGSGFSLPKKDNCFSVDQSSYLNENFLSFCYNCKK 1606
                  GFS  +K+N           E+FL+FC++C+K
Sbjct: 237  ------GFSASRKEN-------DVFPESFLNFCHSCRK 261


>ref|XP_006360776.1| PREDICTED: uncharacterized protein LOC102602108 [Solanum tuberosum]
          Length = 302

 Score =  236 bits (603), Expect = 2e-59
 Identities = 135/244 (55%), Positives = 164/244 (67%), Gaps = 8/244 (3%)
 Frame = +2

Query: 899  RPISSFFNSPRFFSGFSAIGSPETESMMSPTSILDTKPFSVLGNV---KPSLDFRPNHPF 1069
            R  SSFF SPR F+ FSA G P+TES+MSPTSILD+KPF+VL N     P      +   
Sbjct: 15   RKTSSFFGSPRLFTCFSAKGFPDTESIMSPTSILDSKPFTVLRNPFWSDPKSPKPESRVH 74

Query: 1070 EKKLDSRGVGVGLGIVDALNDQETNKKFSKSDSRMVLFGSQLKIQIXXXXXXXXXXXXXX 1249
             +KLDS+GVG  LG+VDAL D++++ K   S SRMV+ GSQLKIQI              
Sbjct: 75   FQKLDSKGVG--LGLVDALIDEKSDSKEMNSISRMVVLGSQLKIQIPTLPPSFNYPTDSP 132

Query: 1250 XXXXDFGIKPRNSQLGLFSP----CSTKKSPFGSSNSGLETPNSPQVFTGCLSASEMELS 1417
                DFGIK RNSQLG FSP       KKSPFGSSNS ++ PNSP  F+  LSA+EMELS
Sbjct: 133  PSPGDFGIKTRNSQLGSFSPGFSPSPVKKSPFGSSNSNIDIPNSPGAFSS-LSAAEMELS 191

Query: 1418 EDYTCVIARGPNPRTTHIYGDCIVENCCGGSGFSLPKKDN-CFSVDQSSYLNENFLSFCY 1594
            E+YTCVI+ GPNPRTTHI+ DCI+E+CCG   +S  +K+N  F     SY +E+FLSFC+
Sbjct: 192  EEYTCVISYGPNPRTTHIFDDCILESCCGVVKYSASRKENETFPNPPMSYPSESFLSFCH 251

Query: 1595 NCKK 1606
            NCKK
Sbjct: 252  NCKK 255


>ref|XP_004247572.1| PREDICTED: uncharacterized protein LOC101247367 [Solanum
            lycopersicum]
          Length = 298

 Score =  235 bits (600), Expect = 4e-59
 Identities = 132/240 (55%), Positives = 164/240 (68%), Gaps = 4/240 (1%)
 Frame = +2

Query: 899  RPISSFFNSPRFFSGFSAIGSPETESMMSPTSILDTKPFSVLGNV---KPSLDFRPNHPF 1069
            R  SSFF SPR F+ F+A G P+TES+MSPTSILD+KPF+VL N    +P      +   
Sbjct: 15   RKTSSFFGSPRLFTCFAAKGFPDTESIMSPTSILDSKPFTVLRNPFWSEPKSPKPESRVH 74

Query: 1070 EKKLDSRGVGVGLGIVDALNDQETNKKFSKSDSRMVLFGSQLKIQIXXXXXXXXXXXXXX 1249
             +KLDS+GVG  LG+VDAL D++++ K   S SRMV+ GSQLKIQI              
Sbjct: 75   FQKLDSKGVG--LGLVDALIDEKSDSKEMNSVSRMVVLGSQLKIQIPTLPPTFNYPTDSP 132

Query: 1250 XXXXDFGIKPRNSQLGLFSPCSTKKSPFGSSNSGLETPNSPQVFTGCLSASEMELSEDYT 1429
                DFGIK RNSQLG  SP   KKSPFGSSNS ++ PNSP  F+  LSA+EMELSE+YT
Sbjct: 133  PSPGDFGIKTRNSQLGSLSP--VKKSPFGSSNSNIDIPNSPGAFSS-LSAAEMELSEEYT 189

Query: 1430 CVIARGPNPRTTHIYGDCIVENCCGGSGFSLPKKDN-CFSVDQSSYLNENFLSFCYNCKK 1606
            CVI+ GPNPRTTHI+ DCI+E+CCG   +S  +K+N  F+     Y +E+FLSFC+NCKK
Sbjct: 190  CVISHGPNPRTTHIFDDCILESCCGVVKYSASRKENETFTSPPMCYPSESFLSFCHNCKK 249


>ref|XP_007223281.1| hypothetical protein PRUPE_ppa009460mg [Prunus persica]
            gi|462420217|gb|EMJ24480.1| hypothetical protein
            PRUPE_ppa009460mg [Prunus persica]
          Length = 291

 Score =  232 bits (592), Expect = 3e-58
 Identities = 136/251 (54%), Positives = 164/251 (65%), Gaps = 10/251 (3%)
 Frame = +2

Query: 884  ADKHIRPISSFFNSPRFFSGFSAIGSPETESMMSPTSILDTKPFSVLGNVKPSLDFRPNH 1063
            ADK+ +P SSFF SPR F+ F++ G  ET+++MSPTSIL+TKPF  L N   S    P  
Sbjct: 11   ADKYTKPTSSFFTSPRLFTSFTSKGYSETDAVMSPTSILETKPFFGLRNPFWSESNTPRT 70

Query: 1064 PFEK------KLDSRGVGVGLGIVDALNDQETNKKFSKSDSRMVLFGSQLKIQIXXXXXX 1225
            P  +      KLD +G+G  L IVDALND  +N K SK +SRMV+FGSQLKIQI      
Sbjct: 71   PEPETKRPWDKLDPKGIG--LAIVDALNDDGSNPKPSKPESRMVIFGSQLKIQIPHLQPS 128

Query: 1226 XXXXXXXXXXXXDFGIKPRNSQLGLFSPCST----KKSPFGSSNSGLETPNSPQVFTGCL 1393
                        DF I+ +NSQLG FS  S+    K SPF S+NSGLET NS +VFT CL
Sbjct: 129  VLSPSDSPKSAADFSIRTKNSQLGSFSSVSSESPAKNSPFKSANSGLETMNSARVFTSCL 188

Query: 1394 SASEMELSEDYTCVIARGPNPRTTHIYGDCIVENCCGGSGFSLPKKDNCFSVDQSSYLNE 1573
            S SEMELSEDYTCVI+ GPNP+TTHI+ +CIVE+  G   FS   K     V+ SSYL+E
Sbjct: 189  SVSEMELSEDYTCVISHGPNPKTTHIFDNCIVESSEGVPEFSPGGK-----VNGSSYLSE 243

Query: 1574 NFLSFCYNCKK 1606
            +FLSFC NCKK
Sbjct: 244  SFLSFCDNCKK 254


>ref|XP_002314686.2| senescence-associated family protein [Populus trichocarpa]
            gi|550329454|gb|EEF00857.2| senescence-associated family
            protein [Populus trichocarpa]
          Length = 329

 Score =  232 bits (591), Expect = 4e-58
 Identities = 144/282 (51%), Positives = 175/282 (62%), Gaps = 24/282 (8%)
 Frame = +2

Query: 833  MLRKRSRAVTSNKRTLMA---------DKHIRPISSFFNSPRFFSGFSAIGSPET-ESMM 982
            M++KRSR  TS K+ LM+         DK  +P S     P+  +G +     ET E++M
Sbjct: 1    MMKKRSRTATS-KQALMSQHSSIPSPTDKFRKPTSF----PKLLTGLTFKNFSETAEAIM 55

Query: 983  SPTSILDTKPFSVLGNV-------KPSLDFRPNHPFEKKLDSRGVGVGLGIVDALNDQET 1141
            SPTSILD+KPFS L N         P            KLDS+G+G  LGIVDAL+D+ET
Sbjct: 56   SPTSILDSKPFSGLKNPFWHDACPSPKTPEPDTRRHWDKLDSKGIG--LGIVDALDDEET 113

Query: 1142 NKKFSKSDSRMVLFGSQLKIQIXXXXXXXXXXXXXXXXXX-DFGIKPRNSQLGLFS---- 1306
            +   SK +SRMVLFGSQLKIQI                   DFGIK RNSQ G FS    
Sbjct: 114  DSNLSKPESRMVLFGSQLKIQIPPLPPPFLSPTDQSPKLNGDFGIKTRNSQFGSFSSGLS 173

Query: 1307 PCSTKKSPFGSSNSGLETPNSPQVFTGCLSASEMELSEDYTCVIARGPNPRTTHIYGDCI 1486
            P   KKS FGS+NSG++TPNSP+VFTGCLSASEMELSEDYTCVI  GP P+TTHI+ +CI
Sbjct: 174  PSPVKKSLFGSANSGMDTPNSPRVFTGCLSASEMELSEDYTCVITHGPVPKTTHIFDNCI 233

Query: 1487 VENCCGGSGFSLP--KKDNCFSVDQSSYLNENFLSFCYNCKK 1606
            VE+CCG  GFS    K +N F  D  +Y +++FLSFC +CKK
Sbjct: 234  VESCCGAVGFSASSRKDNNRFLGDGLTYRSDSFLSFCSSCKK 275


>ref|XP_006379838.1| hypothetical protein POPTR_0008s15440g [Populus trichocarpa]
            gi|550333138|gb|ERP57635.1| hypothetical protein
            POPTR_0008s15440g [Populus trichocarpa]
          Length = 301

 Score =  223 bits (567), Expect = 2e-55
 Identities = 138/254 (54%), Positives = 162/254 (63%), Gaps = 15/254 (5%)
 Frame = +2

Query: 887  DKHIRPISSFFNSPRFFSGFSAIGSPET-ESMMSPTSILDTKPFSVLGN-VKPSLDFRPN 1060
            DK  +P S     P+  + F+     ET E++MSPTSILD+KPFS L N   P  +  P 
Sbjct: 12   DKFRKPTSF----PKLLTAFTFKNFSETSEAIMSPTSILDSKPFSGLKNPFWPDPNPSPK 67

Query: 1061 HPFEK------KLDSRGVGVGLGIVDALNDQETNKKFSKSDSRMVLFGSQLKIQIXXXXX 1222
             P  +      KLDS+G+G  LGIVDAL+D++T+   SK +SR VLFGSQLKIQI     
Sbjct: 68   TPEPETRRHWDKLDSKGIG--LGIVDALDDEKTDSNLSKPESRTVLFGSQLKIQIPPFPP 125

Query: 1223 XXXXXXXXXXXXX-DFGIKPRNSQLGLFS----PCSTKKSPFGSSNSGLETPNSPQVFTG 1387
                          +FGIK RNSQLG FS    P   KKS FGS+NSG+ETPNSP+VF G
Sbjct: 126  SFLSTTDQSPKSPGEFGIKTRNSQLGSFSSGYSPSPVKKSLFGSANSGMETPNSPRVFAG 185

Query: 1388 CLSASEMELSEDYTCVIARGPNPRTTHIYGDCIVENCCGGSGF--SLPKKDNCFSVDQSS 1561
            CLSASEMELSEDYTCVI  GP PRTTHI+ +CIVE+CCG  GF  SL K +N F  D SS
Sbjct: 186  CLSASEMELSEDYTCVITHGPVPRTTHIFDNCIVESCCGVVGFSTSLKKDNNRFLGDGSS 245

Query: 1562 YLNENFLSFCYNCK 1603
            Y   NFLSFC  CK
Sbjct: 246  YPPNNFLSFCSACK 259


>ref|XP_004139816.1| PREDICTED: uncharacterized protein LOC101210425 [Cucumis sativus]
            gi|449492592|ref|XP_004159042.1| PREDICTED:
            uncharacterized LOC101210425 [Cucumis sativus]
          Length = 294

 Score =  220 bits (561), Expect = 1e-54
 Identities = 131/257 (50%), Positives = 162/257 (63%), Gaps = 8/257 (3%)
 Frame = +2

Query: 860  TSNKRTLMADKHIRPISSFFNSPRFFSGFSAIGSPETESMMSPTSILDTKPFSVLGNVKP 1039
            TSN R L         SSFF SPR F+  S+ G  ETE++MSPTSIL+  PF  L N   
Sbjct: 17   TSNSRKLS--------SSFFGSPRLFTSSSSKGLSETEAVMSPTSILE--PFLGLRN--- 63

Query: 1040 SLDFRPNHPFEKKLDSR----GVGVGLGIVDALNDQETNKKFSKSDSRMVLFGSQLKIQI 1207
            S     N P  +  +S+      G+GL IVD L ++ ++ K SK D+RMV+ GSQLKIQI
Sbjct: 64   SFWGESNSPRTQLTESKRPWDSKGIGLAIVDGLTEENSDPKPSKPDTRMVVLGSQLKIQI 123

Query: 1208 XXXXXXXXXXXXXXXXXXDFGIKPRNSQLGLFSPCST----KKSPFGSSNSGLETPNSPQ 1375
                              +FGIK RNS LG  SP S+    KKS FGSS+SG ETPNSP 
Sbjct: 124  PPLPPFVSPTDDSPVSPIEFGIKTRNSHLGSLSPVSSLSPAKKSAFGSSSSGQETPNSPL 183

Query: 1376 VFTGCLSASEMELSEDYTCVIARGPNPRTTHIYGDCIVENCCGGSGFSLPKKDNCFSVDQ 1555
            VFTGCLSA E+E SEDYTCVI+ GPNP+TTHI+GDC++E+ CG   +S  +K+N F  D+
Sbjct: 184  VFTGCLSAGEIEQSEDYTCVISHGPNPKTTHIFGDCVIESGCG--VYSPVRKENGFFRDR 241

Query: 1556 SSYLNENFLSFCYNCKK 1606
            +S+  ENFLSFC NCKK
Sbjct: 242  TSFSPENFLSFCNNCKK 258


>ref|XP_002517116.1| conserved hypothetical protein [Ricinus communis]
            gi|223543751|gb|EEF45279.1| conserved hypothetical
            protein [Ricinus communis]
          Length = 319

 Score =  208 bits (530), Expect = 5e-51
 Identities = 140/281 (49%), Positives = 169/281 (60%), Gaps = 23/281 (8%)
 Frame = +2

Query: 833  MLRKRSRAVTSNKRTLMAD---------KHIRPISSFFNSPRFFSGFSAIGSPET-ESMM 982
            M+RKRSR V+S+K+ LMAD         K+ +P S     PR F+GFS     ET ES+M
Sbjct: 1    MIRKRSR-VSSSKQVLMADYSSILSPTEKYRKPTSF----PRLFTGFSFKNFSETTESVM 55

Query: 983  SPTSILDTKPFSVLGN-------VKPSLDFRPNHPFEKKLDSRGVGVGLGIVDALN-DQE 1138
            SPTSILD+KPFS   N       + P            KLDS+G+G  L IVDALN D +
Sbjct: 56   SPTSILDSKPFSGFRNPFLPDQNLTPKTQESDTKRTWDKLDSKGIG--LAIVDALNYDDK 113

Query: 1139 TNKKFSKSDSRMVLFGSQLKIQIXXXXXXXXXXXXXXXXXXDFGIKPRNSQLGL----FS 1306
            T+   SK +SRMVLFGSQLKIQ+                  DFGIK R+SQLG      S
Sbjct: 114  TDSNLSKPESRMVLFGSQLKIQVPPLPVSPTDQSPKSPA--DFGIKTRHSQLGSSSSGLS 171

Query: 1307 PCSTKKSPFGSSNSGLETPNSPQVFTGCLSASEMELSEDYTCVIARGPNPRTTHIYGDCI 1486
                KKS  GS+NS ++T +SP VF G LSA EME SEDYTCVI+ GPNP+TTHI+ D I
Sbjct: 172  HSPVKKSVCGSANSSIDTSSSPGVFNGSLSAIEMEQSEDYTCVISYGPNPKTTHIFDDYI 231

Query: 1487 VENCCGGSGFSLPK-KDNCFSVDQSSYLNENFLSFCYNCKK 1606
            VE+CC    FS  + + N F  D SSY ++NFLSFCY CKK
Sbjct: 232  VESCCDVVEFSTSRTQTNGFLGDGSSYPSDNFLSFCYACKK 272


>ref|XP_004297111.1| PREDICTED: uncharacterized protein LOC101295037 isoform 2 [Fragaria
            vesca subsp. vesca]
          Length = 285

 Score =  204 bits (519), Expect = 9e-50
 Identities = 124/246 (50%), Positives = 154/246 (62%), Gaps = 11/246 (4%)
 Frame = +2

Query: 899  RPISSFFNSPRFFSGFSAIGSPETESMMSPTSILDTKPFSVLGNVKPSLDFRPNHPFEK- 1075
            RP  SFF++PR F+ F++    ETE++MSPTSIL+TKPF  L N   S    P  P  + 
Sbjct: 11   RPTPSFFSTPRLFTSFTSKPFSETEAVMSPTSILETKPFFGLRNPFWSESNTPKTPEPET 70

Query: 1076 -----KLDSRGVGVGLGIVDALNDQE-TNKKFSKSDSRMVLFGSQLKIQIXXXXXXXXXX 1237
                 KLDS+G G  L IVDAL D + ++ K SK  +RMV+FGSQLKIQI          
Sbjct: 71   KRHWDKLDSKGTG--LAIVDALIDDDGSDPKPSKPQTRMVIFGSQLKIQIPPLPSSVLPS 128

Query: 1238 XXXXXXXXDFGIKPRNSQLGLFSP----CSTKKSPFGSSNSGLETPNSPQVFTGCLSASE 1405
                     FGI+  NS LG  S        KKSPFG+ +SG ETP SPQVF GCLS SE
Sbjct: 129  SESPKLEAVFGIERTNSHLGPLSSGLSQSPAKKSPFGTVSSGNETPVSPQVFKGCLSVSE 188

Query: 1406 MELSEDYTCVIARGPNPRTTHIYGDCIVENCCGGSGFSLPKKDNCFSVDQSSYLNENFLS 1585
            MELSEDYTCVI+ GPNP+TTHI+ + +VE+C G    S  +K+N     +SSY +E+FLS
Sbjct: 189  MELSEDYTCVISHGPNPKTTHIFDNRVVESCDGVPQLSPTRKEN-----KSSYPSESFLS 243

Query: 1586 FCYNCK 1603
            FCY+CK
Sbjct: 244  FCYHCK 249


>ref|XP_004297110.1| PREDICTED: uncharacterized protein LOC101295037 isoform 1 [Fragaria
            vesca subsp. vesca]
          Length = 327

 Score =  204 bits (519), Expect = 9e-50
 Identities = 124/246 (50%), Positives = 154/246 (62%), Gaps = 11/246 (4%)
 Frame = +2

Query: 899  RPISSFFNSPRFFSGFSAIGSPETESMMSPTSILDTKPFSVLGNVKPSLDFRPNHPFEK- 1075
            RP  SFF++PR F+ F++    ETE++MSPTSIL+TKPF  L N   S    P  P  + 
Sbjct: 11   RPTPSFFSTPRLFTSFTSKPFSETEAVMSPTSILETKPFFGLRNPFWSESNTPKTPEPET 70

Query: 1076 -----KLDSRGVGVGLGIVDALNDQE-TNKKFSKSDSRMVLFGSQLKIQIXXXXXXXXXX 1237
                 KLDS+G G  L IVDAL D + ++ K SK  +RMV+FGSQLKIQI          
Sbjct: 71   KRHWDKLDSKGTG--LAIVDALIDDDGSDPKPSKPQTRMVIFGSQLKIQIPPLPSSVLPS 128

Query: 1238 XXXXXXXXDFGIKPRNSQLGLFSP----CSTKKSPFGSSNSGLETPNSPQVFTGCLSASE 1405
                     FGI+  NS LG  S        KKSPFG+ +SG ETP SPQVF GCLS SE
Sbjct: 129  SESPKLEAVFGIERTNSHLGPLSSGLSQSPAKKSPFGTVSSGNETPVSPQVFKGCLSVSE 188

Query: 1406 MELSEDYTCVIARGPNPRTTHIYGDCIVENCCGGSGFSLPKKDNCFSVDQSSYLNENFLS 1585
            MELSEDYTCVI+ GPNP+TTHI+ + +VE+C G    S  +K+N     +SSY +E+FLS
Sbjct: 189  MELSEDYTCVISHGPNPKTTHIFDNRVVESCDGVPQLSPTRKEN-----KSSYPSESFLS 243

Query: 1586 FCYNCK 1603
            FCY+CK
Sbjct: 244  FCYHCK 249


>ref|XP_007034793.1| NAD(P)H-quinone oxidoreductase subunit H, putative isoform 1
            [Theobroma cacao] gi|590658236|ref|XP_007034794.1|
            NAD(P)H-quinone oxidoreductase subunit H, putative
            isoform 1 [Theobroma cacao] gi|508713822|gb|EOY05719.1|
            NAD(P)H-quinone oxidoreductase subunit H, putative
            isoform 1 [Theobroma cacao] gi|508713823|gb|EOY05720.1|
            NAD(P)H-quinone oxidoreductase subunit H, putative
            isoform 1 [Theobroma cacao]
          Length = 289

 Score =  203 bits (516), Expect = 2e-49
 Identities = 122/231 (52%), Positives = 147/231 (63%), Gaps = 4/231 (1%)
 Frame = +2

Query: 926  PRFFSGFSAIG-SPETESMMSPTSILDTKPFSVLGNV---KPSLDFRPNHPFEKKLDSRG 1093
            PR F+GF+    S  TE +MSPTSILD+KPFS   N    + S+   P      KL+++G
Sbjct: 24   PRLFTGFTLKAFSDNTEVVMSPTSILDSKPFSAFRNPFWSESSIPKTPEPETRHKLETKG 83

Query: 1094 VGVGLGIVDALNDQETNKKFSKSDSRMVLFGSQLKIQIXXXXXXXXXXXXXXXXXXDFGI 1273
            VG  LGIVDAL D +++   SKS    VLFGSQL+IQI                  +FGI
Sbjct: 84   VG--LGIVDALKDDDSDSNLSKS----VLFGSQLRIQIPSLPPVFSPAESPRTPP-EFGI 136

Query: 1274 KPRNSQLGLFSPCSTKKSPFGSSNSGLETPNSPQVFTGCLSASEMELSEDYTCVIARGPN 1453
            K RNSQL  FS      SP   S   +ET NSP VF G LSA+EMELSEDYTCVI+ GPN
Sbjct: 137  KTRNSQLSSFSS-GMSPSPVRKS---IETLNSPGVFAGSLSATEMELSEDYTCVISHGPN 192

Query: 1454 PRTTHIYGDCIVENCCGGSGFSLPKKDNCFSVDQSSYLNENFLSFCYNCKK 1606
            PRTTHI+ +CIVE+CCG  GFS  K++N F  D+SSY +E+FLSFCY CKK
Sbjct: 193  PRTTHIFDNCIVESCCGVVGFSSLKRENGFLADRSSYQSESFLSFCYTCKK 243


>ref|XP_003632307.1| PREDICTED: uncharacterized protein LOC100855273 [Vitis vinifera]
            gi|296085215|emb|CBI28710.3| unnamed protein product
            [Vitis vinifera]
          Length = 293

 Score =  201 bits (510), Expect = 1e-48
 Identities = 126/275 (45%), Positives = 156/275 (56%), Gaps = 17/275 (6%)
 Frame = +2

Query: 833  MLRKRSRAVTSNKRTLMADKH---------IRPISSFFNSPRFFSGFSAIGSPETESMMS 985
            MLR RSRAV S K+ +M D            +PIS    SP+ F GF +   PE E ++S
Sbjct: 1    MLRNRSRAVAS-KQAIMGDHSSLPSPTENLTKPISFLLGSPKIFRGFISKCLPEAEDIIS 59

Query: 986  PTSILDTKPFSVLGN--------VKPSLDFRPNHPFEKKLDSRGVGVGLGIVDALNDQET 1141
            PTSI DTKPFS  GN          P         +E  LDS G+GV L   D +N +  
Sbjct: 60   PTSIFDTKPFS--GNPFEYEKTQASPGTFSETKRSWEN-LDSIGIGVALIDSDPINGEGA 116

Query: 1142 NKKFSKSDSRMVLFGSQLKIQIXXXXXXXXXXXXXXXXXXDFGIKPRNSQLGLFSPCSTK 1321
            N+ FSK +SRMVLFGSQLK+QI                  DFGIK RNSQL   SP    
Sbjct: 117  NENFSKPNSRMVLFGSQLKVQIPHLQPSALSPAESPKSPADFGIKTRNSQLASLSP---- 172

Query: 1322 KSPFGSSNSGLETPNSPQVFTGCLSASEMELSEDYTCVIARGPNPRTTHIYGDCIVENCC 1501
               FGS NSG++T +SP++FTG      MELSE+YTCVI+ GPNPRTTHI+ +CIVE+CC
Sbjct: 173  ---FGSLNSGIQTKDSPRIFTG------MELSEEYTCVISHGPNPRTTHIFDNCIVESCC 223

Query: 1502 GGSGFSLPKKDNCFSVDQSSYLNENFLSFCYNCKK 1606
            G S  +L + + C   +  +   ENFLS C+ CKK
Sbjct: 224  GVS--ALSQNNYCTFPENPNSPPENFLSCCHTCKK 256


>ref|XP_006489415.1| PREDICTED: uncharacterized protein LOC102612013 [Citrus sinensis]
          Length = 295

 Score =  196 bits (498), Expect = 3e-47
 Identities = 126/257 (49%), Positives = 160/257 (62%), Gaps = 16/257 (6%)
 Frame = +2

Query: 884  ADKHIRPISSFFNSPRFFSGFSAI-GSPETE-SMMSPTSILD-TKPFSVLGN-----VKP 1039
            +DK  R  +SF   PR F+G + + G  ETE S+MSPTSILD +KPFS+L N     +  
Sbjct: 12   SDKINRKPTSF---PRLFTGLTTLKGFAETEVSVMSPTSILDISKPFSILKNPFWSELTN 68

Query: 1040 SLDFRPNHPFEK---KLDSRGVGVG-LGIVDALNDQETNKKFSKSDSRMVLFGSQLKIQI 1207
            +    P  P  +   KL+S+G G+G LGIVD L D+  + K  K+++RMVLFGSQLKIQI
Sbjct: 69   NTQHSPKTPEPETRHKLESKG-GIGCLGIVDVLKDEIQDPKKPKTETRMVLFGSQLKIQI 127

Query: 1208 XXXXXXXXXXXXXXXXXXDFGIKPRNSQLGLFSPCSTKKSPFGSSNSGLETPNSPQVFTG 1387
                              +FGIK RN                GSS S +   NSPQVFTG
Sbjct: 128  PPLVSSVLSPQDSPKSPAEFGIKTRNQ--------------LGSSFSSVTPSNSPQVFTG 173

Query: 1388 CLSASEMELSEDYTCVIARGPNPRTTHIYGDCIVENCCGGSGFSLPKKDN----CFSVDQ 1555
            CLSA+EMELSEDYTCVI+ GPNP+TTHI+ +CIVE+CCG +GFS  +K++      S D+
Sbjct: 174  CLSATEMELSEDYTCVISHGPNPKTTHIFDNCIVESCCGVAGFSSLRKESKEFMSKSDDR 233

Query: 1556 SSYLNENFLSFCYNCKK 1606
             SY +E+FLSFCYNCKK
Sbjct: 234  FSYPSESFLSFCYNCKK 250


>ref|XP_006419965.1| hypothetical protein CICLE_v10005538mg [Citrus clementina]
            gi|567853605|ref|XP_006419966.1| hypothetical protein
            CICLE_v10005538mg [Citrus clementina]
            gi|557521838|gb|ESR33205.1| hypothetical protein
            CICLE_v10005538mg [Citrus clementina]
            gi|557521839|gb|ESR33206.1| hypothetical protein
            CICLE_v10005538mg [Citrus clementina]
          Length = 295

 Score =  196 bits (498), Expect = 3e-47
 Identities = 126/257 (49%), Positives = 160/257 (62%), Gaps = 16/257 (6%)
 Frame = +2

Query: 884  ADKHIRPISSFFNSPRFFSGFSAI-GSPETE-SMMSPTSILD-TKPFSVLGN-----VKP 1039
            +DK  R  +SF   PR F+G + + G  ETE S+MSPTSILD +KPFS+L N     +  
Sbjct: 12   SDKINRKPTSF---PRLFTGLTTLKGFAETEVSVMSPTSILDISKPFSILKNPFWSELTN 68

Query: 1040 SLDFRPNHPFEK---KLDSRGVGVG-LGIVDALNDQETNKKFSKSDSRMVLFGSQLKIQI 1207
            +    P  P  +   KL+S+G G+G LGIVD L D+  + K  K+++RMVLFGSQLKIQI
Sbjct: 69   NTQHSPKTPEPETRHKLESKG-GIGCLGIVDVLKDEIQDPKKPKTETRMVLFGSQLKIQI 127

Query: 1208 XXXXXXXXXXXXXXXXXXDFGIKPRNSQLGLFSPCSTKKSPFGSSNSGLETPNSPQVFTG 1387
                              +FGIK RN                GSS S +   NSPQVFTG
Sbjct: 128  PPLVSSVLSPQDSPKSPAEFGIKTRNQ--------------LGSSFSSVTPSNSPQVFTG 173

Query: 1388 CLSASEMELSEDYTCVIARGPNPRTTHIYGDCIVENCCGGSGFSLPKKDN----CFSVDQ 1555
            CLSA+EMELSEDYTCVI+ GPNP+TTHI+ +CIVE+CCG +GFS  +K++      S D+
Sbjct: 174  CLSATEMELSEDYTCVISHGPNPKTTHIFDNCIVESCCGVAGFSSLRKESKEFMSKSDDR 233

Query: 1556 SSYLNENFLSFCYNCKK 1606
             SY +E+FLSFCYNCKK
Sbjct: 234  FSYPSESFLSFCYNCKK 250


>gb|EYU41472.1| hypothetical protein MIMGU_mgv1a010847mg [Mimulus guttatus]
          Length = 300

 Score =  194 bits (492), Expect = 1e-46
 Identities = 130/248 (52%), Positives = 152/248 (61%), Gaps = 15/248 (6%)
 Frame = +2

Query: 908  SSFFNSPRFFSGFSAIGSP-ETESMM--SPTSILDTKPFSVLGNVKPSLDF------RPN 1060
            SSFF SPR FS      SP ETES +  SPTSILD+KPFS L N  PS            
Sbjct: 19   SSFFTSPRLFSR-----SPNETESSITSSPTSILDSKPFSCL-NKNPSAPLSKTPKPEAK 72

Query: 1061 HPFEKKLDSRGVGVGLGIVDALNDQETNKKFSKSDSRMVLFGSQLKIQIXXXXXXXXXXX 1240
            + +  KLD+R V   LG+VDAL D++     SK ++RMVLFGSQLKIQ+           
Sbjct: 73   NLYWDKLDTRKVS--LGLVDALIDEKPCPNSSKPNTRMVLFGSQLKIQVPPLPPSVFSPN 130

Query: 1241 XXXXXXXDFGIKPRNSQLGLFSPCS---TKKSPFGSSNSGLETPNSPQVFTGCLSASEME 1411
                   DFGIK RNS +  F PCS    KKSPF SSNSGL T          LSASE+E
Sbjct: 131  ESPKSPGDFGIKTRNSHV--FDPCSPSPVKKSPFSSSNSGLLTS---------LSASEIE 179

Query: 1412 LSEDYTCVIARGPNPRTTHIYGDCIVENCCGGSGFSLPKKDNCFSVDQS---SYLNENFL 1582
            LSEDYTCVI+ GPNPRTTHI+ DCIVE+CCG   FS  +K+N  S+  +   SY +E+FL
Sbjct: 180  LSEDYTCVISYGPNPRTTHIFEDCIVESCCGVVKFSESRKEN-VSIPHNRSMSYPSESFL 238

Query: 1583 SFCYNCKK 1606
            SFCYNCKK
Sbjct: 239  SFCYNCKK 246


>ref|XP_007050557.1| Uncharacterized protein isoform 2 [Theobroma cacao]
            gi|508702818|gb|EOX94714.1| Uncharacterized protein
            isoform 2 [Theobroma cacao]
          Length = 285

 Score =  193 bits (491), Expect = 2e-46
 Identities = 125/279 (44%), Positives = 160/279 (57%), Gaps = 21/279 (7%)
 Frame = +2

Query: 833  MLRKRSRAVTSNKRTLMAD---------KHIRPISSFFNSPRFFSGFSAIGSPETESMMS 985
            MLR RSRAVTS K+ LMAD          + RPI SFF SPRF   F+  G P+TE++ S
Sbjct: 1    MLRNRSRAVTS-KQALMADHSSQSTPAQNYTRPIPSFFGSPRF-KAFTTKGLPDTEAVKS 58

Query: 986  PTSILDTKPFSVLGNV--------KPSLDFRPNHPFE---KKLDSRGVGVGLGIVDALND 1132
            PTSILD KP    G+         K    F PN+  +   +KLDS+G+G  L IVD LND
Sbjct: 59   PTSILDNKPLFPFGSPFGFDINQPKSPRVFSPNNKQQHLPEKLDSKGIG--LAIVDTLND 116

Query: 1133 QET-NKKFSKSDSRMVLFGSQLKIQIXXXXXXXXXXXXXXXXXXDFGIKPRNSQLGLFSP 1309
                +K  S++ ++MVLFG++L++QI                   FGIK RNS L     
Sbjct: 117  TPIEDKSSSETSNKMVLFGAKLRVQIPPLPSSLRSPTTSPISPTYFGIKNRNSHLS---- 172

Query: 1310 CSTKKSPFGSSNSGLETPNSPQVFTGCLSASEMELSEDYTCVIARGPNPRTTHIYGDCIV 1489
                 SPFGS +S +   +SP+VFTGCL   EMELSEDYTCVI+ GPNP+TTHI+ +C+V
Sbjct: 173  -----SPFGSPDSDIHVKDSPRVFTGCLPVREMELSEDYTCVISHGPNPKTTHIFDNCVV 227

Query: 1490 ENCCGGSGFSLPKKDNCFSVDQSSYLNENFLSFCYNCKK 1606
            E+ C     +LP        D+     E+FLSFC+ CKK
Sbjct: 228  ESYC-----TLP--------DKPKSAPESFLSFCHTCKK 253


>ref|XP_007050556.1| Uncharacterized protein isoform 1 [Theobroma cacao]
            gi|508702817|gb|EOX94713.1| Uncharacterized protein
            isoform 1 [Theobroma cacao]
          Length = 287

 Score =  193 bits (491), Expect = 2e-46
 Identities = 125/279 (44%), Positives = 160/279 (57%), Gaps = 21/279 (7%)
 Frame = +2

Query: 833  MLRKRSRAVTSNKRTLMAD---------KHIRPISSFFNSPRFFSGFSAIGSPETESMMS 985
            MLR RSRAVTS K+ LMAD          + RPI SFF SPRF   F+  G P+TE++ S
Sbjct: 1    MLRNRSRAVTS-KQALMADHSSQSTPAQNYTRPIPSFFGSPRF-KAFTTKGLPDTEAVKS 58

Query: 986  PTSILDTKPFSVLGNV--------KPSLDFRPNHPFE---KKLDSRGVGVGLGIVDALND 1132
            PTSILD KP    G+         K    F PN+  +   +KLDS+G+G  L IVD LND
Sbjct: 59   PTSILDNKPLFPFGSPFGFDINQPKSPRVFSPNNKQQHLPEKLDSKGIG--LAIVDTLND 116

Query: 1133 QET-NKKFSKSDSRMVLFGSQLKIQIXXXXXXXXXXXXXXXXXXDFGIKPRNSQLGLFSP 1309
                +K  S++ ++MVLFG++L++QI                   FGIK RNS L     
Sbjct: 117  TPIEDKSSSETSNKMVLFGAKLRVQIPPLPSSLRSPTTSPISPTYFGIKNRNSHLS---- 172

Query: 1310 CSTKKSPFGSSNSGLETPNSPQVFTGCLSASEMELSEDYTCVIARGPNPRTTHIYGDCIV 1489
                 SPFGS +S +   +SP+VFTGCL   EMELSEDYTCVI+ GPNP+TTHI+ +C+V
Sbjct: 173  -----SPFGSPDSDIHVKDSPRVFTGCLPVREMELSEDYTCVISHGPNPKTTHIFDNCVV 227

Query: 1490 ENCCGGSGFSLPKKDNCFSVDQSSYLNENFLSFCYNCKK 1606
            E+ C     +LP        D+     E+FLSFC+ CKK
Sbjct: 228  ESYC-----TLP--------DKPKSAPESFLSFCHTCKK 253


>ref|XP_006379837.1| hypothetical protein POPTR_0008s15440g [Populus trichocarpa]
            gi|550333137|gb|ERP57634.1| hypothetical protein
            POPTR_0008s15440g [Populus trichocarpa]
          Length = 270

 Score =  192 bits (488), Expect = 4e-46
 Identities = 124/252 (49%), Positives = 149/252 (59%), Gaps = 13/252 (5%)
 Frame = +2

Query: 887  DKHIRPISSFFNSPRFFSGFSAIGSPET-ESMMSPTSILDTKPFSVLGN-VKPSLDFRPN 1060
            DK  +P S     P+  + F+     ET E++MSPTSILD+KPFS L N   P  +  P 
Sbjct: 12   DKFRKPTSF----PKLLTAFTFKNFSETSEAIMSPTSILDSKPFSGLKNPFWPDPNPSPK 67

Query: 1061 HPFEK------KLDSRGVGVGLGIVDALNDQETNKKFSKSDSRMVLFGSQLKIQIXXXXX 1222
             P  +      KLDS+G+G  LGIVDAL+D++T+   SK +SR VLFGSQLKIQI     
Sbjct: 68   TPEPETRRHWDKLDSKGIG--LGIVDALDDEKTDSNLSKPESRTVLFGSQLKIQIPP--- 122

Query: 1223 XXXXXXXXXXXXXDFGIKPRNSQLGLFSPC---STKKSPFGSSNSGLETPNSPQVFTGCL 1393
                                      F P    +T +SP   +NSG+ETPNSP+VF GCL
Sbjct: 123  --------------------------FPPSFLSTTDQSPKSPANSGMETPNSPRVFAGCL 156

Query: 1394 SASEMELSEDYTCVIARGPNPRTTHIYGDCIVENCCGGSGF--SLPKKDNCFSVDQSSYL 1567
            SASEMELSEDYTCVI  GP PRTTHI+ +CIVE+CCG  GF  SL K +N F  D SSY 
Sbjct: 157  SASEMELSEDYTCVITHGPVPRTTHIFDNCIVESCCGVVGFSTSLKKDNNRFLGDGSSYP 216

Query: 1568 NENFLSFCYNCK 1603
              NFLSFC  CK
Sbjct: 217  PNNFLSFCSACK 228


>ref|XP_003556054.1| PREDICTED: uncharacterized protein LOC100807906 isoform X1 [Glycine
            max] gi|571567339|ref|XP_006606057.1| PREDICTED:
            uncharacterized protein LOC100807906 isoform X2 [Glycine
            max] gi|571567346|ref|XP_006606058.1| PREDICTED:
            uncharacterized protein LOC100807906 isoform X3 [Glycine
            max]
          Length = 269

 Score =  186 bits (472), Expect = 3e-44
 Identities = 117/244 (47%), Positives = 150/244 (61%), Gaps = 11/244 (4%)
 Frame = +2

Query: 908  SSFFNSPRFFSGFSAIGSPETESMMSPTSILDTKPFSVLGNVKPSLDFRPNHPFEK---- 1075
            SSFF+SPR F+ F+  G  ETE+MMSPTS LD+KPFS   N   S    P  P  +    
Sbjct: 14   SSFFSSPRLFTNFTPKGFHETETMMSPTSTLDSKPFSGFKNPFWSETNSPRTPVSEHKRY 73

Query: 1076 --KLDSRGVGVGLGIVDALNDQETNKKF-SKSDSRMVLFGSQLKIQIXXXXXXXXXXXXX 1246
              KLDS+  G+GLG+VDAL D+E + +  SKS+SRMV+FGSQLKIQI             
Sbjct: 74   WDKLDSK--GIGLGLVDALVDEEKHGEVSSKSESRMVVFGSQLKIQIPP----------- 120

Query: 1247 XXXXXDFGIKPRNSQLGLFSPCSTKK--SPFGSSNSGLETPNSPQVFTGCLSASEMELSE 1420
                               SP  + K  +  G+S+SG+   NS +VF GCLSASEMELSE
Sbjct: 121  ------------------LSPSESSKFVAEKGNSSSGVADANSQRVFMGCLSASEMELSE 162

Query: 1421 DYTCVIARGPNPRTTHIYGDCIVENCCG--GSGFSLPKKDNCFSVDQSSYLNENFLSFCY 1594
            DYT VI+RGPNPRTTHI+ +CI+E+ C   G   S  K++ CF +DQ+SY + +FLS C+
Sbjct: 163  DYTRVISRGPNPRTTHIFDNCIIESSCFELGCSASSVKENGCF-LDQTSYHSRSFLSVCF 221

Query: 1595 NCKK 1606
            +CKK
Sbjct: 222  HCKK 225


>ref|XP_002524763.1| conserved hypothetical protein [Ricinus communis]
            gi|223535947|gb|EEF37606.1| conserved hypothetical
            protein [Ricinus communis]
          Length = 279

 Score =  186 bits (471), Expect = 3e-44
 Identities = 118/275 (42%), Positives = 151/275 (54%), Gaps = 17/275 (6%)
 Frame = +2

Query: 833  MLRKRSRAVTSNKRTLMAD---------KHIRPISSFFNSPRFFSGFSAIGSPETESMMS 985
            MLR RSRAVTS K+ LM D          H +PI SFF SPRF  GF+   SPE E ++S
Sbjct: 1    MLRNRSRAVTS-KQALMTDHSSHSPSTQNHTKPIPSFFGSPRF-KGFTFKRSPEAEPVIS 58

Query: 986  PTSILDTKPFSVLGNVKPSLDFRPNHPFEK--------KLDSRGVGVGLGIVDALNDQET 1141
            PTSIL+  PFS   N       +P  P           KLDS+G+ V L   +  N+Q  
Sbjct: 59   PTSILE--PFSSFKNPFCHDTNQPKSPRVSSENKYSWDKLDSKGIAVALIDEEKPNEQNN 116

Query: 1142 NKKFSKSDSRMVLFGSQLKIQIXXXXXXXXXXXXXXXXXXDFGIKPRNSQLGLFSPCSTK 1321
            +KK SK  ++MVL+G++L++QI                  DFGIK RN+QL         
Sbjct: 117  SKKISKPSNKMVLYGTKLRVQIPPPANFMFSAADSPISPGDFGIKTRNAQLS-------- 168

Query: 1322 KSPFGSSNSGLETPNSPQVFTGCLSASEMELSEDYTCVIARGPNPRTTHIYGDCIVENCC 1501
                  S SG++T  SP VFTGC+  SE+ELSEDYTCVI+ GPNP+TTH +G+C++EN C
Sbjct: 169  -----GSGSGIQTKESPGVFTGCVPMSELELSEDYTCVISYGPNPKTTHKFGNCVLENYC 223

Query: 1502 GGSGFSLPKKDNCFSVDQSSYLNENFLSFCYNCKK 1606
              S             D+S+    NFLSFC+ CKK
Sbjct: 224  SLS-------------DKSNSAPNNFLSFCHKCKK 245


Top