BLASTX nr result

ID: Akebia27_contig00001950 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Akebia27_contig00001950
         (1151 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002512882.1| conserved hypothetical protein [Ricinus comm...   339   1e-90
ref|XP_007216996.1| hypothetical protein PRUPE_ppa002575mg [Prun...   327   4e-87
emb|CBI38174.3| unnamed protein product [Vitis vinifera]              320   5e-85
ref|XP_002263726.1| PREDICTED: uncharacterized protein At2g33490...   320   5e-85
ref|XP_006438830.1| hypothetical protein CICLE_v10030944mg [Citr...   319   2e-84
gb|EXB40356.1| Uncharacterized protein L484_017498 [Morus notabi...   318   4e-84
ref|XP_006483028.1| PREDICTED: uncharacterized protein At2g33490...   317   5e-84
ref|XP_007221971.1| hypothetical protein PRUPE_ppa002745mg [Prun...   315   2e-83
ref|XP_002275111.2| PREDICTED: uncharacterized protein At2g33490...   314   4e-83
emb|CBI21307.3| unnamed protein product [Vitis vinifera]              314   4e-83
ref|XP_007031960.1| Hydroxyproline-rich glycoprotein family prot...   312   1e-82
ref|XP_004304547.1| PREDICTED: uncharacterized protein At2g33490...   312   2e-82
ref|XP_007043861.1| Hydroxyproline-rich glycoprotein family prot...   311   4e-82
gb|EXC29160.1| hypothetical protein L484_005672 [Morus notabilis]     308   2e-81
ref|XP_007161663.1| hypothetical protein PHAVU_001G088100g [Phas...   308   3e-81
ref|XP_007161662.1| hypothetical protein PHAVU_001G088100g [Phas...   308   3e-81
ref|XP_006373131.1| hypothetical protein POPTR_0017s09000g [Popu...   308   4e-81
ref|XP_006447002.1| hypothetical protein CICLE_v10014551mg [Citr...   307   5e-81
ref|XP_006604167.1| PREDICTED: uncharacterized protein At2g33490...   307   6e-81
ref|XP_002512634.1| conserved hypothetical protein [Ricinus comm...   306   1e-80

>ref|XP_002512882.1| conserved hypothetical protein [Ricinus communis]
            gi|223547893|gb|EEF49385.1| conserved hypothetical
            protein [Ricinus communis]
          Length = 656

 Score =  339 bits (869), Expect = 1e-90
 Identities = 177/244 (72%), Positives = 194/244 (79%), Gaps = 1/244 (0%)
 Frame = +2

Query: 383  MKSSLKKLRGFALHKNDAKEKMDLQPSAHLDELAQAEEDMQEIRNCYDXXXXXXXXXXXX 562
            MKS L KLRGF LHK+D K+K DL PSA LDELAQA EDMQ++RNCYD            
Sbjct: 1    MKSPLGKLRGFKLHKSDTKDKRDLLPSAQLDELAQAAEDMQDMRNCYDSLLSAAAATANS 60

Query: 563  XXXXXXXLREMGTCLLEKTASNDDEESGRALLMLGKVQFELQKLVDGYRSHIFQTITTPS 742
                   L+EMG+CLLEKTA +DDE+SGR LLMLGKVQFELQKLVD YRSHIF TIT PS
Sbjct: 61   AYEFSESLQEMGSCLLEKTALHDDEQSGRVLLMLGKVQFELQKLVDSYRSHIFLTITNPS 120

Query: 743  ESLLNELRNVEDMKRQCDEKXXXXXXXTAQ-REKGKSKTTKGESFSSEQLQAAHDEYDEE 919
            ESLLNELR VEDMKRQCDEK        AQ +EKGKSK+ KGESF+ ++L+ AHDEYDEE
Sbjct: 121  ESLLNELRTVEDMKRQCDEKRNVYEYMVAQQKEKGKSKSGKGESFTLQELRTAHDEYDEE 180

Query: 920  ATYFVFRLKSLKQGQSRSLLTQAARHHAAQLNFFSKGLKSLEAVDQHVQLVTEKQHIDYQ 1099
            AT  VFRLKSLKQGQSRSLLTQAARHHAAQLNFF KGLKSLEAVD HV++V E+QHIDYQ
Sbjct: 181  ATLCVFRLKSLKQGQSRSLLTQAARHHAAQLNFFRKGLKSLEAVDDHVKIVAEQQHIDYQ 240

Query: 1100 FSGL 1111
            FSGL
Sbjct: 241  FSGL 244


>ref|XP_007216996.1| hypothetical protein PRUPE_ppa002575mg [Prunus persica]
            gi|462413146|gb|EMJ18195.1| hypothetical protein
            PRUPE_ppa002575mg [Prunus persica]
          Length = 657

 Score =  327 bits (839), Expect = 4e-87
 Identities = 170/244 (69%), Positives = 189/244 (77%), Gaps = 1/244 (0%)
 Frame = +2

Query: 383  MKSSLKKLRGFALHKNDAKEKMDLQPSAHLDELAQAEEDMQEIRNCYDXXXXXXXXXXXX 562
            MKSSL  LR F LH+NDAK+K D+QP A +DELAQA +DMQ++RNCYD            
Sbjct: 1    MKSSLGMLRRFELHRNDAKDKRDVQPLAQVDELAQAAQDMQDMRNCYDGLLSAAAATANS 60

Query: 563  XXXXXXXLREMGTCLLEKTASNDDEESGRALLMLGKVQFELQKLVDGYRSHIFQTITTPS 742
                   LREMG CLLEKTA +DDEESG+  LMLGKVQFELQKLVD YRSHIF TIT PS
Sbjct: 61   AYEFSESLREMGACLLEKTALHDDEESGKVFLMLGKVQFELQKLVDSYRSHIFLTITNPS 120

Query: 743  ESLLNELRNVEDMKRQCDEKXXXXXXXTAQ-REKGKSKTTKGESFSSEQLQAAHDEYDEE 919
            ESLLNELR VE+MKRQCDEK        AQ +EKG+SK  KGE FS +QLQ AHDEYDEE
Sbjct: 121  ESLLNELRTVEEMKRQCDEKRDVYEYMVAQQKEKGRSKRGKGEHFSLQQLQVAHDEYDEE 180

Query: 920  ATYFVFRLKSLKQGQSRSLLTQAARHHAAQLNFFSKGLKSLEAVDQHVQLVTEKQHIDYQ 1099
            AT  VFRLKSLKQGQ+RSLLTQA RHHAAQLNFF KGLKSLEAV+ HV+ +TE+QHI+YQ
Sbjct: 181  ATLCVFRLKSLKQGQARSLLTQATRHHAAQLNFFRKGLKSLEAVEPHVRFITEEQHIEYQ 240

Query: 1100 FSGL 1111
            FSGL
Sbjct: 241  FSGL 244


>emb|CBI38174.3| unnamed protein product [Vitis vinifera]
          Length = 651

 Score =  320 bits (821), Expect = 5e-85
 Identities = 164/244 (67%), Positives = 192/244 (78%), Gaps = 1/244 (0%)
 Frame = +2

Query: 383  MKSSLKKLRGFALHKNDAKEKMDLQPSAHLDELAQAEEDMQEIRNCYDXXXXXXXXXXXX 562
            MK+SL+KLRGFALH+ D +E+  +QP A LDEL QA +DMQ++RNCYD            
Sbjct: 1    MKTSLRKLRGFALHRQDVRERRVVQPLAQLDELVQATQDMQDMRNCYDTLLSAAAATANS 60

Query: 563  XXXXXXXLREMGTCLLEKTASNDDEESGRALLMLGKVQFELQKLVDGYRSHIFQTITTPS 742
                   L+E+G CLLEKTA N+DEESG+ LL LGKVQ+ELQKLVD YRSHIFQTI TPS
Sbjct: 61   AYEFSESLKELGGCLLEKTALNEDEESGKVLLKLGKVQYELQKLVDSYRSHIFQTIVTPS 120

Query: 743  ESLLNELRNVEDMKRQCDEKXXXXXXX-TAQREKGKSKTTKGESFSSEQLQAAHDEYDEE 919
            ESLL ELR VE+MKRQCDEK        T QREKG+S++ +GE+FSS Q+QAA DEYDEE
Sbjct: 121  ESLLKELRTVEEMKRQCDEKRDVYEYMITRQREKGRSRSGRGETFSSHQVQAACDEYDEE 180

Query: 920  ATYFVFRLKSLKQGQSRSLLTQAARHHAAQLNFFSKGLKSLEAVDQHVQLVTEKQHIDYQ 1099
            AT FVFRLKSLKQGQSRSLLTQA+RHHAAQL+FF K LKSLE ++ HV+LVTE+QHIDY+
Sbjct: 181  ATLFVFRLKSLKQGQSRSLLTQASRHHAAQLSFFRKALKSLEVIEPHVKLVTEQQHIDYK 240

Query: 1100 FSGL 1111
            FSGL
Sbjct: 241  FSGL 244


>ref|XP_002263726.1| PREDICTED: uncharacterized protein At2g33490-like [Vitis vinifera]
          Length = 653

 Score =  320 bits (821), Expect = 5e-85
 Identities = 164/244 (67%), Positives = 192/244 (78%), Gaps = 1/244 (0%)
 Frame = +2

Query: 383  MKSSLKKLRGFALHKNDAKEKMDLQPSAHLDELAQAEEDMQEIRNCYDXXXXXXXXXXXX 562
            MK+SL+KLRGFALH+ D +E+  +QP A LDEL QA +DMQ++RNCYD            
Sbjct: 1    MKTSLRKLRGFALHRQDVRERRVVQPLAQLDELVQATQDMQDMRNCYDTLLSAAAATANS 60

Query: 563  XXXXXXXLREMGTCLLEKTASNDDEESGRALLMLGKVQFELQKLVDGYRSHIFQTITTPS 742
                   L+E+G CLLEKTA N+DEESG+ LL LGKVQ+ELQKLVD YRSHIFQTI TPS
Sbjct: 61   AYEFSESLKELGGCLLEKTALNEDEESGKVLLKLGKVQYELQKLVDSYRSHIFQTIVTPS 120

Query: 743  ESLLNELRNVEDMKRQCDEKXXXXXXX-TAQREKGKSKTTKGESFSSEQLQAAHDEYDEE 919
            ESLL ELR VE+MKRQCDEK        T QREKG+S++ +GE+FSS Q+QAA DEYDEE
Sbjct: 121  ESLLKELRTVEEMKRQCDEKRDVYEYMITRQREKGRSRSGRGETFSSHQVQAACDEYDEE 180

Query: 920  ATYFVFRLKSLKQGQSRSLLTQAARHHAAQLNFFSKGLKSLEAVDQHVQLVTEKQHIDYQ 1099
            AT FVFRLKSLKQGQSRSLLTQA+RHHAAQL+FF K LKSLE ++ HV+LVTE+QHIDY+
Sbjct: 181  ATLFVFRLKSLKQGQSRSLLTQASRHHAAQLSFFRKALKSLEVIEPHVKLVTEQQHIDYK 240

Query: 1100 FSGL 1111
            FSGL
Sbjct: 241  FSGL 244


>ref|XP_006438830.1| hypothetical protein CICLE_v10030944mg [Citrus clementina]
            gi|557541026|gb|ESR52070.1| hypothetical protein
            CICLE_v10030944mg [Citrus clementina]
          Length = 633

 Score =  319 bits (817), Expect = 2e-84
 Identities = 166/245 (67%), Positives = 191/245 (77%), Gaps = 2/245 (0%)
 Frame = +2

Query: 383  MKSSLKKLRGFALHKN-DAKEKMDLQPSAHLDELAQAEEDMQEIRNCYDXXXXXXXXXXX 559
            MK+SL++ RGF LHK+ D+K++ DL+P A LDELAQA +DMQ++R CYD           
Sbjct: 1    MKTSLRRWRGFTLHKHGDSKDRRDLRPLAQLDELAQASQDMQDMRGCYDSLLSAAAATAN 60

Query: 560  XXXXXXXXLREMGTCLLEKTASNDDEESGRALLMLGKVQFELQKLVDGYRSHIFQTITTP 739
                    L+E+G CLLEKTA ND+EESG+ LLMLGKVQFELQKLVD YRSHIFQTIT P
Sbjct: 61   SAYEFSESLQELGACLLEKTALNDNEESGKVLLMLGKVQFELQKLVDAYRSHIFQTITIP 120

Query: 740  SESLLNELRNVEDMKRQCDEKXXXXXXXTA-QREKGKSKTTKGESFSSEQLQAAHDEYDE 916
            SESLLNEL+ VE+MKRQCDEK          QREKG+SK  KGE+FS +QLQ AHDEYD+
Sbjct: 121  SESLLNELQTVEEMKRQCDEKRNVCEYMLMRQREKGRSKNGKGETFSLQQLQEAHDEYDQ 180

Query: 917  EATYFVFRLKSLKQGQSRSLLTQAARHHAAQLNFFSKGLKSLEAVDQHVQLVTEKQHIDY 1096
            EAT FVFRLKSLKQGQSRSLLTQAARHHAAQL+F  K LKSLEAVD HV++V E+QHIDY
Sbjct: 181  EATLFVFRLKSLKQGQSRSLLTQAARHHAAQLSFVKKALKSLEAVDPHVKMVAEQQHIDY 240

Query: 1097 QFSGL 1111
            QF GL
Sbjct: 241  QFRGL 245


>gb|EXB40356.1| Uncharacterized protein L484_017498 [Morus notabilis]
          Length = 589

 Score =  318 bits (814), Expect = 4e-84
 Identities = 166/245 (67%), Positives = 190/245 (77%), Gaps = 2/245 (0%)
 Frame = +2

Query: 383  MKSSLKKLRGFALHKND-AKEKMDLQPSAHLDELAQAEEDMQEIRNCYDXXXXXXXXXXX 559
            MK+SL+KLRGF LHK   AK++ DL+P A LDELAQA  DMQ++R+CYD           
Sbjct: 1    MKTSLRKLRGFTLHKQGGAKDRKDLRPLAQLDELAQAYRDMQDMRDCYDGLLASAAATAN 60

Query: 560  XXXXXXXXLREMGTCLLEKTASNDDEESGRALLMLGKVQFELQKLVDGYRSHIFQTITTP 739
                    LREMG CLLEKTA NDDEESG+ LLMLGK Q+E+QK+VD YRSHIFQTIT P
Sbjct: 61   SAYEFSESLREMGACLLEKTALNDDEESGKVLLMLGKAQYEIQKIVDSYRSHIFQTITVP 120

Query: 740  SESLLNELRNVEDMKRQCDEKXXXXXXXTA-QREKGKSKTTKGESFSSEQLQAAHDEYDE 916
            SESLLNELR VE+MKRQCDEK        + QREKG+S++ KGESF+ +QLQ A DEYDE
Sbjct: 121  SESLLNELRTVEEMKRQCDEKRDVYEYIKSRQREKGRSRSGKGESFTVQQLQMARDEYDE 180

Query: 917  EATYFVFRLKSLKQGQSRSLLTQAARHHAAQLNFFSKGLKSLEAVDQHVQLVTEKQHIDY 1096
            EAT FVFRLKSLKQGQSRSLLTQAARHHAAQL FF K L+SLEAV+ HV+LV+E+QHIDY
Sbjct: 181  EATLFVFRLKSLKQGQSRSLLTQAARHHAAQLCFFKKALRSLEAVEPHVKLVSEQQHIDY 240

Query: 1097 QFSGL 1111
             F GL
Sbjct: 241  HFDGL 245


>ref|XP_006483028.1| PREDICTED: uncharacterized protein At2g33490-like isoform X1 [Citrus
            sinensis] gi|568858996|ref|XP_006483029.1| PREDICTED:
            uncharacterized protein At2g33490-like isoform X2 [Citrus
            sinensis]
          Length = 633

 Score =  317 bits (813), Expect = 5e-84
 Identities = 165/245 (67%), Positives = 191/245 (77%), Gaps = 2/245 (0%)
 Frame = +2

Query: 383  MKSSLKKLRGFALHKN-DAKEKMDLQPSAHLDELAQAEEDMQEIRNCYDXXXXXXXXXXX 559
            MK+SL++ RGF LHK+ D+K++ DL+P A LDELAQA +DMQ++R CYD           
Sbjct: 1    MKTSLRRWRGFTLHKHGDSKDRRDLRPLAQLDELAQASQDMQDMRGCYDSLLSAAAATAN 60

Query: 560  XXXXXXXXLREMGTCLLEKTASNDDEESGRALLMLGKVQFELQKLVDGYRSHIFQTITTP 739
                    L+E+G CLLEKTA ND+EESG+ LLMLGKVQFELQKLVD YRSHIFQTIT P
Sbjct: 61   SAYEFSESLQELGACLLEKTALNDNEESGKVLLMLGKVQFELQKLVDAYRSHIFQTITIP 120

Query: 740  SESLLNELRNVEDMKRQCDEKXXXXXXXTA-QREKGKSKTTKGESFSSEQLQAAHDEYDE 916
            SESLLNEL+ VE+MK+QCDEK          QREKG+SK  KGE+FS +QLQ AHDEYD+
Sbjct: 121  SESLLNELQTVEEMKQQCDEKRNVCEYMLMRQREKGRSKNGKGETFSLQQLQEAHDEYDQ 180

Query: 917  EATYFVFRLKSLKQGQSRSLLTQAARHHAAQLNFFSKGLKSLEAVDQHVQLVTEKQHIDY 1096
            EAT FVFRLKSLKQGQSRSLLTQAARHHAAQL+F  K LKSLEAVD HV++V E+QHIDY
Sbjct: 181  EATLFVFRLKSLKQGQSRSLLTQAARHHAAQLSFVKKALKSLEAVDPHVKMVAEQQHIDY 240

Query: 1097 QFSGL 1111
            QF GL
Sbjct: 241  QFRGL 245


>ref|XP_007221971.1| hypothetical protein PRUPE_ppa002745mg [Prunus persica]
            gi|462418907|gb|EMJ23170.1| hypothetical protein
            PRUPE_ppa002745mg [Prunus persica]
          Length = 638

 Score =  315 bits (807), Expect = 2e-83
 Identities = 161/244 (65%), Positives = 191/244 (78%), Gaps = 1/244 (0%)
 Frame = +2

Query: 383  MKSSLKKLRGFALHKNDAKEKMDLQPSAHLDELAQAEEDMQEIRNCYDXXXXXXXXXXXX 562
            MK+SL+KLRGFALHK+DAK++ DL+P   LDELAQA + MQ++R+CYD            
Sbjct: 1    MKTSLRKLRGFALHKHDAKDRRDLRPLPQLDELAQAAQGMQDMRDCYDSLLSAAAATANS 60

Query: 563  XXXXXXXLREMGTCLLEKTASNDDEESGRALLMLGKVQFELQKLVDGYRSHIFQTITTPS 742
                   LREMG+CLL+KTA NDDEESGR LL LGK+QFEL KLVD YRSHIFQTI  PS
Sbjct: 61   AYEFSESLREMGSCLLQKTALNDDEESGRVLLKLGKLQFELHKLVDSYRSHIFQTIAVPS 120

Query: 743  ESLLNELRNVEDMKRQCDEKXXXXXXXTA-QREKGKSKTTKGESFSSEQLQAAHDEYDEE 919
            ESLLNEL+ VE+MKRQCDEK          Q+EKG+S+  KGESFS +Q+Q A +EYDEE
Sbjct: 121  ESLLNELQTVEEMKRQCDEKRDVYEYMIKRQKEKGRSRGGKGESFSVQQIQLAREEYDEE 180

Query: 920  ATYFVFRLKSLKQGQSRSLLTQAARHHAAQLNFFSKGLKSLEAVDQHVQLVTEKQHIDYQ 1099
            AT F+FRLKSLKQGQSRSLLTQAARHHAAQL FF K L+S+E+V+ HV+LVTE+QHIDY+
Sbjct: 181  ATLFIFRLKSLKQGQSRSLLTQAARHHAAQLCFFKKALRSVESVEPHVKLVTEQQHIDYE 240

Query: 1100 FSGL 1111
            F+GL
Sbjct: 241  FNGL 244


>ref|XP_002275111.2| PREDICTED: uncharacterized protein At2g33490-like [Vitis vinifera]
          Length = 700

 Score =  314 bits (805), Expect = 4e-83
 Identities = 169/242 (69%), Positives = 190/242 (78%), Gaps = 2/242 (0%)
 Frame = +2

Query: 392  SLKKLRGFALHKNDA-KEKMDLQPSAHLDELAQAEEDMQEIRNCYDXXXXXXXXXXXXXX 568
            SL KLR FAL KNDA KEK D Q SAH+DELAQA +DMQE+RNCYD              
Sbjct: 3    SLGKLRKFALPKNDASKEKRDAQLSAHVDELAQASQDMQEMRNCYDSLLSAAAATTNSAY 62

Query: 569  XXXXXLREMGTCLLEKTASNDDEESGRALLMLGKVQFELQKLVDGYRSHIFQTITTPSES 748
                 L EMG+CL+EKT+ NDDEESG+ LLM+GKVQF+LQKLVD YRSHI QTIT PSES
Sbjct: 63   EFSVSLGEMGSCLVEKTSINDDEESGKVLLMMGKVQFDLQKLVDSYRSHIIQTITNPSES 122

Query: 749  LLNELRNVEDMKRQCDEKXXXXXXXTAQ-REKGKSKTTKGESFSSEQLQAAHDEYDEEAT 925
            LLNELR VE+MKRQCDEK        AQ REKG+SK+ KGES   +QL AAHDE+++EAT
Sbjct: 123  LLNELRTVEEMKRQCDEKRNVYEYMKAQQREKGRSKSGKGESL--QQLTAAHDEFNDEAT 180

Query: 926  YFVFRLKSLKQGQSRSLLTQAARHHAAQLNFFSKGLKSLEAVDQHVQLVTEKQHIDYQFS 1105
              VFRLKSLKQGQSRSLLTQAARHHAAQLNFF KGLKSLEAV+QH+++V E+QHIDYQFS
Sbjct: 181  LCVFRLKSLKQGQSRSLLTQAARHHAAQLNFFRKGLKSLEAVEQHLRVVAERQHIDYQFS 240

Query: 1106 GL 1111
            GL
Sbjct: 241  GL 242


>emb|CBI21307.3| unnamed protein product [Vitis vinifera]
          Length = 643

 Score =  314 bits (805), Expect = 4e-83
 Identities = 169/242 (69%), Positives = 190/242 (78%), Gaps = 2/242 (0%)
 Frame = +2

Query: 392  SLKKLRGFALHKNDA-KEKMDLQPSAHLDELAQAEEDMQEIRNCYDXXXXXXXXXXXXXX 568
            SL KLR FAL KNDA KEK D Q SAH+DELAQA +DMQE+RNCYD              
Sbjct: 3    SLGKLRKFALPKNDASKEKRDAQLSAHVDELAQASQDMQEMRNCYDSLLSAAAATTNSAY 62

Query: 569  XXXXXLREMGTCLLEKTASNDDEESGRALLMLGKVQFELQKLVDGYRSHIFQTITTPSES 748
                 L EMG+CL+EKT+ NDDEESG+ LLM+GKVQF+LQKLVD YRSHI QTIT PSES
Sbjct: 63   EFSVSLGEMGSCLVEKTSINDDEESGKVLLMMGKVQFDLQKLVDSYRSHIIQTITNPSES 122

Query: 749  LLNELRNVEDMKRQCDEKXXXXXXXTAQ-REKGKSKTTKGESFSSEQLQAAHDEYDEEAT 925
            LLNELR VE+MKRQCDEK        AQ REKG+SK+ KGES   +QL AAHDE+++EAT
Sbjct: 123  LLNELRTVEEMKRQCDEKRNVYEYMKAQQREKGRSKSGKGESL--QQLTAAHDEFNDEAT 180

Query: 926  YFVFRLKSLKQGQSRSLLTQAARHHAAQLNFFSKGLKSLEAVDQHVQLVTEKQHIDYQFS 1105
              VFRLKSLKQGQSRSLLTQAARHHAAQLNFF KGLKSLEAV+QH+++V E+QHIDYQFS
Sbjct: 181  LCVFRLKSLKQGQSRSLLTQAARHHAAQLNFFRKGLKSLEAVEQHLRVVAERQHIDYQFS 240

Query: 1106 GL 1111
            GL
Sbjct: 241  GL 242


>ref|XP_007031960.1| Hydroxyproline-rich glycoprotein family protein [Theobroma cacao]
            gi|508710989|gb|EOY02886.1| Hydroxyproline-rich
            glycoprotein family protein [Theobroma cacao]
          Length = 649

 Score =  312 bits (800), Expect = 1e-82
 Identities = 165/244 (67%), Positives = 184/244 (75%), Gaps = 1/244 (0%)
 Frame = +2

Query: 383  MKSSLKKLRGFALHKNDAKEKMDLQPSAHLDELAQAEEDMQEIRNCYDXXXXXXXXXXXX 562
            MKSS  KLR FALHKNDAK+K+D+  SAHLDELAQA +DMQ++RNCYD            
Sbjct: 1    MKSSFDKLRRFALHKNDAKDKLDVLSSAHLDELAQAAQDMQDMRNCYDSLLSAAAATANS 60

Query: 563  XXXXXXXLREMGTCLLEKTASNDDEESGRALLMLGKVQFELQKLVDGYRSHIFQTITTPS 742
                   L+EMG+CL EK    DDEES R LLMLG +QFELQKLVD YR+HI  TIT PS
Sbjct: 61   AYEFSESLQEMGSCLREKRVLPDDEESSRILLMLGNLQFELQKLVDNYRAHILLTITNPS 120

Query: 743  ESLLNELRNVEDMKRQCDEKXXXXXXX-TAQREKGKSKTTKGESFSSEQLQAAHDEYDEE 919
            ESLLNELR VEDMKRQCDEK        T Q+EKG+ K  KGE+ + +QLQ A DEYDE 
Sbjct: 121  ESLLNELRTVEDMKRQCDEKRNVYEYMVTQQKEKGRLKGGKGETLTLQQLQTARDEYDEV 180

Query: 920  ATYFVFRLKSLKQGQSRSLLTQAARHHAAQLNFFSKGLKSLEAVDQHVQLVTEKQHIDYQ 1099
            AT  VFRLKSLKQGQSRSLLTQAARHHAAQLNFF KGLKSLEA++ HV+ VTE+QHIDYQ
Sbjct: 181  ATLCVFRLKSLKQGQSRSLLTQAARHHAAQLNFFRKGLKSLEAIEPHVRQVTEQQHIDYQ 240

Query: 1100 FSGL 1111
            FSGL
Sbjct: 241  FSGL 244


>ref|XP_004304547.1| PREDICTED: uncharacterized protein At2g33490-like [Fragaria vesca
            subsp. vesca]
          Length = 626

 Score =  312 bits (799), Expect = 2e-82
 Identities = 161/244 (65%), Positives = 186/244 (76%), Gaps = 1/244 (0%)
 Frame = +2

Query: 383  MKSSLKKLRGFALHKNDAKEKMDLQPSAHLDELAQAEEDMQEIRNCYDXXXXXXXXXXXX 562
            M+SS  +LR F LHK+ +K++ + QPSAH+DELAQA +DMQ +RNCYD            
Sbjct: 1    MRSSFDRLRRFELHKSGSKDQRNFQPSAHVDELAQAAQDMQGMRNCYDSLLSAAAATANS 60

Query: 563  XXXXXXXLREMGTCLLEKTASNDDEESGRALLMLGKVQFELQKLVDGYRSHIFQTITTPS 742
                   LREMG+CL EKT  +DDE  GR  LMLG VQFELQKLVD YRSHI  T+T PS
Sbjct: 61   AYEFSESLREMGSCLQEKTVLHDDEHGGRVFLMLGNVQFELQKLVDSYRSHIVSTVTNPS 120

Query: 743  ESLLNELRNVEDMKRQCDEKXXXXXXXTAQ-REKGKSKTTKGESFSSEQLQAAHDEYDEE 919
            ESLLNELR VE+MKRQCDEK        AQ +EKG+SK  KGESF+ +QLQAA D+YDE+
Sbjct: 121  ESLLNELRTVEEMKRQCDEKREVYEYMVAQQKEKGRSKRGKGESFTLQQLQAARDQYDED 180

Query: 920  ATYFVFRLKSLKQGQSRSLLTQAARHHAAQLNFFSKGLKSLEAVDQHVQLVTEKQHIDYQ 1099
            AT  VFRLKSLKQGQ+RSLLTQAARHHAAQLNFF KGLKSLEAV+ HV+LVTE+QHI+YQ
Sbjct: 181  ATLCVFRLKSLKQGQARSLLTQAARHHAAQLNFFRKGLKSLEAVEPHVRLVTEEQHIEYQ 240

Query: 1100 FSGL 1111
            FSGL
Sbjct: 241  FSGL 244


>ref|XP_007043861.1| Hydroxyproline-rich glycoprotein family protein, putative [Theobroma
            cacao] gi|508707796|gb|EOX99692.1| Hydroxyproline-rich
            glycoprotein family protein, putative [Theobroma cacao]
          Length = 654

 Score =  311 bits (796), Expect = 4e-82
 Identities = 161/246 (65%), Positives = 190/246 (77%), Gaps = 3/246 (1%)
 Frame = +2

Query: 383  MKSSLKKLRGFALHKN--DAKEKMDLQPSAHLDELAQAEEDMQEIRNCYDXXXXXXXXXX 556
            MK+SL++LRGFALHK   + K++ DL+P A LDELAQA +DM+++R+CYD          
Sbjct: 1    MKTSLRRLRGFALHKRGGETKDRRDLRPLAQLDELAQASQDMEDMRDCYDSLLSAAAATA 60

Query: 557  XXXXXXXXXLREMGTCLLEKTASNDDEESGRALLMLGKVQFELQKLVDGYRSHIFQTITT 736
                     LRE+G CLL KTA NDDEE G+ LLMLGKVQFELQK VD YRSH+F+TIT+
Sbjct: 61   NSAYEFSVSLRELGACLLAKTALNDDEECGKVLLMLGKVQFELQKHVDSYRSHLFKTITS 120

Query: 737  PSESLLNELRNVEDMKRQCDEKXXXXXXXTAQ-REKGKSKTTKGESFSSEQLQAAHDEYD 913
            PS+SLLNELR VE+MKRQCDEK         + +EKG+SK+ K E+FS +QLQ AHDEYD
Sbjct: 121  PSDSLLNELRIVEEMKRQCDEKRNVYEYMAMRLKEKGRSKSGKVENFSMQQLQVAHDEYD 180

Query: 914  EEATYFVFRLKSLKQGQSRSLLTQAARHHAAQLNFFSKGLKSLEAVDQHVQLVTEKQHID 1093
            EEAT FVFRLKSLKQGQSRSLLTQAARHHAAQL+FF K LKSLE V+ HVQ +TE+QHID
Sbjct: 181  EEATLFVFRLKSLKQGQSRSLLTQAARHHAAQLSFFKKALKSLEEVEPHVQKITEQQHID 240

Query: 1094 YQFSGL 1111
            Y FSGL
Sbjct: 241  YHFSGL 246


>gb|EXC29160.1| hypothetical protein L484_005672 [Morus notabilis]
          Length = 646

 Score =  308 bits (790), Expect = 2e-81
 Identities = 169/243 (69%), Positives = 184/243 (75%)
 Frame = +2

Query: 383  MKSSLKKLRGFALHKNDAKEKMDLQPSAHLDELAQAEEDMQEIRNCYDXXXXXXXXXXXX 562
            MKS L KLR F  +K D+K+K DLQ SA LDELAQA +DMQ++RNCYD            
Sbjct: 1    MKSPLGKLRKF--YKTDSKDKRDLQSSAQLDELAQAAKDMQDMRNCYDSLLSAAAATANS 58

Query: 563  XXXXXXXLREMGTCLLEKTASNDDEESGRALLMLGKVQFELQKLVDGYRSHIFQTITTPS 742
                   LREMG CLLEKTA NDDEESGR LLMLGKVQFELQKLVD YR+HIF TIT PS
Sbjct: 59   AYEFSESLREMGDCLLEKTALNDDEESGRVLLMLGKVQFELQKLVDSYRAHIFLTITNPS 118

Query: 743  ESLLNELRNVEDMKRQCDEKXXXXXXXTAQREKGKSKTTKGESFSSEQLQAAHDEYDEEA 922
            ESLLNELR VED  R   E          Q+EKGK K+ K ESFSS+QL+AAHD YDEEA
Sbjct: 119  ESLLNELRTVEDFFRTVYE-----YMVAQQKEKGKPKSGKNESFSSQQLRAAHDAYDEEA 173

Query: 923  TYFVFRLKSLKQGQSRSLLTQAARHHAAQLNFFSKGLKSLEAVDQHVQLVTEKQHIDYQF 1102
            T  VFRLKSLKQGQSRSLLTQAARHHAAQLNFF KGLKSLEAV+ HV+LVTE+QHI+YQF
Sbjct: 174  TLCVFRLKSLKQGQSRSLLTQAARHHAAQLNFFRKGLKSLEAVEPHVRLVTEQQHIEYQF 233

Query: 1103 SGL 1111
            SGL
Sbjct: 234  SGL 236


>ref|XP_007161663.1| hypothetical protein PHAVU_001G088100g [Phaseolus vulgaris]
            gi|561035127|gb|ESW33657.1| hypothetical protein
            PHAVU_001G088100g [Phaseolus vulgaris]
          Length = 625

 Score =  308 bits (789), Expect = 3e-81
 Identities = 162/244 (66%), Positives = 184/244 (75%), Gaps = 1/244 (0%)
 Frame = +2

Query: 383  MKSSLKKLRGFALHKNDAKEKMDLQPSAHLDELAQAEEDMQEIRNCYDXXXXXXXXXXXX 562
            MKSSL KL+  ALHK  +K+K D  P+    ELA A +DMQ++R+CYD            
Sbjct: 1    MKSSLSKLKKIALHKTVSKDKRDFHPTVKFHELALAAKDMQDMRDCYDSLLSAAAATQNS 60

Query: 563  XXXXXXXLREMGTCLLEKTASNDDEESGRALLMLGKVQFELQKLVDGYRSHIFQTITTPS 742
                   L+EMGTCLLEKTA NDDEESG+ L MLG VQ +LQKLVD YRSHI  TIT PS
Sbjct: 61   AYEFAESLQEMGTCLLEKTALNDDEESGKVLGMLGSVQLDLQKLVDSYRSHIVLTITNPS 120

Query: 743  ESLLNELRNVEDMKRQCDEKXXXXXXXTAQ-REKGKSKTTKGESFSSEQLQAAHDEYDEE 919
            ESLLNELR VEDMKRQCDEK       +AQ +EKGKSK+ KGES + +QLQAAHD+Y+EE
Sbjct: 121  ESLLNELRTVEDMKRQCDEKREVYEYMSAQQKEKGKSKSGKGESITLQQLQAAHDDYEEE 180

Query: 920  ATYFVFRLKSLKQGQSRSLLTQAARHHAAQLNFFSKGLKSLEAVDQHVQLVTEKQHIDYQ 1099
            AT   FRLKSLKQGQSRSLLTQAARHHAAQLNFF KGLKSLEAV+ HV++V E+QHIDYQ
Sbjct: 181  ATLCAFRLKSLKQGQSRSLLTQAARHHAAQLNFFRKGLKSLEAVEPHVRMVAERQHIDYQ 240

Query: 1100 FSGL 1111
            FSGL
Sbjct: 241  FSGL 244


>ref|XP_007161662.1| hypothetical protein PHAVU_001G088100g [Phaseolus vulgaris]
            gi|561035126|gb|ESW33656.1| hypothetical protein
            PHAVU_001G088100g [Phaseolus vulgaris]
          Length = 622

 Score =  308 bits (789), Expect = 3e-81
 Identities = 162/244 (66%), Positives = 184/244 (75%), Gaps = 1/244 (0%)
 Frame = +2

Query: 383  MKSSLKKLRGFALHKNDAKEKMDLQPSAHLDELAQAEEDMQEIRNCYDXXXXXXXXXXXX 562
            MKSSL KL+  ALHK  +K+K D  P+    ELA A +DMQ++R+CYD            
Sbjct: 1    MKSSLSKLKKIALHKTVSKDKRDFHPTVKFHELALAAKDMQDMRDCYDSLLSAAAATQNS 60

Query: 563  XXXXXXXLREMGTCLLEKTASNDDEESGRALLMLGKVQFELQKLVDGYRSHIFQTITTPS 742
                   L+EMGTCLLEKTA NDDEESG+ L MLG VQ +LQKLVD YRSHI  TIT PS
Sbjct: 61   AYEFAESLQEMGTCLLEKTALNDDEESGKVLGMLGSVQLDLQKLVDSYRSHIVLTITNPS 120

Query: 743  ESLLNELRNVEDMKRQCDEKXXXXXXXTAQ-REKGKSKTTKGESFSSEQLQAAHDEYDEE 919
            ESLLNELR VEDMKRQCDEK       +AQ +EKGKSK+ KGES + +QLQAAHD+Y+EE
Sbjct: 121  ESLLNELRTVEDMKRQCDEKREVYEYMSAQQKEKGKSKSGKGESITLQQLQAAHDDYEEE 180

Query: 920  ATYFVFRLKSLKQGQSRSLLTQAARHHAAQLNFFSKGLKSLEAVDQHVQLVTEKQHIDYQ 1099
            AT   FRLKSLKQGQSRSLLTQAARHHAAQLNFF KGLKSLEAV+ HV++V E+QHIDYQ
Sbjct: 181  ATLCAFRLKSLKQGQSRSLLTQAARHHAAQLNFFRKGLKSLEAVEPHVRMVAERQHIDYQ 240

Query: 1100 FSGL 1111
            FSGL
Sbjct: 241  FSGL 244


>ref|XP_006373131.1| hypothetical protein POPTR_0017s09000g [Populus trichocarpa]
            gi|550319837|gb|ERP50928.1| hypothetical protein
            POPTR_0017s09000g [Populus trichocarpa]
          Length = 625

 Score =  308 bits (788), Expect = 4e-81
 Identities = 161/238 (67%), Positives = 181/238 (76%), Gaps = 1/238 (0%)
 Frame = +2

Query: 401  KLRGFALHKNDAKEKMDLQPSAHLDELAQAEEDMQEIRNCYDXXXXXXXXXXXXXXXXXX 580
            KLRGF L +++ KEK+DL P A LDELAQA  DMQ++RNCYD                  
Sbjct: 4    KLRGFGLKRSETKEKIDLLPPAQLDELAQAARDMQDMRNCYDSLLFAAAATANSAYEFSE 63

Query: 581  XLREMGTCLLEKTASNDDEESGRALLMLGKVQFELQKLVDGYRSHIFQTITTPSESLLNE 760
             LREMG+CLLEKTA +DDEESG+ LLMLG VQFELQKLVD YRSHIF TIT PSESLLNE
Sbjct: 64   SLREMGSCLLEKTALHDDEESGKVLLMLGNVQFELQKLVDSYRSHIFLTITNPSESLLNE 123

Query: 761  LRNVEDMKRQCDEKXXXXXXXTAQ-REKGKSKTTKGESFSSEQLQAAHDEYDEEATYFVF 937
            LR VEDMKRQCDEK        AQ ++KG+SK  K ES + +QL++A +EYDEEAT  VF
Sbjct: 124  LRTVEDMKRQCDEKRNVYEYMVAQQKDKGRSKGGKDESTTLQQLRSAREEYDEEATLCVF 183

Query: 938  RLKSLKQGQSRSLLTQAARHHAAQLNFFSKGLKSLEAVDQHVQLVTEKQHIDYQFSGL 1111
            RLKSLKQGQSRSLLTQ ARHHAAQLNFF KGLKSLE V+ HV+L+TE QHIDY FSGL
Sbjct: 184  RLKSLKQGQSRSLLTQVARHHAAQLNFFQKGLKSLETVEPHVRLITEHQHIDYHFSGL 241


>ref|XP_006447002.1| hypothetical protein CICLE_v10014551mg [Citrus clementina]
            gi|568829044|ref|XP_006468842.1| PREDICTED:
            uncharacterized protein At2g33490-like [Citrus sinensis]
            gi|557549613|gb|ESR60242.1| hypothetical protein
            CICLE_v10014551mg [Citrus clementina]
          Length = 650

 Score =  307 bits (787), Expect = 5e-81
 Identities = 167/246 (67%), Positives = 189/246 (76%), Gaps = 3/246 (1%)
 Frame = +2

Query: 383  MKSSLKKLRGFALHKNDAKEKMDLQPSA-HLDELAQAEEDMQEIRNCYDXXXXXXXXXXX 559
            MKSSL KLR FALHK+D K+K+D  PS+  +D+L QA +DMQ +RNCYD           
Sbjct: 1    MKSSLSKLRRFALHKSDTKDKIDFLPSSSQVDDLDQAAQDMQVMRNCYDSLLSAAAATAN 60

Query: 560  XXXXXXXXLREMGTCLLEKTASNDDEESGRALLMLGKVQFELQKLVDGYRSHIFQTITTP 739
                    L+EMG+CL+EKT+ +DDEES + LLMLG+VQFELQKLVD YRS+IF TIT P
Sbjct: 61   SAYEFSESLQEMGSCLMEKTSLHDDEESRKVLLMLGEVQFELQKLVDNYRSNIFLTITNP 120

Query: 740  SESLLNELRNVEDMKRQCDEKXXXXXXXTAQ-REKGKSKTTKGESFS-SEQLQAAHDEYD 913
            SESLLNELR VEDMKRQCDEK        AQ REKGKSK+ KGES S  +QLQAA+DEY+
Sbjct: 121  SESLLNELRTVEDMKRQCDEKRNVCEYVMAQQREKGKSKSGKGESVSLQQQLQAANDEYE 180

Query: 914  EEATYFVFRLKSLKQGQSRSLLTQAARHHAAQLNFFSKGLKSLEAVDQHVQLVTEKQHID 1093
            EEA   VFRLKSLKQGQ RSLLTQAARHHAAQLNFF KG KSLEAVD HV+LV E+QHID
Sbjct: 181  EEARLCVFRLKSLKQGQYRSLLTQAARHHAAQLNFFRKGFKSLEAVDTHVRLVAERQHID 240

Query: 1094 YQFSGL 1111
            YQFSGL
Sbjct: 241  YQFSGL 246


>ref|XP_006604167.1| PREDICTED: uncharacterized protein At2g33490-like isoform X2 [Glycine
            max]
          Length = 623

 Score =  307 bits (786), Expect = 6e-81
 Identities = 162/244 (66%), Positives = 183/244 (75%), Gaps = 1/244 (0%)
 Frame = +2

Query: 383  MKSSLKKLRGFALHKNDAKEKMDLQPSAHLDELAQAEEDMQEIRNCYDXXXXXXXXXXXX 562
            MKSSL KL+  ALHK  +K+K D  P+   DELA A +DMQ++R+CYD            
Sbjct: 1    MKSSLSKLKKIALHKTVSKDKKDFPPTVKFDELALAAKDMQDMRDCYDSLLAAAAATQNS 60

Query: 563  XXXXXXXLREMGTCLLEKTASNDDEESGRALLMLGKVQFELQKLVDGYRSHIFQTITTPS 742
                   L++MGTCLLEKTA NDDEESG+ L MLG VQ ELQKLVD YRSHI  TIT PS
Sbjct: 61   AHEFAESLQDMGTCLLEKTALNDDEESGKVLGMLGSVQLELQKLVDSYRSHIVLTITNPS 120

Query: 743  ESLLNELRNVEDMKRQCDEKXXXXXXXTAQ-REKGKSKTTKGESFSSEQLQAAHDEYDEE 919
            ESLLNELR VEDMKRQCDEK        AQ +EKGKSK+ KGESF+ +QLQAAH EY+EE
Sbjct: 121  ESLLNELRTVEDMKRQCDEKRNVYEYMIAQQKEKGKSKSGKGESFTLQQLQAAHAEYEEE 180

Query: 920  ATYFVFRLKSLKQGQSRSLLTQAARHHAAQLNFFSKGLKSLEAVDQHVQLVTEKQHIDYQ 1099
            AT   FRLKSLKQGQSRSLLTQAARHHAAQLNFF KGLKSLEAV+ HV+++  +QHIDYQ
Sbjct: 181  ATLCAFRLKSLKQGQSRSLLTQAARHHAAQLNFFRKGLKSLEAVEPHVRMIAVRQHIDYQ 240

Query: 1100 FSGL 1111
            FSGL
Sbjct: 241  FSGL 244


>ref|XP_002512634.1| conserved hypothetical protein [Ricinus communis]
            gi|223548595|gb|EEF50086.1| conserved hypothetical
            protein [Ricinus communis]
          Length = 681

 Score =  306 bits (783), Expect = 1e-80
 Identities = 158/244 (64%), Positives = 186/244 (76%), Gaps = 1/244 (0%)
 Frame = +2

Query: 383  MKSSLKKLRGFALHKNDAKEKMDLQPSAHLDELAQAEEDMQEIRNCYDXXXXXXXXXXXX 562
            MK+S KKLR FAL   + + + D++P A LDELAQA +DM++++ CYD            
Sbjct: 1    MKTSFKKLREFALRHGEHEVRKDVRPLAPLDELAQASQDMEDMKECYDSFLSAAAATANS 60

Query: 563  XXXXXXXLREMGTCLLEKTASNDDEESGRALLMLGKVQFELQKLVDGYRSHIFQTITTPS 742
                    REMG+CLL++TA NDDEESG+ LLMLGKVQFELQKL D YRSH+F+TIT PS
Sbjct: 61   SFEISEAWREMGSCLLQRTALNDDEESGKVLLMLGKVQFELQKLFDTYRSHLFRTITVPS 120

Query: 743  ESLLNELRNVEDMKRQCDEKXXXXXXXTA-QREKGKSKTTKGESFSSEQLQAAHDEYDEE 919
            ESLLNELR VE+MKRQCDEK          QREKG+ +  KGE+FS +QLQAAHDEYDEE
Sbjct: 121  ESLLNELRTVEEMKRQCDEKRNIYEYMIMRQREKGRGRNGKGETFSMQQLQAAHDEYDEE 180

Query: 920  ATYFVFRLKSLKQGQSRSLLTQAARHHAAQLNFFSKGLKSLEAVDQHVQLVTEKQHIDYQ 1099
            AT FVFRLKSLKQGQSRSLLTQAARH+AAQL+FF K LK LEA++ HV+LVTE+QHIDY 
Sbjct: 181  ATLFVFRLKSLKQGQSRSLLTQAARHYAAQLSFFKKALKCLEALEPHVKLVTEQQHIDYH 240

Query: 1100 FSGL 1111
            FSGL
Sbjct: 241  FSGL 244


Top