BLASTX nr result

ID: Akebia23_contig00020427 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Akebia23_contig00020427
         (592 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_006379679.1| hypothetical protein POPTR_0008s09230g [Popu...   110   2e-22
emb|CBI27248.3| unnamed protein product [Vitis vinifera]              103   4e-20
ref|XP_002316103.2| hypothetical protein POPTR_0010s16940g [Popu...   100   3e-19
emb|CAN65039.1| hypothetical protein VITISV_009459 [Vitis vinifera]    97   3e-18
ref|XP_007045957.1| T-box transcription factor TBX5, putative is...    91   2e-16
ref|XP_006438780.1| hypothetical protein CICLE_v10030574mg [Citr...    91   2e-16
ref|XP_002512124.1| hypothetical protein RCOM_1621800 [Ricinus c...    91   3e-16
ref|XP_007225410.1| hypothetical protein PRUPE_ppa000582mg [Prun...    90   4e-16
gb|EXB65066.1| hypothetical protein L484_004242 [Morus notabilis]      89   1e-15
ref|XP_006483072.1| PREDICTED: uncharacterized protein LOC102619...    88   2e-15
ref|XP_007037462.1| Uncharacterized protein isoform 6 [Theobroma...    84   4e-14
ref|XP_007037461.1| Uncharacterized protein isoform 5 [Theobroma...    84   4e-14
ref|XP_007037460.1| Uncharacterized protein isoform 4 [Theobroma...    84   4e-14
ref|XP_007037459.1| Uncharacterized protein isoform 3 [Theobroma...    84   4e-14
ref|XP_007037458.1| Uncharacterized protein isoform 2 [Theobroma...    84   4e-14
ref|XP_007037457.1| Uncharacterized protein isoform 1 [Theobroma...    84   4e-14
gb|EXC11014.1| hypothetical protein L484_015234 [Morus notabilis]      82   8e-14
ref|XP_006583955.1| PREDICTED: uncharacterized protein LOC102665...    82   1e-13
ref|XP_002514707.1| hypothetical protein RCOM_1470750 [Ricinus c...    76   6e-12
ref|XP_007158617.1| hypothetical protein PHAVU_002G167700g [Phas...    70   3e-10

>ref|XP_006379679.1| hypothetical protein POPTR_0008s09230g [Populus trichocarpa]
            gi|550332708|gb|ERP57476.1| hypothetical protein
            POPTR_0008s09230g [Populus trichocarpa]
          Length = 1044

 Score =  110 bits (276), Expect = 2e-22
 Identities = 75/208 (36%), Positives = 109/208 (52%), Gaps = 29/208 (13%)
 Frame = -3

Query: 575  SNKKILGFTILDKPHIXXXXXXXXXXXXSLQYPYEVEEIQNRGKVGILNIDLSRDPALPD 396
            S +KILGF I +KPHI             +  P   EE++N  K  + +I+L  DPA+PD
Sbjct: 647  SCRKILGFPIFEKPHIPKNESSSFTSSS-VALPRLSEEVENSKKNKVFDINLPCDPAVPD 705

Query: 395  LDGQLSSENLIIDKGPDKDRSGSLCHINLNSCANE-------------------IDLEVP 273
            L  Q + E +++ K P    +   C I+LNSC N+                   IDLE P
Sbjct: 706  LAQQTAEEIVVVAKEPATKVANFRCQIDLNSCINDDETSLMPSVPVFSAKIVVGIDLEAP 765

Query: 272  VAPYAMEGIVRVGD--------SSGNQLETP-DELVKVAAEAIVTMSSLRVHNHSDNASC 120
              P   E I+   +        S+ +++E P DEL+++AA+AIV +SS    NH D+A+C
Sbjct: 766  AVPEIEENIISTEEKGHEAALQSTEHRVEIPTDELIRIAAKAIVAISSTSCQNHLDDATC 825

Query: 119  HPLEAS-TDSLHWFAEMVTSNIADPKSE 39
            +  EAS TD LHWF E+V+S   D +S+
Sbjct: 826  NLREASMTDPLHWFVEIVSSCGEDLESK 853


>emb|CBI27248.3| unnamed protein product [Vitis vinifera]
          Length = 891

 Score =  103 bits (257), Expect = 4e-20
 Identities = 80/240 (33%), Positives = 114/240 (47%), Gaps = 46/240 (19%)
 Frame = -3

Query: 590  VGDCWSNKKILGFTILDKPHIXXXXXXXXXXXXS-LQYPYEVEEIQNRGKVGILNIDLSR 414
            + DC  N+KILGF + +KPH+            + L Y  E ++I+N  K   L+I+L  
Sbjct: 528  ISDCPRNRKILGFPVFEKPHVSNNESYSLTSPSASLLYSSEGQDIENNWKNRALDINLPC 587

Query: 413  DPALPDLDGQLSSENLIIDKGPDKDRSGSLCHINLNSC------------------ANEI 288
            D A+PDL  Q  +E LII+KG   + +    HI+LNSC                  A EI
Sbjct: 588  DLAVPDLGKQTPAEVLIIEKGAHSNVACVRSHIDLNSCITEDDASMTPVPSTNVKIALEI 647

Query: 287  DLEVPVAPYAMEGIVRVGDSSGNQLETP------------DELVKVAAEAIVTMSSLRVH 144
            DLE PV P   E ++   +S G Q ++P            DE  ++AAEAIV +SS    
Sbjct: 648  DLEAPVVPETEEDVLSGLESIGKQHDSPVQSLPHKDDGLLDEFARIAAEAIVAISS---- 703

Query: 143  NHSDNASCHPLEAST----------DSLHWFAEMVTSNIADPKSE-----AGKDRSDHDE 9
                + +C  LE+ T           SLHWF E+++S   D  S+      GKD  D++E
Sbjct: 704  ----SGNCSDLESPTHYLSEAPLKDSSLHWFVEVISSCADDLDSKFGSVLRGKDYVDNEE 759


>ref|XP_002316103.2| hypothetical protein POPTR_0010s16940g [Populus trichocarpa]
            gi|550329984|gb|EEF02274.2| hypothetical protein
            POPTR_0010s16940g [Populus trichocarpa]
          Length = 1114

 Score =  100 bits (249), Expect = 3e-19
 Identities = 77/225 (34%), Positives = 112/225 (49%), Gaps = 34/225 (15%)
 Frame = -3

Query: 575  SNKKILGFTILDKPHIXXXXXXXXXXXXSLQYPYEVEEIQNRGKVGILNIDLSRDPALPD 396
            S +KILGF I +KP I             L  P   EE+++  K  +L+I+L  DPA+PD
Sbjct: 681  SCRKILGFPIFEKPRIPKTEFSSFPSSS-LALPQLSEEVEDSKKNMVLDINLPCDPAVPD 739

Query: 395  LDGQLSSENLIIDKGPDKDRSGSLCHINLNSCANE-------------------IDLEVP 273
            L  Q + E  ++ K  D   +    HI+LNSC ++                   IDLE P
Sbjct: 740  LAQQTAEEVAVVAKEADTKVANFRFHIDLNSCISDDETSMLSSVPGSSAKVVAGIDLEAP 799

Query: 272  VAPYAMEGIVRVGD--------SSGNQLET-PDELVKVAAEAIVTMSSLRVHNHSDNASC 120
              P + E      +        S+ ++ E+  DEL+++AA+AIV +SS    NH D+A+C
Sbjct: 800  AVPESEENTFSREEKAHELPLQSTEHKAESLTDELIRIAADAIVAISSSGYQNHLDDATC 859

Query: 119  HPLEAS-TDSLHWFAEMVTSNIADPKSE-----AGKDRSDHDEFS 3
            +P E S TD LHWF E+V+S   D +S+       KD  D+ E S
Sbjct: 860  NPPEVSMTDPLHWFVEIVSSCGEDLESKFDAVLRAKDGEDNMETS 904


>emb|CAN65039.1| hypothetical protein VITISV_009459 [Vitis vinifera]
          Length = 1250

 Score = 97.1 bits (240), Expect = 3e-18
 Identities = 72/219 (32%), Positives = 104/219 (47%), Gaps = 41/219 (18%)
 Frame = -3

Query: 590  VGDCWSNKKILGFTILDKPHIXXXXXXXXXXXXS-LQYPYEVEEIQNRGKVGILNIDLSR 414
            + DC  N+KILGF + +KPH+            + L Y  E ++I+N  K   L+I+L  
Sbjct: 783  ISDCPRNRKILGFPVFEKPHVSNNESYSLTSPSASLLYSSEGQDIENNWKNRALDINLPC 842

Query: 413  DPALPDLDGQLSSENLIIDKGPDKDRSGSLCHINLNSC------------------ANEI 288
            D A+PDL  Q  +E LII+KG   + +    HI+LNSC                  A EI
Sbjct: 843  DLAVPDLGKQTPAEVLIIEKGAHSNVACVRSHIDLNSCITEDDASMTPVPSTNVKIALEI 902

Query: 287  DLEVPVAPYAMEGIVRVGDSSGNQLETP------------DELVKVAAEAIVTMSSLRVH 144
            DLE PV P   E ++   +S G Q ++P            DE  ++AAEAIV +SS    
Sbjct: 903  DLEAPVVPETEEDVLSGLESIGKQHDSPVQSLPHKDDGLLDEFARIAAEAIVAISS---- 958

Query: 143  NHSDNASCHPLEAST----------DSLHWFAEMVTSNI 57
                + +C  LE+ T           SLHWF E++ + +
Sbjct: 959  ----SGNCSDLESPTHYLSEAPLKDSSLHWFVEIMRNPV 993


>ref|XP_007045957.1| T-box transcription factor TBX5, putative isoform 1 [Theobroma cacao]
            gi|590699564|ref|XP_007045958.1| T-box transcription
            factor TBX5, putative isoform 1 [Theobroma cacao]
            gi|508709892|gb|EOY01789.1| T-box transcription factor
            TBX5, putative isoform 1 [Theobroma cacao]
            gi|508709893|gb|EOY01790.1| T-box transcription factor
            TBX5, putative isoform 1 [Theobroma cacao]
          Length = 1084

 Score = 91.3 bits (225), Expect = 2e-16
 Identities = 75/233 (32%), Positives = 109/233 (46%), Gaps = 37/233 (15%)
 Frame = -3

Query: 590  VGDCWSNKKILGFTILDKPHIXXXXXXXXXXXXSLQYPYEVEEIQNRGKVGILNIDLSRD 411
            + +C  NKKILG  I DKP++            S+  P E  E +N+G+  +L+I+L  D
Sbjct: 680  ISECLHNKKILGIPIFDKPYVSKNESSYTSPYVSVPQPSE-GEAENKGRNRLLDINLPCD 738

Query: 410  PALPDLDGQLSSENLIIDKGPDKDRSGSLCHINLNSCANE-------------------I 288
              +PD+   + +E+   +K PD   S     I+LNSC  E                   I
Sbjct: 739  VNVPDVSQDVVAEDSATEKEPDTKLSSFRHQIDLNSCVTEDEASFVASVPITCVKMTGGI 798

Query: 287  DLEVPVAPYAMEGIVRVGDSSGNQLETP------------DELVKVAAEAIVTMSSLRVH 144
            DLE P+ P   E ++   +      E P            DEL+K AAEAIV +SS   +
Sbjct: 799  DLEAPLVP-EPEDVIHGEELLEKARELPLQSAQSKDDFLQDELIKSAAEAIVAISSSGEY 857

Query: 143  NHSDNASCHPLEAS-TDSLHWFAEMVTSNIADPKSE-----AGKDRSDHDEFS 3
            +H D+ + +  E S TD L+WF E ++S   D +S+      GKD  D DE S
Sbjct: 858  SHFDDVNRYSSETSMTDPLNWFVETISSFGEDLESKFEALLRGKD-GDRDESS 909


>ref|XP_006438780.1| hypothetical protein CICLE_v10030574mg [Citrus clementina]
            gi|557540976|gb|ESR52020.1| hypothetical protein
            CICLE_v10030574mg [Citrus clementina]
          Length = 1080

 Score = 90.9 bits (224), Expect = 2e-16
 Identities = 75/230 (32%), Positives = 110/230 (47%), Gaps = 36/230 (15%)
 Frame = -3

Query: 584  DCWSNKKILGFTILDKPHIXXXXXXXXXXXXSLQYPYEVEEIQNRGKVGILNIDLSRDPA 405
            D  S++KILGF  L+KPHI                P    E++   K  +L+I+L  D A
Sbjct: 678  DFLSSRKILGFPFLEKPHISANESSSLTSPSVSVPPTSEVEVEENKKNRVLDINLPFDAA 737

Query: 404  LPDLDGQLSSENLI-IDKGPDKDRSGSLCHINLNSCANE------------------IDL 282
            +PDL  Q ++E L+ I+K  D   +G    I+LNSC +E                  IDL
Sbjct: 738  VPDLSQQGATEALVLIEKKSDVRVAGFRHEIDLNSCVSEDEASFTPAAPSSNVKTSGIDL 797

Query: 281  EVPVAPYAMEGIVRVGDSSGNQLETP-----------DELVKVAAEAIVTMSSLRVHNHS 135
            E P+ P   E ++   +S    L+ P           D++ + AAEAIV +SS       
Sbjct: 798  EAPIVPETEEMVISGEESPEKALKVPLQQRKTELVHDDDVARAAAEAIVWISSSASQIRL 857

Query: 134  DNASCHPLEAS-TDSLHWFAEMVTSNIAD--PKSEA---GKDRSDHDEFS 3
            D+A+C+  EAS  D L+WF E+++S   D   K +A   GKD  D+ + S
Sbjct: 858  DDATCNSSEASIKDPLNWFVEIISSCGDDIMRKFDAALRGKDGEDNGDSS 907


>ref|XP_002512124.1| hypothetical protein RCOM_1621800 [Ricinus communis]
            gi|223549304|gb|EEF50793.1| hypothetical protein
            RCOM_1621800 [Ricinus communis]
          Length = 1085

 Score = 90.5 bits (223), Expect = 3e-16
 Identities = 67/216 (31%), Positives = 99/216 (45%), Gaps = 32/216 (14%)
 Frame = -3

Query: 590  VGDCWSNKKILGFTILDKPHIXXXXXXXXXXXXSLQYPYEVEEIQNRGKVGILNIDLSRD 411
            + D  S +KILGF I +KPHI             +      E+I+N  K  +L+I+L  D
Sbjct: 683  ISDTSSCRKILGFPIFEKPHISKVESSSLTSPS-VSLSQPTEDIENNRKSRVLDINLPCD 741

Query: 410  PALPDLDGQLSSENLIIDKGPDKDRSGSLCHINLNSCANE-------------------I 288
            P +PD   +  +E ++ +K  +K  +    HI+LNS   E                   I
Sbjct: 742  PPVPDFGQETPAELVLTEKETEKRVASVRHHIDLNSSITEDEASLIPSVPGSTVKIISGI 801

Query: 287  DLEVPVAPYAMEGIVRVGD------------SSGNQLETPDELVKVAAEAIVTMSSLRVH 144
            DLEVP  P   E ++   +            S      +PDE  ++AAEAIV +S     
Sbjct: 802  DLEVPALPETEEDVIPGEECLEKAHGVSSQLSESKAESSPDEFARIAAEAIVAISITGYR 861

Query: 143  NHSDNASCHPLEAS-TDSLHWFAEMVTSNIADPKSE 39
            +H D+   +P EAS TD LHWF E+ +S   D +S+
Sbjct: 862  SHQDDDVGNPSEASMTDPLHWFVEIASSFGEDLESK 897


>ref|XP_007225410.1| hypothetical protein PRUPE_ppa000582mg [Prunus persica]
            gi|462422346|gb|EMJ26609.1| hypothetical protein
            PRUPE_ppa000582mg [Prunus persica]
          Length = 1088

 Score = 90.1 bits (222), Expect = 4e-16
 Identities = 71/231 (30%), Positives = 108/231 (46%), Gaps = 37/231 (16%)
 Frame = -3

Query: 590  VGDCWSNKKILGFTILDKPHIXXXXXXXXXXXXSLQYPYEVEEIQNRGKVGILNIDLSRD 411
            +GD    +K+LGF I +K HI                       +N  +   L+I+L  D
Sbjct: 682  LGDIPCKRKLLGFPIFEKSHISKNESSSLTSPSVSISHQSERGGENTRRNRELDINLPCD 741

Query: 410  PALPDLDGQLSSENLIIDKGPDKDRSGSLCHINLNSCAN-------------------EI 288
            P+ P+L  +  +E +++++G D   +    +I+LNSC +                   EI
Sbjct: 742  PSAPELARKNVAEIVVVEEGRDTKVASFRHYIDLNSCISDDEVSLKPSVPSTSVKITVEI 801

Query: 287  DLEVPVAPYAMEGIVRVGDSSGNQLETP------------DELVKVAAEAIVTMSSLRVH 144
            DLE P+ P   + ++    S+  Q E              DELV+VAAEAIV++SS   H
Sbjct: 802  DLEAPIVPETDDDVIPGETSAEKQKEISLALPQHTAEPPQDELVRVAAEAIVSISSSGPH 861

Query: 143  NHSDNASCHPLEA-STDSLHWFAEMVTSNIADPKSE-----AGKDRSDHDE 9
            NH + +SC P EA STD L WF E+ +   +D +S+      GKD  D +E
Sbjct: 862  NHMNESSCDPPEASSTDPLVWFVEIASICGSDLESKFDTVLRGKDGEDKEE 912


>gb|EXB65066.1| hypothetical protein L484_004242 [Morus notabilis]
          Length = 1075

 Score = 88.6 bits (218), Expect = 1e-15
 Identities = 70/212 (33%), Positives = 97/212 (45%), Gaps = 38/212 (17%)
 Frame = -3

Query: 584  DCWSNKKILGFTILDKPHIXXXXXXXXXXXXSLQYPYEVEEIQNRGKVGILNIDLSRDPA 405
            D  SNKK+LGF I +K  I             L  P E + +    +V  L+I+L  DPA
Sbjct: 675  DIPSNKKLLGFAIFEKTRISKNESS-------LPQPSESKVVNKCNRV--LDINLPCDPA 725

Query: 404  LPDLDGQLSSENLIIDKGPDKDRSGSLCHINLNSCAN--------------------EID 285
             PDL  Q  +E ++++KG +   +G   HI+LNSC +                    EID
Sbjct: 726  APDLVQQNEAEIMVVEKGTESKSAGFRHHIDLNSCLSDDEEESLKLPAPIARLRITAEID 785

Query: 284  LEVPVAPYAMEGIVRVGDSSGNQLET------------PDELVKVAAEAIVTMSSLRVHN 141
            LE P  P   + ++    S+  Q+E              DE + VAAEAIV +SS   HN
Sbjct: 786  LEAPAVPETEDDVILGEASALEQIEAHVKSLERNVEVLQDEFMMVAAEAIVAISSSSCHN 845

Query: 140  HSDNASCHPLEAST------DSLHWFAEMVTS 63
            H  + SCH  E  +      D L WF E+V+S
Sbjct: 846  HV-HESCHSSETPSKESSLEDPLAWFVEIVSS 876


>ref|XP_006483072.1| PREDICTED: uncharacterized protein LOC102619816 [Citrus sinensis]
          Length = 1080

 Score = 88.2 bits (217), Expect = 2e-15
 Identities = 75/230 (32%), Positives = 109/230 (47%), Gaps = 36/230 (15%)
 Frame = -3

Query: 584  DCWSNKKILGFTILDKPHIXXXXXXXXXXXXSLQYPYEVEEIQNRGKVGILNIDLSRDPA 405
            D  S+ KILGF  L+KPHI                P    E++   K  +L+I+L  D A
Sbjct: 678  DFSSSGKILGFPFLEKPHISANESSSLTSPSVSVPPTSEVEVEENKKNRVLDINLPFDAA 737

Query: 404  LPDLDGQLSSENLI-IDKGPDKDRSGSLCHINLNSCANE------------------IDL 282
            +PDL  Q ++E L+ I+K  D   +G    I+LNSC +E                  IDL
Sbjct: 738  VPDLSQQGATEALVLIEKKSDVRVAGFRHEIDLNSCVSEDEASFTPAAPSSNVKTSGIDL 797

Query: 281  EVPVAPYAMEGIVRVGDSSGNQLETP-----------DELVKVAAEAIVTMSSLRVHNHS 135
            E P+ P   E ++   +S    L+ P           D++ + AAEAIV +SS       
Sbjct: 798  EAPIVPETEEMVISGEESPEKALKVPLQQRKTELVHDDDVSRAAAEAIVWISSSASQIRL 857

Query: 134  DNASCHPLEAS-TDSLHWFAEMVTSNIAD--PKSEA---GKDRSDHDEFS 3
            D+A+C+  EAS  D L+WF E+++S   D   K +A   GKD  D+ + S
Sbjct: 858  DDATCNSSEASIKDPLNWFVEIISSCGDDIMRKFDAALRGKDGEDNGDSS 907


>ref|XP_007037462.1| Uncharacterized protein isoform 6 [Theobroma cacao]
            gi|508774707|gb|EOY21963.1| Uncharacterized protein
            isoform 6 [Theobroma cacao]
          Length = 954

 Score = 83.6 bits (205), Expect = 4e-14
 Identities = 63/184 (34%), Positives = 93/184 (50%), Gaps = 30/184 (16%)
 Frame = -3

Query: 467  EEIQNRGKVGILNIDLSRDPALPDLDGQLSSENLIIDKGPDKDRSGSLCHINLNSC---- 300
            E+I+++ K  + +++L  D  +P    QL+   L     P          I+LNSC    
Sbjct: 600  EDIKDKEKDRLPDMNLEVDH-VPFRGKQLAVAELFSKSKPCGKHPTFGVLIDLNSCLSLD 658

Query: 299  --------ANEIDLEVPVAPYAMEGIVRVGDSSGNQLETP------------DELVKVAA 180
                    +NEIDLE P +P   E     G+S  NQLETP            + LV++AA
Sbjct: 659  ASPLIPSHSNEIDLEPPASPENKERSPPRGESDENQLETPLVSSGQEDGDLQEALVRIAA 718

Query: 179  EAIVTMSSLRVHNHSDNASCHPLEAS-TDSLHWFAEMVTSNIADPKSEAG-----KDRSD 18
            EAIV++SS  +    ++ SC P +AS  +SL+WFA + +S + DP SE G     KD  D
Sbjct: 719  EAIVSISSSEIQTCKESTSCEPFKASWNNSLYWFARVASSVVDDPGSEFGVNVGVKDHGD 778

Query: 17   HDEF 6
            H+E+
Sbjct: 779  HEEY 782


>ref|XP_007037461.1| Uncharacterized protein isoform 5 [Theobroma cacao]
            gi|508774706|gb|EOY21962.1| Uncharacterized protein
            isoform 5 [Theobroma cacao]
          Length = 999

 Score = 83.6 bits (205), Expect = 4e-14
 Identities = 63/184 (34%), Positives = 93/184 (50%), Gaps = 30/184 (16%)
 Frame = -3

Query: 467  EEIQNRGKVGILNIDLSRDPALPDLDGQLSSENLIIDKGPDKDRSGSLCHINLNSC---- 300
            E+I+++ K  + +++L  D  +P    QL+   L     P          I+LNSC    
Sbjct: 645  EDIKDKEKDRLPDMNLEVDH-VPFRGKQLAVAELFSKSKPCGKHPTFGVLIDLNSCLSLD 703

Query: 299  --------ANEIDLEVPVAPYAMEGIVRVGDSSGNQLETP------------DELVKVAA 180
                    +NEIDLE P +P   E     G+S  NQLETP            + LV++AA
Sbjct: 704  ASPLIPSHSNEIDLEPPASPENKERSPPRGESDENQLETPLVSSGQEDGDLQEALVRIAA 763

Query: 179  EAIVTMSSLRVHNHSDNASCHPLEAS-TDSLHWFAEMVTSNIADPKSEAG-----KDRSD 18
            EAIV++SS  +    ++ SC P +AS  +SL+WFA + +S + DP SE G     KD  D
Sbjct: 764  EAIVSISSSEIQTCKESTSCEPFKASWNNSLYWFARVASSVVDDPGSEFGVNVGVKDHGD 823

Query: 17   HDEF 6
            H+E+
Sbjct: 824  HEEY 827


>ref|XP_007037460.1| Uncharacterized protein isoform 4 [Theobroma cacao]
            gi|508774705|gb|EOY21961.1| Uncharacterized protein
            isoform 4 [Theobroma cacao]
          Length = 1016

 Score = 83.6 bits (205), Expect = 4e-14
 Identities = 63/184 (34%), Positives = 93/184 (50%), Gaps = 30/184 (16%)
 Frame = -3

Query: 467  EEIQNRGKVGILNIDLSRDPALPDLDGQLSSENLIIDKGPDKDRSGSLCHINLNSC---- 300
            E+I+++ K  + +++L  D  +P    QL+   L     P          I+LNSC    
Sbjct: 662  EDIKDKEKDRLPDMNLEVDH-VPFRGKQLAVAELFSKSKPCGKHPTFGVLIDLNSCLSLD 720

Query: 299  --------ANEIDLEVPVAPYAMEGIVRVGDSSGNQLETP------------DELVKVAA 180
                    +NEIDLE P +P   E     G+S  NQLETP            + LV++AA
Sbjct: 721  ASPLIPSHSNEIDLEPPASPENKERSPPRGESDENQLETPLVSSGQEDGDLQEALVRIAA 780

Query: 179  EAIVTMSSLRVHNHSDNASCHPLEAS-TDSLHWFAEMVTSNIADPKSEAG-----KDRSD 18
            EAIV++SS  +    ++ SC P +AS  +SL+WFA + +S + DP SE G     KD  D
Sbjct: 781  EAIVSISSSEIQTCKESTSCEPFKASWNNSLYWFARVASSVVDDPGSEFGVNVGVKDHGD 840

Query: 17   HDEF 6
            H+E+
Sbjct: 841  HEEY 844


>ref|XP_007037459.1| Uncharacterized protein isoform 3 [Theobroma cacao]
            gi|508774704|gb|EOY21960.1| Uncharacterized protein
            isoform 3 [Theobroma cacao]
          Length = 928

 Score = 83.6 bits (205), Expect = 4e-14
 Identities = 63/184 (34%), Positives = 93/184 (50%), Gaps = 30/184 (16%)
 Frame = -3

Query: 467  EEIQNRGKVGILNIDLSRDPALPDLDGQLSSENLIIDKGPDKDRSGSLCHINLNSC---- 300
            E+I+++ K  + +++L  D  +P    QL+   L     P          I+LNSC    
Sbjct: 574  EDIKDKEKDRLPDMNLEVDH-VPFRGKQLAVAELFSKSKPCGKHPTFGVLIDLNSCLSLD 632

Query: 299  --------ANEIDLEVPVAPYAMEGIVRVGDSSGNQLETP------------DELVKVAA 180
                    +NEIDLE P +P   E     G+S  NQLETP            + LV++AA
Sbjct: 633  ASPLIPSHSNEIDLEPPASPENKERSPPRGESDENQLETPLVSSGQEDGDLQEALVRIAA 692

Query: 179  EAIVTMSSLRVHNHSDNASCHPLEAS-TDSLHWFAEMVTSNIADPKSEAG-----KDRSD 18
            EAIV++SS  +    ++ SC P +AS  +SL+WFA + +S + DP SE G     KD  D
Sbjct: 693  EAIVSISSSEIQTCKESTSCEPFKASWNNSLYWFARVASSVVDDPGSEFGVNVGVKDHGD 752

Query: 17   HDEF 6
            H+E+
Sbjct: 753  HEEY 756


>ref|XP_007037458.1| Uncharacterized protein isoform 2 [Theobroma cacao]
            gi|508774703|gb|EOY21959.1| Uncharacterized protein
            isoform 2 [Theobroma cacao]
          Length = 990

 Score = 83.6 bits (205), Expect = 4e-14
 Identities = 63/184 (34%), Positives = 93/184 (50%), Gaps = 30/184 (16%)
 Frame = -3

Query: 467  EEIQNRGKVGILNIDLSRDPALPDLDGQLSSENLIIDKGPDKDRSGSLCHINLNSC---- 300
            E+I+++ K  + +++L  D  +P    QL+   L     P          I+LNSC    
Sbjct: 636  EDIKDKEKDRLPDMNLEVDH-VPFRGKQLAVAELFSKSKPCGKHPTFGVLIDLNSCLSLD 694

Query: 299  --------ANEIDLEVPVAPYAMEGIVRVGDSSGNQLETP------------DELVKVAA 180
                    +NEIDLE P +P   E     G+S  NQLETP            + LV++AA
Sbjct: 695  ASPLIPSHSNEIDLEPPASPENKERSPPRGESDENQLETPLVSSGQEDGDLQEALVRIAA 754

Query: 179  EAIVTMSSLRVHNHSDNASCHPLEAS-TDSLHWFAEMVTSNIADPKSEAG-----KDRSD 18
            EAIV++SS  +    ++ SC P +AS  +SL+WFA + +S + DP SE G     KD  D
Sbjct: 755  EAIVSISSSEIQTCKESTSCEPFKASWNNSLYWFARVASSVVDDPGSEFGVNVGVKDHGD 814

Query: 17   HDEF 6
            H+E+
Sbjct: 815  HEEY 818


>ref|XP_007037457.1| Uncharacterized protein isoform 1 [Theobroma cacao]
            gi|508774702|gb|EOY21958.1| Uncharacterized protein
            isoform 1 [Theobroma cacao]
          Length = 1025

 Score = 83.6 bits (205), Expect = 4e-14
 Identities = 63/184 (34%), Positives = 93/184 (50%), Gaps = 30/184 (16%)
 Frame = -3

Query: 467  EEIQNRGKVGILNIDLSRDPALPDLDGQLSSENLIIDKGPDKDRSGSLCHINLNSC---- 300
            E+I+++ K  + +++L  D  +P    QL+   L     P          I+LNSC    
Sbjct: 671  EDIKDKEKDRLPDMNLEVDH-VPFRGKQLAVAELFSKSKPCGKHPTFGVLIDLNSCLSLD 729

Query: 299  --------ANEIDLEVPVAPYAMEGIVRVGDSSGNQLETP------------DELVKVAA 180
                    +NEIDLE P +P   E     G+S  NQLETP            + LV++AA
Sbjct: 730  ASPLIPSHSNEIDLEPPASPENKERSPPRGESDENQLETPLVSSGQEDGDLQEALVRIAA 789

Query: 179  EAIVTMSSLRVHNHSDNASCHPLEAS-TDSLHWFAEMVTSNIADPKSEAG-----KDRSD 18
            EAIV++SS  +    ++ SC P +AS  +SL+WFA + +S + DP SE G     KD  D
Sbjct: 790  EAIVSISSSEIQTCKESTSCEPFKASWNNSLYWFARVASSVVDDPGSEFGVNVGVKDHGD 849

Query: 17   HDEF 6
            H+E+
Sbjct: 850  HEEY 853


>gb|EXC11014.1| hypothetical protein L484_015234 [Morus notabilis]
          Length = 972

 Score = 82.4 bits (202), Expect = 8e-14
 Identities = 55/171 (32%), Positives = 92/171 (53%), Gaps = 25/171 (14%)
 Frame = -3

Query: 470  VEEIQNRGKVGILNIDLSRDPALPDLDGQLSSENLIIDKGPDKDRSGSLCHINLNSCANE 291
            ++++++ GK  +++++L+ D A  + + +L+++  + + G ++  S   C I+LNS  NE
Sbjct: 615  LQDVKSSGKDSVIDLNLACDSA-SETEIELTADEHVGENGVNRKHSSFGCLIDLNSSINE 673

Query: 290  ------------IDLEVPVAPYAMEGIVRVGDSSGNQLETP------------DELVKVA 183
                        IDL+ P +P   E     G+S  NQ ETP            DEL K+A
Sbjct: 674  ARFTQKSSLLAEIDLDAPASPENKESSPPRGESDENQAETPVLLLGQEGADQQDELAKIA 733

Query: 182  AEAIVTMSSLRVHNHSDNASCHPLEAST-DSLHWFAEMVTSNIADPKSEAG 33
            AEA++++SS +        S   LE S  DSLHWFA +V+S  ++P+SE G
Sbjct: 734  AEALISISSFKSSTSLQKPSFERLEVSLLDSLHWFAGVVSSVASNPESEFG 784


>ref|XP_006583955.1| PREDICTED: uncharacterized protein LOC102665797 isoform X1 [Glycine
            max] gi|571467486|ref|XP_006583956.1| PREDICTED:
            uncharacterized protein LOC102665797 isoform X2 [Glycine
            max]
          Length = 1080

 Score = 81.6 bits (200), Expect = 1e-13
 Identities = 70/234 (29%), Positives = 103/234 (44%), Gaps = 38/234 (16%)
 Frame = -3

Query: 590  VGDCWSNKKILGFTILDKPHIXXXXXXXXXXXXS-LQYPYEVEEIQNRGKVGILNIDLSR 414
            V D  S +KILG  I D  HI              +  P +VE ++N  +  IL+I+L  
Sbjct: 675  VSDSSSKRKILGVPIFDISHISAKESSSFTSSSVSVPNPSDVELVENNQRKHILDINLPC 734

Query: 413  DPALPDLDGQLSSENLIIDKGPDKDRSGSLCHINLNSCANE------------------I 288
            D ++P+ D Q  ++ ++ + G    ++ S   I+LN C NE                  I
Sbjct: 735  DASVPEFDEQAVAQVIVCETGSSTTKANSRKQIDLNLCMNEDEAFVTNIPATNLETKAEI 794

Query: 287  DLEVPVAPYAMEGIVRVGDSSGNQLETP---------------DELVKVAAEAIVTMSSL 153
            DLEVP  P A E  +        +LETP               DEL++ AAEAIV +SS 
Sbjct: 795  DLEVPAVPEAEEDAI----PEEKKLETPLVSPLGPQDTVEKLQDELMRHAAEAIVVLSSS 850

Query: 152  RVHNHSDNASCHPLEASTDSLHWFAEMVTSNIADPKSEAG----KDRSDHDEFS 3
                  D  S        DSL WF ++V+S + D + ++     KD  D++E S
Sbjct: 851  CCQQVDDVISSPSEGPVVDSLSWFVDIVSSCVDDLQKKSDNSREKDGEDNEESS 904


>ref|XP_002514707.1| hypothetical protein RCOM_1470750 [Ricinus communis]
            gi|223546311|gb|EEF47813.1| hypothetical protein
            RCOM_1470750 [Ricinus communis]
          Length = 925

 Score = 76.3 bits (186), Expect = 6e-12
 Identities = 54/164 (32%), Positives = 84/164 (51%), Gaps = 30/164 (18%)
 Frame = -3

Query: 407  ALPDLDGQLSSENLIIDKGPDKDRSGSLCHINLNSCANE------------IDLEVPVAP 264
            ++PD   QL++  L++ K   K  SG    ++LNS  +E            +DL+ P +P
Sbjct: 589  SVPDSGEQLTANELVLGKKLGKKSSGFGFQVDLNSYIHEDGSLLLPSVPSILDLQAPKSP 648

Query: 263  YAMEGIVRVGDSSGNQLETP------------DELVKVAAEAIVTMSSLRVHNHSDNASC 120
               EG    G+S  NQ ETP            ++LV +AAEAIV++S     N ++N + 
Sbjct: 649  ENEEGSPPRGESDENQHETPCILSEQENGDLLEDLVTIAAEAIVSISLSEPQNETENETF 708

Query: 119  HPLEASTD-SLHWFAEMVTSNIADPKSEAG-----KDRSDHDEF 6
               EA+   SLHWFA++ +S + DP+SE G     ++  D DE+
Sbjct: 709  RQPEAAESVSLHWFAKLASSIVDDPESEFGVVLSCRNPDDQDEY 752


>ref|XP_007158617.1| hypothetical protein PHAVU_002G167700g [Phaseolus vulgaris]
            gi|561032032|gb|ESW30611.1| hypothetical protein
            PHAVU_002G167700g [Phaseolus vulgaris]
          Length = 1078

 Score = 70.5 bits (171), Expect = 3e-10
 Identities = 64/227 (28%), Positives = 96/227 (42%), Gaps = 36/227 (15%)
 Frame = -3

Query: 575  SNKKILGFTILDKPHIXXXXXXXXXXXXSL-QYPYEVEEIQNRGKVGILNIDLSRDPALP 399
            S +KILG  I   PHI             L     +VE ++N  +  IL+I+L  D ++P
Sbjct: 678  SKRKILGVPIFGIPHISSKESSSFTFPSVLVPISSDVELVENNQRKHILDINLPCDASVP 737

Query: 398  DLDGQLSSENLIIDKGPDKDRSGSLCHINLNSCANE-------------------IDLEV 276
            + D Q  +E ++ +      ++ S   I+LN   +E                   IDLE 
Sbjct: 738  EFDEQAVTEVIVCETRSSTTKANSRNQIDLNLSMDEEDEAFLTNIPATSLETKVEIDLEA 797

Query: 275  PVAPYAMEGIVRVGDSSGNQLETP------------DELVKVAAEAIVTMSSLRVHNHSD 132
            P  P   +  +       N+LETP            DEL++ AAEAIV +SS       D
Sbjct: 798  PAIPETEDNAI----PEENKLETPSVSPQGTVEKLQDELMRYAAEAIVVLSSSCSQQVDD 853

Query: 131  NASCHPLEASTDSLHWFAEMVTSNIADPK----SEAGKDRSDHDEFS 3
              S        D L WF ++V+S + D +    +  GKD  D++E S
Sbjct: 854  VISSPSESPVVDPLSWFVDIVSSCVDDLQKKIDNRRGKDGEDNEECS 900


Top