BLASTX nr result

ID: Akebia26_contig00025336 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Akebia26_contig00025336
         (2273 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_006854566.1| hypothetical protein AMTR_s00030p00103480 [A...   144   1e-31
ref|XP_007148443.1| hypothetical protein PHAVU_006G209200g [Phas...   129   8e-27
ref|XP_006487482.1| PREDICTED: uncharacterized protein LOC102618...   117   2e-23
ref|XP_006423726.1| hypothetical protein CICLE_v10028677mg [Citr...   115   1e-22
ref|XP_004230610.1| PREDICTED: uncharacterized protein LOC101262...   112   6e-22
ref|XP_002523767.1| conserved hypothetical protein [Ricinus comm...   112   7e-22
ref|XP_002321383.1| hypothetical protein POPTR_0015s01060g [Popu...   110   2e-21
ref|XP_007043041.1| Uncharacterized protein isoform 1 [Theobroma...   106   4e-20
ref|XP_003593131.1| hypothetical protein MTR_2g008130 [Medicago ...   100   2e-18
ref|XP_007204312.1| hypothetical protein PRUPE_ppb021745mg [Prun...    90   4e-15
gb|AEJ72552.1| hypothetical protein [Malus domestica]                  87   3e-14
gb|EPS68024.1| hypothetical protein M569_06755, partial [Genlise...    81   2e-12
ref|XP_002318448.2| hypothetical protein POPTR_0012s02720g [Popu...    70   4e-09
ref|NP_001174784.1| Os06g0468300 [Oryza sativa Japonica Group] g...    68   2e-08
ref|XP_006656073.1| PREDICTED: uncharacterized protein LOC102716...    66   6e-08
gb|ABR16126.1| unknown [Picea sitchensis]                              62   1e-06

>ref|XP_006854566.1| hypothetical protein AMTR_s00030p00103480 [Amborella trichopoda]
            gi|548858252|gb|ERN16033.1| hypothetical protein
            AMTR_s00030p00103480 [Amborella trichopoda]
          Length = 736

 Score =  144 bits (364), Expect = 1e-31
 Identities = 181/681 (26%), Positives = 265/681 (38%), Gaps = 156/681 (22%)
 Frame = -2

Query: 2272 SNRGKDKLQTLDSQWRIVLLCNNCFHGVQSAKNCSYCFTEISDVG--ISCVCCDRRVHGD 2099
            S +GKDKL TL S+WR+VLLC NC + V S  NCSYCF+ + + G  ++C  CD RVH  
Sbjct: 82   SGKGKDKLHTLASEWRVVLLCKNCLNAVNSGTNCSYCFSALENSGCVLNCRKCDHRVHQG 141

Query: 2098 CVSKYR--------GLGLC----------------SKSDSFTC----------------- 2042
            C SK+R        G  LC                +KSDSF                   
Sbjct: 142  CASKHRGSLLQCSSGSFLCVDCWVPKSRLNFGCGSNKSDSFGTQDSKSLLRFGETKVFGD 201

Query: 2041 IDCWVPKSLNGVPWGRNPNGS--------------------------------SKIVSGN 1958
             D    KS++   +    +GS                                 K VS +
Sbjct: 202  CDSKAEKSVSSASFPETNSGSVDKTMVSVAIKPLDKENPCIDGESELNKYQDAEKHVSDS 261

Query: 1957 CSVKISRAS-------SLDDVAAKDANSTAELKIXXXXXXXXXXXXXXXXXXXXXXXAKN 1799
             S K SR S       SL+++  K+ANS A   +                       A+N
Sbjct: 262  VSEKASRFSFNGNCCRSLEEIV-KEANSAAARAMTIAASAKENALRKAMVARNAASAARN 320

Query: 1798 ALDLVAAAVREDSQLKDSRLSGG-LAADD------------------------------T 1712
            AL+ +A   +E+++ K+S  S   L  DD                               
Sbjct: 321  ALNFLAILEQEENEAKESLQSNASLMGDDGNSNIADRAEKSNGIHLKAGSLPESHEVADE 380

Query: 1711 KLAFLLHRTINSSPRISKNLGSMD--------LGN---------LVAPKLRKGNGYLLDR 1583
            +LA  LHR +NSSPRIS+  G+ +        L N         +   K    NG+  + 
Sbjct: 381  ELALRLHRAMNSSPRISRRRGAPNGIQLKECKLSNSTKCEFNCMVTTKKQNCSNGFGNEE 440

Query: 1582 ----------------------QSDHGSHSVHGELEVCTNNTM---LENPDKVVSEPSVR 1478
                                  +++ GS SV G L +CT + +   L++PD   +EPSV 
Sbjct: 441  FRRNERRFRRDSEVIGQSTSILKTESGSQSVCGNLHLCTEDKIDGTLDHPD---AEPSVG 497

Query: 1477 IGSLDHSSSMGLGVLEPKMKVYTRESHKVKNFK-KNGEDAFMGNFEKGLEHCYRKQEVLE 1301
             G+L+ ++S+G+ V E K +   R+   +        E+   G  +     C    +V  
Sbjct: 498  NGALELANSIGMAVEEFKKR---RDDEAINGVSFHEDEEKKEGTMQGAFRSCRADGKV-- 552

Query: 1300 HKVSSNSGGTHCQFPCDEDSSTPEKKKCHGSDMYLRKYSKRRTSIKSTLDQSSSHVNECK 1121
              +  N G        + ++S          D    K  K  T +K   +Q+S       
Sbjct: 553  -DMKQNGG-------MNMENSLKNGLLIVDGDNSGVKDMKPETPVKE--EQASCSNKAMN 602

Query: 1120 ENGDDSVMGNFDRGLQASLIGLSNGNGIVELPVKEQVSCYLKQEGLAPRVSSNNRGTQCQ 941
             +G+DS   + D G ++S       NG     V +     +K  G   ++S  N      
Sbjct: 603  SSGEDS---SLDTGFESSQKWKGGENGGSSSNVSK-----VKPFGYRAKLSKFNCAQ--S 652

Query: 940  SACDEDTSIPERKRCHGPDMYLKTYSKRHTSLKVILHQKTKVLFEDSPLESQASTPGLSS 761
             A + D   P++KR   P    K   KRH+S+KVIL +KTK L ED PLES+A T  L  
Sbjct: 653  QAREGDPLKPQKKRSILPHPDSKRPIKRHSSMKVILDRKTKSLAEDFPLESKALTNALPL 712

Query: 760  LQLNCSNVCRTFSDASFQSSS 698
            LQ NC+   +  SD+S  S S
Sbjct: 713  LQRNCAKAPKKLSDSSHGSPS 733


>ref|XP_007148443.1| hypothetical protein PHAVU_006G209200g [Phaseolus vulgaris]
            gi|561021666|gb|ESW20437.1| hypothetical protein
            PHAVU_006G209200g [Phaseolus vulgaris]
          Length = 439

 Score =  129 bits (323), Expect = 8e-27
 Identities = 116/403 (28%), Positives = 172/403 (42%), Gaps = 33/403 (8%)
 Frame = -2

Query: 2263 GKDKLQTLDSQWRIVLLCNNCFHGVQSAKNCSYCFTEISDVGISCVCCDRRVHGDCVSKY 2084
            GK++L+TL S+WR+VLLC  CF  V+S++ CSYCF+ +S     C  C   VH  C  KY
Sbjct: 57   GKNRLRTLYSEWRVVLLCKKCFVSVESSQICSYCFSGMSLESYRCNQCQHSVHKTCFLKY 116

Query: 2083 RGL---GLCSKSDSFT-CIDCWVPKSLN----------GVPWGR--NPNGSSKIVSGNCS 1952
            +        S    F+ C+DCW+PK L           G   GR     GSS+++ G   
Sbjct: 117  KNAPPWSYASMGSEFSVCVDCWIPKHLEISRRRKRRVMGDENGRIILEKGSSRVLPGG-- 174

Query: 1951 VKISRASSLDDVAAKDANSTAELKIXXXXXXXXXXXXXXXXXXXXXXXAKNALDLVAAAV 1772
               + A S++D+  +DA      K+                       AKNAL LVA   
Sbjct: 175  ---NLARSMEDL-VEDAKREVGEKVEAAARAREGAVKKALVARRAVEIAKNALSLVANG- 229

Query: 1771 REDSQLKDSRLSGGLAADDTKLAFLLHRTINSSPRISKNLGSMDLGNLVAPKLRKGNGYL 1592
             E S     +       D ++L F LH   N+ PRISK+   ++   L APK    +   
Sbjct: 230  EESSLNPPPKREAFKVLDGSELTFELHPEFNTLPRISKSCCLLNTSFLDAPKRLSPSVDS 289

Query: 1591 LDRQSDHGSHSVHGELEVCTNNTMLENPDKVVSEPSVRIGSLDHSSSMGLGVL---EPKM 1421
              + S+  +     + EV  +N +L +  K + EP V +G+LD  SS GL +L      M
Sbjct: 290  SCKTSNSRNADYRDKHEVSCDNKLLADSCKSLCEPLVSVGTLDSGSSTGLNLLCMGRSGM 349

Query: 1420 KVYTRESHKV---------KNFKKNGE----DAFMGNFEKGLEHCYRKQEVLEHKVSSNS 1280
            +  +++  +          +  +K GE    D  +   E       RKQ           
Sbjct: 350  ETGSKDGERTAESDGEGIGEELQKEGEGSCSDRIINLSEDSCMELDRKQ----------- 398

Query: 1279 GGTHCQFPCDEDSSTPEKKKCHGS-DMYLRKYSKRRTSIKSTL 1154
                       DS+    K+C+G  D Y  KYS+R  S+KS +
Sbjct: 399  ----------ADSALHRVKRCNGQPDRYFLKYSRRNCSLKSKI 431


>ref|XP_006487482.1| PREDICTED: uncharacterized protein LOC102618081 isoform X1 [Citrus
            sinensis] gi|568868391|ref|XP_006487483.1| PREDICTED:
            uncharacterized protein LOC102618081 isoform X2 [Citrus
            sinensis]
          Length = 373

 Score =  117 bits (294), Expect = 2e-23
 Identities = 100/386 (25%), Positives = 163/386 (42%), Gaps = 1/386 (0%)
 Frame = -2

Query: 2263 GKDKLQTLDSQWRIVLLCNNCFHGVQSAKNCSYCFTEISDVGISCVCCDRRVHGDCVSKY 2084
            G DK+Q L S+WRIVLLC  C   ++S+K CSYC+ E  +  ++C  C R VH +C  K 
Sbjct: 36   GNDKIQILYSEWRIVLLCCKCLDRIESSKICSYCYKETIEDFLTCSQCKRSVHRNCFLKC 95

Query: 2083 RGLGLCSKSDSFTCIDCWVPKSLNGVPWGRNPNGSSKIVSGNCSVKISRASSLDDVAAKD 1904
            + +   S  +S  C+DCWVPKSL      R      KI + +  + IS         ++ 
Sbjct: 96   KAIDSMSSLESLICVDCWVPKSL---VKRRELLTCRKICNSSADLGISN--------SRV 144

Query: 1903 ANSTAELKIXXXXXXXXXXXXXXXXXXXXXXXAKNALDLVAAAVREDSQLKDSRLSGGLA 1724
            +N      +                         NALDL     RE+      +++   +
Sbjct: 145  SNGGGSCAVVERKIVFALMATEMIGRKPFVPKKSNALDL--EVKREEGGEIHKKVA---S 199

Query: 1723 ADDTKLAFLLHRTINSSPRISKNLGSMDLGNLVAPKLRKGNGYLLDRQSDHGSHSVHGEL 1544
             DD +LAF LHR++NSSPRISKNL  ++  +   PK ++ +G L+   S  GS       
Sbjct: 200  DDDAELAFQLHRSMNSSPRISKNLCVVNSSDSHVPKKQECDGVLILGGSGSGS------- 252

Query: 1543 EVCTNNTMLENPDKVVSEPSVRIGSLDHSSSMGLGVLEPKMKVYTRESHK-VKNFKKNGE 1367
              C++N +  + D+  +    R        S        K+ V  ++  +    ++K G 
Sbjct: 253  --CSSNALKSSGDETSTNFDSRPSYDKRCESASY-----KLAVCNKQPDRFFFKYRKRGS 305

Query: 1366 DAFMGNFEKGLEHCYRKQEVLEHKVSSNSGGTHCQFPCDEDSSTPEKKKCHGSDMYLRKY 1187
              F+  + +        + VL++K                            SD++L KY
Sbjct: 306  RRFLLKYRR---RSSSSKPVLDNK----------------------------SDIFLLKY 334

Query: 1186 SKRRTSIKSTLDQSSSHVNECKENGD 1109
             +RR++    +  + S +  C +  D
Sbjct: 335  RRRRSAGSKPVPDNKSDIEICNQKPD 360


>ref|XP_006423726.1| hypothetical protein CICLE_v10028677mg [Citrus clementina]
            gi|567862146|ref|XP_006423727.1| hypothetical protein
            CICLE_v10028677mg [Citrus clementina]
            gi|557525660|gb|ESR36966.1| hypothetical protein
            CICLE_v10028677mg [Citrus clementina]
            gi|557525661|gb|ESR36967.1| hypothetical protein
            CICLE_v10028677mg [Citrus clementina]
          Length = 373

 Score =  115 bits (287), Expect = 1e-22
 Identities = 81/255 (31%), Positives = 122/255 (47%), Gaps = 1/255 (0%)
 Frame = -2

Query: 2263 GKDKLQTLDSQWRIVLLCNNCFHGVQSAKNCSYCFTEISDVGISCVCCDRRVHGDCVSKY 2084
            G DK+Q L S+WRIVLLC  C   ++S+K CSYC+ E  +  ++C  C R VH +C  K 
Sbjct: 36   GNDKIQILYSEWRIVLLCCKCLDRIESSKICSYCYKETIEDFLTCSQCKRSVHRNCFLKC 95

Query: 2083 RGLGLCSKSDSFTCIDCWVPKSLNGVPWGRNPNGSSKIVSGNCSVKISRASSLDDVAAKD 1904
            + +   S  +S  C+DCWVPKSL      R      KI + +  + IS         ++ 
Sbjct: 96   KAIDSMSSLESLICVDCWVPKSL---VKRRELLTCRKICNSSADLGISN--------SRV 144

Query: 1903 ANSTAELKIXXXXXXXXXXXXXXXXXXXXXXXAKNALDLVAAAVREDSQLKDSRLSGGLA 1724
            +N      +                         NALDL    V+ D   +   +   +A
Sbjct: 145  SNGGGSCAVVERKIVFALMASEMIGRKPFVPKKSNALDL---EVKRD---EGGEIHKKVA 198

Query: 1723 A-DDTKLAFLLHRTINSSPRISKNLGSMDLGNLVAPKLRKGNGYLLDRQSDHGSHSVHGE 1547
            + DD +LAF LHR++NSSPRISKNL  ++  +   PK ++ +G L+   S  GS      
Sbjct: 199  SDDDAELAFQLHRSMNSSPRISKNLCVVNSSDSHVPKKQECDGVLILGGSGSGS------ 252

Query: 1546 LEVCTNNTMLENPDK 1502
               C++N +  + D+
Sbjct: 253  ---CSSNALKSSGDE 264


>ref|XP_004230610.1| PREDICTED: uncharacterized protein LOC101262666 [Solanum
            lycopersicum]
          Length = 488

 Score =  112 bits (281), Expect = 6e-22
 Identities = 118/409 (28%), Positives = 173/409 (42%), Gaps = 40/409 (9%)
 Frame = -2

Query: 2257 DKLQTLDSQWRIVLLCNNCFHGVQSAKNCSYCFTEISDVGIS-CVCCDRRVHGDCVSKYR 2081
            D+L TLDS WRIVLLC NC   V S + C YCF    D   S C  C R+VH DCVS+Y 
Sbjct: 93   DRLLTLDSFWRIVLLCKNCIRCVDSGQTCPYCFKNTDDTDCSKCRSCKRQVHKDCVSRYG 152

Query: 2080 G---LGLCSKSDS--FTCIDCWVP----KSLNGVPWGRNPNGSSKIVSGN--CSVKISRA 1934
                   CS+ +   F CIDCWVP    KS+      +    + +  S +   S KI++ 
Sbjct: 153  NSAPWSFCSREEGGLFVCIDCWVPNFFKKSIGDCRKIQKDVLNIQHCSSDFKSSEKIAKH 212

Query: 1933 SSLDD------VAAKDANSTAELKIXXXXXXXXXXXXXXXXXXXXXXXAKNALDLVAAAV 1772
            ++L+       V  K  NST +  +                       AKN + L  +A+
Sbjct: 213  ANLEGLRKEVVVGLKAKNSTLQKAV----------------------VAKNPMGLAKSAL 250

Query: 1771 REDSQLKDSRLSGGLAA---DDTKLAFLLHRTINSSPRISKNLGSMDLGNLVAPKLRKGN 1601
              +S +K  +  G + +   +D +LAF LHR++NSSPRISK LG  +   +  P+++   
Sbjct: 251  --ESVVKKGKSKGKVVSKDVNDAQLAFQLHRSMNSSPRISKTLGPKNSSYVGGPEIQTLP 308

Query: 1600 GYLLDRQSDHGSHSVHGELEVCTNNT-----------MLENPDKVVSEPSVRI-----GS 1469
                +R   +      G++   +  T           + E  D+  SE S R+       
Sbjct: 309  SSTGERLKVYFRTKYRGKVGPTSPETPPSVMVYSRARLKEKVDQTTSETSPRVTVYSRRR 368

Query: 1468 LDHSSSMGLGVLEPKMKVYTRESHKVKNFKKNGE-DAFMGNFEKG--LEHCYRKQEVLEH 1298
            L            P + VY+R   K K  + + E    +   E G  ++    K E+L +
Sbjct: 369  LKEEVGKASSDASPCLLVYSRTRFKEKVCQTDSEAPPCVTTNECGSCVDSACSKAELLTY 428

Query: 1297 KVSSNSGGTHCQFPCDEDSSTPEKKKCHGSDMYLRKYSKRRTSIKSTLD 1151
            K +     T     CDE       K     D YL KYS+R+   K   D
Sbjct: 429  KRNKLKRKT-----CDE-------KVVFTEDRYLLKYSRRKRCWKPGSD 465


>ref|XP_002523767.1| conserved hypothetical protein [Ricinus communis]
            gi|223536979|gb|EEF38616.1| conserved hypothetical
            protein [Ricinus communis]
          Length = 488

 Score =  112 bits (280), Expect = 7e-22
 Identities = 106/374 (28%), Positives = 157/374 (41%), Gaps = 12/374 (3%)
 Frame = -2

Query: 2254 KLQTLDSQWRIVLLCNNCFHGVQSAKNCSYCFTEISDVGISCVC----CDRRVHGDCVSK 2087
            +LQTL S+WRIVLLC  CF  V+S   C+YCF ++S    SC+     C R +H  C S 
Sbjct: 57   RLQTLYSEWRIVLLCKICFFRVESCHICAYCFKDLSSSDNSCLFRCPQCKRIIHRTCFSN 116

Query: 2086 YRGLGLCSKSDSFT-CIDCWVPKSLNGVPWGRNPNGSSKIVSGNCSVKISRASSLDDVAA 1910
            Y      S S  F+ C+DCWVPKS+      R     +K    NC     + SSL+DV  
Sbjct: 117  YSNFAPWSFSSKFSVCVDCWVPKSIA----SRRACFRTKKSKSNC-----KYSSLEDV-V 166

Query: 1909 KDANSTAELKIXXXXXXXXXXXXXXXXXXXXXXXAKNALDLVAAAVREDSQLKDSRLSGG 1730
            +DA+   + K+                         NA DLV+   R+D+ + +      
Sbjct: 167  RDADFDVQRKVEAAAKARELVVEKALAARKAAQLVHNAFDLVSE--RDDNGIAN------ 218

Query: 1729 LAADDTKLAFLLHRTINSSPRISKNLGSMDLGNLVAPKLRKGNGYLLDRQSDHGSHSVHG 1550
               DD +LA  LH  +NSSPRI  NL S+D                    S   S  V G
Sbjct: 219  --VDDVQLALHLHLALNSSPRILSNLCSLD--------------------SAGSSPLVRG 256

Query: 1549 ELEVCTNNTMLENPDKVVSEPS--VRIGSLD---HSSSMGLGVLEPKMKVYTRESHKVKN 1385
             +    N++   N  K  + PS  VR+   D   H  S G   ++  +   +R   K  +
Sbjct: 257  RVCRKLNHS---NGGKPAAGPSVPVRVSGYDSSLHMDSFGSNGIDENL---SRRDAKDSD 310

Query: 1384 FK-KNGEDAFMGNFEKGLEHCYRKQEVLEHKVSSNSGGTHCQFPCDEDSSTPEKKKCHGS 1208
             + K GE +          H  R+ +           G   ++       T   ++C+  
Sbjct: 311  IRLKEGEGSCFDKVMNSKAHSCRQGDGFIVLADERCNGKPDRYSIKYTRRTSADERCNRK 370

Query: 1207 -DMYLRKYSKRRTS 1169
             ++YLRKY++R ++
Sbjct: 371  PEVYLRKYARRTSA 384


>ref|XP_002321383.1| hypothetical protein POPTR_0015s01060g [Populus trichocarpa]
            gi|222868379|gb|EEF05510.1| hypothetical protein
            POPTR_0015s01060g [Populus trichocarpa]
          Length = 497

 Score =  110 bits (276), Expect = 2e-21
 Identities = 119/404 (29%), Positives = 172/404 (42%), Gaps = 16/404 (3%)
 Frame = -2

Query: 2254 KLQTLDSQWRIVLLCNNCFHGVQSAKNCSYCFTEIS--DVGISCVCCDRRVHGDCVSKYR 2081
            +L+ L S+WRI+LLC  CF+ V+S+K CSYCF + S     + C  C R VH  C +K +
Sbjct: 129  RLEILYSEWRIILLCTKCFNLVESSKICSYCFRKFSVKTKCLRCCQCKRVVHKSCFAKRK 188

Query: 2080 GLGLCSKS---DS---FTCIDCWVPKSLNGVPWGRNPNGSSKIVSGNCSVKISRASSLDD 1919
             +   S S   DS     CIDCWVPKS+  +  G+    S +  +G          SL+D
Sbjct: 189  NVAPWSYSCYGDSGGFSVCIDCWVPKSV-AIKRGKVCGVSKRNDTG------VLGRSLED 241

Query: 1918 VAAKDANSTAELKIXXXXXXXXXXXXXXXXXXXXXXXAKNALDLVAAAVREDSQLKDSRL 1739
            V  KDA  T + K+                       A+ ALDLVA    E  +  +  +
Sbjct: 242  V-VKDAACTVQEKVESAVRARELAVRKALEARKAADVARKALDLVAN--NEGGKENNDNV 298

Query: 1738 SGGLAADDTKLAFLLHRTINSSPRISKNLGSMDLGNLVAPKLRKGNGYLLDRQSDHGSHS 1559
                  DD +LAF LHR +NSSPRIS NL  ++   L    + +GNG +  R S+  +  
Sbjct: 299  ------DDIELAFQLHRAMNSSPRISSNLCLVNSSCLGVTMIGEGNGEMRIRNSELRNLG 352

Query: 1558 VHGELEVCTNNTMLENPDKVVSEPSVRIGSLDHSSSMGLGVLEPKMKVYTRESHKVKNFK 1379
              G+L            D  +S+ SV +G    S+    GV+ P  K             
Sbjct: 353  AFGKL------------DGFMSK-SVDVGR-RKSNGNDDGVIRPDAK------------- 385

Query: 1378 KNGEDAFMGNFEKGLEHCYRKQEVLEHKVSSNSGGTHCQFPCD-------EDSSTPEKKK 1220
               +D  +G          ++QE        NS G  C    D        +S  P+ K 
Sbjct: 386  ---KDRNVG---------MQQQEQSFFNKLINSRGNDCSVNSDFQSYREGNESLVPDDKG 433

Query: 1219 C-HGSDMYLRKYSKRRTSIKSTLDQSSSHVNECKENGDDSVMGN 1091
            C    D YL KYS++R   K    +    +  C+   D+ ++ N
Sbjct: 434  CKRKHDRYLLKYSRKRVLFK--YSRRKVMLKYCRRKLDERLIPN 475


>ref|XP_007043041.1| Uncharacterized protein isoform 1 [Theobroma cacao]
            gi|590688771|ref|XP_007043042.1| Uncharacterized protein
            isoform 1 [Theobroma cacao] gi|508706976|gb|EOX98872.1|
            Uncharacterized protein isoform 1 [Theobroma cacao]
            gi|508706977|gb|EOX98873.1| Uncharacterized protein
            isoform 1 [Theobroma cacao]
          Length = 442

 Score =  106 bits (265), Expect = 4e-20
 Identities = 74/211 (35%), Positives = 101/211 (47%), Gaps = 8/211 (3%)
 Frame = -2

Query: 2263 GKDKLQTLDSQWRIVLLCNNCFHGVQSAKNCSYCFTEISDVGISCVCCDRRVHGDCVSKY 2084
            GK+++QTL S+WRIVLLC+ C+H V S++ CSYCF E S+   SC  C R +H  C    
Sbjct: 61   GKNRIQTLYSEWRIVLLCSRCYHRVDSSEICSYCFKEASEDCFSCGQCKRSLHKTCFLNC 120

Query: 2083 RGL-----GLCSKSDSFTCIDCWVPKSLNGVPWGRNPNGSSKIVSGNCSVKISR---ASS 1928
            + +      +C  S+   CIDCWVPK +         N  +K    N S+  +R    + 
Sbjct: 121  KSVPPWSFSICG-SEFTVCIDCWVPKQIARKRGNFRHNKKAK----NSSILDNRDGGGAK 175

Query: 1927 LDDVAAKDANSTAELKIXXXXXXXXXXXXXXXXXXXXXXXAKNALDLVAAAVREDSQLKD 1748
            L +   KDAN     K+                       AK A++L + A+ E      
Sbjct: 176  LLESVVKDANYAMGKKV-------EAAVKAREMAVKKAIVAKRAVELASNALEE------ 222

Query: 1747 SRLSGGLAADDTKLAFLLHRTINSSPRISKN 1655
                     DD +LAF LHR +NSSPRISKN
Sbjct: 223  --------YDDAELAFRLHRAMNSSPRISKN 245


>ref|XP_003593131.1| hypothetical protein MTR_2g008130 [Medicago truncatula]
            gi|355482179|gb|AES63382.1| hypothetical protein
            MTR_2g008130 [Medicago truncatula]
          Length = 420

 Score =  100 bits (250), Expect = 2e-18
 Identities = 80/237 (33%), Positives = 110/237 (46%), Gaps = 21/237 (8%)
 Frame = -2

Query: 2263 GKDKLQTLDSQWRIVLLCNNCFHGVQSAKNCSYCFTEISDVGISCVCCDRRVHGDCVSKY 2084
            GK+KLQTL S+WR+VLLC  CF  V+S++ CSYCF+E S   + CV C   VH +C  K 
Sbjct: 53   GKNKLQTLYSEWRVVLLCKKCFSCVKSSQICSYCFSESSSDSLRCVKCKHSVHKNCFLKN 112

Query: 2083 RGLG----LCSKSDSFTCIDCWVPKSLNGVPWGRNPNGSSKIVSGNCSVKISRAS----- 1931
            + +      C  S+   C+DCWVPK +  +   R      K+ SG   VK  R       
Sbjct: 113  KNVAPWSYSCVGSEFSVCVDCWVPKHVE-ISRRRTIRSLRKVKSG-VIVKKGRVDLVKES 170

Query: 1930 -----------SLDDVAAKDANSTAELKIXXXXXXXXXXXXXXXXXXXXXXXAKNALDLV 1784
                       S++DV  KDA   A+ K+                       A   L++ 
Sbjct: 171  SRVLKGGNLTRSMEDV-VKDAKQKAKKKVEAAAMARRVASKKAVAARRAVELANKTLNI- 228

Query: 1783 AAAVREDSQLK-DSRLSGGLAADDTKLAFLLHRTINSSPRISKNLGSMDLGNLVAPK 1616
             AA RE+  L   S++        + LAF L   +N+SP ISK+   +D  NL APK
Sbjct: 229  -AANREEGTLNLPSKMDPVKVVGCSCLAFDL--CLNNSPMISKSRCLLDTNNLDAPK 282


>ref|XP_007204312.1| hypothetical protein PRUPE_ppb021745mg [Prunus persica]
            gi|462399843|gb|EMJ05511.1| hypothetical protein
            PRUPE_ppb021745mg [Prunus persica]
          Length = 353

 Score = 90.1 bits (222), Expect = 4e-15
 Identities = 53/135 (39%), Positives = 70/135 (51%), Gaps = 10/135 (7%)
 Frame = -2

Query: 2260 KDKLQTLDSQWRIVLLCNNCFHGVQSAKNCSYCF-TEISDVGISCVCCDRRVHGDCVSKY 2084
            K KL  L S+WRIVLLC  CF  V+S++ CSYC+ T  S     C+ C R+VH  C S+Y
Sbjct: 42   KSKLHVLYSEWRIVLLCKKCFSRVESSELCSYCYSTSSSQESFFCLQCHRKVHRHCDSEY 101

Query: 2083 RGLGL----CSKSDSFTCIDCWVPKSLNGVPWGRNPNGSSKIVSGNCSVKISRASS---- 1928
            R + L    CS  +   C DCW+P+SL  V W R  + S    +G   V +    S    
Sbjct: 102  RSVALLSDSCSAMEFSVCADCWIPESL--VKWKRVVSSSKSRRTGKRRVGLGLGKSRVLA 159

Query: 1927 -LDDVAAKDANSTAE 1886
             +DD    DA  + E
Sbjct: 160  MVDDREIDDAFGSEE 174


>gb|AEJ72552.1| hypothetical protein [Malus domestica]
          Length = 588

 Score = 87.0 bits (214), Expect = 3e-14
 Identities = 46/107 (42%), Positives = 63/107 (58%), Gaps = 11/107 (10%)
 Frame = -2

Query: 2260 KDKLQTLDSQWRIVLLCNNCFHGVQSAKNCSYCFTEIS---DVGISCVCCDRRVHGDCVS 2090
            K KLQ L S+WR+VLLC  C   V+S++ CSYCF   S   +   +C  C+RRVH  C S
Sbjct: 39   KSKLQILYSEWRVVLLCKKCLTRVESSELCSYCFAATSPSQEDSFTCCQCNRRVHRRCDS 98

Query: 2089 KYRGLGLCSKS-----DSFTCIDCWVPKSL---NGVPWGRNPNGSSK 1973
            +YRG+ L S++     ++  C DCW+P+SL    GV   +N   S K
Sbjct: 99   EYRGIALLSQNSCLAVEAEVCADCWLPESLARWRGVVRSQNARRSGK 145


>gb|EPS68024.1| hypothetical protein M569_06755, partial [Genlisea aurea]
          Length = 113

 Score = 80.9 bits (198), Expect = 2e-12
 Identities = 44/93 (47%), Positives = 53/93 (56%), Gaps = 11/93 (11%)
 Frame = -2

Query: 2260 KDKLQTLDSQWRIVLLCNNCFHGVQSAKNCSYCFTEISDVGIS-----CVCCDRRVHGDC 2096
            +D+LQ LDS WRIVLLC  C H +     C YCF +I   GIS     CV C RR+H DC
Sbjct: 23   RDRLQPLDSVWRIVLLCRKCRHNLDIGHVCPYCFEKI---GISLDLCTCVICRRRIHKDC 79

Query: 2095 VSKY------RGLGLCSKSDSFTCIDCWVPKSL 2015
            + KY      R LG   +    TCIDCW+P+ L
Sbjct: 80   IRKYGRFTPWRFLG--GEVGFSTCIDCWIPQLL 110


>ref|XP_002318448.2| hypothetical protein POPTR_0012s02720g [Populus trichocarpa]
            gi|550326239|gb|EEE96668.2| hypothetical protein
            POPTR_0012s02720g [Populus trichocarpa]
          Length = 311

 Score = 70.1 bits (170), Expect = 4e-09
 Identities = 83/309 (26%), Positives = 121/309 (39%), Gaps = 7/309 (2%)
 Frame = -2

Query: 2080 GLGLCSKSDSFTCIDCWVPKSLNGVPWG--RNPNGSSKIVSGNCSVKISRASSLDDVAAK 1907
            GL + S        +CWVP S+     G  R+   +S  V G               + +
Sbjct: 30   GLRISSHKRLEILYNCWVPNSVASKRGGVCRDSKRNSGRVLGR--------------SLE 75

Query: 1906 DANSTAELKIXXXXXXXXXXXXXXXXXXXXXXXAKNALDLVA--AAVREDSQLKDSRLSG 1733
            DAN   + K+                       A+ ALD+VA    V+E++ +       
Sbjct: 76   DANCVVQEKVEAAVRARDLAVRKALEERNAADVARKALDMVANNGVVKENNDV------- 128

Query: 1732 GLAADDTKLAFLLHRTINSSPRISKNLGSMDLGNLVAPKLRKGNGYLLDRQSDHGSHSVH 1553
                DD +LAF LHR INSSPRIS NL  ++   L   +  +GNG    R SD  +    
Sbjct: 129  ----DDFELAFRLHRAINSSPRISSNLCMVNSSCLGVARRGEGNGQTRIRNSDFRNPIAC 184

Query: 1552 GELEVCTNNTMLENPDKVVSEPSVRIGSLDHSSSMGLGVLEPKMKVYTRESHKVKNFKKN 1373
            G+L            D  +S+      S+D       G+ + K++   ++        K 
Sbjct: 185  GKL------------DDFLSK------SVDVECRKSNGIGDGKIRPNAKKDGNAGKCSKM 226

Query: 1372 GEDAFMGNF--EKGLEHCYRKQEVLEHKVSSNSGGTHCQFPCDEDSSTPEKKKCHG-SDM 1202
            GE +F       +G +H            S NSG     F    +S TP+ K C G SD 
Sbjct: 227  GEQSFFSKLIDSRGNDH------------SVNSGSQ--SFRERNESMTPDDKSCKGKSDR 272

Query: 1201 YLRKYSKRR 1175
            YL KYS+R+
Sbjct: 273  YLLKYSRRK 281


>ref|NP_001174784.1| Os06g0468300 [Oryza sativa Japonica Group]
            gi|54290641|dbj|BAD62212.1| unknown protein [Oryza sativa
            Japonica Group] gi|125555297|gb|EAZ00903.1| hypothetical
            protein OsI_22931 [Oryza sativa Indica Group]
            gi|222635557|gb|EEE65689.1| hypothetical protein
            OsJ_21309 [Oryza sativa Japonica Group]
            gi|255677039|dbj|BAH93512.1| Os06g0468300 [Oryza sativa
            Japonica Group]
          Length = 383

 Score = 67.8 bits (164), Expect = 2e-08
 Identities = 74/271 (27%), Positives = 114/271 (42%), Gaps = 12/271 (4%)
 Frame = -2

Query: 2251 LQTLDSQWRIVLLCNNCFHGVQSAKNCSYCFT----EISDVGISCVCCDRRVHGDCVSKY 2084
            ++ L S WR+VLLC  C   V+SA  CSYC +       D  ++C CC+R VH  C++  
Sbjct: 73   VRPLRSLWRVVLLCTECLSLVRSAAVCSYCLSLDNLPPEDSSVTCRCCNRCVHPYCIAGE 132

Query: 2083 RGLGLCSKSD--SFTCIDCWVPKSLNGVPWGRNPNGSSKIVSGNCSVKISRASSLDDVAA 1910
                L    D  +F C+DC         P G+N   SS     +    ++R     D+ A
Sbjct: 133  HRAALIQPIDVENFICVDCCPTVK----PGGKNGGASSV----HMLQAVAREPRKGDIVA 184

Query: 1909 KDANSTA----ELKIXXXXXXXXXXXXXXXXXXXXXXXAKNALD-LVAAAVREDSQLKDS 1745
            +   +      E+K+                        K A + LV+AA    SQ    
Sbjct: 185  ESKENAVRKAMEMKL----------------------AFKRAKEALVSAAGGRGSQ---R 219

Query: 1744 RLSGGLAADDTKLAFLLHRTINSSPRISKNLGSMDLGNLVAPKLRKGNGYLLDRQSDHGS 1565
             + G     D +LA  LH  +N S R S+  G+   G+  + +  KG+      +S  G 
Sbjct: 220  TVGGKPDLPDEELALQLHLAMNGSQRFSR-AGNTSGGD--SAEQCKGH------KSVIGG 270

Query: 1564 HSVHGELEVCTNNTMLE-NPDKVVSEPSVRI 1475
             + +G+ E+C  N M + + D+   EP  RI
Sbjct: 271  KNFYGDQELCVTNMMDQLDDDEAGVEPLCRI 301


>ref|XP_006656073.1| PREDICTED: uncharacterized protein LOC102716222 isoform X1 [Oryza
            brachyantha]
          Length = 392

 Score = 66.2 bits (160), Expect = 6e-08
 Identities = 83/303 (27%), Positives = 122/303 (40%), Gaps = 15/303 (4%)
 Frame = -2

Query: 2251 LQTLDSQWRIVLLCNNCFHGVQSAKNCSYCFT----EISDVGISCVCCDRRVHGDCVSKY 2084
            ++ L S WRIVLLC  C + V+SA  CSYC +       D  ++C  C+R VH  C+S  
Sbjct: 73   VRPLRSLWRIVLLCTECLYLVRSAAVCSYCLSLDNLPPEDCSVTCRFCNRCVHHYCISGE 132

Query: 2083 RGLGLCSKSD--SFTCIDCWVPKSLNGVPWGRNPNGSSKIVSGNCSVKISRASSLDDVAA 1910
                L    D  +F C+DC       G   G  P    + V+         A + D+   
Sbjct: 133  HRTSLVQPIDVENFVCVDCCPTVKPGGKQGGVAPVHMLQAVAREPRKGEIVAEAKDNAVR 192

Query: 1909 KDANSTAELKIXXXXXXXXXXXXXXXXXXXXXXXAKNALDLVAAAVREDSQLKDSRLSGG 1730
            K      E+K+                          A + V  A+   +    S+ + G
Sbjct: 193  K----AMEVKL--------------------------ASNRVKEALAPAAAGGGSQRTAG 222

Query: 1729 LAAD--DTKLAFLLHRTINSSPRISKNLGSMDLGNLVAPKLRKGNGYLLDRQSDHGSHSV 1556
               D  D +LA  LH  +N S RIS+   +    + V  K  K         +      V
Sbjct: 223  CNPDLPDEELALQLHLAMNGSHRISRAGNTSGGDSAVQGKCHK---------TMVCGKKV 273

Query: 1555 HGELEVCTNNTMLENPDKVVS--EPSVRIG-----SLDHSSSMGLGVLEPKMKVYTRESH 1397
            +G+ E+C  N M++  D V +  EP  RIG      LD S ++ L  LE  +  + +ES 
Sbjct: 274  YGDQELCVTN-MMDQLDDVETGVEPLCRIGRPARRRLDPSVTIVL-ALECVVGKHVKESM 331

Query: 1396 KVK 1388
            KVK
Sbjct: 332  KVK 334


>gb|ABR16126.1| unknown [Picea sitchensis]
          Length = 756

 Score = 61.6 bits (148), Expect = 1e-06
 Identities = 37/98 (37%), Positives = 50/98 (51%), Gaps = 7/98 (7%)
 Frame = -2

Query: 2266 RGKDKLQTLDSQWRIVLLCNNCFHGVQSAKNCSYCFTEISD------VGISCVCCDRRVH 2105
            RG   L+++ +  R V +C +C   + S   C  CF +ISD      V +SC  C  RVH
Sbjct: 59   RGGPTLESVKNGKRFVSVCKSCNCLLNSGGICCCCFRKISDDKGLLVVALSCCKCRHRVH 118

Query: 2104 GDCVSKYRG-LGLCSKSDSFTCIDCWVPKSLNGVPWGR 1994
             DCVSK  G   +CS S SF C+DC   K +     G+
Sbjct: 119  CDCVSKNIGEEDVCSDSKSFVCVDCSPLKGIRDACGGK 156


Top