BLASTX nr result

ID: Akebia23_contig00004506 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Akebia23_contig00004506
         (2429 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_006854566.1| hypothetical protein AMTR_s00030p00103480 [A...   158   1e-35
ref|XP_007148443.1| hypothetical protein PHAVU_006G209200g [Phas...   143   4e-31
ref|XP_004230610.1| PREDICTED: uncharacterized protein LOC101262...   134   2e-28
ref|XP_006487482.1| PREDICTED: uncharacterized protein LOC102618...   134   3e-28
ref|XP_006423726.1| hypothetical protein CICLE_v10028677mg [Citr...   131   2e-27
ref|XP_002523767.1| conserved hypothetical protein [Ricinus comm...   124   3e-25
ref|XP_007043041.1| Uncharacterized protein isoform 1 [Theobroma...   121   1e-24
ref|XP_002321383.1| hypothetical protein POPTR_0015s01060g [Popu...   120   3e-24
ref|XP_003593131.1| hypothetical protein MTR_2g008130 [Medicago ...   115   9e-23
ref|XP_007204312.1| hypothetical protein PRUPE_ppb021745mg [Prun...   103   4e-19
gb|EPS68024.1| hypothetical protein M569_06755, partial [Genlise...   100   5e-18
gb|AEJ72552.1| hypothetical protein [Malus domestica]                  98   2e-17
ref|NP_001174784.1| Os06g0468300 [Oryza sativa Japonica Group] g...    76   8e-11
ref|XP_006656073.1| PREDICTED: uncharacterized protein LOC102716...    72   2e-09
gb|ABR16126.1| unknown [Picea sitchensis]                              71   2e-09
ref|XP_002318448.2| hypothetical protein POPTR_0012s02720g [Popu...    70   5e-09
ref|XP_003571048.1| PREDICTED: uncharacterized protein LOC100843...    62   1e-06
ref|XP_002438424.1| hypothetical protein SORBIDRAFT_10g018040 [S...    62   1e-06

>ref|XP_006854566.1| hypothetical protein AMTR_s00030p00103480 [Amborella trichopoda]
            gi|548858252|gb|ERN16033.1| hypothetical protein
            AMTR_s00030p00103480 [Amborella trichopoda]
          Length = 736

 Score =  158 bits (399), Expect = 1e-35
 Identities = 199/743 (26%), Positives = 285/743 (38%), Gaps = 171/743 (23%)
 Frame = -2

Query: 2269 SLPPRKRLLASLDQTXXXXXXXXXXXXXXXLRN---------------PNLSSCHVCCSR 2135
            SLPPRKRLLA L Q                  +               P +S CH  CS 
Sbjct: 21   SLPPRKRLLAGLKQNGWVDLDHLVEESRSSTSSAKSMEIGNPNASKELPRISECH-SCSY 79

Query: 2134 ITSNRGKDKLQTLDSQWRIVLLCNNCFHGVQSAKNCSYCFTEISDVG--ISCVCCDRRVH 1961
            + S +GKDKL TL S+WR+VLLC NC + V S  NCSYCF+ + + G  ++C  CD RVH
Sbjct: 80   LVSGKGKDKLHTLASEWRVVLLCKNCLNAVNSGTNCSYCFSALENSGCVLNCRKCDHRVH 139

Query: 1960 GDCVSKYR--------GLGLC----------------SKSDSFTC--------------- 1898
              C SK+R        G  LC                +KSDSF                 
Sbjct: 140  QGCASKHRGSLLQCSSGSFLCVDCWVPKSRLNFGCGSNKSDSFGTQDSKSLLRFGETKVF 199

Query: 1897 --IDCWVPKSLNGVPWGRNPNGS--------------------------------SKIVS 1820
               D    KS++   +    +GS                                 K VS
Sbjct: 200  GDCDSKAEKSVSSASFPETNSGSVDKTMVSVAIKPLDKENPCIDGESELNKYQDAEKHVS 259

Query: 1819 GNCSVKISRAS-------SLDDVAAKDANSTAELKIXXXXXXXXXXXXXXXXXXXXXXXA 1661
             + S K SR S       SL+++  K+ANS A   +                       A
Sbjct: 260  DSVSEKASRFSFNGNCCRSLEEIV-KEANSAAARAMTIAASAKENALRKAMVARNAASAA 318

Query: 1660 KNALDLVAAAVREDSQLKDSRLSGG-LAADD----------------------------- 1571
            +NAL+ +A   +E+++ K+S  S   L  DD                             
Sbjct: 319  RNALNFLAILEQEENEAKESLQSNASLMGDDGNSNIADRAEKSNGIHLKAGSLPESHEVA 378

Query: 1570 -TKLAFLLHRTINSSPRISKNLGSMD--------LGN---------LVAPKLRKGNGYLL 1445
              +LA  LHR +NSSPRIS+  G+ +        L N         +   K    NG+  
Sbjct: 379  DEELALRLHRAMNSSPRISRRRGAPNGIQLKECKLSNSTKCEFNCMVTTKKQNCSNGFGN 438

Query: 1444 DR----------------------QSDHGSHSVHGELEVCTNNTM---LENPDKVVSEPS 1340
            +                       +++ GS SV G L +CT + +   L++PD   +EPS
Sbjct: 439  EEFRRNERRFRRDSEVIGQSTSILKTESGSQSVCGNLHLCTEDKIDGTLDHPD---AEPS 495

Query: 1339 VRIGSLDHSSSMGLGVLEPKMKVYTRESHKVKNFK-KNGEDAFMGNFEKGLEHCYRKQEV 1163
            V  G+L+ ++S+G+ V E K +   R+   +        E+   G  +     C    +V
Sbjct: 496  VGNGALELANSIGMAVEEFKKR---RDDEAINGVSFHEDEEKKEGTMQGAFRSCRADGKV 552

Query: 1162 LEHKVSSNSGGTHCQFPCDEDSSTPEKKKCHGSDMYLRKYSKRRTSIKSTLDQSSSHVNE 983
                +  N G        + ++S          D    K  K  T +K   +Q+S     
Sbjct: 553  ---DMKQNGG-------MNMENSLKNGLLIVDGDNSGVKDMKPETPVKE--EQASCSNKA 600

Query: 982  CKENGDDSVMGNFDRGLQASLIRLSNGNGIVELPVKEQVSCYLKQEGLVPKVSSNNRGTQ 803
               +G+DS   + D G ++S       NG     V +     +K  G   K+S  N    
Sbjct: 601  MNSSGEDS---SLDTGFESSQKWKGGENGGSSSNVSK-----VKPFGYRAKLSKFNCAQ- 651

Query: 802  CQSACDEDTSIPERKRCHGLDMYLKTYSKRHTSLKVILHQKTKVLFEDSPLESQASTPGL 623
               A + D   P++KR        K   KRH+S+KVIL +KTK L ED PLES+A T  L
Sbjct: 652  -SQAREGDPLKPQKKRSILPHPDSKRPIKRHSSMKVILDRKTKSLAEDFPLESKALTNAL 710

Query: 622  SSLQLNCSNVCRTFSDASFQSSS 554
              LQ NC+   +  SD+S  S S
Sbjct: 711  PLLQRNCAKAPKKLSDSSHGSPS 733


>ref|XP_007148443.1| hypothetical protein PHAVU_006G209200g [Phaseolus vulgaris]
            gi|561021666|gb|ESW20437.1| hypothetical protein
            PHAVU_006G209200g [Phaseolus vulgaris]
          Length = 439

 Score =  143 bits (360), Expect = 4e-31
 Identities = 122/420 (29%), Positives = 181/420 (43%), Gaps = 33/420 (7%)
 Frame = -2

Query: 2170 PNLSSCHVCCSRITSNRGKDKLQTLDSQWRIVLLCNNCFHGVQSAKNCSYCFTEISDVGI 1991
            PNL+ CH C  ++    GK++L+TL S+WR+VLLC  CF  V+S++ CSYCF+ +S    
Sbjct: 40   PNLTECHACGFKVDVCSGKNRLRTLYSEWRVVLLCKKCFVSVESSQICSYCFSGMSLESY 99

Query: 1990 SCVCCDRRVHGDCVSKYRGL---GLCSKSDSFT-CIDCWVPKSLN----------GVPWG 1853
             C  C   VH  C  KY+        S    F+ C+DCW+PK L           G   G
Sbjct: 100  RCNQCQHSVHKTCFLKYKNAPPWSYASMGSEFSVCVDCWIPKHLEISRRRKRRVMGDENG 159

Query: 1852 R--NPNGSSKIVSGNCSVKISRASSLDDVAAKDANSTAELKIXXXXXXXXXXXXXXXXXX 1679
            R     GSS+++ G      + A S++D+  +DA      K+                  
Sbjct: 160  RIILEKGSSRVLPGG-----NLARSMEDL-VEDAKREVGEKVEAAARAREGAVKKALVAR 213

Query: 1678 XXXXXAKNALDLVAAAVREDSQLKDSRLSGGLAADDTKLAFLLHRTINSSPRISKNLGSM 1499
                 AKNAL LVA    E S     +       D ++L F LH   N+ PRISK+   +
Sbjct: 214  RAVEIAKNALSLVANG-EESSLNPPPKREAFKVLDGSELTFELHPEFNTLPRISKSCCLL 272

Query: 1498 DLGNLVAPKLRKGNGYLLDRQSDHGSHSVHGELEVCTNNTMLENPDKVVSEPSVRIGSLD 1319
            +   L APK    +     + S+  +     + EV  +N +L +  K + EP V +G+LD
Sbjct: 273  NTSFLDAPKRLSPSVDSSCKTSNSRNADYRDKHEVSCDNKLLADSCKSLCEPLVSVGTLD 332

Query: 1318 HSSSMGLGVL---EPKMKVYTRESHKV---------KNFKKNGE----DAFMGNFEKGLE 1187
              SS GL +L      M+  +++  +          +  +K GE    D  +   E    
Sbjct: 333  SGSSTGLNLLCMGRSGMETGSKDGERTAESDGEGIGEELQKEGEGSCSDRIINLSEDSCM 392

Query: 1186 HCYRKQEVLEHKVSSNSGGTHCQFPCDEDSSTPEKKKCHGS-DMYLRKYSKRRTSIKSTL 1010
               RKQ                      DS+    K+C+G  D Y  KYS+R  S+KS +
Sbjct: 393  ELDRKQ---------------------ADSALHRVKRCNGQPDRYFLKYSRRNCSLKSKI 431


>ref|XP_004230610.1| PREDICTED: uncharacterized protein LOC101262666 [Solanum
            lycopersicum]
          Length = 488

 Score =  134 bits (337), Expect = 2e-28
 Identities = 135/471 (28%), Positives = 193/471 (40%), Gaps = 40/471 (8%)
 Frame = -2

Query: 2299 PPPPSTDVQPSLPPRKRLLASLDQTXXXXXXXXXXXXXXXLRNPNLSSCHVCCSRITSNR 2120
            PPPP+T      P     + S + T                  PNLS CH C  RI    
Sbjct: 31   PPPPATATSSKTPLNCVPIQSTNSTSSSSSAFDQFSKRVTRDLPNLSDCHGCGVRINHTD 90

Query: 2119 GKDKLQTLDSQWRIVLLCNNCFHGVQSAKNCSYCFTEISDVGIS-CVCCDRRVHGDCVSK 1943
              D+L TLDS WRIVLLC NC   V S + C YCF    D   S C  C R+VH DCVS+
Sbjct: 91   PDDRLLTLDSFWRIVLLCKNCIRCVDSGQTCPYCFKNTDDTDCSKCRSCKRQVHKDCVSR 150

Query: 1942 YRG---LGLCSKSDS--FTCIDCWVP----KSLNGVPWGRNPNGSSKIVSGN--CSVKIS 1796
            Y        CS+ +   F CIDCWVP    KS+      +    + +  S +   S KI+
Sbjct: 151  YGNSAPWSFCSREEGGLFVCIDCWVPNFFKKSIGDCRKIQKDVLNIQHCSSDFKSSEKIA 210

Query: 1795 RASSLDD------VAAKDANSTAELKIXXXXXXXXXXXXXXXXXXXXXXXAKNALDLVAA 1634
            + ++L+       V  K  NST +  +                       AKN + L  +
Sbjct: 211  KHANLEGLRKEVVVGLKAKNSTLQKAV----------------------VAKNPMGLAKS 248

Query: 1633 AVREDSQLKDSRLSGGLAA---DDTKLAFLLHRTINSSPRISKNLGSMDLGNLVAPKLRK 1463
            A+  +S +K  +  G + +   +D +LAF LHR++NSSPRISK LG  +   +  P+++ 
Sbjct: 249  AL--ESVVKKGKSKGKVVSKDVNDAQLAFQLHRSMNSSPRISKTLGPKNSSYVGGPEIQT 306

Query: 1462 GNGYLLDRQSDHGSHSVHGELEVCTNNT-----------MLENPDKVVSEPSVRI----- 1331
                  +R   +      G++   +  T           + E  D+  SE S R+     
Sbjct: 307  LPSSTGERLKVYFRTKYRGKVGPTSPETPPSVMVYSRARLKEKVDQTTSETSPRVTVYSR 366

Query: 1330 GSLDHSSSMGLGVLEPKMKVYTRESHKVKNFKKNGE-DAFMGNFEKG--LEHCYRKQEVL 1160
              L            P + VY+R   K K  + + E    +   E G  ++    K E+L
Sbjct: 367  RRLKEEVGKASSDASPCLLVYSRTRFKEKVCQTDSEAPPCVTTNECGSCVDSACSKAELL 426

Query: 1159 EHKVSSNSGGTHCQFPCDEDSSTPEKKKCHGSDMYLRKYSKRRTSIKSTLD 1007
             +K +     T     CDE       K     D YL KYS+R+   K   D
Sbjct: 427  TYKRNKLKRKT-----CDE-------KVVFTEDRYLLKYSRRKRCWKPGSD 465


>ref|XP_006487482.1| PREDICTED: uncharacterized protein LOC102618081 isoform X1 [Citrus
            sinensis] gi|568868391|ref|XP_006487483.1| PREDICTED:
            uncharacterized protein LOC102618081 isoform X2 [Citrus
            sinensis]
          Length = 373

 Score =  134 bits (336), Expect = 3e-28
 Identities = 109/403 (27%), Positives = 172/403 (42%), Gaps = 1/403 (0%)
 Frame = -2

Query: 2170 PNLSSCHVCCSRITSNRGKDKLQTLDSQWRIVLLCNNCFHGVQSAKNCSYCFTEISDVGI 1991
            PNLS C  C  RI S  G DK+Q L S+WRIVLLC  C   ++S+K CSYC+ E  +  +
Sbjct: 19   PNLSECQACGFRIDSCTGNDKIQILYSEWRIVLLCCKCLDRIESSKICSYCYKETIEDFL 78

Query: 1990 SCVCCDRRVHGDCVSKYRGLGLCSKSDSFTCIDCWVPKSLNGVPWGRNPNGSSKIVSGNC 1811
            +C  C R VH +C  K + +   S  +S  C+DCWVPKSL      R      KI + + 
Sbjct: 79   TCSQCKRSVHRNCFLKCKAIDSMSSLESLICVDCWVPKSL---VKRRELLTCRKICNSSA 135

Query: 1810 SVKISRASSLDDVAAKDANSTAELKIXXXXXXXXXXXXXXXXXXXXXXXAKNALDLVAAA 1631
             + IS         ++ +N      +                         NALDL    
Sbjct: 136  DLGISN--------SRVSNGGGSCAVVERKIVFALMATEMIGRKPFVPKKSNALDL--EV 185

Query: 1630 VREDSQLKDSRLSGGLAADDTKLAFLLHRTINSSPRISKNLGSMDLGNLVAPKLRKGNGY 1451
             RE+      +++   + DD +LAF LHR++NSSPRISKNL  ++  +   PK ++ +G 
Sbjct: 186  KREEGGEIHKKVA---SDDDAELAFQLHRSMNSSPRISKNLCVVNSSDSHVPKKQECDGV 242

Query: 1450 LLDRQSDHGSHSVHGELEVCTNNTMLENPDKVVSEPSVRIGSLDHSSSMGLGVLEPKMKV 1271
            L+   S  GS         C++N +  + D+  +    R        S        K+ V
Sbjct: 243  LILGGSGSGS---------CSSNALKSSGDETSTNFDSRPSYDKRCESASY-----KLAV 288

Query: 1270 YTRESHK-VKNFKKNGEDAFMGNFEKGLEHCYRKQEVLEHKVSSNSGGTHCQFPCDEDSS 1094
              ++  +    ++K G   F+  + +        + VL++K                   
Sbjct: 289  CNKQPDRFFFKYRKRGSRRFLLKYRR---RSSSSKPVLDNK------------------- 326

Query: 1093 TPEKKKCHGSDMYLRKYSKRRTSIKSTLDQSSSHVNECKENGD 965
                     SD++L KY +RR++    +  + S +  C +  D
Sbjct: 327  ---------SDIFLLKYRRRRSAGSKPVPDNKSDIEICNQKPD 360


>ref|XP_006423726.1| hypothetical protein CICLE_v10028677mg [Citrus clementina]
            gi|567862146|ref|XP_006423727.1| hypothetical protein
            CICLE_v10028677mg [Citrus clementina]
            gi|557525660|gb|ESR36966.1| hypothetical protein
            CICLE_v10028677mg [Citrus clementina]
            gi|557525661|gb|ESR36967.1| hypothetical protein
            CICLE_v10028677mg [Citrus clementina]
          Length = 373

 Score =  131 bits (329), Expect = 2e-27
 Identities = 90/272 (33%), Positives = 131/272 (48%), Gaps = 1/272 (0%)
 Frame = -2

Query: 2170 PNLSSCHVCCSRITSNRGKDKLQTLDSQWRIVLLCNNCFHGVQSAKNCSYCFTEISDVGI 1991
            PNLS C  C  RI S  G DK+Q L S+WRIVLLC  C   ++S+K CSYC+ E  +  +
Sbjct: 19   PNLSECQACGFRIDSCTGNDKIQILYSEWRIVLLCCKCLDRIESSKICSYCYKETIEDFL 78

Query: 1990 SCVCCDRRVHGDCVSKYRGLGLCSKSDSFTCIDCWVPKSLNGVPWGRNPNGSSKIVSGNC 1811
            +C  C R VH +C  K + +   S  +S  C+DCWVPKSL      R      KI + + 
Sbjct: 79   TCSQCKRSVHRNCFLKCKAIDSMSSLESLICVDCWVPKSL---VKRRELLTCRKICNSSA 135

Query: 1810 SVKISRASSLDDVAAKDANSTAELKIXXXXXXXXXXXXXXXXXXXXXXXAKNALDLVAAA 1631
             + IS         ++ +N      +                         NALDL    
Sbjct: 136  DLGISN--------SRVSNGGGSCAVVERKIVFALMASEMIGRKPFVPKKSNALDL---E 184

Query: 1630 VREDSQLKDSRLSGGLAA-DDTKLAFLLHRTINSSPRISKNLGSMDLGNLVAPKLRKGNG 1454
            V+ D   +   +   +A+ DD +LAF LHR++NSSPRISKNL  ++  +   PK ++ +G
Sbjct: 185  VKRD---EGGEIHKKVASDDDAELAFQLHRSMNSSPRISKNLCVVNSSDSHVPKKQECDG 241

Query: 1453 YLLDRQSDHGSHSVHGELEVCTNNTMLENPDK 1358
             L+   S  GS         C++N +  + D+
Sbjct: 242  VLILGGSGSGS---------CSSNALKSSGDE 264


>ref|XP_002523767.1| conserved hypothetical protein [Ricinus communis]
            gi|223536979|gb|EEF38616.1| conserved hypothetical
            protein [Ricinus communis]
          Length = 488

 Score =  124 bits (310), Expect = 3e-25
 Identities = 116/401 (28%), Positives = 170/401 (42%), Gaps = 19/401 (4%)
 Frame = -2

Query: 2170 PNLSSCHVCCSRITS-NRGKD------KLQTLDSQWRIVLLCNNCFHGVQSAKNCSYCFT 2012
            PNLS CH C  R+   + GK+      +LQTL S+WRIVLLC  CF  V+S   C+YCF 
Sbjct: 30   PNLSECHSCGFRVDCCSNGKNNDSSSGRLQTLYSEWRIVLLCKICFFRVESCHICAYCFK 89

Query: 2011 EISDVGISCVC----CDRRVHGDCVSKYRGLGLCSKSDSFT-CIDCWVPKSLNGVPWGRN 1847
            ++S    SC+     C R +H  C S Y      S S  F+ C+DCWVPKS+      R 
Sbjct: 90   DLSSSDNSCLFRCPQCKRIIHRTCFSNYSNFAPWSFSSKFSVCVDCWVPKSIA----SRR 145

Query: 1846 PNGSSKIVSGNCSVKISRASSLDDVAAKDANSTAELKIXXXXXXXXXXXXXXXXXXXXXX 1667
                +K    NC     + SSL+DV  +DA+   + K+                      
Sbjct: 146  ACFRTKKSKSNC-----KYSSLEDV-VRDADFDVQRKVEAAAKARELVVEKALAARKAAQ 199

Query: 1666 XAKNALDLVAAAVREDSQLKDSRLSGGLAADDTKLAFLLHRTINSSPRISKNLGSMDLGN 1487
               NA DLV+   R+D+ + +         DD +LA  LH  +NSSPRI  NL S+D   
Sbjct: 200  LVHNAFDLVSE--RDDNGIAN--------VDDVQLALHLHLALNSSPRILSNLCSLD--- 246

Query: 1486 LVAPKLRKGNGYLLDRQSDHGSHSVHGELEVCTNNTMLENPDKVVSEPS--VRIGSLD-- 1319
                             S   S  V G +    N++   N  K  + PS  VR+   D  
Sbjct: 247  -----------------SAGSSPLVRGRVCRKLNHS---NGGKPAAGPSVPVRVSGYDSS 286

Query: 1318 -HSSSMGLGVLEPKMKVYTRESHKVKNFK-KNGEDAFMGNFEKGLEHCYRKQEVLEHKVS 1145
             H  S G   ++  +   +R   K  + + K GE +          H  R+ +       
Sbjct: 287  LHMDSFGSNGIDENL---SRRDAKDSDIRLKEGEGSCFDKVMNSKAHSCRQGDGFIVLAD 343

Query: 1144 SNSGGTHCQFPCDEDSSTPEKKKCHGS-DMYLRKYSKRRTS 1025
                G   ++       T   ++C+   ++YLRKY++R ++
Sbjct: 344  ERCNGKPDRYSIKYTRRTSADERCNRKPEVYLRKYARRTSA 384


>ref|XP_007043041.1| Uncharacterized protein isoform 1 [Theobroma cacao]
            gi|590688771|ref|XP_007043042.1| Uncharacterized protein
            isoform 1 [Theobroma cacao] gi|508706976|gb|EOX98872.1|
            Uncharacterized protein isoform 1 [Theobroma cacao]
            gi|508706977|gb|EOX98873.1| Uncharacterized protein
            isoform 1 [Theobroma cacao]
          Length = 442

 Score =  121 bits (304), Expect = 1e-24
 Identities = 81/228 (35%), Positives = 110/228 (48%), Gaps = 8/228 (3%)
 Frame = -2

Query: 2170 PNLSSCHVCCSRITSNRGKDKLQTLDSQWRIVLLCNNCFHGVQSAKNCSYCFTEISDVGI 1991
            PNL+ C  C SR  +  GK+++QTL S+WRIVLLC+ C+H V S++ CSYCF E S+   
Sbjct: 44   PNLTECQACGSRTDTANGKNRIQTLYSEWRIVLLCSRCYHRVDSSEICSYCFKEASEDCF 103

Query: 1990 SCVCCDRRVHGDCVSKYRGL-----GLCSKSDSFTCIDCWVPKSLNGVPWGRNPNGSSKI 1826
            SC  C R +H  C    + +      +C  S+   CIDCWVPK +         N  +K 
Sbjct: 104  SCGQCKRSLHKTCFLNCKSVPPWSFSICG-SEFTVCIDCWVPKQIARKRGNFRHNKKAK- 161

Query: 1825 VSGNCSVKISR---ASSLDDVAAKDANSTAELKIXXXXXXXXXXXXXXXXXXXXXXXAKN 1655
               N S+  +R    + L +   KDAN     K+                       AK 
Sbjct: 162  ---NSSILDNRDGGGAKLLESVVKDANYAMGKKV-------EAAVKAREMAVKKAIVAKR 211

Query: 1654 ALDLVAAAVREDSQLKDSRLSGGLAADDTKLAFLLHRTINSSPRISKN 1511
            A++L + A+ E               DD +LAF LHR +NSSPRISKN
Sbjct: 212  AVELASNALEE--------------YDDAELAFRLHRAMNSSPRISKN 245


>ref|XP_002321383.1| hypothetical protein POPTR_0015s01060g [Populus trichocarpa]
            gi|222868379|gb|EEF05510.1| hypothetical protein
            POPTR_0015s01060g [Populus trichocarpa]
          Length = 497

 Score =  120 bits (301), Expect = 3e-24
 Identities = 126/424 (29%), Positives = 182/424 (42%), Gaps = 16/424 (3%)
 Frame = -2

Query: 2170 PNLSSCHVCCSRITSNRGKDKLQTLDSQWRIVLLCNNCFHGVQSAKNCSYCFTEIS--DV 1997
            PNL+ C  C  R  S++   +L+ L S+WRI+LLC  CF+ V+S+K CSYCF + S    
Sbjct: 112  PNLTECQSCGLRTPSHK---RLEILYSEWRIILLCTKCFNLVESSKICSYCFRKFSVKTK 168

Query: 1996 GISCVCCDRRVHGDCVSKYRGLGLCSKS---DS---FTCIDCWVPKSLNGVPWGRNPNGS 1835
             + C  C R VH  C +K + +   S S   DS     CIDCWVPKS+  +  G+    S
Sbjct: 169  CLRCCQCKRVVHKSCFAKRKNVAPWSYSCYGDSGGFSVCIDCWVPKSV-AIKRGKVCGVS 227

Query: 1834 SKIVSGNCSVKISRASSLDDVAAKDANSTAELKIXXXXXXXXXXXXXXXXXXXXXXXAKN 1655
             +  +G          SL+DV  KDA  T + K+                       A+ 
Sbjct: 228  KRNDTG------VLGRSLEDV-VKDAACTVQEKVESAVRARELAVRKALEARKAADVARK 280

Query: 1654 ALDLVAAAVREDSQLKDSRLSGGLAADDTKLAFLLHRTINSSPRISKNLGSMDLGNLVAP 1475
            ALDLVA    E  +  +  +      DD +LAF LHR +NSSPRIS NL  ++   L   
Sbjct: 281  ALDLVAN--NEGGKENNDNV------DDIELAFQLHRAMNSSPRISSNLCLVNSSCLGVT 332

Query: 1474 KLRKGNGYLLDRQSDHGSHSVHGELEVCTNNTMLENPDKVVSEPSVRIGSLDHSSSMGLG 1295
             + +GNG +  R S+  +    G+L            D  +S+ SV +G    S+    G
Sbjct: 333  MIGEGNGEMRIRNSELRNLGAFGKL------------DGFMSK-SVDVGR-RKSNGNDDG 378

Query: 1294 VLEPKMKVYTRESHKVKNFKKNGEDAFMGNFEKGLEHCYRKQEVLEHKVSSNSGGTHCQF 1115
            V+ P  K                +D  +G          ++QE        NS G  C  
Sbjct: 379  VIRPDAK----------------KDRNVG---------MQQQEQSFFNKLINSRGNDCSV 413

Query: 1114 PCD-------EDSSTPEKKKC-HGSDMYLRKYSKRRTSIKSTLDQSSSHVNECKENGDDS 959
              D        +S  P+ K C    D YL KYS++R   K    +    +  C+   D+ 
Sbjct: 414  NSDFQSYREGNESLVPDDKGCKRKHDRYLLKYSRKRVLFK--YSRRKVMLKYCRRKLDER 471

Query: 958  VMGN 947
            ++ N
Sbjct: 472  LIPN 475


>ref|XP_003593131.1| hypothetical protein MTR_2g008130 [Medicago truncatula]
            gi|355482179|gb|AES63382.1| hypothetical protein
            MTR_2g008130 [Medicago truncatula]
          Length = 420

 Score =  115 bits (288), Expect = 9e-23
 Identities = 87/254 (34%), Positives = 119/254 (46%), Gaps = 21/254 (8%)
 Frame = -2

Query: 2170 PNLSSCHVCCSRITSNRGKDKLQTLDSQWRIVLLCNNCFHGVQSAKNCSYCFTEISDVGI 1991
            PNL+ CH C  +I    GK+KLQTL S+WR+VLLC  CF  V+S++ CSYCF+E S   +
Sbjct: 36   PNLTECHACGFKIDVCTGKNKLQTLYSEWRVVLLCKKCFSCVKSSQICSYCFSESSSDSL 95

Query: 1990 SCVCCDRRVHGDCVSKYRGLG----LCSKSDSFTCIDCWVPKSLNGVPWGRNPNGSSKIV 1823
             CV C   VH +C  K + +      C  S+   C+DCWVPK +  +   R      K+ 
Sbjct: 96   RCVKCKHSVHKNCFLKNKNVAPWSYSCVGSEFSVCVDCWVPKHVE-ISRRRTIRSLRKVK 154

Query: 1822 SGNCSVKISRAS----------------SLDDVAAKDANSTAELKIXXXXXXXXXXXXXX 1691
            SG   VK  R                  S++DV  KDA   A+ K+              
Sbjct: 155  SG-VIVKKGRVDLVKESSRVLKGGNLTRSMEDV-VKDAKQKAKKKVEAAAMARRVASKKA 212

Query: 1690 XXXXXXXXXAKNALDLVAAAVREDSQLK-DSRLSGGLAADDTKLAFLLHRTINSSPRISK 1514
                     A   L++  AA RE+  L   S++        + LAF L   +N+SP ISK
Sbjct: 213  VAARRAVELANKTLNI--AANREEGTLNLPSKMDPVKVVGCSCLAFDL--CLNNSPMISK 268

Query: 1513 NLGSMDLGNLVAPK 1472
            +   +D  NL APK
Sbjct: 269  SRCLLDTNNLDAPK 282


>ref|XP_007204312.1| hypothetical protein PRUPE_ppb021745mg [Prunus persica]
            gi|462399843|gb|EMJ05511.1| hypothetical protein
            PRUPE_ppb021745mg [Prunus persica]
          Length = 353

 Score =  103 bits (257), Expect = 4e-19
 Identities = 62/155 (40%), Positives = 80/155 (51%), Gaps = 12/155 (7%)
 Frame = -2

Query: 2170 PNLSSCHVCCSR--ITSNRGKDKLQTLDSQWRIVLLCNNCFHGVQSAKNCSYCF-TEISD 2000
            PNLS CH C  R  I +   K KL  L S+WRIVLLC  CF  V+S++ CSYC+ T  S 
Sbjct: 22   PNLSECHSCHLRVDIANANAKSKLHVLYSEWRIVLLCKKCFSRVESSELCSYCYSTSSSQ 81

Query: 1999 VGISCVCCDRRVHGDCVSKYRGLGL----CSKSDSFTCIDCWVPKSLNGVPWGRNPNGSS 1832
                C+ C R+VH  C S+YR + L    CS  +   C DCW+P+SL  V W R  + S 
Sbjct: 82   ESFFCLQCHRKVHRHCDSEYRSVALLSDSCSAMEFSVCADCWIPESL--VKWKRVVSSSK 139

Query: 1831 KIVSGNCSVKISRASS-----LDDVAAKDANSTAE 1742
               +G   V +    S     +DD    DA  + E
Sbjct: 140  SRRTGKRRVGLGLGKSRVLAMVDDREIDDAFGSEE 174


>gb|EPS68024.1| hypothetical protein M569_06755, partial [Genlisea aurea]
          Length = 113

 Score = 99.8 bits (247), Expect = 5e-18
 Identities = 53/111 (47%), Positives = 62/111 (55%), Gaps = 11/111 (9%)
 Frame = -2

Query: 2170 PNLSSCHVCCSRITSNRGKDKLQTLDSQWRIVLLCNNCFHGVQSAKNCSYCFTEISDVGI 1991
            PN S CH C SRI     +D+LQ LDS WRIVLLC  C H +     C YCF +I   GI
Sbjct: 5    PNFSDCHCCGSRINHTNPRDRLQPLDSVWRIVLLCRKCRHNLDIGHVCPYCFEKI---GI 61

Query: 1990 S-----CVCCDRRVHGDCVSKY------RGLGLCSKSDSFTCIDCWVPKSL 1871
            S     CV C RR+H DC+ KY      R LG   +    TCIDCW+P+ L
Sbjct: 62   SLDLCTCVICRRRIHKDCIRKYGRFTPWRFLG--GEVGFSTCIDCWIPQLL 110


>gb|AEJ72552.1| hypothetical protein [Malus domestica]
          Length = 588

 Score = 98.2 bits (243), Expect = 2e-17
 Identities = 54/127 (42%), Positives = 72/127 (56%), Gaps = 13/127 (10%)
 Frame = -2

Query: 2170 PNLSSCHVCCSR--ITSNRGKDKLQTLDSQWRIVLLCNNCFHGVQSAKNCSYCFTEIS-- 2003
            PNL  CH C  R  I +   K KLQ L S+WR+VLLC  C   V+S++ CSYCF   S  
Sbjct: 19   PNLLECHCCHLRVDIANASAKSKLQILYSEWRVVLLCKKCLTRVESSELCSYCFAATSPS 78

Query: 2002 -DVGISCVCCDRRVHGDCVSKYRGLGLCSKS-----DSFTCIDCWVPKSL---NGVPWGR 1850
             +   +C  C+RRVH  C S+YRG+ L S++     ++  C DCW+P+SL    GV   +
Sbjct: 79   QEDSFTCCQCNRRVHRRCDSEYRGIALLSQNSCLAVEAEVCADCWLPESLARWRGVVRSQ 138

Query: 1849 NPNGSSK 1829
            N   S K
Sbjct: 139  NARRSGK 145


>ref|NP_001174784.1| Os06g0468300 [Oryza sativa Japonica Group]
            gi|54290641|dbj|BAD62212.1| unknown protein [Oryza sativa
            Japonica Group] gi|125555297|gb|EAZ00903.1| hypothetical
            protein OsI_22931 [Oryza sativa Indica Group]
            gi|222635557|gb|EEE65689.1| hypothetical protein
            OsJ_21309 [Oryza sativa Japonica Group]
            gi|255677039|dbj|BAH93512.1| Os06g0468300 [Oryza sativa
            Japonica Group]
          Length = 383

 Score = 75.9 bits (185), Expect = 8e-11
 Identities = 81/295 (27%), Positives = 123/295 (41%), Gaps = 15/295 (5%)
 Frame = -2

Query: 2170 PNLSSCHVCCSRITS---NRGKDKLQTLDSQWRIVLLCNNCFHGVQSAKNCSYCFT---- 2012
            PNL+ CH C  R         +  ++ L S WR+VLLC  C   V+SA  CSYC +    
Sbjct: 49   PNLNLCHCCGVRFPPAPPGAKRRPVRPLRSLWRVVLLCTECLSLVRSAAVCSYCLSLDNL 108

Query: 2011 EISDVGISCVCCDRRVHGDCVSKYRGLGLCSKSD--SFTCIDCWVPKSLNGVPWGRNPNG 1838
               D  ++C CC+R VH  C++      L    D  +F C+DC         P G+N   
Sbjct: 109  PPEDSSVTCRCCNRCVHPYCIAGEHRAALIQPIDVENFICVDCCPTVK----PGGKNGGA 164

Query: 1837 SSKIVSGNCSVKISRASSLDDVAAKDANSTA----ELKIXXXXXXXXXXXXXXXXXXXXX 1670
            SS     +    ++R     D+ A+   +      E+K+                     
Sbjct: 165  SSV----HMLQAVAREPRKGDIVAESKENAVRKAMEMKL--------------------- 199

Query: 1669 XXAKNALD-LVAAAVREDSQLKDSRLSGGLAADDTKLAFLLHRTINSSPRISKNLGSMDL 1493
               K A + LV+AA    SQ     + G     D +LA  LH  +N S R S+  G+   
Sbjct: 200  -AFKRAKEALVSAAGGRGSQ---RTVGGKPDLPDEELALQLHLAMNGSQRFSR-AGNTSG 254

Query: 1492 GNLVAPKLRKGNGYLLDRQSDHGSHSVHGELEVCTNNTMLE-NPDKVVSEPSVRI 1331
            G+  + +  KG+      +S  G  + +G+ E+C  N M + + D+   EP  RI
Sbjct: 255  GD--SAEQCKGH------KSVIGGKNFYGDQELCVTNMMDQLDDDEAGVEPLCRI 301


>ref|XP_006656073.1| PREDICTED: uncharacterized protein LOC102716222 isoform X1 [Oryza
            brachyantha]
          Length = 392

 Score = 71.6 bits (174), Expect = 2e-09
 Identities = 88/327 (26%), Positives = 131/327 (40%), Gaps = 18/327 (5%)
 Frame = -2

Query: 2170 PNLSSCHVCCSRITS---NRGKDKLQTLDSQWRIVLLCNNCFHGVQSAKNCSYCFT---- 2012
            P+++ CH C  R         +  ++ L S WRIVLLC  C + V+SA  CSYC +    
Sbjct: 49   PDINLCHCCGVRFPPPPPGAKRRPVRPLRSLWRIVLLCTECLYLVRSAAVCSYCLSLDNL 108

Query: 2011 EISDVGISCVCCDRRVHGDCVSKYRGLGLCSKSD--SFTCIDCWVPKSLNGVPWGRNPNG 1838
               D  ++C  C+R VH  C+S      L    D  +F C+DC       G   G  P  
Sbjct: 109  PPEDCSVTCRFCNRCVHHYCISGEHRTSLVQPIDVENFVCVDCCPTVKPGGKQGGVAPVH 168

Query: 1837 SSKIVSGNCSVKISRASSLDDVAAKDANSTAELKIXXXXXXXXXXXXXXXXXXXXXXXAK 1658
              + V+         A + D+   K      E+K+                         
Sbjct: 169  MLQAVAREPRKGEIVAEAKDNAVRK----AMEVKL------------------------- 199

Query: 1657 NALDLVAAAVREDSQLKDSRLSGGLAAD--DTKLAFLLHRTINSSPRISKNLGSMDLGNL 1484
             A + V  A+   +    S+ + G   D  D +LA  LH  +N S RIS+   +    + 
Sbjct: 200  -ASNRVKEALAPAAAGGGSQRTAGCNPDLPDEELALQLHLAMNGSHRISRAGNTSGGDSA 258

Query: 1483 VAPKLRKGNGYLLDRQSDHGSHSVHGELEVCTNNTMLENPDKVVS--EPSVRIG-----S 1325
            V  K  K         +      V+G+ E+C  N M++  D V +  EP  RIG      
Sbjct: 259  VQGKCHK---------TMVCGKKVYGDQELCVTN-MMDQLDDVETGVEPLCRIGRPARRR 308

Query: 1324 LDHSSSMGLGVLEPKMKVYTRESHKVK 1244
            LD S ++ L  LE  +  + +ES KVK
Sbjct: 309  LDPSVTIVL-ALECVVGKHVKESMKVK 334


>gb|ABR16126.1| unknown [Picea sitchensis]
          Length = 756

 Score = 71.2 bits (173), Expect = 2e-09
 Identities = 53/153 (34%), Positives = 67/153 (43%), Gaps = 8/153 (5%)
 Frame = -2

Query: 2284 TDVQPSLPPRKRLLASLDQTXXXXXXXXXXXXXXXLRNPNLS-SCHVCCSRITSNRGKDK 2108
            +D    LPPRKRLLA L Q                    N      VC S     RG   
Sbjct: 6    SDAGAFLPPRKRLLAGLKQNGWFCSDSEKNSENRKPEKANGEVDSPVCVS--CGARGGPT 63

Query: 2107 LQTLDSQWRIVLLCNNCFHGVQSAKNCSYCFTEISD------VGISCVCCDRRVHGDCVS 1946
            L+++ +  R V +C +C   + S   C  CF +ISD      V +SC  C  RVH DCVS
Sbjct: 64   LESVKNGKRFVSVCKSCNCLLNSGGICCCCFRKISDDKGLLVVALSCCKCRHRVHCDCVS 123

Query: 1945 KYRG-LGLCSKSDSFTCIDCWVPKSLNGVPWGR 1850
            K  G   +CS S SF C+DC   K +     G+
Sbjct: 124  KNIGEEDVCSDSKSFVCVDCSPLKGIRDACGGK 156


>ref|XP_002318448.2| hypothetical protein POPTR_0012s02720g [Populus trichocarpa]
            gi|550326239|gb|EEE96668.2| hypothetical protein
            POPTR_0012s02720g [Populus trichocarpa]
          Length = 311

 Score = 70.1 bits (170), Expect = 5e-09
 Identities = 83/309 (26%), Positives = 121/309 (39%), Gaps = 7/309 (2%)
 Frame = -2

Query: 1936 GLGLCSKSDSFTCIDCWVPKSLNGVPWG--RNPNGSSKIVSGNCSVKISRASSLDDVAAK 1763
            GL + S        +CWVP S+     G  R+   +S  V G               + +
Sbjct: 30   GLRISSHKRLEILYNCWVPNSVASKRGGVCRDSKRNSGRVLGR--------------SLE 75

Query: 1762 DANSTAELKIXXXXXXXXXXXXXXXXXXXXXXXAKNALDLVA--AAVREDSQLKDSRLSG 1589
            DAN   + K+                       A+ ALD+VA    V+E++ +       
Sbjct: 76   DANCVVQEKVEAAVRARDLAVRKALEERNAADVARKALDMVANNGVVKENNDV------- 128

Query: 1588 GLAADDTKLAFLLHRTINSSPRISKNLGSMDLGNLVAPKLRKGNGYLLDRQSDHGSHSVH 1409
                DD +LAF LHR INSSPRIS NL  ++   L   +  +GNG    R SD  +    
Sbjct: 129  ----DDFELAFRLHRAINSSPRISSNLCMVNSSCLGVARRGEGNGQTRIRNSDFRNPIAC 184

Query: 1408 GELEVCTNNTMLENPDKVVSEPSVRIGSLDHSSSMGLGVLEPKMKVYTRESHKVKNFKKN 1229
            G+L            D  +S+      S+D       G+ + K++   ++        K 
Sbjct: 185  GKL------------DDFLSK------SVDVECRKSNGIGDGKIRPNAKKDGNAGKCSKM 226

Query: 1228 GEDAFMGNF--EKGLEHCYRKQEVLEHKVSSNSGGTHCQFPCDEDSSTPEKKKCHG-SDM 1058
            GE +F       +G +H            S NSG     F    +S TP+ K C G SD 
Sbjct: 227  GEQSFFSKLIDSRGNDH------------SVNSGSQ--SFRERNESMTPDDKSCKGKSDR 272

Query: 1057 YLRKYSKRR 1031
            YL KYS+R+
Sbjct: 273  YLLKYSRRK 281


>ref|XP_003571048.1| PREDICTED: uncharacterized protein LOC100843170 [Brachypodium
            distachyon]
          Length = 380

 Score = 62.0 bits (149), Expect = 1e-06
 Identities = 43/151 (28%), Positives = 62/151 (41%), Gaps = 11/151 (7%)
 Frame = -2

Query: 2308 ESAP--PPPSTDVQPSLPPRKRLLASLDQTXXXXXXXXXXXXXXXLRNPNLSSCHVCCSR 2135
            +SAP  P P+       PP + L     +                   P+L+ C  C +R
Sbjct: 3    QSAPSSPTPAPKADHPSPPSRLLSKHRPRRRAAPPRQTPPPPAPTRGQPDLNLCRCCGAR 62

Query: 2134 ITSNRGKDK---LQTLDSQWRIVLLCNNCFHGVQSAKNCSYCFT----EISDVGISCVCC 1976
                    K   ++ L S WRIVLLC+ C   ++SA  CSYC +       D  ++C  C
Sbjct: 63   FPPPPPGAKPRPVRALRSVWRIVLLCSECLPLIRSAVVCSYCLSLDNLPPEDSSVTCRSC 122

Query: 1975 DRRVHGDCVSKYRGLGLCSKSD--SFTCIDC 1889
            +R VH  C+       L    D  +F C+DC
Sbjct: 123  NRCVHRHCIPSEHRTALIQPVDLENFVCVDC 153


>ref|XP_002438424.1| hypothetical protein SORBIDRAFT_10g018040 [Sorghum bicolor]
            gi|241916647|gb|EER89791.1| hypothetical protein
            SORBIDRAFT_10g018040 [Sorghum bicolor]
          Length = 383

 Score = 62.0 bits (149), Expect = 1e-06
 Identities = 38/105 (36%), Positives = 50/105 (47%), Gaps = 11/105 (10%)
 Frame = -2

Query: 2170 PNLSSCHVCCSRITS----NRGKDK-LQTLDSQWRIVLLCNNCFHGVQSAKNCSYCFT-- 2012
            P+LS CH C  R  +     R K + ++ L S WR+VLLC  C   V+SA  CSYC +  
Sbjct: 51   PDLSLCHCCGVRFPTPQPGTRPKRRPVRPLSSLWRVVLLCTECLSLVRSAAVCSYCLSLD 110

Query: 2011 --EISDVGISCVCCDRRVHGDCVSKYRGLGLCSKSD--SFTCIDC 1889
                 D  + C  C R VH  C+S      +    D   F C+DC
Sbjct: 111  NLPPEDSAVVCRHCKRCVHRSCISAEHRTTVIQPVDVEDFLCVDC 155


Top