BLASTX nr result

ID: Akebia23_contig00019762 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Akebia23_contig00019762
         (965 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

emb|CAN63610.1| hypothetical protein VITISV_000282 [Vitis vinifera]   207   5e-51
ref|XP_002279964.1| PREDICTED: uncharacterized protein LOC100255...   206   1e-50
ref|XP_007048820.1| Uncharacterized protein isoform 1 [Theobroma...   179   1e-42
ref|XP_002520376.1| conserved hypothetical protein [Ricinus comm...   150   7e-34
ref|XP_004306278.1| PREDICTED: uncharacterized protein LOC101297...   146   1e-32
gb|EXC08238.1| hypothetical protein L484_012692 [Morus notabilis]     143   1e-31
emb|CBI31771.3| unnamed protein product [Vitis vinifera]              136   6e-30
ref|XP_007217065.1| hypothetical protein PRUPE_ppa001053mg [Prun...   126   1e-26
ref|XP_006484829.1| PREDICTED: uncharacterized protein LOC102621...   118   3e-24
ref|XP_006437208.1| hypothetical protein CICLE_v10030533mg [Citr...   117   5e-24
ref|XP_007048821.1| Uncharacterized protein isoform 2 [Theobroma...   102   2e-19
ref|XP_006852688.1| hypothetical protein AMTR_s00021p00252680 [A...    89   3e-15
ref|XP_006586065.1| PREDICTED: uncharacterized protein LOC102665...    80   1e-12
ref|XP_006586066.1| PREDICTED: uncharacterized protein LOC102665...    77   8e-12
ref|XP_007141375.1| hypothetical protein PHAVU_008G1904000g, par...    69   2e-09
ref|XP_006575669.1| PREDICTED: uncharacterized protein LOC102669...    61   8e-07
ref|XP_004491380.1| PREDICTED: uncharacterized protein LOC101497...    59   2e-06
gb|EEE62684.1| hypothetical protein OsJ_17487 [Oryza sativa Japo...    58   5e-06
gb|EEC78687.1| hypothetical protein OsI_18829 [Oryza sativa Indi...    58   5e-06
ref|NP_001054890.1| Os05g0203900 [Oryza sativa Japonica Group] g...    58   5e-06

>emb|CAN63610.1| hypothetical protein VITISV_000282 [Vitis vinifera]
          Length = 1138

 Score =  207 bits (527), Expect = 5e-51
 Identities = 122/312 (39%), Positives = 178/312 (57%), Gaps = 23/312 (7%)
 Frame = +3

Query: 33   QHMFSNSSQT----------SAFQFPFSSQDGGEYVKPLRFQNSSQNLPPWLLDATKQKE 182
            QHM  NS++           SAF+FPF   D  E+ +P  F N S++LPPWL+ A +QK+
Sbjct: 827  QHMHLNSAELRYNQGLHATKSAFEFPFMHPDYREHGQPSWFPNPSKSLPPWLIHAAQQKK 886

Query: 183  KSLALSQLHYDTSAEHYPFSVSGANFSAIPIPHHTPLISYPHNPNTSLNHFQXXXXXXXX 362
             S+A S  + D   +H+  +VS  NF  +P    +P++SYP+ P  S +  Q        
Sbjct: 887  TSIASSLPYSDLDGKHHSCTVSQTNFITVPSVQQSPVLSYPYCPMKSQSQIQSSLGHSFV 946

Query: 363  XXXXXX------KCITSNTRRRNRIKVKDKMKSKYIHLKNLDHVKKARKRFETIKDDSRK 524
                        +  +S+   RNRIKVKD+MKSK   +K+ D+ K  +KR     ++S K
Sbjct: 947  HSPLIPVLPGFKQTSSSHVNYRNRIKVKDRMKSKSFFVKDSDYSKNTKKRPAAEANESPK 1006

Query: 525  VTRKPYLELQGDSNDVTGLGRREEFNVCIHYNTGSSEFDANKNNGVEI-------QKDGL 683
              +   LE++ +S+ VTGL     ++     N  + E +++++    I       QKD L
Sbjct: 1007 PPKLMTLEMREESSTVTGLNTVGNYSSEXQLNPVALELNSDRDQASSIGFTPSETQKDEL 1066

Query: 684  RTTSGVNFSKVDYVARSGPIKLSAGAKHILKPSENLDQDNSRPTHSTIPFIAMNHAGEVS 863
              + G++ +K+D V RSGP+KLSAGAKHILKPS+N+D D+SRPTHSTIPF A+  +G VS
Sbjct: 1067 ANSPGIDAAKLDGVTRSGPVKLSAGAKHILKPSQNMDHDSSRPTHSTIPFAAVTDSGRVS 1126

Query: 864  DFQKKSANIYRF 899
              QKK+A IYRF
Sbjct: 1127 GPQKKTAKIYRF 1138


>ref|XP_002279964.1| PREDICTED: uncharacterized protein LOC100255858 [Vitis vinifera]
          Length = 1000

 Score =  206 bits (523), Expect = 1e-50
 Identities = 124/312 (39%), Positives = 176/312 (56%), Gaps = 23/312 (7%)
 Frame = +3

Query: 33   QHMFSNSSQT----------SAFQFPFSSQDGGEYVKPLRFQNSSQNLPPWLLDATKQKE 182
            QHM  NS++           SAF+FPF   D  E+ +P  F N S++LPPWL+ A +QK+
Sbjct: 689  QHMHLNSAELRYNQGLHATKSAFEFPFMHPDYREHGQPSWFPNPSKSLPPWLIHAAQQKK 748

Query: 183  KSLALSQLHYDTSAEHYPFSVSGANFSAIPIPHHTPLISYPHNPNTSLNHFQXXXXXXXX 362
             S+A S  + D   +H+  +VS  NF  +P    +P++SYP+ P  S +  Q        
Sbjct: 749  TSIASSLPYSDLDGKHHSCTVSQTNFITVPSVQQSPVLSYPYCPMKSQSQIQSSLGHSFV 808

Query: 363  XXXXXX------KCITSNTRRRNRIKVKDKMKSKYIHLKNLDHVKKARKRFETIKDDSRK 524
                        +  +S+   RNRIKVKD+MKSK   +K+ D+ K  +KR     ++S K
Sbjct: 809  HSPLIPVLPGFKQTSSSHVNYRNRIKVKDRMKSKSFFVKDSDYSKNTKKRPAAEANESPK 868

Query: 525  VTRKPYLELQGDSNDVTGLGR------REEFN-VCIHYNTGSSEFDANKNNGVEIQKDGL 683
              +   LE++ +S+ VTGL         E+ N V +  N+   +  +      E QKD L
Sbjct: 869  PPKLMTLEMREESSTVTGLNTVGNYSSEEQLNPVALELNSDRDQASSIGFTPSETQKDEL 928

Query: 684  RTTSGVNFSKVDYVARSGPIKLSAGAKHILKPSENLDQDNSRPTHSTIPFIAMNHAGEVS 863
              + G++ SK+D V RSGP+KLSAGAKHILKPS+N+D D+SRPTHSTIPF A+  +  VS
Sbjct: 929  ANSPGIDASKLDGVTRSGPVKLSAGAKHILKPSQNMDHDSSRPTHSTIPFAAVTDSDRVS 988

Query: 864  DFQKKSANIYRF 899
              QKK+A IYRF
Sbjct: 989  GPQKKTAKIYRF 1000


>ref|XP_007048820.1| Uncharacterized protein isoform 1 [Theobroma cacao]
            gi|508701081|gb|EOX92977.1| Uncharacterized protein
            isoform 1 [Theobroma cacao]
          Length = 1273

 Score =  179 bits (454), Expect = 1e-42
 Identities = 113/306 (36%), Positives = 158/306 (51%), Gaps = 17/306 (5%)
 Frame = +3

Query: 33   QHMFSNSSQT---SAFQFPFSSQDGGEYVKPLRFQNSSQNLPPWLLDATKQKEKSLALSQ 203
            +H +  + Q    S+F FPF   D GE+V+P  F+ SS++L PWLL AT+Q +     SQ
Sbjct: 970  EHKYKQNLQNAVKSSFNFPFLHPDQGEHVQPSWFRGSSKSLIPWLLQATQQVKAPCTPSQ 1029

Query: 204  LHYDTSAEHYPFSVSGANFSAIPIPHHTPLISYPHNPNTSLNHFQXXXXXXXXXXXXXXK 383
               D     +P ++   +F   P+  H P++SY HNP  S +H +               
Sbjct: 1030 PFPDEGGRRHPHTMQ-TSFLTNPLVPHLPIVSYDHNPMISHSHMESPVGQPYIAHSPLIP 1088

Query: 384  CITS-------NTRRRNRIKVKDKMKSKYIHLKNLDHVKKARKRFETIKDDSRKVTRKPY 542
             +         N   RNRIK KD+MK K + +++ D  +K RKR    +D   K  + P 
Sbjct: 1089 ALPGIKPSSPVNMSHRNRIKFKDRMKLKSVGIQDPDICRKTRKRPRAKEDCPMKPIKIPS 1148

Query: 543  LELQGDSNDVTGLGRREEFNVCIHYNTGSSEFDANKNNGV-------EIQKDGLRTTSGV 701
            L +Q  S   T    RE F   I  N GS E D  ++          E + +G   ++ +
Sbjct: 1149 LGIQDKSRAATR-STRENFFDDIQCNMGSLEIDPYRDEAGLVGWIPNEPRCNGFGASAVI 1207

Query: 702  NFSKVDYVARSGPIKLSAGAKHILKPSENLDQDNSRPTHSTIPFIAMNHAGEVSDFQKKS 881
            + SK+D V R GPIKL AG KHILKPS+N+DQDNSR  HSTIPF ++   G + + QKKS
Sbjct: 1208 DSSKIDGVTRPGPIKLGAGVKHILKPSQNVDQDNSRLIHSTIPFASVTDCGNILETQKKS 1267

Query: 882  ANIYRF 899
              IYRF
Sbjct: 1268 TKIYRF 1273


>ref|XP_002520376.1| conserved hypothetical protein [Ricinus communis]
            gi|223540423|gb|EEF41992.1| conserved hypothetical
            protein [Ricinus communis]
          Length = 883

 Score =  150 bits (379), Expect = 7e-34
 Identities = 103/298 (34%), Positives = 156/298 (52%), Gaps = 15/298 (5%)
 Frame = +3

Query: 51   SSQTSAFQFPFSSQDGGEYVKPLRFQNSSQNLPPWLLDATKQKEKSLALSQLHY-DTSAE 227
            +++ SAF FPF   D  E+ +   F  SS+NLPPW + A+  + K+  ++  ++ D    
Sbjct: 590  NARKSAFDFPFLHPDYNEHNQSSSFPTSSKNLPPWSVHASPLQVKTGDMASKNFSDVGCT 649

Query: 228  HYPFSVSGANFSAIPIPHHTPLISYPHNP-------NTSLNHFQXXXXXXXXXXXXXXKC 386
            H+P   SG NF   P+ +H+ ++S PH+         +SL                    
Sbjct: 650  HHPSCTSGTNFLT-PL-YHSSVVSDPHSSVISGPPLRSSLGSVLFIEPPGFPFSTGVHSN 707

Query: 387  ITSNTRRRNRIKVKDKMKSKYIHLKNLDHVKKARKRFETIKDDSRKVTRKPYLELQGDSN 566
             + +   R++I ++++MKS  + +K  DH +K +KR   I   S + T+ P L +Q D +
Sbjct: 708  SSIDMSYRDKIIIQERMKSNSLGVKVPDHCQKIKKRPAAISSGSLRPTKMPNL-MQEDLS 766

Query: 567  DVTGLGRREEFNVCIHYNTGSSEFDANKNNGV-------EIQKDGLRTTSGVNFSKVDYV 725
             VT L  RE  +  I  N  + E  +  + G+       E QK+GL T+S  +FSK D +
Sbjct: 767  AVTEL-TRETSSSEIWQNIAAYEARSKGDKGIGLGCCSYEAQKNGLGTSSDSDFSKADTL 825

Query: 726  ARSGPIKLSAGAKHILKPSENLDQDNSRPTHSTIPFIAMNHAGEVSDFQKKSANIYRF 899
             + GP+KL+AGAKHILKPS+ +D DNSR  HSTIPF          D Q+KS  IYRF
Sbjct: 826  TKPGPMKLTAGAKHILKPSQKMDHDNSRLIHSTIPFPGATDCESFLDSQRKSTMIYRF 883


>ref|XP_004306278.1| PREDICTED: uncharacterized protein LOC101297951 [Fragaria vesca
            subsp. vesca]
          Length = 1218

 Score =  146 bits (369), Expect = 1e-32
 Identities = 100/287 (34%), Positives = 151/287 (52%), Gaps = 8/287 (2%)
 Frame = +3

Query: 63   SAFQFPFSSQDGGEYVKPLRFQNSSQN--LPPWLLDATKQKEKSLALSQLHYDTSAEHYP 236
            SAF FPF + +  E V+P  FQNSS    LPPWLL AT ++ + +   Q+  +T+++H  
Sbjct: 937  SAFDFPFLNPECQENVRPSWFQNSSSKAGLPPWLLHATLERNRPITAPQVSPNTASKHLH 996

Query: 237  FSVSGANFSAIPIPHHTPLISYPHNPNTS-LNHFQXXXXXXXXXXXXXXKCITSNTRRRN 413
              +   N    P  H++  IS+      S +                      ++   RN
Sbjct: 997  HIIPRKNICNAPYLHYSSEISHSQEVKRSPVPQNAVQPQGVQVIPEENPSSAMTDMSYRN 1056

Query: 414  RIKVKDKMKSKYIHLKNLDHVKKARKRFETIKDDSRKVTRKPYLELQGDSNDVTGLGRRE 593
            R+ +KD+MK K I +K+L   K+ ++       DS  +     LE+Q  S+ V GL    
Sbjct: 1057 RMDLKDRMKPKNIGIKDLYPCKRIKRSAV----DSTNLPNIIDLEVQEKSSVVAGLSSNG 1112

Query: 594  EFNVC-IHYNTGSSEFDAN----KNNGVEIQKDGLRTTSGVNFSKVDYVARSGPIKLSAG 758
            +FN+  +  N  + + +++    K++    QKDG  +      SK+D V RSGPIKLSAG
Sbjct: 1113 DFNIDEMQSNFRALDLESSRKQVKDSECITQKDGFESLC-TETSKMDDVGRSGPIKLSAG 1171

Query: 759  AKHILKPSENLDQDNSRPTHSTIPFIAMNHAGEVSDFQKKSANIYRF 899
            AKHI+KP+ N+DQDN RP HSTIPF+A+ +     + QK+S  IY+F
Sbjct: 1172 AKHIIKPTPNVDQDNCRPIHSTIPFVAVTNDYTGPEPQKRSTKIYKF 1218


>gb|EXC08238.1| hypothetical protein L484_012692 [Morus notabilis]
          Length = 1240

 Score =  143 bits (360), Expect = 1e-31
 Identities = 112/320 (35%), Positives = 150/320 (46%), Gaps = 27/320 (8%)
 Frame = +3

Query: 21   QNRCQHMFSNSSQTS-----------AFQFPFSSQDGGEYVKPLRFQNSSQNLPPWLLDA 167
            QN C+H     ++ S            F+FPF + D    V+   F+NS ++LPPWLL A
Sbjct: 925  QNSCEHGHWRPAELSHRQNLPHFTDPGFEFPFLNPDSRVNVQSSWFENS-KSLPPWLLHA 983

Query: 168  TKQKEKSLALSQLHYDTSAEHYPFSVSGANFSAIP-IPHHTPLISYPHNP-------NTS 323
             +Q +  +  SQ     +++++    S  N    P I H     SYP  P       N+S
Sbjct: 984  KQQGKTPMISSQQGPIAASKNHQHISSRTNILNRPSIYHSAEACSYPCGPGTLHSQMNSS 1043

Query: 324  LNHFQXXXXXXXXXXXXXXKCITSNTRRRNRIKVKDKMKSKYIHLKNLDHVKKARKRFET 503
            L                       NT  RNR+KVK+++KSK   +K+L   KK  KR  T
Sbjct: 1044 LGSATIVIPPLGPIISRVKPASAMNTGYRNRMKVKERLKSKAFGVKDLYPCKKTNKRLVT 1103

Query: 504  IKDDSRKVTRKPYLELQGDSNDVTGLGRREEFNVCIHYNTGSSEFDANKNNGVEIQKDGL 683
               D  K TR   LE Q      + L R    N+           D + N   +I  +  
Sbjct: 1104 KSLDLVKPTRILNLEKQ---EKFSALARCSAQNLYSEMQRDIVGDDLHSNRVKDIGLECQ 1160

Query: 684  RTTS--------GVNFSKVDYVARSGPIKLSAGAKHILKPSENLDQDNSRPTHSTIPFIA 839
            RT +        G   S+VD +ARSGPIKLSAGAKHILKP++N+D DN  P HSTIPF A
Sbjct: 1161 RTETQDFGIGIAGNESSRVDIMARSGPIKLSAGAKHILKPNQNMDLDNFMPIHSTIPFAA 1220

Query: 840  MNHAGEVSDFQKKSANIYRF 899
              +   V + QKK+A IYRF
Sbjct: 1221 ATNVSMVPESQKKAAKIYRF 1240


>emb|CBI31771.3| unnamed protein product [Vitis vinifera]
          Length = 1244

 Score =  136 bits (342), Expect(2) = 6e-30
 Identities = 89/262 (33%), Positives = 135/262 (51%), Gaps = 23/262 (8%)
 Frame = +3

Query: 33   QHMFSNSSQT----------SAFQFPFSSQDGGEYVKPLRFQNSSQNLPPWLLDATKQKE 182
            QHM  NS++           SAF+FPF   D  E+ +P  F N S++LPPWL+ A +QK+
Sbjct: 913  QHMHLNSAELRYNQGLHATKSAFEFPFMHPDYREHGQPSWFPNPSKSLPPWLIHAAQQKK 972

Query: 183  KSLALSQLHYDTSAEHYPFSVSGANFSAIPIPHHTPLISYPHNPNTSLNHFQ------XX 344
             S+A S  + D   +H+  +VS  NF  +P    +P++SYP+ P  S +  Q        
Sbjct: 973  TSIASSLPYSDLDGKHHSCTVSQTNFITVPSVQQSPVLSYPYCPMKSQSQIQSSLGHSFV 1032

Query: 345  XXXXXXXXXXXXKCITSNTRRRNRIKVKDKMKSKYIHLKNLDHVKKARKRFETIKDDSRK 524
                        +  +S+   RNRIKVKD+MKSK   +K+ D+ K  +KR     ++S K
Sbjct: 1033 HSPLIPVLPGFKQTSSSHVNYRNRIKVKDRMKSKSFFVKDSDYSKNTKKRPAAEANESPK 1092

Query: 525  VTRKPYLELQGDSNDVTGL------GRREEFN-VCIHYNTGSSEFDANKNNGVEIQKDGL 683
              +   LE++ +S+ VTGL         E+ N V +  N+   +  +      E QKD L
Sbjct: 1093 PPKLMTLEMREESSTVTGLNTVGNYSSEEQLNPVALELNSDRDQASSIGFTPSETQKDEL 1152

Query: 684  RTTSGVNFSKVDYVARSGPIKL 749
              + G++ SK+D V RSGP+KL
Sbjct: 1153 ANSPGIDASKLDGVTRSGPVKL 1174



 Score = 22.3 bits (46), Expect(2) = 6e-30
 Identities = 8/15 (53%), Positives = 11/15 (73%)
 Frame = +1

Query: 853  VKFQTFRRNQQTFTG 897
            ++FQ  R+ QQ FTG
Sbjct: 1174 LRFQDLRKKQQRFTG 1188


>ref|XP_007217065.1| hypothetical protein PRUPE_ppa001053mg [Prunus persica]
            gi|462413215|gb|EMJ18264.1| hypothetical protein
            PRUPE_ppa001053mg [Prunus persica]
          Length = 923

 Score =  126 bits (316), Expect = 1e-26
 Identities = 95/286 (33%), Positives = 136/286 (47%), Gaps = 7/286 (2%)
 Frame = +3

Query: 63   SAFQFPFSSQDGGEYVKPLRFQNSSQNLPPWLLDATKQKEKSLALSQLHYDTSAEHYPFS 242
            SAF FPF + +  E V+   FQ+SS+ LPPWLL AT Q +     SQ   D   +++   
Sbjct: 651  SAFGFPFLNPECRENVQSPWFQSSSKGLPPWLLHATLQGKPPNTASQSFPDVGRKNHHHI 710

Query: 243  VSGANFSAIPIPHHTPLISYPHNPNTSLNHFQXXXXXXXXXXXXXXKCITSNTRRRNRIK 422
            +  ++    P  HH+   SYP N  T  +                    T   ++     
Sbjct: 711  MPRSDIFTAPFMHHSSEFSYPCNLMTYHSQVMSSPSPATTFLPPHAPANTGGNQKA---- 766

Query: 423  VKDKMKSKYIHLKNLDHVKKARKRFETIKDDSRKVTRKPYLELQGDSNDVTGLGRREEFN 602
                     I++   +  KK  KR      DS        LE+Q   + V G  R   F+
Sbjct: 767  ------MSAINMGYRNRTKKT-KRLAVKAVDSTIPPNTFNLEMQEKLSAVAGSSRGNFFS 819

Query: 603  VCIHYNTGSSEFDANKNNGV-------EIQKDGLRTTSGVNFSKVDYVARSGPIKLSAGA 761
              +   + + + D+++           EIQ+DG  +  G+  SKVD + +SGPIKL AGA
Sbjct: 820  E-MQSTSRALDVDSSRTKASDLGCSLHEIQEDGFGSF-GIESSKVDGMVKSGPIKLCAGA 877

Query: 762  KHILKPSENLDQDNSRPTHSTIPFIAMNHAGEVSDFQKKSANIYRF 899
            KHILKP++N+DQD SRP HSTIPF+A+ +     + QKKS  IYRF
Sbjct: 878  KHILKPTQNVDQDISRPIHSTIPFVAVPNGCREPEPQKKSTKIYRF 923


>ref|XP_006484829.1| PREDICTED: uncharacterized protein LOC102621698 [Citrus sinensis]
          Length = 1276

 Score =  118 bits (296), Expect = 3e-24
 Identities = 101/333 (30%), Positives = 143/333 (42%), Gaps = 54/333 (16%)
 Frame = +3

Query: 63   SAFQFPFSSQDGGEYVKPLRFQNSSQNLPPWLLDATKQKEKSLALSQLHYDTSAEHYPFS 242
            SAF FPF   D  E V+   F++S++ L PWL  AT Q +  L   Q   D    + P S
Sbjct: 947  SAFNFPFLHPDCSEPVQQSWFRSSTERLLPWL-HATPQVKTPLVPCQTFGDIDGRYPPHS 1005

Query: 243  VSGANFSAIPIPHHTPLISYPHNPNTSLNHFQXXXXXXXXXXXXXXKCITS-------NT 401
                N  A P  H  P +  P +   S +H Q                 +        + 
Sbjct: 1006 TE-MNLLATPSVHR-PNVVCPSHAIISQSHMQSSVGPASVVHPSSVPAFSDIRPTSSIDM 1063

Query: 402  RRRNRIKVKDKMKSKYIHLKNLDHVKKARKRFETIKDDSRKVTRKPYLELQGDSNDVTGL 581
               NR  V D+M+SK   +  LDH +K +KR  + ++D  K T+K  L +  D + VT L
Sbjct: 1064 SYGNRNMVIDRMQSKGCGIIGLDHCQKMKKRHASKENDRTKPTKKLNLGIPEDLSVVTEL 1123

Query: 582  GRRE---EFNVCIHYNTGSSEFDA--------------------------NKNNGV---- 662
             R +       C   +  +S  D                           N+ NG     
Sbjct: 1124 TREDYQRNNQFCARASQCNSNGDQAGSSLCGPLEKQEDASATARLARHNLNQENGQFELY 1183

Query: 663  --------------EIQKDGLRTTSGVNFSKVDYVARSGPIKLSAGAKHILKPSENLDQD 800
                          + Q  G+  ++G++ S +D + RSGP+KLSAGAKHI KPS+N+D D
Sbjct: 1184 SDSAEASAVGCVSNDSQSIGVGASAGIDSSNLDSMGRSGPVKLSAGAKHIFKPSQNIDLD 1243

Query: 801  NSRPTHSTIPFIAMNHAGEVSDFQKKSANIYRF 899
            +SR  H TIPF AM  +    + QKK+  IYRF
Sbjct: 1244 DSRLVHLTIPFAAMPDSNIFLESQKKTTKIYRF 1276


>ref|XP_006437208.1| hypothetical protein CICLE_v10030533mg [Citrus clementina]
            gi|557539404|gb|ESR50448.1| hypothetical protein
            CICLE_v10030533mg [Citrus clementina]
          Length = 1260

 Score =  117 bits (294), Expect = 5e-24
 Identities = 101/333 (30%), Positives = 143/333 (42%), Gaps = 54/333 (16%)
 Frame = +3

Query: 63   SAFQFPFSSQDGGEYVKPLRFQNSSQNLPPWLLDATKQKEKSLALSQLHYDTSAEHYPFS 242
            SAF FPF   D  E V+   F++S++ L PWL  AT Q +  L   Q   D    + P S
Sbjct: 931  SAFNFPFLHPDCSEPVQQSWFRSSTERLLPWL-HATPQVKTPLVPCQTFGDIDGRYPPHS 989

Query: 243  VSGANFSAIPIPHHTPLISYPHNPNTSLNHFQXXXXXXXXXXXXXXKCITS-------NT 401
                N  A P  H  P +  P +   S +H Q                 +        + 
Sbjct: 990  TE-MNLLATPSVHR-PNVVCPSHAIISQSHMQSSVGPASVVHPSSVPAFSDIRPTSSIDM 1047

Query: 402  RRRNRIKVKDKMKSKYIHLKNLDHVKKARKRFETIKDDSRKVTRKPYLELQGDSNDVTGL 581
               NR  V D+M+SK   +  LDH +K +KR  + ++D  K T+K  L +  D + VT L
Sbjct: 1048 SYGNRNMVIDRMQSKGCGIIGLDHCQKMKKRRASKENDRTKPTKKLNLGIPEDLSVVTEL 1107

Query: 582  GRRE---EFNVCIHYNTGSSEFDA--------------------------NKNNGV---- 662
             R +       C   +  +S  D                           N+ NG     
Sbjct: 1108 TREDYQRNNQFCARASQCNSNGDRAGSSLCGPLEKQEDASATARLARHYLNQENGQFELY 1167

Query: 663  --------------EIQKDGLRTTSGVNFSKVDYVARSGPIKLSAGAKHILKPSENLDQD 800
                          + Q  G+  ++G++ S +D + RSGP+KLSAGAKHI KPS+N+D D
Sbjct: 1168 SDSAEASAVGCVSNDSQSIGVGASAGIDSSNLDSMGRSGPVKLSAGAKHIFKPSQNIDLD 1227

Query: 801  NSRPTHSTIPFIAMNHAGEVSDFQKKSANIYRF 899
            +SR  H TIPF AM  +    + QKK+  IYRF
Sbjct: 1228 DSRLVHLTIPFAAMPDSNIFLESQKKTTKIYRF 1260


>ref|XP_007048821.1| Uncharacterized protein isoform 2 [Theobroma cacao]
            gi|508701082|gb|EOX92978.1| Uncharacterized protein
            isoform 2 [Theobroma cacao]
          Length = 1214

 Score =  102 bits (254), Expect = 2e-19
 Identities = 70/213 (32%), Positives = 100/213 (46%), Gaps = 10/213 (4%)
 Frame = +3

Query: 33   QHMFSNSSQT---SAFQFPFSSQDGGEYVKPLRFQNSSQNLPPWLLDATKQKEKSLALSQ 203
            +H +  + Q    S+F FPF   D GE+V+P  F+ SS++L PWLL AT+Q +     SQ
Sbjct: 970  EHKYKQNLQNAVKSSFNFPFLHPDQGEHVQPSWFRGSSKSLIPWLLQATQQVKAPCTPSQ 1029

Query: 204  LHYDTSAEHYPFSVSGANFSAIPIPHHTPLISYPHNPNTSLNHFQXXXXXXXXXXXXXXK 383
               D     +P ++   +F   P+  H P++SY HNP  S +H +               
Sbjct: 1030 PFPDEGGRRHPHTMQ-TSFLTNPLVPHLPIVSYDHNPMISHSHMESPVGQPYIAHSPLIP 1088

Query: 384  CITS-------NTRRRNRIKVKDKMKSKYIHLKNLDHVKKARKRFETIKDDSRKVTRKPY 542
             +         N   RNRIK KD+MK K + +++ D  +K RKR    +D   K  + P 
Sbjct: 1089 ALPGIKPSSPVNMSHRNRIKFKDRMKLKSVGIQDPDICRKTRKRPRAKEDCPMKPIKIPS 1148

Query: 543  LELQGDSNDVTGLGRREEFNVCIHYNTGSSEFD 641
            L +Q  S   T    RE F   I  N GS E D
Sbjct: 1149 LGIQDKSRAAT-RSTRENFFDDIQCNMGSLEID 1180


>ref|XP_006852688.1| hypothetical protein AMTR_s00021p00252680 [Amborella trichopoda]
            gi|548856299|gb|ERN14155.1| hypothetical protein
            AMTR_s00021p00252680 [Amborella trichopoda]
          Length = 931

 Score = 88.6 bits (218), Expect = 3e-15
 Identities = 82/292 (28%), Positives = 128/292 (43%), Gaps = 14/292 (4%)
 Frame = +3

Query: 63   SAFQFPFSSQDGGEYVKPLRF-QNSSQNLPPWLLDATKQKEKSLALSQ-------LHYDT 218
            S  +F F ++D     +P     + +  +P WL+ ATKQ    ++ +          Y T
Sbjct: 670  SGSRFHFLNKDSQNRTQPTWTPSHPAPTMPEWLMKATKQNPAFISSTSQPCSIPIFGYRT 729

Query: 219  SAEHYPFSVSGANFSAIPIPHHTPLIS-----YPHNPNTSLNHFQXXXXXXXXXXXXXXK 383
                Y F+ S  NF     P H P +S     YP   N+  N  Q               
Sbjct: 730  FPAPY-FAYSDTNF-----PTHKPNVSHSSLHYPLAQNSQFNPLQFQGCSGFKS------ 777

Query: 384  CITSNTRRRNRIKVKDKMKSKYIHLKNLDHVKKARKRFETIKDDSRKVTRKPYLELQGD- 560
              TS T   + IK+K+   +   + K+ D  KK++KR      +  K  +K  +++Q + 
Sbjct: 778  --TSETNATS-IKIKETTNNAISNFKDFDQTKKSKKRPLENDIERSKHMKKQTIQVQEEG 834

Query: 561  SNDVTGLGRREEFNVCIHYNTGSSEFDANKNNGVEIQKDGLRTTSGVNFSKVDYVARSGP 740
            +  +TG     +  +   ++TG +E               LR    +N  +    +R+GP
Sbjct: 835  AKPLTGSKNSCKHRMLERFSTGENE-------------KALRIGGQMNCFQP---SRTGP 878

Query: 741  IKLSAGAKHILKPSENLDQDNSRPTHSTIPFIAMNHAGEVSDFQKKSANIYR 896
            +KLSAGAKHILKP ++   D+SRPTH TIPF+         + Q K+A IYR
Sbjct: 879  VKLSAGAKHILKPRQSAYHDHSRPTHCTIPFVVEAGPRRDLETQNKTAKIYR 930


>ref|XP_006586065.1| PREDICTED: uncharacterized protein LOC102665055 isoform X1 [Glycine
            max]
          Length = 1348

 Score = 80.1 bits (196), Expect = 1e-12
 Identities = 84/292 (28%), Positives = 121/292 (41%), Gaps = 15/292 (5%)
 Frame = +3

Query: 63   SAFQFPFSSQDGGEYVKPLRFQNSSQNLPPWLLDATKQKEKSLALSQLHYDTSAEHYPFS 242
            SAF+FPF      E+ K   F+ S  +L PW L +T ++    + S     TS  H  + 
Sbjct: 1077 SAFEFPFLHPIVEEHAKSSWFKRSYSSLAPWPLSSTHERPPGSSSS-----TSFPHNTWR 1131

Query: 243  VSGANFSAIPIPHHTPLISYP------HNPN------TSLNHFQXXXXXXXXXXXXXXKC 386
                N S I   +H+  + YP      H P       TS  H                K 
Sbjct: 1132 ----NDSTISSVNHSSDVLYPPTSVATHCPRKTIRYPTSTAHPHPHSRSPYASVSPIIKP 1187

Query: 387  ITS-NTRRRNRIKVKDKMKSKYIHLKNLDHVKKARKRFETIKDDSRKVTRKPYLELQGDS 563
             ++ ++  RN  KV D++K   +  +       +RKR  T   D  K  + P +E++   
Sbjct: 1188 PSAIDSGCRNLFKVTDRVKFDDMSARVHHPCTNSRKRPSTSFVDPAKPNKLPNIEVK--- 1244

Query: 564  NDVTGLGRREEFNVCIHYNTGSSEFDANKNN-GVEIQKDGLRTTS-GVNFSKVDYVARSG 737
             ++TG+         +  NT   E D      G   Q +       G +   +D V  SG
Sbjct: 1245 KNLTGMA--------VQQNTREVELDLQVGAIGKCCQNEAQNLNPRGFDSFNLDGVVISG 1296

Query: 738  PIKLSAGAKHILKPSENLDQDNSRPTHSTIPFIAMNHAGEVSDFQKKSANIY 893
            P+KLS GAKHILKPS+N  QDNS P HS IP       G+  + Q     +Y
Sbjct: 1297 PVKLSPGAKHILKPSQN--QDNSTPVHSAIPIAVTTDCGKDLELQGNLTKMY 1346


>ref|XP_006586066.1| PREDICTED: uncharacterized protein LOC102665055 isoform X2 [Glycine
            max]
          Length = 1334

 Score = 77.4 bits (189), Expect = 8e-12
 Identities = 82/279 (29%), Positives = 116/279 (41%), Gaps = 15/279 (5%)
 Frame = +3

Query: 63   SAFQFPFSSQDGGEYVKPLRFQNSSQNLPPWLLDATKQKEKSLALSQLHYDTSAEHYPFS 242
            SAF+FPF      E+ K   F+ S  +L PW L +T ++    + S     TS  H  + 
Sbjct: 1077 SAFEFPFLHPIVEEHAKSSWFKRSYSSLAPWPLSSTHERPPGSSSS-----TSFPHNTWR 1131

Query: 243  VSGANFSAIPIPHHTPLISYP------HNPN------TSLNHFQXXXXXXXXXXXXXXKC 386
                N S I   +H+  + YP      H P       TS  H                K 
Sbjct: 1132 ----NDSTISSVNHSSDVLYPPTSVATHCPRKTIRYPTSTAHPHPHSRSPYASVSPIIKP 1187

Query: 387  ITS-NTRRRNRIKVKDKMKSKYIHLKNLDHVKKARKRFETIKDDSRKVTRKPYLELQGDS 563
             ++ ++  RN  KV D++K   +  +       +RKR  T   D  K  + P +E++   
Sbjct: 1188 PSAIDSGCRNLFKVTDRVKFDDMSARVHHPCTNSRKRPSTSFVDPAKPNKLPNIEVK--- 1244

Query: 564  NDVTGLGRREEFNVCIHYNTGSSEFDANKNN-GVEIQKDGLRTTS-GVNFSKVDYVARSG 737
             ++TG+         +  NT   E D      G   Q +       G +   +D V  SG
Sbjct: 1245 KNLTGMA--------VQQNTREVELDLQVGAIGKCCQNEAQNLNPRGFDSFNLDGVVISG 1296

Query: 738  PIKLSAGAKHILKPSENLDQDNSRPTHSTIPFIAMNHAG 854
            P+KLS GAKHILKPS+N  QDNS P HS IP       G
Sbjct: 1297 PVKLSPGAKHILKPSQN--QDNSTPVHSAIPIAVTTDCG 1333


>ref|XP_007141375.1| hypothetical protein PHAVU_008G1904000g, partial [Phaseolus vulgaris]
            gi|561014508|gb|ESW13369.1| hypothetical protein
            PHAVU_008G1904000g, partial [Phaseolus vulgaris]
          Length = 958

 Score = 69.3 bits (168), Expect = 2e-09
 Identities = 75/252 (29%), Positives = 106/252 (42%), Gaps = 12/252 (4%)
 Frame = +3

Query: 63   SAFQFPFSSQDGGEYVKPLRFQNSSQNLPPWLLDATKQKEKSLALSQLHYDTSAEHYPFS 242
            SAF FPF      E  K   FQ   ++ P WL  +T +       SQ    TS++ +P +
Sbjct: 715  SAFGFPFLQSTVNEQAKTSWFQRPYRSSPSWLSSSTDEMLPG-TFSQQFSGTSSQSFPQN 773

Query: 243  VSGANFSAIPIPHHTPLISYPHNPNTSLNHFQXXXXXXXXXXXXXXKCITS---NTRRRN 413
            + G N     + +   L  +P N  TSL   Q                +T    N+  RN
Sbjct: 774  LWGNNLITQSVDYSAEL-RFPSNHLTSLRPMQTAPLSPASVVQPLHVPVTPSTINSGNRN 832

Query: 414  RIKVKDKMKSKYIHLKNLDHVKKARKRFETIK--DDSRKVTRKPYLELQGDSNDVTGLGR 587
               V D++K    H         ARKR   +   DDSRK T+ P +++Q +   VT L  
Sbjct: 833  INNVADRLKFDEHH-----PCTNARKRPTAVANLDDSRKHTKLPNIQVQENFCRVTRL-T 886

Query: 588  REEFNVCIHYNTGSSEFDANKNNG------VEIQKDGLRTTSGVNFSKVDYVARSGPIKL 749
             E+ +V +  NT + E D    +        E Q         VN  K+D +  SGP++L
Sbjct: 887  GEKSSVELQRNTRAPELDPQMGSARSRCCQHEAQNLNPSRYPAVNSFKLDGMVTSGPVRL 946

Query: 750  -SAGAKHILKPS 782
             S  AKHILK S
Sbjct: 947  GSKRAKHILKSS 958


>ref|XP_006575669.1| PREDICTED: uncharacterized protein LOC102669110 isoform X1 [Glycine
            max]
          Length = 1376

 Score = 60.8 bits (146), Expect = 8e-07
 Identities = 66/250 (26%), Positives = 106/250 (42%), Gaps = 10/250 (4%)
 Frame = +3

Query: 63   SAFQFPFSSQDGGEYVKPLRFQNSSQNLPPWLLDATKQKEKSLALSQLHYDTSAEHYPFS 242
            SAF FPF      E  K   F +  ++ P WL  +T +        Q     S +++P +
Sbjct: 1137 SAFGFPFLQPTVNEQAKTSWFHSPYRSSPSWLSSSTDEMLPGTFSRQFSASIS-QNFPQN 1195

Query: 243  VSGANFSAIPIPHHTPLISYPHNPNTSLNHFQXXXXXXXXXXXXXXKCITS---NTRRRN 413
            + G NF+  P  +H+  + +P N  TSL   Q                +T    N+  RN
Sbjct: 1196 LWGNNFTT-PFVNHSAEVRFPSNHLTSLGPMQTTPLSPSSIVHPLHFPVTPSTINSGNRN 1254

Query: 414  RIKVKDKMKSKYIHLKNLDHVKKARKRFETIKDDSRKVTRKPYLELQGDSNDVTGLGRRE 593
              KV D++K    HL    + +K       + D     +R P +++Q + + +T L   E
Sbjct: 1255 INKVADRLKFDEHHL--CANTRKNPAAAANLDD-----SRLPNIQVQENLSRMTRL-PEE 1306

Query: 594  EFNVCIHYNTGSSEFDANKNNG------VEIQKDGLRTTSGVNFSKVDYVARSGPIKLS- 752
            + +V +  NT + E D +  +        E Q     +   VN  K+D +  SGP++L  
Sbjct: 1307 KSSVELQRNTRALELDPHMGSARSRCCQHEAQNQNPGSYPAVNSFKLDSMVTSGPVRLGP 1366

Query: 753  AGAKHILKPS 782
              AKHIL  S
Sbjct: 1367 KRAKHILNSS 1376


>ref|XP_004491380.1| PREDICTED: uncharacterized protein LOC101497686 [Cicer arietinum]
          Length = 1343

 Score = 59.3 bits (142), Expect = 2e-06
 Identities = 70/255 (27%), Positives = 101/255 (39%), Gaps = 15/255 (5%)
 Frame = +3

Query: 63   SAFQFPFSSQDGGEYVKPLRFQNSSQNLPPWLLDATKQKEKSLALSQLHYDTSAEHYPFS 242
            SAF FPF      E  K   FQ   +++P WL  +T     +   SQ       + +P +
Sbjct: 1096 SAFGFPFLQPADNELAKTYWFQGPYRSVPSWLSSST-DDTLTRTYSQQLSGVGNQSFPQN 1154

Query: 243  VSGANFSAIPIPHHTPLISYPHNPNTSLNHFQXXXXXXXXXXXXXXKCITSNTRR---RN 413
              G NF++  + H   ++ YP NP TS    +                +T +T     RN
Sbjct: 1155 RWGNNFTSPSVTHSADVL-YPSNPLTSRGPMKTTLLCPASIVQPPQVSVTPSTMNSGCRN 1213

Query: 414  RIKVKDKMK-SKYIHLKNLDHVKKARKRFETIKDDSRKVTRKPYLELQGDSNDVTGLGRR 590
              K  D +     + +K+       RKR     DDSRK  +   +E+  + + VT L  R
Sbjct: 1214 INKAVDGVNLDDNMVVKDYHPCTNIRKRPADNLDDSRKPIKISNIEVNENLSHVTRL-TR 1272

Query: 591  EEFNVCIHYNTGSSEFDANKNNGVEIQKDGLRTTSGVNFSKVDYVA----------RSGP 740
            E  +V +  N  +   D      VEI +         N S   Y A          RSGP
Sbjct: 1273 ENSSVELQRNKRAVIIDPQ----VEIARSRCCQNVAQNLSSTSYPAVDSFNLDGTIRSGP 1328

Query: 741  IKLS-AGAKHILKPS 782
            ++L    AKHILK S
Sbjct: 1329 VRLGPKRAKHILKSS 1343


>gb|EEE62684.1| hypothetical protein OsJ_17487 [Oryza sativa Japonica Group]
          Length = 784

 Score = 58.2 bits (139), Expect = 5e-06
 Identities = 28/73 (38%), Positives = 41/73 (56%)
 Frame = +3

Query: 663 EIQKDGLRTTSGVNFSKVDYVARSGPIKLSAGAKHILKPSENLDQDNSRPTHSTIPFIAM 842
           +I  D    +SGV      Y++RSGP+KL  GAKH+L+P ++ D  N  P +S +PF  +
Sbjct: 713 KIDVDNSNISSGVR-----YISRSGPVKLRPGAKHVLEPRQDTDDGNYPPMYSCVPFFVI 767

Query: 843 NHAGEVSDFQKKS 881
              G +   Q KS
Sbjct: 768 RRGGNILSGQTKS 780


>gb|EEC78687.1| hypothetical protein OsI_18829 [Oryza sativa Indica Group]
          Length = 784

 Score = 58.2 bits (139), Expect = 5e-06
 Identities = 28/73 (38%), Positives = 41/73 (56%)
 Frame = +3

Query: 663 EIQKDGLRTTSGVNFSKVDYVARSGPIKLSAGAKHILKPSENLDQDNSRPTHSTIPFIAM 842
           +I  D    +SGV      Y++RSGP+KL  GAKH+L+P ++ D  N  P +S +PF  +
Sbjct: 713 KIDVDNSNISSGVR-----YISRSGPVKLRPGAKHVLEPRQDTDDGNYPPMYSCVPFFVI 767

Query: 843 NHAGEVSDFQKKS 881
              G +   Q KS
Sbjct: 768 RRGGNILSGQTKS 780


>ref|NP_001054890.1| Os05g0203900 [Oryza sativa Japonica Group]
           gi|50878340|gb|AAT85115.1| unknown protein [Oryza sativa
           Japonica Group] gi|113578441|dbj|BAF16804.1|
           Os05g0203900 [Oryza sativa Japonica Group]
           gi|215697470|dbj|BAG91464.1| unnamed protein product
           [Oryza sativa Japonica Group]
          Length = 755

 Score = 58.2 bits (139), Expect = 5e-06
 Identities = 28/73 (38%), Positives = 41/73 (56%)
 Frame = +3

Query: 663 EIQKDGLRTTSGVNFSKVDYVARSGPIKLSAGAKHILKPSENLDQDNSRPTHSTIPFIAM 842
           +I  D    +SGV      Y++RSGP+KL  GAKH+L+P ++ D  N  P +S +PF  +
Sbjct: 684 KIDVDNSNISSGVR-----YISRSGPVKLRPGAKHVLEPRQDTDDGNYPPMYSCVPFFVI 738

Query: 843 NHAGEVSDFQKKS 881
              G +   Q KS
Sbjct: 739 RRGGNILSGQTKS 751


Top