BLASTX nr result

ID: Akebia23_contig00038114 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Akebia23_contig00038114
         (1256 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002268782.2| PREDICTED: uncharacterized protein LOC100263...   313   8e-83
emb|CAN82741.1| hypothetical protein VITISV_026165 [Vitis vinifera]   313   8e-83
ref|XP_004155679.1| PREDICTED: uncharacterized LOC101218930 [Cuc...   307   7e-81
ref|XP_004142553.1| PREDICTED: uncharacterized protein LOC101218...   304   5e-80
ref|XP_007208065.1| hypothetical protein PRUPE_ppa001825mg [Prun...   299   2e-78
gb|EXC21757.1| hypothetical protein L484_006471 [Morus notabilis]     287   6e-75
ref|NP_001189804.1| uncharacterized protein [Arabidopsis thalian...   277   6e-72
ref|NP_187066.1| uncharacterized protein [Arabidopsis thaliana] ...   277   6e-72
ref|XP_006848026.1| hypothetical protein AMTR_s00029p00178860 [A...   271   4e-70
ref|XP_002525479.1| conserved hypothetical protein [Ricinus comm...   269   2e-69
ref|XP_006371138.1| hypothetical protein POPTR_0019s04490g [Popu...   267   8e-69
ref|XP_007030254.1| U11/U12 small nuclear ribonucleoprotein 48 k...   262   3e-67
ref|XP_004302118.1| PREDICTED: uncharacterized protein LOC101300...   259   1e-66
ref|XP_006297086.1| hypothetical protein CARUB_v10013089mg [Caps...   259   2e-66
ref|XP_002884430.1| hypothetical protein ARALYDRAFT_477678 [Arab...   256   1e-65
ref|XP_006408216.1| hypothetical protein EUTSA_v10020148mg [Eutr...   252   3e-64
ref|XP_006443313.1| hypothetical protein CICLE_v10019009mg [Citr...   234   4e-59
ref|XP_004237502.1| PREDICTED: uncharacterized protein LOC101244...   220   9e-55
ref|XP_006340483.1| PREDICTED: uncharacterized protein LOC102582...   219   2e-54
ref|XP_003535384.1| PREDICTED: uncharacterized protein LOC100803...   213   1e-52

>ref|XP_002268782.2| PREDICTED: uncharacterized protein LOC100263926 [Vitis vinifera]
          Length = 725

 Score =  313 bits (803), Expect = 8e-83
 Identities = 181/385 (47%), Positives = 232/385 (60%), Gaps = 3/385 (0%)
 Frame = -3

Query: 1191 MNPPNPYFNPSSTYFPPISNAYPNPNFXXXXXXXXXXXXXXXXXXXXXXXXSNPDLLSTF 1012
            MNPP P     S  F P +   PN N                          NPDL ST 
Sbjct: 1    MNPPPPSLRHHSFTFLPQN---PNTNMLNTVTP-------------------NPDLTSTL 38

Query: 1011 SILKDVIDLANKTINSISNLLHSENRPKPNGDFCSCPFDSRHQMPPESLFHHSLCCPSSP 832
            S LK +I  +   + S  NLLH       +     CPFD RH+MPPE LF H L CPSS 
Sbjct: 39   SALKALIHQSEAAVTSPHNLLHH-----CSAALSPCPFDPRHRMPPEFLFRHHLRCPSSH 93

Query: 831  -RVIDLGILESLHYPNSLKSEDELRKENRFVQSLPESDADLCFSIDDYGDFESNFFYRDC 655
               +D  IL+SL YP +L+S+      N F+Q L +S+++LCFS+D +GDF SNFFYRDC
Sbjct: 94   FPPLDPSILQSLRYPRTLQSQSP----NSFLQPLRDSNSELCFSLDQFGDFGSNFFYRDC 149

Query: 654  PGVVCSSEQDSTKRTFTLPGVLSAQCANFICSSIRETRV--FPKECFVILPSDLLALKSE 481
            PGVV   E D   RT TLPG+LS +CANF+     + R+    +EC  +LPS+L   + E
Sbjct: 150  PGVV---ELDRLHRTLTLPGLLSVECANFVGVG-DDGRIGGASRECVRLLPSELWEFRRE 205

Query: 480  IELWNDYPYSYSINVLRVFLCLQMVEESDSLKWIISNSPCYGILIDAHMRDHIFLLLKLC 301
            I LWND+P SYS  VLRV LC +MV+E D LKW+I+NSP YG++ID  MRDHIF+L +L 
Sbjct: 206  IGLWNDFPSSYSYAVLRVVLCAEMVKEGDFLKWVIANSPWYGVVIDVAMRDHIFVLFRLV 265

Query: 300  LKAISQEAYRSRNFLLSRDSNEGDGDLNPMSLRFKCPILFEVVTWLASQLSVLYGEANGK 121
            LKAI +EA       +S D      ++N  ++  +CP L + + WLASQ+SVLYGEANGK
Sbjct: 266  LKAIVREA-------ISWDVKGKGLEMNSKTMSLECPNLVQAMMWLASQISVLYGEANGK 318

Query: 120  FFSVNMLKNCILNVASSLLLFPLEQ 46
            FF++NMLK C+ NVAS L+LF LE+
Sbjct: 319  FFAINMLKQCLFNVASGLVLFALEE 343


>emb|CAN82741.1| hypothetical protein VITISV_026165 [Vitis vinifera]
          Length = 772

 Score =  313 bits (803), Expect = 8e-83
 Identities = 181/385 (47%), Positives = 232/385 (60%), Gaps = 3/385 (0%)
 Frame = -3

Query: 1191 MNPPNPYFNPSSTYFPPISNAYPNPNFXXXXXXXXXXXXXXXXXXXXXXXXSNPDLLSTF 1012
            MNPP P     S  F P +   PN N                          NPDL ST 
Sbjct: 1    MNPPPPSLRHHSFTFLPQN---PNTNMLNTVTP-------------------NPDLTSTL 38

Query: 1011 SILKDVIDLANKTINSISNLLHSENRPKPNGDFCSCPFDSRHQMPPESLFHHSLCCPSSP 832
            S LK +I  +   + S  NLLH       +     CPFD RH+MPPE LF H L CPSS 
Sbjct: 39   SALKALIHQSEXAVTSPHNLLHH-----CSAALSPCPFDPRHRMPPEFLFRHHLRCPSSH 93

Query: 831  -RVIDLGILESLHYPNSLKSEDELRKENRFVQSLPESDADLCFSIDDYGDFESNFFYRDC 655
               +D  IL+SL YP +L+S+      N F+Q L +S+++LCFS+D +GDF SNFFYRDC
Sbjct: 94   FPPLDPSILQSLRYPRTLQSQSP----NSFLQPLRDSNSELCFSLDQFGDFGSNFFYRDC 149

Query: 654  PGVVCSSEQDSTKRTFTLPGVLSAQCANFICSSIRETRV--FPKECFVILPSDLLALKSE 481
            PGVV   E D   RT TLPG+LS +CANF+     + R+    +EC  +LPS+L   + E
Sbjct: 150  PGVV---ELDRLHRTLTLPGLLSVECANFVGVG-DDGRIGGASRECVRLLPSELWEFRRE 205

Query: 480  IELWNDYPYSYSINVLRVFLCLQMVEESDSLKWIISNSPCYGILIDAHMRDHIFLLLKLC 301
            I LWND+P SYS  VLRV LC +MV+E D LKW+I+NSP YG++ID  MRDHIF+L +L 
Sbjct: 206  IGLWNDFPSSYSYAVLRVVLCAEMVKEGDFLKWVIANSPWYGVVIDVAMRDHIFVLFRLV 265

Query: 300  LKAISQEAYRSRNFLLSRDSNEGDGDLNPMSLRFKCPILFEVVTWLASQLSVLYGEANGK 121
            LKAI +EA       +S D      ++N  ++  +CP L + + WLASQ+SVLYGEANGK
Sbjct: 266  LKAIVREA-------ISWDVKGKGLEMNSKTMSLECPNLVQAMMWLASQISVLYGEANGK 318

Query: 120  FFSVNMLKNCILNVASSLLLFPLEQ 46
            FF++NMLK C+ NVAS L+LF LE+
Sbjct: 319  FFAINMLKQCLFNVASGLVLFALEE 343


>ref|XP_004155679.1| PREDICTED: uncharacterized LOC101218930 [Cucumis sativus]
          Length = 637

 Score =  307 bits (786), Expect = 7e-81
 Identities = 183/398 (45%), Positives = 228/398 (57%), Gaps = 5/398 (1%)
 Frame = -3

Query: 1194 AMNPPNPYFNPSSTYFPPISNAYPNPNFXXXXXXXXXXXXXXXXXXXXXXXXSNPDLLST 1015
            A+NP  P+  P    FP      PNPN                            DL S+
Sbjct: 3    AINPSLPF--PPYQTFPNFLPPNPNPNSHIHDSSHSQSQHPPL------------DLSSS 48

Query: 1014 FSILKDVIDLANKTINSISNLLHSENRPKPNGDFCSCPFDSRHQMPPESLFHHSLCCPSS 835
            FS L ++I  AN+T+ S+S L  S+     +     C FD RH++PP SLF HSL CPS+
Sbjct: 49   FSSLNNLIHFANQTLQSLSYLTPSDFAN--HSHLLHCHFDRRHRVPPHSLFRHSLLCPSA 106

Query: 834  --PRVIDLGILESLHYPNSLKSEDELRKENRFVQSLPESDADLCFSIDDYGDFESNFFYR 661
              P +    + +SL YP +L S  +L  ENRF Q LP+SDADLCFS+ DY D  SNFFY 
Sbjct: 107  SLPPIDPTQLFQSLLYPQTLHSSRQLVNENRFSQVLPDSDADLCFSLTDYSDATSNFFYV 166

Query: 660  DCPGVVCSSEQDSTKRTFTLPGVLSAQCANFICSSIRETRVFPKECFVILPSDLLALKSE 481
            DCPGVV  S  D   + FTLP VL+  CANF+ +   E          ILPSDL  L+SE
Sbjct: 167  DCPGVVALSNLDEMSKVFTLPRVLAVHCANFVGNDHFEMN-STLNGIRILPSDLWNLRSE 225

Query: 480  IELWNDYPYSYSINVLRVFLCLQMVEESDSLKWIISNSPCYGILIDAHMRDHIFLLLKLC 301
            +E+WNDYP  YS  VLR  L  +M   S  + WII NSP YG++ID  +RDHIFLL +LC
Sbjct: 226  VEIWNDYPSKYSFVVLRSILGSEMALNSHLMTWIIENSPRYGVVIDVALRDHIFLLFRLC 285

Query: 300  LKAISQEAYRSRNFLLSRDSNEGDGDLNPMSLRFKCPILFEVVTWLASQLSVLYGEANGK 121
              AI +EA   +  L   +  EG+      +  FKCPIL +V+ WLASQLSVLYGE NG 
Sbjct: 286  FMAIYKEALGFQVALEKGNGMEGESG----NSCFKCPILIQVLMWLASQLSVLYGETNGN 341

Query: 120  FFSVNMLKNCILNVASSLLLFPLEQKKTEA---SEGSN 16
            FF+VNML+ CIL+ AS LLL   EQK TE+    EGS+
Sbjct: 342  FFAVNMLRQCILDAASGLLLLQSEQKSTESLTLGEGSH 379


>ref|XP_004142553.1| PREDICTED: uncharacterized protein LOC101218930 [Cucumis sativus]
          Length = 548

 Score =  304 bits (779), Expect = 5e-80
 Identities = 184/398 (46%), Positives = 229/398 (57%), Gaps = 5/398 (1%)
 Frame = -3

Query: 1194 AMNPPNPYFNPSSTYFPPISNAYPNPNFXXXXXXXXXXXXXXXXXXXXXXXXSNPDLLST 1015
            A+NP  P+  P    FP      PNPN                            DL S+
Sbjct: 3    AINPSLPF--PPYQTFPNFLPPNPNPNSHIHDSSHSQSQHPPL------------DLSSS 48

Query: 1014 FSILKDVIDLANKTINSISNLLHSENRPKPNGDFCSCPFDSRHQMPPESLFHHSLCCPSS 835
            FS L ++I  AN+T+ S+S L  S+     +     C FD RH++PP SLF HSL CPS+
Sbjct: 49   FSSLNNLIHFANQTLQSLSYLTPSDFAN--HSHLLHCHFDRRHRVPPHSLFRHSLLCPSA 106

Query: 834  PRV-ID-LGILESLHYPNSLKSEDELRKENRFVQSLPESDADLCFSIDDYGDFESNFFYR 661
              + ID   + +SL YP +L S  +L  ENRF Q LP+SDADLCFS+ DY D  SNFFY 
Sbjct: 107  SLLPIDPTQLFQSLLYPQTLHSSRQLVNENRFSQVLPDSDADLCFSLTDYSDATSNFFYV 166

Query: 660  DCPGVVCSSEQDSTKRTFTLPGVLSAQCANFICSSIRETRVFPKECFVILPSDLLALKSE 481
            DCPGVV  S  D   + FTLP VL+  CANF+ +   E          ILPSDL  L+SE
Sbjct: 167  DCPGVVALSNLDEMSKVFTLPRVLAVHCANFVGNDHFEMN-STLNGIRILPSDLWNLRSE 225

Query: 480  IELWNDYPYSYSINVLRVFLCLQMVEESDSLKWIISNSPCYGILIDAHMRDHIFLLLKLC 301
            +E+WNDYP  YS  VLR  L  +M   S  + WII NSP YG++ID  +RDHIFLL +LC
Sbjct: 226  VEIWNDYPSKYSFVVLRSILGSEMALNSHLMTWIIENSPRYGVVIDVALRDHIFLLFRLC 285

Query: 300  LKAISQEAYRSRNFLLSRDSNEGDGDLNPMSLRFKCPILFEVVTWLASQLSVLYGEANGK 121
              AI +EA   +  L   +  EG+      +  FKCPIL +V+ WLASQLSVLYGE NG 
Sbjct: 286  FMAIYKEALGFQVALEKGNGMEGESG----NSCFKCPILIQVLMWLASQLSVLYGETNGN 341

Query: 120  FFSVNMLKNCILNVASSLLLFPLEQKKTEA---SEGSN 16
            FF+VNML+ CIL+ AS LLL   EQK TE+    EGS+
Sbjct: 342  FFAVNMLRQCILDAASGLLLLQSEQKSTESLTLGEGSH 379


>ref|XP_007208065.1| hypothetical protein PRUPE_ppa001825mg [Prunus persica]
            gi|462403707|gb|EMJ09264.1| hypothetical protein
            PRUPE_ppa001825mg [Prunus persica]
          Length = 760

 Score =  299 bits (765), Expect = 2e-78
 Identities = 178/387 (45%), Positives = 227/387 (58%), Gaps = 3/387 (0%)
 Frame = -3

Query: 1185 PPNPYFNPSSTYFPPISNAYPNPNFXXXXXXXXXXXXXXXXXXXXXXXXSNPDLLSTFSI 1006
            PP  + +PS T  P  SN  PNPNF                          PDL +T S 
Sbjct: 4    PPAQFAHPSFTLIP--SNPNPNPNFFHSQPQNTQPVISTPPLPP-------PDLSTTISS 54

Query: 1005 LKDVIDLANKTINSISNLL--HSENRPKPNGDFCSCPFDSRHQMPPESLFHHSLCCPSSP 832
            L  ++  + +T++S+S LL   + N   P      CPF+  H++ P SLF HSL CPS P
Sbjct: 55   LDSLVRDSYQTLDSLSALLPLQNPNYDNPQSSLIPCPFNPHHRVHPHSLFSHSLHCPSHP 114

Query: 831  RVIDLGILESLHYPNSLKSEDELRKENRFVQSLPESDADLCFSIDDY-GDFESNFFYRDC 655
                   L  L+YP +LKS D+ + E  F+Q+L  S+ADL  S++ Y  DF SNFFY DC
Sbjct: 115  HP-----LPHLNYPKTLKSSDQSQTEKSFLQTLHGSEADLRLSLEHYYADFGSNFFYSDC 169

Query: 654  PGVVCSSEQDSTKRTFTLPGVLSAQCANFICSSIRETRVFPKECFVILPSDLLALKSEIE 475
            PGVV  S  D   R FTLP +LS +CANFI    RE   F KE   ILPS+L A+K+E+E
Sbjct: 170  PGVVNFSGLDGVNRMFTLPLILSVECANFIGRGEREIMDFEKEWCRILPSELWAIKTEVE 229

Query: 474  LWNDYPYSYSINVLRVFLCLQMVEESDSLKWIISNSPCYGILIDAHMRDHIFLLLKLCLK 295
             WN++P++YS  VL   L L +V+E D   WII+NSP YGI+ID  MRDHIFLL +LCLK
Sbjct: 230  GWNEFPFTYSYRVLCAILGLGVVKEYDVGTWIIANSPQYGIVIDVAMRDHIFLLSRLCLK 289

Query: 294  AISQEAYRSRNFLLSRDSNEGDGDLNPMSLRFKCPILFEVVTWLASQLSVLYGEANGKFF 115
            AI +EA       LS+   EGD    P S  F+CP L + + WLASQLS+LYG  NGK F
Sbjct: 290  AILREA-------LSK-VKEGD----PESTHFECPTLVQALMWLASQLSILYGAQNGKLF 337

Query: 114  SVNMLKNCILNVASSLLLFPLEQKKTE 34
             +N+LK C+L+ A   L FPLEQ+ TE
Sbjct: 338  VINVLKKCLLDAALGSLTFPLEQQVTE 364


>gb|EXC21757.1| hypothetical protein L484_006471 [Morus notabilis]
          Length = 763

 Score =  287 bits (735), Expect = 6e-75
 Identities = 167/386 (43%), Positives = 225/386 (58%), Gaps = 3/386 (0%)
 Frame = -3

Query: 1182 PNPYFNPSSTYFPPISNAYPNPNFXXXXXXXXXXXXXXXXXXXXXXXXSNPDLLSTFSIL 1003
            P P+  PS  + PP     PNPN                            D  +T S L
Sbjct: 5    PTPFSQPSFHFLPP----NPNPNSVSLNAELQNPQPQNLTPQPL-------DFSATLSSL 53

Query: 1002 KDVIDLANKTINSISNLL--HSENRPKPNGDFCSCPFDSRHQMPPESLFHHSLCCPSSPR 829
              +I  + +T+ ++ +LL   + N+   NG    CPF+S+H M P SLF H L C SSP 
Sbjct: 54   NGLIHHSEQTLRALFSLLPLQNPNQAHSNG-VVPCPFNSQHLMHPSSLFSHFLHCSSSPC 112

Query: 828  VIDLGILESLHYPNSLKSEDELRKENRFVQSLPESDADLCFSIDD-YGDFESNFFYRDCP 652
             I   +L  L+Y  +L S D  + E  F+Q+L  SD++LCFS+DD Y  F  NFFY DC 
Sbjct: 113  PIQFDLLPQLNYTETLNSSDSSKAERGFLQTLHGSDSELCFSLDDFYSQFGFNFFYNDCH 172

Query: 651  GVVCSSEQDSTKRTFTLPGVLSAQCANFICSSIRETRVFPKECFVILPSDLLALKSEIEL 472
            GVV  S  D   RTFTLP  LS +CANF+ ++  E + F ++   ILPS+L A+++EIE 
Sbjct: 173  GVVNLSALDGISRTFTLPVFLSVECANFVSNNEEERKSFERKNRKILPSELWAIRAEIEA 232

Query: 471  WNDYPYSYSINVLRVFLCLQMVEESDSLKWIISNSPCYGILIDAHMRDHIFLLLKLCLKA 292
            WN+YP  YS  VL   L L  +   D  +W+I+NSP YG++ID  MRDHIFLL +LCLKA
Sbjct: 233  WNEYPNVYSYRVLYAILGLDFISVCDLARWVIANSPQYGVVIDTAMRDHIFLLCRLCLKA 292

Query: 291  ISQEAYRSRNFLLSRDSNEGDGDLNPMSLRFKCPILFEVVTWLASQLSVLYGEANGKFFS 112
            I +EA    N + + +S +    LN  S+ F CPIL + + WLASQLS+LYGE NGKFF+
Sbjct: 293  ILKEAL---NLVGNCNSVK---ILN--SMNFSCPILVQALMWLASQLSILYGEMNGKFFA 344

Query: 111  VNMLKNCILNVASSLLLFPLEQKKTE 34
            +N+LK C+L+ AS L+ F LE+  TE
Sbjct: 345  LNILKQCVLDAASGLVFFSLEKSVTE 370


>ref|NP_001189804.1| uncharacterized protein [Arabidopsis thaliana]
            gi|332640525|gb|AEE74046.1| uncharacterized protein
            AT3G04160 [Arabidopsis thaliana]
          Length = 714

 Score =  277 bits (709), Expect = 6e-72
 Identities = 158/342 (46%), Positives = 216/342 (63%), Gaps = 8/342 (2%)
 Frame = -3

Query: 1029 DLLSTFSILKDVIDLANKTINSISNLL---HSENRPKP-NGDFCSCPFDSRHQMPPESLF 862
            +L  T S LK ++    +T++S+S  L   HS    K  NG F  CPFDS H MPPE+LF
Sbjct: 56   ELSGTLSSLKSLLSECQRTLDSLSQNLALDHSSLLQKDENGCFVRCPFDSNHFMPPEALF 115

Query: 861  HHSLCCPSSPRVIDLGILESLH-YPNSLKSEDELRKENRFVQSLPESDADLCFSIDDYGD 685
             HSL CP++  +I L  LES   Y N+L+   EL+  N         D DLC S+DD  D
Sbjct: 116  LHSLRCPNTLDLIHL--LESFSSYRNTLELPCELQLNN--------GDGDLCISLDDLAD 165

Query: 684  FESNFFYRDCPGVVCSSEQDSTKRTFTLPGVLSAQCANFICSSIRETRVFPKECFVILPS 505
            F SNFFYRDCPG V  SE D  KRT TLP VLS +C++F+ S  +  ++   +C  +LPS
Sbjct: 166  FGSNFFYRDCPGAVKFSELDGKKRTLTLPHVLSVECSDFVGSDEKVKKIVLDKCLGVLPS 225

Query: 504  DLLALKSEIELWNDYPYSYSINVLRVFLCLQMVEESDSLKWIISNSPCYGILIDAHMRDH 325
            DL A+K+EI+ W D+P SYS +VL   +  ++VE S   KWI+ NS  YG++ID  MRDH
Sbjct: 226  DLCAMKNEIDQWRDFPSSYSSSVLSSIVGSKVVEISALRKWILVNSTRYGVIIDTFMRDH 285

Query: 324  IFLLLKLCLKAISQEAYRSRNFLLSRDSNEGDGDLNPMSLR---FKCPILFEVVTWLASQ 154
            IFLL +LCLK+  +EA     F +  D+ +  G+   MS +   F+CP+  +V++WLASQ
Sbjct: 286  IFLLFRLCLKSAVKEAC---GFRMESDATD-VGEQKIMSCKSSTFECPVFIQVLSWLASQ 341

Query: 153  LSVLYGEANGKFFSVNMLKNCILNVASSLLLFPLEQKKTEAS 28
            L+VLYGE NGKFF+++M K CI+  AS ++LF LE  +++ S
Sbjct: 342  LAVLYGEGNGKFFALDMFKQCIVESASQVMLFRLEGTRSKCS 383


>ref|NP_187066.1| uncharacterized protein [Arabidopsis thaliana]
            gi|6721169|gb|AAF26797.1|AC016829_21 hypothetical protein
            [Arabidopsis thaliana] gi|332640524|gb|AEE74045.1|
            uncharacterized protein AT3G04160 [Arabidopsis thaliana]
          Length = 712

 Score =  277 bits (709), Expect = 6e-72
 Identities = 158/342 (46%), Positives = 216/342 (63%), Gaps = 8/342 (2%)
 Frame = -3

Query: 1029 DLLSTFSILKDVIDLANKTINSISNLL---HSENRPKP-NGDFCSCPFDSRHQMPPESLF 862
            +L  T S LK ++    +T++S+S  L   HS    K  NG F  CPFDS H MPPE+LF
Sbjct: 56   ELSGTLSSLKSLLSECQRTLDSLSQNLALDHSSLLQKDENGCFVRCPFDSNHFMPPEALF 115

Query: 861  HHSLCCPSSPRVIDLGILESLH-YPNSLKSEDELRKENRFVQSLPESDADLCFSIDDYGD 685
             HSL CP++  +I L  LES   Y N+L+   EL+  N         D DLC S+DD  D
Sbjct: 116  LHSLRCPNTLDLIHL--LESFSSYRNTLELPCELQLNN--------GDGDLCISLDDLAD 165

Query: 684  FESNFFYRDCPGVVCSSEQDSTKRTFTLPGVLSAQCANFICSSIRETRVFPKECFVILPS 505
            F SNFFYRDCPG V  SE D  KRT TLP VLS +C++F+ S  +  ++   +C  +LPS
Sbjct: 166  FGSNFFYRDCPGAVKFSELDGKKRTLTLPHVLSVECSDFVGSDEKVKKIVLDKCLGVLPS 225

Query: 504  DLLALKSEIELWNDYPYSYSINVLRVFLCLQMVEESDSLKWIISNSPCYGILIDAHMRDH 325
            DL A+K+EI+ W D+P SYS +VL   +  ++VE S   KWI+ NS  YG++ID  MRDH
Sbjct: 226  DLCAMKNEIDQWRDFPSSYSSSVLSSIVGSKVVEISALRKWILVNSTRYGVIIDTFMRDH 285

Query: 324  IFLLLKLCLKAISQEAYRSRNFLLSRDSNEGDGDLNPMSLR---FKCPILFEVVTWLASQ 154
            IFLL +LCLK+  +EA     F +  D+ +  G+   MS +   F+CP+  +V++WLASQ
Sbjct: 286  IFLLFRLCLKSAVKEAC---GFRMESDATD-VGEQKIMSCKSSTFECPVFIQVLSWLASQ 341

Query: 153  LSVLYGEANGKFFSVNMLKNCILNVASSLLLFPLEQKKTEAS 28
            L+VLYGE NGKFF+++M K CI+  AS ++LF LE  +++ S
Sbjct: 342  LAVLYGEGNGKFFALDMFKQCIVESASQVMLFRLEGTRSKCS 383


>ref|XP_006848026.1| hypothetical protein AMTR_s00029p00178860 [Amborella trichopoda]
            gi|548851331|gb|ERN09607.1| hypothetical protein
            AMTR_s00029p00178860 [Amborella trichopoda]
          Length = 799

 Score =  271 bits (693), Expect = 4e-70
 Identities = 148/326 (45%), Positives = 200/326 (61%), Gaps = 1/326 (0%)
 Frame = -3

Query: 1026 LLSTFSILKDVIDLANKTINSISNLLHSENRPKPNGDFCSCPFDSRHQMPPESLFHHSLC 847
            L S+ S  KD +   +  +N    L+  E          +CPF+S H+MP + LF HSL 
Sbjct: 53   LKSSISNAKDALKRVSGFLNLDQTLIRHE--------LSTCPFNSNHRMPSQRLFRHSLT 104

Query: 846  CPSSPRVIDLGILESLHYPNSLKSEDELRKENRFVQSLPESDADLCFSIDDYGDFESNFF 667
            C SSP  + +  L +L YPNSLKSE EL+ E +    +   ++DLCFS+DD G F +NFF
Sbjct: 105  CNSSPGALGVDNLGNLKYPNSLKSEKELKSEVQLFHEIHGVESDLCFSLDDGGGFSANFF 164

Query: 666  YRDCPGVVCSSEQDSTKRTFTLPGVLSAQCANFICS-SIRETRVFPKECFVILPSDLLAL 490
            YRDCPGVV SSE + TK+TFTLP +LS +C N   +   +    FP     +LPS+L  +
Sbjct: 165  YRDCPGVVSSSEPE-TKKTFTLPSILSRECVNLAGNCDFKPHGEFP----WLLPSELWYM 219

Query: 489  KSEIELWNDYPYSYSINVLRVFLCLQMVEESDSLKWIISNSPCYGILIDAHMRDHIFLLL 310
            + E   WNDYP  YS   L+V  CL MV + + +KW++ NSP YG +ID  M +HIFLLL
Sbjct: 220  RKESGGWNDYPLCYSYASLKVSSCLSMVSKPEMVKWVLKNSPFYGSVIDNPMGEHIFLLL 279

Query: 309  KLCLKAISQEAYRSRNFLLSRDSNEGDGDLNPMSLRFKCPILFEVVTWLASQLSVLYGEA 130
            KLC KAIS+EA  S     +RD  +   D+  +S  FKCP+L E ++WL S LSVLYG  
Sbjct: 280  KLCFKAISREASSSLELHQNRDERDKGFDIKTLS--FKCPVLAESLSWLGSHLSVLYGHN 337

Query: 129  NGKFFSVNMLKNCILNVASSLLLFPL 52
            NGK F++++LK  +  + S L+LFPL
Sbjct: 338  NGKVFAIHVLKESLFIMGSRLVLFPL 363


>ref|XP_002525479.1| conserved hypothetical protein [Ricinus communis]
            gi|223535292|gb|EEF36969.1| conserved hypothetical
            protein [Ricinus communis]
          Length = 722

 Score =  269 bits (687), Expect = 2e-69
 Identities = 156/389 (40%), Positives = 221/389 (56%), Gaps = 3/389 (0%)
 Frame = -3

Query: 1167 NPSSTYFPPI--SNAYPNPNFXXXXXXXXXXXXXXXXXXXXXXXXSNPDLLSTFSILKDV 994
            NPSS  +P    ++ YP PNF                           DL +T S L ++
Sbjct: 2    NPSSAPYPDYFQNSNYPIPNFVFHSLPQPPPPHIPTITPTTPIL----DLSTTLSSLANL 57

Query: 993  IDLANKTINSISNLLHSENRPKPNGDFCSCPFDSRHQMPPESLFHHSLCCPSSPRVIDLG 814
            + L+ +T NS+S+L+    +P  N  F SCP++  H MPPESLF HSL CPS      + 
Sbjct: 58   LSLSQQTRNSLSSLI----KPNKNVKFISCPYNPNHLMPPESLFLHSLRCPSPSFQDPIS 113

Query: 813  ILESLHYPNSLKSEDELRKENRFVQSLPESDADLCFSIDD-YGDFESNFFYRDCPGVVCS 637
            ++ SLHYP +L S++     N   ++    +A+LC S+D  Y +F SNFFY+DCPG V  
Sbjct: 114  LVNSLHYPKTLNSQNP---SNPLFKN--SDNAELCLSLDGFYNEFSSNFFYKDCPGAVQF 168

Query: 636  SEQDSTKRTFTLPGVLSAQCANFICSSIRETRVFPKECFVILPSDLLALKSEIELWNDYP 457
            S+ DS+ +TF LP VLS +CANF+     + + F    F ILPSDL  +K E+E W DYP
Sbjct: 169  SDLDSSSKTFLLPAVLSVECANFVARIEEDIKGFDINEFRILPSDLWVIKREVESWADYP 228

Query: 456  YSYSINVLRVFLCLQMVEESDSLKWIISNSPCYGILIDAHMRDHIFLLLKLCLKAISQEA 277
              YS  V    L L +++ SD  +WII NSP YG++ID +MRDHI +L +LCL AI +EA
Sbjct: 229  SMYSYAVFCAILRLNVIKGSDLRRWIIFNSPRYGVVIDVYMRDHISVLFRLCLNAIRREA 288

Query: 276  YRSRNFLLSRDSNEGDGDLNPMSLRFKCPILFEVVTWLASQLSVLYGEANGKFFSVNMLK 97
            +   +F+           +N  +  F CP+L +V  W+  QLSVLYGE N K F++++ +
Sbjct: 289  F---SFM--------GHQMNVKTSSFNCPVLSQVFMWIVPQLSVLYGERNAKCFAIHIFR 337

Query: 96   NCILNVASSLLLFPLEQKKTEASEGSNAS 10
             CIL+V++  +LFPLE    E S   N +
Sbjct: 338  QCILDVSNG-MLFPLEANVKEISTELNGN 365


>ref|XP_006371138.1| hypothetical protein POPTR_0019s04490g [Populus trichocarpa]
            gi|550316777|gb|ERP48935.1| hypothetical protein
            POPTR_0019s04490g [Populus trichocarpa]
          Length = 723

 Score =  267 bits (682), Expect = 8e-69
 Identities = 159/379 (41%), Positives = 222/379 (58%), Gaps = 6/379 (1%)
 Frame = -3

Query: 1191 MNPPNPYFNPSSTYFP-PISNAYPNPNFXXXXXXXXXXXXXXXXXXXXXXXXSNP---DL 1024
            MNP  P+  P+   FP P  N+ PNPNF                        +     DL
Sbjct: 1    MNPYTPH--PNHLPFPYPSQNSNPNPNFLLHPFLPSQPPSKPPQVPPPTTTTTTTPILDL 58

Query: 1023 LSTFSILKDVIDLANKTINSISNLLHSENRPKPNGDFCSCPFDSRHQMPPESLFHHSLCC 844
             +T S L +++ L ++T+ S+S  + + ++P+ N +F  CPF+  H MPPESLF HSL C
Sbjct: 59   STTLSTLTNLLSLTHQTLTSLSPQI-TLSKPQ-NANFIPCPFNRHHLMPPESLFLHSLNC 116

Query: 843  PSSPRVIDLGILESLHYPNSLKSEDELRKENRFVQSLPE-SDADLCFSIDDY-GDFESNF 670
            P           + LHYPN+L  +D   K++ F QS+ + ++ +LCFS+D Y   F S+F
Sbjct: 117  PVPLFQNPSSPFDYLHYPNTLNPQDP-HKDSNFSQSIQDPNETELCFSLDSYYNQFSSHF 175

Query: 669  FYRDCPGVVCSSEQDSTKRTFTLPGVLSAQCANFICSSIRETRVFPKECFVILPSDLLAL 490
             Y DCPG V  ++ DS+KR FTLPGVL  +C NF  S   E   F K  F +LPS+L A+
Sbjct: 176  SYNDCPGAVNLNDLDSSKRIFTLPGVLLIECVNFGVSGESERDGFDKNGFRVLPSELWAI 235

Query: 489  KSEIELWNDYPYSYSINVLRVFLCLQMVEESDSLKWIISNSPCYGILIDAHMRDHIFLLL 310
            + EIE W DYP  YS +V    L L +++ SD   WII+NSP YG++ID +MRDHI +L 
Sbjct: 236  RREIEGWIDYPSVYSYSVFCSILRLDLIKGSDLRSWIIANSPRYGVVIDVYMRDHICVLF 295

Query: 309  KLCLKAISQEAYRSRNFLLSRDSNEGDGDLNPMSLRFKCPILFEVVTWLASQLSVLYGEA 130
            +LCLKAI +E   S +            ++N  SL  KCPIL +V+TW+ASQLSVLYGE 
Sbjct: 296  RLCLKAIRKEGLSSVSC-----------EMNVKSL--KCPILVQVLTWIASQLSVLYGEV 342

Query: 129  NGKFFSVNMLKNCILNVAS 73
            N K F++++LK C+L+ A+
Sbjct: 343  NAKCFAIHVLKQCLLDAAN 361


>ref|XP_007030254.1| U11/U12 small nuclear ribonucleoprotein 48 kDa protein, putative
            isoform 1 [Theobroma cacao]
            gi|590641526|ref|XP_007030255.1| U11/U12 small nuclear
            ribonucleoprotein 48 kDa protein, putative isoform 1
            [Theobroma cacao] gi|590641529|ref|XP_007030256.1|
            U11/U12 small nuclear ribonucleoprotein 48 kDa protein,
            putative isoform 1 [Theobroma cacao]
            gi|590641533|ref|XP_007030257.1| U11/U12 small nuclear
            ribonucleoprotein 48 kDa protein, putative isoform 1
            [Theobroma cacao] gi|508718859|gb|EOY10756.1| U11/U12
            small nuclear ribonucleoprotein 48 kDa protein, putative
            isoform 1 [Theobroma cacao] gi|508718860|gb|EOY10757.1|
            U11/U12 small nuclear ribonucleoprotein 48 kDa protein,
            putative isoform 1 [Theobroma cacao]
            gi|508718861|gb|EOY10758.1| U11/U12 small nuclear
            ribonucleoprotein 48 kDa protein, putative isoform 1
            [Theobroma cacao] gi|508718862|gb|EOY10759.1| U11/U12
            small nuclear ribonucleoprotein 48 kDa protein, putative
            isoform 1 [Theobroma cacao]
          Length = 740

 Score =  262 bits (669), Expect = 3e-67
 Identities = 149/334 (44%), Positives = 210/334 (62%), Gaps = 2/334 (0%)
 Frame = -3

Query: 1026 LLSTFSILKDVIDLANKTINSISNLLHSENRPKPNGDFCSCPFDSRHQMPPESLFHHSLC 847
            L +T S L  ++ L+++T+NS S L  S N   PN     CPF+  H + PESLF HSL 
Sbjct: 36   LSTTLSSLTALLSLSHQTLNSHSTLTKSLN---PN--LIPCPFNPNHLLAPESLFSHSLR 90

Query: 846  CPSSPRVIDLGILESLHYPNSLKSEDELRKENRFVQSLPESDADLCFSIDDY-GDFESNF 670
            CPS P+ +DL      +Y N+L     L  ++   Q +  S+  LC S+D+Y  DF SNF
Sbjct: 91   CPS-PQNLDL---YPPNYRNTLIPPSNLHAQDTHFQGIQCSE--LCLSLDEYFADFGSNF 144

Query: 669  FYRDCPGVVCSSEQDSTKRTFTLPGVLSAQCANFICSSIRETRVFPKECFVILPSDLLAL 490
            F +DCP  V   + D++K+TFTLPG LS +C NF   + RE  V  ++   +L S L  +
Sbjct: 145  FCKDCPAAVNLFDIDNSKKTFTLPGFLSVECVNFEGFNEREGVVSEEKGLRVLASGLWEI 204

Query: 489  KSEIELWNDYPYSYSINVLRVFLCLQMVEESDSLKWIISNSPCYGILIDAHMRDHIFLLL 310
            + E+E W DYP SYS NV+   L  +MV+ S+  KWI++NSP YG++ID  M DHI +L+
Sbjct: 205  RREVERWGDYPGSYSFNVICAILGSKMVKGSNLRKWIVANSPRYGVMIDGCMGDHIVVLV 264

Query: 309  KLCLKAISQEAYRSRNFLLS-RDSNEGDGDLNPMSLRFKCPILFEVVTWLASQLSVLYGE 133
            +LCLKA+ +EA       +   ++ E + D+N     F+CPIL +V+ WL SQLSVLYG+
Sbjct: 265  RLCLKAVVREAVGLMEVEMGYGEAKEKEWDVNLQMRMFECPILLQVLVWLGSQLSVLYGD 324

Query: 132  ANGKFFSVNMLKNCILNVASSLLLFPLEQKKTEA 31
             NGKFF++NM+K C+L  AS LLLFPLE+K T++
Sbjct: 325  VNGKFFAINMIKQCVLEGASLLLLFPLEEKVTDS 358


>ref|XP_004302118.1| PREDICTED: uncharacterized protein LOC101300357 [Fragaria vesca
            subsp. vesca]
          Length = 731

 Score =  259 bits (663), Expect = 1e-66
 Identities = 154/349 (44%), Positives = 203/349 (58%), Gaps = 5/349 (1%)
 Frame = -3

Query: 1032 PDLLSTFSILKDVIDLANKTINSISNLLHSENRPKPNGDFCSCPFDSRHQMPPESLFHHS 853
            PDL +  S L  +I  + + ++S+S LL   ++   +G   SCP +  H++ P SLF HS
Sbjct: 35   PDLSTAISSLSSLIRDSARILDSLSALLPLRSQGDSDG-LVSCPVNPHHRLHPHSLFSHS 93

Query: 852  LCCPSSPRVIDLGILESLHYPNSLKSEDELRKENRFVQSLPESDADLCFSIDDY-GDFES 676
            L CP         ++  LHYP +L+S D+ +    F QS      DLC S++ Y  +F  
Sbjct: 94   LRCPRPLH----HLIPPLHYPKTLESTDQSQSGESFTQS-----GDLCLSLEHYYAEFGC 144

Query: 675  NFFYRDCPGVVCSSEQDSTKRTFTLPGVLSAQCANFICSSIRETRVFPKECFVILPSDLL 496
            N FYRDCPGVV SS  D   +TFTLP VLSA+CANF    + E     K C   LPS+  
Sbjct: 145  NLFYRDCPGVVNSSALDGFDKTFTLPSVLSAECANFSGKEVGEMMDCDKVCSKFLPSESW 204

Query: 495  ALKSEIELWNDYPYSYSINVLRVFLCLQMVEESDSLKWIISNSPCYGILIDAHMRDHIFL 316
            A+K+E+  WN+YP  YS  VLR  L L ++ E D   W+I+NSP YGI+ID  M DHI L
Sbjct: 205  AVKNEVLRWNEYPPMYSSCVLRAVLGLGVLRECDLAIWVIANSPKYGIVIDVPMGDHIVL 264

Query: 315  LLKLCLKAISQEAYRSRNFLLSRDSNEGDGDLNPMSLRFKCPILFEVVTWLASQLSVLYG 136
            L+ LCL+AI +EA    N    RDS  G          ++CP L E + WLASQLS LYG
Sbjct: 265  LITLCLRAIVREALGKVN---DRDSESG---------YYECPALVEALVWLASQLSKLYG 312

Query: 135  EANGKFFSVNMLKNCILNVASSLLLFPLEQKKTE---ASEGS-NASKEG 1
            E NGK F++N LK+C+L+ A    +FPL+QK+TE     EGS N   EG
Sbjct: 313  ELNGKLFAINTLKHCVLDAALGSFVFPLKQKETEFHGLEEGSLNLDAEG 361


>ref|XP_006297086.1| hypothetical protein CARUB_v10013089mg [Capsella rubella]
            gi|482565795|gb|EOA29984.1| hypothetical protein
            CARUB_v10013089mg [Capsella rubella]
          Length = 703

 Score =  259 bits (661), Expect = 2e-66
 Identities = 156/388 (40%), Positives = 217/388 (55%), Gaps = 8/388 (2%)
 Frame = -3

Query: 1167 NPSSTYFPPISNAYPNPNFXXXXXXXXXXXXXXXXXXXXXXXXSNPDLLSTFSILKDVID 988
            NP+  +F       PNPNF                           +L  T + L+ ++ 
Sbjct: 12   NPNPNFFHHYPPPNPNPNFFFRPPPPPLQNPNTYSIAPSPPPIR--ELSGTITSLQSLLS 69

Query: 987  LANKTINSISNLL---HSENRPKP-NGDFCSCPFDSRHQMPPESLFHHSLCCPSSPRVID 820
               +T++S+S  L   HS    K  NG F  CPFDS H MPPE+LF HSL CP+   +  
Sbjct: 70   ECQRTLDSLSQNLALDHSYLLQKGGNGGFVRCPFDSNHFMPPEALFLHSLRCPNPLDLTH 129

Query: 819  L-GILESLHYPNSLKSEDELRKENRFVQSLPESDADLCFSIDDYGDFESNFFYRDCPGVV 643
            L G   S  Y N+L+   +++  N           DLC S+D+  DF +NFFY+DCPG V
Sbjct: 130  LLGSFSS--YRNTLELPSQVQLSN--------DAGDLCVSLDELADFGTNFFYKDCPGAV 179

Query: 642  CSSEQDSTKRTFTLPGVLSAQCANFICSSIRETRVFPKECFVILPSDLLALKSEIELWND 463
              SE D  K T TLP +LS +C++   +  +E          ILPSDL A+KSEI  W D
Sbjct: 180  NFSELDGIKPTLTLPNILSLECSDLQVADEKENN----SMLGILPSDLCAIKSEINQWRD 235

Query: 462  YPYSYSINVLRVFLCLQMVEESDSLKWIISNSPCYGILIDAHMRDHIFLLLKLCLKAISQ 283
            YP SYS +VL   L  + +E S+   WI+ NS  YG++ID +MRDHIFLL +LCLK++ +
Sbjct: 236  YPNSYSYSVLSAMLGSKAIETSELNSWILVNSTRYGVIIDTYMRDHIFLLFRLCLKSVVK 295

Query: 282  EAYRSRNFLLSRDSNEGDGDLNPMSLR---FKCPILFEVVTWLASQLSVLYGEANGKFFS 112
            EA     F++  D+N G G+   MS +   F+CP+L  V++WLASQL+VLYGE NGKFF+
Sbjct: 296  EAC---GFMMEPDAN-GVGEQQIMSCKSRIFECPVLVRVLSWLASQLAVLYGEGNGKFFA 351

Query: 111  VNMLKNCILNVASSLLLFPLEQKKTEAS 28
            ++M K CI+  AS ++LF  E+   ++S
Sbjct: 352  LDMFKQCIVESASQIMLFRSERSTPQSS 379


>ref|XP_002884430.1| hypothetical protein ARALYDRAFT_477678 [Arabidopsis lyrata subsp.
            lyrata] gi|297330270|gb|EFH60689.1| hypothetical protein
            ARALYDRAFT_477678 [Arabidopsis lyrata subsp. lyrata]
          Length = 704

 Score =  256 bits (654), Expect = 1e-65
 Identities = 158/391 (40%), Positives = 217/391 (55%), Gaps = 9/391 (2%)
 Frame = -3

Query: 1173 YFNPSSTYFPPISNAYPNPNFXXXXXXXXXXXXXXXXXXXXXXXXSNPDLLSTFSILKDV 994
            Y +P+  +F  +    PNPN                          + +L  T + L+ +
Sbjct: 10   YQHPNPNFFHHVPPPNPNPNIFFRPPPPHLQNPNNYSIAPPSPPPIH-ELSGTLTSLQSL 68

Query: 993  IDLANKTINSISNLL---HSENRPKP-NGDFCSCPFDSRHQMPPESLFHHSLCCPSSPRV 826
            +    +T++S+S  L   HS    K  NG F  CPFDS H MPPE+LF HSL CP+    
Sbjct: 69   LSECQRTLDSLSQNLALDHSSLLQKDENGGFVRCPFDSNHLMPPEALFLHSLRCPNP--- 125

Query: 825  IDLG-ILESLH-YPNSLKSEDELRKENRFVQSLPESDADLCFSIDDYGDFESNFFYRDCP 652
            +DL  IL S   Y N+L+   EL+  N         + DLC S+DD  DF  NFFYRDCP
Sbjct: 126  LDLTHILGSFSCYRNTLELPCELQLNN---------NGDLCVSLDDLADFGRNFFYRDCP 176

Query: 651  GVVCSSEQDSTKRTFTLPGVLSAQCANFICSSIRETRVFPKECFVILPSDLLALKSEIEL 472
            G V  SE D  K T TLP VLS +C +F+ S  +E      +   ILPSDL A+KSEI  
Sbjct: 177  GAVNFSELDGKKPTLTLPNVLSVECNDFVVSDEKEKGSMLDKWLGILPSDLCAIKSEINQ 236

Query: 471  WNDYPYSYSINVLRVFLCLQMVEESDSLKWIISNSPCYGILIDAHMRDHIFLLLKLCLKA 292
            W D+P SYS +VL   +  + +  SD   WI+  S  YG++ID  MRDH+FLL +LCLK+
Sbjct: 237  WRDFPSSYSYSVLSSIVGSKAIATSDLRTWILVKSTRYGVIIDTFMRDHVFLLFRLCLKS 296

Query: 291  ISQEAYRSRNFLLSRDSNEGDGDLNPMSLR---FKCPILFEVVTWLASQLSVLYGEANGK 121
              +EA R    L+  D+N   G+   MS +   F+CP+L +V++WLASQL+VLYGE NGK
Sbjct: 297  AVKEACR----LIESDAN-AVGEKQIMSCKSRTFECPVLIQVLSWLASQLAVLYGEGNGK 351

Query: 120  FFSVNMLKNCILNVASSLLLFPLEQKKTEAS 28
            +F+++M K CI+  A  ++LF  E  + + S
Sbjct: 352  YFALDMFKQCIVESAFRVMLFQSEGTRPKCS 382


>ref|XP_006408216.1| hypothetical protein EUTSA_v10020148mg [Eutrema salsugineum]
            gi|557109362|gb|ESQ49669.1| hypothetical protein
            EUTSA_v10020148mg [Eutrema salsugineum]
          Length = 733

 Score =  252 bits (643), Expect = 3e-64
 Identities = 143/341 (41%), Positives = 205/341 (60%), Gaps = 7/341 (2%)
 Frame = -3

Query: 1029 DLLSTFSILKDVIDLANKTINSISNLLHSEN----RPKPNGDFCSCPFDSRHQMPPESLF 862
            +L  T S L+ ++    +T+ S+S  L  ++    +   NG F  CPFD  H MPPE+LF
Sbjct: 54   ELSGTLSSLQSLLSECQRTLASLSENLALDHSSLLQRDDNGGFVRCPFDPNHLMPPEALF 113

Query: 861  HHSLCCPSSPRVIDLGILESLHYPNSLKSEDELRKENRFVQSLPESDADLCFSIDDYGDF 682
             HSL CP+      L +   L   +S ++  EL  E +    L   D DLCF +DD  DF
Sbjct: 114  LHSLRCPNP-----LDLTHLLGSFSSYRTTLELPCEPQ----LNNGDGDLCFCLDDLTDF 164

Query: 681  ESNFFYRDCPGVVCSSEQDSTKRTFTLPGVLSAQCANFICSSIRETRVFPKECFVILPSD 502
             SNFFY DCPG V  SE D  KRT TLP VLS +C++F+ S  +E     ++   +LPS 
Sbjct: 165  GSNFFYNDCPGAVNFSELDGKKRTLTLPSVLSVECSDFVGSDEKEKMSVLEKRLGVLPSG 224

Query: 501  LLALKSEIELWNDYPYSYSINVLRVFLCLQMVEESDSLKWIISNSPCYGILIDAHMRDHI 322
            L A+K+EI+ W D+P SYS +VL   L  + +E S+   WI+ NS  YG++ID +MRDH+
Sbjct: 225  LCAIKNEIDQWRDFPTSYSFSVLSSILGSEAIETSELSSWILVNSTRYGVIIDTYMRDHV 284

Query: 321  FLLLKLCLKAISQEAYRSRNFLLSRDSNEGDGDLNPMSLR---FKCPILFEVVTWLASQL 151
            FLL +L LKA+ +EA     F++  D+N   G+   MS +   F+C +L  V++W ASQL
Sbjct: 285  FLLFRLSLKAVVKEAC---GFMIESDAN-AVGEQQIMSSKTRTFECAVLVRVLSWFASQL 340

Query: 150  SVLYGEANGKFFSVNMLKNCILNVASSLLLFPLEQKKTEAS 28
            +VLYGE +GKFF+++M K CI+  AS ++LF  E  + ++S
Sbjct: 341  AVLYGEGSGKFFALDMFKQCIVESASQIMLFRSEITRPKSS 381


>ref|XP_006443313.1| hypothetical protein CICLE_v10019009mg [Citrus clementina]
            gi|568850668|ref|XP_006479024.1| PREDICTED:
            uncharacterized protein LOC102620724 [Citrus sinensis]
            gi|557545575|gb|ESR56553.1| hypothetical protein
            CICLE_v10019009mg [Citrus clementina]
          Length = 738

 Score =  234 bits (598), Expect = 4e-59
 Identities = 146/346 (42%), Positives = 206/346 (59%), Gaps = 13/346 (3%)
 Frame = -3

Query: 1029 DLLSTFSILKDVIDLANKTINSISNLLHSENRPKPNGD-FCSCPFDSRHQMPPESLFHHS 853
            DL +T S L  +I   ++T+ + S LL     PKP  D    CP++ +H MPPESLF H+
Sbjct: 31   DLSTTLSSLNALISFCHQTLQNYSFLL-----PKPQNDNLLPCPYNPQHLMPPESLFLHT 85

Query: 852  LCCPSSPRVIDLGILESLHYPNSLKSEDELRKENRFVQSLPESDADLCFSIDDY-GDFES 676
            L CP     +DL   +  +Y N+L S   L ++N  + ++ +   +LCFS+DDY  +  S
Sbjct: 86   LHCPFP---LDL---DPPNYRNTLHSSSLLNQQNAPL-TIQDHIQELCFSLDDYLSNVRS 138

Query: 675  -NFFYRDCPGVVCSSEQDST----KRTFTLPGVLSAQCANFICSSIRETRV----FPKEC 523
             +FFY+DCP  V  S+  ++    K+T  LPG+L  +CAN +C S  E +     F +  
Sbjct: 139  VSFFYQDCPAAVALSDFHASTSISKKTLALPGILCMECANVVCLSDGEAKKNAEGFGEVG 198

Query: 522  FVILPSDLLALKSEIELWNDYPYS--YSINVLRVFLCLQMVEESDSLKWIISNSPCYGIL 349
              +L SDL  ++ E+E W DY +   YS NV    L L+ V  SD  KW++ NSP +G++
Sbjct: 199  LRVLCSDLWFIRREVESWRDYEHMSMYSFNVFCAILGLRTVNVSDLSKWVLVNSPRFGVV 258

Query: 348  IDAHMRDHIFLLLKLCLKAISQEAYRSRNFLLSRDSNEGDGDLNPMSLRFKCPILFEVVT 169
            ID +MRDHI +L+ LCLKA+  EA     FL    S E +  L  M+L  KCP+L +V+ 
Sbjct: 259  IDVYMRDHISVLVGLCLKAVISEAL---GFLELVKSQELERGLKSMNL--KCPVLKQVLM 313

Query: 168  WLASQLSVLYGEANGKFFSVNMLKNCILNVASSLLLFPLEQKKTEA 31
            WLASQLSVLYG+ +GK F++ + K CIL  AS LLLFPLEQ  TE+
Sbjct: 314  WLASQLSVLYGQVSGKIFAIEIFKQCILESASGLLLFPLEQSLTES 359


>ref|XP_004237502.1| PREDICTED: uncharacterized protein LOC101244071 [Solanum
            lycopersicum]
          Length = 719

 Score =  220 bits (561), Expect = 9e-55
 Identities = 147/405 (36%), Positives = 218/405 (53%), Gaps = 11/405 (2%)
 Frame = -3

Query: 1185 PPNPY-FNPSSTYFPPISNAYPNPNFXXXXXXXXXXXXXXXXXXXXXXXXSNPDLLSTFS 1009
            PP P    P+S + PP ++ +P P                             DL S  S
Sbjct: 7    PPLPLPLPPASAFPPPPASTFPRP-------------PPSYFHHPTPTSALAYDLPSALS 53

Query: 1008 ILKDVIDLANKTINSISNLLHSEN-RPKPNGDFCSCPFDSRHQMPPESLFHHSLCCP--- 841
             L  +++L++ T+NS+S+LL      P P+     CPF+S H++P  SLF HSL CP   
Sbjct: 54   SLTSLLNLSSTTLNSLSSLLPIPLVAPSPSPALIPCPFNSNHRLPLSSLFSHSLHCPPIS 113

Query: 840  SSPRVIDLGILESLHYPNSLKSEDELRKENRFVQSLPESDADLCFSIDDYGDFES-NFFY 664
            SS       +++ L YP++L         N F   L ES +DLCFS++ Y DFE+  F Y
Sbjct: 114  SSSADYIQTLIQHLKYPHTL------HYSNPFTLPLLESQSDLCFSLETYLDFENPTFCY 167

Query: 663  RDCPGVVCSS--EQDSTKRTFTLPGVLSAQCANFICSSIRETRVFPKECFV-ILPSDLLA 493
             +CPGVV      +++     TLP VLS++CANF     +    FPKE    +LPS++ A
Sbjct: 168  SNCPGVVSFPIRGENANPPMLTLPAVLSSECANFG----QNLMGFPKEIVSQLLPSEVYA 223

Query: 492  LKSEIELWNDYPYSYSINVLRVFLCLQMVEESDSLKWIISNSP-CYGILIDAHMRDHIFL 316
            +++E + WN++P+ YS +VLR  L L M        W+++NS   Y +++D  MRDH+ +
Sbjct: 224  IRNETDHWNEFPFMYSYHVLRAILGLGMSSVECLSTWVVANSARYYSVVLDLAMRDHVLV 283

Query: 315  LLKLCLKAISQEAYRSRNFLLSRDSNEGDGDLNPMSLR-FKCPILFEVVTWLASQLSVLY 139
            L KLCLKAI +E+       L+     G+ + + +S R FKCP+L +V+ WL +QLSVLY
Sbjct: 284  LFKLCLKAIVRESID-----LASTFCNGEAEESVLSNRSFKCPVLVQVLVWLGTQLSVLY 338

Query: 138  GEANGKFFSVNMLKNCILNVASSLLLFPLEQKKTEASEGSNASKE 4
            GE NGK F++NMLK  I + A S  +F    + T+   G +  +E
Sbjct: 339  GEMNGKLFAINMLKQSICDCAFSSCMF---NESTDMKSGEDNLQE 380


>ref|XP_006340483.1| PREDICTED: uncharacterized protein LOC102582686 isoform X1 [Solanum
            tuberosum]
          Length = 721

 Score =  219 bits (558), Expect = 2e-54
 Identities = 141/352 (40%), Positives = 203/352 (57%), Gaps = 10/352 (2%)
 Frame = -3

Query: 1029 DLLSTFSILKDVIDLANKTINSISNLLHSEN-RPKPNGDFCSCPFDSRHQMPPESLFHHS 853
            DL    S L  +++L++ T+NS+S+LL      P P+     CPF+  H++P  SLF HS
Sbjct: 52   DLPGALSSLTSLLNLSSTTLNSLSSLLPIPLVPPSPSPALIPCPFNPNHRLPLSSLFSHS 111

Query: 852  LCCP---SSPRVIDLGILESLHYPNSLKSEDELRKENRFVQSLPESDADLCFSIDDYGDF 682
            L CP   SS       +++ L YP++L S       N F   L ES +DLCFS++ Y DF
Sbjct: 112  LHCPPISSSSADYIQTLIQHLKYPHTLHSS------NPFTLPLLESQSDLCFSLETYLDF 165

Query: 681  ES-NFFYRDCPGVVCSS--EQDSTKRTFTLPGVLSAQCANFICSSIRETRVFPKECFV-I 514
            E+  F Y +CPGVV      +++     TL  VLS++CANF     +    FPKE    +
Sbjct: 166  ENPTFCYSNCPGVVSFPIRGENANPPMLTLLAVLSSECANFG----QNLMGFPKEIVSQL 221

Query: 513  LPSDLLALKSEIELWNDYPYSYSINVLRVFLCLQMVEESDSLKWIISNSP-CYGILIDAH 337
            LPS++ A+++E + WN++P+ YS  VLR  L L M        W+++NS   Y +++D  
Sbjct: 222  LPSEVYAIRNETDHWNEFPFMYSYRVLRAILGLGMSSVECLSTWVVANSARYYSVVLDLA 281

Query: 336  MRDHIFLLLKLCLKAISQEAYRSRNFLLSRDSNEGDGDLNPMSLR-FKCPILFEVVTWLA 160
            MRDHI +L KLCLKAI +E+    N L S   N G+ + + +S R FKCP+L +V  WL 
Sbjct: 282  MRDHILVLFKLCLKAIVRES----NDLASTFCN-GEAEESVLSNRSFKCPVLVQVFVWLG 336

Query: 159  SQLSVLYGEANGKFFSVNMLKNCILNVASSLLLFPLEQKKTEASEGSNASKE 4
            +QLSVLYGE NGK F++NMLK CI + A S  +F    + T+   G +  +E
Sbjct: 337  TQLSVLYGEMNGKLFAINMLKQCICDCAFSSCMF---NESTDMKSGDDNLQE 385


>ref|XP_003535384.1| PREDICTED: uncharacterized protein LOC100803944 isoform X1 [Glycine
            max] gi|571483372|ref|XP_006589217.1| PREDICTED:
            uncharacterized protein LOC100803944 isoform X2 [Glycine
            max] gi|571483374|ref|XP_006589218.1| PREDICTED:
            uncharacterized protein LOC100803944 isoform X3 [Glycine
            max]
          Length = 687

 Score =  213 bits (543), Expect = 1e-52
 Identities = 139/344 (40%), Positives = 182/344 (52%), Gaps = 3/344 (0%)
 Frame = -3

Query: 1026 LLSTFSILKDVIDLANKTINSISNLLHSENRPKPNGDFCSCPFDSRHQMPPESLFHHSLC 847
            L ST + L ++I L+N  ++       + + P  N +   CPF+  H +PP SLF H L 
Sbjct: 39   LSSTLTSLNNLITLSNHVLSL------TPSPPTLNSNLIQCPFNPHHLLPPPSLFLHHLR 92

Query: 846  CPSSPRVI-DLGILESLHYPNSLKSEDELRKENRFVQSLPESDADLCFSIDDYGDFESNF 670
            CPSSPR + DL    SL YP +L +                S +D    +  Y D  SNF
Sbjct: 93   CPSSPRPLPDLNPSPSLTYPKTLHN----------------SPSD---QLSFYLDSLSNF 133

Query: 669  FYRDCPGVVCSSEQDSTKRT--FTLPGVLSAQCANFICSSIRETRVFPKECFVILPSDLL 496
            FYRD P VV  S  DS  RT   TLP  LS QCA+    SI E+  F      ILPS   
Sbjct: 134  FYRDSPAVVAFSHADSLTRTASLTLPSFLSLQCADTYTHSIPESASFHAP---ILPSQYF 190

Query: 495  ALKSEIELWNDYPYSYSINVLRVFLCLQMVEESDSLKWIISNSPCYGILIDAHMRDHIFL 316
            ++  E++ WND+P +YS +VLR  L L +  + D   W+I+NSP YG++ID  M+ HIFL
Sbjct: 191  SIARELDCWNDFPATYSSSVLRAILGLGIANDRDLTDWMIANSPRYGVVIDTSMQHHIFL 250

Query: 315  LLKLCLKAISQEAYRSRNFLLSRDSNEGDGDLNPMSLRFKCPILFEVVTWLASQLSVLYG 136
            L  +CLK+I +EA  S +              N  SL   CP+  + +TWLASQ+S+LYG
Sbjct: 251  LCCMCLKSILREASVSVD--------------NQNSL-VDCPVTNQALTWLASQVSILYG 295

Query: 135  EANGKFFSVNMLKNCILNVASSLLLFPLEQKKTEASEGSNASKE 4
             ANGK F +N +K CIL  AS LLLFPL        E  N   E
Sbjct: 296  AANGKAFVLNFVKKCILVGASVLLLFPLGDNAASKQESQNLGTE 339


Top