BLASTX nr result

ID: Akebia24_contig00011857 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Akebia24_contig00011857
         (1478 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002272322.1| PREDICTED: uncharacterized protein LOC100264...   417   e-114
emb|CAN63074.1| hypothetical protein VITISV_026979 [Vitis vinifera]   417   e-114
ref|XP_007209129.1| hypothetical protein PRUPE_ppa005552mg [Prun...   404   e-110
ref|XP_002270742.2| PREDICTED: uncharacterized protein LOC100241...   400   e-108
ref|XP_007038766.1| Hydroxyproline-rich glycoprotein family prot...   398   e-108
ref|XP_007038765.1| Hydroxyproline-rich glycoprotein family prot...   398   e-108
ref|XP_002318209.1| hydroxyproline-rich glycoprotein [Populus tr...   382   e-103
ref|XP_006439523.1| hypothetical protein CICLE_v10020073mg [Citr...   382   e-103
ref|XP_006476541.1| PREDICTED: uncharacterized protein LOC102626...   380   e-103
ref|XP_002513675.1| conserved hypothetical protein [Ricinus comm...   376   e-101
ref|XP_004298813.1| PREDICTED: uncharacterized protein LOC101309...   373   e-100
ref|XP_004298812.1| PREDICTED: uncharacterized protein LOC101309...   373   e-100
ref|XP_007040283.1| Hydroxyproline-rich glycoprotein family prot...   363   1e-97
gb|EXB37330.1| hypothetical protein L484_024256 [Morus notabilis]     355   3e-95
ref|XP_006368761.1| hypothetical protein POPTR_0001s09590g [Popu...   355   3e-95
ref|XP_002298027.2| hypothetical protein POPTR_0001s09590g [Popu...   355   3e-95
ref|XP_004234428.1| PREDICTED: uncharacterized protein LOC101260...   341   4e-91
ref|XP_006353896.1| PREDICTED: uncharacterized protein LOC102583...   335   4e-89
ref|XP_004140832.1| PREDICTED: uncharacterized protein LOC101210...   332   2e-88
ref|XP_002509822.1| conserved hypothetical protein [Ricinus comm...   332   2e-88

>ref|XP_002272322.1| PREDICTED: uncharacterized protein LOC100264629 [Vitis vinifera]
          Length = 448

 Score =  417 bits (1073), Expect = e-114
 Identities = 225/380 (59%), Positives = 264/380 (69%), Gaps = 3/380 (0%)
 Frame = +1

Query: 1    SPASFLQSGPPSATQSPAEMVSLTSLSSNVYSP-GPNSIFAIGPYAHETQLVSPPVFSTF 177
            SPASFLQS PPS+TQSPA  +SLT+LS N YSP GP S+FAIGPYAHETQLVSPPVFSTF
Sbjct: 92   SPASFLQSDPPSSTQSPAGFLSLTALSVNAYSPSGPASMFAIGPYAHETQLVSPPVFSTF 151

Query: 178  TTEPSTAPFTPPPESVHLTTPSSPEVPFAQLLTSSLDKNHRTNGDSKKFSPSHYEFQSYQ 357
             TEPSTAPFTPPPESV LTTPSSPEVPFAQLLTSSLD++ R +G ++K S S+YEFQ YQ
Sbjct: 152  PTEPSTAPFTPPPESVQLTTPSSPEVPFAQLLTSSLDRSRRNSGTNQKLSLSNYEFQPYQ 211

Query: 358  LYPGSPIGHLISPGSVISVSGTCSPFPDREFSSSGHHFLEFCAGEPPKLLNIGGLSTRKW 537
            LYP SP+GHLISP   IS SGT SPFPDR               E PKLL     STR+W
Sbjct: 212  LYPESPVGHLISP---ISNSGTSSPFPDRR-----------PIVEAPKLLGFEHFSTRRW 257

Query: 538  GPQQGSLSPMPDVAQPTNQVNSIVENQISEVASLANSENGNQNNEAVIDHRVSFELTREE 717
            G + GS S  PD A P ++ + ++ENQISEVASLANSE+G+QN E VIDHRVSFEL  E+
Sbjct: 258  GSRLGSGSLTPDGAGPASRDSFLLENQISEVASLANSESGSQNGETVIDHRVSFELAGED 317

Query: 718  ASNCVEKEPVASVVAITVSLLD-TTVAAVAAERDGVAKEEETASEYCVGETSKNMSEKTS 894
             + CVEK+PVAS   +  +L D      +  ERDG+++  E   E+CVGE  K  SEK S
Sbjct: 318  VAVCVEKKPVASAETVQNTLQDIVEEGEIERERDGISESTENCCEFCVGEALKAASEKAS 377

Query: 895  GDGNDEKXXXXXXXXXXXXXXXLTTLGLVKEFKFDNTKGGTSDKPT-LGSDWWANEKVVT 1071
             +G +E+                   G +KEF FDNTKG  S KP  +GS+WW NEKVV 
Sbjct: 378  AEGEEEQCHKKHPPIRH---------GSIKEFNFDNTKGEVSAKPNIIGSEWWVNEKVVG 428

Query: 1072 KETGPHNNWAFFPMMQPGVS 1131
            K TGP  NW FFP++QPG+S
Sbjct: 429  KGTGPQTNWTFFPLLQPGIS 448


>emb|CAN63074.1| hypothetical protein VITISV_026979 [Vitis vinifera]
          Length = 385

 Score =  417 bits (1073), Expect = e-114
 Identities = 225/380 (59%), Positives = 264/380 (69%), Gaps = 3/380 (0%)
 Frame = +1

Query: 1    SPASFLQSGPPSATQSPAEMVSLTSLSSNVYSP-GPNSIFAIGPYAHETQLVSPPVFSTF 177
            SPASFLQS PPS+TQSPA  +SLT+LS N YSP GP S+FAIGPYAHETQLVSPPVFSTF
Sbjct: 29   SPASFLQSDPPSSTQSPAGFLSLTALSVNAYSPSGPASMFAIGPYAHETQLVSPPVFSTF 88

Query: 178  TTEPSTAPFTPPPESVHLTTPSSPEVPFAQLLTSSLDKNHRTNGDSKKFSPSHYEFQSYQ 357
             TEPSTAPFTPPPESV LTTPSSPEVPFAQLLTSSLD++ R +G ++K S S+YEFQ YQ
Sbjct: 89   PTEPSTAPFTPPPESVQLTTPSSPEVPFAQLLTSSLDRSRRNSGTNQKLSLSNYEFQPYQ 148

Query: 358  LYPGSPIGHLISPGSVISVSGTCSPFPDREFSSSGHHFLEFCAGEPPKLLNIGGLSTRKW 537
            LYP SP+GHLISP   IS SGT SPFPDR               E PKLL     STR+W
Sbjct: 149  LYPESPVGHLISP---ISNSGTSSPFPDRR-----------PIVEAPKLLGFEHFSTRRW 194

Query: 538  GPQQGSLSPMPDVAQPTNQVNSIVENQISEVASLANSENGNQNNEAVIDHRVSFELTREE 717
            G + GS S  PD A P ++ + ++ENQISEVASLANSE+G+QN E VIDHRVSFEL  E+
Sbjct: 195  GSRLGSGSLTPDGAGPASRDSFLLENQISEVASLANSESGSQNGETVIDHRVSFELAGED 254

Query: 718  ASNCVEKEPVASVVAITVSLLD-TTVAAVAAERDGVAKEEETASEYCVGETSKNMSEKTS 894
             + CVEK+PVAS   +  +L D      +  ERDG+++  E   E+CVGE  K  SEK S
Sbjct: 255  VAVCVEKKPVASAETVQNTLQDIVEEGEIERERDGISESTENCCEFCVGEALKAASEKAS 314

Query: 895  GDGNDEKXXXXXXXXXXXXXXXLTTLGLVKEFKFDNTKGGTSDKPT-LGSDWWANEKVVT 1071
             +G +E+                   G +KEF FDNTKG  S KP  +GS+WW NEKVV 
Sbjct: 315  AEGEEEQCHKKHPPIRH---------GSIKEFNFDNTKGEVSAKPNIIGSEWWVNEKVVG 365

Query: 1072 KETGPHNNWAFFPMMQPGVS 1131
            K TGP  NW FFP++QPG+S
Sbjct: 366  KGTGPQTNWTFFPLLQPGIS 385


>ref|XP_007209129.1| hypothetical protein PRUPE_ppa005552mg [Prunus persica]
            gi|462404864|gb|EMJ10328.1| hypothetical protein
            PRUPE_ppa005552mg [Prunus persica]
          Length = 455

 Score =  404 bits (1039), Expect = e-110
 Identities = 224/379 (59%), Positives = 257/379 (67%), Gaps = 2/379 (0%)
 Frame = +1

Query: 1    SPASFLQSGPPSATQSPAEMVSLTSLSSNVYSP-GPNSIFAIGPYAHETQLVSPPVFSTF 177
            SPASFLQS PPSATQSPA   SLT+   ++YSP GP SIFAIGPYAHETQLVSPPVFSTF
Sbjct: 99   SPASFLQSEPPSATQSPAGFFSLTA---SMYSPSGPTSIFAIGPYAHETQLVSPPVFSTF 155

Query: 178  TTEPSTAPFTPPPESVHLTTPSSPEVPFAQLLTSSLDKNHRTNGDSKKFSPSHYEFQSYQ 357
            TTEPSTAPFTPPPESVHLTTPSSPEVPFAQLL    D + R     ++F  SHYEFQSYQ
Sbjct: 156  TTEPSTAPFTPPPESVHLTTPSSPEVPFAQLL----DPHFRNGEGGQRFPLSHYEFQSYQ 211

Query: 358  LYPGSPIGHLISPGSVISVSGTCSPFPDREFSSSGHHFLEFCAGEPPKLLNIGGLSTRKW 537
            LYPGSP+G LISP S IS SGT SPFPD EF++ GHHFLEF  G+PPKLLN+  LSTR W
Sbjct: 212  LYPGSPVGQLISPSSGISGSGTSSPFPDLEFAARGHHFLEFRTGDPPKLLNLDILSTRDW 271

Query: 538  GPQQGSLSPMPDVAQPTNQVNSIVENQISEVASLANSENGNQNNEAVIDHRVSFELTREE 717
            G + GS S  PD A+ T+    +++ Q  EV     S N  +NN+  I+HRVSFEL+ EE
Sbjct: 272  GSRLGSGSVTPDGAKSTSSDGFLLKPQTPEVVLNPRSNNRGRNNDISINHRVSFELSSEE 331

Query: 718  ASNCVEKEPVASVVAITVSLLDTTVAAVAAERDGVAKEEETASEYC-VGETSKNMSEKTS 894
               CVEK+PVA   A++ SL DT  A    +   V      +S  C VGETS + +EK  
Sbjct: 332  VIRCVEKKPVALAEAVSTSLEDTEKAQSKEDPSKV-----VSSSICPVGETSNDAAEKAV 386

Query: 895  GDGNDEKXXXXXXXXXXXXXXXLTTLGLVKEFKFDNTKGGTSDKPTLGSDWWANEKVVTK 1074
             DG + +                 TLG VKEF FDN  GG S   ++GSDWWANEKV  K
Sbjct: 387  ADGEEAQLHPKQRS---------ITLGSVKEFNFDNPDGGDSGN-SIGSDWWANEKVDAK 436

Query: 1075 ETGPHNNWAFFPMMQPGVS 1131
            E GP  NW+FFPMMQPGVS
Sbjct: 437  ENGPTKNWSFFPMMQPGVS 455


>ref|XP_002270742.2| PREDICTED: uncharacterized protein LOC100241023 [Vitis vinifera]
          Length = 479

 Score =  400 bits (1027), Expect = e-108
 Identities = 224/397 (56%), Positives = 260/397 (65%), Gaps = 20/397 (5%)
 Frame = +1

Query: 1    SPASFLQSGPPSATQSPAEMVSLTSLSSNVYSPG-PNSIFAIGPYAHETQLVSPPVFSTF 177
            SPASFLQS PPSATQSP+ ++SLTS+++N+YSPG P SIFAIGPYAHETQLVSPPVFSTF
Sbjct: 98   SPASFLQSEPPSATQSPSGLLSLTSINANIYSPGGPASIFAIGPYAHETQLVSPPVFSTF 157

Query: 178  TTEPSTAPFTPPPESVHLTTPSSPEVPFAQLLTSSLDKNHRTNGDSKKFSPSHYEFQSYQ 357
            TTEPSTAPFTPPPESVHLTTPSSPEVPFAQL     D N+R      +F  S YEFQSYQ
Sbjct: 158  TTEPSTAPFTPPPESVHLTTPSSPEVPFAQL----FDPNNRNGEAGHRFLLSQYEFQSYQ 213

Query: 358  LYPGSPIGHLISPGSVISVSGTCSPFPDREFSSSG-HHFLEFCAGEPPKLLNIGGLSTRK 534
            LYPGSP+GHLISP S IS SGT SPFPDR+F  SG   FLEF AG PPKLL +  LS  +
Sbjct: 214  LYPGSPVGHLISPSSGISGSGTSSPFPDRDFVCSGSSQFLEFRAGGPPKLLTLDKLSNHE 273

Query: 535  WGPQQGSLSPMP------------------DVAQPTNQVNSIVENQISEVASLANSENGN 660
            WG + GS S  P                  DV  P +  +S+++ QIS+VAS + S++G 
Sbjct: 274  WGSRIGSGSITPDALGPPSRDGSVLDRQVSDVIHPPSGDDSVLDRQISDVASHSLSDSGC 333

Query: 661  QNNEAVIDHRVSFELTREEASNCVEKEPVASVVAITVSLLDTTVAAVAAERDGVAKEEET 840
             NNE ++DHRVSFELT E+   CVEK+  A V A++ SL +        E D  ++E   
Sbjct: 334  PNNEIMVDHRVSFELTAEDVVRCVEKDSAALVKAVSASLQN----PATVEIDENSREVVV 389

Query: 841  ASEYCVGETSKNMSEKTSGDGNDEKXXXXXXXXXXXXXXXLTTLGLVKEFKFDNTKGGTS 1020
             SE  VGET+ N  EK   D N E+                 TLG  KEF FDN  GG S
Sbjct: 390  DSEGRVGETANNPPEKAPEDANGEEGQPHHKQRS-------ITLGSAKEFNFDNADGGHS 442

Query: 1021 DKPTLGSDWWANEKVVTKETGPHNNWAFFPMMQPGVS 1131
            DKP + SDWWANEKVV KE G   NW+ F MMQP VS
Sbjct: 443  DKPNISSDWWANEKVVGKEVGASKNWSIFHMMQPSVS 479


>ref|XP_007038766.1| Hydroxyproline-rich glycoprotein family protein isoform 2 [Theobroma
            cacao] gi|508776011|gb|EOY23267.1| Hydroxyproline-rich
            glycoprotein family protein isoform 2 [Theobroma cacao]
          Length = 489

 Score =  398 bits (1023), Expect = e-108
 Identities = 223/410 (54%), Positives = 264/410 (64%), Gaps = 33/410 (8%)
 Frame = +1

Query: 1    SPASFLQSGPPSATQSPAEMVSLTSLSSNVYSP-GPNSIFAIGPYAHETQLVSPPVFSTF 177
            SPASFLQS PPSATQSPA ++SLTSLS N YSP GP SIFAIGPYAHETQLV+PPVFS  
Sbjct: 96   SPASFLQSDPPSATQSPAGLLSLTSLSVNAYSPRGPASIFAIGPYAHETQLVTPPVFSAL 155

Query: 178  TTEPSTAPFTPPPESVHLTTPSSPEVPFAQLLTSSLDKNHRTNGDSKKFSPSHYEFQSYQ 357
            TTEPSTAPFTPPPESV LTTPSSPEVPFAQLLTSSL++  R +G ++KF  SHYEFQSYQ
Sbjct: 156  TTEPSTAPFTPPPESVQLTTPSSPEVPFAQLLTSSLERARRNSGINQKFGLSHYEFQSYQ 215

Query: 358  LYPGSPIGHLISPGSVISVSGTCSPFPDREFSSSGHHFLEFCAGEPPKLLNIGGLSTRKW 537
            +YPGSP G+LISPGS IS SGT SPFPDR         LEF  GE PKLL     +TRKW
Sbjct: 216  IYPGSPGGNLISPGSAISNSGTSSPFPDR------RPILEFRMGEAPKLLGFENFTTRKW 269

Query: 538  GPQ--QGSLSP------------------------------MPDVAQPTNQVNSIVENQI 621
            G +   GSL+P                               PD   P ++   +V +QI
Sbjct: 270  GSRLGSGSLTPDGLGQGSRLGSGSVTPDGMGLGSRLGSGSLTPDGLGPASRDGFLVGSQI 329

Query: 622  SEVASLANSENGNQNNEAVIDHRVSFELTREEASNCVEKEPVASVVAITVSLLDTTVAAV 801
            SEVA LAN  NG +N+E ++DHRVSFEL+ E+ + C+E + +    A++    D  VA  
Sbjct: 330  SEVALLANPANGPKNDETIVDHRVSFELSGEDVAPCLESKSLLPSRAVSEYPKD-LVAEG 388

Query: 802  AAERDGVAKEEETASEYCVGETSKNMSEKTSGDGNDEKXXXXXXXXXXXXXXXLTTLGLV 981
              ERDG+ K+ E++ E  + ETS    EK SG+  +E                  TLG +
Sbjct: 389  RKERDGIKKDLESSCELFIRETSNETVEKASGEAEEEHSYQKHRS---------VTLGSI 439

Query: 982  KEFKFDNTKGGTSDKPTLGSDWWANEKVVTKETGPHNNWAFFPMMQPGVS 1131
            KEF FDNTKG  SDKPT+ S+WWANEKV  KE  P N+W FFPM+QP VS
Sbjct: 440  KEFNFDNTKGEASDKPTIRSEWWANEKVAGKEARPGNSWTFFPMLQPEVS 489


>ref|XP_007038765.1| Hydroxyproline-rich glycoprotein family protein isoform 1 [Theobroma
            cacao] gi|508776010|gb|EOY23266.1| Hydroxyproline-rich
            glycoprotein family protein isoform 1 [Theobroma cacao]
          Length = 485

 Score =  398 bits (1023), Expect = e-108
 Identities = 223/410 (54%), Positives = 264/410 (64%), Gaps = 33/410 (8%)
 Frame = +1

Query: 1    SPASFLQSGPPSATQSPAEMVSLTSLSSNVYSP-GPNSIFAIGPYAHETQLVSPPVFSTF 177
            SPASFLQS PPSATQSPA ++SLTSLS N YSP GP SIFAIGPYAHETQLV+PPVFS  
Sbjct: 92   SPASFLQSDPPSATQSPAGLLSLTSLSVNAYSPRGPASIFAIGPYAHETQLVTPPVFSAL 151

Query: 178  TTEPSTAPFTPPPESVHLTTPSSPEVPFAQLLTSSLDKNHRTNGDSKKFSPSHYEFQSYQ 357
            TTEPSTAPFTPPPESV LTTPSSPEVPFAQLLTSSL++  R +G ++KF  SHYEFQSYQ
Sbjct: 152  TTEPSTAPFTPPPESVQLTTPSSPEVPFAQLLTSSLERARRNSGINQKFGLSHYEFQSYQ 211

Query: 358  LYPGSPIGHLISPGSVISVSGTCSPFPDREFSSSGHHFLEFCAGEPPKLLNIGGLSTRKW 537
            +YPGSP G+LISPGS IS SGT SPFPDR         LEF  GE PKLL     +TRKW
Sbjct: 212  IYPGSPGGNLISPGSAISNSGTSSPFPDR------RPILEFRMGEAPKLLGFENFTTRKW 265

Query: 538  GPQ--QGSLSP------------------------------MPDVAQPTNQVNSIVENQI 621
            G +   GSL+P                               PD   P ++   +V +QI
Sbjct: 266  GSRLGSGSLTPDGLGQGSRLGSGSVTPDGMGLGSRLGSGSLTPDGLGPASRDGFLVGSQI 325

Query: 622  SEVASLANSENGNQNNEAVIDHRVSFELTREEASNCVEKEPVASVVAITVSLLDTTVAAV 801
            SEVA LAN  NG +N+E ++DHRVSFEL+ E+ + C+E + +    A++    D  VA  
Sbjct: 326  SEVALLANPANGPKNDETIVDHRVSFELSGEDVAPCLESKSLLPSRAVSEYPKD-LVAEG 384

Query: 802  AAERDGVAKEEETASEYCVGETSKNMSEKTSGDGNDEKXXXXXXXXXXXXXXXLTTLGLV 981
              ERDG+ K+ E++ E  + ETS    EK SG+  +E                  TLG +
Sbjct: 385  RKERDGIKKDLESSCELFIRETSNETVEKASGEAEEEHSYQKHRS---------VTLGSI 435

Query: 982  KEFKFDNTKGGTSDKPTLGSDWWANEKVVTKETGPHNNWAFFPMMQPGVS 1131
            KEF FDNTKG  SDKPT+ S+WWANEKV  KE  P N+W FFPM+QP VS
Sbjct: 436  KEFNFDNTKGEASDKPTIRSEWWANEKVAGKEARPGNSWTFFPMLQPEVS 485


>ref|XP_002318209.1| hydroxyproline-rich glycoprotein [Populus trichocarpa]
            gi|222858882|gb|EEE96429.1| hydroxyproline-rich
            glycoprotein [Populus trichocarpa]
          Length = 507

 Score =  382 bits (981), Expect = e-103
 Identities = 220/428 (51%), Positives = 258/428 (60%), Gaps = 51/428 (11%)
 Frame = +1

Query: 1    SPASFLQSGPPSATQSPAEMVSLTSLSSNVYSP-GPNSIFAIGPYAHETQLVSPPVFSTF 177
            SPASFLQS PPS+TQSPA ++SLTSLS+N YSP GP SIFAIGPYAHETQLV+PPVFS F
Sbjct: 98   SPASFLQSDPPSSTQSPAGLLSLTSLSANAYSPRGPASIFAIGPYAHETQLVTPPVFSAF 157

Query: 178  TTEPSTAPFTPPPESVHLTTPSSPEVPFAQLLTSSLDKNHRTNGDSKKFSPSHYEFQSYQ 357
            TTEPSTAPFTPPPESV LTTPSSPEVPFAQLLTSSL++  R +G ++KFS SHYEFQSY 
Sbjct: 158  TTEPSTAPFTPPPESVQLTTPSSPEVPFAQLLTSSLERARRNSGPNQKFSLSHYEFQSYH 217

Query: 358  LYPGSPIGHLISPGSVISVSGTCSPFPDREFSSSGHHFLEFCAGEPPKLLNIGGLSTRKW 537
            LYPGSP G +ISPGS IS SGT SPFPDR      H  LEF  GE PKLL     STRKW
Sbjct: 218  LYPGSPGGQIISPGSAISNSGTSSPFPDR------HPMLEFRMGEAPKLLGFEHFSTRKW 271

Query: 538  GPQ--QGSLSP------------------------------------------------M 567
            G +   GSL+P                                                 
Sbjct: 272  GSRLGSGSLTPDATPDGMGLSRLGSGTVTPDGMGLSRLCSGTATPDGAGLRSRLGSGTLT 331

Query: 568  PDVAQPTNQVNSIVENQISEVASLANSENGNQNNEAVIDHRVSFELTREEASNCVEKEPV 747
            PD   P +Q+  ++ENQISEVASL NSENG++  E V+ HRVSFEL+ EE + C+E + V
Sbjct: 332  PDCFVPASQIGFLLENQISEVASLTNSENGSKTEENVVHHRVSFELSGEEVARCLEIKSV 391

Query: 748  ASVVAITVSLLDTTVAAVAAERDGVAKEEETASEYCVGETSKNMSEKTSGDGNDEKXXXX 927
            AS         D T+       D +A   E   +   GE S  M EK S +  ++     
Sbjct: 392  ASTRTFPEYPQD-TMPEDPVRGDRLAMNGERCLQN--GEASSEMPEKNSEETEEDHVYRK 448

Query: 928  XXXXXXXXXXXLTTLGLVKEFKFDNTKGGTSDKPTLGSDWWANEKVVTKETGPHNNWAFF 1107
                         TLG +KEF FDN+KG  SDKP + S+WWANE +  KE  P N+W FF
Sbjct: 449  HRS---------ITLGSIKEFNFDNSKGEVSDKPAISSEWWANETIAGKEARPANSWTFF 499

Query: 1108 PMMQPGVS 1131
            P++QP VS
Sbjct: 500  PLLQPEVS 507


>ref|XP_006439523.1| hypothetical protein CICLE_v10020073mg [Citrus clementina]
            gi|557541785|gb|ESR52763.1| hypothetical protein
            CICLE_v10020073mg [Citrus clementina]
          Length = 460

 Score =  382 bits (980), Expect = e-103
 Identities = 212/380 (55%), Positives = 256/380 (67%), Gaps = 3/380 (0%)
 Frame = +1

Query: 1    SPASFLQSGPPSATQSPAEMVSLTSLSSNVYSPG-PNSIFAIGPYAHETQLVSPPVFSTF 177
            SPASFLQS PPSATQSPA +VSL S+S N+YSPG P+SIFAIGPYAHETQLVSPPVFSTF
Sbjct: 100  SPASFLQSEPPSATQSPAGLVSLNSISGNMYSPGGPSSIFAIGPYAHETQLVSPPVFSTF 159

Query: 178  TTEPSTAPFTPPPESVHLTTPSSPEVPFAQLLTSSLDKNHRTNGDSKKFSPSHYEFQSYQ 357
            TTEPSTAPFTPPPESVHLTTPSSPEVPFAQLL  SL    R     +KF  S+YEFQSY 
Sbjct: 160  TTEPSTAPFTPPPESVHLTTPSSPEVPFAQLLDPSL----RFGEQGQKFPFSYYEFQSYH 215

Query: 358  LYPGSPIGHLISPGSVISVSGTCSPFPDREFSSSGHHFLEFCAGEPPKLLNIGGLSTRKW 537
            L+PGSP+G+LISP S IS SGT SPFPD EF+++G  F +F  G+PPKLLN+  LS R+W
Sbjct: 216  LHPGSPVGNLISPSSGISGSGTSSPFPDGEFATAGPQFPDFHRGDPPKLLNLDKLSIREW 275

Query: 538  GPQQGSLSPMPDVAQPTNQVNSIVENQISEVASLANSENGNQNNEAVIDHRVSFELTREE 717
            G +QGS +  PD  + T +       QISEVA   +SENG + ++ ++DHRVSFELT E+
Sbjct: 276  GSRQGSGTLTPDAVRSTPRNGFFQNRQISEVALRPHSENGLRKDQ-IVDHRVSFELTTED 334

Query: 718  ASNCVEKEPVASVVAITVSLLDTTVAAVAAERDGVAKEEETASEYCVGETSKNMSEKTSG 897
               CVEK+P     A++ SL + T      E++  + E E     C GE + +   KT  
Sbjct: 335  VVRCVEKKPTTLAEAVSESLQNGT----TVEKEESSGEAENVHHSCAGEAANDEPLKTPV 390

Query: 898  DGNDEKXXXXXXXXXXXXXXXLTTLGLVKEFKFDNTKGGTSDKPTLGSDWWANEKVVTKE 1077
            D  +                   TLG  KEF FD+   G S +PT+ SDWWANEKVV K+
Sbjct: 391  DVEEAPRHQKQQS---------ITLGSTKEFNFDSA-DGDSHEPTIASDWWANEKVVGKD 440

Query: 1078 TGPHNNWAFFPMMQ--PGVS 1131
            +G   NWAFFP++Q  PGVS
Sbjct: 441  SGAIKNWAFFPVIQPAPGVS 460


>ref|XP_006476541.1| PREDICTED: uncharacterized protein LOC102626793 [Citrus sinensis]
          Length = 460

 Score =  380 bits (977), Expect = e-103
 Identities = 212/380 (55%), Positives = 255/380 (67%), Gaps = 3/380 (0%)
 Frame = +1

Query: 1    SPASFLQSGPPSATQSPAEMVSLTSLSSNVYSPG-PNSIFAIGPYAHETQLVSPPVFSTF 177
            SPASFLQS PPSATQSPA +VSL S+S N+YSPG P+SIFAIGPYAHETQLVSPPVFSTF
Sbjct: 100  SPASFLQSEPPSATQSPAGLVSLNSISGNMYSPGGPSSIFAIGPYAHETQLVSPPVFSTF 159

Query: 178  TTEPSTAPFTPPPESVHLTTPSSPEVPFAQLLTSSLDKNHRTNGDSKKFSPSHYEFQSYQ 357
            TTEPSTAPFTPPPESVHLTTPSSPEVPFAQLL  SL    R     +KF  S+YEFQSY 
Sbjct: 160  TTEPSTAPFTPPPESVHLTTPSSPEVPFAQLLDPSL----RFGEQGQKFPFSYYEFQSYH 215

Query: 358  LYPGSPIGHLISPGSVISVSGTCSPFPDREFSSSGHHFLEFCAGEPPKLLNIGGLSTRKW 537
            L+PGSP+G+LISP S IS SGT SPFPD EF+++G  F +F  G+PPKLLN+  LS R+W
Sbjct: 216  LHPGSPVGNLISPSSGISGSGTSSPFPDGEFATAGPQFPDFHRGDPPKLLNLDKLSIREW 275

Query: 538  GPQQGSLSPMPDVAQPTNQVNSIVENQISEVASLANSENGNQNNEAVIDHRVSFELTREE 717
            G +QGS +  PD    T +       QISEVA   +SENG + ++ ++DHRVSFELT E+
Sbjct: 276  GSRQGSGTLTPDAVGSTPRNGFFQNRQISEVALRPHSENGLRKDQ-IVDHRVSFELTTED 334

Query: 718  ASNCVEKEPVASVVAITVSLLDTTVAAVAAERDGVAKEEETASEYCVGETSKNMSEKTSG 897
               CVEK+P     A++ SL + T      E++  + E E     C GE + +   KT  
Sbjct: 335  VVRCVEKKPTTLAEAVSESLQNGT----TVEKEESSGEAENVHHSCAGEAANDEPLKTPV 390

Query: 898  DGNDEKXXXXXXXXXXXXXXXLTTLGLVKEFKFDNTKGGTSDKPTLGSDWWANEKVVTKE 1077
            D  +                   TLG  KEF FD+   G S +PT+ SDWWANEKVV K+
Sbjct: 391  DVEEAPRHQKQQS---------ITLGSTKEFNFDSA-DGDSHEPTIASDWWANEKVVGKD 440

Query: 1078 TGPHNNWAFFPMMQ--PGVS 1131
            +G   NWAFFP++Q  PGVS
Sbjct: 441  SGAIKNWAFFPVIQPAPGVS 460


>ref|XP_002513675.1| conserved hypothetical protein [Ricinus communis]
            gi|223547583|gb|EEF49078.1| conserved hypothetical
            protein [Ricinus communis]
          Length = 510

 Score =  376 bits (965), Expect = e-101
 Identities = 218/426 (51%), Positives = 260/426 (61%), Gaps = 49/426 (11%)
 Frame = +1

Query: 1    SPASFLQSGPPSATQSPAEMVSLTSLSSNVYSPG-PNSIFAIGPYAHETQLVSPPVFSTF 177
            SPASFLQS PPSATQSPA ++SLTSLS N YSPG P SIFAIGPYAHETQLV+PP FS F
Sbjct: 105  SPASFLQSDPPSATQSPAGLLSLTSLSVNAYSPGGPASIFAIGPYAHETQLVTPPAFSAF 164

Query: 178  TTEPSTAPFTPPPESVHLTTPSSPEVPFAQLLTSSLDKNHRTNGDSKKFSPSHYEFQSYQ 357
            TTEPSTAPFTPPPESV LTTPSSPEVPFAQLLTSSL++  R +G ++KF+ SHYEFQSY 
Sbjct: 165  TTEPSTAPFTPPPESVQLTTPSSPEVPFAQLLTSSLERARRNSGTNQKFALSHYEFQSYP 224

Query: 358  LYPGSPIGHLISPGSVISVSGTCSPFPDREFSSSGHHFLEFCAGEPPKLLNIGGLSTRKW 537
            LYPGSP G LISPGSVIS SGT SPFPDR      +  LEF  GE PKLL     +TRKW
Sbjct: 225  LYPGSPGGQLISPGSVISNSGTSSPFPDR------YPILEFRMGEAPKLLGFEHFTTRKW 278

Query: 538  GPQQGS--LSP----------------------------------------------MPD 573
            G + GS  ++P                                               PD
Sbjct: 279  GSRLGSGTVTPDGVGLGSRLGSGTVTPDGVGQGSRLGSGTVTPDGVGLRSMLGSGSLTPD 338

Query: 574  VAQPTNQVNSIVENQISEVASLANSENGNQNNEAVIDHRVSFELTREEASNCVEKEPVAS 753
               P ++    +ENQISEVASLANSENG++ +E ++DHRVSFEL+ EE + C+E + +AS
Sbjct: 339  AVGPASRDGFFLENQISEVASLANSENGSKTDENIVDHRVSFELSGEEVARCLESKSLAS 398

Query: 754  VVAITVSLLDTTVAAVAAERDGVAKEEETASEYCVGETSKNMSEKTSGDGNDEKXXXXXX 933
              A +    D    ++A ++    K   T      GETS    EK SG+  +E       
Sbjct: 399  CRAFSECPPD----SMAEDQIKSGKMLMTDENLPTGETSGETPEKPSGEMEEEHCYRKHR 454

Query: 934  XXXXXXXXXLTTLGLVKEFKFDNTKGGTSDKPTLGSDWWANEKVVTKETGPHNNWAFFPM 1113
                       TLG +KEF FDN+K    DKP++ S+WWANE +  KE  P NNW FFP+
Sbjct: 455  S---------ITLGSIKEFNFDNSK-EVPDKPSINSEWWANETIAGKEARPANNWTFFPL 504

Query: 1114 MQPGVS 1131
            +QP VS
Sbjct: 505  LQPEVS 510


>ref|XP_004298813.1| PREDICTED: uncharacterized protein LOC101309729 isoform 2 [Fragaria
            vesca subsp. vesca]
          Length = 422

 Score =  373 bits (957), Expect = e-100
 Identities = 209/378 (55%), Positives = 246/378 (65%), Gaps = 1/378 (0%)
 Frame = +1

Query: 1    SPASFLQSGPPSATQSPAEMVSLTSLSSNVYSPGPNSIFAIGPYAHETQLVSPPVFSTFT 180
            SPASFLQS PPSA QSP    SL   S+++YSPGP+SIFAIGPYAHETQLVSPPVFSTFT
Sbjct: 63   SPASFLQSEPPSAMQSPGFNFSL---SASMYSPGPSSIFAIGPYAHETQLVSPPVFSTFT 119

Query: 181  TEPSTAPFTPPPESVHLTTPSSPEVPFAQLLTSSLDKNHRTNGDSKKFSPSHYEFQSYQL 360
            TEPSTAPFTPP ESVHLT PSSPEVPFAQLL    D N R     +++  SHYEFQSYQ 
Sbjct: 120  TEPSTAPFTPPAESVHLTRPSSPEVPFAQLL----DSNFRFGEGGQRYPLSHYEFQSYQW 175

Query: 361  YPGSPIGHLISPGSVISVSGTCSPFPDREFSSSGHHFLEFCAGEPPKLLNIGGLSTRKWG 540
            YPGSP+G LISP S IS SGT SPF D EF+S GHHFLEF  GE PK+LN+  L TR WG
Sbjct: 176  YPGSPVGQLISPSSGISGSGTSSPFLDSEFASGGHHFLEFRTGEAPKVLNLDILFTRDWG 235

Query: 541  PQQGSLSPMPDVAQPTNQVNSIVENQISEVASLANSENGNQNNEAVIDHRVSFELTREEA 720
             +  S S  PD A+ T+     ++    E    A S +  +N+ A I HRVSFEL+ EE 
Sbjct: 236  SRLCSGSVTPDAAKSTSSEGFTLKPYTPEGVLNARSNSRRRNDGASIGHRVSFELSAEEV 295

Query: 721  SNCVEKEPVASVVAITVSLLDTTVAAVAAERDGVAKEEETASEYC-VGETSKNMSEKTSG 897
              CVEK+PVA   A++ SL     +A  AER+    +E ++S  C V +TS + SEK  G
Sbjct: 296  VRCVEKKPVALAEAVSTSL----QSAEKAEREEGPNQEVSSSHECPVVDTSNDSSEKAVG 351

Query: 898  DGNDEKXXXXXXXXXXXXXXXLTTLGLVKEFKFDNTKGGTSDKPTLGSDWWANEKVVTKE 1077
               +E                  TLG  KEF FDN  GG S   ++ +DWWANEKVV KE
Sbjct: 352  GDAEE-------LSYRYQKERSITLGSAKEFNFDNADGGDSGTSSISTDWWANEKVVLKE 404

Query: 1078 TGPHNNWAFFPMMQPGVS 1131
             G   NW+FFPM+QPG+S
Sbjct: 405  NGESKNWSFFPMIQPGMS 422


>ref|XP_004298812.1| PREDICTED: uncharacterized protein LOC101309729 isoform 1 [Fragaria
            vesca subsp. vesca]
          Length = 459

 Score =  373 bits (957), Expect = e-100
 Identities = 209/378 (55%), Positives = 246/378 (65%), Gaps = 1/378 (0%)
 Frame = +1

Query: 1    SPASFLQSGPPSATQSPAEMVSLTSLSSNVYSPGPNSIFAIGPYAHETQLVSPPVFSTFT 180
            SPASFLQS PPSA QSP    SL   S+++YSPGP+SIFAIGPYAHETQLVSPPVFSTFT
Sbjct: 100  SPASFLQSEPPSAMQSPGFNFSL---SASMYSPGPSSIFAIGPYAHETQLVSPPVFSTFT 156

Query: 181  TEPSTAPFTPPPESVHLTTPSSPEVPFAQLLTSSLDKNHRTNGDSKKFSPSHYEFQSYQL 360
            TEPSTAPFTPP ESVHLT PSSPEVPFAQLL    D N R     +++  SHYEFQSYQ 
Sbjct: 157  TEPSTAPFTPPAESVHLTRPSSPEVPFAQLL----DSNFRFGEGGQRYPLSHYEFQSYQW 212

Query: 361  YPGSPIGHLISPGSVISVSGTCSPFPDREFSSSGHHFLEFCAGEPPKLLNIGGLSTRKWG 540
            YPGSP+G LISP S IS SGT SPF D EF+S GHHFLEF  GE PK+LN+  L TR WG
Sbjct: 213  YPGSPVGQLISPSSGISGSGTSSPFLDSEFASGGHHFLEFRTGEAPKVLNLDILFTRDWG 272

Query: 541  PQQGSLSPMPDVAQPTNQVNSIVENQISEVASLANSENGNQNNEAVIDHRVSFELTREEA 720
             +  S S  PD A+ T+     ++    E    A S +  +N+ A I HRVSFEL+ EE 
Sbjct: 273  SRLCSGSVTPDAAKSTSSEGFTLKPYTPEGVLNARSNSRRRNDGASIGHRVSFELSAEEV 332

Query: 721  SNCVEKEPVASVVAITVSLLDTTVAAVAAERDGVAKEEETASEYC-VGETSKNMSEKTSG 897
              CVEK+PVA   A++ SL     +A  AER+    +E ++S  C V +TS + SEK  G
Sbjct: 333  VRCVEKKPVALAEAVSTSL----QSAEKAEREEGPNQEVSSSHECPVVDTSNDSSEKAVG 388

Query: 898  DGNDEKXXXXXXXXXXXXXXXLTTLGLVKEFKFDNTKGGTSDKPTLGSDWWANEKVVTKE 1077
               +E                  TLG  KEF FDN  GG S   ++ +DWWANEKVV KE
Sbjct: 389  GDAEE-------LSYRYQKERSITLGSAKEFNFDNADGGDSGTSSISTDWWANEKVVLKE 441

Query: 1078 TGPHNNWAFFPMMQPGVS 1131
             G   NW+FFPM+QPG+S
Sbjct: 442  NGESKNWSFFPMIQPGMS 459


>ref|XP_007040283.1| Hydroxyproline-rich glycoprotein family protein [Theobroma cacao]
            gi|508777528|gb|EOY24784.1| Hydroxyproline-rich
            glycoprotein family protein [Theobroma cacao]
          Length = 458

 Score =  363 bits (932), Expect = 1e-97
 Identities = 205/379 (54%), Positives = 246/379 (64%), Gaps = 3/379 (0%)
 Frame = +1

Query: 1    SPASFLQSGPPSATQSPAEMVSLTSLSSNVYSPGPNSIFAIGPYAHETQLVSPPVFSTFT 180
            SPASFL S PPSATQSPA +VSLTS+S+++YSPGP SIFAIGPYAHETQLVSPPVFSTFT
Sbjct: 99   SPASFLPSEPPSATQSPAGLVSLTSISASMYSPGPASIFAIGPYAHETQLVSPPVFSTFT 158

Query: 181  TEPSTAPFTPPPESVHLTTPSSPEVPFAQLLTSSLDKNHRTNGDSKKFSPSHYEFQSYQL 360
            TEPSTAPFTPPPESVHLTTPSSPEVPFAQLL  +L          ++F  SHYEFQSYQL
Sbjct: 159  TEPSTAPFTPPPESVHLTTPSSPEVPFAQLLGPNLQYGEGV----QRFPISHYEFQSYQL 214

Query: 361  YPGSPIGHLISPGSVISVSGTCSPFPDREFSSSGHHFLEFCAGEPPKLLNIGGLSTRKWG 540
            +PGSP+G LISP S IS SGT SPF D EF++S  HF EF  G+PPKLLN+   S+ +WG
Sbjct: 215  HPGSPVGQLISPSSGISGSGTSSPFRDGEFAAS-LHFPEFRMGDPPKLLNLDKHSSCEWG 273

Query: 541  PQQGSLSPMPDVAQPTNQVNSIVENQISEVASLANSENGN-QNNEAVIDHRVSFELTREE 717
               GS +  PD  + T +   ++++QISE+ S  + +N   QN++   +HRVSFELT EE
Sbjct: 274  SHHGSGTLTPDATRSTPRNGFLLDHQISEITSHPHLKNKEVQNDQVAHNHRVSFELTTEE 333

Query: 718  ASNCVEKEPVASVVAITVSLLDTTVAAVAAERDGVAKEEETASEY-C-VGETSKNMSEKT 891
                +E E      A++ SL       + A R+    + +   +Y C VGETS    EK 
Sbjct: 334  VVRSLEMETATPSEAVSGSL------QIEATRESEEHDTKVVDDYECRVGETSNERPEKA 387

Query: 892  SGDGNDEKXXXXXXXXXXXXXXXLTTLGLVKEFKFDNTKGGTSDKPTLGSDWWANEKVVT 1071
              D   +                  TLG  KEF FDN  GG + KP L SDWWAN+KV  
Sbjct: 388  LADREGKPQHHKHQS---------ITLGSAKEFNFDNVDGGDAHKPILTSDWWANDKVAG 438

Query: 1072 KETGPHNNWAFFPMMQPGV 1128
            K  G   NW+FFPMMQPGV
Sbjct: 439  KGGGVPRNWSFFPMMQPGV 457


>gb|EXB37330.1| hypothetical protein L484_024256 [Morus notabilis]
          Length = 455

 Score =  355 bits (911), Expect = 3e-95
 Identities = 205/378 (54%), Positives = 233/378 (61%), Gaps = 1/378 (0%)
 Frame = +1

Query: 1    SPASFLQSGPPSATQSPAEMVSLTSLSSNVYSPG-PNSIFAIGPYAHETQLVSPPVFSTF 177
            SPASFLQS PPSATQSPA ++SLTS+S+++YSPG P SIFAIGPYAHETQLVSPPVFSTF
Sbjct: 101  SPASFLQSEPPSATQSPAGLLSLTSVSASMYSPGGPASIFAIGPYAHETQLVSPPVFSTF 160

Query: 178  TTEPSTAPFTPPPESVHLTTPSSPEVPFAQLLTSSLDKNHRTNGDSKKFSPSHYEFQSYQ 357
            TTEPSTAPFTPPPESVHLTTPSSPEVPFAQL    LD N       ++F   H EFQSY 
Sbjct: 161  TTEPSTAPFTPPPESVHLTTPSSPEVPFAQL----LDPNIHNGEPGQRFPIFHNEFQSYY 216

Query: 358  LYPGSPIGHLISPGSVISVSGTCSPFPDREFSSSGHHFLEFCAGEPPKLLNIGGLSTRKW 537
              PGSPIG LISP S IS SGT SPFPD EF++ G HFLEF  G+PPKLLN+  LS   W
Sbjct: 217  FQPGSPIGQLISPSSGISGSGTSSPFPDPEFAARGPHFLEFRTGDPPKLLNLDKLSKFDW 276

Query: 538  GPQQGSLSPMPDVAQPTNQVNSIVENQISEVASLANSENGNQNNEAVIDHRVSFELTREE 717
            G +QGS S  PD  +P +           EVA         +N E V D RVSF+++ E+
Sbjct: 277  GSRQGSGSLTPDSVKPISTF---------EVAPHLKPNGRCRNAENVADRRVSFDVSTED 327

Query: 718  ASNCVEKEPVASVVAITVSLLDTTVAAVAAERDGVAKEEETASEYCVGETSKNMSEKTSG 897
                VEK+ V    A+  SL DTT+     E     K EE   E  VGETS    +K   
Sbjct: 328  VIRYVEKKTVPLAEAMLTSLKDTTMGQ-REENSDSNKVEEIGCENRVGETSNEEPDKAPT 386

Query: 898  DGNDEKXXXXXXXXXXXXXXXLTTLGLVKEFKFDNTKGGTSDKPTLGSDWWANEKVVTKE 1077
             G +                   TLG  KEF FDN   G   K    SDWWAN+KV  KE
Sbjct: 387  SGEE---------VLQHQKHRSITLGSSKEFNFDNADAGDLHKSDSVSDWWANQKVAGKE 437

Query: 1078 TGPHNNWAFFPMMQPGVS 1131
              P  NW+FFPM+QPGVS
Sbjct: 438  GAPSQNWSFFPMIQPGVS 455


>ref|XP_006368761.1| hypothetical protein POPTR_0001s09590g [Populus trichocarpa]
            gi|550346902|gb|ERP65330.1| hypothetical protein
            POPTR_0001s09590g [Populus trichocarpa]
          Length = 452

 Score =  355 bits (911), Expect = 3e-95
 Identities = 205/378 (54%), Positives = 246/378 (65%), Gaps = 1/378 (0%)
 Frame = +1

Query: 1    SPASFLQSGPPSATQSPAEMVSLTSLSSNVYSP-GPNSIFAIGPYAHETQLVSPPVFSTF 177
            SPASF QS PPS TQSPA +VSLTS+S+++YSP GP SIFAIGPYAHETQLVSPPVFSTF
Sbjct: 98   SPASFFQSEPPSVTQSPAGLVSLTSISASMYSPSGPASIFAIGPYAHETQLVSPPVFSTF 157

Query: 178  TTEPSTAPFTPPPESVHLTTPSSPEVPFAQLLTSSLDKNHRTNGDSKKFSPSHYEFQSYQ 357
            TTEPSTAPFTPPPESVHLTTPSSPEVPFAQ L  SL      NGD+    P  ++FQSYQ
Sbjct: 158  TTEPSTAPFTPPPESVHLTTPSSPEVPFAQFLDPSL-----RNGDTGLRFP--FDFQSYQ 210

Query: 358  LYPGSPIGHLISPGSVISVSGTCSPFPDREFSSSGHHFLEFCAGEPPKLLNIGGLSTRKW 537
             +PGSP+G LISP S IS SGT SPFPD EF+  G HF EF  GEPPKLLN+  LST +W
Sbjct: 211  FHPGSPVGQLISPSSGISGSGTSSPFPDGEFAVGGAHFPEFRIGEPPKLLNLDKLSTCEW 270

Query: 538  GPQQGSLSPMPDVAQPTNQVNSIVENQISEVASLANSENGNQNNEAVIDHRVSFELTREE 717
            G  QGS +  P+  +     N ++  Q S+V S   S NG++N + V++HRVSFELT E+
Sbjct: 271  GSYQGSGALTPESVR-RGSPNFLLHRQFSDVPSRPRSGNGHKNGQ-VVNHRVSFELTAED 328

Query: 718  ASNCVEKEPVASVVAITVSLLDTTVAAVAAERDGVAKEEETASEYCVGETSKNMSEKTSG 897
            AS CVE++P  S+  +   + + T     A+ +  + E   + E  VG TS +  E  S 
Sbjct: 329  ASRCVEEKPAFSIKTVPEYVENGT----QAKEEKNSGESIQSFECRVGVTSNDSPEMAST 384

Query: 898  DGNDEKXXXXXXXXXXXXXXXLTTLGLVKEFKFDNTKGGTSDKPTLGSDWWANEKVVTKE 1077
            DG                     TLG VKEF FDN   G S KP+  S+WWAN  V+ KE
Sbjct: 385  DGE---------AAPQHRKQQSITLGSVKEFNFDNADEGDSRKPS-SSNWWANGSVIGKE 434

Query: 1078 TGPHNNWAFFPMMQPGVS 1131
                 NW+FFPM+Q GVS
Sbjct: 435  GETTKNWSFFPMVQSGVS 452


>ref|XP_002298027.2| hypothetical protein POPTR_0001s09590g [Populus trichocarpa]
            gi|550346901|gb|EEE82832.2| hypothetical protein
            POPTR_0001s09590g [Populus trichocarpa]
          Length = 453

 Score =  355 bits (911), Expect = 3e-95
 Identities = 205/378 (54%), Positives = 246/378 (65%), Gaps = 1/378 (0%)
 Frame = +1

Query: 1    SPASFLQSGPPSATQSPAEMVSLTSLSSNVYSP-GPNSIFAIGPYAHETQLVSPPVFSTF 177
            SPASF QS PPS TQSPA +VSLTS+S+++YSP GP SIFAIGPYAHETQLVSPPVFSTF
Sbjct: 99   SPASFFQSEPPSVTQSPAGLVSLTSISASMYSPSGPASIFAIGPYAHETQLVSPPVFSTF 158

Query: 178  TTEPSTAPFTPPPESVHLTTPSSPEVPFAQLLTSSLDKNHRTNGDSKKFSPSHYEFQSYQ 357
            TTEPSTAPFTPPPESVHLTTPSSPEVPFAQ L  SL      NGD+    P  ++FQSYQ
Sbjct: 159  TTEPSTAPFTPPPESVHLTTPSSPEVPFAQFLDPSL-----RNGDTGLRFP--FDFQSYQ 211

Query: 358  LYPGSPIGHLISPGSVISVSGTCSPFPDREFSSSGHHFLEFCAGEPPKLLNIGGLSTRKW 537
             +PGSP+G LISP S IS SGT SPFPD EF+  G HF EF  GEPPKLLN+  LST +W
Sbjct: 212  FHPGSPVGQLISPSSGISGSGTSSPFPDGEFAVGGAHFPEFRIGEPPKLLNLDKLSTCEW 271

Query: 538  GPQQGSLSPMPDVAQPTNQVNSIVENQISEVASLANSENGNQNNEAVIDHRVSFELTREE 717
            G  QGS +  P+  +     N ++  Q S+V S   S NG++N + V++HRVSFELT E+
Sbjct: 272  GSYQGSGALTPESVR-RGSPNFLLHRQFSDVPSRPRSGNGHKNGQ-VVNHRVSFELTAED 329

Query: 718  ASNCVEKEPVASVVAITVSLLDTTVAAVAAERDGVAKEEETASEYCVGETSKNMSEKTSG 897
            AS CVE++P  S+  +   + + T     A+ +  + E   + E  VG TS +  E  S 
Sbjct: 330  ASRCVEEKPAFSIKTVPEYVENGT----QAKEEKNSGESIQSFECRVGVTSNDSPEMAST 385

Query: 898  DGNDEKXXXXXXXXXXXXXXXLTTLGLVKEFKFDNTKGGTSDKPTLGSDWWANEKVVTKE 1077
            DG                     TLG VKEF FDN   G S KP+  S+WWAN  V+ KE
Sbjct: 386  DGE---------AAPQHRKQQSITLGSVKEFNFDNADEGDSRKPS-SSNWWANGSVIGKE 435

Query: 1078 TGPHNNWAFFPMMQPGVS 1131
                 NW+FFPM+Q GVS
Sbjct: 436  GETTKNWSFFPMVQSGVS 453


>ref|XP_004234428.1| PREDICTED: uncharacterized protein LOC101260903 [Solanum
            lycopersicum]
          Length = 470

 Score =  341 bits (875), Expect = 4e-91
 Identities = 203/406 (50%), Positives = 244/406 (60%), Gaps = 29/406 (7%)
 Frame = +1

Query: 1    SPASFLQSGPPSATQSPAEMVSLTSLSSNVYSPGPN-SIFAIGPYAHETQLVSPPVFSTF 177
            SPASFL S PPSATQSPA ++SL +LS N YSPG   SIFAIGPYAHETQLVSPPVFSTF
Sbjct: 92   SPASFLPSDPPSATQSPAGLLSLKALSINAYSPGGTASIFAIGPYAHETQLVSPPVFSTF 151

Query: 178  TTEPSTAPFTPPPESVHLTTPSSPEVPFAQLLTSSLDKNHRTNGDSKKFSPSHYEFQSYQ 357
            TTEPSTA FTPPPE VH+TTP SPEVPFAQLLTSSL +N R +G + KF  S YEF  YQ
Sbjct: 152  TTEPSTANFTPPPEPVHMTTPPSPEVPFAQLLTSSLARNRRYSGSNYKFPLSQYEFVPYQ 211

Query: 358  LYPGSPIGHLISPGSVISVSGTCSPFPDREFSSSGHHFLEFCAGEPPKLLNIGGLSTRKW 537
              PGSP  +LISPGSV+S SGT SPFP +         +EF  GEPPK L     STRKW
Sbjct: 212  -DPGSPGSNLISPGSVVSNSGTSSPFPGK------CPIIEFRKGEPPKFLGYEHFSTRKW 264

Query: 538  GPQQGS----------------LSP------------MPDVAQPTNQVNSIVENQISEVA 633
            G + GS                L+P             P+  +P ++ + ++ENQISEVA
Sbjct: 265  GSRVGSGSVTPSGWGSRLGSGTLTPNGGISRLGSGTVTPNGGEPPSRDSYLLENQISEVA 324

Query: 634  SLANSENGNQNNEAVIDHRVSFELTREEASNCVEKEPVASVVAITVSLLDTTVAAVAAER 813
            SLANS+NG++  EAVIDHRVSFELT E+  +C EKEPV S    T+  +D +    +  R
Sbjct: 325  SLANSDNGSEIGEAVIDHRVSFELTEEDVPSCREKEPVMSHSQPTLP-MDVSNLLASEMR 383

Query: 814  DGVAKEEETASEYCVGETSKNMSEKTSGDGNDEKXXXXXXXXXXXXXXXLTTLGLVKEFK 993
             G +  E         E +     K S  G DE                  T G  K+F 
Sbjct: 384  SGSSMAE---------EKTYGSPRKASESGEDE----------CHRKHRNITFGSSKDFD 424

Query: 994  FDNTKGGTSDKPTLGSDWWANEKVVTKETGPHNNWAFFPMMQPGVS 1131
            FDN K    +K ++  +WW ++K   KE+G  NNW FFP++QPGVS
Sbjct: 425  FDNVKIEVLEKDSIDCEWWTSDKAAVKESGIQNNWTFFPVLQPGVS 470


>ref|XP_006353896.1| PREDICTED: uncharacterized protein LOC102583548 [Solanum tuberosum]
          Length = 470

 Score =  335 bits (858), Expect = 4e-89
 Identities = 200/406 (49%), Positives = 241/406 (59%), Gaps = 29/406 (7%)
 Frame = +1

Query: 1    SPASFLQSGPPSATQSPAEMVSLTSLSSNVYSPGPN-SIFAIGPYAHETQLVSPPVFSTF 177
            SPASFL S PPSATQSPA ++SL SLS N YSPG   SIFAIGPYAHETQLVSPPVFSTF
Sbjct: 92   SPASFLPSDPPSATQSPAGLLSLKSLSINAYSPGGTASIFAIGPYAHETQLVSPPVFSTF 151

Query: 178  TTEPSTAPFTPPPESVHLTTPSSPEVPFAQLLTSSLDKNHRTNGDSKKFSPSHYEFQSYQ 357
            TTEPSTA FTPPPE VH+TTP SPEVPFAQLLTSSL +N R +G + KF  S YEF  YQ
Sbjct: 152  TTEPSTANFTPPPELVHMTTPPSPEVPFAQLLTSSLARNRRYSGSNYKFPLSQYEFVPYQ 211

Query: 358  LYPGSPIGHLISPGSVISVSGTCSPFPDREFSSSGHHFLEFCAGEPPKLLNIGGLSTRKW 537
              PGSP  +LISPGSV+S SGT SPFP +         +EF  GEPPK L     STRKW
Sbjct: 212  -DPGSPGSNLISPGSVVSNSGTSSPFPGK------CPIIEFRKGEPPKFLGYEHFSTRKW 264

Query: 538  GPQ--QGSLSP--------------------------MPDVAQPTNQVNSIVENQISEVA 633
            G +   GSL+P                           P+  +P ++ + ++E QISEVA
Sbjct: 265  GSRVGSGSLTPSGWGSRLGSGTLTPNGGISRLGSGTVTPNGGEPPSRDSYLLEYQISEVA 324

Query: 634  SLANSENGNQNNEAVIDHRVSFELTREEASNCVEKEPVASVVAITVSLLDTTVAAVAAER 813
            SLANS+NG++  E VIDHRVSFELT E+  +C EKEPV S    T+ +  + + A   + 
Sbjct: 325  SLANSDNGSEIGEGVIDHRVSFELTGEDVPSCREKEPVMSHSQQTLPMDVSNLLANEMKS 384

Query: 814  DGVAKEEETASEYCVGETSKNMSEKTSGDGNDEKXXXXXXXXXXXXXXXLTTLGLVKEFK 993
                 EE+T               K S  G D+                  T G  K+F 
Sbjct: 385  GSSMAEEKTYGS----------PRKASESGEDQ----------CHRKHRNITFGSSKDFD 424

Query: 994  FDNTKGGTSDKPTLGSDWWANEKVVTKETGPHNNWAFFPMMQPGVS 1131
            FDN K    +K ++  +WW ++K   KE+G  NNW FFP++QPGVS
Sbjct: 425  FDNVKIEVLEKDSIDCEWWTSDKAAGKESGIQNNWTFFPVLQPGVS 470


>ref|XP_004140832.1| PREDICTED: uncharacterized protein LOC101210841 [Cucumis sativus]
          Length = 497

 Score =  332 bits (852), Expect = 2e-88
 Identities = 198/426 (46%), Positives = 250/426 (58%), Gaps = 49/426 (11%)
 Frame = +1

Query: 1    SPASFLQSGPPSATQSPAEMVSLTSLSSNVYSP-GPNSIFAIGPYAHETQLVSPPVFSTF 177
            SPASFLQS P S TQSPA ++SLT+LS N YSP GP SIFAIGPY ++TQLVSPPVFS F
Sbjct: 94   SPASFLQSEPTSNTQSPAGLLSLTALSVNNYSPNGPASIFAIGPYTYDTQLVSPPVFSAF 153

Query: 178  TTEPSTAPFTPPPESVHLTTPSSPEVPFAQLLTSSLDKNHRTNGDSKKFSPSHYEFQSYQ 357
            TTEPSTAP TPPPESV LTTPSSPEVPFA+LLTSSL   +++ G ++KF+ SH +FQ YQ
Sbjct: 154  TTEPSTAPITPPPESVQLTTPSSPEVPFAKLLTSSLSHTNKSFGTNQKFTLSHCDFQPYQ 213

Query: 358  LYPGSPIGHLISPGSVISVSGTCSPFPDREFSSSGHHFLEFCAGEPPKLLNIGGLSTRKW 537
             YPGSP  HLISPGSVIS SGT SPFPD+      H  LEF   + PKLL +   +TRKW
Sbjct: 214  PYPGSPGAHLISPGSVISNSGTSSPFPDK------HPILEFRMADAPKLLGLEHFTTRKW 267

Query: 538  GPQQGSLSPMPDVAQPTNQVNS-------------------------------------- 603
              + GS S  PD     +++ S                                      
Sbjct: 268  ISRMGSGSLTPDGTGLCSRLGSGTLTPDGMGMGSRLGSGSVTPNGMRQDSRLGSGTLTPD 327

Query: 604  ----------IVENQISEVASLANSENGNQNNEAVIDHRVSFELTREEASNCVEKEPVAS 753
                      +++NQISEVASLANSE G QN+  V +HRVSFELT E+ + C+  + + S
Sbjct: 328  GLGHGLQDSPLLDNQISEVASLANSETGCQND--VTNHRVSFELTGEDVARCLANKSLTS 385

Query: 754  VVAITVSLLDTTVAAVAAERDGVAKEEETASEYCVGETSKNMSEKTSGDGNDEKXXXXXX 933
            +   + S   T+ +     ++  ++E ET   + +  ++    EKT G+ +         
Sbjct: 386  IRTESESPKQTSTSNQNENKES-SREAETCEFFDIKTSA--APEKTPGEDDQ-------- 434

Query: 934  XXXXXXXXXLTTLGLVKEFKFDNTKGGTSDKPTLGSDWWANEKVVTKETGPHNNWAFFPM 1113
                       TLG  KEF FD TKG   +  ++G++WWANEKV  KE  P NNW FFP+
Sbjct: 435  ---CYQNQRAVTLGSFKEFNFDQTKGEIHNTASIGAEWWANEKVGVKEASPGNNWTFFPL 491

Query: 1114 MQPGVS 1131
            +QPGVS
Sbjct: 492  LQPGVS 497


>ref|XP_002509822.1| conserved hypothetical protein [Ricinus communis]
            gi|223549721|gb|EEF51209.1| conserved hypothetical
            protein [Ricinus communis]
          Length = 459

 Score =  332 bits (852), Expect = 2e-88
 Identities = 193/377 (51%), Positives = 243/377 (64%), Gaps = 1/377 (0%)
 Frame = +1

Query: 1    SPASFLQSGPPSATQSPAEMVSLTSLSSNVYSP-GPNSIFAIGPYAHETQLVSPPVFSTF 177
            SPASFLQS PPSA+QSPA ++SLTS+S+++YSP GP SIFAIGPYAHETQLVSPP FSTF
Sbjct: 103  SPASFLQSEPPSASQSPAGILSLTSVSASMYSPSGPASIFAIGPYAHETQLVSPPAFSTF 162

Query: 178  TTEPSTAPFTPPPESVHLTTPSSPEVPFAQLLTSSLDKNHRTNGDSKKFSPSHYEFQSYQ 357
            TTEPSTAPFTPPPESV LTTPSSPEVPFAQL    L+ ++R      +F  S+YEFQSYQ
Sbjct: 163  TTEPSTAPFTPPPESVQLTTPSSPEVPFAQL----LEPSNRNGEAGLRFPFSNYEFQSYQ 218

Query: 358  LYPGSPIGHLISPGSVISVSGTCSPFPDREFSSSGHHFLEFCAGEPPKLLNIGGLSTRKW 537
             YPGSP+G LISP S IS SGT SPFPD EF+++G  FLEF    PPKLLN+  LS  + 
Sbjct: 219  FYPGSPVGQLISPSSGISGSGTSSPFPDGEFAAAGPRFLEFQMAVPPKLLNLDKLSVHEC 278

Query: 538  GPQQGSLSPMPDVAQPTNQVNSIVENQISEVASLANSENGNQNNEAVIDHRVSFELTREE 717
            G +QGS +  PD  + T+  +  ++ Q S++AS  +S+N N++++ V D RVSF+L+ E+
Sbjct: 279  GSRQGSGTLTPDAVRATS-CSFPLDRQCSDIASNRHSDNENKDDQ-VADLRVSFDLSAED 336

Query: 718  ASNCVEKEPVASVVAITVSLLDTTVAAVAAERDGVAKEEETASEYCVGETSKNMSEKTSG 897
            A    E +P + V  +  S+ +     +AAE+   + E     E  VGETS  + E+ S 
Sbjct: 337  ALRYAEPKPASPVKIMPESMKN----EIAAEKVQKSSEIRHNFECRVGETSNGILEQAST 392

Query: 898  DGNDEKXXXXXXXXXXXXXXXLTTLGLVKEFKFDNTKGGTSDKPTLGSDWWANEKVVTKE 1077
             G                     TLG  KEF FDN  G    KP+ G DWW N   V KE
Sbjct: 393  GGE---------KTPRHQKHRTLTLGTFKEFNFDNADG--VPKPSAGPDWWDNGSDVGKE 441

Query: 1078 TGPHNNWAFFPMMQPGV 1128
                 NW+FFP+MQP +
Sbjct: 442  DFTAKNWSFFPVMQPSI 458


Top