BLASTX nr result

ID: Cornus23_contig00012941 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Cornus23_contig00012941
         (2166 letters)

Database: ./nr 
           77,306,371 sequences; 28,104,191,420 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002272322.1| PREDICTED: uncharacterized protein LOC100264...   611   e-172
ref|XP_007038765.1| Hydroxyproline-rich glycoprotein family prot...   582   e-163
ref|XP_007038766.1| Hydroxyproline-rich glycoprotein family prot...   576   e-161
ref|XP_012090213.1| PREDICTED: uncharacterized protein LOC105648...   561   e-156
ref|XP_011084440.1| PREDICTED: uncharacterized protein LOC105166...   557   e-155
ref|XP_007219041.1| hypothetical protein PRUPE_ppa004616mg [Prun...   549   e-153
gb|KHG09821.1| hypothetical protein F383_13171 [Gossypium arboreum]   548   e-153
ref|XP_012440132.1| PREDICTED: uncharacterized protein LOC105765...   548   e-153
ref|XP_002513675.1| conserved hypothetical protein [Ricinus comm...   545   e-152
gb|KJB52747.1| hypothetical protein B456_008G275500 [Gossypium r...   543   e-151
ref|XP_009788653.1| PREDICTED: uncharacterized protein LOC104236...   541   e-150
gb|KHF98668.1| hypothetical protein F383_11887 [Gossypium arboreum]   539   e-150
ref|XP_012486505.1| PREDICTED: uncharacterized protein LOC105800...   538   e-150
ref|XP_004234428.1| PREDICTED: uncharacterized protein LOC101260...   538   e-150
ref|XP_006353896.1| PREDICTED: uncharacterized protein LOC102583...   538   e-150
ref|XP_009625236.1| PREDICTED: uncharacterized protein LOC104116...   538   e-149
ref|XP_006421977.1| hypothetical protein CICLE_v10004813mg [Citr...   536   e-149
emb|CDP05166.1| unnamed protein product [Coffea canephora]            533   e-148
ref|XP_002318209.1| hydroxyproline-rich glycoprotein [Populus tr...   533   e-148
gb|KDO51973.1| hypothetical protein CISIN_1g010808mg [Citrus sin...   532   e-148

>ref|XP_002272322.1| PREDICTED: uncharacterized protein LOC100264629 [Vitis vinifera]
            gi|731424007|ref|XP_010662699.1| PREDICTED:
            uncharacterized protein LOC100264629 [Vitis vinifera]
          Length = 448

 Score =  611 bits (1575), Expect = e-172
 Identities = 319/456 (69%), Positives = 355/456 (77%), Gaps = 4/456 (0%)
 Frame = -2

Query: 1898 MSTVHNSXXXXXXXXXXXXXXESRVQPTTVQKRRWGSCWSLYWCFGSCKHSKRIGHAVLA 1719
            M +V+NS              ESRVQPTTVQKRRWGSC SLYWCFGS +HSKRIGHAVL 
Sbjct: 1    MRSVNNSVETINAAATAIVSAESRVQPTTVQKRRWGSCLSLYWCFGSHRHSKRIGHAVLV 60

Query: 1718 PEPTVPGAAAPVSDNLNHSTTIV-PFIXXXXXXXSFLQSDPPSTTQSPAGLLSLTSLSVK 1542
            PEP VPGA AP S+NLN ST+IV PFI       SFLQSDPPS+TQSPAG LSLT+LSV 
Sbjct: 61   PEPMVPGAVAPASENLNLSTSIVLPFIAPPSSPASFLQSDPPSSTQSPAGFLSLTALSVN 120

Query: 1541 AYSPGGPASIFAIGPYAYETQLVSPPVFSTLTTEPSTASFTPPPEPVQLTTPSSPDVPFA 1362
            AYSP GPAS+FAIGPYA+ETQLVSPPVFST  TEPSTA FTPPPE VQLTTPSSP+VPFA
Sbjct: 121  AYSPSGPASMFAIGPYAHETQLVSPPVFSTFPTEPSTAPFTPPPESVQLTTPSSPEVPFA 180

Query: 1361 QLLTSSLARTRRNSGTNQKFSLSQYEFQPYQLYPESPVSHLISPGSAISNSGTSSPFADK 1182
            QLLTSSL R+RRNSGTNQK SLS YEFQPYQLYPESPV HLISP   ISNSGTSSPF D+
Sbjct: 181  QLLTSSLDRSRRNSGTNQKLSLSNYEFQPYQLYPESPVGHLISP---ISNSGTSSPFPDR 237

Query: 1181 RPIIELRMGEAPKIFGYEHFYVRKWGSRLGSGSLTPNGGEPASRDSLLLESQISEVASLA 1002
            RPI+     EAPK+ G+EHF  R+WGSRLGSGSLTP+G  PASRDS LLE+QISEVASLA
Sbjct: 238  RPIV-----EAPKLLGFEHFSTRRWGSRLGSGSLTPDGAGPASRDSFLLENQISEVASLA 292

Query: 1001 NSENGSQNDEIVIDHRVSFELTGDNVPTCMEKDRLSSLETVSESLRDIASEG--TTERDA 828
            NSE+GSQN E VIDHRVSFEL G++V  C+EK  ++S ETV  +L+DI  EG    ERD 
Sbjct: 293  NSESGSQNGETVIDHRVSFELAGEDVAVCVEKKPVASAETVQNTLQDIVEEGEIERERDG 352

Query: 827  TAKSTENCSECRVGEALDEMQGKASGDAEEELCHRKHRSITLGSINEFNFDNTKGEVLDK 648
             ++STENC E  VGEAL     KAS + EEE CH+KH  I  GSI EFNFDNTKGEV  K
Sbjct: 353  ISESTENCCEFCVGEALKAASEKASAEGEEEQCHKKHPPIRHGSIKEFNFDNTKGEVSAK 412

Query: 647  AP-ISSEWWSNEKVLGKELTPQNDWSFYPMLQPGLS 543
               I SEWW NEKV+GK   PQ +W+F+P+LQPG+S
Sbjct: 413  PNIIGSEWWVNEKVVGKGTGPQTNWTFFPLLQPGIS 448


>ref|XP_007038765.1| Hydroxyproline-rich glycoprotein family protein isoform 1 [Theobroma
            cacao] gi|508776010|gb|EOY23266.1| Hydroxyproline-rich
            glycoprotein family protein isoform 1 [Theobroma cacao]
          Length = 485

 Score =  582 bits (1500), Expect = e-163
 Identities = 299/463 (64%), Positives = 342/463 (73%), Gaps = 33/463 (7%)
 Frame = -2

Query: 1832 SRVQPTTVQKRRWGSCWSLYWCFGSCKHSKRIGHAVLAPEPTVPGAAAPVSDNLNHSTTI 1653
            SRVQPTTVQK+RWGSCW LYWCFGS K+SKRIGHAVL PEP VPGA+   ++N+++ T I
Sbjct: 23   SRVQPTTVQKKRWGSCWGLYWCFGSQKNSKRIGHAVLVPEPVVPGASVSTAENVSNPTGI 82

Query: 1652 V-PFIXXXXXXXSFLQSDPPSTTQSPAGLLSLTSLSVKAYSPGGPASIFAIGPYAYETQL 1476
            + PFI       SFLQSDPPS TQSPAGLLSLTSLSV AYSP GPASIFAIGPYA+ETQL
Sbjct: 83   ILPFIAPPSSPASFLQSDPPSATQSPAGLLSLTSLSVNAYSPRGPASIFAIGPYAHETQL 142

Query: 1475 VSPPVFSTLTTEPSTASFTPPPEPVQLTTPSSPDVPFAQLLTSSLARTRRNSGTNQKFSL 1296
            V+PPVFS LTTEPSTA FTPPPE VQLTTPSSP+VPFAQLLTSSL R RRNSG NQKF L
Sbjct: 143  VTPPVFSALTTEPSTAPFTPPPESVQLTTPSSPEVPFAQLLTSSLERARRNSGINQKFGL 202

Query: 1295 SQYEFQPYQLYPESPVSHLISPGSAISNSGTSSPFADKRPIIELRMGEAPKIFGYEHFYV 1116
            S YEFQ YQ+YP SP  +LISPGSAISNSGTSSPF D+RPI+E RMGEAPK+ G+E+F  
Sbjct: 203  SHYEFQSYQIYPGSPGGNLISPGSAISNSGTSSPFPDRRPILEFRMGEAPKLLGFENFTT 262

Query: 1115 RKWGSRLGSGSLTPNG-GE-------------------------------PASRDSLLLE 1032
            RKWGSRLGSGSLTP+G G+                               PASRD  L+ 
Sbjct: 263  RKWGSRLGSGSLTPDGLGQGSRLGSGSVTPDGMGLGSRLGSGSLTPDGLGPASRDGFLVG 322

Query: 1031 SQISEVASLANSENGSQNDEIVIDHRVSFELTGDNVPTCMEKDRLSSLETVSESLRDIAS 852
            SQISEVA LAN  NG +NDE ++DHRVSFEL+G++V  C+E   L     VSE  +D+ +
Sbjct: 323  SQISEVALLANPANGPKNDETIVDHRVSFELSGEDVAPCLESKSLLPSRAVSEYPKDLVA 382

Query: 851  EGTTERDATAKSTENCSECRVGEALDEMQGKASGDAEEELCHRKHRSITLGSINEFNFDN 672
            EG  ERD   K  E+  E  + E  +E   KASG+AEEE  ++KHRS+TLGSI EFNFDN
Sbjct: 383  EGRKERDGIKKDLESSCELFIRETSNETVEKASGEAEEEHSYQKHRSVTLGSIKEFNFDN 442

Query: 671  TKGEVLDKAPISSEWWSNEKVLGKELTPQNDWSFYPMLQPGLS 543
            TKGE  DK  I SEWW+NEKV GKE  P N W+F+PMLQP +S
Sbjct: 443  TKGEASDKPTIRSEWWANEKVAGKEARPGNSWTFFPMLQPEVS 485


>ref|XP_007038766.1| Hydroxyproline-rich glycoprotein family protein isoform 2 [Theobroma
            cacao] gi|508776011|gb|EOY23267.1| Hydroxyproline-rich
            glycoprotein family protein isoform 2 [Theobroma cacao]
          Length = 489

 Score =  576 bits (1485), Expect = e-161
 Identities = 299/467 (64%), Positives = 342/467 (73%), Gaps = 37/467 (7%)
 Frame = -2

Query: 1832 SRVQPTTVQ----KRRWGSCWSLYWCFGSCKHSKRIGHAVLAPEPTVPGAAAPVSDNLNH 1665
            SRVQPTTVQ    K+RWGSCW LYWCFGS K+SKRIGHAVL PEP VPGA+   ++N+++
Sbjct: 23   SRVQPTTVQVHVYKKRWGSCWGLYWCFGSQKNSKRIGHAVLVPEPVVPGASVSTAENVSN 82

Query: 1664 STTIV-PFIXXXXXXXSFLQSDPPSTTQSPAGLLSLTSLSVKAYSPGGPASIFAIGPYAY 1488
             T I+ PFI       SFLQSDPPS TQSPAGLLSLTSLSV AYSP GPASIFAIGPYA+
Sbjct: 83   PTGIILPFIAPPSSPASFLQSDPPSATQSPAGLLSLTSLSVNAYSPRGPASIFAIGPYAH 142

Query: 1487 ETQLVSPPVFSTLTTEPSTASFTPPPEPVQLTTPSSPDVPFAQLLTSSLARTRRNSGTNQ 1308
            ETQLV+PPVFS LTTEPSTA FTPPPE VQLTTPSSP+VPFAQLLTSSL R RRNSG NQ
Sbjct: 143  ETQLVTPPVFSALTTEPSTAPFTPPPESVQLTTPSSPEVPFAQLLTSSLERARRNSGINQ 202

Query: 1307 KFSLSQYEFQPYQLYPESPVSHLISPGSAISNSGTSSPFADKRPIIELRMGEAPKIFGYE 1128
            KF LS YEFQ YQ+YP SP  +LISPGSAISNSGTSSPF D+RPI+E RMGEAPK+ G+E
Sbjct: 203  KFGLSHYEFQSYQIYPGSPGGNLISPGSAISNSGTSSPFPDRRPILEFRMGEAPKLLGFE 262

Query: 1127 HFYVRKWGSRLGSGSLTPNG-GE-------------------------------PASRDS 1044
            +F  RKWGSRLGSGSLTP+G G+                               PASRD 
Sbjct: 263  NFTTRKWGSRLGSGSLTPDGLGQGSRLGSGSVTPDGMGLGSRLGSGSLTPDGLGPASRDG 322

Query: 1043 LLLESQISEVASLANSENGSQNDEIVIDHRVSFELTGDNVPTCMEKDRLSSLETVSESLR 864
             L+ SQISEVA LAN  NG +NDE ++DHRVSFEL+G++V  C+E   L     VSE  +
Sbjct: 323  FLVGSQISEVALLANPANGPKNDETIVDHRVSFELSGEDVAPCLESKSLLPSRAVSEYPK 382

Query: 863  DIASEGTTERDATAKSTENCSECRVGEALDEMQGKASGDAEEELCHRKHRSITLGSINEF 684
            D+ +EG  ERD   K  E+  E  + E  +E   KASG+AEEE  ++KHRS+TLGSI EF
Sbjct: 383  DLVAEGRKERDGIKKDLESSCELFIRETSNETVEKASGEAEEEHSYQKHRSVTLGSIKEF 442

Query: 683  NFDNTKGEVLDKAPISSEWWSNEKVLGKELTPQNDWSFYPMLQPGLS 543
            NFDNTKGE  DK  I SEWW+NEKV GKE  P N W+F+PMLQP +S
Sbjct: 443  NFDNTKGEASDKPTIRSEWWANEKVAGKEARPGNSWTFFPMLQPEVS 489


>ref|XP_012090213.1| PREDICTED: uncharacterized protein LOC105648441 [Jatropha curcas]
            gi|643706116|gb|KDP22248.1| hypothetical protein
            JCGZ_26079 [Jatropha curcas]
          Length = 498

 Score =  561 bits (1445), Expect = e-156
 Identities = 297/501 (59%), Positives = 342/501 (68%), Gaps = 49/501 (9%)
 Frame = -2

Query: 1898 MSTVHNSXXXXXXXXXXXXXXESRVQPTTVQKRRWGSCWSLYWCFGSCKHSKRIGHAVLA 1719
            M +V+NS              ESRVQPT VQKRRWG CWSLYWCFGS K+SKRIGHAVL 
Sbjct: 1    MRSVNNSVETINAAATAIISAESRVQPTVVQKRRWGGCWSLYWCFGSHKNSKRIGHAVLV 60

Query: 1718 PEPTVPGAAAPVSDNLNHSTTI-VPFIXXXXXXXSFLQSDPPSTTQSPAGLLSLTSLSVK 1542
            PEP VP A    ++N  HST   VPFI       SFLQSDPPS TQSPAGLLSLT+LSV 
Sbjct: 61   PEPEVPQAVVTSAENQTHSTAAAVPFIAPPSSPASFLQSDPPSVTQSPAGLLSLTALSVS 120

Query: 1541 AYSPGGPASIFAIGPYAYETQLVSPPVFSTLTTEPSTASFTPPPEPVQLTTPSSPDVPFA 1362
            AYSPGGPASIFAIGPYA+ETQLV+PPVFS  TTEPSTA FTPPPE VQLTTPSSP+VPFA
Sbjct: 121  AYSPGGPASIFAIGPYAHETQLVTPPVFSAFTTEPSTAPFTPPPESVQLTTPSSPEVPFA 180

Query: 1361 QLLTSSLARTRRNSGTNQKFSLSQYEFQPYQLYPESPVSHLISPGSAISNSGTSSPFADK 1182
            QLLTSSL R RRNSG NQKF+LS YEFQ Y LYP SP   LISPGS ISNSGTSSPF D+
Sbjct: 181  QLLTSSLERARRNSGANQKFALSHYEFQSYPLYPGSPGGQLISPGSIISNSGTSSPFPDR 240

Query: 1181 RPIIELRMGEAPKIFGYEHFYVRKWGSRLGSGSLTPNG---------------------- 1068
             P++E RMGEAPK+ G+EHF  RKWGSRLGSG+LTP+G                      
Sbjct: 241  HPLLEFRMGEAPKLLGFEHFTTRKWGSRLGSGTLTPDGVGLGSRLCSGTATPDGVGLGSR 300

Query: 1067 --------------------------GEPASRDSLLLESQISEVASLANSENGSQNDEIV 966
                                        PAS+D LLLE+QISEVASLANSEN S+NDE +
Sbjct: 301  LGSGSVTPDGVGLRSRLGSGSLTPDCVVPASQDGLLLENQISEVASLANSENASKNDENI 360

Query: 965  IDHRVSFELTGDNVPTCMEKDRLSSLETVSESLRDIASEGTTERDATAKSTENCSECRVG 786
            +DHRVSFEL+G+ V  C+E   ++S  T SE  +D  +E     +    ++ +C    +G
Sbjct: 361  VDHRVSFELSGEEVARCLESKSMTSSRTFSECPQDSMAEEQINSEEILINSNDC--LHIG 418

Query: 785  EALDEMQGKASGDAEEELCHRKHRSITLGSINEFNFDNTKGEVLDKAPISSEWWSNEKVL 606
            E  +E   K SG+ EEE C+RKHRSITLGSI EFNFDN+K EV DK  ISSEWW+NE + 
Sbjct: 419  ETSNETPEKPSGETEEEPCYRKHRSITLGSIKEFNFDNSK-EVPDKPTISSEWWANETIA 477

Query: 605  GKELTPQNDWSFYPMLQPGLS 543
            GKE  P N+W+F+P+LQP +S
Sbjct: 478  GKEARPANNWTFFPLLQPEVS 498


>ref|XP_011084440.1| PREDICTED: uncharacterized protein LOC105166690 [Sesamum indicum]
          Length = 479

 Score =  557 bits (1435), Expect = e-155
 Identities = 292/481 (60%), Positives = 340/481 (70%), Gaps = 29/481 (6%)
 Frame = -2

Query: 1898 MSTVHNSXXXXXXXXXXXXXXESRVQPTTVQKRRWGSCWSLYWCFGSCKHSKRIGHAVLA 1719
            MS+VHNS              ESRVQP+TVQKRRWGSCWS+YWCFGS K SKRIGHAVL 
Sbjct: 1    MSSVHNSVETVNAAATAIVTAESRVQPSTVQKRRWGSCWSIYWCFGSHKQSKRIGHAVLV 60

Query: 1718 PEPTVPGAAAPVSDNLNHSTTIV-PFIXXXXXXXSFLQSDPPSTTQSPAGLLSLTSLSVK 1542
             EP   G AAP+S+N N S+TIV PFI       SFLQSDPPS TQSPAGL+SL SLSV 
Sbjct: 61   SEPAAAGVAAPISENRNQSSTIVLPFIAPPSSPASFLQSDPPSATQSPAGLISLASLSVH 120

Query: 1541 AYSPGGPASIFAIGPYAYETQLVSPPVFSTLTTEPSTASFTPPPEPVQLTTPSSPDVPFA 1362
            A SPGG A IF IGPYA+ETQLVSPPVFST TTEPSTASFTPPPEPVQ+TTPSSP+VPFA
Sbjct: 121  ANSPGGTAPIFTIGPYAHETQLVSPPVFSTFTTEPSTASFTPPPEPVQMTTPSSPEVPFA 180

Query: 1361 QLLTSSLARTRRNSGTNQKFSLSQYEFQPYQLYPESPVSHLISPGSAISNSGTSSPFADK 1182
            QLL+SSLAR RRN GTN K+SLSQYEFQPYQ YP SP  H+ SPGSA+S SGTSSPF DK
Sbjct: 181  QLLSSSLARNRRNCGTNLKYSLSQYEFQPYQ-YPGSPGGHIKSPGSALSTSGTSSPFPDK 239

Query: 1181 RPIIELRMGEAPKIFGYEHFYVRKWG----------------------------SRLGSG 1086
             PI+E RMGEAPK  GYEHF   KWG                            SRLGSG
Sbjct: 240  HPIMEFRMGEAPKFLGYEHFPNYKWGSRVGSGSLTPNGWGSRLGSGALTPNGGLSRLGSG 299

Query: 1085 SLTPNGGEPASRDSLLLESQISEVASLANSENGSQNDEIVIDHRVSFELTGDNVPTCMEK 906
            +LTPNGGEP SRD  LLE+QI EVASLANS+  SQND+ V+DHRVSFEL G+++PTC+  
Sbjct: 300  TLTPNGGEPPSRDGNLLENQIYEVASLANSDRKSQNDDAVVDHRVSFELFGEDIPTCVVT 359

Query: 905  DRLSSLETVSESLRDIASEGTTERDATAKSTENCSECRVGEALDEMQGKASGDAEEELCH 726
            +   S +  S       +EGT  +D T K+ ++C E   GE  +E+  +   D E    H
Sbjct: 360  ESAPSHKNASGYPGVATAEGTNNKDLTTKNADSCREHNDGETTNEVP-EIPLDGEGGELH 418

Query: 725  RKHRSITLGSINEFNFDNTKGEVLDKAPISSEWWSNEKVLGKELTPQNDWSFYPMLQPGL 546
            +K R+++LGS  +FNF+N KGE+ +K+ I+ EWW+NEKV+ KEL P+N WSF+PMLQ G 
Sbjct: 419  QKQRTVSLGSSKDFNFNNAKGEIPEKSSINCEWWTNEKVVRKELGPRNSWSFFPMLQSGA 478

Query: 545  S 543
            S
Sbjct: 479  S 479


>ref|XP_007219041.1| hypothetical protein PRUPE_ppa004616mg [Prunus persica]
            gi|462415503|gb|EMJ20240.1| hypothetical protein
            PRUPE_ppa004616mg [Prunus persica]
          Length = 499

 Score =  549 bits (1414), Expect = e-153
 Identities = 293/478 (61%), Positives = 332/478 (69%), Gaps = 49/478 (10%)
 Frame = -2

Query: 1832 SRVQPTTVQKRRWGSCWSLYWCFGSCKHSKRIGHAVLAPEPTVPGAAAPVSDNLNHSTTI 1653
            +R QPTTV KRRWGSCWSLYWCFG  K+ KRIGHAVL PEP VPGAA    DN   ST I
Sbjct: 23   ARPQPTTVPKRRWGSCWSLYWCFGPHKN-KRIGHAVLVPEPVVPGAAVSAIDNQTTSTAI 81

Query: 1652 V-PFIXXXXXXXSFLQSDPPSTTQSPAGLLSLTSLSVKAYSPGGPASIFAIGPYAYETQL 1476
            V PFI       SFL SDPPS TQSPAG LSL SLS  AYSPGGPASIF+IGPYAYETQL
Sbjct: 82   VVPFIAPPSSPASFLPSDPPSATQSPAGFLSLKSLSANAYSPGGPASIFSIGPYAYETQL 141

Query: 1475 VSPPVFSTLTTEPSTASFTPPPEPVQLTTPSSPDVPFAQLLTSSLARTRRNSGTNQKFSL 1296
            VSPPVFST  TEPSTA FTPPPE VQLTTPSSP+VPFAQLLTSSL R RRNSGTNQKF+L
Sbjct: 142  VSPPVFSTFNTEPSTAPFTPPPESVQLTTPSSPEVPFAQLLTSSLDRNRRNSGTNQKFAL 201

Query: 1295 SQYEFQPYQLYPESPVSHLISPGSAISNSGTSSPFADKRPIIELRMGEAPKIFGYEHFYV 1116
            S YEFQPYQ YP SP  +LISPGSA+SNSGTSSPF D+ P++E RMGEAPK+FG++HF  
Sbjct: 202  SHYEFQPYQQYPGSPGGNLISPGSAVSNSGTSSPFPDRHPVLEFRMGEAPKLFGFDHFTT 261

Query: 1115 RKWGSR----------------LGSGSLTPNGGE-------------------------- 1062
            RKWGSR                LGSGSLTP+G E                          
Sbjct: 262  RKWGSRIGSGSLTPDGVGLGSRLGSGSLTPDGNELGSRLGSGCVTPNGAGIGSRLGSGCL 321

Query: 1061 ------PASRDSLLLESQISEVASLANSENGSQNDEIVIDHRVSFELTGDNVPTCMEKDR 900
                  PASRDS LLE+QISEVASLANSE+G Q  E V DHRVSFELTG++V  C+    
Sbjct: 322  TPDGPGPASRDSFLLENQISEVASLANSESGCQTVETVFDHRVSFELTGEDVACCLANKA 381

Query: 899  LSSLETVSESLRDIASEGTTERDATAKSTENCSECRVGEALDEMQGKASGDAEEELCHRK 720
            ++S  T S S + IASE  +ERDA +  + N  E  V E+   +    SG+ E++  +RK
Sbjct: 382  VASNRTASGSSKVIASEYPSERDALSSDSSNHCEFSVEESSSRIPENVSGEGEDQ-GYRK 440

Query: 719  HRSITLGSINEFNFDNTKGEVLDKAPISSEWWSNEKVLGKELTPQNDWSFYPMLQPGL 546
            HRSITLGS  +FNFDNTK EV +K  I SEWW+N+ V  KE  P NDW+F+P+LQPG+
Sbjct: 441  HRSITLGSTKDFNFDNTKAEVPNKPNIGSEWWANKNVAAKESKPCNDWTFFPILQPGV 498


>gb|KHG09821.1| hypothetical protein F383_13171 [Gossypium arboreum]
          Length = 465

 Score =  548 bits (1413), Expect = e-153
 Identities = 285/465 (61%), Positives = 331/465 (71%), Gaps = 35/465 (7%)
 Frame = -2

Query: 1832 SRVQPTTVQKRRWGSCWSLYWCFGSCKHSKRIGHAVLAPEPTVPGAAAPVSDNLNHSTTI 1653
            SRVQPTTVQK+RWGSCWS YWCFGS K SKRIGHAVL PEP VPGA+   ++N ++ T I
Sbjct: 23   SRVQPTTVQKKRWGSCWSFYWCFGSHKSSKRIGHAVLVPEPVVPGASVSTAENASNPTGI 82

Query: 1652 V-PFIXXXXXXXSFLQSDPPSTTQSPAGLLSLTSLSVKAYSPGGPASIFAIGPYAYETQL 1476
            V PFI       SFLQSDPPS TQSPAGLLSLT+LSV AYSP GPASIF+IGPYA+ETQL
Sbjct: 83   VMPFIAPPSSPASFLQSDPPSATQSPAGLLSLTALSVNAYSPRGPASIFSIGPYAHETQL 142

Query: 1475 VSPPVFSTLTTEPSTASFTPPPEPVQLTTPSSPDVPFAQLLTSSLARTRRNSGTNQKFSL 1296
            V+PPVFS LTTEPSTA FTPPPE VQLTTPSSP+VPFAQLLTSSL R RRNSG NQKF L
Sbjct: 143  VTPPVFSALTTEPSTAPFTPPPESVQLTTPSSPEVPFAQLLTSSLERARRNSGINQKFGL 202

Query: 1295 SQYEFQPYQLYPESPVSHLISPGSAISNSGTSSPFADKRPIIELRMGEAPKIFGYEHFYV 1116
            S YEFQ YQ+YP SP  +LISPGS ISNSGTSSPF D+RPI+E RMGEAPK  G+EHF  
Sbjct: 203  SHYEFQSYQIYPGSPGGNLISPGSVISNSGTSSPFPDRRPILEFRMGEAPKTLGFEHFTT 262

Query: 1115 RKWGSRLGSGSLTPNG-GE-------------------------------PASRDSLLLE 1032
            RKWGSRLGSGSLTP+G G+                               PASRD   +E
Sbjct: 263  RKWGSRLGSGSLTPDGLGQGSRLGSECVTPDGMGLGSRLGSGSLTPDGLGPASRDGFPIE 322

Query: 1031 SQISEVASLANSENGSQNDEIVIDHRVSFELTGDNVPTCMEKDRLSSLETVS--ESLRDI 858
            SQ SEVA L+N  NG +NDEI++DHRVSFEL+G++V  C++   L S  T+   E  +D+
Sbjct: 323  SQNSEVALLSNPPNGPKNDEIIVDHRVSFELSGEDVARCLKNKSLVSSRTMPDYEYPKDL 382

Query: 857  ASEGTTERDATAKSTENCSECRVGEALDEMQGKASGDAEEELCHRKHRSITLGSINEFNF 678
             ++G  E+D                       K SG+AEE+ C++KHRS+TLGSI EFNF
Sbjct: 383  VAQGRIEKDE----------------------KVSGEAEEDHCYQKHRSVTLGSIKEFNF 420

Query: 677  DNTKGEVLDKAPISSEWWSNEKVLGKELTPQNDWSFYPMLQPGLS 543
            DN KGE  +K  + SEWW+NEKV GKE  P N+W+F+PMLQP +S
Sbjct: 421  DNRKGEASEKPTVRSEWWANEKVAGKEARPGNNWTFFPMLQPEVS 465


>ref|XP_012440132.1| PREDICTED: uncharacterized protein LOC105765522 [Gossypium raimondii]
            gi|763785675|gb|KJB52746.1| hypothetical protein
            B456_008G275500 [Gossypium raimondii]
          Length = 465

 Score =  548 bits (1412), Expect = e-153
 Identities = 286/465 (61%), Positives = 329/465 (70%), Gaps = 35/465 (7%)
 Frame = -2

Query: 1832 SRVQPTTVQKRRWGSCWSLYWCFGSCKHSKRIGHAVLAPEPTVPGAAAPVSDNLNHSTTI 1653
            SRVQPTTVQK+RWGSCWS YWCFGS K SKRIGHAVL PEP VPGA    ++N ++ T I
Sbjct: 23   SRVQPTTVQKKRWGSCWSFYWCFGSHKSSKRIGHAVLVPEPVVPGALVSTAENASNPTGI 82

Query: 1652 V-PFIXXXXXXXSFLQSDPPSTTQSPAGLLSLTSLSVKAYSPGGPASIFAIGPYAYETQL 1476
            V PFI       SFLQSDPPS TQSPAGLLSLT+LSV AYSP GPASIFAIGPYA+ETQL
Sbjct: 83   VMPFIAPPSSPASFLQSDPPSATQSPAGLLSLTALSVNAYSPRGPASIFAIGPYAHETQL 142

Query: 1475 VSPPVFSTLTTEPSTASFTPPPEPVQLTTPSSPDVPFAQLLTSSLARTRRNSGTNQKFSL 1296
            V+PPVFS LTTEPSTA FTPPPE VQLTTPSSP+VPFAQLLTSSL R RRNSG NQKF L
Sbjct: 143  VTPPVFSALTTEPSTAPFTPPPESVQLTTPSSPEVPFAQLLTSSLERARRNSGINQKFGL 202

Query: 1295 SQYEFQPYQLYPESPVSHLISPGSAISNSGTSSPFADKRPIIELRMGEAPKIFGYEHFYV 1116
            S YEFQ YQ+YP SP  +LISPGS ISNSGTSSPF D+RPI+E RMGEAPK  G+EHF  
Sbjct: 203  SHYEFQSYQIYPGSPGGNLISPGSVISNSGTSSPFPDRRPILEFRMGEAPKTLGFEHFTT 262

Query: 1115 RKWGSRLGSGSLTPNG-GE-------------------------------PASRDSLLLE 1032
            RKWGSRLGSGSLTP+G G+                               PASRD   +E
Sbjct: 263  RKWGSRLGSGSLTPDGLGQGSRLGSECVTPDGMGLGSRLGSGSLTPDGLGPASRDGFPIE 322

Query: 1031 SQISEVASLANSENGSQNDEIVIDHRVSFELTGDNVPTCMEKDRLSSLETVS--ESLRDI 858
            SQ SEVA L+N  NG +NDEI++DHRVSFEL+G++V  C++   L S  T+   E   D+
Sbjct: 323  SQNSEVALLSNPPNGPKNDEIIVDHRVSFELSGEDVARCLKNKSLVSSRTMPDYEYPNDL 382

Query: 857  ASEGTTERDATAKSTENCSECRVGEALDEMQGKASGDAEEELCHRKHRSITLGSINEFNF 678
             ++G  E+D                       K SG+AEE+ C++KHRS+TLGSI EFNF
Sbjct: 383  VAQGRIEKDE----------------------KVSGEAEEDHCYQKHRSVTLGSIKEFNF 420

Query: 677  DNTKGEVLDKAPISSEWWSNEKVLGKELTPQNDWSFYPMLQPGLS 543
            DN KGE  +K  + SEWW+NEKV GKE  P N+W+F+PMLQP +S
Sbjct: 421  DNRKGEASEKPTVRSEWWANEKVAGKEARPGNNWTFFPMLQPEVS 465


>ref|XP_002513675.1| conserved hypothetical protein [Ricinus communis]
            gi|223547583|gb|EEF49078.1| conserved hypothetical
            protein [Ricinus communis]
          Length = 510

 Score =  545 bits (1404), Expect = e-152
 Identities = 294/479 (61%), Positives = 332/479 (69%), Gaps = 49/479 (10%)
 Frame = -2

Query: 1832 SRVQPTTVQKRRWGSCWSLYWCFGSCKHSKRIGHAVLAPEPTVPGAAAPVSDNLNHSTTI 1653
            SRVQPTTVQKRRWG CWSLYWCFGS K +KRIGHAVLAPEP V GA    ++N + ST I
Sbjct: 37   SRVQPTTVQKRRWGGCWSLYWCFGSHK-TKRIGHAVLAPEPEVQGAVVTSAENQSQSTAI 95

Query: 1652 -VPFIXXXXXXXSFLQSDPPSTTQSPAGLLSLTSLSVKAYSPGGPASIFAIGPYAYETQL 1476
             VPFI       SFLQSDPPS TQSPAGLLSLTSLSV AYSPGGPASIFAIGPYA+ETQL
Sbjct: 96   TVPFIAPPSSPASFLQSDPPSATQSPAGLLSLTSLSVNAYSPGGPASIFAIGPYAHETQL 155

Query: 1475 VSPPVFSTLTTEPSTASFTPPPEPVQLTTPSSPDVPFAQLLTSSLARTRRNSGTNQKFSL 1296
            V+PP FS  TTEPSTA FTPPPE VQLTTPSSP+VPFAQLLTSSL R RRNSGTNQKF+L
Sbjct: 156  VTPPAFSAFTTEPSTAPFTPPPESVQLTTPSSPEVPFAQLLTSSLERARRNSGTNQKFAL 215

Query: 1295 SQYEFQPYQLYPESPVSHLISPGSAISNSGTSSPFADKRPIIELRMGEAPKIFGYEHFYV 1116
            S YEFQ Y LYP SP   LISPGS ISNSGTSSPF D+ PI+E RMGEAPK+ G+EHF  
Sbjct: 216  SHYEFQSYPLYPGSPGGQLISPGSVISNSGTSSPFPDRYPILEFRMGEAPKLLGFEHFTT 275

Query: 1115 RKWGSRLGS----------------GSLTPNG-GE------------------------- 1062
            RKWGSRLGS                G++TP+G G+                         
Sbjct: 276  RKWGSRLGSGTVTPDGVGLGSRLGSGTVTPDGVGQGSRLGSGTVTPDGVGLRSMLGSGSL 335

Query: 1061 ------PASRDSLLLESQISEVASLANSENGSQNDEIVIDHRVSFELTGDNVPTCMEKDR 900
                  PASRD   LE+QISEVASLANSENGS+ DE ++DHRVSFEL+G+ V  C+E   
Sbjct: 336  TPDAVGPASRDGFFLENQISEVASLANSENGSKTDENIVDHRVSFELSGEEVARCLESKS 395

Query: 899  LSSLETVSESLRDIASEGTTERDATAKSTENCSECRVGEALDEMQGKASGDAEEELCHRK 720
            L+S    SE   D  +E   +      + EN      GE   E   K SG+ EEE C+RK
Sbjct: 396  LASCRAFSECPPDSMAEDQIKSGKMLMTDENLP---TGETSGETPEKPSGEMEEEHCYRK 452

Query: 719  HRSITLGSINEFNFDNTKGEVLDKAPISSEWWSNEKVLGKELTPQNDWSFYPMLQPGLS 543
            HRSITLGSI EFNFDN+K EV DK  I+SEWW+NE + GKE  P N+W+F+P+LQP +S
Sbjct: 453  HRSITLGSIKEFNFDNSK-EVPDKPSINSEWWANETIAGKEARPANNWTFFPLLQPEVS 510


>gb|KJB52747.1| hypothetical protein B456_008G275500 [Gossypium raimondii]
          Length = 464

 Score =  543 bits (1398), Expect = e-151
 Identities = 286/465 (61%), Positives = 328/465 (70%), Gaps = 35/465 (7%)
 Frame = -2

Query: 1832 SRVQPTTVQKRRWGSCWSLYWCFGSCKHSKRIGHAVLAPEPTVPGAAAPVSDNLNHSTTI 1653
            SRVQPTTVQKR WGSCWS YWCFGS K SKRIGHAVL PEP VPGA    ++N ++ T I
Sbjct: 23   SRVQPTTVQKR-WGSCWSFYWCFGSHKSSKRIGHAVLVPEPVVPGALVSTAENASNPTGI 81

Query: 1652 V-PFIXXXXXXXSFLQSDPPSTTQSPAGLLSLTSLSVKAYSPGGPASIFAIGPYAYETQL 1476
            V PFI       SFLQSDPPS TQSPAGLLSLT+LSV AYSP GPASIFAIGPYA+ETQL
Sbjct: 82   VMPFIAPPSSPASFLQSDPPSATQSPAGLLSLTALSVNAYSPRGPASIFAIGPYAHETQL 141

Query: 1475 VSPPVFSTLTTEPSTASFTPPPEPVQLTTPSSPDVPFAQLLTSSLARTRRNSGTNQKFSL 1296
            V+PPVFS LTTEPSTA FTPPPE VQLTTPSSP+VPFAQLLTSSL R RRNSG NQKF L
Sbjct: 142  VTPPVFSALTTEPSTAPFTPPPESVQLTTPSSPEVPFAQLLTSSLERARRNSGINQKFGL 201

Query: 1295 SQYEFQPYQLYPESPVSHLISPGSAISNSGTSSPFADKRPIIELRMGEAPKIFGYEHFYV 1116
            S YEFQ YQ+YP SP  +LISPGS ISNSGTSSPF D+RPI+E RMGEAPK  G+EHF  
Sbjct: 202  SHYEFQSYQIYPGSPGGNLISPGSVISNSGTSSPFPDRRPILEFRMGEAPKTLGFEHFTT 261

Query: 1115 RKWGSRLGSGSLTPNG-GE-------------------------------PASRDSLLLE 1032
            RKWGSRLGSGSLTP+G G+                               PASRD   +E
Sbjct: 262  RKWGSRLGSGSLTPDGLGQGSRLGSECVTPDGMGLGSRLGSGSLTPDGLGPASRDGFPIE 321

Query: 1031 SQISEVASLANSENGSQNDEIVIDHRVSFELTGDNVPTCMEKDRLSSLETVS--ESLRDI 858
            SQ SEVA L+N  NG +NDEI++DHRVSFEL+G++V  C++   L S  T+   E   D+
Sbjct: 322  SQNSEVALLSNPPNGPKNDEIIVDHRVSFELSGEDVARCLKNKSLVSSRTMPDYEYPNDL 381

Query: 857  ASEGTTERDATAKSTENCSECRVGEALDEMQGKASGDAEEELCHRKHRSITLGSINEFNF 678
             ++G  E+D                       K SG+AEE+ C++KHRS+TLGSI EFNF
Sbjct: 382  VAQGRIEKDE----------------------KVSGEAEEDHCYQKHRSVTLGSIKEFNF 419

Query: 677  DNTKGEVLDKAPISSEWWSNEKVLGKELTPQNDWSFYPMLQPGLS 543
            DN KGE  +K  + SEWW+NEKV GKE  P N+W+F+PMLQP +S
Sbjct: 420  DNRKGEASEKPTVRSEWWANEKVAGKEARPGNNWTFFPMLQPEVS 464


>ref|XP_009788653.1| PREDICTED: uncharacterized protein LOC104236433 isoform X1 [Nicotiana
            sylvestris]
          Length = 470

 Score =  541 bits (1393), Expect = e-150
 Identities = 292/483 (60%), Positives = 335/483 (69%), Gaps = 31/483 (6%)
 Frame = -2

Query: 1898 MSTVHNSXXXXXXXXXXXXXXESRVQPTTVQKRRWGSCWSLYWCFGSCKHSKRIGHAVLA 1719
            MS+V N+              ESRVQP++VQKRRWGSCWSLYWCFGS KHSKRIGHAVL 
Sbjct: 1    MSSVQNTVDTVNAAATAIVTAESRVQPSSVQKRRWGSCWSLYWCFGSYKHSKRIGHAVLV 60

Query: 1718 PEPTVPGAAAPVSDNLNHSTTIV-PFIXXXXXXXSFLQSDPPSTTQSPAGLLSLTSLSVK 1542
            PEP  PG A PV++N N S TIV PFI       SFL SDPPS TQSPAGLLSL S S+ 
Sbjct: 61   PEPAAPGPAVPVTENPNRSATIVIPFIAPPSSPASFLPSDPPSATQSPAGLLSLKSFSIN 120

Query: 1541 AYSPGGPASIFAIGPYAYETQLVSPPVFSTLTTEPSTASFTPPPEPVQLTTPSSPDVPFA 1362
            AYSPGG ASIFAIGPYA+ETQLVSPPVFST TTEPSTA+FTPPPEPV +TTP SP+VPFA
Sbjct: 121  AYSPGGTASIFAIGPYAHETQLVSPPVFSTFTTEPSTANFTPPPEPVHMTTPPSPEVPFA 180

Query: 1361 QLLTSSLARTRRNSGTNQKFSLSQYEFQPYQLYPESPVSHLISPGSAISNSGTSSPFADK 1182
            QLLTSSLAR RR SG+N KF LSQYEF PYQ  P SP S+LISPGS +SNSGTSSPF  K
Sbjct: 181  QLLTSSLARNRRYSGSNYKFPLSQYEFVPYQ-DPGSPGSNLISPGSVVSNSGTSSPFPGK 239

Query: 1181 RPIIELRMGEAPKIFGYEHFYVRKWGSRLGSGSL-------------------------- 1080
             PIIE R GE PK  GYEHF  RKWGSR+GSGSL                          
Sbjct: 240  CPIIEFRKGEPPKFLGYEHFSTRKWGSRVGSGSLTPSGWGSRLGSGTLTPNGGISRLGSG 299

Query: 1079 --TPNGGEPASRDSLLLESQISEVASLANSENGSQNDEIVIDHRVSFELTGDNVPTCMEK 906
              TPNGGEP SRD  LLE+QISEVASLANS+NGS+  E VIDHRVSFELTG++VP+C EK
Sbjct: 300  TVTPNGGEPPSRDCYLLENQISEVASLANSDNGSEIAEGVIDHRVSFELTGEDVPSCREK 359

Query: 905  DRLSSLETVSESLRDIASEGTTERDATAKSTE--NCSECRVGEALDEMQGKASGDAEEEL 732
            + + S            S+ T   D  A S +    S   V E  D +  KAS   +++ 
Sbjct: 360  EPVMS-----------HSQQTLPMDVPAPSNKEMRSSSSIVEEKTDGLPEKASERGDDQ- 407

Query: 731  CHRKHRSITLGSINEFNFDNTKGEVLDKAPISSEWWSNEKVLGKELTPQNDWSFYPMLQP 552
            CHRKHR+IT GS  +F+FDN K EVL+K  +  EWW+++K  GKE + QN+W+F+P+LQP
Sbjct: 408  CHRKHRNITFGSSKDFDFDNVKIEVLEKHSVDCEWWTSDKATGKESSIQNNWTFFPVLQP 467

Query: 551  GLS 543
            G+S
Sbjct: 468  GVS 470


>gb|KHF98668.1| hypothetical protein F383_11887 [Gossypium arboreum]
          Length = 489

 Score =  539 bits (1389), Expect = e-150
 Identities = 280/466 (60%), Positives = 332/466 (71%), Gaps = 40/466 (8%)
 Frame = -2

Query: 1832 SRVQPTTVQ-----------KRRWGSCWSLYWCFGSCKHSKRIGHAVLAPEPTVPGAAAP 1686
            SRVQP+TVQ           K+RWGSCWS YWCFGS + SKRIGHAVL PE  VPG A  
Sbjct: 23   SRVQPSTVQAIQFAELVSVKKKRWGSCWSFYWCFGSHRSSKRIGHAVLVPEALVPGVAVV 82

Query: 1685 VSDNLNHSTTIV-PFIXXXXXXXSFLQSDPPSTTQSPAGLLSLTSLSVKAYSPGGPASIF 1509
             + N ++ T I+ PFI       SFL SDPPS TQSPAGLLSL SLSV AYSP GPASIF
Sbjct: 83   AAQNASNPTGILLPFIAPPSSPASFLLSDPPSATQSPAGLLSLASLSVNAYSPRGPASIF 142

Query: 1508 AIGPYAYETQLVSPPVFSTLTTEPSTASFTPPPEPVQLTTPSSPDVPFAQLLTSSLARTR 1329
            AIGPYA+ETQLV+PPVFS L TEPSTA FTPPPE VQ+TTPSSP+VPFA+LLTSSL R +
Sbjct: 143  AIGPYAHETQLVTPPVFSALATEPSTAPFTPPPESVQVTTPSSPEVPFAKLLTSSLERAQ 202

Query: 1328 RNSGTNQKFSLSQYEFQPYQLYPESPVSHLISPGSAISNSGTSSPFADKRPIIELRMGEA 1149
            RNSG NQKF LS YEFQ +Q+YP SP  +LISPGS ISNSGTSSPF D+RPI+ELR+ EA
Sbjct: 203  RNSGINQKFGLSHYEFQSHQIYPVSPGGNLISPGSVISNSGTSSPFPDRRPILELRIAEA 262

Query: 1148 PKIFGYEHFYVRKWGSRLGSGSLTPNG-GE---------------------------PAS 1053
            PKI G+EHF   KWGSRLGSGSLTP+G G+                           P+S
Sbjct: 263  PKILGFEHFTTGKWGSRLGSGSLTPDGLGQGPRLGSGCMTPDGMGLDSGSWTPDGLPPSS 322

Query: 1052 RDSLLLESQISEVASLANSENGSQNDEIVIDHRVSFELTGDNVPTCMEKDRLSSLETVSE 873
            RD  +LESQISEVA  +N+ENG +NDE ++DHRVSFEL+G+++   ++     S  T SE
Sbjct: 323  RDIFVLESQISEVALFSNTENGPKNDETIVDHRVSFELSGEDIARYLDSKSFISNRTKSE 382

Query: 872  SLRDIASEGTTERDATAKSTENCSECRVGEALDEMQGKASGDAEEELCHRKHRSITLGSI 693
              +D+ +EG  +RD   K  E+  +    E  +E   KASG++EEE C++KHRS+TLGSI
Sbjct: 383  CPKDLVAEGRIDRDGMKKDLESSCKLFSRETSNETVEKASGESEEEHCYQKHRSVTLGSI 442

Query: 692  NEFNFDNTKGEVLDKAPISSEWWSNEKVLGKELTPQNDWSFYPMLQ 555
             EFNFD+TKGE  DK  I SEWW+NEKV GKE+ P N+WSF+PMLQ
Sbjct: 443  KEFNFDSTKGEASDKPSIRSEWWANEKVAGKEVKPGNNWSFFPMLQ 488


>ref|XP_012486505.1| PREDICTED: uncharacterized protein LOC105800123 [Gossypium raimondii]
            gi|763770082|gb|KJB37297.1| hypothetical protein
            B456_006G198500 [Gossypium raimondii]
            gi|763770083|gb|KJB37298.1| hypothetical protein
            B456_006G198500 [Gossypium raimondii]
          Length = 478

 Score =  538 bits (1387), Expect = e-150
 Identities = 277/455 (60%), Positives = 327/455 (71%), Gaps = 29/455 (6%)
 Frame = -2

Query: 1832 SRVQPTTVQKRRWGSCWSLYWCFGSCKHSKRIGHAVLAPEPTVPGAAAPVSDNLNHSTTI 1653
            SRVQP+TVQK+RWGSCWS YWCFGS + SKRIGHAVL PE  VPG A   + N ++ T I
Sbjct: 23   SRVQPSTVQKKRWGSCWSFYWCFGSHRSSKRIGHAVLVPEAVVPGVAVVAAQNASNPTGI 82

Query: 1652 V-PFIXXXXXXXSFLQSDPPSTTQSPAGLLSLTSLSVKAYSPGGPASIFAIGPYAYETQL 1476
            + PFI       SFLQSDP S TQSPAGLLSL SLSV AYSP GPASIFAIGPYA+ETQL
Sbjct: 83   LLPFIAPPSSPASFLQSDPSSATQSPAGLLSLASLSVNAYSPRGPASIFAIGPYAHETQL 142

Query: 1475 VSPPVFSTLTTEPSTASFTPPPEPVQLTTPSSPDVPFAQLLTSSLARTRRNSGTNQKFSL 1296
            V+PPVFS L TEPSTA FTPPPE VQ+TTPSSP+VPFA+LLTSSL R +RNSG NQKF L
Sbjct: 143  VTPPVFSALATEPSTAPFTPPPESVQVTTPSSPEVPFAKLLTSSLERAQRNSGINQKFGL 202

Query: 1295 SQYEFQPYQLYPESPVSHLISPGSAISNSGTSSPFADKRPIIELRMGEAPKIFGYEHFYV 1116
            S YEFQ +Q+ P SP  +LISPGS ISNSGTSSPF D+RPI+ELR  EAPKI G+EHF  
Sbjct: 203  SHYEFQSHQICPVSPGGNLISPGSVISNSGTSSPFPDRRPILELRKAEAPKILGFEHFTT 262

Query: 1115 RKWGSRLGSGSLTPNG-GE---------------------------PASRDSLLLESQIS 1020
             KWGSRLGSGSLTP+G G+                           P+SRD  +LESQIS
Sbjct: 263  SKWGSRLGSGSLTPDGLGQSPTLGSGCMTPDGMGLDSGSWTPDGLPPSSRDGFVLESQIS 322

Query: 1019 EVASLANSENGSQNDEIVIDHRVSFELTGDNVPTCMEKDRLSSLETVSESLRDIASEGTT 840
            EVA  +N+ENG +NDE ++DHRVSFEL+G++V   ++     S  T+SE  +D+ + G  
Sbjct: 323  EVALFSNTENGPKNDETIVDHRVSFELSGEDVARYLDSKSFISNRTMSECPKDLVAGGRI 382

Query: 839  ERDATAKSTENCSECRVGEALDEMQGKASGDAEEELCHRKHRSITLGSINEFNFDNTKGE 660
             RD   K  E+  +    E  +E   KASG++EEE C++KHRS+TLGSI EFNFD+ KGE
Sbjct: 383  YRDGMTKDLESSCKLFSRETSNETVEKASGESEEEHCYQKHRSVTLGSIKEFNFDSAKGE 442

Query: 659  VLDKAPISSEWWSNEKVLGKELTPQNDWSFYPMLQ 555
              D   I SEWW+NEKV GKE+ P N+WSF+PMLQ
Sbjct: 443  ASDNPSIRSEWWANEKVAGKEVKPGNNWSFFPMLQ 477


>ref|XP_004234428.1| PREDICTED: uncharacterized protein LOC101260903 isoform X2 [Solanum
            lycopersicum]
          Length = 470

 Score =  538 bits (1387), Expect = e-150
 Identities = 295/486 (60%), Positives = 336/486 (69%), Gaps = 34/486 (6%)
 Frame = -2

Query: 1898 MSTVHNSXXXXXXXXXXXXXXESRVQPTTVQKRRWGSCWSLYWCFGSCKHSKRIGHAVLA 1719
            MS+V N+              ESRVQP+TVQKRRWGSCWSLYWCFGS KHSKRIGHAVL 
Sbjct: 1    MSSVQNTVDTVNAAASAIVNAESRVQPSTVQKRRWGSCWSLYWCFGSHKHSKRIGHAVLV 60

Query: 1718 PEPTVPGAAAPVSDNLNHSTTIV-PFIXXXXXXXSFLQSDPPSTTQSPAGLLSLTSLSVK 1542
            PEP  PG A PV++N NHS TIV PFI       SFL SDPPS TQSPAGLLSL +LS+ 
Sbjct: 61   PEPVAPGPAVPVTENPNHSATIVIPFIAPPSSPASFLPSDPPSATQSPAGLLSLKALSIN 120

Query: 1541 AYSPGGPASIFAIGPYAYETQLVSPPVFSTLTTEPSTASFTPPPEPVQLTTPSSPDVPFA 1362
            AYSPGG ASIFAIGPYA+ETQLVSPPVFST TTEPSTA+FTPPPEPV +TTP SP+VPFA
Sbjct: 121  AYSPGGTASIFAIGPYAHETQLVSPPVFSTFTTEPSTANFTPPPEPVHMTTPPSPEVPFA 180

Query: 1361 QLLTSSLARTRRNSGTNQKFSLSQYEFQPYQLYPESPVSHLISPGSAISNSGTSSPFADK 1182
            QLLTSSLAR RR SG+N KF LSQYEF PYQ  P SP S+LISPGS +SNSGTSSPF  K
Sbjct: 181  QLLTSSLARNRRYSGSNYKFPLSQYEFVPYQ-DPGSPGSNLISPGSVVSNSGTSSPFPGK 239

Query: 1181 RPIIELRMGEAPKIFGYEHFYVRKWG----------------------------SRLGSG 1086
             PIIE R GE PK  GYEHF  RKWG                            SRLGSG
Sbjct: 240  CPIIEFRKGEPPKFLGYEHFSTRKWGSRVGSGSVTPSGWGSRLGSGTLTPNGGISRLGSG 299

Query: 1085 SLTPNGGEPASRDSLLLESQISEVASLANSENGSQNDEIVIDHRVSFELTGDNVPTCMEK 906
            ++TPNGGEP SRDS LLE+QISEVASLANS+NGS+  E VIDHRVSFELT ++VP+C EK
Sbjct: 300  TVTPNGGEPPSRDSYLLENQISEVASLANSDNGSEIGEAVIDHRVSFELTEEDVPSCREK 359

Query: 905  DRLSSLETVSESLRDIASEGTTERDATAKSTENCSECRVGEALDEMQ-----GKASGDAE 741
            + + S            S+ T   D    S    SE R G ++ E +      KAS   E
Sbjct: 360  EPVMS-----------HSQPTLPMDV---SNLLASEMRSGSSMAEEKTYGSPRKASESGE 405

Query: 740  EELCHRKHRSITLGSINEFNFDNTKGEVLDKAPISSEWWSNEKVLGKELTPQNDWSFYPM 561
            +E CHRKHR+IT GS  +F+FDN K EVL+K  I  EWW+++K   KE   QN+W+F+P+
Sbjct: 406  DE-CHRKHRNITFGSSKDFDFDNVKIEVLEKDSIDCEWWTSDKAAVKESGIQNNWTFFPV 464

Query: 560  LQPGLS 543
            LQPG+S
Sbjct: 465  LQPGVS 470


>ref|XP_006353896.1| PREDICTED: uncharacterized protein LOC102583548 [Solanum tuberosum]
          Length = 470

 Score =  538 bits (1386), Expect = e-150
 Identities = 292/485 (60%), Positives = 336/485 (69%), Gaps = 33/485 (6%)
 Frame = -2

Query: 1898 MSTVHNSXXXXXXXXXXXXXXESRVQPTTVQKRRWGSCWSLYWCFGSCKHSKRIGHAVLA 1719
            MS+V N+              ESRVQP+TVQKRRWGSCWSLYWCFGS KHSKRIGHAVL 
Sbjct: 1    MSSVQNTVDTVNAAASAIVNAESRVQPSTVQKRRWGSCWSLYWCFGSHKHSKRIGHAVLV 60

Query: 1718 PEPTVPGAAAPVSDNLNHSTTIV-PFIXXXXXXXSFLQSDPPSTTQSPAGLLSLTSLSVK 1542
            PEP  PG A PV++N NHS TIV PFI       SFL SDPPS TQSPAGLLSL SLS+ 
Sbjct: 61   PEPAAPGPAVPVTENPNHSATIVIPFIAPPSSPASFLPSDPPSATQSPAGLLSLKSLSIN 120

Query: 1541 AYSPGGPASIFAIGPYAYETQLVSPPVFSTLTTEPSTASFTPPPEPVQLTTPSSPDVPFA 1362
            AYSPGG ASIFAIGPYA+ETQLVSPPVFST TTEPSTA+FTPPPE V +TTP SP+VPFA
Sbjct: 121  AYSPGGTASIFAIGPYAHETQLVSPPVFSTFTTEPSTANFTPPPELVHMTTPPSPEVPFA 180

Query: 1361 QLLTSSLARTRRNSGTNQKFSLSQYEFQPYQLYPESPVSHLISPGSAISNSGTSSPFADK 1182
            QLLTSSLAR RR SG+N KF LSQYEF PYQ  P SP S+LISPGS +SNSGTSSPF  K
Sbjct: 181  QLLTSSLARNRRYSGSNYKFPLSQYEFVPYQ-DPGSPGSNLISPGSVVSNSGTSSPFPGK 239

Query: 1181 RPIIELRMGEAPKIFGYEHFYVRKWGSRLGSGSL-------------------------- 1080
             PIIE R GE PK  GYEHF  RKWGSR+GSGSL                          
Sbjct: 240  CPIIEFRKGEPPKFLGYEHFSTRKWGSRVGSGSLTPSGWGSRLGSGTLTPNGGISRLGSG 299

Query: 1079 --TPNGGEPASRDSLLLESQISEVASLANSENGSQNDEIVIDHRVSFELTGDNVPTCMEK 906
              TPNGGEP SRDS LLE QISEVASLANS+NGS+  E VIDHRVSFELTG++VP+C EK
Sbjct: 300  TVTPNGGEPPSRDSYLLEYQISEVASLANSDNGSEIGEGVIDHRVSFELTGEDVPSCREK 359

Query: 905  DRLSSLETVSESLRDIASEGTTERDATAKSTENCSECRVGEALDEMQ----GKASGDAEE 738
            + + S            S+ T   D    S    +E + G ++ E +     + + ++ E
Sbjct: 360  EPVMS-----------HSQQTLPMDV---SNLLANEMKSGSSMAEEKTYGSPRKASESGE 405

Query: 737  ELCHRKHRSITLGSINEFNFDNTKGEVLDKAPISSEWWSNEKVLGKELTPQNDWSFYPML 558
            + CHRKHR+IT GS  +F+FDN K EVL+K  I  EWW+++K  GKE   QN+W+F+P+L
Sbjct: 406  DQCHRKHRNITFGSSKDFDFDNVKIEVLEKDSIDCEWWTSDKAAGKESGIQNNWTFFPVL 465

Query: 557  QPGLS 543
            QPG+S
Sbjct: 466  QPGVS 470


>ref|XP_009625236.1| PREDICTED: uncharacterized protein LOC104116142 [Nicotiana
            tomentosiformis]
          Length = 470

 Score =  538 bits (1385), Expect = e-149
 Identities = 288/483 (59%), Positives = 334/483 (69%), Gaps = 31/483 (6%)
 Frame = -2

Query: 1898 MSTVHNSXXXXXXXXXXXXXXESRVQPTTVQKRRWGSCWSLYWCFGSCKHSKRIGHAVLA 1719
            MS+V N+              ESRVQP+++QK+RWGSCWSLYWCFGS KHSKRIGHA+L 
Sbjct: 1    MSSVQNTVDTVNAAATAIITAESRVQPSSIQKKRWGSCWSLYWCFGSYKHSKRIGHAILV 60

Query: 1718 PEPTVPGAAAPVSDNLNHSTTIV-PFIXXXXXXXSFLQSDPPSTTQSPAGLLSLTSLSVK 1542
            PEP  PG A PV++N N S TIV PFI       SFL SDPPS TQSPAGLLSL S S+ 
Sbjct: 61   PEPAAPGPAVPVTENPNRSATIVIPFIAPPSSPASFLPSDPPSATQSPAGLLSLKSFSIN 120

Query: 1541 AYSPGGPASIFAIGPYAYETQLVSPPVFSTLTTEPSTASFTPPPEPVQLTTPSSPDVPFA 1362
            AYSPGG ASIFAIGPYA+ETQLVSPPVFST TTEPSTA+FTPPPEPV +TTP SP+VPFA
Sbjct: 121  AYSPGGTASIFAIGPYAHETQLVSPPVFSTFTTEPSTANFTPPPEPVHMTTPPSPEVPFA 180

Query: 1361 QLLTSSLARTRRNSGTNQKFSLSQYEFQPYQLYPESPVSHLISPGSAISNSGTSSPFADK 1182
            QLLTSSLAR RR SG+N KF LSQYEF PYQ  P SP S LISPGS +SNSGTSSPF  K
Sbjct: 181  QLLTSSLARNRRYSGSNYKFPLSQYEFVPYQ-DPGSPGSSLISPGSVVSNSGTSSPFPGK 239

Query: 1181 RPIIELRMGEAPKIFGYEHFYVRKWGSRLGSGSL-------------------------- 1080
             PIIE R GE PK  GYEHF  RKWGSR+GSGSL                          
Sbjct: 240  CPIIEFRKGEPPKFLGYEHFSTRKWGSRVGSGSLTPSGWGSRLGSGTLTPNGGISRLGSG 299

Query: 1079 --TPNGGEPASRDSLLLESQISEVASLANSENGSQNDEIVIDHRVSFELTGDNVPTCMEK 906
              TPNGGEP SRD  LLE+QISEVASLANS+NGS+  E VIDHRVSFELTG++VP+C EK
Sbjct: 300  TVTPNGGEPPSRDCYLLENQISEVASLANSDNGSEIAEGVIDHRVSFELTGEDVPSCREK 359

Query: 905  DRLSSLETVSESLRDIASEGTTERDATAKSTE--NCSECRVGEALDEMQGKASGDAEEEL 732
            + + S            S+ T   D  A S +    S   V E  D +  KAS   +++ 
Sbjct: 360  EPVMS-----------HSQQTLPMDVPAPSNKEMRSSSSNVEEKTDGLPEKASERGDDQ- 407

Query: 731  CHRKHRSITLGSINEFNFDNTKGEVLDKAPISSEWWSNEKVLGKELTPQNDWSFYPMLQP 552
            CHRKHR+IT GS  +F+FDN K EVL++  +  EWW+++K  GKE + QN+W+F+P+LQP
Sbjct: 408  CHRKHRNITFGSSKDFDFDNVKIEVLEEDSVDCEWWTSDKATGKESSIQNNWTFFPVLQP 467

Query: 551  GLS 543
            G+S
Sbjct: 468  GVS 470


>ref|XP_006421977.1| hypothetical protein CICLE_v10004813mg [Citrus clementina]
            gi|557523850|gb|ESR35217.1| hypothetical protein
            CICLE_v10004813mg [Citrus clementina]
          Length = 500

 Score =  536 bits (1380), Expect = e-149
 Identities = 288/497 (57%), Positives = 330/497 (66%), Gaps = 49/497 (9%)
 Frame = -2

Query: 1898 MSTVHNSXXXXXXXXXXXXXXESRVQPTTVQKRRWGSCWSLYWCFGSCKHSKRIGHAVLA 1719
            MS+VH+S              ESR++P  +QKRRWGSCWSLYWCFGS K SKRI HAVL 
Sbjct: 1    MSSVHDSVETVNAAATAIVSAESRLRPAAIQKRRWGSCWSLYWCFGSHKTSKRISHAVLV 60

Query: 1718 PEPTVPGAAAPVSDNLNHSTTIV-PFIXXXXXXXSFLQSDPPSTTQSPAGLLSLTSLSVK 1542
            PEP V GAAAP ++   HST IV PFI       SFLQSDPPS TQSPAGLLSL SLSV 
Sbjct: 61   PEPMVTGAAAPAAETQAHSTAIVLPFIAPPSSPASFLQSDPPSATQSPAGLLSLNSLSVN 120

Query: 1541 AYSPGGPASIFAIGPYAYETQLVSPPVFSTLTTEPSTASFTPPPEPVQLTTPSSPDVPFA 1362
            AYSPGGPAS+FAIGPYA+ETQLV+PPVFS  TTEPSTA  TPPPE VQLTTPSSP+VPFA
Sbjct: 121  AYSPGGPASMFAIGPYAHETQLVTPPVFSAFTTEPSTALCTPPPESVQLTTPSSPEVPFA 180

Query: 1361 QLLTSSLARTRRNSGTNQKFSLSQYEFQPYQLYPESPVSHLISPGSAISNSGTSSPFADK 1182
            QLLTSSL R RRNSGTNQK SLS Y +QPYQLYP SP   LISPGS +S SGTSSPF D+
Sbjct: 181  QLLTSSLERARRNSGTNQKLSLSHYGYQPYQLYPGSPGGQLISPGSVVSYSGTSSPFPDR 240

Query: 1181 RPIIELRMGEAPKIFGYEHFYVRKWGSRLGS----------------------------- 1089
             PI++     APK+ G+EHF  RKWGSRLGS                             
Sbjct: 241  HPILDFSAAAAPKLLGFEHFTTRKWGSRLGSGSVTPDGVGIGSRMGSGSLTPDGVGLGSR 300

Query: 1088 -------------------GSLTPNGGEPASRDSLLLESQISEVASLANSENGSQNDEIV 966
                               GSLTP+G  P SRD  + E+QISEVASLANS+NG+++DE +
Sbjct: 301  LGSGTVTPDGAGLGSRLGSGSLTPDGMGPTSRDGFVRENQISEVASLANSDNGTKSDEHI 360

Query: 965  IDHRVSFELTGDNVPTCMEKDRLSSLETVSESLRDIASEGTTERDATAKSTENCSECRVG 786
            IDHRVSFEL+G+ V  C+     +S   V E  +DI  EG   RD     +EN  E    
Sbjct: 361  IDHRVSFELSGEEVARCLANKSAASPRIVPEFPQDIVPEGEIRRDGKLTDSENHFELCPE 420

Query: 785  EALDEMQGKASGDAEEELCHRKHRSITLGSINEFNFDNTKGEVLDKAPISSEWWSNEKVL 606
            E+ + M  K   D EEE C+RKHRSITLGSI EFNFDNT+GEV +K  I+SEWW+NE V 
Sbjct: 421  ESSNRMPEKTMRDGEEEYCYRKHRSITLGSIKEFNFDNTEGEVSNKPSINSEWWANENV- 479

Query: 605  GKELTPQNDWSFYPMLQ 555
            GKE  P N+W+F+PMLQ
Sbjct: 480  GKESKPSNNWTFFPMLQ 496


>emb|CDP05166.1| unnamed protein product [Coffea canephora]
          Length = 452

 Score =  533 bits (1373), Expect = e-148
 Identities = 286/481 (59%), Positives = 327/481 (67%), Gaps = 29/481 (6%)
 Frame = -2

Query: 1898 MSTVHNSXXXXXXXXXXXXXXESRVQPTTVQKRRWGSCWSLYWCFGSCKHSKRIGHAVLA 1719
            MS+VHNS              ESRVQP TVQKRRWGSCWS YWCFGS K+SKRIG+AVL 
Sbjct: 1    MSSVHNSVETVNAAATAIVTAESRVQPPTVQKRRWGSCWSFYWCFGSVKNSKRIGNAVLV 60

Query: 1718 PEPTVPGAAAPVSDNLNHSTTIV-PFIXXXXXXXSFLQSDPPSTTQSPAGLLSLTSLSVK 1542
            PEPTVPG+A PV DNLNHS TIV PFI       SFLQSDPPS TQSPA  L L S SV 
Sbjct: 61   PEPTVPGSAVPVPDNLNHSATIVIPFIAPPSSPASFLQSDPPSATQSPAKFLPLASFSVN 120

Query: 1541 AYSPGGPASIFAIGPYAYETQLVSPPVFSTLTTEPSTASFTPPPEPVQLTTPSSPDVPFA 1362
             YSP G ASIFAIGPYA+ETQLVSPPVFS  TTEPSTASFTPPPEPVQLTTPSSP+VPFA
Sbjct: 121  TYSPSGAASIFAIGPYAHETQLVSPPVFSAFTTEPSTASFTPPPEPVQLTTPSSPEVPFA 180

Query: 1361 QLLTSSLARTRRNSGTNQKFSLSQYEFQPYQLYPESPVSHLISPGSAISNSGTSSPFADK 1182
            QLL SSL   RR+SGT+ KF LSQYEFQPYQ  P SP SHLISPGSAISNSGTSSPF +K
Sbjct: 181  QLLVSSLTHNRRHSGTSIKFPLSQYEFQPYQC-PGSPGSHLISPGSAISNSGTSSPFPEK 239

Query: 1181 RPIIELRMGEAPKIFGYEHFYVRKWG----------------------------SRLGSG 1086
            RPIIE R+GEAPK  GYE  + RKWG                            SRLGSG
Sbjct: 240  RPIIEFRIGEAPKFLGYE-LFTRKWGSRVGSGSLTPNGWGSRLGSGSLTPNGGISRLGSG 298

Query: 1085 SLTPNGGEPASRDSLLLESQISEVASLANSENGSQNDEIVIDHRVSFELTGDNVPTCMEK 906
            +LTPNGGEPA+RDS LLE+QISEVASLANS+NG+ N+E ++DHRVSFELT ++VP C+E+
Sbjct: 299  TLTPNGGEPAARDSYLLENQISEVASLANSDNGTHNEEGLMDHRVSFELTAEHVPNCVEE 358

Query: 905  DRLSSLETVSESLRDIASEGTTERDATAKSTENCSECRVGEALDEMQGKASGDAEEELCH 726
            +                           K    C +C  G+++  +  KA    E + C 
Sbjct: 359  E--------------------------MKGQNFCEDC-TGDSIHNITRKALDGQEGKQCL 391

Query: 725  RKHRSITLGSINEFNFDNTKGEVLDKAPISSEWWSNEKVLGKELTPQNDWSFYPMLQPGL 546
            + +R+ +LGS  +FNFDN K E  DK+ I  EWW+NE    KEL  +N W+F+PMLQPG+
Sbjct: 392  KNNRTFSLGSSKDFNFDNMKQESPDKSTIDCEWWTNETAAAKELGSKNKWTFFPMLQPGV 451

Query: 545  S 543
            S
Sbjct: 452  S 452


>ref|XP_002318209.1| hydroxyproline-rich glycoprotein [Populus trichocarpa]
            gi|222858882|gb|EEE96429.1| hydroxyproline-rich
            glycoprotein [Populus trichocarpa]
          Length = 507

 Score =  533 bits (1372), Expect = e-148
 Identities = 289/486 (59%), Positives = 331/486 (68%), Gaps = 56/486 (11%)
 Frame = -2

Query: 1832 SRVQPTT--VQKRRWGSCWSLYWCFGSC---KHSKRIGHAVLAPEPTVPGAAAPVSDNLN 1668
            SRVQP++  VQKRRWG CWSLYWCFGS    K+SKRIGHAVL PEP VPGA +  ++N  
Sbjct: 24   SRVQPSSSSVQKRRWGGCWSLYWCFGSHGSHKNSKRIGHAVLVPEPEVPGAVSSSTENQT 83

Query: 1667 HSTTIV-PFIXXXXXXXSFLQSDPPSTTQSPAGLLSLTSLSVKAYSPGGPASIFAIGPYA 1491
             ST I+ PFI       SFLQSDPPS+TQSPAGLLSLTSLS  AYSP GPASIFAIGPYA
Sbjct: 84   QSTPILLPFIAPPSSPASFLQSDPPSSTQSPAGLLSLTSLSANAYSPRGPASIFAIGPYA 143

Query: 1490 YETQLVSPPVFSTLTTEPSTASFTPPPEPVQLTTPSSPDVPFAQLLTSSLARTRRNSGTN 1311
            +ETQLV+PPVFS  TTEPSTA FTPPPE VQLTTPSSP+VPFAQLLTSSL R RRNSG N
Sbjct: 144  HETQLVTPPVFSAFTTEPSTAPFTPPPESVQLTTPSSPEVPFAQLLTSSLERARRNSGPN 203

Query: 1310 QKFSLSQYEFQPYQLYPESPVSHLISPGSAISNSGTSSPFADKRPIIELRMGEAPKIFGY 1131
            QKFSLS YEFQ Y LYP SP   +ISPGSAISNSGTSSPF D+ P++E RMGEAPK+ G+
Sbjct: 204  QKFSLSHYEFQSYHLYPGSPGGQIISPGSAISNSGTSSPFPDRHPMLEFRMGEAPKLLGF 263

Query: 1130 EHFYVRKWGSRLGSGS-------------------LTPNG-------------------- 1068
            EHF  RKWGSRLGSGS                   +TP+G                    
Sbjct: 264  EHFSTRKWGSRLGSGSLTPDATPDGMGLSRLGSGTVTPDGMGLSRLCSGTATPDGAGLRS 323

Query: 1067 -----------GEPASRDSLLLESQISEVASLANSENGSQNDEIVIDHRVSFELTGDNVP 921
                         PAS+   LLE+QISEVASL NSENGS+ +E V+ HRVSFEL+G+ V 
Sbjct: 324  RLGSGTLTPDCFVPASQIGFLLENQISEVASLTNSENGSKTEENVVHHRVSFELSGEEVA 383

Query: 920  TCMEKDRLSSLETVSESLRDIASEGTTERDATAKSTENCSECRVGEALDEMQGKASGDAE 741
             C+E   ++S  T  E  +D   E     D  A + E C   + GEA  EM  K S + E
Sbjct: 384  RCLEIKSVASTRTFPEYPQDTMPEDPVRGDRLAMNGERC--LQNGEASSEMPEKNSEETE 441

Query: 740  EELCHRKHRSITLGSINEFNFDNTKGEVLDKAPISSEWWSNEKVLGKELTPQNDWSFYPM 561
            E+  +RKHRSITLGSI EFNFDN+KGEV DK  ISSEWW+NE + GKE  P N W+F+P+
Sbjct: 442  EDHVYRKHRSITLGSIKEFNFDNSKGEVSDKPAISSEWWANETIAGKEARPANSWTFFPL 501

Query: 560  LQPGLS 543
            LQP +S
Sbjct: 502  LQPEVS 507


>gb|KDO51973.1| hypothetical protein CISIN_1g010808mg [Citrus sinensis]
          Length = 500

 Score =  532 bits (1371), Expect = e-148
 Identities = 287/497 (57%), Positives = 329/497 (66%), Gaps = 49/497 (9%)
 Frame = -2

Query: 1898 MSTVHNSXXXXXXXXXXXXXXESRVQPTTVQKRRWGSCWSLYWCFGSCKHSKRIGHAVLA 1719
            MS+VH+S              ESR++P  +QKRRWGSCWSLYWCFGS K SKRI HAVL 
Sbjct: 1    MSSVHDSVETVNAAATAIVSAESRLRPAAIQKRRWGSCWSLYWCFGSHKTSKRISHAVLL 60

Query: 1718 PEPTVPGAAAPVSDNLNHSTTIV-PFIXXXXXXXSFLQSDPPSTTQSPAGLLSLTSLSVK 1542
            PEP V GAAAP ++   HST IV PFI       SFLQSDP S TQSPAGLL L SLSV 
Sbjct: 61   PEPMVTGAAAPAAETQAHSTAIVLPFIAPPSSPASFLQSDPSSATQSPAGLLCLNSLSVN 120

Query: 1541 AYSPGGPASIFAIGPYAYETQLVSPPVFSTLTTEPSTASFTPPPEPVQLTTPSSPDVPFA 1362
            AYSPGGPAS+FAIGPYA+ETQLV+PPVFS  TTEPSTA  TPPPE VQLTTPSSP+VPFA
Sbjct: 121  AYSPGGPASMFAIGPYAHETQLVTPPVFSAFTTEPSTALCTPPPESVQLTTPSSPEVPFA 180

Query: 1361 QLLTSSLARTRRNSGTNQKFSLSQYEFQPYQLYPESPVSHLISPGSAISNSGTSSPFADK 1182
            QLLTSSL R RRNSGTNQK SLS Y +QPYQLYP SP   LISPGS +S SGTSSPF D+
Sbjct: 181  QLLTSSLERARRNSGTNQKLSLSHYGYQPYQLYPGSPGGQLISPGSVVSYSGTSSPFPDR 240

Query: 1181 RPIIELRMGEAPKIFGYEHFYVRKWGSRLGS----------------------------- 1089
            RPI++     APK+ G+EHF  RKWGSRLGS                             
Sbjct: 241  RPILDFSAAAAPKLLGFEHFTTRKWGSRLGSGSVTPDGVGIGSRMGSGSLTPDGVGLGSR 300

Query: 1088 -------------------GSLTPNGGEPASRDSLLLESQISEVASLANSENGSQNDEIV 966
                               GSLTP+G  P SRD  + E+QISEVASLANS+NG+++DE +
Sbjct: 301  LGSGTVTPDGAGLGSRLGSGSLTPDGMGPTSRDGFVRENQISEVASLANSDNGTKSDEHI 360

Query: 965  IDHRVSFELTGDNVPTCMEKDRLSSLETVSESLRDIASEGTTERDATAKSTENCSECRVG 786
            IDHRVSFEL+G+ V  C+     +S   V E  +DI  EG   RD     +EN  E    
Sbjct: 361  IDHRVSFELSGEEVARCLANKSAASPRIVPEFPQDIVPEGEIRRDGKLTDSENHFELCPE 420

Query: 785  EALDEMQGKASGDAEEELCHRKHRSITLGSINEFNFDNTKGEVLDKAPISSEWWSNEKVL 606
            E+ + M  K   D EEE C+RKHRSITLGSI EFNFDNT+GEV +K  I+SEWW+NE V 
Sbjct: 421  ESSNRMPEKTMRDGEEEYCYRKHRSITLGSIKEFNFDNTEGEVSNKPSINSEWWANENV- 479

Query: 605  GKELTPQNDWSFYPMLQ 555
            GKE  P N+W+F+PMLQ
Sbjct: 480  GKESKPSNNWTFFPMLQ 496


Top