BLASTX nr result

ID: Aconitum23_contig00014121 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Aconitum23_contig00014121
         (1506 letters)

Database: ./nr 
           77,306,371 sequences; 28,104,191,420 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_007044415.1| CW14 protein isoform 1 [Theobroma cacao] gi|...   607   e-170
ref|XP_007044416.1| CW14 protein isoform 2 [Theobroma cacao] gi|...   603   e-170
ref|XP_010272173.1| PREDICTED: uncharacterized protein LOC104608...   602   e-169
ref|XP_002315697.2| hypothetical protein POPTR_0010s04930g [Popu...   587   e-164
ref|XP_011008099.1| PREDICTED: uncharacterized protein LOC105113...   586   e-164
ref|XP_011008098.1| PREDICTED: uncharacterized protein LOC105113...   586   e-164
ref|XP_008221712.1| PREDICTED: uncharacterized protein LOC103321...   586   e-164
ref|XP_010255733.1| PREDICTED: uncharacterized protein LOC104596...   583   e-163
ref|XP_006483281.1| PREDICTED: uncharacterized protein LOC102624...   583   e-163
ref|XP_006438539.1| hypothetical protein CICLE_v10031173mg [Citr...   583   e-163
gb|KDO82688.1| hypothetical protein CISIN_1g009225mg [Citrus sin...   582   e-163
gb|KDO82687.1| hypothetical protein CISIN_1g009225mg [Citrus sin...   582   e-163
ref|XP_007223080.1| hypothetical protein PRUPE_ppa003760mg [Prun...   580   e-162
ref|XP_007223079.1| hypothetical protein PRUPE_ppa003760mg [Prun...   580   e-162
ref|XP_002520174.1| conserved hypothetical protein [Ricinus comm...   578   e-162
ref|XP_012458817.1| PREDICTED: uncharacterized protein LOC105779...   577   e-162
ref|XP_012091987.1| PREDICTED: uncharacterized protein LOC105649...   576   e-161
ref|XP_012091985.1| PREDICTED: uncharacterized protein LOC105649...   576   e-161
ref|XP_012091986.1| PREDICTED: uncharacterized protein LOC105649...   576   e-161
gb|KJB77107.1| hypothetical protein B456_012G120400 [Gossypium r...   572   e-160

>ref|XP_007044415.1| CW14 protein isoform 1 [Theobroma cacao] gi|508708350|gb|EOY00247.1|
            CW14 protein isoform 1 [Theobroma cacao]
          Length = 541

 Score =  607 bits (1564), Expect = e-170
 Identities = 297/425 (69%), Positives = 341/425 (80%)
 Frame = +2

Query: 101  NHGEHNNNVPCIPINEQRHNVGQQPVGNSARSSVSEAARNTNTRVFHSSDADLQSKNDGS 280
            N GEH++ V      +Q    G    GNSA +SV E  RN+N++V +S D + QSK+DG 
Sbjct: 114  NCGEHSSLV------DQMQKPGDLSAGNSACNSVGEVTRNSNSQVLNSEDVNSQSKSDGP 167

Query: 281  FNDAKHPXXXXXXXXXXXXEIAEREGVLENCGILPNACLPCLASTVPSLDKRRSLSPGPP 460
             N AK P               + EG+L+NCGILP+ CLPCLASTVPS++KRRSLS  PP
Sbjct: 168  SNKAKQPVFLDDIASSVDEGSGKEEGLLDNCGILPSNCLPCLASTVPSIEKRRSLSSSPP 227

Query: 461  STRKKAALKLSFKWREGHTNSSLSSTKMILQRPIAGSQVPICRIENKMFDCWLPIDPSTF 640
            S RKK ALKL FKWREGH N++L S+KM+LQRP AGSQVP+C IE KMFDCW  I+P TF
Sbjct: 228  SARKKNALKLPFKWREGHPNATLFSSKMLLQRPKAGSQVPVCPIEKKMFDCWSHIEPGTF 287

Query: 641  KVRGENYMRDKKKEFASNCAAYYPFGVDVFLSPRKIDHIAKFVDLPITNAIGKLPPILIV 820
            KVRGENY RDKKK+FA N AAYYPFGVDVFLSPRKIDHIA+FV+LP+ +  GKLP IL+V
Sbjct: 288  KVRGENYFRDKKKDFAPNHAAYYPFGVDVFLSPRKIDHIARFVELPVVSQSGKLPSILVV 347

Query: 821  NVQVPLYPATFFQNEFDGEGVNFVLYFRLSESYSKELEPHFLENLRRIIDDEVEKVKGFP 1000
            NVQ+PLYPA  FQ+E DGEG++FVLYF+LS+SY KEL PHF EN+RR+I DEVEKVKGFP
Sbjct: 348  NVQIPLYPAALFQSETDGEGMSFVLYFKLSDSYLKELPPHFQENIRRLIVDEVEKVKGFP 407

Query: 1001 MDTILPVRDRLKILGRVENVEDLGLSAPERKLMHAYNEKPVLSRPQHEFYLGENYLEIDI 1180
            +DTI+P R+RLKILGRV NVEDL +SA ERKLMHAYNEKP LSRPQHEFYLGENY EIDI
Sbjct: 408  VDTIVPFRERLKILGRVANVEDLHMSAAERKLMHAYNEKPFLSRPQHEFYLGENYFEIDI 467

Query: 1181 DMHRFGYISRKGFEVFHDRLKLCILDVGLTIQGNKPEELPEQILCCVRLNGIDYKNYHQL 1360
            DMHRF YISRKGF+ F DRLKLCILDVGLTIQGNKPEELPEQILCC+RL+GIDY NYHQL
Sbjct: 468  DMHRFSYISRKGFDAFLDRLKLCILDVGLTIQGNKPEELPEQILCCIRLSGIDYMNYHQL 527

Query: 1361 ATSQD 1375
              SQ+
Sbjct: 528  GLSQE 532


>ref|XP_007044416.1| CW14 protein isoform 2 [Theobroma cacao] gi|508708351|gb|EOY00248.1|
            CW14 protein isoform 2 [Theobroma cacao]
          Length = 511

 Score =  603 bits (1556), Expect = e-170
 Identities = 292/410 (71%), Positives = 334/410 (81%)
 Frame = +2

Query: 146  EQRHNVGQQPVGNSARSSVSEAARNTNTRVFHSSDADLQSKNDGSFNDAKHPXXXXXXXX 325
            +Q    G    GNSA +SV E  RN+N++V +S D + QSK+DG  N AK P        
Sbjct: 93   DQMQKPGDLSAGNSACNSVGEVTRNSNSQVLNSEDVNSQSKSDGPSNKAKQPVFLDDIAS 152

Query: 326  XXXXEIAEREGVLENCGILPNACLPCLASTVPSLDKRRSLSPGPPSTRKKAALKLSFKWR 505
                   + EG+L+NCGILP+ CLPCLASTVPS++KRRSLS  PPS RKK ALKL FKWR
Sbjct: 153  SVDEGSGKEEGLLDNCGILPSNCLPCLASTVPSIEKRRSLSSSPPSARKKNALKLPFKWR 212

Query: 506  EGHTNSSLSSTKMILQRPIAGSQVPICRIENKMFDCWLPIDPSTFKVRGENYMRDKKKEF 685
            EGH N++L S+KM+LQRP AGSQVP+C IE KMFDCW  I+P TFKVRGENY RDKKK+F
Sbjct: 213  EGHPNATLFSSKMLLQRPKAGSQVPVCPIEKKMFDCWSHIEPGTFKVRGENYFRDKKKDF 272

Query: 686  ASNCAAYYPFGVDVFLSPRKIDHIAKFVDLPITNAIGKLPPILIVNVQVPLYPATFFQNE 865
            A N AAYYPFGVDVFLSPRKIDHIA+FV+LP+ +  GKLP IL+VNVQ+PLYPA  FQ+E
Sbjct: 273  APNHAAYYPFGVDVFLSPRKIDHIARFVELPVVSQSGKLPSILVVNVQIPLYPAALFQSE 332

Query: 866  FDGEGVNFVLYFRLSESYSKELEPHFLENLRRIIDDEVEKVKGFPMDTILPVRDRLKILG 1045
             DGEG++FVLYF+LS+SY KEL PHF EN+RR+I DEVEKVKGFP+DTI+P R+RLKILG
Sbjct: 333  TDGEGMSFVLYFKLSDSYLKELPPHFQENIRRLIVDEVEKVKGFPVDTIVPFRERLKILG 392

Query: 1046 RVENVEDLGLSAPERKLMHAYNEKPVLSRPQHEFYLGENYLEIDIDMHRFGYISRKGFEV 1225
            RV NVEDL +SA ERKLMHAYNEKP LSRPQHEFYLGENY EIDIDMHRF YISRKGF+ 
Sbjct: 393  RVANVEDLHMSAAERKLMHAYNEKPFLSRPQHEFYLGENYFEIDIDMHRFSYISRKGFDA 452

Query: 1226 FHDRLKLCILDVGLTIQGNKPEELPEQILCCVRLNGIDYKNYHQLATSQD 1375
            F DRLKLCILDVGLTIQGNKPEELPEQILCC+RL+GIDY NYHQL  SQ+
Sbjct: 453  FLDRLKLCILDVGLTIQGNKPEELPEQILCCIRLSGIDYMNYHQLGLSQE 502


>ref|XP_010272173.1| PREDICTED: uncharacterized protein LOC104608027 [Nelumbo nucifera]
          Length = 523

 Score =  602 bits (1551), Expect = e-169
 Identities = 299/427 (70%), Positives = 338/427 (79%), Gaps = 2/427 (0%)
 Frame = +2

Query: 101  NHGEHNNNVPCIPINEQRHNVGQQPVGNSARSSVSEAARNTNTRVFHSSDADLQSKNDGS 280
            NH + N N PCIP                 R+S+SEA +N N+R  HS+ AD Q K+DG 
Sbjct: 112  NHEDCNGNFPCIP----------------PRNSLSEATKNENSRTLHSNSADSQLKSDGP 155

Query: 281  FNDAKHPXXXXXXXXXXXXEIAERE--GVLENCGILPNACLPCLASTVPSLDKRRSLSPG 454
             N+ K P            EI  R   G+L++CGILPN CLPCLASTV S +KRRSLSPG
Sbjct: 156  INEGKRPVSIDEVSSISVDEITGRGEGGILDHCGILPNTCLPCLASTV-SDEKRRSLSPG 214

Query: 455  PPSTRKKAALKLSFKWREGHTNSSLSSTKMILQRPIAGSQVPICRIENKMFDCWLPIDPS 634
            PPSTR+KA+LKLSFKWREGH N +L S+KM+LQRP AGSQVP C I+ KM DCW  +DPS
Sbjct: 215  PPSTRRKASLKLSFKWREGHGNPTLLSSKMLLQRPRAGSQVPFCPIDKKMSDCWSNVDPS 274

Query: 635  TFKVRGENYMRDKKKEFASNCAAYYPFGVDVFLSPRKIDHIAKFVDLPITNAIGKLPPIL 814
            TFKVRGENY RDKKKEFA N AAYYPFGVDVFLSPRKIDHIA+FVDLP T A GKLPPIL
Sbjct: 275  TFKVRGENYFRDKKKEFAPNHAAYYPFGVDVFLSPRKIDHIARFVDLPSTTASGKLPPIL 334

Query: 815  IVNVQVPLYPATFFQNEFDGEGVNFVLYFRLSESYSKELEPHFLENLRRIIDDEVEKVKG 994
            +VN+QVPLYPAT FQNE DGEG++FVLYFRLSE+YS+EL PHF EN+RR+IDDEVEK+KG
Sbjct: 335  VVNIQVPLYPATLFQNETDGEGMSFVLYFRLSENYSRELTPHFQENIRRLIDDEVEKIKG 394

Query: 995  FPMDTILPVRDRLKILGRVENVEDLGLSAPERKLMHAYNEKPVLSRPQHEFYLGENYLEI 1174
            FP+DTI+P R+RLKILGR+ NVEDL L A ERKLMHAYNEKPVLSRPQHEFY GENY EI
Sbjct: 395  FPVDTIVPFRERLKILGRMVNVEDLHLGATERKLMHAYNEKPVLSRPQHEFYSGENYFEI 454

Query: 1175 DIDMHRFGYISRKGFEVFHDRLKLCILDVGLTIQGNKPEELPEQILCCVRLNGIDYKNYH 1354
            D+DMHRF YISRKGFE F DRLKLCILD GLTIQGNK E+LPEQILCCVRLN I+Y +Y+
Sbjct: 455  DLDMHRFSYISRKGFEQFQDRLKLCILDFGLTIQGNKAEDLPEQILCCVRLNEINYTSYN 514

Query: 1355 QLATSQD 1375
            QL   Q+
Sbjct: 515  QLGLGQE 521


>ref|XP_002315697.2| hypothetical protein POPTR_0010s04930g [Populus trichocarpa]
            gi|550329086|gb|EEF01868.2| hypothetical protein
            POPTR_0010s04930g [Populus trichocarpa]
          Length = 551

 Score =  587 bits (1512), Expect = e-164
 Identities = 286/425 (67%), Positives = 329/425 (77%)
 Frame = +2

Query: 101  NHGEHNNNVPCIPINEQRHNVGQQPVGNSARSSVSEAARNTNTRVFHSSDADLQSKNDGS 280
            NHG+ N N+      +Q    G    GNS   SVSEA   TN  VF+    D  SK+DG 
Sbjct: 122  NHGDCNVNMQHSSFTDQMQKAGDLSAGNSTHDSVSEATEQTNIHVFNLDHVDSVSKSDGP 181

Query: 281  FNDAKHPXXXXXXXXXXXXEIAEREGVLENCGILPNACLPCLASTVPSLDKRRSLSPGPP 460
             N+ K P            E A  EG+L+NCGILP  CLPCLASTVP ++KRRSLS  PP
Sbjct: 182  SNEVKQPVFLDEITSAD--ENAGEEGLLDNCGILPGNCLPCLASTVPPVEKRRSLSSSPP 239

Query: 461  STRKKAALKLSFKWREGHTNSSLSSTKMILQRPIAGSQVPICRIENKMFDCWLPIDPSTF 640
            S RKK ALKL FKW+EG+++++L S+KMIL RPIAGSQVP C +E KM DCW  I+P +F
Sbjct: 240  SARKKGALKLPFKWKEGNSSNTLFSSKMILHRPIAGSQVPFCPMEKKMLDCWSHIEPCSF 299

Query: 641  KVRGENYMRDKKKEFASNCAAYYPFGVDVFLSPRKIDHIAKFVDLPITNAIGKLPPILIV 820
            KVRG++Y RDKKKEFA NC+AYYPFGVDVFLSPRK+DHIA+FVDLPI N+ G  P IL+V
Sbjct: 300  KVRGQSYFRDKKKEFAPNCSAYYPFGVDVFLSPRKVDHIARFVDLPIINSAGNFPTILVV 359

Query: 821  NVQVPLYPATFFQNEFDGEGVNFVLYFRLSESYSKELEPHFLENLRRIIDDEVEKVKGFP 1000
            NVQVPLYPA  FQ+E DGEG NFVLYF+LS+SYSKEL  HF E++RR+IDDEVEKVKGFP
Sbjct: 360  NVQVPLYPAAIFQSESDGEGTNFVLYFKLSDSYSKELPTHFQESIRRLIDDEVEKVKGFP 419

Query: 1001 MDTILPVRDRLKILGRVENVEDLGLSAPERKLMHAYNEKPVLSRPQHEFYLGENYLEIDI 1180
            +DTI   R+RLKILGRV NVEDL LSA ERKLM AYNEKPVLSRPQHEFYLG+NY EIDI
Sbjct: 420  VDTIASFRERLKILGRVVNVEDLHLSAAERKLMQAYNEKPVLSRPQHEFYLGDNYFEIDI 479

Query: 1181 DMHRFGYISRKGFEVFHDRLKLCILDVGLTIQGNKPEELPEQILCCVRLNGIDYKNYHQL 1360
            DMHRF YISRKGF+ F DRLK+C+LD+GLTIQGNK EELPEQILCC+RLNGIDY  YHQL
Sbjct: 480  DMHRFSYISRKGFQAFLDRLKICVLDIGLTIQGNKVEELPEQILCCIRLNGIDYMKYHQL 539

Query: 1361 ATSQD 1375
              +Q+
Sbjct: 540  GLNQE 544


>ref|XP_011008099.1| PREDICTED: uncharacterized protein LOC105113572 isoform X2 [Populus
            euphratica]
          Length = 551

 Score =  586 bits (1511), Expect = e-164
 Identities = 283/425 (66%), Positives = 329/425 (77%)
 Frame = +2

Query: 101  NHGEHNNNVPCIPINEQRHNVGQQPVGNSARSSVSEAARNTNTRVFHSSDADLQSKNDGS 280
            NHG+ N N+      +Q H  G    GNS   SVSEA + TN  +F+    D  SK+DG 
Sbjct: 122  NHGDCNVNMQHSSFTDQMHKAGDLSAGNSTHDSVSEATKQTNIHIFNLDHVDSVSKSDGP 181

Query: 281  FNDAKHPXXXXXXXXXXXXEIAEREGVLENCGILPNACLPCLASTVPSLDKRRSLSPGPP 460
             N+ K P            E A  EG+L+NCGILP  CLPCLASTVP ++KRRSLS  PP
Sbjct: 182  SNEVKQPVFLDEITSAD--ENAGEEGLLDNCGILPGNCLPCLASTVPPVEKRRSLSSSPP 239

Query: 461  STRKKAALKLSFKWREGHTNSSLSSTKMILQRPIAGSQVPICRIENKMFDCWLPIDPSTF 640
            S RKK ALKL FKW+EG++ ++L S+KMIL RPIAGSQVP C +E KM DCW  I+P +F
Sbjct: 240  SARKKGALKLPFKWKEGNSTNALFSSKMILHRPIAGSQVPFCPMEKKMLDCWSHIEPCSF 299

Query: 641  KVRGENYMRDKKKEFASNCAAYYPFGVDVFLSPRKIDHIAKFVDLPITNAIGKLPPILIV 820
            KVRG++Y RDKKKEFA +C+AYYPFGVDVFLSPRK+DHIA+FVDLPI N+ G  PPIL+V
Sbjct: 300  KVRGQSYFRDKKKEFAPDCSAYYPFGVDVFLSPRKVDHIARFVDLPIINSAGNFPPILVV 359

Query: 821  NVQVPLYPATFFQNEFDGEGVNFVLYFRLSESYSKELEPHFLENLRRIIDDEVEKVKGFP 1000
            NVQVPLYPA  FQ+E DGEG NFVLYF+LS+SY KEL  HF E++RR+IDDEVEKVKGFP
Sbjct: 360  NVQVPLYPAAIFQSESDGEGTNFVLYFKLSDSYLKELPAHFQESIRRLIDDEVEKVKGFP 419

Query: 1001 MDTILPVRDRLKILGRVENVEDLGLSAPERKLMHAYNEKPVLSRPQHEFYLGENYLEIDI 1180
            +DTI P R+RLKILGRV NVEDL LSA E+ LM  YNEKPVLSRPQHEFYLG+NY EIDI
Sbjct: 420  VDTIAPFRERLKILGRVVNVEDLHLSAAEKNLMQDYNEKPVLSRPQHEFYLGDNYFEIDI 479

Query: 1181 DMHRFGYISRKGFEVFHDRLKLCILDVGLTIQGNKPEELPEQILCCVRLNGIDYKNYHQL 1360
            DMHRF YISRKGF+ F DRLK+C+LD+GLTIQGNK EELPEQILCC+RLNGIDY  YHQL
Sbjct: 480  DMHRFSYISRKGFQAFLDRLKICVLDIGLTIQGNKVEELPEQILCCIRLNGIDYMKYHQL 539

Query: 1361 ATSQD 1375
              +Q+
Sbjct: 540  GLNQE 544


>ref|XP_011008098.1| PREDICTED: uncharacterized protein LOC105113572 isoform X1 [Populus
            euphratica]
          Length = 552

 Score =  586 bits (1511), Expect = e-164
 Identities = 283/425 (66%), Positives = 329/425 (77%)
 Frame = +2

Query: 101  NHGEHNNNVPCIPINEQRHNVGQQPVGNSARSSVSEAARNTNTRVFHSSDADLQSKNDGS 280
            NHG+ N N+      +Q H  G    GNS   SVSEA + TN  +F+    D  SK+DG 
Sbjct: 123  NHGDCNVNMQHSSFTDQMHKAGDLSAGNSTHDSVSEATKQTNIHIFNLDHVDSVSKSDGP 182

Query: 281  FNDAKHPXXXXXXXXXXXXEIAEREGVLENCGILPNACLPCLASTVPSLDKRRSLSPGPP 460
             N+ K P            E A  EG+L+NCGILP  CLPCLASTVP ++KRRSLS  PP
Sbjct: 183  SNEVKQPVFLDEITSAD--ENAGEEGLLDNCGILPGNCLPCLASTVPPVEKRRSLSSSPP 240

Query: 461  STRKKAALKLSFKWREGHTNSSLSSTKMILQRPIAGSQVPICRIENKMFDCWLPIDPSTF 640
            S RKK ALKL FKW+EG++ ++L S+KMIL RPIAGSQVP C +E KM DCW  I+P +F
Sbjct: 241  SARKKGALKLPFKWKEGNSTNALFSSKMILHRPIAGSQVPFCPMEKKMLDCWSHIEPCSF 300

Query: 641  KVRGENYMRDKKKEFASNCAAYYPFGVDVFLSPRKIDHIAKFVDLPITNAIGKLPPILIV 820
            KVRG++Y RDKKKEFA +C+AYYPFGVDVFLSPRK+DHIA+FVDLPI N+ G  PPIL+V
Sbjct: 301  KVRGQSYFRDKKKEFAPDCSAYYPFGVDVFLSPRKVDHIARFVDLPIINSAGNFPPILVV 360

Query: 821  NVQVPLYPATFFQNEFDGEGVNFVLYFRLSESYSKELEPHFLENLRRIIDDEVEKVKGFP 1000
            NVQVPLYPA  FQ+E DGEG NFVLYF+LS+SY KEL  HF E++RR+IDDEVEKVKGFP
Sbjct: 361  NVQVPLYPAAIFQSESDGEGTNFVLYFKLSDSYLKELPAHFQESIRRLIDDEVEKVKGFP 420

Query: 1001 MDTILPVRDRLKILGRVENVEDLGLSAPERKLMHAYNEKPVLSRPQHEFYLGENYLEIDI 1180
            +DTI P R+RLKILGRV NVEDL LSA E+ LM  YNEKPVLSRPQHEFYLG+NY EIDI
Sbjct: 421  VDTIAPFRERLKILGRVVNVEDLHLSAAEKNLMQDYNEKPVLSRPQHEFYLGDNYFEIDI 480

Query: 1181 DMHRFGYISRKGFEVFHDRLKLCILDVGLTIQGNKPEELPEQILCCVRLNGIDYKNYHQL 1360
            DMHRF YISRKGF+ F DRLK+C+LD+GLTIQGNK EELPEQILCC+RLNGIDY  YHQL
Sbjct: 481  DMHRFSYISRKGFQAFLDRLKICVLDIGLTIQGNKVEELPEQILCCIRLNGIDYMKYHQL 540

Query: 1361 ATSQD 1375
              +Q+
Sbjct: 541  GLNQE 545


>ref|XP_008221712.1| PREDICTED: uncharacterized protein LOC103321659 [Prunus mume]
          Length = 550

 Score =  586 bits (1510), Expect = e-164
 Identities = 293/428 (68%), Positives = 337/428 (78%), Gaps = 1/428 (0%)
 Frame = +2

Query: 101  NHGEHNNNVPCIPINEQRHNVGQQPVGNSARSSVSEAARNTNTRVFHSSDADLQSK-NDG 277
            N GE+N+N       ++ H  G     NSA +SV+  ++ +N ++ + +D D Q+K ND 
Sbjct: 124  NCGEYNDNGLHTSSTDRMHKPGDLSTENSANNSVTVVSQRSNVQIMNVNDVDTQTKFNDH 183

Query: 278  SFNDAKHPXXXXXXXXXXXXEIAEREGVLENCGILPNACLPCLASTVPSLDKRRSLSPGP 457
            S  +A  P              A+ EG+L+NCGILP+ CLPCLASTVPS++KRRSL   P
Sbjct: 184  SVIEANEPVFLDEISSSVDETSAKEEGMLDNCGILPSTCLPCLASTVPSVEKRRSLISSP 243

Query: 458  PSTRKKAALKLSFKWREGHTNSSLSSTKMILQRPIAGSQVPICRIENKMFDCWLPIDPST 637
            PS RKKAALKL FKW+E   N+SL S+KM+LQRPIAGSQVP C IE KMFD W  I+P+T
Sbjct: 244  PSARKKAALKLPFKWKE-QANASLFSSKMLLQRPIAGSQVPFCPIEKKMFDSWSHIEPNT 302

Query: 638  FKVRGENYMRDKKKEFASNCAAYYPFGVDVFLSPRKIDHIAKFVDLPITNAIGKLPPILI 817
            FKVRG NY RDKKKEFA + AAYYPFG+DVFLS RKIDHIA+FV+LPI N+ G LP IL+
Sbjct: 303  FKVRGPNYFRDKKKEFAPSYAAYYPFGLDVFLSQRKIDHIARFVELPIVNSSGDLPAILV 362

Query: 818  VNVQVPLYPATFFQNEFDGEGVNFVLYFRLSESYSKELEPHFLENLRRIIDDEVEKVKGF 997
            VNVQVPLYPA  FQ E DGEG+NFVLYF+LS+ YSKEL P+F EN+RR+I DEVEKVKGF
Sbjct: 363  VNVQVPLYPAAIFQGETDGEGMNFVLYFKLSDIYSKELPPNFQENIRRLIGDEVEKVKGF 422

Query: 998  PMDTILPVRDRLKILGRVENVEDLGLSAPERKLMHAYNEKPVLSRPQHEFYLGENYLEID 1177
            P+DTI P R+RLKILGRV NVEDL LSAPERKLM AYNEKPVLSRPQHEFYLGENYLEID
Sbjct: 423  PVDTIAPFRERLKILGRVVNVEDLHLSAPERKLMQAYNEKPVLSRPQHEFYLGENYLEID 482

Query: 1178 IDMHRFGYISRKGFEVFHDRLKLCILDVGLTIQGNKPEELPEQILCCVRLNGIDYKNYHQ 1357
            +DMHRF YISRKGFE F DRLKLCILDVGLTIQGNKPEELPEQILCC+RLNGIDY NYHQ
Sbjct: 483  LDMHRFSYISRKGFEAFLDRLKLCILDVGLTIQGNKPEELPEQILCCIRLNGIDYMNYHQ 542

Query: 1358 LATSQDAL 1381
            L  +QD L
Sbjct: 543  LGLTQDPL 550


>ref|XP_010255733.1| PREDICTED: uncharacterized protein LOC104596338 isoform X1 [Nelumbo
            nucifera]
          Length = 522

 Score =  583 bits (1503), Expect = e-163
 Identities = 292/429 (68%), Positives = 331/429 (77%), Gaps = 2/429 (0%)
 Frame = +2

Query: 101  NHGEHNNNVPCIPINEQRHNVGQQPVGNSARSSVSEAARNTNTRVFHSSDADLQSKNDGS 280
            NH   N NV CIP                   + SE A+  N+R+ HS++AD Q K+DG 
Sbjct: 111  NHEHFNGNVSCIP-----------------SQTGSEVAKKENSRIVHSNNADSQLKSDGL 153

Query: 281  FNDAKHPXXXXXXXXXXXXEIAERE--GVLENCGILPNACLPCLASTVPSLDKRRSLSPG 454
             N+ K P            EI  R   G+L++CGILPN CLPCLASTV S++KRRSLSPG
Sbjct: 154  LNEGKRPASSDGISSISVDEITARGEGGMLDHCGILPNTCLPCLASTVCSVEKRRSLSPG 213

Query: 455  PPSTRKKAALKLSFKWREGHTNSSLSSTKMILQRPIAGSQVPICRIENKMFDCWLPIDPS 634
            PPSTR+KA+LKLSFKWREGH N +L S+KM+LQRPIAGSQVP C I+ KM DCW  ++P 
Sbjct: 214  PPSTRRKASLKLSFKWREGHGNPTLLSSKMLLQRPIAGSQVPFCPIDKKMSDCWSNLEPC 273

Query: 635  TFKVRGENYMRDKKKEFASNCAAYYPFGVDVFLSPRKIDHIAKFVDLPITNAIGKLPPIL 814
            TFKVRGENY RDKKKEFA N AAY PFGVDVFLSPRKIDHIA+FVDLP  N+ GKLPPIL
Sbjct: 274  TFKVRGENYFRDKKKEFAPNHAAYCPFGVDVFLSPRKIDHIARFVDLPSINSSGKLPPIL 333

Query: 815  IVNVQVPLYPATFFQNEFDGEGVNFVLYFRLSESYSKELEPHFLENLRRIIDDEVEKVKG 994
            +VNVQVPLYPAT FQ+E DG+G++FVLYFRL E+ SKEL  HF EN+RR+IDDEVEKVKG
Sbjct: 334  VVNVQVPLYPATLFQSETDGKGMSFVLYFRLLENCSKELPLHFQENIRRLIDDEVEKVKG 393

Query: 995  FPMDTILPVRDRLKILGRVENVEDLGLSAPERKLMHAYNEKPVLSRPQHEFYLGENYLEI 1174
            FP+DTI+P R+RLKILGRV NVEDL L A E+KLM AYNEKPVLSRPQHEFYLGENY EI
Sbjct: 394  FPVDTIIPFRERLKILGRVVNVEDLHLGATEKKLMQAYNEKPVLSRPQHEFYLGENYFEI 453

Query: 1175 DIDMHRFGYISRKGFEVFHDRLKLCILDVGLTIQGNKPEELPEQILCCVRLNGIDYKNYH 1354
            D+DMHRF YISRKGFE F DRLKLC LD GLTIQ NK EELPEQILCC+RLN IDY NYH
Sbjct: 454  DLDMHRFSYISRKGFEQFQDRLKLCTLDFGLTIQANKAEELPEQILCCIRLNEIDYTNYH 513

Query: 1355 QLATSQDAL 1381
            QL  + + L
Sbjct: 514  QLTLNLEPL 522


>ref|XP_006483281.1| PREDICTED: uncharacterized protein LOC102624792 isoform X2 [Citrus
            sinensis]
          Length = 539

 Score =  583 bits (1502), Expect = e-163
 Identities = 281/427 (65%), Positives = 335/427 (78%)
 Frame = +2

Query: 101  NHGEHNNNVPCIPINEQRHNVGQQPVGNSARSSVSEAARNTNTRVFHSSDADLQSKNDGS 280
            NH  HN N+ C  + +Q    G    GNSA +SVS+  +N+++RV +S +   QSK+DG 
Sbjct: 113  NHRGHNVNIQCTSLTDQLQRPGGLSAGNSAHNSVSDVGKNSSSRVANSENVHSQSKSDGP 172

Query: 281  FNDAKHPXXXXXXXXXXXXEIAEREGVLENCGILPNACLPCLASTVPSLDKRRSLSPGPP 460
              + K P               + EG+L+NCGI+P+ CLPCLASTVPS++KRRS S  PP
Sbjct: 173  SYEGKQPVFLDEISSSVDEGSGKDEGLLDNCGIIPSNCLPCLASTVPSVEKRRSGSSSPP 232

Query: 461  STRKKAALKLSFKWREGHTNSSLSSTKMILQRPIAGSQVPICRIENKMFDCWLPIDPSTF 640
               KK A KLSFKW+EGH N++L S+KM+L RPIAG+QVP C IE KM D W  I+P+TF
Sbjct: 233  RPFKKTASKLSFKWKEGHANATLVSSKMLLSRPIAGAQVPFCPIEKKMLDSWSQIEPNTF 292

Query: 641  KVRGENYMRDKKKEFASNCAAYYPFGVDVFLSPRKIDHIAKFVDLPITNAIGKLPPILIV 820
            KVRG NY+RDKKKEFA NCAAYYPFGVDVFLS RKIDHIA+FV+LP  ++  KLPP+L+V
Sbjct: 293  KVRGVNYLRDKKKEFAHNCAAYYPFGVDVFLSQRKIDHIARFVELPAISSHAKLPPMLVV 352

Query: 821  NVQVPLYPATFFQNEFDGEGVNFVLYFRLSESYSKELEPHFLENLRRIIDDEVEKVKGFP 1000
            NVQ+PLYP   FQ+E DGEG++ VLYF+L+ESY+KEL  HF E++RRIIDDEVEKVKGFP
Sbjct: 353  NVQIPLYPTAIFQSETDGEGISIVLYFKLNESYAKELPLHFQESIRRIIDDEVEKVKGFP 412

Query: 1001 MDTILPVRDRLKILGRVENVEDLGLSAPERKLMHAYNEKPVLSRPQHEFYLGENYLEIDI 1180
            +DTI+P R+RLKILGRV NVEDL LSA ERKL+ AYNEKPVLSRPQHEFYLGENYLEIDI
Sbjct: 413  VDTIVPFRERLKILGRVVNVEDLHLSAAERKLLQAYNEKPVLSRPQHEFYLGENYLEIDI 472

Query: 1181 DMHRFGYISRKGFEVFHDRLKLCILDVGLTIQGNKPEELPEQILCCVRLNGIDYKNYHQL 1360
            DMHRF YISRKGFE F DRLK+CILDVGLTIQGNK +ELPEQILCC+R+NGIDY NYHQL
Sbjct: 473  DMHRFSYISRKGFEAFLDRLKICILDVGLTIQGNKVDELPEQILCCIRINGIDYMNYHQL 532

Query: 1361 ATSQDAL 1381
               ++ L
Sbjct: 533  GHKEETL 539


>ref|XP_006438539.1| hypothetical protein CICLE_v10031173mg [Citrus clementina]
            gi|568859507|ref|XP_006483280.1| PREDICTED:
            uncharacterized protein LOC102624792 isoform X1 [Citrus
            sinensis] gi|557540735|gb|ESR51779.1| hypothetical
            protein CICLE_v10031173mg [Citrus clementina]
          Length = 540

 Score =  583 bits (1502), Expect = e-163
 Identities = 281/427 (65%), Positives = 335/427 (78%)
 Frame = +2

Query: 101  NHGEHNNNVPCIPINEQRHNVGQQPVGNSARSSVSEAARNTNTRVFHSSDADLQSKNDGS 280
            NH  HN N+ C  + +Q    G    GNSA +SVS+  +N+++RV +S +   QSK+DG 
Sbjct: 114  NHRGHNVNIQCTSLTDQLQRPGGLSAGNSAHNSVSDVGKNSSSRVANSENVHSQSKSDGP 173

Query: 281  FNDAKHPXXXXXXXXXXXXEIAEREGVLENCGILPNACLPCLASTVPSLDKRRSLSPGPP 460
              + K P               + EG+L+NCGI+P+ CLPCLASTVPS++KRRS S  PP
Sbjct: 174  SYEGKQPVFLDEISSSVDEGSGKDEGLLDNCGIIPSNCLPCLASTVPSVEKRRSGSSSPP 233

Query: 461  STRKKAALKLSFKWREGHTNSSLSSTKMILQRPIAGSQVPICRIENKMFDCWLPIDPSTF 640
               KK A KLSFKW+EGH N++L S+KM+L RPIAG+QVP C IE KM D W  I+P+TF
Sbjct: 234  RPFKKTASKLSFKWKEGHANATLVSSKMLLSRPIAGAQVPFCPIEKKMLDSWSQIEPNTF 293

Query: 641  KVRGENYMRDKKKEFASNCAAYYPFGVDVFLSPRKIDHIAKFVDLPITNAIGKLPPILIV 820
            KVRG NY+RDKKKEFA NCAAYYPFGVDVFLS RKIDHIA+FV+LP  ++  KLPP+L+V
Sbjct: 294  KVRGVNYLRDKKKEFAHNCAAYYPFGVDVFLSQRKIDHIARFVELPAISSHAKLPPMLVV 353

Query: 821  NVQVPLYPATFFQNEFDGEGVNFVLYFRLSESYSKELEPHFLENLRRIIDDEVEKVKGFP 1000
            NVQ+PLYP   FQ+E DGEG++ VLYF+L+ESY+KEL  HF E++RRIIDDEVEKVKGFP
Sbjct: 354  NVQIPLYPTAIFQSETDGEGISIVLYFKLNESYAKELPLHFQESIRRIIDDEVEKVKGFP 413

Query: 1001 MDTILPVRDRLKILGRVENVEDLGLSAPERKLMHAYNEKPVLSRPQHEFYLGENYLEIDI 1180
            +DTI+P R+RLKILGRV NVEDL LSA ERKL+ AYNEKPVLSRPQHEFYLGENYLEIDI
Sbjct: 414  VDTIVPFRERLKILGRVVNVEDLHLSAAERKLLQAYNEKPVLSRPQHEFYLGENYLEIDI 473

Query: 1181 DMHRFGYISRKGFEVFHDRLKLCILDVGLTIQGNKPEELPEQILCCVRLNGIDYKNYHQL 1360
            DMHRF YISRKGFE F DRLK+CILDVGLTIQGNK +ELPEQILCC+R+NGIDY NYHQL
Sbjct: 474  DMHRFSYISRKGFEAFLDRLKICILDVGLTIQGNKVDELPEQILCCIRINGIDYMNYHQL 533

Query: 1361 ATSQDAL 1381
               ++ L
Sbjct: 534  GHKEETL 540


>gb|KDO82688.1| hypothetical protein CISIN_1g009225mg [Citrus sinensis]
          Length = 539

 Score =  582 bits (1501), Expect = e-163
 Identities = 280/427 (65%), Positives = 335/427 (78%)
 Frame = +2

Query: 101  NHGEHNNNVPCIPINEQRHNVGQQPVGNSARSSVSEAARNTNTRVFHSSDADLQSKNDGS 280
            NH  HN N+ C  + +Q    G    GNSA +SVS+  +N+++RV +S +   QSK+DG 
Sbjct: 113  NHRGHNVNIQCTSLTDQLQRPGGLSAGNSAHNSVSDVGKNSSSRVANSENVHSQSKSDGP 172

Query: 281  FNDAKHPXXXXXXXXXXXXEIAEREGVLENCGILPNACLPCLASTVPSLDKRRSLSPGPP 460
              + K P               + EG+L+NCGI+P+ CLPCLASTVPS++KRRS S  PP
Sbjct: 173  SYEGKQPVFLDEISSSVDEGSGKDEGLLDNCGIIPSNCLPCLASTVPSVEKRRSGSSSPP 232

Query: 461  STRKKAALKLSFKWREGHTNSSLSSTKMILQRPIAGSQVPICRIENKMFDCWLPIDPSTF 640
               KK A KLSFKW+EGH N++L S+KM+L RPI+G+QVP C IE KM D W  I+P+TF
Sbjct: 233  RPFKKTASKLSFKWKEGHANATLVSSKMLLSRPISGAQVPFCPIEKKMLDSWSQIEPNTF 292

Query: 641  KVRGENYMRDKKKEFASNCAAYYPFGVDVFLSPRKIDHIAKFVDLPITNAIGKLPPILIV 820
            KVRG NY+RDKKKEFA NCAAYYPFGVDVFLS RKIDHIA+FV+LP  ++  KLPP+L+V
Sbjct: 293  KVRGVNYLRDKKKEFAHNCAAYYPFGVDVFLSQRKIDHIARFVELPAISSHAKLPPMLVV 352

Query: 821  NVQVPLYPATFFQNEFDGEGVNFVLYFRLSESYSKELEPHFLENLRRIIDDEVEKVKGFP 1000
            NVQ+PLYP   FQ+E DGEG++ VLYF+L+ESY+KEL  HF E++RRIIDDEVEKVKGFP
Sbjct: 353  NVQIPLYPTAIFQSEIDGEGISIVLYFKLNESYAKELPLHFQESIRRIIDDEVEKVKGFP 412

Query: 1001 MDTILPVRDRLKILGRVENVEDLGLSAPERKLMHAYNEKPVLSRPQHEFYLGENYLEIDI 1180
            +DTI+P R+RLKILGRV NVEDL LSA ERKL+ AYNEKPVLSRPQHEFYLGENYLEIDI
Sbjct: 413  VDTIVPFRERLKILGRVVNVEDLHLSAAERKLLQAYNEKPVLSRPQHEFYLGENYLEIDI 472

Query: 1181 DMHRFGYISRKGFEVFHDRLKLCILDVGLTIQGNKPEELPEQILCCVRLNGIDYKNYHQL 1360
            DMHRF YISRKGFE F DRLK+CILDVGLTIQGNK +ELPEQILCC+R+NGIDY NYHQL
Sbjct: 473  DMHRFSYISRKGFEAFLDRLKICILDVGLTIQGNKVDELPEQILCCIRINGIDYMNYHQL 532

Query: 1361 ATSQDAL 1381
               ++ L
Sbjct: 533  GHKEETL 539


>gb|KDO82687.1| hypothetical protein CISIN_1g009225mg [Citrus sinensis]
          Length = 540

 Score =  582 bits (1501), Expect = e-163
 Identities = 280/427 (65%), Positives = 335/427 (78%)
 Frame = +2

Query: 101  NHGEHNNNVPCIPINEQRHNVGQQPVGNSARSSVSEAARNTNTRVFHSSDADLQSKNDGS 280
            NH  HN N+ C  + +Q    G    GNSA +SVS+  +N+++RV +S +   QSK+DG 
Sbjct: 114  NHRGHNVNIQCTSLTDQLQRPGGLSAGNSAHNSVSDVGKNSSSRVANSENVHSQSKSDGP 173

Query: 281  FNDAKHPXXXXXXXXXXXXEIAEREGVLENCGILPNACLPCLASTVPSLDKRRSLSPGPP 460
              + K P               + EG+L+NCGI+P+ CLPCLASTVPS++KRRS S  PP
Sbjct: 174  SYEGKQPVFLDEISSSVDEGSGKDEGLLDNCGIIPSNCLPCLASTVPSVEKRRSGSSSPP 233

Query: 461  STRKKAALKLSFKWREGHTNSSLSSTKMILQRPIAGSQVPICRIENKMFDCWLPIDPSTF 640
               KK A KLSFKW+EGH N++L S+KM+L RPI+G+QVP C IE KM D W  I+P+TF
Sbjct: 234  RPFKKTASKLSFKWKEGHANATLVSSKMLLSRPISGAQVPFCPIEKKMLDSWSQIEPNTF 293

Query: 641  KVRGENYMRDKKKEFASNCAAYYPFGVDVFLSPRKIDHIAKFVDLPITNAIGKLPPILIV 820
            KVRG NY+RDKKKEFA NCAAYYPFGVDVFLS RKIDHIA+FV+LP  ++  KLPP+L+V
Sbjct: 294  KVRGVNYLRDKKKEFAHNCAAYYPFGVDVFLSQRKIDHIARFVELPAISSHAKLPPMLVV 353

Query: 821  NVQVPLYPATFFQNEFDGEGVNFVLYFRLSESYSKELEPHFLENLRRIIDDEVEKVKGFP 1000
            NVQ+PLYP   FQ+E DGEG++ VLYF+L+ESY+KEL  HF E++RRIIDDEVEKVKGFP
Sbjct: 354  NVQIPLYPTAIFQSEIDGEGISIVLYFKLNESYAKELPLHFQESIRRIIDDEVEKVKGFP 413

Query: 1001 MDTILPVRDRLKILGRVENVEDLGLSAPERKLMHAYNEKPVLSRPQHEFYLGENYLEIDI 1180
            +DTI+P R+RLKILGRV NVEDL LSA ERKL+ AYNEKPVLSRPQHEFYLGENYLEIDI
Sbjct: 414  VDTIVPFRERLKILGRVVNVEDLHLSAAERKLLQAYNEKPVLSRPQHEFYLGENYLEIDI 473

Query: 1181 DMHRFGYISRKGFEVFHDRLKLCILDVGLTIQGNKPEELPEQILCCVRLNGIDYKNYHQL 1360
            DMHRF YISRKGFE F DRLK+CILDVGLTIQGNK +ELPEQILCC+R+NGIDY NYHQL
Sbjct: 474  DMHRFSYISRKGFEAFLDRLKICILDVGLTIQGNKVDELPEQILCCIRINGIDYMNYHQL 533

Query: 1361 ATSQDAL 1381
               ++ L
Sbjct: 534  GHKEETL 540


>ref|XP_007223080.1| hypothetical protein PRUPE_ppa003760mg [Prunus persica]
            gi|462420016|gb|EMJ24279.1| hypothetical protein
            PRUPE_ppa003760mg [Prunus persica]
          Length = 547

 Score =  580 bits (1494), Expect = e-162
 Identities = 290/428 (67%), Positives = 335/428 (78%), Gaps = 1/428 (0%)
 Frame = +2

Query: 101  NHGEHNNNVPCIPINEQRHNVGQQPVGNSARSSVSEAARNTNTRVFHSSDADLQSK-NDG 277
            N GE+N+N       ++ H  G     NSA +SV+  ++ +N ++ + +D D Q+K ND 
Sbjct: 121  NCGEYNDNGLHTSSTDRMHKPGDLSTENSASNSVTVVSQRSNVQIMNVNDVDTQTKFNDH 180

Query: 278  SFNDAKHPXXXXXXXXXXXXEIAEREGVLENCGILPNACLPCLASTVPSLDKRRSLSPGP 457
            S  +A  P              A+ EG+L+NCGILP+ CLPCLASTVPS++KRRSL   P
Sbjct: 181  SVIEANEPVFLDEISSSVDETSAKEEGILDNCGILPSTCLPCLASTVPSVEKRRSLISSP 240

Query: 458  PSTRKKAALKLSFKWREGHTNSSLSSTKMILQRPIAGSQVPICRIENKMFDCWLPIDPST 637
            PS RKKAALKL FKW+E   N++L S+K +LQRPIAGSQVP C IE KMFD W  I+P+T
Sbjct: 241  PSARKKAALKLPFKWKE-QANATLFSSKKLLQRPIAGSQVPFCPIEKKMFDSWSHIEPNT 299

Query: 638  FKVRGENYMRDKKKEFASNCAAYYPFGVDVFLSPRKIDHIAKFVDLPITNAIGKLPPILI 817
            FKVRG NY RDKKKEFA + AAYYPFG+DVFLS RKIDHIA+FV+LPI N+ G LP IL+
Sbjct: 300  FKVRGPNYFRDKKKEFAPSYAAYYPFGLDVFLSQRKIDHIARFVELPIVNSSGDLPAILV 359

Query: 818  VNVQVPLYPATFFQNEFDGEGVNFVLYFRLSESYSKELEPHFLENLRRIIDDEVEKVKGF 997
            VNVQVPLYPA  FQ E DGEG+NFVLYF+LS+ YSKEL  +F EN+RR+I DEVEKVKGF
Sbjct: 360  VNVQVPLYPAAIFQGETDGEGMNFVLYFKLSDIYSKELPSNFQENIRRLIGDEVEKVKGF 419

Query: 998  PMDTILPVRDRLKILGRVENVEDLGLSAPERKLMHAYNEKPVLSRPQHEFYLGENYLEID 1177
            P+DTI P R+RLKILGRV NVEDL LSAPERKLM AYNEKPVLSRPQHEFYLGENYLEID
Sbjct: 420  PVDTIAPFRERLKILGRVVNVEDLHLSAPERKLMQAYNEKPVLSRPQHEFYLGENYLEID 479

Query: 1178 IDMHRFGYISRKGFEVFHDRLKLCILDVGLTIQGNKPEELPEQILCCVRLNGIDYKNYHQ 1357
            +DMHRF YISRKGFE F DRLKLCILDVGLTIQGNKPEELPEQILCC+RLNGIDY NYHQ
Sbjct: 480  LDMHRFSYISRKGFEAFLDRLKLCILDVGLTIQGNKPEELPEQILCCIRLNGIDYMNYHQ 539

Query: 1358 LATSQDAL 1381
            L  +QD L
Sbjct: 540  LGLTQDPL 547


>ref|XP_007223079.1| hypothetical protein PRUPE_ppa003760mg [Prunus persica]
            gi|462420015|gb|EMJ24278.1| hypothetical protein
            PRUPE_ppa003760mg [Prunus persica]
          Length = 550

 Score =  580 bits (1494), Expect = e-162
 Identities = 290/428 (67%), Positives = 335/428 (78%), Gaps = 1/428 (0%)
 Frame = +2

Query: 101  NHGEHNNNVPCIPINEQRHNVGQQPVGNSARSSVSEAARNTNTRVFHSSDADLQSK-NDG 277
            N GE+N+N       ++ H  G     NSA +SV+  ++ +N ++ + +D D Q+K ND 
Sbjct: 124  NCGEYNDNGLHTSSTDRMHKPGDLSTENSASNSVTVVSQRSNVQIMNVNDVDTQTKFNDH 183

Query: 278  SFNDAKHPXXXXXXXXXXXXEIAEREGVLENCGILPNACLPCLASTVPSLDKRRSLSPGP 457
            S  +A  P              A+ EG+L+NCGILP+ CLPCLASTVPS++KRRSL   P
Sbjct: 184  SVIEANEPVFLDEISSSVDETSAKEEGILDNCGILPSTCLPCLASTVPSVEKRRSLISSP 243

Query: 458  PSTRKKAALKLSFKWREGHTNSSLSSTKMILQRPIAGSQVPICRIENKMFDCWLPIDPST 637
            PS RKKAALKL FKW+E   N++L S+K +LQRPIAGSQVP C IE KMFD W  I+P+T
Sbjct: 244  PSARKKAALKLPFKWKE-QANATLFSSKKLLQRPIAGSQVPFCPIEKKMFDSWSHIEPNT 302

Query: 638  FKVRGENYMRDKKKEFASNCAAYYPFGVDVFLSPRKIDHIAKFVDLPITNAIGKLPPILI 817
            FKVRG NY RDKKKEFA + AAYYPFG+DVFLS RKIDHIA+FV+LPI N+ G LP IL+
Sbjct: 303  FKVRGPNYFRDKKKEFAPSYAAYYPFGLDVFLSQRKIDHIARFVELPIVNSSGDLPAILV 362

Query: 818  VNVQVPLYPATFFQNEFDGEGVNFVLYFRLSESYSKELEPHFLENLRRIIDDEVEKVKGF 997
            VNVQVPLYPA  FQ E DGEG+NFVLYF+LS+ YSKEL  +F EN+RR+I DEVEKVKGF
Sbjct: 363  VNVQVPLYPAAIFQGETDGEGMNFVLYFKLSDIYSKELPSNFQENIRRLIGDEVEKVKGF 422

Query: 998  PMDTILPVRDRLKILGRVENVEDLGLSAPERKLMHAYNEKPVLSRPQHEFYLGENYLEID 1177
            P+DTI P R+RLKILGRV NVEDL LSAPERKLM AYNEKPVLSRPQHEFYLGENYLEID
Sbjct: 423  PVDTIAPFRERLKILGRVVNVEDLHLSAPERKLMQAYNEKPVLSRPQHEFYLGENYLEID 482

Query: 1178 IDMHRFGYISRKGFEVFHDRLKLCILDVGLTIQGNKPEELPEQILCCVRLNGIDYKNYHQ 1357
            +DMHRF YISRKGFE F DRLKLCILDVGLTIQGNKPEELPEQILCC+RLNGIDY NYHQ
Sbjct: 483  LDMHRFSYISRKGFEAFLDRLKLCILDVGLTIQGNKPEELPEQILCCIRLNGIDYMNYHQ 542

Query: 1358 LATSQDAL 1381
            L  +QD L
Sbjct: 543  LGLTQDPL 550


>ref|XP_002520174.1| conserved hypothetical protein [Ricinus communis]
            gi|223540666|gb|EEF42229.1| conserved hypothetical
            protein [Ricinus communis]
          Length = 512

 Score =  578 bits (1489), Expect = e-162
 Identities = 288/411 (70%), Positives = 323/411 (78%)
 Frame = +2

Query: 143  NEQRHNVGQQPVGNSARSSVSEAARNTNTRVFHSSDADLQSKNDGSFNDAKHPXXXXXXX 322
            ++Q    G    GNSAR+SVSEA                 SK DG  N+AK P       
Sbjct: 110  HDQMKKAGDLSAGNSARNSVSEAP---------------VSKFDGPSNEAKQPVFLDEIA 154

Query: 323  XXXXXEIAEREGVLENCGILPNACLPCLASTVPSLDKRRSLSPGPPSTRKKAALKLSFKW 502
                    + EG+LENCGILP  CLPCLASTV  ++KRRSLS  PPS RKKAALKLSFKW
Sbjct: 155  SSADENAGKEEGLLENCGILPGNCLPCLASTVSQVEKRRSLSSSPPSARKKAALKLSFKW 214

Query: 503  REGHTNSSLSSTKMILQRPIAGSQVPICRIENKMFDCWLPIDPSTFKVRGENYMRDKKKE 682
            +EGH N+SL S+K ILQRPIAGSQVP C ++ KM DCW  I+P +FKVRG+NY+RDKKKE
Sbjct: 215  KEGHANNSLFSSKPILQRPIAGSQVPFCPMDKKMLDCWSHIEPGSFKVRGQNYLRDKKKE 274

Query: 683  FASNCAAYYPFGVDVFLSPRKIDHIAKFVDLPITNAIGKLPPILIVNVQVPLYPATFFQN 862
            FA   AAYYPFGVDVFLSPRKIDHIA+FV+LP+ N+ GKLP IL+VNVQ+PLY A  FQ+
Sbjct: 275  FAPAHAAYYPFGVDVFLSPRKIDHIARFVELPVINSSGKLPTILVVNVQIPLYTAALFQS 334

Query: 863  EFDGEGVNFVLYFRLSESYSKELEPHFLENLRRIIDDEVEKVKGFPMDTILPVRDRLKIL 1042
            E DGEG+NFVLYF+LSESYSKEL  HF E++RRIIDDEVEKVKGFP+DTI+P R+RLKIL
Sbjct: 335  EVDGEGMNFVLYFKLSESYSKELPAHFQESIRRIIDDEVEKVKGFPVDTIVPYRERLKIL 394

Query: 1043 GRVENVEDLGLSAPERKLMHAYNEKPVLSRPQHEFYLGENYLEIDIDMHRFGYISRKGFE 1222
            GRV NV+DL LS+ ERKLM AYNEKPVLSRPQHEFYLGENY EIDIDMHRF YISRKGFE
Sbjct: 395  GRVVNVDDLHLSSAERKLMQAYNEKPVLSRPQHEFYLGENYFEIDIDMHRFSYISRKGFE 454

Query: 1223 VFHDRLKLCILDVGLTIQGNKPEELPEQILCCVRLNGIDYKNYHQLATSQD 1375
             F DRLK+CILDVGLTIQGNK EELPEQILCCVRLNGIDY NYHQL  +QD
Sbjct: 455  AFLDRLKICILDVGLTIQGNKAEELPEQILCCVRLNGIDYMNYHQLGLNQD 505


>ref|XP_012458817.1| PREDICTED: uncharacterized protein LOC105779560 isoform X1 [Gossypium
            raimondii] gi|763810208|gb|KJB77110.1| hypothetical
            protein B456_012G120400 [Gossypium raimondii]
          Length = 533

 Score =  577 bits (1487), Expect = e-162
 Identities = 285/425 (67%), Positives = 330/425 (77%)
 Frame = +2

Query: 101  NHGEHNNNVPCIPINEQRHNVGQQPVGNSARSSVSEAARNTNTRVFHSSDADLQSKNDGS 280
            N+GEH++ V      +Q    G    GNSA +S SEAAR +N+++  S D + Q K DG+
Sbjct: 107  NYGEHSSLV------DQMQKPGGLSTGNSACNSASEAARISNSQILCSKDVNPQLKYDGA 160

Query: 281  FNDAKHPXXXXXXXXXXXXEIAEREGVLENCGILPNACLPCLASTVPSLDKRRSLSPGPP 460
             N+ K P               +  G+L+NCGILP+ CLPCLASTV S++KRRSLS  PP
Sbjct: 161  SNEVKQPVFLDDIASSAGEGPGKEVGLLDNCGILPSNCLPCLASTVSSVEKRRSLSSSPP 220

Query: 461  STRKKAALKLSFKWREGHTNSSLSSTKMILQRPIAGSQVPICRIENKMFDCWLPIDPSTF 640
            S RKK ALKL FKW+EGH N++L S+K +LQRP AGSQVP C  E +MFDCW  I+P TF
Sbjct: 221  SARKKNALKLPFKWKEGHPNAALFSSKRLLQRPKAGSQVPFCPTEKRMFDCWSHIEPGTF 280

Query: 641  KVRGENYMRDKKKEFASNCAAYYPFGVDVFLSPRKIDHIAKFVDLPITNAIGKLPPILIV 820
            KVR ENY RDKKK+FA N AAYYPFGVDVFLSPRKIDHIA+FV+LP+    GKLP IL+V
Sbjct: 281  KVRSENYFRDKKKDFAHNHAAYYPFGVDVFLSPRKIDHIARFVELPVVGHSGKLPSILVV 340

Query: 821  NVQVPLYPATFFQNEFDGEGVNFVLYFRLSESYSKELEPHFLENLRRIIDDEVEKVKGFP 1000
            NVQ+PLYP   F +E DGEG+NFVLYF+LS+SY KEL PHF EN+RRIIDD VEKVKGFP
Sbjct: 341  NVQIPLYPPALFHSEIDGEGMNFVLYFKLSDSYLKELPPHFQENIRRIIDDGVEKVKGFP 400

Query: 1001 MDTILPVRDRLKILGRVENVEDLGLSAPERKLMHAYNEKPVLSRPQHEFYLGENYLEIDI 1180
            +DT +P R+RLKILGRV NVEDL +SA ERKLM AYNEKPVLSRPQHEFY GENY EIDI
Sbjct: 401  VDTNVPFRERLKILGRVANVEDLHMSAAERKLMQAYNEKPVLSRPQHEFYSGENYFEIDI 460

Query: 1181 DMHRFGYISRKGFEVFHDRLKLCILDVGLTIQGNKPEELPEQILCCVRLNGIDYKNYHQL 1360
            DMHRF YISRKGF+ F DRLK CILDVGLTIQGNKPEELPEQILCCVRL+GIDY NYHQL
Sbjct: 461  DMHRFSYISRKGFDAFLDRLKFCILDVGLTIQGNKPEELPEQILCCVRLSGIDYMNYHQL 520

Query: 1361 ATSQD 1375
            + +Q+
Sbjct: 521  SLNQE 525


>ref|XP_012091987.1| PREDICTED: uncharacterized protein LOC105649804 isoform X3 [Jatropha
            curcas]
          Length = 488

 Score =  576 bits (1484), Expect = e-161
 Identities = 287/426 (67%), Positives = 331/426 (77%), Gaps = 2/426 (0%)
 Frame = +2

Query: 104  HGEHNNNVPCIPINEQRHNVGQQPVGNSARSSVSEAARNTNTRVFHSSDADLQSKNDGSF 283
            HG+H          +     G    GNSAR+SVSEAAR+ N +VF+S  AD   K++G  
Sbjct: 60   HGDHTIGFQYTSSGDHMKKAGDSSAGNSARNSVSEAARHPNNQVFNSDYADSLPKSEGP- 118

Query: 284  NDAKHPXXXXXXXXXXXXEIAEREGVLENCGILPNACLPCLASTVPSLDKRRSLSPGPPS 463
                 P               + EG+L+NCGILP  CLPCLASTVP ++KRRSLS  PPS
Sbjct: 119  ---SQPVFLDEIASSVDENGGKGEGLLDNCGILPANCLPCLASTVPPVEKRRSLSSSPPS 175

Query: 464  TRKKAALKLSFKWREGHTNSSLSSTKMILQRPIAGSQVPICRIENKMFDCWLPIDPSTFK 643
             RKKAALKLSFKW+EGH N++L S+K ILQRPIAGSQVP C I+ KM DCW  I+PS+FK
Sbjct: 176  ARKKAALKLSFKWKEGHPNNALFSSKPILQRPIAGSQVPFCPIDKKMLDCWSHIEPSSFK 235

Query: 644  VRGENYMRDKKKEFASNCAAYYPFGVDVFLSPRKIDHIAKFVDLPITNAIGKLPPILIVN 823
            VRG+NY RDKKKEFA N AAYYPFGVDVFLSPRK+DHIA+FV+LP  N+ GKLP IL+VN
Sbjct: 236  VRGQNYFRDKKKEFAPNYAAYYPFGVDVFLSPRKVDHIARFVELPAVNSSGKLPNILVVN 295

Query: 824  VQVPLYPATFFQNEFDGEGVNFVLYFRLSESYSKELEPHFLENLRRIIDDEVEKVKGFPM 1003
            VQ+PLY A FFQ+E DGEG++FVLYF+LSESYSKE+   F E++RR+IDDEVEKVKGFP+
Sbjct: 296  VQIPLYNAAFFQSEIDGEGMSFVLYFKLSESYSKEVPTLFQESIRRLIDDEVEKVKGFPV 355

Query: 1004 DTILPVRDRLKILGRVENVEDLGLSAPERKLMHAYNEKPVLSRPQHEFYLG--ENYLEID 1177
            DTI+P R+RLKILGRV N+EDL LSA ERKLM AYNEKPVLSRPQHEFYLG  E Y EID
Sbjct: 356  DTIVPFRERLKILGRVVNIEDLHLSAAERKLMQAYNEKPVLSRPQHEFYLGERETYFEID 415

Query: 1178 IDMHRFGYISRKGFEVFHDRLKLCILDVGLTIQGNKPEELPEQILCCVRLNGIDYKNYHQ 1357
            IDMHRF YISRKGFE F DRLK+C+LDVGLTIQGNK EELPEQ+LCCVRLNGIDY NY Q
Sbjct: 416  IDMHRFSYISRKGFEAFLDRLKICVLDVGLTIQGNKVEELPEQVLCCVRLNGIDYMNYRQ 475

Query: 1358 LATSQD 1375
            L  +Q+
Sbjct: 476  LGLNQE 481


>ref|XP_012091985.1| PREDICTED: uncharacterized protein LOC105649804 isoform X1 [Jatropha
            curcas]
          Length = 553

 Score =  576 bits (1484), Expect = e-161
 Identities = 287/426 (67%), Positives = 331/426 (77%), Gaps = 2/426 (0%)
 Frame = +2

Query: 104  HGEHNNNVPCIPINEQRHNVGQQPVGNSARSSVSEAARNTNTRVFHSSDADLQSKNDGSF 283
            HG+H          +     G    GNSAR+SVSEAAR+ N +VF+S  AD   K++G  
Sbjct: 125  HGDHTIGFQYTSSGDHMKKAGDSSAGNSARNSVSEAARHPNNQVFNSDYADSLPKSEGP- 183

Query: 284  NDAKHPXXXXXXXXXXXXEIAEREGVLENCGILPNACLPCLASTVPSLDKRRSLSPGPPS 463
                 P               + EG+L+NCGILP  CLPCLASTVP ++KRRSLS  PPS
Sbjct: 184  ---SQPVFLDEIASSVDENGGKGEGLLDNCGILPANCLPCLASTVPPVEKRRSLSSSPPS 240

Query: 464  TRKKAALKLSFKWREGHTNSSLSSTKMILQRPIAGSQVPICRIENKMFDCWLPIDPSTFK 643
             RKKAALKLSFKW+EGH N++L S+K ILQRPIAGSQVP C I+ KM DCW  I+PS+FK
Sbjct: 241  ARKKAALKLSFKWKEGHPNNALFSSKPILQRPIAGSQVPFCPIDKKMLDCWSHIEPSSFK 300

Query: 644  VRGENYMRDKKKEFASNCAAYYPFGVDVFLSPRKIDHIAKFVDLPITNAIGKLPPILIVN 823
            VRG+NY RDKKKEFA N AAYYPFGVDVFLSPRK+DHIA+FV+LP  N+ GKLP IL+VN
Sbjct: 301  VRGQNYFRDKKKEFAPNYAAYYPFGVDVFLSPRKVDHIARFVELPAVNSSGKLPNILVVN 360

Query: 824  VQVPLYPATFFQNEFDGEGVNFVLYFRLSESYSKELEPHFLENLRRIIDDEVEKVKGFPM 1003
            VQ+PLY A FFQ+E DGEG++FVLYF+LSESYSKE+   F E++RR+IDDEVEKVKGFP+
Sbjct: 361  VQIPLYNAAFFQSEIDGEGMSFVLYFKLSESYSKEVPTLFQESIRRLIDDEVEKVKGFPV 420

Query: 1004 DTILPVRDRLKILGRVENVEDLGLSAPERKLMHAYNEKPVLSRPQHEFYLG--ENYLEID 1177
            DTI+P R+RLKILGRV N+EDL LSA ERKLM AYNEKPVLSRPQHEFYLG  E Y EID
Sbjct: 421  DTIVPFRERLKILGRVVNIEDLHLSAAERKLMQAYNEKPVLSRPQHEFYLGERETYFEID 480

Query: 1178 IDMHRFGYISRKGFEVFHDRLKLCILDVGLTIQGNKPEELPEQILCCVRLNGIDYKNYHQ 1357
            IDMHRF YISRKGFE F DRLK+C+LDVGLTIQGNK EELPEQ+LCCVRLNGIDY NY Q
Sbjct: 481  IDMHRFSYISRKGFEAFLDRLKICVLDVGLTIQGNKVEELPEQVLCCVRLNGIDYMNYRQ 540

Query: 1358 LATSQD 1375
            L  +Q+
Sbjct: 541  LGLNQE 546


>ref|XP_012091986.1| PREDICTED: uncharacterized protein LOC105649804 isoform X2 [Jatropha
            curcas] gi|643704194|gb|KDP21258.1| hypothetical protein
            JCGZ_21729 [Jatropha curcas]
          Length = 552

 Score =  576 bits (1484), Expect = e-161
 Identities = 287/426 (67%), Positives = 331/426 (77%), Gaps = 2/426 (0%)
 Frame = +2

Query: 104  HGEHNNNVPCIPINEQRHNVGQQPVGNSARSSVSEAARNTNTRVFHSSDADLQSKNDGSF 283
            HG+H          +     G    GNSAR+SVSEAAR+ N +VF+S  AD   K++G  
Sbjct: 124  HGDHTIGFQYTSSGDHMKKAGDSSAGNSARNSVSEAARHPNNQVFNSDYADSLPKSEGP- 182

Query: 284  NDAKHPXXXXXXXXXXXXEIAEREGVLENCGILPNACLPCLASTVPSLDKRRSLSPGPPS 463
                 P               + EG+L+NCGILP  CLPCLASTVP ++KRRSLS  PPS
Sbjct: 183  ---SQPVFLDEIASSVDENGGKGEGLLDNCGILPANCLPCLASTVPPVEKRRSLSSSPPS 239

Query: 464  TRKKAALKLSFKWREGHTNSSLSSTKMILQRPIAGSQVPICRIENKMFDCWLPIDPSTFK 643
             RKKAALKLSFKW+EGH N++L S+K ILQRPIAGSQVP C I+ KM DCW  I+PS+FK
Sbjct: 240  ARKKAALKLSFKWKEGHPNNALFSSKPILQRPIAGSQVPFCPIDKKMLDCWSHIEPSSFK 299

Query: 644  VRGENYMRDKKKEFASNCAAYYPFGVDVFLSPRKIDHIAKFVDLPITNAIGKLPPILIVN 823
            VRG+NY RDKKKEFA N AAYYPFGVDVFLSPRK+DHIA+FV+LP  N+ GKLP IL+VN
Sbjct: 300  VRGQNYFRDKKKEFAPNYAAYYPFGVDVFLSPRKVDHIARFVELPAVNSSGKLPNILVVN 359

Query: 824  VQVPLYPATFFQNEFDGEGVNFVLYFRLSESYSKELEPHFLENLRRIIDDEVEKVKGFPM 1003
            VQ+PLY A FFQ+E DGEG++FVLYF+LSESYSKE+   F E++RR+IDDEVEKVKGFP+
Sbjct: 360  VQIPLYNAAFFQSEIDGEGMSFVLYFKLSESYSKEVPTLFQESIRRLIDDEVEKVKGFPV 419

Query: 1004 DTILPVRDRLKILGRVENVEDLGLSAPERKLMHAYNEKPVLSRPQHEFYLG--ENYLEID 1177
            DTI+P R+RLKILGRV N+EDL LSA ERKLM AYNEKPVLSRPQHEFYLG  E Y EID
Sbjct: 420  DTIVPFRERLKILGRVVNIEDLHLSAAERKLMQAYNEKPVLSRPQHEFYLGERETYFEID 479

Query: 1178 IDMHRFGYISRKGFEVFHDRLKLCILDVGLTIQGNKPEELPEQILCCVRLNGIDYKNYHQ 1357
            IDMHRF YISRKGFE F DRLK+C+LDVGLTIQGNK EELPEQ+LCCVRLNGIDY NY Q
Sbjct: 480  IDMHRFSYISRKGFEAFLDRLKICVLDVGLTIQGNKVEELPEQVLCCVRLNGIDYMNYRQ 539

Query: 1358 LATSQD 1375
            L  +Q+
Sbjct: 540  LGLNQE 545


>gb|KJB77107.1| hypothetical protein B456_012G120400 [Gossypium raimondii]
          Length = 534

 Score =  572 bits (1475), Expect = e-160
 Identities = 285/426 (66%), Positives = 330/426 (77%), Gaps = 1/426 (0%)
 Frame = +2

Query: 101  NHGEHNNNVPCIPINEQRHNVGQQPVGNSARSSVSEAARNTNTRVFHSSDADLQSKNDGS 280
            N+GEH++ V      +Q    G    GNSA +S SEAAR +N+++  S D + Q K DG+
Sbjct: 107  NYGEHSSLV------DQMQKPGGLSTGNSACNSASEAARISNSQILCSKDVNPQLKYDGA 160

Query: 281  FNDAKHPXXXXXXXXXXXXEIAEREGVLENCGILPNACLPCLASTVPSLDKRRSLSPGPP 460
             N+ K P               +  G+L+NCGILP+ CLPCLASTV S++KRRSLS  PP
Sbjct: 161  SNEVKQPVFLDDIASSAGEGPGKEVGLLDNCGILPSNCLPCLASTVSSVEKRRSLSSSPP 220

Query: 461  STRKKAALKLSFKWREGHTNSSLSSTKMILQRPIAGSQVPICRIENKMFDCWLPIDPSTF 640
            S RKK ALKL FKW+EGH N++L S+K +LQRP AGSQVP C  E +MFDCW  I+P TF
Sbjct: 221  SARKKNALKLPFKWKEGHPNAALFSSKRLLQRPKAGSQVPFCPTEKRMFDCWSHIEPGTF 280

Query: 641  KVRGENYMRDKKKEFASNCAAYYPFGVDVFLSPRKIDHIAKFVDLPITNAIGKLPPILIV 820
            KVR ENY RDKKK+FA N AAYYPFGVDVFLSPRKIDHIA+FV+LP+    GKLP IL+V
Sbjct: 281  KVRSENYFRDKKKDFAHNHAAYYPFGVDVFLSPRKIDHIARFVELPVVGHSGKLPSILVV 340

Query: 821  NVQVPLYPATFFQNEFDGEGVNFVLYFRLSESYSKELEPHFLENLR-RIIDDEVEKVKGF 997
            NVQ+PLYP   F +E DGEG+NFVLYF+LS+SY KEL PHF EN+R RIIDD VEKVKGF
Sbjct: 341  NVQIPLYPPALFHSEIDGEGMNFVLYFKLSDSYLKELPPHFQENIRVRIIDDGVEKVKGF 400

Query: 998  PMDTILPVRDRLKILGRVENVEDLGLSAPERKLMHAYNEKPVLSRPQHEFYLGENYLEID 1177
            P+DT +P R+RLKILGRV NVEDL +SA ERKLM AYNEKPVLSRPQHEFY GENY EID
Sbjct: 401  PVDTNVPFRERLKILGRVANVEDLHMSAAERKLMQAYNEKPVLSRPQHEFYSGENYFEID 460

Query: 1178 IDMHRFGYISRKGFEVFHDRLKLCILDVGLTIQGNKPEELPEQILCCVRLNGIDYKNYHQ 1357
            IDMHRF YISRKGF+ F DRLK CILDVGLTIQGNKPEELPEQILCCVRL+GIDY NYHQ
Sbjct: 461  IDMHRFSYISRKGFDAFLDRLKFCILDVGLTIQGNKPEELPEQILCCVRLSGIDYMNYHQ 520

Query: 1358 LATSQD 1375
            L+ +Q+
Sbjct: 521  LSLNQE 526


Top