BLASTX nr result

ID: Rheum21_contig00022761 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Rheum21_contig00022761
         (667 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002274305.1| PREDICTED: UPF0496 protein At3g19330 [Vitis ...   142   1e-31
emb|CAN67266.1| hypothetical protein VITISV_028729 [Vitis vinifera]   141   2e-31
ref|XP_002512821.1| conserved hypothetical protein [Ricinus comm...   120   3e-25
ref|XP_006446909.1| hypothetical protein CICLE_v10015537mg [Citr...   119   8e-25
ref|XP_006468920.1| PREDICTED: UPF0496 protein At3g19330-like [C...   118   2e-24
gb|EXB41414.1| hypothetical protein L484_007564 [Morus notabilis]     116   7e-24
ref|XP_006378820.1| hypothetical protein POPTR_0010s24650g [Popu...   110   5e-22
ref|XP_006378819.1| hypothetical protein POPTR_0010s24650g [Popu...   110   5e-22
ref|XP_006354112.1| PREDICTED: UPF0496 protein At3g19330-like is...   103   6e-20
ref|XP_006468919.1| PREDICTED: UPF0496 protein At3g19330-like is...   102   1e-19
ref|XP_006468917.1| PREDICTED: UPF0496 protein At3g19330-like is...   102   1e-19
ref|XP_004228688.1| PREDICTED: UPF0496 protein At3g19330-like is...   100   3e-19
ref|XP_002512820.1| conserved hypothetical protein [Ricinus comm...    99   9e-19
gb|EOY02976.1| Uncharacterized protein isoform 2 [Theobroma cacao]     99   2e-18
gb|EOY02975.1| Uncharacterized protein isoform 1 [Theobroma cacao]     99   2e-18
ref|XP_006446910.1| hypothetical protein CICLE_v10015613mg [Citr...    98   3e-18
ref|XP_006408570.1| hypothetical protein EUTSA_v10021033mg [Eutr...    94   5e-17
ref|XP_006299705.1| hypothetical protein CARUB_v10015896mg [Caps...    93   8e-17
ref|XP_002883173.1| hypothetical protein ARALYDRAFT_479449 [Arab...    92   1e-16
ref|NP_188556.2| uncharacterized protein [Arabidopsis thaliana] ...    89   1e-15

>ref|XP_002274305.1| PREDICTED: UPF0496 protein At3g19330 [Vitis vinifera]
           gi|297745643|emb|CBI40808.3| unnamed protein product
           [Vitis vinifera]
          Length = 376

 Score =  142 bits (357), Expect = 1e-31
 Identities = 86/209 (41%), Positives = 117/209 (55%), Gaps = 10/209 (4%)
 Frame = -3

Query: 611 VATVIEPNPEDVQKIIEHATSNPFTQQLLTYFNHTEQICHLLLNLKRWTHHTRCRYIPIN 432
           +A V+ P+ E VQ  + HA SN  T+ +  YF+H+E    L L L R  HH    Y P++
Sbjct: 90  LAHVLRPDRECVQDALRHARSNTLTRLVSDYFDHSENTSQLCLLLHRSVHHAHSLYSPLH 149

Query: 431 NLIKDLKGDPDSLTQEQCNSASDIFLHFQSKESPFPCRGSHNLNDMHSCFFHLKEQVNGC 252
           +L+  L  D DSLTQ QCN A D+FL F S ++PFPC  SHN  DM  CF  LKEQ++G 
Sbjct: 150 DLLDILPLDSDSLTQSQCNQAFDVFLQFDSLDNPFPCPDSHNFRDMRRCFSQLKEQLDGH 209

Query: 251 LDRSQPR--------ACTSIRFIDSTVKLXXXXXXXXXXXXXALLTGSS--FPPAPKLSQ 102
           + +S+ +        A ++  FI + V +             AL+   S  F P P+LS+
Sbjct: 210 IRKSRSKIRLICRATAGSAFCFIGTAVGVAISAVAIATHTLVALIAPLSTVFLP-PRLSK 268

Query: 101 KDLAHIALLDVAARSAYTLHNELQTIDRL 15
           K+LAH A LD AAR  Y L ++L TID L
Sbjct: 269 KELAHGAQLDAAARGTYVLCHDLGTIDSL 297


>emb|CAN67266.1| hypothetical protein VITISV_028729 [Vitis vinifera]
          Length = 996

 Score =  141 bits (355), Expect = 2e-31
 Identities = 85/209 (40%), Positives = 117/209 (55%), Gaps = 10/209 (4%)
 Frame = -3

Query: 611  VATVIEPNPEDVQKIIEHATSNPFTQQLLTYFNHTEQICHLLLNLKRWTHHTRCRYIPIN 432
            +A V+ P+ E VQ  + HA SN  T+ +  YF+H+E    L L L R  HH    Y P++
Sbjct: 710  LAHVLRPDRECVQDALRHARSNTLTRLVSDYFDHSENTSQLCLLLHRSVHHAHSLYSPLH 769

Query: 431  NLIKDLKGDPDSLTQEQCNSASDIFLHFQSKESPFPCRGSHNLNDMHSCFFHLKEQVNGC 252
            +L+  L  D DSLTQ QC+ A D+FL F S ++PFPC  SHN  DM  CF  LKEQ++G 
Sbjct: 770  DLLDILPLDSDSLTQSQCBQAFDVFLQFDSLDNPFPCPDSHNFRDMRRCFSQLKEQLDGH 829

Query: 251  LDRSQPR--------ACTSIRFIDSTVKLXXXXXXXXXXXXXALLTGSS--FPPAPKLSQ 102
            + +S+ +        A ++  FI + V +             AL+   S  F P P+LS+
Sbjct: 830  IRKSRSKIRLICRATAGSAFCFIGTAVGVAISAVAIATHTLVALIAPLSTVFLP-PRLSK 888

Query: 101  KDLAHIALLDVAARSAYTLHNELQTIDRL 15
            K+LAH A LD AAR  Y L ++L TID L
Sbjct: 889  KELAHGAQLDAAARGTYVLCHDLGTIDSL 917


>ref|XP_002512821.1| conserved hypothetical protein [Ricinus communis]
           gi|223547832|gb|EEF49324.1| conserved hypothetical
           protein [Ricinus communis]
          Length = 407

 Score =  120 bits (302), Expect = 3e-25
 Identities = 84/243 (34%), Positives = 123/243 (50%), Gaps = 26/243 (10%)
 Frame = -3

Query: 665 QSKIHAVEEWENG------------DVNGFV-ATVIEPNPEDVQKIIEHATSNPFTQQLL 525
           +S+IH  +E ENG            DV  FV A V+ PN + V+  + HA  N  T+ + 
Sbjct: 89  RSRIHH-QEIENGEQIESSLNIDVEDVRQFVLAQVLHPNRQCVEDALRHAKPNTLTRLVS 147

Query: 524 TYFNHTEQICHLLLNLKRWTHHTRCRYIPINNLIKDLKGDPDSLTQEQCNSASDIFLHFQ 345
            YF+H+E    L L L R     R  Y PI NL++ L  + DSLTQ QC+ A +IF+ F 
Sbjct: 148 NYFDHSESTTDLCLLLHRSVFRARDIYSPIRNLLEVLPVEMDSLTQSQCDYAYEIFMQFD 207

Query: 344 SKESPFPCRGSHNLNDMHSCFFHLKEQVNGCLDRSQPRA--------CTSIRFIDSTVKL 189
             ++PFPC  SH    +H  F  L +Q++  L +S+ +          +++ FI S V +
Sbjct: 208 RCDNPFPCPFSHEFEGIHRSFSELSQQLDHRLRKSRSKVHLVRRATLASALCFIGSAVAI 267

Query: 188 XXXXXXXXXXXXXALL-----TGSSFPPAPKLSQKDLAHIALLDVAARSAYTLHNELQTI 24
                        A++       +S P    L++K+LAH+  LD AAR  Y L+NEL T+
Sbjct: 268 TLTALAITGHALVAIVACPFCAVTSLP--SNLTKKELAHVKQLDAAARGTYVLNNELDTV 325

Query: 23  DRL 15
           DRL
Sbjct: 326 DRL 328


>ref|XP_006446909.1| hypothetical protein CICLE_v10015537mg [Citrus clementina]
           gi|557549520|gb|ESR60149.1| hypothetical protein
           CICLE_v10015537mg [Citrus clementina]
          Length = 391

 Score =  119 bits (298), Expect = 8e-25
 Identities = 73/217 (33%), Positives = 109/217 (50%), Gaps = 14/217 (6%)
 Frame = -3

Query: 611 VATVIEPNPEDVQKIIEHATSNPFTQQLLTYFNHTEQICHLLLNLKRWTHHTRCRYIPIN 432
           V+ V++PN E V + + HA  N  T+ + TYF+H+E   +L L L +  +  R  Y  + 
Sbjct: 107 VSQVLQPNRECVDEALRHARPNTLTRLVSTYFDHSENTTNLCLLLHQSIYRARELYAALY 166

Query: 431 NLIKDLKGDPDSLTQEQCNSASDIFLHFQSKESPFPCRGSHNLNDMHSCFFHLKEQVNGC 252
            L      D  SL+Q QC+ A ++FL F S ++PFPC  SHN ++M  CF  LK+Q    
Sbjct: 167 ELFDIFPSDHHSLSQLQCDKAFEVFLQFDSIDNPFPCPNSHNFHEMRRCFSELKQQ---- 222

Query: 251 LDRSQPRACTSIRF-----------IDSTVKLXXXXXXXXXXXXXALLTGSSFPPA---P 114
           LDR   ++ + +RF           +  T                  +  + F  A   P
Sbjct: 223 LDRKLRKSHSRVRFFSRATSGSTLCVIGTAVAVTIAAAAVATHALVAIVAAPFCTAYFSP 282

Query: 113 KLSQKDLAHIALLDVAARSAYTLHNELQTIDRLAYQL 3
            L++K LAH+A LD A +  Y L+N+L TIDRL  +L
Sbjct: 283 GLAKKQLAHVAQLDAAKKGIYVLNNDLDTIDRLVARL 319


>ref|XP_006468920.1| PREDICTED: UPF0496 protein At3g19330-like [Citrus sinensis]
          Length = 391

 Score =  118 bits (295), Expect = 2e-24
 Identities = 73/217 (33%), Positives = 108/217 (49%), Gaps = 14/217 (6%)
 Frame = -3

Query: 611 VATVIEPNPEDVQKIIEHATSNPFTQQLLTYFNHTEQICHLLLNLKRWTHHTRCRYIPIN 432
           V+ V++PN E V + + HA  N  T+ + TYF+H+E   +L L L +  +  R  Y  + 
Sbjct: 107 VSQVLQPNRECVDEALRHARPNTLTRLVSTYFDHSENTTNLCLLLHQSIYRARELYAALY 166

Query: 431 NLIKDLKGDPDSLTQEQCNSASDIFLHFQSKESPFPCRGSHNLNDMHSCFFHLKEQVNGC 252
            L      D  SL+Q QC+ A ++FL F S ++PFPC  SHN ++M  CF  LK+Q    
Sbjct: 167 ELFDIFPSDHHSLSQLQCDKAFEVFLQFDSIDNPFPCPNSHNFHEMRRCFSELKQQ---- 222

Query: 251 LDRSQPRACTSIRF-----------IDSTVKLXXXXXXXXXXXXXALLTGSSFPPA---P 114
           LDR   ++ + +RF           +  T                  +  + F  A   P
Sbjct: 223 LDRKLRKSHSRVRFFSRATSGSTLCVIGTAVAVTIAAAAVATHALVAIVAAPFCTAYFSP 282

Query: 113 KLSQKDLAHIALLDVAARSAYTLHNELQTIDRLAYQL 3
            L +K LAH+A LD A +  Y L+N+L TIDRL  +L
Sbjct: 283 GLVKKQLAHVAQLDAAKKGIYVLNNDLDTIDRLVARL 319


>gb|EXB41414.1| hypothetical protein L484_007564 [Morus notabilis]
          Length = 364

 Score =  116 bits (290), Expect = 7e-24
 Identities = 75/224 (33%), Positives = 113/224 (50%), Gaps = 7/224 (3%)
 Frame = -3

Query: 653 HAVEEWENGDVNGFVATVIEPNPEDVQKIIEHATSNPFTQQLLTYFNHTEQICHLLLNLK 474
           H  ++ + G V   +  V++PN E V+  + HA  N  T+ +  YF+H+E   HL L L 
Sbjct: 60  HHHQQQQEGQVL-LLERVLQPNRECVEAALRHAKPNALTRLVSAYFDHSENTTHLYLLLH 118

Query: 473 RWTHHTRCRYIPINNLIKDLKGDPD-SLTQEQCNSASDIFLHFQSKESPFPCRGSHNLND 297
           R     R  Y P++ L+  L  D   SL+Q QC+ A ++FL F + E+PFP   S N + 
Sbjct: 119 RCLFQARALYAPLHALLGLLPSDDSPSLSQSQCDRAFNVFLQFDTAENPFPSPDSQNFDH 178

Query: 296 MHSCFFHLKEQVNGCL--DRSQPRACTSIRFIDSTVKLXXXXXXXXXXXXXALLTGSSF- 126
           +H CF  LK+Q++  L   RS  R             +             AL+ G S  
Sbjct: 179 IHGCFSQLKQQLDHRLRNSRSAVRLFCRASAGSYVCSIAITAITVSVRTLAALVAGPSCA 238

Query: 125 ---PPAPKLSQKDLAHIALLDVAARSAYTLHNELQTIDRLAYQL 3
              PP P+L  K++A +A LD AA+  Y L+++L TI+RL  +L
Sbjct: 239 ALPPPPPRLVHKEVAQMAQLDAAAKGTYVLNSDLHTIERLVARL 282


>ref|XP_006378820.1| hypothetical protein POPTR_0010s24650g [Populus trichocarpa]
           gi|550330534|gb|ERP56617.1| hypothetical protein
           POPTR_0010s24650g [Populus trichocarpa]
          Length = 386

 Score =  110 bits (274), Expect = 5e-22
 Identities = 69/226 (30%), Positives = 117/226 (51%), Gaps = 12/226 (5%)
 Frame = -3

Query: 644 EEWENGDVNGFVAT-VIEPNPEDVQKIIEHATSNPFTQQLLTYFNHTEQICHLLLNLKRW 468
           + +   D N  + T V++PN E V++ +  A  N  T+ + TYF ++E    L + L++ 
Sbjct: 86  DHYNEEDANRLLMTQVLQPNRECVEEALRDAKPNTLTRLVSTYFVNSENTSQLCILLQQS 145

Query: 467 THHTRCRYIPINNLIKDLKGDPDSLTQEQCNSASDIFLHFQSKESPFPCRGSHNLNDMHS 288
            +  R  Y P++ L+  +  D +SL+Q QC+ A D+FL F    +PFPC  SHN N+M  
Sbjct: 146 VYRARALYGPLHKLLDVIPTDSESLSQSQCDCAFDVFLQFDRVGNPFPCPESHNFNEMQQ 205

Query: 287 CFFHLKEQVNGCLDRSQPRA--------CTSIRFIDSTVKLXXXXXXXXXXXXXALLTG- 135
           CF  LK+Q+   + +S+ R          +++  I S V +             A+    
Sbjct: 206 CFSQLKQQLERRIRKSRSRIHLVRRATFGSALCVIGSVVAVVVSAVGIASHAFVAIAATP 265

Query: 134 -SSFPPAP-KLSQKDLAHIALLDVAARSAYTLHNELQTIDRLAYQL 3
             + P  P +L++K+LA +  LD A+R  Y L+N+L T++R   +L
Sbjct: 266 ICTLPCLPRRLTKKELARVEQLDSASRGTYVLNNQLATLERRVARL 311


>ref|XP_006378819.1| hypothetical protein POPTR_0010s24650g [Populus trichocarpa]
           gi|550330533|gb|ERP56616.1| hypothetical protein
           POPTR_0010s24650g [Populus trichocarpa]
          Length = 317

 Score =  110 bits (274), Expect = 5e-22
 Identities = 69/226 (30%), Positives = 117/226 (51%), Gaps = 12/226 (5%)
 Frame = -3

Query: 644 EEWENGDVNGFVAT-VIEPNPEDVQKIIEHATSNPFTQQLLTYFNHTEQICHLLLNLKRW 468
           + +   D N  + T V++PN E V++ +  A  N  T+ + TYF ++E    L + L++ 
Sbjct: 17  DHYNEEDANRLLMTQVLQPNRECVEEALRDAKPNTLTRLVSTYFVNSENTSQLCILLQQS 76

Query: 467 THHTRCRYIPINNLIKDLKGDPDSLTQEQCNSASDIFLHFQSKESPFPCRGSHNLNDMHS 288
            +  R  Y P++ L+  +  D +SL+Q QC+ A D+FL F    +PFPC  SHN N+M  
Sbjct: 77  VYRARALYGPLHKLLDVIPTDSESLSQSQCDCAFDVFLQFDRVGNPFPCPESHNFNEMQQ 136

Query: 287 CFFHLKEQVNGCLDRSQPRA--------CTSIRFIDSTVKLXXXXXXXXXXXXXALLTG- 135
           CF  LK+Q+   + +S+ R          +++  I S V +             A+    
Sbjct: 137 CFSQLKQQLERRIRKSRSRIHLVRRATFGSALCVIGSVVAVVVSAVGIASHAFVAIAATP 196

Query: 134 -SSFPPAP-KLSQKDLAHIALLDVAARSAYTLHNELQTIDRLAYQL 3
             + P  P +L++K+LA +  LD A+R  Y L+N+L T++R   +L
Sbjct: 197 ICTLPCLPRRLTKKELARVEQLDSASRGTYVLNNQLATLERRVARL 242


>ref|XP_006354112.1| PREDICTED: UPF0496 protein At3g19330-like isoform X1 [Solanum
           tuberosum] gi|565375181|ref|XP_006354113.1| PREDICTED:
           UPF0496 protein At3g19330-like isoform X2 [Solanum
           tuberosum]
          Length = 376

 Score =  103 bits (256), Expect = 6e-20
 Identities = 65/216 (30%), Positives = 101/216 (46%), Gaps = 16/216 (7%)
 Frame = -3

Query: 602 VIEPNPEDVQKIIEHATSNPFTQQLLTYFNHTEQICHLLLNLKRWTHHTRCRYIPINNLI 423
           V+EPN E VQ+ + H       Q +  Y N +EQ   L ++  +     R  Y PI  L+
Sbjct: 88  VLEPNHECVQEALLHIKPEALNQLIAKYLNDSEQTTRLCISFSQSVKQARILYAPICRLL 147

Query: 422 K----DLKGDPDSLTQEQCNSASDIFLHFQSKESPFPCRGSHNLNDMHSCFFHLKEQVNG 255
                D++    SL+Q QCN A DIFL F S  +PFP   +H+ NDM  C+F LK +++ 
Sbjct: 148 DVLPLDMESAGHSLSQAQCNWAFDIFLQFDSLNNPFPIHDTHSFNDMRHCYFQLKRELDL 207

Query: 254 CLDRSQPR------------ACTSIRFIDSTVKLXXXXXXXXXXXXXALLTGSSFPPAPK 111
            L +S+ +             C     I   +               A +  +  P   K
Sbjct: 208 LLHKSRSKVQLLRHATKGSVVCLVAATIGVVITAAVIASHALVTLVAAPICAACVP--SK 265

Query: 110 LSQKDLAHIALLDVAARSAYTLHNELQTIDRLAYQL 3
           +++K+L H+  LDVA +  + LHN L+T++ L  +L
Sbjct: 266 MAKKELVHLVQLDVATKGIFFLHNHLETVNCLVGRL 301


>ref|XP_006468919.1| PREDICTED: UPF0496 protein At3g19330-like isoform X3 [Citrus
           sinensis]
          Length = 377

 Score =  102 bits (253), Expect = 1e-19
 Identities = 67/215 (31%), Positives = 109/215 (50%), Gaps = 12/215 (5%)
 Frame = -3

Query: 611 VATVIEPNPEDVQKIIEHATSNPFTQQLLTYFNHTEQICHLLLNLKRWTHHTRCRYIPIN 432
           ++ V+ PN E V++ +     N  +    T F+H+E+  +L L L +     R  Y P+ 
Sbjct: 90  LSQVLRPNRESVKEALRLVKVNSLSDLFSTSFDHSEKTTNLCLQLLKSLFCIRTLYAPVC 149

Query: 431 NLIKDLKGDPDSLTQEQCNSASDIFLHFQSKESPFPCRGSHNLNDMHSCFFHLKEQVNGC 252
            L+ +   D  S+TQ QC++A ++FL F S  +PF    S   ++MH CF  LK++++  
Sbjct: 150 ELLDNFPLDHHSVTQSQCDNAFEVFLQFDSLHNPFHSPDSRKFHEMHRCFSDLKQKLDKN 209

Query: 251 LDRSQPRAC--------TSIRFIDSTVKLXXXXXXXXXXXXXALLTG----SSFPPAPKL 108
           L +S+ R C        +S+  I + V +             A+  G    + FP A  L
Sbjct: 210 LQKSRSRVCFLQYATAGSSVCIIGTAVGVTIATVGVATHAIFAIFAGPLCTACFPCA--L 267

Query: 107 SQKDLAHIALLDVAARSAYTLHNELQTIDRLAYQL 3
           ++K+LA+ A LD A + AY L+  L TIDRL  +L
Sbjct: 268 TKKELANAAQLDAARKGAYVLNKCLDTIDRLVARL 302


>ref|XP_006468917.1| PREDICTED: UPF0496 protein At3g19330-like isoform X1 [Citrus
           sinensis] gi|568829203|ref|XP_006468918.1| PREDICTED:
           UPF0496 protein At3g19330-like isoform X2 [Citrus
           sinensis]
          Length = 379

 Score =  102 bits (253), Expect = 1e-19
 Identities = 67/215 (31%), Positives = 109/215 (50%), Gaps = 12/215 (5%)
 Frame = -3

Query: 611 VATVIEPNPEDVQKIIEHATSNPFTQQLLTYFNHTEQICHLLLNLKRWTHHTRCRYIPIN 432
           ++ V+ PN E V++ +     N  +    T F+H+E+  +L L L +     R  Y P+ 
Sbjct: 92  LSQVLRPNRESVKEALRLVKVNSLSDLFSTSFDHSEKTTNLCLQLLKSLFCIRTLYAPVC 151

Query: 431 NLIKDLKGDPDSLTQEQCNSASDIFLHFQSKESPFPCRGSHNLNDMHSCFFHLKEQVNGC 252
            L+ +   D  S+TQ QC++A ++FL F S  +PF    S   ++MH CF  LK++++  
Sbjct: 152 ELLDNFPLDHHSVTQSQCDNAFEVFLQFDSLHNPFHSPDSRKFHEMHRCFSDLKQKLDKN 211

Query: 251 LDRSQPRAC--------TSIRFIDSTVKLXXXXXXXXXXXXXALLTG----SSFPPAPKL 108
           L +S+ R C        +S+  I + V +             A+  G    + FP A  L
Sbjct: 212 LQKSRSRVCFLQYATAGSSVCIIGTAVGVTIATVGVATHAIFAIFAGPLCTACFPCA--L 269

Query: 107 SQKDLAHIALLDVAARSAYTLHNELQTIDRLAYQL 3
           ++K+LA+ A LD A + AY L+  L TIDRL  +L
Sbjct: 270 TKKELANAAQLDAARKGAYVLNKCLDTIDRLVARL 304


>ref|XP_004228688.1| PREDICTED: UPF0496 protein At3g19330-like isoform 1 [Solanum
           lycopersicum] gi|460365600|ref|XP_004228689.1|
           PREDICTED: UPF0496 protein At3g19330-like isoform 2
           [Solanum lycopersicum]
          Length = 376

 Score =  100 bits (250), Expect = 3e-19
 Identities = 61/214 (28%), Positives = 107/214 (50%), Gaps = 14/214 (6%)
 Frame = -3

Query: 602 VIEPNPEDVQKIIEHATSNPFTQQLLTYFNHTEQICHLLLNLKRWTHHTRCRYIPINNLI 423
           +++PN E V++ + H       Q +  Y + +EQ   L ++  +  +  R  Y PI  L+
Sbjct: 88  ILKPNHECVEEALLHIKPEALNQLIAKYLDDSEQTARLCISFSQSVNQARRLYAPICRLL 147

Query: 422 ----KDLKGDPDSLTQEQCNSASDIFLHFQSKESPFPCRGSHNLNDMHSCFFHLKEQVNG 255
               ++++    SL+Q QCN A DIFL F S  +PFP   +HN NDM  C+  LK +++ 
Sbjct: 148 DVLPQEMESTGHSLSQAQCNWAFDIFLQFDSLNNPFPIHDTHNFNDMRHCYLELKRELDL 207

Query: 254 CLDRSQPRA--------CTSIRFIDSTVKLXXXXXXXXXXXXXALLTGSSFPPA--PKLS 105
            L +S+ +          + +  + +TV +             AL+   +       K++
Sbjct: 208 LLQKSRSKVQLLRHSTKGSVVCLVAATVGVVITAVVIASHAFVALVAAPACTACIPSKMA 267

Query: 104 QKDLAHIALLDVAARSAYTLHNELQTIDRLAYQL 3
           +K+L H+A LDVA +  + LHN L+T++ L  +L
Sbjct: 268 KKELVHLAQLDVATKGIFFLHNHLETVNCLVGRL 301


>ref|XP_002512820.1| conserved hypothetical protein [Ricinus communis]
           gi|223547831|gb|EEF49323.1| conserved hypothetical
           protein [Ricinus communis]
          Length = 389

 Score = 99.4 bits (246), Expect = 9e-19
 Identities = 67/217 (30%), Positives = 106/217 (48%), Gaps = 14/217 (6%)
 Frame = -3

Query: 611 VATVIEPNPEDVQKIIEHATSNPFTQQLLTYFNHTEQICHLLLNLKRWTHHTRCRYIPIN 432
           +A V+ PN + V+  + HA  N  T+ +  +F+H+E    L L L+R     R  Y PI+
Sbjct: 101 LARVLHPNRQCVEDALRHAKPNTVTRLVSNFFDHSESATDLCLLLRRSVFRARAIYSPIH 160

Query: 431 NLIKDLKGDPDSLTQEQCNSASDIFLHFQSKESPFPCRGSHNLNDMHSCFFHLKEQVNGC 252
           NL++ L  + +SLTQ  C++A ++ + F   +SPFP   SHN   +   F  L++Q++  
Sbjct: 161 NLLEKLPIELESLTQSHCDNAHEMLVQFNRCDSPFPFPDSHNFQGVRHSFSELRQQLDNR 220

Query: 251 LDRSQPR---------ACTSIRFIDSTV-----KLXXXXXXXXXXXXXALLTGSSFPPAP 114
             RS+ R         AC ++ F+ S V      L                  ++FP   
Sbjct: 221 RLRSRSRVHFVRPAAVAC-ALCFVGSAVTIIFSALAITGHALFAVAACPFCAAANFP--R 277

Query: 113 KLSQKDLAHIALLDVAARSAYTLHNELQTIDRLAYQL 3
            L++K+LAH+  L+ AAR  Y L   L TI  L  +L
Sbjct: 278 NLTKKELAHVEQLNAAARGTYMLDEHLTTIGPLVTRL 314


>gb|EOY02976.1| Uncharacterized protein isoform 2 [Theobroma cacao]
          Length = 380

 Score = 98.6 bits (244), Expect = 2e-18
 Identities = 70/234 (29%), Positives = 110/234 (47%), Gaps = 17/234 (7%)
 Frame = -3

Query: 665 QSKIHAVEEWENGDVNGFVAT-VIEPNPEDVQKIIEHATSNP---FTQQLLTYFNHTEQI 498
           + ++   +  ENGD +  + + +++PN + V + + H  +NP    T+ + TYF H+E I
Sbjct: 70  EERVQVEDVLENGDTHHLILSQMLQPNRDCVAQALRH--TNPKATLTRLVSTYFEHSENI 127

Query: 497 CHLLLNLKRWTHHTRCRYIPINNLIKDLKGDPDSLTQEQCNSASDIFLHFQSKESPFPCR 318
             L L L +     R  Y PI  L++    + +S++Q QCN A D+F  F S ++PFP  
Sbjct: 128 TDLCLLLCQCVDRARTLYSPITELLQVFPYELNSISQAQCNWAFDVFQQFYSLDNPFPRP 187

Query: 317 GSHNLNDMHSCFFHLKEQVNGCLDRSQPRA------------CTSIRFIDSTVKLXXXXX 174
            SHN N+M   F  LKEQ++  +++S  R             C     +   V       
Sbjct: 188 DSHNFNEMRCSFSQLKEQLDHRINKSHSRVRFLHRATTGSAICLIGTVVGVVVSAVVIST 247

Query: 173 XXXXXXXXALLTGSSFPPAP-KLSQKDLAHIALLDVAARSAYTLHNELQTIDRL 15
                    + T       P  L +K LAH+A LDVA +     + +L TIDRL
Sbjct: 248 NALASIVGLVATPLCLVYVPTDLRRKQLAHMAQLDVAKKGTSVHNYDLDTIDRL 301


>gb|EOY02975.1| Uncharacterized protein isoform 1 [Theobroma cacao]
          Length = 381

 Score = 98.6 bits (244), Expect = 2e-18
 Identities = 70/234 (29%), Positives = 110/234 (47%), Gaps = 17/234 (7%)
 Frame = -3

Query: 665 QSKIHAVEEWENGDVNGFVAT-VIEPNPEDVQKIIEHATSNP---FTQQLLTYFNHTEQI 498
           + ++   +  ENGD +  + + +++PN + V + + H  +NP    T+ + TYF H+E I
Sbjct: 71  EERVQVEDVLENGDTHHLILSQMLQPNRDCVAQALRH--TNPKATLTRLVSTYFEHSENI 128

Query: 497 CHLLLNLKRWTHHTRCRYIPINNLIKDLKGDPDSLTQEQCNSASDIFLHFQSKESPFPCR 318
             L L L +     R  Y PI  L++    + +S++Q QCN A D+F  F S ++PFP  
Sbjct: 129 TDLCLLLCQCVDRARTLYSPITELLQVFPYELNSISQAQCNWAFDVFQQFYSLDNPFPRP 188

Query: 317 GSHNLNDMHSCFFHLKEQVNGCLDRSQPRA------------CTSIRFIDSTVKLXXXXX 174
            SHN N+M   F  LKEQ++  +++S  R             C     +   V       
Sbjct: 189 DSHNFNEMRCSFSQLKEQLDHRINKSHSRVRFLHRATTGSAICLIGTVVGVVVSAVVIST 248

Query: 173 XXXXXXXXALLTGSSFPPAP-KLSQKDLAHIALLDVAARSAYTLHNELQTIDRL 15
                    + T       P  L +K LAH+A LDVA +     + +L TIDRL
Sbjct: 249 NALASIVGLVATPLCLVYVPTDLRRKQLAHMAQLDVAKKGTSVHNYDLDTIDRL 302


>ref|XP_006446910.1| hypothetical protein CICLE_v10015613mg [Citrus clementina]
           gi|557549521|gb|ESR60150.1| hypothetical protein
           CICLE_v10015613mg [Citrus clementina]
          Length = 379

 Score = 97.8 bits (242), Expect = 3e-18
 Identities = 65/215 (30%), Positives = 108/215 (50%), Gaps = 12/215 (5%)
 Frame = -3

Query: 611 VATVIEPNPEDVQKIIEHATSNPFTQQLLTYFNHTEQICHLLLNLKRWTHHTRCRYIPIN 432
           ++ ++ PN E V++ +     N  +    T F+H+E+  +L L L +     R  Y P+ 
Sbjct: 92  LSQLLLPNRESVKEALRLVKVNSLSDLFSTSFDHSEKTTNLCLQLLKSLFCIRTLYAPVC 151

Query: 431 NLIKDLKGDPDSLTQEQCNSASDIFLHFQSKESPFPCRGSHNLNDMHSCFFHLKEQVNGC 252
            L+ +   D  S+TQ QC++A ++FL F S  +PF    S   ++ H CF  LK++++  
Sbjct: 152 ELLDNFPLDHHSVTQSQCDNAFEVFLQFDSLHNPFHSPDSRKFHETHRCFSDLKQKLDKN 211

Query: 251 LDRSQPRAC--------TSIRFIDSTVKLXXXXXXXXXXXXXALLTG----SSFPPAPKL 108
           L +S+ R C        +S+  I + V +             A+  G    + FP A  L
Sbjct: 212 LQKSRSRVCFLRYATAGSSVCIIGTAVGVTIATVGVATHVIFAIFAGPLCTACFPRA--L 269

Query: 107 SQKDLAHIALLDVAARSAYTLHNELQTIDRLAYQL 3
           ++K+LA+ A LD A + AY L+  L TIDRL  +L
Sbjct: 270 TKKELANAAQLDAARKGAYVLNKCLDTIDRLVARL 304


>ref|XP_006408570.1| hypothetical protein EUTSA_v10021033mg [Eutrema salsugineum]
           gi|557109716|gb|ESQ50023.1| hypothetical protein
           EUTSA_v10021033mg [Eutrema salsugineum]
          Length = 353

 Score = 93.6 bits (231), Expect = 5e-17
 Identities = 62/210 (29%), Positives = 99/210 (47%), Gaps = 7/210 (3%)
 Frame = -3

Query: 611 VATVIEPNPEDVQKIIEHATSNPFTQQLLTYFNHTEQICHLLLNLKRWTHHTRCRYIPIN 432
           ++ V++PN E VQ+ + HA S   T  +  YF H+E    L LNL +  H  R  Y P+ 
Sbjct: 94  LSQVLQPNKECVQESLAHAKSTTLTHLISAYFQHSEDATRLCLNLYQSVHSARHLYTPLF 153

Query: 431 NLIKDLKGDPDSLTQEQCNSASDIFLHFQSKESPFPCR-GSHNLNDMHSCFFHLKEQVNG 255
           +L   L  D  ++ +  CN A D+FL   + ++PF     SH+      CF  LK +++ 
Sbjct: 154 DLFHILPSDSHAIDESLCNLAFDVFLKLDTFDNPFSSSPESHSFRGTQLCFSQLKHKLDA 213

Query: 254 CLDRSQPRACTSIRFIDSTVKLXXXXXXXXXXXXXALLTGSSFPP------APKLSQKDL 93
            L +S+ R    +R +                       GS+F P         L +K+L
Sbjct: 214 RLRKSRSR----VRLLHHAT------------------AGSAFCPLCSPYLPHSLKKKEL 251

Query: 92  AHIALLDVAARSAYTLHNELQTIDRLAYQL 3
            +I+ L+ AA+  + L+ +L TIDRL  +L
Sbjct: 252 TNISQLNAAAKGTFVLNKDLDTIDRLVSRL 281


>ref|XP_006299705.1| hypothetical protein CARUB_v10015896mg [Capsella rubella]
           gi|482568414|gb|EOA32603.1| hypothetical protein
           CARUB_v10015896mg [Capsella rubella]
          Length = 352

 Score = 92.8 bits (229), Expect = 8e-17
 Identities = 63/217 (29%), Positives = 105/217 (48%), Gaps = 13/217 (5%)
 Frame = -3

Query: 614 FVATVIEPNPEDVQKIIEHATSNPFTQQLLTYFNHTEQICHLLLNLKRWTHHTRCR-YIP 438
           F++  ++PN E VQ+ +  A     TQ + TYF H+E      LNL +  H  RC  Y P
Sbjct: 68  FLSQELKPNKESVQEALRDAKQTSLTQLVSTYFQHSENATRFCLNLYQNVHSARCHLYTP 127

Query: 437 INNLIKDLKGDPDSLTQEQCNSASDIFLHFQSKESPFPCRGSHNLNDMHSCFFHLKEQVN 258
           ++ L     GDP ++ +  CN A D+FL   + E+PFP   SH+  D   C   LK++++
Sbjct: 128 LSEL---FHGDP-AIDEFFCNLAFDVFLKLDTFENPFPSPDSHSFRDTKLCLNQLKDKLD 183

Query: 257 GCLDRSQPR--------ACTSIRFIDSTVKLXXXXXXXXXXXXXALLTGSSFPPAPKL-- 108
             L +S  R          +++  + + V +              L+  +    +P L  
Sbjct: 184 TRLHKSNSRVRILHHATVGSALCLVTAVVAVAGSAAVIAYHALPTLVVVAGPLCSPYLPH 243

Query: 107 --SQKDLAHIALLDVAARSAYTLHNELQTIDRLAYQL 3
              +K+L +I+ L+ AA+  + L+ +L TIDRL  +L
Sbjct: 244 SFKKKELTNISQLNAAAKGTFALNTDLDTIDRLVSRL 280


>ref|XP_002883173.1| hypothetical protein ARALYDRAFT_479449 [Arabidopsis lyrata subsp.
           lyrata] gi|297329013|gb|EFH59432.1| hypothetical protein
           ARALYDRAFT_479449 [Arabidopsis lyrata subsp. lyrata]
          Length = 349

 Score = 92.4 bits (228), Expect = 1e-16
 Identities = 67/230 (29%), Positives = 103/230 (44%), Gaps = 9/230 (3%)
 Frame = -3

Query: 665 QSKIHAV-------EEWENGDVNGFVATVIEPNPEDVQKIIEHATSNPFTQQLLTYFNHT 507
           +S++H V        ++   D+   ++ V++PN E VQ+ I H      T  + TYF H+
Sbjct: 64  RSRVHVVVDPTQHHHQYIQPDIELLISQVLQPNKECVQEAIRHFKQTTLTHLVSTYFQHS 123

Query: 506 EQICHLLLNLKRWTHHTRCR-YIPINNLIKDLKGDPDSLTQEQ-CNSASDIFLHFQSKES 333
           E    L LNL +  H  R   Y P+ +L     GD  +   E  CN A D+FL   + E+
Sbjct: 124 ENATRLCLNLYQNVHSARHHLYTPLLDLFNSFPGDTHAAIDESLCNLAFDVFLKLDTFEN 183

Query: 332 PFPCRGSHNLNDMHSCFFHLKEQVNGCLDRSQPRACTSIRFIDSTVKLXXXXXXXXXXXX 153
           PF    SH+  D   CF  LK      LDR   ++ + +R I                  
Sbjct: 184 PFSSPESHSFQDTQLCFSQLKNN----LDRRLRKSRSRVRLIHHAT-------------- 225

Query: 152 XALLTGSSFPPAPKLSQKDLAHIALLDVAARSAYTLHNELQTIDRLAYQL 3
              L     P + K  +K+L +I  L+ A++  + L+ +L TIDRL  +L
Sbjct: 226 AGPLCSPYLPHSFK--RKELTNICQLNAASKGTFVLNKDLDTIDRLVSRL 273


>ref|NP_188556.2| uncharacterized protein [Arabidopsis thaliana]
           gi|75273581|sp|Q9LJK4.1|U496L_ARATH RecName:
           Full=UPF0496 protein At3g19250
           gi|9294627|dbj|BAB02966.1| unnamed protein product
           [Arabidopsis thaliana] gi|91806441|gb|ABE65948.1|
           hypothetical protein At3g19250 [Arabidopsis thaliana]
           gi|332642692|gb|AEE76213.1| uncharacterized protein
           AT3G19250 [Arabidopsis thaliana]
          Length = 360

 Score = 89.0 bits (219), Expect = 1e-15
 Identities = 63/220 (28%), Positives = 103/220 (46%), Gaps = 16/220 (7%)
 Frame = -3

Query: 614 FVATVIEPNPEDVQKIIE--HATSNPFTQQLLTYFNHTEQICHLLLNLKRWTHHTRCR-Y 444
           F++  + PN E VQ+ +   HA     T  + TYF H+E      LNL +  H  RC  Y
Sbjct: 69  FLSQELRPNNESVQEALSLRHAKQTTLTNLVSTYFQHSEDATRFCLNLYQNVHSARCHLY 128

Query: 443 IPINNLIKDLKGDPDSLTQEQ-CNSASDIFLHFQSKESPFPCRGSHNLNDMHSCFFHLKE 267
            P+ +L      D  S   E  CN A D+FL   + E+PF    SH+  D   CF+ L +
Sbjct: 129 TPLLDLFNIFPRDSHSAIDESFCNLAFDVFLKLDTFENPFASPESHSFQDTQLCFYQLAD 188

Query: 266 QVNGCLDRSQPR--------ACTSIRFIDSTVKLXXXXXXXXXXXXXALLTGSSFPPAPK 111
           +++  + +S+ R        A +++  + + V +              +L  +     P 
Sbjct: 189 KLDTRIRKSKSRVRLLHHATAGSALCLVTAVVVVAASAAFIAYHALPTILVVAGPLCTPY 248

Query: 110 L----SQKDLAHIALLDVAARSAYTLHNELQTIDRLAYQL 3
           L     +K+L++I  L+VAA+  + L+ +L TIDRL  +L
Sbjct: 249 LPHSFKKKELSNIFQLNVAAKGTFALNKDLDTIDRLVSRL 288


Top