BLASTX nr result

ID: Cinnamomum25_contig00019699 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Cinnamomum25_contig00019699
         (1203 letters)

Database: ./nr 
           69,698,275 sequences; 24,982,196,650 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_006372766.1| hypothetical protein POPTR_0017s04850g [Popu...   162   4e-37
ref|XP_010919192.1| PREDICTED: uncharacterized protein LOC105043...   155   6e-35
ref|XP_010919199.1| PREDICTED: uncharacterized protein LOC105043...   152   6e-34
ref|XP_010657010.1| PREDICTED: uncharacterized protein LOC100242...   149   4e-33
ref|XP_010260342.1| PREDICTED: uncharacterized protein LOC104599...   149   4e-33
ref|XP_010260341.1| PREDICTED: uncharacterized protein LOC104599...   149   4e-33
emb|CBI21908.3| unnamed protein product [Vitis vinifera]              149   4e-33
ref|XP_007221034.1| hypothetical protein PRUPE_ppa015217mg, part...   148   7e-33
ref|XP_008232052.1| PREDICTED: uncharacterized protein LOC103331...   146   3e-32
emb|CDP15879.1| unnamed protein product [Coffea canephora]            145   5e-32
ref|XP_012477223.1| PREDICTED: uncharacterized protein LOC105792...   144   1e-31
ref|XP_007043708.1| Uncharacterized protein isoform 2 [Theobroma...   144   2e-31
ref|XP_006852570.2| PREDICTED: uncharacterized protein LOC184422...   142   4e-31
ref|XP_011043605.1| PREDICTED: uncharacterized protein LOC105139...   142   5e-31
ref|XP_006592445.1| PREDICTED: uncharacterized protein LOC102660...   141   1e-30
ref|XP_004287059.1| PREDICTED: uncharacterized protein LOC101291...   140   2e-30
ref|XP_003526252.1| PREDICTED: uncharacterized protein LOC100790...   140   2e-30
ref|XP_002517843.1| conserved hypothetical protein [Ricinus comm...   138   7e-30
ref|XP_010093966.1| hypothetical protein L484_010532 [Morus nota...   136   4e-29
ref|XP_006469074.1| PREDICTED: lisH domain-containing protein C1...   133   3e-28

>ref|XP_006372766.1| hypothetical protein POPTR_0017s04850g [Populus trichocarpa]
            gi|550319414|gb|ERP50563.1| hypothetical protein
            POPTR_0017s04850g [Populus trichocarpa]
          Length = 474

 Score =  162 bits (411), Expect = 4e-37
 Identities = 142/410 (34%), Positives = 200/410 (48%), Gaps = 15/410 (3%)
 Frame = -2

Query: 1196 KPQKQKNPEKTLN------PKSHLLPSNWDRYDDDEV--FG----NEGSDKSLVDA--RL 1059
            KP   +NP KT +      P+   LPSNWDRY+DDE   FG    N   D S   +    
Sbjct: 22   KPHPNQNPSKTPSTGNNQKPQKSKLPSNWDRYEDDEEDEFGVNLENPSGDNSKKPSFKDY 81

Query: 1058 AQGIAAPKSKGADFSKLIAQAKEDARSRINPDEXXXXXXXXXXXXFYQGVSSMLSVKGNS 879
              G+A PKSKGADF  L+    ++A+S+ +  +               GV  +L+V+G S
Sbjct: 82   GDGLALPKSKGADFKYLL----DEAKSKPHQVDDFPFLEGFLAEESMHGVGPLLAVRGES 137

Query: 878  LLSCSMDDNFIVDDSETSSYEASFLSLDLHALAAQLEKVDVSERLFIENDLLADELGEQL 699
            +LS   DDNF+V+D  TSS+EASFLSL+LHALA QL KVDVSERLFIE DLL  ELG   
Sbjct: 138  ILSWIGDDNFVVEDETTSSHEASFLSLNLHALAEQLAKVDVSERLFIEADLLPTELGSN- 196

Query: 698  KEMSSSRSNYGETSHEGSE-NSNLGYDHGETSYAVGTHNMEKTRSNNNELLPSPSTSNDN 522
               +SS   + +    GSE +SN G +  +T     TH+ E T++ + EL  +    ++ 
Sbjct: 197  ---TSSSQEFDQMQTTGSEASSNHGPNRKQT-----THDKE-TKTISGEL--TFEDFSEK 245

Query: 521  SQEVHDEKPFPPFTWQSGKPNPATDSELLHQNAAREVLKPKLQLDATAELEKTTLRSGAT 342
            ++ V+ +          G  +P +  + L        +K  L L+   +  + T     +
Sbjct: 246  NKAVNQDAEIFVSGLTIGNSDPISFIQGLD-------VKDNLNLNQHGKSNQRTAME--S 296

Query: 341  PAETELDMLLNTMNEKETKTLGFKAATAEAELDMLLDSFGETKLLDSVDISKEQSSNFPR 162
            PA+     +        ++   F+AA AE+ELDMLLDS  E KLLDS   S   S   P 
Sbjct: 297  PAQFYASSV-----APNSRLPTFEAAAAESELDMLLDSLSEAKLLDS---SGFGSGTLPV 348

Query: 161  AQXXXXXXXXXXXXXXXXXKAVREVPDSFNSPSMTNALDSSIDDLLARTS 12
            ++                 +  R  P S  +      LD+ +DDLL  TS
Sbjct: 349  SE---------KEAAVPLPQLTRNAPGSAKTTPTAATLDNVLDDLLEETS 389


>ref|XP_010919192.1| PREDICTED: uncharacterized protein LOC105043365 isoform X1 [Elaeis
            guineensis]
          Length = 506

 Score =  155 bits (392), Expect = 6e-35
 Identities = 131/391 (33%), Positives = 183/391 (46%), Gaps = 9/391 (2%)
 Frame = -2

Query: 1154 KSHLLPSNWDRYDDDEVFGNEGSD-KSLVDARLAQGIAAPKSKGADFSKLIAQAKEDARS 978
            +S  LPSNWDRYDDD    + G   +S   A+ A G   PKSKGADF  L+ QA+   + 
Sbjct: 58   RSRDLPSNWDRYDDDGDGDDSGDGAESSAGAKRADGEIRPKSKGADFRFLVEQARSQPQD 117

Query: 977  RINPDEXXXXXXXXXXXXFY-QGVSSMLSVKGNSLLSCSMDDNFIVDDSETSSYEASFLS 801
              +P               Y QG+SSMLSV+G SLLS   DDNFIVDD  TSS E + LS
Sbjct: 118  HRDPGTSQSAFSLDELPSDYIQGISSMLSVRGESLLSWCADDNFIVDDDSTSSCEVNLLS 177

Query: 800  LDLHALAAQLEKVDVSERLFIENDLLADELG-EQLKEMSSSRSNYGETSHEGSENSNLGY 624
            +DLHALAAQL K+ +S+RLFIE DLL +EL  ++LK       +   T  E  ++ + G 
Sbjct: 178  MDLHALAAQLSKLKLSQRLFIEEDLLPEELHIDELKVNQIFEQSETPTMSEHKDSLSQGR 237

Query: 623  DHGETSYAVGTHNMEKTRSNNNELLPSPSTSNDNSQEVHDEKPFPPFTWQSGKPNPATDS 444
             HG +        +EK+     +   S +      + V ++      T Q+ K + + DS
Sbjct: 238  FHGNSE-------LEKSVDGQIDHWNSCNVHGITREAVVEKCQTQSPTGQATKFDLSNDS 290

Query: 443  ELLHQNAAREVLKPKLQLDATAELEKTTLRSGATPAETELDMLLNTMNEKETKTLGFKAA 264
                 +  REV        + ++L K T+                  + K+ +T  F+AA
Sbjct: 291  IPAGLSGRREVQ------GSVSQLSKHTVA-----------------DLKQNRTSRFEAA 327

Query: 263  TAEAELDMLLDSFGETKLLDS------VDISKEQSSNFPRAQXXXXXXXXXXXXXXXXXK 102
             AE ELD+L  SF ET+L  S       D S   ++ F  +                   
Sbjct: 328  AAEEELDVLFSSFSETRLSSSHSDGITNDASTSHNATFNSSVHMSPPSVG---------- 377

Query: 101  AVREVPDSFNSPSMTNALDSSIDDLLARTSI 9
                  D  +S +   +L  +IDDLLA TS+
Sbjct: 378  -----QDLNSSGNAGTSLADAIDDLLAETSL 403


>ref|XP_010919199.1| PREDICTED: uncharacterized protein LOC105043365 isoform X2 [Elaeis
            guineensis]
          Length = 497

 Score =  152 bits (383), Expect = 6e-34
 Identities = 131/390 (33%), Positives = 182/390 (46%), Gaps = 8/390 (2%)
 Frame = -2

Query: 1154 KSHLLPSNWDRYDDDEVFGNEGSD-KSLVDARLAQGIAAPKSKGADFSKLIAQAKEDARS 978
            +S  LPSNWDRYDDD    + G   +S   A+ A G   PKSKGADF  L+ QA+   + 
Sbjct: 58   RSRDLPSNWDRYDDDGDGDDSGDGAESSAGAKRADGEIRPKSKGADFRFLVEQARSQPQD 117

Query: 977  RINPDEXXXXXXXXXXXXFY-QGVSSMLSVKGNSLLSCSMDDNFIVDDSETSSYEASFLS 801
              +P               Y QG+SSMLSV+G SLLS   DDNFIVDD  TSS E + LS
Sbjct: 118  HRDPGTSQSAFSLDELPSDYIQGISSMLSVRGESLLSWCADDNFIVDDDSTSSCEVNLLS 177

Query: 800  LDLHALAAQLEKVDVSERLFIENDLLADELGEQLKEMSSSRSNYGETSHEGSENSNLGYD 621
            +DLHALAAQL K+ +S+RLFIE DLL +EL   + E S +      T  E  ++ + G  
Sbjct: 178  MDLHALAAQLSKLKLSQRLFIEEDLLPEEL---IFEQSET-----PTMSEHKDSLSQGRF 229

Query: 620  HGETSYAVGTHNMEKTRSNNNELLPSPSTSNDNSQEVHDEKPFPPFTWQSGKPNPATDSE 441
            HG +        +EK+     +   S +      + V ++      T Q+ K + + DS 
Sbjct: 230  HGNS-------ELEKSVDGQIDHWNSCNVHGITREAVVEKCQTQSPTGQATKFDLSNDSI 282

Query: 440  LLHQNAAREVLKPKLQLDATAELEKTTLRSGATPAETELDMLLNTMNEKETKTLGFKAAT 261
                +  REV        + ++L K T+                  + K+ +T  F+AA 
Sbjct: 283  PAGLSGRREV------QGSVSQLSKHTV-----------------ADLKQNRTSRFEAAA 319

Query: 260  AEAELDMLLDSFGETKLLDS------VDISKEQSSNFPRAQXXXXXXXXXXXXXXXXXKA 99
            AE ELD+L  SF ET+L  S       D S   ++ F  +                    
Sbjct: 320  AEEELDVLFSSFSETRLSSSHSDGITNDASTSHNATFNSSVHMSPPSVG----------- 368

Query: 98   VREVPDSFNSPSMTNALDSSIDDLLARTSI 9
                 D  +S +   +L  +IDDLLA TS+
Sbjct: 369  ----QDLNSSGNAGTSLADAIDDLLAETSL 394


>ref|XP_010657010.1| PREDICTED: uncharacterized protein LOC100242390 [Vitis vinifera]
            gi|731408881|ref|XP_010657011.1| PREDICTED:
            uncharacterized protein LOC100242390 [Vitis vinifera]
          Length = 429

 Score =  149 bits (376), Expect = 4e-33
 Identities = 115/322 (35%), Positives = 165/322 (51%), Gaps = 17/322 (5%)
 Frame = -2

Query: 1184 QKNPEKTLNPKSHL------LPSNWDRYDDDEVFGNEGSDKSLVDARLAQGIAAPKSKGA 1023
            +K P K +  K H       LPSNWDRY+++   G+EG   S+     A  +  PKSKGA
Sbjct: 41   KKQPGKQIREKPHQSMGLSRLPSNWDRYEEEFDSGSEGP--SINSTNQANDVIVPKSKGA 98

Query: 1022 DFSKLIAQAKEDARSRINPDEXXXXXXXXXXXXFYQGVSSMLSVKGNSLLSCSMDDNFIV 843
            D+ +LI++A   +RS  NP              F QGV S+LSV+G  +LS   D+NFIV
Sbjct: 99   DYGELISEAISQSRS--NPYFDSFASLDDVVPDFNQGVGSLLSVRGQGILSWIGDNNFIV 156

Query: 842  DDSETSSYEASFLSLDLHALAAQLEKVDVSERLFIENDLLADEL----GEQLKEMSSSRS 675
            +D  T+S+EA FLSL+LH+LA QL KVD+S+RLF+E DLL+ EL     E +K  S+  +
Sbjct: 157  EDRATTSHEAPFLSLNLHSLAEQLTKVDLSQRLFVEEDLLSPELMSVSSEGVKVSSNQEA 216

Query: 674  NYGETSHEGS-----ENSNLGYDHGETSYAVGTHNMEK-TRSNNNELLPSPSTSNDNSQE 513
            N  + + EG+     E++   +   +         M   T    N ++ SP+ S  +  +
Sbjct: 217  NQMQRTSEGAKIIVDESAVRSFPEKDKIVDKNKEVMSSDTTRIRNPVISSPNQSAKSENQ 276

Query: 512  VHDEKPFPPFTWQSGKPNPATDSELLHQNAAREVLKPKLQLDATAELEKTTLRSGATPAE 333
            V D+        Q G+     D EL  Q          +   + A+ EK      A  AE
Sbjct: 277  VKDKAK------QFGRAAQTRDLELAAQ----------INKVSVADPEKKQSVFEAAAAE 320

Query: 332  TELDMLLNTMNE-KETKTLGFK 270
             ELDMLL++ NE  +  +LGFK
Sbjct: 321  AELDMLLDSFNETNKFDSLGFK 342


>ref|XP_010260342.1| PREDICTED: uncharacterized protein LOC104599483 isoform X2 [Nelumbo
            nucifera]
          Length = 428

 Score =  149 bits (376), Expect = 4e-33
 Identities = 128/396 (32%), Positives = 184/396 (46%), Gaps = 13/396 (3%)
 Frame = -2

Query: 1196 KPQKQKNPEKT--LNPKSHLLPSNWDRYDDDEVFGNEGSDKSLVDARLAQGIAAPKSKGA 1023
            KP  ++  EK       S  LPSNWDRY+++   G+E  D SL        +  PKSKGA
Sbjct: 41   KPSAKQTREKNRQFRGSSTALPSNWDRYEEEYDSGSE--DPSLGGTSRTSDVVVPKSKGA 98

Query: 1022 DFSKLIAQAKEDARSRINPDEXXXXXXXXXXXXFYQGVSSMLSVKGNSLLSCSMDDNFIV 843
            DF  LI++A+   +S  +               F QGVS++LS +G ++LS   +DNF V
Sbjct: 99   DFRYLISEAQSQLQSPSDLSLESFDSFGGFLPGFNQGVSTVLSARGKNILSWIGNDNFAV 158

Query: 842  DDSETSSYEASFLSLDLHALAAQLEKVDVSERLFIENDLLADEL-GEQLKEMSSSRSNYG 666
            +D+ET+S +ASFLS+DLHALA QL KVDVS+RLFI+  LL  E+  E L++      ++ 
Sbjct: 159  EDNETAS-QASFLSMDLHALAEQLAKVDVSQRLFIDAYLLPPEMHSEGLQKSKCQDYDHT 217

Query: 665  ETSHEGSENSNLGYDHGETSYAVGTHNMEKTRSNNNELLPSPSTSNDNSQEVHDEKPFPP 486
            E +HE SE  +   D  E     G+ N E    N  ++ P+       ++ VH      P
Sbjct: 218  EATHE-SEADDHYLDKMEFH---GSANGEDIMGNRPDISPA------TTENVHSVPALLP 267

Query: 485  FTWQSGKPNPATDSELLHQNAAREVLKPKLQLDATAELEKTTLRSGATPAETELDMLLNT 306
                        DS  + Q    + +    Q + ++ ++                     
Sbjct: 268  EGSMLVNLAKGGDSTQVGQTCPTKFMNSMEQPNRSSSVDL-------------------- 307

Query: 305  MNEKETKTLGFKAATAEAELDMLLDSFGETKLL----------DSVDISKEQSSNFPRAQ 156
               KE K   F+AA AEAELDMLLDSFGETKL            S   S++Q S FP+  
Sbjct: 308  ---KENKPSRFEAAAAEAELDMLLDSFGETKLFYSGFPVVKQEPSHVSSQQQVSGFPQPS 364

Query: 155  XXXXXXXXXXXXXXXXXKAVREVPDSFNSPSMTNAL 48
                              A+ E+ ++ N  +  NA+
Sbjct: 365  VQAPDASKNASGAFDLDNAIDELRETSNPTNQNNAM 400


>ref|XP_010260341.1| PREDICTED: uncharacterized protein LOC104599483 isoform X1 [Nelumbo
            nucifera]
          Length = 437

 Score =  149 bits (376), Expect = 4e-33
 Identities = 128/396 (32%), Positives = 184/396 (46%), Gaps = 13/396 (3%)
 Frame = -2

Query: 1196 KPQKQKNPEKT--LNPKSHLLPSNWDRYDDDEVFGNEGSDKSLVDARLAQGIAAPKSKGA 1023
            KP  ++  EK       S  LPSNWDRY+++   G+E  D SL        +  PKSKGA
Sbjct: 41   KPSAKQTREKNRQFRGSSTALPSNWDRYEEEYDSGSE--DPSLGGTSRTSDVVVPKSKGA 98

Query: 1022 DFSKLIAQAKEDARSRINPDEXXXXXXXXXXXXFYQGVSSMLSVKGNSLLSCSMDDNFIV 843
            DF  LI++A+   +S  +               F QGVS++LS +G ++LS   +DNF V
Sbjct: 99   DFRYLISEAQSQLQSPSDLSLESFDSFGGFLPGFNQGVSTVLSARGKNILSWIGNDNFAV 158

Query: 842  DDSETSSYEASFLSLDLHALAAQLEKVDVSERLFIENDLLADEL-GEQLKEMSSSRSNYG 666
            +D+ET+S +ASFLS+DLHALA QL KVDVS+RLFI+  LL  E+  E L++      ++ 
Sbjct: 159  EDNETAS-QASFLSMDLHALAEQLAKVDVSQRLFIDAYLLPPEMHSEGLQKSKCQDYDHT 217

Query: 665  ETSHEGSENSNLGYDHGETSYAVGTHNMEKTRSNNNELLPSPSTSNDNSQEVHDEKPFPP 486
            E +HE SE  +   D  E     G+ N E    N  ++ P+       ++ VH      P
Sbjct: 218  EATHE-SEADDHYLDKMEFH---GSANGEDIMGNRPDISPA------TTENVHSVPALLP 267

Query: 485  FTWQSGKPNPATDSELLHQNAAREVLKPKLQLDATAELEKTTLRSGATPAETELDMLLNT 306
                        DS  + Q    + +    Q + ++ ++                     
Sbjct: 268  EGSMLVNLAKGGDSTQVGQTCPTKFMNSMEQPNRSSSVDL-------------------- 307

Query: 305  MNEKETKTLGFKAATAEAELDMLLDSFGETKLL----------DSVDISKEQSSNFPRAQ 156
               KE K   F+AA AEAELDMLLDSFGETKL            S   S++Q S FP+  
Sbjct: 308  ---KENKPSRFEAAAAEAELDMLLDSFGETKLFYSGFPVVKQEPSHVSSQQQVSGFPQPS 364

Query: 155  XXXXXXXXXXXXXXXXXKAVREVPDSFNSPSMTNAL 48
                              A+ E+ ++ N  +  NA+
Sbjct: 365  VQAPDASKNASGAFDLDNAIDELRETSNPTNQNNAM 400


>emb|CBI21908.3| unnamed protein product [Vitis vinifera]
          Length = 453

 Score =  149 bits (376), Expect = 4e-33
 Identities = 115/322 (35%), Positives = 165/322 (51%), Gaps = 17/322 (5%)
 Frame = -2

Query: 1184 QKNPEKTLNPKSHL------LPSNWDRYDDDEVFGNEGSDKSLVDARLAQGIAAPKSKGA 1023
            +K P K +  K H       LPSNWDRY+++   G+EG   S+     A  +  PKSKGA
Sbjct: 65   KKQPGKQIREKPHQSMGLSRLPSNWDRYEEEFDSGSEGP--SINSTNQANDVIVPKSKGA 122

Query: 1022 DFSKLIAQAKEDARSRINPDEXXXXXXXXXXXXFYQGVSSMLSVKGNSLLSCSMDDNFIV 843
            D+ +LI++A   +RS  NP              F QGV S+LSV+G  +LS   D+NFIV
Sbjct: 123  DYGELISEAISQSRS--NPYFDSFASLDDVVPDFNQGVGSLLSVRGQGILSWIGDNNFIV 180

Query: 842  DDSETSSYEASFLSLDLHALAAQLEKVDVSERLFIENDLLADEL----GEQLKEMSSSRS 675
            +D  T+S+EA FLSL+LH+LA QL KVD+S+RLF+E DLL+ EL     E +K  S+  +
Sbjct: 181  EDRATTSHEAPFLSLNLHSLAEQLTKVDLSQRLFVEEDLLSPELMSVSSEGVKVSSNQEA 240

Query: 674  NYGETSHEGS-----ENSNLGYDHGETSYAVGTHNMEK-TRSNNNELLPSPSTSNDNSQE 513
            N  + + EG+     E++   +   +         M   T    N ++ SP+ S  +  +
Sbjct: 241  NQMQRTSEGAKIIVDESAVRSFPEKDKIVDKNKEVMSSDTTRIRNPVISSPNQSAKSENQ 300

Query: 512  VHDEKPFPPFTWQSGKPNPATDSELLHQNAAREVLKPKLQLDATAELEKTTLRSGATPAE 333
            V D+        Q G+     D EL  Q          +   + A+ EK      A  AE
Sbjct: 301  VKDKAK------QFGRAAQTRDLELAAQ----------INKVSVADPEKKQSVFEAAAAE 344

Query: 332  TELDMLLNTMNE-KETKTLGFK 270
             ELDMLL++ NE  +  +LGFK
Sbjct: 345  AELDMLLDSFNETNKFDSLGFK 366


>ref|XP_007221034.1| hypothetical protein PRUPE_ppa015217mg, partial [Prunus persica]
            gi|462417496|gb|EMJ22233.1| hypothetical protein
            PRUPE_ppa015217mg, partial [Prunus persica]
          Length = 383

 Score =  148 bits (374), Expect = 7e-33
 Identities = 129/400 (32%), Positives = 186/400 (46%), Gaps = 4/400 (1%)
 Frame = -2

Query: 1196 KPQKQKNPEKTLNPK--SHLLPSNWDRYDDDEVFGNEGSDKSLVDARLAQGIAAPKSKGA 1023
            KP  ++  EKT NP   +  LP+NWDRY+++   G+E  + +      A  +A P SKGA
Sbjct: 45   KPLGKQVKEKT-NPTHGASALPTNWDRYEEEFEAGSE--EPASDGLNRAPDVAVPMSKGA 101

Query: 1022 DFSKLIAQAKEDARSRINPDEXXXXXXXXXXXXFYQGVSSMLSVKGNSLLSCSMDDNFIV 843
            D+  LIA+A+  +   I  D               +G+ SMLSV+G S+LS   DDNF+V
Sbjct: 102  DYRHLIAEAQAQSELTIYSDPFPSLDNVLPGDWN-EGIGSMLSVRGESILSRIGDDNFVV 160

Query: 842  DDSETSSYEASFLSLDLHALAAQLEKVDVSERLFIENDLLADELGEQLKEMSSSRSNYGE 663
            +D   + +E SFLSL+LHALA QLEK+ + ERLF+E +LL  EL  + +E + S+S    
Sbjct: 161  EDKTAAHHEVSFLSLNLHALAEQLEKIALPERLFVEAELLPPELHVEGQEATCSQS---- 216

Query: 662  TSHEGSENSNLGYDHGETSYAVGTHNMEKTRSNNNELLPSPSTSNDNSQEVHDEKPFPPF 483
                             +     T N E TR      +P  S S       HD +     
Sbjct: 217  -----------------SDPMQATCNEEATRG-----MPEESISEKVQVADHDIEITMSG 254

Query: 482  TWQSGKPNPATDSELLHQNAAREVLKPKLQLDATAELEKTTLRSGATPAETELDMLLNTM 303
            +  SG P               +++ P L          + ++    P++       + +
Sbjct: 255  STGSGHP---------------DLILPNLG-------SVSAIQGNIDPSKLGKSDYQSKL 292

Query: 302  NEKETK--TLGFKAATAEAELDMLLDSFGETKLLDSVDISKEQSSNFPRAQXXXXXXXXX 129
            +E ET+     F+A+TAEAELDMLLDSFGETK+ DS   S  ++ +   A          
Sbjct: 293  SESETQFSVKSFEASTAEAELDMLLDSFGETKINDSSGFSSVKTVSVQEAAFMAPLQLP- 351

Query: 128  XXXXXXXXKAVREVPDSFNSPSMTNALDSSIDDLLARTSI 9
                       R+ PDS  S  MT   D  +DDL+  TSI
Sbjct: 352  -----------RKAPDS--SVLMTANFDDELDDLINETSI 378


>ref|XP_008232052.1| PREDICTED: uncharacterized protein LOC103331215 [Prunus mume]
          Length = 420

 Score =  146 bits (368), Expect = 3e-32
 Identities = 128/398 (32%), Positives = 190/398 (47%), Gaps = 2/398 (0%)
 Frame = -2

Query: 1196 KPQKQKNPEKTLNPK--SHLLPSNWDRYDDDEVFGNEGSDKSLVDARLAQGIAAPKSKGA 1023
            KP  ++  EKT NP   +  LP+NWDRY+++   G+E  + +      A  +A P SKGA
Sbjct: 45   KPLGKQVKEKT-NPTHGASALPTNWDRYEEEFEAGSE--EPAGDGLNRAPDVAVPMSKGA 101

Query: 1022 DFSKLIAQAKEDARSRINPDEXXXXXXXXXXXXFYQGVSSMLSVKGNSLLSCSMDDNFIV 843
            D+  LIA+A+  +   I  D               +G+ SMLSV+G S+LS   DDNF+V
Sbjct: 102  DYRHLIAEAQAQSELTIYSDPFPSLDNVLPGDWN-EGIGSMLSVRGESILSRIGDDNFVV 160

Query: 842  DDSETSSYEASFLSLDLHALAAQLEKVDVSERLFIENDLLADELGEQLKEMSSSRSNYGE 663
            +D   +  E SFLSL+LHALA QLEK+ + ERLFIE +LL  EL  + ++++ S+S+   
Sbjct: 161  EDKTAAHQEVSFLSLNLHALAEQLEKIALPERLFIEAELLPPELHVEGQDVTCSQSS--- 217

Query: 662  TSHEGSENSNLGYDHGETSYAVGTHNMEKTRSNNNELLPSPSTSNDNSQEVHDEKPFPPF 483
                              +    T N E T+      +P  S S+      HD +     
Sbjct: 218  ------------------NRMQATCNEEATQG-----MPHESISDKVQVADHDIEITISG 254

Query: 482  TWQSGKPNPATDSELLHQNAAREVLKPKLQLDATAELEKTTLRSGATPAETELDMLLNTM 303
            +  SG P+P     +L    +  V++  +     ++L K+  +S  +  ET+  +     
Sbjct: 255  STGSGHPDP-----ILPNLGSVSVIQGNIN---PSKLGKSDYQSKLSECETQFSVK---- 302

Query: 302  NEKETKTLGFKAATAEAELDMLLDSFGETKLLDSVDISKEQSSNFPRAQXXXXXXXXXXX 123
                     F+A+TAEAELDMLLDSFGETK+ DS   S  ++ +   A            
Sbjct: 303  --------SFEASTAEAELDMLLDSFGETKINDSSGFSSVKTVSGQEAPFMAPLQLP--- 351

Query: 122  XXXXXXKAVREVPDSFNSPSMTNALDSSIDDLLARTSI 9
                     R+ PDS  S   T   D  +DDL+  TS+
Sbjct: 352  ---------RKAPDS--SVLATANFDDELDDLINETSV 378


>emb|CDP15879.1| unnamed protein product [Coffea canephora]
          Length = 410

 Score =  145 bits (367), Expect = 5e-32
 Identities = 126/392 (32%), Positives = 183/392 (46%)
 Frame = -2

Query: 1187 KQKNPEKTLNPKSHLLPSNWDRYDDDEVFGNEGSDKSLVDARLAQGIAAPKSKGADFSKL 1008
            KQ   +   +  S  LP+NWDRY+++  +G+   D   V    A  +  PKSKGAD++ L
Sbjct: 45   KQARDKPYQSQSSKALPTNWDRYEEE--YGSGSEDSPQVSTGQASDVVVPKSKGADYAYL 102

Query: 1007 IAQAKEDARSRINPDEXXXXXXXXXXXXFYQGVSSMLSVKGNSLLSCSMDDNFIVDDSET 828
            I++AK  A+S+ N               F QG+ S+LSV+G  LLS   +D F  DD  T
Sbjct: 103  ISEAK--AQSQANSSSESFSLFDDFLDGFNQGLGSLLSVRGEHLLSRISNDVFPFDDKGT 160

Query: 827  SSYEASFLSLDLHALAAQLEKVDVSERLFIENDLLADELGEQLKEMSSSRSNYGETSHEG 648
            SS+EASFLSL+LH+LA QL K +++ERLFIE DLL  E+  +L   +++  N  E    G
Sbjct: 161  SSHEASFLSLNLHSLAEQLSKANLAERLFIEPDLLPPEMCTELD--ANNEKNPDELQATG 218

Query: 647  SENSNLGYDHGETSYAVGTHNMEKTRSNNNELLPSPSTSNDNSQEVHDEKPFPPFTWQSG 468
            S  +      G+ S  +       ++ N N LL     S+++S+          F+  + 
Sbjct: 219  STEATESEFAGQPSSII-------SKENRNILLSQEYMSSNSSR-------VSQFSVPTS 264

Query: 467  KPNPATDSELLHQNAAREVLKPKLQLDATAELEKTTLRSGATPAETELDMLLNTMNEKET 288
              + A D + + ++ + + L   + +D+++E      R  A  AE ELD           
Sbjct: 265  TDHRADDLKEISRSTSVK-LTSGVSIDSSSEKPS---RFEAAKAEAELD----------- 309

Query: 287  KTLGFKAATAEAELDMLLDSFGETKLLDSVDISKEQSSNFPRAQXXXXXXXXXXXXXXXX 108
                           MLLDSFGETK  DS      + S F                    
Sbjct: 310  ---------------MLLDSFGETKFFDS------KGSTFQSVSVAAQH----------- 337

Query: 107  XKAVREVPDSFNSPSMTNALDSSIDDLLARTS 12
               VRE PD+  S  M  ALD S+DD+L  TS
Sbjct: 338  ---VREGPDATYSGRMDAALDDSLDDILKDTS 366


>ref|XP_012477223.1| PREDICTED: uncharacterized protein LOC105792917 [Gossypium raimondii]
            gi|763759850|gb|KJB27181.1| hypothetical protein
            B456_004G282700 [Gossypium raimondii]
          Length = 415

 Score =  144 bits (364), Expect = 1e-31
 Identities = 123/406 (30%), Positives = 184/406 (45%), Gaps = 9/406 (2%)
 Frame = -2

Query: 1202 LNKPQKQKNPE-KTLNPKSH------LLPSNWDRYDDDEVFGNEGSDKSLVDARLAQGIA 1044
            +N+P   K    K +  K+H       LPSNW+RY+++   G+E       D      + 
Sbjct: 35   VNEPSNSKKQTIKQIKEKAHQAQRISALPSNWNRYEEEFDSGSE-------DPTQTPDVI 87

Query: 1043 APKSKGADFSKLIAQAKEDARSRINPDEXXXXXXXXXXXXFY-QGVSSMLSVKGNSLLSC 867
             PKSKGADF  L+++A+   ++  NP               + Q V SML+V+G  +LS 
Sbjct: 88   VPKSKGADFRHLLSEAQSQLQA--NPYSNNIPSLDDVFPGDFNQFVGSMLAVRGEGILSW 145

Query: 866  SMDDNFIVDDSETSSYEASFLSLDLHALAAQLEKVDVSERLFIENDLLADELGEQLKEMS 687
            + +DNF+VDDS T++ EASFLSL+L ALA QLEKVD+S+RLFIE DLL  +L        
Sbjct: 146  TGNDNFVVDDSTTATPEASFLSLNLQALAEQLEKVDLSKRLFIEEDLLPPDL-------- 197

Query: 686  SSRSNYGETSHEGSENSNLGYDHGETSYAVGTHNMEKTRSNNNELLPSPSTSNDNSQEVH 507
                                               E+++  N++         D  Q   
Sbjct: 198  ---------------------------------RSERSKVKNDQ-------EPDQMQAAP 217

Query: 506  DEKPFPPFTWQSGKPNPATDSELLHQNAAREVLKPKLQLDATAELEKTTLRS-GATPAET 330
            D K     T +   PN    S+      A + +     LD  AE++  ++ S  +  +E+
Sbjct: 218  DRKEAAKIT-EGSTPNDLPGSK------AIDAILSNSGLDLMAEVQSVSISSQNSESSES 270

Query: 329  ELDMLLNTMNEKETKTLGFKAATAEAELDMLLDSFGETKLLDSVDISKEQSSNFPRAQXX 150
                 LN       K   F+AA AEA+LDMLL+SF ETKLLD+ ++S E+ S+    +  
Sbjct: 271  RAPDNLNFTTASNKKVPKFEAAAAEAKLDMLLNSFNETKLLDTSNLSSEKPSSIGSLKAS 330

Query: 149  XXXXXXXXXXXXXXXKAVREVPDSFNSPSMTNALDSSIDDLLARTS 12
                              R + DS  + ++ +  +  +DDLL  TS
Sbjct: 331  NLDSLLDDLLQETSTTVNRGI-DSSKTAAVNSTSEDLLDDLLQETS 375


>ref|XP_007043708.1| Uncharacterized protein isoform 2 [Theobroma cacao]
            gi|590691174|ref|XP_007043709.1| Uncharacterized protein
            isoform 2 [Theobroma cacao] gi|508707643|gb|EOX99539.1|
            Uncharacterized protein isoform 2 [Theobroma cacao]
            gi|508707644|gb|EOX99540.1| Uncharacterized protein
            isoform 2 [Theobroma cacao]
          Length = 465

 Score =  144 bits (362), Expect = 2e-31
 Identities = 131/402 (32%), Positives = 192/402 (47%), Gaps = 11/402 (2%)
 Frame = -2

Query: 1184 QKNPEKTLNPKSH------LLPSNWDRYDDDEVFGNEGSDKSLVDARLAQGIAAPKSKGA 1023
            +K   K +  K+H       LPSNWD Y+++   G+E  D+S         +  PKSKGA
Sbjct: 41   KKQTGKQIREKTHQAQRVSALPSNWDHYEEEFDSGSE--DQSGDSTSQVPDVVLPKSKGA 98

Query: 1022 DFSKLIAQAKEDARSRINPDEXXXXXXXXXXXXFYQGVSSMLSVKGNSLLSCSMDDNFIV 843
            DF  LIA+A+    S    D               Q V  MLSV+G  +LS   +DNF+V
Sbjct: 99   DFHHLIAEAQSQLESNPYTDSLCSSDDILPGDFN-QFVGIMLSVRGEGILSLIQNDNFVV 157

Query: 842  DDSETSSYEASFLSLDLHALAAQLEKVDVSERLFIENDLLADEL-GEQLKEMSSSRSNYG 666
            +D  T+++ ASFLSL+LHALA QLEKV++SERLFIE DLL+ EL  E  K  S+  S+  
Sbjct: 158  EDRTTATHAASFLSLNLHALAEQLEKVNLSERLFIEEDLLSPELHAEGSKANSNQESDQM 217

Query: 665  ETSHEGSENSNLGYDHGETSYAVGTHNMEKTRSNNNELLPSPSTSNDNSQEVH-DEKPFP 489
            +T+ EG   + +                        EL     T ND++ +V+   K   
Sbjct: 218  QTTSEGKAAAQI----------------------TEEL-----TLNDSTDKVNIAAKNVE 250

Query: 488  PFTWQSGKPNPATDSELLHQ--NAAREVLKPKLQLDATAELEKTTLRSGATPAETELDML 315
              ++ SG  + + D+ L ++  ++  EV       D  +     + +S A  + T  +  
Sbjct: 251  HISFSSG--SKSVDATLSNEGLDSVDEVYS-----DFISSQRDKSGKSRALESSTHDNSN 303

Query: 314  LNTMNEKETKTLGFKAATAEAELDMLLDSFGETKLLDSVDISKEQSSNFPRAQXXXXXXX 135
              ++  K+  T    AA A  ELDMLL+SF ETKLLDS  +  ++SSN            
Sbjct: 304  SASVPNKKVSTFEAVAAEA--ELDMLLNSFSETKLLDSSGLKTQKSSN-----------D 350

Query: 134  XXXXXXXXXXKAVREVPDSFN-SPSMTNALDSSIDDLLARTS 12
                      +  R+  DS N S  + +++D  +DDLL  TS
Sbjct: 351  YYTEGSPSLAQLARKGDDSSNKSAGVNSSVDDLLDDLLKETS 392


>ref|XP_006852570.2| PREDICTED: uncharacterized protein LOC18442285 [Amborella trichopoda]
          Length = 398

 Score =  142 bits (359), Expect = 4e-31
 Identities = 111/340 (32%), Positives = 162/340 (47%), Gaps = 3/340 (0%)
 Frame = -2

Query: 1193 PQKQKNPEKTLNPKSHLLPSNWDRYDDDEVFGNEGSDKSLVDARLAQGIAAPKSKGADFS 1014
            P  +K  ++TL      LPSNWDRYDD +  G +  D +  +  +      PKSKGAD++
Sbjct: 29   PSTKKQSDQTLTRHDSRLPSNWDRYDDIDFSGAQPEDPNQENVNVG-----PKSKGADYA 83

Query: 1013 KLIAQAKEDARSRINPDEXXXXXXXXXXXXFYQGVSSMLSVKGNSLLSCSMDDNFIVDDS 834
             L++ AK ++ S ++ D               QG   MLS KG SLLS +  DNFIVDD 
Sbjct: 84   YLLSLAKSESLSLLSFDSVIPDLI--------QGAGPMLSFKGKSLLSWNSYDNFIVDDE 135

Query: 833  ETSSYEASFLSLDLHALAAQLEKVDVSERLFIENDLLADELGEQLKEMSSSRSNYGETSH 654
            E  + EASFLS+DLH LA +L  +++S+R+FIE DLL +EL    ++ S++         
Sbjct: 136  EHLNQEASFLSIDLHKLATKLANINLSKRIFIEEDLLPEELCGTERQGSTTLGIEHVKRA 195

Query: 653  EGSENSNLGYD---HGETSYAVGTHNMEKTRSNNNELLPSPSTSNDNSQEVHDEKPFPPF 483
             G +  N+G      G +      H+      N+ +L+ S S   ++  E + +  F   
Sbjct: 196  LGKDGGNVGSSVMFQGNSDLGTKKHS-----QNHQDLVTSIS---EDYLENYSQPTFSGI 247

Query: 482  TWQSGKPNPATDSELLHQNAAREVLKPKLQLDATAELEKTTLRSGATPAETELDMLLNTM 303
                   + A D  L      +E    K Q +              TP    LD      
Sbjct: 248  -------DVAVDHFLRGSELPQEPKPNKTQEE-------------QTPRGVALD------ 281

Query: 302  NEKETKTLGFKAATAEAELDMLLDSFGETKLLDSVDISKE 183
                 K  GF+AA AE ELD LLD+FGET+ LD+  I+++
Sbjct: 282  GTNSGKNKGFEAAAAEVELDFLLDTFGETRRLDNFSIAED 321


>ref|XP_011043605.1| PREDICTED: uncharacterized protein LOC105139018 isoform X1 [Populus
            euphratica]
          Length = 474

 Score =  142 bits (358), Expect = 5e-31
 Identities = 148/433 (34%), Positives = 198/433 (45%), Gaps = 38/433 (8%)
 Frame = -2

Query: 1196 KPQKQKNPEKTLNPKSHLLPSNWDRYDDDEV--FG----NEGSDKSLVDA--RLAQGIAA 1041
            KP K  +      P+   LPSNWDRY DDE   FG    N   D S   +      G+A 
Sbjct: 28   KPSKTPSTGNNQKPQKSKLPSNWDRYGDDEEDEFGVNLENPSGDNSKKPSFKDYGDGLAL 87

Query: 1040 PKSKGADFSKLIAQAKEDARSRINPDEXXXXXXXXXXXXFYQGVSSMLSVKGNSLLSCSM 861
            PKSKGADF  L+    ++A+S+ +  +               GV  +L+V+G S+LS   
Sbjct: 88   PKSKGADFRYLL----DEAKSKPHQVDDFPFLEHFLAEESMHGVGPLLAVRGESILSWIG 143

Query: 860  DDNFIVDDSETSSYEASFLSLDLHALAAQLEKVDVSERLFIENDLLADELGEQLKEMSSS 681
            DDNF+V+D  TSS+EASFLSL+LHALA QL KVDVSERLFIE DLL  ELG      +SS
Sbjct: 144  DDNFVVEDETTSSHEASFLSLNLHALAEQLAKVDVSERLFIEADLLPTELGSN----TSS 199

Query: 680  RSNYGETSHEGSENSNLGYDHGETSYAVGTHNMEKTRSNNNELLPSPSTSNDNSQEVHDE 501
               + +    GSE    G +HG       TH+ E T++ + EL      S  N     D 
Sbjct: 200  SQEFDQMQTTGSE---AGGNHGPNRKQT-THDKE-TKTISGELT-FEDLSEKNKAVNQDA 253

Query: 500  KPF---------PPFTWQSG---KPNPATDSE-LLHQNAAREVLKPKLQLDATAELEKTT 360
            + F          P ++  G   K N   +     +Q+AA E      QL A +    + 
Sbjct: 254  EIFVSGLTIGNSDPISFIQGLDVKDNLNLNQHGKFNQSAAME---SPAQLYACSVAPSSR 310

Query: 359  LRS-GATPAETELDMLLNTMNE-KETKTLGFKAAT---AEAELDMLLDSF-----GETK- 213
            L +  A  AE+ELDMLL++++E K   + GF + T   +E E  + L        G  K 
Sbjct: 311  LPAFEAAAAESELDMLLDSLSETKLLDSSGFGSGTLPVSEKEAAVPLPQLTRNAPGSAKT 370

Query: 212  -----LLDSV-DISKEQSSNFPRAQXXXXXXXXXXXXXXXXXKAVREVPDSFNSPSMTNA 51
                  LD+V D   E+SS+   A                     R    S  + S    
Sbjct: 371  TPTAATLDNVLDDLLEESSDLQEA-------------AAPLPLLARNAHGSLKTTSTAAT 417

Query: 50   LDSSIDDLLARTS 12
            LD  +DDL   TS
Sbjct: 418  LDDVLDDLFEETS 430


>ref|XP_006592445.1| PREDICTED: uncharacterized protein LOC102660628 [Glycine max]
            gi|734309998|gb|KHM99717.1| hypothetical protein
            glysoja_037281 [Glycine soja]
          Length = 429

 Score =  141 bits (355), Expect = 1e-30
 Identities = 125/391 (31%), Positives = 182/391 (46%), Gaps = 10/391 (2%)
 Frame = -2

Query: 1154 KSH--LLPSNWDRYDDDEVFGNEGSDKSLVDARLAQGIAAPKSKGADFSKLIAQAKEDAR 981
            KSH   LPSNWDRY+D+E    E  D     A     +  PK+KGADF  L+A+A+  A 
Sbjct: 59   KSHRSALPSNWDRYEDEE----EELDSGSGIASKTVDVVLPKTKGADFRHLVAEAQSQAE 114

Query: 980  SRINPDEXXXXXXXXXXXXFYQGVSSMLSVKGNSLLSCSMDDNFIVDDSETSSYEASFLS 801
            + +   E            F  G+SSML V+G  ++S   DDNF+VDD  T + EASFLS
Sbjct: 115  TSL---EGFPAFDDLLPGEFGVGLSSMLVVRGEGIVSWVGDDNFVVDDKTTGNPEASFLS 171

Query: 800  LDLHALAAQLEKVDVSERLFIENDLLADELGEQLKEMSSSRSNYGETSHEGSENSNLGYD 621
            L+LHALA    KVD+S+RLFIE+DLL  EL  +   +SS+  +    + E SE +N    
Sbjct: 172  LNLHALAESFAKVDLSKRLFIESDLLPTELCVEELAVSSNEEHKELKTKEDSELAN---- 227

Query: 620  HGETSYAVGTHNMEKTRSNNNELLPSPSTSNDNSQEVHDEKPFPPFTWQSGKPNPATDSE 441
                        M K   + ++L     TS+ +S   H    FP              + 
Sbjct: 228  -----------RMSK-ELDLDDLAADQFTSSSSSSSSHAVSTFP------------LSNN 263

Query: 440  LLHQNAAREVLKPKLQLDATAELEKTTLRSGATPAETELDMLLNTMNEKETKTLGFKAAT 261
            + H         P   ++A A+    + ++ A    ++   L +T + +  +   F AA 
Sbjct: 264  VFH--------IPVNYVNAEAQQTSCSSKNKAFVPCSDAS-LHSTEDARGKQYSAFGAAD 314

Query: 260  AEAELDMLLDSFGETKLLD--------SVDISKEQSSNFPRAQXXXXXXXXXXXXXXXXX 105
             E ELDMLLDS  ETK+LD        S+ +S   SS +P+                   
Sbjct: 315  VEKELDMLLDSLSETKILDSSGFKSYTSIPVSLGVSSVYPQVS----------------- 357

Query: 104  KAVREVPDSFNSPSMTNALDSSIDDLLARTS 12
               ++ P    + S+T +LD ++D+LL  TS
Sbjct: 358  ---KKDPVPSKTASITASLDDALDELLEETS 385


>ref|XP_004287059.1| PREDICTED: uncharacterized protein LOC101291364 [Fragaria vesca
            subsp. vesca]
          Length = 381

 Score =  140 bits (353), Expect = 2e-30
 Identities = 117/332 (35%), Positives = 163/332 (49%), Gaps = 3/332 (0%)
 Frame = -2

Query: 1190 QKQKNPEKTLNPKSHLLPSNWDRYDDDEVFGNEGSDKSLVDARLAQGIAAPKSKGADFSK 1011
            QK K+  K        +P+NWDRYD++   G++ +         A  I  PKSKGAD++ 
Sbjct: 26   QKAKDGAKPNKASGKQIPTNWDRYDEELDSGSQDA---------ASDIVLPKSKGADYTH 76

Query: 1010 LIAQAKEDARSRINPDEXXXXXXXXXXXXFYQGVSSMLSVKGNSLLSCSMDDNFIVDDSE 831
            LIA+A+  + S+ + D               +G+ SMLS +G S+LS   DDNF+VDD  
Sbjct: 77   LIAEAQSQSLSQFDDDVLSVEWN--------KGIMSMLSARGESILSWIGDDNFVVDDKT 128

Query: 830  TSSY-EASFLSLDLHALAAQLEKVDVSERLFIENDLLADELGEQLKEMSSSRSNYGETSH 654
             +++ E SFLSL+LH+LA QLEKVD+SERLFIE DLL  EL  +  E +SS+S       
Sbjct: 129  AAAHHEVSFLSLNLHSLAEQLEKVDLSERLFIEADLLPPELNLEGLESTSSQS------- 181

Query: 653  EGSENSNLGYDHGETSYAVGTHNMEKTRSNNNELLPSPSTSNDNSQEVHDEKPFPPFTWQ 474
                             A GT   +  R     ++P  S S +   +++           
Sbjct: 182  --------------ADQAQGTFVNKGAR-----VIPEASISGEFPDKINVADQDIEIMLS 222

Query: 473  SGKPNPATDSELLHQNAAREVLKPKLQLDA-TAELEKTTLRSGATP-AETELDMLLNTMN 300
            S     + DS+ L  N     LK   Q+D   ++L K+T +S   P A+  +  L     
Sbjct: 223  S-----SPDSDCLDSNLGSISLK---QIDVDPSKLGKSTRQSSMKPFADIPIKNLAT--- 271

Query: 299  EKETKTLGFKAATAEAELDMLLDSFGETKLLD 204
                    F+AATAE ELDMLLDSF ETK  D
Sbjct: 272  --------FEAATAEEELDMLLDSFSETKRND 295


>ref|XP_003526252.1| PREDICTED: uncharacterized protein LOC100790093 [Glycine max]
            gi|734334005|gb|KHN07784.1| hypothetical protein
            glysoja_033870 [Glycine soja]
          Length = 433

 Score =  140 bits (352), Expect = 2e-30
 Identities = 122/404 (30%), Positives = 182/404 (45%), Gaps = 8/404 (1%)
 Frame = -2

Query: 1190 QKQKNPEKTLNPKSHLLPSNWDRYDDDEVFGNEGSDKSLVDARLAQGIAAPKSKGADFSK 1011
            ++Q + EK        LPSNWDRY+D+E    E  D     A     +  PKSKGADF  
Sbjct: 53   KQQVSEEKKKKSHHSALPSNWDRYEDEE----EELDSGSGIASKTVDVVLPKSKGADFRH 108

Query: 1010 LIAQAKEDARSRINPDEXXXXXXXXXXXXFYQGVSSMLSVKGNSLLSCSMDDNFIVDDSE 831
            L+A+A+  A + +   E            F  G+SSML V+G  ++S + DDNF+V+D  
Sbjct: 109  LVAEAQSLAETSL---EGFPAFNDLLPGEFGVGLSSMLVVRGEGIVSWAGDDNFVVEDKT 165

Query: 830  TSSYEASFLSLDLHALAAQLEKVDVSERLFIENDLLADELGEQLKEMSSSRSNYGETSHE 651
              + EASFLSL+LHALA    KVD+++RLFIE DLL  EL  +   MSSS  +    + +
Sbjct: 166  NGNLEASFLSLNLHALAESFAKVDLAKRLFIEADLLPTELCVEESAMSSSEEHEELKTKD 225

Query: 650  GSENSNLGYDHGETSYAVGTHNMEKTRSNNNELLPSPSTSNDNSQEVHDEKPFPPFTWQS 471
             SE +N   +  +                 ++L      S+ +S   H    FP      
Sbjct: 226  ESELANRMSEELDV----------------DDLAADQFISSSSSSSSHAASTFP------ 263

Query: 470  GKPNPATDSELLHQNAAREVLKPKLQLDATAELEKTTLRSGATPAETELDMLLNTMNEKE 291
                            + +   P   +DA A+   ++ ++ A    ++   L +T + + 
Sbjct: 264  ---------------LSNDFRIPVNYVDAEAQQTSSSGKNKAFVLSSDAS-LHSTEDTRG 307

Query: 290  TKTLGFKAATAEAELDMLLDSFGETKLLD--------SVDISKEQSSNFPRAQXXXXXXX 135
                 F+AA AE ELDMLLDSFGET +LD        S+ +S   +S +P          
Sbjct: 308  KPYSTFEAADAEKELDMLLDSFGETNILDSSGFKSNTSIPVSSGVASVYP---------- 357

Query: 134  XXXXXXXXXXKAVREVPDSFNSPSMTNALDSSIDDLLARTSIVS 3
                          + P    +  +T +LD  +DDLL  TS ++
Sbjct: 358  ---------PHISNKDPVPSKTAPITASLDDVLDDLLEGTSTLT 392


>ref|XP_002517843.1| conserved hypothetical protein [Ricinus communis]
            gi|223542825|gb|EEF44361.1| conserved hypothetical
            protein [Ricinus communis]
          Length = 434

 Score =  138 bits (348), Expect = 7e-30
 Identities = 129/382 (33%), Positives = 179/382 (46%), Gaps = 2/382 (0%)
 Frame = -2

Query: 1142 LPSNWDRYDDDEVFGNEGSDKSLVDA-RLAQGIAAPKSKGADFSKLIAQAKEDARSRINP 966
            LPSN DRY+++    + GS   L D+   A  I  PKSKGAD+  LIA+A+   +S    
Sbjct: 60   LPSNCDRYEEEF---DSGSGDPLGDSINNASDIILPKSKGADYRHLIAEAQSQCQSGSYL 116

Query: 965  DEXXXXXXXXXXXXFYQGVSSMLSVKGNSLLSCSMDDNFIVDDSETSSYEASFLSLDLHA 786
            D                GV  MLSV+G  +LS + DDNF+V+D    S EA FLSL+L A
Sbjct: 117  DMFPSLEDILPADFKL-GVGPMLSVRGEGILSWTGDDNFVVEDESAVSPEAHFLSLNLSA 175

Query: 785  LAAQLEKVDVSERLFIENDLLADEL-GEQLKEMSSSRSNYGETSHEGSENSNLGYDHGET 609
            LA QL KVD+SERLF+E D+L  EL G   K  SS  S   +TS E   NS +     E 
Sbjct: 176  LAEQLLKVDISERLFMEADILPPELSGHGAKATSSLESEQKQTS-EMKVNSTVS----EE 230

Query: 608  SYAVGTHNMEKTRSNNNELLPSPSTSNDNSQEVHDEKPFPPFTWQSGKPNPATDSELLHQ 429
                      +    ++E++ S S     S  +   + F       G  + +  S     
Sbjct: 231  LILKDLSEKNEFAKQSSEVMSSESILTGQSDPISLNQEFDMINKTEGDFSASRHSSSCEN 290

Query: 428  NAAREVLKPKLQLDATAELEKTTLRSGATPAETELDMLLNTMNEKETKTLGFKAATAEAE 249
             A          +++ AE+      SG++ A+ +             K   F+A  AEAE
Sbjct: 291  RA----------MESPAEI------SGSSIADPK------------KKPYMFEATAAEAE 322

Query: 248  LDMLLDSFGETKLLDSVDISKEQSSNFPRAQXXXXXXXXXXXXXXXXXKAVREVPDSFNS 69
            LDMLLDSF ETK LDS   S   S+ FP ++                 + +R  P S + 
Sbjct: 323  LDMLLDSFNETKFLDS---SGFTSAAFPLSK---------KEAPRALPQLIRNTPSS-SK 369

Query: 68   PSMTNALDSSIDDLLARTSIVS 3
             S++  LD ++DDLL +TS +S
Sbjct: 370  TSISATLDDALDDLLEQTSNLS 391


>ref|XP_010093966.1| hypothetical protein L484_010532 [Morus notabilis]
            gi|587865403|gb|EXB54953.1| hypothetical protein
            L484_010532 [Morus notabilis]
          Length = 423

 Score =  136 bits (342), Expect = 4e-29
 Identities = 126/399 (31%), Positives = 175/399 (43%), Gaps = 4/399 (1%)
 Frame = -2

Query: 1196 KPQKQKNPEKTLNPKSH-LLPSNWDRYDDDEVFGNEGSDKSLVDARLAQGIAAPKSKGAD 1020
            KP  +++ EK L P+    LPSNWDRY+ +   G+E    S    +    +  PKSKGAD
Sbjct: 44   KPSGKQDKEKPLQPRGKSALPSNWDRYEQETDSGSEEPSGSGAIQKQNPDVVLPKSKGAD 103

Query: 1019 FSKLIAQAKEDARSRINPDEXXXXXXXXXXXXFYQGVSSMLSVKGNSLLSCSMDDNFIVD 840
            +  LIA+A+  + + +   +            F   V SMLSV+G  +L+ S DDNFIV+
Sbjct: 104  YRHLIAEAQSQSHAYL---DSFPSVDDVLAGEFSLAVGSMLSVRGEGILAWSADDNFIVN 160

Query: 839  DSETSSYEASFLSLDLHALAAQLEKVDVSERLFIENDLLADELGEQLKEMS-SSRSNYGE 663
            D  T+  EA+FLSL+LHALA QLEK+D++ RLFIE DLL  EL  ++ E S + + N   
Sbjct: 161  DKSTTHPEAAFLSLNLHALAEQLEKIDLAHRLFIEADLLPPELHVEVSETSRTQKCNQMP 220

Query: 662  TSHEGSENSNLGYDHGETSYAVGTHNMEKTRSNNNELLPSPSTSNDNSQEVHDEKPFPPF 483
             +++    S L  +         T N     ++ +   P PS S   S  V         
Sbjct: 221  ATNDVEAVSKLPEEL--------TFNEVSLSASPSGGHPDPSLSIRGSSSV--------- 263

Query: 482  TWQSGKPNPATDSELLHQNAAREVLKPKLQLDATAELEKTTLRSGATPAETELDMLLNTM 303
                G  N    S+  H++ A      +  +D  A+  K      A  AE EL       
Sbjct: 264  --SQGVSNVNRVSQYDHKSNAPHFAVAQSSVDTFADPGKKRPEFEAVAAEAEL------- 314

Query: 302  NEKETKTLGFKAATAEAELDMLLDSFGETKLLDSVDISKEQSSNFPRAQXXXXXXXXXXX 123
                               DMLLDSF E K+ DS  +S   +                  
Sbjct: 315  -------------------DMLLDSFSEIKIPDSSGLSSADT----------------LP 339

Query: 122  XXXXXXKAVREVP-DSFNSPSMTNA-LDSSIDDLLARTS 12
                   AV + P    NS  +TNA LD  +DDLL  TS
Sbjct: 340  VHEEASAAVFQPPRKDPNSSVLTNANLDDDLDDLLKETS 378


>ref|XP_006469074.1| PREDICTED: lisH domain-containing protein C1711.05-like isoform X1
            [Citrus sinensis]
          Length = 456

 Score =  133 bits (334), Expect = 3e-28
 Identities = 129/392 (32%), Positives = 172/392 (43%), Gaps = 15/392 (3%)
 Frame = -2

Query: 1142 LPSNWDRYDDDEVFGNEGSDKSLVDARL-AQGIAAPKSKGADFSKLIAQAKEDARSR--- 975
            LPSNWDRY+D       GSD    D    A     PKSKGAD+  LIA+A+  + S+   
Sbjct: 61   LPSNWDRYED-------GSDMDSEDTTSQASDFVVPKSKGADYRHLIAEAQSQSLSQSRS 113

Query: 974  INPDEXXXXXXXXXXXXFYQGVSSMLSVKGNSLLSCSMDDNFIVDDSETSSYEASFLSLD 795
            ++  +            F  G+  MLSV+G  +LS   DDNF+V+D  T+  EASFLSL+
Sbjct: 114  LSYSDTFPLLDDVMPGGFAPGMGPMLSVRGEGILSWVGDDNFVVEDKTTAFQEASFLSLN 173

Query: 794  LHALAAQLEKVDVSERLFIENDLLADELGEQLKEMSSSRS-NYGETSHEGSENSNLGYD- 621
            L+ALA  L KVD+S+RLF+E DLL  ELG +    SS++     +T HE   +  +  D 
Sbjct: 174  LNALAEHLAKVDLSQRLFVEADLLPSELGTEGSIASSNQEPGLMQTEHESEADVGISRDI 233

Query: 620  --------HGETSYAVGTHNMEKTRSNNNELLPSPSTSNDNSQEVHDEKPFPPFTWQSGK 465
                     GE     G H + K  +N +E     ST      ++ D K       ++  
Sbjct: 234  DIASKDFPEGEEEEESGAHKV-KAAANISE--DKASTDFREKVKIVDTKSTSVVGHKNVD 290

Query: 464  PNPATDSELLHQNAAREVLKPKLQLDATAELEKTTLRSGATPAETELDMLLNTMNEKETK 285
               +     L      +V  P  Q D          R G   A  E     N  +   +K
Sbjct: 291  AIFSNQRSALVNQTKNDV--PSSQYD----------RFGQDKA-LEPPAQFNENSVSVSK 337

Query: 284  TLGFKAAT-AEAELDMLLDSFGETKLLDSVDISKEQSSNFPRAQXXXXXXXXXXXXXXXX 108
             L    AT AEAELDMLLDSF +T        S   SS F  +                 
Sbjct: 338  NLPTFEATAAEAELDMLLDSFNDT------GFSYSSSSKFSNSSVSQQTSSTAPPQLS-- 389

Query: 107  XKAVREVPDSFNSPSMTNALDSSIDDLLARTS 12
                R+ PD   S S+T + D  +DDLL  TS
Sbjct: 390  ----RKGPDLSKSASVTASFDDVLDDLLEETS 417


Top