BLASTX nr result

ID: Atropa21_contig00011135 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Atropa21_contig00011135
         (1427 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_006363538.1| PREDICTED: uncharacterized protein DDB_G0271...   497   e-138
ref|XP_006363539.1| PREDICTED: uncharacterized protein DDB_G0271...   493   e-136
ref|XP_006363537.1| PREDICTED: uncharacterized protein DDB_G0271...   493   e-136
ref|XP_004237230.1| PREDICTED: uncharacterized protein LOC101245...   488   e-135
ref|XP_004238731.1| PREDICTED: uncharacterized protein LOC101245...   290   1e-75
ref|XP_006357278.1| PREDICTED: micronuclear linker histone polyp...   254   5e-65
ref|XP_002267713.2| PREDICTED: uncharacterized protein LOC100258...   186   2e-44
ref|XP_002520203.1| conserved hypothetical protein [Ricinus comm...   144   9e-32
ref|XP_006483756.1| PREDICTED: muscle M-line assembly protein un...   142   3e-31
ref|XP_006438506.1| hypothetical protein CICLE_v10031371mg [Citr...   142   3e-31
gb|EMJ25426.1| hypothetical protein PRUPE_ppa018071mg, partial [...   140   1e-30
gb|EOY00300.1| Uncharacterized protein isoform 3 [Theobroma caca...   137   9e-30
gb|EOY00298.1| Uncharacterized protein isoform 1 [Theobroma caca...   137   9e-30
ref|XP_004504495.1| PREDICTED: uncharacterized protein LOC101508...   137   1e-29
gb|EOY00304.1| Uncharacterized protein isoform 7 [Theobroma cacao]    135   6e-29
gb|EXB82666.1| hypothetical protein L484_027847 [Morus notabilis]     134   9e-29
ref|XP_006584485.1| PREDICTED: uncharacterized protein LOC100306...   134   9e-29
ref|XP_006584484.1| PREDICTED: uncharacterized protein LOC100306...   134   9e-29
ref|XP_002311790.2| hypothetical protein POPTR_0008s19710g, part...   133   2e-28
ref|XP_003517869.1| PREDICTED: neurofilament medium polypeptide ...   133   2e-28

>ref|XP_006363538.1| PREDICTED: uncharacterized protein DDB_G0271670-like isoform X2
            [Solanum tuberosum]
          Length = 454

 Score =  497 bits (1280), Expect = e-138
 Identities = 287/424 (67%), Positives = 314/424 (74%), Gaps = 12/424 (2%)
 Frame = +1

Query: 1    EAEKCKTPGSVAQKKAYFEAHYXXXXXXXXXXXXXXXXXXDQAKQIDESLPTILDLDEPH 180
            EA+KCKT GSVAQKKAYFEAHY                     K   E +  +  LDEPH
Sbjct: 42   EADKCKTSGSVAQKKAYFEAHYKKIATQ---------------KMELEKMEQVESLDEPH 86

Query: 181  FQDHSENTQVSNSHFGLSDGEDEKTRDDVNNSDHVDEPKDDVFVNVKAKEGPILETSDNG 360
             QD SE+T V ++    + GE+E TR D+NNSD VD   + + V +K KEG IL   D+G
Sbjct: 87   IQDRSESTHVFDTDRCATQGEEEMTRADMNNSDSVDMEVNSLLV-LKDKEGEIL---DHG 142

Query: 361  EVPKVEEQSS-ERGSQDNRKEIRQADSEGXXXXXXXXXTPKPNLKNTARKVXVHPTTEDR 537
            EVP VE+  S E GSQDN KEI Q D+E          TPK NLKNTARKV  HPTTEDR
Sbjct: 143  EVPNVEQHKSCEIGSQDNLKEISQVDNEAKSSSAKKSKTPKSNLKNTARKV--HPTTEDR 200

Query: 538  ISAGTKKKSASPVTKSSRISTPT--PKPVSKNISSSQQSVTKVNGLSYQRSSNAPVAQSN 711
            ISAGTKKK ASPVTKSSRISTPT  P P SK ISSSQ SV KVNG+SYQRSSNAPVAQ N
Sbjct: 201  ISAGTKKKLASPVTKSSRISTPTSKPPPASKVISSSQTSVKKVNGVSYQRSSNAPVAQGN 260

Query: 712  KVLSRSVNSPSESSIKKLNGSTLQRSKNSST---------SMHMSLSLGPPNSTASTTTM 864
            K+LSRS+ SPS+SSIKKLNGSTLQRSKNSST         S+HMSLSLGPPNSTAST TM
Sbjct: 261  KLLSRSLISPSQSSIKKLNGSTLQRSKNSSTLENKRIAPTSLHMSLSLGPPNSTASTNTM 320

Query: 865  RKSLIMERMGEKDIVKRAFKAFQNSFNQGKLEVDTRYSESKKVFPKGSEQKISASPTPRK 1044
            RKSLIMERMG+KDIVKRAFKAFQ+SFNQGK EVDTRYS SKKV PKGSEQKISASPTP+K
Sbjct: 321  RKSLIMERMGDKDIVKRAFKAFQSSFNQGKPEVDTRYSGSKKVLPKGSEQKISASPTPKK 380

Query: 1045 EIDRLRKTSDKVIAQKCQSGTRSNSLSSGAPKDAGIERKKANTVRPAGT*TDILTDKLKE 1224
            E++RLRKTSD V+ QKCQSGTRSNSLSS APKDA IERKK N+VRPAG   D   DKLKE
Sbjct: 381  EVERLRKTSDTVMTQKCQSGTRSNSLSSRAPKDAVIERKKVNSVRPAGMSIDRSIDKLKE 440

Query: 1225 KITK 1236
             I K
Sbjct: 441  DIIK 444


>ref|XP_006363539.1| PREDICTED: uncharacterized protein DDB_G0271670-like isoform X3
            [Solanum tuberosum]
          Length = 451

 Score =  493 bits (1268), Expect = e-136
 Identities = 287/425 (67%), Positives = 314/425 (73%), Gaps = 13/425 (3%)
 Frame = +1

Query: 1    EAEKCKTPGSVAQKKAYFEAHYXXXXXXXXXXXXXXXXXXDQAKQIDESLPTILDLDEPH 180
            EA+KCKT GSVAQKKAYFEAHY                     K   E +  +  LDEPH
Sbjct: 38   EADKCKTSGSVAQKKAYFEAHYKKIATQ---------------KMELEKMEQVESLDEPH 82

Query: 181  FQDHSENTQVSNSHFGLSDGEDEKTRDDVNNSDHVDEPKDDVFVNVKAKEGPILETSDNG 360
             QD SE+T V ++    + GE+E TR D+NNSD VD   + + V +K KEG IL   D+G
Sbjct: 83   IQDRSESTHVFDTDRCATQGEEEMTRADMNNSDSVDMEVNSLLV-LKDKEGEIL---DHG 138

Query: 361  EVPKVEEQSS-ERGSQDNRKEIRQADSEGXXXXXXXXXTPKPNLKNTARKVXVHPTTEDR 537
            EVP VE+  S E GSQDN KEI Q D+E          TPK NLKNTARKV  HPTTEDR
Sbjct: 139  EVPNVEQHKSCEIGSQDNLKEISQVDNEAKSSSAKKSKTPKSNLKNTARKV--HPTTEDR 196

Query: 538  ISAGTKKKSASPVTKSSRISTPT--PKPVSKNISSSQQSVTKVNGLSYQRSSNAPVAQSN 711
            ISAGTKKK ASPVTKSSRISTPT  P P SK ISSSQ SV KVNG+SYQRSSNAPVAQ N
Sbjct: 197  ISAGTKKKLASPVTKSSRISTPTSKPPPASKVISSSQTSVKKVNGVSYQRSSNAPVAQGN 256

Query: 712  KVLSRSVNSPSESSIKKLNGSTLQRSKNSST---------SMHMSLSLGPPNSTASTTTM 864
            K+LSRS+ SPS+SSIKKLNGSTLQRSKNSST         S+HMSLSLGPPNSTAST TM
Sbjct: 257  KLLSRSLISPSQSSIKKLNGSTLQRSKNSSTLENKRIAPTSLHMSLSLGPPNSTASTNTM 316

Query: 865  RKSLIMERMGEKDIVKRAFKAFQNSFNQGKLEVDTRYSESKKVFPKGSEQKISASPTPRK 1044
            RKSLIMERMG+KDIVKRAFKAFQ+SFNQGK EVDTRYS SKKV PKGSEQKISASPTP+K
Sbjct: 317  RKSLIMERMGDKDIVKRAFKAFQSSFNQGKPEVDTRYSGSKKVLPKGSEQKISASPTPKK 376

Query: 1045 EIDRLRKTSDKVIAQKCQSGTRSNSLSS-GAPKDAGIERKKANTVRPAGT*TDILTDKLK 1221
            E++RLRKTSD V+ QKCQSGTRSNSLSS  APKDA IERKK N+VRPAG   D   DKLK
Sbjct: 377  EVERLRKTSDTVMTQKCQSGTRSNSLSSRRAPKDAVIERKKVNSVRPAGMSIDRSIDKLK 436

Query: 1222 EKITK 1236
            E I K
Sbjct: 437  EDIIK 441


>ref|XP_006363537.1| PREDICTED: uncharacterized protein DDB_G0271670-like isoform X1
            [Solanum tuberosum]
          Length = 455

 Score =  493 bits (1268), Expect = e-136
 Identities = 287/425 (67%), Positives = 314/425 (73%), Gaps = 13/425 (3%)
 Frame = +1

Query: 1    EAEKCKTPGSVAQKKAYFEAHYXXXXXXXXXXXXXXXXXXDQAKQIDESLPTILDLDEPH 180
            EA+KCKT GSVAQKKAYFEAHY                     K   E +  +  LDEPH
Sbjct: 42   EADKCKTSGSVAQKKAYFEAHYKKIATQ---------------KMELEKMEQVESLDEPH 86

Query: 181  FQDHSENTQVSNSHFGLSDGEDEKTRDDVNNSDHVDEPKDDVFVNVKAKEGPILETSDNG 360
             QD SE+T V ++    + GE+E TR D+NNSD VD   + + V +K KEG IL   D+G
Sbjct: 87   IQDRSESTHVFDTDRCATQGEEEMTRADMNNSDSVDMEVNSLLV-LKDKEGEIL---DHG 142

Query: 361  EVPKVEEQSS-ERGSQDNRKEIRQADSEGXXXXXXXXXTPKPNLKNTARKVXVHPTTEDR 537
            EVP VE+  S E GSQDN KEI Q D+E          TPK NLKNTARKV  HPTTEDR
Sbjct: 143  EVPNVEQHKSCEIGSQDNLKEISQVDNEAKSSSAKKSKTPKSNLKNTARKV--HPTTEDR 200

Query: 538  ISAGTKKKSASPVTKSSRISTPT--PKPVSKNISSSQQSVTKVNGLSYQRSSNAPVAQSN 711
            ISAGTKKK ASPVTKSSRISTPT  P P SK ISSSQ SV KVNG+SYQRSSNAPVAQ N
Sbjct: 201  ISAGTKKKLASPVTKSSRISTPTSKPPPASKVISSSQTSVKKVNGVSYQRSSNAPVAQGN 260

Query: 712  KVLSRSVNSPSESSIKKLNGSTLQRSKNSST---------SMHMSLSLGPPNSTASTTTM 864
            K+LSRS+ SPS+SSIKKLNGSTLQRSKNSST         S+HMSLSLGPPNSTAST TM
Sbjct: 261  KLLSRSLISPSQSSIKKLNGSTLQRSKNSSTLENKRIAPTSLHMSLSLGPPNSTASTNTM 320

Query: 865  RKSLIMERMGEKDIVKRAFKAFQNSFNQGKLEVDTRYSESKKVFPKGSEQKISASPTPRK 1044
            RKSLIMERMG+KDIVKRAFKAFQ+SFNQGK EVDTRYS SKKV PKGSEQKISASPTP+K
Sbjct: 321  RKSLIMERMGDKDIVKRAFKAFQSSFNQGKPEVDTRYSGSKKVLPKGSEQKISASPTPKK 380

Query: 1045 EIDRLRKTSDKVIAQKCQSGTRSNSLSS-GAPKDAGIERKKANTVRPAGT*TDILTDKLK 1221
            E++RLRKTSD V+ QKCQSGTRSNSLSS  APKDA IERKK N+VRPAG   D   DKLK
Sbjct: 381  EVERLRKTSDTVMTQKCQSGTRSNSLSSRRAPKDAVIERKKVNSVRPAGMSIDRSIDKLK 440

Query: 1222 EKITK 1236
            E I K
Sbjct: 441  EDIIK 445


>ref|XP_004237230.1| PREDICTED: uncharacterized protein LOC101245640 [Solanum
            lycopersicum]
          Length = 460

 Score =  488 bits (1257), Expect = e-135
 Identities = 282/424 (66%), Positives = 312/424 (73%), Gaps = 12/424 (2%)
 Frame = +1

Query: 1    EAEKCKTPGSVAQKKAYFEAHYXXXXXXXXXXXXXXXXXXDQAKQIDESLPTILDLDEPH 180
            E +KCKT GSVAQKKAYFEAHY                     K   + +  +  LDEPH
Sbjct: 38   EVDKCKTSGSVAQKKAYFEAHYK--------------------KIAAQKMEQVESLDEPH 77

Query: 181  FQDHSENTQVSNSHFGLSDGEDEKTRDDVNNSDHVDEPKDDVFVNVKAKEGPILETSDNG 360
             QD +E+TQV ++H     G +E TR DVNNSD     K +  + +  KEG ILET DNG
Sbjct: 78   IQDRNESTQVFDTH-----GVEETTRADVNNSDM----KVNSLLVLIDKEGEILETGDNG 128

Query: 361  EVPKVEE-QSSERGSQDNRKEIRQADSEGXXXXXXXXXTPKPNLKNTARKVXVHPTTEDR 537
            EV  +E+ +S E GSQD+ KEI Q D+E          TPK NLKNTARKV  HPTTEDR
Sbjct: 129  EVSNLEKHESCEIGSQDDHKEISQVDNEAKISSAKKSKTPKSNLKNTARKV--HPTTEDR 186

Query: 538  ISAGTKKKSASPVTKSSRISTPT--PKPVSKNISSSQQSVTKVNGLSYQRSSNAPVAQSN 711
            ISAGTKKK ASPVTKSSRISTPT  P P SK ISSSQ SV KVNG+SYQRSSN+PVAQSN
Sbjct: 187  ISAGTKKKLASPVTKSSRISTPTSKPTPASKVISSSQTSVKKVNGVSYQRSSNSPVAQSN 246

Query: 712  KVLSRSVNSPSESSIKKLNGSTLQRSKNSST---------SMHMSLSLGPPNSTASTTTM 864
            K+LSRS+ SPS+SSIKKLN STLQRSKNSST         S+HMSLSLGPPNSTAST TM
Sbjct: 247  KLLSRSLISPSQSSIKKLNSSTLQRSKNSSTLENKRIAPTSLHMSLSLGPPNSTASTNTM 306

Query: 865  RKSLIMERMGEKDIVKRAFKAFQNSFNQGKLEVDTRYSESKKVFPKGSEQKISASPTPRK 1044
            RKSLIM+RMG+KDIVKRAFKAFQ+SFNQGK EVDTRYS SKKV PKGSE+KISASPTP+K
Sbjct: 307  RKSLIMDRMGDKDIVKRAFKAFQSSFNQGKPEVDTRYSGSKKVLPKGSEKKISASPTPKK 366

Query: 1045 EIDRLRKTSDKVIAQKCQSGTRSNSLSSGAPKDAGIERKKANTVRPAGT*TDILTDKLKE 1224
            E++RLRKTSD VI QKCQSGTRSNSLSS APKDA IERKK NTVRPAG   D   DKLKE
Sbjct: 367  EVERLRKTSDAVITQKCQSGTRSNSLSSRAPKDAVIERKKVNTVRPAGMSIDRSIDKLKE 426

Query: 1225 KITK 1236
             I K
Sbjct: 427  DIIK 430


>ref|XP_004238731.1| PREDICTED: uncharacterized protein LOC101245760 [Solanum
            lycopersicum]
          Length = 602

 Score =  290 bits (741), Expect = 1e-75
 Identities = 219/549 (39%), Positives = 271/549 (49%), Gaps = 137/549 (24%)
 Frame = +1

Query: 1    EAEKCKTPGSVAQKKAYFEAHYXXXXXXXXXXXXXXXXXXDQAKQIDESL-PTILDLDEP 177
            E EKC TPGSVAQKKAYFEAHY                  ++ +Q+++ + P   ++ EP
Sbjct: 55   EVEKCSTPGSVAQKKAYFEAHYKRIAAKKLEQLE------EETRQVEQEMEPLSPEVTEP 108

Query: 178  HFQDHSENTQVSNSHFGLSDGE----DEKTRDDVN--NSDHV------------------ 285
               D +EN   S+  F  S+GE    DE+    VN  NSD V                  
Sbjct: 109  KSGDVTENGN-SDGDFSSSNGESSSVDEQQMSVVNLKNSDAVDEPKEDITVGVECDNLLV 167

Query: 286  -----------DEPKDDVFVNV-------------------------------------K 321
                       DE KDD  V++                                     K
Sbjct: 168  TEAKELTISGIDESKDDTSVDIECFSPLVTEAKEGTISGIDESNEDISVDLECDSLVVTK 227

Query: 322  AKEGPILETSDNGEVPKVEEQSSERGSQDNRKEIRQADSEGXXXXXXXXXTPKPNLKNTA 501
             KE  IL T D G + K EE++ E   QD+  E  QA++E          TP  N+K+  
Sbjct: 228  TKEETILGTCDQGVLNKAEERNLENVCQDSVVETPQANTEAQKASLKKSKTPNANVKHVP 287

Query: 502  RKVXVHPTTEDRISAGTKKKSASPVTKSSRISTPTPK--PVSKNISSSQQSVTKVNGLSY 675
            RKV    T + R+S GTKKK  SPV KSSRISTPT K  P S  I+ SQ SV KV G+S 
Sbjct: 288  RKVY---TPDARVSVGTKKKLTSPVAKSSRISTPTSKQVPTSMVITPSQPSVKKVTGMST 344

Query: 676  QRSSNAPVAQSNKVLSRSVNSPSESSIKKLNGST-------------------------- 777
            QRS+  P+AQ  K++  S  SPS+SS KKLNG+T                          
Sbjct: 345  QRSNTTPLAQRKKLVPGSFVSPSQSSNKKLNGATPSQSSNKKLNGASPSQSSNKNLNGAT 404

Query: 778  --------------------------LQRSKNSS---------TSMHMSLSLGPPNSTAS 852
                                      LQRS NS          TS+HMSLSL  PNSTAS
Sbjct: 405  PCQSSNKKLNGATSSRSSSKTLNGAALQRSVNSPVLEDKRRVPTSLHMSLSLSSPNSTAS 464

Query: 853  TTTMRKSLIMERMGEKDIVKRAFKAFQNSFNQGKLEVDTRYSESKKVFPKGSEQKISASP 1032
            T TMR+SL ME MG+KDIVKRAFKAFQNS++QG+   D  Y    +V  K SEQKIS S 
Sbjct: 465  TNTMRRSLFMETMGDKDIVKRAFKAFQNSYSQGRSVGDMTYDIQDQVSSKESEQKISTSS 524

Query: 1033 TPRKEIDRLRKTSDKVIAQKCQSGTRSNSLSSGAPKDAGIERKKANTVRPA-GT*TDILT 1209
            T +K+ +RLRKT DKVI  K QSGTRS S SSGAPKDAG+E+K+ N++R +  +  D  T
Sbjct: 525  T-QKDSERLRKTPDKVITLKGQSGTRSASSSSGAPKDAGVEKKRVNSIRASTSSRIDRST 583

Query: 1210 DKLKEKITK 1236
            DK KE++TK
Sbjct: 584  DKWKEEVTK 592


>ref|XP_006357278.1| PREDICTED: micronuclear linker histone polyprotein-like [Solanum
            tuberosum]
          Length = 587

 Score =  254 bits (650), Expect = 5e-65
 Identities = 203/545 (37%), Positives = 250/545 (45%), Gaps = 137/545 (25%)
 Frame = +1

Query: 1    EAEKCKTPGSVAQKKAYFEAHYXXXXXXXXXXXXXXXXXXDQAKQIDESL-PTILDLDEP 177
            E EKC TPGSVAQKKAYFEAHY                  ++ +Q+++ + P   ++ EP
Sbjct: 55   EVEKCSTPGSVAQKKAYFEAHYKRIAAKKLEQLE------EETRQVEQKMEPLCPEVAEP 108

Query: 178  HFQDHSENTQVSNSHFGLSDGE----DEKTRD--DVNNSDHVD----------------- 288
               D +EN   S+  F  S GE    DE+     ++ NSD VD                 
Sbjct: 109  KSGDVTENG-TSDGDFSSSKGERSSVDEQQMSVVELKNSDAVDEPKEDITVDVECDNLLV 167

Query: 289  ------------EPKDDVFVNV-------------------------------------K 321
                        E KDD+ V++                                     K
Sbjct: 168  TKAKELTISGIDESKDDISVDIECVSPFATEAKEGTVLGIDESNEDISVDVECDNLVVTK 227

Query: 322  AKEGPILETSDNGEVPKVEEQSSERGSQDNRKEIRQADSEGXXXXXXXXXTPKPNLKNTA 501
             KE  IL T D GE  KVEE++ E+G Q +  E  QA++E          TP  N+KN  
Sbjct: 228  TKEETILGTCDQGEFHKVEERNPEKGCQVSVVETPQANTEAQKASLKKSKTPNANVKNVP 287

Query: 502  RKVXVHPTTEDRISAGTKKKSASPVTKSSRISTPTPK--PVSKNISSSQQSVTKVNGLSY 675
            RK     T E R+S GTKKK  SPV KSSRISTPT K  P SK ++ SQ S  KVNG+S 
Sbjct: 288  RKAY---TPEARVSVGTKKKLTSPVAKSSRISTPTSKQAPTSKVVTPSQPSAKKVNGMST 344

Query: 676  QRSSNAPVAQSNKVLSRSVNSPSESSIKKLNGSTL------------------------- 780
            QRS+N P+ Q  K++  S  SPS+SS KKLNG+TL                         
Sbjct: 345  QRSNNTPLVQHKKLVPGSFVSPSQSSNKKLNGATLSQSSNKKMNGATPSQSSNKKLNGAT 404

Query: 781  ---------------------------QRSKNSS---------TSMHMSLSLGPPNSTAS 852
                                       QRS NS          TS+HMSL L  PNSTAS
Sbjct: 405  PSQSSNKKLNGAMSSQSSNKTLNGAALQRSVNSPVLEDKRVVPTSLHMSLRLSSPNSTAS 464

Query: 853  TTTMRKSLIMERMGEKDIVKRAFKAFQNSFNQGKLEVDTRYSESKKVFPKGSEQKISASP 1032
            T TMRKSL ME MG+KDIVKRAFKAFQNSF+QG+   D  Y    +V  KGSEQKIS S 
Sbjct: 465  TNTMRKSLFMETMGDKDIVKRAFKAFQNSFSQGRSAGDMTYDVQDQVSSKGSEQKISLSS 524

Query: 1033 TPRKEIDRLRKTSDKVIAQKCQSGTRSNSLSSGAPKDAGIERKKANTVRPA-GT*TDILT 1209
            T +KE +R                         APKDAG+E+K+ N++R + G   D  T
Sbjct: 525  T-QKESER-------------------------APKDAGVEKKRVNSIRASTGLRIDRST 558

Query: 1210 DKLKE 1224
            DK KE
Sbjct: 559  DKWKE 563


>ref|XP_002267713.2| PREDICTED: uncharacterized protein LOC100258808 [Vitis vinifera]
            gi|296086485|emb|CBI32074.3| unnamed protein product
            [Vitis vinifera]
          Length = 513

 Score =  186 bits (472), Expect = 2e-44
 Identities = 140/391 (35%), Positives = 194/391 (49%), Gaps = 27/391 (6%)
 Frame = +1

Query: 1    EAEKCKTPGSVAQKKAYFEAHYXXXXXXXXXXXXXXXXXXDQAKQIDESLPTILDLDEPH 180
            E EKC TPGSVAQKKAYFEAHY                  D  KQ+       L  D+P+
Sbjct: 56   EVEKCSTPGSVAQKKAYFEAHYKKIAARKAELL-------DLEKQMGTDP---LGSDDPN 105

Query: 181  FQDHSENTQVSNSHFGLSDGED--EKTRDDVN-----NSDHVDEPKDD---VFVNVKAKE 330
              D   NT  +N+ F +S+G+   E    D N      + HVDEP +      + ++ + 
Sbjct: 106  CGDQIRNTDGNNTEFDVSNGQSSAEGVDQDTNLISVVTTTHVDEPSESNEGAPITIECQS 165

Query: 331  GPILETS-------------DNGEVPKVEEQSSERGSQDNRKEIRQADSEGXXXXXXXXX 471
              + E               D  E   ++E++S  GSQ+  +     D+           
Sbjct: 166  SSVEEAEEELDSKQGTPKLKDGEETVSIKEEASPMGSQNVMELPPSLDNGTGNTPRIKKE 225

Query: 472  TPKPNLKNTARKVXVHPTTEDRISAGTKKKSASPVTKSSRISTPT---PKPVSKNISSSQ 642
             PK +     +K+ +    ++R +A   KK+ SP+ KS +IS P    P P SK ISSSQ
Sbjct: 226  RPKLDPPKETKKITL--ANKERKTASVMKKAVSPIAKSPQISKPRDSKPTPTSKMISSSQ 283

Query: 643  QSVTKVNGLSYQRSSNAPVAQSNKVLSRSVNSPSESSIKKLNGSTLQRSKNSSTSMHMSL 822
             S+ K NG S  ++ N    +  K   RS   PS    KK+          + TS+H SL
Sbjct: 284  PSIKKANGSSLPKNKNPSAGEIKKPSPRS-KIPSAGEWKKV----------APTSLHKSL 332

Query: 823  SLGPPNS-TASTTTMRKSLIMERMGEKDIVKRAFKAFQNSFNQGKLEVDTRYSESKKVFP 999
            SLGPP+S +AS TT RKSLIME+MG+KDIV+RAFK FQNSFNQ K   + R S  K+V  
Sbjct: 333  SLGPPHSDSASLTTTRKSLIMEKMGDKDIVRRAFKTFQNSFNQLKPSSEVRSSVPKQVSA 392

Query: 1000 KGSEQKISASPTPRKEIDRLRKTSDKVIAQK 1092
            K +E ++S S T +++ +R  K    V+ QK
Sbjct: 393  KSTEPRVSTSITTQRDKERPLKAG--VVDQK 421


>ref|XP_002520203.1| conserved hypothetical protein [Ricinus communis]
            gi|223540695|gb|EEF42258.1| conserved hypothetical
            protein [Ricinus communis]
          Length = 556

 Score =  144 bits (363), Expect = 9e-32
 Identities = 136/433 (31%), Positives = 191/433 (44%), Gaps = 45/433 (10%)
 Frame = +1

Query: 1    EAEKCKTPGSVAQKKAYFEAHYXXXXXXXXXXXXXXXXXXDQAKQIDESLPTILDLDEPH 180
            E EKC TPGSVA KKAYFEAHY                   Q KQ++      L  ++ +
Sbjct: 55   EVEKCATPGSVAMKKAYFEAHYKKIAAKKAEQLG-------QEKQMEHKP---LGSNDQN 104

Query: 181  FQDHSENTQVSNSHFGLSDGE--DEKTRDDVN-----NSDHVDEPKDDVFVNVKAK---- 327
              D        +S F   + +   E TR ++      +S  V+EP +D  +N++A+    
Sbjct: 105  GGDPIGKANGIDSEFDTFNTQTSSEGTRQEIKLDSELDSGLVNEPYEDGAINLEAQGLSV 164

Query: 328  -----------EGPILETSDNGEVPKVEEQSSERGSQDNRKEI-RQADSEGXXXXXXXXX 471
                       +GP L   +  E P V E  +        K++ ++ D E          
Sbjct: 165  EQAEEELCSRIDGPSLNKPE--ETPFVREAETIPMESQAMKDLPKKLDKEAESIPIVKER 222

Query: 472  TPKPNLKNTARKVX-------------VHPTTEDRISAGTKKKSASPVTKSSRISTPTPK 612
              K N +   +KV                P ++ R  A  KKK ASPV KS+++STP   
Sbjct: 223  NAKINQRKEPQKVNNFAIEIIDSYKETTSPMSKVRDMARIKKKPASPVAKSTQLSTPKVT 282

Query: 613  ---PVSKNISSSQQSVTKVNGLSYQRSSNAPVAQSNKVLSRSVNSPSESSIKKLNGSTLQ 783
               P S  +S+ Q S  K    S  +S +  VA +NKV  +S                  
Sbjct: 283  KTGPTSGVLSTPQSSTKKATVSSLPKSKSPSVAGNNKVAPKS------------------ 324

Query: 784  RSKNSSTSMHMSLSLGPPNS------TASTTTMRKSLIMERMGEKDIVKRAFKAFQNSFN 945
                    +HMSLS+  PNS       A TTT RKS IME+M +K+IVKRAFK FQN++N
Sbjct: 325  --------LHMSLSMDTPNSDPAPLAAAPTTTARKSFIMEKMKDKEIVKRAFKTFQNNYN 376

Query: 946  QGKLEVDTRYSESKKVFPKGSEQKISASPTPRKEIDRLRKTSDKVIAQKCQSGTRSNSLS 1125
            Q K   D R   +K+V  KG+E K+S+S TPRKE       S K ++   ++   + S S
Sbjct: 377  QLKSSADERSLVAKQVPTKGTEVKVSSSMTPRKE----NAGSFKAVSMDKKTAKAAPS-S 431

Query: 1126 SGAPKDAGIERKK 1164
             G   D   ER+K
Sbjct: 432  FGLKSDERTERRK 444


>ref|XP_006483756.1| PREDICTED: muscle M-line assembly protein unc-89-like [Citrus
            sinensis]
          Length = 484

 Score =  142 bits (359), Expect = 3e-31
 Identities = 133/423 (31%), Positives = 189/423 (44%), Gaps = 34/423 (8%)
 Frame = +1

Query: 1    EAEKCKTPGSVAQKKAYFEAHYXXXXXXXXXXXXXXXXXXDQAKQIDESLPTILDLDEPH 180
            E EKC TPGSVA+K AYFEAHY                  + + ++D    T  DL   +
Sbjct: 59   EVEKCATPGSVAKKAAYFEAHYKKIAARKAELLDQEKQMDNDSSRLDNQ--TCGDLMADN 116

Query: 181  FQDHSENTQVSNSHFGLSDGEDEKTRDDVNNSDH----------VDEPKDDVFVNVKAKE 330
             ++ SE           SD  D +  DD+   +           VD+P  D  + V+ + 
Sbjct: 117  CKNKSE-----------SDISDHQRSDDIVYPETSLVNEVRGMPVDQPGGDAAIKVECQS 165

Query: 331  GPILETSDNGEVPKVEEQSSERGSQDNRKEIRQADSEGXXXXXXXXXTPKPNLKNTARKV 510
             P         V +V+E+ S   S  + K                  T K +++N++ ++
Sbjct: 166  SP---------VERVKEEKSRLESPTSNKPEEAV-----------VVTVKEDVENSSMRM 205

Query: 511  XV---------HPTT---EDRISAGTKKKS--ASPVTKSSRISTPTPKPVSKNISSSQQS 648
             +          P T   E+ +     K S   +PV K   IS    KP S    SS   
Sbjct: 206  VIVKELQEKEMEPATNVKEENVKLDHPKNSHKIAPVNKEKNISKIKKKPASPAAKSSP-- 263

Query: 649  VTKVNGLSYQRSSNAPVAQSNKVLSRSVNSPSESSIKKLNGSTLQRSKNSST-------- 804
            +TK + ++  +S +    + +K    S  S S SS K  NGS+L RSKN S         
Sbjct: 264  ITKASRIA--KSPHLSTPKVSKPTPMSTLSSSRSSTKIGNGSSLPRSKNLSAGESKKVAP 321

Query: 805  -SMHMSLSLGPPNST-ASTTTMRKSLIMERMGEKDIVKRAFKAFQNSFNQGKLEVDTRYS 978
             S+H+SLSLGP +S   S TT RKSLIME+MG+KDIVKRAFK FQN++NQ K   + R  
Sbjct: 322  KSLHISLSLGPSSSDPVSLTTTRKSLIMEKMGDKDIVKRAFKTFQNNYNQLKSSKEERSP 381

Query: 979  ESKKVFPKGSEQKISASPTPRKEIDRLRKTSDKVIAQKCQSGTRSNSLSSGAPKDAGIER 1158
              K+V  KG+E ++  S TPRKE                         ++G+ K AG+E+
Sbjct: 382  APKQVTAKGAEPRV-PSLTPRKE-------------------------NAGSFKAAGVEK 415

Query: 1159 KKA 1167
            K A
Sbjct: 416  KSA 418


>ref|XP_006438506.1| hypothetical protein CICLE_v10031371mg [Citrus clementina]
            gi|557540702|gb|ESR51746.1| hypothetical protein
            CICLE_v10031371mg [Citrus clementina]
          Length = 484

 Score =  142 bits (359), Expect = 3e-31
 Identities = 133/423 (31%), Positives = 189/423 (44%), Gaps = 34/423 (8%)
 Frame = +1

Query: 1    EAEKCKTPGSVAQKKAYFEAHYXXXXXXXXXXXXXXXXXXDQAKQIDESLPTILDLDEPH 180
            E EKC TPGSVA+K AYFEAHY                  + + ++D    T  DL   +
Sbjct: 59   EVEKCATPGSVAKKAAYFEAHYKKIAARKAELLDQEKQMDNDSSRLDNQ--TCGDLMADN 116

Query: 181  FQDHSENTQVSNSHFGLSDGEDEKTRDDVNNSDH----------VDEPKDDVFVNVKAKE 330
             ++ SE           SD  D +  DD+   +           VD+P  D  + V+ + 
Sbjct: 117  CKNKSE-----------SDISDHQRSDDIVYPETSLVNEVRGMPVDQPGGDAAIKVECQS 165

Query: 331  GPILETSDNGEVPKVEEQSSERGSQDNRKEIRQADSEGXXXXXXXXXTPKPNLKNTARKV 510
             P         V +V+E+ S   S  + K                  T K +++N++ ++
Sbjct: 166  SP---------VERVKEEKSRLESPTSNKPEEAV-----------VVTVKEDVENSSMRM 205

Query: 511  XV---------HPTT---EDRISAGTKKKS--ASPVTKSSRISTPTPKPVSKNISSSQQS 648
             +          P T   E+ +     K S   +PV K   IS    KP S    SS   
Sbjct: 206  VIVKELQEKEMEPATNVKEENVKLDHPKNSHKIAPVNKEKNISKIKKKPASPAAKSSP-- 263

Query: 649  VTKVNGLSYQRSSNAPVAQSNKVLSRSVNSPSESSIKKLNGSTLQRSKNSST-------- 804
            +TK + ++  +S +    + +K    S  S S SS K  NGS+L RSKN S         
Sbjct: 264  ITKASRIA--KSPHLSTPKVSKPTPMSTLSSSRSSTKIGNGSSLPRSKNLSAGESKKVAP 321

Query: 805  -SMHMSLSLGPPNST-ASTTTMRKSLIMERMGEKDIVKRAFKAFQNSFNQGKLEVDTRYS 978
             S+H+SLSLGP +S   S TT RKSLIME+MG+KDIVKRAFK FQN++NQ K   + R  
Sbjct: 322  KSLHISLSLGPSSSDPVSLTTTRKSLIMEKMGDKDIVKRAFKTFQNNYNQLKSSKEERSP 381

Query: 979  ESKKVFPKGSEQKISASPTPRKEIDRLRKTSDKVIAQKCQSGTRSNSLSSGAPKDAGIER 1158
              K+V  KG+E ++  S TPRKE                         ++G+ K AG+E+
Sbjct: 382  APKQVTAKGAEPRV-PSLTPRKE-------------------------NAGSFKAAGVEK 415

Query: 1159 KKA 1167
            K A
Sbjct: 416  KSA 418


>gb|EMJ25426.1| hypothetical protein PRUPE_ppa018071mg, partial [Prunus persica]
          Length = 479

 Score =  140 bits (353), Expect = 1e-30
 Identities = 131/409 (32%), Positives = 188/409 (45%), Gaps = 41/409 (10%)
 Frame = +1

Query: 1    EAEKCKTPGSVAQKKAYFEAHYXXXXXXXXXXXXXXXXXXDQAKQIDESLPTILDL---D 171
            E EKC TPGSVAQK+AYFEAHY                  +Q KQ+ +      D    D
Sbjct: 44   EVEKCATPGSVAQKRAYFEAHYKKIAARKAEELL------EQEKQMQDDPFRSDDQKGGD 97

Query: 172  EPHFQDHSENTQVSNSHFGLSDGEDEKTRDDVNNSDHVDEPKDDVFVNVKAK----EGPI 339
            +     H E   ++NS         E   D+   S HVD+ K+D  + ++ +    EG  
Sbjct: 98   QIDCGAHFE-IDLTNSQSTTQANYQETNFDNDTFSTHVDDLKEDDVITIECQSSLTEGEK 156

Query: 340  LETSDNGEVPKV----------EEQSSERGSQDNRKEIRQADSEGXXXXXXXXXTPKPNL 489
             ET      P +          E ++    SQ  ++  +  D+E           P+ +L
Sbjct: 157  EETDSVTASPNLNNPEELVLEKEAENVPAVSQGIQEIPKSLDNEMGKAPEVKEEKPRLHL 216

Query: 490  KNTARKVXVHPTTEDRISAGTKKKSASPVTKSSRISTP---------TPKPVSKNISSSQ 642
            +  ++KV    + E  + A  KKK    +TK+ + STP         TP+ VSK IS+S 
Sbjct: 217  QKGSQKVTTGVSKERNV-ANVKKKPIPQITKTPQKSTPRMSKPISTSTPR-VSKPISTST 274

Query: 643  QSVTKVNGLSYQRSSNAPVAQSNKVLSRSVNSPS-----ESSIKKLNGSTLQRSKNSST- 804
              V+K    S  R S  P++ S    S+S+++ +      SS+KK N S+L RSKN S  
Sbjct: 275  PRVSKPISTSTPRVSK-PISTSTPRASKSISTSTATPAPRSSVKKGNTSSLPRSKNPSIE 333

Query: 805  --------SMHMSLSLGPPNS-TASTTTMRKSLIMERMGEKDIVKRAFKAFQNSFNQGKL 957
                    S+HMS SL P  S +AS TT RKS IME MG+KDIV+RAFK FQN++NQ K 
Sbjct: 334  DTKKVPPKSLHMSPSLDPAKSDSASPTTARKSFIMENMGDKDIVRRAFKTFQNNYNQPKS 393

Query: 958  EVDTRYSESKKVFPKGSEQKISASPTPRKEIDRLRKTSDKVIAQKCQSG 1104
              + + S   +  P     +       RKE     K ++++  Q    G
Sbjct: 394  SSEEKSSTPTQAAPSSFGLRNDERADKRKEAKSNPKEAERLHFQPKSKG 442


>gb|EOY00300.1| Uncharacterized protein isoform 3 [Theobroma cacao]
            gi|508708406|gb|EOY00303.1| Uncharacterized protein
            isoform 3 [Theobroma cacao]
          Length = 530

 Score =  137 bits (346), Expect = 9e-30
 Identities = 124/451 (27%), Positives = 187/451 (41%), Gaps = 43/451 (9%)
 Frame = +1

Query: 1    EAEKCKTPGSVAQKKAYFEAHYXXXXXXXXXXXXXXXXXXDQAKQIDES----------- 147
            E EKC TPGSVA+KKAYFE HY                   +    D+            
Sbjct: 55   EVEKCATPGSVAKKKAYFEEHYKKIAARKAELQAQEKPMESKPFNSDDQNCGDLVGKSNG 114

Query: 148  ----------LPTILDLDEPHFQDHSENTQVS-----NSHFGLSDGEDEKTRDDVNNSDH 282
                         + ++ + HF +H+E  +++     +S  G+ +  D +    V     
Sbjct: 115  QCSNEGDKQETNWLSEVSDTHFDEHNEEPEIAIKSQNSSAEGVKEKIDSRVESQVIEKIE 174

Query: 283  VDEPKDDVFVNVKAKEGPILETS-----DNGEVPKVEEQSSERGSQDNRKEIRQADSEGX 447
                 ++      A E P L  S     D   + K   ++  +GSQD ++  + ++ +  
Sbjct: 175  SRVESEEKEEMDSAVESPKLIESEETAPDEAVLVKEAVETLPKGSQDEKELPQNSEKD-- 232

Query: 448  XXXXXXXXTPKPNLKNT-----ARKVXVHPTTEDRISAGTKKKSASPVTKSSRISTP--- 603
                    TPK   KN      A+   + P  ++R     KKK ASPVTK+ + STP   
Sbjct: 233  -----IKDTPKFKHKNLKLGHLAKSDKITPANKERNETRIKKKPASPVTKTPQFSTPKAS 287

Query: 604  --TPKPVSKNISSSQQSVTKVNGLSYQRSSNAPVAQSNKVLSRSVNSPSESSIKKLNGST 777
              T  P + + S +       +  S  ++    + +S KV+ RS                
Sbjct: 288  KPTSTPTTPSASRTPSKTKTTSSYSLPKTKIPSMGESKKVVPRS---------------- 331

Query: 778  LQRSKNSSTSMHMSLSLGPPNS-TASTTTMRKSLIMERMGEKDIVKRAFKAFQNSFNQGK 954
                      +HMSLSLGP  S  AS    RKSLIME+MG+KDIVKRAFK FQ++++Q K
Sbjct: 332  ----------LHMSLSLGPSGSGLASLPATRKSLIMEKMGDKDIVKRAFKTFQSNYHQLK 381

Query: 955  LEVDTRYSESKKVFPKGSEQKISASPTPRKEIDRLRKTSDKVIAQKCQSGTRSNSLSSGA 1134
                 +Y+ SK+V  KG E ++S   TP+KE                         + G+
Sbjct: 382  PSSQEQYAASKQVPAKGREARVSTLMTPQKE-------------------------NGGS 416

Query: 1135 PKDAGIERKKANTVRP-AGT*TDILTDKLKE 1224
            P+ +G+E+K A       G  TD   D+ KE
Sbjct: 417  PRASGMEKKNAKAAPSYFGLKTDEWEDRRKE 447


>gb|EOY00298.1| Uncharacterized protein isoform 1 [Theobroma cacao]
            gi|508708402|gb|EOY00299.1| Uncharacterized protein
            isoform 1 [Theobroma cacao] gi|508708404|gb|EOY00301.1|
            Uncharacterized protein isoform 1 [Theobroma cacao]
            gi|508708405|gb|EOY00302.1| Uncharacterized protein
            isoform 1 [Theobroma cacao]
          Length = 517

 Score =  137 bits (346), Expect = 9e-30
 Identities = 124/451 (27%), Positives = 187/451 (41%), Gaps = 43/451 (9%)
 Frame = +1

Query: 1    EAEKCKTPGSVAQKKAYFEAHYXXXXXXXXXXXXXXXXXXDQAKQIDES----------- 147
            E EKC TPGSVA+KKAYFE HY                   +    D+            
Sbjct: 55   EVEKCATPGSVAKKKAYFEEHYKKIAARKAELQAQEKPMESKPFNSDDQNCGDLVGKSNG 114

Query: 148  ----------LPTILDLDEPHFQDHSENTQVS-----NSHFGLSDGEDEKTRDDVNNSDH 282
                         + ++ + HF +H+E  +++     +S  G+ +  D +    V     
Sbjct: 115  QCSNEGDKQETNWLSEVSDTHFDEHNEEPEIAIKSQNSSAEGVKEKIDSRVESQVIEKIE 174

Query: 283  VDEPKDDVFVNVKAKEGPILETS-----DNGEVPKVEEQSSERGSQDNRKEIRQADSEGX 447
                 ++      A E P L  S     D   + K   ++  +GSQD ++  + ++ +  
Sbjct: 175  SRVESEEKEEMDSAVESPKLIESEETAPDEAVLVKEAVETLPKGSQDEKELPQNSEKD-- 232

Query: 448  XXXXXXXXTPKPNLKNT-----ARKVXVHPTTEDRISAGTKKKSASPVTKSSRISTP--- 603
                    TPK   KN      A+   + P  ++R     KKK ASPVTK+ + STP   
Sbjct: 233  -----IKDTPKFKHKNLKLGHLAKSDKITPANKERNETRIKKKPASPVTKTPQFSTPKAS 287

Query: 604  --TPKPVSKNISSSQQSVTKVNGLSYQRSSNAPVAQSNKVLSRSVNSPSESSIKKLNGST 777
              T  P + + S +       +  S  ++    + +S KV+ RS                
Sbjct: 288  KPTSTPTTPSASRTPSKTKTTSSYSLPKTKIPSMGESKKVVPRS---------------- 331

Query: 778  LQRSKNSSTSMHMSLSLGPPNS-TASTTTMRKSLIMERMGEKDIVKRAFKAFQNSFNQGK 954
                      +HMSLSLGP  S  AS    RKSLIME+MG+KDIVKRAFK FQ++++Q K
Sbjct: 332  ----------LHMSLSLGPSGSGLASLPATRKSLIMEKMGDKDIVKRAFKTFQSNYHQLK 381

Query: 955  LEVDTRYSESKKVFPKGSEQKISASPTPRKEIDRLRKTSDKVIAQKCQSGTRSNSLSSGA 1134
                 +Y+ SK+V  KG E ++S   TP+KE                         + G+
Sbjct: 382  PSSQEQYAASKQVPAKGREARVSTLMTPQKE-------------------------NGGS 416

Query: 1135 PKDAGIERKKANTVRP-AGT*TDILTDKLKE 1224
            P+ +G+E+K A       G  TD   D+ KE
Sbjct: 417  PRASGMEKKNAKAAPSYFGLKTDEWEDRRKE 447


>ref|XP_004504495.1| PREDICTED: uncharacterized protein LOC101508782, partial [Cicer
            arietinum]
          Length = 362

 Score =  137 bits (344), Expect = 1e-29
 Identities = 118/366 (32%), Positives = 167/366 (45%), Gaps = 14/366 (3%)
 Frame = +1

Query: 1    EAEKCKTPGSVAQKKAYFEAHYXXXXXXXXXXXXXXXXXXDQAKQIDESLPTILDLDEPH 180
            E EKC TPGSVAQKKAYFEAHY                   Q KQ +    +    D+  
Sbjct: 41   EVEKCATPGSVAQKKAYFEAHYKKIAARKAELLA-------QEKQTEND--SFRSEDQNG 91

Query: 181  FQDHSENTQVSNSHFGLSDGEDEKTRDDVNN----------SDHVDEPKDDVFVNVKAKE 330
                  NT  ++S FG+S+   + T + V            + HVD+ K++  V++   +
Sbjct: 92   IDLSGRNTCETDSDFGISNNTQDTTDECVTQETSSAVGEIGTSHVDDLKEEGTVSIDYNQ 151

Query: 331  GPILETSDNG-EVPKVEEQSSERGSQDNRKEIRQADSEGXXXXXXXXXTPKPNLKNTARK 507
             P +E  +   E  +VEE+  +     N  ++   + E                 N A+ 
Sbjct: 152  SPSVEVDNKELEASQVEEKDVKLDHHPNEPKVISVNREN----------------NVAK- 194

Query: 508  VXVHPTTEDRISAGTKKKSASPVTKSSRISTPTPKPVSKNISSSQQSVTKVNGLSYQRSS 687
                          TKKKS  P +K S+ STP          +S+ ++T +  L+   S+
Sbjct: 195  --------------TKKKSVLPKSKVSQSSTPR---------TSRPTLTPIKTLASAPST 231

Query: 688  NAPVAQSNKVLSRSVNSPSESSIKKLNGSTLQRSKNSSTSMHMSLSLGPPN-STASTTTM 864
                        +  NS S    K++     +  K ++ S+HMS+SLGP N      TTM
Sbjct: 232  ------------KKANSSSLPK-KQIASGVAENKKVANRSLHMSMSLGPSNPDPVPHTTM 278

Query: 865  RKSLIMERMGEKDIVKRAFKAFQNSFNQGKL--EVDTRYSESKKVFPKGSEQKISASPTP 1038
            RKSLIME+MG+KDIVKRAFK FQN FNQ K   EVD R S +K+V  +G+  K+  S   
Sbjct: 279  RKSLIMEQMGDKDIVKRAFKTFQNKFNQPKASGEVD-RSSVTKQVSSRGTASKVPTSTAL 337

Query: 1039 RKEIDR 1056
            RKE  R
Sbjct: 338  RKENGR 343


>gb|EOY00304.1| Uncharacterized protein isoform 7 [Theobroma cacao]
          Length = 518

 Score =  135 bits (339), Expect = 6e-29
 Identities = 124/452 (27%), Positives = 187/452 (41%), Gaps = 44/452 (9%)
 Frame = +1

Query: 1    EAEKCKTPGSVAQKKAYFEAHYXXXXXXXXXXXXXXXXXXDQAKQIDES----------- 147
            E EKC TPGSVA+KKAYFE HY                   +    D+            
Sbjct: 55   EVEKCATPGSVAKKKAYFEEHYKKIAARKAELQAQEKPMESKPFNSDDQNCGDLVGKSNG 114

Query: 148  ----------LPTILDLDEPHFQDHSENTQVS-----NSHFGLSDGEDEKTRDDVNNSDH 282
                         + ++ + HF +H+E  +++     +S  G+ +  D +    V     
Sbjct: 115  QCSNEGDKQETNWLSEVSDTHFDEHNEEPEIAIKSQNSSAEGVKEKIDSRVESQVIEKIE 174

Query: 283  VDEPKDDVFVNVKAKEGPILETS-----DNGEVPKVEEQSSERGSQDNRKEIRQADSEGX 447
                 ++      A E P L  S     D   + K   ++  +GSQD ++  + ++ +  
Sbjct: 175  SRVESEEKEEMDSAVESPKLIESEETAPDEAVLVKEAVETLPKGSQDEKELPQNSEKD-- 232

Query: 448  XXXXXXXXTPKPNLKNT-----ARKVXVHPTTEDRISAGTKKKSASPVTKSSRISTP--- 603
                    TPK   KN      A+   + P  ++R     KKK ASPVTK+ + STP   
Sbjct: 233  -----IKDTPKFKHKNLKLGHLAKSDKITPANKERNETRIKKKPASPVTKTPQFSTPKAS 287

Query: 604  --TPKPVSKNISSSQQSVTKVNGLSYQRSSNAPVAQSNKVLSRSVNSPSESSIKKLNGST 777
              T  P + + S +       +  S  ++    + +S KV+ RS                
Sbjct: 288  KPTSTPTTPSASRTPSKTKTTSSYSLPKTKIPSMGESKKVVPRS---------------- 331

Query: 778  LQRSKNSSTSMHMSLSLGPPNS-TASTTTMRKSLIMERMGEKDIVKRAFKAFQNSFNQGK 954
                      +HMSLSLGP  S  AS    RKSLIME+MG+KDIVKRAFK FQ++++Q K
Sbjct: 332  ----------LHMSLSLGPSGSGLASLPATRKSLIMEKMGDKDIVKRAFKTFQSNYHQLK 381

Query: 955  LEVDTRYSESKKVFP-KGSEQKISASPTPRKEIDRLRKTSDKVIAQKCQSGTRSNSLSSG 1131
                 +Y+ SK+  P KG E ++S   TP+KE                         + G
Sbjct: 382  PSSQEQYAASKQQVPAKGREARVSTLMTPQKE-------------------------NGG 416

Query: 1132 APKDAGIERKKANTVRP-AGT*TDILTDKLKE 1224
            +P+ +G+E+K A       G  TD   D+ KE
Sbjct: 417  SPRASGMEKKNAKAAPSYFGLKTDEWEDRRKE 448


>gb|EXB82666.1| hypothetical protein L484_027847 [Morus notabilis]
          Length = 504

 Score =  134 bits (337), Expect = 9e-29
 Identities = 123/395 (31%), Positives = 186/395 (47%), Gaps = 36/395 (9%)
 Frame = +1

Query: 1    EAEKCKTPGSVAQKKAYFEAHYXXXXXXXXXXXXXXXXXXDQAKQIDESLPTILDLDEPH 180
            E EKC TPGSVAQKKAYFEAHY                   Q  Q D       + D+P+
Sbjct: 42   EVEKCATPGSVAQKKAYFEAHYKKIAAKKAELLEQEK----QQAQNDSMRSEDNEEDDPN 97

Query: 181  FQDHSENTQVSNSHFGLSDG----EDEKTRDDVNNSDHVDEPK-----------DDVFVN 315
              D   NT   ++   +S+     E+E  ++ + +++ +   K           ++  ++
Sbjct: 98   GGDLIRNTNSKDARIDVSEDQISVEEEVKKEPILSNEKMSGEKINDLKLGVVISEECQIS 157

Query: 316  VKAKEGPILETSDNGEVPKVEE-----QSSERGSQDNRKEIRQADS-EGXXXXXXXXXTP 477
            V  +EG +     + ++ K E+     +  E  S D++ ++   +S +            
Sbjct: 158  VVEREGELDTRVASPKLGKAEQDDIFVKEVEAISIDSQPKMEAPESLKSELVYDSKVKEE 217

Query: 478  KPNLKNTARKVXVHPTTEDRISAGTKKKSASPVTKSSRISTPTPKPVSKNISSSQQSVTK 657
            K  L +  +   V    ++R  A  KKK  S +T++ + S  TP+ VSK +  S      
Sbjct: 218  KVKLVDQNQPQKVTAVDKERTVAKAKKKPVSQLTRTPKSSNSTPR-VSKPVQIS------ 270

Query: 658  VNGLSYQRSSNAPVAQSNKVLSRSVNSPSESSIKKLNGSTLQRSKNSSTSMHMSLSLGPP 837
                    S  +P +QS+   ++  N+ ++S  +  N S+ +  K  S S+HMSLSLGP 
Sbjct: 271  --------SRVSPASQSS---TKKSNTITQSLQRNKNPSSGETKKVVSKSLHMSLSLGPR 319

Query: 838  NSTA-------STTTMRKSLIMERMGEKDIVKRAFKAFQNSFNQGKLEVDTRYSESK--K 990
            N  +       + TT RKSL ME+MG+KDIVKRAFKAFQN+FNQ +   D   S  K  +
Sbjct: 320  NLNSPANLDLPAITTPRKSLFMEKMGDKDIVKRAFKAFQNNFNQARSYGDDGSSLQKQVQ 379

Query: 991  VFPKGSEQKISASPTPRKE------IDRLRKTSDK 1077
            V  K  E K+S + TPRKE       DRL K S K
Sbjct: 380  VTTKRPEPKVSTTITPRKENVGSLKTDRLDKRSVK 414


>ref|XP_006584485.1| PREDICTED: uncharacterized protein LOC100306130 isoform X2 [Glycine
            max] gi|571468881|ref|XP_006584486.1| PREDICTED:
            uncharacterized protein LOC100306130 isoform X3 [Glycine
            max] gi|571468883|ref|XP_006584487.1| PREDICTED:
            uncharacterized protein LOC100306130 isoform X4 [Glycine
            max]
          Length = 481

 Score =  134 bits (337), Expect = 9e-29
 Identities = 137/438 (31%), Positives = 195/438 (44%), Gaps = 30/438 (6%)
 Frame = +1

Query: 1    EAEKCKTPGSVAQKKAYFEAHYXXXXXXXXXXXXXXXXXXDQAKQIDESLPTILDLDEPH 180
            E EKC TPGSVAQKKAYFEAHY                   QAKQ+++  P        +
Sbjct: 41   EVEKCATPGSVAQKKAYFEAHYKNIAARKAELLA-------QAKQMEKDSPRS---QRQN 90

Query: 181  FQDHSENTQVSNSHFGLSD--GEDEKTRDDVNNSDH-----VDEPKDDVFVNVK------ 321
             +D S NT  +++   +S   G  E  + + N+        V    +DV V++       
Sbjct: 91   GEDLSCNTCGTDAECDMSSTQGSSEGVKQETNSIGEIVRTDVSNLMEDVAVSIDYQGSSV 150

Query: 322  --AKEGPILET-------SDNGEVPKVEEQSSERGSQDNRKEIRQADSEGXXXXXXXXXT 474
               KE   LE+         + EV  VE+  S+  S +   E  +  S            
Sbjct: 151  EGEKENEELESRLGSSQIDKHEEVVCVEQGGSKEESPNTEAEDVKEISHNVNNE------ 204

Query: 475  PKPNLKNTARKVXV-HPTTEDRISAGTKKKSASPVTKSSRISTPTPKPVSKNISSSQQSV 651
            P    +N A+ V + HP    +++   ++ +A+   K S +ST  PK       +SQ S 
Sbjct: 205  PAKTSENEAKYVTLDHPKVSKKVTPVNRESNATKAKKKSMLSTSKPK-------ASQFST 257

Query: 652  TKVNGLSYQRSSNAPVAQSNKVLSRSVN-----SPSESSIKKLNGSTLQRSKNSSTSMHM 816
             +         S+ P +   K L+ + +     SPS S  +K+N ST +  K  + S+HM
Sbjct: 258  PR---------SSKPTSTPTKTLASASSTKRGISPSISG-RKIN-STSENRKVPNKSLHM 306

Query: 817  SLSLGPPN-STASTTTMRKSLIMERMGEKDIVKRAFKAFQNSFNQGKLEVDTRYSESKKV 993
            SLSL P     AS TTMRKSLIME+MG+KDIVKRAFK FQN+FNQ K   + +    +KV
Sbjct: 307  SLSLAPSQPDPASHTTMRKSLIMEKMGDKDIVKRAFKTFQNNFNQPKTSGENKSLVKEKV 366

Query: 994  FPKGSEQKISASPTPRKEIDRLRKTSDKVIAQKCQSGTRSNSLSSGAPKDAGIERKKANT 1173
              K +E +   S T RKE                            +PK   ++R+  N 
Sbjct: 367  PSKVTESRNPTSITLRKE-------------------------DGQSPKVDSMDRRSVNA 401

Query: 1174 VRPA-GT*TDILTDKLKE 1224
            VR A G   D+  +K KE
Sbjct: 402  VRTAFGLKGDVKAEKGKE 419


>ref|XP_006584484.1| PREDICTED: uncharacterized protein LOC100306130 isoform X1 [Glycine
            max]
          Length = 482

 Score =  134 bits (337), Expect = 9e-29
 Identities = 137/438 (31%), Positives = 195/438 (44%), Gaps = 30/438 (6%)
 Frame = +1

Query: 1    EAEKCKTPGSVAQKKAYFEAHYXXXXXXXXXXXXXXXXXXDQAKQIDESLPTILDLDEPH 180
            E EKC TPGSVAQKKAYFEAHY                   QAKQ+++  P        +
Sbjct: 42   EVEKCATPGSVAQKKAYFEAHYKNIAARKAELLA-------QAKQMEKDSPRS---QRQN 91

Query: 181  FQDHSENTQVSNSHFGLSD--GEDEKTRDDVNNSDH-----VDEPKDDVFVNVK------ 321
             +D S NT  +++   +S   G  E  + + N+        V    +DV V++       
Sbjct: 92   GEDLSCNTCGTDAECDMSSTQGSSEGVKQETNSIGEIVRTDVSNLMEDVAVSIDYQGSSV 151

Query: 322  --AKEGPILET-------SDNGEVPKVEEQSSERGSQDNRKEIRQADSEGXXXXXXXXXT 474
               KE   LE+         + EV  VE+  S+  S +   E  +  S            
Sbjct: 152  EGEKENEELESRLGSSQIDKHEEVVCVEQGGSKEESPNTEAEDVKEISHNVNNE------ 205

Query: 475  PKPNLKNTARKVXV-HPTTEDRISAGTKKKSASPVTKSSRISTPTPKPVSKNISSSQQSV 651
            P    +N A+ V + HP    +++   ++ +A+   K S +ST  PK       +SQ S 
Sbjct: 206  PAKTSENEAKYVTLDHPKVSKKVTPVNRESNATKAKKKSMLSTSKPK-------ASQFST 258

Query: 652  TKVNGLSYQRSSNAPVAQSNKVLSRSVN-----SPSESSIKKLNGSTLQRSKNSSTSMHM 816
             +         S+ P +   K L+ + +     SPS S  +K+N ST +  K  + S+HM
Sbjct: 259  PR---------SSKPTSTPTKTLASASSTKRGISPSISG-RKIN-STSENRKVPNKSLHM 307

Query: 817  SLSLGPPN-STASTTTMRKSLIMERMGEKDIVKRAFKAFQNSFNQGKLEVDTRYSESKKV 993
            SLSL P     AS TTMRKSLIME+MG+KDIVKRAFK FQN+FNQ K   + +    +KV
Sbjct: 308  SLSLAPSQPDPASHTTMRKSLIMEKMGDKDIVKRAFKTFQNNFNQPKTSGENKSLVKEKV 367

Query: 994  FPKGSEQKISASPTPRKEIDRLRKTSDKVIAQKCQSGTRSNSLSSGAPKDAGIERKKANT 1173
              K +E +   S T RKE                            +PK   ++R+  N 
Sbjct: 368  PSKVTESRNPTSITLRKE-------------------------DGQSPKVDSMDRRSVNA 402

Query: 1174 VRPA-GT*TDILTDKLKE 1224
            VR A G   D+  +K KE
Sbjct: 403  VRTAFGLKGDVKAEKGKE 420


>ref|XP_002311790.2| hypothetical protein POPTR_0008s19710g, partial [Populus trichocarpa]
            gi|550333484|gb|EEE89157.2| hypothetical protein
            POPTR_0008s19710g, partial [Populus trichocarpa]
          Length = 421

 Score =  133 bits (334), Expect = 2e-28
 Identities = 128/403 (31%), Positives = 176/403 (43%), Gaps = 46/403 (11%)
 Frame = +1

Query: 1    EAEKCKTPGSVAQKKAYFEAHYXXXXXXXXXXXXXXXXXXDQAKQID-----ESLPTILD 165
            E EKC +PGSVA+KKAYFEAHY                  DQ KQ++     E+   I D
Sbjct: 55   EVEKCASPGSVAEKKAYFEAHYKKIAARKAELF-------DQEKQMEHESSMENNHNIGD 107

Query: 166  LDEPHFQDHSENTQVSNSHFGLSDGEDEKTRDDVNNSDHVDEPKDDVFVNVKAK------ 327
            L   + Q  S +  VSN          E   D+  +  HVDEP +D  ++V  +      
Sbjct: 108  LTGKNGQTDS-SFDVSNGQTSAEGIWHESKLDNERDGGHVDEPYEDAAIDVHGQASLSGL 166

Query: 328  --------------EGPILETSDNG----EVPKVEE----QSSERGSQDNRKEIRQADSE 441
                           G + E  +N     E  K+EE    +  E+G QD R+  + ++ E
Sbjct: 167  YEDAANDVQSQASSNGRVKEELENKLDSPESTKLEELALIKEEEKGYQDTRELPKNSEKE 226

Query: 442  GXXXXXXXXXTPKPNLKNTARKVXVHPTTEDRISAGTKKKSASPVTKSSRISTPTPKPVS 621
                        K + +  + K+   P ++ R  A  KKK    VTK  +ISTP    VS
Sbjct: 227  KESILMIKEEKVKFDHQRGSSKII--PLSKVRDIARAKKKPEPLVTKQPQISTPK---VS 281

Query: 622  KNISSSQQSVTKVNGLSYQRSSNAPVAQSNKVLSRSVNSPSESSIKKLNGSTLQRSKN-- 795
            K + +S                             S  S S+SS KK+NGS L RSKN  
Sbjct: 282  KRVPTS-----------------------------SSLSASQSSTKKMNGSLLPRSKNPP 312

Query: 796  -------SSTSMHMSLSLGPPNSTASTT-TMRKSLIMERMGEKDIVKRAFKAFQNSFNQG 951
                   +S S+H+SL++ P NS      T RKS I E+MG+KDIVKRAFK FQN+F+Q 
Sbjct: 313  AGENKKVTSKSLHLSLTMDPSNSEPDPLITTRKSFIREKMGDKDIVKRAFKTFQNNFSQL 372

Query: 952  KLEVDTRYSESKKVFPKGSEQKISAS---PTPRKEIDRLRKTS 1071
            K   + R    K+   +   +K+  S   PTP     R +K S
Sbjct: 373  KSSAEERAIREKQEEKEEEIKKLRHSNFKPTPMPGFYRAQKAS 415


>ref|XP_003517869.1| PREDICTED: neurofilament medium polypeptide isoform X1 [Glycine max]
            gi|571434004|ref|XP_006573072.1| PREDICTED: neurofilament
            medium polypeptide isoform X2 [Glycine max]
            gi|571434006|ref|XP_006573073.1| PREDICTED: neurofilament
            medium polypeptide isoform X3 [Glycine max]
            gi|571434008|ref|XP_006573074.1| PREDICTED: neurofilament
            medium polypeptide isoform X4 [Glycine max]
          Length = 490

 Score =  133 bits (334), Expect = 2e-28
 Identities = 126/416 (30%), Positives = 176/416 (42%), Gaps = 7/416 (1%)
 Frame = +1

Query: 1    EAEKCKTPGSVAQKKAYFEAHYXXXXXXXXXXXXXXXXXXDQAKQIDESLPTILDLDEPH 180
            E EKC TPGSVAQKKAYFEAHY                    +   +E     L  +   
Sbjct: 57   EVEKCATPGSVAQKKAYFEAHYKKVAARKAELLAQEKQREKDSFGSEEHSGIDLSGNTDA 116

Query: 181  FQDHSENTQVSN---SHFGLSDGEDEKTRDDVNNSDH---VDEPKDDVFVNVKAKEGPIL 342
              D S NTQ S+    H   S GE  KT   VN S+    V        V V+ KE   L
Sbjct: 117  EHDISNNTQGSSEGVEHETSSAGEIHKTH--VNESEEEFAVSRDYQSSSVQVENKE---L 171

Query: 343  ETSDNGEVPKVEEQSSERGSQDNRKEIRQADSEGXXXXXXXXXTPKPNLKNTARKVXVHP 522
            E+  +      E ++  +   ++     +A+            T K + +   + V ++ 
Sbjct: 172  ESRSHSSYQIDEPENVCKKQVESPNNNIEAEDVKEISHVVYKETGKAS-EGEVKDVKLNH 230

Query: 523  TTEDRISAGTKKKSASPVTKSSRISTPTPKPVSKNISSSQQSVTKVNGLSYQRSSNAPVA 702
              E ++ + +K  +A+   K S + T    P+S   SS   S T    ++   +S+    
Sbjct: 231  PKESKVKSVSKGSNAARTKKKSMLPTSKASPISTPKSSKPASTTPTKTVT--PASSTRKG 288

Query: 703  QSNKVLSRSVNSPSESSIKKLNGSTLQRSKNSSTSMHMSLSLGPPN-STASTTTMRKSLI 879
             S  +  R + S  ES             K ++  +HMSLSL P N   A  +TMR+SLI
Sbjct: 289  SSPSLTRRQITSSGES------------RKFANKPLHMSLSLAPSNPDPAPQSTMRRSLI 336

Query: 880  MERMGEKDIVKRAFKAFQNSFNQGKLEVDTRYSESKKVFPKGSEQKISASPTPRKEIDRL 1059
            ME MG+KDIVKRAFK FQNSFNQ K  V+ +    K+V  +G+  K+  S T RKE  R 
Sbjct: 337  MENMGDKDIVKRAFKTFQNSFNQPKTSVEDKSLIKKQVPSRGTVSKVPTSTTLRKENGRP 396

Query: 1060 RKTSDKVIAQKCQSGTRSNSLSSGAPKDAGIERKKANTVRPAGT*TDILTDKLKEK 1227
             K  +   +      T        A K     RK        G     L  K+KE+
Sbjct: 397  TKVENLYQSGNAVRTTLGPKRDIRAEKGKESSRKIEEKSNTKGVERTRLQSKVKEE 452


Top