BLASTX nr result

ID: Paeonia22_contig00011196 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Paeonia22_contig00011196
         (1351 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_003635203.1| PREDICTED: uncharacterized protein LOC100853...   207   9e-51
ref|XP_004152391.1| PREDICTED: uncharacterized protein LOC101222...   184   6e-44
ref|XP_004299406.1| PREDICTED: uncharacterized protein LOC101293...   172   3e-40
gb|EXC02099.1| hypothetical protein L484_024064 [Morus notabilis]     170   1e-39
ref|XP_007155691.1| hypothetical protein PHAVU_003G223000g, part...   166   2e-38
ref|XP_004508940.1| PREDICTED: uncharacterized protein LOC101492...   164   1e-37
ref|XP_007209536.1| hypothetical protein PRUPE_ppa010718mg [Prun...   163   2e-37
ref|XP_003608674.1| hypothetical protein MTR_4g100570 [Medicago ...   161   5e-37
ref|XP_006440252.1| hypothetical protein CICLE_v10022000mg [Citr...   157   1e-35
ref|XP_006477140.1| PREDICTED: uncharacterized protein LOC102618...   155   4e-35
ref|XP_007039763.1| Uncharacterized protein isoform 1 [Theobroma...   150   2e-33
ref|XP_007039766.1| Uncharacterized protein isoform 4 [Theobroma...   148   6e-33
ref|XP_003549926.1| PREDICTED: uncharacterized protein LOC100812...   144   1e-31
ref|XP_003525577.1| PREDICTED: histone-lysine N-methyltransferas...   139   3e-30
ref|XP_002531462.1| conserved hypothetical protein [Ricinus comm...   130   1e-27
ref|XP_006359254.1| PREDICTED: uncharacterized protein LOC102604...   120   2e-24
ref|XP_004245789.1| PREDICTED: uncharacterized protein LOC101263...   120   2e-24
ref|XP_006440253.1| hypothetical protein CICLE_v10022000mg [Citr...   115   3e-23
ref|XP_007052276.1| Uncharacterized protein TCM_005685 [Theobrom...    97   2e-17
ref|XP_006448443.1| hypothetical protein CICLE_v10016320mg [Citr...    94   1e-16

>ref|XP_003635203.1| PREDICTED: uncharacterized protein LOC100853295 [Vitis vinifera]
            gi|296085701|emb|CBI29500.3| unnamed protein product
            [Vitis vinifera]
          Length = 240

 Score =  207 bits (527), Expect = 9e-51
 Identities = 128/251 (50%), Positives = 150/251 (59%), Gaps = 5/251 (1%)
 Frame = -2

Query: 1218 MATAPVKSQPLHNFSLSFPTWGNKNQMNNHRCRRSVDAPPPPSPHLDHRXXXXXXXXXXX 1039
            MATAPVKSQPLHNF LSF  WG KNQMNNHRCR+ VDA    SP    +           
Sbjct: 1    MATAPVKSQPLHNFPLSFLKWG-KNQMNNHRCRKPVDALRE-SPPDGRKNESEPDSDGGS 58

Query: 1038 XXXXXENNRKTPVGSRTTRNRVAFSPCNLLEKSQKQ--IMEKESAEVDDDERGKAYEVDE 865
                   NRK P+GSRT R+R A +  + +EK+QK   ++E+E  EVD+ E        E
Sbjct: 59   KNESDSENRKLPLGSRTARSRHAVASPSPVEKAQKNQALVEREGGEVDEGE-------GE 111

Query: 864  ASAHKPWNLRPRRXXXXXXXXXXXXXXXXXELQETVPVVQLGDANLPKSLRLRG---SQG 694
             S  KPWNLRPR+                  LQE VP V   + N PKSLRLRG   S  
Sbjct: 112  ESVQKPWNLRPRKAVSKSPIEIGVAPKNGE-LQEAVPGVPHSE-NQPKSLRLRGFAESHS 169

Query: 693  TEKKEKAKFWISLSKEEIEEDIFVMTGSXXXXXXXXXXRTVQKQVDNVFPGLWLVGFTAD 514
            +EKKEK KFWISLS+EEIEEDIFVMTGS          + VQKQ+DNVFPGLWLVG T D
Sbjct: 170  SEKKEKRKFWISLSREEIEEDIFVMTGSKPARRPKKRAKNVQKQLDNVFPGLWLVGVTPD 229

Query: 513  AYRVLEAPIKK 481
            +YR+ +AP K+
Sbjct: 230  SYRLPDAPAKR 240


>ref|XP_004152391.1| PREDICTED: uncharacterized protein LOC101222282 [Cucumis sativus]
            gi|449488652|ref|XP_004158130.1| PREDICTED:
            uncharacterized LOC101222282 [Cucumis sativus]
          Length = 246

 Score =  184 bits (468), Expect = 6e-44
 Identities = 114/256 (44%), Positives = 144/256 (56%), Gaps = 10/256 (3%)
 Frame = -2

Query: 1218 MATAPVKSQPLHNFSLSFPTWGNKNQMN-NHRCRRSVDAPP-PPSPHLDHRXXXXXXXXX 1045
            MAT PVKSQPLHNF+L F  WG KNQ N NHR RR++       SP +DH          
Sbjct: 1    MATGPVKSQPLHNFALPFLKWGGKNQTNSNHRIRRAIGGGGGDSSPAVDHSEPESEAD-- 58

Query: 1044 XXXXXXXENNRKTPVGSRTTRNRVAFSPCNLLEKSQKQIMEKESAEVDDDERGKAYEVD- 868
                    +  +  VGSRT RNR+AFSPC+L +K  K    +   EV  +++ +  EV+ 
Sbjct: 59   --------SKPQLRVGSRTVRNRLAFSPCSLGDKFAKHSEGEVGDEVVKEQKREGEEVEG 110

Query: 867  EASAHKPWNLRPRRXXXXXXXXXXXXXXXXXELQETVPVV----QLGDANLPKSLRLRG- 703
            E    KPWNLRPR+                 E+   V       Q G+   PKSLRLRG 
Sbjct: 111  EEIVQKPWNLRPRKGTSLRGYGDLKNGGDLQEMDGAVSSAAGASQQGENPQPKSLRLRGF 170

Query: 702  --SQGTEKKEKAKFWISLSKEEIEEDIFVMTGSXXXXXXXXXXRTVQKQVDNVFPGLWLV 529
              S   EKK+K KFWI+LS++EIEEDIF+MTGS          + VQKQ+D VFPGLWLV
Sbjct: 171  TESHRIEKKDKRKFWIALSRDEIEEDIFIMTGSRPSRRPKKRPKNVQKQLDTVFPGLWLV 230

Query: 528  GFTADAYRVLEAPIKK 481
            G TAD+YR+ ++P K+
Sbjct: 231  GVTADSYRLADSPAKR 246


>ref|XP_004299406.1| PREDICTED: uncharacterized protein LOC101293977 [Fragaria vesca
            subsp. vesca]
          Length = 239

 Score =  172 bits (436), Expect = 3e-40
 Identities = 114/264 (43%), Positives = 140/264 (53%), Gaps = 18/264 (6%)
 Frame = -2

Query: 1218 MATAPVKSQPLHNFSLSFPTWGNKNQMN-NHRCRRSVDAPPPPSPHLDHRXXXXXXXXXX 1042
            MATAPVK  PLHNF LSF  WG+KN  N NHR RR V A P PS   D            
Sbjct: 1    MATAPVKP-PLHNFPLSFLKWGSKNHTNTNHRYRRPVSAEPEPSADDDR----------- 48

Query: 1041 XXXXXXENNRKTP-----VGSRTTRNRVAFSPCN-LLEKSQKQIMEKESAEVDDDERGKA 880
                   N+ ++P     VGSRT R+R + + C+  L +  ++  E+   +VDDD +  A
Sbjct: 49   -------NDSESPPQHHRVGSRTARHRFSLASCSEKLPQRNEKASEESDDDVDDDAKAAA 101

Query: 879  YEV----DEASAHKPWNLRPRRXXXXXXXXXXXXXXXXXELQETVPVVQLGDANLPKSLR 712
                   +EA   KPWNLRPRR                  + E     Q  +   PKS+R
Sbjct: 102  VAAVAAAEEAEVQKPWNLRPRRAPVTKANNNTGGE-----VHEAEGTKQ-SEQPAPKSMR 155

Query: 711  LRGSQGT-------EKKEKAKFWISLSKEEIEEDIFVMTGSXXXXXXXXXXRTVQKQVDN 553
            LRG           +KKEK KFWI+LSK+EIEEDIF+MTGS          + VQKQ+DN
Sbjct: 156  LRGLAAAAEGPSMEKKKEKRKFWIALSKDEIEEDIFIMTGSRPARRPKKRPKNVQKQLDN 215

Query: 552  VFPGLWLVGFTADAYRVLEAPIKK 481
             FPGLWLVGFTADAYR  ++P KK
Sbjct: 216  CFPGLWLVGFTADAYRGSDSPTKK 239


>gb|EXC02099.1| hypothetical protein L484_024064 [Morus notabilis]
          Length = 268

 Score =  170 bits (431), Expect = 1e-39
 Identities = 118/259 (45%), Positives = 142/259 (54%), Gaps = 13/259 (5%)
 Frame = -2

Query: 1218 MATAPVKSQPLHNFSLSFPTWGN-KNQMN-NHRCRRSVDAPPPPSPHLDHRXXXXXXXXX 1045
            MATAPVKS PLHNF L F  WG  KN  + +HRCRR++ A    SP  DH          
Sbjct: 1    MATAPVKS-PLHNFPLPFLKWGGGKNHASGSHRCRRTISADS--SPVADH---CDAAEQE 54

Query: 1044 XXXXXXXENNRKTPVGSRTTRNRVA--FSPCNLL--EKSQKQIMEKESAEVDDDERGKAY 877
                   E NR   VGSRT RNR A  F+ C+L+  +K   ++   E  E DD E   A 
Sbjct: 55   RNESSEAEPNRFHRVGSRTVRNRFAAPFASCSLVSEKKESDEVAAGEGKEGDDREVEAAA 114

Query: 876  EVDEASAHKPWNLRPRRXXXXXXXXXXXXXXXXXELQETVPVVQLGDANL----PKSLRL 709
              +E    KPWNLRPR+                 E +  V        NL    PKS+RL
Sbjct: 115  GEEEMMVQKPWNLRPRKALFSKAATNGAKSGELPEQENAVAGGGHQSENLNQQPPKSMRL 174

Query: 708  RG---SQGTEKKEKAKFWISLSKEEIEEDIFVMTGSXXXXXXXXXXRTVQKQVDNVFPGL 538
            RG   SQ + +KEK KFWI+LS+EEIEEDIFVMTGS          + VQKQ+D VFPGL
Sbjct: 175  RGLSESQQSSEKEKRKFWIALSREEIEEDIFVMTGSRPARRPRKRPKNVQKQLDAVFPGL 234

Query: 537  WLVGFTADAYRVLEAPIKK 481
            WLVG TADAYR+++AP K+
Sbjct: 235  WLVGITADAYRIVDAPAKE 253


>ref|XP_007155691.1| hypothetical protein PHAVU_003G223000g, partial [Phaseolus vulgaris]
            gi|593785303|ref|XP_007155692.1| hypothetical protein
            PHAVU_003G223000g, partial [Phaseolus vulgaris]
            gi|561029045|gb|ESW27685.1| hypothetical protein
            PHAVU_003G223000g, partial [Phaseolus vulgaris]
            gi|561029046|gb|ESW27686.1| hypothetical protein
            PHAVU_003G223000g, partial [Phaseolus vulgaris]
          Length = 306

 Score =  166 bits (420), Expect = 2e-38
 Identities = 114/270 (42%), Positives = 137/270 (50%), Gaps = 20/270 (7%)
 Frame = -2

Query: 1230 IRAHMATAP----VKSQPLHNFSLSFPTWG-----NKNQMNNHRCRRSVDAPPPPSPHLD 1078
            +R  MATAP    VKSQPLHNF+L F  WG     + N  ++HRCRR      P S   D
Sbjct: 53   LRFSMATAPAQPPVKSQPLHNFALPFLKWGASGKNHTNAAHHHRCRR------PSSLSSD 106

Query: 1077 HRXXXXXXXXXXXXXXXXENNRKTPVGSRTTRNRVAFSPCNLLEKSQKQIMEKESA---E 907
            H                  ++R   VGSRTTRNR A   C+L          +  +   E
Sbjct: 107  HASEPDSDP----------DSRPHRVGSRTTRNRFALPTCSLKPLPPPPEPPQPPSCNDE 156

Query: 906  VDDDERGKAYEVDEASAHKPWNLRPRRXXXXXXXXXXXXXXXXXELQETVP-----VVQL 742
             DD+   +  E  E +  KPWNLRPR+                      V      V   
Sbjct: 157  TDDEAAKRDIEDAEEAVQKPWNLRPRKPALPKSALEIGTGPSRNHANNGVGEFHDGVSHH 216

Query: 741  GDANLPKSLRLRG---SQGTEKKEKAKFWISLSKEEIEEDIFVMTGSXXXXXXXXXXRTV 571
            G+   PKSLRLRG   +Q  EKKEK KFWI+LS+EEIEEDIFVMTGS          + V
Sbjct: 217  GENPAPKSLRLRGFADTQCAEKKEKRKFWIALSREEIEEDIFVMTGSRPARRPRKRPKNV 276

Query: 570  QKQVDNVFPGLWLVGFTADAYRVLEAPIKK 481
            QKQ+D+VFPGLWLVG TADAYRV + P K+
Sbjct: 277  QKQMDSVFPGLWLVGITADAYRVPDTPTKR 306


>ref|XP_004508940.1| PREDICTED: uncharacterized protein LOC101492028 [Cicer arietinum]
          Length = 242

 Score =  164 bits (414), Expect = 1e-37
 Identities = 114/258 (44%), Positives = 132/258 (51%), Gaps = 13/258 (5%)
 Frame = -2

Query: 1215 ATAPVKSQPLHNFSLSFPTWGN--KNQMNNHRCRRSVDAPPPPSPHLDHRXXXXXXXXXX 1042
            A APVKSQPLHNFSL F  WG   KN  N++  +RS   P   SP  D            
Sbjct: 4    APAPVKSQPLHNFSLPFLKWGGTGKNHTNSNNHQRSRRPPDHASPEPDSEP--------- 54

Query: 1041 XXXXXXENNRKTPVGSRTTRNRVAFSPCNLLEKSQKQIMEKESAEVDDDERGKAYE-VDE 865
                   ++R   +GSRT RNR      +    S +      + E DDD   +  E  DE
Sbjct: 55   -------DSRPHRLGSRTARNRFGLPSSS---SSHRHATVSSNHETDDDAGDRKREGEDE 104

Query: 864  ASAH----KPWNLRPRRXXXXXXXXXXXXXXXXXELQ--ETVPVVQL-GDANLPKSLRLR 706
            A A     KPWNLRPR+                      E V  V   GD   PKSLRLR
Sbjct: 105  AGAEEIVQKPWNLRPRKPMIPRGAFEIGAGGSRNNHNGGELVEAVNNNGDNPTPKSLRLR 164

Query: 705  G---SQGTEKKEKAKFWISLSKEEIEEDIFVMTGSXXXXXXXXXXRTVQKQVDNVFPGLW 535
            G   +  TEKKEK KFWI+LSKEEIEEDIFVMTGS          + VQKQ+D+VFPGLW
Sbjct: 165  GFADTSCTEKKEKRKFWIALSKEEIEEDIFVMTGSRPNRRPRKRPKNVQKQMDSVFPGLW 224

Query: 534  LVGFTADAYRVLEAPIKK 481
            LVG TADAYRV + P K+
Sbjct: 225  LVGITADAYRVADTPTKR 242


>ref|XP_007209536.1| hypothetical protein PRUPE_ppa010718mg [Prunus persica]
            gi|462405271|gb|EMJ10735.1| hypothetical protein
            PRUPE_ppa010718mg [Prunus persica]
          Length = 238

 Score =  163 bits (412), Expect = 2e-37
 Identities = 108/252 (42%), Positives = 129/252 (51%), Gaps = 7/252 (2%)
 Frame = -2

Query: 1218 MATAPVKSQPLHNFSLSFPTWGNKNQM---NNHRCRRSVDAPPPPSPHLDHRXXXXXXXX 1048
            MATAPVK  PLHNF L+F  WG KN     NNHR RR V A P   P  +          
Sbjct: 1    MATAPVKP-PLHNFPLAFLKWGAKNNSTTNNNHRYRRPVSAEPASEPDSESERTHY---- 55

Query: 1047 XXXXXXXXENNRKTPVGSRTTRNRVAFSPCNLLEKSQKQIMEKESAEVDDDERGKAYEVD 868
                      N      SR +R+R +  PC      +++  E+ES + + +E  KA  V 
Sbjct: 56   ----------NNSRVGSSRASRHRYSLIPC--AGDKRRRSEERESDQEEGEEADKAEVV- 102

Query: 867  EASAHKPWNLRPRRXXXXXXXXXXXXXXXXXELQETVPVVQLGDANLPKSLRLRG----S 700
                HKPWNLRPRR                 EL+   P     +   PKS+RLRG     
Sbjct: 103  ----HKPWNLRPRRAPATTSFSKGGANGEPHELESPNP--NQSELQQPKSMRLRGLAAEG 156

Query: 699  QGTEKKEKAKFWISLSKEEIEEDIFVMTGSXXXXXXXXXXRTVQKQVDNVFPGLWLVGFT 520
            Q  EKKE  KFWI+LSKEEIEEDIFVMTGS          + VQKQ+D  FPGLWLVG T
Sbjct: 157  QNVEKKENRKFWIALSKEEIEEDIFVMTGSRPARRPKKRPKNVQKQLDITFPGLWLVGVT 216

Query: 519  ADAYRVLEAPIK 484
            ADAY+V ++P K
Sbjct: 217  ADAYKVADSPSK 228


>ref|XP_003608674.1| hypothetical protein MTR_4g100570 [Medicago truncatula]
            gi|355509729|gb|AES90871.1| hypothetical protein
            MTR_4g100570 [Medicago truncatula]
          Length = 243

 Score =  161 bits (408), Expect = 5e-37
 Identities = 116/271 (42%), Positives = 130/271 (47%), Gaps = 25/271 (9%)
 Frame = -2

Query: 1218 MATAP--VKSQPLHNFSLSFPTWG-----NKNQMNNHRCRRSVDAPPPPSPHLDHRXXXX 1060
            MAT P  VKSQPLHNFSL F  WG     N N  N+HR RR  D    P    D R    
Sbjct: 1    MATTPASVKSQPLHNFSLPFLKWGGTGKNNTNATNHHRSRRPPDHASEPDSEPDSRPHR- 59

Query: 1059 XXXXXXXXXXXXENNRKTPVGSRTTRNRVAFS-----------PCNLLEKSQKQIMEKES 913
                               +GSRT RNR  F+           P +  E        K  
Sbjct: 60   -------------------LGSRTARNRFGFASSSSQRQAPPTPSSNNETDDNAGDRKRD 100

Query: 912  AEVDDDERGKAYEVDEASAHKPWNLRPRRXXXXXXXXXXXXXXXXXE----LQETVPVVQ 745
            AE D +  G A E+      KPWNLRPR+                      LQE V    
Sbjct: 101  AEDDAEAGGGAEEI----VQKPWNLRPRKPMIPRGGFEIGAGGSRNNNGGELQEGVN--- 153

Query: 744  LGDANLPKSLRLRGSQGT---EKKEKAKFWISLSKEEIEEDIFVMTGSXXXXXXXXXXRT 574
             G+   PKSLRLRG   T   EKKEK KFWI+LSK+EIEEDIFVMTGS          + 
Sbjct: 154  -GENPAPKSLRLRGFADTNCGEKKEKRKFWIALSKDEIEEDIFVMTGSRPNRRPRKRAKN 212

Query: 573  VQKQVDNVFPGLWLVGFTADAYRVLEAPIKK 481
            VQKQ+DNVFPGLWLVG TADAYRV + P K+
Sbjct: 213  VQKQMDNVFPGLWLVGITADAYRVADTPTKR 243


>ref|XP_006440252.1| hypothetical protein CICLE_v10022000mg [Citrus clementina]
            gi|557542514|gb|ESR53492.1| hypothetical protein
            CICLE_v10022000mg [Citrus clementina]
          Length = 216

 Score =  157 bits (397), Expect = 1e-35
 Identities = 108/260 (41%), Positives = 134/260 (51%), Gaps = 14/260 (5%)
 Frame = -2

Query: 1218 MATAPVKSQPLHNFSLSFPTWGNKNQMNNHRCRRSVDAPPPPSPHLDHRXXXXXXXXXXX 1039
            M TAP+KSQPLHNFSLSF  WG  +   NH   R+   PPP  P                
Sbjct: 1    MTTAPMKSQPLHNFSLSFLKWGTHHPNPNHNRTRT---PPPTEPDTTD------------ 45

Query: 1038 XXXXXENNRKTPVGSRTTRNRVAFSPCNLLEKSQKQIMEKESAEVDDDERGKAYEVDEAS 859
                        VGSR++R +    P +   K Q+  +E+   +  D E     E +E  
Sbjct: 46   ----DSTRHHRVVGSRSSRAQRLSFPSST-SKPQQDAVERPQRQTADTE-----EEEEDE 95

Query: 858  AHKPWNLRPRRXXXXXXXXXXXXXXXXXELQETVPVVQL----GDANL----PKSLRLR- 706
              +PWNLRPR+                  +QET+  V +    GD N     PKS RLR 
Sbjct: 96   VGRPWNLRPRK------------------VQETLVDVAVFQNRGDNNANTKAPKSTRLRE 137

Query: 705  -----GSQGTEKKEKAKFWISLSKEEIEEDIFVMTGSXXXXXXXXXXRTVQKQVDNVFPG 541
                 GS G +KKEK KFW++LS+EEIEEDIF+MTGS          + VQKQ+DNVFPG
Sbjct: 138  MVESRGSNG-DKKEKNKFWVTLSREEIEEDIFIMTGSRPARRPRKRPKNVQKQLDNVFPG 196

Query: 540  LWLVGFTADAYRVLEAPIKK 481
            LWLVG TADAYRV +AP+KK
Sbjct: 197  LWLVGLTADAYRVSDAPMKK 216


>ref|XP_006477140.1| PREDICTED: uncharacterized protein LOC102618144 isoform X1 [Citrus
            sinensis]
          Length = 216

 Score =  155 bits (392), Expect = 4e-35
 Identities = 106/260 (40%), Positives = 132/260 (50%), Gaps = 14/260 (5%)
 Frame = -2

Query: 1218 MATAPVKSQPLHNFSLSFPTWGNKNQMNNHRCRRSVDAPPPPSPHLDHRXXXXXXXXXXX 1039
            M TAP+KSQPLHNFSLSF  WG  +   NH   R+   PPP  P                
Sbjct: 1    MTTAPMKSQPLHNFSLSFLKWGTHHPNPNHNRTRT---PPPTEPDTTD------------ 45

Query: 1038 XXXXXENNRKTPVGSRTTRNRVAFSPCNLLEKSQKQIMEKESAEVDDDERGKAYEVDEAS 859
                        VGSR++R +    PC+   K  +   ++   +  D E     E +E  
Sbjct: 46   ----DSTRHHRVVGSRSSRAQRLSFPCST-SKPHQDAGDRSQRQTADTE-----EEEEDE 95

Query: 858  AHKPWNLRPRRXXXXXXXXXXXXXXXXXELQETVPVVQL----GDANL----PKSLRLR- 706
              +PWNLRPR+                  +QET+  V +    GD N     PKS RLR 
Sbjct: 96   VGRPWNLRPRK------------------VQETLVDVAVFQNRGDNNANTKAPKSTRLRE 137

Query: 705  -----GSQGTEKKEKAKFWISLSKEEIEEDIFVMTGSXXXXXXXXXXRTVQKQVDNVFPG 541
                 GS G +KKEK KFW++LS+EEIEEDIF+MTGS          + VQKQ+DNVFPG
Sbjct: 138  MVESRGSNG-DKKEKNKFWVTLSREEIEEDIFIMTGSRPARRPRKRPKNVQKQLDNVFPG 196

Query: 540  LWLVGFTADAYRVLEAPIKK 481
            LWLVG T DAYRV +AP+KK
Sbjct: 197  LWLVGLTVDAYRVSDAPMKK 216


>ref|XP_007039763.1| Uncharacterized protein isoform 1 [Theobroma cacao]
            gi|590676536|ref|XP_007039764.1| Uncharacterized protein
            isoform 1 [Theobroma cacao]
            gi|590676539|ref|XP_007039765.1| Uncharacterized protein
            isoform 1 [Theobroma cacao]
            gi|590676547|ref|XP_007039767.1| Uncharacterized protein
            isoform 1 [Theobroma cacao] gi|508777008|gb|EOY24264.1|
            Uncharacterized protein isoform 1 [Theobroma cacao]
            gi|508777009|gb|EOY24265.1| Uncharacterized protein
            isoform 1 [Theobroma cacao] gi|508777010|gb|EOY24266.1|
            Uncharacterized protein isoform 1 [Theobroma cacao]
            gi|508777012|gb|EOY24268.1| Uncharacterized protein
            isoform 1 [Theobroma cacao]
          Length = 223

 Score =  150 bits (378), Expect = 2e-33
 Identities = 102/257 (39%), Positives = 129/257 (50%), Gaps = 11/257 (4%)
 Frame = -2

Query: 1218 MATAPVKSQPLHNFSLSFPTWGNKNQMNNHRCRRSVDAPPPPSPHLDHRXXXXXXXXXXX 1039
            MATAPVKSQPLHNF+  F  WG            S D    P    DH            
Sbjct: 1    MATAPVKSQPLHNFNFPFLKWGTHG--GGGSSTSSADHRRSPESDSDHDRL--------- 49

Query: 1038 XXXXXENNRKTPVGSRTTR-NRVAFSPCNLLEKSQKQIMEKESAEVDDDERGKAY----- 877
                    R T VGSR+TR  R++F P     K  KQ   ++  +  +++  K +     
Sbjct: 50   --------RPTRVGSRSTRIQRLSFLPP---PKPIKQSHGEDEEQQQEEQPLKPHKNEAE 98

Query: 876  -EVDEASAHKPWNLRPRRXXXXXXXXXXXXXXXXXELQETVPVVQLGDANLPKSLRLRGS 700
             E +E +  +PWNLRPR+                     T  + ++ +   PKS+RLRG 
Sbjct: 99   EEEEEETVQRPWNLRPRKVVVETTAVV------------TTAMEKVSETAAPKSMRLRGL 146

Query: 699  QGT----EKKEKAKFWISLSKEEIEEDIFVMTGSXXXXXXXXXXRTVQKQVDNVFPGLWL 532
                   EKKEK KFWI+LS+EEIEEDIFVMTGS          + +QKQ+D VFPGLWL
Sbjct: 147  AENGGIVEKKEKRKFWIALSREEIEEDIFVMTGSRPARRPKKRPKNIQKQLDAVFPGLWL 206

Query: 531  VGFTADAYRVLEAPIKK 481
            VG TADAYRV +AP+KK
Sbjct: 207  VGTTADAYRVADAPVKK 223


>ref|XP_007039766.1| Uncharacterized protein isoform 4 [Theobroma cacao]
            gi|508777011|gb|EOY24267.1| Uncharacterized protein
            isoform 4 [Theobroma cacao]
          Length = 227

 Score =  148 bits (373), Expect = 6e-33
 Identities = 101/256 (39%), Positives = 128/256 (50%), Gaps = 11/256 (4%)
 Frame = -2

Query: 1218 MATAPVKSQPLHNFSLSFPTWGNKNQMNNHRCRRSVDAPPPPSPHLDHRXXXXXXXXXXX 1039
            MATAPVKSQPLHNF+  F  WG            S D    P    DH            
Sbjct: 1    MATAPVKSQPLHNFNFPFLKWGTHG--GGGSSTSSADHRRSPESDSDHDRL--------- 49

Query: 1038 XXXXXENNRKTPVGSRTTR-NRVAFSPCNLLEKSQKQIMEKESAEVDDDERGKAY----- 877
                    R T VGSR+TR  R++F P     K  KQ   ++  +  +++  K +     
Sbjct: 50   --------RPTRVGSRSTRIQRLSFLPP---PKPIKQSHGEDEEQQQEEQPLKPHKNEAE 98

Query: 876  -EVDEASAHKPWNLRPRRXXXXXXXXXXXXXXXXXELQETVPVVQLGDANLPKSLRLRGS 700
             E +E +  +PWNLRPR+                     T  + ++ +   PKS+RLRG 
Sbjct: 99   EEEEEETVQRPWNLRPRKVVVETTAVV------------TTAMEKVSETAAPKSMRLRGL 146

Query: 699  QGT----EKKEKAKFWISLSKEEIEEDIFVMTGSXXXXXXXXXXRTVQKQVDNVFPGLWL 532
                   EKKEK KFWI+LS+EEIEEDIFVMTGS          + +QKQ+D VFPGLWL
Sbjct: 147  AENGGIVEKKEKRKFWIALSREEIEEDIFVMTGSRPARRPKKRPKNIQKQLDAVFPGLWL 206

Query: 531  VGFTADAYRVLEAPIK 484
            VG TADAYRV +AP+K
Sbjct: 207  VGTTADAYRVADAPVK 222


>ref|XP_003549926.1| PREDICTED: uncharacterized protein LOC100812835 isoform X1 [Glycine
            max] gi|571536516|ref|XP_006600845.1| PREDICTED:
            uncharacterized protein LOC100812835 isoform X2 [Glycine
            max]
          Length = 237

 Score =  144 bits (362), Expect = 1e-31
 Identities = 106/265 (40%), Positives = 123/265 (46%), Gaps = 20/265 (7%)
 Frame = -2

Query: 1215 ATAPVKSQPLHNFSLSFPTWGNKNQMN-------NHRCRRSVD-APPPPSPHLDHRXXXX 1060
            A APVKSQPLHNF+L F  WG   + N       +HR RR  D A  P S   D R    
Sbjct: 6    AHAPVKSQPLHNFALPFLKWGASGKNNTTTTAAHHHRFRRPSDHASEPDSSDPDSRPHR- 64

Query: 1059 XXXXXXXXXXXXENNRKTPVGSRTTRNRVAFSPCNLLEKSQKQIMEKESAEVDDDERGKA 880
                               +GSRT RNR +  P         Q+ E E  + DD      
Sbjct: 65   -------------------LGSRTARNRFSL-PLKPPPPPPPQLHEAEHDDADD------ 98

Query: 879  YEVDEASAHKPWNLRPRRXXXXXXXXXXXXXXXXXELQETVPVVQL-------GDAN--L 727
                  +  KPWNLRPR+                          +        GD N   
Sbjct: 99   ------AVQKPWNLRPRKPALLPKAALEIGTGPSRNHHHATNNGEFHDGGGGGGDNNNPA 152

Query: 726  PKSLRLRGSQGTE---KKEKAKFWISLSKEEIEEDIFVMTGSXXXXXXXXXXRTVQKQVD 556
            PKSLRLRG   T    KKEK KFWI+LS+EEIEEDIFVMTGS          + VQKQ+D
Sbjct: 153  PKSLRLRGFSDTPCSVKKEKRKFWIALSREEIEEDIFVMTGSRPARRPRKRPKNVQKQMD 212

Query: 555  NVFPGLWLVGFTADAYRVLEAPIKK 481
            +VFPGLWLVG TADAYRV + P K+
Sbjct: 213  SVFPGLWLVGITADAYRVADTPAKR 237


>ref|XP_003525577.1| PREDICTED: histone-lysine N-methyltransferase 2E-like [Glycine max]
          Length = 241

 Score =  139 bits (350), Expect = 3e-30
 Identities = 105/264 (39%), Positives = 122/264 (46%), Gaps = 19/264 (7%)
 Frame = -2

Query: 1215 ATAPVKSQPLHNFSLSFPTWG------NKNQMNNHRCRRSVD-APPPPSPHLDHRXXXXX 1057
            A APVKSQPLHNF+L F  WG        N  ++HR RR  D A  P S   D R     
Sbjct: 12   APAPVKSQPLHNFALPFLKWGASGKNNTTNAAHHHRFRRPSDHASEPDSSDPDSRPHR-- 69

Query: 1056 XXXXXXXXXXXENNRKTPVGSRTTRNRVAFSPCNLLEKSQKQIMEKESAEVDDDERGKAY 877
                              +GSRT RNR +      L               DDD      
Sbjct: 70   ------------------LGSRTARNRFS------LPLKPPPPPPPPQPPHDDDA----- 100

Query: 876  EVDEASAHKPWNLRPRRXXXXXXXXXXXXXXXXXE--------LQETVPVVQLGDAN-LP 724
               + S  KPW LRPR+                                 +  GD N  P
Sbjct: 101  ---DDSVQKPWKLRPRKPALLPNKTALEIGTGPSRNHHHHHHHATNNGEFLDGGDNNPAP 157

Query: 723  KSLRLRG---SQGTEKKEKAKFWISLSKEEIEEDIFVMTGSXXXXXXXXXXRTVQKQVDN 553
            KSLRLRG   +Q +EKKEK KFWI+LS+EEIEEDIFVMTGS          + VQKQ+D+
Sbjct: 158  KSLRLRGFSDTQCSEKKEKRKFWIALSREEIEEDIFVMTGSRPARRPRKRPKNVQKQMDS 217

Query: 552  VFPGLWLVGFTADAYRVLEAPIKK 481
            VFPGLWLVG TADAYRV + P K+
Sbjct: 218  VFPGLWLVGITADAYRVADTPTKR 241


>ref|XP_002531462.1| conserved hypothetical protein [Ricinus communis]
            gi|223528916|gb|EEF30912.1| conserved hypothetical
            protein [Ricinus communis]
          Length = 265

 Score =  130 bits (327), Expect = 1e-27
 Identities = 99/284 (34%), Positives = 142/284 (50%), Gaps = 38/284 (13%)
 Frame = -2

Query: 1218 MATAPVKSQPLHNFSLSFPTWG--------NKNQMNNHRCRRSVDAPPPPSPHLDHRXXX 1063
            MATAPVK Q LHNF +S   WG        + N  ++H  R S       S +       
Sbjct: 1    MATAPVKPQQLHNFPISLK-WGQTTTTTTISANHQHHHHNRSS------SSNNQRLATPV 53

Query: 1062 XXXXXXXXXXXXXENNRKTPVGSRTTR-NRVAFSPCN-LLEKSQKQIMEKESAE------ 907
                            R   VGSR+ R +R +F+ C+ LL K++ +I +K  A       
Sbjct: 54   HESETESDPDQSQSTIRHPRVGSRSARVHRYSFASCSTLLPKAKTEIPQKPEATEKPQQK 113

Query: 906  ----VDDDERGKAYEVDEA-SAHKPWNLRPRRXXXXXXXXXXXXXXXXXELQETVPVV-- 748
                ++++ + +A E++E  S+ +PW LRPR+                   +ET  ++  
Sbjct: 114  NLAVLENNNKNEAEEIEEEDSSSRPWKLRPRKGILTGSS------------KETATLLGN 161

Query: 747  QLGDANLPKSLRLRG-----SQGT----------EKKEKAKFWISLSKEEIEEDIFVMTG 613
            +  D+  PKS+RLRG     S G           EKKEK KFW++LS+EEIEED+FV+TG
Sbjct: 162  EQRDSTTPKSMRLRGLVDSTSSGLGVGLGNGVSLEKKEKRKFWVALSREEIEEDVFVLTG 221

Query: 612  SXXXXXXXXXXRTVQKQVDNVFPGLWLVGFTADAYRVLEAPIKK 481
            S          + VQK +D+VFPGLWLVG TAD+YRV + P+K+
Sbjct: 222  SRPARRPKKRPKNVQKILDSVFPGLWLVGTTADSYRVADPPVKR 265


>ref|XP_006359254.1| PREDICTED: uncharacterized protein LOC102604791 [Solanum tuberosum]
          Length = 220

 Score =  120 bits (300), Expect = 2e-24
 Identities = 88/250 (35%), Positives = 118/250 (47%), Gaps = 11/250 (4%)
 Frame = -2

Query: 1218 MATAPVKSQPLHNFSLSFPTWGNKNQMN-NHRCRRSVDAP-----PPPSPHLDHRXXXXX 1057
            MA APVKSQPLH FSL    WGNK+  N NHR RR    P     PP +  +D       
Sbjct: 1    MAAAPVKSQPLHYFSLPQLKWGNKSHTNANHRFRRRDSPPSNGDNPPQTADVD------- 53

Query: 1056 XXXXXXXXXXXENNRKTPVGSRTTRNRVAFSPCNLLEKSQKQIMEKESAEVDDDERGKAY 877
                        ++ K    S    +    S     ++ +K++ E+E  E    E G+  
Sbjct: 54   ---------GGSDSEKVQPRSEAEADPNGVSSLQGEDEHEKEVKEEEEEEEVGCEEGEV- 103

Query: 876  EVDEASAHKPWNLRPRRXXXXXXXXXXXXXXXXXELQETVPVVQLGDANLPKSLRLRGSQ 697
                    K WNLRPRR                    + V +      ++ +S RL+ + 
Sbjct: 104  --------KLWNLRPRRGVTKVETASL----------KNVEMRVESSNHMQRSQRLKDNA 145

Query: 696  -----GTEKKEKAKFWISLSKEEIEEDIFVMTGSXXXXXXXXXXRTVQKQVDNVFPGLWL 532
                 G+ KK K K WISLS+EEIEED++ MTGS          +T+QKQ+DNVFPGL+L
Sbjct: 146  DGNGVGSGKKGKKKLWISLSREEIEEDVYSMTGSRPARRPKKRSKTIQKQLDNVFPGLYL 205

Query: 531  VGFTADAYRV 502
            VG TAD++RV
Sbjct: 206  VGLTADSFRV 215


>ref|XP_004245789.1| PREDICTED: uncharacterized protein LOC101263341 isoform 1 [Solanum
            lycopersicum] gi|460400536|ref|XP_004245790.1| PREDICTED:
            uncharacterized protein LOC101263341 isoform 2 [Solanum
            lycopersicum]
          Length = 219

 Score =  120 bits (300), Expect = 2e-24
 Identities = 90/245 (36%), Positives = 120/245 (48%), Gaps = 6/245 (2%)
 Frame = -2

Query: 1218 MATAPVKSQPLHNFSLSFPTWGNKNQMN-NHRCRRSVDAPPPPSPHLDHRXXXXXXXXXX 1042
            MATAPVKSQPLH FSL    WGNK+  N NHR RR  D+PP    +              
Sbjct: 1    MATAPVKSQPLHYFSLPQLKWGNKSNTNANHRFRRR-DSPPSNGDN----------PTQT 49

Query: 1041 XXXXXXENNRKTPVGSRTTRNRVAFSPCNLLEKSQKQIMEKESAEVDDDERGKAYEVDEA 862
                   ++ K    S    +    S     E+ ++++ E+E  EV  +E     EV   
Sbjct: 50   ADVDGGSDSEKVQPRSEAEADPNGVSSLQGREEHEEKVKEEEEEEVGCEEG----EV--- 102

Query: 861  SAHKPWNLRPRRXXXXXXXXXXXXXXXXXELQETVPVVQLGDANLPKSLRLRGSQ----- 697
               K WNLRPRR                    + V +      ++ +S RL+ +      
Sbjct: 103  ---KLWNLRPRRGVTKVETTSL----------KNVEMRVESSNHMQRSQRLKDNADGNGV 149

Query: 696  GTEKKEKAKFWISLSKEEIEEDIFVMTGSXXXXXXXXXXRTVQKQVDNVFPGLWLVGFTA 517
            G+ KK K K WISLS+EEIEED++ MTGS          +T+QKQ+DNVFPGL+LVG TA
Sbjct: 150  GSGKKGKKKLWISLSREEIEEDVYSMTGSRPARRPKKRSKTIQKQLDNVFPGLYLVGVTA 209

Query: 516  DAYRV 502
            D++RV
Sbjct: 210  DSFRV 214


>ref|XP_006440253.1| hypothetical protein CICLE_v10022000mg [Citrus clementina]
            gi|557542515|gb|ESR53493.1| hypothetical protein
            CICLE_v10022000mg [Citrus clementina]
          Length = 236

 Score =  115 bits (289), Expect = 3e-23
 Identities = 92/249 (36%), Positives = 118/249 (47%), Gaps = 17/249 (6%)
 Frame = -2

Query: 1218 MATAPVKSQPLHNFSLSFPTWGNKNQMNNHRCRRSVDAPPPPSPHLDHRXXXXXXXXXXX 1039
            M TAP+KSQPLHNFSLSF  WG  +   NH   R+   PPP  P                
Sbjct: 1    MTTAPMKSQPLHNFSLSFLKWGTHHPNPNHNRTRT---PPPTEPDTTD------------ 45

Query: 1038 XXXXXENNRKTPVGSRTTRNRVAFSPCNLLEKSQKQIMEKESAEVDDDERGKAYEVDEAS 859
                        VGSR++R +    P +   K Q+  +E+   +  D E     E +E  
Sbjct: 46   ----DSTRHHRVVGSRSSRAQRLSFPSST-SKPQQDAVERPQRQTADTE-----EEEEDE 95

Query: 858  AHKPWNLRPRRXXXXXXXXXXXXXXXXXELQETVPVVQL----GDANL----PKSLRLR- 706
              +PWNLRPR+                  +QET+  V +    GD N     PKS RLR 
Sbjct: 96   VGRPWNLRPRK------------------VQETLVDVAVFQNRGDNNANTKAPKSTRLRE 137

Query: 705  -----GSQGTEKKEKAKFWISLSKEEIEEDIFVMTGSXXXXXXXXXXRTVQKQVDNVF-- 547
                 GS G +KKEK KFW++LS+EEIEEDIF+MTGS          + VQKQ+D  +  
Sbjct: 138  MVESRGSNG-DKKEKNKFWVTLSREEIEEDIFIMTGSRPARRPRKRPKNVQKQLDVRYFC 196

Query: 546  -PGLWLVGF 523
             PG +LV F
Sbjct: 197  SPGFFLVLF 205


>ref|XP_007052276.1| Uncharacterized protein TCM_005685 [Theobroma cacao]
            gi|508704537|gb|EOX96433.1| Uncharacterized protein
            TCM_005685 [Theobroma cacao]
          Length = 287

 Score = 97.1 bits (240), Expect = 2e-17
 Identities = 87/283 (30%), Positives = 113/283 (39%), Gaps = 39/283 (13%)
 Frame = -2

Query: 1215 ATAPVKSQPLHNFSLSFPTWGNKNQMNNHRCRRSVDAPPPPSPHLDHRXXXXXXXXXXXX 1036
            +++ +KS PLHNF L    W   N  NNHR R+  D+        D              
Sbjct: 24   SSSTLKSHPLHNFQLHDLKWA-MNHSNNHRLRKLSDSSHKSPQRGDS------------D 70

Query: 1035 XXXXENNRKTPVGSRTTRNRVAF--SPCNLLEKSQKQIMEKESAEVDD---------DER 889
                +N +  PV     +N  +   S  +  EKS+K+++      VD+         D R
Sbjct: 71   SDSDDNRKGNPVREAAPKNGASSGSSADHRSEKSEKKVINGSDVLVDNNSEKKATPSDGR 130

Query: 888  GKAY------------------------EVDEASAHKPWNLRPRRXXXXXXXXXXXXXXX 781
             K Y                        E  E    K WNLRPR+               
Sbjct: 131  SKIYIRFRTKNQKPADEVADAGDQNLDAEYVEELVPKTWNLRPRKPITKPRNQNGAAPRI 190

Query: 780  XXELQETVPVVQLGDANLPKSLRLRG---SQGTEKKEKAK-FWISLSKEEIEEDIFVMTG 613
                 E          + P+S R R     +  EKKEK K F ISLS+EEI++DIF MTG
Sbjct: 191  GASAHEN-------KIHRPESTRSRNVTEPKAAEKKEKKKKFSISLSREEIDDDIFAMTG 243

Query: 612  SXXXXXXXXXXRTVQKQVDNVFPGLWLVGFTADAYRVLEAPIK 484
            S          + VQKQ+D VFPGLWL   T D YRV +AP K
Sbjct: 244  SKPSRRPKKRAKNVQKQLDCVFPGLWLSSITPDCYRVSDAPAK 286


>ref|XP_006448443.1| hypothetical protein CICLE_v10016320mg [Citrus clementina]
            gi|568828779|ref|XP_006468717.1| PREDICTED:
            uncharacterized protein LOC102623112 [Citrus sinensis]
            gi|557551054|gb|ESR61683.1| hypothetical protein
            CICLE_v10016320mg [Citrus clementina]
          Length = 259

 Score = 94.0 bits (232), Expect = 1e-16
 Identities = 76/255 (29%), Positives = 113/255 (44%), Gaps = 14/255 (5%)
 Frame = -2

Query: 1218 MATAPVKSQPLHNFSLSFPTWGNKNQMNNHRCRRSVDAPPPPSPHLDHRXXXXXXXXXXX 1039
            M TA   S+PLHNF+L +  WG++  +   RC +  D+ P  SP +DHR           
Sbjct: 10   MGTARSSSKPLHNFTLPYLKWGHQRHL---RCMKVEDSSPS-SPSVDHRPRRRSREFEFD 65

Query: 1038 XXXXXENN-----------RKTPVGSRTTRNRVAFSPCNLLEKSQKQIMEKE--SAEVDD 898
                 +              +   G    R ++        +K + QI+ +E    E+ +
Sbjct: 66   DSGRHKGKVLLKSNGVDGGAEEEEGIDAVREKIMNDLKTAADKMKDQILREEVYEEEIME 125

Query: 897  DERGKAYEVDEASAHKPWNLRPRRXXXXXXXXXXXXXXXXXELQETVPVVQLGDANL-PK 721
            D    A  V EA+  +PWNLR RR                  ++E  P     +  +  K
Sbjct: 126  DTHVAAAAV-EAAEVRPWNLRTRRAACKAPNKGFR-------IEEKKPNSSPVNTEIGAK 177

Query: 720  SLRLRGSQGTEKKEKAKFWISLSKEEIEEDIFVMTGSXXXXXXXXXXRTVQKQVDNVFPG 541
            S +L+  +   KKE+ KF +SL ++EIEED   M G           R VQKQ+D++FPG
Sbjct: 178  SPKLQRGEKEIKKERPKFAVSLMRKEIEEDFMEMVGHRPPRRPKKRPRIVQKQMDSLFPG 237

Query: 540  LWLVGFTADAYRVLE 496
            LWL   T D+Y+V E
Sbjct: 238  LWLTEVTVDSYKVPE 252


Top