BLASTX nr result

ID: Akebia23_contig00013901 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Akebia23_contig00013901
         (1563 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002266542.1| PREDICTED: uncharacterized protein LOC100257...   237   1e-59
ref|XP_007201961.1| hypothetical protein PRUPE_ppa004686mg [Prun...   198   5e-48
ref|XP_006482076.1| PREDICTED: uncharacterized protein DDB_G0283...   194   7e-47
ref|XP_006482075.1| PREDICTED: uncharacterized protein DDB_G0283...   194   7e-47
ref|XP_006430550.1| hypothetical protein CICLE_v10011438mg [Citr...   193   2e-46
ref|XP_006430548.1| hypothetical protein CICLE_v10011438mg [Citr...   193   2e-46
ref|XP_006838218.1| hypothetical protein AMTR_s00106p00155270 [A...   190   1e-45
ref|XP_007028352.1| Uncharacterized protein isoform 4 [Theobroma...   187   9e-45
ref|XP_007028351.1| Uncharacterized protein isoform 3 [Theobroma...   185   4e-44
ref|XP_007028350.1| Uncharacterized protein isoform 2, partial [...   185   4e-44
ref|XP_007028349.1| Uncharacterized protein isoform 1 [Theobroma...   185   4e-44
ref|XP_004144330.1| PREDICTED: uncharacterized protein LOC101218...   171   1e-39
ref|XP_002307979.2| hypothetical protein POPTR_0006s03830g [Popu...   166   3e-38
ref|XP_006575187.1| PREDICTED: protein starmaker-like isoform X6...   158   7e-36
gb|EXC34985.1| hypothetical protein L484_014712 [Morus notabilis]     157   2e-35
ref|XP_006575186.1| PREDICTED: protein starmaker-like isoform X5...   157   2e-35
ref|XP_006575183.1| PREDICTED: protein starmaker-like isoform X2...   156   2e-35
ref|XP_003519025.1| PREDICTED: protein starmaker-like isoform X1...   156   3e-35
gb|EYU18618.1| hypothetical protein MIMGU_mgv1a007462mg [Mimulus...   147   2e-32
ref|XP_006589006.1| PREDICTED: arginine/serine-rich coiled-coil ...   144   1e-31

>ref|XP_002266542.1| PREDICTED: uncharacterized protein LOC100257160 [Vitis vinifera]
            gi|297739954|emb|CBI30136.3| unnamed protein product
            [Vitis vinifera]
          Length = 510

 Score =  237 bits (604), Expect = 1e-59
 Identities = 171/468 (36%), Positives = 225/468 (48%), Gaps = 64/468 (13%)
 Frame = +1

Query: 58   MDPNLSLSP-DIADARPSFRKPSNDAANRKYRRRXXXXXXXXXXXXXXXXNHDRSPSPEF 234
            MD +L   P D ADA+ +FRKP+NDA NRKYRRR                 H+ + SP F
Sbjct: 1    MDSSLKSPPRDKADAKTAFRKPTNDATNRKYRRRSPTSGSSSSGGSPI---HEHNSSPIF 57

Query: 235  HRDNPAKISDDPRRRENGGRELEMEXXXXXXXXXXXXXXXXERQPXXXXXXXXXXXXXXX 414
             +++  K+SD  +RR+  GREL+ +                +RQ                
Sbjct: 58   SKEDSEKVSDRRQRRKGDGRELDRDAGRSQYRKTADSYRHSDRQSSRSSRGHYRYDDHVR 117

Query: 415  XXXXA-DGGER-RYQXXXXXXXXXXXXTHSDCTRQESEYER-RDYQQHVDKYNRDKPDCD 585
                A D G+R  +              +SD  RQESE+ R RDY +  DKY+RDK D  
Sbjct: 118  QEKHAADEGDRDHHNLSSRSGRESRVGNYSDHVRQESEHSRTRDYFRGTDKYSRDKHDNA 177

Query: 586  GH----------------------------GRRRLINSNLDEVKIGEE-RHNSXXXXXXX 678
            G+                              RR  NSN ++ K GE+ +H         
Sbjct: 178  GYRSKDKEKETSSLEHQKYKDKDLSSDRAGSGRRHTNSNFEDSKAGEQDKHLRDGDGPDE 237

Query: 679  XXXXXXSLGDYKNDHISSFDESRGHGKDSTAARENGRNGLTETRKSSSKELDGQKRNVME 858
                   LGDYK+D   S +ESRGH  DST+ R++G     E  K+  KE+DGQK+   E
Sbjct: 238  RKDYRRGLGDYKSDRSISHEESRGHRNDSTSGRDSGGYRSKEVHKNEPKEVDGQKQPKDE 297

Query: 859  RRKH----------------------------EDNEFLVKKPKLCNADEGTG-GKIISKF 951
            ++K+                            E+ E   KKPKL + ++ T  GK +S+F
Sbjct: 298  KKKYDEWKTDRHKDRYNRESREQFEDKTVVASENQESAAKKPKLVSLEKSTDYGKDVSRF 357

Query: 952  -TCAAD-ETPSSSKQVQEIVDKVIPEPAQSSANEAVSACDLNXXXXXXXXXXELVNKNLV 1125
             T  AD +  SSSK  Q+I DKV PE A  + +E  +  DLN          ELVN+NLV
Sbjct: 358  STAVADMKQSSSSKLAQDIADKVTPEHAFLNNSEVAN--DLNAAKIAAMKAAELVNRNLV 415

Query: 1126 GGGYMSTDQKKKILWGNKKTSAAEESVHHWDMPLFSDQERQEKFNKLM 1269
            G GYMS DQKKK+LWG+KK++ AEES HHWD  LFSD+ERQEKFNKLM
Sbjct: 416  GVGYMSADQKKKLLWGSKKSTTAEESGHHWDTALFSDRERQEKFNKLM 463


>ref|XP_007201961.1| hypothetical protein PRUPE_ppa004686mg [Prunus persica]
            gi|462397492|gb|EMJ03160.1| hypothetical protein
            PRUPE_ppa004686mg [Prunus persica]
          Length = 496

 Score =  198 bits (504), Expect = 5e-48
 Identities = 146/453 (32%), Positives = 209/453 (46%), Gaps = 61/453 (13%)
 Frame = +1

Query: 94   DARPSFRKPSNDAANRKYRRRXXXXXXXXXXXXXXXXNHDRSPSPEFHRDNPAKISDDPR 273
            DA+ +FRKP+ DAANRKYRRR                 H+ + SP+  R++P K+S+   
Sbjct: 13   DAKTAFRKPATDAANRKYRRRSPVGGSSPSDGSPM---HEHNCSPKNSREDPGKVSEYQT 69

Query: 274  RRENGGRELEMEXXXXXXXXXXXXXXXXERQPXXXXXXXXXXXXXXXXXXXADGGERRYQ 453
            RR + GRELE +                +RQ                    AD  ++ YQ
Sbjct: 70   RRRDDGRELERDSNRRYYGRSSDSYRHSDRQSSRSLHGYYKHDDCIKHDKHADEEDKNYQ 129

Query: 454  XXXXXXXXXXXXTHSDCTRQESEYERRDYQQHVDKYNRDKPDCDGH-------------- 591
                          S  +      + R+Y +++DKY+RDK D  G+              
Sbjct: 130  KLSSRSGR-----ESRGSAYYDHIKSREYSRNLDKYSRDKYDGSGYRNKDKDRESSFPEN 184

Query: 592  ---------------GRRRLINSNLDEVKIGEERHNSXXXXXXXXXXXXXSLGDYKNDHI 726
                           GRR   + + +E++   +RH               + GDY ++ I
Sbjct: 185  QKYKDKDSSSQRVGSGRR---HGHFEEMERERDRHALDRDVQDEKKDYRRNSGDYISERI 241

Query: 727  SSFDESRGHGKDSTAARENGRNGLTETRKSSSKELDGQKRNVMERRKHEDNEF------- 885
             S++ES+G   DS + R+ G++ + E  KS  KELD    +  +R+K++D E        
Sbjct: 242  FSYEESKGQRSDSISRRDEGKHRMKEGYKSELKELDDDNVSKEQRKKYDDKETSWGNRIT 301

Query: 886  --------------------LVKKPKLCNADEGTGG-KIISKFTCAAD-ETPSSSKQVQE 999
                                  K+PKL ++++G  G K +SKFT  AD    SSSKQVQE
Sbjct: 302  RETSERSADKHYIKSENQESTAKRPKLFSSEKGIDGRKDVSKFTTTADGRESSSSKQVQE 361

Query: 1000 IVDKVIPEPAQSSANEAVSACDLNXXXXXXXXXXELVNKNLVGGG---YMSTDQKKKILW 1170
              D++  E  Q  AN+A +A D+N          ELVN+NL+G G    M+ DQKKK+LW
Sbjct: 362  --DEMTTEKTQ--ANDAEAANDINAAKVAALKAAELVNRNLIGAGPVGCMTADQKKKLLW 417

Query: 1171 GNKKTSAAEESVHHWDMPLFSDQERQEKFNKLM 1269
            GNKK++ AEE  H WD  LFSD+ERQEKFNKLM
Sbjct: 418  GNKKSTTAEEVGHRWDSTLFSDRERQEKFNKLM 450


>ref|XP_006482076.1| PREDICTED: uncharacterized protein DDB_G0283697-like isoform X2
            [Citrus sinensis]
          Length = 482

 Score =  194 bits (494), Expect = 7e-47
 Identities = 147/448 (32%), Positives = 195/448 (43%), Gaps = 49/448 (10%)
 Frame = +1

Query: 73   SLSPDIADARPSFRKPSNDAANRKYRRRXXXXXXXXXXXXXXXXNHDRSPSPEFHRDNPA 252
            S  PD  D + SFRKPSNDAANR+YRRR                  D + SP + RD+P+
Sbjct: 4    SSPPDTPDTKASFRKPSNDAANRRYRRRSPANGSSSSDGSPKC---DHNASPIYSRDDPS 60

Query: 253  KISDDPRRRENGGRELEMEXXXXXXXXXXXXXXXXERQPXXXXXXXXXXXXXXXXXXXAD 432
            K+ +  +RR++  REL+ +                +RQ                     +
Sbjct: 61   KVPEHQQRRKDDERELDRDSGRSHHGRGSDSYRHSDRQSSRSSHNYSKHDDYVRHDKHEN 120

Query: 433  GGERRYQXXXXXXXXXXXXTHSDCTRQESEYER-RDYQQHVDKYNRDKPDCDGHGRR--- 600
              +R YQ              S   R+  ++ R +DY    ++ + DK D  GHG +   
Sbjct: 121  DEDRNYQRLS-----------SRSGRESRDHSRSKDYLSSEERSSHDKYDVIGHGSKDKE 169

Query: 601  -------RLINSNLDEV----------------KIGEERHNSXXXXXXXXXXXXXSLGDY 711
                   R  N + D                  ++  + H               S GD+
Sbjct: 170  KESSYLERQKNKDKDSSSDRAGSGRKHTVAYSEELDRDWHKRDRDGRDEKRDYRRSSGDH 229

Query: 712  KNDHISSFDESRGHGKDSTAARENGRNGLTETRKSSSKELDGQKRNVMERRKHEDNEFL- 888
            +ND   ++DESRGH   S++ R+ G   L E  +S  KELDGQK    E++KH D+E   
Sbjct: 230  RNDRTVTYDESRGHRNYSSSGRDYGSYRLKEAHRSDPKELDGQKLANEEKKKHNDSETYR 289

Query: 889  ---------------------VKKPKLCNADEGTGGKIISKFTCAADETPSSSKQVQEIV 1005
                                  KK +  N D+G           AA    SSS Q Q+I 
Sbjct: 290  DRDRYHRADKPDFASGKQENPTKKQRFSNWDKGA-----DNVKDAAGTMSSSSMQSQDIG 344

Query: 1006 DKVIPEPAQSSANEAVSACDLNXXXXXXXXXXELVNKNLVGGGYMSTDQKKKILWGNKKT 1185
            D      AQS AN+AV A DL+          ELVNKNLVGG YMSTDQKKK+LWGNKK+
Sbjct: 345  DT--DALAQSHANDAV-ANDLDAAKVAAMRAAELVNKNLVGGSYMSTDQKKKLLWGNKKS 401

Query: 1186 SAAEESVHHWDMPLFSDQERQEKFNKLM 1269
            +  EES   WD  L  DQ+RQEKFNKLM
Sbjct: 402  TPVEESARRWDTALIGDQDRQEKFNKLM 429


>ref|XP_006482075.1| PREDICTED: uncharacterized protein DDB_G0283697-like isoform X1
            [Citrus sinensis]
          Length = 538

 Score =  194 bits (494), Expect = 7e-47
 Identities = 147/448 (32%), Positives = 195/448 (43%), Gaps = 49/448 (10%)
 Frame = +1

Query: 73   SLSPDIADARPSFRKPSNDAANRKYRRRXXXXXXXXXXXXXXXXNHDRSPSPEFHRDNPA 252
            S  PD  D + SFRKPSNDAANR+YRRR                  D + SP + RD+P+
Sbjct: 4    SSPPDTPDTKASFRKPSNDAANRRYRRRSPANGSSSSDGSPKC---DHNASPIYSRDDPS 60

Query: 253  KISDDPRRRENGGRELEMEXXXXXXXXXXXXXXXXERQPXXXXXXXXXXXXXXXXXXXAD 432
            K+ +  +RR++  REL+ +                +RQ                     +
Sbjct: 61   KVPEHQQRRKDDERELDRDSGRSHHGRGSDSYRHSDRQSSRSSHNYSKHDDYVRHDKHEN 120

Query: 433  GGERRYQXXXXXXXXXXXXTHSDCTRQESEYER-RDYQQHVDKYNRDKPDCDGHGRR--- 600
              +R YQ              S   R+  ++ R +DY    ++ + DK D  GHG +   
Sbjct: 121  DEDRNYQRLS-----------SRSGRESRDHSRSKDYLSSEERSSHDKYDVIGHGSKDKE 169

Query: 601  -------RLINSNLDEV----------------KIGEERHNSXXXXXXXXXXXXXSLGDY 711
                   R  N + D                  ++  + H               S GD+
Sbjct: 170  KESSYLERQKNKDKDSSSDRAGSGRKHTVAYSEELDRDWHKRDRDGRDEKRDYRRSSGDH 229

Query: 712  KNDHISSFDESRGHGKDSTAARENGRNGLTETRKSSSKELDGQKRNVMERRKHEDNEFL- 888
            +ND   ++DESRGH   S++ R+ G   L E  +S  KELDGQK    E++KH D+E   
Sbjct: 230  RNDRTVTYDESRGHRNYSSSGRDYGSYRLKEAHRSDPKELDGQKLANEEKKKHNDSETYR 289

Query: 889  ---------------------VKKPKLCNADEGTGGKIISKFTCAADETPSSSKQVQEIV 1005
                                  KK +  N D+G           AA    SSS Q Q+I 
Sbjct: 290  DRDRYHRADKPDFASGKQENPTKKQRFSNWDKGA-----DNVKDAAGTMSSSSMQSQDIG 344

Query: 1006 DKVIPEPAQSSANEAVSACDLNXXXXXXXXXXELVNKNLVGGGYMSTDQKKKILWGNKKT 1185
            D      AQS AN+AV A DL+          ELVNKNLVGG YMSTDQKKK+LWGNKK+
Sbjct: 345  DT--DALAQSHANDAV-ANDLDAAKVAAMRAAELVNKNLVGGSYMSTDQKKKLLWGNKKS 401

Query: 1186 SAAEESVHHWDMPLFSDQERQEKFNKLM 1269
            +  EES   WD  L  DQ+RQEKFNKLM
Sbjct: 402  TPVEESARRWDTALIGDQDRQEKFNKLM 429


>ref|XP_006430550.1| hypothetical protein CICLE_v10011438mg [Citrus clementina]
            gi|557532607|gb|ESR43790.1| hypothetical protein
            CICLE_v10011438mg [Citrus clementina]
          Length = 538

 Score =  193 bits (490), Expect = 2e-46
 Identities = 146/448 (32%), Positives = 195/448 (43%), Gaps = 49/448 (10%)
 Frame = +1

Query: 73   SLSPDIADARPSFRKPSNDAANRKYRRRXXXXXXXXXXXXXXXXNHDRSPSPEFHRDNPA 252
            S  PD  D + SFRKPSNDAANR+YRRR                  D + SP + RD+P+
Sbjct: 4    SSPPDTPDTKASFRKPSNDAANRRYRRRSPANGSSSSDGSP---KRDHNASPIYSRDDPS 60

Query: 253  KISDDPRRRENGGRELEMEXXXXXXXXXXXXXXXXERQPXXXXXXXXXXXXXXXXXXXAD 432
             + +  +RR++  REL+ +                +RQ                     +
Sbjct: 61   NVPEHQQRRKDDERELDRDSGRSHHGRGSDSYRHSDRQSSRSSHNYSKHDDYVRHDKHEN 120

Query: 433  GGERRYQXXXXXXXXXXXXTHSDCTRQESEYER-RDYQQHVDKYNRDKPDCDGHGRR--- 600
              +R YQ              S   R+  ++ R +DY    ++ +RDK D  GHG +   
Sbjct: 121  DEDRNYQRLS-----------SRSGRESRDHSRSKDYLSSEERSSRDKYDVIGHGSKDKE 169

Query: 601  -------RLINSNLDEV----------------KIGEERHNSXXXXXXXXXXXXXSLGDY 711
                   R  N + D                  ++  + H               S GD+
Sbjct: 170  KESSYLERQKNKDKDSSSDRAGSGRKHTVAYSEELDRDWHKRDRDGRDEKRDYRRSSGDH 229

Query: 712  KNDHISSFDESRGHGKDSTAARENGRNGLTETRKSSSKELDGQKRNVMERRKHEDNEFL- 888
            +ND   ++DESRGH   S++ R+ G   L E  +S  KELDGQK    E++KH D+E   
Sbjct: 230  RNDRTVTYDESRGHRNYSSSGRDYGSYRLKEAHRSDPKELDGQKLANEEKKKHNDSETNR 289

Query: 889  ---------------------VKKPKLCNADEGTGGKIISKFTCAADETPSSSKQVQEIV 1005
                                  KK +  N D+G           AA    SSS Q Q+I 
Sbjct: 290  DRDRYHRADKPDFASGKQENPTKKQRFSNWDKGA-----DNVKDAAGTMSSSSMQSQDIG 344

Query: 1006 DKVIPEPAQSSANEAVSACDLNXXXXXXXXXXELVNKNLVGGGYMSTDQKKKILWGNKKT 1185
            D      AQS AN+AV A DL+          ELVNKNLVGG YMSTDQKKK+LWGNKK+
Sbjct: 345  DT--DALAQSHANDAV-ANDLDAAKVAAMRAAELVNKNLVGGSYMSTDQKKKLLWGNKKS 401

Query: 1186 SAAEESVHHWDMPLFSDQERQEKFNKLM 1269
            +  EES   WD  L  D++RQEKFNKLM
Sbjct: 402  TPVEESARRWDTALIGDRDRQEKFNKLM 429


>ref|XP_006430548.1| hypothetical protein CICLE_v10011438mg [Citrus clementina]
            gi|567875919|ref|XP_006430549.1| hypothetical protein
            CICLE_v10011438mg [Citrus clementina]
            gi|557532605|gb|ESR43788.1| hypothetical protein
            CICLE_v10011438mg [Citrus clementina]
            gi|557532606|gb|ESR43789.1| hypothetical protein
            CICLE_v10011438mg [Citrus clementina]
          Length = 482

 Score =  193 bits (490), Expect = 2e-46
 Identities = 146/448 (32%), Positives = 195/448 (43%), Gaps = 49/448 (10%)
 Frame = +1

Query: 73   SLSPDIADARPSFRKPSNDAANRKYRRRXXXXXXXXXXXXXXXXNHDRSPSPEFHRDNPA 252
            S  PD  D + SFRKPSNDAANR+YRRR                  D + SP + RD+P+
Sbjct: 4    SSPPDTPDTKASFRKPSNDAANRRYRRRSPANGSSSSDGSP---KRDHNASPIYSRDDPS 60

Query: 253  KISDDPRRRENGGRELEMEXXXXXXXXXXXXXXXXERQPXXXXXXXXXXXXXXXXXXXAD 432
             + +  +RR++  REL+ +                +RQ                     +
Sbjct: 61   NVPEHQQRRKDDERELDRDSGRSHHGRGSDSYRHSDRQSSRSSHNYSKHDDYVRHDKHEN 120

Query: 433  GGERRYQXXXXXXXXXXXXTHSDCTRQESEYER-RDYQQHVDKYNRDKPDCDGHGRR--- 600
              +R YQ              S   R+  ++ R +DY    ++ +RDK D  GHG +   
Sbjct: 121  DEDRNYQRLS-----------SRSGRESRDHSRSKDYLSSEERSSRDKYDVIGHGSKDKE 169

Query: 601  -------RLINSNLDEV----------------KIGEERHNSXXXXXXXXXXXXXSLGDY 711
                   R  N + D                  ++  + H               S GD+
Sbjct: 170  KESSYLERQKNKDKDSSSDRAGSGRKHTVAYSEELDRDWHKRDRDGRDEKRDYRRSSGDH 229

Query: 712  KNDHISSFDESRGHGKDSTAARENGRNGLTETRKSSSKELDGQKRNVMERRKHEDNEFL- 888
            +ND   ++DESRGH   S++ R+ G   L E  +S  KELDGQK    E++KH D+E   
Sbjct: 230  RNDRTVTYDESRGHRNYSSSGRDYGSYRLKEAHRSDPKELDGQKLANEEKKKHNDSETNR 289

Query: 889  ---------------------VKKPKLCNADEGTGGKIISKFTCAADETPSSSKQVQEIV 1005
                                  KK +  N D+G           AA    SSS Q Q+I 
Sbjct: 290  DRDRYHRADKPDFASGKQENPTKKQRFSNWDKGA-----DNVKDAAGTMSSSSMQSQDIG 344

Query: 1006 DKVIPEPAQSSANEAVSACDLNXXXXXXXXXXELVNKNLVGGGYMSTDQKKKILWGNKKT 1185
            D      AQS AN+AV A DL+          ELVNKNLVGG YMSTDQKKK+LWGNKK+
Sbjct: 345  DT--DALAQSHANDAV-ANDLDAAKVAAMRAAELVNKNLVGGSYMSTDQKKKLLWGNKKS 401

Query: 1186 SAAEESVHHWDMPLFSDQERQEKFNKLM 1269
            +  EES   WD  L  D++RQEKFNKLM
Sbjct: 402  TPVEESARRWDTALIGDRDRQEKFNKLM 429


>ref|XP_006838218.1| hypothetical protein AMTR_s00106p00155270 [Amborella trichopoda]
            gi|548840676|gb|ERN00787.1| hypothetical protein
            AMTR_s00106p00155270 [Amborella trichopoda]
          Length = 532

 Score =  190 bits (483), Expect = 1e-45
 Identities = 153/488 (31%), Positives = 206/488 (42%), Gaps = 84/488 (17%)
 Frame = +1

Query: 58   MDPNL-SLSPDIADARPSFRKPSNDAANRKYRRRXXXXXXXXXXXXXXXXNHDRSPSPEF 234
            MD  L S SPD  + +PSFRKPSNDA  RKYR+R                 H  S SP  
Sbjct: 1    MDSGLVSYSPDPVEPKPSFRKPSNDAFQRKYRKRSPTSGSASPLSSGSP-QHSHSYSPNI 59

Query: 235  HRDNPAKISDDPRRRENGGRELEMEXXXXXXXXXXXXXXXXERQPXXXXXXXXXXXXXXX 414
              +   K+++D R R +  RE+E +                +                  
Sbjct: 60   SMEEAGKVTNDQRTRMDEEREVERDSSHHRSGKGSDSYG--KGSDVYGDNDRHSRGITQG 117

Query: 415  XXXXADGGERRYQXXXXXXXXXXXXTHSDCTRQ-----ESEYERRDYQQHVDKYNRDKPD 579
                 D  + + Q              S  TR       +EYE+RD      + NR  PD
Sbjct: 118  YRRHDDSSKHQSQHRREVEERSSQRYSSRITRDLEGSSHAEYEKRDRDSDNFRDNRRNPD 177

Query: 580  -------CDGHGRRRL---------------INSNLDEVKIGE-ERHNSXXXXXXXXXXX 690
                    D  GRR+                 N+N++  K+GE ER+             
Sbjct: 178  KPPRDRKIDDEGRRKERDSATQGRYRDIDKPANTNMEREKMGERERYRDRGEGRDDYRDY 237

Query: 691  XXSLGDYKNDHISSFDESRGHGKDSTAARENGRNGLTETRKSSSKELDGQKRNVMERRKH 870
              SLGD + D +SS++ SRG+ +DS + R++G     E  +SS++E +    + ++RR+ 
Sbjct: 238  RKSLGDTRRDRVSSYEGSRGYARDSASGRDSGSRHSREIHRSSNRESERHIEDKVQRRRG 297

Query: 871  EDNE-------------------------------------------------FLVKKPK 903
            +D                                                    + KK K
Sbjct: 298  DDESDRYKNKDSYNRESDDHSRGYSRSSSDYRDRSFRNGRSEDKNVHAVDDEASVGKKCK 357

Query: 904  LCNADEGTGGKI-----ISKFTCAADETPSSS-KQVQEIVDKVIPEPAQSSANEAVSACD 1065
            L +AD+ +G            TC AD+  S S KQ+QE V K   EP QSSANEA  A D
Sbjct: 358  LFDADKSSGDATDRHLPSKSSTCVADDKSSLSLKQLQEPVPKETLEPVQSSANEAKIAQD 417

Query: 1066 LNXXXXXXXXXXELVNKNLVGGGYMSTDQKKKILWGNKKTSAAEESVHHWDMPLFSDQER 1245
            LN           +VN+NLVGG Y+STD+KKK+LWGNKKTSAAEES   WD  +FSD+ER
Sbjct: 418  LNAAKVAAMKAAGIVNRNLVGGSYLSTDEKKKLLWGNKKTSAAEESGTRWDTAMFSDRER 477

Query: 1246 QEKFNKLM 1269
            QEKFNKLM
Sbjct: 478  QEKFNKLM 485


>ref|XP_007028352.1| Uncharacterized protein isoform 4 [Theobroma cacao]
            gi|508716957|gb|EOY08854.1| Uncharacterized protein
            isoform 4 [Theobroma cacao]
          Length = 462

 Score =  187 bits (476), Expect = 9e-45
 Identities = 150/463 (32%), Positives = 209/463 (45%), Gaps = 55/463 (11%)
 Frame = +1

Query: 58   MDPNLSLSP-DIADARPSFRKPSNDAANRKYRRRXXXXXXXXXXXXXXXXNHDRSPSPEF 234
            MD NL  SP D +DA+ +FRK SNDA+NR+YRR                   DRS SP  
Sbjct: 1    MDSNLQTSPPDGSDAKAAFRKFSNDASNRQYRRHSPISRSSSSEGNSP--QRDRSVSPIL 58

Query: 235  HRDNPAKISDDPRRRENGGRELEMEXXXXXXXXXXXXXXXXERQPXXXXXXXXXXXXXXX 414
             RD+ AK +D    R+  GREL+ +                +RQ                
Sbjct: 59   SRDDLAKGADTQPGRD--GRELDRDSSRNKYSRNSDSYRYSDRQSSRSSHGYSRHDNYVR 116

Query: 415  XXXXADGGERRYQXXXXXXXXXXXXTHSDCTRQESEYER-RDYQQHVDKYNRDKPDCDGH 591
                AD G +  +            THSD  RQES+  R +DY ++ DKY+RD+ D  GH
Sbjct: 117  HDKFADEGSKYDRLSSRSGRESRFSTHSDHPRQESDISRSKDYSRNADKYSRDRYDGSGH 176

Query: 592  ---------------------------GRRRLINSNLDEVKIGEERHNSXXXXXXXXXXX 690
                                       G  R   S+  E ++  +R              
Sbjct: 177  RIRDKEKESQSLEHQKYKDKDSALDRAGSGRRQGSSFSE-EMDRDRRRRGRDSRGEKGDY 235

Query: 691  XXSLGDYKNDHISSFDESRGHGKDSTAARE--NGRNGLTETRKSSSKELDGQKRNVMERR 864
              S GD K D+  S++ESRGH  DS++ RE  N +    E  KS  KE+DGQK    ER 
Sbjct: 236  HRSSGDRKGDYTESYEESRGHRNDSSSGRERDNDKYRRKEGYKSGLKEIDGQKP-AKERM 294

Query: 865  KHEDNEFLVKKPK--------------LCNADEGTGGKIISKFTCA--------ADETPS 978
            KH++ E  ++K +                  ++ +  K +  F+ +        ADE  S
Sbjct: 295  KHDEWETNMEKDRYGGVLKEQCEEKSIFVGKNQESPAKKLKLFSSSKGNEYDKDADEKRS 354

Query: 979  SSKQVQEIVDKVIPEPAQSSANEAVSACDLNXXXXXXXXXXELVNKNLVGGGY--MSTDQ 1152
            S +Q +E   +V     Q+  N+     D+N          ELVN+NL+G G+  M+T+Q
Sbjct: 355  SLEQAEETDGRVTM--GQAHGNDVDITNDINSAKVAAMKAAELVNRNLIGAGHSNMTTEQ 412

Query: 1153 KKKILWGNKKTSAAEESVHHWDMPLFSDQERQEKFNKLMVSFL 1281
            KKK+LWG+KK++ AEES H WD  LF D+ERQEKFNKLMV+ +
Sbjct: 413  KKKLLWGSKKSTPAEESGHRWDTALFGDRERQEKFNKLMVALV 455


>ref|XP_007028351.1| Uncharacterized protein isoform 3 [Theobroma cacao]
            gi|508716956|gb|EOY08853.1| Uncharacterized protein
            isoform 3 [Theobroma cacao]
          Length = 464

 Score =  185 bits (470), Expect = 4e-44
 Identities = 149/459 (32%), Positives = 206/459 (44%), Gaps = 55/459 (11%)
 Frame = +1

Query: 58   MDPNLSLSP-DIADARPSFRKPSNDAANRKYRRRXXXXXXXXXXXXXXXXNHDRSPSPEF 234
            MD NL  SP D +DA+ +FRK SNDA+NR+YRR                   DRS SP  
Sbjct: 1    MDSNLQTSPPDGSDAKAAFRKFSNDASNRQYRRHSPISRSSSSEGNSP--QRDRSVSPIL 58

Query: 235  HRDNPAKISDDPRRRENGGRELEMEXXXXXXXXXXXXXXXXERQPXXXXXXXXXXXXXXX 414
             RD+ AK +D    R+  GREL+ +                +RQ                
Sbjct: 59   SRDDLAKGADTQPGRD--GRELDRDSSRNKYSRNSDSYRYSDRQSSRSSHGYSRHDNYVR 116

Query: 415  XXXXADGGERRYQXXXXXXXXXXXXTHSDCTRQESEYER-RDYQQHVDKYNRDKPDCDGH 591
                AD G +  +            THSD  RQES+  R +DY ++ DKY+RD+ D  GH
Sbjct: 117  HDKFADEGSKYDRLSSRSGRESRFSTHSDHPRQESDISRSKDYSRNADKYSRDRYDGSGH 176

Query: 592  ---------------------------GRRRLINSNLDEVKIGEERHNSXXXXXXXXXXX 690
                                       G  R   S+  E ++  +R              
Sbjct: 177  RIRDKEKESQSLEHQKYKDKDSALDRAGSGRRQGSSFSE-EMDRDRRRRGRDSRGEKGDY 235

Query: 691  XXSLGDYKNDHISSFDESRGHGKDSTAARE--NGRNGLTETRKSSSKELDGQKRNVMERR 864
              S GD K D+  S++ESRGH  DS++ RE  N +    E  KS  KE+DGQK    ER 
Sbjct: 236  HRSSGDRKGDYTESYEESRGHRNDSSSGRERDNDKYRRKEGYKSGLKEIDGQKP-AKERM 294

Query: 865  KHEDNEFLVKKPK--------------LCNADEGTGGKIISKFTCA--------ADETPS 978
            KH++ E  ++K +                  ++ +  K +  F+ +        ADE  S
Sbjct: 295  KHDEWETNMEKDRYGGVLKEQCEEKSIFVGKNQESPAKKLKLFSSSKGNEYDKDADEKRS 354

Query: 979  SSKQVQEIVDKVIPEPAQSSANEAVSACDLNXXXXXXXXXXELVNKNLVGGGY--MSTDQ 1152
            S +Q +E   +V     Q+  N+     D+N          ELVN+NL+G G+  M+T+Q
Sbjct: 355  SLEQAEETDGRVTM--GQAHGNDVDITNDINSAKVAAMKAAELVNRNLIGAGHSNMTTEQ 412

Query: 1153 KKKILWGNKKTSAAEESVHHWDMPLFSDQERQEKFNKLM 1269
            KKK+LWG+KK++ AEES H WD  LF D+ERQEKFNKLM
Sbjct: 413  KKKLLWGSKKSTPAEESGHRWDTALFGDRERQEKFNKLM 451


>ref|XP_007028350.1| Uncharacterized protein isoform 2, partial [Theobroma cacao]
            gi|508716955|gb|EOY08852.1| Uncharacterized protein
            isoform 2, partial [Theobroma cacao]
          Length = 473

 Score =  185 bits (470), Expect = 4e-44
 Identities = 149/459 (32%), Positives = 206/459 (44%), Gaps = 55/459 (11%)
 Frame = +1

Query: 58   MDPNLSLSP-DIADARPSFRKPSNDAANRKYRRRXXXXXXXXXXXXXXXXNHDRSPSPEF 234
            MD NL  SP D +DA+ +FRK SNDA+NR+YRR                   DRS SP  
Sbjct: 1    MDSNLQTSPPDGSDAKAAFRKFSNDASNRQYRRHSPISRSSSSEGNSP--QRDRSVSPIL 58

Query: 235  HRDNPAKISDDPRRRENGGRELEMEXXXXXXXXXXXXXXXXERQPXXXXXXXXXXXXXXX 414
             RD+ AK +D    R+  GREL+ +                +RQ                
Sbjct: 59   SRDDLAKGADTQPGRD--GRELDRDSSRNKYSRNSDSYRYSDRQSSRSSHGYSRHDNYVR 116

Query: 415  XXXXADGGERRYQXXXXXXXXXXXXTHSDCTRQESEYER-RDYQQHVDKYNRDKPDCDGH 591
                AD G +  +            THSD  RQES+  R +DY ++ DKY+RD+ D  GH
Sbjct: 117  HDKFADEGSKYDRLSSRSGRESRFSTHSDHPRQESDISRSKDYSRNADKYSRDRYDGSGH 176

Query: 592  ---------------------------GRRRLINSNLDEVKIGEERHNSXXXXXXXXXXX 690
                                       G  R   S+  E ++  +R              
Sbjct: 177  RIRDKEKESQSLEHQKYKDKDSALDRAGSGRRQGSSFSE-EMDRDRRRRGRDSRGEKGDY 235

Query: 691  XXSLGDYKNDHISSFDESRGHGKDSTAARE--NGRNGLTETRKSSSKELDGQKRNVMERR 864
              S GD K D+  S++ESRGH  DS++ RE  N +    E  KS  KE+DGQK    ER 
Sbjct: 236  HRSSGDRKGDYTESYEESRGHRNDSSSGRERDNDKYRRKEGYKSGLKEIDGQKP-AKERM 294

Query: 865  KHEDNEFLVKKPK--------------LCNADEGTGGKIISKFTCA--------ADETPS 978
            KH++ E  ++K +                  ++ +  K +  F+ +        ADE  S
Sbjct: 295  KHDEWETNMEKDRYGGVLKEQCEEKSIFVGKNQESPAKKLKLFSSSKGNEYDKDADEKRS 354

Query: 979  SSKQVQEIVDKVIPEPAQSSANEAVSACDLNXXXXXXXXXXELVNKNLVGGGY--MSTDQ 1152
            S +Q +E   +V     Q+  N+     D+N          ELVN+NL+G G+  M+T+Q
Sbjct: 355  SLEQAEETDGRVTM--GQAHGNDVDITNDINSAKVAAMKAAELVNRNLIGAGHSNMTTEQ 412

Query: 1153 KKKILWGNKKTSAAEESVHHWDMPLFSDQERQEKFNKLM 1269
            KKK+LWG+KK++ AEES H WD  LF D+ERQEKFNKLM
Sbjct: 413  KKKLLWGSKKSTPAEESGHRWDTALFGDRERQEKFNKLM 451


>ref|XP_007028349.1| Uncharacterized protein isoform 1 [Theobroma cacao]
            gi|590634353|ref|XP_007028353.1| Uncharacterized protein
            isoform 1 [Theobroma cacao] gi|508716954|gb|EOY08851.1|
            Uncharacterized protein isoform 1 [Theobroma cacao]
            gi|508716958|gb|EOY08855.1| Uncharacterized protein
            isoform 1 [Theobroma cacao]
          Length = 504

 Score =  185 bits (470), Expect = 4e-44
 Identities = 149/459 (32%), Positives = 206/459 (44%), Gaps = 55/459 (11%)
 Frame = +1

Query: 58   MDPNLSLSP-DIADARPSFRKPSNDAANRKYRRRXXXXXXXXXXXXXXXXNHDRSPSPEF 234
            MD NL  SP D +DA+ +FRK SNDA+NR+YRR                   DRS SP  
Sbjct: 1    MDSNLQTSPPDGSDAKAAFRKFSNDASNRQYRRHSPISRSSSSEGNSP--QRDRSVSPIL 58

Query: 235  HRDNPAKISDDPRRRENGGRELEMEXXXXXXXXXXXXXXXXERQPXXXXXXXXXXXXXXX 414
             RD+ AK +D    R+  GREL+ +                +RQ                
Sbjct: 59   SRDDLAKGADTQPGRD--GRELDRDSSRNKYSRNSDSYRYSDRQSSRSSHGYSRHDNYVR 116

Query: 415  XXXXADGGERRYQXXXXXXXXXXXXTHSDCTRQESEYER-RDYQQHVDKYNRDKPDCDGH 591
                AD G +  +            THSD  RQES+  R +DY ++ DKY+RD+ D  GH
Sbjct: 117  HDKFADEGSKYDRLSSRSGRESRFSTHSDHPRQESDISRSKDYSRNADKYSRDRYDGSGH 176

Query: 592  ---------------------------GRRRLINSNLDEVKIGEERHNSXXXXXXXXXXX 690
                                       G  R   S+  E ++  +R              
Sbjct: 177  RIRDKEKESQSLEHQKYKDKDSALDRAGSGRRQGSSFSE-EMDRDRRRRGRDSRGEKGDY 235

Query: 691  XXSLGDYKNDHISSFDESRGHGKDSTAARE--NGRNGLTETRKSSSKELDGQKRNVMERR 864
              S GD K D+  S++ESRGH  DS++ RE  N +    E  KS  KE+DGQK    ER 
Sbjct: 236  HRSSGDRKGDYTESYEESRGHRNDSSSGRERDNDKYRRKEGYKSGLKEIDGQKP-AKERM 294

Query: 865  KHEDNEFLVKKPK--------------LCNADEGTGGKIISKFTCA--------ADETPS 978
            KH++ E  ++K +                  ++ +  K +  F+ +        ADE  S
Sbjct: 295  KHDEWETNMEKDRYGGVLKEQCEEKSIFVGKNQESPAKKLKLFSSSKGNEYDKDADEKRS 354

Query: 979  SSKQVQEIVDKVIPEPAQSSANEAVSACDLNXXXXXXXXXXELVNKNLVGGGY--MSTDQ 1152
            S +Q +E   +V     Q+  N+     D+N          ELVN+NL+G G+  M+T+Q
Sbjct: 355  SLEQAEETDGRVTM--GQAHGNDVDITNDINSAKVAAMKAAELVNRNLIGAGHSNMTTEQ 412

Query: 1153 KKKILWGNKKTSAAEESVHHWDMPLFSDQERQEKFNKLM 1269
            KKK+LWG+KK++ AEES H WD  LF D+ERQEKFNKLM
Sbjct: 413  KKKLLWGSKKSTPAEESGHRWDTALFGDRERQEKFNKLM 451


>ref|XP_004144330.1| PREDICTED: uncharacterized protein LOC101218861 [Cucumis sativus]
          Length = 472

 Score =  171 bits (432), Expect = 1e-39
 Identities = 137/430 (31%), Positives = 192/430 (44%), Gaps = 40/430 (9%)
 Frame = +1

Query: 100  RPSFRKPSNDAANRKYRRRXXXXXXXXXXXXXXXXNHDRSPSPEFHRDNPAKISDDPRRR 279
            +  FRKPS++ A RKYRRR                  DRS SP+  RD+ +K S+   RR
Sbjct: 7    KAEFRKPSSETAGRKYRRRSSVSGSSSDESP----KRDRSSSPKLLRDDASKHSERKPRR 62

Query: 280  ENGGRELEMEXXXXXXXXXXXXXXXXERQPXXXXXXXXXXXXXXXXXXXADGGERRYQXX 459
            +   R+L  +                +R+                    AD  ER Y+  
Sbjct: 63   KEDERDLNKDSRNHHSRSSDSYRYS-DRKSSRSLHGYSRHDDYVRHDKYADE-ERDYERL 120

Query: 460  XXXXXXXXXXT-HSDCTRQESEYER-RDYQQHVDKYNRDKPDCDGHGRRRLINSNLDEVK 633
                      + H D TR+ESE+ R R+Y + V+K +RDK D  GH R R  +S  +   
Sbjct: 121  SSRSNRESKGSAHYDHTRRESEHSRSREYFRDVEKGSRDKYDASGH-RSRDGDSLSERHG 179

Query: 634  IGEERHNSXXXXXXXXXXXXXS-----------LGDYKNDHISSFDESRGHGKDSTAARE 780
             G  RH S                          GDYKN+ + S D+ RG+  DS   R+
Sbjct: 180  SGSRRHASFEEMEKHRNARDRDGQDEKRDNIKHSGDYKNERVLSHDDGRGNRYDSLLGRD 239

Query: 781  NGRNGLTETRKSSSKELDGQKRNVMERRKHEDNE-------------------------- 882
              ++   +  K+  K+LD +K +  E RKH+  E                          
Sbjct: 240  ESKHRTKDINKNDRKDLDDEKSS-KEERKHDARETHWDKVQGKESKGKYDGKGVFVDENQ 298

Query: 883  -FLVKKPKLCNADEGTGGKIISKFTCAADETPSSSKQVQEIVDKVIPEPAQSSANEAVSA 1059
                KKPKL ++     GK ++    A +   S+SK+ Q+    +     Q  + ++  A
Sbjct: 299  GLPAKKPKLFSS-----GKEVNHEEDADENQSSTSKKEQDGKMSL----GQGQSGDSDFA 349

Query: 1060 CDLNXXXXXXXXXXELVNKNLVGGGYMSTDQKKKILWGNKKTSAAEESVHHWDMPLFSDQ 1239
             D +          ELVNKNLVGGGYM+TDQKKK+LWG+KK++A EES H WD  LF+D+
Sbjct: 350  ADFSAAKVAAMKAAELVNKNLVGGGYMTTDQKKKLLWGSKKSTAVEESAHQWDTALFNDR 409

Query: 1240 ERQEKFNKLM 1269
            ERQEKFNKLM
Sbjct: 410  ERQEKFNKLM 419


>ref|XP_002307979.2| hypothetical protein POPTR_0006s03830g [Populus trichocarpa]
            gi|550335404|gb|EEE91502.2| hypothetical protein
            POPTR_0006s03830g [Populus trichocarpa]
          Length = 473

 Score =  166 bits (419), Expect = 3e-38
 Identities = 138/454 (30%), Positives = 201/454 (44%), Gaps = 48/454 (10%)
 Frame = +1

Query: 52   GFMDPNLSLSPDIADARPSFRKPSNDAANRKYRRRXXXXXXXXXXXXXXXXNHDRSPSPE 231
            G   P L    +  + + +FRKPSND ANRKYRR                   D+S SP 
Sbjct: 4    GIQSPQL----ENTETKATFRKPSNDMANRKYRRHSPMNGSSLSDGSP---KRDQSSSPV 56

Query: 232  FHRDNPAKISDDPRRRENGGRELEMEXXXXXXXXXXXXXXXXERQPXXXXXXXXXXXXXX 411
              RD+PAK S   +RR+   +EL+ +                +R                
Sbjct: 57   VQRDDPAKAS---QRRKGEEKELDRDSGRSRYEKNGESYRHSDRYSSRSSHGYSRNDDYS 113

Query: 412  XXXXXADGGERRYQXXXXXXXXXXXXTHSDCTRQESEYER-RDYQQHVDKYNRDKPDCDG 588
                  D G+R +Q            +HS    ++ E  R RDY ++ +KY+RD+ D  G
Sbjct: 114  RHDRRVDDGDRHHQVV----------SHSGRESKDGERGRSRDYARNSEKYSRDRHDGSG 163

Query: 589  HGR----------RRLINSNLDEVKIGEER--------------HNSXXXXXXXXXXXXX 696
            H            ++L + +    ++G  R              H               
Sbjct: 164  HRNMDKERELSEHQKLKDKDFSPDRVGSGRKYTSIVSEEKDRDWHRRDRDGRDEKRDYHR 223

Query: 697  SLGDYKNDHISSFDESRGHGKDSTAARENGRNGLTETRKSSSKELDGQKRNVMERRKHE- 873
            S GD+K+D  S ++++RG+  DS+     GR+ L E+ K+  KEL+G K    E++KH+ 
Sbjct: 224  SSGDHKSDRSSYYEDTRGYRNDSS-----GRDRLRESYKNDPKELNGLK----EKKKHDN 274

Query: 874  -----DNEFLVKKPKLCNADEGTGG--------KIISKFTCAAD---------ETPSSSK 987
                 D +   K P   N D+   G        K    F+ + D         +  SSS 
Sbjct: 275  WETSRDKDRYSKAPGEKNDDKSAFGSEKPESPAKKPKLFSSSKDPDYSGDVNQKQSSSSM 334

Query: 988  QVQEIVDKVIPEPAQSSANEAVSACDLNXXXXXXXXXXELVNKNLVGGGYMSTDQKKKIL 1167
              QE+ +KV     Q+ AN + +A DL+          ELVNKNLVG G+MST+QKKK+L
Sbjct: 335  LAQEVDNKV--NVGQAHANTSEAANDLDAAKVAAMKAAELVNKNLVGVGFMSTEQKKKLL 392

Query: 1168 WGNKKTSAAEESVHHWDMPLFSDQERQEKFNKLM 1269
            WG+KK++A EE+   WD  +F D+ERQEKFNKLM
Sbjct: 393  WGSKKSAAPEETGRRWDTVMFGDRERQEKFNKLM 426


>ref|XP_006575187.1| PREDICTED: protein starmaker-like isoform X6 [Glycine max]
          Length = 438

 Score =  158 bits (399), Expect = 7e-36
 Identities = 132/445 (29%), Positives = 191/445 (42%), Gaps = 40/445 (8%)
 Frame = +1

Query: 58   MDPNLSLSPDI-ADARPSFRKPSNDAANRKYRRRXXXXXXXXXXXXXXXXNHDRSPSPEF 234
            MD N    P   +D + +FRKPS DAANR YRRR                 H  S SP  
Sbjct: 2    MDSNSPFLPHCNSDTKNAFRKPSGDAANRNYRRRSPVEGSPSPDASP---RHGHSSSPNL 58

Query: 235  HRDNPAKISDDPRRRENGGRELEMEXXXXXXXXXXXXXXXXERQPXXXXXXXXXXXXXXX 414
             R+N A++S   R+ ++  RE + +                +RQ                
Sbjct: 59   VRENSARVSHHSRKYDD--REQDQQYGRNHYGRSSDSLRHSDRQSFKSSYGHSRHDKY-- 114

Query: 415  XXXXADGGERRYQXXXXXXXXXXXXTHSDCTRQESEYERRDYQQHVDKYNRDKPDCDGHG 594
                    E RY+            T  D  R ES+   ++YQ+ V+KY+ DK D   H 
Sbjct: 115  ------ANEDRYREKLLSRSGHE--TRDDHMRDESDSRSKNYQRSVEKYSHDKYDRSDHR 166

Query: 595  RRRL-------------INSNLDEV----------KIGEERHNSXXXXXXXXXXXXXSLG 705
             +               ++S+ D+           ++  E H+              S G
Sbjct: 167  SKEKRRETYLEHQKYKDMDSSYDKSASSKRHALYDEVEREGHSRDWDGRNERRDSRRSSG 226

Query: 706  DYKNDHISSFDESRGHGKDSTAARENGRNGLTETRKSSSKELDGQKRNVMERRKHEDNEF 885
            DY++D    + ESR    +S   R+ G++ L E  KS  KE + Q     E+RKH+D E 
Sbjct: 227  DYRSDQAVCYSESRNQRDESGPQRDCGKSSLKEGYKSEQKESNDQNLPWEEKRKHDDTET 286

Query: 886  LVKKP-------KLCNA-DEGTGGKIISKFTCA--------ADETPSSSKQVQEIVDKVI 1017
               K        + C   D+ + GK +  F           ADE+ +SS ++     K  
Sbjct: 287  GKGKDWKTRQASEQCGIEDKESSGKKLKLFDLDKDDNYRKDADESKTSSSKLSH-ESKAD 345

Query: 1018 PEPAQSSANEAVSACDLNXXXXXXXXXXELVNKNLVGGGYMSTDQKKKILWGNKKTSAAE 1197
               A++S  +  +  DL+          ELVN+NLVG G ++TDQKKK+LWG K+++  E
Sbjct: 346  VRAAKTSGFDGDN--DLDAAKVAAMRAAELVNRNLVGAGCLTTDQKKKLLWGGKRSTPTE 403

Query: 1198 ESVHHWDMPLFSDQERQEKFNKLMV 1272
            ES H WD  +FSD+ERQEKFNKLMV
Sbjct: 404  ESGHRWDTAMFSDRERQEKFNKLMV 428


>gb|EXC34985.1| hypothetical protein L484_014712 [Morus notabilis]
          Length = 491

 Score =  157 bits (396), Expect = 2e-35
 Identities = 137/461 (29%), Positives = 192/461 (41%), Gaps = 57/461 (12%)
 Frame = +1

Query: 58   MDPNL-SLSPDIADARPSFRKPSNDAANRKYRRRXXXXXXXXXXXXXXXXNHDRSPSPEF 234
            MD NL S + D  D +P+FRKP+ DA NRKYRR                   +RS SP+ 
Sbjct: 1    MDSNLQSPNQDNVDVKPAFRKPTTDATNRKYRRHSPVSGSQSDGSP----ERERSASPKL 56

Query: 235  HRDNPAKISDDPRRRENGGRELEMEXXXXXXXXXXXXXXXXERQPXXXXXXXXXXXXXXX 414
              ++P ++ +   RR++ G+E++ +                +RQ                
Sbjct: 57   TGEDPRRVHESQSRRKDDGKEVDRDSYRSHYGRGSDSYRHSDRQFSRSSHRYSRHDDYSK 116

Query: 415  XXXXADGGERRYQXXXXXXXXXXXX-THSDCTRQESEYERRDYQQHVDKYNRDKPDCD-- 585
                AD  ER ++             TH D ++       RD+ +   KY+RD+ D    
Sbjct: 117  HDKHADDEERNHRRLSSRSGWESKGGTHIDHSKL------RDHLRDGGKYSRDRYDSYLY 170

Query: 586  ------------GHGRRRLINSNLDEVKIGE----------ERHNSXXXXXXXXXXXXXS 699
                         H +    +S+ D+ K G+          ER                S
Sbjct: 171  NSKDRERETSSLEHHKYNDRDSSFDKAKSGKRHPHPEDVERERRGMEKDGQDDKRDFRRS 230

Query: 700  LGDYKNDHISSFDESRGHGKDSTAARENGRNGLTETRKSSSKELDGQKRNVMERRKHED- 876
             GDY+ D     +E +GH  D  +     RN   E  K+ +KE+DGQ      ++K++D 
Sbjct: 231  SGDYRGDR----EEVKGHSIDFYS-----RNRAKECYKNEAKEIDGQCLTKEGKKKYDDV 281

Query: 877  ---------------------------NEFLVKKPKLCNADEGTGGKIISKFTCAADETP 975
                                        EFL K+ K         GK +SKF+  AD   
Sbjct: 282  ETNRSNDQYIREPAEQSGEKSVIGSENQEFLSKRQKFSLDKYTDAGKKVSKFSTVADVKE 341

Query: 976  SSSKQVQEIVDKVIPEPAQSSANEAVSACDLNXXXXXXXXXXELVNKNLVGG---GYMST 1146
            SS +Q  +   K+     +   N +  A DLN          E VNKNLVGG   G+M+ 
Sbjct: 342  SSPQQPPD--HKLTA--GEDQVNVSNFANDLNAAKVAAMKAAESVNKNLVGGVGTGFMTA 397

Query: 1147 DQKKKILWGNKKTSAAEESVHHWDMPLFSDQERQEKFNKLM 1269
            DQKKK+LWGNKKT+ AEES H WD  LFSD+ERQEKFNKLM
Sbjct: 398  DQKKKLLWGNKKTTIAEESGHRWDSTLFSDRERQEKFNKLM 438


>ref|XP_006575186.1| PREDICTED: protein starmaker-like isoform X5 [Glycine max]
          Length = 440

 Score =  157 bits (396), Expect = 2e-35
 Identities = 132/451 (29%), Positives = 192/451 (42%), Gaps = 40/451 (8%)
 Frame = +1

Query: 58   MDPNLSLSPDI-ADARPSFRKPSNDAANRKYRRRXXXXXXXXXXXXXXXXNHDRSPSPEF 234
            MD N    P   +D + +FRKPS DAANR YRRR                 H  S SP  
Sbjct: 2    MDSNSPFLPHCNSDTKNAFRKPSGDAANRNYRRRSPVEGSPSPDASP---RHGHSSSPNL 58

Query: 235  HRDNPAKISDDPRRRENGGRELEMEXXXXXXXXXXXXXXXXERQPXXXXXXXXXXXXXXX 414
             R+N A++S   R+ ++  RE + +                +RQ                
Sbjct: 59   VRENSARVSHHSRKYDD--REQDQQYGRNHYGRSSDSLRHSDRQSFKSSYGHSRHDKY-- 114

Query: 415  XXXXADGGERRYQXXXXXXXXXXXXTHSDCTRQESEYERRDYQQHVDKYNRDKPDCDGHG 594
                    E RY+            T  D  R ES+   ++YQ+ V+KY+ DK D   H 
Sbjct: 115  ------ANEDRYREKLLSRSGHE--TRDDHMRDESDSRSKNYQRSVEKYSHDKYDRSDHR 166

Query: 595  RRRL-------------INSNLDEV----------KIGEERHNSXXXXXXXXXXXXXSLG 705
             +               ++S+ D+           ++  E H+              S G
Sbjct: 167  SKEKRRETYLEHQKYKDMDSSYDKSASSKRHALYDEVEREGHSRDWDGRNERRDSRRSSG 226

Query: 706  DYKNDHISSFDESRGHGKDSTAARENGRNGLTETRKSSSKELDGQKRNVMERRKHEDNEF 885
            DY++D    + ESR    +S   R+ G++ L E  KS  KE + Q     E+RKH+D E 
Sbjct: 227  DYRSDQAVCYSESRNQRDESGPQRDCGKSSLKEGYKSEQKESNDQNLPWEEKRKHDDTET 286

Query: 886  LVKKP-------KLCNA-DEGTGGKIISKFTCA--------ADETPSSSKQVQEIVDKVI 1017
               K        + C   D+ + GK +  F           ADE+ +SS ++     K  
Sbjct: 287  GKGKDWKTRQASEQCGIEDKESSGKKLKLFDLDKDDNYRKDADESKTSSSKLSH-ESKAD 345

Query: 1018 PEPAQSSANEAVSACDLNXXXXXXXXXXELVNKNLVGGGYMSTDQKKKILWGNKKTSAAE 1197
               A++S  +  +  DL+          ELVN+NLVG G ++TDQKKK+LWG K+++  E
Sbjct: 346  VRAAKTSGFDGDN--DLDAAKVAAMRAAELVNRNLVGAGCLTTDQKKKLLWGGKRSTPTE 403

Query: 1198 ESVHHWDMPLFSDQERQEKFNKLMVSFLCHP 1290
            ES H WD  +FSD+ERQEKFNKLM   +  P
Sbjct: 404  ESGHRWDTAMFSDRERQEKFNKLMSEVVLVP 434


>ref|XP_006575183.1| PREDICTED: protein starmaker-like isoform X2 [Glycine max]
            gi|571440534|ref|XP_006575184.1| PREDICTED: protein
            starmaker-like isoform X3 [Glycine max]
            gi|571440536|ref|XP_006575185.1| PREDICTED: protein
            starmaker-like isoform X4 [Glycine max]
          Length = 480

 Score =  156 bits (395), Expect = 2e-35
 Identities = 131/444 (29%), Positives = 190/444 (42%), Gaps = 40/444 (9%)
 Frame = +1

Query: 58   MDPNLSLSPDI-ADARPSFRKPSNDAANRKYRRRXXXXXXXXXXXXXXXXNHDRSPSPEF 234
            MD N    P   +D + +FRKPS DAANR YRRR                 H  S SP  
Sbjct: 2    MDSNSPFLPHCNSDTKNAFRKPSGDAANRNYRRRSPVEGSPSPDASP---RHGHSSSPNL 58

Query: 235  HRDNPAKISDDPRRRENGGRELEMEXXXXXXXXXXXXXXXXERQPXXXXXXXXXXXXXXX 414
             R+N A++S   R+ ++  RE + +                +RQ                
Sbjct: 59   VRENSARVSHHSRKYDD--REQDQQYGRNHYGRSSDSLRHSDRQSFKSSYGHSRHDKY-- 114

Query: 415  XXXXADGGERRYQXXXXXXXXXXXXTHSDCTRQESEYERRDYQQHVDKYNRDKPDCDGHG 594
                    E RY+            T  D  R ES+   ++YQ+ V+KY+ DK D   H 
Sbjct: 115  ------ANEDRYREKLLSRSGHE--TRDDHMRDESDSRSKNYQRSVEKYSHDKYDRSDHR 166

Query: 595  RRRL-------------INSNLDEV----------KIGEERHNSXXXXXXXXXXXXXSLG 705
             +               ++S+ D+           ++  E H+              S G
Sbjct: 167  SKEKRRETYLEHQKYKDMDSSYDKSASSKRHALYDEVEREGHSRDWDGRNERRDSRRSSG 226

Query: 706  DYKNDHISSFDESRGHGKDSTAARENGRNGLTETRKSSSKELDGQKRNVMERRKHEDNEF 885
            DY++D    + ESR    +S   R+ G++ L E  KS  KE + Q     E+RKH+D E 
Sbjct: 227  DYRSDQAVCYSESRNQRDESGPQRDCGKSSLKEGYKSEQKESNDQNLPWEEKRKHDDTET 286

Query: 886  LVKKP-------KLCNA-DEGTGGKIISKFTCA--------ADETPSSSKQVQEIVDKVI 1017
               K        + C   D+ + GK +  F           ADE+ +SS ++     K  
Sbjct: 287  GKGKDWKTRQASEQCGIEDKESSGKKLKLFDLDKDDNYRKDADESKTSSSKLSH-ESKAD 345

Query: 1018 PEPAQSSANEAVSACDLNXXXXXXXXXXELVNKNLVGGGYMSTDQKKKILWGNKKTSAAE 1197
               A++S  +  +  DL+          ELVN+NLVG G ++TDQKKK+LWG K+++  E
Sbjct: 346  VRAAKTSGFDGDN--DLDAAKVAAMRAAELVNRNLVGAGCLTTDQKKKLLWGGKRSTPTE 403

Query: 1198 ESVHHWDMPLFSDQERQEKFNKLM 1269
            ES H WD  +FSD+ERQEKFNKLM
Sbjct: 404  ESGHRWDTAMFSDRERQEKFNKLM 427


>ref|XP_003519025.1| PREDICTED: protein starmaker-like isoform X1 [Glycine max]
          Length = 479

 Score =  156 bits (394), Expect = 3e-35
 Identities = 132/444 (29%), Positives = 188/444 (42%), Gaps = 40/444 (9%)
 Frame = +1

Query: 58   MDPNLSLSPDI-ADARPSFRKPSNDAANRKYRRRXXXXXXXXXXXXXXXXNHDRSPSPEF 234
            MD N    P   +D + +FRKPS DAANR YRRR                 H  S SP  
Sbjct: 2    MDSNSPFLPHCNSDTKNAFRKPSGDAANRNYRRRSPVEGSPSPDASP---RHGHSSSPNL 58

Query: 235  HRDNPAKISDDPRRRENGGRELEMEXXXXXXXXXXXXXXXXERQPXXXXXXXXXXXXXXX 414
             R+N A++S   R+ ++  RE + +                +RQ                
Sbjct: 59   VRENSARVSHHSRKYDD--REQDQQYGRNHYGRSSDSLRHSDRQSFKSSYGHSRHDKY-- 114

Query: 415  XXXXADGGERRYQXXXXXXXXXXXXTHSDCTRQESEYERRDYQQHVDKYNRDKPDCDGHG 594
                    E RY+            T  D  R ES+   ++YQ+ V+KY+ DK D   H 
Sbjct: 115  ------ANEDRYREKLLSRSGHE--TRDDHMRDESDSRSKNYQRSVEKYSHDKYDRSDHR 166

Query: 595  RRRL-------------INSNLDEV----------KIGEERHNSXXXXXXXXXXXXXSLG 705
             +               ++S+ D+           ++  E H+              S G
Sbjct: 167  SKEKRRETYLEHQKYKDMDSSYDKSASSKRHALYDEVEREGHSRDWDGRNERRDSRRSSG 226

Query: 706  DYKNDHISSFDESRGHGKDSTAARENGRNGLTETRKSSSKELDGQKRNVMERRKHEDNEF 885
            DY++D    + ESR    +S   R+ G++ L E  KS  KE + Q     E+RKH+D E 
Sbjct: 227  DYRSDQAVCYSESRNQRDESGPQRDCGKSSLKEGYKSEQKESNDQNLPWEEKRKHDDTET 286

Query: 886  LVKKP-------KLCNA-DEGTGGKIISKFTCAADET--------PSSSKQVQEIVDKVI 1017
               K        + C   D+ + GK +  F    D+          SSSK   E   K  
Sbjct: 287  GKGKDWKTRQASEQCGIEDKESSGKKLKLFDLDKDDNYRKDDESKTSSSKLSHE--SKAD 344

Query: 1018 PEPAQSSANEAVSACDLNXXXXXXXXXXELVNKNLVGGGYMSTDQKKKILWGNKKTSAAE 1197
               A++S  +  +  DL+          ELVN+NLVG G ++TDQKKK+LWG K+++  E
Sbjct: 345  VRAAKTSGFDGDN--DLDAAKVAAMRAAELVNRNLVGAGCLTTDQKKKLLWGGKRSTPTE 402

Query: 1198 ESVHHWDMPLFSDQERQEKFNKLM 1269
            ES H WD  +FSD+ERQEKFNKLM
Sbjct: 403  ESGHRWDTAMFSDRERQEKFNKLM 426


>gb|EYU18618.1| hypothetical protein MIMGU_mgv1a007462mg [Mimulus guttatus]
          Length = 406

 Score =  147 bits (370), Expect = 2e-32
 Identities = 124/393 (31%), Positives = 169/393 (43%), Gaps = 1/393 (0%)
 Frame = +1

Query: 94   DARPSFRKPSNDAANRKYRRRXXXXXXXXXXXXXXXXNHDRSPSPEFHRDNPAKISDDPR 273
            D++  FRKPSNDAA+RKYRRR                + DRS SP   + +  +++DD R
Sbjct: 10   DSKAEFRKPSNDAASRKYRRRSPAGGSSSSSDGSL--HRDRSSSPLPRKKDSIRVADDNR 67

Query: 274  RRENGGRELEMEXXXXXXXXXXXXXXXXERQPXXXXXXXXXXXXXXXXXXXADGGERRYQ 453
            + E+G R L                    R                      D  +R Y 
Sbjct: 68   KTEDG-RNLSGRSGESYKYTDRHSSKNYPRHDEHSRRDRH-----------VDDYDRGYS 115

Query: 454  XXXXXXXXXXXXTHS-DCTRQESEYERRDYQQHVDKYNRDKPDCDGHGRRRLINSNLDEV 630
                          + D +R + E+  RDY + +D ++  K D        L+N + D+ 
Sbjct: 116  KSSYRSNRDQRDNGNFDHSRSDKEHRSRDYIKDIDTHSHAKSD-------GLVNRSRDKE 168

Query: 631  KIGEERHNSXXXXXXXXXXXXXSLGDYKNDHISSFDESRGHGKDSTAARENGRNGLTETR 810
            K   ER  S             SLGD                 DS++ ++   + L ET 
Sbjct: 169  KY--ERAGSGRGDQYVKTDRRKSLGDQS---------------DSSSRKDTSGHRLKETS 211

Query: 811  KSSSKELDGQKRNVMERRKHEDNEFLVKKPKLCNADEGTGGKIISKFTCAADETPSSSKQ 990
                KEL+ +K    E+RK  DN  + K+     A E +  K I KFT    + P  S  
Sbjct: 212  WREGKELNAEKYVNDEKRKF-DNRSIYKEEGNGEAKEHSDDKSI-KFTETVTKKPKFSS- 268

Query: 991  VQEIVDKVIPEPAQSSANEAVSACDLNXXXXXXXXXXELVNKNLVGGGYMSTDQKKKILW 1170
                +D   P    +S    V+  D++          ELVNKNLVG GYMSTDQKKK+LW
Sbjct: 269  ----LDSKAPVTDGTSEQPYVTDSDIDAAKIAAMKAAELVNKNLVGTGYMSTDQKKKLLW 324

Query: 1171 GNKKTSAAEESVHHWDMPLFSDQERQEKFNKLM 1269
            G+KK++A EES H WD   F D+ERQEKFNKLM
Sbjct: 325  GSKKSTATEESAHRWDTITFGDRERQEKFNKLM 357


>ref|XP_006589006.1| PREDICTED: arginine/serine-rich coiled-coil protein 2-like isoform X6
            [Glycine max]
          Length = 447

 Score =  144 bits (362), Expect = 1e-31
 Identities = 133/448 (29%), Positives = 184/448 (41%), Gaps = 40/448 (8%)
 Frame = +1

Query: 49   LGFMDPNLS-LSPDIADARPSFRKPSNDAANRKYRRRXXXXXXXXXXXXXXXXNHDRSPS 225
            L  MD NL  L P  +D + SFRKPS DAANR Y+ R                 H  S S
Sbjct: 19   LSMMDSNLPFLPPSNSDTKNSFRKPSGDAANRNYQHRSPVDRSPSPDAS----RHGHSSS 74

Query: 226  PEFHRDNPAKISDDPRRRENGGRELEMEXXXXXXXXXXXXXXXXERQPXXXXXXXXXXXX 405
            P   R+N A++S   R+ ++  RE + +                +RQ             
Sbjct: 75   PNPVRENSARVSHHSRKYDD--REHDQQYGRNHYGRSSDSLRHSDRQSFKSSFGHSRYDK 132

Query: 406  XXXXXXXADGGERRYQXXXXXXXXXXXXTHSDCTRQESEYERRDYQQHVDKYNRDKPDCD 585
                       E RY+            +  D  R+ES+   ++YQ  VDKY+ DK D  
Sbjct: 133  Y--------ANEDRYRERLLSRSGHE--SRDDHVREESDSRPKNYQCSVDKYSHDKYDRS 182

Query: 586  GHG---RRRLINSNLDEVK--------------------IGEERHNSXXXXXXXXXXXXX 696
             H    +RR   S   + K                    +  E H+              
Sbjct: 183  DHRSKEKRRDTYSEHQKYKDMDSSYEKSASSKRHALYDEVEREGHSRDWDGQNERRDSRR 242

Query: 697  SLGDYKNDHISSFDESRGHGKDSTAARENGRNGLTETRKSSSKELDGQKRNVMERRKHED 876
            S GDY++D             +S   R++G+  L E  KS  KE + Q     E+RKH+D
Sbjct: 243  SSGDYRSDQRD----------ESGPQRDSGKFSLKEAYKSEQKESNDQNLPWEEKRKHDD 292

Query: 877  NEFLVKKP--------KLCNADEGTGGKIISKFTCA--------ADETPSSSKQVQEIVD 1008
             E    K         +    D+ + GK +  F           ADE+ +SS  +     
Sbjct: 293  TEIRKGKDWKTRKAGEQCAIEDKESSGKKLKLFDPDKDDNYRKDADESKTSSSNLSH-KS 351

Query: 1009 KVIPEPAQSSANEAVSACDLNXXXXXXXXXXELVNKNLVGGGYMSTDQKKKILWGNKKTS 1188
            K      +SS  +  +  DL+          ELVN+NLVG G ++TDQKKK+LWG KK++
Sbjct: 352  KEDLWAVKSSGFDGDN--DLDAAKIAAMRAAELVNRNLVGPGCLTTDQKKKLLWGGKKST 409

Query: 1189 AAEESVHHWDMPLFSDQERQEKFNKLMV 1272
              EES H WD  +FSD+ERQEKFNKLMV
Sbjct: 410  PTEESGHRWDTGMFSDRERQEKFNKLMV 437


Top