BLASTX nr result

ID: Forsythia21_contig00020207 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Forsythia21_contig00020207
         (2777 letters)

Database: ./nr 
           69,698,275 sequences; 24,982,196,650 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_011092382.1| PREDICTED: uncharacterized protein LOC105172...   626   e-176
ref|XP_011087795.1| PREDICTED: uncharacterized protein LOC105169...   623   e-175
ref|XP_012828376.1| PREDICTED: uncharacterized protein LOC105949...   507   e-140
ref|XP_012828377.1| PREDICTED: uncharacterized protein LOC105949...   493   e-136
ref|XP_002265987.2| PREDICTED: uncharacterized protein LOC100241...   488   e-134
emb|CDO97516.1| unnamed protein product [Coffea canephora]            485   e-134
ref|XP_010277688.1| PREDICTED: uncharacterized protein YMR317W-l...   456   e-125
ref|XP_010245092.1| PREDICTED: uncharacterized protein LOC104588...   454   e-124
ref|XP_010245093.1| PREDICTED: uncharacterized protein LOC104588...   446   e-122
ref|XP_007041567.1| Uncharacterized protein isoform 1 [Theobroma...   429   e-117
ref|XP_007041568.1| Uncharacterized protein isoform 2 [Theobroma...   427   e-116
gb|EYU18535.1| hypothetical protein MIMGU_mgv1a006469mg [Erythra...   388   e-104
ref|XP_012467689.1| PREDICTED: uncharacterized protein LOC105786...   385   e-104
ref|XP_012839759.1| PREDICTED: uncharacterized protein LOC105960...   369   1e-98
ref|XP_012078152.1| PREDICTED: mediator of RNA polymerase II tra...   367   4e-98
ref|XP_012078151.1| PREDICTED: mediator of RNA polymerase II tra...   361   2e-96
ref|XP_010664264.1| PREDICTED: mediator of RNA polymerase II tra...   360   3e-96
ref|XP_012065652.1| PREDICTED: uncharacterized protein LOC105628...   357   4e-95
ref|XP_008338816.1| PREDICTED: uncharacterized serine-rich prote...   353   4e-94
ref|XP_011655200.1| PREDICTED: mediator of RNA polymerase II tra...   352   1e-93

>ref|XP_011092382.1| PREDICTED: uncharacterized protein LOC105172576 [Sesamum indicum]
          Length = 616

 Score =  626 bits (1614), Expect = e-176
 Identities = 345/589 (58%), Positives = 401/589 (68%), Gaps = 15/589 (2%)
 Frame = -2

Query: 1732 MERSEPTLVPEWLKNSGNLTGGSPTLHSDDNAGLKLARNRSSVNSDGHDLGRSLGXXXXX 1553
            MERSEPTL+PEWL+++G+L GG    HSD+    KLARN+S VNS+GHD  RS       
Sbjct: 1    MERSEPTLIPEWLRSAGSLNGGGSISHSDEQTTTKLARNKSLVNSNGHDSARSFSSDRTT 60

Query: 1552 XXXXXXXXXXXXXTHLQSYNSFSRNHRDRDWEKDTYDSRDKEKSDFGDRRRQDFSHPLGK 1373
                          HL+S++SF RNH DRDWEKD  DSRDK+KS  GDR  +DFS  +G 
Sbjct: 61   SSYFRRSSSSNGSGHLRSHSSFGRNHHDRDWEKDACDSRDKDKSVLGDRWHRDFSDAMGN 120

Query: 1372 ILSSSFERDGLRRSQSMISGKRGENWPKKILIDXXXXXXXXXXGLVNKGSPIGSVNKAAF 1193
             L S FERDGLRRSQSMISGKRG+ W KK+  D          GL +KGSPIG VNK  F
Sbjct: 121  TLLSKFERDGLRRSQSMISGKRGDTWHKKVGTDLNIASGNNTNGLPSKGSPIGGVNKTTF 180

Query: 1192 EKDFPSLGAEERSAAPDIGRVPSPSLSTVTQNLPVGTSAMISGEKWTSALAEVPVLAGSN 1013
            E+DFPSLGAEER+A P++GRVPSP +S+  Q+LP+GT  +I GEKW SALAEVPVL G+N
Sbjct: 181  ERDFPSLGAEERAAIPEVGRVPSPGVSSALQSLPIGTPTIIRGEKWRSALAEVPVLVGNN 240

Query: 1012 GTGVXXXXXXXXXXXATVALSTTTVLNMAEAVAHGPPRTPATPQLSAGTQRLEELAIKQS 833
             TG+           A+VAL +TT LNMAEAVA GP R   TPQLS GTQRLEELAIKQS
Sbjct: 241  VTGISSVQQAAPSSSASVALGSTTSLNMAEAVAQGPSRAQTTPQLSIGTQRLEELAIKQS 300

Query: 832  KQLIPVTPSLPKTLVSNSTDKQKIKVGHQQH-----LPVTNSSRGAAVKSDVSKISSVGK 668
            +QLIPVTPS+PK L + S DKQK KVG QQH     L    S RG  VK+DVSK S+VGK
Sbjct: 301  RQLIPVTPSMPKPLAACSADKQKTKVGQQQHVVTSSLAANQSPRGGPVKADVSKTSNVGK 360

Query: 667  LQVLKPVRERNGISPAVKENLSPTGGSKXXXXXXXXXXXXXXXXXARPLPNNLVLPTVEG 488
            L VLKPVRE+NG +P VKENLSPT GSK                  R LPNN   P  + 
Sbjct: 361  LHVLKPVREKNGTTPVVKENLSPTSGSK-LVSSPLAAPSLSGSAATRVLPNN---PVADR 416

Query: 487  KPVLTALEKRPTSQAQSRNDFFNLVRKKSMGNSSSVSDPGM----PV------SLSVSDK 338
            KPV T LEKRPTSQAQSRNDFFN VRKKSM NS+SV+D  +    PV      S S SDK
Sbjct: 417  KPVWTVLEKRPTSQAQSRNDFFNSVRKKSMANSTSVADAAIANSSPVDTAPAASPSFSDK 476

Query: 337  LGETQVSSAPTTAQARDAQSPVSSSGGHLSEEEDDLTCIGDAIERQKCLSNGKKHPISDH 158
            L ET++  AP T Q R+A S V+ SG +LS    D  C GD  + Q  +SNGKK+  SD 
Sbjct: 477  LTETEIVVAPNT-QDRNASSGVNLSGENLSGTRSDTACNGDVCDAQNYVSNGKKNHTSDP 535

Query: 157  LFSEEEEAAFLRSMGWEENADEGGLTEEEISDFYRDVHKYINSKPALNI 11
            +FSEEEEAAFLRS+GWEENADEGGLT+EEIS F+RDV KY++SKP+L I
Sbjct: 536  IFSEEEEAAFLRSLGWEENADEGGLTDEEISAFFRDVTKYVDSKPSLKI 584


>ref|XP_011087795.1| PREDICTED: uncharacterized protein LOC105169167 [Sesamum indicum]
          Length = 624

 Score =  623 bits (1606), Expect = e-175
 Identities = 341/595 (57%), Positives = 400/595 (67%), Gaps = 18/595 (3%)
 Frame = -2

Query: 1732 MERSEPTLVPEWLKNSGNLTGGSPTLHSDDNAGLKLARNRSSVNSDGHDLGRSLGXXXXX 1553
            MERSEPTLVPEWLKN+GNLTG     HSDD+A  ++ARN+S VNS+GH+ GRS       
Sbjct: 1    MERSEPTLVPEWLKNTGNLTGAGSISHSDDHAASRVARNKSFVNSNGHEFGRSSSSERTT 60

Query: 1552 XXXXXXXXXXXXXTHLQSYNSFSRNHRDRDWEKDTYDSRDKEKSDFGDRRRQDFSHPLGK 1373
                          + +SY+SF R+ RDRDWEKD YDSRD++KS   D    DFS PLG 
Sbjct: 61   SSYFRRSSSSNSSGNFRSYSSFGRSQRDRDWEKDVYDSRDQDKSVLADHWHWDFSDPLGN 120

Query: 1372 ILSSSFERDGLRRSQSMISGKRGENWPKKILIDXXXXXXXXXXGLVNKGSPIGS-VNKAA 1196
             L S +ERDGLRRSQSM+SGKRG+ WPKK++ D          GL+ +GSP+G    KA 
Sbjct: 121  SLLSKYERDGLRRSQSMVSGKRGDTWPKKVVTDLSSASGKNANGLLYRGSPVGGRAKKAT 180

Query: 1195 FEKDFPSLGAEERSAAPDIGRVPSPSLSTVTQNLPVGTSAMISGEKWTSALAEVPVLAGS 1016
            FEKDFPSLGA+ER+  P++GRVPSP LST  Q+LPVGTS +I GEKWTSALAEVPVL GS
Sbjct: 181  FEKDFPSLGADERAVVPEVGRVPSPGLSTAIQSLPVGTSGLIVGEKWTSALAEVPVLVGS 240

Query: 1015 NGTGVXXXXXXXXXXXATVALSTTTVLNMAEAVAHGPPRTPATPQLSAGTQRLEELAIKQ 836
            NGT +           A+VAL +TT LNMAEAVA GP R   TPQLS GTQRLEELAIKQ
Sbjct: 241  NGTALSSVQQAAPSSSASVALGSTTSLNMAEAVAQGPSRAQTTPQLSVGTQRLEELAIKQ 300

Query: 835  SKQLIPVTPSLPKTLVSNSTDKQKIKVGHQQH-----LPVTNSSRGAAVKSDVSKISSVG 671
            S+QLIPVTPS+PK LV  S+DK K KVG QQH     LP+ +S RG AVK DV+K S+VG
Sbjct: 301  SRQLIPVTPSMPKALVLTSSDKPKGKVGQQQHSISSSLPLNHSPRGGAVKGDVAKASNVG 360

Query: 670  KLQVLKPVRERNGISPAVKENLSPTGGSKXXXXXXXXXXXXXXXXXARPLPNNLVLPTVE 491
            KLQVLKPVRE+NG++P VK+NLSPT  SK                  R LPNN V    +
Sbjct: 361  KLQVLKPVREKNGVTPVVKDNLSPTSSSKVVTSTLAVSPSVSGSAATRGLPNNGV---HD 417

Query: 490  GKPVLTALEKRPTSQAQSRNDFFNLVRKKSMGNS------------SSVSDPGMPVSLSV 347
             KP LT LEKRPTSQAQSRNDFFNLVRKKSM NS            SSV D G  +S S 
Sbjct: 418  RKPSLTVLEKRPTSQAQSRNDFFNLVRKKSMPNSSSAVADSAMANCSSVLDTGTAISPSF 477

Query: 346  SDKLGETQVSSAPTTAQARDAQSPVSSSGGHLSEEEDDLTCIGDAIERQKCLSNGKKHPI 167
            SDK  E  +  +  T +A D     S S   LSEE+ DLT  GDA + Q  + NGKK+P 
Sbjct: 478  SDKDVEIDILPSSNTPKAADVPLSNSLSADRLSEEKGDLTSNGDACDAQNYVRNGKKYPS 537

Query: 166  SDHLFSEEEEAAFLRSMGWEENADEGGLTEEEISDFYRDVHKYINSKPALNISPG 2
            SD + SEEEEAAFLRS+GW+EN+DEG LT+EEI+ FYRD+ KYI+S P+  I  G
Sbjct: 538  SDPIISEEEEAAFLRSLGWDENSDEGALTDEEINAFYRDLTKYIDSNPSFRILQG 592


>ref|XP_012828376.1| PREDICTED: uncharacterized protein LOC105949617 isoform X1
            [Erythranthe guttatus]
          Length = 575

 Score =  507 bits (1306), Expect = e-140
 Identities = 303/590 (51%), Positives = 371/590 (62%), Gaps = 13/590 (2%)
 Frame = -2

Query: 1732 MERSEPTLVPEWLKNSGNLTGGSPTLHSDDNAGLKLARNRSSVNSDGHDLGRSLGXXXXX 1553
            M+RSEP+LVP+WLKNSG+ TGG      D++   ++ARN+S VN++G+D GR+ G     
Sbjct: 1    MDRSEPSLVPQWLKNSGSSTGGG-----DNHPASRVARNKSFVNTNGNDFGRASGSAKTT 55

Query: 1552 XXXXXXXXXXXXXTHLQSYNSFSRNHRDRDWEKDTYDSRDKEKSDFG-DRRRQDFSHPLG 1376
                            +SY+SF RN RDRDWEKDTY+SRDKE+   G DR R + S  LG
Sbjct: 56   SSYFRRSSSSNSSGSSKSYSSFGRNQRDRDWEKDTYNSRDKERLVLGGDRHRYESSELLG 115

Query: 1375 KILSSSFERDGLRRSQSMISGKRGENWPKKILIDXXXXXXXXXXG-LVNKGSPIGSVNKA 1199
                S +ERDGLRRS SMISGK GE WPKK++ +             + KGSP+G  NKA
Sbjct: 116  NPSLSKYERDGLRRSHSMISGKHGETWPKKVVTESSSGSGKNNGNGFLAKGSPVGVANKA 175

Query: 1198 AFEKDFPSLGAEERSAAPDIGRVPSPSLSTVTQNLPVGTSAMISGEKWTSALAEVPVLAG 1019
             FE+DFPSLG ++R+  P++GRV SP LS+  Q+LP+G+SA I GE+WTSALAEVP+L  
Sbjct: 176  TFERDFPSLGTDDRAVVPEVGRVASPGLSSALQSLPIGSSASIGGERWTSALAEVPMLVV 235

Query: 1018 SNGT-GVXXXXXXXXXXXATVALSTTTVLNMAEAVAHGPPRTPATPQLSAGTQRLEELAI 842
            SNGT  +           A+V +S+TT LNMAEAVA GP R    PQLS GTQRLEELAI
Sbjct: 236  SNGTASLSVQQAAPSSTTASVVVSSTTSLNMAEAVAQGPTRAQTAPQLSLGTQRLEELAI 295

Query: 841  KQSKQLIPVTPSLPKTLVSNSTDKQKIKVG-HQQH-----LPVTNSSRGA-AVKSDVSKI 683
            KQS+QLIPVTP++PKTLV +S+DKQK KVG  QQH     LP+  S RGA   K D SK 
Sbjct: 296  KQSRQLIPVTPTMPKTLVLSSSDKQKSKVGLIQQHPTPSSLPINQSPRGAPPSKPDFSKA 355

Query: 682  SSVGKLQVLKPVRERNGISPAVKENLSPTGGSKXXXXXXXXXXXXXXXXXARPLPNNLVL 503
            S+VGKL VLKPVRE+NG++P+VK+ LSPTG  K                       N  L
Sbjct: 356  SNVGKLHVLKPVREKNGVTPSVKDKLSPTGSGKAV---------------------NSTL 394

Query: 502  PTVEG--KPVL-TALEKRPTSQAQSRNDFFNLVRKKSMGNSSSVSDPGMPVSLSVSDKLG 332
            P      KP+L TALEKRPT+QAQSRNDFF  +R+KS+ NSSS S+ G  +S        
Sbjct: 395  PASPSAVKPLLTTALEKRPTTQAQSRNDFFKRMREKSVSNSSSASETGTAIS-------P 447

Query: 331  ETQVSSAPTTAQARDAQSPVSSSGGHLSEEEDDLTCIGDAIERQKCLSNGKKHPISDHLF 152
            E     A   A    A  P       L EE+   T     ++    +SNGKK+  S+ + 
Sbjct: 448  EKHAKVAVVPAAITGAVEP-------LPEEKAVRTTCNGGVQH---ISNGKKYN-SEPII 496

Query: 151  SEEEEAAFLRSMGWEENADEGGLTEEEISDFYRDVHKYINSKPALNISPG 2
            SEEEEA FLRSMGW+EN DEGGLTEEEIS FYRD  KYINSKP+L I  G
Sbjct: 497  SEEEEAKFLRSMGWDENDDEGGLTEEEISAFYRDFTKYINSKPSLRILQG 546


>ref|XP_012828377.1| PREDICTED: uncharacterized protein LOC105949617 isoform X2
            [Erythranthe guttatus]
          Length = 550

 Score =  493 bits (1268), Expect = e-136
 Identities = 297/588 (50%), Positives = 365/588 (62%), Gaps = 13/588 (2%)
 Frame = -2

Query: 1732 MERSEPTLVPEWLKNSGNLTGGSPTLHSDDNAGLKLARNRSSVNSDGHDLGRSLGXXXXX 1553
            M+RSEP+LVP+WLKNSG+ TGG      D++   ++ARN+S VN++G+D GR+ G     
Sbjct: 1    MDRSEPSLVPQWLKNSGSSTGGG-----DNHPASRVARNKSFVNTNGNDFGRASGSAKTT 55

Query: 1552 XXXXXXXXXXXXXTHLQSYNSFSRNHRDRDWEKDTYDSRDKEKSDFG-DRRRQDFSHPLG 1376
                            +SY+SF RN RDRDWEKDTY+SRDKE+   G DR R + S  LG
Sbjct: 56   SSYFRRSSSSNSSGSSKSYSSFGRNQRDRDWEKDTYNSRDKERLVLGGDRHRYESSELLG 115

Query: 1375 KILSSSFERDGLRRSQSMISGKRGENWPKKILIDXXXXXXXXXXG-LVNKGSPIGSVNKA 1199
                S +ERDGLRRS SMISGK GE WPKK++ +             + KGSP+G  NKA
Sbjct: 116  NPSLSKYERDGLRRSHSMISGKHGETWPKKVVTESSSGSGKNNGNGFLAKGSPVGVANKA 175

Query: 1198 AFEKDFPSLGAEERSAAPDIGRVPSPSLSTVTQNLPVGTSAMISGEKWTSALAEVPVLAG 1019
             FE+DFPSLG ++R+  P++GRV SP LS+  Q+LP+G+SA I GE+WTSALAEVP+L  
Sbjct: 176  TFERDFPSLGTDDRAVVPEVGRVASPGLSSALQSLPIGSSASIGGERWTSALAEVPMLVV 235

Query: 1018 SNGT-GVXXXXXXXXXXXATVALSTTTVLNMAEAVAHGPPRTPATPQLSAGTQRLEELAI 842
            SNGT  +           A+V +S+TT LNMAEAVA GP R    PQLS GTQRLEELAI
Sbjct: 236  SNGTASLSVQQAAPSSTTASVVVSSTTSLNMAEAVAQGPTRAQTAPQLSLGTQRLEELAI 295

Query: 841  KQSKQLIPVTPSLPKTLVSNSTDKQKIKVG-HQQH-----LPVTNSSRGA-AVKSDVSKI 683
            KQS+QLIPVTP++PKTLV +S+DKQK KVG  QQH     LP+  S RGA   K D SK 
Sbjct: 296  KQSRQLIPVTPTMPKTLVLSSSDKQKSKVGLIQQHPTPSSLPINQSPRGAPPSKPDFSKA 355

Query: 682  SSVGKLQVLKPVRERNGISPAVKENLSPTGGSKXXXXXXXXXXXXXXXXXARPLPNNLVL 503
            S+VGKL VLKPVRE+NG++P+VK+ LSPTG  K                       N  L
Sbjct: 356  SNVGKLHVLKPVREKNGVTPSVKDKLSPTGSGKAV---------------------NSTL 394

Query: 502  PTVEG--KPVL-TALEKRPTSQAQSRNDFFNLVRKKSMGNSSSVSDPGMPVSLSVSDKLG 332
            P      KP+L TALEKRPT+QAQSRNDFF  +R+KS+ NSSS S+ G  +S        
Sbjct: 395  PASPSAVKPLLTTALEKRPTTQAQSRNDFFKRMREKSVSNSSSASETGTAIS-------P 447

Query: 331  ETQVSSAPTTAQARDAQSPVSSSGGHLSEEEDDLTCIGDAIERQKCLSNGKKHPISDHLF 152
            E     A   A    A  P       L EE+   T     ++    +SNGKK+  S+ + 
Sbjct: 448  EKHAKVAVVPAAITGAVEP-------LPEEKAVRTTCNGGVQH---ISNGKKYN-SEPII 496

Query: 151  SEEEEAAFLRSMGWEENADEGGLTEEEISDFYRDVHKYINSKPALNIS 8
            SEEEEA FLRSMGW+EN DEGGLTEEEIS FYRD  K     P L+ S
Sbjct: 497  SEEEEAKFLRSMGWDENDDEGGLTEEEISAFYRDFTKIGGISPGLSSS 544


>ref|XP_002265987.2| PREDICTED: uncharacterized protein LOC100241871 [Vitis vinifera]
          Length = 665

 Score =  488 bits (1256), Expect = e-134
 Identities = 295/634 (46%), Positives = 379/634 (59%), Gaps = 60/634 (9%)
 Frame = -2

Query: 1732 MERSEPTLVPEWLKNSGNLTGGSPTLH--------SDDNAGLKLARNRSSVNSDGHDLGR 1577
            M+++EP LVPEWLK+SG++TGG  T H        SDD A LK AR +  VNS+ HD GR
Sbjct: 1    MDKTEPALVPEWLKSSGSVTGGGSTNHHFAPSLLQSDDGAALKPAR-KLMVNSNDHDTGR 59

Query: 1576 SLGXXXXXXXXXXXXXXXXXXTHLQSYNSFSRNHRDRDWEKDTYDSRDKEKSDFGDRRRQ 1397
            S                     H +S++SF R +R+R+WEKD +D RDK+KS   D R +
Sbjct: 60   SSNLERTTSSYFRRSSSSNGSGHPRSFSSFGRTNREREWEKDIHDYRDKDKSVLSDHRHR 119

Query: 1396 DFSHPLGKILSSSFERDGLRRSQSMISGKRGENWPKKILIDXXXXXXXXXXG---LVNKG 1226
            D+S PLG IL    ERD LRRSQSMI+GKRG+ WP+K+  D               +  G
Sbjct: 120  DYSDPLGNILPGRLERDMLRRSQSMITGKRGDMWPRKVAADVSTVNKTIHSNGDGQLASG 179

Query: 1225 SPIGSVNKAAFEKDFPSLGAEERSAAPDIGRVPSPSLSTVTQNLPVGTSAMISGEKWTSA 1046
                SV KAAF+++FPSLGAE++  APDIGRV SP L++  Q+LP+G + +I G+ WTSA
Sbjct: 180  IVTSSVQKAAFDRNFPSLGAEDKQGAPDIGRVTSPGLTSAIQSLPIGNTVVIGGDGWTSA 239

Query: 1045 LAEVPVLAGSNGTGVXXXXXXXXXXXATVALSTTTVLNMAEAVAHGPPRTPA--TPQLSA 872
            LAEVPV+ GSN TGV            +VA STT+ LNMAE +  GP R  A  TPQLS 
Sbjct: 240  LAEVPVIIGSNTTGVSSVQQSVSASSVSVAPSTTSGLNMAETLVQGPARARANATPQLSV 299

Query: 871  GTQRLEELAIKQSKQLIPVTPSLPKTLVSNSTDKQKIKVGHQQHLPVTNSSRGAAVKSDV 692
            GTQRLEELA+KQS+QLIP+TPS+PKTLV + +DK K K+G Q    V +S RG   +SDV
Sbjct: 300  GTQRLEELALKQSRQLIPMTPSMPKTLVPSPSDKPKSKIGLQPLHLVNHSQRGGPARSDV 359

Query: 691  SKISSVGKLQVLKPVRERNGISPAVKENLSPTGGSKXXXXXXXXXXXXXXXXXARPLPNN 512
            +K S+VGKL VLKP RERNG+SP  K++LSPT GS+                  R   NN
Sbjct: 360  TKTSNVGKLHVLKPSRERNGVSPTAKDSLSPTMGSRVANSPLAVTPSAAGSASLRSPRNN 419

Query: 511  LVLPTVEGKP--VLTALEKRPTSQAQSRNDFFNLVRKKSMGN-SSSVSDPGMPVSLSVSD 341
              L + E +P  VLT++EKRPTSQAQSRNDFFNL+RKKS  N  S+V + G  VS SVS+
Sbjct: 420  PTLASAERRPSVVLTSVEKRPTSQAQSRNDFFNLMRKKSSTNPPSAVPESGPAVSSSVSE 479

Query: 340  KLGE--TQVSSAPTTAQARDAQSPVSSS--------------------GGHLSEEEDDLT 227
            K  E  T+V +AP T + RD  S  +S                     G   ++ +D++ 
Sbjct: 480  KSDELITEVVTAPVTPKGRDILSSDNSGLDWSNENRGDKTENGNNEACGVSQNDRDDEID 539

Query: 226  CI-GDAIE--------------------RQKCLSNGKKHPISDH-LFSEEEEAAFLRSMG 113
             + GDA +                     QK L NG+KH   D  L+ +EEEAAFLRS+G
Sbjct: 540  NVNGDACDVSQRDQGDEVHDGNGDACDVSQKFLDNGEKHSSPDEVLYPDEEEAAFLRSLG 599

Query: 112  WEENADEGGLTEEEISDFYRDVHKYINSKPALNI 11
            WEEN ++ GLTEEEI+ FY++  K    KP+ N+
Sbjct: 600  WEENGEDEGLTEEEINAFYKECMKL---KPSSNL 630


>emb|CDO97516.1| unnamed protein product [Coffea canephora]
          Length = 599

 Score =  485 bits (1249), Expect = e-134
 Identities = 281/584 (48%), Positives = 358/584 (61%), Gaps = 13/584 (2%)
 Frame = -2

Query: 1732 MERSEPTLVPEWLKNSGNLTGGSPTLH----SDDNAGLKLARNRSSVNSDGHDLGRSLGX 1565
            MERSEP+LVPEWLK+SG+ TG   T H    SDD+A  KLARN+SSVN + H++GRS   
Sbjct: 1    MERSEPSLVPEWLKSSGSATGSGTTSHPLSPSDDHAVSKLARNKSSVNHNDHEIGRSSVS 60

Query: 1564 XXXXXXXXXXXXXXXXXTHLQSYNSFSRNHRDRDWEKDTYDSRDKEKSDFGDRRRQDFSH 1385
                               +QSY+SF RNHR RDW+KD Y+ RD++    G  + +D+  
Sbjct: 61   DRTSASYFRRSSSSNGSGQMQSYSSFGRNHRGRDWDKDLYEPRDRDNLVVGGHKHRDYLD 120

Query: 1384 PLGKILSSSFERDGLRRSQSMISGKRGENWPKKILIDXXXXXXXXXXG---LVNKGSPIG 1214
            P       +FE+DGLRRSQSM+S KR E WPK+ + D              L++KG  +G
Sbjct: 121  PPVNNFPGNFEKDGLRRSQSMVSRKRNEIWPKRSIADSNSASRNKSTDGNSLLDKGDSVG 180

Query: 1213 SVNKAAFEKDFPSLGAEERSAAPDIGRVPSPSLSTVTQNLPVGTSAMISGEKWTSALAEV 1034
            +V+K  FE+DFPSLG+EER A  ++GRVPSP L+T    LP+  SA+I+G+KWTSALAEV
Sbjct: 181  TVHKVVFERDFPSLGSEERQATSEVGRVPSPGLNTAIHGLPISASAIIAGDKWTSALAEV 240

Query: 1033 PVLAGSNGTGV-XXXXXXXXXXXATVALSTTTVLNMAEAVAHGPPRTPATPQLSAGTQRL 857
            P + G  GTG+            A++  ST+  LNMAE VA G PR  A P++++GTQRL
Sbjct: 241  PAIVGGGGTGLSPGRQASLPSSPASLPSSTSAGLNMAETVAQG-PRVQAAPKITSGTQRL 299

Query: 856  EELAIKQSKQLIPVTPSLPKTLVSNSTDKQKIKVGHQQHLPVTN-----SSRGAAVKSDV 692
            EELAI+QS+QLIP+TPS+PK  + NS+DK K K G  QH PV++     S RG  VK+D 
Sbjct: 300  EELAIRQSRQLIPMTPSMPKPSILNSSDKGKAKAGQPQH-PVSSPLLSPSLRGGPVKTDA 358

Query: 691  SKISSVGKLQVLKPVRERNGISPAVKENLSPTGGSKXXXXXXXXXXXXXXXXXARPLPNN 512
            SK S+ GKL VLKP RERNG+S A K+ LSPT  ++                 +R    N
Sbjct: 359  SKTSNAGKLLVLKPPRERNGVSTASKDTLSPTSSTRAATSGIAVATSVTGLATSRGPAIN 418

Query: 511  LVLPTVEGKPVLTALEKRPTSQAQSRNDFFNLVRKKSMGNSSSVSDPGMPVSLSVSDKLG 332
             V P  E K  L  LEK+P+SQAQSRNDFFNL+RKKSM +SSSV+D G  VS S  D+ G
Sbjct: 419  PVSPGAERKHALPMLEKKPSSQAQSRNDFFNLMRKKSMPSSSSVADAGSAVSASTLDEPG 478

Query: 331  ETQVSSAPTTAQARDAQSPVSSSGGHLSEEEDDLTCIGDAIERQKCLSNGKKHPISDHLF 152
            E +V  AP   +  D  S    +G      E+DL  I                  S  LF
Sbjct: 479  ELEVIPAPVIHEDEDVPSLDRLNG--CQHTENDLFGIQSR---------------SLPLF 521

Query: 151  SEEEEAAFLRSMGWEENADEGGLTEEEISDFYRDVHKYINSKPA 20
            SEEEEAAFL  +GW+ENADE GLTEEEI+ F+RD+ KY+NSKP+
Sbjct: 522  SEEEEAAFLHQLGWQENADEDGLTEEEINAFFRDLSKYMNSKPS 565


>ref|XP_010277688.1| PREDICTED: uncharacterized protein YMR317W-like [Nelumbo nucifera]
            gi|720070295|ref|XP_010277689.1| PREDICTED:
            uncharacterized protein YMR317W-like [Nelumbo nucifera]
          Length = 655

 Score =  456 bits (1173), Expect = e-125
 Identities = 290/626 (46%), Positives = 370/626 (59%), Gaps = 49/626 (7%)
 Frame = -2

Query: 1732 MERSEPTLVPEWLKNSGNLTGG--------SPTLHSDDNAGLKLARNRSSVNSDGHDLGR 1577
            M + EPTLVPEWLK +G++TGG        S + HSDD+A     RNR ++++  +D  R
Sbjct: 1    MAKGEPTLVPEWLKGTGSITGGGNTTHHFASSSTHSDDHAVALTTRNRLTMSTGDYDTPR 60

Query: 1576 SLGXXXXXXXXXXXXXXXXXXTHL---------QSYNSFSRNHRDRDWEKDTYDSRDKEK 1424
            S                    + +         +SY+SF+R+HRDRDWEKDT D RDKEK
Sbjct: 61   SSAFLDRTSSAYFRRSSSSNGSMMHDKETSTYSRSYSSFTRSHRDRDWEKDTLDYRDKEK 120

Query: 1423 SDFGDRRRQDFSHPLGKILSSSFERDGLRRSQSMISGKRGENWPKKILID--XXXXXXXX 1250
            S  GD R +D+S PL  IL+S  E+D LRRSQSMISGKRGE W +++  D          
Sbjct: 121  SILGDHRDRDYSDPLASILTSRXEKDTLRRSQSMISGKRGEGWSRRVAADTNNGNNNHNN 180

Query: 1249 XXGLVNKGSPIGSVNKAAFEKDFPSLGAEERSAAPDIGRVPSPSLSTVTQNLPVGTSAMI 1070
              GL+  GS + S+ KAAFE+DFPSLGAEE+  A DIGRV SP LS+  Q+LP+G+SA+I
Sbjct: 181  GNGLLVGGSIVSSIQKAAFERDFPSLGAEEKQGALDIGRVSSPGLSSSVQSLPIGSSAVI 240

Query: 1069 SGEKWTSALAEVPVLAGSNGTGVXXXXXXXXXXXATVALSTTTVLNMAEAVAHGPPRTPA 890
             G+ WTSALAEVPV+ G+N  G             + A +++T LNMAE +A  P RT  
Sbjct: 241  GGDGWTSALAEVPVIIGNNSIGPSSVQQATPASSTSGAPNSSTGLNMAETLAQAPSRTRI 300

Query: 889  TPQLSAGTQRLEELAIKQSKQLIPVTPSLPKTLVSNSTDKQK-------------IKVGH 749
            +PQLS  TQRLEELAIKQS+QLIP+TPS+PKT   NS++K K              K   
Sbjct: 301  SPQLSVETQRLEELAIKQSRQLIPMTPSMPKTSALNSSEKAKPKAVVRTGEMGISAKTSQ 360

Query: 748  QQHLP----VTNSSRGAAVKSDVSKISSVGKLQVLKPVRERNGISPAVKENLSPTGGSKX 581
            QQ LP    V +S RG  V+SDV K S  GKL VLK  RE+NGISP+ K+ LSPT  SK 
Sbjct: 361  QQQLPSSHLVNHSLRGGPVRSDVPKTSHGGKLLVLKAPREKNGISPSAKDGLSPTNASKV 420

Query: 580  XXXXXXXXXXXXXXXXARPLPNNLVLPTVEGKPVL------TALEKRP-TSQAQSRNDFF 422
                             R  PNN  LP  E K V       +A+EKRP TSQ QSRNDFF
Sbjct: 421  VNNSLVLAPLAAYAPPMRS-PNNSKLPN-ERKSVASSLTHGSAVEKRPTTSQVQSRNDFF 478

Query: 421  NLVRKKSMGN-SSSVSDPGMPVSLSVSDKLGETQ--VSSAPTTAQARDAQSPVSSSGGHL 251
            NL+RKK+ GN +S+V DP    S S+ +K  E    V +AP + Q+ DA S   S     
Sbjct: 479  NLMRKKTSGNLASAVPDPSPTASSSLLEKSSEPTEVVPTAPVSPQSSDAPSSEPSGLDWS 538

Query: 250  SEEEDDLTCIGDAIER-QKCLSNGKKHPISD-HLFSEEEEAAFLRSMGWEENA-DEGGLT 80
            +E   DL   GD  E  Q+  +NG+K   +D  ++ +EEEAAFLRS+GW+ENA +E GLT
Sbjct: 539  TENGGDLVSNGDVSEESQRFSNNGEKRSTADAFVYPDEEEAAFLRSLGWDENAGEEEGLT 598

Query: 79   EEEISDFYRDVHKYINSKPALNISPG 2
            EEEIS FYR+   Y+  +P+  +  G
Sbjct: 599  EEEISAFYRE---YMKVRPSSRLCQG 621


>ref|XP_010245092.1| PREDICTED: uncharacterized protein LOC104588732 isoform X1 [Nelumbo
            nucifera]
          Length = 645

 Score =  454 bits (1167), Expect = e-124
 Identities = 287/617 (46%), Positives = 367/617 (59%), Gaps = 40/617 (6%)
 Frame = -2

Query: 1732 MERSEPTLVPEWLKNSGNLTGGSPTLH--------SDDNAGLKLARNRSSVNSDGHDLGR 1577
            M +SEPTLVPEWLK +G +TG   T H        SDDNA     RNRSS++   +D  R
Sbjct: 1    MAKSEPTLVPEWLKGTGGITGAGSTTHHFASSSLQSDDNAVALPTRNRSSLSIGDYDTPR 60

Query: 1576 SLG---------XXXXXXXXXXXXXXXXXXTHLQSYNSFSRNHRDRDWEKDTYDSRDKEK 1424
            S                             ++ +SY++F+R+HRDRDWEKD  D RDKE+
Sbjct: 61   SSAFSDRTSSAYSRRSSSSNGSIVHDKEIPSYTRSYSAFARSHRDRDWEKDILDFRDKER 120

Query: 1423 SDFGDRRRQDFSHPLGKILSSSFERDGLRRSQSMISGKRGENWPKKILID--XXXXXXXX 1250
            S  GD R  DFS PL  IL+S  E+D LRRSQSM+SGKRGE WP+K+  D          
Sbjct: 121  SVPGDHRDLDFSDPLVSILTSRIEKDTLRRSQSMVSGKRGEVWPRKVAADLNNGNINQNT 180

Query: 1249 XXGLVNKGSPIGSVNKAAFEKDFPSLGAEERSAAPDIGRVPSPSLSTVTQNLPVGTSAMI 1070
              GL+  GS + S+ KAAFE+DFPSLGAEE+   PDIGRV SP LS+  Q+LP+G+SA+I
Sbjct: 181  SNGLLVGGSIVSSIQKAAFERDFPSLGAEEKPGTPDIGRVSSPGLSSAVQSLPMGSSALI 240

Query: 1069 SGEKWTSALAEVPVLAGSNGTGVXXXXXXXXXXXATVALSTTTVLNMAEAVAHGPPRTPA 890
             G+ WTSALAEVP++ G+NGTG+           A+ A +++T LNMAE +A  P R   
Sbjct: 241  GGDGWTSALAEVPMIIGNNGTGISSVQQATLGSSASGATNSSTGLNMAETLAQAPSRARI 300

Query: 889  TPQLSAGTQRLEELAIKQSKQLIPVTPSLPKTLVSNSTDKQKIKVG------------HQ 746
            +PQLS  TQRLEELAIKQS+QLIP+TPS+PKT V NS +K K K+              Q
Sbjct: 301  SPQLSVETQRLEELAIKQSRQLIPMTPSMPKTSVLNSLEKAKPKISVRTGEMNATKTIQQ 360

Query: 745  QHLPVTNSSRGAAVKSDVSKISSVGKLQVLKPVRERNGISPAVKENLSPTGGSKXXXXXX 566
            Q L   +S RGA ++SDVSK S  GKL VLK  RE+NGISP  K+  SPT  SK      
Sbjct: 361  QQL---SSLRGAPMRSDVSKTSHGGKLLVLKAPREKNGISPIAKDGQSPTNVSKVANNPL 417

Query: 565  XXXXXXXXXXXARPLPNNLVLPTVEGKPVL---TALEKRP-TSQAQSRNDFFNLVRKKSM 398
                         P  + L          L   +++EKRP TSQ QSRNDFFNL+RKK+ 
Sbjct: 418  ALAPSAAFTPLKSPNNSKLSNERKSAAASLMHGSSVEKRPTTSQVQSRNDFFNLMRKKTS 477

Query: 397  GN-SSSVSDPGMPVSLSVSDKLGE-TQVSSAPTTAQARDAQSPVSSSGGHLSEEEDDLTC 224
            GN SS+  DP   VS S+ DK  E T + +AP + Q+ DA SP  S     +E   +   
Sbjct: 478  GNLSSAAPDPSPVVSSSLLDKSTEQTALPAAPVSPQSSDAPSPDPSCLDWSTENGSETIS 537

Query: 223  IGDAIER-QKCLSNGKKHPISD-HLFSEEEEAAFLRSMGWEENA-DEGGLTEEEISDFYR 53
             G+A E  Q+ L+NG+KH   D  ++ +EEEAAFLRS+GW+ENA +E GLTEEEIS FY+
Sbjct: 538  NGNASEESQRFLNNGEKHSSPDAFVYPDEEEAAFLRSLGWDENAGEEEGLTEEEISAFYK 597

Query: 52   DVHKYINSKPALNISPG 2
            +   Y+  +P+  +  G
Sbjct: 598  E---YMKLRPSSKLCRG 611


>ref|XP_010245093.1| PREDICTED: uncharacterized protein LOC104588732 isoform X2 [Nelumbo
            nucifera]
          Length = 616

 Score =  446 bits (1147), Expect = e-122
 Identities = 277/600 (46%), Positives = 360/600 (60%), Gaps = 23/600 (3%)
 Frame = -2

Query: 1732 MERSEPTLVPEWLKNSGNLTGGSPTLHSDDNAGLKLARNRSSVNSDGHDLGRSLGXXXXX 1553
            M +SEPTLVPEWLK +G +TG   T H   ++ L+  R  S+ +        S+      
Sbjct: 1    MAKSEPTLVPEWLKGTGGITGAGSTTHHFASSSLQSDRTSSAYSRRSSSSNGSI------ 54

Query: 1552 XXXXXXXXXXXXXTHLQSYNSFSRNHRDRDWEKDTYDSRDKEKSDFGDRRRQDFSHPLGK 1373
                         ++ +SY++F+R+HRDRDWEKD  D RDKE+S  GD R  DFS PL  
Sbjct: 55   ------VHDKEIPSYTRSYSAFARSHRDRDWEKDILDFRDKERSVPGDHRDLDFSDPLVS 108

Query: 1372 ILSSSFERDGLRRSQSMISGKRGENWPKKILIDXXXXXXXXXXG--LVNKGSPIGSVNKA 1199
            IL+S  E+D LRRSQSM+SGKRGE WP+K+  D             L+  GS + S+ KA
Sbjct: 109  ILTSRIEKDTLRRSQSMVSGKRGEVWPRKVAADLNNGNINQNTSNGLLVGGSIVSSIQKA 168

Query: 1198 AFEKDFPSLGAEERSAAPDIGRVPSPSLSTVTQNLPVGTSAMISGEKWTSALAEVPVLAG 1019
            AFE+DFPSLGAEE+   PDIGRV SP LS+  Q+LP+G+SA+I G+ WTSALAEVP++ G
Sbjct: 169  AFERDFPSLGAEEKPGTPDIGRVSSPGLSSAVQSLPMGSSALIGGDGWTSALAEVPMIIG 228

Query: 1018 SNGTGVXXXXXXXXXXXATVALSTTTVLNMAEAVAHGPPRTPATPQLSAGTQRLEELAIK 839
            +NGTG+           A+ A +++T LNMAE +A  P R   +PQLS  TQRLEELAIK
Sbjct: 229  NNGTGISSVQQATLGSSASGATNSSTGLNMAETLAQAPSRARISPQLSVETQRLEELAIK 288

Query: 838  QSKQLIPVTPSLPKTLVSNSTDKQKIKVG------------HQQHLPVTNSSRGAAVKSD 695
            QS+QLIP+TPS+PKT V NS +K K K+              QQ L   +S RGA ++SD
Sbjct: 289  QSRQLIPMTPSMPKTSVLNSLEKAKPKISVRTGEMNATKTIQQQQL---SSLRGAPMRSD 345

Query: 694  VSKISSVGKLQVLKPVRERNGISPAVKENLSPTGGSKXXXXXXXXXXXXXXXXXARPLPN 515
            VSK S  GKL VLK  RE+NGISP  K+  SPT  SK                   P  +
Sbjct: 346  VSKTSHGGKLLVLKAPREKNGISPIAKDGQSPTNVSKVANNPLALAPSAAFTPLKSPNNS 405

Query: 514  NLVLPTVEGKPVL---TALEKRP-TSQAQSRNDFFNLVRKKSMGN-SSSVSDPGMPVSLS 350
             L          L   +++EKRP TSQ QSRNDFFNL+RKK+ GN SS+  DP   VS S
Sbjct: 406  KLSNERKSAAASLMHGSSVEKRPTTSQVQSRNDFFNLMRKKTSGNLSSAAPDPSPVVSSS 465

Query: 349  VSDKLGE-TQVSSAPTTAQARDAQSPVSSSGGHLSEEEDDLTCIGDAIER-QKCLSNGKK 176
            + DK  E T + +AP + Q+ DA SP  S     +E   +    G+A E  Q+ L+NG+K
Sbjct: 466  LLDKSTEQTALPAAPVSPQSSDAPSPDPSCLDWSTENGSETISNGNASEESQRFLNNGEK 525

Query: 175  HPISD-HLFSEEEEAAFLRSMGWEENA-DEGGLTEEEISDFYRDVHKYINSKPALNISPG 2
            H   D  ++ +EEEAAFLRS+GW+ENA +E GLTEEEIS FY++   Y+  +P+  +  G
Sbjct: 526  HSSPDAFVYPDEEEAAFLRSLGWDENAGEEEGLTEEEISAFYKE---YMKLRPSSKLCRG 582


>ref|XP_007041567.1| Uncharacterized protein isoform 1 [Theobroma cacao]
            gi|508705502|gb|EOX97398.1| Uncharacterized protein
            isoform 1 [Theobroma cacao]
          Length = 625

 Score =  429 bits (1102), Expect = e-117
 Identities = 270/596 (45%), Positives = 353/596 (59%), Gaps = 21/596 (3%)
 Frame = -2

Query: 1735 VMERSEPTLVPEWLKNSGNLTGG--------SPTLHSDDNAGLKLARNRSSVNSDGHDLG 1580
            VMERSEP+LVPEWLK+ G++TG         S +LHSD+++ L+  RN+ SV  D HD+G
Sbjct: 5    VMERSEPSLVPEWLKSGGSVTGSGNSNHQFTSSSLHSDNHSALRPTRNKLSVAGD-HDVG 63

Query: 1579 RSLGXXXXXXXXXXXXXXXXXXTHLQSYNSFSRNHRDRDWEKDTYDSRDKEKSDFGDRRR 1400
             +                     HL+SY+SF++ HRDRDW+KD     D+EKS   D R 
Sbjct: 64   GTSVLDRTTSAYFRRSSSSNGSAHLRSYSSFTKGHRDRDWDKDINGYHDREKSVISDHRN 123

Query: 1399 QDFSHPLGKILSSSFERDGLRRSQSMISGKRGENWPKKILIDXXXXXXXXXXGLVNKGSP 1220
            ++FS  L  +L S FE+D L RSQS I+GKR + WPKK+  D                S 
Sbjct: 124  RNFSDSLDNMLPSVFEKDVLWRSQS-ITGKRSDTWPKKVTSDSSTSNKSNHSSSNGLLSG 182

Query: 1219 IGSV--NKAAFEKDFPSLGAEERSAAPDIGRVPSPSLSTVTQNLPVGTSAMISGEKWTSA 1046
            + +   NK+ FE++FP LGAEER  A +IGRV SP LST  Q+LPVGTSA+   + WTSA
Sbjct: 183  VSTTVGNKSVFEREFPVLGAEERQVASEIGRVSSPGLSTAGQSLPVGTSAISGSDGWTSA 242

Query: 1045 LAEVPVLAGSNGTGVXXXXXXXXXXXATVALSTTTVLNMAEAVAHGPPRTPATPQLSAGT 866
            LA++P   GS+GTGV           A++A +T T LNMAE +  GP R    P L+ GT
Sbjct: 243  LADMPAGVGSSGTGVAVASQNVSASSASMASTTMTGLNMAETLVQGPSRARTPPLLNVGT 302

Query: 865  QRLEELAIKQSKQLIP-VTPSLPKTLVSNSTDKQKIKVGHQQHLPVT-NSSRGAAVKSDV 692
            QRLEELAIKQS+QL+P VT S PK LV + ++K K KVG QQH  ++ N +RG   +SD 
Sbjct: 303  QRLEELAIKQSRQLVPLVTTSTPKILVVSPSEKSKPKVGQQQHASLSLNYTRGGTSRSDS 362

Query: 691  SKISSVGKLQVLKPVRERNGISPAVKENLSPT-GGSKXXXXXXXXXXXXXXXXXARPLPN 515
             K+S+ G+L++LKP RE NG+S   K+NLSPT G SK                  R   N
Sbjct: 363  LKVSNEGRLRILKPSRELNGVSLMTKDNLSPTNGSSKLVNSPLSVTPSASASAPFRSSGN 422

Query: 514  NLVLPTVEGK--PVLTALEKRPTSQAQSRNDFFNLVRKKSMGNS-SSVSDPGMPVSLSVS 344
            +    T E    P    +EKRPT+QAQSRNDFFNL++KKS  NS SSV+D G   S SVS
Sbjct: 423  SPSFATAERNQTPFRINIEKRPTAQAQSRNDFFNLLKKKSTTNSPSSVADRGPAASPSVS 482

Query: 343  DKLGE--TQVSSAPTTAQARDAQSPVSSSGGHLSEEEDDLTCIGDAIE-RQKCLSNGKKH 173
            +K  E  T+ +S   T Q     S   S     ++   ++T  GDA    Q+C SNG +H
Sbjct: 483  EKSDELGTEDASTSVTLQGGSVPSSEISIADLPTDNRSEITHNGDAYSGSQQCSSNGDRH 542

Query: 172  PISD-HLFSEEEEAAFLRSMGWEENA-DEGGLTEEEISDFYRDVHKYINSKPALNI 11
               D  L+ +EEEAAFLRS+GWEENA D+ GLTEEEIS F+ +   ++  KP+  +
Sbjct: 543  ARPDAFLYPDEEEAAFLRSLGWEENAGDDEGLTEEEISAFFEE---HMKLKPSAKL 595


>ref|XP_007041568.1| Uncharacterized protein isoform 2 [Theobroma cacao]
            gi|508705503|gb|EOX97399.1| Uncharacterized protein
            isoform 2 [Theobroma cacao]
          Length = 620

 Score =  427 bits (1098), Expect = e-116
 Identities = 269/595 (45%), Positives = 352/595 (59%), Gaps = 21/595 (3%)
 Frame = -2

Query: 1732 MERSEPTLVPEWLKNSGNLTGG--------SPTLHSDDNAGLKLARNRSSVNSDGHDLGR 1577
            MERSEP+LVPEWLK+ G++TG         S +LHSD+++ L+  RN+ SV  D HD+G 
Sbjct: 1    MERSEPSLVPEWLKSGGSVTGSGNSNHQFTSSSLHSDNHSALRPTRNKLSVAGD-HDVGG 59

Query: 1576 SLGXXXXXXXXXXXXXXXXXXTHLQSYNSFSRNHRDRDWEKDTYDSRDKEKSDFGDRRRQ 1397
            +                     HL+SY+SF++ HRDRDW+KD     D+EKS   D R +
Sbjct: 60   TSVLDRTTSAYFRRSSSSNGSAHLRSYSSFTKGHRDRDWDKDINGYHDREKSVISDHRNR 119

Query: 1396 DFSHPLGKILSSSFERDGLRRSQSMISGKRGENWPKKILIDXXXXXXXXXXGLVNKGSPI 1217
            +FS  L  +L S FE+D L RSQS I+GKR + WPKK+  D                S +
Sbjct: 120  NFSDSLDNMLPSVFEKDVLWRSQS-ITGKRSDTWPKKVTSDSSTSNKSNHSSSNGLLSGV 178

Query: 1216 GSV--NKAAFEKDFPSLGAEERSAAPDIGRVPSPSLSTVTQNLPVGTSAMISGEKWTSAL 1043
             +   NK+ FE++FP LGAEER  A +IGRV SP LST  Q+LPVGTSA+   + WTSAL
Sbjct: 179  STTVGNKSVFEREFPVLGAEERQVASEIGRVSSPGLSTAGQSLPVGTSAISGSDGWTSAL 238

Query: 1042 AEVPVLAGSNGTGVXXXXXXXXXXXATVALSTTTVLNMAEAVAHGPPRTPATPQLSAGTQ 863
            A++P   GS+GTGV           A++A +T T LNMAE +  GP R    P L+ GTQ
Sbjct: 239  ADMPAGVGSSGTGVAVASQNVSASSASMASTTMTGLNMAETLVQGPSRARTPPLLNVGTQ 298

Query: 862  RLEELAIKQSKQLIP-VTPSLPKTLVSNSTDKQKIKVGHQQHLPVT-NSSRGAAVKSDVS 689
            RLEELAIKQS+QL+P VT S PK LV + ++K K KVG QQH  ++ N +RG   +SD  
Sbjct: 299  RLEELAIKQSRQLVPLVTTSTPKILVVSPSEKSKPKVGQQQHASLSLNYTRGGTSRSDSL 358

Query: 688  KISSVGKLQVLKPVRERNGISPAVKENLSPT-GGSKXXXXXXXXXXXXXXXXXARPLPNN 512
            K+S+ G+L++LKP RE NG+S   K+NLSPT G SK                  R   N+
Sbjct: 359  KVSNEGRLRILKPSRELNGVSLMTKDNLSPTNGSSKLVNSPLSVTPSASASAPFRSSGNS 418

Query: 511  LVLPTVEGK--PVLTALEKRPTSQAQSRNDFFNLVRKKSMGNS-SSVSDPGMPVSLSVSD 341
                T E    P    +EKRPT+QAQSRNDFFNL++KKS  NS SSV+D G   S SVS+
Sbjct: 419  PSFATAERNQTPFRINIEKRPTAQAQSRNDFFNLLKKKSTTNSPSSVADRGPAASPSVSE 478

Query: 340  KLGE--TQVSSAPTTAQARDAQSPVSSSGGHLSEEEDDLTCIGDAIE-RQKCLSNGKKHP 170
            K  E  T+ +S   T Q     S   S     ++   ++T  GDA    Q+C SNG +H 
Sbjct: 479  KSDELGTEDASTSVTLQGGSVPSSEISIADLPTDNRSEITHNGDAYSGSQQCSSNGDRHA 538

Query: 169  ISD-HLFSEEEEAAFLRSMGWEENA-DEGGLTEEEISDFYRDVHKYINSKPALNI 11
              D  L+ +EEEAAFLRS+GWEENA D+ GLTEEEIS F+ +   ++  KP+  +
Sbjct: 539  RPDAFLYPDEEEAAFLRSLGWEENAGDDEGLTEEEISAFFEE---HMKLKPSAKL 590


>gb|EYU18535.1| hypothetical protein MIMGU_mgv1a006469mg [Erythranthe guttata]
          Length = 443

 Score =  388 bits (996), Expect = e-104
 Identities = 236/453 (52%), Positives = 285/453 (62%), Gaps = 12/453 (2%)
 Frame = -2

Query: 1324 MISGKRGENWPKKILIDXXXXXXXXXXG-LVNKGSPIGSVNKAAFEKDFPSLGAEERSAA 1148
            MISGK GE WPKK++ +             + KGSP+G  NKA FE+DFPSLG ++R+  
Sbjct: 1    MISGKHGETWPKKVVTESSSGSGKNNGNGFLAKGSPVGVANKATFERDFPSLGTDDRAVV 60

Query: 1147 PDIGRVPSPSLSTVTQNLPVGTSAMISGEKWTSALAEVPVLAGSNGT-GVXXXXXXXXXX 971
            P++GRV SP LS+  Q+LP+G+SA I GE+WTSALAEVP+L  SNGT  +          
Sbjct: 61   PEVGRVASPGLSSALQSLPIGSSASIGGERWTSALAEVPMLVVSNGTASLSVQQAAPSST 120

Query: 970  XATVALSTTTVLNMAEAVAHGPPRTPATPQLSAGTQRLEELAIKQSKQLIPVTPSLPKTL 791
             A+V +S+TT LNMAEAVA GP R    PQLS GTQRLEELAIKQS+QLIPVTP++PKTL
Sbjct: 121  TASVVVSSTTSLNMAEAVAQGPTRAQTAPQLSLGTQRLEELAIKQSRQLIPVTPTMPKTL 180

Query: 790  VSNSTDKQKIKVG-HQQH-----LPVTNSSRGA-AVKSDVSKISSVGKLQVLKPVRERNG 632
            V +S+DKQK KVG  QQH     LP+  S RGA   K D SK S+VGKL VLKPVRE+NG
Sbjct: 181  VLSSSDKQKSKVGLIQQHPTPSSLPINQSPRGAPPSKPDFSKASNVGKLHVLKPVREKNG 240

Query: 631  ISPAVKENLSPTGGSKXXXXXXXXXXXXXXXXXARPLPNNLVLPTVEG--KPVL-TALEK 461
            ++P+VK+ LSPTG  K                       N  LP      KP+L TALEK
Sbjct: 241  VTPSVKDKLSPTGSGKAV---------------------NSTLPASPSAVKPLLTTALEK 279

Query: 460  RPTSQAQSRNDFFNLVRKKSMGNSSSVSDPGMPVSLSVSDKLGETQVSSAPTTAQARDAQ 281
            RPT+QAQSRNDFF  +R+KS+ NSSS S+ G  +S        E     A   A    A 
Sbjct: 280  RPTTQAQSRNDFFKRMREKSVSNSSSASETGTAIS-------PEKHAKVAVVPAAITGAV 332

Query: 280  SPVSSSGGHLSEEEDDLTCIGDAIERQKCLSNGKKHPISDHLFSEEEEAAFLRSMGWEEN 101
             P       L EE+   T     ++    +SNGKK+  S+ + SEEEEA FLRSMGW+EN
Sbjct: 333  EP-------LPEEKAVRTTCNGGVQH---ISNGKKYN-SEPIISEEEEAKFLRSMGWDEN 381

Query: 100  ADEGGLTEEEISDFYRDVHKYINSKPALNISPG 2
             DEGGLTEEEIS FYRD  KYINSKP+L I  G
Sbjct: 382  DDEGGLTEEEISAFYRDFTKYINSKPSLRILQG 414


>ref|XP_012467689.1| PREDICTED: uncharacterized protein LOC105786006 [Gossypium raimondii]
            gi|823135857|ref|XP_012467690.1| PREDICTED:
            uncharacterized protein LOC105786006 [Gossypium
            raimondii] gi|763748559|gb|KJB15998.1| hypothetical
            protein B456_002G207700 [Gossypium raimondii]
            gi|763748560|gb|KJB15999.1| hypothetical protein
            B456_002G207700 [Gossypium raimondii]
          Length = 629

 Score =  385 bits (990), Expect = e-104
 Identities = 253/598 (42%), Positives = 342/598 (57%), Gaps = 23/598 (3%)
 Frame = -2

Query: 1732 MERSEPTLVPEWLKNSGNLTGG----------SPTLHSDDNAGLKLARNRSSVNSDGHDL 1583
            MERSEP+LVPEWLK SG+LTG           S + HSD+++ ++ ARN+ SV+SDG D+
Sbjct: 1    MERSEPSLVPEWLKCSGSLTGSGNSNNQFTSSSSSSHSDNHSAVRHARNKLSVDSDG-DI 59

Query: 1582 GRSLGXXXXXXXXXXXXXXXXXXTHLQSYNSFSRNHRDRDWEKDTYDSRDKEKSDFGDRR 1403
            GR+                    +   SY++F + HR+RDWEK +    D++ +   D+R
Sbjct: 60   GRTSVLDRASSAYFRRSSSSKGASDSWSYSNFGKGHRERDWEKVSNGYHDRKNAVLSDQR 119

Query: 1402 RQDFSHPLGKILSSSFERDGLRRSQSMISGKRGENWPKKILIDXXXXXXXXXXGLVNKGS 1223
             ++ S  L  +L S FE+D LRRSQS+ +GK  + WP+K   +              K +
Sbjct: 120  NRNHSDSLDNLLPSMFEKDVLRRSQSLKTGKHSDTWPRKATNESSGTSKSHHSSGNGKLT 179

Query: 1222 PIGSV-NKAAFEKDFPSLGAEERSAAPDIGRVPSPSLSTVTQNLPVGTSAMISGEKWTSA 1046
             + +V NK+AFE+DFPSLGAE R    +IGR+ SP L+   Q+LPVGTS ++  +  TSA
Sbjct: 180  TVAAVGNKSAFERDFPSLGAEVRQVGSEIGRILSPGLTNPVQSLPVGTSPVLGSDGRTSA 239

Query: 1045 LAEVPVLAGSNGTGVXXXXXXXXXXXATVALSTTTVLNMAEAVAHGPPRTPATPQLSAGT 866
            LA++PV  G++G GV                +  T LNMAEAVA GP R    P L+  T
Sbjct: 240  LADIPVGVGNSGRGVAVASQNVPAGSTP---TMVTGLNMAEAVAQGPSRARTPPLLNVET 296

Query: 865  QRLEELAIKQSKQLIP-VTPSLPKTLVSNSTDKQKIKVGHQQHLPVT-NSSRGAAVKSDV 692
            QRLEELAIKQS+QLIP VT S PKTLV + ++K + KVG Q H  ++  S+RG   +SD 
Sbjct: 297  QRLEELAIKQSRQLIPLVTVSTPKTLVVSPSEKSRPKVGQQLHPSLSFGSTRGGTSRSDS 356

Query: 691  SKISSVGKLQVLKPVRERNGISP-AVKENLSPTGGS-KXXXXXXXXXXXXXXXXXARPLP 518
             K+S+  +L +LKP RE NG+S    ++NLSPT GS K                  R   
Sbjct: 357  QKVSNESRLLILKPSRESNGVSSITTRDNLSPTNGSNKFANSPINITPSAAASVPFRSSG 416

Query: 517  NNLVLPTVEGK--PVLTALEKRPTSQAQSRNDFFNLVRKKSMGNS-SSVSDPGMPVSLSV 347
            N+  L T E    PV   +EKR T+QAQSRNDFFNL++KKS  NS SSV D G  VS  V
Sbjct: 417  NSPRLATAERNQTPVRMTMEKRATAQAQSRNDFFNLLKKKSTSNSASSVLDSGSAVSPPV 476

Query: 346  SDKLGETQVSSAPTTAQARDAQSPVSS--SGGHLSEEEDDLTCIGDA-IERQKCLSNGKK 176
            S+K  E     + T+   +D   P S        ++   ++   GDA  E Q   SNG +
Sbjct: 477  SEKSDELGTEDSSTSVTLQDGGVPSSEILIADLPADNRSEVALNGDAYAESQHGSSNGDE 536

Query: 175  HPISD-HLFSEEEEAAFLRSMGWEENA-DEGGLTEEEISDFYRDVHKYINSKPALNIS 8
            H   D +L+ +EEE AFLRS+GWEENA D+ GLTEEEIS F+    +Y+  KP+  +S
Sbjct: 537  HSRPDAYLYPDEEEVAFLRSLGWEENAEDDDGLTEEEISTFF---EQYMKLKPSAKVS 591


>ref|XP_012839759.1| PREDICTED: uncharacterized protein LOC105960131 [Erythranthe
            guttatus]
          Length = 436

 Score =  369 bits (946), Expect = 1e-98
 Identities = 231/482 (47%), Positives = 280/482 (58%), Gaps = 7/482 (1%)
 Frame = -2

Query: 1732 MERSEPTLVPEWLKNSGNLTGGSPTLHSDDNAGLKLARNRSSVNSDGHDLGRSLGXXXXX 1553
            MERSEPTLVPEWL+N G+L GG    HSD     KL RN+S VNS+G+D GRSL      
Sbjct: 1    MERSEPTLVPEWLRNPGSLNGGGSASHSDGKNASKLVRNKSFVNSNGNDFGRSLSSDRTT 60

Query: 1552 XXXXXXXXXXXXXTHLQSYNSFSRNHRDRDWEKDTYDSRDKEKSDFGDRRRQDFSHPLGK 1373
                          + +S+ SF R       + DTYDSR+K+KS  G+RR  +FS   G 
Sbjct: 61   SSYFRRSSSNNGSGNSRSHTSFGRK------QHDTYDSREKDKSVLGNRR--NFSDSFGN 112

Query: 1372 -ILSSSFERDGLRRSQSMISGKRGENWPKKILIDXXXXXXXXXXGLVNKGSPIGSVNKAA 1196
              LSS FER+GLR SQS+ S K  + W +K+  +           L+ K SPIG VNK  
Sbjct: 113  NTLSSKFEREGLRHSQSIDSAKHADTWHRKVTTNSGRNNTDG---LLTKNSPIGEVNKKT 169

Query: 1195 FEKDFPSLGAEERSAAPDIGRVPSPSLSTVTQNLPVGTSAMISGEKWTSALAEVPVLAGS 1016
            F++DFPSLG E+R+       +PSP LS+  Q+LP  TS++I+GEKWTSALAEVPV  GS
Sbjct: 170  FKRDFPSLGTEDRTV------IPSPGLSSPIQSLPSCTSSLINGEKWTSALAEVPVSVGS 223

Query: 1015 NGTGVXXXXXXXXXXXATVALSTTTVLNMAEAVAHGPPRTPATPQLSAGTQRLEELAIKQ 836
            +G G+               L+  +  +MAEAV  GP R    PQLS GTQRLEELAIK+
Sbjct: 224  HGNGILSVQE----------LAPLSSASMAEAVVQGPSRVQTAPQLSMGTQRLEELAIKK 273

Query: 835  SKQLIPVTPSLPKTLVSNSTDKQKIKVGHQQH-----LPVTNSSRGAAVKSDVSKIS-SV 674
            SKQLIPVTPS PKTLV NSTDK K K     H     LPV  S RG   K+D SK S +V
Sbjct: 274  SKQLIPVTPSTPKTLVLNSTDKHKTKASQHNHPISSSLPVNQSPRGGPTKADFSKASTTV 333

Query: 673  GKLQVLKPVRERNGISPAVKENLSPTGGSKXXXXXXXXXXXXXXXXXARPLPNNLVLPTV 494
            GKL VLKP+RE NG+   VK+N S +G SK                  R  PNN ++P  
Sbjct: 334  GKLHVLKPMREINGV---VKDNSSASGSSK-------LTSSSTPAAPTRGPPNNHLVP-- 381

Query: 493  EGKPVLTALEKRPTSQAQSRNDFFNLVRKKSMGNSSSVSDPGMPVSLSVSDKLGETQVSS 314
            + KPV+T LEKRPTSQAQSRNDFFN VRKKSM           P   S S+KL +   + 
Sbjct: 382  DHKPVITVLEKRPTSQAQSRNDFFNTVRKKSM---------AFPSPSSSSEKLSDLVAAV 432

Query: 313  AP 308
             P
Sbjct: 433  EP 434


>ref|XP_012078152.1| PREDICTED: mediator of RNA polymerase II transcription subunit 1
            isoform X2 [Jatropha curcas]
          Length = 599

 Score =  367 bits (941), Expect = 4e-98
 Identities = 261/613 (42%), Positives = 330/613 (53%), Gaps = 36/613 (5%)
 Frame = -2

Query: 1732 MERSEPTLVPEWLKNSGNLTGGSPTLH--------SDDNAGLKLARNRSSVNSDGHDLGR 1577
            MERSEPTLVPEWL++SG+++GG  ++H        SD ++     RNR+S      D  R
Sbjct: 1    MERSEPTLVPEWLRSSGSVSGGGSSVHHFASSSSLSDVSSSAHHTRNRNSKGLTDFDSPR 60

Query: 1576 SLGXXXXXXXXXXXXXXXXXXTHLQSYNSFSRNHRDRDWEKDTYDSRDKEKSDFGDRRRQ 1397
            S                     H  +Y+SFSR+HRD+D E      RDKE+ +F D   +
Sbjct: 61   SAFLDRTSSSNSRRSSINGSAKH--AYSSFSRSHRDKDRE------RDKERLNFVDHWDR 112

Query: 1396 DFSHPLGKILSSSFERDGLRRSQSMISGKRGENWPKKILIDXXXXXXXXXXG---LVNKG 1226
            D   PLG ILSS  E+D LRRS SM+S K+GE  P++  +D              L++ G
Sbjct: 113  DGPDPLGSILSSRSEKDTLRRSHSMVSRKQGEVLPRRFAVDLKNGSSGNHTNGNGLLSGG 172

Query: 1225 SPIGSVNKAAFEKDFPSLGAEERSAAPDIGRVPSPSLSTVTQNLPVGTSAMISGEKWTSA 1046
                ++ KA FEKDFPSLG EER   P+IGRV SPSLST  QNLPVG+SA+I GE WTSA
Sbjct: 173  IVGSNIQKAVFEKDFPSLGCEERQGVPEIGRVSSPSLSTAVQNLPVGSSALIGGEGWTSA 232

Query: 1045 LAEVPVLAGSNGTGVXXXXXXXXXXXATVALSTTTVLNMAEAVAHGPPRTPATPQLSAGT 866
            LAEVP L G++ TG            A+   S    LNMAEA+   P RT   PQLS  T
Sbjct: 233  LAEVPALIGNSSTG-SLSSVQSVAASASACPSVMAGLNMAEALTQAPSRTRTAPQLSVQT 291

Query: 865  QRLEELAIKQSKQLIPVTPSLPKTLVSNSTDKQKIKV------------GHQQH---LPV 731
            QRLEELAIKQS+QLIPVTPS+PK+ V NS+DK K K               QQ    L  
Sbjct: 292  QRLEELAIKQSRQLIPVTPSMPKSSVLNSSDKSKPKTVVRSGEMNMAAKSMQQQSSALHP 351

Query: 730  TNSSRGAAVKSDVSKISSVGKLQVLKPVRERNGISPAVKENLSPTGGSKXXXXXXXXXXX 551
            TN S G  VK+D  K S  GKL VLKP  E NG+SP+ K+  SPT               
Sbjct: 352  TNQSLGIHVKTDAPKTSH-GKLFVLKPGWE-NGVSPSPKDIASPTNNVSRAANSQLAAPA 409

Query: 550  XXXXXXARPLPNNLVLPTVEGKPVLTA-------LEKRPTSQAQSRNDFFNLVRKKSMGN 392
                   R  PNN  L +   +    +       +EKRP SQ QSRNDFFNL++KK+  +
Sbjct: 410  SVTSVPLRS-PNNAKLSSSGERKSANSNMISAFNVEKRPLSQTQSRNDFFNLLKKKTSNS 468

Query: 391  SSSVSDPGMPVSLSVSDKLGET--QVSSAPTTAQARDAQSPVSSSGGHLSEEEDDLTCIG 218
            S ++ D    VS   S+K  E   +V SAPT+ QA    + ++S+GG   E         
Sbjct: 469  SPALPDSSSVVSSPTSEKSCEVNKEVVSAPTSPQAIKDGAELTSNGGTHEE--------- 519

Query: 217  DAIERQKCLSNGKKHPISDHLFSEEEEAAFLRSMGWEENADEG-GLTEEEISDFYRDVHK 41
              ++R                FSEEE AAFLRS+GWEEN+ E  GLTEEEI+ FY++   
Sbjct: 520  --VQR----------------FSEEE-AAFLRSLGWEENSGEDEGLTEEEINAFYQE--- 557

Query: 40   YINSKPALNISPG 2
            Y+  KP+L +  G
Sbjct: 558  YMKKKPSLKVCRG 570


>ref|XP_012078151.1| PREDICTED: mediator of RNA polymerase II transcription subunit 1
            isoform X1 [Jatropha curcas] gi|643723136|gb|KDP32741.1|
            hypothetical protein JCGZ_12033 [Jatropha curcas]
          Length = 603

 Score =  361 bits (926), Expect = 2e-96
 Identities = 261/617 (42%), Positives = 330/617 (53%), Gaps = 40/617 (6%)
 Frame = -2

Query: 1732 MERSEPTLVPEWLKNSGNLTGGSPTLH--------SDDNAGLKLARNRSSVNSDGHDLGR 1577
            MERSEPTLVPEWL++SG+++GG  ++H        SD ++     RNR+S      D  R
Sbjct: 1    MERSEPTLVPEWLRSSGSVSGGGSSVHHFASSSSLSDVSSSAHHTRNRNSKGLTDFDSPR 60

Query: 1576 SLGXXXXXXXXXXXXXXXXXXTHLQSYNSFSRNHRDRDWEKDTYDSRDKEKSDFGDRRRQ 1397
            S                     H  +Y+SFSR+HRD+D E      RDKE+ +F D   +
Sbjct: 61   SAFLDRTSSSNSRRSSINGSAKH--AYSSFSRSHRDKDRE------RDKERLNFVDHWDR 112

Query: 1396 DFSHPLGKILSSSFERDGLRRSQSMISGKRGENWPKKILIDXXXXXXXXXXG---LVNKG 1226
            D   PLG ILSS  E+D LRRS SM+S K+GE  P++  +D              L++ G
Sbjct: 113  DGPDPLGSILSSRSEKDTLRRSHSMVSRKQGEVLPRRFAVDLKNGSSGNHTNGNGLLSGG 172

Query: 1225 SPIGSVNKAAFEKDFPSLGAEERSAAPDIGRVPSPSLSTVTQNLPVGTSAMISGEKWTSA 1046
                ++ KA FEKDFPSLG EER   P+IGRV SPSLST  QNLPVG+SA+I GE WTSA
Sbjct: 173  IVGSNIQKAVFEKDFPSLGCEERQGVPEIGRVSSPSLSTAVQNLPVGSSALIGGEGWTSA 232

Query: 1045 LAEVPVLAGSNGTGVXXXXXXXXXXXATVALSTTTVLNMAEAVAHGPPRTPATPQ----L 878
            LAEVP L G++ TG            A+   S    LNMAEA+   P RT   PQ    L
Sbjct: 233  LAEVPALIGNSSTG-SLSSVQSVAASASACPSVMAGLNMAEALTQAPSRTRTAPQVTEQL 291

Query: 877  SAGTQRLEELAIKQSKQLIPVTPSLPKTLVSNSTDKQKIKV------------GHQQH-- 740
            S  TQRLEELAIKQS+QLIPVTPS+PK+ V NS+DK K K               QQ   
Sbjct: 292  SVQTQRLEELAIKQSRQLIPVTPSMPKSSVLNSSDKSKPKTVVRSGEMNMAAKSMQQQSS 351

Query: 739  -LPVTNSSRGAAVKSDVSKISSVGKLQVLKPVRERNGISPAVKENLSPTGGSKXXXXXXX 563
             L  TN S G  VK+D  K S  GKL VLKP  E NG+SP+ K+  SPT           
Sbjct: 352  ALHPTNQSLGIHVKTDAPKTSH-GKLFVLKPGWE-NGVSPSPKDIASPTNNVSRAANSQL 409

Query: 562  XXXXXXXXXXARPLPNNLVLPTVEGKPVLTA-------LEKRPTSQAQSRNDFFNLVRKK 404
                       R  PNN  L +   +    +       +EKRP SQ QSRNDFFNL++KK
Sbjct: 410  AAPASVTSVPLRS-PNNAKLSSSGERKSANSNMISAFNVEKRPLSQTQSRNDFFNLLKKK 468

Query: 403  SMGNSSSVSDPGMPVSLSVSDKLGET--QVSSAPTTAQARDAQSPVSSSGGHLSEEEDDL 230
            +  +S ++ D    VS   S+K  E   +V SAPT+ QA    + ++S+GG   E     
Sbjct: 469  TSNSSPALPDSSSVVSSPTSEKSCEVNKEVVSAPTSPQAIKDGAELTSNGGTHEE----- 523

Query: 229  TCIGDAIERQKCLSNGKKHPISDHLFSEEEEAAFLRSMGWEENADEG-GLTEEEISDFYR 53
                  ++R                FSEEE AAFLRS+GWEEN+ E  GLTEEEI+ FY+
Sbjct: 524  ------VQR----------------FSEEE-AAFLRSLGWEENSGEDEGLTEEEINAFYQ 560

Query: 52   DVHKYINSKPALNISPG 2
            +   Y+  KP+L +  G
Sbjct: 561  E---YMKKKPSLKVCRG 574


>ref|XP_010664264.1| PREDICTED: mediator of RNA polymerase II transcription subunit 1
            [Vitis vinifera]
          Length = 616

 Score =  360 bits (925), Expect = 3e-96
 Identities = 252/618 (40%), Positives = 337/618 (54%), Gaps = 41/618 (6%)
 Frame = -2

Query: 1732 MERSEPTLVPEWLKNSGNLTGGSPTLH------SDDNAGLKLARNRSSVNSDGHD----- 1586
            MERSEPTLVPEWL+++G++TGG  + H      S  +   +  RNRSS N+  ++     
Sbjct: 1    MERSEPTLVPEWLRSTGSVTGGGNSAHHFATSSSHTDISPRSTRNRSSKNTSDYESPRSA 60

Query: 1585 -LGRSLGXXXXXXXXXXXXXXXXXXTHLQSYNSFSRNHRDRDWEKDTYDSRDKEKSDFGD 1409
             L R+                    ++ ++Y+SFSR+HRD+D +      R+K++    D
Sbjct: 61   FLDRTSSSNSRRNLVSNGFPKHDKESNARAYSSFSRSHRDKDRD------REKDRLVIED 114

Query: 1408 RRRQDFSHPLGKILSSSFERDGLRRSQSMISGKRGENWPKKILIDXXXXXXXXXXG---L 1238
            +     SHPL  IL +  E+D LRRS S++S K+ +  P+++  D              +
Sbjct: 115  QWDHGSSHPLANILINRVEKDVLRRSHSVVSRKQVDVLPRRVASDSRNGDSNKHNNVNGM 174

Query: 1237 VNKGSPIGSVNKAAFEKDFPSLGAEERSAAPDIGRVPSPSLSTVTQNLPVGTSAMISGEK 1058
            V+  S IG ++KA F+KDFPSLG E     PDIGRVPSP LS   Q+LP+G S++I GE 
Sbjct: 175  VSGASIIGGIHKAVFDKDFPSLGTE-----PDIGRVPSPGLSMAVQSLPIGNSSLIGGEG 229

Query: 1057 WTSALAEVPVLAGSNGTGVXXXXXXXXXXXATVALSTTTVLNMAEAVAHGPPRTPATPQL 878
            WTSALAEVP++ GSN TG            A+   STT  LNMAEA+A  P R   TPQL
Sbjct: 230  WTSALAEVPMITGSNSTGSSSVQQTVVSAPASGLPSTTAGLNMAEALAQAPSRARTTPQL 289

Query: 877  SAGTQRLEELAIKQSKQLIPVTPSLPKTLVSNSTDKQK-------------IKVGHQQ-- 743
            S  TQRLEELAIKQS+QLIPVTPS+PK+ V NS+DK K              K G QQ  
Sbjct: 290  SVNTQRLEELAIKQSRQLIPVTPSMPKSSVLNSSDKSKPKTVVRTSDMIAASKTGQQQPS 349

Query: 742  --HLPVTNSSRGAAVKSDVSKISSVGKLQVLKPVRERNGISPAVKENLSPTGGSKXXXXX 569
              HL   N S    V+SD    +S GK  VLKP RE NG SP  ++  SPT  +      
Sbjct: 350  SSHL--ANHSLRGHVRSD-PPTTSHGKFLVLKPARE-NGASPTSRDVSSPTNNASSRVAS 405

Query: 568  XXXXXXXXXXXXARPLPNNLVLPTVEGKPVLTAL------EKRPT-SQAQSRNDFFNLVR 410
                            PN   L T+E K    +L      EKRP+ SQAQSR+DFFNL+R
Sbjct: 406  IQLGVAHSVASAPSISPNYPKLSTMERKAAALSLNSGPTAEKRPSFSQAQSRHDFFNLMR 465

Query: 409  KKSMGNSSSVSDPGMPVSLSVSDKLGETQVSSAPTTAQARDAQSPVSSSGGHLSEEEDDL 230
            KK+  NSS+V     P   ++S    E++VSSAP  + A +    V+ +GG+  EE +  
Sbjct: 466  KKTSVNSSAVLPDSGP---AISSSNTESEVSSAPVKSHAIENGGQVTGNGGNTCEEVESP 522

Query: 229  TCIGDAIERQKCLSNGKKH-PISDHLFSEEEEAAFLRSMGWEENA-DEGGLTEEEISDFY 56
                           G+KH   +  +  +EEEAAFLRS+GWEE+A D+ GLTEEEI+ FY
Sbjct: 523  AV-------------GEKHLGTNASICPDEEEAAFLRSLGWEESAGDDEGLTEEEINAFY 569

Query: 55   RDVHKYINSKPALNISPG 2
            ++   Y+  KP+L +  G
Sbjct: 570  QE---YMKLKPSLKLQQG 584


>ref|XP_012065652.1| PREDICTED: uncharacterized protein LOC105628780 isoform X2 [Jatropha
            curcas]
          Length = 607

 Score =  357 bits (915), Expect = 4e-95
 Identities = 246/592 (41%), Positives = 320/592 (54%), Gaps = 18/592 (3%)
 Frame = -2

Query: 1732 MERSEPTLVPEWLKNSGNLTGGSPTLHSDDNAGL--------KLARNRSSVNSDGHDLGR 1577
            M+RSEP LVPEWLK+ GN+  G    H   +A L        K ++N+SS++   HD  R
Sbjct: 1    MDRSEPALVPEWLKSGGNVPNGGNPSHFSASASLPFDYHPVSKHSQNKSSLSGIDHDTRR 60

Query: 1576 SLGXXXXXXXXXXXXXXXXXXTHLQSYNSFSRNHRDRDWEKDTYDSRDKEKSDFGDRRRQ 1397
                                  HL+S +S  R+HRDRDWE D     DKEK    D R  
Sbjct: 61   LSILERTTSAYFRQGSSSNGSVHLRSTSSLGRSHRDRDWE-DVSGYCDKEKLVSDDNRHH 119

Query: 1396 DFSHPLGKILSSSFERDGLRRSQSMISGKRGENWPKKILIDXXXXXXXXXXGLVNKGS-- 1223
            +   P G I  S  ++D LR SQS+I+GK+ + W KK+  D               G   
Sbjct: 120  EHLDPSGNIFPSKLDKDKLRLSQSIITGKQDDTWSKKVAGDLINPQKNKHSNSNGSGILA 179

Query: 1222 --PIGSVNKAAFEKDFPSLGAEERSAAPDIGRVPSPSLSTVTQNLPVGTSAMISGEKWTS 1049
               +G+VN  AFE+DFPSLGAEER     IGRVPSP LST  Q    GTSA+   E W S
Sbjct: 180  RVGVGAVNDTAFEQDFPSLGAEERQVG--IGRVPSPGLSTAIQT---GTSAIGGSENWKS 234

Query: 1048 ALAEVPVLAGSNGTGVXXXXXXXXXXXATVALSTTTVLNMAEAVAHGPPRTPATPQLSAG 869
            ALAEVPV+ G++  G+           ATV  + T  L MAEA+A GPPR    PQ +AG
Sbjct: 235  ALAEVPVVMGNSNLGLVSAQQAVPATTATVVPNVTMGLKMAEALAQGPPRARTPPQSTAG 294

Query: 868  TQRLEELAIKQSKQLIPVTPSLPKTLVSNSTDKQKIKVGHQQHLPVTNSSRGAAVKSDVS 689
             QR EELAI+QSK LIP+TPS PKTLV + ++K K K+G  Q     N SRGAA +SD +
Sbjct: 295  IQRSEELAIRQSK-LIPMTPSTPKTLVVSPSEKTKSKIGSVQ---FGNHSRGAA-RSDAA 349

Query: 688  KISSVGKLQVLKPVRERNGISPAVKENLSPTG--GSKXXXXXXXXXXXXXXXXXARPLPN 515
            K+S+  +LQVLKP RE NGIS AVK+  +P G  G                   +   PN
Sbjct: 350  KVSNESRLQVLKPSRELNGISSAVKDISNPNGSKGQNNSLGIAPLAIGSVPLRSSGNSPN 409

Query: 514  NLVLPTVEGKPVLTALEKRPTSQAQSRNDFFNLVRKKSMGNSSSVSDPGMPV-SLSVSDK 338
            +              +EKRPT Q QSRNDFFN ++KKS  +S+SV+    P+ S S+S+ 
Sbjct: 410  HASAECHSFAFRRPTMEKRPTLQVQSRNDFFNHLKKKSSIHSTSVASESSPILSSSISEM 469

Query: 337  LGET-QVSSAPTTAQARDAQSPVSSSGGHLSEEEDDLTCIGDAIERQKCLSNGKKHPISD 161
             GE+ +V +AP + Q  D+ S V+S      ++   +   GD          G+K   SD
Sbjct: 470  SGESAKVVTAPVSDQGGDSSSSVASLS---CDDSGKMVYNGDTCSGPLQFDKGEKDSCSD 526

Query: 160  HLFS-EEEEAAFLRSMGWEENADEG-GLTEEEISDFYRDVHKYINSKPALNI 11
             + + +EEEAAFLRS+GW+ENA E  GLTEEEI  FY +   Y   +P+L +
Sbjct: 527  VIPNPDEEEAAFLRSLGWDENAGEDEGLTEEEIRAFYEE---YTKLRPSLKL 575


>ref|XP_008338816.1| PREDICTED: uncharacterized serine-rich protein C215.13 [Malus
            domestica]
          Length = 610

 Score =  353 bits (906), Expect = 4e-94
 Identities = 252/605 (41%), Positives = 328/605 (54%), Gaps = 33/605 (5%)
 Frame = -2

Query: 1732 MERSEPTLVPEWLKNSGNLTGG--------SPTLHSDDNAGLKLARNRSSVNSDGHDLGR 1577
            MERSEPTLVPEWL+++G++TGG        S + HSD ++     RNR+S +    D  R
Sbjct: 1    MERSEPTLVPEWLRSTGSVTGGGSSAHHFASSSSHSDVSSLANHLRNRTSKSITDFDTPR 60

Query: 1576 SLGXXXXXXXXXXXXXXXXXXTHLQSYNSFSRNHRDRDWEKDTYDSRDKEKSDFGDRRRQ 1397
            S                     H  +Y+SF+R+HRD+D EK+      KE+S+FGD   +
Sbjct: 61   SAFLDRSSSSNSRRSSGNGSAKH--AYSSFNRSHRDKDREKE------KERSNFGDHWDR 112

Query: 1396 DFSHPLGKILSSSFERDGLRRSQSMISGKRGENWPKKILIDXXXXXXXXXXGLVNKGSPI 1217
            D S PLG I +S  E+D LRRSQSM+S K+ E  P++  ID                S +
Sbjct: 113  DSSDPLGNIFTSRVEKDTLRRSQSMVSRKQTEXLPRRAAIDSKSSGNSNHHNGNGLLSGV 172

Query: 1216 G-SVNKAAFEKDFPSLGAEERSAAPDIGRVPSPSLSTVTQNLPVGTSAMISGEKWTSALA 1040
            G  + K  F+KDFPSLG EER AAPDIGRVPSP   T  Q+LPVG+SA+I GE WTSALA
Sbjct: 173  GVGIQKVVFDKDFPSLGTEERPAAPDIGRVPSPGFXTAVQSLPVGSSALIGGEGWTSALA 232

Query: 1039 EVP-VLAGSNGTGVXXXXXXXXXXXATVALSTTTVLNMAEAVAHGPPRTPATPQLSAGTQ 863
            EVP  + GS+ +G             + A +  + LNMAEA++  P +    PQLS  TQ
Sbjct: 233  EVPSTIIGSSSSGSFPVQPTVAATSXSGASTAMSGLNMAEALSQXPAKARTVPQLSIKTQ 292

Query: 862  RLEELAIKQSKQLIPVTPSLPKTLVSNSTDKQK-------------IKVGHQQHLPVTNS 722
            RLEELAIKQS+QLIPVTPS PK  V +S+DK K             +KVG QQ   + N 
Sbjct: 293  RLEELAIKQSRQLIPVTPSXPKPSVLSSSDKSKPKAAARPGETNAPVKVGQQQPSQLHNQ 352

Query: 721  S-RGAAVKSDVSKISSVGKLQVLKPVRERNGISPAVKENLSPTGGSKXXXXXXXXXXXXX 545
            S RG +VKSD  K S   K  VLKPV E NG+S + K+  SPT  +              
Sbjct: 353  SLRGGSVKSDAPKTS---KFLVLKPVWE-NGVSSSPKDVTSPTSNASRAANSPLAVAPPV 408

Query: 544  XXXXARPLPNNLVLPTVEGKPVL------TALEKRPT-SQAQSRNDFFNLVRKKSMGNSS 386
                 R  PN+  L +VE K         + LEKRP+ SQ QSRNDFF  ++ K++ NS+
Sbjct: 409  ASAPLRS-PNHQKLSSVERKVAALDLKSGSTLEKRPSLSQVQSRNDFFKRLKNKTLMNST 467

Query: 385  -SVSDPGMPVSLSVSDKLGETQVSSAPTTAQARDAQSPVSSSGGHLSEEEDDLTCIGDAI 209
             ++ D    +S    +K GE       T     D  SP +   G L    DD      + 
Sbjct: 468  ITLPDSAPIISSPTMEKSGEI------TRELFSDPASPHTIENGALVTGNDD-----SSE 516

Query: 208  ERQKCLSNGKKHPISDHLFSEEEEAAFLRSMGWEENA-DEGGLTEEEISDFYRDVHKYIN 32
            + QK    G     S  ++ +EEEA FLRS+GWEEN+ D+GGLTEEEI+ FY    +Y+ 
Sbjct: 517  DVQKFSDTGP----SAAVYPDEEEARFLRSLGWEENSGDDGGLTEEEINAFY---DQYMK 569

Query: 31   SKPAL 17
            S P+L
Sbjct: 570  SXPSL 574


>ref|XP_011655200.1| PREDICTED: mediator of RNA polymerase II transcription subunit 1
            isoform X2 [Cucumis sativus]
          Length = 612

 Score =  352 bits (902), Expect = 1e-93
 Identities = 253/606 (41%), Positives = 324/606 (53%), Gaps = 32/606 (5%)
 Frame = -2

Query: 1732 MERSEPTLVPEWLKNSGNLTGGSPTLHS-------DDNAGLKLARNRSSVNSDGHDLGRS 1574
            MERSEPTLVPEWL+++G++ GG    H         D   L  +RNR S  +   D  RS
Sbjct: 1    MERSEPTLVPEWLRSTGSVAGGGNPNHHFPSSSSHSDVPSLSQSRNRISKTTGDFDSSRS 60

Query: 1573 LGXXXXXXXXXXXXXXXXXXTHLQSYNSFSRNHRDRDWEKDTYDSRDKEKSDFGDRRRQD 1394
                                 H  +Y+SF+R HRD+D EK+      K++ +FGD   +D
Sbjct: 61   SFLDRTSSSNSRRSSSNGSSKH--AYSSFNRGHRDKDREKE------KDRLNFGDNWDRD 112

Query: 1393 FSHPLGKILSSSFERDGLRRSQSMISGKRGENWPKKILIDXXXXXXXXXXGLVNKGSPIG 1214
               PLGKILS+  ++D LRRS SM+S K+GE + +++  +            +  G+ +G
Sbjct: 113  AHDPLGKILSNRIDKDALRRSHSMVSRKQGELFHRRVGTELKSHNSSNG---ILSGTSVG 169

Query: 1213 S-VNKAAFEKDFPSLGAEERSAAPDIGRVPSPSLSTVTQNLPVGTSAMI-SGEKWTSALA 1040
            S + KA FEKDFPSLG+EE+  A +IGRV SP LS+  Q+LP+G SA+I  GE WTSALA
Sbjct: 170  SSIQKAVFEKDFPSLGSEEKQGASEIGRVSSPGLSSPVQSLPIGNSALIVGGEGWTSALA 229

Query: 1039 EVPVLAGSNGTGVXXXXXXXXXXXATVALSTTTVLNMAEAVAHGPPRTPATPQLSAGTQR 860
            EVP + GS  TG                LS T  LNMAEA+   P R  A PQLS  TQR
Sbjct: 230  EVPSMIGST-TGSSSFQQTVPATSGAGPLSVTAGLNMAEALVQAPSRARAAPQLSVKTQR 288

Query: 859  LEELAIKQSKQLIPVTPSLPKTLVSNSTDKQK-------------IKVGHQQHLPV-TNS 722
            LEELAIKQS+QLIPVTPS+PK +V +S+DK K             IK G  Q L V  N 
Sbjct: 289  LEELAIKQSRQLIPVTPSMPKAMVLSSSDKSKPKLASRTGELNATIKGGQPQPLLVHANQ 348

Query: 721  SRGAAVKSDVSKISSVGKLQVLKPVRERNGISPAVKENLSPTGGSKXXXXXXXXXXXXXX 542
            SR   VK D  K SS GK  VLKPVRE NG+S A K+  SPT  +               
Sbjct: 349  SRVGHVKPDAQK-SSHGKFLVLKPVRE-NGVSLAAKDVSSPTSNANSMAANSQFALAPSV 406

Query: 541  XXXARPLPNNLVLPTVEGK------PVLTALEKRPT-SQAQSRNDFFNLVRKKSMGNSSS 383
                   PNN+ + ++E K         T LEKRP+ SQ QSRNDFF L++KK+  NSS+
Sbjct: 407  PHAPLRSPNNINVSSMERKIASLDLKTGTTLEKRPSLSQVQSRNDFFKLIKKKTSMNSSA 466

Query: 382  VSDPGMPVSLSVSDKLGETQVSSAPTTAQARDAQS-PVSSSGGHLSEEEDDLTCIGDAIE 206
            V         S S        S    TA  R  ++  V +  G+ SEE       G+  E
Sbjct: 467  VLSDSCSSVKSPSIGQSNELTSEEMGTASPRVIENGAVENRNGNSSEEVQVSRDSGEKTE 526

Query: 205  RQKCLSNGKKHPISDHLFSEEEEAAFLRSMGWEENADEG-GLTEEEISDFYRDVHKYINS 29
                      H  ++ L  +EEEAAFLRS+GW+E+  E  GLTEEEI+ FYR+   Y+N 
Sbjct: 527  ---------SHVAAESL--DEEEAAFLRSLGWDESCGEDEGLTEEEINSFYRE---YVNL 572

Query: 28   KPALNI 11
            KP+L I
Sbjct: 573  KPSLKI 578


Top