BLASTX nr result

ID: Mentha29_contig00029150 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Mentha29_contig00029150
         (1403 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|EYU40658.1| hypothetical protein MIMGU_mgv1a008176mg [Mimulus...   451   e-124
ref|XP_003532247.1| PREDICTED: embryonic stem cell-specific 5-hy...   389   e-105
ref|XP_003635244.1| PREDICTED: UPF0361 protein C3orf37 homolog [...   386   e-104
ref|XP_006484827.1| PREDICTED: embryonic stem cell-specific 5-hy...   384   e-104
ref|XP_003622783.1| hypothetical protein MTR_7g052250 [Medicago ...   381   e-103
ref|XP_004290141.1| PREDICTED: UPF0361 protein C3orf37 homolog i...   381   e-103
ref|XP_007049611.1| Uncharacterized protein TCM_002685 [Theobrom...   380   e-103
ref|XP_002303080.1| hypothetical protein POPTR_0002s25190g [Popu...   377   e-102
ref|XP_004492204.1| PREDICTED: UPF0361 protein C3orf37 homolog [...   377   e-102
ref|XP_007140735.1| hypothetical protein PHAVU_008G137400g [Phas...   374   e-101
ref|XP_007199067.1| hypothetical protein PRUPE_ppa018685mg [Prun...   370   e-100
ref|XP_002527247.1| conserved hypothetical protein [Ricinus comm...   364   4e-98
emb|CAN82703.1| hypothetical protein VITISV_026469 [Vitis vinifera]   358   4e-96
ref|NP_180215.2| uncharacterized protein [Arabidopsis thaliana] ...   338   4e-90
ref|XP_002880802.1| hypothetical protein ARALYDRAFT_481505 [Arab...   336   2e-89
gb|EXB84512.1| hypothetical protein L484_015844 [Morus notabilis]     334   5e-89
ref|XP_004150365.1| PREDICTED: LOW QUALITY PROTEIN: UPF0361 prot...   334   5e-89
gb|EPS73813.1| hypothetical protein M569_00942, partial [Genlise...   332   2e-88
ref|XP_004290142.1| PREDICTED: UPF0361 protein C3orf37 homolog i...   330   1e-87
ref|XP_006403078.1| hypothetical protein EUTSA_v10003450mg [Eutr...   329   2e-87

>gb|EYU40658.1| hypothetical protein MIMGU_mgv1a008176mg [Mimulus guttatus]
          Length = 382

 Score =  451 bits (1161), Expect = e-124
 Identities = 238/406 (58%), Positives = 280/406 (68%), Gaps = 20/406 (4%)
 Frame = +2

Query: 53   MCGRGRCTLRPDDFSRACHLNSRPVRHLNMDRYRPSYNVAPGFNVPXXXXXXXXXXXXXX 232
            MCGR RCTLR DDF RACHL+ RPVRH NMDRYRPS+NVAPGFNVP              
Sbjct: 1    MCGRARCTLRSDDFRRACHLDGRPVRHQNMDRYRPSHNVAPGFNVPVVRRDDEGDGGGAV 60

Query: 233  XSHCMKWGLVPSFTKKTDKIDHFRMFNARSESIREKASFRRLLPKNRCLVSFEGFYEWKK 412
              HCMKWGL+PSFTKKT+KIDHFRMFNARSESIREKASFRRLLPKNRCLVS EGFYEWKK
Sbjct: 61   L-HCMKWGLIPSFTKKTEKIDHFRMFNARSESIREKASFRRLLPKNRCLVSVEGFYEWKK 119

Query: 413  DGSKKQPYYIHLKDGRPLVFAALYDSWKNSDGEILYTFXXXXXXXXXXXLEWLHDRMPAI 592
            DGS+KQPYYIH KDGRPLVFAAL+DSW+N++GEILYTF           LEWLHDRMP I
Sbjct: 120  DGSRKQPYYIHFKDGRPLVFAALFDSWENAEGEILYTF-TICTTSSSSSLEWLHDRMPVI 178

Query: 593  LGSKESTECWLNDSSLSNLDKILKPYEETDLAWYPVTPAMGKISFDGPDCIKEIQVKMEE 772
            L +KEST+CWLNDSSLSN DKILKPYE+ DLAWYPVT AMGK+SFDGP+CIKE++    E
Sbjct: 179  LRNKESTDCWLNDSSLSNFDKILKPYEDEDLAWYPVTSAMGKLSFDGPECIKEVKT---E 235

Query: 773  KTTTISQFFSKKQACKSEEQENKKTLIKEEPEEYIAPNIEKEEPQNRLISKTTTMKDEAS 952
            ++ TISQFFSKK A  S++   +K+ +KE  E   A ++++E      +  T        
Sbjct: 236  ESKTISQFFSKKVANASQKPNLEKSPVKELAEASEAISVKEEHESQPTLDSTRL------ 289

Query: 953  VMEEPKKDEMVESTEQKSVKEEPHSQEES-----LKQIDESDTKNAD------------- 1078
                  KDE +E+ EQKSV+EEP   ++      +K+ D  +T N               
Sbjct: 290  ------KDEDIENYEQKSVQEEPEISQDDCPKLIIKKDDAENTSNISSIEKQYTGEMLRA 343

Query: 1079 HVKPLAK--EKLHISPVKKRRKGAIDKPRKGADDKQPTLFSYFGKN 1210
            H KP AK  EK ++ P +KR K A DK       +QPTLFSYFG++
Sbjct: 344  HAKPFAKENEKQNVGPARKRSKTANDK-------QQPTLFSYFGRS 382


>ref|XP_003532247.1| PREDICTED: embryonic stem cell-specific
            5-hydroxymethylcytosine-binding protein-like [Glycine
            max]
          Length = 382

 Score =  389 bits (998), Expect = e-105
 Identities = 216/398 (54%), Positives = 259/398 (65%), Gaps = 12/398 (3%)
 Frame = +2

Query: 53   MCGRGRCTLRPDDFSRACHLNSRPVRHLNMDRYRPSYNVAPGFNVPXXXXXXXXXXXXXX 232
            MCGR RCTLR DD  RACH ++ P R L++DRYRP+YNV+PGF+VP              
Sbjct: 1    MCGRARCTLRADDVPRACHRSTSPTRTLHIDRYRPAYNVSPGFDVPVVRRDDASGGEGYV 60

Query: 233  XSHCMKWGLVPSFTKKTDKIDHFRMFNARSESIREKASFRRLLPKNRCLVSFEGFYEWKK 412
               CMKWGL+PSFTKKT+K DH+RMFNARSESI EKASFRRLLPK+RCLV+ EGFYEWKK
Sbjct: 61   LQ-CMKWGLIPSFTKKTEKPDHYRMFNARSESIDEKASFRRLLPKSRCLVAVEGFYEWKK 119

Query: 413  DGSKKQPYYIHLKDGRPLVFAALYDSWKNSDGEILYTFXXXXXXXXXXXLEWLHDRMPAI 592
            DGSKKQPYYIH KDGRPLVFAALYDSW+NS+GE LYTF           L+WLHDRMP I
Sbjct: 120  DGSKKQPYYIHFKDGRPLVFAALYDSWQNSEGETLYTF-TIVTTSSSSALQWLHDRMPVI 178

Query: 593  LGSKESTECWLNDSSLSNLDKILKPYEETDLAWYPVTPAMGKISFDGPDCIKEIQVKMEE 772
            LGSKEST+ WL+ SS S+   ++KPYEE+DL WYPVT AMGK SFDGP+CIKEIQVK  +
Sbjct: 179  LGSKESTDIWLS-SSASSFKSVMKPYEESDLVWYPVTSAMGKASFDGPECIKEIQVK-AQ 236

Query: 773  KTTTISQFFSKKQACKSEEQENKKTLIKEEPEEYIA-PNIEKEEPQNRLISKTTTMKDEA 949
              T+IS FFSKK        E+K T    +PE+  + P + K E    L     T  ++ 
Sbjct: 237  GNTSISMFFSKK------GDESKDT----KPEQKASCPEVVKTEHTEDLTESKDTKPEQK 286

Query: 950  SVMEEPKKDEMVESTEQKSVKEE----------PHSQEESLKQID-ESDTKNADHVKPLA 1096
            +   E  K E  E   +++  EE           HSQ  S+  I  E +T +A   KP  
Sbjct: 287  TSSHEFVKTEPTEDLRERAKTEEGGNDLKFHGSSHSQNVSMLPIKREYETFSAADSKPAL 346

Query: 1097 KEKLHISPVKKRRKGAIDKPRKGADDKQPTLFSYFGKN 1210
                 ISP   ++K    +  K A+DKQPTLFSYFGK+
Sbjct: 347  ANHDQISPNPAKKK----EKAKTANDKQPTLFSYFGKS 380


>ref|XP_003635244.1| PREDICTED: UPF0361 protein C3orf37 homolog [Vitis vinifera]
            gi|296090568|emb|CBI40918.3| unnamed protein product
            [Vitis vinifera]
          Length = 392

 Score =  386 bits (992), Expect = e-104
 Identities = 209/403 (51%), Positives = 266/403 (66%), Gaps = 17/403 (4%)
 Frame = +2

Query: 53   MCGRGRCTLRPDDFSRACHLNSRPVRHLNMDRYRPSYNVAPGFNVPXXXXXXXXXXXXXX 232
            MCGR RCTLRPD+ +RAC+LN+ P +++ MDRYRPSYNV+PG N+P              
Sbjct: 1    MCGRARCTLRPDNIARACNLNTLPTQNIQMDRYRPSYNVSPGANLPVVRRGGGTEGEEAI 60

Query: 233  XSHCMKWGLVPSFTKKTDKIDHFRMFNARSESIREKASFRRLLPKNRCLVSFEGFYEWKK 412
              HCMKWGLVPSFTKK++K DH++MFNARSES+ EKASFRRL+PKNRCLV+ EGFYEWKK
Sbjct: 61   V-HCMKWGLVPSFTKKSEKPDHYKMFNARSESVCEKASFRRLVPKNRCLVAVEGFYEWKK 119

Query: 413  DGSKKQPYYIHLKDGRPLVFAALYDSWKNSDGEILYTFXXXXXXXXXXXLEWLHDRMPAI 592
            DGSKKQPYYIHLKDGRPLVFAAL+DSW NS+GEILYT            L+WLHDRMP I
Sbjct: 120  DGSKKQPYYIHLKDGRPLVFAALFDSWANSEGEILYT-CTILTTSSSSALQWLHDRMPVI 178

Query: 593  LGSKESTECWLNDSSLSNLDKILKPYEETDLAWYPVTPAMGKISFDGPDCIKEIQVKMEE 772
            LG KEST+ WLN SS S  + +LKPYE+ DL WYPVT AMGK SF+GP+CIKEIQ+K E+
Sbjct: 179  LGDKESTDAWLNGSSSSQFNTVLKPYEDPDLVWYPVTQAMGKPSFEGPECIKEIQLKNEQ 238

Query: 773  KTTTISQFFSKKQACKSEEQENKKTLIKEEPEEYIAPNIEKEEP--QNRLISKTTTMK-- 940
            +   IS+FFS K   K+E+       +  EP +   P   KEEP  +N     ++T+K  
Sbjct: 239  R--PISKFFSTK-GIKNEQG------LSNEPVKSNLPQSLKEEPAIENSTGLPSSTVKGD 289

Query: 941  -DEASVMEEPKKDEMVESTEQKSVKEEPHSQEES-LKQIDESDTKNADHVK--PLAKEKL 1108
             D       P+++    +   KS+K+EP +++++ L    + D+K  +     P+ ++  
Sbjct: 290  HDSTCSRSIPQEESTWFTNLPKSLKQEPETEDKTGLPFPGDHDSKCDEEATKLPIKRDFE 349

Query: 1109 HISPVKKRRKGAIDKP---------RKGADDKQPTLFSYFGKN 1210
              S   K     ++KP          K A DKQPTLFSYFGK+
Sbjct: 350  EFSADSKPNTDTVEKPSPVTKKGKLNKNAGDKQPTLFSYFGKS 392


>ref|XP_006484827.1| PREDICTED: embryonic stem cell-specific
            5-hydroxymethylcytosine-binding protein-like isoform X1
            [Citrus sinensis]
          Length = 398

 Score =  384 bits (986), Expect = e-104
 Identities = 215/410 (52%), Positives = 265/410 (64%), Gaps = 25/410 (6%)
 Frame = +2

Query: 53   MCGRGRCTLRPDDFSRACHLNSRPVRHLNMDRYRPSYNVAPGFNVPXXXXXXXXXXXXXX 232
            MCGR RCTLR DD  RACH    P R LNMDRYRPSYNVAPG+N+P              
Sbjct: 1    MCGRARCTLRADDLPRACHRTGSPARTLNMDRYRPSYNVAPGWNLPVVRRDDDGEGFVL- 59

Query: 233  XSHCMKWGLVPSFTKKTDKIDHFRMFNARSESIREKASFRRLLPKNRCLVSFEGFYEWKK 412
              HCMKWGL+PSFTKK +K D ++MFNARSES+ EKASFRRLLPK+RCL + EGFYEWKK
Sbjct: 60   --HCMKWGLIPSFTKKNEKPDFYKMFNARSESVTEKASFRRLLPKSRCLAAVEGFYEWKK 117

Query: 413  DGSKKQPYYIHLKDGRPLVFAALYDSWKNSDGEILYTFXXXXXXXXXXXLEWLHDRMPAI 592
            DGSKKQPYY+H KDGRPLVFAALYD+W++S+GEILYTF           L+WLHDRMP I
Sbjct: 118  DGSKKQPYYVHFKDGRPLVFAALYDTWQSSEGEILYTF-TILTTSSSAALQWLHDRMPVI 176

Query: 593  LGSKESTECWLNDSSLSNLDKILKPYEETDLAWYPVTPAMGKISFDGPDCIKEIQVKMEE 772
            LG KES++ WLN SS S  D ILKPYEE+DL WYPVTP MGK+SF+GP+CIKEI +K E 
Sbjct: 177  LGDKESSDAWLNGSSSSKYDTILKPYEESDLVWYPVTPVMGKLSFNGPECIKEIPLKTEG 236

Query: 773  KTTTISQFFSKKQACKSEEQE-NKKTLIKEEPEEYIAPNIE-------KEEPQNRLISKT 928
            K   IS FF KK+  K +E + ++K+   E  +  +   ++       KEEP + L  K 
Sbjct: 237  K-NPISNFFLKKEIKKEQESKMDEKSSFDESVKTNLPKRMKGEPIKEIKEEPVSGLEEKY 295

Query: 929  T-----------TMKDEASVMEEPKKDEMVE--STEQKSVKEEPHSQEESLKQIDESDTK 1069
            +           ++KDEA   ++ +    VE    + KSV     S E++ K++ + D K
Sbjct: 296  SFDTTAQTNLPKSVKDEAVTADDIRTQSSVEKGDPDTKSV-ASVLSDEDTKKELQKRDYK 354

Query: 1070 N--ADHVKPL--AKEKLHISPVKKRRKGAIDKPRKGADDKQPTLFSYFGK 1207
               AD  KP+     KL  SP+K  RKG +    K A +KQPTLFSY+ K
Sbjct: 355  EFLADS-KPVIDGNNKLETSPLK--RKGNV----KDAGEKQPTLFSYYSK 397


>ref|XP_003622783.1| hypothetical protein MTR_7g052250 [Medicago truncatula]
            gi|355497798|gb|AES79001.1| hypothetical protein
            MTR_7g052250 [Medicago truncatula]
          Length = 354

 Score =  381 bits (979), Expect = e-103
 Identities = 211/391 (53%), Positives = 251/391 (64%), Gaps = 6/391 (1%)
 Frame = +2

Query: 53   MCGRGRCTLRPDDFSRACHLNSRPVRHLNMDRYRPSYNVAPGFNVPXXXXXXXXXXXXXX 232
            MCGR RC+LR DD  RACH  + P R L++DRYRPS NV+PGFN+P              
Sbjct: 1    MCGRTRCSLRADDVPRACHRTTAPSRLLHIDRYRPSNNVSPGFNIPVVRREDNASAESDG 60

Query: 233  XS-HCMKWGLVPSFTKKTDKIDHFRMFNARSESIREKASFRRLLPKNRCLVSFEGFYEWK 409
               HCMKWGL+PSFTKKTDK DH++MFNARSESI EKASFRRLLPKNRCLV+ EGFYEWK
Sbjct: 61   HVVHCMKWGLIPSFTKKTDKPDHYKMFNARSESIDEKASFRRLLPKNRCLVAVEGFYEWK 120

Query: 410  KDGSKKQPYYIHLKDGRPLVFAALYDSWKNSDGEILYTFXXXXXXXXXXXLEWLHDRMPA 589
            KDGSKKQPYYIH KDGRPLVFAALYDSW+NS+GEILYTF            +WLHDRMP 
Sbjct: 121  KDGSKKQPYYIHFKDGRPLVFAALYDSWQNSEGEILYTF-TIVTTSSSSAFKWLHDRMPV 179

Query: 590  ILGSKESTECWLNDSSLSNLDKILKPYEETDLAWYPVTPAMGKISFDGPDCIKEIQVKME 769
            ILG K++T+ WL  SS S+   ++KPYEE+DL WYPVTPAMGK SFDGP+CIKEIQ+K  
Sbjct: 180  ILGDKDTTDTWL--SSASSFKSVMKPYEESDLVWYPVTPAMGKPSFDGPECIKEIQIK-T 236

Query: 770  EKTTTISQFFSKKQACKSEEQENKKTLIKEEPEEYIAPNIEKEEPQNRLIS----KTTTM 937
            E    IS+FFSKK+A                        +E  +P+++++S    KT   
Sbjct: 237  EGYIPISKFFSKKEA-----------------------EVEDTKPEHKILSHEPVKTEQT 273

Query: 938  KDEASVMEEPKKDEMVESTEQKSVKEEPHSQEESLKQIDESDTKNADHVKPLA-KEKLHI 1114
            KD   V EE K +E    T+ KS    P           E D  ++D    LA  +++  
Sbjct: 274  KD---VSEEAKTEE--GDTDLKSSGISPSQNVNRFAIKREYDAISSDSKPSLANNDQVSA 328

Query: 1115 SPVKKRRKGAIDKPRKGADDKQPTLFSYFGK 1207
            +P KK+ K       K ADDKQPTLFSYFGK
Sbjct: 329  NPAKKKEKA------KTADDKQPTLFSYFGK 353


>ref|XP_004290141.1| PREDICTED: UPF0361 protein C3orf37 homolog isoform 1 [Fragaria vesca
            subsp. vesca]
          Length = 366

 Score =  381 bits (978), Expect = e-103
 Identities = 207/391 (52%), Positives = 251/391 (64%), Gaps = 5/391 (1%)
 Frame = +2

Query: 53   MCGRGRCTLRPDDFSRACHLNSRPVRHLNMDRYRPSYNVAPGFNVPXXXXXXXXXXXXXX 232
            MCGR RCTLR DD SRAC+ N  PVR +NMDRY+P YNV+PG N+P              
Sbjct: 1    MCGRARCTLRADDISRACYRNHGPVRSVNMDRYQPRYNVSPGANLPVVRRGDGADGEDGV 60

Query: 233  XSHCMKWGLVPSFTKKTDKIDHFRMFNARSESIREKASFRRLLPKNRCLVSFEGFYEWKK 412
              HCMKWGL+PSFTKKT+K DH+RMFNARSESI EKASFRRL+PK+RC+V+ EGFYEWKK
Sbjct: 61   VLHCMKWGLIPSFTKKTEKPDHYRMFNARSESICEKASFRRLVPKSRCVVAVEGFYEWKK 120

Query: 413  DGSKKQPYYIHLKDGRPLVFAALYDSWKNSDGEILYTFXXXXXXXXXXXLEWLHDRMPAI 592
            DGSKKQPYY+H KDGRPL+FAALYDSW+NS+GE LYTF           L WLHDRMP +
Sbjct: 121  DGSKKQPYYVHFKDGRPLLFAALYDSWENSEGEKLYTF-TIITTSSSSALGWLHDRMPVV 179

Query: 593  LGSKESTECWLNDSSLSNLDKILKPYEETDLAWYPVTPAMGKISFDGPDCIKEIQVKMEE 772
            LG KES + WL+ SS SN DK+LKPYE  DL WYPVTPAMGK+SFDGP+C  EI++K  +
Sbjct: 180  LGDKESVDTWLDGSSASNFDKLLKPYEGPDLVWYPVTPAMGKVSFDGPECSNEIKLK-TD 238

Query: 773  KTTTISQFFSKKQACKSEEQENKKTLIKEEPEEYIAPNIEKEEPQNR--LISKTTTMKDE 946
             T +I++FFS K   K EE   K T + +   +   P    EEP+ +   +  ++T+K  
Sbjct: 239  GTNSITKFFSTK-GTKKEEINPKDTSLHDSSVKTEFPESLNEEPETKEEKVQPSSTVK-- 295

Query: 947  ASVMEEPKKDEMVESTEQKSVKEEPHSQEESLKQIDESDTKNADHVKPLAKE---KLHIS 1117
                E+ K    + S E  S ++     EE L          AD  KPL  E   K   S
Sbjct: 296  ---CEDSKSSVSILSQEDASKEQTKRDYEEFL----------ADS-KPLPNESDKKSSAS 341

Query: 1118 PVKKRRKGAIDKPRKGADDKQPTLFSYFGKN 1210
            P KK+         K + DKQPTLFSYF K+
Sbjct: 342  PAKKKVN------LKTSHDKQPTLFSYFRKS 366


>ref|XP_007049611.1| Uncharacterized protein TCM_002685 [Theobroma cacao]
            gi|508701872|gb|EOX93768.1| Uncharacterized protein
            TCM_002685 [Theobroma cacao]
          Length = 360

 Score =  380 bits (977), Expect = e-103
 Identities = 201/387 (51%), Positives = 251/387 (64%), Gaps = 2/387 (0%)
 Frame = +2

Query: 53   MCGRGRCTLRPDDFSRACHLNSRPVRHLNMDRYRPSYNVAPGFNVPXXXXXXXXXXXXXX 232
            MCGR RCTLR DD  RA H N  PVRH++MDRYRPSYNV PG N+P              
Sbjct: 1    MCGRARCTLRADDIPRASHRNDGPVRHVHMDRYRPSYNVGPGMNLPVVRRDDGSNGDGGV 60

Query: 233  XSHCMKWGLVPSFTKKTDKIDHFRMFNARSESIREKASFRRLLPKNRCLVSFEGFYEWKK 412
              HCMKWGL+PSFTKKTDK D ++MFNARSES+ EKASFRRLLPK+RCLV+ EGFYEWKK
Sbjct: 61   VLHCMKWGLIPSFTKKTDKPDFYKMFNARSESVCEKASFRRLLPKSRCLVAVEGFYEWKK 120

Query: 413  DGSKKQPYYIHLKDGRPLVFAALYDSWKNSDGEILYTFXXXXXXXXXXXLEWLHDRMPAI 592
            DGSKKQPYYIH KDGRPLVFAALYD W+NS+GE LYTF           L WLHDRMP I
Sbjct: 121  DGSKKQPYYIHFKDGRPLVFAALYDCWENSEGEKLYTFTILTTASSSAFL-WLHDRMPVI 179

Query: 593  LGSKESTECWLNDSSLSNLDKILKPYEETDLAWYPVTPAMGKISFDGPDCIKEIQVKMEE 772
            LG KEST+ WLN    + +D +LKPYE  DL WYPVT A+GK+SF+GP+C+KE+ +K +E
Sbjct: 180  LGDKESTDTWLNG---TKIDTLLKPYENPDLVWYPVTSAIGKLSFEGPECVKEVPLKTQE 236

Query: 773  KTTTISQFFSKKQACKSEEQENKKTLIKEEPEEYIAPNIEKE--EPQNRLISKTTTMKDE 946
            K   IS+FFS ++  + +E   +K+L  E  +  +  N+++E   P+++ I    + +D 
Sbjct: 237  K-NPISKFFSTREVKREQESNMEKSLCDESVQTNLLKNLKEEPNSPEDKEIPSLASKEDN 295

Query: 947  ASVMEEPKKDEMVESTEQKSVKEEPHSQEESLKQIDESDTKNADHVKPLAKEKLHISPVK 1126
             S     K   +V + E     +     EE       +DTK        AK+++ +SP  
Sbjct: 296  DS-----KSSVLVPTCEDVRKCQTKRDYEEF-----SADTKP-------AKDEIEVSPA- 337

Query: 1127 KRRKGAIDKPRKGADDKQPTLFSYFGK 1207
             R+KG I    KG   KQPTLF+YFGK
Sbjct: 338  -RKKGNI----KGVAGKQPTLFAYFGK 359


>ref|XP_002303080.1| hypothetical protein POPTR_0002s25190g [Populus trichocarpa]
            gi|222844806|gb|EEE82353.1| hypothetical protein
            POPTR_0002s25190g [Populus trichocarpa]
          Length = 367

 Score =  377 bits (969), Expect = e-102
 Identities = 206/393 (52%), Positives = 248/393 (63%), Gaps = 8/393 (2%)
 Frame = +2

Query: 53   MCGRGRCTLRPDDFSRACHLNSRPVRHLNMDRYRPSYNVAPGFNVPXXXXXXXXXXXXXX 232
            MCGR RCTLR DD  RACH N+  VR +NMDRYRPSYN +PG N+               
Sbjct: 1    MCGRARCTLRADDIPRACHRNTATVRSVNMDRYRPSYNASPGSNLAVVRRDDAASGDGAS 60

Query: 233  XS-----HCMKWGLVPSFTKKTDKIDHFRMFNARSESIREKASFRRLLPKNRCLVSFEGF 397
                   HCMKWGL+P FTKK++K D ++MFNARSES+ EKASFRRL+PK+RCLV+ EGF
Sbjct: 61   GGDGYAIHCMKWGLIPGFTKKSEKPDFYKMFNARSESLSEKASFRRLIPKSRCLVAVEGF 120

Query: 398  YEWKKDGSKKQPYYIHLKDGRPLVFAALYDSWKNSDGEILYTFXXXXXXXXXXXLEWLHD 577
            YEWKKDGSKKQPYYIH KDGRPLVFAALYDSW+NS+GEILYTF           ++WLH+
Sbjct: 121  YEWKKDGSKKQPYYIHFKDGRPLVFAALYDSWQNSEGEILYTF-TIVTTAASSAIQWLHE 179

Query: 578  RMPAILGSKESTECWLNDSSLSNLDKILKPYEETDLAWYPVTPAMGKISFDGPDCIKEIQ 757
            RMP ILG KE+T+ WL+ SS S  D +LKPYE +DL WYPVTPAMGK SFDGP+CIKEI 
Sbjct: 180  RMPVILGDKEATDTWLSVSSNSKFDTVLKPYEHSDLVWYPVTPAMGKPSFDGPECIKEIH 239

Query: 758  VKMEEKTTTISQFFSKKQACKSEEQENKKTLIKEEPEEYIAPNIEKEEPQNRLISKTTTM 937
            +KMEEK  TIS+FFS+K+    +E+ N        PEE       K EP+          
Sbjct: 240  LKMEEK-GTISKFFSRKE---FKEESN--------PEESTHGKSLKLEPK---------- 277

Query: 938  KDEASVMEEPKKDEMVES-TEQKSVKEEPHSQEESLKQIDESDTKNADHVKPLAKEKLHI 1114
                SV EE + +E +E+    K+V  +  S+ E+     E+  K     + L   KL  
Sbjct: 278  ----SVKEENESEEKLETPCSAKTVDYDLKSELETFSHEGETKCKTKRDREELVDSKLKT 333

Query: 1115 SPVKKRRKGAIDKPR--KGADDKQPTLFSYFGK 1207
              + K R     K    K  DDKQPTL SYFGK
Sbjct: 334  DEIVKPRASPAKKKANLKSVDDKQPTLLSYFGK 366


>ref|XP_004492204.1| PREDICTED: UPF0361 protein C3orf37 homolog [Cicer arietinum]
          Length = 375

 Score =  377 bits (967), Expect = e-102
 Identities = 206/389 (52%), Positives = 248/389 (63%), Gaps = 1/389 (0%)
 Frame = +2

Query: 44   EQSMCGRGRCTLRPDDFSRACHLNSRPVRHLNMDRYRPSYNVAPGFNVPXXXXXXXXXXX 223
            E  MCGRGRCTLRPDD   ACH  + P R L++DRYRPS+NV+PGF++P           
Sbjct: 18   EDEMCGRGRCTLRPDDIPTACHRTTAPTRLLHVDRYRPSHNVSPGFHMPVVRREDASESE 77

Query: 224  XXXXSHCMKWGLVPSFTKKTDKIDHFRMFNARSESIREKASFRRLLPKNRCLVSFEGFYE 403
                 HCMKWGL+PSFTKKT+K DH+RMFNARSESI EKASFRRLLPKNRCLV+ EGFYE
Sbjct: 78   GHVL-HCMKWGLIPSFTKKTEKPDHYRMFNARSESIDEKASFRRLLPKNRCLVAVEGFYE 136

Query: 404  WKKDGSKKQPYYIHLKDGRPLVFAALYDSWKNSDGEILYTFXXXXXXXXXXXLEWLHDRM 583
            WKKDGSKKQPYYIH KDGRPLVFAALYDSW+NS+GE LYTF           L+WLHDRM
Sbjct: 137  WKKDGSKKQPYYIHFKDGRPLVFAALYDSWQNSEGETLYTF-TIVTTSSSSTLQWLHDRM 195

Query: 584  PAILGSKESTECWLNDSSLSNLDKILKPYEETDLAWYPVTPAMGKISFDGPDCIKEIQVK 763
            P IL  K+ST+ WLN  S S+   +LKPYEE DLAWYPVTPAMGK SFDGP+CIKEIQVK
Sbjct: 196  PVILSDKDSTDTWLN--SASSFKSVLKPYEECDLAWYPVTPAMGKPSFDGPECIKEIQVK 253

Query: 764  MEEKTTTISQFFSKKQACKSEEQENKKTL-IKEEPEEYIAPNIEKEEPQNRLISKTTTMK 940
              E    IS+FFS+K     + +   K L +  EP                 +    T K
Sbjct: 254  -AEGNIPISKFFSRKGGEGEDTKSGHKILSLCHEP-----------------VKTEQTTK 295

Query: 941  DEASVMEEPKKDEMVESTEQKSVKEEPHSQEESLKQIDESDTKNADHVKPLAKEKLHISP 1120
            D    + E  K E  ES  + S     +  + ++K+  ++ + ++     +  + +   P
Sbjct: 296  D----LSEGAKTEEGESDLKSSGSSPQNVTKFTVKREYDAISSDSKPSLGINDQVIANPP 351

Query: 1121 VKKRRKGAIDKPRKGADDKQPTLFSYFGK 1207
             KK+ K       K ADDKQPTLFS+FGK
Sbjct: 352  TKKKEKA------KNADDKQPTLFSFFGK 374


>ref|XP_007140735.1| hypothetical protein PHAVU_008G137400g [Phaseolus vulgaris]
            gi|561013868|gb|ESW12729.1| hypothetical protein
            PHAVU_008G137400g [Phaseolus vulgaris]
          Length = 353

 Score =  374 bits (961), Expect = e-101
 Identities = 207/390 (53%), Positives = 248/390 (63%), Gaps = 5/390 (1%)
 Frame = +2

Query: 53   MCGRGRCTLRPDDFSRACHLNSRPVRHLNMDRYRPSYNVAPGFNVPXXXXXXXXXXXXXX 232
            MCGR RCTLR DD  RACH +  P R L+MDRYRP+YNV+PG N+P              
Sbjct: 1    MCGRTRCTLRSDDVPRACHRSDAPTRTLHMDRYRPAYNVSPGSNMPVVRREEASDSGGYV 60

Query: 233  XSHCMKWGLVPSFTKKTDKIDHFRMFNARSESIREKASFRRLLPKNRCLVSFEGFYEWKK 412
              H MKWGL+PSFTKKT+K DH++MFNARSESI EKASFRRLLPK+RCLV+ EGFYEWKK
Sbjct: 61   L-HSMKWGLIPSFTKKTEKPDHYKMFNARSESIDEKASFRRLLPKSRCLVAVEGFYEWKK 119

Query: 413  DGSKKQPYYIHLKDGRPLVFAALYDSWKNSDGEILYTFXXXXXXXXXXXLEWLHDRMPAI 592
            DGSKKQPYYIH KDGR LVFAALYDSW+NS+GE L+TF           L+WLHDRMP I
Sbjct: 120  DGSKKQPYYIHFKDGRRLVFAALYDSWQNSEGETLHTFTIVTTSSSSA-LQWLHDRMPVI 178

Query: 593  LGSKESTECWLNDSSLSNLDKILKPYEETDLAWYPVTPAMGKISFDGPDCIKEIQVKMEE 772
            LGSKEST+ WL+ SS S+   ++KPYEE+DL WYPVT AMGK SFDGP+CIKEIQVK E 
Sbjct: 179  LGSKESTDTWLS-SSASSFKSVMKPYEESDLVWYPVTSAMGKTSFDGPECIKEIQVKAEG 237

Query: 773  KTTTISQFFSKKQA----CKSEEQENKKTLIKEEPEEYIAPNIEKEEPQNRLISKTTTMK 940
             T+ IS FFSKK A     K E++ +    +K EP E +    + EE  N L    ++  
Sbjct: 238  NTS-ISMFFSKKGAESKDTKPEQKLSSHEFVKTEPTEDLIEGAKAEEGDNDLKFSGSSHS 296

Query: 941  DEASVMEEPKKDEMVESTEQKSVKEEPHSQEESLKQIDESDTKNADHVKPLAK-EKLHIS 1117
              AS +   +                            E +T +AD    LA  +++  +
Sbjct: 297  KNASTLPIKR----------------------------EYETFSADSKPALANHDQISSN 328

Query: 1118 PVKKRRKGAIDKPRKGADDKQPTLFSYFGK 1207
            P KK+ K       K A+DKQPTLFSYFGK
Sbjct: 329  PAKKKEK------TKTANDKQPTLFSYFGK 352


>ref|XP_007199067.1| hypothetical protein PRUPE_ppa018685mg [Prunus persica]
            gi|462394467|gb|EMJ00266.1| hypothetical protein
            PRUPE_ppa018685mg [Prunus persica]
          Length = 363

 Score =  370 bits (951), Expect = e-100
 Identities = 200/389 (51%), Positives = 250/389 (64%), Gaps = 3/389 (0%)
 Frame = +2

Query: 53   MCGRGRCTLRPDDFSRACHLNSRPVRHLNMDRYRPSYNVAPGFNVPXXXXXXXXXXXXXX 232
            MCGR RCTLR DD  RACH +  PVR +NMDR+RP +N +PG N+P              
Sbjct: 1    MCGRARCTLRADDIPRACHRSHGPVRTVNMDRFRPLFNASPGSNLPVVRREDGGDGDGVV 60

Query: 233  XSHCMKWGLVPSFTKKTDKIDHFRMFNARSESIREKASFRRLLPKNRCLVSFEGFYEWKK 412
              HCMKWGL+PSFTKKT+K DH++MFNARSESI EKASFRRL+PKNRCL++ EGFYEWKK
Sbjct: 61   V-HCMKWGLIPSFTKKTEKPDHYKMFNARSESICEKASFRRLIPKNRCLIAVEGFYEWKK 119

Query: 413  DGSKKQPYYIHLKDGRPLVFAALYDSWKNSDGEILYTFXXXXXXXXXXXLEWLHDRMPAI 592
            DGSKKQPYY+H  DGRPL+FAALYD W+NS+GE LYTF           L WLHDRMP I
Sbjct: 120  DGSKKQPYYVHFNDGRPLLFAALYDFWENSEGEKLYTF-TIITTSSSSALGWLHDRMPVI 178

Query: 593  LGSKESTECWLNDSSLSNLDKILKPYEETDLAWYPVTPAMGKISFDGPDCIKEIQVKMEE 772
            LG K ST+ WL+ SS SN D +LKPYE  DL WYPVT AMGK+SFDGP+CI EIQ+K  E
Sbjct: 179  LGDKGSTDSWLSGSSTSNFDSLLKPYEGPDLVWYPVTQAMGKVSFDGPECINEIQLK-TE 237

Query: 773  KTTTISQFFSKKQACKSEEQENKKTLIKEEPEEYIAPNIEKEEPQNRLISKTTTMKDEAS 952
               +I++FF  K   K EE   K T   +   +   P   KEEP+ +   KT        
Sbjct: 238  GNNSITKFFMSK-GTKKEELNPKDTSFYDSSVKNDLPKSVKEEPEGK--EKT-------- 286

Query: 953  VMEEPKKDEMVES-TEQKSVKEEPHSQEESLKQIDESDTKNADHVKPLAKE--KLHISPV 1123
              E+P   E  E+ ++ +++ +E  S+ ++ +  +E    +    KP+A E  ++  SP 
Sbjct: 287  --EQPASTEKCENDSKGQTISQEGVSKGQTKRDYEEFSADS----KPVAYETSEMSASPA 340

Query: 1124 KKRRKGAIDKPRKGADDKQPTLFSYFGKN 1210
            KK+         K + DKQPTLFSYFGK+
Sbjct: 341  KKKVN------PKSSVDKQPTLFSYFGKS 363


>ref|XP_002527247.1| conserved hypothetical protein [Ricinus communis]
            gi|223533340|gb|EEF35091.1| conserved hypothetical
            protein [Ricinus communis]
          Length = 409

 Score =  364 bits (935), Expect = 4e-98
 Identities = 200/416 (48%), Positives = 252/416 (60%), Gaps = 31/416 (7%)
 Frame = +2

Query: 53   MCGRGRCTLRPDDFSRACHLNSRPVRHLNMDRYRPSYNVAPGFNVPXXXXXXXXXXXXXX 232
            MCGR RCTLR DD  RACH  + PVR +NMDR+RPSYNV+PG N+P              
Sbjct: 1    MCGRARCTLRADDIPRACHRTTGPVRSVNMDRWRPSYNVSPGSNMPVVCREGDGSDGGDG 60

Query: 233  XS-HCMKWGLVPSFTKKTDKIDHFRMFNARSESIREKASFRRLLPKNRCLVSFEGFYEWK 409
                CM WGL+PSFTKKT+K D ++MFNARSES+ EKASFRRLLPK+RCLV+ EGFYEWK
Sbjct: 61   FFVQCMTWGLIPSFTKKTEKPDFYKMFNARSESVGEKASFRRLLPKSRCLVAAEGFYEWK 120

Query: 410  KDGSKKQPYYIHLKDGRPLVFAALYDSWKNSDGEILYTFXXXXXXXXXXXLEWLHDRMPA 589
            KDGSKKQPYYIH KDGRPLVFAALYDSW+NS+GEILYTF           LEWLHDRMP 
Sbjct: 121  KDGSKKQPYYIHFKDGRPLVFAALYDSWQNSEGEILYTF-TILTTSSSSALEWLHDRMPV 179

Query: 590  ILGSKESTECWLNDSSLSNLDKILKPYEETDLAWYPVTPAMGKISFDGPDCIKEIQVKME 769
            ILG KEST+ WLN SS S  D +L+ YE +DL W PVTPAMGK SFDGP+C+KEI VK E
Sbjct: 180  ILGDKESTDTWLNGSSSSKYDVVLESYESSDLVWCPVTPAMGKSSFDGPECVKEIHVKTE 239

Query: 770  EKTTTISQFFSKKQACKSEEQENKKTL----IKEEPEEYIAPNIEKEE-----PQNRLIS 922
             K +TIS+FFS+K+    +E  ++++     +K +  E +    E EE     P N++  
Sbjct: 240  SK-STISKFFSRKEIKGEQELNSRESTFDKSVKMDLPESVKEEYESEEKLDIPPSNQIND 298

Query: 923  K------TTTMKDEASVMEEPKKDE---MVESTEQKSVKEEPHSQEESLKQIDESDTKNA 1075
            +      +T   ++ +  + P  DE    +   ++   +   H    ++ ++   D    
Sbjct: 299  QDLKSNVSTIPCEDETKCQIPDHDETKCQIPDHDETKCQIPDHDLISNVSKLPHEDATLG 358

Query: 1076 D------------HVKPLAKEKLHISPVKKRRKGAIDKPRKGADDKQPTLFSYFGK 1207
                          + P   EKL  +P +K+         K   DKQPTL SYF K
Sbjct: 359  QPKRHHEEALIDRELNPDGNEKLRRNPARKKAN------LKSGGDKQPTLLSYFRK 408


>emb|CAN82703.1| hypothetical protein VITISV_026469 [Vitis vinifera]
          Length = 370

 Score =  358 bits (918), Expect = 4e-96
 Identities = 198/403 (49%), Positives = 254/403 (63%), Gaps = 17/403 (4%)
 Frame = +2

Query: 53   MCGRGRCTLRPDDFSRACHLNSRPVRHLNMDRYRPSYNVAPGFNVPXXXXXXXXXXXXXX 232
            MCGR RCTLRPD+ +RAC+LN+ P +++ MDRYRPSYNV+PG N+P              
Sbjct: 1    MCGRARCTLRPDNIARACNLNTLPTQNIQMDRYRPSYNVSPGANLPVVRRGGGTEGEEAI 60

Query: 233  XSHCMKWGLVPSFTKKTDKIDHFRMFNARSESIREKASFRRLLPKNRCLVSFEGFYEWKK 412
              HCMKWGLVPSFTKK++K DH++MFNARSES+ EKASFRRL+PKNRCLV+ EGFYEWKK
Sbjct: 61   V-HCMKWGLVPSFTKKSEKPDHYKMFNARSESVCEKASFRRLVPKNRCLVAVEGFYEWKK 119

Query: 413  DGSKKQPYYIHLKDGRPLVFAALYDSWKNSDGEILYTFXXXXXXXXXXXLEWLHDRMPAI 592
            DGSKKQPYYIHLKDGRPLVFAAL+DSW NS+                       DRMP I
Sbjct: 120  DGSKKQPYYIHLKDGRPLVFAALFDSWANSE-----------------------DRMPVI 156

Query: 593  LGSKESTECWLNDSSLSNLDKILKPYEETDLAWYPVTPAMGKISFDGPDCIKEIQVKMEE 772
            LG KEST+ WLN SS S  + +LKPYE+ DL WYPVT AMGK SF+GP+CIKEIQ+K E+
Sbjct: 157  LGDKESTDAWLNGSSSSQFNTVLKPYEDPDLVWYPVTQAMGKPSFEGPECIKEIQLKNEQ 216

Query: 773  KTTTISQFFSKKQACKSEEQENKKTLIKEEPEEYIAPNIEKEEP--QNRLISKTTTMK-- 940
            +   IS+FFS K   K+E+       +  EP +   P   KEEP  +N     ++ +K  
Sbjct: 217  R--PISKFFSTK-GIKNEQG------LSNEPVKSNLPQSMKEEPAIENSTGLPSSAVKGD 267

Query: 941  -DEASVMEEPKKDEMVESTEQKSVKEEPHSQEES-LKQIDESDTKNADHVK--PLAKEKL 1108
             D       P+++    +   KS+K+EP +++++ L    + D+K  +     P+ ++  
Sbjct: 268  HDSTCSRSVPQEESTWFTNLPKSLKQEPETEDKTGLPFPGDHDSKCDEEATKLPIKRDFE 327

Query: 1109 HISPVKKRRKGAIDKP---------RKGADDKQPTLFSYFGKN 1210
              S   K     ++KP          K A DKQPTLFSYFGK+
Sbjct: 328  EFSADSKPNTDTVEKPSPVTKKGKLNKNAGDKQPTLFSYFGKS 370


>ref|NP_180215.2| uncharacterized protein [Arabidopsis thaliana]
            gi|26449484|dbj|BAC41868.1| unknown protein [Arabidopsis
            thaliana] gi|29028900|gb|AAO64829.1| At2g26470
            [Arabidopsis thaliana] gi|330252748|gb|AEC07842.1|
            uncharacterized protein AT2G26470 [Arabidopsis thaliana]
          Length = 487

 Score =  338 bits (866), Expect = 4e-90
 Identities = 174/330 (52%), Positives = 218/330 (66%), Gaps = 1/330 (0%)
 Frame = +2

Query: 53   MCGRGRCTLRPDDFSRACHLNSRPVRHLNMDRYRPSYNVAPGFNVPXXXXXXXXXXXXXX 232
            MCGR RCTLRPDD  RA H ++ P R L++DRYRPSYNVAPG  +P              
Sbjct: 1    MCGRTRCTLRPDDVPRASHRHTVPTRFLHLDRYRPSYNVAPGSYIPVLRRDNEEVVGDGV 60

Query: 233  XSHCMKWGLVPSFTKKTDKIDHFRMFNARSESIREKASFRRLLPKNRCLVSFEGFYEWKK 412
              HCMKWGLVPSFTKKTDK D F+MFNARSES+ EKASFRRLLPKNRCLV+ +GFYEWKK
Sbjct: 61   VVHCMKWGLVPSFTKKTDKPDFFKMFNARSESVAEKASFRRLLPKNRCLVAVDGFYEWKK 120

Query: 413  DGSKKQPYYIHLKDGRPLVFAALYDSWKNSDGEILYTFXXXXXXXXXXXLEWLHDRMPAI 592
            +GSKKQPYYIH +DGRPLVFAAL+D+W+NS GE LYTF           L+WLHDRMP I
Sbjct: 121  EGSKKQPYYIHFEDGRPLVFAALFDTWQNSGGETLYTF-TILTTASSSALQWLHDRMPVI 179

Query: 593  LGSKESTECWLNDSSLSNLDKILKPYEETDLAWYPVTPAMGKISFDGPDCIKEIQVKMEE 772
            LG K+S + WL+D S + L  +L PYE++DL WYPVT A+GK +FDGP+CI++I +K  +
Sbjct: 180  LGDKDSIDTWLDDPSTTKLQPLLSPYEKSDLVWYPVTSAIGKPTFDGPECIQQIPLKTSQ 239

Query: 773  KTTTISQFFSKKQACKSE-EQENKKTLIKEEPEEYIAPNIEKEEPQNRLISKTTTMKDEA 949
              + IS+FFS KQ    E ++E K T      +  I  +++KE    +     +  K E 
Sbjct: 240  -NSLISKFFSTKQPKTDEGDKETKST------DANIIVDLKKEPTAEKDTFSDSIKKIEE 292

Query: 950  SVMEEPKKDEMVESTEQKSVKEEPHSQEES 1039
               E+   +       Q+ VK EP  ++ S
Sbjct: 293  LDGEKDMSNVAKNLEFQEIVKAEPFVEDNS 322


>ref|XP_002880802.1| hypothetical protein ARALYDRAFT_481505 [Arabidopsis lyrata subsp.
            lyrata] gi|297326641|gb|EFH57061.1| hypothetical protein
            ARALYDRAFT_481505 [Arabidopsis lyrata subsp. lyrata]
          Length = 489

 Score =  336 bits (861), Expect = 2e-89
 Identities = 172/340 (50%), Positives = 219/340 (64%)
 Frame = +2

Query: 53   MCGRGRCTLRPDDFSRACHLNSRPVRHLNMDRYRPSYNVAPGFNVPXXXXXXXXXXXXXX 232
            MCGR RCTLRPDD  RA H ++ P R L++DRYRPSYN+APG  +P              
Sbjct: 1    MCGRTRCTLRPDDIQRASHRHTVPTRSLHLDRYRPSYNIAPGSYIPVLRRENEVVGDGVV 60

Query: 233  XSHCMKWGLVPSFTKKTDKIDHFRMFNARSESIREKASFRRLLPKNRCLVSFEGFYEWKK 412
              HCMKWGLVP FTKKTDK D F+MFNARSES+ EKASFRRLLPKNRCLV+ +GFYEWKK
Sbjct: 61   V-HCMKWGLVPGFTKKTDKPDFFKMFNARSESVAEKASFRRLLPKNRCLVAVDGFYEWKK 119

Query: 413  DGSKKQPYYIHLKDGRPLVFAALYDSWKNSDGEILYTFXXXXXXXXXXXLEWLHDRMPAI 592
            +GSKKQPYYIH +DGRPLVFAAL+DSW+NS GE LYTF           L+WLHDRMP I
Sbjct: 120  EGSKKQPYYIHFEDGRPLVFAALFDSWQNSGGETLYTF-TILTTTSSSPLQWLHDRMPVI 178

Query: 593  LGSKESTECWLNDSSLSNLDKILKPYEETDLAWYPVTPAMGKISFDGPDCIKEIQVKMEE 772
            LG K+S + WL+D S + L  +L PYE++DL WYPVT A+GK +FDGP+CI++I +K  +
Sbjct: 179  LGDKDSVDTWLDDPSTTKLQPLLSPYEKSDLVWYPVTTAIGKPTFDGPECIQQIPLKASQ 238

Query: 773  KTTTISQFFSKKQACKSEEQENKKTLIKEEPEEYIAPNIEKEEPQNRLISKTTTMKDEAS 952
              + IS+FFS+K     +E ++    I  + +E       +E   +  + K   +  E  
Sbjct: 239  -NSLISKFFSRKTEEGDKETKSTDANISVDLKEEPMVGGYEEATFSDSVKKIEELGGEKD 297

Query: 953  VMEEPKKDEMVESTEQKSVKEEPHSQEESLKQIDESDTKN 1072
            ++ E K         Q+ VK EP +++ S         KN
Sbjct: 298  ILNEAKNIGF-----QEIVKAEPFTEDNSAVASHPEPVKN 332


>gb|EXB84512.1| hypothetical protein L484_015844 [Morus notabilis]
          Length = 469

 Score =  334 bits (857), Expect = 5e-89
 Identities = 195/407 (47%), Positives = 243/407 (59%), Gaps = 23/407 (5%)
 Frame = +2

Query: 53   MCGRGRCTLRPDDFSRACHLNSRPVRHLNMDRYRPSYNVAPGFNVPXXXXXXXXXXXXXX 232
            MCGR RCTLR DD  RACH N+  VR +NMDRYRPSYNV+PG N+P              
Sbjct: 1    MCGRARCTLRADDVPRACHRNNGSVRTVNMDRYRPSYNVSPGSNIPVVRREDGSDGEGFV 60

Query: 233  XSHCMKWGLVPSFTKKTDKIDHFRMFNARSESIREKASFRRLLPKNRCLVSFEGFYEWKK 412
              HCMKWGL+PSFTKKTDK DH++MFNARSESI EK SFRRL+PK+RCLV+ EGFYEWKK
Sbjct: 61   V-HCMKWGLIPSFTKKTDKPDHYKMFNARSESIGEKVSFRRLIPKSRCLVAVEGFYEWKK 119

Query: 413  DGSKKQPYYIHLKDGRPLVFAALYDSWKN--------SDGEILYTFXXXXXXXXXXXLEW 568
            DGSKKQPYYIH KDGRPLVFAALYDSW+N          GEILYTF           L W
Sbjct: 120  DGSKKQPYYIHFKDGRPLVFAALYDSWENYLVTAIVIPAGEILYTF-TILTISSSSALGW 178

Query: 569  LHDRMPAILGSKESTECWLNDSSLSNLDKILKPYEETDLAWYPVTPAMGKISFDGPDCIK 748
            LHDRMP I G KES++ WL  SS S +  +LKPYE+ DL WYPVTPAMGK SFDGP+CI 
Sbjct: 179  LHDRMPVIFGDKESSDAWLTGSS-SKVGALLKPYEDPDLVWYPVTPAMGKPSFDGPECI- 236

Query: 749  EIQVKMEEKTTTISQFFS----KKQACKSEEQENKKT----LIKEEPEEYI--APNIEKE 898
            E+++K  +    IS+FFS    KK+A  + E+ + K      ++E+PE      P    E
Sbjct: 237  EMKLK-ADGNIPISKFFSAKGTKKEADLNPEESSSKVDSAKCLEEKPESKANRGPFSSTE 295

Query: 899  EPQNRLISKTTTMKDEASVMEEPKKDEMVESTEQKSVKEEPHSQEESLKQIDESDTKNAD 1078
            + +    S  ++     +   + K+D    S + KS  +E        K++ +S  +   
Sbjct: 296  KGEADSKSSVSSFSQGGAEKCQIKRDHEKLSADSKSNTDE-------TKKLFDSPGRKKV 348

Query: 1079 HVKPLAKEKLHISPVKK-----RRKGAIDKPRKGADDKQPTLFSYFG 1204
             +K     K    P K+      ++G     R G D K P++   FG
Sbjct: 349  KLKSAGDYKQPTRPPKEVAVYNPQRGNTWGRRNGKDQKTPSINCAFG 395


>ref|XP_004150365.1| PREDICTED: LOW QUALITY PROTEIN: UPF0361 protein C3orf37 homolog,
            partial [Cucumis sativus]
          Length = 344

 Score =  334 bits (857), Expect = 5e-89
 Identities = 169/325 (52%), Positives = 212/325 (65%), Gaps = 7/325 (2%)
 Frame = +2

Query: 53   MCGRGRCTLRPDDFSRACHLNSRPVRHLNMDRYRPSYNVAPGFNVPXXXXXXXXXXXXXX 232
            MCGR RCTLR DD +RACH    PVR LNMDR+RP +N +PG ++P              
Sbjct: 1    MCGRARCTLRADDITRACHRTGGPVRSLNMDRFRPLFNASPGSDLPVVRRDDESSDGGVV 60

Query: 233  XSHCMKWGLVPSFTKKTDKIDHFRMFNARSESIREKASFRRLLPKNRCLVSFEGFYEWKK 412
               CMKWGL+PSFT+K +K ++F+MFNARSESI EKASF RL+PK RCLV+ EGFYEWKK
Sbjct: 61   LQ-CMKWGLIPSFTEKFEKPNYFKMFNARSESIHEKASFHRLVPKRRCLVAVEGFYEWKK 119

Query: 413  DGSKKQPYYIHLKDGRPLVFAALYDSWKNSDGEILYTFXXXXXXXXXXXLEWLHDRMPAI 592
            DG KKQPYYIH KDG+PL  AALYD W+N +GE+LYTF           L+WLHDRMP I
Sbjct: 120  DGXKKQPYYIHFKDGQPLALAALYDCWENLEGELLYTF-TILTTSSSPALKWLHDRMPVI 178

Query: 593  LGSKESTECWLNDSSLSNLDKILKPYEETDLAWYPVTPAMGKISFDGPDCIKEIQVKMEE 772
            LG KE  + WLNDSS S  D +LKPYE  DL WYPVTP+MGK SFDGPDCIKEIQ+K  +
Sbjct: 179  LGDKERMDMWLNDSSSSKYDSVLKPYEAPDLVWYPVTPSMGKPSFDGPDCIKEIQLK-ND 237

Query: 773  KTTTISQFFSKKQACKSEEQENKKTLIKEEPEEYIAPNIEKEEPQNRLISKTTTMKD--- 943
             +  IS+FFS K+  K      +KT      +   +P++E+ + +    + +   KD   
Sbjct: 238  GSNLISKFFSAKETKKEYSVSQEKTCSNTSVKPEASPSLEEHKREVNRGASSEESKDCLA 297

Query: 944  ----EASVMEEPKKDEMVESTEQKS 1006
                + S+  + K+D    S++ KS
Sbjct: 298  KCSSDTSLTYQIKRDREDISSDLKS 322


>gb|EPS73813.1| hypothetical protein M569_00942, partial [Genlisea aurea]
          Length = 297

 Score =  332 bits (851), Expect = 2e-88
 Identities = 157/253 (62%), Positives = 186/253 (73%)
 Frame = +2

Query: 53  MCGRGRCTLRPDDFSRACHLNSRPVRHLNMDRYRPSYNVAPGFNVPXXXXXXXXXXXXXX 232
           MCGR RCT+R D F+RACHL +RP+RH+NMDRY+PSYNVAPGF++P              
Sbjct: 1   MCGRARCTMRADGFARACHLGNRPLRHINMDRYQPSYNVAPGFSLPVVHRDGEKENGVAV 60

Query: 233 XSHCMKWGLVPSFTKKTDKIDHFRMFNARSESIREKASFRRLLPKNRCLVSFEGFYEWKK 412
              CMKWGL+PSF  K DKIDHF+MFNAR+ESI+EKASFRRL+P  RCLV  EGFYEWKK
Sbjct: 61  --QCMKWGLIPSFANKNDKIDHFKMFNARAESIQEKASFRRLIPNKRCLVCVEGFYEWKK 118

Query: 413 DGSKKQPYYIHLKDGRPLVFAALYDSWKNSDGEILYTFXXXXXXXXXXXLEWLHDRMPAI 592
           DG+KKQPYYIH  DG PLV AAL+DSWK+S  ++++TF           LEWLHDRMP I
Sbjct: 119 DGTKKQPYYIHFSDGSPLVLAALFDSWKSSSQDVMFTF-TIITTSSSTSLEWLHDRMPVI 177

Query: 593 LGSKESTECWLNDSSLSNLDKILKPYEETDLAWYPVTPAMGKISFDGPDCIKEIQVKMEE 772
           LG++ES  CW ND   S   ++LKPYE  +LAWYPVTPAMGK+SFDGPDCI+E+      
Sbjct: 178 LGNQESIHCWFNDGMPSL--QLLKPYEGKNLAWYPVTPAMGKVSFDGPDCIREV---TSS 232

Query: 773 KTTTISQFFSKKQ 811
               ISQFFSKK+
Sbjct: 233 HVKPISQFFSKKE 245


>ref|XP_004290142.1| PREDICTED: UPF0361 protein C3orf37 homolog isoform 2 [Fragaria vesca
            subsp. vesca]
          Length = 348

 Score =  330 bits (845), Expect = 1e-87
 Identities = 187/373 (50%), Positives = 229/373 (61%), Gaps = 16/373 (4%)
 Frame = +2

Query: 140  MDRYRPSYNVAPGFNVPXXXXXXXXXXXXXXXSHCMKWGLVPSFTKKTDKIDHFRMFNAR 319
            MDRY+P YNV+PG N+P                HCMKWGL+PSFTKKT+K DH+RMFNAR
Sbjct: 1    MDRYQPRYNVSPGANLPVVRRGDGADGEDGVVLHCMKWGLIPSFTKKTEKPDHYRMFNAR 60

Query: 320  SESIREKASFRRLLPKNRCLVSFEGFYEWKKDGSKKQPYYIHLKDGRPLVFAALYDSWKN 499
            SESI EKASFRRL+PK+RC+V+ EGFYEWKKDGSKKQPYY+H KDGRPL+FAALYDSW+N
Sbjct: 61   SESICEKASFRRLVPKSRCVVAVEGFYEWKKDGSKKQPYYVHFKDGRPLLFAALYDSWEN 120

Query: 500  SD-----------GEILYTFXXXXXXXXXXXLEWLHDRMPAILGSKESTECWLNDSSLSN 646
            S+           GE LYTF           L WLHDRMP +LG KES + WL+ SS SN
Sbjct: 121  SEGTNVYTECETAGEKLYTF-TIITTSSSSALGWLHDRMPVVLGDKESVDTWLDGSSASN 179

Query: 647  LDKILKPYEETDLAWYPVTPAMGKISFDGPDCIKEIQVKMEEKTTTISQFFSKKQACKSE 826
             DK+LKPYE  DL WYPVTPAMGK+SFDGP+C  EI++K  + T +I++FFS K   K E
Sbjct: 180  FDKLLKPYEGPDLVWYPVTPAMGKVSFDGPECSNEIKLK-TDGTNSITKFFSTK-GTKKE 237

Query: 827  EQENKKTLIKEEPEEYIAPNIEKEEPQNR--LISKTTTMKDEASVMEEPKKDEMVESTEQ 1000
            E   K T + +   +   P    EEP+ +   +  ++T+K      E+ K    + S E 
Sbjct: 238  EINPKDTSLHDSSVKTEFPESLNEEPETKEEKVQPSSTVK-----CEDSKSSVSILSQED 292

Query: 1001 KSVKEEPHSQEESLKQIDESDTKNADHVKPLAKE---KLHISPVKKRRKGAIDKPRKGAD 1171
             S ++     EE L          AD  KPL  E   K   SP KK+         K + 
Sbjct: 293  ASKEQTKRDYEEFL----------ADS-KPLPNESDKKSSASPAKKKVN------LKTSH 335

Query: 1172 DKQPTLFSYFGKN 1210
            DKQPTLFSYF K+
Sbjct: 336  DKQPTLFSYFRKS 348


>ref|XP_006403078.1| hypothetical protein EUTSA_v10003450mg [Eutrema salsugineum]
            gi|557104185|gb|ESQ44531.1| hypothetical protein
            EUTSA_v10003450mg [Eutrema salsugineum]
          Length = 480

 Score =  329 bits (843), Expect = 2e-87
 Identities = 180/357 (50%), Positives = 214/357 (59%), Gaps = 8/357 (2%)
 Frame = +2

Query: 53   MCGRGRCTLRPDDFSRACHLNSRPVRHLNMDRYRPSYNVAPGFNVPXXXXXXXXXXXXXX 232
            MCGR RCTLRPDD  RA H +  P R L++DRYRPSYNVAPG  +P              
Sbjct: 1    MCGRARCTLRPDDVPRASHRHGVPARFLHLDRYRPSYNVAPGTYMPVLRRDNDGIAV--- 57

Query: 233  XSHCMKWGLVPSFTKKTDKIDHFRMFNARSESIREKASFRRLLPKNRCLVSFEGFYEWKK 412
              HCMKWGLVPSFTKKTDK D F+MFNARSES+ EKASFRRLLPKNRCLV+ +GFYEWKK
Sbjct: 58   --HCMKWGLVPSFTKKTDKPDFFKMFNARSESVAEKASFRRLLPKNRCLVAVDGFYEWKK 115

Query: 413  DGSKKQPYYIHLKDGRPLVFAALYDSWKNSDGEILYTFXXXXXXXXXXXLEWLHDRMPAI 592
            +GSKKQPYYIH  D RPLVFAAL+DSW+NS GE L TF           L+WLHDRMP I
Sbjct: 116  EGSKKQPYYIHFNDRRPLVFAALFDSWQNSGGETLDTF-TILTTTSSSALDWLHDRMPVI 174

Query: 593  LGSKESTECWLNDSSLSNLDKILKPYEETDLAWYPVTPAMGKISFDGPDCIKEIQVKMEE 772
            L  KES + WL+  S SNL  +L PYE +DL WYPVT A+GK+ FDGP+CI++I +K  +
Sbjct: 175  LNDKESVDTWLDGPSTSNLKPLLVPYENSDLVWYPVTSAIGKLCFDGPECIQQIPLKASQ 234

Query: 773  KTTTISQFFSKKQACKSEEQENKKTLIKEEPEEYIAPNIEKEEPQNRLISKTTTMKDEAS 952
              + IS+FFS K     E     K+   + P +       K E  +         K E  
Sbjct: 235  -NSLISKFFSAKHPNTDEGDRETKSTDADTPVD--LKEKPKVEGYDEAFFSNCNKKSEEL 291

Query: 953  VMEEPKKDEMVESTEQKSVKEEPHSQE--------ESLKQIDESDTKNADHVKPLAK 1099
              E  K +E      Q   K EP  ++        ES+K   E DTK       L+K
Sbjct: 292  DEEIDKSNEAKNLGFQNIAKAEPLMEDNSAVVLRLESVKNEVEEDTKGKSIKTALSK 348


Top