BLASTX nr result

ID: Akebia27_contig00017574 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Akebia27_contig00017574
         (1371 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_004290141.1| PREDICTED: UPF0361 protein C3orf37 homolog i...   449   e-123
ref|XP_007049611.1| Uncharacterized protein TCM_002685 [Theobrom...   446   e-122
ref|XP_007199067.1| hypothetical protein PRUPE_ppa018685mg [Prun...   446   e-122
ref|XP_003635244.1| PREDICTED: UPF0361 protein C3orf37 homolog [...   443   e-122
ref|XP_006484827.1| PREDICTED: embryonic stem cell-specific 5-hy...   439   e-120
ref|XP_003622783.1| hypothetical protein MTR_7g052250 [Medicago ...   439   e-120
ref|XP_007140735.1| hypothetical protein PHAVU_008G137400g [Phas...   435   e-119
ref|XP_002303080.1| hypothetical protein POPTR_0002s25190g [Popu...   432   e-118
gb|EXB84512.1| hypothetical protein L484_015844 [Morus notabilis]     429   e-117
ref|XP_003532247.1| PREDICTED: embryonic stem cell-specific 5-hy...   427   e-117
ref|XP_004492204.1| PREDICTED: UPF0361 protein C3orf37 homolog [...   424   e-116
ref|XP_002527247.1| conserved hypothetical protein [Ricinus comm...   411   e-112
ref|XP_004290142.1| PREDICTED: UPF0361 protein C3orf37 homolog i...   405   e-110
ref|XP_004150365.1| PREDICTED: LOW QUALITY PROTEIN: UPF0361 prot...   402   e-109
gb|EYU40658.1| hypothetical protein MIMGU_mgv1a008176mg [Mimulus...   397   e-108
emb|CAN82703.1| hypothetical protein VITISV_026469 [Vitis vinifera]   394   e-107
ref|XP_006850341.1| hypothetical protein AMTR_s00020p00243160 [A...   391   e-106
ref|XP_004165094.1| PREDICTED: UPF0361 protein C3orf37 homolog [...   382   e-103
ref|XP_007140736.1| hypothetical protein PHAVU_008G137400g [Phas...   376   e-101
ref|NP_180215.2| uncharacterized protein [Arabidopsis thaliana] ...   366   1e-98

>ref|XP_004290141.1| PREDICTED: UPF0361 protein C3orf37 homolog isoform 1 [Fragaria vesca
            subsp. vesca]
          Length = 366

 Score =  449 bits (1155), Expect = e-123
 Identities = 225/367 (61%), Positives = 275/367 (74%), Gaps = 4/367 (1%)
 Frame = +2

Query: 20   MCGRARCSLRAEDIPRACGIDNGSSRNLQMDGYRPRFNVSPGSHIPVVRREEGS-GAQGT 196
            MCGRARC+LRA+DI RAC  ++G  R++ MD Y+PR+NVSPG+++PVVRR +G+ G  G 
Sbjct: 1    MCGRARCTLRADDISRACYRNHGPVRSVNMDRYQPRYNVSPGANLPVVRRGDGADGEDGV 60

Query: 197  VLHCMKWGLIPSFTKKTDKPDHYKMFNARSESICEKASFRRLVPNNRCLVAVEGYYEWKK 376
            VLHCMKWGLIPSFTKKT+KPDHY+MFNARSESICEKASFRRLVP +RC+VAVEG+YEWKK
Sbjct: 61   VLHCMKWGLIPSFTKKTEKPDHYRMFNARSESICEKASFRRLVPKSRCVVAVEGFYEWKK 120

Query: 377  DGSRKQPYYIHLKDDQPLVFAALYDSWGNSEGEMLYTFTILTTSCSSALQWLHDRMPVIL 556
            DGS+KQPYY+H KD +PL+FAALYDSW NSEGE LYTFTI+TTS SSAL WLHDRMPV+L
Sbjct: 121  DGSKKQPYYVHFKDGRPLLFAALYDSWENSEGEKLYTFTIITTSSSSALGWLHDRMPVVL 180

Query: 557  GNKSSIDAWLNGSFSPKSNSILKPYEDSDLVWYPVTPAMGKPSFDGPDCIKEIHLKTEEK 736
            G+K S+D WL+GS +   + +LKPYE  DLVWYPVTPAMGK SFDGP+C  EI LKT+  
Sbjct: 181  GDKESVDTWLDGSSASNFDKLLKPYEGPDLVWYPVTPAMGKVSFDGPECSNEIKLKTDGT 240

Query: 737  NLISKFFSKKRTSNEQEIIPQIENSSEESAPTK--PSKNLKXXXXXXXLPIDSEEGNENP 910
            N I+KFFS K T  E EI P+  +  + S  T+   S N +       +   S    E+ 
Sbjct: 241  NSITKFFSTKGTKKE-EINPKDTSLHDSSVKTEFPESLNEEPETKEEKVQPSSTVKCEDS 299

Query: 911  KFIISPVLNEEAEKCGAKREYKELTSDVKPF-NHNFKMQGSSPVKKKANLKNAGEKQPTL 1087
            K  +S +  E+A K   KR+Y+E  +D KP  N + K   +SP KKK NLK + +KQPTL
Sbjct: 300  KSSVSILSQEDASKEQTKRDYEEFLADSKPLPNESDKKSSASPAKKKVNLKTSHDKQPTL 359

Query: 1088 FSYFGKS 1108
            FSYF KS
Sbjct: 360  FSYFRKS 366


>ref|XP_007049611.1| Uncharacterized protein TCM_002685 [Theobroma cacao]
            gi|508701872|gb|EOX93768.1| Uncharacterized protein
            TCM_002685 [Theobroma cacao]
          Length = 360

 Score =  446 bits (1147), Expect = e-122
 Identities = 225/368 (61%), Positives = 275/368 (74%), Gaps = 6/368 (1%)
 Frame = +2

Query: 20   MCGRARCSLRAEDIPRACGIDNGSSRNLQMDGYRPRFNVSPGSHIPVVRREEGS-GAQGT 196
            MCGRARC+LRA+DIPRA   ++G  R++ MD YRP +NV PG ++PVVRR++GS G  G 
Sbjct: 1    MCGRARCTLRADDIPRASHRNDGPVRHVHMDRYRPSYNVGPGMNLPVVRRDDGSNGDGGV 60

Query: 197  VLHCMKWGLIPSFTKKTDKPDHYKMFNARSESICEKASFRRLVPNNRCLVAVEGYYEWKK 376
            VLHCMKWGLIPSFTKKTDKPD YKMFNARSES+CEKASFRRL+P +RCLVAVEG+YEWKK
Sbjct: 61   VLHCMKWGLIPSFTKKTDKPDFYKMFNARSESVCEKASFRRLLPKSRCLVAVEGFYEWKK 120

Query: 377  DGSRKQPYYIHLKDDQPLVFAALYDSWGNSEGEMLYTFTILTTSCSSALQWLHDRMPVIL 556
            DGS+KQPYYIH KD +PLVFAALYD W NSEGE LYTFTILTT+ SSA  WLHDRMPVIL
Sbjct: 121  DGSKKQPYYIHFKDGRPLVFAALYDCWENSEGEKLYTFTILTTASSSAFLWLHDRMPVIL 180

Query: 557  GNKSSIDAWLNGSFSPKSNSILKPYEDSDLVWYPVTPAMGKPSFDGPDCIKEIHLKTEEK 736
            G+K S D WLNG+   K +++LKPYE+ DLVWYPVT A+GK SF+GP+C+KE+ LKT+EK
Sbjct: 181  GDKESTDTWLNGT---KIDTLLKPYENPDLVWYPVTSAIGKLSFEGPECVKEVPLKTQEK 237

Query: 737  NLISKFFSKKRTSNEQEIIPQIENS-SEESAPTKPSKNLK---XXXXXXXLP-IDSEEGN 901
            N ISKFFS +    EQE    +E S  +ES  T   KNLK          +P + S+E N
Sbjct: 238  NPISKFFSTREVKREQE--SNMEKSLCDESVQTNLLKNLKEEPNSPEDKEIPSLASKEDN 295

Query: 902  ENPKFIISPVLNEEAEKCGAKREYKELTSDVKPFNHNFKMQGSSPVKKKANLKNAGEKQP 1081
            ++   ++ P   E+  KC  KR+Y+E ++D KP     ++   SP +KK N+K    KQP
Sbjct: 296  DSKSSVLVPTC-EDVRKCQTKRDYEEFSADTKPAKDEIEV---SPARKKGNIKGVAGKQP 351

Query: 1082 TLFSYFGK 1105
            TLF+YFGK
Sbjct: 352  TLFAYFGK 359


>ref|XP_007199067.1| hypothetical protein PRUPE_ppa018685mg [Prunus persica]
            gi|462394467|gb|EMJ00266.1| hypothetical protein
            PRUPE_ppa018685mg [Prunus persica]
          Length = 363

 Score =  446 bits (1147), Expect = e-122
 Identities = 220/366 (60%), Positives = 265/366 (72%), Gaps = 3/366 (0%)
 Frame = +2

Query: 20   MCGRARCSLRAEDIPRACGIDNGSSRNLQMDGYRPRFNVSPGSHIPVVRREEGSGAQGTV 199
            MCGRARC+LRA+DIPRAC   +G  R + MD +RP FN SPGS++PVVRRE+G    G V
Sbjct: 1    MCGRARCTLRADDIPRACHRSHGPVRTVNMDRFRPLFNASPGSNLPVVRREDGGDGDGVV 60

Query: 200  LHCMKWGLIPSFTKKTDKPDHYKMFNARSESICEKASFRRLVPNNRCLVAVEGYYEWKKD 379
            +HCMKWGLIPSFTKKT+KPDHYKMFNARSESICEKASFRRL+P NRCL+AVEG+YEWKKD
Sbjct: 61   VHCMKWGLIPSFTKKTEKPDHYKMFNARSESICEKASFRRLIPKNRCLIAVEGFYEWKKD 120

Query: 380  GSRKQPYYIHLKDDQPLVFAALYDSWGNSEGEMLYTFTILTTSCSSALQWLHDRMPVILG 559
            GS+KQPYY+H  D +PL+FAALYD W NSEGE LYTFTI+TTS SSAL WLHDRMPVILG
Sbjct: 121  GSKKQPYYVHFNDGRPLLFAALYDFWENSEGEKLYTFTIITTSSSSALGWLHDRMPVILG 180

Query: 560  NKSSIDAWLNGSFSPKSNSILKPYEDSDLVWYPVTPAMGKPSFDGPDCIKEIHLKTEEKN 739
            +K S D+WL+GS +   +S+LKPYE  DLVWYPVT AMGK SFDGP+CI EI LKTE  N
Sbjct: 181  DKGSTDSWLSGSSTSNFDSLLKPYEGPDLVWYPVTQAMGKVSFDGPECINEIQLKTEGNN 240

Query: 740  LISKFFSKKRTSNEQEIIPQIENSSEESAPTKPSKNLK---XXXXXXXLPIDSEEGNENP 910
             I+KFF  K T  E E+ P+  +  + S      K++K           P  +E+   + 
Sbjct: 241  SITKFFMSKGTKKE-ELNPKDTSFYDSSVKNDLPKSVKEEPEGKEKTEQPASTEKCENDS 299

Query: 911  KFIISPVLNEEAEKCGAKREYKELTSDVKPFNHNFKMQGSSPVKKKANLKNAGEKQPTLF 1090
            K     +  E   K   KR+Y+E ++D KP  +      +SP KKK N K++ +KQPTLF
Sbjct: 300  KG--QTISQEGVSKGQTKRDYEEFSADSKPVAYETSEMSASPAKKKVNPKSSVDKQPTLF 357

Query: 1091 SYFGKS 1108
            SYFGKS
Sbjct: 358  SYFGKS 363


>ref|XP_003635244.1| PREDICTED: UPF0361 protein C3orf37 homolog [Vitis vinifera]
            gi|296090568|emb|CBI40918.3| unnamed protein product
            [Vitis vinifera]
          Length = 392

 Score =  443 bits (1140), Expect = e-122
 Identities = 231/397 (58%), Positives = 281/397 (70%), Gaps = 34/397 (8%)
 Frame = +2

Query: 20   MCGRARCSLRAEDIPRACGIDNGSSRNLQMDGYRPRFNVSPGSHIPVVRREEGSGAQGTV 199
            MCGRARC+LR ++I RAC ++   ++N+QMD YRP +NVSPG+++PVVRR  G+  +  +
Sbjct: 1    MCGRARCTLRPDNIARACNLNTLPTQNIQMDRYRPSYNVSPGANLPVVRRGGGTEGEEAI 60

Query: 200  LHCMKWGLIPSFTKKTDKPDHYKMFNARSESICEKASFRRLVPNNRCLVAVEGYYEWKKD 379
            +HCMKWGL+PSFTKK++KPDHYKMFNARSES+CEKASFRRLVP NRCLVAVEG+YEWKKD
Sbjct: 61   VHCMKWGLVPSFTKKSEKPDHYKMFNARSESVCEKASFRRLVPKNRCLVAVEGFYEWKKD 120

Query: 380  GSRKQPYYIHLKDDQPLVFAALYDSWGNSEGEMLYTFTILTTSCSSALQWLHDRMPVILG 559
            GS+KQPYYIHLKD +PLVFAAL+DSW NSEGE+LYT TILTTS SSALQWLHDRMPVILG
Sbjct: 121  GSKKQPYYIHLKDGRPLVFAALFDSWANSEGEILYTCTILTTSSSSALQWLHDRMPVILG 180

Query: 560  NKSSIDAWLNGSFSPKSNSILKPYEDSDLVWYPVTPAMGKPSFDGPDCIKEIHLKTEEKN 739
            +K S DAWLNGS S + N++LKPYED DLVWYPVT AMGKPSF+GP+CIKEI LK E++ 
Sbjct: 181  DKESTDAWLNGSSSSQFNTVLKPYEDPDLVWYPVTQAMGKPSFEGPECIKEIQLKNEQRP 240

Query: 740  LISKFFSKKRTSNEQEII---------------PQIENSSEESAPTKPSKNLKXXXXXXX 874
             ISKFFS K   NEQ +                P IENS+    P+   K          
Sbjct: 241  -ISKFFSTKGIKNEQGLSNEPVKSNLPQSLKEEPAIENST--GLPSSTVKGDHDSTCSRS 297

Query: 875  LP-------------IDSEEGNENPKFIISP-----VLNEEAEKCGAKREYKELTSDVKP 1000
            +P             +  E   E+   +  P       +EEA K   KR+++E ++D KP
Sbjct: 298  IPQEESTWFTNLPKSLKQEPETEDKTGLPFPGDHDSKCDEEATKLPIKRDFEEFSADSKP 357

Query: 1001 FNHNFKMQGSSPVKKKANL-KNAGEKQPTLFSYFGKS 1108
              +   ++  SPV KK  L KNAG+KQPTLFSYFGKS
Sbjct: 358  --NTDTVEKPSPVTKKGKLNKNAGDKQPTLFSYFGKS 392


>ref|XP_006484827.1| PREDICTED: embryonic stem cell-specific
            5-hydroxymethylcytosine-binding protein-like isoform X1
            [Citrus sinensis]
          Length = 398

 Score =  439 bits (1130), Expect = e-120
 Identities = 225/399 (56%), Positives = 275/399 (68%), Gaps = 37/399 (9%)
 Frame = +2

Query: 20   MCGRARCSLRAEDIPRACGIDNGSSRNLQMDGYRPRFNVSPGSHIPVVRREEGSGAQGTV 199
            MCGRARC+LRA+D+PRAC      +R L MD YRP +NV+PG ++PVVRR++    +G V
Sbjct: 1    MCGRARCTLRADDLPRACHRTGSPARTLNMDRYRPSYNVAPGWNLPVVRRDDDG--EGFV 58

Query: 200  LHCMKWGLIPSFTKKTDKPDHYKMFNARSESICEKASFRRLVPNNRCLVAVEGYYEWKKD 379
            LHCMKWGLIPSFTKK +KPD YKMFNARSES+ EKASFRRL+P +RCL AVEG+YEWKKD
Sbjct: 59   LHCMKWGLIPSFTKKNEKPDFYKMFNARSESVTEKASFRRLLPKSRCLAAVEGFYEWKKD 118

Query: 380  GSRKQPYYIHLKDDQPLVFAALYDSWGNSEGEMLYTFTILTTSCSSALQWLHDRMPVILG 559
            GS+KQPYY+H KD +PLVFAALYD+W +SEGE+LYTFTILTTS S+ALQWLHDRMPVILG
Sbjct: 119  GSKKQPYYVHFKDGRPLVFAALYDTWQSSEGEILYTFTILTTSSSAALQWLHDRMPVILG 178

Query: 560  NKSSIDAWLNGSFSPKSNSILKPYEDSDLVWYPVTPAMGKPSFDGPDCIKEIHLKTEEKN 739
            +K S DAWLNGS S K ++ILKPYE+SDLVWYPVTP MGK SF+GP+CIKEI LKTE KN
Sbjct: 179  DKESSDAWLNGSSSSKYDTILKPYEESDLVWYPVTPVMGKLSFNGPECIKEIPLKTEGKN 238

Query: 740  LISKFFSKKRTSNEQEIIPQIENSSEESAPTKPSKNLKXXXXXXXL--PIDS-------- 889
             IS FF KK    EQE     ++S +ES  T   K +K          P+          
Sbjct: 239  PISNFFLKKEIKKEQESKMDEKSSFDESVKTNLPKRMKGEPIKEIKEEPVSGLEEKYSFD 298

Query: 890  ---------------------------EEGNENPKFIISPVLNEEAEKCGAKREYKELTS 988
                                       E+G+ + K + S + +E+ +K   KR+YKE  +
Sbjct: 299  TTAQTNLPKSVKDEAVTADDIRTQSSVEKGDPDTKSVASVLSDEDTKKELQKRDYKEFLA 358

Query: 989  DVKPFNHNFKMQGSSPVKKKANLKNAGEKQPTLFSYFGK 1105
            D KP         +SP+K+K N+K+AGEKQPTLFSY+ K
Sbjct: 359  DSKPVIDGNNKLETSPLKRKGNVKDAGEKQPTLFSYYSK 397


>ref|XP_003622783.1| hypothetical protein MTR_7g052250 [Medicago truncatula]
            gi|355497798|gb|AES79001.1| hypothetical protein
            MTR_7g052250 [Medicago truncatula]
          Length = 354

 Score =  439 bits (1130), Expect = e-120
 Identities = 228/365 (62%), Positives = 270/365 (73%), Gaps = 3/365 (0%)
 Frame = +2

Query: 20   MCGRARCSLRAEDIPRACGIDNGSSRNLQMDGYRPRFNVSPGSHIPVVRREEGSGAQ--G 193
            MCGR RCSLRA+D+PRAC      SR L +D YRP  NVSPG +IPVVRRE+ + A+  G
Sbjct: 1    MCGRTRCSLRADDVPRACHRTTAPSRLLHIDRYRPSNNVSPGFNIPVVRREDNASAESDG 60

Query: 194  TVLHCMKWGLIPSFTKKTDKPDHYKMFNARSESICEKASFRRLVPNNRCLVAVEGYYEWK 373
             V+HCMKWGLIPSFTKKTDKPDHYKMFNARSESI EKASFRRL+P NRCLVAVEG+YEWK
Sbjct: 61   HVVHCMKWGLIPSFTKKTDKPDHYKMFNARSESIDEKASFRRLLPKNRCLVAVEGFYEWK 120

Query: 374  KDGSRKQPYYIHLKDDQPLVFAALYDSWGNSEGEMLYTFTILTTSCSSALQWLHDRMPVI 553
            KDGS+KQPYYIH KD +PLVFAALYDSW NSEGE+LYTFTI+TTS SSA +WLHDRMPVI
Sbjct: 121  KDGSKKQPYYIHFKDGRPLVFAALYDSWQNSEGEILYTFTIVTTSSSSAFKWLHDRMPVI 180

Query: 554  LGNKSSIDAWLNGSFSPKSNSILKPYEDSDLVWYPVTPAMGKPSFDGPDCIKEIHLKTEE 733
            LG+K + D WL+ + S K  S++KPYE+SDLVWYPVTPAMGKPSFDGP+CIKEI +KTE 
Sbjct: 181  LGDKDTTDTWLSSASSFK--SVMKPYEESDLVWYPVTPAMGKPSFDGPECIKEIQIKTEG 238

Query: 734  KNLISKFFSKKRTSNEQEIIPQIENSSEESAPTKPSKNLKXXXXXXXLPIDSEEGNENPK 913
               ISKFFSKK    E +  P+ +  S E   T+ +K++            +EEG+ + K
Sbjct: 239  YIPISKFFSKKEAEVE-DTKPEHKILSHEPVKTEQTKDVSE-------EAKTEEGDTDLK 290

Query: 914  FI-ISPVLNEEAEKCGAKREYKELTSDVKPFNHNFKMQGSSPVKKKANLKNAGEKQPTLF 1090
               ISP  ++   +   KREY  ++SD KP   N     ++P KKK   K A +KQPTLF
Sbjct: 291  SSGISP--SQNVNRFAIKREYDAISSDSKPSLANNDQVSANPAKKKEKAKTADDKQPTLF 348

Query: 1091 SYFGK 1105
            SYFGK
Sbjct: 349  SYFGK 353


>ref|XP_007140735.1| hypothetical protein PHAVU_008G137400g [Phaseolus vulgaris]
            gi|561013868|gb|ESW12729.1| hypothetical protein
            PHAVU_008G137400g [Phaseolus vulgaris]
          Length = 353

 Score =  435 bits (1119), Expect = e-119
 Identities = 223/362 (61%), Positives = 267/362 (73%)
 Frame = +2

Query: 20   MCGRARCSLRAEDIPRACGIDNGSSRNLQMDGYRPRFNVSPGSHIPVVRREEGSGAQGTV 199
            MCGR RC+LR++D+PRAC   +  +R L MD YRP +NVSPGS++PVVRREE S + G V
Sbjct: 1    MCGRTRCTLRSDDVPRACHRSDAPTRTLHMDRYRPAYNVSPGSNMPVVRREEASDSGGYV 60

Query: 200  LHCMKWGLIPSFTKKTDKPDHYKMFNARSESICEKASFRRLVPNNRCLVAVEGYYEWKKD 379
            LH MKWGLIPSFTKKT+KPDHYKMFNARSESI EKASFRRL+P +RCLVAVEG+YEWKKD
Sbjct: 61   LHSMKWGLIPSFTKKTEKPDHYKMFNARSESIDEKASFRRLLPKSRCLVAVEGFYEWKKD 120

Query: 380  GSRKQPYYIHLKDDQPLVFAALYDSWGNSEGEMLYTFTILTTSCSSALQWLHDRMPVILG 559
            GS+KQPYYIH KD + LVFAALYDSW NSEGE L+TFTI+TTS SSALQWLHDRMPVILG
Sbjct: 121  GSKKQPYYIHFKDGRRLVFAALYDSWQNSEGETLHTFTIVTTSSSSALQWLHDRMPVILG 180

Query: 560  NKSSIDAWLNGSFSPKSNSILKPYEDSDLVWYPVTPAMGKPSFDGPDCIKEIHLKTEEKN 739
            +K S D WL+ S S    S++KPYE+SDLVWYPVT AMGK SFDGP+CIKEI +K E   
Sbjct: 181  SKESTDTWLSSSAS-SFKSVMKPYEESDLVWYPVTSAMGKTSFDGPECIKEIQVKAEGNT 239

Query: 740  LISKFFSKKRTSNEQEIIPQIENSSEESAPTKPSKNLKXXXXXXXLPIDSEEGNENPKFI 919
             IS FFSKK  +  ++  P+ + SS E   T+P+++L            +EEG+ + KF 
Sbjct: 240  SISMFFSKK-GAESKDTKPEQKLSSHEFVKTEPTEDLIEG-------AKAEEGDNDLKFS 291

Query: 920  ISPVLNEEAEKCGAKREYKELTSDVKPFNHNFKMQGSSPVKKKANLKNAGEKQPTLFSYF 1099
             S   ++ A     KREY+  ++D KP   N     S+P KKK   K A +KQPTLFSYF
Sbjct: 292  GSS-HSKNASTLPIKREYETFSADSKPALANHDQISSNPAKKKEKTKTANDKQPTLFSYF 350

Query: 1100 GK 1105
            GK
Sbjct: 351  GK 352


>ref|XP_002303080.1| hypothetical protein POPTR_0002s25190g [Populus trichocarpa]
            gi|222844806|gb|EEE82353.1| hypothetical protein
            POPTR_0002s25190g [Populus trichocarpa]
          Length = 367

 Score =  432 bits (1110), Expect = e-118
 Identities = 216/368 (58%), Positives = 262/368 (71%), Gaps = 6/368 (1%)
 Frame = +2

Query: 20   MCGRARCSLRAEDIPRACGIDNGSSRNLQMDGYRPRFNVSPGSHIPVVRREEG------S 181
            MCGRARC+LRA+DIPRAC  +  + R++ MD YRP +N SPGS++ VVRR++       S
Sbjct: 1    MCGRARCTLRADDIPRACHRNTATVRSVNMDRYRPSYNASPGSNLAVVRRDDAASGDGAS 60

Query: 182  GAQGTVLHCMKWGLIPSFTKKTDKPDHYKMFNARSESICEKASFRRLVPNNRCLVAVEGY 361
            G  G  +HCMKWGLIP FTKK++KPD YKMFNARSES+ EKASFRRL+P +RCLVAVEG+
Sbjct: 61   GGDGYAIHCMKWGLIPGFTKKSEKPDFYKMFNARSESLSEKASFRRLIPKSRCLVAVEGF 120

Query: 362  YEWKKDGSRKQPYYIHLKDDQPLVFAALYDSWGNSEGEMLYTFTILTTSCSSALQWLHDR 541
            YEWKKDGS+KQPYYIH KD +PLVFAALYDSW NSEGE+LYTFTI+TT+ SSA+QWLH+R
Sbjct: 121  YEWKKDGSKKQPYYIHFKDGRPLVFAALYDSWQNSEGEILYTFTIVTTAASSAIQWLHER 180

Query: 542  MPVILGNKSSIDAWLNGSFSPKSNSILKPYEDSDLVWYPVTPAMGKPSFDGPDCIKEIHL 721
            MPVILG+K + D WL+ S + K +++LKPYE SDLVWYPVTPAMGKPSFDGP+CIKEIHL
Sbjct: 181  MPVILGDKEATDTWLSVSSNSKFDTVLKPYEHSDLVWYPVTPAMGKPSFDGPECIKEIHL 240

Query: 722  KTEEKNLISKFFSKKRTSNEQEIIPQIENSSEESAPTKPSKNLKXXXXXXXLPIDSEEGN 901
            K EEK  ISKFFS+K    E          S +  P K  K           P  ++  +
Sbjct: 241  KMEEKGTISKFFSRKEFKEESNPEESTHGKSLKLEP-KSVKEENESEEKLETPCSAKTVD 299

Query: 902  ENPKFIISPVLNEEAEKCGAKREYKELTSDVKPFNHNFKMQGSSPVKKKANLKNAGEKQP 1081
             + K  +    +E   KC  KR+ +EL  D K          +SP KKKANLK+  +KQP
Sbjct: 300  YDLKSELETFSHEGETKCKTKRDREELV-DSKLKTDEIVKPRASPAKKKANLKSVDDKQP 358

Query: 1082 TLFSYFGK 1105
            TL SYFGK
Sbjct: 359  TLLSYFGK 366


>gb|EXB84512.1| hypothetical protein L484_015844 [Morus notabilis]
          Length = 469

 Score =  429 bits (1102), Expect = e-117
 Identities = 223/367 (60%), Positives = 263/367 (71%), Gaps = 12/367 (3%)
 Frame = +2

Query: 20   MCGRARCSLRAEDIPRACGIDNGSSRNLQMDGYRPRFNVSPGSHIPVVRREEGSGAQGTV 199
            MCGRARC+LRA+D+PRAC  +NGS R + MD YRP +NVSPGS+IPVVRRE+GS  +G V
Sbjct: 1    MCGRARCTLRADDVPRACHRNNGSVRTVNMDRYRPSYNVSPGSNIPVVRREDGSDGEGFV 60

Query: 200  LHCMKWGLIPSFTKKTDKPDHYKMFNARSESICEKASFRRLVPNNRCLVAVEGYYEWKKD 379
            +HCMKWGLIPSFTKKTDKPDHYKMFNARSESI EK SFRRL+P +RCLVAVEG+YEWKKD
Sbjct: 61   VHCMKWGLIPSFTKKTDKPDHYKMFNARSESIGEKVSFRRLIPKSRCLVAVEGFYEWKKD 120

Query: 380  GSRKQPYYIHLKDDQPLVFAALYDSWGN--------SEGEMLYTFTILTTSCSSALQWLH 535
            GS+KQPYYIH KD +PLVFAALYDSW N          GE+LYTFTILT S SSAL WLH
Sbjct: 121  GSKKQPYYIHFKDGRPLVFAALYDSWENYLVTAIVIPAGEILYTFTILTISSSSALGWLH 180

Query: 536  DRMPVILGNKSSIDAWLNGSFSPKSNSILKPYEDSDLVWYPVTPAMGKPSFDGPDCIKEI 715
            DRMPVI G+K S DAWL GS S K  ++LKPYED DLVWYPVTPAMGKPSFDGP+CI E+
Sbjct: 181  DRMPVIFGDKESSDAWLTGS-SSKVGALLKPYEDPDLVWYPVTPAMGKPSFDGPECI-EM 238

Query: 716  HLKTEEKNLISKFFSKKRTSNEQEIIPQIENSSEESA---PTKPSKNLKXXXXXXXLPID 886
             LK +    ISKFFS K T  E ++ P+  +S  +SA     KP                
Sbjct: 239  KLKADGNIPISKFFSAKGTKKEADLNPEESSSKVDSAKCLEEKPESKANRGPFS-----S 293

Query: 887  SEEGNENPKFIISPVLNEEAEKCGAKREYKELTSDVKPFNHNFKMQGSSPVKKKANLKNA 1066
            +E+G  + K  +S      AEKC  KR++++L++D K      K    SP +KK  LK+A
Sbjct: 294  TEKGEADSKSSVSSFSQGGAEKCQIKRDHEKLSADSKSNTDETKKLFDSPGRKKVKLKSA 353

Query: 1067 GE-KQPT 1084
            G+ KQPT
Sbjct: 354  GDYKQPT 360


>ref|XP_003532247.1| PREDICTED: embryonic stem cell-specific
            5-hydroxymethylcytosine-binding protein-like [Glycine
            max]
          Length = 382

 Score =  427 bits (1099), Expect = e-117
 Identities = 224/392 (57%), Positives = 271/392 (69%), Gaps = 26/392 (6%)
 Frame = +2

Query: 20   MCGRARCSLRAEDIPRACGIDNGSSRNLQMDGYRPRFNVSPGSHIPVVRREEGSGAQGTV 199
            MCGRARC+LRA+D+PRAC      +R L +D YRP +NVSPG  +PVVRR++ SG +G V
Sbjct: 1    MCGRARCTLRADDVPRACHRSTSPTRTLHIDRYRPAYNVSPGFDVPVVRRDDASGGEGYV 60

Query: 200  LHCMKWGLIPSFTKKTDKPDHYKMFNARSESICEKASFRRLVPNNRCLVAVEGYYEWKKD 379
            L CMKWGLIPSFTKKT+KPDHY+MFNARSESI EKASFRRL+P +RCLVAVEG+YEWKKD
Sbjct: 61   LQCMKWGLIPSFTKKTEKPDHYRMFNARSESIDEKASFRRLLPKSRCLVAVEGFYEWKKD 120

Query: 380  GSRKQPYYIHLKDDQPLVFAALYDSWGNSEGEMLYTFTILTTSCSSALQWLHDRMPVILG 559
            GS+KQPYYIH KD +PLVFAALYDSW NSEGE LYTFTI+TTS SSALQWLHDRMPVILG
Sbjct: 121  GSKKQPYYIHFKDGRPLVFAALYDSWQNSEGETLYTFTIVTTSSSSALQWLHDRMPVILG 180

Query: 560  NKSSIDAWLNGSFSPKSNSILKPYEDSDLVWYPVTPAMGKPSFDGPDCIKEIHLKTEEKN 739
            +K S D WL+ S S    S++KPYE+SDLVWYPVT AMGK SFDGP+CIKEI +K +   
Sbjct: 181  SKESTDIWLSSSAS-SFKSVMKPYEESDLVWYPVTSAMGKASFDGPECIKEIQVKAQGNT 239

Query: 740  LISKFFSKK-------------------RTSNEQEII------PQIENSSEESAPTKPSK 844
             IS FFSKK                   +T + +++       P+ + SS E   T+P++
Sbjct: 240  SISMFFSKKGDESKDTKPEQKASCPEVVKTEHTEDLTESKDTKPEQKTSSHEFVKTEPTE 299

Query: 845  NLKXXXXXXXLPIDSEEGNENPKFIISPVLNEEAEKCGAKREYKELT-SDVKPFNHNFKM 1021
            +L+           +EEG  + KF  S   ++       KREY+  + +D KP   N   
Sbjct: 300  DLRER-------AKTEEGGNDLKFHGSS-HSQNVSMLPIKREYETFSAADSKPALANHDQ 351

Query: 1022 QGSSPVKKKANLKNAGEKQPTLFSYFGKS*NH 1117
               +P KKK   K A +KQPTLFSYFGKS NH
Sbjct: 352  ISPNPAKKKEKAKTANDKQPTLFSYFGKS-NH 382


>ref|XP_004492204.1| PREDICTED: UPF0361 protein C3orf37 homolog [Cicer arietinum]
          Length = 375

 Score =  424 bits (1089), Expect = e-116
 Identities = 223/366 (60%), Positives = 259/366 (70%), Gaps = 2/366 (0%)
 Frame = +2

Query: 14   ERMCGRARCSLRAEDIPRACGIDNGSSRNLQMDGYRPRFNVSPGSHIPVVRREEGSGAQG 193
            + MCGR RC+LR +DIP AC      +R L +D YRP  NVSPG H+PVVRRE+ S ++G
Sbjct: 19   DEMCGRGRCTLRPDDIPTACHRTTAPTRLLHVDRYRPSHNVSPGFHMPVVRREDASESEG 78

Query: 194  TVLHCMKWGLIPSFTKKTDKPDHYKMFNARSESICEKASFRRLVPNNRCLVAVEGYYEWK 373
             VLHCMKWGLIPSFTKKT+KPDHY+MFNARSESI EKASFRRL+P NRCLVAVEG+YEWK
Sbjct: 79   HVLHCMKWGLIPSFTKKTEKPDHYRMFNARSESIDEKASFRRLLPKNRCLVAVEGFYEWK 138

Query: 374  KDGSRKQPYYIHLKDDQPLVFAALYDSWGNSEGEMLYTFTILTTSCSSALQWLHDRMPVI 553
            KDGS+KQPYYIH KD +PLVFAALYDSW NSEGE LYTFTI+TTS SS LQWLHDRMPVI
Sbjct: 139  KDGSKKQPYYIHFKDGRPLVFAALYDSWQNSEGETLYTFTIVTTSSSSTLQWLHDRMPVI 198

Query: 554  LGNKSSIDAWLNGSFSPKSNSILKPYEDSDLVWYPVTPAMGKPSFDGPDCIKEIHLKTEE 733
            L +K S D WLN + S K  S+LKPYE+ DL WYPVTPAMGKPSFDGP+CIKEI +K E 
Sbjct: 199  LSDKDSTDTWLNSASSFK--SVLKPYEECDLAWYPVTPAMGKPSFDGPECIKEIQVKAEG 256

Query: 734  KNLISKFFSKKRTSNEQ-EIIPQIENSSEESAPTKPSKNLKXXXXXXXLPIDSEEGNENP 910
               ISKFFS+K    E  +   +I +   E  P K  +  K           +EEG  + 
Sbjct: 257  NIPISKFFSRKGGEGEDTKSGHKILSLCHE--PVKTEQTTKDLSEG----AKTEEGESDL 310

Query: 911  KFIISPVLNEEAEKCGAKREYKELTSDVKP-FNHNFKMQGSSPVKKKANLKNAGEKQPTL 1087
            K   S   N    K   KREY  ++SD KP    N ++  + P KKK   KNA +KQPTL
Sbjct: 311  KSSGSSPQN--VTKFTVKREYDAISSDSKPSLGINDQVIANPPTKKKEKAKNADDKQPTL 368

Query: 1088 FSYFGK 1105
            FS+FGK
Sbjct: 369  FSFFGK 374


>ref|XP_002527247.1| conserved hypothetical protein [Ricinus communis]
            gi|223533340|gb|EEF35091.1| conserved hypothetical
            protein [Ricinus communis]
          Length = 409

 Score =  411 bits (1056), Expect = e-112
 Identities = 221/408 (54%), Positives = 268/408 (65%), Gaps = 46/408 (11%)
 Frame = +2

Query: 20   MCGRARCSLRAEDIPRACGIDNGSSRNLQMDGYRPRFNVSPGSHIPVVRRE-EGS-GAQG 193
            MCGRARC+LRA+DIPRAC    G  R++ MD +RP +NVSPGS++PVV RE +GS G  G
Sbjct: 1    MCGRARCTLRADDIPRACHRTTGPVRSVNMDRWRPSYNVSPGSNMPVVCREGDGSDGGDG 60

Query: 194  TVLHCMKWGLIPSFTKKTDKPDHYKMFNARSESICEKASFRRLVPNNRCLVAVEGYYEWK 373
              + CM WGLIPSFTKKT+KPD YKMFNARSES+ EKASFRRL+P +RCLVA EG+YEWK
Sbjct: 61   FFVQCMTWGLIPSFTKKTEKPDFYKMFNARSESVGEKASFRRLLPKSRCLVAAEGFYEWK 120

Query: 374  KDGSRKQPYYIHLKDDQPLVFAALYDSWGNSEGEMLYTFTILTTSCSSALQWLHDRMPVI 553
            KDGS+KQPYYIH KD +PLVFAALYDSW NSEGE+LYTFTILTTS SSAL+WLHDRMPVI
Sbjct: 121  KDGSKKQPYYIHFKDGRPLVFAALYDSWQNSEGEILYTFTILTTSSSSALEWLHDRMPVI 180

Query: 554  LGNKSSIDAWLNGSFSPKSNSILKPYEDSDLVWYPVTPAMGKPSFDGPDCIKEIHLKTEE 733
            LG+K S D WLNGS S K + +L+ YE SDLVW PVTPAMGK SFDGP+C+KEIH+KTE 
Sbjct: 181  LGDKESTDTWLNGSSSSKYDVVLESYESSDLVWCPVTPAMGKSSFDGPECVKEIHVKTES 240

Query: 734  KNLISKFFSKKRTSNEQEI--------------IPQI---ENSSEESAPTKPSKNLK--- 853
            K+ ISKFFS+K    EQE+              +P+    E  SEE     PS  +    
Sbjct: 241  KSTISKFFSRKEIKGEQELNSRESTFDKSVKMDLPESVKEEYESEEKLDIPPSNQINDQD 300

Query: 854  XXXXXXXLPIDSEEGNENPKF------------------------IISPVLNEEAEKCGA 961
                   +P + E   + P                           +S + +E+A     
Sbjct: 301  LKSNVSTIPCEDETKCQIPDHDETKCQIPDHDETKCQIPDHDLISNVSKLPHEDATLGQP 360

Query: 962  KREYKELTSDVKPFNHNFKMQGSSPVKKKANLKNAGEKQPTLFSYFGK 1105
            KR ++E   D +      +    +P +KKANLK+ G+KQPTL SYF K
Sbjct: 361  KRHHEEALIDRELNPDGNEKLRRNPARKKANLKSGGDKQPTLLSYFRK 408


>ref|XP_004290142.1| PREDICTED: UPF0361 protein C3orf37 homolog isoform 2 [Fragaria vesca
            subsp. vesca]
          Length = 348

 Score =  405 bits (1041), Expect = e-110
 Identities = 208/349 (59%), Positives = 252/349 (72%), Gaps = 15/349 (4%)
 Frame = +2

Query: 107  MDGYRPRFNVSPGSHIPVVRREEGS-GAQGTVLHCMKWGLIPSFTKKTDKPDHYKMFNAR 283
            MD Y+PR+NVSPG+++PVVRR +G+ G  G VLHCMKWGLIPSFTKKT+KPDHY+MFNAR
Sbjct: 1    MDRYQPRYNVSPGANLPVVRRGDGADGEDGVVLHCMKWGLIPSFTKKTEKPDHYRMFNAR 60

Query: 284  SESICEKASFRRLVPNNRCLVAVEGYYEWKKDGSRKQPYYIHLKDDQPLVFAALYDSWGN 463
            SESICEKASFRRLVP +RC+VAVEG+YEWKKDGS+KQPYY+H KD +PL+FAALYDSW N
Sbjct: 61   SESICEKASFRRLVPKSRCVVAVEGFYEWKKDGSKKQPYYVHFKDGRPLLFAALYDSWEN 120

Query: 464  SE-----------GEMLYTFTILTTSCSSALQWLHDRMPVILGNKSSIDAWLNGSFSPKS 610
            SE           GE LYTFTI+TTS SSAL WLHDRMPV+LG+K S+D WL+GS +   
Sbjct: 121  SEGTNVYTECETAGEKLYTFTIITTSSSSALGWLHDRMPVVLGDKESVDTWLDGSSASNF 180

Query: 611  NSILKPYEDSDLVWYPVTPAMGKPSFDGPDCIKEIHLKTEEKNLISKFFSKKRTSNEQEI 790
            + +LKPYE  DLVWYPVTPAMGK SFDGP+C  EI LKT+  N I+KFFS K T  E EI
Sbjct: 181  DKLLKPYEGPDLVWYPVTPAMGKVSFDGPECSNEIKLKTDGTNSITKFFSTKGTKKE-EI 239

Query: 791  IPQIENSSEESAPTK--PSKNLKXXXXXXXLPIDSEEGNENPKFIISPVLNEEAEKCGAK 964
             P+  +  + S  T+   S N +       +   S    E+ K  +S +  E+A K   K
Sbjct: 240  NPKDTSLHDSSVKTEFPESLNEEPETKEEKVQPSSTVKCEDSKSSVSILSQEDASKEQTK 299

Query: 965  REYKELTSDVKPF-NHNFKMQGSSPVKKKANLKNAGEKQPTLFSYFGKS 1108
            R+Y+E  +D KP  N + K   +SP KKK NLK + +KQPTLFSYF KS
Sbjct: 300  RDYEEFLADSKPLPNESDKKSSASPAKKKVNLKTSHDKQPTLFSYFRKS 348


>ref|XP_004150365.1| PREDICTED: LOW QUALITY PROTEIN: UPF0361 protein C3orf37 homolog,
            partial [Cucumis sativus]
          Length = 344

 Score =  402 bits (1033), Expect = e-109
 Identities = 205/348 (58%), Positives = 247/348 (70%), Gaps = 1/348 (0%)
 Frame = +2

Query: 20   MCGRARCSLRAEDIPRACGIDNGSSRNLQMDGYRPRFNVSPGSHIPVVRREEGSGAQGTV 199
            MCGRARC+LRA+DI RAC    G  R+L MD +RP FN SPGS +PVVRR++ S   G V
Sbjct: 1    MCGRARCTLRADDITRACHRTGGPVRSLNMDRFRPLFNASPGSDLPVVRRDDESSDGGVV 60

Query: 200  LHCMKWGLIPSFTKKTDKPDHYKMFNARSESICEKASFRRLVPNNRCLVAVEGYYEWKKD 379
            L CMKWGLIPSFT+K +KP+++KMFNARSESI EKASF RLVP  RCLVAVEG+YEWKKD
Sbjct: 61   LQCMKWGLIPSFTEKFEKPNYFKMFNARSESIHEKASFHRLVPKRRCLVAVEGFYEWKKD 120

Query: 380  GSRKQPYYIHLKDDQPLVFAALYDSWGNSEGEMLYTFTILTTSCSSALQWLHDRMPVILG 559
            G +KQPYYIH KD QPL  AALYD W N EGE+LYTFTILTTS S AL+WLHDRMPVILG
Sbjct: 121  GXKKQPYYIHFKDGQPLALAALYDCWENLEGELLYTFTILTTSSSPALKWLHDRMPVILG 180

Query: 560  NKSSIDAWLNGSFSPKSNSILKPYEDSDLVWYPVTPAMGKPSFDGPDCIKEIHLKTEEKN 739
            +K  +D WLN S S K +S+LKPYE  DLVWYPVTP+MGKPSFDGPDCIKEI LK +  N
Sbjct: 181  DKERMDMWLNDSSSSKYDSVLKPYEAPDLVWYPVTPSMGKPSFDGPDCIKEIQLKNDGSN 240

Query: 740  LISKFFSKKRTSNEQEIIPQIENSSEESAPTKPSKNLKXXXXXXXLPIDSEEGNENPKFI 919
            LISKFFS K T  E   + Q +  S  S   + S +L+           SEE     K  
Sbjct: 241  LISKFFSAKETKKEYS-VSQEKTCSNTSVKPEASPSLEEHKREVNRGASSEES----KDC 295

Query: 920  ISPVLNEEAEKCGAKREYKELTSDVKPFNHNFKMQGSSP-VKKKANLK 1060
            ++   ++ +     KR+ ++++SD+K    ++   GSSP ++KK NLK
Sbjct: 296  LAKCSSDTSLTYQIKRDREDISSDLKSGMDDYSKVGSSPKIRKKGNLK 343


>gb|EYU40658.1| hypothetical protein MIMGU_mgv1a008176mg [Mimulus guttatus]
          Length = 382

 Score =  397 bits (1021), Expect = e-108
 Identities = 211/385 (54%), Positives = 260/385 (67%), Gaps = 22/385 (5%)
 Frame = +2

Query: 20   MCGRARCSLRAEDIPRACGIDNGSSRNLQMDGYRPRFNVSPGSHIPVVRREEGSGAQGTV 199
            MCGRARC+LR++D  RAC +D    R+  MD YRP  NV+PG ++PVVRR++     G V
Sbjct: 1    MCGRARCTLRSDDFRRACHLDGRPVRHQNMDRYRPSHNVAPGFNVPVVRRDDEGDGGGAV 60

Query: 200  LHCMKWGLIPSFTKKTDKPDHYKMFNARSESICEKASFRRLVPNNRCLVAVEGYYEWKKD 379
            LHCMKWGLIPSFTKKT+K DH++MFNARSESI EKASFRRL+P NRCLV+VEG+YEWKKD
Sbjct: 61   LHCMKWGLIPSFTKKTEKIDHFRMFNARSESIREKASFRRLLPKNRCLVSVEGFYEWKKD 120

Query: 380  GSRKQPYYIHLKDDQPLVFAALYDSWGNSEGEMLYTFTILTTSCSSALQWLHDRMPVILG 559
            GSRKQPYYIH KD +PLVFAAL+DSW N+EGE+LYTFTI TTS SS+L+WLHDRMPVIL 
Sbjct: 121  GSRKQPYYIHFKDGRPLVFAALFDSWENAEGEILYTFTICTTSSSSSLEWLHDRMPVILR 180

Query: 560  NKSSIDAWLNGSFSPKSNSILKPYEDSDLVWYPVTPAMGKPSFDGPDCIKEIHLKTEEKN 739
            NK S D WLN S     + ILKPYED DL WYPVT AMGK SFDGP+CIKE+  KTEE  
Sbjct: 181  NKESTDCWLNDSSLSNFDKILKPYEDEDLAWYPVTSAMGKLSFDGPECIKEV--KTEESK 238

Query: 740  LISKFFSKK------RTSNEQEIIPQIENSSE--------ESAPTKPSKNLKXXXXXXXL 877
             IS+FFSKK      + + E+  + ++  +SE        ES PT  S  LK        
Sbjct: 239  TISQFFSKKVANASQKPNLEKSPVKELAEASEAISVKEEHESQPTLDSTRLKDEDIENYE 298

Query: 878  PIDSEE-----GNENPKFIISPVLNEEAEKCGA-KREY--KELTSDVKPFNHNFKMQGSS 1033
                +E      ++ PK II     E      + +++Y  + L +  KPF    + Q   
Sbjct: 299  QKSVQEEPEISQDDCPKLIIKKDDAENTSNISSIEKQYTGEMLRAHAKPFAKENEKQNVG 358

Query: 1034 PVKKKANLKNAGEKQPTLFSYFGKS 1108
            P +K++   N  ++QPTLFSYFG+S
Sbjct: 359  PARKRSKTAN-DKQQPTLFSYFGRS 382


>emb|CAN82703.1| hypothetical protein VITISV_026469 [Vitis vinifera]
          Length = 370

 Score =  394 bits (1011), Expect = e-107
 Identities = 212/395 (53%), Positives = 262/395 (66%), Gaps = 32/395 (8%)
 Frame = +2

Query: 20   MCGRARCSLRAEDIPRACGIDNGSSRNLQMDGYRPRFNVSPGSHIPVVRREEGSGAQGTV 199
            MCGRARC+LR ++I RAC ++   ++N+QMD YRP +NVSPG+++PVVRR  G+  +  +
Sbjct: 1    MCGRARCTLRPDNIARACNLNTLPTQNIQMDRYRPSYNVSPGANLPVVRRGGGTEGEEAI 60

Query: 200  LHCMKWGLIPSFTKKTDKPDHYKMFNARSESICEKASFRRLVPNNRCLVAVEGYYEWKKD 379
            +HCMKWGL+PSFTKK++KPDHYKMFNARSES+CEKASFRRLVP NRCLVAVEG+YEWKKD
Sbjct: 61   VHCMKWGLVPSFTKKSEKPDHYKMFNARSESVCEKASFRRLVPKNRCLVAVEGFYEWKKD 120

Query: 380  GSRKQPYYIHLKDDQPLVFAALYDSWGNSEGEMLYTFTILTTSCSSALQWLHDRMPVILG 559
            GS+KQPYYIHLKD +PLVFAAL+DSW NSE                      DRMPVILG
Sbjct: 121  GSKKQPYYIHLKDGRPLVFAALFDSWANSE----------------------DRMPVILG 158

Query: 560  NKSSIDAWLNGSFSPKSNSILKPYEDSDLVWYPVTPAMGKPSFDGPDCIKEIHLKTEEKN 739
            +K S DAWLNGS S + N++LKPYED DLVWYPVT AMGKPSF+GP+CIKEI LK E++ 
Sbjct: 159  DKESTDAWLNGSSSSQFNTVLKPYEDPDLVWYPVTQAMGKPSFEGPECIKEIQLKNEQRP 218

Query: 740  LISKFFSKKRTSNEQEII---------------PQIENSS---------EESAPTKPSKN 847
             ISKFFS K   NEQ +                P IENS+         +  +    S  
Sbjct: 219  -ISKFFSTKGIKNEQGLSNEPVKSNLPQSMKEEPAIENSTGLPSSAVKGDHDSTCSRSVP 277

Query: 848  LKXXXXXXXLP--IDSEEGNENPKFIISP-----VLNEEAEKCGAKREYKELTSDVKPFN 1006
             +       LP  +  E   E+   +  P       +EEA K   KR+++E ++D KP  
Sbjct: 278  QEESTWFTNLPKSLKQEPETEDKTGLPFPGDHDSKCDEEATKLPIKRDFEEFSADSKP-- 335

Query: 1007 HNFKMQGSSPVKKKANL-KNAGEKQPTLFSYFGKS 1108
            +   ++  SPV KK  L KNAG+KQPTLFSYFGKS
Sbjct: 336  NTDTVEKPSPVTKKGKLNKNAGDKQPTLFSYFGKS 370


>ref|XP_006850341.1| hypothetical protein AMTR_s00020p00243160 [Amborella trichopoda]
            gi|548853962|gb|ERN11922.1| hypothetical protein
            AMTR_s00020p00243160 [Amborella trichopoda]
          Length = 413

 Score =  391 bits (1005), Expect = e-106
 Identities = 210/378 (55%), Positives = 256/378 (67%), Gaps = 13/378 (3%)
 Frame = +2

Query: 14   ERMCGRARCSLR-AEDIPRACGIDNGSSRNLQMDGYRPRFNVSPGSHIPVVRREEGSGAQ 190
            ++MCGRARC+L   ED+PRACG  N +   L    YR  +N++PG+++PV+R+E+ S   
Sbjct: 38   KKMCGRARCTLNPVEDVPRACGF-NANLPTLHTQRYRLSYNIAPGAYLPVLRKEQES-KH 95

Query: 191  GTVLHCMKWGLIPSFTKKTDKPDHYKMFNARSESICEKASFRRLVPNNRCLVAVEGYYEW 370
            G V+HCMKWGL+PSFTKKT+KPDH+KMFNARSESI EKASFRRLVPN RCLV VEG+YEW
Sbjct: 96   GYVVHCMKWGLVPSFTKKTEKPDHFKMFNARSESIQEKASFRRLVPNKRCLVVVEGFYEW 155

Query: 371  KKDGSRKQPYYIHLKDDQPLVFAALYDSWGNSEGEMLYTFTILTTSCSSALQWLHDRMPV 550
            KKDGS+KQPYY+H +D + LVFA LYD+W NSEGE LYTFTILTT CSSAL WLHDRMPV
Sbjct: 156  KKDGSKKQPYYLHFRDGRALVFAGLYDTWENSEGEGLYTFTILTTRCSSALDWLHDRMPV 215

Query: 551  ILGNKSSIDAWLNGSFSPKSNSILKPYEDSDLVWYPVTPAMGKPSFDGPDCIKEIHLKTE 730
            ILGNK +IDAWLN + SPK +S+L+PYE SDLVWYPVTPAMGK  F GP+CIKEI LK+E
Sbjct: 216  ILGNKEAIDAWLNITPSPKVDSLLQPYEGSDLVWYPVTPAMGKIFFAGPECIKEIQLKSE 275

Query: 731  EKNLISKFFSKKRTSNEQEIIPQIENSSEESAPTKPSKNLK--XXXXXXXLPIDSEEGNE 904
             KN ISK F +     +    P I  ++E+S      +N +          PID  +   
Sbjct: 276  NKNTISKLFMQSHNKKQPISEPSIRKAAEDSTHGHTFENSQEPSNTNEDWEPIDDFKVCI 335

Query: 905  NPKFIISPVLNEEAEKCGAKREYKELTSDVKPFNHNFKMQGSSPVKKKANL-----KN-- 1063
              K   SP   EE EK   KR+ ++L  D K      K    S  +++  +     KN  
Sbjct: 336  GIKREASPGNAEETEKRRTKRDIEQLLVDPKKETIVGKENPISGEERQGYMDRGSHKNGM 395

Query: 1064 ---AGEKQPTLFSYFGKS 1108
                G KQ  LFSYFGKS
Sbjct: 396  PRITGGKQANLFSYFGKS 413


>ref|XP_004165094.1| PREDICTED: UPF0361 protein C3orf37 homolog [Cucumis sativus]
          Length = 267

 Score =  382 bits (982), Expect = e-103
 Identities = 182/251 (72%), Positives = 204/251 (81%)
 Frame = +2

Query: 20  MCGRARCSLRAEDIPRACGIDNGSSRNLQMDGYRPRFNVSPGSHIPVVRREEGSGAQGTV 199
           MCGRARC+LRA+DI RAC    G  R+L MD +RP FN SPGS +PVVRR++ S   G V
Sbjct: 1   MCGRARCTLRADDITRACHRTGGPVRSLNMDRFRPLFNASPGSDLPVVRRDDESSDGGVV 60

Query: 200 LHCMKWGLIPSFTKKTDKPDHYKMFNARSESICEKASFRRLVPNNRCLVAVEGYYEWKKD 379
           L CMKWGLIPSFT+K +KP+++KMFNARSESI EKASF RLVP  RCLVAVEG+YEWKKD
Sbjct: 61  LQCMKWGLIPSFTEKFEKPNYFKMFNARSESIHEKASFHRLVPKRRCLVAVEGFYEWKKD 120

Query: 380 GSRKQPYYIHLKDDQPLVFAALYDSWGNSEGEMLYTFTILTTSCSSALQWLHDRMPVILG 559
           GS+KQPYYIH KD QPL  AALYD W N EGE+LYTFTILTTS S AL+WLHDRMPVILG
Sbjct: 121 GSKKQPYYIHFKDGQPLALAALYDCWENLEGELLYTFTILTTSSSPALKWLHDRMPVILG 180

Query: 560 NKSSIDAWLNGSFSPKSNSILKPYEDSDLVWYPVTPAMGKPSFDGPDCIKEIHLKTEEKN 739
           +K  +D WLN S S K +S+LKPYE  DLVWYPVTP+MGKPSFDGPDCIKEI LK +  N
Sbjct: 181 DKERMDMWLNDSSSSKYDSVLKPYEAPDLVWYPVTPSMGKPSFDGPDCIKEIQLKNDGSN 240

Query: 740 LISKFFSKKRT 772
           LISKFFS K T
Sbjct: 241 LISKFFSAKET 251


>ref|XP_007140736.1| hypothetical protein PHAVU_008G137400g [Phaseolus vulgaris]
            gi|561013869|gb|ESW12730.1| hypothetical protein
            PHAVU_008G137400g [Phaseolus vulgaris]
          Length = 309

 Score =  376 bits (966), Expect = e-101
 Identities = 197/318 (61%), Positives = 233/318 (73%)
 Frame = +2

Query: 152  IPVVRREEGSGAQGTVLHCMKWGLIPSFTKKTDKPDHYKMFNARSESICEKASFRRLVPN 331
            +PVVRREE S + G VLH MKWGLIPSFTKKT+KPDHYKMFNARSESI EKASFRRL+P 
Sbjct: 1    MPVVRREEASDSGGYVLHSMKWGLIPSFTKKTEKPDHYKMFNARSESIDEKASFRRLLPK 60

Query: 332  NRCLVAVEGYYEWKKDGSRKQPYYIHLKDDQPLVFAALYDSWGNSEGEMLYTFTILTTSC 511
            +RCLVAVEG+YEWKKDGS+KQPYYIH KD + LVFAALYDSW NSEGE L+TFTI+TTS 
Sbjct: 61   SRCLVAVEGFYEWKKDGSKKQPYYIHFKDGRRLVFAALYDSWQNSEGETLHTFTIVTTSS 120

Query: 512  SSALQWLHDRMPVILGNKSSIDAWLNGSFSPKSNSILKPYEDSDLVWYPVTPAMGKPSFD 691
            SSALQWLHDRMPVILG+K S D WL+ S S    S++KPYE+SDLVWYPVT AMGK SFD
Sbjct: 121  SSALQWLHDRMPVILGSKESTDTWLSSSAS-SFKSVMKPYEESDLVWYPVTSAMGKTSFD 179

Query: 692  GPDCIKEIHLKTEEKNLISKFFSKKRTSNEQEIIPQIENSSEESAPTKPSKNLKXXXXXX 871
            GP+CIKEI +K E    IS FFSKK  +  ++  P+ + SS E   T+P+++L       
Sbjct: 180  GPECIKEIQVKAEGNTSISMFFSKK-GAESKDTKPEQKLSSHEFVKTEPTEDLIEG---- 234

Query: 872  XLPIDSEEGNENPKFIISPVLNEEAEKCGAKREYKELTSDVKPFNHNFKMQGSSPVKKKA 1051
                 +EEG+ + KF  S   ++ A     KREY+  ++D KP   N     S+P KKK 
Sbjct: 235  ---AKAEEGDNDLKFSGSS-HSKNASTLPIKREYETFSADSKPALANHDQISSNPAKKKE 290

Query: 1052 NLKNAGEKQPTLFSYFGK 1105
              K A +KQPTLFSYFGK
Sbjct: 291  KTKTANDKQPTLFSYFGK 308


>ref|NP_180215.2| uncharacterized protein [Arabidopsis thaliana]
           gi|26449484|dbj|BAC41868.1| unknown protein [Arabidopsis
           thaliana] gi|29028900|gb|AAO64829.1| At2g26470
           [Arabidopsis thaliana] gi|330252748|gb|AEC07842.1|
           uncharacterized protein AT2G26470 [Arabidopsis thaliana]
          Length = 487

 Score =  366 bits (940), Expect = 1e-98
 Identities = 170/256 (66%), Positives = 208/256 (81%), Gaps = 1/256 (0%)
 Frame = +2

Query: 20  MCGRARCSLRAEDIPRACGIDNGSSRNLQMDGYRPRFNVSPGSHIPVVRRE-EGSGAQGT 196
           MCGR RC+LR +D+PRA       +R L +D YRP +NV+PGS+IPV+RR+ E     G 
Sbjct: 1   MCGRTRCTLRPDDVPRASHRHTVPTRFLHLDRYRPSYNVAPGSYIPVLRRDNEEVVGDGV 60

Query: 197 VLHCMKWGLIPSFTKKTDKPDHYKMFNARSESICEKASFRRLVPNNRCLVAVEGYYEWKK 376
           V+HCMKWGL+PSFTKKTDKPD +KMFNARSES+ EKASFRRL+P NRCLVAV+G+YEWKK
Sbjct: 61  VVHCMKWGLVPSFTKKTDKPDFFKMFNARSESVAEKASFRRLLPKNRCLVAVDGFYEWKK 120

Query: 377 DGSRKQPYYIHLKDDQPLVFAALYDSWGNSEGEMLYTFTILTTSCSSALQWLHDRMPVIL 556
           +GS+KQPYYIH +D +PLVFAAL+D+W NS GE LYTFTILTT+ SSALQWLHDRMPVIL
Sbjct: 121 EGSKKQPYYIHFEDGRPLVFAALFDTWQNSGGETLYTFTILTTASSSALQWLHDRMPVIL 180

Query: 557 GNKSSIDAWLNGSFSPKSNSILKPYEDSDLVWYPVTPAMGKPSFDGPDCIKEIHLKTEEK 736
           G+K SID WL+   + K   +L PYE SDLVWYPVT A+GKP+FDGP+CI++I LKT + 
Sbjct: 181 GDKDSIDTWLDDPSTTKLQPLLSPYEKSDLVWYPVTSAIGKPTFDGPECIQQIPLKTSQN 240

Query: 737 NLISKFFSKKRTSNEQ 784
           +LISKFFS K+   ++
Sbjct: 241 SLISKFFSTKQPKTDE 256


Top