BLASTX nr result

ID: Akebia22_contig00007847 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Akebia22_contig00007847
         (1466 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_004290141.1| PREDICTED: UPF0361 protein C3orf37 homolog i...   451   e-124
ref|XP_007049611.1| Uncharacterized protein TCM_002685 [Theobrom...   447   e-123
ref|XP_007199067.1| hypothetical protein PRUPE_ppa018685mg [Prun...   447   e-123
ref|XP_003635244.1| PREDICTED: UPF0361 protein C3orf37 homolog [...   445   e-122
ref|XP_003622783.1| hypothetical protein MTR_7g052250 [Medicago ...   441   e-121
ref|XP_006484827.1| PREDICTED: embryonic stem cell-specific 5-hy...   438   e-120
ref|XP_007140735.1| hypothetical protein PHAVU_008G137400g [Phas...   437   e-120
ref|XP_002303080.1| hypothetical protein POPTR_0002s25190g [Popu...   433   e-119
gb|EXB84512.1| hypothetical protein L484_015844 [Morus notabilis]     430   e-118
ref|XP_003532247.1| PREDICTED: embryonic stem cell-specific 5-hy...   429   e-117
ref|XP_004492204.1| PREDICTED: UPF0361 protein C3orf37 homolog [...   425   e-116
ref|XP_002527247.1| conserved hypothetical protein [Ricinus comm...   412   e-112
ref|XP_004290142.1| PREDICTED: UPF0361 protein C3orf37 homolog i...   407   e-111
ref|XP_004150365.1| PREDICTED: LOW QUALITY PROTEIN: UPF0361 prot...   404   e-110
gb|EYU40658.1| hypothetical protein MIMGU_mgv1a008176mg [Mimulus...   399   e-108
emb|CAN82703.1| hypothetical protein VITISV_026469 [Vitis vinifera]   395   e-107
ref|XP_006850341.1| hypothetical protein AMTR_s00020p00243160 [A...   393   e-106
ref|XP_004165094.1| PREDICTED: UPF0361 protein C3orf37 homolog [...   382   e-103
ref|XP_007140736.1| hypothetical protein PHAVU_008G137400g [Phas...   378   e-102
ref|NP_180215.2| uncharacterized protein [Arabidopsis thaliana] ...   366   1e-98

>ref|XP_004290141.1| PREDICTED: UPF0361 protein C3orf37 homolog isoform 1 [Fragaria vesca
            subsp. vesca]
          Length = 366

 Score =  451 bits (1159), Expect = e-124
 Identities = 226/367 (61%), Positives = 276/367 (75%), Gaps = 4/367 (1%)
 Frame = -2

Query: 1447 MCGRARCSLRAEDIPRACGIDNGSSRNLQMDGYRPRFNVSPGSHIPVVRREEGS-GAQGT 1271
            MCGRARC+LRA+DI RAC  ++G  R++ MD Y+PR+NVSPG+++PVVRR +G+ G  G 
Sbjct: 1    MCGRARCTLRADDISRACYRNHGPVRSVNMDRYQPRYNVSPGANLPVVRRGDGADGEDGV 60

Query: 1270 VLHCMKWGLIPSFTKKTDKPDHYKMFNARSESICEKASFRRLVPNNRCLVAVEGYYEWKK 1091
            VLHCMKWGLIPSFTKKT+KPDHY+MFNARSESICEKASFRRLVP +RC+VAVEG+YEWKK
Sbjct: 61   VLHCMKWGLIPSFTKKTEKPDHYRMFNARSESICEKASFRRLVPKSRCVVAVEGFYEWKK 120

Query: 1090 DGSRKQPYYIHLKDDQPLVFAALYDSWGNSEGEMLYTFTILTTSCSSALQWLHDRMPVIL 911
            DGS+KQPYY+H KD +PL+FAALYDSW NSEGE LYTFTI+TTS SSAL WLHDRMPV+L
Sbjct: 121  DGSKKQPYYVHFKDGRPLLFAALYDSWENSEGEKLYTFTIITTSSSSALGWLHDRMPVVL 180

Query: 910  GNKSSIDAWLNGSFSPKSNSILKPYEDSDLVWYPVTPAMGKPSFDGPDCIKEIHLKTEEK 731
            G+K S+D WL+GS +   + +LKPYE  DLVWYPVTPAMGK SFDGP+C  EI LKT+  
Sbjct: 181  GDKESVDTWLDGSSASNFDKLLKPYEGPDLVWYPVTPAMGKVSFDGPECSNEIKLKTDGT 240

Query: 730  NLISKFFSKKRTSNEQEIIPQIENSSEESAPTK--PSKNLKXXXXXXELPIDSEEGNENP 557
            N I+KFFS K T  E EI P+  +  + S  T+   S N +      ++   S    E+ 
Sbjct: 241  NSITKFFSTKGTKKE-EINPKDTSLHDSSVKTEFPESLNEEPETKEEKVQPSSTVKCEDS 299

Query: 556  KFIISPVLNEEAEKCGAKREYEELTSDVKPF-NHNFKMQGSSPVKKKANLKNAGEKQPTL 380
            K  +S +  E+A K   KR+YEE  +D KP  N + K   +SP KKK NLK + +KQPTL
Sbjct: 300  KSSVSILSQEDASKEQTKRDYEEFLADSKPLPNESDKKSSASPAKKKVNLKTSHDKQPTL 359

Query: 379  FSYFGKS 359
            FSYF KS
Sbjct: 360  FSYFRKS 366


>ref|XP_007049611.1| Uncharacterized protein TCM_002685 [Theobroma cacao]
            gi|508701872|gb|EOX93768.1| Uncharacterized protein
            TCM_002685 [Theobroma cacao]
          Length = 360

 Score =  447 bits (1151), Expect = e-123
 Identities = 227/368 (61%), Positives = 276/368 (75%), Gaps = 6/368 (1%)
 Frame = -2

Query: 1447 MCGRARCSLRAEDIPRACGIDNGSSRNLQMDGYRPRFNVSPGSHIPVVRREEGS-GAQGT 1271
            MCGRARC+LRA+DIPRA   ++G  R++ MD YRP +NV PG ++PVVRR++GS G  G 
Sbjct: 1    MCGRARCTLRADDIPRASHRNDGPVRHVHMDRYRPSYNVGPGMNLPVVRRDDGSNGDGGV 60

Query: 1270 VLHCMKWGLIPSFTKKTDKPDHYKMFNARSESICEKASFRRLVPNNRCLVAVEGYYEWKK 1091
            VLHCMKWGLIPSFTKKTDKPD YKMFNARSES+CEKASFRRL+P +RCLVAVEG+YEWKK
Sbjct: 61   VLHCMKWGLIPSFTKKTDKPDFYKMFNARSESVCEKASFRRLLPKSRCLVAVEGFYEWKK 120

Query: 1090 DGSRKQPYYIHLKDDQPLVFAALYDSWGNSEGEMLYTFTILTTSCSSALQWLHDRMPVIL 911
            DGS+KQPYYIH KD +PLVFAALYD W NSEGE LYTFTILTT+ SSA  WLHDRMPVIL
Sbjct: 121  DGSKKQPYYIHFKDGRPLVFAALYDCWENSEGEKLYTFTILTTASSSAFLWLHDRMPVIL 180

Query: 910  GNKSSIDAWLNGSFSPKSNSILKPYEDSDLVWYPVTPAMGKPSFDGPDCIKEIHLKTEEK 731
            G+K S D WLNG+   K +++LKPYE+ DLVWYPVT A+GK SF+GP+C+KE+ LKT+EK
Sbjct: 181  GDKESTDTWLNGT---KIDTLLKPYENPDLVWYPVTSAIGKLSFEGPECVKEVPLKTQEK 237

Query: 730  NLISKFFSKKRTSNEQEIIPQIENS-SEESAPTKPSKNLK---XXXXXXELP-IDSEEGN 566
            N ISKFFS +    EQE    +E S  +ES  T   KNLK         E+P + S+E N
Sbjct: 238  NPISKFFSTREVKREQE--SNMEKSLCDESVQTNLLKNLKEEPNSPEDKEIPSLASKEDN 295

Query: 565  ENPKFIISPVLNEEAEKCGAKREYEELTSDVKPFNHNFKMQGSSPVKKKANLKNAGEKQP 386
            ++   ++ P   E+  KC  KR+YEE ++D KP     ++   SP +KK N+K    KQP
Sbjct: 296  DSKSSVLVPTC-EDVRKCQTKRDYEEFSADTKPAKDEIEV---SPARKKGNIKGVAGKQP 351

Query: 385  TLFSYFGK 362
            TLF+YFGK
Sbjct: 352  TLFAYFGK 359


>ref|XP_007199067.1| hypothetical protein PRUPE_ppa018685mg [Prunus persica]
            gi|462394467|gb|EMJ00266.1| hypothetical protein
            PRUPE_ppa018685mg [Prunus persica]
          Length = 363

 Score =  447 bits (1151), Expect = e-123
 Identities = 222/366 (60%), Positives = 266/366 (72%), Gaps = 3/366 (0%)
 Frame = -2

Query: 1447 MCGRARCSLRAEDIPRACGIDNGSSRNLQMDGYRPRFNVSPGSHIPVVRREEGSGAQGTV 1268
            MCGRARC+LRA+DIPRAC   +G  R + MD +RP FN SPGS++PVVRRE+G    G V
Sbjct: 1    MCGRARCTLRADDIPRACHRSHGPVRTVNMDRFRPLFNASPGSNLPVVRREDGGDGDGVV 60

Query: 1267 LHCMKWGLIPSFTKKTDKPDHYKMFNARSESICEKASFRRLVPNNRCLVAVEGYYEWKKD 1088
            +HCMKWGLIPSFTKKT+KPDHYKMFNARSESICEKASFRRL+P NRCL+AVEG+YEWKKD
Sbjct: 61   VHCMKWGLIPSFTKKTEKPDHYKMFNARSESICEKASFRRLIPKNRCLIAVEGFYEWKKD 120

Query: 1087 GSRKQPYYIHLKDDQPLVFAALYDSWGNSEGEMLYTFTILTTSCSSALQWLHDRMPVILG 908
            GS+KQPYY+H  D +PL+FAALYD W NSEGE LYTFTI+TTS SSAL WLHDRMPVILG
Sbjct: 121  GSKKQPYYVHFNDGRPLLFAALYDFWENSEGEKLYTFTIITTSSSSALGWLHDRMPVILG 180

Query: 907  NKSSIDAWLNGSFSPKSNSILKPYEDSDLVWYPVTPAMGKPSFDGPDCIKEIHLKTEEKN 728
            +K S D+WL+GS +   +S+LKPYE  DLVWYPVT AMGK SFDGP+CI EI LKTE  N
Sbjct: 181  DKGSTDSWLSGSSTSNFDSLLKPYEGPDLVWYPVTQAMGKVSFDGPECINEIQLKTEGNN 240

Query: 727  LISKFFSKKRTSNEQEIIPQIENSSEESAPTKPSKNLK---XXXXXXELPIDSEEGNENP 557
             I+KFF  K T  E E+ P+  +  + S      K++K         E P  +E+   + 
Sbjct: 241  SITKFFMSKGTKKE-ELNPKDTSFYDSSVKNDLPKSVKEEPEGKEKTEQPASTEKCENDS 299

Query: 556  KFIISPVLNEEAEKCGAKREYEELTSDVKPFNHNFKMQGSSPVKKKANLKNAGEKQPTLF 377
            K     +  E   K   KR+YEE ++D KP  +      +SP KKK N K++ +KQPTLF
Sbjct: 300  KG--QTISQEGVSKGQTKRDYEEFSADSKPVAYETSEMSASPAKKKVNPKSSVDKQPTLF 357

Query: 376  SYFGKS 359
            SYFGKS
Sbjct: 358  SYFGKS 363


>ref|XP_003635244.1| PREDICTED: UPF0361 protein C3orf37 homolog [Vitis vinifera]
            gi|296090568|emb|CBI40918.3| unnamed protein product
            [Vitis vinifera]
          Length = 392

 Score =  445 bits (1144), Expect = e-122
 Identities = 232/397 (58%), Positives = 281/397 (70%), Gaps = 34/397 (8%)
 Frame = -2

Query: 1447 MCGRARCSLRAEDIPRACGIDNGSSRNLQMDGYRPRFNVSPGSHIPVVRREEGSGAQGTV 1268
            MCGRARC+LR ++I RAC ++   ++N+QMD YRP +NVSPG+++PVVRR  G+  +  +
Sbjct: 1    MCGRARCTLRPDNIARACNLNTLPTQNIQMDRYRPSYNVSPGANLPVVRRGGGTEGEEAI 60

Query: 1267 LHCMKWGLIPSFTKKTDKPDHYKMFNARSESICEKASFRRLVPNNRCLVAVEGYYEWKKD 1088
            +HCMKWGL+PSFTKK++KPDHYKMFNARSES+CEKASFRRLVP NRCLVAVEG+YEWKKD
Sbjct: 61   VHCMKWGLVPSFTKKSEKPDHYKMFNARSESVCEKASFRRLVPKNRCLVAVEGFYEWKKD 120

Query: 1087 GSRKQPYYIHLKDDQPLVFAALYDSWGNSEGEMLYTFTILTTSCSSALQWLHDRMPVILG 908
            GS+KQPYYIHLKD +PLVFAAL+DSW NSEGE+LYT TILTTS SSALQWLHDRMPVILG
Sbjct: 121  GSKKQPYYIHLKDGRPLVFAALFDSWANSEGEILYTCTILTTSSSSALQWLHDRMPVILG 180

Query: 907  NKSSIDAWLNGSFSPKSNSILKPYEDSDLVWYPVTPAMGKPSFDGPDCIKEIHLKTEEKN 728
            +K S DAWLNGS S + N++LKPYED DLVWYPVT AMGKPSF+GP+CIKEI LK E++ 
Sbjct: 181  DKESTDAWLNGSSSSQFNTVLKPYEDPDLVWYPVTQAMGKPSFEGPECIKEIQLKNEQRP 240

Query: 727  LISKFFSKKRTSNEQEII---------------PQIENSSEESAPTKPSKNLKXXXXXXE 593
             ISKFFS K   NEQ +                P IENS+    P+   K          
Sbjct: 241  -ISKFFSTKGIKNEQGLSNEPVKSNLPQSLKEEPAIENST--GLPSSTVKGDHDSTCSRS 297

Query: 592  LP-------------IDSEEGNENPKFIISP-----VLNEEAEKCGAKREYEELTSDVKP 467
            +P             +  E   E+   +  P       +EEA K   KR++EE ++D KP
Sbjct: 298  IPQEESTWFTNLPKSLKQEPETEDKTGLPFPGDHDSKCDEEATKLPIKRDFEEFSADSKP 357

Query: 466  FNHNFKMQGSSPVKKKANL-KNAGEKQPTLFSYFGKS 359
              +   ++  SPV KK  L KNAG+KQPTLFSYFGKS
Sbjct: 358  --NTDTVEKPSPVTKKGKLNKNAGDKQPTLFSYFGKS 392


>ref|XP_003622783.1| hypothetical protein MTR_7g052250 [Medicago truncatula]
            gi|355497798|gb|AES79001.1| hypothetical protein
            MTR_7g052250 [Medicago truncatula]
          Length = 354

 Score =  441 bits (1133), Expect = e-121
 Identities = 228/365 (62%), Positives = 271/365 (74%), Gaps = 3/365 (0%)
 Frame = -2

Query: 1447 MCGRARCSLRAEDIPRACGIDNGSSRNLQMDGYRPRFNVSPGSHIPVVRREEGSGAQ--G 1274
            MCGR RCSLRA+D+PRAC      SR L +D YRP  NVSPG +IPVVRRE+ + A+  G
Sbjct: 1    MCGRTRCSLRADDVPRACHRTTAPSRLLHIDRYRPSNNVSPGFNIPVVRREDNASAESDG 60

Query: 1273 TVLHCMKWGLIPSFTKKTDKPDHYKMFNARSESICEKASFRRLVPNNRCLVAVEGYYEWK 1094
             V+HCMKWGLIPSFTKKTDKPDHYKMFNARSESI EKASFRRL+P NRCLVAVEG+YEWK
Sbjct: 61   HVVHCMKWGLIPSFTKKTDKPDHYKMFNARSESIDEKASFRRLLPKNRCLVAVEGFYEWK 120

Query: 1093 KDGSRKQPYYIHLKDDQPLVFAALYDSWGNSEGEMLYTFTILTTSCSSALQWLHDRMPVI 914
            KDGS+KQPYYIH KD +PLVFAALYDSW NSEGE+LYTFTI+TTS SSA +WLHDRMPVI
Sbjct: 121  KDGSKKQPYYIHFKDGRPLVFAALYDSWQNSEGEILYTFTIVTTSSSSAFKWLHDRMPVI 180

Query: 913  LGNKSSIDAWLNGSFSPKSNSILKPYEDSDLVWYPVTPAMGKPSFDGPDCIKEIHLKTEE 734
            LG+K + D WL+ + S K  S++KPYE+SDLVWYPVTPAMGKPSFDGP+CIKEI +KTE 
Sbjct: 181  LGDKDTTDTWLSSASSFK--SVMKPYEESDLVWYPVTPAMGKPSFDGPECIKEIQIKTEG 238

Query: 733  KNLISKFFSKKRTSNEQEIIPQIENSSEESAPTKPSKNLKXXXXXXELPIDSEEGNENPK 554
               ISKFFSKK    E +  P+ +  S E   T+ +K++            +EEG+ + K
Sbjct: 239  YIPISKFFSKKEAEVE-DTKPEHKILSHEPVKTEQTKDVSE-------EAKTEEGDTDLK 290

Query: 553  FI-ISPVLNEEAEKCGAKREYEELTSDVKPFNHNFKMQGSSPVKKKANLKNAGEKQPTLF 377
               ISP  ++   +   KREY+ ++SD KP   N     ++P KKK   K A +KQPTLF
Sbjct: 291  SSGISP--SQNVNRFAIKREYDAISSDSKPSLANNDQVSANPAKKKEKAKTADDKQPTLF 348

Query: 376  SYFGK 362
            SYFGK
Sbjct: 349  SYFGK 353


>ref|XP_006484827.1| PREDICTED: embryonic stem cell-specific
            5-hydroxymethylcytosine-binding protein-like isoform X1
            [Citrus sinensis]
          Length = 398

 Score =  438 bits (1126), Expect = e-120
 Identities = 224/399 (56%), Positives = 275/399 (68%), Gaps = 37/399 (9%)
 Frame = -2

Query: 1447 MCGRARCSLRAEDIPRACGIDNGSSRNLQMDGYRPRFNVSPGSHIPVVRREEGSGAQGTV 1268
            MCGRARC+LRA+D+PRAC      +R L MD YRP +NV+PG ++PVVRR++    +G V
Sbjct: 1    MCGRARCTLRADDLPRACHRTGSPARTLNMDRYRPSYNVAPGWNLPVVRRDDDG--EGFV 58

Query: 1267 LHCMKWGLIPSFTKKTDKPDHYKMFNARSESICEKASFRRLVPNNRCLVAVEGYYEWKKD 1088
            LHCMKWGLIPSFTKK +KPD YKMFNARSES+ EKASFRRL+P +RCL AVEG+YEWKKD
Sbjct: 59   LHCMKWGLIPSFTKKNEKPDFYKMFNARSESVTEKASFRRLLPKSRCLAAVEGFYEWKKD 118

Query: 1087 GSRKQPYYIHLKDDQPLVFAALYDSWGNSEGEMLYTFTILTTSCSSALQWLHDRMPVILG 908
            GS+KQPYY+H KD +PLVFAALYD+W +SEGE+LYTFTILTTS S+ALQWLHDRMPVILG
Sbjct: 119  GSKKQPYYVHFKDGRPLVFAALYDTWQSSEGEILYTFTILTTSSSAALQWLHDRMPVILG 178

Query: 907  NKSSIDAWLNGSFSPKSNSILKPYEDSDLVWYPVTPAMGKPSFDGPDCIKEIHLKTEEKN 728
            +K S DAWLNGS S K ++ILKPYE+SDLVWYPVTP MGK SF+GP+CIKEI LKTE KN
Sbjct: 179  DKESSDAWLNGSSSSKYDTILKPYEESDLVWYPVTPVMGKLSFNGPECIKEIPLKTEGKN 238

Query: 727  LISKFFSKKRTSNEQEIIPQIENSSEESAPTKPSKNLKXXXXXXEL--PIDS-------- 578
             IS FF KK    EQE     ++S +ES  T   K +K          P+          
Sbjct: 239  PISNFFLKKEIKKEQESKMDEKSSFDESVKTNLPKRMKGEPIKEIKEEPVSGLEEKYSFD 298

Query: 577  ---------------------------EEGNENPKFIISPVLNEEAEKCGAKREYEELTS 479
                                       E+G+ + K + S + +E+ +K   KR+Y+E  +
Sbjct: 299  TTAQTNLPKSVKDEAVTADDIRTQSSVEKGDPDTKSVASVLSDEDTKKELQKRDYKEFLA 358

Query: 478  DVKPFNHNFKMQGSSPVKKKANLKNAGEKQPTLFSYFGK 362
            D KP         +SP+K+K N+K+AGEKQPTLFSY+ K
Sbjct: 359  DSKPVIDGNNKLETSPLKRKGNVKDAGEKQPTLFSYYSK 397


>ref|XP_007140735.1| hypothetical protein PHAVU_008G137400g [Phaseolus vulgaris]
            gi|561013868|gb|ESW12729.1| hypothetical protein
            PHAVU_008G137400g [Phaseolus vulgaris]
          Length = 353

 Score =  437 bits (1123), Expect = e-120
 Identities = 224/362 (61%), Positives = 267/362 (73%)
 Frame = -2

Query: 1447 MCGRARCSLRAEDIPRACGIDNGSSRNLQMDGYRPRFNVSPGSHIPVVRREEGSGAQGTV 1268
            MCGR RC+LR++D+PRAC   +  +R L MD YRP +NVSPGS++PVVRREE S + G V
Sbjct: 1    MCGRTRCTLRSDDVPRACHRSDAPTRTLHMDRYRPAYNVSPGSNMPVVRREEASDSGGYV 60

Query: 1267 LHCMKWGLIPSFTKKTDKPDHYKMFNARSESICEKASFRRLVPNNRCLVAVEGYYEWKKD 1088
            LH MKWGLIPSFTKKT+KPDHYKMFNARSESI EKASFRRL+P +RCLVAVEG+YEWKKD
Sbjct: 61   LHSMKWGLIPSFTKKTEKPDHYKMFNARSESIDEKASFRRLLPKSRCLVAVEGFYEWKKD 120

Query: 1087 GSRKQPYYIHLKDDQPLVFAALYDSWGNSEGEMLYTFTILTTSCSSALQWLHDRMPVILG 908
            GS+KQPYYIH KD + LVFAALYDSW NSEGE L+TFTI+TTS SSALQWLHDRMPVILG
Sbjct: 121  GSKKQPYYIHFKDGRRLVFAALYDSWQNSEGETLHTFTIVTTSSSSALQWLHDRMPVILG 180

Query: 907  NKSSIDAWLNGSFSPKSNSILKPYEDSDLVWYPVTPAMGKPSFDGPDCIKEIHLKTEEKN 728
            +K S D WL+ S S    S++KPYE+SDLVWYPVT AMGK SFDGP+CIKEI +K E   
Sbjct: 181  SKESTDTWLSSSAS-SFKSVMKPYEESDLVWYPVTSAMGKTSFDGPECIKEIQVKAEGNT 239

Query: 727  LISKFFSKKRTSNEQEIIPQIENSSEESAPTKPSKNLKXXXXXXELPIDSEEGNENPKFI 548
             IS FFSKK  +  ++  P+ + SS E   T+P+++L            +EEG+ + KF 
Sbjct: 240  SISMFFSKK-GAESKDTKPEQKLSSHEFVKTEPTEDLIEG-------AKAEEGDNDLKFS 291

Query: 547  ISPVLNEEAEKCGAKREYEELTSDVKPFNHNFKMQGSSPVKKKANLKNAGEKQPTLFSYF 368
             S   ++ A     KREYE  ++D KP   N     S+P KKK   K A +KQPTLFSYF
Sbjct: 292  GSS-HSKNASTLPIKREYETFSADSKPALANHDQISSNPAKKKEKTKTANDKQPTLFSYF 350

Query: 367  GK 362
            GK
Sbjct: 351  GK 352


>ref|XP_002303080.1| hypothetical protein POPTR_0002s25190g [Populus trichocarpa]
            gi|222844806|gb|EEE82353.1| hypothetical protein
            POPTR_0002s25190g [Populus trichocarpa]
          Length = 367

 Score =  433 bits (1114), Expect = e-119
 Identities = 218/368 (59%), Positives = 263/368 (71%), Gaps = 6/368 (1%)
 Frame = -2

Query: 1447 MCGRARCSLRAEDIPRACGIDNGSSRNLQMDGYRPRFNVSPGSHIPVVRREEG------S 1286
            MCGRARC+LRA+DIPRAC  +  + R++ MD YRP +N SPGS++ VVRR++       S
Sbjct: 1    MCGRARCTLRADDIPRACHRNTATVRSVNMDRYRPSYNASPGSNLAVVRRDDAASGDGAS 60

Query: 1285 GAQGTVLHCMKWGLIPSFTKKTDKPDHYKMFNARSESICEKASFRRLVPNNRCLVAVEGY 1106
            G  G  +HCMKWGLIP FTKK++KPD YKMFNARSES+ EKASFRRL+P +RCLVAVEG+
Sbjct: 61   GGDGYAIHCMKWGLIPGFTKKSEKPDFYKMFNARSESLSEKASFRRLIPKSRCLVAVEGF 120

Query: 1105 YEWKKDGSRKQPYYIHLKDDQPLVFAALYDSWGNSEGEMLYTFTILTTSCSSALQWLHDR 926
            YEWKKDGS+KQPYYIH KD +PLVFAALYDSW NSEGE+LYTFTI+TT+ SSA+QWLH+R
Sbjct: 121  YEWKKDGSKKQPYYIHFKDGRPLVFAALYDSWQNSEGEILYTFTIVTTAASSAIQWLHER 180

Query: 925  MPVILGNKSSIDAWLNGSFSPKSNSILKPYEDSDLVWYPVTPAMGKPSFDGPDCIKEIHL 746
            MPVILG+K + D WL+ S + K +++LKPYE SDLVWYPVTPAMGKPSFDGP+CIKEIHL
Sbjct: 181  MPVILGDKEATDTWLSVSSNSKFDTVLKPYEHSDLVWYPVTPAMGKPSFDGPECIKEIHL 240

Query: 745  KTEEKNLISKFFSKKRTSNEQEIIPQIENSSEESAPTKPSKNLKXXXXXXELPIDSEEGN 566
            K EEK  ISKFFS+K    E          S +  P K  K         E P  ++  +
Sbjct: 241  KMEEKGTISKFFSRKEFKEESNPEESTHGKSLKLEP-KSVKEENESEEKLETPCSAKTVD 299

Query: 565  ENPKFIISPVLNEEAEKCGAKREYEELTSDVKPFNHNFKMQGSSPVKKKANLKNAGEKQP 386
             + K  +    +E   KC  KR+ EEL  D K          +SP KKKANLK+  +KQP
Sbjct: 300  YDLKSELETFSHEGETKCKTKRDREELV-DSKLKTDEIVKPRASPAKKKANLKSVDDKQP 358

Query: 385  TLFSYFGK 362
            TL SYFGK
Sbjct: 359  TLLSYFGK 366


>gb|EXB84512.1| hypothetical protein L484_015844 [Morus notabilis]
          Length = 469

 Score =  430 bits (1106), Expect = e-118
 Identities = 224/367 (61%), Positives = 263/367 (71%), Gaps = 12/367 (3%)
 Frame = -2

Query: 1447 MCGRARCSLRAEDIPRACGIDNGSSRNLQMDGYRPRFNVSPGSHIPVVRREEGSGAQGTV 1268
            MCGRARC+LRA+D+PRAC  +NGS R + MD YRP +NVSPGS+IPVVRRE+GS  +G V
Sbjct: 1    MCGRARCTLRADDVPRACHRNNGSVRTVNMDRYRPSYNVSPGSNIPVVRREDGSDGEGFV 60

Query: 1267 LHCMKWGLIPSFTKKTDKPDHYKMFNARSESICEKASFRRLVPNNRCLVAVEGYYEWKKD 1088
            +HCMKWGLIPSFTKKTDKPDHYKMFNARSESI EK SFRRL+P +RCLVAVEG+YEWKKD
Sbjct: 61   VHCMKWGLIPSFTKKTDKPDHYKMFNARSESIGEKVSFRRLIPKSRCLVAVEGFYEWKKD 120

Query: 1087 GSRKQPYYIHLKDDQPLVFAALYDSWGN--------SEGEMLYTFTILTTSCSSALQWLH 932
            GS+KQPYYIH KD +PLVFAALYDSW N          GE+LYTFTILT S SSAL WLH
Sbjct: 121  GSKKQPYYIHFKDGRPLVFAALYDSWENYLVTAIVIPAGEILYTFTILTISSSSALGWLH 180

Query: 931  DRMPVILGNKSSIDAWLNGSFSPKSNSILKPYEDSDLVWYPVTPAMGKPSFDGPDCIKEI 752
            DRMPVI G+K S DAWL GS S K  ++LKPYED DLVWYPVTPAMGKPSFDGP+CI E+
Sbjct: 181  DRMPVIFGDKESSDAWLTGS-SSKVGALLKPYEDPDLVWYPVTPAMGKPSFDGPECI-EM 238

Query: 751  HLKTEEKNLISKFFSKKRTSNEQEIIPQIENSSEESA---PTKPSKNLKXXXXXXELPID 581
             LK +    ISKFFS K T  E ++ P+  +S  +SA     KP                
Sbjct: 239  KLKADGNIPISKFFSAKGTKKEADLNPEESSSKVDSAKCLEEKPESKANRGPFS-----S 293

Query: 580  SEEGNENPKFIISPVLNEEAEKCGAKREYEELTSDVKPFNHNFKMQGSSPVKKKANLKNA 401
            +E+G  + K  +S      AEKC  KR++E+L++D K      K    SP +KK  LK+A
Sbjct: 294  TEKGEADSKSSVSSFSQGGAEKCQIKRDHEKLSADSKSNTDETKKLFDSPGRKKVKLKSA 353

Query: 400  GE-KQPT 383
            G+ KQPT
Sbjct: 354  GDYKQPT 360


>ref|XP_003532247.1| PREDICTED: embryonic stem cell-specific
            5-hydroxymethylcytosine-binding protein-like [Glycine
            max]
          Length = 382

 Score =  429 bits (1103), Expect = e-117
 Identities = 225/392 (57%), Positives = 271/392 (69%), Gaps = 26/392 (6%)
 Frame = -2

Query: 1447 MCGRARCSLRAEDIPRACGIDNGSSRNLQMDGYRPRFNVSPGSHIPVVRREEGSGAQGTV 1268
            MCGRARC+LRA+D+PRAC      +R L +D YRP +NVSPG  +PVVRR++ SG +G V
Sbjct: 1    MCGRARCTLRADDVPRACHRSTSPTRTLHIDRYRPAYNVSPGFDVPVVRRDDASGGEGYV 60

Query: 1267 LHCMKWGLIPSFTKKTDKPDHYKMFNARSESICEKASFRRLVPNNRCLVAVEGYYEWKKD 1088
            L CMKWGLIPSFTKKT+KPDHY+MFNARSESI EKASFRRL+P +RCLVAVEG+YEWKKD
Sbjct: 61   LQCMKWGLIPSFTKKTEKPDHYRMFNARSESIDEKASFRRLLPKSRCLVAVEGFYEWKKD 120

Query: 1087 GSRKQPYYIHLKDDQPLVFAALYDSWGNSEGEMLYTFTILTTSCSSALQWLHDRMPVILG 908
            GS+KQPYYIH KD +PLVFAALYDSW NSEGE LYTFTI+TTS SSALQWLHDRMPVILG
Sbjct: 121  GSKKQPYYIHFKDGRPLVFAALYDSWQNSEGETLYTFTIVTTSSSSALQWLHDRMPVILG 180

Query: 907  NKSSIDAWLNGSFSPKSNSILKPYEDSDLVWYPVTPAMGKPSFDGPDCIKEIHLKTEEKN 728
            +K S D WL+ S S    S++KPYE+SDLVWYPVT AMGK SFDGP+CIKEI +K +   
Sbjct: 181  SKESTDIWLSSSAS-SFKSVMKPYEESDLVWYPVTSAMGKASFDGPECIKEIQVKAQGNT 239

Query: 727  LISKFFSKK-------------------RTSNEQEII------PQIENSSEESAPTKPSK 623
             IS FFSKK                   +T + +++       P+ + SS E   T+P++
Sbjct: 240  SISMFFSKKGDESKDTKPEQKASCPEVVKTEHTEDLTESKDTKPEQKTSSHEFVKTEPTE 299

Query: 622  NLKXXXXXXELPIDSEEGNENPKFIISPVLNEEAEKCGAKREYEELT-SDVKPFNHNFKM 446
            +L+           +EEG  + KF  S   ++       KREYE  + +D KP   N   
Sbjct: 300  DLRER-------AKTEEGGNDLKFHGSS-HSQNVSMLPIKREYETFSAADSKPALANHDQ 351

Query: 445  QGSSPVKKKANLKNAGEKQPTLFSYFGKS*NH 350
               +P KKK   K A +KQPTLFSYFGKS NH
Sbjct: 352  ISPNPAKKKEKAKTANDKQPTLFSYFGKS-NH 382


>ref|XP_004492204.1| PREDICTED: UPF0361 protein C3orf37 homolog [Cicer arietinum]
          Length = 375

 Score =  425 bits (1092), Expect = e-116
 Identities = 223/366 (60%), Positives = 260/366 (71%), Gaps = 2/366 (0%)
 Frame = -2

Query: 1453 ERMCGRARCSLRAEDIPRACGIDNGSSRNLQMDGYRPRFNVSPGSHIPVVRREEGSGAQG 1274
            + MCGR RC+LR +DIP AC      +R L +D YRP  NVSPG H+PVVRRE+ S ++G
Sbjct: 19   DEMCGRGRCTLRPDDIPTACHRTTAPTRLLHVDRYRPSHNVSPGFHMPVVRREDASESEG 78

Query: 1273 TVLHCMKWGLIPSFTKKTDKPDHYKMFNARSESICEKASFRRLVPNNRCLVAVEGYYEWK 1094
             VLHCMKWGLIPSFTKKT+KPDHY+MFNARSESI EKASFRRL+P NRCLVAVEG+YEWK
Sbjct: 79   HVLHCMKWGLIPSFTKKTEKPDHYRMFNARSESIDEKASFRRLLPKNRCLVAVEGFYEWK 138

Query: 1093 KDGSRKQPYYIHLKDDQPLVFAALYDSWGNSEGEMLYTFTILTTSCSSALQWLHDRMPVI 914
            KDGS+KQPYYIH KD +PLVFAALYDSW NSEGE LYTFTI+TTS SS LQWLHDRMPVI
Sbjct: 139  KDGSKKQPYYIHFKDGRPLVFAALYDSWQNSEGETLYTFTIVTTSSSSTLQWLHDRMPVI 198

Query: 913  LGNKSSIDAWLNGSFSPKSNSILKPYEDSDLVWYPVTPAMGKPSFDGPDCIKEIHLKTEE 734
            L +K S D WLN + S K  S+LKPYE+ DL WYPVTPAMGKPSFDGP+CIKEI +K E 
Sbjct: 199  LSDKDSTDTWLNSASSFK--SVLKPYEECDLAWYPVTPAMGKPSFDGPECIKEIQVKAEG 256

Query: 733  KNLISKFFSKKRTSNEQ-EIIPQIENSSEESAPTKPSKNLKXXXXXXELPIDSEEGNENP 557
               ISKFFS+K    E  +   +I +   E  P K  +  K           +EEG  + 
Sbjct: 257  NIPISKFFSRKGGEGEDTKSGHKILSLCHE--PVKTEQTTKDLSEG----AKTEEGESDL 310

Query: 556  KFIISPVLNEEAEKCGAKREYEELTSDVKP-FNHNFKMQGSSPVKKKANLKNAGEKQPTL 380
            K   S   N    K   KREY+ ++SD KP    N ++  + P KKK   KNA +KQPTL
Sbjct: 311  KSSGSSPQN--VTKFTVKREYDAISSDSKPSLGINDQVIANPPTKKKEKAKNADDKQPTL 368

Query: 379  FSYFGK 362
            FS+FGK
Sbjct: 369  FSFFGK 374


>ref|XP_002527247.1| conserved hypothetical protein [Ricinus communis]
            gi|223533340|gb|EEF35091.1| conserved hypothetical
            protein [Ricinus communis]
          Length = 409

 Score =  412 bits (1060), Expect = e-112
 Identities = 222/408 (54%), Positives = 268/408 (65%), Gaps = 46/408 (11%)
 Frame = -2

Query: 1447 MCGRARCSLRAEDIPRACGIDNGSSRNLQMDGYRPRFNVSPGSHIPVVRRE-EGS-GAQG 1274
            MCGRARC+LRA+DIPRAC    G  R++ MD +RP +NVSPGS++PVV RE +GS G  G
Sbjct: 1    MCGRARCTLRADDIPRACHRTTGPVRSVNMDRWRPSYNVSPGSNMPVVCREGDGSDGGDG 60

Query: 1273 TVLHCMKWGLIPSFTKKTDKPDHYKMFNARSESICEKASFRRLVPNNRCLVAVEGYYEWK 1094
              + CM WGLIPSFTKKT+KPD YKMFNARSES+ EKASFRRL+P +RCLVA EG+YEWK
Sbjct: 61   FFVQCMTWGLIPSFTKKTEKPDFYKMFNARSESVGEKASFRRLLPKSRCLVAAEGFYEWK 120

Query: 1093 KDGSRKQPYYIHLKDDQPLVFAALYDSWGNSEGEMLYTFTILTTSCSSALQWLHDRMPVI 914
            KDGS+KQPYYIH KD +PLVFAALYDSW NSEGE+LYTFTILTTS SSAL+WLHDRMPVI
Sbjct: 121  KDGSKKQPYYIHFKDGRPLVFAALYDSWQNSEGEILYTFTILTTSSSSALEWLHDRMPVI 180

Query: 913  LGNKSSIDAWLNGSFSPKSNSILKPYEDSDLVWYPVTPAMGKPSFDGPDCIKEIHLKTEE 734
            LG+K S D WLNGS S K + +L+ YE SDLVW PVTPAMGK SFDGP+C+KEIH+KTE 
Sbjct: 181  LGDKESTDTWLNGSSSSKYDVVLESYESSDLVWCPVTPAMGKSSFDGPECVKEIHVKTES 240

Query: 733  KNLISKFFSKKRTSNEQEI--------------IPQI---ENSSEESAPTKPSKNLK--- 614
            K+ ISKFFS+K    EQE+              +P+    E  SEE     PS  +    
Sbjct: 241  KSTISKFFSRKEIKGEQELNSRESTFDKSVKMDLPESVKEEYESEEKLDIPPSNQINDQD 300

Query: 613  XXXXXXELPIDSEEGNENPKF------------------------IISPVLNEEAEKCGA 506
                   +P + E   + P                           +S + +E+A     
Sbjct: 301  LKSNVSTIPCEDETKCQIPDHDETKCQIPDHDETKCQIPDHDLISNVSKLPHEDATLGQP 360

Query: 505  KREYEELTSDVKPFNHNFKMQGSSPVKKKANLKNAGEKQPTLFSYFGK 362
            KR +EE   D +      +    +P +KKANLK+ G+KQPTL SYF K
Sbjct: 361  KRHHEEALIDRELNPDGNEKLRRNPARKKANLKSGGDKQPTLLSYFRK 408


>ref|XP_004290142.1| PREDICTED: UPF0361 protein C3orf37 homolog isoform 2 [Fragaria vesca
            subsp. vesca]
          Length = 348

 Score =  407 bits (1045), Expect = e-111
 Identities = 209/349 (59%), Positives = 253/349 (72%), Gaps = 15/349 (4%)
 Frame = -2

Query: 1360 MDGYRPRFNVSPGSHIPVVRREEGS-GAQGTVLHCMKWGLIPSFTKKTDKPDHYKMFNAR 1184
            MD Y+PR+NVSPG+++PVVRR +G+ G  G VLHCMKWGLIPSFTKKT+KPDHY+MFNAR
Sbjct: 1    MDRYQPRYNVSPGANLPVVRRGDGADGEDGVVLHCMKWGLIPSFTKKTEKPDHYRMFNAR 60

Query: 1183 SESICEKASFRRLVPNNRCLVAVEGYYEWKKDGSRKQPYYIHLKDDQPLVFAALYDSWGN 1004
            SESICEKASFRRLVP +RC+VAVEG+YEWKKDGS+KQPYY+H KD +PL+FAALYDSW N
Sbjct: 61   SESICEKASFRRLVPKSRCVVAVEGFYEWKKDGSKKQPYYVHFKDGRPLLFAALYDSWEN 120

Query: 1003 SE-----------GEMLYTFTILTTSCSSALQWLHDRMPVILGNKSSIDAWLNGSFSPKS 857
            SE           GE LYTFTI+TTS SSAL WLHDRMPV+LG+K S+D WL+GS +   
Sbjct: 121  SEGTNVYTECETAGEKLYTFTIITTSSSSALGWLHDRMPVVLGDKESVDTWLDGSSASNF 180

Query: 856  NSILKPYEDSDLVWYPVTPAMGKPSFDGPDCIKEIHLKTEEKNLISKFFSKKRTSNEQEI 677
            + +LKPYE  DLVWYPVTPAMGK SFDGP+C  EI LKT+  N I+KFFS K T  E EI
Sbjct: 181  DKLLKPYEGPDLVWYPVTPAMGKVSFDGPECSNEIKLKTDGTNSITKFFSTKGTKKE-EI 239

Query: 676  IPQIENSSEESAPTK--PSKNLKXXXXXXELPIDSEEGNENPKFIISPVLNEEAEKCGAK 503
             P+  +  + S  T+   S N +      ++   S    E+ K  +S +  E+A K   K
Sbjct: 240  NPKDTSLHDSSVKTEFPESLNEEPETKEEKVQPSSTVKCEDSKSSVSILSQEDASKEQTK 299

Query: 502  REYEELTSDVKPF-NHNFKMQGSSPVKKKANLKNAGEKQPTLFSYFGKS 359
            R+YEE  +D KP  N + K   +SP KKK NLK + +KQPTLFSYF KS
Sbjct: 300  RDYEEFLADSKPLPNESDKKSSASPAKKKVNLKTSHDKQPTLFSYFRKS 348


>ref|XP_004150365.1| PREDICTED: LOW QUALITY PROTEIN: UPF0361 protein C3orf37 homolog,
            partial [Cucumis sativus]
          Length = 344

 Score =  404 bits (1037), Expect = e-110
 Identities = 206/348 (59%), Positives = 247/348 (70%), Gaps = 1/348 (0%)
 Frame = -2

Query: 1447 MCGRARCSLRAEDIPRACGIDNGSSRNLQMDGYRPRFNVSPGSHIPVVRREEGSGAQGTV 1268
            MCGRARC+LRA+DI RAC    G  R+L MD +RP FN SPGS +PVVRR++ S   G V
Sbjct: 1    MCGRARCTLRADDITRACHRTGGPVRSLNMDRFRPLFNASPGSDLPVVRRDDESSDGGVV 60

Query: 1267 LHCMKWGLIPSFTKKTDKPDHYKMFNARSESICEKASFRRLVPNNRCLVAVEGYYEWKKD 1088
            L CMKWGLIPSFT+K +KP+++KMFNARSESI EKASF RLVP  RCLVAVEG+YEWKKD
Sbjct: 61   LQCMKWGLIPSFTEKFEKPNYFKMFNARSESIHEKASFHRLVPKRRCLVAVEGFYEWKKD 120

Query: 1087 GSRKQPYYIHLKDDQPLVFAALYDSWGNSEGEMLYTFTILTTSCSSALQWLHDRMPVILG 908
            G +KQPYYIH KD QPL  AALYD W N EGE+LYTFTILTTS S AL+WLHDRMPVILG
Sbjct: 121  GXKKQPYYIHFKDGQPLALAALYDCWENLEGELLYTFTILTTSSSPALKWLHDRMPVILG 180

Query: 907  NKSSIDAWLNGSFSPKSNSILKPYEDSDLVWYPVTPAMGKPSFDGPDCIKEIHLKTEEKN 728
            +K  +D WLN S S K +S+LKPYE  DLVWYPVTP+MGKPSFDGPDCIKEI LK +  N
Sbjct: 181  DKERMDMWLNDSSSSKYDSVLKPYEAPDLVWYPVTPSMGKPSFDGPDCIKEIQLKNDGSN 240

Query: 727  LISKFFSKKRTSNEQEIIPQIENSSEESAPTKPSKNLKXXXXXXELPIDSEEGNENPKFI 548
            LISKFFS K T  E   + Q +  S  S   + S +L+           SEE     K  
Sbjct: 241  LISKFFSAKETKKEYS-VSQEKTCSNTSVKPEASPSLEEHKREVNRGASSEES----KDC 295

Query: 547  ISPVLNEEAEKCGAKREYEELTSDVKPFNHNFKMQGSSP-VKKKANLK 407
            ++   ++ +     KR+ E+++SD+K    ++   GSSP ++KK NLK
Sbjct: 296  LAKCSSDTSLTYQIKRDREDISSDLKSGMDDYSKVGSSPKIRKKGNLK 343


>gb|EYU40658.1| hypothetical protein MIMGU_mgv1a008176mg [Mimulus guttatus]
          Length = 382

 Score =  399 bits (1025), Expect = e-108
 Identities = 212/385 (55%), Positives = 260/385 (67%), Gaps = 22/385 (5%)
 Frame = -2

Query: 1447 MCGRARCSLRAEDIPRACGIDNGSSRNLQMDGYRPRFNVSPGSHIPVVRREEGSGAQGTV 1268
            MCGRARC+LR++D  RAC +D    R+  MD YRP  NV+PG ++PVVRR++     G V
Sbjct: 1    MCGRARCTLRSDDFRRACHLDGRPVRHQNMDRYRPSHNVAPGFNVPVVRRDDEGDGGGAV 60

Query: 1267 LHCMKWGLIPSFTKKTDKPDHYKMFNARSESICEKASFRRLVPNNRCLVAVEGYYEWKKD 1088
            LHCMKWGLIPSFTKKT+K DH++MFNARSESI EKASFRRL+P NRCLV+VEG+YEWKKD
Sbjct: 61   LHCMKWGLIPSFTKKTEKIDHFRMFNARSESIREKASFRRLLPKNRCLVSVEGFYEWKKD 120

Query: 1087 GSRKQPYYIHLKDDQPLVFAALYDSWGNSEGEMLYTFTILTTSCSSALQWLHDRMPVILG 908
            GSRKQPYYIH KD +PLVFAAL+DSW N+EGE+LYTFTI TTS SS+L+WLHDRMPVIL 
Sbjct: 121  GSRKQPYYIHFKDGRPLVFAALFDSWENAEGEILYTFTICTTSSSSSLEWLHDRMPVILR 180

Query: 907  NKSSIDAWLNGSFSPKSNSILKPYEDSDLVWYPVTPAMGKPSFDGPDCIKEIHLKTEEKN 728
            NK S D WLN S     + ILKPYED DL WYPVT AMGK SFDGP+CIKE+  KTEE  
Sbjct: 181  NKESTDCWLNDSSLSNFDKILKPYEDEDLAWYPVTSAMGKLSFDGPECIKEV--KTEESK 238

Query: 727  LISKFFSKK------RTSNEQEIIPQIENSSE--------ESAPTKPSKNLKXXXXXXEL 590
             IS+FFSKK      + + E+  + ++  +SE        ES PT  S  LK        
Sbjct: 239  TISQFFSKKVANASQKPNLEKSPVKELAEASEAISVKEEHESQPTLDSTRLKDEDIENYE 298

Query: 589  PIDSEE-----GNENPKFIISPVLNEEAEKCGA-KREY--EELTSDVKPFNHNFKMQGSS 434
                +E      ++ PK II     E      + +++Y  E L +  KPF    + Q   
Sbjct: 299  QKSVQEEPEISQDDCPKLIIKKDDAENTSNISSIEKQYTGEMLRAHAKPFAKENEKQNVG 358

Query: 433  PVKKKANLKNAGEKQPTLFSYFGKS 359
            P +K++   N  ++QPTLFSYFG+S
Sbjct: 359  PARKRSKTAN-DKQQPTLFSYFGRS 382


>emb|CAN82703.1| hypothetical protein VITISV_026469 [Vitis vinifera]
          Length = 370

 Score =  395 bits (1015), Expect = e-107
 Identities = 213/395 (53%), Positives = 262/395 (66%), Gaps = 32/395 (8%)
 Frame = -2

Query: 1447 MCGRARCSLRAEDIPRACGIDNGSSRNLQMDGYRPRFNVSPGSHIPVVRREEGSGAQGTV 1268
            MCGRARC+LR ++I RAC ++   ++N+QMD YRP +NVSPG+++PVVRR  G+  +  +
Sbjct: 1    MCGRARCTLRPDNIARACNLNTLPTQNIQMDRYRPSYNVSPGANLPVVRRGGGTEGEEAI 60

Query: 1267 LHCMKWGLIPSFTKKTDKPDHYKMFNARSESICEKASFRRLVPNNRCLVAVEGYYEWKKD 1088
            +HCMKWGL+PSFTKK++KPDHYKMFNARSES+CEKASFRRLVP NRCLVAVEG+YEWKKD
Sbjct: 61   VHCMKWGLVPSFTKKSEKPDHYKMFNARSESVCEKASFRRLVPKNRCLVAVEGFYEWKKD 120

Query: 1087 GSRKQPYYIHLKDDQPLVFAALYDSWGNSEGEMLYTFTILTTSCSSALQWLHDRMPVILG 908
            GS+KQPYYIHLKD +PLVFAAL+DSW NSE                      DRMPVILG
Sbjct: 121  GSKKQPYYIHLKDGRPLVFAALFDSWANSE----------------------DRMPVILG 158

Query: 907  NKSSIDAWLNGSFSPKSNSILKPYEDSDLVWYPVTPAMGKPSFDGPDCIKEIHLKTEEKN 728
            +K S DAWLNGS S + N++LKPYED DLVWYPVT AMGKPSF+GP+CIKEI LK E++ 
Sbjct: 159  DKESTDAWLNGSSSSQFNTVLKPYEDPDLVWYPVTQAMGKPSFEGPECIKEIQLKNEQRP 218

Query: 727  LISKFFSKKRTSNEQEII---------------PQIENSS---------EESAPTKPSKN 620
             ISKFFS K   NEQ +                P IENS+         +  +    S  
Sbjct: 219  -ISKFFSTKGIKNEQGLSNEPVKSNLPQSMKEEPAIENSTGLPSSAVKGDHDSTCSRSVP 277

Query: 619  LKXXXXXXELP--IDSEEGNENPKFIISP-----VLNEEAEKCGAKREYEELTSDVKPFN 461
             +       LP  +  E   E+   +  P       +EEA K   KR++EE ++D KP  
Sbjct: 278  QEESTWFTNLPKSLKQEPETEDKTGLPFPGDHDSKCDEEATKLPIKRDFEEFSADSKP-- 335

Query: 460  HNFKMQGSSPVKKKANL-KNAGEKQPTLFSYFGKS 359
            +   ++  SPV KK  L KNAG+KQPTLFSYFGKS
Sbjct: 336  NTDTVEKPSPVTKKGKLNKNAGDKQPTLFSYFGKS 370


>ref|XP_006850341.1| hypothetical protein AMTR_s00020p00243160 [Amborella trichopoda]
            gi|548853962|gb|ERN11922.1| hypothetical protein
            AMTR_s00020p00243160 [Amborella trichopoda]
          Length = 413

 Score =  393 bits (1009), Expect = e-106
 Identities = 211/378 (55%), Positives = 256/378 (67%), Gaps = 13/378 (3%)
 Frame = -2

Query: 1453 ERMCGRARCSLR-AEDIPRACGIDNGSSRNLQMDGYRPRFNVSPGSHIPVVRREEGSGAQ 1277
            ++MCGRARC+L   ED+PRACG  N +   L    YR  +N++PG+++PV+R+E+ S   
Sbjct: 38   KKMCGRARCTLNPVEDVPRACGF-NANLPTLHTQRYRLSYNIAPGAYLPVLRKEQES-KH 95

Query: 1276 GTVLHCMKWGLIPSFTKKTDKPDHYKMFNARSESICEKASFRRLVPNNRCLVAVEGYYEW 1097
            G V+HCMKWGL+PSFTKKT+KPDH+KMFNARSESI EKASFRRLVPN RCLV VEG+YEW
Sbjct: 96   GYVVHCMKWGLVPSFTKKTEKPDHFKMFNARSESIQEKASFRRLVPNKRCLVVVEGFYEW 155

Query: 1096 KKDGSRKQPYYIHLKDDQPLVFAALYDSWGNSEGEMLYTFTILTTSCSSALQWLHDRMPV 917
            KKDGS+KQPYY+H +D + LVFA LYD+W NSEGE LYTFTILTT CSSAL WLHDRMPV
Sbjct: 156  KKDGSKKQPYYLHFRDGRALVFAGLYDTWENSEGEGLYTFTILTTRCSSALDWLHDRMPV 215

Query: 916  ILGNKSSIDAWLNGSFSPKSNSILKPYEDSDLVWYPVTPAMGKPSFDGPDCIKEIHLKTE 737
            ILGNK +IDAWLN + SPK +S+L+PYE SDLVWYPVTPAMGK  F GP+CIKEI LK+E
Sbjct: 216  ILGNKEAIDAWLNITPSPKVDSLLQPYEGSDLVWYPVTPAMGKIFFAGPECIKEIQLKSE 275

Query: 736  EKNLISKFFSKKRTSNEQEIIPQIENSSEESAPTKPSKNLK--XXXXXXELPIDSEEGNE 563
             KN ISK F +     +    P I  ++E+S      +N +          PID  +   
Sbjct: 276  NKNTISKLFMQSHNKKQPISEPSIRKAAEDSTHGHTFENSQEPSNTNEDWEPIDDFKVCI 335

Query: 562  NPKFIISPVLNEEAEKCGAKREYEELTSDVKPFNHNFKMQGSSPVKKKANL-----KN-- 404
              K   SP   EE EK   KR+ E+L  D K      K    S  +++  +     KN  
Sbjct: 336  GIKREASPGNAEETEKRRTKRDIEQLLVDPKKETIVGKENPISGEERQGYMDRGSHKNGM 395

Query: 403  ---AGEKQPTLFSYFGKS 359
                G KQ  LFSYFGKS
Sbjct: 396  PRITGGKQANLFSYFGKS 413


>ref|XP_004165094.1| PREDICTED: UPF0361 protein C3orf37 homolog [Cucumis sativus]
          Length = 267

 Score =  382 bits (982), Expect = e-103
 Identities = 182/251 (72%), Positives = 204/251 (81%)
 Frame = -2

Query: 1447 MCGRARCSLRAEDIPRACGIDNGSSRNLQMDGYRPRFNVSPGSHIPVVRREEGSGAQGTV 1268
            MCGRARC+LRA+DI RAC    G  R+L MD +RP FN SPGS +PVVRR++ S   G V
Sbjct: 1    MCGRARCTLRADDITRACHRTGGPVRSLNMDRFRPLFNASPGSDLPVVRRDDESSDGGVV 60

Query: 1267 LHCMKWGLIPSFTKKTDKPDHYKMFNARSESICEKASFRRLVPNNRCLVAVEGYYEWKKD 1088
            L CMKWGLIPSFT+K +KP+++KMFNARSESI EKASF RLVP  RCLVAVEG+YEWKKD
Sbjct: 61   LQCMKWGLIPSFTEKFEKPNYFKMFNARSESIHEKASFHRLVPKRRCLVAVEGFYEWKKD 120

Query: 1087 GSRKQPYYIHLKDDQPLVFAALYDSWGNSEGEMLYTFTILTTSCSSALQWLHDRMPVILG 908
            GS+KQPYYIH KD QPL  AALYD W N EGE+LYTFTILTTS S AL+WLHDRMPVILG
Sbjct: 121  GSKKQPYYIHFKDGQPLALAALYDCWENLEGELLYTFTILTTSSSPALKWLHDRMPVILG 180

Query: 907  NKSSIDAWLNGSFSPKSNSILKPYEDSDLVWYPVTPAMGKPSFDGPDCIKEIHLKTEEKN 728
            +K  +D WLN S S K +S+LKPYE  DLVWYPVTP+MGKPSFDGPDCIKEI LK +  N
Sbjct: 181  DKERMDMWLNDSSSSKYDSVLKPYEAPDLVWYPVTPSMGKPSFDGPDCIKEIQLKNDGSN 240

Query: 727  LISKFFSKKRT 695
            LISKFFS K T
Sbjct: 241  LISKFFSAKET 251


>ref|XP_007140736.1| hypothetical protein PHAVU_008G137400g [Phaseolus vulgaris]
            gi|561013869|gb|ESW12730.1| hypothetical protein
            PHAVU_008G137400g [Phaseolus vulgaris]
          Length = 309

 Score =  378 bits (970), Expect = e-102
 Identities = 198/318 (62%), Positives = 233/318 (73%)
 Frame = -2

Query: 1315 IPVVRREEGSGAQGTVLHCMKWGLIPSFTKKTDKPDHYKMFNARSESICEKASFRRLVPN 1136
            +PVVRREE S + G VLH MKWGLIPSFTKKT+KPDHYKMFNARSESI EKASFRRL+P 
Sbjct: 1    MPVVRREEASDSGGYVLHSMKWGLIPSFTKKTEKPDHYKMFNARSESIDEKASFRRLLPK 60

Query: 1135 NRCLVAVEGYYEWKKDGSRKQPYYIHLKDDQPLVFAALYDSWGNSEGEMLYTFTILTTSC 956
            +RCLVAVEG+YEWKKDGS+KQPYYIH KD + LVFAALYDSW NSEGE L+TFTI+TTS 
Sbjct: 61   SRCLVAVEGFYEWKKDGSKKQPYYIHFKDGRRLVFAALYDSWQNSEGETLHTFTIVTTSS 120

Query: 955  SSALQWLHDRMPVILGNKSSIDAWLNGSFSPKSNSILKPYEDSDLVWYPVTPAMGKPSFD 776
            SSALQWLHDRMPVILG+K S D WL+ S S    S++KPYE+SDLVWYPVT AMGK SFD
Sbjct: 121  SSALQWLHDRMPVILGSKESTDTWLSSSAS-SFKSVMKPYEESDLVWYPVTSAMGKTSFD 179

Query: 775  GPDCIKEIHLKTEEKNLISKFFSKKRTSNEQEIIPQIENSSEESAPTKPSKNLKXXXXXX 596
            GP+CIKEI +K E    IS FFSKK  +  ++  P+ + SS E   T+P+++L       
Sbjct: 180  GPECIKEIQVKAEGNTSISMFFSKK-GAESKDTKPEQKLSSHEFVKTEPTEDLIEG---- 234

Query: 595  ELPIDSEEGNENPKFIISPVLNEEAEKCGAKREYEELTSDVKPFNHNFKMQGSSPVKKKA 416
                 +EEG+ + KF  S   ++ A     KREYE  ++D KP   N     S+P KKK 
Sbjct: 235  ---AKAEEGDNDLKFSGSS-HSKNASTLPIKREYETFSADSKPALANHDQISSNPAKKKE 290

Query: 415  NLKNAGEKQPTLFSYFGK 362
              K A +KQPTLFSYFGK
Sbjct: 291  KTKTANDKQPTLFSYFGK 308


>ref|NP_180215.2| uncharacterized protein [Arabidopsis thaliana]
            gi|26449484|dbj|BAC41868.1| unknown protein [Arabidopsis
            thaliana] gi|29028900|gb|AAO64829.1| At2g26470
            [Arabidopsis thaliana] gi|330252748|gb|AEC07842.1|
            uncharacterized protein AT2G26470 [Arabidopsis thaliana]
          Length = 487

 Score =  366 bits (940), Expect = 1e-98
 Identities = 170/256 (66%), Positives = 208/256 (81%), Gaps = 1/256 (0%)
 Frame = -2

Query: 1447 MCGRARCSLRAEDIPRACGIDNGSSRNLQMDGYRPRFNVSPGSHIPVVRRE-EGSGAQGT 1271
            MCGR RC+LR +D+PRA       +R L +D YRP +NV+PGS+IPV+RR+ E     G 
Sbjct: 1    MCGRTRCTLRPDDVPRASHRHTVPTRFLHLDRYRPSYNVAPGSYIPVLRRDNEEVVGDGV 60

Query: 1270 VLHCMKWGLIPSFTKKTDKPDHYKMFNARSESICEKASFRRLVPNNRCLVAVEGYYEWKK 1091
            V+HCMKWGL+PSFTKKTDKPD +KMFNARSES+ EKASFRRL+P NRCLVAV+G+YEWKK
Sbjct: 61   VVHCMKWGLVPSFTKKTDKPDFFKMFNARSESVAEKASFRRLLPKNRCLVAVDGFYEWKK 120

Query: 1090 DGSRKQPYYIHLKDDQPLVFAALYDSWGNSEGEMLYTFTILTTSCSSALQWLHDRMPVIL 911
            +GS+KQPYYIH +D +PLVFAAL+D+W NS GE LYTFTILTT+ SSALQWLHDRMPVIL
Sbjct: 121  EGSKKQPYYIHFEDGRPLVFAALFDTWQNSGGETLYTFTILTTASSSALQWLHDRMPVIL 180

Query: 910  GNKSSIDAWLNGSFSPKSNSILKPYEDSDLVWYPVTPAMGKPSFDGPDCIKEIHLKTEEK 731
            G+K SID WL+   + K   +L PYE SDLVWYPVT A+GKP+FDGP+CI++I LKT + 
Sbjct: 181  GDKDSIDTWLDDPSTTKLQPLLSPYEKSDLVWYPVTSAIGKPTFDGPECIQQIPLKTSQN 240

Query: 730  NLISKFFSKKRTSNEQ 683
            +LISKFFS K+   ++
Sbjct: 241  SLISKFFSTKQPKTDE 256


Top