BLASTX nr result

ID: Catharanthus22_contig00015139 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Catharanthus22_contig00015139
         (1483 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002303080.1| hypothetical protein POPTR_0002s25190g [Popu...   379   e-102
ref|XP_004492204.1| PREDICTED: UPF0361 protein C3orf37 homolog [...   376   e-101
ref|XP_004290141.1| PREDICTED: UPF0361 protein C3orf37 homolog i...   375   e-101
ref|XP_003635244.1| PREDICTED: UPF0361 protein C3orf37 homolog [...   371   e-100
gb|EMJ00266.1| hypothetical protein PRUPE_ppa018685mg [Prunus pe...   370   e-100
gb|EXB84512.1| hypothetical protein L484_015844 [Morus notabilis]     369   1e-99
gb|EOX93768.1| Uncharacterized protein TCM_002685 [Theobroma cacao]   369   2e-99
gb|ESW12729.1| hypothetical protein PHAVU_008G137400g [Phaseolus...   366   2e-98
ref|XP_003622783.1| hypothetical protein MTR_7g052250 [Medicago ...   365   3e-98
ref|XP_003532247.1| PREDICTED: embryonic stem cell-specific 5-hy...   365   3e-98
ref|XP_006484827.1| PREDICTED: embryonic stem cell-specific 5-hy...   363   1e-97
ref|NP_180215.2| uncharacterized protein [Arabidopsis thaliana] ...   351   5e-94
ref|XP_006293960.1| hypothetical protein CARUB_v10022949mg [Caps...   350   7e-94
ref|XP_006293959.1| hypothetical protein CARUB_v10022949mg [Caps...   350   7e-94
ref|XP_002880802.1| hypothetical protein ARALYDRAFT_481505 [Arab...   349   2e-93
ref|XP_006294385.1| hypothetical protein CARUB_v10023401mg, part...   345   2e-92
ref|XP_006403078.1| hypothetical protein EUTSA_v10003450mg [Eutr...   344   6e-92
ref|XP_004150365.1| PREDICTED: LOW QUALITY PROTEIN: UPF0361 prot...   344   6e-92
ref|XP_006850341.1| hypothetical protein AMTR_s00020p00243160 [A...   342   2e-91
ref|XP_004165094.1| PREDICTED: UPF0361 protein C3orf37 homolog [...   342   2e-91

>ref|XP_002303080.1| hypothetical protein POPTR_0002s25190g [Populus trichocarpa]
            gi|222844806|gb|EEE82353.1| hypothetical protein
            POPTR_0002s25190g [Populus trichocarpa]
          Length = 367

 Score =  379 bits (973), Expect = e-102
 Identities = 204/407 (50%), Positives = 247/407 (60%)
 Frame = -1

Query: 1342 MCGRARCTLRPDDFIRASHLNGHRVRHVDMDRYRPSYNVSPGFNLPVIRRXXXXXXXXXX 1163
            MCGRARCTLR DD  RA H N   VR V+MDRYRPSYN SPG NL V+RR          
Sbjct: 1    MCGRARCTLRADDIPRACHRNTATVRSVNMDRYRPSYNASPGSNLAVVRRDDAASGDGAS 60

Query: 1162 XXXXXXXGVVLHCMKWGLIPSFTKKTEKPDHYKMFNARSESMREKASFRRLVPTNRCLVE 983
                      +HCMKWGLIP FTKK+EKPD YKMFNARSES+ EKASFRRL+P +RCLV 
Sbjct: 61   GGDGY----AIHCMKWGLIPGFTKKSEKPDFYKMFNARSESLSEKASFRRLIPKSRCLVA 116

Query: 982  VEGFYEWKKDGSKRQPYYIHFQDERPMVFAALFDSWKNSKGEALYTFTIXXXXXXXXLAW 803
            VEGFYEWKKDGSK+QPYYIHF+D RP+VFAAL+DSW+NS+GE LYTFTI        + W
Sbjct: 117  VEGFYEWKKDGSKKQPYYIHFKDGRPLVFAALYDSWQNSEGEILYTFTIVTTAASSAIQW 176

Query: 802  LHDRMPVIFGNKESIEMWLNAPPSTNFDTILKPYEEKDLAWYPVTPAMGKPSFDGPECIK 623
            LH+RMPVI G+KE+ + WL+   ++ FDT+LKPYE  DL WYPVTPAMGKPSFDGPECIK
Sbjct: 177  LHERMPVILGDKEATDTWLSVSSNSKFDTVLKPYEHSDLVWYPVTPAMGKPSFDGPECIK 236

Query: 622  EIQLKANENRTISEFFSKKGAGRQPGSKPYSRNTTEEATEIHPPKTEEDTANEDPKDSQA 443
            EI LK  E  TIS+FFS+K    +  S P   +T  ++ ++ P   +E+  +E+  ++  
Sbjct: 237  EIHLKMEEKGTISKFFSRKEFKEE--SNP-EESTHGKSLKLEPKSVKEENESEEKLETPC 293

Query: 442  ITXXXXXXXXXXNFESFHQGAANISMKREYEELSTKMKHSDEEADKQHVSPPQKRLKDES 263
                             H+G      KR+ EEL    K   +E  K   SP +K+     
Sbjct: 294  SAKTVDYDLKSELETFSHEGETKCKTKRDREEL-VDSKLKTDEIVKPRASPAKKKAN--- 349

Query: 262  GXXXXXXXXXXXTPSLDETDXXXXXXXXXXXXXXXXKQPTLLSYFGK 122
                          S+D+                  KQPTLLSYFGK
Sbjct: 350  ------------LKSVDD------------------KQPTLLSYFGK 366


>ref|XP_004492204.1| PREDICTED: UPF0361 protein C3orf37 homolog [Cicer arietinum]
          Length = 375

 Score =  376 bits (965), Expect = e-101
 Identities = 198/358 (55%), Positives = 241/358 (67%), Gaps = 1/358 (0%)
 Frame = -1

Query: 1348 EEMCGRARCTLRPDDFIRASHLNGHRVRHVDMDRYRPSYNVSPGFNLPVIRRXXXXXXXX 1169
            +EMCGR RCTLRPDD   A H      R + +DRYRPS+NVSPGF++PV+RR        
Sbjct: 19   DEMCGRGRCTLRPDDIPTACHRTTAPTRLLHVDRYRPSHNVSPGFHMPVVRREDASESEG 78

Query: 1168 XXXXXXXXXGVVLHCMKWGLIPSFTKKTEKPDHYKMFNARSESMREKASFRRLVPTNRCL 989
                       VLHCMKWGLIPSFTKKTEKPDHY+MFNARSES+ EKASFRRL+P NRCL
Sbjct: 79   H----------VLHCMKWGLIPSFTKKTEKPDHYRMFNARSESIDEKASFRRLLPKNRCL 128

Query: 988  VEVEGFYEWKKDGSKRQPYYIHFQDERPMVFAALFDSWKNSKGEALYTFTIXXXXXXXXL 809
            V VEGFYEWKKDGSK+QPYYIHF+D RP+VFAAL+DSW+NS+GE LYTFTI        L
Sbjct: 129  VAVEGFYEWKKDGSKKQPYYIHFKDGRPLVFAALYDSWQNSEGETLYTFTIVTTSSSSTL 188

Query: 808  AWLHDRMPVIFGNKESIEMWLNAPPSTNFDTILKPYEEKDLAWYPVTPAMGKPSFDGPEC 629
             WLHDRMPVI  +K+S + WLN+  +++F ++LKPYEE DLAWYPVTPAMGKPSFDGPEC
Sbjct: 189  QWLHDRMPVILSDKDSTDTWLNS--ASSFKSVLKPYEECDLAWYPVTPAMGKPSFDGPEC 246

Query: 628  IKEIQLKANENRTISEFFSKKGAGRQPGSKPYSRNTTEEATEIHPP-KTEEDTANEDPKD 452
            IKEIQ+KA  N  IS+FFS+KG     G    +++  +  +  H P KTE+ T     KD
Sbjct: 247  IKEIQVKAEGNIPISKFFSRKG-----GEGEDTKSGHKILSLCHEPVKTEQTT-----KD 296

Query: 451  SQAITXXXXXXXXXXNFESFHQGAANISMKREYEELSTKMKHSDEEADKQHVSPPQKR 278
                           +  S  Q     ++KREY+ +S+  K S    D+   +PP K+
Sbjct: 297  LSEGAKTEEGESDLKSSGSSPQNVTKFTVKREYDAISSDSKPSLGINDQVIANPPTKK 354


>ref|XP_004290141.1| PREDICTED: UPF0361 protein C3orf37 homolog isoform 1 [Fragaria vesca
            subsp. vesca]
          Length = 366

 Score =  375 bits (962), Expect = e-101
 Identities = 192/356 (53%), Positives = 237/356 (66%), Gaps = 1/356 (0%)
 Frame = -1

Query: 1342 MCGRARCTLRPDDFIRASHLNGHRVRHVDMDRYRPSYNVSPGFNLPVIRRXXXXXXXXXX 1163
            MCGRARCTLR DD  RA + N   VR V+MDRY+P YNVSPG NLPV+RR          
Sbjct: 1    MCGRARCTLRADDISRACYRNHGPVRSVNMDRYQPRYNVSPGANLPVVRRGDGADGEDG- 59

Query: 1162 XXXXXXXGVVLHCMKWGLIPSFTKKTEKPDHYKMFNARSESMREKASFRRLVPTNRCLVE 983
                    VVLHCMKWGLIPSFTKKTEKPDHY+MFNARSES+ EKASFRRLVP +RC+V 
Sbjct: 60   --------VVLHCMKWGLIPSFTKKTEKPDHYRMFNARSESICEKASFRRLVPKSRCVVA 111

Query: 982  VEGFYEWKKDGSKRQPYYIHFQDERPMVFAALFDSWKNSKGEALYTFTIXXXXXXXXLAW 803
            VEGFYEWKKDGSK+QPYY+HF+D RP++FAAL+DSW+NS+GE LYTFTI        L W
Sbjct: 112  VEGFYEWKKDGSKKQPYYVHFKDGRPLLFAALYDSWENSEGEKLYTFTIITTSSSSALGW 171

Query: 802  LHDRMPVIFGNKESIEMWLNAPPSTNFDTILKPYEEKDLAWYPVTPAMGKPSFDGPECIK 623
            LHDRMPV+ G+KES++ WL+   ++NFD +LKPYE  DL WYPVTPAMGK SFDGPEC  
Sbjct: 172  LHDRMPVVLGDKESVDTWLDGSSASNFDKLLKPYEGPDLVWYPVTPAMGKVSFDGPECSN 231

Query: 622  EIQLKANENRTISEFFSKKGAGRQP-GSKPYSRNTTEEATEIHPPKTEEDTANEDPKDSQ 446
            EI+LK +   +I++FFS KG  ++    K  S + +   TE  P    E+   ++ K   
Sbjct: 232  EIKLKTDGTNSITKFFSTKGTKKEEINPKDTSLHDSSVKTEF-PESLNEEPETKEEKVQP 290

Query: 445  AITXXXXXXXXXXNFESFHQGAANISMKREYEELSTKMKHSDEEADKQHVSPPQKR 278
            + T          +  S  + A+    KR+YEE     K    E+DK+  + P K+
Sbjct: 291  SSTVKCEDSKSSVSILS-QEDASKEQTKRDYEEFLADSKPLPNESDKKSSASPAKK 345


>ref|XP_003635244.1| PREDICTED: UPF0361 protein C3orf37 homolog [Vitis vinifera]
            gi|296090568|emb|CBI40918.3| unnamed protein product
            [Vitis vinifera]
          Length = 392

 Score =  371 bits (952), Expect = e-100
 Identities = 185/292 (63%), Positives = 212/292 (72%), Gaps = 2/292 (0%)
 Frame = -1

Query: 1342 MCGRARCTLRPDDFIRASHLNGHRVRHVDMDRYRPSYNVSPGFNLPVIRRXXXXXXXXXX 1163
            MCGRARCTLRPD+  RA +LN    +++ MDRYRPSYNVSPG NLPV+RR          
Sbjct: 1    MCGRARCTLRPDNIARACNLNTLPTQNIQMDRYRPSYNVSPGANLPVVRRGGGTEGEE-- 58

Query: 1162 XXXXXXXGVVLHCMKWGLIPSFTKKTEKPDHYKMFNARSESMREKASFRRLVPTNRCLVE 983
                     ++HCMKWGL+PSFTKK+EKPDHYKMFNARSES+ EKASFRRLVP NRCLV 
Sbjct: 59   --------AIVHCMKWGLVPSFTKKSEKPDHYKMFNARSESVCEKASFRRLVPKNRCLVA 110

Query: 982  VEGFYEWKKDGSKRQPYYIHFQDERPMVFAALFDSWKNSKGEALYTFTIXXXXXXXXLAW 803
            VEGFYEWKKDGSK+QPYYIH +D RP+VFAALFDSW NS+GE LYT TI        L W
Sbjct: 111  VEGFYEWKKDGSKKQPYYIHLKDGRPLVFAALFDSWANSEGEILYTCTILTTSSSSALQW 170

Query: 802  LHDRMPVIFGNKESIEMWLNAPPSTNFDTILKPYEEKDLAWYPVTPAMGKPSFDGPECIK 623
            LHDRMPVI G+KES + WLN   S+ F+T+LKPYE+ DL WYPVT AMGKPSF+GPECIK
Sbjct: 171  LHDRMPVILGDKESTDAWLNGSSSSQFNTVLKPYEDPDLVWYPVTQAMGKPSFEGPECIK 230

Query: 622  EIQLKANENRTISEFFSKKGAGRQPG--SKPYSRNTTEEATEIHPPKTEEDT 473
            EIQLK NE R IS+FFS KG   + G  ++P   N  +   E   P  E  T
Sbjct: 231  EIQLK-NEQRPISKFFSTKGIKNEQGLSNEPVKSNLPQSLKE--EPAIENST 279


>gb|EMJ00266.1| hypothetical protein PRUPE_ppa018685mg [Prunus persica]
          Length = 363

 Score =  370 bits (951), Expect = e-100
 Identities = 197/407 (48%), Positives = 242/407 (59%)
 Frame = -1

Query: 1342 MCGRARCTLRPDDFIRASHLNGHRVRHVDMDRYRPSYNVSPGFNLPVIRRXXXXXXXXXX 1163
            MCGRARCTLR DD  RA H +   VR V+MDR+RP +N SPG NLPV+RR          
Sbjct: 1    MCGRARCTLRADDIPRACHRSHGPVRTVNMDRFRPLFNASPGSNLPVVRREDGGDGDG-- 58

Query: 1162 XXXXXXXGVVLHCMKWGLIPSFTKKTEKPDHYKMFNARSESMREKASFRRLVPTNRCLVE 983
                    VV+HCMKWGLIPSFTKKTEKPDHYKMFNARSES+ EKASFRRL+P NRCL+ 
Sbjct: 59   --------VVVHCMKWGLIPSFTKKTEKPDHYKMFNARSESICEKASFRRLIPKNRCLIA 110

Query: 982  VEGFYEWKKDGSKRQPYYIHFQDERPMVFAALFDSWKNSKGEALYTFTIXXXXXXXXLAW 803
            VEGFYEWKKDGSK+QPYY+HF D RP++FAAL+D W+NS+GE LYTFTI        L W
Sbjct: 111  VEGFYEWKKDGSKKQPYYVHFNDGRPLLFAALYDFWENSEGEKLYTFTIITTSSSSALGW 170

Query: 802  LHDRMPVIFGNKESIEMWLNAPPSTNFDTILKPYEEKDLAWYPVTPAMGKPSFDGPECIK 623
            LHDRMPVI G+K S + WL+   ++NFD++LKPYE  DL WYPVT AMGK SFDGPECI 
Sbjct: 171  LHDRMPVILGDKGSTDSWLSGSSTSNFDSLLKPYEGPDLVWYPVTQAMGKVSFDGPECIN 230

Query: 622  EIQLKANENRTISEFFSKKGAGRQPGSKPYSRNTTEEATEIHPPKTEEDTANEDPKDSQA 443
            EIQLK   N +I++FF  KG  ++  +   +           P   +E+   ++  +  A
Sbjct: 231  EIQLKTEGNNSITKFFMSKGTKKEELNPKDTSFYDSSVKNDLPKSVKEEPEGKEKTEQPA 290

Query: 442  ITXXXXXXXXXXNFESFHQGAANISMKREYEELSTKMKHSDEEADKQHVSPPQKRLKDES 263
             T                +G +    KR+YEE S   K    E  +   SP +K++  +S
Sbjct: 291  STEKCENDSKGQTIS--QEGVSKGQTKRDYEEFSADSKPVAYETSEMSASPAKKKVNPKS 348

Query: 262  GXXXXXXXXXXXTPSLDETDXXXXXXXXXXXXXXXXKQPTLLSYFGK 122
                          S+D+                   QPTL SYFGK
Sbjct: 349  --------------SVDK-------------------QPTLFSYFGK 362


>gb|EXB84512.1| hypothetical protein L484_015844 [Morus notabilis]
          Length = 469

 Score =  369 bits (948), Expect = 1e-99
 Identities = 200/369 (54%), Positives = 240/369 (65%), Gaps = 9/369 (2%)
 Frame = -1

Query: 1342 MCGRARCTLRPDDFIRASHLNGHRVRHVDMDRYRPSYNVSPGFNLPVIRRXXXXXXXXXX 1163
            MCGRARCTLR DD  RA H N   VR V+MDRYRPSYNVSPG N+PV+RR          
Sbjct: 1    MCGRARCTLRADDVPRACHRNNGSVRTVNMDRYRPSYNVSPGSNIPVVRREDGSDGEGF- 59

Query: 1162 XXXXXXXGVVLHCMKWGLIPSFTKKTEKPDHYKMFNARSESMREKASFRRLVPTNRCLVE 983
                     V+HCMKWGLIPSFTKKT+KPDHYKMFNARSES+ EK SFRRL+P +RCLV 
Sbjct: 60   ---------VVHCMKWGLIPSFTKKTDKPDHYKMFNARSESIGEKVSFRRLIPKSRCLVA 110

Query: 982  VEGFYEWKKDGSKRQPYYIHFQDERPMVFAALFDSWKN--------SKGEALYTFTIXXX 827
            VEGFYEWKKDGSK+QPYYIHF+D RP+VFAAL+DSW+N          GE LYTFTI   
Sbjct: 111  VEGFYEWKKDGSKKQPYYIHFKDGRPLVFAALYDSWENYLVTAIVIPAGEILYTFTILTI 170

Query: 826  XXXXXLAWLHDRMPVIFGNKESIEMWLNAPPSTNFDTILKPYEEKDLAWYPVTPAMGKPS 647
                 L WLHDRMPVIFG+KES + WL    S+    +LKPYE+ DL WYPVTPAMGKPS
Sbjct: 171  SSSSALGWLHDRMPVIFGDKESSDAWLTG-SSSKVGALLKPYEDPDLVWYPVTPAMGKPS 229

Query: 646  FDGPECIKEIQLKANENRTISEFFSKKGAGRQPGSKPYSRNTTEEATEIHPPKTEEDTAN 467
            FDGPECI E++LKA+ N  IS+FFS KG  ++    P   ++  ++ +    K  E  AN
Sbjct: 230  FDGPECI-EMKLKADGNIPISKFFSAKGTKKEADLNPEESSSKVDSAKCLEEK-PESKAN 287

Query: 466  EDPKDSQAITXXXXXXXXXXNFESFHQGAA-NISMKREYEELSTKMKHSDEEADKQHVSP 290
              P  S              +  SF QG A    +KR++E+LS   K + +E  K   SP
Sbjct: 288  RGPFSS----TEKGEADSKSSVSSFSQGGAEKCQIKRDHEKLSADSKSNTDETKKLFDSP 343

Query: 289  PQKRLKDES 263
             +K++K +S
Sbjct: 344  GRKKVKLKS 352


>gb|EOX93768.1| Uncharacterized protein TCM_002685 [Theobroma cacao]
          Length = 360

 Score =  369 bits (946), Expect = 2e-99
 Identities = 191/356 (53%), Positives = 232/356 (65%), Gaps = 1/356 (0%)
 Frame = -1

Query: 1342 MCGRARCTLRPDDFIRASHLNGHRVRHVDMDRYRPSYNVSPGFNLPVIRRXXXXXXXXXX 1163
            MCGRARCTLR DD  RASH N   VRHV MDRYRPSYNV PG NLPV+RR          
Sbjct: 1    MCGRARCTLRADDIPRASHRNDGPVRHVHMDRYRPSYNVGPGMNLPVVRRDDGSNGDGG- 59

Query: 1162 XXXXXXXGVVLHCMKWGLIPSFTKKTEKPDHYKMFNARSESMREKASFRRLVPTNRCLVE 983
                    VVLHCMKWGLIPSFTKKT+KPD YKMFNARSES+ EKASFRRL+P +RCLV 
Sbjct: 60   --------VVLHCMKWGLIPSFTKKTDKPDFYKMFNARSESVCEKASFRRLLPKSRCLVA 111

Query: 982  VEGFYEWKKDGSKRQPYYIHFQDERPMVFAALFDSWKNSKGEALYTFTIXXXXXXXXLAW 803
            VEGFYEWKKDGSK+QPYYIHF+D RP+VFAAL+D W+NS+GE LYTFTI          W
Sbjct: 112  VEGFYEWKKDGSKKQPYYIHFKDGRPLVFAALYDCWENSEGEKLYTFTILTTASSSAFLW 171

Query: 802  LHDRMPVIFGNKESIEMWLNAPPSTNFDTILKPYEEKDLAWYPVTPAMGKPSFDGPECIK 623
            LHDRMPVI G+KES + WLN    T  DT+LKPYE  DL WYPVT A+GK SF+GPEC+K
Sbjct: 172  LHDRMPVILGDKESTDTWLN---GTKIDTLLKPYENPDLVWYPVTSAIGKLSFEGPECVK 228

Query: 622  EIQLKANENRTISEFFSKKGAGRQPGSKPYSRNTTEEATEIHPPKT-EEDTANEDPKDSQ 446
            E+ LK  E   IS+FFS +   R+  S    ++  +E+ + +  K  +E+  + + K+  
Sbjct: 229  EVPLKTQEKNPISKFFSTREVKREQESN-MEKSLCDESVQTNLLKNLKEEPNSPEDKEIP 287

Query: 445  AITXXXXXXXXXXNFESFHQGAANISMKREYEELSTKMKHSDEEADKQHVSPPQKR 278
            ++                 +       KR+YEE S   K + +E +   VSP +K+
Sbjct: 288  SLASKEDNDSKSSVLVPTCEDVRKCQTKRDYEEFSADTKPAKDEIE---VSPARKK 340


>gb|ESW12729.1| hypothetical protein PHAVU_008G137400g [Phaseolus vulgaris]
          Length = 353

 Score =  366 bits (939), Expect = 2e-98
 Identities = 202/407 (49%), Positives = 248/407 (60%)
 Frame = -1

Query: 1342 MCGRARCTLRPDDFIRASHLNGHRVRHVDMDRYRPSYNVSPGFNLPVIRRXXXXXXXXXX 1163
            MCGR RCTLR DD  RA H +    R + MDRYRP+YNVSPG N+PV+RR          
Sbjct: 1    MCGRTRCTLRSDDVPRACHRSDAPTRTLHMDRYRPAYNVSPGSNMPVVRREEASDSGGY- 59

Query: 1162 XXXXXXXGVVLHCMKWGLIPSFTKKTEKPDHYKMFNARSESMREKASFRRLVPTNRCLVE 983
                     VLH MKWGLIPSFTKKTEKPDHYKMFNARSES+ EKASFRRL+P +RCLV 
Sbjct: 60   ---------VLHSMKWGLIPSFTKKTEKPDHYKMFNARSESIDEKASFRRLLPKSRCLVA 110

Query: 982  VEGFYEWKKDGSKRQPYYIHFQDERPMVFAALFDSWKNSKGEALYTFTIXXXXXXXXLAW 803
            VEGFYEWKKDGSK+QPYYIHF+D R +VFAAL+DSW+NS+GE L+TFTI        L W
Sbjct: 111  VEGFYEWKKDGSKKQPYYIHFKDGRRLVFAALYDSWQNSEGETLHTFTIVTTSSSSALQW 170

Query: 802  LHDRMPVIFGNKESIEMWLNAPPSTNFDTILKPYEEKDLAWYPVTPAMGKPSFDGPECIK 623
            LHDRMPVI G+KES + WL++  S +F +++KPYEE DL WYPVT AMGK SFDGPECIK
Sbjct: 171  LHDRMPVILGSKESTDTWLSSSAS-SFKSVMKPYEESDLVWYPVTSAMGKTSFDGPECIK 229

Query: 622  EIQLKANENRTISEFFSKKGAGRQPGSKPYSRNTTEEATEIHPPKTEEDTANEDPKDSQA 443
            EIQ+KA  N +IS FFSKKGA     +KP  + ++ E  +  P +   + A  +  D+  
Sbjct: 230  EIQVKAEGNTSISMFFSKKGA-ESKDTKPEQKLSSHEFVKTEPTEDLIEGAKAEEGDND- 287

Query: 442  ITXXXXXXXXXXNFESFHQGAANISMKREYEELSTKMKHSDEEADKQHVSPPQKRLKDES 263
                        +  S  + A+ + +KREYE  S   K +    D+   +P +K+ K ++
Sbjct: 288  ---------LKFSGSSHSKNASTLPIKREYETFSADSKPALANHDQISSNPAKKKEKTKT 338

Query: 262  GXXXXXXXXXXXTPSLDETDXXXXXXXXXXXXXXXXKQPTLLSYFGK 122
                                                KQPTL SYFGK
Sbjct: 339  A---------------------------------NDKQPTLFSYFGK 352


>ref|XP_003622783.1| hypothetical protein MTR_7g052250 [Medicago truncatula]
            gi|355497798|gb|AES79001.1| hypothetical protein
            MTR_7g052250 [Medicago truncatula]
          Length = 354

 Score =  365 bits (937), Expect = 3e-98
 Identities = 202/408 (49%), Positives = 249/408 (61%), Gaps = 1/408 (0%)
 Frame = -1

Query: 1342 MCGRARCTLRPDDFIRASHLNGHRVRHVDMDRYRPSYNVSPGFNLPVIRRXXXXXXXXXX 1163
            MCGR RC+LR DD  RA H      R + +DRYRPS NVSPGFN+PV+RR          
Sbjct: 1    MCGRTRCSLRADDVPRACHRTTAPSRLLHIDRYRPSNNVSPGFNIPVVRREDNASAESDG 60

Query: 1162 XXXXXXXGVVLHCMKWGLIPSFTKKTEKPDHYKMFNARSESMREKASFRRLVPTNRCLVE 983
                     V+HCMKWGLIPSFTKKT+KPDHYKMFNARSES+ EKASFRRL+P NRCLV 
Sbjct: 61   H--------VVHCMKWGLIPSFTKKTDKPDHYKMFNARSESIDEKASFRRLLPKNRCLVA 112

Query: 982  VEGFYEWKKDGSKRQPYYIHFQDERPMVFAALFDSWKNSKGEALYTFTIXXXXXXXXLAW 803
            VEGFYEWKKDGSK+QPYYIHF+D RP+VFAAL+DSW+NS+GE LYTFTI          W
Sbjct: 113  VEGFYEWKKDGSKKQPYYIHFKDGRPLVFAALYDSWQNSEGEILYTFTIVTTSSSSAFKW 172

Query: 802  LHDRMPVIFGNKESIEMWLNAPPSTNFDTILKPYEEKDLAWYPVTPAMGKPSFDGPECIK 623
            LHDRMPVI G+K++ + WL++  +++F +++KPYEE DL WYPVTPAMGKPSFDGPECIK
Sbjct: 173  LHDRMPVILGDKDTTDTWLSS--ASSFKSVMKPYEESDLVWYPVTPAMGKPSFDGPECIK 230

Query: 622  EIQLKANENRTISEFFSKKGAGRQPGSKPYSRNTTEEATEIHPPKTEE-DTANEDPKDSQ 446
            EIQ+K      IS+FFSKK A     +KP  +  + E     P KTE+    +E+ K  +
Sbjct: 231  EIQIKTEGYIPISKFFSKKEA-EVEDTKPEHKILSHE-----PVKTEQTKDVSEEAKTEE 284

Query: 445  AITXXXXXXXXXXNFESFHQGAANISMKREYEELSTKMKHSDEEADKQHVSPPQKRLKDE 266
              T             S  Q     ++KREY+ +S+  K S    D+   +P +K+ K +
Sbjct: 285  GDTDLKSSGI------SPSQNVNRFAIKREYDAISSDSKPSLANNDQVSANPAKKKEKAK 338

Query: 265  SGXXXXXXXXXXXTPSLDETDXXXXXXXXXXXXXXXXKQPTLLSYFGK 122
            +                                    KQPTL SYFGK
Sbjct: 339  TA---------------------------------DDKQPTLFSYFGK 353


>ref|XP_003532247.1| PREDICTED: embryonic stem cell-specific
            5-hydroxymethylcytosine-binding protein-like [Glycine
            max]
          Length = 382

 Score =  365 bits (936), Expect = 3e-98
 Identities = 207/423 (48%), Positives = 253/423 (59%), Gaps = 16/423 (3%)
 Frame = -1

Query: 1342 MCGRARCTLRPDDFIRASHLNGHRVRHVDMDRYRPSYNVSPGFNLPVIRRXXXXXXXXXX 1163
            MCGRARCTLR DD  RA H +    R + +DRYRP+YNVSPGF++PV+RR          
Sbjct: 1    MCGRARCTLRADDVPRACHRSTSPTRTLHIDRYRPAYNVSPGFDVPVVRRDDASGGEGY- 59

Query: 1162 XXXXXXXGVVLHCMKWGLIPSFTKKTEKPDHYKMFNARSESMREKASFRRLVPTNRCLVE 983
                     VL CMKWGLIPSFTKKTEKPDHY+MFNARSES+ EKASFRRL+P +RCLV 
Sbjct: 60   ---------VLQCMKWGLIPSFTKKTEKPDHYRMFNARSESIDEKASFRRLLPKSRCLVA 110

Query: 982  VEGFYEWKKDGSKRQPYYIHFQDERPMVFAALFDSWKNSKGEALYTFTIXXXXXXXXLAW 803
            VEGFYEWKKDGSK+QPYYIHF+D RP+VFAAL+DSW+NS+GE LYTFTI        L W
Sbjct: 111  VEGFYEWKKDGSKKQPYYIHFKDGRPLVFAALYDSWQNSEGETLYTFTIVTTSSSSALQW 170

Query: 802  LHDRMPVIFGNKESIEMWLNAPPSTNFDTILKPYEEKDLAWYPVTPAMGKPSFDGPECIK 623
            LHDRMPVI G+KES ++WL++  S +F +++KPYEE DL WYPVT AMGK SFDGPECIK
Sbjct: 171  LHDRMPVILGSKESTDIWLSSSAS-SFKSVMKPYEESDLVWYPVTSAMGKASFDGPECIK 229

Query: 622  EIQLKANENRTISEFFSKKGAGRQPGSKPYSRNT---------TEEATEIHPPKTEEDTA 470
            EIQ+KA  N +IS FFSKKG      +KP  + +         TE+ TE    K E+ T+
Sbjct: 230  EIQVKAQGNTSISMFFSKKG-DESKDTKPEQKASCPEVVKTEHTEDLTESKDTKPEQKTS 288

Query: 469  N------EDPKDSQAITXXXXXXXXXXNFESFH-QGAANISMKREYEELSTKMKHSDEEA 311
            +      E  +D +                S H Q  + + +KREYE  S         A
Sbjct: 289  SHEFVKTEPTEDLRERAKTEEGGNDLKFHGSSHSQNVSMLPIKREYETFSA-ADSKPALA 347

Query: 310  DKQHVSPPQKRLKDESGXXXXXXXXXXXTPSLDETDXXXXXXXXXXXXXXXXKQPTLLSY 131
            +   +SP   + K+++                                    KQPTL SY
Sbjct: 348  NHDQISPNPAKKKEKA-------------------------------KTANDKQPTLFSY 376

Query: 130  FGK 122
            FGK
Sbjct: 377  FGK 379


>ref|XP_006484827.1| PREDICTED: embryonic stem cell-specific
            5-hydroxymethylcytosine-binding protein-like isoform X1
            [Citrus sinensis]
          Length = 398

 Score =  363 bits (932), Expect = 1e-97
 Identities = 176/285 (61%), Positives = 207/285 (72%)
 Frame = -1

Query: 1342 MCGRARCTLRPDDFIRASHLNGHRVRHVDMDRYRPSYNVSPGFNLPVIRRXXXXXXXXXX 1163
            MCGRARCTLR DD  RA H  G   R ++MDRYRPSYNV+PG+NLPV+RR          
Sbjct: 1    MCGRARCTLRADDLPRACHRTGSPARTLNMDRYRPSYNVAPGWNLPVVRRDDDGEGF--- 57

Query: 1162 XXXXXXXGVVLHCMKWGLIPSFTKKTEKPDHYKMFNARSESMREKASFRRLVPTNRCLVE 983
                     VLHCMKWGLIPSFTKK EKPD YKMFNARSES+ EKASFRRL+P +RCL  
Sbjct: 58   ---------VLHCMKWGLIPSFTKKNEKPDFYKMFNARSESVTEKASFRRLLPKSRCLAA 108

Query: 982  VEGFYEWKKDGSKRQPYYIHFQDERPMVFAALFDSWKNSKGEALYTFTIXXXXXXXXLAW 803
            VEGFYEWKKDGSK+QPYY+HF+D RP+VFAAL+D+W++S+GE LYTFTI        L W
Sbjct: 109  VEGFYEWKKDGSKKQPYYVHFKDGRPLVFAALYDTWQSSEGEILYTFTILTTSSSAALQW 168

Query: 802  LHDRMPVIFGNKESIEMWLNAPPSTNFDTILKPYEEKDLAWYPVTPAMGKPSFDGPECIK 623
            LHDRMPVI G+KES + WLN   S+ +DTILKPYEE DL WYPVTP MGK SF+GPECIK
Sbjct: 169  LHDRMPVILGDKESSDAWLNGSSSSKYDTILKPYEESDLVWYPVTPVMGKLSFNGPECIK 228

Query: 622  EIQLKANENRTISEFFSKKGAGRQPGSKPYSRNTTEEATEIHPPK 488
            EI LK      IS FF KK   ++  SK   +++ +E+ + + PK
Sbjct: 229  EIPLKTEGKNPISNFFLKKEIKKEQESKMDEKSSFDESVKTNLPK 273


>ref|NP_180215.2| uncharacterized protein [Arabidopsis thaliana]
            gi|26449484|dbj|BAC41868.1| unknown protein [Arabidopsis
            thaliana] gi|29028900|gb|AAO64829.1| At2g26470
            [Arabidopsis thaliana] gi|330252748|gb|AEC07842.1|
            uncharacterized protein AT2G26470 [Arabidopsis thaliana]
          Length = 487

 Score =  351 bits (900), Expect = 5e-94
 Identities = 170/297 (57%), Positives = 208/297 (70%), Gaps = 1/297 (0%)
 Frame = -1

Query: 1342 MCGRARCTLRPDDFIRASHLNGHRVRHVDMDRYRPSYNVSPGFNLPVIRRXXXXXXXXXX 1163
            MCGR RCTLRPDD  RASH +    R + +DRYRPSYNV+PG  +PV+RR          
Sbjct: 1    MCGRTRCTLRPDDVPRASHRHTVPTRFLHLDRYRPSYNVAPGSYIPVLRRDNEEVVGDG- 59

Query: 1162 XXXXXXXGVVLHCMKWGLIPSFTKKTEKPDHYKMFNARSESMREKASFRRLVPTNRCLVE 983
                    VV+HCMKWGL+PSFTKKT+KPD +KMFNARSES+ EKASFRRL+P NRCLV 
Sbjct: 60   --------VVVHCMKWGLVPSFTKKTDKPDFFKMFNARSESVAEKASFRRLLPKNRCLVA 111

Query: 982  VEGFYEWKKDGSKRQPYYIHFQDERPMVFAALFDSWKNSKGEALYTFTIXXXXXXXXLAW 803
            V+GFYEWKK+GSK+QPYYIHF+D RP+VFAALFD+W+NS GE LYTFTI        L W
Sbjct: 112  VDGFYEWKKEGSKKQPYYIHFEDGRPLVFAALFDTWQNSGGETLYTFTILTTASSSALQW 171

Query: 802  LHDRMPVIFGNKESIEMWLNAPPSTNFDTILKPYEEKDLAWYPVTPAMGKPSFDGPECIK 623
            LHDRMPVI G+K+SI+ WL+ P +T    +L PYE+ DL WYPVT A+GKP+FDGPECI+
Sbjct: 172  LHDRMPVILGDKDSIDTWLDDPSTTKLQPLLSPYEKSDLVWYPVTSAIGKPTFDGPECIQ 231

Query: 622  EIQLKANENRTISEFFSKKGAGRQPGSKPYSRNTTEEATEI-HPPKTEEDTANEDPK 455
            +I LK ++N  IS+FFS K      G K           ++   P  E+DT ++  K
Sbjct: 232  QIPLKTSQNSLISKFFSTKQPKTDEGDKETKSTDANIIVDLKKEPTAEKDTFSDSIK 288


>ref|XP_006293960.1| hypothetical protein CARUB_v10022949mg [Capsella rubella]
            gi|482562668|gb|EOA26858.1| hypothetical protein
            CARUB_v10022949mg [Capsella rubella]
          Length = 540

 Score =  350 bits (899), Expect = 7e-94
 Identities = 170/294 (57%), Positives = 207/294 (70%)
 Frame = -1

Query: 1342 MCGRARCTLRPDDFIRASHLNGHRVRHVDMDRYRPSYNVSPGFNLPVIRRXXXXXXXXXX 1163
            MCGR RCTLRPDD  RASH +G + R +  DRYRPSYNV+PG  +PV+RR          
Sbjct: 1    MCGRTRCTLRPDDVPRASHRHGVQTRFLHTDRYRPSYNVAPGSYMPVLRRDNEVVGDG-- 58

Query: 1162 XXXXXXXGVVLHCMKWGLIPSFTKKTEKPDHYKMFNARSESMREKASFRRLVPTNRCLVE 983
                    VV+HCMKWGL+P FTKKT+KPD +KMFNARSES+ EKASFRRL+P NRCLV 
Sbjct: 59   --------VVVHCMKWGLVPGFTKKTDKPDFFKMFNARSESVAEKASFRRLLPKNRCLVA 110

Query: 982  VEGFYEWKKDGSKRQPYYIHFQDERPMVFAALFDSWKNSKGEALYTFTIXXXXXXXXLAW 803
            V+GFYEWKK+GSK+QPYYIHF+D RP+VFAALFDSW+NS GE LYTFTI        L W
Sbjct: 111  VDGFYEWKKEGSKKQPYYIHFEDGRPLVFAALFDSWQNSGGETLYTFTILTTASSSSLHW 170

Query: 802  LHDRMPVIFGNKESIEMWLNAPPSTNFDTILKPYEEKDLAWYPVTPAMGKPSFDGPECIK 623
            LHDRMPVI G+K+S++ WL+ P +T    +L PYE+ DL WYPVT A+GKP+FDGPECI+
Sbjct: 171  LHDRMPVILGDKDSVDTWLDDPSTTKLQPLLSPYEKSDLVWYPVTSAIGKPTFDGPECIQ 230

Query: 622  EIQLKANENRTISEFFSKKGAGRQPGSKPYSRNTTEEATEIHPPKTEEDTANED 461
            +I LKA++N  IS+FFS K                        PKT+++TA+ D
Sbjct: 231  QITLKASQNSLISKFFSTK-----------------------HPKTDKETASTD 261


>ref|XP_006293959.1| hypothetical protein CARUB_v10022949mg [Capsella rubella]
            gi|482562667|gb|EOA26857.1| hypothetical protein
            CARUB_v10022949mg [Capsella rubella]
          Length = 521

 Score =  350 bits (899), Expect = 7e-94
 Identities = 170/294 (57%), Positives = 207/294 (70%)
 Frame = -1

Query: 1342 MCGRARCTLRPDDFIRASHLNGHRVRHVDMDRYRPSYNVSPGFNLPVIRRXXXXXXXXXX 1163
            MCGR RCTLRPDD  RASH +G + R +  DRYRPSYNV+PG  +PV+RR          
Sbjct: 1    MCGRTRCTLRPDDVPRASHRHGVQTRFLHTDRYRPSYNVAPGSYMPVLRRDNEVVGDG-- 58

Query: 1162 XXXXXXXGVVLHCMKWGLIPSFTKKTEKPDHYKMFNARSESMREKASFRRLVPTNRCLVE 983
                    VV+HCMKWGL+P FTKKT+KPD +KMFNARSES+ EKASFRRL+P NRCLV 
Sbjct: 59   --------VVVHCMKWGLVPGFTKKTDKPDFFKMFNARSESVAEKASFRRLLPKNRCLVA 110

Query: 982  VEGFYEWKKDGSKRQPYYIHFQDERPMVFAALFDSWKNSKGEALYTFTIXXXXXXXXLAW 803
            V+GFYEWKK+GSK+QPYYIHF+D RP+VFAALFDSW+NS GE LYTFTI        L W
Sbjct: 111  VDGFYEWKKEGSKKQPYYIHFEDGRPLVFAALFDSWQNSGGETLYTFTILTTASSSSLHW 170

Query: 802  LHDRMPVIFGNKESIEMWLNAPPSTNFDTILKPYEEKDLAWYPVTPAMGKPSFDGPECIK 623
            LHDRMPVI G+K+S++ WL+ P +T    +L PYE+ DL WYPVT A+GKP+FDGPECI+
Sbjct: 171  LHDRMPVILGDKDSVDTWLDDPSTTKLQPLLSPYEKSDLVWYPVTSAIGKPTFDGPECIQ 230

Query: 622  EIQLKANENRTISEFFSKKGAGRQPGSKPYSRNTTEEATEIHPPKTEEDTANED 461
            +I LKA++N  IS+FFS K                        PKT+++TA+ D
Sbjct: 231  QITLKASQNSLISKFFSTK-----------------------HPKTDKETASTD 261


>ref|XP_002880802.1| hypothetical protein ARALYDRAFT_481505 [Arabidopsis lyrata subsp.
            lyrata] gi|297326641|gb|EFH57061.1| hypothetical protein
            ARALYDRAFT_481505 [Arabidopsis lyrata subsp. lyrata]
          Length = 489

 Score =  349 bits (895), Expect = 2e-93
 Identities = 165/280 (58%), Positives = 203/280 (72%)
 Frame = -1

Query: 1342 MCGRARCTLRPDDFIRASHLNGHRVRHVDMDRYRPSYNVSPGFNLPVIRRXXXXXXXXXX 1163
            MCGR RCTLRPDD  RASH +    R + +DRYRPSYN++PG  +PV+RR          
Sbjct: 1    MCGRTRCTLRPDDIQRASHRHTVPTRSLHLDRYRPSYNIAPGSYIPVLRRENEVVGDG-- 58

Query: 1162 XXXXXXXGVVLHCMKWGLIPSFTKKTEKPDHYKMFNARSESMREKASFRRLVPTNRCLVE 983
                    VV+HCMKWGL+P FTKKT+KPD +KMFNARSES+ EKASFRRL+P NRCLV 
Sbjct: 59   --------VVVHCMKWGLVPGFTKKTDKPDFFKMFNARSESVAEKASFRRLLPKNRCLVA 110

Query: 982  VEGFYEWKKDGSKRQPYYIHFQDERPMVFAALFDSWKNSKGEALYTFTIXXXXXXXXLAW 803
            V+GFYEWKK+GSK+QPYYIHF+D RP+VFAALFDSW+NS GE LYTFTI        L W
Sbjct: 111  VDGFYEWKKEGSKKQPYYIHFEDGRPLVFAALFDSWQNSGGETLYTFTILTTTSSSPLQW 170

Query: 802  LHDRMPVIFGNKESIEMWLNAPPSTNFDTILKPYEEKDLAWYPVTPAMGKPSFDGPECIK 623
            LHDRMPVI G+K+S++ WL+ P +T    +L PYE+ DL WYPVT A+GKP+FDGPECI+
Sbjct: 171  LHDRMPVILGDKDSVDTWLDDPSTTKLQPLLSPYEKSDLVWYPVTTAIGKPTFDGPECIQ 230

Query: 622  EIQLKANENRTISEFFSKKGAGRQPGSKPYSRNTTEEATE 503
            +I LKA++N  IS+FFS+K       +K    N + +  E
Sbjct: 231  QIPLKASQNSLISKFFSRKTEEGDKETKSTDANISVDLKE 270


>ref|XP_006294385.1| hypothetical protein CARUB_v10023401mg, partial [Capsella rubella]
            gi|482563093|gb|EOA27283.1| hypothetical protein
            CARUB_v10023401mg, partial [Capsella rubella]
          Length = 389

 Score =  345 bits (886), Expect = 2e-92
 Identities = 168/294 (57%), Positives = 205/294 (69%)
 Frame = -1

Query: 1342 MCGRARCTLRPDDFIRASHLNGHRVRHVDMDRYRPSYNVSPGFNLPVIRRXXXXXXXXXX 1163
            MCGR  CTLRPDD  RASH +G + R +  DRYRPSYNV+PG  +PV+RR          
Sbjct: 1    MCGRTCCTLRPDDVPRASHRHGVQTRFLHTDRYRPSYNVAPGSYMPVLRRDNEVVGDG-- 58

Query: 1162 XXXXXXXGVVLHCMKWGLIPSFTKKTEKPDHYKMFNARSESMREKASFRRLVPTNRCLVE 983
                    VV+HCMKWGL+P FTKKT+KPD +KMFNARSES+ EK+SFRRL+P NRCLV 
Sbjct: 59   --------VVVHCMKWGLVPGFTKKTDKPDFFKMFNARSESVAEKSSFRRLLPKNRCLVA 110

Query: 982  VEGFYEWKKDGSKRQPYYIHFQDERPMVFAALFDSWKNSKGEALYTFTIXXXXXXXXLAW 803
            V+GFYEWKK+GSK+QPYYIHF+D RP+VFAALFDSW NS GE LYTFTI        L W
Sbjct: 111  VDGFYEWKKEGSKKQPYYIHFEDGRPLVFAALFDSWPNSGGETLYTFTILTAASSSALHW 170

Query: 802  LHDRMPVIFGNKESIEMWLNAPPSTNFDTILKPYEEKDLAWYPVTPAMGKPSFDGPECIK 623
            LHDRMPVI G+K+S++ WL+ P +T    +L PYE+ DL WYPVT A+GKP+FDGPECI+
Sbjct: 171  LHDRMPVILGDKDSVDTWLDDPSTTKLQPLLSPYEKSDLVWYPVTSAIGKPTFDGPECIQ 230

Query: 622  EIQLKANENRTISEFFSKKGAGRQPGSKPYSRNTTEEATEIHPPKTEEDTANED 461
            +I LKA++N  IS+FFS K                        PKT+++TA+ D
Sbjct: 231  QITLKASQNSLISKFFSTK-----------------------HPKTDKETASTD 261


>ref|XP_006403078.1| hypothetical protein EUTSA_v10003450mg [Eutrema salsugineum]
            gi|557104185|gb|ESQ44531.1| hypothetical protein
            EUTSA_v10003450mg [Eutrema salsugineum]
          Length = 480

 Score =  344 bits (882), Expect = 6e-92
 Identities = 166/288 (57%), Positives = 201/288 (69%), Gaps = 1/288 (0%)
 Frame = -1

Query: 1342 MCGRARCTLRPDDFIRASHLNGHRVRHVDMDRYRPSYNVSPGFNLPVIRRXXXXXXXXXX 1163
            MCGRARCTLRPDD  RASH +G   R + +DRYRPSYNV+PG  +PV+RR          
Sbjct: 1    MCGRARCTLRPDDVPRASHRHGVPARFLHLDRYRPSYNVAPGTYMPVLRRDNDG------ 54

Query: 1162 XXXXXXXGVVLHCMKWGLIPSFTKKTEKPDHYKMFNARSESMREKASFRRLVPTNRCLVE 983
                    + +HCMKWGL+PSFTKKT+KPD +KMFNARSES+ EKASFRRL+P NRCLV 
Sbjct: 55   --------IAVHCMKWGLVPSFTKKTDKPDFFKMFNARSESVAEKASFRRLLPKNRCLVA 106

Query: 982  VEGFYEWKKDGSKRQPYYIHFQDERPMVFAALFDSWKNSKGEALYTFTIXXXXXXXXLAW 803
            V+GFYEWKK+GSK+QPYYIHF D RP+VFAALFDSW+NS GE L TFTI        L W
Sbjct: 107  VDGFYEWKKEGSKKQPYYIHFNDRRPLVFAALFDSWQNSGGETLDTFTILTTTSSSALDW 166

Query: 802  LHDRMPVIFGNKESIEMWLNAPPSTNFDTILKPYEEKDLAWYPVTPAMGKPSFDGPECIK 623
            LHDRMPVI  +KES++ WL+ P ++N   +L PYE  DL WYPVT A+GK  FDGPECI+
Sbjct: 167  LHDRMPVILNDKESVDTWLDGPSTSNLKPLLVPYENSDLVWYPVTSAIGKLCFDGPECIQ 226

Query: 622  EIQLKANENRTISEFFSKKGAGRQPGSKPYSRNTTEEATEI-HPPKTE 482
            +I LKA++N  IS+FFS K      G +       +   ++   PK E
Sbjct: 227  QIPLKASQNSLISKFFSAKHPNTDEGDRETKSTDADTPVDLKEKPKVE 274


>ref|XP_004150365.1| PREDICTED: LOW QUALITY PROTEIN: UPF0361 protein C3orf37 homolog,
            partial [Cucumis sativus]
          Length = 344

 Score =  344 bits (882), Expect = 6e-92
 Identities = 177/358 (49%), Positives = 225/358 (62%), Gaps = 7/358 (1%)
 Frame = -1

Query: 1342 MCGRARCTLRPDDFIRASHLNGHRVRHVDMDRYRPSYNVSPGFNLPVIRRXXXXXXXXXX 1163
            MCGRARCTLR DD  RA H  G  VR ++MDR+RP +N SPG +LPV+RR          
Sbjct: 1    MCGRARCTLRADDITRACHRTGGPVRSLNMDRFRPLFNASPGSDLPVVRRDDESSDGG-- 58

Query: 1162 XXXXXXXGVVLHCMKWGLIPSFTKKTEKPDHYKMFNARSESMREKASFRRLVPTNRCLVE 983
                    VVL CMKWGLIPSFT+K EKP+++KMFNARSES+ EKASF RLVP  RCLV 
Sbjct: 59   --------VVLQCMKWGLIPSFTEKFEKPNYFKMFNARSESIHEKASFHRLVPKRRCLVA 110

Query: 982  VEGFYEWKKDGSKRQPYYIHFQDERPMVFAALFDSWKNSKGEALYTFTIXXXXXXXXLAW 803
            VEGFYEWKKDG K+QPYYIHF+D +P+  AAL+D W+N +GE LYTFTI        L W
Sbjct: 111  VEGFYEWKKDGXKKQPYYIHFKDGQPLALAALYDCWENLEGELLYTFTILTTSSSPALKW 170

Query: 802  LHDRMPVIFGNKESIEMWLNAPPSTNFDTILKPYEEKDLAWYPVTPAMGKPSFDGPECIK 623
            LHDRMPVI G+KE ++MWLN   S+ +D++LKPYE  DL WYPVTP+MGKPSFDGP+CIK
Sbjct: 171  LHDRMPVILGDKERMDMWLNDSSSSKYDSVLKPYEAPDLVWYPVTPSMGKPSFDGPDCIK 230

Query: 622  EIQLKANENRTISEFFSKKGAGRQPG-------SKPYSRNTTEEATEIHPPKTEEDTANE 464
            EIQLK + +  IS+FFS K   ++         S    +     + E H  +     ++E
Sbjct: 231  EIQLKNDGSNLISKFFSAKETKKEYSVSQEKTCSNTSVKPEASPSLEEHKREVNRGASSE 290

Query: 463  DPKDSQAITXXXXXXXXXXNFESFHQGAANISMKREYEELSTKMKHSDEEADKQHVSP 290
            + KD  A              +     +    +KR+ E++S+ +K   ++  K   SP
Sbjct: 291  ESKDCLA--------------KCSSDTSLTYQIKRDREDISSDLKSGMDDYSKVGSSP 334


>ref|XP_006850341.1| hypothetical protein AMTR_s00020p00243160 [Amborella trichopoda]
            gi|548853962|gb|ERN11922.1| hypothetical protein
            AMTR_s00020p00243160 [Amborella trichopoda]
          Length = 413

 Score =  342 bits (878), Expect = 2e-91
 Identities = 169/296 (57%), Positives = 208/296 (70%), Gaps = 1/296 (0%)
 Frame = -1

Query: 1351 REEMCGRARCTLRP-DDFIRASHLNGHRVRHVDMDRYRPSYNVSPGFNLPVIRRXXXXXX 1175
            R++MCGRARCTL P +D  RA   N + +  +   RYR SYN++PG  LPV+R+      
Sbjct: 37   RKKMCGRARCTLNPVEDVPRACGFNAN-LPTLHTQRYRLSYNIAPGAYLPVLRKEQESKH 95

Query: 1174 XXXXXXXXXXXGVVLHCMKWGLIPSFTKKTEKPDHYKMFNARSESMREKASFRRLVPTNR 995
                         V+HCMKWGL+PSFTKKTEKPDH+KMFNARSES++EKASFRRLVP  R
Sbjct: 96   GY-----------VVHCMKWGLVPSFTKKTEKPDHFKMFNARSESIQEKASFRRLVPNKR 144

Query: 994  CLVEVEGFYEWKKDGSKRQPYYIHFQDERPMVFAALFDSWKNSKGEALYTFTIXXXXXXX 815
            CLV VEGFYEWKKDGSK+QPYY+HF+D R +VFA L+D+W+NS+GE LYTFTI       
Sbjct: 145  CLVVVEGFYEWKKDGSKKQPYYLHFRDGRALVFAGLYDTWENSEGEGLYTFTILTTRCSS 204

Query: 814  XLAWLHDRMPVIFGNKESIEMWLNAPPSTNFDTILKPYEEKDLAWYPVTPAMGKPSFDGP 635
             L WLHDRMPVI GNKE+I+ WLN  PS   D++L+PYE  DL WYPVTPAMGK  F GP
Sbjct: 205  ALDWLHDRMPVILGNKEAIDAWLNITPSPKVDSLLQPYEGSDLVWYPVTPAMGKIFFAGP 264

Query: 634  ECIKEIQLKANENRTISEFFSKKGAGRQPGSKPYSRNTTEEATEIHPPKTEEDTAN 467
            ECIKEIQLK+    TIS+ F +    +QP S+P  R   E++T  H  +  ++ +N
Sbjct: 265  ECIKEIQLKSENKNTISKLFMQSHNKKQPISEPSIRKAAEDSTHGHTFENSQEPSN 320


>ref|XP_004165094.1| PREDICTED: UPF0361 protein C3orf37 homolog [Cucumis sativus]
          Length = 267

 Score =  342 bits (877), Expect = 2e-91
 Identities = 164/263 (62%), Positives = 194/263 (73%)
 Frame = -1

Query: 1342 MCGRARCTLRPDDFIRASHLNGHRVRHVDMDRYRPSYNVSPGFNLPVIRRXXXXXXXXXX 1163
            MCGRARCTLR DD  RA H  G  VR ++MDR+RP +N SPG +LPV+RR          
Sbjct: 1    MCGRARCTLRADDITRACHRTGGPVRSLNMDRFRPLFNASPGSDLPVVRRDDESSDGG-- 58

Query: 1162 XXXXXXXGVVLHCMKWGLIPSFTKKTEKPDHYKMFNARSESMREKASFRRLVPTNRCLVE 983
                    VVL CMKWGLIPSFT+K EKP+++KMFNARSES+ EKASF RLVP  RCLV 
Sbjct: 59   --------VVLQCMKWGLIPSFTEKFEKPNYFKMFNARSESIHEKASFHRLVPKRRCLVA 110

Query: 982  VEGFYEWKKDGSKRQPYYIHFQDERPMVFAALFDSWKNSKGEALYTFTIXXXXXXXXLAW 803
            VEGFYEWKKDGSK+QPYYIHF+D +P+  AAL+D W+N +GE LYTFTI        L W
Sbjct: 111  VEGFYEWKKDGSKKQPYYIHFKDGQPLALAALYDCWENLEGELLYTFTILTTSSSPALKW 170

Query: 802  LHDRMPVIFGNKESIEMWLNAPPSTNFDTILKPYEEKDLAWYPVTPAMGKPSFDGPECIK 623
            LHDRMPVI G+KE ++MWLN   S+ +D++LKPYE  DL WYPVTP+MGKPSFDGP+CIK
Sbjct: 171  LHDRMPVILGDKERMDMWLNDSSSSKYDSVLKPYEAPDLVWYPVTPSMGKPSFDGPDCIK 230

Query: 622  EIQLKANENRTISEFFSKKGAGR 554
            EIQLK + +  IS+FFS K   R
Sbjct: 231  EIQLKNDGSNLISKFFSAKETKR 253


Top