BLASTX nr result

ID: Catharanthus23_contig00004564 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Catharanthus23_contig00004564
         (1801 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002277652.2| PREDICTED: UPF0505 protein C16orf62 homolog ...   593   e-166
emb|CBI26668.3| unnamed protein product [Vitis vinifera]              593   e-166
ref|XP_006365948.1| PREDICTED: UPF0505 protein C16orf62 homolog ...   580   e-163
ref|XP_006452424.1| hypothetical protein CICLE_v10007388mg [Citr...   580   e-163
ref|XP_006365949.1| PREDICTED: UPF0505 protein C16orf62 homolog ...   575   e-161
gb|EOY12279.1| Uncharacterized protein isoform 2 [Theobroma cacao]    572   e-160
gb|EOY12278.1| Uncharacterized protein isoform 1 [Theobroma cacao]    570   e-160
ref|XP_004251467.1| PREDICTED: UPF0505 protein C16orf62 homolog ...   567   e-159
gb|EOY12280.1| Uncharacterized protein isoform 3 [Theobroma cacao]    562   e-157
ref|XP_006365950.1| PREDICTED: UPF0505 protein C16orf62 homolog ...   551   e-154
ref|XP_002529445.1| esophageal cancer associated protein, putati...   538   e-150
ref|XP_003545120.1| PREDICTED: UPF0505 protein-like isoform X1 [...   517   e-144
ref|XP_006595724.1| PREDICTED: UPF0505 protein-like isoform X2 [...   512   e-142
gb|EXB66322.1| hypothetical protein L484_008062 [Morus notabilis]     499   e-138
gb|ESW14309.1| hypothetical protein PHAVU_008G270200g [Phaseolus...   494   e-137
gb|ESW14308.1| hypothetical protein PHAVU_008G270200g [Phaseolus...   494   e-137
ref|NP_175488.2| uncharacterized protein [Arabidopsis thaliana] ...   493   e-136
ref|XP_006306720.1| hypothetical protein CARUB_v10008246mg [Caps...   490   e-136
ref|XP_006393113.1| hypothetical protein EUTSA_v10011218mg [Eutr...   488   e-135
gb|AAG50781.1|AC079027_4 hypothetical protein [Arabidopsis thali...   485   e-134

>ref|XP_002277652.2| PREDICTED: UPF0505 protein C16orf62 homolog [Vitis vinifera]
          Length = 920

 Score =  593 bits (1528), Expect = e-166
 Identities = 311/585 (53%), Positives = 419/585 (71%)
 Frame = -3

Query: 1799 YNIAGINDLKNLIMRATFPQQTRDDVSSGDERLLLSLMEPTIEYVMKIVFKDLNEQTQVR 1620
            Y I+ IND+K L+MR    ++     SS ++RLL+SLMEPTIEY+MK +FKD + Q QV 
Sbjct: 322  YLISCINDIKILLMRMISEKEATHGNSSANKRLLVSLMEPTIEYIMKCIFKDAS-QRQVG 380

Query: 1619 DILVGLGLGKDQSDLFGERPCPSVIIHHLLKELPIGVICSSAMNILHLIGCSADCSFDQF 1440
            DILV LGLG+++S+LFG+ P  S+I+HHLLKELP  V+ S+A  ILHLI    D SFDQ 
Sbjct: 381  DILVKLGLGRNESELFGKFPFVSIILHHLLKELPTEVVSSNATEILHLIESCNDYSFDQC 440

Query: 1439 LNYKLLGLRLCESVSQVSEAIAVVNKVIQVTSLYHRLDEYMKVLDAYVDIVLQNQLQDTH 1260
            LNY+LLG RL E  SQ+    A+++KVIQV + ++ LDEY+KV+D+YVDIVLQNQ+ D +
Sbjct: 441  LNYRLLGFRLGERGSQMDMINAIIDKVIQVVAQFNCLDEYLKVVDSYVDIVLQNQM-DNY 499

Query: 1259 LNMILEKIFELVCNEGIDESALASLQSFFTKLLTHFDNLNDILSLNHFIDILDLMHGTTR 1080
            L+ ILE + +  CN+ IDES L SLQS F+KLL HF+NL DI +LNHF++ILD+M+G++R
Sbjct: 500  LDAILEGVSKRACNKEIDESELGSLQSIFSKLLAHFNNLEDIFALNHFVEILDVMYGSSR 559

Query: 1079 NMINIQILRIATRNNCLQDPTIIHFLFEVSQALHDGVDFLNIKADEHQHSAQLISRFVNI 900
            N+IN+QIL IATRN  + DP  I  L E+SQ+LHDG+D  N+K +++Q  A+LISRFV +
Sbjct: 560  NIINMQILNIATRNGYIHDPATIQLLLEISQSLHDGIDLFNMKDNDNQQPARLISRFVQM 619

Query: 899  AEFGQDFERNLSFLVDCRGAFGSMSQLKETLVHSSNRLAVKAMQYGSNHASFVKSCLTFS 720
             ++G + E +L+FLV+CRGAF ++ +LKETLVHS N LA+KAM+    H SFVKSC+ FS
Sbjct: 620  VDYGIEMEHHLTFLVECRGAFSNIEELKETLVHSCNCLAIKAMKEAKKHISFVKSCIAFS 679

Query: 719  EVTIPSIPAHTRQWNLYIETAEVAFLGGLVSHADGLIDSVIWCFQNFDLADGTLTSSDHD 540
            EVTIPSI A  +Q NLY+ETAEVA + GLVSH+DGLIDS + C Q  DL DG     D D
Sbjct: 680  EVTIPSISACPKQLNLYLETAEVALVCGLVSHSDGLIDSALGCLQTLDLMDGFQILIDVD 739

Query: 539  AYLSSVCKLCSLVILVPGNFEQGFMNVPRSVFGLVNSQSRMTAKLQIRGXXXXXXXXXXX 360
              LS + KLCSL+++VPGN EQG   +P+S+  LV+SQS +T K++ R            
Sbjct: 740  GILSLIRKLCSLLVMVPGNPEQGAAFIPKSILSLVSSQSWITPKMRARILCAIISLSATL 799

Query: 359  XXXXXXSHMLPGQVLSNSNVFFGDMTYLQELLSLSGVIVQKIIDIVSEEPLQTTRGYIAL 180
                   ++   ++L N  +FFGD TYLQ+L+SLS  +++++ +++ +EP Q  RG +AL
Sbjct: 800  SQNKLPYNVDNIEILGNDLLFFGDSTYLQDLVSLSEFVLEELCNVIQQEPSQAARGSMAL 859

Query: 179  EACNCIASSFKGSEKTWAICSTLVEIAKSSLGSDNIYLKSTRNFL 45
            EACNCIASSFK S +   ICS L+E A+  L S+N YL+ST   L
Sbjct: 860  EACNCIASSFKVSPEISPICSKLMETAQLCLSSNNKYLQSTMKLL 904


>emb|CBI26668.3| unnamed protein product [Vitis vinifera]
          Length = 810

 Score =  593 bits (1528), Expect = e-166
 Identities = 311/585 (53%), Positives = 419/585 (71%)
 Frame = -3

Query: 1799 YNIAGINDLKNLIMRATFPQQTRDDVSSGDERLLLSLMEPTIEYVMKIVFKDLNEQTQVR 1620
            Y I+ IND+K L+MR    ++     SS ++RLL+SLMEPTIEY+MK +FKD + Q QV 
Sbjct: 212  YLISCINDIKILLMRMISEKEATHGNSSANKRLLVSLMEPTIEYIMKCIFKDAS-QRQVG 270

Query: 1619 DILVGLGLGKDQSDLFGERPCPSVIIHHLLKELPIGVICSSAMNILHLIGCSADCSFDQF 1440
            DILV LGLG+++S+LFG+ P  S+I+HHLLKELP  V+ S+A  ILHLI    D SFDQ 
Sbjct: 271  DILVKLGLGRNESELFGKFPFVSIILHHLLKELPTEVVSSNATEILHLIESCNDYSFDQC 330

Query: 1439 LNYKLLGLRLCESVSQVSEAIAVVNKVIQVTSLYHRLDEYMKVLDAYVDIVLQNQLQDTH 1260
            LNY+LLG RL E  SQ+    A+++KVIQV + ++ LDEY+KV+D+YVDIVLQNQ+ D +
Sbjct: 331  LNYRLLGFRLGERGSQMDMINAIIDKVIQVVAQFNCLDEYLKVVDSYVDIVLQNQM-DNY 389

Query: 1259 LNMILEKIFELVCNEGIDESALASLQSFFTKLLTHFDNLNDILSLNHFIDILDLMHGTTR 1080
            L+ ILE + +  CN+ IDES L SLQS F+KLL HF+NL DI +LNHF++ILD+M+G++R
Sbjct: 390  LDAILEGVSKRACNKEIDESELGSLQSIFSKLLAHFNNLEDIFALNHFVEILDVMYGSSR 449

Query: 1079 NMINIQILRIATRNNCLQDPTIIHFLFEVSQALHDGVDFLNIKADEHQHSAQLISRFVNI 900
            N+IN+QIL IATRN  + DP  I  L E+SQ+LHDG+D  N+K +++Q  A+LISRFV +
Sbjct: 450  NIINMQILNIATRNGYIHDPATIQLLLEISQSLHDGIDLFNMKDNDNQQPARLISRFVQM 509

Query: 899  AEFGQDFERNLSFLVDCRGAFGSMSQLKETLVHSSNRLAVKAMQYGSNHASFVKSCLTFS 720
             ++G + E +L+FLV+CRGAF ++ +LKETLVHS N LA+KAM+    H SFVKSC+ FS
Sbjct: 510  VDYGIEMEHHLTFLVECRGAFSNIEELKETLVHSCNCLAIKAMKEAKKHISFVKSCIAFS 569

Query: 719  EVTIPSIPAHTRQWNLYIETAEVAFLGGLVSHADGLIDSVIWCFQNFDLADGTLTSSDHD 540
            EVTIPSI A  +Q NLY+ETAEVA + GLVSH+DGLIDS + C Q  DL DG     D D
Sbjct: 570  EVTIPSISACPKQLNLYLETAEVALVCGLVSHSDGLIDSALGCLQTLDLMDGFQILIDVD 629

Query: 539  AYLSSVCKLCSLVILVPGNFEQGFMNVPRSVFGLVNSQSRMTAKLQIRGXXXXXXXXXXX 360
              LS + KLCSL+++VPGN EQG   +P+S+  LV+SQS +T K++ R            
Sbjct: 630  GILSLIRKLCSLLVMVPGNPEQGAAFIPKSILSLVSSQSWITPKMRARILCAIISLSATL 689

Query: 359  XXXXXXSHMLPGQVLSNSNVFFGDMTYLQELLSLSGVIVQKIIDIVSEEPLQTTRGYIAL 180
                   ++   ++L N  +FFGD TYLQ+L+SLS  +++++ +++ +EP Q  RG +AL
Sbjct: 690  SQNKLPYNVDNIEILGNDLLFFGDSTYLQDLVSLSEFVLEELCNVIQQEPSQAARGSMAL 749

Query: 179  EACNCIASSFKGSEKTWAICSTLVEIAKSSLGSDNIYLKSTRNFL 45
            EACNCIASSFK S +   ICS L+E A+  L S+N YL+ST   L
Sbjct: 750  EACNCIASSFKVSPEISPICSKLMETAQLCLSSNNKYLQSTMKLL 794


>ref|XP_006365948.1| PREDICTED: UPF0505 protein C16orf62 homolog isoform X1 [Solanum
            tuberosum]
          Length = 923

 Score =  580 bits (1494), Expect = e-163
 Identities = 292/587 (49%), Positives = 405/587 (68%), Gaps = 2/587 (0%)
 Frame = -3

Query: 1793 IAGINDLKNLIMRATFPQQTR--DDVSSGDERLLLSLMEPTIEYVMKIVFKDLNEQTQVR 1620
            I  +ND+K L+M               SG     L LMEP IEYVMK +FK+  E  Q+ 
Sbjct: 329  IISMNDMKTLLMNGAHVASAEKPSGALSGTRSSKLGLMEPAIEYVMKCLFKESCEHLQIG 388

Query: 1619 DILVGLGLGKDQSDLFGERPCPSVIIHHLLKELPIGVICSSAMNILHLIGCSADCSFDQF 1440
            DIL+GLGL ++QS+LFG   C S+++HHLL+ELPI ++CS+A++ILHLI CS D SFDQ 
Sbjct: 389  DILMGLGLARNQSELFGNSSCVSLVLHHLLRELPIRIVCSNALDILHLIECSNDYSFDQC 448

Query: 1439 LNYKLLGLRLCESVSQVSEAIAVVNKVIQVTSLYHRLDEYMKVLDAYVDIVLQNQLQDTH 1260
            LNYKLLGLRLCE++S V+E   V+ KVIQV S ++ LDEY+ V+DA+VDI LQ  + D++
Sbjct: 449  LNYKLLGLRLCENISHVNEVNLVMKKVIQVVSQFNSLDEYLNVIDAHVDIALQKHM-DSY 507

Query: 1259 LNMILEKIFELVCNEGIDESALASLQSFFTKLLTHFDNLNDILSLNHFIDILDLMHGTTR 1080
            L+ IL+ IFE   ++ I E+ L+SLQS   KLL HFDNL  IL LNHF  IL +M G++R
Sbjct: 508  LDSILDGIFERTLDDEIGENELSSLQSILLKLLNHFDNLEHILRLNHFNQILSMMQGSSR 567

Query: 1079 NMINIQILRIATRNNCLQDPTIIHFLFEVSQALHDGVDFLNIKADEHQHSAQLISRFVNI 900
             ++N++IL IATR +C++DPT I FLFEVS++LHD +D   IK  E+ HSA L+SRF+++
Sbjct: 568  TIVNMRILSIATRYSCVRDPTTIQFLFEVSRSLHDSIDLSTIKEKENNHSAHLVSRFIHM 627

Query: 899  AEFGQDFERNLSFLVDCRGAFGSMSQLKETLVHSSNRLAVKAMQYGSNHASFVKSCLTFS 720
             ++  + +R+L FLV CRGAFGSMS++KE +VHSSN L VKA +   +   FVKSC+  S
Sbjct: 628  VDYDSEVKRHLDFLVQCRGAFGSMSEVKEMIVHSSNLLVVKATRNDISDVIFVKSCIACS 687

Query: 719  EVTIPSIPAHTRQWNLYIETAEVAFLGGLVSHADGLIDSVIWCFQNFDLADGTLTSSDHD 540
            EVTIPSIP+H +Q NLY+ETAEVA + GLVSH+DGL+DS + C  N DL +G+    D D
Sbjct: 688  EVTIPSIPSHLKQLNLYLETAEVALMAGLVSHSDGLVDSALRCLHNVDLFEGSRIPKDID 747

Query: 539  AYLSSVCKLCSLVILVPGNFEQGFMNVPRSVFGLVNSQSRMTAKLQIRGXXXXXXXXXXX 360
             + S++CK CSL++++PGN E+G  ++PR++F +++S S M   ++ +            
Sbjct: 748  GFQSTLCKFCSLIVMIPGNIERGVTSIPRNMFSILSSLSWMLPSMKAKVLCALILTVAAL 807

Query: 359  XXXXXXSHMLPGQVLSNSNVFFGDMTYLQELLSLSGVIVQKIIDIVSEEPLQTTRGYIAL 180
                   H +  +V+ N ++F+ D  YLQEL S S V++Q +ID V +EP+Q  RG +AL
Sbjct: 808  SQNNLLYHAIHDEVMGNDSLFYCDQQYLQELFSFSTVLLQSLIDTVLQEPIQAARGNLAL 867

Query: 179  EACNCIASSFKGSEKTWAICSTLVEIAKSSLGSDNIYLKSTRNFLDS 39
            +ACN +ASSF+  +    ICS LVE AK SL S+N YL+ST  FL++
Sbjct: 868  DACNAVASSFEVCQGASEICSKLVETAKLSLSSNNKYLQSTIKFLNN 914


>ref|XP_006452424.1| hypothetical protein CICLE_v10007388mg [Citrus clementina]
            gi|557555650|gb|ESR65664.1| hypothetical protein
            CICLE_v10007388mg [Citrus clementina]
          Length = 921

 Score =  580 bits (1494), Expect = e-163
 Identities = 297/584 (50%), Positives = 414/584 (70%)
 Frame = -3

Query: 1793 IAGINDLKNLIMRATFPQQTRDDVSSGDERLLLSLMEPTIEYVMKIVFKDLNEQTQVRDI 1614
            I  IND+K L+ R    ++     S  + RLL+SLMEPTIEY+MK +FKD + Q QV  +
Sbjct: 328  ITSINDIKILLTRVLSTKEAAHGKSVDNRRLLVSLMEPTIEYIMKCIFKDAS-QRQVGTV 386

Query: 1613 LVGLGLGKDQSDLFGERPCPSVIIHHLLKELPIGVICSSAMNILHLIGCSADCSFDQFLN 1434
            L+ LGLG++Q +LFG  PC SV++HHLLKELP  ++ S A+ ILHLI  S D S+DQ LN
Sbjct: 387  LMELGLGRNQVELFGSNPCVSVVLHHLLKELPTEIVGSYAVEILHLIEYSNDKSYDQCLN 446

Query: 1433 YKLLGLRLCESVSQVSEAIAVVNKVIQVTSLYHRLDEYMKVLDAYVDIVLQNQLQDTHLN 1254
            Y+LLG RLCE    +    A V+++IQV +L   LD+++KV+D YVDI+LQNQ+ D HLN
Sbjct: 447  YRLLGFRLCERRPTLDILNAAVDRIIQVVTLLDELDDFLKVVDPYVDIILQNQM-DNHLN 505

Query: 1253 MILEKIFELVCNEGIDESALASLQSFFTKLLTHFDNLNDILSLNHFIDILDLMHGTTRNM 1074
             ILE I E  C + I ++ +  LQS   K+L+HF +L D+ +L HF++ILD+M+G++R  
Sbjct: 506  TILEGISERACKKEIVDNDVVGLQSILMKILSHFKDLEDVFALGHFLEILDVMYGSSRIS 565

Query: 1073 INIQILRIATRNNCLQDPTIIHFLFEVSQALHDGVDFLNIKADEHQHSAQLISRFVNIAE 894
            I++QIL +ATRN C+ DPT +  LFE+ QALHDG+DF+N K D++Q +A+LISRFV + +
Sbjct: 566  IDMQILNMATRNGCINDPTTVQLLFEICQALHDGIDFVNSKGDDYQ-AARLISRFVLMVD 624

Query: 893  FGQDFERNLSFLVDCRGAFGSMSQLKETLVHSSNRLAVKAMQYGSNHASFVKSCLTFSEV 714
            +G + ER+L+FLV+CRGAFGS+++LKETLVHSSN LA KA++ G  H SFVKSC+ FSEV
Sbjct: 625  YGAEMERHLTFLVECRGAFGSINELKETLVHSSNHLATKALKDGRKHLSFVKSCIAFSEV 684

Query: 713  TIPSIPAHTRQWNLYIETAEVAFLGGLVSHADGLIDSVIWCFQNFDLADGTLTSSDHDAY 534
            TIPSI  H RQ NLYIET+EVA L GL+SH+DGL+DS I C Q+ DL +G+LT  D D  
Sbjct: 685  TIPSISDHIRQLNLYIETSEVALLAGLISHSDGLVDSAISCLQSVDLINGSLTPVDVDGM 744

Query: 533  LSSVCKLCSLVILVPGNFEQGFMNVPRSVFGLVNSQSRMTAKLQIRGXXXXXXXXXXXXX 354
            ++S+ KLCSL+++VPGN E GF +  +S+  L+ SQS +T+K++IR              
Sbjct: 745  VTSIQKLCSLLVIVPGNPELGFTHTLKSILSLITSQSWITSKIKIR-ISCAIVSLSATLS 803

Query: 353  XXXXSHMLPGQVLSNSNVFFGDMTYLQELLSLSGVIVQKIIDIVSEEPLQTTRGYIALEA 174
                 +    ++LSN  +F+GD +Y+QELLS S  ++Q +++I+ +EP    RG +ALEA
Sbjct: 804  QNKLPYNADLEILSNDLLFYGDSSYVQELLSFSEHVLQNLVEIIEQEPSGAARGSMALEA 863

Query: 173  CNCIASSFKGSEKTWAICSTLVEIAKSSLGSDNIYLKSTRNFLD 42
            CNCIA+SFK +     +CS L+E AKS+L +++ YL+ST   LD
Sbjct: 864  CNCIAASFKINHNIQPVCSKLIETAKSNLSTNDAYLQSTIKVLD 907


>ref|XP_006365949.1| PREDICTED: UPF0505 protein C16orf62 homolog isoform X2 [Solanum
            tuberosum]
          Length = 922

 Score =  575 bits (1482), Expect = e-161
 Identities = 292/587 (49%), Positives = 405/587 (68%), Gaps = 2/587 (0%)
 Frame = -3

Query: 1793 IAGINDLKNLIMRATFPQQTR--DDVSSGDERLLLSLMEPTIEYVMKIVFKDLNEQTQVR 1620
            I  +ND+K L+M               SG     L LMEP IEYVMK +FK+  E  Q+ 
Sbjct: 329  IISMNDMKTLLMNGAHVASAEKPSGALSGTRSSKLGLMEPAIEYVMKCLFKESCE-LQIG 387

Query: 1619 DILVGLGLGKDQSDLFGERPCPSVIIHHLLKELPIGVICSSAMNILHLIGCSADCSFDQF 1440
            DIL+GLGL ++QS+LFG   C S+++HHLL+ELPI ++CS+A++ILHLI CS D SFDQ 
Sbjct: 388  DILMGLGLARNQSELFGNSSCVSLVLHHLLRELPIRIVCSNALDILHLIECSNDYSFDQC 447

Query: 1439 LNYKLLGLRLCESVSQVSEAIAVVNKVIQVTSLYHRLDEYMKVLDAYVDIVLQNQLQDTH 1260
            LNYKLLGLRLCE++S V+E   V+ KVIQV S ++ LDEY+ V+DA+VDI LQ  + D++
Sbjct: 448  LNYKLLGLRLCENISHVNEVNLVMKKVIQVVSQFNSLDEYLNVIDAHVDIALQKHM-DSY 506

Query: 1259 LNMILEKIFELVCNEGIDESALASLQSFFTKLLTHFDNLNDILSLNHFIDILDLMHGTTR 1080
            L+ IL+ IFE   ++ I E+ L+SLQS   KLL HFDNL  IL LNHF  IL +M G++R
Sbjct: 507  LDSILDGIFERTLDDEIGENELSSLQSILLKLLNHFDNLEHILRLNHFNQILSMMQGSSR 566

Query: 1079 NMINIQILRIATRNNCLQDPTIIHFLFEVSQALHDGVDFLNIKADEHQHSAQLISRFVNI 900
             ++N++IL IATR +C++DPT I FLFEVS++LHD +D   IK  E+ HSA L+SRF+++
Sbjct: 567  TIVNMRILSIATRYSCVRDPTTIQFLFEVSRSLHDSIDLSTIKEKENNHSAHLVSRFIHM 626

Query: 899  AEFGQDFERNLSFLVDCRGAFGSMSQLKETLVHSSNRLAVKAMQYGSNHASFVKSCLTFS 720
             ++  + +R+L FLV CRGAFGSMS++KE +VHSSN L VKA +   +   FVKSC+  S
Sbjct: 627  VDYDSEVKRHLDFLVQCRGAFGSMSEVKEMIVHSSNLLVVKATRNDISDVIFVKSCIACS 686

Query: 719  EVTIPSIPAHTRQWNLYIETAEVAFLGGLVSHADGLIDSVIWCFQNFDLADGTLTSSDHD 540
            EVTIPSIP+H +Q NLY+ETAEVA + GLVSH+DGL+DS + C  N DL +G+    D D
Sbjct: 687  EVTIPSIPSHLKQLNLYLETAEVALMAGLVSHSDGLVDSALRCLHNVDLFEGSRIPKDID 746

Query: 539  AYLSSVCKLCSLVILVPGNFEQGFMNVPRSVFGLVNSQSRMTAKLQIRGXXXXXXXXXXX 360
             + S++CK CSL++++PGN E+G  ++PR++F +++S S M   ++ +            
Sbjct: 747  GFQSTLCKFCSLIVMIPGNIERGVTSIPRNMFSILSSLSWMLPSMKAKVLCALILTVAAL 806

Query: 359  XXXXXXSHMLPGQVLSNSNVFFGDMTYLQELLSLSGVIVQKIIDIVSEEPLQTTRGYIAL 180
                   H +  +V+ N ++F+ D  YLQEL S S V++Q +ID V +EP+Q  RG +AL
Sbjct: 807  SQNNLLYHAIHDEVMGNDSLFYCDQQYLQELFSFSTVLLQSLIDTVLQEPIQAARGNLAL 866

Query: 179  EACNCIASSFKGSEKTWAICSTLVEIAKSSLGSDNIYLKSTRNFLDS 39
            +ACN +ASSF+  +    ICS LVE AK SL S+N YL+ST  FL++
Sbjct: 867  DACNAVASSFEVCQGASEICSKLVETAKLSLSSNNKYLQSTIKFLNN 913


>gb|EOY12279.1| Uncharacterized protein isoform 2 [Theobroma cacao]
          Length = 922

 Score =  572 bits (1473), Expect = e-160
 Identities = 295/584 (50%), Positives = 408/584 (69%)
 Frame = -3

Query: 1793 IAGINDLKNLIMRATFPQQTRDDVSSGDERLLLSLMEPTIEYVMKIVFKDLNEQTQVRDI 1614
            I  +ND+K +  R +  ++T     +  +R L+ LMEP IE++MK +F D + + QV  +
Sbjct: 329  ITCVNDIKLVFTRISSAKETAHGCFADSKRSLVGLMEPAIEFIMKCIFNDASLR-QVGQV 387

Query: 1613 LVGLGLGKDQSDLFGERPCPSVIIHHLLKELPIGVICSSAMNILHLIGCSADCSFDQFLN 1434
            LV LGLG+ Q +LFG  PC S+++HHLLKELP  V+ S A++ILHLI CS D S+DQ LN
Sbjct: 388  LVELGLGRSQEELFGGSPCVSIVLHHLLKELPTDVVSSHAVDILHLIKCSNDYSYDQCLN 447

Query: 1433 YKLLGLRLCESVSQVSEAIAVVNKVIQVTSLYHRLDEYMKVLDAYVDIVLQNQLQDTHLN 1254
            Y+LLGLRLCE +S++    AVVN+V+QV S Y  LDEY+KV++AY+DI+LQNQ+ D  L 
Sbjct: 448  YRLLGLRLCEQISEIGTVDAVVNEVMQVVSQYG-LDEYLKVVEAYLDILLQNQM-DGQLK 505

Query: 1253 MILEKIFELVCNEGIDESALASLQSFFTKLLTHFDNLNDILSLNHFIDILDLMHGTTRNM 1074
             ILE I +L C + I E  LA LQS   KLL+HF +L ++ SLNHF+ ILDLMHG++R++
Sbjct: 506  TILEGILKLACGKVIAEDELAGLQSILVKLLSHFKDLENVFSLNHFLQILDLMHGSSRSI 565

Query: 1073 INIQILRIATRNNCLQDPTIIHFLFEVSQALHDGVDFLNIKADEHQHSAQLISRFVNIAE 894
            +++ IL +ATRN  ++DPT I  LFE+SQALHD  D  N+K D++Q  A+LIS FV + +
Sbjct: 566  VSMHILDMATRNGYVRDPTTIQLLFEISQALHDDTDLANMKNDDNQQQARLISLFVRMVD 625

Query: 893  FGQDFERNLSFLVDCRGAFGSMSQLKETLVHSSNRLAVKAMQYGSNHASFVKSCLTFSEV 714
             G ++E +L+FLV+CRGAFGS+ +LKE LVHSSN LA KA++ G  H SFVKSC+ FSEV
Sbjct: 626  HGAEYEGHLAFLVECRGAFGSIIELKEFLVHSSNCLATKALKDGKTHLSFVKSCIAFSEV 685

Query: 713  TIPSIPAHTRQWNLYIETAEVAFLGGLVSHADGLIDSVIWCFQNFDLADGTLTSSDHDAY 534
            TIPSI  H +Q +LY+ETAEVA LGGLVSH DGLIDS I C Q+FD  +G+  + D D  
Sbjct: 686  TIPSILGHIKQLHLYLETAEVALLGGLVSHCDGLIDSAISCLQSFDWMEGSRVAVDSDRI 745

Query: 533  LSSVCKLCSLVILVPGNFEQGFMNVPRSVFGLVNSQSRMTAKLQIRGXXXXXXXXXXXXX 354
            LS + KLCSL+++VPGN E G +++P+S+  L++SQS  + +++ R              
Sbjct: 746  LSFIRKLCSLLVMVPGNPEVGILHIPKSILSLIHSQS-WSPRMKARIFCAIVSLSATLSQ 804

Query: 353  XXXXSHMLPGQVLSNSNVFFGDMTYLQELLSLSGVIVQKIIDIVSEEPLQTTRGYIALEA 174
                 H +  ++L N  +FFGD +Y+ ELLSL+  ++Q ++ ++ +EP Q  RG ++LEA
Sbjct: 805  GRLPYHAVHPEILGNDLLFFGDSSYVHELLSLTESVLQNLVGLIEQEPSQAARGSMSLEA 864

Query: 173  CNCIASSFKGSEKTWAICSTLVEIAKSSLGSDNIYLKSTRNFLD 42
            CNCIASSFK +E    ICS L+E AK  L  ++ YL ST +FLD
Sbjct: 865  CNCIASSFKLNEHVLPICSKLIETAKLCLSPNDKYLMSTISFLD 908


>gb|EOY12278.1| Uncharacterized protein isoform 1 [Theobroma cacao]
          Length = 920

 Score =  570 bits (1468), Expect = e-160
 Identities = 294/584 (50%), Positives = 407/584 (69%)
 Frame = -3

Query: 1793 IAGINDLKNLIMRATFPQQTRDDVSSGDERLLLSLMEPTIEYVMKIVFKDLNEQTQVRDI 1614
            I  +ND+K +  R +  ++T     +  +R L+ LMEP IE++MK +F   N+ + V  +
Sbjct: 329  ITCVNDIKLVFTRISSAKETAHGCFADSKRSLVGLMEPAIEFIMKCIF---NDASLVGQV 385

Query: 1613 LVGLGLGKDQSDLFGERPCPSVIIHHLLKELPIGVICSSAMNILHLIGCSADCSFDQFLN 1434
            LV LGLG+ Q +LFG  PC S+++HHLLKELP  V+ S A++ILHLI CS D S+DQ LN
Sbjct: 386  LVELGLGRSQEELFGGSPCVSIVLHHLLKELPTDVVSSHAVDILHLIKCSNDYSYDQCLN 445

Query: 1433 YKLLGLRLCESVSQVSEAIAVVNKVIQVTSLYHRLDEYMKVLDAYVDIVLQNQLQDTHLN 1254
            Y+LLGLRLCE +S++    AVVN+V+QV S Y  LDEY+KV++AY+DI+LQNQ+ D  L 
Sbjct: 446  YRLLGLRLCEQISEIGTVDAVVNEVMQVVSQYG-LDEYLKVVEAYLDILLQNQM-DGQLK 503

Query: 1253 MILEKIFELVCNEGIDESALASLQSFFTKLLTHFDNLNDILSLNHFIDILDLMHGTTRNM 1074
             ILE I +L C + I E  LA LQS   KLL+HF +L ++ SLNHF+ ILDLMHG++R++
Sbjct: 504  TILEGILKLACGKVIAEDELAGLQSILVKLLSHFKDLENVFSLNHFLQILDLMHGSSRSI 563

Query: 1073 INIQILRIATRNNCLQDPTIIHFLFEVSQALHDGVDFLNIKADEHQHSAQLISRFVNIAE 894
            +++ IL +ATRN  ++DPT I  LFE+SQALHD  D  N+K D++Q  A+LIS FV + +
Sbjct: 564  VSMHILDMATRNGYVRDPTTIQLLFEISQALHDDTDLANMKNDDNQQQARLISLFVRMVD 623

Query: 893  FGQDFERNLSFLVDCRGAFGSMSQLKETLVHSSNRLAVKAMQYGSNHASFVKSCLTFSEV 714
             G ++E +L+FLV+CRGAFGS+ +LKE LVHSSN LA KA++ G  H SFVKSC+ FSEV
Sbjct: 624  HGAEYEGHLAFLVECRGAFGSIIELKEFLVHSSNCLATKALKDGKTHLSFVKSCIAFSEV 683

Query: 713  TIPSIPAHTRQWNLYIETAEVAFLGGLVSHADGLIDSVIWCFQNFDLADGTLTSSDHDAY 534
            TIPSI  H +Q +LY+ETAEVA LGGLVSH DGLIDS I C Q+FD  +G+  + D D  
Sbjct: 684  TIPSILGHIKQLHLYLETAEVALLGGLVSHCDGLIDSAISCLQSFDWMEGSRVAVDSDRI 743

Query: 533  LSSVCKLCSLVILVPGNFEQGFMNVPRSVFGLVNSQSRMTAKLQIRGXXXXXXXXXXXXX 354
            LS + KLCSL+++VPGN E G +++P+S+  L++SQS  + +++ R              
Sbjct: 744  LSFIRKLCSLLVMVPGNPEVGILHIPKSILSLIHSQS-WSPRMKARIFCAIVSLSATLSQ 802

Query: 353  XXXXSHMLPGQVLSNSNVFFGDMTYLQELLSLSGVIVQKIIDIVSEEPLQTTRGYIALEA 174
                 H +  ++L N  +FFGD +Y+ ELLSL+  ++Q ++ ++ +EP Q  RG ++LEA
Sbjct: 803  GRLPYHAVHPEILGNDLLFFGDSSYVHELLSLTESVLQNLVGLIEQEPSQAARGSMSLEA 862

Query: 173  CNCIASSFKGSEKTWAICSTLVEIAKSSLGSDNIYLKSTRNFLD 42
            CNCIASSFK +E    ICS L+E AK  L  ++ YL ST +FLD
Sbjct: 863  CNCIASSFKLNEHVLPICSKLIETAKLCLSPNDKYLMSTISFLD 906


>ref|XP_004251467.1| PREDICTED: UPF0505 protein C16orf62 homolog [Solanum lycopersicum]
          Length = 917

 Score =  567 bits (1462), Expect = e-159
 Identities = 290/590 (49%), Positives = 404/590 (68%), Gaps = 2/590 (0%)
 Frame = -3

Query: 1793 IAGINDLKNLIMRATFPQQTR--DDVSSGDERLLLSLMEPTIEYVMKIVFKDLNEQTQVR 1620
            I  +ND+K L+M       T+      SG     L LMEP IEYVMK +FK+  E  Q+ 
Sbjct: 323  IISMNDMKILLMNGAHVLSTKKPSGALSGTRSSKLGLMEPAIEYVMKCLFKESCELLQIG 382

Query: 1619 DILVGLGLGKDQSDLFGERPCPSVIIHHLLKELPIGVICSSAMNILHLIGCSADCSFDQF 1440
            DIL+GLGL ++QS+LFG   C S+++HHLL+ELPI ++CS+A++ILHLI CS D SFDQ 
Sbjct: 383  DILMGLGLARNQSELFGNSSCVSLVLHHLLRELPIRIVCSNALDILHLIECSNDYSFDQC 442

Query: 1439 LNYKLLGLRLCESVSQVSEAIAVVNKVIQVTSLYHRLDEYMKVLDAYVDIVLQNQLQDTH 1260
            LNYKLLGLRLCE++S V+E   V+ KVIQV S ++ LDEY+ V+DA+VDI LQ  + +++
Sbjct: 443  LNYKLLGLRLCENISHVNEVNLVMKKVIQVVSQFNSLDEYLNVVDAHVDIALQKHM-NSY 501

Query: 1259 LNMILEKIFELVCNEGIDESALASLQSFFTKLLTHFDNLNDILSLNHFIDILDLMHGTTR 1080
            L+ IL+ IFE   ++ I E+ L+SLQS   K+L HFDNL +IL LNHF  IL +M G++R
Sbjct: 502  LDSILDGIFERTLDDEIGENELSSLQSILLKILNHFDNLENILRLNHFNQILSVMQGSSR 561

Query: 1079 NMINIQILRIATRNNCLQDPTIIHFLFEVSQALHDGVDFLNIKADEHQHSAQLISRFVNI 900
             ++N QIL IATRN+C++DPT I FLFEVS++LHD ++   IK  E+ HSA L+SRF+++
Sbjct: 562  TIVNTQILSIATRNSCIRDPTTIQFLFEVSRSLHDSINLSTIKEKENNHSAHLVSRFIHM 621

Query: 899  AEFGQDFERNLSFLVDCRGAFGSMSQLKETLVHSSNRLAVKAMQYGSNHASFVKSCLTFS 720
             ++  + E +L FLV CRGAFGSMS++KE +VHSSN L VKA +   +   FVKSC+  S
Sbjct: 622  VDYDSEVELHLDFLVQCRGAFGSMSEVKEMIVHSSNLLVVKATRNDISDVIFVKSCIACS 681

Query: 719  EVTIPSIPAHTRQWNLYIETAEVAFLGGLVSHADGLIDSVIWCFQNFDLADGTLTSSDHD 540
            EVTI SIP+H +Q NLY+ETAEVA + GLVS++DGL+DS + C  N DL +G+    D D
Sbjct: 682  EVTISSIPSHLKQLNLYLETAEVALMAGLVSNSDGLVDSALRCLHNVDLFEGSRMPKDID 741

Query: 539  AYLSSVCKLCSLVILVPGNFEQGFMNVPRSVFGLVNSQSRMTAKLQIRGXXXXXXXXXXX 360
             + S++CK CSL++++PGN E+G  ++PR++F +++S S M   ++ +            
Sbjct: 742  GFQSTLCKFCSLIVMIPGNIERGVTSIPRNMFSILSSLSWMLPSMKAKMLCALILTVAAL 801

Query: 359  XXXXXXSHMLPGQVLSNSNVFFGDMTYLQELLSLSGVIVQKIIDIVSEEPLQTTRGYIAL 180
                   H    +V+ N ++F+ D  YLQEL S S V++Q +ID V +EP+Q  RG +AL
Sbjct: 802  SQNNLLYHATHDEVMGNDSLFYCDQQYLQELSSFSAVLLQSLIDTVVQEPIQAARGNLAL 861

Query: 179  EACNCIASSFKGSEKTWAICSTLVEIAKSSLGSDNIYLKSTRNFLDSTSL 30
            +ACN IASSF+  +      S LVE AK SL S+N YL+ST  FL++  L
Sbjct: 862  DACNAIASSFEVCQGASDFSSKLVETAKLSLSSNNKYLQSTIEFLNNRGL 911


>gb|EOY12280.1| Uncharacterized protein isoform 3 [Theobroma cacao]
          Length = 895

 Score =  562 bits (1448), Expect = e-157
 Identities = 295/587 (50%), Positives = 403/587 (68%), Gaps = 3/587 (0%)
 Frame = -3

Query: 1793 IAGINDLKNLIMRATFPQQTRDDVSSGDERLLLSLMEPTIEYVMKIVFKDLNEQTQVRDI 1614
            I  +ND+K +  R +  ++T     +  +R L+ LMEP IE++MK +F D + + QV  +
Sbjct: 329  ITCVNDIKLVFTRISSAKETAHGCFADSKRSLVGLMEPAIEFIMKCIFNDASLR-QVGQV 387

Query: 1613 LVGLGLGKDQSDLFGERPCPSVIIHHLLKELPIGVICSSAMNILHLIGCSADCSFDQFLN 1434
            LV LGLG+ Q +LFG  PC S+++HHLLKELP  V+ S A++ILHLI CS D S+DQ LN
Sbjct: 388  LVELGLGRSQEELFGGSPCVSIVLHHLLKELPTDVVSSHAVDILHLIKCSNDYSYDQCLN 447

Query: 1433 YKLLGLRLCESVSQVSEAIAVVNKVIQVTSLYHRLDEYMKVLDAYVDIVLQNQLQDTHLN 1254
            Y+LLGLRLCE +S++    AVVN+V+QV S Y  LDEY+KV++AY+DI+LQNQ+ D  L 
Sbjct: 448  YRLLGLRLCEQISEIGTVDAVVNEVMQVVSQYG-LDEYLKVVEAYLDILLQNQM-DGQLK 505

Query: 1253 MILEKIFELVCNEGIDESALASLQSFFTKLLTHFDNLNDILSLNHFIDILDLMHGTTRNM 1074
             ILE I +L C + I E  LA LQS   KLL+HF +L ++ SLNHF+ ILDLMHG++R++
Sbjct: 506  TILEGILKLACGKVIAEDELAGLQSILVKLLSHFKDLENVFSLNHFLQILDLMHGSSRSI 565

Query: 1073 INIQILRIATRNNCLQDPTIIHFLFEVSQALHDGVDFLNIKADEHQHSAQLISRFVNIAE 894
            +++ IL +ATRN  ++DPT I  LFE+SQALHD  D  N+K D++Q  A+LIS FV + +
Sbjct: 566  VSMHILDMATRNGYVRDPTTIQLLFEISQALHDDTDLANMKNDDNQQQARLISLFVRMVD 625

Query: 893  FGQDFERNLSFLVDCRGAFGSMSQLKETLVHSSNRLAVKAMQYGSNHASFVKSCLTFSEV 714
             G ++E +L+FLV+CRGAFGS+ +LKE LVHSSN LA KA++ G  H SFVKSC+ FSEV
Sbjct: 626  HGAEYEGHLAFLVECRGAFGSIIELKEFLVHSSNCLATKALKDGKTHLSFVKSCIAFSEV 685

Query: 713  TIPSIPAHTRQWNLYIETAEVAFLGGLVSHADGLIDSVIWCFQNFDLADGTLTSSDHDAY 534
            TIPSI  H +Q +LY+ETAEVA LGGLVSH DGLIDS I C Q+FD  +G+  + D D  
Sbjct: 686  TIPSILGHIKQLHLYLETAEVALLGGLVSHCDGLIDSAISCLQSFDWMEGSRVAVDSDRI 745

Query: 533  LSSVCKLCSLVILVPGNFEQGFMNVPRSVFGLVNSQS---RMTAKLQIRGXXXXXXXXXX 363
            LS + KLCSL+++VPGN E G +++P+S+  L++SQS   RM                  
Sbjct: 746  LSFIRKLCSLLVMVPGNPEVGILHIPKSILSLIHSQSWSPRM------------------ 787

Query: 362  XXXXXXXSHMLPGQVLSNSNVFFGDMTYLQELLSLSGVIVQKIIDIVSEEPLQTTRGYIA 183
                         ++L N  +FFGD +Y+ ELLSL+  ++Q ++ ++ +EP Q  RG ++
Sbjct: 788  -------------KILGNDLLFFGDSSYVHELLSLTESVLQNLVGLIEQEPSQAARGSMS 834

Query: 182  LEACNCIASSFKGSEKTWAICSTLVEIAKSSLGSDNIYLKSTRNFLD 42
            LEACNCIASSFK +E    ICS L+E AK  L  ++ YL ST +FLD
Sbjct: 835  LEACNCIASSFKLNEHVLPICSKLIETAKLCLSPNDKYLMSTISFLD 881


>ref|XP_006365950.1| PREDICTED: UPF0505 protein C16orf62 homolog isoform X3 [Solanum
            tuberosum]
          Length = 878

 Score =  551 bits (1421), Expect = e-154
 Identities = 274/551 (49%), Positives = 382/551 (69%), Gaps = 2/551 (0%)
 Frame = -3

Query: 1793 IAGINDLKNLIMRATFPQQTR--DDVSSGDERLLLSLMEPTIEYVMKIVFKDLNEQTQVR 1620
            I  +ND+K L+M               SG     L LMEP IEYVMK +FK+  E  Q+ 
Sbjct: 329  IISMNDMKTLLMNGAHVASAEKPSGALSGTRSSKLGLMEPAIEYVMKCLFKESCEHLQIG 388

Query: 1619 DILVGLGLGKDQSDLFGERPCPSVIIHHLLKELPIGVICSSAMNILHLIGCSADCSFDQF 1440
            DIL+GLGL ++QS+LFG   C S+++HHLL+ELPI ++CS+A++ILHLI CS D SFDQ 
Sbjct: 389  DILMGLGLARNQSELFGNSSCVSLVLHHLLRELPIRIVCSNALDILHLIECSNDYSFDQC 448

Query: 1439 LNYKLLGLRLCESVSQVSEAIAVVNKVIQVTSLYHRLDEYMKVLDAYVDIVLQNQLQDTH 1260
            LNYKLLGLRLCE++S V+E   V+ KVIQV S ++ LDEY+ V+DA+VDI LQ  + D++
Sbjct: 449  LNYKLLGLRLCENISHVNEVNLVMKKVIQVVSQFNSLDEYLNVIDAHVDIALQKHM-DSY 507

Query: 1259 LNMILEKIFELVCNEGIDESALASLQSFFTKLLTHFDNLNDILSLNHFIDILDLMHGTTR 1080
            L+ IL+ IFE   ++ I E+ L+SLQS   KLL HFDNL  IL LNHF  IL +M G++R
Sbjct: 508  LDSILDGIFERTLDDEIGENELSSLQSILLKLLNHFDNLEHILRLNHFNQILSMMQGSSR 567

Query: 1079 NMINIQILRIATRNNCLQDPTIIHFLFEVSQALHDGVDFLNIKADEHQHSAQLISRFVNI 900
             ++N++IL IATR +C++DPT I FLFEVS++LHD +D   IK  E+ HSA L+SRF+++
Sbjct: 568  TIVNMRILSIATRYSCVRDPTTIQFLFEVSRSLHDSIDLSTIKEKENNHSAHLVSRFIHM 627

Query: 899  AEFGQDFERNLSFLVDCRGAFGSMSQLKETLVHSSNRLAVKAMQYGSNHASFVKSCLTFS 720
             ++  + +R+L FLV CRGAFGSMS++KE +VHSSN L VKA +   +   FVKSC+  S
Sbjct: 628  VDYDSEVKRHLDFLVQCRGAFGSMSEVKEMIVHSSNLLVVKATRNDISDVIFVKSCIACS 687

Query: 719  EVTIPSIPAHTRQWNLYIETAEVAFLGGLVSHADGLIDSVIWCFQNFDLADGTLTSSDHD 540
            EVTIPSIP+H +Q NLY+ETAEVA + GLVSH+DGL+DS + C  N DL +G+    D D
Sbjct: 688  EVTIPSIPSHLKQLNLYLETAEVALMAGLVSHSDGLVDSALRCLHNVDLFEGSRIPKDID 747

Query: 539  AYLSSVCKLCSLVILVPGNFEQGFMNVPRSVFGLVNSQSRMTAKLQIRGXXXXXXXXXXX 360
             + S++CK CSL++++PGN E+G  ++PR++F +++S S M   ++ +            
Sbjct: 748  GFQSTLCKFCSLIVMIPGNIERGVTSIPRNMFSILSSLSWMLPSMKAKVLCALILTVAAL 807

Query: 359  XXXXXXSHMLPGQVLSNSNVFFGDMTYLQELLSLSGVIVQKIIDIVSEEPLQTTRGYIAL 180
                   H +  +V+ N ++F+ D  YLQEL S S V++Q +ID V +EP+Q  RG +AL
Sbjct: 808  SQNNLLYHAIHDEVMGNDSLFYCDQQYLQELFSFSTVLLQSLIDTVLQEPIQAARGNLAL 867

Query: 179  EACNCIASSFK 147
            +ACN +ASSF+
Sbjct: 868  DACNAVASSFE 878


>ref|XP_002529445.1| esophageal cancer associated protein, putative [Ricinus communis]
            gi|223531061|gb|EEF32911.1| esophageal cancer associated
            protein, putative [Ricinus communis]
          Length = 925

 Score =  538 bits (1387), Expect = e-150
 Identities = 279/586 (47%), Positives = 404/586 (68%)
 Frame = -3

Query: 1799 YNIAGINDLKNLIMRATFPQQTRDDVSSGDERLLLSLMEPTIEYVMKIVFKDLNEQTQVR 1620
            Y I  +ND+K L+      +   D   +G  RLL+SL+EP IEY+MK +F++ + Q+QV 
Sbjct: 335  YLITCVNDIKILLGDLLSTKGPPDKQFAGKIRLLVSLIEPAIEYIMKCIFENAS-QSQVH 393

Query: 1619 DILVGLGLGKDQSDLFGERPCPSVIIHHLLKELPIGVICSSAMNILHLIGCSADCSFDQF 1440
             +LV +GLG++        PC S+++H+LLKELP  VI S+A++ILHLI  S D SFDQ+
Sbjct: 394  SVLVEIGLGRNF-------PCVSIVLHNLLKELPTEVISSNAVDILHLIKGSNDYSFDQY 446

Query: 1439 LNYKLLGLRLCESVSQVSEAIAVVNKVIQVTSLYHRLDEYMKVLDAYVDIVLQNQLQDTH 1260
            LN++LLG RL ES SQ+    +V+++VIQ  + Y +LDEY+KV+DAYV+IVLQNQ+ D +
Sbjct: 447  LNFRLLGFRLAESRSQMDIINSVMDEVIQAIAEYDKLDEYLKVVDAYVEIVLQNQM-DNY 505

Query: 1259 LNMILEKIFELVCNEGIDESALASLQSFFTKLLTHFDNLNDILSLNHFIDILDLMHGTTR 1080
            LN++LE ++   C++   E     LQS   KLL+H  +LN++LSL HF+DILD+M+G++R
Sbjct: 506  LNILLEGLYTRACSKEAVEDEQGCLQSIMLKLLSHLKDLNNVLSLKHFLDILDVMYGSSR 565

Query: 1079 NMINIQILRIATRNNCLQDPTIIHFLFEVSQALHDGVDFLNIKADEHQHSAQLISRFVNI 900
            + I++ IL +ATR   + DP+ I  LFE+SQ+LHDG+DF ++K D++Q  A LI RFV +
Sbjct: 566  SFIDMHILNMATRYGQIHDPSTIQLLFEISQSLHDGIDFASMKDDDNQQPAHLICRFVQM 625

Query: 899  AEFGQDFERNLSFLVDCRGAFGSMSQLKETLVHSSNRLAVKAMQYGSNHASFVKSCLTFS 720
             ++G + E++L+FLV+CRGAFGS+++LKETLVHSSN LA KA++ G  H + VKSCL FS
Sbjct: 626  VDYGAEMEQHLTFLVECRGAFGSVNELKETLVHSSNYLATKALKDGKKHLTLVKSCLAFS 685

Query: 719  EVTIPSIPAHTRQWNLYIETAEVAFLGGLVSHADGLIDSVIWCFQNFDLADGTLTSSDHD 540
            EVTIPSI A  RQ NLY+ETAEVA LGGL+SH+DGLI S I C +N D A G+ T +D D
Sbjct: 686  EVTIPSIAAQVRQLNLYLETAEVALLGGLISHSDGLIISAISCLENVDFAGGSQTPTDVD 745

Query: 539  AYLSSVCKLCSLVILVPGNFEQGFMNVPRSVFGLVNSQSRMTAKLQIRGXXXXXXXXXXX 360
              LSS+ KLCSL+++VPGN +QG  N+P S+  L+ S+S MT +++ +            
Sbjct: 746  GILSSIRKLCSLLVMVPGNSDQGVTNIPSSIVSLICSRSWMTPRMKTKFFCAIILLLATL 805

Query: 359  XXXXXXSHMLPGQVLSNSNVFFGDMTYLQELLSLSGVIVQKIIDIVSEEPLQTTRGYIAL 180
                   H+   ++L N  ++FGD +Y+ EL+S+S  ++  ++  +  EP +  RG +AL
Sbjct: 806  SQNKLPYHVCNSEILGNDLLYFGDSSYVHELVSMSESVLWNLVKFIELEPSKAARGSLAL 865

Query: 179  EACNCIASSFKGSEKTWAICSTLVEIAKSSLGSDNIYLKSTRNFLD 42
            EACNCIA SFK SE    +C  L+E A+  L +++ +L+ST  +LD
Sbjct: 866  EACNCIALSFKVSEDILQVCWKLIETAELCLSTNDRFLQSTIKYLD 911


>ref|XP_003545120.1| PREDICTED: UPF0505 protein-like isoform X1 [Glycine max]
          Length = 913

 Score =  517 bits (1331), Expect = e-144
 Identities = 278/589 (47%), Positives = 395/589 (67%)
 Frame = -3

Query: 1799 YNIAGINDLKNLIMRATFPQQTRDDVSSGDERLLLSLMEPTIEYVMKIVFKDLNEQTQVR 1620
            Y +  +ND++ ++M+     +        +++L +SLMEPTIEY+MK +F  L+ Q QV 
Sbjct: 319  YLVTCVNDIRVVLMQILSANERTHKNVKLNKKLQVSLMEPTIEYIMKCIFTGLS-QRQVN 377

Query: 1619 DILVGLGLGKDQSDLFGERPCPSVIIHHLLKELPIGVICSSAMNILHLIGCSADCSFDQF 1440
            ++L   GL K+Q DL G   C S+I+HHLLKELPI V+ S+ + ILHLI  S D SFDQ 
Sbjct: 378  EVLSEFGLMKNQQDL-GSVSCVSIILHHLLKELPIEVVSSNVVQILHLIEFSKDNSFDQH 436

Query: 1439 LNYKLLGLRLCESVSQVSEAIAVVNKVIQVTSLYHRLDEYMKVLDAYVDIVLQNQLQDTH 1260
            +NY+LLG RL E  S V    AV++KVIQV +LY  LDEY+KV+DAY D++LQNQ+ D H
Sbjct: 437  MNYRLLGFRLYERKSPVDIVDAVLDKVIQVIALYDSLDEYLKVVDAYTDLILQNQM-DNH 495

Query: 1259 LNMILEKIFELVCNEGIDESALASLQSFFTKLLTHFDNLNDILSLNHFIDILDLMHGTTR 1080
            L +ILE I +   N+G+ E  + SLQS   KLL+HF +L D+ SL+ F +ILD+M+G ++
Sbjct: 496  LKIILEGISKRTWNKGVTEDEMPSLQSLVVKLLSHFKHLEDVFSLDQFPEILDVMYGKSQ 555

Query: 1079 NMINIQILRIATRNNCLQDPTIIHFLFEVSQALHDGVDFLNIKADEHQHSAQLISRFVNI 900
            +++ + IL +ATRN  + DPT I  LFE+S ALH+ ++F+N+K D+ Q +   I+RFV++
Sbjct: 556  DVVFLHILNMATRNGRISDPTSIQLLFEISLALHNNIEFMNMKDDDGQVACS-IARFVHM 614

Query: 899  AEFGQDFERNLSFLVDCRGAFGSMSQLKETLVHSSNRLAVKAMQYGSNHASFVKSCLTFS 720
             ++G + E +L+FLVDCRGAFG +++LKETLVHSSN LA++A++    H +FVKSC+TFS
Sbjct: 615  VDYGTEMEHHLAFLVDCRGAFGRLNELKETLVHSSNSLAIQALKCAKKHLNFVKSCVTFS 674

Query: 719  EVTIPSIPAHTRQWNLYIETAEVAFLGGLVSHADGLIDSVIWCFQNFDLADGTLTSSDHD 540
            EVTIPSI AH RQ++L++ETAEVAFLGGLVSH+DGLIDS I C    D+ DG  T +D +
Sbjct: 675  EVTIPSISAH-RQFDLFLETAEVAFLGGLVSHSDGLIDSAISCLHTLDIIDGFRTPTDVE 733

Query: 539  AYLSSVCKLCSLVILVPGNFEQGFMNVPRSVFGLVNSQSRMTAKLQIRGXXXXXXXXXXX 360
              +SS+ KLC  +I+VPG         P S+F L++S+S    K++ +            
Sbjct: 734  GLVSSIRKLCGFLIMVPGTLSLPVTYFPNSLFTLISSRSWFEPKMRAQIFSAIILLLTTL 793

Query: 359  XXXXXXSHMLPGQVLSNSNVFFGDMTYLQELLSLSGVIVQKIIDIVSEEPLQTTRGYIAL 180
                   H    Q+  N  +++GD +Y QEL+SLS ++++ ++  V +EP Q  RG +AL
Sbjct: 794  SQKRLPYH-ANSQIPGNDMLYYGDSSYNQELVSLSKLVLENLLSAVQQEPSQAARGIMAL 852

Query: 179  EACNCIASSFKGSEKTWAICSTLVEIAKSSLGSDNIYLKSTRNFLDSTS 33
            EACNCIASSF  S +  + C TLVE AKS L + + YL+ST   L+  S
Sbjct: 853  EACNCIASSFMLSNELLSSCLTLVETAKSCLSAKDRYLQSTIQLLNKQS 901


>ref|XP_006595724.1| PREDICTED: UPF0505 protein-like isoform X2 [Glycine max]
          Length = 914

 Score =  512 bits (1319), Expect = e-142
 Identities = 278/590 (47%), Positives = 395/590 (66%), Gaps = 1/590 (0%)
 Frame = -3

Query: 1799 YNIAGINDLKNLIMRATFPQQTRDDVSSGDERLLLSLMEPTIEYVMKIVFKDLNEQTQVR 1620
            Y +  +ND++ ++M+     +        +++L +SLMEPTIEY+MK +F  L+ Q QV 
Sbjct: 319  YLVTCVNDIRVVLMQILSANERTHKNVKLNKKLQVSLMEPTIEYIMKCIFTGLS-QRQVN 377

Query: 1619 DILVGLGLGKDQSDLFGERPCPSVIIHHLLKELPIGVICSSAMNILHLIGCSADCSFDQF 1440
            ++L   GL K+Q DL G   C S+I+HHLLKELPI V+ S+ + ILHLI  S D SFDQ 
Sbjct: 378  EVLSEFGLMKNQQDL-GSVSCVSIILHHLLKELPIEVVSSNVVQILHLIEFSKDNSFDQH 436

Query: 1439 LNYKLLGLRLCESVSQVSEAIAVVNKVIQVTSLYHRLDEYMKVLDAYVDIVLQNQLQDTH 1260
            +NY+LLG RL E  S V    AV++KVIQV +LY  LDEY+KV+DAY D++LQNQ+ D H
Sbjct: 437  MNYRLLGFRLYERKSPVDIVDAVLDKVIQVIALYDSLDEYLKVVDAYTDLILQNQM-DNH 495

Query: 1259 LNMILEKIFELVCNEGIDESALASLQSFFTKLLTHFDNLNDILSLNHFIDILDLMHGTTR 1080
            L +ILE I +   N+G+ E  + SLQS   KLL+HF +L D+ SL+ F +ILD+M+G ++
Sbjct: 496  LKIILEGISKRTWNKGVTEDEMPSLQSLVVKLLSHFKHLEDVFSLDQFPEILDVMYGKSQ 555

Query: 1079 NMINIQILRIATRNNCLQDPTIIHFLFEVSQALHDGVDFLNIKADEHQHSAQLISRFVNI 900
            +++ + IL +ATRN  + DPT I  LFE+S ALH+ ++F+N+K D+ Q +   I+RFV++
Sbjct: 556  DVVFLHILNMATRNGRISDPTSIQLLFEISLALHNNIEFMNMKDDDGQVACS-IARFVHM 614

Query: 899  AEFGQDFERNLSFLVDCRGAFGSMSQLKETLVHSSNRLAVKAMQYGSNHASFVKSCLTFS 720
             ++G + E +L+FLVDCRGAFG +++LKETLVHSSN LA++A++    H +FVKSC+TFS
Sbjct: 615  VDYGTEMEHHLAFLVDCRGAFGRLNELKETLVHSSNSLAIQALKCAKKHLNFVKSCVTFS 674

Query: 719  EVTIPSIPAHTRQWNLYIETAEVAFLGGLVSHADGLIDSVIWCFQNFDLADGTLTSSDHD 540
            EVTIPSI AH RQ++L++ETAEVAFLGGLVSH+DGLIDS I C    D+ DG  T +D +
Sbjct: 675  EVTIPSISAH-RQFDLFLETAEVAFLGGLVSHSDGLIDSAISCLHTLDIIDGFRTPTDVE 733

Query: 539  AYLSSVCKLCSLVILVPG-NFEQGFMNVPRSVFGLVNSQSRMTAKLQIRGXXXXXXXXXX 363
              +SS+ KLC  +I+VPG          P S+F L++S+S    K++ +           
Sbjct: 734  GLVSSIRKLCGFLIMVPGCTLSLPVTYFPNSLFTLISSRSWFEPKMRAQIFSAIILLLTT 793

Query: 362  XXXXXXXSHMLPGQVLSNSNVFFGDMTYLQELLSLSGVIVQKIIDIVSEEPLQTTRGYIA 183
                    H    Q+  N  +++GD +Y QEL+SLS ++++ ++  V +EP Q  RG +A
Sbjct: 794  LSQKRLPYH-ANSQIPGNDMLYYGDSSYNQELVSLSKLVLENLLSAVQQEPSQAARGIMA 852

Query: 182  LEACNCIASSFKGSEKTWAICSTLVEIAKSSLGSDNIYLKSTRNFLDSTS 33
            LEACNCIASSF  S +  + C TLVE AKS L + + YL+ST   L+  S
Sbjct: 853  LEACNCIASSFMLSNELLSSCLTLVETAKSCLSAKDRYLQSTIQLLNKQS 902


>gb|EXB66322.1| hypothetical protein L484_008062 [Morus notabilis]
          Length = 949

 Score =  499 bits (1286), Expect = e-138
 Identities = 271/586 (46%), Positives = 374/586 (63%)
 Frame = -3

Query: 1799 YNIAGINDLKNLIMRATFPQQTRDDVSSGDERLLLSLMEPTIEYVMKIVFKDLNEQTQVR 1620
            Y +  +ND+K L+ R    +         + RLL+SLMEPTIE+ MK +FKD + Q QV 
Sbjct: 383  YLVRSVNDIKMLLSRIIPAKGAVVRNIKDNNRLLVSLMEPTIEFSMKCMFKDAS-QRQVG 441

Query: 1619 DILVGLGLGKDQSDLFGERPCPSVIIHHLLKELPIGVICSSAMNILHLIGCSADCSFDQF 1440
             IL+ LGLG+++ +LFG  PC SV++HHLLKELP  V  SSA+ ILH+I CS D SF+Q 
Sbjct: 442  KILMELGLGRNEEELFGTFPCVSVVLHHLLKELPTEVFSSSAVKILHVIECSNDNSFNQV 501

Query: 1439 LNYKLLGLRLCESVSQVSEAIAVVNKVIQVTSLYHRLDEYMKVLDAYVDIVLQNQLQDTH 1260
             N                               Y   DEY+KV+DA+VDI+L+NQ+ D H
Sbjct: 502  ANQ------------------------------YENFDEYLKVVDAFVDIILENQM-DCH 530

Query: 1259 LNMILEKIFELVCNEGIDESALASLQSFFTKLLTHFDNLNDILSLNHFIDILDLMHGTTR 1080
            LN+ILE I    C+ G  E   ASLQS   KLL+H + + D+++LNHF++ILD+++G++R
Sbjct: 531  LNIILEGISRRACSTGTAEDEQASLQSILVKLLSHHNRIEDVVALNHFLEILDILYGSSR 590

Query: 1079 NMINIQILRIATRNNCLQDPTIIHFLFEVSQALHDGVDFLNIKADEHQHSAQLISRFVNI 900
             ++N+ IL +ATRN  + DPT I  LFE+SQAL+D +DF+N+K  ++Q   +LISRFVN+
Sbjct: 591  TIVNMHILNMATRNGYICDPTTIQLLFEISQALYDAIDFVNVKDADNQ-PGRLISRFVNM 649

Query: 899  AEFGQDFERNLSFLVDCRGAFGSMSQLKETLVHSSNRLAVKAMQYGSNHASFVKSCLTFS 720
             ++G + ER+L+FLV+CRGAFG +  LKE L+HSSN LAVKA++ GS H SF+KSC+ F 
Sbjct: 650  VDYGVEMERHLTFLVECRGAFGGIDGLKEILIHSSNFLAVKALKDGSKHHSFIKSCIAFG 709

Query: 719  EVTIPSIPAHTRQWNLYIETAEVAFLGGLVSHADGLIDSVIWCFQNFDLADGTLTSSDHD 540
            EVT+PSI +   Q NLY+ETAEVA LGGLVSH++GL++S I C Q+ D  DG+    D D
Sbjct: 710  EVTLPSISSQISQLNLYLETAEVALLGGLVSHSEGLLNSAISCLQSLDRMDGSKVPKDVD 769

Query: 539  AYLSSVCKLCSLVILVPGNFEQGFMNVPRSVFGLVNSQSRMTAKLQIRGXXXXXXXXXXX 360
              LS V KLCSL++++PGN E G      ++  LVNSQS    K++ +            
Sbjct: 770  WILSLVRKLCSLLVMIPGNTELGATYFLNTILVLVNSQSWAKPKMRAKAFCSIISLSATL 829

Query: 359  XXXXXXSHMLPGQVLSNSNVFFGDMTYLQELLSLSGVIVQKIIDIVSEEPLQTTRGYIAL 180
                    +  G+V  N  +++GD++YL EL S S +++Q +ID + +EP    RG +AL
Sbjct: 830  SQNKLPYRVDHGKVPGNDYLYYGDLSYLHELASFSKLVLQHLIDSIQQEPSLAARGSLAL 889

Query: 179  EACNCIASSFKGSEKTWAICSTLVEIAKSSLGSDNIYLKSTRNFLD 42
            EACNCIASSF  S +   ICS L+E AKS L + + YL  T  FLD
Sbjct: 890  EACNCIASSFAPSPEISLICSKLMETAKSCLSTRDRYLHLTFKFLD 935


>gb|ESW14309.1| hypothetical protein PHAVU_008G270200g [Phaseolus vulgaris]
          Length = 779

 Score =  494 bits (1273), Expect = e-137
 Identities = 265/586 (45%), Positives = 387/586 (66%)
 Frame = -3

Query: 1799 YNIAGINDLKNLIMRATFPQQTRDDVSSGDERLLLSLMEPTIEYVMKIVFKDLNEQTQVR 1620
            Y +  +ND++ ++++     +        + +L +SLMEPTIEY+MK VF  L  QTQV 
Sbjct: 198  YLVTCVNDIRVILIQILSANERSHKNVKLNIKLQVSLMEPTIEYIMKCVFNGLT-QTQVN 256

Query: 1619 DILVGLGLGKDQSDLFGERPCPSVIIHHLLKELPIGVICSSAMNILHLIGCSADCSFDQF 1440
            ++L  LGL K+Q +L G   C S+I+HHLLKELPI V+ S+ ++ILHLI  S D SF Q 
Sbjct: 257  EVLSELGLMKNQQEL-GSVSCVSIILHHLLKELPIEVVNSNVVHILHLIEFSKDNSFGQH 315

Query: 1439 LNYKLLGLRLCESVSQVSEAIAVVNKVIQVTSLYHRLDEYMKVLDAYVDIVLQNQLQDTH 1260
            +NY+LLG R+ E  S V     V++KVIQV +LY  LDEY+KV+DAY D++LQN++ D H
Sbjct: 316  MNYRLLGFRMHERKSPVHIVNDVLDKVIQVIALYDSLDEYLKVVDAYTDLILQNKM-DNH 374

Query: 1259 LNMILEKIFELVCNEGIDESALASLQSFFTKLLTHFDNLNDILSLNHFIDILDLMHGTTR 1080
            LN ILE I     N+ + E  + SLQS   KLL+HF +L D+  L  F +ILD+++G ++
Sbjct: 375  LNAILEGISNRAWNKTVTEDEMLSLQSLIVKLLSHFKHLEDVFCLVQFPEILDVLYGKSQ 434

Query: 1079 NMINIQILRIATRNNCLQDPTIIHFLFEVSQALHDGVDFLNIKADEHQHSAQLISRFVNI 900
            +++ + IL + TRN+ + DPT I  LFE++Q LHD ++F+N+K D+ Q  A+ ISRFV++
Sbjct: 435  DVVFLHILNMVTRNDHISDPTSIQLLFEIAQTLHDNIEFMNVKDDDGQ-VARSISRFVHM 493

Query: 899  AEFGQDFERNLSFLVDCRGAFGSMSQLKETLVHSSNRLAVKAMQYGSNHASFVKSCLTFS 720
             ++G + E+ L+FLV+CRGAFG  ++LKETLVHS N LA++A++    + SF KSC+TFS
Sbjct: 494  VDYGAEMEQQLAFLVNCRGAFGRFNELKETLVHSCNSLAIQALKCAKKNLSFFKSCVTFS 553

Query: 719  EVTIPSIPAHTRQWNLYIETAEVAFLGGLVSHADGLIDSVIWCFQNFDLADGTLTSSDHD 540
            EVTIPS+ AH RQ++L++ETAEVAFLGGLVSH+DGLIDS I C    D+ DG  T +  +
Sbjct: 554  EVTIPSVSAH-RQFDLFLETAEVAFLGGLVSHSDGLIDSAITCLHTLDIIDGFRTPTGVE 612

Query: 539  AYLSSVCKLCSLVILVPGNFEQGFMNVPRSVFGLVNSQSRMTAKLQIRGXXXXXXXXXXX 360
              +SS+ KLC  +I+VPG F       P ++F L++S+S    K++ +            
Sbjct: 613  GLVSSIRKLCGFLIMVPGTFSLPVTYFPNNLFTLISSRSCFEPKMRTQ-IFSAIILLLTT 671

Query: 359  XXXXXXSHMLPGQVLSNSNVFFGDMTYLQELLSLSGVIVQKIIDIVSEEPLQTTRGYIAL 180
                   +    Q+L N  +++GD +Y QEL+SLS ++++ ++  V +EP Q  RG +AL
Sbjct: 672  LSQKRLPYRANTQILGNDMLYYGDSSYNQELVSLSKLVLENLLSAVQQEPSQAARGILAL 731

Query: 179  EACNCIASSFKGSEKTWAICSTLVEIAKSSLGSDNIYLKSTRNFLD 42
            E CNCIASSF  + +   +C TL+E AKS L + + YL+ST   L+
Sbjct: 732  EVCNCIASSFMLNSELSPVCLTLIETAKSCLSAQDRYLQSTIQLLN 777


>gb|ESW14308.1| hypothetical protein PHAVU_008G270200g [Phaseolus vulgaris]
          Length = 900

 Score =  494 bits (1273), Expect = e-137
 Identities = 265/586 (45%), Positives = 387/586 (66%)
 Frame = -3

Query: 1799 YNIAGINDLKNLIMRATFPQQTRDDVSSGDERLLLSLMEPTIEYVMKIVFKDLNEQTQVR 1620
            Y +  +ND++ ++++     +        + +L +SLMEPTIEY+MK VF  L  QTQV 
Sbjct: 319  YLVTCVNDIRVILIQILSANERSHKNVKLNIKLQVSLMEPTIEYIMKCVFNGLT-QTQVN 377

Query: 1619 DILVGLGLGKDQSDLFGERPCPSVIIHHLLKELPIGVICSSAMNILHLIGCSADCSFDQF 1440
            ++L  LGL K+Q +L G   C S+I+HHLLKELPI V+ S+ ++ILHLI  S D SF Q 
Sbjct: 378  EVLSELGLMKNQQEL-GSVSCVSIILHHLLKELPIEVVNSNVVHILHLIEFSKDNSFGQH 436

Query: 1439 LNYKLLGLRLCESVSQVSEAIAVVNKVIQVTSLYHRLDEYMKVLDAYVDIVLQNQLQDTH 1260
            +NY+LLG R+ E  S V     V++KVIQV +LY  LDEY+KV+DAY D++LQN++ D H
Sbjct: 437  MNYRLLGFRMHERKSPVHIVNDVLDKVIQVIALYDSLDEYLKVVDAYTDLILQNKM-DNH 495

Query: 1259 LNMILEKIFELVCNEGIDESALASLQSFFTKLLTHFDNLNDILSLNHFIDILDLMHGTTR 1080
            LN ILE I     N+ + E  + SLQS   KLL+HF +L D+  L  F +ILD+++G ++
Sbjct: 496  LNAILEGISNRAWNKTVTEDEMLSLQSLIVKLLSHFKHLEDVFCLVQFPEILDVLYGKSQ 555

Query: 1079 NMINIQILRIATRNNCLQDPTIIHFLFEVSQALHDGVDFLNIKADEHQHSAQLISRFVNI 900
            +++ + IL + TRN+ + DPT I  LFE++Q LHD ++F+N+K D+ Q  A+ ISRFV++
Sbjct: 556  DVVFLHILNMVTRNDHISDPTSIQLLFEIAQTLHDNIEFMNVKDDDGQ-VARSISRFVHM 614

Query: 899  AEFGQDFERNLSFLVDCRGAFGSMSQLKETLVHSSNRLAVKAMQYGSNHASFVKSCLTFS 720
             ++G + E+ L+FLV+CRGAFG  ++LKETLVHS N LA++A++    + SF KSC+TFS
Sbjct: 615  VDYGAEMEQQLAFLVNCRGAFGRFNELKETLVHSCNSLAIQALKCAKKNLSFFKSCVTFS 674

Query: 719  EVTIPSIPAHTRQWNLYIETAEVAFLGGLVSHADGLIDSVIWCFQNFDLADGTLTSSDHD 540
            EVTIPS+ AH RQ++L++ETAEVAFLGGLVSH+DGLIDS I C    D+ DG  T +  +
Sbjct: 675  EVTIPSVSAH-RQFDLFLETAEVAFLGGLVSHSDGLIDSAITCLHTLDIIDGFRTPTGVE 733

Query: 539  AYLSSVCKLCSLVILVPGNFEQGFMNVPRSVFGLVNSQSRMTAKLQIRGXXXXXXXXXXX 360
              +SS+ KLC  +I+VPG F       P ++F L++S+S    K++ +            
Sbjct: 734  GLVSSIRKLCGFLIMVPGTFSLPVTYFPNNLFTLISSRSCFEPKMRTQ-IFSAIILLLTT 792

Query: 359  XXXXXXSHMLPGQVLSNSNVFFGDMTYLQELLSLSGVIVQKIIDIVSEEPLQTTRGYIAL 180
                   +    Q+L N  +++GD +Y QEL+SLS ++++ ++  V +EP Q  RG +AL
Sbjct: 793  LSQKRLPYRANTQILGNDMLYYGDSSYNQELVSLSKLVLENLLSAVQQEPSQAARGILAL 852

Query: 179  EACNCIASSFKGSEKTWAICSTLVEIAKSSLGSDNIYLKSTRNFLD 42
            E CNCIASSF  + +   +C TL+E AKS L + + YL+ST   L+
Sbjct: 853  EVCNCIASSFMLNSELSPVCLTLIETAKSCLSAQDRYLQSTIQLLN 898


>ref|NP_175488.2| uncharacterized protein [Arabidopsis thaliana]
            gi|332194463|gb|AEE32584.1| uncharacterized protein
            AT1G50730 [Arabidopsis thaliana]
          Length = 923

 Score =  493 bits (1268), Expect = e-136
 Identities = 254/585 (43%), Positives = 386/585 (65%)
 Frame = -3

Query: 1799 YNIAGINDLKNLIMRATFPQQTRDDVSSGDERLLLSLMEPTIEYVMKIVFKDLNEQTQVR 1620
            Y I  I D+++++      ++    ++  D++LL SL+EP IEY+MK +F    ++  V 
Sbjct: 336  YLIKCIKDIEDVLAPVLVDKEGYSYITD-DKKLLFSLVEPAIEYIMKCLFLTGRQENNVL 394

Query: 1619 DILVGLGLGKDQSDLFGERPCPSVIIHHLLKELPIGVICSSAMNILHLIGCSADCSFDQF 1440
             IL  LG G+++          S+++H+LLKELP  ++ S AM IL +I CS DCSF Q 
Sbjct: 395  GILEELGFGRNKFQSSYNSSHVSILLHYLLKELPSELVSSLAMEILDMIRCSNDCSFSQV 454

Query: 1439 LNYKLLGLRLCESVSQVSEAIAVVNKVIQVTSLYHRLDEYMKVLDAYVDIVLQNQLQDTH 1260
            LNY+LLG RL E  SQ     +++++VIQ  S Y  L +Y++++DAYVD++LQN++++ H
Sbjct: 455  LNYRLLGNRLSEGKSQEGFLSSLIDEVIQAASQYQSLYDYLRIMDAYVDLMLQNKMEN-H 513

Query: 1259 LNMILEKIFELVCNEGIDESALASLQSFFTKLLTHFDNLNDILSLNHFIDILDLMHGTTR 1080
            L+ +L+ I  L  ++ + E   ASLQS   KLL+HF+NL ++L LNHFI+ILDLM GT++
Sbjct: 514  LDALLDDIVSLARDKFLSEEEQASLQSIILKLLSHFENLQEVLPLNHFIEILDLMSGTSK 573

Query: 1079 NMINIQILRIATRNNCLQDPTIIHFLFEVSQALHDGVDFLNIKADEHQHSAQLISRFVNI 900
            + +N+ +L + TRN C+ D T +  LFEVSQAL+D  DF+NIK D+++ ++ LISRFV +
Sbjct: 574  SSVNMHLLNMGTRNGCICDSTTVQLLFEVSQALYDATDFVNIKDDDNRQTSHLISRFVEM 633

Query: 899  AEFGQDFERNLSFLVDCRGAFGSMSQLKETLVHSSNRLAVKAMQYGSNHASFVKSCLTFS 720
             ++G + ER+L FL +CR AF  + +LKETLV SSN LAVKA++ G  H +FVKSCL FS
Sbjct: 634  VDYGAEMERHLLFLAECREAFNGIHELKETLVRSSNTLAVKALKAGKKHINFVKSCLAFS 693

Query: 719  EVTIPSIPAHTRQWNLYIETAEVAFLGGLVSHADGLIDSVIWCFQNFDLADGTLTSSDHD 540
            EVTIPSI + T+  NLY+ETAEVA LGGL+SH+D L+ S +   +N  L DG L S D D
Sbjct: 694  EVTIPSISSPTKHLNLYLETAEVALLGGLISHSDELVMSAVEYLENVVLTDG-LKSIDID 752

Query: 539  AYLSSVCKLCSLVILVPGNFEQGFMNVPRSVFGLVNSQSRMTAKLQIRGXXXXXXXXXXX 360
            +  S +CKLCSL++++PGN E+G M + +S+F    S S  T +++++            
Sbjct: 753  SMASVICKLCSLLVMIPGNPEKGVMEILKSIFSATRSSSWATLRVKVKIFCAIMSLLSTL 812

Query: 359  XXXXXXSHMLPGQVLSNSNVFFGDMTYLQELLSLSGVIVQKIIDIVSEEPLQTTRGYIAL 180
                   H    +++ N  +FFGD +Y QEL+S + +++ +++D + +E  Q +RG +AL
Sbjct: 813  SQDNLPYHSANPEIIGNELLFFGDSSYKQELVSCTQLVLSELLDAIEQESSQISRGNMAL 872

Query: 179  EACNCIASSFKGSEKTWAICSTLVEIAKSSLGSDNIYLKSTRNFL 45
            EACNCI+S+   +EK   +C  L+E AK  LG+++ Y++ST+  L
Sbjct: 873  EACNCISSALVMNEKVKELCLRLLETAKGCLGANDRYIESTKKSL 917


>ref|XP_006306720.1| hypothetical protein CARUB_v10008246mg [Capsella rubella]
            gi|482575431|gb|EOA39618.1| hypothetical protein
            CARUB_v10008246mg [Capsella rubella]
          Length = 917

 Score =  490 bits (1261), Expect = e-136
 Identities = 254/585 (43%), Positives = 381/585 (65%)
 Frame = -3

Query: 1799 YNIAGINDLKNLIMRATFPQQTRDDVSSGDERLLLSLMEPTIEYVMKIVFKDLNEQTQVR 1620
            Y I  I D+++++      ++    ++  D++L  SL+EP IEY+MK +F    ++  V 
Sbjct: 334  YLIKCIKDIEDVLAPILVDKEGYSYITD-DKKLFFSLIEPAIEYIMKCLFLTGRQEKNVL 392

Query: 1619 DILVGLGLGKDQSDLFGERPCPSVIIHHLLKELPIGVICSSAMNILHLIGCSADCSFDQF 1440
             +L  LG G+ +          S+++H+LLKELP  ++ S AM IL +I CS DCSF Q 
Sbjct: 393  GMLEELGFGRKKLHSSYNPSHMSILLHYLLKELPSELVSSLAMEILDMIKCSNDCSFSQV 452

Query: 1439 LNYKLLGLRLCESVSQVSEAIAVVNKVIQVTSLYHRLDEYMKVLDAYVDIVLQNQLQDTH 1260
            LNYKLLG RL E  SQ     +++N+VIQ  S Y  L +Y++++DAYVD+ LQN++++ H
Sbjct: 453  LNYKLLGTRLSEGKSQDGFLSSLINEVIQAASQYQSLYDYLRIIDAYVDLTLQNKMEN-H 511

Query: 1259 LNMILEKIFELVCNEGIDESALASLQSFFTKLLTHFDNLNDILSLNHFIDILDLMHGTTR 1080
            L+ +L+ I  L C++ + E   ASLQS   KLL+HF+NL ++LSLNHFI+ILDLM GT++
Sbjct: 512  LDALLDDIVRLSCDKFLTEEEQASLQSIILKLLSHFENLQEVLSLNHFIEILDLMSGTSK 571

Query: 1079 NMINIQILRIATRNNCLQDPTIIHFLFEVSQALHDGVDFLNIKADEHQHSAQLISRFVNI 900
            + +N+ +L + TRN C+ D T +  LFEVSQAL+D  DF+ IK D+++ ++ LISRFV +
Sbjct: 572  SSVNMHLLNMGTRNGCISDSTTVQLLFEVSQALYDATDFVTIKDDDNRQTSHLISRFVEM 631

Query: 899  AEFGQDFERNLSFLVDCRGAFGSMSQLKETLVHSSNRLAVKAMQYGSNHASFVKSCLTFS 720
             ++G + ER+L FL +CR AF  + +LKETLV SSN LAVKA++ G  H +FVKSCL FS
Sbjct: 632  VDYGAEMERHLMFLAECREAFNGIHELKETLVRSSNTLAVKALKAGKKHINFVKSCLAFS 691

Query: 719  EVTIPSIPAHTRQWNLYIETAEVAFLGGLVSHADGLIDSVIWCFQNFDLADGTLTSSDHD 540
            EVTIPS+   T+  NLY+ETAEVA LGGL+SH+DGL+ S +   +N    DG L S D D
Sbjct: 692  EVTIPSVSTPTKHLNLYLETAEVALLGGLISHSDGLVMSAVEYLENVAGTDG-LRSIDVD 750

Query: 539  AYLSSVCKLCSLVILVPGNFEQGFMNVPRSVFGLVNSQSRMTAKLQIRGXXXXXXXXXXX 360
            +  S VCKLCSL+++VPGN E+  M + +S+F    S S    +L+++            
Sbjct: 751  SMASVVCKLCSLLVMVPGNPEKDVMEILQSIFSATCSSSWAMQRLKVKLFCAIISLSSTL 810

Query: 359  XXXXXXSHMLPGQVLSNSNVFFGDMTYLQELLSLSGVIVQKIIDIVSEEPLQTTRGYIAL 180
                   H    +++ N  +FFGD +Y QEL+S + +++ ++++ + +E  Q  RG +AL
Sbjct: 811  SQDNLPYHCANPEIIGNDLLFFGDSSYKQELVSFTQLVLGELLNAIEKESSQIVRGNLAL 870

Query: 179  EACNCIASSFKGSEKTWAICSTLVEIAKSSLGSDNIYLKSTRNFL 45
            EACNCI+S+   +EK   +C  L+E AK  LG+++ Y++ST+ +L
Sbjct: 871  EACNCISSALVMNEKVSQVCLRLLETAKGCLGANDRYMESTKKYL 915


>ref|XP_006393113.1| hypothetical protein EUTSA_v10011218mg [Eutrema salsugineum]
            gi|557089691|gb|ESQ30399.1| hypothetical protein
            EUTSA_v10011218mg [Eutrema salsugineum]
          Length = 919

 Score =  488 bits (1255), Expect = e-135
 Identities = 250/556 (44%), Positives = 365/556 (65%)
 Frame = -3

Query: 1712 DERLLLSLMEPTIEYVMKIVFKDLNEQTQVRDILVGLGLGKDQSDLFGERPCPSVIIHHL 1533
            D++LL SLMEP IEY++K +     ++  V  +L  LG G+++S         S+++H+L
Sbjct: 364  DKKLLFSLMEPPIEYIVKCLLLSGRQENNVLGMLEELGFGRNKSHSSTNSSRVSILLHYL 423

Query: 1532 LKELPIGVICSSAMNILHLIGCSADCSFDQFLNYKLLGLRLCESVSQVSEAIAVVNKVIQ 1353
            LKELP  ++ S A  ILH+I  S DCSF Q LNY+LLG RLCE         +++N+VIQ
Sbjct: 424  LKELPSELVSSKATEILHMIKYSNDCSFSQILNYRLLGNRLCEGRDHPGFLSSLINEVIQ 483

Query: 1352 VTSLYHRLDEYMKVLDAYVDIVLQNQLQDTHLNMILEKIFELVCNEGIDESALASLQSFF 1173
            V S Y  L +Y++++DAYVD++LQN++++ HL+ +L+ I  L  ++ + E   ASLQS F
Sbjct: 484  VASQYQTLYDYLRIMDAYVDLLLQNKMEN-HLDALLDDIATLARDKFLSEEEQASLQSIF 542

Query: 1172 TKLLTHFDNLNDILSLNHFIDILDLMHGTTRNMINIQILRIATRNNCLQDPTIIHFLFEV 993
             KLL+HF++L ++L LNHFI+ILDLM GT++  +N+ +L + TRN C+ DPT +  LFEV
Sbjct: 543  LKLLSHFEDLQEVLPLNHFIEILDLMSGTSKISVNMHLLNMGTRNGCISDPTTVQLLFEV 602

Query: 992  SQALHDGVDFLNIKADEHQHSAQLISRFVNIAEFGQDFERNLSFLVDCRGAFGSMSQLKE 813
            SQAL+D  DFLNIK D++  +A LIS FV + ++G + ER+L FL +CR AF  + +LKE
Sbjct: 603  SQALYDATDFLNIKDDDNLQTAHLISHFVEMVDYGAEMERHLMFLAECREAFNGIHELKE 662

Query: 812  TLVHSSNRLAVKAMQYGSNHASFVKSCLTFSEVTIPSIPAHTRQWNLYIETAEVAFLGGL 633
            TLV SSN LAVKA++ G  H +F+KSCL FSEVTIPS+   T+  NLY+ETAEVA LGGL
Sbjct: 663  TLVRSSNTLAVKALKAGKKHTNFIKSCLAFSEVTIPSVSTPTKLLNLYLETAEVALLGGL 722

Query: 632  VSHADGLIDSVIWCFQNFDLADGTLTSSDHDAYLSSVCKLCSLVILVPGNFEQGFMNVPR 453
            +SH+DGL+ S +   +N +  DG L S D D+  S VCKLCSL+++VPGN E+G M + +
Sbjct: 723  ISHSDGLVMSAVESLENIEATDG-LKSIDGDSIASVVCKLCSLLVIVPGNPEKGVMEILK 781

Query: 452  SVFGLVNSQSRMTAKLQIRGXXXXXXXXXXXXXXXXXSHMLPGQVLSNSNVFFGDMTYLQ 273
             +F    S S    +L+++                        +++ N  +FFGD +Y  
Sbjct: 782  RIFSATCSSSWAMPRLKVKIFCAIISLSSTLSQEKLPYRSANPEIIGNDVLFFGDTSYKN 841

Query: 272  ELLSLSGVIVQKIIDIVSEEPLQTTRGYIALEACNCIASSFKGSEKTWAICSTLVEIAKS 93
            EL+S + ++V +++D + +E  Q  RG IALEACNCI+S+   +EK   +C  L+E A+ 
Sbjct: 842  ELVSWTQLVVGELVDAIEQESSQIARGNIALEACNCISSALVMNEKVSQLCLRLLETAEG 901

Query: 92   SLGSDNIYLKSTRNFL 45
             LG+ + YL+ST+  L
Sbjct: 902  CLGAKDRYLESTKKSL 917


>gb|AAG50781.1|AC079027_4 hypothetical protein [Arabidopsis thaliana]
          Length = 1013

 Score =  485 bits (1248), Expect = e-134
 Identities = 250/575 (43%), Positives = 379/575 (65%)
 Frame = -3

Query: 1799 YNIAGINDLKNLIMRATFPQQTRDDVSSGDERLLLSLMEPTIEYVMKIVFKDLNEQTQVR 1620
            Y I  I D+++++      ++    ++  D++LL SL+EP IEY+MK +F    ++  V 
Sbjct: 336  YLIKCIKDIEDVLAPVLVDKEGYSYITD-DKKLLFSLVEPAIEYIMKCLFLTGRQENNVL 394

Query: 1619 DILVGLGLGKDQSDLFGERPCPSVIIHHLLKELPIGVICSSAMNILHLIGCSADCSFDQF 1440
             IL  LG G+++          S+++H+LLKELP  ++ S AM IL +I CS DCSF Q 
Sbjct: 395  GILEELGFGRNKFQSSYNSSHVSILLHYLLKELPSELVSSLAMEILDMIRCSNDCSFSQV 454

Query: 1439 LNYKLLGLRLCESVSQVSEAIAVVNKVIQVTSLYHRLDEYMKVLDAYVDIVLQNQLQDTH 1260
            LNY+LLG RL E  SQ     +++++VIQ  S Y  L +Y++++DAYVD++LQN++++ H
Sbjct: 455  LNYRLLGNRLSEGKSQEGFLSSLIDEVIQAASQYQSLYDYLRIMDAYVDLMLQNKMEN-H 513

Query: 1259 LNMILEKIFELVCNEGIDESALASLQSFFTKLLTHFDNLNDILSLNHFIDILDLMHGTTR 1080
            L+ +L+ I  L  ++ + E   ASLQS   KLL+HF+NL ++L LNHFI+ILDLM GT++
Sbjct: 514  LDALLDDIVSLARDKFLSEEEQASLQSIILKLLSHFENLQEVLPLNHFIEILDLMSGTSK 573

Query: 1079 NMINIQILRIATRNNCLQDPTIIHFLFEVSQALHDGVDFLNIKADEHQHSAQLISRFVNI 900
            + +N+ +L + TRN C+ D T +  LFEVSQAL+D  DF+NIK D+++ ++ LISRFV +
Sbjct: 574  SSVNMHLLNMGTRNGCICDSTTVQLLFEVSQALYDATDFVNIKDDDNRQTSHLISRFVEM 633

Query: 899  AEFGQDFERNLSFLVDCRGAFGSMSQLKETLVHSSNRLAVKAMQYGSNHASFVKSCLTFS 720
             ++G + ER+L FL +CR AF  + +LKETLV SSN LAVKA++ G  H +FVKSCL FS
Sbjct: 634  VDYGAEMERHLLFLAECREAFNGIHELKETLVRSSNTLAVKALKAGKKHINFVKSCLAFS 693

Query: 719  EVTIPSIPAHTRQWNLYIETAEVAFLGGLVSHADGLIDSVIWCFQNFDLADGTLTSSDHD 540
            EVTIPSI + T+  NLY+ETAEVA LGGL+SH+D L+ S +   +N  L DG L S D D
Sbjct: 694  EVTIPSISSPTKHLNLYLETAEVALLGGLISHSDELVMSAVEYLENVVLTDG-LKSIDID 752

Query: 539  AYLSSVCKLCSLVILVPGNFEQGFMNVPRSVFGLVNSQSRMTAKLQIRGXXXXXXXXXXX 360
            +  S +CKLCSL++++PGN E+G M + +S+F    S S  T +++++            
Sbjct: 753  SMASVICKLCSLLVMIPGNPEKGVMEILKSIFSATRSSSWATLRVKVKIFCAIMSLLSTL 812

Query: 359  XXXXXXSHMLPGQVLSNSNVFFGDMTYLQELLSLSGVIVQKIIDIVSEEPLQTTRGYIAL 180
                   H    +++ N  +FFGD +Y QEL+S + +++ +++D + +E  Q +RG +AL
Sbjct: 813  SQDNLPYHSANPEIIGNELLFFGDSSYKQELVSCTQLVLSELLDAIEQESSQISRGNMAL 872

Query: 179  EACNCIASSFKGSEKTWAICSTLVEIAKSSLGSDN 75
            EACNCI+S+   +EK   +C  L+E AK  LG+++
Sbjct: 873  EACNCISSALVMNEKVKELCLRLLETAKGCLGAND 907


Top