BLASTX nr result

ID: Akebia24_contig00004732 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Akebia24_contig00004732
         (2366 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002278525.2| PREDICTED: uncharacterized protein LOC100243...   993   0.0  
emb|CBI20602.3| unnamed protein product [Vitis vinifera]              993   0.0  
ref|XP_007012218.1| Uncharacterized protein isoform 2 [Theobroma...   991   0.0  
ref|XP_007012217.1| Uncharacterized protein isoform 1 [Theobroma...   991   0.0  
ref|XP_002516490.1| conserved hypothetical protein [Ricinus comm...   978   0.0  
ref|XP_006826763.1| hypothetical protein AMTR_s00136p00081990 [A...   969   0.0  
ref|XP_006475982.1| PREDICTED: uncharacterized protein LOC102616...   963   0.0  
ref|XP_006475981.1| PREDICTED: uncharacterized protein LOC102616...   963   0.0  
ref|XP_006450754.1| hypothetical protein CICLE_v100072501mg, par...   963   0.0  
ref|XP_007225467.1| hypothetical protein PRUPE_ppa000219mg [Prun...   957   0.0  
ref|XP_007012964.1| Uncharacterized protein isoform 4 [Theobroma...   957   0.0  
ref|XP_007012963.1| Uncharacterized protein isoform 3 [Theobroma...   957   0.0  
ref|XP_007012962.1| Uncharacterized protein isoform 2 [Theobroma...   957   0.0  
ref|XP_007012961.1| Uncharacterized protein isoform 1 [Theobroma...   957   0.0  
gb|EXB75637.1| hypothetical protein L484_026114 [Morus notabilis]     950   0.0  
ref|XP_002308587.2| hypothetical protein POPTR_0006s25110g [Popu...   949   0.0  
ref|XP_006581468.1| PREDICTED: uncharacterized protein LOC100804...   948   0.0  
ref|XP_003523758.1| PREDICTED: uncharacterized protein LOC100783...   945   0.0  
ref|XP_004501087.1| PREDICTED: uncharacterized protein LOC101498...   940   0.0  
ref|XP_002324157.1| hypothetical protein POPTR_0018s04760g [Popu...   937   0.0  

>ref|XP_002278525.2| PREDICTED: uncharacterized protein LOC100243932 [Vitis vinifera]
          Length = 1416

 Score =  993 bits (2568), Expect = 0.0
 Identities = 502/777 (64%), Positives = 563/777 (72%), Gaps = 3/777 (0%)
 Frame = +3

Query: 45   SIYIKAHKMNGNGKISASXXXXXXXXXXXRVSIDIYSRHDNPEILVHGGRSYGCPENNGA 224
            SIYIKA+KM G+G+ISA            R+S+D++SRHD+P+I VHGG S+GCPEN+GA
Sbjct: 264  SIYIKAYKMTGSGRISACGGNGFGGGGGGRISVDVFSRHDDPKIFVHGGSSFGCPENSGA 323

Query: 225  AGTFYDTVLRSLTVNNHNVSTQTDTLLLDFPHQPLWTNVYVRDHAKAVVPLLWSRVQVQG 404
            AGTFYD V RSL V+N+N ST TDTLLL+FP+QPLWTNVYVRDHAKA VPLLWSRVQVQG
Sbjct: 324  AGTFYDAVPRSLIVSNNNRSTDTDTLLLEFPYQPLWTNVYVRDHAKATVPLLWSRVQVQG 383

Query: 405  QLSLFCG-VLVFGLAHYPSSEFELMAEELLMSDSTIRVYGALRMSIKMLLMLNSKMIIDV 581
            Q+SL+CG VL FGLAHY  SEFEL+AEELLMSDS I+VYGALRMS+KM LM NSK++ID 
Sbjct: 384  QISLYCGGVLSFGLAHYALSEFELLAEELLMSDSIIKVYGALRMSVKMFLMWNSKLLIDG 443

Query: 582  GDDALVATSLLEASNLIALKESSVILSNAXXXXXXXXXXXXXXXXXXIEAQRLVLSLFYS 761
            G DA VATSLLEASNL+ LKESSVI SNA                  IEAQRLVLSLFYS
Sbjct: 444  GGDANVATSLLEASNLVVLKESSVIHSNANLGVHGQGLLNLSGPGDWIEAQRLVLSLFYS 503

Query: 762  IHVGPGSVLQAPLENATTDDMTPKLYCEHQDCPMELLHPPEDCNVNSSLSFTLQICRVED 941
            IHVGPGSVL+ PLENATTD +TP+LYCE QDCP ELLHPPEDCNVNSSLSFTLQICRVED
Sbjct: 504  IHVGPGSVLRGPLENATTDAVTPRLYCELQDCPTELLHPPEDCNVNSSLSFTLQICRVED 563

Query: 942  ITVEGLIKGSVVHFHRARTVVVQSSGAISASXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 1121
            ITV+GLIKGSVVHFHRART+ VQSSG IS S                             
Sbjct: 564  ITVQGLIKGSVVHFHRARTIAVQSSGKISTSRMGCTGGVGRGKFLSSGLGSGGGHGGKGG 623

Query: 1122 XXXXXXSFAEGGVAYGNPDLPCEXXXXXXXXXXXXX--TAGGGIIVMGSLEHSLTSLSIY 1295
                  S  EGG++YGN DLPCE               TAGGG+IVMGSLEH L+SLSI 
Sbjct: 624  DGCYKGSCVEGGISYGNADLPCELGSGSGSGNDTLDGSTAGGGVIVMGSLEHPLSSLSIE 683

Query: 1296 GSLRADGESYGQNARKQDYELTXXXXXXXXXXXXXTVLLFLHTLTVGDTAVLXXXXXXXX 1475
            GS++ADGES  ++ R   Y +              T+LLFL +L +G+ AVL        
Sbjct: 684  GSVKADGESSRESTRNNYYSMNNGSNVNPGGGSGGTILLFLRSLALGEAAVLSSIGGHGS 743

Query: 1476 XXXXXXXXXXRIHFHWSDIPTGNEYQPLASVEGNIYTRGGLGMDQGNAGDNGTITGKACP 1655
                      RIHFHWSDIPTG+ YQP+ASV+G+I++RGGL  DQ   G+NGT+TGKACP
Sbjct: 744  LHGGGGGGGGRIHFHWSDIPTGDVYQPIASVKGSIHSRGGLARDQSGMGENGTVTGKACP 803

Query: 1656 EGLYGIFCEECPTGTFKNVSGSDITLCHQCPLSELPHRAIYTSVRGGIARTPCPYRCISE 1835
             GLYGIFCEECP GT+KNV+GSD +LC  CP  ELP RAIY SVRGGIA TPCPY+CIS+
Sbjct: 804  RGLYGIFCEECPAGTYKNVTGSDRSLCRHCPYHELPRRAIYISVRGGIAETPCPYKCISD 863

Query: 1836 RYHKPHCYTTLEELIYTFGGPWXXXXXXXXXXXXXXXXXXXXRMKFVENDELPGPAPTQH 2015
            RYH PHCYT LEELIYTFGGPW                    RMKFV  DE PGPAPTQH
Sbjct: 864  RYHMPHCYTALEELIYTFGGPWLFCLLLLGVLILLALVLSVARMKFVGVDESPGPAPTQH 923

Query: 2016 GSQINHSFPFLESLNEVLETNRAEESQSHVHRMYFMGPNTFSEPWHLPHSPPEQVTEIVY 2195
            GSQI+HSFPFLESLNEVLETNRAEESQSHVHRMYFMGPNTFSEPWHLPH+PPEQ+ EIVY
Sbjct: 924  GSQIDHSFPFLESLNEVLETNRAEESQSHVHRMYFMGPNTFSEPWHLPHTPPEQIKEIVY 983

Query: 2196 EDAFNIFVDEINALAAYQWWEGSIHSVLSIVAYPFAWSXXXXXXXXXXXXXXEFVRS 2366
            E AFN FVDEINA+AAYQWWEGS+HS+LSI+AYP AWS              EFVRS
Sbjct: 984  EGAFNGFVDEINAIAAYQWWEGSMHSILSILAYPLAWSWQQWRRRKKLQQLREFVRS 1040


>emb|CBI20602.3| unnamed protein product [Vitis vinifera]
          Length = 1439

 Score =  993 bits (2568), Expect = 0.0
 Identities = 502/777 (64%), Positives = 563/777 (72%), Gaps = 3/777 (0%)
 Frame = +3

Query: 45   SIYIKAHKMNGNGKISASXXXXXXXXXXXRVSIDIYSRHDNPEILVHGGRSYGCPENNGA 224
            SIYIKA+KM G+G+ISA            R+S+D++SRHD+P+I VHGG S+GCPEN+GA
Sbjct: 264  SIYIKAYKMTGSGRISACGGNGFGGGGGGRISVDVFSRHDDPKIFVHGGSSFGCPENSGA 323

Query: 225  AGTFYDTVLRSLTVNNHNVSTQTDTLLLDFPHQPLWTNVYVRDHAKAVVPLLWSRVQVQG 404
            AGTFYD V RSL V+N+N ST TDTLLL+FP+QPLWTNVYVRDHAKA VPLLWSRVQVQG
Sbjct: 324  AGTFYDAVPRSLIVSNNNRSTDTDTLLLEFPYQPLWTNVYVRDHAKATVPLLWSRVQVQG 383

Query: 405  QLSLFCG-VLVFGLAHYPSSEFELMAEELLMSDSTIRVYGALRMSIKMLLMLNSKMIIDV 581
            Q+SL+CG VL FGLAHY  SEFEL+AEELLMSDS I+VYGALRMS+KM LM NSK++ID 
Sbjct: 384  QISLYCGGVLSFGLAHYALSEFELLAEELLMSDSIIKVYGALRMSVKMFLMWNSKLLIDG 443

Query: 582  GDDALVATSLLEASNLIALKESSVILSNAXXXXXXXXXXXXXXXXXXIEAQRLVLSLFYS 761
            G DA VATSLLEASNL+ LKESSVI SNA                  IEAQRLVLSLFYS
Sbjct: 444  GGDANVATSLLEASNLVVLKESSVIHSNANLGVHGQGLLNLSGPGDWIEAQRLVLSLFYS 503

Query: 762  IHVGPGSVLQAPLENATTDDMTPKLYCEHQDCPMELLHPPEDCNVNSSLSFTLQICRVED 941
            IHVGPGSVL+ PLENATTD +TP+LYCE QDCP ELLHPPEDCNVNSSLSFTLQICRVED
Sbjct: 504  IHVGPGSVLRGPLENATTDAVTPRLYCELQDCPTELLHPPEDCNVNSSLSFTLQICRVED 563

Query: 942  ITVEGLIKGSVVHFHRARTVVVQSSGAISASXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 1121
            ITV+GLIKGSVVHFHRART+ VQSSG IS S                             
Sbjct: 564  ITVQGLIKGSVVHFHRARTIAVQSSGKISTSRMGCTGGVGRGKFLSSGLGSGGGHGGKGG 623

Query: 1122 XXXXXXSFAEGGVAYGNPDLPCEXXXXXXXXXXXXX--TAGGGIIVMGSLEHSLTSLSIY 1295
                  S  EGG++YGN DLPCE               TAGGG+IVMGSLEH L+SLSI 
Sbjct: 624  DGCYKGSCVEGGISYGNADLPCELGSGSGSGNDTLDGSTAGGGVIVMGSLEHPLSSLSIE 683

Query: 1296 GSLRADGESYGQNARKQDYELTXXXXXXXXXXXXXTVLLFLHTLTVGDTAVLXXXXXXXX 1475
            GS++ADGES  ++ R   Y +              T+LLFL +L +G+ AVL        
Sbjct: 684  GSVKADGESSRESTRNNYYSMNNGSNVNPGGGSGGTILLFLRSLALGEAAVLSSIGGHGS 743

Query: 1476 XXXXXXXXXXRIHFHWSDIPTGNEYQPLASVEGNIYTRGGLGMDQGNAGDNGTITGKACP 1655
                      RIHFHWSDIPTG+ YQP+ASV+G+I++RGGL  DQ   G+NGT+TGKACP
Sbjct: 744  LHGGGGGGGGRIHFHWSDIPTGDVYQPIASVKGSIHSRGGLARDQSGMGENGTVTGKACP 803

Query: 1656 EGLYGIFCEECPTGTFKNVSGSDITLCHQCPLSELPHRAIYTSVRGGIARTPCPYRCISE 1835
             GLYGIFCEECP GT+KNV+GSD +LC  CP  ELP RAIY SVRGGIA TPCPY+CIS+
Sbjct: 804  RGLYGIFCEECPAGTYKNVTGSDRSLCRHCPYHELPRRAIYISVRGGIAETPCPYKCISD 863

Query: 1836 RYHKPHCYTTLEELIYTFGGPWXXXXXXXXXXXXXXXXXXXXRMKFVENDELPGPAPTQH 2015
            RYH PHCYT LEELIYTFGGPW                    RMKFV  DE PGPAPTQH
Sbjct: 864  RYHMPHCYTALEELIYTFGGPWLFCLLLLGVLILLALVLSVARMKFVGVDESPGPAPTQH 923

Query: 2016 GSQINHSFPFLESLNEVLETNRAEESQSHVHRMYFMGPNTFSEPWHLPHSPPEQVTEIVY 2195
            GSQI+HSFPFLESLNEVLETNRAEESQSHVHRMYFMGPNTFSEPWHLPH+PPEQ+ EIVY
Sbjct: 924  GSQIDHSFPFLESLNEVLETNRAEESQSHVHRMYFMGPNTFSEPWHLPHTPPEQIKEIVY 983

Query: 2196 EDAFNIFVDEINALAAYQWWEGSIHSVLSIVAYPFAWSXXXXXXXXXXXXXXEFVRS 2366
            E AFN FVDEINA+AAYQWWEGS+HS+LSI+AYP AWS              EFVRS
Sbjct: 984  EGAFNGFVDEINAIAAYQWWEGSMHSILSILAYPLAWSWQQWRRRKKLQQLREFVRS 1040


>ref|XP_007012218.1| Uncharacterized protein isoform 2 [Theobroma cacao]
            gi|508782581|gb|EOY29837.1| Uncharacterized protein
            isoform 2 [Theobroma cacao]
          Length = 1297

 Score =  991 bits (2561), Expect = 0.0
 Identities = 489/775 (63%), Positives = 568/775 (73%), Gaps = 1/775 (0%)
 Frame = +3

Query: 45   SIYIKAHKMNGNGKISASXXXXXXXXXXXRVSIDIYSRHDNPEILVHGGRSYGCPENNGA 224
            SIYIKAHKM G+G+ISA            RVS+D++SRHD P+I VHGG S+GCP+N GA
Sbjct: 267  SIYIKAHKMTGSGRISACGGNGFAGGGGGRVSVDVFSRHDEPKIYVHGGISHGCPDNAGA 326

Query: 225  AGTFYDTVLRSLTVNNHNVSTQTDTLLLDFPHQPLWTNVYVRDHAKAVVPLLWSRVQVQG 404
            AGTFYD V RSLTVNNHN+ST T+TLLL+FP+QPLWTNVY+R+HA+A VPLLWSRVQVQG
Sbjct: 327  AGTFYDAVPRSLTVNNHNMSTDTETLLLEFPYQPLWTNVYIRNHARATVPLLWSRVQVQG 386

Query: 405  QLSLFC-GVLVFGLAHYPSSEFELMAEELLMSDSTIRVYGALRMSIKMLLMLNSKMIIDV 581
            Q+SL C GVL FGLAHY SSEFEL+AEELLMSDS ++VYGALRM++K+ LM NS+M+ID 
Sbjct: 387  QISLLCSGVLSFGLAHYASSEFELLAEELLMSDSVLKVYGALRMTVKIFLMWNSEMLIDG 446

Query: 582  GDDALVATSLLEASNLIALKESSVILSNAXXXXXXXXXXXXXXXXXXIEAQRLVLSLFYS 761
            G+DA VATS LEASNL+ LKESSVI SNA                  I+AQRLVLSLFYS
Sbjct: 447  GEDATVATSWLEASNLVVLKESSVIHSNANLGVHGQGLLNLSGPGDKIQAQRLVLSLFYS 506

Query: 762  IHVGPGSVLQAPLENATTDDMTPKLYCEHQDCPMELLHPPEDCNVNSSLSFTLQICRVED 941
            IHVGPGSVL+ PLENA++D +TPKLYCE QDCP+ELLHPPEDCNVNSSL+FTLQICRVED
Sbjct: 507  IHVGPGSVLRGPLENASSDAVTPKLYCELQDCPIELLHPPEDCNVNSSLAFTLQICRVED 566

Query: 942  ITVEGLIKGSVVHFHRARTVVVQSSGAISASXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 1121
            ITVEGLIKGSVVHFHRART+ VQSSG ISAS                             
Sbjct: 567  ITVEGLIKGSVVHFHRARTISVQSSGIISASGMGCTGGVGKGNFLDNGIGSGGGHGGKGG 626

Query: 1122 XXXXXXSFAEGGVAYGNPDLPCEXXXXXXXXXXXXXTAGGGIIVMGSLEHSLTSLSIYGS 1301
                  S+ EGG++YGN +LPCE              AGGG+IVMGS+EH L+SLS+ G+
Sbjct: 627  LGCYNGSYVEGGISYGNSELPCELGSGSGNESSSDSAAGGGVIVMGSVEHPLSSLSVEGA 686

Query: 1302 LRADGESYGQNARKQDYELTXXXXXXXXXXXXXTVLLFLHTLTVGDTAVLXXXXXXXXXX 1481
            LRADGES+ +   +Q+Y ++             TVLLFLHTLT+G++A+L          
Sbjct: 687  LRADGESFEETVWQQEYSVSNDSSIAPGGGSGGTVLLFLHTLTLGESALLSSVGGYGSPK 746

Query: 1482 XXXXXXXXRIHFHWSDIPTGNEYQPLASVEGNIYTRGGLGMDQGNAGDNGTITGKACPEG 1661
                    RIHFHWSDIPTG+ YQP+ASV+G+IY RGG G  +   G+NGT+TGKACP+G
Sbjct: 747  GGGGGGGGRIHFHWSDIPTGDVYQPIASVKGSIYARGGFGGGESGGGENGTVTGKACPKG 806

Query: 1662 LYGIFCEECPTGTFKNVSGSDITLCHQCPLSELPHRAIYTSVRGGIARTPCPYRCISERY 1841
            LYG FC +CP GT+KNVSGSD +LC+ CP SELPHRAIY +VRGGIA TPCPY CIS+RY
Sbjct: 807  LYGTFCMQCPVGTYKNVSGSDSSLCYPCPASELPHRAIYIAVRGGIAETPCPYECISDRY 866

Query: 1842 HKPHCYTTLEELIYTFGGPWXXXXXXXXXXXXXXXXXXXXRMKFVENDELPGPAPTQHGS 2021
            H P CYT LEELIYTFGGPW                    RMKFV  DELPGPAPTQHGS
Sbjct: 867  HMPQCYTALEELIYTFGGPWLFCLLLVGLLILLALVLSVARMKFVGVDELPGPAPTQHGS 926

Query: 2022 QINHSFPFLESLNEVLETNRAEESQSHVHRMYFMGPNTFSEPWHLPHSPPEQVTEIVYED 2201
            QI+HSFPFLESLNEVLETNR EES+SHVHRMYFMGPNTFSEPWHLPH+PPE++ EIVYE 
Sbjct: 927  QIDHSFPFLESLNEVLETNRVEESRSHVHRMYFMGPNTFSEPWHLPHTPPEEIKEIVYEG 986

Query: 2202 AFNIFVDEINALAAYQWWEGSIHSVLSIVAYPFAWSXXXXXXXXXXXXXXEFVRS 2366
            AFN FVDEIN++AAYQWWEG+I+++LSI+ YP AWS              EFVRS
Sbjct: 987  AFNTFVDEINSIAAYQWWEGAIYTILSILVYPLAWSWQQCRRRMKLQRLREFVRS 1041


>ref|XP_007012217.1| Uncharacterized protein isoform 1 [Theobroma cacao]
            gi|508782580|gb|EOY29836.1| Uncharacterized protein
            isoform 1 [Theobroma cacao]
          Length = 1452

 Score =  991 bits (2561), Expect = 0.0
 Identities = 489/775 (63%), Positives = 568/775 (73%), Gaps = 1/775 (0%)
 Frame = +3

Query: 45   SIYIKAHKMNGNGKISASXXXXXXXXXXXRVSIDIYSRHDNPEILVHGGRSYGCPENNGA 224
            SIYIKAHKM G+G+ISA            RVS+D++SRHD P+I VHGG S+GCP+N GA
Sbjct: 267  SIYIKAHKMTGSGRISACGGNGFAGGGGGRVSVDVFSRHDEPKIYVHGGISHGCPDNAGA 326

Query: 225  AGTFYDTVLRSLTVNNHNVSTQTDTLLLDFPHQPLWTNVYVRDHAKAVVPLLWSRVQVQG 404
            AGTFYD V RSLTVNNHN+ST T+TLLL+FP+QPLWTNVY+R+HA+A VPLLWSRVQVQG
Sbjct: 327  AGTFYDAVPRSLTVNNHNMSTDTETLLLEFPYQPLWTNVYIRNHARATVPLLWSRVQVQG 386

Query: 405  QLSLFC-GVLVFGLAHYPSSEFELMAEELLMSDSTIRVYGALRMSIKMLLMLNSKMIIDV 581
            Q+SL C GVL FGLAHY SSEFEL+AEELLMSDS ++VYGALRM++K+ LM NS+M+ID 
Sbjct: 387  QISLLCSGVLSFGLAHYASSEFELLAEELLMSDSVLKVYGALRMTVKIFLMWNSEMLIDG 446

Query: 582  GDDALVATSLLEASNLIALKESSVILSNAXXXXXXXXXXXXXXXXXXIEAQRLVLSLFYS 761
            G+DA VATS LEASNL+ LKESSVI SNA                  I+AQRLVLSLFYS
Sbjct: 447  GEDATVATSWLEASNLVVLKESSVIHSNANLGVHGQGLLNLSGPGDKIQAQRLVLSLFYS 506

Query: 762  IHVGPGSVLQAPLENATTDDMTPKLYCEHQDCPMELLHPPEDCNVNSSLSFTLQICRVED 941
            IHVGPGSVL+ PLENA++D +TPKLYCE QDCP+ELLHPPEDCNVNSSL+FTLQICRVED
Sbjct: 507  IHVGPGSVLRGPLENASSDAVTPKLYCELQDCPIELLHPPEDCNVNSSLAFTLQICRVED 566

Query: 942  ITVEGLIKGSVVHFHRARTVVVQSSGAISASXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 1121
            ITVEGLIKGSVVHFHRART+ VQSSG ISAS                             
Sbjct: 567  ITVEGLIKGSVVHFHRARTISVQSSGIISASGMGCTGGVGKGNFLDNGIGSGGGHGGKGG 626

Query: 1122 XXXXXXSFAEGGVAYGNPDLPCEXXXXXXXXXXXXXTAGGGIIVMGSLEHSLTSLSIYGS 1301
                  S+ EGG++YGN +LPCE              AGGG+IVMGS+EH L+SLS+ G+
Sbjct: 627  LGCYNGSYVEGGISYGNSELPCELGSGSGNESSSDSAAGGGVIVMGSVEHPLSSLSVEGA 686

Query: 1302 LRADGESYGQNARKQDYELTXXXXXXXXXXXXXTVLLFLHTLTVGDTAVLXXXXXXXXXX 1481
            LRADGES+ +   +Q+Y ++             TVLLFLHTLT+G++A+L          
Sbjct: 687  LRADGESFEETVWQQEYSVSNDSSIAPGGGSGGTVLLFLHTLTLGESALLSSVGGYGSPK 746

Query: 1482 XXXXXXXXRIHFHWSDIPTGNEYQPLASVEGNIYTRGGLGMDQGNAGDNGTITGKACPEG 1661
                    RIHFHWSDIPTG+ YQP+ASV+G+IY RGG G  +   G+NGT+TGKACP+G
Sbjct: 747  GGGGGGGGRIHFHWSDIPTGDVYQPIASVKGSIYARGGFGGGESGGGENGTVTGKACPKG 806

Query: 1662 LYGIFCEECPTGTFKNVSGSDITLCHQCPLSELPHRAIYTSVRGGIARTPCPYRCISERY 1841
            LYG FC +CP GT+KNVSGSD +LC+ CP SELPHRAIY +VRGGIA TPCPY CIS+RY
Sbjct: 807  LYGTFCMQCPVGTYKNVSGSDSSLCYPCPASELPHRAIYIAVRGGIAETPCPYECISDRY 866

Query: 1842 HKPHCYTTLEELIYTFGGPWXXXXXXXXXXXXXXXXXXXXRMKFVENDELPGPAPTQHGS 2021
            H P CYT LEELIYTFGGPW                    RMKFV  DELPGPAPTQHGS
Sbjct: 867  HMPQCYTALEELIYTFGGPWLFCLLLVGLLILLALVLSVARMKFVGVDELPGPAPTQHGS 926

Query: 2022 QINHSFPFLESLNEVLETNRAEESQSHVHRMYFMGPNTFSEPWHLPHSPPEQVTEIVYED 2201
            QI+HSFPFLESLNEVLETNR EES+SHVHRMYFMGPNTFSEPWHLPH+PPE++ EIVYE 
Sbjct: 927  QIDHSFPFLESLNEVLETNRVEESRSHVHRMYFMGPNTFSEPWHLPHTPPEEIKEIVYEG 986

Query: 2202 AFNIFVDEINALAAYQWWEGSIHSVLSIVAYPFAWSXXXXXXXXXXXXXXEFVRS 2366
            AFN FVDEIN++AAYQWWEG+I+++LSI+ YP AWS              EFVRS
Sbjct: 987  AFNTFVDEINSIAAYQWWEGAIYTILSILVYPLAWSWQQCRRRMKLQRLREFVRS 1041


>ref|XP_002516490.1| conserved hypothetical protein [Ricinus communis]
            gi|223544310|gb|EEF45831.1| conserved hypothetical
            protein [Ricinus communis]
          Length = 1426

 Score =  978 bits (2527), Expect = 0.0
 Identities = 488/775 (62%), Positives = 562/775 (72%), Gaps = 1/775 (0%)
 Frame = +3

Query: 45   SIYIKAHKMNGNGKISASXXXXXXXXXXXRVSIDIYSRHDNPEILVHGGRSYGCPENNGA 224
            SI+IKA+KM G+G+ISA            RVS+DI+SRHD+P+I VHGG S+GCPEN GA
Sbjct: 272  SIFIKAYKMTGSGRISACGGSGFAGGGGGRVSVDIFSRHDDPQIFVHGGSSFGCPENAGA 331

Query: 225  AGTFYDTVLRSLTVNNHNVSTQTDTLLLDFPHQPLWTNVYVRDHAKAVVPLLWSRVQVQG 404
            AGT YD V RSL V+NHN+ST T+TLLLDFP+QPLWTNVYVR+HA+A VPLLWSRVQVQG
Sbjct: 332  AGTLYDAVPRSLIVSNHNMSTDTETLLLDFPYQPLWTNVYVRNHARATVPLLWSRVQVQG 391

Query: 405  QLSLFC-GVLVFGLAHYPSSEFELMAEELLMSDSTIRVYGALRMSIKMLLMLNSKMIIDV 581
            Q+SL C GVL FGLAHY SSEFEL+AEELLMSDS I+VYGALRM++K+ LM NSKMI+D 
Sbjct: 392  QISLLCHGVLSFGLAHYASSEFELLAEELLMSDSVIKVYGALRMTVKIFLMWNSKMIVDG 451

Query: 582  GDDALVATSLLEASNLIALKESSVILSNAXXXXXXXXXXXXXXXXXXIEAQRLVLSLFYS 761
            G+D  V TS LEASNLI LKESSVI SNA                  IEAQRLVLSLFYS
Sbjct: 452  GEDTTVTTSWLEASNLIVLKESSVIQSNANLGVHGQGLLNLSGPGDSIEAQRLVLSLFYS 511

Query: 762  IHVGPGSVLQAPLENATTDDMTPKLYCEHQDCPMELLHPPEDCNVNSSLSFTLQICRVED 941
            IHVGPGSVL+ PL+NAT+D +TP+LYCE QDCP+ELLHPPEDCNVNSSLSFTLQICRVED
Sbjct: 512  IHVGPGSVLRGPLQNATSDAVTPRLYCELQDCPIELLHPPEDCNVNSSLSFTLQICRVED 571

Query: 942  ITVEGLIKGSVVHFHRARTVVVQSSGAISASXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 1121
            ITVEGLIKGSVVHFHRARTV V SSG ISAS                             
Sbjct: 572  ITVEGLIKGSVVHFHRARTVSVLSSGRISASGMGCTGGVGRGHVLENGIGSGGGHGGKGG 631

Query: 1122 XXXXXXSFAEGGVAYGNPDLPCEXXXXXXXXXXXXXTAGGGIIVMGSLEHSLTSLSIYGS 1301
                  S  EGG++YGN +LPCE             TAGGGIIVMGSL+H L+SLS+ GS
Sbjct: 632  LGCYNGSCIEGGMSYGNVELPCELGSGSGDESSAGSTAGGGIIVMGSLDHPLSSLSVEGS 691

Query: 1302 LRADGESYGQNARKQDYELTXXXXXXXXXXXXXTVLLFLHTLTVGDTAVLXXXXXXXXXX 1481
            +RADGES+ Q  +     +              T+L+FLHTL + ++AVL          
Sbjct: 692  VRADGESFQQTVKLGKLTVKNDTTGGPGGGSGGTILMFLHTLDLSESAVLSSGGGYGSQN 751

Query: 1482 XXXXXXXXRIHFHWSDIPTGNEYQPLASVEGNIYTRGGLGMDQGNAGDNGTITGKACPEG 1661
                    RIHFHWSDIPTG+ YQP+ASV+G+I   GG G D+G AG+NGT+TGKACP+G
Sbjct: 752  GAGGGGGGRIHFHWSDIPTGDVYQPIASVKGSILFGGGTGRDEGCAGENGTVTGKACPKG 811

Query: 1662 LYGIFCEECPTGTFKNVSGSDITLCHQCPLSELPHRAIYTSVRGGIARTPCPYRCISERY 1841
            L+G+FCEECP GTFKNV+GS+ +LCH CP +ELPHRA+Y +VRGGIA TPCPY+CIS+R+
Sbjct: 812  LFGVFCEECPAGTFKNVTGSERSLCHPCPANELPHRAVYVAVRGGIAETPCPYKCISDRF 871

Query: 1842 HKPHCYTTLEELIYTFGGPWXXXXXXXXXXXXXXXXXXXXRMKFVENDELPGPAPTQHGS 2021
            H PHCYT LEELIYTFGGPW                    RMKFV  DELPGPAPTQHGS
Sbjct: 872  HMPHCYTALEELIYTFGGPWLFCLLLVALLILLALVLSVARMKFVGVDELPGPAPTQHGS 931

Query: 2022 QINHSFPFLESLNEVLETNRAEESQSHVHRMYFMGPNTFSEPWHLPHSPPEQVTEIVYED 2201
            QI+HSFPFLESLNEVLETNRAEESQ+HVHRMYFMGPNTFSEPWHLPH+PPEQ+ EIVYE 
Sbjct: 932  QIDHSFPFLESLNEVLETNRAEESQNHVHRMYFMGPNTFSEPWHLPHTPPEQIKEIVYES 991

Query: 2202 AFNIFVDEINALAAYQWWEGSIHSVLSIVAYPFAWSXXXXXXXXXXXXXXEFVRS 2366
            A+N FVDEINA+ AYQWWEG+++S+LS + YP AWS              EFVRS
Sbjct: 992  AYNSFVDEINAITAYQWWEGAMYSILSALLYPLAWSWQQWRRRIKLQKLREFVRS 1046


>ref|XP_006826763.1| hypothetical protein AMTR_s00136p00081990 [Amborella trichopoda]
            gi|548831183|gb|ERM94000.1| hypothetical protein
            AMTR_s00136p00081990 [Amborella trichopoda]
          Length = 1454

 Score =  969 bits (2505), Expect = 0.0
 Identities = 492/775 (63%), Positives = 554/775 (71%), Gaps = 1/775 (0%)
 Frame = +3

Query: 45   SIYIKAHKMNGNGKISASXXXXXXXXXXXRVSIDIYSRHDNPEILVHGGRSYGCPENNGA 224
            SI IK+ KM G+GKISAS           RV+I +YSRHD+PEILVHGG S GCPEN GA
Sbjct: 282  SIMIKSDKMKGSGKISASGGNGWAGGGGGRVAIHVYSRHDDPEILVHGGMSRGCPENAGA 341

Query: 225  AGTFYDTVLRSLTVNNHNVSTQTDTLLLDFPHQPLWTNVYVRDHAKAVVPLLWSRVQVQG 404
            AGT YD + R+L V+N+N++TQTDTLLLDFP+QPLWTNVYV++ AK VVPLLWSRVQVQG
Sbjct: 342  AGTLYDCLPRTLFVSNNNMTTQTDTLLLDFPNQPLWTNVYVKNLAKVVVPLLWSRVQVQG 401

Query: 405  QLSLFCG-VLVFGLAHYPSSEFELMAEELLMSDSTIRVYGALRMSIKMLLMLNSKMIIDV 581
            QLSL  G  L FGL HYP SEFELMAEELLMSDS I+VYGALRMS+KMLLM NSKM+ID 
Sbjct: 402  QLSLLHGGSLSFGLTHYPFSEFELMAEELLMSDSVIKVYGALRMSVKMLLMWNSKMLIDG 461

Query: 582  GDDALVATSLLEASNLIALKESSVILSNAXXXXXXXXXXXXXXXXXXIEAQRLVLSLFYS 761
            G D++VATSLLEASNL+ L+ESS+I SN+                  IEAQRL+LSLFY+
Sbjct: 462  GGDSIVATSLLEASNLVVLRESSIIHSNSNLGVHGQGLLNLSGPGDRIEAQRLILSLFYN 521

Query: 762  IHVGPGSVLQAPLENATTDDMTPKLYCEHQDCPMELLHPPEDCNVNSSLSFTLQICRVED 941
            IHVGPGSVL+ PL+NATTDD+TP LYC  QDCP ELLHPPEDCNVNSSLSFTLQICRVED
Sbjct: 522  IHVGPGSVLRGPLKNATTDDVTPHLYCTSQDCPFELLHPPEDCNVNSSLSFTLQICRVED 581

Query: 942  ITVEGLIKGSVVHFHRARTVVVQSSGAISASXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 1121
            I+VEGLI+GSVVHFHRARTVVV S+G I AS                             
Sbjct: 582  ISVEGLIEGSVVHFHRARTVVVHSTGIIDASGLGCKGGVGRGNVLSNGLSGGGGHGGQGG 641

Query: 1122 XXXXXXSFAEGGVAYGNPDLPCEXXXXXXXXXXXXXTAGGGIIVMGSLEHSLTSLSIYGS 1301
                  S+ EGG  YGNP LPCE             TAGGGIIVMGSLEHSL+SLS+ GS
Sbjct: 642  AGYYNHSYVEGGTVYGNPALPCELGSGSGNESLAGSTAGGGIIVMGSLEHSLSSLSVGGS 701

Query: 1302 LRADGESYGQNARKQDYELTXXXXXXXXXXXXXTVLLFLHTLTVGDTAVLXXXXXXXXXX 1481
            LRADGES+   A  QD+ L              T+LLFL TLT+G+ A++          
Sbjct: 702  LRADGESFQLPAGNQDFGLGFGFNGGPGGGSGGTILLFLRTLTLGEDAMISSVGGYGSHT 761

Query: 1482 XXXXXXXXRIHFHWSDIPTGNEYQPLASVEGNIYTRGGLGMDQGNAGDNGTITGKACPEG 1661
                    R+HF WSDIPTG+EY PLASV+G I  RGG G D G  G+NGT+TGK CP G
Sbjct: 762  GGGGGGGGRVHFDWSDIPTGDEYIPLASVKGGIRARGGTGKDGGLRGNNGTVTGKECPRG 821

Query: 1662 LYGIFCEECPTGTFKNVSGSDITLCHQCPLSELPHRAIYTSVRGGIARTPCPYRCISERY 1841
            L+GIFCEECP GTFKNV+GS+  LC  CP  +LPHRAIY +VRGG++  PCPY+CISERY
Sbjct: 822  LFGIFCEECPAGTFKNVTGSNEALCRPCPPEQLPHRAIYINVRGGVSGPPCPYKCISERY 881

Query: 1842 HKPHCYTTLEELIYTFGGPWXXXXXXXXXXXXXXXXXXXXRMKFVENDELPGPAPTQHGS 2021
            H PHCYT LEELIYTFGGPW                    RMKFV  D+LPGPAPTQHGS
Sbjct: 882  HMPHCYTPLEELIYTFGGPWLFGLLLSGLLVLLALVLSVARMKFVGTDDLPGPAPTQHGS 941

Query: 2022 QINHSFPFLESLNEVLETNRAEESQSHVHRMYFMGPNTFSEPWHLPHSPPEQVTEIVYED 2201
            QI+HSFPFLESLNEVLETNRAEESQSHVHRMYFMGPNTF EPWHLPHSPPEQ+ EIVYED
Sbjct: 942  QIDHSFPFLESLNEVLETNRAEESQSHVHRMYFMGPNTFREPWHLPHSPPEQIMEIVYED 1001

Query: 2202 AFNIFVDEINALAAYQWWEGSIHSVLSIVAYPFAWSXXXXXXXXXXXXXXEFVRS 2366
            AFN FVDEIN L AYQWWEGS++S+LS++AYPFAWS              EFVRS
Sbjct: 1002 AFNRFVDEINVLDAYQWWEGSVYSILSVLAYPFAWSWQQWRRRKKLQRLREFVRS 1056


>ref|XP_006475982.1| PREDICTED: uncharacterized protein LOC102616975 isoform X2 [Citrus
            sinensis]
          Length = 1428

 Score =  963 bits (2490), Expect = 0.0
 Identities = 482/775 (62%), Positives = 558/775 (72%), Gaps = 1/775 (0%)
 Frame = +3

Query: 45   SIYIKAHKMNGNGKISASXXXXXXXXXXXRVSIDIYSRHDNPEILVHGGRSYGCPENNGA 224
            SIY+ A+KM G+G ISA            RVS+DI+SRHD P+I VHGG S+ CP+N G 
Sbjct: 243  SIYLIAYKMTGSGLISACGGNGYAGGGGGRVSVDIFSRHDEPKIFVHGGNSFACPDNAGG 302

Query: 225  AGTFYDTVLRSLTVNNHNVSTQTDTLLLDFPHQPLWTNVYVRDHAKAVVPLLWSRVQVQG 404
            AGT YD V R+LTV+N+N+ST T+TLLL+FP+QPLWTNVYV++ A+A VPLLWSRVQVQG
Sbjct: 303  AGTLYDAVPRTLTVSNYNMSTDTETLLLEFPNQPLWTNVYVQNCARATVPLLWSRVQVQG 362

Query: 405  QLSLFCG-VLVFGLAHYPSSEFELMAEELLMSDSTIRVYGALRMSIKMLLMLNSKMIIDV 581
            Q+SL CG VL FGLAHY +SEFEL+AEELLMSDS I+VYGALRM++K+ LM NS+M++D 
Sbjct: 363  QISLSCGGVLSFGLAHYATSEFELLAEELLMSDSVIKVYGALRMTVKIFLMWNSEMLVDG 422

Query: 582  GDDALVATSLLEASNLIALKESSVILSNAXXXXXXXXXXXXXXXXXXIEAQRLVLSLFYS 761
            G DA VATSLLEASNLI LKE S+I SNA                  IEAQRLVL+LFYS
Sbjct: 423  GGDATVATSLLEASNLIVLKEFSIIHSNANLEVHGQGLLNLSGPGDRIEAQRLVLALFYS 482

Query: 762  IHVGPGSVLQAPLENATTDDMTPKLYCEHQDCPMELLHPPEDCNVNSSLSFTLQICRVED 941
            IHVGPGSVL++PLENATTD +TP+LYCE QDCP+ELLHPPEDCNVNSSLSFTLQICRVED
Sbjct: 483  IHVGPGSVLRSPLENATTDAVTPRLYCEIQDCPVELLHPPEDCNVNSSLSFTLQICRVED 542

Query: 942  ITVEGLIKGSVVHFHRARTVVVQSSGAISASXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 1121
            I V+GL++GSVVHFHRART+ VQSSGAISAS                             
Sbjct: 543  IVVDGLVEGSVVHFHRARTISVQSSGAISASGMGCTGGVGRGKVIGNGVGSGGGHGGKGG 602

Query: 1122 XXXXXXSFAEGGVAYGNPDLPCEXXXXXXXXXXXXXTAGGGIIVMGSLEHSLTSLSIYGS 1301
                  S  EGG++YGN +LPCE             TAGGGIIVMGS EH L+SLS+ GS
Sbjct: 603  LGCFNDSCVEGGISYGNANLPCELGSGSGNDTSGNSTAGGGIIVMGSFEHPLSSLSVEGS 662

Query: 1302 LRADGESYGQNARKQDYELTXXXXXXXXXXXXXTVLLFLHTLTVGDTAVLXXXXXXXXXX 1481
            ++ADG+S+   + K++Y +              T+LLFLHTL +GD+AVL          
Sbjct: 663  VKADGQSFEDLSTKKNYVVRNGSIGGAGGGSGGTILLFLHTLDIGDSAVLSSVGGYGSHM 722

Query: 1482 XXXXXXXXRIHFHWSDIPTGNEYQPLASVEGNIYTRGGLGMDQGNAGDNGTITGKACPEG 1661
                    RIHFHWSDIPTG+ YQP+ASV G+I   GGLG  +   G+NGT TGKACP+G
Sbjct: 723  GGGGGGGGRIHFHWSDIPTGDVYQPIASVRGSIRIGGGLGGHELGGGENGTTTGKACPKG 782

Query: 1662 LYGIFCEECPTGTFKNVSGSDITLCHQCPLSELPHRAIYTSVRGGIARTPCPYRCISERY 1841
            LYGIFCEECP GT+KNV+GSD +LCHQCP  E PHRA+Y SVRGGIA TPCPYRCISERY
Sbjct: 783  LYGIFCEECPVGTYKNVTGSDKSLCHQCPPQEFPHRAVYISVRGGIAETPCPYRCISERY 842

Query: 1842 HKPHCYTTLEELIYTFGGPWXXXXXXXXXXXXXXXXXXXXRMKFVENDELPGPAPTQHGS 2021
            H PHCYT LEELIYTFGGPW                    RMKFV  DELPGPAPTQHGS
Sbjct: 843  HMPHCYTALEELIYTFGGPWLFCLLLVGLLILLALVLSVARMKFVGVDELPGPAPTQHGS 902

Query: 2022 QINHSFPFLESLNEVLETNRAEESQSHVHRMYFMGPNTFSEPWHLPHSPPEQVTEIVYED 2201
            QI+HSFPFLESLNEVLETNRAEES SHVHRMYFMGPNTFS+PWHLPH+PPEQ+ EIVYE 
Sbjct: 903  QIDHSFPFLESLNEVLETNRAEESHSHVHRMYFMGPNTFSQPWHLPHTPPEQIKEIVYEG 962

Query: 2202 AFNIFVDEINALAAYQWWEGSIHSVLSIVAYPFAWSXXXXXXXXXXXXXXEFVRS 2366
            AFN FVDEINA+A Y WWEG+I+S+L+I+AYP AWS              E+VRS
Sbjct: 963  AFNSFVDEINAIATYHWWEGAIYSILAILAYPLAWSWQQWRRRMKLQRLREYVRS 1017


>ref|XP_006475981.1| PREDICTED: uncharacterized protein LOC102616975 isoform X1 [Citrus
            sinensis]
          Length = 1458

 Score =  963 bits (2490), Expect = 0.0
 Identities = 482/775 (62%), Positives = 558/775 (72%), Gaps = 1/775 (0%)
 Frame = +3

Query: 45   SIYIKAHKMNGNGKISASXXXXXXXXXXXRVSIDIYSRHDNPEILVHGGRSYGCPENNGA 224
            SIY+ A+KM G+G ISA            RVS+DI+SRHD P+I VHGG S+ CP+N G 
Sbjct: 273  SIYLIAYKMTGSGLISACGGNGYAGGGGGRVSVDIFSRHDEPKIFVHGGNSFACPDNAGG 332

Query: 225  AGTFYDTVLRSLTVNNHNVSTQTDTLLLDFPHQPLWTNVYVRDHAKAVVPLLWSRVQVQG 404
            AGT YD V R+LTV+N+N+ST T+TLLL+FP+QPLWTNVYV++ A+A VPLLWSRVQVQG
Sbjct: 333  AGTLYDAVPRTLTVSNYNMSTDTETLLLEFPNQPLWTNVYVQNCARATVPLLWSRVQVQG 392

Query: 405  QLSLFCG-VLVFGLAHYPSSEFELMAEELLMSDSTIRVYGALRMSIKMLLMLNSKMIIDV 581
            Q+SL CG VL FGLAHY +SEFEL+AEELLMSDS I+VYGALRM++K+ LM NS+M++D 
Sbjct: 393  QISLSCGGVLSFGLAHYATSEFELLAEELLMSDSVIKVYGALRMTVKIFLMWNSEMLVDG 452

Query: 582  GDDALVATSLLEASNLIALKESSVILSNAXXXXXXXXXXXXXXXXXXIEAQRLVLSLFYS 761
            G DA VATSLLEASNLI LKE S+I SNA                  IEAQRLVL+LFYS
Sbjct: 453  GGDATVATSLLEASNLIVLKEFSIIHSNANLEVHGQGLLNLSGPGDRIEAQRLVLALFYS 512

Query: 762  IHVGPGSVLQAPLENATTDDMTPKLYCEHQDCPMELLHPPEDCNVNSSLSFTLQICRVED 941
            IHVGPGSVL++PLENATTD +TP+LYCE QDCP+ELLHPPEDCNVNSSLSFTLQICRVED
Sbjct: 513  IHVGPGSVLRSPLENATTDAVTPRLYCEIQDCPVELLHPPEDCNVNSSLSFTLQICRVED 572

Query: 942  ITVEGLIKGSVVHFHRARTVVVQSSGAISASXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 1121
            I V+GL++GSVVHFHRART+ VQSSGAISAS                             
Sbjct: 573  IVVDGLVEGSVVHFHRARTISVQSSGAISASGMGCTGGVGRGKVIGNGVGSGGGHGGKGG 632

Query: 1122 XXXXXXSFAEGGVAYGNPDLPCEXXXXXXXXXXXXXTAGGGIIVMGSLEHSLTSLSIYGS 1301
                  S  EGG++YGN +LPCE             TAGGGIIVMGS EH L+SLS+ GS
Sbjct: 633  LGCFNDSCVEGGISYGNANLPCELGSGSGNDTSGNSTAGGGIIVMGSFEHPLSSLSVEGS 692

Query: 1302 LRADGESYGQNARKQDYELTXXXXXXXXXXXXXTVLLFLHTLTVGDTAVLXXXXXXXXXX 1481
            ++ADG+S+   + K++Y +              T+LLFLHTL +GD+AVL          
Sbjct: 693  VKADGQSFEDLSTKKNYVVRNGSIGGAGGGSGGTILLFLHTLDIGDSAVLSSVGGYGSHM 752

Query: 1482 XXXXXXXXRIHFHWSDIPTGNEYQPLASVEGNIYTRGGLGMDQGNAGDNGTITGKACPEG 1661
                    RIHFHWSDIPTG+ YQP+ASV G+I   GGLG  +   G+NGT TGKACP+G
Sbjct: 753  GGGGGGGGRIHFHWSDIPTGDVYQPIASVRGSIRIGGGLGGHELGGGENGTTTGKACPKG 812

Query: 1662 LYGIFCEECPTGTFKNVSGSDITLCHQCPLSELPHRAIYTSVRGGIARTPCPYRCISERY 1841
            LYGIFCEECP GT+KNV+GSD +LCHQCP  E PHRA+Y SVRGGIA TPCPYRCISERY
Sbjct: 813  LYGIFCEECPVGTYKNVTGSDKSLCHQCPPQEFPHRAVYISVRGGIAETPCPYRCISERY 872

Query: 1842 HKPHCYTTLEELIYTFGGPWXXXXXXXXXXXXXXXXXXXXRMKFVENDELPGPAPTQHGS 2021
            H PHCYT LEELIYTFGGPW                    RMKFV  DELPGPAPTQHGS
Sbjct: 873  HMPHCYTALEELIYTFGGPWLFCLLLVGLLILLALVLSVARMKFVGVDELPGPAPTQHGS 932

Query: 2022 QINHSFPFLESLNEVLETNRAEESQSHVHRMYFMGPNTFSEPWHLPHSPPEQVTEIVYED 2201
            QI+HSFPFLESLNEVLETNRAEES SHVHRMYFMGPNTFS+PWHLPH+PPEQ+ EIVYE 
Sbjct: 933  QIDHSFPFLESLNEVLETNRAEESHSHVHRMYFMGPNTFSQPWHLPHTPPEQIKEIVYEG 992

Query: 2202 AFNIFVDEINALAAYQWWEGSIHSVLSIVAYPFAWSXXXXXXXXXXXXXXEFVRS 2366
            AFN FVDEINA+A Y WWEG+I+S+L+I+AYP AWS              E+VRS
Sbjct: 993  AFNSFVDEINAIATYHWWEGAIYSILAILAYPLAWSWQQWRRRMKLQRLREYVRS 1047


>ref|XP_006450754.1| hypothetical protein CICLE_v100072501mg, partial [Citrus clementina]
            gi|557553980|gb|ESR63994.1| hypothetical protein
            CICLE_v100072501mg, partial [Citrus clementina]
          Length = 1330

 Score =  963 bits (2490), Expect = 0.0
 Identities = 482/775 (62%), Positives = 558/775 (72%), Gaps = 1/775 (0%)
 Frame = +3

Query: 45   SIYIKAHKMNGNGKISASXXXXXXXXXXXRVSIDIYSRHDNPEILVHGGRSYGCPENNGA 224
            SIY+ A+KM G+G ISA            RVS+DI+SRHD P+I VHGG S+ CP+N G 
Sbjct: 273  SIYLIAYKMTGSGLISACGGNGYAGGGGGRVSVDIFSRHDEPKIFVHGGNSFACPDNAGG 332

Query: 225  AGTFYDTVLRSLTVNNHNVSTQTDTLLLDFPHQPLWTNVYVRDHAKAVVPLLWSRVQVQG 404
            AGT YD V R+LTV+N+N+ST T+TLLL+FP+QPLWTNVYV++ A+A VPLLWSRVQVQG
Sbjct: 333  AGTLYDAVPRTLTVSNYNMSTDTETLLLEFPNQPLWTNVYVQNCARATVPLLWSRVQVQG 392

Query: 405  QLSLFCG-VLVFGLAHYPSSEFELMAEELLMSDSTIRVYGALRMSIKMLLMLNSKMIIDV 581
            Q+SL CG VL FGLAHY +SEFEL+AEELLMSDS I+VYGALRM++K+ LM NS+M++D 
Sbjct: 393  QISLSCGGVLSFGLAHYATSEFELLAEELLMSDSVIKVYGALRMTVKIFLMWNSEMLVDG 452

Query: 582  GDDALVATSLLEASNLIALKESSVILSNAXXXXXXXXXXXXXXXXXXIEAQRLVLSLFYS 761
            G DA VATSLLEASNLI LKE S+I SNA                  IEAQRLVL+LFYS
Sbjct: 453  GGDATVATSLLEASNLIVLKEFSIIHSNANLEVHGQGLLNLSGPGDRIEAQRLVLALFYS 512

Query: 762  IHVGPGSVLQAPLENATTDDMTPKLYCEHQDCPMELLHPPEDCNVNSSLSFTLQICRVED 941
            IHVGPGSVL++PLENATTD +TP+LYCE QDCP+ELLHPPEDCNVNSSLSFTLQICRVED
Sbjct: 513  IHVGPGSVLRSPLENATTDAVTPRLYCEIQDCPVELLHPPEDCNVNSSLSFTLQICRVED 572

Query: 942  ITVEGLIKGSVVHFHRARTVVVQSSGAISASXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 1121
            I V+GL++GSVVHFHRART+ VQSSGAISAS                             
Sbjct: 573  IVVDGLVEGSVVHFHRARTISVQSSGAISASGMGCTGGVGRGKVIGNGVGSGGGHGGKGG 632

Query: 1122 XXXXXXSFAEGGVAYGNPDLPCEXXXXXXXXXXXXXTAGGGIIVMGSLEHSLTSLSIYGS 1301
                  S  EGG++YGN +LPCE             TAGGGIIVMGS EH L+SLS+ GS
Sbjct: 633  LGCFNDSCVEGGISYGNANLPCELGSGSGNDTSGNSTAGGGIIVMGSFEHPLSSLSVEGS 692

Query: 1302 LRADGESYGQNARKQDYELTXXXXXXXXXXXXXTVLLFLHTLTVGDTAVLXXXXXXXXXX 1481
            ++ADG+S+   + K++Y +              T+LLFLHTL +GD+AVL          
Sbjct: 693  VKADGQSFEDLSTKKNYVVRNGSIGGAGGGSGGTILLFLHTLDIGDSAVLSSVGGYGSHM 752

Query: 1482 XXXXXXXXRIHFHWSDIPTGNEYQPLASVEGNIYTRGGLGMDQGNAGDNGTITGKACPEG 1661
                    RIHFHWSDIPTG+ YQP+ASV G+I   GGLG  +   G+NGT TGKACP+G
Sbjct: 753  GGGGGGGGRIHFHWSDIPTGDVYQPIASVRGSIRIGGGLGGHELGGGENGTTTGKACPKG 812

Query: 1662 LYGIFCEECPTGTFKNVSGSDITLCHQCPLSELPHRAIYTSVRGGIARTPCPYRCISERY 1841
            LYGIFCEECP GT+KNV+GSD +LCHQCP  E PHRA+Y SVRGGIA TPCPYRCISERY
Sbjct: 813  LYGIFCEECPVGTYKNVTGSDKSLCHQCPPQEFPHRAVYISVRGGIAETPCPYRCISERY 872

Query: 1842 HKPHCYTTLEELIYTFGGPWXXXXXXXXXXXXXXXXXXXXRMKFVENDELPGPAPTQHGS 2021
            H PHCYT LEELIYTFGGPW                    RMKFV  DELPGPAPTQHGS
Sbjct: 873  HMPHCYTALEELIYTFGGPWLFCLLLVGLLILLALVLSVARMKFVGVDELPGPAPTQHGS 932

Query: 2022 QINHSFPFLESLNEVLETNRAEESQSHVHRMYFMGPNTFSEPWHLPHSPPEQVTEIVYED 2201
            QI+HSFPFLESLNEVLETNRAEES SHVHRMYFMGPNTFS+PWHLPH+PPEQ+ EIVYE 
Sbjct: 933  QIDHSFPFLESLNEVLETNRAEESHSHVHRMYFMGPNTFSQPWHLPHTPPEQIKEIVYEG 992

Query: 2202 AFNIFVDEINALAAYQWWEGSIHSVLSIVAYPFAWSXXXXXXXXXXXXXXEFVRS 2366
            AFN FVDEINA+A Y WWEG+I+S+L+I+AYP AWS              E+VRS
Sbjct: 993  AFNSFVDEINAIATYHWWEGAIYSILAILAYPLAWSWQQWRRRMKLQRLREYVRS 1047


>ref|XP_007225467.1| hypothetical protein PRUPE_ppa000219mg [Prunus persica]
            gi|462422403|gb|EMJ26666.1| hypothetical protein
            PRUPE_ppa000219mg [Prunus persica]
          Length = 1446

 Score =  957 bits (2475), Expect = 0.0
 Identities = 478/775 (61%), Positives = 547/775 (70%), Gaps = 1/775 (0%)
 Frame = +3

Query: 45   SIYIKAHKMNGNGKISASXXXXXXXXXXXRVSIDIYSRHDNPEILVHGGRSYGCPENNGA 224
            SI+IKA KM GNG+ISA            RVS+D++SRHD+P+I VHGG SY CPEN GA
Sbjct: 268  SIHIKARKMTGNGRISACGGNGYAGGGGGRVSVDVFSRHDDPKIFVHGGGSYACPENAGA 327

Query: 225  AGTFYDTVLRSLTVNNHNVSTQTDTLLLDFPHQPLWTNVYVRDHAKAVVPLLWSRVQVQG 404
            AGT YD V RSL VNNHN ST T+TLLL+FP  PLWTNVY+ + A+A VPLLWSRVQVQG
Sbjct: 328  AGTLYDAVPRSLFVNNHNKSTDTETLLLEFPFHPLWTNVYIENKARATVPLLWSRVQVQG 387

Query: 405  QLSLFC-GVLVFGLAHYPSSEFELMAEELLMSDSTIRVYGALRMSIKMLLMLNSKMIIDV 581
            Q+SL   GVL FGL HY SSEFEL+AEELLMSDS I+VYGALRMS+KM LM NSKM+ID 
Sbjct: 388  QISLLSDGVLSFGLPHYASSEFELLAEELLMSDSVIKVYGALRMSVKMFLMWNSKMLIDG 447

Query: 582  GDDALVATSLLEASNLIALKESSVILSNAXXXXXXXXXXXXXXXXXXIEAQRLVLSLFYS 761
            G +  V TSLLEASNL+ L+ESSVI SNA                  I+AQRLVLSLFYS
Sbjct: 448  GGEEAVETSLLEASNLVVLRESSVIHSNANLGVHGQGLLNLSGPGDWIQAQRLVLSLFYS 507

Query: 762  IHVGPGSVLQAPLENATTDDMTPKLYCEHQDCPMELLHPPEDCNVNSSLSFTLQICRVED 941
            IHVGPGSVL+ PLENATTD +TPKLYCE++DCP ELLHPPEDCNVNSSLSFTLQICRVED
Sbjct: 508  IHVGPGSVLRGPLENATTDSLTPKLYCENKDCPSELLHPPEDCNVNSSLSFTLQICRVED 567

Query: 942  ITVEGLIKGSVVHFHRARTVVVQSSGAISASXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 1121
            I +EGL+KGSVVHFHRART+ +QSSGAISAS                             
Sbjct: 568  IIIEGLVKGSVVHFHRARTIAIQSSGAISASGMGCTGGIGSGNILSNGSGSGGGHGGKGG 627

Query: 1122 XXXXXXSFAEGGVAYGNPDLPCEXXXXXXXXXXXXXTAGGGIIVMGSLEHSLTSLSIYGS 1301
                  S  EGG++YGN +LPCE             TAGGGIIVMGS EH L+SLS+ GS
Sbjct: 628  IACYNGSCVEGGISYGNEELPCELGSGSGNDISAGSTAGGGIIVMGSSEHPLSSLSVEGS 687

Query: 1302 LRADGESYGQNARKQDYELTXXXXXXXXXXXXXTVLLFLHTLTVGDTAVLXXXXXXXXXX 1481
            +  DGES+ +   K+ + L              ++LLFL TL +G++A+L          
Sbjct: 688  MTTDGESFERTTLKEKFPLVDSLSGGPGGGSGGSILLFLRTLALGESAILSSVGGYSSSI 747

Query: 1482 XXXXXXXXRIHFHWSDIPTGNEYQPLASVEGNIYTRGGLGMDQGNAGDNGTITGKACPEG 1661
                    RIHFHWSDIPTG+ YQP+ASVEG+I + GG G DQG AG++GT+TGK CP+G
Sbjct: 748  GGGGGGGGRIHFHWSDIPTGDVYQPIASVEGSILSGGGEGRDQGGAGEDGTVTGKDCPKG 807

Query: 1662 LYGIFCEECPTGTFKNVSGSDITLCHQCPLSELPHRAIYTSVRGGIARTPCPYRCISERY 1841
            LYG FCEECP GT+KNV GSD  LCH CP  ELP RAIY SVRGG+A  PCP++CIS+RY
Sbjct: 808  LYGTFCEECPAGTYKNVIGSDRALCHHCPADELPLRAIYISVRGGVAEAPCPFKCISDRY 867

Query: 1842 HKPHCYTTLEELIYTFGGPWXXXXXXXXXXXXXXXXXXXXRMKFVENDELPGPAPTQHGS 2021
            H PHCYT LEELIYTFGGPW                    RMKFV  DELPGPAPTQHGS
Sbjct: 868  HMPHCYTALEELIYTFGGPWLFGLLLIGLLILLALVLSVARMKFVGVDELPGPAPTQHGS 927

Query: 2022 QINHSFPFLESLNEVLETNRAEESQSHVHRMYFMGPNTFSEPWHLPHSPPEQVTEIVYED 2201
            QI+HSFPFLESLNEVLETNRAEESQSHVHRMYFMGPNTF +PWHLPH+PPEQV EIVYE 
Sbjct: 928  QIDHSFPFLESLNEVLETNRAEESQSHVHRMYFMGPNTFGKPWHLPHTPPEQVKEIVYEG 987

Query: 2202 AFNIFVDEINALAAYQWWEGSIHSVLSIVAYPFAWSXXXXXXXXXXXXXXEFVRS 2366
             FN FVDEIN++A YQWWEG+++S+LS++AYP AWS              EFVRS
Sbjct: 988  PFNTFVDEINSIATYQWWEGAMYSILSVLAYPLAWSWQHWRRRLKLQRLREFVRS 1042


>ref|XP_007012964.1| Uncharacterized protein isoform 4 [Theobroma cacao]
            gi|508783327|gb|EOY30583.1| Uncharacterized protein
            isoform 4 [Theobroma cacao]
          Length = 1158

 Score =  957 bits (2474), Expect = 0.0
 Identities = 475/775 (61%), Positives = 556/775 (71%), Gaps = 1/775 (0%)
 Frame = +3

Query: 45   SIYIKAHKMNGNGKISASXXXXXXXXXXXRVSIDIYSRHDNPEILVHGGRSYGCPENNGA 224
            SIYIKAH+M G+G+ISAS           R+SID++SRHD+ E  +HGG S+GC  N GA
Sbjct: 268  SIYIKAHRMTGSGRISASGGNGFAGGGGGRISIDVFSRHDDTEFFIHGGTSFGCKGNAGA 327

Query: 225  AGTFYDTVLRSLTVNNHNVSTQTDTLLLDFPHQPLWTNVYVRDHAKAVVPLLWSRVQVQG 404
            AGT+YD V RSL V+NHN+ST TDTLL++FP QPLWTNVY+RDHAKA VPL WSRVQV+G
Sbjct: 328  AGTYYDAVPRSLIVSNHNMSTSTDTLLMEFPKQPLWTNVYIRDHAKASVPLFWSRVQVRG 387

Query: 405  QLSLFCG-VLVFGLAHYPSSEFELMAEELLMSDSTIRVYGALRMSIKMLLMLNSKMIIDV 581
            Q+ L CG VL FGLAHY SSEFELMAEELLMSDS +++YGALRMS+KM LM NSKM+ID 
Sbjct: 388  QIHLSCGAVLSFGLAHYASSEFELMAEELLMSDSIVKIYGALRMSVKMHLMWNSKMLIDG 447

Query: 582  GDDALVATSLLEASNLIALKESSVILSNAXXXXXXXXXXXXXXXXXXIEAQRLVLSLFYS 761
            G DA+VATSLLEASNL+ L+ESSVI SNA                  IEAQRL+LSLF+S
Sbjct: 448  GADAIVATSLLEASNLVVLRESSVIQSNANLGVHGQGFLNLSGPGDMIEAQRLILSLFFS 507

Query: 762  IHVGPGSVLQAPLENATTDDMTPKLYCEHQDCPMELLHPPEDCNVNSSLSFTLQICRVED 941
            I+VG GS+L+ PLENA+ +DMTP+LYCE QDCPMEL+HPPEDCNVNSSLSFTLQICRVED
Sbjct: 508  INVGSGSILRGPLENASNNDMTPRLYCELQDCPMELVHPPEDCNVNSSLSFTLQICRVED 567

Query: 942  ITVEGLIKGSVVHFHRARTVVVQSSGAISASXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 1121
            I +EG+I GSVVHFH  R+++V SSG I+ S                             
Sbjct: 568  IVIEGVITGSVVHFHWVRSIIVHSSGEITTSALGCTGGVGRGKVLNNGLGGGGGHGGKGG 627

Query: 1122 XXXXXXSFAEGGVAYGNPDLPCEXXXXXXXXXXXXXTAGGGIIVMGSLEHSLTSLSIYGS 1301
                  SF EGGV+YG+ DLPCE             TAGGGIIVMGSLEH L+SL++YGS
Sbjct: 628  EGYFDGSFIEGGVSYGDADLPCELGSGSGNDSLAGTTAGGGIIVMGSLEHLLSSLTVYGS 687

Query: 1302 LRADGESYGQNARKQDYELTXXXXXXXXXXXXXTVLLFLHTLTVGDTAVLXXXXXXXXXX 1481
            LRADGES+G+  RKQ +  +             T+LLF+HT+ +GD++V+          
Sbjct: 688  LRADGESFGEAIRKQAH--STISNIGPGGGSGGTILLFVHTIVLGDSSVISTAGGHGSPS 745

Query: 1482 XXXXXXXXRIHFHWSDIPTGNEYQPLASVEGNIYTRGGLGMDQGNAGDNGTITGKACPEG 1661
                    R+HFHWSDIPTG+EY P+ASV+G+I TRGG G  QG+ G+NGTITGKACP+G
Sbjct: 746  GGGGGGGGRVHFHWSDIPTGDEYLPIASVKGSIITRGGSGRAQGHTGENGTITGKACPKG 805

Query: 1662 LYGIFCEECPTGTFKNVSGSDITLCHQCPLSELPHRAIYTSVRGGIARTPCPYRCISERY 1841
            LYGIFCEECP GTFKNVSGSD  LC  CP ++LP RA+Y +VRGG+  +PCPY+CISERY
Sbjct: 806  LYGIFCEECPVGTFKNVSGSDRVLCLDCPSNKLPSRALYVNVRGGVTESPCPYKCISERY 865

Query: 1842 HKPHCYTTLEELIYTFGGPWXXXXXXXXXXXXXXXXXXXXRMKFVENDELPGPAPTQHGS 2021
            H PHCYT LEEL+YTFGGPW                    RMK+V  DELP   P + GS
Sbjct: 866  HMPHCYTALEELVYTFGGPWLFGLILLGLLVLLALVLSVARMKYVGGDELPALVPARRGS 925

Query: 2022 QINHSFPFLESLNEVLETNRAEESQSHVHRMYFMGPNTFSEPWHLPHSPPEQVTEIVYED 2201
            +I+HSFPFLESLNEVLETNR EESQ+HVHRMYFMGPNTF+EPWHLPHSPPEQV EIVYED
Sbjct: 926  RIDHSFPFLESLNEVLETNRTEESQTHVHRMYFMGPNTFTEPWHLPHSPPEQVIEIVYED 985

Query: 2202 AFNIFVDEINALAAYQWWEGSIHSVLSIVAYPFAWSXXXXXXXXXXXXXXEFVRS 2366
            AFN FVDEIN LAAYQWWEGSI+S+LSI+AYP AWS              EFVRS
Sbjct: 986  AFNRFVDEINGLAAYQWWEGSIYSILSILAYPLAWSWLQQCRKNKLQQLREFVRS 1040


>ref|XP_007012963.1| Uncharacterized protein isoform 3 [Theobroma cacao]
            gi|508783326|gb|EOY30582.1| Uncharacterized protein
            isoform 3 [Theobroma cacao]
          Length = 1433

 Score =  957 bits (2474), Expect = 0.0
 Identities = 475/775 (61%), Positives = 556/775 (71%), Gaps = 1/775 (0%)
 Frame = +3

Query: 45   SIYIKAHKMNGNGKISASXXXXXXXXXXXRVSIDIYSRHDNPEILVHGGRSYGCPENNGA 224
            SIYIKAH+M G+G+ISAS           R+SID++SRHD+ E  +HGG S+GC  N GA
Sbjct: 268  SIYIKAHRMTGSGRISASGGNGFAGGGGGRISIDVFSRHDDTEFFIHGGTSFGCKGNAGA 327

Query: 225  AGTFYDTVLRSLTVNNHNVSTQTDTLLLDFPHQPLWTNVYVRDHAKAVVPLLWSRVQVQG 404
            AGT+YD V RSL V+NHN+ST TDTLL++FP QPLWTNVY+RDHAKA VPL WSRVQV+G
Sbjct: 328  AGTYYDAVPRSLIVSNHNMSTSTDTLLMEFPKQPLWTNVYIRDHAKASVPLFWSRVQVRG 387

Query: 405  QLSLFCG-VLVFGLAHYPSSEFELMAEELLMSDSTIRVYGALRMSIKMLLMLNSKMIIDV 581
            Q+ L CG VL FGLAHY SSEFELMAEELLMSDS +++YGALRMS+KM LM NSKM+ID 
Sbjct: 388  QIHLSCGAVLSFGLAHYASSEFELMAEELLMSDSIVKIYGALRMSVKMHLMWNSKMLIDG 447

Query: 582  GDDALVATSLLEASNLIALKESSVILSNAXXXXXXXXXXXXXXXXXXIEAQRLVLSLFYS 761
            G DA+VATSLLEASNL+ L+ESSVI SNA                  IEAQRL+LSLF+S
Sbjct: 448  GADAIVATSLLEASNLVVLRESSVIQSNANLGVHGQGFLNLSGPGDMIEAQRLILSLFFS 507

Query: 762  IHVGPGSVLQAPLENATTDDMTPKLYCEHQDCPMELLHPPEDCNVNSSLSFTLQICRVED 941
            I+VG GS+L+ PLENA+ +DMTP+LYCE QDCPMEL+HPPEDCNVNSSLSFTLQICRVED
Sbjct: 508  INVGSGSILRGPLENASNNDMTPRLYCELQDCPMELVHPPEDCNVNSSLSFTLQICRVED 567

Query: 942  ITVEGLIKGSVVHFHRARTVVVQSSGAISASXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 1121
            I +EG+I GSVVHFH  R+++V SSG I+ S                             
Sbjct: 568  IVIEGVITGSVVHFHWVRSIIVHSSGEITTSALGCTGGVGRGKVLNNGLGGGGGHGGKGG 627

Query: 1122 XXXXXXSFAEGGVAYGNPDLPCEXXXXXXXXXXXXXTAGGGIIVMGSLEHSLTSLSIYGS 1301
                  SF EGGV+YG+ DLPCE             TAGGGIIVMGSLEH L+SL++YGS
Sbjct: 628  EGYFDGSFIEGGVSYGDADLPCELGSGSGNDSLAGTTAGGGIIVMGSLEHLLSSLTVYGS 687

Query: 1302 LRADGESYGQNARKQDYELTXXXXXXXXXXXXXTVLLFLHTLTVGDTAVLXXXXXXXXXX 1481
            LRADGES+G+  RKQ +  +             T+LLF+HT+ +GD++V+          
Sbjct: 688  LRADGESFGEAIRKQAH--STISNIGPGGGSGGTILLFVHTIVLGDSSVISTAGGHGSPS 745

Query: 1482 XXXXXXXXRIHFHWSDIPTGNEYQPLASVEGNIYTRGGLGMDQGNAGDNGTITGKACPEG 1661
                    R+HFHWSDIPTG+EY P+ASV+G+I TRGG G  QG+ G+NGTITGKACP+G
Sbjct: 746  GGGGGGGGRVHFHWSDIPTGDEYLPIASVKGSIITRGGSGRAQGHTGENGTITGKACPKG 805

Query: 1662 LYGIFCEECPTGTFKNVSGSDITLCHQCPLSELPHRAIYTSVRGGIARTPCPYRCISERY 1841
            LYGIFCEECP GTFKNVSGSD  LC  CP ++LP RA+Y +VRGG+  +PCPY+CISERY
Sbjct: 806  LYGIFCEECPVGTFKNVSGSDRVLCLDCPSNKLPSRALYVNVRGGVTESPCPYKCISERY 865

Query: 1842 HKPHCYTTLEELIYTFGGPWXXXXXXXXXXXXXXXXXXXXRMKFVENDELPGPAPTQHGS 2021
            H PHCYT LEEL+YTFGGPW                    RMK+V  DELP   P + GS
Sbjct: 866  HMPHCYTALEELVYTFGGPWLFGLILLGLLVLLALVLSVARMKYVGGDELPALVPARRGS 925

Query: 2022 QINHSFPFLESLNEVLETNRAEESQSHVHRMYFMGPNTFSEPWHLPHSPPEQVTEIVYED 2201
            +I+HSFPFLESLNEVLETNR EESQ+HVHRMYFMGPNTF+EPWHLPHSPPEQV EIVYED
Sbjct: 926  RIDHSFPFLESLNEVLETNRTEESQTHVHRMYFMGPNTFTEPWHLPHSPPEQVIEIVYED 985

Query: 2202 AFNIFVDEINALAAYQWWEGSIHSVLSIVAYPFAWSXXXXXXXXXXXXXXEFVRS 2366
            AFN FVDEIN LAAYQWWEGSI+S+LSI+AYP AWS              EFVRS
Sbjct: 986  AFNRFVDEINGLAAYQWWEGSIYSILSILAYPLAWSWLQQCRKNKLQQLREFVRS 1040


>ref|XP_007012962.1| Uncharacterized protein isoform 2 [Theobroma cacao]
            gi|508783325|gb|EOY30581.1| Uncharacterized protein
            isoform 2 [Theobroma cacao]
          Length = 1434

 Score =  957 bits (2474), Expect = 0.0
 Identities = 475/775 (61%), Positives = 556/775 (71%), Gaps = 1/775 (0%)
 Frame = +3

Query: 45   SIYIKAHKMNGNGKISASXXXXXXXXXXXRVSIDIYSRHDNPEILVHGGRSYGCPENNGA 224
            SIYIKAH+M G+G+ISAS           R+SID++SRHD+ E  +HGG S+GC  N GA
Sbjct: 268  SIYIKAHRMTGSGRISASGGNGFAGGGGGRISIDVFSRHDDTEFFIHGGTSFGCKGNAGA 327

Query: 225  AGTFYDTVLRSLTVNNHNVSTQTDTLLLDFPHQPLWTNVYVRDHAKAVVPLLWSRVQVQG 404
            AGT+YD V RSL V+NHN+ST TDTLL++FP QPLWTNVY+RDHAKA VPL WSRVQV+G
Sbjct: 328  AGTYYDAVPRSLIVSNHNMSTSTDTLLMEFPKQPLWTNVYIRDHAKASVPLFWSRVQVRG 387

Query: 405  QLSLFCG-VLVFGLAHYPSSEFELMAEELLMSDSTIRVYGALRMSIKMLLMLNSKMIIDV 581
            Q+ L CG VL FGLAHY SSEFELMAEELLMSDS +++YGALRMS+KM LM NSKM+ID 
Sbjct: 388  QIHLSCGAVLSFGLAHYASSEFELMAEELLMSDSIVKIYGALRMSVKMHLMWNSKMLIDG 447

Query: 582  GDDALVATSLLEASNLIALKESSVILSNAXXXXXXXXXXXXXXXXXXIEAQRLVLSLFYS 761
            G DA+VATSLLEASNL+ L+ESSVI SNA                  IEAQRL+LSLF+S
Sbjct: 448  GADAIVATSLLEASNLVVLRESSVIQSNANLGVHGQGFLNLSGPGDMIEAQRLILSLFFS 507

Query: 762  IHVGPGSVLQAPLENATTDDMTPKLYCEHQDCPMELLHPPEDCNVNSSLSFTLQICRVED 941
            I+VG GS+L+ PLENA+ +DMTP+LYCE QDCPMEL+HPPEDCNVNSSLSFTLQICRVED
Sbjct: 508  INVGSGSILRGPLENASNNDMTPRLYCELQDCPMELVHPPEDCNVNSSLSFTLQICRVED 567

Query: 942  ITVEGLIKGSVVHFHRARTVVVQSSGAISASXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 1121
            I +EG+I GSVVHFH  R+++V SSG I+ S                             
Sbjct: 568  IVIEGVITGSVVHFHWVRSIIVHSSGEITTSALGCTGGVGRGKVLNNGLGGGGGHGGKGG 627

Query: 1122 XXXXXXSFAEGGVAYGNPDLPCEXXXXXXXXXXXXXTAGGGIIVMGSLEHSLTSLSIYGS 1301
                  SF EGGV+YG+ DLPCE             TAGGGIIVMGSLEH L+SL++YGS
Sbjct: 628  EGYFDGSFIEGGVSYGDADLPCELGSGSGNDSLAGTTAGGGIIVMGSLEHLLSSLTVYGS 687

Query: 1302 LRADGESYGQNARKQDYELTXXXXXXXXXXXXXTVLLFLHTLTVGDTAVLXXXXXXXXXX 1481
            LRADGES+G+  RKQ +  +             T+LLF+HT+ +GD++V+          
Sbjct: 688  LRADGESFGEAIRKQAH--STISNIGPGGGSGGTILLFVHTIVLGDSSVISTAGGHGSPS 745

Query: 1482 XXXXXXXXRIHFHWSDIPTGNEYQPLASVEGNIYTRGGLGMDQGNAGDNGTITGKACPEG 1661
                    R+HFHWSDIPTG+EY P+ASV+G+I TRGG G  QG+ G+NGTITGKACP+G
Sbjct: 746  GGGGGGGGRVHFHWSDIPTGDEYLPIASVKGSIITRGGSGRAQGHTGENGTITGKACPKG 805

Query: 1662 LYGIFCEECPTGTFKNVSGSDITLCHQCPLSELPHRAIYTSVRGGIARTPCPYRCISERY 1841
            LYGIFCEECP GTFKNVSGSD  LC  CP ++LP RA+Y +VRGG+  +PCPY+CISERY
Sbjct: 806  LYGIFCEECPVGTFKNVSGSDRVLCLDCPSNKLPSRALYVNVRGGVTESPCPYKCISERY 865

Query: 1842 HKPHCYTTLEELIYTFGGPWXXXXXXXXXXXXXXXXXXXXRMKFVENDELPGPAPTQHGS 2021
            H PHCYT LEEL+YTFGGPW                    RMK+V  DELP   P + GS
Sbjct: 866  HMPHCYTALEELVYTFGGPWLFGLILLGLLVLLALVLSVARMKYVGGDELPALVPARRGS 925

Query: 2022 QINHSFPFLESLNEVLETNRAEESQSHVHRMYFMGPNTFSEPWHLPHSPPEQVTEIVYED 2201
            +I+HSFPFLESLNEVLETNR EESQ+HVHRMYFMGPNTF+EPWHLPHSPPEQV EIVYED
Sbjct: 926  RIDHSFPFLESLNEVLETNRTEESQTHVHRMYFMGPNTFTEPWHLPHSPPEQVIEIVYED 985

Query: 2202 AFNIFVDEINALAAYQWWEGSIHSVLSIVAYPFAWSXXXXXXXXXXXXXXEFVRS 2366
            AFN FVDEIN LAAYQWWEGSI+S+LSI+AYP AWS              EFVRS
Sbjct: 986  AFNRFVDEINGLAAYQWWEGSIYSILSILAYPLAWSWLQQCRKNKLQQLREFVRS 1040


>ref|XP_007012961.1| Uncharacterized protein isoform 1 [Theobroma cacao]
            gi|508783324|gb|EOY30580.1| Uncharacterized protein
            isoform 1 [Theobroma cacao]
          Length = 1445

 Score =  957 bits (2474), Expect = 0.0
 Identities = 475/775 (61%), Positives = 556/775 (71%), Gaps = 1/775 (0%)
 Frame = +3

Query: 45   SIYIKAHKMNGNGKISASXXXXXXXXXXXRVSIDIYSRHDNPEILVHGGRSYGCPENNGA 224
            SIYIKAH+M G+G+ISAS           R+SID++SRHD+ E  +HGG S+GC  N GA
Sbjct: 268  SIYIKAHRMTGSGRISASGGNGFAGGGGGRISIDVFSRHDDTEFFIHGGTSFGCKGNAGA 327

Query: 225  AGTFYDTVLRSLTVNNHNVSTQTDTLLLDFPHQPLWTNVYVRDHAKAVVPLLWSRVQVQG 404
            AGT+YD V RSL V+NHN+ST TDTLL++FP QPLWTNVY+RDHAKA VPL WSRVQV+G
Sbjct: 328  AGTYYDAVPRSLIVSNHNMSTSTDTLLMEFPKQPLWTNVYIRDHAKASVPLFWSRVQVRG 387

Query: 405  QLSLFCG-VLVFGLAHYPSSEFELMAEELLMSDSTIRVYGALRMSIKMLLMLNSKMIIDV 581
            Q+ L CG VL FGLAHY SSEFELMAEELLMSDS +++YGALRMS+KM LM NSKM+ID 
Sbjct: 388  QIHLSCGAVLSFGLAHYASSEFELMAEELLMSDSIVKIYGALRMSVKMHLMWNSKMLIDG 447

Query: 582  GDDALVATSLLEASNLIALKESSVILSNAXXXXXXXXXXXXXXXXXXIEAQRLVLSLFYS 761
            G DA+VATSLLEASNL+ L+ESSVI SNA                  IEAQRL+LSLF+S
Sbjct: 448  GADAIVATSLLEASNLVVLRESSVIQSNANLGVHGQGFLNLSGPGDMIEAQRLILSLFFS 507

Query: 762  IHVGPGSVLQAPLENATTDDMTPKLYCEHQDCPMELLHPPEDCNVNSSLSFTLQICRVED 941
            I+VG GS+L+ PLENA+ +DMTP+LYCE QDCPMEL+HPPEDCNVNSSLSFTLQICRVED
Sbjct: 508  INVGSGSILRGPLENASNNDMTPRLYCELQDCPMELVHPPEDCNVNSSLSFTLQICRVED 567

Query: 942  ITVEGLIKGSVVHFHRARTVVVQSSGAISASXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 1121
            I +EG+I GSVVHFH  R+++V SSG I+ S                             
Sbjct: 568  IVIEGVITGSVVHFHWVRSIIVHSSGEITTSALGCTGGVGRGKVLNNGLGGGGGHGGKGG 627

Query: 1122 XXXXXXSFAEGGVAYGNPDLPCEXXXXXXXXXXXXXTAGGGIIVMGSLEHSLTSLSIYGS 1301
                  SF EGGV+YG+ DLPCE             TAGGGIIVMGSLEH L+SL++YGS
Sbjct: 628  EGYFDGSFIEGGVSYGDADLPCELGSGSGNDSLAGTTAGGGIIVMGSLEHLLSSLTVYGS 687

Query: 1302 LRADGESYGQNARKQDYELTXXXXXXXXXXXXXTVLLFLHTLTVGDTAVLXXXXXXXXXX 1481
            LRADGES+G+  RKQ +  +             T+LLF+HT+ +GD++V+          
Sbjct: 688  LRADGESFGEAIRKQAH--STISNIGPGGGSGGTILLFVHTIVLGDSSVISTAGGHGSPS 745

Query: 1482 XXXXXXXXRIHFHWSDIPTGNEYQPLASVEGNIYTRGGLGMDQGNAGDNGTITGKACPEG 1661
                    R+HFHWSDIPTG+EY P+ASV+G+I TRGG G  QG+ G+NGTITGKACP+G
Sbjct: 746  GGGGGGGGRVHFHWSDIPTGDEYLPIASVKGSIITRGGSGRAQGHTGENGTITGKACPKG 805

Query: 1662 LYGIFCEECPTGTFKNVSGSDITLCHQCPLSELPHRAIYTSVRGGIARTPCPYRCISERY 1841
            LYGIFCEECP GTFKNVSGSD  LC  CP ++LP RA+Y +VRGG+  +PCPY+CISERY
Sbjct: 806  LYGIFCEECPVGTFKNVSGSDRVLCLDCPSNKLPSRALYVNVRGGVTESPCPYKCISERY 865

Query: 1842 HKPHCYTTLEELIYTFGGPWXXXXXXXXXXXXXXXXXXXXRMKFVENDELPGPAPTQHGS 2021
            H PHCYT LEEL+YTFGGPW                    RMK+V  DELP   P + GS
Sbjct: 866  HMPHCYTALEELVYTFGGPWLFGLILLGLLVLLALVLSVARMKYVGGDELPALVPARRGS 925

Query: 2022 QINHSFPFLESLNEVLETNRAEESQSHVHRMYFMGPNTFSEPWHLPHSPPEQVTEIVYED 2201
            +I+HSFPFLESLNEVLETNR EESQ+HVHRMYFMGPNTF+EPWHLPHSPPEQV EIVYED
Sbjct: 926  RIDHSFPFLESLNEVLETNRTEESQTHVHRMYFMGPNTFTEPWHLPHSPPEQVIEIVYED 985

Query: 2202 AFNIFVDEINALAAYQWWEGSIHSVLSIVAYPFAWSXXXXXXXXXXXXXXEFVRS 2366
            AFN FVDEIN LAAYQWWEGSI+S+LSI+AYP AWS              EFVRS
Sbjct: 986  AFNRFVDEINGLAAYQWWEGSIYSILSILAYPLAWSWLQQCRKNKLQQLREFVRS 1040


>gb|EXB75637.1| hypothetical protein L484_026114 [Morus notabilis]
          Length = 1448

 Score =  950 bits (2456), Expect = 0.0
 Identities = 476/775 (61%), Positives = 544/775 (70%), Gaps = 1/775 (0%)
 Frame = +3

Query: 45   SIYIKAHKMNGNGKISASXXXXXXXXXXXRVSIDIYSRHDNPEILVHGGRSYGCPENNGA 224
            SIYIKA+KM G+G+ISA            RVS+D++SRHD P I VHGG SY CPEN GA
Sbjct: 265  SIYIKAYKMTGSGRISACGGNGYAGGGGGRVSVDVFSRHDEPGIFVHGGSSYTCPENAGA 324

Query: 225  AGTFYDTVLRSLTVNNHNVSTQTDTLLLDFPHQPLWTNVYVRDHAKAVVPLLWSRVQVQG 404
            AGT YD V RSL ++NHN ST T+TLLLDFP+QPLWTNVYVR+ A A VPLLWSRVQVQG
Sbjct: 325  AGTLYDAVPRSLIIDNHNKSTDTETLLLDFPNQPLWTNVYVRNSAHATVPLLWSRVQVQG 384

Query: 405  QLSLFCG-VLVFGLAHYPSSEFELMAEELLMSDSTIRVYGALRMSIKMLLMLNSKMIIDV 581
            Q+SL  G VL FGL HY SSEFEL+AEELLMSDS +RVYGALRMS+KM LM NSKM+ID 
Sbjct: 385  QISLLSGGVLSFGLQHYASSEFELLAEELLMSDSEMRVYGALRMSVKMFLMWNSKMLIDG 444

Query: 582  GDDALVATSLLEASNLIALKESSVILSNAXXXXXXXXXXXXXXXXXXIEAQRLVLSLFYS 761
            G D  VATSLLEASNL+ LKESSVI SNA                  IEAQRLVLSLFYS
Sbjct: 445  GGDMNVATSLLEASNLVVLKESSVIHSNANLGVHGQGLLNLSGPGDMIEAQRLVLSLFYS 504

Query: 762  IHVGPGSVLQAPLENATTDDMTPKLYCEHQDCPMELLHPPEDCNVNSSLSFTLQICRVED 941
            IH+GPGS L+ PLENA+TD +TPKLYCE QDCP ELLHPPEDCNVNSSLSFTLQICRVED
Sbjct: 505  IHLGPGSALRGPLENASTDSVTPKLYCESQDCPFELLHPPEDCNVNSSLSFTLQICRVED 564

Query: 942  ITVEGLIKGSVVHFHRARTVVVQSSGAISASXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 1121
            ITVEGL+KGSV+HFHRART+ V SSG+ISAS                             
Sbjct: 565  ITVEGLVKGSVIHFHRARTIAVHSSGSISASRMGCTGGIGRGSVLSNGIWSGGGHGGRGG 624

Query: 1122 XXXXXXSFAEGGVAYGNPDLPCEXXXXXXXXXXXXXTAGGGIIVMGSLEHSLTSLSIYGS 1301
                  +   GG++YGN DLPCE             T+GGGIIVMGS+EH L +LSI GS
Sbjct: 625  RGCYDGTCIRGGISYGNADLPCELGSGSGNDSSAGSTSGGGIIVMGSMEHPLFTLSIEGS 684

Query: 1302 LRADGESYGQNARKQDYELTXXXXXXXXXXXXXTVLLFLHTLTVGDTAVLXXXXXXXXXX 1481
            + ADGES    +RK  Y +              T+L+FLH + +GD+A L          
Sbjct: 685  VEADGESSEGTSRKGKYAVVDGLIGGPGGGSGGTILMFLHIIALGDSATLSSIGGYGSPN 744

Query: 1482 XXXXXXXXRIHFHWSDIPTGNEYQPLASVEGNIYTRGGLGMDQGNAGDNGTITGKACPEG 1661
                    RIHFHWSDIP G+ YQ +ASV+G+I   GG+   +G +G+NGT+TGKACP+G
Sbjct: 745  GVGGGGGGRIHFHWSDIPIGDVYQSIASVKGSINAGGGVSKGEGCSGENGTVTGKACPKG 804

Query: 1662 LYGIFCEECPTGTFKNVSGSDITLCHQCPLSELPHRAIYTSVRGGIARTPCPYRCISERY 1841
            LYGIFCEECP GT+KNVSGS+  LC  CP   LP+RA+YT VRGG+A TPCPY+C+S+RY
Sbjct: 805  LYGIFCEECPVGTYKNVSGSERDLCRPCPAEALPNRAVYTYVRGGVAETPCPYKCVSDRY 864

Query: 1842 HKPHCYTTLEELIYTFGGPWXXXXXXXXXXXXXXXXXXXXRMKFVENDELPGPAPTQHGS 2021
            H PHCYT LEELIYTFGGPW                    RMKFV  DELPGPAPTQHGS
Sbjct: 865  HMPHCYTALEELIYTFGGPWLFGLLLVALLILLALVLSVARMKFVGVDELPGPAPTQHGS 924

Query: 2022 QINHSFPFLESLNEVLETNRAEESQSHVHRMYFMGPNTFSEPWHLPHSPPEQVTEIVYED 2201
            QI+HSFPFLESLNEVLETNR EESQSHVHRMYFMGPNTFS+PWHLPHSPP+Q+ EIVYE 
Sbjct: 925  QIDHSFPFLESLNEVLETNRVEESQSHVHRMYFMGPNTFSDPWHLPHSPPDQIKEIVYEV 984

Query: 2202 AFNIFVDEINALAAYQWWEGSIHSVLSIVAYPFAWSXXXXXXXXXXXXXXEFVRS 2366
            AFN FVD+INA+AAYQWWEG+++S+LS+  YP AWS              EFVRS
Sbjct: 985  AFNTFVDDINAIAAYQWWEGAVYSILSVFVYPLAWSWQQWRRRLKLQRLREFVRS 1039


>ref|XP_002308587.2| hypothetical protein POPTR_0006s25110g [Populus trichocarpa]
            gi|550337045|gb|EEE92110.2| hypothetical protein
            POPTR_0006s25110g [Populus trichocarpa]
          Length = 1412

 Score =  949 bits (2452), Expect = 0.0
 Identities = 480/775 (61%), Positives = 550/775 (70%), Gaps = 1/775 (0%)
 Frame = +3

Query: 45   SIYIKAHKMNGNGKISASXXXXXXXXXXXRVSIDIYSRHDNPEILVHGGRSYGCPENNGA 224
            SI +KA+KM G G+ISA            RVS+DI+SRHD+P+I VHGG S+GCPEN G 
Sbjct: 274  SILLKAYKMTGGGRISACGGNGFAGGGGGRVSVDIFSRHDDPQIFVHGGNSFGCPENAGG 333

Query: 225  AGTFYDTVLRSLTVNNHNVSTQTDTLLLDFPHQPLWTNVYVRDHAKAVVPLLWSRVQVQG 404
            AGT YD V RSLTV+NHN+ST TDTLLL+FP+QPLWTNVYVR+HA+A VPLLWSRVQVQG
Sbjct: 334  AGTLYDAVARSLTVSNHNMSTDTDTLLLEFPYQPLWTNVYVRNHARATVPLLWSRVQVQG 393

Query: 405  QLSLFC-GVLVFGLAHYPSSEFELMAEELLMSDSTIRVYGALRMSIKMLLMLNSKMIIDV 581
            Q+SL C GVL FGLAHY SSEFEL AEELLMSDS   VYGALRMS+KM LM NSKMIID 
Sbjct: 394  QISLLCSGVLSFGLAHYASSEFELFAEELLMSDS---VYGALRMSVKMFLMWNSKMIIDG 450

Query: 582  GDDALVATSLLEASNLIALKESSVILSNAXXXXXXXXXXXXXXXXXXIEAQRLVLSLFYS 761
            G+D  VATSLLEASNL+ LKESSVI SNA                  IEAQRLVLSLFYS
Sbjct: 451  GEDVTVATSLLEASNLVVLKESSVIHSNANLGVHGQGLLNLSGSGNWIEAQRLVLSLFYS 510

Query: 762  IHVGPGSVLQAPLENATTDDMTPKLYCEHQDCPMELLHPPEDCNVNSSLSFTLQICRVED 941
            IHV PGSVL+ P+ENAT+D +TP+L+C+ ++CP EL HPPEDCNVNSSLSFTLQICRVED
Sbjct: 511  IHVAPGSVLRGPVENATSDAITPRLHCQLEECPAELFHPPEDCNVNSSLSFTLQICRVED 570

Query: 942  ITVEGLIKGSVVHFHRARTVVVQSSGAISASXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 1121
            ITVEGLI+GSVVHF++AR + V SSG ISAS                             
Sbjct: 571  ITVEGLIEGSVVHFNQARAISVPSSGTISASGMGCTGGVGRGNGLSNGIGSGGGHGGKGG 630

Query: 1122 XXXXXXSFAEGGVAYGNPDLPCEXXXXXXXXXXXXXTAGGGIIVMGSLEHSLTSLSIYGS 1301
                  +  +GGV+YG+ +LPCE             TAGGGIIVMGSLEH L+SLS+ GS
Sbjct: 631  SACYNDNCVDGGVSYGDAELPCELGSGSGQENSSGSTAGGGIIVMGSLEHPLSSLSVEGS 690

Query: 1302 LRADGESYGQNARKQDYELTXXXXXXXXXXXXXTVLLFLHTLTVGDTAVLXXXXXXXXXX 1481
            +R DGES+    R Q   +              T+LLFLHTL +G+ AVL          
Sbjct: 691  VRVDGESFKGITRDQ-LVVMKGTAGGPGGGSGGTILLFLHTLDLGEHAVLSSVGGYGSPK 749

Query: 1482 XXXXXXXXRIHFHWSDIPTGNEYQPLASVEGNIYTRGGLGMDQGNAGDNGTITGKACPEG 1661
                    R+HFHWSDIPTG+ YQP+A V G+I+T GGLG D G+AG+NGT+TGKACP+G
Sbjct: 750  GGGGGGGGRVHFHWSDIPTGDMYQPIARVNGSIHTWGGLGRDDGHAGENGTVTGKACPKG 809

Query: 1662 LYGIFCEECPTGTFKNVSGSDITLCHQCPLSELPHRAIYTSVRGGIARTPCPYRCISERY 1841
            LYGIFCEECP GT+KNV+GS   LCH CP  +LP RA Y +VRGGIA TPCPY+C+SER+
Sbjct: 810  LYGIFCEECPVGTYKNVTGSSRVLCHSCPADDLPRRAAYIAVRGGIAETPCPYKCVSERF 869

Query: 1842 HKPHCYTTLEELIYTFGGPWXXXXXXXXXXXXXXXXXXXXRMKFVENDELPGPAPTQHGS 2021
            H PHCYT LEELIYTFGGPW                    RMKFV  DELPGPAPTQHGS
Sbjct: 870  HMPHCYTALEELIYTFGGPWLFCLLLLGLLILLALVLSVARMKFVGVDELPGPAPTQHGS 929

Query: 2022 QINHSFPFLESLNEVLETNRAEESQSHVHRMYFMGPNTFSEPWHLPHSPPEQVTEIVYED 2201
            QI+HSFPFLESLNEVLETNRAEESQSHVHRMYFMG NTFSEPWHLPH+PPEQ+ EIVYE 
Sbjct: 930  QIDHSFPFLESLNEVLETNRAEESQSHVHRMYFMGRNTFSEPWHLPHTPPEQIKEIVYEG 989

Query: 2202 AFNIFVDEINALAAYQWWEGSIHSVLSIVAYPFAWSXXXXXXXXXXXXXXEFVRS 2366
            AFN FVDEIN +AAYQWWEG+I+ ++S++AYP AWS              EFVRS
Sbjct: 990  AFNTFVDEINGIAAYQWWEGAIYILVSVLAYPLAWSWQQWRRRIKLQRLREFVRS 1044


>ref|XP_006581468.1| PREDICTED: uncharacterized protein LOC100804207 [Glycine max]
          Length = 1447

 Score =  948 bits (2451), Expect = 0.0
 Identities = 478/775 (61%), Positives = 547/775 (70%), Gaps = 1/775 (0%)
 Frame = +3

Query: 45   SIYIKAHKMNGNGKISASXXXXXXXXXXXRVSIDIYSRHDNPEILVHGGRSYGCPENNGA 224
            SIYIKA++M GNG ISA            RVS+D++SRHD P+I VHGG+S GCPEN GA
Sbjct: 266  SIYIKAYRMTGNGIISACGGNGFAGGGGGRVSVDVFSRHDEPKIYVHGGKSLGCPENAGA 325

Query: 225  AGTFYDTVLRSLTVNNHNVSTQTDTLLLDFPHQPLWTNVYVRDHAKAVVPLLWSRVQVQG 404
            AGT YD V RSL V+N+N++T T+TLLL+FP+QPLWTNVYVR+ A+A VPLLWSRVQVQG
Sbjct: 326  AGTLYDAVPRSLIVDNYNMTTDTETLLLEFPNQPLWTNVYVRNKARATVPLLWSRVQVQG 385

Query: 405  QLSLFCG-VLVFGLAHYPSSEFELMAEELLMSDSTIRVYGALRMSIKMLLMLNSKMIIDV 581
            Q+S+  G VL FGL HY +SEFEL+AEELLMSDS ++VYGALRMS+KM LM NSKM+ID 
Sbjct: 386  QISILQGGVLSFGLRHYATSEFELLAEELLMSDSVMKVYGALRMSVKMFLMWNSKMLIDG 445

Query: 582  GDDALVATSLLEASNLIALKESSVILSNAXXXXXXXXXXXXXXXXXXIEAQRLVLSLFYS 761
            G+D  VATSLLEASNLI L+ +SVI SNA                  IEAQRLVLSLFYS
Sbjct: 446  GEDVTVATSLLEASNLIVLRGASVIHSNANLGVHGQGLLNLSGPGDWIEAQRLVLSLFYS 505

Query: 762  IHVGPGSVLQAPLENATTDDMTPKLYCEHQDCPMELLHPPEDCNVNSSLSFTLQICRVED 941
            IHVGPGSVL+ PLENATTDD+TPKLYC ++DCP ELLHPPEDCNVNSSLSFTLQICRVED
Sbjct: 506  IHVGPGSVLRGPLENATTDDVTPKLYCNNEDCPYELLHPPEDCNVNSSLSFTLQICRVED 565

Query: 942  ITVEGLIKGSVVHFHRARTVVVQSSGAISASXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 1121
            I VEGLIKGSVVHFHRART+ V+SSG ISAS                             
Sbjct: 566  ILVEGLIKGSVVHFHRARTISVESSGTISASGMGCTGGLGRGNTLTNGIGSGGGHGGTGG 625

Query: 1122 XXXXXXSFAEGGVAYGNPDLPCEXXXXXXXXXXXXXTAGGGIIVMGSLEHSLTSLSIYGS 1301
                  +  EGG +YGN  LPCE             TAGGGIIV+GSLEH L+SLSI GS
Sbjct: 626  DAFYNDNHVEGGRSYGNATLPCELGSGSGIGNSTGSTAGGGIIVVGSLEHPLSSLSIQGS 685

Query: 1302 LRADGESYGQNARKQDYELTXXXXXXXXXXXXXTVLLFLHTLTVGDTAVLXXXXXXXXXX 1481
            + ADG ++    R + + +              T+L+FLH L +G +AVL          
Sbjct: 686  VNADGGNFEPQIRNEKFAIFDNFTGGPGGGSGGTILMFLHMLNIGQSAVLSSMGGYSSSN 745

Query: 1482 XXXXXXXXRIHFHWSDIPTGNEYQPLASVEGNIYTRGGLGMDQGNAGDNGTITGKACPEG 1661
                    RIHFHWSDIPTG+ Y P+ASVEG+I   GG G  QG +G NGTITGKACP+G
Sbjct: 746  GSGGGGGGRIHFHWSDIPTGDVYLPIASVEGDIQIWGGKGKGQGGSGANGTITGKACPKG 805

Query: 1662 LYGIFCEECPTGTFKNVSGSDITLCHQCPLSELPHRAIYTSVRGGIARTPCPYRCISERY 1841
            LYG FCEECP GT+KNV+GSD +LCH CP++ELPHRA+Y SVRGGI  TPCPY+C S+RY
Sbjct: 806  LYGTFCEECPAGTYKNVTGSDKSLCHSCPVNELPHRAVYISVRGGITETPCPYQCASDRY 865

Query: 1842 HKPHCYTTLEELIYTFGGPWXXXXXXXXXXXXXXXXXXXXRMKFVENDELPGPAPTQHGS 2021
              P CYT LEELIYTFGGPW                    RMKFV  DELPGPAPTQHGS
Sbjct: 866  LMPDCYTALEELIYTFGGPWLFGLFLIGLLILLALVLSVARMKFVGVDELPGPAPTQHGS 925

Query: 2022 QINHSFPFLESLNEVLETNRAEESQSHVHRMYFMGPNTFSEPWHLPHSPPEQVTEIVYED 2201
            QI+HSFPFLESLNEVLETNR EESQSHVHRMYFMGPNTFSEPWHLPH+P EQ+ ++VYE 
Sbjct: 926  QIDHSFPFLESLNEVLETNRVEESQSHVHRMYFMGPNTFSEPWHLPHTPSEQIKDVVYES 985

Query: 2202 AFNIFVDEINALAAYQWWEGSIHSVLSIVAYPFAWSXXXXXXXXXXXXXXEFVRS 2366
             FN FVDEINA+AAYQWWEG+IHSVLS++AYP AWS              EFVRS
Sbjct: 986  EFNTFVDEINAIAAYQWWEGAIHSVLSVLAYPLAWSWQQWRRRLKLQRLREFVRS 1040


>ref|XP_003523758.1| PREDICTED: uncharacterized protein LOC100783686 [Glycine max]
          Length = 1447

 Score =  945 bits (2442), Expect = 0.0
 Identities = 475/775 (61%), Positives = 548/775 (70%), Gaps = 1/775 (0%)
 Frame = +3

Query: 45   SIYIKAHKMNGNGKISASXXXXXXXXXXXRVSIDIYSRHDNPEILVHGGRSYGCPENNGA 224
            SIYIKA++M GNG ISA            RVS+D++SRHD P+I VHGG+S GCPEN GA
Sbjct: 265  SIYIKAYRMTGNGIISACGGNGFAGGGGGRVSVDVFSRHDEPKIYVHGGKSLGCPENAGA 324

Query: 225  AGTFYDTVLRSLTVNNHNVSTQTDTLLLDFPHQPLWTNVYVRDHAKAVVPLLWSRVQVQG 404
            AGT YD V RSL V+N N++T T+TLLL+FP+QPLWTNVYVR+ A+A VPLLWSRVQVQG
Sbjct: 325  AGTLYDAVPRSLIVDNFNMTTDTETLLLEFPNQPLWTNVYVRNKARATVPLLWSRVQVQG 384

Query: 405  QLSLFCG-VLVFGLAHYPSSEFELMAEELLMSDSTIRVYGALRMSIKMLLMLNSKMIIDV 581
            Q+S+  G VL FGL HY +SEFEL+AEELLMSDS ++VYGALRMS+KM LM NSKM+ID 
Sbjct: 385  QISILQGGVLSFGLRHYATSEFELLAEELLMSDSVMKVYGALRMSVKMFLMWNSKMLIDG 444

Query: 582  GDDALVATSLLEASNLIALKESSVILSNAXXXXXXXXXXXXXXXXXXIEAQRLVLSLFYS 761
            G+D  VATSLLEASNLI L+ +SVI SNA                  IEAQRLVLSLFYS
Sbjct: 445  GEDITVATSLLEASNLIVLRGASVIHSNANLGVHGQGLLNLSGPGDWIEAQRLVLSLFYS 504

Query: 762  IHVGPGSVLQAPLENATTDDMTPKLYCEHQDCPMELLHPPEDCNVNSSLSFTLQICRVED 941
            IHVGPGSVL+ PLENATTDD+TPKLYC+ +DCP ELLHPPEDCNVNSSLSFTLQICRVED
Sbjct: 505  IHVGPGSVLRGPLENATTDDVTPKLYCDKEDCPYELLHPPEDCNVNSSLSFTLQICRVED 564

Query: 942  ITVEGLIKGSVVHFHRARTVVVQSSGAISASXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 1121
            I VEGLIKGSVVHFHRART+ V+SSG ISAS                             
Sbjct: 565  ILVEGLIKGSVVHFHRARTISVESSGTISASGMGCTGGLGHGNTLSNGIGSGGGHGGTGG 624

Query: 1122 XXXXXXSFAEGGVAYGNPDLPCEXXXXXXXXXXXXXTAGGGIIVMGSLEHSLTSLSIYGS 1301
                  +  +GG +YG+  LPCE             TAGGGIIV+GSLEH L+SLSI G 
Sbjct: 625  EAFYNDNHVKGGCSYGSATLPCELGSGSGNGNSTGTTAGGGIIVVGSLEHPLSSLSIQGY 684

Query: 1302 LRADGESYGQNARKQDYELTXXXXXXXXXXXXXTVLLFLHTLTVGDTAVLXXXXXXXXXX 1481
            ++A+G ++    R + + +              T+L+FLH LT+G +AVL          
Sbjct: 685  VKANGGNFEPQIRNEKFAIFDNFTGGPGGGSGGTILMFLHMLTIGKSAVLSSMGGYSSSN 744

Query: 1482 XXXXXXXXRIHFHWSDIPTGNEYQPLASVEGNIYTRGGLGMDQGNAGDNGTITGKACPEG 1661
                    RIHFHWSDIPTG+ Y P+ASV+G+I   GG G  QG +G NGTITGKACP+G
Sbjct: 745  GSGGGGGGRIHFHWSDIPTGDVYLPIASVKGDIQIWGGKGKGQGGSGANGTITGKACPKG 804

Query: 1662 LYGIFCEECPTGTFKNVSGSDITLCHQCPLSELPHRAIYTSVRGGIARTPCPYRCISERY 1841
            LYG FCEECP GT+KNV+GSD +LCH CP++ELPHRA Y SVRGGI  TPCPY+C+S+RY
Sbjct: 805  LYGTFCEECPAGTYKNVTGSDKSLCHSCPVNELPHRAAYISVRGGITETPCPYQCVSDRY 864

Query: 1842 HKPHCYTTLEELIYTFGGPWXXXXXXXXXXXXXXXXXXXXRMKFVENDELPGPAPTQHGS 2021
            H P CYT LEELIY FGGPW                    RMKFV  DELPGPAPTQHGS
Sbjct: 865  HMPDCYTALEELIYRFGGPWLFGLFLMGLLILLALVLSVARMKFVGVDELPGPAPTQHGS 924

Query: 2022 QINHSFPFLESLNEVLETNRAEESQSHVHRMYFMGPNTFSEPWHLPHSPPEQVTEIVYED 2201
            QI+HSFPFLESLNEVLETNR EESQSHVHRMYFMGPNTFSEPWHLPH+P EQ+ ++VYE 
Sbjct: 925  QIDHSFPFLESLNEVLETNRVEESQSHVHRMYFMGPNTFSEPWHLPHTPSEQIKDVVYES 984

Query: 2202 AFNIFVDEINALAAYQWWEGSIHSVLSIVAYPFAWSXXXXXXXXXXXXXXEFVRS 2366
             FN FVDEINA+AAYQWWEG+IHSVLS++AYPFAWS              EFVRS
Sbjct: 985  EFNTFVDEINAIAAYQWWEGAIHSVLSVLAYPFAWSWQQWRRRLKLQRLREFVRS 1039


>ref|XP_004501087.1| PREDICTED: uncharacterized protein LOC101498285 [Cicer arietinum]
          Length = 1454

 Score =  940 bits (2429), Expect = 0.0
 Identities = 471/775 (60%), Positives = 549/775 (70%), Gaps = 1/775 (0%)
 Frame = +3

Query: 45   SIYIKAHKMNGNGKISASXXXXXXXXXXXRVSIDIYSRHDNPEILVHGGRSYGCPENNGA 224
            SIYIK ++M G+G I+A            R+S+D++SRHD P+I VHGGRS+ CPEN GA
Sbjct: 273  SIYIKGYRMIGSGMITACGGNGFAGGGGGRISVDVFSRHDEPKIYVHGGRSFACPENAGA 332

Query: 225  AGTFYDTVLRSLTVNNHNVSTQTDTLLLDFPHQPLWTNVYVRDHAKAVVPLLWSRVQVQG 404
            AGT YD V RSL V+N N++T T+TLLL+FP+QPLWTNVYVR+ A+A VPLLWSRVQVQG
Sbjct: 333  AGTLYDAVPRSLIVDNFNMTTDTETLLLEFPYQPLWTNVYVRNKARATVPLLWSRVQVQG 392

Query: 405  QLSLF-CGVLVFGLAHYPSSEFELMAEELLMSDSTIRVYGALRMSIKMLLMLNSKMIIDV 581
            Q+S+   GVL FGL HY +SEFEL+AEELLMSDS ++VYGALRMS+KM LM NSKM+ID 
Sbjct: 393  QISILEGGVLSFGLPHYATSEFELLAEELLMSDSEMKVYGALRMSVKMFLMWNSKMLIDG 452

Query: 582  GDDALVATSLLEASNLIALKESSVILSNAXXXXXXXXXXXXXXXXXXIEAQRLVLSLFYS 761
            G+D  +ATSLLEASNLI L+ SSVI SNA                  IEAQRLVLSLFYS
Sbjct: 453  GEDITLATSLLEASNLIVLRGSSVIHSNANLGVHGQGLLNLSGPGDWIEAQRLVLSLFYS 512

Query: 762  IHVGPGSVLQAPLENATTDDMTPKLYCEHQDCPMELLHPPEDCNVNSSLSFTLQICRVED 941
            IHVGPGSVL+ PLENATTDD+TPKLYC ++DCP ELLHPPEDCNVNSSLSFTLQICRVED
Sbjct: 513  IHVGPGSVLRGPLENATTDDVTPKLYCNNKDCPYELLHPPEDCNVNSSLSFTLQICRVED 572

Query: 942  ITVEGLIKGSVVHFHRARTVVVQSSGAISASXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 1121
            + VEGLIKGSVVHFHRART+ ++SSG ISAS                             
Sbjct: 573  VLVEGLIKGSVVHFHRARTISIESSGTISASGMGCTGGLGHGHVLSNGIGSGGGYGGNGG 632

Query: 1122 XXXXXXSFAEGGVAYGNPDLPCEXXXXXXXXXXXXXTAGGGIIVMGSLEHSLTSLSIYGS 1301
                     EGG++YG PDLPCE             TAGGGIIV+GSL+H L+SLSI GS
Sbjct: 633  KACSNDYCVEGGISYGTPDLPCELGSGSGNDNSTGTTAGGGIIVIGSLDHPLSSLSIKGS 692

Query: 1302 LRADGESYGQNARKQDYELTXXXXXXXXXXXXXTVLLFLHTLTVGDTAVLXXXXXXXXXX 1481
            + ADGE++    R++ + +              TVLLFLHTL +G++A+L          
Sbjct: 693  VNADGENFDPAIRREKFLIFDNFTGGPGGGSGGTVLLFLHTLAIGESAILSSIGGYSGIS 752

Query: 1482 XXXXXXXXRIHFHWSDIPTGNEYQPLASVEGNIYTRGGLGMDQGNAGDNGTITGKACPEG 1661
                    RIHFHW DIPTG+ YQP+ASV+G I + GG+G   G +G NGTI+GKACP+G
Sbjct: 753  GGGGGGGGRIHFHWFDIPTGDVYQPIASVKGVIQSGGGMGKGLGGSGANGTISGKACPKG 812

Query: 1662 LYGIFCEECPTGTFKNVSGSDITLCHQCPLSELPHRAIYTSVRGGIARTPCPYRCISERY 1841
            LYG FCEECP GT+KNV+GSD +LC  CP++ELPHRA+Y SVRGGI   PCPY+CIS+RY
Sbjct: 813  LYGTFCEECPAGTYKNVTGSDRSLCQVCPVNELPHRAVYISVRGGITEAPCPYQCISDRY 872

Query: 1842 HKPHCYTTLEELIYTFGGPWXXXXXXXXXXXXXXXXXXXXRMKFVENDELPGPAPTQHGS 2021
            H P CYT LEELIYTFGGPW                    RMKFV  DELPGPAPTQHG 
Sbjct: 873  HMPDCYTALEELIYTFGGPWLFGLFLTGLLILLALVLSVARMKFVGVDELPGPAPTQHGC 932

Query: 2022 QINHSFPFLESLNEVLETNRAEESQSHVHRMYFMGPNTFSEPWHLPHSPPEQVTEIVYED 2201
            QI+HSFPFLESLNEVLETNR EESQSHVHRMYF+GPNTFSEPWHLPH+P EQ+ +IVYE 
Sbjct: 933  QIDHSFPFLESLNEVLETNRVEESQSHVHRMYFIGPNTFSEPWHLPHTPSEQIHDIVYES 992

Query: 2202 AFNIFVDEINALAAYQWWEGSIHSVLSIVAYPFAWSXXXXXXXXXXXXXXEFVRS 2366
            AFN FVDEINA+AAYQWWEG+I+S LSI+AYP AWS              EFVRS
Sbjct: 993  AFNTFVDEINAIAAYQWWEGAIYSSLSILAYPLAWSWQQCRRRLKLQRLREFVRS 1047


>ref|XP_002324157.1| hypothetical protein POPTR_0018s04760g [Populus trichocarpa]
            gi|222865591|gb|EEF02722.1| hypothetical protein
            POPTR_0018s04760g [Populus trichocarpa]
          Length = 1416

 Score =  937 bits (2421), Expect = 0.0
 Identities = 477/776 (61%), Positives = 548/776 (70%), Gaps = 2/776 (0%)
 Frame = +3

Query: 45   SIYIKAHKMNGNGKISASXXXXXXXXXXXRVSIDIYSRHDNPEILVHGGRSYGCPENNGA 224
            SI++KA+KM G G ISA            RVS+DI+SRHD+P+I VHGG S GCP+N G 
Sbjct: 270  SIHLKAYKMTGGGSISACGGNGFAGGGGGRVSVDIFSRHDDPQIFVHGGNSLGCPKNAGG 329

Query: 225  AGTFYDTVLRSLTVNNHNVSTQTDTLLLDFPHQPLWTNVYVRDHAKAVVPLLWSRVQV-Q 401
            AGT YD V RSLTV+NHN+ST TDTLLL+FP+QPLWTNVYVR+H +A VPL WSRVQV Q
Sbjct: 330  AGTLYDAVARSLTVSNHNMSTDTDTLLLEFPYQPLWTNVYVRNHGRATVPLFWSRVQVVQ 389

Query: 402  GQLSLFC-GVLVFGLAHYPSSEFELMAEELLMSDSTIRVYGALRMSIKMLLMLNSKMIID 578
            GQ+SL C GVL FGLAHY SSEFEL+AEELLMSDS I+VYGALRMS+KM LM NS+M+ID
Sbjct: 390  GQISLLCSGVLSFGLAHYASSEFELLAEELLMSDSVIKVYGALRMSVKMFLMWNSQMLID 449

Query: 579  VGDDALVATSLLEASNLIALKESSVILSNAXXXXXXXXXXXXXXXXXXIEAQRLVLSLFY 758
             G+DA V TSLLEASNL+ LKESSVI SNA                  IEAQRLVLSLFY
Sbjct: 450  GGEDATVGTSLLEASNLVVLKESSVIHSNANLGVHGQGLLNLSGPGNWIEAQRLVLSLFY 509

Query: 759  SIHVGPGSVLQAPLENATTDDMTPKLYCEHQDCPMELLHPPEDCNVNSSLSFTLQICRVE 938
            SIHV PGSVL+ P+ENAT+D +TP+L+C+ ++CP ELLHPPEDCNVNSSLSFTLQ     
Sbjct: 510  SIHVAPGSVLRGPVENATSDAITPRLHCQLEECPSELLHPPEDCNVNSSLSFTLQ----- 564

Query: 939  DITVEGLIKGSVVHFHRARTVVVQSSGAISASXXXXXXXXXXXXXXXXXXXXXXXXXXXX 1118
            DITVEGLI+GSVVHFHRART+ V SSG ISAS                            
Sbjct: 565  DITVEGLIEGSVVHFHRARTIYVPSSGTISASGMGCTGGVGRGNVLSNGVGSGGGHGGKG 624

Query: 1119 XXXXXXXSFAEGGVAYGNPDLPCEXXXXXXXXXXXXXTAGGGIIVMGSLEHSLTSLSIYG 1298
                      EGGV+YGN +LPCE             TAGGGIIVMGSLEH L+SLS+ G
Sbjct: 625  GSACYNDRCIEGGVSYGNAELPCELGSGSGEEMSAGSTAGGGIIVMGSLEHPLSSLSVDG 684

Query: 1299 SLRADGESYGQNARKQDYELTXXXXXXXXXXXXXTVLLFLHTLTVGDTAVLXXXXXXXXX 1478
            S+RADGES+    R Q   +              T+LLFLHTL +G  AVL         
Sbjct: 685  SVRADGESFKGITRDQ-LVVMNGTGGGPGGGSGGTILLFLHTLDLGGYAVLSSVGGYGSP 743

Query: 1479 XXXXXXXXXRIHFHWSDIPTGNEYQPLASVEGNIYTRGGLGMDQGNAGDNGTITGKACPE 1658
                     R+HFHWSDIPTG+ YQP+A V G+I+T GGLG D+G+AG+NGT++GKACP+
Sbjct: 744  KGGGGGGGGRVHFHWSDIPTGDVYQPIARVNGSIHTWGGLGRDEGHAGENGTVSGKACPK 803

Query: 1659 GLYGIFCEECPTGTFKNVSGSDITLCHQCPLSELPHRAIYTSVRGGIARTPCPYRCISER 1838
            GLYGIFCEECP GT+KNV+GSD  LC  CP  ++PHRA Y +VRGGIA TPCPY+C+S+R
Sbjct: 804  GLYGIFCEECPAGTYKNVTGSDRALCRPCPADDIPHRAAYVTVRGGIAETPCPYKCVSDR 863

Query: 1839 YHKPHCYTTLEELIYTFGGPWXXXXXXXXXXXXXXXXXXXXRMKFVENDELPGPAPTQHG 2018
            +H PHCYT LEELIYTFGGPW                    RMKFV  DELPGPAPTQHG
Sbjct: 864  FHMPHCYTALEELIYTFGGPWLFGLLLLGLLILLALVLSVARMKFVGVDELPGPAPTQHG 923

Query: 2019 SQINHSFPFLESLNEVLETNRAEESQSHVHRMYFMGPNTFSEPWHLPHSPPEQVTEIVYE 2198
            SQI+HSFPFLESLNEVLETNRAEESQSHVHRMYFMG NTFSEP HLPH+PPEQ+ EIVYE
Sbjct: 924  SQIDHSFPFLESLNEVLETNRAEESQSHVHRMYFMGRNTFSEPCHLPHTPPEQIKEIVYE 983

Query: 2199 DAFNIFVDEINALAAYQWWEGSIHSVLSIVAYPFAWSXXXXXXXXXXXXXXEFVRS 2366
             AFN FVDEIN +AAYQWWEG+I+S+LS++AYP AWS              EFVRS
Sbjct: 984  GAFNTFVDEINGIAAYQWWEGAIYSILSVLAYPLAWSWQQWRRRIKLQRLREFVRS 1039


Top