BLASTX nr result

ID: Akebia22_contig00003115 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Akebia22_contig00003115
         (2250 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_007217663.1| hypothetical protein PRUPE_ppa002145mg [Prun...   599   e-168
gb|EXB74581.1| hypothetical protein L484_026278 [Morus notabilis]     593   e-167
ref|XP_004155995.1| PREDICTED: uncharacterized LOC101203806 [Cuc...   577   e-161
ref|XP_004141783.1| PREDICTED: uncharacterized protein LOC101203...   575   e-161
ref|XP_007024308.1| Hydroxyproline-rich glycoprotein family prot...   572   e-160
ref|XP_007024307.1| Hydroxyproline-rich glycoprotein family prot...   572   e-160
ref|XP_002278075.2| PREDICTED: RNA polymerase II-associated fact...   570   e-160
ref|XP_006583048.1| PREDICTED: RNA polymerase II-associated fact...   566   e-158
ref|XP_007135633.1| hypothetical protein PHAVU_010G145300g [Phas...   566   e-158
ref|XP_004302858.1| PREDICTED: uncharacterized protein LOC101304...   565   e-158
ref|XP_007225143.1| hypothetical protein PRUPE_ppa002485mg [Prun...   557   e-156
ref|XP_003531647.1| PREDICTED: bromodomain-containing protein 4-...   556   e-155
ref|XP_006465692.1| PREDICTED: RNA polymerase II-associated fact...   555   e-155
ref|XP_006343037.1| PREDICTED: RNA polymerase II-associated fact...   555   e-155
ref|XP_006426877.1| hypothetical protein CICLE_v10025066mg [Citr...   555   e-155
ref|XP_006465693.1| PREDICTED: RNA polymerase II-associated fact...   554   e-155
ref|XP_004235642.1| PREDICTED: uncharacterized protein LOC101254...   548   e-153
ref|XP_006426878.1| hypothetical protein CICLE_v10025066mg [Citr...   533   e-148
ref|XP_007024309.1| Hydroxyproline-rich glycoprotein family prot...   528   e-147
ref|XP_002303312.2| hydroxyproline-rich glycoprotein [Populus tr...   525   e-146

>ref|XP_007217663.1| hypothetical protein PRUPE_ppa002145mg [Prunus persica]
            gi|462413813|gb|EMJ18862.1| hypothetical protein
            PRUPE_ppa002145mg [Prunus persica]
          Length = 709

 Score =  599 bits (1544), Expect = e-168
 Identities = 320/533 (60%), Positives = 376/533 (70%), Gaps = 4/533 (0%)
 Frame = -2

Query: 1847 NRRSHNREGPRDANGSGWRESGHSKHGLPPKQKGSAVP-MPSANAPRVPNAIAXXXXXXX 1671
            +R SH +  PRD + SG RE GH  HG+P KQ    VP MP   A   P  +        
Sbjct: 185  DRGSHEKGAPRDVSVSGRREHGHLNHGVPQKQHKPPVPSMPVKKANGPPGRVETEEERRL 244

Query: 1670 XXXXXXXXXXXXXXXXXXXXQLLKESQNTVMHKTHIMSSGMKGHGSIVGSRVGEKKATPF 1491
                                  LK+SQN+V+ KT ++SSG KGHGSI GSR+GE++ATPF
Sbjct: 245  RKKREFEKQRQEEKHRQQ----LKDSQNSVLQKTQMLSSG-KGHGSIAGSRMGERRATPF 299

Query: 1490 LSGERIENRLKKPTTFLCKLKFRNELPDPTAQPKLLSLNTYKEQYTKYTITSLEKMHKPK 1311
            LSGER ENRLKKPTTF+CKLKFRNELPDP+AQPKL+SL   K+QYTKYTITSLEK +KPK
Sbjct: 300  LSGERTENRLKKPTTFVCKLKFRNELPDPSAQPKLMSLKKDKDQYTKYTITSLEKTYKPK 359

Query: 1310 LFVEPDIGVPLDLLDICVYNSNTTGTPHTXXXXXXXXXXXXATPIKQDGIRRKERPTDKG 1131
            LFVEPD+G+PLDLLD+ VYN  +   P              ATP+K +GIRRKERPTDKG
Sbjct: 360  LFVEPDLGIPLDLLDLSVYNPPSVRPPLALEDEELLRDDVAATPVKNNGIRRKERPTDKG 419

Query: 1130 VSWLVKTQYISPLSTEAAKMSLTEKQAKELRETXXXXXXXXXXXXXEKQIQEIEASFEAC 951
            V+WLVKTQYISPLS ++A+ SLTEKQAKELRE              E+QI++IEASFEAC
Sbjct: 420  VAWLVKTQYISPLSMDSARQSLTEKQAKELREMKGGRNILDNLNDRERQIKDIEASFEAC 479

Query: 950  KSRPVHATSDKLQPVEILPLLPDFDRYDDQFVLAAFDGDPTADSEIYSKLDRSVRDDHES 771
            KSRPVHAT+  L PVEILPLLPDF+RY+DQFVLAAFDG PTADSEIYSKLD+S  D +ES
Sbjct: 480  KSRPVHATNKNLYPVEILPLLPDFERYEDQFVLAAFDGAPTADSEIYSKLDQSGHDAYES 539

Query: 770  HAIMKSFIGAGSEQAKPEKFLAYMVPSPDELWKDVYDEHEDILYTWVREYQWDVRGDDAD 591
             AIMKS+   G++ A PEKFLAYMVPSP+EL KD YDE ED+ Y+WVREY +DVRGDD  
Sbjct: 540  RAIMKSYKVTGADPANPEKFLAYMVPSPNELSKDPYDESEDVSYSWVREYHYDVRGDDVH 599

Query: 590  DPTTYLVTFDEEEARYLPLPTKLILRKKRPKEGRSNEEVEQYXXXXXXXXXXXXXXXXVE 411
            DPTTYLV+FDEEEARY PLPTKL+LRKKR KEG++++EVE +                +E
Sbjct: 600  DPTTYLVSFDEEEARYAPLPTKLVLRKKRSKEGKTSDEVEHFPAPSRVTVRQRSTVAAIE 659

Query: 410  LKESGDYVS---SNSKRGRSAIEDGLETPVPRHKVARVQDMDHYSGAEDDMSD 261
            LK+SGDY     SN K  R  +ED LE P    K+AR QD+D YSGAEDD+SD
Sbjct: 660  LKDSGDYSRGSVSNLKTRRFDVEDTLERP---RKIARHQDIDEYSGAEDDLSD 709


>gb|EXB74581.1| hypothetical protein L484_026278 [Morus notabilis]
          Length = 697

 Score =  593 bits (1530), Expect = e-167
 Identities = 317/544 (58%), Positives = 379/544 (69%), Gaps = 6/544 (1%)
 Frame = -2

Query: 1877 NQAEESRFHDNRRSHNREGPRDANGSGWRESGHSKH-GLPPKQKGSAVPMPSANAPRVPN 1701
            +Q +E+  H   +  +R   ++  GSG RE G+S H G   KQ     P+PS    +  N
Sbjct: 160  SQGKENVHHRGLQERDRGVSKEVAGSGRREHGYSNHHGTHHKQH--KYPVPSVPVKK-SN 216

Query: 1700 AIAXXXXXXXXXXXXXXXXXXXXXXXXXXXQLLKESQNTVMHKTHIMSSGMKGHGSIVGS 1521
                                          Q LKESQ++ + KT I+S+  KGHGSI GS
Sbjct: 217  GPMGRVETEEERRLRKKREFEKQKQEEKHRQHLKESQHSALQKTQILSAA-KGHGSIAGS 275

Query: 1520 RVGEKKATPFLSGERIENRLKKPTTFLCKLKFRNELPDPTAQPKLLSLNTYKEQYTKYTI 1341
            R+GE++AT FLSGERIENRLKKPTTFLCKLKFRNELPDP+AQPKL+S+   K+QY+KYTI
Sbjct: 276  RMGERRATSFLSGERIENRLKKPTTFLCKLKFRNELPDPSAQPKLMSMKREKDQYSKYTI 335

Query: 1340 TSLEKMHKPKLFVEPDIGVPLDLLDICVYNSNTTGTPHTXXXXXXXXXXXXATPIKQDGI 1161
            TSLEK +KPKLFVEPD+G+PL+LLD+ VYN  +   P               TP+K+DGI
Sbjct: 336  TSLEKTYKPKLFVEPDLGIPLNLLDLSVYNPPSVRPPLDPEDEELLRDDEAVTPVKKDGI 395

Query: 1160 RRKERPTDKGVSWLVKTQYISPLSTEAAKMSLTEKQAKELRETXXXXXXXXXXXXXEKQI 981
            +RKERPTDKGV+WLVKTQYISPLS E+ K SLTEKQAKELRE              ++QI
Sbjct: 396  KRKERPTDKGVAWLVKTQYISPLSMESTKQSLTEKQAKELRELKGGRNILENLNDRDRQI 455

Query: 980  QEIEASFEACKSRPVHATSDKLQPVEILPLLPDFDRYDDQFVLAAFDGDPTADSEIYSKL 801
            +EI+ASFEACKSRPVHAT+  L PVE+LPLLPDFDRYDDQFVLAAFD  PTADSE+YSK+
Sbjct: 456  KEIQASFEACKSRPVHATNKSLYPVEVLPLLPDFDRYDDQFVLAAFDSAPTADSEVYSKM 515

Query: 800  DRSVRDDHESHAIMKSFIGAGSEQAKPEKFLAYMVPSPDELWKDVYDEHEDILYTWVREY 621
            D+S+RD HES A++KS+   GS+   PEKFLAYMVPSPDEL KD+YDEHED+ Y+WVREY
Sbjct: 516  DQSIRDAHESQAVLKSYKVTGSDPGNPEKFLAYMVPSPDELSKDIYDEHEDVSYSWVREY 575

Query: 620  QWDVRGDDADDPTTYLVTFDEEEARYLPLPTKLILRKKRPKEGRSNEEVEQYXXXXXXXX 441
             WDVRGDDADDPTTYLV+FDE EARYLPLPTKL+LRKKR KEGRS +EVE +        
Sbjct: 576  HWDVRGDDADDPTTYLVSFDETEARYLPLPTKLVLRKKRAKEGRSGDEVEHFPVPARVTV 635

Query: 440  XXXXXXXXVELKESGDYVS-----SNSKRGRSAIEDGLETPVPRHKVARVQDMDHYSGAE 276
                    VELK++  Y +     SN KRG S +EDGLE     HKVAR +D+D YSGAE
Sbjct: 636  RRRPTVSVVELKDAEVYSNPRGSLSNFKRGGSDVEDGLER---SHKVARQEDVDEYSGAE 692

Query: 275  DDMS 264
            DD+S
Sbjct: 693  DDLS 696


>ref|XP_004155995.1| PREDICTED: uncharacterized LOC101203806 [Cucumis sativus]
          Length = 706

 Score =  577 bits (1486), Expect = e-161
 Identities = 313/534 (58%), Positives = 374/534 (70%), Gaps = 5/534 (0%)
 Frame = -2

Query: 1847 NRRSHNRE--GPRDAN-GSGWRE-SGHSKHGLPPKQKGSAVPMPSANAPRVPNAIAXXXX 1680
            N  +H R+   P+D + G   RE S H KH     QK S  PMP    P+  N  +    
Sbjct: 189  NMGAHERDKGAPKDPSYGRRDRENSNHDKH-----QKHSGPPMP----PKKANGPSGRME 239

Query: 1679 XXXXXXXXXXXXXXXXXXXXXXXQLLKESQNTVMHKTHIMSSGMKGHGSIVGSRVGEKKA 1500
                                     LKESQNT++ KT ++S+G K HGSIVGSR+GE+KA
Sbjct: 240  TDDEKRLRKKREFEKQRQDERHRHHLKESQNTILQKTQMLSTG-KVHGSIVGSRMGERKA 298

Query: 1499 TPFLSGERIENRLKKPTTFLCKLKFRNELPDPTAQPKLLSLNTYKEQYTKYTITSLEKMH 1320
            TPFLSGERIENRLKKPTTFLCKLKFRNELPD +AQPKL+SL   K+ YT+YTITSLEK +
Sbjct: 299  TPFLSGERIENRLKKPTTFLCKLKFRNELPDTSAQPKLMSLRKEKDHYTRYTITSLEKTY 358

Query: 1319 KPKLFVEPDIGVPLDLLDICVYNSNTTGTPHTXXXXXXXXXXXXATPIKQDG-IRRKERP 1143
            KP+L+VEPD+G+PLDLLD+ VYN ++   P               TP+K+DG I+RKERP
Sbjct: 359  KPQLYVEPDLGIPLDLLDLSVYNPSSVRMPLAPEDEELLRDDVLKTPVKKDGGIKRKERP 418

Query: 1142 TDKGVSWLVKTQYISPLSTEAAKMSLTEKQAKELRETXXXXXXXXXXXXXEKQIQEIEAS 963
            TDKGV+WLVKTQYISPLS E+AK SLTEKQAKELRE              E+QI+EIE S
Sbjct: 419  TDKGVAWLVKTQYISPLSIESAKQSLTEKQAKELREMKGGRNILENLNNRERQIKEIETS 478

Query: 962  FEACKSRPVHATSDKLQPVEILPLLPDFDRYDDQFVLAAFDGDPTADSEIYSKLDRSVRD 783
            FEACKSRP+HAT+  L PVE+LPLLPDFDRYDD FV+ AFD  PTADSE ++KLD+S+RD
Sbjct: 479  FEACKSRPIHATNKNLYPVEVLPLLPDFDRYDDPFVVVAFDSAPTADSETFNKLDQSIRD 538

Query: 782  DHESHAIMKSFIGAGSEQAKPEKFLAYMVPSPDELWKDVYDEHEDILYTWVREYQWDVRG 603
             HES AIMKS++  GS+ +KPEKFLAYMVPSPDEL KD+YDE ED+ Y+WVREY WDVRG
Sbjct: 539  AHESQAIMKSYMATGSDPSKPEKFLAYMVPSPDELSKDIYDEQEDVSYSWVREYHWDVRG 598

Query: 602  DDADDPTTYLVTFDEEEARYLPLPTKLILRKKRPKEGRSNEEVEQYXXXXXXXXXXXXXX 423
            D+ DDPTTYLV+FD+ EARY+PLPTKL+LRKKR KEGRS++EVE +              
Sbjct: 599  DNVDDPTTYLVSFDDAEARYVPLPTKLVLRKKRAKEGRSSDEVEHFPAPARVTVRRRPTV 658

Query: 422  XXVELKESGDYVSSNSKRGRSAIEDGLETPVPRHKVARVQDMDHYSGAEDDMSD 261
              +E+K+ G Y  SNSKRG S IEDG+      HK  R QDMD +SGAED+MSD
Sbjct: 659  ATLEVKDPGIY--SNSKRG-SDIEDGIGR---SHKHDRNQDMDQFSGAEDEMSD 706


>ref|XP_004141783.1| PREDICTED: uncharacterized protein LOC101203806 [Cucumis sativus]
          Length = 706

 Score =  575 bits (1482), Expect = e-161
 Identities = 313/534 (58%), Positives = 374/534 (70%), Gaps = 5/534 (0%)
 Frame = -2

Query: 1847 NRRSHNREG--PRDAN-GSGWRE-SGHSKHGLPPKQKGSAVPMPSANAPRVPNAIAXXXX 1680
            N  +H R+   P+D + G   RE S H KH     QK S  PMP    P+  N  +    
Sbjct: 189  NMGAHERDKGVPKDPSYGRRDRENSNHDKH-----QKHSGPPMP----PKKANGPSGRME 239

Query: 1679 XXXXXXXXXXXXXXXXXXXXXXXQLLKESQNTVMHKTHIMSSGMKGHGSIVGSRVGEKKA 1500
                                     LKESQNT++ KT ++S+G K HGSIVGSR+GE+KA
Sbjct: 240  TDDEKRLRKKREFEKQRQDERHRHHLKESQNTILQKTQMLSTG-KVHGSIVGSRMGERKA 298

Query: 1499 TPFLSGERIENRLKKPTTFLCKLKFRNELPDPTAQPKLLSLNTYKEQYTKYTITSLEKMH 1320
            TPFLSGERIENRLKKPTTFLCKLKFRNELPD +AQPKL+SL   K+ YT+YTITSLEK +
Sbjct: 299  TPFLSGERIENRLKKPTTFLCKLKFRNELPDTSAQPKLMSLRKEKDHYTRYTITSLEKTY 358

Query: 1319 KPKLFVEPDIGVPLDLLDICVYNSNTTGTPHTXXXXXXXXXXXXATPIKQDG-IRRKERP 1143
            KP+L+VEPD+G+PLDLLD+ VYN ++   P               TP+K+DG I+RKERP
Sbjct: 359  KPQLYVEPDLGIPLDLLDLSVYNPSSVRMPLAPEDEELLRDDVLKTPVKKDGGIKRKERP 418

Query: 1142 TDKGVSWLVKTQYISPLSTEAAKMSLTEKQAKELRETXXXXXXXXXXXXXEKQIQEIEAS 963
            TDKGV+WLVKTQYISPLS E+AK SLTEKQAKELRE              E+QI+EIEAS
Sbjct: 419  TDKGVAWLVKTQYISPLSIESAKQSLTEKQAKELREMKGGRNILENLNNRERQIKEIEAS 478

Query: 962  FEACKSRPVHATSDKLQPVEILPLLPDFDRYDDQFVLAAFDGDPTADSEIYSKLDRSVRD 783
            FEACKSRP+HAT+  L PVE+LPLLPDFDRYDD FV+ AFD  PTADSE ++KLD+S+RD
Sbjct: 479  FEACKSRPIHATNKNLYPVEVLPLLPDFDRYDDPFVVVAFDSAPTADSETFNKLDQSIRD 538

Query: 782  DHESHAIMKSFIGAGSEQAKPEKFLAYMVPSPDELWKDVYDEHEDILYTWVREYQWDVRG 603
             HES AIMKS++   S+ +KPEKFLAYMVPSPDEL KD+YDE ED+ Y+WVREY WDVRG
Sbjct: 539  AHESQAIMKSYMATSSDPSKPEKFLAYMVPSPDELSKDIYDEQEDVSYSWVREYHWDVRG 598

Query: 602  DDADDPTTYLVTFDEEEARYLPLPTKLILRKKRPKEGRSNEEVEQYXXXXXXXXXXXXXX 423
            D+ DDPTTYLV+FD+ EARY+PLPTKL+LRKKR KEGRS++EVE +              
Sbjct: 599  DNVDDPTTYLVSFDDAEARYVPLPTKLVLRKKRAKEGRSSDEVEHFPAPARVTVRRRPTV 658

Query: 422  XXVELKESGDYVSSNSKRGRSAIEDGLETPVPRHKVARVQDMDHYSGAEDDMSD 261
              +E+K+ G Y  SNSKRG S IEDG+      HK  R QDMD +SGAED+MSD
Sbjct: 659  ATLEVKDPGIY--SNSKRG-SDIEDGIGR---SHKHDRHQDMDQFSGAEDEMSD 706


>ref|XP_007024308.1| Hydroxyproline-rich glycoprotein family protein isoform 2 [Theobroma
            cacao] gi|508779674|gb|EOY26930.1| Hydroxyproline-rich
            glycoprotein family protein isoform 2 [Theobroma cacao]
          Length = 562

 Score =  572 bits (1475), Expect = e-160
 Identities = 313/534 (58%), Positives = 367/534 (68%), Gaps = 5/534 (0%)
 Frame = -2

Query: 1847 NRRSHNREGPRDANGSGWRESGHSKHGLPPKQKGSAVPMPSANAPRVPNAIAXXXXXXXX 1668
            N RS    G RD  GSG RE GHS H    + +   +P       + PN  A        
Sbjct: 45   NERSQG--GNRDFLGSGRREHGHSNHAAGVRDQKPMMP-----PVKKPNGPAGRVETEEE 97

Query: 1667 XXXXXXXXXXXXXXXXXXXQLLKESQNTVMHKTHIMSSGMKGHGSIVGSRVGEKKATPFL 1488
                               Q +KESQ     KT +M SG KGHGS+VGSR+G+++ATPFL
Sbjct: 98   RRLRKKREFEKQRQEEKHRQQMKESQ-----KTQMMPSG-KGHGSMVGSRMGDRRATPFL 151

Query: 1487 SGERIENRLKKPTTFLCKLKFRNELPDPTAQPKLLSLNTYKEQYTKYTITSLEKMHKPKL 1308
            SGERIENRLKKPTTFLCKLKFRNELPDP+AQPKL++L   K+++TKYTITSLEKM+KPKL
Sbjct: 152  SGERIENRLKKPTTFLCKLKFRNELPDPSAQPKLMALKKDKDRFTKYTITSLEKMYKPKL 211

Query: 1307 FVEPDIGVPLDLLDICVYNSNTTGTPHTXXXXXXXXXXXXATPIKQDGIRRKERPTDKGV 1128
            FVEPD+G+PLDLLD+ VYN  +                   TPIK+DGIRRKERPTDKGV
Sbjct: 212  FVEPDLGIPLDLLDLSVYNPPSVRPSLAPEDAELLHDDEAVTPIKKDGIRRKERPTDKGV 271

Query: 1127 SWLVKTQYISPLSTEAAKMSLTEKQAKELRETXXXXXXXXXXXXXEKQIQEIEASFEACK 948
            SWLVKTQYISPLS E+ K SLTEKQAKELRE              E+QI+EIEASFEA K
Sbjct: 272  SWLVKTQYISPLSMESTKQSLTEKQAKELRELKGGRNILENLNNRERQIKEIEASFEASK 331

Query: 947  SRPVHATSDKLQPVEILPLLPDFDRYDDQFVLAAFDGDPTADSEIYSKLDRSVRDDHESH 768
             RPVHAT+  L+PVE++PLLPDFDRY+DQFV+ AFDG PTADSEI+SKLD SVRD+HES 
Sbjct: 332  LRPVHATNKNLEPVEVMPLLPDFDRYNDQFVMVAFDGAPTADSEIFSKLDDSVRDEHESR 391

Query: 767  AIMKSFIGAGSEQAKPEKFLAYMVPSPDELWKDVYDEHEDILYTWVREYQWDVRGDDADD 588
            AIMKS++ A S+ A PEKFLAYMVPS DEL K +YDEHED+ Y+WVREY WDVRGDDA+D
Sbjct: 392  AIMKSYLAASSDPANPEKFLAYMVPSLDELSKGMYDEHEDVSYSWVREYNWDVRGDDAND 451

Query: 587  PTTYLVTFDEEEARYLPLPTKLILRKKRPKEGRSNEEVEQYXXXXXXXXXXXXXXXXVEL 408
            PTTYLV+FDE EARY+PLPTKL LRKKR +EGR+ +E+E +                +EL
Sbjct: 452  PTTYLVSFDEGEARYVPLPTKLNLRKKRAREGRTGDEIEHFPIPARITVRRRSTVAAIEL 511

Query: 407  KESGDYVS-----SNSKRGRSAIEDGLETPVPRHKVARVQDMDHYSGAEDDMSD 261
            KE   Y S     S+SK GR   EDGL      HK+AR  D+D YSGAEDD+S+
Sbjct: 512  KEPEVYTSSRGGMSSSKIGRLDAEDGLGR---SHKLARHHDVDQYSGAEDDLSE 562


>ref|XP_007024307.1| Hydroxyproline-rich glycoprotein family protein isoform 1 [Theobroma
            cacao] gi|508779673|gb|EOY26929.1| Hydroxyproline-rich
            glycoprotein family protein isoform 1 [Theobroma cacao]
          Length = 685

 Score =  572 bits (1475), Expect = e-160
 Identities = 313/534 (58%), Positives = 367/534 (68%), Gaps = 5/534 (0%)
 Frame = -2

Query: 1847 NRRSHNREGPRDANGSGWRESGHSKHGLPPKQKGSAVPMPSANAPRVPNAIAXXXXXXXX 1668
            N RS    G RD  GSG RE GHS H    + +   +P       + PN  A        
Sbjct: 168  NERSQG--GNRDFLGSGRREHGHSNHAAGVRDQKPMMP-----PVKKPNGPAGRVETEEE 220

Query: 1667 XXXXXXXXXXXXXXXXXXXQLLKESQNTVMHKTHIMSSGMKGHGSIVGSRVGEKKATPFL 1488
                               Q +KESQ     KT +M SG KGHGS+VGSR+G+++ATPFL
Sbjct: 221  RRLRKKREFEKQRQEEKHRQQMKESQ-----KTQMMPSG-KGHGSMVGSRMGDRRATPFL 274

Query: 1487 SGERIENRLKKPTTFLCKLKFRNELPDPTAQPKLLSLNTYKEQYTKYTITSLEKMHKPKL 1308
            SGERIENRLKKPTTFLCKLKFRNELPDP+AQPKL++L   K+++TKYTITSLEKM+KPKL
Sbjct: 275  SGERIENRLKKPTTFLCKLKFRNELPDPSAQPKLMALKKDKDRFTKYTITSLEKMYKPKL 334

Query: 1307 FVEPDIGVPLDLLDICVYNSNTTGTPHTXXXXXXXXXXXXATPIKQDGIRRKERPTDKGV 1128
            FVEPD+G+PLDLLD+ VYN  +                   TPIK+DGIRRKERPTDKGV
Sbjct: 335  FVEPDLGIPLDLLDLSVYNPPSVRPSLAPEDAELLHDDEAVTPIKKDGIRRKERPTDKGV 394

Query: 1127 SWLVKTQYISPLSTEAAKMSLTEKQAKELRETXXXXXXXXXXXXXEKQIQEIEASFEACK 948
            SWLVKTQYISPLS E+ K SLTEKQAKELRE              E+QI+EIEASFEA K
Sbjct: 395  SWLVKTQYISPLSMESTKQSLTEKQAKELRELKGGRNILENLNNRERQIKEIEASFEASK 454

Query: 947  SRPVHATSDKLQPVEILPLLPDFDRYDDQFVLAAFDGDPTADSEIYSKLDRSVRDDHESH 768
             RPVHAT+  L+PVE++PLLPDFDRY+DQFV+ AFDG PTADSEI+SKLD SVRD+HES 
Sbjct: 455  LRPVHATNKNLEPVEVMPLLPDFDRYNDQFVMVAFDGAPTADSEIFSKLDDSVRDEHESR 514

Query: 767  AIMKSFIGAGSEQAKPEKFLAYMVPSPDELWKDVYDEHEDILYTWVREYQWDVRGDDADD 588
            AIMKS++ A S+ A PEKFLAYMVPS DEL K +YDEHED+ Y+WVREY WDVRGDDA+D
Sbjct: 515  AIMKSYLAASSDPANPEKFLAYMVPSLDELSKGMYDEHEDVSYSWVREYNWDVRGDDAND 574

Query: 587  PTTYLVTFDEEEARYLPLPTKLILRKKRPKEGRSNEEVEQYXXXXXXXXXXXXXXXXVEL 408
            PTTYLV+FDE EARY+PLPTKL LRKKR +EGR+ +E+E +                +EL
Sbjct: 575  PTTYLVSFDEGEARYVPLPTKLNLRKKRAREGRTGDEIEHFPIPARITVRRRSTVAAIEL 634

Query: 407  KESGDYVS-----SNSKRGRSAIEDGLETPVPRHKVARVQDMDHYSGAEDDMSD 261
            KE   Y S     S+SK GR   EDGL      HK+AR  D+D YSGAEDD+S+
Sbjct: 635  KEPEVYTSSRGGMSSSKIGRLDAEDGLGR---SHKLARHHDVDQYSGAEDDLSE 685


>ref|XP_002278075.2| PREDICTED: RNA polymerase II-associated factor 1 homolog [Vitis
            vinifera]
          Length = 589

 Score =  570 bits (1470), Expect = e-160
 Identities = 308/535 (57%), Positives = 373/535 (69%), Gaps = 9/535 (1%)
 Frame = -2

Query: 1838 SHNRE--GPRDANGSGWRESGHSKHGLPPKQKGSAVPMPSANAPRVPNAIAXXXXXXXXX 1665
            SH R+   P+D  G+G RE GHS  G  P  K    P+P A   +  N            
Sbjct: 64   SHGRDKGAPKDLRGAGRREPGHSNQG--PSGKQQKPPVPPAPVKK-SNGPPGRVETEEER 120

Query: 1664 XXXXXXXXXXXXXXXXXXQLLKESQNTVMHKTHIMSSGMKGHGSIVG-SRVGEKKATPFL 1488
                                LKESQNTV+ KT ++SSG KGHGS+VG SR+GE++ TPFL
Sbjct: 121  RLRKKREFEKQRQEEKQKHQLKESQNTVLQKTQMLSSG-KGHGSVVGGSRMGERRTTPFL 179

Query: 1487 SGERIENRLKKPTTFLCKLKFRNELPDPTAQPKLLSLNTYKEQYTKYTITSLEKMHKPKL 1308
            SG+RIENRL+KPTTFLCKLKFRNELPDPTAQPKL++L T K+++TKYTITSLEKMHKP+L
Sbjct: 180  SGDRIENRLRKPTTFLCKLKFRNELPDPTAQPKLMALKTDKDRFTKYTITSLEKMHKPQL 239

Query: 1307 FVEPDIGVPLDLLDICVYNSNTTGTPHTXXXXXXXXXXXXATPIKQDGIRRKERPTDKGV 1128
            FVEPD+G+PLDLLD+ VYN  +   P               TP+K++GI++KERPTDKGV
Sbjct: 240  FVEPDLGIPLDLLDLSVYNPPSVRRPLDPEDEELLRDDESVTPVKKEGIKKKERPTDKGV 299

Query: 1127 SWLVKTQYISPLSTEAAKMSLTEKQAKELRETXXXXXXXXXXXXXEKQIQEIEASFEACK 948
            SWLVKTQYISPLSTE+ K SLTEKQAKELRET             E++IQ IEA+F A K
Sbjct: 300  SWLVKTQYISPLSTESTKQSLTEKQAKELRETKGGRNILENFNSRERKIQNIEAAFAASK 359

Query: 947  SRPVHATSDKLQPVEILPLLPDFDRYDDQFVLAAFDGDPTADSEIYSKLDRSVRDDHESH 768
              PVH+T+  L+PVEILPLLPDF RYDD FV+A+FD  PTADSEIYSKLD++VRD HES 
Sbjct: 360  ITPVHSTNKSLKPVEILPLLPDFARYDDSFVVASFDSAPTADSEIYSKLDKTVRDSHESQ 419

Query: 767  AIMKSFIGAGSEQAKPEKFLAYMVPSPDELWKDVYDEHEDILYTWVREYQWDVRGDDADD 588
            AI+KS++  GS+ +KPEKFLAYM PSPDEL KD+YDE+ED  Y+WVREY WDVRGDDADD
Sbjct: 420  AILKSYMATGSDPSKPEKFLAYMAPSPDELSKDIYDENEDTSYSWVREYHWDVRGDDADD 479

Query: 587  PTTYLVTFDEEEARYLPLPTKLILRKKRPKEGRSNEEVEQYXXXXXXXXXXXXXXXXVEL 408
            PTTYLV+F++ +ARYLPLPTKL+LRKKR KEGRS++EVE +                +EL
Sbjct: 480  PTTYLVSFNKTDARYLPLPTKLLLRKKRAKEGRSSDEVEHFPVPSKVTVRQRPNVAAIEL 539

Query: 407  KESGDYVSSNSKRGRSA------IEDGLETPVPRHKVARVQDMDHYSGAEDDMSD 261
            K+  + V S+SKRG S+      +EDGL      +K  + Q MD  SGAED+MSD
Sbjct: 540  KD--EEVYSSSKRGVSSSKRGVDMEDGLGR---SYKGVQDQHMDQSSGAEDEMSD 589


>ref|XP_006583048.1| PREDICTED: RNA polymerase II-associated factor 1 homolog isoform X1
            [Glycine max] gi|571464391|ref|XP_006583049.1| PREDICTED:
            RNA polymerase II-associated factor 1 homolog isoform X2
            [Glycine max]
          Length = 659

 Score =  566 bits (1459), Expect = e-158
 Identities = 301/522 (57%), Positives = 360/522 (68%), Gaps = 3/522 (0%)
 Frame = -2

Query: 1817 RDANGSGWRESGHSKHGLPPKQKGSAVPMPSANAPRVPNAIAXXXXXXXXXXXXXXXXXX 1638
            ++ + SG RE  HS HG+  KQ     P+P     ++ N                     
Sbjct: 145  KEPSTSGRREYEHSNHGIAHKQHKQQPPVP---VKKMNNGPPGRAETDEEKRLRKKREFE 201

Query: 1637 XXXXXXXXXQLLKESQNTVMHKTHIMSSGMKGHGSIVGSRVGEKKATPFLSGERIENRLK 1458
                     Q LKESQNTV+ KTH++SSG KGHG I GSR+GE+++TP L  ER+ENRLK
Sbjct: 202  KQRQEEKHRQQLKESQNTVLQKTHMLSSG-KGHGMIAGSRMGERRSTPLLGAERVENRLK 260

Query: 1457 KPTTFLCKLKFRNELPDPTAQPKLLSLNTYKEQYTKYTITSLEKMHKPKLFVEPDIGVPL 1278
            KPTTFLCKLKFRNELPDP+AQPKL++    K+QY KYTITSLEKM+KPKLFVEPD+G+PL
Sbjct: 261  KPTTFLCKLKFRNELPDPSAQPKLMASKKDKDQYAKYTITSLEKMYKPKLFVEPDLGIPL 320

Query: 1277 DLLDICVYNSNTTGTPHTXXXXXXXXXXXXATPIKQDGIRRKERPTDKGVSWLVKTQYIS 1098
            DLLD+ VYN  +   P               TPIK+DGI+RKERPTDKGV+WLVKTQYIS
Sbjct: 321  DLLDLSVYNPPSVRPPLAPEDKELLRDDEAVTPIKKDGIKRKERPTDKGVAWLVKTQYIS 380

Query: 1097 PLSTEAAKMSLTEKQAKELRETXXXXXXXXXXXXXEKQIQEIEASFEACKSRPVHATSDK 918
            PLS E+ K SLTEKQAKELRE              E+QI+EIEASFEA KS PVHAT+  
Sbjct: 381  PLSMESTKQSLTEKQAKELREMKGGRGILDNLNSRERQIREIEASFEAAKSDPVHATNKD 440

Query: 917  LQPVEILPLLPDFDRYDDQFVLAAFDGDPTADSEIYSKLDRSVRDDHESHAIMKSFIGAG 738
            L PVE++PLLPDFDRYDDQFV+AAFD  PTADSE+++K+D+SVRD  ES A+MKS++   
Sbjct: 441  LYPVEVMPLLPDFDRYDDQFVVAAFDNAPTADSEMHAKMDKSVRDAFESKAVMKSYVATS 500

Query: 737  SEQAKPEKFLAYMVPSPDELWKDVYDEHEDILYTWVREYQWDVRGDDADDPTTYLVTFDE 558
            S+ A PEKFLAYMVP+P EL KD+YDE+ED+ Y+W+REY WDVRGDDADDP T+LV FDE
Sbjct: 501  SDPANPEKFLAYMVPAPGELSKDIYDENEDVSYSWIREYHWDVRGDDADDPATFLVAFDE 560

Query: 557  EEARYLPLPTKLILRKKRPKEGRSNEEVEQYXXXXXXXXXXXXXXXXVELKESGDYVSS- 381
             EARYLPLPTKL+LRKKR KEGRS +EVEQ                 +E K+SG Y SS 
Sbjct: 561  SEARYLPLPTKLVLRKKRAKEGRSGDEVEQCPVPARVTVRRRSSVAAIERKDSGVYTSSK 620

Query: 380  --NSKRGRSAIEDGLETPVPRHKVARVQDMDHYSGAEDDMSD 261
              +SKRG   ++DGLE    +H+ A  QD    SGAED MSD
Sbjct: 621  GNSSKRGGLEMDDGLE---DQHRGAPHQDNYQSSGAEDYMSD 659


>ref|XP_007135633.1| hypothetical protein PHAVU_010G145300g [Phaseolus vulgaris]
            gi|561008678|gb|ESW07627.1| hypothetical protein
            PHAVU_010G145300g [Phaseolus vulgaris]
          Length = 661

 Score =  566 bits (1458), Expect = e-158
 Identities = 303/531 (57%), Positives = 361/531 (67%), Gaps = 5/531 (0%)
 Frame = -2

Query: 1838 SHNREGPR--DANGSGWRESGHSKHGLPPKQKGSAVPMPSANAPRVPNAIAXXXXXXXXX 1665
            +HN E  R  D + SG RE   S HG+  KQ     P+P+      P             
Sbjct: 139  THNNEERRFKDPSTSGRREYDPSNHGIGHKQHKHQPPVPAKKVNGPPGRAETEEEKRLRK 198

Query: 1664 XXXXXXXXXXXXXXXXXXQLLKESQNTVMHKTHIMSSGMKGHGSIVGSRVGEKKATPFLS 1485
                                LKESQNTV+ KTH++SSG KGHG + GSR+GE+++TP LS
Sbjct: 199  KREFEKQRQEEKHRQQ----LKESQNTVLQKTHLLSSG-KGHGLVAGSRMGERRSTPLLS 253

Query: 1484 GERIENRLKKPTTFLCKLKFRNELPDPTAQPKLLSLNTYKEQYTKYTITSLEKMHKPKLF 1305
             ER+ENRLKKPTTFLCKLKFRNELPDP+AQPKL++    K+QY KYTITSLEKM+KPKLF
Sbjct: 254  AERVENRLKKPTTFLCKLKFRNELPDPSAQPKLMAFKKDKDQYAKYTITSLEKMYKPKLF 313

Query: 1304 VEPDIGVPLDLLDICVYNSNTTGTPHTXXXXXXXXXXXXATPIKQDGIRRKERPTDKGVS 1125
            VEPD+G+PLDLLD+ VYN  +   P              ATPIK+DGI+RKERPTDKGV+
Sbjct: 314  VEPDLGIPLDLLDLSVYNPPSVRPPLAPEDEELLRDDEAATPIKKDGIKRKERPTDKGVA 373

Query: 1124 WLVKTQYISPLSTEAAKMSLTEKQAKELRETXXXXXXXXXXXXXEKQIQEIEASFEACKS 945
            WLVKTQYISPLS E+ K SLTEKQAKELRE              E+QI+EIEASFEA KS
Sbjct: 374  WLVKTQYISPLSMESTKQSLTEKQAKELREMKGGRGVLDNLNSRERQIREIEASFEAAKS 433

Query: 944  RPVHATSDKLQPVEILPLLPDFDRYDDQFVLAAFDGDPTADSEIYSKLDRSVRDDHESHA 765
             PVHAT+  L PVE++PLLPDFDRYDDQFV+AAFD  PTADSE+Y+KLD+SVRD  ES A
Sbjct: 434  DPVHATNKDLYPVEVMPLLPDFDRYDDQFVVAAFDNAPTADSEMYAKLDKSVRDAFESKA 493

Query: 764  IMKSFIGAGSEQAKPEKFLAYMVPSPDELWKDVYDEHEDILYTWVREYQWDVRGDDADDP 585
            +MKS++   S+ A PEKFLAYM P+P EL KD+YDE+ED+ Y+W+REY WDVRGDDADDP
Sbjct: 494  VMKSYVATSSDPANPEKFLAYMAPAPGELSKDIYDENEDVSYSWIREYHWDVRGDDADDP 553

Query: 584  TTYLVTFDEEEARYLPLPTKLILRKKRPKEGRSNEEVEQYXXXXXXXXXXXXXXXXVELK 405
            TT+ V FD+ EARYLPLPTKL+LRKKR KEGRS EE+EQ                 +E K
Sbjct: 554  TTFFVAFDDSEARYLPLPTKLVLRKKRAKEGRSGEEIEQCPVPSRVTVRRRSSVAAIERK 613

Query: 404  ESGDYVSS---NSKRGRSAIEDGLETPVPRHKVARVQDMDHYSGAEDDMSD 261
            ++G Y SS   +SKR R  ++DGLE     H+ A  QD    SGAED MS+
Sbjct: 614  DTGVYTSSRGNSSKRSRLEMDDGLE---HHHRGAPHQDNYQSSGAEDYMSE 661


>ref|XP_004302858.1| PREDICTED: uncharacterized protein LOC101304396 [Fragaria vesca
            subsp. vesca]
          Length = 693

 Score =  565 bits (1455), Expect = e-158
 Identities = 312/543 (57%), Positives = 365/543 (67%), Gaps = 5/543 (0%)
 Frame = -2

Query: 1874 QAEESRFHDNRRSHNREGPRDANGSGWRESGHSKH-GLPPKQKGSAVPMPSANAPRVPNA 1698
            ++ ES F  ++  H++   +D   S  RE GHS H G+PPK K      P     +  N 
Sbjct: 163  KSRESGF--DKGPHDKGASKDVGASAKREHGHSNHHGVPPKHK------PPVPLVKKSNG 214

Query: 1697 IAXXXXXXXXXXXXXXXXXXXXXXXXXXXQLLKESQNTVMHKTHIMSSGMKGHGSIVGSR 1518
                                         Q  KESQN+V+ KTH+MSSG KGHGSI GSR
Sbjct: 215  APGRVETEEERRLRKKREFEKQRQEEKHRQQAKESQNSVLQKTHLMSSG-KGHGSIAGSR 273

Query: 1517 VGEKKATPFLSGERIENRLKKPTTFLCKLKFRNELPDPTAQPKLLSLNTYKEQYTKYTIT 1338
            +GE++ TPFLSGER ENRLKKPTTF+CKLKFRNELPDP+AQPKL+S+    +QYTKYTIT
Sbjct: 274  MGERRTTPFLSGERAENRLKKPTTFVCKLKFRNELPDPSAQPKLMSMKKDPDQYTKYTIT 333

Query: 1337 SLEKMHKPKLFVEPDIGVPLDLLDICVYNSNTTGTPH-TXXXXXXXXXXXXATPIKQDGI 1161
            SLEK +KPKLFVEPD+G+PLDLLD+ VYN      P                TP+K+DGI
Sbjct: 334  SLEKNYKPKLFVEPDLGIPLDLLDLSVYNPPPGPRPPLAPEDEELLRDDVAVTPVKKDGI 393

Query: 1160 RRKERPTDKGVSWLVKTQYISPLSTEAAKMSLTEKQAKELRETXXXXXXXXXXXXXEKQI 981
            RRKERPTDKGV+WLVKTQYISPLS ++AK SLTEKQAKELRE              E+QI
Sbjct: 394  RRKERPTDKGVAWLVKTQYISPLSMDSAKQSLTEKQAKELREMKGGRNLLDNLNDRERQI 453

Query: 980  QEIEASFEACKSRPVHATSDKLQPVEILPLLPDFDRYDDQFVLAAFDGDPTADSEIYSKL 801
            +EIEASFEACKSRPVHAT+  L PVE+LPLLP  +RY+DQFVLA FDG PTADSEIYSKL
Sbjct: 454  KEIEASFEACKSRPVHATNKNLYPVEVLPLLPXHNRYEDQFVLAGFDGAPTADSEIYSKL 513

Query: 800  DRSVRDDHESHAIMKSFIGAGSEQAKPEKFLAYMVPSPDELWKDVYDEHEDILYTWVREY 621
            D+S  D  ES AIMKS+   G++ A P+KFLAYMVPSP+EL KD YDE EDI Y+WVREY
Sbjct: 514  DQSDHDLCESRAIMKSYKVTGADPANPDKFLAYMVPSPNELSKDPYDESEDISYSWVREY 573

Query: 620  QWDVRGDDADDPTTYLVTFDEEEARYLPLPTKLILRKKRPKEGRSNEEVEQYXXXXXXXX 441
            Q+DVRGDD DD TTYLV+FDE+ ARY PLP KL+LRKKR KEGRS +EVE +        
Sbjct: 574  QYDVRGDDVDDLTTYLVSFDEDAARYAPLPAKLVLRKKRAKEGRSTDEVEHFPAPSRVTV 633

Query: 440  XXXXXXXXVELKESGDY---VSSNSKRGRSAIEDGLETPVPRHKVARVQDMDHYSGAEDD 270
                    +ELK++GDY     SN KR     ED LE P    K  R QD+D YSGAEDD
Sbjct: 634  RRRSTVSAIELKDAGDYSRGALSNLKRRGFDNEDALERP---QKRGRHQDVDEYSGAEDD 690

Query: 269  MSD 261
            +SD
Sbjct: 691  LSD 693


>ref|XP_007225143.1| hypothetical protein PRUPE_ppa002485mg [Prunus persica]
            gi|462422079|gb|EMJ26342.1| hypothetical protein
            PRUPE_ppa002485mg [Prunus persica]
          Length = 668

 Score =  557 bits (1436), Expect = e-156
 Identities = 306/535 (57%), Positives = 362/535 (67%), Gaps = 6/535 (1%)
 Frame = -2

Query: 1847 NRRSHNREGPRDANGSGWRESGHSKHGLPPKQKGSAVP---MPSANAPRVPNAIAXXXXX 1677
            +R SH +   R+ + SG  E GH  HG+P KQ    VP   +  AN P  P  +      
Sbjct: 160  DRGSHEKVASREVSVSGRGEHGHLNHGVPQKQHKPPVPSMQVKKANGP--PGRVETEEER 217

Query: 1676 XXXXXXXXXXXXXXXXXXXXXXQLLKESQNTVMHKTHIMSSGMKGHGSIVGSRVGEKKAT 1497
                                    LK+SQN+V+ KT ++SSG KGHGSI GSR+GE++AT
Sbjct: 218  RLRKKREFEKQRQEEKHRQQ----LKDSQNSVLQKTQMLSSG-KGHGSIAGSRMGERRAT 272

Query: 1496 PFLSGERIENRLKKPTTFLCKLKFRNELPDPTAQPKLLSLNTYKEQYTKYTITSLEKMHK 1317
            PFLSGER ENRLKKPTTF+CKLKFRNELPDP+AQPKL+SL   K+QYTKYTITSLEK +K
Sbjct: 273  PFLSGERTENRLKKPTTFVCKLKFRNELPDPSAQPKLMSLKKDKDQYTKYTITSLEKTYK 332

Query: 1316 PKLFVEPDIGVPLDLLDICVYNSNTTGTPHTXXXXXXXXXXXXATPIKQDGIRRKERPTD 1137
            PKLFVEPD+G+PLDLLD+ VYN  +   P              ATP+K++GI+RKERPTD
Sbjct: 333  PKLFVEPDLGIPLDLLDLSVYNPPSVRPPLALEDEELLRDDVAATPVKKNGIKRKERPTD 392

Query: 1136 KGVSWLVKTQYISPLSTEAAKMSLTEKQAKELRETXXXXXXXXXXXXXEKQIQEIEASFE 957
            KGV+WL                SLTEKQAKELRE              E+QI+EIEASFE
Sbjct: 393  KGVAWL----------------SLTEKQAKELREMKGGRNILDNLNDRERQIKEIEASFE 436

Query: 956  ACKSRPVHATSDKLQPVEILPLLPDFDRYDDQFVLAAFDGDPTADSEIYSKLDRSVRDDH 777
            ACKSRPVHAT+  L PVE+LPLLPDF+RY+DQFVLAAFDG PTADSEIYSKLD+S  D +
Sbjct: 437  ACKSRPVHATNKDLYPVEVLPLLPDFERYEDQFVLAAFDGAPTADSEIYSKLDQSGHDAY 496

Query: 776  ESHAIMKSFIGAGSEQAKPEKFLAYMVPSPDELWKDVYDEHEDILYTWVREYQWDVRGDD 597
            ES AIMKS+   G++ A PEKFLAYMVPSP+EL KD YDE ED+ Y+WVREY +DVRGDD
Sbjct: 497  ESRAIMKSYKVTGADPANPEKFLAYMVPSPNELSKDPYDESEDVSYSWVREYHYDVRGDD 556

Query: 596  ADDPTTYLVTFDEEEARYLPLPTKLILRKKRPKEGRSNEEVEQYXXXXXXXXXXXXXXXX 417
              DPTTYLV+FDEEEARY PLPTKL+LRKKR KEG++++EVE +                
Sbjct: 557  VHDPTTYLVSFDEEEARYAPLPTKLVLRKKRSKEGKTSDEVEHFPAPSRVTVRQRSTVAA 616

Query: 416  VELKESGDYVS---SNSKRGRSAIEDGLETPVPRHKVARVQDMDHYSGAEDDMSD 261
            +ELK+SGDY     SN K  R  IED LE P    K+AR QD+D YSGAEDD+SD
Sbjct: 617  IELKDSGDYSRGSVSNLKTRRFDIEDTLERP---RKIARHQDIDEYSGAEDDLSD 668


>ref|XP_003531647.1| PREDICTED: bromodomain-containing protein 4-like isoform X1 [Glycine
            max] gi|571472317|ref|XP_006585570.1| PREDICTED:
            bromodomain-containing protein 4-like isoform X2 [Glycine
            max]
          Length = 666

 Score =  556 bits (1432), Expect = e-155
 Identities = 305/543 (56%), Positives = 362/543 (66%), Gaps = 4/543 (0%)
 Frame = -2

Query: 1877 NQAEESRFHDNRRSHNREGPRDANGSGWRESGHSKHGLPPKQ-KGSAVPMPSANAPRVPN 1701
            N  EE RF            ++ + SG RE  HS HG+  KQ K    P+P     ++ N
Sbjct: 144  NNNEERRF------------KEPSKSGRREYEHSNHGIAHKQHKQQQPPLP---VKKMNN 188

Query: 1700 AIAXXXXXXXXXXXXXXXXXXXXXXXXXXXQLLKESQNTVMHKTHIMSSGMKGHGSIVGS 1521
                                          Q LKESQNTV+ KTH++SSG KGHG I GS
Sbjct: 189  GPPGRAETDEEKRLRKKREFEKQRQEEKHRQQLKESQNTVLQKTHLLSSG-KGHGMIAGS 247

Query: 1520 RVGEKKATPFLSGERIENRLKKPTTFLCKLKFRNELPDPTAQPKLLSLNTYKEQYTKYTI 1341
            R+GE+++TP L  ER+ENRLKKPTTFLCKLKFRNELPDP+AQPKL+S    K+QY KYTI
Sbjct: 248  RMGERRSTPLLGAERVENRLKKPTTFLCKLKFRNELPDPSAQPKLMSFKKDKDQYAKYTI 307

Query: 1340 TSLEKMHKPKLFVEPDIGVPLDLLDICVYNSNTTGTPHTXXXXXXXXXXXXATPIKQDGI 1161
            TSLEKM+KPKLFVEPD+G+PLDLLD+ VYN      P              ATPIK+DGI
Sbjct: 308  TSLEKMYKPKLFVEPDLGIPLDLLDLSVYNPPRVRPPLAPEDEELLRDDEAATPIKKDGI 367

Query: 1160 RRKERPTDKGVSWLVKTQYISPLSTEAAKMSLTEKQAKELRETXXXXXXXXXXXXXEKQI 981
            +RKERPTDKGV+WLVKTQYISPLS E+ K SLTEKQAKELRE               +QI
Sbjct: 368  KRKERPTDKGVAWLVKTQYISPLSMESTKQSLTEKQAKELREMKGRGILDNLNSRE-RQI 426

Query: 980  QEIEASFEACKSRPVHATSDKLQPVEILPLLPDFDRYDDQFVLAAFDGDPTADSEIYSKL 801
            +EI+ASFEA KS PVHAT+  L PVE++PLLPDFDRYDDQFV+AAFD  PTADSE+Y+K+
Sbjct: 427  REIQASFEAAKSDPVHATNKDLYPVEVMPLLPDFDRYDDQFVVAAFDNAPTADSEMYAKM 486

Query: 800  DRSVRDDHESHAIMKSFIGAGSEQAKPEKFLAYMVPSPDELWKDVYDEHEDILYTWVREY 621
            ++SVRD  ES A+MKS++  G + A PEKFLAYM P+P EL KD+YDE+ED+ Y+W+REY
Sbjct: 487  NKSVRDAFESKAVMKSYVATGLDPANPEKFLAYMAPAPGELSKDIYDENEDVSYSWIREY 546

Query: 620  QWDVRGDDADDPTTYLVTFDEEEARYLPLPTKLILRKKRPKEGRSNEEVEQYXXXXXXXX 441
             WDVRGDDADDPTT+LV FDE EARYLPLPTKL+LRKKR KEGRS +EVEQ         
Sbjct: 547  HWDVRGDDADDPTTFLVAFDESEARYLPLPTKLVLRKKRAKEGRSGDEVEQCPVPARVTV 606

Query: 440  XXXXXXXXVELKESGDYVSSNS---KRGRSAIEDGLETPVPRHKVARVQDMDHYSGAEDD 270
                    +E K+SG Y SS     KR    ++DGLE    +H+ A  QD    SGAED 
Sbjct: 607  RRRSSVAAIERKDSGVYTSSKGNSFKRVGLEMDDGLE---DQHRGAPHQDNYQSSGAEDY 663

Query: 269  MSD 261
            MSD
Sbjct: 664  MSD 666


>ref|XP_006465692.1| PREDICTED: RNA polymerase II-associated factor 1 homolog isoform X1
            [Citrus sinensis]
          Length = 576

 Score =  555 bits (1430), Expect = e-155
 Identities = 283/453 (62%), Positives = 340/453 (75%), Gaps = 5/453 (1%)
 Frame = -2

Query: 1604 LKESQNTVMHKTHIMSSGMKGHGSIVGSRVGEKKATPFLSGERIENRLKKPTTFLCKLKF 1425
            +KESQN VM K+ +++SG  GHGS+ GSR+G+++A P LSGERIENRLKKPTTFLCKLKF
Sbjct: 127  MKESQNVVMQKSQMVASGKGGHGSMAGSRMGDRRAAPLLSGERIENRLKKPTTFLCKLKF 186

Query: 1424 RNELPDPTAQPKLLSLNTYKEQYTKYTITSLEKMHKPKLFVEPDIGVPLDLLDICVYNSN 1245
            RNELP+P+AQPKL++L   K+++T+YT +SLEK +KP+L VEPD+G+PLDLLD+ VYN  
Sbjct: 187  RNELPEPSAQPKLMALKKDKDRFTRYTFSSLEKNYKPQLHVEPDLGIPLDLLDLSVYNPP 246

Query: 1244 TTGTPHTXXXXXXXXXXXXATPIKQDGIRRKERPTDKGVSWLVKTQYISPLSTEAAKMSL 1065
            +   P               TP+K+DGI+RKERPTDKGVSWLVKTQYISPLS E+A+ SL
Sbjct: 247  SVRPPLDPEDEELLRDDEVVTPVKKDGIKRKERPTDKGVSWLVKTQYISPLSMESARQSL 306

Query: 1064 TEKQAKELRETXXXXXXXXXXXXXEKQIQEIEASFEACKSRPVHATSDKLQPVEILPLLP 885
            TEKQAKELRE              E+QI+EIEASFEACK RP+HAT+  LQPVEILPLLP
Sbjct: 307  TEKQAKELREMKGGRSILENLNDRERQIKEIEASFEACKLRPIHATNKNLQPVEILPLLP 366

Query: 884  DFDRYDDQFVLAAFDGDPTADSEIYSKLDRSVRDDHESHAIMKSFIGAGSEQAKPEKFLA 705
            DF+RYDDQFV A FDG PTADSEIYSK+D+SVRD HES AIMKS++  GS+ A PEKFLA
Sbjct: 367  DFERYDDQFVAATFDGAPTADSEIYSKMDKSVRDAHESRAIMKSYVATGSDSANPEKFLA 426

Query: 704  YMVPSPDELWKDVYDEHEDILYTWVREYQWDVRGDDADDPTTYLVTFDEEEARYLPLPTK 525
            YMVPS +EL KD+YDE+ED+ ++WVREY WDVRGDDADDPTTYLV+FD++EARY+PLPTK
Sbjct: 427  YMVPSVNELSKDMYDENEDVSFSWVREYHWDVRGDDADDPTTYLVSFDDDEARYVPLPTK 486

Query: 524  LILRKKRPKEGRSNEEVEQYXXXXXXXXXXXXXXXXVELKESGDYV-----SSNSKRGRS 360
            L LRKKR  EGRSN+EVE +                +ELKE G Y      SS+SK GR 
Sbjct: 487  LNLRKKRAIEGRSNDEVEHFPIPSSIAVRRRANVTAIELKEQGAYSNSKGNSSSSKMGRV 546

Query: 359  AIEDGLETPVPRHKVARVQDMDHYSGAEDDMSD 261
              ++ LE     H  +R QD    SGAEDDM D
Sbjct: 547  DSQEDLER---SHNGSRHQDPYQSSGAEDDMYD 576


>ref|XP_006343037.1| PREDICTED: RNA polymerase II-associated factor 1 homolog [Solanum
            tuberosum]
          Length = 700

 Score =  555 bits (1430), Expect = e-155
 Identities = 297/537 (55%), Positives = 358/537 (66%), Gaps = 8/537 (1%)
 Frame = -2

Query: 1847 NRRSHNREGPRDANGSGWRESGHSKHGLPPKQKGSAVP-MPSANAPRVPNAIAXXXXXXX 1671
            ++R+ +R        SGWRESGH  H    KQ G +VP MP   +    NA +       
Sbjct: 171  DQRNESRPSAEKRRESGWRESGHGNHTARSKQPGHSVPPMPVKKS----NAPSGRVETEE 226

Query: 1670 XXXXXXXXXXXXXXXXXXXXQLLKESQNTVMHKTHIMSSGMKGHGSIVGSRVGEKKATPF 1491
                                Q LKESQN V+ KT +++SG KGHGSI  S + +++  P 
Sbjct: 227  ERRLRKKREIEKQRHEEKNRQHLKESQNKVLQKTQMLTSGTKGHGSISASHMADRRTAPL 286

Query: 1490 LSGERIENRLKKPTTFLCKLKFRNELPDPTAQPKLLSLNTYKEQYTKYTITSLEKMHKPK 1311
            LSGER ENRLKKPTTFLCKLKFRNELPDPTAQPKLL+L    +++TKY+ITSLEKMHKP+
Sbjct: 287  LSGERTENRLKKPTTFLCKLKFRNELPDPTAQPKLLTLRRDPDRFTKYSITSLEKMHKPQ 346

Query: 1310 LFVEPDIGVPLDLLDICVYNS-NTTGTPHTXXXXXXXXXXXXATPIKQDGIRRKERPTDK 1134
            L+VEPD+G+PLDLLD+ VYN       P               TPIK+DGI++KERPTDK
Sbjct: 347  LYVEPDLGIPLDLLDLSVYNPPKGVKIPLAPEDEELLRDDNPITPIKKDGIKKKERPTDK 406

Query: 1133 GVSWLVKTQYISPLSTEAAKMSLTEKQAKELRETXXXXXXXXXXXXXEKQIQEIEASFEA 954
            GVSWLVKTQYISPLSTE+AK SLTEKQAKELRET             ++QIQEIEASFEA
Sbjct: 407  GVSWLVKTQYISPLSTESAKQSLTEKQAKELRETKGGRNILENLNKRDRQIQEIEASFEA 466

Query: 953  CKSRPVHATSDKLQPVEILPLLPDFDRYDDQFVLAAFDGDPTADSEIYSKLDRSVRDDHE 774
            CKSRP+HAT+ +LQPV++ PL PDFDRY D FVLA +D  PTADSE Y+KLD++VRD  E
Sbjct: 467  CKSRPIHATNRRLQPVKVQPLYPDFDRYKDPFVLANYDSAPTADSETYNKLDKTVRDACE 526

Query: 773  SHAIMKSFIGAGSEQAKPEKFLAYMVPSPDELWKDVYDEHEDILYTWVREYQWDVRGDDA 594
            S A+MKSF+   S+  KP+KFLAYMVP+P+EL KD+YDE+EDI Y+WVREY WDVRGDDA
Sbjct: 527  SQAVMKSFVATSSDADKPDKFLAYMVPAPNELSKDMYDENEDISYSWVREYHWDVRGDDA 586

Query: 593  DDPTTYLVTFDEEEARYLPLPTKLILRKKRPKEGRSNEEVEQYXXXXXXXXXXXXXXXXV 414
            DDP TY+V F E EARY+PLPTKL+LRKKR +EG+SNEEVE +                +
Sbjct: 587  DDPNTYVVAFGETEARYMPLPTKLVLRKKRAREGKSNEEVEHFPVPSRVTVRKRPTAAAI 646

Query: 413  ELKESGDYVS------SNSKRGRSAIEDGLETPVPRHKVARVQDMDHYSGAEDDMSD 261
            ELKE G Y +      S+SKR R + ED +     +H      D D  SG E  MSD
Sbjct: 647  ELKEEGGYTTALKGNVSSSKRSRISHEDDVG---EQHNNMHDDDQDQSSGGEYYMSD 700


>ref|XP_006426877.1| hypothetical protein CICLE_v10025066mg [Citrus clementina]
            gi|557528867|gb|ESR40117.1| hypothetical protein
            CICLE_v10025066mg [Citrus clementina]
          Length = 677

 Score =  555 bits (1430), Expect = e-155
 Identities = 283/453 (62%), Positives = 340/453 (75%), Gaps = 5/453 (1%)
 Frame = -2

Query: 1604 LKESQNTVMHKTHIMSSGMKGHGSIVGSRVGEKKATPFLSGERIENRLKKPTTFLCKLKF 1425
            +KESQN VM K+ +++SG  GHGS+VGSR+G+++A P LSGER ENRLKKPTTFLCKLKF
Sbjct: 228  MKESQNVVMQKSQMVASGKGGHGSMVGSRMGDRRAAPLLSGERTENRLKKPTTFLCKLKF 287

Query: 1424 RNELPDPTAQPKLLSLNTYKEQYTKYTITSLEKMHKPKLFVEPDIGVPLDLLDICVYNSN 1245
            RNELP+P+AQPKL++L   K+++T+YT +SLEK +KP+L VEPD+G+PLDLLD+ VYN  
Sbjct: 288  RNELPEPSAQPKLMALKKDKDRFTRYTFSSLEKNYKPQLHVEPDLGIPLDLLDLSVYNPP 347

Query: 1244 TTGTPHTXXXXXXXXXXXXATPIKQDGIRRKERPTDKGVSWLVKTQYISPLSTEAAKMSL 1065
            +   P               TP+K+DGI+RKERPTDKGVSWLVKTQYISPLS E+A+ SL
Sbjct: 348  SVRPPLDPEDEELLRDDEVVTPVKKDGIKRKERPTDKGVSWLVKTQYISPLSMESARQSL 407

Query: 1064 TEKQAKELRETXXXXXXXXXXXXXEKQIQEIEASFEACKSRPVHATSDKLQPVEILPLLP 885
            TEKQAKELRE              E+QI+EIEASFEACK RP+HAT+  LQPVEILPLLP
Sbjct: 408  TEKQAKELREMKGGRSILENLNDRERQIKEIEASFEACKLRPIHATNKNLQPVEILPLLP 467

Query: 884  DFDRYDDQFVLAAFDGDPTADSEIYSKLDRSVRDDHESHAIMKSFIGAGSEQAKPEKFLA 705
            DF+RYDDQFV A FDG PTADSEIYSK+D+SVRD HES AIMKS++  GS+ A PEKFLA
Sbjct: 468  DFERYDDQFVAATFDGAPTADSEIYSKMDKSVRDAHESRAIMKSYVATGSDSANPEKFLA 527

Query: 704  YMVPSPDELWKDVYDEHEDILYTWVREYQWDVRGDDADDPTTYLVTFDEEEARYLPLPTK 525
            YMVPS +EL KD+YDE+ED+ ++WVREY WDVRGDDADDPTTYLV+FD++EARY+PLPTK
Sbjct: 528  YMVPSVNELSKDMYDENEDVSFSWVREYHWDVRGDDADDPTTYLVSFDDDEARYVPLPTK 587

Query: 524  LILRKKRPKEGRSNEEVEQYXXXXXXXXXXXXXXXXVELKESGDYV-----SSNSKRGRS 360
            L LRKKR  EGRSN+EVE +                +ELKE G Y      SS+SK GR 
Sbjct: 588  LNLRKKRAIEGRSNDEVEHFPIPSSIAVRRRANVTAIELKEQGAYSNSKGNSSSSKMGRV 647

Query: 359  AIEDGLETPVPRHKVARVQDMDHYSGAEDDMSD 261
              ++ LE     H  +R QD    SGAEDDM D
Sbjct: 648  DSQEDLER---SHNGSRQQDPYQSSGAEDDMYD 677


>ref|XP_006465693.1| PREDICTED: RNA polymerase II-associated factor 1 homolog isoform X2
            [Citrus sinensis]
          Length = 570

 Score =  554 bits (1428), Expect = e-155
 Identities = 282/448 (62%), Positives = 339/448 (75%)
 Frame = -2

Query: 1604 LKESQNTVMHKTHIMSSGMKGHGSIVGSRVGEKKATPFLSGERIENRLKKPTTFLCKLKF 1425
            +KESQN VM K+ +++SG  GHGS+ GSR+G+++A P LSGERIENRLKKPTTFLCKLKF
Sbjct: 127  MKESQNVVMQKSQMVASGKGGHGSMAGSRMGDRRAAPLLSGERIENRLKKPTTFLCKLKF 186

Query: 1424 RNELPDPTAQPKLLSLNTYKEQYTKYTITSLEKMHKPKLFVEPDIGVPLDLLDICVYNSN 1245
            RNELP+P+AQPKL++L   K+++T+YT +SLEK +KP+L VEPD+G+PLDLLD+ VYN  
Sbjct: 187  RNELPEPSAQPKLMALKKDKDRFTRYTFSSLEKNYKPQLHVEPDLGIPLDLLDLSVYNPP 246

Query: 1244 TTGTPHTXXXXXXXXXXXXATPIKQDGIRRKERPTDKGVSWLVKTQYISPLSTEAAKMSL 1065
            +   P               TP+K+DGI+RKERPTDKGVSWLVKTQYISPLS E+A+ SL
Sbjct: 247  SVRPPLDPEDEELLRDDEVVTPVKKDGIKRKERPTDKGVSWLVKTQYISPLSMESARQSL 306

Query: 1064 TEKQAKELRETXXXXXXXXXXXXXEKQIQEIEASFEACKSRPVHATSDKLQPVEILPLLP 885
            TEKQAKELRE              E+QI+EIEASFEACK RP+HAT+  LQPVEILPLLP
Sbjct: 307  TEKQAKELREMKGGRSILENLNDRERQIKEIEASFEACKLRPIHATNKNLQPVEILPLLP 366

Query: 884  DFDRYDDQFVLAAFDGDPTADSEIYSKLDRSVRDDHESHAIMKSFIGAGSEQAKPEKFLA 705
            DF+RYDDQFV A FDG PTADSEIYSK+D+SVRD HES AIMKS++  GS+ A PEKFLA
Sbjct: 367  DFERYDDQFVAATFDGAPTADSEIYSKMDKSVRDAHESRAIMKSYVATGSDSANPEKFLA 426

Query: 704  YMVPSPDELWKDVYDEHEDILYTWVREYQWDVRGDDADDPTTYLVTFDEEEARYLPLPTK 525
            YMVPS +EL KD+YDE+ED+ ++WVREY WDVRGDDADDPTTYLV+FD++EARY+PLPTK
Sbjct: 427  YMVPSVNELSKDMYDENEDVSFSWVREYHWDVRGDDADDPTTYLVSFDDDEARYVPLPTK 486

Query: 524  LILRKKRPKEGRSNEEVEQYXXXXXXXXXXXXXXXXVELKESGDYVSSNSKRGRSAIEDG 345
            L LRKKR  EGRSN+EVE +                +ELKE G   SS+SK GR   ++ 
Sbjct: 487  LNLRKKRAIEGRSNDEVEHFPIPSSIAVRRRANVTAIELKEQGGN-SSSSKMGRVDSQED 545

Query: 344  LETPVPRHKVARVQDMDHYSGAEDDMSD 261
            LE     H  +R QD    SGAEDDM D
Sbjct: 546  LER---SHNGSRHQDPYQSSGAEDDMYD 570


>ref|XP_004235642.1| PREDICTED: uncharacterized protein LOC101254885 [Solanum
            lycopersicum]
          Length = 698

 Score =  548 bits (1412), Expect = e-153
 Identities = 294/536 (54%), Positives = 354/536 (66%), Gaps = 8/536 (1%)
 Frame = -2

Query: 1844 RRSHNREGPRDANGSGWRESGHSKHGLPPKQKGSAVP-MPSANAPRVPNAIAXXXXXXXX 1668
            +R+ +R        SGWRES H  H    KQ   +VP +P   +    NA +        
Sbjct: 170  QRNESRHSVEKRRESGWRESRHGNHTARSKQPDHSVPPLPMKKS----NAHSGRVETEEE 225

Query: 1667 XXXXXXXXXXXXXXXXXXXQLLKESQNTVMHKTHIMSSGMKGHGSIVGSRVGEKKATPFL 1488
                               Q LKESQN V+ KT +++SG KGHGSI  S + +++ TP L
Sbjct: 226  RRSRKKREIEKQRHEEKNRQHLKESQNKVLQKTQMLTSGTKGHGSISASHMADRRTTPLL 285

Query: 1487 SGERIENRLKKPTTFLCKLKFRNELPDPTAQPKLLSLNTYKEQYTKYTITSLEKMHKPKL 1308
            SGER ENRLKKPTTFLCKLKFRNELPDPTAQPKLL+L    +++TKY+ITSLEKMHKP+L
Sbjct: 286  SGERTENRLKKPTTFLCKLKFRNELPDPTAQPKLLTLRRDPDRFTKYSITSLEKMHKPQL 345

Query: 1307 FVEPDIGVPLDLLDICVYNS-NTTGTPHTXXXXXXXXXXXXATPIKQDGIRRKERPTDKG 1131
             VEPD+G+PLDLLD+ VYN       P               TPIK+DGI++KERPTDKG
Sbjct: 346  HVEPDLGIPLDLLDLSVYNPPKGVKIPLAPEDEELLRDDNPITPIKKDGIKKKERPTDKG 405

Query: 1130 VSWLVKTQYISPLSTEAAKMSLTEKQAKELRETXXXXXXXXXXXXXEKQIQEIEASFEAC 951
            VSWLVKTQYISPLSTE+AK SLTEKQAKELRET             ++QIQEIEASFEAC
Sbjct: 406  VSWLVKTQYISPLSTESAKQSLTEKQAKELRETKGGRNILENLNKRDRQIQEIEASFEAC 465

Query: 950  KSRPVHATSDKLQPVEILPLLPDFDRYDDQFVLAAFDGDPTADSEIYSKLDRSVRDDHES 771
            KSRP+HA++ +LQP+++ PL PDFDRY D FVLA +D  PTADSE YSKLD++VRD  ES
Sbjct: 466  KSRPIHASNRRLQPIKVQPLYPDFDRYKDPFVLANYDSAPTADSETYSKLDKTVRDACES 525

Query: 770  HAIMKSFIGAGSEQAKPEKFLAYMVPSPDELWKDVYDEHEDILYTWVREYQWDVRGDDAD 591
             A+MKSF+   S+  KP+KFLAYMVP+P+EL KD+YDE EDI Y+WVREY WDVRGDDAD
Sbjct: 526  QAVMKSFVATSSDADKPDKFLAYMVPAPNELSKDIYDESEDISYSWVREYHWDVRGDDAD 585

Query: 590  DPTTYLVTFDEEEARYLPLPTKLILRKKRPKEGRSNEEVEQYXXXXXXXXXXXXXXXXVE 411
            DP TY+V F E EARY+PLPTKL+LRKKR +EG+SNEEVE +                +E
Sbjct: 586  DPNTYVVAFGEREARYMPLPTKLVLRKKRAREGKSNEEVEHFPVPSRVTVRKRPTAAAIE 645

Query: 410  LKESGDYVS------SNSKRGRSAIEDGLETPVPRHKVARVQDMDHYSGAEDDMSD 261
            LKE G Y +      S+SKR R + ED +     +H      D D  SG E  MSD
Sbjct: 646  LKEEGGYTTALKGNVSSSKRSRISHEDDVG---EQHNNMHDDDQDQSSGGEYYMSD 698


>ref|XP_006426878.1| hypothetical protein CICLE_v10025066mg [Citrus clementina]
            gi|557528868|gb|ESR40118.1| hypothetical protein
            CICLE_v10025066mg [Citrus clementina]
          Length = 632

 Score =  533 bits (1372), Expect = e-148
 Identities = 262/403 (65%), Positives = 315/403 (78%)
 Frame = -2

Query: 1604 LKESQNTVMHKTHIMSSGMKGHGSIVGSRVGEKKATPFLSGERIENRLKKPTTFLCKLKF 1425
            +KESQN VM K+ +++SG  GHGS+VGSR+G+++A P LSGER ENRLKKPTTFLCKLKF
Sbjct: 228  MKESQNVVMQKSQMVASGKGGHGSMVGSRMGDRRAAPLLSGERTENRLKKPTTFLCKLKF 287

Query: 1424 RNELPDPTAQPKLLSLNTYKEQYTKYTITSLEKMHKPKLFVEPDIGVPLDLLDICVYNSN 1245
            RNELP+P+AQPKL++L   K+++T+YT +SLEK +KP+L VEPD+G+PLDLLD+ VYN  
Sbjct: 288  RNELPEPSAQPKLMALKKDKDRFTRYTFSSLEKNYKPQLHVEPDLGIPLDLLDLSVYNPP 347

Query: 1244 TTGTPHTXXXXXXXXXXXXATPIKQDGIRRKERPTDKGVSWLVKTQYISPLSTEAAKMSL 1065
            +   P               TP+K+DGI+RKERPTDKGVSWLVKTQYISPLS E+A+ SL
Sbjct: 348  SVRPPLDPEDEELLRDDEVVTPVKKDGIKRKERPTDKGVSWLVKTQYISPLSMESARQSL 407

Query: 1064 TEKQAKELRETXXXXXXXXXXXXXEKQIQEIEASFEACKSRPVHATSDKLQPVEILPLLP 885
            TEKQAKELRE              E+QI+EIEASFEACK RP+HAT+  LQPVEILPLLP
Sbjct: 408  TEKQAKELREMKGGRSILENLNDRERQIKEIEASFEACKLRPIHATNKNLQPVEILPLLP 467

Query: 884  DFDRYDDQFVLAAFDGDPTADSEIYSKLDRSVRDDHESHAIMKSFIGAGSEQAKPEKFLA 705
            DF+RYDDQFV A FDG PTADSEIYSK+D+SVRD HES AIMKS++  GS+ A PEKFLA
Sbjct: 468  DFERYDDQFVAATFDGAPTADSEIYSKMDKSVRDAHESRAIMKSYVATGSDSANPEKFLA 527

Query: 704  YMVPSPDELWKDVYDEHEDILYTWVREYQWDVRGDDADDPTTYLVTFDEEEARYLPLPTK 525
            YMVPS +EL KD+YDE+ED+ ++WVREY WDVRGDDADDPTTYLV+FD++EARY+PLPTK
Sbjct: 528  YMVPSVNELSKDMYDENEDVSFSWVREYHWDVRGDDADDPTTYLVSFDDDEARYVPLPTK 587

Query: 524  LILRKKRPKEGRSNEEVEQYXXXXXXXXXXXXXXXXVELKESG 396
            L LRKKR  EGRSN+EVE +                +ELKE G
Sbjct: 588  LNLRKKRAIEGRSNDEVEHFPIPSSIAVRRRANVTAIELKEQG 630


>ref|XP_007024309.1| Hydroxyproline-rich glycoprotein family protein isoform 3 [Theobroma
            cacao] gi|508779675|gb|EOY26931.1| Hydroxyproline-rich
            glycoprotein family protein isoform 3 [Theobroma cacao]
          Length = 662

 Score =  528 bits (1361), Expect = e-147
 Identities = 299/534 (55%), Positives = 347/534 (64%), Gaps = 5/534 (0%)
 Frame = -2

Query: 1847 NRRSHNREGPRDANGSGWRESGHSKHGLPPKQKGSAVPMPSANAPRVPNAIAXXXXXXXX 1668
            N RS    G RD  GSG RE GHS H    + +   +P       + PN  A        
Sbjct: 168  NERSQG--GNRDFLGSGRREHGHSNHAAGVRDQKPMMP-----PVKKPNGPAGRVETEEE 220

Query: 1667 XXXXXXXXXXXXXXXXXXXQLLKESQNTVMHKTHIMSSGMKGHGSIVGSRVGEKKATPFL 1488
                               Q +KESQ     KT +M SG KGHGS+VGSR+G+++ATPFL
Sbjct: 221  RRLRKKREFEKQRQEEKHRQQMKESQ-----KTQMMPSG-KGHGSMVGSRMGDRRATPFL 274

Query: 1487 SGERIENRLKKPTTFLCKLKFRNELPDPTAQPKLLSLNTYKEQYTKYTITSLEKMHKPKL 1308
            SGERIENRLKKPTTFLCKLKF                       TKYTITSLEKM+KPKL
Sbjct: 275  SGERIENRLKKPTTFLCKLKF-----------------------TKYTITSLEKMYKPKL 311

Query: 1307 FVEPDIGVPLDLLDICVYNSNTTGTPHTXXXXXXXXXXXXATPIKQDGIRRKERPTDKGV 1128
            FVEPD+G+PLDLLD+ VYN  +                   TPIK+DGIRRKERPTDKGV
Sbjct: 312  FVEPDLGIPLDLLDLSVYNPPSVRPSLAPEDAELLHDDEAVTPIKKDGIRRKERPTDKGV 371

Query: 1127 SWLVKTQYISPLSTEAAKMSLTEKQAKELRETXXXXXXXXXXXXXEKQIQEIEASFEACK 948
            SWLVKTQYISPLS E+ K SLTEKQAKELRE              E+QI+EIEASFEA K
Sbjct: 372  SWLVKTQYISPLSMESTKQSLTEKQAKELRELKGGRNILENLNNRERQIKEIEASFEASK 431

Query: 947  SRPVHATSDKLQPVEILPLLPDFDRYDDQFVLAAFDGDPTADSEIYSKLDRSVRDDHESH 768
             RPVHAT+  L+PVE++PLLPDFDRY+DQFV+ AFDG PTADSEI+SKLD SVRD+HES 
Sbjct: 432  LRPVHATNKNLEPVEVMPLLPDFDRYNDQFVMVAFDGAPTADSEIFSKLDDSVRDEHESR 491

Query: 767  AIMKSFIGAGSEQAKPEKFLAYMVPSPDELWKDVYDEHEDILYTWVREYQWDVRGDDADD 588
            AIMKS++ A S+ A PEKFLAYMVPS DEL K +YDEHED+ Y+WVREY WDVRGDDA+D
Sbjct: 492  AIMKSYLAASSDPANPEKFLAYMVPSLDELSKGMYDEHEDVSYSWVREYNWDVRGDDAND 551

Query: 587  PTTYLVTFDEEEARYLPLPTKLILRKKRPKEGRSNEEVEQYXXXXXXXXXXXXXXXXVEL 408
            PTTYLV+FDE EARY+PLPTKL LRKKR +EGR+ +E+E +                +EL
Sbjct: 552  PTTYLVSFDEGEARYVPLPTKLNLRKKRAREGRTGDEIEHFPIPARITVRRRSTVAAIEL 611

Query: 407  KESGDYVS-----SNSKRGRSAIEDGLETPVPRHKVARVQDMDHYSGAEDDMSD 261
            KE   Y S     S+SK GR   EDGL      HK+AR  D+D YSGAEDD+S+
Sbjct: 612  KEPEVYTSSRGGMSSSKIGRLDAEDGLGR---SHKLARHHDVDQYSGAEDDLSE 662


>ref|XP_002303312.2| hydroxyproline-rich glycoprotein [Populus trichocarpa]
            gi|550342419|gb|EEE78291.2| hydroxyproline-rich
            glycoprotein [Populus trichocarpa]
          Length = 569

 Score =  525 bits (1353), Expect = e-146
 Identities = 272/451 (60%), Positives = 332/451 (73%), Gaps = 3/451 (0%)
 Frame = -2

Query: 1604 LKESQNTVMHKTHIMSSGMKGHGSIVGSRVGEKKATPFLSGERIENRLKKPTTFLCKLKF 1425
            LKESQN+ + K H++SS  KGHGSIVGSR+G++ ATP L GER ENRLKKPTTF+CKLKF
Sbjct: 123  LKESQNSALLKNHVISS-QKGHGSIVGSRLGDRVATPLLGGERAENRLKKPTTFMCKLKF 181

Query: 1424 RNELPDPTAQPKLLSLNTYKEQYTKYTITSLEKMHKPKLFVEPDIGVPLDLLDICVYNSN 1245
            RNELPDP+AQPKL+ L   K+++TKYTITSLEKM+KP+L+VEPD+G+PLDLLD+ VYN  
Sbjct: 182  RNELPDPSAQPKLMPLKREKDRFTKYTITSLEKMYKPQLYVEPDLGIPLDLLDLSVYNPP 241

Query: 1244 TTGTPHTXXXXXXXXXXXXATPIKQDGIRRKERPTDKGVSWLVKTQYISPLSTEAAKMSL 1065
            +                   TP+K+DGI+RKERPTDKGVSWLVKTQYISPLS E+AK+SL
Sbjct: 242  SVRPLLAPEDEELLHDDESVTPVKRDGIKRKERPTDKGVSWLVKTQYISPLSMESAKLSL 301

Query: 1064 TEKQAKELRETXXXXXXXXXXXXXEKQIQEIEASFEACKSRPVHATSDKLQPVEILPLLP 885
            TEKQAKELRE              E+QI+EI+ASF + K  PVHAT+  L+PVEILPLLP
Sbjct: 302  TEKQAKELREMKGGCKLLDNLNKRERQIKEIQASFASNKLPPVHATNKNLKPVEILPLLP 361

Query: 884  DFDRYDDQFVLAAFDGDPTADSEIYSKLDRSVRDDHESHAIMKSFIGAGSEQAKPEKFLA 705
            DFDRY D+FV  AFDG PTAD+E Y K D S RD +ES AIMK+ + +GS+ A PEKFLA
Sbjct: 362  DFDRYGDKFVTVAFDGAPTADAENYRKFDPSDRDAYESWAIMKACVASGSDPANPEKFLA 421

Query: 704  YMVPSPDELWKDVYDEHEDILYTWVREYQWDVRGDDADDPTTYLVTFDEEEARYLPLPTK 525
            Y VPSPDEL KD+YDE+EDILY+W+REY WDVRGDD DDP+T+LV+FDE EARYLPLPTK
Sbjct: 422  YTVPSPDELSKDMYDENEDILYSWIREYHWDVRGDDVDDPSTFLVSFDEAEARYLPLPTK 481

Query: 524  LILRKKRPKEGRSNEEVEQYXXXXXXXXXXXXXXXXVELKESGDYVSS---NSKRGRSAI 354
            + LRKKR +EGRS +E+E +                +E ++SG   +S   NS+  R   
Sbjct: 482  ISLRKKRAREGRSGDEIEHFPIPSRVTVRKRAVAATIEQRDSGAISNSRGNNSRMERFED 541

Query: 353  EDGLETPVPRHKVARVQDMDHYSGAEDDMSD 261
            EDGL       +VA  +D+ H SGAED+MS+
Sbjct: 542  EDGLGR---LQRVALDEDLHHSSGAEDEMSE 569


Top