BLASTX nr result

ID: Akebia25_contig00003040 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Akebia25_contig00003040
         (1943 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_007217663.1| hypothetical protein PRUPE_ppa002145mg [Prun...   597   e-168
gb|EXB74581.1| hypothetical protein L484_026278 [Morus notabilis]     593   e-167
ref|XP_004155995.1| PREDICTED: uncharacterized LOC101203806 [Cuc...   571   e-160
ref|XP_004141783.1| PREDICTED: uncharacterized protein LOC101203...   569   e-159
ref|XP_007024308.1| Hydroxyproline-rich glycoprotein family prot...   567   e-159
ref|XP_007024307.1| Hydroxyproline-rich glycoprotein family prot...   567   e-159
ref|XP_002278075.2| PREDICTED: RNA polymerase II-associated fact...   566   e-159
ref|XP_007135633.1| hypothetical protein PHAVU_010G145300g [Phas...   565   e-158
ref|XP_006583048.1| PREDICTED: RNA polymerase II-associated fact...   564   e-158
ref|XP_004302858.1| PREDICTED: uncharacterized protein LOC101304...   563   e-157
ref|XP_007225143.1| hypothetical protein PRUPE_ppa002485mg [Prun...   553   e-154
ref|XP_003531647.1| PREDICTED: bromodomain-containing protein 4-...   552   e-154
ref|XP_006465692.1| PREDICTED: RNA polymerase II-associated fact...   547   e-153
ref|XP_006426877.1| hypothetical protein CICLE_v10025066mg [Citr...   547   e-153
ref|XP_006465693.1| PREDICTED: RNA polymerase II-associated fact...   546   e-152
ref|XP_006343037.1| PREDICTED: RNA polymerase II-associated fact...   544   e-152
ref|XP_004235642.1| PREDICTED: uncharacterized protein LOC101254...   537   e-150
ref|XP_006426878.1| hypothetical protein CICLE_v10025066mg [Citr...   525   e-146
ref|XP_007024309.1| Hydroxyproline-rich glycoprotein family prot...   523   e-145
ref|XP_002303312.2| hydroxyproline-rich glycoprotein [Populus tr...   521   e-145

>ref|XP_007217663.1| hypothetical protein PRUPE_ppa002145mg [Prunus persica]
            gi|462413813|gb|EMJ18862.1| hypothetical protein
            PRUPE_ppa002145mg [Prunus persica]
          Length = 709

 Score =  597 bits (1539), Expect = e-168
 Identities = 318/533 (59%), Positives = 369/533 (69%), Gaps = 4/533 (0%)
 Frame = +2

Query: 338  NRRSHNREGPRDANGSGWRESGHSKHGLPPKQKGSAVP-MPSANAPRVPNAIAXXXXXXX 514
            +R SH +  PRD + SG RE GH  HG+P KQ    VP MP   A   P  +        
Sbjct: 185  DRGSHEKGAPRDVSVSGRREHGHLNHGVPQKQHKPPVPSMPVKKANGPPGRVETEEERRL 244

Query: 515  XXXXXXXXXXXXXXXXXXXXHLLKESQNTVLHNTHIMSSGMKGHGLIVGSRVGEKKATPF 694
                                  LK+SQN+VL  T ++SSG KGHG I GSR+GE++ATPF
Sbjct: 245  RKKREFEKQRQEEKHRQQ----LKDSQNSVLQKTQMLSSG-KGHGSIAGSRMGERRATPF 299

Query: 695  LSGERIENRLKKPTTFLCKLKFRNELPDPTAQPKLLSLNTYKEQYTKYTITSLEKMHKPK 874
            LSGER ENRLKKPTTF+CKLKFRNELPDP+AQPKL+SL   K+QYTKYTITSLEK +KPK
Sbjct: 300  LSGERTENRLKKPTTFVCKLKFRNELPDPSAQPKLMSLKKDKDQYTKYTITSLEKTYKPK 359

Query: 875  LFVEPDIGVPLDLLDISVYNSNTTGPPHTXXXXXXXXXXXXXTPIKQDGIRRKERPTDKG 1054
            LFVEPD+G+PLDLLD+SVYN  +  PP               TP+K +GIRRKERPTDKG
Sbjct: 360  LFVEPDLGIPLDLLDLSVYNPPSVRPPLALEDEELLRDDVAATPVKNNGIRRKERPTDKG 419

Query: 1055 VSWLVKTQYISPLSTDAAKMSLTEKQAKELRETRGGXXXXXXXXXXXXXXXXXXASFEAC 1234
            V+WLVKTQYISPLS D+A+ SLTEKQAKELRE +GG                  ASFEAC
Sbjct: 420  VAWLVKTQYISPLSMDSARQSLTEKQAKELREMKGGRNILDNLNDRERQIKDIEASFEAC 479

Query: 1235 KSRPVHATSDKLQPVEILPLLPDFDRYDDQFVLAAFDGDPTADSEIYSKLDRSVRDDHES 1414
            KSRPVHAT+  L PVEILPLLPDF+RY+DQFVLAAFDG PTADSEIYSKLD+S  D +ES
Sbjct: 480  KSRPVHATNKNLYPVEILPLLPDFERYEDQFVLAAFDGAPTADSEIYSKLDQSGHDAYES 539

Query: 1415 HAIMKSFIGAGSEQAKPEKFLAYMVPSPDELWKDVYDEHEDILYTWVREYQWDVRGDDAD 1594
             AIMKS+   G++ A PEKFLAYMVPSP+EL KD YDE ED+ Y+WVREY +DVRGDD  
Sbjct: 540  RAIMKSYKVTGADPANPEKFLAYMVPSPNELSKDPYDESEDVSYSWVREYHYDVRGDDVH 599

Query: 1595 DPTTYLVTFDEEEARYLPLPTKLILRKKRPKEGRSNEEVEQYXXXXXXXXXXXXXXXXXE 1774
            DPTTYLV+FDEEEARY PLPTKL+LRKKR KEG++++EVE +                 E
Sbjct: 600  DPTTYLVSFDEEEARYAPLPTKLVLRKKRSKEGKTSDEVEHFPAPSRVTVRQRSTVAAIE 659

Query: 1775 LKESGDYVS---SNSKRGRSAIEDGLETPVPRHKVARVQDMDHYSGAEDDMSD 1924
            LK+SGDY     SN K  R  +ED LE P    K+AR QD+D YSGAEDD+SD
Sbjct: 660  LKDSGDYSRGSVSNLKTRRFDVEDTLERP---RKIARHQDIDEYSGAEDDLSD 709


>gb|EXB74581.1| hypothetical protein L484_026278 [Morus notabilis]
          Length = 697

 Score =  593 bits (1529), Expect = e-167
 Identities = 314/544 (57%), Positives = 373/544 (68%), Gaps = 6/544 (1%)
 Frame = +2

Query: 308  NQAKESRFHDNRRSHNREGPRDANGSGWRESGHSKH-GLPPKQKGSAVPMPSANAPRVPN 484
            +Q KE+  H   +  +R   ++  GSG RE G+S H G   KQ     P+PS    +   
Sbjct: 160  SQGKENVHHRGLQERDRGVSKEVAGSGRREHGYSNHHGTHHKQH--KYPVPSVPVKKSNG 217

Query: 485  AIAXXXXXXXXXXXXXXXXXXXXXXXXXXXHLLKESQNTVLHNTHIMSSGMKGHGLIVGS 664
             +                            HL KESQ++ L  T I+S+  KGHG I GS
Sbjct: 218  PMGRVETEEERRLRKKREFEKQKQEEKHRQHL-KESQHSALQKTQILSAA-KGHGSIAGS 275

Query: 665  RVGEKKATPFLSGERIENRLKKPTTFLCKLKFRNELPDPTAQPKLLSLNTYKEQYTKYTI 844
            R+GE++AT FLSGERIENRLKKPTTFLCKLKFRNELPDP+AQPKL+S+   K+QY+KYTI
Sbjct: 276  RMGERRATSFLSGERIENRLKKPTTFLCKLKFRNELPDPSAQPKLMSMKREKDQYSKYTI 335

Query: 845  TSLEKMHKPKLFVEPDIGVPLDLLDISVYNSNTTGPPHTXXXXXXXXXXXXXTPIKQDGI 1024
            TSLEK +KPKLFVEPD+G+PL+LLD+SVYN  +  PP               TP+K+DGI
Sbjct: 336  TSLEKTYKPKLFVEPDLGIPLNLLDLSVYNPPSVRPPLDPEDEELLRDDEAVTPVKKDGI 395

Query: 1025 RRKERPTDKGVSWLVKTQYISPLSTDAAKMSLTEKQAKELRETRGGXXXXXXXXXXXXXX 1204
            +RKERPTDKGV+WLVKTQYISPLS ++ K SLTEKQAKELRE +GG              
Sbjct: 396  KRKERPTDKGVAWLVKTQYISPLSMESTKQSLTEKQAKELRELKGGRNILENLNDRDRQI 455

Query: 1205 XXXXASFEACKSRPVHATSDKLQPVEILPLLPDFDRYDDQFVLAAFDGDPTADSEIYSKL 1384
                ASFEACKSRPVHAT+  L PVE+LPLLPDFDRYDDQFVLAAFD  PTADSE+YSK+
Sbjct: 456  KEIQASFEACKSRPVHATNKSLYPVEVLPLLPDFDRYDDQFVLAAFDSAPTADSEVYSKM 515

Query: 1385 DRSVRDDHESHAIMKSFIGAGSEQAKPEKFLAYMVPSPDELWKDVYDEHEDILYTWVREY 1564
            D+S+RD HES A++KS+   GS+   PEKFLAYMVPSPDEL KD+YDEHED+ Y+WVREY
Sbjct: 516  DQSIRDAHESQAVLKSYKVTGSDPGNPEKFLAYMVPSPDELSKDIYDEHEDVSYSWVREY 575

Query: 1565 QWDVRGDDADDPTTYLVTFDEEEARYLPLPTKLILRKKRPKEGRSNEEVEQYXXXXXXXX 1744
             WDVRGDDADDPTTYLV+FDE EARYLPLPTKL+LRKKR KEGRS +EVE +        
Sbjct: 576  HWDVRGDDADDPTTYLVSFDETEARYLPLPTKLVLRKKRAKEGRSGDEVEHFPVPARVTV 635

Query: 1745 XXXXXXXXXELKESGDYVS-----SNSKRGRSAIEDGLETPVPRHKVARVQDMDHYSGAE 1909
                     ELK++  Y +     SN KRG S +EDGLE     HKVAR +D+D YSGAE
Sbjct: 636  RRRPTVSVVELKDAEVYSNPRGSLSNFKRGGSDVEDGLER---SHKVARQEDVDEYSGAE 692

Query: 1910 DDMS 1921
            DD+S
Sbjct: 693  DDLS 696


>ref|XP_004155995.1| PREDICTED: uncharacterized LOC101203806 [Cucumis sativus]
          Length = 706

 Score =  571 bits (1471), Expect = e-160
 Identities = 309/534 (57%), Positives = 368/534 (68%), Gaps = 5/534 (0%)
 Frame = +2

Query: 338  NRRSHNRE--GPRDAN-GSGWRE-SGHSKHGLPPKQKGSAVPMPSANAPRVPNAIAXXXX 505
            N  +H R+   P+D + G   RE S H KH     QK S  PMP    P+  N  +    
Sbjct: 189  NMGAHERDKGAPKDPSYGRRDRENSNHDKH-----QKHSGPPMP----PKKANGPSGRME 239

Query: 506  XXXXXXXXXXXXXXXXXXXXXXXHLLKESQNTVLHNTHIMSSGMKGHGLIVGSRVGEKKA 685
                                   H LKESQNT+L  T ++S+G K HG IVGSR+GE+KA
Sbjct: 240  TDDEKRLRKKREFEKQRQDERHRHHLKESQNTILQKTQMLSTG-KVHGSIVGSRMGERKA 298

Query: 686  TPFLSGERIENRLKKPTTFLCKLKFRNELPDPTAQPKLLSLNTYKEQYTKYTITSLEKMH 865
            TPFLSGERIENRLKKPTTFLCKLKFRNELPD +AQPKL+SL   K+ YT+YTITSLEK +
Sbjct: 299  TPFLSGERIENRLKKPTTFLCKLKFRNELPDTSAQPKLMSLRKEKDHYTRYTITSLEKTY 358

Query: 866  KPKLFVEPDIGVPLDLLDISVYNSNTTGPPHTXXXXXXXXXXXXXTPIKQDG-IRRKERP 1042
            KP+L+VEPD+G+PLDLLD+SVYN ++   P               TP+K+DG I+RKERP
Sbjct: 359  KPQLYVEPDLGIPLDLLDLSVYNPSSVRMPLAPEDEELLRDDVLKTPVKKDGGIKRKERP 418

Query: 1043 TDKGVSWLVKTQYISPLSTDAAKMSLTEKQAKELRETRGGXXXXXXXXXXXXXXXXXXAS 1222
            TDKGV+WLVKTQYISPLS ++AK SLTEKQAKELRE +GG                   S
Sbjct: 419  TDKGVAWLVKTQYISPLSIESAKQSLTEKQAKELREMKGGRNILENLNNRERQIKEIETS 478

Query: 1223 FEACKSRPVHATSDKLQPVEILPLLPDFDRYDDQFVLAAFDGDPTADSEIYSKLDRSVRD 1402
            FEACKSRP+HAT+  L PVE+LPLLPDFDRYDD FV+ AFD  PTADSE ++KLD+S+RD
Sbjct: 479  FEACKSRPIHATNKNLYPVEVLPLLPDFDRYDDPFVVVAFDSAPTADSETFNKLDQSIRD 538

Query: 1403 DHESHAIMKSFIGAGSEQAKPEKFLAYMVPSPDELWKDVYDEHEDILYTWVREYQWDVRG 1582
             HES AIMKS++  GS+ +KPEKFLAYMVPSPDEL KD+YDE ED+ Y+WVREY WDVRG
Sbjct: 539  AHESQAIMKSYMATGSDPSKPEKFLAYMVPSPDELSKDIYDEQEDVSYSWVREYHWDVRG 598

Query: 1583 DDADDPTTYLVTFDEEEARYLPLPTKLILRKKRPKEGRSNEEVEQYXXXXXXXXXXXXXX 1762
            D+ DDPTTYLV+FD+ EARY+PLPTKL+LRKKR KEGRS++EVE +              
Sbjct: 599  DNVDDPTTYLVSFDDAEARYVPLPTKLVLRKKRAKEGRSSDEVEHFPAPARVTVRRRPTV 658

Query: 1763 XXXELKESGDYVSSNSKRGRSAIEDGLETPVPRHKVARVQDMDHYSGAEDDMSD 1924
               E+K+ G Y  SNSKRG S IEDG+      HK  R QDMD +SGAED+MSD
Sbjct: 659  ATLEVKDPGIY--SNSKRG-SDIEDGIGR---SHKHDRNQDMDQFSGAEDEMSD 706


>ref|XP_004141783.1| PREDICTED: uncharacterized protein LOC101203806 [Cucumis sativus]
          Length = 706

 Score =  569 bits (1467), Expect = e-159
 Identities = 309/534 (57%), Positives = 368/534 (68%), Gaps = 5/534 (0%)
 Frame = +2

Query: 338  NRRSHNREG--PRDAN-GSGWRE-SGHSKHGLPPKQKGSAVPMPSANAPRVPNAIAXXXX 505
            N  +H R+   P+D + G   RE S H KH     QK S  PMP    P+  N  +    
Sbjct: 189  NMGAHERDKGVPKDPSYGRRDRENSNHDKH-----QKHSGPPMP----PKKANGPSGRME 239

Query: 506  XXXXXXXXXXXXXXXXXXXXXXXHLLKESQNTVLHNTHIMSSGMKGHGLIVGSRVGEKKA 685
                                   H LKESQNT+L  T ++S+G K HG IVGSR+GE+KA
Sbjct: 240  TDDEKRLRKKREFEKQRQDERHRHHLKESQNTILQKTQMLSTG-KVHGSIVGSRMGERKA 298

Query: 686  TPFLSGERIENRLKKPTTFLCKLKFRNELPDPTAQPKLLSLNTYKEQYTKYTITSLEKMH 865
            TPFLSGERIENRLKKPTTFLCKLKFRNELPD +AQPKL+SL   K+ YT+YTITSLEK +
Sbjct: 299  TPFLSGERIENRLKKPTTFLCKLKFRNELPDTSAQPKLMSLRKEKDHYTRYTITSLEKTY 358

Query: 866  KPKLFVEPDIGVPLDLLDISVYNSNTTGPPHTXXXXXXXXXXXXXTPIKQDG-IRRKERP 1042
            KP+L+VEPD+G+PLDLLD+SVYN ++   P               TP+K+DG I+RKERP
Sbjct: 359  KPQLYVEPDLGIPLDLLDLSVYNPSSVRMPLAPEDEELLRDDVLKTPVKKDGGIKRKERP 418

Query: 1043 TDKGVSWLVKTQYISPLSTDAAKMSLTEKQAKELRETRGGXXXXXXXXXXXXXXXXXXAS 1222
            TDKGV+WLVKTQYISPLS ++AK SLTEKQAKELRE +GG                  AS
Sbjct: 419  TDKGVAWLVKTQYISPLSIESAKQSLTEKQAKELREMKGGRNILENLNNRERQIKEIEAS 478

Query: 1223 FEACKSRPVHATSDKLQPVEILPLLPDFDRYDDQFVLAAFDGDPTADSEIYSKLDRSVRD 1402
            FEACKSRP+HAT+  L PVE+LPLLPDFDRYDD FV+ AFD  PTADSE ++KLD+S+RD
Sbjct: 479  FEACKSRPIHATNKNLYPVEVLPLLPDFDRYDDPFVVVAFDSAPTADSETFNKLDQSIRD 538

Query: 1403 DHESHAIMKSFIGAGSEQAKPEKFLAYMVPSPDELWKDVYDEHEDILYTWVREYQWDVRG 1582
             HES AIMKS++   S+ +KPEKFLAYMVPSPDEL KD+YDE ED+ Y+WVREY WDVRG
Sbjct: 539  AHESQAIMKSYMATSSDPSKPEKFLAYMVPSPDELSKDIYDEQEDVSYSWVREYHWDVRG 598

Query: 1583 DDADDPTTYLVTFDEEEARYLPLPTKLILRKKRPKEGRSNEEVEQYXXXXXXXXXXXXXX 1762
            D+ DDPTTYLV+FD+ EARY+PLPTKL+LRKKR KEGRS++EVE +              
Sbjct: 599  DNVDDPTTYLVSFDDAEARYVPLPTKLVLRKKRAKEGRSSDEVEHFPAPARVTVRRRPTV 658

Query: 1763 XXXELKESGDYVSSNSKRGRSAIEDGLETPVPRHKVARVQDMDHYSGAEDDMSD 1924
               E+K+ G Y  SNSKRG S IEDG+      HK  R QDMD +SGAED+MSD
Sbjct: 659  ATLEVKDPGIY--SNSKRG-SDIEDGIGR---SHKHDRHQDMDQFSGAEDEMSD 706


>ref|XP_007024308.1| Hydroxyproline-rich glycoprotein family protein isoform 2 [Theobroma
            cacao] gi|508779674|gb|EOY26930.1| Hydroxyproline-rich
            glycoprotein family protein isoform 2 [Theobroma cacao]
          Length = 562

 Score =  567 bits (1461), Expect = e-159
 Identities = 307/541 (56%), Positives = 362/541 (66%), Gaps = 5/541 (0%)
 Frame = +2

Query: 317  KESRFHDNRRSHNREGPRDANGSGWRESGHSKHGLPPKQKGSAVPMPSANAPRVPNAIAX 496
            KES         ++ G RD  GSG RE GHS H    + +   +P       + PN  A 
Sbjct: 36   KESVGDKGLNERSQGGNRDFLGSGRREHGHSNHAAGVRDQKPMMP-----PVKKPNGPAG 90

Query: 497  XXXXXXXXXXXXXXXXXXXXXXXXXXHLLKESQNTVLHNTHIMSSGMKGHGLIVGSRVGE 676
                                        +KESQ T      +M SG KGHG +VGSR+G+
Sbjct: 91   RVETEEERRLRKKREFEKQRQEEKHRQQMKESQKT-----QMMPSG-KGHGSMVGSRMGD 144

Query: 677  KKATPFLSGERIENRLKKPTTFLCKLKFRNELPDPTAQPKLLSLNTYKEQYTKYTITSLE 856
            ++ATPFLSGERIENRLKKPTTFLCKLKFRNELPDP+AQPKL++L   K+++TKYTITSLE
Sbjct: 145  RRATPFLSGERIENRLKKPTTFLCKLKFRNELPDPSAQPKLMALKKDKDRFTKYTITSLE 204

Query: 857  KMHKPKLFVEPDIGVPLDLLDISVYNSNTTGPPHTXXXXXXXXXXXXXTPIKQDGIRRKE 1036
            KM+KPKLFVEPD+G+PLDLLD+SVYN  +  P                TPIK+DGIRRKE
Sbjct: 205  KMYKPKLFVEPDLGIPLDLLDLSVYNPPSVRPSLAPEDAELLHDDEAVTPIKKDGIRRKE 264

Query: 1037 RPTDKGVSWLVKTQYISPLSTDAAKMSLTEKQAKELRETRGGXXXXXXXXXXXXXXXXXX 1216
            RPTDKGVSWLVKTQYISPLS ++ K SLTEKQAKELRE +GG                  
Sbjct: 265  RPTDKGVSWLVKTQYISPLSMESTKQSLTEKQAKELRELKGGRNILENLNNRERQIKEIE 324

Query: 1217 ASFEACKSRPVHATSDKLQPVEILPLLPDFDRYDDQFVLAAFDGDPTADSEIYSKLDRSV 1396
            ASFEA K RPVHAT+  L+PVE++PLLPDFDRY+DQFV+ AFDG PTADSEI+SKLD SV
Sbjct: 325  ASFEASKLRPVHATNKNLEPVEVMPLLPDFDRYNDQFVMVAFDGAPTADSEIFSKLDDSV 384

Query: 1397 RDDHESHAIMKSFIGAGSEQAKPEKFLAYMVPSPDELWKDVYDEHEDILYTWVREYQWDV 1576
            RD+HES AIMKS++ A S+ A PEKFLAYMVPS DEL K +YDEHED+ Y+WVREY WDV
Sbjct: 385  RDEHESRAIMKSYLAASSDPANPEKFLAYMVPSLDELSKGMYDEHEDVSYSWVREYNWDV 444

Query: 1577 RGDDADDPTTYLVTFDEEEARYLPLPTKLILRKKRPKEGRSNEEVEQYXXXXXXXXXXXX 1756
            RGDDA+DPTTYLV+FDE EARY+PLPTKL LRKKR +EGR+ +E+E +            
Sbjct: 445  RGDDANDPTTYLVSFDEGEARYVPLPTKLNLRKKRAREGRTGDEIEHFPIPARITVRRRS 504

Query: 1757 XXXXXELKESGDYVS-----SNSKRGRSAIEDGLETPVPRHKVARVQDMDHYSGAEDDMS 1921
                 ELKE   Y S     S+SK GR   EDGL      HK+AR  D+D YSGAEDD+S
Sbjct: 505  TVAAIELKEPEVYTSSRGGMSSSKIGRLDAEDGLGR---SHKLARHHDVDQYSGAEDDLS 561

Query: 1922 D 1924
            +
Sbjct: 562  E 562


>ref|XP_007024307.1| Hydroxyproline-rich glycoprotein family protein isoform 1 [Theobroma
            cacao] gi|508779673|gb|EOY26929.1| Hydroxyproline-rich
            glycoprotein family protein isoform 1 [Theobroma cacao]
          Length = 685

 Score =  567 bits (1461), Expect = e-159
 Identities = 307/541 (56%), Positives = 362/541 (66%), Gaps = 5/541 (0%)
 Frame = +2

Query: 317  KESRFHDNRRSHNREGPRDANGSGWRESGHSKHGLPPKQKGSAVPMPSANAPRVPNAIAX 496
            KES         ++ G RD  GSG RE GHS H    + +   +P       + PN  A 
Sbjct: 159  KESVGDKGLNERSQGGNRDFLGSGRREHGHSNHAAGVRDQKPMMP-----PVKKPNGPAG 213

Query: 497  XXXXXXXXXXXXXXXXXXXXXXXXXXHLLKESQNTVLHNTHIMSSGMKGHGLIVGSRVGE 676
                                        +KESQ T      +M SG KGHG +VGSR+G+
Sbjct: 214  RVETEEERRLRKKREFEKQRQEEKHRQQMKESQKT-----QMMPSG-KGHGSMVGSRMGD 267

Query: 677  KKATPFLSGERIENRLKKPTTFLCKLKFRNELPDPTAQPKLLSLNTYKEQYTKYTITSLE 856
            ++ATPFLSGERIENRLKKPTTFLCKLKFRNELPDP+AQPKL++L   K+++TKYTITSLE
Sbjct: 268  RRATPFLSGERIENRLKKPTTFLCKLKFRNELPDPSAQPKLMALKKDKDRFTKYTITSLE 327

Query: 857  KMHKPKLFVEPDIGVPLDLLDISVYNSNTTGPPHTXXXXXXXXXXXXXTPIKQDGIRRKE 1036
            KM+KPKLFVEPD+G+PLDLLD+SVYN  +  P                TPIK+DGIRRKE
Sbjct: 328  KMYKPKLFVEPDLGIPLDLLDLSVYNPPSVRPSLAPEDAELLHDDEAVTPIKKDGIRRKE 387

Query: 1037 RPTDKGVSWLVKTQYISPLSTDAAKMSLTEKQAKELRETRGGXXXXXXXXXXXXXXXXXX 1216
            RPTDKGVSWLVKTQYISPLS ++ K SLTEKQAKELRE +GG                  
Sbjct: 388  RPTDKGVSWLVKTQYISPLSMESTKQSLTEKQAKELRELKGGRNILENLNNRERQIKEIE 447

Query: 1217 ASFEACKSRPVHATSDKLQPVEILPLLPDFDRYDDQFVLAAFDGDPTADSEIYSKLDRSV 1396
            ASFEA K RPVHAT+  L+PVE++PLLPDFDRY+DQFV+ AFDG PTADSEI+SKLD SV
Sbjct: 448  ASFEASKLRPVHATNKNLEPVEVMPLLPDFDRYNDQFVMVAFDGAPTADSEIFSKLDDSV 507

Query: 1397 RDDHESHAIMKSFIGAGSEQAKPEKFLAYMVPSPDELWKDVYDEHEDILYTWVREYQWDV 1576
            RD+HES AIMKS++ A S+ A PEKFLAYMVPS DEL K +YDEHED+ Y+WVREY WDV
Sbjct: 508  RDEHESRAIMKSYLAASSDPANPEKFLAYMVPSLDELSKGMYDEHEDVSYSWVREYNWDV 567

Query: 1577 RGDDADDPTTYLVTFDEEEARYLPLPTKLILRKKRPKEGRSNEEVEQYXXXXXXXXXXXX 1756
            RGDDA+DPTTYLV+FDE EARY+PLPTKL LRKKR +EGR+ +E+E +            
Sbjct: 568  RGDDANDPTTYLVSFDEGEARYVPLPTKLNLRKKRAREGRTGDEIEHFPIPARITVRRRS 627

Query: 1757 XXXXXELKESGDYVS-----SNSKRGRSAIEDGLETPVPRHKVARVQDMDHYSGAEDDMS 1921
                 ELKE   Y S     S+SK GR   EDGL      HK+AR  D+D YSGAEDD+S
Sbjct: 628  TVAAIELKEPEVYTSSRGGMSSSKIGRLDAEDGLGR---SHKLARHHDVDQYSGAEDDLS 684

Query: 1922 D 1924
            +
Sbjct: 685  E 685


>ref|XP_002278075.2| PREDICTED: RNA polymerase II-associated factor 1 homolog [Vitis
            vinifera]
          Length = 589

 Score =  567 bits (1460), Expect = e-159
 Identities = 305/535 (57%), Positives = 368/535 (68%), Gaps = 9/535 (1%)
 Frame = +2

Query: 347  SHNRE--GPRDANGSGWRESGHSKHGLPPKQKGSAVPMPSANAPRVPNAIAXXXXXXXXX 520
            SH R+   P+D  G+G RE GHS  G  P  K    P+P A   +  N            
Sbjct: 64   SHGRDKGAPKDLRGAGRREPGHSNQG--PSGKQQKPPVPPAPVKK-SNGPPGRVETEEER 120

Query: 521  XXXXXXXXXXXXXXXXXXHLLKESQNTVLHNTHIMSSGMKGHGLIVG-SRVGEKKATPFL 697
                              H LKESQNTVL  T ++SSG KGHG +VG SR+GE++ TPFL
Sbjct: 121  RLRKKREFEKQRQEEKQKHQLKESQNTVLQKTQMLSSG-KGHGSVVGGSRMGERRTTPFL 179

Query: 698  SGERIENRLKKPTTFLCKLKFRNELPDPTAQPKLLSLNTYKEQYTKYTITSLEKMHKPKL 877
            SG+RIENRL+KPTTFLCKLKFRNELPDPTAQPKL++L T K+++TKYTITSLEKMHKP+L
Sbjct: 180  SGDRIENRLRKPTTFLCKLKFRNELPDPTAQPKLMALKTDKDRFTKYTITSLEKMHKPQL 239

Query: 878  FVEPDIGVPLDLLDISVYNSNTTGPPHTXXXXXXXXXXXXXTPIKQDGIRRKERPTDKGV 1057
            FVEPD+G+PLDLLD+SVYN  +   P               TP+K++GI++KERPTDKGV
Sbjct: 240  FVEPDLGIPLDLLDLSVYNPPSVRRPLDPEDEELLRDDESVTPVKKEGIKKKERPTDKGV 299

Query: 1058 SWLVKTQYISPLSTDAAKMSLTEKQAKELRETRGGXXXXXXXXXXXXXXXXXXASFEACK 1237
            SWLVKTQYISPLST++ K SLTEKQAKELRET+GG                  A+F A K
Sbjct: 300  SWLVKTQYISPLSTESTKQSLTEKQAKELRETKGGRNILENFNSRERKIQNIEAAFAASK 359

Query: 1238 SRPVHATSDKLQPVEILPLLPDFDRYDDQFVLAAFDGDPTADSEIYSKLDRSVRDDHESH 1417
              PVH+T+  L+PVEILPLLPDF RYDD FV+A+FD  PTADSEIYSKLD++VRD HES 
Sbjct: 360  ITPVHSTNKSLKPVEILPLLPDFARYDDSFVVASFDSAPTADSEIYSKLDKTVRDSHESQ 419

Query: 1418 AIMKSFIGAGSEQAKPEKFLAYMVPSPDELWKDVYDEHEDILYTWVREYQWDVRGDDADD 1597
            AI+KS++  GS+ +KPEKFLAYM PSPDEL KD+YDE+ED  Y+WVREY WDVRGDDADD
Sbjct: 420  AILKSYMATGSDPSKPEKFLAYMAPSPDELSKDIYDENEDTSYSWVREYHWDVRGDDADD 479

Query: 1598 PTTYLVTFDEEEARYLPLPTKLILRKKRPKEGRSNEEVEQYXXXXXXXXXXXXXXXXXEL 1777
            PTTYLV+F++ +ARYLPLPTKL+LRKKR KEGRS++EVE +                 EL
Sbjct: 480  PTTYLVSFNKTDARYLPLPTKLLLRKKRAKEGRSSDEVEHFPVPSKVTVRQRPNVAAIEL 539

Query: 1778 KESGDYVSSNSKRGRSA------IEDGLETPVPRHKVARVQDMDHYSGAEDDMSD 1924
            K+  + V S+SKRG S+      +EDGL      +K  + Q MD  SGAED+MSD
Sbjct: 540  KD--EEVYSSSKRGVSSSKRGVDMEDGLGR---SYKGVQDQHMDQSSGAEDEMSD 589


>ref|XP_007135633.1| hypothetical protein PHAVU_010G145300g [Phaseolus vulgaris]
            gi|561008678|gb|ESW07627.1| hypothetical protein
            PHAVU_010G145300g [Phaseolus vulgaris]
          Length = 661

 Score =  565 bits (1455), Expect = e-158
 Identities = 300/531 (56%), Positives = 356/531 (67%), Gaps = 5/531 (0%)
 Frame = +2

Query: 347  SHNREGPR--DANGSGWRESGHSKHGLPPKQKGSAVPMPSANAPRVPNAIAXXXXXXXXX 520
            +HN E  R  D + SG RE   S HG+  KQ     P+P+      P             
Sbjct: 139  THNNEERRFKDPSTSGRREYDPSNHGIGHKQHKHQPPVPAKKVNGPPGRAETEEEKRLRK 198

Query: 521  XXXXXXXXXXXXXXXXXXHLLKESQNTVLHNTHIMSSGMKGHGLIVGSRVGEKKATPFLS 700
                                LKESQNTVL  TH++SSG KGHGL+ GSR+GE+++TP LS
Sbjct: 199  KREFEKQRQEEKHRQQ----LKESQNTVLQKTHLLSSG-KGHGLVAGSRMGERRSTPLLS 253

Query: 701  GERIENRLKKPTTFLCKLKFRNELPDPTAQPKLLSLNTYKEQYTKYTITSLEKMHKPKLF 880
             ER+ENRLKKPTTFLCKLKFRNELPDP+AQPKL++    K+QY KYTITSLEKM+KPKLF
Sbjct: 254  AERVENRLKKPTTFLCKLKFRNELPDPSAQPKLMAFKKDKDQYAKYTITSLEKMYKPKLF 313

Query: 881  VEPDIGVPLDLLDISVYNSNTTGPPHTXXXXXXXXXXXXXTPIKQDGIRRKERPTDKGVS 1060
            VEPD+G+PLDLLD+SVYN  +  PP               TPIK+DGI+RKERPTDKGV+
Sbjct: 314  VEPDLGIPLDLLDLSVYNPPSVRPPLAPEDEELLRDDEAATPIKKDGIKRKERPTDKGVA 373

Query: 1061 WLVKTQYISPLSTDAAKMSLTEKQAKELRETRGGXXXXXXXXXXXXXXXXXXASFEACKS 1240
            WLVKTQYISPLS ++ K SLTEKQAKELRE +GG                  ASFEA KS
Sbjct: 374  WLVKTQYISPLSMESTKQSLTEKQAKELREMKGGRGVLDNLNSRERQIREIEASFEAAKS 433

Query: 1241 RPVHATSDKLQPVEILPLLPDFDRYDDQFVLAAFDGDPTADSEIYSKLDRSVRDDHESHA 1420
             PVHAT+  L PVE++PLLPDFDRYDDQFV+AAFD  PTADSE+Y+KLD+SVRD  ES A
Sbjct: 434  DPVHATNKDLYPVEVMPLLPDFDRYDDQFVVAAFDNAPTADSEMYAKLDKSVRDAFESKA 493

Query: 1421 IMKSFIGAGSEQAKPEKFLAYMVPSPDELWKDVYDEHEDILYTWVREYQWDVRGDDADDP 1600
            +MKS++   S+ A PEKFLAYM P+P EL KD+YDE+ED+ Y+W+REY WDVRGDDADDP
Sbjct: 494  VMKSYVATSSDPANPEKFLAYMAPAPGELSKDIYDENEDVSYSWIREYHWDVRGDDADDP 553

Query: 1601 TTYLVTFDEEEARYLPLPTKLILRKKRPKEGRSNEEVEQYXXXXXXXXXXXXXXXXXELK 1780
            TT+ V FD+ EARYLPLPTKL+LRKKR KEGRS EE+EQ                  E K
Sbjct: 554  TTFFVAFDDSEARYLPLPTKLVLRKKRAKEGRSGEEIEQCPVPSRVTVRRRSSVAAIERK 613

Query: 1781 ESGDYVSS---NSKRGRSAIEDGLETPVPRHKVARVQDMDHYSGAEDDMSD 1924
            ++G Y SS   +SKR R  ++DGLE     H+ A  QD    SGAED MS+
Sbjct: 614  DTGVYTSSRGNSSKRSRLEMDDGLE---HHHRGAPHQDNYQSSGAEDYMSE 661


>ref|XP_006583048.1| PREDICTED: RNA polymerase II-associated factor 1 homolog isoform X1
            [Glycine max] gi|571464391|ref|XP_006583049.1| PREDICTED:
            RNA polymerase II-associated factor 1 homolog isoform X2
            [Glycine max]
          Length = 659

 Score =  564 bits (1454), Expect = e-158
 Identities = 297/522 (56%), Positives = 355/522 (68%), Gaps = 3/522 (0%)
 Frame = +2

Query: 368  RDANGSGWRESGHSKHGLPPKQKGSAVPMPSANAPRVPNAIAXXXXXXXXXXXXXXXXXX 547
            ++ + SG RE  HS HG+  KQ     P+P     ++ N                     
Sbjct: 145  KEPSTSGRREYEHSNHGIAHKQHKQQPPVP---VKKMNNGPPGRAETDEEKRLRKKREFE 201

Query: 548  XXXXXXXXXHLLKESQNTVLHNTHIMSSGMKGHGLIVGSRVGEKKATPFLSGERIENRLK 727
                       LKESQNTVL  TH++SSG KGHG+I GSR+GE+++TP L  ER+ENRLK
Sbjct: 202  KQRQEEKHRQQLKESQNTVLQKTHMLSSG-KGHGMIAGSRMGERRSTPLLGAERVENRLK 260

Query: 728  KPTTFLCKLKFRNELPDPTAQPKLLSLNTYKEQYTKYTITSLEKMHKPKLFVEPDIGVPL 907
            KPTTFLCKLKFRNELPDP+AQPKL++    K+QY KYTITSLEKM+KPKLFVEPD+G+PL
Sbjct: 261  KPTTFLCKLKFRNELPDPSAQPKLMASKKDKDQYAKYTITSLEKMYKPKLFVEPDLGIPL 320

Query: 908  DLLDISVYNSNTTGPPHTXXXXXXXXXXXXXTPIKQDGIRRKERPTDKGVSWLVKTQYIS 1087
            DLLD+SVYN  +  PP               TPIK+DGI+RKERPTDKGV+WLVKTQYIS
Sbjct: 321  DLLDLSVYNPPSVRPPLAPEDKELLRDDEAVTPIKKDGIKRKERPTDKGVAWLVKTQYIS 380

Query: 1088 PLSTDAAKMSLTEKQAKELRETRGGXXXXXXXXXXXXXXXXXXASFEACKSRPVHATSDK 1267
            PLS ++ K SLTEKQAKELRE +GG                  ASFEA KS PVHAT+  
Sbjct: 381  PLSMESTKQSLTEKQAKELREMKGGRGILDNLNSRERQIREIEASFEAAKSDPVHATNKD 440

Query: 1268 LQPVEILPLLPDFDRYDDQFVLAAFDGDPTADSEIYSKLDRSVRDDHESHAIMKSFIGAG 1447
            L PVE++PLLPDFDRYDDQFV+AAFD  PTADSE+++K+D+SVRD  ES A+MKS++   
Sbjct: 441  LYPVEVMPLLPDFDRYDDQFVVAAFDNAPTADSEMHAKMDKSVRDAFESKAVMKSYVATS 500

Query: 1448 SEQAKPEKFLAYMVPSPDELWKDVYDEHEDILYTWVREYQWDVRGDDADDPTTYLVTFDE 1627
            S+ A PEKFLAYMVP+P EL KD+YDE+ED+ Y+W+REY WDVRGDDADDP T+LV FDE
Sbjct: 501  SDPANPEKFLAYMVPAPGELSKDIYDENEDVSYSWIREYHWDVRGDDADDPATFLVAFDE 560

Query: 1628 EEARYLPLPTKLILRKKRPKEGRSNEEVEQYXXXXXXXXXXXXXXXXXELKESGDYVSS- 1804
             EARYLPLPTKL+LRKKR KEGRS +EVEQ                  E K+SG Y SS 
Sbjct: 561  SEARYLPLPTKLVLRKKRAKEGRSGDEVEQCPVPARVTVRRRSSVAAIERKDSGVYTSSK 620

Query: 1805 --NSKRGRSAIEDGLETPVPRHKVARVQDMDHYSGAEDDMSD 1924
              +SKRG   ++DGLE    +H+ A  QD    SGAED MSD
Sbjct: 621  GNSSKRGGLEMDDGLE---DQHRGAPHQDNYQSSGAEDYMSD 659


>ref|XP_004302858.1| PREDICTED: uncharacterized protein LOC101304396 [Fragaria vesca
            subsp. vesca]
          Length = 693

 Score =  563 bits (1450), Expect = e-157
 Identities = 309/543 (56%), Positives = 359/543 (66%), Gaps = 5/543 (0%)
 Frame = +2

Query: 311  QAKESRFHDNRRSHNREGPRDANGSGWRESGHSKH-GLPPKQKGSAVPMPSANAPRVPNA 487
            +++ES F  ++  H++   +D   S  RE GHS H G+PPK K      P     +  N 
Sbjct: 163  KSRESGF--DKGPHDKGASKDVGASAKREHGHSNHHGVPPKHK------PPVPLVKKSNG 214

Query: 488  IAXXXXXXXXXXXXXXXXXXXXXXXXXXXHLLKESQNTVLHNTHIMSSGMKGHGLIVGSR 667
                                            KESQN+VL  TH+MSSG KGHG I GSR
Sbjct: 215  APGRVETEEERRLRKKREFEKQRQEEKHRQQAKESQNSVLQKTHLMSSG-KGHGSIAGSR 273

Query: 668  VGEKKATPFLSGERIENRLKKPTTFLCKLKFRNELPDPTAQPKLLSLNTYKEQYTKYTIT 847
            +GE++ TPFLSGER ENRLKKPTTF+CKLKFRNELPDP+AQPKL+S+    +QYTKYTIT
Sbjct: 274  MGERRTTPFLSGERAENRLKKPTTFVCKLKFRNELPDPSAQPKLMSMKKDPDQYTKYTIT 333

Query: 848  SLEKMHKPKLFVEPDIGVPLDLLDISVYNSNT-TGPPHTXXXXXXXXXXXXXTPIKQDGI 1024
            SLEK +KPKLFVEPD+G+PLDLLD+SVYN      PP               TP+K+DGI
Sbjct: 334  SLEKNYKPKLFVEPDLGIPLDLLDLSVYNPPPGPRPPLAPEDEELLRDDVAVTPVKKDGI 393

Query: 1025 RRKERPTDKGVSWLVKTQYISPLSTDAAKMSLTEKQAKELRETRGGXXXXXXXXXXXXXX 1204
            RRKERPTDKGV+WLVKTQYISPLS D+AK SLTEKQAKELRE +GG              
Sbjct: 394  RRKERPTDKGVAWLVKTQYISPLSMDSAKQSLTEKQAKELREMKGGRNLLDNLNDRERQI 453

Query: 1205 XXXXASFEACKSRPVHATSDKLQPVEILPLLPDFDRYDDQFVLAAFDGDPTADSEIYSKL 1384
                ASFEACKSRPVHAT+  L PVE+LPLLP  +RY+DQFVLA FDG PTADSEIYSKL
Sbjct: 454  KEIEASFEACKSRPVHATNKNLYPVEVLPLLPXHNRYEDQFVLAGFDGAPTADSEIYSKL 513

Query: 1385 DRSVRDDHESHAIMKSFIGAGSEQAKPEKFLAYMVPSPDELWKDVYDEHEDILYTWVREY 1564
            D+S  D  ES AIMKS+   G++ A P+KFLAYMVPSP+EL KD YDE EDI Y+WVREY
Sbjct: 514  DQSDHDLCESRAIMKSYKVTGADPANPDKFLAYMVPSPNELSKDPYDESEDISYSWVREY 573

Query: 1565 QWDVRGDDADDPTTYLVTFDEEEARYLPLPTKLILRKKRPKEGRSNEEVEQYXXXXXXXX 1744
            Q+DVRGDD DD TTYLV+FDE+ ARY PLP KL+LRKKR KEGRS +EVE +        
Sbjct: 574  QYDVRGDDVDDLTTYLVSFDEDAARYAPLPAKLVLRKKRAKEGRSTDEVEHFPAPSRVTV 633

Query: 1745 XXXXXXXXXELKESGDY---VSSNSKRGRSAIEDGLETPVPRHKVARVQDMDHYSGAEDD 1915
                     ELK++GDY     SN KR     ED LE P    K  R QD+D YSGAEDD
Sbjct: 634  RRRSTVSAIELKDAGDYSRGALSNLKRRGFDNEDALERP---QKRGRHQDVDEYSGAEDD 690

Query: 1916 MSD 1924
            +SD
Sbjct: 691  LSD 693


>ref|XP_007225143.1| hypothetical protein PRUPE_ppa002485mg [Prunus persica]
            gi|462422079|gb|EMJ26342.1| hypothetical protein
            PRUPE_ppa002485mg [Prunus persica]
          Length = 668

 Score =  553 bits (1424), Expect = e-154
 Identities = 302/535 (56%), Positives = 355/535 (66%), Gaps = 6/535 (1%)
 Frame = +2

Query: 338  NRRSHNREGPRDANGSGWRESGHSKHGLPPKQKGSAVP---MPSANAPRVPNAIAXXXXX 508
            +R SH +   R+ + SG  E GH  HG+P KQ    VP   +  AN P  P  +      
Sbjct: 160  DRGSHEKVASREVSVSGRGEHGHLNHGVPQKQHKPPVPSMQVKKANGP--PGRVETEEER 217

Query: 509  XXXXXXXXXXXXXXXXXXXXXXHLLKESQNTVLHNTHIMSSGMKGHGLIVGSRVGEKKAT 688
                                    LK+SQN+VL  T ++SSG KGHG I GSR+GE++AT
Sbjct: 218  RLRKKREFEKQRQEEKHRQQ----LKDSQNSVLQKTQMLSSG-KGHGSIAGSRMGERRAT 272

Query: 689  PFLSGERIENRLKKPTTFLCKLKFRNELPDPTAQPKLLSLNTYKEQYTKYTITSLEKMHK 868
            PFLSGER ENRLKKPTTF+CKLKFRNELPDP+AQPKL+SL   K+QYTKYTITSLEK +K
Sbjct: 273  PFLSGERTENRLKKPTTFVCKLKFRNELPDPSAQPKLMSLKKDKDQYTKYTITSLEKTYK 332

Query: 869  PKLFVEPDIGVPLDLLDISVYNSNTTGPPHTXXXXXXXXXXXXXTPIKQDGIRRKERPTD 1048
            PKLFVEPD+G+PLDLLD+SVYN  +  PP               TP+K++GI+RKERPTD
Sbjct: 333  PKLFVEPDLGIPLDLLDLSVYNPPSVRPPLALEDEELLRDDVAATPVKKNGIKRKERPTD 392

Query: 1049 KGVSWLVKTQYISPLSTDAAKMSLTEKQAKELRETRGGXXXXXXXXXXXXXXXXXXASFE 1228
            KGV+WL                SLTEKQAKELRE +GG                  ASFE
Sbjct: 393  KGVAWL----------------SLTEKQAKELREMKGGRNILDNLNDRERQIKEIEASFE 436

Query: 1229 ACKSRPVHATSDKLQPVEILPLLPDFDRYDDQFVLAAFDGDPTADSEIYSKLDRSVRDDH 1408
            ACKSRPVHAT+  L PVE+LPLLPDF+RY+DQFVLAAFDG PTADSEIYSKLD+S  D +
Sbjct: 437  ACKSRPVHATNKDLYPVEVLPLLPDFERYEDQFVLAAFDGAPTADSEIYSKLDQSGHDAY 496

Query: 1409 ESHAIMKSFIGAGSEQAKPEKFLAYMVPSPDELWKDVYDEHEDILYTWVREYQWDVRGDD 1588
            ES AIMKS+   G++ A PEKFLAYMVPSP+EL KD YDE ED+ Y+WVREY +DVRGDD
Sbjct: 497  ESRAIMKSYKVTGADPANPEKFLAYMVPSPNELSKDPYDESEDVSYSWVREYHYDVRGDD 556

Query: 1589 ADDPTTYLVTFDEEEARYLPLPTKLILRKKRPKEGRSNEEVEQYXXXXXXXXXXXXXXXX 1768
              DPTTYLV+FDEEEARY PLPTKL+LRKKR KEG++++EVE +                
Sbjct: 557  VHDPTTYLVSFDEEEARYAPLPTKLVLRKKRSKEGKTSDEVEHFPAPSRVTVRQRSTVAA 616

Query: 1769 XELKESGDYVS---SNSKRGRSAIEDGLETPVPRHKVARVQDMDHYSGAEDDMSD 1924
             ELK+SGDY     SN K  R  IED LE P    K+AR QD+D YSGAEDD+SD
Sbjct: 617  IELKDSGDYSRGSVSNLKTRRFDIEDTLERP---RKIARHQDIDEYSGAEDDLSD 668


>ref|XP_003531647.1| PREDICTED: bromodomain-containing protein 4-like isoform X1 [Glycine
            max] gi|571472317|ref|XP_006585570.1| PREDICTED:
            bromodomain-containing protein 4-like isoform X2 [Glycine
            max]
          Length = 666

 Score =  552 bits (1422), Expect = e-154
 Identities = 296/523 (56%), Positives = 351/523 (67%), Gaps = 4/523 (0%)
 Frame = +2

Query: 368  RDANGSGWRESGHSKHGLPPKQ-KGSAVPMPSANAPRVPNAIAXXXXXXXXXXXXXXXXX 544
            ++ + SG RE  HS HG+  KQ K    P+P     ++ N                    
Sbjct: 152  KEPSKSGRREYEHSNHGIAHKQHKQQQPPLP---VKKMNNGPPGRAETDEEKRLRKKREF 208

Query: 545  XXXXXXXXXXHLLKESQNTVLHNTHIMSSGMKGHGLIVGSRVGEKKATPFLSGERIENRL 724
                        LKESQNTVL  TH++SSG KGHG+I GSR+GE+++TP L  ER+ENRL
Sbjct: 209  EKQRQEEKHRQQLKESQNTVLQKTHLLSSG-KGHGMIAGSRMGERRSTPLLGAERVENRL 267

Query: 725  KKPTTFLCKLKFRNELPDPTAQPKLLSLNTYKEQYTKYTITSLEKMHKPKLFVEPDIGVP 904
            KKPTTFLCKLKFRNELPDP+AQPKL+S    K+QY KYTITSLEKM+KPKLFVEPD+G+P
Sbjct: 268  KKPTTFLCKLKFRNELPDPSAQPKLMSFKKDKDQYAKYTITSLEKMYKPKLFVEPDLGIP 327

Query: 905  LDLLDISVYNSNTTGPPHTXXXXXXXXXXXXXTPIKQDGIRRKERPTDKGVSWLVKTQYI 1084
            LDLLD+SVYN     PP               TPIK+DGI+RKERPTDKGV+WLVKTQYI
Sbjct: 328  LDLLDLSVYNPPRVRPPLAPEDEELLRDDEAATPIKKDGIKRKERPTDKGVAWLVKTQYI 387

Query: 1085 SPLSTDAAKMSLTEKQAKELRETRGGXXXXXXXXXXXXXXXXXXASFEACKSRPVHATSD 1264
            SPLS ++ K SLTEKQAKELRE +G                   ASFEA KS PVHAT+ 
Sbjct: 388  SPLSMESTKQSLTEKQAKELREMKG-RGILDNLNSRERQIREIQASFEAAKSDPVHATNK 446

Query: 1265 KLQPVEILPLLPDFDRYDDQFVLAAFDGDPTADSEIYSKLDRSVRDDHESHAIMKSFIGA 1444
             L PVE++PLLPDFDRYDDQFV+AAFD  PTADSE+Y+K+++SVRD  ES A+MKS++  
Sbjct: 447  DLYPVEVMPLLPDFDRYDDQFVVAAFDNAPTADSEMYAKMNKSVRDAFESKAVMKSYVAT 506

Query: 1445 GSEQAKPEKFLAYMVPSPDELWKDVYDEHEDILYTWVREYQWDVRGDDADDPTTYLVTFD 1624
            G + A PEKFLAYM P+P EL KD+YDE+ED+ Y+W+REY WDVRGDDADDPTT+LV FD
Sbjct: 507  GLDPANPEKFLAYMAPAPGELSKDIYDENEDVSYSWIREYHWDVRGDDADDPTTFLVAFD 566

Query: 1625 EEEARYLPLPTKLILRKKRPKEGRSNEEVEQYXXXXXXXXXXXXXXXXXELKESGDYVSS 1804
            E EARYLPLPTKL+LRKKR KEGRS +EVEQ                  E K+SG Y SS
Sbjct: 567  ESEARYLPLPTKLVLRKKRAKEGRSGDEVEQCPVPARVTVRRRSSVAAIERKDSGVYTSS 626

Query: 1805 NS---KRGRSAIEDGLETPVPRHKVARVQDMDHYSGAEDDMSD 1924
                 KR    ++DGLE    +H+ A  QD    SGAED MSD
Sbjct: 627  KGNSFKRVGLEMDDGLE---DQHRGAPHQDNYQSSGAEDYMSD 666


>ref|XP_006465692.1| PREDICTED: RNA polymerase II-associated factor 1 homolog isoform X1
            [Citrus sinensis]
          Length = 576

 Score =  547 bits (1410), Expect = e-153
 Identities = 277/453 (61%), Positives = 334/453 (73%), Gaps = 5/453 (1%)
 Frame = +2

Query: 581  LKESQNTVLHNTHIMSSGMKGHGLIVGSRVGEKKATPFLSGERIENRLKKPTTFLCKLKF 760
            +KESQN V+  + +++SG  GHG + GSR+G+++A P LSGERIENRLKKPTTFLCKLKF
Sbjct: 127  MKESQNVVMQKSQMVASGKGGHGSMAGSRMGDRRAAPLLSGERIENRLKKPTTFLCKLKF 186

Query: 761  RNELPDPTAQPKLLSLNTYKEQYTKYTITSLEKMHKPKLFVEPDIGVPLDLLDISVYNSN 940
            RNELP+P+AQPKL++L   K+++T+YT +SLEK +KP+L VEPD+G+PLDLLD+SVYN  
Sbjct: 187  RNELPEPSAQPKLMALKKDKDRFTRYTFSSLEKNYKPQLHVEPDLGIPLDLLDLSVYNPP 246

Query: 941  TTGPPHTXXXXXXXXXXXXXTPIKQDGIRRKERPTDKGVSWLVKTQYISPLSTDAAKMSL 1120
            +  PP               TP+K+DGI+RKERPTDKGVSWLVKTQYISPLS ++A+ SL
Sbjct: 247  SVRPPLDPEDEELLRDDEVVTPVKKDGIKRKERPTDKGVSWLVKTQYISPLSMESARQSL 306

Query: 1121 TEKQAKELRETRGGXXXXXXXXXXXXXXXXXXASFEACKSRPVHATSDKLQPVEILPLLP 1300
            TEKQAKELRE +GG                  ASFEACK RP+HAT+  LQPVEILPLLP
Sbjct: 307  TEKQAKELREMKGGRSILENLNDRERQIKEIEASFEACKLRPIHATNKNLQPVEILPLLP 366

Query: 1301 DFDRYDDQFVLAAFDGDPTADSEIYSKLDRSVRDDHESHAIMKSFIGAGSEQAKPEKFLA 1480
            DF+RYDDQFV A FDG PTADSEIYSK+D+SVRD HES AIMKS++  GS+ A PEKFLA
Sbjct: 367  DFERYDDQFVAATFDGAPTADSEIYSKMDKSVRDAHESRAIMKSYVATGSDSANPEKFLA 426

Query: 1481 YMVPSPDELWKDVYDEHEDILYTWVREYQWDVRGDDADDPTTYLVTFDEEEARYLPLPTK 1660
            YMVPS +EL KD+YDE+ED+ ++WVREY WDVRGDDADDPTTYLV+FD++EARY+PLPTK
Sbjct: 427  YMVPSVNELSKDMYDENEDVSFSWVREYHWDVRGDDADDPTTYLVSFDDDEARYVPLPTK 486

Query: 1661 LILRKKRPKEGRSNEEVEQYXXXXXXXXXXXXXXXXXELKESGDYV-----SSNSKRGRS 1825
            L LRKKR  EGRSN+EVE +                 ELKE G Y      SS+SK GR 
Sbjct: 487  LNLRKKRAIEGRSNDEVEHFPIPSSIAVRRRANVTAIELKEQGAYSNSKGNSSSSKMGRV 546

Query: 1826 AIEDGLETPVPRHKVARVQDMDHYSGAEDDMSD 1924
              ++ LE     H  +R QD    SGAEDDM D
Sbjct: 547  DSQEDLER---SHNGSRHQDPYQSSGAEDDMYD 576


>ref|XP_006426877.1| hypothetical protein CICLE_v10025066mg [Citrus clementina]
            gi|557528867|gb|ESR40117.1| hypothetical protein
            CICLE_v10025066mg [Citrus clementina]
          Length = 677

 Score =  547 bits (1410), Expect = e-153
 Identities = 277/453 (61%), Positives = 334/453 (73%), Gaps = 5/453 (1%)
 Frame = +2

Query: 581  LKESQNTVLHNTHIMSSGMKGHGLIVGSRVGEKKATPFLSGERIENRLKKPTTFLCKLKF 760
            +KESQN V+  + +++SG  GHG +VGSR+G+++A P LSGER ENRLKKPTTFLCKLKF
Sbjct: 228  MKESQNVVMQKSQMVASGKGGHGSMVGSRMGDRRAAPLLSGERTENRLKKPTTFLCKLKF 287

Query: 761  RNELPDPTAQPKLLSLNTYKEQYTKYTITSLEKMHKPKLFVEPDIGVPLDLLDISVYNSN 940
            RNELP+P+AQPKL++L   K+++T+YT +SLEK +KP+L VEPD+G+PLDLLD+SVYN  
Sbjct: 288  RNELPEPSAQPKLMALKKDKDRFTRYTFSSLEKNYKPQLHVEPDLGIPLDLLDLSVYNPP 347

Query: 941  TTGPPHTXXXXXXXXXXXXXTPIKQDGIRRKERPTDKGVSWLVKTQYISPLSTDAAKMSL 1120
            +  PP               TP+K+DGI+RKERPTDKGVSWLVKTQYISPLS ++A+ SL
Sbjct: 348  SVRPPLDPEDEELLRDDEVVTPVKKDGIKRKERPTDKGVSWLVKTQYISPLSMESARQSL 407

Query: 1121 TEKQAKELRETRGGXXXXXXXXXXXXXXXXXXASFEACKSRPVHATSDKLQPVEILPLLP 1300
            TEKQAKELRE +GG                  ASFEACK RP+HAT+  LQPVEILPLLP
Sbjct: 408  TEKQAKELREMKGGRSILENLNDRERQIKEIEASFEACKLRPIHATNKNLQPVEILPLLP 467

Query: 1301 DFDRYDDQFVLAAFDGDPTADSEIYSKLDRSVRDDHESHAIMKSFIGAGSEQAKPEKFLA 1480
            DF+RYDDQFV A FDG PTADSEIYSK+D+SVRD HES AIMKS++  GS+ A PEKFLA
Sbjct: 468  DFERYDDQFVAATFDGAPTADSEIYSKMDKSVRDAHESRAIMKSYVATGSDSANPEKFLA 527

Query: 1481 YMVPSPDELWKDVYDEHEDILYTWVREYQWDVRGDDADDPTTYLVTFDEEEARYLPLPTK 1660
            YMVPS +EL KD+YDE+ED+ ++WVREY WDVRGDDADDPTTYLV+FD++EARY+PLPTK
Sbjct: 528  YMVPSVNELSKDMYDENEDVSFSWVREYHWDVRGDDADDPTTYLVSFDDDEARYVPLPTK 587

Query: 1661 LILRKKRPKEGRSNEEVEQYXXXXXXXXXXXXXXXXXELKESGDYV-----SSNSKRGRS 1825
            L LRKKR  EGRSN+EVE +                 ELKE G Y      SS+SK GR 
Sbjct: 588  LNLRKKRAIEGRSNDEVEHFPIPSSIAVRRRANVTAIELKEQGAYSNSKGNSSSSKMGRV 647

Query: 1826 AIEDGLETPVPRHKVARVQDMDHYSGAEDDMSD 1924
              ++ LE     H  +R QD    SGAEDDM D
Sbjct: 648  DSQEDLER---SHNGSRQQDPYQSSGAEDDMYD 677


>ref|XP_006465693.1| PREDICTED: RNA polymerase II-associated factor 1 homolog isoform X2
            [Citrus sinensis]
          Length = 570

 Score =  546 bits (1408), Expect = e-152
 Identities = 276/448 (61%), Positives = 333/448 (74%)
 Frame = +2

Query: 581  LKESQNTVLHNTHIMSSGMKGHGLIVGSRVGEKKATPFLSGERIENRLKKPTTFLCKLKF 760
            +KESQN V+  + +++SG  GHG + GSR+G+++A P LSGERIENRLKKPTTFLCKLKF
Sbjct: 127  MKESQNVVMQKSQMVASGKGGHGSMAGSRMGDRRAAPLLSGERIENRLKKPTTFLCKLKF 186

Query: 761  RNELPDPTAQPKLLSLNTYKEQYTKYTITSLEKMHKPKLFVEPDIGVPLDLLDISVYNSN 940
            RNELP+P+AQPKL++L   K+++T+YT +SLEK +KP+L VEPD+G+PLDLLD+SVYN  
Sbjct: 187  RNELPEPSAQPKLMALKKDKDRFTRYTFSSLEKNYKPQLHVEPDLGIPLDLLDLSVYNPP 246

Query: 941  TTGPPHTXXXXXXXXXXXXXTPIKQDGIRRKERPTDKGVSWLVKTQYISPLSTDAAKMSL 1120
            +  PP               TP+K+DGI+RKERPTDKGVSWLVKTQYISPLS ++A+ SL
Sbjct: 247  SVRPPLDPEDEELLRDDEVVTPVKKDGIKRKERPTDKGVSWLVKTQYISPLSMESARQSL 306

Query: 1121 TEKQAKELRETRGGXXXXXXXXXXXXXXXXXXASFEACKSRPVHATSDKLQPVEILPLLP 1300
            TEKQAKELRE +GG                  ASFEACK RP+HAT+  LQPVEILPLLP
Sbjct: 307  TEKQAKELREMKGGRSILENLNDRERQIKEIEASFEACKLRPIHATNKNLQPVEILPLLP 366

Query: 1301 DFDRYDDQFVLAAFDGDPTADSEIYSKLDRSVRDDHESHAIMKSFIGAGSEQAKPEKFLA 1480
            DF+RYDDQFV A FDG PTADSEIYSK+D+SVRD HES AIMKS++  GS+ A PEKFLA
Sbjct: 367  DFERYDDQFVAATFDGAPTADSEIYSKMDKSVRDAHESRAIMKSYVATGSDSANPEKFLA 426

Query: 1481 YMVPSPDELWKDVYDEHEDILYTWVREYQWDVRGDDADDPTTYLVTFDEEEARYLPLPTK 1660
            YMVPS +EL KD+YDE+ED+ ++WVREY WDVRGDDADDPTTYLV+FD++EARY+PLPTK
Sbjct: 427  YMVPSVNELSKDMYDENEDVSFSWVREYHWDVRGDDADDPTTYLVSFDDDEARYVPLPTK 486

Query: 1661 LILRKKRPKEGRSNEEVEQYXXXXXXXXXXXXXXXXXELKESGDYVSSNSKRGRSAIEDG 1840
            L LRKKR  EGRSN+EVE +                 ELKE G   SS+SK GR   ++ 
Sbjct: 487  LNLRKKRAIEGRSNDEVEHFPIPSSIAVRRRANVTAIELKEQGGN-SSSSKMGRVDSQED 545

Query: 1841 LETPVPRHKVARVQDMDHYSGAEDDMSD 1924
            LE     H  +R QD    SGAEDDM D
Sbjct: 546  LER---SHNGSRHQDPYQSSGAEDDMYD 570


>ref|XP_006343037.1| PREDICTED: RNA polymerase II-associated factor 1 homolog [Solanum
            tuberosum]
          Length = 700

 Score =  544 bits (1402), Expect = e-152
 Identities = 291/537 (54%), Positives = 350/537 (65%), Gaps = 8/537 (1%)
 Frame = +2

Query: 338  NRRSHNREGPRDANGSGWRESGHSKHGLPPKQKGSAVP-MPSANAPRVPNAIAXXXXXXX 514
            ++R+ +R        SGWRESGH  H    KQ G +VP MP   +    NA +       
Sbjct: 171  DQRNESRPSAEKRRESGWRESGHGNHTARSKQPGHSVPPMPVKKS----NAPSGRVETEE 226

Query: 515  XXXXXXXXXXXXXXXXXXXXHLLKESQNTVLHNTHIMSSGMKGHGLIVGSRVGEKKATPF 694
                                  LKESQN VL  T +++SG KGHG I  S + +++  P 
Sbjct: 227  ERRLRKKREIEKQRHEEKNRQHLKESQNKVLQKTQMLTSGTKGHGSISASHMADRRTAPL 286

Query: 695  LSGERIENRLKKPTTFLCKLKFRNELPDPTAQPKLLSLNTYKEQYTKYTITSLEKMHKPK 874
            LSGER ENRLKKPTTFLCKLKFRNELPDPTAQPKLL+L    +++TKY+ITSLEKMHKP+
Sbjct: 287  LSGERTENRLKKPTTFLCKLKFRNELPDPTAQPKLLTLRRDPDRFTKYSITSLEKMHKPQ 346

Query: 875  LFVEPDIGVPLDLLDISVYNS-NTTGPPHTXXXXXXXXXXXXXTPIKQDGIRRKERPTDK 1051
            L+VEPD+G+PLDLLD+SVYN       P               TPIK+DGI++KERPTDK
Sbjct: 347  LYVEPDLGIPLDLLDLSVYNPPKGVKIPLAPEDEELLRDDNPITPIKKDGIKKKERPTDK 406

Query: 1052 GVSWLVKTQYISPLSTDAAKMSLTEKQAKELRETRGGXXXXXXXXXXXXXXXXXXASFEA 1231
            GVSWLVKTQYISPLST++AK SLTEKQAKELRET+GG                  ASFEA
Sbjct: 407  GVSWLVKTQYISPLSTESAKQSLTEKQAKELRETKGGRNILENLNKRDRQIQEIEASFEA 466

Query: 1232 CKSRPVHATSDKLQPVEILPLLPDFDRYDDQFVLAAFDGDPTADSEIYSKLDRSVRDDHE 1411
            CKSRP+HAT+ +LQPV++ PL PDFDRY D FVLA +D  PTADSE Y+KLD++VRD  E
Sbjct: 467  CKSRPIHATNRRLQPVKVQPLYPDFDRYKDPFVLANYDSAPTADSETYNKLDKTVRDACE 526

Query: 1412 SHAIMKSFIGAGSEQAKPEKFLAYMVPSPDELWKDVYDEHEDILYTWVREYQWDVRGDDA 1591
            S A+MKSF+   S+  KP+KFLAYMVP+P+EL KD+YDE+EDI Y+WVREY WDVRGDDA
Sbjct: 527  SQAVMKSFVATSSDADKPDKFLAYMVPAPNELSKDMYDENEDISYSWVREYHWDVRGDDA 586

Query: 1592 DDPTTYLVTFDEEEARYLPLPTKLILRKKRPKEGRSNEEVEQYXXXXXXXXXXXXXXXXX 1771
            DDP TY+V F E EARY+PLPTKL+LRKKR +EG+SNEEVE +                 
Sbjct: 587  DDPNTYVVAFGETEARYMPLPTKLVLRKKRAREGKSNEEVEHFPVPSRVTVRKRPTAAAI 646

Query: 1772 ELKESGDYVS------SNSKRGRSAIEDGLETPVPRHKVARVQDMDHYSGAEDDMSD 1924
            ELKE G Y +      S+SKR R + ED +     +H      D D  SG E  MSD
Sbjct: 647  ELKEEGGYTTALKGNVSSSKRSRISHEDDVG---EQHNNMHDDDQDQSSGGEYYMSD 700


>ref|XP_004235642.1| PREDICTED: uncharacterized protein LOC101254885 [Solanum
            lycopersicum]
          Length = 698

 Score =  537 bits (1384), Expect = e-150
 Identities = 288/536 (53%), Positives = 346/536 (64%), Gaps = 8/536 (1%)
 Frame = +2

Query: 341  RRSHNREGPRDANGSGWRESGHSKHGLPPKQKGSAVP-MPSANAPRVPNAIAXXXXXXXX 517
            +R+ +R        SGWRES H  H    KQ   +VP +P   +    NA +        
Sbjct: 170  QRNESRHSVEKRRESGWRESRHGNHTARSKQPDHSVPPLPMKKS----NAHSGRVETEEE 225

Query: 518  XXXXXXXXXXXXXXXXXXXHLLKESQNTVLHNTHIMSSGMKGHGLIVGSRVGEKKATPFL 697
                                 LKESQN VL  T +++SG KGHG I  S + +++ TP L
Sbjct: 226  RRSRKKREIEKQRHEEKNRQHLKESQNKVLQKTQMLTSGTKGHGSISASHMADRRTTPLL 285

Query: 698  SGERIENRLKKPTTFLCKLKFRNELPDPTAQPKLLSLNTYKEQYTKYTITSLEKMHKPKL 877
            SGER ENRLKKPTTFLCKLKFRNELPDPTAQPKLL+L    +++TKY+ITSLEKMHKP+L
Sbjct: 286  SGERTENRLKKPTTFLCKLKFRNELPDPTAQPKLLTLRRDPDRFTKYSITSLEKMHKPQL 345

Query: 878  FVEPDIGVPLDLLDISVYNS-NTTGPPHTXXXXXXXXXXXXXTPIKQDGIRRKERPTDKG 1054
             VEPD+G+PLDLLD+SVYN       P               TPIK+DGI++KERPTDKG
Sbjct: 346  HVEPDLGIPLDLLDLSVYNPPKGVKIPLAPEDEELLRDDNPITPIKKDGIKKKERPTDKG 405

Query: 1055 VSWLVKTQYISPLSTDAAKMSLTEKQAKELRETRGGXXXXXXXXXXXXXXXXXXASFEAC 1234
            VSWLVKTQYISPLST++AK SLTEKQAKELRET+GG                  ASFEAC
Sbjct: 406  VSWLVKTQYISPLSTESAKQSLTEKQAKELRETKGGRNILENLNKRDRQIQEIEASFEAC 465

Query: 1235 KSRPVHATSDKLQPVEILPLLPDFDRYDDQFVLAAFDGDPTADSEIYSKLDRSVRDDHES 1414
            KSRP+HA++ +LQP+++ PL PDFDRY D FVLA +D  PTADSE YSKLD++VRD  ES
Sbjct: 466  KSRPIHASNRRLQPIKVQPLYPDFDRYKDPFVLANYDSAPTADSETYSKLDKTVRDACES 525

Query: 1415 HAIMKSFIGAGSEQAKPEKFLAYMVPSPDELWKDVYDEHEDILYTWVREYQWDVRGDDAD 1594
             A+MKSF+   S+  KP+KFLAYMVP+P+EL KD+YDE EDI Y+WVREY WDVRGDDAD
Sbjct: 526  QAVMKSFVATSSDADKPDKFLAYMVPAPNELSKDIYDESEDISYSWVREYHWDVRGDDAD 585

Query: 1595 DPTTYLVTFDEEEARYLPLPTKLILRKKRPKEGRSNEEVEQYXXXXXXXXXXXXXXXXXE 1774
            DP TY+V F E EARY+PLPTKL+LRKKR +EG+SNEEVE +                 E
Sbjct: 586  DPNTYVVAFGEREARYMPLPTKLVLRKKRAREGKSNEEVEHFPVPSRVTVRKRPTAAAIE 645

Query: 1775 LKESGDYVS------SNSKRGRSAIEDGLETPVPRHKVARVQDMDHYSGAEDDMSD 1924
            LKE G Y +      S+SKR R + ED +     +H      D D  SG E  MSD
Sbjct: 646  LKEEGGYTTALKGNVSSSKRSRISHEDDVG---EQHNNMHDDDQDQSSGGEYYMSD 698


>ref|XP_006426878.1| hypothetical protein CICLE_v10025066mg [Citrus clementina]
            gi|557528868|gb|ESR40118.1| hypothetical protein
            CICLE_v10025066mg [Citrus clementina]
          Length = 632

 Score =  525 bits (1352), Expect = e-146
 Identities = 256/403 (63%), Positives = 309/403 (76%)
 Frame = +2

Query: 581  LKESQNTVLHNTHIMSSGMKGHGLIVGSRVGEKKATPFLSGERIENRLKKPTTFLCKLKF 760
            +KESQN V+  + +++SG  GHG +VGSR+G+++A P LSGER ENRLKKPTTFLCKLKF
Sbjct: 228  MKESQNVVMQKSQMVASGKGGHGSMVGSRMGDRRAAPLLSGERTENRLKKPTTFLCKLKF 287

Query: 761  RNELPDPTAQPKLLSLNTYKEQYTKYTITSLEKMHKPKLFVEPDIGVPLDLLDISVYNSN 940
            RNELP+P+AQPKL++L   K+++T+YT +SLEK +KP+L VEPD+G+PLDLLD+SVYN  
Sbjct: 288  RNELPEPSAQPKLMALKKDKDRFTRYTFSSLEKNYKPQLHVEPDLGIPLDLLDLSVYNPP 347

Query: 941  TTGPPHTXXXXXXXXXXXXXTPIKQDGIRRKERPTDKGVSWLVKTQYISPLSTDAAKMSL 1120
            +  PP               TP+K+DGI+RKERPTDKGVSWLVKTQYISPLS ++A+ SL
Sbjct: 348  SVRPPLDPEDEELLRDDEVVTPVKKDGIKRKERPTDKGVSWLVKTQYISPLSMESARQSL 407

Query: 1121 TEKQAKELRETRGGXXXXXXXXXXXXXXXXXXASFEACKSRPVHATSDKLQPVEILPLLP 1300
            TEKQAKELRE +GG                  ASFEACK RP+HAT+  LQPVEILPLLP
Sbjct: 408  TEKQAKELREMKGGRSILENLNDRERQIKEIEASFEACKLRPIHATNKNLQPVEILPLLP 467

Query: 1301 DFDRYDDQFVLAAFDGDPTADSEIYSKLDRSVRDDHESHAIMKSFIGAGSEQAKPEKFLA 1480
            DF+RYDDQFV A FDG PTADSEIYSK+D+SVRD HES AIMKS++  GS+ A PEKFLA
Sbjct: 468  DFERYDDQFVAATFDGAPTADSEIYSKMDKSVRDAHESRAIMKSYVATGSDSANPEKFLA 527

Query: 1481 YMVPSPDELWKDVYDEHEDILYTWVREYQWDVRGDDADDPTTYLVTFDEEEARYLPLPTK 1660
            YMVPS +EL KD+YDE+ED+ ++WVREY WDVRGDDADDPTTYLV+FD++EARY+PLPTK
Sbjct: 528  YMVPSVNELSKDMYDENEDVSFSWVREYHWDVRGDDADDPTTYLVSFDDDEARYVPLPTK 587

Query: 1661 LILRKKRPKEGRSNEEVEQYXXXXXXXXXXXXXXXXXELKESG 1789
            L LRKKR  EGRSN+EVE +                 ELKE G
Sbjct: 588  LNLRKKRAIEGRSNDEVEHFPIPSSIAVRRRANVTAIELKEQG 630


>ref|XP_007024309.1| Hydroxyproline-rich glycoprotein family protein isoform 3 [Theobroma
            cacao] gi|508779675|gb|EOY26931.1| Hydroxyproline-rich
            glycoprotein family protein isoform 3 [Theobroma cacao]
          Length = 662

 Score =  523 bits (1347), Expect = e-145
 Identities = 293/541 (54%), Positives = 342/541 (63%), Gaps = 5/541 (0%)
 Frame = +2

Query: 317  KESRFHDNRRSHNREGPRDANGSGWRESGHSKHGLPPKQKGSAVPMPSANAPRVPNAIAX 496
            KES         ++ G RD  GSG RE GHS H    + +   +P       + PN  A 
Sbjct: 159  KESVGDKGLNERSQGGNRDFLGSGRREHGHSNHAAGVRDQKPMMP-----PVKKPNGPAG 213

Query: 497  XXXXXXXXXXXXXXXXXXXXXXXXXXHLLKESQNTVLHNTHIMSSGMKGHGLIVGSRVGE 676
                                        +KESQ T      +M SG KGHG +VGSR+G+
Sbjct: 214  RVETEEERRLRKKREFEKQRQEEKHRQQMKESQKT-----QMMPSG-KGHGSMVGSRMGD 267

Query: 677  KKATPFLSGERIENRLKKPTTFLCKLKFRNELPDPTAQPKLLSLNTYKEQYTKYTITSLE 856
            ++ATPFLSGERIENRLKKPTTFLCKLKF                       TKYTITSLE
Sbjct: 268  RRATPFLSGERIENRLKKPTTFLCKLKF-----------------------TKYTITSLE 304

Query: 857  KMHKPKLFVEPDIGVPLDLLDISVYNSNTTGPPHTXXXXXXXXXXXXXTPIKQDGIRRKE 1036
            KM+KPKLFVEPD+G+PLDLLD+SVYN  +  P                TPIK+DGIRRKE
Sbjct: 305  KMYKPKLFVEPDLGIPLDLLDLSVYNPPSVRPSLAPEDAELLHDDEAVTPIKKDGIRRKE 364

Query: 1037 RPTDKGVSWLVKTQYISPLSTDAAKMSLTEKQAKELRETRGGXXXXXXXXXXXXXXXXXX 1216
            RPTDKGVSWLVKTQYISPLS ++ K SLTEKQAKELRE +GG                  
Sbjct: 365  RPTDKGVSWLVKTQYISPLSMESTKQSLTEKQAKELRELKGGRNILENLNNRERQIKEIE 424

Query: 1217 ASFEACKSRPVHATSDKLQPVEILPLLPDFDRYDDQFVLAAFDGDPTADSEIYSKLDRSV 1396
            ASFEA K RPVHAT+  L+PVE++PLLPDFDRY+DQFV+ AFDG PTADSEI+SKLD SV
Sbjct: 425  ASFEASKLRPVHATNKNLEPVEVMPLLPDFDRYNDQFVMVAFDGAPTADSEIFSKLDDSV 484

Query: 1397 RDDHESHAIMKSFIGAGSEQAKPEKFLAYMVPSPDELWKDVYDEHEDILYTWVREYQWDV 1576
            RD+HES AIMKS++ A S+ A PEKFLAYMVPS DEL K +YDEHED+ Y+WVREY WDV
Sbjct: 485  RDEHESRAIMKSYLAASSDPANPEKFLAYMVPSLDELSKGMYDEHEDVSYSWVREYNWDV 544

Query: 1577 RGDDADDPTTYLVTFDEEEARYLPLPTKLILRKKRPKEGRSNEEVEQYXXXXXXXXXXXX 1756
            RGDDA+DPTTYLV+FDE EARY+PLPTKL LRKKR +EGR+ +E+E +            
Sbjct: 545  RGDDANDPTTYLVSFDEGEARYVPLPTKLNLRKKRAREGRTGDEIEHFPIPARITVRRRS 604

Query: 1757 XXXXXELKESGDYVS-----SNSKRGRSAIEDGLETPVPRHKVARVQDMDHYSGAEDDMS 1921
                 ELKE   Y S     S+SK GR   EDGL      HK+AR  D+D YSGAEDD+S
Sbjct: 605  TVAAIELKEPEVYTSSRGGMSSSKIGRLDAEDGLGR---SHKLARHHDVDQYSGAEDDLS 661

Query: 1922 D 1924
            +
Sbjct: 662  E 662


>ref|XP_002303312.2| hydroxyproline-rich glycoprotein [Populus trichocarpa]
            gi|550342419|gb|EEE78291.2| hydroxyproline-rich
            glycoprotein [Populus trichocarpa]
          Length = 569

 Score =  521 bits (1341), Expect = e-145
 Identities = 269/451 (59%), Positives = 326/451 (72%), Gaps = 3/451 (0%)
 Frame = +2

Query: 581  LKESQNTVLHNTHIMSSGMKGHGLIVGSRVGEKKATPFLSGERIENRLKKPTTFLCKLKF 760
            LKESQN+ L   H++SS  KGHG IVGSR+G++ ATP L GER ENRLKKPTTF+CKLKF
Sbjct: 123  LKESQNSALLKNHVISS-QKGHGSIVGSRLGDRVATPLLGGERAENRLKKPTTFMCKLKF 181

Query: 761  RNELPDPTAQPKLLSLNTYKEQYTKYTITSLEKMHKPKLFVEPDIGVPLDLLDISVYNSN 940
            RNELPDP+AQPKL+ L   K+++TKYTITSLEKM+KP+L+VEPD+G+PLDLLD+SVYN  
Sbjct: 182  RNELPDPSAQPKLMPLKREKDRFTKYTITSLEKMYKPQLYVEPDLGIPLDLLDLSVYNPP 241

Query: 941  TTGPPHTXXXXXXXXXXXXXTPIKQDGIRRKERPTDKGVSWLVKTQYISPLSTDAAKMSL 1120
            +  P                TP+K+DGI+RKERPTDKGVSWLVKTQYISPLS ++AK+SL
Sbjct: 242  SVRPLLAPEDEELLHDDESVTPVKRDGIKRKERPTDKGVSWLVKTQYISPLSMESAKLSL 301

Query: 1121 TEKQAKELRETRGGXXXXXXXXXXXXXXXXXXASFEACKSRPVHATSDKLQPVEILPLLP 1300
            TEKQAKELRE +GG                  ASF + K  PVHAT+  L+PVEILPLLP
Sbjct: 302  TEKQAKELREMKGGCKLLDNLNKRERQIKEIQASFASNKLPPVHATNKNLKPVEILPLLP 361

Query: 1301 DFDRYDDQFVLAAFDGDPTADSEIYSKLDRSVRDDHESHAIMKSFIGAGSEQAKPEKFLA 1480
            DFDRY D+FV  AFDG PTAD+E Y K D S RD +ES AIMK+ + +GS+ A PEKFLA
Sbjct: 362  DFDRYGDKFVTVAFDGAPTADAENYRKFDPSDRDAYESWAIMKACVASGSDPANPEKFLA 421

Query: 1481 YMVPSPDELWKDVYDEHEDILYTWVREYQWDVRGDDADDPTTYLVTFDEEEARYLPLPTK 1660
            Y VPSPDEL KD+YDE+EDILY+W+REY WDVRGDD DDP+T+LV+FDE EARYLPLPTK
Sbjct: 422  YTVPSPDELSKDMYDENEDILYSWIREYHWDVRGDDVDDPSTFLVSFDEAEARYLPLPTK 481

Query: 1661 LILRKKRPKEGRSNEEVEQYXXXXXXXXXXXXXXXXXELKESGDYVSS---NSKRGRSAI 1831
            + LRKKR +EGRS +E+E +                 E ++SG   +S   NS+  R   
Sbjct: 482  ISLRKKRAREGRSGDEIEHFPIPSRVTVRKRAVAATIEQRDSGAISNSRGNNSRMERFED 541

Query: 1832 EDGLETPVPRHKVARVQDMDHYSGAEDDMSD 1924
            EDGL       +VA  +D+ H SGAED+MS+
Sbjct: 542  EDGLGR---LQRVALDEDLHHSSGAEDEMSE 569


Top