BLASTX nr result

ID: Akebia23_contig00004594 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Akebia23_contig00004594
         (2271 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_007217663.1| hypothetical protein PRUPE_ppa002145mg [Prun...   596   e-167
gb|EXB74581.1| hypothetical protein L484_026278 [Morus notabilis]     591   e-166
ref|XP_004155995.1| PREDICTED: uncharacterized LOC101203806 [Cuc...   573   e-160
ref|XP_007024308.1| Hydroxyproline-rich glycoprotein family prot...   572   e-160
ref|XP_007024307.1| Hydroxyproline-rich glycoprotein family prot...   572   e-160
ref|XP_004141783.1| PREDICTED: uncharacterized protein LOC101203...   571   e-160
ref|XP_002278075.2| PREDICTED: RNA polymerase II-associated fact...   568   e-159
ref|XP_004302858.1| PREDICTED: uncharacterized protein LOC101304...   567   e-159
ref|XP_006583048.1| PREDICTED: RNA polymerase II-associated fact...   566   e-158
ref|XP_007135633.1| hypothetical protein PHAVU_010G145300g [Phas...   563   e-158
ref|XP_003531647.1| PREDICTED: bromodomain-containing protein 4-...   556   e-155
ref|XP_007225143.1| hypothetical protein PRUPE_ppa002485mg [Prun...   553   e-154
ref|XP_006343037.1| PREDICTED: RNA polymerase II-associated fact...   553   e-154
ref|XP_006465692.1| PREDICTED: RNA polymerase II-associated fact...   552   e-154
ref|XP_006426877.1| hypothetical protein CICLE_v10025066mg [Citr...   552   e-154
ref|XP_006465693.1| PREDICTED: RNA polymerase II-associated fact...   551   e-154
ref|XP_004235642.1| PREDICTED: uncharacterized protein LOC101254...   544   e-152
ref|XP_006426878.1| hypothetical protein CICLE_v10025066mg [Citr...   530   e-147
ref|XP_007024309.1| Hydroxyproline-rich glycoprotein family prot...   528   e-147
emb|CBI36059.3| unnamed protein product [Vitis vinifera]              522   e-145

>ref|XP_007217663.1| hypothetical protein PRUPE_ppa002145mg [Prunus persica]
            gi|462413813|gb|EMJ18862.1| hypothetical protein
            PRUPE_ppa002145mg [Prunus persica]
          Length = 709

 Score =  596 bits (1537), Expect = e-167
 Identities = 317/533 (59%), Positives = 372/533 (69%), Gaps = 4/533 (0%)
 Frame = +2

Query: 404  NRRSHNREGPRDANGSGWRESGHSKHGLPPKQKGSAVP-MPLGNASRVPNGLAXXXXXXX 580
            +R SH +  PRD + SG RE GH  HG+P KQ    VP MP+  A    NG         
Sbjct: 185  DRGSHEKGAPRDVSVSGRREHGHLNHGVPQKQHKPPVPSMPVKKA----NGPPGRVETEE 240

Query: 581  XXXXXXXXXXXXXXXXXXXXXXXXESQNTVMHKTHIMSSGMKGHGSIVGSRVGEKKATPF 760
                                    +SQN+V+ KT ++SSG KGHGSI GSR+GE++ATPF
Sbjct: 241  ERRLRKKREFEKQRQEEKHRQQLKDSQNSVLQKTQMLSSG-KGHGSIAGSRMGERRATPF 299

Query: 761  LSGERIENRLKKPTTFLCKLKFRNELPDPTAQPKLLSLNTYKEQYTKYTITSLEKMHKPK 940
            LSGER ENRLKKPTTF+CKLKFRNELPDP+AQPKL+SL   K+QYTKYTITSLEK +KPK
Sbjct: 300  LSGERTENRLKKPTTFVCKLKFRNELPDPSAQPKLMSLKKDKDQYTKYTITSLEKTYKPK 359

Query: 941  LFVEPDIGVPLDLLDICVYNSNTTGTPHTXXXXXXXXXXXXXTPIKQDGIRRKERPTDKG 1120
            LFVEPD+G+PLDLLD+ VYN  +   P               TP+K +GIRRKERPTDKG
Sbjct: 360  LFVEPDLGIPLDLLDLSVYNPPSVRPPLALEDEELLRDDVAATPVKNNGIRRKERPTDKG 419

Query: 1121 VSWLVKTQYISPLSTEAAKMSLTEKQAKELRETXXXXXXXXXXXXXXKQIQEIEASFEAC 1300
            V+WLVKTQYISPLS ++A+ SLTEKQAKELRE               +QI++IEASFEAC
Sbjct: 420  VAWLVKTQYISPLSMDSARQSLTEKQAKELREMKGGRNILDNLNDRERQIKDIEASFEAC 479

Query: 1301 KSRPVHATSDKLQPVEILPLLPDFDRYDDQFVLAAFDGDPTADSEIYSKLDRSVRDDHES 1480
            KSRPVHAT+  L PVEILPLLPDF+RY+DQFVLAAFDG PTADSEIYSKLD+S  D +ES
Sbjct: 480  KSRPVHATNKNLYPVEILPLLPDFERYEDQFVLAAFDGAPTADSEIYSKLDQSGHDAYES 539

Query: 1481 HAIMKSFIGAGSEQAKPEKFLAYMVPSPDELWKDVYDEHEDILYTWVREYQWDVRGDDAD 1660
             AIMKS+   G++ A PEKFLAYMVPSP+EL KD YDE ED+ Y+WVREY +DVRGDD  
Sbjct: 540  RAIMKSYKVTGADPANPEKFLAYMVPSPNELSKDPYDESEDVSYSWVREYHYDVRGDDVH 599

Query: 1661 DPTTYLVTFDEEEARYLPLPTKLILRKKRPKEGRSNEEVEQYXXXXXXXXXXXXXXXXXE 1840
            DPTTYLV+FDEEEARY PLPTKL+LRKKR KEG++++EVE +                 E
Sbjct: 600  DPTTYLVSFDEEEARYAPLPTKLVLRKKRSKEGKTSDEVEHFPAPSRVTVRQRSTVAAIE 659

Query: 1841 LKESGDYVS---SNSKRGRSAIEDGLETPVPRHKVARVQDMDHYSGAEDDMSD 1990
            LK+SGDY     SN K  R  +ED LE P    K+AR QD+D YSGAEDD+SD
Sbjct: 660  LKDSGDYSRGSVSNLKTRRFDVEDTLERP---RKIARHQDIDEYSGAEDDLSD 709


>gb|EXB74581.1| hypothetical protein L484_026278 [Morus notabilis]
          Length = 697

 Score =  591 bits (1524), Expect = e-166
 Identities = 314/544 (57%), Positives = 373/544 (68%), Gaps = 6/544 (1%)
 Frame = +2

Query: 374  NQAEESRFHDNRRSHNREGPRDANGSGWRESGHSKH-GLPPKQKGSAVPMPLGNASRVPN 550
            +Q +E+  H   +  +R   ++  GSG RE G+S H G   KQ    VP      S  P 
Sbjct: 160  SQGKENVHHRGLQERDRGVSKEVAGSGRREHGYSNHHGTHHKQHKYPVPSVPVKKSNGPM 219

Query: 551  GLAXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXESQNTVMHKTHIMSSGMKGHGSIVGS 730
            G                                 ESQ++ + KT I+S+  KGHGSI GS
Sbjct: 220  GRVETEEERRLRKKREFEKQKQEEKHRQHLK---ESQHSALQKTQILSAA-KGHGSIAGS 275

Query: 731  RVGEKKATPFLSGERIENRLKKPTTFLCKLKFRNELPDPTAQPKLLSLNTYKEQYTKYTI 910
            R+GE++AT FLSGERIENRLKKPTTFLCKLKFRNELPDP+AQPKL+S+   K+QY+KYTI
Sbjct: 276  RMGERRATSFLSGERIENRLKKPTTFLCKLKFRNELPDPSAQPKLMSMKREKDQYSKYTI 335

Query: 911  TSLEKMHKPKLFVEPDIGVPLDLLDICVYNSNTTGTPHTXXXXXXXXXXXXXTPIKQDGI 1090
            TSLEK +KPKLFVEPD+G+PL+LLD+ VYN  +   P               TP+K+DGI
Sbjct: 336  TSLEKTYKPKLFVEPDLGIPLNLLDLSVYNPPSVRPPLDPEDEELLRDDEAVTPVKKDGI 395

Query: 1091 RRKERPTDKGVSWLVKTQYISPLSTEAAKMSLTEKQAKELRETXXXXXXXXXXXXXXKQI 1270
            +RKERPTDKGV+WLVKTQYISPLS E+ K SLTEKQAKELRE               +QI
Sbjct: 396  KRKERPTDKGVAWLVKTQYISPLSMESTKQSLTEKQAKELRELKGGRNILENLNDRDRQI 455

Query: 1271 QEIEASFEACKSRPVHATSDKLQPVEILPLLPDFDRYDDQFVLAAFDGDPTADSEIYSKL 1450
            +EI+ASFEACKSRPVHAT+  L PVE+LPLLPDFDRYDDQFVLAAFD  PTADSE+YSK+
Sbjct: 456  KEIQASFEACKSRPVHATNKSLYPVEVLPLLPDFDRYDDQFVLAAFDSAPTADSEVYSKM 515

Query: 1451 DRSVRDDHESHAIMKSFIGAGSEQAKPEKFLAYMVPSPDELWKDVYDEHEDILYTWVREY 1630
            D+S+RD HES A++KS+   GS+   PEKFLAYMVPSPDEL KD+YDEHED+ Y+WVREY
Sbjct: 516  DQSIRDAHESQAVLKSYKVTGSDPGNPEKFLAYMVPSPDELSKDIYDEHEDVSYSWVREY 575

Query: 1631 QWDVRGDDADDPTTYLVTFDEEEARYLPLPTKLILRKKRPKEGRSNEEVEQYXXXXXXXX 1810
             WDVRGDDADDPTTYLV+FDE EARYLPLPTKL+LRKKR KEGRS +EVE +        
Sbjct: 576  HWDVRGDDADDPTTYLVSFDETEARYLPLPTKLVLRKKRAKEGRSGDEVEHFPVPARVTV 635

Query: 1811 XXXXXXXXXELKESGDYVS-----SNSKRGRSAIEDGLETPVPRHKVARVQDMDHYSGAE 1975
                     ELK++  Y +     SN KRG S +EDGLE     HKVAR +D+D YSGAE
Sbjct: 636  RRRPTVSVVELKDAEVYSNPRGSLSNFKRGGSDVEDGLER---SHKVARQEDVDEYSGAE 692

Query: 1976 DDMS 1987
            DD+S
Sbjct: 693  DDLS 696


>ref|XP_004155995.1| PREDICTED: uncharacterized LOC101203806 [Cucumis sativus]
          Length = 706

 Score =  573 bits (1476), Expect = e-160
 Identities = 311/534 (58%), Positives = 370/534 (69%), Gaps = 5/534 (0%)
 Frame = +2

Query: 404  NRRSHNRE--GPRDAN-GSGWRE-SGHSKHGLPPKQKGSAVPMPLGNASRVPNGLAXXXX 571
            N  +H R+   P+D + G   RE S H KH     QK S  PMP   A    NG +    
Sbjct: 189  NMGAHERDKGAPKDPSYGRRDRENSNHDKH-----QKHSGPPMPPKKA----NGPSGRME 239

Query: 572  XXXXXXXXXXXXXXXXXXXXXXXXXXXESQNTVMHKTHIMSSGMKGHGSIVGSRVGEKKA 751
                                       ESQNT++ KT ++S+G K HGSIVGSR+GE+KA
Sbjct: 240  TDDEKRLRKKREFEKQRQDERHRHHLKESQNTILQKTQMLSTG-KVHGSIVGSRMGERKA 298

Query: 752  TPFLSGERIENRLKKPTTFLCKLKFRNELPDPTAQPKLLSLNTYKEQYTKYTITSLEKMH 931
            TPFLSGERIENRLKKPTTFLCKLKFRNELPD +AQPKL+SL   K+ YT+YTITSLEK +
Sbjct: 299  TPFLSGERIENRLKKPTTFLCKLKFRNELPDTSAQPKLMSLRKEKDHYTRYTITSLEKTY 358

Query: 932  KPKLFVEPDIGVPLDLLDICVYNSNTTGTPHTXXXXXXXXXXXXXTPIKQDG-IRRKERP 1108
            KP+L+VEPD+G+PLDLLD+ VYN ++   P               TP+K+DG I+RKERP
Sbjct: 359  KPQLYVEPDLGIPLDLLDLSVYNPSSVRMPLAPEDEELLRDDVLKTPVKKDGGIKRKERP 418

Query: 1109 TDKGVSWLVKTQYISPLSTEAAKMSLTEKQAKELRETXXXXXXXXXXXXXXKQIQEIEAS 1288
            TDKGV+WLVKTQYISPLS E+AK SLTEKQAKELRE               +QI+EIE S
Sbjct: 419  TDKGVAWLVKTQYISPLSIESAKQSLTEKQAKELREMKGGRNILENLNNRERQIKEIETS 478

Query: 1289 FEACKSRPVHATSDKLQPVEILPLLPDFDRYDDQFVLAAFDGDPTADSEIYSKLDRSVRD 1468
            FEACKSRP+HAT+  L PVE+LPLLPDFDRYDD FV+ AFD  PTADSE ++KLD+S+RD
Sbjct: 479  FEACKSRPIHATNKNLYPVEVLPLLPDFDRYDDPFVVVAFDSAPTADSETFNKLDQSIRD 538

Query: 1469 DHESHAIMKSFIGAGSEQAKPEKFLAYMVPSPDELWKDVYDEHEDILYTWVREYQWDVRG 1648
             HES AIMKS++  GS+ +KPEKFLAYMVPSPDEL KD+YDE ED+ Y+WVREY WDVRG
Sbjct: 539  AHESQAIMKSYMATGSDPSKPEKFLAYMVPSPDELSKDIYDEQEDVSYSWVREYHWDVRG 598

Query: 1649 DDADDPTTYLVTFDEEEARYLPLPTKLILRKKRPKEGRSNEEVEQYXXXXXXXXXXXXXX 1828
            D+ DDPTTYLV+FD+ EARY+PLPTKL+LRKKR KEGRS++EVE +              
Sbjct: 599  DNVDDPTTYLVSFDDAEARYVPLPTKLVLRKKRAKEGRSSDEVEHFPAPARVTVRRRPTV 658

Query: 1829 XXXELKESGDYVSSNSKRGRSAIEDGLETPVPRHKVARVQDMDHYSGAEDDMSD 1990
               E+K+ G Y  SNSKRG S IEDG+      HK  R QDMD +SGAED+MSD
Sbjct: 659  ATLEVKDPGIY--SNSKRG-SDIEDGIGR---SHKHDRNQDMDQFSGAEDEMSD 706


>ref|XP_007024308.1| Hydroxyproline-rich glycoprotein family protein isoform 2 [Theobroma
            cacao] gi|508779674|gb|EOY26930.1| Hydroxyproline-rich
            glycoprotein family protein isoform 2 [Theobroma cacao]
          Length = 562

 Score =  572 bits (1473), Expect = e-160
 Identities = 311/534 (58%), Positives = 363/534 (67%), Gaps = 5/534 (0%)
 Frame = +2

Query: 404  NRRSHNREGPRDANGSGWRESGHSKHGLPPKQKGSAVPMPLGNASRVPNGLAXXXXXXXX 583
            N RS    G RD  GSG RE GHS H    + +   +P       + PNG A        
Sbjct: 45   NERSQG--GNRDFLGSGRREHGHSNHAAGVRDQKPMMP-----PVKKPNGPAGRVETEEE 97

Query: 584  XXXXXXXXXXXXXXXXXXXXXXXESQNTVMHKTHIMSSGMKGHGSIVGSRVGEKKATPFL 763
                                   ESQ     KT +M SG KGHGS+VGSR+G+++ATPFL
Sbjct: 98   RRLRKKREFEKQRQEEKHRQQMKESQ-----KTQMMPSG-KGHGSMVGSRMGDRRATPFL 151

Query: 764  SGERIENRLKKPTTFLCKLKFRNELPDPTAQPKLLSLNTYKEQYTKYTITSLEKMHKPKL 943
            SGERIENRLKKPTTFLCKLKFRNELPDP+AQPKL++L   K+++TKYTITSLEKM+KPKL
Sbjct: 152  SGERIENRLKKPTTFLCKLKFRNELPDPSAQPKLMALKKDKDRFTKYTITSLEKMYKPKL 211

Query: 944  FVEPDIGVPLDLLDICVYNSNTTGTPHTXXXXXXXXXXXXXTPIKQDGIRRKERPTDKGV 1123
            FVEPD+G+PLDLLD+ VYN  +                   TPIK+DGIRRKERPTDKGV
Sbjct: 212  FVEPDLGIPLDLLDLSVYNPPSVRPSLAPEDAELLHDDEAVTPIKKDGIRRKERPTDKGV 271

Query: 1124 SWLVKTQYISPLSTEAAKMSLTEKQAKELRETXXXXXXXXXXXXXXKQIQEIEASFEACK 1303
            SWLVKTQYISPLS E+ K SLTEKQAKELRE               +QI+EIEASFEA K
Sbjct: 272  SWLVKTQYISPLSMESTKQSLTEKQAKELRELKGGRNILENLNNRERQIKEIEASFEASK 331

Query: 1304 SRPVHATSDKLQPVEILPLLPDFDRYDDQFVLAAFDGDPTADSEIYSKLDRSVRDDHESH 1483
             RPVHAT+  L+PVE++PLLPDFDRY+DQFV+ AFDG PTADSEI+SKLD SVRD+HES 
Sbjct: 332  LRPVHATNKNLEPVEVMPLLPDFDRYNDQFVMVAFDGAPTADSEIFSKLDDSVRDEHESR 391

Query: 1484 AIMKSFIGAGSEQAKPEKFLAYMVPSPDELWKDVYDEHEDILYTWVREYQWDVRGDDADD 1663
            AIMKS++ A S+ A PEKFLAYMVPS DEL K +YDEHED+ Y+WVREY WDVRGDDA+D
Sbjct: 392  AIMKSYLAASSDPANPEKFLAYMVPSLDELSKGMYDEHEDVSYSWVREYNWDVRGDDAND 451

Query: 1664 PTTYLVTFDEEEARYLPLPTKLILRKKRPKEGRSNEEVEQYXXXXXXXXXXXXXXXXXEL 1843
            PTTYLV+FDE EARY+PLPTKL LRKKR +EGR+ +E+E +                 EL
Sbjct: 452  PTTYLVSFDEGEARYVPLPTKLNLRKKRAREGRTGDEIEHFPIPARITVRRRSTVAAIEL 511

Query: 1844 KESGDYVS-----SNSKRGRSAIEDGLETPVPRHKVARVQDMDHYSGAEDDMSD 1990
            KE   Y S     S+SK GR   EDGL      HK+AR  D+D YSGAEDD+S+
Sbjct: 512  KEPEVYTSSRGGMSSSKIGRLDAEDGLGR---SHKLARHHDVDQYSGAEDDLSE 562


>ref|XP_007024307.1| Hydroxyproline-rich glycoprotein family protein isoform 1 [Theobroma
            cacao] gi|508779673|gb|EOY26929.1| Hydroxyproline-rich
            glycoprotein family protein isoform 1 [Theobroma cacao]
          Length = 685

 Score =  572 bits (1473), Expect = e-160
 Identities = 311/534 (58%), Positives = 363/534 (67%), Gaps = 5/534 (0%)
 Frame = +2

Query: 404  NRRSHNREGPRDANGSGWRESGHSKHGLPPKQKGSAVPMPLGNASRVPNGLAXXXXXXXX 583
            N RS    G RD  GSG RE GHS H    + +   +P       + PNG A        
Sbjct: 168  NERSQG--GNRDFLGSGRREHGHSNHAAGVRDQKPMMP-----PVKKPNGPAGRVETEEE 220

Query: 584  XXXXXXXXXXXXXXXXXXXXXXXESQNTVMHKTHIMSSGMKGHGSIVGSRVGEKKATPFL 763
                                   ESQ     KT +M SG KGHGS+VGSR+G+++ATPFL
Sbjct: 221  RRLRKKREFEKQRQEEKHRQQMKESQ-----KTQMMPSG-KGHGSMVGSRMGDRRATPFL 274

Query: 764  SGERIENRLKKPTTFLCKLKFRNELPDPTAQPKLLSLNTYKEQYTKYTITSLEKMHKPKL 943
            SGERIENRLKKPTTFLCKLKFRNELPDP+AQPKL++L   K+++TKYTITSLEKM+KPKL
Sbjct: 275  SGERIENRLKKPTTFLCKLKFRNELPDPSAQPKLMALKKDKDRFTKYTITSLEKMYKPKL 334

Query: 944  FVEPDIGVPLDLLDICVYNSNTTGTPHTXXXXXXXXXXXXXTPIKQDGIRRKERPTDKGV 1123
            FVEPD+G+PLDLLD+ VYN  +                   TPIK+DGIRRKERPTDKGV
Sbjct: 335  FVEPDLGIPLDLLDLSVYNPPSVRPSLAPEDAELLHDDEAVTPIKKDGIRRKERPTDKGV 394

Query: 1124 SWLVKTQYISPLSTEAAKMSLTEKQAKELRETXXXXXXXXXXXXXXKQIQEIEASFEACK 1303
            SWLVKTQYISPLS E+ K SLTEKQAKELRE               +QI+EIEASFEA K
Sbjct: 395  SWLVKTQYISPLSMESTKQSLTEKQAKELRELKGGRNILENLNNRERQIKEIEASFEASK 454

Query: 1304 SRPVHATSDKLQPVEILPLLPDFDRYDDQFVLAAFDGDPTADSEIYSKLDRSVRDDHESH 1483
             RPVHAT+  L+PVE++PLLPDFDRY+DQFV+ AFDG PTADSEI+SKLD SVRD+HES 
Sbjct: 455  LRPVHATNKNLEPVEVMPLLPDFDRYNDQFVMVAFDGAPTADSEIFSKLDDSVRDEHESR 514

Query: 1484 AIMKSFIGAGSEQAKPEKFLAYMVPSPDELWKDVYDEHEDILYTWVREYQWDVRGDDADD 1663
            AIMKS++ A S+ A PEKFLAYMVPS DEL K +YDEHED+ Y+WVREY WDVRGDDA+D
Sbjct: 515  AIMKSYLAASSDPANPEKFLAYMVPSLDELSKGMYDEHEDVSYSWVREYNWDVRGDDAND 574

Query: 1664 PTTYLVTFDEEEARYLPLPTKLILRKKRPKEGRSNEEVEQYXXXXXXXXXXXXXXXXXEL 1843
            PTTYLV+FDE EARY+PLPTKL LRKKR +EGR+ +E+E +                 EL
Sbjct: 575  PTTYLVSFDEGEARYVPLPTKLNLRKKRAREGRTGDEIEHFPIPARITVRRRSTVAAIEL 634

Query: 1844 KESGDYVS-----SNSKRGRSAIEDGLETPVPRHKVARVQDMDHYSGAEDDMSD 1990
            KE   Y S     S+SK GR   EDGL      HK+AR  D+D YSGAEDD+S+
Sbjct: 635  KEPEVYTSSRGGMSSSKIGRLDAEDGLGR---SHKLARHHDVDQYSGAEDDLSE 685


>ref|XP_004141783.1| PREDICTED: uncharacterized protein LOC101203806 [Cucumis sativus]
          Length = 706

 Score =  571 bits (1472), Expect = e-160
 Identities = 311/534 (58%), Positives = 370/534 (69%), Gaps = 5/534 (0%)
 Frame = +2

Query: 404  NRRSHNREG--PRDAN-GSGWRE-SGHSKHGLPPKQKGSAVPMPLGNASRVPNGLAXXXX 571
            N  +H R+   P+D + G   RE S H KH     QK S  PMP   A    NG +    
Sbjct: 189  NMGAHERDKGVPKDPSYGRRDRENSNHDKH-----QKHSGPPMPPKKA----NGPSGRME 239

Query: 572  XXXXXXXXXXXXXXXXXXXXXXXXXXXESQNTVMHKTHIMSSGMKGHGSIVGSRVGEKKA 751
                                       ESQNT++ KT ++S+G K HGSIVGSR+GE+KA
Sbjct: 240  TDDEKRLRKKREFEKQRQDERHRHHLKESQNTILQKTQMLSTG-KVHGSIVGSRMGERKA 298

Query: 752  TPFLSGERIENRLKKPTTFLCKLKFRNELPDPTAQPKLLSLNTYKEQYTKYTITSLEKMH 931
            TPFLSGERIENRLKKPTTFLCKLKFRNELPD +AQPKL+SL   K+ YT+YTITSLEK +
Sbjct: 299  TPFLSGERIENRLKKPTTFLCKLKFRNELPDTSAQPKLMSLRKEKDHYTRYTITSLEKTY 358

Query: 932  KPKLFVEPDIGVPLDLLDICVYNSNTTGTPHTXXXXXXXXXXXXXTPIKQDG-IRRKERP 1108
            KP+L+VEPD+G+PLDLLD+ VYN ++   P               TP+K+DG I+RKERP
Sbjct: 359  KPQLYVEPDLGIPLDLLDLSVYNPSSVRMPLAPEDEELLRDDVLKTPVKKDGGIKRKERP 418

Query: 1109 TDKGVSWLVKTQYISPLSTEAAKMSLTEKQAKELRETXXXXXXXXXXXXXXKQIQEIEAS 1288
            TDKGV+WLVKTQYISPLS E+AK SLTEKQAKELRE               +QI+EIEAS
Sbjct: 419  TDKGVAWLVKTQYISPLSIESAKQSLTEKQAKELREMKGGRNILENLNNRERQIKEIEAS 478

Query: 1289 FEACKSRPVHATSDKLQPVEILPLLPDFDRYDDQFVLAAFDGDPTADSEIYSKLDRSVRD 1468
            FEACKSRP+HAT+  L PVE+LPLLPDFDRYDD FV+ AFD  PTADSE ++KLD+S+RD
Sbjct: 479  FEACKSRPIHATNKNLYPVEVLPLLPDFDRYDDPFVVVAFDSAPTADSETFNKLDQSIRD 538

Query: 1469 DHESHAIMKSFIGAGSEQAKPEKFLAYMVPSPDELWKDVYDEHEDILYTWVREYQWDVRG 1648
             HES AIMKS++   S+ +KPEKFLAYMVPSPDEL KD+YDE ED+ Y+WVREY WDVRG
Sbjct: 539  AHESQAIMKSYMATSSDPSKPEKFLAYMVPSPDELSKDIYDEQEDVSYSWVREYHWDVRG 598

Query: 1649 DDADDPTTYLVTFDEEEARYLPLPTKLILRKKRPKEGRSNEEVEQYXXXXXXXXXXXXXX 1828
            D+ DDPTTYLV+FD+ EARY+PLPTKL+LRKKR KEGRS++EVE +              
Sbjct: 599  DNVDDPTTYLVSFDDAEARYVPLPTKLVLRKKRAKEGRSSDEVEHFPAPARVTVRRRPTV 658

Query: 1829 XXXELKESGDYVSSNSKRGRSAIEDGLETPVPRHKVARVQDMDHYSGAEDDMSD 1990
               E+K+ G Y  SNSKRG S IEDG+      HK  R QDMD +SGAED+MSD
Sbjct: 659  ATLEVKDPGIY--SNSKRG-SDIEDGIGR---SHKHDRHQDMDQFSGAEDEMSD 706


>ref|XP_002278075.2| PREDICTED: RNA polymerase II-associated factor 1 homolog [Vitis
            vinifera]
          Length = 589

 Score =  568 bits (1463), Expect = e-159
 Identities = 306/535 (57%), Positives = 369/535 (68%), Gaps = 9/535 (1%)
 Frame = +2

Query: 413  SHNRE--GPRDANGSGWRESGHSKHGLPPKQKGSAVPMPLGNASRVPNGLAXXXXXXXXX 586
            SH R+   P+D  G+G RE GHS  G   KQ+   VP      S  P G           
Sbjct: 64   SHGRDKGAPKDLRGAGRREPGHSNQGPSGKQQKPPVPPAPVKKSNGPPGRVETEEERRLR 123

Query: 587  XXXXXXXXXXXXXXXXXXXXXXESQNTVMHKTHIMSSGMKGHGSIVG-SRVGEKKATPFL 763
                                  ESQNTV+ KT ++SSG KGHGS+VG SR+GE++ TPFL
Sbjct: 124  KKREFEKQRQEEKQKHQLK---ESQNTVLQKTQMLSSG-KGHGSVVGGSRMGERRTTPFL 179

Query: 764  SGERIENRLKKPTTFLCKLKFRNELPDPTAQPKLLSLNTYKEQYTKYTITSLEKMHKPKL 943
            SG+RIENRL+KPTTFLCKLKFRNELPDPTAQPKL++L T K+++TKYTITSLEKMHKP+L
Sbjct: 180  SGDRIENRLRKPTTFLCKLKFRNELPDPTAQPKLMALKTDKDRFTKYTITSLEKMHKPQL 239

Query: 944  FVEPDIGVPLDLLDICVYNSNTTGTPHTXXXXXXXXXXXXXTPIKQDGIRRKERPTDKGV 1123
            FVEPD+G+PLDLLD+ VYN  +   P               TP+K++GI++KERPTDKGV
Sbjct: 240  FVEPDLGIPLDLLDLSVYNPPSVRRPLDPEDEELLRDDESVTPVKKEGIKKKERPTDKGV 299

Query: 1124 SWLVKTQYISPLSTEAAKMSLTEKQAKELRETXXXXXXXXXXXXXXKQIQEIEASFEACK 1303
            SWLVKTQYISPLSTE+ K SLTEKQAKELRET              ++IQ IEA+F A K
Sbjct: 300  SWLVKTQYISPLSTESTKQSLTEKQAKELRETKGGRNILENFNSRERKIQNIEAAFAASK 359

Query: 1304 SRPVHATSDKLQPVEILPLLPDFDRYDDQFVLAAFDGDPTADSEIYSKLDRSVRDDHESH 1483
              PVH+T+  L+PVEILPLLPDF RYDD FV+A+FD  PTADSEIYSKLD++VRD HES 
Sbjct: 360  ITPVHSTNKSLKPVEILPLLPDFARYDDSFVVASFDSAPTADSEIYSKLDKTVRDSHESQ 419

Query: 1484 AIMKSFIGAGSEQAKPEKFLAYMVPSPDELWKDVYDEHEDILYTWVREYQWDVRGDDADD 1663
            AI+KS++  GS+ +KPEKFLAYM PSPDEL KD+YDE+ED  Y+WVREY WDVRGDDADD
Sbjct: 420  AILKSYMATGSDPSKPEKFLAYMAPSPDELSKDIYDENEDTSYSWVREYHWDVRGDDADD 479

Query: 1664 PTTYLVTFDEEEARYLPLPTKLILRKKRPKEGRSNEEVEQYXXXXXXXXXXXXXXXXXEL 1843
            PTTYLV+F++ +ARYLPLPTKL+LRKKR KEGRS++EVE +                 EL
Sbjct: 480  PTTYLVSFNKTDARYLPLPTKLLLRKKRAKEGRSSDEVEHFPVPSKVTVRQRPNVAAIEL 539

Query: 1844 KESGDYVSSNSKRGRSA------IEDGLETPVPRHKVARVQDMDHYSGAEDDMSD 1990
            K+  + V S+SKRG S+      +EDGL      +K  + Q MD  SGAED+MSD
Sbjct: 540  KD--EEVYSSSKRGVSSSKRGVDMEDGLGR---SYKGVQDQHMDQSSGAEDEMSD 589


>ref|XP_004302858.1| PREDICTED: uncharacterized protein LOC101304396 [Fragaria vesca
            subsp. vesca]
          Length = 693

 Score =  567 bits (1462), Expect = e-159
 Identities = 313/543 (57%), Positives = 365/543 (67%), Gaps = 5/543 (0%)
 Frame = +2

Query: 377  QAEESRFHDNRRSHNREGPRDANGSGWRESGHSKH-GLPPKQKGSAVPMPLGNASRVPNG 553
            ++ ES F  ++  H++   +D   S  RE GHS H G+PPK K    P+PL   S   NG
Sbjct: 163  KSRESGF--DKGPHDKGASKDVGASAKREHGHSNHHGVPPKHKP---PVPLVKKS---NG 214

Query: 554  LAXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXESQNTVMHKTHIMSSGMKGHGSIVGSR 733
                                             ESQN+V+ KTH+MSSG KGHGSI GSR
Sbjct: 215  APGRVETEEERRLRKKREFEKQRQEEKHRQQAKESQNSVLQKTHLMSSG-KGHGSIAGSR 273

Query: 734  VGEKKATPFLSGERIENRLKKPTTFLCKLKFRNELPDPTAQPKLLSLNTYKEQYTKYTIT 913
            +GE++ TPFLSGER ENRLKKPTTF+CKLKFRNELPDP+AQPKL+S+    +QYTKYTIT
Sbjct: 274  MGERRTTPFLSGERAENRLKKPTTFVCKLKFRNELPDPSAQPKLMSMKKDPDQYTKYTIT 333

Query: 914  SLEKMHKPKLFVEPDIGVPLDLLDICVYNSNTTGTPH-TXXXXXXXXXXXXXTPIKQDGI 1090
            SLEK +KPKLFVEPD+G+PLDLLD+ VYN      P                TP+K+DGI
Sbjct: 334  SLEKNYKPKLFVEPDLGIPLDLLDLSVYNPPPGPRPPLAPEDEELLRDDVAVTPVKKDGI 393

Query: 1091 RRKERPTDKGVSWLVKTQYISPLSTEAAKMSLTEKQAKELRETXXXXXXXXXXXXXXKQI 1270
            RRKERPTDKGV+WLVKTQYISPLS ++AK SLTEKQAKELRE               +QI
Sbjct: 394  RRKERPTDKGVAWLVKTQYISPLSMDSAKQSLTEKQAKELREMKGGRNLLDNLNDRERQI 453

Query: 1271 QEIEASFEACKSRPVHATSDKLQPVEILPLLPDFDRYDDQFVLAAFDGDPTADSEIYSKL 1450
            +EIEASFEACKSRPVHAT+  L PVE+LPLLP  +RY+DQFVLA FDG PTADSEIYSKL
Sbjct: 454  KEIEASFEACKSRPVHATNKNLYPVEVLPLLPXHNRYEDQFVLAGFDGAPTADSEIYSKL 513

Query: 1451 DRSVRDDHESHAIMKSFIGAGSEQAKPEKFLAYMVPSPDELWKDVYDEHEDILYTWVREY 1630
            D+S  D  ES AIMKS+   G++ A P+KFLAYMVPSP+EL KD YDE EDI Y+WVREY
Sbjct: 514  DQSDHDLCESRAIMKSYKVTGADPANPDKFLAYMVPSPNELSKDPYDESEDISYSWVREY 573

Query: 1631 QWDVRGDDADDPTTYLVTFDEEEARYLPLPTKLILRKKRPKEGRSNEEVEQYXXXXXXXX 1810
            Q+DVRGDD DD TTYLV+FDE+ ARY PLP KL+LRKKR KEGRS +EVE +        
Sbjct: 574  QYDVRGDDVDDLTTYLVSFDEDAARYAPLPAKLVLRKKRAKEGRSTDEVEHFPAPSRVTV 633

Query: 1811 XXXXXXXXXELKESGDY---VSSNSKRGRSAIEDGLETPVPRHKVARVQDMDHYSGAEDD 1981
                     ELK++GDY     SN KR     ED LE P    K  R QD+D YSGAEDD
Sbjct: 634  RRRSTVSAIELKDAGDYSRGALSNLKRRGFDNEDALERP---QKRGRHQDVDEYSGAEDD 690

Query: 1982 MSD 1990
            +SD
Sbjct: 691  LSD 693


>ref|XP_006583048.1| PREDICTED: RNA polymerase II-associated factor 1 homolog isoform X1
            [Glycine max] gi|571464391|ref|XP_006583049.1| PREDICTED:
            RNA polymerase II-associated factor 1 homolog isoform X2
            [Glycine max]
          Length = 659

 Score =  566 bits (1459), Expect = e-158
 Identities = 299/522 (57%), Positives = 357/522 (68%), Gaps = 3/522 (0%)
 Frame = +2

Query: 434  RDANGSGWRESGHSKHGLPPKQKGSAVPMPLGNASRVPNGLAXXXXXXXXXXXXXXXXXX 613
            ++ + SG RE  HS HG+  KQ     P+P+   +  P G A                  
Sbjct: 145  KEPSTSGRREYEHSNHGIAHKQHKQQPPVPVKKMNNGPPGRAETDEEKRLRKKREFEKQR 204

Query: 614  XXXXXXXXXXXXXESQNTVMHKTHIMSSGMKGHGSIVGSRVGEKKATPFLSGERIENRLK 793
                         ESQNTV+ KTH++SSG KGHG I GSR+GE+++TP L  ER+ENRLK
Sbjct: 205  QEEKHRQQLK---ESQNTVLQKTHMLSSG-KGHGMIAGSRMGERRSTPLLGAERVENRLK 260

Query: 794  KPTTFLCKLKFRNELPDPTAQPKLLSLNTYKEQYTKYTITSLEKMHKPKLFVEPDIGVPL 973
            KPTTFLCKLKFRNELPDP+AQPKL++    K+QY KYTITSLEKM+KPKLFVEPD+G+PL
Sbjct: 261  KPTTFLCKLKFRNELPDPSAQPKLMASKKDKDQYAKYTITSLEKMYKPKLFVEPDLGIPL 320

Query: 974  DLLDICVYNSNTTGTPHTXXXXXXXXXXXXXTPIKQDGIRRKERPTDKGVSWLVKTQYIS 1153
            DLLD+ VYN  +   P               TPIK+DGI+RKERPTDKGV+WLVKTQYIS
Sbjct: 321  DLLDLSVYNPPSVRPPLAPEDKELLRDDEAVTPIKKDGIKRKERPTDKGVAWLVKTQYIS 380

Query: 1154 PLSTEAAKMSLTEKQAKELRETXXXXXXXXXXXXXXKQIQEIEASFEACKSRPVHATSDK 1333
            PLS E+ K SLTEKQAKELRE               +QI+EIEASFEA KS PVHAT+  
Sbjct: 381  PLSMESTKQSLTEKQAKELREMKGGRGILDNLNSRERQIREIEASFEAAKSDPVHATNKD 440

Query: 1334 LQPVEILPLLPDFDRYDDQFVLAAFDGDPTADSEIYSKLDRSVRDDHESHAIMKSFIGAG 1513
            L PVE++PLLPDFDRYDDQFV+AAFD  PTADSE+++K+D+SVRD  ES A+MKS++   
Sbjct: 441  LYPVEVMPLLPDFDRYDDQFVVAAFDNAPTADSEMHAKMDKSVRDAFESKAVMKSYVATS 500

Query: 1514 SEQAKPEKFLAYMVPSPDELWKDVYDEHEDILYTWVREYQWDVRGDDADDPTTYLVTFDE 1693
            S+ A PEKFLAYMVP+P EL KD+YDE+ED+ Y+W+REY WDVRGDDADDP T+LV FDE
Sbjct: 501  SDPANPEKFLAYMVPAPGELSKDIYDENEDVSYSWIREYHWDVRGDDADDPATFLVAFDE 560

Query: 1694 EEARYLPLPTKLILRKKRPKEGRSNEEVEQYXXXXXXXXXXXXXXXXXELKESGDYVSS- 1870
             EARYLPLPTKL+LRKKR KEGRS +EVEQ                  E K+SG Y SS 
Sbjct: 561  SEARYLPLPTKLVLRKKRAKEGRSGDEVEQCPVPARVTVRRRSSVAAIERKDSGVYTSSK 620

Query: 1871 --NSKRGRSAIEDGLETPVPRHKVARVQDMDHYSGAEDDMSD 1990
              +SKRG   ++DGLE    +H+ A  QD    SGAED MSD
Sbjct: 621  GNSSKRGGLEMDDGLE---DQHRGAPHQDNYQSSGAEDYMSD 659


>ref|XP_007135633.1| hypothetical protein PHAVU_010G145300g [Phaseolus vulgaris]
            gi|561008678|gb|ESW07627.1| hypothetical protein
            PHAVU_010G145300g [Phaseolus vulgaris]
          Length = 661

 Score =  563 bits (1452), Expect = e-158
 Identities = 300/531 (56%), Positives = 358/531 (67%), Gaps = 5/531 (0%)
 Frame = +2

Query: 413  SHNREGPR--DANGSGWRESGHSKHGLPPKQKGSAVPMPLGNASRVPNGLAXXXXXXXXX 586
            +HN E  R  D + SG RE   S HG+  KQ     P+P    ++  NG           
Sbjct: 139  THNNEERRFKDPSTSGRREYDPSNHGIGHKQHKHQPPVP----AKKVNGPPGRAETEEEK 194

Query: 587  XXXXXXXXXXXXXXXXXXXXXXESQNTVMHKTHIMSSGMKGHGSIVGSRVGEKKATPFLS 766
                                  ESQNTV+ KTH++SSG KGHG + GSR+GE+++TP LS
Sbjct: 195  RLRKKREFEKQRQEEKHRQQLKESQNTVLQKTHLLSSG-KGHGLVAGSRMGERRSTPLLS 253

Query: 767  GERIENRLKKPTTFLCKLKFRNELPDPTAQPKLLSLNTYKEQYTKYTITSLEKMHKPKLF 946
             ER+ENRLKKPTTFLCKLKFRNELPDP+AQPKL++    K+QY KYTITSLEKM+KPKLF
Sbjct: 254  AERVENRLKKPTTFLCKLKFRNELPDPSAQPKLMAFKKDKDQYAKYTITSLEKMYKPKLF 313

Query: 947  VEPDIGVPLDLLDICVYNSNTTGTPHTXXXXXXXXXXXXXTPIKQDGIRRKERPTDKGVS 1126
            VEPD+G+PLDLLD+ VYN  +   P               TPIK+DGI+RKERPTDKGV+
Sbjct: 314  VEPDLGIPLDLLDLSVYNPPSVRPPLAPEDEELLRDDEAATPIKKDGIKRKERPTDKGVA 373

Query: 1127 WLVKTQYISPLSTEAAKMSLTEKQAKELRETXXXXXXXXXXXXXXKQIQEIEASFEACKS 1306
            WLVKTQYISPLS E+ K SLTEKQAKELRE               +QI+EIEASFEA KS
Sbjct: 374  WLVKTQYISPLSMESTKQSLTEKQAKELREMKGGRGVLDNLNSRERQIREIEASFEAAKS 433

Query: 1307 RPVHATSDKLQPVEILPLLPDFDRYDDQFVLAAFDGDPTADSEIYSKLDRSVRDDHESHA 1486
             PVHAT+  L PVE++PLLPDFDRYDDQFV+AAFD  PTADSE+Y+KLD+SVRD  ES A
Sbjct: 434  DPVHATNKDLYPVEVMPLLPDFDRYDDQFVVAAFDNAPTADSEMYAKLDKSVRDAFESKA 493

Query: 1487 IMKSFIGAGSEQAKPEKFLAYMVPSPDELWKDVYDEHEDILYTWVREYQWDVRGDDADDP 1666
            +MKS++   S+ A PEKFLAYM P+P EL KD+YDE+ED+ Y+W+REY WDVRGDDADDP
Sbjct: 494  VMKSYVATSSDPANPEKFLAYMAPAPGELSKDIYDENEDVSYSWIREYHWDVRGDDADDP 553

Query: 1667 TTYLVTFDEEEARYLPLPTKLILRKKRPKEGRSNEEVEQYXXXXXXXXXXXXXXXXXELK 1846
            TT+ V FD+ EARYLPLPTKL+LRKKR KEGRS EE+EQ                  E K
Sbjct: 554  TTFFVAFDDSEARYLPLPTKLVLRKKRAKEGRSGEEIEQCPVPSRVTVRRRSSVAAIERK 613

Query: 1847 ESGDYVSS---NSKRGRSAIEDGLETPVPRHKVARVQDMDHYSGAEDDMSD 1990
            ++G Y SS   +SKR R  ++DGLE     H+ A  QD    SGAED MS+
Sbjct: 614  DTGVYTSSRGNSSKRSRLEMDDGLE---HHHRGAPHQDNYQSSGAEDYMSE 661


>ref|XP_003531647.1| PREDICTED: bromodomain-containing protein 4-like isoform X1 [Glycine
            max] gi|571472317|ref|XP_006585570.1| PREDICTED:
            bromodomain-containing protein 4-like isoform X2 [Glycine
            max]
          Length = 666

 Score =  556 bits (1432), Expect = e-155
 Identities = 303/543 (55%), Positives = 359/543 (66%), Gaps = 4/543 (0%)
 Frame = +2

Query: 374  NQAEESRFHDNRRSHNREGPRDANGSGWRESGHSKHGLPPKQ-KGSAVPMPLGNASRVPN 550
            N  EE RF            ++ + SG RE  HS HG+  KQ K    P+P+   +  P 
Sbjct: 144  NNNEERRF------------KEPSKSGRREYEHSNHGIAHKQHKQQQPPLPVKKMNNGPP 191

Query: 551  GLAXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXESQNTVMHKTHIMSSGMKGHGSIVGS 730
            G A                               ESQNTV+ KTH++SSG KGHG I GS
Sbjct: 192  GRAETDEEKRLRKKREFEKQRQEEKHRQQLK---ESQNTVLQKTHLLSSG-KGHGMIAGS 247

Query: 731  RVGEKKATPFLSGERIENRLKKPTTFLCKLKFRNELPDPTAQPKLLSLNTYKEQYTKYTI 910
            R+GE+++TP L  ER+ENRLKKPTTFLCKLKFRNELPDP+AQPKL+S    K+QY KYTI
Sbjct: 248  RMGERRSTPLLGAERVENRLKKPTTFLCKLKFRNELPDPSAQPKLMSFKKDKDQYAKYTI 307

Query: 911  TSLEKMHKPKLFVEPDIGVPLDLLDICVYNSNTTGTPHTXXXXXXXXXXXXXTPIKQDGI 1090
            TSLEKM+KPKLFVEPD+G+PLDLLD+ VYN      P               TPIK+DGI
Sbjct: 308  TSLEKMYKPKLFVEPDLGIPLDLLDLSVYNPPRVRPPLAPEDEELLRDDEAATPIKKDGI 367

Query: 1091 RRKERPTDKGVSWLVKTQYISPLSTEAAKMSLTEKQAKELRETXXXXXXXXXXXXXXKQI 1270
            +RKERPTDKGV+WLVKTQYISPLS E+ K SLTEKQAKELRE               +QI
Sbjct: 368  KRKERPTDKGVAWLVKTQYISPLSMESTKQSLTEKQAKELREMKGRGILDNLNSRE-RQI 426

Query: 1271 QEIEASFEACKSRPVHATSDKLQPVEILPLLPDFDRYDDQFVLAAFDGDPTADSEIYSKL 1450
            +EI+ASFEA KS PVHAT+  L PVE++PLLPDFDRYDDQFV+AAFD  PTADSE+Y+K+
Sbjct: 427  REIQASFEAAKSDPVHATNKDLYPVEVMPLLPDFDRYDDQFVVAAFDNAPTADSEMYAKM 486

Query: 1451 DRSVRDDHESHAIMKSFIGAGSEQAKPEKFLAYMVPSPDELWKDVYDEHEDILYTWVREY 1630
            ++SVRD  ES A+MKS++  G + A PEKFLAYM P+P EL KD+YDE+ED+ Y+W+REY
Sbjct: 487  NKSVRDAFESKAVMKSYVATGLDPANPEKFLAYMAPAPGELSKDIYDENEDVSYSWIREY 546

Query: 1631 QWDVRGDDADDPTTYLVTFDEEEARYLPLPTKLILRKKRPKEGRSNEEVEQYXXXXXXXX 1810
             WDVRGDDADDPTT+LV FDE EARYLPLPTKL+LRKKR KEGRS +EVEQ         
Sbjct: 547  HWDVRGDDADDPTTFLVAFDESEARYLPLPTKLVLRKKRAKEGRSGDEVEQCPVPARVTV 606

Query: 1811 XXXXXXXXXELKESGDYVSSNS---KRGRSAIEDGLETPVPRHKVARVQDMDHYSGAEDD 1981
                     E K+SG Y SS     KR    ++DGLE    +H+ A  QD    SGAED 
Sbjct: 607  RRRSSVAAIERKDSGVYTSSKGNSFKRVGLEMDDGLE---DQHRGAPHQDNYQSSGAEDY 663

Query: 1982 MSD 1990
            MSD
Sbjct: 664  MSD 666


>ref|XP_007225143.1| hypothetical protein PRUPE_ppa002485mg [Prunus persica]
            gi|462422079|gb|EMJ26342.1| hypothetical protein
            PRUPE_ppa002485mg [Prunus persica]
          Length = 668

 Score =  553 bits (1426), Expect = e-154
 Identities = 300/532 (56%), Positives = 354/532 (66%), Gaps = 3/532 (0%)
 Frame = +2

Query: 404  NRRSHNREGPRDANGSGWRESGHSKHGLPPKQKGSAVPMPLGNASRVPNGLAXXXXXXXX 583
            +R SH +   R+ + SG  E GH  HG+P KQ    VP       +  NG          
Sbjct: 160  DRGSHEKVASREVSVSGRGEHGHLNHGVPQKQHKPPVP---SMQVKKANGPPGRVETEEE 216

Query: 584  XXXXXXXXXXXXXXXXXXXXXXXESQNTVMHKTHIMSSGMKGHGSIVGSRVGEKKATPFL 763
                                   +SQN+V+ KT ++SSG KGHGSI GSR+GE++ATPFL
Sbjct: 217  RRLRKKREFEKQRQEEKHRQQLKDSQNSVLQKTQMLSSG-KGHGSIAGSRMGERRATPFL 275

Query: 764  SGERIENRLKKPTTFLCKLKFRNELPDPTAQPKLLSLNTYKEQYTKYTITSLEKMHKPKL 943
            SGER ENRLKKPTTF+CKLKFRNELPDP+AQPKL+SL   K+QYTKYTITSLEK +KPKL
Sbjct: 276  SGERTENRLKKPTTFVCKLKFRNELPDPSAQPKLMSLKKDKDQYTKYTITSLEKTYKPKL 335

Query: 944  FVEPDIGVPLDLLDICVYNSNTTGTPHTXXXXXXXXXXXXXTPIKQDGIRRKERPTDKGV 1123
            FVEPD+G+PLDLLD+ VYN  +   P               TP+K++GI+RKERPTDKGV
Sbjct: 336  FVEPDLGIPLDLLDLSVYNPPSVRPPLALEDEELLRDDVAATPVKKNGIKRKERPTDKGV 395

Query: 1124 SWLVKTQYISPLSTEAAKMSLTEKQAKELRETXXXXXXXXXXXXXXKQIQEIEASFEACK 1303
            +WL                SLTEKQAKELRE               +QI+EIEASFEACK
Sbjct: 396  AWL----------------SLTEKQAKELREMKGGRNILDNLNDRERQIKEIEASFEACK 439

Query: 1304 SRPVHATSDKLQPVEILPLLPDFDRYDDQFVLAAFDGDPTADSEIYSKLDRSVRDDHESH 1483
            SRPVHAT+  L PVE+LPLLPDF+RY+DQFVLAAFDG PTADSEIYSKLD+S  D +ES 
Sbjct: 440  SRPVHATNKDLYPVEVLPLLPDFERYEDQFVLAAFDGAPTADSEIYSKLDQSGHDAYESR 499

Query: 1484 AIMKSFIGAGSEQAKPEKFLAYMVPSPDELWKDVYDEHEDILYTWVREYQWDVRGDDADD 1663
            AIMKS+   G++ A PEKFLAYMVPSP+EL KD YDE ED+ Y+WVREY +DVRGDD  D
Sbjct: 500  AIMKSYKVTGADPANPEKFLAYMVPSPNELSKDPYDESEDVSYSWVREYHYDVRGDDVHD 559

Query: 1664 PTTYLVTFDEEEARYLPLPTKLILRKKRPKEGRSNEEVEQYXXXXXXXXXXXXXXXXXEL 1843
            PTTYLV+FDEEEARY PLPTKL+LRKKR KEG++++EVE +                 EL
Sbjct: 560  PTTYLVSFDEEEARYAPLPTKLVLRKKRSKEGKTSDEVEHFPAPSRVTVRQRSTVAAIEL 619

Query: 1844 KESGDYVS---SNSKRGRSAIEDGLETPVPRHKVARVQDMDHYSGAEDDMSD 1990
            K+SGDY     SN K  R  IED LE P    K+AR QD+D YSGAEDD+SD
Sbjct: 620  KDSGDYSRGSVSNLKTRRFDIEDTLERP---RKIARHQDIDEYSGAEDDLSD 668


>ref|XP_006343037.1| PREDICTED: RNA polymerase II-associated factor 1 homolog [Solanum
            tuberosum]
          Length = 700

 Score =  553 bits (1424), Expect = e-154
 Identities = 293/536 (54%), Positives = 351/536 (65%), Gaps = 7/536 (1%)
 Frame = +2

Query: 404  NRRSHNREGPRDANGSGWRESGHSKHGLPPKQKGSAVPMPLGNASRVPNGLAXXXXXXXX 583
            ++R+ +R        SGWRESGH  H    KQ G +VP      S  P+G          
Sbjct: 171  DQRNESRPSAEKRRESGWRESGHGNHTARSKQPGHSVPPMPVKKSNAPSGRVETEEERRL 230

Query: 584  XXXXXXXXXXXXXXXXXXXXXXXESQNTVMHKTHIMSSGMKGHGSIVGSRVGEKKATPFL 763
                                   ESQN V+ KT +++SG KGHGSI  S + +++  P L
Sbjct: 231  RKKREIEKQRHEEKNRQHLK---ESQNKVLQKTQMLTSGTKGHGSISASHMADRRTAPLL 287

Query: 764  SGERIENRLKKPTTFLCKLKFRNELPDPTAQPKLLSLNTYKEQYTKYTITSLEKMHKPKL 943
            SGER ENRLKKPTTFLCKLKFRNELPDPTAQPKLL+L    +++TKY+ITSLEKMHKP+L
Sbjct: 288  SGERTENRLKKPTTFLCKLKFRNELPDPTAQPKLLTLRRDPDRFTKYSITSLEKMHKPQL 347

Query: 944  FVEPDIGVPLDLLDICVYNS-NTTGTPHTXXXXXXXXXXXXXTPIKQDGIRRKERPTDKG 1120
            +VEPD+G+PLDLLD+ VYN       P               TPIK+DGI++KERPTDKG
Sbjct: 348  YVEPDLGIPLDLLDLSVYNPPKGVKIPLAPEDEELLRDDNPITPIKKDGIKKKERPTDKG 407

Query: 1121 VSWLVKTQYISPLSTEAAKMSLTEKQAKELRETXXXXXXXXXXXXXXKQIQEIEASFEAC 1300
            VSWLVKTQYISPLSTE+AK SLTEKQAKELRET              +QIQEIEASFEAC
Sbjct: 408  VSWLVKTQYISPLSTESAKQSLTEKQAKELRETKGGRNILENLNKRDRQIQEIEASFEAC 467

Query: 1301 KSRPVHATSDKLQPVEILPLLPDFDRYDDQFVLAAFDGDPTADSEIYSKLDRSVRDDHES 1480
            KSRP+HAT+ +LQPV++ PL PDFDRY D FVLA +D  PTADSE Y+KLD++VRD  ES
Sbjct: 468  KSRPIHATNRRLQPVKVQPLYPDFDRYKDPFVLANYDSAPTADSETYNKLDKTVRDACES 527

Query: 1481 HAIMKSFIGAGSEQAKPEKFLAYMVPSPDELWKDVYDEHEDILYTWVREYQWDVRGDDAD 1660
             A+MKSF+   S+  KP+KFLAYMVP+P+EL KD+YDE+EDI Y+WVREY WDVRGDDAD
Sbjct: 528  QAVMKSFVATSSDADKPDKFLAYMVPAPNELSKDMYDENEDISYSWVREYHWDVRGDDAD 587

Query: 1661 DPTTYLVTFDEEEARYLPLPTKLILRKKRPKEGRSNEEVEQYXXXXXXXXXXXXXXXXXE 1840
            DP TY+V F E EARY+PLPTKL+LRKKR +EG+SNEEVE +                 E
Sbjct: 588  DPNTYVVAFGETEARYMPLPTKLVLRKKRAREGKSNEEVEHFPVPSRVTVRKRPTAAAIE 647

Query: 1841 LKESGDYVS------SNSKRGRSAIEDGLETPVPRHKVARVQDMDHYSGAEDDMSD 1990
            LKE G Y +      S+SKR R + ED +     +H      D D  SG E  MSD
Sbjct: 648  LKEEGGYTTALKGNVSSSKRSRISHEDDVG---EQHNNMHDDDQDQSSGGEYYMSD 700


>ref|XP_006465692.1| PREDICTED: RNA polymerase II-associated factor 1 homolog isoform X1
            [Citrus sinensis]
          Length = 576

 Score =  552 bits (1423), Expect = e-154
 Identities = 281/451 (62%), Positives = 336/451 (74%), Gaps = 5/451 (1%)
 Frame = +2

Query: 653  ESQNTVMHKTHIMSSGMKGHGSIVGSRVGEKKATPFLSGERIENRLKKPTTFLCKLKFRN 832
            ESQN VM K+ +++SG  GHGS+ GSR+G+++A P LSGERIENRLKKPTTFLCKLKFRN
Sbjct: 129  ESQNVVMQKSQMVASGKGGHGSMAGSRMGDRRAAPLLSGERIENRLKKPTTFLCKLKFRN 188

Query: 833  ELPDPTAQPKLLSLNTYKEQYTKYTITSLEKMHKPKLFVEPDIGVPLDLLDICVYNSNTT 1012
            ELP+P+AQPKL++L   K+++T+YT +SLEK +KP+L VEPD+G+PLDLLD+ VYN  + 
Sbjct: 189  ELPEPSAQPKLMALKKDKDRFTRYTFSSLEKNYKPQLHVEPDLGIPLDLLDLSVYNPPSV 248

Query: 1013 GTPHTXXXXXXXXXXXXXTPIKQDGIRRKERPTDKGVSWLVKTQYISPLSTEAAKMSLTE 1192
              P               TP+K+DGI+RKERPTDKGVSWLVKTQYISPLS E+A+ SLTE
Sbjct: 249  RPPLDPEDEELLRDDEVVTPVKKDGIKRKERPTDKGVSWLVKTQYISPLSMESARQSLTE 308

Query: 1193 KQAKELRETXXXXXXXXXXXXXXKQIQEIEASFEACKSRPVHATSDKLQPVEILPLLPDF 1372
            KQAKELRE               +QI+EIEASFEACK RP+HAT+  LQPVEILPLLPDF
Sbjct: 309  KQAKELREMKGGRSILENLNDRERQIKEIEASFEACKLRPIHATNKNLQPVEILPLLPDF 368

Query: 1373 DRYDDQFVLAAFDGDPTADSEIYSKLDRSVRDDHESHAIMKSFIGAGSEQAKPEKFLAYM 1552
            +RYDDQFV A FDG PTADSEIYSK+D+SVRD HES AIMKS++  GS+ A PEKFLAYM
Sbjct: 369  ERYDDQFVAATFDGAPTADSEIYSKMDKSVRDAHESRAIMKSYVATGSDSANPEKFLAYM 428

Query: 1553 VPSPDELWKDVYDEHEDILYTWVREYQWDVRGDDADDPTTYLVTFDEEEARYLPLPTKLI 1732
            VPS +EL KD+YDE+ED+ ++WVREY WDVRGDDADDPTTYLV+FD++EARY+PLPTKL 
Sbjct: 429  VPSVNELSKDMYDENEDVSFSWVREYHWDVRGDDADDPTTYLVSFDDDEARYVPLPTKLN 488

Query: 1733 LRKKRPKEGRSNEEVEQYXXXXXXXXXXXXXXXXXELKESGDYV-----SSNSKRGRSAI 1897
            LRKKR  EGRSN+EVE +                 ELKE G Y      SS+SK GR   
Sbjct: 489  LRKKRAIEGRSNDEVEHFPIPSSIAVRRRANVTAIELKEQGAYSNSKGNSSSSKMGRVDS 548

Query: 1898 EDGLETPVPRHKVARVQDMDHYSGAEDDMSD 1990
            ++ LE     H  +R QD    SGAEDDM D
Sbjct: 549  QEDLER---SHNGSRHQDPYQSSGAEDDMYD 576


>ref|XP_006426877.1| hypothetical protein CICLE_v10025066mg [Citrus clementina]
            gi|557528867|gb|ESR40117.1| hypothetical protein
            CICLE_v10025066mg [Citrus clementina]
          Length = 677

 Score =  552 bits (1423), Expect = e-154
 Identities = 281/451 (62%), Positives = 336/451 (74%), Gaps = 5/451 (1%)
 Frame = +2

Query: 653  ESQNTVMHKTHIMSSGMKGHGSIVGSRVGEKKATPFLSGERIENRLKKPTTFLCKLKFRN 832
            ESQN VM K+ +++SG  GHGS+VGSR+G+++A P LSGER ENRLKKPTTFLCKLKFRN
Sbjct: 230  ESQNVVMQKSQMVASGKGGHGSMVGSRMGDRRAAPLLSGERTENRLKKPTTFLCKLKFRN 289

Query: 833  ELPDPTAQPKLLSLNTYKEQYTKYTITSLEKMHKPKLFVEPDIGVPLDLLDICVYNSNTT 1012
            ELP+P+AQPKL++L   K+++T+YT +SLEK +KP+L VEPD+G+PLDLLD+ VYN  + 
Sbjct: 290  ELPEPSAQPKLMALKKDKDRFTRYTFSSLEKNYKPQLHVEPDLGIPLDLLDLSVYNPPSV 349

Query: 1013 GTPHTXXXXXXXXXXXXXTPIKQDGIRRKERPTDKGVSWLVKTQYISPLSTEAAKMSLTE 1192
              P               TP+K+DGI+RKERPTDKGVSWLVKTQYISPLS E+A+ SLTE
Sbjct: 350  RPPLDPEDEELLRDDEVVTPVKKDGIKRKERPTDKGVSWLVKTQYISPLSMESARQSLTE 409

Query: 1193 KQAKELRETXXXXXXXXXXXXXXKQIQEIEASFEACKSRPVHATSDKLQPVEILPLLPDF 1372
            KQAKELRE               +QI+EIEASFEACK RP+HAT+  LQPVEILPLLPDF
Sbjct: 410  KQAKELREMKGGRSILENLNDRERQIKEIEASFEACKLRPIHATNKNLQPVEILPLLPDF 469

Query: 1373 DRYDDQFVLAAFDGDPTADSEIYSKLDRSVRDDHESHAIMKSFIGAGSEQAKPEKFLAYM 1552
            +RYDDQFV A FDG PTADSEIYSK+D+SVRD HES AIMKS++  GS+ A PEKFLAYM
Sbjct: 470  ERYDDQFVAATFDGAPTADSEIYSKMDKSVRDAHESRAIMKSYVATGSDSANPEKFLAYM 529

Query: 1553 VPSPDELWKDVYDEHEDILYTWVREYQWDVRGDDADDPTTYLVTFDEEEARYLPLPTKLI 1732
            VPS +EL KD+YDE+ED+ ++WVREY WDVRGDDADDPTTYLV+FD++EARY+PLPTKL 
Sbjct: 530  VPSVNELSKDMYDENEDVSFSWVREYHWDVRGDDADDPTTYLVSFDDDEARYVPLPTKLN 589

Query: 1733 LRKKRPKEGRSNEEVEQYXXXXXXXXXXXXXXXXXELKESGDYV-----SSNSKRGRSAI 1897
            LRKKR  EGRSN+EVE +                 ELKE G Y      SS+SK GR   
Sbjct: 590  LRKKRAIEGRSNDEVEHFPIPSSIAVRRRANVTAIELKEQGAYSNSKGNSSSSKMGRVDS 649

Query: 1898 EDGLETPVPRHKVARVQDMDHYSGAEDDMSD 1990
            ++ LE     H  +R QD    SGAEDDM D
Sbjct: 650  QEDLER---SHNGSRQQDPYQSSGAEDDMYD 677


>ref|XP_006465693.1| PREDICTED: RNA polymerase II-associated factor 1 homolog isoform X2
            [Citrus sinensis]
          Length = 570

 Score =  551 bits (1421), Expect = e-154
 Identities = 280/446 (62%), Positives = 335/446 (75%)
 Frame = +2

Query: 653  ESQNTVMHKTHIMSSGMKGHGSIVGSRVGEKKATPFLSGERIENRLKKPTTFLCKLKFRN 832
            ESQN VM K+ +++SG  GHGS+ GSR+G+++A P LSGERIENRLKKPTTFLCKLKFRN
Sbjct: 129  ESQNVVMQKSQMVASGKGGHGSMAGSRMGDRRAAPLLSGERIENRLKKPTTFLCKLKFRN 188

Query: 833  ELPDPTAQPKLLSLNTYKEQYTKYTITSLEKMHKPKLFVEPDIGVPLDLLDICVYNSNTT 1012
            ELP+P+AQPKL++L   K+++T+YT +SLEK +KP+L VEPD+G+PLDLLD+ VYN  + 
Sbjct: 189  ELPEPSAQPKLMALKKDKDRFTRYTFSSLEKNYKPQLHVEPDLGIPLDLLDLSVYNPPSV 248

Query: 1013 GTPHTXXXXXXXXXXXXXTPIKQDGIRRKERPTDKGVSWLVKTQYISPLSTEAAKMSLTE 1192
              P               TP+K+DGI+RKERPTDKGVSWLVKTQYISPLS E+A+ SLTE
Sbjct: 249  RPPLDPEDEELLRDDEVVTPVKKDGIKRKERPTDKGVSWLVKTQYISPLSMESARQSLTE 308

Query: 1193 KQAKELRETXXXXXXXXXXXXXXKQIQEIEASFEACKSRPVHATSDKLQPVEILPLLPDF 1372
            KQAKELRE               +QI+EIEASFEACK RP+HAT+  LQPVEILPLLPDF
Sbjct: 309  KQAKELREMKGGRSILENLNDRERQIKEIEASFEACKLRPIHATNKNLQPVEILPLLPDF 368

Query: 1373 DRYDDQFVLAAFDGDPTADSEIYSKLDRSVRDDHESHAIMKSFIGAGSEQAKPEKFLAYM 1552
            +RYDDQFV A FDG PTADSEIYSK+D+SVRD HES AIMKS++  GS+ A PEKFLAYM
Sbjct: 369  ERYDDQFVAATFDGAPTADSEIYSKMDKSVRDAHESRAIMKSYVATGSDSANPEKFLAYM 428

Query: 1553 VPSPDELWKDVYDEHEDILYTWVREYQWDVRGDDADDPTTYLVTFDEEEARYLPLPTKLI 1732
            VPS +EL KD+YDE+ED+ ++WVREY WDVRGDDADDPTTYLV+FD++EARY+PLPTKL 
Sbjct: 429  VPSVNELSKDMYDENEDVSFSWVREYHWDVRGDDADDPTTYLVSFDDDEARYVPLPTKLN 488

Query: 1733 LRKKRPKEGRSNEEVEQYXXXXXXXXXXXXXXXXXELKESGDYVSSNSKRGRSAIEDGLE 1912
            LRKKR  EGRSN+EVE +                 ELKE G   SS+SK GR   ++ LE
Sbjct: 489  LRKKRAIEGRSNDEVEHFPIPSSIAVRRRANVTAIELKEQGGN-SSSSKMGRVDSQEDLE 547

Query: 1913 TPVPRHKVARVQDMDHYSGAEDDMSD 1990
                 H  +R QD    SGAEDDM D
Sbjct: 548  R---SHNGSRHQDPYQSSGAEDDMYD 570


>ref|XP_004235642.1| PREDICTED: uncharacterized protein LOC101254885 [Solanum
            lycopersicum]
          Length = 698

 Score =  544 bits (1401), Expect = e-152
 Identities = 291/535 (54%), Positives = 348/535 (65%), Gaps = 7/535 (1%)
 Frame = +2

Query: 407  RRSHNREGPRDANGSGWRESGHSKHGLPPKQKGSAVPMPLGNASRVPNGLAXXXXXXXXX 586
            +R+ +R        SGWRES H  H    KQ   +VP PL    +  N  +         
Sbjct: 170  QRNESRHSVEKRRESGWRESRHGNHTARSKQPDHSVP-PL--PMKKSNAHSGRVETEEER 226

Query: 587  XXXXXXXXXXXXXXXXXXXXXXESQNTVMHKTHIMSSGMKGHGSIVGSRVGEKKATPFLS 766
                                  ESQN V+ KT +++SG KGHGSI  S + +++ TP LS
Sbjct: 227  RSRKKREIEKQRHEEKNRQHLKESQNKVLQKTQMLTSGTKGHGSISASHMADRRTTPLLS 286

Query: 767  GERIENRLKKPTTFLCKLKFRNELPDPTAQPKLLSLNTYKEQYTKYTITSLEKMHKPKLF 946
            GER ENRLKKPTTFLCKLKFRNELPDPTAQPKLL+L    +++TKY+ITSLEKMHKP+L 
Sbjct: 287  GERTENRLKKPTTFLCKLKFRNELPDPTAQPKLLTLRRDPDRFTKYSITSLEKMHKPQLH 346

Query: 947  VEPDIGVPLDLLDICVYNS-NTTGTPHTXXXXXXXXXXXXXTPIKQDGIRRKERPTDKGV 1123
            VEPD+G+PLDLLD+ VYN       P               TPIK+DGI++KERPTDKGV
Sbjct: 347  VEPDLGIPLDLLDLSVYNPPKGVKIPLAPEDEELLRDDNPITPIKKDGIKKKERPTDKGV 406

Query: 1124 SWLVKTQYISPLSTEAAKMSLTEKQAKELRETXXXXXXXXXXXXXXKQIQEIEASFEACK 1303
            SWLVKTQYISPLSTE+AK SLTEKQAKELRET              +QIQEIEASFEACK
Sbjct: 407  SWLVKTQYISPLSTESAKQSLTEKQAKELRETKGGRNILENLNKRDRQIQEIEASFEACK 466

Query: 1304 SRPVHATSDKLQPVEILPLLPDFDRYDDQFVLAAFDGDPTADSEIYSKLDRSVRDDHESH 1483
            SRP+HA++ +LQP+++ PL PDFDRY D FVLA +D  PTADSE YSKLD++VRD  ES 
Sbjct: 467  SRPIHASNRRLQPIKVQPLYPDFDRYKDPFVLANYDSAPTADSETYSKLDKTVRDACESQ 526

Query: 1484 AIMKSFIGAGSEQAKPEKFLAYMVPSPDELWKDVYDEHEDILYTWVREYQWDVRGDDADD 1663
            A+MKSF+   S+  KP+KFLAYMVP+P+EL KD+YDE EDI Y+WVREY WDVRGDDADD
Sbjct: 527  AVMKSFVATSSDADKPDKFLAYMVPAPNELSKDIYDESEDISYSWVREYHWDVRGDDADD 586

Query: 1664 PTTYLVTFDEEEARYLPLPTKLILRKKRPKEGRSNEEVEQYXXXXXXXXXXXXXXXXXEL 1843
            P TY+V F E EARY+PLPTKL+LRKKR +EG+SNEEVE +                 EL
Sbjct: 587  PNTYVVAFGEREARYMPLPTKLVLRKKRAREGKSNEEVEHFPVPSRVTVRKRPTAAAIEL 646

Query: 1844 KESGDYVS------SNSKRGRSAIEDGLETPVPRHKVARVQDMDHYSGAEDDMSD 1990
            KE G Y +      S+SKR R + ED +     +H      D D  SG E  MSD
Sbjct: 647  KEEGGYTTALKGNVSSSKRSRISHEDDVG---EQHNNMHDDDQDQSSGGEYYMSD 698


>ref|XP_006426878.1| hypothetical protein CICLE_v10025066mg [Citrus clementina]
            gi|557528868|gb|ESR40118.1| hypothetical protein
            CICLE_v10025066mg [Citrus clementina]
          Length = 632

 Score =  530 bits (1365), Expect = e-147
 Identities = 260/401 (64%), Positives = 311/401 (77%)
 Frame = +2

Query: 653  ESQNTVMHKTHIMSSGMKGHGSIVGSRVGEKKATPFLSGERIENRLKKPTTFLCKLKFRN 832
            ESQN VM K+ +++SG  GHGS+VGSR+G+++A P LSGER ENRLKKPTTFLCKLKFRN
Sbjct: 230  ESQNVVMQKSQMVASGKGGHGSMVGSRMGDRRAAPLLSGERTENRLKKPTTFLCKLKFRN 289

Query: 833  ELPDPTAQPKLLSLNTYKEQYTKYTITSLEKMHKPKLFVEPDIGVPLDLLDICVYNSNTT 1012
            ELP+P+AQPKL++L   K+++T+YT +SLEK +KP+L VEPD+G+PLDLLD+ VYN  + 
Sbjct: 290  ELPEPSAQPKLMALKKDKDRFTRYTFSSLEKNYKPQLHVEPDLGIPLDLLDLSVYNPPSV 349

Query: 1013 GTPHTXXXXXXXXXXXXXTPIKQDGIRRKERPTDKGVSWLVKTQYISPLSTEAAKMSLTE 1192
              P               TP+K+DGI+RKERPTDKGVSWLVKTQYISPLS E+A+ SLTE
Sbjct: 350  RPPLDPEDEELLRDDEVVTPVKKDGIKRKERPTDKGVSWLVKTQYISPLSMESARQSLTE 409

Query: 1193 KQAKELRETXXXXXXXXXXXXXXKQIQEIEASFEACKSRPVHATSDKLQPVEILPLLPDF 1372
            KQAKELRE               +QI+EIEASFEACK RP+HAT+  LQPVEILPLLPDF
Sbjct: 410  KQAKELREMKGGRSILENLNDRERQIKEIEASFEACKLRPIHATNKNLQPVEILPLLPDF 469

Query: 1373 DRYDDQFVLAAFDGDPTADSEIYSKLDRSVRDDHESHAIMKSFIGAGSEQAKPEKFLAYM 1552
            +RYDDQFV A FDG PTADSEIYSK+D+SVRD HES AIMKS++  GS+ A PEKFLAYM
Sbjct: 470  ERYDDQFVAATFDGAPTADSEIYSKMDKSVRDAHESRAIMKSYVATGSDSANPEKFLAYM 529

Query: 1553 VPSPDELWKDVYDEHEDILYTWVREYQWDVRGDDADDPTTYLVTFDEEEARYLPLPTKLI 1732
            VPS +EL KD+YDE+ED+ ++WVREY WDVRGDDADDPTTYLV+FD++EARY+PLPTKL 
Sbjct: 530  VPSVNELSKDMYDENEDVSFSWVREYHWDVRGDDADDPTTYLVSFDDDEARYVPLPTKLN 589

Query: 1733 LRKKRPKEGRSNEEVEQYXXXXXXXXXXXXXXXXXELKESG 1855
            LRKKR  EGRSN+EVE +                 ELKE G
Sbjct: 590  LRKKRAIEGRSNDEVEHFPIPSSIAVRRRANVTAIELKEQG 630


>ref|XP_007024309.1| Hydroxyproline-rich glycoprotein family protein isoform 3 [Theobroma
            cacao] gi|508779675|gb|EOY26931.1| Hydroxyproline-rich
            glycoprotein family protein isoform 3 [Theobroma cacao]
          Length = 662

 Score =  528 bits (1359), Expect = e-147
 Identities = 297/534 (55%), Positives = 343/534 (64%), Gaps = 5/534 (0%)
 Frame = +2

Query: 404  NRRSHNREGPRDANGSGWRESGHSKHGLPPKQKGSAVPMPLGNASRVPNGLAXXXXXXXX 583
            N RS    G RD  GSG RE GHS H    + +   +P       + PNG A        
Sbjct: 168  NERSQG--GNRDFLGSGRREHGHSNHAAGVRDQKPMMP-----PVKKPNGPAGRVETEEE 220

Query: 584  XXXXXXXXXXXXXXXXXXXXXXXESQNTVMHKTHIMSSGMKGHGSIVGSRVGEKKATPFL 763
                                   ESQ     KT +M SG KGHGS+VGSR+G+++ATPFL
Sbjct: 221  RRLRKKREFEKQRQEEKHRQQMKESQ-----KTQMMPSG-KGHGSMVGSRMGDRRATPFL 274

Query: 764  SGERIENRLKKPTTFLCKLKFRNELPDPTAQPKLLSLNTYKEQYTKYTITSLEKMHKPKL 943
            SGERIENRLKKPTTFLCKLKF                       TKYTITSLEKM+KPKL
Sbjct: 275  SGERIENRLKKPTTFLCKLKF-----------------------TKYTITSLEKMYKPKL 311

Query: 944  FVEPDIGVPLDLLDICVYNSNTTGTPHTXXXXXXXXXXXXXTPIKQDGIRRKERPTDKGV 1123
            FVEPD+G+PLDLLD+ VYN  +                   TPIK+DGIRRKERPTDKGV
Sbjct: 312  FVEPDLGIPLDLLDLSVYNPPSVRPSLAPEDAELLHDDEAVTPIKKDGIRRKERPTDKGV 371

Query: 1124 SWLVKTQYISPLSTEAAKMSLTEKQAKELRETXXXXXXXXXXXXXXKQIQEIEASFEACK 1303
            SWLVKTQYISPLS E+ K SLTEKQAKELRE               +QI+EIEASFEA K
Sbjct: 372  SWLVKTQYISPLSMESTKQSLTEKQAKELRELKGGRNILENLNNRERQIKEIEASFEASK 431

Query: 1304 SRPVHATSDKLQPVEILPLLPDFDRYDDQFVLAAFDGDPTADSEIYSKLDRSVRDDHESH 1483
             RPVHAT+  L+PVE++PLLPDFDRY+DQFV+ AFDG PTADSEI+SKLD SVRD+HES 
Sbjct: 432  LRPVHATNKNLEPVEVMPLLPDFDRYNDQFVMVAFDGAPTADSEIFSKLDDSVRDEHESR 491

Query: 1484 AIMKSFIGAGSEQAKPEKFLAYMVPSPDELWKDVYDEHEDILYTWVREYQWDVRGDDADD 1663
            AIMKS++ A S+ A PEKFLAYMVPS DEL K +YDEHED+ Y+WVREY WDVRGDDA+D
Sbjct: 492  AIMKSYLAASSDPANPEKFLAYMVPSLDELSKGMYDEHEDVSYSWVREYNWDVRGDDAND 551

Query: 1664 PTTYLVTFDEEEARYLPLPTKLILRKKRPKEGRSNEEVEQYXXXXXXXXXXXXXXXXXEL 1843
            PTTYLV+FDE EARY+PLPTKL LRKKR +EGR+ +E+E +                 EL
Sbjct: 552  PTTYLVSFDEGEARYVPLPTKLNLRKKRAREGRTGDEIEHFPIPARITVRRRSTVAAIEL 611

Query: 1844 KESGDYVS-----SNSKRGRSAIEDGLETPVPRHKVARVQDMDHYSGAEDDMSD 1990
            KE   Y S     S+SK GR   EDGL      HK+AR  D+D YSGAEDD+S+
Sbjct: 612  KEPEVYTSSRGGMSSSKIGRLDAEDGLGR---SHKLARHHDVDQYSGAEDDLSE 662


>emb|CBI36059.3| unnamed protein product [Vitis vinifera]
          Length = 420

 Score =  522 bits (1345), Expect = e-145
 Identities = 266/425 (62%), Positives = 321/425 (75%), Gaps = 6/425 (1%)
 Frame = +2

Query: 734  VGEKKATPFLSGERIENRLKKPTTFLCKLKFRNELPDPTAQPKLLSLNTYKEQYTKYTIT 913
            +GE++ TPFLSG+RIENRL+KPTTFLCKLKFRNELPDPTAQPKL++L T K+++TKYTIT
Sbjct: 1    MGERRTTPFLSGDRIENRLRKPTTFLCKLKFRNELPDPTAQPKLMALKTDKDRFTKYTIT 60

Query: 914  SLEKMHKPKLFVEPDIGVPLDLLDICVYNSNTTGTPHTXXXXXXXXXXXXXTPIKQDGIR 1093
            SLEKMHKP+LFVEPD+G+PLDLLD+ VYN  +   P               TP+K++GI+
Sbjct: 61   SLEKMHKPQLFVEPDLGIPLDLLDLSVYNPPSVRRPLDPEDEELLRDDESVTPVKKEGIK 120

Query: 1094 RKERPTDKGVSWLVKTQYISPLSTEAAKMSLTEKQAKELRETXXXXXXXXXXXXXXKQIQ 1273
            +KERPTDKGVSWLVKTQYISPLSTE+ K SLTEKQAKELRET              ++IQ
Sbjct: 121  KKERPTDKGVSWLVKTQYISPLSTESTKQSLTEKQAKELRETKGGRNILENFNSRERKIQ 180

Query: 1274 EIEASFEACKSRPVHATSDKLQPVEILPLLPDFDRYDDQFVLAAFDGDPTADSEIYSKLD 1453
             IEA+F A K  PVH+T+  L+PVEILPLLPDF RYDD FV+A+FD  PTADSEIYSKLD
Sbjct: 181  NIEAAFAASKITPVHSTNKSLKPVEILPLLPDFARYDDSFVVASFDSAPTADSEIYSKLD 240

Query: 1454 RSVRDDHESHAIMKSFIGAGSEQAKPEKFLAYMVPSPDELWKDVYDEHEDILYTWVREYQ 1633
            ++VRD HES AI+KS++  GS+ +KPEKFLAYM PSPDEL KD+YDE+ED  Y+WVREY 
Sbjct: 241  KTVRDSHESQAILKSYMATGSDPSKPEKFLAYMAPSPDELSKDIYDENEDTSYSWVREYH 300

Query: 1634 WDVRGDDADDPTTYLVTFDEEEARYLPLPTKLILRKKRPKEGRSNEEVEQYXXXXXXXXX 1813
            WDVRGDDADDPTTYLV+F++ +ARYLPLPTKL+LRKKR KEGRS++EVE +         
Sbjct: 301  WDVRGDDADDPTTYLVSFNKTDARYLPLPTKLLLRKKRAKEGRSSDEVEHFPVPSKVTVR 360

Query: 1814 XXXXXXXXELKESGDYVSSNSKRGRSA------IEDGLETPVPRHKVARVQDMDHYSGAE 1975
                    ELK+  + V S+SKRG S+      +EDGL      +K  + Q MD  SGAE
Sbjct: 361  QRPNVAAIELKD--EEVYSSSKRGVSSSKRGVDMEDGLGR---SYKGVQDQHMDQSSGAE 415

Query: 1976 DDMSD 1990
            D+MSD
Sbjct: 416  DEMSD 420


Top