BLASTX nr result

ID: Catharanthus22_contig00006665 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Catharanthus22_contig00006665
         (3119 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_006343037.1| PREDICTED: RNA polymerase II-associated fact...   684   0.0  
ref|XP_004235642.1| PREDICTED: uncharacterized protein LOC101254...   675   0.0  
gb|EXB74581.1| hypothetical protein L484_026278 [Morus notabilis]     619   e-174
ref|XP_002278075.2| PREDICTED: RNA polymerase II-associated fact...   617   e-173
ref|XP_004141783.1| PREDICTED: uncharacterized protein LOC101203...   614   e-173
ref|XP_004155995.1| PREDICTED: uncharacterized LOC101203806 [Cuc...   609   e-171
ref|XP_006465692.1| PREDICTED: RNA polymerase II-associated fact...   599   e-168
ref|XP_006426877.1| hypothetical protein CICLE_v10025066mg [Citr...   596   e-167
ref|XP_006465693.1| PREDICTED: RNA polymerase II-associated fact...   596   e-167
gb|ESW07627.1| hypothetical protein PHAVU_010G145300g [Phaseolus...   595   e-167
ref|XP_003531647.1| PREDICTED: bromodomain-containing protein 4-...   594   e-167
ref|XP_006583048.1| PREDICTED: RNA polymerase II-associated fact...   593   e-166
gb|EMJ18862.1| hypothetical protein PRUPE_ppa002145mg [Prunus pe...   592   e-166
ref|XP_006426878.1| hypothetical protein CICLE_v10025066mg [Citr...   588   e-165
ref|XP_004302858.1| PREDICTED: uncharacterized protein LOC101304...   581   e-163
gb|EOY26930.1| Hydroxyproline-rich glycoprotein family protein i...   577   e-161
gb|EOY26929.1| Hydroxyproline-rich glycoprotein family protein i...   577   e-161
ref|XP_002515964.1| conserved hypothetical protein [Ricinus comm...   571   e-160
gb|EMJ26342.1| hypothetical protein PRUPE_ppa002485mg [Prunus pe...   551   e-154
ref|XP_002303312.2| hydroxyproline-rich glycoprotein [Populus tr...   550   e-153

>ref|XP_006343037.1| PREDICTED: RNA polymerase II-associated factor 1 homolog [Solanum
            tuberosum]
          Length = 700

 Score =  684 bits (1765), Expect = 0.0
 Identities = 348/529 (65%), Positives = 400/529 (75%), Gaps = 12/529 (2%)
 Frame = +2

Query: 1232 EDRGQSKDGRDSGWRE--------HAKQHKASVPPIPVKKSNGPPGRVETDXXXXXXXXX 1387
            E R  ++  R+SGWRE         +KQ   SVPP+PVKKSN P GRVET+         
Sbjct: 175  ESRPSAEKRRESGWRESGHGNHTARSKQPGHSVPPMPVKKSNAPSGRVETEEERRLRKKR 234

Query: 1388 XXXXXXXXXXHRHHIKESQNRVLQKTQMLSSGAKGHGSITGSHMGDRRTAPLLSGERIEN 1567
                      +R H+KESQN+VLQKTQML+SG KGHGSI+ SHM DRRTAPLLSGER EN
Sbjct: 235  EIEKQRHEEKNRQHLKESQNKVLQKTQMLTSGTKGHGSISASHMADRRTAPLLSGERTEN 294

Query: 1568 RLKKPTTFLCKLKFRNELPDQTAQPKLMMLRRDKERFTKYAITSLEKLHKPQLYVEXXXX 1747
            RLKKPTTFLCKLKFRNELPD TAQPKL+ LRRD +RFTKY+ITSLEK+HKPQLYVE    
Sbjct: 295  RLKKPTTFLCKLKFRNELPDPTAQPKLLTLRRDPDRFTKYSITSLEKMHKPQLYVEPDLG 354

Query: 1748 XXXXXXXXSVYNPPKGKNXXXXXXXXXXXXXXNPITPIKKDGIKRKDRPTDKGVSWLVKT 1927
                    SVYNPPKG                NPITPIKKDGIK+K+RPTDKGVSWLVKT
Sbjct: 355  IPLDLLDLSVYNPPKGVKIPLAPEDEELLRDDNPITPIKKDGIKKKERPTDKGVSWLVKT 414

Query: 1928 QYISPLSMESTKQSLTEKQLRESRS----RDLLENLNSREGQIQEIEASFKACKARPVHA 2095
            QYISPLS ES KQSLTEKQ +E R     R++LENLN R+ QIQEIEASF+ACK+RP+HA
Sbjct: 415  QYISPLSTESAKQSLTEKQAKELRETKGGRNILENLNKRDRQIQEIEASFEACKSRPIHA 474

Query: 2096 TNRKLQPVEILPLFPDFDRYDDQFVVATFDSAPTAESETYSKLEKHVRDTFESQAIMKSF 2275
            TNR+LQPV++ PL+PDFDRY D FV+A +DSAPTA+SETY+KL+K VRD  ESQA+MKSF
Sbjct: 475  TNRRLQPVKVQPLYPDFDRYKDPFVLANYDSAPTADSETYNKLDKTVRDACESQAVMKSF 534

Query: 2276 MATSSDTGKPDKFLAYMVPSPNELSKDIYDESEDVSYSWIREYHWDVRGDDADDPTTYLV 2455
            +ATSSD  KPDKFLAYMVP+PNELSKD+YDE+ED+SYSW+REYHWDVRGDDADDP TY+V
Sbjct: 535  VATSSDADKPDKFLAYMVPAPNELSKDMYDENEDISYSWVREYHWDVRGDDADDPNTYVV 594

Query: 2456 TFGDTEAHYMPLPTKLILRKKRAREGKSTEEVEHFPVPSRLTVRRRSTVATIESKDLXXX 2635
             FG+TEA YMPLPTKL+LRKKRAREGKS EEVEHFPVPSR+TVR+R T A IE K+    
Sbjct: 595  AFGETEARYMPLPTKLVLRKKRAREGKSNEEVEHFPVPSRVTVRKRPTAAAIELKEEGGY 654

Query: 2636 XXXXXXXXXXXXXXLDTEEDEMGEDEQHKDLHDDMEQDQSSGAEYDMSD 2782
                            + ED++G  EQH ++HDD +QDQSSG EY MSD
Sbjct: 655  TTALKGNVSSSKRSRISHEDDVG--EQHNNMHDD-DQDQSSGGEYYMSD 700


>ref|XP_004235642.1| PREDICTED: uncharacterized protein LOC101254885 [Solanum
            lycopersicum]
          Length = 698

 Score =  675 bits (1741), Expect = 0.0
 Identities = 344/529 (65%), Positives = 396/529 (74%), Gaps = 12/529 (2%)
 Frame = +2

Query: 1232 EDRGQSKDGRDSGWRE--------HAKQHKASVPPIPVKKSNGPPGRVETDXXXXXXXXX 1387
            E R   +  R+SGWRE         +KQ   SVPP+P+KKSN   GRVET+         
Sbjct: 173  ESRHSVEKRRESGWRESRHGNHTARSKQPDHSVPPLPMKKSNAHSGRVETEEERRSRKKR 232

Query: 1388 XXXXXXXXXXHRHHIKESQNRVLQKTQMLSSGAKGHGSITGSHMGDRRTAPLLSGERIEN 1567
                      +R H+KESQN+VLQKTQML+SG KGHGSI+ SHM DRRT PLLSGER EN
Sbjct: 233  EIEKQRHEEKNRQHLKESQNKVLQKTQMLTSGTKGHGSISASHMADRRTTPLLSGERTEN 292

Query: 1568 RLKKPTTFLCKLKFRNELPDQTAQPKLMMLRRDKERFTKYAITSLEKLHKPQLYVEXXXX 1747
            RLKKPTTFLCKLKFRNELPD TAQPKL+ LRRD +RFTKY+ITSLEK+HKPQL+VE    
Sbjct: 293  RLKKPTTFLCKLKFRNELPDPTAQPKLLTLRRDPDRFTKYSITSLEKMHKPQLHVEPDLG 352

Query: 1748 XXXXXXXXSVYNPPKGKNXXXXXXXXXXXXXXNPITPIKKDGIKRKDRPTDKGVSWLVKT 1927
                    SVYNPPKG                NPITPIKKDGIK+K+RPTDKGVSWLVKT
Sbjct: 353  IPLDLLDLSVYNPPKGVKIPLAPEDEELLRDDNPITPIKKDGIKKKERPTDKGVSWLVKT 412

Query: 1928 QYISPLSMESTKQSLTEKQLRESRS----RDLLENLNSREGQIQEIEASFKACKARPVHA 2095
            QYISPLS ES KQSLTEKQ +E R     R++LENLN R+ QIQEIEASF+ACK+RP+HA
Sbjct: 413  QYISPLSTESAKQSLTEKQAKELRETKGGRNILENLNKRDRQIQEIEASFEACKSRPIHA 472

Query: 2096 TNRKLQPVEILPLFPDFDRYDDQFVVATFDSAPTAESETYSKLEKHVRDTFESQAIMKSF 2275
            +NR+LQP+++ PL+PDFDRY D FV+A +DSAPTA+SETYSKL+K VRD  ESQA+MKSF
Sbjct: 473  SNRRLQPIKVQPLYPDFDRYKDPFVLANYDSAPTADSETYSKLDKTVRDACESQAVMKSF 532

Query: 2276 MATSSDTGKPDKFLAYMVPSPNELSKDIYDESEDVSYSWIREYHWDVRGDDADDPTTYLV 2455
            +ATSSD  KPDKFLAYMVP+PNELSKDIYDESED+SYSW+REYHWDVRGDDADDP TY+V
Sbjct: 533  VATSSDADKPDKFLAYMVPAPNELSKDIYDESEDISYSWVREYHWDVRGDDADDPNTYVV 592

Query: 2456 TFGDTEAHYMPLPTKLILRKKRAREGKSTEEVEHFPVPSRLTVRRRSTVATIESKDLXXX 2635
             FG+ EA YMPLPTKL+LRKKRAREGKS EEVEHFPVPSR+TVR+R T A IE K+    
Sbjct: 593  AFGEREARYMPLPTKLVLRKKRAREGKSNEEVEHFPVPSRVTVRKRPTAAAIELKEEGGY 652

Query: 2636 XXXXXXXXXXXXXXLDTEEDEMGEDEQHKDLHDDMEQDQSSGAEYDMSD 2782
                            + ED++G  EQH ++HDD +QDQSSG EY MSD
Sbjct: 653  TTALKGNVSSSKRSRISHEDDVG--EQHNNMHDD-DQDQSSGGEYYMSD 698


>gb|EXB74581.1| hypothetical protein L484_026278 [Morus notabilis]
          Length = 697

 Score =  619 bits (1596), Expect = e-174
 Identities = 319/528 (60%), Positives = 382/528 (72%), Gaps = 13/528 (2%)
 Frame = +2

Query: 1235 DRGQSKDGRDSGWREHA---------KQHKASVPPIPVKKSNGPPGRVETDXXXXXXXXX 1387
            DRG SK+   SG REH          KQHK  VP +PVKKSNGP GRVET+         
Sbjct: 175  DRGVSKEVAGSGRREHGYSNHHGTHHKQHKYPVPSVPVKKSNGPMGRVETEEERRLRKKR 234

Query: 1388 XXXXXXXXXXHRHHIKESQNRVLQKTQMLSSGAKGHGSITGSHMGDRRTAPLLSGERIEN 1567
                      HR H+KESQ+  LQKTQ+LS+ AKGHGSI GS MG+RR    LSGERIEN
Sbjct: 235  EFEKQKQEEKHRQHLKESQHSALQKTQILSA-AKGHGSIAGSRMGERRATSFLSGERIEN 293

Query: 1568 RLKKPTTFLCKLKFRNELPDQTAQPKLMMLRRDKERFTKYAITSLEKLHKPQLYVEXXXX 1747
            RLKKPTTFLCKLKFRNELPD +AQPKLM ++R+K++++KY ITSLEK +KP+L+VE    
Sbjct: 294  RLKKPTTFLCKLKFRNELPDPSAQPKLMSMKREKDQYSKYTITSLEKTYKPKLFVEPDLG 353

Query: 1748 XXXXXXXXSVYNPPKGKNXXXXXXXXXXXXXXNPITPIKKDGIKRKDRPTDKGVSWLVKT 1927
                    SVYNPP  +                 +TP+KKDGIKRK+RPTDKGV+WLVKT
Sbjct: 354  IPLNLLDLSVYNPPSVR-PPLDPEDEELLRDDEAVTPVKKDGIKRKERPTDKGVAWLVKT 412

Query: 1928 QYISPLSMESTKQSLTEKQLRESRS----RDLLENLNSREGQIQEIEASFKACKARPVHA 2095
            QYISPLSMESTKQSLTEKQ +E R     R++LENLN R+ QI+EI+ASF+ACK+RPVHA
Sbjct: 413  QYISPLSMESTKQSLTEKQAKELRELKGGRNILENLNDRDRQIKEIQASFEACKSRPVHA 472

Query: 2096 TNRKLQPVEILPLFPDFDRYDDQFVVATFDSAPTAESETYSKLEKHVRDTFESQAIMKSF 2275
            TN+ L PVE+LPL PDFDRYDDQFV+A FDSAPTA+SE YSK+++ +RD  ESQA++KS+
Sbjct: 473  TNKSLYPVEVLPLLPDFDRYDDQFVLAAFDSAPTADSEVYSKMDQSIRDAHESQAVLKSY 532

Query: 2276 MATSSDTGKPDKFLAYMVPSPNELSKDIYDESEDVSYSWIREYHWDVRGDDADDPTTYLV 2455
              T SD G P+KFLAYMVPSP+ELSKDIYDE EDVSYSW+REYHWDVRGDDADDPTTYLV
Sbjct: 533  KVTGSDPGNPEKFLAYMVPSPDELSKDIYDEHEDVSYSWVREYHWDVRGDDADDPTTYLV 592

Query: 2456 TFGDTEAHYMPLPTKLILRKKRAREGKSTEEVEHFPVPSRLTVRRRSTVATIESKDLXXX 2635
            +F +TEA Y+PLPTKL+LRKKRA+EG+S +EVEHFPVP+R+TVRRR TV+ +E KD    
Sbjct: 593  SFDETEARYLPLPTKLVLRKKRAKEGRSGDEVEHFPVPARVTVRRRPTVSVVELKDAEVY 652

Query: 2636 XXXXXXXXXXXXXXLDTEEDEMGEDEQHKDLHDDMEQDQSSGAEYDMS 2779
                           D E+   G +  HK    + + D+ SGAE D+S
Sbjct: 653  SNPRGSLSNFKRGGSDVED---GLERSHKVARQE-DVDEYSGAEDDLS 696


>ref|XP_002278075.2| PREDICTED: RNA polymerase II-associated factor 1 homolog [Vitis
            vinifera]
          Length = 589

 Score =  617 bits (1590), Expect = e-173
 Identities = 329/534 (61%), Positives = 383/534 (71%), Gaps = 13/534 (2%)
 Frame = +2

Query: 1220 HGFAEDRGQSKDGRDSGWRE--------HAKQHKASVPPIPVKKSNGPPGRVETDXXXXX 1375
            HG   D+G  KD R +G RE          KQ K  VPP PVKKSNGPPGRVET+     
Sbjct: 65   HG--RDKGAPKDLRGAGRREPGHSNQGPSGKQQKPPVPPAPVKKSNGPPGRVETEEERRL 122

Query: 1376 XXXXXXXXXXXXXXHRHHIKESQNRVLQKTQMLSSGAKGHGSITG-SHMGDRRTAPLLSG 1552
                           +H +KESQN VLQKTQMLSSG KGHGS+ G S MG+RRT P LSG
Sbjct: 123  RKKREFEKQRQEEKQKHQLKESQNTVLQKTQMLSSG-KGHGSVVGGSRMGERRTTPFLSG 181

Query: 1553 ERIENRLKKPTTFLCKLKFRNELPDQTAQPKLMMLRRDKERFTKYAITSLEKLHKPQLYV 1732
            +RIENRL+KPTTFLCKLKFRNELPD TAQPKLM L+ DK+RFTKY ITSLEK+HKPQL+V
Sbjct: 182  DRIENRLRKPTTFLCKLKFRNELPDPTAQPKLMALKTDKDRFTKYTITSLEKMHKPQLFV 241

Query: 1733 EXXXXXXXXXXXXSVYNPPKGKNXXXXXXXXXXXXXXNPITPIKKDGIKRKDRPTDKGVS 1912
            E            SVYNPP  +               + +TP+KK+GIK+K+RPTDKGVS
Sbjct: 242  EPDLGIPLDLLDLSVYNPPSVRRPLDPEDEELLRDDES-VTPVKKEGIKKKERPTDKGVS 300

Query: 1913 WLVKTQYISPLSMESTKQSLTEKQLRESRS----RDLLENLNSREGQIQEIEASFKACKA 2080
            WLVKTQYISPLS ESTKQSLTEKQ +E R     R++LEN NSRE +IQ IEA+F A K 
Sbjct: 301  WLVKTQYISPLSTESTKQSLTEKQAKELRETKGGRNILENFNSRERKIQNIEAAFAASKI 360

Query: 2081 RPVHATNRKLQPVEILPLFPDFDRYDDQFVVATFDSAPTAESETYSKLEKHVRDTFESQA 2260
             PVH+TN+ L+PVEILPL PDF RYDD FVVA+FDSAPTA+SE YSKL+K VRD+ ESQA
Sbjct: 361  TPVHSTNKSLKPVEILPLLPDFARYDDSFVVASFDSAPTADSEIYSKLDKTVRDSHESQA 420

Query: 2261 IMKSFMATSSDTGKPDKFLAYMVPSPNELSKDIYDESEDVSYSWIREYHWDVRGDDADDP 2440
            I+KS+MAT SD  KP+KFLAYM PSP+ELSKDIYDE+ED SYSW+REYHWDVRGDDADDP
Sbjct: 421  ILKSYMATGSDPSKPEKFLAYMAPSPDELSKDIYDENEDTSYSWVREYHWDVRGDDADDP 480

Query: 2441 TTYLVTFGDTEAHYMPLPTKLILRKKRAREGKSTEEVEHFPVPSRLTVRRRSTVATIESK 2620
            TTYLV+F  T+A Y+PLPTKL+LRKKRA+EG+S++EVEHFPVPS++TVR+R  VA IE K
Sbjct: 481  TTYLVSFNKTDARYLPLPTKLLLRKKRAKEGRSSDEVEHFPVPSKVTVRQRPNVAAIELK 540

Query: 2621 DLXXXXXXXXXXXXXXXXXLDTEEDEMGEDEQHKDLHDDMEQDQSSGAEYDMSD 2782
            D                  +D E+   G    +K +  D   DQSSGAE +MSD
Sbjct: 541  D-EEVYSSSKRGVSSSKRGVDMED---GLGRSYKGV-QDQHMDQSSGAEDEMSD 589


>ref|XP_004141783.1| PREDICTED: uncharacterized protein LOC101203806 [Cucumis sativus]
          Length = 706

 Score =  614 bits (1584), Expect = e-173
 Identities = 324/525 (61%), Positives = 379/525 (72%), Gaps = 9/525 (1%)
 Frame = +2

Query: 1235 DRGQSKDG----RDSGWREHAKQHKASVPPIPVKKSNGPPGRVETDXXXXXXXXXXXXXX 1402
            D+G  KD     RD     H K  K S PP+P KK+NGP GR+ETD              
Sbjct: 196  DKGVPKDPSYGRRDRENSNHDKHQKHSGPPMPPKKANGPSGRMETDDEKRLRKKREFEKQ 255

Query: 1403 XXXXXHRHHIKESQNRVLQKTQMLSSGAKGHGSITGSHMGDRRTAPLLSGERIENRLKKP 1582
                 HRHH+KESQN +LQKTQMLS+G K HGSI GS MG+R+  P LSGERIENRLKKP
Sbjct: 256  RQDERHRHHLKESQNTILQKTQMLSTG-KVHGSIVGSRMGERKATPFLSGERIENRLKKP 314

Query: 1583 TTFLCKLKFRNELPDQTAQPKLMMLRRDKERFTKYAITSLEKLHKPQLYVEXXXXXXXXX 1762
            TTFLCKLKFRNELPD +AQPKLM LR++K+ +T+Y ITSLEK +KPQLYVE         
Sbjct: 315  TTFLCKLKFRNELPDTSAQPKLMSLRKEKDHYTRYTITSLEKTYKPQLYVEPDLGIPLDL 374

Query: 1763 XXXSVYNPPKGKNXXXXXXXXXXXXXXNPITPIKKDG-IKRKDRPTDKGVSWLVKTQYIS 1939
               SVYNP   +                  TP+KKDG IKRK+RPTDKGV+WLVKTQYIS
Sbjct: 375  LDLSVYNPSSVRMPLAPEDEELLRDDVLK-TPVKKDGGIKRKERPTDKGVAWLVKTQYIS 433

Query: 1940 PLSMESTKQSLTEKQLRESRS----RDLLENLNSREGQIQEIEASFKACKARPVHATNRK 2107
            PLS+ES KQSLTEKQ +E R     R++LENLN+RE QI+EIEASF+ACK+RP+HATN+ 
Sbjct: 434  PLSIESAKQSLTEKQAKELREMKGGRNILENLNNRERQIKEIEASFEACKSRPIHATNKN 493

Query: 2108 LQPVEILPLFPDFDRYDDQFVVATFDSAPTAESETYSKLEKHVRDTFESQAIMKSFMATS 2287
            L PVE+LPL PDFDRYDD FVV  FDSAPTA+SET++KL++ +RD  ESQAIMKS+MATS
Sbjct: 494  LYPVEVLPLLPDFDRYDDPFVVVAFDSAPTADSETFNKLDQSIRDAHESQAIMKSYMATS 553

Query: 2288 SDTGKPDKFLAYMVPSPNELSKDIYDESEDVSYSWIREYHWDVRGDDADDPTTYLVTFGD 2467
            SD  KP+KFLAYMVPSP+ELSKDIYDE EDVSYSW+REYHWDVRGD+ DDPTTYLV+F D
Sbjct: 554  SDPSKPEKFLAYMVPSPDELSKDIYDEQEDVSYSWVREYHWDVRGDNVDDPTTYLVSFDD 613

Query: 2468 TEAHYMPLPTKLILRKKRAREGKSTEEVEHFPVPSRLTVRRRSTVATIESKDLXXXXXXX 2647
             EA Y+PLPTKL+LRKKRA+EG+S++EVEHFP P+R+TVRRR TVAT+E KD        
Sbjct: 614  AEARYVPLPTKLVLRKKRAKEGRSSDEVEHFPAPARVTVRRRPTVATLEVKDPGIYSNSK 673

Query: 2648 XXXXXXXXXXLDTEEDEMGEDEQHKDLHDDMEQDQSSGAEYDMSD 2782
                          ED +G   +H D H DM  DQ SGAE +MSD
Sbjct: 674  RGSDI---------EDGIGRSHKH-DRHQDM--DQFSGAEDEMSD 706


>ref|XP_004155995.1| PREDICTED: uncharacterized LOC101203806 [Cucumis sativus]
          Length = 706

 Score =  609 bits (1570), Expect = e-171
 Identities = 321/525 (61%), Positives = 377/525 (71%), Gaps = 9/525 (1%)
 Frame = +2

Query: 1235 DRGQSKDG----RDSGWREHAKQHKASVPPIPVKKSNGPPGRVETDXXXXXXXXXXXXXX 1402
            D+G  KD     RD     H K  K S PP+P KK+NGP GR+ETD              
Sbjct: 196  DKGAPKDPSYGRRDRENSNHDKHQKHSGPPMPPKKANGPSGRMETDDEKRLRKKREFEKQ 255

Query: 1403 XXXXXHRHHIKESQNRVLQKTQMLSSGAKGHGSITGSHMGDRRTAPLLSGERIENRLKKP 1582
                 HRHH+KESQN +LQKTQMLS+G K HGSI GS MG+R+  P LSGERIENRLKKP
Sbjct: 256  RQDERHRHHLKESQNTILQKTQMLSTG-KVHGSIVGSRMGERKATPFLSGERIENRLKKP 314

Query: 1583 TTFLCKLKFRNELPDQTAQPKLMMLRRDKERFTKYAITSLEKLHKPQLYVEXXXXXXXXX 1762
            TTFLCKLKFRNELPD +AQPKLM LR++K+ +T+Y ITSLEK +KPQLYVE         
Sbjct: 315  TTFLCKLKFRNELPDTSAQPKLMSLRKEKDHYTRYTITSLEKTYKPQLYVEPDLGIPLDL 374

Query: 1763 XXXSVYNPPKGKNXXXXXXXXXXXXXXNPITPIKKDG-IKRKDRPTDKGVSWLVKTQYIS 1939
               SVYNP   +                  TP+KKDG IKRK+RPTDKGV+WLVKTQYIS
Sbjct: 375  LDLSVYNPSSVRMPLAPEDEELLRDDVLK-TPVKKDGGIKRKERPTDKGVAWLVKTQYIS 433

Query: 1940 PLSMESTKQSLTEKQLRESRS----RDLLENLNSREGQIQEIEASFKACKARPVHATNRK 2107
            PLS+ES KQSLTEKQ +E R     R++LENLN+RE QI+EIE SF+ACK+RP+HATN+ 
Sbjct: 434  PLSIESAKQSLTEKQAKELREMKGGRNILENLNNRERQIKEIETSFEACKSRPIHATNKN 493

Query: 2108 LQPVEILPLFPDFDRYDDQFVVATFDSAPTAESETYSKLEKHVRDTFESQAIMKSFMATS 2287
            L PVE+LPL PDFDRYDD FVV  FDSAPTA+SET++KL++ +RD  ESQAIMKS+MAT 
Sbjct: 494  LYPVEVLPLLPDFDRYDDPFVVVAFDSAPTADSETFNKLDQSIRDAHESQAIMKSYMATG 553

Query: 2288 SDTGKPDKFLAYMVPSPNELSKDIYDESEDVSYSWIREYHWDVRGDDADDPTTYLVTFGD 2467
            SD  KP+KFLAYMVPSP+ELSKDIYDE EDVSYSW+REYHWDVRGD+ DDPTTYLV+F D
Sbjct: 554  SDPSKPEKFLAYMVPSPDELSKDIYDEQEDVSYSWVREYHWDVRGDNVDDPTTYLVSFDD 613

Query: 2468 TEAHYMPLPTKLILRKKRAREGKSTEEVEHFPVPSRLTVRRRSTVATIESKDLXXXXXXX 2647
             EA Y+PLPTKL+LRKKRA+EG+S++EVEHFP P+R+TVRRR TVAT+E KD        
Sbjct: 614  AEARYVPLPTKLVLRKKRAKEGRSSDEVEHFPAPARVTVRRRPTVATLEVKDPGIYSNSK 673

Query: 2648 XXXXXXXXXXLDTEEDEMGEDEQHKDLHDDMEQDQSSGAEYDMSD 2782
                          ED +G   +H D + DM  DQ SGAE +MSD
Sbjct: 674  RGSDI---------EDGIGRSHKH-DRNQDM--DQFSGAEDEMSD 706


>ref|XP_006465692.1| PREDICTED: RNA polymerase II-associated factor 1 homolog isoform X1
            [Citrus sinensis]
          Length = 576

 Score =  599 bits (1544), Expect = e-168
 Identities = 311/505 (61%), Positives = 365/505 (72%), Gaps = 4/505 (0%)
 Frame = +2

Query: 1280 HAKQHKASVPPIPVKKSNGPPGRVETDXXXXXXXXXXXXXXXXXXXHRHHIKESQNRVLQ 1459
            H+KQH+  VPP  VKK NG  GRVET+                   HR  +KESQN V+Q
Sbjct: 77   HSKQHRPPVPPPGVKKVNGGSGRVETEEERRVRKKREYEKHRQEEKHRLQMKESQNVVMQ 136

Query: 1460 KTQMLSSGAKGHGSITGSHMGDRRTAPLLSGERIENRLKKPTTFLCKLKFRNELPDQTAQ 1639
            K+QM++SG  GHGS+ GS MGDRR APLLSGERIENRLKKPTTFLCKLKFRNELP+ +AQ
Sbjct: 137  KSQMVASGKGGHGSMAGSRMGDRRAAPLLSGERIENRLKKPTTFLCKLKFRNELPEPSAQ 196

Query: 1640 PKLMMLRRDKERFTKYAITSLEKLHKPQLYVEXXXXXXXXXXXXSVYNPPKGKNXXXXXX 1819
            PKLM L++DK+RFT+Y  +SLEK +KPQL+VE            SVYNPP  +       
Sbjct: 197  PKLMALKKDKDRFTRYTFSSLEKNYKPQLHVEPDLGIPLDLLDLSVYNPPSVR-PPLDPE 255

Query: 1820 XXXXXXXXNPITPIKKDGIKRKDRPTDKGVSWLVKTQYISPLSMESTKQSLTEKQLRESR 1999
                      +TP+KKDGIKRK+RPTDKGVSWLVKTQYISPLSMES +QSLTEKQ +E R
Sbjct: 256  DEELLRDDEVVTPVKKDGIKRKERPTDKGVSWLVKTQYISPLSMESARQSLTEKQAKELR 315

Query: 2000 S----RDLLENLNSREGQIQEIEASFKACKARPVHATNRKLQPVEILPLFPDFDRYDDQF 2167
                 R +LENLN RE QI+EIEASF+ACK RP+HATN+ LQPVEILPL PDF+RYDDQF
Sbjct: 316  EMKGGRSILENLNDRERQIKEIEASFEACKLRPIHATNKNLQPVEILPLLPDFERYDDQF 375

Query: 2168 VVATFDSAPTAESETYSKLEKHVRDTFESQAIMKSFMATSSDTGKPDKFLAYMVPSPNEL 2347
            V ATFD APTA+SE YSK++K VRD  ES+AIMKS++AT SD+  P+KFLAYMVPS NEL
Sbjct: 376  VAATFDGAPTADSEIYSKMDKSVRDAHESRAIMKSYVATGSDSANPEKFLAYMVPSVNEL 435

Query: 2348 SKDIYDESEDVSYSWIREYHWDVRGDDADDPTTYLVTFGDTEAHYMPLPTKLILRKKRAR 2527
            SKD+YDE+EDVS+SW+REYHWDVRGDDADDPTTYLV+F D EA Y+PLPTKL LRKKRA 
Sbjct: 436  SKDMYDENEDVSFSWVREYHWDVRGDDADDPTTYLVSFDDDEARYVPLPTKLNLRKKRAI 495

Query: 2528 EGKSTEEVEHFPVPSRLTVRRRSTVATIESKDLXXXXXXXXXXXXXXXXXLDTEEDEMGE 2707
            EG+S +EVEHFP+PS + VRRR+ V  IE K+                  +D++ED   E
Sbjct: 496  EGRSNDEVEHFPIPSSIAVRRRANVTAIELKEQGAYSNSKGNSSSSKMGRVDSQEDL--E 553

Query: 2708 DEQHKDLHDDMEQDQSSGAEYDMSD 2782
               +   H D    QSSGAE DM D
Sbjct: 554  RSHNGSRHQD--PYQSSGAEDDMYD 576


>ref|XP_006426877.1| hypothetical protein CICLE_v10025066mg [Citrus clementina]
            gi|557528867|gb|ESR40117.1| hypothetical protein
            CICLE_v10025066mg [Citrus clementina]
          Length = 677

 Score =  596 bits (1537), Expect = e-167
 Identities = 309/507 (60%), Positives = 365/507 (71%), Gaps = 6/507 (1%)
 Frame = +2

Query: 1280 HAKQHKASVPPIPVKKSNGPPGRVETDXXXXXXXXXXXXXXXXXXXHRHHIKESQNRVLQ 1459
            H+KQH+  VPP  VKK NG  GRVET+                   HR  +KESQN V+Q
Sbjct: 178  HSKQHRPPVPPPGVKKVNGGSGRVETEEERRIRKKREYEKHRQEEKHRLQMKESQNVVMQ 237

Query: 1460 KTQMLSSGAKGHGSITGSHMGDRRTAPLLSGERIENRLKKPTTFLCKLKFRNELPDQTAQ 1639
            K+QM++SG  GHGS+ GS MGDRR APLLSGER ENRLKKPTTFLCKLKFRNELP+ +AQ
Sbjct: 238  KSQMVASGKGGHGSMVGSRMGDRRAAPLLSGERTENRLKKPTTFLCKLKFRNELPEPSAQ 297

Query: 1640 PKLMMLRRDKERFTKYAITSLEKLHKPQLYVEXXXXXXXXXXXXSVYNPPKGKNXXXXXX 1819
            PKLM L++DK+RFT+Y  +SLEK +KPQL+VE            SVYNPP  +       
Sbjct: 298  PKLMALKKDKDRFTRYTFSSLEKNYKPQLHVEPDLGIPLDLLDLSVYNPPSVR-PPLDPE 356

Query: 1820 XXXXXXXXNPITPIKKDGIKRKDRPTDKGVSWLVKTQYISPLSMESTKQSLTEKQLRESR 1999
                      +TP+KKDGIKRK+RPTDKGVSWLVKTQYISPLSMES +QSLTEKQ +E R
Sbjct: 357  DEELLRDDEVVTPVKKDGIKRKERPTDKGVSWLVKTQYISPLSMESARQSLTEKQAKELR 416

Query: 2000 S----RDLLENLNSREGQIQEIEASFKACKARPVHATNRKLQPVEILPLFPDFDRYDDQF 2167
                 R +LENLN RE QI+EIEASF+ACK RP+HATN+ LQPVEILPL PDF+RYDDQF
Sbjct: 417  EMKGGRSILENLNDRERQIKEIEASFEACKLRPIHATNKNLQPVEILPLLPDFERYDDQF 476

Query: 2168 VVATFDSAPTAESETYSKLEKHVRDTFESQAIMKSFMATSSDTGKPDKFLAYMVPSPNEL 2347
            V ATFD APTA+SE YSK++K VRD  ES+AIMKS++AT SD+  P+KFLAYMVPS NEL
Sbjct: 477  VAATFDGAPTADSEIYSKMDKSVRDAHESRAIMKSYVATGSDSANPEKFLAYMVPSVNEL 536

Query: 2348 SKDIYDESEDVSYSWIREYHWDVRGDDADDPTTYLVTFGDTEAHYMPLPTKLILRKKRAR 2527
            SKD+YDE+EDVS+SW+REYHWDVRGDDADDPTTYLV+F D EA Y+PLPTKL LRKKRA 
Sbjct: 537  SKDMYDENEDVSFSWVREYHWDVRGDDADDPTTYLVSFDDDEARYVPLPTKLNLRKKRAI 596

Query: 2528 EGKSTEEVEHFPVPSRLTVRRRSTVATIESKDLXXXXXXXXXXXXXXXXXLDTEEDEMGE 2707
            EG+S +EVEHFP+PS + VRRR+ V  IE K+                  +D++ED    
Sbjct: 597  EGRSNDEVEHFPIPSSIAVRRRANVTAIELKEQGAYSNSKGNSSSSKMGRVDSQED---- 652

Query: 2708 DEQHKDLHDDMEQD--QSSGAEYDMSD 2782
                +  +   +QD  QSSGAE DM D
Sbjct: 653  --LERSHNGSRQQDPYQSSGAEDDMYD 677


>ref|XP_006465693.1| PREDICTED: RNA polymerase II-associated factor 1 homolog isoform X2
            [Citrus sinensis]
          Length = 570

 Score =  596 bits (1536), Expect = e-167
 Identities = 311/511 (60%), Positives = 367/511 (71%), Gaps = 10/511 (1%)
 Frame = +2

Query: 1280 HAKQHKASVPPIPVKKSNGPPGRVETDXXXXXXXXXXXXXXXXXXXHRHHIKESQNRVLQ 1459
            H+KQH+  VPP  VKK NG  GRVET+                   HR  +KESQN V+Q
Sbjct: 77   HSKQHRPPVPPPGVKKVNGGSGRVETEEERRVRKKREYEKHRQEEKHRLQMKESQNVVMQ 136

Query: 1460 KTQMLSSGAKGHGSITGSHMGDRRTAPLLSGERIENRLKKPTTFLCKLKFRNELPDQTAQ 1639
            K+QM++SG  GHGS+ GS MGDRR APLLSGERIENRLKKPTTFLCKLKFRNELP+ +AQ
Sbjct: 137  KSQMVASGKGGHGSMAGSRMGDRRAAPLLSGERIENRLKKPTTFLCKLKFRNELPEPSAQ 196

Query: 1640 PKLMMLRRDKERFTKYAITSLEKLHKPQLYVEXXXXXXXXXXXXSVYNPPKGKNXXXXXX 1819
            PKLM L++DK+RFT+Y  +SLEK +KPQL+VE            SVYNPP  +       
Sbjct: 197  PKLMALKKDKDRFTRYTFSSLEKNYKPQLHVEPDLGIPLDLLDLSVYNPPSVR-PPLDPE 255

Query: 1820 XXXXXXXXNPITPIKKDGIKRKDRPTDKGVSWLVKTQYISPLSMESTKQSLTEKQLRESR 1999
                      +TP+KKDGIKRK+RPTDKGVSWLVKTQYISPLSMES +QSLTEKQ +E R
Sbjct: 256  DEELLRDDEVVTPVKKDGIKRKERPTDKGVSWLVKTQYISPLSMESARQSLTEKQAKELR 315

Query: 2000 S----RDLLENLNSREGQIQEIEASFKACKARPVHATNRKLQPVEILPLFPDFDRYDDQF 2167
                 R +LENLN RE QI+EIEASF+ACK RP+HATN+ LQPVEILPL PDF+RYDDQF
Sbjct: 316  EMKGGRSILENLNDRERQIKEIEASFEACKLRPIHATNKNLQPVEILPLLPDFERYDDQF 375

Query: 2168 VVATFDSAPTAESETYSKLEKHVRDTFESQAIMKSFMATSSDTGKPDKFLAYMVPSPNEL 2347
            V ATFD APTA+SE YSK++K VRD  ES+AIMKS++AT SD+  P+KFLAYMVPS NEL
Sbjct: 376  VAATFDGAPTADSEIYSKMDKSVRDAHESRAIMKSYVATGSDSANPEKFLAYMVPSVNEL 435

Query: 2348 SKDIYDESEDVSYSWIREYHWDVRGDDADDPTTYLVTFGDTEAHYMPLPTKLILRKKRAR 2527
            SKD+YDE+EDVS+SW+REYHWDVRGDDADDPTTYLV+F D EA Y+PLPTKL LRKKRA 
Sbjct: 436  SKDMYDENEDVSFSWVREYHWDVRGDDADDPTTYLVSFDDDEARYVPLPTKLNLRKKRAI 495

Query: 2528 EGKSTEEVEHFPVPSRLTVRRRSTVATIESKDLXXXXXXXXXXXXXXXXXLDTEEDEMGE 2707
            EG+S +EVEHFP+PS + VRRR+ V  IE K+                   ++   +MG 
Sbjct: 496  EGRSNDEVEHFPIPSSIAVRRRANVTAIELKEQGG----------------NSSSSKMGR 539

Query: 2708 DEQHKDL----HDDMEQD--QSSGAEYDMSD 2782
             +  +DL    +    QD  QSSGAE DM D
Sbjct: 540  VDSQEDLERSHNGSRHQDPYQSSGAEDDMYD 570


>gb|ESW07627.1| hypothetical protein PHAVU_010G145300g [Phaseolus vulgaris]
          Length = 661

 Score =  595 bits (1533), Expect = e-167
 Identities = 313/523 (59%), Positives = 369/523 (70%), Gaps = 12/523 (2%)
 Frame = +2

Query: 1250 KDGRDSGWREHA--------KQHKASVPPIPVKKSNGPPGRVETDXXXXXXXXXXXXXXX 1405
            KD   SG RE+         KQHK   PP+P KK NGPPGR ET+               
Sbjct: 148  KDPSTSGRREYDPSNHGIGHKQHKHQ-PPVPAKKVNGPPGRAETEEEKRLRKKREFEKQR 206

Query: 1406 XXXXHRHHIKESQNRVLQKTQMLSSGAKGHGSITGSHMGDRRTAPLLSGERIENRLKKPT 1585
                HR  +KESQN VLQKT +LSSG KGHG + GS MG+RR+ PLLS ER+ENRLKKPT
Sbjct: 207  QEEKHRQQLKESQNTVLQKTHLLSSG-KGHGLVAGSRMGERRSTPLLSAERVENRLKKPT 265

Query: 1586 TFLCKLKFRNELPDQTAQPKLMMLRRDKERFTKYAITSLEKLHKPQLYVEXXXXXXXXXX 1765
            TFLCKLKFRNELPD +AQPKLM  ++DK+++ KY ITSLEK++KP+L+VE          
Sbjct: 266  TFLCKLKFRNELPDPSAQPKLMAFKKDKDQYAKYTITSLEKMYKPKLFVEPDLGIPLDLL 325

Query: 1766 XXSVYNPPKGKNXXXXXXXXXXXXXXNPITPIKKDGIKRKDRPTDKGVSWLVKTQYISPL 1945
              SVYNPP  +                  TPIKKDGIKRK+RPTDKGV+WLVKTQYISPL
Sbjct: 326  DLSVYNPPSVR-PPLAPEDEELLRDDEAATPIKKDGIKRKERPTDKGVAWLVKTQYISPL 384

Query: 1946 SMESTKQSLTEKQLRESRS----RDLLENLNSREGQIQEIEASFKACKARPVHATNRKLQ 2113
            SMESTKQSLTEKQ +E R     R +L+NLNSRE QI+EIEASF+A K+ PVHATN+ L 
Sbjct: 385  SMESTKQSLTEKQAKELREMKGGRGVLDNLNSRERQIREIEASFEAAKSDPVHATNKDLY 444

Query: 2114 PVEILPLFPDFDRYDDQFVVATFDSAPTAESETYSKLEKHVRDTFESQAIMKSFMATSSD 2293
            PVE++PL PDFDRYDDQFVVA FD+APTA+SE Y+KL+K VRD FES+A+MKS++ATSSD
Sbjct: 445  PVEVMPLLPDFDRYDDQFVVAAFDNAPTADSEMYAKLDKSVRDAFESKAVMKSYVATSSD 504

Query: 2294 TGKPDKFLAYMVPSPNELSKDIYDESEDVSYSWIREYHWDVRGDDADDPTTYLVTFGDTE 2473
               P+KFLAYM P+P ELSKDIYDE+EDVSYSWIREYHWDVRGDDADDPTT+ V F D+E
Sbjct: 505  PANPEKFLAYMAPAPGELSKDIYDENEDVSYSWIREYHWDVRGDDADDPTTFFVAFDDSE 564

Query: 2474 AHYMPLPTKLILRKKRAREGKSTEEVEHFPVPSRLTVRRRSTVATIESKDLXXXXXXXXX 2653
            A Y+PLPTKL+LRKKRA+EG+S EE+E  PVPSR+TVRRRS+VA IE KD          
Sbjct: 565  ARYLPLPTKLVLRKKRAKEGRSGEEIEQCPVPSRVTVRRRSSVAAIERKDTGVYTSSRGN 624

Query: 2654 XXXXXXXXLDTEEDEMGEDEQHKDLHDDMEQDQSSGAEYDMSD 2782
                    +D   +       H+D +      QSSGAE  MS+
Sbjct: 625  SSKRSRLEMDDGLEHHHRGAPHQDNY------QSSGAEDYMSE 661


>ref|XP_003531647.1| PREDICTED: bromodomain-containing protein 4-like isoform X1 [Glycine
            max] gi|571472317|ref|XP_006585570.1| PREDICTED:
            bromodomain-containing protein 4-like isoform X2 [Glycine
            max]
          Length = 666

 Score =  594 bits (1532), Expect = e-167
 Identities = 311/523 (59%), Positives = 371/523 (70%), Gaps = 12/523 (2%)
 Frame = +2

Query: 1250 KDGRDSGWREHA--------KQHKASVPPIPVKK-SNGPPGRVETDXXXXXXXXXXXXXX 1402
            K+   SG RE+         KQHK   PP+PVKK +NGPPGR ETD              
Sbjct: 152  KEPSKSGRREYEHSNHGIAHKQHKQQQPPLPVKKMNNGPPGRAETDEEKRLRKKREFEKQ 211

Query: 1403 XXXXXHRHHIKESQNRVLQKTQMLSSGAKGHGSITGSHMGDRRTAPLLSGERIENRLKKP 1582
                 HR  +KESQN VLQKT +LSSG KGHG I GS MG+RR+ PLL  ER+ENRLKKP
Sbjct: 212  RQEEKHRQQLKESQNTVLQKTHLLSSG-KGHGMIAGSRMGERRSTPLLGAERVENRLKKP 270

Query: 1583 TTFLCKLKFRNELPDQTAQPKLMMLRRDKERFTKYAITSLEKLHKPQLYVEXXXXXXXXX 1762
            TTFLCKLKFRNELPD +AQPKLM  ++DK+++ KY ITSLEK++KP+L+VE         
Sbjct: 271  TTFLCKLKFRNELPDPSAQPKLMSFKKDKDQYAKYTITSLEKMYKPKLFVEPDLGIPLDL 330

Query: 1763 XXXSVYNPPKGKNXXXXXXXXXXXXXXNPITPIKKDGIKRKDRPTDKGVSWLVKTQYISP 1942
               SVYNPP+ +                  TPIKKDGIKRK+RPTDKGV+WLVKTQYISP
Sbjct: 331  LDLSVYNPPRVR-PPLAPEDEELLRDDEAATPIKKDGIKRKERPTDKGVAWLVKTQYISP 389

Query: 1943 LSMESTKQSLTEKQ---LRESRSRDLLENLNSREGQIQEIEASFKACKARPVHATNRKLQ 2113
            LSMESTKQSLTEKQ   LRE + R +L+NLNSRE QI+EI+ASF+A K+ PVHATN+ L 
Sbjct: 390  LSMESTKQSLTEKQAKELREMKGRGILDNLNSRERQIREIQASFEAAKSDPVHATNKDLY 449

Query: 2114 PVEILPLFPDFDRYDDQFVVATFDSAPTAESETYSKLEKHVRDTFESQAIMKSFMATSSD 2293
            PVE++PL PDFDRYDDQFVVA FD+APTA+SE Y+K+ K VRD FES+A+MKS++AT  D
Sbjct: 450  PVEVMPLLPDFDRYDDQFVVAAFDNAPTADSEMYAKMNKSVRDAFESKAVMKSYVATGLD 509

Query: 2294 TGKPDKFLAYMVPSPNELSKDIYDESEDVSYSWIREYHWDVRGDDADDPTTYLVTFGDTE 2473
               P+KFLAYM P+P ELSKDIYDE+EDVSYSWIREYHWDVRGDDADDPTT+LV F ++E
Sbjct: 510  PANPEKFLAYMAPAPGELSKDIYDENEDVSYSWIREYHWDVRGDDADDPTTFLVAFDESE 569

Query: 2474 AHYMPLPTKLILRKKRAREGKSTEEVEHFPVPSRLTVRRRSTVATIESKDLXXXXXXXXX 2653
            A Y+PLPTKL+LRKKRA+EG+S +EVE  PVP+R+TVRRRS+VA IE KD          
Sbjct: 570  ARYLPLPTKLVLRKKRAKEGRSGDEVEQCPVPARVTVRRRSSVAAIERKDSGVYTSSKGN 629

Query: 2654 XXXXXXXXLDTEEDEMGEDEQHKDLHDDMEQDQSSGAEYDMSD 2782
                    +D   ++      H+D +      QSSGAE  MSD
Sbjct: 630  SFKRVGLEMDDGLEDQHRGAPHQDNY------QSSGAEDYMSD 666


>ref|XP_006583048.1| PREDICTED: RNA polymerase II-associated factor 1 homolog isoform X1
            [Glycine max] gi|571464391|ref|XP_006583049.1| PREDICTED:
            RNA polymerase II-associated factor 1 homolog isoform X2
            [Glycine max]
          Length = 659

 Score =  593 bits (1528), Expect = e-166
 Identities = 313/524 (59%), Positives = 373/524 (71%), Gaps = 13/524 (2%)
 Frame = +2

Query: 1250 KDGRDSGWREHA--------KQHKASVPPIPVKK-SNGPPGRVETDXXXXXXXXXXXXXX 1402
            K+   SG RE+         KQHK   PP+PVKK +NGPPGR ETD              
Sbjct: 145  KEPSTSGRREYEHSNHGIAHKQHKQQ-PPVPVKKMNNGPPGRAETDEEKRLRKKREFEKQ 203

Query: 1403 XXXXXHRHHIKESQNRVLQKTQMLSSGAKGHGSITGSHMGDRRTAPLLSGERIENRLKKP 1582
                 HR  +KESQN VLQKT MLSSG KGHG I GS MG+RR+ PLL  ER+ENRLKKP
Sbjct: 204  RQEEKHRQQLKESQNTVLQKTHMLSSG-KGHGMIAGSRMGERRSTPLLGAERVENRLKKP 262

Query: 1583 TTFLCKLKFRNELPDQTAQPKLMMLRRDKERFTKYAITSLEKLHKPQLYVEXXXXXXXXX 1762
            TTFLCKLKFRNELPD +AQPKLM  ++DK+++ KY ITSLEK++KP+L+VE         
Sbjct: 263  TTFLCKLKFRNELPDPSAQPKLMASKKDKDQYAKYTITSLEKMYKPKLFVEPDLGIPLDL 322

Query: 1763 XXXSVYNPPKGKNXXXXXXXXXXXXXXNPITPIKKDGIKRKDRPTDKGVSWLVKTQYISP 1942
               SVYNPP  +                 +TPIKKDGIKRK+RPTDKGV+WLVKTQYISP
Sbjct: 323  LDLSVYNPPSVR-PPLAPEDKELLRDDEAVTPIKKDGIKRKERPTDKGVAWLVKTQYISP 381

Query: 1943 LSMESTKQSLTEKQLRESRS----RDLLENLNSREGQIQEIEASFKACKARPVHATNRKL 2110
            LSMESTKQSLTEKQ +E R     R +L+NLNSRE QI+EIEASF+A K+ PVHATN+ L
Sbjct: 382  LSMESTKQSLTEKQAKELREMKGGRGILDNLNSRERQIREIEASFEAAKSDPVHATNKDL 441

Query: 2111 QPVEILPLFPDFDRYDDQFVVATFDSAPTAESETYSKLEKHVRDTFESQAIMKSFMATSS 2290
             PVE++PL PDFDRYDDQFVVA FD+APTA+SE ++K++K VRD FES+A+MKS++ATSS
Sbjct: 442  YPVEVMPLLPDFDRYDDQFVVAAFDNAPTADSEMHAKMDKSVRDAFESKAVMKSYVATSS 501

Query: 2291 DTGKPDKFLAYMVPSPNELSKDIYDESEDVSYSWIREYHWDVRGDDADDPTTYLVTFGDT 2470
            D   P+KFLAYMVP+P ELSKDIYDE+EDVSYSWIREYHWDVRGDDADDP T+LV F ++
Sbjct: 502  DPANPEKFLAYMVPAPGELSKDIYDENEDVSYSWIREYHWDVRGDDADDPATFLVAFDES 561

Query: 2471 EAHYMPLPTKLILRKKRAREGKSTEEVEHFPVPSRLTVRRRSTVATIESKDLXXXXXXXX 2650
            EA Y+PLPTKL+LRKKRA+EG+S +EVE  PVP+R+TVRRRS+VA IE KD         
Sbjct: 562  EARYLPLPTKLVLRKKRAKEGRSGDEVEQCPVPARVTVRRRSSVAAIERKDSGVYTSSKG 621

Query: 2651 XXXXXXXXXLDTEEDEMGEDEQHKDLHDDMEQDQSSGAEYDMSD 2782
                     +D   ++      H+D +      QSSGAE  MSD
Sbjct: 622  NSSKRGGLEMDDGLEDQHRGAPHQDNY------QSSGAEDYMSD 659


>gb|EMJ18862.1| hypothetical protein PRUPE_ppa002145mg [Prunus persica]
          Length = 709

 Score =  592 bits (1525), Expect = e-166
 Identities = 306/530 (57%), Positives = 375/530 (70%), Gaps = 12/530 (2%)
 Frame = +2

Query: 1229 AEDRGQSKDGRDSGWREHA--------KQHKASVPPIPVKKSNGPPGRVETDXXXXXXXX 1384
            + ++G  +D   SG REH         KQHK  VP +PVKK+NGPPGRVET+        
Sbjct: 188  SHEKGAPRDVSVSGRREHGHLNHGVPQKQHKPPVPSMPVKKANGPPGRVETEEERRLRKK 247

Query: 1385 XXXXXXXXXXXHRHHIKESQNRVLQKTQMLSSGAKGHGSITGSHMGDRRTAPLLSGERIE 1564
                       HR  +K+SQN VLQKTQMLSSG KGHGSI GS MG+RR  P LSGER E
Sbjct: 248  REFEKQRQEEKHRQQLKDSQNSVLQKTQMLSSG-KGHGSIAGSRMGERRATPFLSGERTE 306

Query: 1565 NRLKKPTTFLCKLKFRNELPDQTAQPKLMMLRRDKERFTKYAITSLEKLHKPQLYVEXXX 1744
            NRLKKPTTF+CKLKFRNELPD +AQPKLM L++DK+++TKY ITSLEK +KP+L+VE   
Sbjct: 307  NRLKKPTTFVCKLKFRNELPDPSAQPKLMSLKKDKDQYTKYTITSLEKTYKPKLFVEPDL 366

Query: 1745 XXXXXXXXXSVYNPPKGKNXXXXXXXXXXXXXXNPITPIKKDGIKRKDRPTDKGVSWLVK 1924
                     SVYNPP  +                  TP+K +GI+RK+RPTDKGV+WLVK
Sbjct: 367  GIPLDLLDLSVYNPPSVRPPLALEDEELLRDDV-AATPVKNNGIRRKERPTDKGVAWLVK 425

Query: 1925 TQYISPLSMESTKQSLTEKQLRESRS----RDLLENLNSREGQIQEIEASFKACKARPVH 2092
            TQYISPLSM+S +QSLTEKQ +E R     R++L+NLN RE QI++IEASF+ACK+RPVH
Sbjct: 426  TQYISPLSMDSARQSLTEKQAKELREMKGGRNILDNLNDRERQIKDIEASFEACKSRPVH 485

Query: 2093 ATNRKLQPVEILPLFPDFDRYDDQFVVATFDSAPTAESETYSKLEKHVRDTFESQAIMKS 2272
            ATN+ L PVEILPL PDF+RY+DQFV+A FD APTA+SE YSKL++   D +ES+AIMKS
Sbjct: 486  ATNKNLYPVEILPLLPDFERYEDQFVLAAFDGAPTADSEIYSKLDQSGHDAYESRAIMKS 545

Query: 2273 FMATSSDTGKPDKFLAYMVPSPNELSKDIYDESEDVSYSWIREYHWDVRGDDADDPTTYL 2452
            +  T +D   P+KFLAYMVPSPNELSKD YDESEDVSYSW+REYH+DVRGDD  DPTTYL
Sbjct: 546  YKVTGADPANPEKFLAYMVPSPNELSKDPYDESEDVSYSWVREYHYDVRGDDVHDPTTYL 605

Query: 2453 VTFGDTEAHYMPLPTKLILRKKRAREGKSTEEVEHFPVPSRLTVRRRSTVATIESKDLXX 2632
            V+F + EA Y PLPTKL+LRKKR++EGK+++EVEHFP PSR+TVR+RSTVA IE KD   
Sbjct: 606  VSFDEEEARYAPLPTKLVLRKKRSKEGKTSDEVEHFPAPSRVTVRQRSTVAAIELKDSGD 665

Query: 2633 XXXXXXXXXXXXXXXLDTEEDEMGEDEQHKDLHDDMEQDQSSGAEYDMSD 2782
                           ++   +   +  +H+D+      D+ SGAE D+SD
Sbjct: 666  YSRGSVSNLKTRRFDVEDTLERPRKIARHQDI------DEYSGAEDDLSD 709


>ref|XP_006426878.1| hypothetical protein CICLE_v10025066mg [Citrus clementina]
            gi|557528868|gb|ESR40118.1| hypothetical protein
            CICLE_v10025066mg [Citrus clementina]
          Length = 632

 Score =  588 bits (1516), Expect = e-165
 Identities = 295/452 (65%), Positives = 345/452 (76%), Gaps = 4/452 (0%)
 Frame = +2

Query: 1280 HAKQHKASVPPIPVKKSNGPPGRVETDXXXXXXXXXXXXXXXXXXXHRHHIKESQNRVLQ 1459
            H+KQH+  VPP  VKK NG  GRVET+                   HR  +KESQN V+Q
Sbjct: 178  HSKQHRPPVPPPGVKKVNGGSGRVETEEERRIRKKREYEKHRQEEKHRLQMKESQNVVMQ 237

Query: 1460 KTQMLSSGAKGHGSITGSHMGDRRTAPLLSGERIENRLKKPTTFLCKLKFRNELPDQTAQ 1639
            K+QM++SG  GHGS+ GS MGDRR APLLSGER ENRLKKPTTFLCKLKFRNELP+ +AQ
Sbjct: 238  KSQMVASGKGGHGSMVGSRMGDRRAAPLLSGERTENRLKKPTTFLCKLKFRNELPEPSAQ 297

Query: 1640 PKLMMLRRDKERFTKYAITSLEKLHKPQLYVEXXXXXXXXXXXXSVYNPPKGKNXXXXXX 1819
            PKLM L++DK+RFT+Y  +SLEK +KPQL+VE            SVYNPP  +       
Sbjct: 298  PKLMALKKDKDRFTRYTFSSLEKNYKPQLHVEPDLGIPLDLLDLSVYNPPSVR-PPLDPE 356

Query: 1820 XXXXXXXXNPITPIKKDGIKRKDRPTDKGVSWLVKTQYISPLSMESTKQSLTEKQLRESR 1999
                      +TP+KKDGIKRK+RPTDKGVSWLVKTQYISPLSMES +QSLTEKQ +E R
Sbjct: 357  DEELLRDDEVVTPVKKDGIKRKERPTDKGVSWLVKTQYISPLSMESARQSLTEKQAKELR 416

Query: 2000 S----RDLLENLNSREGQIQEIEASFKACKARPVHATNRKLQPVEILPLFPDFDRYDDQF 2167
                 R +LENLN RE QI+EIEASF+ACK RP+HATN+ LQPVEILPL PDF+RYDDQF
Sbjct: 417  EMKGGRSILENLNDRERQIKEIEASFEACKLRPIHATNKNLQPVEILPLLPDFERYDDQF 476

Query: 2168 VVATFDSAPTAESETYSKLEKHVRDTFESQAIMKSFMATSSDTGKPDKFLAYMVPSPNEL 2347
            V ATFD APTA+SE YSK++K VRD  ES+AIMKS++AT SD+  P+KFLAYMVPS NEL
Sbjct: 477  VAATFDGAPTADSEIYSKMDKSVRDAHESRAIMKSYVATGSDSANPEKFLAYMVPSVNEL 536

Query: 2348 SKDIYDESEDVSYSWIREYHWDVRGDDADDPTTYLVTFGDTEAHYMPLPTKLILRKKRAR 2527
            SKD+YDE+EDVS+SW+REYHWDVRGDDADDPTTYLV+F D EA Y+PLPTKL LRKKRA 
Sbjct: 537  SKDMYDENEDVSFSWVREYHWDVRGDDADDPTTYLVSFDDDEARYVPLPTKLNLRKKRAI 596

Query: 2528 EGKSTEEVEHFPVPSRLTVRRRSTVATIESKD 2623
            EG+S +EVEHFP+PS + VRRR+ V  IE K+
Sbjct: 597  EGRSNDEVEHFPIPSSIAVRRRANVTAIELKE 628


>ref|XP_004302858.1| PREDICTED: uncharacterized protein LOC101304396 [Fragaria vesca
            subsp. vesca]
          Length = 693

 Score =  581 bits (1497), Expect = e-163
 Identities = 310/536 (57%), Positives = 370/536 (69%), Gaps = 13/536 (2%)
 Frame = +2

Query: 1214 REHGFAE---DRGQSKDGRDSGWREHAKQHKASVPP-----IP-VKKSNGPPGRVETDXX 1366
            RE GF +   D+G SKD   S  REH   +   VPP     +P VKKSNG PGRVET+  
Sbjct: 165  RESGFDKGPHDKGASKDVGASAKREHGHSNHHGVPPKHKPPVPLVKKSNGAPGRVETEEE 224

Query: 1367 XXXXXXXXXXXXXXXXXHRHHIKESQNRVLQKTQMLSSGAKGHGSITGSHMGDRRTAPLL 1546
                             HR   KESQN VLQKT ++SSG KGHGSI GS MG+RRT P L
Sbjct: 225  RRLRKKREFEKQRQEEKHRQQAKESQNSVLQKTHLMSSG-KGHGSIAGSRMGERRTTPFL 283

Query: 1547 SGERIENRLKKPTTFLCKLKFRNELPDQTAQPKLMMLRRDKERFTKYAITSLEKLHKPQL 1726
            SGER ENRLKKPTTF+CKLKFRNELPD +AQPKLM +++D +++TKY ITSLEK +KP+L
Sbjct: 284  SGERAENRLKKPTTFVCKLKFRNELPDPSAQPKLMSMKKDPDQYTKYTITSLEKNYKPKL 343

Query: 1727 YVEXXXXXXXXXXXXSVYNPPKGKNXXXXXXXXXXXXXXNPITPIKKDGIKRKDRPTDKG 1906
            +VE            SVYNPP G                  +TP+KKDGI+RK+RPTDKG
Sbjct: 344  FVEPDLGIPLDLLDLSVYNPPPGPRPPLAPEDEELLRDDVAVTPVKKDGIRRKERPTDKG 403

Query: 1907 VSWLVKTQYISPLSMESTKQSLTEKQLRESRS----RDLLENLNSREGQIQEIEASFKAC 2074
            V+WLVKTQYISPLSM+S KQSLTEKQ +E R     R+LL+NLN RE QI+EIEASF+AC
Sbjct: 404  VAWLVKTQYISPLSMDSAKQSLTEKQAKELREMKGGRNLLDNLNDRERQIKEIEASFEAC 463

Query: 2075 KARPVHATNRKLQPVEILPLFPDFDRYDDQFVVATFDSAPTAESETYSKLEKHVRDTFES 2254
            K+RPVHATN+ L PVE+LPL P  +RY+DQFV+A FD APTA+SE YSKL++   D  ES
Sbjct: 464  KSRPVHATNKNLYPVEVLPLLPXHNRYEDQFVLAGFDGAPTADSEIYSKLDQSDHDLCES 523

Query: 2255 QAIMKSFMATSSDTGKPDKFLAYMVPSPNELSKDIYDESEDVSYSWIREYHWDVRGDDAD 2434
            +AIMKS+  T +D   PDKFLAYMVPSPNELSKD YDESED+SYSW+REY +DVRGDD D
Sbjct: 524  RAIMKSYKVTGADPANPDKFLAYMVPSPNELSKDPYDESEDISYSWVREYQYDVRGDDVD 583

Query: 2435 DPTTYLVTFGDTEAHYMPLPTKLILRKKRAREGKSTEEVEHFPVPSRLTVRRRSTVATIE 2614
            D TTYLV+F +  A Y PLP KL+LRKKRA+EG+ST+EVEHFP PSR+TVRRRSTV+ IE
Sbjct: 584  DLTTYLVSFDEDAARYAPLPAKLVLRKKRAKEGRSTDEVEHFPAPSRVTVRRRSTVSAIE 643

Query: 2615 SKDLXXXXXXXXXXXXXXXXXLDTEEDEMGEDEQHKDLHDDMEQDQSSGAEYDMSD 2782
             KD                     + ++  E  Q +  H D+  D+ SGAE D+SD
Sbjct: 644  LKDAGDYSRGALSNLKRRGF----DNEDALERPQKRGRHQDV--DEYSGAEDDLSD 693


>gb|EOY26930.1| Hydroxyproline-rich glycoprotein family protein isoform 2 [Theobroma
            cacao]
          Length = 562

 Score =  577 bits (1487), Expect = e-161
 Identities = 312/525 (59%), Positives = 371/525 (70%), Gaps = 10/525 (1%)
 Frame = +2

Query: 1238 RGQSKDGRDSGWREHA-KQHKASV---PPI--PVKKSNGPPGRVETDXXXXXXXXXXXXX 1399
            +G ++D   SG REH    H A V    P+  PVKK NGP GRVET+             
Sbjct: 49   QGGNRDFLGSGRREHGHSNHAAGVRDQKPMMPPVKKPNGPAGRVETEEERRLRKKREFEK 108

Query: 1400 XXXXXXHRHHIKESQNRVLQKTQMLSSGAKGHGSITGSHMGDRRTAPLLSGERIENRLKK 1579
                  HR  +KESQ     KTQM+ SG KGHGS+ GS MGDRR  P LSGERIENRLKK
Sbjct: 109  QRQEEKHRQQMKESQ-----KTQMMPSG-KGHGSMVGSRMGDRRATPFLSGERIENRLKK 162

Query: 1580 PTTFLCKLKFRNELPDQTAQPKLMMLRRDKERFTKYAITSLEKLHKPQLYVEXXXXXXXX 1759
            PTTFLCKLKFRNELPD +AQPKLM L++DK+RFTKY ITSLEK++KP+L+VE        
Sbjct: 163  PTTFLCKLKFRNELPDPSAQPKLMALKKDKDRFTKYTITSLEKMYKPKLFVEPDLGIPLD 222

Query: 1760 XXXXSVYNPPKGKNXXXXXXXXXXXXXXNPITPIKKDGIKRKDRPTDKGVSWLVKTQYIS 1939
                SVYNPP  +                 +TPIKKDGI+RK+RPTDKGVSWLVKTQYIS
Sbjct: 223  LLDLSVYNPPSVR-PSLAPEDAELLHDDEAVTPIKKDGIRRKERPTDKGVSWLVKTQYIS 281

Query: 1940 PLSMESTKQSLTEKQLRESRS----RDLLENLNSREGQIQEIEASFKACKARPVHATNRK 2107
            PLSMESTKQSLTEKQ +E R     R++LENLN+RE QI+EIEASF+A K RPVHATN+ 
Sbjct: 282  PLSMESTKQSLTEKQAKELRELKGGRNILENLNNRERQIKEIEASFEASKLRPVHATNKN 341

Query: 2108 LQPVEILPLFPDFDRYDDQFVVATFDSAPTAESETYSKLEKHVRDTFESQAIMKSFMATS 2287
            L+PVE++PL PDFDRY+DQFV+  FD APTA+SE +SKL+  VRD  ES+AIMKS++A S
Sbjct: 342  LEPVEVMPLLPDFDRYNDQFVMVAFDGAPTADSEIFSKLDDSVRDEHESRAIMKSYLAAS 401

Query: 2288 SDTGKPDKFLAYMVPSPNELSKDIYDESEDVSYSWIREYHWDVRGDDADDPTTYLVTFGD 2467
            SD   P+KFLAYMVPS +ELSK +YDE EDVSYSW+REY+WDVRGDDA+DPTTYLV+F +
Sbjct: 402  SDPANPEKFLAYMVPSLDELSKGMYDEHEDVSYSWVREYNWDVRGDDANDPTTYLVSFDE 461

Query: 2468 TEAHYMPLPTKLILRKKRAREGKSTEEVEHFPVPSRLTVRRRSTVATIESKDLXXXXXXX 2647
             EA Y+PLPTKL LRKKRAREG++ +E+EHFP+P+R+TVRRRSTVA IE K+        
Sbjct: 462  GEARYVPLPTKLNLRKKRAREGRTGDEIEHFPIPARITVRRRSTVAAIELKEPEVYTSSR 521

Query: 2648 XXXXXXXXXXLDTEEDEMGEDEQHKDLHDDMEQDQSSGAEYDMSD 2782
                      LD  ED +G   +    HD    DQ SGAE D+S+
Sbjct: 522  GGMSSSKIGRLDA-EDGLGRSHKLARHHD---VDQYSGAEDDLSE 562


>gb|EOY26929.1| Hydroxyproline-rich glycoprotein family protein isoform 1 [Theobroma
            cacao]
          Length = 685

 Score =  577 bits (1487), Expect = e-161
 Identities = 312/525 (59%), Positives = 371/525 (70%), Gaps = 10/525 (1%)
 Frame = +2

Query: 1238 RGQSKDGRDSGWREHA-KQHKASV---PPI--PVKKSNGPPGRVETDXXXXXXXXXXXXX 1399
            +G ++D   SG REH    H A V    P+  PVKK NGP GRVET+             
Sbjct: 172  QGGNRDFLGSGRREHGHSNHAAGVRDQKPMMPPVKKPNGPAGRVETEEERRLRKKREFEK 231

Query: 1400 XXXXXXHRHHIKESQNRVLQKTQMLSSGAKGHGSITGSHMGDRRTAPLLSGERIENRLKK 1579
                  HR  +KESQ     KTQM+ SG KGHGS+ GS MGDRR  P LSGERIENRLKK
Sbjct: 232  QRQEEKHRQQMKESQ-----KTQMMPSG-KGHGSMVGSRMGDRRATPFLSGERIENRLKK 285

Query: 1580 PTTFLCKLKFRNELPDQTAQPKLMMLRRDKERFTKYAITSLEKLHKPQLYVEXXXXXXXX 1759
            PTTFLCKLKFRNELPD +AQPKLM L++DK+RFTKY ITSLEK++KP+L+VE        
Sbjct: 286  PTTFLCKLKFRNELPDPSAQPKLMALKKDKDRFTKYTITSLEKMYKPKLFVEPDLGIPLD 345

Query: 1760 XXXXSVYNPPKGKNXXXXXXXXXXXXXXNPITPIKKDGIKRKDRPTDKGVSWLVKTQYIS 1939
                SVYNPP  +                 +TPIKKDGI+RK+RPTDKGVSWLVKTQYIS
Sbjct: 346  LLDLSVYNPPSVR-PSLAPEDAELLHDDEAVTPIKKDGIRRKERPTDKGVSWLVKTQYIS 404

Query: 1940 PLSMESTKQSLTEKQLRESRS----RDLLENLNSREGQIQEIEASFKACKARPVHATNRK 2107
            PLSMESTKQSLTEKQ +E R     R++LENLN+RE QI+EIEASF+A K RPVHATN+ 
Sbjct: 405  PLSMESTKQSLTEKQAKELRELKGGRNILENLNNRERQIKEIEASFEASKLRPVHATNKN 464

Query: 2108 LQPVEILPLFPDFDRYDDQFVVATFDSAPTAESETYSKLEKHVRDTFESQAIMKSFMATS 2287
            L+PVE++PL PDFDRY+DQFV+  FD APTA+SE +SKL+  VRD  ES+AIMKS++A S
Sbjct: 465  LEPVEVMPLLPDFDRYNDQFVMVAFDGAPTADSEIFSKLDDSVRDEHESRAIMKSYLAAS 524

Query: 2288 SDTGKPDKFLAYMVPSPNELSKDIYDESEDVSYSWIREYHWDVRGDDADDPTTYLVTFGD 2467
            SD   P+KFLAYMVPS +ELSK +YDE EDVSYSW+REY+WDVRGDDA+DPTTYLV+F +
Sbjct: 525  SDPANPEKFLAYMVPSLDELSKGMYDEHEDVSYSWVREYNWDVRGDDANDPTTYLVSFDE 584

Query: 2468 TEAHYMPLPTKLILRKKRAREGKSTEEVEHFPVPSRLTVRRRSTVATIESKDLXXXXXXX 2647
             EA Y+PLPTKL LRKKRAREG++ +E+EHFP+P+R+TVRRRSTVA IE K+        
Sbjct: 585  GEARYVPLPTKLNLRKKRAREGRTGDEIEHFPIPARITVRRRSTVAAIELKEPEVYTSSR 644

Query: 2648 XXXXXXXXXXLDTEEDEMGEDEQHKDLHDDMEQDQSSGAEYDMSD 2782
                      LD  ED +G   +    HD    DQ SGAE D+S+
Sbjct: 645  GGMSSSKIGRLDA-EDGLGRSHKLARHHD---VDQYSGAEDDLSE 685


>ref|XP_002515964.1| conserved hypothetical protein [Ricinus communis]
            gi|223544869|gb|EEF46384.1| conserved hypothetical
            protein [Ricinus communis]
          Length = 672

 Score =  571 bits (1471), Expect = e-160
 Identities = 299/535 (55%), Positives = 369/535 (68%), Gaps = 12/535 (2%)
 Frame = +2

Query: 1214 REHGFAEDRGQSKDGRDSGWREHA------KQHKASVPPIPVKKSNGPP-GRVETDXXXX 1372
            +E G   D+G S++ R+ G   H       +QH+   PP   KK +GPP GRVET+    
Sbjct: 143  KESGGERDKGMSRERRELGNSNHGDVSRHEQQHRPPAPPPGGKKVSGPPSGRVETEEERR 202

Query: 1373 XXXXXXXXXXXXXXXHRHHIKESQNRVLQKTQMLSSGAKGHGSITGSHMGDRRTAPLLSG 1552
                           HR  +KESQN +LQKTQMLS+  KGHGSI GS MGDRR  PLL G
Sbjct: 203  LRKKREFEKHRQEEKHRQQVKESQNSILQKTQMLSA-QKGHGSIVGSRMGDRRAPPLLGG 261

Query: 1553 ERIENRLKKPTTFLCKLKFRNELPDQTAQPKLMMLRRDKERFTKYAITSLEKLHKPQLYV 1732
            ERIENRLKKPTTFLCKLKFRNELPD +AQPKLM ++RDK+RFTKY ITSLEK++KPQL+V
Sbjct: 262  ERIENRLKKPTTFLCKLKFRNELPDPSAQPKLMTMKRDKDRFTKYTITSLEKMYKPQLFV 321

Query: 1733 EXXXXXXXXXXXXSVYN-PPKGKNXXXXXXXXXXXXXXNPITPIKKDGIKRKDRPTDKGV 1909
            E            SVYN PP  +                 +TP+K++G+K K+RPTDKGV
Sbjct: 322  EPDLGIPLDLLDLSVYNRPPASERPPLDPEDEELLRDDEAVTPVKREGLKIKERPTDKGV 381

Query: 1910 SWLVKTQYISPLSMESTKQSLTEKQLRESRSR----DLLENLNSREGQIQEIEASFKACK 2077
            SWLVKTQYIS LS +STKQS+TEKQ +E R R    +LL+NLN+RE QI+EIEASF+ACK
Sbjct: 382  SWLVKTQYISSLSTDSTKQSMTEKQAKELRERKGGHNLLKNLNNRESQIKEIEASFEACK 441

Query: 2078 ARPVHATNRKLQPVEILPLFPDFDRYDDQFVVATFDSAPTAESETYSKLEKHVRDTFESQ 2257
              PVHATN+ L+PVEILPL PDFDRY+D+FV   FD+APTA+SE YSKL+  VR+  ES+
Sbjct: 442  LTPVHATNKNLKPVEILPLIPDFDRYEDKFVTVAFDNAPTADSEIYSKLDSSVREACESR 501

Query: 2258 AIMKSFMATSSDTGKPDKFLAYMVPSPNELSKDIYDESEDVSYSWIREYHWDVRGDDADD 2437
            A+MK+ +AT SD   P+KFLAYM PSPNELSKD+YDE+ED+SY+W+REYHWDV+GD  +D
Sbjct: 502  AVMKACVATGSDPANPEKFLAYMAPSPNELSKDMYDENEDISYNWVREYHWDVQGDGGND 561

Query: 2438 PTTYLVTFGDTEAHYMPLPTKLILRKKRAREGKSTEEVEHFPVPSRLTVRRRSTVATIES 2617
            PTT+LV+F +  A Y+PLPTK+ LRKKRAREG+S +EVEHFP PS +TVRRR T A  E 
Sbjct: 562  PTTFLVSFDEDAARYVPLPTKINLRKKRAREGRSGDEVEHFPAPSSVTVRRRPTAAAREL 621

Query: 2618 KDLXXXXXXXXXXXXXXXXXLDTEEDEMGEDEQHKDLHDDMEQDQSSGAEYDMSD 2782
            +D                  + T +D+ G    H+   DD + D SS AE D+S+
Sbjct: 622  RD---SAGASSSRGNILDSRMGTGDDDDGLGRVHRVARDD-DLDHSSEAEDDLSE 672


>gb|EMJ26342.1| hypothetical protein PRUPE_ppa002485mg [Prunus persica]
          Length = 668

 Score =  551 bits (1421), Expect = e-154
 Identities = 295/532 (55%), Positives = 358/532 (67%), Gaps = 8/532 (1%)
 Frame = +2

Query: 1211 RREHGFAEDRGQSKDGR-DSGWREHA---KQHKASVPPIPVKKSNGPPGRVETDXXXXXX 1378
            R  H     R  S  GR + G   H    KQHK  VP + VKK+NGPPGRVET+      
Sbjct: 161  RGSHEKVASREVSVSGRGEHGHLNHGVPQKQHKPPVPSMQVKKANGPPGRVETEEERRLR 220

Query: 1379 XXXXXXXXXXXXXHRHHIKESQNRVLQKTQMLSSGAKGHGSITGSHMGDRRTAPLLSGER 1558
                         HR  +K+SQN VLQKTQMLSSG KGHGSI GS MG+RR  P LSGER
Sbjct: 221  KKREFEKQRQEEKHRQQLKDSQNSVLQKTQMLSSG-KGHGSIAGSRMGERRATPFLSGER 279

Query: 1559 IENRLKKPTTFLCKLKFRNELPDQTAQPKLMMLRRDKERFTKYAITSLEKLHKPQLYVEX 1738
             ENRLKKPTTF+CKLKFRNELPD +AQPKLM L++DK+++TKY ITSLEK +KP+L+VE 
Sbjct: 280  TENRLKKPTTFVCKLKFRNELPDPSAQPKLMSLKKDKDQYTKYTITSLEKTYKPKLFVEP 339

Query: 1739 XXXXXXXXXXXSVYNPPKGKNXXXXXXXXXXXXXXNPITPIKKDGIKRKDRPTDKGVSWL 1918
                       SVYNPP  +                  TP+KK+GIKRK+RPTDKGV+WL
Sbjct: 340  DLGIPLDLLDLSVYNPPSVRPPLALEDEELLRDDV-AATPVKKNGIKRKERPTDKGVAWL 398

Query: 1919 VKTQYISPLSMESTKQSLTEKQLRESRS----RDLLENLNSREGQIQEIEASFKACKARP 2086
                            SLTEKQ +E R     R++L+NLN RE QI+EIEASF+ACK+RP
Sbjct: 399  ----------------SLTEKQAKELREMKGGRNILDNLNDRERQIKEIEASFEACKSRP 442

Query: 2087 VHATNRKLQPVEILPLFPDFDRYDDQFVVATFDSAPTAESETYSKLEKHVRDTFESQAIM 2266
            VHATN+ L PVE+LPL PDF+RY+DQFV+A FD APTA+SE YSKL++   D +ES+AIM
Sbjct: 443  VHATNKDLYPVEVLPLLPDFERYEDQFVLAAFDGAPTADSEIYSKLDQSGHDAYESRAIM 502

Query: 2267 KSFMATSSDTGKPDKFLAYMVPSPNELSKDIYDESEDVSYSWIREYHWDVRGDDADDPTT 2446
            KS+  T +D   P+KFLAYMVPSPNELSKD YDESEDVSYSW+REYH+DVRGDD  DPTT
Sbjct: 503  KSYKVTGADPANPEKFLAYMVPSPNELSKDPYDESEDVSYSWVREYHYDVRGDDVHDPTT 562

Query: 2447 YLVTFGDTEAHYMPLPTKLILRKKRAREGKSTEEVEHFPVPSRLTVRRRSTVATIESKDL 2626
            YLV+F + EA Y PLPTKL+LRKKR++EGK+++EVEHFP PSR+TVR+RSTVA IE KD 
Sbjct: 563  YLVSFDEEEARYAPLPTKLVLRKKRSKEGKTSDEVEHFPAPSRVTVRQRSTVAAIELKDS 622

Query: 2627 XXXXXXXXXXXXXXXXXLDTEEDEMGEDEQHKDLHDDMEQDQSSGAEYDMSD 2782
                             ++   +   +  +H+D+      D+ SGAE D+SD
Sbjct: 623  GDYSRGSVSNLKTRRFDIEDTLERPRKIARHQDI------DEYSGAEDDLSD 668


>ref|XP_002303312.2| hydroxyproline-rich glycoprotein [Populus trichocarpa]
            gi|550342419|gb|EEE78291.2| hydroxyproline-rich
            glycoprotein [Populus trichocarpa]
          Length = 569

 Score =  550 bits (1416), Expect = e-153
 Identities = 292/529 (55%), Positives = 359/529 (67%), Gaps = 6/529 (1%)
 Frame = +2

Query: 1214 REHGFAEDRGQSKDGRDSGWREHAK-QHKASVPPIPVKKSNGPPGRVETDXXXXXXXXXX 1390
            ++  F  D+G S++ R+     H K Q + S  P+ VKK+NG PGRVET+          
Sbjct: 50   KDSAFERDKGVSQEKREHDHPNHGKHQQQQSQLPLVVKKANGHPGRVETEEERRLRKKRE 109

Query: 1391 XXXXXXXXXHRHHIKESQNRVLQKTQMLSSGAKGHGSITGSHMGDRRTAPLLSGERIENR 1570
                      R  +KESQN  L K  ++SS  KGHGSI GS +GDR   PLL GER ENR
Sbjct: 110  FEKQRQEENRRQQLKESQNSALLKNHVISS-QKGHGSIVGSRLGDRVATPLLGGERAENR 168

Query: 1571 LKKPTTFLCKLKFRNELPDQTAQPKLMMLRRDKERFTKYAITSLEKLHKPQLYVEXXXXX 1750
            LKKPTTF+CKLKFRNELPD +AQPKLM L+R+K+RFTKY ITSLEK++KPQLYVE     
Sbjct: 169  LKKPTTFMCKLKFRNELPDPSAQPKLMPLKREKDRFTKYTITSLEKMYKPQLYVEPDLGI 228

Query: 1751 XXXXXXXSVYNPPKGKNXXXXXXXXXXXXXXNPITPIKKDGIKRKDRPTDKGVSWLVKTQ 1930
                   SVYNPP  +               + +TP+K+DGIKRK+RPTDKGVSWLVKTQ
Sbjct: 229  PLDLLDLSVYNPPSVRPLLAPEDEELLHDDES-VTPVKRDGIKRKERPTDKGVSWLVKTQ 287

Query: 1931 YISPLSMESTKQSLTEKQLRESRSRD----LLENLNSREGQIQEIEASFKACKARPVHAT 2098
            YISPLSMES K SLTEKQ +E R       LL+NLN RE QI+EI+ASF + K  PVHAT
Sbjct: 288  YISPLSMESAKLSLTEKQAKELREMKGGCKLLDNLNKRERQIKEIQASFASNKLPPVHAT 347

Query: 2099 NRKLQPVEILPLFPDFDRYDDQFVVATFDSAPTAESETYSKLEKHVRDTFESQAIMKSFM 2278
            N+ L+PVEILPL PDFDRY D+FV   FD APTA++E Y K +   RD +ES AIMK+ +
Sbjct: 348  NKNLKPVEILPLLPDFDRYGDKFVTVAFDGAPTADAENYRKFDPSDRDAYESWAIMKACV 407

Query: 2279 ATSSDTGKPDKFLAYMVPSPNELSKDIYDESEDVSYSWIREYHWDVRGDDADDPTTYLVT 2458
            A+ SD   P+KFLAY VPSP+ELSKD+YDE+ED+ YSWIREYHWDVRGDD DDP+T+LV+
Sbjct: 408  ASGSDPANPEKFLAYTVPSPDELSKDMYDENEDILYSWIREYHWDVRGDDVDDPSTFLVS 467

Query: 2459 FGDTEAHYMPLPTKLILRKKRAREGKSTEEVEHFPVPSRLTVRRRSTVATIESKDLXXXX 2638
            F + EA Y+PLPTK+ LRKKRAREG+S +E+EHFP+PSR+TVR+R+  ATIE +D     
Sbjct: 468  FDEAEARYLPLPTKISLRKKRAREGRSGDEIEHFPIPSRVTVRKRAVAATIEQRD----- 522

Query: 2639 XXXXXXXXXXXXXLDTEEDEMGEDE-QHKDLHDDMEQDQSSGAEYDMSD 2782
                         ++  EDE G    Q   L +D+    SSGAE +MS+
Sbjct: 523  SGAISNSRGNNSRMERFEDEDGLGRLQRVALDEDLH--HSSGAEDEMSE 569


Top