BLASTX nr result

ID: Mentha28_contig00000577 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Mentha28_contig00000577
         (4080 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_004235642.1| PREDICTED: uncharacterized protein LOC101254...   548   e-153
ref|XP_006343037.1| PREDICTED: RNA polymerase II-associated fact...   544   e-151
ref|XP_006465692.1| PREDICTED: RNA polymerase II-associated fact...   519   e-144
ref|XP_003531647.1| PREDICTED: bromodomain-containing protein 4-...   517   e-143
ref|XP_006465693.1| PREDICTED: RNA polymerase II-associated fact...   516   e-143
ref|XP_006426877.1| hypothetical protein CICLE_v10025066mg [Citr...   516   e-143
ref|XP_006426878.1| hypothetical protein CICLE_v10025066mg [Citr...   513   e-142
ref|XP_007217663.1| hypothetical protein PRUPE_ppa002145mg [Prun...   513   e-142
gb|EXB74581.1| hypothetical protein L484_026278 [Morus notabilis]     512   e-142
ref|XP_004302858.1| PREDICTED: uncharacterized protein LOC101304...   511   e-142
ref|XP_002278075.2| PREDICTED: RNA polymerase II-associated fact...   510   e-141
gb|EYU45832.1| hypothetical protein MIMGU_mgv1a008530mg [Mimulus...   508   e-141
gb|EYU45831.1| hypothetical protein MIMGU_mgv1a007756mg [Mimulus...   507   e-140
ref|XP_007135633.1| hypothetical protein PHAVU_010G145300g [Phas...   506   e-140
ref|XP_006583048.1| PREDICTED: RNA polymerase II-associated fact...   506   e-140
ref|XP_004141783.1| PREDICTED: uncharacterized protein LOC101203...   503   e-139
ref|XP_007024308.1| Hydroxyproline-rich glycoprotein family prot...   501   e-138
ref|XP_007024307.1| Hydroxyproline-rich glycoprotein family prot...   501   e-138
ref|XP_004155995.1| PREDICTED: uncharacterized LOC101203806 [Cuc...   500   e-138
ref|XP_004510311.1| PREDICTED: RNA polymerase II-associated fact...   480   e-132

>ref|XP_004235642.1| PREDICTED: uncharacterized protein LOC101254885 [Solanum
            lycopersicum]
          Length = 698

 Score =  548 bits (1413), Expect = e-153
 Identities = 276/403 (68%), Positives = 316/403 (78%), Gaps = 1/403 (0%)
 Frame = +1

Query: 1228 SQNKVLQKTQMLASGTKGHGSISGSHMGDRRSTPLLSNDRIENRLKKPTTFLCKLKFRNE 1407
            SQNKVLQKTQML SGTKGHGSIS SHM DRR+TPLLS +R ENRLKKPTTFLCKLKFRNE
Sbjct: 250  SQNKVLQKTQMLTSGTKGHGSISASHMADRRTTPLLSGERTENRLKKPTTFLCKLKFRNE 309

Query: 1408 LPDPSAKMKLLSLKRDPDRYCKYQITSLEKNWKPQLHVEXXXXXXXXXXXXSVYNPPKGX 1587
            LPDP+A+ KLL+L+RDPDR+ KY ITSLEK  KPQLHVE            SVYNPPKG 
Sbjct: 310  LPDPTAQPKLLTLRRDPDRFTKYSITSLEKMHKPQLHVEPDLGIPLDLLDLSVYNPPKGV 369

Query: 1588 XXXXXXXXXXXXXXXXXXTPIKTDGIKKKERPTDKGVSWLVKTQYISPLSIESTKQSLTE 1767
                              TPIK DGIKKKERPTDKGVSWLVKTQYISPLS ES KQSLTE
Sbjct: 370  KIPLAPEDEELLRDDNPITPIKKDGIKKKERPTDKGVSWLVKTQYISPLSTESAKQSLTE 429

Query: 1768 KQAKELRESRG-RILLENLNSRERKIKDIVASFEACKSKPVHAANRHLQPRRILPLLPDF 1944
            KQAKELRE++G R +LENLN R+R+I++I ASFEACKS+P+HA+NR LQP ++ PL PDF
Sbjct: 430  KQAKELRETKGGRNILENLNKRDRQIQEIEASFEACKSRPIHASNRRLQPIKVQPLYPDF 489

Query: 1945 DRYEDNFLVATFDSAPTVDSEIYRKLSAADRDEFEQKAIMKSYASTSTDSEKQDGFLAYM 2124
            DRY+D F++A +DSAPT DSE Y KL    RD  E +A+MKS+ +TS+D++K D FLAYM
Sbjct: 490  DRYKDPFVLANYDSAPTADSETYSKLDKTVRDACESQAVMKSFVATSSDADKPDKFLAYM 549

Query: 2125 VPSIDEIEKDIYDESEDISYSWVREYHYDVRGDDVEDPTTYLVSFGETEARYLPLPQKLI 2304
            VP+ +E+ KDIYDESEDISYSWVREYH+DVRGDD +DP TY+V+FGE EARY+PLP KL+
Sbjct: 550  VPAPNELSKDIYDESEDISYSWVREYHWDVRGDDADDPNTYVVAFGEREARYMPLPTKLV 609

Query: 2305 LRKKRAREGKSTEEVEQFAVPRSVTVRRRTDVAINVLTEEEDY 2433
            LRKKRAREGKS EEVE F VP  VTVR+R   A   L EE  Y
Sbjct: 610  LRKKRAREGKSNEEVEHFPVPSRVTVRKRPTAAAIELKEEGGY 652


>ref|XP_006343037.1| PREDICTED: RNA polymerase II-associated factor 1 homolog [Solanum
            tuberosum]
          Length = 700

 Score =  544 bits (1402), Expect = e-151
 Identities = 273/403 (67%), Positives = 315/403 (78%), Gaps = 1/403 (0%)
 Frame = +1

Query: 1228 SQNKVLQKTQMLASGTKGHGSISGSHMGDRRSTPLLSNDRIENRLKKPTTFLCKLKFRNE 1407
            SQNKVLQKTQML SGTKGHGSIS SHM DRR+ PLLS +R ENRLKKPTTFLCKLKFRNE
Sbjct: 252  SQNKVLQKTQMLTSGTKGHGSISASHMADRRTAPLLSGERTENRLKKPTTFLCKLKFRNE 311

Query: 1408 LPDPSAKMKLLSLKRDPDRYCKYQITSLEKNWKPQLHVEXXXXXXXXXXXXSVYNPPKGX 1587
            LPDP+A+ KLL+L+RDPDR+ KY ITSLEK  KPQL+VE            SVYNPPKG 
Sbjct: 312  LPDPTAQPKLLTLRRDPDRFTKYSITSLEKMHKPQLYVEPDLGIPLDLLDLSVYNPPKGV 371

Query: 1588 XXXXXXXXXXXXXXXXXXTPIKTDGIKKKERPTDKGVSWLVKTQYISPLSIESTKQSLTE 1767
                              TPIK DGIKKKERPTDKGVSWLVKTQYISPLS ES KQSLTE
Sbjct: 372  KIPLAPEDEELLRDDNPITPIKKDGIKKKERPTDKGVSWLVKTQYISPLSTESAKQSLTE 431

Query: 1768 KQAKELRESRG-RILLENLNSRERKIKDIVASFEACKSKPVHAANRHLQPRRILPLLPDF 1944
            KQAKELRE++G R +LENLN R+R+I++I ASFEACKS+P+HA NR LQP ++ PL PDF
Sbjct: 432  KQAKELRETKGGRNILENLNKRDRQIQEIEASFEACKSRPIHATNRRLQPVKVQPLYPDF 491

Query: 1945 DRYEDNFLVATFDSAPTVDSEIYRKLSAADRDEFEQKAIMKSYASTSTDSEKQDGFLAYM 2124
            DRY+D F++A +DSAPT DSE Y KL    RD  E +A+MKS+ +TS+D++K D FLAYM
Sbjct: 492  DRYKDPFVLANYDSAPTADSETYNKLDKTVRDACESQAVMKSFVATSSDADKPDKFLAYM 551

Query: 2125 VPSIDEIEKDIYDESEDISYSWVREYHYDVRGDDVEDPTTYLVSFGETEARYLPLPQKLI 2304
            VP+ +E+ KD+YDE+EDISYSWVREYH+DVRGDD +DP TY+V+FGETEARY+PLP KL+
Sbjct: 552  VPAPNELSKDMYDENEDISYSWVREYHWDVRGDDADDPNTYVVAFGETEARYMPLPTKLV 611

Query: 2305 LRKKRAREGKSTEEVEQFAVPRSVTVRRRTDVAINVLTEEEDY 2433
            LRKKRAREGKS EEVE F VP  VTVR+R   A   L EE  Y
Sbjct: 612  LRKKRAREGKSNEEVEHFPVPSRVTVRKRPTAAAIELKEEGGY 654


>ref|XP_006465692.1| PREDICTED: RNA polymerase II-associated factor 1 homolog isoform X1
            [Citrus sinensis]
          Length = 576

 Score =  519 bits (1337), Expect = e-144
 Identities = 259/407 (63%), Positives = 313/407 (76%), Gaps = 1/407 (0%)
 Frame = +1

Query: 1228 SQNKVLQKTQMLASGTKGHGSISGSHMGDRRSTPLLSNDRIENRLKKPTTFLCKLKFRNE 1407
            SQN V+QK+QM+ASG  GHGS++GS MGDRR+ PLLS +RIENRLKKPTTFLCKLKFRNE
Sbjct: 130  SQNVVMQKSQMVASGKGGHGSMAGSRMGDRRAAPLLSGERIENRLKKPTTFLCKLKFRNE 189

Query: 1408 LPDPSAKMKLLSLKRDPDRYCKYQITSLEKNWKPQLHVEXXXXXXXXXXXXSVYNPPKGX 1587
            LP+PSA+ KL++LK+D DR+ +Y  +SLEKN+KPQLHVE            SVYNPP   
Sbjct: 190  LPEPSAQPKLMALKKDKDRFTRYTFSSLEKNYKPQLHVEPDLGIPLDLLDLSVYNPPS-V 248

Query: 1588 XXXXXXXXXXXXXXXXXXTPIKTDGIKKKERPTDKGVSWLVKTQYISPLSIESTKQSLTE 1767
                              TP+K DGIK+KERPTDKGVSWLVKTQYISPLS+ES +QSLTE
Sbjct: 249  RPPLDPEDEELLRDDEVVTPVKKDGIKRKERPTDKGVSWLVKTQYISPLSMESARQSLTE 308

Query: 1768 KQAKELRESRG-RILLENLNSRERKIKDIVASFEACKSKPVHAANRHLQPRRILPLLPDF 1944
            KQAKELRE +G R +LENLN RER+IK+I ASFEACK +P+HA N++LQP  ILPLLPDF
Sbjct: 309  KQAKELREMKGGRSILENLNDRERQIKEIEASFEACKLRPIHATNKNLQPVEILPLLPDF 368

Query: 1945 DRYEDNFLVATFDSAPTVDSEIYRKLSAADRDEFEQKAIMKSYASTSTDSEKQDGFLAYM 2124
            +RY+D F+ ATFD APT DSEIY K+  + RD  E +AIMKSY +T +DS   + FLAYM
Sbjct: 369  ERYDDQFVAATFDGAPTADSEIYSKMDKSVRDAHESRAIMKSYVATGSDSANPEKFLAYM 428

Query: 2125 VPSIDEIEKDIYDESEDISYSWVREYHYDVRGDDVEDPTTYLVSFGETEARYLPLPQKLI 2304
            VPS++E+ KD+YDE+ED+S+SWVREYH+DVRGDD +DPTTYLVSF + EARY+PLP KL 
Sbjct: 429  VPSVNELSKDMYDENEDVSFSWVREYHWDVRGDDADDPTTYLVSFDDDEARYVPLPTKLN 488

Query: 2305 LRKKRAREGKSTEEVEQFAVPRSVTVRRRTDVAINVLTEEEDYVPSK 2445
            LRKKRA EG+S +EVE F +P S+ VRRR +V    L E+  Y  SK
Sbjct: 489  LRKKRAIEGRSNDEVEHFPIPSSIAVRRRANVTAIELKEQGAYSNSK 535


>ref|XP_003531647.1| PREDICTED: bromodomain-containing protein 4-like isoform X1 [Glycine
            max] gi|571472317|ref|XP_006585570.1| PREDICTED:
            bromodomain-containing protein 4-like isoform X2 [Glycine
            max]
          Length = 666

 Score =  517 bits (1331), Expect = e-143
 Identities = 272/478 (56%), Positives = 329/478 (68%), Gaps = 6/478 (1%)
 Frame = +1

Query: 1030 PSRDSRQP------NLASSSRKEQKPPLSSKRPGPGPGATGXXXXXXXXXXXXXXXXXXX 1191
            PS+  R+        +A    K+Q+PPL  K+   GP   G                   
Sbjct: 154  PSKSGRREYEHSNHGIAHKQHKQQQPPLPVKKMNNGP--PGRAETDEEKRLRKKREFEKQ 211

Query: 1192 XXXXXXXXXXXXSQNKVLQKTQMLASGTKGHGSISGSHMGDRRSTPLLSNDRIENRLKKP 1371
                        SQN VLQKT +L+SG KGHG I+GS MG+RRSTPLL  +R+ENRLKKP
Sbjct: 212  RQEEKHRQQLKESQNTVLQKTHLLSSG-KGHGMIAGSRMGERRSTPLLGAERVENRLKKP 270

Query: 1372 TTFLCKLKFRNELPDPSAKMKLLSLKRDPDRYCKYQITSLEKNWKPQLHVEXXXXXXXXX 1551
            TTFLCKLKFRNELPDPSA+ KL+S K+D D+Y KY ITSLEK +KP+L VE         
Sbjct: 271  TTFLCKLKFRNELPDPSAQPKLMSFKKDKDQYAKYTITSLEKMYKPKLFVEPDLGIPLDL 330

Query: 1552 XXXSVYNPPKGXXXXXXXXXXXXXXXXXXXTPIKTDGIKKKERPTDKGVSWLVKTQYISP 1731
               SVYNPP+                    TPIK DGIK+KERPTDKGV+WLVKTQYISP
Sbjct: 331  LDLSVYNPPR-VRPPLAPEDEELLRDDEAATPIKKDGIKRKERPTDKGVAWLVKTQYISP 389

Query: 1732 LSIESTKQSLTEKQAKELRESRGRILLENLNSRERKIKDIVASFEACKSKPVHAANRHLQ 1911
            LS+ESTKQSLTEKQAKELRE +GR +L+NLNSRER+I++I ASFEA KS PVHA N+ L 
Sbjct: 390  LSMESTKQSLTEKQAKELREMKGRGILDNLNSRERQIREIQASFEAAKSDPVHATNKDLY 449

Query: 1912 PRRILPLLPDFDRYEDNFLVATFDSAPTVDSEIYRKLSAADRDEFEQKAIMKSYASTSTD 2091
            P  ++PLLPDFDRY+D F+VA FD+APT DSE+Y K++ + RD FE KA+MKSY +T  D
Sbjct: 450  PVEVMPLLPDFDRYDDQFVVAAFDNAPTADSEMYAKMNKSVRDAFESKAVMKSYVATGLD 509

Query: 2092 SEKQDGFLAYMVPSIDEIEKDIYDESEDISYSWVREYHYDVRGDDVEDPTTYLVSFGETE 2271
                + FLAYM P+  E+ KDIYDE+ED+SYSW+REYH+DVRGDD +DPTT+LV+F E+E
Sbjct: 510  PANPEKFLAYMAPAPGELSKDIYDENEDVSYSWIREYHWDVRGDDADDPTTFLVAFDESE 569

Query: 2272 ARYLPLPQKLILRKKRAREGKSTEEVEQFAVPRSVTVRRRTDVAINVLTEEEDYVPSK 2445
            ARYLPLP KL+LRKKRA+EG+S +EVEQ  VP  VTVRRR+ VA     +   Y  SK
Sbjct: 570  ARYLPLPTKLVLRKKRAKEGRSGDEVEQCPVPARVTVRRRSSVAAIERKDSGVYTSSK 627


>ref|XP_006465693.1| PREDICTED: RNA polymerase II-associated factor 1 homolog isoform X2
            [Citrus sinensis]
          Length = 570

 Score =  516 bits (1329), Expect = e-143
 Identities = 256/400 (64%), Positives = 310/400 (77%), Gaps = 1/400 (0%)
 Frame = +1

Query: 1228 SQNKVLQKTQMLASGTKGHGSISGSHMGDRRSTPLLSNDRIENRLKKPTTFLCKLKFRNE 1407
            SQN V+QK+QM+ASG  GHGS++GS MGDRR+ PLLS +RIENRLKKPTTFLCKLKFRNE
Sbjct: 130  SQNVVMQKSQMVASGKGGHGSMAGSRMGDRRAAPLLSGERIENRLKKPTTFLCKLKFRNE 189

Query: 1408 LPDPSAKMKLLSLKRDPDRYCKYQITSLEKNWKPQLHVEXXXXXXXXXXXXSVYNPPKGX 1587
            LP+PSA+ KL++LK+D DR+ +Y  +SLEKN+KPQLHVE            SVYNPP   
Sbjct: 190  LPEPSAQPKLMALKKDKDRFTRYTFSSLEKNYKPQLHVEPDLGIPLDLLDLSVYNPPS-V 248

Query: 1588 XXXXXXXXXXXXXXXXXXTPIKTDGIKKKERPTDKGVSWLVKTQYISPLSIESTKQSLTE 1767
                              TP+K DGIK+KERPTDKGVSWLVKTQYISPLS+ES +QSLTE
Sbjct: 249  RPPLDPEDEELLRDDEVVTPVKKDGIKRKERPTDKGVSWLVKTQYISPLSMESARQSLTE 308

Query: 1768 KQAKELRESRG-RILLENLNSRERKIKDIVASFEACKSKPVHAANRHLQPRRILPLLPDF 1944
            KQAKELRE +G R +LENLN RER+IK+I ASFEACK +P+HA N++LQP  ILPLLPDF
Sbjct: 309  KQAKELREMKGGRSILENLNDRERQIKEIEASFEACKLRPIHATNKNLQPVEILPLLPDF 368

Query: 1945 DRYEDNFLVATFDSAPTVDSEIYRKLSAADRDEFEQKAIMKSYASTSTDSEKQDGFLAYM 2124
            +RY+D F+ ATFD APT DSEIY K+  + RD  E +AIMKSY +T +DS   + FLAYM
Sbjct: 369  ERYDDQFVAATFDGAPTADSEIYSKMDKSVRDAHESRAIMKSYVATGSDSANPEKFLAYM 428

Query: 2125 VPSIDEIEKDIYDESEDISYSWVREYHYDVRGDDVEDPTTYLVSFGETEARYLPLPQKLI 2304
            VPS++E+ KD+YDE+ED+S+SWVREYH+DVRGDD +DPTTYLVSF + EARY+PLP KL 
Sbjct: 429  VPSVNELSKDMYDENEDVSFSWVREYHWDVRGDDADDPTTYLVSFDDDEARYVPLPTKLN 488

Query: 2305 LRKKRAREGKSTEEVEQFAVPRSVTVRRRTDVAINVLTEE 2424
            LRKKRA EG+S +EVE F +P S+ VRRR +V    L E+
Sbjct: 489  LRKKRAIEGRSNDEVEHFPIPSSIAVRRRANVTAIELKEQ 528


>ref|XP_006426877.1| hypothetical protein CICLE_v10025066mg [Citrus clementina]
            gi|557528867|gb|ESR40117.1| hypothetical protein
            CICLE_v10025066mg [Citrus clementina]
          Length = 677

 Score =  516 bits (1329), Expect = e-143
 Identities = 258/407 (63%), Positives = 311/407 (76%), Gaps = 1/407 (0%)
 Frame = +1

Query: 1228 SQNKVLQKTQMLASGTKGHGSISGSHMGDRRSTPLLSNDRIENRLKKPTTFLCKLKFRNE 1407
            SQN V+QK+QM+ASG  GHGS+ GS MGDRR+ PLLS +R ENRLKKPTTFLCKLKFRNE
Sbjct: 231  SQNVVMQKSQMVASGKGGHGSMVGSRMGDRRAAPLLSGERTENRLKKPTTFLCKLKFRNE 290

Query: 1408 LPDPSAKMKLLSLKRDPDRYCKYQITSLEKNWKPQLHVEXXXXXXXXXXXXSVYNPPKGX 1587
            LP+PSA+ KL++LK+D DR+ +Y  +SLEKN+KPQLHVE            SVYNPP   
Sbjct: 291  LPEPSAQPKLMALKKDKDRFTRYTFSSLEKNYKPQLHVEPDLGIPLDLLDLSVYNPPS-V 349

Query: 1588 XXXXXXXXXXXXXXXXXXTPIKTDGIKKKERPTDKGVSWLVKTQYISPLSIESTKQSLTE 1767
                              TP+K DGIK+KERPTDKGVSWLVKTQYISPLS+ES +QSLTE
Sbjct: 350  RPPLDPEDEELLRDDEVVTPVKKDGIKRKERPTDKGVSWLVKTQYISPLSMESARQSLTE 409

Query: 1768 KQAKELRESRG-RILLENLNSRERKIKDIVASFEACKSKPVHAANRHLQPRRILPLLPDF 1944
            KQAKELRE +G R +LENLN RER+IK+I ASFEACK +P+HA N++LQP  ILPLLPDF
Sbjct: 410  KQAKELREMKGGRSILENLNDRERQIKEIEASFEACKLRPIHATNKNLQPVEILPLLPDF 469

Query: 1945 DRYEDNFLVATFDSAPTVDSEIYRKLSAADRDEFEQKAIMKSYASTSTDSEKQDGFLAYM 2124
            +RY+D F+ ATFD APT DSEIY K+  + RD  E +AIMKSY +T +DS   + FLAYM
Sbjct: 470  ERYDDQFVAATFDGAPTADSEIYSKMDKSVRDAHESRAIMKSYVATGSDSANPEKFLAYM 529

Query: 2125 VPSIDEIEKDIYDESEDISYSWVREYHYDVRGDDVEDPTTYLVSFGETEARYLPLPQKLI 2304
            VPS++E+ KD+YDE+ED+S+SWVREYH+DVRGDD +DPTTYLVSF + EARY+PLP KL 
Sbjct: 530  VPSVNELSKDMYDENEDVSFSWVREYHWDVRGDDADDPTTYLVSFDDDEARYVPLPTKLN 589

Query: 2305 LRKKRAREGKSTEEVEQFAVPRSVTVRRRTDVAINVLTEEEDYVPSK 2445
            LRKKRA EG+S +EVE F +P S+ VRRR +V    L E+  Y  SK
Sbjct: 590  LRKKRAIEGRSNDEVEHFPIPSSIAVRRRANVTAIELKEQGAYSNSK 636


>ref|XP_006426878.1| hypothetical protein CICLE_v10025066mg [Citrus clementina]
            gi|557528868|gb|ESR40118.1| hypothetical protein
            CICLE_v10025066mg [Citrus clementina]
          Length = 632

 Score =  513 bits (1321), Expect = e-142
 Identities = 255/400 (63%), Positives = 308/400 (77%), Gaps = 1/400 (0%)
 Frame = +1

Query: 1228 SQNKVLQKTQMLASGTKGHGSISGSHMGDRRSTPLLSNDRIENRLKKPTTFLCKLKFRNE 1407
            SQN V+QK+QM+ASG  GHGS+ GS MGDRR+ PLLS +R ENRLKKPTTFLCKLKFRNE
Sbjct: 231  SQNVVMQKSQMVASGKGGHGSMVGSRMGDRRAAPLLSGERTENRLKKPTTFLCKLKFRNE 290

Query: 1408 LPDPSAKMKLLSLKRDPDRYCKYQITSLEKNWKPQLHVEXXXXXXXXXXXXSVYNPPKGX 1587
            LP+PSA+ KL++LK+D DR+ +Y  +SLEKN+KPQLHVE            SVYNPP   
Sbjct: 291  LPEPSAQPKLMALKKDKDRFTRYTFSSLEKNYKPQLHVEPDLGIPLDLLDLSVYNPPS-V 349

Query: 1588 XXXXXXXXXXXXXXXXXXTPIKTDGIKKKERPTDKGVSWLVKTQYISPLSIESTKQSLTE 1767
                              TP+K DGIK+KERPTDKGVSWLVKTQYISPLS+ES +QSLTE
Sbjct: 350  RPPLDPEDEELLRDDEVVTPVKKDGIKRKERPTDKGVSWLVKTQYISPLSMESARQSLTE 409

Query: 1768 KQAKELRESRG-RILLENLNSRERKIKDIVASFEACKSKPVHAANRHLQPRRILPLLPDF 1944
            KQAKELRE +G R +LENLN RER+IK+I ASFEACK +P+HA N++LQP  ILPLLPDF
Sbjct: 410  KQAKELREMKGGRSILENLNDRERQIKEIEASFEACKLRPIHATNKNLQPVEILPLLPDF 469

Query: 1945 DRYEDNFLVATFDSAPTVDSEIYRKLSAADRDEFEQKAIMKSYASTSTDSEKQDGFLAYM 2124
            +RY+D F+ ATFD APT DSEIY K+  + RD  E +AIMKSY +T +DS   + FLAYM
Sbjct: 470  ERYDDQFVAATFDGAPTADSEIYSKMDKSVRDAHESRAIMKSYVATGSDSANPEKFLAYM 529

Query: 2125 VPSIDEIEKDIYDESEDISYSWVREYHYDVRGDDVEDPTTYLVSFGETEARYLPLPQKLI 2304
            VPS++E+ KD+YDE+ED+S+SWVREYH+DVRGDD +DPTTYLVSF + EARY+PLP KL 
Sbjct: 530  VPSVNELSKDMYDENEDVSFSWVREYHWDVRGDDADDPTTYLVSFDDDEARYVPLPTKLN 589

Query: 2305 LRKKRAREGKSTEEVEQFAVPRSVTVRRRTDVAINVLTEE 2424
            LRKKRA EG+S +EVE F +P S+ VRRR +V    L E+
Sbjct: 590  LRKKRAIEGRSNDEVEHFPIPSSIAVRRRANVTAIELKEQ 629


>ref|XP_007217663.1| hypothetical protein PRUPE_ppa002145mg [Prunus persica]
            gi|462413813|gb|EMJ18862.1| hypothetical protein
            PRUPE_ppa002145mg [Prunus persica]
          Length = 709

 Score =  513 bits (1321), Expect = e-142
 Identities = 261/403 (64%), Positives = 307/403 (76%), Gaps = 1/403 (0%)
 Frame = +1

Query: 1228 SQNKVLQKTQMLASGTKGHGSISGSHMGDRRSTPLLSNDRIENRLKKPTTFLCKLKFRNE 1407
            SQN VLQKTQML+SG KGHGSI+GS MG+RR+TP LS +R ENRLKKPTTF+CKLKFRNE
Sbjct: 266  SQNSVLQKTQMLSSG-KGHGSIAGSRMGERRATPFLSGERTENRLKKPTTFVCKLKFRNE 324

Query: 1408 LPDPSAKMKLLSLKRDPDRYCKYQITSLEKNWKPQLHVEXXXXXXXXXXXXSVYNPPKGX 1587
            LPDPSA+ KL+SLK+D D+Y KY ITSLEK +KP+L VE            SVYNPP   
Sbjct: 325  LPDPSAQPKLMSLKKDKDQYTKYTITSLEKTYKPKLFVEPDLGIPLDLLDLSVYNPPS-V 383

Query: 1588 XXXXXXXXXXXXXXXXXXTPIKTDGIKKKERPTDKGVSWLVKTQYISPLSIESTKQSLTE 1767
                              TP+K +GI++KERPTDKGV+WLVKTQYISPLS++S +QSLTE
Sbjct: 384  RPPLALEDEELLRDDVAATPVKNNGIRRKERPTDKGVAWLVKTQYISPLSMDSARQSLTE 443

Query: 1768 KQAKELRESRG-RILLENLNSRERKIKDIVASFEACKSKPVHAANRHLQPRRILPLLPDF 1944
            KQAKELRE +G R +L+NLN RER+IKDI ASFEACKS+PVHA N++L P  ILPLLPDF
Sbjct: 444  KQAKELREMKGGRNILDNLNDRERQIKDIEASFEACKSRPVHATNKNLYPVEILPLLPDF 503

Query: 1945 DRYEDNFLVATFDSAPTVDSEIYRKLSAADRDEFEQKAIMKSYASTSTDSEKQDGFLAYM 2124
            +RYED F++A FD APT DSEIY KL  +  D +E +AIMKSY  T  D    + FLAYM
Sbjct: 504  ERYEDQFVLAAFDGAPTADSEIYSKLDQSGHDAYESRAIMKSYKVTGADPANPEKFLAYM 563

Query: 2125 VPSIDEIEKDIYDESEDISYSWVREYHYDVRGDDVEDPTTYLVSFGETEARYLPLPQKLI 2304
            VPS +E+ KD YDESED+SYSWVREYHYDVRGDDV DPTTYLVSF E EARY PLP KL+
Sbjct: 564  VPSPNELSKDPYDESEDVSYSWVREYHYDVRGDDVHDPTTYLVSFDEEEARYAPLPTKLV 623

Query: 2305 LRKKRAREGKSTEEVEQFAVPRSVTVRRRTDVAINVLTEEEDY 2433
            LRKKR++EGK+++EVE F  P  VTVR+R+ VA   L +  DY
Sbjct: 624  LRKKRSKEGKTSDEVEHFPAPSRVTVRQRSTVAAIELKDSGDY 666


>gb|EXB74581.1| hypothetical protein L484_026278 [Morus notabilis]
          Length = 697

 Score =  512 bits (1319), Expect = e-142
 Identities = 261/403 (64%), Positives = 308/403 (76%), Gaps = 1/403 (0%)
 Frame = +1

Query: 1228 SQNKVLQKTQMLASGTKGHGSISGSHMGDRRSTPLLSNDRIENRLKKPTTFLCKLKFRNE 1407
            SQ+  LQKTQ+L S  KGHGSI+GS MG+RR+T  LS +RIENRLKKPTTFLCKLKFRNE
Sbjct: 252  SQHSALQKTQIL-SAAKGHGSIAGSRMGERRATSFLSGERIENRLKKPTTFLCKLKFRNE 310

Query: 1408 LPDPSAKMKLLSLKRDPDRYCKYQITSLEKNWKPQLHVEXXXXXXXXXXXXSVYNPPKGX 1587
            LPDPSA+ KL+S+KR+ D+Y KY ITSLEK +KP+L VE            SVYNPP   
Sbjct: 311  LPDPSAQPKLMSMKREKDQYSKYTITSLEKTYKPKLFVEPDLGIPLNLLDLSVYNPPS-V 369

Query: 1588 XXXXXXXXXXXXXXXXXXTPIKTDGIKKKERPTDKGVSWLVKTQYISPLSIESTKQSLTE 1767
                              TP+K DGIK+KERPTDKGV+WLVKTQYISPLS+ESTKQSLTE
Sbjct: 370  RPPLDPEDEELLRDDEAVTPVKKDGIKRKERPTDKGVAWLVKTQYISPLSMESTKQSLTE 429

Query: 1768 KQAKELRESRG-RILLENLNSRERKIKDIVASFEACKSKPVHAANRHLQPRRILPLLPDF 1944
            KQAKELRE +G R +LENLN R+R+IK+I ASFEACKS+PVHA N+ L P  +LPLLPDF
Sbjct: 430  KQAKELRELKGGRNILENLNDRDRQIKEIQASFEACKSRPVHATNKSLYPVEVLPLLPDF 489

Query: 1945 DRYEDNFLVATFDSAPTVDSEIYRKLSAADRDEFEQKAIMKSYASTSTDSEKQDGFLAYM 2124
            DRY+D F++A FDSAPT DSE+Y K+  + RD  E +A++KSY  T +D    + FLAYM
Sbjct: 490  DRYDDQFVLAAFDSAPTADSEVYSKMDQSIRDAHESQAVLKSYKVTGSDPGNPEKFLAYM 549

Query: 2125 VPSIDEIEKDIYDESEDISYSWVREYHYDVRGDDVEDPTTYLVSFGETEARYLPLPQKLI 2304
            VPS DE+ KDIYDE ED+SYSWVREYH+DVRGDD +DPTTYLVSF ETEARYLPLP KL+
Sbjct: 550  VPSPDELSKDIYDEHEDVSYSWVREYHWDVRGDDADDPTTYLVSFDETEARYLPLPTKLV 609

Query: 2305 LRKKRAREGKSTEEVEQFAVPRSVTVRRRTDVAINVLTEEEDY 2433
            LRKKRA+EG+S +EVE F VP  VTVRRR  V++  L + E Y
Sbjct: 610  LRKKRAKEGRSGDEVEHFPVPARVTVRRRPTVSVVELKDAEVY 652


>ref|XP_004302858.1| PREDICTED: uncharacterized protein LOC101304396 [Fragaria vesca
            subsp. vesca]
          Length = 693

 Score =  511 bits (1317), Expect = e-142
 Identities = 261/403 (64%), Positives = 305/403 (75%), Gaps = 1/403 (0%)
 Frame = +1

Query: 1228 SQNKVLQKTQMLASGTKGHGSISGSHMGDRRSTPLLSNDRIENRLKKPTTFLCKLKFRNE 1407
            SQN VLQKT +++SG KGHGSI+GS MG+RR+TP LS +R ENRLKKPTTF+CKLKFRNE
Sbjct: 249  SQNSVLQKTHLMSSG-KGHGSIAGSRMGERRTTPFLSGERAENRLKKPTTFVCKLKFRNE 307

Query: 1408 LPDPSAKMKLLSLKRDPDRYCKYQITSLEKNWKPQLHVEXXXXXXXXXXXXSVYNPPKGX 1587
            LPDPSA+ KL+S+K+DPD+Y KY ITSLEKN+KP+L VE            SVYNPP G 
Sbjct: 308  LPDPSAQPKLMSMKKDPDQYTKYTITSLEKNYKPKLFVEPDLGIPLDLLDLSVYNPPPGP 367

Query: 1588 XXXXXXXXXXXXXXXXXXTPIKTDGIKKKERPTDKGVSWLVKTQYISPLSIESTKQSLTE 1767
                              TP+K DGI++KERPTDKGV+WLVKTQYISPLS++S KQSLTE
Sbjct: 368  RPPLAPEDEELLRDDVAVTPVKKDGIRRKERPTDKGVAWLVKTQYISPLSMDSAKQSLTE 427

Query: 1768 KQAKELRESRG-RILLENLNSRERKIKDIVASFEACKSKPVHAANRHLQPRRILPLLPDF 1944
            KQAKELRE +G R LL+NLN RER+IK+I ASFEACKS+PVHA N++L P  +LPLLP  
Sbjct: 428  KQAKELREMKGGRNLLDNLNDRERQIKEIEASFEACKSRPVHATNKNLYPVEVLPLLPXH 487

Query: 1945 DRYEDNFLVATFDSAPTVDSEIYRKLSAADRDEFEQKAIMKSYASTSTDSEKQDGFLAYM 2124
            +RYED F++A FD APT DSEIY KL  +D D  E +AIMKSY  T  D    D FLAYM
Sbjct: 488  NRYEDQFVLAGFDGAPTADSEIYSKLDQSDHDLCESRAIMKSYKVTGADPANPDKFLAYM 547

Query: 2125 VPSIDEIEKDIYDESEDISYSWVREYHYDVRGDDVEDPTTYLVSFGETEARYLPLPQKLI 2304
            VPS +E+ KD YDESEDISYSWVREY YDVRGDDV+D TTYLVSF E  ARY PLP KL+
Sbjct: 548  VPSPNELSKDPYDESEDISYSWVREYQYDVRGDDVDDLTTYLVSFDEDAARYAPLPAKLV 607

Query: 2305 LRKKRAREGKSTEEVEQFAVPRSVTVRRRTDVAINVLTEEEDY 2433
            LRKKRA+EG+ST+EVE F  P  VTVRRR+ V+   L +  DY
Sbjct: 608  LRKKRAKEGRSTDEVEHFPAPSRVTVRRRSTVSAIELKDAGDY 650


>ref|XP_002278075.2| PREDICTED: RNA polymerase II-associated factor 1 homolog [Vitis
            vinifera]
          Length = 589

 Score =  510 bits (1314), Expect = e-141
 Identities = 275/461 (59%), Positives = 321/461 (69%), Gaps = 2/461 (0%)
 Frame = +1

Query: 1069 SRKEQKPPLSSKRPGPGPGATGXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXSQNKVLQ 1248
            S K+QKPP+         G  G                               SQN VLQ
Sbjct: 91   SGKQQKPPVPPAPVKKSNGPPGRVETEEERRLRKKREFEKQRQEEKQKHQLKESQNTVLQ 150

Query: 1249 KTQMLASGTKGHGSI-SGSHMGDRRSTPLLSNDRIENRLKKPTTFLCKLKFRNELPDPSA 1425
            KTQML+SG KGHGS+  GS MG+RR+TP LS DRIENRL+KPTTFLCKLKFRNELPDP+A
Sbjct: 151  KTQMLSSG-KGHGSVVGGSRMGERRTTPFLSGDRIENRLRKPTTFLCKLKFRNELPDPTA 209

Query: 1426 KMKLLSLKRDPDRYCKYQITSLEKNWKPQLHVEXXXXXXXXXXXXSVYNPPKGXXXXXXX 1605
            + KL++LK D DR+ KY ITSLEK  KPQL VE            SVYNPP         
Sbjct: 210  QPKLMALKTDKDRFTKYTITSLEKMHKPQLFVEPDLGIPLDLLDLSVYNPPS-VRRPLDP 268

Query: 1606 XXXXXXXXXXXXTPIKTDGIKKKERPTDKGVSWLVKTQYISPLSIESTKQSLTEKQAKEL 1785
                        TP+K +GIKKKERPTDKGVSWLVKTQYISPLS ESTKQSLTEKQAKEL
Sbjct: 269  EDEELLRDDESVTPVKKEGIKKKERPTDKGVSWLVKTQYISPLSTESTKQSLTEKQAKEL 328

Query: 1786 RESRG-RILLENLNSRERKIKDIVASFEACKSKPVHAANRHLQPRRILPLLPDFDRYEDN 1962
            RE++G R +LEN NSRERKI++I A+F A K  PVH+ N+ L+P  ILPLLPDF RY+D+
Sbjct: 329  RETKGGRNILENFNSRERKIQNIEAAFAASKITPVHSTNKSLKPVEILPLLPDFARYDDS 388

Query: 1963 FLVATFDSAPTVDSEIYRKLSAADRDEFEQKAIMKSYASTSTDSEKQDGFLAYMVPSIDE 2142
            F+VA+FDSAPT DSEIY KL    RD  E +AI+KSY +T +D  K + FLAYM PS DE
Sbjct: 389  FVVASFDSAPTADSEIYSKLDKTVRDSHESQAILKSYMATGSDPSKPEKFLAYMAPSPDE 448

Query: 2143 IEKDIYDESEDISYSWVREYHYDVRGDDVEDPTTYLVSFGETEARYLPLPQKLILRKKRA 2322
            + KDIYDE+ED SYSWVREYH+DVRGDD +DPTTYLVSF +T+ARYLPLP KL+LRKKRA
Sbjct: 449  LSKDIYDENEDTSYSWVREYHWDVRGDDADDPTTYLVSFNKTDARYLPLPTKLLLRKKRA 508

Query: 2323 REGKSTEEVEQFAVPRSVTVRRRTDVAINVLTEEEDYVPSK 2445
            +EG+S++EVE F VP  VTVR+R +VA   L +EE Y  SK
Sbjct: 509  KEGRSSDEVEHFPVPSKVTVRQRPNVAAIELKDEEVYSSSK 549


>gb|EYU45832.1| hypothetical protein MIMGU_mgv1a008530mg [Mimulus guttatus]
          Length = 370

 Score =  508 bits (1308), Expect = e-141
 Identities = 260/363 (71%), Positives = 291/363 (80%)
 Frame = -3

Query: 4075 KEFLPREYQDALTLSEGRVTGIFSPIGAAFQQVYVALEGFEERLGLDPNDPLIPVVLVFG 3896
            K  LPRE+QD+L  SEG    +  P+G AFQQVY+ L GFEE LGLDPNDPLIP VL  G
Sbjct: 14   KVILPREFQDSLASSEGHFGDVLRPVGTAFQQVYIVLVGFEESLGLDPNDPLIPFVLFVG 73

Query: 3895 VSAILWGSYRLFKYSGYAGDISPESAMELLRGNDSAVLIDIRPENLRDRDGIPDLRRRAR 3716
            VSA LWGSYR+ KYSGY+GD+SP+S MELLRGN+S VLIDIRPENLR +DGIPDLRR AR
Sbjct: 74   VSATLWGSYRVLKYSGYSGDLSPQSTMELLRGNESVVLIDIRPENLRGKDGIPDLRRSAR 133

Query: 3715 PRYASIALPEVDSSIKKLLKGGRDVEDSLLAVVIRDLKIVEDRSKVLVMDADGTRSKSVA 3536
             RYAS+ LPEVD S+KKLLKGGRD+EDSLLA VIRDLKIVEDRSKVLVMDADGTRSK VA
Sbjct: 134  SRYASVTLPEVDGSVKKLLKGGRDIEDSLLATVIRDLKIVEDRSKVLVMDADGTRSKGVA 193

Query: 3535 RSLKKLGAKRPYQVLGGFQSWVNEGCRVKELKPETTFTXXXXXXXXXXXXIKPTPLKVIX 3356
            RSL+KLG KRPYQV GGF+SW+ EG RVKELKPETT T            IKPTPLKV+ 
Sbjct: 194  RSLRKLGTKRPYQVEGGFRSWLKEGMRVKELKPETTLTILNEEAEAILEEIKPTPLKVVG 253

Query: 3355 XXXXXXXXGYSLLEWETTLQFIAVIGIGQTIFRRIASYQGAQDFQQDLRFLLAPVSLGGQ 3176
                     YSLL+WE TLQFI VIG+GQTIFRR+ASYQGA DF QD+R LLAPV LGG+
Sbjct: 254  FGVGVLAAAYSLLDWERTLQFIGVIGLGQTIFRRVASYQGADDFNQDVRVLLAPVKLGGE 313

Query: 3175 AISWAAGKLETNRNGLPTSPSSSDVQSRVLQAAAKLESQPAESTETQDLPQVSGNEEVNI 2996
            AISWAAGKLETNRNGLPT+PSS DVQSRVLQAAAK ESQP+++++       +  +EVNI
Sbjct: 314  AISWAAGKLETNRNGLPTAPSSVDVQSRVLQAAAKHESQPSDASDQ------TQQDEVNI 367

Query: 2995 SEA 2987
            SEA
Sbjct: 368  SEA 370


>gb|EYU45831.1| hypothetical protein MIMGU_mgv1a007756mg [Mimulus guttatus]
          Length = 396

 Score =  507 bits (1306), Expect = e-140
 Identities = 250/348 (71%), Positives = 285/348 (81%)
 Frame = +1

Query: 1387 KLKFRNELPDPSAKMKLLSLKRDPDRYCKYQITSLEKNWKPQLHVEXXXXXXXXXXXXSV 1566
            K +FRNELPDPSAK KLL +KRDPDRYCKYQITSLEKNWKPQL+VE            SV
Sbjct: 9    KFRFRNELPDPSAKAKLLVMKRDPDRYCKYQITSLEKNWKPQLYVEPDLGIPLDLLDLSV 68

Query: 1567 YNPPKGXXXXXXXXXXXXXXXXXXXTPIKTDGIKKKERPTDKGVSWLVKTQYISPLSIES 1746
            YNPPKG                   TPIKTDGIK KERPTDKGVSWLVKTQYISPLS++S
Sbjct: 69   YNPPKGERIPLDPEDEELLRDDDPITPIKTDGIKAKERPTDKGVSWLVKTQYISPLSMDS 128

Query: 1747 TKQSLTEKQAKELRESRGRILLENLNSRERKIKDIVASFEACKSKPVHAANRHLQPRRIL 1926
             K SLTEKQAKELRESRGR LLE LNSRERKI+DI ASFEA KSKPVHA NR L+P+R+L
Sbjct: 129  AKHSLTEKQAKELRESRGRNLLEKLNSRERKIQDITASFEASKSKPVHAVNRQLEPKRVL 188

Query: 1927 PLLPDFDRYEDNFLVATFDSAPTVDSEIYRKLSAADRDEFEQKAIMKSYASTSTDSEKQD 2106
            PL PDFDR++D F+VA FD+APT DSE+YRKL+A DRDE+E KAIM+SY S+S+D  K D
Sbjct: 189  PLFPDFDRFDDQFVVANFDNAPTADSEVYRKLNAVDRDEYEHKAIMRSYGSSSSDPNKSD 248

Query: 2107 GFLAYMVPSIDEIEKDIYDESEDISYSWVREYHYDVRGDDVEDPTTYLVSFGETEARYLP 2286
             FLAYMVPS+DEIEKDIYDE+ED+SYSWVREY++D+R D+V+DPTTYLVSF E++A+YLP
Sbjct: 249  KFLAYMVPSVDEIEKDIYDENEDVSYSWVREYNWDMRSDNVDDPTTYLVSFDESKAKYLP 308

Query: 2287 LPQKLILRKKRAREGKSTEEVEQFAVPRSVTVRRRTDVAINVLTEEED 2430
            LP KLILRKKRAR+GKS +EVEQF VPRSVTVRRRT V++  L +EED
Sbjct: 309  LPTKLILRKKRARDGKSGDEVEQFPVPRSVTVRRRTSVSVVELRDEED 356


>ref|XP_007135633.1| hypothetical protein PHAVU_010G145300g [Phaseolus vulgaris]
            gi|561008678|gb|ESW07627.1| hypothetical protein
            PHAVU_010G145300g [Phaseolus vulgaris]
          Length = 661

 Score =  506 bits (1303), Expect = e-140
 Identities = 255/393 (64%), Positives = 306/393 (77%), Gaps = 1/393 (0%)
 Frame = +1

Query: 1228 SQNKVLQKTQMLASGTKGHGSISGSHMGDRRSTPLLSNDRIENRLKKPTTFLCKLKFRNE 1407
            SQN VLQKT +L+SG KGHG ++GS MG+RRSTPLLS +R+ENRLKKPTTFLCKLKFRNE
Sbjct: 218  SQNTVLQKTHLLSSG-KGHGLVAGSRMGERRSTPLLSAERVENRLKKPTTFLCKLKFRNE 276

Query: 1408 LPDPSAKMKLLSLKRDPDRYCKYQITSLEKNWKPQLHVEXXXXXXXXXXXXSVYNPPKGX 1587
            LPDPSA+ KL++ K+D D+Y KY ITSLEK +KP+L VE            SVYNPP   
Sbjct: 277  LPDPSAQPKLMAFKKDKDQYAKYTITSLEKMYKPKLFVEPDLGIPLDLLDLSVYNPPS-V 335

Query: 1588 XXXXXXXXXXXXXXXXXXTPIKTDGIKKKERPTDKGVSWLVKTQYISPLSIESTKQSLTE 1767
                              TPIK DGIK+KERPTDKGV+WLVKTQYISPLS+ESTKQSLTE
Sbjct: 336  RPPLAPEDEELLRDDEAATPIKKDGIKRKERPTDKGVAWLVKTQYISPLSMESTKQSLTE 395

Query: 1768 KQAKELRESRG-RILLENLNSRERKIKDIVASFEACKSKPVHAANRHLQPRRILPLLPDF 1944
            KQAKELRE +G R +L+NLNSRER+I++I ASFEA KS PVHA N+ L P  ++PLLPDF
Sbjct: 396  KQAKELREMKGGRGVLDNLNSRERQIREIEASFEAAKSDPVHATNKDLYPVEVMPLLPDF 455

Query: 1945 DRYEDNFLVATFDSAPTVDSEIYRKLSAADRDEFEQKAIMKSYASTSTDSEKQDGFLAYM 2124
            DRY+D F+VA FD+APT DSE+Y KL  + RD FE KA+MKSY +TS+D    + FLAYM
Sbjct: 456  DRYDDQFVVAAFDNAPTADSEMYAKLDKSVRDAFESKAVMKSYVATSSDPANPEKFLAYM 515

Query: 2125 VPSIDEIEKDIYDESEDISYSWVREYHYDVRGDDVEDPTTYLVSFGETEARYLPLPQKLI 2304
             P+  E+ KDIYDE+ED+SYSW+REYH+DVRGDD +DPTT+ V+F ++EARYLPLP KL+
Sbjct: 516  APAPGELSKDIYDENEDVSYSWIREYHWDVRGDDADDPTTFFVAFDDSEARYLPLPTKLV 575

Query: 2305 LRKKRAREGKSTEEVEQFAVPRSVTVRRRTDVA 2403
            LRKKRA+EG+S EE+EQ  VP  VTVRRR+ VA
Sbjct: 576  LRKKRAKEGRSGEEIEQCPVPSRVTVRRRSSVA 608


>ref|XP_006583048.1| PREDICTED: RNA polymerase II-associated factor 1 homolog isoform X1
            [Glycine max] gi|571464391|ref|XP_006583049.1| PREDICTED:
            RNA polymerase II-associated factor 1 homolog isoform X2
            [Glycine max]
          Length = 659

 Score =  506 bits (1303), Expect = e-140
 Identities = 259/407 (63%), Positives = 310/407 (76%), Gaps = 1/407 (0%)
 Frame = +1

Query: 1228 SQNKVLQKTQMLASGTKGHGSISGSHMGDRRSTPLLSNDRIENRLKKPTTFLCKLKFRNE 1407
            SQN VLQKT ML+SG KGHG I+GS MG+RRSTPLL  +R+ENRLKKPTTFLCKLKFRNE
Sbjct: 216  SQNTVLQKTHMLSSG-KGHGMIAGSRMGERRSTPLLGAERVENRLKKPTTFLCKLKFRNE 274

Query: 1408 LPDPSAKMKLLSLKRDPDRYCKYQITSLEKNWKPQLHVEXXXXXXXXXXXXSVYNPPKGX 1587
            LPDPSA+ KL++ K+D D+Y KY ITSLEK +KP+L VE            SVYNPP   
Sbjct: 275  LPDPSAQPKLMASKKDKDQYAKYTITSLEKMYKPKLFVEPDLGIPLDLLDLSVYNPPS-V 333

Query: 1588 XXXXXXXXXXXXXXXXXXTPIKTDGIKKKERPTDKGVSWLVKTQYISPLSIESTKQSLTE 1767
                              TPIK DGIK+KERPTDKGV+WLVKTQYISPLS+ESTKQSLTE
Sbjct: 334  RPPLAPEDKELLRDDEAVTPIKKDGIKRKERPTDKGVAWLVKTQYISPLSMESTKQSLTE 393

Query: 1768 KQAKELRESRG-RILLENLNSRERKIKDIVASFEACKSKPVHAANRHLQPRRILPLLPDF 1944
            KQAKELRE +G R +L+NLNSRER+I++I ASFEA KS PVHA N+ L P  ++PLLPDF
Sbjct: 394  KQAKELREMKGGRGILDNLNSRERQIREIEASFEAAKSDPVHATNKDLYPVEVMPLLPDF 453

Query: 1945 DRYEDNFLVATFDSAPTVDSEIYRKLSAADRDEFEQKAIMKSYASTSTDSEKQDGFLAYM 2124
            DRY+D F+VA FD+APT DSE++ K+  + RD FE KA+MKSY +TS+D    + FLAYM
Sbjct: 454  DRYDDQFVVAAFDNAPTADSEMHAKMDKSVRDAFESKAVMKSYVATSSDPANPEKFLAYM 513

Query: 2125 VPSIDEIEKDIYDESEDISYSWVREYHYDVRGDDVEDPTTYLVSFGETEARYLPLPQKLI 2304
            VP+  E+ KDIYDE+ED+SYSW+REYH+DVRGDD +DP T+LV+F E+EARYLPLP KL+
Sbjct: 514  VPAPGELSKDIYDENEDVSYSWIREYHWDVRGDDADDPATFLVAFDESEARYLPLPTKLV 573

Query: 2305 LRKKRAREGKSTEEVEQFAVPRSVTVRRRTDVAINVLTEEEDYVPSK 2445
            LRKKRA+EG+S +EVEQ  VP  VTVRRR+ VA     +   Y  SK
Sbjct: 574  LRKKRAKEGRSGDEVEQCPVPARVTVRRRSSVAAIERKDSGVYTSSK 620


>ref|XP_004141783.1| PREDICTED: uncharacterized protein LOC101203806 [Cucumis sativus]
          Length = 706

 Score =  503 bits (1295), Expect = e-139
 Identities = 260/408 (63%), Positives = 310/408 (75%), Gaps = 2/408 (0%)
 Frame = +1

Query: 1228 SQNKVLQKTQMLASGTKGHGSISGSHMGDRRSTPLLSNDRIENRLKKPTTFLCKLKFRNE 1407
            SQN +LQKTQML++G K HGSI GS MG+R++TP LS +RIENRLKKPTTFLCKLKFRNE
Sbjct: 268  SQNTILQKTQMLSTG-KVHGSIVGSRMGERKATPFLSGERIENRLKKPTTFLCKLKFRNE 326

Query: 1408 LPDPSAKMKLLSLKRDPDRYCKYQITSLEKNWKPQLHVEXXXXXXXXXXXXSVYNPPKGX 1587
            LPD SA+ KL+SL+++ D Y +Y ITSLEK +KPQL+VE            SVYNP    
Sbjct: 327  LPDTSAQPKLMSLRKEKDHYTRYTITSLEKTYKPQLYVEPDLGIPLDLLDLSVYNP-SSV 385

Query: 1588 XXXXXXXXXXXXXXXXXXTPIKTDG-IKKKERPTDKGVSWLVKTQYISPLSIESTKQSLT 1764
                              TP+K DG IK+KERPTDKGV+WLVKTQYISPLSIES KQSLT
Sbjct: 386  RMPLAPEDEELLRDDVLKTPVKKDGGIKRKERPTDKGVAWLVKTQYISPLSIESAKQSLT 445

Query: 1765 EKQAKELRESRG-RILLENLNSRERKIKDIVASFEACKSKPVHAANRHLQPRRILPLLPD 1941
            EKQAKELRE +G R +LENLN+RER+IK+I ASFEACKS+P+HA N++L P  +LPLLPD
Sbjct: 446  EKQAKELREMKGGRNILENLNNRERQIKEIEASFEACKSRPIHATNKNLYPVEVLPLLPD 505

Query: 1942 FDRYEDNFLVATFDSAPTVDSEIYRKLSAADRDEFEQKAIMKSYASTSTDSEKQDGFLAY 2121
            FDRY+D F+V  FDSAPT DSE + KL  + RD  E +AIMKSY +TS+D  K + FLAY
Sbjct: 506  FDRYDDPFVVVAFDSAPTADSETFNKLDQSIRDAHESQAIMKSYMATSSDPSKPEKFLAY 565

Query: 2122 MVPSIDEIEKDIYDESEDISYSWVREYHYDVRGDDVEDPTTYLVSFGETEARYLPLPQKL 2301
            MVPS DE+ KDIYDE ED+SYSWVREYH+DVRGD+V+DPTTYLVSF + EARY+PLP KL
Sbjct: 566  MVPSPDELSKDIYDEQEDVSYSWVREYHWDVRGDNVDDPTTYLVSFDDAEARYVPLPTKL 625

Query: 2302 ILRKKRAREGKSTEEVEQFAVPRSVTVRRRTDVAINVLTEEEDYVPSK 2445
            +LRKKRA+EG+S++EVE F  P  VTVRRR  VA   + +   Y  SK
Sbjct: 626  VLRKKRAKEGRSSDEVEHFPAPARVTVRRRPTVATLEVKDPGIYSNSK 673


>ref|XP_007024308.1| Hydroxyproline-rich glycoprotein family protein isoform 2 [Theobroma
            cacao] gi|508779674|gb|EOY26930.1| Hydroxyproline-rich
            glycoprotein family protein isoform 2 [Theobroma cacao]
          Length = 562

 Score =  501 bits (1290), Expect = e-138
 Identities = 258/406 (63%), Positives = 306/406 (75%), Gaps = 1/406 (0%)
 Frame = +1

Query: 1231 QNKVLQKTQMLASGTKGHGSISGSHMGDRRSTPLLSNDRIENRLKKPTTFLCKLKFRNEL 1410
            Q K  QKTQM+ SG KGHGS+ GS MGDRR+TP LS +RIENRLKKPTTFLCKLKFRNEL
Sbjct: 118  QMKESQKTQMMPSG-KGHGSMVGSRMGDRRATPFLSGERIENRLKKPTTFLCKLKFRNEL 176

Query: 1411 PDPSAKMKLLSLKRDPDRYCKYQITSLEKNWKPQLHVEXXXXXXXXXXXXSVYNPPKGXX 1590
            PDPSA+ KL++LK+D DR+ KY ITSLEK +KP+L VE            SVYNPP    
Sbjct: 177  PDPSAQPKLMALKKDKDRFTKYTITSLEKMYKPKLFVEPDLGIPLDLLDLSVYNPPS-VR 235

Query: 1591 XXXXXXXXXXXXXXXXXTPIKTDGIKKKERPTDKGVSWLVKTQYISPLSIESTKQSLTEK 1770
                             TPIK DGI++KERPTDKGVSWLVKTQYISPLS+ESTKQSLTEK
Sbjct: 236  PSLAPEDAELLHDDEAVTPIKKDGIRRKERPTDKGVSWLVKTQYISPLSMESTKQSLTEK 295

Query: 1771 QAKELRESRG-RILLENLNSRERKIKDIVASFEACKSKPVHAANRHLQPRRILPLLPDFD 1947
            QAKELRE +G R +LENLN+RER+IK+I ASFEA K +PVHA N++L+P  ++PLLPDFD
Sbjct: 296  QAKELRELKGGRNILENLNNRERQIKEIEASFEASKLRPVHATNKNLEPVEVMPLLPDFD 355

Query: 1948 RYEDNFLVATFDSAPTVDSEIYRKLSAADRDEFEQKAIMKSYASTSTDSEKQDGFLAYMV 2127
            RY D F++  FD APT DSEI+ KL  + RDE E +AIMKSY + S+D    + FLAYMV
Sbjct: 356  RYNDQFVMVAFDGAPTADSEIFSKLDDSVRDEHESRAIMKSYLAASSDPANPEKFLAYMV 415

Query: 2128 PSIDEIEKDIYDESEDISYSWVREYHYDVRGDDVEDPTTYLVSFGETEARYLPLPQKLIL 2307
            PS+DE+ K +YDE ED+SYSWVREY++DVRGDD  DPTTYLVSF E EARY+PLP KL L
Sbjct: 416  PSLDELSKGMYDEHEDVSYSWVREYNWDVRGDDANDPTTYLVSFDEGEARYVPLPTKLNL 475

Query: 2308 RKKRAREGKSTEEVEQFAVPRSVTVRRRTDVAINVLTEEEDYVPSK 2445
            RKKRAREG++ +E+E F +P  +TVRRR+ VA   L E E Y  S+
Sbjct: 476  RKKRAREGRTGDEIEHFPIPARITVRRRSTVAAIELKEPEVYTSSR 521


>ref|XP_007024307.1| Hydroxyproline-rich glycoprotein family protein isoform 1 [Theobroma
            cacao] gi|508779673|gb|EOY26929.1| Hydroxyproline-rich
            glycoprotein family protein isoform 1 [Theobroma cacao]
          Length = 685

 Score =  501 bits (1290), Expect = e-138
 Identities = 258/406 (63%), Positives = 306/406 (75%), Gaps = 1/406 (0%)
 Frame = +1

Query: 1231 QNKVLQKTQMLASGTKGHGSISGSHMGDRRSTPLLSNDRIENRLKKPTTFLCKLKFRNEL 1410
            Q K  QKTQM+ SG KGHGS+ GS MGDRR+TP LS +RIENRLKKPTTFLCKLKFRNEL
Sbjct: 241  QMKESQKTQMMPSG-KGHGSMVGSRMGDRRATPFLSGERIENRLKKPTTFLCKLKFRNEL 299

Query: 1411 PDPSAKMKLLSLKRDPDRYCKYQITSLEKNWKPQLHVEXXXXXXXXXXXXSVYNPPKGXX 1590
            PDPSA+ KL++LK+D DR+ KY ITSLEK +KP+L VE            SVYNPP    
Sbjct: 300  PDPSAQPKLMALKKDKDRFTKYTITSLEKMYKPKLFVEPDLGIPLDLLDLSVYNPPS-VR 358

Query: 1591 XXXXXXXXXXXXXXXXXTPIKTDGIKKKERPTDKGVSWLVKTQYISPLSIESTKQSLTEK 1770
                             TPIK DGI++KERPTDKGVSWLVKTQYISPLS+ESTKQSLTEK
Sbjct: 359  PSLAPEDAELLHDDEAVTPIKKDGIRRKERPTDKGVSWLVKTQYISPLSMESTKQSLTEK 418

Query: 1771 QAKELRESRG-RILLENLNSRERKIKDIVASFEACKSKPVHAANRHLQPRRILPLLPDFD 1947
            QAKELRE +G R +LENLN+RER+IK+I ASFEA K +PVHA N++L+P  ++PLLPDFD
Sbjct: 419  QAKELRELKGGRNILENLNNRERQIKEIEASFEASKLRPVHATNKNLEPVEVMPLLPDFD 478

Query: 1948 RYEDNFLVATFDSAPTVDSEIYRKLSAADRDEFEQKAIMKSYASTSTDSEKQDGFLAYMV 2127
            RY D F++  FD APT DSEI+ KL  + RDE E +AIMKSY + S+D    + FLAYMV
Sbjct: 479  RYNDQFVMVAFDGAPTADSEIFSKLDDSVRDEHESRAIMKSYLAASSDPANPEKFLAYMV 538

Query: 2128 PSIDEIEKDIYDESEDISYSWVREYHYDVRGDDVEDPTTYLVSFGETEARYLPLPQKLIL 2307
            PS+DE+ K +YDE ED+SYSWVREY++DVRGDD  DPTTYLVSF E EARY+PLP KL L
Sbjct: 539  PSLDELSKGMYDEHEDVSYSWVREYNWDVRGDDANDPTTYLVSFDEGEARYVPLPTKLNL 598

Query: 2308 RKKRAREGKSTEEVEQFAVPRSVTVRRRTDVAINVLTEEEDYVPSK 2445
            RKKRAREG++ +E+E F +P  +TVRRR+ VA   L E E Y  S+
Sbjct: 599  RKKRAREGRTGDEIEHFPIPARITVRRRSTVAAIELKEPEVYTSSR 644


>ref|XP_004155995.1| PREDICTED: uncharacterized LOC101203806 [Cucumis sativus]
          Length = 706

 Score =  500 bits (1287), Expect = e-138
 Identities = 258/408 (63%), Positives = 308/408 (75%), Gaps = 2/408 (0%)
 Frame = +1

Query: 1228 SQNKVLQKTQMLASGTKGHGSISGSHMGDRRSTPLLSNDRIENRLKKPTTFLCKLKFRNE 1407
            SQN +LQKTQML++G K HGSI GS MG+R++TP LS +RIENRLKKPTTFLCKLKFRNE
Sbjct: 268  SQNTILQKTQMLSTG-KVHGSIVGSRMGERKATPFLSGERIENRLKKPTTFLCKLKFRNE 326

Query: 1408 LPDPSAKMKLLSLKRDPDRYCKYQITSLEKNWKPQLHVEXXXXXXXXXXXXSVYNPPKGX 1587
            LPD SA+ KL+SL+++ D Y +Y ITSLEK +KPQL+VE            SVYNP    
Sbjct: 327  LPDTSAQPKLMSLRKEKDHYTRYTITSLEKTYKPQLYVEPDLGIPLDLLDLSVYNP-SSV 385

Query: 1588 XXXXXXXXXXXXXXXXXXTPIKTDG-IKKKERPTDKGVSWLVKTQYISPLSIESTKQSLT 1764
                              TP+K DG IK+KERPTDKGV+WLVKTQYISPLSIES KQSLT
Sbjct: 386  RMPLAPEDEELLRDDVLKTPVKKDGGIKRKERPTDKGVAWLVKTQYISPLSIESAKQSLT 445

Query: 1765 EKQAKELRESRG-RILLENLNSRERKIKDIVASFEACKSKPVHAANRHLQPRRILPLLPD 1941
            EKQAKELRE +G R +LENLN+RER+IK+I  SFEACKS+P+HA N++L P  +LPLLPD
Sbjct: 446  EKQAKELREMKGGRNILENLNNRERQIKEIETSFEACKSRPIHATNKNLYPVEVLPLLPD 505

Query: 1942 FDRYEDNFLVATFDSAPTVDSEIYRKLSAADRDEFEQKAIMKSYASTSTDSEKQDGFLAY 2121
            FDRY+D F+V  FDSAPT DSE + KL  + RD  E +AIMKSY +T +D  K + FLAY
Sbjct: 506  FDRYDDPFVVVAFDSAPTADSETFNKLDQSIRDAHESQAIMKSYMATGSDPSKPEKFLAY 565

Query: 2122 MVPSIDEIEKDIYDESEDISYSWVREYHYDVRGDDVEDPTTYLVSFGETEARYLPLPQKL 2301
            MVPS DE+ KDIYDE ED+SYSWVREYH+DVRGD+V+DPTTYLVSF + EARY+PLP KL
Sbjct: 566  MVPSPDELSKDIYDEQEDVSYSWVREYHWDVRGDNVDDPTTYLVSFDDAEARYVPLPTKL 625

Query: 2302 ILRKKRAREGKSTEEVEQFAVPRSVTVRRRTDVAINVLTEEEDYVPSK 2445
            +LRKKRA+EG+S++EVE F  P  VTVRRR  VA   + +   Y  SK
Sbjct: 626  VLRKKRAKEGRSSDEVEHFPAPARVTVRRRPTVATLEVKDPGIYSNSK 673


>ref|XP_004510311.1| PREDICTED: RNA polymerase II-associated factor 1 homolog [Cicer
            arietinum]
          Length = 657

 Score =  480 bits (1236), Expect = e-132
 Identities = 246/409 (60%), Positives = 301/409 (73%), Gaps = 3/409 (0%)
 Frame = +1

Query: 1228 SQNKVLQKTQMLASGTKG--HGSISGSHMGDRRSTPLLSNDRIENRLKKPTTFLCKLKFR 1401
            SQN VLQKTQM++SG  G  HGSI+GS MG+RR+ PLLS++R+ENRLKKPTTFLCKL+FR
Sbjct: 214  SQNTVLQKTQMVSSGGTGKVHGSIAGSRMGERRNAPLLSSERVENRLKKPTTFLCKLRFR 273

Query: 1402 NELPDPSAKMKLLSLKRDPDRYCKYQITSLEKNWKPQLHVEXXXXXXXXXXXXSVYNPPK 1581
            NELPDP+A+ KL++ K+D D+Y KY ITSLEK +KP+L VE            SVYNPP 
Sbjct: 274  NELPDPTAQPKLMAFKKDKDQYAKYTITSLEKMYKPKLFVEPDLGIPLDLLDLSVYNPPS 333

Query: 1582 GXXXXXXXXXXXXXXXXXXXTPIKTDGIKKKERPTDKGVSWLVKTQYISPLSIESTKQSL 1761
                                TP+K DGIK+KERPTDKGV+WLVKTQYISPLS+ESTKQSL
Sbjct: 334  -VRPPLAPEDEDLLRDDEAVTPMKKDGIKRKERPTDKGVAWLVKTQYISPLSMESTKQSL 392

Query: 1762 TEKQAKELRESRG-RILLENLNSRERKIKDIVASFEACKSKPVHAANRHLQPRRILPLLP 1938
            TEKQAKELRE +G R LLENLN+R  K       FEA KS+ VHA  + L P   +P LP
Sbjct: 393  TEKQAKELRERKGGRNLLENLNNRYGKXXXXXXXFEAAKSQAVHATKKDLYPVEFMPFLP 452

Query: 1939 DFDRYEDNFLVATFDSAPTVDSEIYRKLSAADRDEFEQKAIMKSYASTSTDSEKQDGFLA 2118
            DFDRY+D F+VA FD+APT+DSE++ KL  + RD  E +A+MKSY +TS+D    + FLA
Sbjct: 453  DFDRYDDQFVVAAFDNAPTIDSEMFSKLGKSVRDISESRAVMKSYVATSSDPANPEKFLA 512

Query: 2119 YMVPSIDEIEKDIYDESEDISYSWVREYHYDVRGDDVEDPTTYLVSFGETEARYLPLPQK 2298
            YM P+  E+ KDIYDE+E+++YSWVREYH+DVRGDD  DPTT++VSF E+EARYLPLP K
Sbjct: 513  YMAPAPGELSKDIYDENEEVTYSWVREYHWDVRGDDAHDPTTFVVSFDESEARYLPLPTK 572

Query: 2299 LILRKKRAREGKSTEEVEQFAVPRSVTVRRRTDVAINVLTEEEDYVPSK 2445
            L+LRKKRA+EG+S +EVEQF +P  VTVRRR+ VA     + E Y   K
Sbjct: 573  LVLRKKRAKEGRSGDEVEQFPIPARVTVRRRSSVAAIERKDSEVYTSLK 621


Top