BLASTX nr result
ID: Akebia25_contig00003040
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Akebia25_contig00003040 (1943 letters) Database: ./nr 38,876,450 sequences; 13,856,398,315 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_007217663.1| hypothetical protein PRUPE_ppa002145mg [Prun... 597 e-168 gb|EXB74581.1| hypothetical protein L484_026278 [Morus notabilis] 593 e-167 ref|XP_004155995.1| PREDICTED: uncharacterized LOC101203806 [Cuc... 571 e-160 ref|XP_004141783.1| PREDICTED: uncharacterized protein LOC101203... 569 e-159 ref|XP_007024308.1| Hydroxyproline-rich glycoprotein family prot... 567 e-159 ref|XP_007024307.1| Hydroxyproline-rich glycoprotein family prot... 567 e-159 ref|XP_002278075.2| PREDICTED: RNA polymerase II-associated fact... 566 e-159 ref|XP_007135633.1| hypothetical protein PHAVU_010G145300g [Phas... 565 e-158 ref|XP_006583048.1| PREDICTED: RNA polymerase II-associated fact... 564 e-158 ref|XP_004302858.1| PREDICTED: uncharacterized protein LOC101304... 563 e-157 ref|XP_007225143.1| hypothetical protein PRUPE_ppa002485mg [Prun... 553 e-154 ref|XP_003531647.1| PREDICTED: bromodomain-containing protein 4-... 552 e-154 ref|XP_006465692.1| PREDICTED: RNA polymerase II-associated fact... 547 e-153 ref|XP_006426877.1| hypothetical protein CICLE_v10025066mg [Citr... 547 e-153 ref|XP_006465693.1| PREDICTED: RNA polymerase II-associated fact... 546 e-152 ref|XP_006343037.1| PREDICTED: RNA polymerase II-associated fact... 544 e-152 ref|XP_004235642.1| PREDICTED: uncharacterized protein LOC101254... 537 e-150 ref|XP_006426878.1| hypothetical protein CICLE_v10025066mg [Citr... 525 e-146 ref|XP_007024309.1| Hydroxyproline-rich glycoprotein family prot... 523 e-145 ref|XP_002303312.2| hydroxyproline-rich glycoprotein [Populus tr... 521 e-145 >ref|XP_007217663.1| hypothetical protein PRUPE_ppa002145mg [Prunus persica] gi|462413813|gb|EMJ18862.1| hypothetical protein PRUPE_ppa002145mg [Prunus persica] Length = 709 Score = 597 bits (1539), Expect = e-168 Identities = 318/533 (59%), Positives = 369/533 (69%), Gaps = 4/533 (0%) Frame = +2 Query: 338 NRRSHNREGPRDANGSGWRESGHSKHGLPPKQKGSAVP-MPSANAPRVPNAIAXXXXXXX 514 +R SH + PRD + SG RE GH HG+P KQ VP MP A P + Sbjct: 185 DRGSHEKGAPRDVSVSGRREHGHLNHGVPQKQHKPPVPSMPVKKANGPPGRVETEEERRL 244 Query: 515 XXXXXXXXXXXXXXXXXXXXHLLKESQNTVLHNTHIMSSGMKGHGLIVGSRVGEKKATPF 694 LK+SQN+VL T ++SSG KGHG I GSR+GE++ATPF Sbjct: 245 RKKREFEKQRQEEKHRQQ----LKDSQNSVLQKTQMLSSG-KGHGSIAGSRMGERRATPF 299 Query: 695 LSGERIENRLKKPTTFLCKLKFRNELPDPTAQPKLLSLNTYKEQYTKYTITSLEKMHKPK 874 LSGER ENRLKKPTTF+CKLKFRNELPDP+AQPKL+SL K+QYTKYTITSLEK +KPK Sbjct: 300 LSGERTENRLKKPTTFVCKLKFRNELPDPSAQPKLMSLKKDKDQYTKYTITSLEKTYKPK 359 Query: 875 LFVEPDIGVPLDLLDISVYNSNTTGPPHTXXXXXXXXXXXXXTPIKQDGIRRKERPTDKG 1054 LFVEPD+G+PLDLLD+SVYN + PP TP+K +GIRRKERPTDKG Sbjct: 360 LFVEPDLGIPLDLLDLSVYNPPSVRPPLALEDEELLRDDVAATPVKNNGIRRKERPTDKG 419 Query: 1055 VSWLVKTQYISPLSTDAAKMSLTEKQAKELRETRGGXXXXXXXXXXXXXXXXXXASFEAC 1234 V+WLVKTQYISPLS D+A+ SLTEKQAKELRE +GG ASFEAC Sbjct: 420 VAWLVKTQYISPLSMDSARQSLTEKQAKELREMKGGRNILDNLNDRERQIKDIEASFEAC 479 Query: 1235 KSRPVHATSDKLQPVEILPLLPDFDRYDDQFVLAAFDGDPTADSEIYSKLDRSVRDDHES 1414 KSRPVHAT+ L PVEILPLLPDF+RY+DQFVLAAFDG PTADSEIYSKLD+S D +ES Sbjct: 480 KSRPVHATNKNLYPVEILPLLPDFERYEDQFVLAAFDGAPTADSEIYSKLDQSGHDAYES 539 Query: 1415 HAIMKSFIGAGSEQAKPEKFLAYMVPSPDELWKDVYDEHEDILYTWVREYQWDVRGDDAD 1594 AIMKS+ G++ A PEKFLAYMVPSP+EL KD YDE ED+ Y+WVREY +DVRGDD Sbjct: 540 RAIMKSYKVTGADPANPEKFLAYMVPSPNELSKDPYDESEDVSYSWVREYHYDVRGDDVH 599 Query: 1595 DPTTYLVTFDEEEARYLPLPTKLILRKKRPKEGRSNEEVEQYXXXXXXXXXXXXXXXXXE 1774 DPTTYLV+FDEEEARY PLPTKL+LRKKR KEG++++EVE + E Sbjct: 600 DPTTYLVSFDEEEARYAPLPTKLVLRKKRSKEGKTSDEVEHFPAPSRVTVRQRSTVAAIE 659 Query: 1775 LKESGDYVS---SNSKRGRSAIEDGLETPVPRHKVARVQDMDHYSGAEDDMSD 1924 LK+SGDY SN K R +ED LE P K+AR QD+D YSGAEDD+SD Sbjct: 660 LKDSGDYSRGSVSNLKTRRFDVEDTLERP---RKIARHQDIDEYSGAEDDLSD 709 >gb|EXB74581.1| hypothetical protein L484_026278 [Morus notabilis] Length = 697 Score = 593 bits (1529), Expect = e-167 Identities = 314/544 (57%), Positives = 373/544 (68%), Gaps = 6/544 (1%) Frame = +2 Query: 308 NQAKESRFHDNRRSHNREGPRDANGSGWRESGHSKH-GLPPKQKGSAVPMPSANAPRVPN 484 +Q KE+ H + +R ++ GSG RE G+S H G KQ P+PS + Sbjct: 160 SQGKENVHHRGLQERDRGVSKEVAGSGRREHGYSNHHGTHHKQH--KYPVPSVPVKKSNG 217 Query: 485 AIAXXXXXXXXXXXXXXXXXXXXXXXXXXXHLLKESQNTVLHNTHIMSSGMKGHGLIVGS 664 + HL KESQ++ L T I+S+ KGHG I GS Sbjct: 218 PMGRVETEEERRLRKKREFEKQKQEEKHRQHL-KESQHSALQKTQILSAA-KGHGSIAGS 275 Query: 665 RVGEKKATPFLSGERIENRLKKPTTFLCKLKFRNELPDPTAQPKLLSLNTYKEQYTKYTI 844 R+GE++AT FLSGERIENRLKKPTTFLCKLKFRNELPDP+AQPKL+S+ K+QY+KYTI Sbjct: 276 RMGERRATSFLSGERIENRLKKPTTFLCKLKFRNELPDPSAQPKLMSMKREKDQYSKYTI 335 Query: 845 TSLEKMHKPKLFVEPDIGVPLDLLDISVYNSNTTGPPHTXXXXXXXXXXXXXTPIKQDGI 1024 TSLEK +KPKLFVEPD+G+PL+LLD+SVYN + PP TP+K+DGI Sbjct: 336 TSLEKTYKPKLFVEPDLGIPLNLLDLSVYNPPSVRPPLDPEDEELLRDDEAVTPVKKDGI 395 Query: 1025 RRKERPTDKGVSWLVKTQYISPLSTDAAKMSLTEKQAKELRETRGGXXXXXXXXXXXXXX 1204 +RKERPTDKGV+WLVKTQYISPLS ++ K SLTEKQAKELRE +GG Sbjct: 396 KRKERPTDKGVAWLVKTQYISPLSMESTKQSLTEKQAKELRELKGGRNILENLNDRDRQI 455 Query: 1205 XXXXASFEACKSRPVHATSDKLQPVEILPLLPDFDRYDDQFVLAAFDGDPTADSEIYSKL 1384 ASFEACKSRPVHAT+ L PVE+LPLLPDFDRYDDQFVLAAFD PTADSE+YSK+ Sbjct: 456 KEIQASFEACKSRPVHATNKSLYPVEVLPLLPDFDRYDDQFVLAAFDSAPTADSEVYSKM 515 Query: 1385 DRSVRDDHESHAIMKSFIGAGSEQAKPEKFLAYMVPSPDELWKDVYDEHEDILYTWVREY 1564 D+S+RD HES A++KS+ GS+ PEKFLAYMVPSPDEL KD+YDEHED+ Y+WVREY Sbjct: 516 DQSIRDAHESQAVLKSYKVTGSDPGNPEKFLAYMVPSPDELSKDIYDEHEDVSYSWVREY 575 Query: 1565 QWDVRGDDADDPTTYLVTFDEEEARYLPLPTKLILRKKRPKEGRSNEEVEQYXXXXXXXX 1744 WDVRGDDADDPTTYLV+FDE EARYLPLPTKL+LRKKR KEGRS +EVE + Sbjct: 576 HWDVRGDDADDPTTYLVSFDETEARYLPLPTKLVLRKKRAKEGRSGDEVEHFPVPARVTV 635 Query: 1745 XXXXXXXXXELKESGDYVS-----SNSKRGRSAIEDGLETPVPRHKVARVQDMDHYSGAE 1909 ELK++ Y + SN KRG S +EDGLE HKVAR +D+D YSGAE Sbjct: 636 RRRPTVSVVELKDAEVYSNPRGSLSNFKRGGSDVEDGLER---SHKVARQEDVDEYSGAE 692 Query: 1910 DDMS 1921 DD+S Sbjct: 693 DDLS 696 >ref|XP_004155995.1| PREDICTED: uncharacterized LOC101203806 [Cucumis sativus] Length = 706 Score = 571 bits (1471), Expect = e-160 Identities = 309/534 (57%), Positives = 368/534 (68%), Gaps = 5/534 (0%) Frame = +2 Query: 338 NRRSHNRE--GPRDAN-GSGWRE-SGHSKHGLPPKQKGSAVPMPSANAPRVPNAIAXXXX 505 N +H R+ P+D + G RE S H KH QK S PMP P+ N + Sbjct: 189 NMGAHERDKGAPKDPSYGRRDRENSNHDKH-----QKHSGPPMP----PKKANGPSGRME 239 Query: 506 XXXXXXXXXXXXXXXXXXXXXXXHLLKESQNTVLHNTHIMSSGMKGHGLIVGSRVGEKKA 685 H LKESQNT+L T ++S+G K HG IVGSR+GE+KA Sbjct: 240 TDDEKRLRKKREFEKQRQDERHRHHLKESQNTILQKTQMLSTG-KVHGSIVGSRMGERKA 298 Query: 686 TPFLSGERIENRLKKPTTFLCKLKFRNELPDPTAQPKLLSLNTYKEQYTKYTITSLEKMH 865 TPFLSGERIENRLKKPTTFLCKLKFRNELPD +AQPKL+SL K+ YT+YTITSLEK + Sbjct: 299 TPFLSGERIENRLKKPTTFLCKLKFRNELPDTSAQPKLMSLRKEKDHYTRYTITSLEKTY 358 Query: 866 KPKLFVEPDIGVPLDLLDISVYNSNTTGPPHTXXXXXXXXXXXXXTPIKQDG-IRRKERP 1042 KP+L+VEPD+G+PLDLLD+SVYN ++ P TP+K+DG I+RKERP Sbjct: 359 KPQLYVEPDLGIPLDLLDLSVYNPSSVRMPLAPEDEELLRDDVLKTPVKKDGGIKRKERP 418 Query: 1043 TDKGVSWLVKTQYISPLSTDAAKMSLTEKQAKELRETRGGXXXXXXXXXXXXXXXXXXAS 1222 TDKGV+WLVKTQYISPLS ++AK SLTEKQAKELRE +GG S Sbjct: 419 TDKGVAWLVKTQYISPLSIESAKQSLTEKQAKELREMKGGRNILENLNNRERQIKEIETS 478 Query: 1223 FEACKSRPVHATSDKLQPVEILPLLPDFDRYDDQFVLAAFDGDPTADSEIYSKLDRSVRD 1402 FEACKSRP+HAT+ L PVE+LPLLPDFDRYDD FV+ AFD PTADSE ++KLD+S+RD Sbjct: 479 FEACKSRPIHATNKNLYPVEVLPLLPDFDRYDDPFVVVAFDSAPTADSETFNKLDQSIRD 538 Query: 1403 DHESHAIMKSFIGAGSEQAKPEKFLAYMVPSPDELWKDVYDEHEDILYTWVREYQWDVRG 1582 HES AIMKS++ GS+ +KPEKFLAYMVPSPDEL KD+YDE ED+ Y+WVREY WDVRG Sbjct: 539 AHESQAIMKSYMATGSDPSKPEKFLAYMVPSPDELSKDIYDEQEDVSYSWVREYHWDVRG 598 Query: 1583 DDADDPTTYLVTFDEEEARYLPLPTKLILRKKRPKEGRSNEEVEQYXXXXXXXXXXXXXX 1762 D+ DDPTTYLV+FD+ EARY+PLPTKL+LRKKR KEGRS++EVE + Sbjct: 599 DNVDDPTTYLVSFDDAEARYVPLPTKLVLRKKRAKEGRSSDEVEHFPAPARVTVRRRPTV 658 Query: 1763 XXXELKESGDYVSSNSKRGRSAIEDGLETPVPRHKVARVQDMDHYSGAEDDMSD 1924 E+K+ G Y SNSKRG S IEDG+ HK R QDMD +SGAED+MSD Sbjct: 659 ATLEVKDPGIY--SNSKRG-SDIEDGIGR---SHKHDRNQDMDQFSGAEDEMSD 706 >ref|XP_004141783.1| PREDICTED: uncharacterized protein LOC101203806 [Cucumis sativus] Length = 706 Score = 569 bits (1467), Expect = e-159 Identities = 309/534 (57%), Positives = 368/534 (68%), Gaps = 5/534 (0%) Frame = +2 Query: 338 NRRSHNREG--PRDAN-GSGWRE-SGHSKHGLPPKQKGSAVPMPSANAPRVPNAIAXXXX 505 N +H R+ P+D + G RE S H KH QK S PMP P+ N + Sbjct: 189 NMGAHERDKGVPKDPSYGRRDRENSNHDKH-----QKHSGPPMP----PKKANGPSGRME 239 Query: 506 XXXXXXXXXXXXXXXXXXXXXXXHLLKESQNTVLHNTHIMSSGMKGHGLIVGSRVGEKKA 685 H LKESQNT+L T ++S+G K HG IVGSR+GE+KA Sbjct: 240 TDDEKRLRKKREFEKQRQDERHRHHLKESQNTILQKTQMLSTG-KVHGSIVGSRMGERKA 298 Query: 686 TPFLSGERIENRLKKPTTFLCKLKFRNELPDPTAQPKLLSLNTYKEQYTKYTITSLEKMH 865 TPFLSGERIENRLKKPTTFLCKLKFRNELPD +AQPKL+SL K+ YT+YTITSLEK + Sbjct: 299 TPFLSGERIENRLKKPTTFLCKLKFRNELPDTSAQPKLMSLRKEKDHYTRYTITSLEKTY 358 Query: 866 KPKLFVEPDIGVPLDLLDISVYNSNTTGPPHTXXXXXXXXXXXXXTPIKQDG-IRRKERP 1042 KP+L+VEPD+G+PLDLLD+SVYN ++ P TP+K+DG I+RKERP Sbjct: 359 KPQLYVEPDLGIPLDLLDLSVYNPSSVRMPLAPEDEELLRDDVLKTPVKKDGGIKRKERP 418 Query: 1043 TDKGVSWLVKTQYISPLSTDAAKMSLTEKQAKELRETRGGXXXXXXXXXXXXXXXXXXAS 1222 TDKGV+WLVKTQYISPLS ++AK SLTEKQAKELRE +GG AS Sbjct: 419 TDKGVAWLVKTQYISPLSIESAKQSLTEKQAKELREMKGGRNILENLNNRERQIKEIEAS 478 Query: 1223 FEACKSRPVHATSDKLQPVEILPLLPDFDRYDDQFVLAAFDGDPTADSEIYSKLDRSVRD 1402 FEACKSRP+HAT+ L PVE+LPLLPDFDRYDD FV+ AFD PTADSE ++KLD+S+RD Sbjct: 479 FEACKSRPIHATNKNLYPVEVLPLLPDFDRYDDPFVVVAFDSAPTADSETFNKLDQSIRD 538 Query: 1403 DHESHAIMKSFIGAGSEQAKPEKFLAYMVPSPDELWKDVYDEHEDILYTWVREYQWDVRG 1582 HES AIMKS++ S+ +KPEKFLAYMVPSPDEL KD+YDE ED+ Y+WVREY WDVRG Sbjct: 539 AHESQAIMKSYMATSSDPSKPEKFLAYMVPSPDELSKDIYDEQEDVSYSWVREYHWDVRG 598 Query: 1583 DDADDPTTYLVTFDEEEARYLPLPTKLILRKKRPKEGRSNEEVEQYXXXXXXXXXXXXXX 1762 D+ DDPTTYLV+FD+ EARY+PLPTKL+LRKKR KEGRS++EVE + Sbjct: 599 DNVDDPTTYLVSFDDAEARYVPLPTKLVLRKKRAKEGRSSDEVEHFPAPARVTVRRRPTV 658 Query: 1763 XXXELKESGDYVSSNSKRGRSAIEDGLETPVPRHKVARVQDMDHYSGAEDDMSD 1924 E+K+ G Y SNSKRG S IEDG+ HK R QDMD +SGAED+MSD Sbjct: 659 ATLEVKDPGIY--SNSKRG-SDIEDGIGR---SHKHDRHQDMDQFSGAEDEMSD 706 >ref|XP_007024308.1| Hydroxyproline-rich glycoprotein family protein isoform 2 [Theobroma cacao] gi|508779674|gb|EOY26930.1| Hydroxyproline-rich glycoprotein family protein isoform 2 [Theobroma cacao] Length = 562 Score = 567 bits (1461), Expect = e-159 Identities = 307/541 (56%), Positives = 362/541 (66%), Gaps = 5/541 (0%) Frame = +2 Query: 317 KESRFHDNRRSHNREGPRDANGSGWRESGHSKHGLPPKQKGSAVPMPSANAPRVPNAIAX 496 KES ++ G RD GSG RE GHS H + + +P + PN A Sbjct: 36 KESVGDKGLNERSQGGNRDFLGSGRREHGHSNHAAGVRDQKPMMP-----PVKKPNGPAG 90 Query: 497 XXXXXXXXXXXXXXXXXXXXXXXXXXHLLKESQNTVLHNTHIMSSGMKGHGLIVGSRVGE 676 +KESQ T +M SG KGHG +VGSR+G+ Sbjct: 91 RVETEEERRLRKKREFEKQRQEEKHRQQMKESQKT-----QMMPSG-KGHGSMVGSRMGD 144 Query: 677 KKATPFLSGERIENRLKKPTTFLCKLKFRNELPDPTAQPKLLSLNTYKEQYTKYTITSLE 856 ++ATPFLSGERIENRLKKPTTFLCKLKFRNELPDP+AQPKL++L K+++TKYTITSLE Sbjct: 145 RRATPFLSGERIENRLKKPTTFLCKLKFRNELPDPSAQPKLMALKKDKDRFTKYTITSLE 204 Query: 857 KMHKPKLFVEPDIGVPLDLLDISVYNSNTTGPPHTXXXXXXXXXXXXXTPIKQDGIRRKE 1036 KM+KPKLFVEPD+G+PLDLLD+SVYN + P TPIK+DGIRRKE Sbjct: 205 KMYKPKLFVEPDLGIPLDLLDLSVYNPPSVRPSLAPEDAELLHDDEAVTPIKKDGIRRKE 264 Query: 1037 RPTDKGVSWLVKTQYISPLSTDAAKMSLTEKQAKELRETRGGXXXXXXXXXXXXXXXXXX 1216 RPTDKGVSWLVKTQYISPLS ++ K SLTEKQAKELRE +GG Sbjct: 265 RPTDKGVSWLVKTQYISPLSMESTKQSLTEKQAKELRELKGGRNILENLNNRERQIKEIE 324 Query: 1217 ASFEACKSRPVHATSDKLQPVEILPLLPDFDRYDDQFVLAAFDGDPTADSEIYSKLDRSV 1396 ASFEA K RPVHAT+ L+PVE++PLLPDFDRY+DQFV+ AFDG PTADSEI+SKLD SV Sbjct: 325 ASFEASKLRPVHATNKNLEPVEVMPLLPDFDRYNDQFVMVAFDGAPTADSEIFSKLDDSV 384 Query: 1397 RDDHESHAIMKSFIGAGSEQAKPEKFLAYMVPSPDELWKDVYDEHEDILYTWVREYQWDV 1576 RD+HES AIMKS++ A S+ A PEKFLAYMVPS DEL K +YDEHED+ Y+WVREY WDV Sbjct: 385 RDEHESRAIMKSYLAASSDPANPEKFLAYMVPSLDELSKGMYDEHEDVSYSWVREYNWDV 444 Query: 1577 RGDDADDPTTYLVTFDEEEARYLPLPTKLILRKKRPKEGRSNEEVEQYXXXXXXXXXXXX 1756 RGDDA+DPTTYLV+FDE EARY+PLPTKL LRKKR +EGR+ +E+E + Sbjct: 445 RGDDANDPTTYLVSFDEGEARYVPLPTKLNLRKKRAREGRTGDEIEHFPIPARITVRRRS 504 Query: 1757 XXXXXELKESGDYVS-----SNSKRGRSAIEDGLETPVPRHKVARVQDMDHYSGAEDDMS 1921 ELKE Y S S+SK GR EDGL HK+AR D+D YSGAEDD+S Sbjct: 505 TVAAIELKEPEVYTSSRGGMSSSKIGRLDAEDGLGR---SHKLARHHDVDQYSGAEDDLS 561 Query: 1922 D 1924 + Sbjct: 562 E 562 >ref|XP_007024307.1| Hydroxyproline-rich glycoprotein family protein isoform 1 [Theobroma cacao] gi|508779673|gb|EOY26929.1| Hydroxyproline-rich glycoprotein family protein isoform 1 [Theobroma cacao] Length = 685 Score = 567 bits (1461), Expect = e-159 Identities = 307/541 (56%), Positives = 362/541 (66%), Gaps = 5/541 (0%) Frame = +2 Query: 317 KESRFHDNRRSHNREGPRDANGSGWRESGHSKHGLPPKQKGSAVPMPSANAPRVPNAIAX 496 KES ++ G RD GSG RE GHS H + + +P + PN A Sbjct: 159 KESVGDKGLNERSQGGNRDFLGSGRREHGHSNHAAGVRDQKPMMP-----PVKKPNGPAG 213 Query: 497 XXXXXXXXXXXXXXXXXXXXXXXXXXHLLKESQNTVLHNTHIMSSGMKGHGLIVGSRVGE 676 +KESQ T +M SG KGHG +VGSR+G+ Sbjct: 214 RVETEEERRLRKKREFEKQRQEEKHRQQMKESQKT-----QMMPSG-KGHGSMVGSRMGD 267 Query: 677 KKATPFLSGERIENRLKKPTTFLCKLKFRNELPDPTAQPKLLSLNTYKEQYTKYTITSLE 856 ++ATPFLSGERIENRLKKPTTFLCKLKFRNELPDP+AQPKL++L K+++TKYTITSLE Sbjct: 268 RRATPFLSGERIENRLKKPTTFLCKLKFRNELPDPSAQPKLMALKKDKDRFTKYTITSLE 327 Query: 857 KMHKPKLFVEPDIGVPLDLLDISVYNSNTTGPPHTXXXXXXXXXXXXXTPIKQDGIRRKE 1036 KM+KPKLFVEPD+G+PLDLLD+SVYN + P TPIK+DGIRRKE Sbjct: 328 KMYKPKLFVEPDLGIPLDLLDLSVYNPPSVRPSLAPEDAELLHDDEAVTPIKKDGIRRKE 387 Query: 1037 RPTDKGVSWLVKTQYISPLSTDAAKMSLTEKQAKELRETRGGXXXXXXXXXXXXXXXXXX 1216 RPTDKGVSWLVKTQYISPLS ++ K SLTEKQAKELRE +GG Sbjct: 388 RPTDKGVSWLVKTQYISPLSMESTKQSLTEKQAKELRELKGGRNILENLNNRERQIKEIE 447 Query: 1217 ASFEACKSRPVHATSDKLQPVEILPLLPDFDRYDDQFVLAAFDGDPTADSEIYSKLDRSV 1396 ASFEA K RPVHAT+ L+PVE++PLLPDFDRY+DQFV+ AFDG PTADSEI+SKLD SV Sbjct: 448 ASFEASKLRPVHATNKNLEPVEVMPLLPDFDRYNDQFVMVAFDGAPTADSEIFSKLDDSV 507 Query: 1397 RDDHESHAIMKSFIGAGSEQAKPEKFLAYMVPSPDELWKDVYDEHEDILYTWVREYQWDV 1576 RD+HES AIMKS++ A S+ A PEKFLAYMVPS DEL K +YDEHED+ Y+WVREY WDV Sbjct: 508 RDEHESRAIMKSYLAASSDPANPEKFLAYMVPSLDELSKGMYDEHEDVSYSWVREYNWDV 567 Query: 1577 RGDDADDPTTYLVTFDEEEARYLPLPTKLILRKKRPKEGRSNEEVEQYXXXXXXXXXXXX 1756 RGDDA+DPTTYLV+FDE EARY+PLPTKL LRKKR +EGR+ +E+E + Sbjct: 568 RGDDANDPTTYLVSFDEGEARYVPLPTKLNLRKKRAREGRTGDEIEHFPIPARITVRRRS 627 Query: 1757 XXXXXELKESGDYVS-----SNSKRGRSAIEDGLETPVPRHKVARVQDMDHYSGAEDDMS 1921 ELKE Y S S+SK GR EDGL HK+AR D+D YSGAEDD+S Sbjct: 628 TVAAIELKEPEVYTSSRGGMSSSKIGRLDAEDGLGR---SHKLARHHDVDQYSGAEDDLS 684 Query: 1922 D 1924 + Sbjct: 685 E 685 >ref|XP_002278075.2| PREDICTED: RNA polymerase II-associated factor 1 homolog [Vitis vinifera] Length = 589 Score = 567 bits (1460), Expect = e-159 Identities = 305/535 (57%), Positives = 368/535 (68%), Gaps = 9/535 (1%) Frame = +2 Query: 347 SHNRE--GPRDANGSGWRESGHSKHGLPPKQKGSAVPMPSANAPRVPNAIAXXXXXXXXX 520 SH R+ P+D G+G RE GHS G P K P+P A + N Sbjct: 64 SHGRDKGAPKDLRGAGRREPGHSNQG--PSGKQQKPPVPPAPVKK-SNGPPGRVETEEER 120 Query: 521 XXXXXXXXXXXXXXXXXXHLLKESQNTVLHNTHIMSSGMKGHGLIVG-SRVGEKKATPFL 697 H LKESQNTVL T ++SSG KGHG +VG SR+GE++ TPFL Sbjct: 121 RLRKKREFEKQRQEEKQKHQLKESQNTVLQKTQMLSSG-KGHGSVVGGSRMGERRTTPFL 179 Query: 698 SGERIENRLKKPTTFLCKLKFRNELPDPTAQPKLLSLNTYKEQYTKYTITSLEKMHKPKL 877 SG+RIENRL+KPTTFLCKLKFRNELPDPTAQPKL++L T K+++TKYTITSLEKMHKP+L Sbjct: 180 SGDRIENRLRKPTTFLCKLKFRNELPDPTAQPKLMALKTDKDRFTKYTITSLEKMHKPQL 239 Query: 878 FVEPDIGVPLDLLDISVYNSNTTGPPHTXXXXXXXXXXXXXTPIKQDGIRRKERPTDKGV 1057 FVEPD+G+PLDLLD+SVYN + P TP+K++GI++KERPTDKGV Sbjct: 240 FVEPDLGIPLDLLDLSVYNPPSVRRPLDPEDEELLRDDESVTPVKKEGIKKKERPTDKGV 299 Query: 1058 SWLVKTQYISPLSTDAAKMSLTEKQAKELRETRGGXXXXXXXXXXXXXXXXXXASFEACK 1237 SWLVKTQYISPLST++ K SLTEKQAKELRET+GG A+F A K Sbjct: 300 SWLVKTQYISPLSTESTKQSLTEKQAKELRETKGGRNILENFNSRERKIQNIEAAFAASK 359 Query: 1238 SRPVHATSDKLQPVEILPLLPDFDRYDDQFVLAAFDGDPTADSEIYSKLDRSVRDDHESH 1417 PVH+T+ L+PVEILPLLPDF RYDD FV+A+FD PTADSEIYSKLD++VRD HES Sbjct: 360 ITPVHSTNKSLKPVEILPLLPDFARYDDSFVVASFDSAPTADSEIYSKLDKTVRDSHESQ 419 Query: 1418 AIMKSFIGAGSEQAKPEKFLAYMVPSPDELWKDVYDEHEDILYTWVREYQWDVRGDDADD 1597 AI+KS++ GS+ +KPEKFLAYM PSPDEL KD+YDE+ED Y+WVREY WDVRGDDADD Sbjct: 420 AILKSYMATGSDPSKPEKFLAYMAPSPDELSKDIYDENEDTSYSWVREYHWDVRGDDADD 479 Query: 1598 PTTYLVTFDEEEARYLPLPTKLILRKKRPKEGRSNEEVEQYXXXXXXXXXXXXXXXXXEL 1777 PTTYLV+F++ +ARYLPLPTKL+LRKKR KEGRS++EVE + EL Sbjct: 480 PTTYLVSFNKTDARYLPLPTKLLLRKKRAKEGRSSDEVEHFPVPSKVTVRQRPNVAAIEL 539 Query: 1778 KESGDYVSSNSKRGRSA------IEDGLETPVPRHKVARVQDMDHYSGAEDDMSD 1924 K+ + V S+SKRG S+ +EDGL +K + Q MD SGAED+MSD Sbjct: 540 KD--EEVYSSSKRGVSSSKRGVDMEDGLGR---SYKGVQDQHMDQSSGAEDEMSD 589 >ref|XP_007135633.1| hypothetical protein PHAVU_010G145300g [Phaseolus vulgaris] gi|561008678|gb|ESW07627.1| hypothetical protein PHAVU_010G145300g [Phaseolus vulgaris] Length = 661 Score = 565 bits (1455), Expect = e-158 Identities = 300/531 (56%), Positives = 356/531 (67%), Gaps = 5/531 (0%) Frame = +2 Query: 347 SHNREGPR--DANGSGWRESGHSKHGLPPKQKGSAVPMPSANAPRVPNAIAXXXXXXXXX 520 +HN E R D + SG RE S HG+ KQ P+P+ P Sbjct: 139 THNNEERRFKDPSTSGRREYDPSNHGIGHKQHKHQPPVPAKKVNGPPGRAETEEEKRLRK 198 Query: 521 XXXXXXXXXXXXXXXXXXHLLKESQNTVLHNTHIMSSGMKGHGLIVGSRVGEKKATPFLS 700 LKESQNTVL TH++SSG KGHGL+ GSR+GE+++TP LS Sbjct: 199 KREFEKQRQEEKHRQQ----LKESQNTVLQKTHLLSSG-KGHGLVAGSRMGERRSTPLLS 253 Query: 701 GERIENRLKKPTTFLCKLKFRNELPDPTAQPKLLSLNTYKEQYTKYTITSLEKMHKPKLF 880 ER+ENRLKKPTTFLCKLKFRNELPDP+AQPKL++ K+QY KYTITSLEKM+KPKLF Sbjct: 254 AERVENRLKKPTTFLCKLKFRNELPDPSAQPKLMAFKKDKDQYAKYTITSLEKMYKPKLF 313 Query: 881 VEPDIGVPLDLLDISVYNSNTTGPPHTXXXXXXXXXXXXXTPIKQDGIRRKERPTDKGVS 1060 VEPD+G+PLDLLD+SVYN + PP TPIK+DGI+RKERPTDKGV+ Sbjct: 314 VEPDLGIPLDLLDLSVYNPPSVRPPLAPEDEELLRDDEAATPIKKDGIKRKERPTDKGVA 373 Query: 1061 WLVKTQYISPLSTDAAKMSLTEKQAKELRETRGGXXXXXXXXXXXXXXXXXXASFEACKS 1240 WLVKTQYISPLS ++ K SLTEKQAKELRE +GG ASFEA KS Sbjct: 374 WLVKTQYISPLSMESTKQSLTEKQAKELREMKGGRGVLDNLNSRERQIREIEASFEAAKS 433 Query: 1241 RPVHATSDKLQPVEILPLLPDFDRYDDQFVLAAFDGDPTADSEIYSKLDRSVRDDHESHA 1420 PVHAT+ L PVE++PLLPDFDRYDDQFV+AAFD PTADSE+Y+KLD+SVRD ES A Sbjct: 434 DPVHATNKDLYPVEVMPLLPDFDRYDDQFVVAAFDNAPTADSEMYAKLDKSVRDAFESKA 493 Query: 1421 IMKSFIGAGSEQAKPEKFLAYMVPSPDELWKDVYDEHEDILYTWVREYQWDVRGDDADDP 1600 +MKS++ S+ A PEKFLAYM P+P EL KD+YDE+ED+ Y+W+REY WDVRGDDADDP Sbjct: 494 VMKSYVATSSDPANPEKFLAYMAPAPGELSKDIYDENEDVSYSWIREYHWDVRGDDADDP 553 Query: 1601 TTYLVTFDEEEARYLPLPTKLILRKKRPKEGRSNEEVEQYXXXXXXXXXXXXXXXXXELK 1780 TT+ V FD+ EARYLPLPTKL+LRKKR KEGRS EE+EQ E K Sbjct: 554 TTFFVAFDDSEARYLPLPTKLVLRKKRAKEGRSGEEIEQCPVPSRVTVRRRSSVAAIERK 613 Query: 1781 ESGDYVSS---NSKRGRSAIEDGLETPVPRHKVARVQDMDHYSGAEDDMSD 1924 ++G Y SS +SKR R ++DGLE H+ A QD SGAED MS+ Sbjct: 614 DTGVYTSSRGNSSKRSRLEMDDGLE---HHHRGAPHQDNYQSSGAEDYMSE 661 >ref|XP_006583048.1| PREDICTED: RNA polymerase II-associated factor 1 homolog isoform X1 [Glycine max] gi|571464391|ref|XP_006583049.1| PREDICTED: RNA polymerase II-associated factor 1 homolog isoform X2 [Glycine max] Length = 659 Score = 564 bits (1454), Expect = e-158 Identities = 297/522 (56%), Positives = 355/522 (68%), Gaps = 3/522 (0%) Frame = +2 Query: 368 RDANGSGWRESGHSKHGLPPKQKGSAVPMPSANAPRVPNAIAXXXXXXXXXXXXXXXXXX 547 ++ + SG RE HS HG+ KQ P+P ++ N Sbjct: 145 KEPSTSGRREYEHSNHGIAHKQHKQQPPVP---VKKMNNGPPGRAETDEEKRLRKKREFE 201 Query: 548 XXXXXXXXXHLLKESQNTVLHNTHIMSSGMKGHGLIVGSRVGEKKATPFLSGERIENRLK 727 LKESQNTVL TH++SSG KGHG+I GSR+GE+++TP L ER+ENRLK Sbjct: 202 KQRQEEKHRQQLKESQNTVLQKTHMLSSG-KGHGMIAGSRMGERRSTPLLGAERVENRLK 260 Query: 728 KPTTFLCKLKFRNELPDPTAQPKLLSLNTYKEQYTKYTITSLEKMHKPKLFVEPDIGVPL 907 KPTTFLCKLKFRNELPDP+AQPKL++ K+QY KYTITSLEKM+KPKLFVEPD+G+PL Sbjct: 261 KPTTFLCKLKFRNELPDPSAQPKLMASKKDKDQYAKYTITSLEKMYKPKLFVEPDLGIPL 320 Query: 908 DLLDISVYNSNTTGPPHTXXXXXXXXXXXXXTPIKQDGIRRKERPTDKGVSWLVKTQYIS 1087 DLLD+SVYN + PP TPIK+DGI+RKERPTDKGV+WLVKTQYIS Sbjct: 321 DLLDLSVYNPPSVRPPLAPEDKELLRDDEAVTPIKKDGIKRKERPTDKGVAWLVKTQYIS 380 Query: 1088 PLSTDAAKMSLTEKQAKELRETRGGXXXXXXXXXXXXXXXXXXASFEACKSRPVHATSDK 1267 PLS ++ K SLTEKQAKELRE +GG ASFEA KS PVHAT+ Sbjct: 381 PLSMESTKQSLTEKQAKELREMKGGRGILDNLNSRERQIREIEASFEAAKSDPVHATNKD 440 Query: 1268 LQPVEILPLLPDFDRYDDQFVLAAFDGDPTADSEIYSKLDRSVRDDHESHAIMKSFIGAG 1447 L PVE++PLLPDFDRYDDQFV+AAFD PTADSE+++K+D+SVRD ES A+MKS++ Sbjct: 441 LYPVEVMPLLPDFDRYDDQFVVAAFDNAPTADSEMHAKMDKSVRDAFESKAVMKSYVATS 500 Query: 1448 SEQAKPEKFLAYMVPSPDELWKDVYDEHEDILYTWVREYQWDVRGDDADDPTTYLVTFDE 1627 S+ A PEKFLAYMVP+P EL KD+YDE+ED+ Y+W+REY WDVRGDDADDP T+LV FDE Sbjct: 501 SDPANPEKFLAYMVPAPGELSKDIYDENEDVSYSWIREYHWDVRGDDADDPATFLVAFDE 560 Query: 1628 EEARYLPLPTKLILRKKRPKEGRSNEEVEQYXXXXXXXXXXXXXXXXXELKESGDYVSS- 1804 EARYLPLPTKL+LRKKR KEGRS +EVEQ E K+SG Y SS Sbjct: 561 SEARYLPLPTKLVLRKKRAKEGRSGDEVEQCPVPARVTVRRRSSVAAIERKDSGVYTSSK 620 Query: 1805 --NSKRGRSAIEDGLETPVPRHKVARVQDMDHYSGAEDDMSD 1924 +SKRG ++DGLE +H+ A QD SGAED MSD Sbjct: 621 GNSSKRGGLEMDDGLE---DQHRGAPHQDNYQSSGAEDYMSD 659 >ref|XP_004302858.1| PREDICTED: uncharacterized protein LOC101304396 [Fragaria vesca subsp. vesca] Length = 693 Score = 563 bits (1450), Expect = e-157 Identities = 309/543 (56%), Positives = 359/543 (66%), Gaps = 5/543 (0%) Frame = +2 Query: 311 QAKESRFHDNRRSHNREGPRDANGSGWRESGHSKH-GLPPKQKGSAVPMPSANAPRVPNA 487 +++ES F ++ H++ +D S RE GHS H G+PPK K P + N Sbjct: 163 KSRESGF--DKGPHDKGASKDVGASAKREHGHSNHHGVPPKHK------PPVPLVKKSNG 214 Query: 488 IAXXXXXXXXXXXXXXXXXXXXXXXXXXXHLLKESQNTVLHNTHIMSSGMKGHGLIVGSR 667 KESQN+VL TH+MSSG KGHG I GSR Sbjct: 215 APGRVETEEERRLRKKREFEKQRQEEKHRQQAKESQNSVLQKTHLMSSG-KGHGSIAGSR 273 Query: 668 VGEKKATPFLSGERIENRLKKPTTFLCKLKFRNELPDPTAQPKLLSLNTYKEQYTKYTIT 847 +GE++ TPFLSGER ENRLKKPTTF+CKLKFRNELPDP+AQPKL+S+ +QYTKYTIT Sbjct: 274 MGERRTTPFLSGERAENRLKKPTTFVCKLKFRNELPDPSAQPKLMSMKKDPDQYTKYTIT 333 Query: 848 SLEKMHKPKLFVEPDIGVPLDLLDISVYNSNT-TGPPHTXXXXXXXXXXXXXTPIKQDGI 1024 SLEK +KPKLFVEPD+G+PLDLLD+SVYN PP TP+K+DGI Sbjct: 334 SLEKNYKPKLFVEPDLGIPLDLLDLSVYNPPPGPRPPLAPEDEELLRDDVAVTPVKKDGI 393 Query: 1025 RRKERPTDKGVSWLVKTQYISPLSTDAAKMSLTEKQAKELRETRGGXXXXXXXXXXXXXX 1204 RRKERPTDKGV+WLVKTQYISPLS D+AK SLTEKQAKELRE +GG Sbjct: 394 RRKERPTDKGVAWLVKTQYISPLSMDSAKQSLTEKQAKELREMKGGRNLLDNLNDRERQI 453 Query: 1205 XXXXASFEACKSRPVHATSDKLQPVEILPLLPDFDRYDDQFVLAAFDGDPTADSEIYSKL 1384 ASFEACKSRPVHAT+ L PVE+LPLLP +RY+DQFVLA FDG PTADSEIYSKL Sbjct: 454 KEIEASFEACKSRPVHATNKNLYPVEVLPLLPXHNRYEDQFVLAGFDGAPTADSEIYSKL 513 Query: 1385 DRSVRDDHESHAIMKSFIGAGSEQAKPEKFLAYMVPSPDELWKDVYDEHEDILYTWVREY 1564 D+S D ES AIMKS+ G++ A P+KFLAYMVPSP+EL KD YDE EDI Y+WVREY Sbjct: 514 DQSDHDLCESRAIMKSYKVTGADPANPDKFLAYMVPSPNELSKDPYDESEDISYSWVREY 573 Query: 1565 QWDVRGDDADDPTTYLVTFDEEEARYLPLPTKLILRKKRPKEGRSNEEVEQYXXXXXXXX 1744 Q+DVRGDD DD TTYLV+FDE+ ARY PLP KL+LRKKR KEGRS +EVE + Sbjct: 574 QYDVRGDDVDDLTTYLVSFDEDAARYAPLPAKLVLRKKRAKEGRSTDEVEHFPAPSRVTV 633 Query: 1745 XXXXXXXXXELKESGDY---VSSNSKRGRSAIEDGLETPVPRHKVARVQDMDHYSGAEDD 1915 ELK++GDY SN KR ED LE P K R QD+D YSGAEDD Sbjct: 634 RRRSTVSAIELKDAGDYSRGALSNLKRRGFDNEDALERP---QKRGRHQDVDEYSGAEDD 690 Query: 1916 MSD 1924 +SD Sbjct: 691 LSD 693 >ref|XP_007225143.1| hypothetical protein PRUPE_ppa002485mg [Prunus persica] gi|462422079|gb|EMJ26342.1| hypothetical protein PRUPE_ppa002485mg [Prunus persica] Length = 668 Score = 553 bits (1424), Expect = e-154 Identities = 302/535 (56%), Positives = 355/535 (66%), Gaps = 6/535 (1%) Frame = +2 Query: 338 NRRSHNREGPRDANGSGWRESGHSKHGLPPKQKGSAVP---MPSANAPRVPNAIAXXXXX 508 +R SH + R+ + SG E GH HG+P KQ VP + AN P P + Sbjct: 160 DRGSHEKVASREVSVSGRGEHGHLNHGVPQKQHKPPVPSMQVKKANGP--PGRVETEEER 217 Query: 509 XXXXXXXXXXXXXXXXXXXXXXHLLKESQNTVLHNTHIMSSGMKGHGLIVGSRVGEKKAT 688 LK+SQN+VL T ++SSG KGHG I GSR+GE++AT Sbjct: 218 RLRKKREFEKQRQEEKHRQQ----LKDSQNSVLQKTQMLSSG-KGHGSIAGSRMGERRAT 272 Query: 689 PFLSGERIENRLKKPTTFLCKLKFRNELPDPTAQPKLLSLNTYKEQYTKYTITSLEKMHK 868 PFLSGER ENRLKKPTTF+CKLKFRNELPDP+AQPKL+SL K+QYTKYTITSLEK +K Sbjct: 273 PFLSGERTENRLKKPTTFVCKLKFRNELPDPSAQPKLMSLKKDKDQYTKYTITSLEKTYK 332 Query: 869 PKLFVEPDIGVPLDLLDISVYNSNTTGPPHTXXXXXXXXXXXXXTPIKQDGIRRKERPTD 1048 PKLFVEPD+G+PLDLLD+SVYN + PP TP+K++GI+RKERPTD Sbjct: 333 PKLFVEPDLGIPLDLLDLSVYNPPSVRPPLALEDEELLRDDVAATPVKKNGIKRKERPTD 392 Query: 1049 KGVSWLVKTQYISPLSTDAAKMSLTEKQAKELRETRGGXXXXXXXXXXXXXXXXXXASFE 1228 KGV+WL SLTEKQAKELRE +GG ASFE Sbjct: 393 KGVAWL----------------SLTEKQAKELREMKGGRNILDNLNDRERQIKEIEASFE 436 Query: 1229 ACKSRPVHATSDKLQPVEILPLLPDFDRYDDQFVLAAFDGDPTADSEIYSKLDRSVRDDH 1408 ACKSRPVHAT+ L PVE+LPLLPDF+RY+DQFVLAAFDG PTADSEIYSKLD+S D + Sbjct: 437 ACKSRPVHATNKDLYPVEVLPLLPDFERYEDQFVLAAFDGAPTADSEIYSKLDQSGHDAY 496 Query: 1409 ESHAIMKSFIGAGSEQAKPEKFLAYMVPSPDELWKDVYDEHEDILYTWVREYQWDVRGDD 1588 ES AIMKS+ G++ A PEKFLAYMVPSP+EL KD YDE ED+ Y+WVREY +DVRGDD Sbjct: 497 ESRAIMKSYKVTGADPANPEKFLAYMVPSPNELSKDPYDESEDVSYSWVREYHYDVRGDD 556 Query: 1589 ADDPTTYLVTFDEEEARYLPLPTKLILRKKRPKEGRSNEEVEQYXXXXXXXXXXXXXXXX 1768 DPTTYLV+FDEEEARY PLPTKL+LRKKR KEG++++EVE + Sbjct: 557 VHDPTTYLVSFDEEEARYAPLPTKLVLRKKRSKEGKTSDEVEHFPAPSRVTVRQRSTVAA 616 Query: 1769 XELKESGDYVS---SNSKRGRSAIEDGLETPVPRHKVARVQDMDHYSGAEDDMSD 1924 ELK+SGDY SN K R IED LE P K+AR QD+D YSGAEDD+SD Sbjct: 617 IELKDSGDYSRGSVSNLKTRRFDIEDTLERP---RKIARHQDIDEYSGAEDDLSD 668 >ref|XP_003531647.1| PREDICTED: bromodomain-containing protein 4-like isoform X1 [Glycine max] gi|571472317|ref|XP_006585570.1| PREDICTED: bromodomain-containing protein 4-like isoform X2 [Glycine max] Length = 666 Score = 552 bits (1422), Expect = e-154 Identities = 296/523 (56%), Positives = 351/523 (67%), Gaps = 4/523 (0%) Frame = +2 Query: 368 RDANGSGWRESGHSKHGLPPKQ-KGSAVPMPSANAPRVPNAIAXXXXXXXXXXXXXXXXX 544 ++ + SG RE HS HG+ KQ K P+P ++ N Sbjct: 152 KEPSKSGRREYEHSNHGIAHKQHKQQQPPLP---VKKMNNGPPGRAETDEEKRLRKKREF 208 Query: 545 XXXXXXXXXXHLLKESQNTVLHNTHIMSSGMKGHGLIVGSRVGEKKATPFLSGERIENRL 724 LKESQNTVL TH++SSG KGHG+I GSR+GE+++TP L ER+ENRL Sbjct: 209 EKQRQEEKHRQQLKESQNTVLQKTHLLSSG-KGHGMIAGSRMGERRSTPLLGAERVENRL 267 Query: 725 KKPTTFLCKLKFRNELPDPTAQPKLLSLNTYKEQYTKYTITSLEKMHKPKLFVEPDIGVP 904 KKPTTFLCKLKFRNELPDP+AQPKL+S K+QY KYTITSLEKM+KPKLFVEPD+G+P Sbjct: 268 KKPTTFLCKLKFRNELPDPSAQPKLMSFKKDKDQYAKYTITSLEKMYKPKLFVEPDLGIP 327 Query: 905 LDLLDISVYNSNTTGPPHTXXXXXXXXXXXXXTPIKQDGIRRKERPTDKGVSWLVKTQYI 1084 LDLLD+SVYN PP TPIK+DGI+RKERPTDKGV+WLVKTQYI Sbjct: 328 LDLLDLSVYNPPRVRPPLAPEDEELLRDDEAATPIKKDGIKRKERPTDKGVAWLVKTQYI 387 Query: 1085 SPLSTDAAKMSLTEKQAKELRETRGGXXXXXXXXXXXXXXXXXXASFEACKSRPVHATSD 1264 SPLS ++ K SLTEKQAKELRE +G ASFEA KS PVHAT+ Sbjct: 388 SPLSMESTKQSLTEKQAKELREMKG-RGILDNLNSRERQIREIQASFEAAKSDPVHATNK 446 Query: 1265 KLQPVEILPLLPDFDRYDDQFVLAAFDGDPTADSEIYSKLDRSVRDDHESHAIMKSFIGA 1444 L PVE++PLLPDFDRYDDQFV+AAFD PTADSE+Y+K+++SVRD ES A+MKS++ Sbjct: 447 DLYPVEVMPLLPDFDRYDDQFVVAAFDNAPTADSEMYAKMNKSVRDAFESKAVMKSYVAT 506 Query: 1445 GSEQAKPEKFLAYMVPSPDELWKDVYDEHEDILYTWVREYQWDVRGDDADDPTTYLVTFD 1624 G + A PEKFLAYM P+P EL KD+YDE+ED+ Y+W+REY WDVRGDDADDPTT+LV FD Sbjct: 507 GLDPANPEKFLAYMAPAPGELSKDIYDENEDVSYSWIREYHWDVRGDDADDPTTFLVAFD 566 Query: 1625 EEEARYLPLPTKLILRKKRPKEGRSNEEVEQYXXXXXXXXXXXXXXXXXELKESGDYVSS 1804 E EARYLPLPTKL+LRKKR KEGRS +EVEQ E K+SG Y SS Sbjct: 567 ESEARYLPLPTKLVLRKKRAKEGRSGDEVEQCPVPARVTVRRRSSVAAIERKDSGVYTSS 626 Query: 1805 NS---KRGRSAIEDGLETPVPRHKVARVQDMDHYSGAEDDMSD 1924 KR ++DGLE +H+ A QD SGAED MSD Sbjct: 627 KGNSFKRVGLEMDDGLE---DQHRGAPHQDNYQSSGAEDYMSD 666 >ref|XP_006465692.1| PREDICTED: RNA polymerase II-associated factor 1 homolog isoform X1 [Citrus sinensis] Length = 576 Score = 547 bits (1410), Expect = e-153 Identities = 277/453 (61%), Positives = 334/453 (73%), Gaps = 5/453 (1%) Frame = +2 Query: 581 LKESQNTVLHNTHIMSSGMKGHGLIVGSRVGEKKATPFLSGERIENRLKKPTTFLCKLKF 760 +KESQN V+ + +++SG GHG + GSR+G+++A P LSGERIENRLKKPTTFLCKLKF Sbjct: 127 MKESQNVVMQKSQMVASGKGGHGSMAGSRMGDRRAAPLLSGERIENRLKKPTTFLCKLKF 186 Query: 761 RNELPDPTAQPKLLSLNTYKEQYTKYTITSLEKMHKPKLFVEPDIGVPLDLLDISVYNSN 940 RNELP+P+AQPKL++L K+++T+YT +SLEK +KP+L VEPD+G+PLDLLD+SVYN Sbjct: 187 RNELPEPSAQPKLMALKKDKDRFTRYTFSSLEKNYKPQLHVEPDLGIPLDLLDLSVYNPP 246 Query: 941 TTGPPHTXXXXXXXXXXXXXTPIKQDGIRRKERPTDKGVSWLVKTQYISPLSTDAAKMSL 1120 + PP TP+K+DGI+RKERPTDKGVSWLVKTQYISPLS ++A+ SL Sbjct: 247 SVRPPLDPEDEELLRDDEVVTPVKKDGIKRKERPTDKGVSWLVKTQYISPLSMESARQSL 306 Query: 1121 TEKQAKELRETRGGXXXXXXXXXXXXXXXXXXASFEACKSRPVHATSDKLQPVEILPLLP 1300 TEKQAKELRE +GG ASFEACK RP+HAT+ LQPVEILPLLP Sbjct: 307 TEKQAKELREMKGGRSILENLNDRERQIKEIEASFEACKLRPIHATNKNLQPVEILPLLP 366 Query: 1301 DFDRYDDQFVLAAFDGDPTADSEIYSKLDRSVRDDHESHAIMKSFIGAGSEQAKPEKFLA 1480 DF+RYDDQFV A FDG PTADSEIYSK+D+SVRD HES AIMKS++ GS+ A PEKFLA Sbjct: 367 DFERYDDQFVAATFDGAPTADSEIYSKMDKSVRDAHESRAIMKSYVATGSDSANPEKFLA 426 Query: 1481 YMVPSPDELWKDVYDEHEDILYTWVREYQWDVRGDDADDPTTYLVTFDEEEARYLPLPTK 1660 YMVPS +EL KD+YDE+ED+ ++WVREY WDVRGDDADDPTTYLV+FD++EARY+PLPTK Sbjct: 427 YMVPSVNELSKDMYDENEDVSFSWVREYHWDVRGDDADDPTTYLVSFDDDEARYVPLPTK 486 Query: 1661 LILRKKRPKEGRSNEEVEQYXXXXXXXXXXXXXXXXXELKESGDYV-----SSNSKRGRS 1825 L LRKKR EGRSN+EVE + ELKE G Y SS+SK GR Sbjct: 487 LNLRKKRAIEGRSNDEVEHFPIPSSIAVRRRANVTAIELKEQGAYSNSKGNSSSSKMGRV 546 Query: 1826 AIEDGLETPVPRHKVARVQDMDHYSGAEDDMSD 1924 ++ LE H +R QD SGAEDDM D Sbjct: 547 DSQEDLER---SHNGSRHQDPYQSSGAEDDMYD 576 >ref|XP_006426877.1| hypothetical protein CICLE_v10025066mg [Citrus clementina] gi|557528867|gb|ESR40117.1| hypothetical protein CICLE_v10025066mg [Citrus clementina] Length = 677 Score = 547 bits (1410), Expect = e-153 Identities = 277/453 (61%), Positives = 334/453 (73%), Gaps = 5/453 (1%) Frame = +2 Query: 581 LKESQNTVLHNTHIMSSGMKGHGLIVGSRVGEKKATPFLSGERIENRLKKPTTFLCKLKF 760 +KESQN V+ + +++SG GHG +VGSR+G+++A P LSGER ENRLKKPTTFLCKLKF Sbjct: 228 MKESQNVVMQKSQMVASGKGGHGSMVGSRMGDRRAAPLLSGERTENRLKKPTTFLCKLKF 287 Query: 761 RNELPDPTAQPKLLSLNTYKEQYTKYTITSLEKMHKPKLFVEPDIGVPLDLLDISVYNSN 940 RNELP+P+AQPKL++L K+++T+YT +SLEK +KP+L VEPD+G+PLDLLD+SVYN Sbjct: 288 RNELPEPSAQPKLMALKKDKDRFTRYTFSSLEKNYKPQLHVEPDLGIPLDLLDLSVYNPP 347 Query: 941 TTGPPHTXXXXXXXXXXXXXTPIKQDGIRRKERPTDKGVSWLVKTQYISPLSTDAAKMSL 1120 + PP TP+K+DGI+RKERPTDKGVSWLVKTQYISPLS ++A+ SL Sbjct: 348 SVRPPLDPEDEELLRDDEVVTPVKKDGIKRKERPTDKGVSWLVKTQYISPLSMESARQSL 407 Query: 1121 TEKQAKELRETRGGXXXXXXXXXXXXXXXXXXASFEACKSRPVHATSDKLQPVEILPLLP 1300 TEKQAKELRE +GG ASFEACK RP+HAT+ LQPVEILPLLP Sbjct: 408 TEKQAKELREMKGGRSILENLNDRERQIKEIEASFEACKLRPIHATNKNLQPVEILPLLP 467 Query: 1301 DFDRYDDQFVLAAFDGDPTADSEIYSKLDRSVRDDHESHAIMKSFIGAGSEQAKPEKFLA 1480 DF+RYDDQFV A FDG PTADSEIYSK+D+SVRD HES AIMKS++ GS+ A PEKFLA Sbjct: 468 DFERYDDQFVAATFDGAPTADSEIYSKMDKSVRDAHESRAIMKSYVATGSDSANPEKFLA 527 Query: 1481 YMVPSPDELWKDVYDEHEDILYTWVREYQWDVRGDDADDPTTYLVTFDEEEARYLPLPTK 1660 YMVPS +EL KD+YDE+ED+ ++WVREY WDVRGDDADDPTTYLV+FD++EARY+PLPTK Sbjct: 528 YMVPSVNELSKDMYDENEDVSFSWVREYHWDVRGDDADDPTTYLVSFDDDEARYVPLPTK 587 Query: 1661 LILRKKRPKEGRSNEEVEQYXXXXXXXXXXXXXXXXXELKESGDYV-----SSNSKRGRS 1825 L LRKKR EGRSN+EVE + ELKE G Y SS+SK GR Sbjct: 588 LNLRKKRAIEGRSNDEVEHFPIPSSIAVRRRANVTAIELKEQGAYSNSKGNSSSSKMGRV 647 Query: 1826 AIEDGLETPVPRHKVARVQDMDHYSGAEDDMSD 1924 ++ LE H +R QD SGAEDDM D Sbjct: 648 DSQEDLER---SHNGSRQQDPYQSSGAEDDMYD 677 >ref|XP_006465693.1| PREDICTED: RNA polymerase II-associated factor 1 homolog isoform X2 [Citrus sinensis] Length = 570 Score = 546 bits (1408), Expect = e-152 Identities = 276/448 (61%), Positives = 333/448 (74%) Frame = +2 Query: 581 LKESQNTVLHNTHIMSSGMKGHGLIVGSRVGEKKATPFLSGERIENRLKKPTTFLCKLKF 760 +KESQN V+ + +++SG GHG + GSR+G+++A P LSGERIENRLKKPTTFLCKLKF Sbjct: 127 MKESQNVVMQKSQMVASGKGGHGSMAGSRMGDRRAAPLLSGERIENRLKKPTTFLCKLKF 186 Query: 761 RNELPDPTAQPKLLSLNTYKEQYTKYTITSLEKMHKPKLFVEPDIGVPLDLLDISVYNSN 940 RNELP+P+AQPKL++L K+++T+YT +SLEK +KP+L VEPD+G+PLDLLD+SVYN Sbjct: 187 RNELPEPSAQPKLMALKKDKDRFTRYTFSSLEKNYKPQLHVEPDLGIPLDLLDLSVYNPP 246 Query: 941 TTGPPHTXXXXXXXXXXXXXTPIKQDGIRRKERPTDKGVSWLVKTQYISPLSTDAAKMSL 1120 + PP TP+K+DGI+RKERPTDKGVSWLVKTQYISPLS ++A+ SL Sbjct: 247 SVRPPLDPEDEELLRDDEVVTPVKKDGIKRKERPTDKGVSWLVKTQYISPLSMESARQSL 306 Query: 1121 TEKQAKELRETRGGXXXXXXXXXXXXXXXXXXASFEACKSRPVHATSDKLQPVEILPLLP 1300 TEKQAKELRE +GG ASFEACK RP+HAT+ LQPVEILPLLP Sbjct: 307 TEKQAKELREMKGGRSILENLNDRERQIKEIEASFEACKLRPIHATNKNLQPVEILPLLP 366 Query: 1301 DFDRYDDQFVLAAFDGDPTADSEIYSKLDRSVRDDHESHAIMKSFIGAGSEQAKPEKFLA 1480 DF+RYDDQFV A FDG PTADSEIYSK+D+SVRD HES AIMKS++ GS+ A PEKFLA Sbjct: 367 DFERYDDQFVAATFDGAPTADSEIYSKMDKSVRDAHESRAIMKSYVATGSDSANPEKFLA 426 Query: 1481 YMVPSPDELWKDVYDEHEDILYTWVREYQWDVRGDDADDPTTYLVTFDEEEARYLPLPTK 1660 YMVPS +EL KD+YDE+ED+ ++WVREY WDVRGDDADDPTTYLV+FD++EARY+PLPTK Sbjct: 427 YMVPSVNELSKDMYDENEDVSFSWVREYHWDVRGDDADDPTTYLVSFDDDEARYVPLPTK 486 Query: 1661 LILRKKRPKEGRSNEEVEQYXXXXXXXXXXXXXXXXXELKESGDYVSSNSKRGRSAIEDG 1840 L LRKKR EGRSN+EVE + ELKE G SS+SK GR ++ Sbjct: 487 LNLRKKRAIEGRSNDEVEHFPIPSSIAVRRRANVTAIELKEQGGN-SSSSKMGRVDSQED 545 Query: 1841 LETPVPRHKVARVQDMDHYSGAEDDMSD 1924 LE H +R QD SGAEDDM D Sbjct: 546 LER---SHNGSRHQDPYQSSGAEDDMYD 570 >ref|XP_006343037.1| PREDICTED: RNA polymerase II-associated factor 1 homolog [Solanum tuberosum] Length = 700 Score = 544 bits (1402), Expect = e-152 Identities = 291/537 (54%), Positives = 350/537 (65%), Gaps = 8/537 (1%) Frame = +2 Query: 338 NRRSHNREGPRDANGSGWRESGHSKHGLPPKQKGSAVP-MPSANAPRVPNAIAXXXXXXX 514 ++R+ +R SGWRESGH H KQ G +VP MP + NA + Sbjct: 171 DQRNESRPSAEKRRESGWRESGHGNHTARSKQPGHSVPPMPVKKS----NAPSGRVETEE 226 Query: 515 XXXXXXXXXXXXXXXXXXXXHLLKESQNTVLHNTHIMSSGMKGHGLIVGSRVGEKKATPF 694 LKESQN VL T +++SG KGHG I S + +++ P Sbjct: 227 ERRLRKKREIEKQRHEEKNRQHLKESQNKVLQKTQMLTSGTKGHGSISASHMADRRTAPL 286 Query: 695 LSGERIENRLKKPTTFLCKLKFRNELPDPTAQPKLLSLNTYKEQYTKYTITSLEKMHKPK 874 LSGER ENRLKKPTTFLCKLKFRNELPDPTAQPKLL+L +++TKY+ITSLEKMHKP+ Sbjct: 287 LSGERTENRLKKPTTFLCKLKFRNELPDPTAQPKLLTLRRDPDRFTKYSITSLEKMHKPQ 346 Query: 875 LFVEPDIGVPLDLLDISVYNS-NTTGPPHTXXXXXXXXXXXXXTPIKQDGIRRKERPTDK 1051 L+VEPD+G+PLDLLD+SVYN P TPIK+DGI++KERPTDK Sbjct: 347 LYVEPDLGIPLDLLDLSVYNPPKGVKIPLAPEDEELLRDDNPITPIKKDGIKKKERPTDK 406 Query: 1052 GVSWLVKTQYISPLSTDAAKMSLTEKQAKELRETRGGXXXXXXXXXXXXXXXXXXASFEA 1231 GVSWLVKTQYISPLST++AK SLTEKQAKELRET+GG ASFEA Sbjct: 407 GVSWLVKTQYISPLSTESAKQSLTEKQAKELRETKGGRNILENLNKRDRQIQEIEASFEA 466 Query: 1232 CKSRPVHATSDKLQPVEILPLLPDFDRYDDQFVLAAFDGDPTADSEIYSKLDRSVRDDHE 1411 CKSRP+HAT+ +LQPV++ PL PDFDRY D FVLA +D PTADSE Y+KLD++VRD E Sbjct: 467 CKSRPIHATNRRLQPVKVQPLYPDFDRYKDPFVLANYDSAPTADSETYNKLDKTVRDACE 526 Query: 1412 SHAIMKSFIGAGSEQAKPEKFLAYMVPSPDELWKDVYDEHEDILYTWVREYQWDVRGDDA 1591 S A+MKSF+ S+ KP+KFLAYMVP+P+EL KD+YDE+EDI Y+WVREY WDVRGDDA Sbjct: 527 SQAVMKSFVATSSDADKPDKFLAYMVPAPNELSKDMYDENEDISYSWVREYHWDVRGDDA 586 Query: 1592 DDPTTYLVTFDEEEARYLPLPTKLILRKKRPKEGRSNEEVEQYXXXXXXXXXXXXXXXXX 1771 DDP TY+V F E EARY+PLPTKL+LRKKR +EG+SNEEVE + Sbjct: 587 DDPNTYVVAFGETEARYMPLPTKLVLRKKRAREGKSNEEVEHFPVPSRVTVRKRPTAAAI 646 Query: 1772 ELKESGDYVS------SNSKRGRSAIEDGLETPVPRHKVARVQDMDHYSGAEDDMSD 1924 ELKE G Y + S+SKR R + ED + +H D D SG E MSD Sbjct: 647 ELKEEGGYTTALKGNVSSSKRSRISHEDDVG---EQHNNMHDDDQDQSSGGEYYMSD 700 >ref|XP_004235642.1| PREDICTED: uncharacterized protein LOC101254885 [Solanum lycopersicum] Length = 698 Score = 537 bits (1384), Expect = e-150 Identities = 288/536 (53%), Positives = 346/536 (64%), Gaps = 8/536 (1%) Frame = +2 Query: 341 RRSHNREGPRDANGSGWRESGHSKHGLPPKQKGSAVP-MPSANAPRVPNAIAXXXXXXXX 517 +R+ +R SGWRES H H KQ +VP +P + NA + Sbjct: 170 QRNESRHSVEKRRESGWRESRHGNHTARSKQPDHSVPPLPMKKS----NAHSGRVETEEE 225 Query: 518 XXXXXXXXXXXXXXXXXXXHLLKESQNTVLHNTHIMSSGMKGHGLIVGSRVGEKKATPFL 697 LKESQN VL T +++SG KGHG I S + +++ TP L Sbjct: 226 RRSRKKREIEKQRHEEKNRQHLKESQNKVLQKTQMLTSGTKGHGSISASHMADRRTTPLL 285 Query: 698 SGERIENRLKKPTTFLCKLKFRNELPDPTAQPKLLSLNTYKEQYTKYTITSLEKMHKPKL 877 SGER ENRLKKPTTFLCKLKFRNELPDPTAQPKLL+L +++TKY+ITSLEKMHKP+L Sbjct: 286 SGERTENRLKKPTTFLCKLKFRNELPDPTAQPKLLTLRRDPDRFTKYSITSLEKMHKPQL 345 Query: 878 FVEPDIGVPLDLLDISVYNS-NTTGPPHTXXXXXXXXXXXXXTPIKQDGIRRKERPTDKG 1054 VEPD+G+PLDLLD+SVYN P TPIK+DGI++KERPTDKG Sbjct: 346 HVEPDLGIPLDLLDLSVYNPPKGVKIPLAPEDEELLRDDNPITPIKKDGIKKKERPTDKG 405 Query: 1055 VSWLVKTQYISPLSTDAAKMSLTEKQAKELRETRGGXXXXXXXXXXXXXXXXXXASFEAC 1234 VSWLVKTQYISPLST++AK SLTEKQAKELRET+GG ASFEAC Sbjct: 406 VSWLVKTQYISPLSTESAKQSLTEKQAKELRETKGGRNILENLNKRDRQIQEIEASFEAC 465 Query: 1235 KSRPVHATSDKLQPVEILPLLPDFDRYDDQFVLAAFDGDPTADSEIYSKLDRSVRDDHES 1414 KSRP+HA++ +LQP+++ PL PDFDRY D FVLA +D PTADSE YSKLD++VRD ES Sbjct: 466 KSRPIHASNRRLQPIKVQPLYPDFDRYKDPFVLANYDSAPTADSETYSKLDKTVRDACES 525 Query: 1415 HAIMKSFIGAGSEQAKPEKFLAYMVPSPDELWKDVYDEHEDILYTWVREYQWDVRGDDAD 1594 A+MKSF+ S+ KP+KFLAYMVP+P+EL KD+YDE EDI Y+WVREY WDVRGDDAD Sbjct: 526 QAVMKSFVATSSDADKPDKFLAYMVPAPNELSKDIYDESEDISYSWVREYHWDVRGDDAD 585 Query: 1595 DPTTYLVTFDEEEARYLPLPTKLILRKKRPKEGRSNEEVEQYXXXXXXXXXXXXXXXXXE 1774 DP TY+V F E EARY+PLPTKL+LRKKR +EG+SNEEVE + E Sbjct: 586 DPNTYVVAFGEREARYMPLPTKLVLRKKRAREGKSNEEVEHFPVPSRVTVRKRPTAAAIE 645 Query: 1775 LKESGDYVS------SNSKRGRSAIEDGLETPVPRHKVARVQDMDHYSGAEDDMSD 1924 LKE G Y + S+SKR R + ED + +H D D SG E MSD Sbjct: 646 LKEEGGYTTALKGNVSSSKRSRISHEDDVG---EQHNNMHDDDQDQSSGGEYYMSD 698 >ref|XP_006426878.1| hypothetical protein CICLE_v10025066mg [Citrus clementina] gi|557528868|gb|ESR40118.1| hypothetical protein CICLE_v10025066mg [Citrus clementina] Length = 632 Score = 525 bits (1352), Expect = e-146 Identities = 256/403 (63%), Positives = 309/403 (76%) Frame = +2 Query: 581 LKESQNTVLHNTHIMSSGMKGHGLIVGSRVGEKKATPFLSGERIENRLKKPTTFLCKLKF 760 +KESQN V+ + +++SG GHG +VGSR+G+++A P LSGER ENRLKKPTTFLCKLKF Sbjct: 228 MKESQNVVMQKSQMVASGKGGHGSMVGSRMGDRRAAPLLSGERTENRLKKPTTFLCKLKF 287 Query: 761 RNELPDPTAQPKLLSLNTYKEQYTKYTITSLEKMHKPKLFVEPDIGVPLDLLDISVYNSN 940 RNELP+P+AQPKL++L K+++T+YT +SLEK +KP+L VEPD+G+PLDLLD+SVYN Sbjct: 288 RNELPEPSAQPKLMALKKDKDRFTRYTFSSLEKNYKPQLHVEPDLGIPLDLLDLSVYNPP 347 Query: 941 TTGPPHTXXXXXXXXXXXXXTPIKQDGIRRKERPTDKGVSWLVKTQYISPLSTDAAKMSL 1120 + PP TP+K+DGI+RKERPTDKGVSWLVKTQYISPLS ++A+ SL Sbjct: 348 SVRPPLDPEDEELLRDDEVVTPVKKDGIKRKERPTDKGVSWLVKTQYISPLSMESARQSL 407 Query: 1121 TEKQAKELRETRGGXXXXXXXXXXXXXXXXXXASFEACKSRPVHATSDKLQPVEILPLLP 1300 TEKQAKELRE +GG ASFEACK RP+HAT+ LQPVEILPLLP Sbjct: 408 TEKQAKELREMKGGRSILENLNDRERQIKEIEASFEACKLRPIHATNKNLQPVEILPLLP 467 Query: 1301 DFDRYDDQFVLAAFDGDPTADSEIYSKLDRSVRDDHESHAIMKSFIGAGSEQAKPEKFLA 1480 DF+RYDDQFV A FDG PTADSEIYSK+D+SVRD HES AIMKS++ GS+ A PEKFLA Sbjct: 468 DFERYDDQFVAATFDGAPTADSEIYSKMDKSVRDAHESRAIMKSYVATGSDSANPEKFLA 527 Query: 1481 YMVPSPDELWKDVYDEHEDILYTWVREYQWDVRGDDADDPTTYLVTFDEEEARYLPLPTK 1660 YMVPS +EL KD+YDE+ED+ ++WVREY WDVRGDDADDPTTYLV+FD++EARY+PLPTK Sbjct: 528 YMVPSVNELSKDMYDENEDVSFSWVREYHWDVRGDDADDPTTYLVSFDDDEARYVPLPTK 587 Query: 1661 LILRKKRPKEGRSNEEVEQYXXXXXXXXXXXXXXXXXELKESG 1789 L LRKKR EGRSN+EVE + ELKE G Sbjct: 588 LNLRKKRAIEGRSNDEVEHFPIPSSIAVRRRANVTAIELKEQG 630 >ref|XP_007024309.1| Hydroxyproline-rich glycoprotein family protein isoform 3 [Theobroma cacao] gi|508779675|gb|EOY26931.1| Hydroxyproline-rich glycoprotein family protein isoform 3 [Theobroma cacao] Length = 662 Score = 523 bits (1347), Expect = e-145 Identities = 293/541 (54%), Positives = 342/541 (63%), Gaps = 5/541 (0%) Frame = +2 Query: 317 KESRFHDNRRSHNREGPRDANGSGWRESGHSKHGLPPKQKGSAVPMPSANAPRVPNAIAX 496 KES ++ G RD GSG RE GHS H + + +P + PN A Sbjct: 159 KESVGDKGLNERSQGGNRDFLGSGRREHGHSNHAAGVRDQKPMMP-----PVKKPNGPAG 213 Query: 497 XXXXXXXXXXXXXXXXXXXXXXXXXXHLLKESQNTVLHNTHIMSSGMKGHGLIVGSRVGE 676 +KESQ T +M SG KGHG +VGSR+G+ Sbjct: 214 RVETEEERRLRKKREFEKQRQEEKHRQQMKESQKT-----QMMPSG-KGHGSMVGSRMGD 267 Query: 677 KKATPFLSGERIENRLKKPTTFLCKLKFRNELPDPTAQPKLLSLNTYKEQYTKYTITSLE 856 ++ATPFLSGERIENRLKKPTTFLCKLKF TKYTITSLE Sbjct: 268 RRATPFLSGERIENRLKKPTTFLCKLKF-----------------------TKYTITSLE 304 Query: 857 KMHKPKLFVEPDIGVPLDLLDISVYNSNTTGPPHTXXXXXXXXXXXXXTPIKQDGIRRKE 1036 KM+KPKLFVEPD+G+PLDLLD+SVYN + P TPIK+DGIRRKE Sbjct: 305 KMYKPKLFVEPDLGIPLDLLDLSVYNPPSVRPSLAPEDAELLHDDEAVTPIKKDGIRRKE 364 Query: 1037 RPTDKGVSWLVKTQYISPLSTDAAKMSLTEKQAKELRETRGGXXXXXXXXXXXXXXXXXX 1216 RPTDKGVSWLVKTQYISPLS ++ K SLTEKQAKELRE +GG Sbjct: 365 RPTDKGVSWLVKTQYISPLSMESTKQSLTEKQAKELRELKGGRNILENLNNRERQIKEIE 424 Query: 1217 ASFEACKSRPVHATSDKLQPVEILPLLPDFDRYDDQFVLAAFDGDPTADSEIYSKLDRSV 1396 ASFEA K RPVHAT+ L+PVE++PLLPDFDRY+DQFV+ AFDG PTADSEI+SKLD SV Sbjct: 425 ASFEASKLRPVHATNKNLEPVEVMPLLPDFDRYNDQFVMVAFDGAPTADSEIFSKLDDSV 484 Query: 1397 RDDHESHAIMKSFIGAGSEQAKPEKFLAYMVPSPDELWKDVYDEHEDILYTWVREYQWDV 1576 RD+HES AIMKS++ A S+ A PEKFLAYMVPS DEL K +YDEHED+ Y+WVREY WDV Sbjct: 485 RDEHESRAIMKSYLAASSDPANPEKFLAYMVPSLDELSKGMYDEHEDVSYSWVREYNWDV 544 Query: 1577 RGDDADDPTTYLVTFDEEEARYLPLPTKLILRKKRPKEGRSNEEVEQYXXXXXXXXXXXX 1756 RGDDA+DPTTYLV+FDE EARY+PLPTKL LRKKR +EGR+ +E+E + Sbjct: 545 RGDDANDPTTYLVSFDEGEARYVPLPTKLNLRKKRAREGRTGDEIEHFPIPARITVRRRS 604 Query: 1757 XXXXXELKESGDYVS-----SNSKRGRSAIEDGLETPVPRHKVARVQDMDHYSGAEDDMS 1921 ELKE Y S S+SK GR EDGL HK+AR D+D YSGAEDD+S Sbjct: 605 TVAAIELKEPEVYTSSRGGMSSSKIGRLDAEDGLGR---SHKLARHHDVDQYSGAEDDLS 661 Query: 1922 D 1924 + Sbjct: 662 E 662 >ref|XP_002303312.2| hydroxyproline-rich glycoprotein [Populus trichocarpa] gi|550342419|gb|EEE78291.2| hydroxyproline-rich glycoprotein [Populus trichocarpa] Length = 569 Score = 521 bits (1341), Expect = e-145 Identities = 269/451 (59%), Positives = 326/451 (72%), Gaps = 3/451 (0%) Frame = +2 Query: 581 LKESQNTVLHNTHIMSSGMKGHGLIVGSRVGEKKATPFLSGERIENRLKKPTTFLCKLKF 760 LKESQN+ L H++SS KGHG IVGSR+G++ ATP L GER ENRLKKPTTF+CKLKF Sbjct: 123 LKESQNSALLKNHVISS-QKGHGSIVGSRLGDRVATPLLGGERAENRLKKPTTFMCKLKF 181 Query: 761 RNELPDPTAQPKLLSLNTYKEQYTKYTITSLEKMHKPKLFVEPDIGVPLDLLDISVYNSN 940 RNELPDP+AQPKL+ L K+++TKYTITSLEKM+KP+L+VEPD+G+PLDLLD+SVYN Sbjct: 182 RNELPDPSAQPKLMPLKREKDRFTKYTITSLEKMYKPQLYVEPDLGIPLDLLDLSVYNPP 241 Query: 941 TTGPPHTXXXXXXXXXXXXXTPIKQDGIRRKERPTDKGVSWLVKTQYISPLSTDAAKMSL 1120 + P TP+K+DGI+RKERPTDKGVSWLVKTQYISPLS ++AK+SL Sbjct: 242 SVRPLLAPEDEELLHDDESVTPVKRDGIKRKERPTDKGVSWLVKTQYISPLSMESAKLSL 301 Query: 1121 TEKQAKELRETRGGXXXXXXXXXXXXXXXXXXASFEACKSRPVHATSDKLQPVEILPLLP 1300 TEKQAKELRE +GG ASF + K PVHAT+ L+PVEILPLLP Sbjct: 302 TEKQAKELREMKGGCKLLDNLNKRERQIKEIQASFASNKLPPVHATNKNLKPVEILPLLP 361 Query: 1301 DFDRYDDQFVLAAFDGDPTADSEIYSKLDRSVRDDHESHAIMKSFIGAGSEQAKPEKFLA 1480 DFDRY D+FV AFDG PTAD+E Y K D S RD +ES AIMK+ + +GS+ A PEKFLA Sbjct: 362 DFDRYGDKFVTVAFDGAPTADAENYRKFDPSDRDAYESWAIMKACVASGSDPANPEKFLA 421 Query: 1481 YMVPSPDELWKDVYDEHEDILYTWVREYQWDVRGDDADDPTTYLVTFDEEEARYLPLPTK 1660 Y VPSPDEL KD+YDE+EDILY+W+REY WDVRGDD DDP+T+LV+FDE EARYLPLPTK Sbjct: 422 YTVPSPDELSKDMYDENEDILYSWIREYHWDVRGDDVDDPSTFLVSFDEAEARYLPLPTK 481 Query: 1661 LILRKKRPKEGRSNEEVEQYXXXXXXXXXXXXXXXXXELKESGDYVSS---NSKRGRSAI 1831 + LRKKR +EGRS +E+E + E ++SG +S NS+ R Sbjct: 482 ISLRKKRAREGRSGDEIEHFPIPSRVTVRKRAVAATIEQRDSGAISNSRGNNSRMERFED 541 Query: 1832 EDGLETPVPRHKVARVQDMDHYSGAEDDMSD 1924 EDGL +VA +D+ H SGAED+MS+ Sbjct: 542 EDGLGR---LQRVALDEDLHHSSGAEDEMSE 569