BLASTX nr result
ID: Paeonia23_contig00005582
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Paeonia23_contig00005582 (4857 letters) Database: ./nr 38,876,450 sequences; 13,856,398,315 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value emb|CBI16022.3| unnamed protein product [Vitis vinifera] 856 0.0 ref|XP_007016237.1| Uncharacterized protein isoform 7 [Theobroma... 623 e-175 ref|XP_007016232.1| Uncharacterized protein isoform 2 [Theobroma... 623 e-175 ref|XP_006424987.1| hypothetical protein CICLE_v10027683mg [Citr... 622 e-175 ref|XP_006488440.1| PREDICTED: mediator of RNA polymerase II tra... 620 e-174 ref|XP_007016238.1| Uncharacterized protein isoform 8 [Theobroma... 614 e-172 emb|CAN64434.1| hypothetical protein VITISV_000937 [Vitis vinifera] 575 e-161 ref|XP_006379033.1| hypothetical protein POPTR_0009s04520g [Popu... 554 e-154 ref|XP_007204950.1| hypothetical protein PRUPE_ppa000292mg [Prun... 543 e-151 ref|XP_002520450.1| hypothetical protein RCOM_0731250 [Ricinus c... 541 e-150 ref|XP_002298329.1| hypothetical protein POPTR_0001s25430g [Popu... 532 e-148 ref|XP_004295721.1| PREDICTED: uncharacterized protein LOC101314... 499 e-138 ref|XP_007016231.1| Uncharacterized protein isoform 1 [Theobroma... 498 e-137 ref|XP_007016236.1| Uncharacterized protein isoform 6 [Theobroma... 496 e-137 ref|XP_007016235.1| Uncharacterized protein isoform 5 [Theobroma... 496 e-137 ref|XP_004153176.1| PREDICTED: uncharacterized protein LOC101214... 466 e-128 ref|XP_004145323.1| PREDICTED: uncharacterized protein LOC101205... 466 e-128 gb|EXB30469.1| hypothetical protein L484_006018 [Morus notabilis] 458 e-125 ref|XP_004169561.1| PREDICTED: uncharacterized protein LOC101227... 396 e-107 ref|XP_006345324.1| PREDICTED: trithorax group protein osa-like ... 393 e-106 >emb|CBI16022.3| unnamed protein product [Vitis vinifera] Length = 1669 Score = 856 bits (2211), Expect = 0.0 Identities = 555/1235 (44%), Positives = 668/1235 (54%), Gaps = 138/1235 (11%) Frame = -1 Query: 4590 HAVTGXXXXXXXXXXXQLSLGA-PQHPMHTHLQNVPHQQPQHSAQMQSHFSQQPVQMRPP 4414 HAVTG Q+ LG Q PMH H Q+ F QQ QMRP Sbjct: 507 HAVTGHHSFPQPRPQQQMPLGGMQQQPMHMH--------------PQAQFPQQSPQMRPS 552 Query: 4413 QSHAPIASQHQSTLPSPGQVPTI-PLQQIPLHPNAQQPGHLVNXXXXXXXXXXXXXXXXX 4237 Q+HA + Q + LP PGQ + P QQ+P+HP+ QQ GH V+ Sbjct: 553 QAHAQ-SQQQSALLPLPGQAQNVLPPQQLPVHPH-QQAGHPVHQRAAMQPIQQSLPHQFV 610 Query: 4236 XXXXXXXXXXPSGSVQNQLHQQGPYSQPQQLPMHSHLRPQGPTXXXXXXXXXXXXXXQNV 4057 G+ QNQLHQQG + QP M S LRPQ P Q V Sbjct: 611 QQPPL-------GTGQNQLHQQGSFMQPPTPTMQSQLRPQAPPQSWQQHSHAYPQPQQKV 663 Query: 4056 AL-----SQSQNHFGRSMIP--GAQSQPLPQSASGPTGAGQVKTTMHGANQHQLSTNQNY 3898 A+ Q + GR +P G Q QP PQS +G +GA Q++ G NQ Sbjct: 664 AMLHGMQPQLPQNVGRPGMPNQGVQPQPFPQSQAGLSGAVQLRPMHLGPNQ--------- 714 Query: 3897 PLRTNNQGQPVFEQQPGYMQQSVQ-QSGPIIKSTMSESQGDQLSEKNVSFRE-ESSSQRT 3724 P GQ +++QS Q G +K T E D LS+K V +E ES S++T Sbjct: 715 PSANQTLGQ--------HLEQSAHPQPGLNVKQTTFEKPDDDLSKKGVGGQEGESFSEKT 766 Query: 3723 AKSDLNNPVITSGLRADSVEEKNLESEVDTKSIDDERKHIGEDED-----NN---KVSDS 3568 A+ D N TSG+ +++VE ++SE D KS+D+++K GEDED NN ++ +S Sbjct: 767 AREDANGVAATSGIESNTVE---IKSETDMKSMDEKQKTTGEDEDTISRINNSAKEIPES 823 Query: 3567 LRTLGTDPNSHSMENGEPVIKQIVEEEVTDSISEPSSGGKFV-----ENKDQKDVPHNDL 3403 +R LG+DP + E+GEPVIKQ+V+EEV S E S GGK + + KD+ VP + Sbjct: 824 MRALGSDPMQQASEDGEPVIKQMVKEEVIKSTVERSPGGKSIGIVVEDQKDELSVPPKQV 883 Query: 3402 KQVENSSLEGKEIQG----------QVEIIGEQSGKLKKDAVNAEGVVLPANGSDRGFLS 3253 +QVE+S L+ KEIQ QVEI+ E GKL+KD+ +A GV+ ++RG + Sbjct: 884 EQVEHSLLQDKEIQNGLLMKNPPIQQVEILDEMGGKLQKDSGDASGVMQLFTATNRGTEA 943 Query: 3252 VHPSSAPVP--------------------------------------------EHRGHPP 3205 V P AP+P E+RG PP Sbjct: 944 VPP--APIPDSSAQNATPRGSVSVSERKMLNQPGNQERNLLQAPTMPQGPSNDEYRGFPP 1001 Query: 3204 PGQLHGRGFVQPSHPVPL-----HQRPP-----------ALPSG---LPP----QHGQAS 3094 P Q+ GRGFV HPVP+ HQ PP A PS +PP + Sbjct: 1002 PSQVQGRGFVPLPHPVPILDGGRHQPPPMQYGPTVQQRPAAPSSGQAMPPPGLVHNAPVP 1061 Query: 3093 GLPLTQLRPQGPGHFPQSGQPLNPPDHFQ-PPGGILGPGST-SFGRGPSHFGPPQRNFES 2920 G P TQL+PQ G P Q H + PPGGILGPGS SFGRG SHF PPQR+FE Sbjct: 1062 GQPSTQLQPQALGLLPHPAQQSRGSFHHEIPPGGILGPGSAASFGRGLSHFAPPQRSFEP 1121 Query: 2919 QSAGPLGHYHQGHVPPSHIAPPR-SQGEPVGGP---------FDAHGGLMARAPPHGPEV 2770 S GHY+QGH PSH P R SQGE +G P FD+HGG+M RAPPHGP+ Sbjct: 1122 PSVVSQGHYNQGHGLPSHAGPSRISQGELIGRPPLGPLPAGSFDSHGGMMVRAPPHGPDG 1181 Query: 2769 QMGPQRFNPMEAEIFPNPRPSFLDGRQPDLHLQAT-----------------RMNAPPXX 2641 Q P NP+E+EIF NPRP++ DGRQ D H+ + RMN Sbjct: 1182 QQRP--VNPVESEIFSNPRPNYFDGRQSDSHIPGSSERGPFGQPSGVQSNMMRMNGGLGI 1239 Query: 2640 XXXXXXXLRDERFKPPFDERPHSFPVEPGRR---RGEFEEDLKQFPRPSHLDPEAAPKFG 2470 L+DERFK S P EPGRR G+F EDLKQF R SHLD + PKFG Sbjct: 1240 ESSLPVGLQDERFK--------SLP-EPGRRSSDHGKFAEDLKQFSRSSHLDSDLVPKFG 1290 Query: 2469 SYFSSSRPIERGPQGFGMDAAPGALDKAPHGFNYDAGLKFDPSASSAPSRFLPPYHPGGT 2290 +YFSSSRP++RG QGF MDAA G LDKAP GFNYD+G K SA + SRF PP HPGG Sbjct: 1291 NYFSSSRPLDRGSQGFVMDAAQGLLDKAPLGFNYDSGFK--SSAGTGTSRFFPPPHPGGD 1348 Query: 2289 PTLNEAGERARPVRLNEDNVSRPDSTRKHPDFNGPVPGYGRHRMDGSAPRSPVREFXXXX 2110 GER+R V +EDNV R D R HP+F G VP YGRH MDG PRSP REF Sbjct: 1349 ------GERSRAVGFHEDNVGRSDMARTHPNFLGSVPEYGRHHMDGLNPRSPTREFSGIP 1402 Query: 2109 XXXXXXXXXXXXXXPD--DIXXXXXXXXXXXXRPFNLPSDQIGNSFQENRFPILPSHLRR 1936 D DI + FNLPSD E+RFP+LPSHLRR Sbjct: 1403 HRGFGGLSGVPGRQSDLDDIDGRESRRFGEGSKTFNLPSD-------ESRFPVLPSHLRR 1455 Query: 1935 GEPERNVNMPMGEHIS--PGPQHFRTGDLTGQDILPSHLRRGENLGPRNLPGHLRFGEPA 1762 GE E + M + I+ P P H R GDL GQDILPSHL+RGE+ G RN+PG LRFGEP Sbjct: 1456 GELEGPGELVMADPIASRPAPHHLRGGDLIGQDILPSHLQRGEHFGSRNIPGQLRFGEPV 1515 Query: 1761 GFGAFPSHARMGELAGPGNFPQHLSTGEPFGVGNKPSHPRFGEPGFRSSYSLQGYPNDGG 1582 F AF H RMGEL+GPGNFP LS GE FG NK HPR GEPGFRS+YSL GYPND G Sbjct: 1516 -FDAFLGHPRMGELSGPGNFPSRLSAGESFGGSNKSGHPRIGEPGFRSTYSLHGYPNDHG 1574 Query: 1581 FHL-GDMESLDNPRKRKSASMGWCRICKVDCETVEGLDLHSQTREHQKMAMDMVLSIKQQ 1405 F GDMES DN RKRK SM WCRIC +DCETV+GLD+HSQTREHQ+MAMD+VLSIKQQ Sbjct: 1575 FRPPGDMESFDNSRKRKPLSMAWCRICNIDCETVDGLDMHSQTREHQQMAMDIVLSIKQQ 1634 Query: 1404 NGKKQKLTSNDHSSVEDASKSRNAIAIFEGRRNKP 1300 N KKQKLTS DHS+ ED+SKS+ + G KP Sbjct: 1635 NAKKQKLTSKDHSTPEDSSKSKKGVLRGGGISIKP 1669 >ref|XP_007016237.1| Uncharacterized protein isoform 7 [Theobroma cacao] gi|508786600|gb|EOY33856.1| Uncharacterized protein isoform 7 [Theobroma cacao] Length = 975 Score = 623 bits (1607), Expect = e-175 Identities = 464/1150 (40%), Positives = 561/1150 (48%), Gaps = 54/1150 (4%) Frame = -1 Query: 4590 HAVTGXXXXXXXXXXXQLSLGAPQHPMHTHLQNVPHQQPQHSAQMQSHFSQQPVQMRPPQ 4411 HAVTG Q+ L PQHPMH H Q H Q QH AQMQ+ + QQP QMRPPQ Sbjct: 17 HAVTGHQSYPLSQPHQQMQLVTPQHPMHVHAQGGLHPQ-QHPAQMQNSYPQQPPQMRPPQ 75 Query: 4410 SHAPIASQHQ-STLPSPGQVPTIPLQQIPLHPNAQQPGHLVNXXXXXXXXXXXXXXXXXX 4234 H I++Q Q LPSPG + LQQ+ LH + QP V Sbjct: 76 PHVAISNQQQPGLLPSPGSM----LQQVHLH--SHQPALPVQQRPVMHPAASPMSQPYVQ 129 Query: 4233 XXXXXXXXXPSGSVQNQLHQQGPYSQPQQLPMHSHLRPQGPTXXXXXXXXXXXXXXQNVA 4054 G VQ Q+ QQGP+ Q QQ S RP GP QNVA Sbjct: 130 QQPLSTQPV--GLVQPQMLQQGPFVQ-QQSSFQSQSRPLGPPHSFPQPPHAYAQPQQNVA 186 Query: 4053 LSQ------SQNHFGRSMIP--GAQSQPLPQSASGPTGAGQVKTTMHGANQHQLSTNQNY 3898 S S N GR M P G QSQP P SA+G VK GANQ S+ QN Sbjct: 187 GSHAVHFHPSHNLVGRPMTPNHGVQSQPYPHSAAGTP----VKPVHLGANQP--SSYQNN 240 Query: 3897 PLRTNNQGQPVFEQQPGYMQQSVQQSGPIIKSTMSESQGDQLSEKNVSFRE-ESSSQRTA 3721 RTNNQ SG + MSE GD ++KNV+ +E +SSS TA Sbjct: 241 VFRTNNQ------------------SG-VTSQPMSEVPGDHGTDKNVAEQEADSSSPGTA 281 Query: 3720 KSDLNNPVITSGLRADSVEEKNLESEVDTKSIDDERK-HIGEDEDNNKVS-----DSLRT 3559 + + N + S L AD E+ + E D KS+D++ +G+D + +S +S RT Sbjct: 282 RKEANELDMASSLGADVAEKNTAKLEADLKSVDEKLTGDVGDDSNGVDISTKETPESRRT 341 Query: 3558 LGTDPNSHSMENGEPVIKQIVEEEVTDSISEPSSGGKFVENKDQKDVPHNDLKQVENSSL 3379 +GTD H +PV K +V E + + +G VE KD P SL Sbjct: 342 VGTDLEQHR----DPVSKNMVTCEAIEDQKDVHNGEHKVEEIKIKDGP----------SL 387 Query: 3378 EGKEIQGQVEIIGEQSGKLKKDAVNAEGVVLPANGSDRGFLSVHPSSAPVPEHRGHPPPG 3199 + +Q + ++ EQ+GK++KD + P + GF RG PP Sbjct: 388 KTPPLQ-EAKLGEEQNGKMQKDKILPHDQGTPKGPAGNGF-------------RGIPPSS 433 Query: 3198 QLHGRGFVQPSHPVPL------------------HQRPP------ALPSGLPPQHGQASG 3091 Q+ G++ PSH VP QRP A P GLP H Q G Sbjct: 434 QVQPGGYLPPSHSVPNVDQGRHQPLQMPYGSNNNQQRPAVSAILQAPPPGLP-SHAQTPG 492 Query: 3090 LPLTQLRPQGPGHFPQSGQPLNPPDHFQPPGGILGPGSTSFGRGPSHFGPPQRNFESQSA 2911 LP Q RPQGPG Q L PP++ PPG SFGR PS++GP Sbjct: 493 LPPNQFRPQGPG------QALVPPENL-PPG--------SFGRDPSNYGPQ--------- 528 Query: 2910 GPLGHYHQGHVPPSHIAPPR-SQGEPVGG---------PFDAHGGLMARAPPHGPEVQMG 2761 G Y+QG PPS PR SQGEP+ G FD+HG AP +GPE Sbjct: 529 ---GPYNQG--PPSLSGAPRISQGEPLVGLSYGTPPLTAFDSHG-----APLYGPESHSV 578 Query: 2760 PQRFNPMEAEIFPNPRPSFLDGRQPDLHLQATRMNAPPXXXXXXXXXLRDERFKPPFDER 2581 N ++ D RQ D + LR ER KP DE Sbjct: 579 QHSANMVDYHA---------DNRQLDPRASGLDSTST--------FSLRGERLKPVQDEC 621 Query: 2580 PHSFPVEPGRR--RGEFEEDLKQFPRPSHLDPEAAPKFGSYFSSSRPIERGPQGFGMDAA 2407 + FP++ G R RG+FEEDLK FPRPSHLD E PKFGSY SSSRP++RGP GFGMD Sbjct: 622 SNQFPLDRGHRGDRGQFEEDLKHFPRPSHLDNEPVPKFGSYISSSRPLDRGPHGFGMDMG 681 Query: 2406 PGALDKAPHGFNYDAGLKFDPSASSAPSRFLPPYHPGGTPTLNEAGERARPVRLNEDNVS 2227 P A +K PHGF+ FDP S PSRFLPPYHP ++ GE RPV L +D + Sbjct: 682 PRAQEKEPHGFS------FDPMIGSGPSRFLPPYHP------DDTGE--RPVGLPKDTLG 727 Query: 2226 RPDSTRKHPDFNGPVPGYGRHRMDGSAPRSPVREFXXXXXXXXXXXXXXXXXXPDDIXXX 2047 R PDF G VP YGRHRMDG RSP RE+ Sbjct: 728 R-------PDFLGTVPSYGRHRMDGFVSRSPGREYPGISPHGFGGH-------------- 766 Query: 2046 XXXXXXXXXRPFNLPSDQIGNSFQ--ENRFPILPSHLRRGEPERNVNMPMGEHISPGPQH 1873 P D+I + +RFP LP HL RG E + M +H Sbjct: 767 --------------PGDEIDGRERRFSDRFPGLPGHLHRGGFESSDRM---------EEH 803 Query: 1872 FRTGDLTGQDILPSHLRRGENLGPRNLPGHLRFGEPAGFGAFPSHARMGELAGPGNFPQH 1693 R+ D+ QD P++ RRGE++G N+PGHLR GEP GFG F SH R+GE GPGNF Sbjct: 804 LRSRDMINQDNRPAYFRRGEHVGHHNMPGHLRLGEPIGFGDFSSHERIGEFGGPGNF--- 860 Query: 1692 LSTGEPFGVGNKPSHPRFGEPGFRSSYSLQGYPNDGGFHLGDMESLDNPRKRKSASMGWC 1513 HPR GEPGFRSS+SLQ +PNDGG + G M+S +N RKRK SMGWC Sbjct: 861 -------------RHPRLGEPGFRSSFSLQEFPNDGGIYTGGMDSFENLRKRKPMSMGWC 907 Query: 1512 RICKVDCETVEGLDLHSQTREHQKMAMDMVLSIKQQNGKKQKLTSNDHSSVEDASKSRNA 1333 RICK+DCETVEGLDLHSQTREHQKMAMDMV++IK QN KKQKLTS+DHS D SKS+N Sbjct: 908 RICKIDCETVEGLDLHSQTREHQKMAMDMVVTIK-QNAKKQKLTSSDHSIRNDTSKSKN- 965 Query: 1332 IAIFEGRRNK 1303 FEGR NK Sbjct: 966 -VKFEGRVNK 974 >ref|XP_007016232.1| Uncharacterized protein isoform 2 [Theobroma cacao] gi|590588563|ref|XP_007016233.1| Uncharacterized protein isoform 2 [Theobroma cacao] gi|590588573|ref|XP_007016234.1| Uncharacterized protein isoform 2 [Theobroma cacao] gi|508786595|gb|EOY33851.1| Uncharacterized protein isoform 2 [Theobroma cacao] gi|508786596|gb|EOY33852.1| Uncharacterized protein isoform 2 [Theobroma cacao] gi|508786597|gb|EOY33853.1| Uncharacterized protein isoform 2 [Theobroma cacao] Length = 1408 Score = 623 bits (1607), Expect = e-175 Identities = 464/1150 (40%), Positives = 561/1150 (48%), Gaps = 54/1150 (4%) Frame = -1 Query: 4590 HAVTGXXXXXXXXXXXQLSLGAPQHPMHTHLQNVPHQQPQHSAQMQSHFSQQPVQMRPPQ 4411 HAVTG Q+ L PQHPMH H Q H Q QH AQMQ+ + QQP QMRPPQ Sbjct: 450 HAVTGHQSYPLSQPHQQMQLVTPQHPMHVHAQGGLHPQ-QHPAQMQNSYPQQPPQMRPPQ 508 Query: 4410 SHAPIASQHQ-STLPSPGQVPTIPLQQIPLHPNAQQPGHLVNXXXXXXXXXXXXXXXXXX 4234 H I++Q Q LPSPG + LQQ+ LH + QP V Sbjct: 509 PHVAISNQQQPGLLPSPGSM----LQQVHLH--SHQPALPVQQRPVMHPAASPMSQPYVQ 562 Query: 4233 XXXXXXXXXPSGSVQNQLHQQGPYSQPQQLPMHSHLRPQGPTXXXXXXXXXXXXXXQNVA 4054 G VQ Q+ QQGP+ Q QQ S RP GP QNVA Sbjct: 563 QQPLSTQPV--GLVQPQMLQQGPFVQ-QQSSFQSQSRPLGPPHSFPQPPHAYAQPQQNVA 619 Query: 4053 LSQ------SQNHFGRSMIP--GAQSQPLPQSASGPTGAGQVKTTMHGANQHQLSTNQNY 3898 S S N GR M P G QSQP P SA+G VK GANQ S+ QN Sbjct: 620 GSHAVHFHPSHNLVGRPMTPNHGVQSQPYPHSAAGTP----VKPVHLGANQP--SSYQNN 673 Query: 3897 PLRTNNQGQPVFEQQPGYMQQSVQQSGPIIKSTMSESQGDQLSEKNVSFRE-ESSSQRTA 3721 RTNNQ SG + MSE GD ++KNV+ +E +SSS TA Sbjct: 674 VFRTNNQ------------------SG-VTSQPMSEVPGDHGTDKNVAEQEADSSSPGTA 714 Query: 3720 KSDLNNPVITSGLRADSVEEKNLESEVDTKSIDDERK-HIGEDEDNNKVS-----DSLRT 3559 + + N + S L AD E+ + E D KS+D++ +G+D + +S +S RT Sbjct: 715 RKEANELDMASSLGADVAEKNTAKLEADLKSVDEKLTGDVGDDSNGVDISTKETPESRRT 774 Query: 3558 LGTDPNSHSMENGEPVIKQIVEEEVTDSISEPSSGGKFVENKDQKDVPHNDLKQVENSSL 3379 +GTD H +PV K +V E + + +G VE KD P SL Sbjct: 775 VGTDLEQHR----DPVSKNMVTCEAIEDQKDVHNGEHKVEEIKIKDGP----------SL 820 Query: 3378 EGKEIQGQVEIIGEQSGKLKKDAVNAEGVVLPANGSDRGFLSVHPSSAPVPEHRGHPPPG 3199 + +Q + ++ EQ+GK++KD + P + GF RG PP Sbjct: 821 KTPPLQ-EAKLGEEQNGKMQKDKILPHDQGTPKGPAGNGF-------------RGIPPSS 866 Query: 3198 QLHGRGFVQPSHPVPL------------------HQRPP------ALPSGLPPQHGQASG 3091 Q+ G++ PSH VP QRP A P GLP H Q G Sbjct: 867 QVQPGGYLPPSHSVPNVDQGRHQPLQMPYGSNNNQQRPAVSAILQAPPPGLP-SHAQTPG 925 Query: 3090 LPLTQLRPQGPGHFPQSGQPLNPPDHFQPPGGILGPGSTSFGRGPSHFGPPQRNFESQSA 2911 LP Q RPQGPG Q L PP++ PPG SFGR PS++GP Sbjct: 926 LPPNQFRPQGPG------QALVPPENL-PPG--------SFGRDPSNYGPQ--------- 961 Query: 2910 GPLGHYHQGHVPPSHIAPPR-SQGEPVGG---------PFDAHGGLMARAPPHGPEVQMG 2761 G Y+QG PPS PR SQGEP+ G FD+HG AP +GPE Sbjct: 962 ---GPYNQG--PPSLSGAPRISQGEPLVGLSYGTPPLTAFDSHG-----APLYGPESHSV 1011 Query: 2760 PQRFNPMEAEIFPNPRPSFLDGRQPDLHLQATRMNAPPXXXXXXXXXLRDERFKPPFDER 2581 N ++ D RQ D + LR ER KP DE Sbjct: 1012 QHSANMVDYHA---------DNRQLDPRASGLDSTST--------FSLRGERLKPVQDEC 1054 Query: 2580 PHSFPVEPGRR--RGEFEEDLKQFPRPSHLDPEAAPKFGSYFSSSRPIERGPQGFGMDAA 2407 + FP++ G R RG+FEEDLK FPRPSHLD E PKFGSY SSSRP++RGP GFGMD Sbjct: 1055 SNQFPLDRGHRGDRGQFEEDLKHFPRPSHLDNEPVPKFGSYISSSRPLDRGPHGFGMDMG 1114 Query: 2406 PGALDKAPHGFNYDAGLKFDPSASSAPSRFLPPYHPGGTPTLNEAGERARPVRLNEDNVS 2227 P A +K PHGF+ FDP S PSRFLPPYHP ++ GE RPV L +D + Sbjct: 1115 PRAQEKEPHGFS------FDPMIGSGPSRFLPPYHP------DDTGE--RPVGLPKDTLG 1160 Query: 2226 RPDSTRKHPDFNGPVPGYGRHRMDGSAPRSPVREFXXXXXXXXXXXXXXXXXXPDDIXXX 2047 R PDF G VP YGRHRMDG RSP RE+ Sbjct: 1161 R-------PDFLGTVPSYGRHRMDGFVSRSPGREYPGISPHGFGGH-------------- 1199 Query: 2046 XXXXXXXXXRPFNLPSDQIGNSFQ--ENRFPILPSHLRRGEPERNVNMPMGEHISPGPQH 1873 P D+I + +RFP LP HL RG E + M +H Sbjct: 1200 --------------PGDEIDGRERRFSDRFPGLPGHLHRGGFESSDRM---------EEH 1236 Query: 1872 FRTGDLTGQDILPSHLRRGENLGPRNLPGHLRFGEPAGFGAFPSHARMGELAGPGNFPQH 1693 R+ D+ QD P++ RRGE++G N+PGHLR GEP GFG F SH R+GE GPGNF Sbjct: 1237 LRSRDMINQDNRPAYFRRGEHVGHHNMPGHLRLGEPIGFGDFSSHERIGEFGGPGNF--- 1293 Query: 1692 LSTGEPFGVGNKPSHPRFGEPGFRSSYSLQGYPNDGGFHLGDMESLDNPRKRKSASMGWC 1513 HPR GEPGFRSS+SLQ +PNDGG + G M+S +N RKRK SMGWC Sbjct: 1294 -------------RHPRLGEPGFRSSFSLQEFPNDGGIYTGGMDSFENLRKRKPMSMGWC 1340 Query: 1512 RICKVDCETVEGLDLHSQTREHQKMAMDMVLSIKQQNGKKQKLTSNDHSSVEDASKSRNA 1333 RICK+DCETVEGLDLHSQTREHQKMAMDMV++IK QN KKQKLTS+DHS D SKS+N Sbjct: 1341 RICKIDCETVEGLDLHSQTREHQKMAMDMVVTIK-QNAKKQKLTSSDHSIRNDTSKSKN- 1398 Query: 1332 IAIFEGRRNK 1303 FEGR NK Sbjct: 1399 -VKFEGRVNK 1407 >ref|XP_006424987.1| hypothetical protein CICLE_v10027683mg [Citrus clementina] gi|557526921|gb|ESR38227.1| hypothetical protein CICLE_v10027683mg [Citrus clementina] Length = 1392 Score = 622 bits (1603), Expect = e-175 Identities = 465/1110 (41%), Positives = 560/1110 (50%), Gaps = 34/1110 (3%) Frame = -1 Query: 4530 GAPQHPMHTHLQNVPHQQPQHSAQMQSHFSQQPVQMRPPQSHAPIASQHQST-LPSPGQV 4354 G QHPM+ H PH Q +QMQ+ F QQ MRP QSHA I++Q ST LP GQV Sbjct: 442 GPLQHPMYVH----PHTGAQ--SQMQNQFPQQTPSMRPAQSHATISNQPLSTGLPPLGQV 495 Query: 4353 PTI-PLQQIPLHPNAQQPGHLVNXXXXXXXXXXXXXXXXXXXXXXXXXXXPSGSVQNQLH 4177 I P QQ+P+ P+A QPG V+ SG H Sbjct: 496 ANIPPAQQLPVRPHAPQPGVPVS-----QHPVMQPVQQPMPYQYVQQHLPFSGQ-----H 545 Query: 4176 QQGPYSQPQQLPMHSHLRPQGPTXXXXXXXXXXXXXXQNVAL-----SQSQNHFGRSMIP 4012 QQGP+ QPQ LRPQ P QNVA+ S + G+ + P Sbjct: 546 QQGPFVQPQ-------LRPQRPPQSLQLHPPAYSQPLQNVAVINGMQSHQPRNLGQPLTP 598 Query: 4011 --GAQSQPLPQSASGPTGAGQVKTTMHGANQHQLSTNQNYPLRTNNQGQPVFEQQPGYMQ 3838 G +Q QSA+ + V+ GANQ S+NQ+ T+NQ Q Sbjct: 599 NYGVHAQSYQQSAT----SLHVRPAQLGANQS--SSNQSNLFWTSNQVQ----------L 642 Query: 3837 QSVQQSGPIIKSTMSESQGDQLSEKNVSFRE-ESSSQRTAKSDLNNPVITSGLRADSVEE 3661 S QQ+G K MSE ++++ K RE ESSS++TAK+D + T G A +V Sbjct: 643 SSEQQAGATSKPEMSEK--NEVAVKIAHEREAESSSEKTAKTDNFD---TPGPEAAAVGM 697 Query: 3660 KNLESEVDTKSIDDERKHIGEDEDNNKVSDSLRTLGTDPNSHSMENGEPVIKQIVEEEVT 3481 K +SE D K+ DE K ED+ N V S + TD SH EN +P I ++V+EEV Sbjct: 698 KVPKSETDVKAAVDEIKTEVEDK-TNVVDTSSKEFVTDRESHIAENVQP-INKMVKEEVI 755 Query: 3480 DSISEPSSGGKFVENKDQKDVPHNDLKQVENSSLEGKEIQGQVEIIGEQSGKLKKDAVNA 3301 +++ G K N D K H+ K+V+ L Q GEQS K++K Sbjct: 756 ENV----EGQKDSANVDIKQEEHSVSKEVQEEPLLKTSTMQQGTQFGEQSEKVQK----- 806 Query: 3300 EGVVLPANGSDRGFLSVHPSSAPVPEHRGHPPPGQLHGRGFVQPSHPV----PLHQRPPA 3133 E V A G+ P + PP GQ GFVQ + + L QR PA Sbjct: 807 EQKVPQAQGAQ------GPGAV--------PPAGQAQAGGFVQSAPSLYGSSTLQQR-PA 851 Query: 3132 LPSGLPPQHGQASGLPLTQLRPQGPGHFPQSGQPLN-PPDHFQ---PPGGILGPG-STSF 2968 PS + Q P PG PQ+ P P F+ PPGGI G + SF Sbjct: 852 APS-------------IFQAPP--PGAVPQTQAPTQFRPPMFKAEVPPGGIPVSGPAASF 896 Query: 2967 GRGPSHFGPPQRNFESQSAGPLGHYHQGHVPPSHIAPPRSQGEPVGGPFDAHGGLMARAP 2788 GRGP H GP Q +FE P G Y+ GH+ PS + P + P+ G FD+H G M P Sbjct: 897 GRGPGHNGPHQHSFEPPLVAPQGPYNLGHLHPSPVGGPPQRSVPLSG-FDSHVGTMV-GP 954 Query: 2787 PHGPEVQMG-PQRFNPMEAEIFPNPRPSFLDGRQPDLHLQATRMNAP-----------PX 2644 +GP M Q NPMEAE+F RP ++DGR+ D H ++ +P Sbjct: 955 AYGPGGPMDLKQPSNPMEAEMFTGQRPGYMDGRESDSHFPGSQQRSPLGPPSGTRSNMMR 1014 Query: 2643 XXXXXXXXLRDERFKPPFDERPHSFPVEPGRR---RGEFEEDLKQFPRPSHLDPEAAPKF 2473 LRDERFK D R + FPV+P R RGEFEEDLKQF RPSHLD E PK Sbjct: 1015 MNGGPGSELRDERFKSFPDGRLNPFPVDPARSVIDRGEFEEDLKQFSRPSHLDAEPVPKL 1074 Query: 2472 GSYFSSSRPIERGPQGFGMDAAPGALDKAPHGFNYDAGLKFDPSASSAPSRFLPPYHPGG 2293 GS+F SRP +RGP G+GMD P ++ G +YD GLK DP +SAPSRFLP YH Sbjct: 1075 GSHFLPSRPFDRGPHGYGMDMGPRPFER---GLSYDPGLKLDPMGASAPSRFLPAYH--- 1128 Query: 2292 TPTLNEAGERARPVRLNEDNVSRPDSTRKHPDFNGPVPGYGRHRMDGSAPRSPVREFXXX 2113 +D R DS+ HPDF P YGR M G +PRS REF Sbjct: 1129 -----------------DDAAGRSDSSHAHPDFPRPGRAYGRRHMGGLSPRSSFREF--- 1168 Query: 2112 XXXXXXXXXXXXXXXPDDIXXXXXXXXXXXXRPFNLPSDQIGNSFQENRFPILPSHLRRG 1933 P + R F D IGNSF ++RFP+LPSHLRRG Sbjct: 1169 ---------CGFGGLPGSLGGSRSVREDIGGREFRRFGDPIGNSFHDSRFPVLPSHLRRG 1219 Query: 1932 EPERNVNMPMGEHISPGPQHFRTGDLTGQDILPSHLRRGENLGPRNLPGHLRFGEPAGFG 1753 E E PG RTGDL GQ+ LPSHLRRGE LGP N LR GE G G Sbjct: 1220 EFE-----------GPG----RTGDLIGQEFLPSHLRRGEPLGPHN----LRLGETVGLG 1260 Query: 1752 AFPSHARMGELAGPGNFPQHLSTGEPFGVGNKPSHPRFGEPGFRSSYSLQGYPNDGGFHL 1573 FP ARM EL GPGNFP PR GEPGFRSS+S QG+PNDGGF+ Sbjct: 1261 GFPGPARMEELGGPGNFPP----------------PRLGEPGFRSSFSHQGFPNDGGFYT 1304 Query: 1572 GDMESLDNPRKRKSASMGWCRICKVDCETVEGLDLHSQTREHQKMAMDMVLSIKQQNGKK 1393 GDMES+DN RKRK SMGWCRICKVDCETV+GLDLHSQTREHQKMAMDMVLSIK QN KK Sbjct: 1305 GDMESIDNSRKRKPPSMGWCRICKVDCETVDGLDLHSQTREHQKMAMDMVLSIK-QNAKK 1363 Query: 1392 QKLTSNDHSSVEDASKSRNAIAIFEGRRNK 1303 QKLTS D S +DA+KSRN F+GR K Sbjct: 1364 QKLTSGDRCSTDDANKSRN--VNFDGRGKK 1391 >ref|XP_006488440.1| PREDICTED: mediator of RNA polymerase II transcription subunit 15-like isoform X1 [Citrus sinensis] gi|568870502|ref|XP_006488441.1| PREDICTED: mediator of RNA polymerase II transcription subunit 15-like isoform X2 [Citrus sinensis] gi|568870504|ref|XP_006488442.1| PREDICTED: mediator of RNA polymerase II transcription subunit 15-like isoform X3 [Citrus sinensis] gi|568870506|ref|XP_006488443.1| PREDICTED: mediator of RNA polymerase II transcription subunit 15-like isoform X4 [Citrus sinensis] Length = 1392 Score = 620 bits (1599), Expect = e-174 Identities = 465/1110 (41%), Positives = 559/1110 (50%), Gaps = 34/1110 (3%) Frame = -1 Query: 4530 GAPQHPMHTHLQNVPHQQPQHSAQMQSHFSQQPVQMRPPQSHAPIASQHQST-LPSPGQV 4354 G QHPM+ H PH Q +QMQ+ F QQ MRP QSHA I++Q ST LP GQV Sbjct: 442 GPLQHPMYVH----PHTGAQ--SQMQNQFPQQTPSMRPAQSHATISNQPLSTGLPPLGQV 495 Query: 4353 PTI-PLQQIPLHPNAQQPGHLVNXXXXXXXXXXXXXXXXXXXXXXXXXXXPSGSVQNQLH 4177 I P QQ+P+ P+A QPG V+ SG H Sbjct: 496 ANIPPAQQLPVRPHAPQPGVPVS-----QHPVMQPVQQPMPYQYVQQHLPFSGQ-----H 545 Query: 4176 QQGPYSQPQQLPMHSHLRPQGPTXXXXXXXXXXXXXXQNVAL-----SQSQNHFGRSMIP 4012 QQGP+ QPQ LRPQ P QNVA+ S + G+ + P Sbjct: 546 QQGPFVQPQ-------LRPQRPPQSLQLHPPAYSQPLQNVAVINGMQSHQPRNLGQPLTP 598 Query: 4011 --GAQSQPLPQSASGPTGAGQVKTTMHGANQHQLSTNQNYPLRTNNQGQPVFEQQPGYMQ 3838 G +Q QSA+ + V+ GANQ S+NQ+ T+NQ Q Sbjct: 599 NYGVHAQSYQQSAT----SLHVRPAQLGANQS--SSNQSNLSWTSNQVQ----------L 642 Query: 3837 QSVQQSGPIIKSTMSESQGDQLSEKNVSFRE-ESSSQRTAKSDLNNPVITSGLRADSVEE 3661 S QQ+G K MSE ++++ K RE ESSS++TAK+D + T G A +V Sbjct: 643 SSEQQAGATSKPEMSEK--NEVAVKIAHEREAESSSEKTAKTDNFD---TPGPEAAAVGM 697 Query: 3660 KNLESEVDTKSIDDERKHIGEDEDNNKVSDSLRTLGTDPNSHSMENGEPVIKQIVEEEVT 3481 K +SE D K+ DE K ED+ N V S + TD SH EN +P I ++V+EEV Sbjct: 698 KVPKSETDVKAAVDEIKTEVEDK-TNVVDTSSKEFVTDRESHIAENVQP-INKMVKEEVI 755 Query: 3480 DSISEPSSGGKFVENKDQKDVPHNDLKQVENSSLEGKEIQGQVEIIGEQSGKLKKDAVNA 3301 +++ G K N D K H+ K+V+ L Q GEQS K++K Sbjct: 756 ENV----EGQKDSANVDIKQEEHSVSKEVQEEPLLKTSTMQQGTQFGEQSEKVQK----- 806 Query: 3300 EGVVLPANGSDRGFLSVHPSSAPVPEHRGHPPPGQLHGRGFVQPSHPV----PLHQRPPA 3133 E V A G+ P + PP GQ GFVQ + + L QR PA Sbjct: 807 EQKVPQAQGAQ------GPGAV--------PPAGQAQAGGFVQSAPSLYGSSTLQQR-PA 851 Query: 3132 LPSGLPPQHGQASGLPLTQLRPQGPGHFPQSGQPLN-PPDHFQ---PPGGILGPG-STSF 2968 PS + Q P PG PQ+ P P F+ PPGGI G + SF Sbjct: 852 APS-------------IFQAPP--PGAVPQTQAPTQFRPPMFKAEVPPGGIPVSGPAASF 896 Query: 2967 GRGPSHFGPPQRNFESQSAGPLGHYHQGHVPPSHIAPPRSQGEPVGGPFDAHGGLMARAP 2788 GRGP H GP Q +FE P G Y+ GH PS + P + P+ G FD+H G M P Sbjct: 897 GRGPGHNGPHQHSFEPPLVAPQGPYNLGHPHPSPVGGPPQRSVPLSG-FDSHVGTMV-GP 954 Query: 2787 PHGPEVQMG-PQRFNPMEAEIFPNPRPSFLDGRQPDLHLQATRMNAP-----------PX 2644 +GP M Q NPMEAE+F RP ++DGR+ D H ++ +P Sbjct: 955 AYGPGGPMDLKQPSNPMEAEMFTGQRPGYMDGRESDSHFPGSQQRSPLGPPSGTRSNMMR 1014 Query: 2643 XXXXXXXXLRDERFKPPFDERPHSFPVEPGRR---RGEFEEDLKQFPRPSHLDPEAAPKF 2473 LRDERFK D R + FPV+P R RGEFEEDLKQF RPSHLD E PK Sbjct: 1015 MNGGPGSELRDERFKSFPDGRLNPFPVDPARSVIDRGEFEEDLKQFSRPSHLDAEPVPKL 1074 Query: 2472 GSYFSSSRPIERGPQGFGMDAAPGALDKAPHGFNYDAGLKFDPSASSAPSRFLPPYHPGG 2293 GS+F SRP +RGP G+GMD P ++ G +YD GLK DP +SAPSRFLP YH Sbjct: 1075 GSHFLPSRPFDRGPHGYGMDMGPRPFER---GLSYDPGLKLDPMGASAPSRFLPAYH--- 1128 Query: 2292 TPTLNEAGERARPVRLNEDNVSRPDSTRKHPDFNGPVPGYGRHRMDGSAPRSPVREFXXX 2113 +D R DS+ HPDF P YGR M G +PRS REF Sbjct: 1129 -----------------DDAAGRSDSSHAHPDFPRPGRAYGRRHMGGLSPRSSFREF--- 1168 Query: 2112 XXXXXXXXXXXXXXXPDDIXXXXXXXXXXXXRPFNLPSDQIGNSFQENRFPILPSHLRRG 1933 P + R F D IGNSF ++RFP+LPSHLRRG Sbjct: 1169 ---------CGFGGLPGSLGGSRSVREDIGGREFRRFGDPIGNSFHDSRFPVLPSHLRRG 1219 Query: 1932 EPERNVNMPMGEHISPGPQHFRTGDLTGQDILPSHLRRGENLGPRNLPGHLRFGEPAGFG 1753 E E PG RTGDL GQ+ LPSHLRRGE LGP N LR GE G G Sbjct: 1220 EFE-----------GPG----RTGDLIGQEFLPSHLRRGEPLGPHN----LRLGETVGLG 1260 Query: 1752 AFPSHARMGELAGPGNFPQHLSTGEPFGVGNKPSHPRFGEPGFRSSYSLQGYPNDGGFHL 1573 FP ARM EL GPGNFP PR GEPGFRSS+S QG+PNDGGF+ Sbjct: 1261 GFPGPARMEELGGPGNFPP----------------PRLGEPGFRSSFSRQGFPNDGGFYT 1304 Query: 1572 GDMESLDNPRKRKSASMGWCRICKVDCETVEGLDLHSQTREHQKMAMDMVLSIKQQNGKK 1393 GDMES+DN RKRK SMGWCRICKVDCETV+GLDLHSQTREHQKMAMDMVLSIK QN KK Sbjct: 1305 GDMESIDNSRKRKPPSMGWCRICKVDCETVDGLDLHSQTREHQKMAMDMVLSIK-QNAKK 1363 Query: 1392 QKLTSNDHSSVEDASKSRNAIAIFEGRRNK 1303 QKLTS D S +DA+KSRN F+GR K Sbjct: 1364 QKLTSGDRCSTDDANKSRN--VNFDGRGKK 1391 >ref|XP_007016238.1| Uncharacterized protein isoform 8 [Theobroma cacao] gi|508786601|gb|EOY33857.1| Uncharacterized protein isoform 8 [Theobroma cacao] Length = 972 Score = 614 bits (1583), Expect = e-172 Identities = 462/1150 (40%), Positives = 558/1150 (48%), Gaps = 54/1150 (4%) Frame = -1 Query: 4590 HAVTGXXXXXXXXXXXQLSLGAPQHPMHTHLQNVPHQQPQHSAQMQSHFSQQPVQMRPPQ 4411 HAVTG Q+ L PQHPMH H Q H Q QH AQMQ+ + QQP QMRPPQ Sbjct: 17 HAVTGHQSYPLSQPHQQMQLVTPQHPMHVHAQGGLHPQ-QHPAQMQNSYPQQPPQMRPPQ 75 Query: 4410 SHAPIASQHQ-STLPSPGQVPTIPLQQIPLHPNAQQPGHLVNXXXXXXXXXXXXXXXXXX 4234 H I++Q Q LPSPG + LQQ+ LH + QP V Sbjct: 76 PHVAISNQQQPGLLPSPGSM----LQQVHLH--SHQPALPVQQRPVMHPAASPMSQPYVQ 129 Query: 4233 XXXXXXXXXPSGSVQNQLHQQGPYSQPQQLPMHSHLRPQGPTXXXXXXXXXXXXXXQNVA 4054 G VQ Q+ QQGP+ Q QQ S RP GP QNVA Sbjct: 130 QQPLSTQPV--GLVQPQMLQQGPFVQ-QQSSFQSQSRPLGPPHSFPQPPHAYAQPQQNVA 186 Query: 4053 LSQ------SQNHFGRSMIP--GAQSQPLPQSASGPTGAGQVKTTMHGANQHQLSTNQNY 3898 S S N GR M P G QSQP P SA+G VK GANQ S+ QN Sbjct: 187 GSHAVHFHPSHNLVGRPMTPNHGVQSQPYPHSAAGTP----VKPVHLGANQP--SSYQNN 240 Query: 3897 PLRTNNQGQPVFEQQPGYMQQSVQQSGPIIKSTMSESQGDQLSEKNVSFRE-ESSSQRTA 3721 RTNNQ SG + MSE GD ++KNV+ +E +SSS TA Sbjct: 241 VFRTNNQ------------------SG-VTSQPMSEVPGDHGTDKNVAEQEADSSSPGTA 281 Query: 3720 KSDLNNPVITSGLRADSVEEKNLESEVDTKSIDDERK-HIGEDEDNNKVS-----DSLRT 3559 + + N + S L AD E+ + E D KS+D++ +G+D + +S +S RT Sbjct: 282 RKEANELDMASSLGADVAEKNTAKLEADLKSVDEKLTGDVGDDSNGVDISTKETPESRRT 341 Query: 3558 LGTDPNSHSMENGEPVIKQIVEEEVTDSISEPSSGGKFVENKDQKDVPHNDLKQVENSSL 3379 +GTD H +PV K +V E + + +G VE KD P SL Sbjct: 342 VGTDLEQHR----DPVSKNMVTCEAIEDQKDVHNGEHKVEEIKIKDGP----------SL 387 Query: 3378 EGKEIQGQVEIIGEQSGKLKKDAVNAEGVVLPANGSDRGFLSVHPSSAPVPEHRGHPPPG 3199 + +Q + ++ EQ+GK++KD + P + GF RG PP Sbjct: 388 KTPPLQ-EAKLGEEQNGKMQKDKILPHDQGTPKGPAGNGF-------------RGIPPSS 433 Query: 3198 QLHGRGFVQPSHPVPL------------------HQRPP------ALPSGLPPQHGQASG 3091 Q+ G++ PSH VP QRP A P GLP H Q G Sbjct: 434 QVQPGGYLPPSHSVPNVDQGRHQPLQMPYGSNNNQQRPAVSAILQAPPPGLP-SHAQTPG 492 Query: 3090 LPLTQLRPQGPGHFPQSGQPLNPPDHFQPPGGILGPGSTSFGRGPSHFGPPQRNFESQSA 2911 LP Q RPQGPG Q L PP++ PPG SFGR PS++GP Sbjct: 493 LPPNQFRPQGPG------QALVPPENL-PPG--------SFGRDPSNYGPQ--------- 528 Query: 2910 GPLGHYHQGHVPPSHIAPPR-SQGEPVGG---------PFDAHGGLMARAPPHGPEVQMG 2761 G Y+QG PPS PR SQGEP+ G FD+HG AP +GPE Sbjct: 529 ---GPYNQG--PPSLSGAPRISQGEPLVGLSYGTPPLTAFDSHG-----APLYGPESHSV 578 Query: 2760 PQRFNPMEAEIFPNPRPSFLDGRQPDLHLQATRMNAPPXXXXXXXXXLRDERFKPPFDER 2581 N ++ D RQ D + LR ER KP DE Sbjct: 579 QHSANMVDYHA---------DNRQLDPRASGLDSTST--------FSLRGERLKPVQDEC 621 Query: 2580 PHSFPVEPGRR--RGEFEEDLKQFPRPSHLDPEAAPKFGSYFSSSRPIERGPQGFGMDAA 2407 + FP++ G R RG+FEEDLK FPRPSHLD E PKFGSY SSSRP++RGP GFGMD Sbjct: 622 SNQFPLDRGHRGDRGQFEEDLKHFPRPSHLDNEPVPKFGSYISSSRPLDRGPHGFGMDMG 681 Query: 2406 PGALDKAPHGFNYDAGLKFDPSASSAPSRFLPPYHPGGTPTLNEAGERARPVRLNEDNVS 2227 P A +K PHGF+ FDP S PSRFLPPYHP ++ GE RPV L +D + Sbjct: 682 PRAQEKEPHGFS------FDPMIGSGPSRFLPPYHP------DDTGE--RPVGLPKDTLG 727 Query: 2226 RPDSTRKHPDFNGPVPGYGRHRMDGSAPRSPVREFXXXXXXXXXXXXXXXXXXPDDIXXX 2047 R PDF G VP YGRHRMDG RSP RE+ Sbjct: 728 R-------PDFLGTVPSYGRHRMDGFVSRSPGREYPGISPHGFGGH-------------- 766 Query: 2046 XXXXXXXXXRPFNLPSDQIGNSFQ--ENRFPILPSHLRRGEPERNVNMPMGEHISPGPQH 1873 P D+I + +RFP LP HL RG E + M +H Sbjct: 767 --------------PGDEIDGRERRFSDRFPGLPGHLHRGGFESSDRM---------EEH 803 Query: 1872 FRTGDLTGQDILPSHLRRGENLGPRNLPGHLRFGEPAGFGAFPSHARMGELAGPGNFPQH 1693 R+ D+ QD P++ RRGE++G N+PGHLR GEP GFG F SH R+GE GPGNF Sbjct: 804 LRSRDMINQDNRPAYFRRGEHVGHHNMPGHLRLGEPIGFGDFSSHERIGEFGGPGNF--- 860 Query: 1692 LSTGEPFGVGNKPSHPRFGEPGFRSSYSLQGYPNDGGFHLGDMESLDNPRKRKSASMGWC 1513 HPR GEPGFRSS+SLQ +PNDGG + G M+S +N RKRK SMGWC Sbjct: 861 -------------RHPRLGEPGFRSSFSLQEFPNDGGIYTGGMDSFENLRKRKPMSMGWC 907 Query: 1512 RICKVDCETVEGLDLHSQTREHQKMAMDMVLSIKQQNGKKQKLTSNDHSSVEDASKSRNA 1333 RICK+DCETVEGLDLHSQTREHQKMAMDMV++IK QN KKQKL DHS D SKS+N Sbjct: 908 RICKIDCETVEGLDLHSQTREHQKMAMDMVVTIK-QNAKKQKL---DHSIRNDTSKSKN- 962 Query: 1332 IAIFEGRRNK 1303 FEGR NK Sbjct: 963 -VKFEGRVNK 971 >emb|CAN64434.1| hypothetical protein VITISV_000937 [Vitis vinifera] Length = 1131 Score = 575 bits (1482), Expect = e-161 Identities = 450/1209 (37%), Positives = 562/1209 (46%), Gaps = 112/1209 (9%) Frame = -1 Query: 4590 HAVTGXXXXXXXXXXXQLSLGA-PQHPMHTHLQNVPHQQPQHSAQMQSHFSQQPVQMRPP 4414 HAVTG Q+ LG Q PMH H Q+ F QQ QMRP Sbjct: 78 HAVTGHHSFPQPRPQQQMPLGGMQQQPMHMH--------------PQAQFPQQSPQMRPS 123 Query: 4413 QSHAPIASQHQSTLPSPGQVPTI-PLQQIPLHPNAQQPGHLVNXXXXXXXXXXXXXXXXX 4237 Q+HA + Q + LP PGQ + P QQ+P+HP+ QQ GH V+ Sbjct: 124 QAHAQ-SQQQSALLPLPGQAQNVLPPQQLPVHPH-QQAGHPVHQRAAMQPIQQSLPHQXV 181 Query: 4236 XXXXXXXXXXPSGSVQNQLHQQGPYSQPQQLPMHSHLRPQGPTXXXXXXXXXXXXXXQNV 4057 G+ QNQLHQQG + QP M S LRPQ P Q V Sbjct: 182 QQPPL-------GTGQNQLHQQGSFMQPPTPTMQSQLRPQAPPQSWQQHSHAYPQPQQKV 234 Query: 4056 AL-----SQSQNHFGRSMIP--GAQSQPLPQSASGPTGAGQVKTTMHGANQHQLSTNQNY 3898 A+ Q + GR +P G Q QP PQS +G +GA Q++ G NQ Sbjct: 235 AMLHGMQPQLPQNVGRPGMPNQGVQPQPFPQSQAGLSGAVQLRPMHLGPNQ--------- 285 Query: 3897 PLRTNNQGQPVFEQQPGYMQQSVQ-QSGPIIKSTMSESQGDQLSEKNVSFRE-ESSSQRT 3724 P GQ +++QS Q G +K T E D LS+K V +E ES S++T Sbjct: 286 PSANQTLGQ--------HLEQSAHPQPGLNVKQTTFEKPDDDLSKKGVGGQEGESFSEKT 337 Query: 3723 AKSDLNNPVITSGLRADSVEEKNLESEVDTKSIDDERKHIGEDED-----NN---KVSDS 3568 A+ D N TSG+ +++VE ++SE D KS+D+++K GEDED NN ++ +S Sbjct: 338 AREDANGVAATSGIESNTVE---IKSETDMKSMDEKQKTTGEDEDTISRINNSAKEIPES 394 Query: 3567 LRTLGTDPNSHSMENGEPVIKQIVEEEVTDSISEPSSGGKFV-----ENKDQKDVPHNDL 3403 +R LG+DP + E+GEPVIKQ+V+EEV S E S GGK + + KD+ VP + Sbjct: 395 MRALGSDPMQQASEDGEPVIKQMVKEEVIKSTVERSPGGKSIGIVVEDQKDELSVPPKQV 454 Query: 3402 KQVENSSLEGKEIQG----------QVEIIGEQSGKLKKDAVNAEGVVLPANGSDRGFLS 3253 +QVE+S L+ KEIQ QVEI+ E GKL+KD+ +A GV+ ++RG + Sbjct: 455 EQVEHSLLQDKEIQNGLLMKNPPIQQVEILDEMGGKLQKDSGDASGVMQLFTATNRGTEA 514 Query: 3252 VHPSSAPVP--------------------------------------------EHRGHPP 3205 V P AP+P E+RG PP Sbjct: 515 VPP--APIPDSSAQNATPRGSVSVSERKMLNQPGNQERNLLQAPTMPQGPSNDEYRGFPP 572 Query: 3204 PGQLHGRGFVQPSHPVPL-----HQRPP-----------ALPSG---LPP----QHGQAS 3094 P Q+ GRGFV HPVP+ HQ PP A PS +PP + Sbjct: 573 PSQVQGRGFVPLPHPVPILDGGRHQPPPMQYGPTVQQRPAAPSSGQAMPPPGLVHNAPVP 632 Query: 3093 GLPLTQLRPQGPGHFPQSGQPLNPPDHFQ-PPGGILGPGST-SFGRGPSHFGPPQRNFES 2920 G P TQL+PQ G P Q H + PPGGILGPGS SFGRG SHF PPQR+FE Sbjct: 633 GQPSTQLQPQALGLLPHPAQQSRGSFHHEIPPGGILGPGSAASFGRGLSHFAPPQRSFEP 692 Query: 2919 QSAGPLGHYHQGHVPPSHIAPPR-SQGEPVGGPFDAHGGLMARAPPHGPEVQMGPQRFNP 2743 S GHY+QGH PSH P R SQGE +G PP GP Sbjct: 693 PSVVSQGHYNQGHGLPSHAGPSRISQGELIG------------RPPLGPL---------- 730 Query: 2742 MEAEIFPNPRPSFLDGRQPDLHLQATRMNAPPXXXXXXXXXLRDERFKPPFDERPHSFPV 2563 P SF D H + APP P +RP V Sbjct: 731 --------PAGSF------DSH-GGMMVRAPPHG--------------PDGQQRP----V 757 Query: 2562 EPGRRRGEFEEDLKQFPRPSHLDPEAAPKFGSYFSSSRPIERGPQGFGMDAAPGALDKAP 2383 P E ++ PRP++ D + S+ S ERGP G P Sbjct: 758 NP------VESEIFSNPRPNYFDGRQSD---SHIPGSS--ERGPFG-----QPSGXQSNM 801 Query: 2382 HGFNYDAGLKFDPSASSAPSRFLPPYHPGGTPT-----LNEAGERARPVRLNEDNVSRPD 2218 N G++ RF PG + + + +R L+ D V + Sbjct: 802 MRMNGGLGIESSLPVGLQDERFKSLPEPGRRSSDHGKFAEDLKQFSRSSHLDSDLVPKFG 861 Query: 2217 S--TRKHPDFNGPVPGYGRHRMDGSAPRSPVREFXXXXXXXXXXXXXXXXXXPDDIXXXX 2044 + + P G G+ G ++P+ DDI Sbjct: 862 NYFSSSRPLDRGS-QGFVMDAAQGLLDKAPLGFNYDSGFKSSAGTGTSRQSDLDDIDGRE 920 Query: 2043 XXXXXXXXRPFNLPSDQIGNSFQENRFPILPSHLRRGEPERNVNMPMGEHISPGPQHFRT 1864 + FNLPSD E+RFP+LPSHLRR Sbjct: 921 SRRFGEGYQTFNLPSD-------ESRFPVLPSHLRR------------------------ 949 Query: 1863 GDLTGQDILPSHLRRGENLGPRNLPGHLRFGEPAGFGAFPSHARMGELAGPGNFPQHLST 1684 DILPSHL+RGE+ G RN+PG LRFGEP F AF H RMGEL+GPGNFP LS Sbjct: 950 ------DILPSHLQRGEHFGSRNIPGQLRFGEPV-FDAFLGHPRMGELSGPGNFPSRLSA 1002 Query: 1683 GEPFGVGNKPSHPRFGEPGFRSSYSLQGYPNDGGFHL-GDMESLDNPRKRKSASMGWCRI 1507 GE FG NK HPR GEPGFRS+YSL GYPND GF GDMES DN RKRK SM WCRI Sbjct: 1003 GESFGGSNKSGHPRIGEPGFRSTYSLHGYPNDHGFRPPGDMESFDNSRKRKPLSMAWCRI 1062 Query: 1506 CKVDCETVEGLDLHSQTREHQKMAMDMVLSIKQQNGKKQKLTSNDHSSVEDASKSRNAIA 1327 C +DCETV+GLD+HSQTREHQ+MAMD+VLSIKQQN KKQKLTS DHS+ ED+SKS+ + Sbjct: 1063 CNIDCETVDGLDMHSQTREHQQMAMDIVLSIKQQNAKKQKLTSKDHSTPEDSSKSKKGVL 1122 Query: 1326 IFEGRRNKP 1300 G KP Sbjct: 1123 RGGGISIKP 1131 >ref|XP_006379033.1| hypothetical protein POPTR_0009s04520g [Populus trichocarpa] gi|550331020|gb|ERP56830.1| hypothetical protein POPTR_0009s04520g [Populus trichocarpa] Length = 1315 Score = 554 bits (1427), Expect = e-154 Identities = 431/1119 (38%), Positives = 519/1119 (46%), Gaps = 23/1119 (2%) Frame = -1 Query: 4590 HAVTGXXXXXXXXXXXQLSLGAPQHPMHTHLQNVPHQQPQHSAQMQSHFSQQPVQMRPPQ 4411 +AVTG Q+ GA +H Q P Q QMQS F QQ PQ Sbjct: 412 NAVTGHHSYQQPQIHQQMQTGALKHS-----QGGPQPHSQQPVQMQSQFPQQSSLWPQPQ 466 Query: 4410 SHAPIAS-QHQSTLPSPGQVPTIP--LQQIPLHPNAQQPGHLVNXXXXXXXXXXXXXXXX 4240 HA + + Q LPS GQVP IP LQQ P+H +A QPG V Sbjct: 467 YHAAVQNLQQPGLLPSQGQVPNIPPALQQ-PIHSHAHQPGLPVQQRPGMQPTPQPMHQQY 525 Query: 4239 XXXXXXXXXXXPSGSVQNQLHQQGPYSQPQQLPMHSHLRPQGPTXXXXXXXXXXXXXXQN 4060 G+V NQ HQQGPY Q QQL + LRPQG QN Sbjct: 526 AQHQQPFSGQPW-GAVHNQAHQQGPYVQQQQLHPLTQLRPQGLPQSFQQPSHAYPHPQQN 584 Query: 4059 VALSQSQN-HFGRSMI--PGAQSQPLPQSASGPTGAGQVKTTMHGANQHQLSTNQNYPLR 3889 V L + H +S+ PG +Q PQSASG QV++ GANQ + L+ Sbjct: 585 VLLPHGAHPHQAKSLAVGPGLPAQSYPQSASGM----QVRSIQIGANQQSGNI-----LK 635 Query: 3888 TNNQGQPVFEQQPGYMQQSVQQSGPIIKSTMSESQGDQLSEKNVSFREESSSQRTAKSDL 3709 TNNQ + +QQ G S Q+ G I K E S+Q+T K +L Sbjct: 636 TNNQVELSSDQQSGV--SSRQRQGDIEKGAEGEL----------------SAQKTIKKEL 677 Query: 3708 NNPVITSGLRADSVEEKNLESEVDTKSIDDERKHIGEDEDNNKVSDSLRTLGTDPNSHSM 3529 N+ + +GL AD+ E K ++SE D K +DD+ K GE +D P S + Sbjct: 678 ND--LDAGLAADASEMKTIKSESDLKQVDDKNKPTGEAKDV-------------PESLAA 722 Query: 3528 ENGEPVIKQIVEEEVTDSISEPSSGGKFVENKDQKDVPHNDLKQVENSSLEGKE---IQG 3358 NGE IKQ V+EE D E Q DV + D ++VE S E K+ ++ Sbjct: 723 ANGESSIKQ-VKEEHRDGADE------------QNDVSNADHEKVELSVSEHKDGPLLET 769 Query: 3357 QVEIIGEQSGKLKKDAVNAEGVVLPANGSDRGFLSVHPSSAPVPEHRGHPPPGQLHGRGF 3178 + EQ KL+KD P + S GF PP G + + Sbjct: 770 APSHLEEQIMKLQKDKT-------PTSQSFGGF----------------PPNGHVQSQSV 806 Query: 3177 VQPSH----PVPLHQRPPALPSGLPPQHGQASGLPLTQLRPQGPGHFPQSGQPLNPPDHF 3010 P+P+H P A Q RP GP S PL PP H Sbjct: 807 SAVDQGKLEPLPIHHGPSA-----------------AQQRPVGPSLVQAS--PLGPPHHM 847 Query: 3009 QPPG------GILGPGSTSFGRGPSHFGPPQRNFESQSAGPLGHYHQGHVPPSHIAPPRS 2848 Q PG G LGPG PSH+GPPQ G Y PPS Sbjct: 848 QLPGHPPTQHGRLGPGHV-----PSHYGPPQ-----------GAYPHAPAPPS------- 884 Query: 2847 QGEPVGGPFDAHGGLMARAPPHGPEVQMGPQRFNPMEAEIFPNPRPSFLDGRQPDLHLQA 2668 QGE R P H EA +F N RP + DGRQ + Sbjct: 885 QGE--------------RTPSH------------VHEATMFANQRPKYPDGRQ-GTYSNV 917 Query: 2667 TRMNAPPXXXXXXXXXLRDERFKPPFDERPHSFPVEPGRR---RGEFEEDLKQFPRPSHL 2497 MN +RF DE + FP P +GEFEEDLK FPRPSHL Sbjct: 918 VGMNGAQGP--------NSDRFSSLPDEHLNPFPRGPAHHNVHQGEFEEDLKHFPRPSHL 969 Query: 2496 DPEAAPKFGSYFSSSRPIERGPQGFGMDAAPGALDKAPHGFNYDAGLKFDPSASSAPSRF 2317 D E PK S+F SSRP++RGP+GFG+D AP LDK HGFNYD+GL +P SAP RF Sbjct: 970 DTEPVPKSSSHFPSSRPLDRGPRGFGVDGAPRPLDKGSHGFNYDSGLNMEPLGGSAPPRF 1029 Query: 2316 LPPYHPGGTPTLNEAGERARPVRLNEDNVSRPDSTRKHPDFNGP-VPGYGRHRMDGSAPR 2140 PPYH ++A + ++ R D R P F GP +PGY MD APR Sbjct: 1030 FPPYHHDKALHPSDAEVS---LGYHDSLAGRSDFARTRPGFLGPPIPGYDHRHMDNLAPR 1086 Query: 2139 SPVREFXXXXXXXXXXXXXXXXXXPDDIXXXXXXXXXXXXRPFNLPSDQIGNSFQENRFP 1960 SPVR++ DDI D+ +S +++RFP Sbjct: 1087 SPVRDYPGMPTRRFGALPGL-----DDIDGRDPHRF----------GDKFSSSLRDSRFP 1131 Query: 1959 ILPSHLRRGEPERNVNMPMGEHISPGPQHFRTGDLTGQDILPSHLRRGENLGPRNLPGHL 1780 + PSHLRRGE E N+ MGEH+S GDL G D P+HLRRGE+LGPRNLP HL Sbjct: 1132 VFPSHLRRGELEGPGNLHMGEHLS--------GDLMGHDGRPAHLRRGEHLGPRNLPSHL 1183 Query: 1779 RFGEPAGFGAFPSHARMGELAGPGNFPQHLSTGEPFGVGNKPSHPRFGEPGFRSSYSLQG 1600 GEP FGAFP HARMGELAGPGNF H + GEPGFRSS+ Sbjct: 1184 WVGEPGNFGAFPGHARMGELAGPGNFYHH----------------QLGEPGFRSSF---- 1223 Query: 1599 YPNDGGFHLGDMESLDNPRKRKSASMGWCRICKVDCETVEGLDLHSQTREHQKMAMDMVL 1420 GG + GD++ DN RKRK SMGWCRICKVDCETVE LDLHSQTREHQKMA+DMV+ Sbjct: 1224 ----GGNYAGDLQFFDNSRKRK-PSMGWCRICKVDCETVEALDLHSQTREHQKMALDMVV 1278 Query: 1419 SIKQQNGKKQKLTSNDHSSVEDASKSRNAIAIFEGRRNK 1303 +IK QN KK K T HSS+ED SKSRN A FEGR NK Sbjct: 1279 TIK-QNAKKHKSTPCHHSSLEDKSKSRN--ASFEGRGNK 1314 >ref|XP_007204950.1| hypothetical protein PRUPE_ppa000292mg [Prunus persica] gi|462400592|gb|EMJ06149.1| hypothetical protein PRUPE_ppa000292mg [Prunus persica] Length = 1334 Score = 543 bits (1400), Expect = e-151 Identities = 440/1139 (38%), Positives = 532/1139 (46%), Gaps = 55/1139 (4%) Frame = -1 Query: 4590 HAVTGXXXXXXXXXXXQLSLGAPQ-HPMHTHLQNVPHQQPQHSAQMQSHFSQQPVQMRPP 4414 HAVTG + GAPQ H MH +PH Q Q Q+QS F QQP MRPP Sbjct: 435 HAVTGNHLYLQPHLHQPVQSGAPQQHTMHLQSHGMPHSQSQTPVQIQSQFPQQPPLMRPP 494 Query: 4413 QSHAPIASQHQSTLPSPGQVPTIPLQQIPLHPNAQQPGHLVNXXXXXXXXXXXXXXXXXX 4234 SH + PN QQP L + Sbjct: 495 PSHTTV-------------------------PNQQQPALLPSP----------------- 512 Query: 4233 XXXXXXXXXPSGSVQNQLHQQGPYSQPQQLPMHSH------------------------- 4129 G +QN + QQ P+HS+ Sbjct: 513 -----------GQIQN-------INPAQQQPVHSYGHPPGNTVHQRPHMQAVQQPIPQQY 554 Query: 4128 --------------LRPQGPTXXXXXXXXXXXXXXQNVALSQSQNHF-----GRSMIP-- 4012 LRPQG + QNV LSQ H GR M+P Sbjct: 555 FHHQPFVQQQPPTQLRPQGQSHSFPQHIHASTQSQQNVTLSQGIQHTQSNLGGRPMMPIH 614 Query: 4011 GAQSQPLPQSASGPTGAGQVKTTMHGANQHQLSTNQNYPLRTNNQGQPVFEQQPGYMQQS 3832 G QSQ Q T G MH A + STNQN +RTNN G QS Sbjct: 615 GVQSQTYAQ-----TAGGVYMRPMHPA-ANLSSTNQNNMVRTNNLG------------QS 656 Query: 3831 VQQSGPIIKSTMSESQGDQLSEKNVSFREESSSQRTAKSDLNNPVITSGLRADSVEEKNL 3652 SGP T SE Q +Q S E S+Q+ AK +++ S + AD+ E K Sbjct: 657 GANSGP----TTSERQAEQES--------EFSAQQNAKKVVHDVGTASAVVADA-EVKTA 703 Query: 3651 ESEVDTKSIDDERKHIGEDEDNNKVSDSLRTLGTDPNSHSMENGEPVIKQIVEEEVTDSI 3472 +SE D KSID+E K GED+ + S P+ H++ENGE V K I++EE D Sbjct: 704 KSETDMKSIDNENKPTGEDKTIQGDTSSKEI----PDIHALENGESVSKSILKEEGVDG- 758 Query: 3471 SEPSSGGKFVENKDQKDVPHNDLKQVENSSLEGKEIQGQVEIIGEQSGKLKKDAVNAEGV 3292 D +V +D+KQ E + +E Q + EQ L+KD A G Sbjct: 759 -----------TLDHSNVSISDMKQRELKEIPSEEAQ----LREEQGWMLQKD---ASGD 800 Query: 3291 VLPANGSDRGFLSVHPSSAPVPEHRGHPPPGQLHGRGFVQPSHPVPLHQRPPA-----LP 3127 P G+D G +V +SAP+ + H P HG P L QRP A +P Sbjct: 801 PQPFIGTDEGSQAV-STSAPISDQGKHLPH---HG--------PTTLPQRPGAPLLLQVP 848 Query: 3126 SGLPPQHGQASGLPLTQLRPQGPGHFPQSGQPLNPPDHFQPPGGILGPGSTSFGRGPSHF 2947 G PP H Q G LRP GP H P GQP + +HFQP GG LG G++S GR S + Sbjct: 849 PG-PPCHTQGPG---HHLRPPGPAHVP--GQPFHSSEHFQPHGGNLGFGASS-GRA-SQY 900 Query: 2946 GPPQRNFESQSAGPLGHYHQGHVPPSHIAPPRSQGEPVGGPFDAHGGLMARAPPHGPEVQ 2767 G PQ + E QS P G Y++GH+P PP S FD+HGG+M+RA P G Sbjct: 901 G-PQGSIELQSVTPHGPYNEGHLP----LPPTS-------AFDSHGGMMSRAAPIG---- 944 Query: 2766 MGPQRFNPMEAEIFPNPRPSFLDGRQPDLHLQATRMNAPPXXXXXXXXXLRDERFKPPFD 2587 +PS +H RMN P RDERFK Sbjct: 945 -----------------QPS-------GIHPNMLRMNGTPGLDSSSTHGPRDERFKAFPG 980 Query: 2586 ERPHSFPVEPGRR---RGEFEEDLKQFPRPSHLDPEAAPKFGSYFSSSRPIERGPQGFGM 2416 ER + FPV+P R R EFE+DLKQFPRPS+LD E KFG+Y SSRP Sbjct: 981 ERLNPFPVDPTRHVIDRVEFEDDLKQFPRPSYLDSEPVAKFGNY--SSRP---------- 1028 Query: 2415 DAAPGALDKAPHGFNYDAGLKFDPSASSAPSRFLPPYHPGGTPTLNEAGERARPVRLNED 2236 D+APHGF YD+G DP A +APSRFL PY GG+ N+AG+ Sbjct: 1029 ------FDRAPHGFKYDSGPHTDPLAGTAPSRFLSPYRLGGSVHGNDAGD---------- 1072 Query: 2235 NVSRPDSTRKHPDFNGPVPGYGRHRMDGSAPRSPVREFXXXXXXXXXXXXXXXXXXPDDI 2056 R + T HPDF GR +DG APRSPVR++ PDD Sbjct: 1073 -FGRMEPTHGHPDF------VGRRLVDGLAPRSPVRDY------PGLPPHGFRGFGPDDF 1119 Query: 2055 XXXXXXXXXXXXRPFNLPSDQIGNSFQENRFPILPSHLRRGEPERNVNMPMGEHISPGPQ 1876 R F+ D +GN F E RF LP H RRGE E N+ M +H Sbjct: 1120 ----------DGREFHRFGDPLGNQFHEGRFSNLPGHFRRGEFEGPGNLRMVDH------ 1163 Query: 1875 HFRTGDLTGQDILPSHLRRGENLGPRNLPGHLRFGEPAGFGAFPSHARMGELAGPGNFPQ 1696 R D GQD P HLRRG++LGP NL EP GFG+ H+ MG++AGPGNF Sbjct: 1164 --RRNDFIGQDGHPGHLRRGDHLGPHNL------REPLGFGS--RHSHMGDMAGPGNF-- 1211 Query: 1695 HLSTGEPFGVGNKPSHPRFGEPGFRSSYSLQGYPNDGGFHLGDMESLDNPRKRKSASMGW 1516 EPF GN+P+HPR GEPGFRSS+SLQ +PNDG + GD+ES D+ RKRK ASMGW Sbjct: 1212 -----EPFR-GNRPNHPRLGEPGFRSSFSLQRFPNDGTY-TGDLESFDHSRKRKPASMGW 1264 Query: 1515 CRICKVDCETVEGLDLHSQTREHQKMAMDMVLSIKQQNGKKQKLTSNDHSSVEDASKSR 1339 CRICKVDCETVEGLDLHSQTREHQKMAMDMV SIK QN KKQKLTS D S +EDA+KS+ Sbjct: 1265 CRICKVDCETVEGLDLHSQTREHQKMAMDMVRSIK-QNAKKQKLTSGDQSLLEDANKSK 1322 >ref|XP_002520450.1| hypothetical protein RCOM_0731250 [Ricinus communis] gi|223540292|gb|EEF41863.1| hypothetical protein RCOM_0731250 [Ricinus communis] Length = 1329 Score = 541 bits (1393), Expect = e-150 Identities = 412/1116 (36%), Positives = 521/1116 (46%), Gaps = 20/1116 (1%) Frame = -1 Query: 4590 HAVTGXXXXXXXXXXXQLSLGAPQHPMHTHLQNVPHQQPQHSAQMQSHFSQQPVQMRPPQ 4411 HAVTG QL LG QHP+H + Q P QPQ F QQ +RPPQ Sbjct: 426 HAVTGHHSYPQPQPQQQLQLGGLQHPVH-YAQGGP--QPQ--------FPQQSPLLRPPQ 474 Query: 4410 SHAPIASQHQS-TLPSPGQVPTIP-LQQIPLHPNAQQPGHLVNXXXXXXXXXXXXXXXXX 4237 SH P+ + QS LPSPGQVP +P QQ P+ +AQQPG V+ Sbjct: 475 SHVPVQNPQQSGLLPSPGQVPNVPPAQQQPVQAHAQQPGLPVHQLPVMQSVQQPIHQQYV 534 Query: 4236 XXXXXXXXXXPSGSVQNQLHQQGPYSQPQQLPMHSHLRPQGPTXXXXXXXXXXXXXXQNV 4057 G VQNQ+HQQG Y Q Q L HS LRPQGP+ Sbjct: 535 QQQPPFPGQAL-GPVQNQVHQQGAYMQ-QHLHGHSQLRPQGPSHAYTQPLQNVPLPHGTQ 592 Query: 4056 ALSQSQNHFGRSMIPGAQSQPLPQSASGPTGAGQVKTTMHGANQHQLSTNQNYPLRTNNQ 3877 A Q+QN GR G + P P S+ G QV+ GA+Q R NNQ Sbjct: 593 A-HQAQNLGGRPPY-GVPTYPHPHSSVGM----QVRPMQVGADQ-----QSGNAFRANNQ 641 Query: 3876 GQPVFEQQPGYMQQSVQQSGPIIKSTMSESQGDQLSEKNVSFREESSSQRTAKSDLNNPV 3697 MQ S +Q I S QGD + EK S +SSSQ+ + D N+ Sbjct: 642 -----------MQLSSEQPSGAISRPTSNRQGDDIIEK--SSEADSSSQKNVRRDPNDLD 688 Query: 3696 ITSGLRADSVEEKNLESEVDTKSIDDERKHIGE-DEDNNKVSDSLRTLGTDPNSHSMENG 3520 + SGL +D + K + SE + K +DD+ K I E E+ K +D + + N Sbjct: 689 VASGLGSDVSDLKTVISESNLKPVDDDNKSINEVKEEPKKGNDDQKDISNTDN------- 741 Query: 3519 EPVIKQIVEEEVTDSISEPSSGGKFVENKDQKDVPHNDLKQVENSSLEGKEIQGQVEIIG 3340 D+ + G ++N+ + H L+ S G+ + Q Sbjct: 742 -------------DAEDKGVKDGPVMKNRPLPEAEH--LEDQSMKSQRGRNVTPQ----- 781 Query: 3339 EQSGKLKKDAVNAEGVVLPANGSDRGFLSVHPSSAPVPEHRGHPPPGQLHGRGFVQPSHP 3160 G + V EG+ P++ S P+ E PP V P P Sbjct: 782 HSGGFILHGQVQGEGLAQPSH------------SIPIAEQGKQQPP--------VIPHGP 821 Query: 3159 VPLHQRP--PALPSGLPP---QHGQASGLPLTQLRPQGPGHFPQSGQPLNPPDHFQPPGG 2995 L QRP +L + PP HGQ G P ++RP GPGH P + + G Sbjct: 822 SALQQRPIGSSLLTAPPPGSLHHGQIPGHPSARVRPLGPGHIPHGPEVSS--------AG 873 Query: 2994 ILGPGSTSF-GRGPSHFGPPQRNFESQSAGPLGHYHQGHVPPSHIAPPRSQGEPVGGPFD 2818 + G GST GRG SH+G G Y QGH PS Sbjct: 874 MTGLGSTPITGRGGSHYGLQ------------GTYTQGHALPS----------------- 904 Query: 2817 AHGGLMARAPPHGPEVQMGPQRFNPMEAEIFPNPRPSFLDGRQPD-------LHLQATRM 2659 A P+G + M F N RP++ DG++ D +H A RM Sbjct: 905 -----QADRTPYGHDTDM------------FANQRPNYTDGKRLDPLGQQSGMHSNAMRM 947 Query: 2658 NAPPXXXXXXXXXLRDERFKPPFDERPHSFPVEPGRR---RGEFEEDLKQFPRPSHLDPE 2488 N P LRD+RF+P DE + FP +P +R R EFEEDLK F RPS LD + Sbjct: 948 NGAPGMDSSSALGLRDDRFRPFSDEYMNPFPKDPSQRIVDRREFEEDLKHFSRPSDLDTQ 1007 Query: 2487 AAPKFGSYFSSSRPIERGPQGFGMDAAPGALDKAPHGFNYDAGLKFDPSASSAPSRFLPP 2308 + KFG+ FSSSRP++RGP LDK HG NYD+G+K + PSRF PP Sbjct: 1008 STTKFGANFSSSRPLDRGP-----------LDKGLHGPNYDSGMKLESLGGPPPSRFFPP 1056 Query: 2307 YHPGGTPTLNEAGERARPVRLNEDNVSR-PDSTRKHPDFNGPVPGYGRHRMDGSAPRSPV 2131 YH G N+ ER+ + +++ + R PDS R HP+F GP Y R DG APRSP Sbjct: 1057 YHHDGLMHPNDIAERS--IGFHDNTLGRQPDSVRAHPEFFGPGRRYDRRHRDGMAPRSPG 1114 Query: 2130 REFXXXXXXXXXXXXXXXXXXPDDIXXXXXXXXXXXXRPFNLPSDQIGNSFQENRFPILP 1951 R++ DDI S + G+SF +RFP+LP Sbjct: 1115 RDY-----PGVSSRGFGAIPGLDDID--------------GRESRRFGDSFHGSRFPVLP 1155 Query: 1950 SHLRRGEPERNVNMPMGEHISPGPQHFRTGDLTGQDILPSHLRRGENLGPRNLPGHLRFG 1771 SH+R GE E GP QD +H RRGE+LG N+ R G Sbjct: 1156 SHMRMGEFE-------------GP---------SQDGFSNHFRRGEHLGHHNMRN--RLG 1191 Query: 1770 EPAGFGAFPSHARMGELAGPGNFPQHLSTGEPFGVGNKPSHPRFGEPGFRSSYSLQGYPN 1591 EP GFGAFP A MG+L+G GNF +PR GEPGFRSS+S +G+P Sbjct: 1192 EPIGFGAFPGPAGMGDLSGTGNF----------------FNPRLGEPGFRSSFSFKGFPG 1235 Query: 1590 DGGFHLGDMESLDNPRKRKSASMGWCRICKVDCETVEGLDLHSQTREHQKMAMDMVLSIK 1411 DGG + G++ES DN R+RKS+SMGWCRICKVDCETVEGLDLHSQTREHQK AMDMV++IK Sbjct: 1236 DGGIYAGELESFDNSRRRKSSSMGWCRICKVDCETVEGLDLHSQTREHQKRAMDMVVTIK 1295 Query: 1410 QQNGKKQKLTSNDHSSVEDASKSRNAIAIFEGRRNK 1303 QN KKQKL +NDHSSV+DASKS+N EGR NK Sbjct: 1296 -QNAKKQKLANNDHSSVDDASKSKN--TSIEGRGNK 1328 >ref|XP_002298329.1| hypothetical protein POPTR_0001s25430g [Populus trichocarpa] gi|222845587|gb|EEE83134.1| hypothetical protein POPTR_0001s25430g [Populus trichocarpa] Length = 1327 Score = 532 bits (1371), Expect = e-148 Identities = 422/1119 (37%), Positives = 512/1119 (45%), Gaps = 23/1119 (2%) Frame = -1 Query: 4590 HAVTGXXXXXXXXXXXQLSLGAPQHPMHTHLQNVPHQQPQHSAQMQSHFSQQPVQMRPPQ 4411 HAVTG Q+ LGAPQHP + P Q Q QMQS F QQP + PPQ Sbjct: 418 HAVTGHHSYLQPQIHQQMPLGAPQHP-----RGGPQSQSQQPVQMQSQFIQQPPLLPPPQ 472 Query: 4410 SHAPIASQHQ-STLPSPGQVPTIP-LQQIPLHPNAQQPGHLVNXXXXXXXXXXXXXXXXX 4237 SHA + Q LPSP QVP+IP QQ P+H +A QPG V Sbjct: 473 SHAAFQNPQQPGLLPSPVQVPSIPPAQQQPVHSHADQPGLPVQQRPVMQPIVQPMNQQYV 532 Query: 4236 XXXXXXXXXXPSGSVQNQLHQQGPYSQPQQLPMHSHLRPQGPTXXXXXXXXXXXXXXQNV 4057 G+V NQ+H QG Y Q + L P GP QNV Sbjct: 533 QHQQPFPGQPW-GAVHNQMHHQGLYGQQHP---QTQLHPHGPVQSFQQPSHAYPHPQQNV 588 Query: 4056 ALSQSQN-HFGRSMIPGAQSQP----LPQSASGPTGAGQVKTTMHGANQHQLSTNQNYPL 3892 L + + H +S+ G P QS T Q + GANQ + L Sbjct: 589 PLPRGAHPHQAQSLAVGTGVSPHGVLSVQSYPQSTAVMQARPVQIGANQQSGNI-----L 643 Query: 3891 RTNNQGQPVFEQQPGYMQQSVQQSGPIIKSTMSESQGDQLSEKNVSFREESSSQRTAKSD 3712 +TNNQ ++ S +Q + +SE QGD EK ESS+ T K + Sbjct: 644 KTNNQ-----------VEFSSEQQAWVASRPISERQGD--IEKGAE--GESSAHNTIKKE 688 Query: 3711 LNNPVITSGLRADSVEEKNLESEVDTKSIDDERKHIGEDEDNNKVSDSLRTLGTDPNSHS 3532 LN + +GL A + E K ++SE D K +DDE K GE +D P + + Sbjct: 689 LNE--LDAGLGASASEMKTIKSESDLKQVDDENKPTGEAKDI-------------PGAPA 733 Query: 3531 MENGEPVIKQIVEE--EVTDSISEPSSGGKFVENKDQKDVPHNDLKQVENSSLE------ 3376 NGEP IKQ+ E+ +VTD QKD+ + D K+VE S E Sbjct: 734 AANGEPSIKQVKEDHRDVTDK---------------QKDISNADQKKVELSLSEYMDGKD 778 Query: 3375 GKEIQGQVEIIGEQSGKLKKDAV-NAEGVV-LPANGSDRGFLSVHPSSAPVPEHRGHPPP 3202 G ++ + EQS K +KD ++G P NG H S PV Sbjct: 779 GLSLETAPSHLEEQSKKSQKDKTPTSQGFGGFPPNG--------HMQSQPVSV----VDQ 826 Query: 3201 GQLHGRGFVQPSHPVPLHQRPPALPSGLPPQHGQASGLPLTQLRPQGPGHFPQSGQPLNP 3022 G+LH P+P+HQ P AL Q RP GP P P Sbjct: 827 GKLH---------PLPIHQGPAAL-----------------QQRPVGPSWLQA---PHGP 857 Query: 3021 PDHFQPPGGILGPGSTSFGRGPSHFG--PPQRNFESQSAGPLGHYHQGHVPPSHIAPPRS 2848 P H Q PG PSH G PP GH+P SH PP+ Sbjct: 858 PHHMQLPG-----------HPPSHHGRLPP-----------------GHMP-SHYGPPQ- 887 Query: 2847 QGEPVGGPFDAHGGLMARAPPHGPEVQMGPQRFNPMEAEIFPNPRPSFLDGRQPDLHLQA 2668 GP+ H P Q E +F N RPS+ GRQ L Sbjct: 888 ------GPYT-----------HAPTSQGERTSSYVHETSMFGNQRPSYPGGRQGILSNAV 930 Query: 2667 TRMNAPPXXXXXXXXXLRDERFKPPFDERPHSFPVEPGRR---RGEFEEDLKQFPRPSHL 2497 A +RF+ DE + FP +P RR +GEFEEDLK F PS L Sbjct: 931 GTNGAQDP---------NSDRFRSFPDEHLNPFPHDPARRNAHQGEFEEDLKHFTAPSCL 981 Query: 2496 DPEAAPKFGSYFSSSRPIERGPQGFGMDAAPGALDKAPHGFNYDAGLKFDPSASSAPSRF 2317 D + PK G +FSSSRP++RGP GFG+D AP LDK HG NYD+GL +P SAP RF Sbjct: 982 DTKPVPKSGGHFSSSRPLDRGPHGFGVDGAPKHLDKGSHGLNYDSGLNVEPLGGSAPPRF 1041 Query: 2316 LPPYHPGGTPTLNEAGERARPVRLNEDNVSRPDSTRKHPDFNGP-VPGYGRHRMDGSAPR 2140 PP H T +EA + +++ R D R P GP +PGY MD APR Sbjct: 1042 FPPIHHDRTLHRSEA---EGSLGFHDNLAGRTDFARTRPGLLGPPMPGYDHRDMDNLAPR 1098 Query: 2139 SPVREFXXXXXXXXXXXXXXXXXXPDDIXXXXXXXXXXXXRPFNLPSDQIGNSFQENRFP 1960 SP R++ DDI SD I +S ++RFP Sbjct: 1099 SPGRDYPGMSMQRFGALPGL-----DDIDGRAPQRS----------SDPITSSLHDSRFP 1143 Query: 1959 ILPSHLRRGEPERNVNMPMGEHISPGPQHFRTGDLTGQDILPSHLRRGENLGPRNLPGHL 1780 + PSHLRRGE N MGEH+S GDL G D P+HLRRGE LGPRN P HL Sbjct: 1144 LFPSHLRRGELNGPGNFHMGEHLS--------GDLMGHDGWPAHLRRGERLGPRNPPSHL 1195 Query: 1779 RFGEPAGFGAFPSHARMGELAGPGNFPQHLSTGEPFGVGNKPSHPRFGEPGFRSSYSLQG 1600 R GE GFG+FP HARMGELAGPGN H + GEPGFRSS+ Sbjct: 1196 RLGERGGFGSFPGHARMGELAGPGNL----------------YHQQLGEPGFRSSF---- 1235 Query: 1599 YPNDGGFHLGDMESLDNPRKRKSASMGWCRICKVDCETVEGLDLHSQTREHQKMAMDMVL 1420 GG + GD++ +N RKRKS SMGWCRICKVDCET EGLDLHSQTREHQKMAMDMV+ Sbjct: 1236 ----GGSYAGDLQYSENSRKRKS-SMGWCRICKVDCETFEGLDLHSQTREHQKMAMDMVV 1290 Query: 1419 SIKQQNGKKQKLTSNDHSSVEDASKSRNAIAIFEGRRNK 1303 +IK QN KK K +DHSS+ED SK RN A FEGR NK Sbjct: 1291 TIK-QNVKKHKSAPSDHSSLEDTSKLRN--ASFEGRGNK 1326 >ref|XP_004295721.1| PREDICTED: uncharacterized protein LOC101314450 [Fragaria vesca subsp. vesca] Length = 1316 Score = 499 bits (1286), Expect = e-138 Identities = 406/1094 (37%), Positives = 506/1094 (46%), Gaps = 27/1094 (2%) Frame = -1 Query: 4539 LSLGAPQHPMHTHLQNVPHQQPQHSAQMQSHFSQQPVQMRPPQSHAPIASQHQSTL-PSP 4363 LS Q +H Q P+ Q Q+ Q Q F QP +RPP I +Q Q+ L PSP Sbjct: 421 LSAAPQQRTVHLQSQGAPNSQSQNHVQTQIQFPLQPPLLRPPPFQTTIPNQPQTALLPSP 480 Query: 4362 GQVPTIPLQQIPLHPNAQQPGHLVNXXXXXXXXXXXXXXXXXXXXXXXXXXXPSGSVQNQ 4183 + QQ P+H AQQPG VQ Sbjct: 481 SMISA---QQPPVHSFAQQPG------------------------IPPLQRPLIQPVQQL 513 Query: 4182 LHQQGPYSQP--QQLPMH-SHLRPQGPTXXXXXXXXXXXXXXQNVALSQSQNHFGRSMIP 4012 QQ +QP QQ P S LRPQG + QNV LSQ H S + Sbjct: 514 NPQQYFQNQPYVQQTPATLSQLRPQGQSHSFPQHIRASNQSQQNVVLSQGMQHIQPSNLV 573 Query: 4011 GAQSQP----LPQSASGPTGAGQVKTTMHGANQHQLSTNQNYPLRTNNQGQPVFEQQPGY 3844 G P LPQ + G G + M+ HQ S+NQN RTNNQ QP +P Sbjct: 574 GRPMMPSHGVLPQPYAQTVG-GVLPRPMYPPLNHQ-SSNQNNIGRTNNQVQPGANSRP-- 629 Query: 3843 MQQSVQQSGPIIKSTMSESQGDQLSEKNVSFREESSSQRTAKSDLNNPVITSGLRADSVE 3664 TM+ E ++ +AK+ + ++S + ADS E Sbjct: 630 --------------TMTTRPA------------EKEAELSAKNGAQDVGVSSAVVADS-E 662 Query: 3663 EKNLESEVDTKSIDDERKHIGED---EDNNKVSDSLRTLGTDPNSHSMENGEPVIKQIVE 3493 K ++SEVD KS DD K ED + ++ +S LG NGE K ++ Sbjct: 663 AKTVKSEVDIKSTDDGNKPSSEDRSYQGTKEIPESKGMLGA--------NGESESKPTLK 714 Query: 3492 EEVTDSISEPSSGGKFVE--NKDQKDVPHNDLKQVENSSLEGKEIQGQVEIIGEQSGKLK 3319 EE DS E S GK E + KD P + +K E+ + +E Q + G + KL+ Sbjct: 715 EEGVDSTLEDLSNGKLGELVAEGAKDAPSSGMKLGEHKEMPPEEAQ----LHGVKDKKLQ 770 Query: 3318 KDAVNAEGVVLPANGSDRGFLSVHPSSAPVPEHRGHPPPGQLHGRGFVQPSHP--VPLHQ 3145 K + ++ G +V SSAP+ GQ+ G +QPSHP L Q Sbjct: 771 K----------VVSSTEEGSQTVSISSAPI---------GQVQAGGLMQPSHPGSAILQQ 811 Query: 3144 RPPA-----LPSGLPPQHGQASGLPLTQLRPQGPGHFPQSGQPLNPPDHFQPPGGILGPG 2980 +P A +PS PP H SG PL +RPQGPGH P G P + +HFQ P G LG Sbjct: 812 KPGAPPLLQVPSSGPPHHILGSGQPLAHVRPQGPGHVP--GHPSHLSEHFQSPRGNLGFA 869 Query: 2979 STSFGRGPSHFGPPQRNFESQSAGPLGHYHQGHVPPSHIAPPRSQGEPVGGPFDAHGGLM 2800 ++S +A G Y+Q H PP AP P FD+HGG+M Sbjct: 870 ASS-----------------ANASQHGPYNQSHAPPHSGAPRGPPFAPPPSAFDSHGGIM 912 Query: 2799 ARAPPHGPEVQMGPQRFNPMEAEIFPNPRPSF-----LDGRQPDLHLQATRMNAPPXXXX 2635 ARA P+G E QMG Q RP+F G+ + RMN P Sbjct: 913 ARAAPYGHEGQMGLQ-------------RPAFQMEQGATGQPSGIISNMLRMNGNPGFES 959 Query: 2634 XXXXXLRDERFKPPFDERPHSFPVEPGR--RRGEFEEDLKQFPRPSHLDPEAAPKFGSYF 2461 LRDERFK D R + FP +P R R FE+DLKQFPRPS LD E PK G+Y Sbjct: 960 SSTLGLRDERFKALPDGRLNPFPGDPTRVISRVGFEDDLKQFPRPSFLDSEPLPKLGNY- 1018 Query: 2460 SSSRPIERGPQGFGMDAAPGALDKAPHGFNYDAGLKFDPSASSAPSRFLPPYHPGGTPTL 2281 SSR A D+ P G NYD L DP+A SAP RFL PY G Sbjct: 1019 -SSR----------------AFDRRPFGVNYDTRLNIDPAAGSAP-RFLSPYGHAGLIHA 1060 Query: 2280 NEAGERARPVRLNEDNVSRPDSTRKHPDFNGPVPGYGRHRMDGSAPRSPVREFXXXXXXX 2101 N+ T HPDF GR MDG A RSP+R++ Sbjct: 1061 ND--------------------TIGHPDFG------GRRLMDGLARRSPIRDY------- 1087 Query: 2100 XXXXXXXXXXXPDDIXXXXXXXXXXXXRPFNLPSDQIGNSFQENRFPILPSHLRRGEPER 1921 PDD R F+ D +G F +NRFP H RRGE E Sbjct: 1088 PGIPSRFRGFGPDDF----------DGREFHRFGDPLGREFHDNRFP--NQHFRRGEFEG 1135 Query: 1920 NVNMPMGEHISPGPQHFRTGDLTGQDILPSHLRRGENLGPRNLPGHLRFGEPAGFGAFPS 1741 NM + + + DL GQD HL+RGE+LGP NLPGHL E GFG P Sbjct: 1136 PGNMRVDDRM--------RNDLIGQDGHLGHLQRGEHLGPHNLPGHLHMREHVGFGVHPR 1187 Query: 1740 HARMGELAGPGNFPQHLSTGEPFGVGNKPSHPRFGEPGFRSSYSLQGYPNDGGFHLGDME 1561 H AGPG+F E F +GN+ +HPR GEPGFRSS+SL+ +PNDG + G++E Sbjct: 1188 H------AGPGSF-------ESF-IGNRANHPRLGEPGFRSSFSLKRFPNDGTY-AGELE 1232 Query: 1560 SLDNPRKRKSASMGWCRICKVDCETVEGLDLHSQTREHQKMAMDMVLSIKQQNGKKQKLT 1381 S D+ RKRK ASMGWCRICKV+CETVEGLD+HSQTREHQ+MAM+MV IK QN KKQKLT Sbjct: 1233 SFDHSRKRKPASMGWCRICKVNCETVEGLDVHSQTREHQRMAMEMVQIIK-QNAKKQKLT 1291 Query: 1380 SNDHSSVEDASKSR 1339 S D SS+EDA+KS+ Sbjct: 1292 SGDQSSIEDANKSK 1305 >ref|XP_007016231.1| Uncharacterized protein isoform 1 [Theobroma cacao] gi|508786594|gb|EOY33850.1| Uncharacterized protein isoform 1 [Theobroma cacao] Length = 1326 Score = 498 bits (1282), Expect = e-137 Identities = 398/1061 (37%), Positives = 488/1061 (45%), Gaps = 54/1061 (5%) Frame = -1 Query: 4590 HAVTGXXXXXXXXXXXQLSLGAPQHPMHTHLQNVPHQQPQHSAQMQSHFSQQPVQMRPPQ 4411 HAVTG Q+ L PQHPMH H Q H Q QH AQMQ+ + QQP QMRPPQ Sbjct: 450 HAVTGHQSYPLSQPHQQMQLVTPQHPMHVHAQGGLHPQ-QHPAQMQNSYPQQPPQMRPPQ 508 Query: 4410 SHAPIASQHQ-STLPSPGQVPTIPLQQIPLHPNAQQPGHLVNXXXXXXXXXXXXXXXXXX 4234 H I++Q Q LPSPG + LQQ+ LH + QP V Sbjct: 509 PHVAISNQQQPGLLPSPGSM----LQQVHLH--SHQPALPVQQRPVMHPAASPMSQPYVQ 562 Query: 4233 XXXXXXXXXPSGSVQNQLHQQGPYSQPQQLPMHSHLRPQGPTXXXXXXXXXXXXXXQNVA 4054 G VQ Q+ QQGP+ Q QQ S RP GP QNVA Sbjct: 563 QQPLSTQPV--GLVQPQMLQQGPFVQ-QQSSFQSQSRPLGPPHSFPQPPHAYAQPQQNVA 619 Query: 4053 LSQ------SQNHFGRSMIP--GAQSQPLPQSASGPTGAGQVKTTMHGANQHQLSTNQNY 3898 S S N GR M P G QSQP P SA+G VK GANQ S+ QN Sbjct: 620 GSHAVHFHPSHNLVGRPMTPNHGVQSQPYPHSAAGTP----VKPVHLGANQP--SSYQNN 673 Query: 3897 PLRTNNQGQPVFEQQPGYMQQSVQQSGPIIKSTMSESQGDQLSEKNVSFRE-ESSSQRTA 3721 RTNNQ SG + MSE GD ++KNV+ +E +SSS TA Sbjct: 674 VFRTNNQ------------------SG-VTSQPMSEVPGDHGTDKNVAEQEADSSSPGTA 714 Query: 3720 KSDLNNPVITSGLRADSVEEKNLESEVDTKSIDDERK-HIGEDEDNNKVS-----DSLRT 3559 + + N + S L AD E+ + E D KS+D++ +G+D + +S +S RT Sbjct: 715 RKEANELDMASSLGADVAEKNTAKLEADLKSVDEKLTGDVGDDSNGVDISTKETPESRRT 774 Query: 3558 LGTDPNSHSMENGEPVIKQIVEEEVTDSISEPSSGGKFVENKDQKDVPHNDLKQVENSSL 3379 +GTD H +PV K +V E + + +G VE KD P SL Sbjct: 775 VGTDLEQHR----DPVSKNMVTCEAIEDQKDVHNGEHKVEEIKIKDGP----------SL 820 Query: 3378 EGKEIQGQVEIIGEQSGKLKKDAVNAEGVVLPANGSDRGFLSVHPSSAPVPEHRGHPPPG 3199 + +Q + ++ EQ+GK++KD + P + GF RG PP Sbjct: 821 KTPPLQ-EAKLGEEQNGKMQKDKILPHDQGTPKGPAGNGF-------------RGIPPSS 866 Query: 3198 QLHGRGFVQPSHPVPL------------------HQRPP------ALPSGLPPQHGQASG 3091 Q+ G++ PSH VP QRP A P GLP H Q G Sbjct: 867 QVQPGGYLPPSHSVPNVDQGRHQPLQMPYGSNNNQQRPAVSAILQAPPPGLP-SHAQTPG 925 Query: 3090 LPLTQLRPQGPGHFPQSGQPLNPPDHFQPPGGILGPGSTSFGRGPSHFGPPQRNFESQSA 2911 LP Q RPQGPG Q L PP++ PPG SFGR PS++GP Sbjct: 926 LPPNQFRPQGPG------QALVPPENL-PPG--------SFGRDPSNYGPQ--------- 961 Query: 2910 GPLGHYHQGHVPPSHIAPPR-SQGEPVGG---------PFDAHGGLMARAPPHGPEVQMG 2761 G Y+QG PPS PR SQGEP+ G FD+HG AP +GPE Sbjct: 962 ---GPYNQG--PPSLSGAPRISQGEPLVGLSYGTPPLTAFDSHG-----APLYGPESHSV 1011 Query: 2760 PQRFNPMEAEIFPNPRPSFLDGRQPDLHLQATRMNAPPXXXXXXXXXLRDERFKPPFDER 2581 N ++ D RQ D + LR ER KP DE Sbjct: 1012 QHSANMVDYHA---------DNRQLDPRASGLDSTST--------FSLRGERLKPVQDEC 1054 Query: 2580 PHSFPVEPGRR--RGEFEEDLKQFPRPSHLDPEAAPKFGSYFSSSRPIERGPQGFGMDAA 2407 + FP++ G R RG+FEEDLK FPRPSHLD E PKFGSY SSSRP++RGP GFGMD Sbjct: 1055 SNQFPLDRGHRGDRGQFEEDLKHFPRPSHLDNEPVPKFGSYISSSRPLDRGPHGFGMDMG 1114 Query: 2406 PGALDKAPHGFNYDAGLKFDPSASSAPSRFLPPYHPGGTPTLNEAGERARPVRLNEDNVS 2227 P A +K PHGF+ FDP S PSRFLPPYHP ++ GE RPV L +D + Sbjct: 1115 PRAQEKEPHGFS------FDPMIGSGPSRFLPPYHP------DDTGE--RPVGLPKDTLG 1160 Query: 2226 RPDSTRKHPDFNGPVPGYGRHRMDGSAPRSPVREFXXXXXXXXXXXXXXXXXXPDDIXXX 2047 R PDF G VP YGRHRMDG RSP RE+ Sbjct: 1161 R-------PDFLGTVPSYGRHRMDGFVSRSPGREYPGISPHGFGGH-------------- 1199 Query: 2046 XXXXXXXXXRPFNLPSDQIGNSFQ--ENRFPILPSHLRRGEPERNVNMPMGEHISPGPQH 1873 P D+I + +RFP LP HL RG E + M +H Sbjct: 1200 --------------PGDEIDGRERRFSDRFPGLPGHLHRGGFESSDRM---------EEH 1236 Query: 1872 FRTGDLTGQDILPSHLRRGENLGPRNLPGHLRFGEPAGFGAFPSHARMGELAGPGNFPQH 1693 R+ D+ QD P++ RRGE++G N+PGHLR GEP GFG F SH R+GE GPGNF Sbjct: 1237 LRSRDMINQDNRPAYFRRGEHVGHHNMPGHLRLGEPIGFGDFSSHERIGEFGGPGNF--- 1293 Query: 1692 LSTGEPFGVGNKPSHPRFGEPGFRSSYSLQGYPNDGGFHLG 1570 HPR GEPGFRSS+SLQ +PNDGG + G Sbjct: 1294 -------------RHPRLGEPGFRSSFSLQEFPNDGGIYTG 1321 >ref|XP_007016236.1| Uncharacterized protein isoform 6 [Theobroma cacao] gi|508786599|gb|EOY33855.1| Uncharacterized protein isoform 6 [Theobroma cacao] Length = 1345 Score = 496 bits (1277), Expect = e-137 Identities = 397/1059 (37%), Positives = 487/1059 (45%), Gaps = 54/1059 (5%) Frame = -1 Query: 4590 HAVTGXXXXXXXXXXXQLSLGAPQHPMHTHLQNVPHQQPQHSAQMQSHFSQQPVQMRPPQ 4411 HAVTG Q+ L PQHPMH H Q H Q QH AQMQ+ + QQP QMRPPQ Sbjct: 450 HAVTGHQSYPLSQPHQQMQLVTPQHPMHVHAQGGLHPQ-QHPAQMQNSYPQQPPQMRPPQ 508 Query: 4410 SHAPIASQHQ-STLPSPGQVPTIPLQQIPLHPNAQQPGHLVNXXXXXXXXXXXXXXXXXX 4234 H I++Q Q LPSPG + LQQ+ LH + QP V Sbjct: 509 PHVAISNQQQPGLLPSPGSM----LQQVHLH--SHQPALPVQQRPVMHPAASPMSQPYVQ 562 Query: 4233 XXXXXXXXXPSGSVQNQLHQQGPYSQPQQLPMHSHLRPQGPTXXXXXXXXXXXXXXQNVA 4054 G VQ Q+ QQGP+ Q QQ S RP GP QNVA Sbjct: 563 QQPLSTQPV--GLVQPQMLQQGPFVQ-QQSSFQSQSRPLGPPHSFPQPPHAYAQPQQNVA 619 Query: 4053 LSQ------SQNHFGRSMIP--GAQSQPLPQSASGPTGAGQVKTTMHGANQHQLSTNQNY 3898 S S N GR M P G QSQP P SA+G VK GANQ S+ QN Sbjct: 620 GSHAVHFHPSHNLVGRPMTPNHGVQSQPYPHSAAGTP----VKPVHLGANQP--SSYQNN 673 Query: 3897 PLRTNNQGQPVFEQQPGYMQQSVQQSGPIIKSTMSESQGDQLSEKNVSFRE-ESSSQRTA 3721 RTNNQ SG + MSE GD ++KNV+ +E +SSS TA Sbjct: 674 VFRTNNQ------------------SG-VTSQPMSEVPGDHGTDKNVAEQEADSSSPGTA 714 Query: 3720 KSDLNNPVITSGLRADSVEEKNLESEVDTKSIDDERK-HIGEDEDNNKVS-----DSLRT 3559 + + N + S L AD E+ + E D KS+D++ +G+D + +S +S RT Sbjct: 715 RKEANELDMASSLGADVAEKNTAKLEADLKSVDEKLTGDVGDDSNGVDISTKETPESRRT 774 Query: 3558 LGTDPNSHSMENGEPVIKQIVEEEVTDSISEPSSGGKFVENKDQKDVPHNDLKQVENSSL 3379 +GTD H +PV K +V E + + +G VE KD P SL Sbjct: 775 VGTDLEQHR----DPVSKNMVTCEAIEDQKDVHNGEHKVEEIKIKDGP----------SL 820 Query: 3378 EGKEIQGQVEIIGEQSGKLKKDAVNAEGVVLPANGSDRGFLSVHPSSAPVPEHRGHPPPG 3199 + +Q + ++ EQ+GK++KD + P + GF RG PP Sbjct: 821 KTPPLQ-EAKLGEEQNGKMQKDKILPHDQGTPKGPAGNGF-------------RGIPPSS 866 Query: 3198 QLHGRGFVQPSHPVPL------------------HQRPP------ALPSGLPPQHGQASG 3091 Q+ G++ PSH VP QRP A P GLP H Q G Sbjct: 867 QVQPGGYLPPSHSVPNVDQGRHQPLQMPYGSNNNQQRPAVSAILQAPPPGLP-SHAQTPG 925 Query: 3090 LPLTQLRPQGPGHFPQSGQPLNPPDHFQPPGGILGPGSTSFGRGPSHFGPPQRNFESQSA 2911 LP Q RPQGPG Q L PP++ PPG SFGR PS++GP Sbjct: 926 LPPNQFRPQGPG------QALVPPENL-PPG--------SFGRDPSNYGPQ--------- 961 Query: 2910 GPLGHYHQGHVPPSHIAPPR-SQGEPVGG---------PFDAHGGLMARAPPHGPEVQMG 2761 G Y+QG PPS PR SQGEP+ G FD+HG AP +GPE Sbjct: 962 ---GPYNQG--PPSLSGAPRISQGEPLVGLSYGTPPLTAFDSHG-----APLYGPESHSV 1011 Query: 2760 PQRFNPMEAEIFPNPRPSFLDGRQPDLHLQATRMNAPPXXXXXXXXXLRDERFKPPFDER 2581 N ++ D RQ D + LR ER KP DE Sbjct: 1012 QHSANMVDYHA---------DNRQLDPRASGLDSTST--------FSLRGERLKPVQDEC 1054 Query: 2580 PHSFPVEPGRR--RGEFEEDLKQFPRPSHLDPEAAPKFGSYFSSSRPIERGPQGFGMDAA 2407 + FP++ G R RG+FEEDLK FPRPSHLD E PKFGSY SSSRP++RGP GFGMD Sbjct: 1055 SNQFPLDRGHRGDRGQFEEDLKHFPRPSHLDNEPVPKFGSYISSSRPLDRGPHGFGMDMG 1114 Query: 2406 PGALDKAPHGFNYDAGLKFDPSASSAPSRFLPPYHPGGTPTLNEAGERARPVRLNEDNVS 2227 P A +K PHGF+ FDP S PSRFLPPYHP ++ GE RPV L +D + Sbjct: 1115 PRAQEKEPHGFS------FDPMIGSGPSRFLPPYHP------DDTGE--RPVGLPKDTLG 1160 Query: 2226 RPDSTRKHPDFNGPVPGYGRHRMDGSAPRSPVREFXXXXXXXXXXXXXXXXXXPDDIXXX 2047 R PDF G VP YGRHRMDG RSP RE+ Sbjct: 1161 R-------PDFLGTVPSYGRHRMDGFVSRSPGREYPGISPHGFGGH-------------- 1199 Query: 2046 XXXXXXXXXRPFNLPSDQIGNSFQ--ENRFPILPSHLRRGEPERNVNMPMGEHISPGPQH 1873 P D+I + +RFP LP HL RG E + M +H Sbjct: 1200 --------------PGDEIDGRERRFSDRFPGLPGHLHRGGFESSDRM---------EEH 1236 Query: 1872 FRTGDLTGQDILPSHLRRGENLGPRNLPGHLRFGEPAGFGAFPSHARMGELAGPGNFPQH 1693 R+ D+ QD P++ RRGE++G N+PGHLR GEP GFG F SH R+GE GPGNF Sbjct: 1237 LRSRDMINQDNRPAYFRRGEHVGHHNMPGHLRLGEPIGFGDFSSHERIGEFGGPGNF--- 1293 Query: 1692 LSTGEPFGVGNKPSHPRFGEPGFRSSYSLQGYPNDGGFH 1576 HPR GEPGFRSS+SLQ +PNDGG + Sbjct: 1294 -------------RHPRLGEPGFRSSFSLQEFPNDGGIY 1319 >ref|XP_007016235.1| Uncharacterized protein isoform 5 [Theobroma cacao] gi|508786598|gb|EOY33854.1| Uncharacterized protein isoform 5 [Theobroma cacao] Length = 1358 Score = 496 bits (1277), Expect = e-137 Identities = 397/1059 (37%), Positives = 487/1059 (45%), Gaps = 54/1059 (5%) Frame = -1 Query: 4590 HAVTGXXXXXXXXXXXQLSLGAPQHPMHTHLQNVPHQQPQHSAQMQSHFSQQPVQMRPPQ 4411 HAVTG Q+ L PQHPMH H Q H Q QH AQMQ+ + QQP QMRPPQ Sbjct: 450 HAVTGHQSYPLSQPHQQMQLVTPQHPMHVHAQGGLHPQ-QHPAQMQNSYPQQPPQMRPPQ 508 Query: 4410 SHAPIASQHQ-STLPSPGQVPTIPLQQIPLHPNAQQPGHLVNXXXXXXXXXXXXXXXXXX 4234 H I++Q Q LPSPG + LQQ+ LH + QP V Sbjct: 509 PHVAISNQQQPGLLPSPGSM----LQQVHLH--SHQPALPVQQRPVMHPAASPMSQPYVQ 562 Query: 4233 XXXXXXXXXPSGSVQNQLHQQGPYSQPQQLPMHSHLRPQGPTXXXXXXXXXXXXXXQNVA 4054 G VQ Q+ QQGP+ Q QQ S RP GP QNVA Sbjct: 563 QQPLSTQPV--GLVQPQMLQQGPFVQ-QQSSFQSQSRPLGPPHSFPQPPHAYAQPQQNVA 619 Query: 4053 LSQ------SQNHFGRSMIP--GAQSQPLPQSASGPTGAGQVKTTMHGANQHQLSTNQNY 3898 S S N GR M P G QSQP P SA+G VK GANQ S+ QN Sbjct: 620 GSHAVHFHPSHNLVGRPMTPNHGVQSQPYPHSAAGTP----VKPVHLGANQP--SSYQNN 673 Query: 3897 PLRTNNQGQPVFEQQPGYMQQSVQQSGPIIKSTMSESQGDQLSEKNVSFRE-ESSSQRTA 3721 RTNNQ SG + MSE GD ++KNV+ +E +SSS TA Sbjct: 674 VFRTNNQ------------------SG-VTSQPMSEVPGDHGTDKNVAEQEADSSSPGTA 714 Query: 3720 KSDLNNPVITSGLRADSVEEKNLESEVDTKSIDDERK-HIGEDEDNNKVS-----DSLRT 3559 + + N + S L AD E+ + E D KS+D++ +G+D + +S +S RT Sbjct: 715 RKEANELDMASSLGADVAEKNTAKLEADLKSVDEKLTGDVGDDSNGVDISTKETPESRRT 774 Query: 3558 LGTDPNSHSMENGEPVIKQIVEEEVTDSISEPSSGGKFVENKDQKDVPHNDLKQVENSSL 3379 +GTD H +PV K +V E + + +G VE KD P SL Sbjct: 775 VGTDLEQHR----DPVSKNMVTCEAIEDQKDVHNGEHKVEEIKIKDGP----------SL 820 Query: 3378 EGKEIQGQVEIIGEQSGKLKKDAVNAEGVVLPANGSDRGFLSVHPSSAPVPEHRGHPPPG 3199 + +Q + ++ EQ+GK++KD + P + GF RG PP Sbjct: 821 KTPPLQ-EAKLGEEQNGKMQKDKILPHDQGTPKGPAGNGF-------------RGIPPSS 866 Query: 3198 QLHGRGFVQPSHPVPL------------------HQRPP------ALPSGLPPQHGQASG 3091 Q+ G++ PSH VP QRP A P GLP H Q G Sbjct: 867 QVQPGGYLPPSHSVPNVDQGRHQPLQMPYGSNNNQQRPAVSAILQAPPPGLP-SHAQTPG 925 Query: 3090 LPLTQLRPQGPGHFPQSGQPLNPPDHFQPPGGILGPGSTSFGRGPSHFGPPQRNFESQSA 2911 LP Q RPQGPG Q L PP++ PPG SFGR PS++GP Sbjct: 926 LPPNQFRPQGPG------QALVPPENL-PPG--------SFGRDPSNYGPQ--------- 961 Query: 2910 GPLGHYHQGHVPPSHIAPPR-SQGEPVGG---------PFDAHGGLMARAPPHGPEVQMG 2761 G Y+QG PPS PR SQGEP+ G FD+HG AP +GPE Sbjct: 962 ---GPYNQG--PPSLSGAPRISQGEPLVGLSYGTPPLTAFDSHG-----APLYGPESHSV 1011 Query: 2760 PQRFNPMEAEIFPNPRPSFLDGRQPDLHLQATRMNAPPXXXXXXXXXLRDERFKPPFDER 2581 N ++ D RQ D + LR ER KP DE Sbjct: 1012 QHSANMVDYHA---------DNRQLDPRASGLDSTST--------FSLRGERLKPVQDEC 1054 Query: 2580 PHSFPVEPGRR--RGEFEEDLKQFPRPSHLDPEAAPKFGSYFSSSRPIERGPQGFGMDAA 2407 + FP++ G R RG+FEEDLK FPRPSHLD E PKFGSY SSSRP++RGP GFGMD Sbjct: 1055 SNQFPLDRGHRGDRGQFEEDLKHFPRPSHLDNEPVPKFGSYISSSRPLDRGPHGFGMDMG 1114 Query: 2406 PGALDKAPHGFNYDAGLKFDPSASSAPSRFLPPYHPGGTPTLNEAGERARPVRLNEDNVS 2227 P A +K PHGF+ FDP S PSRFLPPYHP ++ GE RPV L +D + Sbjct: 1115 PRAQEKEPHGFS------FDPMIGSGPSRFLPPYHP------DDTGE--RPVGLPKDTLG 1160 Query: 2226 RPDSTRKHPDFNGPVPGYGRHRMDGSAPRSPVREFXXXXXXXXXXXXXXXXXXPDDIXXX 2047 R PDF G VP YGRHRMDG RSP RE+ Sbjct: 1161 R-------PDFLGTVPSYGRHRMDGFVSRSPGREYPGISPHGFGGH-------------- 1199 Query: 2046 XXXXXXXXXRPFNLPSDQIGNSFQ--ENRFPILPSHLRRGEPERNVNMPMGEHISPGPQH 1873 P D+I + +RFP LP HL RG E + M +H Sbjct: 1200 --------------PGDEIDGRERRFSDRFPGLPGHLHRGGFESSDRM---------EEH 1236 Query: 1872 FRTGDLTGQDILPSHLRRGENLGPRNLPGHLRFGEPAGFGAFPSHARMGELAGPGNFPQH 1693 R+ D+ QD P++ RRGE++G N+PGHLR GEP GFG F SH R+GE GPGNF Sbjct: 1237 LRSRDMINQDNRPAYFRRGEHVGHHNMPGHLRLGEPIGFGDFSSHERIGEFGGPGNF--- 1293 Query: 1692 LSTGEPFGVGNKPSHPRFGEPGFRSSYSLQGYPNDGGFH 1576 HPR GEPGFRSS+SLQ +PNDGG + Sbjct: 1294 -------------RHPRLGEPGFRSSFSLQEFPNDGGIY 1319 >ref|XP_004153176.1| PREDICTED: uncharacterized protein LOC101214768 [Cucumis sativus] Length = 1177 Score = 466 bits (1199), Expect = e-128 Identities = 406/1163 (34%), Positives = 531/1163 (45%), Gaps = 80/1163 (6%) Frame = -1 Query: 4590 HAVTGXXXXXXXXXXXQLSLGAPQHPMHTHLQNVPHQQPQHSAQMQSHFSQQPVQMRPPQ 4411 +A TG Q+ LG PQ+ + + Q HQQ Q QMQS Q P MRP Q Sbjct: 134 YASTGYPSYPQPQHHQQMQLGVPQN-VPSAPQGGAHQQSQPLVQMQSQLPQPP-PMRPSQ 191 Query: 4410 SHAPIASQHQSTLPSPGQVPTIP-LQQIPLHPNAQQPGHLVNXXXXXXXXXXXXXXXXXX 4234 Q LPS QV + QQ+ +H +AQQPG Sbjct: 192 PPLYQNQQQPPILPSSNQVQNVSSAQQLHIHSHAQQPG----------------GPGQAA 235 Query: 4233 XXXXXXXXXPSGSVQNQLHQQGPYSQPQQLPMHS-HLRPQ----GPTXXXXXXXXXXXXX 4069 Q +HQ + Q Q H H+ PQ GP Sbjct: 236 NQRPVMQLVQQSQSQQVVHQHQHFGQQGQFIQHQLHMTPQMRLPGPPNSLSQHNHAYAHL 295 Query: 4068 XQNVAL------SQSQNHFGRSMIP--GAQSQPLPQSASGPTGAGQVKTTMHGANQHQLS 3913 N L + SQ+ GR ++P GAQS P QS G V+ GANQ Sbjct: 296 QHNANLPHGMQHNPSQSSEGRPLVPNQGAQSIPYSQSMVGVP----VRAIQPGANQ---- 347 Query: 3912 TNQNYPLRTNNQGQPVFEQQPGYMQQSVQQSGPIIKSTMSESQGDQLSEKNVSFREES-S 3736 P +Q P + + S Q P + G++ EK RE S Sbjct: 348 --------------PTIKQGPTFGKNSNQVQLP-------DGFGERKLEKGPDGRESGLS 386 Query: 3735 SQRTAKSDLNNPVITSGLRADSVEEKNLESEVDTK--SIDDERKHIGEDEDNNKVSDSLR 3562 SQ+ AK N+ ++S + ++ E K +SE D + D+ H + + ++ Sbjct: 387 SQKDAKRAANHLDVSSTMGTNAGELKIDKSEADKGRYAFGDKSIHFDTSTERTPQNGAM- 445 Query: 3561 TLGTDPNSHSMENGEPVIKQI---VEEEVTDSISEPSSGGKFVENK--DQKDVPHNDLKQ 3397 D N H ++G+ KQ+ V+ E + + SS K E DQKD+ + K+ Sbjct: 446 ----DSNLHVGDSGKT--KQVELKVKVEAAEGTFDHSSNDKLGEVSILDQKDLG-TEPKK 498 Query: 3396 VENSSLEGKEIQGQVEIIG-------EQSGKLKKDAVNAEGVVLPANGSDRGFLSVHPSS 3238 E+ +E K Q + +I EQS +++ D G P++G++ +S Sbjct: 499 KEDLVIENKGNQEEFKISSQDTELREEQSKRMQNDT---SGTPHPSSGTNESQQGATTTS 555 Query: 3237 APVPEHRG----------HPPPGQLHGRGFVQPSHPVPL-----HQRPP----------- 3136 + + G +PP G SHP L HQ PP Sbjct: 556 SLILGSPGMLNQHGYQDKNPPQTGGTQIGAAVTSHPASLVAHTRHQTPPSSYVSSALQHG 615 Query: 3135 -ALPS--GLPP---QHGQASGLPLTQLRPQGPGHFPQSGQPLNPPDHFQPPGGILGPGS- 2977 A PS G PP Q S P Q+RP+ PG GQP NP + F GGI GS Sbjct: 616 VAAPSLPGPPPGPYHQAQFSNNPSMQVRPRAPGLVAHPGQPFNPSESFHL-GGIPESGSA 674 Query: 2976 TSFGRGPSHFGPPQRNFESQSAGPLGHYHQGHVPPSHIAPPRSQGEPVGGPFDAH--GGL 2803 +SFGRG +GP Q +S G Y S S G+PVG F + G Sbjct: 675 SSFGRGLGQYGPQQAL--ERSIGSQATYSLSQPSASQGGSKMSLGDPVGAHFRSKLPGAF 732 Query: 2802 MARAPPHGPEVQMGPQR-FNPMEAEIFPNPRPSFLDGRQPDL------HL-----QATRM 2659 +R H PE Q+G QR +P+EAEIF N RP LD P HL + Sbjct: 733 DSRGLLHAPEAQIGVQRPIHPLEAEIFSNQRPR-LDSHLPGTMEHHPPHLTGIPPNVLPL 791 Query: 2658 NAPPXXXXXXXXXLRDERFKPPFDERPHSFPVEPGRR---RGEFEEDLKQFPRPSHLDPE 2488 N P LRDERFK +E+ +SFP++P RR + + E+ L+QFPRPSHL+ E Sbjct: 792 NGAPGPDSSSKLGLRDERFKLLHEEQLNSFPLDPARRPINQTDAEDILRQFPRPSHLESE 851 Query: 2487 AAPKFGSYFSSSRPIERGPQGFGMDAAPGALDKAPHGFNYDAGLKFDPSASSAPSRFLPP 2308 A + G+Y S RP +RG HG N+D GL D +A+S R LPP Sbjct: 852 LAQRIGNY--SLRPFDRGV----------------HGQNFDTGLTIDGAAAS---RVLPP 890 Query: 2307 YHPGGTPTLNEAGERARPVRLNEDNVSRPDSTRKHPDFNGPVPG-YGRHRMDGSAPRSPV 2131 H GG +A RP+ ED+ + D +R H DF P PG YGR +DG PRSP+ Sbjct: 891 RHIGGALYPTDA---ERPIAFYEDSTGQADRSRGHSDF--PAPGSYGRRFVDGFGPRSPL 945 Query: 2130 REFXXXXXXXXXXXXXXXXXXPDDIXXXXXXXXXXXXRPFNLPSDQIGNSFQENRFPILP 1951 E+ D P + D + SF+E+RFPI Sbjct: 946 HEYHGRGFGGRGFTGVEEIDGQD--------------FPHHF-GDPL--SFRESRFPIFR 988 Query: 1950 SHLRRGEPERNVNMPMGEHISPGPQHFRTGDLTGQDILPSHLRRGENLGPRNLPGHLRFG 1771 SHL+RG+ E + N M EH+ RTGDL GQD + GPR+LPGHLR G Sbjct: 989 SHLQRGDFESSGNFRMSEHL-------RTGDLIGQD---------RHFGPRSLPGHLRLG 1032 Query: 1770 EPAGFGAFPSHARMGELAGPGNFPQHLSTGEPFGVGNKPSHPRFGEPGFRSSYSLQGYPN 1591 E FG+ P H+R+G+L+ GNF EPFG G++P++PR GEPGFRSS+S QG + Sbjct: 1033 ELTAFGSHPGHSRIGDLSVLGNF-------EPFGGGHRPNNPRLGEPGFRSSFSRQGLVD 1085 Query: 1590 DGGFHLGDMESLDNPRKRKSASMGWCRICKVDCETVEGLDLHSQTREHQKMAMDMVLSIK 1411 DG F GD+ES DN RKRK SMGWCRICKVDCETVEGL+LHSQTREHQKMAMDMV SIK Sbjct: 1086 DGRFFAGDVESFDNSRKRKPISMGWCRICKVDCETVEGLELHSQTREHQKMAMDMVQSIK 1145 Query: 1410 QQNGKKQKLTSNDHSSVEDASKS 1342 QN KK K+T NDHSS + SK+ Sbjct: 1146 -QNAKKHKVTPNDHSSEDGKSKN 1167 >ref|XP_004145323.1| PREDICTED: uncharacterized protein LOC101205914 [Cucumis sativus] Length = 1434 Score = 466 bits (1199), Expect = e-128 Identities = 406/1163 (34%), Positives = 531/1163 (45%), Gaps = 80/1163 (6%) Frame = -1 Query: 4590 HAVTGXXXXXXXXXXXQLSLGAPQHPMHTHLQNVPHQQPQHSAQMQSHFSQQPVQMRPPQ 4411 +A TG Q+ LG PQ+ + + Q HQQ Q QMQS Q P MRP Q Sbjct: 391 YASTGYPSYPQPQHHQQMQLGVPQN-VPSAPQGGAHQQSQPLVQMQSQLPQPP-PMRPSQ 448 Query: 4410 SHAPIASQHQSTLPSPGQVPTIP-LQQIPLHPNAQQPGHLVNXXXXXXXXXXXXXXXXXX 4234 Q LPS QV + QQ+ +H +AQQPG Sbjct: 449 PPLYQNQQQPPILPSSNQVQNVSSAQQLHIHSHAQQPG----------------GPGQAA 492 Query: 4233 XXXXXXXXXPSGSVQNQLHQQGPYSQPQQLPMHS-HLRPQ----GPTXXXXXXXXXXXXX 4069 Q +HQ + Q Q H H+ PQ GP Sbjct: 493 NQRPVMQLVQQSQSQQVVHQHQHFGQQGQFIQHQLHMTPQMRLPGPPNSLSQHNHAYAHL 552 Query: 4068 XQNVAL------SQSQNHFGRSMIP--GAQSQPLPQSASGPTGAGQVKTTMHGANQHQLS 3913 N L + SQ+ GR ++P GAQS P QS G V+ GANQ Sbjct: 553 QHNANLPHGMQHNPSQSSEGRPLVPNQGAQSIPYSQSMVGVP----VRAIQPGANQ---- 604 Query: 3912 TNQNYPLRTNNQGQPVFEQQPGYMQQSVQQSGPIIKSTMSESQGDQLSEKNVSFREES-S 3736 P +Q P + + S Q P + G++ EK RE S Sbjct: 605 --------------PTIKQGPTFGKNSNQVQLP-------DGFGERKLEKGPDGRESGLS 643 Query: 3735 SQRTAKSDLNNPVITSGLRADSVEEKNLESEVDTK--SIDDERKHIGEDEDNNKVSDSLR 3562 SQ+ AK N+ ++S + ++ E K +SE D + D+ H + + ++ Sbjct: 644 SQKDAKRAANHLDVSSTMGTNAGELKIDKSEADKGRYAFGDKSIHFDTSTERTPQNGAM- 702 Query: 3561 TLGTDPNSHSMENGEPVIKQI---VEEEVTDSISEPSSGGKFVENK--DQKDVPHNDLKQ 3397 D N H ++G+ KQ+ V+ E + + SS K E DQKD+ + K+ Sbjct: 703 ----DSNLHVGDSGKT--KQVELKVKVEAAEGTFDHSSNDKLGEVSILDQKDLG-TEPKK 755 Query: 3396 VENSSLEGKEIQGQVEIIG-------EQSGKLKKDAVNAEGVVLPANGSDRGFLSVHPSS 3238 E+ +E K Q + +I EQS +++ D G P++G++ +S Sbjct: 756 KEDLVIENKGNQEEFKISSQDTELREEQSKRMQNDT---SGTPHPSSGTNESQQGATTTS 812 Query: 3237 APVPEHRG----------HPPPGQLHGRGFVQPSHPVPL-----HQRPP----------- 3136 + + G +PP G SHP L HQ PP Sbjct: 813 SLILGSPGMLNQHGYQDKNPPQTGGTQIGAAVTSHPASLVAHTRHQTPPSSYVSSALQHG 872 Query: 3135 -ALPS--GLPP---QHGQASGLPLTQLRPQGPGHFPQSGQPLNPPDHFQPPGGILGPGS- 2977 A PS G PP Q S P Q+RP+ PG GQP NP + F GGI GS Sbjct: 873 VAAPSLPGPPPGPYHQAQFSNNPSMQVRPRAPGLVAHPGQPFNPSESFHL-GGIPESGSA 931 Query: 2976 TSFGRGPSHFGPPQRNFESQSAGPLGHYHQGHVPPSHIAPPRSQGEPVGGPFDAH--GGL 2803 +SFGRG +GP Q +S G Y S S G+PVG F + G Sbjct: 932 SSFGRGLGQYGPQQAL--ERSIGSQATYSLSQPSASQGGSKMSLGDPVGAHFRSKLPGAF 989 Query: 2802 MARAPPHGPEVQMGPQR-FNPMEAEIFPNPRPSFLDGRQPDL------HL-----QATRM 2659 +R H PE Q+G QR +P+EAEIF N RP LD P HL + Sbjct: 990 DSRGLLHAPEAQIGVQRPIHPLEAEIFSNQRPR-LDSHLPGTMEHHPPHLTGIPPNVLPL 1048 Query: 2658 NAPPXXXXXXXXXLRDERFKPPFDERPHSFPVEPGRR---RGEFEEDLKQFPRPSHLDPE 2488 N P LRDERFK +E+ +SFP++P RR + + E+ L+QFPRPSHL+ E Sbjct: 1049 NGAPGPDSSSKLGLRDERFKLLHEEQLNSFPLDPARRPINQTDAEDILRQFPRPSHLESE 1108 Query: 2487 AAPKFGSYFSSSRPIERGPQGFGMDAAPGALDKAPHGFNYDAGLKFDPSASSAPSRFLPP 2308 A + G+Y S RP +RG HG N+D GL D +A+S R LPP Sbjct: 1109 LAQRIGNY--SLRPFDRGV----------------HGQNFDTGLTIDGAAAS---RVLPP 1147 Query: 2307 YHPGGTPTLNEAGERARPVRLNEDNVSRPDSTRKHPDFNGPVPG-YGRHRMDGSAPRSPV 2131 H GG +A RP+ ED+ + D +R H DF P PG YGR +DG PRSP+ Sbjct: 1148 RHIGGALYPTDA---ERPIAFYEDSTGQADRSRGHSDF--PAPGSYGRRFVDGFGPRSPL 1202 Query: 2130 REFXXXXXXXXXXXXXXXXXXPDDIXXXXXXXXXXXXRPFNLPSDQIGNSFQENRFPILP 1951 E+ D P + D + SF+E+RFPI Sbjct: 1203 HEYHGRGFGGRGFTGVEEIDGQD--------------FPHHF-GDPL--SFRESRFPIFR 1245 Query: 1950 SHLRRGEPERNVNMPMGEHISPGPQHFRTGDLTGQDILPSHLRRGENLGPRNLPGHLRFG 1771 SHL+RG+ E + N M EH+ RTGDL GQD + GPR+LPGHLR G Sbjct: 1246 SHLQRGDFESSGNFRMSEHL-------RTGDLIGQD---------RHFGPRSLPGHLRLG 1289 Query: 1770 EPAGFGAFPSHARMGELAGPGNFPQHLSTGEPFGVGNKPSHPRFGEPGFRSSYSLQGYPN 1591 E FG+ P H+R+G+L+ GNF EPFG G++P++PR GEPGFRSS+S QG + Sbjct: 1290 ELTAFGSHPGHSRIGDLSVLGNF-------EPFGGGHRPNNPRLGEPGFRSSFSRQGLVD 1342 Query: 1590 DGGFHLGDMESLDNPRKRKSASMGWCRICKVDCETVEGLDLHSQTREHQKMAMDMVLSIK 1411 DG F GD+ES DN RKRK SMGWCRICKVDCETVEGL+LHSQTREHQKMAMDMV SIK Sbjct: 1343 DGRFFAGDVESFDNSRKRKPISMGWCRICKVDCETVEGLELHSQTREHQKMAMDMVQSIK 1402 Query: 1410 QQNGKKQKLTSNDHSSVEDASKS 1342 QN KK K+T NDHSS + SK+ Sbjct: 1403 -QNAKKHKVTPNDHSSEDGKSKN 1424 >gb|EXB30469.1| hypothetical protein L484_006018 [Morus notabilis] Length = 1320 Score = 458 bits (1179), Expect = e-125 Identities = 397/1118 (35%), Positives = 491/1118 (43%), Gaps = 46/1118 (4%) Frame = -1 Query: 4521 QHPMHTHLQNVPHQQPQ--HSAQMQ----SHFSQQPVQMRPPQSHAPIASQHQSTL-PSP 4363 QHP H H PQ + Q+Q F +QP+ MRPP A I +Q Q L PSP Sbjct: 407 QHPS-AHAVTGHHSFPQLNNDPQVQIGGPQQFPKQPL-MRPPHPQATIPNQQQPVLLPSP 464 Query: 4362 GQVPTIPL--QQIPLHPNAQQPGHLVNXXXXXXXXXXXXXXXXXXXXXXXXXXXPSGSVQ 4189 GQV P QQ H Q PG VQ Sbjct: 465 GQVQNNPSVQQQSVQHSYFQPPGQ------------------------PEYQRPIMQPVQ 500 Query: 4188 NQLHQQGPYSQPQQLPMHSHLRPQGPTXXXXXXXXXXXXXXQNVALSQSQNHFGRSMIP- 4012 QQ Y QPQ LPM S RP GP+ A +S N GR +P Sbjct: 501 QTFPQQH-YQQPQ-LPMPSQFRPTGPSHLFPPQTHAYPQPPMQHA--KSPNVAGRPSMPQ 556 Query: 4011 GAQSQPLPQSASGPTGAGQVKTTMHGANQHQLSTNQNYPLRTNNQGQPVFEQQPGYMQQS 3832 G Q+ P Q A G ++ T G NQ + NQN L+TNNQ + S Sbjct: 557 GVQAPPFTQYAGGV-----IRPTYPGTNQQ--ANNQNNILKTNNQMK----------LPS 599 Query: 3831 VQQSGPIIKSTMSESQGDQLSEKNVSFREE-SSSQRTAKSDLNNPVITSGLRADSVEEKN 3655 + SG +TMS QG+Q K + +E +SS +T K NN L A+ E K Sbjct: 600 EEHSGANSTATMSIRQGNQDFVKGSAQQEVVASSHKTVKVGTNNSDSVLDLLANVGEVKT 659 Query: 3654 LESEVDTKSIDDERKHIGEDEDNNKVSDSLRTLGTDPNSHSMENGEPVIKQIVEEEVTDS 3475 +S+ D KS D PV+K +++EE +S Sbjct: 660 EKSKTDLKSTD-----------------------------------PVVKPMMKEEDVES 684 Query: 3474 ISEPSSGGKF--VENKDQKDVPHNDLKQVENSSLEGKEIQGQVEIIGEQSGKLKKDAVNA 3301 + SS GK V +D+KDV + ++++NS++E K++ G ++ + + Sbjct: 685 TLKNSSNGKSGKVVAEDKKDVLKVEPEKMKNSTVEDKDVGGSLQKKSPLQAVERHEGQGG 744 Query: 3300 EGVVLPANGSDRGFLSVHPSSAPV---PEHRGH---PPPGQLHGRGFVQPSHPVPLHQRP 3139 + V A+GSDR V SA + P G P + +G P P PL Q P Sbjct: 745 DSVKDAASGSDRASKVVPTPSAQILRSPASGGEVKSPYSRSVQVQGHQLPGPP-PLSQVP 803 Query: 3138 PALPSGLPPQHGQASGLPLTQLRPQGPGHFPQSGQPLNPPDHFQPPGGILGPGST-SFGR 2962 P P PP Q G T RPQ PG D PPG I PGS FGR Sbjct: 804 P--PG--PPHKTQEFGASQTHCRPQVPG------------DPLHPPGSI--PGSAIPFGR 845 Query: 2961 GPSHFGPPQRNFESQSAGPLGHYHQGHVPPSHIAPPRSQGEPVGGP---------FDAHG 2809 GP+ +GP Q++ E QS P Y+ G + SQGEP G F++HG Sbjct: 846 GPNQYGPNQQSSELQSLAPQRPYNPGPFGAFRL----SQGEPTGAESSGVLQPRAFNSHG 901 Query: 2808 GLMARAPPHGPEVQMGPQRFNPMEAEIFPNPRPSFLDGRQPD-----------------L 2680 G+MAR PHGPE+ F N RP F+D R PD + Sbjct: 902 GMMARPTPHGPEM--------------FSNQRPDFMDSRGPDPHFAGSLEHGAHSQSFGI 947 Query: 2679 HLQATRMNAPPXXXXXXXXXLRDERFKPPFDERPHSFPVEPGRRRGEFEEDLKQFPRPSH 2500 H TRMN RDERF P FP P R EFE+DLKQFPRP Sbjct: 948 HPNMTRMNDSHGFDSLSTLGPRDERFNP--------FPAGPNPR-AEFEDDLKQFPRP-- 996 Query: 2499 LDPEAAPKFGSYFSSSRPIERGPQGFGMDAAPGALDKAPHGFNYDAGLKFDPSASSAPSR 2320 D+ HG Y GLK D S PSR Sbjct: 997 ----------------------------------FDRGLHGLKYHTGLKMDSGVGSVPSR 1022 Query: 2319 FLPPYHPGGTPTLNEAGERARPVRLNEDNVSRPDSTRKHPDFNGPVPGYGRHRMDGSAPR 2140 L PY+ GG N+ G+R R D R D TR H DF GP GY R RMD A R Sbjct: 1023 SLSPYNGGGA---NDGGDRLGWHR--GDAFGRMDPTRGHLDFLGPGLGYDRRRMDSLASR 1077 Query: 2139 SPVREFXXXXXXXXXXXXXXXXXXPDDIXXXXXXXXXXXXRPFNLPSDQIGNSFQENRFP 1960 SP+RE DDI F P D +SF E+RF Sbjct: 1078 SPIREHPGISLRGFVGPGP------DDIHGRELRR-------FGEPFD---SSFHESRFS 1121 Query: 1959 ILPSHLRRGEPERNVNMPMGEHISPGPQHFRTGDLTGQDILPSHLRRGENLGPRNLPGHL 1780 +LP HLRRGE E NM MG+H+ DL G+D L LR GE++G + GH Sbjct: 1122 MLPGHLRRGEFEGPRNMGMGDHLR--------NDLIGRDGLSGPLRWGEHMG--DFHGHF 1171 Query: 1779 RFGEPAGFGAFPSHARMGELAGPGNFPQHLSTGEPFGVGNKPSHPRFGEPGFRSSYSLQG 1600 GEP GFGA HAR+ E+ GPG+F + FG G+ PS P GEPGFRS +S G Sbjct: 1172 HLGEPVGFGAHSRHARIREIGGPGSF-------DSFGRGDGPSFPHLGEPGFRSRFSSHG 1224 Query: 1599 YPNDGGFHLGDMESLDNPRKRKSASMGWCRICKVDCETVEGLDLHSQTREHQKMAMDMVL 1420 +P G D+ + D RKRK +MGWCRICKVDCETVEGL+LHSQTREHQKMAMDMV+ Sbjct: 1225 FPTGDGIFTEDL-AFDKSRKRKLPTMGWCRICKVDCETVEGLELHSQTREHQKMAMDMVV 1283 Query: 1419 SIKQQNGKKQKLTSNDHSSVEDASKSRNAIAIFEGRRN 1306 +IK QN KKQKLT D SS+ DAS+ R+A G+ N Sbjct: 1284 AIK-QNAKKQKLTFGDQSSLGDASQPRSAGTEGHGKDN 1320 >ref|XP_004169561.1| PREDICTED: uncharacterized protein LOC101227701 [Cucumis sativus] Length = 538 Score = 396 bits (1017), Expect = e-107 Identities = 262/598 (43%), Positives = 325/598 (54%), Gaps = 19/598 (3%) Frame = -1 Query: 3078 QLRPQGPGHFPQSGQPLNPPDHFQPPGGILGPGS-TSFGRGPSHFGPPQRNFESQSAGPL 2902 Q+RP+ PG GQP NP + F GGI GS +SFGRG +GP Q +S G Sbjct: 2 QVRPRAPGLVAHPGQPFNPSESFHL-GGIPESGSASSFGRGLGQYGPQQAL--ERSIGSQ 58 Query: 2901 GHYHQGHVPPSHIAPPRSQGEPVGGPFDAH--GGLMARAPPHGPEVQMGPQR-FNPMEAE 2731 Y S S G+PVG F + G +R H PE Q+G QR +P+EAE Sbjct: 59 ATYSLSQPSASQGGSKMSLGDPVGAHFRSKLPGAFDSRGLLHAPEAQIGVQRPIHPLEAE 118 Query: 2730 IFPNPRPSFLDGRQPDL------HL-----QATRMNAPPXXXXXXXXXLRDERFKPPFDE 2584 IF N RP LD P HL +N P LRDERFK +E Sbjct: 119 IFSNQRPR-LDSHLPGTMEHHPPHLTGIPPNVLPLNGAPGPDSSSKLGLRDERFKLLHEE 177 Query: 2583 RPHSFPVEPGRR---RGEFEEDLKQFPRPSHLDPEAAPKFGSYFSSSRPIERGPQGFGMD 2413 + +SFP++P RR + + E+ L+QFPRPSHL+ E A + G+Y S RP +RG Sbjct: 178 QLNSFPLDPARRPINQTDAEDILRQFPRPSHLESELAQRIGNY--SLRPFDRGV------ 229 Query: 2412 AAPGALDKAPHGFNYDAGLKFDPSASSAPSRFLPPYHPGGTPTLNEAGERARPVRLNEDN 2233 HG N+D GL D +A+S R LPP H GG +A RP+ ED+ Sbjct: 230 ----------HGQNFDTGLTIDGAAAS---RVLPPRHIGGALYPTDA---ERPIAFYEDS 273 Query: 2232 VSRPDSTRKHPDFNGPVPG-YGRHRMDGSAPRSPVREFXXXXXXXXXXXXXXXXXXPDDI 2056 + D +R H DF P PG YGR +DG PRSP+ E+ D Sbjct: 274 TGQADRSRGHSDF--PAPGSYGRRFVDGFGPRSPLHEYHGRGFGGRGFTGVEEIDGQD-- 329 Query: 2055 XXXXXXXXXXXXRPFNLPSDQIGNSFQENRFPILPSHLRRGEPERNVNMPMGEHISPGPQ 1876 P + D + SF+E+RFPI SHL+RG+ E + N M EH+ Sbjct: 330 ------------FPHHF-GDPL--SFRESRFPIFRSHLQRGDFESSGNFRMSEHL----- 369 Query: 1875 HFRTGDLTGQDILPSHLRRGENLGPRNLPGHLRFGEPAGFGAFPSHARMGELAGPGNFPQ 1696 RTGDL GQD + GPR+LPGHLR GE FG+ P H+R+G+L+ GNF Sbjct: 370 --RTGDLIGQD---------RHFGPRSLPGHLRLGELTAFGSHPGHSRIGDLSVLGNF-- 416 Query: 1695 HLSTGEPFGVGNKPSHPRFGEPGFRSSYSLQGYPNDGGFHLGDMESLDNPRKRKSASMGW 1516 EPFG G++P++PR GEPGFRSS+S QG +DG F GD+ES DN RKRK SMGW Sbjct: 417 -----EPFGGGHRPNNPRLGEPGFRSSFSRQGLVDDGRFFAGDVESFDNSRKRKPISMGW 471 Query: 1515 CRICKVDCETVEGLDLHSQTREHQKMAMDMVLSIKQQNGKKQKLTSNDHSSVEDASKS 1342 CRICKVDCETVEGL+LHSQTREHQKMAMDMV SIK QN KK K+T NDHSS + SK+ Sbjct: 472 CRICKVDCETVEGLELHSQTREHQKMAMDMVQSIK-QNAKKHKVTPNDHSSEDGKSKN 528 >ref|XP_006345324.1| PREDICTED: trithorax group protein osa-like isoform X1 [Solanum tuberosum] Length = 1049 Score = 393 bits (1010), Expect = e-106 Identities = 351/1129 (31%), Positives = 463/1129 (41%), Gaps = 35/1129 (3%) Frame = -1 Query: 4590 HAVTGXXXXXXXXXXXQLSLGAPQHPMHTHLQNVPHQQPQHSAQMQSHFS-QQPVQMRPP 4414 +AV+G Q+++G Q P PH + +Q+H QP MRPP Sbjct: 139 NAVSGFHSYPQTQITQQVAIGMSQQP-----PMYPHPTSGSTPLVQTHGQVPQPPLMRPP 193 Query: 4413 QSHAPIASQHQSTLPSPGQVPTIPLQQIPLHPNAQQPGHLVNXXXXXXXXXXXXXXXXXX 4234 I +Q +P+ GQVP Q L+ AQQ GH + Sbjct: 194 LGL--IGNQQPGLVPTQGQVPA----QSQLYATAQQAGHSIQQHPVRPNQQPMSQQYSQH 247 Query: 4233 XXXXXXXXXPSGSVQNQLHQQGPYSQPQQLPMHSHLRPQGPTXXXXXXXXXXXXXXQNVA 4054 G +Q HQQG ++ Q P+ S RPQG QN Sbjct: 248 HTFP-------GPFPSQSHQQGHFTHQQ--PLQSQFRPQGLPNVVPQSLHAYIQPQQNAT 298 Query: 4053 L------SQSQNHFGRSMIPGAQS--QPLPQSASGPTGAGQVKTTMHGANQHQLSTNQNY 3898 L QSQ + GR PG Q+ Q + Q+ G QV+ +Q Q+ N +Y Sbjct: 299 LPPPPQPQQSQTYIGR---PGMQNHVQSISQAHGGYNTTAQVRPVQPALSQPQI--NPSY 353 Query: 3897 PLRTNNQGQPVFEQQPGYMQQSVQQSGPIIKSTMSESQGDQLSEKNVSFREESSS-QRTA 3721 T+N+ +S+ Q K ES+GD L +K E Q A Sbjct: 354 GSYTSNE------------HESMDQK----KRLALESKGDLLPDKTSGRPEVGVPYQDNA 397 Query: 3720 KSDLNNPVITSGLRADSVEEKNLESEVDTKSIDDERKHIGEDEDNNKVSDSLRTLGTDPN 3541 + DLN+ + KSIDDE + + +D + Sbjct: 398 QKDLNS--------------------LPAKSIDDEYR---------------QRASSDID 422 Query: 3540 SHSMENGEPVIKQIVEEEVTDSISEPSSGGKFVE-----NKDQKDVPHNDLKQVENSSLE 3376 H ++ E + K+ V+EE ++ P S K + +KD D +L Q Sbjct: 423 VHKGDSDELMDKRTVKEEENENFLMPKSASKSADATVKPDKDACDDAPKELDQTLEKHES 482 Query: 3375 GKEIQGQVEIIGEQSGKLKKDAVNAEGVVLPANGSDRGFLSVHPSSAPVPEHRGHPPPGQ 3196 G ++ + SG+ D+ GV Q Sbjct: 483 SDAADGSIKKLN--SGRDSHDSTIDRGVF------------------------------Q 510 Query: 3195 LHGRGFVQPSHPVPLHQRP-------PALPSGLPPQHGQASGLPLTQLRPQGPGHFPQSG 3037 +G G P + QRP P P+G H Q G P T + P G PQ+G Sbjct: 511 QYGHGMPPPKYGPSAQQRPVGPMIISPVQPAG-SASHAQLPGYPPTAMMPSGD--VPQAG 567 Query: 3036 QPLNPPDHFQ---------PPGGILGPGS-TSFGRGPSHFGPPQRNFESQSAGPLGHYHQ 2887 QPLN DH P GGI GPGS T+F RG HF PP Sbjct: 568 QPLNSLDHHPQFLKQPSSAPLGGIPGPGSITTFARGHGHFLPP----------------- 610 Query: 2886 GHVPPSHIAPPRSQGEPVGGPFDAHGGLMARAPPHGPEVQMGPQR-FNPMEAEIFPNPRP 2710 G P E + G + RAP G E+ G Q NP EAE+F N R Sbjct: 611 GEFP-----------EGITG--------IGRAPLSGAEIPSGTQHSVNPAEAEMFQNQRV 651 Query: 2709 SFLDGRQPDLHLQATRMNAPPXXXXXXXXXLRDERFKPPFDERPHSFPVEPGRRRGEFEE 2530 + +G QP+ + P RD+R K P E Sbjct: 652 NRFEGNQPN-PFSSGSFEKVPFGQPRSMESARDKRLKAPMGE------------------ 692 Query: 2529 DLKQFPRPSHLDPEAAPKFGSYFSSSRPIERGPQGFGMDAAPGALDKAPHGFNYDAGLKF 2350 HL P P+ D DK P G YD+G KF Sbjct: 693 ---------HLSPLPVPR--------------------DQGSWPHDKPPRGLGYDSGSKF 723 Query: 2349 DPSASSAPSRFLPPYHPGGTPTLNEAGERARPVRLNEDNVSRPDSTRKHPDFNGPVPGYG 2170 + S P+R LPP+HP G+ ++GER P+ ++D+ R S G+G Sbjct: 724 EASTGVPPNRLLPPHHPPGSMHFKDSGEREAPLGPHDDDRKRGGS------------GFG 771 Query: 2169 RHRMDGSAPRSPVREFXXXXXXXXXXXXXXXXXXPDDIXXXXXXXXXXXXRPFNLPSDQI 1990 H MD + R+P E DDI FNLPS+ Sbjct: 772 VHHMDYLSARNPDGELFNIPPRGFVSHSGF-----DDIGGREPRQFIEGPGHFNLPSNLA 826 Query: 1989 GNSFQENRFPILPSHLRRGEPERNVNMPMGEHISPGP--QHFRTGDLTGQDILPSHLRRG 1816 G + RF LP H E + ++ GEH + G +H ++GDL G+D +PSHL Sbjct: 827 GGLYSNGRFQALPGHPHGVETDGLGDLRGGEHTTFGRPYKHVQSGDLFGKD-MPSHLHHD 885 Query: 1815 ENLGPRNLPGHLRFGEPAGFGAFPSHARMGELAGPGNFPQHLSTGEPFGVGNKPSHPRFG 1636 E+L P LP HLRF +PAGFG+F HA MGEL+G G+ P GE G NKP PRFG Sbjct: 886 ESLDPPKLPSHLRFDKPAGFGSFAGHAYMGELSGFGDIP---GFGESIG-RNKPGMPRFG 941 Query: 1635 EPGFRSSYSLQGYPNDGGFHLGDMESLDNPRKRKSASMGWCRICKVDCETVEGLDLHSQT 1456 EPGFRS Y + YPN G + GD++S D PRKRK SMGWCRICKVDCETVEGLD+HSQT Sbjct: 942 EPGFRSRYPVPAYPNHG-LYAGDVDSFDRPRKRKPTSMGWCRICKVDCETVEGLDMHSQT 1000 Query: 1455 REHQKMAMDMVLSIKQQNGKKQKLTSNDHSSVEDASKSRNAIAIFEGRR 1309 REHQ MAMDMV SIK+QN KKQK T +D +SVE+ ++R A+ GR+ Sbjct: 1001 REHQDMAMDMVRSIKEQNRKKQK-TFSDRASVEEKGRTRKAVFESRGRK 1048