BLASTX nr result
ID: Forsythia22_contig00022761
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Forsythia22_contig00022761 (1489 letters) Database: ./nr 69,698,275 sequences; 24,982,196,650 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_011087795.1| PREDICTED: uncharacterized protein LOC105169... 464 e-127 ref|XP_011092382.1| PREDICTED: uncharacterized protein LOC105172... 461 e-127 ref|XP_012828377.1| PREDICTED: uncharacterized protein LOC105949... 397 e-108 ref|XP_012828376.1| PREDICTED: uncharacterized protein LOC105949... 397 e-108 emb|CDO97516.1| unnamed protein product [Coffea canephora] 383 e-103 ref|XP_002265987.2| PREDICTED: uncharacterized protein LOC100241... 381 e-103 ref|XP_012839759.1| PREDICTED: uncharacterized protein LOC105960... 340 2e-90 ref|XP_010277688.1| PREDICTED: uncharacterized protein YMR317W-l... 332 5e-88 ref|XP_010245092.1| PREDICTED: uncharacterized protein LOC104588... 330 2e-87 ref|XP_010245093.1| PREDICTED: uncharacterized protein LOC104588... 323 3e-85 gb|EYU35430.1| hypothetical protein MIMGU_mgv1a0188591mg, partia... 317 2e-83 ref|XP_007041568.1| Uncharacterized protein isoform 2 [Theobroma... 309 3e-81 ref|XP_007041567.1| Uncharacterized protein isoform 1 [Theobroma... 309 3e-81 gb|EYU18535.1| hypothetical protein MIMGU_mgv1a006469mg [Erythra... 301 9e-79 ref|XP_007225552.1| hypothetical protein PRUPE_ppa002972m2g, par... 285 5e-74 ref|XP_012467689.1| PREDICTED: uncharacterized protein LOC105786... 281 7e-73 ref|XP_012078152.1| PREDICTED: mediator of RNA polymerase II tra... 275 5e-71 ref|XP_002513834.1| conserved hypothetical protein [Ricinus comm... 274 1e-70 ref|XP_008233924.1| PREDICTED: cell wall protein AWA1 [Prunus mume] 273 3e-70 ref|XP_012078151.1| PREDICTED: mediator of RNA polymerase II tra... 272 4e-70 >ref|XP_011087795.1| PREDICTED: uncharacterized protein LOC105169167 [Sesamum indicum] Length = 624 Score = 464 bits (1193), Expect = e-127 Identities = 275/468 (58%), Positives = 314/468 (67%), Gaps = 7/468 (1%) Frame = -2 Query: 1386 MERSEPTLVPEWLKNXXXXXXXXXXXXXXXXXXSNLARNKSFMHSNGHDSGRSSASDGPT 1207 MERSEPTLVPEWLKN S +ARNKSF++SNGH+ GRSS+S+ T Sbjct: 1 MERSEPTLVPEWLKNTGNLTGAGSISHSDDHAASRVARNKSFVNSNGHEFGRSSSSERTT 60 Query: 1206 SSYFHRSSNSNISGRLRSYSSFGXXXXXXXXXXXXXXXXDKAKPVFGDYRHRDFSDVFEN 1027 SSYF RSS+SN SG RSYSSFG D+ K V D+ H DFSD N Sbjct: 61 SSYFRRSSSSNSSGNFRSYSSFGRSQRDRDWEKDVYDSRDQDKSVLADHWHWDFSDPLGN 120 Query: 1026 IFPNKFERDGLRRSQSMISGKRGETEPKKVVTDLRSAS-RNNKGLLTEGSPVGS-ANKAA 853 +K+ERDGLRRSQSM+SGKRG+T PKKVVTDL SAS +N GLL GSPVG A KA Sbjct: 121 SLLSKYERDGLRRSQSMVSGKRGDTWPKKVVTDLSSASGKNANGLLYRGSPVGGRAKKAT 180 Query: 852 FEKDFPSLGAEERPATPEVGRVPSPVLSTAIQSLPIGSSSVIGGEKWTSALAEVPVLVGS 673 FEKDFPSLGA+ER PEVGRVPSP LSTAIQSLP+G+S +I GEKWTSALAEVPVLVGS Sbjct: 181 FEKDFPSLGADERAVVPEVGRVPSPGLSTAIQSLPVGTSGLIVGEKWTSALAEVPVLVGS 240 Query: 672 DXXXXXXXXXXXXXXXXXXXXXXXXGLNMAETVAQCPTLIQTTTQSSAGTPRLEELAIKQ 493 + LNMAE VAQ P+ QTT Q S GT RLEELAIKQ Sbjct: 241 NGTALSSVQQAAPSSSASVALGSTTSLNMAEAVAQGPSRAQTTPQLSVGTQRLEELAIKQ 300 Query: 492 SRQLIPVTQSMPKTLVLNSSDK---KVG-QQHTLASSLPVSHSTRGVPEKSDLSKTSNVG 325 SRQLIPVT SMPK LVL SSDK KVG QQH+++SSLP++HS RG K D++K SNVG Sbjct: 301 SRQLIPVTPSMPKALVLTSSDKPKGKVGQQQHSISSSLPLNHSPRGGAVKGDVAKASNVG 360 Query: 324 KLHVLKPVRERNGVSPSAKDNFSPTSGSRILNSPLVVXXXXXXXXXXXXXXPNIPVLHGA 145 KL VLKPVRE+NGV+P KDN SPTS S+++ S L V N +H Sbjct: 361 KLQVLKPVREKNGVTPVVKDNLSPTSSSKVVTSTLAVSPSVSGSAATRGLPNN--GVH-- 416 Query: 144 NRKPVLTVSEKKSTSQAQSRSEFFNLVRKKSLAN-SSPVPDRSMANPS 4 +RKP LTV EK+ TSQAQSR++FFNLVRKKS+ N SS V D +MAN S Sbjct: 417 DRKPSLTVLEKRPTSQAQSRNDFFNLVRKKSMPNSSSAVADSAMANCS 464 >ref|XP_011092382.1| PREDICTED: uncharacterized protein LOC105172576 [Sesamum indicum] Length = 616 Score = 461 bits (1185), Expect = e-127 Identities = 268/467 (57%), Positives = 312/467 (66%), Gaps = 5/467 (1%) Frame = -2 Query: 1386 MERSEPTLVPEWLKNXXXXXXXXXXXXXXXXXXSNLARNKSFMHSNGHDSGRSSASDGPT 1207 MERSEPTL+PEWL++ + LARNKS ++SNGHDS RS +SD T Sbjct: 1 MERSEPTLIPEWLRSAGSLNGGGSISHSDEQTTTKLARNKSLVNSNGHDSARSFSSDRTT 60 Query: 1206 SSYFHRSSNSNISGRLRSYSSFGXXXXXXXXXXXXXXXXDKAKPVFGDYRHRDFSDVFEN 1027 SSYF RSS+SN SG LRS+SSFG DK K V GD HRDFSD N Sbjct: 61 SSYFRRSSSSNGSGHLRSHSSFGRNHHDRDWEKDACDSRDKDKSVLGDRWHRDFSDAMGN 120 Query: 1026 IFPNKFERDGLRRSQSMISGKRGETEPKKVVTDLRSASRNN-KGLLTEGSPVGSANKAAF 850 +KFERDGLRRSQSMISGKRG+T KKV TDL AS NN GL ++GSP+G NK F Sbjct: 121 TLLSKFERDGLRRSQSMISGKRGDTWHKKVGTDLNIASGNNTNGLPSKGSPIGGVNKTTF 180 Query: 849 EKDFPSLGAEERPATPEVGRVPSPVLSTAIQSLPIGSSSVIGGEKWTSALAEVPVLVGSD 670 E+DFPSLGAEER A PEVGRVPSP +S+A+QSLPIG+ ++I GEKW SALAEVPVLVG++ Sbjct: 181 ERDFPSLGAEERAAIPEVGRVPSPGVSSALQSLPIGTPTIIRGEKWRSALAEVPVLVGNN 240 Query: 669 XXXXXXXXXXXXXXXXXXXXXXXXGLNMAETVAQCPTLIQTTTQSSAGTPRLEELAIKQS 490 LNMAE VAQ P+ QTT Q S GT RLEELAIKQS Sbjct: 241 VTGISSVQQAAPSSSASVALGSTTSLNMAEAVAQGPSRAQTTPQLSIGTQRLEELAIKQS 300 Query: 489 RQLIPVTQSMPKTLVLNSSDK---KVG-QQHTLASSLPVSHSTRGVPEKSDLSKTSNVGK 322 RQLIPVT SMPK L S+DK KVG QQH + SSL + S RG P K+D+SKTSNVGK Sbjct: 301 RQLIPVTPSMPKPLAACSADKQKTKVGQQQHVVTSSLAANQSPRGGPVKADVSKTSNVGK 360 Query: 321 LHVLKPVRERNGVSPSAKDNFSPTSGSRILNSPLVVXXXXXXXXXXXXXXPNIPVLHGAN 142 LHVLKPVRE+NG +P K+N SPTSGS++++SPL PN PV A+ Sbjct: 361 LHVLKPVREKNGTTPVVKENLSPTSGSKLVSSPLAA--PSLSGSAATRVLPNNPV---AD 415 Query: 141 RKPVLTVSEKKSTSQAQSRSEFFNLVRKKSLANSSPVPDRSMANPSP 1 RKPV TV EK+ TSQAQSR++FFN VRKKS+ANS+ V D ++AN SP Sbjct: 416 RKPVWTVLEKRPTSQAQSRNDFFNSVRKKSMANSTSVADAAIANSSP 462 >ref|XP_012828377.1| PREDICTED: uncharacterized protein LOC105949617 isoform X2 [Erythranthe guttatus] Length = 550 Score = 397 bits (1021), Expect = e-108 Identities = 243/468 (51%), Positives = 290/468 (61%), Gaps = 10/468 (2%) Frame = -2 Query: 1386 MERSEPTLVPEWLKNXXXXXXXXXXXXXXXXXXSNLARNKSFMHSNGHDSGRSSASDGPT 1207 M+RSEP+LVP+WLKN +ARNKSF+++NG+D GR+S S T Sbjct: 1 MDRSEPSLVPQWLKNSGSSTGGGDNHPAS-----RVARNKSFVNTNGNDFGRASGSAKTT 55 Query: 1206 SSYFHRSSNSNISGRLRSYSSFGXXXXXXXXXXXXXXXXDKAKPVFGDYRHR-DFSDVFE 1030 SSYF RSS+SN SG +SYSSFG DK + V G RHR + S++ Sbjct: 56 SSYFRRSSSSNSSGSSKSYSSFGRNQRDRDWEKDTYNSRDKERLVLGGDRHRYESSELLG 115 Query: 1029 NIFPNKFERDGLRRSQSMISGKRGETEPKKVVTDLRSAS--RNNKGLLTEGSPVGSANKA 856 N +K+ERDGLRRS SMISGK GET PKKVVT+ S S N G L +GSPVG ANKA Sbjct: 116 NPSLSKYERDGLRRSHSMISGKHGETWPKKVVTESSSGSGKNNGNGFLAKGSPVGVANKA 175 Query: 855 AFEKDFPSLGAEERPATPEVGRVPSPVLSTAIQSLPIGSSSVIGGEKWTSALAEVPVLVG 676 FE+DFPSLG ++R PEVGRV SP LS+A+QSLPIGSS+ IGGE+WTSALAEVP+LV Sbjct: 176 TFERDFPSLGTDDRAVVPEVGRVASPGLSSALQSLPIGSSASIGGERWTSALAEVPMLVV 235 Query: 675 SD-XXXXXXXXXXXXXXXXXXXXXXXXGLNMAETVAQCPTLIQTTTQSSAGTPRLEELAI 499 S+ LNMAE VAQ PT QT Q S GT RLEELAI Sbjct: 236 SNGTASLSVQQAAPSSTTASVVVSSTTSLNMAEAVAQGPTRAQTAPQLSLGTQRLEELAI 295 Query: 498 KQSRQLIPVTQSMPKTLVLNSSDK---KVG--QQHTLASSLPVSHSTRGV-PEKSDLSKT 337 KQSRQLIPVT +MPKTLVL+SSDK KVG QQH SSLP++ S RG P K D SK Sbjct: 296 KQSRQLIPVTPTMPKTLVLSSSDKQKSKVGLIQQHPTPSSLPINQSPRGAPPSKPDFSKA 355 Query: 336 SNVGKLHVLKPVRERNGVSPSAKDNFSPTSGSRILNSPLVVXXXXXXXXXXXXXXPNIPV 157 SNVGKLHVLKPVRE+NGV+PS KD SPT + +NS L P Sbjct: 356 SNVGKLHVLKPVREKNGVTPSVKDKLSPTGSGKAVNSTL-------------------PA 396 Query: 156 LHGANRKPVLTVSEKKSTSQAQSRSEFFNLVRKKSLANSSPVPDRSMA 13 A + + T EK+ T+QAQSR++FF +R+KS++NSS + A Sbjct: 397 SPSAVKPLLTTALEKRPTTQAQSRNDFFKRMREKSVSNSSSASETGTA 444 >ref|XP_012828376.1| PREDICTED: uncharacterized protein LOC105949617 isoform X1 [Erythranthe guttatus] Length = 575 Score = 397 bits (1021), Expect = e-108 Identities = 243/468 (51%), Positives = 290/468 (61%), Gaps = 10/468 (2%) Frame = -2 Query: 1386 MERSEPTLVPEWLKNXXXXXXXXXXXXXXXXXXSNLARNKSFMHSNGHDSGRSSASDGPT 1207 M+RSEP+LVP+WLKN +ARNKSF+++NG+D GR+S S T Sbjct: 1 MDRSEPSLVPQWLKNSGSSTGGGDNHPAS-----RVARNKSFVNTNGNDFGRASGSAKTT 55 Query: 1206 SSYFHRSSNSNISGRLRSYSSFGXXXXXXXXXXXXXXXXDKAKPVFGDYRHR-DFSDVFE 1030 SSYF RSS+SN SG +SYSSFG DK + V G RHR + S++ Sbjct: 56 SSYFRRSSSSNSSGSSKSYSSFGRNQRDRDWEKDTYNSRDKERLVLGGDRHRYESSELLG 115 Query: 1029 NIFPNKFERDGLRRSQSMISGKRGETEPKKVVTDLRSAS--RNNKGLLTEGSPVGSANKA 856 N +K+ERDGLRRS SMISGK GET PKKVVT+ S S N G L +GSPVG ANKA Sbjct: 116 NPSLSKYERDGLRRSHSMISGKHGETWPKKVVTESSSGSGKNNGNGFLAKGSPVGVANKA 175 Query: 855 AFEKDFPSLGAEERPATPEVGRVPSPVLSTAIQSLPIGSSSVIGGEKWTSALAEVPVLVG 676 FE+DFPSLG ++R PEVGRV SP LS+A+QSLPIGSS+ IGGE+WTSALAEVP+LV Sbjct: 176 TFERDFPSLGTDDRAVVPEVGRVASPGLSSALQSLPIGSSASIGGERWTSALAEVPMLVV 235 Query: 675 SD-XXXXXXXXXXXXXXXXXXXXXXXXGLNMAETVAQCPTLIQTTTQSSAGTPRLEELAI 499 S+ LNMAE VAQ PT QT Q S GT RLEELAI Sbjct: 236 SNGTASLSVQQAAPSSTTASVVVSSTTSLNMAEAVAQGPTRAQTAPQLSLGTQRLEELAI 295 Query: 498 KQSRQLIPVTQSMPKTLVLNSSDK---KVG--QQHTLASSLPVSHSTRGV-PEKSDLSKT 337 KQSRQLIPVT +MPKTLVL+SSDK KVG QQH SSLP++ S RG P K D SK Sbjct: 296 KQSRQLIPVTPTMPKTLVLSSSDKQKSKVGLIQQHPTPSSLPINQSPRGAPPSKPDFSKA 355 Query: 336 SNVGKLHVLKPVRERNGVSPSAKDNFSPTSGSRILNSPLVVXXXXXXXXXXXXXXPNIPV 157 SNVGKLHVLKPVRE+NGV+PS KD SPT + +NS L P Sbjct: 356 SNVGKLHVLKPVREKNGVTPSVKDKLSPTGSGKAVNSTL-------------------PA 396 Query: 156 LHGANRKPVLTVSEKKSTSQAQSRSEFFNLVRKKSLANSSPVPDRSMA 13 A + + T EK+ T+QAQSR++FF +R+KS++NSS + A Sbjct: 397 SPSAVKPLLTTALEKRPTTQAQSRNDFFKRMREKSVSNSSSASETGTA 444 >emb|CDO97516.1| unnamed protein product [Coffea canephora] Length = 599 Score = 383 bits (983), Expect = e-103 Identities = 236/471 (50%), Positives = 289/471 (61%), Gaps = 13/471 (2%) Frame = -2 Query: 1386 MERSEPTLVPEWLKNXXXXXXXXXXXXXXXXXXSN----LARNKSFMHSNGHDSGRSSAS 1219 MERSEP+LVPEWLK+ + LARNKS ++ N H+ GRSS S Sbjct: 1 MERSEPSLVPEWLKSSGSATGSGTTSHPLSPSDDHAVSKLARNKSSVNHNDHEIGRSSVS 60 Query: 1218 DGPTSSYFHRSSNSNISGRLRSYSSFGXXXXXXXXXXXXXXXXDKAKPVFGDYRHRDFSD 1039 D ++SYF RSS+SN SG+++SYSSFG D+ V G ++HRD+ D Sbjct: 61 DRTSASYFRRSSSSNGSGQMQSYSSFGRNHRGRDWDKDLYEPRDRDNLVVGGHKHRDYLD 120 Query: 1038 VFENIFPNKFERDGLRRSQSMISGKRGETEPKKVVTDLRSASRNNK----GLLTEGSPVG 871 N FP FE+DGLRRSQSM+S KR E PK+ + D SASRN LL +G VG Sbjct: 121 PPVNNFPGNFEKDGLRRSQSMVSRKRNEIWPKRSIADSNSASRNKSTDGNSLLDKGDSVG 180 Query: 870 SANKAAFEKDFPSLGAEERPATPEVGRVPSPVLSTAIQSLPIGSSSVIGGEKWTSALAEV 691 + +K FE+DFPSLG+EER AT EVGRVPSP L+TAI LPI +S++I G+KWTSALAEV Sbjct: 181 TVHKVVFERDFPSLGSEERQATSEVGRVPSPGLNTAIHGLPISASAIIAGDKWTSALAEV 240 Query: 690 PVLV-GSDXXXXXXXXXXXXXXXXXXXXXXXXGLNMAETVAQCPTLIQTTTQSSAGTPRL 514 P +V G GLNMAETVAQ P +Q + ++GT RL Sbjct: 241 PAIVGGGGTGLSPGRQASLPSSPASLPSSTSAGLNMAETVAQGPR-VQAAPKITSGTQRL 299 Query: 513 EELAIKQSRQLIPVTQSMPKTLVLNSSDK---KVGQ-QHTLASSLPVSHSTRGVPEKSDL 346 EELAI+QSRQLIP+T SMPK +LNSSDK K GQ QH ++S L +S S RG P K+D Sbjct: 300 EELAIRQSRQLIPMTPSMPKPSILNSSDKGKAKAGQPQHPVSSPL-LSPSLRGGPVKTDA 358 Query: 345 SKTSNVGKLHVLKPVRERNGVSPSAKDNFSPTSGSRILNSPLVVXXXXXXXXXXXXXXPN 166 SKTSN GKL VLKP RERNGVS ++KD SPTS +R S + V N Sbjct: 359 SKTSNAGKLLVLKPPRERNGVSTASKDTLSPTSSTRAATSGIAVATSVTGLATSRGPAIN 418 Query: 165 IPVLHGANRKPVLTVSEKKSTSQAQSRSEFFNLVRKKSLANSSPVPDRSMA 13 PV GA RK L + EKK +SQAQSR++FFNL+RKKS+ +SS V D A Sbjct: 419 -PVSPGAERKHALPMLEKKPSSQAQSRNDFFNLMRKKSMPSSSSVADAGSA 468 >ref|XP_002265987.2| PREDICTED: uncharacterized protein LOC100241871 [Vitis vinifera] Length = 665 Score = 381 bits (978), Expect = e-103 Identities = 233/477 (48%), Positives = 287/477 (60%), Gaps = 19/477 (3%) Frame = -2 Query: 1386 MERSEPTLVPEWLKNXXXXXXXXXXXXXXXXXXSNL-------ARNKSFMHSNGHDSGRS 1228 M+++EP LVPEWLK+ K ++SN HD+GRS Sbjct: 1 MDKTEPALVPEWLKSSGSVTGGGSTNHHFAPSLLQSDDGAALKPARKLMVNSNDHDTGRS 60 Query: 1227 SASDGPTSSYFHRSSNSNISGRLRSYSSFGXXXXXXXXXXXXXXXXDKAKPVFGDYRHRD 1048 S + TSSYF RSS+SN SG RS+SSFG DK K V D+RHRD Sbjct: 61 SNLERTTSSYFRRSSSSNGSGHPRSFSSFGRTNREREWEKDIHDYRDKDKSVLSDHRHRD 120 Query: 1047 FSDVFENIFPNKFERDGLRRSQSMISGKRGETEPKKVVTDLRSASR----NNKGLLTEGS 880 +SD NI P + ERD LRRSQSMI+GKRG+ P+KV D+ + ++ N G L G Sbjct: 121 YSDPLGNILPGRLERDMLRRSQSMITGKRGDMWPRKVAADVSTVNKTIHSNGDGQLASGI 180 Query: 879 PVGSANKAAFEKDFPSLGAEERPATPEVGRVPSPVLSTAIQSLPIGSSSVIGGEKWTSAL 700 S KAAF+++FPSLGAE++ P++GRV SP L++AIQSLPIG++ VIGG+ WTSAL Sbjct: 181 VTSSVQKAAFDRNFPSLGAEDKQGAPDIGRVTSPGLTSAIQSLPIGNTVVIGGDGWTSAL 240 Query: 699 AEVPVLVGSDXXXXXXXXXXXXXXXXXXXXXXXXGLNMAETVAQCPTLIQ--TTTQSSAG 526 AEVPV++GS+ GLNMAET+ Q P + T Q S G Sbjct: 241 AEVPVIIGSNTTGVSSVQQSVSASSVSVAPSTTSGLNMAETLVQGPARARANATPQLSVG 300 Query: 525 TPRLEELAIKQSRQLIPVTQSMPKTLVLNSSDK---KVGQQHTLASSLPVSHSTRGVPEK 355 T RLEELA+KQSRQLIP+T SMPKTLV + SDK K+G Q V+HS RG P + Sbjct: 301 TQRLEELALKQSRQLIPMTPSMPKTLVPSPSDKPKSKIGLQPLHL----VNHSQRGGPAR 356 Query: 354 SDLSKTSNVGKLHVLKPVRERNGVSPSAKDNFSPTSGSRILNSPLVVXXXXXXXXXXXXX 175 SD++KTSNVGKLHVLKP RERNGVSP+AKD+ SPT GSR+ NSPL V Sbjct: 357 SDVTKTSNVGKLHVLKPSRERNGVSPTAKDSLSPTMGSRVANSPLAVTPSAAGSASLRSP 416 Query: 174 XPNIPVLHGANRKP--VLTVSEKKSTSQAQSRSEFFNLVRKKSLAN-SSPVPDRSMA 13 N P L A R+P VLT EK+ TSQAQSR++FFNL+RKKS N S VP+ A Sbjct: 417 RNN-PTLASAERRPSVVLTSVEKRPTSQAQSRNDFFNLMRKKSSTNPPSAVPESGPA 472 >ref|XP_012839759.1| PREDICTED: uncharacterized protein LOC105960131 [Erythranthe guttatus] Length = 436 Score = 340 bits (872), Expect = 2e-90 Identities = 222/457 (48%), Positives = 270/457 (59%), Gaps = 6/457 (1%) Frame = -2 Query: 1386 MERSEPTLVPEWLKNXXXXXXXXXXXXXXXXXXSNLARNKSFMHSNGHDSGRSSASDGPT 1207 MERSEPTLVPEWL+N S L RNKSF++SNG+D GRS +SD T Sbjct: 1 MERSEPTLVPEWLRNPGSLNGGGSASHSDGKNASKLVRNKSFVNSNGNDFGRSLSSDRTT 60 Query: 1206 SSYFHRSSNSNISGRLRSYSSFGXXXXXXXXXXXXXXXXDKAKPVFGDYRHRDFSDVF-E 1030 SSYF RSS++N SG RS++SFG K K V G+ R+FSD F Sbjct: 61 SSYFRRSSSNNGSGNSRSHTSFGRKQHDTYDSRE------KDKSVLGN--RRNFSDSFGN 112 Query: 1029 NIFPNKFERDGLRRSQSMISGKRGETEPKKVVTDLRSASRNNKGLLTEGSPVGSANKAAF 850 N +KFER+GLR SQS+ S K +T +KV T+ S N GLLT+ SP+G NK F Sbjct: 113 NTLSSKFEREGLRHSQSIDSAKHADTWHRKVTTN--SGRNNTDGLLTKNSPIGEVNKKTF 170 Query: 849 EKDFPSLGAEERPATPEVGRVPSPVLSTAIQSLPIGSSSVIGGEKWTSALAEVPVLVGSD 670 ++DFPSLG E+R +PSP LS+ IQSLP +SS+I GEKWTSALAEVPV VGS Sbjct: 171 KRDFPSLGTEDRTV------IPSPGLSSPIQSLPSCTSSLINGEKWTSALAEVPVSVGSH 224 Query: 669 XXXXXXXXXXXXXXXXXXXXXXXXGLNMAETVAQCPTLIQTTTQSSAGTPRLEELAIKQS 490 +MAE V Q P+ +QT Q S GT RLEELAIK+S Sbjct: 225 GNGILSVQELAPLSSA----------SMAEAVVQGPSRVQTAPQLSMGTQRLEELAIKKS 274 Query: 489 RQLIPVTQSMPKTLVLNSSDK---KVGQ-QHTLASSLPVSHSTRGVPEKSDLSKTS-NVG 325 +QLIPVT S PKTLVLNS+DK K Q H ++SSLPV+ S RG P K+D SK S VG Sbjct: 275 KQLIPVTPSTPKTLVLNSTDKHKTKASQHNHPISSSLPVNQSPRGGPTKADFSKASTTVG 334 Query: 324 KLHVLKPVRERNGVSPSAKDNFSPTSGSRILNSPLVVXXXXXXXXXXXXXXPNIPVLHGA 145 KLHVLKP+RE NGV KDN S + S++ +S +P Sbjct: 335 KLHVLKPMREINGV---VKDNSSASGSSKLTSSSTPAAPTRGPPNNHL-----VP----- 381 Query: 144 NRKPVLTVSEKKSTSQAQSRSEFFNLVRKKSLANSSP 34 + KPV+TV EK+ TSQAQSR++FFN VRKKS+A SP Sbjct: 382 DHKPVITVLEKRPTSQAQSRNDFFNTVRKKSMAFPSP 418 >ref|XP_010277688.1| PREDICTED: uncharacterized protein YMR317W-like [Nelumbo nucifera] gi|720070295|ref|XP_010277689.1| PREDICTED: uncharacterized protein YMR317W-like [Nelumbo nucifera] Length = 655 Score = 332 bits (851), Expect = 5e-88 Identities = 218/503 (43%), Positives = 283/503 (56%), Gaps = 47/503 (9%) Frame = -2 Query: 1386 MERSEPTLVPEWLKNXXXXXXXXXXXXXXXXXXSN--------LARNKSFMHSNGHDSGR 1231 M + EPTLVPEWLK ++ RN+ M + +D+ R Sbjct: 1 MAKGEPTLVPEWLKGTGSITGGGNTTHHFASSSTHSDDHAVALTTRNRLTMSTGDYDTPR 60 Query: 1230 SSAS-DGPTSSYFHRSSNSN--------ISGRLRSYSSFGXXXXXXXXXXXXXXXXDKAK 1078 SSA D +S+YF RSS+SN S RSYSSF DK K Sbjct: 61 SSAFLDRTSSAYFRRSSSSNGSMMHDKETSTYSRSYSSFTRSHRDRDWEKDTLDYRDKEK 120 Query: 1077 PVFGDYRHRDFSDVFENIFPNKFERDGLRRSQSMISGKRGETEPKKVVTDLRSASRNNK- 901 + GD+R RD+SD +I ++ E+D LRRSQSMISGKRGE ++V D + + N+ Sbjct: 121 SILGDHRDRDYSDPLASILTSRXEKDTLRRSQSMISGKRGEGWSRRVAADTNNGNNNHNN 180 Query: 900 --GLLTEGSPVGSANKAAFEKDFPSLGAEERPATPEVGRVPSPVLSTAIQSLPIGSSSVI 727 GLL GS V S KAAFE+DFPSLGAEE+ ++GRV SP LS+++QSLPIGSS+VI Sbjct: 181 GNGLLVGGSIVSSIQKAAFERDFPSLGAEEKQGALDIGRVSSPGLSSSVQSLPIGSSAVI 240 Query: 726 GGEKWTSALAEVPVLVGSDXXXXXXXXXXXXXXXXXXXXXXXXGLNMAETVAQCPTLIQT 547 GG+ WTSALAEVPV++G++ GLNMAET+AQ P+ + Sbjct: 241 GGDGWTSALAEVPVIIGNNSIGPSSVQQATPASSTSGAPNSSTGLNMAETLAQAPSRTRI 300 Query: 546 TTQSSAGTPRLEELAIKQSRQLIPVTQSMPKTLVLNSSDK----------------KVGQ 415 + Q S T RLEELAIKQSRQLIP+T SMPKT LNSS+K K Q Sbjct: 301 SPQLSVETQRLEELAIKQSRQLIPMTPSMPKTSALNSSEKAKPKAVVRTGEMGISAKTSQ 360 Query: 414 QHTLASSLPVSHSTRGVPEKSDLSKTSNVGKLHVLKPVRERNGVSPSAKDNFSPTSGSRI 235 Q L SS V+HS RG P +SD+ KTS+ GKL VLK RE+NG+SPSAKD SPT+ S++ Sbjct: 361 QQQLPSSHLVNHSLRGGPVRSDVPKTSHGGKLLVLKAPREKNGISPSAKDGLSPTNASKV 420 Query: 234 LNSPLVVXXXXXXXXXXXXXXPN-IP---------VLHGANRKPVLTVSEKKSTSQAQSR 85 +N+ LV+ + +P + HG+ V ++ +TSQ QSR Sbjct: 421 VNNSLVLAPLAAYAPPMRSPNNSKLPNERKSVASSLTHGS------AVEKRPTTSQVQSR 474 Query: 84 SEFFNLVRKKSLAN-SSPVPDRS 19 ++FFNL+RKK+ N +S VPD S Sbjct: 475 NDFFNLMRKKTSGNLASAVPDPS 497 >ref|XP_010245092.1| PREDICTED: uncharacterized protein LOC104588732 isoform X1 [Nelumbo nucifera] Length = 645 Score = 330 bits (845), Expect = 2e-87 Identities = 214/494 (43%), Positives = 280/494 (56%), Gaps = 38/494 (7%) Frame = -2 Query: 1386 MERSEPTLVPEWLKNXXXXXXXXXXXXXXXXXXSNL--------ARNKSFMHSNGHDSGR 1231 M +SEPTLVPEWLK RN+S + +D+ R Sbjct: 1 MAKSEPTLVPEWLKGTGGITGAGSTTHHFASSSLQSDDNAVALPTRNRSSLSIGDYDTPR 60 Query: 1230 SSA-SDGPTSSYFHRSSNSN--------ISGRLRSYSSFGXXXXXXXXXXXXXXXXDKAK 1078 SSA SD +S+Y RSS+SN I RSYS+F DK + Sbjct: 61 SSAFSDRTSSAYSRRSSSSNGSIVHDKEIPSYTRSYSAFARSHRDRDWEKDILDFRDKER 120 Query: 1077 PVFGDYRHRDFSDVFENIFPNKFERDGLRRSQSMISGKRGETEPKKVVTDLRSASRN--- 907 V GD+R DFSD +I ++ E+D LRRSQSM+SGKRGE P+KV DL + + N Sbjct: 121 SVPGDHRDLDFSDPLVSILTSRIEKDTLRRSQSMVSGKRGEVWPRKVAADLNNGNINQNT 180 Query: 906 NKGLLTEGSPVGSANKAAFEKDFPSLGAEERPATPEVGRVPSPVLSTAIQSLPIGSSSVI 727 + GLL GS V S KAAFE+DFPSLGAEE+P TP++GRV SP LS+A+QSLP+GSS++I Sbjct: 181 SNGLLVGGSIVSSIQKAAFERDFPSLGAEEKPGTPDIGRVSSPGLSSAVQSLPMGSSALI 240 Query: 726 GGEKWTSALAEVPVLVGSDXXXXXXXXXXXXXXXXXXXXXXXXGLNMAETVAQCPTLIQT 547 GG+ WTSALAEVP+++G++ GLNMAET+AQ P+ + Sbjct: 241 GGDGWTSALAEVPMIIGNNGTGISSVQQATLGSSASGATNSSTGLNMAETLAQAPSRARI 300 Query: 546 TTQSSAGTPRLEELAIKQSRQLIPVTQSMPKTLVLNSSDK-------KVGQQH-TLASSL 391 + Q S T RLEELAIKQSRQLIP+T SMPKT VLNS +K + G+ + T Sbjct: 301 SPQLSVETQRLEELAIKQSRQLIPMTPSMPKTSVLNSLEKAKPKISVRTGEMNATKTIQQ 360 Query: 390 PVSHSTRGVPEKSDLSKTSNVGKLHVLKPVRERNGVSPSAKDNFSPTSGSRILNSPLVVX 211 S RG P +SD+SKTS+ GKL VLK RE+NG+SP AKD SPT+ S++ N+PL + Sbjct: 361 QQLSSLRGAPMRSDVSKTSHGGKLLVLKAPREKNGISPIAKDGQSPTNVSKVANNPLALA 420 Query: 210 XXXXXXXXXXXXXPNI---------PVLHGANRKPVLTVSEKKSTSQAQSRSEFFNLVRK 58 + ++HG+ +V ++ +TSQ QSR++FFNL+RK Sbjct: 421 PSAAFTPLKSPNNSKLSNERKSAAASLMHGS------SVEKRPTTSQVQSRNDFFNLMRK 474 Query: 57 KSLAN-SSPVPDRS 19 K+ N SS PD S Sbjct: 475 KTSGNLSSAAPDPS 488 >ref|XP_010245093.1| PREDICTED: uncharacterized protein LOC104588732 isoform X2 [Nelumbo nucifera] Length = 616 Score = 323 bits (827), Expect = 3e-85 Identities = 210/485 (43%), Positives = 274/485 (56%), Gaps = 29/485 (5%) Frame = -2 Query: 1386 MERSEPTLVPEWLKNXXXXXXXXXXXXXXXXXXSNLARNKSFMHSNGHDSGRSSASDGPT 1207 M +SEPTLVPEWLK + S H H + S SD + Sbjct: 1 MAKSEPTLVPEWLKG-----------------TGGITGAGSTTH---HFASSSLQSDRTS 40 Query: 1206 SSYFHRSSNSN--------ISGRLRSYSSFGXXXXXXXXXXXXXXXXDKAKPVFGDYRHR 1051 S+Y RSS+SN I RSYS+F DK + V GD+R Sbjct: 41 SAYSRRSSSSNGSIVHDKEIPSYTRSYSAFARSHRDRDWEKDILDFRDKERSVPGDHRDL 100 Query: 1050 DFSDVFENIFPNKFERDGLRRSQSMISGKRGETEPKKVVTDLRSASRN---NKGLLTEGS 880 DFSD +I ++ E+D LRRSQSM+SGKRGE P+KV DL + + N + GLL GS Sbjct: 101 DFSDPLVSILTSRIEKDTLRRSQSMVSGKRGEVWPRKVAADLNNGNINQNTSNGLLVGGS 160 Query: 879 PVGSANKAAFEKDFPSLGAEERPATPEVGRVPSPVLSTAIQSLPIGSSSVIGGEKWTSAL 700 V S KAAFE+DFPSLGAEE+P TP++GRV SP LS+A+QSLP+GSS++IGG+ WTSAL Sbjct: 161 IVSSIQKAAFERDFPSLGAEEKPGTPDIGRVSSPGLSSAVQSLPMGSSALIGGDGWTSAL 220 Query: 699 AEVPVLVGSDXXXXXXXXXXXXXXXXXXXXXXXXGLNMAETVAQCPTLIQTTTQSSAGTP 520 AEVP+++G++ GLNMAET+AQ P+ + + Q S T Sbjct: 221 AEVPMIIGNNGTGISSVQQATLGSSASGATNSSTGLNMAETLAQAPSRARISPQLSVETQ 280 Query: 519 RLEELAIKQSRQLIPVTQSMPKTLVLNSSDK-------KVGQQH-TLASSLPVSHSTRGV 364 RLEELAIKQSRQLIP+T SMPKT VLNS +K + G+ + T S RG Sbjct: 281 RLEELAIKQSRQLIPMTPSMPKTSVLNSLEKAKPKISVRTGEMNATKTIQQQQLSSLRGA 340 Query: 363 PEKSDLSKTSNVGKLHVLKPVRERNGVSPSAKDNFSPTSGSRILNSPLVVXXXXXXXXXX 184 P +SD+SKTS+ GKL VLK RE+NG+SP AKD SPT+ S++ N+PL + Sbjct: 341 PMRSDVSKTSHGGKLLVLKAPREKNGISPIAKDGQSPTNVSKVANNPLALAPSAAFTPLK 400 Query: 183 XXXXPNI---------PVLHGANRKPVLTVSEKKSTSQAQSRSEFFNLVRKKSLAN-SSP 34 + ++HG+ +V ++ +TSQ QSR++FFNL+RKK+ N SS Sbjct: 401 SPNNSKLSNERKSAAASLMHGS------SVEKRPTTSQVQSRNDFFNLMRKKTSGNLSSA 454 Query: 33 VPDRS 19 PD S Sbjct: 455 APDPS 459 >gb|EYU35430.1| hypothetical protein MIMGU_mgv1a0188591mg, partial [Erythranthe guttata] Length = 399 Score = 317 bits (812), Expect = 2e-83 Identities = 207/422 (49%), Positives = 254/422 (60%), Gaps = 6/422 (1%) Frame = -2 Query: 1281 LARNKSFMHSNGHDSGRSSASDGPTSSYFHRSSNSNISGRLRSYSSFGXXXXXXXXXXXX 1102 L RNKSF++SNG+D GRS +SD TSSYF RSS++N SG RS++SFG Sbjct: 7 LVRNKSFVNSNGNDFGRSLSSDRTTSSYFRRSSSNNGSGNSRSHTSFGRKQHDTYDSRE- 65 Query: 1101 XXXXDKAKPVFGDYRHRDFSDVF-ENIFPNKFERDGLRRSQSMISGKRGETEPKKVVTDL 925 K K V G+ R+FSD F N +KFER+GLR SQS+ S K +T +KV T+ Sbjct: 66 -----KDKSVLGN--RRNFSDSFGNNTLSSKFEREGLRHSQSIDSAKHADTWHRKVTTN- 117 Query: 924 RSASRNNKGLLTEGSPVGSANKAAFEKDFPSLGAEERPATPEVGRVPSPVLSTAIQSLPI 745 S N GLLT+ SP+G NK F++DFPSLG E+R +PSP LS+ IQSLP Sbjct: 118 -SGRNNTDGLLTKNSPIGEVNKKTFKRDFPSLGTEDRTV------IPSPGLSSPIQSLPS 170 Query: 744 GSSSVIGGEKWTSALAEVPVLVGSDXXXXXXXXXXXXXXXXXXXXXXXXGLNMAETVAQC 565 +SS+I GEKWTSALAEVPV VGS +MAE V Q Sbjct: 171 CTSSLINGEKWTSALAEVPVSVGSHGNGILSVQELAPLSSA----------SMAEAVVQG 220 Query: 564 PTLIQTTTQSSAGTPRLEELAIKQSRQLIPVTQSMPKTLVLNSSDK---KVGQ-QHTLAS 397 P+ +QT Q S GT RLEELAIK+S+QLIPVT S PKTLVLNS+DK K Q H ++S Sbjct: 221 PSRVQTAPQLSMGTQRLEELAIKKSKQLIPVTPSTPKTLVLNSTDKHKTKASQHNHPISS 280 Query: 396 SLPVSHSTRGVPEKSDLSKTS-NVGKLHVLKPVRERNGVSPSAKDNFSPTSGSRILNSPL 220 SLPV+ S RG P K+D SK S VGKLHVLKP+RE NGV KDN S + S++ +S Sbjct: 281 SLPVNQSPRGGPTKADFSKASTTVGKLHVLKPMREINGV---VKDNSSASGSSKLTSSST 337 Query: 219 VVXXXXXXXXXXXXXXPNIPVLHGANRKPVLTVSEKKSTSQAQSRSEFFNLVRKKSLANS 40 +P + KPV+TV EK+ TSQAQSR++FFN VRKKS+A Sbjct: 338 PAAPTRGPPNNHL-----VP-----DHKPVITVLEKRPTSQAQSRNDFFNTVRKKSMAFP 387 Query: 39 SP 34 SP Sbjct: 388 SP 389 >ref|XP_007041568.1| Uncharacterized protein isoform 2 [Theobroma cacao] gi|508705503|gb|EOX97399.1| Uncharacterized protein isoform 2 [Theobroma cacao] Length = 620 Score = 309 bits (792), Expect = 3e-81 Identities = 214/482 (44%), Positives = 275/482 (57%), Gaps = 21/482 (4%) Frame = -2 Query: 1386 MERSEPTLVPEWLKNXXXXXXXXXXXXXXXXXXSNL--------ARNKSFMHSNGHDSGR 1231 MERSEP+LVPEWLK+ + RNK + + HD G Sbjct: 1 MERSEPSLVPEWLKSGGSVTGSGNSNHQFTSSSLHSDNHSALRPTRNKLSV-AGDHDVGG 59 Query: 1230 SSASDGPTSSYFHRSSNSNISGRLRSYSSFGXXXXXXXXXXXXXXXXDKAKPVFGDYRHR 1051 +S D TS+YF RSS+SN S LRSYSSF D+ K V D+R+R Sbjct: 60 TSVLDRTTSAYFRRSSSSNGSAHLRSYSSFTKGHRDRDWDKDINGYHDREKSVISDHRNR 119 Query: 1050 DFSDVFENIFPNKFERDGLRRSQSMISGKRGETEPKKVVTDLRSASRNN----KGLLTEG 883 +FSD +N+ P+ FE+D L RSQS I+GKR +T PKKV +D +++++N GLL+ G Sbjct: 120 NFSDSLDNMLPSVFEKDVLWRSQS-ITGKRSDTWPKKVTSDSSTSNKSNHSSSNGLLS-G 177 Query: 882 SPVGSANKAAFEKDFPSLGAEERPATPEVGRVPSPVLSTAIQSLPIGSSSVIGGEKWTSA 703 NK+ FE++FP LGAEER E+GRV SP LSTA QSLP+G+S++ G + WTSA Sbjct: 178 VSTTVGNKSVFEREFPVLGAEERQVASEIGRVSSPGLSTAGQSLPVGTSAISGSDGWTSA 237 Query: 702 LAEVPVLVGSDXXXXXXXXXXXXXXXXXXXXXXXXGLNMAETVAQCPTLIQTTTQSSAGT 523 LA++P VGS GLNMAET+ Q P+ +T + GT Sbjct: 238 LADMPAGVGSSGTGVAVASQNVSASSASMASTTMTGLNMAETLVQGPSRARTPPLLNVGT 297 Query: 522 PRLEELAIKQSRQLIP-VTQSMPKTLVLNSSDK---KVGQQHTLASSLPVSHSTRGVPEK 355 RLEELAIKQSRQL+P VT S PK LV++ S+K KVGQQ + SL + TRG + Sbjct: 298 QRLEELAIKQSRQLVPLVTTSTPKILVVSPSEKSKPKVGQQQHASLSL---NYTRGGTSR 354 Query: 354 SDLSKTSNVGKLHVLKPVRERNGVSPSAKDNFSPTSG-SRILNSPLVVXXXXXXXXXXXX 178 SD K SN G+L +LKP RE NGVS KDN SPT+G S+++NSPL V Sbjct: 355 SDSLKVSNEGRLRILKPSRELNGVSLMTKDNLSPTNGSSKLVNSPLSV-TPSASASAPFR 413 Query: 177 XXPNIPVLHGA--NRKPVLTVSEKKSTSQAQSRSEFFNLVRKKSLANS-SPVPDRS-MAN 10 N P A N+ P EK+ T+QAQSR++FFNL++KKS NS S V DR A+ Sbjct: 414 SSGNSPSFATAERNQTPFRINIEKRPTAQAQSRNDFFNLLKKKSTTNSPSSVADRGPAAS 473 Query: 9 PS 4 PS Sbjct: 474 PS 475 >ref|XP_007041567.1| Uncharacterized protein isoform 1 [Theobroma cacao] gi|508705502|gb|EOX97398.1| Uncharacterized protein isoform 1 [Theobroma cacao] Length = 625 Score = 309 bits (792), Expect = 3e-81 Identities = 214/482 (44%), Positives = 275/482 (57%), Gaps = 21/482 (4%) Frame = -2 Query: 1386 MERSEPTLVPEWLKNXXXXXXXXXXXXXXXXXXSNL--------ARNKSFMHSNGHDSGR 1231 MERSEP+LVPEWLK+ + RNK + + HD G Sbjct: 6 MERSEPSLVPEWLKSGGSVTGSGNSNHQFTSSSLHSDNHSALRPTRNKLSV-AGDHDVGG 64 Query: 1230 SSASDGPTSSYFHRSSNSNISGRLRSYSSFGXXXXXXXXXXXXXXXXDKAKPVFGDYRHR 1051 +S D TS+YF RSS+SN S LRSYSSF D+ K V D+R+R Sbjct: 65 TSVLDRTTSAYFRRSSSSNGSAHLRSYSSFTKGHRDRDWDKDINGYHDREKSVISDHRNR 124 Query: 1050 DFSDVFENIFPNKFERDGLRRSQSMISGKRGETEPKKVVTDLRSASRNN----KGLLTEG 883 +FSD +N+ P+ FE+D L RSQS I+GKR +T PKKV +D +++++N GLL+ G Sbjct: 125 NFSDSLDNMLPSVFEKDVLWRSQS-ITGKRSDTWPKKVTSDSSTSNKSNHSSSNGLLS-G 182 Query: 882 SPVGSANKAAFEKDFPSLGAEERPATPEVGRVPSPVLSTAIQSLPIGSSSVIGGEKWTSA 703 NK+ FE++FP LGAEER E+GRV SP LSTA QSLP+G+S++ G + WTSA Sbjct: 183 VSTTVGNKSVFEREFPVLGAEERQVASEIGRVSSPGLSTAGQSLPVGTSAISGSDGWTSA 242 Query: 702 LAEVPVLVGSDXXXXXXXXXXXXXXXXXXXXXXXXGLNMAETVAQCPTLIQTTTQSSAGT 523 LA++P VGS GLNMAET+ Q P+ +T + GT Sbjct: 243 LADMPAGVGSSGTGVAVASQNVSASSASMASTTMTGLNMAETLVQGPSRARTPPLLNVGT 302 Query: 522 PRLEELAIKQSRQLIP-VTQSMPKTLVLNSSDK---KVGQQHTLASSLPVSHSTRGVPEK 355 RLEELAIKQSRQL+P VT S PK LV++ S+K KVGQQ + SL + TRG + Sbjct: 303 QRLEELAIKQSRQLVPLVTTSTPKILVVSPSEKSKPKVGQQQHASLSL---NYTRGGTSR 359 Query: 354 SDLSKTSNVGKLHVLKPVRERNGVSPSAKDNFSPTSG-SRILNSPLVVXXXXXXXXXXXX 178 SD K SN G+L +LKP RE NGVS KDN SPT+G S+++NSPL V Sbjct: 360 SDSLKVSNEGRLRILKPSRELNGVSLMTKDNLSPTNGSSKLVNSPLSV-TPSASASAPFR 418 Query: 177 XXPNIPVLHGA--NRKPVLTVSEKKSTSQAQSRSEFFNLVRKKSLANS-SPVPDRS-MAN 10 N P A N+ P EK+ T+QAQSR++FFNL++KKS NS S V DR A+ Sbjct: 419 SSGNSPSFATAERNQTPFRINIEKRPTAQAQSRNDFFNLLKKKSTTNSPSSVADRGPAAS 478 Query: 9 PS 4 PS Sbjct: 479 PS 480 >gb|EYU18535.1| hypothetical protein MIMGU_mgv1a006469mg [Erythranthe guttata] Length = 443 Score = 301 bits (771), Expect = 9e-79 Identities = 181/331 (54%), Positives = 211/331 (63%), Gaps = 9/331 (2%) Frame = -2 Query: 978 MISGKRGETEPKKVVTDLRSAS--RNNKGLLTEGSPVGSANKAAFEKDFPSLGAEERPAT 805 MISGK GET PKKVVT+ S S N G L +GSPVG ANKA FE+DFPSLG ++R Sbjct: 1 MISGKHGETWPKKVVTESSSGSGKNNGNGFLAKGSPVGVANKATFERDFPSLGTDDRAVV 60 Query: 804 PEVGRVPSPVLSTAIQSLPIGSSSVIGGEKWTSALAEVPVLVGSD-XXXXXXXXXXXXXX 628 PEVGRV SP LS+A+QSLPIGSS+ IGGE+WTSALAEVP+LV S+ Sbjct: 61 PEVGRVASPGLSSALQSLPIGSSASIGGERWTSALAEVPMLVVSNGTASLSVQQAAPSST 120 Query: 627 XXXXXXXXXXGLNMAETVAQCPTLIQTTTQSSAGTPRLEELAIKQSRQLIPVTQSMPKTL 448 LNMAE VAQ PT QT Q S GT RLEELAIKQSRQLIPVT +MPKTL Sbjct: 121 TASVVVSSTTSLNMAEAVAQGPTRAQTAPQLSLGTQRLEELAIKQSRQLIPVTPTMPKTL 180 Query: 447 VLNSSDK---KVG--QQHTLASSLPVSHSTRGV-PEKSDLSKTSNVGKLHVLKPVRERNG 286 VL+SSDK KVG QQH SSLP++ S RG P K D SK SNVGKLHVLKPVRE+NG Sbjct: 181 VLSSSDKQKSKVGLIQQHPTPSSLPINQSPRGAPPSKPDFSKASNVGKLHVLKPVREKNG 240 Query: 285 VSPSAKDNFSPTSGSRILNSPLVVXXXXXXXXXXXXXXPNIPVLHGANRKPVLTVSEKKS 106 V+PS KD SPT + +NS L P A + + T EK+ Sbjct: 241 VTPSVKDKLSPTGSGKAVNSTL-------------------PASPSAVKPLLTTALEKRP 281 Query: 105 TSQAQSRSEFFNLVRKKSLANSSPVPDRSMA 13 T+QAQSR++FF +R+KS++NSS + A Sbjct: 282 TTQAQSRNDFFKRMREKSVSNSSSASETGTA 312 >ref|XP_007225552.1| hypothetical protein PRUPE_ppa002972m2g, partial [Prunus persica] gi|462422488|gb|EMJ26751.1| hypothetical protein PRUPE_ppa002972m2g, partial [Prunus persica] Length = 571 Score = 285 bits (730), Expect = 5e-74 Identities = 215/500 (43%), Positives = 267/500 (53%), Gaps = 38/500 (7%) Frame = -2 Query: 1386 MERSEPTLVPEWLKNXXXXXXXXXXXXXXXXXXSNL--------ARNKSFMHSNGHDSGR 1231 MERSEPTLVPEWL++ S+ RN++ + D+ R Sbjct: 1 MERSEPTLVPEWLRSTGSVTGGGNSAHHFASSSSHSDVTSLAHHLRNRTSKSISDFDTPR 60 Query: 1230 SSASDGPTSSYFHRSSNSNISGRLRSYSSFGXXXXXXXXXXXXXXXXDKAKPVFGDYRHR 1051 S+ +SS R S+SN S + +YSSF +K + +GD+ R Sbjct: 61 SAFLLDRSSSSNSRRSSSNGSAK-HAYSSFN------RSHRDKDRDKEKERLNYGDHWDR 113 Query: 1050 DFSDVFENIFPNKFERDGLRRSQSMISGKRGETEPKKVVTDLRSAS---RNNKGLLTEGS 880 D SD NIF ++ E+D LRRSQSM++ K+ E P++ V D +S++ N GLL S Sbjct: 114 DCSDPLGNIFTSRVEKDTLRRSQSMVARKQSELLPRRAVIDSKSSNSNHNNGNGLL---S 170 Query: 879 PVG-SANKAAFEKDFPSLGAEERPATPEVGRVPSPVLSTAIQSLPIGSSSVIGGEKWTSA 703 VG S K F+KDFPSLG EERPA P++GRVPSP STA+QSLP+GSS++IGGE WTSA Sbjct: 171 GVGVSIQKVVFDKDFPSLGTEERPAVPDIGRVPSPGFSTAVQSLPVGSSALIGGEGWTSA 230 Query: 702 LAEVP-VLVGSDXXXXXXXXXXXXXXXXXXXXXXXXGLNMAETVAQCPTLIQTTTQSSAG 526 LAEVP ++ S GLNMAE +AQ P +T Q S Sbjct: 231 LAEVPSTIIASSSSGSFPVQPTVAATSGSGTSTAMAGLNMAEALAQAPARARTAPQLSIK 290 Query: 525 TPRLEELAIKQSRQLIPVTQSMPKTLVLNSSDK----------------KVGQQHTLASS 394 T RLEELAIKQSRQLIPVT SMPK VLNSSDK K GQQ + Sbjct: 291 TQRLEELAIKQSRQLIPVTPSMPKASVLNSSDKSKPKTAARTGEMNVPAKGGQQQQPSQL 350 Query: 393 LPVSHSTRGVPEKSDLSKTSNVGKLHVLKPVRERNGVSPSAKDNFSPT-SGSRILNSPLV 217 + S RG P KSD KTS+ GK VLKPV E NGVS S KD SPT + SR+ NSPLV Sbjct: 351 HHANQSLRGGPVKSDPPKTSH-GKFLVLKPVWE-NGVSSSPKDVTSPTNNASRVANSPLV 408 Query: 216 VXXXXXXXXXXXXXXPNIPVLHGANRKPVL-------TVSEKKSTSQAQSRSEFFNLVRK 58 V PN P L RK T+ ++ S SQ QSR++FFNL++K Sbjct: 409 V---APAVASAPLRSPNNPKLSPVERKVAALDLKSGSTLEKRPSLSQVQSRNDFFNLLKK 465 Query: 57 KSLANSS-PVPDRSMANPSP 1 K+ NSS +PD SP Sbjct: 466 KTSMNSSITLPDSGPIISSP 485 >ref|XP_012467689.1| PREDICTED: uncharacterized protein LOC105786006 [Gossypium raimondii] gi|823135857|ref|XP_012467690.1| PREDICTED: uncharacterized protein LOC105786006 [Gossypium raimondii] gi|763748559|gb|KJB15998.1| hypothetical protein B456_002G207700 [Gossypium raimondii] gi|763748560|gb|KJB15999.1| hypothetical protein B456_002G207700 [Gossypium raimondii] Length = 629 Score = 281 bits (720), Expect = 7e-73 Identities = 205/485 (42%), Positives = 268/485 (55%), Gaps = 23/485 (4%) Frame = -2 Query: 1386 MERSEPTLVPEWLK----------NXXXXXXXXXXXXXXXXXXSNLARNKSFMHSNGHDS 1237 MERSEP+LVPEWLK + ARNK + S+G D Sbjct: 1 MERSEPSLVPEWLKCSGSLTGSGNSNNQFTSSSSSSHSDNHSAVRHARNKLSVDSDG-DI 59 Query: 1236 GRSSASDGPTSSYFHRSSNSNISGRLRSYSSFGXXXXXXXXXXXXXXXXDKAKPVFGDYR 1057 GR+S D +S+YF RSS+S + SYS+FG D+ V D R Sbjct: 60 GRTSVLDRASSAYFRRSSSSKGASDSWSYSNFGKGHRERDWEKVSNGYHDRKNAVLSDQR 119 Query: 1056 HRDFSDVFENIFPNKFERDGLRRSQSMISGKRGETEPKKVVTDLRSASRNN----KGLLT 889 +R+ SD +N+ P+ FE+D LRRSQS+ +GK +T P+K + S+++ G LT Sbjct: 120 NRNHSDSLDNLLPSMFEKDVLRRSQSLKTGKHSDTWPRKATNESSGTSKSHHSSGNGKLT 179 Query: 888 EGSPVGSANKAAFEKDFPSLGAEERPATPEVGRVPSPVLSTAIQSLPIGSSSVIGGEKWT 709 + VG NK+AFE+DFPSLGAE R E+GR+ SP L+ +QSLP+G+S V+G + T Sbjct: 180 TVAAVG--NKSAFERDFPSLGAEVRQVGSEIGRILSPGLTNPVQSLPVGTSPVLGSDGRT 237 Query: 708 SALAEVPVLVGSDXXXXXXXXXXXXXXXXXXXXXXXXGLNMAETVAQCPTLIQTTTQSSA 529 SALA++PV VG+ GLNMAE VAQ P+ +T + Sbjct: 238 SALADIPVGVGNS---GRGVAVASQNVPAGSTPTMVTGLNMAEAVAQGPSRARTPPLLNV 294 Query: 528 GTPRLEELAIKQSRQLIP-VTQSMPKTLVLNSSDK---KVGQQHTLASSLPVSHSTRGVP 361 T RLEELAIKQSRQLIP VT S PKTLV++ S+K KVGQQ L SL STRG Sbjct: 295 ETQRLEELAIKQSRQLIPLVTVSTPKTLVVSPSEKSRPKVGQQ--LHPSLSFG-STRGGT 351 Query: 360 EKSDLSKTSNVGKLHVLKPVRERNGVSP-SAKDNFSPTSGS-RILNSPLVVXXXXXXXXX 187 +SD K SN +L +LKP RE NGVS + +DN SPT+GS + NSP+ + Sbjct: 352 SRSDSQKVSNESRLLILKPSRESNGVSSITTRDNLSPTNGSNKFANSPINI-TPSAAASV 410 Query: 186 XXXXXPNIPVLHGA--NRKPVLTVSEKKSTSQAQSRSEFFNLVRKKSLANS-SPVPDRSM 16 N P L A N+ PV EK++T+QAQSR++FFNL++KKS +NS S V D Sbjct: 411 PFRSSGNSPRLATAERNQTPVRMTMEKRATAQAQSRNDFFNLLKKKSTSNSASSVLDSGS 470 Query: 15 ANPSP 1 A P Sbjct: 471 AVSPP 475 >ref|XP_012078152.1| PREDICTED: mediator of RNA polymerase II transcription subunit 1 isoform X2 [Jatropha curcas] Length = 599 Score = 275 bits (704), Expect = 5e-71 Identities = 199/491 (40%), Positives = 260/491 (52%), Gaps = 29/491 (5%) Frame = -2 Query: 1386 MERSEPTLVPEWLKNXXXXXXXXXXXXXXXXXXSNLARNKSFMHSNGHDSGRSSASDGPT 1207 MERSEPTLVPEWL++ S + S H+ +S + D P Sbjct: 1 MERSEPTLVPEWLRSSGSVSGGGSSVHHFASSSSLSDVSSSAHHTRNRNSKGLTDFDSPR 60 Query: 1206 SSYFHRSSNSN-----ISGRLR-SYSSFGXXXXXXXXXXXXXXXXDKAKPVFGDYRHRDF 1045 S++ R+S+SN I+G + +YSSF K + F D+ RD Sbjct: 61 SAFLDRTSSSNSRRSSINGSAKHAYSSFSRSHRDKDRERD------KERLNFVDHWDRDG 114 Query: 1044 SDVFENIFPNKFERDGLRRSQSMISGKRGETEPKKVVTDLRSASRNN----KGLLTEGSP 877 D +I ++ E+D LRRS SM+S K+GE P++ DL++ S N GLL+ G Sbjct: 115 PDPLGSILSSRSEKDTLRRSHSMVSRKQGEVLPRRFAVDLKNGSSGNHTNGNGLLSGGIV 174 Query: 876 VGSANKAAFEKDFPSLGAEERPATPEVGRVPSPVLSTAIQSLPIGSSSVIGGEKWTSALA 697 + KA FEKDFPSLG EER PE+GRV SP LSTA+Q+LP+GSS++IGGE WTSALA Sbjct: 175 GSNIQKAVFEKDFPSLGCEERQGVPEIGRVSSPSLSTAVQNLPVGSSALIGGEGWTSALA 234 Query: 696 EVPVLVGSDXXXXXXXXXXXXXXXXXXXXXXXXGLNMAETVAQCPTLIQTTTQSSAGTPR 517 EVP L+G+ GLNMAE + Q P+ +T Q S T R Sbjct: 235 EVPALIGNS-STGSLSSVQSVAASASACPSVMAGLNMAEALTQAPSRTRTAPQLSVQTQR 293 Query: 516 LEELAIKQSRQLIPVTQSMPKTLVLNSSDK-------KVGQQHTLA-------SSLPVSH 379 LEELAIKQSRQLIPVT SMPK+ VLNSSDK + G+ + A S+L ++ Sbjct: 294 LEELAIKQSRQLIPVTPSMPKSSVLNSSDKSKPKTVVRSGEMNMAAKSMQQQSSALHPTN 353 Query: 378 STRGVPEKSDLSKTSNVGKLHVLKPVRERNGVSPSAKDNFSPTSG-SRILNSPLVVXXXX 202 + G+ K+D KTS+ GKL VLKP E NGVSPS KD SPT+ SR NS L Sbjct: 354 QSLGIHVKTDAPKTSH-GKLFVLKPGWE-NGVSPSPKDIASPTNNVSRAANSQLAAPASV 411 Query: 201 XXXXXXXXXXPNIP---VLHGANRKPVLTVS-EKKSTSQAQSRSEFFNLVRKKSLANSSP 34 + AN + + EK+ SQ QSR++FFNL++KK+ +S Sbjct: 412 TSVPLRSPNNAKLSSSGERKSANSNMISAFNVEKRPLSQTQSRNDFFNLLKKKTSNSSPA 471 Query: 33 VPDRSMANPSP 1 +PD S SP Sbjct: 472 LPDSSSVVSSP 482 >ref|XP_002513834.1| conserved hypothetical protein [Ricinus communis] gi|223546920|gb|EEF48417.1| conserved hypothetical protein [Ricinus communis] Length = 596 Score = 274 bits (701), Expect = 1e-70 Identities = 198/489 (40%), Positives = 252/489 (51%), Gaps = 27/489 (5%) Frame = -2 Query: 1386 MERSEPTLVPEWLKNXXXXXXXXXXXXXXXXXXSNLARNKSFMHSNGHDSGRSSASDGPT 1207 MERSEPTLVPEWL++ + + S HS +S +S D P Sbjct: 1 MERSEPTLVPEWLRSSGSVPGGGSSAHHFASSSPHSDVSSSVHHSRSRNSKSTSDFDSPR 60 Query: 1206 SSYFHRSSNSNISGRLRSYSSFGXXXXXXXXXXXXXXXXDKAKPVFGDYRHRDFSDVFEN 1027 S++ R+S+SN + S+ DK + FG++ D SD + Sbjct: 61 SAFLDRTSSSNSRRSSSNGSAKHAYSSFSRSHRDKDRERDKERLNFGNHWDNDASDPLGS 120 Query: 1026 IFPNKFERDGLRRSQSMISGKRGETEPKKVVTDLRSASRNN----KGLLTEGSPVGSANK 859 I ++ E+D LRRS SM+S K GE P++ DLR+ S +N GL++ G S K Sbjct: 121 IL-SRNEKDALRRSHSMVSRKLGEVLPRRFAADLRNGSNSNHVNGNGLISGGGVGNSIPK 179 Query: 858 AAFEKDFPSLGAEERPATPEVGRVPSPVLSTAIQSLPIGSSSVIGGEKWTSALAEVPVLV 679 A FEKDFPSLG+EER P++GRV SP LSTA+QSLP+ SS++IGGE WTSALAEVP ++ Sbjct: 180 AVFEKDFPSLGSEERQGAPDIGRVSSPGLSTAVQSLPVSSSALIGGEGWTSALAEVPAII 239 Query: 678 GSDXXXXXXXXXXXXXXXXXXXXXXXXGLNMAETVAQCPTLIQTTTQSSAGTPRLEELAI 499 G++ GLNMAE + Q PT +T Q S T RLEELAI Sbjct: 240 GNN-SSGSSSSVQTVATSASGAPSTVAGLNMAEALTQAPTRTRTAPQLSVQTQRLEELAI 298 Query: 498 KQSRQLIPVTQSMPKTLVLNSSDKKVGQ---------------QHTLASSLPVSHSTRGV 364 KQSRQLIPVT SMPK+ VLNSSDK + Q +S V+ S G Sbjct: 299 KQSRQLIPVTPSMPKSSVLNSSDKSKPKTVVRSSEMNMAPKNLQQQPSSLHAVTQSLAGG 358 Query: 363 PEKSDLSKTSNVGKLHVLKPVRERNGVSPSAKDNFSPTSGSRILNSPLVVXXXXXXXXXX 184 KSD SK S+ GKL VLKP E NG SPS KD +P + R NS L Sbjct: 359 HVKSDASKASH-GKLFVLKPGWE-NGASPSPKDIANPNNAGRAANSQLAA---APSVPSA 413 Query: 183 XXXXPNIPVLHGANRKPV-------LTVSEKKSTSQAQSRSEFFNLVRKKSLANSS-PVP 28 PN P L RK V ++ SQ QSR +FFNL++KK+L NSS + Sbjct: 414 PLRSPNNPKLSAGERKSASLNLISGFNVEKRPLLSQTQSRHDFFNLLKKKTLKNSSTALT 473 Query: 27 DRSMANPSP 1 D + A SP Sbjct: 474 DSASAISSP 482 >ref|XP_008233924.1| PREDICTED: cell wall protein AWA1 [Prunus mume] Length = 612 Score = 273 bits (698), Expect = 3e-70 Identities = 210/498 (42%), Positives = 260/498 (52%), Gaps = 36/498 (7%) Frame = -2 Query: 1386 MERSEPTLVPEWLKNXXXXXXXXXXXXXXXXXXSNL--------ARNKSFMHSNGHDSGR 1231 MERSEPTLVPEWL++ S+ RN++ + D+ R Sbjct: 1 MERSEPTLVPEWLRSTGSVTGGGNSAHHFASSSSHSDVTSLAHHLRNRASKSISDFDTPR 60 Query: 1230 SSASDGPTSSYFHRSSNSNISGRLRSYSSFGXXXXXXXXXXXXXXXXDKAKPVFGDYRHR 1051 S+ +SS R S+SN S + +YSSF +K + +GD+ R Sbjct: 61 SAFLLDRSSSSNSRRSSSNGSAK-HAYSSFN------RSHRDKDRDKEKERLNYGDHWDR 113 Query: 1050 DFSDVFENIFPNKFERDGLRRSQSMISGKRGETEPKKVVTDLRS--ASRNNKGLLTEGSP 877 D SD NIF ++ E+D LRRSQSM++ K+ E P++ V D +S ++ NN L G Sbjct: 114 DCSDPLGNIFTSRVEKDTLRRSQSMVARKQSELLPRRAVIDSKSSNSNHNNGNGLLSGVG 173 Query: 876 VGSANKAAFEKDFPSLGAEERPATPEVGRVPSPVLSTAIQSLPIGSSSVIGGEKWTSALA 697 VG K F+KDFPSLG EERPA P++GRVPSP QSLP+GSS++IGGE WTSALA Sbjct: 174 VG-IQKVVFDKDFPSLGTEERPAVPDIGRVPSP----GFQSLPVGSSALIGGEGWTSALA 228 Query: 696 EVP-VLVGSDXXXXXXXXXXXXXXXXXXXXXXXXGLNMAETVAQCPTLIQTTTQSSAGTP 520 EVP ++ S GLNMAE +AQ P +T Q S T Sbjct: 229 EVPSTIIASSSSGSFPVQPTVAATSASGTSTAMAGLNMAEALAQAPARARTAPQLSIKTQ 288 Query: 519 RLEELAIKQSRQLIPVTQSMPKTLVLNSSDK----------------KVGQQHTLASSLP 388 RLEELAIKQSRQLIPVT SMPK VLNSSDK K GQQ + Sbjct: 289 RLEELAIKQSRQLIPVTPSMPKASVLNSSDKSKPKTAARTGEMNVPAKGGQQQQPSQLHH 348 Query: 387 VSHSTRGVPEKSDLSKTSNVGKLHVLKPVRERNGVSPSAKDNFSPT-SGSRILNSPLVVX 211 + S RG P KSD KTS+ GK VLKPV E NGVS S KD SPT + SR NSPLVV Sbjct: 349 ANQSLRGGPVKSDPPKTSH-GKFLVLKPVWE-NGVSSSPKDVTSPTNNASRAANSPLVV- 405 Query: 210 XXXXXXXXXXXXXPNIPVLHGANRKPVL-------TVSEKKSTSQAQSRSEFFNLVRKKS 52 PN P L RK T+ ++ S SQ QSR++FFNL++KK+ Sbjct: 406 --APAVASAPLRSPNNPKLSPVERKVAALDLKSGSTLEKRPSLSQVQSRNDFFNLLKKKT 463 Query: 51 LANSS-PVPDRSMANPSP 1 NSS +PD SP Sbjct: 464 SMNSSITLPDSGPIISSP 481 >ref|XP_012078151.1| PREDICTED: mediator of RNA polymerase II transcription subunit 1 isoform X1 [Jatropha curcas] gi|643723136|gb|KDP32741.1| hypothetical protein JCGZ_12033 [Jatropha curcas] Length = 603 Score = 272 bits (696), Expect = 4e-70 Identities = 201/495 (40%), Positives = 260/495 (52%), Gaps = 33/495 (6%) Frame = -2 Query: 1386 MERSEPTLVPEWLKNXXXXXXXXXXXXXXXXXXSNLARNKSFMHSNGHDSGRSSASDGPT 1207 MERSEPTLVPEWL++ S + S H+ +S + D P Sbjct: 1 MERSEPTLVPEWLRSSGSVSGGGSSVHHFASSSSLSDVSSSAHHTRNRNSKGLTDFDSPR 60 Query: 1206 SSYFHRSSNSN-----ISGRLR-SYSSFGXXXXXXXXXXXXXXXXDKAKPVFGDYRHRDF 1045 S++ R+S+SN I+G + +YSSF K + F D+ RD Sbjct: 61 SAFLDRTSSSNSRRSSINGSAKHAYSSFSRSHRDKDRERD------KERLNFVDHWDRDG 114 Query: 1044 SDVFENIFPNKFERDGLRRSQSMISGKRGETEPKKVVTDLRSASRNN----KGLLTEGSP 877 D +I ++ E+D LRRS SM+S K+GE P++ DL++ S N GLL+ G Sbjct: 115 PDPLGSILSSRSEKDTLRRSHSMVSRKQGEVLPRRFAVDLKNGSSGNHTNGNGLLSGGIV 174 Query: 876 VGSANKAAFEKDFPSLGAEERPATPEVGRVPSPVLSTAIQSLPIGSSSVIGGEKWTSALA 697 + KA FEKDFPSLG EER PE+GRV SP LSTA+Q+LP+GSS++IGGE WTSALA Sbjct: 175 GSNIQKAVFEKDFPSLGCEERQGVPEIGRVSSPSLSTAVQNLPVGSSALIGGEGWTSALA 234 Query: 696 EVPVLVGSDXXXXXXXXXXXXXXXXXXXXXXXXGLNMAETVAQCP----TLIQTTTQSSA 529 EVP L+G+ GLNMAE + Q P T Q T Q S Sbjct: 235 EVPALIGNS-STGSLSSVQSVAASASACPSVMAGLNMAEALTQAPSRTRTAPQVTEQLSV 293 Query: 528 GTPRLEELAIKQSRQLIPVTQSMPKTLVLNSSDK-------KVGQQHTLA-------SSL 391 T RLEELAIKQSRQLIPVT SMPK+ VLNSSDK + G+ + A S+L Sbjct: 294 QTQRLEELAIKQSRQLIPVTPSMPKSSVLNSSDKSKPKTVVRSGEMNMAAKSMQQQSSAL 353 Query: 390 PVSHSTRGVPEKSDLSKTSNVGKLHVLKPVRERNGVSPSAKDNFSPTSG-SRILNSPLVV 214 ++ + G+ K+D KTS+ GKL VLKP E NGVSPS KD SPT+ SR NS L Sbjct: 354 HPTNQSLGIHVKTDAPKTSH-GKLFVLKPGWE-NGVSPSPKDIASPTNNVSRAANSQLAA 411 Query: 213 XXXXXXXXXXXXXXPNIP---VLHGANRKPVLTVS-EKKSTSQAQSRSEFFNLVRKKSLA 46 + AN + + EK+ SQ QSR++FFNL++KK+ Sbjct: 412 PASVTSVPLRSPNNAKLSSSGERKSANSNMISAFNVEKRPLSQTQSRNDFFNLLKKKTSN 471 Query: 45 NSSPVPDRSMANPSP 1 +S +PD S SP Sbjct: 472 SSPALPDSSSVVSSP 486