BLASTX nr result
ID: Forsythia23_contig00005188
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Forsythia23_contig00005188 (830 letters) Database: ./nr 69,698,275 sequences; 24,982,196,650 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value gb|EYU19721.1| hypothetical protein MIMGU_mgv1a011346mg [Erythra... 362 2e-97 ref|XP_012858818.1| PREDICTED: uncharacterized protein LOC105977... 358 3e-96 ref|XP_011084600.1| PREDICTED: uncharacterized protein LOC105166... 352 1e-94 ref|XP_009791066.1| PREDICTED: uncharacterized protein LOC104238... 352 2e-94 ref|XP_009624097.1| PREDICTED: uncharacterized protein LOC104115... 346 9e-93 ref|XP_006343655.1| PREDICTED: uncharacterized protein LOC102589... 341 4e-91 ref|XP_004242589.1| PREDICTED: uncharacterized protein LOC101266... 338 3e-90 emb|CDP04958.1| unnamed protein product [Coffea canephora] 335 3e-89 ref|XP_002278704.1| PREDICTED: uncharacterized protein LOC100247... 325 2e-86 ref|XP_010034968.1| PREDICTED: uncharacterized protein LOC104424... 320 9e-85 ref|XP_010269910.1| PREDICTED: uncharacterized protein LOC104606... 319 1e-84 ref|XP_007029500.1| Uncharacterized protein TCM_025373 [Theobrom... 307 5e-81 emb|CBI26577.3| unnamed protein product [Vitis vinifera] 306 1e-80 ref|XP_012458959.1| PREDICTED: uncharacterized protein LOC105779... 305 3e-80 gb|KHG14627.1| Replicase polyprotein 1ab [Gossypium arboreum] 301 3e-79 gb|EPS71910.1| hypothetical protein M569_02850, partial [Genlise... 301 4e-79 ref|XP_012090950.1| PREDICTED: uncharacterized protein LOC105649... 297 5e-78 gb|KHF99621.1| hypothetical protein F383_17009 [Gossypium arboreum] 291 3e-76 ref|XP_012476851.1| PREDICTED: uncharacterized protein LOC105792... 291 5e-76 ref|XP_010110458.1| hypothetical protein L484_001857 [Morus nota... 290 1e-75 >gb|EYU19721.1| hypothetical protein MIMGU_mgv1a011346mg [Erythranthe guttata] Length = 285 Score = 362 bits (929), Expect = 2e-97 Identities = 180/267 (67%), Positives = 205/267 (76%), Gaps = 17/267 (6%) Frame = -2 Query: 754 YLPFAFCKPASLFVVMASSLPWHPLIPTKTQRFHHYRKTLKPNFIPIRHAESIRAFRRSD 575 +L +F P SLF MASSLPWHP++ +K Q+ YR PN +P+R A +IRAF RSD Sbjct: 25 FLCKSFGSPFSLF--MASSLPWHPVLTSKPQKHSFYR----PNIVPVRRAATIRAFGRSD 78 Query: 574 LDGFAKRVASGEAWRDVWRGANDGFEQVLYETKKTAERINRRYAVSERFSEVTNSAADRA 395 +DGFA+RV SGE WR WR ANDGFE +YET+KTAERI+RRY VS RFS V SAADRA Sbjct: 79 IDGFAQRVRSGELWRSAWRSANDGFELFVYETRKTAERIDRRYDVSGRFSVVAQSAADRA 138 Query: 394 REFDREFGITQRWRTLSLDFSRNWPRYRRQFNDFLDTPLGRSFATIF------------- 254 RE DR+F ITQRWRT SLDF+RN PRYR+Q NDFLDTPLGRSFAT+F Sbjct: 139 RELDRDFEITQRWRTFSLDFTRNLPRYRKQLNDFLDTPLGRSFATLFFLWFALSGWLFRF 198 Query: 253 ----XWVLPFAGPLLIGAVANNLVIKGECPACRRQFVGYKSQTVRCASCGNIVWQPQGDF 86 WVLPFAGPLLIGA+ANNLVIKGECPACR+ FVGYKSQTVRCASCGN VWQP+ DF Sbjct: 199 LIFATWVLPFAGPLLIGAIANNLVIKGECPACRKPFVGYKSQTVRCASCGNTVWQPKNDF 258 Query: 85 FSKGSRGTTSSPKKSQPDVIDVEFEEK 5 FS+G RG+ +S KSQPD+IDVEFEEK Sbjct: 259 FSRGDRGSNTSSSKSQPDIIDVEFEEK 285 >ref|XP_012858818.1| PREDICTED: uncharacterized protein LOC105977962 [Erythranthe guttatus] Length = 248 Score = 358 bits (918), Expect = 3e-96 Identities = 174/252 (69%), Positives = 197/252 (78%), Gaps = 17/252 (6%) Frame = -2 Query: 709 MASSLPWHPLIPTKTQRFHHYRKTLKPNFIPIRHAESIRAFRRSDLDGFAKRVASGEAWR 530 MASSLPWHP++ +K Q+ YR PN +P+R A +IRAF RSD+DGFA+RV SGE WR Sbjct: 1 MASSLPWHPVLTSKPQKHSFYR----PNIVPVRRAATIRAFGRSDIDGFAQRVRSGELWR 56 Query: 529 DVWRGANDGFEQVLYETKKTAERINRRYAVSERFSEVTNSAADRAREFDREFGITQRWRT 350 WR ANDGFE +YET+KTAERI+RRY VS RFS V SAADRARE DR+F ITQRWRT Sbjct: 57 SAWRSANDGFELFVYETRKTAERIDRRYDVSGRFSVVAQSAADRARELDRDFEITQRWRT 116 Query: 349 LSLDFSRNWPRYRRQFNDFLDTPLGRSFATIF-----------------XWVLPFAGPLL 221 SLDF+RN PRYR+Q NDFLDTPLGRSFAT+F WVLPFAGPLL Sbjct: 117 FSLDFTRNLPRYRKQLNDFLDTPLGRSFATLFFLWFALSGWLFRFLIFATWVLPFAGPLL 176 Query: 220 IGAVANNLVIKGECPACRRQFVGYKSQTVRCASCGNIVWQPQGDFFSKGSRGTTSSPKKS 41 IGA+ANNLVIKGECPACR+ FVGYKSQTVRCASCGN VWQP+ DFFS+G RG+ +S KS Sbjct: 177 IGAIANNLVIKGECPACRKPFVGYKSQTVRCASCGNTVWQPKNDFFSRGDRGSNTSSSKS 236 Query: 40 QPDVIDVEFEEK 5 QPD+IDVEFEEK Sbjct: 237 QPDIIDVEFEEK 248 >ref|XP_011084600.1| PREDICTED: uncharacterized protein LOC105166810 [Sesamum indicum] Length = 241 Score = 352 bits (904), Expect = 1e-94 Identities = 182/254 (71%), Positives = 197/254 (77%), Gaps = 19/254 (7%) Frame = -2 Query: 709 MASSLPWHPLIPTKTQR--FHHYRKTLKPNFIPIRHAESIRAFRRSDLDGFAKRVASGEA 536 MASSL WHPL+P K QR FH PN + SIRAFRRSDLDGFA+RVASGE Sbjct: 1 MASSLQWHPLLPAKPQRRLFH------PPNSV------SIRAFRRSDLDGFAQRVASGEL 48 Query: 535 WRDVWRGANDGFEQVLYETKKTAERINRRYAVSERFSEVTNSAADRAREFDREFGITQRW 356 WRD WR ANDGFE +YET+KTAERI+RRYAVS R S V SA DRARE DREF ITQRW Sbjct: 49 WRDAWRRANDGFELFVYETRKTAERIDRRYAVSRRLSTVAQSATDRARELDREFEITQRW 108 Query: 355 RTLSLDFSRNWPRYRRQFNDFLDTPLGRSFATIF-----------------XWVLPFAGP 227 R+ +LDFSRNWPRYRRQ +DFLDTPLGRSFAT+F WVLPFAGP Sbjct: 109 RSFTLDFSRNWPRYRRQLSDFLDTPLGRSFATLFFLWFALSGWLFRVLIFATWVLPFAGP 168 Query: 226 LLIGAVANNLVIKGECPACRRQFVGYKSQTVRCASCGNIVWQPQGDFFSKGSRGTTSSPK 47 LLIGA+ANNLVIKGECPAC+R FVG KSQTV+CASCGNIVWQPQGDFFS+G RGTTS+ Sbjct: 169 LLIGALANNLVIKGECPACKRPFVGNKSQTVQCASCGNIVWQPQGDFFSRGGRGTTST-S 227 Query: 46 KSQPDVIDVEFEEK 5 KSQPDVIDVEFEEK Sbjct: 228 KSQPDVIDVEFEEK 241 >ref|XP_009791066.1| PREDICTED: uncharacterized protein LOC104238417 [Nicotiana sylvestris] Length = 249 Score = 352 bits (902), Expect = 2e-94 Identities = 176/253 (69%), Positives = 197/253 (77%), Gaps = 18/253 (7%) Frame = -2 Query: 709 MASSLPWHPLIPTKTQRFHHYRKTLKPNFIPIRHAESIRAFRRSDLDGFAKRVASGEAWR 530 MA LPWHPL P+KTQR R + + +PIR + AFRRSD DGFA+RV SGEAWR Sbjct: 1 MAICLPWHPLQPSKTQRLFRIRNPV--HVVPIRQVAYVHAFRRSDFDGFARRVKSGEAWR 58 Query: 529 DVWRGANDGFEQVLYETKKTAERINRRYAVSERFSEVTNSAADRAREFDREFGITQRWRT 350 + WR ANDGFEQ ++ETKKTAERI+RRYAVS R S V SAADRARE DR+F ITQ+WRT Sbjct: 59 EAWRSANDGFEQFVFETKKTAERIDRRYAVSRRLSAVAQSAADRAREIDRDFEITQKWRT 118 Query: 349 LSLDFSRNWPRYRRQFNDFLDTPLGRSFATIF-----------------XWVLPFAGPLL 221 SLDFSRNWPRYRRQ +DF+DTPLGRS ATIF WVLPFAGPLL Sbjct: 119 FSLDFSRNWPRYRRQLSDFMDTPLGRSAATIFFLWFALSGWLFRILIIATWVLPFAGPLL 178 Query: 220 IGAVANNLVIKGECPACRRQFVGYKSQTVRCASCGNIVWQPQ-GDFFSKGSRGTTSSPKK 44 IGAVANNLVIKG+CPACRRQF+G K+ TVRCASCGNIVWQP+ GDFFS+GSRGTT S K Sbjct: 179 IGAVANNLVIKGQCPACRRQFIGNKNSTVRCASCGNIVWQPKGGDFFSRGSRGTTRS--K 236 Query: 43 SQPDVIDVEFEEK 5 S+PD+IDVEFEEK Sbjct: 237 SEPDIIDVEFEEK 249 >ref|XP_009624097.1| PREDICTED: uncharacterized protein LOC104115216 [Nicotiana tomentosiformis] Length = 246 Score = 346 bits (888), Expect = 9e-93 Identities = 175/253 (69%), Positives = 196/253 (77%), Gaps = 18/253 (7%) Frame = -2 Query: 709 MASSLPWHPLIPTKTQRFHHYRKTLKPNFIPIRHAESIRAFRRSDLDGFAKRVASGEAWR 530 MA LPWHPL P+KTQ R + + +PIRH I AFRRSD DGFA+RV SGEAWR Sbjct: 1 MAICLPWHPLQPSKTQHLFRIRNPV--HVVPIRH---IHAFRRSDFDGFARRVKSGEAWR 55 Query: 529 DVWRGANDGFEQVLYETKKTAERINRRYAVSERFSEVTNSAADRAREFDREFGITQRWRT 350 D WR ANDGFEQ ++ETKKTAERI+RRYAVS R S V SAADRARE DR+F IT +WRT Sbjct: 56 DAWRSANDGFEQFVFETKKTAERIDRRYAVSRRLSAVAQSAADRARELDRDFEITHKWRT 115 Query: 349 LSLDFSRNWPRYRRQFNDFLDTPLGRSFATIF-----------------XWVLPFAGPLL 221 SLDF+RNWPRYR+Q +DF+DTPLGRS ATIF WVLPFAGPLL Sbjct: 116 FSLDFTRNWPRYRKQLSDFMDTPLGRSAATIFFLWFALSGWLFRILIIATWVLPFAGPLL 175 Query: 220 IGAVANNLVIKGECPACRRQFVGYKSQTVRCASCGNIVWQPQ-GDFFSKGSRGTTSSPKK 44 IGAVANNLVIKG+CPACRRQF+G K+ TVRCASCGNIVWQP+ GDFFS+GSRGTT S K Sbjct: 176 IGAVANNLVIKGQCPACRRQFIGNKNSTVRCASCGNIVWQPKGGDFFSRGSRGTTRS--K 233 Query: 43 SQPDVIDVEFEEK 5 S+PD+IDVEFEEK Sbjct: 234 SEPDIIDVEFEEK 246 >ref|XP_006343655.1| PREDICTED: uncharacterized protein LOC102589298 [Solanum tuberosum] Length = 250 Score = 341 bits (874), Expect = 4e-91 Identities = 170/253 (67%), Positives = 194/253 (76%), Gaps = 18/253 (7%) Frame = -2 Query: 709 MASSLPWHPLIPTKTQRFHHYRKTLKPNFIPIRHAESIRAFRRSDLDGFAKRVASGEAWR 530 MA+SLPWHPL PTKT RF +T +PIR + AFRRSD DGFAKRV SGEAW+ Sbjct: 1 MATSLPWHPLNPTKTLRFTS--RTTTKVVLPIRQVTYVHAFRRSDFDGFAKRVRSGEAWK 58 Query: 529 DVWRGANDGFEQVLYETKKTAERINRRYAVSERFSEVTNSAADRAREFDREFGITQRWRT 350 DVWR ANDGFEQ LYE KKTAER++RRY VS + S+V SAADRARE DREF IT++WRT Sbjct: 59 DVWRNANDGFEQFLYEAKKTAERVDRRYDVSRKVSDVAQSAADRAREIDREFEITRKWRT 118 Query: 349 LSLDFSRNWPRYRRQFNDFLDTPLGRSFATIF-----------------XWVLPFAGPLL 221 SLDF N PRYR+Q NDF+DTPLGRS ATIF WVLPFAGPLL Sbjct: 119 FSLDFRSNLPRYRKQLNDFMDTPLGRSAATIFFLWFALSGWLFRILIIATWVLPFAGPLL 178 Query: 220 IGAVANNLVIKGECPACRRQFVGYKSQTVRCASCGNIVWQPQG-DFFSKGSRGTTSSPKK 44 IG VANNLV+KG+CP+CRRQF+G K+ TVRCA+CGN+VWQP+G DFFS+GSRG S+P K Sbjct: 179 IGVVANNLVVKGQCPSCRRQFIGNKNSTVRCANCGNVVWQPKGDDFFSRGSRG-GSTPSK 237 Query: 43 SQPDVIDVEFEEK 5 SQPD+IDVEFEEK Sbjct: 238 SQPDIIDVEFEEK 250 >ref|XP_004242589.1| PREDICTED: uncharacterized protein LOC101266837 [Solanum lycopersicum] Length = 248 Score = 338 bits (866), Expect = 3e-90 Identities = 171/253 (67%), Positives = 196/253 (77%), Gaps = 18/253 (7%) Frame = -2 Query: 709 MASSLPWHPLIPTKTQRFHHYRKTLKPNFIPIRHAESIRAFRRSDLDGFAKRVASGEAWR 530 MA SLPWHPL PTKT F + +T+K N +PIR + AFRRSD DGFAKRV SGEAW+ Sbjct: 1 MAISLPWHPLNPTKTLSFAN--RTVKVN-LPIRQVSYVHAFRRSDFDGFAKRVRSGEAWK 57 Query: 529 DVWRGANDGFEQVLYETKKTAERINRRYAVSERFSEVTNSAADRAREFDREFGITQRWRT 350 DVWR ANDGFEQ LYE+KKTAERI+RRY VS + S+V SAADRARE DR+F IT++WRT Sbjct: 58 DVWRNANDGFEQFLYESKKTAERIDRRYDVSRKVSDVAQSAADRAREIDRDFEITRKWRT 117 Query: 349 LSLDFSRNWPRYRRQFNDFLDTPLGRSFATIF-----------------XWVLPFAGPLL 221 SLDF N PRYR+Q NDF+DTPLGRS TIF WVLPFAGPLL Sbjct: 118 FSLDFRSNLPRYRKQLNDFMDTPLGRSAVTIFFLWFALSGWLFRILIIATWVLPFAGPLL 177 Query: 220 IGAVANNLVIKGECPACRRQFVGYKSQTVRCASCGNIVWQPQ-GDFFSKGSRGTTSSPKK 44 IG VANNLVIKG+CP+CRRQF+G K+ TVRCA+CGN+VWQP+ GDFFS+GSRG T+S K Sbjct: 178 IGVVANNLVIKGQCPSCRRQFIGNKNSTVRCANCGNVVWQPKGGDFFSRGSRGGTTS--K 235 Query: 43 SQPDVIDVEFEEK 5 SQPD+IDVEFEEK Sbjct: 236 SQPDIIDVEFEEK 248 >emb|CDP04958.1| unnamed protein product [Coffea canephora] Length = 250 Score = 335 bits (858), Expect = 3e-89 Identities = 170/253 (67%), Positives = 193/253 (76%), Gaps = 18/253 (7%) Frame = -2 Query: 709 MASSLPWHPLIPTKTQRFHHYRKTLKPNFIPIR-HAESIRAFRRSDLDGFAKRVASGEAW 533 M ++LPW L+P+K + FIP+ I+AFRRSD D FA+R+ SGEAW Sbjct: 1 MGATLPWQHLVPSKPIGLCPAQN--HKQFIPLHCQVSKIQAFRRSDFDVFARRITSGEAW 58 Query: 532 RDVWRGANDGFEQVLYETKKTAERINRRYAVSERFSEVTNSAADRAREFDREFGITQRWR 353 RD WR ANDGFEQ+LYETKKTAER++R+Y+VS R S VT SAADRARE DREF +TQRWR Sbjct: 59 RDAWRRANDGFEQLLYETKKTAERLDRQYSVSRRLSAVTRSAADRAREIDREFELTQRWR 118 Query: 352 TLSLDFSRNWPRYRRQFNDFLDTPLGRSFATIF-----------------XWVLPFAGPL 224 T SLDFSRNWPR R+QF DFLDTPLG+SF TIF WVLPFAGPL Sbjct: 119 TFSLDFSRNWPRNRKQFIDFLDTPLGKSFTTIFFLWFALSGWLFRFLIFATWVLPFAGPL 178 Query: 223 LIGAVANNLVIKGECPACRRQFVGYKSQTVRCASCGNIVWQPQGDFFSKGSRGTTSSPKK 44 LIGAVANNLVIKG CPACRRQF GYK+QTVRCASCGNIVWQPQGDFFS+GS+G++SS K Sbjct: 179 LIGAVANNLVIKGACPACRRQFAGYKNQTVRCASCGNIVWQPQGDFFSRGSQGSSSS-SK 237 Query: 43 SQPDVIDVEFEEK 5 S+ DVIDVEFEEK Sbjct: 238 SEHDVIDVEFEEK 250 >ref|XP_002278704.1| PREDICTED: uncharacterized protein LOC100247606 [Vitis vinifera] Length = 244 Score = 325 bits (834), Expect = 2e-86 Identities = 165/254 (64%), Positives = 186/254 (73%), Gaps = 19/254 (7%) Frame = -2 Query: 709 MASSLPWHPLIPTKTQRFHHYRKTLKPNFIPIRHA--ESIRAFRRSDLDGFAKRVASGEA 536 M +SLPWHPL +K Q TL+ P+RH +RAFRRSD DGFAKR+ASG+A Sbjct: 1 MTTSLPWHPLFSSKPQ-------TLRRFAAPVRHRLPMPVRAFRRSDFDGFAKRMASGDA 53 Query: 535 WRDVWRGANDGFEQVLYETKKTAERINRRYAVSERFSEVTNSAADRAREFDREFGITQRW 356 WRD WR ANDGFE +++E KKTAERINR+YAVS RFSE SA D ARE DREF I +RW Sbjct: 54 WRDAWRSANDGFELLIFEAKKTAERINRQYAVSRRFSEAVGSAGDWAREVDREFEIGRRW 113 Query: 355 RTLSLDFSRNWPRYRRQFNDFLDTPLGRSFATIF-----------------XWVLPFAGP 227 RT++LDF RNWPRYR+Q NDFLDTPLGRSFATIF WVLPFAGP Sbjct: 114 RTVTLDFGRNWPRYRKQLNDFLDTPLGRSFATIFFLWFALSGWLFRFLIFATWVLPFAGP 173 Query: 226 LLIGAVANNLVIKGECPACRRQFVGYKSQTVRCASCGNIVWQPQGDFFSKGSRGTTSSPK 47 LLIG ANN VIKG CPACRRQF+GYK+Q VRCA CGNIVWQP+GD S+GSRGT P Sbjct: 174 LLIGTFANNFVIKGNCPACRRQFIGYKNQIVRCAGCGNIVWQPKGD-VSRGSRGT--PPS 230 Query: 46 KSQPDVIDVEFEEK 5 SQ ++IDVEFEEK Sbjct: 231 SSQSEIIDVEFEEK 244 >ref|XP_010034968.1| PREDICTED: uncharacterized protein LOC104424300 [Eucalyptus grandis] gi|629079766|gb|KCW46211.1| hypothetical protein EUGRSUZ_K00108 [Eucalyptus grandis] Length = 250 Score = 320 bits (819), Expect = 9e-85 Identities = 159/259 (61%), Positives = 190/259 (73%), Gaps = 22/259 (8%) Frame = -2 Query: 715 VVMASSLPWHPLIPTKTQRFHHYRKTLKPNFIPIRHAESIRAFRRSDLDGFAKRVASGEA 536 +V +SLPW+P +P + +R ++P +P R A + AFRRSDL+ FA+RVASGEA Sbjct: 1 MVTTTSLPWNPALPARPRR-------VRPARLPTRAAPPVLAFRRSDLNHFAQRVASGEA 53 Query: 535 WRDVWRGANDGFEQVLYETKKTAERINRRYAVSERFSEVTNSAADRAREFDREFGITQRW 356 WRD WR AND FE ++E +KTAERI+RRY+VS R V SA+DRARE DREF I QRW Sbjct: 54 WRDAWRSANDRFELFIFEARKTAERIDRRYSVSRRLGAVAQSASDRAREIDREFEIGQRW 113 Query: 355 RTLSLDFSRNWPRYRRQFNDFLDTPLGRSFATIF-----------------XWVLPFAGP 227 RT +LDFSRNWPRYRR+ NDF++TPLGR FATIF W+LPFAGP Sbjct: 114 RTFTLDFSRNWPRYRREINDFMETPLGRGFATIFFLWFALSGWLFRCLIFATWILPFAGP 173 Query: 226 LLIGAVANNLVIKGECPACRRQFVGYKSQTVRCASCGNIVWQPQGDFFSK-----GSRGT 62 LLIG VANNL+IKG CPAC+RQFVGYK+Q VRCA+CGNIVWQP+GDFFS+ G R Sbjct: 174 LLIGTVANNLIIKGACPACKRQFVGYKNQIVRCANCGNIVWQPKGDFFSRDGFPGGGRRN 233 Query: 61 TSSPKKSQPDVIDVEFEEK 5 TSS KS+PD+IDVEFEEK Sbjct: 234 TSS--KSEPDIIDVEFEEK 250 >ref|XP_010269910.1| PREDICTED: uncharacterized protein LOC104606424 [Nelumbo nucifera] Length = 244 Score = 319 bits (818), Expect = 1e-84 Identities = 157/252 (62%), Positives = 183/252 (72%), Gaps = 17/252 (6%) Frame = -2 Query: 709 MASSLPWHPLIPTKTQRFHHYRKTLKPNFIPIRHAESIRAFRRSDLDGFAKRVASGEAWR 530 MA+SLPWHPL+ R+ L R +++AFRRSD DGFAKRV SGEAWR Sbjct: 1 MATSLPWHPLLSASKSHLSIRRRALST-----RQVVTVQAFRRSDFDGFAKRVTSGEAWR 55 Query: 529 DVWRGANDGFEQVLYETKKTAERINRRYAVSERFSEVTNSAADRAREFDREFGITQRWRT 350 D WR ANDGFEQ L++ KKTAER++R+Y+VS R + V SA DRARE DRE I +RWR+ Sbjct: 56 DAWRSANDGFEQFLFDAKKTAERLDRQYSVSRRLNAVMQSATDRAREIDRELEIGRRWRS 115 Query: 349 LSLDFSRNWPRYRRQFNDFLDTPLGRSFATIF-----------------XWVLPFAGPLL 221 +LDFSRNWPRYRRQ NDFLDTPLGRSFAT+F WVLPFA PLL Sbjct: 116 FTLDFSRNWPRYRRQLNDFLDTPLGRSFATVFFLWFALSGWLFRVLIFATWVLPFAAPLL 175 Query: 220 IGAVANNLVIKGECPACRRQFVGYKSQTVRCASCGNIVWQPQGDFFSKGSRGTTSSPKKS 41 IG VANN VI+G CPAC+R+F+GYK+Q VRCASCGNIVWQP+GDFFS GS T+S+ S Sbjct: 176 IGTVANNFVIEGACPACKRRFMGYKNQVVRCASCGNIVWQPKGDFFSGGSNNTSST---S 232 Query: 40 QPDVIDVEFEEK 5 PD+IDVEFEEK Sbjct: 233 GPDIIDVEFEEK 244 >ref|XP_007029500.1| Uncharacterized protein TCM_025373 [Theobroma cacao] gi|508718105|gb|EOY10002.1| Uncharacterized protein TCM_025373 [Theobroma cacao] Length = 242 Score = 307 bits (787), Expect = 5e-81 Identities = 156/256 (60%), Positives = 185/256 (72%), Gaps = 21/256 (8%) Frame = -2 Query: 709 MASSLP---WHPLIPTKTQRFHHYRKTLKPNFIPIRHAES-IRAFRRSDLDGFAKRVASG 542 MA++LP WH + K + +P+R I++FRRSD D FA+R+ASG Sbjct: 1 MATALPSTAWHTI------------KLQPQSPLPLRRRPLLIQSFRRSDFDTFARRMASG 48 Query: 541 EAWRDVWRGANDGFEQVLYETKKTAERINRRYAVSERFSEVTNSAADRAREFDREFGITQ 362 EAW+D WR ANDGFEQ ++E KKTAER++RRY+VS R S V SA DRARE DRE I Sbjct: 49 EAWKDAWRTANDGFEQFVFEAKKTAERLDRRYSVSRRVSSVVRSATDRAREIDRELEIGL 108 Query: 361 RWRTLSLDFSRNWPRYRRQFNDFLDTPLGRSFATIF-----------------XWVLPFA 233 RWRT ++DFSRNWP YR+Q NDFLDTPLGRSFATIF W+LPFA Sbjct: 109 RWRTFTMDFSRNWPTYRKQLNDFLDTPLGRSFATIFFLWFALSGWLFRFLILATWILPFA 168 Query: 232 GPLLIGAVANNLVIKGECPACRRQFVGYKSQTVRCASCGNIVWQPQGDFFSKGSRGTTSS 53 GPLLIGAVANNLVIKG CPAC+RQFVGYK+Q +RCASCGNIVWQP+GDFF + SRGT S Sbjct: 169 GPLLIGAVANNLVIKGACPACKRQFVGYKNQIIRCASCGNIVWQPEGDFFRRDSRGTNS- 227 Query: 52 PKKSQPDVIDVEFEEK 5 +KS+P++IDVEFEEK Sbjct: 228 -RKSEPEIIDVEFEEK 242 >emb|CBI26577.3| unnamed protein product [Vitis vinifera] Length = 214 Score = 306 bits (783), Expect = 1e-80 Identities = 150/215 (69%), Positives = 167/215 (77%), Gaps = 17/215 (7%) Frame = -2 Query: 598 IRAFRRSDLDGFAKRVASGEAWRDVWRGANDGFEQVLYETKKTAERINRRYAVSERFSEV 419 +RAFRRSD DGFAKR+ASG+AWRD WR ANDGFE +++E KKTAERINR+YAVS RFSE Sbjct: 3 VRAFRRSDFDGFAKRMASGDAWRDAWRSANDGFELLIFEAKKTAERINRQYAVSRRFSEA 62 Query: 418 TNSAADRAREFDREFGITQRWRTLSLDFSRNWPRYRRQFNDFLDTPLGRSFATIF----- 254 SA D ARE DREF I +RWRT++LDF RNWPRYR+Q NDFLDTPLGRSFATIF Sbjct: 63 VGSAGDWAREVDREFEIGRRWRTVTLDFGRNWPRYRKQLNDFLDTPLGRSFATIFFLWFA 122 Query: 253 ------------XWVLPFAGPLLIGAVANNLVIKGECPACRRQFVGYKSQTVRCASCGNI 110 WVLPFAGPLLIG ANN VIKG CPACRRQF+GYK+Q VRCA CGNI Sbjct: 123 LSGWLFRFLIFATWVLPFAGPLLIGTFANNFVIKGNCPACRRQFIGYKNQIVRCAGCGNI 182 Query: 109 VWQPQGDFFSKGSRGTTSSPKKSQPDVIDVEFEEK 5 VWQP+GD S+GSRGT P SQ ++IDVEFEEK Sbjct: 183 VWQPKGD-VSRGSRGT--PPSSSQSEIIDVEFEEK 214 >ref|XP_012458959.1| PREDICTED: uncharacterized protein LOC105779655 [Gossypium raimondii] gi|763808281|gb|KJB75183.1| hypothetical protein B456_012G029700 [Gossypium raimondii] Length = 242 Score = 305 bits (780), Expect = 3e-80 Identities = 156/255 (61%), Positives = 183/255 (71%), Gaps = 20/255 (7%) Frame = -2 Query: 709 MASSLP---WHPLIPTKTQRFHHYRKTLKPNFIPIRHAESIRAFRRSDLDGFAKRVASGE 539 MA++LP WH TK Q +P + R +++FRRSD D F +R+ASGE Sbjct: 1 MATALPSTVWHA---TKFQP--------QPPLLQRRRPLLVQSFRRSDFDTFTRRMASGE 49 Query: 538 AWRDVWRGANDGFEQVLYETKKTAERINRRYAVSERFSEVTNSAADRAREFDREFGITQR 359 A +D WR ANDGFEQ ++E KKTAER++R+Y+VS R S SAADRARE DREF I R Sbjct: 50 ALKDAWRTANDGFEQFVFEAKKTAERLDRQYSVSRRLSSAAQSAADRAREIDREFEIGLR 109 Query: 358 WRTLSLDFSRNWPRYRRQFNDFLDTPLGRSFATIF-----------------XWVLPFAG 230 WRT S+DFSRNWPRYR+Q NDFLDTPLGRSFATIF W+LPFAG Sbjct: 110 WRTFSMDFSRNWPRYRKQLNDFLDTPLGRSFATIFFLWFALSGWMFRCLIFATWILPFAG 169 Query: 229 PLLIGAVANNLVIKGECPACRRQFVGYKSQTVRCASCGNIVWQPQGDFFSKGSRGTTSSP 50 PLLIG VANNLVIKG CPAC+RQFVGYK+Q VRC SCGNIVWQP+GDFF + S+GT S Sbjct: 170 PLLIGTVANNLVIKGACPACKRQFVGYKNQIVRCVSCGNIVWQPEGDFFRRDSKGTNS-- 227 Query: 49 KKSQPDVIDVEFEEK 5 +KS+PD+IDVEFEEK Sbjct: 228 RKSEPDIIDVEFEEK 242 >gb|KHG14627.1| Replicase polyprotein 1ab [Gossypium arboreum] Length = 242 Score = 301 bits (771), Expect = 3e-79 Identities = 145/215 (67%), Positives = 167/215 (77%), Gaps = 17/215 (7%) Frame = -2 Query: 598 IRAFRRSDLDGFAKRVASGEAWRDVWRGANDGFEQVLYETKKTAERINRRYAVSERFSEV 419 +++FRRSD D F +R+ASGEA +D WR ANDGFEQ ++E KKTAER++ +Y+VS R S Sbjct: 30 VQSFRRSDFDTFTRRMASGEALKDAWRTANDGFEQFVFEAKKTAERLDHQYSVSRRLSSA 89 Query: 418 TNSAADRAREFDREFGITQRWRTLSLDFSRNWPRYRRQFNDFLDTPLGRSFATIF----- 254 SAADRARE DREF I RWRT S+DFSRNWPRYR+Q NDFLDTPLGRSFATIF Sbjct: 90 AQSAADRAREIDREFEIGLRWRTFSMDFSRNWPRYRKQLNDFLDTPLGRSFATIFFLWFA 149 Query: 253 ------------XWVLPFAGPLLIGAVANNLVIKGECPACRRQFVGYKSQTVRCASCGNI 110 W+LPFAGPLLIG VANNLVIKG CPAC+RQFVGYK+Q VRC SCGNI Sbjct: 150 LSGWMFRCLIFATWILPFAGPLLIGTVANNLVIKGACPACKRQFVGYKNQIVRCVSCGNI 209 Query: 109 VWQPQGDFFSKGSRGTTSSPKKSQPDVIDVEFEEK 5 VWQP+GDFF + SRGT S +KS+PD+IDVEFEEK Sbjct: 210 VWQPEGDFFRRDSRGTNS--RKSEPDIIDVEFEEK 242 >gb|EPS71910.1| hypothetical protein M569_02850, partial [Genlisea aurea] Length = 247 Score = 301 bits (770), Expect = 4e-79 Identities = 153/251 (60%), Positives = 178/251 (70%), Gaps = 17/251 (6%) Frame = -2 Query: 706 ASSLPWHPLIPTKTQRFHHYRKTLKPNFIPIRHAESIRAFRRSDLDGFAKRVASGEAWRD 527 A++ P PL+ TKT L+ N IR A SI+AFRRSD+DGFAKRVASGE WR+ Sbjct: 9 ATNHPLRPLLHTKTGI------RLRSNSAVIRRASSIQAFRRSDIDGFAKRVASGELWRE 62 Query: 526 VWRGANDGFEQVLYETKKTAERINRRYAVSERFSEVTNSAADRAREFDREFGITQRWRTL 347 WR ANDGFE LYET+KTAER++RRY VS R S SA+DRARE DR+F +T+RWRT Sbjct: 63 AWRKANDGFELFLYETRKTAERLDRRYEVSRRLSAAAQSASDRARELDRDFELTRRWRTF 122 Query: 346 SLDFSRNWPRYRRQFNDFLDTPLGRSFATIF-----------------XWVLPFAGPLLI 218 SLDF RN P YRRQ NDFLDTPLGRSFAT+F W+LPFAGPLLI Sbjct: 123 SLDFGRNLPMYRRQINDFLDTPLGRSFATLFLLWFTLSGWLFRFLIFATWILPFAGPLLI 182 Query: 217 GAVANNLVIKGECPACRRQFVGYKSQTVRCASCGNIVWQPQGDFFSKGSRGTTSSPKKSQ 38 G++ANNLVIKGECPAC++QF+GYK+QTVRC +CGN VWQP + S S SQ Sbjct: 183 GSLANNLVIKGECPACKKQFIGYKNQTVRCTTCGNTVWQP------RSSSNRPPSGSNSQ 236 Query: 37 PDVIDVEFEEK 5 PD+IDVEFEEK Sbjct: 237 PDIIDVEFEEK 247 >ref|XP_012090950.1| PREDICTED: uncharacterized protein LOC105649037 [Jatropha curcas] gi|643705189|gb|KDP21806.1| hypothetical protein JCGZ_00593 [Jatropha curcas] Length = 234 Score = 297 bits (761), Expect = 5e-78 Identities = 150/252 (59%), Positives = 173/252 (68%), Gaps = 17/252 (6%) Frame = -2 Query: 709 MASSLPWHPLIPTKTQRFHHYRKTLKPNFIPIRHAESIRAFRRSDLDGFAKRVASGEAWR 530 MA+++PW P R H +P RHA ++RAF+R D D FA R Sbjct: 1 MATTIPWRK--PPLLTRLHR-------RSVPYRHAATVRAFQRGDFDRFA---------R 42 Query: 529 DVWRGANDGFEQVLYETKKTAERINRRYAVSERFSEVTNSAADRAREFDREFGITQRWRT 350 D WR ANDGFEQ ++E KK AERI+RRY+VS R + V SAADRARE DRE + RWRT Sbjct: 43 DAWRSANDGFEQFVFEAKKAAERIDRRYSVSRRITAVAQSAADRAREIDRELELGIRWRT 102 Query: 349 LSLDFSRNWPRYRRQFNDFLDTPLGRSFATIF-----------------XWVLPFAGPLL 221 S+DF+RNWPRYRRQ +DFLDTPLGRSFATIF WVLPFA PLL Sbjct: 103 FSMDFARNWPRYRRQLSDFLDTPLGRSFATIFFLWFALSGWLFRFLIFATWVLPFAAPLL 162 Query: 220 IGAVANNLVIKGECPACRRQFVGYKSQTVRCASCGNIVWQPQGDFFSKGSRGTTSSPKKS 41 IGAVANNLVIKG CPAC+RQFVGYKSQ +RCA CGNIVWQP+GDFFS+G +G +S KS Sbjct: 163 IGAVANNLVIKGNCPACKRQFVGYKSQVIRCAGCGNIVWQPEGDFFSRGGKGRGTSSSKS 222 Query: 40 QPDVIDVEFEEK 5 D+IDVEFEEK Sbjct: 223 NSDIIDVEFEEK 234 >gb|KHF99621.1| hypothetical protein F383_17009 [Gossypium arboreum] Length = 242 Score = 291 bits (746), Expect = 3e-76 Identities = 140/215 (65%), Positives = 165/215 (76%), Gaps = 17/215 (7%) Frame = -2 Query: 598 IRAFRRSDLDGFAKRVASGEAWRDVWRGANDGFEQVLYETKKTAERINRRYAVSERFSEV 419 +++FRRSD D FA+RVASGEA +D WR AND FEQ ++E KKTAER+NR+Y+VS R S V Sbjct: 30 VQSFRRSDFDTFARRVASGEALKDAWRTANDRFEQFVFEAKKTAERLNRQYSVSRRLSSV 89 Query: 418 TNSAADRAREFDREFGITQRWRTLSLDFSRNWPRYRRQFNDFLDTPLGRSFATIF----- 254 SA DRARE DRE I RWRT ++DF RNWPRYR+Q NDFL+TPLGRSFA IF Sbjct: 90 VQSATDRARELDRELEIGIRWRTFTMDFRRNWPRYRKQLNDFLETPLGRSFAMIFFLWFA 149 Query: 253 ------------XWVLPFAGPLLIGAVANNLVIKGECPACRRQFVGYKSQTVRCASCGNI 110 W+LPFAGPLLIG +AN+LVIKG CPACR+QF GYK+QT+ CASCGNI Sbjct: 150 LSGWLFRFLIFAMWILPFAGPLLIGTIANSLVIKGACPACRKQFAGYKNQTIHCASCGNI 209 Query: 109 VWQPQGDFFSKGSRGTTSSPKKSQPDVIDVEFEEK 5 VWQP+GDFF +GSR T S KKS+P++IDVEFEEK Sbjct: 210 VWQPEGDFFGRGSRRTGS--KKSEPEIIDVEFEEK 242 >ref|XP_012476851.1| PREDICTED: uncharacterized protein LOC105792695 [Gossypium raimondii] gi|823153996|ref|XP_012476852.1| PREDICTED: uncharacterized protein LOC105792695 [Gossypium raimondii] gi|823153998|ref|XP_012476853.1| PREDICTED: uncharacterized protein LOC105792695 [Gossypium raimondii] gi|823154000|ref|XP_012476854.1| PREDICTED: uncharacterized protein LOC105792695 [Gossypium raimondii] gi|823154002|ref|XP_012476856.1| PREDICTED: uncharacterized protein LOC105792695 [Gossypium raimondii] gi|763759397|gb|KJB26728.1| hypothetical protein B456_004G260300 [Gossypium raimondii] gi|763759398|gb|KJB26729.1| hypothetical protein B456_004G260300 [Gossypium raimondii] gi|763759399|gb|KJB26730.1| hypothetical protein B456_004G260300 [Gossypium raimondii] Length = 242 Score = 291 bits (744), Expect = 5e-76 Identities = 146/255 (57%), Positives = 180/255 (70%), Gaps = 20/255 (7%) Frame = -2 Query: 709 MASSLP---WHPLIPTKTQRFHHYRKTLKPNFIPIRHAESIRAFRRSDLDGFAKRVASGE 539 M+++LP WH + + +R+ L +++FRRSD D FA+RVASGE Sbjct: 1 MSTALPSTAWHSMTLQRRPPLPQWRRPL-----------IVQSFRRSDFDTFARRVASGE 49 Query: 538 AWRDVWRGANDGFEQVLYETKKTAERINRRYAVSERFSEVTNSAADRAREFDREFGITQR 359 A +D WR AND FEQ ++E KKTAER++R+Y+VS R S V SA DRARE DREF I R Sbjct: 50 ALKDAWRTANDRFEQFVFEAKKTAERLDRQYSVSRRISSVVQSATDRARELDREFEIGIR 109 Query: 358 WRTLSLDFSRNWPRYRRQFNDFLDTPLGRSFATIF-----------------XWVLPFAG 230 WRT ++DF RNWPRYR+Q NDFL+TPLGRSFATIF W+LPFAG Sbjct: 110 WRTSTMDFRRNWPRYRKQLNDFLETPLGRSFATIFFLWFALSGWLFRFLIFAMWILPFAG 169 Query: 229 PLLIGAVANNLVIKGECPACRRQFVGYKSQTVRCASCGNIVWQPQGDFFSKGSRGTTSSP 50 PLLIG +AN+LVIKG CPACR+QF GYK+Q +RCASCGNIVWQP+GDFF + SR T S Sbjct: 170 PLLIGTIANSLVIKGACPACRKQFAGYKNQMIRCASCGNIVWQPEGDFFGRDSRRTGS-- 227 Query: 49 KKSQPDVIDVEFEEK 5 +KS+P++IDVEFEEK Sbjct: 228 RKSEPEIIDVEFEEK 242 >ref|XP_010110458.1| hypothetical protein L484_001857 [Morus notabilis] gi|587939822|gb|EXC26456.1| hypothetical protein L484_001857 [Morus notabilis] Length = 244 Score = 290 bits (741), Expect = 1e-75 Identities = 147/253 (58%), Positives = 174/253 (68%), Gaps = 18/253 (7%) Frame = -2 Query: 709 MASSLPWHPLIPTKTQRFHHYRKTLKPNFIPIRHAESIRAFRR-SDLDGFAKRVASGEAW 533 MA++L W+P + F+ + AFRR SD DGFAKR+ASGEAW Sbjct: 1 MATTLRWNPQAVALPATARQRKNRPSSAFV-------VLAFRRRSDFDGFAKRMASGEAW 53 Query: 532 RDVWRGANDGFEQVLYETKKTAERINRRYAVSERFSEVTNSAADRAREFDREFGITQRWR 353 RD WRGANDGFE+ L+E +KTAER++R+Y+VS R S SAA RARE DR+ I RWR Sbjct: 54 RDAWRGANDGFERFLFEARKTAERLDRQYSVSHRLSSAARSAAARAREIDRDLEIGSRWR 113 Query: 352 TLSLDFSRNWPRYRRQFNDFLDTPLGRSFATIF-----------------XWVLPFAGPL 224 L++DFSRNWPRYR+Q DFL+TPLGRSFATIF WVLPFAGPL Sbjct: 114 ALTMDFSRNWPRYRKQLADFLETPLGRSFATIFFLWFALSGWLFRFLIFGLWVLPFAGPL 173 Query: 223 LIGAVANNLVIKGECPACRRQFVGYKSQTVRCASCGNIVWQPQGDFFSKGSRGTTSSPKK 44 L+G ANNLVIKG CPAC RQFVG K+Q +RCA CGN VWQP+GD F++G RGT SS K Sbjct: 174 LVGTFANNLVIKGSCPACNRQFVGSKTQMIRCAGCGNTVWQPKGDSFTRGGRGTGSS--K 231 Query: 43 SQPDVIDVEFEEK 5 S P++IDVEFEEK Sbjct: 232 SSPEIIDVEFEEK 244