BLASTX nr result
ID: Perilla23_contig00021974
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Perilla23_contig00021974 (812 letters) Database: ./nr 77,306,371 sequences; 28,104,191,420 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_011088659.1| PREDICTED: uncharacterized protein LOC105169... 394 e-107 ref|XP_011088658.1| PREDICTED: uncharacterized protein LOC105169... 394 e-107 ref|XP_011088657.1| PREDICTED: uncharacterized protein LOC105169... 394 e-107 ref|XP_012837224.1| PREDICTED: uncharacterized protein LOC105957... 360 9e-97 ref|XP_010646386.1| PREDICTED: uncharacterized protein LOC100258... 330 6e-88 ref|XP_010646379.1| PREDICTED: uncharacterized protein LOC100258... 330 6e-88 emb|CBI37806.3| unnamed protein product [Vitis vinifera] 330 6e-88 ref|XP_002278562.1| PREDICTED: uncharacterized protein LOC100258... 330 6e-88 ref|XP_002312932.2| hypothetical protein POPTR_0009s14190g [Popu... 328 4e-87 ref|XP_011045505.1| PREDICTED: uncharacterized protein LOC105140... 326 1e-86 ref|XP_007041718.1| RNA polymerase II-associated protein 1, puta... 321 4e-85 ref|XP_012467614.1| PREDICTED: uncharacterized protein LOC105785... 320 6e-85 gb|KHF97960.1| RNA polymerase II-associated 1 [Gossypium arboreu... 319 1e-84 ref|XP_009347860.1| PREDICTED: uncharacterized protein LOC103939... 318 2e-84 ref|XP_006364516.1| PREDICTED: uncharacterized protein LOC102599... 318 2e-84 gb|KJB15887.1| hypothetical protein B456_002G201600 [Gossypium r... 315 2e-83 ref|XP_004231458.1| PREDICTED: uncharacterized protein LOC101256... 315 2e-83 emb|CDP17654.1| unnamed protein product [Coffea canephora] 315 3e-83 ref|XP_012074496.1| PREDICTED: uncharacterized protein LOC105635... 314 5e-83 emb|CAN83259.1| hypothetical protein VITISV_032134 [Vitis vinifera] 311 4e-82 >ref|XP_011088659.1| PREDICTED: uncharacterized protein LOC105169823 isoform X3 [Sesamum indicum] Length = 1509 Score = 394 bits (1012), Expect = e-107 Identities = 194/268 (72%), Positives = 229/268 (85%) Frame = -1 Query: 812 ALPTFLSSEFCSPVKCVPVVWKLHAMSVTLLSGMGVLEDEKSRDVYETLQNVYGEVLDEK 633 A+ + L SEFCSPVKCVPVVWKLHAMSV LLSGMG+LE+ KSRDVYETLQNVYGE LD + Sbjct: 1226 AITSVLGSEFCSPVKCVPVVWKLHAMSVILLSGMGILENGKSRDVYETLQNVYGETLDGR 1285 Query: 632 RCSDVHGNTGVASLKFKTDIHENYSTFIETLVEQFAAESYGDVIFGRQVAMYLHRSIEAS 453 ++HGN V SL+F+++IHENYSTFIETLVEQFAAESYGD++FGRQVA+YLHRS+EAS Sbjct: 1286 EVVNLHGNLSVESLQFESEIHENYSTFIETLVEQFAAESYGDILFGRQVAIYLHRSVEAS 1345 Query: 452 VRLATWNTLSNARVLELLPPLHKCFAKADGYLEPVEDNESILEAYGKSWASGALDKAANR 273 VRL+TWN LSNAR LELLPPL +C +A GYL+P+ED+E IL+AY KSW SGALDKAA R Sbjct: 1346 VRLSTWNALSNARALELLPPLAECCIQAKGYLKPIEDDERILDAYVKSWVSGALDKAAKR 1405 Query: 272 SSVAFSLVLHHLSSFIFKNVSADAVSLRNKLAKSLLRDYSRKQQHEGMMVRLICYKKPDI 93 SS+AFSLVLHHLSSFIF NV+ D ++LR+KLAKSLLRDYSRKQQHEGM+V+LICY+KP++ Sbjct: 1406 SSMAFSLVLHHLSSFIFSNVAGDMLALRSKLAKSLLRDYSRKQQHEGMLVKLICYEKPNM 1465 Query: 92 DXXXXXXXSLPMSEIEKRLQLLREICVG 9 S++EKRLQLL+EIC G Sbjct: 1466 SLQ-------TWSQVEKRLQLLKEICEG 1486 >ref|XP_011088658.1| PREDICTED: uncharacterized protein LOC105169823 isoform X2 [Sesamum indicum] Length = 1611 Score = 394 bits (1012), Expect = e-107 Identities = 194/268 (72%), Positives = 229/268 (85%) Frame = -1 Query: 812 ALPTFLSSEFCSPVKCVPVVWKLHAMSVTLLSGMGVLEDEKSRDVYETLQNVYGEVLDEK 633 A+ + L SEFCSPVKCVPVVWKLHAMSV LLSGMG+LE+ KSRDVYETLQNVYGE LD + Sbjct: 1328 AITSVLGSEFCSPVKCVPVVWKLHAMSVILLSGMGILENGKSRDVYETLQNVYGETLDGR 1387 Query: 632 RCSDVHGNTGVASLKFKTDIHENYSTFIETLVEQFAAESYGDVIFGRQVAMYLHRSIEAS 453 ++HGN V SL+F+++IHENYSTFIETLVEQFAAESYGD++FGRQVA+YLHRS+EAS Sbjct: 1388 EVVNLHGNLSVESLQFESEIHENYSTFIETLVEQFAAESYGDILFGRQVAIYLHRSVEAS 1447 Query: 452 VRLATWNTLSNARVLELLPPLHKCFAKADGYLEPVEDNESILEAYGKSWASGALDKAANR 273 VRL+TWN LSNAR LELLPPL +C +A GYL+P+ED+E IL+AY KSW SGALDKAA R Sbjct: 1448 VRLSTWNALSNARALELLPPLAECCIQAKGYLKPIEDDERILDAYVKSWVSGALDKAAKR 1507 Query: 272 SSVAFSLVLHHLSSFIFKNVSADAVSLRNKLAKSLLRDYSRKQQHEGMMVRLICYKKPDI 93 SS+AFSLVLHHLSSFIF NV+ D ++LR+KLAKSLLRDYSRKQQHEGM+V+LICY+KP++ Sbjct: 1508 SSMAFSLVLHHLSSFIFSNVAGDMLALRSKLAKSLLRDYSRKQQHEGMLVKLICYEKPNM 1567 Query: 92 DXXXXXXXSLPMSEIEKRLQLLREICVG 9 S++EKRLQLL+EIC G Sbjct: 1568 SLQ-------TWSQVEKRLQLLKEICEG 1588 >ref|XP_011088657.1| PREDICTED: uncharacterized protein LOC105169823 isoform X1 [Sesamum indicum] Length = 1612 Score = 394 bits (1012), Expect = e-107 Identities = 194/268 (72%), Positives = 229/268 (85%) Frame = -1 Query: 812 ALPTFLSSEFCSPVKCVPVVWKLHAMSVTLLSGMGVLEDEKSRDVYETLQNVYGEVLDEK 633 A+ + L SEFCSPVKCVPVVWKLHAMSV LLSGMG+LE+ KSRDVYETLQNVYGE LD + Sbjct: 1329 AITSVLGSEFCSPVKCVPVVWKLHAMSVILLSGMGILENGKSRDVYETLQNVYGETLDGR 1388 Query: 632 RCSDVHGNTGVASLKFKTDIHENYSTFIETLVEQFAAESYGDVIFGRQVAMYLHRSIEAS 453 ++HGN V SL+F+++IHENYSTFIETLVEQFAAESYGD++FGRQVA+YLHRS+EAS Sbjct: 1389 EVVNLHGNLSVESLQFESEIHENYSTFIETLVEQFAAESYGDILFGRQVAIYLHRSVEAS 1448 Query: 452 VRLATWNTLSNARVLELLPPLHKCFAKADGYLEPVEDNESILEAYGKSWASGALDKAANR 273 VRL+TWN LSNAR LELLPPL +C +A GYL+P+ED+E IL+AY KSW SGALDKAA R Sbjct: 1449 VRLSTWNALSNARALELLPPLAECCIQAKGYLKPIEDDERILDAYVKSWVSGALDKAAKR 1508 Query: 272 SSVAFSLVLHHLSSFIFKNVSADAVSLRNKLAKSLLRDYSRKQQHEGMMVRLICYKKPDI 93 SS+AFSLVLHHLSSFIF NV+ D ++LR+KLAKSLLRDYSRKQQHEGM+V+LICY+KP++ Sbjct: 1509 SSMAFSLVLHHLSSFIFSNVAGDMLALRSKLAKSLLRDYSRKQQHEGMLVKLICYEKPNM 1568 Query: 92 DXXXXXXXSLPMSEIEKRLQLLREICVG 9 S++EKRLQLL+EIC G Sbjct: 1569 SLQ-------TWSQVEKRLQLLKEICEG 1589 >ref|XP_012837224.1| PREDICTED: uncharacterized protein LOC105957806 [Erythranthe guttatus] gi|604333647|gb|EYU37998.1| hypothetical protein MIMGU_mgv1a000182mg [Erythranthe guttata] Length = 1485 Score = 360 bits (923), Expect = 9e-97 Identities = 187/270 (69%), Positives = 221/270 (81%), Gaps = 2/270 (0%) Frame = -1 Query: 812 ALPTFLSSEFCSPVKCVPVVWKLHAMSVTLLSGMGVLEDEKSRDVYETLQNVYGEVLDEK 633 A+P L+SEF SPV+CV VVWKLHA+SV LLSGMGVLEDEKSRDVYETLQN+YG+++DEK Sbjct: 1217 AIPASLTSEFFSPVECVTVVWKLHAISVVLLSGMGVLEDEKSRDVYETLQNIYGKIIDEK 1276 Query: 632 RCSDVHGNTGVASLKFKTDIHENYSTFIETLVEQFAAESYGDVIFGRQVAMYLHRSIEAS 453 ++H SL+F+++IH+NY TFIETLVEQFAAESYGDV+FGRQ+AMYLHRS+EAS Sbjct: 1277 ---ELH-----KSLQFESEIHKNYPTFIETLVEQFAAESYGDVLFGRQIAMYLHRSVEAS 1328 Query: 452 VRLATWNTLSNARVLELLPPLHKCFAKADGYLEPVEDNESILEAYGKSWASGALDKAANR 273 VRLA WN LSNAR LELLP L KCF+KA+GYLEP+ED+E ILEAY KSW GALD+AA R Sbjct: 1329 VRLAAWNGLSNARALELLPTLDKCFSKAEGYLEPIEDDEKILEAYVKSWVGGALDRAAKR 1388 Query: 272 SSVAFSLVLHHLSSFIFKNVSADAVSLRNKLAKSLLRDYSRK-QQHEGMMVRLIC-YKKP 99 +S++FSLVLHHLS FIF +V D +SLRNKL KSLLRDYSRK QQHEGM+V+L+C Y K Sbjct: 1389 NSMSFSLVLHHLSWFIFGDVVGDMLSLRNKLVKSLLRDYSRKQQQHEGMLVKLVCYYNKS 1448 Query: 98 DIDXXXXXXXSLPMSEIEKRLQLLREICVG 9 D D EIE+RLQLL++IC G Sbjct: 1449 DRDY-----------EIERRLQLLKQICDG 1467 >ref|XP_010646386.1| PREDICTED: uncharacterized protein LOC100258889 isoform X3 [Vitis vinifera] Length = 1524 Score = 330 bits (847), Expect = 6e-88 Identities = 174/280 (62%), Positives = 212/280 (75%), Gaps = 10/280 (3%) Frame = -1 Query: 812 ALPTFLSSEFCSPVKCVPVVWKLHAMSVTLLSGMGVLEDEKSRDVYETLQNVYGEVLDEK 633 A+ +FLSS+ SPV+ VPV+WKLH++SVTLL GM VLE++KSRDVYE LQ +YG++LDE Sbjct: 1228 AMSSFLSSDVPSPVRSVPVIWKLHSLSVTLLDGMSVLEEKKSRDVYEALQELYGQLLDES 1287 Query: 632 RCSDVHGNT----------GVASLKFKTDIHENYSTFIETLVEQFAAESYGDVIFGRQVA 483 R VH +T + L+F++DIHE+YSTFIETLVEQFAA SYGD+I+GRQVA Sbjct: 1288 R---VHRSTKPTPETGEKNSIEFLRFQSDIHESYSTFIETLVEQFAAISYGDLIYGRQVA 1344 Query: 482 MYLHRSIEASVRLATWNTLSNARVLELLPPLHKCFAKADGYLEPVEDNESILEAYGKSWA 303 +YLHRS+EA VRLA WN LSNARVLELLPPL KC A A+GYLEPVE+NE ILEAY KSW Sbjct: 1345 IYLHRSVEAPVRLAAWNALSNARVLELLPPLEKCSADAEGYLEPVENNEGILEAYVKSWV 1404 Query: 302 SGALDKAANRSSVAFSLVLHHLSSFIFKNVSADAVSLRNKLAKSLLRDYSRKQQHEGMMV 123 +GALD+AA R SV F+LVLHHLSS IF++ + +SLRNKLAKSLLRDYSRK+QHEG+M+ Sbjct: 1405 TGALDRAATRGSVTFTLVLHHLSSVIFEDDADVKLSLRNKLAKSLLRDYSRKRQHEGLML 1464 Query: 122 RLICYKKPDIDXXXXXXXSLPMSEIEKRLQLLREICVGCA 3 +L+ Y K + E EKR + L E C G A Sbjct: 1465 QLLRYNK---QFASPQPEWMKEGETEKRFRFLTEACEGNA 1501 >ref|XP_010646379.1| PREDICTED: uncharacterized protein LOC100258889 isoform X1 [Vitis vinifera] Length = 1608 Score = 330 bits (847), Expect = 6e-88 Identities = 174/280 (62%), Positives = 212/280 (75%), Gaps = 10/280 (3%) Frame = -1 Query: 812 ALPTFLSSEFCSPVKCVPVVWKLHAMSVTLLSGMGVLEDEKSRDVYETLQNVYGEVLDEK 633 A+ +FLSS+ SPV+ VPV+WKLH++SVTLL GM VLE++KSRDVYE LQ +YG++LDE Sbjct: 1312 AMSSFLSSDVPSPVRSVPVIWKLHSLSVTLLDGMSVLEEKKSRDVYEALQELYGQLLDES 1371 Query: 632 RCSDVHGNT----------GVASLKFKTDIHENYSTFIETLVEQFAAESYGDVIFGRQVA 483 R VH +T + L+F++DIHE+YSTFIETLVEQFAA SYGD+I+GRQVA Sbjct: 1372 R---VHRSTKPTPETGEKNSIEFLRFQSDIHESYSTFIETLVEQFAAISYGDLIYGRQVA 1428 Query: 482 MYLHRSIEASVRLATWNTLSNARVLELLPPLHKCFAKADGYLEPVEDNESILEAYGKSWA 303 +YLHRS+EA VRLA WN LSNARVLELLPPL KC A A+GYLEPVE+NE ILEAY KSW Sbjct: 1429 IYLHRSVEAPVRLAAWNALSNARVLELLPPLEKCSADAEGYLEPVENNEGILEAYVKSWV 1488 Query: 302 SGALDKAANRSSVAFSLVLHHLSSFIFKNVSADAVSLRNKLAKSLLRDYSRKQQHEGMMV 123 +GALD+AA R SV F+LVLHHLSS IF++ + +SLRNKLAKSLLRDYSRK+QHEG+M+ Sbjct: 1489 TGALDRAATRGSVTFTLVLHHLSSVIFEDDADVKLSLRNKLAKSLLRDYSRKRQHEGLML 1548 Query: 122 RLICYKKPDIDXXXXXXXSLPMSEIEKRLQLLREICVGCA 3 +L+ Y K + E EKR + L E C G A Sbjct: 1549 QLLRYNK---QFASPQPEWMKEGETEKRFRFLTEACEGNA 1585 >emb|CBI37806.3| unnamed protein product [Vitis vinifera] Length = 1505 Score = 330 bits (847), Expect = 6e-88 Identities = 174/280 (62%), Positives = 212/280 (75%), Gaps = 10/280 (3%) Frame = -1 Query: 812 ALPTFLSSEFCSPVKCVPVVWKLHAMSVTLLSGMGVLEDEKSRDVYETLQNVYGEVLDEK 633 A+ +FLSS+ SPV+ VPV+WKLH++SVTLL GM VLE++KSRDVYE LQ +YG++LDE Sbjct: 1215 AMSSFLSSDVPSPVRSVPVIWKLHSLSVTLLDGMSVLEEKKSRDVYEALQELYGQLLDES 1274 Query: 632 RCSDVHGNT----------GVASLKFKTDIHENYSTFIETLVEQFAAESYGDVIFGRQVA 483 R VH +T + L+F++DIHE+YSTFIETLVEQFAA SYGD+I+GRQVA Sbjct: 1275 R---VHRSTKPTPETGEKNSIEFLRFQSDIHESYSTFIETLVEQFAAISYGDLIYGRQVA 1331 Query: 482 MYLHRSIEASVRLATWNTLSNARVLELLPPLHKCFAKADGYLEPVEDNESILEAYGKSWA 303 +YLHRS+EA VRLA WN LSNARVLELLPPL KC A A+GYLEPVE+NE ILEAY KSW Sbjct: 1332 IYLHRSVEAPVRLAAWNALSNARVLELLPPLEKCSADAEGYLEPVENNEGILEAYVKSWV 1391 Query: 302 SGALDKAANRSSVAFSLVLHHLSSFIFKNVSADAVSLRNKLAKSLLRDYSRKQQHEGMMV 123 +GALD+AA R SV F+LVLHHLSS IF++ + +SLRNKLAKSLLRDYSRK+QHEG+M+ Sbjct: 1392 TGALDRAATRGSVTFTLVLHHLSSVIFEDDADVKLSLRNKLAKSLLRDYSRKRQHEGLML 1451 Query: 122 RLICYKKPDIDXXXXXXXSLPMSEIEKRLQLLREICVGCA 3 +L+ Y K + E EKR + L E C G A Sbjct: 1452 QLLRYNK---QFASPQPEWMKEGETEKRFRFLTEACEGNA 1488 >ref|XP_002278562.1| PREDICTED: uncharacterized protein LOC100258889 isoform X2 [Vitis vinifera] Length = 1602 Score = 330 bits (847), Expect = 6e-88 Identities = 174/280 (62%), Positives = 212/280 (75%), Gaps = 10/280 (3%) Frame = -1 Query: 812 ALPTFLSSEFCSPVKCVPVVWKLHAMSVTLLSGMGVLEDEKSRDVYETLQNVYGEVLDEK 633 A+ +FLSS+ SPV+ VPV+WKLH++SVTLL GM VLE++KSRDVYE LQ +YG++LDE Sbjct: 1312 AMSSFLSSDVPSPVRSVPVIWKLHSLSVTLLDGMSVLEEKKSRDVYEALQELYGQLLDES 1371 Query: 632 RCSDVHGNT----------GVASLKFKTDIHENYSTFIETLVEQFAAESYGDVIFGRQVA 483 R VH +T + L+F++DIHE+YSTFIETLVEQFAA SYGD+I+GRQVA Sbjct: 1372 R---VHRSTKPTPETGEKNSIEFLRFQSDIHESYSTFIETLVEQFAAISYGDLIYGRQVA 1428 Query: 482 MYLHRSIEASVRLATWNTLSNARVLELLPPLHKCFAKADGYLEPVEDNESILEAYGKSWA 303 +YLHRS+EA VRLA WN LSNARVLELLPPL KC A A+GYLEPVE+NE ILEAY KSW Sbjct: 1429 IYLHRSVEAPVRLAAWNALSNARVLELLPPLEKCSADAEGYLEPVENNEGILEAYVKSWV 1488 Query: 302 SGALDKAANRSSVAFSLVLHHLSSFIFKNVSADAVSLRNKLAKSLLRDYSRKQQHEGMMV 123 +GALD+AA R SV F+LVLHHLSS IF++ + +SLRNKLAKSLLRDYSRK+QHEG+M+ Sbjct: 1489 TGALDRAATRGSVTFTLVLHHLSSVIFEDDADVKLSLRNKLAKSLLRDYSRKRQHEGLML 1548 Query: 122 RLICYKKPDIDXXXXXXXSLPMSEIEKRLQLLREICVGCA 3 +L+ Y K + E EKR + L E C G A Sbjct: 1549 QLLRYNK---QFASPQPEWMKEGETEKRFRFLTEACEGNA 1585 >ref|XP_002312932.2| hypothetical protein POPTR_0009s14190g [Populus trichocarpa] gi|550331699|gb|EEE86887.2| hypothetical protein POPTR_0009s14190g [Populus trichocarpa] Length = 1530 Score = 328 bits (840), Expect = 4e-87 Identities = 166/267 (62%), Positives = 209/267 (78%), Gaps = 2/267 (0%) Frame = -1 Query: 809 LPTFLSSEFCSPVKCVPVVWKLHAMSVTLLSGMGVLEDEKSRDVYETLQNVYGEVLDEKR 630 + +FL ++ SPV+ P++WKLH++SV LLSGMGVLED+KSRDVYE LQN+YG++LDE R Sbjct: 1249 MSSFLPTDAPSPVRFTPLIWKLHSLSVMLLSGMGVLEDDKSRDVYEALQNLYGQLLDESR 1308 Query: 629 CSDVHGNTGVASLKFKTDIHENYSTFIETLVEQFAAESYGDVIFGRQVAMYLHRSIEASV 450 + L+F+++IHE+YSTF+ETLVEQFA+ SYGD+IFGRQVA+YLHR E V Sbjct: 1309 ----------SFLRFQSEIHESYSTFLETLVEQFASISYGDIIFGRQVAVYLHRCTETPV 1358 Query: 449 RLATWNTLSNARVLELLPPLHKCFAKADGYLEPVEDNESILEAYGKSWASGALDKAANRS 270 RLA WN L+NA VLE+LPPL KCFA+A+GYLEPVEDNE ILEAY K+W SGALD+AA R Sbjct: 1359 RLAAWNGLANAHVLEILPPLEKCFAEAEGYLEPVEDNEGILEAYVKAWVSGALDRAATRG 1418 Query: 269 SVAFSLVLHHLSSFIFKNVSADAVSLRNKLAKSLLRDYSRKQQHEGMMVRLICYKKPDID 90 S+AF+LVLHHLSSFIF + D ++LRNKLAKSLLRDYS+KQ+HEG+M+ L+CY K Sbjct: 1419 SMAFTLVLHHLSSFIFLFHANDKITLRNKLAKSLLRDYSKKQRHEGIMLELVCYYKLS-S 1477 Query: 89 XXXXXXXSLPM--SEIEKRLQLLREIC 15 LP+ S+IEKR ++L E C Sbjct: 1478 RLPEKQEGLPLQASDIEKRFEVLVEAC 1504 >ref|XP_011045505.1| PREDICTED: uncharacterized protein LOC105140391 [Populus euphratica] gi|743792825|ref|XP_011045512.1| PREDICTED: uncharacterized protein LOC105140391 [Populus euphratica] gi|743792828|ref|XP_011045519.1| PREDICTED: uncharacterized protein LOC105140391 [Populus euphratica] gi|743792831|ref|XP_011045525.1| PREDICTED: uncharacterized protein LOC105140391 [Populus euphratica] Length = 1581 Score = 326 bits (835), Expect = 1e-86 Identities = 170/282 (60%), Positives = 213/282 (75%), Gaps = 17/282 (6%) Frame = -1 Query: 809 LPTFLSSEFCSPVKCVPVVWKLHAMSVTLLSGMGVLEDEKSRDVYETLQNVYGEVLDEKR 630 + +FL ++ SPV+ P++WKLH++SV LLSGMGVLED+KSRDVYE LQN+YG++LDE R Sbjct: 1274 MSSFLPTDAPSPVRFTPLIWKLHSLSVILLSGMGVLEDDKSRDVYEALQNLYGQLLDESR 1333 Query: 629 CS-----------DVHGNTGVAS----LKFKTDIHENYSTFIETLVEQFAAESYGDVIFG 495 +V TG S L+F+++IHE+YSTF+ETLVEQFA+ SYGD+IFG Sbjct: 1334 SVRSAEHFLEDNVNVLPETGKKSASEFLRFQSEIHESYSTFLETLVEQFASISYGDIIFG 1393 Query: 494 RQVAMYLHRSIEASVRLATWNTLSNARVLELLPPLHKCFAKADGYLEPVEDNESILEAYG 315 RQVA+YLHR E VRLA WN L+NARVLE+LPPL KCFA+A+GYLEPVEDNE ILEAY Sbjct: 1394 RQVAVYLHRCTETPVRLAAWNGLTNARVLEILPPLEKCFAEAEGYLEPVEDNEGILEAYV 1453 Query: 314 KSWASGALDKAANRSSVAFSLVLHHLSSFIFKNVSADAVSLRNKLAKSLLRDYSRKQQHE 135 K+W SGALD+AA R S+AF+LVLHHLSSFIF + D ++LRNKLAKSLLRDYS+KQ+HE Sbjct: 1454 KAWVSGALDRAATRGSMAFTLVLHHLSSFIFLFHANDKITLRNKLAKSLLRDYSKKQRHE 1513 Query: 134 GMMVRLICYKKPDIDXXXXXXXSLPM--SEIEKRLQLLREIC 15 G+M+ L+ Y K LP+ S+IEKR ++L E C Sbjct: 1514 GIMLELVRYYKLSSRLPEMQEGGLPLQASDIEKRFEVLVEAC 1555 >ref|XP_007041718.1| RNA polymerase II-associated protein 1, putative [Theobroma cacao] gi|508705653|gb|EOX97549.1| RNA polymerase II-associated protein 1, putative [Theobroma cacao] Length = 1625 Score = 321 bits (822), Expect = 4e-85 Identities = 162/285 (56%), Positives = 212/285 (74%), Gaps = 17/285 (5%) Frame = -1 Query: 812 ALPTFLSSEFCSPVKCVPVVWKLHAMSVTLLSGMGVLEDEKSRDVYETLQNVYGEVLDE- 636 A+ TF+S + SPV+ VP++WKLH++S+ LL GM VLE+EKSRDVYE+LQ ++G++LD+ Sbjct: 1326 AMSTFISKDVASPVQSVPLIWKLHSLSIILLIGMAVLEEEKSRDVYESLQEIFGQLLDKT 1385 Query: 635 --KRCSDVHGNTGVASL------------KFKTDIHENYSTFIETLVEQFAAESYGDVIF 498 KR + N ++ L +F+T+IHE+YSTFI+TLVEQ+AA S+GD+I+ Sbjct: 1386 RSKRRPETILNMSISLLPETGKKYDGEFLRFQTEIHESYSTFIDTLVEQYAAVSFGDLIY 1445 Query: 497 GRQVAMYLHRSIEASVRLATWNTLSNARVLELLPPLHKCFAKADGYLEPVEDNESILEAY 318 GRQVA+YLHR +EA VRLA WN LSN+RVLELLPPL KC +A+GYLEPVE+NE ILEAY Sbjct: 1446 GRQVAVYLHRCVEAPVRLAAWNALSNSRVLELLPPLQKCLGEAEGYLEPVEENEGILEAY 1505 Query: 317 GKSWASGALDKAANRSSVAFSLVLHHLSSFIFKNVSADAVSLRNKLAKSLLRDYSRKQQH 138 KSW SGALD+AA R S+AF+LVLHHLSSF+F + ++ + LRNKL KSLLRDYSRK+QH Sbjct: 1506 AKSWVSGALDRAATRGSIAFTLVLHHLSSFVFNSHKSEKLLLRNKLVKSLLRDYSRKKQH 1565 Query: 137 EGMMVRLICYKKPDIDXXXXXXXSLPM--SEIEKRLQLLREICVG 9 EGMM+ I KP L + S +E+RL++L+E C G Sbjct: 1566 EGMMLEFIQNTKPSAILLAEKREGLSLQRSNVEERLEILKEACEG 1610 >ref|XP_012467614.1| PREDICTED: uncharacterized protein LOC105785948 [Gossypium raimondii] gi|763748447|gb|KJB15886.1| hypothetical protein B456_002G201600 [Gossypium raimondii] Length = 1616 Score = 320 bits (821), Expect = 6e-85 Identities = 165/285 (57%), Positives = 208/285 (72%), Gaps = 17/285 (5%) Frame = -1 Query: 812 ALPTFLSSEFCSPVKCVPVVWKLHAMSVTLLSGMGVLEDEKSRDVYETLQNVYGEVLDEK 633 AL TFLS++ SP++ VPV+WKLH++S+ LL GM VLEDEK+RDVYE+LQ +YG++LDE Sbjct: 1314 ALSTFLSADVVSPIRSVPVIWKLHSLSIILLIGMAVLEDEKTRDVYESLQELYGQLLDEI 1373 Query: 632 RCS---------------DVHGNTGVASLKFKTDIHENYSTFIETLVEQFAAESYGDVIF 498 R + V L+F+++IHE+YSTFI+TLVEQ+AA S+GD+ + Sbjct: 1374 RSKGRSQTISNMSTSLTPETENKINVEFLRFQSEIHESYSTFIDTLVEQYAAVSFGDLTY 1433 Query: 497 GRQVAMYLHRSIEASVRLATWNTLSNARVLELLPPLHKCFAKADGYLEPVEDNESILEAY 318 GRQVA+YLHR +EA VRLA WN LSN+ VLELLPPL KC A+A+GYLEPVE+NE+ILEAY Sbjct: 1434 GRQVAIYLHRCVEAPVRLAAWNALSNSHVLELLPPLQKCLAEAEGYLEPVEENEAILEAY 1493 Query: 317 GKSWASGALDKAANRSSVAFSLVLHHLSSFIFKNVSADAVSLRNKLAKSLLRDYSRKQQH 138 KSW SGALDKAA R SVAF+LVLHHLS+F+F + + LRNKL KSLLRDY+RK+QH Sbjct: 1494 VKSWVSGALDKAATRGSVAFTLVLHHLSTFVFISHKSYKPLLRNKLVKSLLRDYARKKQH 1553 Query: 137 EGMMVRLICYKKPDIDXXXXXXXSLPM--SEIEKRLQLLREICVG 9 EGMM++ I Y KP L M S +E RL+ L+E C G Sbjct: 1554 EGMMLQFIEYTKPSSVTKAEKEEGLTMESSNVEGRLERLKEACEG 1598 >gb|KHF97960.1| RNA polymerase II-associated 1 [Gossypium arboreum] gi|728815575|gb|KHG01884.1| RNA polymerase II-associated 1 [Gossypium arboreum] Length = 1616 Score = 319 bits (818), Expect = 1e-84 Identities = 165/285 (57%), Positives = 206/285 (72%), Gaps = 17/285 (5%) Frame = -1 Query: 812 ALPTFLSSEFCSPVKCVPVVWKLHAMSVTLLSGMGVLEDEKSRDVYETLQNVYGEVLDEK 633 AL TFLS++ SP+ VPV+WKLH++S+ LL GM VLEDEK+RDVYE+LQ +YG++LDE Sbjct: 1314 ALSTFLSADVVSPIWSVPVIWKLHSLSIILLIGMAVLEDEKTRDVYESLQELYGQLLDEI 1373 Query: 632 RCS---------------DVHGNTGVASLKFKTDIHENYSTFIETLVEQFAAESYGDVIF 498 R + V L+F+++IHE+YSTFI+TLVEQ+AA S+GD+ + Sbjct: 1374 RSKGRSQTISNMSTSLTPETENKINVEFLRFQSEIHESYSTFIDTLVEQYAAVSFGDLTY 1433 Query: 497 GRQVAMYLHRSIEASVRLATWNTLSNARVLELLPPLHKCFAKADGYLEPVEDNESILEAY 318 GRQVA+YLHR +EA VRLA WN LSN+ VLELLPPL KC +A+GYLEPVE+NE+ILEAY Sbjct: 1434 GRQVAIYLHRCVEAPVRLAAWNALSNSHVLELLPPLQKCLGEAEGYLEPVEENEAILEAY 1493 Query: 317 GKSWASGALDKAANRSSVAFSLVLHHLSSFIFKNVSADAVSLRNKLAKSLLRDYSRKQQH 138 KSW SGALDKAA R SVAF+LVLHHLSSF+F + +D LRNKL KSLLRD +RK+QH Sbjct: 1494 VKSWVSGALDKAATRGSVAFTLVLHHLSSFVFSSHKSDKPLLRNKLVKSLLRDNARKKQH 1553 Query: 137 EGMMVRLICYKKPDIDXXXXXXXSLPM--SEIEKRLQLLREICVG 9 EGMM++ I Y KP L M S +E RL+ L+E C G Sbjct: 1554 EGMMLQFIEYMKPSSVTKAEKEEGLTMESSNVEGRLERLKEACEG 1598 >ref|XP_009347860.1| PREDICTED: uncharacterized protein LOC103939489 [Pyrus x bretschneideri] Length = 1543 Score = 318 bits (816), Expect = 2e-84 Identities = 160/271 (59%), Positives = 210/271 (77%), Gaps = 5/271 (1%) Frame = -1 Query: 812 ALPTFLSSEFCSPVKCVPVVWKLHAMSVTLLSGMGVLEDEKSRDVYETLQNVYGEVLDEK 633 AL +FL S+ SPVK V +VWKLH++SV LL GMGV+E+EKSR V+E LQ++YG +L + Sbjct: 1251 ALSSFLPSDIPSPVKSVSLVWKLHSLSVILLVGMGVVEEEKSRVVFEALQDLYGNLLHQS 1310 Query: 632 RCSDV---HGN-TGVASLKFKTDIHENYSTFIETLVEQFAAESYGDVIFGRQVAMYLHRS 465 R S++ H N + L F++++HE+YS FIETLV+QF+A SYGD+I+GRQVA+YLHR Sbjct: 1311 RLSNLMPEHRNENNLEVLAFQSEVHESYSVFIETLVDQFSAISYGDLIYGRQVAVYLHRC 1370 Query: 464 IEASVRLATWNTLSNARVLELLPPLHKCFAKADGYLEPVEDNESILEAYGKSWASGALDK 285 +EA VRLA WNTL+N+RVLELLPPL KCF A+GYLEP EDN ILEAY KSW SGALD+ Sbjct: 1371 VEAPVRLAAWNTLTNSRVLELLPPLEKCFTDAEGYLEPAEDNPDILEAYVKSWTSGALDR 1430 Query: 284 AANRSSVAFSLVLHHLSSFIFKNVSADAVSLRNKLAKSLLRDYSRKQQHEGMMVRLICYK 105 AA+R S+A++LV+HHLS+FIF + + D + LRNKL++SLLRD+S KQQHE MM+ LI Y Sbjct: 1431 AASRGSIAYTLVIHHLSAFIFSSYTGDKLLLRNKLSRSLLRDFSLKQQHEAMMLNLIQYN 1490 Query: 104 KPDIDXXXXXXXSLPM-SEIEKRLQLLREIC 15 K I +P+ +++EKRL+LL+E C Sbjct: 1491 KASISHETKREDGVPVGNDVEKRLELLKETC 1521 >ref|XP_006364516.1| PREDICTED: uncharacterized protein LOC102599570 [Solanum tuberosum] Length = 1559 Score = 318 bits (816), Expect = 2e-84 Identities = 161/269 (59%), Positives = 197/269 (73%), Gaps = 2/269 (0%) Frame = -1 Query: 809 LPTFLSSEFCSPVKCVPVVWKLHAMSVTLLSGMGVLEDEKSRDVYETLQNVYGEVLDEKR 630 + TFL +E +PV+ VPVVWKLHA+S TLLSGM + E++ SRD+Y+ LQ+VYG++LD + Sbjct: 1274 MSTFLPAELQTPVRNVPVVWKLHALSATLLSGMSIFEEDNSRDLYKALQDVYGQLLDREE 1333 Query: 629 CSDVHGNTGVASLKFKTDIHENYSTFIETLVEQFAAESYGDVIFGRQVAMYLHRSIEASV 450 SLKFKTDIHENYSTFI+ LVEQFAA SYGD+IFGRQV +YLH+ +EA V Sbjct: 1334 ------KVNAKSLKFKTDIHENYSTFIDNLVEQFAAVSYGDMIFGRQVGVYLHQFVEAPV 1387 Query: 449 RLATWNTLSNARVLELLPPLHKCFAKADGYLEPVEDNESILEAYGKSWASGALDKAANRS 270 RLA WN LSNA LELLPPL KC A GYLEPVED+E ILEAY KSW SGALDKAA R Sbjct: 1388 RLAAWNALSNACALELLPPLEKCIAATYGYLEPVEDDERILEAYCKSWVSGALDKAARRG 1447 Query: 269 SVAFSLVLHHLSSFIFKNVSADAVSLRNKLAKSLLRDYSRKQQHEGMMVRLICYKKPDID 90 S +F+L LHHLSSFIF+ S + + LRNKL KSLLRDYSRK+QHE + + L+ Y++PD Sbjct: 1448 SASFTLALHHLSSFIFQICSGNMIPLRNKLVKSLLRDYSRKKQHEVLFINLLEYQRPDTR 1507 Query: 89 XXXXXXXSLPMS--EIEKRLQLLREICVG 9 +P+ ++ RLQ+L E C G Sbjct: 1508 SEPFHKECMPLQSCDVVNRLQILNEACEG 1536 >gb|KJB15887.1| hypothetical protein B456_002G201600 [Gossypium raimondii] Length = 1615 Score = 315 bits (807), Expect = 2e-83 Identities = 165/285 (57%), Positives = 207/285 (72%), Gaps = 17/285 (5%) Frame = -1 Query: 812 ALPTFLSSEFCSPVKCVPVVWKLHAMSVTLLSGMGVLEDEKSRDVYETLQNVYGEVLDEK 633 AL TFLS++ SP++ VPV+WKLH++S+ LL GM VLEDEK+RDVYE+LQ +YG++LDE Sbjct: 1314 ALSTFLSADVVSPIRSVPVIWKLHSLSIILLIGMAVLEDEKTRDVYESLQELYGQLLDEI 1373 Query: 632 RCS---------------DVHGNTGVASLKFKTDIHENYSTFIETLVEQFAAESYGDVIF 498 R + V L+F+++IHE+YSTFI+TLVEQ+AA S+GD+ + Sbjct: 1374 RSKGRSQTISNMSTSLTPETENKINVEFLRFQSEIHESYSTFIDTLVEQYAAVSFGDLTY 1433 Query: 497 GRQVAMYLHRSIEASVRLATWNTLSNARVLELLPPLHKCFAKADGYLEPVEDNESILEAY 318 GRQVA+YLHR +EA VRLA WN LSN+ VLELLPPL KC A+A+GYLEPVE NE+ILEAY Sbjct: 1434 GRQVAIYLHRCVEAPVRLAAWNALSNSHVLELLPPLQKCLAEAEGYLEPVE-NEAILEAY 1492 Query: 317 GKSWASGALDKAANRSSVAFSLVLHHLSSFIFKNVSADAVSLRNKLAKSLLRDYSRKQQH 138 KSW SGALDKAA R SVAF+LVLHHLS+F+F + + LRNKL KSLLRDY+RK+QH Sbjct: 1493 VKSWVSGALDKAATRGSVAFTLVLHHLSTFVFISHKSYKPLLRNKLVKSLLRDYARKKQH 1552 Query: 137 EGMMVRLICYKKPDIDXXXXXXXSLPM--SEIEKRLQLLREICVG 9 EGMM++ I Y KP L M S +E RL+ L+E C G Sbjct: 1553 EGMMLQFIEYTKPSSVTKAEKEEGLTMESSNVEGRLERLKEACEG 1597 >ref|XP_004231458.1| PREDICTED: uncharacterized protein LOC101256927 [Solanum lycopersicum] Length = 1556 Score = 315 bits (807), Expect = 2e-83 Identities = 156/269 (57%), Positives = 197/269 (73%), Gaps = 2/269 (0%) Frame = -1 Query: 809 LPTFLSSEFCSPVKCVPVVWKLHAMSVTLLSGMGVLEDEKSRDVYETLQNVYGEVLDEKR 630 + T L +E +PV+ VP+VWKLHA+S TLLSGM + E++ SRD+Y+ LQ++YG++LD + Sbjct: 1271 MSTSLPAELQTPVRNVPIVWKLHALSATLLSGMSIFEEDNSRDLYKALQDIYGQLLDREE 1330 Query: 629 CSDVHGNTGVASLKFKTDIHENYSTFIETLVEQFAAESYGDVIFGRQVAMYLHRSIEASV 450 SLKFKTDIHENYSTFI+ LVEQFAA SYGD+IFGRQV +YLH+ +EA V Sbjct: 1331 ------KVNAKSLKFKTDIHENYSTFIDNLVEQFAAVSYGDMIFGRQVGVYLHQFVEAPV 1384 Query: 449 RLATWNTLSNARVLELLPPLHKCFAKADGYLEPVEDNESILEAYGKSWASGALDKAANRS 270 RLA WN LSNA LELLPPL KC A +GY EPVED+E +LEAY KSW SGALDKAA R Sbjct: 1385 RLAAWNALSNACALELLPPLEKCIAATNGYFEPVEDDERMLEAYCKSWVSGALDKAARRG 1444 Query: 269 SVAFSLVLHHLSSFIFKNVSADAVSLRNKLAKSLLRDYSRKQQHEGMMVRLICYKKPDID 90 S +F+L LHHLSSFIF++ S + + LRNKL KSLLRDYSRK+QHE + + L+ Y++PD Sbjct: 1445 SASFTLALHHLSSFIFQSCSGNMIPLRNKLVKSLLRDYSRKKQHEVLFINLLEYQRPDTR 1504 Query: 89 XXXXXXXSLPMS--EIEKRLQLLREICVG 9 +P+ + RLQ+L+E C G Sbjct: 1505 PEPFHKGCMPLQSCNVVNRLQILKEACEG 1533 >emb|CDP17654.1| unnamed protein product [Coffea canephora] Length = 1525 Score = 315 bits (806), Expect = 3e-83 Identities = 160/268 (59%), Positives = 200/268 (74%) Frame = -1 Query: 812 ALPTFLSSEFCSPVKCVPVVWKLHAMSVTLLSGMGVLEDEKSRDVYETLQNVYGEVLDEK 633 A FLS+E S V V V WKLHA+SV L+ G GVLEDEKSRDVY+TLQ+VYG+ +D++ Sbjct: 1236 ATSAFLSTESYSSVHNVSVTWKLHALSVILIDGTGVLEDEKSRDVYQTLQSVYGQTVDKR 1295 Query: 632 RCSDVHGNTGVASLKFKTDIHENYSTFIETLVEQFAAESYGDVIFGRQVAMYLHRSIEAS 453 R S+ L+F+ +I+E+YSTF+E LVEQFAA SYGD++FGRQ+A+YLHR +EA Sbjct: 1296 RLSEAGDKINGGLLQFQLEINESYSTFLEMLVEQFAAVSYGDLVFGRQIAVYLHRWVEAP 1355 Query: 452 VRLATWNTLSNARVLELLPPLHKCFAKADGYLEPVEDNESILEAYGKSWASGALDKAANR 273 VRLATWN LSNA LELLPPL +CFA+ADGYLEPVED+E +LEAY KSW SG LDKAA R Sbjct: 1356 VRLATWNALSNAHALELLPPLEQCFAEADGYLEPVEDDEKLLEAYVKSWVSGVLDKAATR 1415 Query: 272 SSVAFSLVLHHLSSFIFKNVSADAVSLRNKLAKSLLRDYSRKQQHEGMMVRLICYKKPDI 93 S ++ LVLHHL+SFIF N D +SLRN+L KSLLRD+SRK H+GMM+ L+ Y+KP Sbjct: 1416 RSSSYILVLHHLTSFIFGNGIGDKLSLRNQLVKSLLRDFSRKVNHQGMMMNLLQYEKP-- 1473 Query: 92 DXXXXXXXSLPMSEIEKRLQLLREICVG 9 + ++EKRL +LR+ C G Sbjct: 1474 -TTGSKRGLVEAWQVEKRLVVLRDACGG 1500 >ref|XP_012074496.1| PREDICTED: uncharacterized protein LOC105635957 [Jatropha curcas] gi|643727630|gb|KDP36000.1| hypothetical protein JCGZ_08395 [Jatropha curcas] Length = 1639 Score = 314 bits (804), Expect = 5e-83 Identities = 168/281 (59%), Positives = 209/281 (74%), Gaps = 15/281 (5%) Frame = -1 Query: 812 ALPTFLSSEFCSPVKCVPVVWKLHAMSVTLLSGMGVLEDEKSRDVYETLQNVYGEVLDEK 633 A+ TFLSS+ SP++ VP+VWKLH++SV LL GM VL+D +SRDVYE LQ++YG++LDE Sbjct: 1345 AMSTFLSSDVHSPIRYVPLVWKLHSLSVILLVGMDVLDDNRSRDVYEALQDIYGQLLDEA 1404 Query: 632 RC--SDVH---GNTGVAS----------LKFKTDIHENYSTFIETLVEQFAAESYGDVIF 498 R S VH GN + S LKF+++I E+YSTF+ETLVEQF+A SYGD IF Sbjct: 1405 RYTKSAVHILDGNVNLLSETEKRNMPYFLKFQSEIQESYSTFLETLVEQFSAVSYGDFIF 1464 Query: 497 GRQVAMYLHRSIEASVRLATWNTLSNARVLELLPPLHKCFAKADGYLEPVEDNESILEAY 318 GRQVA+YLHRS E++VRL+ WN LSNARVLE+LPPL KC A+A+GYLEP+EDNE+ILEAY Sbjct: 1465 GRQVAVYLHRSTESAVRLSAWNLLSNARVLEILPPLDKCIAEAEGYLEPIEDNEAILEAY 1524 Query: 317 GKSWASGALDKAANRSSVAFSLVLHHLSSFIFKNVSADAVSLRNKLAKSLLRDYSRKQQH 138 KSW SGALD++A R S+A+SLVLHHLS FIF D +SLRNKL KSLLRDYS+KQ+ Sbjct: 1525 MKSWVSGALDRSAVRGSMAYSLVLHHLSFFIFFVGCHDKISLRNKLVKSLLRDYSQKQKR 1584 Query: 137 EGMMVRLICYKKPDIDXXXXXXXSLPMSEIEKRLQLLREIC 15 EGMM+ L+ Y KP + IEKR ++L E C Sbjct: 1585 EGMMLDLVQYPKPHPYN----------NNIEKRFEVLAEAC 1615 >emb|CAN83259.1| hypothetical protein VITISV_032134 [Vitis vinifera] Length = 1444 Score = 311 bits (797), Expect = 4e-82 Identities = 160/236 (67%), Positives = 191/236 (80%), Gaps = 10/236 (4%) Frame = -1 Query: 812 ALPTFLSSEFCSPVKCVPVVWKLHAMSVTLLSGMGVLEDEKSRDVYETLQNVYGEVLDEK 633 A+ +FLSS+ SPV+ VPV+WKLH++SVTLL GM VLE+ KSRDVYE LQ +YG++LDE Sbjct: 1188 AMSSFLSSDVPSPVRSVPVIWKLHSLSVTLLDGMSVLEEXKSRDVYEALQELYGQLLDES 1247 Query: 632 RCSDVHGNT----------GVASLKFKTDIHENYSTFIETLVEQFAAESYGDVIFGRQVA 483 R VH +T + L+F++DIHE+YSTFIETLVEQFAA SYGD+I+GRQVA Sbjct: 1248 R---VHRSTKPXPETGEKNSIEFLRFQSDIHESYSTFIETLVEQFAAISYGDLIYGRQVA 1304 Query: 482 MYLHRSIEASVRLATWNTLSNARVLELLPPLHKCFAKADGYLEPVEDNESILEAYGKSWA 303 +YLHRS+EA VRLA WN LSNARVLELLPPL KC A A+GYLEPVE+NE ILEAY KSW Sbjct: 1305 IYLHRSVEAPVRLAAWNALSNARVLELLPPLEKCSADAEGYLEPVENNEGILEAYVKSWV 1364 Query: 302 SGALDKAANRSSVAFSLVLHHLSSFIFKNVSADAVSLRNKLAKSLLRDYSRKQQHE 135 +GALD+AA R SV F+LVLHHLSS IF++ + +SLRNKLAKSLLRDYSRK+QHE Sbjct: 1365 TGALDRAATRGSVTFTLVLHHLSSVIFEDDADVKLSLRNKLAKSLLRDYSRKRQHE 1420