BLASTX nr result
ID: Cinnamomum25_contig00006029
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Cinnamomum25_contig00006029 (798 letters) Database: ./nr 69,698,275 sequences; 24,982,196,650 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_002280625.1| PREDICTED: putative RNA polymerase II subuni... 212 2e-52 emb|CAN62034.1| hypothetical protein VITISV_014731 [Vitis vinifera] 211 3e-52 ref|XP_002521936.1| conserved hypothetical protein [Ricinus comm... 204 5e-50 ref|XP_007016930.1| F2P16.20-like protein isoform 5 [Theobroma c... 202 3e-49 ref|XP_007016929.1| F2P16.20 protein, putative isoform 4 [Theobr... 202 3e-49 ref|XP_007016927.1| F2P16.20-like protein isoform 2 [Theobroma c... 202 3e-49 ref|XP_007016926.1| F2P16.20 protein, putative isoform 1 [Theobr... 202 3e-49 ref|XP_012072543.1| PREDICTED: putative RNA polymerase II subuni... 201 4e-49 ref|XP_009389521.1| PREDICTED: putative RNA polymerase II subuni... 194 4e-47 ref|XP_012479689.1| PREDICTED: putative RNA polymerase II subuni... 193 1e-46 ref|XP_012479683.1| PREDICTED: putative RNA polymerase II subuni... 193 1e-46 gb|KHG00854.1| hypothetical protein F383_23706 [Gossypium arboreum] 189 1e-45 ref|XP_007016928.1| F2P16.20-like protein isoform 3, partial [Th... 187 5e-45 ref|XP_007145767.1| hypothetical protein PHAVU_007G265900g [Phas... 185 3e-44 gb|KDO45360.1| hypothetical protein CISIN_1g0087651mg [Citrus si... 184 7e-44 gb|KDO45358.1| hypothetical protein CISIN_1g0087651mg, partial [... 184 7e-44 ref|XP_011087531.1| PREDICTED: putative RNA polymerase II subuni... 182 2e-43 ref|XP_011087530.1| PREDICTED: putative RNA polymerase II subuni... 182 2e-43 ref|XP_011087529.1| PREDICTED: putative RNA polymerase II subuni... 182 2e-43 ref|XP_010921353.1| PREDICTED: putative RNA polymerase II subuni... 182 2e-43 >ref|XP_002280625.1| PREDICTED: putative RNA polymerase II subunit B1 CTD phosphatase RPAP2 homolog [Vitis vinifera] gi|731415977|ref|XP_010659731.1| PREDICTED: putative RNA polymerase II subunit B1 CTD phosphatase RPAP2 homolog [Vitis vinifera] gi|731415979|ref|XP_010659732.1| PREDICTED: putative RNA polymerase II subunit B1 CTD phosphatase RPAP2 homolog [Vitis vinifera] gi|296089830|emb|CBI39649.3| unnamed protein product [Vitis vinifera] Length = 659 Score = 212 bits (540), Expect = 2e-52 Identities = 129/259 (49%), Positives = 163/259 (62%), Gaps = 10/259 (3%) Frame = -2 Query: 749 IGDQLSASEASMGPQKNDSKPKANRKSKG-------KDIVEKAGKQSETQSRSALSKGPQ 591 IGDQLS E S P +NDS+ K R+SKG KD A S + G + Sbjct: 255 IGDQLSMLEKSAPPIQNDSESKL-RESKGRRSRVIFKDEFSTAEVPSVPSQSGSELNGVK 313 Query: 590 GEDSVAAAVVKQNG-TQLKSALKSSGVKPLSRSVTWADEKKAENIDAGNLFNGQKTEEKS 414 G++ Q G T+LKS LK SG K ++RSVTWADEK ++ D+ + ++ E K Sbjct: 314 GKEEYHTENAAQLGPTKLKSCLKPSGGKKVTRSVTWADEKM-DSADSRDFCKVRELEVKK 372 Query: 413 ESIKNXXXXXXXXXXS-LRFALAEACAIALSQAAEAVASGECDAEDAATEAGIVILP-PV 240 E + LRFA AEACAIALSQAAEAVASGE D DA +EA I+ILP P Sbjct: 373 EDPNGLGDIDVGDDDNALRFASAEACAIALSQAAEAVASGETDMTDAVSEARIIILPHPR 432 Query: 239 EVDEGNSSELMDVSEPDCQSVKWPRKPVLLDTDLFDCEDSWHDTPPEGFSLTLSPFATMW 60 ++DEG S + D+ EP+ +KWP KP + +D+FD +DSW+DTPPEGFSLTLSPFATMW Sbjct: 433 DMDEGESLKDADLLEPEPVPLKWPIKPGISHSDIFDSDDSWYDTPPEGFSLTLSPFATMW 492 Query: 59 TALFGWVSASSLAYIYGRD 3 ALF W+++SS+AYIYGRD Sbjct: 493 MALFAWITSSSIAYIYGRD 511 >emb|CAN62034.1| hypothetical protein VITISV_014731 [Vitis vinifera] Length = 659 Score = 211 bits (538), Expect = 3e-52 Identities = 128/259 (49%), Positives = 163/259 (62%), Gaps = 10/259 (3%) Frame = -2 Query: 749 IGDQLSASEASMGPQKNDSKPKANRKSKG-------KDIVEKAGKQSETQSRSALSKGPQ 591 IGDQLS E S P +NDS+ K R+SKG KD A S + G + Sbjct: 255 IGDQLSMLEKSAPPIQNDSESKL-RESKGRRSRVIFKDEFSTAEVPSVPSQSGSELNGVK 313 Query: 590 GEDSVAAAVVKQNG-TQLKSALKSSGVKPLSRSVTWADEKKAENIDAGNLFNGQKTEEKS 414 G++ Q G T+ KS+LK SG K + RSVTWADEK ++ D+ + ++ E K Sbjct: 314 GKEEYHTENAAQLGPTKPKSSLKPSGGKKVIRSVTWADEKM-DSADSRDFCKVRELEVKK 372 Query: 413 ESIKNXXXXXXXXXXS-LRFALAEACAIALSQAAEAVASGECDAEDAATEAGIVILP-PV 240 E + LRFA AEACA+ALSQAAEAVASGE D DA +EAGI+ILP P Sbjct: 373 EDPNGLGDIDVGDDDNALRFASAEACAVALSQAAEAVASGETDMTDAVSEAGIIILPHPR 432 Query: 239 EVDEGNSSELMDVSEPDCQSVKWPRKPVLLDTDLFDCEDSWHDTPPEGFSLTLSPFATMW 60 ++DEG S + D+ EP+ +KWP KP + +D+FD +DSW+DTPPEGFSLTLSPFATMW Sbjct: 433 DMDEGESLKDADLLEPEPVPLKWPIKPGISHSDIFDSDDSWYDTPPEGFSLTLSPFATMW 492 Query: 59 TALFGWVSASSLAYIYGRD 3 ALF W+++SS+AYIYGRD Sbjct: 493 MALFAWITSSSIAYIYGRD 511 >ref|XP_002521936.1| conserved hypothetical protein [Ricinus communis] gi|223538861|gb|EEF40460.1| conserved hypothetical protein [Ricinus communis] Length = 645 Score = 204 bits (519), Expect = 5e-50 Identities = 125/289 (43%), Positives = 165/289 (57%), Gaps = 30/289 (10%) Frame = -2 Query: 779 NETAFVSTVIIGDQLSASE--------------------------ASMGPQKNDSKPKAN 678 ++T F ST+I D+ S S+ A + + KA+ Sbjct: 212 SDTDFTSTIITNDEYSISKGPSGLTSTASDIKLQAQTGKGHEGLNAQLSSLRKQDSIKAS 271 Query: 677 RKSKG--KDIVEKAGKQSETQSRSALSKGPQGEDSVAAAVVKQNGTQLKSALKSSGVKPL 504 RKSKG K+ V K + S+ + S A N + LK +LKSSG K Sbjct: 272 RKSKGRRKEKVIKEQLNFQDLPSSSYYTAEAEDISQATGAANLNESVLKPSLKSSGAKRS 331 Query: 503 SRSVTWADEKKAENIDAGNLFNGQKTEEKSESIK-NXXXXXXXXXXSLRFALAEACAIAL 327 +RSVTWADE+ +N + NL Q+ E+ +ES + + LRF AEACA+AL Sbjct: 332 NRSVTWADER-VDNAGSRNLCEVQEMEQTNESHEISESANKGDDGHMLRFESAEACAVAL 390 Query: 326 SQAAEAVASGECDAEDAATEAGIVILPPVE-VDEGNSSELMDVSEPDCQSVKWPRKPVLL 150 SQAAEAVASG+ D A +EAGI++LPP + + +G + E D+ E + S+KWP KP + Sbjct: 391 SQAAEAVASGDADVNKAMSEAGIIVLPPSQDLGQGGNVEKNDMIEQESASLKWPTKPGIP 450 Query: 149 DTDLFDCEDSWHDTPPEGFSLTLSPFATMWTALFGWVSASSLAYIYGRD 3 +DLFD EDSW+D PPEGFSLTLSPFATMW ALF WV++SSLAYIYGRD Sbjct: 451 QSDLFDPEDSWYDAPPEGFSLTLSPFATMWMALFAWVTSSSLAYIYGRD 499 >ref|XP_007016930.1| F2P16.20-like protein isoform 5 [Theobroma cacao] gi|508787293|gb|EOY34549.1| F2P16.20-like protein isoform 5 [Theobroma cacao] Length = 708 Score = 202 bits (513), Expect = 3e-49 Identities = 124/290 (42%), Positives = 168/290 (57%), Gaps = 25/290 (8%) Frame = -2 Query: 797 EEVVVGNETAFVSTVIIGDQLSASEASMGPQKN----------------DSKPK------ 684 +E V NE F S +I+ D+ + S+ G +++ DS+ K Sbjct: 302 KEDFVINEMDFTSEIIMNDEYTISKMPSGSKQSCFDSNLKEVEEKGICKDSEDKCVISGS 361 Query: 683 -ANRKSKGKDIVEKAGKQSETQSRSALSKGPQGEDSVAAAVVKQNGTQLKSALKSSGVKP 507 + + K IVE ++ QS S +++ A V + T LKS+LKS+G K Sbjct: 362 SSALREKDSSIVELPSTKNVYQSGLDTSSAEAEKETHADKAVTSSETVLKSSLKSAGAKK 421 Query: 506 LSRSVTWADEKKAENIDAGNLFNGQKTEE-KSESIKNXXXXXXXXXXSLRFALAEACAIA 330 L+R VTWAD+KKA+N GNL ++ E K +S + LRF AEACA+A Sbjct: 422 LNRFVTWADKKKADNAGNGNLCEVKEMETMKGDSEISGSAEDGGDDNMLRFVSAEACAMA 481 Query: 329 LSQAAEAVASGECDAEDAATEAGIVILPPV-EVDEGNSSELMDVSEPDCQSVKWPRKPVL 153 LS+AAEAVASG+ D DA E G++ILP + EVD+ E D+ EP+ VKWP+KP + Sbjct: 482 LSKAAEAVASGDSDVTDAVYENGLIILPSLCEVDKEEPMEDGDMLEPETAPVKWPKKPGI 541 Query: 152 LDTDLFDCEDSWHDTPPEGFSLTLSPFATMWTALFGWVSASSLAYIYGRD 3 +D+F+ EDSW D PPEGFSLTLS FATMW ALF W+++SSLAYIYGRD Sbjct: 542 PHSDMFNPEDSWFDAPPEGFSLTLSTFATMWNALFEWITSSSLAYIYGRD 591 >ref|XP_007016929.1| F2P16.20 protein, putative isoform 4 [Theobroma cacao] gi|508787292|gb|EOY34548.1| F2P16.20 protein, putative isoform 4 [Theobroma cacao] Length = 607 Score = 202 bits (513), Expect = 3e-49 Identities = 124/290 (42%), Positives = 168/290 (57%), Gaps = 25/290 (8%) Frame = -2 Query: 797 EEVVVGNETAFVSTVIIGDQLSASEASMGPQKN----------------DSKPK------ 684 +E V NE F S +I+ D+ + S+ G +++ DS+ K Sbjct: 248 KEDFVINEMDFTSEIIMNDEYTISKMPSGSKQSCFDSNLKEVEEKGICKDSEDKCVISGS 307 Query: 683 -ANRKSKGKDIVEKAGKQSETQSRSALSKGPQGEDSVAAAVVKQNGTQLKSALKSSGVKP 507 + + K IVE ++ QS S +++ A V + T LKS+LKS+G K Sbjct: 308 SSALREKDSSIVELPSTKNVYQSGLDTSSAEAEKETHADKAVTSSETVLKSSLKSAGAKK 367 Query: 506 LSRSVTWADEKKAENIDAGNLFNGQKTEE-KSESIKNXXXXXXXXXXSLRFALAEACAIA 330 L+R VTWAD+KKA+N GNL ++ E K +S + LRF AEACA+A Sbjct: 368 LNRFVTWADKKKADNAGNGNLCEVKEMETMKGDSEISGSAEDGGDDNMLRFVSAEACAMA 427 Query: 329 LSQAAEAVASGECDAEDAATEAGIVILPPV-EVDEGNSSELMDVSEPDCQSVKWPRKPVL 153 LS+AAEAVASG+ D DA E G++ILP + EVD+ E D+ EP+ VKWP+KP + Sbjct: 428 LSKAAEAVASGDSDVTDAVYENGLIILPSLCEVDKEEPMEDGDMLEPETAPVKWPKKPGI 487 Query: 152 LDTDLFDCEDSWHDTPPEGFSLTLSPFATMWTALFGWVSASSLAYIYGRD 3 +D+F+ EDSW D PPEGFSLTLS FATMW ALF W+++SSLAYIYGRD Sbjct: 488 PHSDMFNPEDSWFDAPPEGFSLTLSTFATMWNALFEWITSSSLAYIYGRD 537 >ref|XP_007016927.1| F2P16.20-like protein isoform 2 [Theobroma cacao] gi|508787290|gb|EOY34546.1| F2P16.20-like protein isoform 2 [Theobroma cacao] Length = 679 Score = 202 bits (513), Expect = 3e-49 Identities = 124/290 (42%), Positives = 168/290 (57%), Gaps = 25/290 (8%) Frame = -2 Query: 797 EEVVVGNETAFVSTVIIGDQLSASEASMGPQKN----------------DSKPK------ 684 +E V NE F S +I+ D+ + S+ G +++ DS+ K Sbjct: 302 KEDFVINEMDFTSEIIMNDEYTISKMPSGSKQSCFDSNLKEVEEKGICKDSEDKCVISGS 361 Query: 683 -ANRKSKGKDIVEKAGKQSETQSRSALSKGPQGEDSVAAAVVKQNGTQLKSALKSSGVKP 507 + + K IVE ++ QS S +++ A V + T LKS+LKS+G K Sbjct: 362 SSALREKDSSIVELPSTKNVYQSGLDTSSAEAEKETHADKAVTSSETVLKSSLKSAGAKK 421 Query: 506 LSRSVTWADEKKAENIDAGNLFNGQKTEE-KSESIKNXXXXXXXXXXSLRFALAEACAIA 330 L+R VTWAD+KKA+N GNL ++ E K +S + LRF AEACA+A Sbjct: 422 LNRFVTWADKKKADNAGNGNLCEVKEMETMKGDSEISGSAEDGGDDNMLRFVSAEACAMA 481 Query: 329 LSQAAEAVASGECDAEDAATEAGIVILPPV-EVDEGNSSELMDVSEPDCQSVKWPRKPVL 153 LS+AAEAVASG+ D DA E G++ILP + EVD+ E D+ EP+ VKWP+KP + Sbjct: 482 LSKAAEAVASGDSDVTDAVYENGLIILPSLCEVDKEEPMEDGDMLEPETAPVKWPKKPGI 541 Query: 152 LDTDLFDCEDSWHDTPPEGFSLTLSPFATMWTALFGWVSASSLAYIYGRD 3 +D+F+ EDSW D PPEGFSLTLS FATMW ALF W+++SSLAYIYGRD Sbjct: 542 PHSDMFNPEDSWFDAPPEGFSLTLSTFATMWNALFEWITSSSLAYIYGRD 591 >ref|XP_007016926.1| F2P16.20 protein, putative isoform 1 [Theobroma cacao] gi|508787289|gb|EOY34545.1| F2P16.20 protein, putative isoform 1 [Theobroma cacao] Length = 739 Score = 202 bits (513), Expect = 3e-49 Identities = 124/290 (42%), Positives = 168/290 (57%), Gaps = 25/290 (8%) Frame = -2 Query: 797 EEVVVGNETAFVSTVIIGDQLSASEASMGPQKN----------------DSKPK------ 684 +E V NE F S +I+ D+ + S+ G +++ DS+ K Sbjct: 302 KEDFVINEMDFTSEIIMNDEYTISKMPSGSKQSCFDSNLKEVEEKGICKDSEDKCVISGS 361 Query: 683 -ANRKSKGKDIVEKAGKQSETQSRSALSKGPQGEDSVAAAVVKQNGTQLKSALKSSGVKP 507 + + K IVE ++ QS S +++ A V + T LKS+LKS+G K Sbjct: 362 SSALREKDSSIVELPSTKNVYQSGLDTSSAEAEKETHADKAVTSSETVLKSSLKSAGAKK 421 Query: 506 LSRSVTWADEKKAENIDAGNLFNGQKTEE-KSESIKNXXXXXXXXXXSLRFALAEACAIA 330 L+R VTWAD+KKA+N GNL ++ E K +S + LRF AEACA+A Sbjct: 422 LNRFVTWADKKKADNAGNGNLCEVKEMETMKGDSEISGSAEDGGDDNMLRFVSAEACAMA 481 Query: 329 LSQAAEAVASGECDAEDAATEAGIVILPPV-EVDEGNSSELMDVSEPDCQSVKWPRKPVL 153 LS+AAEAVASG+ D DA E G++ILP + EVD+ E D+ EP+ VKWP+KP + Sbjct: 482 LSKAAEAVASGDSDVTDAVYENGLIILPSLCEVDKEEPMEDGDMLEPETAPVKWPKKPGI 541 Query: 152 LDTDLFDCEDSWHDTPPEGFSLTLSPFATMWTALFGWVSASSLAYIYGRD 3 +D+F+ EDSW D PPEGFSLTLS FATMW ALF W+++SSLAYIYGRD Sbjct: 542 PHSDMFNPEDSWFDAPPEGFSLTLSTFATMWNALFEWITSSSLAYIYGRD 591 >ref|XP_012072543.1| PREDICTED: putative RNA polymerase II subunit B1 CTD phosphatase RPAP2 homolog [Jatropha curcas] gi|802599693|ref|XP_012072544.1| PREDICTED: putative RNA polymerase II subunit B1 CTD phosphatase RPAP2 homolog [Jatropha curcas] gi|802599695|ref|XP_012072546.1| PREDICTED: putative RNA polymerase II subunit B1 CTD phosphatase RPAP2 homolog [Jatropha curcas] gi|643730423|gb|KDP37902.1| hypothetical protein JCGZ_05341 [Jatropha curcas] Length = 654 Score = 201 bits (511), Expect = 4e-49 Identities = 122/297 (41%), Positives = 167/297 (56%), Gaps = 38/297 (12%) Frame = -2 Query: 779 NETAFVSTVIIGDQLSASEASMGP---------------------QKNDSKP------KA 681 N+ F+ST+I D+ S S+A G + S P K Sbjct: 213 NDMDFMSTIITKDEYSISKAPSGSISTGSDMKLQEQRGKETHKGSEAQSSSPGKHAFVKT 272 Query: 680 NRKSKG---KDIVEKAGKQSE-------TQSRSALSKGPQGEDSVAAAVVKQNGTQLKSA 531 +RKSKG K I+++ + +Q+ S+++ E S A + + LK + Sbjct: 273 SRKSKGGRSKQIIKEELSDKDLLSASNYSQTGSSMNNAEPEEKSGAKQAANLSESMLKPS 332 Query: 530 LKSSGVKPLSRSVTWADEKKAENIDAGNLFNGQKTEEKSESIKNXXXXXXXXXXSLRFAL 351 LK SG K SVTWADEK +N + NL ++ E+ ++ LRF Sbjct: 333 LKPSGAKKSVHSVTWADEK-FDNAKSRNLCEVREMEDTKSGLEILDSLENNNDNMLRFES 391 Query: 350 AEACAIALSQAAEAVASGECDAEDAATEAGIVILP-PVEVDEGNSSELMDVSEPDCQSVK 174 AEACAIALSQAAEAVASG+ D DA +EAG+++LP P + G+S+++ D+ E + S+K Sbjct: 392 AEACAIALSQAAEAVASGDADVNDAMSEAGVIVLPQPHHLAPGDSTDIADMLERESASLK 451 Query: 173 WPRKPVLLDTDLFDCEDSWHDTPPEGFSLTLSPFATMWTALFGWVSASSLAYIYGRD 3 WP KP + +DLFD EDSW+D PPEGFSL LSPFATMW ALF WV++SSLA+IYGRD Sbjct: 452 WPAKPAVEQSDLFDSEDSWYDAPPEGFSLMLSPFATMWMALFAWVTSSSLAFIYGRD 508 >ref|XP_009389521.1| PREDICTED: putative RNA polymerase II subunit B1 CTD phosphatase RPAP2 homolog [Musa acuminata subsp. malaccensis] Length = 668 Score = 194 bits (494), Expect = 4e-47 Identities = 124/264 (46%), Positives = 158/264 (59%), Gaps = 5/264 (1%) Frame = -2 Query: 779 NETAFVSTVIIGDQLSASEASMGPQKNDSKPKANRKSKGKDIVEKAGKQSETQSRSALSK 600 ++ F+ST+I+G+Q+ ++ + PK + S + +K SE S + + Sbjct: 275 HKVEFMSTIIVGEQVPPGSSAAA----QNTPKLDYTST-TFVGDKESLISELDSGIHM-E 328 Query: 599 GPQGEDSVAAAVVKQ----NGTQLKSALKSSGVKPLSRSVTWADEKKAENIDAGNLFNGQ 432 G VA K+ G+ LKS+LK+S K RSV WADE++ N+ + Sbjct: 329 STTGSQKVAYEFEKKVSMDKGSVLKSSLKTSRSKNAGRSVKWADERE-------NMAQEE 381 Query: 431 KTEEKSESIKNXXXXXXXXXXSLRFALAEACAIALSQAAEAVASGECDAEDAATEAGIVI 252 + ++ S K SLRFA AEACA AL+QAAEAVASG +A DAA+EAGIVI Sbjct: 382 RKDDLKSSTK-PEESQVEDDSSLRFASAEACAAALTQAAEAVASGIAEAGDAASEAGIVI 440 Query: 251 LP-PVEVDEGNSSELMDVSEPDCQSVKWPRKPVLLDTDLFDCEDSWHDTPPEGFSLTLSP 75 LP P VDEG+ E D E D VKWP+K VLLDTD+FD EDSWHDTPPEGF L LS Sbjct: 441 LPQPKRVDEGDVEEDEDTFEFDRGYVKWPKKTVLLDTDMFDVEDSWHDTPPEGFDLKLSS 500 Query: 74 FATMWTALFGWVSASSLAYIYGRD 3 FATMW ALFGW++ SSLAYIYG D Sbjct: 501 FATMWMALFGWITCSSLAYIYGCD 524 >ref|XP_012479689.1| PREDICTED: putative RNA polymerase II subunit B1 CTD phosphatase RPAP2 homolog isoform X2 [Gossypium raimondii] Length = 695 Score = 193 bits (490), Expect = 1e-46 Identities = 123/293 (41%), Positives = 167/293 (56%), Gaps = 34/293 (11%) Frame = -2 Query: 779 NETAFVSTVIIGDQLSASEASMGPQKNDSKPKANRKSKG----KDIVEKAGKQSETQSRS 612 NE F S +I+ D+ + S+ G ++ S K +K++G KD EK + ++S S Sbjct: 259 NEMDFTSEIIMNDEYTVSKTPPGSRQGSSGSKL-KKTEGQGVCKDFEEKCMR---SESSS 314 Query: 611 ALSK--------------GPQGEDSVAAAVVKQ---------NGTQLKSALKSSGVKPLS 501 AL+K G D++ A K+ +G LKS+LKS+G K L+ Sbjct: 315 ALTKEDSGIVEMPSTKCVDQSGLDTINAEAEKETHSDKAVASSGVVLKSSLKSAGAKKLN 374 Query: 500 RSVTWADEKKAENIDAGNLFNGQKTEEKSESIKNXXXXXXXXXXS--LRFALAEACAIAL 327 RSVTWAD+K + G+L ++ + + +N LRFA AEACA+AL Sbjct: 375 RSVTWADKKNVDGARKGSLCEVKEMDAQKGDSENLGRAEDGDDDDNMLRFASAEACAMAL 434 Query: 326 SQAAEAVASGECDAEDAATEAGIVILP-PVEVDEGNSSELMDV----SEPDCQSVKWPRK 162 S+AA AVASG+ D DA +EAG++IL P+E D+ E +D EP+ VKWP K Sbjct: 435 SEAAAAVASGDSDVNDAVSEAGLIILAHPLEADKEEKVENIDTLEAEPEPEEGPVKWPTK 494 Query: 161 PVLLDTDLFDCEDSWHDTPPEGFSLTLSPFATMWTALFGWVSASSLAYIYGRD 3 P + +D FD EDSW D PPEGFSLTLS FATMW ALF W+++SSLAYIYGRD Sbjct: 495 PGIPRSDFFDPEDSWFDAPPEGFSLTLSTFATMWNALFEWITSSSLAYIYGRD 547 >ref|XP_012479683.1| PREDICTED: putative RNA polymerase II subunit B1 CTD phosphatase RPAP2 homolog isoform X1 [Gossypium raimondii] gi|823159708|ref|XP_012479685.1| PREDICTED: putative RNA polymerase II subunit B1 CTD phosphatase RPAP2 homolog isoform X1 [Gossypium raimondii] gi|823159710|ref|XP_012479686.1| PREDICTED: putative RNA polymerase II subunit B1 CTD phosphatase RPAP2 homolog isoform X1 [Gossypium raimondii] gi|823159712|ref|XP_012479687.1| PREDICTED: putative RNA polymerase II subunit B1 CTD phosphatase RPAP2 homolog isoform X1 [Gossypium raimondii] gi|823159714|ref|XP_012479688.1| PREDICTED: putative RNA polymerase II subunit B1 CTD phosphatase RPAP2 homolog isoform X1 [Gossypium raimondii] gi|763764410|gb|KJB31664.1| hypothetical protein B456_005G200700 [Gossypium raimondii] gi|763764411|gb|KJB31665.1| hypothetical protein B456_005G200700 [Gossypium raimondii] gi|763764412|gb|KJB31666.1| hypothetical protein B456_005G200700 [Gossypium raimondii] gi|763764413|gb|KJB31667.1| hypothetical protein B456_005G200700 [Gossypium raimondii] gi|763764414|gb|KJB31668.1| hypothetical protein B456_005G200700 [Gossypium raimondii] Length = 708 Score = 193 bits (490), Expect = 1e-46 Identities = 123/293 (41%), Positives = 167/293 (56%), Gaps = 34/293 (11%) Frame = -2 Query: 779 NETAFVSTVIIGDQLSASEASMGPQKNDSKPKANRKSKG----KDIVEKAGKQSETQSRS 612 NE F S +I+ D+ + S+ G ++ S K +K++G KD EK + ++S S Sbjct: 272 NEMDFTSEIIMNDEYTVSKTPPGSRQGSSGSKL-KKTEGQGVCKDFEEKCMR---SESSS 327 Query: 611 ALSK--------------GPQGEDSVAAAVVKQ---------NGTQLKSALKSSGVKPLS 501 AL+K G D++ A K+ +G LKS+LKS+G K L+ Sbjct: 328 ALTKEDSGIVEMPSTKCVDQSGLDTINAEAEKETHSDKAVASSGVVLKSSLKSAGAKKLN 387 Query: 500 RSVTWADEKKAENIDAGNLFNGQKTEEKSESIKNXXXXXXXXXXS--LRFALAEACAIAL 327 RSVTWAD+K + G+L ++ + + +N LRFA AEACA+AL Sbjct: 388 RSVTWADKKNVDGARKGSLCEVKEMDAQKGDSENLGRAEDGDDDDNMLRFASAEACAMAL 447 Query: 326 SQAAEAVASGECDAEDAATEAGIVILP-PVEVDEGNSSELMDV----SEPDCQSVKWPRK 162 S+AA AVASG+ D DA +EAG++IL P+E D+ E +D EP+ VKWP K Sbjct: 448 SEAAAAVASGDSDVNDAVSEAGLIILAHPLEADKEEKVENIDTLEAEPEPEEGPVKWPTK 507 Query: 161 PVLLDTDLFDCEDSWHDTPPEGFSLTLSPFATMWTALFGWVSASSLAYIYGRD 3 P + +D FD EDSW D PPEGFSLTLS FATMW ALF W+++SSLAYIYGRD Sbjct: 508 PGIPRSDFFDPEDSWFDAPPEGFSLTLSTFATMWNALFEWITSSSLAYIYGRD 560 >gb|KHG00854.1| hypothetical protein F383_23706 [Gossypium arboreum] Length = 729 Score = 189 bits (481), Expect = 1e-45 Identities = 123/294 (41%), Positives = 167/294 (56%), Gaps = 35/294 (11%) Frame = -2 Query: 779 NETAFVSTVIIGDQLSASEASMGPQKNDSKPKANR-KSKG--KDIVEKAGKQSETQSRSA 609 NE F S +I+ D+ + S+ G ++ S K + + KG KD EK + ++S SA Sbjct: 259 NEMDFTSEIIMNDEYTVSKTPPGSRQGSSGSKLEKTEGKGVCKDFEEKCMR---SESSSA 315 Query: 608 LSK--------------GPQGEDSVAAAVVKQ---------NGTQLKSALKSSGVKPLSR 498 L+K G D++ A K+ +G LKS+LK +G K L+R Sbjct: 316 LTKEDSGIVQMPSTKCVDQSGLDTINAEAEKETHSDKAMASSGVVLKSSLKPAGAKKLNR 375 Query: 497 SVTWADEKKAENIDAGNLFNGQKTEEKSESIKNXXXXXXXXXXS--LRFALAEACAIALS 324 SVTWAD+K ++ G+L ++ + + +N LRFA AEACA+ALS Sbjct: 376 SVTWADKKNVDSARKGSLCEVKEMDAQKGDSENIGRAEDGDADDKMLRFASAEACAMALS 435 Query: 323 QAAEA--VASGECDAEDAATEAGIVILP-PVEVDEGNSSELMDVSEPDCQS----VKWPR 165 +AA A VASG+ D DA +EAG++ILP P+E D+ E +D E D + VKWP Sbjct: 436 KAAAAAAVASGDSDVNDAVSEAGLIILPHPLEADKEEKVENIDTLEADPEPEEGPVKWPT 495 Query: 164 KPVLLDTDLFDCEDSWHDTPPEGFSLTLSPFATMWTALFGWVSASSLAYIYGRD 3 KP + +D FD EDSW D PPEGFSLTLS FATMW ALF W+++SSLAYIYGRD Sbjct: 496 KPGIPRSDFFDPEDSWFDAPPEGFSLTLSTFATMWNALFEWITSSSLAYIYGRD 549 >ref|XP_007016928.1| F2P16.20-like protein isoform 3, partial [Theobroma cacao] gi|508787291|gb|EOY34547.1| F2P16.20-like protein isoform 3, partial [Theobroma cacao] Length = 703 Score = 187 bits (476), Expect = 5e-45 Identities = 119/289 (41%), Positives = 160/289 (55%), Gaps = 24/289 (8%) Frame = -2 Query: 797 EEVVVGNETAFVSTVIIGDQLSASEASMGPQKN----------------DSKPK------ 684 +E V NE F S +I+ D+ + S+ G +++ DS+ K Sbjct: 302 KEDFVINEMDFTSEIIMNDEYTISKMPSGSKQSCFDSNLKEVEEKGICKDSEDKCVISGS 361 Query: 683 -ANRKSKGKDIVEKAGKQSETQSRSALSKGPQGEDSVAAAVVKQNGTQLKSALKSSGVKP 507 + + K IVE ++ QS S +++ A V + T LKS+LKS+G K Sbjct: 362 SSALREKDSSIVELPSTKNVYQSGLDTSSAEAEKETHADKAVTSSETVLKSSLKSAGAKK 421 Query: 506 LSRSVTWADEKKAENIDAGNLFNGQKTEE-KSESIKNXXXXXXXXXXSLRFALAEACAIA 330 L+R VTWAD+KKA+N GNL ++ E K +S + LRF AEACA+A Sbjct: 422 LNRFVTWADKKKADNAGNGNLCEVKEMETMKGDSEISGSAEDGGDDNMLRFVSAEACAMA 481 Query: 329 LSQAAEAVASGECDAEDAATEAGIVILPPVEVDEGNSSELMDVSEPDCQSVKWPRKPVLL 150 LS+AAEAVASG+ D DA E VD+ E D+ EP+ VKWP+KP + Sbjct: 482 LSKAAEAVASGDSDVTDAVCE----------VDKEEPMEDGDMLEPETAPVKWPKKPGIP 531 Query: 149 DTDLFDCEDSWHDTPPEGFSLTLSPFATMWTALFGWVSASSLAYIYGRD 3 +D+F+ EDSW D PPEGFSLTLS FATMW ALF W+++SSLAYIYGRD Sbjct: 532 HSDMFNPEDSWFDAPPEGFSLTLSTFATMWNALFEWITSSSLAYIYGRD 580 >ref|XP_007145767.1| hypothetical protein PHAVU_007G265900g [Phaseolus vulgaris] gi|561018957|gb|ESW17761.1| hypothetical protein PHAVU_007G265900g [Phaseolus vulgaris] Length = 706 Score = 185 bits (469), Expect = 3e-44 Identities = 118/253 (46%), Positives = 150/253 (59%), Gaps = 8/253 (3%) Frame = -2 Query: 737 LSASEASMGPQKNDSKPKANRKSKGKDIVEKAGKQSETQSRSALSKGPQGEDSVAAA--V 564 +S SE +KN+S K+ V+ G+ S S D+V V Sbjct: 317 VSISERHYDVEKNNSARKS---------VQLKGETSRVTVNGDASTSNFDPDNVKEKFQV 367 Query: 563 VKQNG---TQLKSALKSSGVKPLSRSVTWADEK--KAENIDAGNLFNGQKTEEKSESIKN 399 K G T+LKS+LKS+G K LSR+VTWADEK A N D + ++SES+ N Sbjct: 368 EKVGGLCETKLKSSLKSAGEKKLSRTVTWADEKINGAGNKDLCEVKEFGDIIKESESVGN 427 Query: 398 XXXXXXXXXXSLRFALAEACAIALSQAAEAVASGECDAEDAATEAGIVILP-PVEVDEGN 222 LR A AEACAIALSQA+EAVASG+ DA DA +EAGI+ILP P + E Sbjct: 428 EDVANNEDM--LRQASAEACAIALSQASEAVASGDSDATDAVSEAGIIILPQPHDAVEEG 485 Query: 221 SSELMDVSEPDCQSVKWPRKPVLLDTDLFDCEDSWHDTPPEGFSLTLSPFATMWTALFGW 42 + E D+ + D ++KWPRKP + D D F+ +DSW D PPEGFSLTLSPFA MW A+F W Sbjct: 486 TMEDADILQNDSVTLKWPRKPGISDIDFFESDDSWFDAPPEGFSLTLSPFANMWNAIFSW 545 Query: 41 VSASSLAYIYGRD 3 +++ SLAYIYGRD Sbjct: 546 MTSYSLAYIYGRD 558 >gb|KDO45360.1| hypothetical protein CISIN_1g0087651mg [Citrus sinensis] Length = 469 Score = 184 bits (466), Expect = 7e-44 Identities = 119/294 (40%), Positives = 158/294 (53%), Gaps = 35/294 (11%) Frame = -2 Query: 779 NETAFVSTVIIGDQLSASEASMGPQKNDSKPKANRKSKGKD----------------IVE 648 NE F S ++ D+ S S+ G K +K K + D I + Sbjct: 122 NEMDFTSVIMTNDEYSISKPHCGSTKTITKTKFEETKENADGENLEDQCAALGSLALIKD 181 Query: 647 KAGKQSETQSRSALS--KGPQG---------------EDSVAAAVVKQNGTQL-KSALKS 522 + ++S+T ++ LS K P E + A +G + KS+LKS Sbjct: 182 DSCRKSKTVVKAELSAQKVPSASVLPLTGSNISTVDAEREIQVAKESISGVSMPKSSLKS 241 Query: 521 SGVKPLSRSVTWADEKKAENIDAGNLFNGQKTEEKSESIKNXXXXXXXXXXSLRFALAEA 342 SG K + SVTWADEK + + +LF + + LRFA AEA Sbjct: 242 SGSKKVGLSVTWADEK-IDGCGSRDLFEVRDMGDDGND--------NNADDMLRFASAEA 292 Query: 341 CAIALSQAAEAVASGECDAEDAATEAGIVILP-PVEVDEGNSSELMDVSEPDCQSVKWPR 165 CA+ALS+ AEAV SG+ D DA +EAG++ILP P + EG S E DV EP+ +KWP Sbjct: 293 CAMALSRVAEAVMSGDSDVADAVSEAGVIILPSPRDGHEGESMEDPDVLEPEAALLKWPS 352 Query: 164 KPVLLDTDLFDCEDSWHDTPPEGFSLTLSPFATMWTALFGWVSASSLAYIYGRD 3 KP + ++LFD EDSW+D PPEGFSLTLSPFATMW A+F W+S+SSLAYIYGRD Sbjct: 353 KPGIPRSELFDPEDSWYDEPPEGFSLTLSPFATMWMAIFAWISSSSLAYIYGRD 406 >gb|KDO45358.1| hypothetical protein CISIN_1g0087651mg, partial [Citrus sinensis] Length = 520 Score = 184 bits (466), Expect = 7e-44 Identities = 119/294 (40%), Positives = 158/294 (53%), Gaps = 35/294 (11%) Frame = -2 Query: 779 NETAFVSTVIIGDQLSASEASMGPQKNDSKPKANRKSKGKD----------------IVE 648 NE F S ++ D+ S S+ G K +K K + D I + Sbjct: 122 NEMDFTSVIMTNDEYSISKPHCGSTKTITKTKFEETKENADGENLEDQCAALGSLALIKD 181 Query: 647 KAGKQSETQSRSALS--KGPQG---------------EDSVAAAVVKQNGTQL-KSALKS 522 + ++S+T ++ LS K P E + A +G + KS+LKS Sbjct: 182 DSCRKSKTVVKAELSAQKVPSASVLPLTGSNISTVDAEREIQVAKESISGVSMPKSSLKS 241 Query: 521 SGVKPLSRSVTWADEKKAENIDAGNLFNGQKTEEKSESIKNXXXXXXXXXXSLRFALAEA 342 SG K + SVTWADEK + + +LF + + LRFA AEA Sbjct: 242 SGSKKVGLSVTWADEK-IDGCGSRDLFEVRDMGDDGND--------NNADDMLRFASAEA 292 Query: 341 CAIALSQAAEAVASGECDAEDAATEAGIVILP-PVEVDEGNSSELMDVSEPDCQSVKWPR 165 CA+ALS+ AEAV SG+ D DA +EAG++ILP P + EG S E DV EP+ +KWP Sbjct: 293 CAMALSRVAEAVMSGDSDVADAVSEAGVIILPSPRDGHEGESMEDPDVLEPEAALLKWPS 352 Query: 164 KPVLLDTDLFDCEDSWHDTPPEGFSLTLSPFATMWTALFGWVSASSLAYIYGRD 3 KP + ++LFD EDSW+D PPEGFSLTLSPFATMW A+F W+S+SSLAYIYGRD Sbjct: 353 KPGIPRSELFDPEDSWYDEPPEGFSLTLSPFATMWMAIFAWISSSSLAYIYGRD 406 >ref|XP_011087531.1| PREDICTED: putative RNA polymerase II subunit B1 CTD phosphatase RPAP2 homolog isoform X3 [Sesamum indicum] gi|747080559|ref|XP_011087533.1| PREDICTED: putative RNA polymerase II subunit B1 CTD phosphatase RPAP2 homolog isoform X3 [Sesamum indicum] Length = 655 Score = 182 bits (463), Expect = 2e-43 Identities = 109/253 (43%), Positives = 154/253 (60%), Gaps = 5/253 (1%) Frame = -2 Query: 746 GDQLSASEASMGPQKNDSKPKANRKSKGKDIVEKAGKQSETQSRSALSKGPQGEDSVAAA 567 G+Q+ +A P N + K+ + K K + + K S ++ + S+ ++ Sbjct: 260 GNQMEKPDA---PLPNVQETKSKKSDKHKHVTKTDDKLSILEAAAGPSQNDLTKEENGHR 316 Query: 566 VVKQ---NGTQLKSALKSSGVKPLSRSVTWADEKKAENIDAGNLFNGQKTEE-KSESIKN 399 + K+ T LKS+LK+S K +RSVTWAD K + D NL ++ ++ K + + Sbjct: 317 LGKECASGATILKSSLKTSDSKKATRSVTWADAKT--DGDGQNLCEFREVKDGKGALVTS 374 Query: 398 XXXXXXXXXXSLRFALAEACAIALSQAAEAVASGECDAEDAATEAGIVILPPV-EVDEGN 222 S R A AEACA ALSQAAEAVA+G+ D DA +EAG++ILPP EVDE Sbjct: 375 HSADQEVGDESYRIASAEACARALSQAAEAVATGQHDVSDAVSEAGVIILPPPHEVDEAK 434 Query: 221 SSELMDVSEPDCQSVKWPRKPVLLDTDLFDCEDSWHDTPPEGFSLTLSPFATMWTALFGW 42 E+ DV++ D +KWP KP + DLFD EDSW+D+PPEGFSLTLSPF+TM+ ALF W Sbjct: 435 HEEIGDVTDTDPVLLKWPPKPGFSNADLFDSEDSWYDSPPEGFSLTLSPFSTMFMALFAW 494 Query: 41 VSASSLAYIYGRD 3 +++SSLAYIYG++ Sbjct: 495 ITSSSLAYIYGKE 507 >ref|XP_011087530.1| PREDICTED: putative RNA polymerase II subunit B1 CTD phosphatase RPAP2 homolog isoform X2 [Sesamum indicum] Length = 687 Score = 182 bits (463), Expect = 2e-43 Identities = 109/253 (43%), Positives = 154/253 (60%), Gaps = 5/253 (1%) Frame = -2 Query: 746 GDQLSASEASMGPQKNDSKPKANRKSKGKDIVEKAGKQSETQSRSALSKGPQGEDSVAAA 567 G+Q+ +A P N + K+ + K K + + K S ++ + S+ ++ Sbjct: 292 GNQMEKPDA---PLPNVQETKSKKSDKHKHVTKTDDKLSILEAAAGPSQNDLTKEENGHR 348 Query: 566 VVKQ---NGTQLKSALKSSGVKPLSRSVTWADEKKAENIDAGNLFNGQKTEE-KSESIKN 399 + K+ T LKS+LK+S K +RSVTWAD K + D NL ++ ++ K + + Sbjct: 349 LGKECASGATILKSSLKTSDSKKATRSVTWADAKT--DGDGQNLCEFREVKDGKGALVTS 406 Query: 398 XXXXXXXXXXSLRFALAEACAIALSQAAEAVASGECDAEDAATEAGIVILPPV-EVDEGN 222 S R A AEACA ALSQAAEAVA+G+ D DA +EAG++ILPP EVDE Sbjct: 407 HSADQEVGDESYRIASAEACARALSQAAEAVATGQHDVSDAVSEAGVIILPPPHEVDEAK 466 Query: 221 SSELMDVSEPDCQSVKWPRKPVLLDTDLFDCEDSWHDTPPEGFSLTLSPFATMWTALFGW 42 E+ DV++ D +KWP KP + DLFD EDSW+D+PPEGFSLTLSPF+TM+ ALF W Sbjct: 467 HEEIGDVTDTDPVLLKWPPKPGFSNADLFDSEDSWYDSPPEGFSLTLSPFSTMFMALFAW 526 Query: 41 VSASSLAYIYGRD 3 +++SSLAYIYG++ Sbjct: 527 ITSSSLAYIYGKE 539 >ref|XP_011087529.1| PREDICTED: putative RNA polymerase II subunit B1 CTD phosphatase RPAP2 homolog isoform X1 [Sesamum indicum] Length = 699 Score = 182 bits (463), Expect = 2e-43 Identities = 109/253 (43%), Positives = 154/253 (60%), Gaps = 5/253 (1%) Frame = -2 Query: 746 GDQLSASEASMGPQKNDSKPKANRKSKGKDIVEKAGKQSETQSRSALSKGPQGEDSVAAA 567 G+Q+ +A P N + K+ + K K + + K S ++ + S+ ++ Sbjct: 304 GNQMEKPDA---PLPNVQETKSKKSDKHKHVTKTDDKLSILEAAAGPSQNDLTKEENGHR 360 Query: 566 VVKQ---NGTQLKSALKSSGVKPLSRSVTWADEKKAENIDAGNLFNGQKTEE-KSESIKN 399 + K+ T LKS+LK+S K +RSVTWAD K + D NL ++ ++ K + + Sbjct: 361 LGKECASGATILKSSLKTSDSKKATRSVTWADAKT--DGDGQNLCEFREVKDGKGALVTS 418 Query: 398 XXXXXXXXXXSLRFALAEACAIALSQAAEAVASGECDAEDAATEAGIVILPPV-EVDEGN 222 S R A AEACA ALSQAAEAVA+G+ D DA +EAG++ILPP EVDE Sbjct: 419 HSADQEVGDESYRIASAEACARALSQAAEAVATGQHDVSDAVSEAGVIILPPPHEVDEAK 478 Query: 221 SSELMDVSEPDCQSVKWPRKPVLLDTDLFDCEDSWHDTPPEGFSLTLSPFATMWTALFGW 42 E+ DV++ D +KWP KP + DLFD EDSW+D+PPEGFSLTLSPF+TM+ ALF W Sbjct: 479 HEEIGDVTDTDPVLLKWPPKPGFSNADLFDSEDSWYDSPPEGFSLTLSPFSTMFMALFAW 538 Query: 41 VSASSLAYIYGRD 3 +++SSLAYIYG++ Sbjct: 539 ITSSSLAYIYGKE 551 >ref|XP_010921353.1| PREDICTED: putative RNA polymerase II subunit B1 CTD phosphatase RPAP2 homolog isoform X2 [Elaeis guineensis] Length = 664 Score = 182 bits (463), Expect = 2e-43 Identities = 114/270 (42%), Positives = 156/270 (57%), Gaps = 11/270 (4%) Frame = -2 Query: 779 NETAFVSTVIIGDQLSASEASM----------GPQKNDSKPKANRKSKGKDIVEKAGKQS 630 N+ F S +++GD+ S S P D K + ++ + ++ Sbjct: 276 NQMDFKSVIVMGDEAQTSSVSTKNHSEQFDFTSPMIIDQPSKTSFVELDNNLNNEVHLEN 335 Query: 629 ETQSRSALSKGPQGEDSVAAAVVKQNGTQLKSALKSSGVKPLSRSVTWADEKKAENIDAG 450 E +S K + +D V +++ T LKS+LK++G K ++V WAD +K + + Sbjct: 336 ELESLEIAQK--ELKDRVK---MEKKETALKSSLKAAGSKVGRQTVKWADMEKDKAPE-- 388 Query: 449 NLFNGQKTEEKSESIKNXXXXXXXXXXSLRFALAEACAIALSQAAEAVASGECDAEDAAT 270 ++ + +I SLRFA AEACA AL+QAAE+VASG +A DA + Sbjct: 389 -----ERKDGPEGNISTGALHGDDDGSSLRFASAEACAAALTQAAESVASGLSEAGDAVS 443 Query: 269 EAGIVILP-PVEVDEGNSSELMDVSEPDCQSVKWPRKPVLLDTDLFDCEDSWHDTPPEGF 93 EAGIVILP P V EG++ D E D VKWP+K VLLDTD+F+ EDSWHDTPPEGF Sbjct: 444 EAGIVILPQPQHVKEGDAEADEDTFEFDRGFVKWPQKTVLLDTDMFEVEDSWHDTPPEGF 503 Query: 92 SLTLSPFATMWTALFGWVSASSLAYIYGRD 3 SLTLS FATMW ALFGW++ SSLAYIYG++ Sbjct: 504 SLTLSSFATMWMALFGWITCSSLAYIYGQN 533