BLASTX nr result
ID: Mentha28_contig00025756
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Mentha28_contig00025756 (949 letters) Database: ./nr 38,876,450 sequences; 13,856,398,315 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value gb|EYU34894.1| hypothetical protein MIMGU_mgv1a005743mg [Mimulus... 383 e-104 gb|EYU34893.1| hypothetical protein MIMGU_mgv1a005743mg [Mimulus... 360 5e-97 ref|XP_006339932.1| PREDICTED: RNA polymerase II-associated prot... 301 2e-79 ref|XP_006339933.1| PREDICTED: RNA polymerase II-associated prot... 295 2e-77 ref|XP_004248819.1| PREDICTED: RNA polymerase II-associated prot... 291 3e-76 ref|XP_006339935.1| PREDICTED: RNA polymerase II-associated prot... 290 4e-76 ref|XP_007031159.1| Tetratricopeptide repeat (TPR)-like superfam... 286 1e-74 emb|CBI39598.3| unnamed protein product [Vitis vinifera] 285 1e-74 ref|XP_007031161.1| Tetratricopeptide repeat (TPR)-like superfam... 285 2e-74 ref|XP_007031158.1| Tetratricopeptide repeat-like superfamily pr... 285 2e-74 ref|XP_002277910.2| PREDICTED: RNA polymerase II-associated prot... 285 2e-74 ref|XP_006339934.1| PREDICTED: RNA polymerase II-associated prot... 283 9e-74 ref|XP_004302236.1| PREDICTED: RNA polymerase II-associated prot... 276 9e-72 ref|XP_006382393.1| hypothetical protein POPTR_0005s01710g [Popu... 275 1e-71 ref|XP_006392426.1| hypothetical protein EUTSA_v10023436mg [Eutr... 265 3e-68 ref|XP_004144746.1| PREDICTED: RNA polymerase II-associated prot... 262 1e-67 ref|XP_006433540.1| hypothetical protein CICLE_v10003914mg [Citr... 260 6e-67 ref|XP_006472205.1| PREDICTED: RNA polymerase II-associated prot... 257 4e-66 ref|NP_176039.2| carboxylate clamp-tetratricopeptide repeat prot... 257 4e-66 ref|NP_001185250.1| carboxylate clamp-tetratricopeptide repeat p... 257 4e-66 >gb|EYU34894.1| hypothetical protein MIMGU_mgv1a005743mg [Mimulus guttatus] Length = 472 Score = 383 bits (984), Expect = e-104 Identities = 201/302 (66%), Positives = 239/302 (79%), Gaps = 2/302 (0%) Frame = +1 Query: 46 MAKVPPKHSRDQLRDVEGLLNNLQDWELSFKEKDKKLRADTLGRGKL-DNPSQSSYNDSS 222 MAK P KHSRD +++EGLLNNLQDWELSFKEKDKK+R+D +G+ KL D Q S N+ Sbjct: 1 MAKTPSKHSRDLPQELEGLLNNLQDWELSFKEKDKKMRSDAVGKAKLKDQLGQPSGNNVG 60 Query: 223 RLSDNPRVDGRKTRDKTSSAYNTNTVRQHDHLKKYEELSQLASTVSSKEGVVDANSEKEL 402 R D+ +V ++T +KT S N +TV+Q+DHLK+YE LS+L+S V S E VDANSEKEL Sbjct: 61 RPYDSAKVHRKQTMEKTPSYENPSTVKQYDHLKEYEALSRLSSAVPSTESFVDANSEKEL 120 Query: 403 GNEFFKQKKFNEAIDCYSRSIALSPTAVAYANRAMAYIKVKRFQEAENDCTEALNLDDRY 582 GNEFFKQKKF EAIDCYSRSIALSP+AVAYANRAMAYIK++RFQEAENDCTEALNLDDRY Sbjct: 121 GNEFFKQKKFTEAIDCYSRSIALSPSAVAYANRAMAYIKIRRFQEAENDCTEALNLDDRY 180 Query: 583 IKAYSRRSTARKELGKLKQSIDDVEFALRLEPQNQDIKKQHAECKALLEKEIMKKASGAL 762 IKAYSRRSTARKELGKLK+S+DD EFALRLEPQNQ+IKKQ AE K+LLEK I+KK SGAL Sbjct: 181 IKAYSRRSTARKELGKLKESMDDAEFALRLEPQNQEIKKQFAESKSLLEKAILKKVSGAL 240 Query: 763 AGPLEGAHKGKVEADKKKYALNSPAGRVAKFVEDDSKAN-IIEVSNQESVEHKGSTILSN 939 G +EGA GK E DK +GRV + +E++ K N EV+ S+E K +++ SN Sbjct: 241 TGSVEGAQPGKSEVDKNNNVQKPLSGRVTEILEENPKGNKSREVTADTSMEIKQNSVRSN 300 Query: 940 GS 945 GS Sbjct: 301 GS 302 >gb|EYU34893.1| hypothetical protein MIMGU_mgv1a005743mg [Mimulus guttatus] Length = 462 Score = 360 bits (924), Expect = 5e-97 Identities = 194/302 (64%), Positives = 229/302 (75%), Gaps = 2/302 (0%) Frame = +1 Query: 46 MAKVPPKHSRDQLRDVEGLLNNLQDWELSFKEKDKKLRADTLGRGKL-DNPSQSSYNDSS 222 MAK P KHSRD QDWELSFKEKDKK+R+D +G+ KL D Q S N+ Sbjct: 1 MAKTPSKHSRDLP----------QDWELSFKEKDKKMRSDAVGKAKLKDQLGQPSGNNVG 50 Query: 223 RLSDNPRVDGRKTRDKTSSAYNTNTVRQHDHLKKYEELSQLASTVSSKEGVVDANSEKEL 402 R D+ +V ++T +KT S N +TV+Q+DHLK+YE LS+L+S V S E VDANSEKEL Sbjct: 51 RPYDSAKVHRKQTMEKTPSYENPSTVKQYDHLKEYEALSRLSSAVPSTESFVDANSEKEL 110 Query: 403 GNEFFKQKKFNEAIDCYSRSIALSPTAVAYANRAMAYIKVKRFQEAENDCTEALNLDDRY 582 GNEFFKQKKF EAIDCYSRSIALSP+AVAYANRAMAYIK++RFQEAENDCTEALNLDDRY Sbjct: 111 GNEFFKQKKFTEAIDCYSRSIALSPSAVAYANRAMAYIKIRRFQEAENDCTEALNLDDRY 170 Query: 583 IKAYSRRSTARKELGKLKQSIDDVEFALRLEPQNQDIKKQHAECKALLEKEIMKKASGAL 762 IKAYSRRSTARKELGKLK+S+DD EFALRLEPQNQ+IKKQ AE K+LLEK I+KK SGAL Sbjct: 171 IKAYSRRSTARKELGKLKESMDDAEFALRLEPQNQEIKKQFAESKSLLEKAILKKVSGAL 230 Query: 763 AGPLEGAHKGKVEADKKKYALNSPAGRVAKFVEDDSKAN-IIEVSNQESVEHKGSTILSN 939 G +EGA GK E DK +GRV + +E++ K N EV+ S+E K +++ SN Sbjct: 231 TGSVEGAQPGKSEVDKNNNVQKPLSGRVTEILEENPKGNKSREVTADTSMEIKQNSVRSN 290 Query: 940 GS 945 GS Sbjct: 291 GS 292 >ref|XP_006339932.1| PREDICTED: RNA polymerase II-associated protein 3-like isoform X1 [Solanum tuberosum] Length = 468 Score = 301 bits (772), Expect = 2e-79 Identities = 165/291 (56%), Positives = 212/291 (72%), Gaps = 2/291 (0%) Frame = +1 Query: 46 MAKVPPKHSRDQLRDVEGLLNNLQDWELSFKEKDKKLRADTLGRGKLDNPSQSSYNDSSR 225 MAKVP KHSRDQ +D++GLLNNLQDWELS K KDKK+++ G+ L + ++ +S Sbjct: 1 MAKVPSKHSRDQFQDMQGLLNNLQDWELSLKGKDKKMKSQAHGKETL----REDWSRTSE 56 Query: 226 LSDNPRVDGRKTRDKTSSAYNTNTVRQHDHLKKYEELSQLASTVSSKEGVVDANSEKELG 405 L +P+V+G + TS + +++ K Y +S L+S + S+E ++ANSEKELG Sbjct: 57 LLTSPQVNGTRVGKSTSIR---SAAGPYNYSKNYNPISHLSSELISEESNINANSEKELG 113 Query: 406 NEFFKQKKFNEAIDCYSRSIALSPTAVAYANRAMAYIKVKRFQEAENDCTEALNLDDRYI 585 NE FKQKKFNEAIDCYSRSIALSPTAV+YANRAMAY+K+KRFQEAENDCTEALNLDDRYI Sbjct: 114 NECFKQKKFNEAIDCYSRSIALSPTAVSYANRAMAYLKIKRFQEAENDCTEALNLDDRYI 173 Query: 586 KAYSRRSTARKELGKLKQSIDDVEFALRLEPQNQDIKKQHAECKALLEKEIMKKASGALA 765 KAYSRRST+RKELGKLK+SI+D EFALRLEPQN +IKKQ+ E KAL EKEI K+ SGA Sbjct: 174 KAYSRRSTSRKELGKLKESIEDAEFALRLEPQNPEIKKQYGEVKALYEKEIRKRVSGATD 233 Query: 766 GPLEGAHK-GK-VEADKKKYALNSPAGRVAKFVEDDSKANIIEVSNQESVE 912 + A K GK +++ +++S + ++A+ +K N +V VE Sbjct: 234 VSAQRAQKSGKTIKSGPVIQSVSSSSQKMAEVWTIPAKENNRDVPGTAKVE 284 >ref|XP_006339933.1| PREDICTED: RNA polymerase II-associated protein 3-like isoform X2 [Solanum tuberosum] Length = 467 Score = 295 bits (755), Expect = 2e-77 Identities = 164/291 (56%), Positives = 211/291 (72%), Gaps = 2/291 (0%) Frame = +1 Query: 46 MAKVPPKHSRDQLRDVEGLLNNLQDWELSFKEKDKKLRADTLGRGKLDNPSQSSYNDSSR 225 MAKVP KHSRDQ +D++GLLNNLQDWELS K KDKK+++ G+ L + ++ +S Sbjct: 1 MAKVPSKHSRDQFQDMQGLLNNLQDWELSLKGKDKKMKSQAHGKETL----REDWSRTSE 56 Query: 226 LSDNPRVDGRKTRDKTSSAYNTNTVRQHDHLKKYEELSQLASTVSSKEGVVDANSEKELG 405 L +P+V+G + TS + +++ K Y +S L+S + S+E ++ANSEKELG Sbjct: 57 LLTSPQVNGTRVGKSTSIR---SAAGPYNYSKNYNPISHLSSELISEESNINANSEKELG 113 Query: 406 NEFFKQKKFNEAIDCYSRSIALSPTAVAYANRAMAYIKVKRFQEAENDCTEALNLDDRYI 585 NE FKQKKFNEAIDCYSRSIALSPTAV+YANRAMAY+K+KRFQEAENDCTEALNLDDRYI Sbjct: 114 NECFKQKKFNEAIDCYSRSIALSPTAVSYANRAMAYLKIKRFQEAENDCTEALNLDDRYI 173 Query: 586 KAYSRRSTARKELGKLKQSIDDVEFALRLEPQNQDIKKQHAECKALLEKEIMKKASGALA 765 KAYSRRST+RKELGKLK+SI+D EFALRLEPQN +IKKQ+ E KAL EK I K+ SGA Sbjct: 174 KAYSRRSTSRKELGKLKESIEDAEFALRLEPQNPEIKKQYGEVKALYEK-IRKRVSGATD 232 Query: 766 GPLEGAHK-GK-VEADKKKYALNSPAGRVAKFVEDDSKANIIEVSNQESVE 912 + A K GK +++ +++S + ++A+ +K N +V VE Sbjct: 233 VSAQRAQKSGKTIKSGPVIQSVSSSSQKMAEVWTIPAKENNRDVPGTAKVE 283 >ref|XP_004248819.1| PREDICTED: RNA polymerase II-associated protein 3-like [Solanum lycopersicum] Length = 470 Score = 291 bits (744), Expect = 3e-76 Identities = 159/291 (54%), Positives = 208/291 (71%), Gaps = 2/291 (0%) Frame = +1 Query: 46 MAKVPPKHSRDQLRDVEGLLNNLQDWELSFKEKDKKLRADTLGRGKLDNPSQSSYNDSSR 225 MA+VP HSRDQ +D++GL NNLQDWEL+ K KDKK+++ G+ L + ++ +S Sbjct: 1 MARVPSNHSRDQFQDMQGLFNNLQDWELALKGKDKKMKSQAGGKETL----KEDWSRTSE 56 Query: 226 LSDNPRVDGRKTRDKTSSAYNTNTVRQHDHLKKYEELSQLASTVSSKEGVVDANSEKELG 405 +P+ +G + K++S N + + K Y +S L+S + S+E ++ANSEKELG Sbjct: 57 PLTSPQANGTQQVGKSTSI--RNAAGPYSYSKNYNPISHLSSELISEESNINANSEKELG 114 Query: 406 NEFFKQKKFNEAIDCYSRSIALSPTAVAYANRAMAYIKVKRFQEAENDCTEALNLDDRYI 585 NE FKQKKFNEAIDCYSRSIALSPTAV+YANRAMAY+K+KRFQEAENDCTEALNLDDRYI Sbjct: 115 NECFKQKKFNEAIDCYSRSIALSPTAVSYANRAMAYLKIKRFQEAENDCTEALNLDDRYI 174 Query: 586 KAYSRRSTARKELGKLKQSIDDVEFALRLEPQNQDIKKQHAECKALLEKEIMKKASGALA 765 KAYSRRST+RKELGKLK+SI+D EFAL LEP+N +IKKQ+ E KAL EKEI+K+ SGA Sbjct: 175 KAYSRRSTSRKELGKLKESIEDAEFALWLEPRNPEIKKQYGEVKALYEKEILKRVSGATD 234 Query: 766 GPLEGAHK-GK-VEADKKKYALNSPAGRVAKFVEDDSKANIIEVSNQESVE 912 +G K GK ++ +++S + +VA+ +K N +V VE Sbjct: 235 VSAQGPQKSGKTIKIGPVIQSVSSSSQKVAEVRTIPAKENNRDVLGTAKVE 285 >ref|XP_006339935.1| PREDICTED: RNA polymerase II-associated protein 3-like isoform X4 [Solanum tuberosum] Length = 419 Score = 290 bits (743), Expect = 4e-76 Identities = 150/237 (63%), Positives = 184/237 (77%) Frame = +1 Query: 46 MAKVPPKHSRDQLRDVEGLLNNLQDWELSFKEKDKKLRADTLGRGKLDNPSQSSYNDSSR 225 MAKVP KHSRDQ +D++GLLNNLQDWELS K KDKK+++ G+ L + ++ +S Sbjct: 1 MAKVPSKHSRDQFQDMQGLLNNLQDWELSLKGKDKKMKSQAHGKETL----REDWSRTSE 56 Query: 226 LSDNPRVDGRKTRDKTSSAYNTNTVRQHDHLKKYEELSQLASTVSSKEGVVDANSEKELG 405 L +P+V+G + TS + +++ K Y +S L+S + S+E ++ANSEKELG Sbjct: 57 LLTSPQVNGTRVGKSTSIR---SAAGPYNYSKNYNPISHLSSELISEESNINANSEKELG 113 Query: 406 NEFFKQKKFNEAIDCYSRSIALSPTAVAYANRAMAYIKVKRFQEAENDCTEALNLDDRYI 585 NE FKQKKFNEAIDCYSRSIALSPTAV+YANRAMAY+K+KRFQEAENDCTEALNLDDRYI Sbjct: 114 NECFKQKKFNEAIDCYSRSIALSPTAVSYANRAMAYLKIKRFQEAENDCTEALNLDDRYI 173 Query: 586 KAYSRRSTARKELGKLKQSIDDVEFALRLEPQNQDIKKQHAECKALLEKEIMKKASG 756 KAYSRRST+RKELGKLK+SI+D EFALRLEPQN +IKKQ+ E KAL EKE + G Sbjct: 174 KAYSRRSTSRKELGKLKESIEDAEFALRLEPQNPEIKKQYGEVKALYEKENNRDVPG 230 >ref|XP_007031159.1| Tetratricopeptide repeat (TPR)-like superfamily protein, putative isoform 2 [Theobroma cacao] gi|508719764|gb|EOY11661.1| Tetratricopeptide repeat (TPR)-like superfamily protein, putative isoform 2 [Theobroma cacao] Length = 422 Score = 286 bits (731), Expect = 1e-74 Identities = 151/257 (58%), Positives = 190/257 (73%), Gaps = 1/257 (0%) Frame = +1 Query: 64 KHSRDQLRDVEGLLNNLQDWELSFKEKDKKLRADTLGRGKLDNPSQSSYNDSSRLSDNPR 243 KHSRDQ D +G LNNLQDWELS KEKDK ++ S +D +L N + Sbjct: 4 KHSRDQALDFQGFLNNLQDWELSLKEKDKIMK--------------SQASDKEQLKTNEK 49 Query: 244 VDGRKTRDKTSSAYNTNTVRQHDHLKKYEELSQLASTVSSKEGVVDANSEKELGNEFFKQ 423 GR T + +T + RQ+D+L+ Y++ + L+S+ ++E + DA SEKELGNE+FKQ Sbjct: 50 --GRPTGKSSLIDSSTTSSRQYDYLQNYDKFNSLSSSFVTEENMPDAASEKELGNEYFKQ 107 Query: 424 KKFNEAIDCYSRSIALSPTAVAYANRAMAYIKVKRFQEAENDCTEALNLDDRYIKAYSRR 603 KKF EAIDCYSRSI LSPTAVA+ANRAMAY+K+K+FQEAE+DCTEALNLDDRYIKAYSRR Sbjct: 108 KKFKEAIDCYSRSIGLSPTAVAHANRAMAYLKIKKFQEAEDDCTEALNLDDRYIKAYSRR 167 Query: 604 STARKELGKLKQSIDDVEFALRLEPQNQDIKKQHAECKALLEKEIMKKASGALAGPLEGA 783 +TARKELGKLK+SI+D EFALRLEP NQ+IKKQHAE K+L EKEI++KASG L ++ A Sbjct: 168 ATARKELGKLKESIEDTEFALRLEPNNQEIKKQHAEFKSLYEKEILQKASGVLRKSMQEA 227 Query: 784 HK-GKVEADKKKYALNS 831 + GK E + ++S Sbjct: 228 QEVGKSETKENGLGMHS 244 >emb|CBI39598.3| unnamed protein product [Vitis vinifera] Length = 1097 Score = 285 bits (730), Expect = 1e-74 Identities = 158/303 (52%), Positives = 212/303 (69%), Gaps = 8/303 (2%) Frame = +1 Query: 40 STMAKVPPKHSRDQLRDVEGLLNNLQDWELSFKEKDKKLRADTLGRGKLDNPS-QSSYND 216 S + P KH+RDQ D +G L +LQDWELS KEKDKK++A + D P+ + + Sbjct: 623 SMATRFPSKHARDQALDFQGFLTDLQDWELSLKEKDKKMKAQAEEK---DVPTARGNVKH 679 Query: 217 SSRLSDNPRVDGRKTRDKTSSAYNTNTVRQHDHLKKYEELSQLASTVSSKEGVVDANSEK 396 SS+LS +P V R + ++ + RQH++ + ++ +S+++S+ ++E + DA SEK Sbjct: 680 SSKLSSSPGVSLRLGQSRSDT-------RQHEYSRNHDAISRISSSFMTEESLPDAASEK 732 Query: 397 ELGNEFFKQKKFNEAIDCYSRSIALSPTAVAYANRAMAYIKVKRFQEAENDCTEALNLDD 576 ELGNE+FKQ+KF EAIDCYSRSIAL PTAVAYANRAMAYIK+KRF+EAE+DC EALNLDD Sbjct: 733 ELGNEYFKQRKFKEAIDCYSRSIALLPTAVAYANRAMAYIKIKRFREAEDDCMEALNLDD 792 Query: 577 RYIKAYSRRSTARKELGKLKQSIDDVEFALRLEPQNQDIKKQHAECKALLEKEIMKKASG 756 RYIKAYSRR+TARKELGK K++ +D EFALRLEPQNQ+IKKQ+AE K+L EKEI++KASG Sbjct: 793 RYIKAYSRRATARKELGKFKEATEDAEFALRLEPQNQEIKKQYAEAKSLYEKEILQKASG 852 Query: 757 ALAGPLEGAHK-GK----VEADKK--KYALNSPAGRVAKFVEDDSKANIIEVSNQESVEH 915 AL ++G K GK V AD + + +S G ++D ++ E E+ Sbjct: 853 ALKSSVQGLQKVGKSVVEVNADTQGVRSISSSSQGAGEAAIQDRFMVPANTSTSMEETEN 912 Query: 916 KGS 924 KG+ Sbjct: 913 KGT 915 >ref|XP_007031161.1| Tetratricopeptide repeat (TPR)-like superfamily protein, putative isoform 4 [Theobroma cacao] gi|508719766|gb|EOY11663.1| Tetratricopeptide repeat (TPR)-like superfamily protein, putative isoform 4 [Theobroma cacao] Length = 421 Score = 285 bits (728), Expect = 2e-74 Identities = 149/257 (57%), Positives = 188/257 (73%), Gaps = 1/257 (0%) Frame = +1 Query: 64 KHSRDQLRDVEGLLNNLQDWELSFKEKDKKLRADTLGRGKLDNPSQSSYNDSSRLSDNPR 243 KHSRDQ D +G LNNLQDWELS KEKDK +++ + +L N Sbjct: 4 KHSRDQALDFQGFLNNLQDWELSLKEKDKIMKSQASDKEQLTNEK--------------- 48 Query: 244 VDGRKTRDKTSSAYNTNTVRQHDHLKKYEELSQLASTVSSKEGVVDANSEKELGNEFFKQ 423 GR T + +T + RQ+D+L+ Y++ + L+S+ ++E + DA SEKELGNE+FKQ Sbjct: 49 --GRPTGKSSLIDSSTTSSRQYDYLQNYDKFNSLSSSFVTEENMPDAASEKELGNEYFKQ 106 Query: 424 KKFNEAIDCYSRSIALSPTAVAYANRAMAYIKVKRFQEAENDCTEALNLDDRYIKAYSRR 603 KKF EAIDCYSRSI LSPTAVA+ANRAMAY+K+K+FQEAE+DCTEALNLDDRYIKAYSRR Sbjct: 107 KKFKEAIDCYSRSIGLSPTAVAHANRAMAYLKIKKFQEAEDDCTEALNLDDRYIKAYSRR 166 Query: 604 STARKELGKLKQSIDDVEFALRLEPQNQDIKKQHAECKALLEKEIMKKASGALAGPLEGA 783 +TARKELGKLK+SI+D EFALRLEP NQ+IKKQHAE K+L EKEI++KASG L ++ A Sbjct: 167 ATARKELGKLKESIEDTEFALRLEPNNQEIKKQHAEFKSLYEKEILQKASGVLRKSMQEA 226 Query: 784 HK-GKVEADKKKYALNS 831 + GK E + ++S Sbjct: 227 QEVGKSETKENGLGMHS 243 >ref|XP_007031158.1| Tetratricopeptide repeat-like superfamily protein, putative isoform 1 [Theobroma cacao] gi|508719763|gb|EOY11660.1| Tetratricopeptide repeat-like superfamily protein, putative isoform 1 [Theobroma cacao] Length = 468 Score = 285 bits (728), Expect = 2e-74 Identities = 149/257 (57%), Positives = 188/257 (73%), Gaps = 1/257 (0%) Frame = +1 Query: 64 KHSRDQLRDVEGLLNNLQDWELSFKEKDKKLRADTLGRGKLDNPSQSSYNDSSRLSDNPR 243 KHSRDQ D +G LNNLQDWELS KEKDK +++ + +L N Sbjct: 4 KHSRDQALDFQGFLNNLQDWELSLKEKDKIMKSQASDKEQLTNEK--------------- 48 Query: 244 VDGRKTRDKTSSAYNTNTVRQHDHLKKYEELSQLASTVSSKEGVVDANSEKELGNEFFKQ 423 GR T + +T + RQ+D+L+ Y++ + L+S+ ++E + DA SEKELGNE+FKQ Sbjct: 49 --GRPTGKSSLIDSSTTSSRQYDYLQNYDKFNSLSSSFVTEENMPDAASEKELGNEYFKQ 106 Query: 424 KKFNEAIDCYSRSIALSPTAVAYANRAMAYIKVKRFQEAENDCTEALNLDDRYIKAYSRR 603 KKF EAIDCYSRSI LSPTAVA+ANRAMAY+K+K+FQEAE+DCTEALNLDDRYIKAYSRR Sbjct: 107 KKFKEAIDCYSRSIGLSPTAVAHANRAMAYLKIKKFQEAEDDCTEALNLDDRYIKAYSRR 166 Query: 604 STARKELGKLKQSIDDVEFALRLEPQNQDIKKQHAECKALLEKEIMKKASGALAGPLEGA 783 +TARKELGKLK+SI+D EFALRLEP NQ+IKKQHAE K+L EKEI++KASG L ++ A Sbjct: 167 ATARKELGKLKESIEDTEFALRLEPNNQEIKKQHAEFKSLYEKEILQKASGVLRKSMQEA 226 Query: 784 HK-GKVEADKKKYALNS 831 + GK E + ++S Sbjct: 227 QEVGKSETKENGLGMHS 243 >ref|XP_002277910.2| PREDICTED: RNA polymerase II-associated protein 3-like [Vitis vinifera] Length = 474 Score = 285 bits (728), Expect = 2e-74 Identities = 157/299 (52%), Positives = 211/299 (70%), Gaps = 8/299 (2%) Frame = +1 Query: 52 KVPPKHSRDQLRDVEGLLNNLQDWELSFKEKDKKLRADTLGRGKLDNPS-QSSYNDSSRL 228 + P KH+RDQ D +G L +LQDWELS KEKDKK++A + D P+ + + SS+L Sbjct: 4 RFPSKHARDQALDFQGFLTDLQDWELSLKEKDKKMKAQAEEK---DVPTARGNVKHSSKL 60 Query: 229 SDNPRVDGRKTRDKTSSAYNTNTVRQHDHLKKYEELSQLASTVSSKEGVVDANSEKELGN 408 S +P V R + ++ + RQH++ + ++ +S+++S+ ++E + DA SEKELGN Sbjct: 61 SSSPGVSLRLGQSRSDT-------RQHEYSRNHDAISRISSSFMTEESLPDAASEKELGN 113 Query: 409 EFFKQKKFNEAIDCYSRSIALSPTAVAYANRAMAYIKVKRFQEAENDCTEALNLDDRYIK 588 E+FKQ+KF EAIDCYSRSIAL PTAVAYANRAMAYIK+KRF+EAE+DC EALNLDDRYIK Sbjct: 114 EYFKQRKFKEAIDCYSRSIALLPTAVAYANRAMAYIKIKRFREAEDDCMEALNLDDRYIK 173 Query: 589 AYSRRSTARKELGKLKQSIDDVEFALRLEPQNQDIKKQHAECKALLEKEIMKKASGALAG 768 AYSRR+TARKELGK K++ +D EFALRLEPQNQ+IKKQ+AE K+L EKEI++KASGAL Sbjct: 174 AYSRRATARKELGKFKEATEDAEFALRLEPQNQEIKKQYAEAKSLYEKEILQKASGALKS 233 Query: 769 PLEGAHK-GK----VEADKK--KYALNSPAGRVAKFVEDDSKANIIEVSNQESVEHKGS 924 ++G K GK V AD + + +S G ++D ++ E E+KG+ Sbjct: 234 SVQGLQKVGKSVVEVNADTQGVRSISSSSQGAGEAAIQDRFMVPANTSTSMEETENKGT 292 >ref|XP_006339934.1| PREDICTED: RNA polymerase II-associated protein 3-like isoform X3 [Solanum tuberosum] Length = 461 Score = 283 bits (723), Expect = 9e-74 Identities = 159/291 (54%), Positives = 206/291 (70%), Gaps = 2/291 (0%) Frame = +1 Query: 46 MAKVPPKHSRDQLRDVEGLLNNLQDWELSFKEKDKKLRADTLGRGKLDNPSQSSYNDSSR 225 MAKVP KHSRDQ +D++GLLNNLQDWELS K KDKK+++ G+ L + ++ +S Sbjct: 1 MAKVPSKHSRDQFQDMQGLLNNLQDWELSLKGKDKKMKSQAHGKETL----REDWSRTSE 56 Query: 226 LSDNPRVDGRKTRDKTSSAYNTNTVRQHDHLKKYEELSQLASTVSSKEGVVDANSEKELG 405 L +P+V+G + TS + +++ K Y +S L+S + S+E ++ANSEKEL Sbjct: 57 LLTSPQVNGTRVGKSTSIR---SAAGPYNYSKNYNPISHLSSELISEESNINANSEKEL- 112 Query: 406 NEFFKQKKFNEAIDCYSRSIALSPTAVAYANRAMAYIKVKRFQEAENDCTEALNLDDRYI 585 KKFNEAIDCYSRSIALSPTAV+YANRAMAY+K+KRFQEAENDCTEALNLDDRYI Sbjct: 113 ------KKFNEAIDCYSRSIALSPTAVSYANRAMAYLKIKRFQEAENDCTEALNLDDRYI 166 Query: 586 KAYSRRSTARKELGKLKQSIDDVEFALRLEPQNQDIKKQHAECKALLEKEIMKKASGALA 765 KAYSRRST+RKELGKLK+SI+D EFALRLEPQN +IKKQ+ E KAL EKEI K+ SGA Sbjct: 167 KAYSRRSTSRKELGKLKESIEDAEFALRLEPQNPEIKKQYGEVKALYEKEIRKRVSGATD 226 Query: 766 GPLEGAHK-GK-VEADKKKYALNSPAGRVAKFVEDDSKANIIEVSNQESVE 912 + A K GK +++ +++S + ++A+ +K N +V VE Sbjct: 227 VSAQRAQKSGKTIKSGPVIQSVSSSSQKMAEVWTIPAKENNRDVPGTAKVE 277 >ref|XP_004302236.1| PREDICTED: RNA polymerase II-associated protein 3-like [Fragaria vesca subsp. vesca] Length = 407 Score = 276 bits (706), Expect = 9e-72 Identities = 148/252 (58%), Positives = 184/252 (73%) Frame = +1 Query: 46 MAKVPPKHSRDQLRDVEGLLNNLQDWELSFKEKDKKLRADTLGRGKLDNPSQSSYNDSSR 225 MA+ P KH RDQ D +G L++LQDWELS K+KDKK+R P Q Sbjct: 1 MARAPSKHGRDQALDFQGFLSDLQDWELSLKDKDKKMR-----------PQQ-------- 41 Query: 226 LSDNPRVDGRKTRDKTSSAYNTNTVRQHDHLKKYEELSQLASTVSSKEGVVDANSEKELG 405 P + K+RD +S+Y+TN YE ++ ++S+ +S++G+ DA SEK+LG Sbjct: 42 ----PNKEAPKSRDFGTSSYSTN----------YEPMNTVSSSFTSEDGLPDAASEKDLG 87 Query: 406 NEFFKQKKFNEAIDCYSRSIALSPTAVAYANRAMAYIKVKRFQEAENDCTEALNLDDRYI 585 NE+FKQKKF EAIDCYSRSIAL+PTAVA+ANRAM+YIK+KRFQEAENDCTEALNLDDRYI Sbjct: 88 NEYFKQKKFKEAIDCYSRSIALTPTAVAFANRAMSYIKIKRFQEAENDCTEALNLDDRYI 147 Query: 586 KAYSRRSTARKELGKLKQSIDDVEFALRLEPQNQDIKKQHAECKALLEKEIMKKASGALA 765 KAYSRR+TARKELGKLK+SI+D EFALRLEP NQ+IKKQ+AE K+L EK I++K SGA+ Sbjct: 148 KAYSRRATARKELGKLKESIEDAEFALRLEPHNQEIKKQYAEAKSLYEKGILQKVSGAI- 206 Query: 766 GPLEGAHKGKVE 801 + K KVE Sbjct: 207 -KISEQDKQKVE 217 >ref|XP_006382393.1| hypothetical protein POPTR_0005s01710g [Populus trichocarpa] gi|550337753|gb|ERP60190.1| hypothetical protein POPTR_0005s01710g [Populus trichocarpa] Length = 402 Score = 275 bits (704), Expect = 1e-71 Identities = 148/263 (56%), Positives = 195/263 (74%), Gaps = 2/263 (0%) Frame = +1 Query: 46 MAKVPPKHSRDQLRDVEGLLNNLQDWELSFKEKDKKLRADTLGRGKLDNPSQSSYNDSSR 225 MA+VP KH RDQ D +G LN+LQDWEL K+ DKK++ SR Sbjct: 1 MARVPGKHGRDQALDFQGFLNDLQDWEL-LKDTDKKMKK------------------KSR 41 Query: 226 LSDNPRVDGRKTRDKTSSAYNTNTVR-QHDHLKKYEELSQLASTVSSKEGVVDANSEKEL 402 SD + ++ KTS+A ++ + Q+++ + + +++L+S+ ++ E VDA +EKEL Sbjct: 42 ASDVKIGEDGRSEGKTSAADSSRSGSGQYEYSRNFGAINRLSSSFTTDEITVDATTEKEL 101 Query: 403 GNEFFKQKKFNEAIDCYSRSIALSPTAVAYANRAMAYIKVKRFQEAENDCTEALNLDDRY 582 GNE+FKQKKFNEAI+CYSRSIALSPTAVAYANRAMAY+K+KRF+EAE+DCTEALNLDDRY Sbjct: 102 GNEYFKQKKFNEAIECYSRSIALSPTAVAYANRAMAYLKIKRFREAEDDCTEALNLDDRY 161 Query: 583 IKAYSRRSTARKELGKLKQSIDDVEFALRLEPQNQDIKKQHAECKALLEKEIMKKASGAL 762 IKAYSRR+TARKELGKLK+SI+D EFAL+LEP NQ+IKKQ+AE K+L EKEI++KASG L Sbjct: 162 IKAYSRRATARKELGKLKESIEDSEFALKLEPNNQEIKKQYAEVKSLYEKEILQKASGTL 221 Query: 763 AGPLEGAHK-GKVEADKKKYALN 828 L+G + G+ EA +A++ Sbjct: 222 RSSLQGTQQGGRSEASVNGHAVH 244 >ref|XP_006392426.1| hypothetical protein EUTSA_v10023436mg [Eutrema salsugineum] gi|557088932|gb|ESQ29712.1| hypothetical protein EUTSA_v10023436mg [Eutrema salsugineum] Length = 473 Score = 265 bits (676), Expect = 3e-68 Identities = 154/311 (49%), Positives = 201/311 (64%), Gaps = 14/311 (4%) Frame = +1 Query: 46 MAKVPPKHSRDQLRDVEGLLNNLQDWELSFKEKDKKLRADTLGRGKLDNPSQSSYNDSSR 225 MA+ P KH RDQ +D +G LN+LQDWELS K+KDKK++ + L NP Sbjct: 1 MARSPSKHGRDQTQDFQGFLNDLQDWELSLKDKDKKIKQN------LSNP---------- 44 Query: 226 LSDNPRVDGRKTRDKTSSAYNTNTVRQHDHLKKYEELSQLASTVSSKEGVVDANSEKELG 405 TS + + Q+D +KKY +S L+S+ + E +DANSEKE G Sbjct: 45 ---------------TSEKFRPSGSGQYDFVKKYGPMSGLSSSFADDESPLDANSEKEQG 89 Query: 406 NEFFKQKKFNEAIDCYSRSIALSPTAVAYANRAMAYIKVKRFQEAENDCTEALNLDDRYI 585 NE+FKQKKFNEAIDCYSRSIALSP AVA+ANRAMAY+K+KR++EAE DCTEALNLDDRY Sbjct: 90 NEYFKQKKFNEAIDCYSRSIALSPNAVAFANRAMAYLKIKRYREAEIDCTEALNLDDRYT 149 Query: 586 KAYSRRSTARKELGKLKQSIDDVEFALRLEPQNQDIKKQHAECKALLEKEIMKKASGALA 765 KAYSRR+TARK LG +K++++D EFALRLEPQ+Q+++KQ+A+ K+LLEKEI++KASGA+ Sbjct: 150 KAYSRRATARKALGMVKEAMEDAEFALRLEPQSQELQKQYADIKSLLEKEIIEKASGAMQ 209 Query: 766 GPLEGAHKGKVEADKKKYALNS----PAGRVAK----FVEDDSKAN------IIEVSNQE 903 + K DKK N+ P VAK V S++N +IE + E Sbjct: 210 STAQELLK-TAGLDKKAKIPNTDTKKPVTLVAKTNGDMVRPVSRSNESSGKKLIESVDPE 268 Query: 904 SVEHKGSTILS 936 +GS +S Sbjct: 269 EKHKEGSKKIS 279 >ref|XP_004144746.1| PREDICTED: RNA polymerase II-associated protein 3-like [Cucumis sativus] gi|449517788|ref|XP_004165926.1| PREDICTED: RNA polymerase II-associated protein 3-like [Cucumis sativus] Length = 458 Score = 262 bits (670), Expect = 1e-67 Identities = 136/238 (57%), Positives = 172/238 (72%) Frame = +1 Query: 46 MAKVPPKHSRDQLRDVEGLLNNLQDWELSFKEKDKKLRADTLGRGKLDNPSQSSYNDSSR 225 MA KH RDQL D +G LN+LQDWE+SFK KDKKL+ +G+ K D Sbjct: 1 MADSSAKHGRDQLLDFQGFLNDLQDWEVSFKGKDKKLKPQAIGKEKED------------ 48 Query: 226 LSDNPRVDGRKTRDKTSSAYNTNTVRQHDHLKKYEELSQLASTVSSKEGVVDANSEKELG 405 R+ +K S+A D++K+Y+ +++L+ ++ VDA SEKE G Sbjct: 49 ---------RRQTEKASAA---------DYMKQYDAVNRLSRNFQTEGSFVDAASEKEQG 90 Query: 406 NEFFKQKKFNEAIDCYSRSIALSPTAVAYANRAMAYIKVKRFQEAENDCTEALNLDDRYI 585 NE+FKQKKF EAIDCYSRSIALSPTAVA+ANRAMAY+K++RFQEAE+DCTEALNLDDRYI Sbjct: 91 NEYFKQKKFKEAIDCYSRSIALSPTAVAFANRAMAYLKIRRFQEAEDDCTEALNLDDRYI 150 Query: 586 KAYSRRSTARKELGKLKQSIDDVEFALRLEPQNQDIKKQHAECKALLEKEIMKKASGA 759 KAYSRR+TARKELGK K++++D EFA RLEP NQ+IKKQHA+ +A + K I++KASGA Sbjct: 151 KAYSRRATARKELGKAKEALEDAEFAQRLEPNNQEIKKQHADLRAFVGKAILEKASGA 208 >ref|XP_006433540.1| hypothetical protein CICLE_v10003914mg [Citrus clementina] gi|557535662|gb|ESR46780.1| hypothetical protein CICLE_v10003914mg [Citrus clementina] Length = 977 Score = 260 bits (664), Expect = 6e-67 Identities = 141/243 (58%), Positives = 179/243 (73%) Frame = +1 Query: 34 FRSTMAKVPPKHSRDQLRDVEGLLNNLQDWELSFKEKDKKLRADTLGRGKLDNPSQSSYN 213 +++ K+PP H+RDQ D +G LN+LQDW+LS EKDKK++ + Sbjct: 533 YKAINEKMPP-HNRDQALDFQGFLNDLQDWDLSLHEKDKKMK----------------HK 575 Query: 214 DSSRLSDNPRVDGRKTRDKTSSAYNTNTVRQHDHLKKYEELSQLASTVSSKEGVVDANSE 393 SS+ DN K+ +K S + N+ + + Y+ +S+++S++ ++E DA SE Sbjct: 576 ASSK--DNLVSSSLKSGEKPSPSGNS-------YSRNYDPVSRISSSLMNEESTPDATSE 626 Query: 394 KELGNEFFKQKKFNEAIDCYSRSIALSPTAVAYANRAMAYIKVKRFQEAENDCTEALNLD 573 KELGNE FKQKKF EAIDCYSRSIALSPTAVAYANRAMAY+K++RFQEAE+DCTEALNLD Sbjct: 627 KELGNECFKQKKFKEAIDCYSRSIALSPTAVAYANRAMAYLKLRRFQEAEDDCTEALNLD 686 Query: 574 DRYIKAYSRRSTARKELGKLKQSIDDVEFALRLEPQNQDIKKQHAECKALLEKEIMKKAS 753 DRYIKAYSRR+TARKELGKLK+SI+D EFALRLEPQNQ+IKKQ AE K+L EKE+ +KAS Sbjct: 687 DRYIKAYSRRATARKELGKLKESIEDSEFALRLEPQNQEIKKQLAEVKSLYEKEVFQKAS 746 Query: 754 GAL 762 L Sbjct: 747 KTL 749 >ref|XP_006472205.1| PREDICTED: RNA polymerase II-associated protein 3-like [Citrus sinensis] Length = 438 Score = 257 bits (657), Expect = 4e-66 Identities = 137/233 (58%), Positives = 168/233 (72%) Frame = +1 Query: 64 KHSRDQLRDVEGLLNNLQDWELSFKEKDKKLRADTLGRGKLDNPSQSSYNDSSRLSDNPR 243 KH+RDQ D +G LN+LQDW+LS EKDKK++ + L + S S Sbjct: 3 KHNRDQALDFQGFLNDLQDWDLSLNEKDKKMKHKASSKDNLVSSSLKS------------ 50 Query: 244 VDGRKTRDKTSSAYNTNTVRQHDHLKKYEELSQLASTVSSKEGVVDANSEKELGNEFFKQ 423 K + ++Y+ N Y+ +S ++S++ ++E DA SEKELGNE FKQ Sbjct: 51 ---AKKPSPSGNSYSRN----------YDPVSHISSSLMNEESTPDATSEKELGNECFKQ 97 Query: 424 KKFNEAIDCYSRSIALSPTAVAYANRAMAYIKVKRFQEAENDCTEALNLDDRYIKAYSRR 603 KKF EAIDCYSRSIALSPTAVAYANRAMAY+K++RFQEAE+DCTEALNLDDRYIKAYSRR Sbjct: 98 KKFKEAIDCYSRSIALSPTAVAYANRAMAYLKLRRFQEAEDDCTEALNLDDRYIKAYSRR 157 Query: 604 STARKELGKLKQSIDDVEFALRLEPQNQDIKKQHAECKALLEKEIMKKASGAL 762 +TARKELGKLK+SI+D EFALRLEPQNQ+IKKQ AE K+L EKE+ +KAS L Sbjct: 158 ATARKELGKLKESIEDSEFALRLEPQNQEIKKQLAEVKSLYEKEVFQKASKTL 210 >ref|NP_176039.2| carboxylate clamp-tetratricopeptide repeat protein [Arabidopsis thaliana] gi|53828529|gb|AAU94374.1| At1g56440 [Arabidopsis thaliana] gi|59958350|gb|AAX12885.1| At1g56440 [Arabidopsis thaliana] gi|110743110|dbj|BAE99447.1| hypothetical protein [Arabidopsis thaliana] gi|332195274|gb|AEE33395.1| carboxylate clamp-tetratricopeptide repeat [Arabidopsis thaliana] Length = 476 Score = 257 bits (657), Expect = 4e-66 Identities = 134/239 (56%), Positives = 175/239 (73%) Frame = +1 Query: 46 MAKVPPKHSRDQLRDVEGLLNNLQDWELSFKEKDKKLRADTLGRGKLDNPSQSSYNDSSR 225 MA+ P KH RDQ +D +G N+LQDWELS K+KDKK++ P+ SS Sbjct: 1 MARSPSKHGRDQTQDFQGFFNDLQDWELSLKDKDKKIK---------QQPANSS------ 45 Query: 226 LSDNPRVDGRKTRDKTSSAYNTNTVRQHDHLKKYEELSQLASTVSSKEGVVDANSEKELG 405 NP +S + + ++D KKY + L+S++ E ++D++SEKE G Sbjct: 46 ---NP----------SSETFRPSGSGKYDFAKKYRSIRDLSSSLIG-ESLLDSSSEKEQG 91 Query: 406 NEFFKQKKFNEAIDCYSRSIALSPTAVAYANRAMAYIKVKRFQEAENDCTEALNLDDRYI 585 NEFFKQKKFNEAIDCYSRSIALSP AV YANRAMAY+K+KR++EAE DCTEALNLDDRYI Sbjct: 92 NEFFKQKKFNEAIDCYSRSIALSPNAVTYANRAMAYLKIKRYREAEVDCTEALNLDDRYI 151 Query: 586 KAYSRRSTARKELGKLKQSIDDVEFALRLEPQNQDIKKQHAECKALLEKEIMKKASGAL 762 KAYSRR+TARKELG +K++ +D EFALRLEP++Q++KKQ+A+ K+LLEKEI++KA+GA+ Sbjct: 152 KAYSRRATARKELGMIKEAKEDAEFALRLEPESQELKKQYADIKSLLEKEIIEKATGAM 210 >ref|NP_001185250.1| carboxylate clamp-tetratricopeptide repeat protein [Arabidopsis thaliana] gi|332195275|gb|AEE33396.1| carboxylate clamp-tetratricopeptide repeat [Arabidopsis thaliana] Length = 494 Score = 257 bits (657), Expect = 4e-66 Identities = 134/239 (56%), Positives = 175/239 (73%) Frame = +1 Query: 46 MAKVPPKHSRDQLRDVEGLLNNLQDWELSFKEKDKKLRADTLGRGKLDNPSQSSYNDSSR 225 MA+ P KH RDQ +D +G N+LQDWELS K+KDKK++ P+ SS Sbjct: 1 MARSPSKHGRDQTQDFQGFFNDLQDWELSLKDKDKKIK---------QQPANSS------ 45 Query: 226 LSDNPRVDGRKTRDKTSSAYNTNTVRQHDHLKKYEELSQLASTVSSKEGVVDANSEKELG 405 NP +S + + ++D KKY + L+S++ E ++D++SEKE G Sbjct: 46 ---NP----------SSETFRPSGSGKYDFAKKYRSIRDLSSSLIG-ESLLDSSSEKEQG 91 Query: 406 NEFFKQKKFNEAIDCYSRSIALSPTAVAYANRAMAYIKVKRFQEAENDCTEALNLDDRYI 585 NEFFKQKKFNEAIDCYSRSIALSP AV YANRAMAY+K+KR++EAE DCTEALNLDDRYI Sbjct: 92 NEFFKQKKFNEAIDCYSRSIALSPNAVTYANRAMAYLKIKRYREAEVDCTEALNLDDRYI 151 Query: 586 KAYSRRSTARKELGKLKQSIDDVEFALRLEPQNQDIKKQHAECKALLEKEIMKKASGAL 762 KAYSRR+TARKELG +K++ +D EFALRLEP++Q++KKQ+A+ K+LLEKEI++KA+GA+ Sbjct: 152 KAYSRRATARKELGMIKEAKEDAEFALRLEPESQELKKQYADIKSLLEKEIIEKATGAM 210