BLASTX nr result
ID: Cocculus22_contig00014861
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Cocculus22_contig00014861 (1404 letters) Database: ./nr 38,876,450 sequences; 13,856,398,315 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value emb|CBI39598.3| unnamed protein product [Vitis vinifera] 478 e-132 ref|XP_002277910.2| PREDICTED: RNA polymerase II-associated prot... 476 e-132 ref|XP_007031161.1| Tetratricopeptide repeat (TPR)-like superfam... 466 e-128 ref|XP_007031159.1| Tetratricopeptide repeat (TPR)-like superfam... 465 e-128 ref|XP_004302236.1| PREDICTED: RNA polymerase II-associated prot... 464 e-128 ref|XP_006472205.1| PREDICTED: RNA polymerase II-associated prot... 461 e-127 ref|XP_007031158.1| Tetratricopeptide repeat-like superfamily pr... 456 e-126 ref|XP_006433540.1| hypothetical protein CICLE_v10003914mg [Citr... 454 e-125 ref|XP_007205290.1| hypothetical protein PRUPE_ppa006661mg [Prun... 454 e-125 ref|XP_004144746.1| PREDICTED: RNA polymerase II-associated prot... 443 e-122 gb|EXB53029.1| RNA polymerase II-associated protein 3 [Morus not... 439 e-120 ref|XP_006339932.1| PREDICTED: RNA polymerase II-associated prot... 431 e-118 ref|NP_176039.2| carboxylate clamp-tetratricopeptide repeat prot... 430 e-118 ref|XP_004248819.1| PREDICTED: RNA polymerase II-associated prot... 429 e-117 ref|XP_007145004.1| hypothetical protein PHAVU_007G201600g [Phas... 426 e-117 ref|XP_006339933.1| PREDICTED: RNA polymerase II-associated prot... 426 e-116 ref|XP_006339935.1| PREDICTED: RNA polymerase II-associated prot... 426 e-116 ref|XP_006588434.1| PREDICTED: uncharacterized protein LOC100784... 418 e-114 ref|XP_004495650.1| PREDICTED: RNA polymerase II-associated prot... 417 e-114 ref|XP_007031162.1| Tetratricopeptide repeat (TPR)-like superfam... 416 e-113 >emb|CBI39598.3| unnamed protein product [Vitis vinifera] Length = 1097 Score = 478 bits (1229), Expect = e-132 Identities = 267/474 (56%), Positives = 326/474 (68%), Gaps = 48/474 (10%) Frame = +2 Query: 50 SKSMA-RVPNKHNRDQNLDFQGFLNDLQDWELSLKEKDKKFKAQSNEEK--------KKT 202 S SMA R P+KH RDQ LDFQGFL DLQDWELSLKEKDKK KAQ+ E+ K + Sbjct: 621 SVSMATRFPSKHARDQALDFQGFLTDLQDWELSLKEKDKKMKAQAEEKDVPTARGNVKHS 680 Query: 203 EIISEAKGVARKAPSV-------DYSRSLKSIGDISSKFFAEESSPDAATEKEHGNEYFK 361 +S + GV+ + +YSR+ +I ISS F EES PDAA+EKE GNEYFK Sbjct: 681 SKLSSSPGVSLRLGQSRSDTRQHEYSRNHDAISRISSSFMTEESLPDAASEKELGNEYFK 740 Query: 362 QKKFKEAIDCYSRSIALSPTAVAFANRAMAYLKLKRFEEAEVDCTEALNLDDRYTKAYSR 541 Q+KFKEAIDCYSRSIAL PTAVA+ANRAMAY+K+KRF EAE DC EALNLDDRY KAYSR Sbjct: 741 QRKFKEAIDCYSRSIALLPTAVAYANRAMAYIKIKRFREAEDDCMEALNLDDRYIKAYSR 800 Query: 542 RATARKELGKLKESFEDSEFALRLEPQNQELKKQYTDTKALYDKELLAKASEMVKKSRVG 721 RATARKELGK KE+ ED+EFALRLEPQNQE+KKQY + K+LY+KE+L KAS +K S G Sbjct: 801 RATARKELGKFKEATEDAEFALRLEPQNQEIKKQYAEAKSLYEKEILQKASGALKSSVQG 860 Query: 722 EQSVG--------STTG-KAVSIKEMGSGS----------TNAKRKIGEQELD-----QS 829 Q VG T G +++S G+G N + E E Sbjct: 861 LQKVGKSVVEVNADTQGVRSISSSSQGAGEAAIQDRFMVPANTSTSMEETENKGTGNRSK 920 Query: 830 QDGQFNQVTQN--------GHGITKQEIKPSIQDLALRAASQAKTEAAKNIKAPKTAYEF 985 ++G QN H ++E+K S+Q+LA RAAS+A EAAKNI AP +AY+F Sbjct: 921 ENGYLENAVQNSGLEDVMSNHKTGQREMKSSLQELASRAASRAMVEAAKNITAPNSAYQF 980 Query: 986 ELSWKGLSGDRALQARLLKAIPPSTLPKLFRDALTAPLLIDIVKCIATFFVEETELAVEL 1165 E+SW+GL GD ALQA LKAI P+ LP++F++AL+AP+LIDI+KCIATFFV E +LAV+ Sbjct: 981 EVSWRGLLGDHALQASYLKAISPNALPQIFKNALSAPILIDIIKCIATFFVTEMDLAVKF 1040 Query: 1166 LDNVTKVSRFDMISMCLSMADKADIGKTWEEVFSSEAVPIECAETLSKLRVKYC 1327 LDN+TK+SRFDMI MCLS DK D+ K W+EVF ++A P A+TL KLR +YC Sbjct: 1041 LDNLTKISRFDMIIMCLSSTDKTDLLKIWDEVFCNKATPSGYADTLGKLRPRYC 1094 >ref|XP_002277910.2| PREDICTED: RNA polymerase II-associated protein 3-like [Vitis vinifera] Length = 474 Score = 476 bits (1226), Expect = e-132 Identities = 263/468 (56%), Positives = 322/468 (68%), Gaps = 47/468 (10%) Frame = +2 Query: 65 RVPNKHNRDQNLDFQGFLNDLQDWELSLKEKDKKFKAQSNEEK--------KKTEIISEA 220 R P+KH RDQ LDFQGFL DLQDWELSLKEKDKK KAQ+ E+ K + +S + Sbjct: 4 RFPSKHARDQALDFQGFLTDLQDWELSLKEKDKKMKAQAEEKDVPTARGNVKHSSKLSSS 63 Query: 221 KGVARKAPSV-------DYSRSLKSIGDISSKFFAEESSPDAATEKEHGNEYFKQKKFKE 379 GV+ + +YSR+ +I ISS F EES PDAA+EKE GNEYFKQ+KFKE Sbjct: 64 PGVSLRLGQSRSDTRQHEYSRNHDAISRISSSFMTEESLPDAASEKELGNEYFKQRKFKE 123 Query: 380 AIDCYSRSIALSPTAVAFANRAMAYLKLKRFEEAEVDCTEALNLDDRYTKAYSRRATARK 559 AIDCYSRSIAL PTAVA+ANRAMAY+K+KRF EAE DC EALNLDDRY KAYSRRATARK Sbjct: 124 AIDCYSRSIALLPTAVAYANRAMAYIKIKRFREAEDDCMEALNLDDRYIKAYSRRATARK 183 Query: 560 ELGKLKESFEDSEFALRLEPQNQELKKQYTDTKALYDKELLAKASEMVKKSRVGEQSVG- 736 ELGK KE+ ED+EFALRLEPQNQE+KKQY + K+LY+KE+L KAS +K S G Q VG Sbjct: 184 ELGKFKEATEDAEFALRLEPQNQEIKKQYAEAKSLYEKEILQKASGALKSSVQGLQKVGK 243 Query: 737 -------STTG-KAVSIKEMGSGS----------TNAKRKIGEQELD-----QSQDGQFN 847 T G +++S G+G N + E E ++G Sbjct: 244 SVVEVNADTQGVRSISSSSQGAGEAAIQDRFMVPANTSTSMEETENKGTGNRSKENGYLE 303 Query: 848 QVTQN--------GHGITKQEIKPSIQDLALRAASQAKTEAAKNIKAPKTAYEFELSWKG 1003 QN H ++E+K S+Q+LA RAAS+A EAAKNI AP +AY+FE+SW+G Sbjct: 304 NAVQNSGLEDVMSNHKTGQREMKSSLQELASRAASRAMVEAAKNITAPNSAYQFEVSWRG 363 Query: 1004 LSGDRALQARLLKAIPPSTLPKLFRDALTAPLLIDIVKCIATFFVEETELAVELLDNVTK 1183 L GD ALQA LKAI P+ LP++F++AL+AP+LIDI+KCIATFFV E +LAV+ LDN+TK Sbjct: 364 LLGDHALQASYLKAISPNALPQIFKNALSAPILIDIIKCIATFFVTEMDLAVKFLDNLTK 423 Query: 1184 VSRFDMISMCLSMADKADIGKTWEEVFSSEAVPIECAETLSKLRVKYC 1327 +SRFDMI MCLS DK D+ K W+EVF ++A P A+TL KLR +YC Sbjct: 424 ISRFDMIIMCLSSTDKTDLLKIWDEVFCNKATPSGYADTLGKLRPRYC 471 >ref|XP_007031161.1| Tetratricopeptide repeat (TPR)-like superfamily protein, putative isoform 4 [Theobroma cacao] gi|508719766|gb|EOY11663.1| Tetratricopeptide repeat (TPR)-like superfamily protein, putative isoform 4 [Theobroma cacao] Length = 421 Score = 466 bits (1198), Expect = e-128 Identities = 245/425 (57%), Positives = 307/425 (72%), Gaps = 7/425 (1%) Frame = +2 Query: 74 NKHNRDQNLDFQGFLNDLQDWELSLKEKDKKFKAQSNEEKKKTEIISEAKGVARKAPS-- 247 +KH+RDQ LDFQGFLN+LQDWELSLKEKDK K+Q++++++ T G + S Sbjct: 3 SKHSRDQALDFQGFLNNLQDWELSLKEKDKIMKSQASDKEQLTNEKGRPTGKSSLIDSST 62 Query: 248 -----VDYSRSLKSIGDISSKFFAEESSPDAATEKEHGNEYFKQKKFKEAIDCYSRSIAL 412 DY ++ +SS F EE+ PDAA+EKE GNEYFKQKKFKEAIDCYSRSI L Sbjct: 63 TSSRQYDYLQNYDKFNSLSSSFVTEENMPDAASEKELGNEYFKQKKFKEAIDCYSRSIGL 122 Query: 413 SPTAVAFANRAMAYLKLKRFEEAEVDCTEALNLDDRYTKAYSRRATARKELGKLKESFED 592 SPTAVA ANRAMAYLK+K+F+EAE DCTEALNLDDRY KAYSRRATARKELGKLKES ED Sbjct: 123 SPTAVAHANRAMAYLKIKKFQEAEDDCTEALNLDDRYIKAYSRRATARKELGKLKESIED 182 Query: 593 SEFALRLEPQNQELKKQYTDTKALYDKELLAKASEMVKKSRVGEQSVGSTTGKAVSIKEM 772 +EFALRLEP NQE+KKQ+ + K+LY+KE+L KAS +++KS Q VG + KE Sbjct: 183 TEFALRLEPNNQEIKKQHAEFKSLYEKEILQKASGVLRKSMQEAQEVGKS-----ETKEN 237 Query: 773 GSGSTNAKRKIGEQELDQSQDGQFNQVTQNGHGITKQEIKPSIQDLALRAASQAKTEAAK 952 G G +A + Q Q T+ + K E+K S+Q+LA AA++A EAAK Sbjct: 238 GLGMHSASNSTQRTGVATVQGYQ----TKKNNRTRKPELKASVQELASLAATRAMAEAAK 293 Query: 953 NIKAPKTAYEFELSWKGLSGDRALQARLLKAIPPSTLPKLFRDALTAPLLIDIVKCIATF 1132 NI P TAY+FE+SW+ LSGDRALQA LLK PS LP++F++AL+A +L+DI+KC+ATF Sbjct: 294 NISPPNTAYQFEVSWRALSGDRALQAHLLKVTSPSALPQIFKNALSASMLVDIIKCVATF 353 Query: 1133 FVEETELAVELLDNVTKVSRFDMISMCLSMADKADIGKTWEEVFSSEAVPIECAETLSKL 1312 F EE +LA++ L+N+TKV RFDM+ MCLS +KAD+ K W++VF +EA PIE AE L L Sbjct: 354 FREEVDLAIKYLENLTKVPRFDMLIMCLSSTEKADLLKVWDDVFCNEATPIEWAEILDNL 413 Query: 1313 RVKYC 1327 R YC Sbjct: 414 RSVYC 418 >ref|XP_007031159.1| Tetratricopeptide repeat (TPR)-like superfamily protein, putative isoform 2 [Theobroma cacao] gi|508719764|gb|EOY11661.1| Tetratricopeptide repeat (TPR)-like superfamily protein, putative isoform 2 [Theobroma cacao] Length = 422 Score = 465 bits (1196), Expect = e-128 Identities = 245/429 (57%), Positives = 311/429 (72%), Gaps = 11/429 (2%) Frame = +2 Query: 74 NKHNRDQNLDFQGFLNDLQDWELSLKEKDKKFKAQSNEEKK-----------KTEIISEA 220 +KH+RDQ LDFQGFLN+LQDWELSLKEKDK K+Q++++++ K+ +I + Sbjct: 3 SKHSRDQALDFQGFLNNLQDWELSLKEKDKIMKSQASDKEQLKTNEKGRPTGKSSLIDSS 62 Query: 221 KGVARKAPSVDYSRSLKSIGDISSKFFAEESSPDAATEKEHGNEYFKQKKFKEAIDCYSR 400 +R+ DY ++ +SS F EE+ PDAA+EKE GNEYFKQKKFKEAIDCYSR Sbjct: 63 TTSSRQ---YDYLQNYDKFNSLSSSFVTEENMPDAASEKELGNEYFKQKKFKEAIDCYSR 119 Query: 401 SIALSPTAVAFANRAMAYLKLKRFEEAEVDCTEALNLDDRYTKAYSRRATARKELGKLKE 580 SI LSPTAVA ANRAMAYLK+K+F+EAE DCTEALNLDDRY KAYSRRATARKELGKLKE Sbjct: 120 SIGLSPTAVAHANRAMAYLKIKKFQEAEDDCTEALNLDDRYIKAYSRRATARKELGKLKE 179 Query: 581 SFEDSEFALRLEPQNQELKKQYTDTKALYDKELLAKASEMVKKSRVGEQSVGSTTGKAVS 760 S ED+EFALRLEP NQE+KKQ+ + K+LY+KE+L KAS +++KS Q VG + Sbjct: 180 SIEDTEFALRLEPNNQEIKKQHAEFKSLYEKEILQKASGVLRKSMQEAQEVGKS-----E 234 Query: 761 IKEMGSGSTNAKRKIGEQELDQSQDGQFNQVTQNGHGITKQEIKPSIQDLALRAASQAKT 940 KE G G +A + Q Q T+ + K E+K S+Q+LA AA++A Sbjct: 235 TKENGLGMHSASNSTQRTGVATVQGYQ----TKKNNRTRKPELKASVQELASLAATRAMA 290 Query: 941 EAAKNIKAPKTAYEFELSWKGLSGDRALQARLLKAIPPSTLPKLFRDALTAPLLIDIVKC 1120 EAAKNI P TAY+FE+SW+ LSGDRALQA LLK PS LP++F++AL+A +L+DI+KC Sbjct: 291 EAAKNISPPNTAYQFEVSWRALSGDRALQAHLLKVTSPSALPQIFKNALSASMLVDIIKC 350 Query: 1121 IATFFVEETELAVELLDNVTKVSRFDMISMCLSMADKADIGKTWEEVFSSEAVPIECAET 1300 +ATFF EE +LA++ L+N+TKV RFDM+ MCLS +KAD+ K W++VF +EA PIE AE Sbjct: 351 VATFFREEVDLAIKYLENLTKVPRFDMLIMCLSSTEKADLLKVWDDVFCNEATPIEWAEI 410 Query: 1301 LSKLRVKYC 1327 L LR YC Sbjct: 411 LDNLRSVYC 419 >ref|XP_004302236.1| PREDICTED: RNA polymerase II-associated protein 3-like [Fragaria vesca subsp. vesca] Length = 407 Score = 464 bits (1195), Expect = e-128 Identities = 248/427 (58%), Positives = 310/427 (72%), Gaps = 4/427 (0%) Frame = +2 Query: 59 MARVPNKHNRDQNLDFQGFLNDLQDWELSLKEKDKKFKAQS-NEEKKKTEIISEAKGVAR 235 MAR P+KH RDQ LDFQGFL+DLQDWELSLK+KDKK + Q N+E K+ R Sbjct: 1 MARAPSKHGRDQALDFQGFLSDLQDWELSLKDKDKKMRPQQPNKEAPKS----------R 50 Query: 236 KAPSVDYSRSLKSIGDISSKFFAEESSPDAATEKEHGNEYFKQKKFKEAIDCYSRSIALS 415 + YS + + + +SS F +E+ PDAA+EK+ GNEYFKQKKFKEAIDCYSRSIAL+ Sbjct: 51 DFGTSSYSTNYEPMNTVSSSFTSEDGLPDAASEKDLGNEYFKQKKFKEAIDCYSRSIALT 110 Query: 416 PTAVAFANRAMAYLKLKRFEEAEVDCTEALNLDDRYTKAYSRRATARKELGKLKESFEDS 595 PTAVAFANRAM+Y+K+KRF+EAE DCTEALNLDDRY KAYSRRATARKELGKLKES ED+ Sbjct: 111 PTAVAFANRAMSYIKIKRFQEAENDCTEALNLDDRYIKAYSRRATARKELGKLKESIEDA 170 Query: 596 EFALRLEPQNQELKKQYTDTKALYDKELLAKASEMVKKSRVGEQSV--GSTTGKAVSIKE 769 EFALRLEP NQE+KKQY + K+LY+K +L K S +K S +Q V TT SI+ Sbjct: 171 EFALRLEPHNQEIKKQYAEAKSLYEKGILQKVSGAIKISEQDKQKVEKSGTTVNGHSIQP 230 Query: 770 MGSGSTNAK-RKIGEQELDQSQDGQFNQVTQNGHGITKQEIKPSIQDLALRAASQAKTEA 946 + S + + +G+ ++ NG KQ K S+Q+LA RAAS+AK A Sbjct: 231 VSSTTQRTETTAVGDHT---------KKINTNG----KQASKLSVQELASRAASRAKALA 277 Query: 947 AKNIKAPKTAYEFELSWKGLSGDRALQARLLKAIPPSTLPKLFRDALTAPLLIDIVKCIA 1126 A+NI P +AY+FE SW+GLSGDRALQA+LLKAI PS LP++F++ALT +L+DI+KC+ Sbjct: 278 AENITPPSSAYQFEASWRGLSGDRALQAKLLKAISPSALPQIFKNALTVHILVDILKCVT 337 Query: 1127 TFFVEETELAVELLDNVTKVSRFDMISMCLSMADKADIGKTWEEVFSSEAVPIECAETLS 1306 TFF++E +LAV +L+N+TKV RFD + M LS DKAD+ K W+EVF +EA PIE AE L Sbjct: 338 TFFIDEMDLAVSVLENLTKVPRFDTLIMFLSSNDKADLAKIWDEVFYNEATPIEFAEKLD 397 Query: 1307 KLRVKYC 1327 LR KYC Sbjct: 398 NLRAKYC 404 >ref|XP_006472205.1| PREDICTED: RNA polymerase II-associated protein 3-like [Citrus sinensis] Length = 438 Score = 461 bits (1187), Expect = e-127 Identities = 248/435 (57%), Positives = 305/435 (70%), Gaps = 18/435 (4%) Frame = +2 Query: 77 KHNRDQNLDFQGFLNDLQDWELSLKEKDKKFKAQSNEEKKKTEIISEAKGVARKAPSVD- 253 KHNRDQ LDFQGFLNDLQDW+LSL EKDKK K +++ K + S K + +PS + Sbjct: 3 KHNRDQALDFQGFLNDLQDWDLSLNEKDKKMKHKASS--KDNLVSSSLKSAKKPSPSGNS 60 Query: 254 YSRSLKSIGDISSKFFAEESSPDAATEKEHGNEYFKQKKFKEAIDCYSRSIALSPTAVAF 433 YSR+ + ISS EES+PDA +EKE GNE FKQKKFKEAIDCYSRSIALSPTAVA+ Sbjct: 61 YSRNYDPVSHISSSLMNEESTPDATSEKELGNECFKQKKFKEAIDCYSRSIALSPTAVAY 120 Query: 434 ANRAMAYLKLKRFEEAEVDCTEALNLDDRYTKAYSRRATARKELGKLKESFEDSEFALRL 613 ANRAMAYLKL+RF+EAE DCTEALNLDDRY KAYSRRATARKELGKLKES EDSEFALRL Sbjct: 121 ANRAMAYLKLRRFQEAEDDCTEALNLDDRYIKAYSRRATARKELGKLKESIEDSEFALRL 180 Query: 614 EPQNQELKKQYTDTKALYDKELLAKASEMVKKSRVGEQSVGSTTGKAVSIKEMGSG---- 781 EPQNQE+KKQ + K+LY+KE+ KAS+ ++K V +AV +G Sbjct: 181 EPQNQEIKKQLAEVKSLYEKEVFQKASKTLEKYGKSGMKVNGHEVRAVRNTTQKTGVAEI 240 Query: 782 -------STNAKRKIGEQELDQSQDGQFNQVT------QNGHGITKQEIKPSIQDLALRA 922 T K E + + +DG T + H K + S+Q+LA RA Sbjct: 241 QDLTISKKTENKNLRDESKTEGQRDGSGANATHISGLDKRNHRTKKAVLDASVQELATRA 300 Query: 923 ASQAKTEAAKNIKAPKTAYEFELSWKGLSGDRALQARLLKAIPPSTLPKLFRDALTAPLL 1102 S+A EAAKNI PK+AYEFE+SW+G +GD ALQARLLKAI P+ LP++F++AL+A +L Sbjct: 301 TSRAVAEAAKNITPPKSAYEFEVSWRGFAGDHALQARLLKAISPNALPQIFKNALSASIL 360 Query: 1103 IDIVKCIATFFVEETELAVELLDNVTKVSRFDMISMCLSMADKADIGKTWEEVFSSEAVP 1282 IDIVK +ATFF E +LA++ L+ +T V RFD++ MCLS+ADKAD+ K W+E F +E+ P Sbjct: 361 IDIVKVVATFFTGEVDLAIKYLEYLTMVPRFDLVIMCLSLADKADLRKVWDETFCNESTP 420 Query: 1283 IECAETLSKLRVKYC 1327 IE AE L LR KYC Sbjct: 421 IEYAEILDNLRSKYC 435 >ref|XP_007031158.1| Tetratricopeptide repeat-like superfamily protein, putative isoform 1 [Theobroma cacao] gi|508719763|gb|EOY11660.1| Tetratricopeptide repeat-like superfamily protein, putative isoform 1 [Theobroma cacao] Length = 468 Score = 456 bits (1174), Expect = e-126 Identities = 248/464 (53%), Positives = 313/464 (67%), Gaps = 46/464 (9%) Frame = +2 Query: 74 NKHNRDQNLDFQGFLNDLQDWELSLKEKDKKFKAQSNEEKKKTEIISEAKGVARKAPS-- 247 +KH+RDQ LDFQGFLN+LQDWELSLKEKDK K+Q++++++ T G + S Sbjct: 3 SKHSRDQALDFQGFLNNLQDWELSLKEKDKIMKSQASDKEQLTNEKGRPTGKSSLIDSST 62 Query: 248 -----VDYSRSLKSIGDISSKFFAEESSPDAATEKEHGNEYFKQKKFKEAIDCYSRSIAL 412 DY ++ +SS F EE+ PDAA+EKE GNEYFKQKKFKEAIDCYSRSI L Sbjct: 63 TSSRQYDYLQNYDKFNSLSSSFVTEENMPDAASEKELGNEYFKQKKFKEAIDCYSRSIGL 122 Query: 413 SPTAVAFANRAMAYLKLKRFEEAEVDCTEALNLDDRYTKAYSRRATARKELGKLKESFED 592 SPTAVA ANRAMAYLK+K+F+EAE DCTEALNLDDRY KAYSRRATARKELGKLKES ED Sbjct: 123 SPTAVAHANRAMAYLKIKKFQEAEDDCTEALNLDDRYIKAYSRRATARKELGKLKESIED 182 Query: 593 SEFALRLEPQNQELKKQYTDTKALYDKELLAKASEMVKKSRVGEQSVGSTTGKAVSIKEM 772 +EFALRLEP NQE+KKQ+ + K+LY+KE+L KAS +++KS Q VG + K + M Sbjct: 183 TEFALRLEPNNQEIKKQHAEFKSLYEKEILQKASGVLRKSMQEAQEVGKSETKENGL-GM 241 Query: 773 GSGSTNAKR---------KIGEQELDQSQDGQFNQVTQNGHG------------------ 871 S S + +R + E D+ + + VT G G Sbjct: 242 HSASNSTQRTGVATVQGYQTKVSEYDKQKKPEKGSVTSEGIGDRNTLAGSRKDGTQLDSG 301 Query: 872 ------------ITKQEIKPSIQDLALRAASQAKTEAAKNIKAPKTAYEFELSWKGLSGD 1015 K E+K S+Q+LA AA++A EAAKNI P TAY+FE+SW+ LSGD Sbjct: 302 IVGLESIKKNNRTRKPELKASVQELASLAATRAMAEAAKNISPPNTAYQFEVSWRALSGD 361 Query: 1016 RALQARLLKAIPPSTLPKLFRDALTAPLLIDIVKCIATFFVEETELAVELLDNVTKVSRF 1195 RALQA LLK PS LP++F++AL+A +L+DI+KC+ATFF EE +LA++ L+N+TKV RF Sbjct: 362 RALQAHLLKVTSPSALPQIFKNALSASMLVDIIKCVATFFREEVDLAIKYLENLTKVPRF 421 Query: 1196 DMISMCLSMADKADIGKTWEEVFSSEAVPIECAETLSKLRVKYC 1327 DM+ MCLS +KAD+ K W++VF +EA PIE AE L LR YC Sbjct: 422 DMLIMCLSSTEKADLLKVWDDVFCNEATPIEWAEILDNLRSVYC 465 >ref|XP_006433540.1| hypothetical protein CICLE_v10003914mg [Citrus clementina] gi|557535662|gb|ESR46780.1| hypothetical protein CICLE_v10003914mg [Citrus clementina] Length = 977 Score = 454 bits (1169), Expect = e-125 Identities = 248/434 (57%), Positives = 305/434 (70%), Gaps = 18/434 (4%) Frame = +2 Query: 80 HNRDQNLDFQGFLNDLQDWELSLKEKDKKFKAQSNEEKKKTEIISEAKGVARKAPSVD-Y 256 HNRDQ LDFQGFLNDLQDW+LSL EKDKK K +++ K + S K + +PS + Y Sbjct: 543 HNRDQALDFQGFLNDLQDWDLSLHEKDKKMKHKASS--KDNLVSSSLKSGEKPSPSGNSY 600 Query: 257 SRSLKSIGDISSKFFAEESSPDAATEKEHGNEYFKQKKFKEAIDCYSRSIALSPTAVAFA 436 SR+ + ISS EES+PDA +EKE GNE FKQKKFKEAIDCYSRSIALSPTAVA+A Sbjct: 601 SRNYDPVSRISSSLMNEESTPDATSEKELGNECFKQKKFKEAIDCYSRSIALSPTAVAYA 660 Query: 437 NRAMAYLKLKRFEEAEVDCTEALNLDDRYTKAYSRRATARKELGKLKESFEDSEFALRLE 616 NRAMAYLKL+RF+EAE DCTEALNLDDRY KAYSRRATARKELGKLKES EDSEFALRLE Sbjct: 661 NRAMAYLKLRRFQEAEDDCTEALNLDDRYIKAYSRRATARKELGKLKESIEDSEFALRLE 720 Query: 617 PQNQELKKQYTDTKALYDKELLAKASEMVKKSRVGEQSVGSTTGKAV--SIKEMG----- 775 PQNQE+KKQ + K+LY+KE+ KAS+ ++K V +AV +I++ G Sbjct: 721 PQNQEIKKQLAEVKSLYEKEVFQKASKTLEKYGKSGMKVNGHEVRAVRNTIQKTGVAEIQ 780 Query: 776 ----SGSTNAKRKIGEQELDQSQDGQFNQVT------QNGHGITKQEIKPSIQDLALRAA 925 S T K E + + +DG T + H K + S+Q+LA RA Sbjct: 781 DLTISKKTENKNLRDESKTEGQRDGSGANATHISGLDKRNHRTKKAVLDASVQELATRAT 840 Query: 926 SQAKTEAAKNIKAPKTAYEFELSWKGLSGDRALQARLLKAIPPSTLPKLFRDALTAPLLI 1105 S+A EAAKNI PK+AYEFE+SW+G +GD ALQARLLKAI P+ LP++F++AL+A +LI Sbjct: 841 SRAVAEAAKNITPPKSAYEFEVSWRGFAGDHALQARLLKAISPNALPQIFKNALSASILI 900 Query: 1106 DIVKCIATFFVEETELAVELLDNVTKVSRFDMISMCLSMADKADIGKTWEEVFSSEAVPI 1285 DIVK +A FF E +LA++ L+ +T V RFD + MCLS+ADKAD+ K W+E F +E PI Sbjct: 901 DIVKVVAMFFPGEVDLAIKYLEYLTMVPRFDFVIMCLSLADKADLRKVWDETFCNELTPI 960 Query: 1286 ECAETLSKLRVKYC 1327 E AE L LR KYC Sbjct: 961 EYAEILDNLRSKYC 974 >ref|XP_007205290.1| hypothetical protein PRUPE_ppa006661mg [Prunus persica] gi|462400932|gb|EMJ06489.1| hypothetical protein PRUPE_ppa006661mg [Prunus persica] Length = 401 Score = 454 bits (1168), Expect = e-125 Identities = 245/424 (57%), Positives = 300/424 (70%), Gaps = 1/424 (0%) Frame = +2 Query: 59 MARVPNKHNRDQNLDFQGFLNDLQDWELSLKEKDKKFKAQ-SNEEKKKTEIISEAKGVAR 235 MAR PNKH RDQ LD WELSLK+KDKK + + S++EK KT + + G Sbjct: 1 MARAPNKHGRDQALD----------WELSLKDKDKKMRPKDSHQEKLKTRDLGTSSG--- 47 Query: 236 KAPSVDYSRSLKSIGDISSKFFAEESSPDAATEKEHGNEYFKQKKFKEAIDCYSRSIALS 415 + DYSR+L SI +SS F +E+S PDAA+EKE GNEYFKQKKF+EAIDCYSRSIALS Sbjct: 48 ---NYDYSRNLDSINTMSSSFISEDSLPDAASEKELGNEYFKQKKFREAIDCYSRSIALS 104 Query: 416 PTAVAFANRAMAYLKLKRFEEAEVDCTEALNLDDRYTKAYSRRATARKELGKLKESFEDS 595 P+AVA+ANRAMAY+K+K F+EAE DCTEALNLDDRY KAYSRRATARKELGKLKES ED+ Sbjct: 105 PSAVAYANRAMAYIKIKSFQEAEDDCTEALNLDDRYIKAYSRRATARKELGKLKESIEDA 164 Query: 596 EFALRLEPQNQELKKQYTDTKALYDKELLAKASEMVKKSRVGEQSVGSTTGKAVSIKEMG 775 EFALRLEPQNQE+KKQYT+ K+LYDK +L KAS K S + VG + K G Sbjct: 165 EFALRLEPQNQEIKKQYTEAKSLYDKTILQKASGAQKNSVQEMRKVGK-----LDTKVNG 219 Query: 776 SGSTNAKRKIGEQELDQSQDGQFNQVTQNGHGITKQEIKPSIQDLALRAASQAKTEAAKN 955 A E+ QD T+ + E+K S+Q+LA RAAS+ K AA+ Sbjct: 220 QSIQPASSSAQITEMTAVQDH-----TKRNNTTRNPEVKASVQELASRAASRVKAVAAEK 274 Query: 956 IKAPKTAYEFELSWKGLSGDRALQARLLKAIPPSTLPKLFRDALTAPLLIDIVKCIATFF 1135 IK P +AY+FE+SW+G SGD A Q LLKAI PS LP++F++ALT P+L+DI+KC+ATFF Sbjct: 275 IKPPNSAYQFEVSWRGFSGDNARQTSLLKAISPSALPQIFKNALTVPILLDIIKCVATFF 334 Query: 1136 VEETELAVELLDNVTKVSRFDMISMCLSMADKADIGKTWEEVFSSEAVPIECAETLSKLR 1315 VEE +LAV L+N+T+V RFD + M LS +D AD+ K W+EVF +EA PIE AE L LR Sbjct: 335 VEEMDLAVNYLENLTRVPRFDTLIMFLSSSDNADLVKIWDEVFDNEATPIEYAEKLDNLR 394 Query: 1316 VKYC 1327 KYC Sbjct: 395 TKYC 398 >ref|XP_004144746.1| PREDICTED: RNA polymerase II-associated protein 3-like [Cucumis sativus] gi|449517788|ref|XP_004165926.1| PREDICTED: RNA polymerase II-associated protein 3-like [Cucumis sativus] Length = 458 Score = 443 bits (1140), Expect = e-122 Identities = 246/460 (53%), Positives = 306/460 (66%), Gaps = 38/460 (8%) Frame = +2 Query: 59 MARVPNKHNRDQNLDFQGFLNDLQDWELSLKEKDKKFKAQSNEEKKKTEIISEAKGVARK 238 MA KH RDQ LDFQGFLNDLQDWE+S K KDKK K Q+ ++K E + K Sbjct: 1 MADSSAKHGRDQLLDFQGFLNDLQDWEVSFKGKDKKLKPQAIGKEK------EDRRQTEK 54 Query: 239 APSVDYSRSLKSIGDISSKFFAEESSPDAATEKEHGNEYFKQKKFKEAIDCYSRSIALSP 418 A + DY + ++ +S F E S DAA+EKE GNEYFKQKKFKEAIDCYSRSIALSP Sbjct: 55 ASAADYMKQYDAVNRLSRNFQTEGSFVDAASEKEQGNEYFKQKKFKEAIDCYSRSIALSP 114 Query: 419 TAVAFANRAMAYLKLKRFEEAEVDCTEALNLDDRYTKAYSRRATARKELGKLKESFEDSE 598 TAVAFANRAMAYLK++RF+EAE DCTEALNLDDRY KAYSRRATARKELGK KE+ ED+E Sbjct: 115 TAVAFANRAMAYLKIRRFQEAEDDCTEALNLDDRYIKAYSRRATARKELGKAKEALEDAE 174 Query: 599 FALRLEPQNQELKKQYTDTKALYDKELLAKASEMVKKSRVGEQSVGSTTGKA-------V 757 FA RLEP NQE+KKQ+ D +A K +L KAS + S ++++ + A V Sbjct: 175 FAQRLEPNNQEIKKQHADLRAFVGKAILEKASGASRSSTKNKKTLKKSDSDAKIQDIPPV 234 Query: 758 SIKEMGSGSTNAKRKIGEQ----------ELDQSQDGQ------FNQVTQNG-------- 865 S +G A+ ++ E L++S+D +V NG Sbjct: 235 SSSTSRTGLLAARERVEENGGGNAVKTSARLEESEDTSSGAEITSKKVATNGFHKDSSSY 294 Query: 866 -------HGITKQEIKPSIQDLALRAASQAKTEAAKNIKAPKTAYEFELSWKGLSGDRAL 1024 H KQE+K S+ +LA +AAS++ EAAKNI AP TAY+FE+SW+G SGD+AL Sbjct: 295 LSALERDHLPRKQELKASVYELASQAASRSMVEAAKNIIAPTTAYQFEVSWRGFSGDQAL 354 Query: 1025 QARLLKAIPPSTLPKLFRDALTAPLLIDIVKCIATFFVEETELAVELLDNVTKVSRFDMI 1204 QARLLK I P+ LP++F+DALTAP+LIDIVKC+ATFF+EE LA+ L+N+ V RF ++ Sbjct: 355 QARLLKTISPAKLPQIFKDALTAPILIDIVKCVATFFIEEPALAISFLENLVNVPRFSIL 414 Query: 1205 SMCLSMADKADIGKTWEEVFSSEAVPIECAETLSKLRVKY 1324 MCLS ++K D+ K W+EVF EAVPIE AE L LR KY Sbjct: 415 MMCLSSSEKFDLLKIWDEVFCDEAVPIEYAEMLDSLRSKY 454 >gb|EXB53029.1| RNA polymerase II-associated protein 3 [Morus notabilis] Length = 450 Score = 439 bits (1130), Expect = e-120 Identities = 245/458 (53%), Positives = 308/458 (67%), Gaps = 35/458 (7%) Frame = +2 Query: 59 MARVPNKHNRDQNLDFQGFLNDLQDWELSL--KEKDKKFKAQSNEEK---KKTEIISEAK 223 MAR P KH RD+ L FQGFLNDLQDWE SL K+KDKK KAQ++++ ++ I EA Sbjct: 1 MARAPTKHGRDEALAFQGFLNDLQDWEFSLEDKDKDKKMKAQASDKGISVSSSKKIGEA- 59 Query: 224 GVARKAPS-------------VDYSRSLKSIGDISSKFFAEESSPDAATEKEHGNEYFKQ 364 G RKA DYSR +I +SS +E+S DAA+EKE GNEYFKQ Sbjct: 60 GKDRKAAGKSSTFEYLSSSMPYDYSRKYDAINQVSSSSISEDSYTDAASEKELGNEYFKQ 119 Query: 365 KKFKEAIDCYSRSIALSPTAVAFANRAMAYLKLKR-----------------FEEAEVDC 493 KKFKEAIDCYSRSIALS TAVA+ANRAMAYLKLKR F+EAE DC Sbjct: 120 KKFKEAIDCYSRSIALSSTAVAYANRAMAYLKLKRQLLPYLIFFCKSIFLIRFQEAEGDC 179 Query: 494 TEALNLDDRYTKAYSRRATARKELGKLKESFEDSEFALRLEPQNQELKKQYTDTKALYDK 673 TEALN+DDRY KAYSRRATARKELGKLKE ED+EFALRLEP NQE+KKQY++ K+L +K Sbjct: 180 TEALNMDDRYIKAYSRRATARKELGKLKECIEDAEFALRLEPNNQEIKKQYSEAKSLCEK 239 Query: 674 ELLAKASEMVKKSRVGEQSVGSTTGKAVSIKEMGSGSTNAKRKIGEQELDQSQDGQFNQV 853 +L KAS ++ + Q + K ++ G + Q+ + + + ++ Sbjct: 240 VILQKASVALENT---VQKMQKAEKKDTKVQNNGIQPVES----ATQKTEAAVAEDYTKI 292 Query: 854 TQNGHGITKQEIKPSIQDLALRAASQAKTEAAKNIKAPKTAYEFELSWKGLSGDRALQAR 1033 Q KQE K S+Q+LA RAAS+A AKNI++P +AY+FE+SW+GLSGDRALQA Sbjct: 293 NQTAK---KQEPKASVQELASRAASRAMNGTAKNIRSPTSAYQFEVSWRGLSGDRALQAS 349 Query: 1034 LLKAIPPSTLPKLFRDALTAPLLIDIVKCIATFFVEETELAVELLDNVTKVSRFDMISMC 1213 LLK + P LP++F+++LT P+L+DIVKCIATFF+EE ++ V L+N+TKV RFD++ MC Sbjct: 350 LLKTVSPGALPQIFKNSLTVPILVDIVKCIATFFIEEMDVTVTFLENLTKVPRFDILVMC 409 Query: 1214 LSMADKADIGKTWEEVFSSEAVPIECAETLSKLRVKYC 1327 L+ D+AD+ K W EVF EA PIE AE L LR KYC Sbjct: 410 LTSKDRADLVKIWNEVFCKEATPIEHAEKLDNLRSKYC 447 >ref|XP_006339932.1| PREDICTED: RNA polymerase II-associated protein 3-like isoform X1 [Solanum tuberosum] Length = 468 Score = 431 bits (1107), Expect = e-118 Identities = 238/467 (50%), Positives = 309/467 (66%), Gaps = 45/467 (9%) Frame = +2 Query: 59 MARVPNKHNRDQNLDFQGFLNDLQDWELSLKEKDKKFKAQSNEEKKKTEI---------- 208 MA+VP+KH+RDQ D QG LN+LQDWELSLK KDKK K+Q++ ++ E Sbjct: 1 MAKVPSKHSRDQFQDMQGLLNNLQDWELSLKGKDKKMKSQAHGKETLREDWSRTSELLTS 60 Query: 209 -------ISEAKGVARKAPSVDYSRSLKSIGDISSKFFAEESSPDAATEKEHGNEYFKQK 367 + ++ + A +YS++ I +SS+ +EES+ +A +EKE GNE FKQK Sbjct: 61 PQVNGTRVGKSTSIRSAAGPYNYSKNYNPISHLSSELISEESNINANSEKELGNECFKQK 120 Query: 368 KFKEAIDCYSRSIALSPTAVAFANRAMAYLKLKRFEEAEVDCTEALNLDDRYTKAYSRRA 547 KF EAIDCYSRSIALSPTAV++ANRAMAYLK+KRF+EAE DCTEALNLDDRY KAYSRR+ Sbjct: 121 KFNEAIDCYSRSIALSPTAVSYANRAMAYLKIKRFQEAENDCTEALNLDDRYIKAYSRRS 180 Query: 548 TARKELGKLKESFEDSEFALRLEPQNQELKKQYTDTKALYDKELLAKASEMVKKSRVGEQ 727 T+RKELGKLKES ED+EFALRLEPQN E+KKQY + KALY+KE+ + S S Q Sbjct: 181 TSRKELGKLKESIEDAEFALRLEPQNPEIKKQYGEVKALYEKEIRKRVSGATDVSAQRAQ 240 Query: 728 SVGSTTGKAVSIKEMGSGSTNAKR--KIGEQELDQSQDG---------QFNQ-------- 850 G T I+ + S S I +E ++ G Q N Sbjct: 241 KSGKTIKSGPVIQSVSSSSQKMAEVWTIPAKENNRDVPGTAKVEDTHMQINNKDSDASPT 300 Query: 851 ---------VTQNGHGITKQEIKPSIQDLALRAASQAKTEAAKNIKAPKTAYEFELSWKG 1003 + H I+KQE++ S+Q+LA RAA AKTEAAKNI AP +AY+FE+SW+G Sbjct: 301 VPTLNPAFGTAKKTHKISKQELEESVQELAARAAGLAKTEAAKNIAAPNSAYQFEVSWRG 360 Query: 1004 LSGDRALQARLLKAIPPSTLPKLFRDALTAPLLIDIVKCIATFFVEETELAVELLDNVTK 1183 LSGDR LQ +LLK P+ LP++F++AL+AP+L+DIV+C+ATFF+E+ LA+ L+++TK Sbjct: 361 LSGDRNLQTQLLKVTSPAMLPRIFKNALSAPMLMDIVRCVATFFIEDMNLAIRYLEDLTK 420 Query: 1184 VSRFDMISMCLSMADKADIGKTWEEVFSSEAVPIECAETLSKLRVKY 1324 V RFDMI MCLS DK+++ K WEE+F EA E + TL LRV Y Sbjct: 421 VPRFDMIIMCLSSTDKSELLKIWEEIFCKEAE--EHSATLGALRVPY 465 >ref|NP_176039.2| carboxylate clamp-tetratricopeptide repeat protein [Arabidopsis thaliana] gi|53828529|gb|AAU94374.1| At1g56440 [Arabidopsis thaliana] gi|59958350|gb|AAX12885.1| At1g56440 [Arabidopsis thaliana] gi|110743110|dbj|BAE99447.1| hypothetical protein [Arabidopsis thaliana] gi|332195274|gb|AEE33395.1| carboxylate clamp-tetratricopeptide repeat [Arabidopsis thaliana] Length = 476 Score = 430 bits (1105), Expect = e-118 Identities = 233/478 (48%), Positives = 316/478 (66%), Gaps = 55/478 (11%) Frame = +2 Query: 59 MARVPNKHNRDQNLDFQGFLNDLQDWELSLKEKDKKFKAQSNEEKKKTEIISEAKGVARK 238 MAR P+KH RDQ DFQGF NDLQDWELSLK+KDKK K Q + G + Sbjct: 1 MARSPSKHGRDQTQDFQGFFNDLQDWELSLKDKDKKIKQQPANSSNPSSETFRPSGSGK- 59 Query: 239 APSVDYSRSLKSIGDISSKFFAEESSPDAATEKEHGNEYFKQKKFKEAIDCYSRSIALSP 418 D+++ +SI D+SS E S D+++EKE GNE+FKQKKF EAIDCYSRSIALSP Sbjct: 60 ---YDFAKKYRSIRDLSSSLIGE-SLLDSSSEKEQGNEFFKQKKFNEAIDCYSRSIALSP 115 Query: 419 TAVAFANRAMAYLKLKRFEEAEVDCTEALNLDDRYTKAYSRRATARKELGKLKESFEDSE 598 AV +ANRAMAYLK+KR+ EAEVDCTEALNLDDRY KAYSRRATARKELG +KE+ ED+E Sbjct: 116 NAVTYANRAMAYLKIKRYREAEVDCTEALNLDDRYIKAYSRRATARKELGMIKEAKEDAE 175 Query: 599 FALRLEPQNQELKKQYTDTKALYDKELLAKAS--------EMVKKSRVGEQ--------- 727 FALRLEP++QELKKQY D K+L +KE++ KA+ E++K S + ++ Sbjct: 176 FALRLEPESQELKKQYADIKSLLEKEIIEKATGAMQSTAQELLKTSGLDKKIQKPKTEMT 235 Query: 728 ----SVGSTTGKAVSIKEMGSGSTNAKRKIGE-QELDQSQDGQF-----------NQVTQ 859 ++ + T + + +GS ++ K+ I Q ++S++G +VT Sbjct: 236 SKPVTLVAKTNRDIVQPVLGSNESSGKKLIENIQPEEKSKEGSMKIPAITEILDSKKVTP 295 Query: 860 NGHGITKQ----------------------EIKPSIQDLALRAASQAKTEAAKNIKAPKT 973 K+ E+KPS+Q+LA AAS A TEA+KNIK PK+ Sbjct: 296 GSQSYEKEAKPSDRNGTQPSGPENQVSKQLELKPSVQELAAHAASLAMTEASKNIKTPKS 355 Query: 974 AYEFELSWKGLSGDRALQARLLKAIPPSTLPKLFRDALTAPLLIDIVKCIATFFVEETEL 1153 AYEFE SW+ SGD AL+++LLK PS+LP++F++ALT+P+L+DI+KC+A+FF E+ +L Sbjct: 356 AYEFENSWRSFSGDSALRSQLLKVTTPSSLPQIFKNALTSPVLVDIIKCVASFFTEDMDL 415 Query: 1154 AVELLDNVTKVSRFDMISMCLSMADKADIGKTWEEVFSSEAVPIECAETLSKLRVKYC 1327 AV+ ++N+TKV RF+M+ MCL+ +K ++ K WE+VF ++A P+E AE L KLR +YC Sbjct: 416 AVKYIENLTKVPRFNMLVMCLTSTEKNELLKIWEDVFCNKATPMEYAEVLDKLRSRYC 473 >ref|XP_004248819.1| PREDICTED: RNA polymerase II-associated protein 3-like [Solanum lycopersicum] Length = 470 Score = 429 bits (1103), Expect = e-117 Identities = 235/468 (50%), Positives = 309/468 (66%), Gaps = 46/468 (9%) Frame = +2 Query: 59 MARVPNKHNRDQNLDFQGFLNDLQDWELSLKEKDKKFKAQS------------------N 184 MARVP+ H+RDQ D QG N+LQDWEL+LK KDKK K+Q+ + Sbjct: 1 MARVPSNHSRDQFQDMQGLFNNLQDWELALKGKDKKMKSQAGGKETLKEDWSRTSEPLTS 60 Query: 185 EEKKKTEIISEAKGVARKAPSVDYSRSLKSIGDISSKFFAEESSPDAATEKEHGNEYFKQ 364 + T+ + ++ + A YS++ I +SS+ +EES+ +A +EKE GNE FKQ Sbjct: 61 PQANGTQQVGKSTSIRNAAGPYSYSKNYNPISHLSSELISEESNINANSEKELGNECFKQ 120 Query: 365 KKFKEAIDCYSRSIALSPTAVAFANRAMAYLKLKRFEEAEVDCTEALNLDDRYTKAYSRR 544 KKF EAIDCYSRSIALSPTAV++ANRAMAYLK+KRF+EAE DCTEALNLDDRY KAYSRR Sbjct: 121 KKFNEAIDCYSRSIALSPTAVSYANRAMAYLKIKRFQEAENDCTEALNLDDRYIKAYSRR 180 Query: 545 ATARKELGKLKESFEDSEFALRLEPQNQELKKQYTDTKALYDKELLAKASEMVKKSRVGE 724 +T+RKELGKLKES ED+EFAL LEP+N E+KKQY + KALY+KE+L + S S G Sbjct: 181 STSRKELGKLKESIEDAEFALWLEPRNPEIKKQYGEVKALYEKEILKRVSGATDVSAQGP 240 Query: 725 QSVGSTTGKAVSIKEMGSGS------------TNAKRKIGEQELDQSQDGQFNQ------ 850 Q G T I+ + S S N + +G +++ + N+ Sbjct: 241 QKSGKTIKIGPVIQSVSSSSQKVAEVRTIPAKENNRDVLGTAKVEDTHMQISNKDSDASP 300 Query: 851 ----------VTQNGHGITKQEIKPSIQDLALRAASQAKTEAAKNIKAPKTAYEFELSWK 1000 + H I+KQE++ S+Q+LA RAA AKTEAAKNI AP +AY+FE+SW+ Sbjct: 301 TVPTLNLAFGTAKKTHKISKQELEESVQELAARAAGLAKTEAAKNIAAPNSAYQFEVSWR 360 Query: 1001 GLSGDRALQARLLKAIPPSTLPKLFRDALTAPLLIDIVKCIATFFVEETELAVELLDNVT 1180 GLSGDR LQ +LLK P+ LP++F++AL+AP+L+DIV+CIATFF+E+ LA+ L+++T Sbjct: 361 GLSGDRNLQTQLLKVTSPAMLPRIFKNALSAPMLMDIVRCIATFFIEDMNLAIRYLEDLT 420 Query: 1181 KVSRFDMISMCLSMADKADIGKTWEEVFSSEAVPIECAETLSKLRVKY 1324 KV RFDMI MCLS ADK+++ K WEE+F V E + TL LRV Y Sbjct: 421 KVPRFDMIIMCLSSADKSELLKIWEEIFCK--VAEEHSATLGALRVSY 466 >ref|XP_007145004.1| hypothetical protein PHAVU_007G201600g [Phaseolus vulgaris] gi|561018194|gb|ESW16998.1| hypothetical protein PHAVU_007G201600g [Phaseolus vulgaris] Length = 465 Score = 426 bits (1096), Expect = e-117 Identities = 238/464 (51%), Positives = 306/464 (65%), Gaps = 52/464 (11%) Frame = +2 Query: 98 LDFQGFLNDLQDWELSLKEKDKKFKAQSNEE--KKKTEIISEAKGV--ARKAPSVDYSRS 265 +DFQGFLNDLQDWELS K+K + K+Q + K + ++ + GV A KA ++ + R+ Sbjct: 1 MDFQGFLNDLQDWELSRKDKTQTLKSQKENQFTKASSSRLTGSVGVEKASKADAISFDRA 60 Query: 266 LKSIG--DISS---------KFFAEESSPDAATEKEHGNEYFKQKKFKEAIDCYSRSIAL 412 S G D+S F E PDAA+EK+ GNE+FKQKKFKEA DCYSRSIAL Sbjct: 61 RNSQGLYDLSKINDPLNRLHGSFVPEDVPDAASEKDLGNEFFKQKKFKEARDCYSRSIAL 120 Query: 413 SPTAVAFANRAMAYLKLKRFEEAEVDCTEALNLDDRYTKAYSRRATARKELGKLKESFED 592 SPTAVA+ANRAMA +KL+RF+EAE DCTEAL+LDDRY KAYSRRATARKELGK+KES ED Sbjct: 121 SPTAVAYANRAMANIKLRRFQEAEDDCTEALDLDDRYIKAYSRRATARKELGKIKESMED 180 Query: 593 SEFALRLEPQNQELKKQYTDTKALYDKELLAKASEMVKKSRVGEQSVGSTTGKA------ 754 +EFALRLEP NQE+KKQY D K+LY+K++L KAS ++++ G VG + K Sbjct: 181 AEFALRLEPNNQEIKKQYADAKSLYEKDILHKASGALRRTVQGTNKVGKSDEKVNGGSIH 240 Query: 755 -------------VSIKEMGSGSTNAKRKIGEQELD----------QSQDGQ-------- 841 V+ K++ K + +E+D Q+Q G Sbjct: 241 PISHGAQKSGPAEVNHKKVNEQQVPIKESLVTEEVDSRDTITRKRPQAQGGDDSKKSLSA 300 Query: 842 FNQVTQNGHGITKQEIKPSIQDLALRAASQAKTEAAKNIKAPKTAYEFELSWKGLSGDRA 1021 N + Q H I K E K S+Q LA RAAS+A EAAKNI P TAYEFE+SW+ LSGD A Sbjct: 301 SNSLEQRNHRIIKPEFKASVQQLASRAASRAMAEAAKNITPPTTAYEFEVSWRALSGDLA 360 Query: 1022 LQARLLKAIPPSTLPKLFRDALTAPLLIDIVKCIATFFVEETELAVELLDNVTKVSRFDM 1201 LQARLLKAI P LPK+F++AL++ +L+DI+KC+++FF E+ +L V ++++ KV RFDM Sbjct: 361 LQARLLKAISPRELPKIFKNALSSTILVDIIKCLSSFFTEDMDLVVSYMEHLIKVPRFDM 420 Query: 1202 ISMCLSMADKADIGKTWEEVFSSEAVPIECAETLSKLRVKYCSG 1333 I +CLS +K DI K W+EVF S+A PIE AE L LR K+C G Sbjct: 421 IVLCLSSTNKDDIRKIWDEVFRSKATPIEYAEILDNLRSKFCLG 464 >ref|XP_006339933.1| PREDICTED: RNA polymerase II-associated protein 3-like isoform X2 [Solanum tuberosum] Length = 467 Score = 426 bits (1095), Expect = e-116 Identities = 236/467 (50%), Positives = 313/467 (67%), Gaps = 45/467 (9%) Frame = +2 Query: 59 MARVPNKHNRDQNLDFQGFLNDLQDWELSLKEKDKKFKAQSNEEKKKTEI---------- 208 MA+VP+KH+RDQ D QG LN+LQDWELSLK KDKK K+Q++ ++ E Sbjct: 1 MAKVPSKHSRDQFQDMQGLLNNLQDWELSLKGKDKKMKSQAHGKETLREDWSRTSELLTS 60 Query: 209 -------ISEAKGVARKAPSVDYSRSLKSIGDISSKFFAEESSPDAATEKEHGNEYFKQK 367 + ++ + A +YS++ I +SS+ +EES+ +A +EKE GNE FKQK Sbjct: 61 PQVNGTRVGKSTSIRSAAGPYNYSKNYNPISHLSSELISEESNINANSEKELGNECFKQK 120 Query: 368 KFKEAIDCYSRSIALSPTAVAFANRAMAYLKLKRFEEAEVDCTEALNLDDRYTKAYSRRA 547 KF EAIDCYSRSIALSPTAV++ANRAMAYLK+KRF+EAE DCTEALNLDDRY KAYSRR+ Sbjct: 121 KFNEAIDCYSRSIALSPTAVSYANRAMAYLKIKRFQEAENDCTEALNLDDRYIKAYSRRS 180 Query: 548 TARKELGKLKESFEDSEFALRLEPQNQELKKQYTDTKALYDK-------------ELLAK 688 T+RKELGKLKES ED+EFALRLEPQN E+KKQY + KALY+K + K Sbjct: 181 TSRKELGKLKESIEDAEFALRLEPQNPEIKKQYGEVKALYEKIRKRVSGATDVSAQRAQK 240 Query: 689 ASEMVKKSRVGEQSVGSTTGKAVSIKEMGSGSTN------AKRKIGEQELDQSQDGQFNQ 850 + + +K V QSV S++ K + + + N AK + +++ Sbjct: 241 SGKTIKSGPV-IQSVSSSSQKMAEVWTIPAKENNRDVPGTAKVEDTHMQINNKDSDASPT 299 Query: 851 V---------TQNGHGITKQEIKPSIQDLALRAASQAKTEAAKNIKAPKTAYEFELSWKG 1003 V + H I+KQE++ S+Q+LA RAA AKTEAAKNI AP +AY+FE+SW+G Sbjct: 300 VPTLNPAFGTAKKTHKISKQELEESVQELAARAAGLAKTEAAKNIAAPNSAYQFEVSWRG 359 Query: 1004 LSGDRALQARLLKAIPPSTLPKLFRDALTAPLLIDIVKCIATFFVEETELAVELLDNVTK 1183 LSGDR LQ +LLK P+ LP++F++AL+AP+L+DIV+C+ATFF+E+ LA+ L+++TK Sbjct: 360 LSGDRNLQTQLLKVTSPAMLPRIFKNALSAPMLMDIVRCVATFFIEDMNLAIRYLEDLTK 419 Query: 1184 VSRFDMISMCLSMADKADIGKTWEEVFSSEAVPIECAETLSKLRVKY 1324 V RFDMI MCLS DK+++ K WEE+F EA E + TL LRV Y Sbjct: 420 VPRFDMIIMCLSSTDKSELLKIWEEIFCKEAE--EHSATLGALRVPY 464 >ref|XP_006339935.1| PREDICTED: RNA polymerase II-associated protein 3-like isoform X4 [Solanum tuberosum] Length = 419 Score = 426 bits (1094), Expect = e-116 Identities = 229/439 (52%), Positives = 298/439 (67%), Gaps = 17/439 (3%) Frame = +2 Query: 59 MARVPNKHNRDQNLDFQGFLNDLQDWELSLKEKDKKFKAQSNEEKKKTEI---------- 208 MA+VP+KH+RDQ D QG LN+LQDWELSLK KDKK K+Q++ ++ E Sbjct: 1 MAKVPSKHSRDQFQDMQGLLNNLQDWELSLKGKDKKMKSQAHGKETLREDWSRTSELLTS 60 Query: 209 -------ISEAKGVARKAPSVDYSRSLKSIGDISSKFFAEESSPDAATEKEHGNEYFKQK 367 + ++ + A +YS++ I +SS+ +EES+ +A +EKE GNE FKQK Sbjct: 61 PQVNGTRVGKSTSIRSAAGPYNYSKNYNPISHLSSELISEESNINANSEKELGNECFKQK 120 Query: 368 KFKEAIDCYSRSIALSPTAVAFANRAMAYLKLKRFEEAEVDCTEALNLDDRYTKAYSRRA 547 KF EAIDCYSRSIALSPTAV++ANRAMAYLK+KRF+EAE DCTEALNLDDRY KAYSRR+ Sbjct: 121 KFNEAIDCYSRSIALSPTAVSYANRAMAYLKIKRFQEAENDCTEALNLDDRYIKAYSRRS 180 Query: 548 TARKELGKLKESFEDSEFALRLEPQNQELKKQYTDTKALYDKELLAKASEMVKKSRVGEQ 727 T+RKELGKLKES ED+EFALRLEPQN E+KKQY + KALY+KE K Q Sbjct: 181 TSRKELGKLKESIEDAEFALRLEPQNPEIKKQYGEVKALYEKENNRDVPGTAKVEDTHMQ 240 Query: 728 SVGSTTGKAVSIKEMGSGSTNAKRKIGEQELDQSQDGQFNQVTQNGHGITKQEIKPSIQD 907 + + ++ + AK+ H I+KQE++ S+Q+ Sbjct: 241 INNKDSDASPTVPTLNPAFGTAKKT---------------------HKISKQELEESVQE 279 Query: 908 LALRAASQAKTEAAKNIKAPKTAYEFELSWKGLSGDRALQARLLKAIPPSTLPKLFRDAL 1087 LA RAA AKTEAAKNI AP +AY+FE+SW+GLSGDR LQ +LLK P+ LP++F++AL Sbjct: 280 LAARAAGLAKTEAAKNIAAPNSAYQFEVSWRGLSGDRNLQTQLLKVTSPAMLPRIFKNAL 339 Query: 1088 TAPLLIDIVKCIATFFVEETELAVELLDNVTKVSRFDMISMCLSMADKADIGKTWEEVFS 1267 +AP+L+DIV+C+ATFF+E+ LA+ L+++TKV RFDMI MCLS DK+++ K WEE+F Sbjct: 340 SAPMLMDIVRCVATFFIEDMNLAIRYLEDLTKVPRFDMIIMCLSSTDKSELLKIWEEIFC 399 Query: 1268 SEAVPIECAETLSKLRVKY 1324 EA E + TL LRV Y Sbjct: 400 KEAE--EHSATLGALRVPY 416 >ref|XP_006588434.1| PREDICTED: uncharacterized protein LOC100784528 isoform X1 [Glycine max] Length = 459 Score = 418 bits (1075), Expect = e-114 Identities = 236/461 (51%), Positives = 301/461 (65%), Gaps = 49/461 (10%) Frame = +2 Query: 98 LDFQGFLNDLQDWELSLKEKDKKFKAQSNEEKKKTEIISEAKGVARKAPSVDYSRSLKSI 277 +DFQGFLNDLQDWELS K+K + K ++ + + E A K ++ + R+ S Sbjct: 1 MDFQGFLNDLQDWELSRKDKTRAQKENASSSQLTGSVGVEK---ASKGDTISFDRARNSP 57 Query: 278 GD-----ISSKF------FAEESSPDAATEKEHGNEYFKQKKFKEAIDCYSRSIALSPTA 424 G I+ F F E PDA +EK+ GNE+FKQKKFKEA DCYSRSIALSPTA Sbjct: 58 GQYDLSRINDPFNRVHSSFVPEDVPDAVSEKDLGNEFFKQKKFKEARDCYSRSIALSPTA 117 Query: 425 VAFANRAMAYLKLKRFEEAEVDCTEALNLDDRYTKAYSRRATARKELGKLKESFEDSEFA 604 VA+ANRAMA +KL+RF+EAE DCTEALNLDDRY KAYSRRATARKELGK+KES +D+ FA Sbjct: 118 VAYANRAMANIKLRRFQEAEDDCTEALNLDDRYIKAYSRRATARKELGKIKESMDDAAFA 177 Query: 605 LRLEPQNQELKKQYTDTKALYDKELLAKAS-------EMVKKSRVGEQSV--GSTTGKAV 757 LRLEP NQE+KKQY D K+LY+K++L KAS + +KS+ E+ + GS + Sbjct: 178 LRLEPNNQEIKKQYADAKSLYEKDILQKASGALRSTVQGTQKSQKSEEKINGGSIQPISH 237 Query: 758 SIKEMGSGSTNAKRKIGEQEL---------------------DQSQDGQ--------FNQ 850 S ++ G N +K EQ++ QSQ G N Sbjct: 238 STQKSGLAEVNHHKKDNEQQILVKESLLTEDVDSRETKARSRPQSQGGDGSKEGLSASNS 297 Query: 851 VTQNGHGITKQEIKPSIQDLALRAASQAKTEAAKNIKAPKTAYEFELSWKGLSGDRALQA 1030 + Q H ITK E+K S+Q LA RAAS+ EAAKN+ P TAY+FE+SW+ SGD ALQA Sbjct: 298 LEQRNHSITKLEMKASVQQLASRAASRVVAEAAKNVTPPTTAYQFEVSWRAFSGDLALQA 357 Query: 1031 RLLKAIPPSTLPKLFRDALTAPLLIDIVKCIATFFVEETELAVELLDNVTKVSRFDMISM 1210 RLLKAI P LPK+F++AL++ +LI+I+KC+A+FF E+ +L V L+++TKV RFD+I M Sbjct: 358 RLLKAISPHELPKIFKNALSSAILIEIIKCLASFFTEDMDLVVSYLEHLTKVPRFDVIVM 417 Query: 1211 CLSMADKADIGKTWEEVFSSEAVPIECAETLSKLRVKYCSG 1333 CLS +K DI K W+EVFSSEA PIE AE L LR K+ G Sbjct: 418 CLSSTNKDDIRKIWDEVFSSEATPIEYAEILDNLRSKFGLG 458 >ref|XP_004495650.1| PREDICTED: RNA polymerase II-associated protein 3-like isoform X4 [Cicer arietinum] Length = 454 Score = 417 bits (1073), Expect = e-114 Identities = 231/455 (50%), Positives = 300/455 (65%), Gaps = 43/455 (9%) Frame = +2 Query: 98 LDFQGFLNDLQDWELSLKEKDKKFKAQSNEEKKKTEIISEAKG-------VARKAPSVDY 256 +DFQGFLNDLQDWE+S K K K K+ + + + ++G + A D+ Sbjct: 1 MDFQGFLNDLQDWEISTKNKAPKTKSHKENSGRSVGVENGSRGDTISFDHAKKSAAQYDF 60 Query: 257 SRSLKSIGDISSKFFAEESSPDAATEKEHGNEYFKQKKFKEAIDCYSRSIALSPTAVAFA 436 SR+ + ++S F A E PDAA+EK+ GNE+FKQKKFKEAIDCYSRSIALSPTAVA+A Sbjct: 61 SRNNDLLSRVTSSF-ASEDVPDAASEKDLGNEFFKQKKFKEAIDCYSRSIALSPTAVAYA 119 Query: 437 NRAMAYLKLKRFEEAEVDCTEALNLDDRYTKAYSRRATARKELGKLKESFEDSEFALRLE 616 NRAMA +KL+RF+EAE DCTEALNLDDRY KAYSRRATARKELGK KES ED+EFALRLE Sbjct: 120 NRAMARIKLRRFQEAEDDCTEALNLDDRYIKAYSRRATARKELGKNKESMEDAEFALRLE 179 Query: 617 PQNQELKKQYTDTKALYDKELLAKASEMVKKS--RVGEQSV---GSTTGKAVSIKEMGSG 781 P NQE+KKQY D K+LY+KE++ K S+ ++ + ++G+ GS++ ++VS SG Sbjct: 180 PNNQEVKKQYADAKSLYEKEIVHKTSKALRNTVQKLGKSETKVNGSSSIQSVSHDTQKSG 239 Query: 782 ST-------------------------NAKRKIGEQELDQSQDGQ------FNQVTQNGH 868 S N K G + Q+ +G N + Q H Sbjct: 240 SAEVHHRTKGNECQIPAIESVLMEEIDNKDTKSGSRTQGQAGNGSKEGYSASNSLEQRNH 299 Query: 869 GITKQEIKPSIQDLALRAASQAKTEAAKNIKAPKTAYEFELSWKGLSGDRALQARLLKAI 1048 K E+K S+Q LA +AAS+A +AAKNI P TAY+FE+SW+G +GD ALQA LLKA+ Sbjct: 300 RTRKPEMKASVQQLASQAASRAMADAAKNITPPTTAYQFEVSWRGFAGDCALQACLLKAM 359 Query: 1049 PPSTLPKLFRDALTAPLLIDIVKCIATFFVEETELAVELLDNVTKVSRFDMISMCLSMAD 1228 P LPK+F++AL++ LLI+I+KC+A+FF E+ +L V +DN+TKV RFD+I MCL A Sbjct: 360 SPHELPKIFKNALSSTLLIEIIKCVASFFAEDVDLVVSYMDNLTKVPRFDVIVMCLPSAA 419 Query: 1229 KADIGKTWEEVFSSEAVPIECAETLSKLRVKYCSG 1333 K D+ K W EVF SEA P+E AE L LR K+ G Sbjct: 420 KDDLRKIWNEVFCSEATPMEYAEILGSLRSKFYLG 454 >ref|XP_007031162.1| Tetratricopeptide repeat (TPR)-like superfamily protein, putative isoform 5 [Theobroma cacao] gi|508719767|gb|EOY11664.1| Tetratricopeptide repeat (TPR)-like superfamily protein, putative isoform 5 [Theobroma cacao] Length = 389 Score = 416 bits (1068), Expect = e-113 Identities = 219/388 (56%), Positives = 276/388 (71%) Frame = +2 Query: 164 KFKAQSNEEKKKTEIISEAKGVARKAPSVDYSRSLKSIGDISSKFFAEESSPDAATEKEH 343 K + ++NE+ + T S + DY ++ +SS F EE+ PDAA+EKE Sbjct: 8 KEQLKTNEKGRPTGKSSLIDSSTTSSRQYDYLQNYDKFNSLSSSFVTEENMPDAASEKEL 67 Query: 344 GNEYFKQKKFKEAIDCYSRSIALSPTAVAFANRAMAYLKLKRFEEAEVDCTEALNLDDRY 523 GNEYFKQKKFKEAIDCYSRSI LSPTAVA ANRAMAYLK+K+F+EAE DCTEALNLDDRY Sbjct: 68 GNEYFKQKKFKEAIDCYSRSIGLSPTAVAHANRAMAYLKIKKFQEAEDDCTEALNLDDRY 127 Query: 524 TKAYSRRATARKELGKLKESFEDSEFALRLEPQNQELKKQYTDTKALYDKELLAKASEMV 703 KAYSRRATARKELGKLKES ED+EFALRLEP NQE+KKQ+ + K+LY+KE+L KAS ++ Sbjct: 128 IKAYSRRATARKELGKLKESIEDTEFALRLEPNNQEIKKQHAEFKSLYEKEILQKASGVL 187 Query: 704 KKSRVGEQSVGSTTGKAVSIKEMGSGSTNAKRKIGEQELDQSQDGQFNQVTQNGHGITKQ 883 +KS Q VG + KE G G +A + Q Q T+ + K Sbjct: 188 RKSMQEAQEVGKS-----ETKENGLGMHSASNSTQRTGVATVQGYQ----TKKNNRTRKP 238 Query: 884 EIKPSIQDLALRAASQAKTEAAKNIKAPKTAYEFELSWKGLSGDRALQARLLKAIPPSTL 1063 E+K S+Q+LA AA++A EAAKNI P TAY+FE+SW+ LSGDRALQA LLK PS L Sbjct: 239 ELKASVQELASLAATRAMAEAAKNISPPNTAYQFEVSWRALSGDRALQAHLLKVTSPSAL 298 Query: 1064 PKLFRDALTAPLLIDIVKCIATFFVEETELAVELLDNVTKVSRFDMISMCLSMADKADIG 1243 P++F++AL+A +L+DI+KC+ATFF EE +LA++ L+N+TKV RFDM+ MCLS +KAD+ Sbjct: 299 PQIFKNALSASMLVDIIKCVATFFREEVDLAIKYLENLTKVPRFDMLIMCLSSTEKADLL 358 Query: 1244 KTWEEVFSSEAVPIECAETLSKLRVKYC 1327 K W++VF +EA PIE AE L LR YC Sbjct: 359 KVWDDVFCNEATPIEWAEILDNLRSVYC 386