BLASTX nr result
ID: Cornus23_contig00007071
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Cornus23_contig00007071 (1192 letters) Database: ./nr 77,306,371 sequences; 28,104,191,420 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value gb|AIG55302.1| gag-pol, partial [Camellia sinensis] 565 e-158 emb|CAN59997.1| hypothetical protein VITISV_020888 [Vitis vinifera] 498 e-138 emb|CAA73042.1| polyprotein [Ananas comosus] 494 e-137 emb|CAN66189.1| hypothetical protein VITISV_006047 [Vitis vinifera] 494 e-137 ref|XP_008812481.1| PREDICTED: LOW QUALITY PROTEIN: uncharacteri... 488 e-135 emb|CAN77191.1| hypothetical protein VITISV_006389 [Vitis vinifera] 482 e-133 ref|XP_007044250.1| DNA/RNA polymerases superfamily protein [The... 481 e-133 ref|XP_007044383.1| DNA/RNA polymerases superfamily protein [The... 476 e-131 ref|XP_007010875.1| DNA/RNA polymerases superfamily protein, put... 473 e-130 ref|XP_007023888.1| DNA/RNA polymerases superfamily protein [The... 473 e-130 ref|XP_012575125.1| PREDICTED: uncharacterized protein LOC101508... 471 e-130 ref|XP_007010454.1| Uncharacterized protein TCM_044274 [Theobrom... 471 e-130 ref|XP_010026793.1| PREDICTED: uncharacterized protein LOC104417... 468 e-129 ref|XP_007028176.1| DNA/RNA polymerases superfamily protein [The... 466 e-128 ref|XP_010668427.1| PREDICTED: uncharacterized protein LOC104885... 464 e-128 ref|XP_007036977.1| DNA/RNA polymerases superfamily protein [The... 461 e-127 ref|XP_007049973.1| DNA/RNA polymerases superfamily protein [The... 461 e-127 ref|XP_007213082.1| hypothetical protein PRUPE_ppa021229mg [Prun... 459 e-126 ref|XP_010695935.1| PREDICTED: uncharacterized protein LOC104908... 458 e-126 gb|AEV42258.1| hypothetical protein [Beta vulgaris] 458 e-126 >gb|AIG55302.1| gag-pol, partial [Camellia sinensis] Length = 923 Score = 565 bits (1456), Expect = e-158 Identities = 268/395 (67%), Positives = 324/395 (82%) Frame = +1 Query: 4 EWDRPKTVFEIRSFLGLAGYYRRFIQDFSRLAAPMTRLTRKGVKFVWDDCCERAFQELKR 183 +W +PK VFEIR+FLGLAGYYR+F++DFSRLA+P+TRLTRKGVKFVW + CE++FQELK Sbjct: 213 DWAQPKNVFEIRNFLGLAGYYRQFVKDFSRLASPLTRLTRKGVKFVWSETCEKSFQELKV 272 Query: 184 RLTSAPILIIPERGRDYVIFCDASREGLGCVLMQDEKVVAYGSRQLKPHERNYPTHDLEL 363 RLT+AP+LIIPERG Y ++CDASREGLGCVLMQ+ KVVAYGSRQLK HE+NYPTHDLEL Sbjct: 273 RLTTAPVLIIPERGLGYAVYCDASREGLGCVLMQEGKVVAYGSRQLKIHEKNYPTHDLEL 332 Query: 364 AAVVFALKVWRHYLYGERFEVFSDHKSLKYLFSQKDLNLRQRRWVELMEDYDFDLRYHPG 543 AV+FALK+WRHYLYGE+FEVFSDHKS KYLF+Q+DLNLRQR W+E +EDYDF+L HPG Sbjct: 333 TAVIFALKIWRHYLYGEKFEVFSDHKSFKYLFTQRDLNLRQRWWMEFIEDYDFELHCHPG 392 Query: 544 KANVVADALSRKSRGSLSALVAVAIHEWRVMEDFAELGLYFSEEVDDASPVLFSLVAQPT 723 KANVVADALSRK ++S + +AI EW ++ E L E V+ A+ LFS+VAQPT Sbjct: 393 KANVVADALSRK---TISDVACIAIREWEMLGALGEFDLLLGESVEAAA--LFSVVAQPT 447 Query: 724 LMHEVIVAQGQDVDLRALRERLEQGDQEDGWTVSADGGLRYRGRLCVPLPCREAVLQAFH 903 L+ V+ AQ D+++ +LRE++ G E G TV + +RYR RL VP CRE VL FH Sbjct: 448 LVTRVLEAQRGDLEIESLREKISSGKVEKGLTVYPEQSVRYRDRLFVPESCREEVLGEFH 507 Query: 904 CSRFAVHPGGTKMYMNVKRHYWWKGLKGDVARFVAKCLTCQQVKAEHQRPAGLFQPLPVS 1083 SR AVHPGGTKMY ++ R +WW+G+K DVA FV+KCLTCQQVKAEHQRPAGL QPLP++ Sbjct: 508 HSRLAVHPGGTKMYQDLGRQFWWRGMKRDVAVFVSKCLTCQQVKAEHQRPAGLLQPLPIA 567 Query: 1084 EWKWEHITMDFVTGLPRTRQRHDAIWVIVDRLTKT 1188 EWKWEHITMDFV GLPRT++ DAIWV+VDRLTK+ Sbjct: 568 EWKWEHITMDFVVGLPRTQRGSDAIWVVVDRLTKS 602 >emb|CAN59997.1| hypothetical protein VITISV_020888 [Vitis vinifera] Length = 893 Score = 498 bits (1283), Expect = e-138 Identities = 240/397 (60%), Positives = 301/397 (75%), Gaps = 2/397 (0%) Frame = +1 Query: 4 EWDRPKTVFEIRSFLGLAGYYRRFIQDFSRLAAPMTRLTRKGVKFVWDDCCERAFQELKR 183 EW RP VFE+RSFLGL GYYRRF+++FSR+AAPMTRLTRKGVKF W++ CE AFQELKR Sbjct: 213 EWQRPTNVFEVRSFLGLVGYYRRFVENFSRIAAPMTRLTRKGVKFDWNEECENAFQELKR 272 Query: 184 RLTSAPILIIPERGRDYVIFCDASREGLGCVLMQDEKVVAYGSRQLKPHERNYPTHDLEL 363 +LT+ P+L P G + I+CD S GLGCVLMQ KVVAY SRQLK HERNY THDLEL Sbjct: 273 KLTTTPVLTAPISGELFTIYCDVSTVGLGCVLMQQGKVVAYASRQLKQHERNYLTHDLEL 332 Query: 364 AAVVFALKVWRHYLYGERFEVFSDHKSLKYLFSQKDLNLRQRRWVELMEDYDFDLRYHPG 543 AAVVFALK WRHYLYGE+FEV+SDHKSLKY+F+QKDLN RQRRW+E +EDYDF L YHPG Sbjct: 333 AAVVFALKTWRHYLYGEKFEVYSDHKSLKYIFTQKDLNSRQRRWMETLEDYDFALHYHPG 392 Query: 544 KANVVADALSRKSRGSLSALVAVAIHEWRVMEDFAELGLYFSEEVDDASPVLFSLVAQPT 723 KANVVADALSRK+ G LS+L V+EDF EL L ++ P L+S++A+P Sbjct: 393 KANVVADALSRKNVGQLSSLELREFEMHAVIEDF-ELCL----GLEGHGPCLYSILARPM 447 Query: 724 LMHEVIVAQGQDVDLRALRERLEQGDQEDGWTVSADGGLRYRGRLCVP--LPCREAVLQA 897 ++ ++ AQ D L ++ +L G+ ++ W++ DG + ++GRLCVP + R +L Sbjct: 448 VIQRIVEAQVHDEFLEKVKAQLVAGEIDENWSMYEDGSVWFKGRLCVPKDVGLRNELLAD 507 Query: 898 FHCSRFAVHPGGTKMYMNVKRHYWWKGLKGDVARFVAKCLTCQQVKAEHQRPAGLFQPLP 1077 H +++ +HPG TKMY ++KR +W G+K D+A+FVA C CQQVKAEHQRPAGL QPLP Sbjct: 508 AHKAKYTIHPGNTKMYQDLKRQFWCNGMKRDIAQFVANCQICQQVKAEHQRPAGLLQPLP 567 Query: 1078 VSEWKWEHITMDFVTGLPRTRQRHDAIWVIVDRLTKT 1188 + EWKW++ITMDFV LPRTR + + +WVIVDRLTK+ Sbjct: 568 IPEWKWDNITMDFVIRLPRTRSKKNGVWVIVDRLTKS 604 >emb|CAA73042.1| polyprotein [Ananas comosus] Length = 871 Score = 494 bits (1273), Expect = e-137 Identities = 244/398 (61%), Positives = 308/398 (77%), Gaps = 3/398 (0%) Frame = +1 Query: 4 EWDRPKTVFEIRSFLGLAGYYRRFIQDFSRLAAPMTRLTRKGVKFVWDDCCERAFQELKR 183 +W R +V EIRSFLGLAGYYRRF++ F++L+ P+TRLT KGVKF+W+D CER+FQELK+ Sbjct: 237 DWPRLTSVTEIRSFLGLAGYYRRFVERFAKLSTPLTRLTHKGVKFIWNDACERSFQELKQ 296 Query: 184 RLTSAPILIIPERGRDYVIFCDASREGLGCVLMQDEKVVAYGSRQLKPHERNYPTHDLEL 363 RLT+APIL +P G YV++ DAS GLGCVLMQD+KV+AY SRQLK +E+NYPTHDLEL Sbjct: 297 RLTTAPILTLPVAGAGYVVYSDASLNGLGCVLMQDDKVIAYASRQLKEYEKNYPTHDLEL 356 Query: 364 AAVVFALKVWRHYLYGERFEVFSDHKSLKYLFSQKDLNLRQRRWVELMEDYDFDLRYHPG 543 AAVVFALK+WRHYLYGER EV++DHKSLKYLF+QK+LNLRQRRW+EL++DYD + YHPG Sbjct: 357 AAVVFALKLWRHYLYGERCEVYTDHKSLKYLFTQKELNLRQRRWLELLKDYDLTILYHPG 416 Query: 544 KANVVADALSRKSRGSLSALVAVAIHEWRVMEDFAELGLYFSEEVDDASPV-LFSLVAQP 720 KANVVADALSRKS +L+ V + + R++E L L E V +P+ L +LV QP Sbjct: 417 KANVVADALSRKSMENLAMHV---VTQPRLIEQMKRLEL---EIVTPDTPMRLMTLVVQP 470 Query: 721 TLMHEVIVAQGQDVDLRALRERLEQGDQEDGWTVSADGGLRYRGRLCVPLP--CREAVLQ 894 TL+ + Q DV+L+ ++ ++ G D +T+ DG +R+RGR+CVP +E +LQ Sbjct: 471 TLLDRIKEKQASDVELQKIKGKMVDGCTGD-FTLDGDGLMRFRGRICVPADSGIKEDILQ 529 Query: 895 AFHCSRFAVHPGGTKMYMNVKRHYWWKGLKGDVARFVAKCLTCQQVKAEHQRPAGLFQPL 1074 H + +A+HPGGTKMY ++K YWW G+K DV FVAKCLTCQQVKAEH+ PAG Q L Sbjct: 530 EAHRAPYAIHPGGTKMYKDLKLLYWWPGIKKDVGEFVAKCLTCQQVKAEHRVPAGKLQSL 589 Query: 1075 PVSEWKWEHITMDFVTGLPRTRQRHDAIWVIVDRLTKT 1188 P+ WKWE ITMDFVTGLPR++ HDAIWVIVDRLTK+ Sbjct: 590 PIPVWKWEKITMDFVTGLPRSQAGHDAIWVIVDRLTKS 627 >emb|CAN66189.1| hypothetical protein VITISV_006047 [Vitis vinifera] Length = 1573 Score = 494 bits (1272), Expect = e-137 Identities = 238/397 (59%), Positives = 301/397 (75%), Gaps = 2/397 (0%) Frame = +1 Query: 4 EWDRPKTVFEIRSFLGLAGYYRRFIQDFSRLAAPMTRLTRKGVKFVWDDCCERAFQELKR 183 EW RP VFE+RSFLGLAGYYRRF++DFSR+AAPMT+LTRK VKF W++ CE AFQELK+ Sbjct: 907 EWQRPTNVFEVRSFLGLAGYYRRFVEDFSRIAAPMTQLTRKWVKFDWNEECENAFQELKQ 966 Query: 184 RLTSAPILIIPERGRDYVIFCDASREGLGCVLMQDEKVVAYGSRQLKPHERNYPTHDLEL 363 +LT+AP+L P G ++I+CDAS GLGCVLMQ KVVAY SRQLK HERNY HDLEL Sbjct: 967 KLTTAPVLTAPISGELFMIYCDASTVGLGCVLMQQGKVVAYASRQLKQHERNYLAHDLEL 1026 Query: 364 AAVVFALKVWRHYLYGERFEVFSDHKSLKYLFSQKDLNLRQRRWVELMEDYDFDLRYHPG 543 AA+VFALK W HYLYGE+FEV+SDHKSLKY+F+QKDLN RQRRW+E +EDYDF L YHPG Sbjct: 1027 AAMVFALKTWIHYLYGEKFEVYSDHKSLKYIFTQKDLNSRQRRWMETLEDYDFALHYHPG 1086 Query: 544 KANVVADALSRKSRGSLSALVAVAIHEWRVMEDFAELGLYFSEEVDDASPVLFSLVAQPT 723 KANVVADALSRKS G L +L + V+EDF EL L + P L+S+ A+P Sbjct: 1087 KANVVADALSRKSYGQLFSLGLREFEMYAVIEDF-ELCLV----QEGRGPCLYSISARPM 1141 Query: 724 LMHEVIVAQGQDVDLRALRERLEQGDQEDGWTVSADGGLRYRGRLCVP--LPCREAVLQA 897 ++ ++ AQ D L ++ +L G+ ++ W++ DG +R++GRLCVP + R +L Sbjct: 1142 VIQRIVEAQVHDEFLEKVKAQLVAGEIDENWSMYEDGSVRFKGRLCVPKDVELRNELLAD 1201 Query: 898 FHCSRFAVHPGGTKMYMNVKRHYWWKGLKGDVARFVAKCLTCQQVKAEHQRPAGLFQPLP 1077 H +++ +HPG TKMY ++KR + W G+K D+A+FVA C CQQVKAEHQRPA L QPLP Sbjct: 1202 AHRAKYTIHPGNTKMYQDLKRQFXWSGMKRDIAQFVANCQICQQVKAEHQRPAELLQPLP 1261 Query: 1078 VSEWKWEHITMDFVTGLPRTRQRHDAIWVIVDRLTKT 1188 + +WKW++ITMDFV GLPRTR + + +WVIVDRLTK+ Sbjct: 1262 IPKWKWDNITMDFVIGLPRTRSKKNGVWVIVDRLTKS 1298 >ref|XP_008812481.1| PREDICTED: LOW QUALITY PROTEIN: uncharacterized protein LOC103723366 [Phoenix dactylifera] Length = 1246 Score = 488 bits (1255), Expect = e-135 Identities = 237/399 (59%), Positives = 306/399 (76%), Gaps = 3/399 (0%) Frame = +1 Query: 1 MEWDRPKTVFEIRSFLGLAGYYRRFIQDFSRLAAPMTRLTRKGVKFVWDDCCERAFQELK 180 ++W RP V EIRSFLGLAGYYRRF++ FSR+A P+TRLT+K KFVW + CE++FQELK Sbjct: 579 VDWPRPTNVTEIRSFLGLAGYYRRFVEGFSRIATPLTRLTQKRAKFVWSEDCEQSFQELK 638 Query: 181 RRLTSAPILIIPERGRDYVIFCDASREGLGCVLMQDEKVVAYGSRQLKPHERNYPTHDLE 360 +RL SAPIL +P ++I+ DAS++GLGCVLMQ++KVVAY SRQLKP+E+NYPTHDLE Sbjct: 639 QRLVSAPILTLPTSTGGFIIYSDASKKGLGCVLMQNDKVVAYASRQLKPYEQNYPTHDLE 698 Query: 361 LAAVVFALKVWRHYLYGERFEVFSDHKSLKYLFSQKDLNLRQRRWVELMEDYDFDLRYHP 540 LAAVVFALK+W HYLYGE EVF+DHKSLKY+F+QK+LN+RQRRW+EL++DYD ++YHP Sbjct: 699 LAAVVFALKIWGHYLYGEPCEVFTDHKSLKYIFTQKELNMRQRRWLELLKDYDLSIKYHP 758 Query: 541 GKANVVADALSRKSR-GSLSALVAVAIHEWRVMEDFAELGLYFSEEVDDASPVLFSLVAQ 717 KANVVADALSRKS GS+S L + ++++DF + + DA +L SL+ Q Sbjct: 759 EKANVVADALSRKSAVGSISLLTT----QKQILKDFEMMQI--DVITKDAGSMLTSLLVQ 812 Query: 718 PTLMHEVIVAQGQDVDLRALRERLEQGDQEDGWTVSADGGLRYRGRLCVP--LPCREAVL 891 PTL+ + AQ D L LR +E+G + + + DG LR+ RLCVP + +L Sbjct: 813 PTLIERIKTAQQTDAHLCRLRNDVERGLRPE-LRIHPDGTLRFGCRLCVPKDADLKREIL 871 Query: 892 QAFHCSRFAVHPGGTKMYMNVKRHYWWKGLKGDVARFVAKCLTCQQVKAEHQRPAGLFQP 1071 + H SRF++HPG TKMY +++ H+WW G+K ++A FVA+CL CQQVKAEHQRPAGL +P Sbjct: 872 EEAHQSRFSIHPGSTKMYTDLREHFWWNGMKREIAGFVARCLVCQQVKAEHQRPAGLLEP 931 Query: 1072 LPVSEWKWEHITMDFVTGLPRTRQRHDAIWVIVDRLTKT 1188 L + EWKWEHITMDFV GLPRT +R+DA+WVIVDRLTK+ Sbjct: 932 LEIPEWKWEHITMDFVIGLPRTVRRNDAVWVIVDRLTKS 970 >emb|CAN77191.1| hypothetical protein VITISV_006389 [Vitis vinifera] Length = 1387 Score = 482 bits (1240), Expect = e-133 Identities = 239/398 (60%), Positives = 296/398 (74%), Gaps = 2/398 (0%) Frame = +1 Query: 4 EWDRPKTVFEIRSFLGLAGYYRRFIQDFSRLAAPMTRLTRKGVKFVWDDCCERAFQELKR 183 E RP VFE+RSFLGL GYYRRF++DFSR+AAPMTRLTRKGVKF ++ CE AFQELKR Sbjct: 686 EXQRPTNVFEVRSFLGLVGYYRRFVEDFSRIAAPMTRLTRKGVKFDLNEECENAFQELKR 745 Query: 184 RLTSAPILIIPERGRDYVIFCDASREGLGCVLMQDEKVVAYGSRQLKPHERNYPTHDLEL 363 +LT AP+L P G + I+CDAS GLGCVLMQ +KVVAY SRQLK HERNYPTHDLEL Sbjct: 746 KLTIAPVLTAPISGELFTIYCDASTVGLGCVLMQQDKVVAYASRQLKQHERNYPTHDLEL 805 Query: 364 AAVVFALKVWRHYLYGERFEVFSDHKSLKYLFSQKDLNLRQRRWVELMEDYDFDLRYHPG 543 A VVFALK WRHYLYGE+FEV+SDHKSLKY+F+QKDLN RQRRW+E +EDYDF L YHPG Sbjct: 806 AVVVFALKTWRHYLYGEKFEVYSDHKSLKYIFTQKDLNSRQRRWMETLEDYDFALHYHPG 865 Query: 544 KANVVADALSRKSRGSLSALVAVAIHEWRVMEDFAELGLYFSEEVDDASPVLFSLVAQPT 723 KANVVADALSRKS G LS+L V+EDF EL L ++ P L+S+ A+P Sbjct: 866 KANVVADALSRKSVGQLSSLELREFEMHTVIEDF-ELCL----GLEGHGPCLYSISARPX 920 Query: 724 LMHEVIVAQGQDVDLRALRERLEQGDQEDGWTVSADGGLRYRGRLCVP--LPCREAVLQA 897 ++ ++ AQ D L ++ +L G+ ++ W++ DG +R++GRLCVP + R +L Sbjct: 921 VIQRIVEAQVHDEFLEKVKTQLVAGEIDENWSMYEDGSVRFKGRLCVPKDVELRNELLAD 980 Query: 898 FHCSRFAVHPGGTKMYMNVKRHYWWKGLKGDVARFVAKCLTCQQVKAEHQRPAGLFQPLP 1077 H +++ +HPG TK+ G+K D+A+FVA C CQQVKAEHQRPAGL QPLP Sbjct: 981 AHRAKYTIHPGNTKI-----------GMKKDIAQFVANCQICQQVKAEHQRPAGLLQPLP 1029 Query: 1078 VSEWKWEHITMDFVTGLPRTRQRHDAIWVIVDRLTKTT 1191 + EWKW++ITMDFV GLPRTR + + +W+IVDRLTK+T Sbjct: 1030 IPEWKWDNITMDFVIGLPRTRSKKNGVWMIVDRLTKST 1067 >ref|XP_007044250.1| DNA/RNA polymerases superfamily protein [Theobroma cacao] gi|508708185|gb|EOY00082.1| DNA/RNA polymerases superfamily protein [Theobroma cacao] Length = 1515 Score = 481 bits (1239), Expect = e-133 Identities = 234/398 (58%), Positives = 298/398 (74%), Gaps = 2/398 (0%) Frame = +1 Query: 1 MEWDRPKTVFEIRSFLGLAGYYRRFIQDFSRLAAPMTRLTRKGVKFVWDDCCERAFQELK 180 ++W++PKTV EIRSFLGLAGYYRRF+Q FS +AAP+TRLTRKGVKFVWDD CE FQELK Sbjct: 796 LQWEQPKTVTEIRSFLGLAGYYRRFVQGFSLVAAPLTRLTRKGVKFVWDDVCENRFQELK 855 Query: 181 RRLTSAPILIIPERGRDYVIFCDASREGLGCVLMQDEKVVAYGSRQLKPHERNYPTHDLE 360 RLTSAP+L +P G+ ++++ DAS+ GLGCVLMQDEKVVAY SRQLK HE NYPTHDLE Sbjct: 856 NRLTSAPVLTLPVNGKGFIVYSDASKLGLGCVLMQDEKVVAYASRQLKRHEANYPTHDLE 915 Query: 361 LAAVVFALKVWRHYLYGERFEVFSDHKSLKYLFSQKDLNLRQRRWVELMEDYDFDLRYHP 540 LAAVVFALK+WRHYLYGE +F+DHKSLKYL +QK+LNLRQRRW+EL++DYD + YH Sbjct: 916 LAAVVFALKIWRHYLYGEHCRIFTDHKSLKYLLTQKELNLRQRRWLELIKDYDLVIDYHL 975 Query: 541 GKANVVADALSRKSRGSLSALVAVAIHEWRVMEDFAELGLYFSEEVDDASPVLFSLVAQP 720 GKANVVADALSRKS SL+AL + + + + LG+ D + +L + + +P Sbjct: 976 GKANVVADALSRKSSSSLAALQSC---YFPALIEMKSLGVQLRNGEDGS--LLANFIVRP 1030 Query: 721 TLMHEVIVAQGQDVDLRALRERLEQGDQEDGWTVSADGGLRYRGRLCVP--LPCREAVLQ 894 +L++++ Q D +LR ++L G + + D L ++ R+CVP R+A+++ Sbjct: 1031 SLLNQIKDIQRSDDELRKEIQKLTDGGVSE-FRFGEDNVLMFKDRVCVPEGNQLRQAIME 1089 Query: 895 AFHCSRFAVHPGGTKMYMNVKRHYWWKGLKGDVARFVAKCLTCQQVKAEHQRPAGLFQPL 1074 H S +A+HPG TKMY ++ +YWW G+K DVA F+AKCL CQQVKAEHQR Q L Sbjct: 1090 EAHSSAYALHPGSTKMYRTIRENYWWPGMKRDVAEFIAKCLVCQQVKAEHQRLVDTLQSL 1149 Query: 1075 PVSEWKWEHITMDFVTGLPRTRQRHDAIWVIVDRLTKT 1188 PV EWKWEH+TMDF+ GLPRT++ DAIWVIVDRLTK+ Sbjct: 1150 PVPEWKWEHVTMDFILGLPRTQRGKDAIWVIVDRLTKS 1187 >ref|XP_007044383.1| DNA/RNA polymerases superfamily protein [Theobroma cacao] gi|508708318|gb|EOY00215.1| DNA/RNA polymerases superfamily protein [Theobroma cacao] Length = 1537 Score = 476 bits (1225), Expect = e-131 Identities = 231/398 (58%), Positives = 296/398 (74%), Gaps = 2/398 (0%) Frame = +1 Query: 1 MEWDRPKTVFEIRSFLGLAGYYRRFIQDFSRLAAPMTRLTRKGVKFVWDDCCERAFQELK 180 ++W++P+ V EIRSFLGLAGYYRRF+Q FS +AAP+TRLTRK VK+ WDD CE FQELK Sbjct: 809 LQWEQPRMVTEIRSFLGLAGYYRRFVQGFSLIAAPLTRLTRKEVKYEWDDVCENRFQELK 868 Query: 181 RRLTSAPILIIPERGRDYVIFCDASREGLGCVLMQDEKVVAYGSRQLKPHERNYPTHDLE 360 RLTS +L +P G+++V++ DAS+ GLGCVLMQDEKV+AY SRQLK HE NYPTHDLE Sbjct: 869 NRLTSTLVLTLPVSGKEFVVYSDASKLGLGCVLMQDEKVIAYASRQLKKHETNYPTHDLE 928 Query: 361 LAAVVFALKVWRHYLYGERFEVFSDHKSLKYLFSQKDLNLRQRRWVELMEDYDFDLRYHP 540 LA VVFALK+WRHYLYGER +F DHKSLKYL +QK+LNLRQR+W+EL++DYD + YHP Sbjct: 929 LATVVFALKIWRHYLYGERCRIFYDHKSLKYLLTQKELNLRQRQWLELIKDYDLVIDYHP 988 Query: 541 GKANVVADALSRKSRGSLSALVAVAIHEWRVMEDFAELGLYFSEEVDDASPVLFSLVAQP 720 KANVVADALSRKS SL+ L + + ++ + LG+ + D +L S V +P Sbjct: 989 RKANVVADALSRKSSSSLATLRS---SYFSMLLEMKSLGIQLNNGED--GTLLASFVVRP 1043 Query: 721 TLMHEVIVAQGQDVDLRALRERLEQGDQEDGWTVSADGGLRYRGRLCVPL--PCREAVLQ 894 +L++++ Q D L+ ++L+ G + + +S DG L R R+CVP R A+L+ Sbjct: 1044 SLLNQIRELQKSDDWLKQEVQKLQDGKASE-FRLSDDGTLMLRDRICVPKDDQLRRAILE 1102 Query: 895 AFHCSRFAVHPGGTKMYMNVKRHYWWKGLKGDVARFVAKCLTCQQVKAEHQRPAGLFQPL 1074 H S +A+HPG TKMY +K YWW G++ D+A FVAKCLTCQQ+KAEHQ+P+G QPL Sbjct: 1103 EAHYSAYALHPGSTKMYRTIKESYWWPGMERDIAEFVAKCLTCQQIKAEHQKPSGTLQPL 1162 Query: 1075 PVSEWKWEHITMDFVTGLPRTRQRHDAIWVIVDRLTKT 1188 + EWKWEH+TMDFV GLPRT+ DAIWVIVDRLTK+ Sbjct: 1163 SIPEWKWEHVTMDFVLGLPRTQSGKDAIWVIVDRLTKS 1200 >ref|XP_007010875.1| DNA/RNA polymerases superfamily protein, putative [Theobroma cacao] gi|508727788|gb|EOY19685.1| DNA/RNA polymerases superfamily protein, putative [Theobroma cacao] Length = 1347 Score = 473 bits (1217), Expect = e-130 Identities = 233/392 (59%), Positives = 293/392 (74%), Gaps = 2/392 (0%) Frame = +1 Query: 1 MEWDRPKTVFEIRSFLGLAGYYRRFIQDFSRLAAPMTRLTRKGVKFVWDDCCERAFQELK 180 ++W++PKTV EIRSFLGLAGYYRRF+Q FS +AAP+TRLTRKGVKFV DD CE FQELK Sbjct: 650 LQWEQPKTVTEIRSFLGLAGYYRRFVQGFSLIAAPLTRLTRKGVKFVCDDVCENRFQELK 709 Query: 181 RRLTSAPILIIPERGRDYVIFCDASREGLGCVLMQDEKVVAYGSRQLKPHERNYPTHDLE 360 RLTSAP+L +P G+ +V++ DAS+ GLGCVLMQDEKVVAY SRQLK HE NYPTHDLE Sbjct: 710 NRLTSAPVLTLPVNGKGFVVYSDASKLGLGCVLMQDEKVVAYASRQLKRHEANYPTHDLE 769 Query: 361 LAAVVFALKVWRHYLYGERFEVFSDHKSLKYLFSQKDLNLRQRRWVELMEDYDFDLRYHP 540 LAAVVFALK+WRHYLYGE +F+DHKSLKYL +QK+LNLRQRRW+EL++DYD + YHP Sbjct: 770 LAAVVFALKIWRHYLYGEHCRIFTDHKSLKYLLTQKELNLRQRRWLELIKDYDLVIDYHP 829 Query: 541 GKANVVADALSRKSRGSLSALVAVAIHEWRVMEDFAELGLYFSEEVDDASPVLFSLVAQP 720 GKANVVADALSRKS SL+AL + + + + LG+ D + VL + + +P Sbjct: 830 GKANVVADALSRKSSSSLAALQSC---YFSALIEMKSLGVQLRNGEDGS--VLANFIVRP 884 Query: 721 TLMHEVIVAQGQDVDLRALRERLEQGDQEDGWTVSADGGLRYRGRLCVP--LPCREAVLQ 894 +L++++ Q D +LR ++L G + + D L +R R+CVP R+ +++ Sbjct: 885 SLLNQIKDIQRSDDELRKEIQKLTDGGVSE-FRFGEDNVLMFRDRVCVPEGNQLRQTIME 943 Query: 895 AFHCSRFAVHPGGTKMYMNVKRHYWWKGLKGDVARFVAKCLTCQQVKAEHQRPAGLFQPL 1074 H S +A++PG TKMY ++ +YWW G+K DVA FVAKCL CQQVKAEHQRP G FQ L Sbjct: 944 EAHSSAYALNPGSTKMYRTIRENYWWPGMKRDVAEFVAKCLVCQQVKAEHQRPVGTFQSL 1003 Query: 1075 PVSEWKWEHITMDFVTGLPRTRQRHDAIWVIV 1170 PV EWKWEH+TMDFV GLPRT++ DAI+ IV Sbjct: 1004 PVLEWKWEHVTMDFVLGLPRTQRGKDAIYEIV 1035 >ref|XP_007023888.1| DNA/RNA polymerases superfamily protein [Theobroma cacao] gi|508779254|gb|EOY26510.1| DNA/RNA polymerases superfamily protein [Theobroma cacao] Length = 1290 Score = 473 bits (1216), Expect = e-130 Identities = 231/398 (58%), Positives = 294/398 (73%), Gaps = 2/398 (0%) Frame = +1 Query: 1 MEWDRPKTVFEIRSFLGLAGYYRRFIQDFSRLAAPMTRLTRKGVKFVWDDCCERAFQELK 180 ++W++P+TV EIRSFLGL GYYRRF+Q FS +AAP+TRLTRKGVKF WDD CE FQELK Sbjct: 598 LQWEQPRTVTEIRSFLGLVGYYRRFVQRFSLIAAPLTRLTRKGVKFEWDDVCENRFQELK 657 Query: 181 RRLTSAPILIIPERGRDYVIFCDASREGLGCVLMQDEKVVAYGSRQLKPHERNYPTHDLE 360 RLTSAPIL + +++V++ DA + GLGCVLMQDEKV+AY SRQL HE NY THDLE Sbjct: 658 NRLTSAPILTLSVSEKEFVVYSDAPKLGLGCVLMQDEKVIAYASRQLMKHETNYLTHDLE 717 Query: 361 LAAVVFALKVWRHYLYGERFEVFSDHKSLKYLFSQKDLNLRQRRWVELMEDYDFDLRYHP 540 LAAVVFALK+WRHYLYGER +F DHKSLKYL +QK+LNLRQRRW+EL++DYD + YHP Sbjct: 718 LAAVVFALKIWRHYLYGERCRIFFDHKSLKYLLTQKELNLRQRRWLELIKDYDLVIDYHP 777 Query: 541 GKANVVADALSRKSRGSLSALVAVAIHEWRVMEDFAELGLYFSEEVDDASPVLFSLVAQP 720 GKANVV DALSRKS SL+ L + + ++ + LG+ + D +L S V +P Sbjct: 778 GKANVVTDALSRKSSSSLATLRS---SYFPMLLEMKSLGIQLNNGED--GTLLASFVVRP 832 Query: 721 TLMHEVIVAQGQDVDLRALRERLEQGDQEDGWTVSADGGLRYRGRLCVPL--PCREAVLQ 894 +L++++ Q D L+ ++L+ G+ + + +S DG L R R+CVP R A+L+ Sbjct: 833 SLLNQIRELQKFDDWLKQEVQKLQDGEASE-FRLSDDGTLMLRDRICVPKDDQLRRAILE 891 Query: 895 AFHCSRFAVHPGGTKMYMNVKRHYWWKGLKGDVARFVAKCLTCQQVKAEHQRPAGLFQPL 1074 H S +A+HPG TKMY +K YWW G+K D+A FVAKCL CQQ+KAEHQ+ +G QPL Sbjct: 892 EAHSSAYALHPGSTKMYQTIKESYWWPGMKRDIAEFVAKCLICQQIKAEHQKSSGTLQPL 951 Query: 1075 PVSEWKWEHITMDFVTGLPRTRQRHDAIWVIVDRLTKT 1188 P+ EWKWEH+TMDFV GLPRT+ DAIWVI+ RLTK+ Sbjct: 952 PIPEWKWEHVTMDFVLGLPRTQSGKDAIWVIMGRLTKS 989 >ref|XP_012575125.1| PREDICTED: uncharacterized protein LOC101508115 [Cicer arietinum] Length = 1870 Score = 471 bits (1212), Expect = e-130 Identities = 236/398 (59%), Positives = 298/398 (74%), Gaps = 2/398 (0%) Frame = +1 Query: 1 MEWDRPKTVFEIRSFLGLAGYYRRFIQDFSRLAAPMTRLTRKGVKFVWDDCCERAFQELK 180 +EW PK+V EIRSFLGLAGYYRRFI+ FSRLA P+T+LTRKG FVWD CE +FQELK Sbjct: 793 LEWKAPKSVTEIRSFLGLAGYYRRFIEGFSRLALPLTKLTRKGELFVWDTHCENSFQELK 852 Query: 181 RRLTSAPILIIPERGRDYVIFCDASREGLGCVLMQDEKVVAYGSRQLKPHERNYPTHDLE 360 +RLTSAPIL++P+ +V++CDA GLG VLMQD KVVAY SRQLK HERNYPTHDLE Sbjct: 853 KRLTSAPILVLPDLSEPFVVYCDACGSGLGGVLMQDGKVVAYASRQLKIHERNYPTHDLE 912 Query: 361 LAAVVFALKVWRHYLYGERFEVFSDHKSLKYLFSQKDLNLRQRRWVELMEDYDFDLRYHP 540 LAAVVF LK+WRHYLYG RFEVFSDHKSLKYLF QK+LN+RQRRW+E ++D+DF+L+YHP Sbjct: 913 LAAVVFVLKMWRHYLYGSRFEVFSDHKSLKYLFDQKELNMRQRRWMEFLKDFDFELKYHP 972 Query: 541 GKANVVADALSRKSRGSLSALVAVAIHEWRVMEDFAELGLYFSEEVDDASPVLFSLVAQP 720 GKANVVADALSRK+ S+SAL+ + ++E F +L L EV S L L Sbjct: 973 GKANVVADALSRKTL-SVSALM---VKHSELLEQFRDLSLVC--EVTPKSIKLGMLKVTS 1026 Query: 721 TLMHEVIVAQGQDVDLRALRERLEQGDQEDGWTVSADGGLRYRGRLCVP--LPCREAVLQ 894 L+ E+ +Q D+ L + ++QG + D + + DG LR++ R+CVP R+ +L+ Sbjct: 1027 GLLEEIEKSQKLDIYLLDKLQSIDQGREPD-FKIGVDGILRFKERICVPDVEELRKMILE 1085 Query: 895 AFHCSRFAVHPGGTKMYMNVKRHYWWKGLKGDVARFVAKCLTCQQVKAEHQRPAGLFQPL 1074 H S ++HPG TKMY ++K+ +WW +K DVA FV CLTCQ+ K EHQ+P+GL QPL Sbjct: 1086 EGHRSCLSIHPGATKMYKDLKKIFWWPKMKRDVAEFVYACLTCQKSKVEHQKPSGLMQPL 1145 Query: 1075 PVSEWKWEHITMDFVTGLPRTRQRHDAIWVIVDRLTKT 1188 + EWKW+ I+MDFV GLPRT +R+D+IWVIVDRLTK+ Sbjct: 1146 SIPEWKWDSISMDFVVGLPRTPKRYDSIWVIVDRLTKS 1183 >ref|XP_007010454.1| Uncharacterized protein TCM_044274 [Theobroma cacao] gi|508727367|gb|EOY19264.1| Uncharacterized protein TCM_044274 [Theobroma cacao] Length = 860 Score = 471 bits (1211), Expect = e-130 Identities = 231/398 (58%), Positives = 292/398 (73%), Gaps = 3/398 (0%) Frame = +1 Query: 4 EWDRPKTVFEIRSFLGLAGYYRRFIQDFSRLAAPMTRLTRKGVKFVWDDCCERAFQELKR 183 +W RP +V EIRSF+GLAGYYRRF++DFS++ AP+T+LTRK KF W D CE +F++LK Sbjct: 177 KWPRPTSVTEIRSFVGLAGYYRRFVKDFSKIVAPLTKLTRKDTKFEWSDACENSFEKLKA 236 Query: 184 RLTSAPILIIPERGRDYVIFCDASREGLGCVLMQDEKVVAYGSRQLKPHERNYPTHDLEL 363 LT+AP+L +P+ R Y +FCDAS GLGCVLMQ KV+AY SRQLK HE+NYP HDLE+ Sbjct: 237 CLTTAPVLSLPQGTRGYTVFCDASGVGLGCVLMQHGKVIAYASRQLKRHEQNYPIHDLEM 296 Query: 364 AAVVFALKVWRHYLYGERFEVFSDHKSLKYLFSQKDLNLRQRRWVELMEDYDFDLRYHPG 543 AA+VFALK+WRHYLYGE E++ DHKSLKY+F Q+DLNLRQRRW+EL++DYD + YHPG Sbjct: 297 AAIVFALKIWRHYLYGETCEIYMDHKSLKYIFQQRDLNLRQRRWMELLKDYDCTILYHPG 356 Query: 544 KANVVADALSRKSRGSLSALVAVAIHEWRVMEDFAELGLYFSEEVDDASPVLFSLVAQPT 723 KANVVADALSRKS GSL+ + R + ++G+ EV + S +L +P Sbjct: 357 KANVVADALSRKSMGSLAHISIGRRSLVREIHSLGDIGVRL--EVAETSALLAHFRVRPI 414 Query: 724 LMHEVIVAQGQD-VDLRALRERLEQGDQEDGWTVSADGGLRYRGRLCVP--LPCREAVLQ 894 LM ++ AQ +D ++AL + QG + +T DG LRY RL VP R +L+ Sbjct: 415 LMDKIKEAQSKDEFVIKALED--PQGRKGKMFTKGTDGVLRYGTRLYVPDGDGLRREILE 472 Query: 895 AFHCSRFAVHPGGTKMYMNVKRHYWWKGLKGDVARFVAKCLTCQQVKAEHQRPAGLFQPL 1074 H + + VHPG TKMY ++K YWW+GLK DVA FV+KCL CQQVKAEHQ+PAGL QPL Sbjct: 473 EAHMAAYVVHPGATKMYQDLKEVYWWEGLKRDVAEFVSKCLVCQQVKAEHQKPAGLLQPL 532 Query: 1075 PVSEWKWEHITMDFVTGLPRTRQRHDAIWVIVDRLTKT 1188 PV EWKWEHI MDFVTGLPRT +D+IW++VDRLTK+ Sbjct: 533 PVPEWKWEHIAMDFVTGLPRTSGGYDSIWIVVDRLTKS 570 >ref|XP_010026793.1| PREDICTED: uncharacterized protein LOC104417177 [Eucalyptus grandis] Length = 1753 Score = 468 bits (1203), Expect = e-129 Identities = 225/398 (56%), Positives = 299/398 (75%), Gaps = 2/398 (0%) Frame = +1 Query: 1 MEWDRPKTVFEIRSFLGLAGYYRRFIQDFSRLAAPMTRLTRKGVKFVWDDCCERAFQELK 180 + W RP TV EIRSFLGLAGYYRRF++ FSRLA+PMTRL +K KFVW D CE +FQELK Sbjct: 786 INWPRPTTVTEIRSFLGLAGYYRRFVEGFSRLASPMTRLLKKEEKFVWTDKCENSFQELK 845 Query: 181 RRLTSAPILIIPERGRDYVIFCDASREGLGCVLMQDEKVVAYGSRQLKPHERNYPTHDLE 360 +LT+AP+L IP + I+ DAS +GLGCVLMQ +VVAY SRQL+ HE NYPTHDLE Sbjct: 846 HKLTTAPVLTIPSGPGGFEIYSDASFKGLGCVLMQHGRVVAYASRQLRLHELNYPTHDLE 905 Query: 361 LAAVVFALKVWRHYLYGERFEVFSDHKSLKYLFSQKDLNLRQRRWVELMEDYDFDLRYHP 540 LAA++FALK+WRHYL GERF++F+DH+SLKYLFSQK+LN+RQRRW+EL++DYD ++ YHP Sbjct: 906 LAAIIFALKIWRHYLCGERFQIFTDHQSLKYLFSQKELNMRQRRWMELLKDYDCEILYHP 965 Query: 541 GKANVVADALSRKSRGSLSALVAVAIHEWRVMEDFAELGLYFSEEVDDASPVLFSLVAQP 720 GKAN VADALSRK S++ + + EW ++E + F EV S ++ +L +P Sbjct: 966 GKANKVADALSRK-----SSVAQMVLKEWGLIERARDSDFKF--EVGHLSNLVATLRIEP 1018 Query: 721 TLMHEVIVAQGQDVDLRALRERLEQGDQEDGWTVSADGGLRYRGRLCVP--LPCREAVLQ 894 + ++ Q D D++ + + + + D + +S DG LR++GRL VP + RE +L Sbjct: 1019 EVQVKIRTLQQMDSDVQKILQEDAEKRKAD-FQISEDGTLRFQGRLVVPDDVELREEILS 1077 Query: 895 AFHCSRFAVHPGGTKMYMNVKRHYWWKGLKGDVARFVAKCLTCQQVKAEHQRPAGLFQPL 1074 H S +++HPG TKMY N+++HYWW G+K D+A+ VAKCLTCQQVKA+H +P GL +PL Sbjct: 1078 EAHRSNYSIHPGSTKMYQNLRQHYWWCGMKADIAKHVAKCLTCQQVKAQHCKPGGLLRPL 1137 Query: 1075 PVSEWKWEHITMDFVTGLPRTRQRHDAIWVIVDRLTKT 1188 + EWKWEHITMDFVTGLPR+++ +D+IWV+VDRLTK+ Sbjct: 1138 EIPEWKWEHITMDFVTGLPRSQRGNDSIWVVVDRLTKS 1175 >ref|XP_007028176.1| DNA/RNA polymerases superfamily protein [Theobroma cacao] gi|508716781|gb|EOY08678.1| DNA/RNA polymerases superfamily protein [Theobroma cacao] Length = 666 Score = 466 bits (1198), Expect = e-128 Identities = 228/398 (57%), Positives = 292/398 (73%), Gaps = 3/398 (0%) Frame = +1 Query: 4 EWDRPKTVFEIRSFLGLAGYYRRFIQDFSRLAAPMTRLTRKGVKFVWDDCCERAFQELKR 183 +W RP +V EIRSF+GLAGYYRRF++DFS++ AP+T+LTRK KF W D CE +F++LK Sbjct: 60 KWPRPTSVTEIRSFVGLAGYYRRFVKDFSKIVAPLTKLTRKDTKFEWSDACENSFEKLKA 119 Query: 184 RLTSAPILIIPERGRDYVIFCDASREGLGCVLMQDEKVVAYGSRQLKPHERNYPTHDLEL 363 LT+AP+L +P+ Y +FCDAS GLGCVLMQ KV+AY SRQLK HE+NYP H+LE+ Sbjct: 120 CLTTAPVLSLPQGTGGYTVFCDASGVGLGCVLMQHGKVIAYASRQLKRHEQNYPIHNLEM 179 Query: 364 AAVVFALKVWRHYLYGERFEVFSDHKSLKYLFSQKDLNLRQRRWVELMEDYDFDLRYHPG 543 AA+VFALK+WRHYLYGE E+++DHKSLKY+F Q+DLNLRQRRW+EL++DYD + YHPG Sbjct: 180 AAIVFALKIWRHYLYGETCEIYTDHKSLKYIFQQRDLNLRQRRWMELLKDYDCTILYHPG 239 Query: 544 KANVVADALSRKSRGSLSALVAVAIHEWRVMEDFAELGLYFSEEVDDASPVLFSLVAQPT 723 KANVVADALSRKS GSL+ + R + ++G+ EV + + +L +P Sbjct: 240 KANVVADALSRKSMGSLAHISIGRRSLVREIHSLGDIGVRL--EVAETNALLAHFRVRPI 297 Query: 724 LMHEVIVAQGQD-VDLRALRERLEQGDQEDGWTVSADGGLRYRGRLCVP--LPCREAVLQ 894 LM ++ AQ +D ++AL + QG + +T DG LRY RL VP R +L+ Sbjct: 298 LMDKIKEAQSKDEFVIKALED--PQGRKGKMFTKGTDGVLRYGTRLYVPDGDGLRRKILE 355 Query: 895 AFHCSRFAVHPGGTKMYMNVKRHYWWKGLKGDVARFVAKCLTCQQVKAEHQRPAGLFQPL 1074 H + + VHPG TKMY ++K YWW+GLK DVA FV+KCL CQQVKAEHQ+PAGL QPL Sbjct: 356 EAHMAAYVVHPGATKMYQDLKEVYWWEGLKRDVAEFVSKCLVCQQVKAEHQKPAGLLQPL 415 Query: 1075 PVSEWKWEHITMDFVTGLPRTRQRHDAIWVIVDRLTKT 1188 PV EWKWEHI MDFVTGLPRT +D+IW++VDRLTK+ Sbjct: 416 PVPEWKWEHIAMDFVTGLPRTSGGYDSIWIVVDRLTKS 453 >ref|XP_010668427.1| PREDICTED: uncharacterized protein LOC104885432, partial [Beta vulgaris subsp. vulgaris] Length = 1134 Score = 464 bits (1193), Expect = e-128 Identities = 224/397 (56%), Positives = 299/397 (75%), Gaps = 2/397 (0%) Frame = +1 Query: 4 EWDRPKTVFEIRSFLGLAGYYRRFIQDFSRLAAPMTRLTRKGVKFVWDDCCERAFQELKR 183 EW PK V E+RSFLGLAGYYRRF+++FS++A P+T L RK +F W++ CE AF ELKR Sbjct: 604 EWPAPKNVSEVRSFLGLAGYYRRFVKNFSKIALPITSLIRKNSRFQWNEKCEAAFLELKR 663 Query: 184 RLTSAPILIIPERGRDYVIFCDASREGLGCVLMQDEKVVAYGSRQLKPHERNYPTHDLEL 363 RLTSAPIL +P + I+ DAS+EGLGCVLMQ KV+AY SRQL+PHE+NYP HDLEL Sbjct: 664 RLTSAPILTLPSGTEGFEIYSDASQEGLGCVLMQHGKVIAYASRQLRPHEKNYPVHDLEL 723 Query: 364 AAVVFALKVWRHYLYGERFEVFSDHKSLKYLFSQKDLNLRQRRWVELMEDYDFDLRYHPG 543 AAVVFALK+WRHYLYG +VF+DHKSLKY+F+QKD+N+RQRRW+EL++DYD D++YHPG Sbjct: 724 AAVVFALKLWRHYLYGVSCKVFTDHKSLKYIFTQKDMNMRQRRWLELLKDYDIDIQYHPG 783 Query: 544 KANVVADALSRKSRGSLSALVAVAIHEWRVMEDFAELGLYFSEEVDDASPVLFSLVAQPT 723 KAN VADALSR+ R LS L A+ + +E F EL L S +++ + +L QP Sbjct: 784 KANKVADALSRRPRSELSFLSALPDELSKEIELF-ELALVRSGDIEG---TINALTVQPD 839 Query: 724 LMHEVIVAQGQDVDLRALRERLEQGDQEDGWTVSADGGLRYRGRLCVP--LPCREAVLQA 897 L E+ Q QD L+ ++E+++ G+ ++ + DG +R RGR CVP R+ +L+ Sbjct: 840 LYSEIREKQSQDAFLQGIKEKIKNGETQE-FAQYEDGSIRMRGRWCVPEDQDLRQRILKE 898 Query: 898 FHCSRFAVHPGGTKMYMNVKRHYWWKGLKGDVARFVAKCLTCQQVKAEHQRPAGLFQPLP 1077 H S ++VHPG KM ++K+++WWKGLK +VAR+VA+CLTCQ+VK E Q+ GL QPLP Sbjct: 899 AHSSPYSVHPGRDKMVRDLKKYFWWKGLKKEVARYVARCLTCQKVKFERQKAPGLLQPLP 958 Query: 1078 VSEWKWEHITMDFVTGLPRTRQRHDAIWVIVDRLTKT 1188 + EWKW+ ++MDFV+GLPR+++ +D+IWVIVDRLTKT Sbjct: 959 IPEWKWDSVSMDFVSGLPRSKKGNDSIWVIVDRLTKT 995 >ref|XP_007036977.1| DNA/RNA polymerases superfamily protein [Theobroma cacao] gi|508774222|gb|EOY21478.1| DNA/RNA polymerases superfamily protein [Theobroma cacao] Length = 878 Score = 461 bits (1187), Expect = e-127 Identities = 227/394 (57%), Positives = 288/394 (73%), Gaps = 3/394 (0%) Frame = +1 Query: 16 PKTVFEIRSFLGLAGYYRRFIQDFSRLAAPMTRLTRKGVKFVWDDCCERAFQELKRRLTS 195 P +V EIRSF+GLAGYYRRF++DFS++ AP+T+LTRK KF W D CE +F++LK LT+ Sbjct: 347 PTSVTEIRSFVGLAGYYRRFVKDFSKIVAPLTKLTRKDTKFEWSDACENSFEKLKACLTT 406 Query: 196 APILIIPERGRDYVIFCDASREGLGCVLMQDEKVVAYGSRQLKPHERNYPTHDLELAAVV 375 AP+L +P+ Y +FCDAS GLGCVLMQ KV+AY SRQLK HE+NYP HDLE+AA+V Sbjct: 407 APVLSLPQGTGGYTVFCDASGVGLGCVLMQHGKVIAYASRQLKRHEQNYPIHDLEMAAIV 466 Query: 376 FALKVWRHYLYGERFEVFSDHKSLKYLFSQKDLNLRQRRWVELMEDYDFDLRYHPGKANV 555 FALK+WRHYLYGE E+++DHKSLKY+F Q+DLNLRQRRW+EL++DYD + YHPGKANV Sbjct: 467 FALKIWRHYLYGETCEIYTDHKSLKYIFQQRDLNLRQRRWMELLKDYDCTILYHPGKANV 526 Query: 556 VADALSRKSRGSLSALVAVAIHEWRVMEDFAELGLYFSEEVDDASPVLFSLVAQPTLMHE 735 VADALSRKS GSL+ + R + ++G+ EV + + +L +P LM Sbjct: 527 VADALSRKSMGSLAHIFIGRRSLVREIHSLGDIGVRL--EVAETNALLAHFRVRPILMDR 584 Query: 736 VIVAQGQD-VDLRALRERLEQGDQEDGWTVSADGGLRYRGRLCVP--LPCREAVLQAFHC 906 + AQ +D ++AL + QG + +T DG LRY RL VP R +L+ H Sbjct: 585 IKEAQSKDEFVIKALED--PQGRKGKMFTKGTDGVLRYGTRLYVPDGDGLRREILEEAHM 642 Query: 907 SRFAVHPGGTKMYMNVKRHYWWKGLKGDVARFVAKCLTCQQVKAEHQRPAGLFQPLPVSE 1086 + + VHPG TKMY ++K YWW+GLK DVA FV+KCL CQQVKAEHQ+PAGL QPLPV E Sbjct: 643 AAYVVHPGATKMYQDLKEVYWWEGLKRDVAEFVSKCLVCQQVKAEHQKPAGLLQPLPVPE 702 Query: 1087 WKWEHITMDFVTGLPRTRQRHDAIWVIVDRLTKT 1188 WKWEHI MDFVTGLPRT +D+IW++VDRLTK+ Sbjct: 703 WKWEHIAMDFVTGLPRTSGGYDSIWIVVDRLTKS 736 >ref|XP_007049973.1| DNA/RNA polymerases superfamily protein [Theobroma cacao] gi|508702234|gb|EOX94130.1| DNA/RNA polymerases superfamily protein [Theobroma cacao] Length = 1401 Score = 461 bits (1185), Expect = e-127 Identities = 225/398 (56%), Positives = 289/398 (72%), Gaps = 3/398 (0%) Frame = +1 Query: 4 EWDRPKTVFEIRSFLGLAGYYRRFIQDFSRLAAPMTRLTRKGVKFVWDDCCERAFQELKR 183 +W RP +V EIRSF+GLAGYYRRF++DFS++ AP+T+LTRK KF W D CE +F++LK Sbjct: 731 KWPRPTSVTEIRSFVGLAGYYRRFVKDFSKIVAPLTKLTRKDTKFEWSDACENSFEKLKA 790 Query: 184 RLTSAPILIIPERGRDYVIFCDASREGLGCVLMQDEKVVAYGSRQLKPHERNYPTHDLEL 363 LT+AP+L +P+ Y +FCDASR GLGCVLMQ KV+AY SRQLK HE+NYP HDLE+ Sbjct: 791 CLTTAPVLSLPQGTGGYTVFCDASRVGLGCVLMQHGKVIAYASRQLKRHEQNYPIHDLEM 850 Query: 364 AAVVFALKVWRHYLYGERFEVFSDHKSLKYLFSQKDLNLRQRRWVELMEDYDFDLRYHPG 543 A +VFALK+WRHYLYGE E+++DHKSLKY+F Q+DLNLRQRRW+EL++DYD + YHPG Sbjct: 851 ATIVFALKIWRHYLYGETCEIYTDHKSLKYIFQQRDLNLRQRRWMELLKDYDCTILYHPG 910 Query: 544 KANVVADALSRKSRGSLSALVAVAIHEWRVMEDFAELGLYFSEEVDDASPVLFSLVAQPT 723 KANVVAD LSRKS GSL+ + R + ++G+ EV + + +L +P Sbjct: 911 KANVVADVLSRKSMGSLAHISIGRRSLVREIHSLGDIGVRL--EVAETNALLAHFRVRPI 968 Query: 724 LMHEVIVAQGQD-VDLRALRERLEQGDQEDGWTVSADGGLRYRGRLCV--PLPCREAVLQ 894 LM ++ AQ +D ++AL + QG + +T DG LRY RL V R +L+ Sbjct: 969 LMDKIKEAQSKDEFVIKALED--PQGRKGKMFTKGTDGVLRYGTRLYVLDGDGLRREILE 1026 Query: 895 AFHCSRFAVHPGGTKMYMNVKRHYWWKGLKGDVARFVAKCLTCQQVKAEHQRPAGLFQPL 1074 H + + VHPG TKMY ++K YWW+GLK DVA FV+KCL CQQVKAEHQ+PAGL QPL Sbjct: 1027 EAHMAAYVVHPGATKMYQDLKEVYWWEGLKRDVAEFVSKCLVCQQVKAEHQKPAGLLQPL 1086 Query: 1075 PVSEWKWEHITMDFVTGLPRTRQRHDAIWVIVDRLTKT 1188 PV +WKWEHI MDFVTG PRT +D+IW++VDRLTK+ Sbjct: 1087 PVPKWKWEHIAMDFVTGFPRTSGGYDSIWIVVDRLTKS 1124 >ref|XP_007213082.1| hypothetical protein PRUPE_ppa021229mg [Prunus persica] gi|462408947|gb|EMJ14281.1| hypothetical protein PRUPE_ppa021229mg [Prunus persica] Length = 1194 Score = 459 bits (1180), Expect = e-126 Identities = 226/399 (56%), Positives = 290/399 (72%), Gaps = 3/399 (0%) Frame = +1 Query: 1 MEWDRPKTVFEIRSFLGLAGYYRRFIQDFSRLAAPMTRLTRKGVKFVWDDCCERAFQELK 180 + W RP +V EIRSFLGLAGYYRRF++ FS +AAP+T LTRKGVKFVW D CE +F ELK Sbjct: 484 VNWLRPTSVTEIRSFLGLAGYYRRFVEGFSTIAAPLTYLTRKGVKFVWSDKCEESFIELK 543 Query: 181 RRLTSAPILIIPERGRDYVIFCDASREGLGCVLMQDEKVVAYGSRQLKPHERNYPTHDLE 360 RLT+AP+L +P+ ++VI+ DAS++GLGCVLMQ +V+AY SRQLK HE NYP HDLE Sbjct: 544 TRLTTAPVLALPDDSGNFVIYSDASQQGLGCVLMQHGRVIAYASRQLKKHELNYPVHDLE 603 Query: 361 LAAVVFALKVWRHYLYGERFEVFSDHKSLKYLFSQKDLNLRQRRWVELMEDYDFDLRYHP 540 LAAVVFALK+WRHYLYGE ++F+DHKSLKYLF+QK+LNLRQRRW+EL++DYD + +HP Sbjct: 604 LAAVVFALKIWRHYLYGETCQIFTDHKSLKYLFTQKELNLRQRRWLELIKDYDCTIEHHP 663 Query: 541 GKANVVADALSRKSRGSLSALVAVAIHEWRVMEDFAEL-GLYFSEEVDDASPVLFSLVAQ 717 G+ANVVADALSRKS GS++ L R + E+ L +VD+ +L +L + Sbjct: 664 GRANVVADALSRKSSGSIAYL------RGRYLPLMVEMRKLRIGLDVDNQGALLATLHVR 717 Query: 718 PTLMHEVIVAQGQDVDLRALRERLEQGDQEDGWTVSADGGLRYRGRLCVP--LPCREAVL 891 P L+ ++ AQ QD + LR + GD+ D +V DG L RL VP + +L Sbjct: 718 PVLVERILAAQSQDPLICTLRVEVANGDRTD-CSVRNDGALMVGNRLYVPNDEALKREIL 776 Query: 892 QAFHCSRFAVHPGGTKMYMNVKRHYWWKGLKGDVARFVAKCLTCQQVKAEHQRPAGLFQP 1071 + H S FA+HPG TKMY ++ HYWW +K +A +V +CL CQQVKAE Q+P+GL QP Sbjct: 777 EEAHESAFAMHPGSTKMYHTLREHYWWPFMKKQIAEYVRRCLICQQVKAERQKPSGLLQP 836 Query: 1072 LPVSEWKWEHITMDFVTGLPRTRQRHDAIWVIVDRLTKT 1188 LP+ EWKWE ITMDFV LP+T+ +HD +WVIVDRLTK+ Sbjct: 837 LPIPEWKWERITMDFVFKLPQTQSKHDGVWVIVDRLTKS 875 >ref|XP_010695935.1| PREDICTED: uncharacterized protein LOC104908519 [Beta vulgaris subsp. vulgaris] Length = 1273 Score = 458 bits (1179), Expect = e-126 Identities = 224/397 (56%), Positives = 296/397 (74%), Gaps = 2/397 (0%) Frame = +1 Query: 4 EWDRPKTVFEIRSFLGLAGYYRRFIQDFSRLAAPMTRLTRKGVKFVWDDCCERAFQELKR 183 EW PK V E+RSFLGLAGYYRRF+++FS++A P+T L RK +F W++ CE AF ELKR Sbjct: 614 EWPAPKNVSEVRSFLGLAGYYRRFVKNFSKIALPITSLIRKNSRFQWNEKCEAAFLELKR 673 Query: 184 RLTSAPILIIPERGRDYVIFCDASREGLGCVLMQDEKVVAYGSRQLKPHERNYPTHDLEL 363 RLTSAPIL +P + I+ DAS+EGLGCVLMQ KV+AY SRQL+PHE+NYP HDLEL Sbjct: 674 RLTSAPILTLPSGTEGFEIYSDASQEGLGCVLMQHGKVIAYASRQLRPHEKNYPVHDLEL 733 Query: 364 AAVVFALKVWRHYLYGERFEVFSDHKSLKYLFSQKDLNLRQRRWVELMEDYDFDLRYHPG 543 AAVVFALK+W+HYLY +VF+DHKSLKY+F+QKD+N+RQRRW+EL++DYD D++YHPG Sbjct: 734 AAVVFALKLWQHYLYAVSCKVFTDHKSLKYIFTQKDMNMRQRRWLELLKDYDIDIQYHPG 793 Query: 544 KANVVADALSRKSRGSLSALVAVAIHEWRVMEDFAELGLYFSEEVDDASPVLFSLVAQPT 723 KAN VADALSR+ R LS L A+ + +E F EL L S E+ + +L QP Sbjct: 794 KANKVADALSRRPRSELSFLSAMPDELSKEIELF-ELVLVRSGEI---GGTINALTVQPD 849 Query: 724 LMHEVIVAQGQDVDLRALRERLEQGDQEDGWTVSADGGLRYRGRLCVP--LPCREAVLQA 897 L E+ Q QD L+ ++E+++ G+ ++ + DG +R RGR CVP R+ VL+ Sbjct: 850 LYSEIREKQSQDAFLQGVKEKIKNGETQE-FAQCEDGSIRLRGRWCVPEDQNLRQRVLKE 908 Query: 898 FHCSRFAVHPGGTKMYMNVKRHYWWKGLKGDVARFVAKCLTCQQVKAEHQRPAGLFQPLP 1077 H S ++VHPG KM ++K+++WW+GLK DVAR+VA+CLTCQ+VK E + GL QPLP Sbjct: 909 AHSSPYSVHPGRDKMVRDLKKYFWWRGLKKDVARYVARCLTCQKVKFERHKAPGLLQPLP 968 Query: 1078 VSEWKWEHITMDFVTGLPRTRQRHDAIWVIVDRLTKT 1188 + EWKW+ ++MDFV+GLPR+R+ +D+IWVIVDRLTKT Sbjct: 969 IPEWKWDSVSMDFVSGLPRSRKGNDSIWVIVDRLTKT 1005 >gb|AEV42258.1| hypothetical protein [Beta vulgaris] Length = 1553 Score = 458 bits (1179), Expect = e-126 Identities = 218/398 (54%), Positives = 290/398 (72%), Gaps = 3/398 (0%) Frame = +1 Query: 4 EWDRPKTVFEIRSFLGLAGYYRRFIQDFSRLAAPMTRLTRKGVKFVWDDCCERAFQELKR 183 EW PK V +IRSFLGLAGYYRRF++DFS++A PMT L +K +F W++ E+AFQ LK Sbjct: 812 EWPTPKNVTDIRSFLGLAGYYRRFVKDFSKIAKPMTNLMKKDCRFTWNEDSEKAFQTLKE 871 Query: 184 RLTSAPILIIPERGRDYVIFCDASREGLGCVLMQDEKVVAYGSRQLKPHERNYPTHDLEL 363 RLTSAP+L +P Y ++ DAS+ GLGCVLMQ+ KV+AY SRQLKP+E NYPTHDLEL Sbjct: 872 RLTSAPVLTLPNGNEGYDVYSDASKNGLGCVLMQNGKVIAYASRQLKPYEVNYPTHDLEL 931 Query: 364 AAVVFALKVWRHYLYGERFEVFSDHKSLKYLFSQKDLNLRQRRWVELMEDYDFDLRYHPG 543 AA+VFALK+WRHYLYG +F+DHKSLKY+F+QKDLN+RQRRW+EL++DYD D++YH G Sbjct: 932 AAIVFALKIWRHYLYGVTCRIFTDHKSLKYIFTQKDLNMRQRRWLELIKDYDLDIQYHEG 991 Query: 544 KANVVADALSRKSRGSLSALVAVAIHEWRVMEDFAELGLYFSEEVDDASPVLFSLVAQPT 723 KANVVADALSRKS SL+ LV ++ E+F+ L + E + +L +L +P Sbjct: 992 KANVVADALSRKSSHSLNTLVVAD----KLCEEFSRLQIEVVHE-GEVERLLSALTIEPN 1046 Query: 724 LMHEVIVAQGQDVDLRALRERLEQGDQEDGWTVSADGGLRYRGRLCVPLPCRE---AVLQ 894 + E+ +Q DV L ++ +L++G E G+ + DG +RY+GR CVP C E ++ Sbjct: 1047 FLEEIRASQPGDVKLERVKAKLKEGKAE-GFAIHEDGSIRYKGRWCVPQKCEELKQKIMS 1105 Query: 895 AFHCSRFAVHPGGTKMYMNVKRHYWWKGLKGDVARFVAKCLTCQQVKAEHQRPAGLFQPL 1074 H + + VHPGG K+Y ++K+ +WW G+K VA FV+KCLTCQ+VK+EH+RP G QPL Sbjct: 1106 EGHNTTYYVHPGGDKLYKDLKKMFWWPGMKRAVAEFVSKCLTCQKVKSEHKRPQGKIQPL 1165 Query: 1075 PVSEWKWEHITMDFVTGLPRTRQRHDAIWVIVDRLTKT 1188 + WKW+ I+MDFV LPR+R ++ IWVIVDRLTKT Sbjct: 1166 DIPTWKWDSISMDFVVALPRSRGGNNTIWVIVDRLTKT 1203