BLASTX nr result
ID: Sinomenium22_contig00027065
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Sinomenium22_contig00027065 (1670 letters) Database: ./nr 38,876,450 sequences; 13,856,398,315 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_004161321.1| PREDICTED: uncharacterized protein LOC101223... 169 8e-60 ref|XP_007052567.1| Gag-pol polyprotein, putative [Theobroma cac... 156 3e-58 ref|XP_007220740.1| hypothetical protein PRUPE_ppa023598mg [Prun... 144 1e-56 gb|ADP20179.1| gag-pol polyprotein [Silene latifolia] 164 4e-56 gb|ADP20178.1| gag-pol polyprotein [Silene latifolia] 134 6e-55 ref|XP_004309164.1| PREDICTED: uncharacterized protein LOC101300... 123 6e-48 ref|XP_007221295.1| hypothetical protein PRUPE_ppa024499mg, part... 126 2e-45 emb|CAN79321.1| hypothetical protein VITISV_018984 [Vitis vinifera] 107 5e-43 ref|XP_006392773.1| hypothetical protein EUTSA_v10012229mg [Eutr... 149 9e-41 ref|XP_006402169.1| hypothetical protein EUTSA_v10015409mg, part... 156 2e-40 ref|XP_007207232.1| hypothetical protein PRUPE_ppa026856mg [Prun... 160 2e-40 ref|XP_007029783.1| Uncharacterized protein TCM_025656 [Theobrom... 156 4e-40 emb|CAN69233.1| hypothetical protein VITISV_003380 [Vitis vinifera] 108 5e-40 ref|XP_007210190.1| hypothetical protein PRUPE_ppa017790mg [Prun... 157 1e-39 ref|XP_007051412.1| DNA/RNA polymerases superfamily protein [The... 155 4e-35 ref|XP_007010495.1| Uncharacterized protein TCM_044370 [Theobrom... 155 5e-35 ref|XP_007009850.1| Uncharacterized protein TCM_043155 [Theobrom... 140 6e-35 ref|XP_007033335.1| Uncharacterized protein TCM_019516, partial ... 150 2e-33 ref|XP_007028192.1| Uncharacterized protein TCM_023754 [Theobrom... 147 1e-32 ref|XP_007049887.1| DNA/RNA polymerases superfamily protein [The... 130 1e-32 >ref|XP_004161321.1| PREDICTED: uncharacterized protein LOC101223713 [Cucumis sativus] Length = 645 Score = 169 bits (427), Expect(2) = 8e-60 Identities = 102/285 (35%), Positives = 150/285 (52%) Frame = +1 Query: 622 QRLCLVP*WQENWLQNYIFQSSYTIAGKVCSFMIDSNYCENVISKEVVQKLGIQTVKHLN 801 QR+ + P ++ ++ +F++ TI G+VC +ID++ +N ++K++V L ++ H Sbjct: 274 QRVLITPKEEKKQQRHCLFKARCTINGRVCDVIIDNDSSKNFVAKKLVTVLNLKAEAHPT 333 Query: 802 PYKLAWLKKGSEVFVSHQATVSFKIGNKYKDEVLCDVVCMDVFHLLLWRPWQYDRHVIHG 981 YK+ W++K E VS TV I N YKD+++CDV+ MDV HLLL RPWQYD +H Sbjct: 334 SYKIGWVRKEGEATVSEICTVPLSIENAYKDQIVCDVIEMDVCHLLLGRPWQYDTQSLHK 393 Query: 982 GCNNTYSFQFGGTKIMLLPSRNKGSPKPSNQQ*LSHSSYVRVGNGVRHGVCSSKHAYGTR 1161 G NTY Q G K++LLP K + G+ K + T Sbjct: 394 GRENTYELQLMGRKVVLLPITRKN----------------------KEGLRGEKQLFTTV 431 Query: 1162 GVYHS*AGEGSDR*VFLHVSRGLANRVSTLAIFKIRT*CQGATLSN*SHYRMSPKEYEEL 1341 + D + L GL + GA+L N +HYRMSP+EY+ L Sbjct: 432 SGKNMLKEREQDL-LGLEEPEGLPPLRDIQHHIDL---IPGASLPNLAHYRMSPQEYKTL 487 Query: 1342 HRQVEKLVAKVHV*ECLSS*VVPALLTQEKNGSRRMCIDN*AINR 1476 H +E+L+ K H+ LS VPALLT +K+GS RMC+D+ AINR Sbjct: 488 HDHIEELLKKGHIKPSLSPCAVPALLTLKKDGSWRMCVDSRAINR 532 Score = 90.9 bits (224), Expect(2) = 8e-60 Identities = 43/63 (68%), Positives = 50/63 (79%) Frame = +2 Query: 1460 IEPLTGVVVVSNLDLKSGYHQIRIRSGDKWKPTFKMLEGLFE*LVMPFGLSNAPITFMRV 1639 ++ L + S +DLKSGYHQIRIR GD+WK TFK EGLFE +VMPFGLSNAP TFMR+ Sbjct: 548 LDQLGKASIFSKIDLKSGYHQIRIRPGDEWKTTFKTKEGLFEWMVMPFGLSNAPNTFMRL 607 Query: 1640 MNQ 1648 MNQ Sbjct: 608 MNQ 610 >ref|XP_007052567.1| Gag-pol polyprotein, putative [Theobroma cacao] gi|508704828|gb|EOX96724.1| Gag-pol polyprotein, putative [Theobroma cacao] Length = 794 Score = 156 bits (395), Expect(4) = 3e-58 Identities = 76/137 (55%), Positives = 95/137 (69%) Frame = +1 Query: 652 ENWLQNYIFQSSYTIAGKVCSFMIDSNYCENVISKEVVQKLGIQTVKHLNPYKLAWLKKG 831 E+WL++ IF + T GKVC+ +IDS CENVI+ +V+KL +QT H +PYKL WL+KG Sbjct: 371 ESWLRHNIFHTRCTSQGKVCNVIIDSGSCENVIANYMVKKLKLQTEVHPHPYKLQWLRKG 430 Query: 832 SEVFVSHQATVSFKIGNKYKDEVLCDVVCMDVFHLLLWRPWQYDRHVIHGGCNNTYSFQF 1011 +EV V+ + V F IGNKY+DEV CDV+ MD HLLL RPWQYDR H G NTYSF Sbjct: 431 NEVKVTKRCCVQFSIGNKYEDEVWCDVIPMDACHLLLGRPWQYDRRAHHDGYKNTYSFIK 490 Query: 1012 GGTKIMLLPSRNKGSPK 1062 G KIML P + + PK Sbjct: 491 DGAKIMLTPLKPEDCPK 507 Score = 60.1 bits (144), Expect(4) = 3e-58 Identities = 31/65 (47%), Positives = 46/65 (70%) Frame = +1 Query: 1282 GATLSN*SHYRMSPKEYEELHRQVEKLVAKVHV*ECLSS*VVPALLTQEKNGSRRMCIDN 1461 G+ + N YRMSP+E++EL QV++L+ K V E +S VPALL +K+G+ RMCID+ Sbjct: 584 GSIIPNKPAYRMSPQEHKELQHQVKQLLEKGLVRESVSPCAVPALLVPKKDGTWRMCIDS 643 Query: 1462 *AINR 1476 A+N+ Sbjct: 644 RAVNK 648 Score = 48.9 bits (115), Expect(4) = 3e-58 Identities = 46/162 (28%), Positives = 74/162 (45%), Gaps = 15/162 (9%) Frame = +2 Query: 179 ETEEQRVSHYIGGLHL*IQNVLNMLGLVTVSKV*K*SFQYEKQLIHQGSSAFDNAGPSTN 358 E EEQ V+ Y+GGL++ I +V+ + ++ V + + + EKQ + + S + ST+ Sbjct: 201 EPEEQTVARYLGGLNVGIADVVQLQPYWNLNDVIRLALKVEKQQLRKSSMSSSRQKDSTS 260 Query: 359 ----QQ*ASTSTPKLGQPQQL-------TRSGGC---CFLCCDLGHRQSKCRKKR-*GLF 493 Q A+ PK+ + + TR+ CF C GH S C +R L Sbjct: 261 NRGRQSSATIPPPKVNSSKTINHKETTSTRAPNVNKKCFKCQGFGHIASDCPNRRIISLI 320 Query: 494 VEEVTEDDKAVIV*SELDFDTSDDLAGDEELVEGDTSPMLVV 619 EEV E+ V EL+ ++++ E V D LVV Sbjct: 321 EEEVMEEPSLEEVDDELEIFNNEEI----EEVSADHGEALVV 358 Score = 30.4 bits (67), Expect(4) = 3e-58 Identities = 13/51 (25%), Positives = 29/51 (56%) Frame = +2 Query: 1124 MVYVLVSMPMEPEASTIPKQVRVLIDEFFYMFPEDLPTEFLPLQSSKSGLD 1276 ++Y+L+ +S + K V+ +I+EF + PE++P P++ + +D Sbjct: 530 LLYLLLVCEENEVSSPLSKDVKPIIEEFCDVVPEEIPHGLPPMRDIQHAID 580 >ref|XP_007220740.1| hypothetical protein PRUPE_ppa023598mg [Prunus persica] gi|462417202|gb|EMJ21939.1| hypothetical protein PRUPE_ppa023598mg [Prunus persica] Length = 1457 Score = 144 bits (363), Expect(3) = 1e-56 Identities = 103/299 (34%), Positives = 155/299 (51%), Gaps = 14/299 (4%) Frame = +1 Query: 622 QRLCLVP*WQENWLQNYIFQSSYTIAGKVCSFMIDSNYCENVISKEVVQKLGIQTVKHLN 801 QR+ L P +E ++ I +S +I KVC ++D+ CEN +SK++V+ L + T H+ Sbjct: 384 QRVLLAP--KEEGQRHSICRSLCSIKNKVCDVIVDNGSCENFVSKKLVEHLQLSTEPHVR 441 Query: 802 PYKLAWLKKGSEVFVSHQATVSFKIGNKYKDEVLCDVVCMDVFHLLLWRPWQYDRHVIHG 981 PY L W+KKG V V+ +V IG Y D+VLCDV+ MD H+LL + WQ+D + Sbjct: 442 PYSLGWVKKGPSVRVAETYSVPLSIGKHYIDDVLCDVIDMDACHILLGQLWQFDVDATYK 501 Query: 982 GCNNTYSFQFGGTKIMLL---PSRNKGSPKPSNQQ*LSHSSYVRVGNGVRHGVCSSKH-- 1146 G +N F + KI + PS+ PK + L+ S + N V V +++ Sbjct: 502 GRDNVILFSWNNRKIAMATTKPSKQSVEPKTRSSSFLTLISSEQELNKV---VKEAEYFC 558 Query: 1147 AYGTRGVYHS*AGEG---SDR*VFLH-----VSRGLANRVSTLAIFKIR-T*CQGATLSN 1299 +G+ GE D L +S L N + ++ + R GA L N Sbjct: 559 PLVLKGLLKLGRGESDIPQDVQKILSQFQELLSEKLPNELPSMRDIQHRIDLVPGANLPN 618 Query: 1300 *SHYRMSPKEYEELHRQVEKLVAKVHV*ECLSS*VVPALLTQEKNGSRRMCIDN*AINR 1476 HYRMSPKE + L Q+E+L+ K + E LS VP LL +K+ + RMC+D+ AIN+ Sbjct: 619 LPHYRMSPKENDILREQIEELLQKGFIRESLSPCAVPVLLVPKKDKTWRMCVDSRAINK 677 Score = 90.9 bits (224), Expect(3) = 1e-56 Identities = 43/63 (68%), Positives = 51/63 (80%) Frame = +2 Query: 1460 IEPLTGVVVVSNLDLKSGYHQIRIRSGDKWKPTFKMLEGLFE*LVMPFGLSNAPITFMRV 1639 ++ L+G V S +DL+SGYHQIRIR GD+WK FK +GLFE LVMPFGLSNAP TFMR+ Sbjct: 693 LDVLSGSRVFSKIDLRSGYHQIRIRPGDEWKTAFKSKDGLFEWLVMPFGLSNAPSTFMRL 752 Query: 1640 MNQ 1648 MNQ Sbjct: 753 MNQ 755 Score = 35.0 bits (79), Expect(3) = 1e-56 Identities = 29/102 (28%), Positives = 42/102 (41%) Frame = +2 Query: 314 HQGSSAFDNAGPSTNQQ*ASTSTPKLGQPQQLTRSGGCCFLCCDLGHRQSKCRKKR*GLF 493 ++ SS N G S NQ + P+ C+ C GHR + C + F Sbjct: 298 NESSSRTFNRGQSRNQSQNPYAKPRTD----------ICYRCQKPGHRSNVCPEWTQANF 347 Query: 494 VEEVTEDDKAVIV*SELDFDTSDDLAGDEELVEGDTSPMLVV 619 +EEV ED+ E D DD AG E +E +++V Sbjct: 348 IEEVDEDE-------EKDEVGEDDYAGAEFAIEERMERIILV 382 >gb|ADP20179.1| gag-pol polyprotein [Silene latifolia] Length = 1475 Score = 164 bits (415), Expect(2) = 4e-56 Identities = 109/286 (38%), Positives = 154/286 (53%), Gaps = 19/286 (6%) Frame = +1 Query: 673 IFQSSYTIAGKVCSFMIDSNYCENVISKEVVQKLGIQTVKHLNPYKLAWLKKGSEVFVSH 852 IF+S TI G+VC+ +ID C NV S +++KL + T H +PYKL WL KG+EV V Sbjct: 388 IFRSRCTIKGRVCNLIIDGGSCTNVASSTLIEKLSLPTQDHPSPYKLRWLNKGAEVRVDK 447 Query: 853 QATVSFKIGNKYKDEVLCDVVCMDVFHLLLWRPWQYDRHVIHGGCNNTYSFQFGGTKIML 1032 Q V+F IG Y DE LCDV+ MD HLLL RPW++DR +H G +NTY+F+F K++L Sbjct: 448 QCLVTFSIGKNYSDEALCDVLPMDACHLLLGRPWEFDRDSVHHGRDNTYTFKFRSRKVIL 507 Query: 1033 L---PSRNKGSP----KPSNQQ*LSHSSYVRV---GNGVRHGVCSSKHAYGTRGVYHS*A 1182 P +P +PS + L + + + G+ + + + +G Sbjct: 508 TPLPPVLKHTTPPSMLEPSKEVLLINEAEMLQELKGDEDVYALIAKDVVFGQNVSLPKEV 567 Query: 1183 GE--GSDR*VF-------LHVSRGLANRVSTLAIFKIRT*CQGATLSN*SHYRMSPKEYE 1335 E S VF L RG+ +++ + GATL N + YR PK + Sbjct: 568 QELLQSYEDVFPNELPSGLPPLRGIEHQIDFIP---------GATLPNKAAYRSDPKATQ 618 Query: 1336 ELHRQVEKLVAKVHV*ECLSS*VVPALLTQEKNGSRRMCIDN*AIN 1473 EL +Q+ +LV+K V E LS VPALL +K+GS RMC D+ AIN Sbjct: 619 ELQQQIGELVSKGFVRESLSPCSVPALLVPKKDGSWRMCTDSRAIN 664 Score = 83.2 bits (204), Expect(2) = 4e-56 Identities = 36/63 (57%), Positives = 48/63 (76%) Frame = +2 Query: 1460 IEPLTGVVVVSNLDLKSGYHQIRIRSGDKWKPTFKMLEGLFE*LVMPFGLSNAPITFMRV 1639 ++ L+G + S +DL+ GYHQ+RI+ GD+WK FK GL+E LVMPFGLSNAP TFMR+ Sbjct: 681 LDELSGAQLFSKIDLRQGYHQVRIKEGDEWKTAFKTKHGLYEWLVMPFGLSNAPSTFMRL 740 Query: 1640 MNQ 1648 M + Sbjct: 741 MTE 743 >gb|ADP20178.1| gag-pol polyprotein [Silene latifolia] Length = 1518 Score = 134 bits (338), Expect(3) = 6e-55 Identities = 69/137 (50%), Positives = 87/137 (63%), Gaps = 5/137 (3%) Frame = +1 Query: 664 QNYIFQSSYTIAGKVCSFMIDSNYCENVISKEVVQKLGIQTVKHLNPYKLAWLKKGSEVF 843 ++ IF+S T+ G+VC+ +I+ C NV S +V KLG+ T +H NPYKL WL K S V Sbjct: 396 RSMIFRSRCTVQGRVCNLIINGGSCTNVASTTMVSKLGLPTQEHPNPYKLRWLSKDSGVR 455 Query: 844 VSHQATVSFKIGNKYKDEVLCDVVC-MDVFHLLLWRPWQYDRHVIHGGCNNTYSFQFGGT 1020 V Q +SF IG YKDEVLCDVV MD HLLL RPW+YDR+ H G +N Y F+ G Sbjct: 456 VDKQCIISFSIGKMYKDEVLCDVVVPMDACHLLLGRPWEYDRNTTHQGKDNVYIFKHQGK 515 Query: 1021 KIMLLP----SRNKGSP 1059 K+ L P R+ GSP Sbjct: 516 KVTLTPLPPNQRDYGSP 532 Score = 93.2 bits (230), Expect(3) = 6e-55 Identities = 70/218 (32%), Positives = 102/218 (46%), Gaps = 29/218 (13%) Frame = +2 Query: 1082 FLTQAMFE*EMELGM-VYVLVSMPMEPEASTI-PKQVRVLIDEFFYMFPEDLPTEFLPLQ 1255 FL++A E+ V +L+S + E +T+ P V LI F +FP++LP+ PL+ Sbjct: 543 FLSEAAMIKEIRQAQPVLMLLSREVNQEENTVVPTAVAPLIQRFQEVFPDELPSGLPPLR 602 Query: 1256 SSKSGLDV-------RGPLYRID--------HIIE*VL------RSMRSCT------DRS 1354 + +D+ P YR D H IE ++ S+ C + Sbjct: 603 GIEHHIDLVPGSVLPNKPAYRCDPNATKELQHQIEELMAKGFVRESLSPCAVPALLVPKK 662 Query: 1355 RSLWPRSMFESA*AREXXXXXXXXXXMDHGVCALTIEPLTGVVVVSNLDLKSGYHQIRIR 1534 W A +D ++ L+G + S +DL+ GYHQ+RIR Sbjct: 663 DGTWRMCTDSRAINNITVKYRFPIPRLDD-----MLDELSGASIFSKIDLRQGYHQVRIR 717 Query: 1535 SGDKWKPTFKMLEGLFE*LVMPFGLSNAPITFMRVMNQ 1648 GD+WK FK GL+E LVMPFGLSNAP TFMR+M + Sbjct: 718 EGDEWKTAFKTKHGLYEWLVMPFGLSNAPSTFMRLMTE 755 Score = 36.6 bits (83), Expect(3) = 6e-55 Identities = 47/173 (27%), Positives = 69/173 (39%), Gaps = 18/173 (10%) Frame = +2 Query: 179 ETEEQRVSHYIGGLHL*IQNVLNMLGLVTVSKV*K*SFQYEKQ------------LIHQG 322 E EQR++ ++ GL I + M L + V S + EK + Sbjct: 213 EKSEQRIARFLEGLDKNIAAEVRMQPLWSYDDVVNLSLRVEKMGKTKPVATRPKPVFRPY 272 Query: 323 SSAFDNAGPSTNQQ*A-----STSTPKLGQPQQLTRSGGCCFLCCDLGHRQSKCRKKR*G 487 SS N P T Q + PK+ P L+R CF C GH + C R Sbjct: 273 SSVKINDPPKTTPQSTVDKGKAPMNPKINPP--LSRDKIKCFQCQGFGHFRKDCPSAR-T 329 Query: 488 LFVEEVTEDDKAVIV*SELDFDTSDDLAGDEELVEGDTSP-MLVVHSDFA*SL 643 L EV E ++ +V +++ + L +E E +TSP +V H D SL Sbjct: 330 LTAIEVAEWEREGLV----EYEEDEALVLEEVESEKETSPDQIVAHPDTGHSL 378 >ref|XP_004309164.1| PREDICTED: uncharacterized protein LOC101300012 [Fragaria vesca subsp. vesca] Length = 1034 Score = 123 bits (308), Expect(2) = 6e-48 Identities = 73/154 (47%), Positives = 93/154 (60%), Gaps = 3/154 (1%) Frame = +1 Query: 619 TQRLCLVP*WQENWLQNYIFQSSYTIAGKVCSFMIDSNYCENVISKEVVQKLGIQTVKHL 798 TQRL L QEN ++ IF+S+ TI K S +IDS CEN +SK+VV+ + T+KH Sbjct: 430 TQRL-LCSTKQENQ-RHSIFRSTCTIKEKPMSLIIDSGSCENFVSKKVVEHFNLLTMKHR 487 Query: 799 NPYKLAWLKKGSEVFVSHQATVSFKIGNKYKDEVLCDVVCMDVFHLLLWRPWQYDRHVIH 978 PY + W+KKG EV ++ VS IG Y+DEV CDVV MD H+LL +PWQ+D + IH Sbjct: 488 APYAIGWIKKGLEVRITETCKVSISIGKFYQDEVECDVVDMDASHVLLGKPWQHDVNTIH 547 Query: 979 GGCNNTYSFQFGGTKIMLLPS---RNKGSPKPSN 1071 G NT SF + I L P N SPK SN Sbjct: 548 NGRENTVSFIWEKHHITLKPKTKPTNLVSPKESN 581 Score = 97.1 bits (240), Expect(2) = 6e-48 Identities = 71/193 (36%), Positives = 96/193 (49%), Gaps = 30/193 (15%) Frame = +2 Query: 1160 EASTIPKQVRVLIDEFFYMFPEDLPTEFLPLQSSKSGLDVRG-------PLYRID----- 1303 E IPK+V+ L+ +F + +DLP E P++ + +D+ P YR+ Sbjct: 618 EEKKIPKEVQQLLQDFEELLADDLPNELPPMRDIQHQIDLVSGASLPNLPHYRMSPKENE 677 Query: 1304 ---HIIE*VLR------SMRSCT---------DRSRSLWPRSMFESA*AREXXXXXXXXX 1429 IE +LR SM C DRS W + A + Sbjct: 678 ILKEKIEELLRKGHIRESMSPCAVPVLLVPKKDRS---WRMCVDSRAINKITIKYRFPIP 734 Query: 1430 XMDHGVCALTIEPLTGVVVVSNLDLKSGYHQIRIRSGDKWKPTFKMLEGLFE*LVMPFGL 1609 ++ ++ L G VV S +DL+SGYHQIRI+ GD+WK FK +GL+E LVMPFGL Sbjct: 735 QLED-----MLDVLGGSVVFSKIDLRSGYHQIRIKLGDEWKTAFKSKDGLYEWLVMPFGL 789 Query: 1610 SNAPITFMRVMNQ 1648 SNAP TFMRVMNQ Sbjct: 790 SNAPSTFMRVMNQ 802 >ref|XP_007221295.1| hypothetical protein PRUPE_ppa024499mg, partial [Prunus persica] gi|462417929|gb|EMJ22494.1| hypothetical protein PRUPE_ppa024499mg, partial [Prunus persica] Length = 1364 Score = 126 bits (317), Expect(2) = 2e-45 Identities = 96/288 (33%), Positives = 145/288 (50%), Gaps = 13/288 (4%) Frame = +1 Query: 652 ENWLQNYIFQSSYTIAGKVCSFMIDSNYCENVISKEVVQKLGIQTVKHLNPYKLAWLKKG 831 ++W + IF + + C +IDS NVISK V +L ++ H +P+ +AW+ K Sbjct: 359 DSWKRTSIFHTYVPCNNQTCKLVIDSGSTMNVISKSAVTRLNLKPEPHPHPFHVAWVDK- 417 Query: 832 SEVFVSHQATVSFKIGNKYKDEVLCDVVCMDVFHLLLWRPWQYDRHVIHGGCNNTYSFQF 1011 +++ V+ + VS K+G +D + D++ M+V H+LL RPW YD V + G NTY+FQ Sbjct: 418 TKLPVTERCLVSLKLGTCDED-IYLDLLPMNVAHVLLGRPWLYDHCVQNCGRENTYTFQH 476 Query: 1012 GGTKIMLLPSRNKGSPKPSNQQ*LSHSSYVRVGNGVRHGVC------------SSKHAYG 1155 G I L P+ P +N ++ SS + GN H + S+ +Y Sbjct: 477 EGKSITLRPANPAIKPTKTN---ITTSSPSQTGNVSGHQLALLSYGEFEKEKISAAPSYQ 533 Query: 1156 TRGVYHS*AGEGSDR*VFLHVSRGLANRVSTLA-IFKIRT*CQGATLSN*SHYRMSPKEY 1332 H E SD V L L N + + I G+ L N HYRM+ E Sbjct: 534 QPEPLHQLLNEFSD--VMLD---DLPNELPPMRDIQHAIDLVPGSQLLNLPHYRMNSSER 588 Query: 1333 EELHRQVEKLVAKVHV*ECLSS*VVPALLTQEKNGSRRMCIDN*AINR 1476 EL+ Q++ L+ K + LSS VP LLT +K+GS RMC+D+ AIN+ Sbjct: 589 AELNTQIQGLLDKGFIRHSLSSCAVPVLLTPKKDGSWRMCVDSRAINK 636 Score = 84.7 bits (208), Expect(2) = 2e-45 Identities = 40/61 (65%), Positives = 47/61 (77%) Frame = +2 Query: 1460 IEPLTGVVVVSNLDLKSGYHQIRIRSGDKWKPTFKMLEGLFE*LVMPFGLSNAPITFMRV 1639 +E L G S +DL+SGYHQIRIR GD+WK FK +GL+E LVMPFG+SNAP TFMRV Sbjct: 652 LEELAGSKWFSKIDLRSGYHQIRIREGDEWKTAFKTPDGLYEWLVMPFGMSNAPSTFMRV 711 Query: 1640 M 1642 M Sbjct: 712 M 712 >emb|CAN79321.1| hypothetical protein VITISV_018984 [Vitis vinifera] Length = 1521 Score = 107 bits (266), Expect(2) = 5e-43 Identities = 52/142 (36%), Positives = 87/142 (61%) Frame = +1 Query: 649 QENWLQNYIFQSSYTIAGKVCSFMIDSNYCENVISKEVVQKLGIQTVKHLNPYKLAWLKK 828 +E+W + IFQ+ + G++C+ +ID N+ S+E+V+KL ++T +H NP+++AW+ Sbjct: 401 EEDWRRISIFQTRISCHGRLCTMIIDGGSSLNIASQELVEKLNLKTERHPNPFRVAWVND 460 Query: 829 GSEVFVSHQATVSFKIGNKYKDEVLCDVVCMDVFHLLLWRPWQYDRHVIHGGCNNTYSFQ 1008 S + VS + V+F G +++ V C+V+ + V H+LL RPW +DR V H G NTY+ Sbjct: 461 TS-IPVSFRCLVTFLFGKDFEESVWCEVLPIKVSHILLGRPWLFDRKVQHDGYENTYALI 519 Query: 1009 FGGTKIMLLPSRNKGSPKPSNQ 1074 G K +L P + K SN+ Sbjct: 520 HNGRKKILRPMKEVPPIKKSNE 541 Score = 96.7 bits (239), Expect(2) = 5e-43 Identities = 67/219 (30%), Positives = 106/219 (48%), Gaps = 31/219 (14%) Frame = +2 Query: 1085 LTQAMFE*EM-ELGMVYVLVSMPMEP---EASTIPKQVRVLIDEFFYMFPEDLPTEFLPL 1252 LT FE E E +++ L++ +E + P R ++D+F ++P +LP E P+ Sbjct: 549 LTMCQFENESKETXVIFALMARKVEEFKEQDKEYPANARKILDDFSDLWPVELPNELPPM 608 Query: 1253 QSSKSGLDV-------RGPLYR------------IDHIIE*--VLRSMRSC------TDR 1351 + + +D+ P YR +D ++ + S+ C T + Sbjct: 609 RDIQHAIDLIPGASLPNLPAYRMNPTEHAELKRQVDELLTKGFIRESLSPCGVPALLTPK 668 Query: 1352 SRSLWPRSMFESA*AREXXXXXXXXXXMDHGVCALTIEPLTGVVVVSNLDLKSGYHQIRI 1531 W + A + +D ++ + G V+ S +DL+SGYHQIRI Sbjct: 669 KDGSWRMCVDSRAINKITIKYRFPIPRLDD-----MLDMMVGSVIFSKIDLRSGYHQIRI 723 Query: 1532 RSGDKWKPTFKMLEGLFE*LVMPFGLSNAPITFMRVMNQ 1648 R GD+WK +FK +GL+E LVMPFGL+NAP TFMR+M Q Sbjct: 724 RPGDEWKTSFKTKDGLYEWLVMPFGLTNAPSTFMRIMTQ 762 >ref|XP_006392773.1| hypothetical protein EUTSA_v10012229mg [Eutrema salsugineum] gi|557089351|gb|ESQ30059.1| hypothetical protein EUTSA_v10012229mg [Eutrema salsugineum] Length = 382 Score = 149 bits (377), Expect(3) = 9e-41 Identities = 71/146 (48%), Positives = 99/146 (67%), Gaps = 1/146 (0%) Frame = +1 Query: 622 QRLCLVP*-WQENWLQNYIFQSSYTIAGKVCSFMIDSNYCENVISKEVVQKLGIQTVKHL 798 +R+CL P ++E WL+ IF+S+ TI GK+C+ +IDS NV+S+ V+KLG++ H Sbjct: 194 RRICLAPVGYEEPWLRTNIFRSTCTIKGKLCNLVIDSGSSRNVVSETAVKKLGLKREDHP 253 Query: 799 NPYKLAWLKKGSEVFVSHQATVSFKIGNKYKDEVLCDVVCMDVFHLLLWRPWQYDRHVIH 978 PY LAW+ +G++V ++H+A VSF IG YKD + CD+ MDV HL+L RPWQ+DR H Sbjct: 254 APYALAWITEGTDVKITHRALVSFSIGAFYKDTIYCDIAPMDVSHLILGRPWQFDRDTCH 313 Query: 979 GGCNNTYSFQFGGTKIMLLPSRNKGS 1056 G NTYSF F KI+LLP+ S Sbjct: 314 NGKKNTYSFVFENRKIVLLPNPEPAS 339 Score = 43.9 bits (102), Expect(3) = 9e-41 Identities = 43/179 (24%), Positives = 81/179 (45%), Gaps = 24/179 (13%) Frame = +2 Query: 155 VISENNNIETEEQRVSHYIGGLHL*IQNVLNMLGLVTVSKV*K*SFQYEKQLIHQGSSAF 334 +++ N +T+ Q VS +IGGL +QN L TV++ + + +E Q + S++ Sbjct: 25 LLTRNELNDTQIQLVSRFIGGLRPQLQNSLTQFDPSTVAEAHRRALAFETQ--SKAGSSW 82 Query: 335 DNAG------PSTNQQ*ASTSTPKLGQPQQLTRSGGC----------------CFLCCDL 448 N+G T+ + +S +P++ + Q R+ C+ C + Sbjct: 83 TNSGNWRPRLTGTDTENSSHDSPEVSKSQTAPRNSTTLDESTLRRSTRPPALKCYSCGEP 142 Query: 449 GHRQSKC-RKKR*GLFVEEVTEDDKAVIV*SELDFDTSDDLAGDEELVEGDT-SPMLVV 619 GHRQ+ C ++R GL +E+ + D D +E L GD+ +P+L++ Sbjct: 143 GHRQTACPNQQRRGLLLEDTEGVYNSA--------DEEDTGIYEETLTSGDSNAPVLML 193 Score = 23.1 bits (48), Expect(3) = 9e-41 Identities = 10/23 (43%), Positives = 15/23 (65%) Frame = +3 Query: 84 VYQYLQNLRPRFRSVDD*TTQFY 152 ++ LQNLR R+VD+ +FY Sbjct: 1 MFTRLQNLRQGSRTVDEYAEEFY 23 >ref|XP_006402169.1| hypothetical protein EUTSA_v10015409mg, partial [Eutrema salsugineum] gi|557103259|gb|ESQ43622.1| hypothetical protein EUTSA_v10015409mg, partial [Eutrema salsugineum] Length = 367 Score = 156 bits (395), Expect(2) = 2e-40 Identities = 77/158 (48%), Positives = 102/158 (64%), Gaps = 1/158 (0%) Frame = +1 Query: 622 QRLCLVP*-WQENWLQNYIFQSSYTIAGKVCSFMIDSNYCENVISKEVVQKLGIQTVKHL 798 + +CL P +E WL+ IFQS+ TI GKVC F++DS C NVI+++ +KLG++ H Sbjct: 203 RHVCLAPVVLEEPWLRTNIFQSTCTIKGKVCRFVVDSGSCRNVIAEDAARKLGLKREDHP 262 Query: 799 NPYKLAWLKKGSEVFVSHQATVSFKIGNKYKDEVLCDVVCMDVFHLLLWRPWQYDRHVIH 978 PYKL WLK+G E+ + H+ VSF IG+ YKD++ CDV MDV HLLL PWQYDR V+H Sbjct: 263 APYKLTWLKQGVEIRIEHRCLVSFSIGSHYKDKIYCDVALMDVSHLLLGTPWQYDRSVMH 322 Query: 979 GGCNNTYSFQFGGTKIMLLPSRNKGSPKPSNQQ*LSHS 1092 G N+YSF F KI+L S +P S SH+ Sbjct: 323 DGRRNSYSFIFENRKIVLFSSPQPPAPSTSCVSQNSHN 360 Score = 38.5 bits (88), Expect(2) = 2e-40 Identities = 48/173 (27%), Positives = 74/173 (42%), Gaps = 26/173 (15%) Frame = +2 Query: 179 ETEEQRVSHYIGGLHL*IQNVLNMLGLVTVSKV*K*SFQYEKQLI------HQGSSAFDN 340 ++E Q VS +I GL +Q+ + TVS+ + + +E+Q + G S Sbjct: 34 DSEVQLVSRFISGLRPQLQSAMAQFDPDTVSEAHRRAVAFEQQFKSSVTGWNSGFSRSRM 93 Query: 341 AGPSTNQ-----------Q*ASTSTP----KLGQPQQLTRSGGC----CFLCCDLGHRQS 463 G +T++ A+TS G L RS CF C + GH Q+ Sbjct: 94 TGTATSEGSHGQAHKKDTTEATTSNTLPVANSGTEPTLRRSSQPNALRCFACGEPGHLQT 153 Query: 464 KCRKK-R*GLFVEEVTEDDKAVIV*SELDFDTSDDLAGDEELVEGDTSPMLVV 619 C K+ R GLF +E D +E +FD+ E+ GDTSP L++ Sbjct: 154 ACPKQTRRGLFGDETKWDKDDAADDNEDEFDSE----VPEDHHHGDTSPSLML 202 >ref|XP_007207232.1| hypothetical protein PRUPE_ppa026856mg [Prunus persica] gi|462402874|gb|EMJ08431.1| hypothetical protein PRUPE_ppa026856mg [Prunus persica] Length = 1493 Score = 160 bits (405), Expect(2) = 2e-40 Identities = 102/286 (35%), Positives = 154/286 (53%), Gaps = 1/286 (0%) Frame = +1 Query: 622 QRLCLVP*WQENWLQNYIFQSSYTIAGKVCSFMIDSNYCENVISKEVVQKLGIQTVKHLN 801 QR+ L P +E ++ IF+S +I KVC ++D+ CEN +SK++V+ L + T H++ Sbjct: 420 QRVLLAP--KEEGQRHNIFRSLCSIKNKVCDVIVDNGSCENFVSKKLVEYLQLSTEPHVS 477 Query: 802 PYKLAWLKKGSEVFVSHQATVSFKIGNKYKDEVLCDVVCMDVFHLLLWRPWQYDRHVIHG 981 PY L W+KKG V V+ V IG Y+D+VLCDV+ MD H+LL RPWQ+D Sbjct: 478 PYSLGWVKKGPSVRVAETCRVPLSIGKHYRDDVLCDVIDMDACHILLGRPWQFDVDATFK 537 Query: 982 GCNNTYSFQFGGTKIMLLPSRNKGSPKPSNQQ*LSHSSYVRVGNGVRHGVCSSKHAYGTR 1161 G +N F + KI + + +PS +Q L SS++ + + + + K A G Sbjct: 538 GRDNVILFSWNNRKIAM------ATTQPSRKQELRSSSFLTLISNEQELNEAVKEAEGEG 591 Query: 1162 GVYHS*AGEGSDR*VFLHVSRGLANRVSTLAIFKIR-T*CQGATLSN*SHYRMSPKEYEE 1338 + S L S L N + + + R GA+L N HYRMSPKE + Sbjct: 592 DIPQDVQQILSQFQELL--SENLPNELPPMRDIQHRIDLVHGASLPNLPHYRMSPKENDI 649 Query: 1339 LHRQVEKLVAKVHV*ECLSS*VVPALLTQEKNGSRRMCIDN*AINR 1476 L Q+E+L+ K + E LS VP LL +K+ + RMC+D+ A+N+ Sbjct: 650 LREQIEELLRKGFIRESLSPCAVPVLLVPKKDKTWRMCVDSRAVNK 695 Score = 34.3 bits (77), Expect(2) = 2e-40 Identities = 41/187 (21%), Positives = 70/187 (37%), Gaps = 42/187 (22%) Frame = +2 Query: 158 ISENNNI-ETEEQRVSHYIGGLHL*IQNVLNMLGLVTVSKV*K*SFQYE----------- 301 ++E N++ ET+ Q+V+ Y GL IQ + M + T+ + + + E Sbjct: 230 LAERNHLTETDNQKVARYNNGLKSSIQEKIGMQNIWTLQEAINMALKAELLEKEKRQPNF 289 Query: 302 ------------------------KQLIHQGSSAFDNAGPSTNQQ*ASTSTPKLGQPQQL 409 +Q G + G + N S+ GQP+ Sbjct: 290 RRNKTEASDYTAGASSGAGDKEKAQQQNSGGMTKPATVGQNKNFNEGSSRNYNRGQPRNQ 349 Query: 410 TRSG------GCCFLCCDLGHRQSKCRKKR*GLFVEEVTEDDKAVIV*SELDFDTSDDLA 571 +++ C+ C GHR + C +++ F+EE ED+ E D +D A Sbjct: 350 SQNPYAKPMTDICYRCQKPGHRSNVCPERKQANFIEEADEDE-------EKDEVGENDYA 402 Query: 572 GDEELVE 592 G E VE Sbjct: 403 GAEFAVE 409 Score = 91.7 bits (226), Expect = 9e-16 Identities = 66/186 (35%), Positives = 93/186 (50%), Gaps = 27/186 (14%) Frame = +2 Query: 1172 IPKQVRVLIDEFFYMFPEDLPTEFLPLQSSKSGLD-VRG------PLYRID--------H 1306 IP+ V+ ++ +F + E+LP E P++ + +D V G P YR+ Sbjct: 593 IPQDVQQILSQFQELLSENLPNELPPMRDIQHRIDLVHGASLPNLPHYRMSPKENDILRE 652 Query: 1307 IIE*VLR------SMRSCT------DRSRSLWPRSMFESA*AREXXXXXXXXXXMDHGVC 1450 IE +LR S+ C + W + A + ++ Sbjct: 653 QIEELLRKGFIRESLSPCAVPVLLVPKKDKTWRMCVDSRAVNKIKVKYRFSIPRLED--- 709 Query: 1451 ALTIEPLTGVVVVSNLDLKSGYHQIRIRSGDKWKPTFKMLEGLFE*LVMPFGLSNAPITF 1630 ++ L+G V S +DL+SGYHQIRIR GD+WK FK +GLFE LVMPFGLSNAP TF Sbjct: 710 --ILDVLSGSKVFSKIDLRSGYHQIRIRPGDEWKTAFKSKDGLFEWLVMPFGLSNAPSTF 767 Query: 1631 MRVMNQ 1648 MR+MNQ Sbjct: 768 MRLMNQ 773 >ref|XP_007029783.1| Uncharacterized protein TCM_025656 [Theobroma cacao] gi|508718388|gb|EOY10285.1| Uncharacterized protein TCM_025656 [Theobroma cacao] Length = 505 Score = 156 bits (395), Expect(2) = 4e-40 Identities = 75/137 (54%), Positives = 95/137 (69%) Frame = +1 Query: 652 ENWLQNYIFQSSYTIAGKVCSFMIDSNYCENVISKEVVQKLGIQTVKHLNPYKLAWLKKG 831 E+WL++ IF + T GKVC+ +IDS CENVI+ +V+KL +QT H +PYKL WL+KG Sbjct: 220 ESWLRHNIFYTRCTSQGKVCNVIIDSGSCENVIANYMVEKLKLQTEVHPHPYKLQWLRKG 279 Query: 832 SEVFVSHQATVSFKIGNKYKDEVLCDVVCMDVFHLLLWRPWQYDRHVIHGGCNNTYSFQF 1011 +EV V+ + V F IGNKY+DEV CD++ MD HLLL RPWQYDR H G NTYSF Sbjct: 280 NEVKVTKRCCVQFSIGNKYEDEVWCDIIPMDACHLLLGRPWQYDRRAHHDGYKNTYSFIK 339 Query: 1012 GGTKIMLLPSRNKGSPK 1062 G KIML P + + PK Sbjct: 340 DGAKIMLTPLKPENRPK 356 Score = 37.4 bits (85), Expect(2) = 4e-40 Identities = 42/163 (25%), Positives = 68/163 (41%), Gaps = 16/163 (9%) Frame = +2 Query: 179 ETEEQRVSHYIGGLHL*IQNVLNMLGLVTVSKV*K*SFQYEKQLIHQGSSAFDNAGPSTN 358 E EEQ V+ Y+GGL++ I +V+ + ++ V + + + EKQ + S + S + Sbjct: 50 EPEEQTVARYLGGLNVEIADVVQLQPYWNLNDVIRLALKVEKQRSRKRSMSSSRQQESIS 109 Query: 359 QQ*ASTST----PKLGQPQ---------QLTRSGGC---CFLCCDLGHRQSKCRKKR*GL 490 + +S PK+ + TR+ CF C GH C +R Sbjct: 110 NDESQSSVTIPPPKVNSSKTASSNDKETTFTRASNVNKKCFKCQGFGHIAFDCPNRR--- 166 Query: 491 FVEEVTEDDKAVIV*SELDFDTSDDLAGDEELVEGDTSPMLVV 619 + V E+D A E +D DD + E V D L+V Sbjct: 167 IISLVEEEDYANWEKLEPVYDEYDD--EEIEEVSADHGEALIV 207 >emb|CAN69233.1| hypothetical protein VITISV_003380 [Vitis vinifera] Length = 1292 Score = 108 bits (269), Expect(2) = 5e-40 Identities = 51/142 (35%), Positives = 87/142 (61%) Frame = +1 Query: 649 QENWLQNYIFQSSYTIAGKVCSFMIDSNYCENVISKEVVQKLGIQTVKHLNPYKLAWLKK 828 +E+W + IFQ+ + G++C+ +ID N+ S+E+V+KL ++T +H NP+++AW+ Sbjct: 319 EEDWRRTSIFQTRISCQGRLCTMIIDGGSSLNIASQELVEKLNLKTERHPNPFRVAWVND 378 Query: 829 GSEVFVSHQATVSFKIGNKYKDEVLCDVVCMDVFHLLLWRPWQYDRHVIHGGCNNTYSFQ 1008 S + VS + V+F G +++ V C+V+ + V H+LL RPW +DR V H G NTY+ Sbjct: 379 TS-IPVSFRCLVTFLFGKDFEESVWCEVLPIKVSHILLGRPWLFDRXVQHDGYENTYALI 437 Query: 1009 FGGTKIMLLPSRNKGSPKPSNQ 1074 G K +L P + K S++ Sbjct: 438 HNGCKTILRPMKEVSPIKKSDE 459 Score = 85.5 bits (210), Expect(2) = 5e-40 Identities = 64/219 (29%), Positives = 104/219 (47%), Gaps = 31/219 (14%) Frame = +2 Query: 1085 LTQAMFE*EM-ELGMVYVLVSMPMEP---EASTIPKQVRVLIDEFFYMFPEDLPTEFLPL 1252 L+ FE E E +++ L++ +E + P VR ++D+F +P +LP + P+ Sbjct: 467 LSMCQFENESKETKVIFALMARKVEESKEQDKEYPANVRKILDDFSDFWPTELPNQLPPM 526 Query: 1253 QSSKSGLDV-------RGPLYR------------IDHII-E*VLRSMRS-------CTDR 1351 + + +D+ P YR +D ++ + +R S T + Sbjct: 527 RDVQHAIDLIPGASLPNLPAYRMNPTEHAELKRQVDELLTKGFIRESLSPYGVPALLTPK 586 Query: 1352 SRSLWPRSMFESA*AREXXXXXXXXXXMDHGVCALTIEPLTGVVVVSNLDLKSGYHQIRI 1531 W + A + +D ++ + V+ S +DL+SGYHQIRI Sbjct: 587 KDGSWRMCVDSRAMNKITIKYRFPIPRLDD-----MLDMMVRSVIFSKIDLRSGYHQIRI 641 Query: 1532 RSGDKWKPTFKMLEGLFE*LVMPFGLSNAPITFMRVMNQ 1648 R GD+WK +FK +GL+E LVM FGL+NAP TFMR+M Q Sbjct: 642 RPGDEWKTSFKTKDGLYEWLVMLFGLTNAPSTFMRIMTQ 680 >ref|XP_007210190.1| hypothetical protein PRUPE_ppa017790mg [Prunus persica] gi|462405925|gb|EMJ11389.1| hypothetical protein PRUPE_ppa017790mg [Prunus persica] Length = 1485 Score = 157 bits (397), Expect(2) = 1e-39 Identities = 103/287 (35%), Positives = 154/287 (53%), Gaps = 2/287 (0%) Frame = +1 Query: 622 QRLCLVP*WQENWLQNYIFQSSYTIAGKVCSFMIDSNYCENVISKEVVQKLGIQTVKHLN 801 QR+ L P +E ++ IF+S +I KVC ++D+ CEN +SK++V+ L + T H++ Sbjct: 409 QRVLLAP--REEGQRHSIFRSLCSIKNKVCDVIVDNGSCENFVSKKLVEYLQLSTEPHVS 466 Query: 802 PYKLAWLKKGSEVFVSHQATVSFKIGNKYKDEVLCDVVCMDVFHLLLWRPWQYDRHVIHG 981 PY L W+KKG V V+ V IG Y+DEVLCDV+ MD H+LL RPWQ+D Sbjct: 467 PYSLGWVKKGPSVRVAETCRVPLSIGKHYRDEVLCDVIDMDACHILLGRPWQFDVDATFK 526 Query: 982 GCNNTYSFQFGGTKIMLLPSRNKGSPKPSNQQ*LSHSSYVRVGNGVRHGVCSSKHAYGTR 1161 G +N F + KI + ++ KPS + SS++ + + + + K A G Sbjct: 527 GRDNVILFSWNNRKIAMTTTQ---PSKPSVEVKTRSSSFLTLISNEQELNEAVKEAEGEG 583 Query: 1162 GVYHS*AGEGSDR*VFLHV-SRGLANRVSTLAIFKIR-T*CQGATLSN*SHYRMSPKEYE 1335 + S F + S L N + + + R GA+L N HYRMSPKE + Sbjct: 584 DIPQDVQQILSQ---FQELFSENLPNELPPMRDIQHRIDLVPGASLQNLPHYRMSPKEND 640 Query: 1336 ELHRQVEKLVAKVHV*ECLSS*VVPALLTQEKNGSRRMCIDN*AINR 1476 L Q+E+L+ K + E LS VP LL +K+ + RMC+D+ AIN+ Sbjct: 641 ILREQIEELLRKGFIRESLSPCAVPVLLVPKKDKTWRMCVDSRAINK 687 Score = 34.7 bits (78), Expect(2) = 1e-39 Identities = 41/187 (21%), Positives = 70/187 (37%), Gaps = 42/187 (22%) Frame = +2 Query: 158 ISENNNI-ETEEQRVSHYIGGLHL*IQNVLNMLGLVTVSKV*K*SFQYE----------- 301 ++E N++ ET+ Q+V+ Y GL + IQ + M + T+ + + + E Sbjct: 219 LAERNHLTETDNQKVARYNNGLKISIQEKIGMQNIWTLQEAINMALKAELLEKEKRQPNF 278 Query: 302 ------------------------KQLIHQGSSAFDNAGPSTNQQ*ASTSTPKLGQPQQL 409 +Q G + G + N S+ GQP+ Sbjct: 279 RRNTTEASDYTAGASSGAGDKGKAQQQSSGGMTKPTTVGQNKNFNEGSSRNYNRGQPRNQ 338 Query: 410 TRS------GGCCFLCCDLGHRQSKCRKKR*GLFVEEVTEDDKAVIV*SELDFDTSDDLA 571 +++ C+ C GHR + C + + F+EE ED+ E D +D A Sbjct: 339 SQNLYAKPMTDICYRCQKPGHRSNVCPELKQANFIEEADEDE-------ENDEVGENDYA 391 Query: 572 GDEELVE 592 G E VE Sbjct: 392 GAEFAVE 398 Score = 91.7 bits (226), Expect = 9e-16 Identities = 64/186 (34%), Positives = 92/186 (49%), Gaps = 27/186 (14%) Frame = +2 Query: 1172 IPKQVRVLIDEFFYMFPEDLPTEFLPLQSSKSGLDV-------RGPLYRID--------H 1306 IP+ V+ ++ +F +F E+LP E P++ + +D+ P YR+ Sbjct: 585 IPQDVQQILSQFQELFSENLPNELPPMRDIQHRIDLVPGASLQNLPHYRMSPKENDILRE 644 Query: 1307 IIE*VLR------SMRSCT------DRSRSLWPRSMFESA*AREXXXXXXXXXXMDHGVC 1450 IE +LR S+ C + W + A + ++ Sbjct: 645 QIEELLRKGFIRESLSPCAVPVLLVPKKDKTWRMCVDSRAINKITVKYRFPIPRLED--- 701 Query: 1451 ALTIEPLTGVVVVSNLDLKSGYHQIRIRSGDKWKPTFKMLEGLFE*LVMPFGLSNAPITF 1630 ++ L+G V S +DL+SGYHQIRIR GD+WK FK +GLFE LVMPFGLSN P TF Sbjct: 702 --MLDVLSGSKVFSKIDLRSGYHQIRIRPGDEWKTAFKSKDGLFEWLVMPFGLSNTPSTF 759 Query: 1631 MRVMNQ 1648 MR+MNQ Sbjct: 760 MRLMNQ 765 >ref|XP_007051412.1| DNA/RNA polymerases superfamily protein [Theobroma cacao] gi|508703673|gb|EOX95569.1| DNA/RNA polymerases superfamily protein [Theobroma cacao] Length = 1452 Score = 155 bits (393), Expect = 4e-35 Identities = 106/286 (37%), Positives = 146/286 (51%), Gaps = 11/286 (3%) Frame = +1 Query: 652 ENWLQNYIFQSSYTIAGKVCSFMIDSNYCENVISKEVVQKLGIQTVKHLNPYKLAWLKKG 831 E+W + IF++ GKVC +ID EN+ISKE V KL + T KH PYK+ WLKKG Sbjct: 318 EDWKRRSIFRTRVVCEGKVCDLVIDGGSMENIISKEAVNKLKLPTNKHPYPYKIGWLKKG 377 Query: 832 SEVFVSHQATVSFKIGNKYKDEVLCDVVCMDVFHLLLWRPWQYDRHVIHGGCNNTYSFQF 1011 EV V+ Q V F +G+ DE LCDVV MDV H+L+ RPW YD ++H NTYSF Sbjct: 378 HEVPVTTQCLVKFTMGDNSDDEALCDVVPMDVGHILVGRPWLYDHDMVHKTKPNTYSFYK 437 Query: 1012 GGTKIMLLPSRNKGSPKPSNQQ*LSHSSYVRVGNGVRHG-VCSSKHAYGTRGVYHS*AGE 1188 + L P R + + K +N + + Y+ N G +A T+ + + Sbjct: 438 NNKRYTLYPLREE-TKKSANHKISKITRYLSAENFEAEGSEMGIMYALVTKHLKSDQMSK 496 Query: 1189 G----SDR*VFLHVSRGLANRVSTLAIFKIRT------*CQGATLSN*SHYRMSPKEYEE 1338 ++ L L N ++ +R+ GA L N YRM P + E Sbjct: 497 SPQYPTEIQQLLKEFGELFNEDLPKSLPPLRSIQHAIDLVPGAALPNLPAYRMPPMQRAE 556 Query: 1339 LHRQVEKLVAKVHV*ECLSS*VVPALLTQEKNGSRRMCIDN*AINR 1476 + RQVE+L K V E S PALL +K+GS RMC+D+ AIN+ Sbjct: 557 VQRQVEELFEKGLVRESKSPCACPALLAPKKDGSWRMCVDSRAINK 602 Score = 99.0 bits (245), Expect = 6e-18 Identities = 74/221 (33%), Positives = 109/221 (49%), Gaps = 32/221 (14%) Frame = +2 Query: 1076 SNFLTQAMFE*E-MELGMVYVLVSMPMEPEAST----IPKQVRVLIDEFFYMFPEDLPTE 1240 + +L+ FE E E+G++Y LV+ ++ + + P +++ L+ EF +F EDLP Sbjct: 463 TRYLSAENFEAEGSEMGIMYALVTKHLKSDQMSKSPQYPTEIQQLLKEFGELFNEDLPKS 522 Query: 1241 FLPLQSSKSGLDV-------RGPLYR------------IDHIIE*VL----RSMRSC--- 1342 PL+S + +D+ P YR ++ + E L +S +C Sbjct: 523 LPPLRSIQHAIDLVPGAALPNLPAYRMPPMQRAEVQRQVEELFEKGLVRESKSPCACPAL 582 Query: 1343 -TDRSRSLWPRSMFESA*AREXXXXXXXXXXMDHGVCALTIEPLTGVVVVSNLDLKSGYH 1519 + W + A + +D ++ L G V S +DLKSGYH Sbjct: 583 LAPKKDGSWRMCVDSRAINKITIKYRFPIPRLDE-----MLDQLVGSRVFSKIDLKSGYH 637 Query: 1520 QIRIRSGDKWKPTFKMLEGLFE*LVMPFGLSNAPITFMRVM 1642 QIR+R GD+WK FK +GLFE LVMPFGLSNAP TFMRVM Sbjct: 638 QIRMRDGDEWKTAFKTPDGLFEWLVMPFGLSNAPSTFMRVM 678 >ref|XP_007010495.1| Uncharacterized protein TCM_044370 [Theobroma cacao] gi|508727408|gb|EOY19305.1| Uncharacterized protein TCM_044370 [Theobroma cacao] Length = 1306 Score = 155 bits (392), Expect = 5e-35 Identities = 106/286 (37%), Positives = 148/286 (51%), Gaps = 11/286 (3%) Frame = +1 Query: 652 ENWLQNYIFQSSYTIAGKVCSFMIDSNYCENVISKEVVQKLGIQTVKHLNPYKLAWLKKG 831 E+W + IF++ GKVC +ID EN+ISKE V KL + T KH PYK+ WLKKG Sbjct: 319 EDWKRRSIFRTRVVCEGKVCDLVIDGGSMENIISKEAVNKLKLPTNKHPYPYKIGWLKKG 378 Query: 832 SEVFVSHQATVSFKIGNKYKDEVLCDVVCMDVFHLLLWRPWQYDRHVIHGGCNNTYSFQF 1011 EV V+ Q V F +G+ DE LCDVV MDV H+L+ RPW YD ++H NTYSF Sbjct: 379 HEVPVTTQYLVKFTMGDNLDDEALCDVVPMDVGHILVGRPWLYDHDMVHKTEPNTYSFYN 438 Query: 1012 GGTKIMLLPSRNKGSPKPSNQQ*LSHSSYVRVGNGVRHG-VCSSKHAYGTRGVYHS*AGE 1188 + P + + + K +N + + Y+ V N G +A T+ + G+ Sbjct: 439 DNKRYTSYPLKEE-TKKSANSKINKITGYLSVENFEAEGSEMGIMYALVTKHLKSDQMGK 497 Query: 1189 G----SDR*VFLHVSRGLANRVSTLAIFKIRT------*CQGATLSN*SHYRMSPKEYEE 1338 ++ L L N ++ +R+ GA L N YRM P + E Sbjct: 498 SPQYPTEIQQLLKEFGELFNEDLPKSLPPLRSIQHAIDLVPGAALPNLPAYRMPPMQRVE 557 Query: 1339 LHRQVEKLVAKVHV*ECLSS*VVPALLTQEKNGSRRMCIDN*AINR 1476 + RQVE+L+ K V E S PALL +K+GS RMC+D+ AIN+ Sbjct: 558 VQRQVEELLEKGLVRESKSPCACPALLAPKKDGSWRMCVDSRAINK 603 Score = 96.7 bits (239), Expect = 3e-17 Identities = 76/228 (33%), Positives = 112/228 (49%), Gaps = 34/228 (14%) Frame = +2 Query: 1061 NPPIN--SNFLTQAMFE*E-MELGMVYVLVSMPMEPE----ASTIPKQVRVLIDEFFYMF 1219 N IN + +L+ FE E E+G++Y LV+ ++ + + P +++ L+ EF +F Sbjct: 457 NSKINKITGYLSVENFEAEGSEMGIMYALVTKHLKSDQMGKSPQYPTEIQQLLKEFGELF 516 Query: 1220 PEDLPTEFLPLQSSKSGLDV-------RGPLYR------------IDHIIE*VL----RS 1330 EDLP PL+S + +D+ P YR ++ ++E L +S Sbjct: 517 NEDLPKSLPPLRSIQHAIDLVPGAALPNLPAYRMPPMQRVEVQRQVEELLEKGLVRESKS 576 Query: 1331 MRSC----TDRSRSLWPRSMFESA*AREXXXXXXXXXXMDHGVCALTIEPLTGVVVVSNL 1498 +C + W + A + +D ++ L G V S + Sbjct: 577 PCACPALLAPKKDGSWRMCVDSRAINKITIKYRFPIPRLDE-----MLDQLVGSRVFSKI 631 Query: 1499 DLKSGYHQIRIRSGDKWKPTFKMLEGLFE*LVMPFGLSNAPITFMRVM 1642 DLKS YHQIR+R GD+WK FK +GLFE LVMPFGLSNAP TFMRVM Sbjct: 632 DLKSEYHQIRMRDGDEWKTAFKTPDGLFEWLVMPFGLSNAPSTFMRVM 679 >ref|XP_007009850.1| Uncharacterized protein TCM_043155 [Theobroma cacao] gi|508726763|gb|EOY18660.1| Uncharacterized protein TCM_043155 [Theobroma cacao] Length = 625 Score = 140 bits (352), Expect(2) = 6e-35 Identities = 69/137 (50%), Positives = 91/137 (66%) Frame = +1 Query: 652 ENWLQNYIFQSSYTIAGKVCSFMIDSNYCENVISKEVVQKLGIQTVKHLNPYKLAWLKKG 831 E+ L++ IF + T G VC+ +IDS CENV++ +V+KL + T H +PYKL WL+KG Sbjct: 340 ESCLRHNIFYTRCTSQGNVCNVIIDSGSCENVVANYMVEKLKLPTEVHPHPYKLQWLRKG 399 Query: 832 SEVFVSHQATVSFKIGNKYKDEVLCDVVCMDVFHLLLWRPWQYDRHVIHGGCNNTYSFQF 1011 +EV V+ + + F I NKY+DEV CDV+ MD HLLL RPWQYDR + G NTYSF Sbjct: 400 NEVKVTKRCCIQFFIRNKYEDEVWCDVIPMDACHLLLGRPWQYDRRAHYDGYKNTYSFIK 459 Query: 1012 GGTKIMLLPSRNKGSPK 1062 G KIML P + + PK Sbjct: 460 DGVKIMLTPLKPEDRPK 476 Score = 36.6 bits (83), Expect(2) = 6e-35 Identities = 39/153 (25%), Positives = 67/153 (43%), Gaps = 20/153 (13%) Frame = +2 Query: 179 ETEEQRVSHYIGGLHL*IQNVLNMLGLVTVSKV*K*SFQYEKQLIHQGSSAFDNAGPSTN 358 E EEQ ++ Y+GGL++ I +V+ + ++ V + + + EKQ + S + S + Sbjct: 170 EPEEQTLARYLGGLNVEIADVVQLQPYWNLNDVIRLTLKVEKQQSRKRSMSSSRQQESIS 229 Query: 359 QQ*ASTST----PKLGQPQ---------QLTRSGGC---CFLCCDLGHRQSKCRKKR*GL 490 + +S PK+ + TR+ CF C GH S C +R Sbjct: 230 NDESQSSVTIPPPKVNSSKTASSNDKETTFTRASNVNKKCFKCQRFGHIASDCPSRRIIS 289 Query: 491 FVEEVTED----DKAVIV*SELDFDTSDDLAGD 577 VEE ED +K V E D + ++++ D Sbjct: 290 LVEE--EDYVNWEKLEPVYDEYDDEEIEEVSAD 320 >ref|XP_007033335.1| Uncharacterized protein TCM_019516, partial [Theobroma cacao] gi|508712364|gb|EOY04261.1| Uncharacterized protein TCM_019516, partial [Theobroma cacao] Length = 215 Score = 150 bits (378), Expect = 2e-33 Identities = 74/137 (54%), Positives = 93/137 (67%) Frame = +1 Query: 652 ENWLQNYIFQSSYTIAGKVCSFMIDSNYCENVISKEVVQKLGIQTVKHLNPYKLAWLKKG 831 E+WL++ IF + T GKVC+ +IDS CENVI+ +V+KL +QT +PYKL WL+KG Sbjct: 47 ESWLRHNIFHARCTSQGKVCNVIIDSGSCENVIANYMVEKLKLQTEVLPHPYKLQWLRKG 106 Query: 832 SEVFVSHQATVSFKIGNKYKDEVLCDVVCMDVFHLLLWRPWQYDRHVIHGGCNNTYSFQF 1011 +EV V+ V F IGNKY+DEV CDV+ MD LLL RPWQYDR H G NTYSF Sbjct: 107 NEVKVTKHCCVQFSIGNKYEDEVWCDVIPMDACQLLLGRPWQYDRRAHHDGYKNTYSFIK 166 Query: 1012 GGTKIMLLPSRNKGSPK 1062 G KIML P +++ PK Sbjct: 167 DGAKIMLTPLKSEDYPK 183 >ref|XP_007028192.1| Uncharacterized protein TCM_023754 [Theobroma cacao] gi|508716797|gb|EOY08694.1| Uncharacterized protein TCM_023754 [Theobroma cacao] Length = 440 Score = 147 bits (372), Expect = 1e-32 Identities = 72/137 (52%), Positives = 93/137 (67%) Frame = +1 Query: 652 ENWLQNYIFQSSYTIAGKVCSFMIDSNYCENVISKEVVQKLGIQTVKHLNPYKLAWLKKG 831 E+WL++ IF + YT GKVC+ +IDS CENVI+ +V+KL + T H +PYKL WL+KG Sbjct: 155 ESWLRHNIFYTRYTSQGKVCNVIIDSGSCENVIANYMVEKLKLPTEVHPHPYKLQWLRKG 214 Query: 832 SEVFVSHQATVSFKIGNKYKDEVLCDVVCMDVFHLLLWRPWQYDRHVIHGGCNNTYSFQF 1011 +EV V+ + V F IG+KY+DEV CDV+ MD HLLL RPWQYDR + G N SF Sbjct: 215 NEVKVTKRCCVQFSIGSKYEDEVWCDVIPMDACHLLLGRPWQYDRRAHYDGYKNISSFIK 274 Query: 1012 GGTKIMLLPSRNKGSPK 1062 G KIML P + + PK Sbjct: 275 DGVKIMLTPLKPEDRPK 291 >ref|XP_007049887.1| DNA/RNA polymerases superfamily protein [Theobroma cacao] gi|508702148|gb|EOX94044.1| DNA/RNA polymerases superfamily protein [Theobroma cacao] Length = 546 Score = 130 bits (328), Expect(2) = 1e-32 Identities = 65/141 (46%), Positives = 83/141 (58%) Frame = +1 Query: 652 ENWLQNYIFQSSYTIAGKVCSFMIDSNYCENVISKEVVQKLGIQTVKHLNPYKLAWLKKG 831 E+W + IF++ GKVC +ID EN+ISKE V KL + T KH PYK+ WLKKG Sbjct: 327 EDWKRRSIFRTRVVCEGKVCDLVIDGGSMENIISKEAVNKLKLPTNKHPYPYKIGWLKKG 386 Query: 832 SEVFVSHQATVSFKIGNKYKDEVLCDVVCMDVFHLLLWRPWQYDRHVIHGGCNNTYSFQF 1011 EV V+ Q V F +GN DE LCDVV MDV H+L+ RPW YD ++H NTYSF Sbjct: 387 HEVPVTTQCLVKFTMGNNLDDEALCDVVPMDVGHILVGRPWLYDHDMVHKTKPNTYSFYK 446 Query: 1012 GGTKIMLLPSRNKGSPKPSNQ 1074 + L P R + +N+ Sbjct: 447 NNKRYTLYPLREETKKSANNK 467 Score = 37.7 bits (86), Expect(2) = 1e-32 Identities = 22/73 (30%), Positives = 41/73 (56%), Gaps = 5/73 (6%) Frame = +2 Query: 1076 SNFLTQAMFE*E-MELGMVYVLVSMPMEPEAST----IPKQVRVLIDEFFYMFPEDLPTE 1240 + +L+ FE E E+G+ Y LV+ ++ + + P +++ L+ EF +F EDLP Sbjct: 472 TGYLSAENFEAEGSEMGITYALVTKHLKSDQMSKSPQYPTEIQQLLKEFGELFNEDLPKS 531 Query: 1241 FLPLQSSKSGLDV 1279 PL+S + +D+ Sbjct: 532 LPPLRSIQHAIDL 544