BLASTX nr result
ID: Sinomenium21_contig00020262
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Sinomenium21_contig00020262 (855 letters) Database: ./nr 38,876,450 sequences; 13,856,398,315 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_007027902.1| Retrotransposon-like protein [Theobroma caca... 176 8e-42 ref|XP_007099662.1| Gag protease polyprotein-like protein [Theob... 175 2e-41 ref|XP_007044250.1| DNA/RNA polymerases superfamily protein [The... 174 5e-41 emb|CAN83518.1| hypothetical protein VITISV_035077 [Vitis vinifera] 173 7e-41 gb|ADN33767.1| gag protease polyprotein [Cucumis melo subsp. melo] 173 9e-41 ref|XP_004488407.1| PREDICTED: uncharacterized protein LOC101502... 172 1e-40 ref|XP_007044383.1| DNA/RNA polymerases superfamily protein [The... 172 1e-40 ref|XP_007027952.1| DNA/RNA polymerases superfamily protein [The... 172 2e-40 gb|AAT38744.1| Putative gag-pol polyprotein, identical [Solanum ... 169 2e-39 gb|AAT38724.1| Putative retrotransposon protein, identical [Sola... 168 2e-39 ref|XP_004154396.1| PREDICTED: uncharacterized protein LOC101203... 168 3e-39 emb|CAN62233.1| hypothetical protein VITISV_010121 [Vitis vinifera] 167 4e-39 emb|CBI17376.3| unnamed protein product [Vitis vinifera] 167 5e-39 ref|XP_004153883.1| PREDICTED: uncharacterized protein LOC101208... 166 1e-38 emb|CAN69982.1| hypothetical protein VITISV_027150 [Vitis vinifera] 166 1e-38 ref|XP_007010278.1| Uncharacterized protein TCM_043787 [Theobrom... 165 2e-38 gb|AAT39297.2| Gag-pol protein, putative [Solanum demissum] 165 2e-38 ref|XP_007049973.1| DNA/RNA polymerases superfamily protein [The... 164 4e-38 ref|XP_007200265.1| hypothetical protein PRUPE_ppa015000mg [Prun... 164 4e-38 ref|XP_007032220.1| Retrotransposon protein, putative [Theobroma... 163 7e-38 >ref|XP_007027902.1| Retrotransposon-like protein [Theobroma cacao] gi|508716507|gb|EOY08404.1| Retrotransposon-like protein [Theobroma cacao] Length = 654 Score = 176 bits (447), Expect = 8e-42 Identities = 111/326 (34%), Positives = 161/326 (49%), Gaps = 49/326 (15%) Frame = -2 Query: 833 CYQCGQPGHRRSECPNA------------PMSTSGSVSQFPPRE---------------- 738 CY CGQPGH R +CP A P S++ SV+ RE Sbjct: 238 CYGCGQPGHIRRDCPMAHQSPDSARGSTQPASSAPSVAVSSGREVSGSRGRGAGTSSQGK 297 Query: 737 ---------------RMQALTEPDIEATKNLIEGMINICGRVMYTLIDAGSTISFISGSM 603 R+ ALT+ + + + ++ G++++C L D G+T SFIS Sbjct: 298 PSGSGHQSSIGRGQARVFALTQQEAQTSNAVVSGILSVCNMNARVLFDPGATHSFISPCF 357 Query: 602 VDKLGLIPVKATNLLILSTAAGDRVYPNLCCERCIVKIKGYEMPVDLRVLEFLEFDALLG 423 +LG V+ L++ST + E C+V++K + V+L VL+ L+FD +LG Sbjct: 358 ASRLGRGRVRREEQLVVSTPLKEIFVAEWEYESCVVRVKDKDTSVNLVVLDTLDFDVILG 417 Query: 422 MDWLAAYHAHVDCFHKRVVFQIPRYPVFVYSGRTTTQP------ITRGKMATLQEIEILA 261 M+WL+ HA VDC+HK V F P P F G + P I+ ++ I LA Sbjct: 418 MNWLSPCHASVDCYHKLVRFDFPGEPSFSIQGDRSNAPTNLISVISARRLLRQGCIGYLA 477 Query: 260 GLIPEEDKTNEIKLDDIPVVREYPEVFPTDLPGLPPRREIDFIIELIPGTKPISIPPYRM 81 + + K ++ + VV+E+ +VFP +LPGLPP RE++F I+LIP T+PISIPPYRM Sbjct: 478 VVKDSQAKIGDVT--QVSVVKEFVDVFPEELPGLPPEREVEFCIDLIPDTRPISIPPYRM 535 Query: 80 APXXXXXXXXXXXXXXXKGFIHPSMS 3 AP KGFI PS+S Sbjct: 536 APAELKELKDQLEDLLDKGFIRPSVS 561 >ref|XP_007099662.1| Gag protease polyprotein-like protein [Theobroma cacao] gi|508728474|gb|EOY20371.1| Gag protease polyprotein-like protein [Theobroma cacao] Length = 665 Score = 175 bits (444), Expect = 2e-41 Identities = 114/325 (35%), Positives = 162/325 (49%), Gaps = 48/325 (14%) Frame = -2 Query: 833 CYQCGQPGHRRSECPNAPMSTSGSVSQ-------------FPPRE--------------- 738 C+QCGQ GH RS CP +T + S PPR+ Sbjct: 308 CFQCGQTGHIRSNCPRLGRATVVASSSPARTDIQRRDSSGLPPRQGVAIRSGVESNTPAH 367 Query: 737 -----------RMQALTEPDIEATKNLIEGMINICGRVMYTLIDAGSTISFIS---GSMV 600 R+ A+TE + + + G I++ + Y LID+GS S++S S+ Sbjct: 368 PPSRPQTRTSTRVFAVTEDEAQVRPGAVTGTISLFDKDAYVLIDSGSDRSYVSTTFASIA 427 Query: 599 DKLGLIPVKATNLLILSTAAGDRVYPNLCCERCIVKIKGYEMPVDLRVLEFLEFDALLGM 420 D+ L P++ +++ T G+++ N C C V++ E DL LE L+FD +LGM Sbjct: 428 DR-NLSPLEEE--IVIHTPLGEKLVRNSCYRDCGVRVGEEEFRGDLIPLEILDFDLILGM 484 Query: 419 DWLAAYHAHVDCFHKRVVFQIPRYPVFVYSGRTTTQP------ITRGKMATLQEIEILAG 258 DWL A+ A+VDCF K VV + + V+ G+ P I K+ LA Sbjct: 485 DWLTAHRANVDCFRKEVVLRNSKGAEIVFVGKRRVLPSCVISAIKASKLVQKGYSTYLAY 544 Query: 257 LIPEEDKTNEIKLDDIPVVREYPEVFPTDLPGLPPRREIDFIIELIPGTKPISIPPYRMA 78 +I + E KL+D+P+V E+P+VFP DLPGLPP RE++F I+L+ GT PISIPPYRMA Sbjct: 545 VI--DTSKREPKLEDVPIVSEFPDVFPDDLPGLPPDRELEFPIDLLSGTAPISIPPYRMA 602 Query: 77 PXXXXXXXXXXXXXXXKGFIHPSMS 3 P KGFI PS+S Sbjct: 603 PAELKELKVQLQELVDKGFIRPSIS 627 >ref|XP_007044250.1| DNA/RNA polymerases superfamily protein [Theobroma cacao] gi|508708185|gb|EOY00082.1| DNA/RNA polymerases superfamily protein [Theobroma cacao] Length = 1515 Score = 174 bits (440), Expect = 5e-41 Identities = 113/325 (34%), Positives = 162/325 (49%), Gaps = 48/325 (14%) Frame = -2 Query: 833 CYQCGQPGHRRSECPNAPMSTSGSVSQ-------------FPPRE--------------- 738 C+QCGQ GH RS CP +T + S PPR+ Sbjct: 292 CFQCGQTGHIRSNCPRLGRATVVASSSPARTDIQRRDSSGLPPRQGVAIPSGVESNTPAH 351 Query: 737 -----------RMQALTEPDIEATKNLIEGMINICGRVMYTLIDAGSTISFISG---SMV 600 R+ A+TE + + + G +++ + Y LID+GS S++S S+V Sbjct: 352 PPSRPQTRTSTRVFAVTEDEAQVRPGAVTGTMSLFDKDAYVLIDSGSDRSYVSTTFVSIV 411 Query: 599 DKLGLIPVKATNLLILSTAAGDRVYPNLCCERCIVKIKGYEMPVDLRVLEFLEFDALLGM 420 D+ L P++ +++ T G+++ N C C V++ E DL LE L+FD +LGM Sbjct: 412 DR-NLSPLEEE--IVIHTPLGEKLVRNSCYRDCGVRVGEEEFRGDLIPLEILDFDLILGM 468 Query: 419 DWLAAYHAHVDCFHKRVVFQIPRYPVFVYSGRTTTQP------ITRGKMATLQEIEILAG 258 DWL A+ A+VDCF K +V + V+ G+ P I K+ LA Sbjct: 469 DWLTAHRANVDCFRKEIVLRNSEGAEIVFVGKRRVLPSCVISAIKASKLVQKGYSTYLAY 528 Query: 257 LIPEEDKTNEIKLDDIPVVREYPEVFPTDLPGLPPRREIDFIIELIPGTKPISIPPYRMA 78 +I + E KL+D+ +V E+P+VFP DLPGLPP RE++F I+L+PGT PISIPPYRMA Sbjct: 529 VI--DTSKGEPKLEDVSIVSEFPDVFPDDLPGLPPDRELEFPIDLLPGTAPISIPPYRMA 586 Query: 77 PXXXXXXXXXXXXXXXKGFIHPSMS 3 P KGFI PS+S Sbjct: 587 PTELKELKVQLQELVDKGFIRPSIS 611 >emb|CAN83518.1| hypothetical protein VITISV_035077 [Vitis vinifera] Length = 1194 Score = 173 bits (439), Expect = 7e-41 Identities = 102/250 (40%), Positives = 140/250 (56%), Gaps = 5/250 (2%) Frame = -2 Query: 737 RMQALTEPDIEATKNLIEGMINICGRVMYTLIDAGSTISFISGSMVDKLGLIPVKATNLL 558 R+ A+T D + T +++ G + I LID GST SF+S S LG+ L Sbjct: 186 RVFAMTHRDAQTTFDVVIGTLQIHTLFARALIDPGSTHSFVSVSFAGLLGMSIDNMDFDL 245 Query: 557 ILSTAAGDRVYPNLCCERCIVKIKGYEMPVDLRVLEFLEFDALLGMDWLAAYHAHVDCFH 378 ++ GD V N CIV I EM VDL +L+ +FD +LGM+WLA+YHA +DCF Sbjct: 246 FVAIPLGDSVVVNKILRDCIVMIGYREMTVDLVLLDLQDFDVILGMNWLASYHASIDCFG 305 Query: 377 KRVVFQIPRYPVFVYSGRTTTQPITRGKMATLQEIEILAG-----LIPEEDKTNEIKLDD 213 K V F IP P F + G+ +P+ ++ LQ +L L ++ N +KL+D Sbjct: 306 KIVTFNIPSRPDFGFEGKHVDKPLHM--ISALQASSLLRKGCQGFLAYVMNEENNLKLED 363 Query: 212 IPVVREYPEVFPTDLPGLPPRREIDFIIELIPGTKPISIPPYRMAPXXXXXXXXXXXXXX 33 IP+VR+YP+VFP DLPGLPP +E++F I++ GT PIS PYRMAP Sbjct: 364 IPIVRDYPDVFPDDLPGLPPEKEVEFTIDVALGTTPISKAPYRMAPLELKELKIQLQELL 423 Query: 32 XKGFIHPSMS 3 KGFI PS+S Sbjct: 424 DKGFIRPSVS 433 >gb|ADN33767.1| gag protease polyprotein [Cucumis melo subsp. melo] Length = 871 Score = 173 bits (438), Expect = 9e-41 Identities = 102/284 (35%), Positives = 146/284 (51%), Gaps = 7/284 (2%) Frame = -2 Query: 833 CYQCGQPGHRRSECPNAPMSTSGSVSQFPPRE-RMQALTEPDIEATKNLIEGMINICGRV 657 C++C Q GH CP P + + P + R+ A + E ++ G + + G Sbjct: 578 CFKCRQEGHTADRCPLRPTGIAQNQGAGAPLQGRVFATNRTEAEKAGTVVTGTLPVLGHY 637 Query: 656 MYTLIDAGSTISFISGSMVDKLGLIPVKATNLLILSTAAGDRVYPNLCCERCIVKIKGYE 477 L D+GS+ SFIS + V L ++L +ST +G+ + + C ++I G+ Sbjct: 638 ALVLFDSGSSHSFISSAFVSHARLEVEPLHHVLSVSTPSGECMLSKEKVKACQIEIAGHV 697 Query: 476 MPVDLRVLEFLEFDALLGMDWLAAYHAHVDCFHKRVVFQIPRYPVFVYSG------RTTT 315 + V L VL+ L+FD +LGMDWLAA HA +DC K V F P F + G Sbjct: 698 IEVTLIVLDMLDFDVILGMDWLAANHASIDCSRKEVTFNPPSMASFKFKGGGSKSLPQVI 757 Query: 314 QPITRGKMATLQEIEILAGLIPEEDKTNEIKLDDIPVVREYPEVFPTDLPGLPPRREIDF 135 I K+ + ILA ++ + + ++ L PVVR+YP+VFP +LPGLPP RE++F Sbjct: 758 SAIRASKLLSQGTWGILASVV--DTREADVSLSSEPVVRDYPDVFPEELPGLPPHREVEF 815 Query: 134 IIELIPGTKPISIPPYRMAPXXXXXXXXXXXXXXXKGFIHPSMS 3 IEL PGT PIS PYRMAP KGFI PS+S Sbjct: 816 AIELEPGTVPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVS 859 >ref|XP_004488407.1| PREDICTED: uncharacterized protein LOC101502180 [Cicer arietinum] Length = 1235 Score = 172 bits (437), Expect = 1e-40 Identities = 109/328 (33%), Positives = 152/328 (46%), Gaps = 50/328 (15%) Frame = -2 Query: 836 ICYQCGQPGHRRSECPNAPMSTSGSVSQFP------------------------------ 747 +CYQ GQ GH R +CP S S + P Sbjct: 468 VCYQYGQIGHIRRDCPVDTTHPSSSYASTPTALASSQTHSASVLQGGNSYVRGSGTFQQR 527 Query: 746 ----------PRERMQA----LTEPDIEATKNLIEGMINICGRVMYTLIDAGSTISFISG 609 P ER QA LT D + ++ G+++IC R + L D G+T SF+S Sbjct: 528 GRGFGGRGQIPAERGQAQVFALTRQDAQTCNAVVTGILSICSRDAHVLFDLGATHSFVSS 587 Query: 608 SMVDKLGLIPVKATNLLILSTAAGDRVYPNLCCERCIVKIKGYEMPVDLRVLEFLEFDAL 429 +LG L+++T G + C + I G VDL V++ ++FD + Sbjct: 588 WFATRLGKCSSSLEEPLVVATPVGGNLLAKSVYRCCDITIDGKVFSVDLVVIDLIDFDVI 647 Query: 428 LGMDWLAAYHAHVDCFHKRVVFQIPRYPVFVYSGRTTTQPITRGKMATLQEIEILAG--- 258 LGMDWLA +HA +DC K V F+IP VF + G P ++ L +++ Sbjct: 648 LGMDWLAFHHATLDCHDKVVKFEIPGQSVFSFQGERCWVP--HNQILALAASKLMRRGCQ 705 Query: 257 ---LIPEEDKTNEIKLDDIPVVREYPEVFPTDLPGLPPRREIDFIIELIPGTKPISIPPY 87 + + + E KL+ IP+ E+P+VFP +LPGLPP REI+F I+L+P T PISIPPY Sbjct: 706 AYIALVRDTQVAEEKLEKIPIACEFPDVFPEELPGLPPDREIEFSIDLVPNTHPISIPPY 765 Query: 86 RMAPXXXXXXXXXXXXXXXKGFIHPSMS 3 RMAP KGFI PS S Sbjct: 766 RMAPAKLKELREQLQDLLDKGFIRPSSS 793 >ref|XP_007044383.1| DNA/RNA polymerases superfamily protein [Theobroma cacao] gi|508708318|gb|EOY00215.1| DNA/RNA polymerases superfamily protein [Theobroma cacao] Length = 1537 Score = 172 bits (436), Expect = 1e-40 Identities = 111/325 (34%), Positives = 159/325 (48%), Gaps = 48/325 (14%) Frame = -2 Query: 833 CYQCGQPGHRRSECPNAPMSTSGSVSQ-------------FPPRE--------------- 738 C+QCGQ GH RS CP +T + S PPR+ Sbjct: 305 CFQCGQTGHIRSNCPQLGRATVAASSPTARTDIQRRDSSGLPPRQGVAIRSGVESNTPSH 364 Query: 737 -----------RMQALTEPDIEATKNLIEGMINICGRVMYTLIDAGSTISFIS---GSMV 600 R+ A+TE + + G +++ + Y LID+GS S++S S+ Sbjct: 365 SPSRPQTCTATRVFAVTEDEARVRPGAVTGTMSLFDKDAYVLIDSGSDRSYVSTTFASIT 424 Query: 599 DKLGLIPVKATNLLILSTAAGDRVYPNLCCERCIVKIKGYEMPVDLRVLEFLEFDALLGM 420 D+ L P++ +++ T G+++ N C C V++ E DL LE L+FD +LGM Sbjct: 425 DR-NLSPLEEE--IVVHTPLGEQLIRNTCYRDCGVRVGEEEFRGDLIPLEILDFDLILGM 481 Query: 419 DWLAAYHAHVDCFHKRVVFQIPRYPVFVYSGRTTTQP------ITRGKMATLQEIEILAG 258 DWL + A++DCF K VV + V+ G P I K+ LA Sbjct: 482 DWLTTHRANLDCFRKEVVLRNSEGAEIVFVGERRVLPSCVISAIKASKLVQKGYPTYLAY 541 Query: 257 LIPEEDKTNEIKLDDIPVVREYPEVFPTDLPGLPPRREIDFIIELIPGTKPISIPPYRMA 78 +I + E KL+D+P+V E+P+VFP DLPG+PP RE++F I+L+PGT PISIPPYRMA Sbjct: 542 VI--DTSKGEPKLEDVPIVSEFPDVFPDDLPGIPPNRELEFPIDLLPGTAPISIPPYRMA 599 Query: 77 PXXXXXXXXXXXXXXXKGFIHPSMS 3 P KGFI PS+S Sbjct: 600 PAELKELKAQLQDLVDKGFIRPSIS 624 >ref|XP_007027952.1| DNA/RNA polymerases superfamily protein [Theobroma cacao] gi|508716557|gb|EOY08454.1| DNA/RNA polymerases superfamily protein [Theobroma cacao] Length = 1400 Score = 172 bits (435), Expect = 2e-40 Identities = 108/326 (33%), Positives = 159/326 (48%), Gaps = 49/326 (15%) Frame = -2 Query: 833 CYQCGQPGHRRSECPNAPMST-------------------------------SGSVSQFP 747 CY CGQPGH R +CP A S +G+ SQ Sbjct: 335 CYGCGQPGHIRRDCPMAHQSPDSARGSTQPASSAPSVAVSSGQEVSGSRGRGAGTSSQGR 394 Query: 746 P------------RERMQALTEPDIEATKNLIEGMINICGRVMYTLIDAGSTISFISGSM 603 P + R+ ALT+ + + + ++ ++++C L D G+T SFIS Sbjct: 395 PSGSGHQSSIGRGQARVFALTQQEAQTSNAVVSSILSVCNMNARVLFDPGATHSFISPCF 454 Query: 602 VDKLGLIPVKATNLLILSTAAGDRVYPNLCCERCIVKIKGYEMPVDLRVLEFLEFDALLG 423 +LG V+ L++ST + E C+V++K + V+L VL+ L+FD +LG Sbjct: 455 ASRLGRGRVRREEQLVVSTPLKEIFVAEWEYESCVVRVKDKDTSVNLVVLDTLDFDVILG 514 Query: 422 MDWLAAYHAHVDCFHKRVVFQIPRYPVFVYSGRTTTQP------ITRGKMATLQEIEILA 261 M+WL+ HA VDC+HK V F P P F G + P I+ ++ I LA Sbjct: 515 MNWLSPCHASVDCYHKLVRFDFPGEPSFSIQGDRSNAPTNLISVISARRLLRQGCIGYLA 574 Query: 260 GLIPEEDKTNEIKLDDIPVVREYPEVFPTDLPGLPPRREIDFIIELIPGTKPISIPPYRM 81 + + K ++ + VV+E+ +VFP +LPGLPP RE++F I+LIP T+PISIPPYRM Sbjct: 575 VVKDSQAKIGDVT--QVSVVKEFMDVFPDELPGLPPEREVEFCIDLIPDTRPISIPPYRM 632 Query: 80 APXXXXXXXXXXXXXXXKGFIHPSMS 3 AP KGFI PS++ Sbjct: 633 APAELKELKDQLEDLLDKGFIRPSLN 658 >gb|AAT38744.1| Putative gag-pol polyprotein, identical [Solanum demissum] Length = 1515 Score = 169 bits (427), Expect = 2e-39 Identities = 106/306 (34%), Positives = 159/306 (51%), Gaps = 29/306 (9%) Frame = -2 Query: 833 CYQCGQPGHRRSECP--NAPMSTSGSVSQF----PP---------------RERMQALTE 717 C++CGQ GH ECP N + GS +Q PP R+ A+T Sbjct: 394 CFKCGQEGHFVKECPKNNQGSGSLGSRTQSSSVAPPDRMTPRGATSSTGGGANRLYAITS 453 Query: 716 P-DIEATKNLIEGMINICGRVMYTLIDAGSTISFISGSMVDKLGLIPVKATNLLILSTAA 540 + E + N++ MI + +Y L+D G+++SF++ + +K ++P + +ST Sbjct: 454 RHEQENSPNVVTAMIKVFAFYVYALLDPGASLSFVTPYVANKFDVLPERLCEPFCVSTPV 513 Query: 539 GDRVYPNLCCERCIVKIKGYEMPVDLRVLEFLEFDALLGMDWLAAYHAHVDCFHKRVVFQ 360 G+ + C V I VDL L+ ++FD +LGMDWL A +A +DC + V FQ Sbjct: 514 GESILAERVYRDCPVSINHKSTMVDLIELDMVDFDVILGMDWLHACYASIDCRTRVVKFQ 573 Query: 359 IPRYPVFVYSGRTTTQPITRGKMATLQEIEILA--GLIPEEDKTNEIKLD-----DIPVV 201 P P+ +S ++ + +G+ + + L G I + N+ ++ +P+V Sbjct: 574 FPSEPILEWS---SSSAVPKGRFISYLKARKLVSKGCIYHLARVNDSSVEIPYFQSVPIV 630 Query: 200 REYPEVFPTDLPGLPPRREIDFIIELIPGTKPISIPPYRMAPXXXXXXXXXXXXXXXKGF 21 RE+PEVFP DLPG+PP REIDF I+LIP T+PISIPPYRMAP KGF Sbjct: 631 REFPEVFPNDLPGIPPEREIDFGIDLIPDTRPISIPPYRMAP----AELKELKDLLEKGF 686 Query: 20 IHPSMS 3 I PS+S Sbjct: 687 IRPSVS 692 >gb|AAT38724.1| Putative retrotransposon protein, identical [Solanum demissum] Length = 1602 Score = 168 bits (426), Expect = 2e-39 Identities = 106/306 (34%), Positives = 159/306 (51%), Gaps = 29/306 (9%) Frame = -2 Query: 833 CYQCGQPGHRRSECP--NAPMSTSGSVSQF----PP---------------RERMQALTE 717 C++CGQ GH ECP N + GS +Q PP R+ A+T Sbjct: 400 CFKCGQEGHFVKECPKNNQGSGSLGSRTQSSSVAPPDRMTPRGATSSTGGGANRLYAITS 459 Query: 716 P-DIEATKNLIEGMINICGRVMYTLIDAGSTISFISGSMVDKLGLIPVKATNLLILSTAA 540 + E + N++ MI + +Y L+D G+++SF++ + +K ++P + +ST Sbjct: 460 RHEQENSPNVVTAMIKVFAFYVYALLDPGASLSFVTPYVANKFDVLPERLCEPFCVSTPV 519 Query: 539 GDRVYPNLCCERCIVKIKGYEMPVDLRVLEFLEFDALLGMDWLAAYHAHVDCFHKRVVFQ 360 G+ + C V I VDL L+ ++FD +LGMDWL A +A +DC + V FQ Sbjct: 520 GESILAERVYRDCPVSINHKSTMVDLIELDMVDFDVILGMDWLHACYASIDCRTRVVKFQ 579 Query: 359 IPRYPVFVYSGRTTTQPITRGKMATLQEIEILA--GLIPEEDKTNEIKLD-----DIPVV 201 P P+ +S ++ + +G+ + + L G I + N+ ++ +P+V Sbjct: 580 FPSEPILEWS---SSSAVPKGRFISYLKARKLVSKGCIYHLARVNDSSVEIPYFQSVPIV 636 Query: 200 REYPEVFPTDLPGLPPRREIDFIIELIPGTKPISIPPYRMAPXXXXXXXXXXXXXXXKGF 21 RE+PEVFP DLPG+PP REIDF I+LIP T+PISIPPYRMAP KGF Sbjct: 637 REFPEVFPDDLPGIPPEREIDFGIDLIPDTRPISIPPYRMAP----AELKELKDLLEKGF 692 Query: 20 IHPSMS 3 I PS+S Sbjct: 693 IRPSVS 698 >ref|XP_004154396.1| PREDICTED: uncharacterized protein LOC101203289 [Cucumis sativus] Length = 655 Score = 168 bits (425), Expect = 3e-39 Identities = 102/262 (38%), Positives = 136/262 (51%), Gaps = 8/262 (3%) Frame = -2 Query: 836 ICYQCGQPGHRRSECP-NAPMSTSGSVSQFPP-RERMQALTEPDIEATKNLIEGMINICG 663 +CY+C Q GH CP + + S S + PP R + A + E ++ G + + G Sbjct: 365 VCYKCKQEGHMADRCPLRSTGAGSSSQGERPPQRGTIFATNRSEAEKAGTVVTGTLPVLG 424 Query: 662 RVMYTLIDAGSTISFISGSMVDKLGLIPVKATNLLILSTAAGDRVYPNLCCERCIVKIKG 483 TL D+GS+ SFIS V L +L +ST +G+ + + C ++I G Sbjct: 425 HFALTLFDSGSSHSFISSLFVTHACLEVEPLDYVLSVSTPSGEIMLSKEKIKACEIEIAG 484 Query: 482 YEMPVDLRVLEFLEFDALLGMDWLAAYHAHVDCFHKRVVFQIPRYPVFVYSGRTTT---- 315 + V L VL+ +FD +LGMDWLA HA +DC K VVF P F + G T Sbjct: 485 RVLDVTLLVLDMRDFDVILGMDWLATNHASIDCSRKEVVFSPPTASSFKFKGVGTVVLPK 544 Query: 314 --QPITRGKMATLQEIEILAGLIPEEDKTNEIKLDDIPVVREYPEVFPTDLPGLPPRREI 141 + K+ ILA ++ + + E L PVVREYP+VFP DLPGLPP REI Sbjct: 545 VISAMKASKLLNQGTWSILASVV--DTREGETSLTSEPVVREYPDVFPEDLPGLPPHREI 602 Query: 140 DFIIELIPGTKPISIPPYRMAP 75 DF IEL P T PIS PYRMAP Sbjct: 603 DFAIELEPDTTPISRAPYRMAP 624 >emb|CAN62233.1| hypothetical protein VITISV_010121 [Vitis vinifera] Length = 1797 Score = 167 bits (424), Expect = 4e-39 Identities = 97/246 (39%), Positives = 137/246 (55%), Gaps = 9/246 (3%) Frame = -2 Query: 833 CYQCGQPGHRRSECPNAPMSTSGSVSQFPPRERMQ--------ALTEPDIEATKNLIEGM 678 C+ CG+ GH +CP +G + +++ + A+T D +AT +++ G Sbjct: 734 CFCCGEQGHLIRDCPENRKFITGKPKEENKKDKQKPKAQGWVFAMTHRDAQATSDVVTGT 793 Query: 677 INICGRVMYTLIDAGSTISFISGSMVDKLGLIPVKATNL-LILSTAAGDRVYPNLCCERC 501 + I LID GST SF+S S LGL PV + + LI++T GD V + C Sbjct: 794 LRIHTLFARVLIDLGSTHSFVSVSFAGLLGL-PVASMDFDLIVATPVGDSVVASRMLRNC 852 Query: 500 IVKIKGYEMPVDLRVLEFLEFDALLGMDWLAAYHAHVDCFHKRVVFQIPRYPVFVYSGRT 321 IV I EMP+DL +L+ +FD +LGMDWLA+YHA VDCF KRV F IP P F + G+ Sbjct: 853 IVMIGYREMPIDLVLLDLQDFDVILGMDWLASYHASVDCFEKRVTFSIPGQPKFSFEGKH 912 Query: 320 TTQPITRGKMATLQEIEILAGLIPEEDKTNEIKLDDIPVVREYPEVFPTDLPGLPPRREI 141 +P+ ++TL+ +L DIP+VREYP+VF DLPGLPP RE+ Sbjct: 913 VDRPLR--MISTLRASSLLK--------------KDIPIVREYPDVFLEDLPGLPPEREM 956 Query: 140 DFIIEL 123 +F I+L Sbjct: 957 EFTIDL 962 Score = 136 bits (343), Expect = 9e-30 Identities = 78/171 (45%), Positives = 104/171 (60%), Gaps = 4/171 (2%) Frame = -2 Query: 647 LIDAGSTISFISGSMVDKLGLIPVKATNL-LILSTAAGDRVYPNLCCERCIVKIKGYEMP 471 LID GST SF+S S LGL PV + + LI++T GD V + CIV I EM Sbjct: 98 LIDPGSTHSFVSVSFAGLLGL-PVASMDFDLIVATPVGDFVVASRMLRNCIVMIGYREML 156 Query: 470 VDLRVLEFLEFDALLGMDWLAAYHAHVDCFHKRVVFQIPRYPVFVYSGRTTTQP---ITR 300 VDL +L+ +FD +LGMDWL +YHA +DCF KRV F IP P F + G+ +P I+ Sbjct: 157 VDLVLLDLQDFDVILGMDWLTSYHASIDCFEKRVTFSIPGQPKFSFEGKHVDRPLRMISA 216 Query: 299 GKMATLQEIEILAGLIPEEDKTNEIKLDDIPVVREYPEVFPTDLPGLPPRR 147 + ++L + L +++KL+DIP+VREYP+VF DLPGLPP R Sbjct: 217 LRASSLLKKGCQGFLASVMSNESDLKLEDIPIVREYPDVFLEDLPGLPPER 267 >emb|CBI17376.3| unnamed protein product [Vitis vinifera] Length = 1567 Score = 167 bits (423), Expect = 5e-39 Identities = 106/296 (35%), Positives = 157/296 (53%), Gaps = 13/296 (4%) Frame = -2 Query: 851 PRPQLICYQCGQ--PGHR--RSECPNAPMSTSGSVSQFPPRE---RMQALTEPDIEATKN 693 P+ QL YQ Q P + R+ + S+ GS ++ R+ R+ ALT + E Sbjct: 357 PQLQLPYYQMPQLPPAAQGTRTTTTSQTRSSQGSNARGRGRQAAGRVFALTPTEPEEDAL 416 Query: 692 LIEGMINICGRVMYTLIDAGSTISFISGSMVDKLGLIPVKATNLLILSTAAGDRVYPNLC 513 L+EGMI + + L D G+T SFIS S + LGL + NLL++ + G + Sbjct: 417 LVEGMILVYSTWVRVLFDTGATHSFISASCANALGLKSERVENLLLIESPMGTNSRVDRI 476 Query: 512 CERCIVKIKGYEMPVDLRVLEFLEFDALLGMDWLAAYHAHVDCFHKRVVFQIPR-YPVFV 336 C+ C++ + + VDLR+L+ +D +LGMDWLA Y A +DC +R++F +P + V Sbjct: 477 CKGCVITLADRALNVDLRILDMTGYDVILGMDWLAVYRAVIDCHRRRIIFCLPEGFEVCF 536 Query: 335 YSGRTTTQPITRGK-----MATLQEIEILAGLIPEEDKTNEIKLDDIPVVREYPEVFPTD 171 G+ + P ++ + I LA L +E +I +IP+VR++ +VFP + Sbjct: 537 VGGKCVSLPFSQSDPCYQYVLRKGSINFLACLRGKEKAQKDI--TEIPMVRKFQDVFPDE 594 Query: 170 LPGLPPRREIDFIIELIPGTKPISIPPYRMAPXXXXXXXXXXXXXXXKGFIHPSMS 3 LPGLPP RE DF IE+ PGT PIS+ PYRMAP +GFI PS S Sbjct: 595 LPGLPPHREFDFSIEVYPGTDPISVSPYRMAPLELKELKTQLDELLGRGFIRPSTS 650 >ref|XP_004153883.1| PREDICTED: uncharacterized protein LOC101208523, partial [Cucumis sativus] Length = 804 Score = 166 bits (420), Expect = 1e-38 Identities = 107/288 (37%), Positives = 144/288 (50%), Gaps = 10/288 (3%) Frame = -2 Query: 836 ICYQCGQPGHRRSEC----PNAPMSTSGSVSQFPPRERMQALTEPDIEATKNLIEGMINI 669 +CY+C Q GH C A S+ G+ P R + A + + E ++ G + + Sbjct: 364 VCYKCKQEGHMADRCRLRSTGAGQSSQGAGP--PQRGTIFATSRSEAEKAGTVVTGTLPV 421 Query: 668 CGRVMYTLIDAGSTISFISGSMVDKLGLIPVKATNLLILSTAAGDRVYPNLCCERCIVKI 489 G TL D+GS+ SFIS V L +L +ST +G+ + + C ++I Sbjct: 422 LGHFALTLFDSGSSHSFISSLFVTHACLEVKPLDYVLSVSTPSGEIMLSKEKIKACKIEI 481 Query: 488 KGYEMPVDLRVLEFLEFDALLGMDWLAAYHAHVDCFHKRVVFQIPRYPVFVYSGRTTT-- 315 G + V L VL+ +FD +LGMD LA HA +DC K VVF P F + G T Sbjct: 482 AGRVLDVTLLVLDIRDFDVILGMDLLATNHASIDCSRKEVVFSPPTESSFKFKGVGTVVL 541 Query: 314 ----QPITRGKMATLQEIEILAGLIPEEDKTNEIKLDDIPVVREYPEVFPTDLPGLPPRR 147 + K+ + ILA ++ + + +E L PVVREYP+VFP DLPGLPP R Sbjct: 542 PKVISAMKASKLLSQGTWSILASVV--DTREDETSLTSEPVVREYPDVFPEDLPGLPPHR 599 Query: 146 EIDFIIELIPGTKPISIPPYRMAPXXXXXXXXXXXXXXXKGFIHPSMS 3 EIDF IEL P T PIS PYRMAP KGFI PS+S Sbjct: 600 EIDFAIELEPDTTPISRAPYRMAPAELKELKVQLQELLDKGFIQPSVS 647 >emb|CAN69982.1| hypothetical protein VITISV_027150 [Vitis vinifera] Length = 1495 Score = 166 bits (419), Expect = 1e-38 Identities = 95/255 (37%), Positives = 137/255 (53%), Gaps = 6/255 (2%) Frame = -2 Query: 749 PPRERMQALTEPDIEATKNLIEGMINICGRVMYTLIDAGSTISFISGSMVDKLGLIPVKA 570 P R+ ALT + L+EGMI + + L D G+T SFIS S + LGL + Sbjct: 414 PAAGRVFALTPTEPXEDALLVEGMILVYSTWVRVLFDTGATHSFISASCANALGLKSERV 473 Query: 569 TNLLILSTAAGDRVYPNLCCERCIVKIKGYEMPVDLRVLEFLEFDALLGMDWLAAYHAHV 390 NLL++ + G + C+ C++ + + VDLR+L+ +D +LGMDWLA Y A + Sbjct: 474 ENLLLIESPMGTNSRVDRICKGCVITLADRALNVDLRILDMTGYDVILGMDWLAVYRAVI 533 Query: 389 DCFHKRVVFQIPR-YPVFVYSGRTTTQPITRGK-----MATLQEIEILAGLIPEEDKTNE 228 DC +R++F +P + V G+ + P ++ + I LA L +E + Sbjct: 534 DCHRRRIIFCLPEGFEVCFVGGKCVSLPFSQSDPCYQYVLRKGSINFLACLRGKEKAQKD 593 Query: 227 IKLDDIPVVREYPEVFPTDLPGLPPRREIDFIIELIPGTKPISIPPYRMAPXXXXXXXXX 48 I +IPVVR++ +VFP +LPGLPP RE DF IE+ PGT PIS+ PYRMAP Sbjct: 594 I--TEIPVVRKFQDVFPDELPGLPPHREFDFSIEVYPGTDPISVSPYRMAPLELKELKTQ 651 Query: 47 XXXXXXKGFIHPSMS 3 +GFI PS S Sbjct: 652 LDELLGRGFIRPSTS 666 >ref|XP_007010278.1| Uncharacterized protein TCM_043787 [Theobroma cacao] gi|508727191|gb|EOY19088.1| Uncharacterized protein TCM_043787 [Theobroma cacao] Length = 649 Score = 165 bits (418), Expect = 2e-38 Identities = 103/292 (35%), Positives = 156/292 (53%), Gaps = 9/292 (3%) Frame = -2 Query: 851 PRPQLICYQCGQPGHRRSECPNAPMSTSGSVSQFPPRERMQALTEPDIEATKNLIEGMIN 672 P Q + + G + + P+ P + + + R+ A+TE + + + G+++ Sbjct: 106 PPRQGVAIRSGVESNTPAHPPSRPQTRTST--------RVFAVTEDEAQVRPRAVTGIMS 157 Query: 671 ICGRVMYTLIDAGSTISFIS---GSMVDKLGLIPVKATNLLILSTAAGDRVYPNLCCERC 501 + + Y LID+GS S++S S+ D+ L P++ +++ T G+++ N C C Sbjct: 158 LFDKDAYVLIDSGSDRSYVSTTFASIADR-NLSPLEEE--IVIHTPLGEKLVRNSCYRDC 214 Query: 500 IVKIKGYEMPVDLRVLEFLEFDALLGMDWLAAYHAHVDCFHKRVVFQIPRYPVFVYSGRT 321 V++ E DL LE L+FD +LGMDWL A+ A+VDCF K VV + V+ G+ Sbjct: 215 GVRVGEEEFRGDLIPLEILDFDLILGMDWLTAHRANVDCFRKEVVLRNSEGAEIVFVGKR 274 Query: 320 TTQP------ITRGKMATLQEIEILAGLIPEEDKTNEIKLDDIPVVREYPEVFPTDLPGL 159 P I K+ LA +I + E KL+D+P+V E+P+VFP DLPGL Sbjct: 275 RVLPSCVISAIKASKLVQKGYPTYLAYVI--DTSKGEPKLEDVPIVSEFPDVFPDDLPGL 332 Query: 158 PPRREIDFIIELIPGTKPISIPPYRMAPXXXXXXXXXXXXXXXKGFIHPSMS 3 PP RE++F I+L+PGT PISIPPYRMAP KGFI PS+S Sbjct: 333 PPDRELEFPIDLLPGTAPISIPPYRMAPAELKELKVQLQELVDKGFIRPSIS 384 >gb|AAT39297.2| Gag-pol protein, putative [Solanum demissum] Length = 1554 Score = 165 bits (417), Expect = 2e-38 Identities = 103/304 (33%), Positives = 153/304 (50%), Gaps = 27/304 (8%) Frame = -2 Query: 833 CYQCGQPGHRRSECPNAPMSTSGSVSQFP---------PR----------ERMQALTEP- 714 C++CGQ GH ECP G+ +Q PR R+ A+T Sbjct: 477 CFKCGQNGHFMRECPKNRQGNGGNRAQSSSVVPLDMTAPRGATSSTGGGANRLYAITSRH 536 Query: 713 DIEATKNLIEGMINICGRVMYTLIDAGSTISFISGSMVDKLGLIPVKATNLLILSTAAGD 534 + E + N++ MI + +Y L+D G ++SF++ + +K ++P + +ST G+ Sbjct: 537 EPENSPNVVTRMIKVFAFDVYALLDPGVSLSFVTLYVANKFDVLPERLCEPFCVSTPVGE 596 Query: 533 RVYPNLCCERCIVKIKGYEMPVDLRVLEFLEFDALLGMDWLAAYHAHVDCFHKRVVFQIP 354 + C I DL L+ ++FD +LGM+WL A +A +DC + V FQ P Sbjct: 597 SILAERVYRDCPDSINHKSTMADLVELDMVDFDVILGMNWLHACYASLDCRTRVVKFQFP 656 Query: 353 RYPVFVYSGRTTTQPITRGKMATLQEIEILAG------LIPEEDKTNEIK-LDDIPVVRE 195 PVF +S ++ + +G+ + + L L+ D + EI +P+VRE Sbjct: 657 NEPVFEWS---SSSAVPKGRFISYLKARKLVSKGCIYHLVRVHDSSVEIPHFQSVPIVRE 713 Query: 194 YPEVFPTDLPGLPPRREIDFIIELIPGTKPISIPPYRMAPXXXXXXXXXXXXXXXKGFIH 15 +P+VFP DLPG+PP REIDF I+LIP T PISIPPYRMAP KGFI Sbjct: 714 FPKVFPDDLPGIPPEREIDFGIDLIPDTHPISIPPYRMAPSELKELKEQLKDLLDKGFIR 773 Query: 14 PSMS 3 PS+S Sbjct: 774 PSVS 777 >ref|XP_007049973.1| DNA/RNA polymerases superfamily protein [Theobroma cacao] gi|508702234|gb|EOX94130.1| DNA/RNA polymerases superfamily protein [Theobroma cacao] Length = 1401 Score = 164 bits (415), Expect = 4e-38 Identities = 93/249 (37%), Positives = 138/249 (55%), Gaps = 6/249 (2%) Frame = -2 Query: 737 RMQALTEPDIEATKNLIEGMINICGRVMYTLIDAGSTISFISGSMVDKLGLIPVKATNLL 558 R+ ALT+ + + + ++ G++++C L D G+T SFIS +LG V+ L Sbjct: 433 RVFALTQQEAQTSNAVVSGILSVCNMNARVLFDPGATHSFISPCFASRLGRGRVRREEQL 492 Query: 557 ILSTAAGDRVYPNLCCERCIVKIKGYEMPVDLRVLEFLEFDALLGMDWLAAYHAHVDCFH 378 ++ST + E C+V++K + V+L VL+ L+FD +LGMDWL+ HA VDC+H Sbjct: 493 MVSTPLKEIFVAEWEYESCVVRVKDKDTSVNLVVLDTLDFDVILGMDWLSPCHASVDCYH 552 Query: 377 KRVVFQIPRYPVFVYSGRTTTQP------ITRGKMATLQEIEILAGLIPEEDKTNEIKLD 216 K V F P P+F G + P I+ ++ I LA + + K ++ Sbjct: 553 KLVRFDFPGEPLFSIQGDRSNAPTNLISVISARRLLRQGCIGYLAVVKDSQAKIGDVT-- 610 Query: 215 DIPVVREYPEVFPTDLPGLPPRREIDFIIELIPGTKPISIPPYRMAPXXXXXXXXXXXXX 36 + VV+E+ +VFP +LPGLPP RE++F I+LIP T+PISIPPYRMAP Sbjct: 611 QVSVVKEFVDVFPEELPGLPPEREVEFCIDLIPDTRPISIPPYRMAPAELKELKDQLEDL 670 Query: 35 XXKGFIHPS 9 KGFI PS Sbjct: 671 LDKGFIRPS 679 >ref|XP_007200265.1| hypothetical protein PRUPE_ppa015000mg [Prunus persica] gi|462395665|gb|EMJ01464.1| hypothetical protein PRUPE_ppa015000mg [Prunus persica] Length = 1493 Score = 164 bits (415), Expect = 4e-38 Identities = 97/281 (34%), Positives = 144/281 (51%), Gaps = 11/281 (3%) Frame = -2 Query: 812 GHRRSECPNAPMSTSGSVSQFPP-----RERMQALTEPDIEATKNLIEGMINICGRVMYT 648 G S+ P++ SG S+ P + R+ ++T+ + AT ++I GMI I G + Sbjct: 319 GSSSSKAPSSSRGRSGRQSRGQPGRSTTQARVFSMTQQEAYATPDVITGMIPIFGYLARV 378 Query: 647 LIDAGSTISFISGSMVDKLGLIPVKATNLLILSTAAGDRVYPNLCCERCIVKIKGYEMPV 468 LID G+T SF++ + + + + P T +S G+ +Y + C V++ + Sbjct: 379 LIDPGATHSFVAHNFIPYISIRPTPITGSFSISLPTGEVLYADRVFRNCFVQVDDAWLEA 438 Query: 467 DLRVLEFLEFDALLGMDWLAAYHAHVDCFHKRVVFQIPRYPVFVYSGRTTTQP------I 306 +L L+ ++ D +LGMDWL +HA VDCF K V + P P + G P I Sbjct: 439 NLIPLDLVDLDIILGMDWLEKHHASVDCFRKEVTLRSPGQPKVTFRGERRVLPTCLISAI 498 Query: 305 TRGKMATLQEIEILAGLIPEEDKTNEIKLDDIPVVREYPEVFPTDLPGLPPRREIDFIIE 126 T K+ LA +I + T + L+DIPVV E+P +FP DLPGLPP+REI+F I+ Sbjct: 499 TAKKLLKKGYEGYLAHIIDTREIT--LNLEDIPVVCEFPNIFPDDLPGLPPKREIEFTID 556 Query: 125 LIPGTKPISIPPYRMAPXXXXXXXXXXXXXXXKGFIHPSMS 3 +PGT PI PYRMAP FI PS+S Sbjct: 557 FLPGTNPIYQTPYRMAPAELRELKIQLQELVDLRFIRPSVS 597 >ref|XP_007032220.1| Retrotransposon protein, putative [Theobroma cacao] gi|508711249|gb|EOY03146.1| Retrotransposon protein, putative [Theobroma cacao] Length = 1480 Score = 163 bits (413), Expect = 7e-38 Identities = 101/302 (33%), Positives = 149/302 (49%), Gaps = 49/302 (16%) Frame = -2 Query: 833 CYQCGQPGHRRSECPNAPMST-------------------------------SGSVSQFP 747 CY CGQPGH +CP A S +G+ SQ Sbjct: 386 CYGCGQPGHIMKDCPMAHQSPDSARGSTQPASSAPSVAVSSGLEVSGSRGRGAGTSSQGR 445 Query: 746 P------------RERMQALTEPDIEATKNLIEGMINICGRVMYTLIDAGSTISFISGSM 603 P + R+ ALT+ + + + ++ G++++C L D G+T SFIS Sbjct: 446 PSRSGHQSSIGRGQARVFALTQQEAQTSNAVVSGILSVCNMNARVLFDPGATHSFISPCF 505 Query: 602 VDKLGLIPVKATNLLILSTAAGDRVYPNLCCERCIVKIKGYEMPVDLRVLEFLEFDALLG 423 +LG V+ L++ST + E C+V++K + V+L VL+ L+FD +LG Sbjct: 506 ASRLGRGRVRREEQLVVSTLLKEIFMAEWEYESCVVRVKDKDTSVNLVVLDTLDFDVILG 565 Query: 422 MDWLAAYHAHVDCFHKRVVFQIPRYPVFVYSGRTTTQP------ITRGKMATLQEIEILA 261 MDWL+ HA VDC+HK V F P P F G + P I+ ++ I LA Sbjct: 566 MDWLSPCHASVDCYHKLVRFDFPGEPSFSIQGDMSNAPTNLISVISARRLLRQGCIGYLA 625 Query: 260 GLIPEEDKTNEIKLDDIPVVREYPEVFPTDLPGLPPRREIDFIIELIPGTKPISIPPYRM 81 + + K ++ + VV+E+ +VFP +L G PP REI+F I+LIP T+P+SIPPYRM Sbjct: 626 VVKDSQAKIGDV--TQVSVVKEFVDVFPEELSGFPPEREIEFCIDLIPDTRPMSIPPYRM 683 Query: 80 AP 75 AP Sbjct: 684 AP 685