BLASTX nr result
ID: Panax24_contig00040278
seq
BLASTX 2.2.26 [Sep-21-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Panax24_contig00040278 (680 letters) Database: ./nr 115,041,592 sequences; 42,171,959,267 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value EOX94092.1 Gag protease polyprotein [Theobroma cacao] 322 e-108 EOX93994.1 DNA/RNA polymerases superfamily protein [Theobroma ca... 332 e-106 XP_017698858.1 PREDICTED: uncharacterized protein LOC108511389, ... 328 e-101 EOY14138.1 Uncharacterized protein TCM_033423 [Theobroma cacao] 320 e-101 XP_017216862.1 PREDICTED: uncharacterized protein LOC108194427 [... 330 e-100 XP_017224826.1 PREDICTED: uncharacterized protein LOC108201051 [... 323 1e-99 XP_017224824.1 PREDICTED: uncharacterized protein LOC108201049 [... 323 1e-99 EOY00215.1 DNA/RNA polymerases superfamily protein [Theobroma ca... 325 2e-99 XP_012073065.1 PREDICTED: uncharacterized protein LOC105634770 [... 325 4e-99 XP_015944834.1 PREDICTED: LOW QUALITY PROTEIN: uncharacterized p... 321 7e-99 XP_007221234.1 hypothetical protein PRUPE_ppb019121mg [Prunus pe... 306 8e-99 KYP40737.1 Retrotransposable element Tf2 [Cajanus cajan] 306 8e-99 KYP44365.1 Retrotransposable element Tf2, partial [Cajanus cajan] 301 9e-99 GAU18217.1 hypothetical protein TSUD_26810 [Trifolium subterraneum] 320 9e-99 XP_017224825.1 PREDICTED: uncharacterized protein LOC108201050 [... 322 1e-98 KYP31581.1 Retrotransposable element Tf2 [Cajanus cajan] 306 1e-98 EOY08653.1 DNA/RNA polymerases superfamily protein [Theobroma ca... 319 1e-98 GAU46468.1 hypothetical protein TSUD_402350 [Trifolium subterran... 319 2e-98 KYP76963.1 Retrotransposable element Tf2 [Cajanus cajan] 309 2e-98 EOY26510.1 DNA/RNA polymerases superfamily protein [Theobroma ca... 320 4e-98 >EOX94092.1 Gag protease polyprotein [Theobroma cacao] Length = 269 Score = 322 bits (825), Expect = e-108 Identities = 145/206 (70%), Positives = 174/206 (84%) Frame = -3 Query: 618 LAEAHSSPFSIHPGSTKMYRDLKKNFWWNGMKRDVAEYVGKCLTCQQVKIEHQRASGLLQ 439 + EAHSS +++HPGSTKMYR +K+N+WW GMKRDVAE+V KCL CQQVK EHQR +G LQ Sbjct: 1 MEEAHSSAYALHPGSTKMYRTIKENYWWPGMKRDVAEFVAKCLVCQQVKAEHQRPAGTLQ 60 Query: 438 QLDIPVWKWEDITMDFVTGLPKTFRKNDAIWVVVDRLTKLAHFLAIHEGYSVDRLAHIFQ 259 L +P WKWE +TMDFV GLP+T R NDAIWV+VDRLTK AHFLA+H YS+++LA ++ Sbjct: 61 SLPVPEWKWEHVTMDFVLGLPRTQRGNDAIWVIVDRLTKSAHFLAVHSTYSIEKLAQLYI 120 Query: 258 QEIVRLHGTPVSIVSNRDPRFTSRFWKGFQRAWGTRLNFSTSFHPQTDGQSERTIQTLED 79 EIVRLHG PVSIVS+RDPRFTSRFW FQ A GT+L FST+FHPQTDGQSERTIQTLED Sbjct: 121 DEIVRLHGVPVSIVSDRDPRFTSRFWLKFQEALGTKLKFSTAFHPQTDGQSERTIQTLED 180 Query: 78 MLRSCALEWSGNWDDYLSLVEFSYNN 1 MLR+C +++ G+WD +L LVEF+YNN Sbjct: 181 MLRACVIDFIGSWDRHLPLVEFAYNN 206 >EOX93994.1 DNA/RNA polymerases superfamily protein [Theobroma cacao] Length = 811 Score = 332 bits (851), Expect = e-106 Identities = 149/225 (66%), Positives = 185/225 (82%) Frame = -3 Query: 675 IWLGERLCVLSDPIIREVVLAEAHSSPFSIHPGSTKMYRDLKKNFWWNGMKRDVAEYVGK 496 + L +R+CV D +R +L EAHSS +++HPGSTKMYR +K+++WW GMKRD+A++V K Sbjct: 483 LMLRDRICVPKDDQLRRAILEEAHSSAYALHPGSTKMYRTIKESYWWPGMKRDIAKFVAK 542 Query: 495 CLTCQQVKIEHQRASGLLQQLDIPVWKWEDITMDFVTGLPKTFRKNDAIWVVVDRLTKLA 316 CLTCQQ+K EHQ++SG LQ L IP WKWE +TMDFV GLP+T DAIWV+VDRLTK A Sbjct: 543 CLTCQQIKAEHQKSSGTLQPLPIPEWKWEHVTMDFVLGLPRTQSGKDAIWVIVDRLTKSA 602 Query: 315 HFLAIHEGYSVDRLAHIFQQEIVRLHGTPVSIVSNRDPRFTSRFWKGFQRAWGTRLNFST 136 HFLAIH YS++RLA ++ E+VRLHG P+SIVS+RDPRFTSRFW FQ A GT+L FST Sbjct: 603 HFLAIHSTYSIERLARLYIDEVVRLHGVPISIVSDRDPRFTSRFWPKFQEALGTKLRFST 662 Query: 135 SFHPQTDGQSERTIQTLEDMLRSCALEWSGNWDDYLSLVEFSYNN 1 SFHPQTDGQSERTIQTLEDMLR+C +++ G+WD +L LVEF+YNN Sbjct: 663 SFHPQTDGQSERTIQTLEDMLRACVIDFIGSWDRHLPLVEFAYNN 707 >XP_017698858.1 PREDICTED: uncharacterized protein LOC108511389, partial [Phoenix dactylifera] Length = 1231 Score = 328 bits (840), Expect = e-101 Identities = 150/222 (67%), Positives = 181/222 (81%) Frame = -3 Query: 666 GERLCVLSDPIIREVVLAEAHSSPFSIHPGSTKMYRDLKKNFWWNGMKRDVAEYVGKCLT 487 G RLCV D +++ +L EAH S FSIHPGSTKMYRDL+++FWWNGMKR++AE+V +CL Sbjct: 795 GSRLCVPKDADLKKEILEEAHQSYFSIHPGSTKMYRDLREHFWWNGMKREIAEFVARCLV 854 Query: 486 CQQVKIEHQRASGLLQQLDIPVWKWEDITMDFVTGLPKTFRKNDAIWVVVDRLTKLAHFL 307 CQQVK EHQR +GLL+ L+IP WKWE ITMDFV GLPKT +KNDA+WV+VDRLTK AHFL Sbjct: 855 CQQVKAEHQRPAGLLEPLEIPEWKWEHITMDFVIGLPKTVKKNDAVWVIVDRLTKSAHFL 914 Query: 306 AIHEGYSVDRLAHIFQQEIVRLHGTPVSIVSNRDPRFTSRFWKGFQRAWGTRLNFSTSFH 127 G S+DRLA + EIVRLHG PVSIVS+RDPRF SRFW+ FQ A GT L ST++H Sbjct: 915 PFRVGTSLDRLAQRYIDEIVRLHGVPVSIVSDRDPRFVSRFWRSFQDAMGTELRLSTAYH 974 Query: 126 PQTDGQSERTIQTLEDMLRSCALEWSGNWDDYLSLVEFSYNN 1 PQTDGQSERTIQTLEDMLR+C ++ WD++++LVEF+YNN Sbjct: 975 PQTDGQSERTIQTLEDMLRTCTVDLGDCWDNHIALVEFAYNN 1016 >EOY14138.1 Uncharacterized protein TCM_033423 [Theobroma cacao] Length = 809 Score = 320 bits (819), Expect = e-101 Identities = 145/222 (65%), Positives = 179/222 (80%) Frame = -3 Query: 678 VIWLGERLCVLSDPIIREVVLAEAHSSPFSIHPGSTKMYRDLKKNFWWNGMKRDVAEYVG 499 + L +R+CV D +R +L EAHSS +++HPGSTKMYR +K+++WW GMKRD+AE+V Sbjct: 586 IFMLRDRICVPKDDQLRRAILEEAHSSAYALHPGSTKMYRTIKESYWWPGMKRDIAEFVA 645 Query: 498 KCLTCQQVKIEHQRASGLLQQLDIPVWKWEDITMDFVTGLPKTFRKNDAIWVVVDRLTKL 319 KCLTCQQ+K EHQ+ SG LQ L IP WKWE +TMDFV GLP+T DAIWV+VDRLTK Sbjct: 646 KCLTCQQIKAEHQKPSGTLQPLLIPEWKWEHVTMDFVLGLPRTQSGKDAIWVIVDRLTKS 705 Query: 318 AHFLAIHEGYSVDRLAHIFQQEIVRLHGTPVSIVSNRDPRFTSRFWKGFQRAWGTRLNFS 139 AHFLAIH YS++RLA ++ EIVRLHG PVSIVS+RDPRFTSRFW F A GT+L FS Sbjct: 706 AHFLAIHSTYSIERLARLYIDEIVRLHGVPVSIVSDRDPRFTSRFWPKFHEALGTKLRFS 765 Query: 138 TSFHPQTDGQSERTIQTLEDMLRSCALEWSGNWDDYLSLVEF 13 T+FHPQTDGQSERTIQTLEDMLR+C +++ G+WD +L LV++ Sbjct: 766 TAFHPQTDGQSERTIQTLEDMLRACVIDFIGSWDRHLPLVQY 807 >XP_017216862.1 PREDICTED: uncharacterized protein LOC108194427 [Daucus carota subsp. sativus] Length = 1810 Score = 330 bits (845), Expect = e-100 Identities = 147/225 (65%), Positives = 184/225 (81%) Frame = -3 Query: 675 IWLGERLCVLSDPIIREVVLAEAHSSPFSIHPGSTKMYRDLKKNFWWNGMKRDVAEYVGK 496 + LG R+CV +D +R +L EAH++P+++HPG+TKMY +K ++WW+GMKRDVAE+ K Sbjct: 1362 LMLGNRICVPNDEDLRREILDEAHNAPYAMHPGATKMYNTMKSHYWWSGMKRDVAEFTAK 1421 Query: 495 CLTCQQVKIEHQRASGLLQQLDIPVWKWEDITMDFVTGLPKTFRKNDAIWVVVDRLTKLA 316 CLTCQQVK+EHQ +G L L IP WKWE ITMDFVT LPKT + NDAIW++VDRLTK A Sbjct: 1422 CLTCQQVKVEHQAPAGKLHPLSIPEWKWEKITMDFVTNLPKTRKGNDAIWIIVDRLTKSA 1481 Query: 315 HFLAIHEGYSVDRLAHIFQQEIVRLHGTPVSIVSNRDPRFTSRFWKGFQRAWGTRLNFST 136 HFL I G ++D LA + EIVRLHG P+SIVS+RDPRFTSRFWK Q A GTRLNFST Sbjct: 1482 HFLPIRWGCTLDHLAQRYVNEIVRLHGVPISIVSDRDPRFTSRFWKSLQEAMGTRLNFST 1541 Query: 135 SFHPQTDGQSERTIQTLEDMLRSCALEWSGNWDDYLSLVEFSYNN 1 +FHPQTDGQSERTIQTLE+MLR+C +E+ G+WD+Y++L+EF+YNN Sbjct: 1542 AFHPQTDGQSERTIQTLEEMLRACVIEFKGSWDEYIALMEFAYNN 1586 >XP_017224826.1 PREDICTED: uncharacterized protein LOC108201051 [Daucus carota subsp. sativus] Length = 1262 Score = 323 bits (829), Expect = 1e-99 Identities = 152/239 (63%), Positives = 186/239 (77%), Gaps = 16/239 (6%) Frame = -3 Query: 669 LGERLCVLSD--------------PI--IREVVLAEAHSSPFSIHPGSTKMYRDLKKNFW 538 +GE LC D P+ +++ VL+EAHSS +SIHPGSTKMYRDLK+N+W Sbjct: 711 VGEELCTQKDEQGILRFSSRIWIPPVQELKDEVLSEAHSSAYSIHPGSTKMYRDLKENYW 770 Query: 537 WNGMKRDVAEYVGKCLTCQQVKIEHQRASGLLQQLDIPVWKWEDITMDFVTGLPKTFRKN 358 W MKR++AE+V KC TCQ+VK EHQR SGLLQ L+IP WKWE I MDF+ GLP+T + Sbjct: 771 WPDMKREIAEWVNKCYTCQRVKAEHQRPSGLLQPLEIPEWKWEHIAMDFIVGLPRTRANH 830 Query: 357 DAIWVVVDRLTKLAHFLAIHEGYSVDRLAHIFQQEIVRLHGTPVSIVSNRDPRFTSRFWK 178 DAIWV+VDRLTK AHFL I+E +S+D+L H++ +EIV HG PVSIVS+RDPRF SRFWK Sbjct: 831 DAIWVIVDRLTKSAHFLPINERFSLDKLVHMYLKEIVVRHGVPVSIVSDRDPRFNSRFWK 890 Query: 177 GFQRAWGTRLNFSTSFHPQTDGQSERTIQTLEDMLRSCALEWSGNWDDYLSLVEFSYNN 1 FQ GTRLN ST++HPQTDGQSERTIQT+EDMLR CA+++ GNWD++L LVEFSYNN Sbjct: 891 SFQECLGTRLNMSTAYHPQTDGQSERTIQTIEDMLRVCAIDFKGNWDEHLPLVEFSYNN 949 >XP_017224824.1 PREDICTED: uncharacterized protein LOC108201049 [Daucus carota subsp. sativus] Length = 1268 Score = 323 bits (829), Expect = 1e-99 Identities = 152/239 (63%), Positives = 186/239 (77%), Gaps = 16/239 (6%) Frame = -3 Query: 669 LGERLCVLSD--------------PI--IREVVLAEAHSSPFSIHPGSTKMYRDLKKNFW 538 +GE LC D P+ +++ VL+EAHSS +SIHPGSTKMYRDLK+N+W Sbjct: 711 VGEELCTQKDEQGILRFSSRIWIPPVQELKDEVLSEAHSSAYSIHPGSTKMYRDLKENYW 770 Query: 537 WNGMKRDVAEYVGKCLTCQQVKIEHQRASGLLQQLDIPVWKWEDITMDFVTGLPKTFRKN 358 W MKR++AE+V KC TCQ+VK EHQR SGLLQ L+IP WKWE I MDF+ GLP+T + Sbjct: 771 WPDMKREIAEWVNKCYTCQRVKAEHQRPSGLLQPLEIPEWKWEHIAMDFIVGLPRTRANH 830 Query: 357 DAIWVVVDRLTKLAHFLAIHEGYSVDRLAHIFQQEIVRLHGTPVSIVSNRDPRFTSRFWK 178 DAIWV+VDRLTK AHFL I+E +S+D+L H++ +EIV HG PVSIVS+RDPRF SRFWK Sbjct: 831 DAIWVIVDRLTKSAHFLPINERFSLDKLVHMYLKEIVVRHGVPVSIVSDRDPRFNSRFWK 890 Query: 177 GFQRAWGTRLNFSTSFHPQTDGQSERTIQTLEDMLRSCALEWSGNWDDYLSLVEFSYNN 1 FQ GTRLN ST++HPQTDGQSERTIQT+EDMLR CA+++ GNWD++L LVEFSYNN Sbjct: 891 SFQECLGTRLNMSTAYHPQTDGQSERTIQTIEDMLRVCAIDFKGNWDEHLPLVEFSYNN 949 >EOY00215.1 DNA/RNA polymerases superfamily protein [Theobroma cacao] Length = 1537 Score = 325 bits (833), Expect = 2e-99 Identities = 148/225 (65%), Positives = 182/225 (80%) Frame = -3 Query: 675 IWLGERLCVLSDPIIREVVLAEAHSSPFSIHPGSTKMYRDLKKNFWWNGMKRDVAEYVGK 496 + L +R+CV D +R +L EAH S +++HPGSTKMYR +K+++WW GM+RD+AE+V K Sbjct: 1082 LMLRDRICVPKDDQLRRAILEEAHYSAYALHPGSTKMYRTIKESYWWPGMERDIAEFVAK 1141 Query: 495 CLTCQQVKIEHQRASGLLQQLDIPVWKWEDITMDFVTGLPKTFRKNDAIWVVVDRLTKLA 316 CLTCQQ+K EHQ+ SG LQ L IP WKWE +TMDFV GLP+T DAIWV+VDRLTK A Sbjct: 1142 CLTCQQIKAEHQKPSGTLQPLSIPEWKWEHVTMDFVLGLPRTQSGKDAIWVIVDRLTKSA 1201 Query: 315 HFLAIHEGYSVDRLAHIFQQEIVRLHGTPVSIVSNRDPRFTSRFWKGFQRAWGTRLNFST 136 HFLAIH YS++RLA ++ EIVRLHG PVSIVS+RD RFTSRFW FQ A GT+L FST Sbjct: 1202 HFLAIHSTYSIERLARLYIDEIVRLHGVPVSIVSDRDLRFTSRFWPKFQEALGTKLRFST 1261 Query: 135 SFHPQTDGQSERTIQTLEDMLRSCALEWSGNWDDYLSLVEFSYNN 1 +FHPQTDGQSERTIQTLEDMLR+C +++ G+WD +L LVEF+YNN Sbjct: 1262 AFHPQTDGQSERTIQTLEDMLRACVIDFIGSWDRHLPLVEFAYNN 1306 >XP_012073065.1 PREDICTED: uncharacterized protein LOC105634770 [Jatropha curcas] Length = 1963 Score = 325 bits (833), Expect = 4e-99 Identities = 148/221 (66%), Positives = 181/221 (81%) Frame = -3 Query: 663 ERLCVLSDPIIREVVLAEAHSSPFSIHPGSTKMYRDLKKNFWWNGMKRDVAEYVGKCLTC 484 +RL V SD +R ++L EAH SPF++HPG+TKMYRDL +N+WW GMK+D+AE+V KCLTC Sbjct: 311 DRLYVPSDLDLRHLILKEAHDSPFAMHPGATKMYRDLTRNYWWTGMKKDIAEFVAKCLTC 370 Query: 483 QQVKIEHQRASGLLQQLDIPVWKWEDITMDFVTGLPKTFRKNDAIWVVVDRLTKLAHFLA 304 QQVK EHQ +GL L IP WKWE +TMDF+ GLP T +K+DA+WV+VDRLTK AHFL Sbjct: 371 QQVKAEHQVPAGLHHPLQIPEWKWERVTMDFLMGLPLTQKKHDAVWVIVDRLTKSAHFLP 430 Query: 303 IHEGYSVDRLAHIFQQEIVRLHGTPVSIVSNRDPRFTSRFWKGFQRAWGTRLNFSTSFHP 124 I YS+++LA ++ EIVRLHG PVSIVS+RDPRFTSRFW Q+A GTRLNFST+FHP Sbjct: 431 IRSNYSLEKLAEMYIGEIVRLHGVPVSIVSDRDPRFTSRFWASLQKALGTRLNFSTAFHP 490 Query: 123 QTDGQSERTIQTLEDMLRSCALEWSGNWDDYLSLVEFSYNN 1 QTDGQSER IQ LEDMLR+C LE+ G+WD+YL L+EF+YNN Sbjct: 491 QTDGQSERIIQILEDMLRACVLEFEGSWDNYLPLIEFAYNN 531 >XP_015944834.1 PREDICTED: LOW QUALITY PROTEIN: uncharacterized protein LOC107469968 [Arachis duranensis] Length = 1201 Score = 321 bits (822), Expect = 7e-99 Identities = 150/220 (68%), Positives = 179/220 (81%) Frame = -3 Query: 660 RLCVLSDPIIREVVLAEAHSSPFSIHPGSTKMYRDLKKNFWWNGMKRDVAEYVGKCLTCQ 481 R+CV S +R+ +LAEAH S FS+HPG TKMY+DLK+ FWW G+K+DVA+YV KCLTCQ Sbjct: 737 RICVPSSGDLRQRILAEAHQSRFSMHPGVTKMYQDLKQMFWWPGLKKDVADYVSKCLTCQ 796 Query: 480 QVKIEHQRASGLLQQLDIPVWKWEDITMDFVTGLPKTFRKNDAIWVVVDRLTKLAHFLAI 301 +VK+EHQ+ SG LQ L+IP WKWE ITMDFV GLP+T +DAIWV+VD LTK AHFL I Sbjct: 797 KVKVEHQKPSGTLQPLEIPQWKWEQITMDFVMGLPRTSTGHDAIWVIVDMLTKSAHFLPI 856 Query: 300 HEGYSVDRLAHIFQQEIVRLHGTPVSIVSNRDPRFTSRFWKGFQRAWGTRLNFSTSFHPQ 121 Y+++RLA I+ QEIVRLHG P SIVS+RDPRFTSRFW FQ+A GT L+ ST++HPQ Sbjct: 857 RVDYTLERLARIYIQEIVRLHGIPSSIVSDRDPRFTSRFWGAFQKALGTELHMSTAYHPQ 916 Query: 120 TDGQSERTIQTLEDMLRSCALEWSGNWDDYLSLVEFSYNN 1 TDGQSERTIQTLEDMLRSC ++ G+WD YL LVEF YNN Sbjct: 917 TDGQSERTIQTLEDMLRSCVMDNQGSWDKYLPLVEFVYNN 956 >XP_007221234.1 hypothetical protein PRUPE_ppb019121mg [Prunus persica] Length = 552 Score = 306 bits (785), Expect = 8e-99 Identities = 142/225 (63%), Positives = 178/225 (79%) Frame = -3 Query: 675 IWLGERLCVLSDPIIREVVLAEAHSSPFSIHPGSTKMYRDLKKNFWWNGMKRDVAEYVGK 496 + +G RL V +D ++ +L EAH S F++HPGSTKMY L++++WW MK+++AEYV + Sbjct: 115 LMVGNRLYVPNDEALKREILEEAHESAFAMHPGSTKMYHTLREHYWWPFMKKEIAEYVRR 174 Query: 495 CLTCQQVKIEHQRASGLLQQLDIPVWKWEDITMDFVTGLPKTFRKNDAIWVVVDRLTKLA 316 CL CQQVK E Q+ SGLLQ L IP WKWE ITMDFV LP+T K+D +WV+VDRLTK A Sbjct: 175 CLICQQVKAERQKPSGLLQPLPIPEWKWERITMDFVFKLPRTQSKHDGVWVIVDRLTKSA 234 Query: 315 HFLAIHEGYSVDRLAHIFQQEIVRLHGTPVSIVSNRDPRFTSRFWKGFQRAWGTRLNFST 136 HFL + YS+++LA IF EIVRLHG PVSIVS+RDPRFTSRFW A+GT+L FST Sbjct: 235 HFLPVRANYSLNKLAKIFIDEIVRLHGVPVSIVSDRDPRFTSRFWTKLNEAFGTQLQFST 294 Query: 135 SFHPQTDGQSERTIQTLEDMLRSCALEWSGNWDDYLSLVEFSYNN 1 +FHPQTDGQSERTIQTLEDMLR+CAL++ G+WD+ L L+EF+YNN Sbjct: 295 AFHPQTDGQSERTIQTLEDMLRACALQFRGDWDEKLPLMEFAYNN 339 >KYP40737.1 Retrotransposable element Tf2 [Cajanus cajan] Length = 540 Score = 306 bits (784), Expect = 8e-99 Identities = 135/220 (61%), Positives = 179/220 (81%) Frame = -3 Query: 660 RLCVLSDPIIREVVLAEAHSSPFSIHPGSTKMYRDLKKNFWWNGMKRDVAEYVGKCLTCQ 481 R+C+ D +R VVL E H S SIHPG TKMY+D+KK+FWW GMK+++AE+V CLTCQ Sbjct: 84 RICLPCDSELRRVVLEEGHMSRLSIHPGMTKMYQDIKKSFWWPGMKKEIAEFVAACLTCQ 143 Query: 480 QVKIEHQRASGLLQQLDIPVWKWEDITMDFVTGLPKTFRKNDAIWVVVDRLTKLAHFLAI 301 + KIEHQR G+LQ ++IP WKW+ +TMDFV GLP+T R +D+IWV+VDRLTK AHFL + Sbjct: 144 KAKIEHQRPGGVLQLMEIPEWKWDSVTMDFVVGLPRTTRNSDSIWVIVDRLTKCAHFLPV 203 Query: 300 HEGYSVDRLAHIFQQEIVRLHGTPVSIVSNRDPRFTSRFWKGFQRAWGTRLNFSTSFHPQ 121 ++ +S++RLA ++ +EIVRLHG P SI+S+RDPRFTSRFW+ +A GTRL S+++HPQ Sbjct: 204 NKRWSLERLAQLYIREIVRLHGVPSSIISDRDPRFTSRFWQTLHQALGTRLRLSSAYHPQ 263 Query: 120 TDGQSERTIQTLEDMLRSCALEWSGNWDDYLSLVEFSYNN 1 TDGQSERTIQ+LED+LR+C L+ G+W++ L LVEF+YNN Sbjct: 264 TDGQSERTIQSLEDLLRACVLDHLGSWEEVLPLVEFTYNN 303 >KYP44365.1 Retrotransposable element Tf2, partial [Cajanus cajan] Length = 402 Score = 301 bits (772), Expect = 9e-99 Identities = 132/220 (60%), Positives = 176/220 (80%) Frame = -3 Query: 660 RLCVLSDPIIREVVLAEAHSSPFSIHPGSTKMYRDLKKNFWWNGMKRDVAEYVGKCLTCQ 481 R+C+ D +R VL E H+S SIHP TKMY+DLKKNFWW+ MKR++AEYV CLTCQ Sbjct: 90 RICLPQDAELRRAVLEEGHTSRLSIHPCMTKMYQDLKKNFWWSSMKREIAEYVAACLTCQ 149 Query: 480 QVKIEHQRASGLLQQLDIPVWKWEDITMDFVTGLPKTFRKNDAIWVVVDRLTKLAHFLAI 301 + K+EHQ+ SGL+QQ++IP WKW+ ITMDF+ G PK+ R +DAIWV+VDRLTK A+FL + Sbjct: 150 KAKVEHQKPSGLMQQIEIPEWKWDSITMDFIVGFPKSARNSDAIWVIVDRLTKCANFLPV 209 Query: 300 HEGYSVDRLAHIFQQEIVRLHGTPVSIVSNRDPRFTSRFWKGFQRAWGTRLNFSTSFHPQ 121 + +S+++L ++ +EIVRLHG P SI+S+RDPRFTSRFW+ A GT+L S+++HPQ Sbjct: 210 NIKWSLEKLTQLYVKEIVRLHGVPSSIISDRDPRFTSRFWQSLHEALGTKLKLSSAYHPQ 269 Query: 120 TDGQSERTIQTLEDMLRSCALEWSGNWDDYLSLVEFSYNN 1 TDGQSERTIQ+LED+LR+C L+ G+W++ L LVEF+YNN Sbjct: 270 TDGQSERTIQSLEDLLRACVLDHLGSWEEVLPLVEFTYNN 309 >GAU18217.1 hypothetical protein TSUD_26810 [Trifolium subterraneum] Length = 1171 Score = 320 bits (820), Expect = 9e-99 Identities = 147/226 (65%), Positives = 177/226 (78%) Frame = -3 Query: 678 VIWLGERLCVLSDPIIREVVLAEAHSSPFSIHPGSTKMYRDLKKNFWWNGMKRDVAEYVG 499 VI G R+CV +D ++ ++L EAH S FSIHPGSTKMY DLKKN+WW MK ++AE+V Sbjct: 723 VIQFGNRICVPNDADLKRLILEEAHKSGFSIHPGSTKMYHDLKKNYWWPNMKTEIAEFVS 782 Query: 498 KCLTCQQVKIEHQRASGLLQQLDIPVWKWEDITMDFVTGLPKTFRKNDAIWVVVDRLTKL 319 +C+ CQQVKIEHQ+ +G LQ L+IP WKWE ITMDFVTGLP + D+IWV+VDRLTK Sbjct: 783 RCIVCQQVKIEHQKPAGPLQPLEIPEWKWEHITMDFVTGLPHNQKGEDSIWVIVDRLTKS 842 Query: 318 AHFLAIHEGYSVDRLAHIFQQEIVRLHGTPVSIVSNRDPRFTSRFWKGFQRAWGTRLNFS 139 AHF+A+ Y V R A IF +EIV+LHG P+SIVS+RDP FTS FW+ FQ+A GTRL S Sbjct: 843 AHFIAVKSTYKVSRYAEIFLEEIVKLHGVPLSIVSDRDPTFTSHFWRAFQKAMGTRLRIS 902 Query: 138 TSFHPQTDGQSERTIQTLEDMLRSCALEWSGNWDDYLSLVEFSYNN 1 TS HPQTDGQSERTIQTLEDMLR+C LE GNW +L L+EF+YNN Sbjct: 903 TSNHPQTDGQSERTIQTLEDMLRACVLEDGGNWSKHLHLIEFAYNN 948 >XP_017224825.1 PREDICTED: uncharacterized protein LOC108201050 [Daucus carota subsp. sativus] Length = 1393 Score = 322 bits (826), Expect = 1e-98 Identities = 152/239 (63%), Positives = 185/239 (77%), Gaps = 16/239 (6%) Frame = -3 Query: 669 LGERLCVLSD--------------PI--IREVVLAEAHSSPFSIHPGSTKMYRDLKKNFW 538 +GE LC D P+ +++ VL EAHSS +SIHPGSTKMYRDLK+N+W Sbjct: 760 VGEELCTQKDDQGILRFSSRIWIPPVQELKDEVLNEAHSSAYSIHPGSTKMYRDLKENYW 819 Query: 537 WNGMKRDVAEYVGKCLTCQQVKIEHQRASGLLQQLDIPVWKWEDITMDFVTGLPKTFRKN 358 W MKR++AE+V KC TCQ+VK EHQR SGLLQ L+IP WKWE I MDF+ GLP+T + Sbjct: 820 WPDMKREIAEWVSKCYTCQRVKAEHQRPSGLLQPLEIPEWKWEHIAMDFIVGLPRTRANH 879 Query: 357 DAIWVVVDRLTKLAHFLAIHEGYSVDRLAHIFQQEIVRLHGTPVSIVSNRDPRFTSRFWK 178 DAIWV+VDRLTK AHFL I+E +S+D+L H++ +EIV HG PVSIVS+RDPRF SRFWK Sbjct: 880 DAIWVIVDRLTKSAHFLPINERFSLDKLVHMYLKEIVVRHGVPVSIVSDRDPRFNSRFWK 939 Query: 177 GFQRAWGTRLNFSTSFHPQTDGQSERTIQTLEDMLRSCALEWSGNWDDYLSLVEFSYNN 1 FQ GTRLN ST++HPQTDGQSERTIQT+EDMLR CA+++ GNWD++L LVEFSYNN Sbjct: 940 SFQECLGTRLNMSTAYHPQTDGQSERTIQTIEDMLRVCAIDFKGNWDEHLPLVEFSYNN 998 >KYP31581.1 Retrotransposable element Tf2 [Cajanus cajan] Length = 539 Score = 306 bits (783), Expect = 1e-98 Identities = 139/226 (61%), Positives = 178/226 (78%) Frame = -3 Query: 678 VIWLGERLCVLSDPIIREVVLAEAHSSPFSIHPGSTKMYRDLKKNFWWNGMKRDVAEYVG 499 V+ +R+CV SDP +R ++L E H S S HPG+TKMY+DL+K FWW MK+D+AE+V Sbjct: 86 VLRFKDRVCVPSDPTLRRLILEEGHRSKLSFHPGATKMYQDLRKIFWWPRMKKDIAEFVS 145 Query: 498 KCLTCQQVKIEHQRASGLLQQLDIPVWKWEDITMDFVTGLPKTFRKNDAIWVVVDRLTKL 319 CL CQ+ KIEHQ+ SGLLQ L IP WKW+ I+MDFV LP+T R +D+IWV+VDRLTK Sbjct: 146 ACLVCQKAKIEHQKPSGLLQPLSIPEWKWDSISMDFVVALPRTRRGHDSIWVIVDRLTKS 205 Query: 318 AHFLAIHEGYSVDRLAHIFQQEIVRLHGTPVSIVSNRDPRFTSRFWKGFQRAWGTRLNFS 139 AHFL I+ YS++RLA ++ EIVRLHG P SIVS+RDPRFTSRFW+ QRA GT+L S Sbjct: 206 AHFLPINIRYSLERLAGLYIDEIVRLHGIPSSIVSDRDPRFTSRFWESLQRALGTQLRLS 265 Query: 138 TSFHPQTDGQSERTIQTLEDMLRSCALEWSGNWDDYLSLVEFSYNN 1 +++HPQTDGQ+ERTIQ+LED+LR+C L+ G+WD L L+EF+YNN Sbjct: 266 SAYHPQTDGQTERTIQSLEDLLRACVLDQGGSWDSLLPLIEFTYNN 311 >EOY08653.1 DNA/RNA polymerases superfamily protein [Theobroma cacao] Length = 1110 Score = 319 bits (817), Expect = 1e-98 Identities = 145/225 (64%), Positives = 181/225 (80%) Frame = -3 Query: 675 IWLGERLCVLSDPIIREVVLAEAHSSPFSIHPGSTKMYRDLKKNFWWNGMKRDVAEYVGK 496 + L +R+CVL D +R +L EAHSS +++H STKMYR +K+++WW GMKRD+AE+V K Sbjct: 777 LMLRDRICVLKDDQLRRAILEEAHSSAYALHLESTKMYRTIKESYWWPGMKRDIAEFVAK 836 Query: 495 CLTCQQVKIEHQRASGLLQQLDIPVWKWEDITMDFVTGLPKTFRKNDAIWVVVDRLTKLA 316 CLTCQQ+K EHQ+ SG LQ L IP WKWE +TMDFV GL +T DAIWV+VDRLTK A Sbjct: 837 CLTCQQIKAEHQKLSGTLQPLPIPEWKWEHVTMDFVLGLLRTQSGKDAIWVIVDRLTKSA 896 Query: 315 HFLAIHEGYSVDRLAHIFQQEIVRLHGTPVSIVSNRDPRFTSRFWKGFQRAWGTRLNFST 136 HFLAIH YS+++L ++ EIVRL+G P+SIVS+RDPRFTSRFW FQ A GT+L FST Sbjct: 897 HFLAIHNTYSIEKLVKLYIDEIVRLYGVPISIVSDRDPRFTSRFWSKFQEALGTKLRFST 956 Query: 135 SFHPQTDGQSERTIQTLEDMLRSCALEWSGNWDDYLSLVEFSYNN 1 +FHPQTDGQSERTIQTLEDMLR+C +++ G+WD +L LVEF+YNN Sbjct: 957 AFHPQTDGQSERTIQTLEDMLRACVIDFIGSWDRHLPLVEFAYNN 1001 >GAU46468.1 hypothetical protein TSUD_402350 [Trifolium subterraneum] Length = 1151 Score = 319 bits (817), Expect = 2e-98 Identities = 146/226 (64%), Positives = 177/226 (78%) Frame = -3 Query: 678 VIWLGERLCVLSDPIIREVVLAEAHSSPFSIHPGSTKMYRDLKKNFWWNGMKRDVAEYVG 499 VI G R+CV +D ++ +L EAH S FSIHPGSTKMY DLKKN+WW MK ++AE+V Sbjct: 705 VIQFGNRICVPNDADLKRSILEEAHKSGFSIHPGSTKMYHDLKKNYWWPNMKTEIAEFVS 764 Query: 498 KCLTCQQVKIEHQRASGLLQQLDIPVWKWEDITMDFVTGLPKTFRKNDAIWVVVDRLTKL 319 +C+ CQQVKIEHQ+ +G LQ L+IP WKWE ITMDFVTGLP+ + D+IWV+VDRLTK Sbjct: 765 RCIVCQQVKIEHQKPAGPLQPLEIPEWKWEHITMDFVTGLPRNQKGEDSIWVIVDRLTKS 824 Query: 318 AHFLAIHEGYSVDRLAHIFQQEIVRLHGTPVSIVSNRDPRFTSRFWKGFQRAWGTRLNFS 139 AHF+A+ Y V R A IF +EIV+LHG PVSIVS+RDP FTS FW+ FQ+A GTR+ + Sbjct: 825 AHFIAVKSTYKVSRYAEIFLEEIVKLHGVPVSIVSDRDPAFTSHFWRAFQKAMGTRIRMN 884 Query: 138 TSFHPQTDGQSERTIQTLEDMLRSCALEWSGNWDDYLSLVEFSYNN 1 TS HPQTDGQSERTIQTLEDMLR+C LE GNW +L L+EF+YNN Sbjct: 885 TSNHPQTDGQSERTIQTLEDMLRACILEDGGNWSKHLHLIEFAYNN 930 >KYP76963.1 Retrotransposable element Tf2 [Cajanus cajan] Length = 673 Score = 309 bits (791), Expect = 2e-98 Identities = 133/221 (60%), Positives = 181/221 (81%) Frame = -3 Query: 663 ERLCVLSDPIIREVVLAEAHSSPFSIHPGSTKMYRDLKKNFWWNGMKRDVAEYVGKCLTC 484 +R+C+ D +R VL E H S SIHPG TKMY+DLKK FWW+GMKR++AEYV CLTC Sbjct: 208 DRICLPQDAELRRAVLEEGHKSRLSIHPGMTKMYQDLKKTFWWSGMKREIAEYVAACLTC 267 Query: 483 QQVKIEHQRASGLLQQLDIPVWKWEDITMDFVTGLPKTFRKNDAIWVVVDRLTKLAHFLA 304 Q+ K+EHQ+ SGL+QQ++IP WKW++ITM+F+ GLP++ R +DAIWV+VDRLTK AHFL+ Sbjct: 268 QKAKVEHQKPSGLMQQIEIPEWKWDNITMNFIVGLPRSARNSDAIWVIVDRLTKCAHFLS 327 Query: 303 IHEGYSVDRLAHIFQQEIVRLHGTPVSIVSNRDPRFTSRFWKGFQRAWGTRLNFSTSFHP 124 ++ +S+++L ++ +EIVRLHG P SI+S+RDPRFTSRFW+ +A GT+L S+++HP Sbjct: 328 VNIKWSLEKLTQLYMKEIVRLHGVPSSIISDRDPRFTSRFWQSLHQALGTKLKLSSAYHP 387 Query: 123 QTDGQSERTIQTLEDMLRSCALEWSGNWDDYLSLVEFSYNN 1 QTDGQSERTIQ+LED+LR+C L+ G+W++ L LVEF+YNN Sbjct: 388 QTDGQSERTIQSLEDLLRACVLDHLGSWEEVLPLVEFTYNN 428 >EOY26510.1 DNA/RNA polymerases superfamily protein [Theobroma cacao] Length = 1290 Score = 320 bits (819), Expect = 4e-98 Identities = 144/225 (64%), Positives = 181/225 (80%) Frame = -3 Query: 675 IWLGERLCVLSDPIIREVVLAEAHSSPFSIHPGSTKMYRDLKKNFWWNGMKRDVAEYVGK 496 + L +R+CV D +R +L EAHSS +++HPGSTKMY+ +K+++WW GMKRD+AE+V K Sbjct: 871 LMLRDRICVPKDDQLRRAILEEAHSSAYALHPGSTKMYQTIKESYWWPGMKRDIAEFVAK 930 Query: 495 CLTCQQVKIEHQRASGLLQQLDIPVWKWEDITMDFVTGLPKTFRKNDAIWVVVDRLTKLA 316 CL CQQ+K EHQ++SG LQ L IP WKWE +TMDFV GLP+T DAIWV++ RLTK A Sbjct: 931 CLICQQIKAEHQKSSGTLQPLPIPEWKWEHVTMDFVLGLPRTQSGKDAIWVIMGRLTKSA 990 Query: 315 HFLAIHEGYSVDRLAHIFQQEIVRLHGTPVSIVSNRDPRFTSRFWKGFQRAWGTRLNFST 136 HFLAIH YS++RLA ++ E+VRLHG PVSIVS+RDPRFTSRFW FQ A GT+L FST Sbjct: 991 HFLAIHSTYSIERLARLYIDEVVRLHGVPVSIVSDRDPRFTSRFWPKFQEALGTKLRFST 1050 Query: 135 SFHPQTDGQSERTIQTLEDMLRSCALEWSGNWDDYLSLVEFSYNN 1 +FHPQ DGQSERTIQTLEDMLR+C +++ +WD +L LVEF+YNN Sbjct: 1051 AFHPQIDGQSERTIQTLEDMLRACVIDFIRSWDRHLPLVEFAYNN 1095