BLASTX nr result
ID: Astragalus22_contig00035276
seq
BLASTX 2.2.26 [Sep-21-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Astragalus22_contig00035276 (529 letters) Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects 149,584,005 sequences; 54,822,741,787 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value gb|KHN36591.1| Retrovirus-related Pol polyprotein from transposo... 144 8e-38 gb|KHN39047.1| Retrovirus-related Pol polyprotein from transposo... 140 3e-37 gb|KHN02838.1| Retrovirus-related Pol polyprotein from transposo... 140 3e-37 gb|PNX90684.1| retrovirus-related Pol polyprotein from transposo... 135 5e-35 gb|PRQ17740.1| putative RNA-directed DNA polymerase [Rosa chinen... 138 1e-34 gb|PNX96089.1| copia-type polyprotein, partial [Trifolium pratense] 136 9e-34 emb|CAB75932.1| putative protein [Arabidopsis thaliana] 134 3e-33 ref|XP_024200681.1| uncharacterized protein LOC112204028 [Rosa c... 128 6e-33 gb|PRQ52345.1| putative RNA-directed DNA polymerase [Rosa chinen... 133 8e-33 gb|PNX95763.1| retrotransposon-related protein, partial [Trifoli... 130 7e-32 gb|PNX94522.1| copia-type polyprotein [Trifolium pratense] 129 2e-31 gb|PNX67946.1| copia-type polyprotein, partial [Trifolium pratense] 120 5e-31 dbj|GAU43011.1| hypothetical protein TSUD_28300 [Trifolium subte... 126 8e-31 dbj|GAU50131.1| hypothetical protein TSUD_192520 [Trifolium subt... 126 3e-30 dbj|GAU11414.1| hypothetical protein TSUD_344050 [Trifolium subt... 126 3e-30 dbj|GAU23361.1| hypothetical protein TSUD_334080 [Trifolium subt... 126 3e-30 gb|PNX61211.1| copia-type polyprotein, partial [Trifolium pratense] 116 5e-30 dbj|GAU46070.1| hypothetical protein TSUD_180060 [Trifolium subt... 121 1e-29 gb|PNX72392.1| copia-type polyprotein, partial [Trifolium pratense] 124 1e-29 dbj|GAU39634.1| hypothetical protein TSUD_397220 [Trifolium subt... 121 2e-29 >gb|KHN36591.1| Retrovirus-related Pol polyprotein from transposon TNT 1-94, partial [Glycine soja] Length = 430 Score = 144 bits (362), Expect = 8e-38 Identities = 74/160 (46%), Positives = 93/160 (58%) Frame = +2 Query: 47 NKELIECYNCRKLGHYQYECPSLEERANYVEDGXXXXXXXMAYMEGNDTGGELNVPRIES 226 NK +IEC+ C KLGHYQYECP E+ ANYVE +E Sbjct: 234 NKAVIECFKCHKLGHYQYECPDWEKNANYVE--------------------------LEK 267 Query: 227 TRDVETDADHSDSNVAEAYLLMANTDEKTSKDPKVWYLDSGCSNHMSGNRNWFHDLDESF 406 +D E LLM+ + + K +VW+LDSGCSNHM+GN+ WF +LDESF Sbjct: 268 EKD-------------EELLLMSYVELEQDKMEEVWFLDSGCSNHMTGNKEWFSELDESF 314 Query: 407 SVTVKLGNNTRMQVKGKGSVKLKINEIVQMISDVYYIPEL 526 S TVKLGNNTRM V GKG +++++N Q IS VYY+PEL Sbjct: 315 SQTVKLGNNTRMVVVGKGIIRMQVNGFTQAISGVYYVPEL 354 >gb|KHN39047.1| Retrovirus-related Pol polyprotein from transposon TNT 1-94, partial [Glycine soja] Length = 342 Score = 140 bits (353), Expect = 3e-37 Identities = 72/159 (45%), Positives = 92/159 (57%) Frame = +2 Query: 50 KELIECYNCRKLGHYQYECPSLEERANYVEDGXXXXXXXMAYMEGNDTGGELNVPRIEST 229 + +IEC+ C KLGHYQYECP E+ ANYVE +E Sbjct: 206 RAVIECFKCHKLGHYQYECPDWEKNANYVE--------------------------LEKE 239 Query: 230 RDVETDADHSDSNVAEAYLLMANTDEKTSKDPKVWYLDSGCSNHMSGNRNWFHDLDESFS 409 +D E LLM+ + + K +VW+LDSGCSNHM+GN+ WF +LDESFS Sbjct: 240 KD-------------EELLLMSYVELEQDKMEEVWFLDSGCSNHMTGNKEWFSELDESFS 286 Query: 410 VTVKLGNNTRMQVKGKGSVKLKINEIVQMISDVYYIPEL 526 TVKLGNNTRM V GKG +++++N Q IS VYY+PEL Sbjct: 287 QTVKLGNNTRMVVVGKGIIRMQVNGFTQAISGVYYVPEL 325 >gb|KHN02838.1| Retrovirus-related Pol polyprotein from transposon TNT 1-94, partial [Glycine soja] Length = 342 Score = 140 bits (353), Expect = 3e-37 Identities = 72/159 (45%), Positives = 92/159 (57%) Frame = +2 Query: 50 KELIECYNCRKLGHYQYECPSLEERANYVEDGXXXXXXXMAYMEGNDTGGELNVPRIEST 229 + +IEC+ C KLGHYQYECP E+ ANYVE +E Sbjct: 206 RAVIECFKCHKLGHYQYECPDWEKNANYVE--------------------------LEKE 239 Query: 230 RDVETDADHSDSNVAEAYLLMANTDEKTSKDPKVWYLDSGCSNHMSGNRNWFHDLDESFS 409 +D E LLM+ + + K +VW+LDSGCSNHM+GN+ WF +LDESFS Sbjct: 240 KD-------------EELLLMSYVELEQDKMEEVWFLDSGCSNHMTGNKEWFSELDESFS 286 Query: 410 VTVKLGNNTRMQVKGKGSVKLKINEIVQMISDVYYIPEL 526 TVKLGNNTRM V GKG +++++N Q IS VYY+PEL Sbjct: 287 QTVKLGNNTRMVVVGKGIIRMQVNGFTQAISGVYYVPEL 325 >gb|PNX90684.1| retrovirus-related Pol polyprotein from transposon TNT 1-94 [Trifolium pratense] Length = 372 Score = 135 bits (340), Expect = 5e-35 Identities = 68/162 (41%), Positives = 95/162 (58%) Frame = +2 Query: 41 QVNKELIECYNCRKLGHYQYECPSLEERANYVEDGXXXXXXXMAYMEGNDTGGELNVPRI 220 Q N+ELIECY C KLGH+QYECP Sbjct: 104 QFNRELIECYKCHKLGHFQYECP------------------------------------- 126 Query: 221 ESTRDVETDADHSDSNVAEAYLLMANTDEKTSKDPKVWYLDSGCSNHMSGNRNWFHDLDE 400 D E +A +++ N +E LLMA+ + + K ++W+LDSGCSNHM+GN+ WF D+DE Sbjct: 127 ----DWERNAHYAELNESEEILLMAHAEHE-EKSVELWFLDSGCSNHMTGNKKWFTDIDE 181 Query: 401 SFSVTVKLGNNTRMQVKGKGSVKLKINEIVQMISDVYYIPEL 526 + +VKLGNN +M V G+G+VKL +N I+Q+I++VYY+PEL Sbjct: 182 QYQQSVKLGNNFKMAVVGRGNVKLHVNGIMQVITNVYYVPEL 223 >gb|PRQ17740.1| putative RNA-directed DNA polymerase [Rosa chinensis] Length = 1302 Score = 138 bits (348), Expect = 1e-34 Identities = 71/162 (43%), Positives = 91/162 (56%) Frame = +2 Query: 41 QVNKELIECYNCRKLGHYQYECPSLEERANYVEDGXXXXXXXMAYMEGNDTGGELNVPRI 220 Q NK L+ECY C KLGH+QYECP E+ ANY Sbjct: 241 QFNKALVECYKCHKLGHFQYECPEWEKGANY----------------------------- 271 Query: 221 ESTRDVETDADHSDSNVAEAYLLMANTDEKTSKDPKVWYLDSGCSNHMSGNRNWFHDLDE 400 ++ + E LLMA + SK +VW+LDSGCSNHMSGN+ WF DL+E Sbjct: 272 ------------AELDEKEEMLLMAYVELNNSKMEEVWFLDSGCSNHMSGNKKWFIDLNE 319 Query: 401 SFSVTVKLGNNTRMQVKGKGSVKLKINEIVQMISDVYYIPEL 526 F +VKLGNN++M V GKG+V+L+ N + Q+ +DVYYIPEL Sbjct: 320 QFRQSVKLGNNSKMAVMGKGNVRLQANGVTQVFTDVYYIPEL 361 >gb|PNX96089.1| copia-type polyprotein, partial [Trifolium pratense] Length = 1062 Score = 136 bits (342), Expect = 9e-34 Identities = 73/160 (45%), Positives = 96/160 (60%) Frame = +2 Query: 47 NKELIECYNCRKLGHYQYECPSLEERANYVEDGXXXXXXXMAYMEGNDTGGELNVPRIES 226 NK+ IECY+C KLGH+Q +CP+ +E+ANY E + EG Sbjct: 267 NKDNIECYHCHKLGHFQSDCPAWDEKANYAE-----------FDEG-------------- 301 Query: 227 TRDVETDADHSDSNVAEAYLLMANTDEKTSKDPKVWYLDSGCSNHMSGNRNWFHDLDESF 406 E LLMA++ EK S D KVW+LDSGC NHM G ++WF +LDE F Sbjct: 302 ----------------EEMLLMAHS-EKGSYDKKVWFLDSGCRNHMCGTKDWFFNLDEQF 344 Query: 407 SVTVKLGNNTRMQVKGKGSVKLKINEIVQMISDVYYIPEL 526 ++VKLG+N+RM V GKG+VKL+I I Q+I++VYYIPEL Sbjct: 345 RISVKLGDNSRMMVVGKGNVKLRIGGITQVITNVYYIPEL 384 >emb|CAB75932.1| putative protein [Arabidopsis thaliana] Length = 1339 Score = 134 bits (338), Expect = 3e-33 Identities = 66/160 (41%), Positives = 92/160 (57%) Frame = +2 Query: 47 NKELIECYNCRKLGHYQYECPSLEERANYVEDGXXXXXXXMAYMEGNDTGGELNVPRIES 226 N+ ++ECY C LGH+QYECP E+ ANY Sbjct: 246 NRAIVECYKCHNLGHFQYECPEWEKNANYA------------------------------ 275 Query: 227 TRDVETDADHSDSNVAEAYLLMANTDEKTSKDPKVWYLDSGCSNHMSGNRNWFHDLDESF 406 ++E + E LLMA ++ + +VW+LDSGCSNHM+G++ WF +L+E F Sbjct: 276 --ELEEE---------EELLLMAYVEQNQANRDEVWFLDSGCSNHMTGSKEWFSELEEGF 324 Query: 407 SVTVKLGNNTRMQVKGKGSVKLKINEIVQMISDVYYIPEL 526 + TVKLGN+TRM V GKGSVK+K+N + Q+I +VYY+PEL Sbjct: 325 NRTVKLGNDTRMSVVGKGSVKVKVNGVTQVIPEVYYVPEL 364 >ref|XP_024200681.1| uncharacterized protein LOC112204028 [Rosa chinensis] Length = 297 Score = 128 bits (321), Expect = 6e-33 Identities = 68/160 (42%), Positives = 88/160 (55%) Frame = +2 Query: 47 NKELIECYNCRKLGHYQYECPSLEERANYVEDGXXXXXXXMAYMEGNDTGGELNVPRIES 226 NK IECY C KLGH QYEC E+ AN Sbjct: 169 NKATIECYKCHKLGHLQYECLDWEKGAN-------------------------------- 196 Query: 227 TRDVETDADHSDSNVAEAYLLMANTDEKTSKDPKVWYLDSGCSNHMSGNRNWFHDLDESF 406 +++ + E LLM+ + SK +VW+LDSGCSNHMSGN++WF DL+E F Sbjct: 197 ---------YAEFDEEEEMLLMSYVELHNSKREEVWFLDSGCSNHMSGNKSWFLDLNEKF 247 Query: 407 SVTVKLGNNTRMQVKGKGSVKLKINEIVQMISDVYYIPEL 526 +VKLGNN+RM V GKG++KL++ I Q+ S+VYYIPEL Sbjct: 248 RHSVKLGNNSRMAVMGKGNIKLQVGGITQVFSEVYYIPEL 287 >gb|PRQ52345.1| putative RNA-directed DNA polymerase [Rosa chinensis] Length = 1316 Score = 133 bits (335), Expect = 8e-33 Identities = 70/162 (43%), Positives = 92/162 (56%) Frame = +2 Query: 41 QVNKELIECYNCRKLGHYQYECPSLEERANYVEDGXXXXXXXMAYMEGNDTGGELNVPRI 220 Q +K L+ECY C KLGH+Q+E P E+ ANY Sbjct: 243 QFDKALVECYKCHKLGHFQHEFPEWEKGANY----------------------------- 273 Query: 221 ESTRDVETDADHSDSNVAEAYLLMANTDEKTSKDPKVWYLDSGCSNHMSGNRNWFHDLDE 400 A+H ++ E LLMA + SK +VW+LDSGCSNHMSGN+ WF DL+E Sbjct: 274 ---------AEHDEN---EEMLLMAYVELNNSKMEEVWFLDSGCSNHMSGNKQWFTDLNE 321 Query: 401 SFSVTVKLGNNTRMQVKGKGSVKLKINEIVQMISDVYYIPEL 526 F +VKLGNN++M V GKG+V+L+ N + Q+ +DVYYIPEL Sbjct: 322 QFRQSVKLGNNSKMAVMGKGNVRLQANGVTQVFTDVYYIPEL 363 >gb|PNX95763.1| retrotransposon-related protein, partial [Trifolium pratense] Length = 1327 Score = 130 bits (328), Expect = 7e-32 Identities = 69/159 (43%), Positives = 87/159 (54%) Frame = +2 Query: 50 KELIECYNCRKLGHYQYECPSLEERANYVEDGXXXXXXXMAYMEGNDTGGELNVPRIEST 229 K IEC+ C K GH+QYECP E AN+ E Sbjct: 245 KAAIECFKCHKKGHFQYECPDWEREANFAE------------------------------ 274 Query: 230 RDVETDADHSDSNVAEAYLLMANTDEKTSKDPKVWYLDSGCSNHMSGNRNWFHDLDESFS 409 +H D LLMA +EK ++W+LDSGCSNHMSGN+ WF DL+E F Sbjct: 275 ------LEHEDE-----LLLMAYVEEKKETREEIWFLDSGCSNHMSGNKEWFTDLNEDFK 323 Query: 410 VTVKLGNNTRMQVKGKGSVKLKINEIVQMISDVYYIPEL 526 TVKLGN++R+ V G GSV+L +N IVQ+I++VYYIPEL Sbjct: 324 QTVKLGNDSRIAVTGIGSVRLWVNGIVQVITNVYYIPEL 362 >gb|PNX94522.1| copia-type polyprotein [Trifolium pratense] Length = 1172 Score = 129 bits (324), Expect = 2e-31 Identities = 67/163 (41%), Positives = 90/163 (55%), Gaps = 1/163 (0%) Frame = +2 Query: 41 QVNKELIECYNCRKLGHYQYECPSLEERANYVEDGXXXXXXXMAYMEGNDTGGELNVPRI 220 ++NKE IECY C KLGH+QYECP++ + ANY ++ Sbjct: 235 RINKETIECYKCHKLGHFQYECPNVGDYANYADN-------------------------- 268 Query: 221 ESTRDVETDADHSDSNVAEAYLLMA-NTDEKTSKDPKVWYLDSGCSNHMSGNRNWFHDLD 397 E LLMA + + S ++WYLDSGC NHM G + WFHDLD Sbjct: 269 ------------------EEVLLMAFDKSHQESTKKQIWYLDSGCINHMCGVKEWFHDLD 310 Query: 398 ESFSVTVKLGNNTRMQVKGKGSVKLKINEIVQMISDVYYIPEL 526 +F TV+L +N++M V GKG+VKL++N Q+I+DVYYIPEL Sbjct: 311 MNFKETVRLRDNSQMSVVGKGNVKLQLNGFTQIITDVYYIPEL 353 >gb|PNX67946.1| copia-type polyprotein, partial [Trifolium pratense] Length = 197 Score = 120 bits (301), Expect = 5e-31 Identities = 63/163 (38%), Positives = 92/163 (56%), Gaps = 1/163 (0%) Frame = +2 Query: 41 QVNKELIECYNCRKLGHYQYECPSLEE-RANYVEDGXXXXXXXMAYMEGNDTGGELNVPR 217 ++NKE +ECY C KLGHYQ +CPS +E ANY E D G E+ Sbjct: 46 KINKESVECYKCHKLGHYQSDCPSWDEDNANYAEF---------------DEGQEI---- 86 Query: 218 IESTRDVETDADHSDSNVAEAYLLMANTDEKTSKDPKVWYLDSGCSNHMSGNRNWFHDLD 397 + A M N + +S+ ++W+LDSGCSNHM GN+NW D D Sbjct: 87 -----------------LLMAQNTMVNESQNSSEKLELWFLDSGCSNHMVGNKNWLFDYD 129 Query: 398 ESFSVTVKLGNNTRMQVKGKGSVKLKINEIVQMISDVYYIPEL 526 +SF +VKLG++++M V+GKG++KL I Q++++VY++P L Sbjct: 130 DSFKDSVKLGDDSKMSVEGKGNLKLYIEGFTQILTNVYFLPGL 172 >dbj|GAU43011.1| hypothetical protein TSUD_28300 [Trifolium subterraneum] Length = 538 Score = 126 bits (317), Expect = 8e-31 Identities = 68/160 (42%), Positives = 90/160 (56%) Frame = +2 Query: 47 NKELIECYNCRKLGHYQYECPSLEERANYVEDGXXXXXXXMAYMEGNDTGGELNVPRIES 226 +KEL+ECY C KLGHYQ ECP+ E ANY E Sbjct: 237 SKELVECYKCHKLGHYQNECPTWGENANYAE----------------------------- 267 Query: 227 TRDVETDADHSDSNVAEAYLLMANTDEKTSKDPKVWYLDSGCSNHMSGNRNWFHDLDESF 406 N E LLMA T+ + K+ ++W+LDSGCSNHM GN++W ++ DE++ Sbjct: 268 ------------FNDEEEMLLMAKTNCEEMKE-EIWFLDSGCSNHMIGNKDWMYEFDETY 314 Query: 407 SVTVKLGNNTRMQVKGKGSVKLKINEIVQMISDVYYIPEL 526 +VKLG++++MQV GKG+VKL IN V +IS VYYIP L Sbjct: 315 RDSVKLGDDSKMQVMGKGNVKLSINGRVHVISSVYYIPGL 354 >dbj|GAU50131.1| hypothetical protein TSUD_192520 [Trifolium subterraneum] Length = 1197 Score = 126 bits (316), Expect = 3e-30 Identities = 68/160 (42%), Positives = 90/160 (56%) Frame = +2 Query: 47 NKELIECYNCRKLGHYQYECPSLEERANYVEDGXXXXXXXMAYMEGNDTGGELNVPRIES 226 +KEL+ECY C KLGHYQ ECP+ E ANY E Sbjct: 220 SKELVECYKCHKLGHYQNECPTRGENANYAE----------------------------- 250 Query: 227 TRDVETDADHSDSNVAEAYLLMANTDEKTSKDPKVWYLDSGCSNHMSGNRNWFHDLDESF 406 N E LLMA T+ + K+ ++W+LDSGCSNHM GN++W ++ DE++ Sbjct: 251 ------------FNDEEEMLLMAKTNCEEVKE-EIWFLDSGCSNHMIGNKDWMYEFDETY 297 Query: 407 SVTVKLGNNTRMQVKGKGSVKLKINEIVQMISDVYYIPEL 526 +VKLG++++MQV GKG+VKL IN V +IS VYYIP L Sbjct: 298 RDSVKLGDDSKMQVMGKGNVKLSINGRVHVISSVYYIPGL 337 >dbj|GAU11414.1| hypothetical protein TSUD_344050 [Trifolium subterraneum] Length = 1198 Score = 126 bits (316), Expect = 3e-30 Identities = 69/161 (42%), Positives = 88/161 (54%) Frame = +2 Query: 47 NKELIECYNCRKLGHYQYECPSLEERANYVEDGXXXXXXXMAYMEGNDTGGELNVPRIES 226 NKEL+ECY C KLGHYQ+ECP + ANY E ++ +T Sbjct: 244 NKELVECYKCHKLGHYQHECPDWND-ANYAE-----------FVNEEET----------- 280 Query: 227 TRDVETDADHSDSNVAEAYLLMANTDEKTSKDPKVWYLDSGCSNHMSGNRNWFHDLDESF 406 LLMA+TD + K WYLDSGCSNHM GN++W D D SF Sbjct: 281 -------------------LLMASTDHGNAIREKTWYLDSGCSNHMIGNKDWLFDFDPSF 321 Query: 407 SVTVKLGNNTRMQVKGKGSVKLKINEIVQMISDVYYIPELS 529 +V+LGN+ RM V GKG+VKL IN + +IS+VY+IP L+ Sbjct: 322 KDSVRLGNDARMSVMGKGNVKLFINGKIHVISNVYFIPGLN 362 >dbj|GAU23361.1| hypothetical protein TSUD_334080 [Trifolium subterraneum] Length = 1322 Score = 126 bits (316), Expect = 3e-30 Identities = 68/162 (41%), Positives = 89/162 (54%) Frame = +2 Query: 41 QVNKELIECYNCRKLGHYQYECPSLEERANYVEDGXXXXXXXMAYMEGNDTGGELNVPRI 220 +VNKE IECY C KLGHYQ+ECP+ EE+ Sbjct: 234 RVNKENIECYRCHKLGHYQHECPTWEEK-------------------------------- 261 Query: 221 ESTRDVETDADHSDSNVAEAYLLMANTDEKTSKDPKVWYLDSGCSNHMSGNRNWFHDLDE 400 DA+++ + E LLMA +T +VW+LDSGCSNHM G R W D D+ Sbjct: 262 --------DANYAAYDSHEEILLMAKHGIETDARDEVWFLDSGCSNHMVGTREWLFDFDD 313 Query: 401 SFSVTVKLGNNTRMQVKGKGSVKLKINEIVQMISDVYYIPEL 526 + +VKLG+++RMQ+ GKG++KL I I Q+ISDVYYIP L Sbjct: 314 NIRESVKLGDDSRMQILGKGNLKLCIGGITQVISDVYYIPGL 355 >gb|PNX61211.1| copia-type polyprotein, partial [Trifolium pratense] Length = 154 Score = 116 bits (291), Expect = 5e-30 Identities = 59/162 (36%), Positives = 91/162 (56%) Frame = +2 Query: 41 QVNKELIECYNCRKLGHYQYECPSLEERANYVEDGXXXXXXXMAYMEGNDTGGELNVPRI 220 +++KE +EC+ C +LGHY+ ECP+ EE Sbjct: 3 RISKETVECFKCHQLGHYKNECPTWEE--------------------------------- 29 Query: 221 ESTRDVETDADHSDSNVAEAYLLMANTDEKTSKDPKVWYLDSGCSNHMSGNRNWFHDLDE 400 A +++++ E +LLM D + + + WYLDSGCSNHM GN+NW ++ DE Sbjct: 30 ---------AKYAEADYEEEFLLMTGKDCEEFETER-WYLDSGCSNHMVGNKNWLYEFDE 79 Query: 401 SFSVTVKLGNNTRMQVKGKGSVKLKINEIVQMISDVYYIPEL 526 ++ TVKLG++++M V GKG++KL+IN V +I++VYYIP L Sbjct: 80 NYRDTVKLGDDSKMNVVGKGNIKLRINGRVHVITEVYYIPGL 121 >dbj|GAU46070.1| hypothetical protein TSUD_180060 [Trifolium subterraneum] Length = 389 Score = 121 bits (304), Expect = 1e-29 Identities = 66/162 (40%), Positives = 87/162 (53%) Frame = +2 Query: 41 QVNKELIECYNCRKLGHYQYECPSLEERANYVEDGXXXXXXXMAYMEGNDTGGELNVPRI 220 ++NKE ++CY C K GHYQ ECP + ANYVE Sbjct: 238 RINKESVQCYKCHKFGHYQNECPEWDN-ANYVE--------------------------- 269 Query: 221 ESTRDVETDADHSDSNVAEAYLLMANTDEKTSKDPKVWYLDSGCSNHMSGNRNWFHDLDE 400 T D E LLMA++ + ++WYLDSGCSNHMSG + W HD D+ Sbjct: 270 --THD------------NEEMLLMADSTSINNPREEIWYLDSGCSNHMSGTKEWMHDFDD 315 Query: 401 SFSVTVKLGNNTRMQVKGKGSVKLKINEIVQMISDVYYIPEL 526 SF+ +VKLGN+++M V GKG+VKL I + +I+ VYYIP L Sbjct: 316 SFTESVKLGNDSKMAVMGKGNVKLMIEGRIHVITVVYYIPGL 357 >gb|PNX72392.1| copia-type polyprotein, partial [Trifolium pratense] Length = 886 Score = 124 bits (311), Expect = 1e-29 Identities = 64/162 (39%), Positives = 91/162 (56%) Frame = +2 Query: 41 QVNKELIECYNCRKLGHYQYECPSLEERANYVEDGXXXXXXXMAYMEGNDTGGELNVPRI 220 Q++KE IEC+ C KLGHYQ ECP+ E+ Sbjct: 238 QISKENIECFRCHKLGHYQSECPNWED--------------------------------- 264 Query: 221 ESTRDVETDADHSDSNVAEAYLLMANTDEKTSKDPKVWYLDSGCSNHMSGNRNWFHDLDE 400 +A+ ++ + E LLMA ++++ VWYLDSGCSNHM GN+ W D D+ Sbjct: 265 -------ANANFAEFDDKEEILLMAQGTDESNNKKVVWYLDSGCSNHMVGNKEWLFDFDD 317 Query: 401 SFSVTVKLGNNTRMQVKGKGSVKLKINEIVQMISDVYYIPEL 526 SF +VKLG+++RM V GKG++KL IN +VQ+I+DVY++P L Sbjct: 318 SFRESVKLGDDSRMAVMGKGNLKLNINGMVQVITDVYFLPGL 359 >dbj|GAU39634.1| hypothetical protein TSUD_397220 [Trifolium subterraneum] Length = 417 Score = 121 bits (304), Expect = 2e-29 Identities = 65/161 (40%), Positives = 91/161 (56%), Gaps = 1/161 (0%) Frame = +2 Query: 47 NKELIECYNCRKLGHYQYECPSLEERANYVEDGXXXXXXXMAYMEGNDTGGELNVPRIES 226 NKE +EC+ C KLGHYQ ECP+ EGND Sbjct: 146 NKENVECFKCHKLGHYQSECPN---------------------WEGND------------ 172 Query: 227 TRDVETDADHSDSNVAEAYLLMANTDEKTSK-DPKVWYLDSGCSNHMSGNRNWFHDLDES 403 A++++ N E LLM TD K + + W+LDSGCSNHM G+++W + DE+ Sbjct: 173 -------ANYAEFNQYEEVLLMTKTDSNQHKYENETWFLDSGCSNHMVGHKDWMFEFDET 225 Query: 404 FSVTVKLGNNTRMQVKGKGSVKLKINEIVQMISDVYYIPEL 526 +S VKLG+++RM VKGKG++KL IN +V +IS+VY++P L Sbjct: 226 YSDYVKLGDDSRMAVKGKGNIKLCINGVVHVISNVYFVPGL 266