BLASTX nr result
ID: Astragalus23_contig00019362
seq
BLASTX 2.2.26 [Sep-21-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Astragalus23_contig00019362 (1015 letters) Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects 149,584,005 sequences; 54,822,741,787 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value gb|KYP40819.1| Retrovirus-related Pol polyprotein from transposo... 306 1e-96 gb|KYP35468.1| Retrovirus-related Pol polyprotein from transposo... 246 5e-74 gb|PNX95204.1| copia-type polyprotein [Trifolium pratense] 257 1e-73 dbj|GAU43011.1| hypothetical protein TSUD_28300 [Trifolium subte... 242 5e-72 dbj|GAU32260.1| hypothetical protein TSUD_53880 [Trifolium subte... 251 1e-71 dbj|GAU16533.1| hypothetical protein TSUD_167640 [Trifolium subt... 249 3e-71 gb|PNX77239.1| copia-type polyprotein, partial [Trifolium pratense] 246 3e-71 gb|PNX74620.1| putative LRR receptor-like protein kinase, partia... 245 5e-71 gb|PNY01730.1| copia-type polyprotein, partial [Trifolium pratense] 245 1e-70 gb|KYP38784.1| Retrovirus-related Pol polyprotein from transposo... 238 2e-70 gb|PNX72392.1| copia-type polyprotein, partial [Trifolium pratense] 244 2e-70 gb|PNX65055.1| retrotransposon-related protein, partial [Trifoli... 229 3e-70 gb|PNX83704.1| copia-type polyprotein, partial [Trifolium pratense] 239 3e-70 gb|PNX77752.1| copia-type polyprotein, partial [Trifolium pratense] 241 4e-70 gb|PNX74679.1| copia-type polyprotein, partial [Trifolium pratense] 240 5e-70 dbj|GAU37106.1| hypothetical protein TSUD_278930 [Trifolium subt... 244 8e-70 gb|PNX99782.1| copia-type polyprotein [Trifolium pratense] 243 1e-69 dbj|GAU36022.1| hypothetical protein TSUD_211600 [Trifolium subt... 245 2e-69 gb|PNX90684.1| retrovirus-related Pol polyprotein from transposo... 230 3e-69 emb|CBI37296.3| unnamed protein product, partial [Vitis vinifera] 245 3e-69 >gb|KYP40819.1| Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cajanus cajan] Length = 582 Score = 306 bits (785), Expect = 1e-96 Identities = 143/210 (68%), Positives = 172/210 (81%), Gaps = 4/210 (1%) Frame = -1 Query: 1015 DSGCSTHMCGTKKWFSYLDGSFRESVKLGDGRKLNVIGKGNVKLRIKGRIQVITGVYYIP 836 DSGCSTHMCG K+WF LD FRE VKLGDGR L+V+G+GNVKL ++GRIQ+ITGVYYIP Sbjct: 294 DSGCSTHMCGVKRWFIDLDEQFREVVKLGDGRTLSVMGRGNVKLCVEGRIQIITGVYYIP 353 Query: 835 NLR----SIGQLQQKNLKVEFEDDKCKVIHKEKGLIMTSFMASNRMFAVKATPVKIDTNC 668 NL SIGQLQQK LK+ F+DDKC+V HKEKGL+MTS+MA+NRMF +KA PV + C Sbjct: 354 NLMNNLLSIGQLQQKKLKIIFDDDKCRVYHKEKGLLMTSYMANNRMFPIKAKPVITEAAC 413 Query: 667 LHIADEETTQLWHRRYGHLSYKGLSPLNAKEMVRGLPELKETTKVCAECQIGKQHRDTMS 488 + ++ ++T++WH+RYGHLSY GL LN K MV+GLPELKE KVC EC +GKQHRD +S Sbjct: 414 IQTSNVDSTEMWHKRYGHLSYGGLKLLNQKAMVKGLPELKEMDKVCPECAVGKQHRDAIS 473 Query: 487 KQSTWRASRKLELIHSDICGPINPASNSNK 398 KQSTWRA+R+LELIHSDICGP P SNSN+ Sbjct: 474 KQSTWRATRRLELIHSDICGPSTPTSNSNR 503 >gb|KYP35468.1| Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cajanus cajan] Length = 508 Score = 246 bits (628), Expect = 5e-74 Identities = 115/174 (66%), Positives = 142/174 (81%), Gaps = 4/174 (2%) Frame = -1 Query: 907 IGKGNVKLRIKGRIQVITGVYYIPNLR----SIGQLQQKNLKVEFEDDKCKVIHKEKGLI 740 +G+GNVKL ++GRIQ+ITGVYYIPNL SIGQLQQK LK+ F+DDKC+V HKEKGL+ Sbjct: 1 MGRGNVKLCVEGRIQIITGVYYIPNLMNNLLSIGQLQQKKLKIIFDDDKCRVYHKEKGLL 60 Query: 739 MTSFMASNRMFAVKATPVKIDTNCLHIADEETTQLWHRRYGHLSYKGLSPLNAKEMVRGL 560 MTS+MA+NRMF +KA V + C+ ++ ++T++WH+RYGHLSY GL LN K MV+GL Sbjct: 61 MTSYMANNRMFPIKAKLVITEAACIQTSNVDSTEMWHKRYGHLSYGGLKLLNQKAMVKGL 120 Query: 559 PELKETTKVCAECQIGKQHRDTMSKQSTWRASRKLELIHSDICGPINPASNSNK 398 PELKE KVC EC +GKQHRD +SKQSTWRA+R+LELIHSDICGP P SNSN+ Sbjct: 121 PELKEMDKVCPECAVGKQHRDAISKQSTWRATRRLELIHSDICGPSTPTSNSNR 174 >gb|PNX95204.1| copia-type polyprotein [Trifolium pratense] Length = 1328 Score = 257 bits (657), Expect = 1e-73 Identities = 128/208 (61%), Positives = 157/208 (75%), Gaps = 4/208 (1%) Frame = -1 Query: 1015 DSGCSTHMCGTKKWFSYLDGSFRESVKLGDGRKLNVIGKGNVKLRIKGRIQVITGVYYIP 836 DSGCS HM G K+W D SFRESVKLGD K++V+GKG +KL I G QVI+ VYY+P Sbjct: 301 DSGCSNHMIGNKEWLFDFDDSFRESVKLGDDSKMHVMGKGKLKLYIGGITQVISEVYYLP 360 Query: 835 ----NLRSIGQLQQKNLKVEFEDDKCKVIHKEKGLIMTSFMASNRMFAVKATPVKIDTNC 668 NL SIGQLQQKNL + F++D CKV H+E+GLIM++ M++NRMF +KAT + C Sbjct: 361 GLKNNLLSIGQLQQKNLTIVFKNDICKVFHEERGLIMSTPMSANRMFVIKATVLV--PMC 418 Query: 667 LHIADEETTQLWHRRYGHLSYKGLSPLNAKEMVRGLPELKETTKVCAECQIGKQHRDTMS 488 L +E +QLWH+RYGHLSYKGL+ L KEMVRGLP LKE + VC++C GKQHR+ + Sbjct: 419 LQTTNEIDSQLWHKRYGHLSYKGLNTLVKKEMVRGLPALKEASDVCSDCLFGKQHREVIP 478 Query: 487 KQSTWRASRKLELIHSDICGPINPASNS 404 K+ WRA+ KLELIHSDICGPINPASNS Sbjct: 479 KKVNWRATHKLELIHSDICGPINPASNS 506 >dbj|GAU43011.1| hypothetical protein TSUD_28300 [Trifolium subterraneum] Length = 538 Score = 242 bits (617), Expect = 5e-72 Identities = 115/211 (54%), Positives = 151/211 (71%), Gaps = 4/211 (1%) Frame = -1 Query: 1015 DSGCSTHMCGTKKWFSYLDGSFRESVKLGDGRKLNVIGKGNVKLRIKGRIQVITGVYYIP 836 DSGCS HM G K W D ++R+SVKLGD K+ V+GKGNVKL I GR+ VI+ VYYIP Sbjct: 293 DSGCSNHMIGNKDWMYEFDETYRDSVKLGDDSKMQVMGKGNVKLSINGRVHVISSVYYIP 352 Query: 835 ----NLRSIGQLQQKNLKVEFEDDKCKVIHKEKGLIMTSFMASNRMFAVKATPVKIDTNC 668 NL SIGQ+QQKN+ + F +D CK H EKGL+ ++ M++NRM+ +KA + + C Sbjct: 353 GLKTNLLSIGQIQQKNVTIVFNEDTCKAYHDEKGLLFSTHMSANRMYVIKA--LVVTPRC 410 Query: 667 LHIADEETTQLWHRRYGHLSYKGLSPLNAKEMVRGLPELKETTKVCAECQIGKQHRDTMS 488 L A ++ +QLWH RYGHLS KGL+ L K+MV+GLP LK+ ++ CA+C GKQHR+ + Sbjct: 411 LQAAKKDVSQLWHNRYGHLSIKGLNTLTNKDMVKGLPALKDLSEKCADCLTGKQHREKIP 470 Query: 487 KQSTWRASRKLELIHSDICGPINPASNSNKS 395 KQ+ WRA+ L+L+HSDICGPINP SN S Sbjct: 471 KQAKWRAT--LKLVHSDICGPINPTSNGGNS 499 >dbj|GAU32260.1| hypothetical protein TSUD_53880 [Trifolium subterraneum] Length = 1172 Score = 251 bits (640), Expect = 1e-71 Identities = 125/233 (53%), Positives = 157/233 (67%), Gaps = 8/233 (3%) Frame = -1 Query: 1015 DSGCSTHMCGTKKWFSYLDGSFRESVKLGDGRKLNVIGKGNVKLRIKGRIQVITGVYYIP 836 DSGCS HM G K W D +F++SVKLGD ++ V+GKGN+KL I+G +Q++T VYY+P Sbjct: 282 DSGCSNHMVGNKSWLFDYDDTFKDSVKLGDDSRMAVVGKGNLKLHIEGYVQILTNVYYLP 341 Query: 835 ----NLRSIGQLQQKNLKVEFEDDKCKVIHKEKGLIMTSFMASNRMFAVKATPVKIDTNC 668 NL SIGQLQQKNL + F++D CKV H EKGLIMTS M+ NRM+ +KA V + C Sbjct: 342 GLKNNLLSIGQLQQKNLTIIFKNDTCKVYHDEKGLIMTSHMSMNRMYVIKAPVVIPQSQC 401 Query: 667 LHIADEETTQLWHRRYGHLSYKGLSPLNAKEMVRGLPELKETTKVCAECQIGKQHRDTMS 488 I + LWH+RYGHLS+KG++ L K MV GLP+LKE T+ C C GKQ R+ + Sbjct: 402 FKITKNDEDTLWHKRYGHLSFKGINVLVQKNMVIGLPKLKEPTEKCTHCMKGKQQRENVP 461 Query: 487 KQSTWRASRKLELIHSDICGPINPASNSNK----SDVGDHARFTHGCLHEAKA 341 K+S WRAS KLELIHSDICGPINP SN K + D +R T C K+ Sbjct: 462 KKSHWRASHKLELIHSDICGPINPESNGKKRYFITFTDDMSRKTWTCFISEKS 514 >dbj|GAU16533.1| hypothetical protein TSUD_167640 [Trifolium subterraneum] Length = 1103 Score = 249 bits (637), Expect = 3e-71 Identities = 120/210 (57%), Positives = 157/210 (74%), Gaps = 4/210 (1%) Frame = -1 Query: 1015 DSGCSTHMCGTKKWFSYLDGSFRESVKLGDGRKLNVIGKGNVKLRIKGRIQVITGVYYIP 836 DSGCS HMCG K+WF LD SFRESV+LGD +++V+GKGNVKL++ G +Q+ITGVY+IP Sbjct: 297 DSGCSNHMCGIKEWFHDLDESFRESVRLGDDSQMSVMGKGNVKLQMNGIVQIITGVYFIP 356 Query: 835 ----NLRSIGQLQQKNLKVEFEDDKCKVIHKEKGLIMTSFMASNRMFAVKATPVKIDTNC 668 NL S+GQLQ+KNL +++ CKV H++KG+IM S MASN++F + A ++ C Sbjct: 357 KLKNNLLSLGQLQEKNLTFVIKNNWCKVYHRDKGMIMCSQMASNKLFPIMAEAKQV---C 413 Query: 667 LHIADEETTQLWHRRYGHLSYKGLSPLNAKEMVRGLPELKETTKVCAECQIGKQHRDTMS 488 L E+ TQLWH RYGHL+ KGL L K MV GLP+ +E+ VCA+C GKQHR+++ Sbjct: 414 LQANIEDITQLWHCRYGHLNIKGLQNLQQKNMVIGLPKFEESNHVCADCLRGKQHRESIP 473 Query: 487 KQSTWRASRKLELIHSDICGPINPASNSNK 398 K S W++S++LELIHSDICGPI P SNSNK Sbjct: 474 KTSNWKSSKRLELIHSDICGPITPVSNSNK 503 >gb|PNX77239.1| copia-type polyprotein, partial [Trifolium pratense] Length = 803 Score = 246 bits (627), Expect = 3e-71 Identities = 127/240 (52%), Positives = 163/240 (67%), Gaps = 8/240 (3%) Frame = -1 Query: 1015 DSGCSTHMCGTKKWFSYLDGSFRESVKLGDGRKLNVIGKGNVKLRIKGRIQVITGVYYIP 836 DSGCS HM GTK W LD +FRESVKLG+ K+ V+GKGN++L I GRI +IT VYY+P Sbjct: 59 DSGCSNHMIGTKDWLFDLDETFRESVKLGNDSKMAVMGKGNLRLDIGGRIIIITDVYYLP 118 Query: 835 ----NLRSIGQLQQKNLKVEFEDDKCKVIHKEKGLIMTSFMASNRMFAVKATPVKIDTNC 668 NL SIGQLQQK L + F+++ C++ H++KGLI+++ M NRM+ V+AT I NC Sbjct: 119 GLGNNLLSIGQLQQKGLTIVFKNNVCQLFHEDKGLILSTEMTMNRMYIVRATV--IIPNC 176 Query: 667 LHIADEETTQLWHRRYGHLSYKGLSPLNAKEMVRGLPELKETTKVCAECQIGKQHRDTMS 488 L + E T+LWH+RY HLS KGL LN K MV+GLPEL++T + C +C GKQHR+ M Sbjct: 177 LQVTKAEETELWHKRYAHLSIKGLRVLNKKHMVKGLPELRDTEEKCTDCLSGKQHRENMP 236 Query: 487 KQSTWRASRKLELIHSDICGPINPASNSNK----SDVGDHARFTHGCLHEAKACL*LVVK 320 KQ+ WRAS LELIHSDICGPI P SN + D +R T + + K+C V K Sbjct: 237 KQANWRASEILELIHSDICGPITPKSNGGNRYFLTFTDDFSRKTWTYIIQEKSCALSVFK 296 >gb|PNX74620.1| putative LRR receptor-like protein kinase, partial [Trifolium pratense] Length = 814 Score = 245 bits (626), Expect = 5e-71 Identities = 119/207 (57%), Positives = 153/207 (73%), Gaps = 4/207 (1%) Frame = -1 Query: 1015 DSGCSTHMCGTKKWFSYLDGSFRESVKLGDGRKLNVIGKGNVKLRIKGRIQVITGVYYIP 836 DSGC+ HM G K W LD SFRESVKLG+ K+ V+GK NVKL I+GRI VIT VYY+P Sbjct: 21 DSGCNNHMVGNKDWLFELDESFRESVKLGNDSKMAVMGKCNVKLNIEGRIHVITDVYYLP 80 Query: 835 ----NLRSIGQLQQKNLKVEFEDDKCKVIHKEKGLIMTSFMASNRMFAVKATPVKIDTNC 668 NL SIGQLQQK + + F+++ C++ H+EKGLI+++ M +N+M+ + A PV I NC Sbjct: 81 GLSNNLLSIGQLQQKGITIIFKNNTCQLFHEEKGLIISTAMTTNKMYIINA-PV-ITPNC 138 Query: 667 LHIADEETTQLWHRRYGHLSYKGLSPLNAKEMVRGLPELKETTKVCAECQIGKQHRDTMS 488 L + +E T LWH+RY HLS KGL L K MV+GLPELK+ + C++C GKQHRD + Sbjct: 139 LQMTKDEETDLWHKRYAHLSLKGLKVLTGKNMVKGLPELKDNEEKCSDCLSGKQHRDNIP 198 Query: 487 KQSTWRASRKLELIHSDICGPINPASN 407 KQ+ WRAS+KLEL+HSDICGP+NP SN Sbjct: 199 KQTNWRASQKLELVHSDICGPLNPKSN 225 >gb|PNY01730.1| copia-type polyprotein, partial [Trifolium pratense] Length = 861 Score = 245 bits (625), Expect = 1e-70 Identities = 117/207 (56%), Positives = 150/207 (72%), Gaps = 4/207 (1%) Frame = -1 Query: 1015 DSGCSTHMCGTKKWFSYLDGSFRESVKLGDGRKLNVIGKGNVKLRIKGRIQVITGVYYIP 836 DSGCS HM G K WF D ++R+SVKLGD ++NV+GKGNVKL I GR+ ++T VY+IP Sbjct: 293 DSGCSYHMAGNKDWFYDFDENYRDSVKLGDDSRMNVMGKGNVKLSINGRVHILTDVYFIP 352 Query: 835 ----NLRSIGQLQQKNLKVEFEDDKCKVIHKEKGLIMTSFMASNRMFAVKATPVKIDTNC 668 NL SIGQ+QQKN + F++D CKV H+EKGL+ ++M++NRM+ VKA + C Sbjct: 353 GLKTNLLSIGQIQQKNTTIVFKNDICKVYHREKGLLFATYMSTNRMYVVKAEVIA--PRC 410 Query: 667 LHIADEETTQLWHRRYGHLSYKGLSPLNAKEMVRGLPELKETTKVCAECQIGKQHRDTMS 488 L + +QLWH RY HLS KGL+ L K+MV+GLP LKE + C C +GKQH+D + Sbjct: 411 LQASKIVNSQLWHNRYCHLSIKGLNILVIKDMVKGLPALKELDENCVNCLVGKQHKDAIP 470 Query: 487 KQSTWRASRKLELIHSDICGPINPASN 407 KQ+TWRAS KLEL+HSDICGPINP SN Sbjct: 471 KQATWRASLKLELVHSDICGPINPKSN 497 >gb|KYP38784.1| Retrovirus-related Pol polyprotein from transposon TNT 1-94, partial [Cajanus cajan] Length = 560 Score = 238 bits (608), Expect = 2e-70 Identities = 112/210 (53%), Positives = 146/210 (69%), Gaps = 4/210 (1%) Frame = -1 Query: 1015 DSGCSTHMCGTKKWFSYLDGSFRESVKLGDGRKLNVIGKGNVKLRIKGRIQVITGVYYIP 836 DSGCS HMCG K F L+ FR+ VKLG+ K++V+GKGNV+L++ G V+T V+Y+P Sbjct: 271 DSGCSNHMCGDKALFYNLNEDFRQIVKLGNNSKMSVLGKGNVRLKVNGFTHVVTEVFYVP 330 Query: 835 ----NLRSIGQLQQKNLKVEFEDDKCKVIHKEKGLIMTSFMASNRMFAVKATPVKIDTNC 668 NL SIGQLQ+K L + ++ CK+ H EKGL++ + M +NRMF + A P+ C Sbjct: 331 ELKNNLLSIGQLQEKGLSILIQNGSCKIYHPEKGLVIQTEMTTNRMFVLTAVPMPQKPTC 390 Query: 667 LHIADEETTQLWHRRYGHLSYKGLSPLNAKEMVRGLPELKETTKVCAECQIGKQHRDTMS 488 LH +E LWH RYGHLS+KGL L K+MV GLP LK T VC C IGKQHRD + Sbjct: 391 LHTTVQEIAHLWHCRYGHLSHKGLRTLQYKKMVHGLPPLKSPTDVCTVCMIGKQHRDPIP 450 Query: 487 KQSTWRASRKLELIHSDICGPINPASNSNK 398 K++ WRA++KL+LIH+DICGPI+P SNS K Sbjct: 451 KKANWRATQKLQLIHADICGPISPTSNSKK 480 >gb|PNX72392.1| copia-type polyprotein, partial [Trifolium pratense] Length = 886 Score = 244 bits (624), Expect = 2e-70 Identities = 119/207 (57%), Positives = 153/207 (73%), Gaps = 4/207 (1%) Frame = -1 Query: 1015 DSGCSTHMCGTKKWFSYLDGSFRESVKLGDGRKLNVIGKGNVKLRIKGRIQVITGVYYIP 836 DSGCS HM G K+W D SFRESVKLGD ++ V+GKGN+KL I G +QVIT VY++P Sbjct: 298 DSGCSNHMVGNKEWLFDFDDSFRESVKLGDDSRMAVMGKGNLKLNINGMVQVITDVYFLP 357 Query: 835 ----NLRSIGQLQQKNLKVEFEDDKCKVIHKEKGLIMTSFMASNRMFAVKATPVKIDTNC 668 NL SIGQLQQKN+ + FE D+CKV H + GLI+TS M++NRMF ++A+ I C Sbjct: 358 GLKNNLLSIGQLQQKNVTIIFEKDQCKVFHDKWGLIITSDMSANRMFIIQASI--ISPMC 415 Query: 667 LHIADEETTQLWHRRYGHLSYKGLSPLNAKEMVRGLPELKETTKVCAECQIGKQHRDTMS 488 L I+ + + LWH RY HLS+KGL+ L K+MV+GLP L+ET +VC++C GKQ R+ + Sbjct: 416 LKISKDSQSHLWHCRYAHLSFKGLNTLVKKDMVKGLPTLQETDEVCSDCATGKQSREAIP 475 Query: 487 KQSTWRASRKLELIHSDICGPINPASN 407 K + WRAS KL+L+HSDICGPINPASN Sbjct: 476 KSNNWRASEKLQLVHSDICGPINPASN 502 >gb|PNX65055.1| retrotransposon-related protein, partial [Trifolium pratense] Length = 281 Score = 229 bits (585), Expect = 3e-70 Identities = 115/207 (55%), Positives = 144/207 (69%), Gaps = 4/207 (1%) Frame = -1 Query: 1015 DSGCSTHMCGTKKWFSYLDGSFRESVKLGDGRKLNVIGKGNVKLRIKGRIQVITGVYYIP 836 DSGC+ HM G K W D SF++SVKLG+ K++V+GKGNVKL I G+I VIT VYY+P Sbjct: 75 DSGCNNHMIGQKDWLYDFDSSFKDSVKLGNDTKMSVMGKGNVKLFINGKIHVITNVYYLP 134 Query: 835 ----NLRSIGQLQQKNLKVEFEDDKCKVIHKEKGLIMTSFMASNRMFAVKATPVKIDTNC 668 NL S+GQLQ+K + V F+D+ CK H E GLI T+ M +NRMF + A PV I C Sbjct: 135 GLTTNLLSVGQLQEKKVTVVFKDNMCKGYHDENGLIFTTQMTANRMFLISA-PV-IMPMC 192 Query: 667 LHIADEETTQLWHRRYGHLSYKGLSPLNAKEMVRGLPELKETTKVCAECQIGKQHRDTMS 488 + + +E TQLWH RYGHLS GL L EMV GLPEL++ C +C GKQ R+ + Sbjct: 193 MQFSKQERTQLWHNRYGHLSVNGLKLLTKLEMVNGLPELEDMEGRCTDCLSGKQQREAIP 252 Query: 487 KQSTWRASRKLELIHSDICGPINPASN 407 KQ+ WRA+ KL+LIHSDICGPINP+SN Sbjct: 253 KQAKWRATEKLQLIHSDICGPINPSSN 279 >gb|PNX83704.1| copia-type polyprotein, partial [Trifolium pratense] Length = 598 Score = 239 bits (609), Expect = 3e-70 Identities = 121/212 (57%), Positives = 152/212 (71%), Gaps = 6/212 (2%) Frame = -1 Query: 1015 DSGCSTHMCGTKKWFSYLDGSFRESVKLGDGRKLNVIGKGNVKLRIKGRIQVITGVYYIP 836 DSGCS HM G K W DG+F++SVKLGD K+ V GKGN+KL I+G QV+T VYY+P Sbjct: 315 DSGCSNHMVGNKNWLFEYDGTFKDSVKLGDDSKMAVEGKGNLKLHIEGFTQVLTNVYYLP 374 Query: 835 ----NLRSIGQLQQKNLKVEFEDDKCKVIHKEKGLIMTSFMASNRMFAVKATPVKIDTNC 668 NL SIGQLQQKNL V F++D CKV H+EKGLIM + M NRMF +KA PV I +C Sbjct: 375 GLKNNLLSIGQLQQKNLTVIFKNDTCKVFHEEKGLIMATHMTMNRMFVIKA-PV-IVPHC 432 Query: 667 LHIA--DEETTQLWHRRYGHLSYKGLSPLNAKEMVRGLPELKETTKVCAECQIGKQHRDT 494 ++ + + + LWH+RYGHLS+KG++ L K MV GLP+LK+ + C+ C GKQ R Sbjct: 433 MNTSSNNHDNANLWHQRYGHLSFKGMNVLAHKGMVIGLPKLKQPDEECSNCMKGKQQRKN 492 Query: 493 MSKQSTWRASRKLELIHSDICGPINPASNSNK 398 + K+S+WRAS KLEL+HSDICGPINP SN K Sbjct: 493 VPKKSSWRASTKLELVHSDICGPINPESNGRK 524 >gb|PNX77752.1| copia-type polyprotein, partial [Trifolium pratense] Length = 736 Score = 241 bits (616), Expect = 4e-70 Identities = 120/210 (57%), Positives = 149/210 (70%), Gaps = 4/210 (1%) Frame = -1 Query: 1015 DSGCSTHMCGTKKWFSYLDGSFRESVKLGDGRKLNVIGKGNVKLRIKGRIQVITGVYYIP 836 DSGCS HM G W D +F++SVKLGD K+ V GKGN+KL IKG +Q +T VYY+P Sbjct: 70 DSGCSNHMVGNNNWLFDYDDTFKDSVKLGDDSKMVVEGKGNLKLYIKGFVQTLTSVYYLP 129 Query: 835 ----NLRSIGQLQQKNLKVEFEDDKCKVIHKEKGLIMTSFMASNRMFAVKATPVKIDTNC 668 NL SIGQLQQKNL + F+DD CKV H EKGLIM + M NRMF +KA PV I C Sbjct: 130 GLKNNLLSIGQLQQKNLTIIFKDDTCKVFHDEKGLIMATTMTFNRMFVIKA-PV-IVPQC 187 Query: 667 LHIADEETTQLWHRRYGHLSYKGLSPLNAKEMVRGLPELKETTKVCAECQIGKQHRDTMS 488 + I+ + + LWH+RYGHLS+KG++ L K+MV GLP+LKE+ + C C GKQ + + Sbjct: 188 MKISGVDDSTLWHQRYGHLSFKGMNVLTQKQMVIGLPKLKESNEKCTNCMKGKQQKQSAP 247 Query: 487 KQSTWRASRKLELIHSDICGPINPASNSNK 398 K+S+WRAS KLEL+HSDICGPINP SN K Sbjct: 248 KKSSWRASTKLELVHSDICGPINPESNGKK 277 >gb|PNX74679.1| copia-type polyprotein, partial [Trifolium pratense] Length = 694 Score = 240 bits (613), Expect = 5e-70 Identities = 119/220 (54%), Positives = 155/220 (70%), Gaps = 4/220 (1%) Frame = -1 Query: 1015 DSGCSTHMCGTKKWFSYLDGSFRESVKLGDGRKLNVIGKGNVKLRIKGRIQVITGVYYIP 836 DSGCS HM G K+W DGSF ESVKLG+ K+ ++GKGNVKL+I G++ VIT VYYIP Sbjct: 284 DSGCSNHMTGKKEWLHDFDGSFTESVKLGNDSKMAIMGKGNVKLKIAGKVHVITDVYYIP 343 Query: 835 ----NLRSIGQLQQKNLKVEFEDDKCKVIHKEKGLIMTSFMASNRMFAVKATPVKIDTNC 668 NL S+ QLQQKNL V F++D C+VIH EKGLI+T+ +++NRM+ + A PV I N Sbjct: 344 GLSSNLLSVDQLQQKNLTVVFKNDLCQVIHNEKGLILTTQISANRMYMIFA-PV-ILPNY 401 Query: 667 LHIADEETTQLWHRRYGHLSYKGLSPLNAKEMVRGLPELKETTKVCAECQIGKQHRDTMS 488 + ++ + LWH +Y HL++KGL LN K MV+GLPELK+ C +C GKQHRD++ Sbjct: 402 MQMSKTDEYHLWHNKYAHLNFKGLKVLNKKHMVKGLPELKDIEDKCGDCLSGKQHRDSIP 461 Query: 487 KQSTWRASRKLELIHSDICGPINPASNSNKSDVGDHARFT 368 K+STWRAS +LEL+H+DICGPI P SN G + T Sbjct: 462 KKSTWRASTRLELVHTDICGPIKPESNGGNISQGIKRKLT 501 >dbj|GAU37106.1| hypothetical protein TSUD_278930 [Trifolium subterraneum] Length = 1013 Score = 244 bits (624), Expect = 8e-70 Identities = 114/207 (55%), Positives = 148/207 (71%), Gaps = 4/207 (1%) Frame = -1 Query: 1015 DSGCSTHMCGTKKWFSYLDGSFRESVKLGDGRKLNVIGKGNVKLRIKGRIQVITGVYYIP 836 DSGCS HM G K W DGS+R+SVKLGD K+NV+GKGN+K I G+I VIT V+YIP Sbjct: 296 DSGCSNHMVGNKDWLYEFDGSYRDSVKLGDDSKMNVMGKGNIKFSIAGKIHVITNVFYIP 355 Query: 835 ----NLRSIGQLQQKNLKVEFEDDKCKVIHKEKGLIMTSFMASNRMFAVKATPVKIDTNC 668 NL SIGQ+QQK++ + F++D CK+ H +KGL+ T+ M+ NRM+ V AT + NC Sbjct: 356 GLKSNLLSIGQIQQKHVTIVFKNDLCKIYHDDKGLLFTTHMSPNRMYVVNATVIV--PNC 413 Query: 667 LHIADEETTQLWHRRYGHLSYKGLSPLNAKEMVRGLPELKETTKVCAECQIGKQHRDTMS 488 L + ++ +QLWH+RYGHLS KGL+ L K+MV GLP L + K C +C GKQH D M Sbjct: 414 LQVTKKDWSQLWHQRYGHLSIKGLNTLAKKDMVNGLPVLDDLDKHCGDCLTGKQHMDAMP 473 Query: 487 KQSTWRASRKLELIHSDICGPINPASN 407 K++ WRA KLEL+HSD+CGP+NP SN Sbjct: 474 KRAVWRAKLKLELVHSDLCGPLNPTSN 500 >gb|PNX99782.1| copia-type polyprotein [Trifolium pratense] Length = 912 Score = 243 bits (620), Expect = 1e-69 Identities = 115/207 (55%), Positives = 151/207 (72%), Gaps = 4/207 (1%) Frame = -1 Query: 1015 DSGCSTHMCGTKKWFSYLDGSFRESVKLGDGRKLNVIGKGNVKLRIKGRIQVITGVYYIP 836 DSGCS HM GTK+WF D FRESVKLG+ K+ V+G+GNVKL + G+I VIT VYY+P Sbjct: 294 DSGCSNHMTGTKEWFFDFDDKFRESVKLGNDSKMTVMGRGNVKLNMDGKIHVITNVYYLP 353 Query: 835 ----NLRSIGQLQQKNLKVEFEDDKCKVIHKEKGLIMTSFMASNRMFAVKATPVKIDTNC 668 NL S+GQLQQ+ L F+++ C++ H+EKGLI+T+ M N+M+ VKA+ + NC Sbjct: 354 GLSNNLLSVGQLQQRGLTTVFKNNMCQLFHEEKGLILTTKMTFNKMYIVKASMIL--PNC 411 Query: 667 LHIADEETTQLWHRRYGHLSYKGLSPLNAKEMVRGLPELKETTKVCAECQIGKQHRDTMS 488 L E T LWH+RY HLS++GL L K+MV+GLP LKE+ C +C +GKQHR ++ Sbjct: 412 LQATALEETTLWHQRYAHLSFQGLKTLITKQMVKGLPNLKESGDKCTDCLVGKQHRSSIP 471 Query: 487 KQSTWRASRKLELIHSDICGPINPASN 407 K++ WRAS+KLEL+HSDICGPINP SN Sbjct: 472 KEANWRASKKLELVHSDICGPINPQSN 498 >dbj|GAU36022.1| hypothetical protein TSUD_211600 [Trifolium subterraneum] Length = 1423 Score = 245 bits (626), Expect = 2e-69 Identities = 119/220 (54%), Positives = 153/220 (69%), Gaps = 4/220 (1%) Frame = -1 Query: 1015 DSGCSTHMCGTKKWFSYLDGSFRESVKLGDGRKLNVIGKGNVKLRIKGRIQVITGVYYIP 836 DSGCS HM G K W LD S+R++VKLGD K+NV+GKGNVKL I G+I VITGV+YIP Sbjct: 298 DSGCSNHMVGNKDWLYELDESYRDTVKLGDDSKMNVMGKGNVKLSIDGKIHVITGVFYIP 357 Query: 835 ----NLRSIGQLQQKNLKVEFEDDKCKVIHKEKGLIMTSFMASNRMFAVKATPVKIDTNC 668 NL SIGQ+QQKN+ + F++D CK+ H +KGL+ T+ M+ NRM+ V A + Sbjct: 358 GLKSNLLSIGQIQQKNVTIVFKNDICKIYHDDKGLLFTTQMSPNRMYVVNANVIM--PKR 415 Query: 667 LHIADEETTQLWHRRYGHLSYKGLSPLNAKEMVRGLPELKETTKVCAECQIGKQHRDTMS 488 L + ++ +QLWH RYGHLS KGL+ L KEMV+GLP L+E + C +C GKQHRD + Sbjct: 416 LQVTKKDWSQLWHNRYGHLSIKGLNTLARKEMVKGLPPLEELNEHCVDCLTGKQHRDAIP 475 Query: 487 KQSTWRASRKLELIHSDICGPINPASNSNKSDVGDHARFT 368 KQ+ WRA KLEL+H DICGP+NP SN S+ G + T Sbjct: 476 KQAVWRAKLKLELVHLDICGPLNPISNGGNSNQGIKRQLT 515 >gb|PNX90684.1| retrovirus-related Pol polyprotein from transposon TNT 1-94 [Trifolium pratense] Length = 372 Score = 230 bits (586), Expect = 3e-69 Identities = 106/210 (50%), Positives = 150/210 (71%), Gaps = 4/210 (1%) Frame = -1 Query: 1015 DSGCSTHMCGTKKWFSYLDGSFRESVKLGDGRKLNVIGKGNVKLRIKGRIQVITGVYYIP 836 DSGCS HM G KKWF+ +D +++SVKLG+ K+ V+G+GNVKL + G +QVIT VYY+P Sbjct: 162 DSGCSNHMTGNKKWFTDIDEQYQQSVKLGNNFKMAVVGRGNVKLHVNGIMQVITNVYYVP 221 Query: 835 ----NLRSIGQLQQKNLKVEFEDDKCKVIHKEKGLIMTSFMASNRMFAVKATPVKIDTNC 668 NL SIGQL +K + V ++ +C+ H ++GL + + M++NRMF ATP+ ++C Sbjct: 222 ELKNNLISIGQLIEKGVSVLIQNGECRCYHSKEGLFLQTKMSANRMFVFHATPMSQFSSC 281 Query: 667 LHIADEETTQLWHRRYGHLSYKGLSPLNAKEMVRGLPELKETTKVCAECQIGKQHRDTMS 488 A E+ T LWH RYGHL+Y GL L +K+MV GLPE++ +K+C +C +GKQ R + Sbjct: 282 FKTASEDETHLWHCRYGHLNYNGLKTLQSKKMVTGLPEIRTPSKLCNDCVMGKQQRKPIP 341 Query: 487 KQSTWRASRKLELIHSDICGPINPASNSNK 398 K+S WRA+ KL+LIHSDICGPI P+++S K Sbjct: 342 KKSLWRATHKLQLIHSDICGPITPSTSSGK 371 >emb|CBI37296.3| unnamed protein product, partial [Vitis vinifera] Length = 3048 Score = 245 bits (625), Expect = 3e-69 Identities = 112/210 (53%), Positives = 152/210 (72%), Gaps = 4/210 (1%) Frame = -1 Query: 1015 DSGCSTHMCGTKKWFSYLDGSFRESVKLGDGRKLNVIGKGNVKLRIKGRIQVITGVYYIP 836 DSGCS HMCG K +FS DG+FR+SVKLG+ ++V+GKGNV+L++ Q+ITGV+Y+P Sbjct: 323 DSGCSNHMCGKKDYFSDFDGTFRDSVKLGNNTSMSVLGKGNVRLKVNEMTQIITGVFYVP 382 Query: 835 ----NLRSIGQLQQKNLKVEFEDDKCKVIHKEKGLIMTSFMASNRMFAVKATPVKIDTNC 668 NL SIGQLQ+K L + F+ KCKV H +KGLIM + M+SNRMF + A I + C Sbjct: 383 ELKNNLLSIGQLQEKGLTILFQHGKCKVFHSQKGLIMDTKMSSNRMFMLYALSQPISSTC 442 Query: 667 LHIADEETTQLWHRRYGHLSYKGLSPLNAKEMVRGLPELKETTKVCAECQIGKQHRDTMS 488 + E+ QLWH RYGHLS++GL L ++MV GLP+ + +K+C +C +GKQHR ++ Sbjct: 443 FNTVTEDILQLWHCRYGHLSFQGLKTLQQRKMVNGLPQFQPPSKLCKDCLVGKQHRSSIP 502 Query: 487 KQSTWRASRKLELIHSDICGPINPASNSNK 398 K+S WRA+ L+L+H+DICGPINP SNS K Sbjct: 503 KKSNWRAAEILQLVHADICGPINPISNSKK 532