BLASTX nr result
ID: Astragalus22_contig00025015
seq
BLASTX 2.2.26 [Sep-21-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Astragalus22_contig00025015 (1989 letters) Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects 149,584,005 sequences; 54,822,741,787 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value gb|PNX77624.1| Ty3/gypsy retrotransposon protein [Trifolium prat... 422 e-140 gb|PNX89231.1| Ty3/gypsy retrotransposon protein, partial [Trifo... 419 e-138 gb|KYP53324.1| Transposon Ty3-G Gag-Pol polyprotein [Cajanus cajan] 429 e-137 gb|PNX93203.1| hypothetical protein L195_g016354 [Trifolium prat... 430 e-136 gb|AAO23078.1| polyprotein [Glycine max] 441 e-135 gb|PNX96484.1| Ty3/gypsy retrotransposon protein [Trifolium prat... 430 e-132 gb|KYP45652.1| Transposon Ty3-G Gag-Pol polyprotein [Cajanus cajan] 429 e-132 gb|KYP53386.1| Transposon Ty3-G Gag-Pol polyprotein [Cajanus cajan] 414 e-132 gb|PNY16670.1| Ty3/gypsy retrotransposon protein [Trifolium prat... 413 e-131 gb|PNY00079.1| hypothetical protein L195_g023353, partial [Trifo... 409 e-131 gb|PNY17453.1| Ty3/gypsy retrotransposon protein [Trifolium prat... 429 e-130 gb|PNX98514.1| Ty3/gypsy retrotransposon protein, partial [Trifo... 419 e-128 dbj|GAU12540.1| hypothetical protein TSUD_182540 [Trifolium subt... 421 e-128 gb|PNX92431.1| Ty3/gypsy retrotransposon protein, partial [Trifo... 419 e-127 gb|PNY14541.1| hypothetical protein L195_g011223 [Trifolium prat... 402 e-127 dbj|GAU25204.1| hypothetical protein TSUD_151040 [Trifolium subt... 417 e-126 dbj|GAU19157.1| hypothetical protein TSUD_79800 [Trifolium subte... 416 e-125 dbj|GAU47333.1| hypothetical protein TSUD_101210 [Trifolium subt... 405 e-125 dbj|GAU27453.1| hypothetical protein TSUD_161390 [Trifolium subt... 415 e-125 gb|PNX73110.1| Ty3/gypsy retrotransposon protein, partial [Trifo... 403 e-125 >gb|PNX77624.1| Ty3/gypsy retrotransposon protein [Trifolium pratense] gb|PNY16672.1| Ty3/gypsy retrotransposon protein [Trifolium pratense] Length = 367 Score = 422 bits (1086), Expect = e-140 Identities = 202/330 (61%), Positives = 252/330 (76%), Gaps = 1/330 (0%) Frame = +2 Query: 14 MSQIVKLHGVPKSIVSDRDRVFTSSFWQHLFKLLGTTLAMSSAYHPQSDGQSESLNKCLE 193 M IVKLHG+PKSIVSDRD+VFTSSFWQ LFKL GT+LAMSSAYHPQSDGQ+E LNK LE Sbjct: 1 MHNIVKLHGMPKSIVSDRDKVFTSSFWQQLFKLQGTSLAMSSAYHPQSDGQTEVLNKVLE 60 Query: 194 MYLRCFTFDNPKTWYKALAWAEYWYNSSFHTSLGMTPFKALYGRDPPQLVRPMITNSDPP 373 ++LRCFTF+NPK+W K ++WAEYWYN++F TS+GMTPFKALYGRDPP L + DPP Sbjct: 61 LFLRCFTFNNPKSWSKVISWAEYWYNTAFQTSIGMTPFKALYGRDPPYLTKYEAQEIDPP 120 Query: 374 EVIKQITEREATXXXXXXXXXXXXXRMKCQADKRRRDVQFEIGDLVLVKLQPYRQHSLAL 553 + +++ ER+ MK QADK R+DV+F++G++VLV+LQPYRQ S+AL Sbjct: 121 TLQEELKERDKLLQQLKSNLEKAQQYMKHQADKHRKDVKFQVGEMVLVRLQPYRQQSVAL 180 Query: 554 RKNNKLGMRYFGPFEIIARIGAVAYKLKLPSTAKIHAVFHVSQLKKFRGDSSEPYLPLPL 733 RKN KLGMRYFGPFEI+A +G VAYKLKLP AKIH VFHVSQLK F+G+ +E YLPLPL Sbjct: 181 RKNQKLGMRYFGPFEILACVGNVAYKLKLPDNAKIHPVFHVSQLKPFKGNVTEHYLPLPL 240 Query: 734 TSTELGPILLPRAVLQKRTIQNGQTLVEQVLIMWEDNNEDTATWENVEEFKQHYPTFNLE 913 T + GPI+ P AVLQ RTI+ G V Q+L+ WE N++D ATWE++ + + +PT NLE Sbjct: 241 TMNDTGPIIQPVAVLQARTIRKGTQKVHQILVQWEQNSKDAATWEDLHDLQFKFPTLNLE 300 Query: 914 DKIQFKGEGIVMNESAAEV-ENEMPRRSMR 1000 DK+ F GEGIVM + ++ EN+ +S R Sbjct: 301 DKVVFNGEGIVMRPNTTKILENDDSAKSQR 330 >gb|PNX89231.1| Ty3/gypsy retrotransposon protein, partial [Trifolium pratense] Length = 408 Score = 419 bits (1076), Expect = e-138 Identities = 212/372 (56%), Positives = 259/372 (69%), Gaps = 27/372 (7%) Frame = +2 Query: 2 AELFMSQIVKLHGVPKSIVSDRDRVFTSSFWQHLFKLLGTTLAMSSAYHPQSDGQSESLN 181 AE FM IVKLHG+PKSIVSDRD+VFTS+FWQ LFKL GT+LAMSSAYHPQSDGQ+E LN Sbjct: 36 AEAFMHNIVKLHGMPKSIVSDRDKVFTSAFWQQLFKLQGTSLAMSSAYHPQSDGQTEVLN 95 Query: 182 KCLEMYLRCFTFDNPKTWYKALAWAEYWYNSSFHTSLGMTPFKALYGRDPPQLVRPMITN 361 K LE++LRCFTF+NPK+W+K ++WAEYWYN++F TS+GMTPFKALYGR+PP LV+ Sbjct: 96 KVLELFLRCFTFNNPKSWFKVISWAEYWYNTAFQTSIGMTPFKALYGREPPYLVKYEAHE 155 Query: 362 SDPPEVIKQITEREATXXXXXXXXXXXXXRMKCQADKRRRDVQFEIGDLVLVKLQPYRQH 541 +D P + ++ R+ MK QADK R+DV+F+IG+LVLV+LQPYRQ Sbjct: 156 NDSPALQDELRGRDKILQQLKSNLERAQQYMKHQADKHRKDVKFQIGELVLVRLQPYRQQ 215 Query: 542 SLALRKNNKLGMRYFGPFEIIARIGAVAYKLKLPSTAKIHAVFHVSQLKKFRGDSSEPYL 721 S+ALRKN KLGMRYFGPFEII +G VAYKLKLP AKIH VFHVSQLK F+G + E YL Sbjct: 216 SVALRKNQKLGMRYFGPFEIIDCVGKVAYKLKLPDNAKIHPVFHVSQLKPFKGTTDEQYL 275 Query: 722 PLPLTSTELGPILLPRAVLQKRTIQNGQTLVEQVLIMWEDNNEDTATWENVEEFKQHYPT 901 PLPLT + GPI+ AVLQ RTI G V Q+L+ WE +++ ATWEN+ + + +P Sbjct: 276 PLPLTMADTGPIIQSAAVLQARTIMQGSQKVHQILVQWEQQSKEEATWENLHDLQLKFPA 335 Query: 902 FNLEDKIQFKGEGIVM----------NESAA---------------EVENEM--PRRSMR 1000 NLEDK+ FKGEGIVM N SAA E N++ PRR R Sbjct: 336 LNLEDKVVFKGEGIVMRPNVTQLLEENMSAAQSHGDPQNALEMETVEENNKLMGPRRGQR 395 Query: 1001 ARKASVKLNDYC 1036 ARK+ +C Sbjct: 396 ARKSHSMWKTFC 407 >gb|KYP53324.1| Transposon Ty3-G Gag-Pol polyprotein [Cajanus cajan] Length = 751 Score = 429 bits (1102), Expect = e-137 Identities = 203/316 (64%), Positives = 245/316 (77%) Frame = +2 Query: 2 AELFMSQIVKLHGVPKSIVSDRDRVFTSSFWQHLFKLLGTTLAMSSAYHPQSDGQSESLN 181 AE F IVKLHG+PKSIVSDRD+VFTS+FWQHLFKL GTTLAMSSAYHPQ+DGQSE LN Sbjct: 380 AEAFTLHIVKLHGLPKSIVSDRDKVFTSNFWQHLFKLQGTTLAMSSAYHPQTDGQSEVLN 439 Query: 182 KCLEMYLRCFTFDNPKTWYKALAWAEYWYNSSFHTSLGMTPFKALYGRDPPQLVRPMITN 361 KCLEM+LRCFTFDNPK+W K L WAEYWYN+SFHTSLGMTPFKALYGRDPP L R + Sbjct: 440 KCLEMFLRCFTFDNPKSWSKGLTWAEYWYNTSFHTSLGMTPFKALYGRDPPTLTRYQRSP 499 Query: 362 SDPPEVIKQITEREATXXXXXXXXXXXXXRMKCQADKRRRDVQFEIGDLVLVKLQPYRQH 541 +DP +V Q+T+R+ RMK QADK+R D+QF++GD VLVKLQPYRQH Sbjct: 500 TDPRDVQDQLTKRDQLLDQLKCNLTKAQQRMKHQADKKRSDMQFQVGDQVLVKLQPYRQH 559 Query: 542 SLALRKNNKLGMRYFGPFEIIARIGAVAYKLKLPSTAKIHAVFHVSQLKKFRGDSSEPYL 721 S+ LRK+ KL MRYFGPF+++ ++G VAYKL+LP TA+IH VFH+SQLK F+G S+ PY+ Sbjct: 560 SVVLRKHQKLSMRYFGPFKVLGKVGVVAYKLELPETARIHPVFHISQLKPFKGISNAPYM 619 Query: 722 PLPLTSTELGPILLPRAVLQKRTIQNGQTLVEQVLIMWEDNNEDTATWENVEEFKQHYPT 901 PLPLT++ELGP L P A+L RTI G L+ QVL+ W+ ++ +WE+V K H+P Sbjct: 620 PLPLTTSELGPFLQPVAILHARTILQGSKLLSQVLVQWDPSSNVPNSWEDVTFIKTHFPY 679 Query: 902 FNLEDKIQFKGEGIVM 949 NLEDK+ KGEG VM Sbjct: 680 LNLEDKVVLKGEGNVM 695 >gb|PNX93203.1| hypothetical protein L195_g016354 [Trifolium pratense] Length = 869 Score = 430 bits (1106), Expect = e-136 Identities = 213/363 (58%), Positives = 268/363 (73%), Gaps = 19/363 (5%) Frame = +2 Query: 2 AELFMSQIVKLHGVPKSIVSDRDRVFTSSFWQHLFKLLGTTLAMSSAYHPQSDGQSESLN 181 AE+FM +VKLHG+PKSIVSDRD+VF S FW+ LF+L GTTL+MSSAYHPQ+DGQSE+LN Sbjct: 505 AEVFMKTVVKLHGLPKSIVSDRDKVFISKFWKELFQLQGTTLSMSSAYHPQTDGQSEALN 564 Query: 182 KCLEMYLRCFTFDNPKTWYKALAWAEYWYNSSFHTSLGMTPFKALYGRDPPQLVRPMITN 361 KCLEMYLRC TF NPK+W+KAL WAEYWYN+++H SLGMTPF+ALYGR PP LVR + Sbjct: 565 KCLEMYLRCLTFQNPKSWFKALDWAEYWYNTAYHNSLGMTPFQALYGRTPPTLVRYTHSP 624 Query: 362 SDPPEVIKQITEREATXXXXXXXXXXXXXRMKCQADKRRRDVQFEIGDLVLVKLQPYRQH 541 +D +V +Q+ ER+ MK QADK RRD QFE+G+ VLVKLQPYRQ+ Sbjct: 625 TDTLDVQQQLMERDRLIATLKDNLKRAQQIMKNQADKHRRDAQFEVGEQVLVKLQPYRQN 684 Query: 542 SLALRKNNKLGMRYFGPFEIIARIGAVAYKLKLPSTAKIHAVFHVSQLKKFRGDSSEPYL 721 S+ALRKN KLGMRYFGPF II ++G VAYK++LP AKIH VFH+SQLK+F+G +++PY+ Sbjct: 685 SVALRKNQKLGMRYFGPFTIIEKVGKVAYKVQLPVEAKIHPVFHISQLKQFKGRATDPYI 744 Query: 722 PLPLTSTELGPILLPRAVLQKRTIQNGQTLVEQVLIMWEDNNEDTATWENVEEFKQHYPT 901 PLPLT+ ELGPIL P AVLQ+R I + ++QVLI WE N+ ATWE+V+E ++YP Sbjct: 745 PLPLTTHELGPILQPIAVLQRRDIVRNEHAIQQVLIKWEGLNDTDATWEDVDEITENYPN 804 Query: 902 FNLEDKIQFKGEGIVMNESAAE-------VENE----------MP--RRSMRARKASVKL 1024 FNLEDK++ KG+GI M E + +ENE MP R+ +R R S+KL Sbjct: 805 FNLEDKVEVKGKGIAMEEPRQQKGQVSKILENEGATKSVSAPQMPGMRKGVRPRAPSIKL 864 Query: 1025 NDY 1033 D+ Sbjct: 865 RDF 867 >gb|AAO23078.1| polyprotein [Glycine max] Length = 1552 Score = 441 bits (1134), Expect = e-135 Identities = 213/344 (61%), Positives = 256/344 (74%) Frame = +2 Query: 2 AELFMSQIVKLHGVPKSIVSDRDRVFTSSFWQHLFKLLGTTLAMSSAYHPQSDGQSESLN 181 AE FMS IVKLHG+P+SIVSDRDRVFTS+FWQHLFKL GTTLAMSSAYHPQSDGQSE LN Sbjct: 1189 AEAFMSHIVKLHGIPRSIVSDRDRVFTSTFWQHLFKLQGTTLAMSSAYHPQSDGQSEVLN 1248 Query: 182 KCLEMYLRCFTFDNPKTWYKALAWAEYWYNSSFHTSLGMTPFKALYGRDPPQLVRPMITN 361 KCLEMYLRCFT+++PK W KAL WAE+WYN+++H SLGMTPF+ALYGR+PP L R + Sbjct: 1249 KCLEMYLRCFTYEHPKGWVKALPWAEFWYNTAYHMSLGMTPFRALYGREPPTLTRQACSI 1308 Query: 362 SDPPEVIKQITEREATXXXXXXXXXXXXXRMKCQADKRRRDVQFEIGDLVLVKLQPYRQH 541 DP EV +Q+T+R+A MK QADK+R DV F+IGD VLVKLQPYRQH Sbjct: 1309 DDPAEVREQLTDRDALLAKLKINLTRAQQVMKRQADKKRLDVSFQIGDEVLVKLQPYRQH 1368 Query: 542 SLALRKNNKLGMRYFGPFEIIARIGAVAYKLKLPSTAKIHAVFHVSQLKKFRGDSSEPYL 721 S LRKN KL MRYFGPF+++A+IG VAYKL+LPS A+IH VFHVSQLK F G + +PYL Sbjct: 1369 SAVLRKNQKLSMRYFGPFKVLAKIGDVAYKLELPSAARIHPVFHVSQLKPFNGTAQDPYL 1428 Query: 722 PLPLTSTELGPILLPRAVLQKRTIQNGQTLVEQVLIMWEDNNEDTATWENVEEFKQHYPT 901 PLPLT TE+GP++ P +L R I G +EQ+L+ WE+ +D ATWE++E+ K YPT Sbjct: 1429 PLPLTVTEMGPVMQPVKILASRIIIRGHNQIEQILVQWENGLQDEATWEDIEDIKASYPT 1488 Query: 902 FNLEDKIQFKGEGIVMNESAAEVENEMPRRSMRARKASVKLNDY 1033 FNLEDK+ FKGEG V N + + S R KL D+ Sbjct: 1489 FNLEDKVVFKGEGNVTNGMSRGEKVNNTAESSSERGLHNKLADF 1532 >gb|PNX96484.1| Ty3/gypsy retrotransposon protein [Trifolium pratense] Length = 1258 Score = 430 bits (1106), Expect = e-132 Identities = 216/365 (59%), Positives = 263/365 (72%), Gaps = 19/365 (5%) Frame = +2 Query: 2 AELFMSQIVKLHGVPKSIVSDRDRVFTSSFWQHLFKLLGTTLAMSSAYHPQSDGQSESLN 181 AE M IVKLHG+PKSIVSDRD+VFTSSFWQ LFKL GTTLAMSSAYHPQSDGQSE LN Sbjct: 894 AEAVMDNIVKLHGMPKSIVSDRDKVFTSSFWQQLFKLQGTTLAMSSAYHPQSDGQSEVLN 953 Query: 182 KCLEMYLRCFTFDNPKTWYKALAWAEYWYNSSFHTSLGMTPFKALYGRDPPQLVRPMITN 361 K LE++LRCFTFDNPK+W KAL+W+E+WYN++F TS+GMTPFKALYGRDPP L+R Sbjct: 954 KTLELFLRCFTFDNPKSWCKALSWSEFWYNTAFQTSIGMTPFKALYGRDPPALIRYETQA 1013 Query: 362 SDPPEVIKQITEREATXXXXXXXXXXXXXRMKCQADKRRRDVQFEIGDLVLVKLQPYRQH 541 +DPP + +++ ER+ MK QADK R DV+ ++GDLVLVKLQPYRQ Sbjct: 1014 NDPPTLQEKLMERDRIIQQLKLNLEKAQQYMKKQADKHRVDVKLQVGDLVLVKLQPYRQQ 1073 Query: 542 SLALRKNNKLGMRYFGPFEIIARIGAVAYKLKLPSTAKIHAVFHVSQLKKFRGDSSEPYL 721 S+ALRKN KLGMRYFGPFE+IA++G VAYKLKLP AKIH VFHVSQLK F+GD+ E Y+ Sbjct: 1074 SVALRKNQKLGMRYFGPFEVIAKVGEVAYKLKLPEHAKIHPVFHVSQLKPFKGDNQEQYM 1133 Query: 722 PLPLTSTELGPILLPRAVLQKRTIQNGQTLVEQVLIMWEDNNEDTATWENVEEFKQHYPT 901 PLPL+ T+ GP++ P +VL RTI G ++QVLI W+ + ATWE+V+ + +P Sbjct: 1134 PLPLSMTDTGPMIQPVSVLATRTIIRGAQRIQQVLIQWDQYSTAEATWEDVDALQSKFPA 1193 Query: 902 FNLEDKIQFKGEGIVMN----------ESAAEVENEM---------PRRSMRARKASVKL 1024 FNLEDK+ F G+GIVM+ ESA E N+M PRR R RK S +L Sbjct: 1194 FNLEDKVAFIGDGIVMSPMEENILQEGESAKEGLNDMHERNSVMMGPRRGKRVRKTSKRL 1253 Query: 1025 NDYCV 1039 Y + Sbjct: 1254 EGYAL 1258 >gb|KYP45652.1| Transposon Ty3-G Gag-Pol polyprotein [Cajanus cajan] Length = 1210 Score = 429 bits (1102), Expect = e-132 Identities = 203/316 (64%), Positives = 245/316 (77%) Frame = +2 Query: 2 AELFMSQIVKLHGVPKSIVSDRDRVFTSSFWQHLFKLLGTTLAMSSAYHPQSDGQSESLN 181 AE F IVKLHG+PKSIVSDRD+VFTS+FWQHLFKL GTTLAMSSAYHPQ+DGQSE LN Sbjct: 839 AEAFTLHIVKLHGLPKSIVSDRDKVFTSNFWQHLFKLQGTTLAMSSAYHPQTDGQSEVLN 898 Query: 182 KCLEMYLRCFTFDNPKTWYKALAWAEYWYNSSFHTSLGMTPFKALYGRDPPQLVRPMITN 361 KCLEM+LRCFTFDNPK+W K L WAEYWYN+SFHTSLGMTPFKALYGRDPP L R + Sbjct: 899 KCLEMFLRCFTFDNPKSWSKGLTWAEYWYNTSFHTSLGMTPFKALYGRDPPTLTRYQRSP 958 Query: 362 SDPPEVIKQITEREATXXXXXXXXXXXXXRMKCQADKRRRDVQFEIGDLVLVKLQPYRQH 541 +DP +V Q+T+R+ RMK QADK+R D+QF++GD VLVKLQPYRQH Sbjct: 959 TDPRDVQDQLTKRDQLLDQLKCNLTKAQQRMKHQADKKRSDMQFQVGDQVLVKLQPYRQH 1018 Query: 542 SLALRKNNKLGMRYFGPFEIIARIGAVAYKLKLPSTAKIHAVFHVSQLKKFRGDSSEPYL 721 S+ LRK+ KL MRYFGPF+++ ++G VAYKL+LP TA+IH VFH+SQLK F+G S+ PY+ Sbjct: 1019 SVVLRKHQKLSMRYFGPFKVLGKVGVVAYKLELPETARIHPVFHISQLKPFKGISNAPYM 1078 Query: 722 PLPLTSTELGPILLPRAVLQKRTIQNGQTLVEQVLIMWEDNNEDTATWENVEEFKQHYPT 901 PLPLT++ELGP L P A+L RTI G L+ QVL+ W+ ++ +WE+V K H+P Sbjct: 1079 PLPLTTSELGPFLQPVAILHARTILQGSKLLSQVLVQWDPSSNVPNSWEDVTFIKTHFPY 1138 Query: 902 FNLEDKIQFKGEGIVM 949 NLEDK+ KGEG VM Sbjct: 1139 LNLEDKVVLKGEGNVM 1154 >gb|KYP53386.1| Transposon Ty3-G Gag-Pol polyprotein [Cajanus cajan] Length = 673 Score = 414 bits (1064), Expect = e-132 Identities = 199/322 (61%), Positives = 243/322 (75%) Frame = +2 Query: 2 AELFMSQIVKLHGVPKSIVSDRDRVFTSSFWQHLFKLLGTTLAMSSAYHPQSDGQSESLN 181 AE F IVKLHG+PKSIVSDRD+VFTS+FWQHLFKL GTTLAMSS YH Q+DGQS++LN Sbjct: 302 AEAFTLHIVKLHGLPKSIVSDRDKVFTSTFWQHLFKLHGTTLAMSSTYHLQTDGQSKALN 361 Query: 182 KCLEMYLRCFTFDNPKTWYKALAWAEYWYNSSFHTSLGMTPFKALYGRDPPQLVRPMITN 361 KCLEM+L CFTF+NPK+W K L WAEYWYN+SFHTSLGM+PFKALYGRDPP L R + Sbjct: 362 KCLEMFLSCFTFENPKSWSKGLTWAEYWYNTSFHTSLGMSPFKALYGRDPPTLTRYQRSP 421 Query: 362 SDPPEVIKQITEREATXXXXXXXXXXXXXRMKCQADKRRRDVQFEIGDLVLVKLQPYRQH 541 + P +V Q+TER+ MK QADK+R D+QF++GD VLVKLQPYRQH Sbjct: 422 AYPSDVQDQLTERDQLLDQLKCNLTKAQQHMKHQADKKRFDMQFQVGDQVLVKLQPYRQH 481 Query: 542 SLALRKNNKLGMRYFGPFEIIARIGAVAYKLKLPSTAKIHAVFHVSQLKKFRGDSSEPYL 721 S+ LRK+ KL MRYFGPF++I ++G VAYKL+LP TA+IH VFH+SQLK F+G S+EPY+ Sbjct: 482 SVVLRKHQKLSMRYFGPFKVIGKVGVVAYKLELPETARIHPVFHISQLKPFKGVSNEPYM 541 Query: 722 PLPLTSTELGPILLPRAVLQKRTIQNGQTLVEQVLIMWEDNNEDTATWENVEEFKQHYPT 901 PLPLT++ELGPIL P A+L RTI G L QVL+ W+ + +WE+V K H+P Sbjct: 542 PLPLTTSELGPILQPVAILHARTILQGSKLHSQVLVQWDPSINVPNSWEDVTFIKTHFPH 601 Query: 902 FNLEDKIQFKGEGIVMNESAAE 967 +LEDK+ KGEG VM S + Sbjct: 602 IDLEDKVVLKGEGNVMKMSVGQ 623 >gb|PNY16670.1| Ty3/gypsy retrotransposon protein [Trifolium pratense] Length = 752 Score = 413 bits (1061), Expect = e-131 Identities = 203/356 (57%), Positives = 248/356 (69%), Gaps = 12/356 (3%) Frame = +2 Query: 2 AELFMSQIVKLHGVPKSIVSDRDRVFTSSFWQHLFKLLGTTLAMSSAYHPQSDGQSESLN 181 AE FM I KLHG+PKSIVSDRD+VFTS FWQ+LFK GT+LAMS+AYHPQ+DGQSE+LN Sbjct: 384 AEAFMLNIAKLHGIPKSIVSDRDKVFTSGFWQNLFKRQGTSLAMSTAYHPQTDGQSEALN 443 Query: 182 KCLEMYLRCFTFDNPKTWYKALAWAEYWYNSSFHTSLGMTPFKALYGRDPPQLVRPMITN 361 KCLEMYLRCFTF NPK WYK L AEYWYN++FH S GMTPF+ALYGR+ P L+R + Sbjct: 444 KCLEMYLRCFTFQNPKGWYKILPMAEYWYNTTFHNSAGMTPFRALYGREAPTLIRYVAQT 503 Query: 362 SDPPEVIKQITEREATXXXXXXXXXXXXXRMKCQADKRRRDVQFEIGDLVLVKLQPYRQH 541 SDPP V +Q+ +R+ MK QA K+R+DV+F++GD VLV+LQPYRQH Sbjct: 504 SDPPSVKEQLIQRDVIMDQLKQNLMRAQHVMKHQAGKKRKDVEFKLGDKVLVRLQPYRQH 563 Query: 542 SLALRKNNKLGMRYFGPFEIIARIGAVAYKLKLPSTAKIHAVFHVSQLKKFRGDSSEPYL 721 S ALRKN KL MRYFGPFE+IA+IG VAYKL LP +AKIH+VFHV+QLK+F+G + +PYL Sbjct: 564 SAALRKNQKLSMRYFGPFEVIAKIGTVAYKLDLPPSAKIHSVFHVAQLKEFKGSNDDPYL 623 Query: 722 PLPLTSTELGPILLPRAVLQKRTIQNGQTLVEQVLIMWEDNNEDTATWENVEEFKQHYPT 901 PLPLT+TE+GP L P VL R + Q+LI W + A WE+ E K +YP Sbjct: 624 PLPLTTTEVGPTLYPTQVLDSRMVMQASVANPQILIQWGNEANAEAKWEDYNEIKNNYPE 683 Query: 902 FNLEDKIQFKGEGIVM-----NESAAEVENEM-------PRRSMRARKASVKLNDY 1033 NLEDK++FKG GIVM N + +M R+ R R + KL Y Sbjct: 684 LNLEDKVEFKGGGIVMKGIMGNRGKSSDSTQMITTNEDGIRKGSRKRVTNTKLKGY 739 >gb|PNY00079.1| hypothetical protein L195_g023353, partial [Trifolium pratense] Length = 641 Score = 409 bits (1050), Expect = e-131 Identities = 197/309 (63%), Positives = 243/309 (78%), Gaps = 1/309 (0%) Frame = +2 Query: 2 AELFMSQIVKLHGVPKSIVSDRDRVFTSSFWQHLFKLLGTTLAMSSAYHPQSDGQSESLN 181 AE FM +VKLHG+PKSI+SDRD+VF S FW+ LF L GTTL+MSSAYHPQ+DGQSE+LN Sbjct: 334 AEAFMKTVVKLHGLPKSIISDRDKVFISKFWKELFSLQGTTLSMSSAYHPQTDGQSEALN 393 Query: 182 KCLEMYLRCFTFDNPKTWYKALAWAEYWYNSSFHTSLGMTPFKALYGRDPPQLVRPMITN 361 KCLEMYLRC TF NPKTW+KAL WAEYWYN++ HTSLGMTPF+ALYGR PP LVR + Sbjct: 394 KCLEMYLRCLTFQNPKTWFKALDWAEYWYNTAHHTSLGMTPFQALYGRAPPTLVRYNHSP 453 Query: 362 SDPPEVIKQITEREATXXXXXXXXXXXXXRMKCQADKRRRDVQFEIGDLVLVKLQPYRQH 541 SD V +Q+ ER+ MK QADK RR+VQFE+G+ VLVKLQPYRQ+ Sbjct: 454 SDTVTVQQQLMERDVLITTLKDNLNRAQQVMKAQADKHRREVQFEVGEHVLVKLQPYRQN 513 Query: 542 SLALRKNNKLGMRYFGPFEIIARIGAVAYKLKLPSTAKIHAVFHVSQLKKFRGDSSEPYL 721 S+ALRKN KLGMRYFGPF II +IG VAYKL+LP+ AKIH VFH+SQLK+F+G + +PY+ Sbjct: 514 SVALRKNQKLGMRYFGPFTIIEKIGKVAYKLQLPAEAKIHPVFHISQLKQFKGQAFDPYI 573 Query: 722 PLPLTSTELGPILLPRAVLQKR-TIQNGQTLVEQVLIMWEDNNEDTATWENVEEFKQHYP 898 PLPLT+TELGP+L P AVL++R ++NG+ ++ QVLI W+ N ATWE+ + ++YP Sbjct: 574 PLPLTTTELGPVLQPVAVLKRRDRLRNGE-VISQVLIKWQGLNNTDATWEDAADIVENYP 632 Query: 899 TFNLEDKIQ 925 TFNLEDK++ Sbjct: 633 TFNLEDKVE 641 >gb|PNY17453.1| Ty3/gypsy retrotransposon protein [Trifolium pratense] Length = 1535 Score = 429 bits (1102), Expect = e-130 Identities = 209/326 (64%), Positives = 247/326 (75%) Frame = +2 Query: 2 AELFMSQIVKLHGVPKSIVSDRDRVFTSSFWQHLFKLLGTTLAMSSAYHPQSDGQSESLN 181 AE FM IVKLHG+PKSIVSDRD+VFTS+FWQ LFKL GT+LAMSSAYHPQSDGQ+E LN Sbjct: 1166 AESFMHNIVKLHGMPKSIVSDRDKVFTSAFWQQLFKLQGTSLAMSSAYHPQSDGQTEVLN 1225 Query: 182 KCLEMYLRCFTFDNPKTWYKALAWAEYWYNSSFHTSLGMTPFKALYGRDPPQLVRPMITN 361 K LE++LRCFTF NPK+W K L+WAEYWYN++F TS+GMTPFKALYGRDPP L + Sbjct: 1226 KALELFLRCFTFHNPKSWSKVLSWAEYWYNTAFQTSIGMTPFKALYGRDPPYLTKYEAQV 1285 Query: 362 SDPPEVIKQITEREATXXXXXXXXXXXXXRMKCQADKRRRDVQFEIGDLVLVKLQPYRQH 541 +DPP + +++ ER+ MK QADK R+DV F++GDLVLVKLQPYRQ Sbjct: 1286 TDPPALQEELMERDKILQQLKSNLDRAQQYMKKQADKHRKDVTFQVGDLVLVKLQPYRQQ 1345 Query: 542 SLALRKNNKLGMRYFGPFEIIARIGAVAYKLKLPSTAKIHAVFHVSQLKKFRGDSSEPYL 721 S+ALRKN KLGMRYFGPFEIIA IGAVAYKLKLP AKIH VFHVSQLK F+G +S+ YL Sbjct: 1346 SVALRKNQKLGMRYFGPFEIIACIGAVAYKLKLPDNAKIHPVFHVSQLKPFKGAASDQYL 1405 Query: 722 PLPLTSTELGPILLPRAVLQKRTIQNGQTLVEQVLIMWEDNNEDTATWENVEEFKQHYPT 901 PLPLT TE GPI+ P AVLQ RTI G V Q+L+ W+ N E ATWE+ ++ + +PT Sbjct: 1406 PLPLTMTETGPIMQPIAVLQARTIMRGTQRVHQILVQWDTNAEAEATWEDFDDLQLKFPT 1465 Query: 902 FNLEDKIQFKGEGIVMNESAAEVENE 979 NLEDK+ F GEGIVM + + E Sbjct: 1466 LNLEDKVVFNGEGIVMRPNTTNLLEE 1491 >gb|PNX98514.1| Ty3/gypsy retrotransposon protein, partial [Trifolium pratense] Length = 1240 Score = 419 bits (1077), Expect = e-128 Identities = 199/316 (62%), Positives = 243/316 (76%) Frame = +2 Query: 2 AELFMSQIVKLHGVPKSIVSDRDRVFTSSFWQHLFKLLGTTLAMSSAYHPQSDGQSESLN 181 AE FM +VKLHG+PKSIVSDRD+VFTS+FWQHLFKL GTTLAM+SAYHPQSDGQ+E LN Sbjct: 894 AEAFMHNVVKLHGMPKSIVSDRDKVFTSTFWQHLFKLQGTTLAMTSAYHPQSDGQTEVLN 953 Query: 182 KCLEMYLRCFTFDNPKTWYKALAWAEYWYNSSFHTSLGMTPFKALYGRDPPQLVRPMITN 361 K LE+YLRCF+F+NPK+W+K L+W+E+WYN++F TS+GMTPFKALYGRDPP L R + Sbjct: 954 KGLELYLRCFSFNNPKSWFKMLSWSEFWYNTAFQTSIGMTPFKALYGRDPPYLTRYVAQA 1013 Query: 362 SDPPEVIKQITEREATXXXXXXXXXXXXXRMKCQADKRRRDVQFEIGDLVLVKLQPYRQH 541 SDPP + +++ ER+ MK QADK R D+ +IGDLVLVKLQPYRQH Sbjct: 1014 SDPPTLQEELMERDKILQQLKDNLIRAQQYMKKQADKHRSDISLKIGDLVLVKLQPYRQH 1073 Query: 542 SLALRKNNKLGMRYFGPFEIIARIGAVAYKLKLPSTAKIHAVFHVSQLKKFRGDSSEPYL 721 S+ALRKN KLG+RYFGPFEIIAR+G VAYKLKLP AKIH VFHVSQLK F+G + E YL Sbjct: 1074 SVALRKNQKLGLRYFGPFEIIARVGEVAYKLKLPDDAKIHPVFHVSQLKPFKGVADEQYL 1133 Query: 722 PLPLTSTELGPILLPRAVLQKRTIQNGQTLVEQVLIMWEDNNEDTATWENVEEFKQHYPT 901 PLPLT T++GP + P VLQ RT+ G + QVLI W+ ATWE++ ++ +P+ Sbjct: 1134 PLPLTMTDIGPSIQPIDVLQVRTVIRGSQQIHQVLIQWDQYPAAQATWEDITTIQEKFPS 1193 Query: 902 FNLEDKIQFKGEGIVM 949 NLEDK+ F G+GIVM Sbjct: 1194 LNLEDKVAFNGDGIVM 1209 >dbj|GAU12540.1| hypothetical protein TSUD_182540 [Trifolium subterraneum] Length = 1451 Score = 421 bits (1081), Expect = e-128 Identities = 198/323 (61%), Positives = 250/323 (77%) Frame = +2 Query: 2 AELFMSQIVKLHGVPKSIVSDRDRVFTSSFWQHLFKLLGTTLAMSSAYHPQSDGQSESLN 181 AE+FM+ IVKLHG+PKSIVSDRD+VFTSSFWQHLFKL GT+LAMSSAYHPQSDGQ+E LN Sbjct: 1111 AEVFMNNIVKLHGMPKSIVSDRDKVFTSSFWQHLFKLQGTSLAMSSAYHPQSDGQTEVLN 1170 Query: 182 KCLEMYLRCFTFDNPKTWYKALAWAEYWYNSSFHTSLGMTPFKALYGRDPPQLVRPMITN 361 K LE++LRCFTF+NPK+WYKALAW+E+WYN++ HTS+GMTPFKALYGR+PP L R + + Sbjct: 1171 KGLELFLRCFTFNNPKSWYKALAWSEFWYNTALHTSIGMTPFKALYGREPPTLTRYEVQD 1230 Query: 362 SDPPEVIKQITEREATXXXXXXXXXXXXXRMKCQADKRRRDVQFEIGDLVLVKLQPYRQH 541 +DPP + +++ ER+ MK QADK R + +F +GD+VLVKLQPYRQ Sbjct: 1231 NDPPALQEELMERDRILQQLKSNLERAQQYMKKQADKHRVEFKFHLGDMVLVKLQPYRQQ 1290 Query: 542 SLALRKNNKLGMRYFGPFEIIARIGAVAYKLKLPSTAKIHAVFHVSQLKKFRGDSSEPYL 721 S+ALRKN KLGMRYFGPFEIIA +G VAYKLKLP AKIH VFHVSQLK F+G + Y+ Sbjct: 1291 SVALRKNQKLGMRYFGPFEIIACVGKVAYKLKLPDHAKIHLVFHVSQLKPFKGVPQQQYM 1350 Query: 722 PLPLTSTELGPILLPRAVLQKRTIQNGQTLVEQVLIMWEDNNEDTATWENVEEFKQHYPT 901 PLPLT + GP++ P VLQ RTI G + Q+L+ W+ + ATWENV++ ++++P Sbjct: 1351 PLPLTMFDNGPMIQPVEVLQARTIMQGTQKIHQILVQWDQYDIAEATWENVDDLQKNFPL 1410 Query: 902 FNLEDKIQFKGEGIVMNESAAEV 970 +NLEDK+ FKG+GIVM ++ Sbjct: 1411 YNLEDKVIFKGDGIVMRPKGEDI 1433 >gb|PNX92431.1| Ty3/gypsy retrotransposon protein, partial [Trifolium pratense] Length = 1502 Score = 419 bits (1076), Expect = e-127 Identities = 197/323 (60%), Positives = 244/323 (75%) Frame = +2 Query: 2 AELFMSQIVKLHGVPKSIVSDRDRVFTSSFWQHLFKLLGTTLAMSSAYHPQSDGQSESLN 181 AE FM IVKLHG+PKSIVSDRD+VFTS+FWQHLFK+ GT+LAMSSAYHPQ+DGQ+E LN Sbjct: 1169 AEAFMHNIVKLHGMPKSIVSDRDKVFTSAFWQHLFKMQGTSLAMSSAYHPQTDGQTEVLN 1228 Query: 182 KCLEMYLRCFTFDNPKTWYKALAWAEYWYNSSFHTSLGMTPFKALYGRDPPQLVRPMITN 361 K LE++LRCFTF NPK+W+K ++WAEYWYN++F TS+GMTPFKALYGRDPP L + + Sbjct: 1229 KTLELFLRCFTFHNPKSWFKVMSWAEYWYNTAFQTSIGMTPFKALYGRDPPYLTKYEVQV 1288 Query: 362 SDPPEVIKQITEREATXXXXXXXXXXXXXRMKCQADKRRRDVQFEIGDLVLVKLQPYRQH 541 DPP + +++ ER+ MK QADK RR+V F++GDLVLVKLQPY+Q Sbjct: 1289 DDPPALREELMERDQILQQLKTNLERAQQYMKQQADKHRREVSFKVGDLVLVKLQPYKQQ 1348 Query: 542 SLALRKNNKLGMRYFGPFEIIARIGAVAYKLKLPSTAKIHAVFHVSQLKKFRGDSSEPYL 721 S+ALRKN KLGMRYFGPFE+IA +G VAYKL+LP AKIH VFHVSQLK F G S E YL Sbjct: 1349 SVALRKNQKLGMRYFGPFEVIACVGKVAYKLQLPENAKIHPVFHVSQLKPFHGTSQEQYL 1408 Query: 722 PLPLTSTELGPILLPRAVLQKRTIQNGQTLVEQVLIMWEDNNEDTATWENVEEFKQHYPT 901 PLPLT ++ GPI P +LQ RTI G V Q+ I W+ N+ + A+WE+++E + +P Sbjct: 1409 PLPLTMSDTGPIFQPATILQARTIVRGNKKVHQLQIQWDLNSPEEASWEDLDELQNKFPN 1468 Query: 902 FNLEDKIQFKGEGIVMNESAAEV 970 NLEDK+ FKGEGIVM + + Sbjct: 1469 INLEDKVVFKGEGIVMRPNNTNI 1491 >gb|PNY14541.1| hypothetical protein L195_g011223 [Trifolium pratense] Length = 763 Score = 402 bits (1033), Expect = e-127 Identities = 193/316 (61%), Positives = 235/316 (74%) Frame = +2 Query: 2 AELFMSQIVKLHGVPKSIVSDRDRVFTSSFWQHLFKLLGTTLAMSSAYHPQSDGQSESLN 181 AE F+ IVKLHG+ KSIVSDRD+VFTS+FWQ LFKL GT+L MSSAYHPQSDGQ+E LN Sbjct: 391 AEAFIHNIVKLHGMSKSIVSDRDKVFTSNFWQQLFKLQGTSLTMSSAYHPQSDGQTEVLN 450 Query: 182 KCLEMYLRCFTFDNPKTWYKALAWAEYWYNSSFHTSLGMTPFKALYGRDPPQLVRPMITN 361 K LE++LRCF+F+NPK+WYK L+WAEYWYN++F TS+GMTPFKALYGR+PP L + Sbjct: 451 KGLELFLRCFSFNNPKSWYKVLSWAEYWYNTTFQTSIGMTPFKALYGREPPSLTKYEAHA 510 Query: 362 SDPPEVIKQITEREATXXXXXXXXXXXXXRMKCQADKRRRDVQFEIGDLVLVKLQPYRQH 541 D P + +++ ER+ MK QADK R +V ++G+LVLVKLQPYRQ Sbjct: 511 DDSPTIQEELMERDKILQQLKTNLDRAQQYMKKQADKNRTEVNLQVGELVLVKLQPYRQQ 570 Query: 542 SLALRKNNKLGMRYFGPFEIIARIGAVAYKLKLPSTAKIHAVFHVSQLKKFRGDSSEPYL 721 S+ALRKN KLGMRYFGPFEIIARIG VAYKLKLP AKIH VFHVSQLK F+G + + YL Sbjct: 571 SVALRKNQKLGMRYFGPFEIIARIGKVAYKLKLPDNAKIHPVFHVSQLKPFKGTTQDQYL 630 Query: 722 PLPLTSTELGPILLPRAVLQKRTIQNGQTLVEQVLIMWEDNNEDTATWENVEEFKQHYPT 901 PLPLT +E+GPI+ P ++L RTI V Q+LI W+ TWE+ ++ + +PT Sbjct: 631 PLPLTMSEVGPIIQPVSILDARTIVRESQKVHQILIQWDQTTPAETTWEDFDDLQNKFPT 690 Query: 902 FNLEDKIQFKGEGIVM 949 NLEDKI F GEGIVM Sbjct: 691 LNLEDKIVFNGEGIVM 706 >dbj|GAU25204.1| hypothetical protein TSUD_151040 [Trifolium subterraneum] Length = 1512 Score = 417 bits (1072), Expect = e-126 Identities = 203/323 (62%), Positives = 243/323 (75%) Frame = +2 Query: 2 AELFMSQIVKLHGVPKSIVSDRDRVFTSSFWQHLFKLLGTTLAMSSAYHPQSDGQSESLN 181 AE FM IVKLHG+PKSIVSDRD+VFTS+FWQ LFKL GT+LAMSSAYHPQSDGQSE LN Sbjct: 1166 AEAFMHHIVKLHGMPKSIVSDRDKVFTSNFWQQLFKLQGTSLAMSSAYHPQSDGQSEVLN 1225 Query: 182 KCLEMYLRCFTFDNPKTWYKALAWAEYWYNSSFHTSLGMTPFKALYGRDPPQLVRPMITN 361 + LE++LRCFTF+NPK WYKAL+W+E+WYN++F TS+GMTPFKALYGRDPP LVR Sbjct: 1226 RTLELFLRCFTFNNPKAWYKALSWSEFWYNTAFQTSIGMTPFKALYGRDPPTLVRYEAQA 1285 Query: 362 SDPPEVIKQITEREATXXXXXXXXXXXXXRMKCQADKRRRDVQFEIGDLVLVKLQPYRQH 541 DPP + +++ R+ MK QADK RRD++ ++GDLVLVKLQPYRQ Sbjct: 1286 GDPPALQEELMGRDKLLQQLKSNLERAQQYMKRQADKHRRDIKLQVGDLVLVKLQPYRQQ 1345 Query: 542 SLALRKNNKLGMRYFGPFEIIARIGAVAYKLKLPSTAKIHAVFHVSQLKKFRGDSSEPYL 721 SLALRKN KLGMRYFGPFEI+A++G VAYKLKLP AKIH VFH+SQLK F+G S + L Sbjct: 1346 SLALRKNQKLGMRYFGPFEILAKVGEVAYKLKLPDHAKIHPVFHISQLKPFKGISQDQSL 1405 Query: 722 PLPLTSTELGPILLPRAVLQKRTIQNGQTLVEQVLIMWEDNNEDTATWENVEEFKQHYPT 901 PLPLT ++ GP++ P AVL RTI G V QVLI W+ E ATWE V + +P Sbjct: 1406 PLPLTMSDTGPLIQPIAVLAARTILKGIQKVHQVLIQWDQYPEAEATWEEVTNLQSKFPY 1465 Query: 902 FNLEDKIQFKGEGIVMNESAAEV 970 FNLEDK+ FKG+GIVM+ +V Sbjct: 1466 FNLEDKVVFKGDGIVMSPKEGKV 1488 >dbj|GAU19157.1| hypothetical protein TSUD_79800 [Trifolium subterraneum] Length = 1500 Score = 416 bits (1068), Expect = e-125 Identities = 208/370 (56%), Positives = 256/370 (69%), Gaps = 26/370 (7%) Frame = +2 Query: 2 AELFMSQIVKLHGVPKSIVSDRDRVFTSSFWQHLFKLLGTTLAMSSAYHPQSDGQSESLN 181 AE FM IVKLHG+PKSIVSDRDRVFTS+FWQHLFKL GT+LAMSSAYHPQSDGQ+E LN Sbjct: 1131 AETFMHNIVKLHGMPKSIVSDRDRVFTSTFWQHLFKLQGTSLAMSSAYHPQSDGQTEVLN 1190 Query: 182 KCLEMYLRCFTFDNPKTWYKALAWAEYWYNSSFHTSLGMTPFKALYGRDPPQLVRPMITN 361 K LE++LRCFTF+NPK+W+K L+W+EYWYN+SF TS+GMTPF+ALYGR PP L + + Sbjct: 1191 KALELFLRCFTFNNPKSWFKVLSWSEYWYNTSFQTSIGMTPFQALYGRLPPYLTKYVPQE 1250 Query: 362 SDPPEVIKQITEREATXXXXXXXXXXXXXRMKCQADKRRRDVQFEIGDLVLVKLQPYRQH 541 +DPP + ++ ER+ MK QADK RRD+ +++GD VL+KLQPYRQH Sbjct: 1251 NDPPTLQAELIERDNLLQQLKTNLERAQQYMKKQADKHRRDISYQVGDFVLIKLQPYRQH 1310 Query: 542 SLALRKNNKLGMRYFGPFEIIARIGAVAYKLKLPSTAKIHAVFHVSQLKKFRGDSSEPYL 721 S+ALRKN KLGMRYFGPFEIIA +G +AYKL LP AKIH VFHVSQLK F+G + + Y+ Sbjct: 1311 SVALRKNQKLGMRYFGPFEIIACVGTIAYKLNLPENAKIHPVFHVSQLKPFKGTTQDQYM 1370 Query: 722 PLPLTSTELGPILLPRAVLQKRTIQNGQTLVEQVLIMWEDNNEDTATWENVEEFKQHYPT 901 PLPLT +E GPI+ P AVLQ RTIQ G V QV I W+ E A+WE++++ K +PT Sbjct: 1371 PLPLTMSETGPIIQPIAVLQARTIQRGMQKVHQVQIQWDQTAE--ASWEDLDDLKNKFPT 1428 Query: 902 FNLEDKIQFKGEGIVM-------------------------NESAAEVENEM-PRRSMRA 1003 NLEDK+ +G IVM S AE++ ++ PRR R Sbjct: 1429 LNLEDKVVVEGGSIVMKPNINNILEAKVPANSIGDPQNMYDGNSVAEIKEDLGPRRGKRV 1488 Query: 1004 RKASVKLNDY 1033 RK DY Sbjct: 1489 RKTHGIWKDY 1498 >dbj|GAU47333.1| hypothetical protein TSUD_101210 [Trifolium subterraneum] Length = 1017 Score = 405 bits (1042), Expect = e-125 Identities = 197/311 (63%), Positives = 235/311 (75%) Frame = +2 Query: 2 AELFMSQIVKLHGVPKSIVSDRDRVFTSSFWQHLFKLLGTTLAMSSAYHPQSDGQSESLN 181 AE FM IVKLHG+PKSIVSDRD+VFTS+FWQ LFKL GTTLAMSS+YHPQSDGQ+E LN Sbjct: 704 AEAFMDNIVKLHGMPKSIVSDRDKVFTSAFWQQLFKLQGTTLAMSSSYHPQSDGQTEVLN 763 Query: 182 KCLEMYLRCFTFDNPKTWYKALAWAEYWYNSSFHTSLGMTPFKALYGRDPPQLVRPMITN 361 K LE++LRCF+F+NPK+W K L+W+E+WYN++F TS+GMTPFKALYGRDPP L R + Sbjct: 764 KGLELFLRCFSFNNPKSWSKMLSWSEFWYNTAFQTSIGMTPFKALYGRDPPYLTRYVAQE 823 Query: 362 SDPPEVIKQITEREATXXXXXXXXXXXXXRMKCQADKRRRDVQFEIGDLVLVKLQPYRQH 541 +DPP + +++ ER MK QADK R D+ +GDLVLVKLQPYRQH Sbjct: 824 NDPPALQEELMERGRILQQLKNNLIRAQQYMKKQADKHRSDITLNVGDLVLVKLQPYRQH 883 Query: 542 SLALRKNNKLGMRYFGPFEIIARIGAVAYKLKLPSTAKIHAVFHVSQLKKFRGDSSEPYL 721 S+ALRKN KLG+RYFGPFEIIAR+G VAYKL+LP AKIH VFHVSQLK F+G + E YL Sbjct: 884 SVALRKNKKLGLRYFGPFEIIARVGDVAYKLQLPKNAKIHPVFHVSQLKPFKGVAQEQYL 943 Query: 722 PLPLTSTELGPILLPRAVLQKRTIQNGQTLVEQVLIMWEDNNEDTATWENVEEFKQHYPT 901 PLPLT TE+GPI+ P VLQ RTI G V QVLI W+ + ATWE+V K +P+ Sbjct: 944 PLPLTMTEIGPIVQPIDVLQARTIIQGLQKVHQVLIQWDQYSAAEATWEDVTTVKDKFPS 1003 Query: 902 FNLEDKIQFKG 934 NLEDK+ F G Sbjct: 1004 LNLEDKVSFYG 1014 >dbj|GAU27453.1| hypothetical protein TSUD_161390 [Trifolium subterraneum] Length = 1531 Score = 415 bits (1067), Expect = e-125 Identities = 208/363 (57%), Positives = 257/363 (70%), Gaps = 19/363 (5%) Frame = +2 Query: 2 AELFMSQIVKLHGVPKSIVSDRDRVFTSSFWQHLFKLLGTTLAMSSAYHPQSDGQSESLN 181 AE FM IVKLHG+PKSIVSDRD+VFTSSFWQ LFKL GT+LAMSSAYHPQSDGQSE LN Sbjct: 1167 AEAFMDNIVKLHGMPKSIVSDRDKVFTSSFWQQLFKLQGTSLAMSSAYHPQSDGQSEVLN 1226 Query: 182 KCLEMYLRCFTFDNPKTWYKALAWAEYWYNSSFHTSLGMTPFKALYGRDPPQLVRPMITN 361 K LE++LRCFTF+NPK+W KALAW+E+WYN++F TS+GMTPFKALYGRDPP ++R I Sbjct: 1227 KTLELFLRCFTFENPKSWCKALAWSEFWYNTAFQTSIGMTPFKALYGRDPPAIIRYEIQA 1286 Query: 362 SDPPEVIKQITEREATXXXXXXXXXXXXXRMKCQADKRRRDVQFEIGDLVLVKLQPYRQH 541 SD P + +++ ER+ MK QADK R DV+ ++GD VLVKLQPYRQ Sbjct: 1287 SDSPTLQEKLMERDRIIQQLKLNLEKAQQYMKKQADKHRVDVKLQVGDWVLVKLQPYRQQ 1346 Query: 542 SLALRKNNKLGMRYFGPFEIIARIGAVAYKLKLPSTAKIHAVFHVSQLKKFRGDSSEPYL 721 S+ALRKN KLGM+YFGPFE+IA++G VAYKLKLP AKIH VFHVSQLK F+GD+ E Y+ Sbjct: 1347 SVALRKNQKLGMKYFGPFEVIAKVGEVAYKLKLPDHAKIHPVFHVSQLKPFKGDNQEQYM 1406 Query: 722 PLPLTSTELGPILLPRAVLQKRTIQNGQTLVEQVLIMWEDNNEDTATWENVEEFKQHYPT 901 PLPL+ T++GP++ P AVL RTI ++QVLI W+ ATWE++ ++ +PT Sbjct: 1407 PLPLSMTDIGPMIQPVAVLATRTIIRCAQRIQQVLIQWDQYPIAEATWEDMVALQRKFPT 1466 Query: 902 FNLEDKIQFKGEGIVMNESAAEVENE-------------------MPRRSMRARKASVKL 1024 FNLEDK+ F G+GIVM+ + + E PRR R R S +L Sbjct: 1467 FNLEDKVAFIGDGIVMSPNEENILEEGDSSNVGPPDKHEGNYVMMGPRRGKRMRNISKRL 1526 Query: 1025 NDY 1033 Y Sbjct: 1527 EGY 1529 >gb|PNX73110.1| Ty3/gypsy retrotransposon protein, partial [Trifolium pratense] Length = 937 Score = 403 bits (1035), Expect = e-125 Identities = 194/312 (62%), Positives = 234/312 (75%) Frame = +2 Query: 2 AELFMSQIVKLHGVPKSIVSDRDRVFTSSFWQHLFKLLGTTLAMSSAYHPQSDGQSESLN 181 AE FM IVKLHG+PKSIVSDRD+VFTS+FWQ LFKL GT+LAMSSAYHPQSDGQ+E LN Sbjct: 626 AEAFMHNIVKLHGMPKSIVSDRDKVFTSAFWQQLFKLQGTSLAMSSAYHPQSDGQTEVLN 685 Query: 182 KCLEMYLRCFTFDNPKTWYKALAWAEYWYNSSFHTSLGMTPFKALYGRDPPQLVRPMITN 361 K LE++LRCF+F NPK+WYK L+WAEYWYN++F TS+GMTPFKALYGRDPP L + Sbjct: 686 KGLELFLRCFSFHNPKSWYKVLSWAEYWYNTAFQTSIGMTPFKALYGRDPPYLTKYEAQV 745 Query: 362 SDPPEVIKQITEREATXXXXXXXXXXXXXRMKCQADKRRRDVQFEIGDLVLVKLQPYRQH 541 +D P + +++ ER+ MK QADK R +V ++GDLVLVKLQPYRQ Sbjct: 746 TDSPALQEELMERDKILQQLKINLERAQQYMKKQADKHRSEVNLQVGDLVLVKLQPYRQQ 805 Query: 542 SLALRKNNKLGMRYFGPFEIIARIGAVAYKLKLPSTAKIHAVFHVSQLKKFRGDSSEPYL 721 S++LRKN KLGMRYFGPFEIIAR+G VAYKLKLP AKIH VFHVSQLK F+G + + YL Sbjct: 806 SVSLRKNQKLGMRYFGPFEIIARVGNVAYKLKLPDNAKIHPVFHVSQLKPFKGIAQDQYL 865 Query: 722 PLPLTSTELGPILLPRAVLQKRTIQNGQTLVEQVLIMWEDNNEDTATWENVEEFKQHYPT 901 PLPLT +E GPI+ P A L+ RTI G V Q+L+ W+ ATWE+++ + +PT Sbjct: 866 PLPLTMSETGPIIQPIAALEARTIMRGMQKVHQILVQWDQMPVTEATWEDLDVLQDKFPT 925 Query: 902 FNLEDKIQFKGE 937 NLEDKI F GE Sbjct: 926 LNLEDKIAFNGE 937