BLASTX nr result
ID: Astragalus24_contig00023419
seq
BLASTX 2.2.26 [Sep-21-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Astragalus24_contig00023419 (1031 letters) Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects 149,584,005 sequences; 54,822,741,787 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_014634047.1| PREDICTED: uncharacterized protein LOC106799... 298 5e-93 gb|PNX55196.1| hypothetical protein L195_g048823, partial [Trifo... 272 7e-86 dbj|GAU45812.1| hypothetical protein TSUD_115000 [Trifolium subt... 291 1e-85 dbj|GAU10615.1| hypothetical protein TSUD_418280, partial [Trifo... 279 2e-85 gb|PNX88023.1| hypothetical protein L195_g044123, partial [Trifo... 270 3e-84 gb|PNX92431.1| Ty3/gypsy retrotransposon protein, partial [Trifo... 283 2e-82 gb|AAO23078.1| polyprotein [Glycine max] 282 3e-82 gb|PNX66565.1| hypothetical protein L195_g055159, partial [Trifo... 258 3e-82 gb|PNX99332.1| retrotransposon-related protein [Trifolium pratense] 277 3e-81 dbj|GAU12540.1| hypothetical protein TSUD_182540 [Trifolium subt... 279 3e-81 gb|PNX94483.1| retrotransposon-related protein, partial [Trifoli... 278 7e-81 gb|PNY17453.1| Ty3/gypsy retrotransposon protein [Trifolium prat... 276 3e-80 dbj|GAU25204.1| hypothetical protein TSUD_151040 [Trifolium subt... 274 2e-79 gb|PNY16671.1| retrotransposon-related protein, partial [Trifoli... 273 4e-79 dbj|GAU11620.1| hypothetical protein TSUD_346120 [Trifolium subt... 267 5e-77 dbj|GAU19157.1| hypothetical protein TSUD_79800 [Trifolium subte... 267 5e-77 gb|KYP45652.1| Transposon Ty3-G Gag-Pol polyprotein [Cajanus cajan] 265 2e-76 gb|PNX61186.1| hypothetical protein L195_g052324, partial [Trifo... 241 4e-76 dbj|GAU27453.1| hypothetical protein TSUD_161390 [Trifolium subt... 262 4e-75 dbj|GAU25507.1| hypothetical protein TSUD_279910 [Trifolium subt... 256 5e-73 >ref|XP_014634047.1| PREDICTED: uncharacterized protein LOC106799639 [Glycine max] Length = 600 Score = 298 bits (763), Expect = 5e-93 Identities = 154/266 (57%), Positives = 198/266 (74%), Gaps = 10/266 (3%) Frame = -3 Query: 768 MADNTRMKEIYADLKKNAEAIELAATTND----------ARFSKIEANQAIADEKLTRIT 619 MA+NTRMKE+ +D+K+NAE+IE ND A S+ EA Q + K ++I Sbjct: 1 MAENTRMKELSSDIKRNAESIE--KMYNDFHEKIDRLEIANASRFEAMQTNTESKFSQIN 58 Query: 618 EALDALLNRDTRQVHFHGGPSSTQSGFQVSKIKLDFPRFNGKNVLDWIFKAEQFFTYYNT 439 ALD LLN+ + HG +S++ FQV IKL+FPRF+GKNVL+WIF+AEQFF YY T Sbjct: 59 NALDMLLNQSPHKSS-HGVGNSSKQPFQVRNIKLEFPRFDGKNVLEWIFRAEQFFDYYGT 117 Query: 438 SDADRITIASVNLDEDVVPWFQMVQRTTPFQSWQELTRALELDFGPSLYDCPRGALFKLK 259 D DR+TIASV+LD+DVVPWFQM+QR+ PF SW E TRALELDFGPS+Y+CPR LFKL Sbjct: 118 PDPDRLTIASVHLDKDVVPWFQMMQRSHPFHSWVEFTRALELDFGPSIYECPRATLFKLS 177 Query: 258 QTTSVNAYYLEFTALSNRVYGLSNDAVVDCFISGLNDDIRRDVMLHTPPNIVKAVALAKV 79 QT +V YYL+FT+L+N+VYGLSNDA++DCFISGL +IRRDVM+HTP ++VK V+LAKV Sbjct: 178 QTGTVADYYLQFTSLANKVYGLSNDALIDCFISGLIPEIRRDVMIHTPISMVKVVSLAKV 237 Query: 78 YEEKHNAHTASLNKTQTSNTRAHFNP 1 YEEK+ + T+ +K+ SN+ H P Sbjct: 238 YEEKYTS-TSKPHKSTPSNSYNHRAP 262 >gb|PNX55196.1| hypothetical protein L195_g048823, partial [Trifolium pratense] Length = 348 Score = 272 bits (695), Expect = 7e-86 Identities = 136/235 (57%), Positives = 177/235 (75%) Frame = -3 Query: 726 KKNAEAIELAATTNDARFSKIEANQAIADEKLTRITEALDALLNRDTRQVHFHGGPSSTQ 547 K+N E + L R ++EA A EK ++ ALD L+ + R+ + HG + + Sbjct: 14 KRNKEEMMLFQAEILERMERLEAGTA---EKFDKVYAALDVLIEQSPRKQN-HGAGLNNR 69 Query: 546 SGFQVSKIKLDFPRFNGKNVLDWIFKAEQFFTYYNTSDADRITIASVNLDEDVVPWFQMV 367 FQV +KL+FPRF+G NV +WIF+AEQFF YY+T D DR+TI+SV+LD+DVVPW+QMV Sbjct: 70 PPFQVRNVKLEFPRFDGTNVHEWIFRAEQFFDYYDTPDLDRLTISSVHLDKDVVPWYQMV 129 Query: 366 QRTTPFQSWQELTRALELDFGPSLYDCPRGALFKLKQTTSVNAYYLEFTALSNRVYGLSN 187 QR+ PF SW E TRALELDFGPS+YDCPR LFKL QT +V YYL+FT+L+NRVYGLSN Sbjct: 130 QRSHPFTSWIEFTRALELDFGPSVYDCPRATLFKLTQTGTVAEYYLKFTSLANRVYGLSN 189 Query: 186 DAVVDCFISGLNDDIRRDVMLHTPPNIVKAVALAKVYEEKHNAHTASLNKTQTSN 22 DA++DCF+SGLN++IRRDV++HTP +IVKAV+LAKVYEEK+ A + KT T+N Sbjct: 190 DALIDCFVSGLNNEIRRDVLIHTPSSIVKAVSLAKVYEEKY-ASNPNHKKTNTTN 243 >dbj|GAU45812.1| hypothetical protein TSUD_115000 [Trifolium subterraneum] Length = 1289 Score = 291 bits (744), Expect = 1e-85 Identities = 145/245 (59%), Positives = 183/245 (74%), Gaps = 11/245 (4%) Frame = -3 Query: 768 MADNTRMKEIYADLKKNAEAI-----------ELAATTNDARFSKIEANQAIADEKLTRI 622 MA+N+RMKE+ ++KKN + E AT +D RF +E + DE L R+ Sbjct: 1 MAENSRMKELSDEVKKNTADMKKLYDETQIQFEHFATVSDNRFKTMEDRHSSTDENLDRM 60 Query: 621 TEALDALLNRDTRQVHFHGGPSSTQSGFQVSKIKLDFPRFNGKNVLDWIFKAEQFFTYYN 442 E+L LL R T Q HG +S ++ FQV +KLDFPRF+GKNV++WIF+AEQFF YY+ Sbjct: 61 NESLSMLL-RKTSQNSSHGATNSYKAPFQVRNVKLDFPRFDGKNVMEWIFRAEQFFDYYD 119 Query: 441 TSDADRITIASVNLDEDVVPWFQMVQRTTPFQSWQELTRALELDFGPSLYDCPRGALFKL 262 T D DR+TI +V+LD+DVVPWFQM+QRT PF SW E TRALELDFGPS+YDCPR +LFKL Sbjct: 120 TPDKDRLTITAVHLDQDVVPWFQMIQRTNPFNSWVEFTRALELDFGPSIYDCPRASLFKL 179 Query: 261 KQTTSVNAYYLEFTALSNRVYGLSNDAVVDCFISGLNDDIRRDVMLHTPPNIVKAVALAK 82 Q+ +V+ YY++FTAL+N VYGLS DA+VDCFISG+N +IRRDVM+HTP IVK V+LAK Sbjct: 180 NQSGTVSDYYIQFTALANMVYGLSIDALVDCFISGINPEIRRDVMIHTPITIVKDVSLAK 239 Query: 81 VYEEK 67 VYEEK Sbjct: 240 VYEEK 244 >dbj|GAU10615.1| hypothetical protein TSUD_418280, partial [Trifolium subterraneum] Length = 602 Score = 279 bits (713), Expect = 2e-85 Identities = 147/272 (54%), Positives = 186/272 (68%), Gaps = 17/272 (6%) Frame = -3 Query: 768 MADNTRMKEIYADLKKNAEAIELAATTNDARFSKIEAN---QAIADEKLTRITEALDALL 598 MADNT MKEIYA+LKKN EAIE +TT + ++E Q + +++ R E A L Sbjct: 1 MADNTHMKEIYAELKKNTEAIETVSTTLTGQIDRLELGGNAQLLRMKEIQRSNETQFAKL 60 Query: 597 NRDTRQV---------HFHGGPSS----TQSGFQVSKIKLDFPRFNGKNVLDWIFKAEQF 457 N + Q+ HGG +S QS FQV +KLDFPRF+GKNV+DWIFK EQF Sbjct: 61 NANIAQLLQRFSPGQSSSHGGRNSGNDQPQSSFQVRYVKLDFPRFDGKNVMDWIFKDEQF 120 Query: 456 FTYYNTSDADRITIASVNLDEDVVPWFQMVQRTTPFQSWQELTRALELDFGPSLYDCPRG 277 F YY T D++R+ IA V+LD DVVPW+QM+Q+T PF +W LTRALELDFGPS YDCPR Sbjct: 121 FDYYATPDSERLLIALVHLDHDVVPWYQMIQKTNPFLTWSALTRALELDFGPSAYDCPRA 180 Query: 276 ALFKLKQTTSVNAYYLEFTALSNRVYGLSNDAVVDCFISGLNDDIRRDVMLHTPPNIVKA 97 LFKL+Q+ SVN YY++FT+L NRV GLS DA++DCFISGL D+I RDV P N+ KA Sbjct: 181 TLFKLQQSGSVNDYYMQFTSLVNRVDGLSLDAILDCFISGLQDEINRDVKAMEPRNLSKA 240 Query: 96 VALAKVYEEKHNAHTASLNKTQTSN-TRAHFN 4 V LAK++EEK+ A+ ++ T N T + FN Sbjct: 241 VPLAKLFEEKYTANKTKISSTPAKNYTPSSFN 272 >gb|PNX88023.1| hypothetical protein L195_g044123, partial [Trifolium pratense] Length = 400 Score = 270 bits (689), Expect = 3e-84 Identities = 129/216 (59%), Positives = 170/216 (78%) Frame = -3 Query: 669 KIEANQAIADEKLTRITEALDALLNRDTRQVHFHGGPSSTQSGFQVSKIKLDFPRFNGKN 490 ++E +A +D K ++ A+D L+N+ ++ H HG T++ FQV +KL+FPRF G N Sbjct: 26 RMERLEASSDAKFDKLYAAMDVLINQSPKKQH-HG----TRAPFQVRNVKLEFPRFEGTN 80 Query: 489 VLDWIFKAEQFFTYYNTSDADRITIASVNLDEDVVPWFQMVQRTTPFQSWQELTRALELD 310 V +WIF+AEQFF YY+T D DR+TIASV+LD+DVVPW+QMVQRT PFQSW E TRALEL Sbjct: 81 VHEWIFRAEQFFEYYDTPDLDRLTIASVHLDKDVVPWYQMVQRTHPFQSWIEFTRALELS 140 Query: 309 FGPSLYDCPRGALFKLKQTTSVNAYYLEFTALSNRVYGLSNDAVVDCFISGLNDDIRRDV 130 FGPS+YDCPR LFKL QT +V YYL+FT L+NRVYGLSNDA++DCF+SGL+D+IRRDV Sbjct: 141 FGPSVYDCPRATLFKLNQTGTVAEYYLKFTTLANRVYGLSNDALIDCFVSGLHDEIRRDV 200 Query: 129 MLHTPPNIVKAVALAKVYEEKHNAHTASLNKTQTSN 22 ++HTP ++VKA +LAK+YEEK+ + T + K T+N Sbjct: 201 LIHTPSSLVKAFSLAKIYEEKYTS-TTNQKKLNTTN 235 >gb|PNX92431.1| Ty3/gypsy retrotransposon protein, partial [Trifolium pratense] Length = 1502 Score = 283 bits (724), Expect = 2e-82 Identities = 138/243 (56%), Positives = 183/243 (75%), Gaps = 4/243 (1%) Frame = -3 Query: 741 IYADLKKNAEAIELAATTNDARFSKIEANQAIADEKLTRITEALDALLNRDTRQVHFHGG 562 I A+LK+NAE + L F ++E ++A EK +I A+D L+++ + H G Sbjct: 9 IEAELKRNAEDMRLYQAE---MFERLERSEAANKEKFDKIFTAIDILIDQSPSKHHHGAG 65 Query: 561 PSSTQSGFQVSKIKLDFPRFNGKNVLDWIFKAEQFFTYYNTSDADRITIASVNLDEDVVP 382 ++ + FQV +KL+FPRF+G NV +WIF+AEQFF YY+T D+DR+TI+SV+LD+DVVP Sbjct: 66 LNNNRPPFQVRNVKLEFPRFDGTNVHEWIFRAEQFFDYYDTPDSDRLTISSVHLDKDVVP 125 Query: 381 WFQMVQRTTPFQSWQELTRALELDFGPSLYDCPRGALFKLKQTTSVNAYYLEFTALSNRV 202 W+QMVQR PF SW E TRALELDFGPS+YDCPR LFKL QT +V YYL+FT+L+NRV Sbjct: 126 WYQMVQRLRPFTSWVEFTRALELDFGPSVYDCPRATLFKLSQTGTVAEYYLQFTSLANRV 185 Query: 201 YGLSNDAVVDCFISGLNDDIRRDVMLHTPPNIVKAVALAKVYEEKH----NAHTASLNKT 34 YGLSNDA+VDCF+SGLN+ IRRDV++HTPP++VKAV+LAKVYEEK+ N A++N Sbjct: 186 YGLSNDAMVDCFVSGLNNQIRRDVLIHTPPSLVKAVSLAKVYEEKYADAMNTQKATINNH 245 Query: 33 QTS 25 T+ Sbjct: 246 STN 248 >gb|AAO23078.1| polyprotein [Glycine max] Length = 1552 Score = 282 bits (722), Expect = 3e-82 Identities = 147/273 (53%), Positives = 188/273 (68%), Gaps = 21/273 (7%) Frame = -3 Query: 768 MADNTRMKEIYADLKKNAEAI-----------ELAATTNDARFSKIEANQAIADEKLTRI 622 MADNTRMKE+YA+LKKNA+AI E TN A+ KIE Q+ D + +++ Sbjct: 1 MADNTRMKEVYAELKKNADAITRVSDDLQNHIERLEATNHAQMEKIEVMQSTNDSQFSQL 60 Query: 621 TEALDALLNR-DTRQVHFHGGPSSTQ----SGFQVSKIKLDFPRFNGKNVLDWIFKAEQF 457 + +L R + HG +S + S FQV +KLDFPRF+GKNV+DWIFKAEQF Sbjct: 61 NAVMSQVLQRLQNIPMSSHGASNSQKEQQRSSFQVRSVKLDFPRFDGKNVMDWIFKAEQF 120 Query: 456 FTYYNTSDADRITIASVNLDEDVVPWFQMVQRTTPFQSWQELTRALELDFGPSLYDCPRG 277 F YY T DADR+ IASV+LD+DVVPW+QM+Q+T PF SWQ TRALELDFGPS YDCPR Sbjct: 121 FDYYATPDADRLIIASVHLDQDVVPWYQMLQKTEPFSSWQAFTRALELDFGPSAYDCPRA 180 Query: 276 ALFKLKQTTSVNAYYLEFTALSNRVYGLSNDAVVDCFISGLNDDIRRDVMLHTPPNIVKA 97 LFKL Q+ +VN YY++FTAL NRV GLS +A++DCF+SGL ++I RDV P + KA Sbjct: 181 TLFKLNQSATVNEYYMQFTALVNRVDGLSAEAILDCFVSGLQEEISRDVKAMEPRTLTKA 240 Query: 96 VALAKVYEEKHNAHT-----ASLNKTQTSNTRA 13 VALAK++EEK+ + ++L + TSNT A Sbjct: 241 VALAKLFEEKYTSPPKTKTFSNLARNFTSNTSA 273 >gb|PNX66565.1| hypothetical protein L195_g055159, partial [Trifolium pratense] Length = 225 Score = 258 bits (659), Expect = 3e-82 Identities = 125/200 (62%), Positives = 162/200 (81%), Gaps = 1/200 (0%) Frame = -3 Query: 675 FSKIEANQAIADEKLTRITEALDALLNRD-TRQVHFHGGPSSTQSGFQVSKIKLDFPRFN 499 F ++E ++A EK RI ALD L+++ T+Q H G P+ + FQV +KL+FPRF+ Sbjct: 28 FERLERSEAENAEKFARIFTALDILIDQTPTKQNHGAGLPN--RPPFQVRNVKLEFPRFD 85 Query: 498 GKNVLDWIFKAEQFFTYYNTSDADRITIASVNLDEDVVPWFQMVQRTTPFQSWQELTRAL 319 G NV +WIF+AEQFF YY+T D DR+TIASV+LD+DVVPW+QMVQRT PF SW E TRAL Sbjct: 86 GSNVHEWIFRAEQFFDYYDTPDPDRLTIASVHLDKDVVPWYQMVQRTHPFTSWIEFTRAL 145 Query: 318 ELDFGPSLYDCPRGALFKLKQTTSVNAYYLEFTALSNRVYGLSNDAVVDCFISGLNDDIR 139 ELDFGPS+Y+CPR LFKL Q+ +V YYL+FT+L+NRVYGLSNDA++DCF+SGLN++IR Sbjct: 146 ELDFGPSIYECPRATLFKLTQSGTVAEYYLQFTSLANRVYGLSNDALIDCFVSGLNNEIR 205 Query: 138 RDVMLHTPPNIVKAVALAKV 79 RDV++HTP ++VKAV+LAKV Sbjct: 206 RDVLIHTPSSLVKAVSLAKV 225 >gb|PNX99332.1| retrotransposon-related protein [Trifolium pratense] Length = 1084 Score = 277 bits (708), Expect = 3e-81 Identities = 135/218 (61%), Positives = 172/218 (78%) Frame = -3 Query: 675 FSKIEANQAIADEKLTRITEALDALLNRDTRQVHFHGGPSSTQSGFQVSKIKLDFPRFNG 496 F ++E ++A D+K RI ALD L+++ T HG + + FQV +KL+FPRF+G Sbjct: 28 FERLERSEAANDDKFNRIFAALDILIDQ-TPSKQNHGAGLNNRLPFQVRNVKLEFPRFDG 86 Query: 495 KNVLDWIFKAEQFFTYYNTSDADRITIASVNLDEDVVPWFQMVQRTTPFQSWQELTRALE 316 NV +WIF+AEQFF YY T D DR+TIASV+LD+DVVPW+QMVQRTTPFQSW + TRALE Sbjct: 87 TNVHEWIFRAEQFFDYYETPDPDRLTIASVHLDKDVVPWYQMVQRTTPFQSWIDFTRALE 146 Query: 315 LDFGPSLYDCPRGALFKLKQTTSVNAYYLEFTALSNRVYGLSNDAVVDCFISGLNDDIRR 136 LD+GPS+Y+CPR LFKL QT +V YYL+FT+L+NRVYGLSNDA+VDCFISGL D+IRR Sbjct: 147 LDYGPSIYECPRATLFKLTQTGTVAEYYLKFTSLANRVYGLSNDAMVDCFISGLTDEIRR 206 Query: 135 DVMLHTPPNIVKAVALAKVYEEKHNAHTASLNKTQTSN 22 DV++HTP +IVKAV+LAKVYEEK+ +T L +T +N Sbjct: 207 DVLIHTPTSIVKAVSLAKVYEEKYTTNT-KLPQTYQNN 243 >dbj|GAU12540.1| hypothetical protein TSUD_182540 [Trifolium subterraneum] Length = 1451 Score = 279 bits (714), Expect = 3e-81 Identities = 138/242 (57%), Positives = 179/242 (73%), Gaps = 2/242 (0%) Frame = -3 Query: 723 KNAEAIELAATTNDARFSKIEANQAIADEKLTRITEALDALLNRDTRQVHFHGGPSSTQS 544 +N E ++L T F ++E ++A K +I ALD L+++ + H G + + Sbjct: 2 RNTEEMKLLQTEI---FERLERSKAENGAKFNKIFAALDILIDQTPSKQHHGAGLTHNRP 58 Query: 543 GFQVSKIKLDFPRFNGKNVLDWIFKAEQFFTYYNTSDADRITIASVNLDEDVVPWFQMVQ 364 FQV +KL+FPRF+GKNV +WIF+AEQFF YY+T D DR+TIASV+LD+DVVPW+QM+Q Sbjct: 59 PFQVRNVKLEFPRFDGKNVHEWIFRAEQFFEYYDTPDLDRLTIASVHLDKDVVPWYQMMQ 118 Query: 363 RTTPFQSWQELTRALELDFGPSLYDCPRGALFKLKQTTSVNAYYLEFTALSNRVYGLSND 184 RT PF SW ELTRALEL FGPS+YDCPR LFKL QT SV YYL+FT+L+NRVYGLSND Sbjct: 119 RTHPFMSWIELTRALELGFGPSIYDCPRATLFKLNQTGSVADYYLQFTSLANRVYGLSND 178 Query: 183 AVVDCFISGLNDDIRRDVMLHTPPNIVKAVALAKVYEEKH--NAHTASLNKTQTSNTRAH 10 A+VDCF+SGLN++IRRDV++HTPP++VKAV+LAKVYEEK+ N T N T + + Sbjct: 179 ALVDCFVSGLNNEIRRDVLIHTPPSLVKAVSLAKVYEEKYASNLKTQKFNNTNYATNKPF 238 Query: 9 FN 4 N Sbjct: 239 TN 240 >gb|PNX94483.1| retrotransposon-related protein, partial [Trifolium pratense] Length = 1287 Score = 278 bits (710), Expect = 7e-81 Identities = 138/238 (57%), Positives = 181/238 (76%) Frame = -3 Query: 735 ADLKKNAEAIELAATTNDARFSKIEANQAIADEKLTRITEALDALLNRDTRQVHFHGGPS 556 +++K+N E +L R ++ EA K +I ALD L+++ + H G + Sbjct: 11 SEIKRNTEETKLLQAEIFERLARSEAENGA---KFDKIFAALDILIDQTPSKHHHGAGLN 67 Query: 555 STQSGFQVSKIKLDFPRFNGKNVLDWIFKAEQFFTYYNTSDADRITIASVNLDEDVVPWF 376 + + FQV +KL+FPRF+GKNV +WIF+AEQFF YY+T D DR+TIASV+LD+DVVPW+ Sbjct: 68 NNRPPFQVRNVKLEFPRFDGKNVHEWIFRAEQFFEYYDTPDPDRLTIASVHLDKDVVPWY 127 Query: 375 QMVQRTTPFQSWQELTRALELDFGPSLYDCPRGALFKLKQTTSVNAYYLEFTALSNRVYG 196 QM+QRT PF SW ELTRALELDFGPS+Y+CPR LFKL QT SV YYL+FT+L+NRVYG Sbjct: 128 QMMQRTHPFMSWIELTRALELDFGPSIYECPRATLFKLNQTGSVADYYLQFTSLANRVYG 187 Query: 195 LSNDAVVDCFISGLNDDIRRDVMLHTPPNIVKAVALAKVYEEKHNAHTASLNKTQTSN 22 LSNDA+VDCF+SGL+++IRRDV++HTPP++VKAV+LAKVYEEK+ AS K+Q SN Sbjct: 188 LSNDALVDCFVSGLSNEIRRDVLIHTPPSLVKAVSLAKVYEEKY----ASNLKSQKSN 241 >gb|PNY17453.1| Ty3/gypsy retrotransposon protein [Trifolium pratense] Length = 1535 Score = 276 bits (707), Expect = 3e-80 Identities = 141/256 (55%), Positives = 187/256 (73%) Frame = -3 Query: 777 ITTMADNTRMKEIYADLKKNAEAIELAATTNDARFSKIEANQAIADEKLTRITEALDALL 598 + + DNT EI K E ++L R ++E N D K ++ ALD L+ Sbjct: 1 MANLNDNTTSAEI-----KRNEEMQLFQAELLERMERLETN---TDSKFDKVYAALDVLI 52 Query: 597 NRDTRQVHFHGGPSSTQSGFQVSKIKLDFPRFNGKNVLDWIFKAEQFFTYYNTSDADRIT 418 + ++Q HG SS++ FQV +KL+FPRF+G NV +WIF+AEQFF YY+T D DR+T Sbjct: 53 TQ-SQQRPPHGAGSSSRPPFQVRNVKLEFPRFDGTNVHEWIFRAEQFFEYYDTPDLDRLT 111 Query: 417 IASVNLDEDVVPWFQMVQRTTPFQSWQELTRALELDFGPSLYDCPRGALFKLKQTTSVNA 238 IASV+LD+DVVPW+QMVQRT PF SW E TRALELDFGPS+YDCPR LFKLKQT +V Sbjct: 112 IASVHLDKDVVPWYQMVQRTHPFTSWIEFTRALELDFGPSVYDCPRATLFKLKQTGTVAE 171 Query: 237 YYLEFTALSNRVYGLSNDAVVDCFISGLNDDIRRDVMLHTPPNIVKAVALAKVYEEKHNA 58 YYL+FT+L+NRVYGLSNDA++DCF+SGLND+IRRDV++HTP ++VKA++LAKVYEEK++ Sbjct: 172 YYLQFTSLANRVYGLSNDALIDCFVSGLNDEIRRDVLIHTPISLVKAMSLAKVYEEKYSY 231 Query: 57 HTASLNKTQTSNTRAH 10 + NK Q + + ++ Sbjct: 232 N----NKNQKNYSNSY 243 >dbj|GAU25204.1| hypothetical protein TSUD_151040 [Trifolium subterraneum] Length = 1512 Score = 274 bits (701), Expect = 2e-79 Identities = 134/218 (61%), Positives = 171/218 (78%) Frame = -3 Query: 675 FSKIEANQAIADEKLTRITEALDALLNRDTRQVHFHGGPSSTQSGFQVSKIKLDFPRFNG 496 F ++E ++A EK +I ALD L+++ T HG + ++ FQV +KL+FPRF+G Sbjct: 28 FERLERSEAANAEKFAKIFTALDILIDQ-TPSKQNHGIGLNNRTPFQVRNVKLEFPRFDG 86 Query: 495 KNVLDWIFKAEQFFTYYNTSDADRITIASVNLDEDVVPWFQMVQRTTPFQSWQELTRALE 316 NV +WIF+AEQFF YY+T D DR+TIASV+LD+DVVPW+QMVQRTTPFQSW + TRALE Sbjct: 87 NNVHEWIFRAEQFFDYYDTPDLDRLTIASVHLDKDVVPWYQMVQRTTPFQSWMDFTRALE 146 Query: 315 LDFGPSLYDCPRGALFKLKQTTSVNAYYLEFTALSNRVYGLSNDAVVDCFISGLNDDIRR 136 LDFGPS+Y+CPR LFKL QT +V YYL+FT+L+NRVYGLSNDA++DCFISGL+ DIRR Sbjct: 147 LDFGPSIYECPRATLFKLNQTGTVAEYYLQFTSLANRVYGLSNDALIDCFISGLSADIRR 206 Query: 135 DVMLHTPPNIVKAVALAKVYEEKHNAHTASLNKTQTSN 22 DV++HTP +IVKAV+LAKVYEEK+ T KT +N Sbjct: 207 DVLIHTPNSIVKAVSLAKVYEEKYTT-TLKPQKTYQNN 243 >gb|PNY16671.1| retrotransposon-related protein, partial [Trifolium pratense] Length = 1284 Score = 273 bits (697), Expect = 4e-79 Identities = 139/242 (57%), Positives = 183/242 (75%), Gaps = 2/242 (0%) Frame = -3 Query: 723 KNAEAIELAATTNDARFSKIEANQAIADEKLTRITEALDALLNRDTRQVHFHGGPSSTQS 544 KN I++ R ++EA+ A +KL EA+D L+++ + + + G +S + Sbjct: 4 KNDNEIQIFQAEILERMERLEASSAARIDKLY---EAVDLLISQSSPKQPYGAG-TSNKP 59 Query: 543 GFQVSKIKLDFPRFNGKNVLDWIFKAEQFFTYYNTSDADRITIASVNLDEDVVPWFQMVQ 364 FQV +KL+FPRF+GKNV +WIF+AEQFF YY+T D DR+TIASV+LD+DVVPW+QMVQ Sbjct: 60 PFQVRNVKLEFPRFDGKNVHEWIFRAEQFFEYYDTPDLDRLTIASVHLDKDVVPWYQMVQ 119 Query: 363 RTTPFQSWQELTRALELDFGPSLYDCPRGALFKLKQTTSVNAYYLEFTALSNRVYGLSND 184 RT PFQSW E TRALELDFGPS+Y+CPR LFKL Q+ +V YYL+FT L+NRVYGLS+D Sbjct: 120 RTHPFQSWIEFTRALELDFGPSVYECPRATLFKLNQSGTVAEYYLKFTTLANRVYGLSSD 179 Query: 183 AVVDCFISGLNDDIRRDVMLHTPPNIVKAVALAKVYEEKHNAHTASLNKTQTSN--TRAH 10 A++DCFISGLN+DIRRDVM+HTPP++VKA +LAKVYEEK+ ++T + K T+N T Sbjct: 180 ALIDCFISGLNNDIRRDVMIHTPPSLVKAFSLAKVYEEKYTSNT-NQKKFNTTNYATNKP 238 Query: 9 FN 4 FN Sbjct: 239 FN 240 >dbj|GAU11620.1| hypothetical protein TSUD_346120 [Trifolium subterraneum] Length = 1479 Score = 267 bits (683), Expect = 5e-77 Identities = 132/218 (60%), Positives = 173/218 (79%) Frame = -3 Query: 675 FSKIEANQAIADEKLTRITEALDALLNRDTRQVHFHGGPSSTQSGFQVSKIKLDFPRFNG 496 F ++E + K +I ALD L+++ T H HG S ++ FQV +KL+FPRF+G Sbjct: 28 FERMERLELANASKFDKIFSALDVLIDQ-TPSKHRHGIGLS-KAPFQVRNVKLEFPRFDG 85 Query: 495 KNVLDWIFKAEQFFTYYNTSDADRITIASVNLDEDVVPWFQMVQRTTPFQSWQELTRALE 316 NV +WIF+AEQFF YY+T D DR+TI+SV+LD+DVVPW+QM+QRT PF SW ELTRALE Sbjct: 86 SNVHEWIFRAEQFFDYYDTPDHDRLTISSVHLDKDVVPWYQMMQRTHPFTSWIELTRALE 145 Query: 315 LDFGPSLYDCPRGALFKLKQTTSVNAYYLEFTALSNRVYGLSNDAVVDCFISGLNDDIRR 136 LDFGPS+YDCPR LFKLKQ+ SV+ YY++FT+L+NRVYGLSNDA++DCF+SGL+D+IRR Sbjct: 146 LDFGPSIYDCPRATLFKLKQSGSVSEYYMKFTSLANRVYGLSNDALIDCFVSGLSDEIRR 205 Query: 135 DVMLHTPPNIVKAVALAKVYEEKHNAHTASLNKTQTSN 22 DV++HTP ++VKAV+LAKVYEEK+ A +K+QT N Sbjct: 206 DVLIHTPSSLVKAVSLAKVYEEKY----AMNSKSQTRN 239 >dbj|GAU19157.1| hypothetical protein TSUD_79800 [Trifolium subterraneum] Length = 1500 Score = 267 bits (683), Expect = 5e-77 Identities = 129/208 (62%), Positives = 168/208 (80%), Gaps = 1/208 (0%) Frame = -3 Query: 678 RFSKIEANQAIADEKLTRITEALDALLNRDT-RQVHFHGGPSSTQSGFQVSKIKLDFPRF 502 R ++E N + K +I ALD L+++ + +Q H HG SS++ FQV +KL+FPRF Sbjct: 30 RMERLEMNN---ESKFDKIHTALDLLISQSSPKQTHGHG--SSSRPPFQVRNVKLEFPRF 84 Query: 501 NGKNVLDWIFKAEQFFTYYNTSDADRITIASVNLDEDVVPWFQMVQRTTPFQSWQELTRA 322 +G NV +WIF+AEQFF YY+T D DR+TI+SV+LD+DVVPW+QM+QRT PF SW E TRA Sbjct: 85 DGTNVHEWIFRAEQFFEYYDTPDLDRLTISSVHLDKDVVPWYQMLQRTHPFTSWIEFTRA 144 Query: 321 LELDFGPSLYDCPRGALFKLKQTTSVNAYYLEFTALSNRVYGLSNDAVVDCFISGLNDDI 142 LELDFGPS+YDCPR LFKL QT +V YYL+FT+L+NRVYGLSNDA++DCF+SGL DDI Sbjct: 145 LELDFGPSVYDCPRATLFKLAQTGTVAEYYLQFTSLANRVYGLSNDALIDCFVSGLKDDI 204 Query: 141 RRDVMLHTPPNIVKAVALAKVYEEKHNA 58 RRDV+LHTP ++VKA++LAKVYEEK+++ Sbjct: 205 RRDVVLHTPISLVKAMSLAKVYEEKYSS 232 >gb|KYP45652.1| Transposon Ty3-G Gag-Pol polyprotein [Cajanus cajan] Length = 1210 Score = 265 bits (677), Expect = 2e-76 Identities = 131/255 (51%), Positives = 181/255 (70%), Gaps = 12/255 (4%) Frame = -3 Query: 768 MADNTRMKEIYADLKKNAEAIEL-----------AATTNDARFSKIEANQAIADEKLTRI 622 MAD+TR+K++ AD+K+ ++ +E + N +RF ++E D+K +I Sbjct: 1 MADHTRLKDLQADVKQTSDKLEQYYSDLQAQIAKLESVNSSRFERLENVIQANDDKFNQI 60 Query: 621 TEALDALLNRDTR-QVHFHGGPSSTQSGFQVSKIKLDFPRFNGKNVLDWIFKAEQFFTYY 445 + AL+ LL ++ Q FHG +S + FQV +KLDFPRF+G +VLDWIFKAEQFF YY Sbjct: 61 SVALETLLQHNSSSQGSFHGSSNSFKPPFQVRNVKLDFPRFDGNHVLDWIFKAEQFFDYY 120 Query: 444 NTSDADRITIASVNLDEDVVPWFQMVQRTTPFQSWQELTRALELDFGPSLYDCPRGALFK 265 TS+ DR++IASV+LD DVVPWFQM+QR++PF SW T ALEL FGP+ Y+CPR +LFK Sbjct: 121 ATSEVDRLSIASVHLDNDVVPWFQMMQRSSPFHSWHAFTEALELAFGPTAYECPRASLFK 180 Query: 264 LKQTTSVNAYYLEFTALSNRVYGLSNDAVVDCFISGLNDDIRRDVMLHTPPNIVKAVALA 85 L QT SV YY F AL+NRV G+ N+A++DCF+SGL D++RRDV+ +PP++VKAVALA Sbjct: 181 LNQTDSVAEYYKAFFALANRVSGIDNEALLDCFLSGLKDELRRDVVALSPPSLVKAVALA 240 Query: 84 KVYEEKHNAHTASLN 40 K++E K+ +A N Sbjct: 241 KLFEAKYTPSSAPRN 255 >gb|PNX61186.1| hypothetical protein L195_g052324, partial [Trifolium pratense] Length = 192 Score = 241 bits (616), Expect = 4e-76 Identities = 116/196 (59%), Positives = 152/196 (77%) Frame = -3 Query: 669 KIEANQAIADEKLTRITEALDALLNRDTRQVHFHGGPSSTQSGFQVSKIKLDFPRFNGKN 490 ++EA A K ++ ALD L+ + + ++ ++ FQV +KL+FPRF+G N Sbjct: 3 RLEAGNA---SKFDKVYAALDVLIEQTPSK---QNQGANNRAPFQVRNVKLEFPRFDGTN 56 Query: 489 VLDWIFKAEQFFTYYNTSDADRITIASVNLDEDVVPWFQMVQRTTPFQSWQELTRALELD 310 V +WIF+AEQFF YY+T D DR+TI+SV+LD+DVVPW+QMVQR+ PF SW E TRALELD Sbjct: 57 VHEWIFRAEQFFDYYDTPDIDRLTISSVHLDKDVVPWYQMVQRSHPFTSWIEFTRALELD 116 Query: 309 FGPSLYDCPRGALFKLKQTTSVNAYYLEFTALSNRVYGLSNDAVVDCFISGLNDDIRRDV 130 FGPS+YDCPR LFKL QT +V YYL+FT+L+NRVYGLSNDA++DCF+SGL +IRRDV Sbjct: 117 FGPSIYDCPRATLFKLTQTGTVAEYYLQFTSLANRVYGLSNDALIDCFVSGLTTEIRRDV 176 Query: 129 MLHTPPNIVKAVALAK 82 ++HTP +IVKAV+LAK Sbjct: 177 LIHTPTSIVKAVSLAK 192 >dbj|GAU27453.1| hypothetical protein TSUD_161390 [Trifolium subterraneum] Length = 1531 Score = 262 bits (669), Expect = 4e-75 Identities = 132/232 (56%), Positives = 172/232 (74%), Gaps = 8/232 (3%) Frame = -3 Query: 675 FSKIEANQAIADEKLTRITEALDALLNRDTR------QVHFHGGPSSTQSGFQVSKIKLD 514 F ++E ++A K +I ALD L+++ ++H H P FQV +KL+ Sbjct: 28 FERLERSEAANTGKFEKIFAALDILIDQTPSKHQQGAELHHHRAP------FQVRNVKLE 81 Query: 513 FPRFNGKNVLDWIFKAEQFFTYYNTSDADRITIASVNLDEDVVPWFQMVQRTTPFQSWQE 334 FPRF+G NV +WIF+AEQFF YY+T D DR+TIA+V+LD+DVVPW+QM+QR+ PFQSW + Sbjct: 82 FPRFDGTNVHEWIFRAEQFFEYYDTPDLDRLTIAAVHLDKDVVPWYQMMQRSHPFQSWID 141 Query: 333 LTRALELDFGPSLYDCPRGALFKLKQTTSVNAYYLEFTALSNRVYGLSNDAVVDCFISGL 154 TRALELDFGPS+YDCPR LFKL QT +V Y+++FT+L+NRVYGLSNDA+VDCFISGL Sbjct: 142 FTRALELDFGPSIYDCPRATLFKLVQTGTVAEYFVQFTSLANRVYGLSNDALVDCFISGL 201 Query: 153 NDDIRRDVMLHTPPNIVKAVALAKVYEEKHNAHTASLNK--TQTSNTRAHFN 4 N DIRRDV++HTP ++VKAV+LAKVYEEK+ T K TQT +T +N Sbjct: 202 NPDIRRDVLIHTPSSLVKAVSLAKVYEEKYTT-TMKPQKPYTQTYSTNKPYN 252 >dbj|GAU25507.1| hypothetical protein TSUD_279910 [Trifolium subterraneum] Length = 1389 Score = 256 bits (653), Expect = 5e-73 Identities = 131/239 (54%), Positives = 169/239 (70%) Frame = -3 Query: 735 ADLKKNAEAIELAATTNDARFSKIEANQAIADEKLTRITEALDALLNRDTRQVHFHGGPS 556 A ++ E+ T + + + E + K +I EALD LL + T HG Sbjct: 2 AQTNASSSQTEIKRNTEEMQLFQQEILELGNASKFDKIHEALDILL-KQTPPKQTHGAGL 60 Query: 555 STQSGFQVSKIKLDFPRFNGKNVLDWIFKAEQFFTYYNTSDADRITIASVNLDEDVVPWF 376 + FQV +KL+FPRF+G V +WIF+AEQFF YY+T D DR+TI+SV+LD+DVVPW+ Sbjct: 61 HNRPPFQVRNVKLEFPRFDGTKVHEWIFRAEQFFEYYDTPDLDRLTISSVHLDKDVVPWY 120 Query: 375 QMVQRTTPFQSWQELTRALELDFGPSLYDCPRGALFKLKQTTSVNAYYLEFTALSNRVYG 196 QMVQR+ PF SW E TRALELDFGPS+Y+CPR LFKL QT +V YYL+FT+L+NRVYG Sbjct: 121 QMVQRSHPFTSWIEFTRALELDFGPSVYECPRATLFKLAQTGTVAEYYLQFTSLANRVYG 180 Query: 195 LSNDAVVDCFISGLNDDIRRDVMLHTPPNIVKAVALAKVYEEKHNAHTASLNKTQTSNT 19 LS A++DCFISGL+++IRRDVM+HTP ++VKAV+LAKVYEEK+ S NK Q NT Sbjct: 181 LSTYAMIDCFISGLSNEIRRDVMIHTPNSLVKAVSLAKVYEEKY----TSSNKPQRINT 235