BLASTX nr result
ID: Astragalus22_contig00016555
seq
BLASTX 2.2.26 [Sep-21-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Astragalus22_contig00016555 (814 letters) Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects 149,584,005 sequences; 54,822,741,787 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value gb|PNX77860.1| retrovirus-related Pol polyprotein from transposo... 288 2e-90 dbj|GAU22921.1| hypothetical protein TSUD_326940 [Trifolium subt... 278 2e-82 gb|PNX71411.1| retrovirus-related Pol polyprotein from transposo... 267 6e-82 dbj|GAU46782.1| hypothetical protein TSUD_351810 [Trifolium subt... 276 3e-81 gb|PNX76620.1| hypothetical protein L195_g032574 [Trifolium prat... 258 6e-81 gb|KYP51705.1| hypothetical protein KK1_026473 [Cajanus cajan] 254 7e-81 gb|PNX72611.1| peptide transporter PTR2 [Trifolium pratense] 269 7e-81 gb|KYP35344.1| hypothetical protein KK1_043625 [Cajanus cajan] 249 6e-79 dbj|GAU41109.1| hypothetical protein TSUD_139780 [Trifolium subt... 251 1e-78 gb|PNX79728.1| hypothetical protein L195_g035716 [Trifolium prat... 246 9e-78 dbj|GAU43894.1| hypothetical protein TSUD_399420 [Trifolium subt... 263 3e-77 dbj|GAU45259.1| hypothetical protein TSUD_291430 [Trifolium subt... 243 5e-75 gb|PNX59756.1| retrovirus-related Pol polyprotein from transposo... 239 7e-75 gb|PNY05212.1| flavonol sulfotransferase-like protein [Trifolium... 241 4e-74 dbj|GAU49830.1| hypothetical protein TSUD_293850 [Trifolium subt... 241 5e-74 gb|PNY13856.1| hypothetical protein L195_g010524 [Trifolium prat... 241 1e-73 gb|PNX80244.1| hypothetical protein L195_g036241 [Trifolium prat... 238 2e-73 gb|PNX93130.1| retrovirus-related Pol polyprotein from transposo... 238 2e-73 gb|PNX62201.1| retrovirus-related Pol polyprotein from transposo... 234 3e-73 dbj|GAU47169.1| hypothetical protein TSUD_28920 [Trifolium subte... 252 3e-73 >gb|PNX77860.1| retrovirus-related Pol polyprotein from transposon TNT 1-94, partial [Trifolium pratense] Length = 581 Score = 288 bits (737), Expect = 2e-90 Identities = 140/259 (54%), Positives = 191/259 (73%), Gaps = 8/259 (3%) Frame = +3 Query: 3 HSWIMNSVAESIGQSIVFLENAIDVWNELKERLSQGDLVRISELECEIYASKQGPLSVTE 182 HSWIMNSV +SI QSIV+LENAIDVWNELKER S+GD +RISEL+ EIY+ KQG SV+E Sbjct: 95 HSWIMNSVEDSIAQSIVYLENAIDVWNELKERFSRGDFIRISELQVEIYSLKQGSRSVSE 154 Query: 183 YFSELKVLWEELEAYWPEPKCGCPRKCVCTTGISLSQQHYELSKVIRFLTGLNDNFAMIK 362 +F+ LKVLWEELEAY P P C CPRKCVC TGI ++ ++L + IRFLTGLND + +++ Sbjct: 155 FFTALKVLWEELEAYLPVPVCNCPRKCVCVTGIGNARSQHDLLRAIRFLTGLNDTYDLVR 214 Query: 363 SQILIMNPLPKLNKIFSLVIQHERQF------PQSDESKILAGNVDARNGRTKPRGGYTS 524 SQIL+M+PLP +NKIFS+VIQ+ERQF ++SK+L DAR G+ + +G Y + Sbjct: 215 SQILLMDPLPAINKIFSMVIQYERQFAPVNIGSDLEDSKVLVNASDARRGQGRGKGSYGN 274 Query: 525 GYNSRNGSKVCSFCGKTGHTVDTCYKKHEVPPHWQRNSSNAASADTNEAPSDKEE-EKTV 701 GY S+ +VC++CGK H VD CYKKH PP + RN++ + + AP++ E+ T Sbjct: 275 GYGSK--KRVCTYCGKDNHIVDNCYKKHGFPPGFGRNNATNSVNTEDSAPANNEDVGNTK 332 Query: 702 QMQT-GITQEKYDKLINML 755 +++ G+T+ +Y+KL+N+L Sbjct: 333 DIESFGLTKAQYEKLVNLL 351 >dbj|GAU22921.1| hypothetical protein TSUD_326940 [Trifolium subterraneum] Length = 1122 Score = 278 bits (710), Expect = 2e-82 Identities = 142/269 (52%), Positives = 189/269 (70%), Gaps = 18/269 (6%) Frame = +3 Query: 3 HSWIMNSVAESIGQSIVFLENAIDVWNELKERLSQGDLVRISELECEIYASKQGPLSVTE 182 HSWIMNSV ESI +SIV+L+NAIDVWNELKER S+GD +RISEL+ EIY+ KQG +V+E Sbjct: 96 HSWIMNSVEESIAKSIVYLDNAIDVWNELKERFSRGDFIRISELQVEIYSLKQGSRTVSE 155 Query: 183 YFSELKVLWEELEAYWPEPKCGCPRKCVCTTGISLSQQHYELSKVIRFLTGLNDNFAMIK 362 +F+ LK+LWEELEAY P P C CP KC+C TG+ ++ + L VIRFLTGLND F +++ Sbjct: 156 FFTALKILWEELEAYLPVPVCNCPHKCMCATGVGNARHQHSLLHVIRFLTGLNDTFDLVR 215 Query: 363 SQILIMNPLPKLNKIFSLVIQHERQFPQ------SDESKILAGNVDARNGRTKPRG---- 512 SQIL+M+PLP +NKIFS+VIQHERQF DESK + D+R + RG Sbjct: 216 SQILLMDPLPSINKIFSMVIQHERQFVAINGDLLVDESKAIVNASDSRRSYGRGRGYSSS 275 Query: 513 -GYTSGYNSRNGSK--VCSFCGKTGHTVDTCYKKHEVPPHWQRNS--SNAAS---ADTNE 668 G SG+++ +GSK +C+FCGK H VD CY+K+ PPH+ RN+ SN A+ N+ Sbjct: 276 HGRGSGFSTNSGSKKRICTFCGKDNHIVDNCYRKYGFPPHYGRNAEVSNVDCEDIAENND 335 Query: 669 APSDKEEEKTVQMQTGITQEKYDKLINML 755 A S K EK + G+T+ +Y++L+N+L Sbjct: 336 AHSLKSTEKGTE-SFGLTKAQYERLVNLL 363 >gb|PNX71411.1| retrovirus-related Pol polyprotein from transposon TNT 1-94 [Trifolium pratense] Length = 629 Score = 267 bits (683), Expect = 6e-82 Identities = 133/258 (51%), Positives = 178/258 (68%), Gaps = 8/258 (3%) Frame = +3 Query: 6 SWIMNSVAESIGQSIVFLENAIDVWNELKERLSQGDLVRISELECEIYASKQGPLSVTEY 185 SWIMNSV ESI QSIV+L+NAIDVWNELKER S+GD +RISEL+ EI KQ SV+E+ Sbjct: 40 SWIMNSVEESIAQSIVYLDNAIDVWNELKERFSRGDFIRISELQVEINGLKQDSRSVSEF 99 Query: 186 FSELKVLWEELEAYWPEPKCGCPRKCVCTTGISLSQQHYELSKVIRFLTGLNDNFAMIKS 365 F+ LKVLWEELEAY P P C CPRKCVC TG+ ++ ++L + IRFLTGLND F +++S Sbjct: 100 FTALKVLWEELEAYLPVPVCNCPRKCVCVTGVGNARSQHDLLRAIRFLTGLNDTFDLVRS 159 Query: 366 QILIMNPLPKLNKIFSLVIQHERQF------PQSDESKILAGNVDARNGRTKPRGGYTSG 527 QI +M+PLP +NKIFS+VIQ+ERQF D+SK+L D R + + +G Y +G Sbjct: 160 QISLMDPLPAINKIFSMVIQYERQFAPVNIGSDLDDSKVLVNASDTRRSQGRGKGSYGNG 219 Query: 528 YNSRNGSKVCSFCGKTGHTVDTCYKKHEVPPHWQRNSSNAASADTNEAPSDKEEEKTVQ- 704 Y S+ +VC++CGK H VD YKKH PP + RN+S + AP + E+ + Sbjct: 220 YGSK--KRVCTYCGKDNHIVDNYYKKHGFPPSYGRNNSTNNVNTEDSAPVNNEDIGNTKD 277 Query: 705 -MQTGITQEKYDKLINML 755 G+T+ +++KL+N+L Sbjct: 278 NESFGLTKAQHEKLVNLL 295 >dbj|GAU46782.1| hypothetical protein TSUD_351810 [Trifolium subterraneum] Length = 1512 Score = 276 bits (706), Expect = 3e-81 Identities = 136/264 (51%), Positives = 181/264 (68%), Gaps = 13/264 (4%) Frame = +3 Query: 3 HSWIMNSVAESIGQSIVFLENAIDVWNELKERLSQGDLVRISELECEIYASKQGPLSVTE 182 HSWIMNSV ESI QSIVFL+NA+DVW ELKER S GD +RISEL+ EIY KQG SV+E Sbjct: 95 HSWIMNSVEESIAQSIVFLDNALDVWIELKERFSHGDFIRISELQVEIYGLKQGNRSVSE 154 Query: 183 YFSELKVLWEELEAYWPEPKCGCPRKCVCTTGISLSQQHYELSKVIRFLTGLNDNFAMIK 362 +F+ L++LWEE E Y P P C CPRKCVC TG+S ++ ++L + IRFLTGLNDNF M++ Sbjct: 155 FFTALRILWEEFEIYLPAPVCNCPRKCVCVTGVSNARTQHDLLRTIRFLTGLNDNFDMVR 214 Query: 363 SQILIMNPLPKLNKIFSLVIQHERQFP------QSDESKILAGNVDARNGRTKPRGGYTS 524 SQIL+M+PLP +NK+FS+VIQHERQF ++SK+ D+R + + R G+ S Sbjct: 215 SQILLMDPLPPINKVFSMVIQHERQFTPLQAVLDVEDSKVSVNASDSRRSQGRGRSGFNS 274 Query: 525 GYNS------RNGSKVCSFCGKTGHTVDTCYKKHEVPPHWQRNSSNAASADTNEAPSDKE 686 YNS N KVC++CGK H V+ CYKKH PPH+ R S+ A +A+ E + + Sbjct: 275 QYNSGFNPQYNNKKKVCTYCGKENHVVENCYKKHGFPPHYGRGST-ANNANAGELMDNDD 333 Query: 687 EEKTVQMQT-GITQEKYDKLINML 755 T + T+ +Y++L+N+L Sbjct: 334 ARSTRGSDSFSFTKAQYEQLVNLL 357 >gb|PNX76620.1| hypothetical protein L195_g032574 [Trifolium pratense] Length = 398 Score = 258 bits (659), Expect = 6e-81 Identities = 138/262 (52%), Positives = 186/262 (70%), Gaps = 11/262 (4%) Frame = +3 Query: 3 HSWIMNSVAESIGQSIVFLENAIDVWNELKERLSQGDLVRISELECEIYASKQGPLSVTE 182 HSWI+NSV+ESI QSIVF+ENAIDVWN+LKER SQGDLVRI+EL+ EIY+ +Q SVTE Sbjct: 89 HSWILNSVSESIAQSIVFMENAIDVWNDLKERFSQGDLVRIAELQQEIYSLRQDSRSVTE 148 Query: 183 YFSELKVLWEELEAYWPEPKCGCPRKCVCTTGISLSQQHYELSKVIRFLTGLNDNFAMIK 362 +FS LK+LWEELE Y P P C C KC C + ++ +++L VIRFLTGLND+F M+K Sbjct: 149 FFSALKILWEELELYLPIPTCTCRVKCNC-DAMRRARANHQLMYVIRFLTGLNDHFDMVK 207 Query: 363 SQILIMNPLPKLNKIFSLVIQHERQ--FPQSDESKILAGNVDARNGRTKPRGGYTSGYNS 536 SQIL+++PLP LNKIFS+VIQHERQ F S+ SK L ++A N R P G +S NS Sbjct: 208 SQILLLDPLPSLNKIFSMVIQHERQGNFTPSEHSKAL---INAANFR--PPGSTSSSKNS 262 Query: 537 RN----GSKVCSFCGKTGHTVDTCYKKHEVPPHWQRNS-SNAASADTNEAPSD----KEE 689 R+ G +VC+FCGK H +D CY+KH +PPH Q+ S ++ A+A+ N+ S+ E Sbjct: 263 RSNSSTGKRVCTFCGKDNHIIDNCYQKHGLPPHLQKKSQAHNAAAEGNDCDSNSIAASEP 322 Query: 690 EKTVQMQTGITQEKYDKLINML 755 + +TQ+++++LI ++ Sbjct: 323 QAASSSSAPMTQDQWERLIALI 344 >gb|KYP51705.1| hypothetical protein KK1_026473 [Cajanus cajan] Length = 278 Score = 254 bits (648), Expect = 7e-81 Identities = 126/232 (54%), Positives = 171/232 (73%), Gaps = 3/232 (1%) Frame = +3 Query: 3 HSWIMNSVAESIGQSIVFLENAIDVWNELKERLSQGDLVRISELECEIYASKQGPLSVTE 182 HSWI+NSV +SIGQSI+FLEN +DVWN+LKER SQGDL+RISEL+ EIY KQG L VTE Sbjct: 52 HSWIVNSVVKSIGQSIIFLENVVDVWNDLKERFSQGDLIRISELQQEIYGIKQGSLFVTE 111 Query: 183 YFSELKVLWEELEAYWPEPKCGCPRKCVCTTGISLSQQHYELSKVIRFLTGLNDNFAMIK 362 ++SELK+LWEELE Y P P C CP KC C + ++Q + L+ IRFLTGLN+NF+++K Sbjct: 112 FYSELKILWEELETYMPIPCCACPVKCTC-VAMRNARQFHTLNHFIRFLTGLNENFSVVK 170 Query: 363 SQILIMNPLPKLNKIFSLVIQHERQ--FPQSDESKILAGNVDARNGRTKPRGGYTSGYNS 536 SQIL+M+ +P +N+IF +VIQHERQ F +DESK L +D + R++ RG G+ Sbjct: 171 SQILLMDLVPSMNQIFYMVIQHERQGNFIVNDESKALINAIDYK--RSQGRG---KGFAQ 225 Query: 537 RNG-SKVCSFCGKTGHTVDTCYKKHEVPPHWQRNSSNAASADTNEAPSDKEE 689 +G K+C++ GKTGHT++TCY+KH PPH+Q+ +S+ + +E KE+ Sbjct: 226 NSGPKKICTYYGKTGHTIETCYRKHGFPPHFQKGNSSMVNNACSETTDLKED 277 >gb|PNX72611.1| peptide transporter PTR2 [Trifolium pratense] Length = 845 Score = 269 bits (688), Expect = 7e-81 Identities = 132/259 (50%), Positives = 186/259 (71%), Gaps = 8/259 (3%) Frame = +3 Query: 3 HSWIMNSVAESIGQSIVFLENAIDVWNELKERLSQGDLVRISELECEIYASKQGPLSVTE 182 HSWIMN V ESI QSI++LENAIDVWNELKER S GD +RISEL+ EI+A KQG SV+E Sbjct: 4 HSWIMNFVEESIAQSIIYLENAIDVWNELKERFSHGDFIRISELQIEIHALKQGNRSVSE 63 Query: 183 YFSELKVLWEELEAYWPEPKCGCPRKCVCTTGISLSQQHYELSKVIRFLTGLNDNFAMIK 362 +F+ LK+LWEELEAY P P C CPRKCVC TGIS + ++L + IRFLTGLNDNF M++ Sbjct: 64 FFTALKILWEELEAYLPTPVCNCPRKCVCATGISNVKTQHDLLRKIRFLTGLNDNFDMVR 123 Query: 363 SQILIMNPLPKLNKIFSLVIQHERQF-PQS-----DESKILAGNVDARNGRTKPRGGYTS 524 SQIL+M+PLP +NK+FS ++QHERQF P + ++SK+L D R + + +GG+ + Sbjct: 124 SQILLMDPLPPINKVFSSILQHERQFVPHNAGLDVEDSKVLVNASDNRRSQGRGKGGF-N 182 Query: 525 GYNSRNGSKVCSFCGKTGHTVDTCYKKHEVPPHWQRNSSNAASADTNEAPSDKEEEKTVQ 704 G + K C++CGK H ++ C+KKH PP++ RN+++A T+++ D ++ K+++ Sbjct: 183 GQSGPFKKKYCTYCGKDNHVIENCFKKHGFPPNFGRNNASANHFGTDDS-MDNDDIKSLK 241 Query: 705 MQT--GITQEKYDKLINML 755 T+ +Y+ L+N+L Sbjct: 242 ASEPFTFTKSQYEHLVNLL 260 >gb|KYP35344.1| hypothetical protein KK1_043625 [Cajanus cajan] Length = 287 Score = 249 bits (636), Expect = 6e-79 Identities = 125/201 (62%), Positives = 157/201 (78%), Gaps = 3/201 (1%) Frame = +3 Query: 3 HSWIMNSVAESIGQSIVFLENAIDVWNELKERLSQGDLVRISELECEIYASKQGPLSVTE 182 HSWI+NSV ESIGQSI+FLENA+DVWN+LKER SQGDL RISEL+ EIY KQG LSVTE Sbjct: 93 HSWIVNSVVESIGQSIIFLENAVDVWNDLKERFSQGDLTRISELQQEIYGLKQGSLSVTE 152 Query: 183 YFSELKVLWEELEAYWPEPKCGCPRKCVCTTGISLSQQHYELSKVIRFLTGLNDNFAMIK 362 ++SELK+LWEELE Y P P C CP KC C + ++Q + L+ VIRFLTGLN+NF+++K Sbjct: 153 FYSELKILWEELETYMPIPSCACPVKCTC-AAMRNARQFHTLNHVIRFLTGLNENFSVVK 211 Query: 363 SQILIMNPLPKLNKIFSLVIQHERQ--FPQSDESKILAGNVDARNGRTKPRGGYTSGYNS 536 SQIL+M+PLP +N+IFS+VIQHERQ F +DESK L VD + R++ RG G+ Sbjct: 212 SQILLMDPLPSMNRIFSMVIQHERQGNFIFNDESKALINAVDYK--RSQGRG---KGFAQ 266 Query: 537 RNG-SKVCSFCGKTGHTVDTC 596 +G K+C++CGKTGHTV+TC Sbjct: 267 NSGPKKICTYCGKTGHTVETC 287 >dbj|GAU41109.1| hypothetical protein TSUD_139780 [Trifolium subterraneum] Length = 356 Score = 251 bits (640), Expect = 1e-78 Identities = 131/256 (51%), Positives = 175/256 (68%), Gaps = 5/256 (1%) Frame = +3 Query: 3 HSWIMNSVAESIGQSIVFLENAIDVWNELKERLSQGDLVRISELECEIYASKQGPLSVTE 182 HSWI+NSV+ESI QSIVF+E+A+D WN+LK+R SQGDLVRISEL EIYA KQ VTE Sbjct: 89 HSWILNSVSESIAQSIVFIEHAVDAWNDLKDRFSQGDLVRISELMQEIYAFKQDSKFVTE 148 Query: 183 YFSELKVLWEELEAYWPEPKCGCPRKCVCTTGISLSQQHYELSKVIRFLTGLNDNFAMIK 362 +FSE KVLWEELE Y P P C C +C C + + H L IRFLTGLN+NF M+K Sbjct: 149 FFSEFKVLWEELEIYMPIPNCVCRSRCSCDSMLKARSNH-ALLHAIRFLTGLNENFGMVK 207 Query: 363 SQILIMNPLPKLNKIFSLVIQHERQ--FPQSDESKILAGNVDARNGRTKPRGGYTSGYNS 536 SQIL+++PLP ++KIFS+V+Q ERQ F DESK+L VD++ G++ S Sbjct: 208 SQILLLDPLPPMSKIFSMVLQFERQSGFGLHDESKVLVNVVDSKKPSYFASKGHSQPSTS 267 Query: 537 RNGSKVCSFCGKTGHTVDTCYKKHEVPPHWQ---RNSSNAASADTNEAPSDKEEEKTVQM 707 + G++ C++C KT HTV+ C+KKH PPH Q R +S+ A +D S++ E + Sbjct: 268 K-GNRFCTYCHKTNHTVNECFKKHGFPPHMQKSNRTNSSQAGSDNVHNASERGESSSANS 326 Query: 708 QTGITQEKYDKLINML 755 Q+ ITQ++Y++L+ ML Sbjct: 327 QS-ITQDQYEQLMTML 341 >gb|PNX79728.1| hypothetical protein L195_g035716 [Trifolium pratense] Length = 272 Score = 246 bits (627), Expect = 9e-78 Identities = 126/255 (49%), Positives = 173/255 (67%), Gaps = 8/255 (3%) Frame = +3 Query: 15 MNSVAESIGQSIVFLENAIDVWNELKERLSQGDLVRISELECEIYASKQGPLSVTEYFSE 194 MNSV ES+ QSIVFL+NA+DVW ELKER S D +RISEL+ EIY+ KQG SV E+F+ Sbjct: 1 MNSVEESVAQSIVFLDNALDVWTELKERFSYCDFIRISELQVEIYSLKQGNPSVYEFFTA 60 Query: 195 LKVLWEELEAYWPEPKCGCPRKCVCTTGISLSQQHYELSKVIRFLTGLNDNFAMIKSQIL 374 LKVLW+ELEAY P P C CPRKC+C TG+ ++ ++L + IRFLTGLNDNF ++SQ+L Sbjct: 61 LKVLWKELEAYLPAPVCNCPRKCMCVTGVRKARIQHDLLETIRFLTGLNDNFDTVRSQVL 120 Query: 375 IMNPLPKLNKIFSLVIQHERQFPQS------DESKILAGNVDARNGRTKPRGGYTSGYNS 536 +M PLP +NK+FS+VIQ+ERQF + ++SK+ D+R +P G S +N Sbjct: 121 LMGPLPPINKVFSMVIQYERQFVATHAGLDIEDSKVSINASDSR----RPLGCGRSSFNP 176 Query: 537 R-NGSKVCSFCGKTGHTVDTCYKKHEVPPHWQRN-SSNAASADTNEAPSDKEEEKTVQMQ 710 + N K C++CGK H V+ CYKKH PP++ RN ++N +A+ + D K Sbjct: 177 QFNKKKYCTYCGKDNHVVENCYKKHGFPPNFGRNINANNVNAEDSMDNDDARSTKGTDSF 236 Query: 711 TGITQEKYDKLINML 755 T T+ +Y+KL+N+L Sbjct: 237 T-FTKSQYEKLVNLL 250 >dbj|GAU43894.1| hypothetical protein TSUD_399420 [Trifolium subterraneum] Length = 1098 Score = 263 bits (672), Expect = 3e-77 Identities = 128/268 (47%), Positives = 188/268 (70%), Gaps = 17/268 (6%) Frame = +3 Query: 3 HSWIMNSVAESIGQSIVFLENAIDVWNELKERLSQGDLVRISELECEIYASKQGPLSVTE 182 HS IMNSV ESI QSI FL+N +DVWNELKER SQGD +RISEL+CEI+ KQ SV+E Sbjct: 119 HSGIMNSVDESIAQSIAFLDNVVDVWNELKERFSQGDYIRISELQCEIFGMKQESRSVSE 178 Query: 183 YFSELKVLWEELEAYWPEPKCGCPRKCVCTTGISLSQQHYELSKVIRFLTGLNDNFAMIK 362 +F+ LK+LWEEL++Y P P C C +C+C TG+S ++ +++ + IRFLTGLN+NF ++ Sbjct: 179 FFTALKILWEELDSYLPAPVCSCLMRCICNTGVSNAKHQHKIMRSIRFLTGLNENFDPVR 238 Query: 363 SQILIMNPLPKLNKIFSLVIQHERQFPQS--DESKILAGNVDAR------NGRTKPRGGY 518 +QIL+MNPLP +N+IFS+V+QHERQ+ + D+SK+L + DAR +G + +G Sbjct: 239 AQILLMNPLPTINRIFSMVLQHERQYNSTHFDDSKVLVNSHDARKPKGRCHGSSSSQGNR 298 Query: 519 TSGYNSRN---GSKVCSFCGKTGHTVDTCYKKHEVPPHWQRNS-SNAASADTNEAPSDKE 686 ++ Y + N +K CS+CGKT H V+ CY+KH PPH+ RNS +N AS + + + + Sbjct: 299 SNSYGANNYGAKNKECSYCGKTNHIVENCYRKHGFPPHYGRNSHANNASLEHVDERENMD 358 Query: 687 EEKTVQ-----MQTGITQEKYDKLINML 755 + K+V+ G T+E+Y++L+ ++ Sbjct: 359 DNKSVRGNNNNTDFGFTKEQYNQLMTLI 386 >dbj|GAU45259.1| hypothetical protein TSUD_291430 [Trifolium subterraneum] Length = 387 Score = 243 bits (619), Expect = 5e-75 Identities = 134/261 (51%), Positives = 173/261 (66%), Gaps = 10/261 (3%) Frame = +3 Query: 3 HSWIMNSVAESIGQSIVFLENAIDVWNELKERLSQGDLVRISELECEIYASKQGPLSVTE 182 HS I+NSV+ESI QSIVF+EN IDVWN+LKE+ SQGDLVRI+EL+ EIY+ +Q SVTE Sbjct: 89 HSLILNSVSESIAQSIVFMENVIDVWNDLKEQFSQGDLVRIAELQQEIYSLRQESRSVTE 148 Query: 183 YFSELKVLWEELEAYWPEPKCGCPRKCVCTTGISLSQQHYELSKVIRFLTGLNDNFAMIK 362 +FS LK+LWEELE Y P P C C KC C S H L VIRFLTGLN++F ++K Sbjct: 149 FFSALKILWEELELYLPIPMCTCRVKCNCEAMRSARNNH-NLMYVIRFLTGLNEHFDVVK 207 Query: 363 SQILIMNPLPKLNKIFSLVIQHERQ--FPQSDESKILAGNVDARNGRTKPRGGYT--SGY 530 SQIL+M+PLP LNKIFS+VIQHERQ F S++S+ L ++A N +K G S Y Sbjct: 208 SQILLMDPLPTLNKIFSMVIQHERQGNFTPSEDSQAL---INAANSNSKGYGSKNPKSSY 264 Query: 531 NSRNGSKVCSFCGKTGHTVDTCYKKHEVPPHWQR----NSSNAA--SADTNEAPSDKEEE 692 S + +VC+FCGK H VD CYKKH +PPH Q+ + NAA N P Sbjct: 265 ASSSVKRVCTFCGKDNHIVDNCYKKHGLPPHLQKRVQSQAHNAAIDGGKCNTDPIPASNS 324 Query: 693 KTVQMQTGITQEKYDKLINML 755 ++ T +TQ ++++LI ++ Sbjct: 325 QSASGSTPMTQAQWERLIALV 345 >gb|PNX59756.1| retrovirus-related Pol polyprotein from transposon TNT 1-94, partial [Trifolium pratense] Length = 309 Score = 239 bits (611), Expect = 7e-75 Identities = 123/252 (48%), Positives = 167/252 (66%), Gaps = 2/252 (0%) Frame = +3 Query: 6 SWIMNSVAESIGQSIVFLENAIDVWNELKERLSQGDLVRISELECEIYASKQGPLSVTEY 185 SWI+NS++ SI QS+VF+ENAID+WN+L+ER SQGDL+RISEL+ EIY+ KQ SVT++ Sbjct: 41 SWILNSISPSIAQSVVFMENAIDIWNDLRERFSQGDLIRISELQQEIYSLKQDNRSVTDF 100 Query: 186 FSELKVLWEELEAYWPEPKCGCPRKCVCTTGISLSQQHYELSKVIRFLTGLNDNFAMIKS 365 FSELK LWEELE Y P P C C ++C C S + H L +RFLTGLN+NF+ ++S Sbjct: 101 FSELKTLWEELELYLPIPSCTCRQRCACEAMRSARKNHL-LLHTVRFLTGLNENFSTVRS 159 Query: 366 QILIMNPLPKLNKIFSLVIQHERQ--FPQSDESKILAGNVDARNGRTKPRGGYTSGYNSR 539 QILIM PLP +NK+FSLVIQHERQ F + D+SKIL + KP +S+ Sbjct: 160 QILIMEPLPPINKVFSLVIQHERQGNFAEVDDSKILVNAAKS----AKPSS------SSK 209 Query: 540 NGSKVCSFCGKTGHTVDTCYKKHEVPPHWQRNSSNAASADTNEAPSDKEEEKTVQMQTGI 719 + ++ CS+CGK H V+ C+KK+ VPPH ++ S SA A + + Sbjct: 210 SSTRNCSYCGKDNHVVENCFKKNGVPPHMKKFS----SAHNVAAEGGSVDSNVASTPPSL 265 Query: 720 TQEKYDKLINML 755 +Q++YDKL+ +L Sbjct: 266 SQDQYDKLMTLL 277 >gb|PNY05212.1| flavonol sulfotransferase-like protein [Trifolium pratense] Length = 417 Score = 241 bits (615), Expect = 4e-74 Identities = 131/253 (51%), Positives = 167/253 (66%), Gaps = 2/253 (0%) Frame = +3 Query: 3 HSWIMNSVAESIGQSIVFLENAIDVWNELKERLSQGDLVRISELECEIYASKQGPLSVTE 182 HSWI+NSV+ESI QSI+F+ENAIDVWN+LK R SQGDLVRISEL+ EIY+ +Q SVTE Sbjct: 161 HSWILNSVSESIAQSIMFMENAIDVWNDLKGRFSQGDLVRISELQQEIYSLRQESRSVTE 220 Query: 183 YFSELKVLWEELEAYWPEPKCGCPRKCVCTTGISLSQQHYELSKVIRFLTGLNDNFAMIK 362 +FS LKVLWEE E Y P P C C KC C S H L VIRFLTGLND+F ++K Sbjct: 221 FFSALKVLWEEFEIYLPIPMCTCRVKCSCEAMRSAHNNH-NLMYVIRFLTGLNDHFDVVK 279 Query: 363 SQILIMNPLPKLNKIFSLVIQHERQ--FPQSDESKILAGNVDARNGRTKPRGGYTSGYNS 536 SQILIM+PLP L KIFS++IQHERQ F S++SK L ++A N +T + S Y S Sbjct: 280 SQILIMDPLPPLYKIFSMLIQHERQGNFAPSEDSKAL---INAANSKTSGSKNFKSSYGS 336 Query: 537 RNGSKVCSFCGKTGHTVDTCYKKHEVPPHWQRNSSNAASADTNEAPSDKEEEKTVQMQTG 716 + +VC+FCGK H +D CYKKH +W R+ + A E KT Sbjct: 337 SSVKRVCTFCGKDNHIIDNCYKKHGYSCNWGRDCDGDSVA--------ASEPKTAG-SAP 387 Query: 717 ITQEKYDKLINML 755 +TQ+++++LI ++ Sbjct: 388 MTQDQWERLIALI 400 >dbj|GAU49830.1| hypothetical protein TSUD_293850 [Trifolium subterraneum] Length = 410 Score = 241 bits (614), Expect = 5e-74 Identities = 125/253 (49%), Positives = 170/253 (67%), Gaps = 2/253 (0%) Frame = +3 Query: 3 HSWIMNSVAESIGQSIVFLENAIDVWNELKERLSQGDLVRISELECEIYASKQGPLSVTE 182 HSWIMNSV+ESI QSIVF+ENAIDVWN+LKER SQ DL+RI+EL+ E++A +Q SVTE Sbjct: 86 HSWIMNSVSESIAQSIVFMENAIDVWNDLKERFSQADLIRIAELQQELHALQQDSRSVTE 145 Query: 183 YFSELKVLWEELEAYWPEPKCGCPRKCVCTTGISLSQQHYELSKVIRFLTGLNDNFAMIK 362 ++S+LK++WEELE Y P P C C +C C S H L +IRFLTGLN++FA++K Sbjct: 146 FYSDLKLIWEELEIYLPMPNCSCRNRCTCEAMRSARANH-ALLYIIRFLTGLNEHFAVVK 204 Query: 363 SQILIMNPLPKLNKIFSLVIQHERQ--FPQSDESKILAGNVDARNGRTKPRGGYTSGYNS 536 SQIL+M+PLP +NK+FSLV+QH+RQ F S++SK L K +G + S Sbjct: 205 SQILLMDPLPPMNKVFSLVLQHQRQSNFSPSEDSKALL-------NAAKSKGSFP----S 253 Query: 537 RNGSKVCSFCGKTGHTVDTCYKKHEVPPHWQRNSSNAASADTNEAPSDKEEEKTVQMQTG 716 +N ++C+FCGK H V C+KK+ +PPH+++NS A+ E EE+ Sbjct: 254 KNPVRICTFCGKDNHIVANCFKKYGLPPHFRKNS----QANNAEIEGGNEEQIAADNSNI 309 Query: 717 ITQEKYDKLINML 755 ITQE+ +LI +L Sbjct: 310 ITQEQALQLITLL 322 >gb|PNY13856.1| hypothetical protein L195_g010524 [Trifolium pratense] Length = 448 Score = 241 bits (614), Expect = 1e-73 Identities = 125/256 (48%), Positives = 174/256 (67%), Gaps = 5/256 (1%) Frame = +3 Query: 3 HSWIMNSVAESIGQSIVFLENAIDVWNELKERLSQGDLVRISELECEIYASKQGPLSVTE 182 HSWIMNSV+ESI QSIVF+ENAIDVWN+LKER SQ DL+RI+EL+ E++A KQ +V E Sbjct: 86 HSWIMNSVSESIAQSIVFIENAIDVWNDLKERFSQADLIRIAELQQELHALKQDSHTVNE 145 Query: 183 YFSELKVLWEELEAYWPEPKCGCPRKCVCTTGISLSQQHYELSKVIRFLTGLNDNFAMIK 362 ++S+LK++WEELE Y P P C C C C S H L VI FLTGLN++F+++K Sbjct: 146 FYSDLKLIWEELEIYLPMPNCSCRNCCTCEAMRSARANH-TLLYVICFLTGLNEHFSVVK 204 Query: 363 SQILIMNPLPKLNKIFSLVIQHERQ--FPQSDESKILAGNVDARNGRTKPRGGYTSGYNS 536 SQIL+M+PLP + K+ SLV+QHERQ F SD+S++L +R +S Sbjct: 205 SQILLMDPLPPMTKVVSLVLQHERQSHFSTSDDSRVLLNAAKSRGSS-----------SS 253 Query: 537 RNGSKVCSFCGKTGHTVDTCYKKHEVPPHWQRNSS-NAASADTNEAPSDKEEEKTVQMQT 713 R+G++VC+FCGK H VD C+KKH +PPH+++NS N A+ + + E ++Q+ Sbjct: 254 RSGNRVCTFCGKDNHIVDNCFKKHGLPPHFRKNSQVNNAAIEGGIEDHNASEVTNAELQS 313 Query: 714 G--ITQEKYDKLINML 755 G ITQ++ +LI++L Sbjct: 314 GPPITQDQALQLISLL 329 >gb|PNX80244.1| hypothetical protein L195_g036241 [Trifolium pratense] Length = 362 Score = 238 bits (606), Expect = 2e-73 Identities = 126/260 (48%), Positives = 173/260 (66%), Gaps = 10/260 (3%) Frame = +3 Query: 6 SWIMNSVAESIGQSIVFLENAIDVWNELKERLSQGDLVRISELECEIYASKQGPLSVTEY 185 +WI++SV+ SI QS+VF+ENAID+WN+L+ER SQGDL+RISEL+ E YA KQ SVT++ Sbjct: 86 AWILSSVSPSIAQSVVFMENAIDIWNDLRERFSQGDLIRISELQQEAYALKQDSKSVTDF 145 Query: 186 FSELKVLWEELEAYWPEPKCGCPRKCVCTTGISLSQQHYELSKVIRFLTGLNDNFAMIKS 365 +++LKV+WEELE Y P P C CPR+C C S + H L IRFLTGLN NF+ +KS Sbjct: 146 YTDLKVIWEELELYLPIPSCTCPRRCTCEAMRSARRNH-SLLHTIRFLTGLNANFSTVKS 204 Query: 366 QILIMNPLPKLNKIFSLVIQHERQ--FPQSDESKILAGNVDARNGRTKPRGGYTSGYNSR 539 QILIM+PLP +NK+FSLV+QHERQ +SD+S IL + T GY S Sbjct: 205 QILIMDPLPPINKVFSLVLQHERQGISHESDDSTILVNAARS----TPSSSGYKQSTQSS 260 Query: 540 NGSK---VCSFCGKTGHTVDTCYKKHEVPPHWQR--NSSNAASAD---TNEAPSDKEEEK 695 +GSK C++CG H V+ C+KK+ VPPH ++ +++NAAS + N A + Sbjct: 261 SGSKPPRKCTYCGMNNHFVENCFKKNGVPPHMKKFASANNAASEEGITNNNAATSSTNSP 320 Query: 696 TVQMQTGITQEKYDKLINML 755 I+Q++YDKL+++L Sbjct: 321 AA--SPSISQDQYDKLMSLL 338 >gb|PNX93130.1| retrovirus-related Pol polyprotein from transposon TNT 1-94, partial [Trifolium pratense] Length = 369 Score = 238 bits (606), Expect = 2e-73 Identities = 126/260 (48%), Positives = 173/260 (66%), Gaps = 10/260 (3%) Frame = +3 Query: 6 SWIMNSVAESIGQSIVFLENAIDVWNELKERLSQGDLVRISELECEIYASKQGPLSVTEY 185 +WI++SV+ SI QS+VF+ENAID+WN+L+ER SQGDL+RISEL+ E YA KQ SVT++ Sbjct: 86 AWILSSVSPSIAQSVVFMENAIDIWNDLRERFSQGDLIRISELQQEAYALKQDSKSVTDF 145 Query: 186 FSELKVLWEELEAYWPEPKCGCPRKCVCTTGISLSQQHYELSKVIRFLTGLNDNFAMIKS 365 +++LKV+WEELE Y P P C CPR+C C S + H L IRFLTGLN NF+ +KS Sbjct: 146 YTDLKVIWEELELYLPIPSCTCPRRCTCEAMRSARRNH-SLLHTIRFLTGLNANFSTVKS 204 Query: 366 QILIMNPLPKLNKIFSLVIQHERQ--FPQSDESKILAGNVDARNGRTKPRGGYTSGYNSR 539 QILIM+PLP +NK+FSLV+QHERQ +SD+S IL + T GY S Sbjct: 205 QILIMDPLPPINKVFSLVLQHERQGISHESDDSTILVNAARS----TPSSSGYKQSTQSS 260 Query: 540 NGSK---VCSFCGKTGHTVDTCYKKHEVPPHWQR--NSSNAASAD---TNEAPSDKEEEK 695 +GSK C++CG H V+ C+KK+ VPPH ++ +++NAAS + N A + Sbjct: 261 SGSKPPRKCTYCGMNNHFVENCFKKNGVPPHMKKFASANNAASEEGITNNNAATSSTNSP 320 Query: 696 TVQMQTGITQEKYDKLINML 755 I+Q++YDKL+++L Sbjct: 321 AA--SPSISQDQYDKLMSLL 338 >gb|PNX62201.1| retrovirus-related Pol polyprotein from transposon TNT 1-94, partial [Trifolium pratense] Length = 268 Score = 234 bits (597), Expect = 3e-73 Identities = 120/236 (50%), Positives = 162/236 (68%), Gaps = 7/236 (2%) Frame = +3 Query: 63 NAIDVWNELKERLSQGDLVRISELECEIYASKQGPLSVTEYFSELKVLWEELEAYWPEPK 242 NAIDVWNELKER S GD +RISEL+ EI+ KQG SV+E+F+ LK LWEELEAY P P Sbjct: 35 NAIDVWNELKERFSHGDFIRISELQIEIHRLKQGNRSVSEFFTVLKTLWEELEAYLPTPV 94 Query: 243 CGCPRKCVCTTGISLSQQHYELSKVIRFLTGLNDNFAMIKSQILIMNPLPKLNKIFSLVI 422 C CPRKCVC TGI ++ ++L + IRFLTGLND+F M++SQIL+M+PLP +NK+FS+VI Sbjct: 95 CNCPRKCVCATGIINARSQHDLLRKIRFLTGLNDSFDMVRSQILLMDPLPLMNKVFSMVI 154 Query: 423 QHERQFP------QSDESKILAGNVDARNGRTKPRGGYTSGYNSRNGSKVCSFCGKTGHT 584 QHERQF +++SKI D+R + + RGG+ G S + K C+FCGK H Sbjct: 155 QHERQFVPHITGLDTEDSKISINASDSRRSQGRGRGGF-HGQFSSSKKKYCTFCGKDSHV 213 Query: 585 VDTCYKKHEVPPHWQRNSS-NAASADTNEAPSDKEEEKTVQMQTGITQEKYDKLIN 749 V+ YKKH PP++ RN+S N A+A+ + D + K + T T+ +YD +++ Sbjct: 214 VENFYKKHGFPPNYGRNTSGNNANAEDSLDTDDSKSTKGNEAFT-FTKSRYDNILS 268 >dbj|GAU47169.1| hypothetical protein TSUD_28920 [Trifolium subterraneum] Length = 1086 Score = 252 bits (643), Expect = 3e-73 Identities = 134/258 (51%), Positives = 176/258 (68%), Gaps = 8/258 (3%) Frame = +3 Query: 6 SWIMNSVAESIGQSIVFLENAIDVWNELKERLSQGDLVRISELECEIYASKQGPLSVTEY 185 SWIMN V+ESI QSIVF+ENA+D WN+LK+R SQGDLVRISEL EIYA +Q SVTE+ Sbjct: 98 SWIMNFVSESIAQSIVFMENAMDAWNDLKDRFSQGDLVRISELMQEIYALQQDSKSVTEF 157 Query: 186 FSELKVLWEELEAYWPEPKCGCPRKCVCTTGISLSQQHYELSKVIRFLTGLNDNFAMIKS 365 +S+LK+LWEELE Y P P C C +C C IS H L IRFLTGLNDNFAM+KS Sbjct: 158 YSDLKILWEELEIYMPIPNCTCRSRCNCEAMISARSNH-TLLYAIRFLTGLNDNFAMVKS 216 Query: 366 QILIMNPLPKLNKIFSLVIQHERQ--FPQSDESKILAGNVDARNGRTKPRGGYTSGYNSR 539 QIL+++PLP + K+FS+V+Q ERQ F S+ESK+L VD++ + P +S + Sbjct: 217 QILLLDPLPSMTKMFSMVLQFERQRNFGTSEESKVLVNAVDSKK-PSYPNSRGSSQPATS 275 Query: 540 NGSKVCSFCGKTGHTVDTCYKKHEVPPHWQRNSSN----AASADTNEAPSDKEE--EKTV 701 GSK C++C +T HTV+ C+KKH PPH QRN SN AS ++NEA S + + + Sbjct: 276 KGSKFCTYCHRTNHTVNDCFKKHGYPPHMQRNHSNRAAYMASGESNEANSAASDHGQSSQ 335 Query: 702 QMQTGITQEKYDKLINML 755 IT ++Y +L+++L Sbjct: 336 AATPSITPDQYQQLMSLL 353