BLASTX nr result
ID: Cheilocostus21_contig00051066
seq
BLASTX 2.2.26 [Sep-21-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Cheilocostus21_contig00051066 (1303 letters) Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects 149,584,005 sequences; 54,822,741,787 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value gb|KZV44098.1| hypothetical protein F511_10769 [Dorcoceras hygro... 106 1e-20 gb|KZV19421.1| hypothetical protein F511_08762 [Dorcoceras hygro... 101 5e-19 gb|KZV50652.1| peroxidase 64 [Dorcoceras hygrometricum] 100 7e-19 ref|XP_004499522.1| PREDICTED: uncharacterized protein LOC101509... 99 3e-18 gb|PNX89396.1| pentatricopeptide repeat-containing protein, part... 90 6e-16 ref|XP_014622499.1| PREDICTED: uncharacterized protein LOC106795... 91 7e-16 dbj|GAU37691.1| hypothetical protein TSUD_164940 [Trifolium subt... 91 9e-16 dbj|GAU13119.1| hypothetical protein TSUD_174190 [Trifolium subt... 91 1e-15 gb|PNX98841.1| retrotransposon-related protein [Trifolium pratense] 90 2e-15 gb|PNY03339.1| retrotransposon-related protein, partial [Trifoli... 89 5e-15 gb|PNX99328.1| retrotransposon-related protein [Trifolium pratense] 89 6e-15 gb|PNX92330.1| retrotransposon-related protein [Trifolium pratense] 89 7e-15 gb|PNY15768.1| retrotransposon-related protein, partial [Trifoli... 89 7e-15 dbj|GAU42300.1| hypothetical protein TSUD_136860 [Trifolium subt... 88 9e-15 dbj|GAU14123.1| hypothetical protein TSUD_169480 [Trifolium subt... 87 2e-14 gb|PNX92424.1| retrotransposon-related protein [Trifolium pratense] 87 2e-14 gb|PNY13662.1| retrotransposon-related protein [Trifolium pratense] 87 2e-14 ref|XP_014632080.1| PREDICTED: uncharacterized protein LOC106798... 86 3e-14 gb|PNY16937.1| retrotransposon-related protein [Trifolium pratense] 86 4e-14 gb|PNY17068.1| retrotransposon-related protein [Trifolium pratense] 85 4e-14 >gb|KZV44098.1| hypothetical protein F511_10769 [Dorcoceras hygrometricum] Length = 1119 Score = 106 bits (264), Expect = 1e-20 Identities = 83/304 (27%), Positives = 144/304 (47%), Gaps = 11/304 (3%) Frame = -1 Query: 880 KRMEVSGFDRSDLVEWIAHAE*YFNI*GVAEVYKVPMAFVSMEGKTYH*FR*MKKRILNL 701 ++ E+ F+ +D + W++ AE YF I G +++ +A + MEG H F+ + R N Sbjct: 2 RKFEIPAFNGTDPIAWLSKAEQYFEIHGTPTYHRLRIAHICMEGTAVHWFQWARSRNPNW 61 Query: 700 T*EILKEKMI*H*SEFPAI-AYECLVALEQGHMTVMEHIAAYKEIVS*ITKLPKQ*YMGF 524 E E++I S A +E L +L+Q +V E+I A++ ++S + LP+ +G+ Sbjct: 62 NWERFAEELINRYSGRKATNPFESLASLKQEDRSVEEYIEAFEVLLSQVGDLPELQCLGY 121 Query: 523 ILKTLNYYIRERVRALLPTNVMQAMTLARIIEREVTKGVDLKRRTGAEG-------MMXX 365 L +R R+R +P + +AM LAR IE E+T KR T +G + Sbjct: 122 FQSGLREELRLRLRTHVPRTINRAMDLARSIEEELT--CSTKRHTSRDGYTSQRWDVRKR 179 Query: 364 XXXXXXXXXXXXSFPRT*HGQIHLQWPF---*TVVFLS*RKNSGANSNIMTQLKRREGGP 194 R Q++ +W + T +N+ A S +M++ P Sbjct: 180 PVETVVGRVPAAEQYRPGRAQLNDRWGYKVDPTQPLERYEENTNAGSRLMSR-------P 232 Query: 193 SHISSKEVGNQHSPPSISDRTRGSQTMTH*DYINKREKRLCFHCDQLYHSMHRYENKSFR 14 +S GN+ S + R R + +H +Y++++EK LC+ C + YH HR NKS + Sbjct: 233 VATNSTTTGNR----SNNGRFREGKIFSHQEYLSRKEKGLCYRCGEPYHPQHRCANKSLK 288 Query: 13 LMLL 2 + L Sbjct: 289 VAFL 292 >gb|KZV19421.1| hypothetical protein F511_08762 [Dorcoceras hygrometricum] Length = 1594 Score = 101 bits (251), Expect = 5e-19 Identities = 85/304 (27%), Positives = 145/304 (47%), Gaps = 11/304 (3%) Frame = -1 Query: 880 KRMEVSGFDRSDLVEWIAHAE*YFNI*GVAEVYKVPMAFVSMEGKTYH*FR*MKKRILNL 701 KR E+ F+ D V W++ AE YF I G +++ +A + MEG H F+ + R N Sbjct: 102 KRFEIPAFNGVDPVGWLSKAEQYFEIHGTPLYHRLKIAHICMEGTAVHWFQWARSRNKNW 161 Query: 700 T*EILKEKMI*H*SEFPAI-AYECLVALEQGHMTVMEHIAAYKEIVS*ITKLPKQ*YMGF 524 + E E+++ S A +E L +L+Q +V E+I ++ +++ + LP+ MG+ Sbjct: 162 SWERFAEELVNRYSGRKATNPFELLASLKQEGRSVDEYIEEFEVLIAQVGDLPELQCMGY 221 Query: 523 ILKTLNYYIRERVRALLPTNVMQAMTLARIIEREVT---------KGVDLKRRTGAEGMM 371 L L +R R+R P + +AM LAR +E E+T +G ++RR G + Sbjct: 222 FLSGLREELRLRLRTHGPRTITRAMDLARSVEEELTWLTGRSVSREGNVIQRREGRQ--- 278 Query: 370 XXXXXXXXXXXXXXSFPRT*HGQIHLQWPF*TVVFLS*RKNSGANSNIMT-QLKRREGGP 194 + R G+ QW V R +N+N T Q R+ Sbjct: 279 ---RWTDNLLGRAYTVERPKMGR--AQWTERPVD----RNEDSSNTNSKTIQSTFRQINT 329 Query: 193 SHISSKEVGNQHSPPSISDRTRGSQTMTH*DYINKREKRLCFHCDQLYHSMHRYENKSFR 14 + + GN + R R + ++H +Y+N+REK LC+ C +LY+ +H+ NKS + Sbjct: 330 NSTPTVGRGN-------NGRFREGRILSHQEYLNRREKGLCYRCGELYNPLHKCANKSLK 382 Query: 13 LMLL 2 + +L Sbjct: 383 VAVL 386 >gb|KZV50652.1| peroxidase 64 [Dorcoceras hygrometricum] Length = 914 Score = 100 bits (249), Expect = 7e-19 Identities = 79/302 (26%), Positives = 138/302 (45%), Gaps = 9/302 (2%) Frame = -1 Query: 880 KRMEVSGFDRSDLVEWIAHAE*YFNI*GVAEVYKVPMAFVSMEGKTYH*FR*MKKRILNL 701 ++ E+ F+ +D + W+ AE YF I G +++ +A + MEG H F+ + R N Sbjct: 97 RKFEIPSFNGTDPIAWLGKAEQYFEIHGTPSYHRLRIAHICMEGSAVHWFQWARTRNPNW 156 Query: 700 T*EILKEKMI*H*SEFPAI-AYECLVALEQGHMTVMEHIAAYKEIVS*ITKLPKQ*YMGF 524 + E++I S A+ +E L +L+Q +V E+I +++ +++ + LP+ +G+ Sbjct: 157 NWDRFAEELINRYSGRKAVNPFETLASLKQEDRSVEEYIESFEILLAQVGDLPEPQCLGY 216 Query: 523 ILKTLNYYIRERVRALLPTNVMQAMTLARIIEREVTKGVDLKRRTGA--EGMMXXXXXXX 350 L L +R R+R+ +P + +AM LAR +E E++ TG+ EG Sbjct: 217 FLSGLREELRLRLRSHVPRTITRAMDLARSVEEELSWSPARHSNTGSRWEGRPRSTDTLM 276 Query: 349 XXXXXXXSFPRT*HGQIHLQWPF*TVVFLS*RKNSGANSNIMTQLKRRE----GGPSHIS 182 + + PF S + N L R E P+ IS Sbjct: 277 GRGPIGDRY--------RVGRPF------SNERLGYKTDNFQPPLGRHEANNTSAPTTIS 322 Query: 181 SKEVGNQHSPPS--ISDRTRGSQTMTH*DYINKREKRLCFHCDQLYHSMHRYENKSFRLM 8 N + + R R + +H +Y+N+REK LC+ C + YH +H+ NKS R+ Sbjct: 323 RPLASNATTSNNKMPGGRFREGKIYSHQEYLNRREKGLCYRCGEAYHPLHKCTNKSLRVA 382 Query: 7 LL 2 L Sbjct: 383 FL 384 >ref|XP_004499522.1| PREDICTED: uncharacterized protein LOC101509306 [Cicer arietinum] Length = 1006 Score = 98.6 bits (244), Expect = 3e-18 Identities = 92/324 (28%), Positives = 142/324 (43%), Gaps = 20/324 (6%) Frame = -1 Query: 913 KDSYRRG*TIYKRMEVSGFDRSDLVEWIAHAE*YFNI*GVAEVYKVPMAFVSMEGKTYH* 734 +DS R K++E+ FD D V WI AE YF + G E KV +A +SMEG T H Sbjct: 110 EDSVREYKMAVKKVELPSFDGDDHVAWITRAETYFEVQGTLEEVKVRLAKLSMEGATIHW 169 Query: 733 FR*MKKRILNLT*EILKEKMI----*H*SEFPAIAYECLVALEQGHMTVMEHIAAYKEIV 566 F +++ NL LK +I S+ P +E + L+Q TV E+I ++ + Sbjct: 170 FNLLRETKDNLNWAKLKRALIERYGGRQSDNP---FEEMKDLQQTG-TVDEYITTFEYVS 225 Query: 565 S*ITKLPKQ*YMGFILKTLNYYIRERVRALLPTNVMQAMTLARIIEREV----------T 416 S + +LP++ Y+G+ + L +IR +VR L P +QAM +AR +E E+ Sbjct: 226 SQVARLPEEQYLGYFMGGLKNHIRLKVRTLNPQTRLQAMKIARDVETELHGSLISIGGSV 285 Query: 415 KGVDLKRRTGAEGM------MXXXXXXXXXXXXXXSFPRT*HGQIHLQWPF*TVVFLS*R 254 G+ + +G G+ F G +H + S + Sbjct: 286 GGLRSWKGSGPSGLGPNGKKGSGFHYNPGSTRSGSGFNNNLTGSVHSKTS-------STQ 338 Query: 253 KNSGANSNIMTQLKRREGGPSHISSKEVGNQHSPPSISDRTRGSQTMTH*DYINKREKRL 74 S ANSNI R GN + R RG + + + + + +R + L Sbjct: 339 SASNANSNISYNTARS------------GNDEGRRISNGRNRGLKHLPYSELMERRTRGL 386 Query: 73 CFHCDQLYHSMHRYENKSFRLMLL 2 CF C + YH H+ K RLM+L Sbjct: 387 CFRCGEKYHP-HQCAEKQLRLMIL 409 >gb|PNX89396.1| pentatricopeptide repeat-containing protein, partial [Trifolium pratense] Length = 407 Score = 90.1 bits (222), Expect = 6e-16 Identities = 90/305 (29%), Positives = 144/305 (47%), Gaps = 12/305 (3%) Frame = -1 Query: 880 KRMEVSGFDRSDLVEWIAHAE*YFNI*GVAEVYKVPMAFVSMEGKTYH*FR*MKKRILNL 701 K++++ FD D V WI AE YF++ + +V +A +SMEG T H F + + +L Sbjct: 90 KKVKLPLFDGEDPVAWITRAEIYFDVQNTIDEMRVKLARLSMEGPTIHWFNLLMETEDDL 149 Query: 700 T*EILKEKMI*H*S----EFPAIAYECLVALEQGHMTVMEHIAAYKEIVS*ITKLPKQ*Y 533 + E LK +I E P +E L L+Q TV E + +++ + S + +LP++ Y Sbjct: 150 SWEKLKRALIARYGGRRLENP---FEELSTLKQKG-TVEEFVESFELLSSQVGRLPEEQY 205 Query: 532 MGFILKTLNYYIRERVRALLPTNVMQAMTLARIIEREVTKG--VDLKRRTGAEGMMXXXX 359 +G+ + L IR RVR L P N M+ M +A+ +E E+ +G D +RR G G Sbjct: 206 LGYFMSGLKPQIRRRVRTLNPRNRMEMMRIAKDVEGELKEGDDDDTERRFGKRG----GA 261 Query: 358 XXXXXXXXXXSFPRT*HGQIHLQWPF*TVVFLS*RKNS-----GANSNIMTQLKRREGGP 194 SF Q Q T S NS G+NSN T Sbjct: 262 ERLGQRDWAGSFKNRSGSQPRDQ----TRSIGSYSNNSKTASYGSNSNSNT--------- 308 Query: 193 SHISSKEVGNQHSPPSISDRTRGSQTMTH*DYINKREKRLCFHCDQLYH-SMHRYENKSF 17 + +SS + +S ++R RG ++ + + + K LCF C YH ++H+ KS Sbjct: 309 TAVSSARKHDGNSRSGATERWRGVRSFNNEETEERWRKGLCFKCGGKYHPTLHKCPEKSL 368 Query: 16 RLMLL 2 R+++L Sbjct: 369 RVLIL 373 >ref|XP_014622499.1| PREDICTED: uncharacterized protein LOC106795996 [Glycine max] Length = 463 Score = 90.5 bits (223), Expect = 7e-16 Identities = 80/303 (26%), Positives = 141/303 (46%), Gaps = 10/303 (3%) Frame = -1 Query: 880 KRMEVSGFDRSDLVEWIAHAE*YFNI*GVAEVYKVPMAFVSMEGKTYH*FR*MKKRILNL 701 K++++ F+ DLV WI E YF++ + +V +A +SMEG T H F + + L Sbjct: 55 KKVKLPLFEGDDLVAWITRVEIYFDVQNTTDKMRVKLARLSMEGSTIHWFNLLMETEDEL 114 Query: 700 T*EILKEKMI*H*S----EFPAIAYECLVALEQGHMTVMEHIAAYKEIVS*ITKLPKQ*Y 533 + E LK +I E P +E L + Q +V E + A++ + S + +LP++ Y Sbjct: 115 SWEKLKRALIARYGGRRLENP---FEELSTIRQKG-SVEEFVEAFELLSSQVGRLPEEQY 170 Query: 532 MGFILKTLNYYIRERVRALLPTNVMQAMTLARIIEREVTK----GVDLKRRTGAEGMMXX 365 +G+ + L IR RVR L P N MQ M +A+ +E E+ + G + G + M Sbjct: 171 LGYFMSGLKPQIRRRVRTLNPLNRMQMMRIAKDVEEELKEEDEDGERSDSKKGVQDRMGR 230 Query: 364 XXXXXXXXXXXXSF-PRT*HGQIHLQWPF*TVVFLS*RKNSGANSNIMTQLKRREGGPSH 188 PR +L W T +K + SN ++ L S Sbjct: 231 NNWAGSFQKSRSGLNPRDPIRSPNLGWSNPT------QKTGSSGSNTISTL-------SL 277 Query: 187 ISSKEVGNQHSPPSISDRTRGSQTMTH*DYINKREKRLCFHCDQLYH-SMHRYENKSFRL 11 S+ G + +S++ +G +++ + + +R K LCF C YH ++H+ ++ R+ Sbjct: 278 ASTGRKGETDTRTGVSEKWKGVRSIRNNEMAERRAKGLCFKCGGKYHPTLHKCPERALRV 337 Query: 10 MLL 2 ++L Sbjct: 338 LIL 340 >dbj|GAU37691.1| hypothetical protein TSUD_164940 [Trifolium subterraneum] Length = 1542 Score = 91.3 bits (225), Expect = 9e-16 Identities = 78/303 (25%), Positives = 132/303 (43%), Gaps = 10/303 (3%) Frame = -1 Query: 880 KRMEVSGFDRSDLVEWIAHAE*YFNI*GVAEVYKVPMAFVSMEGKTYH*FR*MKKRILNL 701 KR+E+ FD D WI+ AE YF + G KV +A + MEG T H F + + + +L Sbjct: 104 KRVELPSFDGDDPAGWISRAEVYFRVQGTTPEVKVSLAQLCMEGSTIHFFNSIVREVPDL 163 Query: 700 T*EILKEKMI-*H*SEFPAIAYECLVALEQGHMTVMEHIAAYKEIVS*ITKLPKQ*YMGF 524 T E LKE ++ + YE L L+Q TV ++I ++ +++ I KLP++ + G+ Sbjct: 164 TWEGLKEALLERYGGHGEGDVYEQLTELKQ-EGTVEDYITEFEYLIAQIPKLPEKQFQGY 222 Query: 523 ILKTLNYYIRERVRALLPTNVMQAMTL---ARIIEREVTKGVDLKRRTGAEGMMXXXXXX 353 L L IR +VR++ M M L R +E+E+ G G+ Sbjct: 223 FLHGLKTEIRGKVRSMAAMGEMSRMKLFQVTRAVEKEIKGGSGSNHHRGS---------- 272 Query: 352 XXXXXXXXSFPRT*HGQIHLQWPF*TVVFLS*RKNSGANSNIMTQLKRREGGPSHISSKE 173 R +G LS R + + +KR G + + Sbjct: 273 -----------RVGNG-------------LS-RHGPSRSGSDWVMVKREGGNNGSVKNGT 307 Query: 172 VGNQHSPPSISDRTRGSQ------TMTH*DYINKREKRLCFHCDQLYHSMHRYENKSFRL 11 G + + DR RG +++ + + +++K LCF C +H MH+ K R+ Sbjct: 308 NGPRSEKQAQGDRRRGGHRDRGFTQLSYNELMERKQKGLCFKCGGPFHPMHQCPEKQLRV 367 Query: 10 MLL 2 +++ Sbjct: 368 LVI 370 >dbj|GAU13119.1| hypothetical protein TSUD_174190 [Trifolium subterraneum] Length = 1550 Score = 90.9 bits (224), Expect = 1e-15 Identities = 81/303 (26%), Positives = 141/303 (46%), Gaps = 10/303 (3%) Frame = -1 Query: 880 KRMEVSGFDRSDLVEWIAHAE*YFNI*GVAEVYKVPMAFVSMEGKTYH*FR*MKKRILNL 701 K++++ FD D V WI AE YF++ + ++ +A +SM+G T H F + + L Sbjct: 80 KKVKLPEFDGEDPVAWITRAEIYFDVQQTPDAMRIKLARLSMDGPTIHWFNLLLETEDEL 139 Query: 700 T*EILKEKMI*H*S----EFPAIAYECLVALEQGHMTVMEHIAAYKEIVS*ITKLPKQ*Y 533 + E LK+++I E P +E L L+QG +V E++ A++ + S +T+LP++ Y Sbjct: 140 SWEKLKKELIARYGGRRLENP---FEELSTLKQGG-SVKEYVEAFELLSSQVTRLPEEQY 195 Query: 532 MGFILKTLNYYIRERVRALLPTNVMQAMTLARIIEREVTKGVD-----LKRRTGAEGMMX 368 +G+ + L IR RVR P +Q M +A+ +E E+ + D ++ G E M Sbjct: 196 LGYFMSGLKPPIRRRVRTFNPRTRLQMMRVAKDVEDELKEDDDDQGGYFSKKGGKERM-- 253 Query: 367 XXXXXXXXXXXXXSFPRT*HGQIHLQWPF*TVVFLS*RKNSGANSNIMTQLKRREGGPSH 188 PR H W T + ++ + S++ T K+ E S Sbjct: 254 ---GRNNWAHVLNKGPRDTTRSGHTGWAQSTQHSGTIASSNNSTSSLSTTAKKGETSGS- 309 Query: 187 ISSKEVGNQHSPPSISDRTRGSQTMTH*DYINKREKRLCFHCDQLYH-SMHRYENKSFRL 11 DR +G +M + + +R K LCF C +H ++H+ +S RL Sbjct: 310 ---------------MDRWKGFHSMHNNEIAERRAKGLCFKCGGKFHPTLHKCPERSLRL 354 Query: 10 MLL 2 ++L Sbjct: 355 LVL 357 >gb|PNX98841.1| retrotransposon-related protein [Trifolium pratense] Length = 786 Score = 89.7 bits (221), Expect = 2e-15 Identities = 81/298 (27%), Positives = 138/298 (46%), Gaps = 5/298 (1%) Frame = -1 Query: 880 KRMEVSGFDRSDLVEWIAHAE*YFNI*GVAEVYKVPMAFVSMEGKTYH*FR*MKKRILNL 701 K++++ FD D V WI AE YF++ + +V +A +SMEG T H F + + +L Sbjct: 92 KKVKLPLFDGEDPVAWITRAEIYFDVQNTVDEMRVKLARLSMEGSTIHWFNLLMETEDDL 151 Query: 700 T*EILKEKMI*H*S----EFPAIAYECLVALEQGHMTVMEHIAAYKEIVS*ITKLPKQ*Y 533 + E LK +I E P +E L L+Q TV E + +++ + S + +LP++ Y Sbjct: 152 SWEKLKRALIARYGGRRLENP---FEELSTLKQ-RGTVEEFVESFELLSSQVGRLPEEQY 207 Query: 532 MGFILKTLNYYIRERVRALLPTNVMQAMTLARIIEREVTKGVDLKRRTGAEGMMXXXXXX 353 +G+ + L IR RVR L P N M+ M +A+ +E E+ D++ + Sbjct: 208 LGYFMSGLKPQIRRRVRTLNPRNRMEMMRIAKDVEGELKDDDDVEPKENTR--------- 258 Query: 352 XXXXXXXXSFPRT*HGQIHLQWPF*TVVFLS*RKNSGANSNIMTQLKRREGGPSHISSKE 173 S+P + G S ANS++ + + EG K+ Sbjct: 259 --VSNTGGSYPNSKTGS----------------NGSNANSSMSSTARNSEG-------KQ 293 Query: 172 VGNQHSPPSISDRTRGSQTMTH*DYINKREKRLCFHCDQLYH-SMHRYENKSFRLMLL 2 + SDR RG ++ + + +R K LCF C +H +MH+ +S R+++L Sbjct: 294 LSGG------SDRWRGFRSFQNSEVEERRSKGLCFKCGGKWHPTMHKCPERSLRVLIL 345 >gb|PNY03339.1| retrotransposon-related protein, partial [Trifolium pratense] Length = 1048 Score = 89.0 bits (219), Expect = 5e-15 Identities = 79/302 (26%), Positives = 140/302 (46%), Gaps = 9/302 (2%) Frame = -1 Query: 880 KRMEVSGFDRSDLVEWIAHAE*YFNI*GVAEVYKVPMAFVSMEGKTYH*FR*MKKRILNL 701 K++E+ F+ D WIA AE YFN+ KV +A M+G T H F+ + + L Sbjct: 56 KKVELPMFNGDDPAGWIARAEVYFNVQNTRPEIKVNLAQXCMDGPTIHFFKGLLEENETL 115 Query: 700 T*EILKEKMI*H*SEFPAIA----YECLVALEQGHMTVMEHIAAYKEIVS*ITKLPKQ*Y 533 T E LK+ ++ + ++ +E L AL+Q T+ E+I ++ +V+ + KLP Y Sbjct: 116 TWENLKDALL---ERYGGVSDGNVFEQLSALQQ-EGTIEEYIEDFERLVAQLPKLPNDQY 171 Query: 532 MGFILKTLNYYIRERVRALL---PTNVMQAMTLARIIEREVTKGVDLKRRTGAEGMMXXX 362 +G+ + L IR +VR+++ P + + M +AR +ERE+ +G + G + Sbjct: 172 LGYFVHGLKDKIRGKVRSMIAMGPMSRAKLMNVARAVERELEEGRRDWTSRRSHGSLSRH 231 Query: 361 XXXXXXXXXXXSFPRT*HGQIHLQWPF*TVVFLS*RKNSGANSNIM--TQLKRREGGPSH 188 T + H W +N+ N ++ + GGP Sbjct: 232 VFGNKVV--------TQNEGRHGDWGM--------ARNNKDNRDLPGGNVGGKSIGGPRG 275 Query: 187 ISSKEVGNQHSPPSISDRTRGSQTMTH*DYINKREKRLCFHCDQLYHSMHRYENKSFRLM 8 + S H+ + + R R + M+ + ++R+K LCF C YH H+ +K+ R+M Sbjct: 276 MFSVGQNKTHTTNTGNWRDRNVRNMSSQEIADRRQKGLCFKCGGPYHPRHQCPDKNLRVM 335 Query: 7 LL 2 +L Sbjct: 336 VL 337 >gb|PNX99328.1| retrotransposon-related protein [Trifolium pratense] Length = 959 Score = 88.6 bits (218), Expect = 6e-15 Identities = 76/302 (25%), Positives = 142/302 (47%), Gaps = 9/302 (2%) Frame = -1 Query: 880 KRMEVSGFDRSDLVEWIAHAE*YFNI*GVAEVYKVPMAFVSMEGKTYH*FR*MKKRILNL 701 K++++ FD D V WI AE YF++ A+ +V +A +SMEG T H F +++ +L Sbjct: 87 KKVKLPVFDGDDPVAWITRAEIYFDVQNTADEMRVKLARLSMEGPTIHWFNLLRETEDDL 146 Query: 700 T*EILKEKMI*H*S----EFPAIAYECLVALEQGHMTVMEHIAAYKEIVS*ITKLPKQ*Y 533 + E LK+ +I E P +E L L+Q +V +++ A++ + S + +LP++ Y Sbjct: 147 SWEKLKKALIARYGGRRLENP---FEELATLKQTG-SVEDYVEAFELLSSQVGRLPEEQY 202 Query: 532 MGFILKTLNYYIRERVRALLPTNVMQAMTLARIIEREVTKG-VDLKRRTGAEGMMXXXXX 356 +G+ + L IR RVR L P + MQ M +A+ +E E+ + D R +G+ Sbjct: 203 LGYFMSGLKPQIRRRVRTLNPLSRMQMMRIAKDVEEELKEADEDEGRGVSRKGVQDRGNK 262 Query: 355 XXXXXXXXXSFPRT*HGQIHLQWPF*TVVFLS*RKNSGANSNIMTQLKRREGGPSHISSK 176 + +G++ N P S+ Sbjct: 263 SNWAGSFS-------------------------KGQNGSHPNRSFNSGWSNPSPKSHSAM 297 Query: 175 EVGNQHSPPSISDRT---RGSQTMTH*DYINKREKRLCFHCDQLYH-SMHRYENKSFRLM 8 N ++ + + RT +G +++ + + + +R K LCF C + +H +MH+ K+ R++ Sbjct: 298 SKSNSNTSMATTGRTGPWKGVRSVQNSEIVERRAKGLCFKCGERWHPTMHKCPEKALRVL 357 Query: 7 LL 2 +L Sbjct: 358 IL 359 >gb|PNX92330.1| retrotransposon-related protein [Trifolium pratense] Length = 1568 Score = 88.6 bits (218), Expect = 7e-15 Identities = 84/300 (28%), Positives = 139/300 (46%), Gaps = 7/300 (2%) Frame = -1 Query: 880 KRMEVSGFDRSDLVEWIAHAE*YFNI*GVAEVYKVPMAFVSMEGKTYH*FR*MKKRILNL 701 K++++ FD D V WI AE YF++ + +V +A +SMEG T H F +++ +L Sbjct: 104 KKVKLPVFDGEDPVAWITRAEIYFDVQNTMDEMRVKLARLSMEGATIHWFNLLRETEDDL 163 Query: 700 T*EILKEKMI*H*S----EFPAIAYECLVALEQGHMTVMEHIAAYKEIVS*ITKLPKQ*Y 533 T E LK +I E P +E L L+Q +V E++ A++ + S + +LP++ Y Sbjct: 164 TWEKLKRALIARYGGRRLENP---FEELATLKQSG-SVEEYVEAFELLSSQVGRLPEEQY 219 Query: 532 MGFILKTLNYYIRERVRALLPTNVMQAMTLARIIEREVTKGVDLKRRTGAEGMMXXXXXX 353 +G+ + L IR RVR L P + MQ M +A+ +E E+ K VD G Sbjct: 220 LGYFMSGLKPQIRRRVRTLNPLSRMQMMRIAKDVEDEL-KEVDEDEGRGMSKKGVQDRGI 278 Query: 352 XXXXXXXXSFPRT*HGQIHLQWPF*TVVFLS*RKNSGANSNI--MTQLKRREGGPSHISS 179 + R +G N ANS + + Q G S+ SS Sbjct: 279 KNEWAGSMNKGR--YGP---------------NPNRPANSGLSNLNQKLGSTGSNSNSSS 321 Query: 178 KEVGNQHSPPSISDRTRGSQTMTH*DYINKREKRLCFHCDQLYH-SMHRYENKSFRLMLL 2 S P +G ++ + + + +R K LCF C + +H +MH+ K+ R+++L Sbjct: 322 SMASTGRSGP-----WKGVRSFQNNEIVERRAKGLCFKCGERWHPTMHKCPEKALRVLIL 376 >gb|PNY15768.1| retrotransposon-related protein, partial [Trifolium pratense] Length = 1583 Score = 88.6 bits (218), Expect = 7e-15 Identities = 81/303 (26%), Positives = 130/303 (42%), Gaps = 10/303 (3%) Frame = -1 Query: 880 KRMEVSGFDRSDLVEWIAHAE*YFNI*GVAEVYKVPMAFVSMEGKTYH*FR*MKKRILNL 701 K++E+ F+ D WI+ AE YF + G KV +A + MEG T H F + NL Sbjct: 104 KKVELPAFNGEDPAGWISRAEVYFRVQGTTPEVKVNLAQLCMEGSTIHFFNSLVGEEENL 163 Query: 700 T*EILKEKMI-*H*SEFPAIAYECLVALEQGHMTVMEHIAAYKEIVS*ITKLPKQ*YMGF 524 T + LKE ++ + YE L L+Q TV E+I ++ +++ I KLP++ + G+ Sbjct: 164 TWDRLKESLLERYGGHGEGDVYEQLTELKQ-EGTVEEYITEFEYLIAQIPKLPEKQFRGY 222 Query: 523 ILKTLNYYIRERVRALLPTNVM---QAMTLARIIEREVTKGVDLKRRTGAEGMMXXXXXX 353 + L IR +VR+L M + + + R +E+EV G K G+ Sbjct: 223 FIHGLKTEIRGKVRSLAAMGEMNRTKLLQITRAVEKEVRGGNGSKHNHGSRFGSGSHRSG 282 Query: 352 XXXXXXXXSFPRT*HGQIHLQWPF*TVVFLS*RKNSGANSNI------MTQLKRREGGPS 191 + W +V N GANS + Q + R GP Sbjct: 283 SYGP-----------NRNRSDW---VMVRRGASNNGGANSGSGEFKGQLAQGETRRNGP- 327 Query: 190 HISSKEVGNQHSPPSISDRTRGSQTMTH*DYINKREKRLCFHCDQLYHSMHRYENKSFRL 11 R RG + +T+ + +N+R+K LCF C +H H+ K RL Sbjct: 328 ------------------RDRGFKHLTYGELMNRRQKGLCFTCGGPFHPRHQCPEKHLRL 369 Query: 10 MLL 2 +++ Sbjct: 370 LVV 372 >dbj|GAU42300.1| hypothetical protein TSUD_136860 [Trifolium subterraneum] Length = 1523 Score = 88.2 bits (217), Expect = 9e-15 Identities = 78/294 (26%), Positives = 128/294 (43%), Gaps = 1/294 (0%) Frame = -1 Query: 880 KRMEVSGFDRSDLVEWIAHAE*YFNI*GVAEVYKVPMAFVSMEGKTYH*FR*MKKRILNL 701 KR+E+ FD D WI+ AE YF + KV +A + MEG T H F + + +L Sbjct: 86 KRVELPPFDGEDPAGWISRAEVYFRVQNTMPAVKVSLAQLCMEGSTIHFFNSLLREKEDL 145 Query: 700 T*EILKEKMI*-H*SEFPAIAYECLVALEQGHMTVMEHIAAYKEIVS*ITKLPKQ*YMGF 524 + E LKE ++ + YE L L+Q TV ++I ++ +++ I KLP++ + G+ Sbjct: 146 SWEELKEALLERYGGHGEGDVYEQLTELKQ-EGTVEDYITDFEYLIAQIPKLPEKQFQGY 204 Query: 523 ILKTLNYYIRERVRALLPTNVMQAMTLARIIEREVTKGVDLKRRTGAEGMMXXXXXXXXX 344 L L IR +VR+L+ M L +VT+ V+ K G G Sbjct: 205 FLHGLKVEIRGKVRSLIAMGEMSRSKLM-----QVTRAVE-KEIQGKNGSGSNHNRGSKP 258 Query: 343 XXXXXSFPRT*HGQIHLQWPF*TVVFLS*RKNSGANSNIMTQLKRREGGPSHISSKEVGN 164 F + + W + K SG N + R G S K+ Sbjct: 259 SSGFQRFGPNGPNRNNSDW------VMVRNKESGGNGGV------RSGSSGLKSDKQA-- 304 Query: 163 QHSPPSISDRTRGSQTMTH*DYINKREKRLCFHCDQLYHSMHRYENKSFRLMLL 2 Q R RG +++ + + +++K LCF C +H MH+ K R++++ Sbjct: 305 QGDRKRSGPRDRGFNHLSYNELMERKQKGLCFKCGGPFHPMHQCPEKQLRVLIV 358 >dbj|GAU14123.1| hypothetical protein TSUD_169480 [Trifolium subterraneum] Length = 1349 Score = 87.4 bits (215), Expect = 2e-14 Identities = 82/314 (26%), Positives = 142/314 (45%), Gaps = 21/314 (6%) Frame = -1 Query: 880 KRMEVSGFDRSDLVEWIAHAE*YFNI*GVAEVYKVPMAFVSMEGKTYH*FR*MKKRILNL 701 K++++ F+ D V WI AE YF++ + +V ++ +SMEG T H F + + NL Sbjct: 72 KKVKLPVFEGEDPVAWITRAEIYFDVQNTPDDLRVKLSRLSMEGSTIHWFNLLMETEDNL 131 Query: 700 T*EILKEKMI*H*S----EFPAIAYECLVALEQGHMTVMEHIAAYKEIVS*ITKLPKQ*Y 533 + E LK+ +I E P +E L L Q +V + A++ + S + +LP++ Y Sbjct: 132 SWEKLKKALIARYGGRRLENP---FEELSTLRQTS-SVENFVEAFELLSSQVGRLPEEQY 187 Query: 532 MGFILKTLNYYIRERVRALLPTNVMQAMTLARIIEREVTKGVDLKRRTGAEGMMXXXXXX 353 +G+ + L +IR RVR L P MQ M +A+ +E E+ + D R+G Sbjct: 188 LGYFMSGLKPHIRRRVRTLNPVTRMQMMRIAKDVEDELNEEEDDGVRSGVRKSSGDRSG- 246 Query: 352 XXXXXXXXSFPRT*HGQIHLQWPF*TVVFLS*RKNSGANSNI--MTQLKRREGGPSHISS 179 +W LS + SG N N +T+ G + + Sbjct: 247 ------------------RSEW-----AGLSFKSRSGYNPNSKEITRFANSSGSGPNYKT 283 Query: 178 KEVGNQHSPPS--------------ISDRTRGSQTMTH*DYINKREKRLCFHCDQLYH-S 44 VG+ S S S+ +G +++++ + + +R K LCF C YH + Sbjct: 284 GSVGSSPSSNSSLFSSARKENDRRASSEIGKGVRSISNDEIMERRAKGLCFKCGGKYHPT 343 Query: 43 MHRYENKSFRLMLL 2 +H+ KS R+++L Sbjct: 344 LHKCPEKSLRVLIL 357 >gb|PNX92424.1| retrotransposon-related protein [Trifolium pratense] Length = 1554 Score = 87.4 bits (215), Expect = 2e-14 Identities = 77/301 (25%), Positives = 131/301 (43%), Gaps = 8/301 (2%) Frame = -1 Query: 880 KRMEVSGFDRSDLVEWIAHAE*YFNI*GVAEVYKVPMAFVSMEGKTYH*FR*MKKRILNL 701 KR+E+ FD D WI+ AE YF + KV ++ + MEG T H F + K +L Sbjct: 98 KRVELPSFDGDDPAGWISRAEVYFRVQNTTPAIKVSLSQLCMEGPTIHFFNSLLKENEDL 157 Query: 700 T*EILKEKMI-*H*SEFPAIAYECLVALEQGHMTVMEHIAAYKEIVS*ITKLPKQ*YMGF 524 + + LKE ++ + YE L L Q TV E+I ++ +++ I KLP++ + G+ Sbjct: 158 SWDELKEALLERYGGHGEGDVYEQLTELRQ-EGTVEEYITDFEYLIAQIPKLPEKQFQGY 216 Query: 523 ILKTLNYYIRERVRALLPTNVMQAMTLARIIEREVTKGVDLKRRTGAEGMMXXXXXXXXX 344 L L IR +VR+L+ M L + + R V K + K TG Sbjct: 217 FLHGLKLEIRGKVRSLVALGEMSRTKLMQ-VTRAVEKEIQGKSGTGLN------------ 263 Query: 343 XXXXXSFPRT*HGQIHLQWPF*TVVFLS*RKNSGANSNIMTQLKRRE-GGPSHISSKEVG 167 G H + G+N N ++ +E GG + S G Sbjct: 264 -----------RGPKHTNG----------SQKFGSNRNDWIFVRNKEAGGSGGVKSNNNG 302 Query: 166 NQHSPPSISD------RTRGSQTMTH*DYINKREKRLCFHCDQLYHSMHRYENKSFRLML 5 ++ + D R RG +++ + + +++K LCF C +H MH+ K ++++ Sbjct: 303 LRNEKNAQGDKRRSGSRDRGFNHLSYNELMERKQKGLCFKCGGPFHPMHQCPEKQLKVLI 362 Query: 4 L 2 + Sbjct: 363 V 363 >gb|PNY13662.1| retrotransposon-related protein [Trifolium pratense] Length = 1562 Score = 87.4 bits (215), Expect = 2e-14 Identities = 83/300 (27%), Positives = 137/300 (45%), Gaps = 7/300 (2%) Frame = -1 Query: 880 KRMEVSGFDRSDLVEWIAHAE*YFNI*GVAEVYKVPMAFVSMEGKTYH*FR*MKKRILNL 701 K++++ FD D V WI AE YF++ + +V +A +SMEG T H F + + +L Sbjct: 82 KKVKLPLFDGEDPVAWITRAEIYFDVQNTVDEMRVKLARLSMEGSTIHWFNLLMETEDDL 141 Query: 700 T*EILKEKMI*H*S----EFPAIAYECLVALEQGHMTVMEHIAAYKEIVS*ITKLPKQ*Y 533 + E LK +I E P +E L L+Q TV E + +++ + S + +LP++ Y Sbjct: 142 SWEKLKRALIARYGGRRLENP---FEELSTLKQ-RGTVEEFVESFELLSSQVGRLPEEQY 197 Query: 532 MGFILKTLNYYIRERVRALLPTNVMQAMTLARIIEREV--TKGVDLKRRTGAEGMMXXXX 359 +G+ + L IR RVR L P N M+ M +A+ +E E+ D +RR + Sbjct: 198 LGYFMSGLKPQIRRRVRTLNPRNRMEMMRIAKDVEGELKDDDDDDAERRLDKKNHFERMG 257 Query: 358 XXXXXXXXXXSFPRT*HGQIHLQWPF*TVVFLS*RKNSGANSNIMTQLKRREGGPSHISS 179 R +G + LS S NS + + S + Sbjct: 258 LRDWAGSV-----RNKNGSQSKD-----TMRLSNAGGSYPNSKMGSTASNASSSMSSTAR 307 Query: 178 KEVGNQHSPPSISDRTRGSQTMTH*DYINKREKRLCFHCDQLYH-SMHRYENKSFRLMLL 2 G Q S SDR +G ++ + + +R K LCF C +H +MH+ +S R+++L Sbjct: 308 NSEGKQRS--GASDRWKGVRSYQNSEVEERRSKGLCFKCGGKWHPTMHKCPERSLRVLIL 365 >ref|XP_014632080.1| PREDICTED: uncharacterized protein LOC106798941 [Glycine max] Length = 998 Score = 86.3 bits (212), Expect = 3e-14 Identities = 81/303 (26%), Positives = 136/303 (44%), Gaps = 8/303 (2%) Frame = -1 Query: 886 IYKRMEVSGFDRSDLVEWIAHAE*YFNI*GVAEVYKVPMAFVSMEGKTYH*FR*MKKRIL 707 + K++++ F+ D V WI AE YF++ + +V ++ +SMEG T H F +++ Sbjct: 77 VEKKVKLPVFEGEDPVAWITRAEIYFDVQNTPDEMRVKLSRLSMEGSTIHWFNLLRETED 136 Query: 706 NLT*EILKEKMI*H*S----EFPAIAYECLVALEQGHMTVMEHIAAYKEIVS*ITKLPKQ 539 +L+ EILK +I E P +E L L Q TV E + A++ + S + +LP++ Sbjct: 137 DLSWEILKRALIARYGGRRLENP---FEELSTLRQTG-TVEEFVEAFELLSSQVGRLPEE 192 Query: 538 *YMGFILKTLNYYIRERVRALLPTNVMQAMTLARIIEREVTKGVDLKRRTGAEGMMXXXX 359 Y+G+ + L IR RVR L P N MQ M +A+ +E E+ + D R + M Sbjct: 193 QYLGYFMSGLKQPIRWRVRTLNPQNRMQVMRMAKDVEEELKEEDDEGDRYYGKNRMGRSE 252 Query: 358 XXXXXXXXXXSFPRT*HGQIHLQWPF*TVVFLS*RKNSGA---NSNIMTQLKRREGGPSH 188 R IH + R NSG + + S Sbjct: 253 PNRLLSK-----SRNGSNPIHKDFT---------RSNSGGYAPSQKTGSTGSNTNPTSSM 298 Query: 187 ISSKEVGNQHSPPSISDRTRGSQTMTH*DYINKREKRLCFHCDQLYH-SMHRYENKSFRL 11 S+ G S+R +G + + + +R K LCF C YH ++H+ ++ R+ Sbjct: 299 NSTGRKGENDRRTLSSERWKGVRGVQSDEIAERRAKGLCFKCGGKYHPTLHKCPERAMRV 358 Query: 10 MLL 2 ++L Sbjct: 359 LIL 361 >gb|PNY16937.1| retrotransposon-related protein [Trifolium pratense] Length = 1550 Score = 86.3 bits (212), Expect = 4e-14 Identities = 94/345 (27%), Positives = 160/345 (46%), Gaps = 9/345 (2%) Frame = -1 Query: 1009 VLHQSRTLGLILQESMLKQAGRRH-SYRG*FK*KDSYRRG*TIY-KRMEVSGFDRSDLVE 836 ++ Q + +IL E + KQAG+R S G DS + + K++++ F+ D V Sbjct: 31 LVRQMQQQSVILAE-LSKQAGKRAPSPEGETSVGDSSQSESRLAGKKVKLPLFEGEDPVA 89 Query: 835 WIAHAE*YFNI*GVAEVYKVPMAFVSMEGKTYH*FR*MKKRILNLT*EILKEKMI*H*S- 659 WI AE YF++ G + +V +A +SMEG T H F + + +L+ E LK+ +I Sbjct: 90 WITRAEIYFDVQGTLDDMRVKLARLSMEGSTIHWFNLLMETEDDLSWEKLKKALIARYGG 149 Query: 658 ---EFPAIAYECLVALEQGHMTVMEHIAAYKEIVS*ITKLPKQ*YMGFILKTLNYYIRER 488 E P +E L L+Q TV E + +++ + S + +LP++ Y+G+ + L IR R Sbjct: 150 RRLENP---FEELATLKQSG-TVEEFVESFELLSSQVGRLPEEQYLGYFMSGLKPQIRRR 205 Query: 487 VRALLPTNVMQAMTLARIIEREVTKGVDLKRRTGAEGMMXXXXXXXXXXXXXXSFPRT*H 308 V+ L P + M+ M +A+ +E E+ + ++RR +G + + Sbjct: 206 VQTLNPRSRMEMMRIAKDVEGELKEEDHVERRYVRKGSYGLGQRDWAGSLNNKNGSQLKD 265 Query: 307 GQIHLQWPF*TVVFLS*RKNSGANSNIMTQLKRREGGPSHISS--KEVGNQHSPPSISDR 134 Q S NS + S +SS K+ G Q S +R Sbjct: 266 SNRLFQ-----------AGGSNPNSKTGSTGSNTNSNASLLSSARKKDGGQRS----GER 310 Query: 133 TRGSQTMTH*DYINKREKRLCFHCDQLYH-SMHRYENKSFRLMLL 2 RG ++ + +R K LCF C YH ++H+ KS R+++L Sbjct: 311 WRGVRSFHSEEMEERRVKGLCFKCGGKYHPTLHKCPEKSLRVLIL 355 >gb|PNY17068.1| retrotransposon-related protein [Trifolium pratense] Length = 463 Score = 85.1 bits (209), Expect = 4e-14 Identities = 82/298 (27%), Positives = 138/298 (46%), Gaps = 5/298 (1%) Frame = -1 Query: 880 KRMEVSGFDRSDLVEWIAHAE*YFNI*GVAEVYKVPMAFVSMEGKTYH*FR*MKKRILNL 701 K++++ F+ D V WI AE YF++ E KV +A +SMEG T H F M + +L Sbjct: 68 KKVKLPMFEGEDPVAWITRAEIYFDVQNTTEEMKVKLARLSMEGPTIHWFNLMLETEDDL 127 Query: 700 T*EILKEKMI*H*S----EFPAIAYECLVALEQGHMTVMEHIAAYKEIVS*ITKLPKQ*Y 533 + LK+ +I E P +E L AL+Q +V +++ A++ + S + +LP++ Y Sbjct: 128 SWIKLKKALIARYGGRRLENP---FEELSALKQTG-SVEDYVEAFELLSSQVGRLPEEQY 183 Query: 532 MGFILKTLNYYIRERVRALLPTNVMQAMTLARIIEREVTKGVDLKRRTGAEGMMXXXXXX 353 +G+ + L IR RVR L P++ +Q M +A+ +E E+ + D R G Sbjct: 184 LGYFMSGLKAQIRRRVRTLNPSSRLQMMRIAKDVEEELKEEDDENDR--RFGKKLGGERM 241 Query: 352 XXXXXXXXSFPRT*HGQIHLQWPF*TVVFLS*RKNSGANSNIMTQLKRREGGPSHISSKE 173 + R GQI +K S N + T S S Sbjct: 242 GRSDWFGPNSNRNGSGQI--------------QKESKPNPSQKTG-SHNSNLSSSSSLAS 286 Query: 172 VGNQHSPPSISDRTRGSQTMTH*DYINKREKRLCFHCDQLYH-SMHRYENKSFRLMLL 2 G + S S+ +G +++ + + +R K LCF C +H + H+ KS R+++L Sbjct: 287 TGRKMDNDSRSNSWKGIRSIHSDEVVERRAKGLCFKCGGKWHPTQHKCPEKSIRVLIL 344