BLASTX nr result
ID: Rehmannia32_contig00010932
seq
BLASTX 2.2.26 [Sep-21-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Rehmannia32_contig00010932 (900 letters) Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects 149,584,005 sequences; 54,822,741,787 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value gb|PNX80752.1| ribonuclease H, partial [Trifolium pratense] 338 e-108 gb|PNX92710.1| ribonuclease H [Trifolium pratense] 339 e-105 gb|PNY15174.1| ribonuclease H, partial [Trifolium pratense] 340 e-104 dbj|GAU35627.1| hypothetical protein TSUD_30450 [Trifolium subte... 337 e-102 ref|XP_023909336.1| uncharacterized protein LOC112020997 [Quercu... 335 e-102 gb|PNY15111.1| ribonuclease H [Trifolium pratense] 333 e-101 ref|XP_023894138.1| uncharacterized protein LOC112006071 [Quercu... 325 e-101 gb|KMS97068.1| hypothetical protein BVRB_7g179290 [Beta vulgaris... 323 e-101 gb|PNY14301.1| ribonuclease H [Trifolium pratense] 330 e-101 gb|PNX95041.1| ribonuclease H, partial [Trifolium pratense] 330 e-100 gb|PNY16580.1| ribonuclease H, partial [Trifolium pratense] 321 e-99 ref|XP_023913142.1| uncharacterized protein LOC112024740 [Quercu... 326 2e-99 gb|PNY01158.1| ribonuclease H, partial [Trifolium pratense] 323 2e-99 ref|XP_023874626.1| uncharacterized protein LOC111987155 [Quercu... 324 6e-99 gb|POF03084.1| line-1 retrotransposable element orf2 protein [Qu... 314 5e-98 ref|XP_010682492.1| PREDICTED: uncharacterized protein LOC104897... 322 8e-98 ref|XP_023881891.1| uncharacterized protein LOC111994244 [Quercu... 321 1e-97 ref|XP_023919081.1| uncharacterized protein LOC112030645 [Quercu... 314 2e-97 dbj|GAU39028.1| hypothetical protein TSUD_59840 [Trifolium subte... 323 3e-97 gb|PNX95452.1| ribonuclease H, partial [Trifolium pratense] 307 4e-97 >gb|PNX80752.1| ribonuclease H, partial [Trifolium pratense] Length = 696 Score = 338 bits (866), Expect = e-108 Identities = 162/298 (54%), Positives = 208/298 (69%) Frame = +3 Query: 3 GPDGMPPLFFQKFWKFVKHDFISTVLGVLNNKVDPSNLNHTYIVLIPKIKSPETTKDYRP 182 GPDG+P LFFQKFW V D + VL +LNN P ++N T+I LIPK+K+P KDYRP Sbjct: 8 GPDGLPALFFQKFWHIVGRDVQNLVLEILNNGRSPKDINKTFIALIPKLKNPLAPKDYRP 67 Query: 183 ISLCNVIFRVITKVIANRLKLILPDIISENQSAFIPGRLITDNAMVAFEIFXXXXXXXXX 362 ISLCNV+ +++TKVIANR+K ILPDII E QSAF+ GRLITDNA++A E F Sbjct: 68 ISLCNVVMKIVTKVIANRIKPILPDIIDEEQSAFVQGRLITDNALIAMECFHWMKKKKKG 127 Query: 363 XXXXXXXXLDMSKAYDRVEWPFLQAMLEKLGMNRNWVELIMRCVSTVSYSILLNGSPLDK 542 LDMSKAYDR+EW F++A L+ +G N V+LI+ C+STVSY IL+NG P Sbjct: 128 KKGTMALKLDMSKAYDRIEWNFVKATLKSMGFPSNVVDLILNCISTVSYQILINGQPSKS 187 Query: 543 FNPGRGLRQGDPLSPYLFLICAEGFSALLNEAERNGDIHGFRVTRNAPSVSHLFFADDSI 722 FNP RGLRQGDPLSPYLF++CA+ S L+ G IHG ++ R AP +SHL FADDS+ Sbjct: 188 FNPERGLRQGDPLSPYLFILCADVLSGLMKRKAVTGGIHGIKIARQAPKISHLLFADDSL 247 Query: 723 LFCRATTHEVDKVTQIIKAYEEASGQQINLEKSELSASANVSEVSRNELCNRMGVRQV 896 LF RA++ E D + ++ Y++ASGQ +NL+KSE+S S NV + + NRMGV+ V Sbjct: 248 LFARASSTEADVILSVLAEYQQASGQVVNLDKSEVSFSQNVRNEEKEMIRNRMGVKTV 305 >gb|PNX92710.1| ribonuclease H [Trifolium pratense] Length = 1052 Score = 339 bits (870), Expect = e-105 Identities = 161/298 (54%), Positives = 210/298 (70%) Frame = +3 Query: 3 GPDGMPPLFFQKFWKFVKHDFISTVLGVLNNKVDPSNLNHTYIVLIPKIKSPETTKDYRP 182 GPDG+P LF+QK+W V + + L +LN DP ++N T++VLIPK K+P + KD+RP Sbjct: 422 GPDGLPALFYQKYWHIVGVEVQNLALSILNQNGDPRDINKTFLVLIPKGKNPTSPKDFRP 481 Query: 183 ISLCNVIFRVITKVIANRLKLILPDIISENQSAFIPGRLITDNAMVAFEIFXXXXXXXXX 362 ISLCNV+ +++TK IANRLKL LPD+I QSAF+ GRLITDNA++A E F Sbjct: 482 ISLCNVVMKIVTKTIANRLKLTLPDVIDIEQSAFVQGRLITDNALIAMECFHWLKKKRKG 541 Query: 363 XXXXXXXXLDMSKAYDRVEWPFLQAMLEKLGMNRNWVELIMRCVSTVSYSILLNGSPLDK 542 LDMSKAYDR+EWPF+Q +L +G VELIMRC+S+VSY IL+NG P Sbjct: 542 KKGVMALKLDMSKAYDRIEWPFVQHVLASMGYPVRVVELIMRCISSVSYQILINGQPSPS 601 Query: 543 FNPGRGLRQGDPLSPYLFLICAEGFSALLNEAERNGDIHGFRVTRNAPSVSHLFFADDSI 722 F P RGLRQGDPLSPYLF++CA+ S L ++A R +IHG +V R+AP +SHLFFADDS+ Sbjct: 602 FQPERGLRQGDPLSPYLFILCADVLSGLFHKAAREKEIHGIKVARSAPQLSHLFFADDSL 661 Query: 723 LFCRATTHEVDKVTQIIKAYEEASGQQINLEKSELSASANVSEVSRNELCNRMGVRQV 896 LF RA +HE K+ I++ Y++ASGQ +NL+KSE S S NV +N +CN MG + V Sbjct: 662 LFTRANSHEATKILSILQVYQQASGQVVNLDKSEASFSRNVQNEDKNMICNMMGAKAV 719 >gb|PNY15174.1| ribonuclease H, partial [Trifolium pratense] Length = 1289 Score = 340 bits (872), Expect = e-104 Identities = 157/298 (52%), Positives = 209/298 (70%) Frame = +3 Query: 3 GPDGMPPLFFQKFWKFVKHDFISTVLGVLNNKVDPSNLNHTYIVLIPKIKSPETTKDYRP 182 GPDG+P FFQK+W + D S +L +LNN P ++N T++ LIPK K+P T KDYRP Sbjct: 919 GPDGLPAFFFQKYWSIIGEDVQSLILNILNNNRPPGDINKTFLTLIPKNKNPTTPKDYRP 978 Query: 183 ISLCNVIFRVITKVIANRLKLILPDIISENQSAFIPGRLITDNAMVAFEIFXXXXXXXXX 362 ISLCNV+ +++TK IANR+K ILP+II E QSAF+ GRLITDNA++A E F Sbjct: 979 ISLCNVLMKIVTKCIANRIKPILPEIIDEEQSAFVQGRLITDNALIAMECFHWLKKKKKG 1038 Query: 363 XXXXXXXXLDMSKAYDRVEWPFLQAMLEKLGMNRNWVELIMRCVSTVSYSILLNGSPLDK 542 LDM+KAYDR+EWPF++A L +G V+LIMRC+STVSY IL+NG P + Sbjct: 1039 RKGMMALKLDMAKAYDRIEWPFVKATLISMGFPTKMVDLIMRCISTVSYQILINGQPSKR 1098 Query: 543 FNPGRGLRQGDPLSPYLFLICAEGFSALLNEAERNGDIHGFRVTRNAPSVSHLFFADDSI 722 F RGLRQGDPLSPYLF++CA+ FS LL++ IHG ++ R AP +SHLFFADDS+ Sbjct: 1099 FTAERGLRQGDPLSPYLFILCADVFSGLLHQKVAANSIHGLKIARQAPQISHLFFADDSL 1158 Query: 723 LFCRATTHEVDKVTQIIKAYEEASGQQINLEKSELSASANVSEVSRNELCNRMGVRQV 896 LF RA HE + + +I+ Y++ASGQ +N++KSE+S S N+ + +CN+MGV+ V Sbjct: 1159 LFTRANQHEAESIMEILSTYQKASGQMVNMDKSEVSFSRNMLNEEKEMICNKMGVKTV 1216 >dbj|GAU35627.1| hypothetical protein TSUD_30450 [Trifolium subterraneum] Length = 1475 Score = 337 bits (865), Expect = e-102 Identities = 160/298 (53%), Positives = 209/298 (70%) Frame = +3 Query: 3 GPDGMPPLFFQKFWKFVKHDFISTVLGVLNNKVDPSNLNHTYIVLIPKIKSPETTKDYRP 182 GPDG+P LFFQK+W V + + L +LN DP ++N T++VLIPK K+P + KD+RP Sbjct: 555 GPDGLPALFFQKYWHIVGVEVQNLALSILNQNGDPRDINKTFLVLIPKGKNPTSPKDFRP 614 Query: 183 ISLCNVIFRVITKVIANRLKLILPDIISENQSAFIPGRLITDNAMVAFEIFXXXXXXXXX 362 ISLCNV+ +++TK IANRLK+ LPD+I QSAF+ GRLITDNA++A E F Sbjct: 615 ISLCNVVMKIVTKTIANRLKITLPDVIDIEQSAFVQGRLITDNALIAMECFHWLKKKRKG 674 Query: 363 XXXXXXXXLDMSKAYDRVEWPFLQAMLEKLGMNRNWVELIMRCVSTVSYSILLNGSPLDK 542 LDMSKAYDR+EWPF+Q +L +G VELIMRC+S+VSY IL+NG P Sbjct: 675 KKGVMALKLDMSKAYDRIEWPFVQHVLTSMGYPVKVVELIMRCISSVSYQILINGQPSPS 734 Query: 543 FNPGRGLRQGDPLSPYLFLICAEGFSALLNEAERNGDIHGFRVTRNAPSVSHLFFADDSI 722 F P RGLRQGDPLSPYLF++CA+ S L ++A R +IHG +V R+AP +SHLFFADDS+ Sbjct: 735 FRPERGLRQGDPLSPYLFILCADVLSGLFHKAAREKEIHGIKVARSAPQLSHLFFADDSL 794 Query: 723 LFCRATTHEVDKVTQIIKAYEEASGQQINLEKSELSASANVSEVSRNELCNRMGVRQV 896 LF RA +HE K+ I++ Y++ASGQ +NL+KSE S S NV + +CN MG + V Sbjct: 795 LFTRANSHEATKILSILQVYQQASGQVVNLDKSEASFSRNVQNEDKTMICNMMGAKAV 852 >ref|XP_023909336.1| uncharacterized protein LOC112020997 [Quercus suber] Length = 1369 Score = 335 bits (858), Expect = e-102 Identities = 158/295 (53%), Positives = 212/295 (71%) Frame = +3 Query: 3 GPDGMPPLFFQKFWKFVKHDFISTVLGVLNNKVDPSNLNHTYIVLIPKIKSPETTKDYRP 182 GPDGMPP+FF+ +W V D +S L VLN+ + P N+NHT+I LIPK KSPET KD+RP Sbjct: 455 GPDGMPPIFFKHYWNTVGPDVLSATLSVLNSGIIPPNINHTFISLIPKTKSPETAKDFRP 514 Query: 183 ISLCNVIFRVITKVIANRLKLILPDIISENQSAFIPGRLITDNAMVAFEIFXXXXXXXXX 362 ISLCNVI+++I+K IANRLK LP +IS++QSAF+ RLITDN ++AFE Sbjct: 515 ISLCNVIYKLISKTIANRLKKCLPKLISDSQSAFLSNRLITDNILIAFETLHHLKNKRKG 574 Query: 363 XXXXXXXXLDMSKAYDRVEWPFLQAMLEKLGMNRNWVELIMRCVSTVSYSILLNGSPLDK 542 LDMSKAYDRVEW FL+ +++KLG R W++LI C+STVS+SIL+NG+P Sbjct: 575 KTGYMALKLDMSKAYDRVEWTFLENLMDKLGFARKWIDLIKSCISTVSFSILINGAPYGL 634 Query: 543 FNPGRGLRQGDPLSPYLFLICAEGFSALLNEAERNGDIHGFRVTRNAPSVSHLFFADDSI 722 +P RGLRQGDPLSPYLFL+CAEG AL+ +A NG I G + R P V+HL FADDS+ Sbjct: 635 IHPQRGLRQGDPLSPYLFLLCAEGLHALIKQAATNGTISGVSLCREGPRVTHLLFADDSL 694 Query: 723 LFCRATTHEVDKVTQIIKAYEEASGQQINLEKSELSASANVSEVSRNELCNRMGV 887 L C+A + E + V ++++ YE ASGQ+IN +K++L S+N ++ +RN + + +GV Sbjct: 695 LLCKANSRECNSVLELLEKYERASGQRINRDKTQLFFSSNTNQQTRNSIKSSLGV 749 >gb|PNY15111.1| ribonuclease H [Trifolium pratense] Length = 1334 Score = 333 bits (854), Expect = e-101 Identities = 158/298 (53%), Positives = 206/298 (69%) Frame = +3 Query: 3 GPDGMPPLFFQKFWKFVKHDFISTVLGVLNNKVDPSNLNHTYIVLIPKIKSPETTKDYRP 182 GPDG+P LFFQK+W V D VL +LNN P ++N T+I LIPK+KSP+ KDYRP Sbjct: 422 GPDGLPALFFQKYWHIVGRDVQRLVLQILNNDRSPEDINRTFIALIPKVKSPQAPKDYRP 481 Query: 183 ISLCNVIFRVITKVIANRLKLILPDIISENQSAFIPGRLITDNAMVAFEIFXXXXXXXXX 362 ISLCNV+ +++TKVIANR+K ILPDI+ E QSAF+ GRLITDNA++A E F Sbjct: 482 ISLCNVVMKIVTKVIANRIKPILPDIVDEEQSAFVQGRLITDNALIAMECFHWMKKKKRG 541 Query: 363 XXXXXXXXLDMSKAYDRVEWPFLQAMLEKLGMNRNWVELIMRCVSTVSYSILLNGSPLDK 542 LDMSKAYDR+EW F++A L +G V+LIM+C+ TVSY IL+NG P Sbjct: 542 KKGTMALKLDMSKAYDRIEWTFVKATLNSMGFPCKLVDLIMKCICTVSYQILINGQPSKL 601 Query: 543 FNPGRGLRQGDPLSPYLFLICAEGFSALLNEAERNGDIHGFRVTRNAPSVSHLFFADDSI 722 F P RGLRQGDPLSPYLF++CA+ S L+ + G +HG ++ R AP +SHLFFADDS+ Sbjct: 602 FTPERGLRQGDPLSPYLFILCADVLSGLVKKQAETGSMHGIQIARQAPKISHLFFADDSL 661 Query: 723 LFCRATTHEVDKVTQIIKAYEEASGQQINLEKSELSASANVSEVSRNELCNRMGVRQV 896 LF RA+ E + ++ Y++ASGQ +NL+KSE+S S NV ++ + NRMGV+ V Sbjct: 662 LFARASAAEAGVILNVLAEYQKASGQVVNLDKSEVSFSQNVRNEDKDMIRNRMGVKTV 719 >ref|XP_023894138.1| uncharacterized protein LOC112006071 [Quercus suber] Length = 910 Score = 325 bits (833), Expect = e-101 Identities = 154/298 (51%), Positives = 210/298 (70%) Frame = +3 Query: 3 GPDGMPPLFFQKFWKFVKHDFISTVLGVLNNKVDPSNLNHTYIVLIPKIKSPETTKDYRP 182 GPDGMPPLF+QKFW V+ D I TVL LN+ P+ LNHT+I LIPK K+PE ++RP Sbjct: 119 GPDGMPPLFYQKFWHLVRGDVIQTVLLYLNSDSLPNPLNHTFITLIPKTKNPERVSEFRP 178 Query: 183 ISLCNVIFRVITKVIANRLKLILPDIISENQSAFIPGRLITDNAMVAFEIFXXXXXXXXX 362 ISLC+V++++ +KV+ANRLK +LP IISE+QSAFI GRLITDN +VA+E Sbjct: 179 ISLCHVLYKIFSKVLANRLKRVLPHIISEHQSAFIKGRLITDNILVAYETLHYMKNHNSG 238 Query: 363 XXXXXXXXLDMSKAYDRVEWPFLQAMLEKLGMNRNWVELIMRCVSTVSYSILLNGSPLDK 542 LDMSKAYDRV+W FL+ ++ ++G + WV L+M C++TV+YSIL+NG P+ Sbjct: 239 KSGFMALKLDMSKAYDRVKWSFLKNVMLQMGFDVCWVTLVMECITTVTYSILINGEPMGD 298 Query: 543 FNPGRGLRQGDPLSPYLFLICAEGFSALLNEAERNGDIHGFRVTRNAPSVSHLFFADDSI 722 P RG+RQGDPLSPYLFL+C+EG ++ A NGDI G + RN P ++HLFFADDS+ Sbjct: 299 IKPSRGIRQGDPLSPYLFLVCSEGLHRMIQRAACNGDIKGVSICRNGPKLTHLFFADDSL 358 Query: 723 LFCRATTHEVDKVTQIIKAYEEASGQQINLEKSELSASANVSEVSRNELCNRMGVRQV 896 LFCRAT E KV +I+ YE SGQ++N EK+ L S + + ++++ + +GV ++ Sbjct: 359 LFCRATIQECRKVMEILATYERVSGQRLNREKTALFFSKSTALEHQSQIMDELGVSKL 416 >gb|KMS97068.1| hypothetical protein BVRB_7g179290 [Beta vulgaris subsp. vulgaris] Length = 849 Score = 323 bits (829), Expect = e-101 Identities = 165/298 (55%), Positives = 204/298 (68%) Frame = +3 Query: 3 GPDGMPPLFFQKFWKFVKHDFISTVLGVLNNKVDPSNLNHTYIVLIPKIKSPETTKDYRP 182 G DGM LF+QKFW V D ++ V + D S LN T IVLIPK + P+ ++RP Sbjct: 318 GVDGMHALFYQKFWHIVGADVVAFVKSWWKGEEDISMLNKTCIVLIPKCQKPQQMTEFRP 377 Query: 183 ISLCNVIFRVITKVIANRLKLILPDIISENQSAFIPGRLITDNAMVAFEIFXXXXXXXXX 362 ISLCNV++++I+K++ANRLK+ LPD+IS +QSAF+PGRLITDNA+VAFEIF Sbjct: 378 ISLCNVLYKIISKLMANRLKIWLPDLISHHQSAFVPGRLITDNALVAFEIFHRMKRRGDG 437 Query: 363 XXXXXXXXLDMSKAYDRVEWPFLQAMLEKLGMNRNWVELIMRCVSTVSYSILLNGSPLDK 542 LDMSKAYDRVEW FL+ ++ K+G W++ IM C+STVSY LNG+ Sbjct: 438 KAGTMAFKLDMSKAYDRVEWSFLEKVMAKMGFCYGWIQRIMICLSTVSYCFKLNGNIEGN 497 Query: 543 FNPGRGLRQGDPLSPYLFLICAEGFSALLNEAERNGDIHGFRVTRNAPSVSHLFFADDSI 722 P RGLRQGDPLSPYLFL+CAE FS +L +A RNG+IHG +V R AP VSHLFFADDSI Sbjct: 498 IIPSRGLRQGDPLSPYLFLLCAEAFSTMLAQAARNGEIHGAQVCRTAPRVSHLFFADDSI 557 Query: 723 LFCRATTHEVDKVTQIIKAYEEASGQQINLEKSELSASANVSEVSRNELCNRMGVRQV 896 LF RAT E K+ II YE ASGQ+IN KSE+S S NV + R + N GV +V Sbjct: 558 LFSRATLQECSKIADIISVYERASGQKINFNKSEVSFSKNVDDTRRLAIRNMFGVGEV 615 >gb|PNY14301.1| ribonuclease H [Trifolium pratense] Length = 1196 Score = 330 bits (845), Expect = e-101 Identities = 156/298 (52%), Positives = 210/298 (70%) Frame = +3 Query: 3 GPDGMPPLFFQKFWKFVKHDFISTVLGVLNNKVDPSNLNHTYIVLIPKIKSPETTKDYRP 182 GPDG+P LF+QK+W V D VL +LN+ P +N T++VLIPK K+P + KD+RP Sbjct: 281 GPDGIPALFYQKYWHIVGGDIQQMVLNILNHNGQPHEINKTFLVLIPKGKNPCSPKDFRP 340 Query: 183 ISLCNVIFRVITKVIANRLKLILPDIISENQSAFIPGRLITDNAMVAFEIFXXXXXXXXX 362 ISLCNV+ +++TKVIANRLK LPD++ QSAF+ GRLI+DNA++A E F Sbjct: 341 ISLCNVVMKIVTKVIANRLKYTLPDVVDIEQSAFVQGRLISDNALIAMECFHWLKKKKQG 400 Query: 363 XXXXXXXXLDMSKAYDRVEWPFLQAMLEKLGMNRNWVELIMRCVSTVSYSILLNGSPLDK 542 LDMSKAYDR+EW F+ +L +G ++ +VELIMRC+++VSY IL+NG P Sbjct: 401 KKGMMALKLDMSKAYDRIEWSFVNHVLTSMGYSQKFVELIMRCITSVSYQILINGQPSTT 460 Query: 543 FNPGRGLRQGDPLSPYLFLICAEGFSALLNEAERNGDIHGFRVTRNAPSVSHLFFADDSI 722 F P RGLRQGDPLSPYLF++CA+ S LL++A + +HG +V R+AP +SHLFFADDS+ Sbjct: 461 FFPERGLRQGDPLSPYLFILCADVLSGLLHKASVSKKLHGIKVARSAPQLSHLFFADDSL 520 Query: 723 LFCRATTHEVDKVTQIIKAYEEASGQQINLEKSELSASANVSEVSRNELCNRMGVRQV 896 LF RA + E + +I+K Y+ ASGQ +NL+KSE S S NV + +N +CN MGV+ V Sbjct: 521 LFSRANSDEAHTIMKILKTYQNASGQVVNLDKSEASFSRNVPSIDKNSICNMMGVKAV 578 >gb|PNX95041.1| ribonuclease H, partial [Trifolium pratense] Length = 1348 Score = 330 bits (847), Expect = e-100 Identities = 158/298 (53%), Positives = 204/298 (68%) Frame = +3 Query: 3 GPDGMPPLFFQKFWKFVKHDFISTVLGVLNNKVDPSNLNHTYIVLIPKIKSPETTKDYRP 182 GPDG+P LF+QK+W FV D S VL +LNN P +N T+IVLIPK K+P+T KDYRP Sbjct: 536 GPDGLPALFYQKYWHFVGKDVKSLVLDILNNHSSPEAINGTFIVLIPKGKNPKTPKDYRP 595 Query: 183 ISLCNVIFRVITKVIANRLKLILPDIISENQSAFIPGRLITDNAMVAFEIFXXXXXXXXX 362 ISLCNV+ +++TK +ANRLK+I+PD+I QS F+ GRLITDN ++A E F Sbjct: 596 ISLCNVVMKIVTKTLANRLKVIMPDVIDVEQSGFVQGRLITDNGLIAMECFHWLKKKRKG 655 Query: 363 XXXXXXXXLDMSKAYDRVEWPFLQAMLEKLGMNRNWVELIMRCVSTVSYSILLNGSPLDK 542 LDMSKAYDR+EW F+Q +L K+G N V +IM+C+STVSY IL+NG P Sbjct: 656 KKGMMAVKLDMSKAYDRIEWAFVQTVLVKMGFPDNIVSVIMKCISTVSYQILINGQPSIS 715 Query: 543 FNPGRGLRQGDPLSPYLFLICAEGFSALLNEAERNGDIHGFRVTRNAPSVSHLFFADDSI 722 F P RGLRQGDP+SPYLF++CA S LL++ ++ IHG +V R AP +SHL FADDS+ Sbjct: 716 FTPDRGLRQGDPISPYLFILCANVLSGLLHKEVQSKRIHGIKVARRAPQISHLQFADDSL 775 Query: 723 LFCRATTHEVDKVTQIIKAYEEASGQQINLEKSELSASANVSEVSRNELCNRMGVRQV 896 LF RA HE + + + AY+ ASGQ +N++KSE S S NV E +CN MG + V Sbjct: 776 LFARANQHEANVILSTLAAYQRASGQVVNMDKSEASFSRNVLEADSQFICNMMGAKTV 833 >gb|PNY16580.1| ribonuclease H, partial [Trifolium pratense] Length = 894 Score = 321 bits (823), Expect = e-99 Identities = 148/298 (49%), Positives = 204/298 (68%) Frame = +3 Query: 3 GPDGMPPLFFQKFWKFVKHDFISTVLGVLNNKVDPSNLNHTYIVLIPKIKSPETTKDYRP 182 GPDG+P LFFQK+W V D VL +LN+ P +LN T++ LIPK K+P + +RP Sbjct: 43 GPDGIPALFFQKYWHIVGKDTCLKVLSILNDNGTPESLNRTFVALIPKCKNPGSPNHFRP 102 Query: 183 ISLCNVIFRVITKVIANRLKLILPDIISENQSAFIPGRLITDNAMVAFEIFXXXXXXXXX 362 ISLCNVI +++TK IANR+K ILP+++ E QSAF+ GRLITDNA++A E F Sbjct: 103 ISLCNVIMKIVTKCIANRMKCILPEVVDEEQSAFVKGRLITDNALIAMECFHWMKKKTKG 162 Query: 363 XXXXXXXXLDMSKAYDRVEWPFLQAMLEKLGMNRNWVELIMRCVSTVSYSILLNGSPLDK 542 LDM+KAYDR+EWPF+++ L+ G + LIM C+STVSY +L+NG P Sbjct: 163 KKGMMALKLDMAKAYDRMEWPFIRSTLQATGFPPTMINLIMNCISTVSYQLLVNGQPSRS 222 Query: 543 FNPGRGLRQGDPLSPYLFLICAEGFSALLNEAERNGDIHGFRVTRNAPSVSHLFFADDSI 722 F P RG+RQGDPLSPY+F++CA S +L++ RN IHG +V RNAP ++HL FADDS+ Sbjct: 223 FKPERGIRQGDPLSPYVFILCANVLSGMLHKGARNNKIHGLQVARNAPKITHLLFADDSL 282 Query: 723 LFCRATTHEVDKVTQIIKAYEEASGQQINLEKSELSASANVSEVSRNELCNRMGVRQV 896 LF RA E + + I++ Y+ ASGQ ++LEKSE+S S N+ + +N +CN++ V+ V Sbjct: 283 LFARANQKEAEVIINILQTYQTASGQLVSLEKSEVSFSRNLPTIEKNMICNKIDVKAV 340 >ref|XP_023913142.1| uncharacterized protein LOC112024740 [Quercus suber] Length = 1194 Score = 326 bits (835), Expect = 2e-99 Identities = 158/298 (53%), Positives = 205/298 (68%) Frame = +3 Query: 3 GPDGMPPLFFQKFWKFVKHDFISTVLGVLNNKVDPSNLNHTYIVLIPKIKSPETTKDYRP 182 GPDGM LF+Q+FW V +D S VL LN+ +N+T+IVLIPK+KSPE D+RP Sbjct: 299 GPDGMNALFYQRFWHIVGNDVSSAVLDFLNSGTMLPEINYTHIVLIPKVKSPEKMTDFRP 358 Query: 183 ISLCNVIFRVITKVIANRLKLILPDIISENQSAFIPGRLITDNAMVAFEIFXXXXXXXXX 362 ISLCNVI+++I+KV+ANRLK ILP +IS QSAF+PGRLITDN ++A+E Sbjct: 359 ISLCNVIYKIISKVLANRLKTILPQLISPTQSAFVPGRLITDNVLLAYETLHAMHGRKKG 418 Query: 363 XXXXXXXXLDMSKAYDRVEWPFLQAMLEKLGMNRNWVELIMRCVSTVSYSILLNGSPLDK 542 LD+SKAYDRVEW FL+ M+ +LG W+ +M CVST S+S+ +NG Sbjct: 419 KTRALALKLDVSKAYDRVEWDFLKGMMIRLGFPEEWINRVMSCVSTPSFSVHINGRAFGN 478 Query: 543 FNPGRGLRQGDPLSPYLFLICAEGFSALLNEAERNGDIHGFRVTRNAPSVSHLFFADDSI 722 P RGLRQGDPLSPYLFL+CAEGF++LL++AE G +HG ++ R AP +S+L FADDS+ Sbjct: 479 ITPSRGLRQGDPLSPYLFLLCAEGFTSLLSKAESEGRLHGVQICRRAPCISNLLFADDSL 538 Query: 723 LFCRATTHEVDKVTQIIKAYEEASGQQINLEKSELSASANVSEVSRNELCNRMGVRQV 896 +FCRA EV +T +K Y EASGQ IN EKS S+N SE R ++ +GVR+V Sbjct: 539 IFCRANQEEVQVITNTLKLYAEASGQCINFEKSSAYFSSNTSEGQRQQIKQALGVREV 596 >gb|PNY01158.1| ribonuclease H, partial [Trifolium pratense] Length = 1068 Score = 323 bits (829), Expect = 2e-99 Identities = 148/298 (49%), Positives = 206/298 (69%) Frame = +3 Query: 3 GPDGMPPLFFQKFWKFVKHDFISTVLGVLNNKVDPSNLNHTYIVLIPKIKSPETTKDYRP 182 GPDG+P LFFQ++W V +D I LG+LN P +N T+I LIPK K+P K +RP Sbjct: 414 GPDGLPALFFQEYWHIVGNDIIDLSLGILNEGKSPEVINKTFIALIPKCKNPSAPKQFRP 473 Query: 183 ISLCNVIFRVITKVIANRLKLILPDIISENQSAFIPGRLITDNAMVAFEIFXXXXXXXXX 362 ISLCNV +++TK IANR+K ILPD++ E QSAF+ GRLITDNA++A E F Sbjct: 474 ISLCNVTMKIVTKTIANRIKSILPDVVDEEQSAFVKGRLITDNALIAMECFHWLKKKVKG 533 Query: 363 XXXXXXXXLDMSKAYDRVEWPFLQAMLEKLGMNRNWVELIMRCVSTVSYSILLNGSPLDK 542 LDM+KAYDR+EW F++ +L +G +++LIM C+STVSY IL+NG P Sbjct: 534 KRGTMALKLDMAKAYDRMEWAFIRGVLHNMGFPERFLQLIMSCISTVSYQILINGQPSSS 593 Query: 543 FNPGRGLRQGDPLSPYLFLICAEGFSALLNEAERNGDIHGFRVTRNAPSVSHLFFADDSI 722 F+P RG+RQGDPLSPY+F++CA S LL++ ++ IHG +V RNAP ++HL FADDS+ Sbjct: 594 FSPNRGIRQGDPLSPYIFILCANVLSGLLHKEAQSQAIHGIQVARNAPKITHLLFADDSL 653 Query: 723 LFCRATTHEVDKVTQIIKAYEEASGQQINLEKSELSASANVSEVSRNELCNRMGVRQV 896 LF RA +E + + Q++ Y+ ASGQ ++ EKSE+S S NV ++ +N +CN++ V+ V Sbjct: 654 LFARANQNEANTIIQVLNKYQLASGQMVSYEKSEVSFSRNVPDIEKNIICNKIEVKAV 711 >ref|XP_023874626.1| uncharacterized protein LOC111987155 [Quercus suber] Length = 1179 Score = 324 bits (831), Expect = 6e-99 Identities = 155/283 (54%), Positives = 202/283 (71%) Frame = +3 Query: 3 GPDGMPPLFFQKFWKFVKHDFISTVLGVLNNKVDPSNLNHTYIVLIPKIKSPETTKDYRP 182 GPDGM P+F+QK+W+ V D I VL VLN+ V P +N TYI LIPK+ SP+ ++RP Sbjct: 255 GPDGMSPIFYQKYWEIVDPDVIECVLAVLNSGVLPCGINETYICLIPKVHSPQKITEFRP 314 Query: 183 ISLCNVIFRVITKVIANRLKLILPDIISENQSAFIPGRLITDNAMVAFEIFXXXXXXXXX 362 ISLCNV++++I+KV+ANRLK +L ++I E+QSAF+PGR I DN +VAFE Sbjct: 315 ISLCNVVYKIISKVLANRLKGVLKEVIDESQSAFVPGRSIIDNVLVAFETMHYIGQRKKG 374 Query: 363 XXXXXXXXLDMSKAYDRVEWPFLQAMLEKLGMNRNWVELIMRCVSTVSYSILLNGSPLDK 542 LDMSKAYDRVEW +L+A+L KLG + W+ L+M CVSTV+YS+L+NG P K Sbjct: 375 KEALMAVKLDMSKAYDRVEWSYLEAILRKLGFHEKWIALMMMCVSTVTYSVLVNGEPKGK 434 Query: 543 FNPGRGLRQGDPLSPYLFLICAEGFSALLNEAERNGDIHGFRVTRNAPSVSHLFFADDSI 722 P RGLRQGDP+SPYLFL+CAEG SA+L + E G++ G V+R AP VSHL FADDSI Sbjct: 435 IVPSRGLRQGDPISPYLFLLCAEGLSAMLKKEECLGNLKGVAVSRGAPRVSHLLFADDSI 494 Query: 723 LFCRATTHEVDKVTQIIKAYEEASGQQINLEKSELSASANVSE 851 +FCRA+ + D+V QI++ YE SGQ++N EK+ L S N E Sbjct: 495 IFCRASVEDCDRVIQILEDYETDSGQKLNKEKTSLFFSKNTKE 537 >gb|POF03084.1| line-1 retrotransposable element orf2 protein [Quercus suber] Length = 786 Score = 314 bits (805), Expect = 5e-98 Identities = 149/298 (50%), Positives = 203/298 (68%) Frame = +3 Query: 3 GPDGMPPLFFQKFWKFVKHDFISTVLGVLNNKVDPSNLNHTYIVLIPKIKSPETTKDYRP 182 GPDGMPPLFFQ FW V D ++VL LN+ P LNHT+I LIPKI++P DYRP Sbjct: 76 GPDGMPPLFFQHFWGVVDADVTNSVLSWLNSGTIPHPLNHTFITLIPKIQNPVNVSDYRP 135 Query: 183 ISLCNVIFRVITKVIANRLKLILPDIISENQSAFIPGRLITDNAMVAFEIFXXXXXXXXX 362 ISLCNV++++ +KV+ANRLK ILP IISE+QSAF RLI+DN +VA+E Sbjct: 136 ISLCNVLYKIFSKVLANRLKRILPSIISEHQSAFTKNRLISDNILVAYESLHYMNNMRTG 195 Query: 363 XXXXXXXXLDMSKAYDRVEWPFLQAMLEKLGMNRNWVELIMRCVSTVSYSILLNGSPLDK 542 LDMSKAYDRVEW +L+ ++ KLG W+ LIM C+ TV+YSIL+NG P Sbjct: 196 KTGYMAVKLDMSKAYDRVEWVYLENVMRKLGFCERWIGLIMVCIKTVTYSILVNGEPQGL 255 Query: 543 FNPGRGLRQGDPLSPYLFLICAEGFSALLNEAERNGDIHGFRVTRNAPSVSHLFFADDSI 722 P RG+RQGDPLSP+LFL+C EG L+ + R G++ G ++RN P ++HL FADDS+ Sbjct: 256 IQPTRGIRQGDPLSPFLFLLCTEGLHGLIQHSVRMGELKGLSISRNGPKLTHLLFADDSL 315 Query: 723 LFCRATTHEVDKVTQIIKAYEEASGQQINLEKSELSASANVSEVSRNELCNRMGVRQV 896 LFCR+T E K+ +I++ YEE SGQ++N K+ + S + SE ++ E+ +G++++ Sbjct: 316 LFCRSTIEECRKILEILEIYEEGSGQKVNKNKTAIFFSQSTSEATKVEIKEALGLQEI 373 >ref|XP_010682492.1| PREDICTED: uncharacterized protein LOC104897331 [Beta vulgaris subsp. vulgaris] Length = 1212 Score = 322 bits (824), Expect = 8e-98 Identities = 167/298 (56%), Positives = 206/298 (69%) Frame = +3 Query: 3 GPDGMPPLFFQKFWKFVKHDFISTVLGVLNNKVDPSNLNHTYIVLIPKIKSPETTKDYRP 182 G DGM LF+QKFW V D I V +++VD +LN T I LIPK ++P D+RP Sbjct: 299 GVDGMHALFYQKFWSVVGDDVIDFVQQWWDSRVDLQSLNATCITLIPKCQNPIQMGDFRP 358 Query: 183 ISLCNVIFRVITKVIANRLKLILPDIISENQSAFIPGRLITDNAMVAFEIFXXXXXXXXX 362 ISLCNV+++VI+KV+ANRL++ILPD+IS QSAF+PGRLITDNAM+A+EIF Sbjct: 359 ISLCNVLYKVISKVMANRLEVILPDLISPYQSAFVPGRLITDNAMIAYEIFHYMKRSGDS 418 Query: 363 XXXXXXXXLDMSKAYDRVEWPFLQAMLEKLGMNRNWVELIMRCVSTVSYSILLNGSPLDK 542 LDMSKAYDRVEW FL+ ++ K+G +WV IM C+S+VSY+ LNG Sbjct: 419 KTGSMAFKLDMSKAYDRVEWSFLEQVMRKMGFCDSWVRRIMVCLSSVSYAFKLNGKVTGN 478 Query: 543 FNPGRGLRQGDPLSPYLFLICAEGFSALLNEAERNGDIHGFRVTRNAPSVSHLFFADDSI 722 P RGLRQGDPLSPYLFL+CAE FS LL +A +G IHG RV R+AP +SHLFFADDSI Sbjct: 479 IIPSRGLRQGDPLSPYLFLLCAEAFSTLLAKASDDGRIHGARVCRSAPRISHLFFADDSI 538 Query: 723 LFCRATTHEVDKVTQIIKAYEEASGQQINLEKSELSASANVSEVSRNELCNRMGVRQV 896 LF RAT E V II YE ASGQ+IN KSE+S S NV + R E+ + +GVR+V Sbjct: 539 LFTRATLQECSVVADIISVYERASGQKINFNKSEVSFSKNVDDSRRVEIRSMLGVREV 596 >ref|XP_023881891.1| uncharacterized protein LOC111994244 [Quercus suber] Length = 1198 Score = 321 bits (822), Expect = 1e-97 Identities = 151/297 (50%), Positives = 206/297 (69%) Frame = +3 Query: 3 GPDGMPPLFFQKFWKFVKHDFISTVLGVLNNKVDPSNLNHTYIVLIPKIKSPETTKDYRP 182 GPDGM +FFQK+W V +D + VL VLN+ + +N T I L+PKIK+P D+RP Sbjct: 287 GPDGMSAIFFQKYWNIVGNDIVCMVLDVLNSNMSMVEINKTNITLVPKIKNPTKMSDFRP 346 Query: 183 ISLCNVIFRVITKVIANRLKLILPDIISENQSAFIPGRLITDNAMVAFEIFXXXXXXXXX 362 ISLCNV++++I+KV+ANRLK ILP IISENQSAF+ GRLITDN +VAFE+ Sbjct: 347 ISLCNVVYKLISKVLANRLKNILPQIISENQSAFLSGRLITDNVLVAFELMHYLEHKKEG 406 Query: 363 XXXXXXXXLDMSKAYDRVEWPFLQAMLEKLGMNRNWVELIMRCVSTVSYSILLNGSPLDK 542 LDMSKAYDRVEW F++ ++EK+G + W++L+M C+++VSYSIL+NG Sbjct: 407 KEGFAAIKLDMSKAYDRVEWGFIKQVMEKMGFHEKWIKLVMHCITSVSYSILVNGGAYGS 466 Query: 543 FNPGRGLRQGDPLSPYLFLICAEGFSALLNEAERNGDIHGFRVTRNAPSVSHLFFADDSI 722 P RGLRQGDP+SPY+FL+CA+GFS+LLN+ R I G + R P ++HLFFADDS+ Sbjct: 467 ITPTRGLRQGDPISPYIFLLCADGFSSLLNDVARKLRISGVSICRGCPKITHLFFADDSL 526 Query: 723 LFCRATTHEVDKVTQIIKAYEEASGQQINLEKSELSASANVSEVSRNELCNRMGVRQ 893 LFC+A + E + I++ YE+ASGQ+IN++KS + S N + R E+ +G Q Sbjct: 527 LFCKANSQECQTLIDILQLYEDASGQKINVDKSSVFFSNNTPDEKRCEVLRMLGHMQ 583 >ref|XP_023919081.1| uncharacterized protein LOC112030645 [Quercus suber] Length = 838 Score = 314 bits (804), Expect = 2e-97 Identities = 148/298 (49%), Positives = 203/298 (68%) Frame = +3 Query: 3 GPDGMPPLFFQKFWKFVKHDFISTVLGVLNNKVDPSNLNHTYIVLIPKIKSPETTKDYRP 182 GPDGMPPLFFQ FW V+ D ++VL LN+ P LNHT+I LIPKI++P DYRP Sbjct: 76 GPDGMPPLFFQHFWGVVEDDVTNSVLSWLNSGTIPHPLNHTFITLIPKIQNPVNVSDYRP 135 Query: 183 ISLCNVIFRVITKVIANRLKLILPDIISENQSAFIPGRLITDNAMVAFEIFXXXXXXXXX 362 ISLCNV++++ +KV+ANRLK ILP IISE+QSAF RLI+DN +VA+E Sbjct: 136 ISLCNVLYKIFSKVLANRLKRILPSIISEHQSAFTKNRLISDNILVAYESLHYMNNMRTG 195 Query: 363 XXXXXXXXLDMSKAYDRVEWPFLQAMLEKLGMNRNWVELIMRCVSTVSYSILLNGSPLDK 542 LDMSKAYDRVEW +L+ ++ KLG W+ LIM C+ TV+YSIL+NG P Sbjct: 196 KTGYMAVKLDMSKAYDRVEWVYLENVMRKLGFCERWIGLIMVCIKTVTYSILVNGEPQGL 255 Query: 543 FNPGRGLRQGDPLSPYLFLICAEGFSALLNEAERNGDIHGFRVTRNAPSVSHLFFADDSI 722 P RG+RQGDPLSP+LFL+C EG L+ + R G++ G ++RN P ++HL FADDS+ Sbjct: 256 IQPTRGIRQGDPLSPFLFLLCTEGLHGLIQHSVRMGELRGLSISRNGPKLTHLLFADDSL 315 Query: 723 LFCRATTHEVDKVTQIIKAYEEASGQQINLEKSELSASANVSEVSRNELCNRMGVRQV 896 LFCR+T E K+ +I++ YEE SGQ++N K+ + S + E ++ E+ +G++++ Sbjct: 316 LFCRSTIEECRKILEILEIYEEGSGQKVNKNKTAIFFSQSTPEATKVEIKEALGLQEI 373 >dbj|GAU39028.1| hypothetical protein TSUD_59840 [Trifolium subterraneum] Length = 1626 Score = 323 bits (828), Expect = 3e-97 Identities = 146/298 (48%), Positives = 207/298 (69%) Frame = +3 Query: 3 GPDGMPPLFFQKFWKFVKHDFISTVLGVLNNKVDPSNLNHTYIVLIPKIKSPETTKDYRP 182 GPDG+P LFFQK+W V D VL +LN+ P +LN T++ LIPK K+P + +RP Sbjct: 716 GPDGLPALFFQKYWYIVGKDICEKVLSILNDNCSPESLNRTFVALIPKCKNPGSPNHFRP 775 Query: 183 ISLCNVIFRVITKVIANRLKLILPDIISENQSAFIPGRLITDNAMVAFEIFXXXXXXXXX 362 ISLCNV+ +++TK +ANR+K ILP+++ E QSAF+ GRLITDNA++A E F Sbjct: 776 ISLCNVMMKMVTKCVANRMKCILPEVVDEEQSAFVKGRLITDNALIAMECFHWLKKKTKG 835 Query: 363 XXXXXXXXLDMSKAYDRVEWPFLQAMLEKLGMNRNWVELIMRCVSTVSYSILLNGSPLDK 542 LDM+KAYDR+EWPF+++ L+ +G V LIM C+STVSY +L+NG P Sbjct: 836 KKGMMALKLDMAKAYDRMEWPFIRSALQAIGFPLTMVNLIMNCISTVSYQLLVNGQPSRS 895 Query: 543 FNPGRGLRQGDPLSPYLFLICAEGFSALLNEAERNGDIHGFRVTRNAPSVSHLFFADDSI 722 F P RG+RQGDPLSPY+F++CA S+++++ RN +IHG +V RNAP ++HL FADDS+ Sbjct: 896 FKPERGIRQGDPLSPYVFILCANVLSSMIHKGARNNEIHGIQVARNAPKITHLLFADDSL 955 Query: 723 LFCRATTHEVDKVTQIIKAYEEASGQQINLEKSELSASANVSEVSRNELCNRMGVRQV 896 LF RA E + + I++ Y+ ASGQ ++LEKSE+S S N+ + +N +CN++ V+ V Sbjct: 956 LFARANQKESEVILNILQTYQTASGQLVSLEKSEVSFSRNLPTIEKNMICNKIDVKAV 1013 >gb|PNX95452.1| ribonuclease H, partial [Trifolium pratense] Length = 614 Score = 307 bits (787), Expect = 4e-97 Identities = 154/298 (51%), Positives = 196/298 (65%) Frame = +3 Query: 3 GPDGMPPLFFQKFWKFVKHDFISTVLGVLNNKVDPSNLNHTYIVLIPKIKSPETTKDYRP 182 GPDG+P LF+ +W V D L VLN+K NHTYI LIPK K+P D+RP Sbjct: 134 GPDGLPALFYHTYWDIVGSDVTDAALHVLNHKGSSKPYNHTYICLIPKKKNPTHPSDFRP 193 Query: 183 ISLCNVIFRVITKVIANRLKLILPDIISENQSAFIPGRLITDNAMVAFEIFXXXXXXXXX 362 ISLCNV ++ITK IANRLK ILPD+IS NQSAF+PGRLITDN +VA+E+F Sbjct: 194 ISLCNVTLKIITKTIANRLKSILPDVISLNQSAFVPGRLITDNTLVAYEVFHHFNQSNSR 253 Query: 363 XXXXXXXXLDMSKAYDRVEWPFLQAMLEKLGMNRNWVELIMRCVSTVSYSILLNGSPLDK 542 DM+KAYDR+EW FL+ LE +G + IM CV+ V++SIL+NG+P Sbjct: 254 KGFMGIKT-DMAKAYDRMEWNFLKTTLESMGFPHQLTDTIMECVTNVTFSILINGTPSQP 312 Query: 543 FNPGRGLRQGDPLSPYLFLICAEGFSALLNEAERNGDIHGFRVTRNAPSVSHLFFADDSI 722 F+P RGLRQGDPLSPYLF+ICA S L+++A+ IHG RV AP VSHL FADDS+ Sbjct: 313 FSPQRGLRQGDPLSPYLFIICANVLSGLISKAQSMKKIHGIRVAHGAPEVSHLLFADDSL 372 Query: 723 LFCRATTHEVDKVTQIIKAYEEASGQQINLEKSELSASANVSEVSRNELCNRMGVRQV 896 FCRAT E + II Y+EASGQ +N++KSE+ S + S+ EL N + +++V Sbjct: 373 FFCRATKEEAKVIKNIINEYQEASGQLVNIDKSEIMFSKYTPQNSKTELHNILPMKRV 430