BLASTX nr result
ID: Astragalus23_contig00012787
seq
BLASTX 2.2.26 [Sep-21-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Astragalus23_contig00012787 (1849 letters) Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects 149,584,005 sequences; 54,822,741,787 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value gb|KYP51324.1| Putative ribonuclease H protein At1g65750 family,... 77 2e-34 gb|PNX59423.1| ribonuclease H, partial [Trifolium pratense] 134 1e-31 gb|KYP50883.1| LINE-1 reverse transcriptase isogeny, partial [Ca... 131 2e-31 gb|PNX92765.1| ribonuclease H [Trifolium pratense] 131 2e-31 gb|PNY06444.1| ribonuclease H [Trifolium pratense] 71 3e-31 ref|XP_015939884.1| uncharacterized protein LOC107465419 [Arachi... 139 9e-31 gb|KYP56191.1| LINE-1 reverse transcriptase isogeny, partial [Ca... 126 3e-30 dbj|GAU49954.1| hypothetical protein TSUD_180180 [Trifolium subt... 138 3e-30 dbj|GAU50504.1| hypothetical protein TSUD_409790 [Trifolium subt... 137 5e-30 gb|PNX71724.1| ribonuclease H, partial [Trifolium pratense] 135 9e-30 ref|XP_020999538.1| uncharacterized protein LOC110281546 [Arachi... 131 4e-29 gb|ABD28627.2| RNA-directed DNA polymerase (Reverse transcriptas... 134 5e-29 gb|KYP64034.1| Retrovirus-related Pol polyprotein LINE-1, partia... 129 6e-29 ref|XP_020981606.1| uncharacterized protein LOC110273076 [Arachi... 134 8e-29 gb|PNX92520.1| ribonuclease H, partial [Trifolium pratense] 133 8e-29 gb|PNX59952.1| ribonuclease H, partial [Trifolium pratense] 124 9e-29 gb|PNX92714.1| ribonuclease H [Trifolium pratense] 133 1e-28 ref|XP_021666740.1| uncharacterized protein LOC110654914 [Hevea ... 132 1e-28 dbj|GAU41508.1| hypothetical protein TSUD_302460 [Trifolium subt... 132 2e-28 dbj|GAU37589.1| hypothetical protein TSUD_365100 [Trifolium subt... 131 2e-28 >gb|KYP51324.1| Putative ribonuclease H protein At1g65750 family, partial [Cajanus cajan] Length = 483 Score = 76.6 bits (187), Expect(5) = 2e-34 Identities = 52/138 (37%), Positives = 71/138 (51%), Gaps = 1/138 (0%) Frame = +1 Query: 1234 GTQLLSR*TPPHIGYLKLHVDGSCS-NGIIGSEGLVRDD*GHWLGGFSSNDDQGDVLLDE 1410 GT + +PP +LKL++DGSCS NG +G+ GLVR+ G WL FSSN+ QGD L E Sbjct: 314 GTHSKPKWSPPLAPFLKLNIDGSCSRNGQMGTGGLVRNQQGEWLTVFSSNEGQGDAPLAE 373 Query: 1411 LFVICHGLSLLLYIGKTRAM*IXXXXXXXXXXXXETSHASHAYASLLMKITSSMDCFQDL 1590 L + +GL + G H +A ++++I M+ Sbjct: 374 LLALRNGLEVAWGCGYREIKCECDALDVVNVVMGLLDLNFHPHARVVLQIRMLMNRAWSC 433 Query: 1591 ALLHVFREANTSADWLAK 1644 L HV REAN+SAD LAK Sbjct: 434 HLAHVPREANSSADILAK 451 Score = 71.6 bits (174), Expect(5) = 2e-34 Identities = 39/115 (33%), Positives = 61/115 (53%) Frame = +3 Query: 342 LFGKLVWSILHYKDKLQVWVLFHNYLKNTPIWCIGNHTNASLIWRSILKAVETLKESFKY 521 L GKL W + +D L V +L H YL+ ++ I S IWR+I++A ++K+ F + Sbjct: 4 LLGKLAWKLCINEDCLWVQLLKHKYLQGKSMFTIKARPGDSCIWRNIVQAFVSIKDGFYF 63 Query: 522 RIGDVESSF*FID*TGMGLLGHLVDFVNISDTQLIFKDACSASPWNYDSFMTSVP 686 ++G ++SF F D G G L + V ++ SD L KD WN T++P Sbjct: 64 KLGAGDTSFWFDDWLGFGPLSNRVGLIHPSDILLNVKDVILNDNWNLSRLQTNLP 118 Score = 31.2 bits (69), Expect(5) = 2e-34 Identities = 18/54 (33%), Positives = 26/54 (48%), Gaps = 4/54 (7%) Frame = +2 Query: 935 NMRGSSCCSYCNFWMMEYILHCLCDCPYGMKIWKHFCF----CFAGTNVYHY*E 1084 ++ S+ C C + ILHCL DC + ++WK F F TNV + E Sbjct: 206 HLAASASCPCC-LTRDKDILHCLRDCHHAREVWKRLGFSTLPAFNATNVVSWIE 258 Score = 28.9 bits (63), Expect(5) = 2e-34 Identities = 15/46 (32%), Positives = 21/46 (45%), Gaps = 1/46 (2%) Frame = +1 Query: 751 DKWVWGAATFGLY*TN*TYQWLIAKHRTW-GDDDWSWIWKLMTDKK 885 D W W GLY Y++L+ + T G W +WKL +K Sbjct: 138 DNWCWRPNGTGLYTAATAYKFLLCDNITGVGRGRWKTLWKLSIPEK 183 Score = 28.9 bits (63), Expect(5) = 2e-34 Identities = 10/30 (33%), Positives = 14/30 (46%) Frame = +3 Query: 1092 AWVRKMASSYSIRLFLAFLWWLWCRRNNMV 1181 +W+ M LF WW+W RNN + Sbjct: 255 SWIESMIYGDHSTLFSVAAWWIWKWRNNYI 284 >gb|PNX59423.1| ribonuclease H, partial [Trifolium pratense] Length = 277 Score = 134 bits (337), Expect = 1e-31 Identities = 68/129 (52%), Positives = 91/129 (70%) Frame = +3 Query: 3 KIDLEKACDHLDFNFLESTLQKFGFPMPTICLIMSCVRASNLSIL*NGAKTYTFKLSRGL 182 K+DLEKA D+++++FL++ L FGFP I LIM CV +SNLS+L NG K +FK S GL Sbjct: 14 KLDLEKAFDNVNWDFLKTCLHDFGFPDDIIRLIMHCVTSSNLSLLWNGNKMPSFKPSHGL 73 Query: 183 *QGDSWSPHLFVLCMEKLALSIQDKVEKGVWRPVHVYKDGPSLSHLLFTDDVFLFGKLVW 362 QGD SP+LF+LCMEKL+++I D V+ G W P+++ +GP LSHL F DDV LF K Sbjct: 74 RQGDPLSPYLFILCMEKLSIAINDAVQHGAWTPINISDNGPRLSHLFFADDVLLFTKAKN 133 Query: 363 SILHYKDKL 389 S L + + L Sbjct: 134 SQLRFINDL 142 >gb|KYP50883.1| LINE-1 reverse transcriptase isogeny, partial [Cajanus cajan] Length = 191 Score = 131 bits (329), Expect = 2e-31 Identities = 67/117 (57%), Positives = 84/117 (71%) Frame = +3 Query: 3 KIDLEKACDHLDFNFLESTLQKFGFPMPTICLIMSCVRASNLSIL*NGAKTYTFKLSRGL 182 KIDLEKA D +++ FL TL FGFP+ I LIM V +++LSIL NG+K ++F RGL Sbjct: 70 KIDLEKAYDRVNWEFLRCTLHDFGFPLKIINLIMWGVTSASLSILWNGSKLHSFTPHRGL 129 Query: 183 *QGDSWSPHLFVLCMEKLALSIQDKVEKGVWRPVHVYKDGPSLSHLLFTDDVFLFGK 353 QGD SP+LF+L MEKLALSIQ V+ VW+P+HV + GP + HLLF DDV LF K Sbjct: 130 RQGDPLSPYLFILYMEKLALSIQQLVDNNVWQPIHVSRGGPGIIHLLFADDVLLFAK 186 >gb|PNX92765.1| ribonuclease H [Trifolium pratense] Length = 1310 Score = 60.1 bits (144), Expect(6) = 2e-31 Identities = 42/113 (37%), Positives = 54/113 (47%), Gaps = 1/113 (0%) Frame = +3 Query: 342 LFGKLVWSILHYKDKLQVWVLFHNYLKNTPIWCIGNHTN-ASLIWRSILKAVETLKESFK 518 + GKLVW I K V +L YL++ P I T S IW SI KA L + FK Sbjct: 825 MLGKLVWDIQQNSPKPWVLMLRSKYLQHQPF--INAPTQPGSPIWNSISKAKMILADGFK 882 Query: 519 YRIGDVESSF*FID*TGMGLLGHLVDFVNISDTQLIFKDACSASPWNYDSFMT 677 YR+ D S F + LLG VD+V I D+Q+ KD W+ + T Sbjct: 883 YRVSDGSSLFWYSPWLSHKLLGTEVDYVAIQDSQIRIKDIYFNDSWHLNLLYT 935 Score = 53.9 bits (128), Expect(6) = 2e-31 Identities = 41/141 (29%), Positives = 65/141 (46%), Gaps = 2/141 (1%) Frame = +1 Query: 1270 IGYLKLHVDGSC--SNGIIGSEGLVRDD*GHWLGGFSSNDDQGDVLLDELFVICHGLSLL 1443 +G++ L+VDGSC +G G GL+R+ G W+ GFS + EL + HGL + Sbjct: 1151 VGFV-LNVDGSCLGDSGRAGFGGLIREGDGSWIIGFSGFLGISNNTFAELMAVFHGLKIA 1209 Query: 1444 LYIGKTRAM*IXXXXXXXXXXXXETSHASHAYASLLMKITSSMDCFQDLALLHVFREANT 1623 G R + ++ H YA+++ I + ++ L H RE N Sbjct: 1210 RERGYRRIH-CYSDSQTVVDAISKDLNSFHRYAAVIASIKDLLQLDWEVRLSHSLREGNA 1268 Query: 1624 SADWLAKLVNRIVGRLPFGTS 1686 AD+LAK+ + +L F S Sbjct: 1269 GADFLAKIGSANDDKLTFWES 1289 Score = 43.1 bits (100), Expect(6) = 2e-31 Identities = 22/66 (33%), Positives = 31/66 (46%), Gaps = 6/66 (9%) Frame = +1 Query: 751 DKWVWGAATFGLY*TN*TYQWLIA-KHRTWGDDDWSWIWKLMTDKKYNILFGCVC----- 912 D ++W G+Y + Y+WL+ K+ W+WIWKL +K L C C Sbjct: 958 DCFIWQGNIDGIYNASSGYKWLLQQKYNVPSIQSWNWIWKLQAPEKIKFLIWCACHHSIP 1017 Query: 913 TML*LH 930 TM LH Sbjct: 1018 TMYMLH 1023 Score = 32.0 bits (71), Expect(6) = 2e-31 Identities = 14/34 (41%), Positives = 20/34 (58%) Frame = +2 Query: 935 NMRGSSCCSYCNFWMMEYILHCLCDCPYGMKIWK 1036 NM SS C+ C+ E ++HCL DC +IW+ Sbjct: 1026 NMTSSSICTRCSN-NEETVIHCLRDCTTAKRIWE 1058 Score = 31.2 bits (69), Expect(6) = 2e-31 Identities = 13/46 (28%), Positives = 25/46 (54%) Frame = +3 Query: 1062 LMFTTIRSASAWVRKMASSYSIRLFLAFLWWLWCRRNNMVLRDTSV 1199 + F + + +W++ + + LFLA LWW W RN + + + S+ Sbjct: 1065 ISFFSSNNLESWLKIHSYGPAANLFLAGLWWNWRARNIVCVGNESI 1110 Score = 25.4 bits (54), Expect(6) = 2e-31 Identities = 9/23 (39%), Positives = 15/23 (65%) Frame = +3 Query: 1665 EITFWDFPPSKFEFVLASRCLGI 1733 ++TFW+ PP + E +L S L + Sbjct: 1283 KLTFWESPPVEMESILQSDALRV 1305 Score = 131 bits (329), Expect = 6e-28 Identities = 65/121 (53%), Positives = 86/121 (71%) Frame = +3 Query: 3 KIDLEKACDHLDFNFLESTLQKFGFPMPTICLIMSCVRASNLSIL*NGAKTYTFKLSRGL 182 KIDLEKA D++D+++L S L+ FGFP TI LIM CV +S+LS++ NG + F +RGL Sbjct: 535 KIDLEKAYDNVDWSYLRSCLRDFGFPPITIKLIMHCVSSSSLSLIWNGNRLPNFSPTRGL 594 Query: 183 *QGDSWSPHLFVLCMEKLALSIQDKVEKGVWRPVHVYKDGPSLSHLLFTDDVFLFGKLVW 362 QGD SP+LFV+CMEKL+L+I + V+ W+P+ V K+GP SHL F DDV LF K Sbjct: 595 RQGDPLSPYLFVICMEKLSLAIVEAVQDNCWKPIRVSKNGPCFSHLFFADDVLLFSKATC 654 Query: 363 S 365 S Sbjct: 655 S 655 >gb|PNY06444.1| ribonuclease H [Trifolium pratense] Length = 547 Score = 71.2 bits (173), Expect(5) = 3e-31 Identities = 45/120 (37%), Positives = 60/120 (50%) Frame = +3 Query: 327 TDDVFLFGKLVWSILHYKDKLQVWVLFHNYLKNTPIWCIGNHTNASLIWRSILKAVETLK 506 + + L GKLVW +L DKL V + YLK + + T S+IW ++ KA++ LK Sbjct: 55 SQNTVLLGKLVWELLQNPDKLWVNLFNDRYLKGQLPFNV-KVTGGSVIWNAVAKAMQLLK 113 Query: 507 ESFKYRIGDVESSF*FID*TGMGLLGHLVDFVNISDTQLIFKDACSASPWNYDSFMTSVP 686 E F +IGD ESSF F L +V V I DT + KD WN S T +P Sbjct: 114 EGFTLKIGDGESSFWFDPWVLKERLCSVVPVVAIQDTDMKIKDVWMNGMWNLQSLYTPLP 173 Score = 52.8 bits (125), Expect(5) = 3e-31 Identities = 42/130 (32%), Positives = 59/130 (45%), Gaps = 2/130 (1%) Frame = +1 Query: 1264 PHIGYLKLHVDGSC--SNGIIGSEGLVRDD*GHWLGGFSSNDDQGDVLLDELFVICHGLS 1437 P G++ L+VDGS S G GL+RD+ G +L GF +LL EL I HGL Sbjct: 385 PVEGFVCLNVDGSLLGSTNTAGYGGLLRDNNGVFLLGFYGAVTVPSILLAELMAILHGLQ 444 Query: 1438 LLLYIGKTRAM*IXXXXXXXXXXXXETSHASHAYASLLMKITSSMDCFQDLALLHVFREA 1617 + G R S A H +A+ ++ I +D ++ + H RE Sbjct: 445 ICWENGYRRITCFSDSLQAVNLIRDGVS-AHHRFANEVVSIRQLLDRDWEIVVKHTLREG 503 Query: 1618 NTSADWLAKL 1647 N AD LAK+ Sbjct: 504 NACADVLAKM 513 Score = 41.6 bits (96), Expect(5) = 3e-31 Identities = 16/44 (36%), Positives = 22/44 (50%) Frame = +1 Query: 739 THIPDKWVWGAATFGLY*TN*TYQWLIAKHRTWGDDDWSWIWKL 870 +++PD W W AT G+Y Y WL +W WIW+L Sbjct: 189 SNLPDVWTWSNATSGVYSVKDAYNWLRKPTPLQDHVNWQWIWQL 232 Score = 33.5 bits (75), Expect(5) = 3e-31 Identities = 16/43 (37%), Positives = 22/43 (51%), Gaps = 6/43 (13%) Frame = +3 Query: 1095 WVRKMASSYSIRLFLAFLWWLWCRRNNMVLRD------TSVTK 1205 W R M Y LF +W +WC RN+++ + TSVTK Sbjct: 311 WYRNMGKKYGT-LFFVTIWVVWCSRNDVIFNNSNDNIHTSVTK 352 Score = 26.9 bits (58), Expect(5) = 3e-31 Identities = 12/30 (40%), Positives = 15/30 (50%) Frame = +2 Query: 947 SSCCSYCNFWMMEYILHCLCDCPYGMKIWK 1036 +S C C E I HCL CP +IW+ Sbjct: 264 TSVCPRCTS-TAESIEHCLFSCPASARIWR 292 >ref|XP_015939884.1| uncharacterized protein LOC107465419 [Arachis duranensis] Length = 694 Score = 139 bits (349), Expect = 9e-31 Identities = 70/115 (60%), Positives = 86/115 (74%) Frame = +3 Query: 3 KIDLEKACDHLDFNFLESTLQKFGFPMPTICLIMSCVRASNLSIL*NGAKTYTFKLSRGL 182 KIDLEKA D +D+ FLESTL FGFP+ T+ LIM+CVRAS+LSI+ NG + +F RGL Sbjct: 89 KIDLEKAYDRVDWRFLESTLIAFGFPIITVNLIMNCVRASSLSIMWNGNRLDSFAPRRGL 148 Query: 183 *QGDSWSPHLFVLCMEKLALSIQDKVEKGVWRPVHVYKDGPSLSHLLFTDDVFLF 347 QGD SP+LFVLCME+LA I KV +GVW+PV V + GP SHL+F DD+ LF Sbjct: 149 RQGDPMSPYLFVLCMERLACYISHKVVEGVWKPVSVTRSGPKFSHLMFADDLLLF 203 Score = 58.2 bits (139), Expect(3) = 3e-16 Identities = 35/102 (34%), Positives = 53/102 (51%) Frame = +3 Query: 381 DKLQVWVLFHNYLKNTPIWCIGNHTNASLIWRSILKAVETLKESFKYRIGDVESSF*FID 560 DK V +L YL+N + NAS +W+SI KA LK++F + IG ++ SF F + Sbjct: 356 DKPWVALLRAKYLRNEGVLDGHVPCNASHVWKSISKAFGALKDAFSWCIGSLDQSFWFDN 415 Query: 561 *TGMGLLGHLVDFVNISDTQLIFKDACSASPWNYDSFMTSVP 686 + G + V FV+ISD+ L +D WN + +P Sbjct: 416 WSIEGPIAQDVPFVHISDSDLTIRDVWKDGQWNLHDIFSIIP 457 Score = 49.7 bits (117), Expect(3) = 3e-16 Identities = 25/71 (35%), Positives = 36/71 (50%), Gaps = 3/71 (4%) Frame = +1 Query: 694 VKQMVDMYPPPSNPDTHIPDK--WVWGAATFGLY*TN*TYQWLIAKHRTWGD-DDWSWIW 864 VKQ ++ Y NPD + + W WG A+ LY Y WL + W + D+W W+W Sbjct: 460 VKQRLNAY----NPDLNAGESSGWSWGVASSRLYSARSGYSWLAKRKFDWNEHDNWLWVW 515 Query: 865 KLMTDKKYNIL 897 +L +KY L Sbjct: 516 RLHIPEKYKFL 526 Score = 27.7 bits (60), Expect(3) = 3e-16 Identities = 12/29 (41%), Positives = 16/29 (55%) Frame = +2 Query: 947 SSCCSYCNFWMMEYILHCLCDCPYGMKIW 1033 S+ C C E ILHCL +CP ++W Sbjct: 549 SNTCHRCQNGS-ESILHCLQECPSAKEVW 576 >gb|KYP56191.1| LINE-1 reverse transcriptase isogeny, partial [Cajanus cajan] gb|KYP56197.1| LINE-1 reverse transcriptase isogeny, partial [Cajanus cajan] Length = 148 Score = 126 bits (316), Expect = 3e-30 Identities = 65/117 (55%), Positives = 83/117 (70%) Frame = +3 Query: 3 KIDLEKACDHLDFNFLESTLQKFGFPMPTICLIMSCVRASNLSIL*NGAKTYTFKLSRGL 182 KIDLEKA D + +NFL++ L KFGFP I LIM V S+L++L NG+K F SRGL Sbjct: 26 KIDLEKAYDQISWNFLQAKLWKFGFPERIIKLIMWGVTNSSLTLLWNGSKLPPFAPSRGL 85 Query: 183 *QGDSWSPHLFVLCMEKLALSIQDKVEKGVWRPVHVYKDGPSLSHLLFTDDVFLFGK 353 QGD SP+LFVLCME+LAL I + ++G W P+H+ + GP +SHLLF DDV LF + Sbjct: 86 RQGDPLSPYLFVLCMERLALRINELNKEGCWNPIHLSQGGPPISHLLFADDVILFSQ 142 >dbj|GAU49954.1| hypothetical protein TSUD_180180 [Trifolium subterraneum] Length = 968 Score = 138 bits (347), Expect = 3e-30 Identities = 68/121 (56%), Positives = 87/121 (71%) Frame = +3 Query: 3 KIDLEKACDHLDFNFLESTLQKFGFPMPTICLIMSCVRASNLSIL*NGAKTYTFKLSRGL 182 K+DLEKA D++++ FL+ L FGFP TI LIM CV +SNLS+L NG K +FK S GL Sbjct: 346 KLDLEKAFDNVNWEFLKICLHDFGFPDDTIRLIMHCVTSSNLSLLWNGNKMPSFKPSHGL 405 Query: 183 *QGDSWSPHLFVLCMEKLALSIQDKVEKGVWRPVHVYKDGPSLSHLLFTDDVFLFGKLVW 362 QGD SP+LF+LCMEKL+++I D V+ W P+H+ +GP LSHLLF DDV LF K + Sbjct: 406 RQGDPLSPYLFILCMEKLSIAINDAVQHNAWTPIHILNNGPRLSHLLFADDVLLFTKAKY 465 Query: 363 S 365 S Sbjct: 466 S 466 >dbj|GAU50504.1| hypothetical protein TSUD_409790 [Trifolium subterraneum] Length = 902 Score = 137 bits (345), Expect = 5e-30 Identities = 69/125 (55%), Positives = 89/125 (71%) Frame = +3 Query: 3 KIDLEKACDHLDFNFLESTLQKFGFPMPTICLIMSCVRASNLSIL*NGAKTYTFKLSRGL 182 K+DLEKA D++++ FL+S L FGFP TI LIM CV +SN S+L NG K FK S GL Sbjct: 302 KLDLEKAFDNVNWEFLKSCLHDFGFPDTTIQLIMHCVTSSNFSLLWNGNKMPHFKSSHGL 361 Query: 183 *QGDSWSPHLFVLCMEKLALSIQDKVEKGVWRPVHVYKDGPSLSHLLFTDDVFLFGKLVW 362 QGD SP+LF+LCMEKL+++I + V++G W P+H+ +GP LSHLLF DDV LF K Sbjct: 362 RQGDPLSPYLFILCMEKLSIAINNAVQQGNWAPIHISDNGPRLSHLLFADDVLLFSKAKN 421 Query: 363 SILHY 377 S L + Sbjct: 422 SQLRF 426 Score = 77.4 bits (189), Expect(4) = 2e-21 Identities = 53/172 (30%), Positives = 83/172 (48%), Gaps = 7/172 (4%) Frame = +3 Query: 195 SWSPHLFVLCMEKLALSI--QDKVEKGV----WRPVHVYKDGPSLS-HLLFTDDVFLFGK 353 SW P +++ + +D KG+ W+ + K LS ++ L GK Sbjct: 536 SWLPQSICDSIDQTTRNFIWRDSNNKGIHLVSWKKIARPKQHGGLSIRTARGQNISLLGK 595 Query: 354 LVWSILHYKDKLQVWVLFHNYLKNTPIWCIGNHTNASLIWRSILKAVETLKESFKYRIGD 533 LVW ++ +KL V +L Y+K + GN+++ S W SI++A LK+ F +R G Sbjct: 596 LVWDMVQSSNKLWVDLLSSKYVKGSTFLLSGNNSSGSPTWSSIIQAKNILKDGFSWRAGS 655 Query: 534 VESSF*FID*TGMGLLGHLVDFVNISDTQLIFKDACSASPWNYDSFMTSVPP 689 SSF T +G LG LV +++I D QL KD S + + T +PP Sbjct: 656 GTSSFWSSHWTTLGQLGALVPYIDIHDLQLSIKDVLSTNSPHTHILYTQLPP 707 Score = 38.5 bits (88), Expect(4) = 2e-21 Identities = 19/60 (31%), Positives = 26/60 (43%), Gaps = 4/60 (6%) Frame = +1 Query: 745 IPDKWVWGAATFGLY*TN*TYQWLIAKHRTWGDDD----WSWIWKLMTDKKYNILFGCVC 912 I D ++W + G Y T Y WL++ + WSWIWKL +K F C Sbjct: 724 IDDAFIWTSNKNGSYTTKSGYNWLLSLQNLVTPHNPSLSWSWIWKLQLPEKIKFFFWLAC 783 Score = 32.3 bits (72), Expect(4) = 2e-21 Identities = 14/37 (37%), Positives = 19/37 (51%) Frame = +2 Query: 938 MRGSSCCSYCNFWMMEYILHCLCDCPYGMKIWKHFCF 1048 M S+ C+ C E LHC+ DC + + IW H F Sbjct: 798 MNLSATCARCGL-REETFLHCVRDCDFSISIWHHIGF 833 Score = 24.6 bits (52), Expect(4) = 2e-21 Identities = 11/41 (26%), Positives = 19/41 (46%) Frame = +3 Query: 1068 FTTIRSASAWVRKMASSYSIRLFLAFLWWLWCRRNNMVLRD 1190 F + A W++ ++ +F A +WW W N M L + Sbjct: 838 FFSSMDAHDWLKWGSTGSKAFIFSAGVWWSWRHCNLMCLNN 878 >gb|PNX71724.1| ribonuclease H, partial [Trifolium pratense] Length = 585 Score = 135 bits (339), Expect = 9e-30 Identities = 67/117 (57%), Positives = 88/117 (75%) Frame = +3 Query: 3 KIDLEKACDHLDFNFLESTLQKFGFPMPTICLIMSCVRASNLSIL*NGAKTYTFKLSRGL 182 K+DLEKA D+++++FL+++LQ FGFP TI LIM CV +SNLSIL NG + +FK + GL Sbjct: 114 KLDLEKAFDNVNWDFLKNSLQDFGFPDITIRLIMHCVTSSNLSILWNGNQMPSFKPTHGL 173 Query: 183 *QGDSWSPHLFVLCMEKLALSIQDKVEKGVWRPVHVYKDGPSLSHLLFTDDVFLFGK 353 QGD SP+LF++CMEKL+++I + V W PVH+ DGP LSHLLF DDV LF K Sbjct: 174 RQGDPLSPYLFIICMEKLSIAIHNAVLNKTWDPVHISNDGPHLSHLLFADDVLLFTK 230 Score = 76.6 bits (187), Expect(2) = 3e-14 Identities = 43/115 (37%), Positives = 60/115 (52%) Frame = +3 Query: 342 LFGKLVWSILHYKDKLQVWVLFHNYLKNTPIWCIGNHTNASLIWRSILKAVETLKESFKY 521 L GKLVW IL KDKL V + H Y I +H +S W +I++A L+E F + Sbjct: 404 LLGKLVWDILQSKDKLWVNIFSHRYDAGVKILHAIBHQGSSSTWSAIIRAKTVLREGFTW 463 Query: 522 RIGDVESSF*FID*TGMGLLGHLVDFVNISDTQLIFKDACSASPWNYDSFMTSVP 686 R G SSF F T +G LV +++I D QL KD S++ + T++P Sbjct: 464 RAGSGSSSFWFCPWTVLGCFSKLVPYIDIHDLQLTVKDVISSNNPHSQILYTNLP 518 Score = 32.3 bits (72), Expect(2) = 3e-14 Identities = 15/49 (30%), Positives = 22/49 (44%), Gaps = 2/49 (4%) Frame = +1 Query: 745 IPDKWVWGAATFGLY*TN*TYQWLIA--KHRTWGDDDWSWIWKLMTDKK 885 I D ++W G+Y Y W++ + WSWIWKL +K Sbjct: 536 IEDTFIWSNNKNGVYTAKSGYDWILTCTEQVQPSPHTWSWIWKLKVPEK 584 >ref|XP_020999538.1| uncharacterized protein LOC110281546 [Arachis duranensis] Length = 474 Score = 131 bits (330), Expect = 4e-29 Identities = 66/117 (56%), Positives = 84/117 (71%) Frame = +3 Query: 3 KIDLEKACDHLDFNFLESTLQKFGFPMPTICLIMSCVRASNLSIL*NGAKTYTFKLSRGL 182 KIDLEKA + +D+ FL TL+ F FP+PTI LIM+CV AS+LSIL NG + F SRGL Sbjct: 316 KIDLEKAYNRVDWRFLAHTLKSFDFPIPTINLIMNCVTASSLSILWNGNRLNGFTPSRGL 375 Query: 183 *QGDSWSPHLFVLCMEKLALSIQDKVEKGVWRPVHVYKDGPSLSHLLFTDDVFLFGK 353 QGD SP+LFVLCME+LA I V+ G+W P+ + + GP +SHL+F DD+ LF K Sbjct: 376 RQGDPMSPYLFVLCMERLACFISHHVDLGLWEPIAISRGGPRISHLMFVDDLLLFCK 432 >gb|ABD28627.2| RNA-directed DNA polymerase (Reverse transcriptase); Ribonuclease H [Medicago truncatula] Length = 1296 Score = 134 bits (338), Expect = 5e-29 Identities = 67/125 (53%), Positives = 88/125 (70%) Frame = +3 Query: 3 KIDLEKACDHLDFNFLESTLQKFGFPMPTICLIMSCVRASNLSIL*NGAKTYTFKLSRGL 182 K+DLEKA D+++++FL S L FGFP + LIM CV ++N S+L NG K FK + GL Sbjct: 576 KLDLEKAFDNVNWDFLNSCLLDFGFPDIIVKLIMHCVSSANYSLLWNGNKMPPFKPTHGL 635 Query: 183 *QGDSWSPHLFVLCMEKLALSIQDKVEKGVWRPVHVYKDGPSLSHLLFTDDVFLFGKLVW 362 QGD SP+LF+LCMEKL+++IQD V +G W P+H+ DGP +SHLLF DDV LF K Sbjct: 636 RQGDPLSPYLFILCMEKLSVAIQDAVLQGSWEPIHIINDGPQISHLLFADDVLLFTKAKS 695 Query: 363 SILHY 377 S L + Sbjct: 696 SQLQF 700 Score = 62.8 bits (151), Expect = 2e-06 Identities = 36/105 (34%), Positives = 57/105 (54%) Frame = +3 Query: 333 DVFLFGKLVWSILHYKDKLQVWVLFHNYLKNTPIWCIGNHTNASLIWRSILKAVETLKES 512 ++ L GKLVW ++ +KL V +L Y T + ++N+S W SI++A + LK Sbjct: 863 NICLLGKLVWDMVQSTNKLWVNLLAKKYSSGTSLLEANVNSNSSPSWFSIIRAKDILKTG 922 Query: 513 FKYRIGDVESSF*FID*TGMGLLGHLVDFVNISDTQLIFKDACSA 647 + +R G SSF F + + G LG LV ++I D L KD ++ Sbjct: 923 YSWRAGAGTSSFWFSNWSSHGYLGSLVPIIDIHDIHLTVKDVLTS 967 >gb|KYP64034.1| Retrovirus-related Pol polyprotein LINE-1, partial [Cajanus cajan] Length = 393 Score = 129 bits (325), Expect = 6e-29 Identities = 63/117 (53%), Positives = 84/117 (71%) Frame = +3 Query: 3 KIDLEKACDHLDFNFLESTLQKFGFPMPTICLIMSCVRASNLSIL*NGAKTYTFKLSRGL 182 K+DLEKA D LD++FLE TL+ +GFP + LIM + +++LS+L NG K FK RGL Sbjct: 249 KLDLEKAYDRLDWDFLEQTLKLYGFPERIVSLIMHGITSTSLSLLWNGNKLDGFKPIRGL 308 Query: 183 *QGDSWSPHLFVLCMEKLALSIQDKVEKGVWRPVHVYKDGPSLSHLLFTDDVFLFGK 353 QGD +SP+LFVLCME L + IQ +VE+G W+P+ + + GP LSH+ F DDV LF K Sbjct: 309 CQGDPFSPYLFVLCMECLGIMIQQEVEEGTWKPIQLTRHGPKLSHIFFADDVLLFAK 365 >ref|XP_020981606.1| uncharacterized protein LOC110273076 [Arachis duranensis] Length = 1117 Score = 134 bits (336), Expect = 8e-29 Identities = 66/117 (56%), Positives = 85/117 (72%) Frame = +3 Query: 3 KIDLEKACDHLDFNFLESTLQKFGFPMPTICLIMSCVRASNLSIL*NGAKTYTFKLSRGL 182 KIDLEKA D +D+ FLE+TL +FGFP TI LI++CV +S+L++L NG + F RGL Sbjct: 346 KIDLEKAYDRVDWRFLEATLVRFGFPKATINLILNCVTSSSLAVLWNGNRLQNFNPKRGL 405 Query: 183 *QGDSWSPHLFVLCMEKLALSIQDKVEKGVWRPVHVYKDGPSLSHLLFTDDVFLFGK 353 QGD SP+LFVLCME LA I +V +G+W PV V ++GP LSHL+F DD+ LF K Sbjct: 406 RQGDPMSPYLFVLCMEMLACFISHRVSQGLWNPVAVSRNGPRLSHLMFADDLLLFCK 462 Score = 81.3 bits (199), Expect = 4e-12 Identities = 82/288 (28%), Positives = 122/288 (42%), Gaps = 10/288 (3%) Frame = +3 Query: 342 LFGKLVWSILHYKDKLQVWVLFHNYLKNTPIWCIGNHTNASLIWRSILKAVETLKESFKY 521 L GKLVW L+ +KL V VL H YL+N + ++S W++I+ A E LKE + Sbjct: 637 LLGKLVWDCLNNSEKLWVQVLKHKYLRNQSGMNGNSRNSSSATWKNIVSAYEHLKEGLHW 696 Query: 522 RIGDVESSF*FID*TGMGLLGHLVDFVNISDTQLIFKDACSASPWNYDSFMTSVPPPCE- 698 IGDV S + + T G L +LV +V+IS++ + D W DS T +P + Sbjct: 697 NIGDVHKSVWYDEWTPFGKLCNLVPYVHISESDFMVADLWKGVSWEVDSLTTPIPHEIKQ 756 Query: 699 --TNG*YVSSTL*S*Y-----AYSR*MGLGSCYLWALLD*LDLSMAYSQA*NLG***LEL 857 Y SST A S+ Y W L L+ + + Sbjct: 757 FICGLRYPSSTELEPQWEWWPAASKKYSAREGYRWLLKKALNWNANSN---------WNW 807 Query: 858 DLETHD*QKIQHFVWLCLHNALTTCVI*GAPHVAPIVIFG*WNISYIVSVTALMV*KFGS 1037 T+ +K + +WL LH+AL T H+A + N + L + Sbjct: 808 LWNTNIPEKFKFTMWLGLHDALPTETFRFKRHLASSDMCKRCNKAQETMEHCLRDCERSK 867 Query: 1038 IFVFVL--QVLMFTTIRSASAWVRKMASSYSIRLFLAFLWWLWCRRNN 1175 ++L +L TT + W RK A + + F A LWW+W R N Sbjct: 868 AIWYMLDPSILDSTTGTALEEWFRK-ALANNEASFGAGLWWVWRHRCN 914 >gb|PNX92520.1| ribonuclease H, partial [Trifolium pratense] Length = 865 Score = 133 bits (335), Expect = 8e-29 Identities = 66/121 (54%), Positives = 85/121 (70%) Frame = +3 Query: 3 KIDLEKACDHLDFNFLESTLQKFGFPMPTICLIMSCVRASNLSIL*NGAKTYTFKLSRGL 182 KIDLEKA D++D+N+L S L FGFP TI LIM CV +S LS++ NG + +F +RGL Sbjct: 513 KIDLEKAYDNVDWNYLRSCLHDFGFPPLTIKLIMHCVCSSTLSLIWNGQRLPSFSPTRGL 572 Query: 183 *QGDSWSPHLFVLCMEKLALSIQDKVEKGVWRPVHVYKDGPSLSHLLFTDDVFLFGKLVW 362 QGD SP+LFVLCMEKL+L+I + V+ W+P+ + K+GP SHL F DDV LF K Sbjct: 573 RQGDPLSPYLFVLCMEKLSLAISEAVQNNSWKPIQISKNGPRFSHLFFADDVLLFSKATC 632 Query: 363 S 365 S Sbjct: 633 S 633 >gb|PNX59952.1| ribonuclease H, partial [Trifolium pratense] Length = 215 Score = 124 bits (311), Expect = 9e-29 Identities = 61/117 (52%), Positives = 83/117 (70%) Frame = +3 Query: 3 KIDLEKACDHLDFNFLESTLQKFGFPMPTICLIMSCVRASNLSIL*NGAKTYTFKLSRGL 182 ++DLEK +++++ FL+S LQ FGFP T LIM CV +S SIL NG K FK + GL Sbjct: 30 ELDLEKTFNNVNWKFLQSCLQDFGFPDITTRLIMHCVTSSTYSILWNGNKMPPFKPTHGL 89 Query: 183 *QGDSWSPHLFVLCMEKLALSIQDKVEKGVWRPVHVYKDGPSLSHLLFTDDVFLFGK 353 QGD SP+LF+LCMEKL+++I + V++ W PVH++ + P +SHLLF DDV LF K Sbjct: 90 RQGDPLSPYLFILCMEKLSIAINNAVQRKAWDPVHIFDNSPRMSHLLFGDDVLLFTK 146 >gb|PNX92714.1| ribonuclease H [Trifolium pratense] Length = 1359 Score = 133 bits (335), Expect = 1e-28 Identities = 67/125 (53%), Positives = 90/125 (72%) Frame = +3 Query: 3 KIDLEKACDHLDFNFLESTLQKFGFPMPTICLIMSCVRASNLSIL*NGAKTYTFKLSRGL 182 K+DLEKA D++++ FL++ L FGFP T LIM CV +SNLSIL NG + +FK + GL Sbjct: 583 KLDLEKAFDNVNWTFLKNCLNDFGFPTITTNLIMHCVTSSNLSILWNGNRMPSFKPTHGL 642 Query: 183 *QGDSWSPHLFVLCMEKLALSIQDKVEKGVWRPVHVYKDGPSLSHLLFTDDVFLFGKLVW 362 QGD SP+LF+LCMEKL+L+I + V+ G W+P+ + ++GP LSHLLF DDV LF K Sbjct: 643 RQGDPLSPYLFILCMEKLSLAINEAVDIGSWKPICISRNGPRLSHLLFADDVLLFSKAKK 702 Query: 363 SILHY 377 S L + Sbjct: 703 SQLRF 707 Score = 76.6 bits (187), Expect = 1e-10 Identities = 78/292 (26%), Positives = 121/292 (41%), Gaps = 13/292 (4%) Frame = +3 Query: 342 LFGKLVWSILHYKDKLQVWVLFHNYLKNTPIWCIGNHTNASLIWRSILKAVETLKESFKY 521 L GKLVW +L DKL V VL Y+ T I + + S W SI+KA L+ F + Sbjct: 873 LLGKLVWDLLQDSDKLWVRVLSDKYISGTRILS-SDTLSGSSTWNSIMKAKNVLRTGFVW 931 Query: 522 RIGDVESSF*FID*TGMGLLGHLVDFVNISDTQLIFKDACSASPWNYDSFMTSVPP---- 689 R G SSF + + G LG +V +V+ DT L+ KD +S N T +P Sbjct: 932 RPGSGNSSFWYSHWSHFGPLGAIVPYVHYHDTALMVKDVFVSSTANLHLLYTQLPQEVVI 991 Query: 690 PCETNG*YVSSTL*S*YAYSR----*MGLGSCYLW--ALLD*LDLSMAYSQA*NLG***L 851 + +ST+ +S S YLW +L L + ++ +L Sbjct: 992 SLNSMKFSFNSTIEDTMIWSANKHGTYTTSSGYLWILSLRSQLQSNQSWKWIWSL----- 1046 Query: 852 ELDLETHD*QKIQHFVWLCLHNALTTCVI*GAPHVAPIVI---FG*WNISYIVSVTALMV 1022 H +KI+ WL HN++ T + +++ G + S++ V Sbjct: 1047 ------HLPEKIKFLFWLACHNSVPTLALLHHRNMSSSSACPRCGYFEESFLHCVRDCPT 1100 Query: 1023 *KFGSIFVFVLQVLMFTTIRSASAWVRKMASSYSIRLFLAFLWWLWCRRNNM 1178 K F F +W+++ S S+ +F A +WW W RN+M Sbjct: 1101 SK-NLWFSIGFSAPQFYDTTDNISWIKEGTSGSSLNIFAAGVWWAWRNRNSM 1151 >ref|XP_021666740.1| uncharacterized protein LOC110654914 [Hevea brasiliensis] Length = 653 Score = 132 bits (331), Expect = 1e-28 Identities = 68/117 (58%), Positives = 81/117 (69%) Frame = +3 Query: 3 KIDLEKACDHLDFNFLESTLQKFGFPMPTICLIMSCVRASNLSIL*NGAKTYTFKLSRGL 182 KIDL KA D +D+ FL+ TL FGFP+ I LIMSCV S LSIL NG++ F RGL Sbjct: 145 KIDLAKAYDRVDWRFLQQTLHDFGFPVQIISLIMSCVTQSQLSILWNGSRLPAFAPMRGL 204 Query: 183 *QGDSWSPHLFVLCMEKLALSIQDKVEKGVWRPVHVYKDGPSLSHLLFTDDVFLFGK 353 QGD SP+LFVLCMEKL+L I+ KV W P+ + + GPS+SHLLF DDV LF K Sbjct: 205 RQGDPLSPYLFVLCMEKLSLLIETKVANHQWAPLSLSRGGPSISHLLFADDVILFSK 261 Score = 63.5 bits (153), Expect = 1e-06 Identities = 50/149 (33%), Positives = 77/149 (51%), Gaps = 7/149 (4%) Frame = +3 Query: 273 WRPVHVYKDGPSLS-HLLFTDDVFLFGKLVWSILHYKDKLQVWVLFHNYLKNTPIWCIGN 449 W V K L H +++ + GKLVWSIL +DK V VL YL ++ Sbjct: 411 WESVTSPKKAGGLGLHTARDNNIVMLGKLVWSILSNEDKPWVKVLCDRYLHGGSLFDARF 470 Query: 450 HTNASLIWRSILKAVETLKESFKYRIGDVESSF*FID-*TGMGLLGHLVDFVNISDTQLI 626 +S +WR+ILKA L+++F +RIG ++ + D G L LVD ++++D++L Sbjct: 471 SGTSSHVWRAILKASTFLRDAFMWRIGTGQNVALWEDRWLGAMPLRFLVDNISLADSELC 530 Query: 627 FKDACSASPWNYDS-----FMTSVPPPCE 698 D SAS +Y + F+T VP P + Sbjct: 531 VADIISAS-GSYSTKSAYHFITQVPYPVD 558 >dbj|GAU41508.1| hypothetical protein TSUD_302460 [Trifolium subterraneum] Length = 1075 Score = 132 bits (333), Expect = 2e-28 Identities = 66/117 (56%), Positives = 84/117 (71%) Frame = +3 Query: 3 KIDLEKACDHLDFNFLESTLQKFGFPMPTICLIMSCVRASNLSIL*NGAKTYTFKLSRGL 182 K+DLEKA D+++++FL+S L FGFP TI LIM CV +SN S+L NG K +FK + GL Sbjct: 375 KLDLEKAFDNVNWDFLKSCLHDFGFPNITIRLIMHCVTSSNFSLLWNGNKLPSFKPTHGL 434 Query: 183 *QGDSWSPHLFVLCMEKLALSIQDKVEKGVWRPVHVYKDGPSLSHLLFTDDVFLFGK 353 QGD SP+LF+LCMEKL++SI V +G W P+ + GP LSHLLF DDV LF K Sbjct: 435 RQGDPLSPYLFILCMEKLSISINSVVHQGAWDPIRISNTGPHLSHLLFADDVLLFTK 491 Score = 68.9 bits (167), Expect = 3e-08 Identities = 38/98 (38%), Positives = 54/98 (55%) Frame = +3 Query: 342 LFGKLVWSILHYKDKLQVWVLFHNYLKNTPIWCIGNHTNASLIWRSILKAVETLKESFKY 521 L GKLVW ++ +KL V +L + Y+ I +H+ S W SI+ A LK+ F + Sbjct: 665 LLGKLVWDMVQSSNKLWVNLLSNKYVAGPEILHSNSHSTGSPTWSSIIHAKNVLKDGFSW 724 Query: 522 RIGDVESSF*FID*TGMGLLGHLVDFVNISDTQLIFKD 635 R G SSF F + +G LG LV +++I D QL KD Sbjct: 725 RAGSGSSSFWFSHWSPLGYLGTLVPYIDIHDLQLSVKD 762 >dbj|GAU37589.1| hypothetical protein TSUD_365100 [Trifolium subterraneum] Length = 648 Score = 131 bits (329), Expect = 2e-28 Identities = 64/117 (54%), Positives = 83/117 (70%) Frame = +3 Query: 3 KIDLEKACDHLDFNFLESTLQKFGFPMPTICLIMSCVRASNLSIL*NGAKTYTFKLSRGL 182 KIDLEKA D++++NFL S L FGFP T+ LIM CV +S+LS++ NG + F +RGL Sbjct: 134 KIDLEKAYDNVNWNFLRSCLHDFGFPQLTVKLIMHCVSSSSLSLIWNGKRLPNFSPTRGL 193 Query: 183 *QGDSWSPHLFVLCMEKLALSIQDKVEKGVWRPVHVYKDGPSLSHLLFTDDVFLFGK 353 QGD SP+LFVLCMEKL+L+I V+ W+P+ + K+GP SHL F DDV LF K Sbjct: 194 RQGDPLSPYLFVLCMEKLSLAISAAVQNNSWKPLQISKNGPRFSHLFFADDVLLFSK 250 Score = 61.6 bits (148), Expect = 4e-06 Identities = 47/167 (28%), Positives = 71/167 (42%) Frame = +3 Query: 282 VHVYKDGPSLSHLLFTDDVFLFGKLVWSILHYKDKLQVWVLFHNYLKNTPIWCIGNHTNA 461 + KDG L + + GKLVW I +K V ++ YL N + + Sbjct: 404 IQTKKDGGLGVRLARETNTAMLGKLVWDIQQNTNKPWVHMIKDKYLYNKMFFNTPR-SYG 462 Query: 462 SLIWRSILKAVETLKESFKYRIGDVESSF*FID*TGMGLLGHLVDFVNISDTQLIFKDAC 641 S IW SI KA + L E + +RI D SS + LLG VD+V I D+ L KD Sbjct: 463 SPIWNSISKAKQVLLEGYHFRISDGSSSLWYSPWLSKDLLGRKVDYVAIQDSHLRIKDVF 522 Query: 642 SASPWNYDSFMTSVPPPCETNG*YVSSTL*S*YAYSR*MGLGSCYLW 782 W + T + P ++S + + + + G C++W Sbjct: 523 INDAWQLNLLYTPLQPD------VITSIMNTHFILNN--GTSDCFIW 561