BLASTX nr result
ID: Astragalus22_contig00038115
seq
BLASTX 2.2.26 [Sep-21-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Astragalus22_contig00038115 (843 letters) Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects 149,584,005 sequences; 54,822,741,787 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value gb|KYP76185.1| Putative ribonuclease H protein At1g65750 [Cajanu... 254 2e-73 dbj|GAU26239.1| hypothetical protein TSUD_224300 [Trifolium subt... 252 1e-72 gb|KYP40438.1| Putative ribonuclease H protein At1g65750 family,... 248 4e-71 gb|KYP72147.1| LINE-1 reverse transcriptase isogeny [Cajanus cajan] 239 7e-71 gb|KYP72596.1| Putative ribonuclease H protein At1g65750 family ... 234 6e-69 ref|XP_020210568.1| uncharacterized protein LOC109795461 [Cajanu... 239 4e-68 gb|KYP45885.1| Putative ribonuclease H protein At1g65750 family ... 237 2e-67 gb|KYP65965.1| Putative ribonuclease H protein At1g65750 family ... 233 3e-66 gb|KYP61054.1| Retrovirus-related Pol polyprotein LINE-1 [Cajanu... 227 3e-66 gb|KYP48048.1| LINE-1 reverse transcriptase isogeny [Cajanus cajan] 219 7e-66 gb|KYP70239.1| Putative ribonuclease H protein At1g65750 family ... 231 1e-65 gb|KYP74374.1| Putative ribonuclease H protein At1g65750 family ... 226 2e-63 ref|XP_016164673.1| uncharacterized protein LOC107607211 [Arachi... 225 3e-63 gb|KYP53058.1| Putative ribonuclease H protein At1g65750 family,... 223 9e-63 gb|KYP33748.1| Putative ribonuclease H protein At1g65750 family ... 219 4e-61 gb|KYP57513.1| Retrovirus-related Pol polyprotein LINE-1 [Cajanu... 210 9e-61 gb|KYP69874.1| Retrovirus-related Pol polyprotein LINE-1 [Cajanu... 212 6e-59 gb|KYP49443.1| Retrovirus-related Pol polyprotein LINE-1 [Cajanu... 199 2e-56 ref|XP_016168765.1| uncharacterized protein LOC107611342 [Arachi... 202 2e-55 ref|XP_016206284.1| uncharacterized protein LOC107646622 [Arachi... 200 1e-54 >gb|KYP76185.1| Putative ribonuclease H protein At1g65750 [Cajanus cajan] Length = 1354 Score = 254 bits (649), Expect = 2e-73 Identities = 127/279 (45%), Positives = 174/279 (62%) Frame = -3 Query: 841 WMAIKIDLEKAYDRLSWNFIRDTLEDMGFPSNISNLVWHCISSPFIQLLWNGEALGNFRF 662 WMAIKIDLEKAYDRL+WNF+RDTL D+G P + L+WHCISSP +Q+LWNGEAL F Sbjct: 579 WMAIKIDLEKAYDRLNWNFVRDTLVDIGLPQKLIELIWHCISSPSMQVLWNGEALEEFVP 638 Query: 661 LEELGRVIPCLLIYLFYAWKGFFT*LILKSEQRLGLLLSSH*KMP*DLTSCLCR*SFAFC 482 + + P + F + + + L + K P F Sbjct: 639 SRGIRQGDPISPYIFVLCMERLFHLIKIAEDHHLWKPIKLSKKGPPLSHLAFADDLILFS 698 Query: 481 RCHC*QARFIKQVLHTFCSSYDQKVSLDKTRIFYSDNVPYHLRVKIRECLGFQSTSDLGK 302 QA IK L FC S KVS +KTRIF+S N+ + ++ +I LGFQ T DLGK Sbjct: 699 EASLDQAEIIKACLDNFCHSSGMKVSTEKTRIFFSKNIGWSVKNEISSSLGFQRTDDLGK 758 Query: 301 YLGILIFHKNVTKESFRFIIDKVNQRLSAWKARSLSLADRLTLTKSVLQALPSYAVQSCI 122 Y+GI + H+ V+K S + ++D + +RLS+WK ++LS A RLTLTKSVL A+PSY +Q+ + Sbjct: 759 YIGIKLHHERVSKRSLQSVMDHIKRRLSSWKTKTLSFAGRLTLTKSVLAAIPSYTMQTVL 818 Query: 121 IPKGVCDDIDKICRSFIWGEENGRRKFHPISWSKVC*PK 5 +PK +C DIDK CRSFIWG+++G+R+ H ++W +C PK Sbjct: 819 LPKQLCYDIDKSCRSFIWGQDSGKRRVHALAWETLCKPK 857 >dbj|GAU26239.1| hypothetical protein TSUD_224300 [Trifolium subterraneum] Length = 1250 Score = 252 bits (643), Expect = 1e-72 Identities = 137/288 (47%), Positives = 183/288 (63%), Gaps = 8/288 (2%) Frame = -3 Query: 841 WMAIKIDLEKAYDRLSWNFIRDTLEDMGFPSNISNLVWHCISSPFIQLLWNGEALGNFRF 662 WMAIKIDLEKAYDRL+W F+++TLED+G P + NL+W CIS+ +++LWNGEAL F Sbjct: 454 WMAIKIDLEKAYDRLNWEFVKETLEDIGVPRRMVNLIWSCISTSKMRVLWNGEALEEFSP 513 Query: 661 LEELGRVIPCLLIYLFYAWKGFFT*LILKSEQRLGLLLSSH*KMP*DLTSCLCR*S---- 494 + + P L YLF I + Q + L + + P L+ + S Sbjct: 514 SRGIRQGDP-LSPYLFVL-------CIERLFQSINLAVDQNKLSPIKLSRGGPKISHLAY 565 Query: 493 ----FAFCRCHC*QARFIKQVLHTFCSSYDQKVSLDKTRIFYSDNVPYHLRVKIRECLGF 326 F QA+ IK +L TFC S QKVS +KT+IF+S NV +H+R ++ E GF Sbjct: 566 ADDLLLFGEATVSQAQNIKVILDTFCISSGQKVSPEKTKIFFSKNVGWHVRQEVSERCGF 625 Query: 325 QSTSDLGKYLGILIFHKNVTKESFRFIIDKVNQRLSAWKARSLSLADRLTLTKSVLQALP 146 T +LGKYLG+ I H ++ +F+FI+DKV QRLS WKA++LS A R+TL KSV+QALP Sbjct: 626 GWTDNLGKYLGVPILHNKASRATFQFIMDKVGQRLSNWKAKNLSFAGRVTLAKSVIQALP 685 Query: 145 SYAVQSCIIPKGVCDDIDKICRSFIWGEENGRRKFHPISWSKVC*PKK 2 Y +QS ++PK +CD+IDK CRSFIWG+ RK H ISW K+C PKK Sbjct: 686 VYTMQSTLLPKSICDEIDKKCRSFIWGDTEESRKIHLISWDKICSPKK 733 >gb|KYP40438.1| Putative ribonuclease H protein At1g65750 family, partial [Cajanus cajan] Length = 1356 Score = 248 bits (632), Expect = 4e-71 Identities = 134/280 (47%), Positives = 173/280 (61%), Gaps = 1/280 (0%) Frame = -3 Query: 841 WMAIKIDLEKAYDRLSWNFIRDTLEDMGFPSNISNLVWHCISSPFIQLLWNGEALGNFRF 662 WMAIKIDLEKAYDRL+WNFIRDTL D+G P N LVW CIS+P ++LWNGEAL F Sbjct: 565 WMAIKIDLEKAYDRLNWNFIRDTLTDIGLPQNFVELVWACISTPSSRVLWNGEALQEFHP 624 Query: 661 LEELGRVIPCLLIYLFY-AWKGFFT*LILKSEQRLGLLLSSH*KMP*DLTSCLCR*SFAF 485 + + P L YLF + F + + Q+L + + P F Sbjct: 625 SRGIRQGDP-LSPYLFVLCMERLFHIIEVAVAQKLWKPICLSKQGPPLSHLAFADDLILF 683 Query: 484 CRCHC*QARFIKQVLHTFCSSYDQKVSLDKTRIFYSDNVPYHLRVKIRECLGFQSTSDLG 305 Q IK L FC S QKVSL+KTRIF+S NV + +R +I LGFQ T +LG Sbjct: 684 SEASLDQVEVIKACLELFCKSSGQKVSLEKTRIFFSKNVGWSVREEISSALGFQRTDNLG 743 Query: 304 KYLGILIFHKNVTKESFRFIIDKVNQRLSAWKARSLSLADRLTLTKSVLQALPSYAVQSC 125 KYLG+ I H V + + II+KVNQRLS+WKA++LS A RLTLTK VL LP Y +Q+ Sbjct: 744 KYLGVPIQHDRVNRRLYSSIINKVNQRLSSWKAKTLSFAGRLTLTKFVLVTLPMYTMQTA 803 Query: 124 IIPKGVCDDIDKICRSFIWGEENGRRKFHPISWSKVC*PK 5 +P+ +CDDIDK CRSF+WG + +++ H ++WS +C PK Sbjct: 804 FLPRKICDDIDKECRSFLWGHKGEQQRIHAVAWSVICKPK 843 >gb|KYP72147.1| LINE-1 reverse transcriptase isogeny [Cajanus cajan] Length = 628 Score = 239 bits (609), Expect = 7e-71 Identities = 120/279 (43%), Positives = 170/279 (60%) Frame = -3 Query: 841 WMAIKIDLEKAYDRLSWNFIRDTLEDMGFPSNISNLVWHCISSPFIQLLWNGEALGNFRF 662 WMAIKIDLEKAYDRL+W FI++TL +G P N+ L+WHCISS +Q+LWNGE L F+ Sbjct: 181 WMAIKIDLEKAYDRLNWTFIKETLTMIGIPLNLVELIWHCISSSSMQVLWNGETLPEFKP 240 Query: 661 LEELGRVIPCLLIYLFYAWKGFFT*LILKSEQRLGLLLSSH*KMP*DLTSCLCR*SFAFC 482 + + P + F + + Q L + K P F Sbjct: 241 TRGIRQGDPLSPYIFVLCMERLFHLIEVAVCQELWKPIKQSKKGPAISHLAFADNLILFA 300 Query: 481 RCHC*QARFIKQVLHTFCSSYDQKVSLDKTRIFYSDNVPYHLRVKIRECLGFQSTSDLGK 302 QA IK L +FC S KVS + TR+F+S NV ++++ +I LGFQ T +LGK Sbjct: 301 EASLDQAEIIKSCLDSFCLSSGMKVSEENTRVFFSKNVGWNVKSEISSSLGFQRTDNLGK 360 Query: 301 YLGILIFHKNVTKESFRFIIDKVNQRLSAWKARSLSLADRLTLTKSVLQALPSYAVQSCI 122 YLG+ + H V++ SF+ +++ +N+R+S+WKA++LS A RLTLTKSVL ALPSY +Q+ Sbjct: 361 YLGVQLHHTRVSRNSFQSVMNSINRRISSWKAKTLSFAGRLTLTKSVLAALPSYTMQTVF 420 Query: 121 IPKGVCDDIDKICRSFIWGEENGRRKFHPISWSKVC*PK 5 +P+ +CD+IDK RSF+WG+ R+ H I+W +C PK Sbjct: 421 LPRQLCDEIDKASRSFLWGDSRAHRRVHAIAWETICKPK 459 >gb|KYP72596.1| Putative ribonuclease H protein At1g65750 family [Cajanus cajan] Length = 646 Score = 234 bits (597), Expect = 6e-69 Identities = 127/291 (43%), Positives = 178/291 (61%), Gaps = 12/291 (4%) Frame = -3 Query: 841 WMAIKIDLEKAYDRLSWNFIRDTLEDMGFPSNISNLVWHCISSPFIQLLWNGEALGNF-- 668 WMA+K+DLEKAYDRL W+FI+DTLED+GFPS NLV CI++P +++LWNGE L F Sbjct: 10 WMALKVDLEKAYDRLEWSFIQDTLEDIGFPSTFINLVMACITTPKMRMLWNGEILDEFSP 69 Query: 667 -RFLEELGRVIPCLLIY----LFY-----AWKGFFT*LILKSEQRLGLLLSSH*KMP*DL 518 R + + + P + + LF+ KGF++ + L G LS H DL Sbjct: 70 SRGIRQGDPISPYIFVLCIERLFHIIECAVEKGFWSPIQLSKR---GPKLS-HLGFADDL 125 Query: 517 TSCLCR*SFAFCRCHC*QARFIKQVLHTFCSSYDQKVSLDKTRIFYSDNVPYHLRVKIRE 338 F + Q I+ L FC S QKV+ +KT++F+S NV + +R ++ Sbjct: 126 V--------LFAEANVEQVEVIQTCLDLFCKSSGQKVNKEKTKVFFSKNVSWTVRNQLSS 177 Query: 337 CLGFQSTSDLGKYLGILIFHKNVTKESFRFIIDKVNQRLSAWKARSLSLADRLTLTKSVL 158 LG Q T DLGKYLG+ + HK VT ++ I+DKV R+S WK SLS+A R+T KSVL Sbjct: 178 SLGVQRTEDLGKYLGVPLHHKRVTTNTYSNILDKVRNRMSCWKRNSLSMAGRVTFAKSVL 237 Query: 157 QALPSYAVQSCIIPKGVCDDIDKICRSFIWGEENGRRKFHPISWSKVC*PK 5 ALP+Y +Q+ ++PK +C+++DK+ R FIWGE + RK H ISW+ +C PK Sbjct: 238 NALPTYTMQTSLLPKTICEELDKLTRKFIWGENDHDRKIHTISWNTICQPK 288 >ref|XP_020210568.1| uncharacterized protein LOC109795461 [Cajanus cajan] Length = 1200 Score = 239 bits (609), Expect = 4e-68 Identities = 120/279 (43%), Positives = 170/279 (60%) Frame = -3 Query: 841 WMAIKIDLEKAYDRLSWNFIRDTLEDMGFPSNISNLVWHCISSPFIQLLWNGEALGNFRF 662 WMAIKIDLEKAYDRL+W FI++TL +G P N+ L+WHCISS +Q+LWNGE L F+ Sbjct: 434 WMAIKIDLEKAYDRLNWTFIKETLTMIGIPLNLVELIWHCISSSSMQVLWNGETLPEFKP 493 Query: 661 LEELGRVIPCLLIYLFYAWKGFFT*LILKSEQRLGLLLSSH*KMP*DLTSCLCR*SFAFC 482 + + P + F + + Q L + K P F Sbjct: 494 TRGIRQGDPLSPYIFVLCMERLFHLIEVAVCQELWKPIKQSKKGPAISHLAFADNLILFA 553 Query: 481 RCHC*QARFIKQVLHTFCSSYDQKVSLDKTRIFYSDNVPYHLRVKIRECLGFQSTSDLGK 302 QA IK L +FC S KVS + TR+F+S NV ++++ +I LGFQ T +LGK Sbjct: 554 EASLDQAEIIKSCLDSFCLSSGMKVSEENTRVFFSKNVGWNVKSEISSSLGFQRTDNLGK 613 Query: 301 YLGILIFHKNVTKESFRFIIDKVNQRLSAWKARSLSLADRLTLTKSVLQALPSYAVQSCI 122 YLG+ + H V++ SF+ +++ +N+R+S+WKA++LS A RLTLTKSVL ALPSY +Q+ Sbjct: 614 YLGVQLHHTRVSRNSFQSVMNSINRRISSWKAKTLSFAGRLTLTKSVLAALPSYTMQTVF 673 Query: 121 IPKGVCDDIDKICRSFIWGEENGRRKFHPISWSKVC*PK 5 +P+ +CD+IDK RSF+WG+ R+ H I+W +C PK Sbjct: 674 LPRQLCDEIDKASRSFLWGDSRAHRRVHAIAWETICKPK 712 >gb|KYP45885.1| Putative ribonuclease H protein At1g65750 family [Cajanus cajan] Length = 1192 Score = 237 bits (604), Expect = 2e-67 Identities = 124/279 (44%), Positives = 169/279 (60%) Frame = -3 Query: 841 WMAIKIDLEKAYDRLSWNFIRDTLEDMGFPSNISNLVWHCISSPFIQLLWNGEALGNFRF 662 WMAIKIDLEKAYDRL+WNFI++TLED+GFP I L+W+CIS+ ++LWNGE L +F Sbjct: 659 WMAIKIDLEKAYDRLNWNFIKETLEDIGFPLKIIELIWNCISTAKFRMLWNGEMLESFSP 718 Query: 661 LEELGRVIPCLLIYLFYAWKGFFT*LILKSEQRLGLLLSSH*KMP*DLTSCLCR*SFAFC 482 + + P + F + + Q+L + P F Sbjct: 719 SRGIRQGDPISPYLFVLCMERLFHLINISVTQKLWKPIRLSRSGPELSHLAFADDLILFA 778 Query: 481 RCHC*QARFIKQVLHTFCSSYDQKVSLDKTRIFYSDNVPYHLRVKIRECLGFQSTSDLGK 302 Q I+ L+ FC+S QK+S +KTRIF+S NV +++R +I GFQ +LGK Sbjct: 779 EARLDQVEIIQACLNLFCTSSGQKISQEKTRIFFSKNVNWNVRNEISSSFGFQRAENLGK 838 Query: 301 YLGILIFHKNVTKESFRFIIDKVNQRLSAWKARSLSLADRLTLTKSVLQALPSYAVQSCI 122 YLGI + H V + + II+KV QRL+ WKA+SLS A RLTLTKSVL ALPSY +Q+ Sbjct: 839 YLGIPLHHSRVNRATHSGIIEKVTQRLNNWKAKSLSFAGRLTLTKSVLTALPSYTMQTVW 898 Query: 121 IPKGVCDDIDKICRSFIWGEENGRRKFHPISWSKVC*PK 5 +P+ +CDDIDK R F+WG+ + +K H +SWS +C PK Sbjct: 899 LPRNICDDIDKKNRQFLWGDTSHNKKVHTVSWSVICQPK 937 >gb|KYP65965.1| Putative ribonuclease H protein At1g65750 family [Cajanus cajan] Length = 1043 Score = 233 bits (593), Expect = 3e-66 Identities = 123/278 (44%), Positives = 165/278 (59%) Frame = -3 Query: 838 MAIKIDLEKAYDRLSWNFIRDTLEDMGFPSNISNLVWHCISSPFIQLLWNGEALGNFRFL 659 +AIKIDLEKAYDRL+W FI+DTLED+G PS +LVW CIS+ +Q+LWNGE L F Sbjct: 244 LAIKIDLEKAYDRLNWLFIKDTLEDIGLPSKFIDLVWSCISTASLQVLWNGEVLEAFSPS 303 Query: 658 EELGRVIPCLLIYLFYAWKGFFT*LILKSEQRLGLLLSSH*KMP*DLTSCLCR*SFAFCR 479 + + P + F + + Q+L + P F Sbjct: 304 RGIRQGDPISPYLFVLCMERLFHLIDITVTQQLWKPIRLSRGGPSLTHLAFADDLILFAE 363 Query: 478 CHC*QARFIKQVLHTFCSSYDQKVSLDKTRIFYSDNVPYHLRVKIRECLGFQSTSDLGKY 299 + Q I+ L+ FCSS QK+S +KTRIF+S NV +R +I GFQ +LGKY Sbjct: 364 ANMNQVEIIQSCLNHFCSSSGQKISQEKTRIFFSKNVARTVREEISSAFGFQRAENLGKY 423 Query: 298 LGILIFHKNVTKESFRFIIDKVNQRLSAWKARSLSLADRLTLTKSVLQALPSYAVQSCII 119 LGI + H V ++++ I+DK+ QRLS WKA++LS A RLTLTKSVL ALPSY +Q + Sbjct: 424 LGIPLHHSRVNRDTYHGIMDKITQRLSNWKAKNLSFAGRLTLTKSVLAALPSYTMQMVRL 483 Query: 118 PKGVCDDIDKICRSFIWGEENGRRKFHPISWSKVC*PK 5 P+ +CD++DK CR F+WG+ RK H I WS +C PK Sbjct: 484 PRSICDEVDKKCRQFLWGDSEDCRKIHTIGWSMLCLPK 521 >gb|KYP61054.1| Retrovirus-related Pol polyprotein LINE-1 [Cajanus cajan] Length = 636 Score = 227 bits (578), Expect = 3e-66 Identities = 122/282 (43%), Positives = 169/282 (59%), Gaps = 3/282 (1%) Frame = -3 Query: 841 WMAIKIDLEKAYDRLSWNFIRDTLEDMGFPSNISNLVWHCISSPFIQLLWNGEALGNF-- 668 WMAIKIDLEKAYDRL+W FI++TL D+G P+N LVW CISS ++++WNGEAL F Sbjct: 218 WMAIKIDLEKAYDRLNWKFIKETLIDIGLPNNFVELVWACISSGKLRMMWNGEALEEFLP 277 Query: 667 -RFLEELGRVIPCLLIYLFYAWKGFFT*LILKSEQRLGLLLSSH*KMP*DLTSCLCR*SF 491 R + + + P L + + F + + + RL + P Sbjct: 278 SRGVRQGDPISPYLFVLCM---ERLFQLINMTIDHRLWKPIQLSRNGPMISHLAFADDIV 334 Query: 490 AFCRCHC*QARFIKQVLHTFCSSYDQKVSLDKTRIFYSDNVPYHLRVKIRECLGFQSTSD 311 F Q I+ L+ FC S QKVS +KTRIF+S NV + +R +I GFQ T + Sbjct: 335 LFAEASLDQVEVIQGCLNVFCDSAGQKVSNEKTRIFFSKNVGHVVRSEISNAFGFQRTEN 394 Query: 310 LGKYLGILIFHKNVTKESFRFIIDKVNQRLSAWKARSLSLADRLTLTKSVLQALPSYAVQ 131 LG YLG+ H V+ +++ IIDKVN RLS WKA++LS A R+TLTKSVL+ALPSY +Q Sbjct: 395 LGNYLGVPTHHSRVSHATYQSIIDKVNNRLSGWKAKNLSFAGRITLTKSVLEALPSYIMQ 454 Query: 130 SCIIPKGVCDDIDKICRSFIWGEENGRRKFHPISWSKVC*PK 5 + +PK VCD ++K R F+WG+ + + H I+W+ +C PK Sbjct: 455 TVSLPKTVCDALEKSSRGFLWGDNSEHHRPHAINWNTICLPK 496 >gb|KYP48048.1| LINE-1 reverse transcriptase isogeny [Cajanus cajan] Length = 364 Score = 219 bits (557), Expect = 7e-66 Identities = 121/289 (41%), Positives = 164/289 (56%), Gaps = 9/289 (3%) Frame = -3 Query: 841 WMAIKIDLEKAYDRLSWNFIRDTLEDMGFPSNISNLVWHCISSPFIQLLWNGEALGNF-- 668 WMAIKIDLEKAYDRL WNF++DTL+D+G P NL+W I SP ++++WNGEAL F Sbjct: 66 WMAIKIDLEKAYDRLKWNFVKDTLQDIGLPQIFVNLIWASILSPRLRMVWNGEALEEFTP 125 Query: 667 -RFLEELGRVIPCLLIYLFYAWKGFFT*LILKSEQRLGLLLS------SH*KMP*DLTSC 509 R + + G + P L + + + S+Q + LS SH DL Sbjct: 126 SRGIRQGGPISPYLFVLCMERLFQLIS-AAVTSDQWKPIKLSRDGRPLSHLAFADDLV-- 182 Query: 508 LCR*SFAFCRCHC*QARFIKQVLHTFCSSYDQKVSLDKTRIFYSDNVPYHLRVKIRECLG 329 F Q IK L FC+S QKVSL+KTRI++S NV + +R +I G Sbjct: 183 ------LFAEASINQVEIIKTCLDLFCASSGQKVSLEKTRIYFSKNVNHSIREEISSTFG 236 Query: 328 FQSTSDLGKYLGILIFHKNVTKESFRFIIDKVNQRLSAWKARSLSLADRLTLTKSVLQAL 149 +Q +LGKYLGI H V ++ +I+ V++R WK +L RLTL KSVL + Sbjct: 237 YQCIDNLGKYLGIPAHHSRVCHRDYQGLIEHVSRR--GWKTSALLFMGRLTLCKSVLSTI 294 Query: 148 PSYAVQSCIIPKGVCDDIDKICRSFIWGEENGRRKFHPISWSKVC*PKK 2 PSY +QS +P+ CD+ID+ICR F+WG R+FH I W+KVC K+ Sbjct: 295 PSYTMQSVYLPRSTCDEIDRICRDFLWGGSRNNRRFHAIGWNKVCMAKE 343 >gb|KYP70239.1| Putative ribonuclease H protein At1g65750 family [Cajanus cajan] Length = 1157 Score = 231 bits (590), Expect = 1e-65 Identities = 121/279 (43%), Positives = 167/279 (59%) Frame = -3 Query: 841 WMAIKIDLEKAYDRLSWNFIRDTLEDMGFPSNISNLVWHCISSPFIQLLWNGEALGNFRF 662 WMAIKIDLEKAYDRL+WNFI++TLED+GFP I L+W+CIS+ ++LWNGE L +F Sbjct: 674 WMAIKIDLEKAYDRLNWNFIKETLEDIGFPLKIIELIWNCISTAKFRMLWNGEMLESFSP 733 Query: 661 LEELGRVIPCLLIYLFYAWKGFFT*LILKSEQRLGLLLSSH*KMP*DLTSCLCR*SFAFC 482 + + P + F + + Q+L + P F Sbjct: 734 SRGIRQGDPISPYLFVLCMERLFHLINISVTQKLWKPIRLSRSGPELSHLAFADDLILFA 793 Query: 481 RCHC*QARFIKQVLHTFCSSYDQKVSLDKTRIFYSDNVPYHLRVKIRECLGFQSTSDLGK 302 Q I+ L+ FC+S QK+S +KTRIF+S NV +++ +I FQ +LGK Sbjct: 794 EARLDQVEIIQACLNLFCTSSGQKISQEKTRIFFSKNVNWNVINEISSSFSFQQAENLGK 853 Query: 301 YLGILIFHKNVTKESFRFIIDKVNQRLSAWKARSLSLADRLTLTKSVLQALPSYAVQSCI 122 YLGI + H V + ++ II+KV QRL+ WKA+SLS A RLTLTKS L ALPSY +Q+ Sbjct: 854 YLGIPLHHSRVNRATYSGIIEKVTQRLNNWKAKSLSFAGRLTLTKSFLTALPSYTMQTVW 913 Query: 121 IPKGVCDDIDKICRSFIWGEENGRRKFHPISWSKVC*PK 5 +P+ +CDDIDK R F+WG+ + +K H +SWS +C PK Sbjct: 914 LPRNICDDIDKKNRQFLWGDTSHNKKVHTVSWSVICQPK 952 >gb|KYP74374.1| Putative ribonuclease H protein At1g65750 family [Cajanus cajan] Length = 1421 Score = 226 bits (575), Expect = 2e-63 Identities = 118/279 (42%), Positives = 165/279 (59%) Frame = -3 Query: 841 WMAIKIDLEKAYDRLSWNFIRDTLEDMGFPSNISNLVWHCISSPFIQLLWNGEALGNFRF 662 WM IKIDLEKAYDRL+WNF++DTL D+GFP N +L+W CISS +++LWNGEAL F Sbjct: 772 WMIIKIDLEKAYDRLNWNFVKDTLLDIGFPENFISLIWSCISSSKMRVLWNGEALEEFLP 831 Query: 661 LEELGRVIPCLLIYLFYAWKGFFT*LILKSEQRLGLLLSSH*KMP*DLTSCLCR*SFAFC 482 + + P + F + + +L + P F Sbjct: 832 SRGVRQGDPISPYIFVLCMERLFHLIEIAVNHQLWKPIRISRGGPKIAHLAFADDLLLFA 891 Query: 481 RCHC*QARFIKQVLHTFCSSYDQKVSLDKTRIFYSDNVPYHLRVKIRECLGFQSTSDLGK 302 Q I+ L FCSS QKVS DKTRI +S NV + +R +I GF T +LGK Sbjct: 892 EASVDQVEIIQTCLDLFCSSSGQKVSQDKTRIHFSKNVSWRVREEISNKFGFLRTDNLGK 951 Query: 301 YLGILIFHKNVTKESFRFIIDKVNQRLSAWKARSLSLADRLTLTKSVLQALPSYAVQSCI 122 YLG+ I H+ V + F+ +++KVNQRLS+WKA++LS A R+TLT+SVL ALPSY +QS Sbjct: 952 YLGVPIHHRRVNRVLFKGVVEKVNQRLSSWKAKTLSFAGRVTLTQSVLSALPSYLMQSVY 1011 Query: 121 IPKGVCDDIDKICRSFIWGEENGRRKFHPISWSKVC*PK 5 +P+ VCD++DK R F+W ++ + + H +SW + P+ Sbjct: 1012 LPRQVCDELDKHYRRFLWDDKENKHRLHAVSWEVISKPR 1050 >ref|XP_016164673.1| uncharacterized protein LOC107607211 [Arachis ipaensis] Length = 1901 Score = 225 bits (574), Expect = 3e-63 Identities = 122/288 (42%), Positives = 167/288 (57%), Gaps = 8/288 (2%) Frame = -3 Query: 841 WMAIKIDLEKAYDRLSWNFIRDTLEDMGFPSNISNLVWHCISSPFIQLLWNGEALGNFRF 662 WM IKIDLEKAYDRL+WNFI++TL D+GFP N NL CIS+ +++ WNGE L F Sbjct: 1109 WMTIKIDLEKAYDRLNWNFIKETLMDIGFPQNFINLTLSCISTARMRVFWNGEELEEFSP 1168 Query: 661 LEELGRVIPCLLIYLFYAWKGFFT*LILKSEQRLGLLLSSH*KMP*DLTS--------CL 506 + + P + Y+F I K Q + + P L C Sbjct: 1169 TRGIRQGDP-ISPYIFVL-------CIEKLSQLISAAVEHDFWKPIRLKKDGPPISHLCF 1220 Query: 505 CR*SFAFCRCHC*QARFIKQVLHTFCSSYDQKVSLDKTRIFYSDNVPYHLRVKIRECLGF 326 F + QA I + L FC S QKVS DKTR+F+S NV +++R +I + F Sbjct: 1221 ADDIILFAEANVDQANIINKCLEAFCKSSGQKVSKDKTRVFFSRNVGHNVRTEISNVMQF 1280 Query: 325 QSTSDLGKYLGILIFHKNVTKESFRFIIDKVNQRLSAWKARSLSLADRLTLTKSVLQALP 146 T DL KYLG+ I H VTK +F II+K++ RL++WKA SLSLA R TL KSVL ++P Sbjct: 1281 TRTDDLRKYLGVPILHSKVTKHTFEGIINKLHVRLNSWKASSLSLAGRTTLVKSVLSSMP 1340 Query: 145 SYAVQSCIIPKGVCDDIDKICRSFIWGEENGRRKFHPISWSKVC*PKK 2 Y + S ++P C+ ID+ICR+FIWG+ + +K H ++W K+C PK+ Sbjct: 1341 IYNMHSALLPTATCNSIDRICRNFIWGDTDQNKKVHLLNWKKICEPKQ 1388 >gb|KYP53058.1| Putative ribonuclease H protein At1g65750 family, partial [Cajanus cajan] Length = 1039 Score = 223 bits (568), Expect = 9e-63 Identities = 124/284 (43%), Positives = 165/284 (58%), Gaps = 9/284 (3%) Frame = -3 Query: 841 WMAIKIDLEKAYDRLSWNFIRDTLEDMGFPSNISNLVWHCISSPFIQLLWNGEALGNFRF 662 WMAIKIDLEKAYDRL WNF++DTL+D+G P NL+W ISSP ++++WNGEAL F Sbjct: 491 WMAIKIDLEKAYDRLKWNFVKDTLQDIGLPQTFVNLIWASISSPRLRMVWNGEALEEFTP 550 Query: 661 LEELGRVIPCLLIYLFYAWKGFFT*LI---LKSEQRLGLLLS------SH*KMP*DLTSC 509 E+ + P + YLF LI S Q + LS SH DL Sbjct: 551 SREIRQGDP-ISPYLFVLCMERLFQLISAAANSNQWKPIKLSRDGPPLSHLAFADDLV-- 607 Query: 508 LCR*SFAFCRCHC*QARFIKQVLHTFCSSYDQKVSLDKTRIFYSDNVPYHLRVKIRECLG 329 F Q IK L FC S QK SL+KT+I++S NV + +R +I G Sbjct: 608 ------LFAEASINQVEIIKTCLDLFCVSSGQKASLEKTKIYFSKNVNHSIREEISSAFG 661 Query: 328 FQSTSDLGKYLGILIFHKNVTKESFRFIIDKVNQRLSAWKARSLSLADRLTLTKSVLQAL 149 +Q T +LGK+LGI H V ++ +I++V++RLS WK +LS A RLTL K+VL A+ Sbjct: 662 YQRTDNLGKFLGIPANHSRVCHRDYQGLIERVSRRLSGWKTSALSFAGRLTLCKTVLSAI 721 Query: 148 PSYAVQSCIIPKGVCDDIDKICRSFIWGEENGRRKFHPISWSKV 17 PSY +QS +P+ CD+ID+I R F+WG R+FH I W+KV Sbjct: 722 PSYTMQSVYLPRRTCDEIDRISRDFLWGGSRNNRRFHAIGWNKV 765 >gb|KYP33748.1| Putative ribonuclease H protein At1g65750 family [Cajanus cajan] Length = 1133 Score = 219 bits (557), Expect = 4e-61 Identities = 130/289 (44%), Positives = 169/289 (58%), Gaps = 9/289 (3%) Frame = -3 Query: 841 WMAIKIDLEKAYDRLSWNFIRDTLEDMGFPSNISNLVWHCISSPFIQLLWNGEALGNFRF 662 WMAIKIDLEKAYDRL W+F++DTL D+G PS NLVW I+SP ++LWNGEAL F Sbjct: 533 WMAIKIDLEKAYDRLKWSFVKDTLLDIGLPSQFVNLVWASITSPKFRMLWNGEALEEFSP 592 Query: 661 LEELGRVIPCLLIYLFYAWKGFFT*LI---LKSEQRLGLLLS------SH*KMP*DLTSC 509 + + P + YLF LI ++S+Q + LS SH DL Sbjct: 593 SHGIRQGDP-ISPYLFVLCMERLFQLITSTVESQQWRPIKLSRDGPLLSHLAFADDL--- 648 Query: 508 LCR*SFAFCRCHC*QARFIKQVLHTFCSSYDQKVSLDKTRIFYSDNVPYHLRVKIRECLG 329 F Q I+ L C S QKVS++KTRIF+S NV + +R +I G Sbjct: 649 -----ILFAEATSDQVEVIQSCLDQLCGSSGQKVSIEKTRIFFSKNVSHVIRNEISTTFG 703 Query: 328 FQSTSDLGKYLGILIFHKNVTKESFRFIIDKVNQRLSAWKARSLSLADRLTLTKSVLQAL 149 FQ TS+LGKYLGI H V + ++ II++VN+RLS WK +LS A RLTL KSVL A+ Sbjct: 704 FQCTSNLGKYLGIPAHHSRVCQRDYQEIIERVNKRLSGWKTSTLSFAGRLTLCKSVLSAI 763 Query: 148 PSYAVQSCIIPKGVCDDIDKICRSFIWGEENGRRKFHPISWSKVC*PKK 2 PSY +QS +D++CRSF+ GE N +R++H I WS VC PK+ Sbjct: 764 PSYTMQS----------VDRLCRSFLSGESNNQRRYHAIGWSTVCQPKE 802 >gb|KYP57513.1| Retrovirus-related Pol polyprotein LINE-1 [Cajanus cajan] Length = 520 Score = 210 bits (534), Expect = 9e-61 Identities = 116/266 (43%), Positives = 161/266 (60%), Gaps = 9/266 (3%) Frame = -3 Query: 841 WMAIKIDLEKAYDRLSWNFIRDTLEDMGFPSNISNLVWHCISSPFIQLLWNGEALGNF-- 668 WMAIKIDLEKAYDRL W+F++DTL D+G P+ NLVW ISSP +++LWNGEAL F Sbjct: 264 WMAIKIDLEKAYDRLKWSFVKDTLLDIGLPNQFVNLVWVSISSPKLRMLWNGEALEEFVP 323 Query: 667 -RFLEELGRVIPCLLIYLFYAWKGFFT*LILKSEQRLGLLLS------SH*KMP*DLTSC 509 R + + + P L + T + S+Q + LS SH DL Sbjct: 324 SRGIRQGDPISPYLFVLCMERLFHLIT-TTVDSQQWKPIRLSRDGPLLSHLAFADDL--- 379 Query: 508 LCR*SFAFCRCHC*QARFIKQVLHTFCSSYDQKVSLDKTRIFYSDNVPYHLRVKIRECLG 329 F Q I+ L+ FC+S QKVS++KTRI++S NV + +R ++ G Sbjct: 380 -----ILFAEATLDQVEVIQSCLNHFCASSGQKVSIEKTRIYFSKNVSHIVRNEVSSAFG 434 Query: 328 FQSTSDLGKYLGILIFHKNVTKESFRFIIDKVNQRLSAWKARSLSLADRLTLTKSVLQAL 149 FQ T +LGKYLGI H V + ++ II++VN+RLS WK+ +LS A RLTL KSVL A+ Sbjct: 435 FQRTDNLGKYLGIPAHHSRVCRRDYQGIIERVNKRLSGWKSSTLSFAGRLTLCKSVLSAI 494 Query: 148 PSYAVQSCIIPKGVCDDIDKICRSFI 71 PSY +QS +P+ VCD++D++C +F+ Sbjct: 495 PSYTMQSVFLPRSVCDEVDRLCSNFL 520 >gb|KYP69874.1| Retrovirus-related Pol polyprotein LINE-1 [Cajanus cajan] Length = 956 Score = 212 bits (539), Expect = 6e-59 Identities = 123/288 (42%), Positives = 167/288 (57%), Gaps = 9/288 (3%) Frame = -3 Query: 841 WMAIKIDLEKAYDRLSWNFIRDTLEDMGFPSNISNLVWHCISSPFIQLLWNGEALGNFRF 662 WMA KIDLEKAYDRL W+F++DTL D+G P+ + N++W CISSP +++LWNGE L F Sbjct: 656 WMAFKIDLEKAYDRLKWDFVKDTLLDIGLPAQLVNIIWACISSPRMRMLWNGETLDEFLP 715 Query: 661 LEELGRVIPCLLIYLFYAWKGFFT*LILK---SEQRLGLLLS------SH*KMP*DLTSC 509 ++ + P + YLF LI K +++ + L+ SH DL Sbjct: 716 SRDVRQGDP-ISPYLFVLCIERLFQLITKEVEAKRWKPIRLAKDGPPLSHLAFADDL--- 771 Query: 508 LCR*SFAFCRCHC*QARFIKQVLHTFCSSYDQKVSLDKTRIFYSDNVPYHLRVKIRECLG 329 F QA I+ L FC+S QKVSL+KT+IF+S NV + +R I LG Sbjct: 772 -----ILFSEASMNQAEIIRDCLDRFCASSGQKVSLEKTKIFFSKNVAHTVRDDISSGLG 826 Query: 328 FQSTSDLGKYLGILIFHKNVTKESFRFIIDKVNQRLSAWKARSLSLADRLTLTKSVLQAL 149 FQ T++LGKYLGI H V + ++ +I+ VN+RLS WKA +LS A RLTL KSV++A+ Sbjct: 827 FQRTNNLGKYLGIPAHHSRVCRRDYQNVINCVNKRLSGWKASTLSFAGRLTLCKSVIEAI 886 Query: 148 PSYAVQSCIIPKGVCDDIDKICRSFIWGEENGRRKFHPISWSKVC*PK 5 PSY + DK+C SF+WG+ RK H ISW +C PK Sbjct: 887 PSYTSK----------QFDKLCMSFLWGDSPTSRKIHAISWKTICMPK 924 >gb|KYP49443.1| Retrovirus-related Pol polyprotein LINE-1 [Cajanus cajan] Length = 548 Score = 199 bits (506), Expect = 2e-56 Identities = 124/285 (43%), Positives = 162/285 (56%), Gaps = 6/285 (2%) Frame = -3 Query: 841 WMAIKIDLEKAYDRLSWNFIRDTLEDMGFPSNISNLVWHCISSPFIQLLWNGEALGNFRF 662 WMAIKIDLEKAYDRL W FI+DTLED+G P +VW CIS+P + +LWNGE L +F Sbjct: 105 WMAIKIDLEKAYDRLKWKFIKDTLEDIGLPQQFVEMVWACISTPSMSMLWNGEKLEDFTP 164 Query: 661 LEELGRVIPCLLIYLFY-AWKGFFT*LILKSEQRLGLLLSSH*KMP*DLTSCLCR*SFA- 488 + + + P L YLF + F + + +L + P SCL +FA Sbjct: 165 SKGIRQGDP-LSPYLFVLCMERVFHLIEIAVIHKLWKPIKLSKGGP--PLSCL---AFAD 218 Query: 487 ----FCRCHC*QARFIKQVLHTFCSSYDQKVSLDKTRIFYSDNVPYHLRVKIRECLGFQS 320 F Q I+Q L FC S QKVSL+KTRIF+S NV + ++ +I GFQ Sbjct: 219 DLILFSEASMDQVEIIQQCLDIFCGSLGQKVSLEKTRIFFSKNVGWAVKNEISNAFGFQR 278 Query: 319 TSDLGKYLGILIFHKNVTKESFRFIIDKVNQRLSAWKARSLSLADRLTLTKSVLQALPSY 140 T +LGKYLG+ I H V + R + DKVNQRL++WK R+LS+ RLTLTKSVL A+PSY Sbjct: 279 TDNLGKYLGVSIHHDRVNRRLLRSVKDKVNQRLNSWKTRNLSVTGRLTLTKSVLAAIPSY 338 Query: 139 AVQSCIIPKGVCDDIDKICRSFIWGEENGRRKFHPISWSKVC*PK 5 +Q+ P SF+ + G RK H +W K+ PK Sbjct: 339 TMQTVFFPD-----------SFVM-KLTGLRKVHVKAWQKIYKPK 371 >ref|XP_016168765.1| uncharacterized protein LOC107611342 [Arachis ipaensis] Length = 917 Score = 202 bits (513), Expect = 2e-55 Identities = 113/279 (40%), Positives = 153/279 (54%) Frame = -3 Query: 841 WMAIKIDLEKAYDRLSWNFIRDTLEDMGFPSNISNLVWHCISSPFIQLLWNGEALGNFRF 662 WMAIKIDLEKAYDRL FI++TL D+G P N NL+ CI + +++LWNGE L F Sbjct: 315 WMAIKIDLEKAYDRLKECFIKETLADIGLPQNFVNLILSCILTARMRVLWNGEELEEFTP 374 Query: 661 LEELGRVIPCLLIYLFYAWKGFFT*LILKSEQRLGLLLSSH*KMP*DLTSCLCR*SFAFC 482 + GF+ + LK + P C F Sbjct: 375 SRAVDH--------------GFWKPIRLKKDG------------PPISHLCFADDIILFA 408 Query: 481 RCHC*QARFIKQVLHTFCSSYDQKVSLDKTRIFYSDNVPYHLRVKIRECLGFQSTSDLGK 302 + QA I + L FC S Q VS +KTR+ +S NV + +R ++ L F T DLGK Sbjct: 409 EANLEQANVINKCLEAFCDSSGQSVSKEKTRVIFSKNVGHTVRAELSNILQFSRTDDLGK 468 Query: 301 YLGILIFHKNVTKESFRFIIDKVNQRLSAWKARSLSLADRLTLTKSVLQALPSYAVQSCI 122 YLGI I H V+K +F II+K++ RL++WKA SLSLA R+TL K VL ++P Y +Q + Sbjct: 469 YLGIPILHSRVSKHAFEGIINKLHARLNSWKASSLSLAGRVTLVKYVLSSMPLYNMQYAV 528 Query: 121 IPKGVCDDIDKICRSFIWGEENGRRKFHPISWSKVC*PK 5 + C+ ID ICR+F+WG +K H +SW +VC PK Sbjct: 529 LSSTTCNTIDCICRNFLWGNTEQTKKIHLLSWKRVCEPK 567 >ref|XP_016206284.1| uncharacterized protein LOC107646622 [Arachis ipaensis] Length = 1460 Score = 200 bits (509), Expect = 1e-54 Identities = 112/291 (38%), Positives = 168/291 (57%), Gaps = 12/291 (4%) Frame = -3 Query: 841 WMAIKIDLEKAYDRLSWNFIRDTLEDMGFPSNISNLVWHCISSPFIQLLWNGEALGNF-- 668 +MAIKIDLEKAYD L+W FIRDTL + P N+ +L+ HC SS +++LWNG +F Sbjct: 322 YMAIKIDLEKAYDLLNWKFIRDTLIEARLPENLVDLISHCYSSAEMKVLWNGIPSNSFTP 381 Query: 667 -RFLEELGRVIPCLLIYL---------FYAWKGFFT*LILKSEQRLGLLLSSH*KMP*DL 518 R + + + P L + F + F+ ++L R G LS Sbjct: 382 SRGIRQGDPMSPYLFVLCIERLSQIISFAVNQNFWEPMVLN---RGGPKLSH-------- 430 Query: 517 TSCLCR*SFAFCRCHC*QARFIKQVLHTFCSSYDQKVSLDKTRIFYSDNVPYHLRVKIRE 338 C F + Q ++ +L FC QKV+ K +++SDN+ + + ++ + Sbjct: 431 -LCFADDIVLFGKASMEQVEVVRGILDLFCKCSGQKVNYFKFCVYFSDNMCFARKKELSD 489 Query: 337 CLGFQSTSDLGKYLGILIFHKNVTKESFRFIIDKVNQRLSAWKARSLSLADRLTLTKSVL 158 LG + T+++GKYLG+ + H KE F+FI+D++ RLS+WKA +LSLA R+TLT+S L Sbjct: 490 ALGMRLTNNMGKYLGVPLLHGRSKKEDFQFILDRMANRLSSWKATNLSLAGRVTLTQSAL 549 Query: 157 QALPSYAVQSCIIPKGVCDDIDKICRSFIWGEENGRRKFHPISWSKVC*PK 5 ++PSY +Q+ +P +CD IDKICR+F+WG + RK H +SW KVC PK Sbjct: 550 ASIPSYVMQTMKLPLSICDSIDKICRNFLWGSVSSGRKPHLMSWEKVCLPK 600