BLASTX nr result
ID: Glycyrrhiza28_contig00024202
seq
BLASTX 2.2.26 [Sep-21-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Glycyrrhiza28_contig00024202 (722 letters) Database: ./nr 115,041,592 sequences; 42,171,959,267 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value XP_013710279.1 PREDICTED: uncharacterized protein LOC106414114 [... 134 3e-32 KHN24231.1 Putative ribonuclease H protein [Glycine soja] 124 2e-30 KYP65965.1 Putative ribonuclease H protein At1g65750 family [Caj... 128 3e-30 KYP40438.1 Putative ribonuclease H protein At1g65750 family, par... 128 4e-30 XP_016164673.1 PREDICTED: uncharacterized protein LOC107607211 [... 128 4e-30 KYP76862.1 Putative ribonuclease H protein At1g65750, partial [C... 126 5e-30 KYP37594.1 Putative ribonuclease H protein At1g65750 family, par... 125 1e-29 XP_015935830.1 PREDICTED: uncharacterized protein LOC107461787 [... 126 2e-29 KYP76185.1 Putative ribonuclease H protein At1g65750 [Cajanus ca... 125 4e-29 KYP73155.1 Putative ribonuclease H protein At1g65750 family [Caj... 120 1e-28 GAU18772.1 hypothetical protein TSUD_80610 [Trifolium subterraneum] 122 1e-28 KYP56524.1 Putative ribonuclease H protein At1g65750 family [Caj... 123 2e-28 KYP65942.1 Putative ribonuclease H protein At1g65750 family [Caj... 121 2e-28 AAC63844.1 putative non-LTR retroelement reverse transcriptase [... 122 5e-28 XP_015936169.1 PREDICTED: uncharacterized protein LOC107462117 [... 121 7e-28 KYP42324.1 Putative ribonuclease H protein At1g65750 family [Caj... 120 2e-27 XP_018510856.1 PREDICTED: uncharacterized protein LOC103844431 [... 120 2e-27 XP_019094473.1 PREDICTED: uncharacterized protein LOC109129898 [... 120 2e-27 AID60103.1 hypothetical protein [Brassica napus] 119 4e-27 GAU26239.1 hypothetical protein TSUD_224300 [Trifolium subterran... 117 1e-26 >XP_013710279.1 PREDICTED: uncharacterized protein LOC106414114 [Brassica napus] Length = 1895 Score = 134 bits (337), Expect = 3e-32 Identities = 72/203 (35%), Positives = 109/203 (53%), Gaps = 5/203 (2%) Frame = -2 Query: 595 ETVFRKLWRWPGPKRYRILLWKVCLEALVTNSWRKRRNLDDTGFCPMCGLEEESVIHLIR 416 E ++ ++WR P+R R+ LW V + ++TN RKRR+L D G C +C +E+++H++R Sbjct: 1560 EALYNRVWRLVAPERVRVFLWLVSHQVIMTNMERKRRHLSDNGVCSLCKNGDETILHVLR 1619 Query: 415 DCPSMQQVWVCLKGISSLSDPFFSQPLLIWVETNIGGIQ--NDGSWPTLFGTAVYLAWQT 242 DCP+ +W + S FFS PLL W+ N+ + N WPT+F V+ W+ Sbjct: 1620 DCPAAAGLWT-KSVMPSRQHRFFSLPLLEWLYDNLASDRSGNGSQWPTIFAVTVWWCWKW 1678 Query: 241 RNELVFQQK*TTVTQLVFRIKSMASNIEDSIRENKRNSSRGISYSGGRHPQ---WTCPDQ 71 R VF + + R++ + + + NK S R S +GGR + W CP+ Sbjct: 1679 RCGYVFGD----IGKCRDRVQYVRDKAREVMDANKILSKR--SVAGGRVEKQIAWKCPES 1732 Query: 70 GWYKLNCDGTVSGFGGMAGCGGV 2 GWYKLN DG G G+A GGV Sbjct: 1733 GWYKLNTDGAARGNPGLATAGGV 1755 >KHN24231.1 Putative ribonuclease H protein [Glycine soja] Length = 317 Score = 124 bits (310), Expect = 2e-30 Identities = 70/210 (33%), Positives = 105/210 (50%), Gaps = 3/210 (1%) Frame = -2 Query: 628 YRILDNNFRSPETVFRKLWRWPGPKRYRILLWKVCLEALVTNSWRKRRNLDDTGFCPMCG 449 YR + + +F +W W GP+R RILLWK+ E L+TN R R + ++ CP C Sbjct: 119 YRKVAGFSNDKDLLFNLIWSWKGPERMRILLWKIANEGLLTNKSRVTRAMAESSECPRCH 178 Query: 448 LEEESVIHLIRDCPSMQQVWVCLKGISSLSDPFFSQPLLIWVETNIGGIQN--DGSWPTL 275 L+ ES++H +RDC +QVW L G +SL+ F + W+ +N+ QN + +W Sbjct: 179 LQPESILHCLRDCFYAKQVWNTLSG-NSLNHLFCAHDCPQWLVSNLRSPQNCEENNWALF 237 Query: 274 FGTAVYLAWQTRNELVFQQK*TTVTQLVFRIKSMASNIEDSIR-ENKRNSSRGISYSGGR 98 F + W+ RNE++F + + ++LV I MA + S+ ENK + Sbjct: 238 FVITISFIWKARNEMIFSNRTQSASELVGHIPRMALEVIQSLSVENKIH----------- 286 Query: 97 HPQWTCPDQGWYKLNCDGTVSGFGGMAGCG 8 W P G +KLNCD V G A G Sbjct: 287 ---WHLPPPGKFKLNCDAAVDNMGNAAIVG 313 >KYP65965.1 Putative ribonuclease H protein At1g65750 family [Cajanus cajan] Length = 1043 Score = 128 bits (322), Expect = 3e-30 Identities = 71/199 (35%), Positives = 111/199 (55%), Gaps = 3/199 (1%) Frame = -2 Query: 589 VFRKLWRWPGPKRYRILLWKVCLEALVTNSWRKRRNLDDTGFCPMCGLEEESVIHLIRDC 410 +F+ +WRW GP+R R+LLWK+ +L+TN+ R + L D C +C + E +H++RDC Sbjct: 709 IFKMIWRWKGPERVRVLLWKIAHNSLLTNACRFKLGLSDNPSCSLCMHDTEDTLHVLRDC 768 Query: 409 PSMQQVWVCLKGISSLSDPFFSQPLLIWVETNI--GGIQNDGSWPTLFGTAVYLAWQTRN 236 + VW L G S+ + F+ L W+ N+ G + +G W T F A+ W N Sbjct: 769 SFAKVVWRKLLG-STSDEHIFTDELHAWLVRNLSRSGSRWEG-WQTCFALALDSLWHRCN 826 Query: 235 ELVFQQK*TTVTQLVFRIKSMASNIEDSIR-ENKRNSSRGISYSGGRHPQWTCPDQGWYK 59 +++FQ T+ QL+ +IK+ S++ S+ E ++ S R QW CP + +K Sbjct: 827 QVLFQNSQTSSDQLIAKIKARISSLSSSVSLEIQQFSLRQPPLIVTPEYQWCCPPRSLFK 886 Query: 58 LNCDGTVSGFGGMAGCGGV 2 LNCDG+VS G A CGG+ Sbjct: 887 LNCDGSVSQARG-ASCGGI 904 >KYP40438.1 Putative ribonuclease H protein At1g65750 family, partial [Cajanus cajan] Length = 1356 Score = 128 bits (321), Expect = 4e-30 Identities = 70/214 (32%), Positives = 112/214 (52%), Gaps = 5/214 (2%) Frame = -2 Query: 628 YRILDNNFRSPETVFRKLWRWPGPKRYRILLWKVCLEALVTNSWRKRRNLDDTGFCPMCG 449 + + D R P +F+ + +W GP+R R LW+V +L TN WR R L +G CP+C Sbjct: 1018 HSLRDTTDRIP--LFQIICKWKGPERLRCFLWRVAHSSLCTNEWRAYRGLTQSGNCPVCN 1075 Query: 448 LEEESVIHLIRDCPSMQQVWVCLKGISSLSDPFFSQPLLIWVETNIGGIQNDGSWPTLFG 269 E E++IH++RDC +++W + G L + FF+ PLL W++ N+ + D W F Sbjct: 1076 NESETIIHILRDCIEAKEIWRAM-GTEGLLNEFFNLPLLTWLQENLTHV--DPRWCLSFV 1132 Query: 268 TAVYLAWQTRNELVFQQK*TTVTQLVFRIKSMASN-----IEDSIRENKRNSSRGISYSG 104 + W+ RN +VFQQ T++V IK +++ + + ++++ + Sbjct: 1133 IIMDSLWRARNTIVFQQGNFHKTRIVGEIKGRVDEMTKVFLKEVVSDRRQDALVSVG--- 1189 Query: 103 GRHPQWTCPDQGWYKLNCDGTVSGFGGMAGCGGV 2 W P +G KLNCDG V G +A CGGV Sbjct: 1190 -----WNFPPEGILKLNCDGVVDG-NSIAACGGV 1217 >XP_016164673.1 PREDICTED: uncharacterized protein LOC107607211 [Arachis ipaensis] Length = 1901 Score = 128 bits (321), Expect = 4e-30 Identities = 75/213 (35%), Positives = 109/213 (51%), Gaps = 4/213 (1%) Frame = -2 Query: 628 YRILDNNFRSPETVFRKLWRWPGPKRYRILLWKVCLEALVTNSWRKRRNLDDTGFCPMCG 449 Y++ N +P FR +W W GP+R R LW V A++TNS ++RR+L + CP C Sbjct: 1561 YQLNMENQHAPNKNFRLVWNWQGPERIRTFLWLVTHNAILTNSEKRRRHLTNDDTCPRCR 1620 Query: 448 LEEESVIHLIRDCPSMQQVWVCLKGISSLSDPFFSQPLLIWVETNIGGIQNDGSWPTLFG 269 EES IH++RDCP +W L + S FF+ L W+ N+ +N W LFG Sbjct: 1621 SHEESTIHVLRDCPYAMSIWNRLIPPNGRSS-FFNTELNEWLYQNLTTNKN---WNCLFG 1676 Query: 268 TAVYLAWQTRNELVFQQK----*TTVTQLVFRIKSMASNIEDSIRENKRNSSRGISYSGG 101 A+ W RN+LVF + T V Q+ R + S S++ K + +G Sbjct: 1677 VALSSIWYLRNKLVFNGESAHVNTAVNQIKARSEEFLSLTRSSLKPQKSQA------AGE 1730 Query: 100 RHPQWTCPDQGWYKLNCDGTVSGFGGMAGCGGV 2 +W+CP++G K+N DG+ G A CGGV Sbjct: 1731 SLIRWSCPEEGCVKVNVDGSWFGHTRNAACGGV 1763 >KYP76862.1 Putative ribonuclease H protein At1g65750, partial [Cajanus cajan] Length = 538 Score = 126 bits (317), Expect = 5e-30 Identities = 72/199 (36%), Positives = 104/199 (52%), Gaps = 3/199 (1%) Frame = -2 Query: 589 VFRKLWRWPGPKRYRILLWKVCLEALVTNSWRKRRNLDDTGFCPMCGLEEESVIHLIRDC 410 VF +W+W GP+R R LLW+V E+LVTN WR RR L CP+C E E+ +H++RDC Sbjct: 287 VFNLIWKWCGPERVRCLLWRVAHESLVTNDWRSRRGLTTDPLCPICKRERETTLHVLRDC 346 Query: 409 PSMQQVWVCLKGISSLSDPFFSQPLLIWVETNIGGIQNDGSWPTLFGTAVYLAWQTRNEL 230 + +W+ L + D S +L W+E N+ + + W F + W+ RN Sbjct: 347 LFAKSIWLSLYNDTPGFD-LISNSILDWLEHNLS--RGNKGWSITFAVTLDANWKARNTF 403 Query: 229 VFQQK*TTVTQLVFRIKSMASNIED--SIRENKR-NSSRGISYSGGRHPQWTCPDQGWYK 59 VFQQ T L+ I+ + + + S+ N+ + I Y G W P QG+ K Sbjct: 404 VFQQFQLNTTLLLGEIRGRSRELSNRYSLTANRGVPPPQSILYIG-----WKVPLQGYLK 458 Query: 58 LNCDGTVSGFGGMAGCGGV 2 LNCDG V+ +A CGGV Sbjct: 459 LNCDGAVN-TSRVASCGGV 476 >KYP37594.1 Putative ribonuclease H protein At1g65750 family, partial [Cajanus cajan] Length = 522 Score = 125 bits (314), Expect = 1e-29 Identities = 67/207 (32%), Positives = 106/207 (51%) Frame = -2 Query: 622 ILDNNFRSPETVFRKLWRWPGPKRYRILLWKVCLEALVTNSWRKRRNLDDTGFCPMCGLE 443 +++NN + ++ +W+W GP+R R LLW+V +L TN WR R L G CP+C ++ Sbjct: 189 LVNNN----QALYSAIWKWKGPQRIRCLLWRVAQNSLCTNEWRVSRGLATIGLCPICNID 244 Query: 442 EESVIHLIRDCPSMQQVWVCLKGISSLSDPFFSQPLLIWVETNIGGIQNDGSWPTLFGTA 263 +E+++H++RDC + +W L G L + F+ + W+ +N+ I + SW T F A Sbjct: 245 QENIVHILRDCYCVVAIWQRLFG--DLDNDFYHPTVTQWLGSNL--INVEESWATTFAVA 300 Query: 262 VYLAWQTRNELVFQQK*TTVTQLVFRIKSMASNIEDSIRENKRNSSRGISYSGGRHPQWT 83 + W+ RN VF Q Q+V IK + + + I + W+ Sbjct: 301 IDSIWKARNMTVFNQLPFNPNQIVCEIKGRTGELTKVNSIGPKPTHYNIDTN---LISWS 357 Query: 82 CPDQGWYKLNCDGTVSGFGGMAGCGGV 2 P G KLNCDG V+ +A CGGV Sbjct: 358 RPPPGVLKLNCDGAVAA-SSIASCGGV 383 >XP_015935830.1 PREDICTED: uncharacterized protein LOC107461787 [Arachis duranensis] Length = 1370 Score = 126 bits (316), Expect = 2e-29 Identities = 73/212 (34%), Positives = 111/212 (52%), Gaps = 3/212 (1%) Frame = -2 Query: 628 YRILDNNFRSPETVFRKLWRWPGPKRYRILLWKVCLEALVTNSWRKRRNLDDTGFCPMCG 449 Y+++ + FR +WRW GP+R R LW ++TNS RKRR+L + CP C Sbjct: 937 YQVIMEEQHTQNQNFRLVWRWQGPERIRTFLWLATHNVILTNSERKRRHLTNDDSCPRCR 996 Query: 448 LEEESVIHLIRDCPSMQQVWVCL---KGISSLSDPFFSQPLLIWVETNIGGIQNDGSWPT 278 EES IH++RDC + +W L GI+S FF+ L W+ N ++++ W Sbjct: 997 CHEESTIHVLRDCFYAKSIWRKLFPPIGINS----FFNTDLNEWLLQN---LKSNNKWSC 1049 Query: 277 LFGTAVYLAWQTRNELVFQQK*TTVTQLVFRIKSMASNIEDSIRENKRNSSRGISYSGGR 98 LFG AV W RN+LVF + VT V +I++ + ++ N + R SG Sbjct: 1050 LFGVAVSTMWYLRNKLVFNGESVLVTTAVNQIRARSEEFGRVVQTNL--TLRNNHNSGAS 1107 Query: 97 HPQWTCPDQGWYKLNCDGTVSGFGGMAGCGGV 2 + +W+ P++G+ K+N DG+ A CGGV Sbjct: 1108 NIRWSRPEKGYIKVNVDGSWFSHKNNAACGGV 1139 >KYP76185.1 Putative ribonuclease H protein At1g65750 [Cajanus cajan] Length = 1354 Score = 125 bits (313), Expect = 4e-29 Identities = 69/210 (32%), Positives = 107/210 (50%), Gaps = 1/210 (0%) Frame = -2 Query: 628 YRILDNNFRSPET-VFRKLWRWPGPKRYRILLWKVCLEALVTNSWRKRRNLDDTGFCPMC 452 Y L + R T +F+K+W+WPGP+R R LLW++ ++L TN+WR R L CP C Sbjct: 1031 YASLTPSLRIESTDLFKKIWKWPGPERIRCLLWRISQDSLCTNAWRLSRFLTSDARCPRC 1090 Query: 451 GLEEESVIHLIRDCPSMQQVWVCLKGISSLSDPFFSQPLLIWVETNIGGIQNDGSWPTLF 272 ++ E+ +H++RDC ++VW + G + FF+ L IW+E+N+ + +W F Sbjct: 1091 HMDRETTLHVLRDCTFAREVWRGIMG-EDVDQNFFNSQLTIWLESNLD--TSAPNWIHEF 1147 Query: 271 GTAVYLAWQTRNELVFQQK*TTVTQLVFRIKSMASNIEDSIRENKRNSSRGISYSGGRHP 92 + W+ RN F Q T Q+V +IK + + R + + H Sbjct: 1148 ALTLDSLWKNRNNFTFCQIVTNPLQVVCKIKGRMHVLSSTTFCKPRTLYQKDNRKS--HI 1205 Query: 91 QWTCPDQGWYKLNCDGTVSGFGGMAGCGGV 2 +WT P KLNCDG V+ A CGG+ Sbjct: 1206 RWTPPPSNSLKLNCDGAVAD-NARASCGGI 1234 >KYP73155.1 Putative ribonuclease H protein At1g65750 family [Cajanus cajan] Length = 354 Score = 120 bits (300), Expect = 1e-28 Identities = 70/208 (33%), Positives = 112/208 (53%), Gaps = 3/208 (1%) Frame = -2 Query: 616 DNNFRSPETVFRKLWRWPGPKRYRILLWKVCLEALVTNSWRKRRNLDDTGFCPMCGLEEE 437 D+N +P +F+ LW+W G +R R+ LW+V E+L+TN R R+L + CP+C + E Sbjct: 31 DSNPCAP--IFKVLWKWQGTERIRLFLWRVAHESLMTNEARFGRDLTTSPICPICMQDVE 88 Query: 436 SVIHLIRDCPSMQQVWVCLKGISSLSDPF---FSQPLLIWVETNIGGIQNDGSWPTLFGT 266 + +H++RDC +QVW + S +S P F + L+ + + + N WP F Sbjct: 89 NTMHVLRDCIFARQVWSSIPRGSCISQPTGSNFQEWLIFHLTRSRTELMN---WPLSFAI 145 Query: 265 AVYLAWQTRNELVFQQK*TTVTQLVFRIKSMASNIEDSIRENKRNSSRGISYSGGRHPQW 86 + W RNE+VFQQ + +QLV ++ + A +I +S ++ Y+ W Sbjct: 146 TIDALWNRRNEVVFQQSSLSASQLVVKVTNCAKSIINS--STPFDAGAQSDYTRITR-NW 202 Query: 85 TCPDQGWYKLNCDGTVSGFGGMAGCGGV 2 CP G+ KLN DG VS + G + CGG+ Sbjct: 203 VCPPSGFIKLNGDGAVS-YSGTSSCGGL 229 >GAU18772.1 hypothetical protein TSUD_80610 [Trifolium subterraneum] Length = 482 Score = 122 bits (305), Expect = 1e-28 Identities = 75/238 (31%), Positives = 115/238 (48%), Gaps = 10/238 (4%) Frame = -2 Query: 685 FPFFSLLIFAEPYIQQLHEYRILDNNFRSP---ETVFRKLWRWPGPKRYRILLWKVCLEA 515 FP + L I + Y Y ++N + +F K+W W GP R + LWK+ Sbjct: 144 FPCWKLSI--DGYFSLKTAYEFMENQHQEDLYINPIFEKVWHWKGPNRIKAFLWKLSQGR 201 Query: 514 LVTNSWRKRRNLDDTGFCPMCGLEEESVIHLIRDCPSMQQVWVCLKGISSLSDPFFSQPL 335 L+TN R+ RN+ ++ CP C ES++H +RDC ++ W + + FFS L Sbjct: 202 LLTNEERRHRNMTNSDLCPRCQDYPESIMHCLRDCEDAREFWTNIIN-PEVWSKFFSIGL 260 Query: 334 LIWVETNIG--GIQNDG-SWPTLFGTAVYLAWQTRNELVFQQK*TTVTQLVFRIKSMASN 164 W++ N+ I NDG +W FG AV W+ RN LVF L+F+I + S+ Sbjct: 261 NNWLDWNLSNDNIGNDGNNWSIFFGVAVNELWKDRNSLVFSNISGIDRNLLFKINTQVSS 320 Query: 163 IED--SIREN--KRNSSRGISYSGGRHPQWTCPDQGWYKLNCDGTVSGFGGMAGCGGV 2 I + S ++N R ++ S W P GW+K+N DG+ + G CGG+ Sbjct: 321 IINLHSFQKNLVTRQPGEVVAVS------WKPPLDGWHKVNVDGSFNTISGSTACGGL 372 >KYP56524.1 Putative ribonuclease H protein At1g65750 family [Cajanus cajan] Length = 1146 Score = 123 bits (308), Expect = 2e-28 Identities = 73/199 (36%), Positives = 108/199 (54%), Gaps = 3/199 (1%) Frame = -2 Query: 589 VFRKLWRWPGPKRYRILLWKVCLEALVTNSWRKRRNLDDTGFCPMCGLEEESVIHLIRDC 410 V+R + RW P+R R+ LW+V ++LVTNS+R R L + CP+C E+ +H++RDC Sbjct: 814 VYRDIARWQAPERLRMFLWRVAQDSLVTNSFRLYRGLTECAICPLCHAGLETAMHILRDC 873 Query: 409 PSMQQVWVCLKGISSLSDPFFSQPLLIWVETNIG-GIQNDGSWPTLFGTAVYLAWQTRNE 233 + +VW L + + F W+ N+ G ++ +W LF AV W RN+ Sbjct: 874 HLVTRVWNVLLQGHLIPEFFRYASACDWILANLSYGCHSEPNWGILFAVAVDAFWYWRNK 933 Query: 232 LVFQQK*TTVTQLVFRIKSMASNIEDSIRENKRNSSRGISYSG--GRHPQWTCPDQGWYK 59 +VF + ++QLVF+IK + I IR RGIS++ R +W P + K Sbjct: 934 VVFGEIEGDISQLVFQIKGRTNEI---IRVGFGGRPRGISHAAIQQRTIRWYPPPRDSCK 990 Query: 58 LNCDGTVSGFGGMAGCGGV 2 LNCDG VS G+A CGGV Sbjct: 991 LNCDGAVS--YGIASCGGV 1007 >KYP65942.1 Putative ribonuclease H protein At1g65750 family [Cajanus cajan] Length = 457 Score = 121 bits (303), Expect = 2e-28 Identities = 72/212 (33%), Positives = 103/212 (48%), Gaps = 3/212 (1%) Frame = -2 Query: 628 YRILDNN-FRSPETVFRKLWRWPGPKRYRILLWKVCLEALVTNSWRKRRNLDDTGFCPMC 452 Y LD+N + VF +WRW GP+R ++ WK +L+TN R+RR L CP C Sbjct: 111 YVCLDHNQIECNQAVFSTIWRWKGPERIKLRFWKTAHNSLLTNIARERRGLALENLCPRC 170 Query: 451 GLEEESVIHLIRDCPSMQQVWVCLKGISSLSDPFFSQPLLIWVETNIGGIQNDGSWPTLF 272 E E+ +H +RDC ++ VW L + F + L W+ +N+ + + +W +F Sbjct: 171 HQEPETGLHALRDCVVVKNVWSHLAN-GGIPPNFINSNLGTWIVSNL--TRRNENWRLIF 227 Query: 271 GTAVYLAWQTRNELVFQQK*TTVTQLVFRIKSMASNIEDSIRENKRNSSR--GISYSGGR 98 + W+ RN L+F Q LV IK I + +E SS+ G Y G Sbjct: 228 AVTIDELWKARNALIFDQHHVNPCGLVVSIKRRCLEINRAYKECSTFSSKILGDPY-GSS 286 Query: 97 HPQWTCPDQGWYKLNCDGTVSGFGGMAGCGGV 2 +W P G KLNCD V G G AGCGG+ Sbjct: 287 LIRWQPPPLGSIKLNCDRAVHGVGRKAGCGGI 318 >AAC63844.1 putative non-LTR retroelement reverse transcriptase [Arabidopsis thaliana] Length = 1231 Score = 122 bits (305), Expect = 5e-28 Identities = 67/194 (34%), Positives = 103/194 (53%) Frame = -2 Query: 586 FRKLWRWPGPKRYRILLWKVCLEALVTNSWRKRRNLDDTGFCPMCGLEEESVIHLIRDCP 407 F ++W+ P+R R+ +W V ++TN R RR+L + C +C EE+++H++RDCP Sbjct: 903 FNRIWKLITPERVRVFIWLVSQNVIMTNVERVRRHLSENAICSVCNGAEETILHVLRDCP 962 Query: 406 SMQQVWVCLKGISSLSDPFFSQPLLIWVETNIGGIQNDGSWPTLFGTAVYLAWQTRNELV 227 +M+ +W L + + FFSQ LL W+ TN+ ++ G WPTLFG ++ AW+ R V Sbjct: 963 AMEPIWRRLLPLRRHHE-FFSQSLLEWLFTNMDPVK--GIWPTLFGMGIWWAWKWRCCDV 1019 Query: 226 FQQK*TTVTQLVFRIKSMASNIEDSIRENKRNSSRGISYSGGRHPQWTCPDQGWYKLNCD 47 F ++ +L F IK MA + N G+ R +W P GW K+ D Sbjct: 1020 FGERKICRDRLKF-IKDMAEEVRRVHVGAVGNRPNGVRVE--RMIRWQVPSDGWVKITTD 1076 Query: 46 GTVSGFGGMAGCGG 5 G G G+A GG Sbjct: 1077 GASRGNHGLAAAGG 1090 >XP_015936169.1 PREDICTED: uncharacterized protein LOC107462117 [Arachis duranensis] Length = 1250 Score = 121 bits (304), Expect = 7e-28 Identities = 70/213 (32%), Positives = 104/213 (48%), Gaps = 4/213 (1%) Frame = -2 Query: 628 YRILDNNFRSPETVFRKLWRWPGPKRYRILLWKVCLEALVTNSWRKRRNLDDTGFCPMCG 449 YRIL+N+ + ++R +W+W GP+R + +W V E ++T S R+ R C C Sbjct: 905 YRILENHGEETDQIWRIIWKWRGPERIKCFIWLVVRERIMT-SHRRARIFGMNSSCHRCT 963 Query: 448 LEEESVIHLIRDCPSMQQVWVCLKGISSLSDPFFSQPLLIWVETNIG---GIQNDGSWPT 278 EE+ IH++RDCP +VWV L + D FF P W+ N+ G G+W T Sbjct: 964 GVEENTIHMLRDCPVASRVWVKLIHHEHIHD-FFRAPFNAWIRWNLAMDLGTTKQGNWNT 1022 Query: 277 LFGTAVYLAWQTRNELVFQQK*TTVTQ-LVFRIKSMASNIEDSIRENKRNSSRGISYSGG 101 F + W+ RN+ +F Q L F +K + E +E +R + + Sbjct: 1023 QFLVTCWWLWKWRNQEIFNPPFQRPMQPLPFILKQVKLIQEAFKKEGQRKNKIETNIC-- 1080 Query: 100 RHPQWTCPDQGWYKLNCDGTVSGFGGMAGCGGV 2 W CP + W K+N DG G GMAGCGG+ Sbjct: 1081 ----WECPPEDWMKVNTDGAAKGNPGMAGCGGL 1109 >KYP42324.1 Putative ribonuclease H protein At1g65750 family [Cajanus cajan] Length = 1443 Score = 120 bits (301), Expect = 2e-27 Identities = 64/200 (32%), Positives = 106/200 (53%), Gaps = 4/200 (2%) Frame = -2 Query: 589 VFRKLWRWPGPKRYRILLWKVCLEALVTNSWRKRRNLDDTGFCPMCGLEEESVIHLIRDC 410 +F+ +W+WPGP+R R LW++ ++L TN+WR R + + CP+ E E+ H++RDC Sbjct: 1118 LFKMVWKWPGPERVRCFLWRLAHKSLCTNAWRLSRGITNDDGCPIFFSESETCTHILRDC 1177 Query: 409 PSMQQVW-VCLKGISSLSDPFFSQPLLIWVETNIGGIQNDGSWPTLFGTAVYLAWQTRNE 233 VW + L+G + + FF+ PL W+ TN+G + G W +F + W+TRN Sbjct: 1178 RFATTVWKILLQGKNDHN--FFTLPLHEWLATNLG--ETSGYWSKIFAIGLDSIWKTRNN 1233 Query: 232 LVFQQK*TTVTQLVFRIKSMASNIE---DSIRENKRNSSRGISYSGGRHPQWTCPDQGWY 62 VF Q+ + + + S++ + +++ + +G WT P G Sbjct: 1234 YVFNHVLNQPIQVACEVTGRVTELNKSTGSLQTTEFRTTQNTNMTG-----WTRPSPGSI 1288 Query: 61 KLNCDGTVSGFGGMAGCGGV 2 K+NCDGTV+ +A CGGV Sbjct: 1289 KINCDGTVAS-SKLAACGGV 1307 >XP_018510856.1 PREDICTED: uncharacterized protein LOC103844431 [Brassica rapa] Length = 1833 Score = 120 bits (301), Expect = 2e-27 Identities = 67/197 (34%), Positives = 98/197 (49%), Gaps = 2/197 (1%) Frame = -2 Query: 586 FRKLWRWPGPKRYRILLWKVCLEALVTNSWRKRRNLDDTGFCPMCGLEEESVIHLIRDCP 407 + ++WR P+R R+ LW V + ++TN RKRR+L D G C +C E+++H +RDCP Sbjct: 1501 YDRVWRVIVPERVRVFLWLVSHQVIMTNMERKRRHLSDNGMCQLCKSGNETILHTLRDCP 1560 Query: 406 SMQQVWVCLKGISSLSDPFFSQPLLIWVETNIGGIQ--NDGSWPTLFGTAVYLAWQTRNE 233 + +W L S FF Q LL W+ N+ Q N WPT+F V+ W+ R Sbjct: 1561 ASMGLWRRLVD-PSRQQRFFDQSLLQWLYENLTSAQSANGERWPTMFALTVWWCWKWRCG 1619 Query: 232 LVFQQK*TTVTQLVFRIKSMASNIEDSIRENKRNSSRGISYSGGRHPQWTCPDQGWYKLN 53 VF + ++ F +K + ++ NK + RH +W P GW KLN Sbjct: 1620 YVFGETGKCPDRVKF-VKDKTQEV--TLANNKLRLHLAVGLREERHIKWRRPSNGWCKLN 1676 Query: 52 CDGTVSGFGGMAGCGGV 2 DG G G+A GGV Sbjct: 1677 TDGASRGNPGLATAGGV 1693 >XP_019094473.1 PREDICTED: uncharacterized protein LOC109129898 [Camelina sativa] Length = 1738 Score = 120 bits (300), Expect = 2e-27 Identities = 70/202 (34%), Positives = 104/202 (51%), Gaps = 8/202 (3%) Frame = -2 Query: 586 FRKLWRWPGPKRYRILLWKVCLEALVTNSWRKRRNLDDTGFCPMCGLEEESVIHLIRDCP 407 F +W P+R R+ LW+V + ++TN R RR++ D+ C +C EES++H++RDCP Sbjct: 1406 FACVWGVVAPERVRVFLWQVSHQVIMTNVERVRRHMGDSVVCKVCSGAEESILHVLRDCP 1465 Query: 406 SMQQVWVCLKGISSLSDP-FFSQPLLIWVETN--IGGIQNDGSWPTLFGTAVYLAWQTRN 236 ++ +W L + P FF Q LL W+ N +G +G W TLF V+ AW+ R Sbjct: 1466 AISGIWRRL--VPQRKQPEFFDQSLLPWLFRNLRVGLNSRNGHWSTLFSMTVWWAWKWRC 1523 Query: 235 ELVFQQK*TTVTQLVFRIKSMASNIEDSIRENKRNSSRGISYSGGR-----HPQWTCPDQ 71 VF ++ T L F ++D E R S ++ +GGR +W CP+ Sbjct: 1524 SDVFGERRTCRDMLKF--------VKDMAEEVHRAHSLSVNTTGGRVGVEQLVKWVCPNV 1575 Query: 70 GWYKLNCDGTVSGFGGMAGCGG 5 GW KL DG G G+A GG Sbjct: 1576 GWVKLTTDGASRGNPGLAAAGG 1597 >AID60103.1 hypothetical protein [Brassica napus] Length = 620 Score = 119 bits (297), Expect = 4e-27 Identities = 62/200 (31%), Positives = 104/200 (52%), Gaps = 3/200 (1%) Frame = -2 Query: 595 ETVFRKLWRWPGPKRYRILLWKVCLEALVTNSWRKRRNLDDTGFCPMCGLEEESVIHLIR 416 E ++ ++WR P+R R+ LW V + ++TN RKRR++ + G CP+C +E+++H++R Sbjct: 285 EDLYSRVWRVTAPERVRVFLWLVTHQVIMTNMERKRRHISENGTCPLCKSGDETILHVLR 344 Query: 415 DCPSMQQVWVCLKGISSLSDPFFSQPLLIWVETNIGGIQ--NDGSWPTLFGTAVYLAWQT 242 DCP+ +W L + + FF+ L W+ N+ + N WP+LF V+ W+ Sbjct: 345 DCPAAAGLWRKLV-LPTRQQRFFNLTLFEWLYENLANDKSVNGDQWPSLFALTVWWCWKW 403 Query: 241 RNELVFQQK*TTVTQLVFRIKSMASNIEDSIRENKR-NSSRGISYSGGRHPQWTCPDQGW 65 R VF + + + R++ + ++ I+ NK+ I R W P+ GW Sbjct: 404 RCGYVFGE----IGKCRDRVRFVKDKAQEVIKANKKVREPSAIGVHVERQIAWFVPENGW 459 Query: 64 YKLNCDGTVSGFGGMAGCGG 5 KLN DG G G+A GG Sbjct: 460 VKLNTDGASRGNPGLATAGG 479 >GAU26239.1 hypothetical protein TSUD_224300 [Trifolium subterraneum] Length = 1250 Score = 117 bits (294), Expect = 1e-26 Identities = 64/203 (31%), Positives = 104/203 (51%), Gaps = 7/203 (3%) Frame = -2 Query: 589 VFRKLWRWPGPKRYRILLWKVCLEALVTNSWRKRRNLDDTGFCPMCGLEEESVIHLIRDC 410 +F +W+W GP+R ++ LWK L+TN R RR + + C C L++ES++H+ RDC Sbjct: 920 LFNLVWKWRGPERIKLFLWKATHACLLTNMERLRRKMTASKVCSRCNLQDESLLHVFRDC 979 Query: 409 PSMQQVWVCLKGISSLSDPFFSQPLLIWVETNIGGI---QNDGSWPTLFGTAVYLAWQTR 239 + +W L + + F W+ TN+ G+ +++ +W F + W +R Sbjct: 980 NFSKSIWQNL-NVQNRRSFFHENDWHQWLLTNLSGMVGSKDEATWSLKFAIILDKIWYSR 1038 Query: 238 NELVFQQK----*TTVTQLVFRIKSMASNIEDSIRENKRNSSRGISYSGGRHPQWTCPDQ 71 N +F K T + Q ++ ++ N++DS SSR I S +W P + Sbjct: 1039 NSFIFSHKEINIFTIIAQAASIMQFLSPNVDDS-------SSRQICNSSS--IRWERPPE 1089 Query: 70 GWYKLNCDGTVSGFGGMAGCGGV 2 + LNCDG V+G G+AGCGGV Sbjct: 1090 NFIALNCDGAVTGLTGLAGCGGV 1112