BLASTX nr result
ID: Panax24_contig00018034
seq
BLASTX 2.2.26 [Sep-21-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Panax24_contig00018034 (798 letters) Database: ./nr 115,041,592 sequences; 42,171,959,267 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value XP_012076974.1 PREDICTED: uncharacterized protein LOC105637914 [... 279 8e-88 XP_011003058.1 PREDICTED: putative nuclease HARBI1 [Populus euph... 271 1e-84 XP_017223139.1 PREDICTED: putative nuclease HARBI1 [Daucus carot... 268 5e-84 XP_009603754.1 PREDICTED: uncharacterized protein LOC104098663 [... 268 8e-84 XP_011007748.1 PREDICTED: uncharacterized protein LOC105113312 [... 268 2e-83 XP_002307739.2 hypothetical protein POPTR_0005s26390g [Populus t... 268 2e-83 AHL69788.1 hypothetical protein [Camellia sinensis] 265 6e-83 XP_002281001.1 PREDICTED: uncharacterized protein LOC100247440 [... 265 2e-82 XP_010262020.1 PREDICTED: putative nuclease HARBI1 [Nelumbo nuci... 264 7e-82 XP_019249162.1 PREDICTED: putative nuclease HARBI1 [Nicotiana at... 261 4e-81 XP_009762418.1 PREDICTED: putative nuclease HARBI1 [Nicotiana sy... 258 5e-80 XP_010247371.1 PREDICTED: putative nuclease HARBI1 [Nelumbo nuci... 256 5e-79 XP_017643943.1 PREDICTED: uncharacterized protein LOC108484603 [... 253 7e-78 XP_016678760.1 PREDICTED: uncharacterized protein LOC107897721 [... 253 7e-78 KVH93992.1 Harbinger transposase-derived nuclease [Cynara cardun... 251 2e-77 XP_004144012.1 PREDICTED: putative nuclease HARBI1 [Cucumis sati... 249 8e-77 XP_012451022.1 PREDICTED: uncharacterized protein LOC105773564 [... 249 3e-76 XP_006342328.1 PREDICTED: uncharacterized protein LOC102590956 [... 246 3e-75 XP_016723331.1 PREDICTED: uncharacterized protein LOC107935269 [... 245 1e-74 ABC86705.1 R111 [Coffea arabica] CDP03132.1 unnamed protein prod... 244 2e-74 >XP_012076974.1 PREDICTED: uncharacterized protein LOC105637914 [Jatropha curcas] Length = 528 Score = 279 bits (714), Expect = 8e-88 Identities = 151/296 (51%), Positives = 189/296 (63%), Gaps = 53/296 (17%) Frame = +3 Query: 69 MEISSIPFISQEDYSNFYGFFQELDT---------IDMDPSSIKRRRIDEI------PEI 203 MEISS PF++Q++ S+FY FQ++DT ++ D KR+ +E P Sbjct: 1 MEISSFPFLNQDEISHFYSLFQDMDTSATGTGAFGVNNDLKKRKRKGDEEDGDGNKDPFQ 60 Query: 204 EASKQYPVK-----EIVSTLILLDEEEKSGQQNWKVESQQHSSLFQSNTQQNSHTMNEYN 368 + KQ +K +I+++L++LDEEEK QQ W ++SQQ +LF SN ++ MN+Y+ Sbjct: 61 QHDKQDDLKKSAMRDILASLLMLDEEEKQEQQQWVIDSQQDRTLFDSNYKRKVEVMNDYS 120 Query: 369 IHFQDHYDN---------------------------------NXXXXXXXXXXXXHRRLW 449 Q+HY + HRRLW Sbjct: 121 SQLQNHYSDLDEMDHSRTKRARRSASTVAALAIENAASGDSAQVSGSGGAGGSGQHRRLW 180 Query: 450 VKDRSKAWWDLCNSQDFPDEEFKKAFRMSKATFDMICEELDSVVTKKDTMLRLAIPVRQR 629 VKDRSK WW+ C+ DFP+EEFKK+FRMSKATF MIC+ELDSVVTKK+TMLR AIPVRQR Sbjct: 181 VKDRSKDWWERCSHPDFPEEEFKKSFRMSKATFAMICDELDSVVTKKNTMLRDAIPVRQR 240 Query: 630 VAVCIWRLATGEPLRLVSKRFGLGISTCHKLVLEVCAAIRSVLMPKFLQWPDENKM 797 VAVCIWRLATGEPLRLVSKRFGLGISTCHKLVLEVC+AIR+VLMPKFLQWPDE ++ Sbjct: 241 VAVCIWRLATGEPLRLVSKRFGLGISTCHKLVLEVCSAIRNVLMPKFLQWPDEERL 296 >XP_011003058.1 PREDICTED: putative nuclease HARBI1 [Populus euphratica] Length = 520 Score = 271 bits (692), Expect = 1e-84 Identities = 148/288 (51%), Positives = 181/288 (62%), Gaps = 45/288 (15%) Frame = +3 Query: 69 MEISSIPFISQEDYSNFYGFFQELDTIDMDPSS---IKRRRIDEIPE------IEASKQY 221 MEIS PF++QE+ S+ Y FQ++D+ + +S +KRRR D + I++S Sbjct: 1 MEISPFPFLNQEEISHLYSLFQDIDSTTLATNSNAHLKRRRKDGEEDESGWKKIKSSNNN 60 Query: 222 PV-KEIVSTLILLDEEEKSGQQNWKVESQQHSSLFQSNTQQNSHTMNEYNIHFQDHY--- 389 V +I+S+LILLDEEEK Q W +ESQ + N +Q MN+Y + H+ Sbjct: 61 SVMSDILSSLILLDEEEKQEYQQWAIESQLDKAALDWNHKQKVVAMNDYRSDLEAHFSDL 120 Query: 390 --------------------------------DNNXXXXXXXXXXXXHRRLWVKDRSKAW 473 +++ HRRLWVKDRSK W Sbjct: 121 DEMDHTRTKRIRRAASEVATAAAVTETALASSESSGPGGGGSGQQPQHRRLWVKDRSKDW 180 Query: 474 WDLCNSQDFPDEEFKKAFRMSKATFDMICEELDSVVTKKDTMLRLAIPVRQRVAVCIWRL 653 WD CN DFPDEEF+KAFRMSKATFD+IC ELDS VTKK+TMLR AIPVRQR+AVCIWRL Sbjct: 181 WDKCNHPDFPDEEFRKAFRMSKATFDLICMELDSAVTKKNTMLRDAIPVRQRIAVCIWRL 240 Query: 654 ATGEPLRLVSKRFGLGISTCHKLVLEVCAAIRSVLMPKFLQWPDENKM 797 ATGEPLRLVSKRFGLGISTCHKLVLEVC+AI++VLM KF+QWPDE KM Sbjct: 241 ATGEPLRLVSKRFGLGISTCHKLVLEVCSAIKNVLMQKFVQWPDEEKM 288 >XP_017223139.1 PREDICTED: putative nuclease HARBI1 [Daucus carota subsp. sativus] Length = 484 Score = 268 bits (685), Expect = 5e-84 Identities = 148/254 (58%), Positives = 172/254 (67%), Gaps = 11/254 (4%) Frame = +3 Query: 69 MEISSIPFISQEDYSNFYGFFQELDTIDMDPSSIKRRRIDEIPEIEASKQYPVKEIVSTL 248 MEI S PF+SQ+++SN +GFFQE + +MD + K+R+ + E E + YP + V +L Sbjct: 1 MEIGSHPFLSQDEFSNLFGFFQEFEGFEMDSFNSKKRQ--KGGEFEGFEGYPGQGFVGSL 58 Query: 249 ILLDEEEKSGQQNWKVESQQHS---------SLFQSNTQQNSHTMNEYNIHFQDHYDNNX 401 E SG + + V S Q +LF S T + D Sbjct: 59 DFGGERAGSGGEIFGVGSGQEQVLKKVGDSVTLFDDQYGGVSRTGELRVKRARQEGDGGV 118 Query: 402 XXXXXXXXXXXH--RRLWVKDRSKAWWDLCNSQDFPDEEFKKAFRMSKATFDMICEELDS 575 RRLWVKDRSKAWWDLC+S +FP+EEFKKAFRMSKATFDMICEELDS Sbjct: 119 GGNGVQGSGGQQQQRRLWVKDRSKAWWDLCSSPEFPEEEFKKAFRMSKATFDMICEELDS 178 Query: 576 VVTKKDTMLRLAIPVRQRVAVCIWRLATGEPLRLVSKRFGLGISTCHKLVLEVCAAIRSV 755 VTKKDTMLRLAIPVRQRVAVCIWRLATGEPLRLVSKRFGLGISTCHKLVLEVCAAIR+V Sbjct: 179 AVTKKDTMLRLAIPVRQRVAVCIWRLATGEPLRLVSKRFGLGISTCHKLVLEVCAAIRNV 238 Query: 756 LMPKFLQWPDENKM 797 LMPK+LQWP+ENKM Sbjct: 239 LMPKYLQWPNENKM 252 >XP_009603754.1 PREDICTED: uncharacterized protein LOC104098663 [Nicotiana tomentosiformis] XP_016435589.1 PREDICTED: uncharacterized protein LOC107761818 [Nicotiana tabacum] Length = 501 Score = 268 bits (685), Expect = 8e-84 Identities = 147/269 (54%), Positives = 177/269 (65%), Gaps = 26/269 (9%) Frame = +3 Query: 69 MEISSIPFISQEDY-SNFYGFFQELDTIDMDPSSI---------KRRRIDEI---PEIEA 209 MEISS PF + EDY SNF+ FFQ+ D D ++ K+R+ D+ P +E Sbjct: 1 MEISSFPFPTPEDYPSNFFSFFQDFDLPTTDNTTATAFASEPLPKKRKTDDFDFDPIVEE 60 Query: 210 SKQYPVKEIVSTLILLDEEEKSGQQNWKVESQQ------HSSLFQSNTQQNSHTMNEYNI 371 V++I+S + D EE+ N+ + S Q + S+F+ + QQ + + Sbjct: 61 GSLKTVEDILSKFLGFDNEEEKNNTNFDLSSHQPTQPLANQSVFEFSNQQTMESAKPLAV 120 Query: 372 HFQDHYDNNXXXXXXXXXXXX-------HRRLWVKDRSKAWWDLCNSQDFPDEEFKKAFR 530 + + N+ RRLWVKDRSKAWW+ CNS DFP+EEFKKAFR Sbjct: 121 NNKRGRQNSAEFISTSEDSTGGNSQPLQQRRLWVKDRSKAWWEQCNSPDFPEEEFKKAFR 180 Query: 531 MSKATFDMICEELDSVVTKKDTMLRLAIPVRQRVAVCIWRLATGEPLRLVSKRFGLGIST 710 MSKATFDMIC+EL+SVVTKKDTMLR AIPVRQRVAVCIWRLATGEPLR VSKRFGLGIST Sbjct: 181 MSKATFDMICDELESVVTKKDTMLRQAIPVRQRVAVCIWRLATGEPLREVSKRFGLGIST 240 Query: 711 CHKLVLEVCAAIRSVLMPKFLQWPDENKM 797 CHKLVLEVC AIRSVLMPKFLQWPDENK+ Sbjct: 241 CHKLVLEVCTAIRSVLMPKFLQWPDENKL 269 >XP_011007748.1 PREDICTED: uncharacterized protein LOC105113312 [Populus euphratica] Length = 523 Score = 268 bits (685), Expect = 2e-83 Identities = 145/291 (49%), Positives = 182/291 (62%), Gaps = 48/291 (16%) Frame = +3 Query: 69 MEISSIPFISQEDYSNFYG-FFQELDTIDMDPS-----SIKRRRIDE---------IPEI 203 MEIS PF++QE+ S+ Y FQ++D + + ++KR R D + + Sbjct: 1 MEISPFPFLNQEEISHLYSSLFQDMDNNSLTSNDNINANLKRGRKDGEDDEDKLGGLRKA 60 Query: 204 EASKQYPVKEIVSTLILLDEEEKSGQQNWKVESQQHSSLFQSNTQQNSHTMNEYNIHFQD 383 +++ + +I+++LILLDEEEK Q W +ESQ + F N ++ MN+Y H Q Sbjct: 61 KSNNNSLMSDILTSLILLDEEEKQEHQQWAIESQHDKAAFDRNHKRKLEAMNDYRSHLQT 120 Query: 384 HYDN---------------------------------NXXXXXXXXXXXXHRRLWVKDRS 464 H+ + + RRLWVKDRS Sbjct: 121 HFSDLDEMDSSGTKRHGRCASLTAAAVAVAVTETAQASSEQSGSGGGSGQQRRLWVKDRS 180 Query: 465 KAWWDLCNSQDFPDEEFKKAFRMSKATFDMICEELDSVVTKKDTMLRLAIPVRQRVAVCI 644 K WW+ CN DFP+EEF+KAFRMSKATFDMIC ELDSVVTKK+TMLR AIPVRQRVAVCI Sbjct: 181 KDWWEKCNHPDFPEEEFRKAFRMSKATFDMICVELDSVVTKKNTMLRDAIPVRQRVAVCI 240 Query: 645 WRLATGEPLRLVSKRFGLGISTCHKLVLEVCAAIRSVLMPKFLQWPDENKM 797 WRLATGEPLRLVSKRFGLGISTCHKLVLEVC+AIR+VLMPKFLQWP+E+K+ Sbjct: 241 WRLATGEPLRLVSKRFGLGISTCHKLVLEVCSAIRNVLMPKFLQWPNEDKL 291 >XP_002307739.2 hypothetical protein POPTR_0005s26390g [Populus trichocarpa] EEE94735.2 hypothetical protein POPTR_0005s26390g [Populus trichocarpa] Length = 517 Score = 268 bits (684), Expect = 2e-83 Identities = 147/285 (51%), Positives = 180/285 (63%), Gaps = 42/285 (14%) Frame = +3 Query: 69 MEISSIPFISQEDYSNFYGFFQELDTIDMDPSS---IKRRRIDEIPE------IEASKQY 221 MEIS PF++QE+ S+ Y FQ++D+ + +S +KRRR D + I++S Sbjct: 1 MEISPFPFLNQEEISHLYSLFQDIDSNTLATNSNAHLKRRRKDGEEDESGWKKIKSSNNN 60 Query: 222 PV-KEIVSTLILLDEEEKSGQQNWKVESQQHSSLFQSNTQQNSHTMNEYNIHFQDHYDN- 395 V +I+S+LILLDEEEK Q W +ESQ + N +Q MN+Y + ++ + Sbjct: 61 SVMSDILSSLILLDEEEKQEYQQWAIESQLDKAALDWNHKQKVVAMNDYRSDLEANFSDL 120 Query: 396 -------------------------------NXXXXXXXXXXXXHRRLWVKDRSKAWWDL 482 + HRRLWVKDRSK WWD Sbjct: 121 DEMDHTRTKRARRAASEVATAAAVTETALPSSGPGGGGSGQQQQHRRLWVKDRSKDWWDK 180 Query: 483 CNSQDFPDEEFKKAFRMSKATFDMICEELDSVVTKKDTMLRLAIPVRQRVAVCIWRLATG 662 CN DFPDEEF+KAFRMSKATFD+IC ELDS VTKK+TMLR AIPVRQR+AVCIWRLATG Sbjct: 181 CNHPDFPDEEFRKAFRMSKATFDLICMELDSAVTKKNTMLRDAIPVRQRIAVCIWRLATG 240 Query: 663 EPLRLVSKRFGLGISTCHKLVLEVCAAIRSVLMPKFLQWPDENKM 797 EPLRLVSKRFGLGISTCHKLVLEVC+AI++VLM KF+QWPDE KM Sbjct: 241 EPLRLVSKRFGLGISTCHKLVLEVCSAIKNVLMQKFVQWPDEEKM 285 >AHL69788.1 hypothetical protein [Camellia sinensis] Length = 468 Score = 265 bits (677), Expect = 6e-83 Identities = 136/243 (55%), Positives = 175/243 (72%) Frame = +3 Query: 69 MEISSIPFISQEDYSNFYGFFQELDTIDMDPSSIKRRRIDEIPEIEASKQYPVKEIVSTL 248 ME+S F++ +DY+NFY FQE T + D + KRRR +E + ++ + ++V+TL Sbjct: 1 MEVSPFSFLNPDDYTNFYSIFQE--TQEEDIITKKRRRTEE----QVPEEVVLNDVVATL 54 Query: 249 ILLDEEEKSGQQNWKVESQQHSSLFQSNTQQNSHTMNEYNIHFQDHYDNNXXXXXXXXXX 428 + LDEE+KS QQ + +SQ ++ + + T + +N Sbjct: 55 LFLDEEQKSEQQQYH-KSQSMNNDYSNQFHDLDDTQQLKTKRTRCEVSDNSAQLGGGSGG 113 Query: 429 XXHRRLWVKDRSKAWWDLCNSQDFPDEEFKKAFRMSKATFDMICEELDSVVTKKDTMLRL 608 HRRLWVK+RSKAWW+ CN DFP+EEF++AFRMS+ATFD+IC+ELDS V KKDTMLR+ Sbjct: 114 Q-HRRLWVKNRSKAWWEHCNHPDFPEEEFRRAFRMSRATFDLICDELDSAVNKKDTMLRM 172 Query: 609 AIPVRQRVAVCIWRLATGEPLRLVSKRFGLGISTCHKLVLEVCAAIRSVLMPKFLQWPDE 788 AIPVRQRVAVCIWRLATGEPLR+VSKRFGLGISTCHKLVLEVC+AI+SVLMPKFLQWPDE Sbjct: 173 AIPVRQRVAVCIWRLATGEPLRVVSKRFGLGISTCHKLVLEVCSAIKSVLMPKFLQWPDE 232 Query: 789 NKM 797 +K+ Sbjct: 233 SKL 235 >XP_002281001.1 PREDICTED: uncharacterized protein LOC100247440 [Vitis vinifera] Length = 509 Score = 265 bits (677), Expect = 2e-82 Identities = 140/271 (51%), Positives = 174/271 (64%), Gaps = 40/271 (14%) Frame = +3 Query: 102 EDYSNFYGFFQELDTIDMDPSS---------IKRRRIDEIPEIEASKQYPVKEIVSTLIL 254 +D+S+FY FFQ+ D S+ KRRR + +KEI+++L+L Sbjct: 8 DDFSHFYSFFQDSGNSIPDDSAGGGGSGGGGRKRRR--GAGGGSGDESGALKEILASLLL 65 Query: 255 LDEEEKSGQQNWKVESQQHSSLFQSNTQQNSHTMNEYNIHFQDHY--------------- 389 LDEEEK Q+ W +++Q +LF+ N ++ + MNEY + + Sbjct: 66 LDEEEKLDQEKWALDNQNERALFEENHRKRAQAMNEYRADMVERFGEMEESESSRVKRAR 125 Query: 390 ----------------DNNXXXXXXXXXXXXHRRLWVKDRSKAWWDLCNSQDFPDEEFKK 521 +N HRRLWVKDRSKAWW+ C+ DFP+E+F++ Sbjct: 126 RSASTVAAGVAAAAASSDNVGSESSSQPVGHHRRLWVKDRSKAWWEWCSHPDFPEEDFRR 185 Query: 522 AFRMSKATFDMICEELDSVVTKKDTMLRLAIPVRQRVAVCIWRLATGEPLRLVSKRFGLG 701 AFRMS+ATFDMIC+ELDSVVTKKDTMLRLAIPVRQRVAVCIWRLATGEPLRLVSKRFGLG Sbjct: 186 AFRMSRATFDMICDELDSVVTKKDTMLRLAIPVRQRVAVCIWRLATGEPLRLVSKRFGLG 245 Query: 702 ISTCHKLVLEVCAAIRSVLMPKFLQWPDENK 794 ISTCHKLVLEVCAAI++VLMPKFLQWPD+ K Sbjct: 246 ISTCHKLVLEVCAAIKTVLMPKFLQWPDDEK 276 >XP_010262020.1 PREDICTED: putative nuclease HARBI1 [Nelumbo nucifera] Length = 527 Score = 264 bits (674), Expect = 7e-82 Identities = 141/291 (48%), Positives = 183/291 (62%), Gaps = 48/291 (16%) Frame = +3 Query: 69 MEISSIPFISQEDYSNFYGFFQE-LDTIDMDPSSI-----------KRRRI------DEI 194 + + S+P +QEDYS F+ FQ+ L+ ++ +P++ ++R++ DE Sbjct: 4 INVGSLPLPAQEDYSYFFSLFQDVLNEMNTNPTAATTTGRNVGDNWRKRKVSEGDSADEE 63 Query: 195 PEIEASKQYPVKEIVSTLILLDEEEKSGQQNWKVESQQHSSLFQSNTQQNSHTMNEYNIH 374 + + VKEI+S+ +LL+E+EK QQ W + QQ + +SN +Q S M +Y + Sbjct: 64 EITTPNNKNRVKEILSSYLLLEEQEKLDQQEWSNDYQQERQIIESNYKQRSQAMLDYYNN 123 Query: 375 FQDHYDN------------------------------NXXXXXXXXXXXXHRRLWVKDRS 464 Q +Y + HRRLWVKDRS Sbjct: 124 LQGYYSDLEESDDQLRTKRSRLSASAVAAAAVASTSDATIRAGGGAATGHHRRLWVKDRS 183 Query: 465 KAWWDLCNSQDFPDEEFKKAFRMSKATFDMICEELDSVVTKKDTMLRLAIPVRQRVAVCI 644 +AWWD CN DFP+EEF++AFRM +ATFDMICEEL+SVV K+DTMLR AIPVRQRVAVCI Sbjct: 184 RAWWDRCNHPDFPEEEFQRAFRMGRATFDMICEELNSVVAKEDTMLRAAIPVRQRVAVCI 243 Query: 645 WRLATGEPLRLVSKRFGLGISTCHKLVLEVCAAIRSVLMPKFLQWPDENKM 797 WRLATGEPLRLVSKRFGLGISTCHKLVLEVC+AI++VLMPKFLQWPDE K+ Sbjct: 244 WRLATGEPLRLVSKRFGLGISTCHKLVLEVCSAIKTVLMPKFLQWPDEEKL 294 >XP_019249162.1 PREDICTED: putative nuclease HARBI1 [Nicotiana attenuata] OIS99914.1 hypothetical protein A4A49_14737 [Nicotiana attenuata] Length = 502 Score = 261 bits (667), Expect = 4e-81 Identities = 147/271 (54%), Positives = 176/271 (64%), Gaps = 28/271 (10%) Frame = +3 Query: 69 MEISSIPFISQEDY-SNFYGFFQELD---------TIDMDPSSIKRRRIDEIPE---IEA 209 MEISS PF + EDY SNF+ FFQ+ D T+ +K+R+ D+ +E Sbjct: 1 MEISSFPFPTPEDYPSNFFSFFQDFDFPTTDNTTATVFASEPLLKKRKTDDFDFDSIVEE 60 Query: 210 SKQYPVKEIVSTLILLDEEEKSGQQNWKVESQQ------HSSLFQSNTQQ---------N 344 V++I+S + D EE+ + S Q + S+F+ N Q+ N Sbjct: 61 GSLKTVEDILSKFLGFDNEEEKINTQFDFSSHQPTQPLVNQSVFELNQQKMDSVKPLTGN 120 Query: 345 SHTMNEYNIHFQDHYDNNXXXXXXXXXXXXHRRLWVKDRSKAWWDLCNSQDFPDEEFKKA 524 + + + F +++ RRLWVKDRSKAWW+ CNS DFP+EEFKKA Sbjct: 121 NKRSRQNSAEFISTSEDSTGGGNSQPPLQ-QRRLWVKDRSKAWWEQCNSPDFPEEEFKKA 179 Query: 525 FRMSKATFDMICEELDSVVTKKDTMLRLAIPVRQRVAVCIWRLATGEPLRLVSKRFGLGI 704 FRMSKATFDMIC+EL+SVVTKKDTMLR AIPVRQRVAVCIWRLATGEPLR VSKRFGLGI Sbjct: 180 FRMSKATFDMICDELESVVTKKDTMLRQAIPVRQRVAVCIWRLATGEPLREVSKRFGLGI 239 Query: 705 STCHKLVLEVCAAIRSVLMPKFLQWPDENKM 797 STCHKLVLEVC AIRSVLMPKFLQWPDENKM Sbjct: 240 STCHKLVLEVCTAIRSVLMPKFLQWPDENKM 270 >XP_009762418.1 PREDICTED: putative nuclease HARBI1 [Nicotiana sylvestris] XP_016515645.1 PREDICTED: putative nuclease HARBI1 [Nicotiana tabacum] Length = 502 Score = 258 bits (660), Expect = 5e-80 Identities = 145/270 (53%), Positives = 173/270 (64%), Gaps = 27/270 (10%) Frame = +3 Query: 69 MEISSIPFISQEDY-SNFYGFFQELDTIDMDPSSI---------KRRRIDEIPE---IEA 209 MEISS PF +QEDY SNF+ FFQ+ D D ++ K+R+ D+ +E Sbjct: 1 MEISSFPFPTQEDYPSNFFSFFQDFDFPATDNTTATAFASEPLPKKRKTDDFDFDSIVEE 60 Query: 210 SKQYPVKEIVSTLILLDEEEKSGQQNWKVESQQ------HSSLFQSNTQQNSHTMNEYNI 371 V++I+S + D EE+ + S Q + S+F+ + QQ + Sbjct: 61 GSLKTVEDILSKFLGFDNEEEKINTQFDFSSHQPTQPLANQSVFEFSNQQKMESAKPLTG 120 Query: 372 HFQDHYDNNXXXXXXXXXXXX--------HRRLWVKDRSKAWWDLCNSQDFPDEEFKKAF 527 + + N+ RRLWVKDRSKAWW+ CNS DFP+EEFKKAF Sbjct: 121 NNKRSRQNSAEFISTSEDSTGGDNSQPLQQRRLWVKDRSKAWWEQCNSPDFPEEEFKKAF 180 Query: 528 RMSKATFDMICEELDSVVTKKDTMLRLAIPVRQRVAVCIWRLATGEPLRLVSKRFGLGIS 707 RMSK TFDMIC+EL+SVVTKKDTMLR AIPVRQRVAVCIWRLATGEPLR VSKRFGLGIS Sbjct: 181 RMSKGTFDMICDELESVVTKKDTMLRQAIPVRQRVAVCIWRLATGEPLREVSKRFGLGIS 240 Query: 708 TCHKLVLEVCAAIRSVLMPKFLQWPDENKM 797 TCHKLVLEVC AIRSVLMPKF+QWPDENKM Sbjct: 241 TCHKLVLEVCTAIRSVLMPKFVQWPDENKM 270 >XP_010247371.1 PREDICTED: putative nuclease HARBI1 [Nelumbo nucifera] Length = 526 Score = 256 bits (655), Expect = 5e-79 Identities = 143/286 (50%), Positives = 171/286 (59%), Gaps = 46/286 (16%) Frame = +3 Query: 78 SSIPFISQEDYSNFYGFFQ----ELDTIDMDPSSIKRRRIDEIP------EIEASKQYPV 227 SS+P +QEDYS FY FQ E++T ++ ++RR D E Sbjct: 8 SSLPLPTQEDYSFFYSLFQDALSEMNTNTTTGNNSRKRRRDNTNDEADEGETTTGNNKNF 67 Query: 228 KEIVSTLILLDEEEKSGQQNWKVESQQHSSLFQSNTQQNSHTMNEYNIHFQDHY------ 389 KEI+++ +LL+E+EK QQ QQ L +SN +Q S M Y + QD+Y Sbjct: 68 KEILTSYLLLEEQEKLDQQESARAYQQERQLLESNHKQRSQAMLNYYNNLQDYYSDLEES 127 Query: 390 ------------------------------DNNXXXXXXXXXXXXHRRLWVKDRSKAWWD 479 D HRRLWVKDRS+AWWD Sbjct: 128 DEQLRTKRSRLSASAVAAATASVASAKANSDAPIQSGGAGATAGHHRRLWVKDRSQAWWD 187 Query: 480 LCNSQDFPDEEFKKAFRMSKATFDMICEELDSVVTKKDTMLRLAIPVRQRVAVCIWRLAT 659 CN DFP+EEF+KAFRM +ATFDMICEEL+SVV K+DTMLR AIPV QRVAVCIWRLAT Sbjct: 188 RCNHPDFPEEEFRKAFRMGRATFDMICEELNSVVAKEDTMLRAAIPVHQRVAVCIWRLAT 247 Query: 660 GEPLRLVSKRFGLGISTCHKLVLEVCAAIRSVLMPKFLQWPDENKM 797 GEPLRLVSKRFGLGISTCHKLVLEVC+AI++VLMPKFLQWPDE + Sbjct: 248 GEPLRLVSKRFGLGISTCHKLVLEVCSAIKTVLMPKFLQWPDEEAL 293 >XP_017643943.1 PREDICTED: uncharacterized protein LOC108484603 [Gossypium arboreum] Length = 510 Score = 253 bits (646), Expect = 7e-78 Identities = 141/275 (51%), Positives = 176/275 (64%), Gaps = 32/275 (11%) Frame = +3 Query: 69 MEISSIPFISQED------YSNFYGFFQELDTIDMDPSSIKRRRIDEIPEIEASKQYPVK 230 MEI S F+S ED Y++ + +F +++T + ++ KR R D E S++ Sbjct: 1 MEIGSFTFLSPEDFSFNNNYNSDFSWFLDMET-GFNRNTKKRGRKD-FEESLVSEKSGFG 58 Query: 231 EIVSTLILLDEEEKSGQQNWKVESQQHSSLFQSNTQQNSHTMNEY------NIHFQDHYD 392 +I+S++++LDEE K Q W S Q S+ FQ+N + N MN Y ++ D+ Sbjct: 59 DILSSILMLDEEAKQEQYQWVTNSDQDSAFFQANYKGNVQEMNGYFENQFSEMNQLDNSS 118 Query: 393 N--------------------NXXXXXXXXXXXXHRRLWVKDRSKAWWDLCNSQDFPDEE 512 N N RRLWVKDRSK WW CN DFPDEE Sbjct: 119 NKRARKSGSPAAAATVSAGSDNVGPSQSGRGSGQQRRLWVKDRSKDWWTKCNHPDFPDEE 178 Query: 513 FKKAFRMSKATFDMICEELDSVVTKKDTMLRLAIPVRQRVAVCIWRLATGEPLRLVSKRF 692 FK+AFRMSKATF+M+CEEL+ VTKK+TMLR AIPVRQRVAVCIWRLATGEPLR+VSKRF Sbjct: 179 FKRAFRMSKATFNMVCEELEPAVTKKNTMLRDAIPVRQRVAVCIWRLATGEPLRMVSKRF 238 Query: 693 GLGISTCHKLVLEVCAAIRSVLMPKFLQWPDENKM 797 GLGISTCHKLVLEVCAAI++VLMPKF+QWPDE+KM Sbjct: 239 GLGISTCHKLVLEVCAAIKTVLMPKFVQWPDEHKM 273 >XP_016678760.1 PREDICTED: uncharacterized protein LOC107897721 [Gossypium hirsutum] Length = 510 Score = 253 bits (646), Expect = 7e-78 Identities = 141/275 (51%), Positives = 176/275 (64%), Gaps = 32/275 (11%) Frame = +3 Query: 69 MEISSIPFISQED------YSNFYGFFQELDTIDMDPSSIKRRRIDEIPEIEASKQYPVK 230 MEI S F+S ED Y++ + +F +++T + ++ KR R D E S++ Sbjct: 1 MEIGSFTFLSPEDFSFNNNYNSDFSWFLDMET-GFNRNTKKRGRKD-FEESLVSEKSGFG 58 Query: 231 EIVSTLILLDEEEKSGQQNWKVESQQHSSLFQSNTQQNSHTMNEY------NIHFQDHYD 392 +I+S++++LDEE K Q W S Q S+ FQ+N + N MN Y ++ D+ Sbjct: 59 DILSSILMLDEEAKQEQYQWVTNSDQDSAFFQANYKGNVQEMNGYFENQFSEMNQLDNSS 118 Query: 393 N--------------------NXXXXXXXXXXXXHRRLWVKDRSKAWWDLCNSQDFPDEE 512 N N RRLWVKDRSK WW CN DFPDEE Sbjct: 119 NKRARKSGSPAAAATVSAGSDNVGPSQSGSGSGQQRRLWVKDRSKDWWTKCNHPDFPDEE 178 Query: 513 FKKAFRMSKATFDMICEELDSVVTKKDTMLRLAIPVRQRVAVCIWRLATGEPLRLVSKRF 692 FK+AFRMSKATF+M+CEEL+ VTKK+TMLR AIPVRQRVAVCIWRLATGEPLR+VSKRF Sbjct: 179 FKRAFRMSKATFNMVCEELEPAVTKKNTMLRDAIPVRQRVAVCIWRLATGEPLRMVSKRF 238 Query: 693 GLGISTCHKLVLEVCAAIRSVLMPKFLQWPDENKM 797 GLGISTCHKLVLEVCAAI++VLMPKF+QWPDE+KM Sbjct: 239 GLGISTCHKLVLEVCAAIKTVLMPKFVQWPDEHKM 273 >KVH93992.1 Harbinger transposase-derived nuclease [Cynara cardunculus var. scolymus] Length = 478 Score = 251 bits (641), Expect = 2e-77 Identities = 133/247 (53%), Positives = 171/247 (69%), Gaps = 4/247 (1%) Frame = +3 Query: 69 MEISSIPFISQEDYSNFYGFFQELDTIDM---DPSSIKRRRIDEIPEIEASKQYPVKEIV 239 MEISSIPF++QE+YS FY FFQE DT + D + KRR+I+E ++S +KEI+ Sbjct: 1 MEISSIPFLNQEEYSYFYSFFQEFDTNNTAIDDIQTNKRRKINETTTGDSSS---LKEIL 57 Query: 240 STLILLDEEEKSGQQNWKVESQQHSSLFQSNTQQNSHTMN-EYNIHFQDHYDNNXXXXXX 416 T+ +DE+E S + + + T +++ + + ++ Sbjct: 58 DTISFMDEQEIPDFDFQMPNLDFGSEMAEPERGRIRKTPPPKFDATVTEEWSSSQGGGGG 117 Query: 417 XXXXXXHRRLWVKDRSKAWWDLCNSQDFPDEEFKKAFRMSKATFDMICEELDSVVTKKDT 596 RRLWVK+RSK WWD NS + PDEEFKKAFRMSK+TF+MIC+ELD+ VTKKDT Sbjct: 118 GGGGGPQRRLWVKERSKGWWDYYNSDECPDEEFKKAFRMSKSTFNMICDELDAAVTKKDT 177 Query: 597 MLRLAIPVRQRVAVCIWRLATGEPLRLVSKRFGLGISTCHKLVLEVCAAIRSVLMPKFLQ 776 MLR+AIPVRQRVAVC++RLATG+PLR VS RFGLGISTCHKLVLEVCAAIR+VLMPKFLQ Sbjct: 178 MLRMAIPVRQRVAVCLYRLATGDPLRTVSSRFGLGISTCHKLVLEVCAAIRNVLMPKFLQ 237 Query: 777 WPDENKM 797 WPD+ ++ Sbjct: 238 WPDDERL 244 >XP_004144012.1 PREDICTED: putative nuclease HARBI1 [Cucumis sativus] KGN66264.1 hypothetical protein Csa_1G589710 [Cucumis sativus] Length = 483 Score = 249 bits (637), Expect = 8e-77 Identities = 134/253 (52%), Positives = 166/253 (65%), Gaps = 10/253 (3%) Frame = +3 Query: 69 MEISSIPFISQEDYSNFYGFFQELD----TIDMDPSSIKRRRIDEIPEIEASKQY----- 221 MEISS PF++QE++ + F E+D T +++P+S KRRR D + S + Sbjct: 1 MEISSFPFLNQEEFLPIFNLFSEMDNPTATFNVNPTSKKRRRSDPNSDDFNSFSFTDEND 60 Query: 222 -PVKEIVSTLILLDEEEKSGQQNWKVESQQHSSLFQSNTQQNSHTMNEYNIHFQDHYDNN 398 P + + L + + QQNW +++Q+ +N S + + + Sbjct: 61 DPTADPLLKLPCWFDPQPESQQNWLMDAQKPKP---TNDFHLSDQIPKKPRRASPENPSP 117 Query: 399 XXXXXXXXXXXXHRRLWVKDRSKAWWDLCNSQDFPDEEFKKAFRMSKATFDMICEELDSV 578 RRLWVKDRSK WWD CN DFPDEEF++AFRMSK+TFDMIC+ELDS Sbjct: 118 VKNTPAGGGGTQQRRLWVKDRSKDWWDQCNHPDFPDEEFRRAFRMSKSTFDMICKELDST 177 Query: 579 VTKKDTMLRLAIPVRQRVAVCIWRLATGEPLRLVSKRFGLGISTCHKLVLEVCAAIRSVL 758 V KKDTMLR+AIPVRQRVAVCIWRLATGEPLRLVSKRFGLGISTCHKLVLEVC+AIR VL Sbjct: 178 VMKKDTMLRVAIPVRQRVAVCIWRLATGEPLRLVSKRFGLGISTCHKLVLEVCSAIRKVL 237 Query: 759 MPKFLQWPDENKM 797 MPKFL WPDE+K+ Sbjct: 238 MPKFLNWPDESKL 250 >XP_012451022.1 PREDICTED: uncharacterized protein LOC105773564 [Gossypium raimondii] Length = 510 Score = 249 bits (635), Expect = 3e-76 Identities = 139/275 (50%), Positives = 173/275 (62%), Gaps = 32/275 (11%) Frame = +3 Query: 69 MEISSIPFISQED------YSNFYGFFQELDTIDMDPSSIKRRRIDEIPEIEASKQYPVK 230 MEI S F+S ED Y++ + +F +++T + ++ KR R D E S++ Sbjct: 1 MEIGSFTFLSPEDFSFNNNYNSDFSWFLDMET-GFNGNTKKRSRKD-FEESLVSEKSGFG 58 Query: 231 EIVSTLILLDEEEKSGQQNWKVESQQHSSLFQSNTQQNSHTMNEY------NIHFQDHYD 392 +I+S++++LDEE K Q W S Q + FQ+N N MN Y ++ D+ Sbjct: 59 DILSSILMLDEEAKQEQYQWVTNSDQDRAFFQANYNGNVQEMNGYFENQFSEMNQLDNSS 118 Query: 393 N--------------------NXXXXXXXXXXXXHRRLWVKDRSKAWWDLCNSQDFPDEE 512 N N RRLWVKDRSK WW CN DFPDEE Sbjct: 119 NKRARKSGSPAAAATVSAGSDNVGPSQSGSGSGQQRRLWVKDRSKDWWTKCNHPDFPDEE 178 Query: 513 FKKAFRMSKATFDMICEELDSVVTKKDTMLRLAIPVRQRVAVCIWRLATGEPLRLVSKRF 692 FK+AFRMSKATF+M+CEEL+ V KK+TMLR AIPVRQRVAVCIWRLATGEPLR+VSKRF Sbjct: 179 FKRAFRMSKATFNMVCEELEPAVMKKNTMLRDAIPVRQRVAVCIWRLATGEPLRMVSKRF 238 Query: 693 GLGISTCHKLVLEVCAAIRSVLMPKFLQWPDENKM 797 GLGISTCHKLVLEVCAAI++VLMPKF+QWPDE+KM Sbjct: 239 GLGISTCHKLVLEVCAAIKTVLMPKFVQWPDESKM 273 >XP_006342328.1 PREDICTED: uncharacterized protein LOC102590956 [Solanum tuberosum] Length = 506 Score = 246 bits (628), Expect = 3e-75 Identities = 140/271 (51%), Positives = 173/271 (63%), Gaps = 30/271 (11%) Frame = +3 Query: 69 MEISSIPFISQEDYS--NFYGFFQELD---TIDMDPSSI---------KRRRIDEIP--- 197 MEISS F +QEDY NF+ FFQ+ D T D ++ K+RR+D+ Sbjct: 1 MEISSFLFPNQEDYPSPNFFSFFQDFDFPATTDTTAAAAAPIAAEPLPKKRRVDDFDFDL 60 Query: 198 --EIEASKQYPVKEIVSTLILLDEEEKSGQQNWKVESQQ---HSSLFQSNTQQNSHTMNE 362 +E V++I++ + D+EE + SQ+ + S+F + QQ+ MNE Sbjct: 61 EQVVEQGSLKSVEDILNKFLGFDKEEDKTELKLDWSSQEQFANQSVFDFSNQQSGLIMNE 120 Query: 363 Y--------NIHFQDHYDNNXXXXXXXXXXXXHRRLWVKDRSKAWWDLCNSQDFPDEEFK 518 N + + RRLWVKDRSKAWW+ CNS DFP+EEFK Sbjct: 121 KVKPMAGTNNKRSRQNSAEFIPTSEEESQPQQQRRLWVKDRSKAWWEQCNSPDFPEEEFK 180 Query: 519 KAFRMSKATFDMICEELDSVVTKKDTMLRLAIPVRQRVAVCIWRLATGEPLRLVSKRFGL 698 KAFRM++ATFDMICEEL+SVVTKKDTMLR AIPVRQRVAVCIWRLATGEPLR VSKRFGL Sbjct: 181 KAFRMTRATFDMICEELESVVTKKDTMLRQAIPVRQRVAVCIWRLATGEPLREVSKRFGL 240 Query: 699 GISTCHKLVLEVCAAIRSVLMPKFLQWPDEN 791 GISTCHKLVLEVC AI+ VLMPKF+QWP+++ Sbjct: 241 GISTCHKLVLEVCTAIKGVLMPKFVQWPNDD 271 >XP_016723331.1 PREDICTED: uncharacterized protein LOC107935269 [Gossypium hirsutum] Length = 510 Score = 245 bits (625), Expect = 1e-74 Identities = 137/275 (49%), Positives = 172/275 (62%), Gaps = 32/275 (11%) Frame = +3 Query: 69 MEISSIPFISQED------YSNFYGFFQELDTIDMDPSSIKRRRIDEIPEIEASKQYPVK 230 MEI F+S ED Y++ + +F +++T + ++ KR R D E S++ Sbjct: 1 MEIGPFTFLSPEDFSFNNNYNSDFSWFLDMET-GFNGNTKKRGRKD-FEESLVSEKSGFG 58 Query: 231 EIVSTLILLDEEEKSGQQNWKVESQQHSSLFQSNTQQNSHTMNEY------NIHFQDHYD 392 +I+S++++LDEE K Q W Q + FQ+N + N MN Y ++ D+ Sbjct: 59 DILSSILMLDEEAKQEQYQWVTNPDQDRAFFQANYKGNVQEMNGYFENQFSEMNQLDNSS 118 Query: 393 N--------------------NXXXXXXXXXXXXHRRLWVKDRSKAWWDLCNSQDFPDEE 512 N N RRLWVKDRSK WW CN DFPDEE Sbjct: 119 NKRARKSGSPAAAATVSAGSDNVGPSQSGSGSGQQRRLWVKDRSKDWWTKCNHPDFPDEE 178 Query: 513 FKKAFRMSKATFDMICEELDSVVTKKDTMLRLAIPVRQRVAVCIWRLATGEPLRLVSKRF 692 FK+AFRMSKATF+M+CEEL+ V KK+TMLR AIPVRQRVAVCIWRLATGEPLR+VSKRF Sbjct: 179 FKRAFRMSKATFNMVCEELEPAVMKKNTMLRDAIPVRQRVAVCIWRLATGEPLRMVSKRF 238 Query: 693 GLGISTCHKLVLEVCAAIRSVLMPKFLQWPDENKM 797 GLGISTCHKLVLEVCAAI++VLMPKF+QWPDE+KM Sbjct: 239 GLGISTCHKLVLEVCAAIKTVLMPKFVQWPDESKM 273 >ABC86705.1 R111 [Coffea arabica] CDP03132.1 unnamed protein product [Coffea canephora] Length = 498 Score = 244 bits (622), Expect = 2e-74 Identities = 144/271 (53%), Positives = 172/271 (63%), Gaps = 28/271 (10%) Frame = +3 Query: 69 MEISSI--PFI--SQEDY--SNFYGFFQELDTIDMDPS--------------SIKRRRID 188 MEISS P+ S EDY SN + FFQELD I + + S KR+R+D Sbjct: 1 MEISSFSSPYSGSSSEDYLSSNLFSFFQELDPIAIPTNNTSSSSSSQSNINLSKKRKRVD 60 Query: 189 EIPEIEASKQYPVKEIVSTLILLDEEEKSGQQNWKVESQQHSSLFQSNTQQNSHT----M 356 P + +++ V+T + D + K +Q + E SL + +S + M Sbjct: 61 HEPTSSS-----IQDFVNTFLTFDSDNKQQEQQQEDEFLLFPSLASFSQSPSSTSPMREM 115 Query: 357 NEYNIHFQDHYD----NNXXXXXXXXXXXXHRRLWVKDRSKAWWDLCNSQDFPDEEFKKA 524 E I ++ RRLWVKDRSKAWW+ CNS DFP+EEF+KA Sbjct: 116 KEGAIGTASESKRARRSSPEVEAPEAGTSSQRRLWVKDRSKAWWEHCNSPDFPEEEFRKA 175 Query: 525 FRMSKATFDMICEELDSVVTKKDTMLRLAIPVRQRVAVCIWRLATGEPLRLVSKRFGLGI 704 FRMSKATFDMIC+EL+SVVTKKDTMLRLAIPVRQRVAVCIWRLATGEPLR VSKRFGLGI Sbjct: 176 FRMSKATFDMICDELESVVTKKDTMLRLAIPVRQRVAVCIWRLATGEPLREVSKRFGLGI 235 Query: 705 STCHKLVLEVCAAIRSVLMPKFLQWPDENKM 797 STCHKLVLEVC+AIR+VLMPKFLQWP+E M Sbjct: 236 STCHKLVLEVCSAIRNVLMPKFLQWPNEENM 266