BLASTX nr result
ID: Mentha29_contig00012460
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Mentha29_contig00012460 (1244 letters) Database: ./nr 38,876,450 sequences; 13,856,398,315 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value gb|EPS70565.1| hypothetical protein M569_04195, partial [Genlise... 348 4e-93 gb|EYU22465.1| hypothetical protein MIMGU_mgv1a007882mg [Mimulus... 337 6e-90 ref|XP_004237084.1| PREDICTED: uncharacterized protein LOC101260... 327 5e-87 ref|XP_006602372.1| PREDICTED: uncharacterized protein LOC100809... 323 7e-86 ref|XP_006602371.1| PREDICTED: uncharacterized protein LOC100809... 323 9e-86 ref|XP_006363529.1| PREDICTED: uncharacterized protein LOC102591... 323 9e-86 ref|XP_006602373.1| PREDICTED: uncharacterized protein LOC100809... 323 1e-85 ref|XP_006363531.1| PREDICTED: uncharacterized protein LOC102591... 322 3e-85 ref|XP_002534525.1| nuclease, putative [Ricinus communis] gi|223... 318 2e-84 ref|XP_006355984.1| PREDICTED: uncharacterized protein LOC102591... 317 9e-84 ref|XP_007153671.1| hypothetical protein PHAVU_003G055000g [Phas... 315 2e-83 ref|XP_003532034.1| PREDICTED: uncharacterized protein LOC100779... 314 6e-83 ref|XP_006585969.1| PREDICTED: uncharacterized protein LOC100779... 313 8e-83 ref|XP_006484311.1| PREDICTED: uncharacterized protein LOC102614... 311 4e-82 ref|XP_002314727.2| hypothetical protein POPTR_0010s10515g [Popu... 305 3e-80 ref|XP_007226366.1| hypothetical protein PRUPE_ppa022484mg [Prun... 301 4e-79 ref|XP_002517391.1| nuclease, putative [Ricinus communis] gi|223... 293 1e-76 ref|XP_002266599.2| PREDICTED: uncharacterized protein LOC100255... 289 2e-75 ref|XP_002305325.1| RNase H domain-containing family protein [Po... 283 1e-73 ref|XP_006438469.1| hypothetical protein CICLE_v10031867mg [Citr... 282 2e-73 >gb|EPS70565.1| hypothetical protein M569_04195, partial [Genlisea aurea] Length = 245 Score = 348 bits (892), Expect = 4e-93 Identities = 170/242 (70%), Positives = 203/242 (83%), Gaps = 5/242 (2%) Frame = +1 Query: 307 EKEEFYVVRKGDLVGVYRSLSDCQAQVGTSICDPPVSVFKSCSMPKDTENYLRSQGLKNA 486 +KE F+VVRKGDL+GVY+SL+DCQAQVGTSICDPPVSVFK SMPKDTE YL S GLKNA Sbjct: 4 DKEAFFVVRKGDLIGVYKSLNDCQAQVGTSICDPPVSVFKCNSMPKDTEKYLMSCGLKNA 63 Query: 487 LYSIRASDLTDDLFGTLMACPLQPSSRGGEAPAKKKLNKD-----LQSDYGKSCTLEFDG 651 LYSIRASD+T++LFG L +CP+Q SRG + K + NK L S YG+SCTLEFDG Sbjct: 64 LYSIRASDITEELFGALESCPVQVPSRGETSIHKSESNKKRPQGTLWSQYGRSCTLEFDG 123 Query: 652 ASKGNPGKAGAGAILRYDDGSVICRLCEGVGIATCNFAEYKGAILGMRYAAEKGFTNIRV 831 ASKGNPG+AGAGA+LR DDGS+ICRL EG+G+ATCN AEY+ ILG++YA KGFTN+R Sbjct: 124 ASKGNPGQAGAGAVLRSDDGSLICRLREGLGVATCNVAEYRAFILGLKYALGKGFTNVRA 183 Query: 832 KGDSKLVCMQIQGLWKVKSENLLNLYNEAKELESRFHSFQIMHVLRDLNSDADAQANLGA 1011 +GDSKLVCMQIQGLW+VK++N+ NL+NEAK+L+ F SFQI+HVLRDLNS+AD QANL Sbjct: 184 RGDSKLVCMQIQGLWRVKNQNISNLFNEAKKLKDSFMSFQIIHVLRDLNSEADEQANLAV 243 Query: 1012 QL 1017 +L Sbjct: 244 KL 245 >gb|EYU22465.1| hypothetical protein MIMGU_mgv1a007882mg [Mimulus guttatus] Length = 392 Score = 337 bits (864), Expect = 6e-90 Identities = 164/243 (67%), Positives = 197/243 (81%), Gaps = 4/243 (1%) Frame = +1 Query: 307 EKEEFYVVRKGDLVGVYRSLSDCQAQVGTSICDPPVSVFKSCSMPKDTENYLRSQGLKNA 486 +KE F+VVRKGDL+GVY+SL DCQAQVGTSICDPP+SV+K SMPK+TE YL S GLKNA Sbjct: 4 DKEAFFVVRKGDLIGVYKSLKDCQAQVGTSICDPPISVYKGSSMPKETEKYLVSSGLKNA 63 Query: 487 LYSIRASDLTDDLFGTLMACPLQ----PSSRGGEAPAKKKLNKDLQSDYGKSCTLEFDGA 654 LYSIRASDLT+DLFGTL+ACP+Q S E +KK+ + L SDY + CTLEFDGA Sbjct: 64 LYSIRASDLTEDLFGTLVACPVQLPSVKSETSNEPVSKKRSHDALSSDYERFCTLEFDGA 123 Query: 655 SKGNPGKAGAGAILRYDDGSVICRLCEGVGIATCNFAEYKGAILGMRYAAEKGFTNIRVK 834 SKGNPG+AGAGAILR DGS +CR+ EG+GIATCN AEY+ ILG++YA KGFT+++V+ Sbjct: 124 SKGNPGQAGAGAILRSVDGSFVCRMREGLGIATCNVAEYRAFILGLKYALRKGFTSVQVR 183 Query: 835 GDSKLVCMQIQGLWKVKSENLLNLYNEAKELESRFHSFQIMHVLRDLNSDADAQANLGAQ 1014 GDSKLVCMQI+GLWKV++ N+ Y EAK L+ RF +F+I HVLRDLNS+AD QANLG Sbjct: 184 GDSKLVCMQIEGLWKVRNPNIATWYEEAKNLKDRFVNFKITHVLRDLNSEADVQANLGVD 243 Query: 1015 LAE 1023 L E Sbjct: 244 LPE 246 >ref|XP_004237084.1| PREDICTED: uncharacterized protein LOC101260715 [Solanum lycopersicum] Length = 369 Score = 327 bits (839), Expect = 5e-87 Identities = 171/288 (59%), Positives = 212/288 (73%), Gaps = 39/288 (13%) Frame = +1 Query: 295 QLTMEKEEFYVVRKGDLVGVYRSLSDCQAQVGTSICDPPVSVFKSCSMPKDTENYLRSQG 474 Q+ +++ F+VVRKG+LVGVY++LSDCQ QVG+SICDPPVSV+K +MPKDTE+YL S G Sbjct: 79 QMKEDRDGFFVVRKGNLVGVYKNLSDCQTQVGSSICDPPVSVYKGYAMPKDTEDYLLSCG 138 Query: 475 LKNALYSIRASDLTDDLFGTLMACPLQ--------PSSRGG--EAPAKKKLNKDLQSDY- 621 LKNALYSIRA+DLT+DLFGTL+ CP Q SS+GG E KK+ + S+Y Sbjct: 139 LKNALYSIRAADLTEDLFGTLVPCPFQHMLVSQQPSSSKGGMPEHMTKKRSQDVMWSEYA 198 Query: 622 ----------------------------GKSCTLEFDGASKGNPGKAGAGAILRYDDGSV 717 G+SCTLEFDGASKGNPG AGAGAILR DDGS Sbjct: 199 DVAVISNDDSLTKHVKLDDHKGVQAPLSGQSCTLEFDGASKGNPGLAGAGAILRADDGSF 258 Query: 718 ICRLCEGVGIATCNFAEYKGAILGMRYAAEKGFTNIRVKGDSKLVCMQIQGLWKVKSENL 897 ICRL EG+G+AT N AEY+ ILG+ YA KGFT+IRV+GDSKLVCMQIQGLWKVK++N+ Sbjct: 259 ICRLREGLGVATNNAAEYRAIILGLNYALSKGFTSIRVQGDSKLVCMQIQGLWKVKNQNI 318 Query: 898 LNLYNEAKELESRFHSFQIMHVLRDLNSDADAQANLGAQLAEGEIQED 1041 +LY +AK+L+ RF SF+I+HVLR+ NSDADAQAN+ +LA+G+IQE+ Sbjct: 319 SSLYEQAKQLKDRFLSFRIIHVLRESNSDADAQANIAVELADGQIQEE 366 >ref|XP_006602372.1| PREDICTED: uncharacterized protein LOC100809644 isoform X2 [Glycine max] Length = 351 Score = 323 bits (829), Expect = 7e-86 Identities = 168/284 (59%), Positives = 210/284 (73%), Gaps = 30/284 (10%) Frame = +1 Query: 286 DPKQLTMEKEEFYVVRKGDLVGVYRSLSDCQAQVGTSICDPPVSVFKSCSMPKDTENYLR 465 +P+ + EK+ FYVVRKGD+VG+Y SL+D QAQVG+S+C+PPVSVFK S+ KDTE YL Sbjct: 68 EPEAMKQEKDAFYVVRKGDVVGIYNSLADSQAQVGSSVCNPPVSVFKGYSLSKDTEEYLV 127 Query: 466 SQGLKNALYSIRASDLTDDLFGTLMACPLQ-PSSR---GGEAPAKKK------------- 594 S GLKNALY+IRA+DL +DLFG L+ CPLQ PS++ + +KK+ Sbjct: 128 SHGLKNALYTIRATDLKEDLFGMLVPCPLQEPSTKESTSNKDVSKKRSLGVLGQDEKVIS 187 Query: 595 ---LNKDLQSDYG----------KSCTLEFDGASKGNPGKAGAGAILRYDDGSVICRLCE 735 L K ++ D+ ++C +EFDGASKGNPGKAGAGAILR +DGS+ICRL E Sbjct: 188 EDPLRKQVKLDHAAVAEAPLHATQTCFVEFDGASKGNPGKAGAGAILRANDGSLICRLRE 247 Query: 736 GVGIATCNFAEYKGAILGMRYAAEKGFTNIRVKGDSKLVCMQIQGLWKVKSENLLNLYNE 915 GVGIAT N AEY+ ILGM+YA +KGFT IR++GDSKLVCMQI G WKVK+ENL LYN Sbjct: 248 GVGIATNNAAEYRAMILGMKYALKKGFTGIRIQGDSKLVCMQIDGSWKVKNENLSTLYNV 307 Query: 916 AKELESRFHSFQIMHVLRDLNSDADAQANLGAQLAEGEIQEDCI 1047 AKEL+ +F SFQI HVLR+ NSDADAQANL LA+G++QE+C+ Sbjct: 308 AKELKDKFSSFQISHVLRNFNSDADAQANLAINLADGQVQEECV 351 >ref|XP_006602371.1| PREDICTED: uncharacterized protein LOC100809644 isoform X1 [Glycine max] Length = 352 Score = 323 bits (828), Expect = 9e-86 Identities = 168/285 (58%), Positives = 210/285 (73%), Gaps = 31/285 (10%) Frame = +1 Query: 286 DPKQLTMEKEEFYVVRKGDLVGVYRSLSDCQAQVGTSICDPPVSVFKSCSMPKDTENYLR 465 +P+ + EK+ FYVVRKGD+VG+Y SL+D QAQVG+S+C+PPVSVFK S+ KDTE YL Sbjct: 68 EPEAMKQEKDAFYVVRKGDVVGIYNSLADSQAQVGSSVCNPPVSVFKGYSLSKDTEEYLV 127 Query: 466 SQGLKNALYSIRASDLTDDLFGTLMACPLQ-PSSR---GGEAPAKKK------------- 594 S GLKNALY+IRA+DL +DLFG L+ CPLQ PS++ + +KK+ Sbjct: 128 SHGLKNALYTIRATDLKEDLFGMLVPCPLQEPSTKESTSNKDVSKKRSLGVLGQDEQKVI 187 Query: 595 ----LNKDLQSDYG----------KSCTLEFDGASKGNPGKAGAGAILRYDDGSVICRLC 732 L K ++ D+ ++C +EFDGASKGNPGKAGAGAILR +DGS+ICRL Sbjct: 188 SEDPLRKQVKLDHAAVAEAPLHATQTCFVEFDGASKGNPGKAGAGAILRANDGSLICRLR 247 Query: 733 EGVGIATCNFAEYKGAILGMRYAAEKGFTNIRVKGDSKLVCMQIQGLWKVKSENLLNLYN 912 EGVGIAT N AEY+ ILGM+YA +KGFT IR++GDSKLVCMQI G WKVK+ENL LYN Sbjct: 248 EGVGIATNNAAEYRAMILGMKYALKKGFTGIRIQGDSKLVCMQIDGSWKVKNENLSTLYN 307 Query: 913 EAKELESRFHSFQIMHVLRDLNSDADAQANLGAQLAEGEIQEDCI 1047 AKEL+ +F SFQI HVLR+ NSDADAQANL LA+G++QE+C+ Sbjct: 308 VAKELKDKFSSFQISHVLRNFNSDADAQANLAINLADGQVQEECV 352 >ref|XP_006363529.1| PREDICTED: uncharacterized protein LOC102591092 isoform X1 [Solanum tuberosum] Length = 367 Score = 323 bits (828), Expect = 9e-86 Identities = 169/286 (59%), Positives = 208/286 (72%), Gaps = 37/286 (12%) Frame = +1 Query: 295 QLTMEKEEFYVVRKGDLVGVYRSLSDCQAQVGTSICDPPVSVFKSCSMPKDTENYLRSQG 474 Q+ +++ F+VVRKG+LVGVY++LSDCQ QVG+SICDPPVSV+K +MPKDTE YL S G Sbjct: 79 QMKEDRDGFFVVRKGNLVGVYKNLSDCQTQVGSSICDPPVSVYKGYAMPKDTEEYLLSCG 138 Query: 475 LKNALYSIRASDLTDDLFGTLMACPLQP--SSRGG--EAPAKKKLNKDLQSDYG------ 624 LKNALYSIRA+DLT+DLFGTL+ CP Q SS+GG E KK+ + S+Y Sbjct: 139 LKNALYSIRAADLTEDLFGTLVPCPFQQPSSSKGGIPEHMTKKRSQDVMWSEYTDAAGSA 198 Query: 625 ---------------------------KSCTLEFDGASKGNPGKAGAGAILRYDDGSVIC 723 +SCTLEFDGASKGNPG AGAGA+LR DDGS IC Sbjct: 199 VISNDDSLRKHVKLDDHKGDQALPSGQQSCTLEFDGASKGNPGLAGAGAVLRADDGSFIC 258 Query: 724 RLCEGVGIATCNFAEYKGAILGMRYAAEKGFTNIRVKGDSKLVCMQIQGLWKVKSENLLN 903 RL EG+G+AT N AEY+ ILG+ YA KGFT+IRV+GDSKLVCMQIQGLWKVK++N+ Sbjct: 259 RLREGLGVATNNAAEYRAIILGLNYALSKGFTSIRVQGDSKLVCMQIQGLWKVKNQNIST 318 Query: 904 LYNEAKELESRFHSFQIMHVLRDLNSDADAQANLGAQLAEGEIQED 1041 LY +AK+L+ RF SF+I+HVLR+ NSDADAQAN+ +LA G+IQE+ Sbjct: 319 LYEQAKQLKDRFLSFRIIHVLRESNSDADAQANIAVELANGQIQEE 364 >ref|XP_006602373.1| PREDICTED: uncharacterized protein LOC100809644 isoform X3 [Glycine max] Length = 351 Score = 323 bits (827), Expect = 1e-85 Identities = 167/284 (58%), Positives = 207/284 (72%), Gaps = 30/284 (10%) Frame = +1 Query: 286 DPKQLTMEKEEFYVVRKGDLVGVYRSLSDCQAQVGTSICDPPVSVFKSCSMPKDTENYLR 465 +P+ + EK+ FYVVRKGD+VG+Y SL+D QAQVG+S+C+PPVSVFK S+ KDTE YL Sbjct: 68 EPEAMKQEKDAFYVVRKGDVVGIYNSLADSQAQVGSSVCNPPVSVFKGYSLSKDTEEYLV 127 Query: 466 SQGLKNALYSIRASDLTDDLFGTLMACPLQ-PSSRGG----------------------- 573 S GLKNALY+IRA+DL +DLFG L+ CPLQ PS++ Sbjct: 128 SHGLKNALYTIRATDLKEDLFGMLVPCPLQEPSTKESTSNKDVSKKRSLGVLGQDEQKVI 187 Query: 574 -EAPAKKKLNKDLQSD-----YGKSCTLEFDGASKGNPGKAGAGAILRYDDGSVICRLCE 735 E P +K++ D + + +C +EFDGASKGNPGKAGAGAILR +DGS+ICRL E Sbjct: 188 SEDPLRKQVKLDHAAVAEAPLHATTCFVEFDGASKGNPGKAGAGAILRANDGSLICRLRE 247 Query: 736 GVGIATCNFAEYKGAILGMRYAAEKGFTNIRVKGDSKLVCMQIQGLWKVKSENLLNLYNE 915 GVGIAT N AEY+ ILGM+YA +KGFT IR++GDSKLVCMQI G WKVK+ENL LYN Sbjct: 248 GVGIATNNAAEYRAMILGMKYALKKGFTGIRIQGDSKLVCMQIDGSWKVKNENLSTLYNV 307 Query: 916 AKELESRFHSFQIMHVLRDLNSDADAQANLGAQLAEGEIQEDCI 1047 AKEL+ +F SFQI HVLR+ NSDADAQANL LA+G++QE+C+ Sbjct: 308 AKELKDKFSSFQISHVLRNFNSDADAQANLAINLADGQVQEECV 351 >ref|XP_006363531.1| PREDICTED: uncharacterized protein LOC102591092 isoform X3 [Solanum tuberosum] gi|565395816|ref|XP_006363532.1| PREDICTED: uncharacterized protein LOC102591092 isoform X4 [Solanum tuberosum] Length = 288 Score = 322 bits (824), Expect = 3e-85 Identities = 168/282 (59%), Positives = 206/282 (73%), Gaps = 37/282 (13%) Frame = +1 Query: 307 EKEEFYVVRKGDLVGVYRSLSDCQAQVGTSICDPPVSVFKSCSMPKDTENYLRSQGLKNA 486 +++ F+VVRKG+LVGVY++LSDCQ QVG+SICDPPVSV+K +MPKDTE YL S GLKNA Sbjct: 4 DRDGFFVVRKGNLVGVYKNLSDCQTQVGSSICDPPVSVYKGYAMPKDTEEYLLSCGLKNA 63 Query: 487 LYSIRASDLTDDLFGTLMACPLQP--SSRGG--EAPAKKKLNKDLQSDYG---------- 624 LYSIRA+DLT+DLFGTL+ CP Q SS+GG E KK+ + S+Y Sbjct: 64 LYSIRAADLTEDLFGTLVPCPFQQPSSSKGGIPEHMTKKRSQDVMWSEYTDAAGSAVISN 123 Query: 625 -----------------------KSCTLEFDGASKGNPGKAGAGAILRYDDGSVICRLCE 735 +SCTLEFDGASKGNPG AGAGA+LR DDGS ICRL E Sbjct: 124 DDSLRKHVKLDDHKGDQALPSGQQSCTLEFDGASKGNPGLAGAGAVLRADDGSFICRLRE 183 Query: 736 GVGIATCNFAEYKGAILGMRYAAEKGFTNIRVKGDSKLVCMQIQGLWKVKSENLLNLYNE 915 G+G+AT N AEY+ ILG+ YA KGFT+IRV+GDSKLVCMQIQGLWKVK++N+ LY + Sbjct: 184 GLGVATNNAAEYRAIILGLNYALSKGFTSIRVQGDSKLVCMQIQGLWKVKNQNISTLYEQ 243 Query: 916 AKELESRFHSFQIMHVLRDLNSDADAQANLGAQLAEGEIQED 1041 AK+L+ RF SF+I+HVLR+ NSDADAQAN+ +LA G+IQE+ Sbjct: 244 AKQLKDRFLSFRIIHVLRESNSDADAQANIAVELANGQIQEE 285 >ref|XP_002534525.1| nuclease, putative [Ricinus communis] gi|223525106|gb|EEF27855.1| nuclease, putative [Ricinus communis] Length = 262 Score = 318 bits (816), Expect = 2e-84 Identities = 158/259 (61%), Positives = 199/259 (76%), Gaps = 9/259 (3%) Frame = +1 Query: 298 LTMEKEEFYVVRKGDLVGVYRSLSDCQAQVGTSICDPPVSVFKSCSMPKDTENYLRSQGL 477 + EK+ F+VVRKGD+VGVY+S +DCQAQVG+S+CDPPVSV+K S+ KDTE YL S+GL Sbjct: 1 MEQEKDAFFVVRKGDVVGVYKSFTDCQAQVGSSVCDPPVSVYKGYSLSKDTEEYLVSRGL 60 Query: 478 KNALYSIRASDLTDDLFGTLMACPLQP---SSRGGEAPAKKKLNKDLQSDY------GKS 630 +NALY+IRA DL +DLFGTL+ CP Q S+ G P +K D Q++ S Sbjct: 61 QNALYAIRAQDLKEDLFGTLVPCPFQETDGSASGLTDPLRKHAKLDNQTEAQALYYDDDS 120 Query: 631 CTLEFDGASKGNPGKAGAGAILRYDDGSVICRLCEGVGIATCNFAEYKGAILGMRYAAEK 810 C LEFDGASKGNPG AGAGA+LR DG +ICRL EG+G T N AEY+ ILGM+YA +K Sbjct: 121 CILEFDGASKGNPGPAGAGALLRTTDGRIICRLREGLGQVTNNVAEYRAMILGMKYALKK 180 Query: 811 GFTNIRVKGDSKLVCMQIQGLWKVKSENLLNLYNEAKELESRFHSFQIMHVLRDLNSDAD 990 G+T IRV+GDSKLVC Q+QGLWKVK +++ NLY +AK+L+ +F SFQI HVLR LNS+AD Sbjct: 181 GYTKIRVQGDSKLVCSQVQGLWKVKHKDMTNLYEQAKQLKDKFASFQISHVLRALNSEAD 240 Query: 991 AQANLGAQLAEGEIQEDCI 1047 AQANL QLA+G++QE+C+ Sbjct: 241 AQANLAIQLADGQVQEECL 259 >ref|XP_006355984.1| PREDICTED: uncharacterized protein LOC102591820 [Solanum tuberosum] Length = 613 Score = 317 bits (811), Expect = 9e-84 Identities = 180/365 (49%), Positives = 231/365 (63%), Gaps = 47/365 (12%) Frame = +1 Query: 88 MISSFHGRCSTILSRASNLVSSSPSCGYIH-SCKASFXXXXXXXXXXXX------VQCYX 246 M S FH + IL+R S+LV S CG+ S K S V CY Sbjct: 1 MNSLFHACSTAILTRTSHLVVKSSICGFPSLSWKTSVGHARIRKVDSNLYLNRVSVCCYS 60 Query: 247 XXXXXXXXXXXXXDPK--QLTMEKEEFYVVRKGDLVGVYRSLSDCQAQVGTSICDPPVSV 420 ++ E++ F+VVRKGDLVGVY++LSDCQ QVG+SICDPPVSV Sbjct: 61 SKKHSGDSSPSQNSDSTPEMKEERDGFFVVRKGDLVGVYKNLSDCQTQVGSSICDPPVSV 120 Query: 421 FKSCSMPKDTENYLRSQGLKNALYSIRASDLTDDLFGTLMACPLQP--SSRGGEAP--AK 588 +K +MPKDTE YL S GLKNALYSIRA+DLT+DLFGTL+ CP Q SS+ G + K Sbjct: 121 YKGYAMPKDTEEYLLSCGLKNALYSIRAADLTEDLFGTLVPCPFQQPSSSKSGTSDHLPK 180 Query: 589 KKLNKDLQSDYG----------------------------------KSCTLEFDGASKGN 666 K+ + + S+Y +SCTLEFDGASKGN Sbjct: 181 KRPQEAVWSEYADAVGSTVVSNDSARKHVKLEQQKGDQVLALPSGQRSCTLEFDGASKGN 240 Query: 667 PGKAGAGAILRYDDGSVICRLCEGVGIATCNFAEYKGAILGMRYAAEKGFTNIRVKGDSK 846 PG+AGAGA++R DDGS+ RL EG+G+AT N AEY+ ILG+++A +GFT+IRV+GDSK Sbjct: 241 PGQAGAGAVIRADDGSMTLRLREGLGVATSNHAEYRAFILGLKHALREGFTSIRVQGDSK 300 Query: 847 LVCMQIQGLWKVKSENLLNLYNEAKELESRFHSFQIMHVLRDLNSDADAQANLGAQLAEG 1026 LVCMQIQGLWKVK++N+ ++ +AK+L+ RF SF+I+HVLR+ NSDAD QANL +L EG Sbjct: 301 LVCMQIQGLWKVKNQNIAVVFEQAKQLKERFLSFRIIHVLRESNSDADQQANLAVELPEG 360 Query: 1027 EIQED 1041 +IQE+ Sbjct: 361 QIQEE 365 >ref|XP_007153671.1| hypothetical protein PHAVU_003G055000g [Phaseolus vulgaris] gi|561027025|gb|ESW25665.1| hypothetical protein PHAVU_003G055000g [Phaseolus vulgaris] Length = 359 Score = 315 bits (808), Expect = 2e-83 Identities = 166/284 (58%), Positives = 205/284 (72%), Gaps = 32/284 (11%) Frame = +1 Query: 286 DPKQLTMEKEE--FYVVRKGDLVGVYRSLSDCQAQVGTSICDPPVSVFKSCSMPKDTENY 459 +P+ MEKE+ FYVVRKGD+VG+Y SL+D QAQVG+S+C+PPVSV+K S+ KDTE Y Sbjct: 74 EPEAPVMEKEKDAFYVVRKGDVVGIYNSLADSQAQVGSSVCNPPVSVYKGYSLSKDTEEY 133 Query: 460 LRSQGLKNALYSIRASDLTDDLFGTLMACPLQ-PSSRGG--------------------- 573 L S GLKNALY+IRA+DL +DLFG L+ CP Q PS++ G Sbjct: 134 LASHGLKNALYTIRAADLKEDLFGMLIPCPFQEPSTKEGTSNMDVPKKRSLRVPGQDEKA 193 Query: 574 --EAPAKKKLN------KDLQSDYGKSCTLEFDGASKGNPGKAGAGAILRYDDGSVICRL 729 E P +KK+ + S ++CTLEFDGASKGNPGK+GAGA+LR DGS+ICRL Sbjct: 194 VSEDPLRKKVKLEHNAVAEAPSHSTRTCTLEFDGASKGNPGKSGAGAVLRAIDGSLICRL 253 Query: 730 CEGVGIATCNFAEYKGAILGMRYAAEKGFTNIRVKGDSKLVCMQIQGLWKVKSENLLNLY 909 EGVG+AT N AEY+ ILGM+YA +KGFT IR++GDSKLVCMQI G WKVK+ENL LY Sbjct: 254 REGVGVATNNAAEYRAMILGMKYALKKGFTGIRIQGDSKLVCMQIDGSWKVKNENLSTLY 313 Query: 910 NEAKELESRFHSFQIMHVLRDLNSDADAQANLGAQLAEGEIQED 1041 AKEL+ +F SFQI HVLR+ NSDADAQANL LA+G++QE+ Sbjct: 314 KVAKELKDKFSSFQINHVLRNFNSDADAQANLAINLADGQVQEE 357 >ref|XP_003532034.1| PREDICTED: uncharacterized protein LOC100779114 isoform X1 [Glycine max] Length = 356 Score = 314 bits (804), Expect = 6e-83 Identities = 162/280 (57%), Positives = 202/280 (72%), Gaps = 30/280 (10%) Frame = +1 Query: 298 LTMEKEEFYVVRKGDLVGVYRSLSDCQAQVGTSICDPPVSVFKSCSMPKDTENYLRSQGL 477 + EK+ FYVVRKGD+VG+Y SL+D QAQVG+S+C+PPVSV+K S+ KDTE YL S GL Sbjct: 77 MEQEKDAFYVVRKGDVVGIYNSLADSQAQVGSSVCNPPVSVYKGYSLSKDTEEYLVSHGL 136 Query: 478 KNALYSIRASDLTDDLFGTLMACPLQ-PSSRGG-----------------------EAPA 585 KNALY+IRA+DL +DLFG L+ CP Q PS++ G E P Sbjct: 137 KNALYTIRATDLKEDLFGMLVPCPFQEPSTKEGTSNKDVSKQRSLGVLAQDEKVISEDPF 196 Query: 586 KKKLN------KDLQSDYGKSCTLEFDGASKGNPGKAGAGAILRYDDGSVICRLCEGVGI 747 +K++ + S ++C +EFDGASKGNPGKAGAGAILR +DGS+ICR+ EGVGI Sbjct: 197 RKQVKLEYAEVAEAPSHATRTCFVEFDGASKGNPGKAGAGAILRANDGSLICRVREGVGI 256 Query: 748 ATCNFAEYKGAILGMRYAAEKGFTNIRVKGDSKLVCMQIQGLWKVKSENLLNLYNEAKEL 927 AT N AEY+ ILGM+YA +KGFT I ++GDSKLVCMQI G WKVK+ENL LYN AKEL Sbjct: 257 ATNNAAEYRAMILGMKYALKKGFTGICIQGDSKLVCMQIDGSWKVKNENLFTLYNVAKEL 316 Query: 928 ESRFHSFQIMHVLRDLNSDADAQANLGAQLAEGEIQEDCI 1047 + +F SFQI HVLR+ NSDADAQANL L +G++QE+C+ Sbjct: 317 KDKFSSFQISHVLRNFNSDADAQANLAINLVDGQVQEECV 356 >ref|XP_006585969.1| PREDICTED: uncharacterized protein LOC100779114 isoform X2 [Glycine max] Length = 357 Score = 313 bits (803), Expect = 8e-83 Identities = 162/281 (57%), Positives = 202/281 (71%), Gaps = 31/281 (11%) Frame = +1 Query: 298 LTMEKEEFYVVRKGDLVGVYRSLSDCQAQVGTSICDPPVSVFKSCSMPKDTENYLRSQGL 477 + EK+ FYVVRKGD+VG+Y SL+D QAQVG+S+C+PPVSV+K S+ KDTE YL S GL Sbjct: 77 MEQEKDAFYVVRKGDVVGIYNSLADSQAQVGSSVCNPPVSVYKGYSLSKDTEEYLVSHGL 136 Query: 478 KNALYSIRASDLTDDLFGTLMACPLQ-PSSRGG------------------------EAP 582 KNALY+IRA+DL +DLFG L+ CP Q PS++ G E P Sbjct: 137 KNALYTIRATDLKEDLFGMLVPCPFQEPSTKEGTSNKDVSKQRSLGVLAQDEQKVISEDP 196 Query: 583 AKKKLN------KDLQSDYGKSCTLEFDGASKGNPGKAGAGAILRYDDGSVICRLCEGVG 744 +K++ + S ++C +EFDGASKGNPGKAGAGAILR +DGS+ICR+ EGVG Sbjct: 197 FRKQVKLEYAEVAEAPSHATRTCFVEFDGASKGNPGKAGAGAILRANDGSLICRVREGVG 256 Query: 745 IATCNFAEYKGAILGMRYAAEKGFTNIRVKGDSKLVCMQIQGLWKVKSENLLNLYNEAKE 924 IAT N AEY+ ILGM+YA +KGFT I ++GDSKLVCMQI G WKVK+ENL LYN AKE Sbjct: 257 IATNNAAEYRAMILGMKYALKKGFTGICIQGDSKLVCMQIDGSWKVKNENLFTLYNVAKE 316 Query: 925 LESRFHSFQIMHVLRDLNSDADAQANLGAQLAEGEIQEDCI 1047 L+ +F SFQI HVLR+ NSDADAQANL L +G++QE+C+ Sbjct: 317 LKDKFSSFQISHVLRNFNSDADAQANLAINLVDGQVQEECV 357 >ref|XP_006484311.1| PREDICTED: uncharacterized protein LOC102614852 [Citrus sinensis] Length = 558 Score = 311 bits (797), Expect = 4e-82 Identities = 164/295 (55%), Positives = 207/295 (70%), Gaps = 25/295 (8%) Frame = +1 Query: 232 VQCYXXXXXXXXXXXXXXDPKQLTMEKEEFYVVRKGDLVGVYRSLSDCQAQVGTSICDPP 411 +QCY +P Q+ K+EF+VVRKGDLVGVY+S ++CQAQ+G+SIC PP Sbjct: 59 LQCYSSSAKKPRSRKLKTEP-QMKQGKDEFFVVRKGDLVGVYKSFTECQAQLGSSICHPP 117 Query: 412 VSVFKSCSMPKDTENYLRSQGLKNALYSIRASDLTDDLFGTLMACPLQ-PSSR------- 567 VSV+K ++PK TE YL S GLKNALY+IRA+DLT+DLFGTLM C LQ P+S+ Sbjct: 118 VSVYKGNALPKGTEEYLASHGLKNALYTIRAADLTEDLFGTLMPCTLQDPTSKKRPQDPI 177 Query: 568 ----GGEA--------PAKKKLNKDLQSD-----YGKSCTLEFDGASKGNPGKAGAGAIL 696 G E P +K + DL ++ Y +SC +EFDGASKGNPG AGA A+L Sbjct: 178 EPEIGYELGSTSVLADPLRKHVKLDLDAESKAASYHRSCIIEFDGASKGNPGPAGAAAVL 237 Query: 697 RYDDGSVICRLCEGVGIATCNFAEYKGAILGMRYAAEKGFTNIRVKGDSKLVCMQIQGLW 876 R DDGS+IC+L EGVGIAT N AEY+G ILG++YA EKGF+NIRV+GDSKLVCMQ+ G W Sbjct: 238 RTDDGSLICKLREGVGIATSNVAEYRGLILGLKYALEKGFSNIRVQGDSKLVCMQVAGSW 297 Query: 877 KVKSENLLNLYNEAKELESRFHSFQIMHVLRDLNSDADAQANLGAQLAEGEIQED 1041 K K + + L EA+ L+ +F SFQI HVLR+LNS+ADAQA L LA+GE+ E+ Sbjct: 298 KTKHQGMAKLCGEARRLKDKFLSFQISHVLRNLNSEADAQATLAVGLADGEVAEE 352 >ref|XP_002314727.2| hypothetical protein POPTR_0010s10515g [Populus trichocarpa] gi|550329518|gb|EEF00898.2| hypothetical protein POPTR_0010s10515g [Populus trichocarpa] Length = 364 Score = 305 bits (780), Expect = 3e-80 Identities = 160/287 (55%), Positives = 201/287 (70%), Gaps = 35/287 (12%) Frame = +1 Query: 286 DPKQLTM---EKEEFYVVRKGDLVGVYRSLSDCQAQVGTSICDPPVSVFKSCSMPKDTEN 456 DP+ T+ E + F+VVRKGD+VGVY++ +DCQAQVG+SICDPPVSV+K S+ KD+E Sbjct: 76 DPQPATVMDHENDAFFVVRKGDVVGVYKNFADCQAQVGSSICDPPVSVYKGYSLSKDSEA 135 Query: 457 YLRSQGLKNALYSIRASDLTDDLFGTLMACPLQ-PSSRGGEA---PAKKKLNKDL----- 609 YL S GL+NALY++RA+DL +DLFG LM CP Q P+S E KK+ + L Sbjct: 136 YLVSHGLQNALYTVRAADLKEDLFGVLMPCPFQQPASSDAETLKNDTKKRSREVLGSEIT 195 Query: 610 -----------------------QSDYGKSCTLEFDGASKGNPGKAGAGAILRYDDGSVI 720 Q+ +SC LEFDGASKGNPG+AGAGA+LR DDGS+I Sbjct: 196 DTAGSASMMSKHANLDNQAECQAQNSNSRSCLLEFDGASKGNPGQAGAGAVLRTDDGSLI 255 Query: 721 CRLCEGVGIATCNFAEYKGAILGMRYAAEKGFTNIRVKGDSKLVCMQIQGLWKVKSENLL 900 CRL EG+GIAT N AEY+ +LGM+YA +KG+T I+VKGDSKLVCMQIQG WK K N+ Sbjct: 256 CRLREGLGIATNNMAEYRAILLGMKYALQKGYTKIQVKGDSKLVCMQIQGSWKAKHVNIT 315 Query: 901 NLYNEAKELESRFHSFQIMHVLRDLNSDADAQANLGAQLAEGEIQED 1041 NL EAK+L++ F SF I HVLR+ NS+ADAQANL LA+GE+QE+ Sbjct: 316 NLCTEAKKLKNSFLSFHISHVLREFNSEADAQANLAVHLADGEVQEE 362 >ref|XP_007226366.1| hypothetical protein PRUPE_ppa022484mg [Prunus persica] gi|462423302|gb|EMJ27565.1| hypothetical protein PRUPE_ppa022484mg [Prunus persica] Length = 521 Score = 301 bits (771), Expect = 4e-79 Identities = 151/254 (59%), Positives = 191/254 (75%), Gaps = 8/254 (3%) Frame = +1 Query: 304 MEKEEFYVVRKGDLVGVYRSLSDCQAQVGTSICDPPVSVFKSCSMPKDTENYLRSQGLKN 483 +EK+ FYVVRKGD+VGVY+S SDCQAQ+ +SI DPPVSV+K S+PK+TE YL S GL N Sbjct: 86 LEKDAFYVVRKGDIVGVYKSFSDCQAQLSSSIFDPPVSVYKGYSLPKETEEYLGSCGLTN 145 Query: 484 ALYSIRASDLTDDLFGTLMACPLQPSSRGGEAPAKKKLNKDLQSDYGKS--------CTL 639 A+Y+I A+DL DD+FG LM CP Q G + A L K ++ D+ CTL Sbjct: 146 AIYTIAAADLKDDIFGKLMHCPFQEVI-GSPSIADDPLRKHVKIDHSTQSLPLDSGFCTL 204 Query: 640 EFDGASKGNPGKAGAGAILRYDDGSVICRLCEGVGIATCNFAEYKGAILGMRYAAEKGFT 819 EFDGASKGNPG AGAGA+LR DDGS+IC+L EG+G+ T N AEY+ ILG++YA +KGFT Sbjct: 205 EFDGASKGNPGLAGAGAVLRADDGSLICKLHEGLGVRTNNVAEYRALILGLKYALKKGFT 264 Query: 820 NIRVKGDSKLVCMQIQGLWKVKSENLLNLYNEAKELESRFHSFQIMHVLRDLNSDADAQA 999 IRVKGDSKLVCMQ+QGLWKV+++N+ +L E KEL+ +F SF+I HVLR+LNS+ADAQA Sbjct: 265 KIRVKGDSKLVCMQVQGLWKVRNQNMSDLCEEVKELKDKFLSFEISHVLRELNSEADAQA 324 Query: 1000 NLGAQLAEGEIQED 1041 NL +L ++ D Sbjct: 325 NLAVRLTGEAVKSD 338 >ref|XP_002517391.1| nuclease, putative [Ricinus communis] gi|223543402|gb|EEF44933.1| nuclease, putative [Ricinus communis] Length = 255 Score = 293 bits (749), Expect = 1e-76 Identities = 146/249 (58%), Positives = 188/249 (75%), Gaps = 3/249 (1%) Frame = +1 Query: 307 EKEEFYVVRKGDLVGVYRSLSDCQAQVGTSICDPPVSVFKSCSMPKDTENYLRSQGLKNA 486 EK+ FYVVRKGD+VG+Y+SL DCQAQVG+S+C+P VSVFK + KD E+YL S G+K+A Sbjct: 4 EKDVFYVVRKGDVVGIYKSLRDCQAQVGSSVCNPSVSVFKGYGLAKDAEDYLVSHGIKDA 63 Query: 487 LYSIRASDLTDDLFGTLMACPLQ-PSSRGGEAPAKKKLNKDLQSDYGK--SCTLEFDGAS 657 +SI A+D+ DLFG L+ CP Q P+ G+A K K + G SC LEFDGAS Sbjct: 64 AFSIHATDVQPDLFGKLVPCPFQQPAFSEGKALNKDSSPKSSRGVLGSMSSCILEFDGAS 123 Query: 658 KGNPGKAGAGAILRYDDGSVICRLCEGVGIATCNFAEYKGAILGMRYAAEKGFTNIRVKG 837 KGNPG AGAGA+LR +DGS++C L EG+G AT N AEY+ ILG+++A KGF +IRV+G Sbjct: 124 KGNPGPAGAGAVLRAEDGSMVCLLREGLGTATNNVAEYRAVILGLKHALRKGFKHIRVRG 183 Query: 838 DSKLVCMQIQGLWKVKSENLLNLYNEAKELESRFHSFQIMHVLRDLNSDADAQANLGAQL 1017 DS LV MQI+GLWK+KS+N+ +L EAKEL+++F SFQI HVLR+ NS+AD QANL L Sbjct: 184 DSNLVVMQIKGLWKIKSQNVADLCKEAKELKNKFLSFQIEHVLREFNSEADTQANLAVNL 243 Query: 1018 AEGEIQEDC 1044 +G+I+EDC Sbjct: 244 KDGQIEEDC 252 >ref|XP_002266599.2| PREDICTED: uncharacterized protein LOC100255243 [Vitis vinifera] Length = 453 Score = 289 bits (740), Expect = 2e-75 Identities = 149/249 (59%), Positives = 188/249 (75%), Gaps = 9/249 (3%) Frame = +1 Query: 307 EKEEFYVVRKGDLVGVYRSLSDCQAQVGTSICDPPVSVFKSCSMPKDTENYLRSQGLKNA 486 EK+ F+VVRKGD+VGVY++ SDCQAQVG+SICDPPVSV+K +PKDTE YL S+GL+NA Sbjct: 4 EKDAFFVVRKGDVVGVYKTFSDCQAQVGSSICDPPVSVYKGYYLPKDTEEYLVSRGLRNA 63 Query: 487 LYSIRASDLTDDLFGTLMACPLQPSSRGGEA---PAKKKLNKD------LQSDYGKSCTL 639 LY+IRA+DL +DLFG LM C Q + P K+ + D L SD +SC + Sbjct: 64 LYTIRAADLKEDLFGKLMPCAFQGAVESRPITTDPLKEHIKLDRVEAQALFSDC-RSCVV 122 Query: 640 EFDGASKGNPGKAGAGAILRYDDGSVICRLCEGVGIATCNFAEYKGAILGMRYAAEKGFT 819 EFDGASKGNPG AGA A+LR D G VICR+ EG+G+AT N AEY+ ILG++YA +KG+T Sbjct: 123 EFDGASKGNPGPAGAAAVLRSDSGRVICRVREGLGLATNNVAEYQAMILGLKYALKKGYT 182 Query: 820 NIRVKGDSKLVCMQIQGLWKVKSENLLNLYNEAKELESRFHSFQIMHVLRDLNSDADAQA 999 +IRV+GDSKLVCMQ+QGLWK +++N+ L EAK+L++ F S +I HVLR LNS+ADAQA Sbjct: 183 SIRVQGDSKLVCMQVQGLWKARNKNMSILCKEAKKLKNEFLSVEINHVLRGLNSEADAQA 242 Query: 1000 NLGAQLAEG 1026 NL LA G Sbjct: 243 NLAVHLAGG 251 >ref|XP_002305325.1| RNase H domain-containing family protein [Populus trichocarpa] gi|222848289|gb|EEE85836.1| RNase H domain-containing family protein [Populus trichocarpa] Length = 257 Score = 283 bits (724), Expect = 1e-73 Identities = 144/254 (56%), Positives = 184/254 (72%), Gaps = 7/254 (2%) Frame = +1 Query: 307 EKEEFYVVRKGDLVGVYRSLSDCQAQV-GTSICDPPVSVFKSCSMPKDTENYLRSQGLKN 483 EK+ FYVVRKGD++GVY + SDCQ Q +S+C+P VSVFK +PK+ + YL S GL N Sbjct: 4 EKDAFYVVRKGDIIGVYNNFSDCQLQAQSSSVCNPSVSVFKGYGLPKEAKEYLSSHGLNN 63 Query: 484 ALYSIRASDLTDDLFGTLMACPLQPSSRGGEAPA------KKKLNKDLQSDYGKSCTLEF 645 A YSI+A D+ +DLFG L+ CP Q + A K+L + L+S SC LEF Sbjct: 64 AAYSIQAPDVQNDLFGKLLPCPFQEPASSFRAKELDNNFPPKRLPQPLESI--PSCILEF 121 Query: 646 DGASKGNPGKAGAGAILRYDDGSVICRLCEGVGIATCNFAEYKGAILGMRYAAEKGFTNI 825 DGASKGNPG AGAGA+LR +DGS++CRL EG+GIAT N AEY+ +LG+++A +KGF I Sbjct: 122 DGASKGNPGPAGAGAVLRAEDGSMVCRLREGLGIATNNVAEYRAVLLGLKHALKKGFKYI 181 Query: 826 RVKGDSKLVCMQIQGLWKVKSENLLNLYNEAKELESRFHSFQIMHVLRDLNSDADAQANL 1005 V+GDS LVCMQIQGLWK+K++NL +L EAKEL+ F SFQI HV R+ N +AD QANL Sbjct: 182 CVQGDSNLVCMQIQGLWKLKNQNLADLCKEAKELKDMFTSFQIKHVPREFNFEADVQANL 241 Query: 1006 GAQLAEGEIQEDCI 1047 A L +G+I+EDCI Sbjct: 242 AANLRDGQIEEDCI 255 >ref|XP_006438469.1| hypothetical protein CICLE_v10031867mg [Citrus clementina] gi|557540665|gb|ESR51709.1| hypothetical protein CICLE_v10031867mg [Citrus clementina] Length = 372 Score = 282 bits (722), Expect = 2e-73 Identities = 149/271 (54%), Positives = 188/271 (69%), Gaps = 25/271 (9%) Frame = +1 Query: 232 VQCYXXXXXXXXXXXXXXDPKQLTMEKEEFYVVRKGDLVGVYRSLSDCQAQVGTSICDPP 411 +QCY +P Q+ K+EF+VVRKGDLVGVY+S ++CQAQ+G+SIC PP Sbjct: 59 LQCYSSSAKKPRSRKLKTEP-QMKQGKDEFFVVRKGDLVGVYKSFTECQAQLGSSICHPP 117 Query: 412 VSVFKSCSMPKDTENYLRSQGLKNALYSIRASDLTDDLFGTLMACPLQ-PSSR------- 567 VSV+K ++PK TE YL S GLKNALY+IRA+DLT+DLFG+LM C LQ P+S+ Sbjct: 118 VSVYKGNALPKGTEEYLASHGLKNALYTIRAADLTEDLFGSLMPCTLQDPTSKKRPQDTI 177 Query: 568 ----GGEA--------PAKKKLNKDLQSD-----YGKSCTLEFDGASKGNPGKAGAGAIL 696 G E P +K + DL ++ Y +SC +EFDGASKGNPG AGA A+L Sbjct: 178 EPEIGYELGSTSVLADPLRKHVKLDLDAESKAASYHRSCVIEFDGASKGNPGPAGAAAVL 237 Query: 697 RYDDGSVICRLCEGVGIATCNFAEYKGAILGMRYAAEKGFTNIRVKGDSKLVCMQIQGLW 876 R DDGS+IC+L EGVGIAT N AEY+G ILG++YA EKGF+NIRV+GDSKLVCMQ+ G W Sbjct: 238 RTDDGSLICKLREGVGIATSNVAEYRGLILGLKYALEKGFSNIRVQGDSKLVCMQVAGSW 297 Query: 877 KVKSENLLNLYNEAKELESRFHSFQIMHVLR 969 K K + + L EA+ L+ +F SFQI HVLR Sbjct: 298 KTKHQGMAKLCGEARRLKDKFLSFQISHVLR 328