BLASTX nr result

ID: Mentha22_contig00005988 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Mentha22_contig00005988
         (980 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|EPS70565.1| hypothetical protein M569_04195, partial [Genlise...   345   2e-92
gb|EYU22465.1| hypothetical protein MIMGU_mgv1a007882mg [Mimulus...   337   6e-90
ref|XP_004237084.1| PREDICTED: uncharacterized protein LOC101260...   319   1e-84
ref|XP_006363531.1| PREDICTED: uncharacterized protein LOC102591...   315   1e-83
ref|XP_006363529.1| PREDICTED: uncharacterized protein LOC102591...   315   1e-83
ref|XP_006602372.1| PREDICTED: uncharacterized protein LOC100809...   310   4e-82
ref|XP_006602371.1| PREDICTED: uncharacterized protein LOC100809...   310   6e-82
ref|XP_006602373.1| PREDICTED: uncharacterized protein LOC100809...   310   7e-82
ref|XP_007153671.1| hypothetical protein PHAVU_003G055000g [Phas...   306   6e-81
ref|XP_002534525.1| nuclease, putative [Ricinus communis] gi|223...   306   1e-80
ref|XP_003532034.1| PREDICTED: uncharacterized protein LOC100779...   301   3e-79
ref|XP_006585969.1| PREDICTED: uncharacterized protein LOC100779...   300   5e-79
ref|XP_006484311.1| PREDICTED: uncharacterized protein LOC102614...   300   8e-79
ref|XP_007226366.1| hypothetical protein PRUPE_ppa022484mg [Prun...   298   3e-78
ref|XP_002314727.2| hypothetical protein POPTR_0010s10515g [Popu...   292   1e-76
ref|XP_007044529.1| RNase H family protein, putative isoform 3 [...   292   2e-76
ref|XP_002266599.2| PREDICTED: uncharacterized protein LOC100255...   286   9e-75
gb|EXC04052.1| Uncharacterized protein L484_011032 [Morus notabi...   283   7e-74
ref|XP_002517391.1| nuclease, putative [Ricinus communis] gi|223...   280   5e-73
ref|XP_006438469.1| hypothetical protein CICLE_v10031867mg [Citr...   279   1e-72

>gb|EPS70565.1| hypothetical protein M569_04195, partial [Genlisea aurea]
          Length = 245

 Score =  345 bits (884), Expect = 2e-92
 Identities = 169/242 (69%), Positives = 202/242 (83%), Gaps = 5/242 (2%)
 Frame = -2

Query: 715 EKEEFYVVRKGDLVGVYRSLSDCQDQVGTSICDPPVSVFKGCSMPKDTENYLRSQGLKNA 536
           +KE F+VVRKGDL+GVY+SL+DCQ QVGTSICDPPVSVFK  SMPKDTE YL S GLKNA
Sbjct: 4   DKEAFFVVRKGDLIGVYKSLNDCQAQVGTSICDPPVSVFKCNSMPKDTEKYLMSCGLKNA 63

Query: 535 LYSIRASDLTDDLFGTLVACPLQPSSRGGEAPAKKKLNKD-----LQSDYGKSCTLEFDG 371
           LYSIRASD+T++LFG L +CP+Q  SRG  +  K + NK      L S YG+SCTLEFDG
Sbjct: 64  LYSIRASDITEELFGALESCPVQVPSRGETSIHKSESNKKRPQGTLWSQYGRSCTLEFDG 123

Query: 370 ASKGNPGKAGAGAILRYDDGSVICRLCEGVGIATCNFAEYKGAILGMRYAAEKGFTNIRV 191
           ASKGNPG+AGAGA+LR DDGS+ICRL EG+G+ATCN AEY+  ILG++YA  KGFTN+R 
Sbjct: 124 ASKGNPGQAGAGAVLRSDDGSLICRLREGLGVATCNVAEYRAFILGLKYALGKGFTNVRA 183

Query: 190 KGDSKLVCMQIQGLWKVKSENLLNLYNEAKELESRFHSFQIMHVLRDLNSDADAQANLGA 11
           +GDSKLVCMQIQGLW+VK++N+ NL+NEAK+L+  F SFQI+HVLRDLNS+AD QANL  
Sbjct: 184 RGDSKLVCMQIQGLWRVKNQNISNLFNEAKKLKDSFMSFQIIHVLRDLNSEADEQANLAV 243

Query: 10  QL 5
           +L
Sbjct: 244 KL 245


>gb|EYU22465.1| hypothetical protein MIMGU_mgv1a007882mg [Mimulus guttatus]
          Length = 392

 Score =  337 bits (863), Expect = 6e-90
 Identities = 164/241 (68%), Positives = 196/241 (81%), Gaps = 4/241 (1%)
 Frame = -2

Query: 715 EKEEFYVVRKGDLVGVYRSLSDCQDQVGTSICDPPVSVFKGCSMPKDTENYLRSQGLKNA 536
           +KE F+VVRKGDL+GVY+SL DCQ QVGTSICDPP+SV+KG SMPK+TE YL S GLKNA
Sbjct: 4   DKEAFFVVRKGDLIGVYKSLKDCQAQVGTSICDPPISVYKGSSMPKETEKYLVSSGLKNA 63

Query: 535 LYSIRASDLTDDLFGTLVACPLQ----PSSRGGEAPAKKKLNKDLQSDYGKSCTLEFDGA 368
           LYSIRASDLT+DLFGTLVACP+Q     S    E  +KK+ +  L SDY + CTLEFDGA
Sbjct: 64  LYSIRASDLTEDLFGTLVACPVQLPSVKSETSNEPVSKKRSHDALSSDYERFCTLEFDGA 123

Query: 367 SKGNPGKAGAGAILRYDDGSVICRLCEGVGIATCNFAEYKGAILGMRYAAEKGFTNIRVK 188
           SKGNPG+AGAGAILR  DGS +CR+ EG+GIATCN AEY+  ILG++YA  KGFT+++V+
Sbjct: 124 SKGNPGQAGAGAILRSVDGSFVCRMREGLGIATCNVAEYRAFILGLKYALRKGFTSVQVR 183

Query: 187 GDSKLVCMQIQGLWKVKSENLLNLYNEAKELESRFHSFQIMHVLRDLNSDADAQANLGAQ 8
           GDSKLVCMQI+GLWKV++ N+   Y EAK L+ RF +F+I HVLRDLNS+AD QANLG  
Sbjct: 184 GDSKLVCMQIEGLWKVRNPNIATWYEEAKNLKDRFVNFKITHVLRDLNSEADVQANLGVD 243

Query: 7   L 5
           L
Sbjct: 244 L 244


>ref|XP_004237084.1| PREDICTED: uncharacterized protein LOC101260715 [Solanum
           lycopersicum]
          Length = 369

 Score =  319 bits (817), Expect = 1e-84
 Identities = 168/277 (60%), Positives = 204/277 (73%), Gaps = 39/277 (14%)
 Frame = -2

Query: 715 EKEEFYVVRKGDLVGVYRSLSDCQDQVGTSICDPPVSVFKGCSMPKDTENYLRSQGLKNA 536
           +++ F+VVRKG+LVGVY++LSDCQ QVG+SICDPPVSV+KG +MPKDTE+YL S GLKNA
Sbjct: 83  DRDGFFVVRKGNLVGVYKNLSDCQTQVGSSICDPPVSVYKGYAMPKDTEDYLLSCGLKNA 142

Query: 535 LYSIRASDLTDDLFGTLVACPLQ--------PSSRGG--EAPAKKKLNKDLQSDY----- 401
           LYSIRA+DLT+DLFGTLV CP Q         SS+GG  E   KK+    + S+Y     
Sbjct: 143 LYSIRAADLTEDLFGTLVPCPFQHMLVSQQPSSSKGGMPEHMTKKRSQDVMWSEYADVAV 202

Query: 400 ------------------------GKSCTLEFDGASKGNPGKAGAGAILRYDDGSVICRL 293
                                   G+SCTLEFDGASKGNPG AGAGAILR DDGS ICRL
Sbjct: 203 ISNDDSLTKHVKLDDHKGVQAPLSGQSCTLEFDGASKGNPGLAGAGAILRADDGSFICRL 262

Query: 292 CEGVGIATCNFAEYKGAILGMRYAAEKGFTNIRVKGDSKLVCMQIQGLWKVKSENLLNLY 113
            EG+G+AT N AEY+  ILG+ YA  KGFT+IRV+GDSKLVCMQIQGLWKVK++N+ +LY
Sbjct: 263 REGLGVATNNAAEYRAIILGLNYALSKGFTSIRVQGDSKLVCMQIQGLWKVKNQNISSLY 322

Query: 112 NEAKELESRFHSFQIMHVLRDLNSDADAQANLGAQLA 2
            +AK+L+ RF SF+I+HVLR+ NSDADAQAN+  +LA
Sbjct: 323 EQAKQLKDRFLSFRIIHVLRESNSDADAQANIAVELA 359


>ref|XP_006363531.1| PREDICTED: uncharacterized protein LOC102591092 isoform X3 [Solanum
           tuberosum] gi|565395816|ref|XP_006363532.1| PREDICTED:
           uncharacterized protein LOC102591092 isoform X4 [Solanum
           tuberosum]
          Length = 288

 Score =  315 bits (808), Expect = 1e-83
 Identities = 166/275 (60%), Positives = 201/275 (73%), Gaps = 37/275 (13%)
 Frame = -2

Query: 715 EKEEFYVVRKGDLVGVYRSLSDCQDQVGTSICDPPVSVFKGCSMPKDTENYLRSQGLKNA 536
           +++ F+VVRKG+LVGVY++LSDCQ QVG+SICDPPVSV+KG +MPKDTE YL S GLKNA
Sbjct: 4   DRDGFFVVRKGNLVGVYKNLSDCQTQVGSSICDPPVSVYKGYAMPKDTEEYLLSCGLKNA 63

Query: 535 LYSIRASDLTDDLFGTLVACPLQP--SSRGG--EAPAKKKLNKDLQSDYG---------- 398
           LYSIRA+DLT+DLFGTLV CP Q   SS+GG  E   KK+    + S+Y           
Sbjct: 64  LYSIRAADLTEDLFGTLVPCPFQQPSSSKGGIPEHMTKKRSQDVMWSEYTDAAGSAVISN 123

Query: 397 -----------------------KSCTLEFDGASKGNPGKAGAGAILRYDDGSVICRLCE 287
                                  +SCTLEFDGASKGNPG AGAGA+LR DDGS ICRL E
Sbjct: 124 DDSLRKHVKLDDHKGDQALPSGQQSCTLEFDGASKGNPGLAGAGAVLRADDGSFICRLRE 183

Query: 286 GVGIATCNFAEYKGAILGMRYAAEKGFTNIRVKGDSKLVCMQIQGLWKVKSENLLNLYNE 107
           G+G+AT N AEY+  ILG+ YA  KGFT+IRV+GDSKLVCMQIQGLWKVK++N+  LY +
Sbjct: 184 GLGVATNNAAEYRAIILGLNYALSKGFTSIRVQGDSKLVCMQIQGLWKVKNQNISTLYEQ 243

Query: 106 AKELESRFHSFQIMHVLRDLNSDADAQANLGAQLA 2
           AK+L+ RF SF+I+HVLR+ NSDADAQAN+  +LA
Sbjct: 244 AKQLKDRFLSFRIIHVLRESNSDADAQANIAVELA 278


>ref|XP_006363529.1| PREDICTED: uncharacterized protein LOC102591092 isoform X1 [Solanum
           tuberosum]
          Length = 367

 Score =  315 bits (808), Expect = 1e-83
 Identities = 166/275 (60%), Positives = 201/275 (73%), Gaps = 37/275 (13%)
 Frame = -2

Query: 715 EKEEFYVVRKGDLVGVYRSLSDCQDQVGTSICDPPVSVFKGCSMPKDTENYLRSQGLKNA 536
           +++ F+VVRKG+LVGVY++LSDCQ QVG+SICDPPVSV+KG +MPKDTE YL S GLKNA
Sbjct: 83  DRDGFFVVRKGNLVGVYKNLSDCQTQVGSSICDPPVSVYKGYAMPKDTEEYLLSCGLKNA 142

Query: 535 LYSIRASDLTDDLFGTLVACPLQP--SSRGG--EAPAKKKLNKDLQSDYG---------- 398
           LYSIRA+DLT+DLFGTLV CP Q   SS+GG  E   KK+    + S+Y           
Sbjct: 143 LYSIRAADLTEDLFGTLVPCPFQQPSSSKGGIPEHMTKKRSQDVMWSEYTDAAGSAVISN 202

Query: 397 -----------------------KSCTLEFDGASKGNPGKAGAGAILRYDDGSVICRLCE 287
                                  +SCTLEFDGASKGNPG AGAGA+LR DDGS ICRL E
Sbjct: 203 DDSLRKHVKLDDHKGDQALPSGQQSCTLEFDGASKGNPGLAGAGAVLRADDGSFICRLRE 262

Query: 286 GVGIATCNFAEYKGAILGMRYAAEKGFTNIRVKGDSKLVCMQIQGLWKVKSENLLNLYNE 107
           G+G+AT N AEY+  ILG+ YA  KGFT+IRV+GDSKLVCMQIQGLWKVK++N+  LY +
Sbjct: 263 GLGVATNNAAEYRAIILGLNYALSKGFTSIRVQGDSKLVCMQIQGLWKVKNQNISTLYEQ 322

Query: 106 AKELESRFHSFQIMHVLRDLNSDADAQANLGAQLA 2
           AK+L+ RF SF+I+HVLR+ NSDADAQAN+  +LA
Sbjct: 323 AKQLKDRFLSFRIIHVLRESNSDADAQANIAVELA 357


>ref|XP_006602372.1| PREDICTED: uncharacterized protein LOC100809644 isoform X2 [Glycine
           max]
          Length = 351

 Score =  310 bits (795), Expect = 4e-82
 Identities = 165/275 (60%), Positives = 201/275 (73%), Gaps = 30/275 (10%)
 Frame = -2

Query: 736 DPKKLTMEKEEFYVVRKGDLVGVYRSLSDCQDQVGTSICDPPVSVFKGCSMPKDTENYLR 557
           +P+ +  EK+ FYVVRKGD+VG+Y SL+D Q QVG+S+C+PPVSVFKG S+ KDTE YL 
Sbjct: 68  EPEAMKQEKDAFYVVRKGDVVGIYNSLADSQAQVGSSVCNPPVSVFKGYSLSKDTEEYLV 127

Query: 556 SQGLKNALYSIRASDLTDDLFGTLVACPLQ-PSSR---GGEAPAKKK------------- 428
           S GLKNALY+IRA+DL +DLFG LV CPLQ PS++     +  +KK+             
Sbjct: 128 SHGLKNALYTIRATDLKEDLFGMLVPCPLQEPSTKESTSNKDVSKKRSLGVLGQDEKVIS 187

Query: 427 ---LNKDLQSDYG----------KSCTLEFDGASKGNPGKAGAGAILRYDDGSVICRLCE 287
              L K ++ D+           ++C +EFDGASKGNPGKAGAGAILR +DGS+ICRL E
Sbjct: 188 EDPLRKQVKLDHAAVAEAPLHATQTCFVEFDGASKGNPGKAGAGAILRANDGSLICRLRE 247

Query: 286 GVGIATCNFAEYKGAILGMRYAAEKGFTNIRVKGDSKLVCMQIQGLWKVKSENLLNLYNE 107
           GVGIAT N AEY+  ILGM+YA +KGFT IR++GDSKLVCMQI G WKVK+ENL  LYN 
Sbjct: 248 GVGIATNNAAEYRAMILGMKYALKKGFTGIRIQGDSKLVCMQIDGSWKVKNENLSTLYNV 307

Query: 106 AKELESRFHSFQIMHVLRDLNSDADAQANLGAQLA 2
           AKEL+ +F SFQI HVLR+ NSDADAQANL   LA
Sbjct: 308 AKELKDKFSSFQISHVLRNFNSDADAQANLAINLA 342


>ref|XP_006602371.1| PREDICTED: uncharacterized protein LOC100809644 isoform X1 [Glycine
           max]
          Length = 352

 Score =  310 bits (794), Expect = 6e-82
 Identities = 165/276 (59%), Positives = 201/276 (72%), Gaps = 31/276 (11%)
 Frame = -2

Query: 736 DPKKLTMEKEEFYVVRKGDLVGVYRSLSDCQDQVGTSICDPPVSVFKGCSMPKDTENYLR 557
           +P+ +  EK+ FYVVRKGD+VG+Y SL+D Q QVG+S+C+PPVSVFKG S+ KDTE YL 
Sbjct: 68  EPEAMKQEKDAFYVVRKGDVVGIYNSLADSQAQVGSSVCNPPVSVFKGYSLSKDTEEYLV 127

Query: 556 SQGLKNALYSIRASDLTDDLFGTLVACPLQ-PSSR---GGEAPAKKK------------- 428
           S GLKNALY+IRA+DL +DLFG LV CPLQ PS++     +  +KK+             
Sbjct: 128 SHGLKNALYTIRATDLKEDLFGMLVPCPLQEPSTKESTSNKDVSKKRSLGVLGQDEQKVI 187

Query: 427 ----LNKDLQSDYG----------KSCTLEFDGASKGNPGKAGAGAILRYDDGSVICRLC 290
               L K ++ D+           ++C +EFDGASKGNPGKAGAGAILR +DGS+ICRL 
Sbjct: 188 SEDPLRKQVKLDHAAVAEAPLHATQTCFVEFDGASKGNPGKAGAGAILRANDGSLICRLR 247

Query: 289 EGVGIATCNFAEYKGAILGMRYAAEKGFTNIRVKGDSKLVCMQIQGLWKVKSENLLNLYN 110
           EGVGIAT N AEY+  ILGM+YA +KGFT IR++GDSKLVCMQI G WKVK+ENL  LYN
Sbjct: 248 EGVGIATNNAAEYRAMILGMKYALKKGFTGIRIQGDSKLVCMQIDGSWKVKNENLSTLYN 307

Query: 109 EAKELESRFHSFQIMHVLRDLNSDADAQANLGAQLA 2
            AKEL+ +F SFQI HVLR+ NSDADAQANL   LA
Sbjct: 308 VAKELKDKFSSFQISHVLRNFNSDADAQANLAINLA 343


>ref|XP_006602373.1| PREDICTED: uncharacterized protein LOC100809644 isoform X3 [Glycine
           max]
          Length = 351

 Score =  310 bits (793), Expect = 7e-82
 Identities = 164/275 (59%), Positives = 198/275 (72%), Gaps = 30/275 (10%)
 Frame = -2

Query: 736 DPKKLTMEKEEFYVVRKGDLVGVYRSLSDCQDQVGTSICDPPVSVFKGCSMPKDTENYLR 557
           +P+ +  EK+ FYVVRKGD+VG+Y SL+D Q QVG+S+C+PPVSVFKG S+ KDTE YL 
Sbjct: 68  EPEAMKQEKDAFYVVRKGDVVGIYNSLADSQAQVGSSVCNPPVSVFKGYSLSKDTEEYLV 127

Query: 556 SQGLKNALYSIRASDLTDDLFGTLVACPLQ-PSSRGG----------------------- 449
           S GLKNALY+IRA+DL +DLFG LV CPLQ PS++                         
Sbjct: 128 SHGLKNALYTIRATDLKEDLFGMLVPCPLQEPSTKESTSNKDVSKKRSLGVLGQDEQKVI 187

Query: 448 -EAPAKKKLNKDLQSD-----YGKSCTLEFDGASKGNPGKAGAGAILRYDDGSVICRLCE 287
            E P +K++  D  +      +  +C +EFDGASKGNPGKAGAGAILR +DGS+ICRL E
Sbjct: 188 SEDPLRKQVKLDHAAVAEAPLHATTCFVEFDGASKGNPGKAGAGAILRANDGSLICRLRE 247

Query: 286 GVGIATCNFAEYKGAILGMRYAAEKGFTNIRVKGDSKLVCMQIQGLWKVKSENLLNLYNE 107
           GVGIAT N AEY+  ILGM+YA +KGFT IR++GDSKLVCMQI G WKVK+ENL  LYN 
Sbjct: 248 GVGIATNNAAEYRAMILGMKYALKKGFTGIRIQGDSKLVCMQIDGSWKVKNENLSTLYNV 307

Query: 106 AKELESRFHSFQIMHVLRDLNSDADAQANLGAQLA 2
           AKEL+ +F SFQI HVLR+ NSDADAQANL   LA
Sbjct: 308 AKELKDKFSSFQISHVLRNFNSDADAQANLAINLA 342


>ref|XP_007153671.1| hypothetical protein PHAVU_003G055000g [Phaseolus vulgaris]
           gi|561027025|gb|ESW25665.1| hypothetical protein
           PHAVU_003G055000g [Phaseolus vulgaris]
          Length = 359

 Score =  306 bits (785), Expect = 6e-81
 Identities = 163/277 (58%), Positives = 198/277 (71%), Gaps = 32/277 (11%)
 Frame = -2

Query: 736 DPKKLTMEKEE--FYVVRKGDLVGVYRSLSDCQDQVGTSICDPPVSVFKGCSMPKDTENY 563
           +P+   MEKE+  FYVVRKGD+VG+Y SL+D Q QVG+S+C+PPVSV+KG S+ KDTE Y
Sbjct: 74  EPEAPVMEKEKDAFYVVRKGDVVGIYNSLADSQAQVGSSVCNPPVSVYKGYSLSKDTEEY 133

Query: 562 LRSQGLKNALYSIRASDLTDDLFGTLVACPLQ-PSSRGG--------------------- 449
           L S GLKNALY+IRA+DL +DLFG L+ CP Q PS++ G                     
Sbjct: 134 LASHGLKNALYTIRAADLKEDLFGMLIPCPFQEPSTKEGTSNMDVPKKRSLRVPGQDEKA 193

Query: 448 --EAPAKKKLN------KDLQSDYGKSCTLEFDGASKGNPGKAGAGAILRYDDGSVICRL 293
             E P +KK+        +  S   ++CTLEFDGASKGNPGK+GAGA+LR  DGS+ICRL
Sbjct: 194 VSEDPLRKKVKLEHNAVAEAPSHSTRTCTLEFDGASKGNPGKSGAGAVLRAIDGSLICRL 253

Query: 292 CEGVGIATCNFAEYKGAILGMRYAAEKGFTNIRVKGDSKLVCMQIQGLWKVKSENLLNLY 113
            EGVG+AT N AEY+  ILGM+YA +KGFT IR++GDSKLVCMQI G WKVK+ENL  LY
Sbjct: 254 REGVGVATNNAAEYRAMILGMKYALKKGFTGIRIQGDSKLVCMQIDGSWKVKNENLSTLY 313

Query: 112 NEAKELESRFHSFQIMHVLRDLNSDADAQANLGAQLA 2
             AKEL+ +F SFQI HVLR+ NSDADAQANL   LA
Sbjct: 314 KVAKELKDKFSSFQINHVLRNFNSDADAQANLAINLA 350


>ref|XP_002534525.1| nuclease, putative [Ricinus communis] gi|223525106|gb|EEF27855.1|
           nuclease, putative [Ricinus communis]
          Length = 262

 Score =  306 bits (783), Expect = 1e-80
 Identities = 155/250 (62%), Positives = 190/250 (76%), Gaps = 9/250 (3%)
 Frame = -2

Query: 724 LTMEKEEFYVVRKGDLVGVYRSLSDCQDQVGTSICDPPVSVFKGCSMPKDTENYLRSQGL 545
           +  EK+ F+VVRKGD+VGVY+S +DCQ QVG+S+CDPPVSV+KG S+ KDTE YL S+GL
Sbjct: 1   MEQEKDAFFVVRKGDVVGVYKSFTDCQAQVGSSVCDPPVSVYKGYSLSKDTEEYLVSRGL 60

Query: 544 KNALYSIRASDLTDDLFGTLVACPLQP---SSRGGEAPAKKKLNKDLQSDY------GKS 392
           +NALY+IRA DL +DLFGTLV CP Q    S+ G   P +K    D Q++         S
Sbjct: 61  QNALYAIRAQDLKEDLFGTLVPCPFQETDGSASGLTDPLRKHAKLDNQTEAQALYYDDDS 120

Query: 391 CTLEFDGASKGNPGKAGAGAILRYDDGSVICRLCEGVGIATCNFAEYKGAILGMRYAAEK 212
           C LEFDGASKGNPG AGAGA+LR  DG +ICRL EG+G  T N AEY+  ILGM+YA +K
Sbjct: 121 CILEFDGASKGNPGPAGAGALLRTTDGRIICRLREGLGQVTNNVAEYRAMILGMKYALKK 180

Query: 211 GFTNIRVKGDSKLVCMQIQGLWKVKSENLLNLYNEAKELESRFHSFQIMHVLRDLNSDAD 32
           G+T IRV+GDSKLVC Q+QGLWKVK +++ NLY +AK+L+ +F SFQI HVLR LNS+AD
Sbjct: 181 GYTKIRVQGDSKLVCSQVQGLWKVKHKDMTNLYEQAKQLKDKFASFQISHVLRALNSEAD 240

Query: 31  AQANLGAQLA 2
           AQANL  QLA
Sbjct: 241 AQANLAIQLA 250


>ref|XP_003532034.1| PREDICTED: uncharacterized protein LOC100779114 isoform X1 [Glycine
           max]
          Length = 356

 Score =  301 bits (770), Expect = 3e-79
 Identities = 159/270 (58%), Positives = 193/270 (71%), Gaps = 30/270 (11%)
 Frame = -2

Query: 724 LTMEKEEFYVVRKGDLVGVYRSLSDCQDQVGTSICDPPVSVFKGCSMPKDTENYLRSQGL 545
           +  EK+ FYVVRKGD+VG+Y SL+D Q QVG+S+C+PPVSV+KG S+ KDTE YL S GL
Sbjct: 77  MEQEKDAFYVVRKGDVVGIYNSLADSQAQVGSSVCNPPVSVYKGYSLSKDTEEYLVSHGL 136

Query: 544 KNALYSIRASDLTDDLFGTLVACPLQ-PSSRGG-----------------------EAPA 437
           KNALY+IRA+DL +DLFG LV CP Q PS++ G                       E P 
Sbjct: 137 KNALYTIRATDLKEDLFGMLVPCPFQEPSTKEGTSNKDVSKQRSLGVLAQDEKVISEDPF 196

Query: 436 KKKLN------KDLQSDYGKSCTLEFDGASKGNPGKAGAGAILRYDDGSVICRLCEGVGI 275
           +K++        +  S   ++C +EFDGASKGNPGKAGAGAILR +DGS+ICR+ EGVGI
Sbjct: 197 RKQVKLEYAEVAEAPSHATRTCFVEFDGASKGNPGKAGAGAILRANDGSLICRVREGVGI 256

Query: 274 ATCNFAEYKGAILGMRYAAEKGFTNIRVKGDSKLVCMQIQGLWKVKSENLLNLYNEAKEL 95
           AT N AEY+  ILGM+YA +KGFT I ++GDSKLVCMQI G WKVK+ENL  LYN AKEL
Sbjct: 257 ATNNAAEYRAMILGMKYALKKGFTGICIQGDSKLVCMQIDGSWKVKNENLFTLYNVAKEL 316

Query: 94  ESRFHSFQIMHVLRDLNSDADAQANLGAQL 5
           + +F SFQI HVLR+ NSDADAQANL   L
Sbjct: 317 KDKFSSFQISHVLRNFNSDADAQANLAINL 346


>ref|XP_006585969.1| PREDICTED: uncharacterized protein LOC100779114 isoform X2 [Glycine
           max]
          Length = 357

 Score =  300 bits (769), Expect = 5e-79
 Identities = 159/271 (58%), Positives = 193/271 (71%), Gaps = 31/271 (11%)
 Frame = -2

Query: 724 LTMEKEEFYVVRKGDLVGVYRSLSDCQDQVGTSICDPPVSVFKGCSMPKDTENYLRSQGL 545
           +  EK+ FYVVRKGD+VG+Y SL+D Q QVG+S+C+PPVSV+KG S+ KDTE YL S GL
Sbjct: 77  MEQEKDAFYVVRKGDVVGIYNSLADSQAQVGSSVCNPPVSVYKGYSLSKDTEEYLVSHGL 136

Query: 544 KNALYSIRASDLTDDLFGTLVACPLQ-PSSRGG------------------------EAP 440
           KNALY+IRA+DL +DLFG LV CP Q PS++ G                        E P
Sbjct: 137 KNALYTIRATDLKEDLFGMLVPCPFQEPSTKEGTSNKDVSKQRSLGVLAQDEQKVISEDP 196

Query: 439 AKKKLN------KDLQSDYGKSCTLEFDGASKGNPGKAGAGAILRYDDGSVICRLCEGVG 278
            +K++        +  S   ++C +EFDGASKGNPGKAGAGAILR +DGS+ICR+ EGVG
Sbjct: 197 FRKQVKLEYAEVAEAPSHATRTCFVEFDGASKGNPGKAGAGAILRANDGSLICRVREGVG 256

Query: 277 IATCNFAEYKGAILGMRYAAEKGFTNIRVKGDSKLVCMQIQGLWKVKSENLLNLYNEAKE 98
           IAT N AEY+  ILGM+YA +KGFT I ++GDSKLVCMQI G WKVK+ENL  LYN AKE
Sbjct: 257 IATNNAAEYRAMILGMKYALKKGFTGICIQGDSKLVCMQIDGSWKVKNENLFTLYNVAKE 316

Query: 97  LESRFHSFQIMHVLRDLNSDADAQANLGAQL 5
           L+ +F SFQI HVLR+ NSDADAQANL   L
Sbjct: 317 LKDKFSSFQISHVLRNFNSDADAQANLAINL 347


>ref|XP_006484311.1| PREDICTED: uncharacterized protein LOC102614852 [Citrus sinensis]
          Length = 558

 Score =  300 bits (767), Expect = 8e-79
 Identities = 159/288 (55%), Positives = 202/288 (70%), Gaps = 25/288 (8%)
 Frame = -2

Query: 790 VQCYXXXXXXXXXXXXXSDPKKLTMEKEEFYVVRKGDLVGVYRSLSDCQDQVGTSICDPP 611
           +QCY             ++P+ +   K+EF+VVRKGDLVGVY+S ++CQ Q+G+SIC PP
Sbjct: 59  LQCYSSSAKKPRSRKLKTEPQ-MKQGKDEFFVVRKGDLVGVYKSFTECQAQLGSSICHPP 117

Query: 610 VSVFKGCSMPKDTENYLRSQGLKNALYSIRASDLTDDLFGTLVACPLQ-PSSR------- 455
           VSV+KG ++PK TE YL S GLKNALY+IRA+DLT+DLFGTL+ C LQ P+S+       
Sbjct: 118 VSVYKGNALPKGTEEYLASHGLKNALYTIRAADLTEDLFGTLMPCTLQDPTSKKRPQDPI 177

Query: 454 ----GGEA--------PAKKKLNKDLQSD-----YGKSCTLEFDGASKGNPGKAGAGAIL 326
               G E         P +K +  DL ++     Y +SC +EFDGASKGNPG AGA A+L
Sbjct: 178 EPEIGYELGSTSVLADPLRKHVKLDLDAESKAASYHRSCIIEFDGASKGNPGPAGAAAVL 237

Query: 325 RYDDGSVICRLCEGVGIATCNFAEYKGAILGMRYAAEKGFTNIRVKGDSKLVCMQIQGLW 146
           R DDGS+IC+L EGVGIAT N AEY+G ILG++YA EKGF+NIRV+GDSKLVCMQ+ G W
Sbjct: 238 RTDDGSLICKLREGVGIATSNVAEYRGLILGLKYALEKGFSNIRVQGDSKLVCMQVAGSW 297

Query: 145 KVKSENLLNLYNEAKELESRFHSFQIMHVLRDLNSDADAQANLGAQLA 2
           K K + +  L  EA+ L+ +F SFQI HVLR+LNS+ADAQA L   LA
Sbjct: 298 KTKHQGMAKLCGEARRLKDKFLSFQISHVLRNLNSEADAQATLAVGLA 345


>ref|XP_007226366.1| hypothetical protein PRUPE_ppa022484mg [Prunus persica]
           gi|462423302|gb|EMJ27565.1| hypothetical protein
           PRUPE_ppa022484mg [Prunus persica]
          Length = 521

 Score =  298 bits (762), Expect = 3e-78
 Identities = 149/246 (60%), Positives = 188/246 (76%), Gaps = 8/246 (3%)
 Frame = -2

Query: 718 MEKEEFYVVRKGDLVGVYRSLSDCQDQVGTSICDPPVSVFKGCSMPKDTENYLRSQGLKN 539
           +EK+ FYVVRKGD+VGVY+S SDCQ Q+ +SI DPPVSV+KG S+PK+TE YL S GL N
Sbjct: 86  LEKDAFYVVRKGDIVGVYKSFSDCQAQLSSSIFDPPVSVYKGYSLPKETEEYLGSCGLTN 145

Query: 538 ALYSIRASDLTDDLFGTLVACPLQPSSRGGEAPAKKKLNKDLQSDYGKS--------CTL 383
           A+Y+I A+DL DD+FG L+ CP Q    G  + A   L K ++ D+           CTL
Sbjct: 146 AIYTIAAADLKDDIFGKLMHCPFQEVI-GSPSIADDPLRKHVKIDHSTQSLPLDSGFCTL 204

Query: 382 EFDGASKGNPGKAGAGAILRYDDGSVICRLCEGVGIATCNFAEYKGAILGMRYAAEKGFT 203
           EFDGASKGNPG AGAGA+LR DDGS+IC+L EG+G+ T N AEY+  ILG++YA +KGFT
Sbjct: 205 EFDGASKGNPGLAGAGAVLRADDGSLICKLHEGLGVRTNNVAEYRALILGLKYALKKGFT 264

Query: 202 NIRVKGDSKLVCMQIQGLWKVKSENLLNLYNEAKELESRFHSFQIMHVLRDLNSDADAQA 23
            IRVKGDSKLVCMQ+QGLWKV+++N+ +L  E KEL+ +F SF+I HVLR+LNS+ADAQA
Sbjct: 265 KIRVKGDSKLVCMQVQGLWKVRNQNMSDLCEEVKELKDKFLSFEISHVLRELNSEADAQA 324

Query: 22  NLGAQL 5
           NL  +L
Sbjct: 325 NLAVRL 330


>ref|XP_002314727.2| hypothetical protein POPTR_0010s10515g [Populus trichocarpa]
           gi|550329518|gb|EEF00898.2| hypothetical protein
           POPTR_0010s10515g [Populus trichocarpa]
          Length = 364

 Score =  292 bits (748), Expect = 1e-76
 Identities = 155/280 (55%), Positives = 194/280 (69%), Gaps = 35/280 (12%)
 Frame = -2

Query: 736 DPKKLTM---EKEEFYVVRKGDLVGVYRSLSDCQDQVGTSICDPPVSVFKGCSMPKDTEN 566
           DP+  T+   E + F+VVRKGD+VGVY++ +DCQ QVG+SICDPPVSV+KG S+ KD+E 
Sbjct: 76  DPQPATVMDHENDAFFVVRKGDVVGVYKNFADCQAQVGSSICDPPVSVYKGYSLSKDSEA 135

Query: 565 YLRSQGLKNALYSIRASDLTDDLFGTLVACPLQ-PSSRGGEA---PAKKKLNKDL----- 413
           YL S GL+NALY++RA+DL +DLFG L+ CP Q P+S   E      KK+  + L     
Sbjct: 136 YLVSHGLQNALYTVRAADLKEDLFGVLMPCPFQQPASSDAETLKNDTKKRSREVLGSEIT 195

Query: 412 -----------------------QSDYGKSCTLEFDGASKGNPGKAGAGAILRYDDGSVI 302
                                  Q+   +SC LEFDGASKGNPG+AGAGA+LR DDGS+I
Sbjct: 196 DTAGSASMMSKHANLDNQAECQAQNSNSRSCLLEFDGASKGNPGQAGAGAVLRTDDGSLI 255

Query: 301 CRLCEGVGIATCNFAEYKGAILGMRYAAEKGFTNIRVKGDSKLVCMQIQGLWKVKSENLL 122
           CRL EG+GIAT N AEY+  +LGM+YA +KG+T I+VKGDSKLVCMQIQG WK K  N+ 
Sbjct: 256 CRLREGLGIATNNMAEYRAILLGMKYALQKGYTKIQVKGDSKLVCMQIQGSWKAKHVNIT 315

Query: 121 NLYNEAKELESRFHSFQIMHVLRDLNSDADAQANLGAQLA 2
           NL  EAK+L++ F SF I HVLR+ NS+ADAQANL   LA
Sbjct: 316 NLCTEAKKLKNSFLSFHISHVLREFNSEADAQANLAVHLA 355


>ref|XP_007044529.1| RNase H family protein, putative isoform 3 [Theobroma cacao]
           gi|508708464|gb|EOY00361.1| RNase H family protein,
           putative isoform 3 [Theobroma cacao]
          Length = 288

 Score =  292 bits (747), Expect = 2e-76
 Identities = 153/276 (55%), Positives = 194/276 (70%), Gaps = 38/276 (13%)
 Frame = -2

Query: 715 EKEEFYVVRKGDLVGVYRSLSDCQDQVGTSICDPPVSVFKGCSMPKDTENYLRSQGLKNA 536
           EK+ FYVVRKGD+VGVY+S +DC+ QVG SICDPPVSV+KG S+ KDT+ YL S GLKNA
Sbjct: 4   EKDAFYVVRKGDVVGVYKSFADCRAQVGPSICDPPVSVYKGYSLTKDTKEYLVSCGLKNA 63

Query: 535 LYSIRASDLTDDLFGTLVACPLQ-PSSRGGEAP----AKKKLNKDLQSDYG--------- 398
           LY++RA+D+ +DLFG L+ C  Q P+S  GE      AKK+    L+S+YG         
Sbjct: 64  LYTVRAADVKEDLFGLLMPCSFQEPASSKGETSHMDAAKKRSQDMLKSEYGGLGALGSIA 123

Query: 397 ------------------------KSCTLEFDGASKGNPGKAGAGAILRYDDGSVICRLC 290
                                   +SC LEFDGASKGNPG AGA A+LR D G VIC+L 
Sbjct: 124 VADPVSKHIKLDPYAEVQIASSNCQSCILEFDGASKGNPGPAGAAAVLRTDTGKVICKLR 183

Query: 289 EGVGIATCNFAEYKGAILGMRYAAEKGFTNIRVKGDSKLVCMQIQGLWKVKSENLLNLYN 110
           EG+GIATCN AEY+  ILG+++A  KG+++I V+GDSKLVCMQ+QGLWKVK E++  LY 
Sbjct: 184 EGLGIATCNAAEYRAVILGLKHALRKGYSSICVRGDSKLVCMQMQGLWKVKHEHMSELYE 243

Query: 109 EAKELESRFHSFQIMHVLRDLNSDADAQANLGAQLA 2
           +AK+L+++F SFQI HVLR+LN++ADAQANL   LA
Sbjct: 244 QAKKLKNKFLSFQINHVLRELNAEADAQANLAVNLA 279


>ref|XP_002266599.2| PREDICTED: uncharacterized protein LOC100255243 [Vitis vinifera]
          Length = 453

 Score =  286 bits (732), Expect = 9e-75
 Identities = 147/247 (59%), Positives = 187/247 (75%), Gaps = 9/247 (3%)
 Frame = -2

Query: 715 EKEEFYVVRKGDLVGVYRSLSDCQDQVGTSICDPPVSVFKGCSMPKDTENYLRSQGLKNA 536
           EK+ F+VVRKGD+VGVY++ SDCQ QVG+SICDPPVSV+KG  +PKDTE YL S+GL+NA
Sbjct: 4   EKDAFFVVRKGDVVGVYKTFSDCQAQVGSSICDPPVSVYKGYYLPKDTEEYLVSRGLRNA 63

Query: 535 LYSIRASDLTDDLFGTLVACPLQPSSRGGEA---PAKKKLNKD------LQSDYGKSCTL 383
           LY+IRA+DL +DLFG L+ C  Q +         P K+ +  D      L SD  +SC +
Sbjct: 64  LYTIRAADLKEDLFGKLMPCAFQGAVESRPITTDPLKEHIKLDRVEAQALFSDC-RSCVV 122

Query: 382 EFDGASKGNPGKAGAGAILRYDDGSVICRLCEGVGIATCNFAEYKGAILGMRYAAEKGFT 203
           EFDGASKGNPG AGA A+LR D G VICR+ EG+G+AT N AEY+  ILG++YA +KG+T
Sbjct: 123 EFDGASKGNPGPAGAAAVLRSDSGRVICRVREGLGLATNNVAEYQAMILGLKYALKKGYT 182

Query: 202 NIRVKGDSKLVCMQIQGLWKVKSENLLNLYNEAKELESRFHSFQIMHVLRDLNSDADAQA 23
           +IRV+GDSKLVCMQ+QGLWK +++N+  L  EAK+L++ F S +I HVLR LNS+ADAQA
Sbjct: 183 SIRVQGDSKLVCMQVQGLWKARNKNMSILCKEAKKLKNEFLSVEINHVLRGLNSEADAQA 242

Query: 22  NLGAQLA 2
           NL   LA
Sbjct: 243 NLAVHLA 249


>gb|EXC04052.1| Uncharacterized protein L484_011032 [Morus notabilis]
          Length = 369

 Score =  283 bits (724), Expect = 7e-74
 Identities = 149/270 (55%), Positives = 179/270 (66%), Gaps = 32/270 (11%)
 Frame = -2

Query: 715 EKEEFYVVRKGDLVGVYRSLSDCQDQVGTSICDPPVSVFKGCSMPKDTENYLRSQGLKNA 536
           EKE FYVVRKGD+VGVYRSL+DCQ QVG+SICDPPVSVFKG  +PK+T+ YL S+GLK+A
Sbjct: 94  EKEAFYVVRKGDVVGVYRSLNDCQAQVGSSICDPPVSVFKGHRLPKETQEYLTSRGLKDA 153

Query: 535 LYSIRASDLTDDLFGTLVACPLQ----PSSRGGEAPAKKKLNK----------------- 419
           +Y+IR  D+ + LFG +V C L     P+S  G  P+K    K                 
Sbjct: 154 IYTIRVEDMKEGLFGPIVQCALPVKEVPTSSIGVTPSKDTSKKRSQLVAGLETAEGIGYI 213

Query: 418 -----------DLQSDYGKSCTLEFDGASKGNPGKAGAGAILRYDDGSVICRLCEGVGIA 272
                      D  S   KSC L FDGASKGNPG+AGAGA+L  DDG +IC+LCEG+G  
Sbjct: 214 SASTSDLPNVRDSSSLSSKSCFLMFDGASKGNPGRAGAGAVLLADDGRMICKLCEGLGET 273

Query: 271 TCNFAEYKGAILGMRYAAEKGFTNIRVKGDSKLVCMQIQGLWKVKSENLLNLYNEAKELE 92
           T N AEY+  +LG+RYA +KGF  I V GDSKLVC+Q+QG WKV+ E +  LY E K LE
Sbjct: 274 TNNVAEYRALLLGLRYAHKKGFNRISVLGDSKLVCLQVQGSWKVRDEKISKLYKEVKALE 333

Query: 91  SRFHSFQIMHVLRDLNSDADAQANLGAQLA 2
           + F SFQI HVLRD N +ADAQANL   LA
Sbjct: 334 NNFLSFQINHVLRDRNKEADAQANLATTLA 363


>ref|XP_002517391.1| nuclease, putative [Ricinus communis] gi|223543402|gb|EEF44933.1|
           nuclease, putative [Ricinus communis]
          Length = 255

 Score =  280 bits (717), Expect = 5e-73
 Identities = 142/240 (59%), Positives = 180/240 (75%), Gaps = 3/240 (1%)
 Frame = -2

Query: 715 EKEEFYVVRKGDLVGVYRSLSDCQDQVGTSICDPPVSVFKGCSMPKDTENYLRSQGLKNA 536
           EK+ FYVVRKGD+VG+Y+SL DCQ QVG+S+C+P VSVFKG  + KD E+YL S G+K+A
Sbjct: 4   EKDVFYVVRKGDVVGIYKSLRDCQAQVGSSVCNPSVSVFKGYGLAKDAEDYLVSHGIKDA 63

Query: 535 LYSIRASDLTDDLFGTLVACPLQ-PSSRGGEAPAKKKLNKDLQSDYGK--SCTLEFDGAS 365
            +SI A+D+  DLFG LV CP Q P+   G+A  K    K  +   G   SC LEFDGAS
Sbjct: 64  AFSIHATDVQPDLFGKLVPCPFQQPAFSEGKALNKDSSPKSSRGVLGSMSSCILEFDGAS 123

Query: 364 KGNPGKAGAGAILRYDDGSVICRLCEGVGIATCNFAEYKGAILGMRYAAEKGFTNIRVKG 185
           KGNPG AGAGA+LR +DGS++C L EG+G AT N AEY+  ILG+++A  KGF +IRV+G
Sbjct: 124 KGNPGPAGAGAVLRAEDGSMVCLLREGLGTATNNVAEYRAVILGLKHALRKGFKHIRVRG 183

Query: 184 DSKLVCMQIQGLWKVKSENLLNLYNEAKELESRFHSFQIMHVLRDLNSDADAQANLGAQL 5
           DS LV MQI+GLWK+KS+N+ +L  EAKEL+++F SFQI HVLR+ NS+AD QANL   L
Sbjct: 184 DSNLVVMQIKGLWKIKSQNVADLCKEAKELKNKFLSFQIEHVLREFNSEADTQANLAVNL 243


>ref|XP_006438469.1| hypothetical protein CICLE_v10031867mg [Citrus clementina]
           gi|557540665|gb|ESR51709.1| hypothetical protein
           CICLE_v10031867mg [Citrus clementina]
          Length = 372

 Score =  279 bits (714), Expect = 1e-72
 Identities = 147/271 (54%), Positives = 189/271 (69%), Gaps = 25/271 (9%)
 Frame = -2

Query: 790 VQCYXXXXXXXXXXXXXSDPKKLTMEKEEFYVVRKGDLVGVYRSLSDCQDQVGTSICDPP 611
           +QCY             ++P+ +   K+EF+VVRKGDLVGVY+S ++CQ Q+G+SIC PP
Sbjct: 59  LQCYSSSAKKPRSRKLKTEPQ-MKQGKDEFFVVRKGDLVGVYKSFTECQAQLGSSICHPP 117

Query: 610 VSVFKGCSMPKDTENYLRSQGLKNALYSIRASDLTDDLFGTLVACPLQ-PSSR------- 455
           VSV+KG ++PK TE YL S GLKNALY+IRA+DLT+DLFG+L+ C LQ P+S+       
Sbjct: 118 VSVYKGNALPKGTEEYLASHGLKNALYTIRAADLTEDLFGSLMPCTLQDPTSKKRPQDTI 177

Query: 454 ----GGEA--------PAKKKLNKDLQSD-----YGKSCTLEFDGASKGNPGKAGAGAIL 326
               G E         P +K +  DL ++     Y +SC +EFDGASKGNPG AGA A+L
Sbjct: 178 EPEIGYELGSTSVLADPLRKHVKLDLDAESKAASYHRSCVIEFDGASKGNPGPAGAAAVL 237

Query: 325 RYDDGSVICRLCEGVGIATCNFAEYKGAILGMRYAAEKGFTNIRVKGDSKLVCMQIQGLW 146
           R DDGS+IC+L EGVGIAT N AEY+G ILG++YA EKGF+NIRV+GDSKLVCMQ+ G W
Sbjct: 238 RTDDGSLICKLREGVGIATSNVAEYRGLILGLKYALEKGFSNIRVQGDSKLVCMQVAGSW 297

Query: 145 KVKSENLLNLYNEAKELESRFHSFQIMHVLR 53
           K K + +  L  EA+ L+ +F SFQI HVLR
Sbjct: 298 KTKHQGMAKLCGEARRLKDKFLSFQISHVLR 328


Top