BLASTX nr result

ID: Mentha29_contig00006264 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Mentha29_contig00006264
         (1192 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|EYU40501.1| hypothetical protein MIMGU_mgv1a008493mg [Mimulus...   400   e-109
ref|XP_006358582.1| PREDICTED: uncharacterized protein LOC102600...   349   2e-93
ref|XP_004245868.1| PREDICTED: uncharacterized protein LOC101261...   345   2e-92
ref|XP_006485398.1| PREDICTED: uncharacterized protein LOC102629...   340   9e-91
ref|XP_006436792.1| hypothetical protein CICLE_v10031769mg [Citr...   340   9e-91
ref|XP_006436791.1| hypothetical protein CICLE_v10031769mg [Citr...   340   9e-91
ref|XP_002282079.1| PREDICTED: uncharacterized protein LOC100243...   321   4e-85
ref|XP_007039584.1| Uncharacterized protein isoform 3 [Theobroma...   315   2e-83
ref|XP_007039582.1| Uncharacterized protein isoform 1 [Theobroma...   315   2e-83
ref|XP_002528762.1| conserved hypothetical protein [Ricinus comm...   314   4e-83
ref|XP_007211018.1| hypothetical protein PRUPE_ppa020378mg [Prun...   310   8e-82
ref|XP_006368453.1| hypothetical protein POPTR_0001s02940g [Popu...   304   4e-80
ref|XP_004511767.1| PREDICTED: uncharacterized protein LOC101498...   302   2e-79
ref|XP_007039583.1| Uncharacterized protein isoform 2 [Theobroma...   293   1e-76
ref|XP_003538768.1| PREDICTED: uncharacterized protein LOC100784...   293   1e-76
ref|XP_003611437.1| hypothetical protein MTR_5g014010 [Medicago ...   290   6e-76
ref|XP_004148518.1| PREDICTED: uncharacterized protein LOC101208...   286   9e-75
ref|XP_006485401.1| PREDICTED: uncharacterized protein LOC102629...   286   2e-74
ref|XP_003516643.1| PREDICTED: uncharacterized protein LOC100779...   285   2e-74
ref|XP_007156635.1| hypothetical protein PHAVU_002G004700g [Phas...   284   5e-74

>gb|EYU40501.1| hypothetical protein MIMGU_mgv1a008493mg [Mimulus guttatus]
          Length = 371

 Score =  400 bits (1028), Expect = e-109
 Identities = 197/338 (58%), Positives = 236/338 (69%)
 Frame = +3

Query: 111  SNTQKKKRHKPSYSTLFPAEPSSDSLPSTKADFSRLIXXXXXXXXXXXXXXXXXGSLNQP 290
            S+ +K+KR KP Y T FP E   +SLPS+K DF RLI                  S +QP
Sbjct: 3    SDGKKRKRPKP-YETQFPVELFLNSLPSSKPDFCRLIAVVSIAAAVAVACNFVATSFSQP 61

Query: 291  PKPFCDSTXXXXXXXXXXXXXXXYCEACPAHGVCYEGKLTCDHGFRKQGKRCVEDGDINR 470
            PKPFCD+T               YCE CP +G CY+GKL C  G+RK    CV DGD+++
Sbjct: 62   PKPFCDTTSDPDGSPFD------YCEPCPENGECYDGKLKCIDGYRKHVNLCVRDGDVDK 115

Query: 471  AAKKLSKWVEDHVCGACAQFLCSGTGKCWFGVDNLLDNLDKYGISDDYDLDKAIYMPARQ 650
            AA KLSKWVE  +C A AQ LCSGTGKCW   D L + LD Y + D++ +D++IY PA+Q
Sbjct: 116  AAMKLSKWVEVRLCEAYAQLLCSGTGKCWVSKDELFNELDNYNLGDNHRVDESIYAPAKQ 175

Query: 651  KAMEAIRSALETRRGDQGVEELKCPEILVGHYKPLSCVARQCIIKHALLLVPACALIAGC 830
            +A++ I S LET+R D G+EE KCPE LV HYKPLSCV +Q +IKHALL +     + GC
Sbjct: 176  RAIQNIHSLLETKRDDYGIEEFKCPESLVNHYKPLSCVVQQWLIKHALLSILTFLSLVGC 235

Query: 831  ILISCRAYQRHLLSVRAEQLYHEVCDILEEKPIVSKSGSGEGEPWVVASLLRDHLLLPKE 1010
            I I+ RAYQRH LSVRAE LYHEVCDILEEKP+ S+  +GE EPW+VAS LRDHLL PKE
Sbjct: 236  IFIANRAYQRHHLSVRAEHLYHEVCDILEEKPLESRRVNGECEPWIVASWLRDHLLSPKE 295

Query: 1011 RKDPFLWRKVEELVLEDSRVDQYPKLVKGESKVVWEWQ 1124
            RKDP LWRKVEEL+ EDSR+DQYPKL+KGESKVVWEWQ
Sbjct: 296  RKDPLLWRKVEELIQEDSRIDQYPKLLKGESKVVWEWQ 333


>ref|XP_006358582.1| PREDICTED: uncharacterized protein LOC102600075 [Solanum tuberosum]
          Length = 397

 Score =  349 bits (895), Expect = 2e-93
 Identities = 179/355 (50%), Positives = 228/355 (64%), Gaps = 9/355 (2%)
 Frame = +3

Query: 87   PILRMASES---NTQKKKRHKPSYSTL------FPAEPSSDSLPSTKADFSRLIXXXXXX 239
            P  R +S S   N  K K    S S L       P +PSS+  PS+K++FSR I      
Sbjct: 5    PRTRPSSRSPNPNPTKNKTSSSSSSRLSTSSRSIPLQPSSNLFPSSKSEFSRFIAVVVVA 64

Query: 240  XXXXXXXXXXXGSLNQPPKPFCDSTXXXXXXXXXXXXXXXYCEACPAHGVCYEGKLTCDH 419
                         LN  PKPFCDS                +CE CP +GVC+EGKL C H
Sbjct: 65   SAVAFSCNYVFTYLNSQPKPFCDSNSDFDDSLSD------FCEPCPLNGVCHEGKLECAH 118

Query: 420  GFRKQGKRCVEDGDINRAAKKLSKWVEDHVCGACAQFLCSGTGKCWFGVDNLLDNLDKYG 599
            G+R+ G  CVED  IN AAKKLSK VE  +C    Q+ C+GTG  W   + L + +++  
Sbjct: 119  GYRRLGNLCVEDSSINEAAKKLSKLVEGLLCEGHTQYSCTGTGTVWVQGNQLWEKVNESK 178

Query: 600  ISDDYDLDKAIYMPARQKAMEAIRSALETRRGDQGVEELKCPEILVGHYKPLSCVARQCI 779
            I D+Y L +A+Y  A ++AMEA+R  LETR  D G+EELKCP +LV HY P+SC  +Q I
Sbjct: 179  IMDEYGLSEAVYAHAMKRAMEALRKVLETRLNDHGIEELKCPPLLVLHYTPVSCRIQQWI 238

Query: 780  IKHALLLVPACALIAGCILISCRAYQRHLLSVRAEQLYHEVCDILEEKPIVSKSGSGEGE 959
            + HALLLVPACAL+ GC+    +  +R+ LSV+AEQ+Y+E CD+LEEK + ++S +GE E
Sbjct: 239  LDHALLLVPACALLLGCVFTLLKFRRRYYLSVKAEQIYNEACDVLEEKAVSARSMTGEHE 298

Query: 960  PWVVASLLRDHLLLPKERKDPFLWRKVEELVLEDSRVDQYPKLVKGESKVVWEWQ 1124
            PWVVASLLRDHLL PKERKDP LW+KVE+LV EDSR+++YPK+VKGE KVVWEWQ
Sbjct: 299  PWVVASLLRDHLLSPKERKDPMLWKKVEQLVQEDSRLERYPKMVKGECKVVWEWQ 353


>ref|XP_004245868.1| PREDICTED: uncharacterized protein LOC101261143 [Solanum
            lycopersicum]
          Length = 398

 Score =  345 bits (886), Expect = 2e-92
 Identities = 176/356 (49%), Positives = 231/356 (64%), Gaps = 10/356 (2%)
 Frame = +3

Query: 87   PILRMASESNTQKKKRHKPSYSTL----------FPAEPSSDSLPSTKADFSRLIXXXXX 236
            P  R +S+S   K  ++K S S+            P +PSS+  PS+K++FSRLI     
Sbjct: 5    PRTRPSSQSPNPKPTKNKTSSSSSASRPSTSSRSIPLQPSSNLFPSSKSEFSRLIAVVVV 64

Query: 237  XXXXXXXXXXXXGSLNQPPKPFCDSTXXXXXXXXXXXXXXXYCEACPAHGVCYEGKLTCD 416
                          LN  PKPFCDS                 CE CP +GVC EGKL C 
Sbjct: 65   ASAVAFSCNYVFTYLNSQPKPFCDSNSGFDDSLTDL------CEPCPLNGVCREGKLECA 118

Query: 417  HGFRKQGKRCVEDGDINRAAKKLSKWVEDHVCGACAQFLCSGTGKCWFGVDNLLDNLDKY 596
            HG+R+ G  CVED +IN  AKKLSK VE  +C   AQ+ C+GTG  W   + L + +++ 
Sbjct: 119  HGYRRLGNLCVEDSNINETAKKLSKLVEGLLCEEHAQYSCTGTGTIWVQGNQLWEKVNES 178

Query: 597  GISDDYDLDKAIYMPARQKAMEAIRSALETRRGDQGVEELKCPEILVGHYKPLSCVARQC 776
             I D+Y L++A+Y  A ++AMEA+R  LETR  D G+EELKCP +LV HY P+SC  ++ 
Sbjct: 179  KIMDEYGLNEAVYAHAMKRAMEALRKVLETRLNDHGIEELKCPPLLVLHYTPVSCRIQRW 238

Query: 777  IIKHALLLVPACALIAGCILISCRAYQRHLLSVRAEQLYHEVCDILEEKPIVSKSGSGEG 956
            I++HALLLVPACAL+ GC+    +  +R+ LSV+AE +Y+E CD+LEEK + ++S +GE 
Sbjct: 239  ILEHALLLVPACALLLGCVFTLLKLRRRYHLSVKAEHIYNEACDVLEEKAMSARSMTGEH 298

Query: 957  EPWVVASLLRDHLLLPKERKDPFLWRKVEELVLEDSRVDQYPKLVKGESKVVWEWQ 1124
            EPWVVASLLRDHLL PKERKDP LW+KVE+LV EDSR+++YPK+VKGE KVVWEWQ
Sbjct: 299  EPWVVASLLRDHLLSPKERKDPMLWKKVEQLVQEDSRLERYPKMVKGECKVVWEWQ 354


>ref|XP_006485398.1| PREDICTED: uncharacterized protein LOC102629601 isoform X1 [Citrus
            sinensis]
          Length = 396

 Score =  340 bits (871), Expect = 9e-91
 Identities = 169/350 (48%), Positives = 222/350 (63%), Gaps = 12/350 (3%)
 Frame = +3

Query: 111  SNTQKKKRHKP------------SYSTLFPAEPSSDSLPSTKADFSRLIXXXXXXXXXXX 254
            S++ KKKR KP            S S  +  EP     PS K D  RLI           
Sbjct: 2    SSSTKKKRPKPKSNSSSSSSSSSSSSWSWMTEPPQSLFPS-KQDLLRLITVVAIASSVAL 60

Query: 255  XXXXXXGSLNQPPKPFCDSTXXXXXXXXXXXXXXXYCEACPAHGVCYEGKLTCDHGFRKQ 434
                    LN   KPFCDS                 CE CP++G C++GKL C HG+RK 
Sbjct: 61   TCNYLANFLNSTSKPFCDSNLLLDSPQSPTDS----CEPCPSNGECHQGKLECFHGYRKH 116

Query: 435  GKRCVEDGDINRAAKKLSKWVEDHVCGACAQFLCSGTGKCWFGVDNLLDNLDKYGISDDY 614
            GK CVEDGDIN  A +LS+WVE+ +C A AQFLC GTG  W   +++ ++L+ + +   +
Sbjct: 117  GKLCVEDGDINETAGRLSRWVENRLCRAYAQFLCDGTGSIWVEENDIWNDLEGHELMKIF 176

Query: 615  DLDKAIYMPARQKAMEAIRSALETRRGDQGVEELKCPEILVGHYKPLSCVARQCIIKHAL 794
            +LD  +Y+  +++ ME +   LE+R    G++ELKCPE+L  HYKPLSC   Q +  HAL
Sbjct: 177  ELDNPVYLYTKKRTMETVGRYLESRTNSYGMKELKCPELLAEHYKPLSCRIHQWVSTHAL 236

Query: 795  LLVPACALIAGCILISCRAYQRHLLSVRAEQLYHEVCDILEEKPIVSKSGSGEGEPWVVA 974
            ++VP C+L+ GC+L+  + ++R   ++R E+LYH+VC+ILEE  ++SKS +GE EPWVVA
Sbjct: 237  IIVPVCSLLVGCLLLLWKVHRRRYFAIRVEELYHQVCEILEENALMSKSVNGECEPWVVA 296

Query: 975  SLLRDHLLLPKERKDPFLWRKVEELVLEDSRVDQYPKLVKGESKVVWEWQ 1124
            S LRDHLLLPKERKDP +W+KVEELV EDSRVDQYPKL+KGESKVVWEWQ
Sbjct: 297  SRLRDHLLLPKERKDPVIWKKVEELVQEDSRVDQYPKLLKGESKVVWEWQ 346


>ref|XP_006436792.1| hypothetical protein CICLE_v10031769mg [Citrus clementina]
            gi|568863997|ref|XP_006485400.1| PREDICTED:
            uncharacterized protein LOC102629601 isoform X3 [Citrus
            sinensis] gi|557538988|gb|ESR50032.1| hypothetical
            protein CICLE_v10031769mg [Citrus clementina]
          Length = 359

 Score =  340 bits (871), Expect = 9e-91
 Identities = 169/350 (48%), Positives = 222/350 (63%), Gaps = 12/350 (3%)
 Frame = +3

Query: 111  SNTQKKKRHKP------------SYSTLFPAEPSSDSLPSTKADFSRLIXXXXXXXXXXX 254
            S++ KKKR KP            S S  +  EP     PS K D  RLI           
Sbjct: 2    SSSTKKKRPKPKSNSSSSSSSSSSSSWSWMTEPPQSLFPS-KQDLLRLITVVAIASSVAL 60

Query: 255  XXXXXXGSLNQPPKPFCDSTXXXXXXXXXXXXXXXYCEACPAHGVCYEGKLTCDHGFRKQ 434
                    LN   KPFCDS                 CE CP++G C++GKL C HG+RK 
Sbjct: 61   TCNYLANFLNSTSKPFCDSNLLLDSPQSPTDS----CEPCPSNGECHQGKLECFHGYRKH 116

Query: 435  GKRCVEDGDINRAAKKLSKWVEDHVCGACAQFLCSGTGKCWFGVDNLLDNLDKYGISDDY 614
            GK CVEDGDIN  A +LS+WVE+ +C A AQFLC GTG  W   +++ ++L+ + +   +
Sbjct: 117  GKLCVEDGDINETAGRLSRWVENRLCRAYAQFLCDGTGSIWVEENDIWNDLEGHELMKIF 176

Query: 615  DLDKAIYMPARQKAMEAIRSALETRRGDQGVEELKCPEILVGHYKPLSCVARQCIIKHAL 794
            +LD  +Y+  +++ ME +   LE+R    G++ELKCPE+L  HYKPLSC   Q +  HAL
Sbjct: 177  ELDNPVYLYTKKRTMETVGRYLESRTNSYGMKELKCPELLAEHYKPLSCRIHQWVSTHAL 236

Query: 795  LLVPACALIAGCILISCRAYQRHLLSVRAEQLYHEVCDILEEKPIVSKSGSGEGEPWVVA 974
            ++VP C+L+ GC+L+  + ++R   ++R E+LYH+VC+ILEE  ++SKS +GE EPWVVA
Sbjct: 237  IIVPVCSLLVGCLLLLWKVHRRRYFAIRVEELYHQVCEILEENALMSKSVNGECEPWVVA 296

Query: 975  SLLRDHLLLPKERKDPFLWRKVEELVLEDSRVDQYPKLVKGESKVVWEWQ 1124
            S LRDHLLLPKERKDP +W+KVEELV EDSRVDQYPKL+KGESKVVWEWQ
Sbjct: 297  SRLRDHLLLPKERKDPVIWKKVEELVQEDSRVDQYPKLLKGESKVVWEWQ 346


>ref|XP_006436791.1| hypothetical protein CICLE_v10031769mg [Citrus clementina]
            gi|568863995|ref|XP_006485399.1| PREDICTED:
            uncharacterized protein LOC102629601 isoform X2 [Citrus
            sinensis] gi|557538987|gb|ESR50031.1| hypothetical
            protein CICLE_v10031769mg [Citrus clementina]
          Length = 391

 Score =  340 bits (871), Expect = 9e-91
 Identities = 169/350 (48%), Positives = 222/350 (63%), Gaps = 12/350 (3%)
 Frame = +3

Query: 111  SNTQKKKRHKP------------SYSTLFPAEPSSDSLPSTKADFSRLIXXXXXXXXXXX 254
            S++ KKKR KP            S S  +  EP     PS K D  RLI           
Sbjct: 2    SSSTKKKRPKPKSNSSSSSSSSSSSSWSWMTEPPQSLFPS-KQDLLRLITVVAIASSVAL 60

Query: 255  XXXXXXGSLNQPPKPFCDSTXXXXXXXXXXXXXXXYCEACPAHGVCYEGKLTCDHGFRKQ 434
                    LN   KPFCDS                 CE CP++G C++GKL C HG+RK 
Sbjct: 61   TCNYLANFLNSTSKPFCDSNLLLDSPQSPTDS----CEPCPSNGECHQGKLECFHGYRKH 116

Query: 435  GKRCVEDGDINRAAKKLSKWVEDHVCGACAQFLCSGTGKCWFGVDNLLDNLDKYGISDDY 614
            GK CVEDGDIN  A +LS+WVE+ +C A AQFLC GTG  W   +++ ++L+ + +   +
Sbjct: 117  GKLCVEDGDINETAGRLSRWVENRLCRAYAQFLCDGTGSIWVEENDIWNDLEGHELMKIF 176

Query: 615  DLDKAIYMPARQKAMEAIRSALETRRGDQGVEELKCPEILVGHYKPLSCVARQCIIKHAL 794
            +LD  +Y+  +++ ME +   LE+R    G++ELKCPE+L  HYKPLSC   Q +  HAL
Sbjct: 177  ELDNPVYLYTKKRTMETVGRYLESRTNSYGMKELKCPELLAEHYKPLSCRIHQWVSTHAL 236

Query: 795  LLVPACALIAGCILISCRAYQRHLLSVRAEQLYHEVCDILEEKPIVSKSGSGEGEPWVVA 974
            ++VP C+L+ GC+L+  + ++R   ++R E+LYH+VC+ILEE  ++SKS +GE EPWVVA
Sbjct: 237  IIVPVCSLLVGCLLLLWKVHRRRYFAIRVEELYHQVCEILEENALMSKSVNGECEPWVVA 296

Query: 975  SLLRDHLLLPKERKDPFLWRKVEELVLEDSRVDQYPKLVKGESKVVWEWQ 1124
            S LRDHLLLPKERKDP +W+KVEELV EDSRVDQYPKL+KGESKVVWEWQ
Sbjct: 297  SRLRDHLLLPKERKDPVIWKKVEELVQEDSRVDQYPKLLKGESKVVWEWQ 346


>ref|XP_002282079.1| PREDICTED: uncharacterized protein LOC100243743 [Vitis vinifera]
            gi|297742158|emb|CBI33945.3| unnamed protein product
            [Vitis vinifera]
          Length = 383

 Score =  321 bits (822), Expect = 4e-85
 Identities = 165/335 (49%), Positives = 211/335 (62%), Gaps = 2/335 (0%)
 Frame = +3

Query: 126  KKRHKPSYSTLFPA--EPSSDSLPSTKADFSRLIXXXXXXXXXXXXXXXXXGSLNQPPKP 299
            K  H PS S+   A  EP  +  PS K +  +L+                   L++  KP
Sbjct: 14   KSTHSPSSSSSLNALMEPPENFFPS-KPELFKLLAVIAIATSVAALCNYVVTILSRHSKP 72

Query: 300  FCDSTXXXXXXXXXXXXXXXYCEACPAHGVCYEGKLTCDHGFRKQGKRCVEDGDINRAAK 479
            FCD+                 CE CP++  CY+G + C  G+RK GK C+EDGDIN  AK
Sbjct: 73   FCDTNADSQYLPSDL------CEPCPSNAECYQGMMECVRGYRKHGKLCIEDGDINETAK 126

Query: 480  KLSKWVEDHVCGACAQFLCSGTGKCWFGVDNLLDNLDKYGISDDYDLDKAIYMPARQKAM 659
            KL+  +E HVC   AQFLC GTG  W   D + +++D+  + ++  L+ AI M  +Q+AM
Sbjct: 127  KLANRIETHVCEGYAQFLC-GTGSVWVQEDEVWNDVDELKMMENLGLENAIDMHTKQRAM 185

Query: 660  EAIRSALETRRGDQGVEELKCPEILVGHYKPLSCVARQCIIKHALLLVPACALIAGCILI 839
            E I   LET+   +G++ELKCP +L  HYKP SC  +Q I  HAL+L+P C L+ G IL+
Sbjct: 186  EMIDGLLETKINHRGIKELKCPNLLAEHYKPFSCRVQQWISNHALVLMPICGLLVGSILL 245

Query: 840  SCRAYQRHLLSVRAEQLYHEVCDILEEKPIVSKSGSGEGEPWVVASLLRDHLLLPKERKD 1019
              R  QR  LS RAE+LY+++CDILEE  +++K G GEGEPWVV S LRDHLLLPKERKD
Sbjct: 246  LRRIRQRRNLSARAEELYNQICDILEENAMMTKGGDGEGEPWVVVSWLRDHLLLPKERKD 305

Query: 1020 PFLWRKVEELVLEDSRVDQYPKLVKGESKVVWEWQ 1124
            P LWRKVEELV EDSR+D+YPKLVKGESKVVWEWQ
Sbjct: 306  PLLWRKVEELVQEDSRLDRYPKLVKGESKVVWEWQ 340


>ref|XP_007039584.1| Uncharacterized protein isoform 3 [Theobroma cacao]
            gi|508776829|gb|EOY24085.1| Uncharacterized protein
            isoform 3 [Theobroma cacao]
          Length = 350

 Score =  315 bits (807), Expect = 2e-83
 Identities = 163/348 (46%), Positives = 210/348 (60%), Gaps = 10/348 (2%)
 Frame = +3

Query: 111  SNTQKKKRHKPSYSTLFPAEPSSDSLPS----------TKADFSRLIXXXXXXXXXXXXX 260
            S++  KKR KP +++   +  S  SL S          +K +F RLI             
Sbjct: 2    SSSTPKKRPKPKHNSPSKSSTSKSSLNSILEPPQSLFPSKGEFFRLIAVLAIASSVALSC 61

Query: 261  XXXXGSLNQPPKPFCDSTXXXXXXXXXXXXXXXYCEACPAHGVCYEGKLTCDHGFRKQGK 440
                       KPFCDS                 CE CP++G CYEGKL C HG+R+ GK
Sbjct: 62   NFFATFFTSTSKPFCDSNLDSIDSLSDS------CEPCPSNGECYEGKLECIHGYRRHGK 115

Query: 441  RCVEDGDINRAAKKLSKWVEDHVCGACAQFLCSGTGKCWFGVDNLLDNLDKYGISDDYDL 620
             CVED DIN  AKK SKW+E  +C A AQ LC GT   W    ++ ++LD + +  ++  
Sbjct: 116  LCVEDKDINETAKKFSKWLEVRLCEAYAQSLCYGTVTVWAREHDIWNDLDGHELMQNFGP 175

Query: 621  DKAIYMPARQKAMEAIRSALETRRGDQGVEELKCPEILVGHYKPLSCVARQCIIKHALLL 800
            D A Y+ A+++ ME I   LETR    G++E+KCP+ L  +YKP +C  RQ I  HAL++
Sbjct: 176  DNATYLYAKRRVMETIVKLLETRINSHGIQEVKCPDSLAEYYKPFTCRIRQLISNHALII 235

Query: 801  VPACALIAGCILISCRAYQRHLLSVRAEQLYHEVCDILEEKPIVSKSGSGEGEPWVVASL 980
            VP CA + G  ++    +Q+  LS R E+LYH+VCD+LEEK + SKS +G GE WVVAS 
Sbjct: 236  VPVCAGLVGFAMLFWNVHQKRCLSARVEELYHQVCDMLEEKALRSKSVNGGGESWVVASW 295

Query: 981  LRDHLLLPKERKDPFLWRKVEELVLEDSRVDQYPKLVKGESKVVWEWQ 1124
            LRDHLL P+ERKDP LW+KVEELV EDSRVD+YPKLVKGESKVVWEWQ
Sbjct: 296  LRDHLLFPRERKDPHLWKKVEELVQEDSRVDRYPKLVKGESKVVWEWQ 343


>ref|XP_007039582.1| Uncharacterized protein isoform 1 [Theobroma cacao]
            gi|508776827|gb|EOY24083.1| Uncharacterized protein
            isoform 1 [Theobroma cacao]
          Length = 388

 Score =  315 bits (807), Expect = 2e-83
 Identities = 163/348 (46%), Positives = 210/348 (60%), Gaps = 10/348 (2%)
 Frame = +3

Query: 111  SNTQKKKRHKPSYSTLFPAEPSSDSLPS----------TKADFSRLIXXXXXXXXXXXXX 260
            S++  KKR KP +++   +  S  SL S          +K +F RLI             
Sbjct: 2    SSSTPKKRPKPKHNSPSKSSTSKSSLNSILEPPQSLFPSKGEFFRLIAVLAIASSVALSC 61

Query: 261  XXXXGSLNQPPKPFCDSTXXXXXXXXXXXXXXXYCEACPAHGVCYEGKLTCDHGFRKQGK 440
                       KPFCDS                 CE CP++G CYEGKL C HG+R+ GK
Sbjct: 62   NFFATFFTSTSKPFCDSNLDSIDSLSDS------CEPCPSNGECYEGKLECIHGYRRHGK 115

Query: 441  RCVEDGDINRAAKKLSKWVEDHVCGACAQFLCSGTGKCWFGVDNLLDNLDKYGISDDYDL 620
             CVED DIN  AKK SKW+E  +C A AQ LC GT   W    ++ ++LD + +  ++  
Sbjct: 116  LCVEDKDINETAKKFSKWLEVRLCEAYAQSLCYGTVTVWAREHDIWNDLDGHELMQNFGP 175

Query: 621  DKAIYMPARQKAMEAIRSALETRRGDQGVEELKCPEILVGHYKPLSCVARQCIIKHALLL 800
            D A Y+ A+++ ME I   LETR    G++E+KCP+ L  +YKP +C  RQ I  HAL++
Sbjct: 176  DNATYLYAKRRVMETIVKLLETRINSHGIQEVKCPDSLAEYYKPFTCRIRQLISNHALII 235

Query: 801  VPACALIAGCILISCRAYQRHLLSVRAEQLYHEVCDILEEKPIVSKSGSGEGEPWVVASL 980
            VP CA + G  ++    +Q+  LS R E+LYH+VCD+LEEK + SKS +G GE WVVAS 
Sbjct: 236  VPVCAGLVGFAMLFWNVHQKRCLSARVEELYHQVCDMLEEKALRSKSVNGGGESWVVASW 295

Query: 981  LRDHLLLPKERKDPFLWRKVEELVLEDSRVDQYPKLVKGESKVVWEWQ 1124
            LRDHLL P+ERKDP LW+KVEELV EDSRVD+YPKLVKGESKVVWEWQ
Sbjct: 296  LRDHLLFPRERKDPHLWKKVEELVQEDSRVDRYPKLVKGESKVVWEWQ 343


>ref|XP_002528762.1| conserved hypothetical protein [Ricinus communis]
            gi|223531765|gb|EEF33584.1| conserved hypothetical
            protein [Ricinus communis]
          Length = 373

 Score =  314 bits (805), Expect = 4e-83
 Identities = 157/341 (46%), Positives = 211/341 (61%)
 Frame = +3

Query: 102  ASESNTQKKKRHKPSYSTLFPAEPSSDSLPSTKADFSRLIXXXXXXXXXXXXXXXXXGSL 281
            +S +N ++K    PS S      P ++  PS K +F RLI                   +
Sbjct: 3    SSSTNKRRKPNLSPSSSPTLLTGPPNNLFPS-KEEFVRLIAVLAIASSVAFTCNLIATYI 61

Query: 282  NQPPKPFCDSTXXXXXXXXXXXXXXXYCEACPAHGVCYEGKLTCDHGFRKQGKRCVEDGD 461
            N   KPFCDS                +C  CP +G C +GKL C  G+RK    C+EDGD
Sbjct: 62   NPSTKPFCDSNTDSFSE---------FCVPCPENGECTQGKLECAEGYRKHRNICIEDGD 112

Query: 462  INRAAKKLSKWVEDHVCGACAQFLCSGTGKCWFGVDNLLDNLDKYGISDDYDLDKAIYMP 641
            IN  AKKLS+WVE+H+C A AQ+LC G G  WF  +++  +LD + + +++  D A Y+ 
Sbjct: 113  INERAKKLSEWVENHLCEAYAQYLCDGIGTIWFQDNDIWYDLDGHQLMENFQPDNATYIY 172

Query: 642  ARQKAMEAIRSALETRRGDQGVEELKCPEILVGHYKPLSCVARQCIIKHALLLVPACALI 821
            A++KAME I   LE R    G +ELKCP+++  HYKP +C  RQ I  HA ++   C+L+
Sbjct: 173  AKRKAMEMIVRLLEIRTNSHGNKELKCPDLVAEHYKPFTCRFRQWISNHAFVIASLCSLV 232

Query: 822  AGCILISCRAYQRHLLSVRAEQLYHEVCDILEEKPIVSKSGSGEGEPWVVASLLRDHLLL 1001
             G +L+  +  +R  LS R E+LYH+VC++LEE  ++SK  +GE + WVVAS LRDHLLL
Sbjct: 233  VGAVLLLRKLQRRWYLSARGEELYHQVCEVLEENALMSKQSNGECDSWVVASQLRDHLLL 292

Query: 1002 PKERKDPFLWRKVEELVLEDSRVDQYPKLVKGESKVVWEWQ 1124
            PKERKDP LW++VE+LV EDSRVD+YPKLVKGESKVVWEWQ
Sbjct: 293  PKERKDPVLWKRVEQLVQEDSRVDRYPKLVKGESKVVWEWQ 333


>ref|XP_007211018.1| hypothetical protein PRUPE_ppa020378mg [Prunus persica]
            gi|462406753|gb|EMJ12217.1| hypothetical protein
            PRUPE_ppa020378mg [Prunus persica]
          Length = 380

 Score =  310 bits (794), Expect = 8e-82
 Identities = 166/348 (47%), Positives = 217/348 (62%), Gaps = 6/348 (1%)
 Frame = +3

Query: 99   MASESNTQKKKRHKPSYSTLFPA-----EPSSDSLPSTKADFSRLIXXXXXXXXXXXXXX 263
            M+S S  + K + K S  +   +     EPS +  PS K +FSRL               
Sbjct: 1    MSSTSKKRPKPKPKRSPESSLSSIASTLEPSQNFFPS-KEEFSRLTVALAIAASVALTLN 59

Query: 264  XXXGSLNQP-PKPFCDSTXXXXXXXXXXXXXXXYCEACPAHGVCYEGKLTCDHGFRKQGK 440
                +L  P  KPFCDS+                CE CP++G C++GK+ C  GF+K+GK
Sbjct: 60   FLSSTLINPHSKPFCDSSLDSLDFLPDS------CEPCPSNGQCFQGKMECLQGFKKRGK 113

Query: 441  RCVEDGDINRAAKKLSKWVEDHVCGACAQFLCSGTGKCWFGVDNLLDNLDKYGISDDYDL 620
             C+EDGDIN  AKKL++ VE  +CGA AQFLC GT   W   +++ ++LDK  + +    
Sbjct: 114  LCIEDGDINETAKKLAERVEIRLCGALAQFLCYGTETIWVEENDIWNDLDKRELLEHVP- 172

Query: 621  DKAIYMPARQKAMEAIRSALETRRGDQGVEELKCPEILVGHYKPLSCVARQCIIKHALLL 800
            D AIYM  +++ ME +   L+TR   +GV+ELKCP++L  HYKP SC  RQ I +HALL+
Sbjct: 173  DNAIYMYTKERTMETVNRMLDTRTSSRGVKELKCPDMLAEHYKPFSCRIRQWISEHALLI 232

Query: 801  VPACALIAGCILISCRAYQRHLLSVRAEQLYHEVCDILEEKPIVSKSGSGEGEPWVVASL 980
            +  CAL+ G   I  + ++R  LS R ++LY +VC++LEEK  +SKS + E EPWVVAS 
Sbjct: 233  LRVCALLVGSTFILWKLHRRRCLSTRVDELYQQVCEVLEEKAFMSKSVNSECEPWVVASR 292

Query: 981  LRDHLLLPKERKDPFLWRKVEELVLEDSRVDQYPKLVKGESKVVWEWQ 1124
            LRD LLLPKERKDP LW+KVEELV EDS VD YPKLVKGESKVVWEWQ
Sbjct: 293  LRDRLLLPKERKDPVLWKKVEELVQEDSHVDCYPKLVKGESKVVWEWQ 340


>ref|XP_006368453.1| hypothetical protein POPTR_0001s02940g [Populus trichocarpa]
            gi|550346367|gb|ERP65022.1| hypothetical protein
            POPTR_0001s02940g [Populus trichocarpa]
          Length = 384

 Score =  304 bits (779), Expect = 4e-80
 Identities = 155/326 (47%), Positives = 200/326 (61%)
 Frame = +3

Query: 147  YSTLFPAEPSSDSLPSTKADFSRLIXXXXXXXXXXXXXXXXXGSLNQPPKPFCDSTXXXX 326
            Y+     EP  +  PS K +F RLI                   ++   KPFCD++    
Sbjct: 21   YTISSKIEPPHNLFPS-KQEFLRLIAVLAIASSVALTCNFIANYIDHSTKPFCDTSLDSS 79

Query: 327  XXXXXXXXXXXYCEACPAHGVCYEGKLTCDHGFRKQGKRCVEDGDINRAAKKLSKWVEDH 506
                        CE CP +G C +GKL C  G+RK    C+EDGD+   AKKL + VE+H
Sbjct: 80   DSLSNS------CEPCPRNGECNQGKLECARGYRKHRNTCIEDGDVYERAKKLLEGVENH 133

Query: 507  VCGACAQFLCSGTGKCWFGVDNLLDNLDKYGISDDYDLDKAIYMPARQKAMEAIRSALET 686
            +C A A FLC GTG  W   D++L++LD + +  +Y  D  +Y   + KAME I   L+T
Sbjct: 134  LCEAYADFLCYGTGIMWVQEDDILNDLDGHQLLKNYSSDNPVYAYTKMKAMETISEELQT 193

Query: 687  RRGDQGVEELKCPEILVGHYKPLSCVARQCIIKHALLLVPACALIAGCILISCRAYQRHL 866
            R    G +E KCP++LV HYKP +C  RQ I +HAL++VP CAL+ G   +  +  +R  
Sbjct: 194  RTNPNGKKEFKCPDLLVEHYKPFTCHLRQWISEHALVIVPVCALVVGFAFLVWKIRRRWY 253

Query: 867  LSVRAEQLYHEVCDILEEKPIVSKSGSGEGEPWVVASLLRDHLLLPKERKDPFLWRKVEE 1046
            LS R E+LYH+VCDILEE+ ++SK  + E EPWVVAS LRDHLL PKERKD  LW+KVE+
Sbjct: 254  LSTRGEELYHQVCDILEERALMSKRVNAECEPWVVASRLRDHLLSPKERKDFVLWKKVED 313

Query: 1047 LVLEDSRVDQYPKLVKGESKVVWEWQ 1124
            LV EDSRVD+YPKLVKGESKVVWEWQ
Sbjct: 314  LVREDSRVDRYPKLVKGESKVVWEWQ 339


>ref|XP_004511767.1| PREDICTED: uncharacterized protein LOC101498686 [Cicer arietinum]
          Length = 391

 Score =  302 bits (773), Expect = 2e-79
 Identities = 156/344 (45%), Positives = 211/344 (61%), Gaps = 1/344 (0%)
 Frame = +3

Query: 96   RMASESNTQKKKRHKPSYSTLFPAEPSSDSLPSTKADFSRLIXXXXXXXXXXXXXXXXXG 275
            + +   ++  K +   S  ++   EP  +  PS K +F RLI                  
Sbjct: 14   KSSGRGSSSVKMKSSSSSISIIEKEPPPNLFPS-KHEFPRLIVVITVASLVAWTCNLLFT 72

Query: 276  SLNQPP-KPFCDSTXXXXXXXXXXXXXXXYCEACPAHGVCYEGKLTCDHGFRKQGKRCVE 452
            SL  PP KPFCDS                 CE CP++G C +GKL C  G++K G  CVE
Sbjct: 73   SLLHPPTKPFCDSNLNSYDFFPDN------CEPCPSNGECNDGKLECLSGYQKHGNLCVE 126

Query: 453  DGDINRAAKKLSKWVEDHVCGACAQFLCSGTGKCWFGVDNLLDNLDKYGISDDYDLDKAI 632
            DGDIN +A+K+ + VE H+CG  AQ+LCSGTG  W   D+L +  +  G   +   D A+
Sbjct: 127  DGDINESARKIVEKVEHHLCGEYAQYLCSGTGSIWVHDDDLWNYFEPVGNVKE---DNAL 183

Query: 633  YMPARQKAMEAIRSALETRRGDQGVEELKCPEILVGHYKPLSCVARQCIIKHALLLVPAC 812
            Y   +QKA + +   LE R    G++E KCP++LV HYK  +C  RQ I +H ++++P C
Sbjct: 184  YKYTKQKAFDTMDKLLEMRLNSHGMKEFKCPDLLVEHYKSYACRFRQWITQHIIVVLPIC 243

Query: 813  ALIAGCILISCRAYQRHLLSVRAEQLYHEVCDILEEKPIVSKSGSGEGEPWVVASLLRDH 992
            A++ GC ++   A ++  +S R E+LY++VC+ILEE  + SKS +GE EPWVVAS LRDH
Sbjct: 244  AMLVGCTILFTNARRKLRMSRRVEELYNKVCEILEENALTSKSVNGECEPWVVASRLRDH 303

Query: 993  LLLPKERKDPFLWRKVEELVLEDSRVDQYPKLVKGESKVVWEWQ 1124
            LLLP+ERKDP LW+KVEELV EDSR+D+YPKLVKGESKVVWEWQ
Sbjct: 304  LLLPRERKDPLLWKKVEELVQEDSRIDRYPKLVKGESKVVWEWQ 347


>ref|XP_007039583.1| Uncharacterized protein isoform 2 [Theobroma cacao]
            gi|508776828|gb|EOY24084.1| Uncharacterized protein
            isoform 2 [Theobroma cacao]
          Length = 378

 Score =  293 bits (750), Expect = 1e-76
 Identities = 154/343 (44%), Positives = 202/343 (58%), Gaps = 10/343 (2%)
 Frame = +3

Query: 111  SNTQKKKRHKPSYSTLFPAEPSSDSLPS----------TKADFSRLIXXXXXXXXXXXXX 260
            S++  KKR KP +++   +  S  SL S          +K +F RLI             
Sbjct: 2    SSSTPKKRPKPKHNSPSKSSTSKSSLNSILEPPQSLFPSKGEFFRLIAVLAIASSVALSC 61

Query: 261  XXXXGSLNQPPKPFCDSTXXXXXXXXXXXXXXXYCEACPAHGVCYEGKLTCDHGFRKQGK 440
                       KPFCDS                 CE CP++G CYEGKL C HG+R+ GK
Sbjct: 62   NFFATFFTSTSKPFCDSNLDSIDSLSDS------CEPCPSNGECYEGKLECIHGYRRHGK 115

Query: 441  RCVEDGDINRAAKKLSKWVEDHVCGACAQFLCSGTGKCWFGVDNLLDNLDKYGISDDYDL 620
             CVED DIN  AKK SKW+E  +C A AQ LC GT   W    ++ ++LD + +  ++  
Sbjct: 116  LCVEDKDINETAKKFSKWLEVRLCEAYAQSLCYGTVTVWAREHDIWNDLDGHELMQNFGP 175

Query: 621  DKAIYMPARQKAMEAIRSALETRRGDQGVEELKCPEILVGHYKPLSCVARQCIIKHALLL 800
            D A Y+ A+++ ME I   LETR    G++E+KCP+ L  +YKP +C  RQ I  HAL++
Sbjct: 176  DNATYLYAKRRVMETIVKLLETRINSHGIQEVKCPDSLAEYYKPFTCRIRQLISNHALII 235

Query: 801  VPACALIAGCILISCRAYQRHLLSVRAEQLYHEVCDILEEKPIVSKSGSGEGEPWVVASL 980
            VP CA + G  ++    +Q+  LS R E+LYH+VCD+LEEK + SKS +G GE WVVAS 
Sbjct: 236  VPVCAGLVGFAMLFWNVHQKRCLSARVEELYHQVCDMLEEKALRSKSVNGGGESWVVASW 295

Query: 981  LRDHLLLPKERKDPFLWRKVEELVLEDSRVDQYPKLVKGESKV 1109
            LRDHLL P+ERKDP LW+KVEELV EDSRVD+YPKLVK E  +
Sbjct: 296  LRDHLLFPRERKDPHLWKKVEELVQEDSRVDRYPKLVKVEGSL 338


>ref|XP_003538768.1| PREDICTED: uncharacterized protein LOC100784375 isoform X1 [Glycine
            max]
          Length = 381

 Score =  293 bits (749), Expect = 1e-76
 Identities = 151/326 (46%), Positives = 202/326 (61%), Gaps = 2/326 (0%)
 Frame = +3

Query: 153  TLFPAEPSSDSLPSTKADFSRLIXXXXXXXXXXXXXXXXXGSLNQPP-KPFCDSTXXXXX 329
            +L   EP  + LPS K DF RL+                  SL  PP KPFCD+      
Sbjct: 22   SLMGREPPQNLLPS-KHDFPRLVLVIALASLVAWTCNFLFTSLFHPPSKPFCDTNLHSPD 80

Query: 330  XXXXXXXXXXYCEACPAHGVCYEGKLTCDHGFRKQGKRCVEDGDINRAAKKLSKWVEDHV 509
                       C+ CP++G C +GKL C  G+++ G  C EDGDIN +A+KL + VE H+
Sbjct: 81   YFLDI------CQPCPSNGECNDGKLECHQGYQRHGNLCAEDGDINESARKLLERVEHHL 134

Query: 510  CGACAQFLCSGTGKCWFGVDNLLDNLDKYGISDDYDLDKAIYMPARQKAMEAIRSALETR 689
            C   AQFLC+GTG  W   D+L +  +  G   +  +D A+Y   +Q+A+E +   LETR
Sbjct: 135  CEKYAQFLCTGTGIIWVHEDDLWNYFEPVG---NVKVDNALYNYTKQRAVETMGKLLETR 191

Query: 690  -RGDQGVEELKCPEILVGHYKPLSCVARQCIIKHALLLVPACALIAGCILISCRAYQRHL 866
                 G++E KCP+ L  HYKP +C  RQ I +H L+++P CA++ GC  +     Q+  
Sbjct: 192  LNSSHGMKEFKCPDQLAEHYKPYTCCIRQWISQHILVVLPICAMLVGCTALCWNVRQKLS 251

Query: 867  LSVRAEQLYHEVCDILEEKPIVSKSGSGEGEPWVVASLLRDHLLLPKERKDPFLWRKVEE 1046
            +S R E+LY +VC+ILE+  + SKS +GE EPWVVAS LRDHLLLP+ERK+P LW+K+EE
Sbjct: 252  MSRRVEELYDKVCEILEDNALTSKSANGECEPWVVASRLRDHLLLPRERKNPLLWKKLEE 311

Query: 1047 LVLEDSRVDQYPKLVKGESKVVWEWQ 1124
            LV EDSR+D+YPKLVKGESKVVWEWQ
Sbjct: 312  LVQEDSRIDRYPKLVKGESKVVWEWQ 337


>ref|XP_003611437.1| hypothetical protein MTR_5g014010 [Medicago truncatula]
            gi|355512772|gb|AES94395.1| hypothetical protein
            MTR_5g014010 [Medicago truncatula]
          Length = 374

 Score =  290 bits (743), Expect = 6e-76
 Identities = 153/335 (45%), Positives = 205/335 (61%), Gaps = 1/335 (0%)
 Frame = +3

Query: 123  KKKRHKPSYSTLFPAEPSSDSLPSTKADFSRLIXXXXXXXXXXXXXXXXXGSLNQPP-KP 299
            K  R     S++   EP  + LPS K +F +L+                  S   P  KP
Sbjct: 7    KSSREVKLKSSIIDKEPPPNLLPS-KHEFPKLLLVLTVASLVAWSSNLLFTSFLHPSTKP 65

Query: 300  FCDSTXXXXXXXXXXXXXXXYCEACPAHGVCYEGKLTCDHGFRKQGKRCVEDGDINRAAK 479
            FCD+                 CE CP++G C +GKL C  G++K G  CVEDGDIN +A+
Sbjct: 66   FCDTNSLHNHFPDS-------CEPCPSNGECNDGKLECLRGYQKHGNLCVEDGDINDSAR 118

Query: 480  KLSKWVEDHVCGACAQFLCSGTGKCWFGVDNLLDNLDKYGISDDYDLDKAIYMPARQKAM 659
            K++  VE H+CG  AQFLCSGTG  W   D+L + ++     ++     A+Y   +QKA 
Sbjct: 119  KIADTVERHLCGEYAQFLCSGTGSIWVHDDDLWNYIEPV---ENVKEGNALYNYTKQKAF 175

Query: 660  EAIRSALETRRGDQGVEELKCPEILVGHYKPLSCVARQCIIKHALLLVPACALIAGCILI 839
            + +   LE R    G++E KCP+ LV  YKP +C  RQ I +H L+++P CA++ GC+++
Sbjct: 176  DMMDKLLEMRLTTHGMKEFKCPDSLVEQYKPYACRLRQWITQHILVVLPICAMLVGCMIL 235

Query: 840  SCRAYQRHLLSVRAEQLYHEVCDILEEKPIVSKSGSGEGEPWVVASLLRDHLLLPKERKD 1019
                 ++  +S R E+LY++VC+ILEE  + SKS +GE EPWVVAS LRDHLLLP+ERKD
Sbjct: 236  FWNVRRKLRVSRRVEELYNKVCEILEENALTSKSVNGECEPWVVASRLRDHLLLPRERKD 295

Query: 1020 PFLWRKVEELVLEDSRVDQYPKLVKGESKVVWEWQ 1124
            P LW+KVEELV EDSRVD+YPKLVKGESKVVWEWQ
Sbjct: 296  PLLWKKVEELVQEDSRVDRYPKLVKGESKVVWEWQ 330


>ref|XP_004148518.1| PREDICTED: uncharacterized protein LOC101208017 [Cucumis sativus]
          Length = 404

 Score =  286 bits (733), Expect = 9e-75
 Identities = 143/320 (44%), Positives = 194/320 (60%), Gaps = 1/320 (0%)
 Frame = +3

Query: 168  EPSSDSLPSTKADFSRLIXXXXXXXXXXXXXXXXXGSLN-QPPKPFCDSTXXXXXXXXXX 344
            EP  D  PS K D + LI                   L+ + P PFCD+           
Sbjct: 43   EPPRDFFPS-KDDLAALITVLIIACFVFVSCNFFVSRLSSRHPIPFCDTDADSSDFISDV 101

Query: 345  XXXXXYCEACPAHGVCYEGKLTCDHGFRKQGKRCVEDGDINRAAKKLSKWVEDHVCGACA 524
                  CE CP HG C +GKL C HG+RK G+ C+EDG IN A  KLS+W+E H+C A A
Sbjct: 102  ------CEPCPRHGECRDGKLECLHGYRKHGRLCIEDGVINEAVNKLSEWLESHLCEANA 155

Query: 525  QFLCSGTGKCWFGVDNLLDNLDKYGISDDYDLDKAIYMPARQKAMEAIRSALETRRGDQG 704
            +FLC G G  W   +++ D+LD   + +    D    M A+ KA+E I   L+TR+   G
Sbjct: 156  KFLCDGIGIVWVKENDIWDDLDGKELVESIGSDNTTLMYAKSKALETIGGLLQTRQNSLG 215

Query: 705  VEELKCPEILVGHYKPLSCVARQCIIKHALLLVPACALIAGCILISCRAYQRHLLSVRAE 884
            ++ELKCP++L   YKP +C  R  +++HA +++P   L+ GC  +  + Y+R  L+ RAE
Sbjct: 216  IKELKCPDLLAESYKPFTCRIRHWVLQHAFVVLPVFLLLVGCTWLLWKLYRRQYLTNRAE 275

Query: 885  QLYHEVCDILEEKPIVSKSGSGEGEPWVVASLLRDHLLLPKERKDPFLWRKVEELVLEDS 1064
             LY++VC+ILEE  + S   SG+ E WVVAS LRDHLLLP+ER++P LW+KVEELV EDS
Sbjct: 276  DLYNQVCEILEENALTSTRNSGQCESWVVASRLRDHLLLPRERRNPLLWKKVEELVQEDS 335

Query: 1065 RVDQYPKLVKGESKVVWEWQ 1124
            R+D+YP+LVKG+ K VWEWQ
Sbjct: 336  RIDRYPRLVKGDGKEVWEWQ 355


>ref|XP_006485401.1| PREDICTED: uncharacterized protein LOC102629601 isoform X4 [Citrus
            sinensis]
          Length = 352

 Score =  286 bits (731), Expect = 2e-74
 Identities = 143/322 (44%), Positives = 195/322 (60%), Gaps = 12/322 (3%)
 Frame = +3

Query: 111  SNTQKKKRHKP------------SYSTLFPAEPSSDSLPSTKADFSRLIXXXXXXXXXXX 254
            S++ KKKR KP            S S  +  EP     PS K D  RLI           
Sbjct: 2    SSSTKKKRPKPKSNSSSSSSSSSSSSWSWMTEPPQSLFPS-KQDLLRLITVVAIASSVAL 60

Query: 255  XXXXXXGSLNQPPKPFCDSTXXXXXXXXXXXXXXXYCEACPAHGVCYEGKLTCDHGFRKQ 434
                    LN   KPFCDS                 CE CP++G C++GKL C HG+RK 
Sbjct: 61   TCNYLANFLNSTSKPFCDSNLLLDSPQSPTDS----CEPCPSNGECHQGKLECFHGYRKH 116

Query: 435  GKRCVEDGDINRAAKKLSKWVEDHVCGACAQFLCSGTGKCWFGVDNLLDNLDKYGISDDY 614
            GK CVEDGDIN  A +LS+WVE+ +C A AQFLC GTG  W   +++ ++L+ + +   +
Sbjct: 117  GKLCVEDGDINETAGRLSRWVENRLCRAYAQFLCDGTGSIWVEENDIWNDLEGHELMKIF 176

Query: 615  DLDKAIYMPARQKAMEAIRSALETRRGDQGVEELKCPEILVGHYKPLSCVARQCIIKHAL 794
            +LD  +Y+  +++ ME +   LE+R    G++ELKCPE+L  HYKPLSC   Q +  HAL
Sbjct: 177  ELDNPVYLYTKKRTMETVGRYLESRTNSYGMKELKCPELLAEHYKPLSCRIHQWVSTHAL 236

Query: 795  LLVPACALIAGCILISCRAYQRHLLSVRAEQLYHEVCDILEEKPIVSKSGSGEGEPWVVA 974
            ++VP C+L+ GC+L+  + ++R   ++R E+LYH+VC+ILEE  ++SKS +GE EPWVVA
Sbjct: 237  IIVPVCSLLVGCLLLLWKVHRRRYFAIRVEELYHQVCEILEENALMSKSVNGECEPWVVA 296

Query: 975  SLLRDHLLLPKERKDPFLWRKV 1040
            S LRDHLLLPKERKDP +W+KV
Sbjct: 297  SRLRDHLLLPKERKDPVIWKKV 318


>ref|XP_003516643.1| PREDICTED: uncharacterized protein LOC100779650 [Glycine max]
          Length = 377

 Score =  285 bits (730), Expect = 2e-74
 Identities = 149/325 (45%), Positives = 199/325 (61%), Gaps = 1/325 (0%)
 Frame = +3

Query: 153  TLFPAEPSSDSLPSTKADFSRLIXXXXXXXXXXXXXXXXXGSLNQPPKPFCDSTXXXXXX 332
            +L   EP  + LPS K DF RL+                   L  P KPFCD        
Sbjct: 22   SLMGREPPQNLLPS-KHDFPRLVLVVALASLVAWTCNF----LFTPSKPFCDPNLHSPDY 76

Query: 333  XXXXXXXXXYCEACPAHGVCYEGKLTCDHGFRKQGKRCVEDGDINRAAKKLSKWVEDHVC 512
                      CE CP++G C +GKL C  G+++ G  CVEDGDIN +A+KL + VE H+C
Sbjct: 77   FSDI------CEPCPSNGECNDGKLKCLQGYQRHGNLCVEDGDINESARKLLERVEHHLC 130

Query: 513  GACAQFLCSGTGKCWFGVDNLLDNLDKYGISDDYDLDKAIYMPARQKAMEAIRSALETR- 689
               AQFLC+GTG  W   D+L +  +  G   +  +D A+Y   +QKA E +   L+TR 
Sbjct: 131  EEYAQFLCTGTGTIWVREDDLWNYFEPVG---NVKVDNALYKYTKQKAFETMGKLLDTRL 187

Query: 690  RGDQGVEELKCPEILVGHYKPLSCVARQCIIKHALLLVPACALIAGCILISCRAYQRHLL 869
                G++E KCP+ L  HYK  +C  RQ I +H L+++P CA++ GC  +     Q+  +
Sbjct: 188  NSSHGMKEFKCPDQLAEHYKSYACCIRQWISQHILVVLPICAMLVGCTALFWSVRQKLCM 247

Query: 870  SVRAEQLYHEVCDILEEKPIVSKSGSGEGEPWVVASLLRDHLLLPKERKDPFLWRKVEEL 1049
            S R E+LY++VC+ILEE  + SKS +GE EPWVV+S LRDHLLLP+ERK+P LW+KVE++
Sbjct: 248  SRRIEELYNKVCEILEENALTSKSANGECEPWVVSSRLRDHLLLPRERKNPLLWKKVEKM 307

Query: 1050 VLEDSRVDQYPKLVKGESKVVWEWQ 1124
            V EDSR+D+YPKLVKGESKVVWEWQ
Sbjct: 308  VQEDSRIDRYPKLVKGESKVVWEWQ 332


>ref|XP_007156635.1| hypothetical protein PHAVU_002G004700g [Phaseolus vulgaris]
            gi|561030050|gb|ESW28629.1| hypothetical protein
            PHAVU_002G004700g [Phaseolus vulgaris]
          Length = 383

 Score =  284 bits (727), Expect = 5e-74
 Identities = 148/326 (45%), Positives = 201/326 (61%), Gaps = 2/326 (0%)
 Frame = +3

Query: 153  TLFPAEPSSDSLPSTKADFSRLIXXXXXXXXXXXXXXXXXGSL-NQPPKPFCDSTXXXXX 329
            +L   EP  + LPS K D  RL+                  SL   P KPFCD+      
Sbjct: 24   SLMGREPPQNLLPS-KHDLPRLVLVLALASLVAWTCNFLFTSLLRSPSKPFCDTNFHSPD 82

Query: 330  XXXXXXXXXXYCEACPAHGVCYEGKLTCDHGFRKQGKRCVEDGDINRAAKKLSKWVEDHV 509
                       CE CP++G C +GKL C  G+++ G  CVEDGDI+++A+K+ + VE H+
Sbjct: 83   YFPDA------CEPCPSNGECNDGKLECLQGYQRHGNLCVEDGDISQSARKIVERVERHL 136

Query: 510  CGACAQFLCSGTGKCWFGVDNLLDNLDKYGISDDYDLDKAIYMPARQKAMEAIRSALETR 689
            C   AQFLCSGTG  W   D L ++       ++  +D A++   +Q+A+E +   LETR
Sbjct: 137  CEGYAQFLCSGTGPMWVPEDVLWNHFQPV---ENVKVDNALHNYTKQRAVETMGKLLETR 193

Query: 690  -RGDQGVEELKCPEILVGHYKPLSCVARQCIIKHALLLVPACALIAGCILISCRAYQRHL 866
                 G++E KCP++L  HYKP +C  RQ + +H L+++P CA++ GCI +     ++  
Sbjct: 194  LNNSHGMKEFKCPDLLAVHYKPYTCCIRQWVSQHILVVLPICAMLVGCITLFWSIRRKLS 253

Query: 867  LSVRAEQLYHEVCDILEEKPIVSKSGSGEGEPWVVASLLRDHLLLPKERKDPFLWRKVEE 1046
            +S R E+LY +VC+ILE+  + SKS +GE EPW VAS LRDHLLLP+ERK+P LWRKVEE
Sbjct: 254  MSRRVEELYDKVCEILEDNALTSKSANGECEPWFVASRLRDHLLLPRERKNPLLWRKVEE 313

Query: 1047 LVLEDSRVDQYPKLVKGESKVVWEWQ 1124
            LV EDSR+D YPKLVKGESKVVWEWQ
Sbjct: 314  LVQEDSRIDCYPKLVKGESKVVWEWQ 339


Top