BLASTX nr result

ID: Paeonia23_contig00003736 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Paeonia23_contig00003736
         (1757 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002881741.1| hypothetical protein ARALYDRAFT_903376 [Arab...   407   e-110
ref|XP_006411346.1| hypothetical protein EUTSA_v10016808mg [Eutr...   403   e-109
ref|NP_181612.1| uncharacterized protein [Arabidopsis thaliana] ...   402   e-109
ref|XP_006371317.1| hypothetical protein POPTR_0019s09020g [Popu...   402   e-109
ref|XP_002270952.1| PREDICTED: uncharacterized protein LOC100265...   395   e-107
ref|NP_191202.1| uncharacterized protein [Arabidopsis thaliana] ...   395   e-107
ref|XP_002878085.1| hypothetical protein ARALYDRAFT_907085 [Arab...   395   e-107
ref|XP_007218249.1| hypothetical protein PRUPE_ppa008050mg [Prun...   390   e-106
ref|XP_006435962.1| hypothetical protein CICLE_v10031884mg [Citr...   389   e-105
ref|XP_004141508.1| PREDICTED: uncharacterized protein LOC101215...   389   e-105
ref|XP_002520894.1| conserved hypothetical protein [Ricinus comm...   389   e-105
ref|XP_006402987.1| hypothetical protein EUTSA_v10006036mg [Eutr...   388   e-105
ref|XP_006291135.1| hypothetical protein CARUB_v10017250mg [Caps...   383   e-103
ref|XP_007011394.1| Uncharacterized protein TCM_045580 [Theobrom...   380   e-102
ref|XP_004167240.1| PREDICTED: uncharacterized protein LOC101225...   377   e-101
ref|XP_003526791.1| PREDICTED: uncharacterized protein LOC100819...   374   e-100
ref|XP_006358158.1| PREDICTED: uncharacterized protein LOC102588...   372   e-100
ref|XP_004235216.1| PREDICTED: uncharacterized protein LOC101251...   370   1e-99
ref|XP_007136404.1| hypothetical protein PHAVU_009G042200g [Phas...   367   9e-99
gb|EXC21106.1| hypothetical protein L484_017118 [Morus notabilis]     365   3e-98

>ref|XP_002881741.1| hypothetical protein ARALYDRAFT_903376 [Arabidopsis lyrata subsp.
            lyrata] gi|297327580|gb|EFH58000.1| hypothetical protein
            ARALYDRAFT_903376 [Arabidopsis lyrata subsp. lyrata]
          Length = 377

 Score =  407 bits (1045), Expect = e-110
 Identities = 221/405 (54%), Positives = 270/405 (66%), Gaps = 4/405 (0%)
 Frame = +3

Query: 270  MAKPSTGRVIQGLLKFHYQHPHRLLSDGGSVSAIQNSSAKTYSTSTF---LSSRFQSKPP 440
            M +PS  + IQG ++ HY   + +     + SA+ + +  +   S      S    S  P
Sbjct: 1    MVRPSDFKAIQGFIRLHYTRVNPVTIGRSNPSALSSPAIPSNGVSQLQPKFSFHSLSSKP 60

Query: 441  TSSNWGLSQLLSPSKTSNLPPSQINYYPSRIFTNSSHLSPKTHLKPSAFSPAGLGFGKLN 620
            TS+N GLSQ+LS  K                      L+PK            LG  ++N
Sbjct: 61   TSTNVGLSQILSSPK----------------------LNPKLQ--------QALGLPRVN 90

Query: 621  LN-SSGFRYFSSKTSNISKANGNFAKSVIDKPVSAVRSAVSRYRDAVGLQIDAFWKRNSM 797
            ++ +S FR  S+K+S   K +GNFA+ V+DKPV AV S  +RYR+A+GL +DAFWK+NS+
Sbjct: 91   VSFASSFRLVSNKSSGFRKIDGNFARKVVDKPVKAVSSTFARYREAIGLHVDAFWKKNSL 150

Query: 798  XXXXXXXXXXXXXXWRVMFGIAHSFVGLSEGMAKYGFLALSSAIVAFSGLYIRSRFAINP 977
                          WR+MFGIA +FVGLSEGMAKYGFLALSSAIVAFSGLY+RSRF INP
Sbjct: 151  VVFGAGGVFVCIFLWRIMFGIASTFVGLSEGMAKYGFLALSSAIVAFSGLYLRSRFTINP 210

Query: 978  DKVYRIAMTKLNTSAAILEIMGAPLSGTSLRAYVMSGGGLTLKNFKPTLRSKRCFLIFPI 1157
            DKVYR+ M K+NT+A ILE+MGAPLSG+ LRAYVMSGGG+T K FKPT+RSKRCFL+FP+
Sbjct: 211  DKVYRMTMRKINTAAEILEVMGAPLSGSDLRAYVMSGGGITFKKFKPTIRSKRCFLLFPV 270

Query: 1158 RGSERKGLVSVEVKKKKGQYDMKLLAIDIPMATGPDQRIFLIGDEVEYKVGGGLISELRD 1337
            +GSERKGLVSVEVKKKKGQYDMKLLA+DIPMA+GPDQR+FLIGDE EY+VGGGLIS LRD
Sbjct: 271  QGSERKGLVSVEVKKKKGQYDMKLLAVDIPMASGPDQRLFLIGDEEEYRVGGGLISVLRD 330

Query: 1338 PVVKAMAATKXXXXXXXXXXXXXXXXXXXXXXRKHQEEIEKLESK 1472
            PVVKAMAATK                      RKH+EEIEKLE +
Sbjct: 331  PVVKAMAATKEFDNLDRIEEEEDAERELEEAERKHREEIEKLEKE 375


>ref|XP_006411346.1| hypothetical protein EUTSA_v10016808mg [Eutrema salsugineum]
            gi|557112515|gb|ESQ52799.1| hypothetical protein
            EUTSA_v10016808mg [Eutrema salsugineum]
          Length = 377

 Score =  403 bits (1036), Expect = e-109
 Identities = 227/408 (55%), Positives = 272/408 (66%), Gaps = 6/408 (1%)
 Frame = +3

Query: 270  MAKPSTGRVIQGLLKFHYQHPHRLLSDGGSVSAIQNSSAKTYSTSTFLSSRFQ----SKP 437
            M KPS  R + G ++ HY   + + + G S     +S A   ++   L  RF     S  
Sbjct: 1    MVKPSDFRSVHGFIRLHYSRVNPV-TIGRSNPPPLSSPAIPSNSVPQLQPRFSFHSLSSK 59

Query: 438  PTSSNWGLS-QLLSPSKTSNLPPSQINYYPSRIFTNSSHLSPKTHLKPSAFSPAGLGFGK 614
            PTS+N G S Q+LS  K                      L+PK            LG  +
Sbjct: 60   PTSTNVGFSSQVLSCPK----------------------LNPKLQ---------ALGLPR 88

Query: 615  LNLN-SSGFRYFSSKTSNISKANGNFAKSVIDKPVSAVRSAVSRYRDAVGLQIDAFWKRN 791
            +N+N SS FR  S+K+S   K +GNFA+ V+D+PV AV S  +RYR+A+GL IDAFWK+N
Sbjct: 89   VNVNYSSAFRLVSTKSSGFRKVDGNFARKVVDRPVKAVSSTFARYREAIGLHIDAFWKKN 148

Query: 792  SMXXXXXXXXXXXXXXWRVMFGIAHSFVGLSEGMAKYGFLALSSAIVAFSGLYIRSRFAI 971
            S+              WR+MFGIA +FVGLSEGMAKYGFLALSSAIVAFSGLY+RSRF I
Sbjct: 149  SLILFGAGGVFVCIFLWRIMFGIASTFVGLSEGMAKYGFLALSSAIVAFSGLYLRSRFTI 208

Query: 972  NPDKVYRIAMTKLNTSAAILEIMGAPLSGTSLRAYVMSGGGLTLKNFKPTLRSKRCFLIF 1151
            NPDKVYR+ M K+NT+A ILE+MGAPLSG+ LRAYVMSGGG+T K FKPT+RSKRCFL+F
Sbjct: 209  NPDKVYRMTMRKINTAAEILEVMGAPLSGSDLRAYVMSGGGITFKKFKPTIRSKRCFLLF 268

Query: 1152 PIRGSERKGLVSVEVKKKKGQYDMKLLAIDIPMATGPDQRIFLIGDEVEYKVGGGLISEL 1331
            P++GSERKGLVSVEVKKKKGQYDMKLLA+DIPMA+GPDQR+FLIGDE EY+VGGGLISEL
Sbjct: 269  PVQGSERKGLVSVEVKKKKGQYDMKLLAVDIPMASGPDQRLFLIGDEEEYRVGGGLISEL 328

Query: 1332 RDPVVKAMAATKXXXXXXXXXXXXXXXXXXXXXXRKHQEEIEKLESKS 1475
            RDPVVKAMAATK                      RKH+EEIEKLE +S
Sbjct: 329  RDPVVKAMAATKEFDNLDRIEEEEDAERELEEAERKHREEIEKLEKES 376


>ref|NP_181612.1| uncharacterized protein [Arabidopsis thaliana]
            gi|17473709|gb|AAL38308.1| unknown protein [Arabidopsis
            thaliana] gi|20148507|gb|AAM10144.1| unknown protein
            [Arabidopsis thaliana] gi|330254786|gb|AEC09880.1|
            uncharacterized protein AT2G40800 [Arabidopsis thaliana]
          Length = 377

 Score =  402 bits (1033), Expect = e-109
 Identities = 219/405 (54%), Positives = 268/405 (66%), Gaps = 4/405 (0%)
 Frame = +3

Query: 270  MAKPSTGRVIQGLLKFHYQHPHRLLSDGGSVSAIQNSSAKTYSTSTF---LSSRFQSKPP 440
            M KPS  + IQG ++ HY   + +     + SA+ + +  +   S      S    S  P
Sbjct: 1    MVKPSDFKAIQGFIRLHYTRVNPVTIGRSNPSALSSPAIPSNGVSQLQPKFSFHSLSSKP 60

Query: 441  TSSNWGLSQLLSPSKTSNLPPSQINYYPSRIFTNSSHLSPKTHLKPSAFSPAGLGFGKLN 620
            TS N GL Q+LS  K                      L+PK            LG  ++N
Sbjct: 61   TSKNVGLYQILSSPK----------------------LNPKLQ--------QALGLPRVN 90

Query: 621  LN-SSGFRYFSSKTSNISKANGNFAKSVIDKPVSAVRSAVSRYRDAVGLQIDAFWKRNSM 797
            ++ +S FR  S+K+S   K +G+FA+ V+DKPV AV S  +RYR+A+GL IDAFWK+NS+
Sbjct: 91   VSFASAFRLVSTKSSGFRKVDGSFARKVVDKPVKAVSSTFARYREAIGLHIDAFWKKNSL 150

Query: 798  XXXXXXXXXXXXXXWRVMFGIAHSFVGLSEGMAKYGFLALSSAIVAFSGLYIRSRFAINP 977
                          WR+MFGIA +FVGLSEGMAKYGFLALSSAIVAFSGLY+RSRF INP
Sbjct: 151  VVFGAAGVFVCIFLWRIMFGIASTFVGLSEGMAKYGFLALSSAIVAFSGLYLRSRFTINP 210

Query: 978  DKVYRIAMTKLNTSAAILEIMGAPLSGTSLRAYVMSGGGLTLKNFKPTLRSKRCFLIFPI 1157
            DKVYR+ M K+NT+A ILE+MGAPLSG+ LRAYVMSGGG+T K FKPT+RSKRCFL+FP+
Sbjct: 211  DKVYRMTMRKINTAAEILEVMGAPLSGSDLRAYVMSGGGITFKKFKPTIRSKRCFLLFPV 270

Query: 1158 RGSERKGLVSVEVKKKKGQYDMKLLAIDIPMATGPDQRIFLIGDEVEYKVGGGLISELRD 1337
            +GSE+KGLVSVEVKKKKGQYDMKLLA+DIPMA+GPDQR+FLIGDE EY++GGGLIS LRD
Sbjct: 271  QGSEQKGLVSVEVKKKKGQYDMKLLAVDIPMASGPDQRLFLIGDEEEYRIGGGLISVLRD 330

Query: 1338 PVVKAMAATKXXXXXXXXXXXXXXXXXXXXXXRKHQEEIEKLESK 1472
            PVVKAMAATK                      RKH+EEIEKLE +
Sbjct: 331  PVVKAMAATKEFDNLDRIEEEEDAERELQEAERKHREEIEKLEKE 375


>ref|XP_006371317.1| hypothetical protein POPTR_0019s09020g [Populus trichocarpa]
            gi|550317070|gb|ERP49114.1| hypothetical protein
            POPTR_0019s09020g [Populus trichocarpa]
          Length = 378

 Score =  402 bits (1032), Expect = e-109
 Identities = 238/405 (58%), Positives = 271/405 (66%), Gaps = 13/405 (3%)
 Frame = +3

Query: 291  RVIQGLLKFHYQHPHRL----LSDGGSVSAIQNSSAKTYSTS----TFLSSR---FQSKP 437
            R+IQGLLK HY H        L+     S I  S  + YS +    T L+S+   F SKP
Sbjct: 7    RIIQGLLKLHYNHVSTSKPSPLTTPSLASRISTSFTRPYSNASSNFTPLNSQIPCFTSKP 66

Query: 438  PTSSNWGLSQLLSPSKTSNLPPSQINYYPSRIFTNSSHLSPKTHLKPSAFSPAGLGFGKL 617
             TSSN GLSQ LS +K +                             S+FS  G  F   
Sbjct: 67   -TSSNLGLSQFLSCTKPN-----------------------------SSFSKNGSFF--- 93

Query: 618  NLNSSGFRYFSSKTSNI--SKANGNFAKSVIDKPVSAVRSAVSRYRDAVGLQIDAFWKRN 791
                 G R FS K S+    + +GNFAK +++KP +AV SA SRYR+A+GLQIDAF KRN
Sbjct: 94   ----YGVRQFSFKGSSDLGKRVDGNFAKKLLEKPATAVTSAFSRYREALGLQIDAFLKRN 149

Query: 792  SMXXXXXXXXXXXXXXWRVMFGIAHSFVGLSEGMAKYGFLALSSAIVAFSGLYIRSRFAI 971
            S+              WR+MFGIA++FV LSEGMAKYGFLALSSAIVAFSGLYIRSR  I
Sbjct: 150  SLFLIGAGGVIICALLWRIMFGIANTFVSLSEGMAKYGFLALSSAIVAFSGLYIRSRITI 209

Query: 972  NPDKVYRIAMTKLNTSAAILEIMGAPLSGTSLRAYVMSGGGLTLKNFKPTLRSKRCFLIF 1151
            NPDKVYR+AMTKLNTSA ILE+MGAPL+GT LRAYVMSGGGL LKNFKPT+RSKRCFLIF
Sbjct: 210  NPDKVYRMAMTKLNTSAGILEVMGAPLTGTVLRAYVMSGGGLVLKNFKPTVRSKRCFLIF 269

Query: 1152 PIRGSERKGLVSVEVKKKKGQYDMKLLAIDIPMATGPDQRIFLIGDEVEYKVGGGLISEL 1331
            PI+GSERKGLVSVEVKKKKGQYDM+LLA+DIPMA+GPDQR+FLIGDE EYKVGGGLISEL
Sbjct: 270  PIQGSERKGLVSVEVKKKKGQYDMRLLAVDIPMASGPDQRLFLIGDEEEYKVGGGLISEL 329

Query: 1332 RDPVVKAMAATKXXXXXXXXXXXXXXXXXXXXXXRKHQEEIEKLE 1466
            RDPVVKAMAA+K                      RKH+EEIEKLE
Sbjct: 330  RDPVVKAMAASKEFDDLDQIEEEEDAEKELQEAERKHREEIEKLE 374


>ref|XP_002270952.1| PREDICTED: uncharacterized protein LOC100265611 [Vitis vinifera]
          Length = 382

 Score =  395 bits (1015), Expect = e-107
 Identities = 225/354 (63%), Positives = 250/354 (70%), Gaps = 6/354 (1%)
 Frame = +3

Query: 423  FQSKPPTSSNWGLSQLLSPSKTSNLPP--SQINYYPSRIFTNSSHLSPKTHLKPSAFSPA 596
            F  KP  S   G S  LSPS     P   S    + SR+   S  +  K   KP    P 
Sbjct: 28   FHDKPSISLPAGGS--LSPSSAIRGPDPLSTTLCFSSRV--ESFQIQSKRLSKPYPIPPI 83

Query: 597  -GLGFG-KLNLNSSGFRYFSSKTSNISKA--NGNFAKSVIDKPVSAVRSAVSRYRDAVGL 764
               GFG +L  NSSG RYFSS   N+ KA  N NF K+ +D P+ ++RSA  RYR+AVGL
Sbjct: 84   FSSGFGARLYANSSGLRYFSSGGWNLGKAQTNANFPKAFLDLPLRSLRSAFYRYREAVGL 143

Query: 765  QIDAFWKRNSMXXXXXXXXXXXXXXWRVMFGIAHSFVGLSEGMAKYGFLALSSAIVAFSG 944
            QI+AFWKRN +              WR MFGIA +FVGLSEGMAKYGFLALS++IVAFSG
Sbjct: 144  QIEAFWKRNYVFLLGAGGVVLCAVLWRAMFGIATTFVGLSEGMAKYGFLALSASIVAFSG 203

Query: 945  LYIRSRFAINPDKVYRIAMTKLNTSAAILEIMGAPLSGTSLRAYVMSGGGLTLKNFKPTL 1124
            LYIRSR  INPDKVYRIAM KLNTSA ILE+MGAPL+GT LRAYVMSGGGL+LK FKPTL
Sbjct: 204  LYIRSRLTINPDKVYRIAMRKLNTSAGILEVMGAPLTGTDLRAYVMSGGGLSLKKFKPTL 263

Query: 1125 RSKRCFLIFPIRGSERKGLVSVEVKKKKGQYDMKLLAIDIPMATGPDQRIFLIGDEVEYK 1304
            RSKRCFLIFPIRGSER+GLVS+EVKKKKG+YDMKLLA+DIPMATGPDQR+FLIGDE EYK
Sbjct: 264  RSKRCFLIFPIRGSERRGLVSIEVKKKKGEYDMKLLAVDIPMATGPDQRLFLIGDEEEYK 323

Query: 1305 VGGGLISELRDPVVKAMAATKXXXXXXXXXXXXXXXXXXXXXXRKHQEEIEKLE 1466
            VGGGLISELRDPVVKAMAATK                      RKH+EEIEKLE
Sbjct: 324  VGGGLISELRDPVVKAMAATKEFEELDQIEEEEDAERELQEAERKHREEIEKLE 377


>ref|NP_191202.1| uncharacterized protein [Arabidopsis thaliana]
            gi|7594521|emb|CAB88046.1| putative protein [Arabidopsis
            thaliana] gi|63003794|gb|AAY25426.1| At3g56430
            [Arabidopsis thaliana] gi|114213521|gb|ABI54343.1|
            At3g56430 [Arabidopsis thaliana]
            gi|332646000|gb|AEE79521.1| uncharacterized protein
            AT3G56430 [Arabidopsis thaliana]
          Length = 434

 Score =  395 bits (1014), Expect = e-107
 Identities = 223/412 (54%), Positives = 276/412 (66%), Gaps = 11/412 (2%)
 Frame = +3

Query: 270  MAKPSTGRVIQGLLKFHYQHPHRLLSDGGS-VSAIQNSSAKTYSTSTFL-SSRFQ--SKP 437
            M KPS  + +  L++ HY   +   + G S +SA+ + +  +   S     S F   S  
Sbjct: 1    MVKPSDFKAVHKLIQLHYSRVNLFTTTGWSKLSALSSPALPSNGVSQLQPKSGFHTFSSR 60

Query: 438  PTSSNWGLSQLLSPSKTSNLPP-SQINYYPSRIFTNSSHLS---PKTHLKPSAFSPAGLG 605
            PTS N+GLSQ+L  +  S L P +  + + SR  + +  LS   P   L P      G+ 
Sbjct: 61   PTSKNFGLSQILPSNGVSQLQPKTSFHSFLSRPTSKNVGLSQILPSPKLVPG-LQNCGVA 119

Query: 606  FGKLNLNS---SGFRYFSSKTSNISKANGNFAKSVIDKPVSAVRSAVSRYRDAVGLQIDA 776
              K  +N    S FR FSS  S   K +GNFA+ V+DKP+ AV S  +RYR A+GL +DA
Sbjct: 120  LVKPRVNMNFVSAFRLFSS--SGFRKVDGNFARKVVDKPIKAVSSTFARYRMALGLHVDA 177

Query: 777  FWKRNSMXXXXXXXXXXXXXXWRVMFGIAHSFVGLSEGMAKYGFLALSSAIVAFSGLYIR 956
            FWK+N++              WRVMFGIA +FVGLSEGMAKYGFLALSSAIVAF+GLY+R
Sbjct: 178  FWKKNNLLVFGAGAVFVCIFLWRVMFGIASTFVGLSEGMAKYGFLALSSAIVAFAGLYLR 237

Query: 957  SRFAINPDKVYRIAMTKLNTSAAILEIMGAPLSGTSLRAYVMSGGGLTLKNFKPTLRSKR 1136
            +RF INPDKVYRI M KLNT+A +LE+MGAPL+G+ LRAYVMSGGG+T K FKPT+R+KR
Sbjct: 238  ARFTINPDKVYRITMRKLNTAADVLEVMGAPLAGSDLRAYVMSGGGITFKKFKPTIRNKR 297

Query: 1137 CFLIFPIRGSERKGLVSVEVKKKKGQYDMKLLAIDIPMATGPDQRIFLIGDEVEYKVGGG 1316
            CFL+FP++GSERKGLVSVEVKKKKGQYDMKLLA+DIPMA+GPDQR+FLIGDE EY+VGGG
Sbjct: 298  CFLLFPVQGSERKGLVSVEVKKKKGQYDMKLLAVDIPMASGPDQRLFLIGDEEEYRVGGG 357

Query: 1317 LISELRDPVVKAMAATKXXXXXXXXXXXXXXXXXXXXXXRKHQEEIEKLESK 1472
            LISELRDPVVKAMAATK                      RK +EEIE  E++
Sbjct: 358  LISELRDPVVKAMAATKEFDNLDRIEEEEDAERELQEAERKEREEIELQEAE 409


>ref|XP_002878085.1| hypothetical protein ARALYDRAFT_907085 [Arabidopsis lyrata subsp.
            lyrata] gi|297323923|gb|EFH54344.1| hypothetical protein
            ARALYDRAFT_907085 [Arabidopsis lyrata subsp. lyrata]
          Length = 438

 Score =  395 bits (1014), Expect = e-107
 Identities = 221/412 (53%), Positives = 278/412 (67%), Gaps = 11/412 (2%)
 Frame = +3

Query: 270  MAKPSTGRVIQGLLKFHYQHPHRLLSDGGS-VSAIQNSSAKTYSTSTFLS-SRFQS--KP 437
            M KPS  + +  L++ HY   + +   G S +SA  + +  +   S   + S F S    
Sbjct: 1    MVKPSDFKAVHKLIQLHYSRVNLVSKTGRSKLSAFSSPTLPSNGVSQLQAKSGFHSFSSR 60

Query: 438  PTSSNWGLSQLLSPSKTSNLPP-SQINYYPSRIFTNSSHLS---PKTHLKPSAFSPAGLG 605
            PT+ N+GLSQ+L  +  S L P +  + + SR  + +  LS   P +     +    G+ 
Sbjct: 61   PTAKNFGLSQILPSNGVSQLQPKTSFHSFLSRPTSKNLGLSQILPSSPKLVPSLQNCGVA 120

Query: 606  FGKLNLN---SSGFRYFSSKTSNISKANGNFAKSVIDKPVSAVRSAVSRYRDAVGLQIDA 776
              K  +N   +S FR FSS  S   K +GNFA+ V+DKP+ AV S  +RYR A+GL IDA
Sbjct: 121  LVKPRVNVNFASAFRLFSS--SGFRKIDGNFARKVVDKPIQAVSSTFARYRMALGLHIDA 178

Query: 777  FWKRNSMXXXXXXXXXXXXXXWRVMFGIAHSFVGLSEGMAKYGFLALSSAIVAFSGLYIR 956
            FWK+N++              WR+MFGIA +FVGLSEGMAKYGFLALSSAIVAF+GLY+R
Sbjct: 179  FWKKNNLLVFGAGAVFVCIFLWRIMFGIASTFVGLSEGMAKYGFLALSSAIVAFAGLYLR 238

Query: 957  SRFAINPDKVYRIAMTKLNTSAAILEIMGAPLSGTSLRAYVMSGGGLTLKNFKPTLRSKR 1136
            +RF INPDKVYRI M KLNT+A +LE+MGAPL+G+ LRAYVMSGGG+T K FKPT+R+KR
Sbjct: 239  ARFTINPDKVYRITMRKLNTAADVLEVMGAPLAGSDLRAYVMSGGGITFKKFKPTIRNKR 298

Query: 1137 CFLIFPIRGSERKGLVSVEVKKKKGQYDMKLLAIDIPMATGPDQRIFLIGDEVEYKVGGG 1316
            CFL+FP++GSERKGLVSVEVKKKKGQYDMKLLA+DIPMA+GPDQR+FLIGDEVEY+VGGG
Sbjct: 299  CFLLFPVQGSERKGLVSVEVKKKKGQYDMKLLAVDIPMASGPDQRLFLIGDEVEYRVGGG 358

Query: 1317 LISELRDPVVKAMAATKXXXXXXXXXXXXXXXXXXXXXXRKHQEEIEKLESK 1472
            LISELRDPVVKAMAATK                      RK +EEIE  E++
Sbjct: 359  LISELRDPVVKAMAATKEFDNLDRIEEEEDAERELQEAERKEREEIELQEAE 410


>ref|XP_007218249.1| hypothetical protein PRUPE_ppa008050mg [Prunus persica]
            gi|462414711|gb|EMJ19448.1| hypothetical protein
            PRUPE_ppa008050mg [Prunus persica]
          Length = 347

 Score =  390 bits (1003), Expect = e-106
 Identities = 226/399 (56%), Positives = 258/399 (64%)
 Frame = +3

Query: 270  MAKPSTGRVIQGLLKFHYQHPHRLLSDGGSVSAIQNSSAKTYSTSTFLSSRFQSKPPTSS 449
            M+KPS+ R+IQGLL+FH+    +        S+   SS +TY ++     +F S P    
Sbjct: 1    MSKPSS-RIIQGLLRFHHNQIAK-------PSSPLTSSRRTYHSNNGPLPQFPSHPKPVV 52

Query: 450  NWGLSQLLSPSKTSNLPPSQINYYPSRIFTNSSHLSPKTHLKPSAFSPAGLGFGKLNLNS 629
              G S                                         S  GLG G      
Sbjct: 53   GGGGSS----------------------------------------SGPGLGLGL----- 67

Query: 630  SGFRYFSSKTSNISKANGNFAKSVIDKPVSAVRSAVSRYRDAVGLQIDAFWKRNSMXXXX 809
             G R+FS K  N SK N   AK V DKP+SA  SA SRY++A+GLQI+AFWKRN++    
Sbjct: 68   -GLRFFSFKPPNFSKVN---AKKVFDKPLSAATSAFSRYQEAIGLQIEAFWKRNNLVLLG 123

Query: 810  XXXXXXXXXXWRVMFGIAHSFVGLSEGMAKYGFLALSSAIVAFSGLYIRSRFAINPDKVY 989
                      WRVMFGIA +FVGLSEGMAKYGFLALSSAIVAF+GL+IRSRF INPDKVY
Sbjct: 124  VGALVVCALLWRVMFGIASTFVGLSEGMAKYGFLALSSAIVAFAGLHIRSRFTINPDKVY 183

Query: 990  RIAMTKLNTSAAILEIMGAPLSGTSLRAYVMSGGGLTLKNFKPTLRSKRCFLIFPIRGSE 1169
            RIAM +LNTSA ILE+MGAPLSG+ LRAYVMSGGG+TLK FKPT RSKRCFLIFP+RGSE
Sbjct: 184  RIAMRRLNTSAGILEVMGAPLSGSDLRAYVMSGGGVTLKKFKPTFRSKRCFLIFPVRGSE 243

Query: 1170 RKGLVSVEVKKKKGQYDMKLLAIDIPMATGPDQRIFLIGDEVEYKVGGGLISELRDPVVK 1349
            RKGLVSVEVKKKKGQYDMKLLA+DIPMA+GPDQR+FLIGDE EYKVGGGLI+ELRDPVVK
Sbjct: 244  RKGLVSVEVKKKKGQYDMKLLAVDIPMASGPDQRLFLIGDEEEYKVGGGLIAELRDPVVK 303

Query: 1350 AMAATKXXXXXXXXXXXXXXXXXXXXXXRKHQEEIEKLE 1466
            AMAATK                      RKH+EEIEKLE
Sbjct: 304  AMAATKEFDSLDQIEEEEDAERELQEAERKHREEIEKLE 342


>ref|XP_006435962.1| hypothetical protein CICLE_v10031884mg [Citrus clementina]
            gi|568865534|ref|XP_006486129.1| PREDICTED:
            uncharacterized protein LOC102626917 [Citrus sinensis]
            gi|557538158|gb|ESR49202.1| hypothetical protein
            CICLE_v10031884mg [Citrus clementina]
          Length = 368

 Score =  389 bits (1000), Expect = e-105
 Identities = 227/407 (55%), Positives = 265/407 (65%), Gaps = 8/407 (1%)
 Frame = +3

Query: 270  MAKPSTGRVIQGLLKFHYQHPHRLLSDGGSVSAIQNSSAKTYSTSTFLSSRFQSKPPTSS 449
            M+KPS   +I GLL+ H+ H  R          +  SS    S S+   S F S+   SS
Sbjct: 1    MSKPSKA-IINGLLRLHFDHLLR----------VNPSSKSKPSLSSAFYSHFSSRG--SS 47

Query: 450  NWGLSQLLSPSKTSNLPPSQINYYPSRIFTNSSHLSPKTHLKPSAFSPAGLGFGKLNLNS 629
            N+                           T++SH+   T L PS F+ A LG  ++  NS
Sbjct: 48   NFT--------------------------TSTSHIH-STKL-PSKFTSANLGLAQILSNS 79

Query: 630  S--------GFRYFSSKTSNISKANGNFAKSVIDKPVSAVRSAVSRYRDAVGLQIDAFWK 785
                     GFR+FS K+    K NGNF K V +KP S V S  SRYR+A+GLQIDAF+K
Sbjct: 80   RKPNVKISPGFRFFSFKSEFGQKLNGNFTKKVFEKPASVVSSTFSRYREAIGLQIDAFFK 139

Query: 786  RNSMXXXXXXXXXXXXXXWRVMFGIAHSFVGLSEGMAKYGFLALSSAIVAFSGLYIRSRF 965
             N +              WR+MFGIA++FVG+SEGMAKYGFLALS+AIVAF+GLYIRSRF
Sbjct: 140  GNYLLLFGAGGVVVCMLLWRIMFGIANTFVGISEGMAKYGFLALSTAIVAFAGLYIRSRF 199

Query: 966  AINPDKVYRIAMTKLNTSAAILEIMGAPLSGTSLRAYVMSGGGLTLKNFKPTLRSKRCFL 1145
             INPDKVYR+AM KLNTSA ILE+MGAPLSGTSLRAYVMSGGG+T+KNFKP  R+KRCFL
Sbjct: 200  TINPDKVYRMAMRKLNTSAGILEVMGAPLSGTSLRAYVMSGGGITMKNFKPRFRNKRCFL 259

Query: 1146 IFPIRGSERKGLVSVEVKKKKGQYDMKLLAIDIPMATGPDQRIFLIGDEVEYKVGGGLIS 1325
            IFPIRGSERKGLVSVEVKKKKGQ+D KLLAIDIPM +GPDQR+FLIGDE EYKVG GLI+
Sbjct: 260  IFPIRGSERKGLVSVEVKKKKGQHDTKLLAIDIPMKSGPDQRLFLIGDEEEYKVGDGLIA 319

Query: 1326 ELRDPVVKAMAATKXXXXXXXXXXXXXXXXXXXXXXRKHQEEIEKLE 1466
            ELRDPVVKAMAATK                      RKH+EEI+KLE
Sbjct: 320  ELRDPVVKAMAATKEFDDLDRIEDEEDAERELQEAERKHREEIKKLE 366


>ref|XP_004141508.1| PREDICTED: uncharacterized protein LOC101215996 [Cucumis sativus]
          Length = 377

 Score =  389 bits (998), Expect = e-105
 Identities = 223/403 (55%), Positives = 267/403 (66%), Gaps = 3/403 (0%)
 Frame = +3

Query: 273  AKPSTGRVIQGLL--KFHYQHPHRLLSDGGSVSAIQNSSAKTYSTSTFLSSRFQSKPPTS 446
            +KPS  RV++GLL  +      H  ++   +        A ++S++    S+F SKP  S
Sbjct: 6    SKPSQ-RVLEGLLTLRLRLHFTHHPITHSNANPLRDPFIAHSFSSAPSFQSKFPSKP-IS 63

Query: 447  SNWGLSQLLSPSKTSNLPPSQINYYPSRIFTNSSHLSPKTHLKPSAFSPAGLGFGKLNLN 626
            SN GLSQ L               Y  ++   +S L  K +   SA              
Sbjct: 64   SNVGLSQFL---------------YSPKLTAGNSSLVTKLNAHHSA-------------- 94

Query: 627  SSGFRYFSSKTSNIS-KANGNFAKSVIDKPVSAVRSAVSRYRDAVGLQIDAFWKRNSMXX 803
             S FR+FS K      + NGNFAK VIDKP +AV SA SRYR+A+GLQI+AF+KRN +  
Sbjct: 95   -SRFRFFSVKIPRFGGQINGNFAKKVIDKPAAAVSSAFSRYREAIGLQIEAFFKRNYLVL 153

Query: 804  XXXXXXXXXXXXWRVMFGIAHSFVGLSEGMAKYGFLALSSAIVAFSGLYIRSRFAINPDK 983
                        W++MFGIA++FVGLSEGMAKYGFLALSSAIVAF+GLY+RSRF +NPD+
Sbjct: 154  LGFAAALICALLWKIMFGIANTFVGLSEGMAKYGFLALSSAIVAFTGLYMRSRFTVNPDR 213

Query: 984  VYRIAMTKLNTSAAILEIMGAPLSGTSLRAYVMSGGGLTLKNFKPTLRSKRCFLIFPIRG 1163
            VYR+AM KLNTSA ILE+MGAPL+G+ LRAYVMSGGG TLKNF P  RSKRCFLIFPIRG
Sbjct: 214  VYRMAMRKLNTSAGILEVMGAPLTGSDLRAYVMSGGGFTLKNFAPNRRSKRCFLIFPIRG 273

Query: 1164 SERKGLVSVEVKKKKGQYDMKLLAIDIPMATGPDQRIFLIGDEVEYKVGGGLISELRDPV 1343
            SERKGLVSVEVKKKKGQYDMKLLA+DIPMA+GPDQR+FLIG+E EYK+GGGLISELRDPV
Sbjct: 274  SERKGLVSVEVKKKKGQYDMKLLAVDIPMASGPDQRLFLIGNEEEYKIGGGLISELRDPV 333

Query: 1344 VKAMAATKXXXXXXXXXXXXXXXXXXXXXXRKHQEEIEKLESK 1472
            VKAMAA K                      RK++EEIEKLE +
Sbjct: 334  VKAMAAVKEFDDLDRIEEKEDAERELQEAERKNREEIEKLEKE 376


>ref|XP_002520894.1| conserved hypothetical protein [Ricinus communis]
            gi|223540025|gb|EEF41603.1| conserved hypothetical
            protein [Ricinus communis]
          Length = 364

 Score =  389 bits (998), Expect = e-105
 Identities = 228/401 (56%), Positives = 271/401 (67%), Gaps = 2/401 (0%)
 Frame = +3

Query: 270  MAKPSTGRVIQGLL-KFHYQHPHRLLSDGGSVSAIQNSSAKTYSTSTFLSSRFQSKPPTS 446
            M+KPS  R+I GLL K H+ +   +LS+   ++   ++   T ++    +  F SKP TS
Sbjct: 1    MSKPSQ-RIINGLLLKLHHSN---ILSNSKPITKPYSNGFSTLNSRIPFNPSFTSKP-TS 55

Query: 447  SNWGLSQLLSPSKTSNLPPSQINYYPSRIFTNSSHLSPKTHLKPSAFSPAGLGFGKLNLN 626
            SN GLSQ LS +           YY S                 S+ +     +G     
Sbjct: 56   SNVGLSQFLSRANPY--------YYSS-----------------SSLAKQSCWYGVK--- 87

Query: 627  SSGFRYFSSKTSNISKA-NGNFAKSVIDKPVSAVRSAVSRYRDAVGLQIDAFWKRNSMXX 803
                R+FS KTSN+ K  NG+FA+ V++KP +      SRYR+A+GLQIDAF KRN +  
Sbjct: 88   ----RHFSLKTSNLGKTVNGDFARKVLEKPATTF----SRYREAIGLQIDAFCKRNVLLL 139

Query: 804  XXXXXXXXXXXXWRVMFGIAHSFVGLSEGMAKYGFLALSSAIVAFSGLYIRSRFAINPDK 983
                        WR+MFGIA++FVGLSEGMAKYGFLALSSAIVAF+GLYIRSR  +NPD+
Sbjct: 140  VGAGGVIVCALLWRIMFGIANTFVGLSEGMAKYGFLALSSAIVAFAGLYIRSRITVNPDR 199

Query: 984  VYRIAMTKLNTSAAILEIMGAPLSGTSLRAYVMSGGGLTLKNFKPTLRSKRCFLIFPIRG 1163
            VYRIAM KLNTSAAILE+MGAPL+GT LRAYVMSGGG+TLKNFKP LRSKRCFLIFPIRG
Sbjct: 200  VYRIAMRKLNTSAAILEVMGAPLTGTELRAYVMSGGGVTLKNFKPRLRSKRCFLIFPIRG 259

Query: 1164 SERKGLVSVEVKKKKGQYDMKLLAIDIPMATGPDQRIFLIGDEVEYKVGGGLISELRDPV 1343
            SERKGLVSVEVKKKKGQYDMKLLA+DIPMA+GPDQR+FLIGDE EYKVGGGLI+ELRDPV
Sbjct: 260  SERKGLVSVEVKKKKGQYDMKLLAVDIPMASGPDQRLFLIGDEDEYKVGGGLIAELRDPV 319

Query: 1344 VKAMAATKXXXXXXXXXXXXXXXXXXXXXXRKHQEEIEKLE 1466
            VKAMAA+K                      RKH+EE+EKLE
Sbjct: 320  VKAMAASKEFDDLDDIEEAEDAERELEEAERKHREEMEKLE 360


>ref|XP_006402987.1| hypothetical protein EUTSA_v10006036mg [Eutrema salsugineum]
            gi|557104086|gb|ESQ44440.1| hypothetical protein
            EUTSA_v10006036mg [Eutrema salsugineum]
          Length = 378

 Score =  388 bits (996), Expect = e-105
 Identities = 218/407 (53%), Positives = 265/407 (65%), Gaps = 6/407 (1%)
 Frame = +3

Query: 270  MAKPSTGRVIQGLLKFHYQHPHRLLSDGGSVSAIQNSSAKTYSTSTFL-SSRFQS--KPP 440
            M KPS  + +  L++ HY   + +      +    + +  +   S F   S F S    P
Sbjct: 1    MVKPSDFKGVYRLIQLHYSRANPVTFSRSKLFTSSSPAFPSNGVSQFQPKSSFHSISSRP 60

Query: 441  TSSNWGLSQLLSPSKTSNLPPSQINYYPSRIFTNSSHLSPKTHLKPSAFSPAGLGFGKLN 620
            TS++ GLSQ+LS  K   +P  QI                            GL   K  
Sbjct: 61   TSTSLGLSQILSRPKL--VPNLQI---------------------------CGLAMAKPR 91

Query: 621  LNS---SGFRYFSSKTSNISKANGNFAKSVIDKPVSAVRSAVSRYRDAVGLQIDAFWKRN 791
            +N+   S FR FSS  S   K +GNFA+ V+DKP+ AV S   RYR+A+GL +DAFWK+N
Sbjct: 92   VNTNFASAFRLFSS--SGFRKVDGNFARKVVDKPIKAVSSTFGRYREALGLHVDAFWKKN 149

Query: 792  SMXXXXXXXXXXXXXXWRVMFGIAHSFVGLSEGMAKYGFLALSSAIVAFSGLYIRSRFAI 971
            ++              WR+MFGIA +FVGLSEGMAKYGFLALSSAIVAF+GLY+R+RF I
Sbjct: 150  NLVVFGAVGVFVCIFLWRIMFGIASTFVGLSEGMAKYGFLALSSAIVAFAGLYLRARFTI 209

Query: 972  NPDKVYRIAMTKLNTSAAILEIMGAPLSGTSLRAYVMSGGGLTLKNFKPTLRSKRCFLIF 1151
            NPDKVYRIAM KLNT+A ILE+MGAPL+G+ LRAYVMSGGG+TLK FKPT+RSKRCFL+F
Sbjct: 210  NPDKVYRIAMRKLNTAADILEVMGAPLAGSDLRAYVMSGGGITLKKFKPTIRSKRCFLLF 269

Query: 1152 PIRGSERKGLVSVEVKKKKGQYDMKLLAIDIPMATGPDQRIFLIGDEVEYKVGGGLISEL 1331
            P++G+ERKGLVSVEVKKKKGQYDMKLLA+DIPMA+GPDQR+FLIGDE EY+VGGGLISEL
Sbjct: 270  PVQGAERKGLVSVEVKKKKGQYDMKLLAVDIPMASGPDQRLFLIGDEEEYRVGGGLISEL 329

Query: 1332 RDPVVKAMAATKXXXXXXXXXXXXXXXXXXXXXXRKHQEEIEKLESK 1472
            RDPVVKAMAA K                      RK +EEIEKLE +
Sbjct: 330  RDPVVKAMAAAKEFDNLDRIEEEEDAERELQEAERKQREEIEKLEKE 376


>ref|XP_006291135.1| hypothetical protein CARUB_v10017250mg [Capsella rubella]
            gi|482559842|gb|EOA24033.1| hypothetical protein
            CARUB_v10017250mg [Capsella rubella]
          Length = 444

 Score =  383 bits (984), Expect = e-103
 Identities = 220/412 (53%), Positives = 275/412 (66%), Gaps = 15/412 (3%)
 Frame = +3

Query: 270  MAKPSTGRVIQGLLKFHYQHPHRLLSDGGSVSAIQ------NSSAKTYSTSTFLSSRFQS 431
            M KPS  + +  L++ H    + + +    +SA+       N  ++    S F S  F S
Sbjct: 1    MVKPSDFKAVHRLIQLHCSRVNVVATGRSKLSALSFPALPSNGVSQLQPKSGFHS--FSS 58

Query: 432  KPPTSSNWGLSQLLSPSKTSNLPP-SQINYYPSRIFTNSSHL-----SPKTHLKPSAFSP 593
            +P T  N+GLSQ+L  +  S L P +  + + SR  + +  L     SPK  L PS    
Sbjct: 59   RP-TLKNFGLSQILPSNGVSQLKPKTSFHSFLSRPTSKNVGLFQILSSPK--LVPS-LQN 114

Query: 594  AGLGFGKLNLN---SSGFRYFSSKTSNISKANGNFAKSVIDKPVSAVRSAVSRYRDAVGL 764
             G+   K  +N   +S FR FSS  S   K +GNFA+ V+DKP+ AV S  +RYR A+GL
Sbjct: 115  CGVALVKPRVNMNFASAFRLFSS--SGFRKVDGNFARKVVDKPIQAVSSTFARYRMALGL 172

Query: 765  QIDAFWKRNSMXXXXXXXXXXXXXXWRVMFGIAHSFVGLSEGMAKYGFLALSSAIVAFSG 944
             IDAFWK+N++              WR+MFGIA +FVGLSEGMAKYGFLALSSAIVAF+G
Sbjct: 173  HIDAFWKKNNLLVFGAGAVFVCIFLWRIMFGIASTFVGLSEGMAKYGFLALSSAIVAFAG 232

Query: 945  LYIRSRFAINPDKVYRIAMTKLNTSAAILEIMGAPLSGTSLRAYVMSGGGLTLKNFKPTL 1124
            LY+R+RF INPDKVYRI M KLNT+A +LE+MGAPL+G+ LRAYVMSGGG+T K FKP++
Sbjct: 233  LYLRARFTINPDKVYRITMRKLNTAADVLEVMGAPLAGSDLRAYVMSGGGITFKRFKPSI 292

Query: 1125 RSKRCFLIFPIRGSERKGLVSVEVKKKKGQYDMKLLAIDIPMATGPDQRIFLIGDEVEYK 1304
            R+KRCFL+FP++GSERKGLVSVEVKKKKGQYDMKLLA+DIPMA+GPDQR+FLIGDE EY+
Sbjct: 293  RNKRCFLLFPVQGSERKGLVSVEVKKKKGQYDMKLLAVDIPMASGPDQRLFLIGDEEEYR 352

Query: 1305 VGGGLISELRDPVVKAMAATKXXXXXXXXXXXXXXXXXXXXXXRKHQEEIEK 1460
            VGGGLISELRDPVVKAMAATK                      RK +EE E+
Sbjct: 353  VGGGLISELRDPVVKAMAATKEFDNLDRIEEEEDAERELQEAERKQREEEER 404


>ref|XP_007011394.1| Uncharacterized protein TCM_045580 [Theobroma cacao]
            gi|508728307|gb|EOY20204.1| Uncharacterized protein
            TCM_045580 [Theobroma cacao]
          Length = 448

 Score =  380 bits (975), Expect = e-102
 Identities = 222/396 (56%), Positives = 262/396 (66%), Gaps = 1/396 (0%)
 Frame = +3

Query: 282  STGRVIQGLLKFHYQHPHRLLSDGGSVSAIQNSSAKTYSTSTFLSSRFQSKPPTSSNWGL 461
            S+ R+I   LK H  + HR+     +  AI +SS+   +    LSS      PTSSN GL
Sbjct: 11   SSSRLIYAFLKLH-SNRHRV-----NPFAIPSSSSSCSNGFHQLSSN-----PTSSNAGL 59

Query: 462  SQLLSPSKTSNLPPSQINYYPSRIFTNSSHLSPKTHLKPSAFSPAGLGFGKLNLNSSGFR 641
            SQ L  S   + P    N Y +R               P+ FSP+GL            R
Sbjct: 60   SQFLFRSAHQSTPLR--NRYIAR---------------PNPFSPSGL------------R 90

Query: 642  YFSSKTSNIS-KANGNFAKSVIDKPVSAVRSAVSRYRDAVGLQIDAFWKRNSMXXXXXXX 818
            +FS K SN   K  G+F K+    P +A RS +SRYR+A+GL ++AF+K+N +       
Sbjct: 91   FFSFKPSNFGQKFGGSFTKNAFQNPANAFRSTLSRYREAIGLHLEAFFKKNYLILFGAGG 150

Query: 819  XXXXXXXWRVMFGIAHSFVGLSEGMAKYGFLALSSAIVAFSGLYIRSRFAINPDKVYRIA 998
                   WR+MFGIA+SF+GLSEGMAKYGFLALS+AIV+F+GLY RSRF INPDKVYR+A
Sbjct: 151  VLLCVLLWRIMFGIANSFIGLSEGMAKYGFLALSTAIVSFAGLYFRSRFTINPDKVYRMA 210

Query: 999  MTKLNTSAAILEIMGAPLSGTSLRAYVMSGGGLTLKNFKPTLRSKRCFLIFPIRGSERKG 1178
            M +LNT+A ILE+MGAPL+GT LRAYVMSGGGLT+KNFK  LRSKRCFLIFPIRGSERKG
Sbjct: 211  MRRLNTAAGILEVMGAPLTGTELRAYVMSGGGLTVKNFKLKLRSKRCFLIFPIRGSERKG 270

Query: 1179 LVSVEVKKKKGQYDMKLLAIDIPMATGPDQRIFLIGDEVEYKVGGGLISELRDPVVKAMA 1358
            LVSVEVKK KGQY MKLLA+DIPMA+GPDQR+FLIGDE EYKVGGGLISELRDPVVKAMA
Sbjct: 271  LVSVEVKKNKGQYVMKLLAVDIPMASGPDQRLFLIGDEEEYKVGGGLISELRDPVVKAMA 330

Query: 1359 ATKXXXXXXXXXXXXXXXXXXXXXXRKHQEEIEKLE 1466
            ATK                      RKH+EEIEKLE
Sbjct: 331  ATKEFDDLDQIEEEEDAERELQEAERKHREEIEKLE 366


>ref|XP_004167240.1| PREDICTED: uncharacterized protein LOC101225862 [Cucumis sativus]
          Length = 378

 Score =  377 bits (967), Expect = e-101
 Identities = 218/404 (53%), Positives = 266/404 (65%), Gaps = 4/404 (0%)
 Frame = +3

Query: 273  AKPSTGRVIQGLL--KFHYQHPHRLLSDGGSVSAIQNSSAKTYSTSTFLSSRFQSKPPTS 446
            +KPS  RV++GLL  +      H  ++   +        A ++S++    S+F SKP  S
Sbjct: 6    SKPSQ-RVLEGLLTLRLRLHFTHHPITHSNANPLRDPFIAHSFSSAPSFQSKFPSKP-IS 63

Query: 447  SNWGLSQLLSPSKTSNLPPSQINYYPSRIFTNSSHLSPKTHLKPSAFSPAGLGFGKLNLN 626
            SN GLSQ L               Y  ++   +S L  K +   SA              
Sbjct: 64   SNVGLSQFL---------------YSPKLTAGNSSLVTKLNAHHSA-------------- 94

Query: 627  SSGFRYFSSKTSNIS-KANGNFAKSVIDKPVSAVRSAVSRYRDAVGLQIDAFWKRNSMXX 803
             S FR+FS K      + NGNFAK VIDKP +AV SA SRYR+A+GLQI+AF+KRN +  
Sbjct: 95   -SRFRFFSVKIPRFGGQINGNFAKKVIDKPAAAVSSAFSRYREAIGLQIEAFFKRNYLVL 153

Query: 804  XXXXXXXXXXXXWRVMFGIAHSFVGLSEGMAKYGFLALSSAIVAFSGLYIRSRFAINPDK 983
                        W++MFGIA++FVGLSEGMAKYGFLALSSAIVAF+GLY+RSRF +NPD+
Sbjct: 154  LGFAAALICALLWKIMFGIANTFVGLSEGMAKYGFLALSSAIVAFTGLYMRSRFTVNPDR 213

Query: 984  VYRIAMTKLNTSAAILEIMGAPLSGTSLRAYVMSGGGLTLKNFKPTLRSKRCFLIFPIRG 1163
            VYR+AM KLNTSA ILE+MGAPL+G+ LRAYVMSGGG TLKNF P  RSKRCFLIFPIRG
Sbjct: 214  VYRMAMRKLNTSAGILEVMGAPLTGSDLRAYVMSGGGFTLKNFAPNRRSKRCFLIFPIRG 273

Query: 1164 SERKGLVSVEVKKKKGQ-YDMKLLAIDIPMATGPDQRIFLIGDEVEYKVGGGLISELRDP 1340
            SERKGLVSVEVK+++ + YDMKLLA+DIPMA+GPDQR+FLIG+E EYK+GGGLISELRDP
Sbjct: 274  SERKGLVSVEVKRRRARFYDMKLLAVDIPMASGPDQRLFLIGNEEEYKIGGGLISELRDP 333

Query: 1341 VVKAMAATKXXXXXXXXXXXXXXXXXXXXXXRKHQEEIEKLESK 1472
            VVKAMAA K                      RK++EEIEKLE +
Sbjct: 334  VVKAMAAVKEFDDLDRIEEKEDAERELQEAERKNREEIEKLEKE 377


>ref|XP_003526791.1| PREDICTED: uncharacterized protein LOC100819345 [Glycine max]
          Length = 339

 Score =  374 bits (959), Expect = e-100
 Identities = 200/313 (63%), Positives = 235/313 (75%), Gaps = 1/313 (0%)
 Frame = +3

Query: 540  NSSHLSPKTHLKPSAFSPAGLGFG-KLNLNSSGFRYFSSKTSNISKANGNFAKSVIDKPV 716
            +SS  S  +    SAF P+   F  +  L +S FR+FSS   N  K    FA+ V DKP 
Sbjct: 31   DSSFASKPSVSNASAFRPSSPVFHPQQRLGNSTFRFFSS---NFDKG---FAQKVFDKPA 84

Query: 717  SAVRSAVSRYRDAVGLQIDAFWKRNSMXXXXXXXXXXXXXXWRVMFGIAHSFVGLSEGMA 896
            +AV SA SRYR+A+GLQI+AF+KRN++              WR++FGIA+ FVGLSEGMA
Sbjct: 85   AAVTSAFSRYREAIGLQIEAFFKRNTLFLWGAGGVVLCAVLWRILFGIANLFVGLSEGMA 144

Query: 897  KYGFLALSSAIVAFSGLYIRSRFAINPDKVYRIAMTKLNTSAAILEIMGAPLSGTSLRAY 1076
            KYGFLALSSAIVAF+GLYIR+R  INPDKVYR+AMTKLNTSA ILE+MGAPLSGT LRAY
Sbjct: 145  KYGFLALSSAIVAFTGLYIRTRLTINPDKVYRMAMTKLNTSAGILEVMGAPLSGTDLRAY 204

Query: 1077 VMSGGGLTLKNFKPTLRSKRCFLIFPIRGSERKGLVSVEVKKKKGQYDMKLLAIDIPMAT 1256
            +MSGGGLT+K FKP++RS+RCFLIFPIRGSE+KGLVSVEVKKKKGQYDMKLLA+D+PMA+
Sbjct: 205  IMSGGGLTVKKFKPSVRSRRCFLIFPIRGSEKKGLVSVEVKKKKGQYDMKLLAVDVPMAS 264

Query: 1257 GPDQRIFLIGDEVEYKVGGGLISELRDPVVKAMAATKXXXXXXXXXXXXXXXXXXXXXXR 1436
            GPDQR+FLIGDE EY+VGGGLIS+LRDPVV+AMAATK                      R
Sbjct: 265  GPDQRLFLIGDEEEYRVGGGLISDLRDPVVRAMAATKEFDDLDEIEEEEDAERERQETER 324

Query: 1437 KHQEEIEKLESKS 1475
            K +EEIEKLE  S
Sbjct: 325  KEREEIEKLEKSS 337


>ref|XP_006358158.1| PREDICTED: uncharacterized protein LOC102588510 [Solanum tuberosum]
          Length = 370

 Score =  372 bits (956), Expect = e-100
 Identities = 210/375 (56%), Positives = 248/375 (66%), Gaps = 9/375 (2%)
 Frame = +3

Query: 369  IQNSSAKTYSTSTF----LSSRFQSKPPTSSNWGLSQLLSPSKTSNLPPSQINYYPSRIF 536
            +  SS+  +  S F    LS   Q   P  ++ G+S  LS + +  +    +N + S   
Sbjct: 1    MSKSSSSKFVLSNFYHYILSKPHQFSSPNVTSSGISHFLS-NHSKIIEKPAVNQWVSN-- 57

Query: 537  TNSSHLS----PKTHLKPSA-FSPAGLGFGKLNLNSSGFRYFSSKTSNISKANGNFAKSV 701
            T  +H S    P+    PS    P G       L   GFRYFS K+S +        K+V
Sbjct: 58   TQRTHFSSSPFPRVLQNPSKKLDPDGSFLWNRKL--LGFRYFSLKSSGLG-----LGKNV 110

Query: 702  IDKPVSAVRSAVSRYRDAVGLQIDAFWKRNSMXXXXXXXXXXXXXXWRVMFGIAHSFVGL 881
            +  PV A +    RY+ AVGLQ++AFWKRNSM              WR++FGIA +F+GL
Sbjct: 111  LKNPVEAAKKTALRYKGAVGLQMEAFWKRNSMVLFGAAGIMVCILLWRILFGIATTFIGL 170

Query: 882  SEGMAKYGFLALSSAIVAFSGLYIRSRFAINPDKVYRIAMTKLNTSAAILEIMGAPLSGT 1061
            SEGMAKYGFLALSSAIVAF+GLY+RSRF INPDKVYR+AM +LNT A ILE+MGAPLSGT
Sbjct: 171  SEGMAKYGFLALSSAIVAFAGLYLRSRFTINPDKVYRMAMRRLNTEAGILEVMGAPLSGT 230

Query: 1062 SLRAYVMSGGGLTLKNFKPTLRSKRCFLIFPIRGSERKGLVSVEVKKKKGQYDMKLLAID 1241
             LRAYVMSGGG+TLKNFKP  R KRCFLIFPIRGSERKGLVSVEVK K+GQYDMKLLA+D
Sbjct: 231  DLRAYVMSGGGITLKNFKPRFRGKRCFLIFPIRGSERKGLVSVEVKNKQGQYDMKLLAVD 290

Query: 1242 IPMATGPDQRIFLIGDEVEYKVGGGLISELRDPVVKAMAATKXXXXXXXXXXXXXXXXXX 1421
            IPMA+GPDQR+FLIGDE EY++GGGLI+ELRDPVVKAMAATK                  
Sbjct: 291  IPMASGPDQRLFLIGDEEEYRIGGGLIAELRDPVVKAMAATKEFEDRDDLEDEEDAEREL 350

Query: 1422 XXXXRKHQEEIEKLE 1466
                RKHQEEIEKLE
Sbjct: 351  QEAERKHQEEIEKLE 365


>ref|XP_004235216.1| PREDICTED: uncharacterized protein LOC101251760 isoform 1 [Solanum
            lycopersicum]
          Length = 367

 Score =  370 bits (949), Expect = 1e-99
 Identities = 213/368 (57%), Positives = 245/368 (66%), Gaps = 11/368 (2%)
 Frame = +3

Query: 396  STSTFLSSRFQ----SKPPTSS--NWGLSQLLSPSKTSNLPPSQINYYPSRIFTNSSHLS 557
            S+S F+ S F     SKP   S  N G+S  LS        P+ +N + S   T  +H S
Sbjct: 5    SSSKFVLSNFYHYILSKPHQFSTPNVGISHFLSNHSKIIQKPA-VNQWVSN--TQRTHFS 61

Query: 558  ----PKTHLKPSA-FSPAGLGFGKLNLNSSGFRYFSSKTSNISKANGNFAKSVIDKPVSA 722
                P+    PS    P G       L   GFRYFS K+S +        K+V+  PV A
Sbjct: 62   SSPFPRVLQNPSKKLDPDGSFLWNRKL--LGFRYFSLKSSGLG-----LGKNVLKNPVEA 114

Query: 723  VRSAVSRYRDAVGLQIDAFWKRNSMXXXXXXXXXXXXXXWRVMFGIAHSFVGLSEGMAKY 902
             +    RY+ AVGLQ++AFWKRNSM              WR++FGIA +F+GLSEGMAKY
Sbjct: 115  AKKTTLRYKGAVGLQMEAFWKRNSMVLFGAAGIMVCILLWRILFGIATTFIGLSEGMAKY 174

Query: 903  GFLALSSAIVAFSGLYIRSRFAINPDKVYRIAMTKLNTSAAILEIMGAPLSGTSLRAYVM 1082
            GFLALSSAIVAF+GLY+RSRF INPDKVYR+AM +LNT A ILE+MGAPLSGT LRAYVM
Sbjct: 175  GFLALSSAIVAFAGLYLRSRFTINPDKVYRMAMRRLNTEAGILEVMGAPLSGTDLRAYVM 234

Query: 1083 SGGGLTLKNFKPTLRSKRCFLIFPIRGSERKGLVSVEVKKKKGQYDMKLLAIDIPMATGP 1262
            SGGG+TLKNFKP  R KRCFLIFPIRGSERKGLVSVEVK K+GQYDMKLLA+DIPMA GP
Sbjct: 235  SGGGVTLKNFKPRFRGKRCFLIFPIRGSERKGLVSVEVKNKQGQYDMKLLAVDIPMAAGP 294

Query: 1263 DQRIFLIGDEVEYKVGGGLISELRDPVVKAMAATKXXXXXXXXXXXXXXXXXXXXXXRKH 1442
            DQR++LIGDE EY+VGGGLI+ELRDPVVKAMAATK                      RKH
Sbjct: 295  DQRLYLIGDEEEYRVGGGLIAELRDPVVKAMAATKEFEERDDLEDEEDAERELQEAERKH 354

Query: 1443 QEEIEKLE 1466
            QEEIEKLE
Sbjct: 355  QEEIEKLE 362


>ref|XP_007136404.1| hypothetical protein PHAVU_009G042200g [Phaseolus vulgaris]
            gi|561009491|gb|ESW08398.1| hypothetical protein
            PHAVU_009G042200g [Phaseolus vulgaris]
          Length = 332

 Score =  367 bits (942), Expect = 9e-99
 Identities = 192/293 (65%), Positives = 223/293 (76%)
 Frame = +3

Query: 588  SPAGLGFGKLNLNSSGFRYFSSKTSNISKANGNFAKSVIDKPVSAVRSAVSRYRDAVGLQ 767
            SP+ +   +  L +S FR+FSS   N  K    F + V DKP  AV+SA SRYR+A+GLQ
Sbjct: 41   SPSPIFHPQQRLGNSTFRFFSS---NFDKG---FVQKVFDKPAVAVKSAFSRYREAIGLQ 94

Query: 768  IDAFWKRNSMXXXXXXXXXXXXXXWRVMFGIAHSFVGLSEGMAKYGFLALSSAIVAFSGL 947
            I+AF KRN +              WR++FG+A+ FVG+SEG+AKYGFLALSSAIVAF+GL
Sbjct: 95   IEAFCKRNYLFLLGAAGVILCGVLWRILFGVANLFVGISEGLAKYGFLALSSAIVAFTGL 154

Query: 948  YIRSRFAINPDKVYRIAMTKLNTSAAILEIMGAPLSGTSLRAYVMSGGGLTLKNFKPTLR 1127
            YIRSRF INPDKVYR+AMT+LNTSA ILE+MGAPLSGT LRAY+MSGGGLTLK FKP +R
Sbjct: 155  YIRSRFTINPDKVYRMAMTRLNTSAGILEVMGAPLSGTELRAYIMSGGGLTLKKFKPGIR 214

Query: 1128 SKRCFLIFPIRGSERKGLVSVEVKKKKGQYDMKLLAIDIPMATGPDQRIFLIGDEVEYKV 1307
            SKRCFLIFPIRGSE+KGLV+VEVKKK GQYDMKLLA+D+PMA+GPDQR++LIGDE EYKV
Sbjct: 215  SKRCFLIFPIRGSEKKGLVNVEVKKKNGQYDMKLLAVDVPMASGPDQRLYLIGDEQEYKV 274

Query: 1308 GGGLISELRDPVVKAMAATKXXXXXXXXXXXXXXXXXXXXXXRKHQEEIEKLE 1466
            GGGLISELRDPVVKAMAATK                      RK +EEIEKLE
Sbjct: 275  GGGLISELRDPVVKAMAATKEFDALDEIEEEEDAERERLETERKQREEIEKLE 327


>gb|EXC21106.1| hypothetical protein L484_017118 [Morus notabilis]
          Length = 378

 Score =  365 bits (937), Expect = 3e-98
 Identities = 201/329 (61%), Positives = 226/329 (68%), Gaps = 25/329 (7%)
 Frame = +3

Query: 555  SPKTHLKPSAFSPAGLGFGKLNLNSSGFRYFSSKTSNISKANGNFAKSVIDKPVSAVR-- 728
            SP     P AFS +G G G         R+FS + S + K N NFAK + +KP SAV   
Sbjct: 54   SPTHRTNPHAFS-SGAGLG--------LRFFSFRASELGKGNANFAKKIFEKPASAVAAT 104

Query: 729  -------------------SAV----SRYRDAVGLQIDAFWKRNSMXXXXXXXXXXXXXX 839
                               SAV    SRYR+A+GLQI+AF +RN +              
Sbjct: 105  FSRYREALGLQIKIFEKPASAVAATFSRYREALGLQIEAFCRRNYLFLLGAGAVMACALL 164

Query: 840  WRVMFGIAHSFVGLSEGMAKYGFLALSSAIVAFSGLYIRSRFAINPDKVYRIAMTKLNTS 1019
            WR+MFGIA SFVG SEGMAKYGFLALSSAIVAF+GLY+RSRF INPD+VYR AM KLNTS
Sbjct: 165  WRIMFGIASSFVGFSEGMAKYGFLALSSAIVAFAGLYVRSRFTINPDRVYRTAMRKLNTS 224

Query: 1020 AAILEIMGAPLSGTSLRAYVMSGGGLTLKNFKPTLRSKRCFLIFPIRGSERKGLVSVEVK 1199
            A ILE+MGAPLSG+ LRAYV SGGGLT+KNFKP +RSKRCFLIFPIRGSERKGLVSVEVK
Sbjct: 225  AGILEVMGAPLSGSDLRAYVTSGGGLTVKNFKPRIRSKRCFLIFPIRGSERKGLVSVEVK 284

Query: 1200 KKKGQYDMKLLAIDIPMATGPDQRIFLIGDEVEYKVGGGLISELRDPVVKAMAATKXXXX 1379
            KKKGQYDMKLLA+DIPMA+GPDQR+FL+GDE EYKVGGGLISELRDPVV AM+A K    
Sbjct: 285  KKKGQYDMKLLAVDIPMASGPDQRLFLVGDEEEYKVGGGLISELRDPVVSAMSAAKEFDD 344

Query: 1380 XXXXXXXXXXXXXXXXXXRKHQEEIEKLE 1466
                              RKH+EEIEKLE
Sbjct: 345  LDQIEEEEDTERELQEAERKHREEIEKLE 373


Top