BLASTX nr result

ID: Akebia22_contig00003327 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Akebia22_contig00003327
         (1642 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_006435962.1| hypothetical protein CICLE_v10031884mg [Citr...   409   e-111
ref|XP_007011394.1| Uncharacterized protein TCM_045580 [Theobrom...   406   e-110
ref|XP_006411346.1| hypothetical protein EUTSA_v10016808mg [Eutr...   402   e-109
ref|XP_007218249.1| hypothetical protein PRUPE_ppa008050mg [Prun...   397   e-108
ref|XP_002881741.1| hypothetical protein ARALYDRAFT_903376 [Arab...   397   e-107
ref|XP_002270952.1| PREDICTED: uncharacterized protein LOC100265...   397   e-107
ref|XP_004141508.1| PREDICTED: uncharacterized protein LOC101215...   395   e-107
ref|XP_006402987.1| hypothetical protein EUTSA_v10006036mg [Eutr...   394   e-107
gb|EYU20136.1| hypothetical protein MIMGU_mgv1a008727mg [Mimulus...   392   e-106
ref|XP_006842580.1| hypothetical protein AMTR_s00077p00157770 [A...   392   e-106
gb|EXC21106.1| hypothetical protein L484_017118 [Morus notabilis]     392   e-106
ref|XP_006371317.1| hypothetical protein POPTR_0019s09020g [Popu...   392   e-106
ref|XP_006358158.1| PREDICTED: uncharacterized protein LOC102588...   389   e-105
ref|NP_181612.1| uncharacterized protein [Arabidopsis thaliana] ...   389   e-105
ref|XP_004235216.1| PREDICTED: uncharacterized protein LOC101251...   389   e-105
ref|XP_002520894.1| conserved hypothetical protein [Ricinus comm...   387   e-105
ref|NP_191202.1| uncharacterized protein [Arabidopsis thaliana] ...   386   e-104
ref|XP_002878085.1| hypothetical protein ARALYDRAFT_907085 [Arab...   385   e-104
ref|XP_004167240.1| PREDICTED: uncharacterized protein LOC101225...   384   e-104
ref|XP_006291135.1| hypothetical protein CARUB_v10017250mg [Caps...   384   e-104

>ref|XP_006435962.1| hypothetical protein CICLE_v10031884mg [Citrus clementina]
            gi|568865534|ref|XP_006486129.1| PREDICTED:
            uncharacterized protein LOC102626917 [Citrus sinensis]
            gi|557538158|gb|ESR49202.1| hypothetical protein
            CICLE_v10031884mg [Citrus clementina]
          Length = 368

 Score =  409 bits (1052), Expect = e-111
 Identities = 220/338 (65%), Positives = 254/338 (75%), Gaps = 7/338 (2%)
 Frame = -3

Query: 1193 RFSSNFTQKQHLPTPFVGNSKLNGAFPSG-------FSNSRRLSPQISPKISALSSRGLG 1035
            R SSNFT      T  + ++KL   F S         SNSR+ + +ISP          G
Sbjct: 44   RGSSNFTTS----TSHIHSTKLPSKFTSANLGLAQILSNSRKPNVKISP----------G 89

Query: 1034 FRYISFKSSDTSKINGNFTKTLVNKPXXXXXXXXXRYREAVGLQIEAFWKRNYXXXXXXX 855
            FR+ SFKS    K+NGNFTK +  KP         RYREA+GLQI+AF+K NY       
Sbjct: 90   FRFFSFKSEFGQKLNGNFTKKVFEKPASVVSSTFSRYREAIGLQIDAFFKGNYLLLFGAG 149

Query: 854  XXXXXXXLWRIMFGIANTFVGLSEGMAKYGFLGLSSAIVTFAGLYVRSRFTINPDRVYRI 675
                   LWRIMFGIANTFVG+SEGMAKYGFL LS+AIV FAGLY+RSRFTINPD+VYR+
Sbjct: 150  GVVVCMLLWRIMFGIANTFVGISEGMAKYGFLALSTAIVAFAGLYIRSRFTINPDKVYRM 209

Query: 674  AMRKLNTSAGILEIMGAPLSGTDLRAYVMSGGGLRLKNFKLKLGGKRCFLIFPVRGSERK 495
            AMRKLNTSAGILE+MGAPLSGT LRAYVMSGGG+ +KNFK +   KRCFLIFP+RGSERK
Sbjct: 210  AMRKLNTSAGILEVMGAPLSGTSLRAYVMSGGGITMKNFKPRFRNKRCFLIFPIRGSERK 269

Query: 494  GLVSVEVKKQKGQYDMKLLAVDIPMTAGPDQRFFLIGDEEEYKVGGGLISELRDPIVKAM 315
            GLVSVEVKK+KGQ+D KLLA+DIPM +GPDQR FLIGDEEEYKVG GLI+ELRDP+VKAM
Sbjct: 270  GLVSVEVKKKKGQHDTKLLAIDIPMKSGPDQRLFLIGDEEEYKVGDGLIAELRDPVVKAM 329

Query: 314  AAEKEFEALDQKEDDEDAERELKDAERKDLEEIEKLEK 201
            AA KEF+ LD+ ED+EDAEREL++AERK  EEI+KLEK
Sbjct: 330  AATKEFDDLDRIEDEEDAERELQEAERKHREEIKKLEK 367


>ref|XP_007011394.1| Uncharacterized protein TCM_045580 [Theobroma cacao]
            gi|508728307|gb|EOY20204.1| Uncharacterized protein
            TCM_045580 [Theobroma cacao]
          Length = 448

 Score =  406 bits (1043), Expect = e-110
 Identities = 221/348 (63%), Positives = 254/348 (72%), Gaps = 1/348 (0%)
 Frame = -3

Query: 1232 FHEISFKPTSPNFRFSSNFTQKQHLPTPFVGNSKLNGAFPSGFSNSRRLSPQISPKISAL 1053
            FH++S  PTS N   S    +  H  TP + N  +  A P+ FS S              
Sbjct: 45   FHQLSSNPTSSNAGLSQFLFRSAHQSTP-LRNRYI--ARPNPFSPS-------------- 87

Query: 1052 SSRGLGFRYISFKSSDTS-KINGNFTKTLVNKPXXXXXXXXXRYREAVGLQIEAFWKRNY 876
                 G R+ SFK S+   K  G+FTK     P         RYREA+GL +EAF+K+NY
Sbjct: 88   -----GLRFFSFKPSNFGQKFGGSFTKNAFQNPANAFRSTLSRYREAIGLHLEAFFKKNY 142

Query: 875  XXXXXXXXXXXXXXLWRIMFGIANTFVGLSEGMAKYGFLGLSSAIVTFAGLYVRSRFTIN 696
                          LWRIMFGIAN+F+GLSEGMAKYGFL LS+AIV+FAGLY RSRFTIN
Sbjct: 143  LILFGAGGVLLCVLLWRIMFGIANSFIGLSEGMAKYGFLALSTAIVSFAGLYFRSRFTIN 202

Query: 695  PDRVYRIAMRKLNTSAGILEIMGAPLSGTDLRAYVMSGGGLRLKNFKLKLGGKRCFLIFP 516
            PD+VYR+AMR+LNT+AGILE+MGAPL+GT+LRAYVMSGGGL +KNFKLKL  KRCFLIFP
Sbjct: 203  PDKVYRMAMRRLNTAAGILEVMGAPLTGTELRAYVMSGGGLTVKNFKLKLRSKRCFLIFP 262

Query: 515  VRGSERKGLVSVEVKKQKGQYDMKLLAVDIPMTAGPDQRFFLIGDEEEYKVGGGLISELR 336
            +RGSERKGLVSVEVKK KGQY MKLLAVDIPM +GPDQR FLIGDEEEYKVGGGLISELR
Sbjct: 263  IRGSERKGLVSVEVKKNKGQYVMKLLAVDIPMASGPDQRLFLIGDEEEYKVGGGLISELR 322

Query: 335  DPIVKAMAAEKEFEALDQKEDDEDAERELKDAERKDLEEIEKLEKGSP 192
            DP+VKAMAA KEF+ LDQ E++EDAEREL++AERK  EEIEKLEKG P
Sbjct: 323  DPVVKAMAATKEFDDLDQIEEEEDAERELQEAERKHREEIEKLEKGLP 370


>ref|XP_006411346.1| hypothetical protein EUTSA_v10016808mg [Eutrema salsugineum]
            gi|557112515|gb|ESQ52799.1| hypothetical protein
            EUTSA_v10016808mg [Eutrema salsugineum]
          Length = 377

 Score =  402 bits (1032), Expect = e-109
 Identities = 223/396 (56%), Positives = 268/396 (67%), Gaps = 17/396 (4%)
 Frame = -3

Query: 1331 SVQNSFRQHSSSIFPS--GFSNARNLTS---------EIHSKLCFHEISFKPTSPNFRFS 1185
            SV    R H S + P   G SN   L+S         ++  +  FH +S KPTS N    
Sbjct: 9    SVHGFIRLHYSRVNPVTIGRSNPPPLSSPAIPSNSVPQLQPRFSFHSLSSKPTSTNV--- 65

Query: 1184 SNFTQKQHLPTPFVGNSKLNGAFPSGFSNSRRLSPQISPKISALS------SRGLGFRYI 1023
                                     GFS+     P+++PK+ AL       +    FR +
Sbjct: 66   -------------------------GFSSQVLSCPKLNPKLQALGLPRVNVNYSSAFRLV 100

Query: 1022 SFKSSDTSKINGNFTKTLVNKPXXXXXXXXXRYREAVGLQIEAFWKRNYXXXXXXXXXXX 843
            S KSS   K++GNF + +V++P         RYREA+GL I+AFWK+N            
Sbjct: 101  STKSSGFRKVDGNFARKVVDRPVKAVSSTFARYREAIGLHIDAFWKKNSLILFGAGGVFV 160

Query: 842  XXXLWRIMFGIANTFVGLSEGMAKYGFLGLSSAIVTFAGLYVRSRFTINPDRVYRIAMRK 663
               LWRIMFGIA+TFVGLSEGMAKYGFL LSSAIV F+GLY+RSRFTINPD+VYR+ MRK
Sbjct: 161  CIFLWRIMFGIASTFVGLSEGMAKYGFLALSSAIVAFSGLYLRSRFTINPDKVYRMTMRK 220

Query: 662  LNTSAGILEIMGAPLSGTDLRAYVMSGGGLRLKNFKLKLGGKRCFLIFPVRGSERKGLVS 483
            +NT+A ILE+MGAPLSG+DLRAYVMSGGG+  K FK  +  KRCFL+FPV+GSERKGLVS
Sbjct: 221  INTAAEILEVMGAPLSGSDLRAYVMSGGGITFKKFKPTIRSKRCFLLFPVQGSERKGLVS 280

Query: 482  VEVKKQKGQYDMKLLAVDIPMTAGPDQRFFLIGDEEEYKVGGGLISELRDPIVKAMAAEK 303
            VEVKK+KGQYDMKLLAVDIPM +GPDQR FLIGDEEEY+VGGGLISELRDP+VKAMAA K
Sbjct: 281  VEVKKKKGQYDMKLLAVDIPMASGPDQRLFLIGDEEEYRVGGGLISELRDPVVKAMAATK 340

Query: 302  EFEALDQKEDDEDAERELKDAERKDLEEIEKLEKGS 195
            EF+ LD+ E++EDAEREL++AERK  EEIEKLEK S
Sbjct: 341  EFDNLDRIEEEEDAERELEEAERKHREEIEKLEKES 376


>ref|XP_007218249.1| hypothetical protein PRUPE_ppa008050mg [Prunus persica]
            gi|462414711|gb|EMJ19448.1| hypothetical protein
            PRUPE_ppa008050mg [Prunus persica]
          Length = 347

 Score =  397 bits (1020), Expect = e-108
 Identities = 205/281 (72%), Positives = 231/281 (82%)
 Frame = -3

Query: 1043 GLGFRYISFKSSDTSKINGNFTKTLVNKPXXXXXXXXXRYREAVGLQIEAFWKRNYXXXX 864
            GLG R+ SFK  + SK+N    K + +KP         RY+EA+GLQIEAFWKRN     
Sbjct: 66   GLGLRFFSFKPPNFSKVNA---KKVFDKPLSAATSAFSRYQEAIGLQIEAFWKRNNLVLL 122

Query: 863  XXXXXXXXXXLWRIMFGIANTFVGLSEGMAKYGFLGLSSAIVTFAGLYVRSRFTINPDRV 684
                      LWR+MFGIA+TFVGLSEGMAKYGFL LSSAIV FAGL++RSRFTINPD+V
Sbjct: 123  GVGALVVCALLWRVMFGIASTFVGLSEGMAKYGFLALSSAIVAFAGLHIRSRFTINPDKV 182

Query: 683  YRIAMRKLNTSAGILEIMGAPLSGTDLRAYVMSGGGLRLKNFKLKLGGKRCFLIFPVRGS 504
            YRIAMR+LNTSAGILE+MGAPLSG+DLRAYVMSGGG+ LK FK     KRCFLIFPVRGS
Sbjct: 183  YRIAMRRLNTSAGILEVMGAPLSGSDLRAYVMSGGGVTLKKFKPTFRSKRCFLIFPVRGS 242

Query: 503  ERKGLVSVEVKKQKGQYDMKLLAVDIPMTAGPDQRFFLIGDEEEYKVGGGLISELRDPIV 324
            ERKGLVSVEVKK+KGQYDMKLLAVDIPM +GPDQR FLIGDEEEYKVGGGLI+ELRDP+V
Sbjct: 243  ERKGLVSVEVKKKKGQYDMKLLAVDIPMASGPDQRLFLIGDEEEYKVGGGLIAELRDPVV 302

Query: 323  KAMAAEKEFEALDQKEDDEDAERELKDAERKDLEEIEKLEK 201
            KAMAA KEF++LDQ E++EDAEREL++AERK  EEIEKLEK
Sbjct: 303  KAMAATKEFDSLDQIEEEEDAERELQEAERKHREEIEKLEK 343


>ref|XP_002881741.1| hypothetical protein ARALYDRAFT_903376 [Arabidopsis lyrata subsp.
            lyrata] gi|297327580|gb|EFH58000.1| hypothetical protein
            ARALYDRAFT_903376 [Arabidopsis lyrata subsp. lyrata]
          Length = 377

 Score =  397 bits (1019), Expect = e-107
 Identities = 227/394 (57%), Positives = 273/394 (69%), Gaps = 11/394 (2%)
 Frame = -3

Query: 1349 RTSYSSSVQNSFRQHSSSIFPS--GFSNARNLTS---------EIHSKLCFHEISFKPTS 1203
            R S   ++Q   R H + + P   G SN   L+S         ++  K  FH +S KPTS
Sbjct: 3    RPSDFKAIQGFIRLHYTRVNPVTIGRSNPSALSSPAIPSNGVSQLQPKFSFHSLSSKPTS 62

Query: 1202 PNFRFSSNFTQKQHLPTPFVGNSKLNGAFPSGFSNSRRLSPQISPKISALSSRGLGFRYI 1023
             N   S      Q L +P + N KL  A            P+++  +S  SS    FR +
Sbjct: 63   TNVGLS------QILSSPKL-NPKLQQALGL---------PRVN--VSFASS----FRLV 100

Query: 1022 SFKSSDTSKINGNFTKTLVNKPXXXXXXXXXRYREAVGLQIEAFWKRNYXXXXXXXXXXX 843
            S KSS   KI+GNF + +V+KP         RYREA+GL ++AFWK+N            
Sbjct: 101  SNKSSGFRKIDGNFARKVVDKPVKAVSSTFARYREAIGLHVDAFWKKNSLVVFGAGGVFV 160

Query: 842  XXXLWRIMFGIANTFVGLSEGMAKYGFLGLSSAIVTFAGLYVRSRFTINPDRVYRIAMRK 663
               LWRIMFGIA+TFVGLSEGMAKYGFL LSSAIV F+GLY+RSRFTINPD+VYR+ MRK
Sbjct: 161  CIFLWRIMFGIASTFVGLSEGMAKYGFLALSSAIVAFSGLYLRSRFTINPDKVYRMTMRK 220

Query: 662  LNTSAGILEIMGAPLSGTDLRAYVMSGGGLRLKNFKLKLGGKRCFLIFPVRGSERKGLVS 483
            +NT+A ILE+MGAPLSG+DLRAYVMSGGG+  K FK  +  KRCFL+FPV+GSERKGLVS
Sbjct: 221  INTAAEILEVMGAPLSGSDLRAYVMSGGGITFKKFKPTIRSKRCFLLFPVQGSERKGLVS 280

Query: 482  VEVKKQKGQYDMKLLAVDIPMTAGPDQRFFLIGDEEEYKVGGGLISELRDPIVKAMAAEK 303
            VEVKK+KGQYDMKLLAVDIPM +GPDQR FLIGDEEEY+VGGGLIS LRDP+VKAMAA K
Sbjct: 281  VEVKKKKGQYDMKLLAVDIPMASGPDQRLFLIGDEEEYRVGGGLISVLRDPVVKAMAATK 340

Query: 302  EFEALDQKEDDEDAERELKDAERKDLEEIEKLEK 201
            EF+ LD+ E++EDAEREL++AERK  EEIEKLEK
Sbjct: 341  EFDNLDRIEEEEDAERELEEAERKHREEIEKLEK 374


>ref|XP_002270952.1| PREDICTED: uncharacterized protein LOC100265611 [Vitis vinifera]
          Length = 382

 Score =  397 bits (1019), Expect = e-107
 Identities = 213/311 (68%), Positives = 237/311 (76%), Gaps = 9/311 (2%)
 Frame = -3

Query: 1097 SRRLS-PQISPKI------SALSSRGLGFRYISFKSSDTSKI--NGNFTKTLVNKPXXXX 945
            S+RLS P   P I      + L +   G RY S    +  K   N NF K  ++ P    
Sbjct: 71   SKRLSKPYPIPPIFSSGFGARLYANSSGLRYFSSGGWNLGKAQTNANFPKAFLDLPLRSL 130

Query: 944  XXXXXRYREAVGLQIEAFWKRNYXXXXXXXXXXXXXXLWRIMFGIANTFVGLSEGMAKYG 765
                 RYREAVGLQIEAFWKRNY              LWR MFGIA TFVGLSEGMAKYG
Sbjct: 131  RSAFYRYREAVGLQIEAFWKRNYVFLLGAGGVVLCAVLWRAMFGIATTFVGLSEGMAKYG 190

Query: 764  FLGLSSAIVTFAGLYVRSRFTINPDRVYRIAMRKLNTSAGILEIMGAPLSGTDLRAYVMS 585
            FL LS++IV F+GLY+RSR TINPD+VYRIAMRKLNTSAGILE+MGAPL+GTDLRAYVMS
Sbjct: 191  FLALSASIVAFSGLYIRSRLTINPDKVYRIAMRKLNTSAGILEVMGAPLTGTDLRAYVMS 250

Query: 584  GGGLRLKNFKLKLGGKRCFLIFPVRGSERKGLVSVEVKKQKGQYDMKLLAVDIPMTAGPD 405
            GGGL LK FK  L  KRCFLIFP+RGSER+GLVS+EVKK+KG+YDMKLLAVDIPM  GPD
Sbjct: 251  GGGLSLKKFKPTLRSKRCFLIFPIRGSERRGLVSIEVKKKKGEYDMKLLAVDIPMATGPD 310

Query: 404  QRFFLIGDEEEYKVGGGLISELRDPIVKAMAAEKEFEALDQKEDDEDAERELKDAERKDL 225
            QR FLIGDEEEYKVGGGLISELRDP+VKAMAA KEFE LDQ E++EDAEREL++AERK  
Sbjct: 311  QRLFLIGDEEEYKVGGGLISELRDPVVKAMAATKEFEELDQIEEEEDAERELQEAERKHR 370

Query: 224  EEIEKLEKGSP 192
            EEIEKLEKG+P
Sbjct: 371  EEIEKLEKGAP 381


>ref|XP_004141508.1| PREDICTED: uncharacterized protein LOC101215996 [Cucumis sativus]
          Length = 377

 Score =  395 bits (1015), Expect = e-107
 Identities = 223/366 (60%), Positives = 255/366 (69%), Gaps = 3/366 (0%)
 Frame = -3

Query: 1289 PSGFSNARNLTSEIHSKLCFHEISFKPTSPNFRFSSNFTQKQHLPTPFV--GNSKLNGAF 1116
            P   SNA  L     +       SF+   P+   SSN    Q L +P +  GNS L    
Sbjct: 29   PITHSNANPLRDPFIAHSFSSAPSFQSKFPSKPISSNVGLSQFLYSPKLTAGNSSLVTKL 88

Query: 1115 PSGFSNSRRLSPQISPKISALSSRGLGFRYISFKSSDTS-KINGNFTKTLVNKPXXXXXX 939
             +  S SR                   FR+ S K      +INGNF K +++KP      
Sbjct: 89   NAHHSASR-------------------FRFFSVKIPRFGGQINGNFAKKVIDKPAAAVSS 129

Query: 938  XXXRYREAVGLQIEAFWKRNYXXXXXXXXXXXXXXLWRIMFGIANTFVGLSEGMAKYGFL 759
               RYREA+GLQIEAF+KRNY              LW+IMFGIANTFVGLSEGMAKYGFL
Sbjct: 130  AFSRYREAIGLQIEAFFKRNYLVLLGFAAALICALLWKIMFGIANTFVGLSEGMAKYGFL 189

Query: 758  GLSSAIVTFAGLYVRSRFTINPDRVYRIAMRKLNTSAGILEIMGAPLSGTDLRAYVMSGG 579
             LSSAIV F GLY+RSRFT+NPDRVYR+AMRKLNTSAGILE+MGAPL+G+DLRAYVMSGG
Sbjct: 190  ALSSAIVAFTGLYMRSRFTVNPDRVYRMAMRKLNTSAGILEVMGAPLTGSDLRAYVMSGG 249

Query: 578  GLRLKNFKLKLGGKRCFLIFPVRGSERKGLVSVEVKKQKGQYDMKLLAVDIPMTAGPDQR 399
            G  LKNF      KRCFLIFP+RGSERKGLVSVEVKK+KGQYDMKLLAVDIPM +GPDQR
Sbjct: 250  GFTLKNFAPNRRSKRCFLIFPIRGSERKGLVSVEVKKKKGQYDMKLLAVDIPMASGPDQR 309

Query: 398  FFLIGDEEEYKVGGGLISELRDPIVKAMAAEKEFEALDQKEDDEDAERELKDAERKDLEE 219
             FLIG+EEEYK+GGGLISELRDP+VKAMAA KEF+ LD+ E+ EDAEREL++AERK+ EE
Sbjct: 310  LFLIGNEEEYKIGGGLISELRDPVVKAMAAVKEFDDLDRIEEKEDAERELQEAERKNREE 369

Query: 218  IEKLEK 201
            IEKLEK
Sbjct: 370  IEKLEK 375


>ref|XP_006402987.1| hypothetical protein EUTSA_v10006036mg [Eutrema salsugineum]
            gi|557104086|gb|ESQ44440.1| hypothetical protein
            EUTSA_v10006036mg [Eutrema salsugineum]
          Length = 378

 Score =  394 bits (1012), Expect = e-107
 Identities = 215/369 (58%), Positives = 259/369 (70%), Gaps = 1/369 (0%)
 Frame = -3

Query: 1304 SSSIFPSGFSNARNLTSEIHSKLCFHEISFKPTSPNFRFSSNFTQKQHLPT-PFVGNSKL 1128
            SS  FPS      N  S+   K  FH IS +PTS +   S   ++ + +P     G +  
Sbjct: 35   SSPAFPS------NGVSQFQPKSSFHSISSRPTSTSLGLSQILSRPKLVPNLQICGLAMA 88

Query: 1127 NGAFPSGFSNSRRLSPQISPKISALSSRGLGFRYISFKSSDTSKINGNFTKTLVNKPXXX 948
                 + F+++ RL                      F SS   K++GNF + +V+KP   
Sbjct: 89   KPRVNTNFASAFRL----------------------FSSSGFRKVDGNFARKVVDKPIKA 126

Query: 947  XXXXXXRYREAVGLQIEAFWKRNYXXXXXXXXXXXXXXLWRIMFGIANTFVGLSEGMAKY 768
                  RYREA+GL ++AFWK+N               LWRIMFGIA+TFVGLSEGMAKY
Sbjct: 127  VSSTFGRYREALGLHVDAFWKKNNLVVFGAVGVFVCIFLWRIMFGIASTFVGLSEGMAKY 186

Query: 767  GFLGLSSAIVTFAGLYVRSRFTINPDRVYRIAMRKLNTSAGILEIMGAPLSGTDLRAYVM 588
            GFL LSSAIV FAGLY+R+RFTINPD+VYRIAMRKLNT+A ILE+MGAPL+G+DLRAYVM
Sbjct: 187  GFLALSSAIVAFAGLYLRARFTINPDKVYRIAMRKLNTAADILEVMGAPLAGSDLRAYVM 246

Query: 587  SGGGLRLKNFKLKLGGKRCFLIFPVRGSERKGLVSVEVKKQKGQYDMKLLAVDIPMTAGP 408
            SGGG+ LK FK  +  KRCFL+FPV+G+ERKGLVSVEVKK+KGQYDMKLLAVDIPM +GP
Sbjct: 247  SGGGITLKKFKPTIRSKRCFLLFPVQGAERKGLVSVEVKKKKGQYDMKLLAVDIPMASGP 306

Query: 407  DQRFFLIGDEEEYKVGGGLISELRDPIVKAMAAEKEFEALDQKEDDEDAERELKDAERKD 228
            DQR FLIGDEEEY+VGGGLISELRDP+VKAMAA KEF+ LD+ E++EDAEREL++AERK 
Sbjct: 307  DQRLFLIGDEEEYRVGGGLISELRDPVVKAMAAAKEFDNLDRIEEEEDAERELQEAERKQ 366

Query: 227  LEEIEKLEK 201
             EEIEKLEK
Sbjct: 367  REEIEKLEK 375


>gb|EYU20136.1| hypothetical protein MIMGU_mgv1a008727mg [Mimulus guttatus]
          Length = 364

 Score =  392 bits (1007), Expect = e-106
 Identities = 214/352 (60%), Positives = 256/352 (72%), Gaps = 7/352 (1%)
 Frame = -3

Query: 1235 CFHEISFKPT---SPNFRFSSNF--TQKQHLPTPFVGNSKLNGAFPSGFSNSRRLSPQIS 1071
            CFH+I+ KP+   +P     S+F  +  + +  P   +S L+  F S   N+ R+  ++ 
Sbjct: 13   CFHQIASKPSISGNPTSSGISHFLTSNPKLIRNPTSRSSLLSSHFSSS-PNAPRVFGKLD 71

Query: 1070 P--KISALSSRGLGFRYISFKSSDTSKINGNFTKTLVNKPXXXXXXXXXRYREAVGLQIE 897
            P  K S      LGFRY  F S++    + NF K  +  P         RY+EAV L IE
Sbjct: 72   PRAKFSPPPGTYLGFRY--FSSANREMGSKNFAKKAMKNPALTLKSALARYKEAVVLHIE 129

Query: 896  AFWKRNYXXXXXXXXXXXXXXLWRIMFGIANTFVGLSEGMAKYGFLGLSSAIVTFAGLYV 717
            AFWKRNY              LWR++FGIA+TF+G SEGMAKYGFL LSSAIV F GLY 
Sbjct: 130  AFWKRNYLVVLGAGGFVVCILLWRVLFGIASTFIGFSEGMAKYGFLALSSAIVAFTGLYF 189

Query: 716  RSRFTINPDRVYRIAMRKLNTSAGILEIMGAPLSGTDLRAYVMSGGGLRLKNFKLKLGGK 537
            RSRFTINPD+VYR+AMR+LNTSAGILE+MGAPL+GTDLRAYVMSGGG+ LKNF+  L  K
Sbjct: 190  RSRFTINPDKVYRMAMRRLNTSAGILEVMGAPLTGTDLRAYVMSGGGISLKNFQPGLRSK 249

Query: 536  RCFLIFPVRGSERKGLVSVEVKKQKGQYDMKLLAVDIPMTAGPDQRFFLIGDEEEYKVGG 357
            RCFLIFP++GSERKGLVSVE KK+KGQYDMKLLAVDIPM +GPDQR FLIGDEEEY++GG
Sbjct: 250  RCFLIFPIQGSERKGLVSVEAKKKKGQYDMKLLAVDIPMASGPDQRLFLIGDEEEYRIGG 309

Query: 356  GLISELRDPIVKAMAAEKEFEALDQKEDDEDAERELKDAERKDLEEIEKLEK 201
            GLISELRDP+VKA+AA KEFEA D+KED+EDA REL +AE K+ EEIEKLEK
Sbjct: 310  GLISELRDPVVKALAAAKEFEARDEKEDEEDAARELLEAEIKNQEEIEKLEK 361


>ref|XP_006842580.1| hypothetical protein AMTR_s00077p00157770 [Amborella trichopoda]
            gi|548844666|gb|ERN04255.1| hypothetical protein
            AMTR_s00077p00157770 [Amborella trichopoda]
          Length = 306

 Score =  392 bits (1007), Expect = e-106
 Identities = 205/294 (69%), Positives = 233/294 (79%)
 Frame = -3

Query: 1076 ISPKISALSSRGLGFRYISFKSSDTSKINGNFTKTLVNKPXXXXXXXXXRYREAVGLQIE 897
            + P+I + S   LG          + K+N   +K   +KP         RY EAVGLQ+E
Sbjct: 23   LKPRIGSFSFGSLG---------SSGKVNAGLSK-FFDKPLSAIGSTFSRYPEAVGLQME 72

Query: 896  AFWKRNYXXXXXXXXXXXXXXLWRIMFGIANTFVGLSEGMAKYGFLGLSSAIVTFAGLYV 717
            AFWKRN               LWR+MFGIA+ FVGLSEGMAKYGFL L+SAIV F GLY+
Sbjct: 73   AFWKRNSLVLLGAFGLGVCILLWRVMFGIASMFVGLSEGMAKYGFLALASAIVAFTGLYL 132

Query: 716  RSRFTINPDRVYRIAMRKLNTSAGILEIMGAPLSGTDLRAYVMSGGGLRLKNFKLKLGGK 537
            RSRFTINPD+VYRIAMRKLNTSAGILE+MGAPLSGTD+RAYVMSGGGLRLK+FK +LGGK
Sbjct: 133  RSRFTINPDKVYRIAMRKLNTSAGILEVMGAPLSGTDVRAYVMSGGGLRLKSFKPRLGGK 192

Query: 536  RCFLIFPVRGSERKGLVSVEVKKQKGQYDMKLLAVDIPMTAGPDQRFFLIGDEEEYKVGG 357
            RCFLIFP+RGSERKGLVSVEVKK++GQYDMKLLAVD+PMT+GPDQR FLIGDEEEYKVGG
Sbjct: 193  RCFLIFPIRGSERKGLVSVEVKKKQGQYDMKLLAVDVPMTSGPDQRLFLIGDEEEYKVGG 252

Query: 356  GLISELRDPIVKAMAAEKEFEALDQKEDDEDAERELKDAERKDLEEIEKLEKGS 195
            GLISELRDPIVKAMAA KEFE +DQKE++ED +REL++AERK  EEIE  EKGS
Sbjct: 253  GLISELRDPIVKAMAAAKEFEDIDQKEEEEDEKRELQEAERKRQEEIENPEKGS 306


>gb|EXC21106.1| hypothetical protein L484_017118 [Morus notabilis]
          Length = 378

 Score =  392 bits (1006), Expect = e-106
 Identities = 215/333 (64%), Positives = 241/333 (72%), Gaps = 28/333 (8%)
 Frame = -3

Query: 1115 PSGFSNS--RRLSPQISPKISALSS-RGLGFRYISFKSSDTSKINGNFTKTLVNKPXXXX 945
            PSG SN+   R SP       A SS  GLG R+ SF++S+  K N NF K +  KP    
Sbjct: 42   PSGSSNAFLPRASPTHRTNPHAFSSGAGLGLRFFSFRASELGKGNANFAKKIFEKPASAV 101

Query: 944  XXXXXRYREAVGLQI-------------------------EAFWKRNYXXXXXXXXXXXX 840
                 RYREA+GLQI                         EAF +RNY            
Sbjct: 102  AATFSRYREALGLQIKIFEKPASAVAATFSRYREALGLQIEAFCRRNYLFLLGAGAVMAC 161

Query: 839  XXLWRIMFGIANTFVGLSEGMAKYGFLGLSSAIVTFAGLYVRSRFTINPDRVYRIAMRKL 660
              LWRIMFGIA++FVG SEGMAKYGFL LSSAIV FAGLYVRSRFTINPDRVYR AMRKL
Sbjct: 162  ALLWRIMFGIASSFVGFSEGMAKYGFLALSSAIVAFAGLYVRSRFTINPDRVYRTAMRKL 221

Query: 659  NTSAGILEIMGAPLSGTDLRAYVMSGGGLRLKNFKLKLGGKRCFLIFPVRGSERKGLVSV 480
            NTSAGILE+MGAPLSG+DLRAYV SGGGL +KNFK ++  KRCFLIFP+RGSERKGLVSV
Sbjct: 222  NTSAGILEVMGAPLSGSDLRAYVTSGGGLTVKNFKPRIRSKRCFLIFPIRGSERKGLVSV 281

Query: 479  EVKKQKGQYDMKLLAVDIPMTAGPDQRFFLIGDEEEYKVGGGLISELRDPIVKAMAAEKE 300
            EVKK+KGQYDMKLLAVDIPM +GPDQR FL+GDEEEYKVGGGLISELRDP+V AM+A KE
Sbjct: 282  EVKKKKGQYDMKLLAVDIPMASGPDQRLFLVGDEEEYKVGGGLISELRDPVVSAMSAAKE 341

Query: 299  FEALDQKEDDEDAERELKDAERKDLEEIEKLEK 201
            F+ LDQ E++ED EREL++AERK  EEIEKLEK
Sbjct: 342  FDDLDQIEEEEDTERELQEAERKHREEIEKLEK 374


>ref|XP_006371317.1| hypothetical protein POPTR_0019s09020g [Populus trichocarpa]
            gi|550317070|gb|ERP49114.1| hypothetical protein
            POPTR_0019s09020g [Populus trichocarpa]
          Length = 378

 Score =  392 bits (1006), Expect = e-106
 Identities = 226/377 (59%), Positives = 262/377 (69%), Gaps = 3/377 (0%)
 Frame = -3

Query: 1319 SFRQHSSSIFPSGFSNARNLTSEIHSKL-CFHEISFKPTSPNFRFSSNFTQKQHLPTPFV 1143
            S     S+ F   +SNA +  + ++S++ CF   + KPTS N   S            F+
Sbjct: 32   SLASRISTSFTRPYSNASSNFTPLNSQIPCF---TSKPTSSNLGLSQ-----------FL 77

Query: 1142 GNSKLNGAFPSGFSNSRRLSPQISPKISALSSRGLGFRYISFK-SSDTSK-INGNFTKTL 969
              +K N +F                  S   S   G R  SFK SSD  K ++GNF K L
Sbjct: 78   SCTKPNSSF------------------SKNGSFFYGVRQFSFKGSSDLGKRVDGNFAKKL 119

Query: 968  VNKPXXXXXXXXXRYREAVGLQIEAFWKRNYXXXXXXXXXXXXXXLWRIMFGIANTFVGL 789
            + KP         RYREA+GLQI+AF KRN               LWRIMFGIANTFV L
Sbjct: 120  LEKPATAVTSAFSRYREALGLQIDAFLKRNSLFLIGAGGVIICALLWRIMFGIANTFVSL 179

Query: 788  SEGMAKYGFLGLSSAIVTFAGLYVRSRFTINPDRVYRIAMRKLNTSAGILEIMGAPLSGT 609
            SEGMAKYGFL LSSAIV F+GLY+RSR TINPD+VYR+AM KLNTSAGILE+MGAPL+GT
Sbjct: 180  SEGMAKYGFLALSSAIVAFSGLYIRSRITINPDKVYRMAMTKLNTSAGILEVMGAPLTGT 239

Query: 608  DLRAYVMSGGGLRLKNFKLKLGGKRCFLIFPVRGSERKGLVSVEVKKQKGQYDMKLLAVD 429
             LRAYVMSGGGL LKNFK  +  KRCFLIFP++GSERKGLVSVEVKK+KGQYDM+LLAVD
Sbjct: 240  VLRAYVMSGGGLVLKNFKPTVRSKRCFLIFPIQGSERKGLVSVEVKKKKGQYDMRLLAVD 299

Query: 428  IPMTAGPDQRFFLIGDEEEYKVGGGLISELRDPIVKAMAAEKEFEALDQKEDDEDAEREL 249
            IPM +GPDQR FLIGDEEEYKVGGGLISELRDP+VKAMAA KEF+ LDQ E++EDAE+EL
Sbjct: 300  IPMASGPDQRLFLIGDEEEYKVGGGLISELRDPVVKAMAASKEFDDLDQIEEEEDAEKEL 359

Query: 248  KDAERKDLEEIEKLEKG 198
            ++AERK  EEIEKLEKG
Sbjct: 360  QEAERKHREEIEKLEKG 376


>ref|XP_006358158.1| PREDICTED: uncharacterized protein LOC102588510 [Solanum tuberosum]
          Length = 370

 Score =  389 bits (1000), Expect = e-105
 Identities = 225/385 (58%), Positives = 265/385 (68%), Gaps = 1/385 (0%)
 Frame = -3

Query: 1349 RTSYSSSVQNSFRQHSSSIFPSGFSNARNLTSEIHSKLCFHEISFKPTSPNFRFSSNFTQ 1170
            ++S S  V ++F  +  S  P  FS+    +S I   L  H    +  + N ++ SN TQ
Sbjct: 3    KSSSSKFVLSNFYHYILSK-PHQFSSPNVTSSGISHFLSNHSKIIEKPAVN-QWVSN-TQ 59

Query: 1169 KQHLPTPFVGNSKLNGAFPSGFSNSRRLSPQISPKISALSSRGL-GFRYISFKSSDTSKI 993
            + H  +           FP    N    S ++ P  S L +R L GFRY S KSS     
Sbjct: 60   RTHFSS---------SPFPRVLQNP---SKKLDPDGSFLWNRKLLGFRYFSLKSSGLG-- 105

Query: 992  NGNFTKTLVNKPXXXXXXXXXRYREAVGLQIEAFWKRNYXXXXXXXXXXXXXXLWRIMFG 813
                 K ++  P         RY+ AVGLQ+EAFWKRN               LWRI+FG
Sbjct: 106  ---LGKNVLKNPVEAAKKTALRYKGAVGLQMEAFWKRNSMVLFGAAGIMVCILLWRILFG 162

Query: 812  IANTFVGLSEGMAKYGFLGLSSAIVTFAGLYVRSRFTINPDRVYRIAMRKLNTSAGILEI 633
            IA TF+GLSEGMAKYGFL LSSAIV FAGLY+RSRFTINPD+VYR+AMR+LNT AGILE+
Sbjct: 163  IATTFIGLSEGMAKYGFLALSSAIVAFAGLYLRSRFTINPDKVYRMAMRRLNTEAGILEV 222

Query: 632  MGAPLSGTDLRAYVMSGGGLRLKNFKLKLGGKRCFLIFPVRGSERKGLVSVEVKKQKGQY 453
            MGAPLSGTDLRAYVMSGGG+ LKNFK +  GKRCFLIFP+RGSERKGLVSVEVK ++GQY
Sbjct: 223  MGAPLSGTDLRAYVMSGGGITLKNFKPRFRGKRCFLIFPIRGSERKGLVSVEVKNKQGQY 282

Query: 452  DMKLLAVDIPMTAGPDQRFFLIGDEEEYKVGGGLISELRDPIVKAMAAEKEFEALDQKED 273
            DMKLLAVDIPM +GPDQR FLIGDEEEY++GGGLI+ELRDP+VKAMAA KEFE  D  ED
Sbjct: 283  DMKLLAVDIPMASGPDQRLFLIGDEEEYRIGGGLIAELRDPVVKAMAATKEFEDRDDLED 342

Query: 272  DEDAERELKDAERKDLEEIEKLEKG 198
            +EDAEREL++AERK  EEIEKLEKG
Sbjct: 343  EEDAERELQEAERKHQEEIEKLEKG 367


>ref|NP_181612.1| uncharacterized protein [Arabidopsis thaliana]
            gi|17473709|gb|AAL38308.1| unknown protein [Arabidopsis
            thaliana] gi|20148507|gb|AAM10144.1| unknown protein
            [Arabidopsis thaliana] gi|330254786|gb|AEC09880.1|
            uncharacterized protein AT2G40800 [Arabidopsis thaliana]
          Length = 377

 Score =  389 bits (1000), Expect = e-105
 Identities = 212/355 (59%), Positives = 255/355 (71%)
 Frame = -3

Query: 1265 NLTSEIHSKLCFHEISFKPTSPNFRFSSNFTQKQHLPTPFVGNSKLNGAFPSGFSNSRRL 1086
            N  S++  K  FH +S KPTS N          Q L +P + N KL  A           
Sbjct: 42   NGVSQLQPKFSFHSLSSKPTSKNVGLY------QILSSPKL-NPKLQQALGL-------- 86

Query: 1085 SPQISPKISALSSRGLGFRYISFKSSDTSKINGNFTKTLVNKPXXXXXXXXXRYREAVGL 906
                 P+++   S    FR +S KSS   K++G+F + +V+KP         RYREA+GL
Sbjct: 87   -----PRVNV--SFASAFRLVSTKSSGFRKVDGSFARKVVDKPVKAVSSTFARYREAIGL 139

Query: 905  QIEAFWKRNYXXXXXXXXXXXXXXLWRIMFGIANTFVGLSEGMAKYGFLGLSSAIVTFAG 726
             I+AFWK+N               LWRIMFGIA+TFVGLSEGMAKYGFL LSSAIV F+G
Sbjct: 140  HIDAFWKKNSLVVFGAAGVFVCIFLWRIMFGIASTFVGLSEGMAKYGFLALSSAIVAFSG 199

Query: 725  LYVRSRFTINPDRVYRIAMRKLNTSAGILEIMGAPLSGTDLRAYVMSGGGLRLKNFKLKL 546
            LY+RSRFTINPD+VYR+ MRK+NT+A ILE+MGAPLSG+DLRAYVMSGGG+  K FK  +
Sbjct: 200  LYLRSRFTINPDKVYRMTMRKINTAAEILEVMGAPLSGSDLRAYVMSGGGITFKKFKPTI 259

Query: 545  GGKRCFLIFPVRGSERKGLVSVEVKKQKGQYDMKLLAVDIPMTAGPDQRFFLIGDEEEYK 366
              KRCFL+FPV+GSE+KGLVSVEVKK+KGQYDMKLLAVDIPM +GPDQR FLIGDEEEY+
Sbjct: 260  RSKRCFLLFPVQGSEQKGLVSVEVKKKKGQYDMKLLAVDIPMASGPDQRLFLIGDEEEYR 319

Query: 365  VGGGLISELRDPIVKAMAAEKEFEALDQKEDDEDAERELKDAERKDLEEIEKLEK 201
            +GGGLIS LRDP+VKAMAA KEF+ LD+ E++EDAEREL++AERK  EEIEKLEK
Sbjct: 320  IGGGLISVLRDPVVKAMAATKEFDNLDRIEEEEDAERELQEAERKHREEIEKLEK 374


>ref|XP_004235216.1| PREDICTED: uncharacterized protein LOC101251760 isoform 1 [Solanum
            lycopersicum]
          Length = 367

 Score =  389 bits (999), Expect = e-105
 Identities = 216/355 (60%), Positives = 252/355 (70%), Gaps = 10/355 (2%)
 Frame = -3

Query: 1232 FHEISFKP---TSPNFRFSSNFTQKQHLPTPFVGNSKLNGAFPSGFSNS------RRLSP 1080
            +H I  KP   ++PN   S   +    +      N  ++    + FS+S      +  S 
Sbjct: 15   YHYILSKPHQFSTPNVGISHFLSNHSKIIQKPAVNQWVSNTQRTHFSSSPFPRVLQNPSK 74

Query: 1079 QISPKISALSSRGL-GFRYISFKSSDTSKINGNFTKTLVNKPXXXXXXXXXRYREAVGLQ 903
            ++ P  S L +R L GFRY S KSS          K ++  P         RY+ AVGLQ
Sbjct: 75   KLDPDGSFLWNRKLLGFRYFSLKSSGLG-----LGKNVLKNPVEAAKKTTLRYKGAVGLQ 129

Query: 902  IEAFWKRNYXXXXXXXXXXXXXXLWRIMFGIANTFVGLSEGMAKYGFLGLSSAIVTFAGL 723
            +EAFWKRN               LWRI+FGIA TF+GLSEGMAKYGFL LSSAIV FAGL
Sbjct: 130  MEAFWKRNSMVLFGAAGIMVCILLWRILFGIATTFIGLSEGMAKYGFLALSSAIVAFAGL 189

Query: 722  YVRSRFTINPDRVYRIAMRKLNTSAGILEIMGAPLSGTDLRAYVMSGGGLRLKNFKLKLG 543
            Y+RSRFTINPD+VYR+AMR+LNT AGILE+MGAPLSGTDLRAYVMSGGG+ LKNFK +  
Sbjct: 190  YLRSRFTINPDKVYRMAMRRLNTEAGILEVMGAPLSGTDLRAYVMSGGGVTLKNFKPRFR 249

Query: 542  GKRCFLIFPVRGSERKGLVSVEVKKQKGQYDMKLLAVDIPMTAGPDQRFFLIGDEEEYKV 363
            GKRCFLIFP+RGSERKGLVSVEVK ++GQYDMKLLAVDIPM AGPDQR +LIGDEEEY+V
Sbjct: 250  GKRCFLIFPIRGSERKGLVSVEVKNKQGQYDMKLLAVDIPMAAGPDQRLYLIGDEEEYRV 309

Query: 362  GGGLISELRDPIVKAMAAEKEFEALDQKEDDEDAERELKDAERKDLEEIEKLEKG 198
            GGGLI+ELRDP+VKAMAA KEFE  D  ED+EDAEREL++AERK  EEIEKLEKG
Sbjct: 310  GGGLIAELRDPVVKAMAATKEFEERDDLEDEEDAERELQEAERKHQEEIEKLEKG 364


>ref|XP_002520894.1| conserved hypothetical protein [Ricinus communis]
            gi|223540025|gb|EEF41603.1| conserved hypothetical
            protein [Ricinus communis]
          Length = 364

 Score =  387 bits (994), Expect = e-105
 Identities = 200/279 (71%), Positives = 228/279 (81%), Gaps = 1/279 (0%)
 Frame = -3

Query: 1031 RYISFKSSDTSK-INGNFTKTLVNKPXXXXXXXXXRYREAVGLQIEAFWKRNYXXXXXXX 855
            R+ S K+S+  K +NG+F + ++ KP          YREA+GLQI+AF KRN        
Sbjct: 88   RHFSLKTSNLGKTVNGDFARKVLEKPATTFSR----YREAIGLQIDAFCKRNVLLLVGAG 143

Query: 854  XXXXXXXLWRIMFGIANTFVGLSEGMAKYGFLGLSSAIVTFAGLYVRSRFTINPDRVYRI 675
                   LWRIMFGIANTFVGLSEGMAKYGFL LSSAIV FAGLY+RSR T+NPDRVYRI
Sbjct: 144  GVIVCALLWRIMFGIANTFVGLSEGMAKYGFLALSSAIVAFAGLYIRSRITVNPDRVYRI 203

Query: 674  AMRKLNTSAGILEIMGAPLSGTDLRAYVMSGGGLRLKNFKLKLGGKRCFLIFPVRGSERK 495
            AMRKLNTSA ILE+MGAPL+GT+LRAYVMSGGG+ LKNFK +L  KRCFLIFP+RGSERK
Sbjct: 204  AMRKLNTSAAILEVMGAPLTGTELRAYVMSGGGVTLKNFKPRLRSKRCFLIFPIRGSERK 263

Query: 494  GLVSVEVKKQKGQYDMKLLAVDIPMTAGPDQRFFLIGDEEEYKVGGGLISELRDPIVKAM 315
            GLVSVEVKK+KGQYDMKLLAVDIPM +GPDQR FLIGDE+EYKVGGGLI+ELRDP+VKAM
Sbjct: 264  GLVSVEVKKKKGQYDMKLLAVDIPMASGPDQRLFLIGDEDEYKVGGGLIAELRDPVVKAM 323

Query: 314  AAEKEFEALDQKEDDEDAERELKDAERKDLEEIEKLEKG 198
            AA KEF+ LD  E+ EDAEREL++AERK  EE+EKLEKG
Sbjct: 324  AASKEFDDLDDIEEAEDAERELEEAERKHREEMEKLEKG 362


>ref|NP_191202.1| uncharacterized protein [Arabidopsis thaliana]
            gi|7594521|emb|CAB88046.1| putative protein [Arabidopsis
            thaliana] gi|63003794|gb|AAY25426.1| At3g56430
            [Arabidopsis thaliana] gi|114213521|gb|ABI54343.1|
            At3g56430 [Arabidopsis thaliana]
            gi|332646000|gb|AEE79521.1| uncharacterized protein
            AT3G56430 [Arabidopsis thaliana]
          Length = 434

 Score =  386 bits (991), Expect = e-104
 Identities = 213/371 (57%), Positives = 261/371 (70%), Gaps = 17/371 (4%)
 Frame = -3

Query: 1265 NLTSEIHSKLCFHEISFKPTSPNFRFSSNFTQKQHLPTPFVGNSKLNGAFPSGFSN--SR 1092
            N  S++  K  FH  S +PTS NF  S      Q LP+  V   +   +F S  S   S+
Sbjct: 43   NGVSQLQPKSGFHTFSSRPTSKNFGLS------QILPSNGVSQLQPKTSFHSFLSRPTSK 96

Query: 1091 RL-------SPQISPKIS----ALSSRGLGFRYIS----FKSSDTSKINGNFTKTLVNKP 957
             +       SP++ P +     AL    +   ++S    F SS   K++GNF + +V+KP
Sbjct: 97   NVGLSQILPSPKLVPGLQNCGVALVKPRVNMNFVSAFRLFSSSGFRKVDGNFARKVVDKP 156

Query: 956  XXXXXXXXXRYREAVGLQIEAFWKRNYXXXXXXXXXXXXXXLWRIMFGIANTFVGLSEGM 777
                     RYR A+GL ++AFWK+N               LWR+MFGIA+TFVGLSEGM
Sbjct: 157  IKAVSSTFARYRMALGLHVDAFWKKNNLLVFGAGAVFVCIFLWRVMFGIASTFVGLSEGM 216

Query: 776  AKYGFLGLSSAIVTFAGLYVRSRFTINPDRVYRIAMRKLNTSAGILEIMGAPLSGTDLRA 597
            AKYGFL LSSAIV FAGLY+R+RFTINPD+VYRI MRKLNT+A +LE+MGAPL+G+DLRA
Sbjct: 217  AKYGFLALSSAIVAFAGLYLRARFTINPDKVYRITMRKLNTAADVLEVMGAPLAGSDLRA 276

Query: 596  YVMSGGGLRLKNFKLKLGGKRCFLIFPVRGSERKGLVSVEVKKQKGQYDMKLLAVDIPMT 417
            YVMSGGG+  K FK  +  KRCFL+FPV+GSERKGLVSVEVKK+KGQYDMKLLAVDIPM 
Sbjct: 277  YVMSGGGITFKKFKPTIRNKRCFLLFPVQGSERKGLVSVEVKKKKGQYDMKLLAVDIPMA 336

Query: 416  AGPDQRFFLIGDEEEYKVGGGLISELRDPIVKAMAAEKEFEALDQKEDDEDAERELKDAE 237
            +GPDQR FLIGDEEEY+VGGGLISELRDP+VKAMAA KEF+ LD+ E++EDAEREL++AE
Sbjct: 337  SGPDQRLFLIGDEEEYRVGGGLISELRDPVVKAMAATKEFDNLDRIEEEEDAERELQEAE 396

Query: 236  RKDLEEIEKLE 204
            RK+ EEIE  E
Sbjct: 397  RKEREEIELQE 407


>ref|XP_002878085.1| hypothetical protein ARALYDRAFT_907085 [Arabidopsis lyrata subsp.
            lyrata] gi|297323923|gb|EFH54344.1| hypothetical protein
            ARALYDRAFT_907085 [Arabidopsis lyrata subsp. lyrata]
          Length = 438

 Score =  385 bits (989), Expect = e-104
 Identities = 219/396 (55%), Positives = 266/396 (67%), Gaps = 18/396 (4%)
 Frame = -3

Query: 1337 SSSVQNSFRQHSSSIFPSGFSNARNLTSEIHSKLCFHEISFKPTSPNFRFSSNFTQKQHL 1158
            S + ++     SS   PS      N  S++ +K  FH  S +PT+ NF  S      Q L
Sbjct: 25   SKTGRSKLSAFSSPTLPS------NGVSQLQAKSGFHSFSSRPTAKNFGLS------QIL 72

Query: 1157 PTPFVGNSKLNGAFPS----------GFSNSRRLSPQISPKIS----ALSSRGLGFRYIS 1020
            P+  V   +   +F S          G S     SP++ P +     AL    +   + S
Sbjct: 73   PSNGVSQLQPKTSFHSFLSRPTSKNLGLSQILPSSPKLVPSLQNCGVALVKPRVNVNFAS 132

Query: 1019 ----FKSSDTSKINGNFTKTLVNKPXXXXXXXXXRYREAVGLQIEAFWKRNYXXXXXXXX 852
                F SS   KI+GNF + +V+KP         RYR A+GL I+AFWK+N         
Sbjct: 133  AFRLFSSSGFRKIDGNFARKVVDKPIQAVSSTFARYRMALGLHIDAFWKKNNLLVFGAGA 192

Query: 851  XXXXXXLWRIMFGIANTFVGLSEGMAKYGFLGLSSAIVTFAGLYVRSRFTINPDRVYRIA 672
                  LWRIMFGIA+TFVGLSEGMAKYGFL LSSAIV FAGLY+R+RFTINPD+VYRI 
Sbjct: 193  VFVCIFLWRIMFGIASTFVGLSEGMAKYGFLALSSAIVAFAGLYLRARFTINPDKVYRIT 252

Query: 671  MRKLNTSAGILEIMGAPLSGTDLRAYVMSGGGLRLKNFKLKLGGKRCFLIFPVRGSERKG 492
            MRKLNT+A +LE+MGAPL+G+DLRAYVMSGGG+  K FK  +  KRCFL+FPV+GSERKG
Sbjct: 253  MRKLNTAADVLEVMGAPLAGSDLRAYVMSGGGITFKKFKPTIRNKRCFLLFPVQGSERKG 312

Query: 491  LVSVEVKKQKGQYDMKLLAVDIPMTAGPDQRFFLIGDEEEYKVGGGLISELRDPIVKAMA 312
            LVSVEVKK+KGQYDMKLLAVDIPM +GPDQR FLIGDE EY+VGGGLISELRDP+VKAMA
Sbjct: 313  LVSVEVKKKKGQYDMKLLAVDIPMASGPDQRLFLIGDEVEYRVGGGLISELRDPVVKAMA 372

Query: 311  AEKEFEALDQKEDDEDAERELKDAERKDLEEIEKLE 204
            A KEF+ LD+ E++EDAEREL++AERK+ EEIE  E
Sbjct: 373  ATKEFDNLDRIEEEEDAERELQEAERKEREEIELQE 408


>ref|XP_004167240.1| PREDICTED: uncharacterized protein LOC101225862 [Cucumis sativus]
          Length = 378

 Score =  384 bits (987), Expect = e-104
 Identities = 219/367 (59%), Positives = 254/367 (69%), Gaps = 4/367 (1%)
 Frame = -3

Query: 1289 PSGFSNARNLTSEIHSKLCFHEISFKPTSPNFRFSSNFTQKQHLPTPFV--GNSKLNGAF 1116
            P   SNA  L     +       SF+   P+   SSN    Q L +P +  GNS L    
Sbjct: 29   PITHSNANPLRDPFIAHSFSSAPSFQSKFPSKPISSNVGLSQFLYSPKLTAGNSSLVTKL 88

Query: 1115 PSGFSNSRRLSPQISPKISALSSRGLGFRYISFKSSDTS-KINGNFTKTLVNKPXXXXXX 939
             +  S SR                   FR+ S K      +INGNF K +++KP      
Sbjct: 89   NAHHSASR-------------------FRFFSVKIPRFGGQINGNFAKKVIDKPAAAVSS 129

Query: 938  XXXRYREAVGLQIEAFWKRNYXXXXXXXXXXXXXXLWRIMFGIANTFVGLSEGMAKYGFL 759
               RYREA+GLQIEAF+KRNY              LW+IMFGIANTFVGLSEGMAKYGFL
Sbjct: 130  AFSRYREAIGLQIEAFFKRNYLVLLGFAAALICALLWKIMFGIANTFVGLSEGMAKYGFL 189

Query: 758  GLSSAIVTFAGLYVRSRFTINPDRVYRIAMRKLNTSAGILEIMGAPLSGTDLRAYVMSGG 579
             LSSAIV F GLY+RSRFT+NPDRVYR+AMRKLNTSAGILE+MGAPL+G+DLRAYVMSGG
Sbjct: 190  ALSSAIVAFTGLYMRSRFTVNPDRVYRMAMRKLNTSAGILEVMGAPLTGSDLRAYVMSGG 249

Query: 578  GLRLKNFKLKLGGKRCFLIFPVRGSERKGLVSVEVKKQKGQ-YDMKLLAVDIPMTAGPDQ 402
            G  LKNF      KRCFLIFP+RGSERKGLVSVEVK+++ + YDMKLLAVDIPM +GPDQ
Sbjct: 250  GFTLKNFAPNRRSKRCFLIFPIRGSERKGLVSVEVKRRRARFYDMKLLAVDIPMASGPDQ 309

Query: 401  RFFLIGDEEEYKVGGGLISELRDPIVKAMAAEKEFEALDQKEDDEDAERELKDAERKDLE 222
            R FLIG+EEEYK+GGGLISELRDP+VKAMAA KEF+ LD+ E+ EDAEREL++AERK+ E
Sbjct: 310  RLFLIGNEEEYKIGGGLISELRDPVVKAMAAVKEFDDLDRIEEKEDAERELQEAERKNRE 369

Query: 221  EIEKLEK 201
            EIEKLEK
Sbjct: 370  EIEKLEK 376


>ref|XP_006291135.1| hypothetical protein CARUB_v10017250mg [Capsella rubella]
            gi|482559842|gb|EOA24033.1| hypothetical protein
            CARUB_v10017250mg [Capsella rubella]
          Length = 444

 Score =  384 bits (986), Expect = e-104
 Identities = 217/385 (56%), Positives = 265/385 (68%), Gaps = 17/385 (4%)
 Frame = -3

Query: 1304 SSSIFPSGFSNARNLTSEIHSKLCFHEISFKPTSPNFRFSSNFTQKQHLPTPFVGNSKLN 1125
            S+  FP+  SN     S++  K  FH  S +PT  NF  S      Q LP+  V   K  
Sbjct: 32   SALSFPALPSNG---VSQLQPKSGFHSFSSRPTLKNFGLS------QILPSNGVSQLKPK 82

Query: 1124 GAFPSGFSN--SRRL-------SPQISPKIS----ALSSRGLGFRYIS----FKSSDTSK 996
             +F S  S   S+ +       SP++ P +     AL    +   + S    F SS   K
Sbjct: 83   TSFHSFLSRPTSKNVGLFQILSSPKLVPSLQNCGVALVKPRVNMNFASAFRLFSSSGFRK 142

Query: 995  INGNFTKTLVNKPXXXXXXXXXRYREAVGLQIEAFWKRNYXXXXXXXXXXXXXXLWRIMF 816
            ++GNF + +V+KP         RYR A+GL I+AFWK+N               LWRIMF
Sbjct: 143  VDGNFARKVVDKPIQAVSSTFARYRMALGLHIDAFWKKNNLLVFGAGAVFVCIFLWRIMF 202

Query: 815  GIANTFVGLSEGMAKYGFLGLSSAIVTFAGLYVRSRFTINPDRVYRIAMRKLNTSAGILE 636
            GIA+TFVGLSEGMAKYGFL LSSAIV FAGLY+R+RFTINPD+VYRI MRKLNT+A +LE
Sbjct: 203  GIASTFVGLSEGMAKYGFLALSSAIVAFAGLYLRARFTINPDKVYRITMRKLNTAADVLE 262

Query: 635  IMGAPLSGTDLRAYVMSGGGLRLKNFKLKLGGKRCFLIFPVRGSERKGLVSVEVKKQKGQ 456
            +MGAPL+G+DLRAYVMSGGG+  K FK  +  KRCFL+FPV+GSERKGLVSVEVKK+KGQ
Sbjct: 263  VMGAPLAGSDLRAYVMSGGGITFKRFKPSIRNKRCFLLFPVQGSERKGLVSVEVKKKKGQ 322

Query: 455  YDMKLLAVDIPMTAGPDQRFFLIGDEEEYKVGGGLISELRDPIVKAMAAEKEFEALDQKE 276
            YDMKLLAVDIPM +GPDQR FLIGDEEEY+VGGGLISELRDP+VKAMAA KEF+ LD+ E
Sbjct: 323  YDMKLLAVDIPMASGPDQRLFLIGDEEEYRVGGGLISELRDPVVKAMAATKEFDNLDRIE 382

Query: 275  DDEDAERELKDAERKDLEEIEKLEK 201
            ++EDAEREL++AERK  EE E+ ++
Sbjct: 383  EEEDAERELQEAERKQREEEERKQR 407


Top