BLASTX nr result

ID: Akebia24_contig00011262 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Akebia24_contig00011262
         (2370 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|EXC19998.1| Methyltransferase-like protein 7B [Morus notabilis]    401   e-109
ref|XP_004298421.1| PREDICTED: uncharacterized protein LOC101302...   398   e-108
ref|XP_006483695.1| PREDICTED: uncharacterized protein LOC102616...   379   e-102
ref|XP_007046058.1| RING/U-box superfamily protein, putative iso...   362   2e-99
ref|XP_004135450.1| PREDICTED: uncharacterized protein LOC101203...   358   2e-97
ref|XP_006575835.1| PREDICTED: uncharacterized protein LOC100789...   356   2e-95
ref|XP_006438717.1| hypothetical protein CICLE_v10031680mg [Citr...   335   7e-89
ref|XP_007046059.1| RING/U-box superfamily protein, putative iso...   327   8e-89
ref|XP_004238816.1| PREDICTED: uncharacterized protein LOC101246...   315   6e-83
ref|XP_006355117.1| PREDICTED: uncharacterized protein LOC102606...   305   5e-80
ref|XP_007157605.1| hypothetical protein PHAVU_002G083400g [Phas...   304   1e-79
ref|XP_004490108.1| PREDICTED: uncharacterized protein LOC101512...   304   1e-79
ref|XP_004238817.1| PREDICTED: uncharacterized protein LOC101246...   299   3e-78
ref|XP_006573120.1| PREDICTED: uncharacterized protein LOC100779...   291   1e-75
ref|XP_003613854.1| hypothetical protein MTR_5g041750 [Medicago ...   286   3e-74
ref|XP_007046060.1| Mandelonitrile lyase, related, putative isof...   264   4e-70
ref|XP_004983150.1| PREDICTED: uncharacterized protein LOC101770...   272   6e-70
ref|XP_006661763.1| PREDICTED: uncharacterized protein LOC102712...   270   3e-69
ref|XP_007223213.1| hypothetical protein PRUPE_ppa009352mg [Prun...   266   4e-68
gb|ABB47573.1| expressed protein [Oryza sativa Japonica Group] g...   264   2e-67

>gb|EXC19998.1| Methyltransferase-like protein 7B [Morus notabilis]
          Length = 675

 Score =  401 bits (1031), Expect = e-109
 Identities = 233/457 (50%), Positives = 273/457 (59%), Gaps = 4/457 (0%)
 Frame = -1

Query: 1677 EIKRVKSERNWFLLVPKNIAK--TGSLCCVAARPHVSNTASREWSMGPHEPYWRSNMSFS 1504
            E+KRV      +L V    AK  TGSLCCVAARPH S+T SR+WS+GPHEPYW++N SFS
Sbjct: 221  EVKRVLKPGGLYLFVEHVAAKAKTGSLCCVAARPHGSHTTSRDWSVGPHEPYWQTNSSFS 280

Query: 1503 PPISRWEHRFQNEGLPYGEHRFQTEGLPYXXXXXXXXXXXXXXSNSKESRISVRDDHFPN 1324
            PP +RW+ +FQ EGL YG H    +G+P               SNSKESR  VR +   N
Sbjct: 281  PPPARWDFQFQTEGLQYGPH----DGIP--------PYGSSTSSNSKESRSWVRGNQLYN 328

Query: 1323 HS-VSDGAGSYFSSPSDSFQTHQWTPSPIQGGKIDDFMSGTMRESASGPLVFTPTMEGTS 1147
            H   SDGAG + SS SD  Q  QWTP  IQ  ++DDF S   R    GP  F   MEGTS
Sbjct: 329  HHYASDGAGMFLSSSSDLSQGPQWTPPAIQEIRVDDFESAQRRGMTLGPPSFKTNMEGTS 388

Query: 1146 ALPYXXXXXXXXXXXSEYDAMVNRHAXXXXXXXXXXXXXSKPIHPISFPKHQTPDRDTHE 967
              P            SE +      +             SKPIHP+SFP+  +P  +  +
Sbjct: 389  ENPDSGGSTSFRSDSSESEP--KSRSSSHRTFSSRRSFMSKPIHPLSFPRQTSPG-EVSD 445

Query: 966  IIVTGNFLNRSTPSTQHFGATQGGTLRWSSPSSSIDFTDVXXXXXXXXXXXXXXXE-AFQ 790
            I V G          +   + Q     WSS SSS+DF DV                   +
Sbjct: 446  IAVAG--------LPEFDASAQRDAHSWSSASSSLDFADVSETFESETCNRSCNPSDGLR 497

Query: 789  CGLCDKFLSQRSPWSSRRIVRSGDMPVTGVLSCGHVFHADCLEHTTPKVHKHDPPCPICA 610
            CGLC++ LSQRSPWSSRRIVRSGDMPVTGVLSC HVFHA+CLE TTPK+ ++DPPCP+CA
Sbjct: 498  CGLCERLLSQRSPWSSRRIVRSGDMPVTGVLSCCHVFHAECLEQTTPKMRRNDPPCPLCA 557

Query: 609  RSEEINASSPLEQSVVFRLKKNGLPRLRPFQEDGSSRHWACGQVGDCVEGALHASPRNSM 430
            R EE N     EQ    R  +N  PRL+PF EDG SR W C QVGDCVEGALH  PRNSM
Sbjct: 558  RLEEENCP---EQRSCSR-SRNSFPRLKPFNEDGPSRPWGCVQVGDCVEGALHVPPRNSM 613

Query: 429  MLLNRSRLKKHLSSKGNSSKEPPYKLKKSGSFSSQQF 319
            +LLNRSR+KK+LS KGNSSKE P KLKKSGS+SSQQF
Sbjct: 614  LLLNRSRIKKNLSLKGNSSKEFPGKLKKSGSYSSQQF 650


>ref|XP_004298421.1| PREDICTED: uncharacterized protein LOC101302161 [Fragaria vesca
            subsp. vesca]
          Length = 453

 Score =  398 bits (1023), Expect = e-108
 Identities = 228/442 (51%), Positives = 271/442 (61%), Gaps = 5/442 (1%)
 Frame = -1

Query: 1620 AKTGSLCCVAARPHVSNTASREWSMGPHEPYWRSNMSFSPPISRWEHRFQNEGLPYGEHR 1441
            AKTGSLCCVAARPH S+TASR+WS+GP EP WR+N SFSPP SRW+ RFQ+E LPYG H 
Sbjct: 15   AKTGSLCCVAARPHESHTASRDWSVGPDEPCWRTNTSFSPPPSRWDFRFQSEELPYGLH- 73

Query: 1440 FQTEGLPYXXXXXXXXXXXXXXSNSKESRISVRDDHFPN--HSVSDGAGSYFSSPSDSFQ 1267
               +G+                 NS+ SR  VR +H  N  +S SDGAG + SS SD  Q
Sbjct: 74   ---DGVQLYESSTSS--------NSRGSRGRVRGNHLYNLHYSASDGAGLFLSSSSDHSQ 122

Query: 1266 THQWTPSPIQGGKIDDFMSGTMRESASGPLVFTPTMEGTSALPYXXXXXXXXXXXSEYDA 1087
              QWTP  IQ   IDD+ +   R  A G   F PTMEGTS +             SE +A
Sbjct: 123  GPQWTPPAIQEISIDDYETTKRRGPAFGKSSFRPTMEGTSEIQDTRGSTSSRSDSSESEA 182

Query: 1086 MVNRHAXXXXXXXXXXXXXSKPIHPISFPKHQTPDRDTHEIIVTGN-FLNRSTPSTQHFG 910
             +                 SKPI+P+SFP  QT  R+  +  V G+   + +TP      
Sbjct: 183  TIKASLSSHHTFASRRSFMSKPIYPLSFPS-QTLPREASDFTVAGSPDFDAATP------ 235

Query: 909  ATQGGTLRWSSPSSSIDFTDVXXXXXXXXXXXXXXXEA--FQCGLCDKFLSQRSPWSSRR 736
              Q    RWSS SSS+DFTDV                +  F+CGLCD+FLSQRSPWSSRR
Sbjct: 236  --QRDGHRWSSASSSVDFTDVSESFEAEISIRHFSNMSEGFRCGLCDRFLSQRSPWSSRR 293

Query: 735  IVRSGDMPVTGVLSCGHVFHADCLEHTTPKVHKHDPPCPICARSEEINASSPLEQSVVFR 556
            IVRSGDMP+ GVLSC HVFHA+CLE TTPK  K DPPCP+CAR EE N S   EQ    R
Sbjct: 294  IVRSGDMPIAGVLSCCHVFHAECLEQTTPKTRKSDPPCPLCARLEEENLS---EQQSFSR 350

Query: 555  LKKNGLPRLRPFQEDGSSRHWACGQVGDCVEGALHASPRNSMMLLNRSRLKKHLSSKGNS 376
            L+ N  PRLRP  +DG SR W C QVGDCVEGALHA PRNSM+LLNRSR+KK+LS KGNS
Sbjct: 351  LRAN-FPRLRPISDDGPSRPWGCAQVGDCVEGALHAPPRNSMLLLNRSRIKKNLSLKGNS 409

Query: 375  SKEPPYKLKKSGSFSSQQFEGE 310
            SKE P KL++S ++++Q   G+
Sbjct: 410  SKEFPGKLRRSATYATQHLSGK 431


>ref|XP_006483695.1| PREDICTED: uncharacterized protein LOC102616602 [Citrus sinensis]
          Length = 439

 Score =  379 bits (974), Expect = e-102
 Identities = 223/445 (50%), Positives = 268/445 (60%), Gaps = 3/445 (0%)
 Frame = -1

Query: 1635 VPKNIAKTGSLCCVAARPHVSNTASREWSMGPHEPYWRSNMSFSPPISRWEHRFQNEGLP 1456
            +PK IAKTGSLCCVAARPH SN  SR+WSMGPHEPYW++N SFSPP SRW+ RFQ+EGLP
Sbjct: 1    MPK-IAKTGSLCCVAARPHGSNATSRDWSMGPHEPYWQTNTSFSPPPSRWDFRFQSEGLP 59

Query: 1455 YGEHRFQTEGLPYXXXXXXXXXXXXXXSNSKESRISVRDD--HFPNHSVSDGAGSYFSSP 1282
            YG +    +G  +               NSKES+  VR +  +   +S SD AG + SS 
Sbjct: 60   YGSN----DGFHFYGSSTSS--------NSKESKSWVRGNLPYNNQNSASDSAGLFLSSS 107

Query: 1281 SDSFQTHQWTPSPIQGGKIDDFMSGTMRESASGPLVFTPTMEGTSALPYXXXXXXXXXXX 1102
            SD  Q  QWTP  IQ   ID + + T R+  S    FTP +EGTSA PY           
Sbjct: 108  SDLSQGPQWTPPAIQEITIDGYETPTRRDPVSTQFSFTPAIEGTSANPYSRGSTSSRSDS 167

Query: 1101 SEYDAMVNRHAXXXXXXXXXXXXXSKPIHPISFPKHQTPDRDTHEIIVTGNFLNRSTPST 922
            SE +  V                 SKPI+P+SFP  QT +R+  +   T   L+    ST
Sbjct: 168  SESEPKVKSCISSHCNFSSRLSFMSKPIYPLSFPT-QTSNREAIDSAST--VLSEDDTST 224

Query: 921  QHFGATQGGTLRWSSPSSSIDFTDVXXXXXXXXXXXXXXXE-AFQCGLCDKFLSQRSPWS 745
              + A      RWSS SSS+DF DV                  F+CGLC++FLSQRSPWS
Sbjct: 225  PQWEAH-----RWSSASSSVDFADVSEPFESESFGQSYVPSDTFKCGLCERFLSQRSPWS 279

Query: 744  SRRIVRSGDMPVTGVLSCGHVFHADCLEHTTPKVHKHDPPCPICARSEEINASSPLEQSV 565
            SRRIVRSGDMPV GVLSC HVFHA+CLE TTPK  K DP CPIC R +E N  SP +Q  
Sbjct: 280  SRRIVRSGDMPVVGVLSCRHVFHAECLEQTTPKTQKSDPSCPICLRLQEEN--SPDQQ-- 335

Query: 564  VFRLKKNGLPRLRPFQEDGSSRHWACGQVGDCVEGALHASPRNSMMLLNRSRLKKHLSSK 385
            VF   KN  PRLR   ++G SR W C   G CVEGA H  PRN+++LLNR+R+KK+LS K
Sbjct: 336  VFSRLKNSFPRLRQSCDNGQSRPWGCPLAGGCVEGASHVPPRNTVLLLNRNRVKKNLSLK 395

Query: 384  GNSSKEPPYKLKKSGSFSSQQFEGE 310
            GNSSKE P KL+K+G+ SSQ F G+
Sbjct: 396  GNSSKEFPGKLRKTGACSSQLFNGK 420


>ref|XP_007046058.1| RING/U-box superfamily protein, putative isoform 1 [Theobroma cacao]
            gi|508709993|gb|EOY01890.1| RING/U-box superfamily
            protein, putative isoform 1 [Theobroma cacao]
          Length = 411

 Score =  362 bits (928), Expect(2) = 2e-99
 Identities = 213/416 (51%), Positives = 246/416 (59%), Gaps = 3/416 (0%)
 Frame = -1

Query: 1548 MGPHEPYWRSNMSFSPPISRWEHRFQNEGLPYGEHRFQTEGLPYXXXXXXXXXXXXXXSN 1369
            MGPHEPYWR+N SFSPP SRW+  FQ EGL YG H    +G+                 N
Sbjct: 1    MGPHEPYWRTNTSFSPPPSRWDFHFQPEGLSYGSH----DGIQLYGSATSS--------N 48

Query: 1368 SKESRISVRDDHFPNH--SVSDGAGSYFSSPSDSFQTHQWTPSPIQGGKIDDFMSGTMRE 1195
            SKESR  VR +   NH  S SDGAG + SSPSD  Q  QWTP  IQ    DD+ + T R+
Sbjct: 49   SKESRGWVRGNLLYNHQYSTSDGAGLFLSSPSDLSQGPQWTPPAIQEITADDYETTTRRD 108

Query: 1194 SASGPLVFTPTMEGTSALPYXXXXXXXXXXXSEYDAMVNRHAXXXXXXXXXXXXXSKPIH 1015
               G L F   +EG  +              SE +AMV                 SKPIH
Sbjct: 109  QVVGQLPFASIVEGILSTADSGVSTSSHSDSSESEAMVKPCLSSHRNFSNRRYFMSKPIH 168

Query: 1014 PISFPKHQTPDRDTHEIIVTGNFLNRSTPSTQHFGATQGGTLRWSSPSSSIDFTDVXXXX 835
            P+SFPK  TP  +  +  V G   + +TP        Q    RWSS SSS DF DV    
Sbjct: 169  PLSFPKG-TPTTEASDSAVAGFSDDAATP--------QRDAHRWSSASSSNDFADVSEPF 219

Query: 834  XXXXXXXXXXXE-AFQCGLCDKFLSQRSPWSSRRIVRSGDMPVTGVLSCGHVFHADCLEH 658
                          F+CGLC++FLSQRSPWSSRRIVRS DMPV GVLSC HVFHA+CLE 
Sbjct: 220  ESEIFNRSFIPSDGFKCGLCERFLSQRSPWSSRRIVRSSDMPVAGVLSCRHVFHAECLEQ 279

Query: 657  TTPKVHKHDPPCPICARSEEINASSPLEQSVVFRLKKNGLPRLRPFQEDGSSRHWACGQV 478
            TTPK  K+DPPCPIC R EE N+    E+ V+ RL +NGLPRLRPF EDG SR W C QV
Sbjct: 280  TTPKTRKNDPPCPICVRLEEQNSP---EKQVISRL-RNGLPRLRPFSEDGPSRTWGCAQV 335

Query: 477  GDCVEGALHASPRNSMMLLNRSRLKKHLSSKGNSSKEPPYKLKKSGSFSSQQFEGE 310
            GDCVEGALHA PR++M+LLNRSR+KK+L  KGNSSKE P KL+KSGS S Q F G+
Sbjct: 336  GDCVEGALHAPPRSTMLLLNRSRMKKNLFVKGNSSKEFPGKLRKSGSSSLQLFGGK 391



 Score = 30.4 bits (67), Expect(2) = 2e-99
 Identities = 13/22 (59%), Positives = 17/22 (77%)
 Frame = -3

Query: 313 GRSIEQEAVECSNLTAGPALKR 248
           G+SI+Q AV CS   AGP++KR
Sbjct: 390 GKSIDQGAVGCSKTIAGPSVKR 411


>ref|XP_004135450.1| PREDICTED: uncharacterized protein LOC101203618 [Cucumis sativus]
            gi|449532609|ref|XP_004173273.1| PREDICTED:
            uncharacterized LOC101203618 [Cucumis sativus]
          Length = 436

 Score =  358 bits (920), Expect(2) = 2e-97
 Identities = 215/435 (49%), Positives = 256/435 (58%), Gaps = 6/435 (1%)
 Frame = -1

Query: 1611 GSLCCVAARPHVSNTASREWSMGPHEPYWRSNMSFSPPISRWEHRFQNEGLPYGEHRFQT 1432
            GSLCCVAARPH SN ASR+WS+GPHEP+W +N SFSPP SRW+ +FQ+EGLP+G H    
Sbjct: 2    GSLCCVAARPHGSNAASRDWSLGPHEPFWHTNTSFSPPPSRWDIQFQSEGLPHGWH---- 57

Query: 1431 EGLPYXXXXXXXXXXXXXXSNSKESRISVRDDH--FPNHSVSDGAGSYFSSPSDSFQTHQ 1258
                               SNSKESR  +R ++  + ++S SDGAG + SSPSD  Q  Q
Sbjct: 58   --------DAVQLYGSSTSSNSKESRSWIRGNNHLYTHNSASDGAGLFLSSPSDISQGPQ 109

Query: 1257 WTPSPIQGGKIDDFMSGTMRESASGPLVFTPTMEGTSALPYXXXXXXXXXXXSEYDAMVN 1078
            WTP  IQ   ID + + T R+ +     F P  EG S  P            SE +  V 
Sbjct: 110  WTPPAIQEINIDGYETATKRDPSLRTFSFWPAAEGNSENPDSGSSTFSQSDSSETEPTVK 169

Query: 1077 RHAXXXXXXXXXXXXXSKPIHPISFPKHQTPDRDTHEIIVTGNF-LNRSTPSTQHFGATQ 901
              +             SKPIHP++ P  QT   +  E    G    + STP        Q
Sbjct: 170  LRSSSNWNFTSRRSFMSKPIHPLAIPM-QTSSGEAFESTNLGFAEFDSSTP--------Q 220

Query: 900  GGTLRWSSPSSSIDFTDVXXXXXXXXXXXXXXXE-AFQCGLCDKFLSQRSPWSSRRIVRS 724
                RWSS SSSIDF DV                 +F+CGLC++FLSQRSPWSSRRIVRS
Sbjct: 221  RDNQRWSSASSSIDFADVSEPLESDFYFKSSCRSDSFRCGLCERFLSQRSPWSSRRIVRS 280

Query: 723  GDMPVTGVLSCGHVFHADCLEHTTPKVHKHDPPCPICARSEEINASSPLEQSVVFRLKK- 547
             DMPV GVLSC HVFHA+CL+ TTPK  K DPPCP+C + E  N  SP EQ    RL+  
Sbjct: 281  TDMPVAGVLSCRHVFHAECLDQTTPKTCKSDPPCPLCLKHE--NDRSP-EQRTNSRLRNA 337

Query: 546  NGLPRLRP-FQEDGSSRHWACGQVGDCVEGALHASPRNSMMLLNRSRLKKHLSSKGNSSK 370
            N LPR RP   EDG SR W C QVGDCVEGALHA PRNSM+ +NR+R  K+LS KGNSSK
Sbjct: 338  NSLPRPRPSTSEDGPSRPWGCAQVGDCVEGALHAPPRNSMLFVNRNR-SKNLSFKGNSSK 396

Query: 369  EPPYKLKKSGSFSSQ 325
            E P KL+KSGS+SS+
Sbjct: 397  EFPGKLRKSGSYSSR 411



 Score = 27.3 bits (59), Expect(2) = 2e-97
 Identities = 11/21 (52%), Positives = 15/21 (71%)
 Frame = -3

Query: 310 RSIEQEAVECSNLTAGPALKR 248
           R  +QE V CS  +AGP++KR
Sbjct: 416 RPFDQEFVGCSRTSAGPSMKR 436


>ref|XP_006575835.1| PREDICTED: uncharacterized protein LOC100789831 [Glycine max]
          Length = 534

 Score =  356 bits (914), Expect(2) = 2e-95
 Identities = 207/442 (46%), Positives = 256/442 (57%), Gaps = 5/442 (1%)
 Frame = -1

Query: 1623 IAKTGSLCCVAARPHVSNTASREWSMGPHEPYWRSNMSFSPPISRWEHRFQNEGLPYGEH 1444
            IAKTGSLCCVA+RPH SN  SR+WSMGP+EPYWR+N S+SPP +RW+ RFQ+EGLP    
Sbjct: 96   IAKTGSLCCVASRPHESNAGSRDWSMGPNEPYWRTNSSYSPPPTRWDFRFQSEGLP---- 151

Query: 1443 RFQTEGLPYXXXXXXXXXXXXXXSNSKESRISVRDDHFPN--HSVSDGAGSYFSSPSDSF 1270
                    Y              S  KESR  VR +H  +  +S SD  G + SSPSD  
Sbjct: 152  --------YDVNDGVQLYGSSTSSIDKESRGWVRGNHLYDLHYSASDDTGIFLSSPSDLS 203

Query: 1269 QTHQWTPSPIQGGKIDDFMSGTMRES--ASGPLVFTPTMEGTSALPYXXXXXXXXXXXSE 1096
            Q  QWTP  IQ   ID++ + T ++S  +   + FTP  EGTS  P            SE
Sbjct: 204  QGPQWTPPAIQEISIDNYETSTRKDSHPSVDRVSFTPNKEGTSVNPNSGGSTSSQSESSE 263

Query: 1095 YDAMVNRHAXXXXXXXXXXXXXSKPIHPISFPKHQTPDRDTHEIIVTGNFLNRSTPSTQH 916
             ++                   SKPIHP+SF    T  RD  +  VT +F    T +   
Sbjct: 264  SESTAKSRLSSQRNFSNLRSFMSKPIHPMSF-NDLTTTRDAFDPAVT-DFTEFDTSTPLR 321

Query: 915  FGATQGGTLRWSSPSSSIDFTDVXXXXXXXXXXXXXXXE-AFQCGLCDKFLSQRSPWSSR 739
             G       RWSS SSS +F D+                  F+CGLC++FL+QRSPWSSR
Sbjct: 322  DGH------RWSSASSSQEFADITESFELETPGRSHFLSDGFRCGLCERFLTQRSPWSSR 375

Query: 738  RIVRSGDMPVTGVLSCGHVFHADCLEHTTPKVHKHDPPCPICARSEEINASSPLEQSVVF 559
            RIVRSGDMP  GVL C H FHA+CLE TTPK  K DPPCP+C + EE N+    +Q    
Sbjct: 376  RIVRSGDMPTIGVLPCCHAFHAECLEQTTPKTQKSDPPCPVCVKLEEENSP---DQRGHL 432

Query: 558  RLKKNGLPRLRPFQEDGSSRHWACGQVGDCVEGALHASPRNSMMLLNRSRLKKHLSSKGN 379
            RL + G PRL+  ++DG SR W C QVGDCVEGALHA PRN+M+LLNR+R+KK+LS KGN
Sbjct: 433  RL-RTGFPRLKSSRDDGPSRPWGCVQVGDCVEGALHAPPRNTMLLLNRNRIKKNLSLKGN 491

Query: 378  SSKEPPYKLKKSGSFSSQQFEG 313
              KE P K++K+G+FSS  F G
Sbjct: 492  IGKEFPGKMRKNGTFSSHLFSG 513



 Score = 22.7 bits (47), Expect(2) = 2e-95
 Identities = 11/22 (50%), Positives = 14/22 (63%)
 Frame = -3

Query: 313 GRSIEQEAVECSNLTAGPALKR 248
           G S + EAV  S  TAGP++ R
Sbjct: 513 GSSADGEAVGSSKATAGPSVWR 534


>ref|XP_006438717.1| hypothetical protein CICLE_v10031680mg [Citrus clementina]
            gi|557540913|gb|ESR51957.1| hypothetical protein
            CICLE_v10031680mg [Citrus clementina]
          Length = 411

 Score =  335 bits (858), Expect = 7e-89
 Identities = 200/416 (48%), Positives = 243/416 (58%), Gaps = 3/416 (0%)
 Frame = -1

Query: 1548 MGPHEPYWRSNMSFSPPISRWEHRFQNEGLPYGEHRFQTEGLPYXXXXXXXXXXXXXXSN 1369
            MGPHEPYW++N SFSPP SRW+ RFQ+EGLPYG +    +G  +               N
Sbjct: 1    MGPHEPYWQTNTSFSPPPSRWDFRFQSEGLPYGSN----DGFHFYGSSTSS--------N 48

Query: 1368 SKESRISVRDD--HFPNHSVSDGAGSYFSSPSDSFQTHQWTPSPIQGGKIDDFMSGTMRE 1195
            SKES+  VR +  +   +S SD AG + SS SD  Q  QWTP  IQ   ID + + T R+
Sbjct: 49   SKESKSWVRGNLPYNNQNSASDSAGLFLSSSSDLSQGPQWTPPAIQEITIDGYETPTRRD 108

Query: 1194 SASGPLVFTPTMEGTSALPYXXXXXXXXXXXSEYDAMVNRHAXXXXXXXXXXXXXSKPIH 1015
              S    FTP +EGTSA PY           SE +  V                 SKPI+
Sbjct: 109  PVSTQFSFTPAIEGTSANPYSRGSTSSRSDSSESEPKVKSCISSHCNFSSRLSFMSKPIY 168

Query: 1014 PISFPKHQTPDRDTHEIIVTGNFLNRSTPSTQHFGATQGGTLRWSSPSSSIDFTDVXXXX 835
            P+SFP  QT +R+  +   T   L+    ST  + A      RWSS SSS+DF DV    
Sbjct: 169  PLSFPT-QTSNREAIDSAST--VLSEDDTSTPQWEAH-----RWSSASSSVDFADVSEPF 220

Query: 834  XXXXXXXXXXXE-AFQCGLCDKFLSQRSPWSSRRIVRSGDMPVTGVLSCGHVFHADCLEH 658
                          F+CGLC++FLSQRSPWSSRRIVRSGDMPV GVLSC HVFHA+CLE 
Sbjct: 221  ESESFGQSYVPSDTFKCGLCERFLSQRSPWSSRRIVRSGDMPVVGVLSCRHVFHAECLEQ 280

Query: 657  TTPKVHKHDPPCPICARSEEINASSPLEQSVVFRLKKNGLPRLRPFQEDGSSRHWACGQV 478
            TTPK  K DP CPIC R +E N  SP +Q  VF   KN  PRLR   ++G SR W C   
Sbjct: 281  TTPKTQKSDPSCPICLRLQEEN--SPDQQ--VFSRLKNSFPRLRQSCDNGQSRPWGCPLA 336

Query: 477  GDCVEGALHASPRNSMMLLNRSRLKKHLSSKGNSSKEPPYKLKKSGSFSSQQFEGE 310
            G CVEGA H  PRN+++LLNR+R+KK+LS KGNSSKE P KL+K+G+ SSQ F G+
Sbjct: 337  GGCVEGASHVPPRNTVLLLNRNRVKKNLSLKGNSSKEFPGKLRKTGACSSQLFNGK 392


>ref|XP_007046059.1| RING/U-box superfamily protein, putative isoform 2 [Theobroma cacao]
            gi|508709994|gb|EOY01891.1| RING/U-box superfamily
            protein, putative isoform 2 [Theobroma cacao]
          Length = 382

 Score =  327 bits (837), Expect(2) = 8e-89
 Identities = 200/414 (48%), Positives = 232/414 (56%), Gaps = 1/414 (0%)
 Frame = -1

Query: 1548 MGPHEPYWRSNMSFSPPISRWEHRFQNEGLPYGEHRFQTEGLPYXXXXXXXXXXXXXXSN 1369
            MGPHEPYWR+N SFSPP SRW+  FQ EGL YG H    +G+                SN
Sbjct: 1    MGPHEPYWRTNTSFSPPPSRWDFHFQPEGLSYGSH----DGI--------QLYGSATSSN 48

Query: 1368 SKESRISVRDDHFPNHSVSDGAGSYFSSPSDSFQTHQWTPSPIQGGKIDDFMSGTMRESA 1189
            SKESR               G G             QWTP  IQ    DD+ + T R+  
Sbjct: 49   SKESR---------------GWGP------------QWTPPAIQEITADDYETTTRRDQV 81

Query: 1188 SGPLVFTPTMEGTSALPYXXXXXXXXXXXSEYDAMVNRHAXXXXXXXXXXXXXSKPIHPI 1009
             G L F   +EG  +              SE +AMV                 SKPIHP+
Sbjct: 82   VGQLPFASIVEGILSTADSGVSTSSHSDSSESEAMVKPCLSSHRNFSNRRYFMSKPIHPL 141

Query: 1008 SFPKHQTPDRDTHEIIVTGNFLNRSTPSTQHFGATQGGTLRWSSPSSSIDFTDV-XXXXX 832
            SFPK  TP  +  +  V G   + +TP        Q    RWSS SSS DF DV      
Sbjct: 142  SFPK-GTPTTEASDSAVAGFSDDAATP--------QRDAHRWSSASSSNDFADVSEPFES 192

Query: 831  XXXXXXXXXXEAFQCGLCDKFLSQRSPWSSRRIVRSGDMPVTGVLSCGHVFHADCLEHTT 652
                      + F+CGLC++FLSQRSPWSSRRIVRS DMPV GVLSC HVFHA+CLE TT
Sbjct: 193  EIFNRSFIPSDGFKCGLCERFLSQRSPWSSRRIVRSSDMPVAGVLSCRHVFHAECLEQTT 252

Query: 651  PKVHKHDPPCPICARSEEINASSPLEQSVVFRLKKNGLPRLRPFQEDGSSRHWACGQVGD 472
            PK  K+DPPCPIC R EE N+    E+ V+ RL +NGLPRLRPF EDG SR W C QVGD
Sbjct: 253  PKTRKNDPPCPICVRLEEQNSP---EKQVISRL-RNGLPRLRPFSEDGPSRTWGCAQVGD 308

Query: 471  CVEGALHASPRNSMMLLNRSRLKKHLSSKGNSSKEPPYKLKKSGSFSSQQFEGE 310
            CVEGALHA PR++M+LLNRSR+KK+L  KGNSSKE P KL+KSGS S Q F G+
Sbjct: 309  CVEGALHAPPRSTMLLLNRSRMKKNLFVKGNSSKEFPGKLRKSGSSSLQLFGGK 362



 Score = 30.4 bits (67), Expect(2) = 8e-89
 Identities = 13/22 (59%), Positives = 17/22 (77%)
 Frame = -3

Query: 313 GRSIEQEAVECSNLTAGPALKR 248
           G+SI+Q AV CS   AGP++KR
Sbjct: 361 GKSIDQGAVGCSKTIAGPSVKR 382


>ref|XP_004238816.1| PREDICTED: uncharacterized protein LOC101246630 isoform 1 [Solanum
            lycopersicum]
          Length = 406

 Score =  315 bits (807), Expect = 6e-83
 Identities = 189/416 (45%), Positives = 235/416 (56%), Gaps = 4/416 (0%)
 Frame = -1

Query: 1548 MGPHEPYWRSNMSFSPPISRWEHRFQNEGLPYGEHRFQTEGLPYXXXXXXXXXXXXXXSN 1369
            MGPHEPYWR+N SFSP  SRW+ RFQ E L +G +    +G+                 N
Sbjct: 1    MGPHEPYWRTNSSFSPAPSRWDFRFQPETLSFGSN----DGVQLYGSSASS--------N 48

Query: 1368 SKESRISVRDDHFPNHS--VSDGAGSYFSSPSDSFQTHQWTPSPIQGGKIDDFMSGTMRE 1195
            S++SR  VR +   NH   +SDG G+Y SSPSD     QWTP  IQ   IDDF  GT R 
Sbjct: 49   SRDSRSWVRGNQLANHQYVISDGVGAYCSSPSDISPAQQWTPPAIQEINIDDF--GTSRR 106

Query: 1194 SA-SGPLVFTPTMEGTSALPYXXXXXXXXXXXSEYDAMVNRHAXXXXXXXXXXXXXSKPI 1018
             A + P  F+PTMEG S               S+ D++   H+              KPI
Sbjct: 107  DAITRPFSFSPTMEGASIARDGRGSTSSRSDSSDCDSITKSHSSYRSFPSRRFFMS-KPI 165

Query: 1017 HPISFPKHQTPDRDTHEIIVTGNFLNRSTPSTQHFGATQGGTLRWSSPSSSIDFTDVXXX 838
            HP+SFP   TP R+  + +  G FL     ++Q          R SS S S+D T+    
Sbjct: 166  HPLSFPTETTPRREAIDSLSAG-FLEFDASTSQR------DKHRLSSASGSLDLTEASES 218

Query: 837  XXXXXXXXXXXXE-AFQCGLCDKFLSQRSPWSSRRIVRSGDMPVTGVLSCGHVFHADCLE 661
                           F+CGLC++FLSQRSPWSSRRIVRSGDMPV GVLSC HVFHA+CLE
Sbjct: 219  FHSDFLSKPCNPSDCFRCGLCERFLSQRSPWSSRRIVRSGDMPVAGVLSCRHVFHAECLE 278

Query: 660  HTTPKVHKHDPPCPICARSEEINASSPLEQSVVFRLKKNGLPRLRPFQEDGSSRHWACGQ 481
              TPK  K DPPCPICA+ EE   SSP EQ V  +      PRL+PF E+G S+ W C  
Sbjct: 279  QATPKSCKSDPPCPICAKLEE--GSSP-EQRVFSKF----FPRLKPFSEEGPSKPWGCAH 331

Query: 480  VGDCVEGALHASPRNSMMLLNRSRLKKHLSSKGNSSKEPPYKLKKSGSFSSQQFEG 313
             GDCVEGALH   R +M+ LN++R++K+LS KG+S K+ P KL+K+ +FSSQ F G
Sbjct: 332  SGDCVEGALHGPSRGTMLSLNKNRIRKNLSLKGDSGKDFPGKLRKTNTFSSQLFIG 387


>ref|XP_006355117.1| PREDICTED: uncharacterized protein LOC102606114 [Solanum tuberosum]
          Length = 405

 Score =  305 bits (782), Expect = 5e-80
 Identities = 188/416 (45%), Positives = 233/416 (56%), Gaps = 4/416 (0%)
 Frame = -1

Query: 1548 MGPHEPYWRSNMSFSPPISRWEHRFQNEGLPYGEHRFQTEGLPYXXXXXXXXXXXXXXSN 1369
            MGPHEPYWR+N SFSP  SRW+ RFQ E L +G +    +G+                 N
Sbjct: 1    MGPHEPYWRTNSSFSPAPSRWDFRFQPETLSFGSN----DGVQLYGSSASS--------N 48

Query: 1368 SKESRISVRDDHFPNHS--VSDGAGSYFSSPSDSFQTHQWTPSPIQGGKIDDFMSGTMRE 1195
            S++SR  VR +   NH   +SDG G+Y SSPSD     QWTP  IQ   IDDF  GT R 
Sbjct: 49   SRDSRSWVRGNQLANHQYLISDGVGAYCSSPSDISPAQQWTPPAIQEINIDDF--GTSRR 106

Query: 1194 SA-SGPLVFTPTMEGTSALPYXXXXXXXXXXXSEYDAMVNRHAXXXXXXXXXXXXXSKPI 1018
             A + P  F+PTMEG S               S+ D++   H+              KPI
Sbjct: 107  DAITRPFTFSPTMEGASIARDGRGSTSSRSDSSDCDSITKSHSSYRSFPSRRFFMS-KPI 165

Query: 1017 HPISFPKHQTPDRDTHEIIVTGNFLNRSTPSTQHFGATQGGTLRWSSPSSSIDFTDVXXX 838
            HP+SFP  +T  R+  + +  G FL     ++Q          R SS S S+D T+    
Sbjct: 166  HPLSFPT-ETSRREAIDSLSAG-FLEFDASTSQR------DKHRLSSASGSLDLTEASES 217

Query: 837  XXXXXXXXXXXXE-AFQCGLCDKFLSQRSPWSSRRIVRSGDMPVTGVLSCGHVFHADCLE 661
                           F+CGLC++FLSQRSPWSSRRIVRS DMPV GVLSC HVFHA+CLE
Sbjct: 218  FQSDFLSKPCNPSDGFRCGLCERFLSQRSPWSSRRIVRSEDMPVAGVLSCRHVFHAECLE 277

Query: 660  HTTPKVHKHDPPCPICARSEEINASSPLEQSVVFRLKKNGLPRLRPFQEDGSSRHWACGQ 481
              TPK  K DPPCPICA+ EE   SSP EQ V  +      PRL+ F E+G S+ W C  
Sbjct: 278  QATPKSCKSDPPCPICAKLEE--GSSP-EQRVFSKF----FPRLKTFSEEGPSKPWGCAH 330

Query: 480  VGDCVEGALHASPRNSMMLLNRSRLKKHLSSKGNSSKEPPYKLKKSGSFSSQQFEG 313
             GDCVEGALH   R +M+ LN++R++K+LS KGNS KE P KL+K+ +FSSQ F G
Sbjct: 331  SGDCVEGALHGPSRGTMLSLNKNRIRKNLSLKGNSVKEFPGKLRKTNTFSSQLFIG 386


>ref|XP_007157605.1| hypothetical protein PHAVU_002G083400g [Phaseolus vulgaris]
            gi|561031020|gb|ESW29599.1| hypothetical protein
            PHAVU_002G083400g [Phaseolus vulgaris]
          Length = 411

 Score =  304 bits (778), Expect = 1e-79
 Identities = 188/418 (44%), Positives = 232/418 (55%), Gaps = 6/418 (1%)
 Frame = -1

Query: 1548 MGPHEPYWRSNMSFSPPISRWEHRFQNEGLPYGEH-RFQTEGLPYXXXXXXXXXXXXXXS 1372
            MGP+EP+WR+N SFSPP +RW+ RFQ+EG+PY  +   Q  G                 S
Sbjct: 1    MGPNEPFWRTNSSFSPPPTRWDFRFQSEGIPYSANDSIQLYG-------------SSTSS 47

Query: 1371 NSKESRISVRDDHFPN--HSVSDGAGSYFSSPSDSFQTHQWTPSPIQGGKIDDFMSGTMR 1198
            N KESR  VR +H  +  +S SDG G   SSPSD  Q  QWTP  IQ   ID++ +   +
Sbjct: 48   NDKESRGWVRGNHLYDLHYSASDGTGILLSSPSDLSQGPQWTPPTIQEISIDNYETSARK 107

Query: 1197 ES--ASGPLVFTPTMEGTSALPYXXXXXXXXXXXSEYDAMVNRHAXXXXXXXXXXXXXSK 1024
            +   ++G   FTPT EGTS  P            SE ++    H              SK
Sbjct: 108  DHHPSAGRTSFTPTKEGTSVNPNSGCSTSSLSESSESESTTKSHLSCQRNFSNLRSFISK 167

Query: 1023 PIHPISFPKHQTPDRDTHEIIVTGNFLNRSTPSTQHFGATQGGTLRWSSPSSSIDFTDVX 844
            PIHP+SF   +T  RD  +  VT +F    T +    G        WSS SSS +F DV 
Sbjct: 168  PIHPMSFNDLKTT-RDAFDPAVT-DFTEFDTSTPLRDGQC------WSSASSSQEFADVT 219

Query: 843  XXXXXXXXXXXXXXE-AFQCGLCDKFLSQRSPWSSRRIVRSGDMPVTGVLSCGHVFHADC 667
                             F+CGLC++FL QRSPWSSRRIVRSGDMP  GVL C H FH +C
Sbjct: 220  ESFELEPPGGPHFPSDGFRCGLCERFLLQRSPWSSRRIVRSGDMPTIGVLPCCHAFHCEC 279

Query: 666  LEHTTPKVHKHDPPCPICARSEEINASSPLEQSVVFRLKKNGLPRLRPFQEDGSSRHWAC 487
            LE TTPK  K DPPCP+C + EE N  SP  +   F    NG PR +  + DG SR W C
Sbjct: 280  LEQTTPKTRKIDPPCPVCVKLEEEN--SPDHRG--FLRLTNGFPRHKSSRGDGPSRPWGC 335

Query: 486  GQVGDCVEGALHASPRNSMMLLNRSRLKKHLSSKGNSSKEPPYKLKKSGSFSSQQFEG 313
             QVGDCVEGALHA PRN+M +LN   L+K+LS KG+ SKE P K++K+G+FSSQ F G
Sbjct: 336  AQVGDCVEGALHAPPRNAMFMLN---LRKNLSLKGSLSKEFPGKVRKNGTFSSQLFSG 390


>ref|XP_004490108.1| PREDICTED: uncharacterized protein LOC101512464, partial [Cicer
            arietinum]
          Length = 472

 Score =  304 bits (778), Expect = 1e-79
 Identities = 193/487 (39%), Positives = 251/487 (51%), Gaps = 50/487 (10%)
 Frame = -1

Query: 1623 IAKTGSLCCVAARPHVSNTASREWSMGPHEPYWRSNMSFSPPISRWEHRFQNEGLPYGEH 1444
            IAKTGSLCCVA+RPH SN  SR+WSMGPHEPYWR+N S+SPP  RW+ +FQ+EGLPY  +
Sbjct: 2    IAKTGSLCCVASRPHGSNADSRDWSMGPHEPYWRTNTSYSPPPPRWDFKFQSEGLPYSLN 61

Query: 1443 RFQTEGLPYXXXXXXXXXXXXXXSNSKESRISVRDDHFPN--HSVSDGAGSYFSSPSDSF 1270
                +G+                 N K+SR  VR +H  +  +SVSDG G + SSP  S 
Sbjct: 62   ----DGVQLYDGSSTSS-------NGKDSRTWVRGNHLYDLHYSVSDGTGIFLSSPCPSE 110

Query: 1269 QTH--QWTPSPIQGGKIDDFMSGTMRE--------------------------------- 1195
              H  QWTP  IQ    DD+   T ++                                 
Sbjct: 111  LRHGPQWTPPAIQEISFDDYELVTRKDFVVFSFCRWVRELLRHHPGVLPSSFKHHPVAPP 170

Query: 1194 ----------SASGPLVFTPTMEGTSALPYXXXXXXXXXXXSEYDAMVNRHAXXXXXXXX 1045
                       + G + F PT EGTS               SE ++  N           
Sbjct: 171  SGYNNHLDPHPSLGRISFAPTKEGTSKNLNNGSSISTQSESSESES-TNSQLSFHRTFSN 229

Query: 1044 XXXXXSKPIHPISFPKHQTPDRDTHEIIVTGNFLNRSTPSTQHFGATQG--GTLRWSSPS 871
                 SKPI+P+SFP          ++  T +  + + P    F  +     + R S+ S
Sbjct: 230  HRSFVSKPIYPLSFP----------DLTTTRDAFDPAFPDFTGFDTSNPLKDSQRSSNAS 279

Query: 870  SSIDFTDVXXXXXXXXXXXXXXXE-AFQCGLCDKFLSQRSPWSSRRIVRSGDMPVTGVLS 694
            SS D  DV                  F+CGLC++FLS RSPWSSRRIVRS DMP TGVL 
Sbjct: 280  SSQDSVDVTESFELETPALPHTHSEGFRCGLCERFLSNRSPWSSRRIVRSRDMPATGVLP 339

Query: 693  CGHVFHADCLEHTTPKVHKHDPPCPICARSEEINASSPLEQSVVFRLKKNGLPRLRPFQE 514
            C HVFHA+CLE TTPK  K DPPCP+C + +E ++    +Q  V RL +NG PR +   E
Sbjct: 340  CCHVFHAECLEQTTPKTRKIDPPCPLCVKLDEQHSP---DQRSVLRL-RNGFPRFKSVCE 395

Query: 513  DGSSRHWACGQVGDCVEGALHASPRNSMMLLNRSRLKKHLSSKGNSSKEPPYKLKKSGSF 334
            DG SR  +C Q  DCVEG LHA P N+M+LLNR+ ++++LS +GN SK  P K++K+ ++
Sbjct: 396  DGPSRTRSCSQADDCVEGPLHAPPHNTMLLLNRNHIRRNLSLRGNLSKAFPGKVRKTETY 455

Query: 333  SSQQFEG 313
            SSQ F G
Sbjct: 456  SSQLFSG 462


>ref|XP_004238817.1| PREDICTED: uncharacterized protein LOC101246630 isoform 2 [Solanum
            lycopersicum]
          Length = 372

 Score =  299 bits (766), Expect = 3e-78
 Identities = 180/400 (45%), Positives = 223/400 (55%), Gaps = 4/400 (1%)
 Frame = -1

Query: 1548 MGPHEPYWRSNMSFSPPISRWEHRFQNEGLPYGEHRFQTEGLPYXXXXXXXXXXXXXXSN 1369
            MGPHEPYWR+N SFSP  SRW+ RFQ E L +G +    +G+                 N
Sbjct: 1    MGPHEPYWRTNSSFSPAPSRWDFRFQPETLSFGSN----DGVQLYGSSASS--------N 48

Query: 1368 SKESRISVRDDHFPNHS--VSDGAGSYFSSPSDSFQTHQWTPSPIQGGKIDDFMSGTMRE 1195
            S++SR  VR +   NH   +SDG G+Y SSPSD     QWTP  IQ   IDDF  GT R 
Sbjct: 49   SRDSRSWVRGNQLANHQYVISDGVGAYCSSPSDISPAQQWTPPAIQEINIDDF--GTSRR 106

Query: 1194 SA-SGPLVFTPTMEGTSALPYXXXXXXXXXXXSEYDAMVNRHAXXXXXXXXXXXXXSKPI 1018
             A + P  F+PTMEG S               S+ D++   H+              KPI
Sbjct: 107  DAITRPFSFSPTMEGASIARDGRGSTSSRSDSSDCDSITKSHSSYRSFPSRRFFMS-KPI 165

Query: 1017 HPISFPKHQTPDRDTHEIIVTGNFLNRSTPSTQHFGATQGGTLRWSSPSSSIDFTDVXXX 838
            HP+SFP   TP R+  + +  G FL     ++Q          R SS S S+D T+    
Sbjct: 166  HPLSFPTETTPRREAIDSLSAG-FLEFDASTSQR------DKHRLSSASGSLDLTEASES 218

Query: 837  XXXXXXXXXXXXE-AFQCGLCDKFLSQRSPWSSRRIVRSGDMPVTGVLSCGHVFHADCLE 661
                           F+CGLC++FLSQRSPWSSRRIVRSGDMPV GVLSC HVFHA+CLE
Sbjct: 219  FHSDFLSKPCNPSDCFRCGLCERFLSQRSPWSSRRIVRSGDMPVAGVLSCRHVFHAECLE 278

Query: 660  HTTPKVHKHDPPCPICARSEEINASSPLEQSVVFRLKKNGLPRLRPFQEDGSSRHWACGQ 481
              TPK  K DPPCPICA+ EE   SSP EQ V  +      PRL+PF E+G S+ W C  
Sbjct: 279  QATPKSCKSDPPCPICAKLEE--GSSP-EQRVFSKF----FPRLKPFSEEGPSKPWGCAH 331

Query: 480  VGDCVEGALHASPRNSMMLLNRSRLKKHLSSKGNSSKEPP 361
             GDCVEGALH   R +M+ LN++R++K+LS KG+S K+ P
Sbjct: 332  SGDCVEGALHGPSRGTMLSLNKNRIRKNLSLKGDSGKDFP 371


>ref|XP_006573120.1| PREDICTED: uncharacterized protein LOC100779481 isoform X1 [Glycine
            max] gi|571434172|ref|XP_006573121.1| PREDICTED:
            uncharacterized protein LOC100779481 isoform X2 [Glycine
            max]
          Length = 371

 Score =  291 bits (744), Expect = 1e-75
 Identities = 176/387 (45%), Positives = 218/387 (56%), Gaps = 5/387 (1%)
 Frame = -1

Query: 1548 MGPHEPYWRSNMSFSPPISRWEHRFQNEGLPYGEHRFQTEGLPYXXXXXXXXXXXXXXSN 1369
            MGP+EPYWR+N SFSPP +RW+ RFQ+EGL YG +    +G+                 N
Sbjct: 1    MGPNEPYWRTNSSFSPPPTRWDFRFQSEGLSYGVN----DGVQLYGSSTSE--------N 48

Query: 1368 SKESRISVRDDHFPN--HSVSDGAGSYFSSPSDSFQTHQWTPSPIQGGKIDDFMSGTMRE 1195
             KESR  VR +H  +  +S SDG G + SSPSD  Q  QWTP  IQ   ID++ + T ++
Sbjct: 49   DKESRGWVRGNHLYDLHYSASDGTGIFLSSPSDLSQGPQWTPPAIQEISIDNYETSTRKD 108

Query: 1194 S--ASGPLVFTPTMEGTSALPYXXXXXXXXXXXSEYDAMVNRHAXXXXXXXXXXXXXSKP 1021
            S  + G + FTP  EGTS   Y           SE ++    H              SKP
Sbjct: 109  SHPSVGRVSFTPNKEGTSVNHYCGGSTSSQSESSESESTTKSHLSSERNFANLRSFMSKP 168

Query: 1020 IHPISFPKHQTPDRDTHEIIVTGNFLNRSTPSTQHFGATQGGTLRWSSPSSSIDFTDVXX 841
            IHP+SF    T  RD  +  VT +F    T +    G       RWSS SSS +F DV  
Sbjct: 169  IHPMSF-NDLTTTRDAFDPAVT-DFTEFDTSTPLRDGQ------RWSSASSSQEFADVTE 220

Query: 840  XXXXXXXXXXXXXE-AFQCGLCDKFLSQRSPWSSRRIVRSGDMPVTGVLSCGHVFHADCL 664
                            F+CGLC++FLSQRSPWSSRRIVRSGDMP  GVL C H FHA+CL
Sbjct: 221  SFELETPGRSHFLSDGFKCGLCERFLSQRSPWSSRRIVRSGDMPTIGVLPCCHAFHAECL 280

Query: 663  EHTTPKVHKHDPPCPICARSEEINASSPLEQSVVFRLKKNGLPRLRPFQEDGSSRHWACG 484
            E  TPK  K DPPCP+C + EE   +SP +Q    RL + G PRL+  ++DG SR W C 
Sbjct: 281  EQATPKTRKSDPPCPVCVKLEE---NSP-DQRSHLRL-RTGFPRLKSSRDDGPSRPWGCV 335

Query: 483  QVGDCVEGALHASPRNSMMLLNRSRLK 403
            QVGDCVEGALHA P N+M+LLNR+R+K
Sbjct: 336  QVGDCVEGALHAPPGNTMLLLNRNRIK 362


>ref|XP_003613854.1| hypothetical protein MTR_5g041750 [Medicago truncatula]
            gi|355515189|gb|AES96812.1| hypothetical protein
            MTR_5g041750 [Medicago truncatula]
          Length = 447

 Score =  286 bits (732), Expect = 3e-74
 Identities = 185/446 (41%), Positives = 239/446 (53%), Gaps = 7/446 (1%)
 Frame = -1

Query: 1629 KNIAKTGSLCCVAARPHVSNTASREWSMGPHEPYWRSNMSFSPPISRWEHRFQNEGLPYG 1450
            + IAKTGSLCCVA+RPH S+  SREWS+GPHEPYWR+N S+SPP SRW+ RFQ+EGLPY 
Sbjct: 31   EKIAKTGSLCCVASRPHGSSADSREWSLGPHEPYWRTNTSYSPPPSRWDFRFQSEGLPYS 90

Query: 1449 EHRFQTEGLPYXXXXXXXXXXXXXXSNSKESRISVRDDHFPN--HSVSDGAGSYFSSP-- 1282
                   G  Y               N KESR  VR +H  +  +SV+DG G + SSP  
Sbjct: 91   ---LSDGGQLYDGSSTSS--------NGKESRTWVRGNHLYDLHYSVADGTGIFVSSPCP 139

Query: 1281 SDSFQTHQWTPSPIQGGKIDDFMSGTMRE--SASGPLVFTPTMEGTSALPYXXXXXXXXX 1108
            SD  Q  QW P  IQ    DD+ + T ++   + G + FTPT EGTS  PY         
Sbjct: 140  SDLSQGPQWMPPAIQEISFDDYTAVTRKDFHPSLGRISFTPTKEGTSQNPYNRGSTSSES 199

Query: 1107 XXSEYDAMVNRHAXXXXXXXXXXXXXSKPIHPISFPKHQTPDRDTHEIIVTGNFLNRSTP 928
              SE ++  N                SKPIHP+SFP   T  RD  +  V+ ++    T 
Sbjct: 200  ESSESESTTNSQLSFQRNFSNHRSFISKPIHPLSFPDLTTA-RDAFDHAVS-DYTGFDTS 257

Query: 927  STQHFGATQGGTLRWSSPSSSIDFTDVXXXXXXXXXXXXXXXE-AFQCGLCDKFLSQRSP 751
            +          + R S+ SSS D  D+                  F+C LC+KF+SQRSP
Sbjct: 258  NRLR------DSQRSSNASSSQDSADITESFDLETPAHLHTQSDEFRCSLCEKFMSQRSP 311

Query: 750  WSSRRIVRSGDMPVTGVLSCGHVFHADCLEHTTPKVHKHDPPCPICARSEEINASSPLEQ 571
            WSSRRIVRSGDMP  GVL C HVFHA+CL+  TPK  K +PPCP+C + EE    SP ++
Sbjct: 312  WSSRRIVRSGDMPAAGVLPCRHVFHAECLDQATPKTRKIEPPCPVCVKLEE--QYSPDQR 369

Query: 570  SVVFRLKKNGLPRLRPFQEDGSSRHWACGQVGDCVEGALHASPRNSMMLLNRSRLKKHLS 391
             VV RL +N  P+   F+ D                        +SM LLNR+R++K+LS
Sbjct: 370  GVV-RL-RNSFPK---FKSD------------------------DSMFLLNRNRIRKNLS 400

Query: 390  SKGNSSKEPPYKLKKSGSFSSQQFEG 313
             +GN S + P K++K+G + SQ F G
Sbjct: 401  MRGNLSNQFPGKVRKTGGYPSQLFTG 426


>ref|XP_007046060.1| Mandelonitrile lyase, related, putative isoform 3 [Theobroma cacao]
            gi|508709995|gb|EOY01892.1| Mandelonitrile lyase,
            related, putative isoform 3 [Theobroma cacao]
          Length = 299

 Score =  264 bits (675), Expect(2) = 4e-70
 Identities = 143/240 (59%), Positives = 164/240 (68%), Gaps = 1/240 (0%)
 Frame = -1

Query: 1026 KPIHPISFPKHQTPDRDTHEIIVTGNFLNRSTPSTQHFGATQGGTLRWSSPSSSIDFTDV 847
            KPIHP+SFPK  TP  +  +  V G   + +TP        Q    RWSS SSS DF DV
Sbjct: 53   KPIHPLSFPKG-TPTTEASDSAVAGFSDDAATP--------QRDAHRWSSASSSNDFADV 103

Query: 846  XXXXXXXXXXXXXXXE-AFQCGLCDKFLSQRSPWSSRRIVRSGDMPVTGVLSCGHVFHAD 670
                              F+CGLC++FLSQRSPWSSRRIVRS DMPV GVLSC HVFHA+
Sbjct: 104  SEPFESEIFNRSFIPSDGFKCGLCERFLSQRSPWSSRRIVRSSDMPVAGVLSCRHVFHAE 163

Query: 669  CLEHTTPKVHKHDPPCPICARSEEINASSPLEQSVVFRLKKNGLPRLRPFQEDGSSRHWA 490
            CLE TTPK  K+DPPCPIC R EE N+    E+ V+ RL +NGLPRLRPF EDG SR W 
Sbjct: 164  CLEQTTPKTRKNDPPCPICVRLEEQNSP---EKQVISRL-RNGLPRLRPFSEDGPSRTWG 219

Query: 489  CGQVGDCVEGALHASPRNSMMLLNRSRLKKHLSSKGNSSKEPPYKLKKSGSFSSQQFEGE 310
            C QVGDCVEGALHA PR++M+LLNRSR+KK+L  KGNSSKE P KL+KSGS S Q F G+
Sbjct: 220  CAQVGDCVEGALHAPPRSTMLLLNRSRMKKNLFVKGNSSKEFPGKLRKSGSSSLQLFGGK 279



 Score = 30.4 bits (67), Expect(2) = 4e-70
 Identities = 13/22 (59%), Positives = 17/22 (77%)
 Frame = -3

Query: 313 GRSIEQEAVECSNLTAGPALKR 248
           G+SI+Q AV CS   AGP++KR
Sbjct: 278 GKSIDQGAVGCSKTIAGPSVKR 299


>ref|XP_004983150.1| PREDICTED: uncharacterized protein LOC101770678 [Setaria italica]
          Length = 425

 Score =  272 bits (695), Expect = 6e-70
 Identities = 172/435 (39%), Positives = 224/435 (51%), Gaps = 20/435 (4%)
 Frame = -1

Query: 1611 GSLCCVAARPHVSNTASREWS-MGPHEPYWRSNMSFSPPISR-WEHRFQNEGLPYGEHRF 1438
            GSLCCVAARPH ++TASREWS +G  +P WR+N  FSPP+SR WE+R  +EGL YG H  
Sbjct: 2    GSLCCVAARPHGTSTASREWSSIGRSDPPWRTNAGFSPPLSRGWEYRINSEGLSYGSHGD 61

Query: 1437 QTEGLPYXXXXXXXXXXXXXXSNSKESRISVRDDHFPN---HSVSDGAGSYFSSPSDSFQ 1267
                  Y               NSKE+  S   +  P    +S S+GA SYF+SP  SFQ
Sbjct: 62   SGVAANYGSSLSS---------NSKEASRSWERNELPQEHRYSTSEGAISYFNSPDVSFQ 112

Query: 1266 THQWTPSPIQGGKIDDFMSGTMRESASGPLVFTPTMEGTSALPYXXXXXXXXXXXSEYDA 1087
             H      ++   +D++M    R S + P+      EG S               SEYD 
Sbjct: 113  NHHAMLPMLRDSSVDEYM----RVSVAEPIGALLLSEGISG-QQNSGGSTSRSDGSEYDI 167

Query: 1086 MVNRHAXXXXXXXXXXXXXSKPIHPISFPKHQTPDRDTHEIIVTGNFLN----------- 940
            +   ++             SKPIHP+SFP+H    + T   I + +  N           
Sbjct: 168  VPKSYSSTPRNFPSRRSFLSKPIHPLSFPEHALEAQGTQSPIASASSNNPLRSEFKGKGE 227

Query: 939  -RSTPSTQHFGATQGGTLRWSSPSSSIDFTDVXXXXXXXXXXXXXXXEAFQ---CGLCDK 772
             RS     +   + G +  WS+ +SS+D TD+                  Q   C LC++
Sbjct: 228  LRSPGPMDYASGSHGESGNWSA-ASSMDLTDLSEQPEAERAGAQRSNNVMQKTRCDLCER 286

Query: 771  FLSQRSPWSSRRIVRSGDMPVTGVLSCGHVFHADCLEHTTPKVHKHDPPCPICARSEEIN 592
            FL++RSPW SRRIVR+GD+PV GVL C HV+HA+CLE TTPK  KHDPPCP+C +     
Sbjct: 287  FLTKRSPWGSRRIVRTGDLPVAGVLPCSHVYHAECLERTTPKGQKHDPPCPVCDKL---- 342

Query: 591  ASSPLEQSVVFRLKKNGLPRLRPFQEDGSSRHWACGQVGDCVEGALHASPRNSMMLLNRS 412
            A    E   + RL KNG PRLR   E G SR W+C   GDCV GA+     NS+ LL RS
Sbjct: 343  AGKDTEHWSICRL-KNGFPRLRSLGE-GPSRVWSCAHAGDCVAGAVQIPRSNSIALLTRS 400

Query: 411  RLKKHLSSKGNSSKE 367
              K+H SSKG+  K+
Sbjct: 401  GHKRHASSKGDPGKD 415


>ref|XP_006661763.1| PREDICTED: uncharacterized protein LOC102712663 [Oryza brachyantha]
          Length = 428

 Score =  270 bits (689), Expect = 3e-69
 Identities = 167/436 (38%), Positives = 223/436 (51%), Gaps = 21/436 (4%)
 Frame = -1

Query: 1611 GSLCCVAARPHVSNTASREWS-MGPHEPYWRSNMSFSPPISR-WEHRFQNEGLPYGEHRF 1438
            GSLCCVAARPH ++TASREWS +G ++P WR+N  FSPP+SR WE+   +EGL YG    
Sbjct: 2    GSLCCVAARPHGASTASREWSSIGRNDPLWRTNAGFSPPLSRRWEYCINSEGLSYGSQGD 61

Query: 1437 QTEGLPYXXXXXXXXXXXXXXSNSKESRISVRDDHFP----NHSVSDGAGSYFSSPSDSF 1270
                  Y               NSKE   S      P     +S S+GA SYF+SP  +F
Sbjct: 62   SGAAAHYGSSLSS---------NSKEPSRSWERSELPLDHHRYSTSEGAISYFNSPDVTF 112

Query: 1269 QTHQWTPSPIQGGKIDDFMSGTMRESASGPLVFTPTMEGTSALPYXXXXXXXXXXXSEYD 1090
              H      +Q   ID++M    R S + P+      EG S               SEYD
Sbjct: 113  HNHHIMLPMLQDSSIDEYM----RVSVAEPIGALLLSEGISGQQNSGGSTSSRSDGSEYD 168

Query: 1089 AMVNRHAXXXXXXXXXXXXXSKPIHPISFPKHQTPDRDTHEIIVTGNFLN---------- 940
             +   ++             SKPIHP+SFP+H    ++T   +   +  N          
Sbjct: 169  IVPKSYSSTPRNFPSRRSFLSKPIHPLSFPEHALEGQETDSPVANASSSNPMPSEFKAIG 228

Query: 939  --RSTPSTQHFGATQGGTLRWSSPSSSIDFTDVXXXXXXXXXXXXXXXEAF---QCGLCD 775
              RS+    +   + G +  WS+ +SS+D TD+                     +C LC+
Sbjct: 229  EIRSSGLMDYASGSHGESANWSA-ASSMDLTDLSERPETERSGPLRSNNIMDRTRCDLCE 287

Query: 774  KFLSQRSPWSSRRIVRSGDMPVTGVLSCGHVFHADCLEHTTPKVHKHDPPCPICARSEEI 595
            + LS+RSPW SRRI+R+GD+PV GVL C H++HA+CLE TTPK  KHDPPCP+C R    
Sbjct: 288  RLLSKRSPWGSRRIIRTGDLPVAGVLPCSHIYHAECLERTTPKGQKHDPPCPVCDRL--- 344

Query: 594  NASSPLEQSVVFRLKKNGLPRLRPFQEDGSSRHWACGQVGDCVEGALHASPRNSMMLLNR 415
             A    EQ  + RL +NG PRLR   E G SR W+C Q GDCV GA+     +S+ LL+R
Sbjct: 345  -AGKDTEQWSICRL-RNGFPRLRSLGE-GPSRVWSCAQAGDCVAGAVQIPRASSISLLSR 401

Query: 414  SRLKKHLSSKGNSSKE 367
            S  K+H +SKG S K+
Sbjct: 402  SGHKRHATSKGESGKD 417


>ref|XP_007223213.1| hypothetical protein PRUPE_ppa009352mg [Prunus persica]
            gi|462420149|gb|EMJ24412.1| hypothetical protein
            PRUPE_ppa009352mg [Prunus persica]
          Length = 296

 Score =  266 bits (679), Expect = 4e-68
 Identities = 141/241 (58%), Positives = 163/241 (67%), Gaps = 2/241 (0%)
 Frame = -1

Query: 1026 KPIHPISFPKHQTPDRDTHEIIVTGNFLNRSTPSTQHFGATQGGTLRWSSPSSSIDFTDV 847
            KPIHP+SFP  QTP R+  ++ + G F      + Q  G       RWSS SSSIDF DV
Sbjct: 46   KPIHPLSFPA-QTPPREASDLTLAG-FTEFDAATPQRDGH------RWSSASSSIDFADV 97

Query: 846  XXXXXXXXXXXXXXXEA--FQCGLCDKFLSQRSPWSSRRIVRSGDMPVTGVLSCGHVFHA 673
                            +  F+CGLC++FLSQRSPWSSRRIVRSGDMPVTGVLSC HVFHA
Sbjct: 98   SESFEAEISGRPCNNMSDGFRCGLCERFLSQRSPWSSRRIVRSGDMPVTGVLSCCHVFHA 157

Query: 672  DCLEHTTPKVHKHDPPCPICARSEEINASSPLEQSVVFRLKKNGLPRLRPFQEDGSSRHW 493
            +CLE TTPK  K+DPPCP+CAR EE N    L +   F   + G PRLRP  +DGSSR W
Sbjct: 158  ECLEQTTPKTRKNDPPCPLCARLEEEN----LPEQQGFSRLRTGFPRLRPISDDGSSRPW 213

Query: 492  ACGQVGDCVEGALHASPRNSMMLLNRSRLKKHLSSKGNSSKEPPYKLKKSGSFSSQQFEG 313
             C QVGDCVEGALHA PRNSM+LLNRSR+KK+LS KGN  KE P KL+KSGS+S Q   G
Sbjct: 214  GCTQVGDCVEGALHAPPRNSMLLLNRSRIKKNLSLKGNLGKEFPGKLRKSGSYSFQHLSG 273

Query: 312  E 310
            +
Sbjct: 274  K 274


>gb|ABB47573.1| expressed protein [Oryza sativa Japonica Group]
            gi|125531860|gb|EAY78425.1| hypothetical protein
            OsI_33515 [Oryza sativa Indica Group]
          Length = 433

 Score =  264 bits (674), Expect = 2e-67
 Identities = 174/442 (39%), Positives = 227/442 (51%), Gaps = 27/442 (6%)
 Frame = -1

Query: 1611 GSLCCVAARPHVSNTASREWS-MGPHEPYWRSNMSFSPPISR-WEHRFQNEGLPYGEHRF 1438
            GSLCCVA+RPH ++TASREWS +G  +P WR+N  FSPP+SR WE+R  +EGL YG    
Sbjct: 2    GSLCCVASRPHGASTASREWSSIGRSDPLWRTNAGFSPPLSRRWEYRINSEGLSYGSQGD 61

Query: 1437 QTEGLPYXXXXXXXXXXXXXXSNSKE-SRISVRDDHFPNH---SVSDGAGSYFSSPSDSF 1270
                  Y               NSKE SR   R D  P+H   S S+GA SYF+SP  +F
Sbjct: 62   SGAAAHYGSSLSS---------NSKEPSRSWERSDVPPDHHRYSTSEGAISYFNSPDVTF 112

Query: 1269 QTHQWTPSPIQGGKIDDFMSGTMRESASGPLVFTPTMEGTSALPYXXXXXXXXXXXSEYD 1090
            Q H      +Q   ID++M    R S + P+      EG S               SEYD
Sbjct: 113  QNHHIMLPMLQDSGIDEYM----RVSVAEPIGALLLSEGISGQQNSGGSTSSRSDGSEYD 168

Query: 1089 AMVNRHAXXXXXXXXXXXXXSKPIHPISFPKHQTPDRDTHEIIVTGNFLNRSTPSTQHFG 910
             +   ++             SKPIHP+SFP+H    ++T   +   +    S+P    F 
Sbjct: 169  IVPKSYSSTPRNFPSRRSFLSKPIHPLSFPEHALEGQETDSPVANAS---TSSPMPSEFK 225

Query: 909  A-----------------TQGGTLRWSSPSSSIDFTDVXXXXXXXXXXXXXXXEAF---Q 790
            A                 + G +  WS+ +SS+D TD+                     +
Sbjct: 226  AIGEIRPSGLMDYAYASGSHGESANWSA-ASSMDLTDLSERHDAERSGPLRSNNIMDRTR 284

Query: 789  CGLCDKFLSQRSPWSSRRIVRSGDMPVTGVLSCGHVFHADCLEHTTPKVHKHDPPCPICA 610
            C LC++ LS+RSPW SRRIVR+GD+PV GVL C HV+HA+CLE TTPK  KHDPPCP C 
Sbjct: 285  CDLCERLLSKRSPWGSRRIVRTGDLPVAGVLPCCHVYHAECLERTTPKGQKHDPPCPACD 344

Query: 609  RSEEINASSPLEQSVVFRLKKNGLPRLRPFQEDGSSRHWACGQVGDCVEGALHASPRNSM 430
            R     +    EQ  + RL +NG PRLR   E G SR W+C Q GDCV GA+     +S+
Sbjct: 345  RL----SGKDTEQWSICRL-RNGFPRLRSLGE-GPSRVWSCAQAGDCVAGAVQIPRASSI 398

Query: 429  MLLNRSRLKK-HLSSKGNSSKE 367
             LL+RS  K+ H +SKG S K+
Sbjct: 399  SLLSRSGHKRHHAASKGESGKD 420


Top