BLASTX nr result

ID: Catharanthus23_contig00007212 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Catharanthus23_contig00007212
         (1961 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_006356629.1| PREDICTED: uncharacterized protein LOC102585...   416   e-113
ref|XP_004245248.1| PREDICTED: uncharacterized protein LOC101254...   416   e-113
ref|XP_002284674.1| PREDICTED: uncharacterized protein LOC100247...   395   e-107
ref|XP_002308629.2| hypothetical protein POPTR_0006s26160g [Popu...   390   e-105
ref|XP_002324258.2| hypothetical protein POPTR_0018s00980g [Popu...   375   e-101
ref|XP_006476169.1| PREDICTED: uncharacterized protein LOC102623...   364   7e-98
ref|XP_006450586.1| hypothetical protein CICLE_v10008522mg [Citr...   362   4e-97
gb|EMJ28917.1| hypothetical protein PRUPE_ppa006815mg [Prunus pe...   352   3e-94
ref|XP_002516598.1| conserved hypothetical protein [Ricinus comm...   346   2e-92
ref|XP_004291408.1| PREDICTED: uncharacterized protein LOC101296...   334   9e-89
gb|EXB60468.1| hypothetical protein L484_014922 [Morus notabilis]     332   4e-88
gb|ESW09089.1| hypothetical protein PHAVU_009G099100g [Phaseolus...   284   1e-73
gb|EOY26721.1| Pre-mRNA cleavage complex 2 protein Pcf11, putati...   276   2e-71
gb|EOY26719.1| Pre-mRNA cleavage complex 2 protein Pcf11, putati...   276   2e-71
gb|EOY26718.1| Pre-mRNA cleavage complex 2 protein Pcf11, putati...   276   2e-71
ref|XP_003603497.1| hypothetical protein MTR_3g108290 [Medicago ...   274   9e-71
ref|XP_004138874.1| PREDICTED: uncharacterized protein LOC101212...   272   4e-70
ref|XP_004160865.1| PREDICTED: uncharacterized protein LOC101229...   268   6e-69
ref|XP_006601110.1| PREDICTED: uncharacterized protein LOC100804...   267   1e-68
gb|EOY26722.1| Pre-mRNA cleavage complex 2 protein Pcf11, putati...   265   5e-68

>ref|XP_006356629.1| PREDICTED: uncharacterized protein LOC102585748 [Solanum tuberosum]
          Length = 407

 Score =  416 bits (1070), Expect = e-113
 Identities = 228/372 (61%), Positives = 272/372 (73%), Gaps = 10/372 (2%)
 Frame = +2

Query: 407  MLRKRTRSHQKDQHMGQLTSDVISESYFHSD---NKHKNNSFFNIPGLFVGFNPKGSESD 577
            ML+KRTRSHQK   MG L SD IS+SYF SD    KHK+NSFFN+PG+FVG NPKGSESD
Sbjct: 1    MLKKRTRSHQKVHTMGHLMSDGISDSYFQSDVLVRKHKSNSFFNVPGVFVGLNPKGSESD 60

Query: 578  SVRSPTSPLDFRVFSNLGNPFRSPRSSHEGHHKIWDTNKVGLSIIDTLDHDVKQPGKVLR 757
            SVRSPTSPLDFRVFSNLGNPFRS  S   G +K W   KVGL I+D+LD ++KQ GKV R
Sbjct: 61   SVRSPTSPLDFRVFSNLGNPFRSSTSEGAGANKTWGCTKVGLGIVDSLDDEMKQSGKVFR 120

Query: 758  ASDSKNILFGPQMRIKA--LKSLIHTDSFEAPKSLPKNVGIFPCGNAKPSILQKASSDVL 931
            +SDSKNILFG QMRIK    +S +  DS E PKSLPKN+ IFP   +K S L+K SSDV+
Sbjct: 121  SSDSKNILFGTQMRIKTHDFQSCV-DDSLEEPKSLPKNISIFPHTLSKSSNLRKGSSDVV 179

Query: 932  FEIGDA--QCGLKPSFRPCSLDSTKSGSHLSRLAKDNSGSKSFVSENG-NDIVSSAVQIN 1102
            F IGDA  +  L  +FR CSLDS +S S  + LA   + + +F SEN  N +VS    + 
Sbjct: 180  FGIGDALSEHELSRNFRSCSLDSGRSSSRFASLA---NRTVAFGSENAINPVVSHTKCVR 236

Query: 1103 GGSKLSNSLDAEQHSALAS-IGSGNGLIGTIS-SEIELSEDYTCVRIHGPNPKVTHIYGD 1276
            G SKL N     + S + + +GS   L+G+IS S+IELSEDYTCVR  GPN KVTHI+ D
Sbjct: 237  GCSKLGNPAGGAKLSPIPTPVGSNTSLVGSISASDIELSEDYTCVRTRGPNAKVTHIFCD 296

Query: 1277 CILECHDNELANFGKNNEDGTVLLPTTESSKLITSYPSSDFLSFCYSCKKKLDGEDIYMY 1456
            CILECH+NEL NF KN  + TVL   T+SS+++TS+PSSDFL FC SCKK+LDG+DIYMY
Sbjct: 297  CILECHNNELPNFCKNANEKTVLPEVTDSSEVLTSFPSSDFLRFCSSCKKRLDGKDIYMY 356

Query: 1457 RGEKAFCSWNCR 1492
            RGEKAFCS +CR
Sbjct: 357  RGEKAFCSLDCR 368


>ref|XP_004245248.1| PREDICTED: uncharacterized protein LOC101254717 [Solanum
            lycopersicum]
          Length = 406

 Score =  416 bits (1069), Expect = e-113
 Identities = 228/372 (61%), Positives = 271/372 (72%), Gaps = 10/372 (2%)
 Frame = +2

Query: 407  MLRKRTRSHQKDQHMGQLTSDVISESYFHSD---NKHKNNSFFNIPGLFVGFNPKGSESD 577
            ML+KRTRSHQK Q MG L SD IS+SYF  D    KHKNNSFFN+PG+FVGFNPKGSESD
Sbjct: 1    MLKKRTRSHQKVQTMGHLMSDGISDSYFQPDVFVRKHKNNSFFNVPGVFVGFNPKGSESD 60

Query: 578  SVRSPTSPLDFRVFSNLGNPFRSPRSSHEGHHKIWDTNKVGLSIIDTLDHDVKQPGKVLR 757
            SVRSPTSPLDFRVFSNLGNPFRS  S   G +K W   KVGL I+D+LD ++K  GKV R
Sbjct: 61   SVRSPTSPLDFRVFSNLGNPFRSSTSEGAGANKTWGCTKVGLGIVDSLDDEMKHSGKVFR 120

Query: 758  ASDSKNILFGPQMRIKA--LKSLIHTDSFEAPKSLPKNVGIFPCGNAKPSILQKASSDVL 931
            +SDSKNILFG QMRIKA   +S +  DS E PKSLPKN+ IFP   +K S L+K SSDV+
Sbjct: 121  SSDSKNILFGTQMRIKAHDFQSCV-DDSLEEPKSLPKNISIFPHTLSKSSNLRKGSSDVV 179

Query: 932  FEIGDA--QCGLKPSFRPCSLDSTKSGSHLSRLAKDNSGSKSFVSENG-NDIVSSAVQIN 1102
            F IGDA  +     +FR CSLDS +S S  + LA   + + +  SEN  N +VS    + 
Sbjct: 180  FGIGDALSEHEYSRNFRSCSLDSGRSSSRFASLA---NRTVAVGSENAINPVVSQTKCVR 236

Query: 1103 GGSKLSNSLDAEQHSALAS-IGSGNGLIGTIS-SEIELSEDYTCVRIHGPNPKVTHIYGD 1276
            G SKL N     + S + + +GS   L+G+IS S+I+LSEDYTCVR  GPN KVTHI+ D
Sbjct: 237  GCSKLGNPAGGAKLSPIPTPVGSNTSLVGSISASDIQLSEDYTCVRTRGPNAKVTHIFCD 296

Query: 1277 CILECHDNELANFGKNNEDGTVLLPTTESSKLITSYPSSDFLSFCYSCKKKLDGEDIYMY 1456
            CILECH+NEL NF KN  + TVL   T+SS+++TS+PSSDFL FC SCKKKLDG+DIYMY
Sbjct: 297  CILECHNNELPNFCKNANEKTVLPEVTDSSEVLTSFPSSDFLRFCSSCKKKLDGKDIYMY 356

Query: 1457 RGEKAFCSWNCR 1492
            RGEKAFCS +CR
Sbjct: 357  RGEKAFCSLDCR 368


>ref|XP_002284674.1| PREDICTED: uncharacterized protein LOC100247517 [Vitis vinifera]
          Length = 411

 Score =  395 bits (1016), Expect = e-107
 Identities = 229/376 (60%), Positives = 261/376 (69%), Gaps = 14/376 (3%)
 Frame = +2

Query: 407  MLRKRTRSHQKDQHMGQLT-SDVISESYFHSD---NKHKNNSFFNIPGLFVGFNPKG-SE 571
            MLRKR+RS QKDQHMG  T +D +SE YF SD    KHK NSFF++PGLFVG N KG S+
Sbjct: 1    MLRKRSRSFQKDQHMGHPTMADAVSELYFQSDVMGQKHKGNSFFSVPGLFVGLNYKGLSD 60

Query: 572  SDSVRSPTSPLDFRVFSNLGNPFRSPRSSHEGHHKIWDTNKVGLSIIDTLDHDVKQPGKV 751
            SDSVRSPTSPLDFRVFSNLG+PFRSPRSS +G HK WD +KVGLSIID+LD   K  GKV
Sbjct: 61   SDSVRSPTSPLDFRVFSNLGSPFRSPRSSQDGQHKSWDCSKVGLSIIDSLDDGGKLSGKV 120

Query: 752  LRASDSKNILFGPQMRIKALKSLIHTDSFEAPKSLPKNVGIFPCGNAKPSILQKASSDVL 931
            L +S+SK ILFGPQMRIK   S  H + F+  KSLPKN   FP    K S  QK  SDV+
Sbjct: 121  LGSSESKTILFGPQMRIKTPNSPSHINFFDGSKSLPKNYASFPHTQIK-SRPQKRDSDVV 179

Query: 932  FEIGDAQCGLKPS----FRPCSLDSTKSGSHLSRLAK--DNSGSKSFVSENGNDIVSSAV 1093
            FEI +    L+P      R CSLDS++S S L+ L K   N  S +    N    VSS  
Sbjct: 180  FEIEETP--LEPEAFGRIRSCSLDSSRSFSSLTNLTKRQSNLSSGNLCPGNMTTQVSSPP 237

Query: 1094 QINGGS-KLSNSLDAEQHSALASIGSGNGLIGTIS-SEIELSEDYTCVRIHGPNPKVTHI 1267
            QI GG+    N L  + +S  AS+GSG GLIG++S SEIELSEDYTCV  HGPNPK THI
Sbjct: 238  QILGGNPNPDNFLPMKLNSIPASVGSGQGLIGSLSASEIELSEDYTCVISHGPNPKTTHI 297

Query: 1268 YGDCILECHDNELANFGKNNEDGTVLLPTTESSKLITSYPSSDFLSFCYSCKKKL-DGED 1444
            YGDCILECH N+LAN  KN+E         E S   T YPS+DFLS CYSCKKKL +G+D
Sbjct: 298  YGDCILECHSNDLANHNKNDEHKIGSPLIVECSDNSTPYPSNDFLSICYSCKKKLEEGKD 357

Query: 1445 IYMYRGEKAFCSWNCR 1492
            IYMYRGEKAFCS NCR
Sbjct: 358  IYMYRGEKAFCSLNCR 373


>ref|XP_002308629.2| hypothetical protein POPTR_0006s26160g [Populus trichocarpa]
            gi|550337113|gb|EEE92152.2| hypothetical protein
            POPTR_0006s26160g [Populus trichocarpa]
          Length = 411

 Score =  390 bits (1002), Expect = e-105
 Identities = 225/374 (60%), Positives = 259/374 (69%), Gaps = 12/374 (3%)
 Frame = +2

Query: 407  MLRKRTRSHQKDQHMGQLT-SDVISESYFHSDN---KHKNNSFFNIPGLFVGFNPKG-SE 571
            MLRKRTRS QKDQ MGQLT SD  SES+F SDN    HK NSFF +PGLFVG + KG S+
Sbjct: 1    MLRKRTRSLQKDQQMGQLTMSDSGSESHFQSDNMGHNHKANSFFTVPGLFVGSSLKGLSD 60

Query: 572  SDSVRSPTSPLDFRVFSNLGNPFRSPRSSHEGHHKIWDTNKVGLSIIDTLDHDVKQPGKV 751
             DSVRSPTSPLDFR+FSN+GNP +SPRSSH G  K WD NKVGLSI+D+LD D K  GKV
Sbjct: 61   CDSVRSPTSPLDFRMFSNIGNPSKSPRSSHGGQRKSWDCNKVGLSIVDSLDDDGKGSGKV 120

Query: 752  LRASDSKNILFGPQMRIKALKSLIHTDSFEAPKSLPKNVGIFPCGNAKPSILQKASSDVL 931
            LR+S+SKNILFGP++R K       TDSF+APKSLP+N  IFP    K  +L K SSDVL
Sbjct: 121  LRSSESKNILFGPRVRSKTPNFQSRTDSFQAPKSLPRNFAIFPRTLTKSPLL-KGSSDVL 179

Query: 932  FEIGDAQCGLKP--SFRPCSLDSTKSGSHLSRLAKDNS--GSKSFVSENGNDIVSSAVQI 1099
            FEIG+     +P    R CSLDS +S S LSRLA  NS   S +F  +N         Q+
Sbjct: 180  FEIGEDPSDSEPFGKIRSCSLDSCRSFSSLSRLAGQNSKASSGNFCLDNVT-TRGECPQL 238

Query: 1100 NGGSKLSNSL-DAEQHSALASIGSGNGLIGTIS-SEIELSEDYTCVRIHGPNPKVTHIYG 1273
             GGS  SN+  +        S+ SGNG IG++S SEIELSEDYTCV  HGPNPK THIYG
Sbjct: 239  FGGSPNSNNFSNTNLTFTPMSVSSGNGFIGSLSASEIELSEDYTCVISHGPNPKTTHIYG 298

Query: 1274 DCILECHDNELANFGKNNEDGTVLLPTTESSKLITSYPSSDFLSFCYSCKKKLD-GEDIY 1450
            DCILEC  N+L+NFGKN      L      SK+  S+PS  FLSFCY C KKLD G+DIY
Sbjct: 299  DCILECQSNDLSNFGKNEAKEIGLPQAVTCSKIPGSFPSEVFLSFCYYCNKKLDEGKDIY 358

Query: 1451 MYRGEKAFCSWNCR 1492
            +YRGEKAFCS +CR
Sbjct: 359  IYRGEKAFCSLSCR 372


>ref|XP_002324258.2| hypothetical protein POPTR_0018s00980g [Populus trichocarpa]
            gi|550317758|gb|EEF02823.2| hypothetical protein
            POPTR_0018s00980g [Populus trichocarpa]
          Length = 415

 Score =  375 bits (964), Expect = e-101
 Identities = 219/380 (57%), Positives = 255/380 (67%), Gaps = 18/380 (4%)
 Frame = +2

Query: 407  MLRKRTRSHQKDQHMGQLT-SDVISESYFHSDNK----HKNNSFFNIPGLFVGFNPKG-S 568
            MLRKRTRS +KDQ  GQLT SD  SESYF  DN     HK NSFF +PGLFVG + KG S
Sbjct: 1    MLRKRTRSLKKDQQTGQLTMSDSGSESYFQPDNNMGHSHKANSFFTVPGLFVGLSHKGLS 60

Query: 569  ESDSVRSPTSPLDFRVFSNLGNPFRSPRSSHEGHHKIWDTNKVGLSIIDTLDHDV----- 733
            + DSVRSPTSPLD R+FSN+GNP +S RSSH G  K WD NKVGLSI+D+LD D      
Sbjct: 61   DCDSVRSPTSPLDSRMFSNIGNPHKSLRSSHGGQQKSWDCNKVGLSILDSLDDDDDDDDG 120

Query: 734  KQPGKVLRASDSKNILFGPQMRIKALKSLIHTDSFEAPKSLPKNVGIFPCGNAKPSILQK 913
            K  GKVL++S+SKNILFGP++R K      HTD F+APKSLP+N  IFP    K S LQK
Sbjct: 121  KGYGKVLQSSESKNILFGPRVRSKTANFQSHTDPFQAPKSLPRNFAIFPRTLTK-SPLQK 179

Query: 914  ASSDVLFEIGDAQCGLKP--SFRPCSLDSTKSGSHLSRLAKDN--SGSKSFVSENGNDIV 1081
             SSDVLFEIG+     +     R CSLDS +S S +SRLA  N  + S +F   N    V
Sbjct: 180  DSSDVLFEIGEGPFESETFGRIRSCSLDSCRSFSSMSRLAGQNLKASSLNFSLHNITTQV 239

Query: 1082 SSAVQINGGSKLSNSLDAEQHSALA-SIGSGNGLIGTIS-SEIELSEDYTCVRIHGPNPK 1255
                Q+ GGS  +N+      +    S  SGNG I ++S SEIELSEDYTCV  HGPNPK
Sbjct: 240  DCPPQLLGGSSNTNNFSNTNLTYTPMSASSGNGFISSLSASEIELSEDYTCVISHGPNPK 299

Query: 1256 VTHIYGDCILECHDNELANFGKNNEDGTVLLPTTESSKLITSYPSSDFLSFCYSCKKKLD 1435
             THIYG CILECH N+ +NFGKN E    L      SK+ +S+PS DFLSFCY C KKLD
Sbjct: 300  TTHIYGGCILECHSNDFSNFGKNKEKEIGLAQAATCSKIPSSFPSEDFLSFCYYCNKKLD 359

Query: 1436 -GEDIYMYRGEKAFCSWNCR 1492
             G+DIY+YRGEKAFCS +CR
Sbjct: 360  EGKDIYIYRGEKAFCSLSCR 379


>ref|XP_006476169.1| PREDICTED: uncharacterized protein LOC102623549 [Citrus sinensis]
          Length = 399

 Score =  364 bits (935), Expect = 7e-98
 Identities = 214/415 (51%), Positives = 270/415 (65%), Gaps = 10/415 (2%)
 Frame = +2

Query: 407  MLRKRTRSHQKDQHMGQL-TSDVISESYFHSDNKHKNNSFFNIPGLFVGFNPKG-SESDS 580
            MLRKRTRS +K+Q M  L T + ++ES+F+S+N    NS FN+PGLFVG +PKG S++DS
Sbjct: 1    MLRKRTRSVEKEQQMSHLKTPESVAESFFNSENL-TGNSLFNVPGLFVGLSPKGLSDTDS 59

Query: 581  VRSPTSPLDFRVFSNLGNPFRSPRSSHEGHHKIWDTNKVGLSIIDTLDHDVKQPGKVLRA 760
            VRSPTSPLDFR FSNLGN FRSP+S+H   HK WDT+KVGLSIID+L +D+K   KVLR 
Sbjct: 60   VRSPTSPLDFRAFSNLGNSFRSPKSAHYEQHKSWDTSKVGLSIIDSLRNDMKPSSKVLR- 118

Query: 761  SDSKNILFGPQMRIKALKSLIHTDSFEAPKSLPKNVGIFPCGNAKPSILQKASSDVLFEI 940
            S+SKNI+FGPQMRIK   S  + +SF+APKSLPKN  IFPC   K S+LQK +SDV+ EI
Sbjct: 119  SESKNIIFGPQMRIKTPNSQTNINSFDAPKSLPKNYAIFPCTQIK-SLLQKGNSDVVLEI 177

Query: 941  GDAQCGLKPSF---RPCSLDSTKSGSHLSRLAKDNS--GSKSFVSENGNDIVSSAVQING 1105
            G+        F   R CSLDS +S   L+      S   S++F  E      SS + + G
Sbjct: 178  GETPFEEHEPFGKTRSCSLDSCRSFPALAGFTDCGSIMSSENFGFEKLACQESSPLMVGG 237

Query: 1106 GSKLSNSLDAEQHSALASIGSGNGLIGTIS-SEIELSEDYTCVRIHGPNPKVTHIYGDCI 1282
              + +N LD++ +    SIGSGNG   ++S SEIELSEDYT V  HGPNP+ THIYGDCI
Sbjct: 238  SPRSNNFLDSKVNLMSTSIGSGNGFTESLSASEIELSEDYTRVVSHGPNPRTTHIYGDCI 297

Query: 1283 LECHDNELANFGKNNEDGT--VLLPTTESSKLITSYPSSDFLSFCYSCKKKLDGEDIYMY 1456
            LEC  N+ ++  KN  +G+  V++ TT+       YPS DFLSFC SC KKL+G+DIY+Y
Sbjct: 298  LECRTNDQSDDYKNEAEGSDGVMIITTQ-------YPSDDFLSFCCSCNKKLEGKDIYIY 350

Query: 1457 RGEKAFCSWNCRXXXXXXXXXXXXXXXXXXXXXXXXXXPSSNCEEISEPSLFIST 1621
            RGEKAFCS +CR                           S +C E+SE   FI+T
Sbjct: 351  RGEKAFCSADCR------AQEILIDEEMEKDINSESSPKSDDCGELSETCFFITT 399


>ref|XP_006450586.1| hypothetical protein CICLE_v10008522mg [Citrus clementina]
            gi|557553812|gb|ESR63826.1| hypothetical protein
            CICLE_v10008522mg [Citrus clementina]
          Length = 399

 Score =  362 bits (928), Expect = 4e-97
 Identities = 213/415 (51%), Positives = 269/415 (64%), Gaps = 10/415 (2%)
 Frame = +2

Query: 407  MLRKRTRSHQKDQHMGQL-TSDVISESYFHSDNKHKNNSFFNIPGLFVGFNPKG-SESDS 580
            MLRKRTRS +K+Q M  L T + ++ES+F+S+N  K NS FN+PGLFVG +PKG S++DS
Sbjct: 1    MLRKRTRSVEKEQQMSHLKTPESVAESFFNSENL-KGNSLFNVPGLFVGLSPKGLSDTDS 59

Query: 581  VRSPTSPLDFRVFSNLGNPFRSPRSSHEGHHKIWDTNKVGLSIIDTLDHDVKQPGKVLRA 760
            VRSPTSPLDFR FSNLGN FRSP+S+H   HK WDT+KVGLSIID+L +D+K   KVLR 
Sbjct: 60   VRSPTSPLDFRAFSNLGNSFRSPKSAHYEQHKSWDTSKVGLSIIDSLRNDMKPSSKVLR- 118

Query: 761  SDSKNILFGPQMRIKALKSLIHTDSFEAPKSLPKNVGIFPCGNAKPSILQKASSDVLFEI 940
            S+SKNI+FGPQMRIK   S  + +SF+APKSLPKN  IFPC   K S+LQ  +SDV+ EI
Sbjct: 119  SESKNIIFGPQMRIKTPNSQTNINSFDAPKSLPKNYAIFPCTQIK-SLLQTGNSDVVLEI 177

Query: 941  GDAQCGLKPSF---RPCSLDSTKSGSHLSRLAKDNS--GSKSFVSENGNDIVSSAVQING 1105
            G+        F   R CSLDS +S   L+      S   S++F  E      SS + + G
Sbjct: 178  GETPFEEHEPFGKTRSCSLDSCRSFPVLAGFTDCGSIMSSENFGFEKLACQESSPLMVGG 237

Query: 1106 GSKLSNSLDAEQHSALASIGSGNGLIGTIS-SEIELSEDYTCVRIHGPNPKVTHIYGDCI 1282
              + +N  D++ +    SIGSGNG   ++S SEIELSEDYT V  HGPNP+ THIYGDCI
Sbjct: 238  SPRSNNFSDSKVNLMSTSIGSGNGFTESLSASEIELSEDYTRVVSHGPNPRTTHIYGDCI 297

Query: 1283 LECHDNELANFGKNNEDGT--VLLPTTESSKLITSYPSSDFLSFCYSCKKKLDGEDIYMY 1456
            LEC  N+ ++  KN  +G+  V++ TT+       YPS DFLSFC SC KKL+G+DIY+Y
Sbjct: 298  LECRTNDQSDDYKNEAEGSDGVMIITTQ-------YPSDDFLSFCCSCNKKLEGKDIYIY 350

Query: 1457 RGEKAFCSWNCRXXXXXXXXXXXXXXXXXXXXXXXXXXPSSNCEEISEPSLFIST 1621
            RGEKAFCS +CR                           S +C E+SE   FI+T
Sbjct: 351  RGEKAFCSADCR------SQEILIDEEMEKDINSESSPKSDDCGELSETCFFITT 399


>gb|EMJ28917.1| hypothetical protein PRUPE_ppa006815mg [Prunus persica]
          Length = 394

 Score =  352 bits (904), Expect = 3e-94
 Identities = 207/372 (55%), Positives = 251/372 (67%), Gaps = 10/372 (2%)
 Frame = +2

Query: 407  MLRKRTRSHQKDQH-MGQLT-SDVISESYFHSDNKHKNNSFFNIPGLFVGFNPKGS-ESD 577
            MLRKR+RS QKDQH MG L  +D  S+   H+    K+NSFF++PGLFVG + KG  +SD
Sbjct: 1    MLRKRSRSIQKDQHQMGHLPIADAGSDVLGHNP---KSNSFFSVPGLFVGLSSKGLIDSD 57

Query: 578  SVRSPTSPLDFRVFSNLGNPFRSPRSSHEGHHKIWDTNKVGLSIIDTLDHDVKQPGKVLR 757
            SVRSPTSPLDFRVFSNLGNPFRSPRS+ +G  + W ++KVGLSIID+ D DVK  GKV R
Sbjct: 58   SVRSPTSPLDFRVFSNLGNPFRSPRSNSDGQQRSWGSSKVGLSIIDSFDDDVKFSGKVPR 117

Query: 758  ASDSKNILFGPQMRIKALKSLIHTDSFEAPKSLPKNVGIFPCGNAKPSILQKASSDVLFE 937
            +S+SKNILFGP MRIK   S  +T+SF +PKSLPKN  +FP    K S L+K SSDVLFE
Sbjct: 118  SSESKNILFGPGMRIKTPDSQSNTNSFASPKSLPKNYAVFPHSKIK-SPLEKGSSDVLFE 176

Query: 938  IGDAQCGLKP--SFRPCSLDSTKSGSHLSRLAKDNSGSKSFVSENGNDIVSSAVQ---IN 1102
            IG++    +     R CSLDS ++ S LS L+  N  S S     GN  + S      I 
Sbjct: 177  IGESPTEPESFGKIRSCSLDSGRAFSTLSGLSNLNPNSTS-----GNFCMGSLTTQPFIG 231

Query: 1103 GGSKLSNSLDAEQHSALASIGSGNGLIGTIS-SEIELSEDYTCVRIHGPNPKVTHIYGDC 1279
            G   L+  ++        SIGS NGL+G++S SEIELSEDYTCV  HG NPK THI+GDC
Sbjct: 232  GSPNLATQMNT------GSIGSSNGLVGSLSASEIELSEDYTCVISHGANPKKTHIFGDC 285

Query: 1280 ILECHDNELANFGKNNEDGTVLLPTTESSKLITSYPSSDFLSFCYSCKKKL-DGEDIYMY 1456
            IL CH N+L+NFGKN            S      YPS++FLSFCY C KKL +G+DIY+Y
Sbjct: 286  ILGCHSNDLSNFGKNEGKEIGFARPGTSLGNFVQYPSNNFLSFCYYCNKKLEEGKDIYIY 345

Query: 1457 RGEKAFCSWNCR 1492
            RGEKAFCS +CR
Sbjct: 346  RGEKAFCSLSCR 357


>ref|XP_002516598.1| conserved hypothetical protein [Ricinus communis]
            gi|223544418|gb|EEF45939.1| conserved hypothetical
            protein [Ricinus communis]
          Length = 435

 Score =  346 bits (888), Expect = 2e-92
 Identities = 211/392 (53%), Positives = 256/392 (65%), Gaps = 22/392 (5%)
 Frame = +2

Query: 383  RRVCGC------GTMLRKRTRSHQKDQHMGQLT-SDVISESYFHSD---NKHKNNSFFNI 532
            +R CG       G MLRKRTRS QKDQ MG LT SD  S+    SD     HK  SFFN+
Sbjct: 13   KRGCGVPNRRFLGVMLRKRTRSLQKDQQMGPLTMSDSGSQFNSQSDCLGYNHKRTSFFNV 72

Query: 533  PGLFVGFNPKG-SESDSVRSPTSPLDFRVFSNLGNP-FRSPRSSHEGHHKIWDTNKVGLS 706
            PGLFVG +PKG S+ DSVRSPTSPLD R+FSNLGN  +RSPRSS  GH K WD +KVGLS
Sbjct: 73   PGLFVGLSPKGMSDCDSVRSPTSPLDLRLFSNLGNSSYRSPRSSQNGHQKSWDCSKVGLS 132

Query: 707  IIDTLDH---DVKQPGKVLRASDSKNILFGPQMRIKALKSLIHTDSFEAPKSLPKNVGIF 877
            I+++LD    D K  GKVLR+S+SKNILFG ++RIK     ++ +SFEAPKSLP+N  I 
Sbjct: 133  IVNSLDDEDDDTKVSGKVLRSSESKNILFGQKVRIKTPTFQVNANSFEAPKSLPRNFAIL 192

Query: 878  PCGNAKPSILQKASSDVLFEIGDAQCGLKP--SFRPCSLDSTKSGSHLSRLAKDNSG--S 1045
            P    K S LQK  S V+FEIG+A    +     R CSLDS KS S LSRLA  NS    
Sbjct: 193  PHSYTKSS-LQKGCSKVIFEIGEAPTEPEHFGKIRSCSLDSCKSFSTLSRLANRNSNVIC 251

Query: 1046 KSFVSENGNDIVSSAVQINGGSKLSNSLDAEQHSALASIGSGNGLIGTIS-SEIELSEDY 1222
             +F   N     SS +Q +GGS   ++        L   GS +G +G++S SEIELSEDY
Sbjct: 252  GNFPLNNVATGTSSPLQFSGGSPPQSNNSLHMDLNLPPAGSTSGFVGSLSASEIELSEDY 311

Query: 1223 TCVRIHGPNPKVTHIYGDCILECHDNELANFGKNNEDGTVLLPTTESSKLITS-YPSSDF 1399
            TCV  HGPN K THIYGDC+LEC+ NE    GK      + +P   +S +I S +PS+DF
Sbjct: 312  TCVISHGPNAKKTHIYGDCVLECYSNE----GKE-----IRMPQAITSSIIPSPFPSNDF 362

Query: 1400 LSFCYSCKKKLD-GEDIYMYRGEKAFCSWNCR 1492
            L+FCY C ++LD G+DIY+YRGEKAFCS +CR
Sbjct: 363  LNFCYYCNRRLDGGKDIYIYRGEKAFCSLSCR 394


>ref|XP_004291408.1| PREDICTED: uncharacterized protein LOC101296169 [Fragaria vesca
            subsp. vesca]
          Length = 403

 Score =  334 bits (856), Expect = 9e-89
 Identities = 200/375 (53%), Positives = 248/375 (66%), Gaps = 13/375 (3%)
 Frame = +2

Query: 407  MLRKRTRSHQKDQHMGQL----TSDVISESYFHSD---NKHKNNSFFNIPGLFVGFNPKG 565
            MLRKRTRS QKDQ   Q+     S+  SES+F SD      K+N FF IPGLFVG  P G
Sbjct: 1    MLRKRTRSTQKDQDQHQMGHLPISNTGSESHFRSDVLGPNPKSNPFFTIPGLFVGLGPIG 60

Query: 566  -SESDSVRSPTSPLDFRVFSNLGNPFRSPRSSHEGHHKIWDTNKVGLSIIDTLDHDVKQP 742
             ++SDS+RSPTSPLDFRVFSNLG+PFRSPRS  +GH + W ++KVGLSIID+ D DVK  
Sbjct: 61   LTDSDSIRSPTSPLDFRVFSNLGSPFRSPRSPLDGHKRSWGSSKVGLSIIDSFDDDVKCS 120

Query: 743  GKVLRASDSKNILFGPQMRIKALKSLIHTDSFEAPKSLPKNVGIFPCGNAKPSILQKASS 922
            GKV R+S+SKNILFGP MRIK   S  +T+S  +P+SLPKN  IFP    K S LQ++SS
Sbjct: 121  GKVPRSSESKNILFGPGMRIKTRDSRSNTNSIGSPRSLPKNYAIFPHSKVK-SPLQESSS 179

Query: 923  DVLFEIGDAQCGLKP--SFRPCSLDSTKSGSHLSRLAKDNSGS-KSFVSENGNDIVSSAV 1093
            DV+FEIG+     +     R CS DS ++ S LS L+K N  S ++F  EN  +      
Sbjct: 180  DVVFEIGETPSEPESFGKIRSCSFDSARTFSTLSGLSKLNPNSTRNFCLENVTN-----P 234

Query: 1094 QINGGSKLSNSLDAEQHSALASIGSGNGLIGTIS-SEIELSEDYTCVRIHGPNPKVTHIY 1270
            Q  GGS  S +L       + S GSGN  +G++S SEIELSEDYTCV  HG NPK THI+
Sbjct: 235  QFIGGSPNSATL-----MNVGSTGSGNEFVGSLSASEIELSEDYTCVISHGANPKTTHIF 289

Query: 1271 GDCILECHDNELANFGKNNEDGTVLLPTTESSKLITSYPSSDFLSFCYSCKKKL-DGEDI 1447
            GDCIL CH  +L+   +N + G        S      YPS++FLSFC+ C K+L +G+DI
Sbjct: 290  GDCIL-CHSEDLSKSFENEKKGIGSPQLATSLGSFVQYPSNNFLSFCHYCNKELEEGKDI 348

Query: 1448 YMYRGEKAFCSWNCR 1492
            Y+YRGEKAFCS +CR
Sbjct: 349  YIYRGEKAFCSLSCR 363


>gb|EXB60468.1| hypothetical protein L484_014922 [Morus notabilis]
          Length = 431

 Score =  332 bits (851), Expect = 4e-88
 Identities = 208/393 (52%), Positives = 260/393 (66%), Gaps = 31/393 (7%)
 Frame = +2

Query: 407  MLRKRTRSHQKDQH-MGQ--LTSDVISESYFHSD----NKHKNNSFFNIPGLFVGFNPKG 565
            MLRKRTRS QKDQH MG   +T+      +FHSD    N  K NSF    GL VG +PKG
Sbjct: 1    MLRKRTRSIQKDQHQMGHQPITNSGSESFFFHSDILNNNNPKRNSF---SGLLVGLSPKG 57

Query: 566  ----SESDSVRSPTSPLDFRVFSNLGNPF----RSPRSSHE-GHHKIWD-TNKVGL-SII 712
                ++ DSVRSPTSPLDF++FS+LGNPF    ++ RSSHE G  + W  + KVGL SII
Sbjct: 58   LATSTDCDSVRSPTSPLDFKLFSSLGNPFFRSSKATRSSHENGQQRSWGGSTKVGLISII 117

Query: 713  DTLDHDVKQPGKVLRASDSKNILFGPQMRIKALKS-LIHTDSFEAPKSLPKNVGIFPCGN 889
            D+LD D+K PGKVLR+S+SKNILFGP+ R+K   S   +T+SFE+PKSLPKN  IFP  +
Sbjct: 118  DSLDDDIKFPGKVLRSSESKNILFGPKFRVKTSTSGQANTNSFESPKSLPKNYAIFPHSS 177

Query: 890  AKPSILQKASSDVLFEIGDAQCGLKP-----SFRPCSLDSTKSGSHLSRLAKDNSGSKSF 1054
                 L+K SSDVLFEIG++   L+P       R CSLDS ++ S+        S S +F
Sbjct: 178  KTKPPLEKGSSDVLFEIGESP--LEPPDSLGQIRSCSLDSCRTMSN-----SPISTSMNF 230

Query: 1055 VSENG-NDIVSSAVQINGGSKLSNSLDAEQHSAL-ASIGSGNGLIGTIS-SEIELSEDYT 1225
              EN     VSS+ Q  GGS  SN +   + S +  S+GSGNG IG++S SEIELSEDYT
Sbjct: 231  CLENNVTTQVSSSPQFFGGSPNSNRISGTKLSTIPVSLGSGNGFIGSLSASEIELSEDYT 290

Query: 1226 CVRIHGPNPKVTHIYGDCILECHDNELANFGKNNEDGTVL---LPTTESSKLITSYPSSD 1396
            CV  HGPNPK THI+GDCILE    +L+NF    +D   +    P  +++++   YPS+ 
Sbjct: 291  CVISHGPNPKTTHIFGDCILETESCDLSNFAAKADDNKEIGFSQPIGKNTRISAPYPSNY 350

Query: 1397 FLSFCYSCKKKL-DGEDIYMYRGEKAFCSWNCR 1492
            FLSFCYSC KKL DG+DIY+YRGEKAFCS +CR
Sbjct: 351  FLSFCYSCNKKLEDGKDIYIYRGEKAFCSLSCR 383


>gb|ESW09089.1| hypothetical protein PHAVU_009G099100g [Phaseolus vulgaris]
          Length = 423

 Score =  284 bits (726), Expect = 1e-73
 Identities = 183/385 (47%), Positives = 229/385 (59%), Gaps = 23/385 (5%)
 Frame = +2

Query: 407  MLRKRTRSHQKDQH-MGQLTS-DVISESYFHSD-----NKHKNNSFFNIPGLFVGFNPKG 565
            MLRKR RS QK+QH M  LT  +  SE Y  +      N  K +S FN+P L+VG  PKG
Sbjct: 1    MLRKRNRSMQKEQHHMSNLTQCEANSEHYSQTHHALGRNNIKGHSIFNVPCLYVGLGPKG 60

Query: 566  S-ESDSVRSPTSPLDFRVFSNLGNPFRSPRSS-HEGHHKIWDTNKVGLSIIDTLDHDVKQ 739
              +SDSVRSPTSPLD RV SNLGNP R PRSS HEGH + WD  KVGL I+++L+   + 
Sbjct: 61   LLDSDSVRSPTSPLDARVLSNLGNPVRKPRSSPHEGHPRSWDCCKVGLGIVESLEDCSRF 120

Query: 740  PGKVLRASDSKNILFGPQMRIKALKSLIHTDSFEAPKSLPKNVGIFPCGNAKPSILQ--- 910
             GK+L++ +SK +   PQM IKA    IH D  E  KSLPK+    P G    S+     
Sbjct: 121  SGKILQSPESKRVSVSPQMMIKASNCQIHRDFLEGSKSLPKDFCKAPYGPKNRSVTTHKG 180

Query: 911  KASSDVLFEIGDAQCGLKPSF----RPCSLDSTKSGSHLS--RLAKDNSGSKSFVSENGN 1072
            ++ S VLFEIG++  GL+       R CSLDS      LS   ++  +S + SF  ++ N
Sbjct: 181  ESESTVLFEIGES--GLEHELFRRTRSCSLDSCSQLKKLSGLNISFSDSDTDSFAVKDVN 238

Query: 1073 DIVSSAVQINGGSKLSNSLDAEQ-HSALASIGSGNGLIGTIS-SEIELSEDYTCVRIHGP 1246
              +SS     GGS+ SN+    + ++   SI S N  I ++S SEIELSEDYTCV  +GP
Sbjct: 239  FQLSSPPHFIGGSQNSNTFPPTKFNTNTLSISSSNEFIKSLSASEIELSEDYTCVISYGP 298

Query: 1247 NPKVTHIYGDCILECHDNELANFGKNNEDGTV--LLPTTESSKLITSYPSSDFLSFCYSC 1420
            NPK THI+GDCILE H N      KN E      + P          YPSSDFLSFC+ C
Sbjct: 299  NPKTTHIFGDCILETHSNAFKIHYKNEEKEKEKGVNPVANRLGSPNPYPSSDFLSFCHHC 358

Query: 1421 KKKL-DGEDIYMYRGEKAFCSWNCR 1492
             KKL +G+DIY+Y GEKAFCS  CR
Sbjct: 359  NKKLEEGKDIYIYGGEKAFCSLTCR 383


>gb|EOY26721.1| Pre-mRNA cleavage complex 2 protein Pcf11, putative isoform 5
            [Theobroma cacao]
          Length = 403

 Score =  276 bits (706), Expect = 2e-71
 Identities = 166/361 (45%), Positives = 222/361 (61%), Gaps = 14/361 (3%)
 Frame = +2

Query: 452  GQLTSDVISESYFHSDN---KHKNNSFFNIPGLFVGFNPKGS-ESDSVRSPTSPLDFRVF 619
            G + +D  SESYF SD    +H ++S FNIPG  VGF+ KGS +SD VRSPTSPLD RVF
Sbjct: 3    GNVMADPDSESYFQSDTLGLRHISSSLFNIPGFLVGFSTKGSSDSDMVRSPTSPLDLRVF 62

Query: 620  SNLGNPF--RSPRSSHE-GHHKIWDTNKVGLSIIDTLDHDVKQPGKVLRASDSKNILFGP 790
            +N  NPF  RSPRSS + G+ K WD +K+GL I++ L  ++K  G+ L +   KNI+FGP
Sbjct: 63   ANFSNPFSVRSPRSSSQSGYQKKWDCSKMGLGIVNLLADEIKSDGEDLDSPKRKNIIFGP 122

Query: 791  QMRIKALKSLIHTDSFEA----PKSLPKNVGIFPCGNAKPSILQKASSDVLFEIGDAQCG 958
            Q++ K   S  ++  F        SLP+N  I      +        S ++F  G+ +  
Sbjct: 123  QVKTKFPSSSRYSHEFLGNSMKSNSLPRNYIISQLSKDRKPNTNSGGSSLVF--GNEEVP 180

Query: 959  LKPSFRPCSLDSTKSGSHLSRLAKDNSGSKSFVSENGN-DIVSSAVQINGGSKLSNSLDA 1135
            L+P     S  S  S S ++     N  S+SF SENG   + SS++ I    ++ +SL +
Sbjct: 181  LEPK----SDSSRLSPSFIASTKNCNLSSRSFCSENGTTSLNSSSLPIGRALQVDDSLLS 236

Query: 1136 EQHSALASIGSGNGLIGTISS-EIELSEDYTCVRIHGPNPKVTHIYGDCILECHDNELAN 1312
            +  S    +G     IG++S+ EIELSEDYTC+  HGPNPK THI+GDCILECH+ EL N
Sbjct: 237  KPSSLPIPVGHS---IGSLSAHEIELSEDYTCIISHGPNPKTTHIFGDCILECHNTELTN 293

Query: 1313 FGKNNEDGTVLLPTTESSKLITSYPSSDFLSFCYSCKKKLD-GEDIYMYRGEKAFCSWNC 1489
            F K  E  T +    +S +  T YPS +FLSFCYSC+KKL+  EDIYMYRGEKAFCS++C
Sbjct: 294  FDKKAEPETKVSQLDKSPETSTPYPSDEFLSFCYSCQKKLEKDEDIYMYRGEKAFCSFDC 353

Query: 1490 R 1492
            R
Sbjct: 354  R 354


>gb|EOY26719.1| Pre-mRNA cleavage complex 2 protein Pcf11, putative isoform 3
            [Theobroma cacao]
          Length = 404

 Score =  276 bits (706), Expect = 2e-71
 Identities = 166/361 (45%), Positives = 222/361 (61%), Gaps = 14/361 (3%)
 Frame = +2

Query: 452  GQLTSDVISESYFHSDN---KHKNNSFFNIPGLFVGFNPKGS-ESDSVRSPTSPLDFRVF 619
            G + +D  SESYF SD    +H ++S FNIPG  VGF+ KGS +SD VRSPTSPLD RVF
Sbjct: 3    GNVMADPDSESYFQSDTLGLRHISSSLFNIPGFLVGFSTKGSSDSDMVRSPTSPLDLRVF 62

Query: 620  SNLGNPF--RSPRSSHE-GHHKIWDTNKVGLSIIDTLDHDVKQPGKVLRASDSKNILFGP 790
            +N  NPF  RSPRSS + G+ K WD +K+GL I++ L  ++K  G+ L +   KNI+FGP
Sbjct: 63   ANFSNPFSVRSPRSSSQSGYQKKWDCSKMGLGIVNLLADEIKSDGEDLDSPKRKNIIFGP 122

Query: 791  QMRIKALKSLIHTDSFEA----PKSLPKNVGIFPCGNAKPSILQKASSDVLFEIGDAQCG 958
            Q++ K   S  ++  F        SLP+N  I      +        S ++F  G+ +  
Sbjct: 123  QVKTKFPSSSRYSHEFLGNSMKSNSLPRNYIISQLSKDRKPNTNSGGSSLVF--GNEEVP 180

Query: 959  LKPSFRPCSLDSTKSGSHLSRLAKDNSGSKSFVSENGN-DIVSSAVQINGGSKLSNSLDA 1135
            L+P     S  S  S S ++     N  S+SF SENG   + SS++ I    ++ +SL +
Sbjct: 181  LEPK----SDSSRLSPSFIASTKNCNLSSRSFCSENGTTSLNSSSLPIGRALQVDDSLLS 236

Query: 1136 EQHSALASIGSGNGLIGTISS-EIELSEDYTCVRIHGPNPKVTHIYGDCILECHDNELAN 1312
            +  S    +G     IG++S+ EIELSEDYTC+  HGPNPK THI+GDCILECH+ EL N
Sbjct: 237  KPSSLPIPVGHS---IGSLSAHEIELSEDYTCIISHGPNPKTTHIFGDCILECHNTELTN 293

Query: 1313 FGKNNEDGTVLLPTTESSKLITSYPSSDFLSFCYSCKKKLD-GEDIYMYRGEKAFCSWNC 1489
            F K  E  T +    +S +  T YPS +FLSFCYSC+KKL+  EDIYMYRGEKAFCS++C
Sbjct: 294  FDKKAEPETKVSQLDKSPETSTPYPSDEFLSFCYSCQKKLEKDEDIYMYRGEKAFCSFDC 353

Query: 1490 R 1492
            R
Sbjct: 354  R 354


>gb|EOY26718.1| Pre-mRNA cleavage complex 2 protein Pcf11, putative isoform 2
            [Theobroma cacao]
          Length = 394

 Score =  276 bits (706), Expect = 2e-71
 Identities = 166/361 (45%), Positives = 222/361 (61%), Gaps = 14/361 (3%)
 Frame = +2

Query: 452  GQLTSDVISESYFHSDN---KHKNNSFFNIPGLFVGFNPKGS-ESDSVRSPTSPLDFRVF 619
            G + +D  SESYF SD    +H ++S FNIPG  VGF+ KGS +SD VRSPTSPLD RVF
Sbjct: 3    GNVMADPDSESYFQSDTLGLRHISSSLFNIPGFLVGFSTKGSSDSDMVRSPTSPLDLRVF 62

Query: 620  SNLGNPF--RSPRSSHE-GHHKIWDTNKVGLSIIDTLDHDVKQPGKVLRASDSKNILFGP 790
            +N  NPF  RSPRSS + G+ K WD +K+GL I++ L  ++K  G+ L +   KNI+FGP
Sbjct: 63   ANFSNPFSVRSPRSSSQSGYQKKWDCSKMGLGIVNLLADEIKSDGEDLDSPKRKNIIFGP 122

Query: 791  QMRIKALKSLIHTDSFEA----PKSLPKNVGIFPCGNAKPSILQKASSDVLFEIGDAQCG 958
            Q++ K   S  ++  F        SLP+N  I      +        S ++F  G+ +  
Sbjct: 123  QVKTKFPSSSRYSHEFLGNSMKSNSLPRNYIISQLSKDRKPNTNSGGSSLVF--GNEEVP 180

Query: 959  LKPSFRPCSLDSTKSGSHLSRLAKDNSGSKSFVSENGN-DIVSSAVQINGGSKLSNSLDA 1135
            L+P     S  S  S S ++     N  S+SF SENG   + SS++ I    ++ +SL +
Sbjct: 181  LEPK----SDSSRLSPSFIASTKNCNLSSRSFCSENGTTSLNSSSLPIGRALQVDDSLLS 236

Query: 1136 EQHSALASIGSGNGLIGTISS-EIELSEDYTCVRIHGPNPKVTHIYGDCILECHDNELAN 1312
            +  S    +G     IG++S+ EIELSEDYTC+  HGPNPK THI+GDCILECH+ EL N
Sbjct: 237  KPSSLPIPVGHS---IGSLSAHEIELSEDYTCIISHGPNPKTTHIFGDCILECHNTELTN 293

Query: 1313 FGKNNEDGTVLLPTTESSKLITSYPSSDFLSFCYSCKKKLD-GEDIYMYRGEKAFCSWNC 1489
            F K  E  T +    +S +  T YPS +FLSFCYSC+KKL+  EDIYMYRGEKAFCS++C
Sbjct: 294  FDKKAEPETKVSQLDKSPETSTPYPSDEFLSFCYSCQKKLEKDEDIYMYRGEKAFCSFDC 353

Query: 1490 R 1492
            R
Sbjct: 354  R 354


>ref|XP_003603497.1| hypothetical protein MTR_3g108290 [Medicago truncatula]
            gi|355492545|gb|AES73748.1| hypothetical protein
            MTR_3g108290 [Medicago truncatula]
          Length = 424

 Score =  274 bits (701), Expect = 9e-71
 Identities = 193/387 (49%), Positives = 238/387 (61%), Gaps = 25/387 (6%)
 Frame = +2

Query: 407  MLRKRTRSHQKDQH-MGQLT-SDVISESYFHSDNKHKN---NSFFNIPGLFVGFNPKGS- 568
            MLRKR+RS QKDQH MG LT SD  S+ Y  S    +N   N  FN+P LFVG  PKG  
Sbjct: 1    MLRKRSRSIQKDQHQMGHLTNSDTNSDHYAQSHALGRNIKGNPIFNVPCLFVGLGPKGLL 60

Query: 569  ESDSVRSPTSPLDFRVFSNLGNPFRSPRSSH-EGHHKIWDTNKVGLSIIDTLD--HDVKQ 739
            +SDSVRSPTSPLD RV SN GNP R+ RSS  EG+ + WD+ KVGLSI+++L+  +  + 
Sbjct: 61   DSDSVRSPTSPLDTRVLSNSGNPVRNLRSSLLEGNQRSWDSCKVGLSIVESLEDCNCSRF 120

Query: 740  PGKVLRASDSKNILFGPQMRIKALKSLIHTDSFEAP-KSLPKNVG-IFPCGNAKPSILQK 913
             GK+L++ DSK I   PQ  IK        DSFE+  KSLPK+ G + PC     S++QK
Sbjct: 121  CGKILQSLDSKGISLSPQSMIKTPICETCMDSFESSSKSLPKDFGKVVPCVE-DGSVIQK 179

Query: 914  AS--SDVLFEIGDAQCGLKPSF---RPCSLDSTKSGSHLSRLA--KDNSGSKSFVSENGN 1072
                S+VLFEIG+        F   R CSLDS KS      LA  K +S    F  ++  
Sbjct: 180  GECESNVLFEIGETSLEHDEPFGRTRSCSLDSCKSMKADFGLATSKTDSDIDDFAMKDVT 239

Query: 1073 DIVSSAVQINGGSKLSNS-LDAEQHSALASIGSGNGLIGTIS-SEIELSEDYTCVRIHGP 1246
              VSS+    GGS+ SN+ + AE  S   SI S + ++ ++S SEIELSEDYTCV  HGP
Sbjct: 240  VQVSSSPHFIGGSQNSNAFIPAESKSNTLSICSSSEILKSLSASEIELSEDYTCVISHGP 299

Query: 1247 NPKVTHIYGDCILECH-DNELANFGKNNE---DGTVLLPTTESSKLITSYPSSDFLSFCY 1414
            NPK THI+GD ILE H D  + N  KN E   +  V L   + S+    YPSS FLSFC+
Sbjct: 300  NPKTTHIFGDYILETHPDLSIKNHFKNEENEKEKGVTLMGNKLSQTPNQYPSSAFLSFCH 359

Query: 1415 SCKKKLD-GEDIYMYRGEKAFCSWNCR 1492
             C KKLD G+DIY+YRGEKAFCS  CR
Sbjct: 360  HCDKKLDEGKDIYIYRGEKAFCSLTCR 386


>ref|XP_004138874.1| PREDICTED: uncharacterized protein LOC101212300 [Cucumis sativus]
          Length = 399

 Score =  272 bits (695), Expect = 4e-70
 Identities = 171/368 (46%), Positives = 218/368 (59%), Gaps = 6/368 (1%)
 Frame = +2

Query: 407  MLRKRTRSHQKDQHMGQLTSDVISESYFHSDNKHKNNSFFNIPGLFVGFNPKGSESDSVR 586
            MLRKRTRS QKDQ+     +   S S  H+    K +S F    LF G +PKG ESDS +
Sbjct: 1    MLRKRTRSVQKDQYRMNQMNVPCSGSELHT----KCSSIFKRSHLFTGLSPKGLESDSAK 56

Query: 587  SPTSPLDFRVFSNLGNPFRSPRSS-HEGHHKIWDTNKVGLSIIDTLDHD-VKQPGKVLRA 760
            SPTSPLDF V S+LGNP RSPRSS +EGH K WD++KVGLSIID+L++D  K  GKVLR+
Sbjct: 57   SPTSPLDFWVLSSLGNPLRSPRSSSNEGHRKNWDSSKVGLSIIDSLNNDDSKLFGKVLRS 116

Query: 761  SDSKNILFGPQMRIKALKSLIHTDSFEAPKSLPKNVGIFPCGNAKPSILQKASSDVLFEI 940
            SDSK  LFGP+   K        +  + PKSLPKN  IF     K   +++ +SDV+FEI
Sbjct: 117  SDSKTALFGPRSVAKKSNCPPQANLIQGPKSLPKNYAIFQVPKTKTP-MEQGNSDVIFEI 175

Query: 941  GDAQCGLKPSFRPC-SLDSTKSGSHLSRLAKDNSGSKSFVSENG-NDIVSSAVQINGGSK 1114
            G+     +P      S DS ++ +  S +   +  S S  +E+  +  +    +++    
Sbjct: 176  GETPLECEPFGNYSRSFDSYRAFAPRSVINGHSVSSSSTTTESAASPCLGEEPRVSEKYP 235

Query: 1115 LSNSLDAEQHSALASIGSGNGLIGTIS-SEIELSEDYTCVRIHGPNPKVTHIYGDCILEC 1291
            L+        S    +   NG    +S SEIELSEDYTCV  HGPNPK THI+GDCIL C
Sbjct: 236  LTKPC-----STSLGLSCDNGSNKPLSASEIELSEDYTCVISHGPNPKTTHIFGDCILGC 290

Query: 1292 HDNELANFGKNNEDGTVLLPTTESSKLITSYPSSDFLSFCYSCKKKLD-GEDIYMYRGEK 1468
            H N L++  +N           +S    TSY  +DFLS CYSC KKLD G+DIY+YRGEK
Sbjct: 291  HSNYLSSSSENEMKEMEFPRPLKSLNTSTSYSLTDFLSMCYSCHKKLDEGKDIYIYRGEK 350

Query: 1469 AFCSWNCR 1492
            AFCS  CR
Sbjct: 351  AFCSLTCR 358


>ref|XP_004160865.1| PREDICTED: uncharacterized protein LOC101229906 [Cucumis sativus]
          Length = 399

 Score =  268 bits (685), Expect = 6e-69
 Identities = 170/368 (46%), Positives = 217/368 (58%), Gaps = 6/368 (1%)
 Frame = +2

Query: 407  MLRKRTRSHQKDQHMGQLTSDVISESYFHSDNKHKNNSFFNIPGLFVGFNPKGSESDSVR 586
            MLRKRTRS QKDQ+     +   S S  H+    K +S F    LF G +PKG ESDS +
Sbjct: 1    MLRKRTRSVQKDQYRMNQMNVPCSGSELHT----KCSSIFKRSHLFTGLSPKGLESDSAK 56

Query: 587  SPTSPLDFRVFSNLGNPFRSPRSS-HEGHHKIWDTNKVGLSIIDTLDHD-VKQPGKVLRA 760
            SPTSPLDF V S+LGNP RSPRSS +EGH K WD++KVGLSIID+L++D  K  GKVLR+
Sbjct: 57   SPTSPLDFWVLSSLGNPLRSPRSSSNEGHRKNWDSSKVGLSIIDSLNNDDSKLFGKVLRS 116

Query: 761  SDSKNILFGPQMRIKALKSLIHTDSFEAPKSLPKNVGIFPCGNAKPSILQKASSDVLFEI 940
            SDSK  LFGP+   K        +  + PKSLPKN  IF     K   +++ +SDV+FEI
Sbjct: 117  SDSKTALFGPRSVAKKSNCPPQANLIQGPKSLPKNYAIFQVPKTKTP-MEQGNSDVIFEI 175

Query: 941  GDAQCGLKPSFRPC-SLDSTKSGSHLSRLAKDNSGSKSFVSENG-NDIVSSAVQINGGSK 1114
            G+     +P      S DS ++ +  S +   +  S S  +E+  +  +    +++    
Sbjct: 176  GETPLECEPFGNYSRSFDSYRAFAPRSVINGHSVSSSSTTTESAASPCLGEEPRVSEKYP 235

Query: 1115 LSNSLDAEQHSALASIGSGNGLIGTIS-SEIELSEDYTCVRIHGPNPKVTHIYGDCILEC 1291
            L+        S    +   NG    +S SEIELSEDYTCV  HG NPK THI+GDCIL C
Sbjct: 236  LTKPC-----STSLGLSCDNGSNKPLSASEIELSEDYTCVISHGLNPKTTHIFGDCILGC 290

Query: 1292 HDNELANFGKNNEDGTVLLPTTESSKLITSYPSSDFLSFCYSCKKKLD-GEDIYMYRGEK 1468
            H N L++  +N           +S    TSY  +DFLS CYSC KKLD G+DIY+YRGEK
Sbjct: 291  HSNYLSSSSENEMKEMEFPRPLKSLNTSTSYSLTDFLSMCYSCHKKLDEGKDIYIYRGEK 350

Query: 1469 AFCSWNCR 1492
            AFCS  CR
Sbjct: 351  AFCSLTCR 358


>ref|XP_006601110.1| PREDICTED: uncharacterized protein LOC100804101 [Glycine max]
          Length = 399

 Score =  267 bits (682), Expect = 1e-68
 Identities = 185/378 (48%), Positives = 225/378 (59%), Gaps = 16/378 (4%)
 Frame = +2

Query: 407  MLRKRTRSHQKDQH-MGQLT-SDVISESYFHSDNKHKNNSFFNIPGLFVGFNPKGS-ESD 577
            MLRKRTRS QKDQH  GQ+  SD  SES+    N  K+NS FN P LFVG   KG  +SD
Sbjct: 1    MLRKRTRSIQKDQHHTGQMAISDTNSESHALGSNG-KSNSIFNSPLLFVGMGHKGLLDSD 59

Query: 578  SVRSPTSPLDFRVFSNLGNPFRSPRS-SHEGHHKIWDTNKVGLSIIDTLDHDVKQPGKVL 754
            SV+SPTSPLDF   SNL NPFR+P S S+EG H+ W+  KVGLSIID+L+   K  GK+L
Sbjct: 60   SVKSPTSPLDFGFLSNLSNPFRTPSSLSNEGQHRSWNCAKVGLSIIDSLEECSKFSGKIL 119

Query: 755  RASDSKNILFGPQMRIKALKSLIHTDSFEAPKSLPKNVGIFPCGNAKPSILQKASSDVLF 934
            +AS+SK     P M  KA K   + DS +A KSLPK+     C     SI  K  S VL 
Sbjct: 120  QASESKKTSLCPPMITKAPKCKSYMDSAQASKSLPKDFCKITC-TQNGSIFPKGESTVLS 178

Query: 935  EIGDAQC-----GLKPSFRPCSLDSTKSGSHLSRLAKDN--SGSKSFVSENGNDIVSSAV 1093
            EIG+A       G   SF   SLDS     +LS L   +  S S++F  +     + S  
Sbjct: 179  EIGEAPLEYESFGKTVSF---SLDSCSPIRNLSGLTGSDFDSDSENFALKQ----MCSPP 231

Query: 1094 QINGGSKLSNS--LDAEQHSALASIGSGNGLIGTIS-SEIELSEDYTCVRIHGPNPKVTH 1264
               GGS+ +    L +E HS   +  S N  I ++S SEIELSEDYTCV  HG NPK TH
Sbjct: 232  HFIGGSQNNTKFLLPSEVHSNPVAAVSSNEFIESLSASEIELSEDYTCVISHGSNPKTTH 291

Query: 1265 IYGDCILECHDNELANFGKNNEDGTVL-LPTTESSKLITSYPSSDFLSFCYSCKKKL-DG 1438
            I+ DCILE H N+     K  E+GT L L +       + YPS DFLS C+ C KKL DG
Sbjct: 292  IFCDCILESHVNDSERHYKAEEEGTGLPLFSVNILHTPSQYPSHDFLSVCHHCNKKLEDG 351

Query: 1439 EDIYMYRGEKAFCSWNCR 1492
            +DIY+YRGEK+FCS +CR
Sbjct: 352  KDIYIYRGEKSFCSLSCR 369


>gb|EOY26722.1| Pre-mRNA cleavage complex 2 protein Pcf11, putative isoform 6
            [Theobroma cacao]
          Length = 401

 Score =  265 bits (677), Expect = 5e-68
 Identities = 163/361 (45%), Positives = 220/361 (60%), Gaps = 14/361 (3%)
 Frame = +2

Query: 452  GQLTSDVISESYFHSDN---KHKNNSFFNIPGLFVGFNPKGS-ESDSVRSPTSPLDFRVF 619
            G + +D  SESYF SD    +H ++S FNIPG  VGF+ KGS +SD VRSPTSPLD RVF
Sbjct: 3    GNVMADPDSESYFQSDTLGLRHISSSLFNIPGFLVGFSTKGSSDSDMVRSPTSPLDLRVF 62

Query: 620  SNLGNPF--RSPRSSHE-GHHKIWDTNKVGLSIIDTLDHDVKQPGKVLRASDSKNILFGP 790
            +N  NPF  RSPRSS + G+ K WD +K+GL I++ L  ++K  G+ L +   KNI+FGP
Sbjct: 63   ANFSNPFSVRSPRSSSQSGYQKKWDCSKMGLGIVNLLADEIKSDGEDLDSPKRKNIIFGP 122

Query: 791  QMRIKALKSLIHTDSFEA----PKSLPKNVGIFPCGNAKPSILQKASSDVLFEIGDAQCG 958
            Q++ K   S  ++  F        SLP+N  I      +        S ++F  G+ +  
Sbjct: 123  QVKTKFPSSSRYSHEFLGNSMKSNSLPRNYIISQLSKDRKPNTNSGGSSLVF--GNEEVP 180

Query: 959  LKPSFRPCSLDSTKSGSHLSRLAKDNSGSKSFVSENGN-DIVSSAVQINGGSKLSNSLDA 1135
            L+P     S  S  S S ++     N  S+SF SENG   + SS++ I    ++ +SL +
Sbjct: 181  LEPK----SDSSRLSPSFIASTKNCNLSSRSFCSENGTTSLNSSSLPIGRALQVDDSLLS 236

Query: 1136 EQHSALASIGSGNGLIGTISS-EIELSEDYTCVRIHGPNPKVTHIYGDCILECHDNELAN 1312
            +  S    +G     IG++S+ EIELSEDYTC+  HGPNPK THI+GDCILECH+ EL N
Sbjct: 237  KPSSLPIPVGHS---IGSLSAHEIELSEDYTCIISHGPNPKTTHIFGDCILECHNTELTN 293

Query: 1313 FGKNNEDGTVLLPTTESSKLITSYPSSDFLSFCYSCKKKLD-GEDIYMYRGEKAFCSWNC 1489
            F K  E  T +    +S +  T YPS +FLSFCYSC+KKL+  EDIY+  GEKAFCS++C
Sbjct: 294  FDKKAEPETKVSQLDKSPETSTPYPSDEFLSFCYSCQKKLEKDEDIYI--GEKAFCSFDC 351

Query: 1490 R 1492
            R
Sbjct: 352  R 352


Top