BLASTX nr result

ID: Catharanthus23_contig00005207 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Catharanthus23_contig00005207
         (1140 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_006338117.1| PREDICTED: serine/arginine repetitive matrix...   187   7e-45
ref|XP_006342376.1| PREDICTED: uncharacterized protein DDB_G0271...   181   6e-43
ref|XP_004239320.1| PREDICTED: uncharacterized protein LOC101263...   173   1e-40
ref|XP_004239319.1| PREDICTED: uncharacterized protein LOC101263...   165   3e-38
ref|XP_004243710.1| PREDICTED: uncharacterized protein LOC101253...   161   4e-37
ref|XP_004239321.1| PREDICTED: uncharacterized protein LOC101263...   161   5e-37
ref|XP_002278077.1| PREDICTED: uncharacterized protein LOC100264...   155   3e-35
emb|CAN82449.1| hypothetical protein VITISV_006434 [Vitis vinifera]   154   5e-35
gb|EOY30257.1| Damaged dna-binding 2, putative isoform 1 [Theobr...   144   9e-32
ref|XP_006474735.1| PREDICTED: uncharacterized protein LOC102616...   139   2e-30
gb|EOY30258.1| Damaged dna-binding 2, putative isoform 2 [Theobr...   135   3e-29
ref|XP_002281841.1| PREDICTED: uncharacterized protein LOC100260...   130   8e-28
ref|XP_004141345.1| PREDICTED: uncharacterized protein LOC101215...   128   4e-27
gb|EXB38953.1| hypothetical protein L484_027388 [Morus notabilis]     128   5e-27
gb|EOY30055.1| Damaged dna-binding 2, putative isoform 1 [Theobr...   126   2e-26
ref|XP_002308520.1| MTD1 family protein [Populus trichocarpa] gi...   125   2e-26
ref|XP_002516147.1| conserved hypothetical protein [Ricinus comm...   125   3e-26
ref|XP_002324201.2| MTD1 family protein [Populus trichocarpa] gi...   122   3e-25
gb|EMJ03691.1| hypothetical protein PRUPE_ppa010604mg [Prunus pe...   117   9e-24
gb|AGV54556.1| hypothetical protein [Phaseolus vulgaris] gi|5610...   115   4e-23

>ref|XP_006338117.1| PREDICTED: serine/arginine repetitive matrix protein 2-like [Solanum
            tuberosum]
          Length = 257

 Score =  187 bits (475), Expect = 7e-45
 Identities = 126/273 (46%), Positives = 150/273 (54%), Gaps = 1/273 (0%)
 Frame = +2

Query: 206  MSIALERNNSAKNSYNHDHDHHRIGRSGFGANKMTCHSIYGSGNGDNSSEFG-GADRLKV 382
            MSI L RNNS+          HRI  SGF  + MT   IY      NS EFG G  R + 
Sbjct: 1    MSIVLGRNNSSD---------HRIEPSGFATHGMTSIPIY------NSPEFGIGDPRDQE 45

Query: 383  AEEVRVGDXXXXXXXIGRNSDTESSAERSSDGGVNEEVQSKFKGGGGALDNLEALEEVLP 562
             +             IGRNSD +S A RSS  G  EEVQS FK GG  LDNLE LEEVLP
Sbjct: 46   DDRSSSFSSSSSSSSIGRNSD-DSPAGRSSSDGDGEEVQSSFKPGG--LDNLETLEEVLP 102

Query: 563  MKRXXXXXXXXXXXXXXXLADAASCSSIKDIVKPENAYTRKRKNLLAYSNFWDKNHITIR 742
            +KR               LADAAS SS+K+IVKPE+AYTRKRKNLLA++NF+ KN   + 
Sbjct: 103  IKRSISKFYAGKSKSFTSLADAASISSVKEIVKPEDAYTRKRKNLLAHNNFFGKNRSYLP 162

Query: 743  RSSSGAISKRHSNSRSTLTLAATMGCAEXXXXXXXXXXXPPRGICXXXXXXQVRRSPNND 922
               +G I KR  NSRS+  LAA++ C++           P              R   N+
Sbjct: 163  GMGNGGIYKRPINSRSSSALAASVSCSDSNYSSKSLNSSPSSPCLSRPPLPPQTRRYRNE 222

Query: 923  SISSPPEQMFSPWRSFSLSDLQGAAASPNISGI 1021
            S  SPPEQ  + WRSFSLSDLQGAAA+P++ GI
Sbjct: 223  SSLSPPEQKLNAWRSFSLSDLQGAAATPSLMGI 255


>ref|XP_006342376.1| PREDICTED: uncharacterized protein DDB_G0271670-like [Solanum
            tuberosum]
          Length = 264

 Score =  181 bits (458), Expect = 6e-43
 Identities = 128/280 (45%), Positives = 153/280 (54%), Gaps = 8/280 (2%)
 Frame = +2

Query: 206  MSIALERNNSAKNSYNHDHDHHRIGRSGFGANKMTCHSIYGSGNGDNSSEFGGADRLKVA 385
            MSIA ERN++           H+I R GF  + M    IY      NS + G  +R+  A
Sbjct: 1    MSIAFERNSTPD---------HQIERPGF-VHGMDFVPIY------NSPDLGVGERMVQA 44

Query: 386  EEVRVGDXXXXXXX--IGRNSDTESSAE-RSSDGGVN----EEVQSKFKGGGGALDNLEA 544
            ++    D         IGRNSD    A   SSDGG      EEVQS FK G  ALDNLE+
Sbjct: 45   KQEDEDDRTSSSSSSSIGRNSDDSPPAGGSSSDGGRGDGDGEEVQSPFKPG--ALDNLES 102

Query: 545  LEEVLPMKRXXXXXXXXXXXXXXXLADAASCSSIKDIVKPENAYTRKRKNLLAYSNFWDK 724
            LEEVLP+KR               LADA SCSS+KD+VKPENAYTRKRKNLLA+SNF+DK
Sbjct: 103  LEEVLPIKRGISSFYAGKSKSYTSLADAVSCSSLKDMVKPENAYTRKRKNLLAHSNFFDK 162

Query: 725  NHITIRRSSSGAISKRHSNSRSTLTLAATMGCAEXXXXXXXXXXXPPRGICXXXXXXQVR 904
            N     R++SG + KR  NSRS+L L A   C+E                C         
Sbjct: 163  NRNHFPRNNSGGLYKRPINSRSSLALGAISSCSESNNSSESLNSNASSPRCSLPPLPPQS 222

Query: 905  RSPNNDSISSPPEQMFSPWRSFSLSDLQGAAA-SPNISGI 1021
            R  + +  SSPPEQ  SPWRSFSLSDLQGAAA +P++ GI
Sbjct: 223  RRYSIEPSSSPPEQKLSPWRSFSLSDLQGAAAGTPSLMGI 262


>ref|XP_004239320.1| PREDICTED: uncharacterized protein LOC101263316 isoform 2 [Solanum
            lycopersicum]
          Length = 252

 Score =  173 bits (439), Expect = 1e-40
 Identities = 124/273 (45%), Positives = 151/273 (55%), Gaps = 1/273 (0%)
 Frame = +2

Query: 206  MSIALERNNSAKNSYNHDHDHHRIGRSGFGANKMTCHSIYGSGNGDNSSEFGGADRLKVA 385
            MSI L RNNS+          HRI  SGF A++MT   IY      NS EFG  D     
Sbjct: 1    MSIVLGRNNSSD---------HRIKPSGFTAHEMTSMPIY------NSPEFGIGDPRD-- 43

Query: 386  EEVRVGDXXXXXXXIGRNSDTESSAERSSDGGVNEEVQSKFKGGGGALDNLEALEEVLPM 565
            +             IGRNSD +S A RSS  G  EEVQS FK GG  LDNL  LEEVLP+
Sbjct: 44   DRSSSFSSSSSSSSIGRNSD-DSPAGRSSSDGDGEEVQSSFKPGG--LDNLGTLEEVLPI 100

Query: 566  KRXXXXXXXXXXXXXXXLADAASCSSIKDIVKPENAYTRKRKNLLAYSNFWDKNHITIRR 745
            KR               LADAAS SS+K+IVKPE+AYTRKRKNLLA++NF+ KN   +  
Sbjct: 101  KRSISKFYAGKSKSFTSLADAASISSVKEIVKPEDAYTRKRKNLLAHNNFFGKNRSYLPG 160

Query: 746  SSSGAISKRHSNSRSTLTLAATMGCAEXXXXXXXXXXXPPRGICXXXXXXQVRRSPNNDS 925
              +  I KR  NSRS+  LAA++ C++           P   +       Q RR  N  S
Sbjct: 161  MGNRGIYKRPINSRSSSALAASVSCSDSSESLNSSPSSP--CLTRPPLPPQTRRYRNESS 218

Query: 926  ISSPPEQMFSPWRSFSLSDLQG-AAASPNISGI 1021
            + SPP++  + WRSFSLSDLQG AAA+P++ GI
Sbjct: 219  L-SPPDRKLNAWRSFSLSDLQGAAAAAPSLMGI 250


>ref|XP_004239319.1| PREDICTED: uncharacterized protein LOC101263316 isoform 1 [Solanum
            lycopersicum]
          Length = 262

 Score =  165 bits (418), Expect = 3e-38
 Identities = 124/283 (43%), Positives = 151/283 (53%), Gaps = 11/283 (3%)
 Frame = +2

Query: 206  MSIALERNNSAKNSYNHDHDHHRIGRSGFGANKMTCHSIYGSGNGDNSSEFGGADRLKVA 385
            MSI L RNNS+          HRI  SGF A++MT   IY      NS EFG  D     
Sbjct: 1    MSIVLGRNNSSD---------HRIKPSGFTAHEMTSMPIY------NSPEFGIGDPRD-- 43

Query: 386  EEVRVGDXXXXXXXIGRNSDTESSAERSSDGGVNEEVQSKFKGGGGALDNLEALEEVLPM 565
            +             IGRNSD +S A RSS  G  EEVQS FK GG  LDNL  LEEVLP+
Sbjct: 44   DRSSSFSSSSSSSSIGRNSD-DSPAGRSSSDGDGEEVQSSFKPGG--LDNLGTLEEVLPI 100

Query: 566  K----------RXXXXXXXXXXXXXXXLADAASCSSIKDIVKPENAYTRKRKNLLAYSNF 715
            K          R               LADAAS SS+K+IVKPE+AYTRKRKNLLA++NF
Sbjct: 101  KYVDFSLLFVRRSISKFYAGKSKSFTSLADAASISSVKEIVKPEDAYTRKRKNLLAHNNF 160

Query: 716  WDKNHITIRRSSSGAISKRHSNSRSTLTLAATMGCAEXXXXXXXXXXXPPRGICXXXXXX 895
            + KN   +    +  I KR  NSRS+  LAA++ C++           P   +       
Sbjct: 161  FGKNRSYLPGMGNRGIYKRPINSRSSSALAASVSCSDSSESLNSSPSSP--CLTRPPLPP 218

Query: 896  QVRRSPNNDSISSPPEQMFSPWRSFSLSDLQG-AAASPNISGI 1021
            Q RR  N  S+ SPP++  + WRSFSLSDLQG AAA+P++ GI
Sbjct: 219  QTRRYRNESSL-SPPDRKLNAWRSFSLSDLQGAAAAAPSLMGI 260


>ref|XP_004243710.1| PREDICTED: uncharacterized protein LOC101253102 [Solanum
            lycopersicum]
          Length = 264

 Score =  161 bits (408), Expect = 4e-37
 Identities = 121/281 (43%), Positives = 148/281 (52%), Gaps = 9/281 (3%)
 Frame = +2

Query: 206  MSIALERNNSAKNSYNHDHDHHRIGRSGFGANKMTCHSIYGS---GNGDNSSEFGGADRL 376
            MSIA ERN++           H+I R GF  + MT   IY S   G G++  +    D  
Sbjct: 1    MSIAFERNSTPD---------HQIERPGF-MHGMTFVPIYNSPDLGVGESMVQVKRED-- 48

Query: 377  KVAEEVRVGDXXXXXXXIGRNSDTESSAERSSDGGV-----NEEVQSKFKGGGGALDNLE 541
               E+ R          IGRNSD    A  SS  G       EEVQS FK G  ALDNLE
Sbjct: 49   ---EDDRTSSSSSSS--IGRNSDDSPLAGGSSSNGCPGEGDGEEVQSPFKPG--ALDNLE 101

Query: 542  ALEEVLPMKRXXXXXXXXXXXXXXXLADAASCSSIKDIVKPENAYTRKRKNLLAYSNFWD 721
            +LEEVLP+KR               LADA SCSS+KD+VK ENAY+RKRKNLLA+SNF+ 
Sbjct: 102  SLEEVLPIKRGISSFYAGKSKSYTSLADAVSCSSLKDMVKAENAYSRKRKNLLAHSNFFG 161

Query: 722  KNHITIRRSSSGAISKRHSNSRSTLTLAATMGCAEXXXXXXXXXXXPPRGICXXXXXXQV 901
            KN     R++S  + KR  +SRS+L L AT  C+E                         
Sbjct: 162  KNRNHFPRNNSCGLYKRPISSRSSLALGATSSCSESNNSSESLNSNASSPHFSLPPLPPQ 221

Query: 902  RRSPNNDSISSPPEQMFSPWRSFSLSDLQGAAA-SPNISGI 1021
             R  + +  SSPP+Q  SPWRSFSLSDLQGAAA +P++ GI
Sbjct: 222  PRRYSIEPSSSPPDQKLSPWRSFSLSDLQGAAAGTPSLMGI 262


>ref|XP_004239321.1| PREDICTED: uncharacterized protein LOC101263316 isoform 3 [Solanum
            lycopersicum]
          Length = 225

 Score =  161 bits (407), Expect = 5e-37
 Identities = 120/273 (43%), Positives = 143/273 (52%), Gaps = 1/273 (0%)
 Frame = +2

Query: 206  MSIALERNNSAKNSYNHDHDHHRIGRSGFGANKMTCHSIYGSGNGDNSSEFGGADRLKVA 385
            MSI L RNNS+          HRI  SGF A++MT   IY      NS EFG  D     
Sbjct: 1    MSIVLGRNNSSD---------HRIKPSGFTAHEMTSMPIY------NSPEFGIGDPRD-- 43

Query: 386  EEVRVGDXXXXXXXIGRNSDTESSAERSSDGGVNEEVQSKFKGGGGALDNLEALEEVLPM 565
            +             IGRNSD +S A RSS  G  EEVQS FK GG  LDNL  LEEVLP+
Sbjct: 44   DRSSSFSSSSSSSSIGRNSD-DSPAGRSSSDGDGEEVQSSFKPGG--LDNLGTLEEVLPI 100

Query: 566  KRXXXXXXXXXXXXXXXLADAASCSSIKDIVKPENAYTRKRKNLLAYSNFWDKNHITIRR 745
            KR               LADAAS SS+K+IVKPE+AYTRKRKNLLA++NF+ KN   +  
Sbjct: 101  KRSISKFYAGKSKSFTSLADAASISSVKEIVKPEDAYTRKRKNLLAHNNFFGKNRSYLPG 160

Query: 746  SSSGAISKRHSNSRSTLTLAATMGCAEXXXXXXXXXXXPPRGICXXXXXXQVRRSPNNDS 925
              +  I KR  NSRS+  LAA+                               R   N+S
Sbjct: 161  MGNRGIYKRPINSRSSSALAAS------------------------------TRRYRNES 190

Query: 926  ISSPPEQMFSPWRSFSLSDLQG-AAASPNISGI 1021
              SPP++  + WRSFSLSDLQG AAA+P++ GI
Sbjct: 191  SLSPPDRKLNAWRSFSLSDLQGAAAAAPSLMGI 223


>ref|XP_002278077.1| PREDICTED: uncharacterized protein LOC100264608 [Vitis vinifera]
          Length = 275

 Score =  155 bits (392), Expect = 3e-35
 Identities = 110/264 (41%), Positives = 139/264 (52%), Gaps = 8/264 (3%)
 Frame = +2

Query: 275  IGRSGFGANKMTCHSIYGSGNGDNSSEFGGADRLKVA-EEVRVGDXXXXXXXIGRNSDTE 451
            I RSGF  + M+C SI+ S     +  F    R     EE   G        IGRNSD  
Sbjct: 13   IERSGF-VHGMSCISIFDS---PEAGVFSSDRRFPSGVEEREEGLDSCSSSSIGRNSDAS 68

Query: 452  SSAERSSDGGVNEEVQSKFKGGGGALDNLEALEEVLPMKRXXXXXXXXXXXXXXXLADAA 631
              +    D G   EVQS +KG    L+ ++ALE+VL +K+               LAD +
Sbjct: 69   GGSSEGEDSG-ETEVQSSYKG---PLETMDALEDVLVVKKSISKFYNGKSKSFTSLADVS 124

Query: 632  SCSSIKDIVKPENAYTRKRKNLLAYSNFWDKNHITIRRSSSGAISKRH-SNSRSTLTLAA 808
            + SS+KD+ KPENAY +KRKNLLAYSNFWDKN     RS++G ISKR   +SRSTL LA 
Sbjct: 125  ASSSVKDLAKPENAYAKKRKNLLAYSNFWDKNRNCPWRSNAGGISKRPLISSRSTLALAV 184

Query: 809  TMGCAE----XXXXXXXXXXXPPRGICXXXXXXQVRRSPNNDSISSPP-EQMFSPWRSFS 973
            TM  +E                           Q ++S NN   SSPP +Q F PWRSFS
Sbjct: 185  TMSSSESGNYCDDSNCSSNLSSSHSPSLPPLHPQAKKSSNNAPSSSPPSQQKFPPWRSFS 244

Query: 974  LSDLQGA-AASPNISGITVNHRQE 1042
            LSDLQG  AA+P I+G+  N+ +E
Sbjct: 245  LSDLQGMDAATPGITGLAGNNNRE 268


>emb|CAN82449.1| hypothetical protein VITISV_006434 [Vitis vinifera]
          Length = 275

 Score =  154 bits (390), Expect = 5e-35
 Identities = 111/264 (42%), Positives = 140/264 (53%), Gaps = 8/264 (3%)
 Frame = +2

Query: 275  IGRSGFGANKMTCHSIYGSGNGDNSSEFGGADRLKVA-EEVRVGDXXXXXXXIGRNSDTE 451
            I RSGF  + M+C SI+ S     +  F    R     EE   G        IGRNSD  
Sbjct: 13   IERSGF-VHGMSCISIFDS---PEAGVFXXDRRFPSGVEEREEGLDSCSSSSIGRNSDAS 68

Query: 452  SSAERSSDGGVNEEVQSKFKGGGGALDNLEALEEVLPMKRXXXXXXXXXXXXXXXLADAA 631
              +    D G   EVQS +KG    L+ ++ALE+VL +K+               LAD +
Sbjct: 69   GGSSEGEDSG-ETEVQSSYKG---PLETMDALEDVLVVKKSISKFYNGKSKSFTSLADVS 124

Query: 632  SCSSIKDIVKPENAYTRKRKNLLAYSNFWDKNHITIRRSSSGAISKRH-SNSRSTLTLAA 808
            + SS+KD+ KPENAY +KRKNLLAYSNFWDKN     RS++G ISKR   +SRSTL LA 
Sbjct: 125  ASSSVKDLAKPENAYAKKRKNLLAYSNFWDKNRNCPWRSNAGGISKRPLISSRSTLALAV 184

Query: 809  TMGCAE----XXXXXXXXXXXPPRGICXXXXXXQVRRSPNNDSISSPP-EQMFSPWRSFS 973
            TM  +E                           Q ++S NN   SSPP +Q F PWRSFS
Sbjct: 185  TMSSSESGNYCXDSNCSSNLSSSHSPSLPPLHPQAKKSSNNAPSSSPPSQQKFPPWRSFS 244

Query: 974  LSDLQGA-AASPNISGITVNHRQE 1042
            LSDLQG  AA+P I+G+  N+ +E
Sbjct: 245  LSDLQGMDAATPGITGLAGNNNRE 268


>gb|EOY30257.1| Damaged dna-binding 2, putative isoform 1 [Theobroma cacao]
          Length = 288

 Score =  144 bits (362), Expect = 9e-32
 Identities = 110/280 (39%), Positives = 141/280 (50%), Gaps = 3/280 (1%)
 Frame = +2

Query: 200  KEMSIALERNNSAKNSYNHDHDHHRIGRSGFGANKMTCHSIYGSGNGDNSSEFGGADRLK 379
            K MS+  ERN++  +          I RSGF  + M C S+YGS    N     G  RL 
Sbjct: 25   KTMSLVFERNDNTNS----------IRRSGF-IHGMECISVYGSPEEKNE----GRRRLS 69

Query: 380  VAEEVRVGDXXXXXXX-IGRNSDTESSAERSSDGGVNEEVQSKFKGGGGALDNLEALEEV 556
             A+E    D        IGRNSD    +    +     E QS+ KG    LD ++ALEEV
Sbjct: 70   SADEREEEDSRSCSSSSIGRNSDVSDGSSSDGEDSTEAEAQSELKG---PLDTMDALEEV 126

Query: 557  LPMKRXXXXXXXXXXXXXXXLADAASCSSIKDIVKPENAYTRKRKNLLAYSNFWDKNHIT 736
            LP++R               LADAA+ SSIKD  KP+N Y +KRKNLLA+S+   KNH  
Sbjct: 127  LPVRRGISKFYNGKSKSFTSLADAAAASSIKDFAKPDNPYNKKRKNLLAHSSLLFKNHNH 186

Query: 737  IRRSSSGAISKRHSN-SRSTLTLAATMGCAEXXXXXXXXXXXPPRGICXXXXXXQVRRSP 913
              RSS   ISKR +N SRST+ L  T+G ++                C      Q ++S 
Sbjct: 187  PLRSSGSEISKRLTNSSRSTVALGTTLGSSDSNSISSLP------STCLPPLHPQCKKST 240

Query: 914  NNDSISSPPEQMFSPWRSFSLSDLQ-GAAASPNISGITVN 1030
               S SSP  +   P RSFSLSDLQ  AAA+PNI+G+ V+
Sbjct: 241  TIRS-SSPTTRPNPPCRSFSLSDLQFVAAATPNITGLAVH 279


>ref|XP_006474735.1| PREDICTED: uncharacterized protein LOC102616005 [Citrus sinensis]
          Length = 244

 Score =  139 bits (350), Expect = 2e-30
 Identities = 112/272 (41%), Positives = 140/272 (51%), Gaps = 3/272 (1%)
 Frame = +2

Query: 206  MSIALERNNSAKNSYNHDHDHHRIGRSGFGANKMTC-HSIYGSGNGDNSSEFGGADRLKV 382
            MSIALERNNS           + I RS F    M C  S+Y   +   +  F G  R  V
Sbjct: 1    MSIALERNNS-----------NPIQRSKF----MQCVSSVY---DPPETEAFTGDRRFLV 42

Query: 383  AEEVRVGDXXXXXXXIGRNSDTESSAERSSDGGVNEEVQSKFKGGGGALDNLEALEEVLP 562
             EE    +       IGRNSD    +E  SDG  ++EVQS +KG    LD L ALE+VLP
Sbjct: 43   GEE---REDSSSTSSIGRNSDV---SEVPSDGEDSDEVQSSYKG---PLDTLNALEQVLP 93

Query: 563  MKRXXXXXXXXXXXXXXXLADAASCSSIKDIVKPENAYTRKRKNLLAYSNFWDKNHITIR 742
            +KR               LAD +S SSIK++ KPE+ YTRKRKNLLA++N +DKNH    
Sbjct: 94   IKRGISSFYNGKSKSFTSLADVSSASSIKELAKPEDPYTRKRKNLLAHNNLFDKNHNHQF 153

Query: 743  RSSSGAISKRHSN-SRSTLTLAATMGCAEXXXXXXXXXXXPPRGICXXXXXXQVRRSPNN 919
            +S+    SK+ +N  RS + L  TM   +                       Q ++SP+N
Sbjct: 154  KSNGRGASKKPANCGRSAMVLGMTMKSCDMNHRGDSDSIASSHLHHLPPLHPQGKKSPSN 213

Query: 920  DSISSPPEQMFSPWRSFSLSDLQ-GAAASPNI 1012
             S   PP +  SPWRSFSLSDLQ  AAASPNI
Sbjct: 214  GS-PPPPLRRNSPWRSFSLSDLQCVAAASPNI 244


>gb|EOY30258.1| Damaged dna-binding 2, putative isoform 2 [Theobroma cacao]
          Length = 240

 Score =  135 bits (340), Expect = 3e-29
 Identities = 99/245 (40%), Positives = 125/245 (51%), Gaps = 3/245 (1%)
 Frame = +2

Query: 305  MTCHSIYGSGNGDNSSEFGGADRLKVAEEVRVGDXXXXXXX-IGRNSDTESSAERSSDGG 481
            M C S+YGS    N     G  RL  A+E    D        IGRNSD    +    +  
Sbjct: 1    MECISVYGSPEEKNE----GRRRLSSADEREEEDSRSCSSSSIGRNSDVSDGSSSDGEDS 56

Query: 482  VNEEVQSKFKGGGGALDNLEALEEVLPMKRXXXXXXXXXXXXXXXLADAASCSSIKDIVK 661
               E QS+ KG    LD ++ALEEVLP++R               LADAA+ SSIKD  K
Sbjct: 57   TEAEAQSELKG---PLDTMDALEEVLPVRRGISKFYNGKSKSFTSLADAAAASSIKDFAK 113

Query: 662  PENAYTRKRKNLLAYSNFWDKNHITIRRSSSGAISKRHSN-SRSTLTLAATMGCAEXXXX 838
            P+N Y +KRKNLLA+S+   KNH    RSS   ISKR +N SRST+ L  T+G ++    
Sbjct: 114  PDNPYNKKRKNLLAHSSLLFKNHNHPLRSSGSEISKRLTNSSRSTVALGTTLGSSDSNSI 173

Query: 839  XXXXXXXPPRGICXXXXXXQVRRSPNNDSISSPPEQMFSPWRSFSLSDLQ-GAAASPNIS 1015
                        C      Q ++S    S SSP  +   P RSFSLSDLQ  AAA+PNI+
Sbjct: 174  SSLP------STCLPPLHPQCKKSTTIRS-SSPTTRPNPPCRSFSLSDLQFVAAATPNIT 226

Query: 1016 GITVN 1030
            G+ V+
Sbjct: 227  GLAVH 231


>ref|XP_002281841.1| PREDICTED: uncharacterized protein LOC100260963 [Vitis vinifera]
            gi|147857682|emb|CAN82883.1| hypothetical protein
            VITISV_008557 [Vitis vinifera]
          Length = 281

 Score =  130 bits (328), Expect = 8e-28
 Identities = 104/276 (37%), Positives = 135/276 (48%), Gaps = 10/276 (3%)
 Frame = +2

Query: 206  MSIALERNNSAKNSYNHDHDHHRIGRSGFGANKMTCHSIYGSGN---GDNSSEFGGADRL 376
            MSIAL+R+++            RI  SGF  + M+C SI+ S     GD     GG    
Sbjct: 1    MSIALDRSSN------------RIEGSGF-MHGMSCISIFESPELLTGDRRFPAGGEMAA 47

Query: 377  KVAEEVRVGDXXXXXXXIGRNSDTESSAERSSDGGVNEEVQSKFKGGGGALDNLEALEEV 556
            K  E     D       IG+NSD    +    D G   EVQS +K     LD++ ALEEV
Sbjct: 48   KAEEREEELDSCSSSSSIGKNSDVSGMSSDQEDSG-ETEVQSSYKR---PLDSMNALEEV 103

Query: 557  LPMKRXXXXXXXXXXXXXXXLADAASCSSIKDIVKPENAYTRKRKNLLAYSNFWDKNHIT 736
            LP++R               LADA++ +S KD+ KPENAY R+R+NLLAY++  DKN   
Sbjct: 104  LPLRRGISRFYNGKSKSFTSLADASTSASCKDLAKPENAYNRRRRNLLAYNHVLDKNRNF 163

Query: 737  IRRSSSGAISKR-HSNSRSTLTLAATM------GCAEXXXXXXXXXXXPPRGICXXXXXX 895
              RS+ G ISK+  + SRSTL LA  M        +E            P  +       
Sbjct: 164  PLRSNGGGISKKLAATSRSTLALAVAMSSSDSNNSSEDLNSSLNCISRSP-SLLLPPLHP 222

Query: 896  QVRRSPNNDSISSPPEQMFSPWRSFSLSDLQGAAAS 1003
            Q R   NN S SSPP++  S WRS+SL+DLQ  A S
Sbjct: 223  QARLYHNNVS-SSPPQRNLSAWRSYSLADLQQCATS 257


>ref|XP_004141345.1| PREDICTED: uncharacterized protein LOC101215519 [Cucumis sativus]
          Length = 262

 Score =  128 bits (322), Expect = 4e-27
 Identities = 82/205 (40%), Positives = 113/205 (55%), Gaps = 6/205 (2%)
 Frame = +2

Query: 428  IGRNSDTESSAERSSDGGVNEEVQSKFKGGGGALDNLEALEEVLPMKRXXXXXXXXXXXX 607
            IGRNSD         D G N+EVQS +KG    LD +++LEEVLP+++            
Sbjct: 66   IGRNSDQSDD----EDNGENDEVQSSYKG---PLDMMDSLEEVLPVRKGISKFYSGKSKS 118

Query: 608  XXXLADAASCSSIKDIVKPENAYTRKRKNLLAYSNFWDKNHITIRRSSSGAISKRH-SNS 784
               LADA+S +S+K+I KPENAY++KR+NL+AY+  W+KN     +++ G ISKR  S+S
Sbjct: 119  FTSLADASSVNSMKEIAKPENAYSKKRRNLMAYNLVWEKNRSFPLKNNGGGISKRPISSS 178

Query: 785  RSTLTLAATM-----GCAEXXXXXXXXXXXPPRGICXXXXXXQVRRSPNNDSISSPPEQM 949
            +S+L LA  M       +E           PPR         Q R S NN     PP++ 
Sbjct: 179  KSSLALAVAMSSSESNSSEDSNCSSYSSSPPPR----PPLHPQSRPSNNNFPSMVPPQKT 234

Query: 950  FSPWRSFSLSDLQGAAASPNISGIT 1024
            FS WRS+SL+DLQ  A   N + +T
Sbjct: 235  FSTWRSYSLADLQECATFANKANLT 259


>gb|EXB38953.1| hypothetical protein L484_027388 [Morus notabilis]
          Length = 264

 Score =  128 bits (321), Expect = 5e-27
 Identities = 103/278 (37%), Positives = 130/278 (46%), Gaps = 3/278 (1%)
 Frame = +2

Query: 206  MSIALERNNSAKNSYNHDHDHHRIGRSGFGANKMTCHSIYGSGNGDNSSEFGGADRLKVA 385
            MSIAL+ N               I RS F  + + C SIY S      +E    DR ++ 
Sbjct: 1    MSIALQSNGG-----------DAIRRSRF-IHGVPCVSIYDSSEPKVFAE----DRRRLE 44

Query: 386  EEVRVGDXXXXXXXIGRNSDTESSAERSSDGGVNEEVQSKFKGGGGALDNLEALEEVLPM 565
             E            IGRNSD    +    D    +EVQS FKG    LD ++ALEEVLP+
Sbjct: 45   RE----SDSCSSTSIGRNSDLSGGSSDGEDSA-EDEVQSSFKG---PLDTMDALEEVLPI 96

Query: 566  KRXXXXXXXXXXXXXXXLADAASCSSIKDIVKPENAYTRKRKNLLAYSNFWDKNHITIRR 745
            KR               LADA+S SSIKD  KPEN Y +KRKNLLA+ + WDKNH    +
Sbjct: 97   KRGISKFYSGKSKSFTSLADASSVSSIKDFAKPENPYNKKRKNLLAHGSLWDKNHNQPLK 156

Query: 746  SSSGAISKRHSN-SRSTLTLAATMGCAEXXXXXXXXXXXPPRGICXXXXXXQVRRSPNND 922
            +  G  SKR ++ +RS   L  T+  +                 C         +     
Sbjct: 157  NIGGGTSKRPASCNRSASVLCETLRSSATNVNCDDSSSISTSPSCNLPPLHPHGKRSPTI 216

Query: 923  SISSPPEQMFSPWRSFSLSDLQGAAAS--PNISGITVN 1030
              SSPP Q  SP RSFSLSDLQ  AAS  PNI+G+ ++
Sbjct: 217  GTSSPPRQ--SPRRSFSLSDLQSVAASSTPNINGLIIS 252


>gb|EOY30055.1| Damaged dna-binding 2, putative isoform 1 [Theobroma cacao]
            gi|508782800|gb|EOY30056.1| Damaged dna-binding 2,
            putative isoform 1 [Theobroma cacao]
          Length = 247

 Score =  126 bits (316), Expect = 2e-26
 Identities = 91/204 (44%), Positives = 116/204 (56%), Gaps = 6/204 (2%)
 Frame = +2

Query: 428  IGRNSDTESSAERSSDGGVNEE--VQSKFKGGGGALDNLEALEEVLPMKRXXXXXXXXXX 601
            IGRNSD  S   RSSDGG  EE  VQS +KGG   LD +++LE+VLPM+R          
Sbjct: 54   IGRNSDDASG--RSSDGGACEENEVQSSYKGG---LDMMDSLEQVLPMRRGISNFYNGKS 108

Query: 602  XXXXXLADAASCSSIKDIVKPENAYTRKRKNLLAYSNFWDKNHITIRRSSSGAISKRHSN 781
                 LADA+S SSIKDI KPENAYTR+R+NLLA ++ WDKN        +  + +  S+
Sbjct: 109  KSFTSLADASSTSSIKDIAKPENAYTRRRRNLLAINHAWDKNR-------NKRLIRPISS 161

Query: 782  SRSTLTLAATMGCAE--XXXXXXXXXXXPPRGICXXXXXXQVRRSPNNDSISSPPE--QM 949
            S+STL LA  M  +E              PR         Q R S NN + SSPP+  + 
Sbjct: 162  SKSTLALAVAMSSSESISSTSEDSTSTSSPR---LPPLHPQTRTSFNN-TPSSPPKSSRN 217

Query: 950  FSPWRSFSLSDLQGAAASPNISGI 1021
            FS WRSFSL+D++  A +P+ S I
Sbjct: 218  FSNWRSFSLADVREYATNPDCSSI 241


>ref|XP_002308520.1| MTD1 family protein [Populus trichocarpa]
           gi|118483800|gb|ABK93792.1| unknown [Populus
           trichocarpa] gi|222854496|gb|EEE92043.1| MTD1 family
           protein [Populus trichocarpa]
          Length = 239

 Score =  125 bits (315), Expect = 2e-26
 Identities = 82/188 (43%), Positives = 102/188 (54%), Gaps = 1/188 (0%)
 Frame = +2

Query: 428 IGRNSDTESSAERSSDGGVNEEVQSKFKGGGGALDNLEALEEVLPMKRXXXXXXXXXXXX 607
           IG+NSD     E   DG    EVQS +KG    LD++EALEEVLP++R            
Sbjct: 46  IGKNSDLTDGGE---DGLEENEVQSAYKG---TLDSMEALEEVLPIRRGISNFYNGKSKS 99

Query: 608 XXXLADAASCSSIKDIVKPENAYTRKRKNLLAYSNFWDKNHITIRRSSSGAISKRHSNSR 787
              L+DA+S  SIKDI KPENAYTRKR+NLLA+S+ W+K      R  SG   +  SNS+
Sbjct: 100 FTSLSDASSSPSIKDIAKPENAYTRKRRNLLAFSHVWEKTRSFPYR--SGIAKRPISNSK 157

Query: 788 STLTLAATMGCAEXXXXXXXXXXXPPRGICXXXXXXQVRRSPNNDSISSP-PEQMFSPWR 964
           STL LA  M  +E             +            R+ +N+  S P P Q FSPWR
Sbjct: 158 STLALAVAMSSSESISSASEDSTSTSKSPPNLPPLHPRSRASHNNLTSLPSPRQNFSPWR 217

Query: 965 SFSLSDLQ 988
           SFSL+DLQ
Sbjct: 218 SFSLADLQ 225


>ref|XP_002516147.1| conserved hypothetical protein [Ricinus communis]
            gi|223544633|gb|EEF46149.1| conserved hypothetical
            protein [Ricinus communis]
          Length = 262

 Score =  125 bits (314), Expect = 3e-26
 Identities = 83/196 (42%), Positives = 104/196 (53%), Gaps = 4/196 (2%)
 Frame = +2

Query: 428  IGRNSDTESSAERSSDGGVNEEVQSKFKGGGGALDNLEALEEVLPMKRXXXXXXXXXXXX 607
            IG+NSD  S+ E   D     EVQS FKG    LD ++ALEE L M+R            
Sbjct: 58   IGKNSDLSSNGENCED---ENEVQSAFKG---TLDAMDALEEALSMRRGISKFYNGKSKS 111

Query: 608  XXXLADAASCSSIKDIVKPENAYTRKRKNLLAYSNFWDKNHITIRRSSSGAISKRH-SNS 784
               LA+A+S S IK+I KPENAYTR+R+NLLA+++ WDKN     RS+ G ISKR  S+S
Sbjct: 112  FTSLAEASSSSCIKEITKPENAYTRRRRNLLAFNHVWDKNRSFPHRSNGGGISKRPISSS 171

Query: 785  RSTLTLAATMGCAEXXXXXXXXXXXPPRGICXXXXXXQVRRS---PNNDSISSPPEQMFS 955
            +STL LA  M  +E                          RS    NN +    P Q FS
Sbjct: 172  KSTLALAVAMSSSESISSASEDSTSSSMSNTPTHLPPLHPRSRTYHNNLASLPSPRQNFS 231

Query: 956  PWRSFSLSDLQGAAAS 1003
            PWRSFS++DLQ  A +
Sbjct: 232  PWRSFSVADLQQCATT 247


>ref|XP_002324201.2| MTD1 family protein [Populus trichocarpa]
           gi|550318329|gb|EEF02766.2| MTD1 family protein [Populus
           trichocarpa]
          Length = 254

 Score =  122 bits (306), Expect = 3e-25
 Identities = 84/194 (43%), Positives = 106/194 (54%), Gaps = 7/194 (3%)
 Frame = +2

Query: 428 IGRNSDTESSAERSSDGGVNEEVQSKFKGGGGALDNLEALEEVLPMKRXXXXXXXXXXXX 607
           IG++SD     E   DG    EVQS +KG   ALD++E LEEVLP++R            
Sbjct: 59  IGKDSDLSGGGE---DGLDENEVQSAYKG---ALDSMEGLEEVLPIRRGISKFYDGKSKS 112

Query: 608 XXXLADAASCSSIKDIVKPENAYTRKRKNLLAYSNFWDKNHITIRRSSSGAISKRH-SNS 784
              L+DA+S  SIKDI KPENA+TRKR+NLLA+++FW+KN     R+    ISKR  S+S
Sbjct: 113 FTILSDASSSPSIKDIAKPENAFTRKRRNLLAFNHFWEKNRGFPHRN---GISKRPISSS 169

Query: 785 RSTLTLAATMGCAE------XXXXXXXXXXXPPRGICXXXXXXQVRRSPNNDSISSPPEQ 946
           +STL LA  M  +E                 PP          + R S NN +    P Q
Sbjct: 170 KSTLALAVAMSSSESISSASEDSNSTSTSKSPPH---LPPLHPRSRASHNNLASLPSPRQ 226

Query: 947 MFSPWRSFSLSDLQ 988
            FSPWRSFSL+DLQ
Sbjct: 227 SFSPWRSFSLADLQ 240


>gb|EMJ03691.1| hypothetical protein PRUPE_ppa010604mg [Prunus persica]
          Length = 243

 Score =  117 bits (293), Expect = 9e-24
 Identities = 93/280 (33%), Positives = 135/280 (48%), Gaps = 6/280 (2%)
 Frame = +2

Query: 206  MSIALERNNSAKNSYNHDHDHHRIGRSGFGANKMTCHSIYGSGNGDNSSEFGGADRLKVA 385
            M IAL+RN    N        H           M C S++     D+S   G A   ++ 
Sbjct: 1    MPIALDRNGGGGNMIQRPRFIHG----------MPCLSMH-----DSSENKGFAQHRRLE 45

Query: 386  EEVRVGDXXXXXXXIGRNSDTESSAERSSDGGVNEEVQSKFKGGGGALDNLEALEEVLPM 565
            +++           +GRNSD+   +    D G   E+QS +KG    LD ++ LEEVLP+
Sbjct: 46   QDL----DSCSSSSVGRNSDSSDGSSEGDDSG-EAEIQSSYKG---PLDTMDQLEEVLPV 97

Query: 566  KRXXXXXXXXXXXXXXXLADAASCSSIKDIVKPENAYTRKRKNLLAYSNFWDKNHITIRR 745
            KR               L D +S SS+KD+ KP+N + +KRKNLLA+SN+ + N+     
Sbjct: 98   KRGISMFYSGKSKSFTSLEDVSSVSSVKDLEKPKNRFMKKRKNLLAHSNYRNCNN---PL 154

Query: 746  SSSGAISKRHSN-SRSTLTLAATMGCAEXXXXXXXXXXXPPRGICXXXXXXQV----RRS 910
             ++GA+ +  +N SR +  L   +  +            PP   C       +    +RS
Sbjct: 155  KNNGAVKRPTANSSRGSFLLGENLSSS----------ISPPPTSCLPPLHPPLHPDSKRS 204

Query: 911  PNNDSISSPPEQMFSPWRSFSLSDLQ-GAAASPNISGITV 1027
            P N S S PP +  SPWRSFSLSDLQ  AAA+PNI+G+ +
Sbjct: 205  PGNGS-SPPPLRRNSPWRSFSLSDLQCVAAATPNITGLEI 243


>gb|AGV54556.1| hypothetical protein [Phaseolus vulgaris] gi|561034505|gb|ESW33035.1|
            hypothetical protein PHAVU_001G037900g [Phaseolus
            vulgaris]
          Length = 239

 Score =  115 bits (287), Expect = 4e-23
 Identities = 87/214 (40%), Positives = 108/214 (50%), Gaps = 12/214 (5%)
 Frame = +2

Query: 428  IGRNSDTESSAERSSDGGVNEEVQSKFKGGGGALDNLEALEEVLPMKRXXXXXXXXXXXX 607
            IGRNSD   S+ERS++GG NE V+S ++G    L ++E LEEVLP++R            
Sbjct: 26   IGRNSDV--SSERSAEGGENE-VESVYRG---PLHSMETLEEVLPIRRSISKFYGGKSKS 79

Query: 608  XXXLADAASCSSIKDIVKPENAYTRKRKNLLAYSNFWDKNHITIRRSSSGAISKRH-SNS 784
               LAD AS  S KDI KPENAYTRKR+NL+A +N  DKN     R   GAI KR  S S
Sbjct: 80   FTSLADVASSPSAKDIAKPENAYTRKRRNLMALNNVLDKNRSYPLRFIGGAICKRSISLS 139

Query: 785  RSTLTLAATMG--------CAEXXXXXXXXXXXPPRGICXXXXXXQVRRSPNNDSISSPP 940
            RS L LA  M          +E            P  +          R  +    SSP 
Sbjct: 140  RSNLALAVAMNNSDSSSSITSEEDSGSSSNSIPSPSSLSSLPALHPRSRVASGACPSSPS 199

Query: 941  EQMFSPWRSFSLSDLQ---GAAASPNISGITVNH 1033
             Q  S WRSFSL+DLQ     AA+  IS  ++ +
Sbjct: 200  LQNLSSWRSFSLADLQQHCAIAATMKISSTSIGN 233


Top