BLASTX nr result

ID: Zingiber25_contig00011653 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Zingiber25_contig00011653
         (1496 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|EEC76661.1| hypothetical protein OsI_14623 [Oryza sativa Indi...   382   e-103
ref|NP_001052042.1| Os04g0115500 [Oryza sativa Japonica Group] g...   380   e-103
ref|XP_002276294.1| PREDICTED: uncharacterized protein LOC100250...   377   e-102
ref|XP_004975007.1| PREDICTED: uncharacterized protein LOC101752...   374   e-101
ref|XP_006473648.1| PREDICTED: uncharacterized protein LOC102626...   367   9e-99
ref|XP_002510523.1| nucleic acid binding protein, putative [Rici...   366   1e-98
gb|EXC00976.1| hypothetical protein L484_016042 [Morus notabilis]     365   2e-98
ref|XP_004291367.1| PREDICTED: uncharacterized protein LOC101308...   363   8e-98
ref|XP_004291366.1| PREDICTED: uncharacterized protein LOC101308...   363   8e-98
ref|XP_006435171.1| hypothetical protein CICLE_v10001255mg [Citr...   362   3e-97
ref|XP_003550423.1| PREDICTED: uncharacterized protein LOC100816...   358   3e-96
ref|XP_004162557.1| PREDICTED: LOW QUALITY PROTEIN: uncharacteri...   357   7e-96
gb|AFW57693.1| hypothetical protein ZEAMMB73_127678 [Zea mays]        357   7e-96
ref|XP_003545035.1| PREDICTED: uncharacterized protein LOC100782...   355   2e-95
ref|XP_002301980.1| zinc finger family protein [Populus trichoca...   353   8e-95
gb|EOY15015.1| C2H2-like zinc finger protein [Theobroma cacao]        353   1e-94
gb|ESW32664.1| hypothetical protein PHAVU_001G007100g [Phaseolus...   353   1e-94
ref|XP_006390280.1| hypothetical protein EUTSA_v10018514mg [Eutr...   348   4e-93
ref|NP_001183873.1| hypothetical protein [Zea mays] gi|238015166...   347   6e-93
ref|XP_004238353.1| PREDICTED: uncharacterized protein LOC101261...   345   2e-92

>gb|EEC76661.1| hypothetical protein OsI_14623 [Oryza sativa Indica Group]
          Length = 478

 Score =  382 bits (981), Expect = e-103
 Identities = 228/402 (56%), Positives = 260/402 (64%), Gaps = 20/402 (4%)
 Frame = +3

Query: 159  TSWEHFKNLLSCRS--TGEQVHDPA------SRL---GRAFCGSSICAIRDVVHGNTRVV 305
            +SWE  K+LLSCRS  +  +VHDPA      SRL   G   CG+S+CAIRDVV   +   
Sbjct: 111  SSWEQLKSLLSCRSATSAARVHDPAAPSSALSRLRSHGAGACGASLCAIRDVVDAASSA- 169

Query: 306  HRXXXXXXXXXXXXXXXXQNETAQLARSAR--HRPAPPVASLNCGRGGYYSPLPKLSGCY 479
                                +T  L RS+R  HR A    S + G GG+++ L  LSGCY
Sbjct: 170  --------SAASTAAASLDRDTTPLTRSSRRAHRAA---TSSSGGGGGHHASLRGLSGCY 218

Query: 480  ECHAVSVDSSSRRYPRPRETLCACSDCGEVFTKPETLELHQITRHGVSELGPEDSGRNIV 659
            EC A++V+  SRRYPRPRE LCACS CGEVFTK ++LE HQ  RH VSELGPEDSGRNIV
Sbjct: 219  ECRAINVEPMSRRYPRPRE-LCACSQCGEVFTKADSLEHHQAIRHAVSELGPEDSGRNIV 277

Query: 660  EIIFKSSWLKSDRPICKIERILKVHNPPRTVARFEDYRAAVKSRALPHSATGGGGACHNH 839
            EIIFKSSW K DRPIC+I+RILKVHN  RTVARFE YR AV++R            C   
Sbjct: 278  EIIFKSSWQKRDRPICQIDRILKVHNAARTVARFEAYRDAVRTR------------CRAT 325

Query: 840  RSRCAADGNELLRFHCTSLSCPLGASGSTSLCSNTGGTSSPIVACGVCAIIRHGFA--RA 1013
             +R AADGNELLRFH  +L+CPLG +G+TSLC +         ACGVCA IRHGFA    
Sbjct: 326  DARAAADGNELLRFHPAALACPLGLNGATSLCDDDD-------ACGVCAAIRHGFAPWAG 378

Query: 1014 NRPHGVRTTASSGRAHDCGPMTDATEDR--LRAMLVCRVIAGRVRRPGDDPAAED---AY 1178
              P GVRTTASSGRAHDCG    A +     RAMLVCRVIAGRVRR  DD  AE+   A+
Sbjct: 379  AHPLGVRTTASSGRAHDCGAAAVAAQQAGGCRAMLVCRVIAGRVRRNDDDGGAEEEEGAF 438

Query: 1179 DSVAVAEAEADGACGAYGNLEELLVANPRAILPCFVVIYRAL 1304
            DSVA  EA    A   YGNLEEL VANPRAILPCFVVIYR +
Sbjct: 439  DSVAGDEA----ASSVYGNLEELFVANPRAILPCFVVIYRVV 476


>ref|NP_001052042.1| Os04g0115500 [Oryza sativa Japonica Group]
            gi|113563613|dbj|BAF13956.1| Os04g0115500 [Oryza sativa
            Japonica Group] gi|222628268|gb|EEE60400.1| hypothetical
            protein OsJ_13566 [Oryza sativa Japonica Group]
          Length = 480

 Score =  380 bits (977), Expect = e-103
 Identities = 227/402 (56%), Positives = 258/402 (64%), Gaps = 20/402 (4%)
 Frame = +3

Query: 159  TSWEHFKNLLSCRST--GEQVHDPA------SRL---GRAFCGSSICAIRDVVHGNTRVV 305
            +SWE  K+LLSCRS     +VHDPA      SRL   G   CG+S+CAIRDVV   +   
Sbjct: 113  SSWEQLKSLLSCRSATAAARVHDPAAPSSALSRLRSHGAGACGASLCAIRDVVDAASSA- 171

Query: 306  HRXXXXXXXXXXXXXXXXQNETAQLARSAR--HRPAPPVASLNCGRGGYYSPLPKLSGCY 479
                                +T  L RS+R  HR A    S + G GG+++ L  LSGCY
Sbjct: 172  --------SAASTAAASLDRDTTPLTRSSRRAHRAA---TSSSGGGGGHHASLRGLSGCY 220

Query: 480  ECHAVSVDSSSRRYPRPRETLCACSDCGEVFTKPETLELHQITRHGVSELGPEDSGRNIV 659
            EC A++V+  SRRYPRPRE LCACS CGEVF K ++LE HQ  RH VSELGPEDSGRNIV
Sbjct: 221  ECRAINVEPMSRRYPRPRE-LCACSQCGEVFNKADSLEHHQAIRHAVSELGPEDSGRNIV 279

Query: 660  EIIFKSSWLKSDRPICKIERILKVHNPPRTVARFEDYRAAVKSRALPHSATGGGGACHNH 839
            EIIFKSSW K DRPIC+I+RILKVHN  RTVARFE YR AV++R            C   
Sbjct: 280  EIIFKSSWQKRDRPICQIDRILKVHNAARTVARFEAYRDAVRTR------------CRAT 327

Query: 840  RSRCAADGNELLRFHCTSLSCPLGASGSTSLCSNTGGTSSPIVACGVCAIIRHGFA--RA 1013
             +R AADGNELLRFH  +L+CPLG +G+TSLC +         ACGVCA IRHGFA    
Sbjct: 328  AARAAADGNELLRFHPAALACPLGLNGATSLCDDDD-------ACGVCAAIRHGFAPWAG 380

Query: 1014 NRPHGVRTTASSGRAHDCGPMTDATEDR--LRAMLVCRVIAGRVRRPGDDPAAED---AY 1178
              P GVRTTASSGRAHDCG    A +     RAMLVCRVIAGRVRR  DD  AE+   A+
Sbjct: 381  AHPLGVRTTASSGRAHDCGAAAAAAQQAGGCRAMLVCRVIAGRVRRNDDDGGAEEEEGAF 440

Query: 1179 DSVAVAEAEADGACGAYGNLEELLVANPRAILPCFVVIYRAL 1304
            DSVA  EA    A   YGNLEEL VANPRAILPCFVVIYR +
Sbjct: 441  DSVAGDEA----ASSVYGNLEELFVANPRAILPCFVVIYRVV 478


>ref|XP_002276294.1| PREDICTED: uncharacterized protein LOC100250572 [Vitis vinifera]
          Length = 419

 Score =  377 bits (967), Expect = e-102
 Identities = 223/405 (55%), Positives = 253/405 (62%), Gaps = 22/405 (5%)
 Frame = +3

Query: 159  TSWEHFKNLLSCRST-GEQVHDPA------SRLGRAFCGSSICAIRDVVHGNTRVVHRXX 317
            +SW   KNLL+C+   G QVHDP+      S+LG + CGS IC+ RDVVHGNTRVVHR  
Sbjct: 52   SSWNQIKNLLTCKQIEGSQVHDPSKNPGGYSKLGSS-CGS-ICSFRDVVHGNTRVVHRAD 109

Query: 318  XXXXXXXXXXXXXX---------QNETAQLARSARHRPAPPVASLNCGRGGYYSPLPKLS 470
                                    + T  L+ S R   +    S + G         KLS
Sbjct: 110  NSPESSSVGQETGLLSRKTVSGSTSSTRSLSSSVRSNASATYTSSSRGM-----QFRKLS 164

Query: 471  GCYECHAVSVDSSSRRYPRPRETLCACSDCGEVFTKPETLELHQITRHGVSELGPEDSGR 650
            GCYECH + VD +  RYP PR T+CACS+CGEVF K E+LELHQ  RH VSELGPEDSGR
Sbjct: 165  GCYECHMI-VDPN--RYPSPRTTICACSECGEVFPKTESLELHQAVRHAVSELGPEDSGR 221

Query: 651  NIVEIIFKSSWLKSDRPICKIERILKVHNPPRTVARFEDYRAAVKSRALPHSATGGGGAC 830
            NIVEIIFKSSWLK D PICKIERILKVHN  RT+ RFE+ R AVK RA  ++        
Sbjct: 222  NIVEIIFKSSWLKKDNPICKIERILKVHNTQRTIQRFEECRDAVKVRANNNT-------- 273

Query: 831  HNHRSRCAADGNELLRFHCTSLSCPLGASGSTSLCSNTGGTSSPIVACGVCAIIRHGF-A 1007
                 RCAADGNELLRFHCT+L+C LG+ GS+SLC +  G       CGVC IIRHGF  
Sbjct: 274  -KKNPRCAADGNELLRFHCTTLTCALGSRGSSSLCGSVPG-------CGVCTIIRHGFQG 325

Query: 1008 RANRPHGVRTTASSGRAHDCGPMTDATEDRLRAMLVCRVIAGRVRRPGDDPAAED----- 1172
            +A    GVRTT SSGRAHDC P TD      RAMLVCRVIAGRV+R  DD   ED     
Sbjct: 326  KAGEAKGVRTTDSSGRAHDCLPCTDGR----RAMLVCRVIAGRVKRMADDAPDEDGASAG 381

Query: 1173 AYDSVAVAEAEADGACGAYGNLEELLVANPRAILPCFVVIYRALD 1307
            +YDSVA       G  G Y NLE+L V NPRAILPCFVVIY+ALD
Sbjct: 382  SYDSVA-------GYSGIYSNLEDLFVFNPRAILPCFVVIYKALD 419


>ref|XP_004975007.1| PREDICTED: uncharacterized protein LOC101752639 [Setaria italica]
          Length = 480

 Score =  374 bits (959), Expect = e-101
 Identities = 221/404 (54%), Positives = 250/404 (61%), Gaps = 21/404 (5%)
 Frame = +3

Query: 159  TSWEHFKNLLSCRST--GEQVHDPAS-----RL---GRAFCGSSICAIRDVVHGNTRVVH 308
            +SWE  K+LLSCRS     +VHDPA+     RL   G   CG+S+CA+RDVV   +    
Sbjct: 113  SSWEQVKSLLSCRSATAAARVHDPAAPSALARLRGSGAGACGASLCAMRDVVDAASSAA- 171

Query: 309  RXXXXXXXXXXXXXXXXQNETAQLARSARHRPAPPVASLNCGRGGYYSPLPKLSGCYECH 488
                               +TA L R   HR     +S     GG +S L  LSGCYEC 
Sbjct: 172  ------------ASASADRDTAPLNRRRAHRAGSSSSSSAAAGGGSHSSLRGLSGCYECR 219

Query: 489  AVSVDSSSRRYPRPRETLCACSDCGEVFTKPETLELHQITRHGVSELGPEDSGRNIVEII 668
            A++V+  SRRYPRPRE LCAC  CGEVFTK ++LE HQ  RH VSELGPEDSGRNIVEII
Sbjct: 220  AINVEPMSRRYPRPRE-LCACPQCGEVFTKADSLEHHQAIRHAVSELGPEDSGRNIVEII 278

Query: 669  FKSSWLKSDRPICKIERILKVHNPPRTVARFEDYRAAVKSRALPHSATGGGGACHNHRSR 848
            FKSSW K DRPIC I+RILKVHN PRTVARFE YR AV+SR            C    +R
Sbjct: 279  FKSSWQKRDRPICHIDRILKVHNAPRTVARFEAYRDAVRSR------------CRATAAR 326

Query: 849  CAADGNELLRFHCTSLSCPLGASGSTSLCSNTGGTSSPIVACGVCAIIRHGFARANRPH- 1025
             AADGNELLRFH   L+C LG SG+TSLC+          ACGVC  IRHGFA     H 
Sbjct: 327  AAADGNELLRFHSAPLACALGLSGATSLCAGA-------AACGVCTAIRHGFAPWVGAHQ 379

Query: 1026 -GVRTTASSGRAHDCG------PMTDATEDRLRAMLVCRVIAGRVRRPGDDPAA---EDA 1175
             GVRTTASSGRAHDCG         ++     RAMLVCRVIAGRVRR GD  +A   E  
Sbjct: 380  LGVRTTASSGRAHDCGGSESVQAAANSNNGGCRAMLVCRVIAGRVRRDGDATSAAGEEGP 439

Query: 1176 YDSVAVAEAEADGACGAYGNLEELLVANPRAILPCFVVIYRALD 1307
            +DSVA  +A    +   YGNLEEL VANPRAILPCFVVIYR L+
Sbjct: 440  FDSVAGEDA---ASSSVYGNLEELFVANPRAILPCFVVIYRVLE 480


>ref|XP_006473648.1| PREDICTED: uncharacterized protein LOC102626182 [Citrus sinensis]
          Length = 407

 Score =  367 bits (941), Expect = 9e-99
 Identities = 222/404 (54%), Positives = 259/404 (64%), Gaps = 22/404 (5%)
 Frame = +3

Query: 159  TSWEHFKNLLSCRST-GEQVHDPA-------SRLGRAFCGSSICAIRDVVHGNTRVVHRX 314
            +SW+  KNLL+C+   G +VHDPA       SRLG +   SSIC+ +DVVHGNT+VVHR 
Sbjct: 42   SSWDQIKNLLTCKQIEGSKVHDPAAKNVNGYSRLGSSC--SSICSFKDVVHGNTKVVHRA 99

Query: 315  XXXXXXXXXXXXXXXQNETAQLARSARHRPAPPVASLNCGRGGY--YSP------LPKLS 470
                             ET  L+R A +  +   AS++    G   YS         KLS
Sbjct: 100  DNSPESSTLG------QETGLLSRKAVNGSSTRSASVSARSNGCRAYSSSSRGMQFRKLS 153

Query: 471  GCYECHAVSVDSSSRRYPRPRETLCACSDCGEVFTKPETLELHQITRHGVSELGPEDSGR 650
            GCYECH + +D S  R+P PR T+CACS CGEVF K E+LELHQ  RH VSELGPEDS R
Sbjct: 154  GCYECHTI-IDPS--RFPSPRTTICACSQCGEVFPKIESLELHQAVRHAVSELGPEDSSR 210

Query: 651  NIVEIIFKSSWLKSDRPICKIERILKVHNPPRTVARFEDYRAAVKSRALPHSATGGGGAC 830
            NIVEIIFKSSWLK D P+CKIERILKVHN  RT+ RFED R AVK+RAL  +        
Sbjct: 211  NIVEIIFKSSWLKQDSPMCKIERILKVHNTQRTIQRFEDCRDAVKTRALNST-------- 262

Query: 831  HNHRSRCAADGNELLRFHCTSLSCPLGASGSTSLCSNTGGTSSPIVACGVCAIIRHGFAR 1010
                 RCAADGNELLRFHCT+LSC LG+ GS++LC +  G       C VC IIRHGF +
Sbjct: 263  -RKNPRCAADGNELLRFHCTTLSCNLGSRGSSTLCGSVPG-------CSVCTIIRHGF-Q 313

Query: 1011 ANRPHGVRTTASSGRAHDCGPMTDATEDRLRAMLVCRVIAGRVRRPGDD-PAAED----- 1172
                 GVRTTASSGRAHD   + + T++R RAMLVCRVIAGRVRR  DD P+ ED     
Sbjct: 314  GKECKGVRTTASSGRAHD--SLKNCTDER-RAMLVCRVIAGRVRRVTDDAPSEEDSVSGG 370

Query: 1173 AYDSVAVAEAEADGACGAYGNLEELLVANPRAILPCFVVIYRAL 1304
            +YDSVA       G  G Y NLEEL V NPRAILPCFVVIY++L
Sbjct: 371  SYDSVA-------GYTGVYSNLEELFVFNPRAILPCFVVIYKSL 407


>ref|XP_002510523.1| nucleic acid binding protein, putative [Ricinus communis]
            gi|223551224|gb|EEF52710.1| nucleic acid binding protein,
            putative [Ricinus communis]
          Length = 406

 Score =  366 bits (940), Expect = 1e-98
 Identities = 224/408 (54%), Positives = 254/408 (62%), Gaps = 25/408 (6%)
 Frame = +3

Query: 159  TSWEHFKNLLSCRST-GEQVHDPA--------SRLGRAFCGSSICAIRDVVHGNTRVVHR 311
            +SW+  KNLL+C+   G  VHDP+        S+LG +   SSIC+ RD+VHGNTRVVHR
Sbjct: 33   SSWDQIKNLLTCKQIEGSSVHDPSKNSNNIGYSKLGSSC--SSICSFRDIVHGNTRVVHR 90

Query: 312  XXXXXXXXXXXXXXXXQNE-------TAQLARSARHRPAPPVAS-LNCGRGGYYSPLPKL 467
                             +        T  L  S R       +S ++  RG  +    KL
Sbjct: 91   ADNSPESSTVGQETGLLSRKATGGSSTRTLGGSGRSNGGATYSSHVSSSRGMQFR---KL 147

Query: 468  SGCYECHAVSVDSSSRRYPRPRETLCACSDCGEVFTKPETLELHQITRHGVSELGPEDSG 647
            SGCYECH + VD S  RYP PR T+C C+ CGEVF K E+LELHQ  RH VSELGPEDSG
Sbjct: 148  SGCYECHMI-VDPS--RYPAPRTTICTCAQCGEVFPKTESLELHQKVRHAVSELGPEDSG 204

Query: 648  RNIVEIIFKSSWLKSDRPICKIERILKVHNPPRTVARFEDYRAAVKSRALPHSATGGGGA 827
            RNIVEIIFKSSWLK D PICKIERILKVHN  RT+ RFED R AVK+RAL  +       
Sbjct: 205  RNIVEIIFKSSWLKKDNPICKIERILKVHNTQRTIQRFEDCRDAVKTRALNST------- 257

Query: 828  CHNHRSRCAADGNELLRFHCTSLSCPLGASGSTSLCSNTGGTSSPIVACGVCAIIRHGFA 1007
                  RCAADGNELLRFHCT+LSC LGA GS+SLC +       I  CGVC IIRHGF 
Sbjct: 258  --KKNPRCAADGNELLRFHCTTLSCSLGARGSSSLCGS-------IPCCGVCTIIRHGF- 307

Query: 1008 RANRPHGVRTTASSGRAHDCGPMTDATEDRLRAMLVCRVIAGRVRRPGDD--PAAEDA-- 1175
            +     GVRTTASSGRAHD   +   T+ R RAMLVCRVIAGRV+R  DD  PA E+A  
Sbjct: 308  QGKECKGVRTTASSGRAHD--SLLGCTDGR-RAMLVCRVIAGRVKRVADDTPPAEEEALA 364

Query: 1176 ----YDSVAVAEAEADGACGAYGNLEELLVANPRAILPCFVVIYRALD 1307
                YDSVA       G  G Y NLEEL V NPRAILPCFVVIY AL+
Sbjct: 365  AAGSYDSVA-------GYAGIYSNLEELFVFNPRAILPCFVVIYSALE 405


>gb|EXC00976.1| hypothetical protein L484_016042 [Morus notabilis]
          Length = 419

 Score =  365 bits (938), Expect = 2e-98
 Identities = 220/403 (54%), Positives = 248/403 (61%), Gaps = 20/403 (4%)
 Frame = +3

Query: 159  TSWEHFKNLLSCRST-GEQVHDPA-------SRLGRAFCGSSICAIRDVVHGNTRVVHRX 314
            +SW+  KNLL+C+   G QVHDP+       S+LG +   SSIC+ RDVVHGNTRVVHR 
Sbjct: 46   SSWDQIKNLLTCKQIEGSQVHDPSKNGVVGYSKLGSSC--SSICSFRDVVHGNTRVVHRA 103

Query: 315  XXXXXXXXXXXXXXXQNETAQLARSARHRPAPPVASLNCGRGGYYSP-------LPKLSG 473
                             E   L+R A +  +    S     G  Y+          KLSG
Sbjct: 104  DNSPESSSLG------QEAGLLSRKAVNGSSSSTRSARSNCGAAYTSSSSRGMQFRKLSG 157

Query: 474  CYECHAVSVDSSSRRYP-RPRETLCACSDCGEVFTKPETLELHQITRHGVSELGPEDSGR 650
            CYECH + VD S  RYP  PR T+CAC  CGEVF K E+LELHQ  RH VSELGPEDSGR
Sbjct: 158  CYECHMI-VDPS--RYPITPRSTICACPQCGEVFPKIESLELHQAVRHAVSELGPEDSGR 214

Query: 651  NIVEIIFKSSWLKSDRPICKIERILKVHNPPRTVARFEDYRAAVKSRALPHSATGGGGAC 830
            NIVEIIFKSSWLK D PI KIERILKVHN  RT+ RFED R AVK+RAL  +        
Sbjct: 215  NIVEIIFKSSWLKKDNPIFKIERILKVHNTQRTIQRFEDCRDAVKARALGST-------- 266

Query: 831  HNHRSRCAADGNELLRFHCTSLSCPLGASGSTSLCSNTGGTSSPIVACGVCAIIRHGF-- 1004
                 RCAADGNELLRFHCT+LSC LGA G++SLC +  G       CGVC IIRHGF  
Sbjct: 267  -RKNPRCAADGNELLRFHCTTLSCALGARGASSLCGSVPG-------CGVCTIIRHGFQG 318

Query: 1005 --ARANRPHGVRTTASSGRAHDCGPMTDATEDRLRAMLVCRVIAGRVRRPGDDPAAEDAY 1178
                A    GV TTASSGRAHD    TD      RAMLVCRVIAGRV+R  DD A E+  
Sbjct: 319  KNGGAGEGKGVWTTASSGRAHDSLKCTDGR----RAMLVCRVIAGRVKRVADDAAPEEDS 374

Query: 1179 DSVAVAEAEADGACGAYGNLEELLVANPRAILPCFVVIYRALD 1307
             S+A +     G  G Y NLEEL V NPRAILPCFVVIY+AL+
Sbjct: 375  VSLASSYDSVAGYAGIYSNLEELTVFNPRAILPCFVVIYKALE 417


>ref|XP_004291367.1| PREDICTED: uncharacterized protein LOC101308537 isoform 2 [Fragaria
            vesca subsp. vesca]
          Length = 412

 Score =  363 bits (933), Expect = 8e-98
 Identities = 222/409 (54%), Positives = 253/409 (61%), Gaps = 26/409 (6%)
 Frame = +3

Query: 159  TSWEHFKNLLSCRST-GEQVHDPA------SRLGRAFCGSSICAIRDVVHGNT--RVVHR 311
            +SW+HFKNLL+C+   G +VHDP+      S+LG + CGS IC+ +DVVHGNT  RVVHR
Sbjct: 34   SSWDHFKNLLTCKQIEGSRVHDPSKNAVGYSKLGSS-CGS-ICSFKDVVHGNTTSRVVHR 91

Query: 312  XXXXXXXXXXXXXXXXQNETAQLARSARHRPAPPVASL-NCGRGGYYSP--------LPK 464
                              ET  L+R    R     AS  N G G  Y+           K
Sbjct: 92   ADNSPDSSSLG------QETGLLSRKVASRSESGSASRSNRGGGAGYTSSNSSRGMQFRK 145

Query: 465  LSGCYECHAVSVDSSSRRYPRPRETLCACSDCGEVFTKPETLELHQITRHGVSELGPEDS 644
            LSGCYECH + VD S  RYP PR T+C+C  CGE+F K E+LELHQ  RH VSELG EDS
Sbjct: 146  LSGCYECHYI-VDPS--RYPVPRSTICSCPHCGEIFPKLESLELHQSIRHAVSELGVEDS 202

Query: 645  GRNIVEIIFKSSWLKSDRPICKIERILKVHNPPRTVARFEDYRAAVKSRALPHSATGGGG 824
            GRNIVEIIFKSSWLK D PIC+IERILKV N  RT+ RFED R AVK+RAL  +      
Sbjct: 203  GRNIVEIIFKSSWLKKDSPICRIERILKVQNTQRTIQRFEDCRDAVKNRALNSA------ 256

Query: 825  ACHNHRSRCAADGNELLRFHCTSLSCPLGASGSTSLCSNTGGTSSPIVACGVCAIIRHGF 1004
                   RCAADGNELLRFHCTS+SCPLGA GST+LC +  G       CGVC  IRHGF
Sbjct: 257  ---RKNPRCAADGNELLRFHCTSISCPLGARGSTNLCGSVPG-------CGVCTTIRHGF 306

Query: 1005 -----ARANRPHGVRTTASSGRAHDCGPMTDATEDRLRAMLVCRVIAGRVRRPGDDPAAE 1169
                        GVRTTASSGRAHD    TD      RAMLVCRVIAGRVRR  +D A E
Sbjct: 307  QGHGGGEVVGSKGVRTTASSGRAHDSLHCTDGR----RAMLVCRVIAGRVRRVAEDAAEE 362

Query: 1170 DAYDSVAVAEAEAD---GACGAYGNLEELLVANPRAILPCFVVIYRALD 1307
            +   + + A    D   G  G + NLEEL+V NPRAILPCFVVIY+ALD
Sbjct: 363  EGVTASSSAGGTYDSVAGYAGVFSNLEELVVFNPRAILPCFVVIYKALD 411


>ref|XP_004291366.1| PREDICTED: uncharacterized protein LOC101308537 isoform 1 [Fragaria
            vesca subsp. vesca]
          Length = 431

 Score =  363 bits (933), Expect = 8e-98
 Identities = 222/409 (54%), Positives = 253/409 (61%), Gaps = 26/409 (6%)
 Frame = +3

Query: 159  TSWEHFKNLLSCRST-GEQVHDPA------SRLGRAFCGSSICAIRDVVHGNT--RVVHR 311
            +SW+HFKNLL+C+   G +VHDP+      S+LG + CGS IC+ +DVVHGNT  RVVHR
Sbjct: 53   SSWDHFKNLLTCKQIEGSRVHDPSKNAVGYSKLGSS-CGS-ICSFKDVVHGNTTSRVVHR 110

Query: 312  XXXXXXXXXXXXXXXXQNETAQLARSARHRPAPPVASL-NCGRGGYYSP--------LPK 464
                              ET  L+R    R     AS  N G G  Y+           K
Sbjct: 111  ADNSPDSSSLG------QETGLLSRKVASRSESGSASRSNRGGGAGYTSSNSSRGMQFRK 164

Query: 465  LSGCYECHAVSVDSSSRRYPRPRETLCACSDCGEVFTKPETLELHQITRHGVSELGPEDS 644
            LSGCYECH + VD S  RYP PR T+C+C  CGE+F K E+LELHQ  RH VSELG EDS
Sbjct: 165  LSGCYECHYI-VDPS--RYPVPRSTICSCPHCGEIFPKLESLELHQSIRHAVSELGVEDS 221

Query: 645  GRNIVEIIFKSSWLKSDRPICKIERILKVHNPPRTVARFEDYRAAVKSRALPHSATGGGG 824
            GRNIVEIIFKSSWLK D PIC+IERILKV N  RT+ RFED R AVK+RAL  +      
Sbjct: 222  GRNIVEIIFKSSWLKKDSPICRIERILKVQNTQRTIQRFEDCRDAVKNRALNSA------ 275

Query: 825  ACHNHRSRCAADGNELLRFHCTSLSCPLGASGSTSLCSNTGGTSSPIVACGVCAIIRHGF 1004
                   RCAADGNELLRFHCTS+SCPLGA GST+LC +  G       CGVC  IRHGF
Sbjct: 276  ---RKNPRCAADGNELLRFHCTSISCPLGARGSTNLCGSVPG-------CGVCTTIRHGF 325

Query: 1005 -----ARANRPHGVRTTASSGRAHDCGPMTDATEDRLRAMLVCRVIAGRVRRPGDDPAAE 1169
                        GVRTTASSGRAHD    TD      RAMLVCRVIAGRVRR  +D A E
Sbjct: 326  QGHGGGEVVGSKGVRTTASSGRAHDSLHCTDGR----RAMLVCRVIAGRVRRVAEDAAEE 381

Query: 1170 DAYDSVAVAEAEAD---GACGAYGNLEELLVANPRAILPCFVVIYRALD 1307
            +   + + A    D   G  G + NLEEL+V NPRAILPCFVVIY+ALD
Sbjct: 382  EGVTASSSAGGTYDSVAGYAGVFSNLEELVVFNPRAILPCFVVIYKALD 430


>ref|XP_006435171.1| hypothetical protein CICLE_v10001255mg [Citrus clementina]
            gi|557537293|gb|ESR48411.1| hypothetical protein
            CICLE_v10001255mg [Citrus clementina]
          Length = 426

 Score =  362 bits (928), Expect = 3e-97
 Identities = 220/404 (54%), Positives = 258/404 (63%), Gaps = 22/404 (5%)
 Frame = +3

Query: 159  TSWEHFKNLLSCRSTG-EQVHDPA-------SRLGRAFCGSSICAIRDVVHGNTRVVHRX 314
            +SW+  KNLL+C+     +VHDPA       SRLG +   SSIC+ +DVVHGNT+VVHR 
Sbjct: 61   SSWDQIKNLLTCKQIEVSKVHDPAAKNVNGYSRLGSSC--SSICSFKDVVHGNTKVVHRA 118

Query: 315  XXXXXXXXXXXXXXXQNETAQLARSARHRPAPPVASLNCGRGGY--YSP------LPKLS 470
                             ET  L+R A +  +   AS++    G   YS         KLS
Sbjct: 119  DNSPESSTLG------QETGLLSRKAVNGSSTRSASVSARSNGCRTYSSSSRGMQFRKLS 172

Query: 471  GCYECHAVSVDSSSRRYPRPRETLCACSDCGEVFTKPETLELHQITRHGVSELGPEDSGR 650
            GCYECH + +D S  R+P PR T+CACS CGEVF K E+LELHQ  RH VSELGPEDS R
Sbjct: 173  GCYECHTI-IDPS--RFPSPRTTICACSQCGEVFPKIESLELHQAVRHAVSELGPEDSSR 229

Query: 651  NIVEIIFKSSWLKSDRPICKIERILKVHNPPRTVARFEDYRAAVKSRALPHSATGGGGAC 830
            NIVEIIFKSSWLK + P+CKIERILKVHN  RT+ RFED R AVK+RAL  +        
Sbjct: 230  NIVEIIFKSSWLKQNSPMCKIERILKVHNTQRTIQRFEDCRDAVKTRALNST-------- 281

Query: 831  HNHRSRCAADGNELLRFHCTSLSCPLGASGSTSLCSNTGGTSSPIVACGVCAIIRHGFAR 1010
                 RCAADGNELLRFHCT+LSC LG+ GS++LC +  G       C VC IIRHGF +
Sbjct: 282  -RKNPRCAADGNELLRFHCTTLSCNLGSRGSSTLCGSVPG-------CSVCTIIRHGF-Q 332

Query: 1011 ANRPHGVRTTASSGRAHDCGPMTDATEDRLRAMLVCRVIAGRVRRPGDD-PAAED----- 1172
                 GVRTTASSGRAHD   + + T++R RAMLVCRVIAGRVRR  DD P+ ED     
Sbjct: 333  GKECKGVRTTASSGRAHD--SLKNCTDER-RAMLVCRVIAGRVRRVTDDAPSEEDSVSGG 389

Query: 1173 AYDSVAVAEAEADGACGAYGNLEELLVANPRAILPCFVVIYRAL 1304
            +YDSVA       G  G Y NLEEL V NPRAILPCFVVIY++L
Sbjct: 390  SYDSVA-------GYTGVYSNLEELFVFNPRAILPCFVVIYKSL 426


>ref|XP_003550423.1| PREDICTED: uncharacterized protein LOC100816726 [Glycine max]
          Length = 417

 Score =  358 bits (919), Expect = 3e-96
 Identities = 213/405 (52%), Positives = 251/405 (61%), Gaps = 25/405 (6%)
 Frame = +3

Query: 159  TSWEHFKNLLSCRST-GEQVHDPA------SRLGRAFCGSSICAIRDVVHGNTRVVHRXX 317
            +SW+  KNLL+C+   G +VHDP+      S+LG +   SSIC+ RDVVHGNTRVVHR  
Sbjct: 36   SSWDQIKNLLTCKQIEGSRVHDPSKVVSGYSKLGSSC--SSICSFRDVVHGNTRVVHRSD 93

Query: 318  XXXXXXXXXXXXXX---------QNETAQLARSARHRPAPPVASLNCGRGGYYSPLPKLS 470
                                      T     SA+        S +  RG  +    KLS
Sbjct: 94   NSSPESSSLGQETNGLLTRKPVTTTTTTTTRSSAKSHGGATYTSSSSSRGMQFR---KLS 150

Query: 471  GCYECHAVSVDSSSRRYPRPRETLCACSDCGEVFTKPETLELHQITRHGVSELGPEDSGR 650
            GCYECH + +D S  R P  R T+CACS CGEVF K E+LELHQ  RH VSELGPEDSGR
Sbjct: 151  GCYECHMI-IDPS--RLPIARSTVCACSHCGEVFPKMESLELHQAVRHAVSELGPEDSGR 207

Query: 651  NIVEIIFKSSWLKSDRPICKIERILKVHNPPRTVARFEDYRAAVKSRALPHSATGGGGAC 830
            NIVEIIFKSSWLK D PICKIERILKVHN  RT+ RFE+ R  VK+RAL  +        
Sbjct: 208  NIVEIIFKSSWLKKDNPICKIERILKVHNTQRTIQRFEECRDTVKNRALGST-------- 259

Query: 831  HNHRSRCAADGNELLRFHCTSLSCPLGASGSTSLCSNTGGTSSPIVACGVCAIIRHGFAR 1010
                 RCAADGNELLRFHCT+L+C LGA GS+SLC++  G      +CGVC IIRHGF  
Sbjct: 260  -KKNPRCAADGNELLRFHCTTLTCALGARGSSSLCASVHG------SCGVCTIIRHGFQG 312

Query: 1011 AN---------RPHGVRTTASSGRAHDCGPMTDATEDRLRAMLVCRVIAGRVRRPGDDPA 1163
             +         +  GVRTTASSGRAHD     DAT    RAMLVCRVIAGRV+R  +D  
Sbjct: 313  GSCGGGSGDHGKAKGVRTTASSGRAHDSVVCGDATR---RAMLVCRVIAGRVKRVVEDAP 369

Query: 1164 AEDAYDSVAVAEAEADGACGAYGNLEELLVANPRAILPCFVVIYR 1298
            +E+ + SVA  ++ A G  G Y NLEEL+V NP+AILPCFVVIY+
Sbjct: 370  SEEEHVSVASYDSVA-GYAGIYSNLEELVVFNPKAILPCFVVIYK 413


>ref|XP_004162557.1| PREDICTED: LOW QUALITY PROTEIN: uncharacterized protein LOC101231096
            [Cucumis sativus]
          Length = 424

 Score =  357 bits (916), Expect = 7e-96
 Identities = 216/412 (52%), Positives = 249/412 (60%), Gaps = 30/412 (7%)
 Frame = +3

Query: 159  TSWEHFKNLLSCRSTG-EQVHDPA------SRLGRAFCGSSICAIRDVVHGNTRVVHRXX 317
            +SW+H KNL++C+     +V +P       S+LG +   SSIC+ RDVVHGN +VVHR  
Sbjct: 35   SSWDHIKNLITCKQVEVSRVQEPGKRSPAYSKLGSSC--SSICSFRDVVHGNAKVVHRAD 92

Query: 318  XXXXXXXXXXXXXXQNETAQLARSARHRPAPPVASLNCGRGGYYS---------PLPKLS 470
                              +    S+R   AP  A    G  G  S          L KLS
Sbjct: 93   NSPESSSVGQETRLLTRKSANGSSSRSLTAPTPARTKNGGSGSASYNSSSSRGIQLRKLS 152

Query: 471  GCYECHAVSVDSSSRRYPRPRETLCACSDCGEVFTKPETLELHQITRHGVSELGPEDSGR 650
            GCYECH + VD S  R P PR ++C C  CGEVF K E+LELHQ+ RH VSELG EDSGR
Sbjct: 153  GCYECHTI-VDPS--RLPIPRSSICPCPQCGEVFPKIESLELHQLVRHAVSELGXEDSGR 209

Query: 651  NIVEIIFKSSWLKSDRPICKIERILKVHNPPRTVARFEDYRAAVKSRALPHSATGGGGAC 830
            NIVEIIFKSSWLK DRPICKIERILKVHN  RT+ RFED R AVK+RAL       G   
Sbjct: 210  NIVEIIFKSSWLKKDRPICKIERILKVHNTQRTIQRFEDCRDAVKTRAL-------GSTX 262

Query: 831  HNHRSRCAADGNELLRFHCTSLSCPLGASGSTSLCSNTGGTSSPIVACGVCAIIRHGF-A 1007
             N   RCAADGNELLRFHC++L C LG+ GST LC +       I ACGVC +IRHGF +
Sbjct: 263  KN--PRCAADGNELLRFHCSALFCDLGSRGSTGLCGS-------IPACGVCTVIRHGFQS 313

Query: 1008 RANRPHGVRTTASSGRAHDCGPMTDATEDRLRAMLVCRVIAGRVRRPGDDPAA------- 1166
            +   P GVRTTASSGRAHD     D  + R RAMLVCRVIAGRV+R  DD AA       
Sbjct: 314  KPGGPPGVRTTASSGRAHD---SFDCGDGRRRAMLVCRVIAGRVKRISDDAAATGTTTTT 370

Query: 1167 ---EDAYDSVAVAEAEADGA---CGAYGNLEELLVANPRAILPCFVVIYRAL 1304
               E+   S A A A  D      G Y NLEEL++ NP+AILPCFVVIY AL
Sbjct: 371  ATEEENVVSAAAAAASYDSVSRHSGMYSNLEELVIFNPKAILPCFVVIYEAL 422


>gb|AFW57693.1| hypothetical protein ZEAMMB73_127678 [Zea mays]
          Length = 512

 Score =  357 bits (916), Expect = 7e-96
 Identities = 221/420 (52%), Positives = 250/420 (59%), Gaps = 39/420 (9%)
 Frame = +3

Query: 165  WEHFKNLLSCRST--GEQVHDPAS-----RL---GRAFCGSSICAIRDVVHGNTRVVHRX 314
            WE  K+LLSCRS     +VHDPA+     RL   G   CG+S+CA+RDVV   +      
Sbjct: 117  WEQVKSLLSCRSVTAAARVHDPAAPSALARLRGSGAGTCGASLCAMRDVVDAASSAASSA 176

Query: 315  XXXXXXXXXXXXXXXQNETAQLARSARHRPAPPVASLNCGRGG-YYSPLPKLSGCYECHA 491
                             +TA L R   HR A   +S + G G  ++S L  LSGCYEC A
Sbjct: 177  AASASASG-------DRDTAPLNRRRAHRGA---SSSSAGAGSSHHSSLRGLSGCYECRA 226

Query: 492  VSVDSSSRRYPRPRETLCACSDCGEVFTKPETLELHQITRHGVSELGPEDSGRNIVEIIF 671
            ++V+  SRRYPRPRE LCAC  CGEVFTK +TLE HQ  RH VSELGPEDSGRNIVEIIF
Sbjct: 227  INVEPMSRRYPRPRE-LCACPQCGEVFTKADTLEHHQAIRHAVSELGPEDSGRNIVEIIF 285

Query: 672  KSSWLKSDRPICKIERILKVHNPPRTVARFEDYRAAVKSRALPHSATGGGGACHNHRSRC 851
            KSSW K DRPIC I+RILKVHN PRTVARFE YR AV+SR     A           +R 
Sbjct: 286  KSSWHKRDRPICHIDRILKVHNAPRTVARFEAYRDAVRSRCRAVVA-----------ARA 334

Query: 852  AADGNELLRFHCTSLSCPLGASGSTSLC--SNTGGTSSPIVACGVCAIIRHGFA--RANR 1019
            AADGNELLRFH   L+C LG  G+T+LC  +      +P   CGVC  IRHGFA      
Sbjct: 335  AADGNELLRFHSAPLACALGLGGATALCCAAADADADAPAPPCGVCTAIRHGFAPWLGAH 394

Query: 1020 PHGVRTTASSGRAHDCG--------PMTDATED---------RLRAMLVCRVIAGRVRRP 1148
            P GVRTTASSGRAHDC         P    T+            RAMLVCRVIAGRVRR 
Sbjct: 395  PLGVRTTASSGRAHDCASDNNNPSPPPPHQTQSAASDVSNAAACRAMLVCRVIAGRVRRD 454

Query: 1149 GDDPAA-------EDAYDSVAVAEAEADGACGAYGNLEELLVANPRAILPCFVVIYRALD 1307
            GD   +       E  +DSVA  +A    +   YGNLEEL VANPRAILPCFVVIYR LD
Sbjct: 455  GDGDTSAAAGEDQEGPFDSVAGEDA---ASSSVYGNLEELFVANPRAILPCFVVIYRVLD 511


>ref|XP_003545035.1| PREDICTED: uncharacterized protein LOC100782665 [Glycine max]
          Length = 414

 Score =  355 bits (912), Expect = 2e-95
 Identities = 212/397 (53%), Positives = 243/397 (61%), Gaps = 17/397 (4%)
 Frame = +3

Query: 159  TSWEHFKNLLSCRSTGE-QVHDPASRLGRAFCGSS---ICAIRDVVHGNTRVVHRXXXXX 326
            +SW+  KNLL+C+   E +VHDP+   G +  GSS   IC+ RDVVHGNTRVVHR     
Sbjct: 43   SSWDQIKNLLTCKQMEESRVHDPSKITGYSKLGSSCSSICSFRDVVHGNTRVVHRSDNSS 102

Query: 327  XXXXXXXXXXXQNETAQLARSARHRPAPPVASLNCGRGGYYS---PLPKLSGCYECHAVS 497
                          T +   +   R A       C      S      KLSGCYECH + 
Sbjct: 103  PESSSLGQETNGLLTRKPVTTTTTRSAKSNGGATCTSSSSSSRGMQFRKLSGCYECHMI- 161

Query: 498  VDSSSRRYPRPRETLCACSDCGEVFTKPETLELHQITRHGVSELGPEDSGRNIVEIIFKS 677
            +D S  R P  R T+CACS CGEVF K E+LELHQ  RH VSELGPEDSGRNIVEIIFKS
Sbjct: 162  IDPS--RLPIARSTVCACSHCGEVFPKMESLELHQAVRHAVSELGPEDSGRNIVEIIFKS 219

Query: 678  SWLKSDRPICKIERILKVHNPPRTVARFEDYRAAVKSRALPHSATGGGGACHNHRSRCAA 857
            SWLK D PICKIERILKVHN  RT+ RFE+ R  VK+RAL  +             RCAA
Sbjct: 220  SWLKKDNPICKIERILKVHNTQRTIQRFEECRDTVKNRALGST---------KKNPRCAA 270

Query: 858  DGNELLRFHCTSLSCPLGASGSTSLCSNTGGTSSPIVACGVCAIIRHGFARA-------N 1016
            DGNELLRFHCT+L+C LGA GS+SLC++  G       C VC IIRHGF           
Sbjct: 271  DGNELLRFHCTTLTCALGARGSSSLCASVPG-------CSVCTIIRHGFQGGCGGGGDHA 323

Query: 1017 RPHGVRTTASSGRAHDCGPMTDATEDRLRAMLVCRVIAGRVRRPGDDPAAED---AYDSV 1187
            R  GVRTTASSGRAHD     DAT    RAMLVCRVIAGRV+R  +D  +E+   +YDSV
Sbjct: 324  RAKGVRTTASSGRAHDSVVCGDATR---RAMLVCRVIAGRVKRVVEDAPSEEEHVSYDSV 380

Query: 1188 AVAEAEADGACGAYGNLEELLVANPRAILPCFVVIYR 1298
            A       G  G Y NLEEL+V NP+AILPCFVVIY+
Sbjct: 381  A-------GYAGIYSNLEELVVFNPKAILPCFVVIYK 410


>ref|XP_002301980.1| zinc finger family protein [Populus trichocarpa]
            gi|222843706|gb|EEE81253.1| zinc finger family protein
            [Populus trichocarpa]
          Length = 405

 Score =  353 bits (907), Expect = 8e-95
 Identities = 216/407 (53%), Positives = 250/407 (61%), Gaps = 24/407 (5%)
 Frame = +3

Query: 159  TSWEHFKNLLSCRST-GEQVHDPA------SRLGRAFCGSSICAIRDVVHGNTRVVHRXX 317
            +SW+  KNLL+C+   G +VHDP+      S+LG +   SSIC+ +DVVHGNTRVVHR  
Sbjct: 36   SSWDQIKNLLTCKQIEGSRVHDPSKNPIGYSKLGSSC--SSICSFKDVVHGNTRVVHRAD 93

Query: 318  XXXXXXXXXXXXXXQNETAQLARSARHRPAPPVASLN----------CGRGGYYSPLPKL 467
                            ET  L+R      +    SL           C          KL
Sbjct: 94   NSPESSTLG------QETGLLSRKGVSTGSSSTRSLTSSGRSNSGVTCSSSSRGMQFRKL 147

Query: 468  SGCYECHAVSVDSSSRRYPRPRETLCACSDCGEVFTKPETLELHQITRHGVSELGPEDSG 647
            SGCYECH + VD S  RYP  R T+ AC+ CGEVF K E+LELHQ  RH VSELGPEDSG
Sbjct: 148  SGCYECHMI-VDPS--RYPSARTTISACTQCGEVFPKIESLELHQKVRHAVSELGPEDSG 204

Query: 648  RNIVEIIFKSSWLKSDRPICKIERILKVHNPPRTVARFEDYRAAVKSRALPHSATGGGGA 827
            RNIVEIIFKSSWLK D PICKIERILKVHN  RT+ RFED R AVK+RAL  +       
Sbjct: 205  RNIVEIIFKSSWLKKDNPICKIERILKVHNTQRTIQRFEDCRDAVKTRALNST------- 257

Query: 828  CHNHRSRCAADGNELLRFHCTSLSCPLGASGSTSLCSNTGGTSSPIVACGVCAIIRHGFA 1007
                  RCAADGNELLRFHCT+L+C LG+ GS+SLC +       I  CGVC IIRHGF 
Sbjct: 258  --KKNPRCAADGNELLRFHCTTLTCSLGSLGSSSLCGS-------IPVCGVCTIIRHGF- 307

Query: 1008 RANRPHGVRTTASSGRAHDCGPMTDATEDRLRAMLVCRVIAGRVRR-------PGDDPAA 1166
            +     GV TTASSGRAHD   +   T+ R RAMLVCRVIAGRV+R       P +D A+
Sbjct: 308  QGIECKGVSTTASSGRAHD--SLWGCTDGR-RAMLVCRVIAGRVKRVAEDALPPEEDGAS 364

Query: 1167 EDAYDSVAVAEAEADGACGAYGNLEELLVANPRAILPCFVVIYRALD 1307
              +YDSVA       G  G Y +LEEL V NPRAILPCFVVIY+AL+
Sbjct: 365  AGSYDSVA-------GGAGIYSSLEELSVFNPRAILPCFVVIYKALE 404


>gb|EOY15015.1| C2H2-like zinc finger protein [Theobroma cacao]
          Length = 413

 Score =  353 bits (906), Expect = 1e-94
 Identities = 214/408 (52%), Positives = 249/408 (61%), Gaps = 25/408 (6%)
 Frame = +3

Query: 159  TSWEHFKNLLSCRST-GEQVHDPA----------SRLGRAFCGSSICAIRDVVHGNTRVV 305
            +SW+  KNLL+C+   G +VHDP+          S+LG +   +SIC+ RDVVHGNTRVV
Sbjct: 33   SSWDQIKNLLTCKQVEGSKVHDPSKNNPPHHHGYSKLGSSC--NSICSFRDVVHGNTRVV 90

Query: 306  HRXXXXXXXXXXXXXXXXQ-------NETAQLARSARHRPAPPVASLNCGRGGYYSPLPK 464
            HR                        + T  L+ S R   +    + +  R   +    K
Sbjct: 91   HRADNSPESSTVGQETGLLRRKAANGSSTRSLSGSTRSNTSTTYTTSSSSRAMQFR---K 147

Query: 465  LSGCYECHAVSVDSSSRRYPRPRETLCACSDCGEVFTKPETLELHQITRHGVSELGPEDS 644
            LSGCYECH + VD S  RYP  R T+ ACS CGEVF K E+LELHQ  RH VSELGPEDS
Sbjct: 148  LSGCYECHMI-VDPS--RYPSSRTTISACSQCGEVFPKIESLELHQAVRHAVSELGPEDS 204

Query: 645  GRNIVEIIFKSSWLKSDRPICKIERILKVHNPPRTVARFEDYRAAVKSRALPHSATGGGG 824
            GRNIVEIIFKSSWLK D PICKIERILKVHN  RT+ RFED R AVK+RAL  +      
Sbjct: 205  GRNIVEIIFKSSWLKKDNPICKIERILKVHNTQRTIQRFEDCRDAVKTRALNST------ 258

Query: 825  ACHNHRSRCAADGNELLRFHCTSLSCPLGASGSTSLCSNTGGTSSPIVACGVCAIIRHGF 1004
                   RCAADGNELLRFHCT+L+C LGA GS+SLC    G       CGVC IIR GF
Sbjct: 259  ---RKNPRCAADGNELLRFHCTTLTCSLGARGSSSLCGAVPG-------CGVCTIIRQGF 308

Query: 1005 ------ARANRPHGVRTTASSGRAHDCGPMTDATEDRLRAMLVCRVIAGRVRRPGDDPAA 1166
                  A      GVRTTASSGRAHD    +   +D  RAMLVCRVIAGRV+R  DD   
Sbjct: 309  QNKGGSAAVADFKGVRTTASSGRAHD----SLNCKDGRRAMLVCRVIAGRVKRVTDDAPL 364

Query: 1167 EDAYDSVAVAEAEADGA-CGAYGNLEELLVANPRAILPCFVVIYRALD 1307
            E+   SV+    ++  A  G Y NLEEL+V NPRAILPCFVVIY+AL+
Sbjct: 365  EEDNSSVSAGSYDSLAAYAGVYSNLEELVVFNPRAILPCFVVIYKALE 412


>gb|ESW32664.1| hypothetical protein PHAVU_001G007100g [Phaseolus vulgaris]
          Length = 408

 Score =  353 bits (905), Expect = 1e-94
 Identities = 211/394 (53%), Positives = 249/394 (63%), Gaps = 14/394 (3%)
 Frame = +3

Query: 159  TSWEHFKNLLSCRST-GEQVHDPA---SRLGRAFCGSSICAIRDVVHGNTRVVHRXXXXX 326
            +SW+  KNLL+C+   G +VHDP+   SR+G +   SSIC+ RDVVHGNTRVVHR     
Sbjct: 39   SSWDQIKNLLTCKQMEGSRVHDPSKGYSRIGSSC--SSICSFRDVVHGNTRVVHRSDNSS 96

Query: 327  XXXXXXXXXXX----QNETAQLARSARHRPAPPVASLNCGRGGYYSPLPKLSGCYECHAV 494
                           +  TA    +A    A    + +  RG  +    KLSGCYECH +
Sbjct: 97   PESSSLGQETGLLTRKPVTASTRSTAAKSNAGTTYTSSSSRGMQFR---KLSGCYECHMI 153

Query: 495  SVDSSSRRYPRPRETLCACSDCGEVFTKPETLELHQITRHGVSELGPEDSGRNIVEIIFK 674
             +D S  R P  R T+CACS CGEVF K E+LELHQ  RH VSEL PEDSGRNIVEIIFK
Sbjct: 154  -IDPS--RLPVARSTVCACSHCGEVFPKMESLELHQAVRHAVSELEPEDSGRNIVEIIFK 210

Query: 675  SSWLKSDRPICKIERILKVHNPPRTVARFEDYRAAVKSRALPHSATGGGGACHNHRSRCA 854
            SSWLK D PICKIERILKVHN  RT+ RFE+ R  VK+RAL  +             RCA
Sbjct: 211  SSWLKKDNPICKIERILKVHNTQRTIQRFEECRDTVKNRALSST---------KKNPRCA 261

Query: 855  ADGNELLRFHCTSLSCPLGASGSTSLCSNTGGTSSPIVACGVCAIIRHGFARAN-----R 1019
            ADGNELLRFHCT+L+C LGA GS+SLC++  G       CGVC IIRHGF         +
Sbjct: 262  ADGNELLRFHCTTLTCALGARGSSSLCASVPG-------CGVCTIIRHGFQGGGGGDHAK 314

Query: 1020 PHGVRTTASSGRAHDCGPMTDATEDRLRAMLVCRVIAGRVRRPGDD-PAAEDAYDSVAVA 1196
              GV TTASSGRAHD     D T    RAMLVCRVIAGRV+R  +D P +E+ + SVA  
Sbjct: 315  VKGVGTTASSGRAHDSVVCGDGTR---RAMLVCRVIAGRVKRVAEDTPPSEEEHVSVASY 371

Query: 1197 EAEADGACGAYGNLEELLVANPRAILPCFVVIYR 1298
            ++ A G  G Y NLEEL+V NP+AILPCFVVIY+
Sbjct: 372  DSVA-GYAGIYSNLEELVVFNPKAILPCFVVIYK 404


>ref|XP_006390280.1| hypothetical protein EUTSA_v10018514mg [Eutrema salsugineum]
            gi|312282391|dbj|BAJ34061.1| unnamed protein product
            [Thellungiella halophila] gi|557086714|gb|ESQ27566.1|
            hypothetical protein EUTSA_v10018514mg [Eutrema
            salsugineum]
          Length = 459

 Score =  348 bits (892), Expect = 4e-93
 Identities = 221/431 (51%), Positives = 258/431 (59%), Gaps = 48/431 (11%)
 Frame = +3

Query: 159  TSWEHFKNLLSCRST-GEQVHDPA--SRLGRAFCG-----------SSICAIRDVVHGNT 296
            +SW+  KNLL+C+   G +VHDP+  S+ G +              SSIC+ RDV HGNT
Sbjct: 54   SSWDQIKNLLTCKQIEGSRVHDPSKNSQSGPSMTTHLSPSKLSSSCSSICSFRDVAHGNT 113

Query: 297  RVVHRXXXXXXXXXXXXXXXXQNETAQLARS-ARHRPAPPVASL------NCGRGGYYSP 455
            RVVHR                 +ET  L R   +H  +    SL      + G G Y S 
Sbjct: 114  RVVHRADHSPDVANSATPA--DSETRLLTRKPGQHGSSSSSRSLISGSARSNGSGSYTSS 171

Query: 456  ---------LPKLSGCYECHAVSVDSSSRRYP-RPRETLCACSDCGEVFTKPETLELHQI 605
                       KLSGCYECH + VD S  RYP  PR  +CACS CGEVF K E+LELHQ 
Sbjct: 172  STTSFRAMQFRKLSGCYECHMI-VDPS--RYPISPR--VCACSQCGEVFPKLESLELHQA 226

Query: 606  TRHGVSELGPEDSGRNIVEIIFKSSWLKSDRPICKIERILKVHNPPRTVARFEDYRAAVK 785
             RH VSELGPEDSGRNIVEIIFKSSWLK D PICKIERILKVHN  RT+ RFED R AVK
Sbjct: 227  VRHAVSELGPEDSGRNIVEIIFKSSWLKKDSPICKIERILKVHNTQRTIQRFEDCRDAVK 286

Query: 786  SRALPHSATGGGGACHNHRSRCAADGNELLRFHCTSLSCPLGASGSTSLCSNTGGTSSPI 965
            +RAL  +            +RCAADGNELLRFHCT+L+C LG+ GS+SLCSN       +
Sbjct: 287  ARALQTT---------RKDARCAADGNELLRFHCTTLTCSLGSRGSSSLCSN-------L 330

Query: 966  VACGVCAIIRHGF----ARANRPHGVRTTASSGRAHDCGPMTDATEDRLRAMLVCRVIAG 1133
              CGVC +IRHGF           GVRTTASSGRA D   +   ++D  R MLVCRVIAG
Sbjct: 331  PTCGVCNVIRHGFQGKSGAGGATGGVRTTASSGRADD---LLRCSDDARRVMLVCRVIAG 387

Query: 1134 RVRR---PGDDPAAED---AYDSVAVAEAEADGA-------CGAYGNLEELLVANPRAIL 1274
            RV+R   P D PA E    A D++AV  + + GA        G Y NLEEL+V NPRAIL
Sbjct: 388  RVKRVDLPADSPATEQKSPAEDNLAVGVSSSSGAFDSVAVNAGVYSNLEELVVYNPRAIL 447

Query: 1275 PCFVVIYRALD 1307
            PCFVVIY+ L+
Sbjct: 448  PCFVVIYKVLE 458


>ref|NP_001183873.1| hypothetical protein [Zea mays] gi|238015166|gb|ACR38618.1| unknown
            [Zea mays] gi|414588164|tpg|DAA38735.1| TPA: hypothetical
            protein ZEAMMB73_389881 [Zea mays]
          Length = 459

 Score =  347 bits (891), Expect = 6e-93
 Identities = 219/409 (53%), Positives = 248/409 (60%), Gaps = 26/409 (6%)
 Frame = +3

Query: 159  TSWEHFKNLLSCRST--GEQVHDPASRLGRAFCGSSICAIRDVVHGNTRVVHRXXXXXXX 332
            +SWE  K+LLSCR+     +VHDP S L R   G+S+CA+RDVV  +             
Sbjct: 95   SSWEQVKSLLSCRTATAAARVHDP-SALAR-LRGASLCAMRDVVLSSAS----------- 141

Query: 333  XXXXXXXXXQNETAQLARSARHRPAPPVASLNCGRGGYYSPLPKLSGCYECHAVSVDSSS 512
                       +TA L R  R  P+   AS   G   ++S L  LSGCYEC A+ V+  S
Sbjct: 142  --------GDRDTAPLNR--RRAPSSSAAS-GAGSSSHHSSLRGLSGCYECRAIDVEPMS 190

Query: 513  RR-YPRPRETLCACSDCGEVFTKPETLELHQITRHGVSELGPEDSGRNIVEIIFKSSWLK 689
            RR YPRPRE LCAC  CGEVFT  ++LE HQ  RH VSELGPEDSGRNIV+IIF SSW K
Sbjct: 191  RRRYPRPRE-LCACPQCGEVFTMADSLEHHQAIRHAVSELGPEDSGRNIVDIIFNSSWQK 249

Query: 690  SDRPICKIERILKVHNPPRTVARFEDYRAAVKSRALPHSATGGGGACHNHRSRCAADGNE 869
              RPIC I+RILKVHN PRTVARFE YR AV+SR+   +A           +R AADGNE
Sbjct: 250  RGRPICHIDRILKVHNAPRTVARFEAYRDAVRSRSRCRAA-----------ARVAADGNE 298

Query: 870  LLRFHCTSLSCPLGASGSTSLCSNTGGTSSPIVACGVCAIIRHGFA--RANRPHGVRTTA 1043
            LLRFH   L+C LG  G+T+LC   GG SSP   CGVC  IRHGFA      P GVRTTA
Sbjct: 299  LLRFHSAPLACALGLGGATALCC-AGGASSP---CGVCTAIRHGFAPWAGAHPLGVRTTA 354

Query: 1044 SSGRAHDCG-------------PMTDATEDRLRAMLVCRVIAGRVRRPGDDPAAEDA--- 1175
            SS RAHDCG             P  DA     RAMLVCRVIAGRVRR G   A  +A   
Sbjct: 355  SSARAHDCGCSASPSSVQQTQQPANDAA--ACRAMLVCRVIAGRVRRGGGGGATSNAADE 412

Query: 1176 -----YDSVAVAEAEADGACGAYGNLEELLVANPRAILPCFVVIYRALD 1307
                 +DSVAV +A +      YGNLEEL VANPRAILPCFVVIYR LD
Sbjct: 413  DQEGPFDSVAVEDAVSS---SMYGNLEELFVANPRAILPCFVVIYRVLD 458


>ref|XP_004238353.1| PREDICTED: uncharacterized protein LOC101261325 [Solanum
            lycopersicum]
          Length = 451

 Score =  345 bits (886), Expect = 2e-92
 Identities = 218/431 (50%), Positives = 251/431 (58%), Gaps = 48/431 (11%)
 Frame = +3

Query: 159  TSWEHFKNLLSCRST-GEQVHDPASR--------------LGRAFCGSSICAIRDVVHGN 293
            +SW+ FKNLL+C+      VHDPAS+                +    SSIC+ RDVVHGN
Sbjct: 56   SSWDQFKNLLTCKQIENSTVHDPASKNPPPSAAAAAPAGAYSKLSSCSSICSFRDVVHGN 115

Query: 294  TRVVHRXXXXXXXXXXXXXXXXQNETAQLA--RSARHRPAP----PVASLNCGRGGYYS- 452
            TRVVHR                  ET  L+  +++ H P+      +A  N      Y+ 
Sbjct: 116  TRVVHRADNSPESSSLG------QETRLLSKNKTSHHDPSSSSSRSLARSNGNGSSTYTT 169

Query: 453  -----PLPKLSGCYECHAVSVDSSSRRYPRPRETLCACSDCGEVFTKPETLELHQITRHG 617
                    KLSGCYECH + VD S  RYP PR T+CAC DCGEVF K E+LE HQ  +H 
Sbjct: 170  SSRGMQFRKLSGCYECHMI-VDPS--RYPLPRSTICACPDCGEVFPKIESLEHHQAVKHA 226

Query: 618  VSELGPEDSGRNIVEIIFKSSWLKSDRPICKIERILKVHNPPRTVARFEDYRAAVKSRAL 797
            VSELGPEDS RNIVEIIFKSSWLK D PICKIERILKVHN  RT+ RFED R AVK  A+
Sbjct: 227  VSELGPEDSSRNIVEIIFKSSWLKKDNPICKIERILKVHNTKRTIQRFEDCRDAVKIHAV 286

Query: 798  PHSATGGGGACHNHRSRCAADGNELLRFHCTSLSCPLGASGSTSLCSNTGGTSSPIVACG 977
               ATG          RCAADGNELLRF+CTSL+C LGA GS+SLC +  G       CG
Sbjct: 287  ---ATG------KKNPRCAADGNELLRFYCTSLTCVLGARGSSSLCGSVPG-------CG 330

Query: 978  VCAIIRHGFARANRPHGVRTTASSGRAHDCGPMTDATEDRLRAMLVCRVIAGRVRR-PGD 1154
            VC  IRHGF + N+  GVRTTASSGRAHDC     +   R RAM+VCRVIAGRV++  G 
Sbjct: 331  VCTTIRHGF-QGNKTSGVRTTASSGRAHDC---LGSGMARRRAMIVCRVIAGRVKQNVGA 386

Query: 1155 DPAAED--------------------AYDSVAVAEAEADGACGAYGNLEELLVANPRAIL 1274
             P  E+                     Y+SVA       G  G Y NLEEL V NPRAIL
Sbjct: 387  SPTKEEENCSGSGSGSGSGLSATGSTIYNSVA-------GHVGIYSNLEELYVFNPRAIL 439

Query: 1275 PCFVVIYRALD 1307
            PCFVVIY AL+
Sbjct: 440  PCFVVIYEALE 450


Top