BLASTX nr result

ID: Zingiber25_contig00033847 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Zingiber25_contig00033847
         (1845 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002533875.1| DNA-directed RNA polymerase II, putative [Ri...   504   e-140
gb|EMJ26908.1| hypothetical protein PRUPE_ppa005050mg [Prunus pe...   501   e-139
ref|XP_002285818.1| PREDICTED: uncharacterized protein LOC100260...   498   e-138
ref|XP_004300664.1| PREDICTED: uncharacterized protein LOC101292...   496   e-137
gb|EOY16120.1| DNA-directed RNA polymerase II protein isoform 1 ...   493   e-137
ref|XP_006434072.1| hypothetical protein CICLE_v10001001mg [Citr...   492   e-136
ref|XP_002302270.1| hypothetical protein POPTR_0002s09150g [Popu...   491   e-136
ref|XP_006472675.1| PREDICTED: uncharacterized protein LOC102626...   491   e-136
ref|XP_003544777.1| PREDICTED: uncharacterized protein LOC100776...   483   e-133
gb|ESW13116.1| hypothetical protein PHAVU_008G169200g [Phaseolus...   482   e-133
ref|XP_003542641.1| PREDICTED: uncharacterized protein LOC100813...   478   e-132
ref|XP_006386390.1| hypothetical protein POPTR_0002s09150g [Popu...   475   e-131
ref|XP_006354053.1| PREDICTED: uncharacterized protein LOC102590...   473   e-130
gb|EXC06677.1| hypothetical protein L484_021513 [Morus notabilis]     471   e-130
ref|XP_003614901.1| hypothetical protein MTR_5g061040 [Medicago ...   467   e-129
ref|XP_004237862.1| PREDICTED: uncharacterized protein LOC101264...   461   e-127
ref|XP_004138644.1| PREDICTED: uncharacterized protein LOC101217...   459   e-126
ref|XP_006397280.1| hypothetical protein EUTSA_v10028627mg [Eutr...   452   e-124
ref|NP_192594.2| DNA-directed RNA polymerase II protein [Arabido...   436   e-119
gb|AAL59980.1| unknown protein [Arabidopsis thaliana]                 435   e-119

>ref|XP_002533875.1| DNA-directed RNA polymerase II, putative [Ricinus communis]
            gi|223526176|gb|EEF28506.1| DNA-directed RNA polymerase
            II, putative [Ricinus communis]
          Length = 478

 Score =  504 bits (1299), Expect = e-140
 Identities = 264/475 (55%), Positives = 323/475 (68%), Gaps = 2/475 (0%)
 Frame = +1

Query: 136  NSCCAICEGSNLASICAPCVNYRLNEYSALMKSLNIVRQSYYSKLFDFLEKKRIADEQNN 315
            +SCCAICE SN ASIC  CVNYRLNEYS L+KSL   R   YS+L + L  K  AD+Q N
Sbjct: 5    SSCCAICENSNRASICTVCVNYRLNEYSTLLKSLKSRRDLLYSRLSEVLVAKGKADDQLN 64

Query: 316  WISDQNEKLKKLEQRLTYLKAKRVEDKAKVAEQSNDLKSKYESLELAFEIIKKKRTDVLD 495
            W   QNEKL  L ++L   K + ++ KAK  + S+DL +KY  LE +   ++K R D L+
Sbjct: 65   WRVHQNEKLANLREKLLRSKEQLIQAKAKTEKMSSDLNAKYGLLESSRSALEKNRVDQLE 124

Query: 496  KFNTDLIYNQRSAYMAITSELFHRQSVIVKQICKIFPLRKVNSDA-KKDGSNSMCDQICN 672
            K+  +LI  Q   +MAITSEL H  SV VKQICK+FP R+V  +  KKDGS+   DQICN
Sbjct: 125  KYFPNLICTQSLGHMAITSELLHNLSVTVKQICKLFPQRRVIVEGEKKDGSSGQYDQICN 184

Query: 673  ARLPRGLDPHSVPPEQLAASLGYMIQLLNLIVPILAAPVLHNSGFAGSCSRIWQRASYWD 852
            ARLPRGLDPHS+P E+LAASLGYM+QLLNL+V  LAAP LHNSGFAGSCSRIWQR SYW+
Sbjct: 185  ARLPRGLDPHSIPSEELAASLGYMVQLLNLVVHNLAAPALHNSGFAGSCSRIWQRDSYWN 244

Query: 853  ACPFSQSKEYPLFIPRQSFCTSGGENLWSNRSSSNFGVASVXXXXXXXXXXXXXXXFNYS 1032
            A P S+S EYPLFIPRQ +C++ GEN W++RSSSNFGVAS+               FNY+
Sbjct: 245  ARPSSRSNEYPLFIPRQRYCSTSGENSWTDRSSSNFGVASMESERRARLDSSRSSSFNYN 304

Query: 1033 LASPHSLENHTDLQKGISLLKKSVACITTYWYNILDLDMPSEASTFETFGKXXXXXXXXX 1212
             ASPHS+E H DLQKGISL+KKSVAC+T Y YN+L LD+P+EASTFE F K         
Sbjct: 305  SASPHSVETHKDLQKGISLMKKSVACVTAYGYNLLCLDVPAEASTFEAFAKLLATLSSSK 364

Query: 1213 XXXXIRNSLKMACSSSEKQAQQLKSSMWNEXXXXXXXXXIVEGEQSILS-TVDDNDFSNP 1389
                +  SLKMACS S KQ Q+L  S+WN          +       L+  ++DN+  N 
Sbjct: 365  EVRSV-FSLKMACSRSCKQVQKLNKSVWNVNSIISSSTLMESAHAPHLTKNINDNNLRNS 423

Query: 1390 NSSFICTAEMIDYVKPKSLEEGWDIVEHPTFSPPLSETEDVEHWLRAMYIDVPKK 1554
             +SF+   E+ D  K +SL +GWD+VEHPTF PP S+TEDVEHW RAM+ID  KK
Sbjct: 424  ATSFLFANEISDAGKNESLIDGWDLVEHPTFPPPPSQTEDVEHWTRAMFIDATKK 478


>gb|EMJ26908.1| hypothetical protein PRUPE_ppa005050mg [Prunus persica]
            gi|462422646|gb|EMJ26909.1| hypothetical protein
            PRUPE_ppa005050mg [Prunus persica]
          Length = 479

 Score =  501 bits (1289), Expect = e-139
 Identities = 267/482 (55%), Positives = 330/482 (68%), Gaps = 3/482 (0%)
 Frame = +1

Query: 118  MARKSSNSCCAICEGSNLASICAPCVNYRLNEYSALMKSLNIVRQSYYSKLFDFLEKKRI 297
            M RKSSN  CAICE SNLAS+CA CVNYRL EY++ +K+L   R S YS+L + L  K  
Sbjct: 2    MNRKSSN--CAICESSNLASVCAICVNYRLTEYNSSLKALKSRRDSLYSRLTEALVAKGK 59

Query: 298  ADEQNNWISDQNEKLKKLEQRLTYLKAKRVEDKAKVAEQSNDLKSKYESLELAFEIIKKK 477
            AD+Q NW   QNEKL +L ++L   K + V+ KAK+ + S DLK K   LE A  +++K 
Sbjct: 60   ADDQLNWRVLQNEKLVRLREKLRCNKEQLVQGKAKIEKTSYDLKVKSGVLESALAVLEKN 119

Query: 478  RTDVLDKFNTDLIYNQRSAYMAITSELFHRQSVIVKQICKIFPLRKVNSDAK-KDGSNSM 654
            R + L+KF  + I  Q   +MAITSE  H+QSV++KQICK+FP R+V  DAK KD S   
Sbjct: 120  RAEQLEKFYPNFICTQNLGHMAITSERLHKQSVVIKQICKLFPQRRVTVDAKRKDASGGQ 179

Query: 655  CDQICNARLPRGLDPHSVPPEQLAASLGYMIQLLNLIVPILAAPVLHNSGFAGSCSRIWQ 834
             DQICNA LPRGLDPHSVP E+LAASLGYM+QLLNL+V  LAAP LHNSGFAGSCSRIWQ
Sbjct: 180  YDQICNACLPRGLDPHSVPSEELAASLGYMVQLLNLVVQNLAAPALHNSGFAGSCSRIWQ 239

Query: 835  RASYWDACPFSQSKEYPLFIPRQSFCTSGGENLWSNRSSSNFGVASVXXXXXXXXXXXXX 1014
            R SYWDA P S+S EYPLFIPRQ++C++ GEN WS+RSSSNFGVAS+             
Sbjct: 240  RDSYWDARPSSRSNEYPLFIPRQNYCSTSGENSWSDRSSSNFGVASIDSERKPHLDSSGS 299

Query: 1015 XXFNYSLASPHSLENHTDLQKGISLLKKSVACITTYWYNILDLDMPSEASTFETFGKXXX 1194
              FNY+ AS HS+E H DLQ+GISLLKKSVACIT Y YN L LD+PSEASTFE F K   
Sbjct: 300  SSFNYTSASQHSVETHKDLQRGISLLKKSVACITAYCYNSLCLDVPSEASTFEAFAKLLA 359

Query: 1195 XXXXXXXXXXIRNSLKMACSSSEKQAQQLKSSMWNEXXXXXXXXXIVEGEQSILSTVDDN 1374
                      +  SLKMACS S KQ QQL  S+WN          +++   ++  T +  
Sbjct: 360  TLSSSKEVHSV-FSLKMACSRSCKQVQQLNKSVWN-VNSAISSTTLLDSAHAMTMTKNLY 417

Query: 1375 DFSNPN--SSFICTAEMIDYVKPKSLEEGWDIVEHPTFSPPLSETEDVEHWLRAMYIDVP 1548
            +++ P   +S +C+ E+ D  K +SL EGWD+VEHPTF PP S++ED+EHW RAM+ID  
Sbjct: 418  EYNLPTYATSSLCSTELSDSGKNESLVEGWDLVEHPTFPPPPSQSEDIEHWTRAMFIDAK 477

Query: 1549 KK 1554
            +K
Sbjct: 478  RK 479


>ref|XP_002285818.1| PREDICTED: uncharacterized protein LOC100260778 [Vitis vinifera]
            gi|302141899|emb|CBI19102.3| unnamed protein product
            [Vitis vinifera]
          Length = 478

 Score =  498 bits (1282), Expect = e-138
 Identities = 269/482 (55%), Positives = 327/482 (67%), Gaps = 3/482 (0%)
 Frame = +1

Query: 118  MARKSSNSCCAICEGSNLASICAPCVNYRLNEYSALMKSLNIVRQSYYSKLFDFLEKKRI 297
            M RK+S+  C+ICE SNLASICA CVNYRLNEY+  +KS    R S Y +L + L  K  
Sbjct: 1    MTRKTSS--CSICEKSNLASICAVCVNYRLNEYNTSLKSSKGRRDSLYLRLSEVLVAKGK 58

Query: 298  ADEQNNWISDQNEKLKKLEQRLTYLKAKRVEDKAKVAEQSNDLKSKYESLELAFEIIKKK 477
            AD+Q NW   QNEKL +L ++L + K + ++ KAKV + SNDLK KY  LE A  +++K 
Sbjct: 59   ADDQINWRVLQNEKLARLREKLRHRKEQYLDGKAKVEKMSNDLKLKYGLLESAMSMLEKN 118

Query: 478  RTDVLDKFNTDLIYNQRSAYMAITSELFHRQSVIVKQICKIFPLRKVNSDA-KKDGSNSM 654
            R + L+KF  +LI  Q    MAITSE FH+QSV++KQICK+FP R+VN D  KKDGS+  
Sbjct: 119  RVEQLEKFYPNLICTQNLGLMAITSERFHKQSVVIKQICKLFPQRRVNIDGEKKDGSSRP 178

Query: 655  CDQICNARLPRGLDPHSVPPEQLAASLGYMIQLLNLIVPILAAPVLHNSGFAGSCSRIWQ 834
             DQICN RLPR LDPHSVP ++LAASLGYM+QLLNL+V  LAAP LHNSGFAGSCSRIWQ
Sbjct: 179  YDQICNVRLPRVLDPHSVPSDELAASLGYMVQLLNLVVYNLAAPALHNSGFAGSCSRIWQ 238

Query: 835  RASYWDACPFSQSKEYPLFIPRQSFCTSGGENLWSNRSSSNFGVASVXXXXXXXXXXXXX 1014
            R SYW+  P S+S EYPLFIPRQ+ C++ GEN WS RSSSNFG+AS+             
Sbjct: 239  RESYWNPRPSSRSNEYPLFIPRQNLCSTNGENSWSERSSSNFGIASMESDRKPRLESSGS 298

Query: 1015 XXFNYSLASPHSLENHTDLQKGISLLKKSVACITTYWYNILDLDMPSEASTFETFGKXXX 1194
              FNYS AS HS+E H DLQKGISLLKKSVAC+TTY Y+ L LD+P+EASTFE F K   
Sbjct: 299  SSFNYSSASLHSVETHKDLQKGISLLKKSVACLTTYCYSSLCLDVPTEASTFEAFAKLLA 358

Query: 1195 XXXXXXXXXXIRNSLKMACSSSEKQAQQLKSSMWNEXXXXXXXXXIVEGEQSILST--VD 1368
                      +  SLKMACS S KQ QQL  S+WN          ++E   ++  T  + 
Sbjct: 359  ILSSSKEVRSV-FSLKMACSRSCKQVQQLNKSIWN-MNSAISSSTLLESAHTLPMTRNIF 416

Query: 1369 DNDFSNPNSSFICTAEMIDYVKPKSLEEGWDIVEHPTFSPPLSETEDVEHWLRAMYIDVP 1548
            DN+  N  +SF+ T EM D  K +SL E WD+VEH  F PP S+TED+EHW RAM ID  
Sbjct: 417  DNNLPNSAASFLYTTEMSDIGKNESLIEEWDLVEHANFPPPPSQTEDIEHWTRAMIIDAT 476

Query: 1549 KK 1554
            KK
Sbjct: 477  KK 478


>ref|XP_004300664.1| PREDICTED: uncharacterized protein LOC101292418 [Fragaria vesca
            subsp. vesca]
          Length = 478

 Score =  496 bits (1277), Expect = e-137
 Identities = 267/480 (55%), Positives = 328/480 (68%), Gaps = 3/480 (0%)
 Frame = +1

Query: 124  RKSSNSCCAICEGSNLASICAPCVNYRLNEYSALMKSLNIVRQSYYSKLFDFLEKKRIAD 303
            +KSSN  CAICE SNLASICA CVNYRLN+Y+  +K+L   R   YS+L D L  K  AD
Sbjct: 4    KKSSN--CAICENSNLASICAVCVNYRLNDYNNSLKALKSRRDLLYSRLSDALVAKGKAD 61

Query: 304  EQNNWISDQNEKLKKLEQRLTYLKAKRVEDKAKVAEQSNDLKSKYESLELAFEIIKKKRT 483
            +Q NW   Q+EKL +L ++L   K + V+ KAK+ + S DLK KY  LE A  +++K R 
Sbjct: 62   DQLNWRILQDEKLVRLREKLRRNKEQLVQGKAKIEKTSYDLKVKYGVLESALSMLEKNRA 121

Query: 484  DVLDKFNTDLIYNQRSAYMAITSELFHRQSVIVKQICKIFPLRKVNSDAK-KDGSNSMCD 660
            + L+KF  +LI  Q   +MAITSE  H+QSV++KQICK+FP R+V  DAK K+GS    D
Sbjct: 122  EQLEKFYPNLICTQSLGHMAITSERLHKQSVVIKQICKLFPQRRVTVDAKRKEGSGGQYD 181

Query: 661  QICNARLPRGLDPHSVPPEQLAASLGYMIQLLNLIVPILAAPVLHNSGFAGSCSRIWQRA 840
            QICNA LPRGLDPHSVP E+LAASLGYM+QLLNL+V  L AP LHNSGFAGSCSRIWQR 
Sbjct: 182  QICNASLPRGLDPHSVPSEELAASLGYMVQLLNLVVQNLGAPALHNSGFAGSCSRIWQRD 241

Query: 841  SYWDACPFSQSKEYPLFIPRQSFCTSGGENLWSNRSSSNFGVASVXXXXXXXXXXXXXXX 1020
            SYWDA P S+S EYPLFIPRQ++C++ GEN WS+RSSSNFGVAS+               
Sbjct: 242  SYWDARPSSRSNEYPLFIPRQNYCSTSGENSWSDRSSSNFGVASIESERKPRLDSSGSSS 301

Query: 1021 FNYSLASPHSLENHTDLQKGISLLKKSVACITTYWYNILDLDMPSEASTFETFGKXXXXX 1200
            FNYS AS HS+E H DLQ+GISLLKKSVACIT Y YN L LD+PSEASTFE F K     
Sbjct: 302  FNYSSASQHSVETHKDLQRGISLLKKSVACITAYCYNSLCLDVPSEASTFEAFAKLLSTL 361

Query: 1201 XXXXXXXXIRNSLKMACSSSEKQAQQLKSSMWNEXXXXXXXXXIVEGEQSILSTVD--DN 1374
                    +  SLKMACS S KQ QQL  S+WN          +++   ++  T +  +N
Sbjct: 362  SSSKEVHSV-FSLKMACSRSCKQVQQLNKSVWN-VNSAISSTTLLDSAHTMTMTKNFYEN 419

Query: 1375 DFSNPNSSFICTAEMIDYVKPKSLEEGWDIVEHPTFSPPLSETEDVEHWLRAMYIDVPKK 1554
            +  N  +SF+ + EM D  K +   EGWD+VEHPT  PP S++ED+EHW RAM+IDV K+
Sbjct: 420  NIPNYATSFLSSTEMSDVGKNECTIEGWDLVEHPTLPPP-SQSEDIEHWTRAMFIDVTKR 478


>gb|EOY16120.1| DNA-directed RNA polymerase II protein isoform 1 [Theobroma cacao]
          Length = 479

 Score =  493 bits (1270), Expect = e-137
 Identities = 268/482 (55%), Positives = 328/482 (68%), Gaps = 3/482 (0%)
 Frame = +1

Query: 118  MARKSSNSCCAICEGSNLASICAPCVNYRLNEYSALMKSLNIVRQSYYSKLFDFLEKKRI 297
            M++K+SN  CAIC+ SN ASICA CVNYRLNEY++L+KSL   R   YSKL + L  KR 
Sbjct: 2    MSKKASN--CAICDNSNRASICAVCVNYRLNEYNSLLKSLKSRRDFLYSKLDEVLAAKRK 59

Query: 298  ADEQNNWISDQNEKLKKLEQRLTYLKAKRVEDKAKVAEQSNDLKSKYESLELAFEIIKKK 477
            AD+Q NW   QNEKL  L+++L   K +  + KAK+   S DLK KY  LE A  +++K 
Sbjct: 60   ADDQLNWKILQNEKLTDLKEKLRRSKEQLAQGKAKIERVSYDLKVKYGVLESARGMLEKN 119

Query: 478  RTDVLDKFNTDLIYNQRSAYMAITSELFHRQSVIVKQICKIFPLRKVNSDAK-KDGSNSM 654
            R + L+KF  +LI  Q    MAITSE  H+QSV++KQICK+FP R+VN D + +DGS   
Sbjct: 120  RVEKLEKFYPNLICTQSLGLMAITSERLHKQSVVIKQICKLFPQRRVNLDGEGRDGSCGQ 179

Query: 655  CDQICNARLPRGLDPHSVPPEQLAASLGYMIQLLNLIVPILAAPVLHNSGFAGSCSRIWQ 834
             D ICN  LPRGLDPHSVP EQLAASLGYM+QLLNL+V  LAAP LHNSGFAGSCSRIWQ
Sbjct: 180  YDLICNVGLPRGLDPHSVPSEQLAASLGYMVQLLNLVVHNLAAPALHNSGFAGSCSRIWQ 239

Query: 835  RASYWDACPFSQSKEYPLFIPRQSFCTSGGENLWSNRSSSNFGVASVXXXXXXXXXXXXX 1014
            R SYW+A P S+S EYPLFIPRQ++C++ G+N W++RSSSNFGVAS+             
Sbjct: 240  RDSYWNARPSSRSNEYPLFIPRQNYCSTSGDNSWTDRSSSNFGVASMESERRPRLDSSGS 299

Query: 1015 XXFNYSLASPHSLENHTDLQKGISLLKKSVACITTYWYNILDLDMPSEASTFETFGKXXX 1194
              FNYS AS H++E H DLQ GISLLKKSVACIT + YN L LD+P+EASTFE F K   
Sbjct: 300  NSFNYSSASSHTVETHKDLQIGISLLKKSVACITAFCYNSLCLDVPTEASTFEAFSKLLA 359

Query: 1195 XXXXXXXXXXIRNSLKMACSSSEKQAQQLKSSMWNEXXXXXXXXXIVEGEQSILSTVDDN 1374
                      +  SLKMACS S KQAQQL  S+WN          ++E    +  T + +
Sbjct: 360  TLSSTKEVRSV-FSLKMACSRSSKQAQQLNKSVWN-VNSAMSSSMLLESAHMLPLTKNLS 417

Query: 1375 DFSNPNS--SFICTAEMIDYVKPKSLEEGWDIVEHPTFSPPLSETEDVEHWLRAMYIDVP 1548
            D + P+S  SF+   EM D  K +SL E WD+VEHPTF PP S+TEDVEHW RAM+ID  
Sbjct: 418  DHNLPSSAASFLFATEMPDIGKNESLIEEWDLVEHPTFPPPPSQTEDVEHWTRAMFIDAT 477

Query: 1549 KK 1554
            K+
Sbjct: 478  KR 479


>ref|XP_006434072.1| hypothetical protein CICLE_v10001001mg [Citrus clementina]
            gi|567883029|ref|XP_006434073.1| hypothetical protein
            CICLE_v10001001mg [Citrus clementina]
            gi|567883031|ref|XP_006434074.1| hypothetical protein
            CICLE_v10001001mg [Citrus clementina]
            gi|567883033|ref|XP_006434075.1| hypothetical protein
            CICLE_v10001001mg [Citrus clementina]
            gi|557536194|gb|ESR47312.1| hypothetical protein
            CICLE_v10001001mg [Citrus clementina]
            gi|557536195|gb|ESR47313.1| hypothetical protein
            CICLE_v10001001mg [Citrus clementina]
            gi|557536196|gb|ESR47314.1| hypothetical protein
            CICLE_v10001001mg [Citrus clementina]
            gi|557536197|gb|ESR47315.1| hypothetical protein
            CICLE_v10001001mg [Citrus clementina]
          Length = 478

 Score =  492 bits (1266), Expect = e-136
 Identities = 260/481 (54%), Positives = 323/481 (67%), Gaps = 2/481 (0%)
 Frame = +1

Query: 118  MARKSSNSCCAICEGSNLASICAPCVNYRLNEYSALMKSLNIVRQSYYSKLFDFLEKKRI 297
            M +K+SN  CAICE SN ASICA CVNYRL+E + L+KSL   R + Y +L + L  K  
Sbjct: 1    MNKKASN--CAICENSNRASICAACVNYRLSECNTLLKSLKSRRDALYMRLSEVLVAKGK 58

Query: 298  ADEQNNWISDQNEKLKKLEQRLTYLKAKRVEDKAKVAEQSNDLKSKYESLELAFEIIKKK 477
            AD+Q NW   QNEKL  L ++L   K +  + K K+ + S DLK +Y  L+ A  +++K 
Sbjct: 59   ADDQLNWRVLQNEKLTNLREKLRRSKEQLSQGKLKIEKSSYDLKGRYAILDSARSMMEKN 118

Query: 478  RTDVLDKFNTDLIYNQRSAYMAITSELFHRQSVIVKQICKIFPLRKVNSDA-KKDGSNSM 654
            R + L+KF  ++I  Q   +MAI SEL H+QSV++KQICK+FP R+VN D  ++DGS+  
Sbjct: 119  RAEQLEKFYPNIICTQSLGHMAIVSELLHKQSVVIKQICKLFPQRRVNIDGERRDGSSGQ 178

Query: 655  CDQICNARLPRGLDPHSVPPEQLAASLGYMIQLLNLIVPILAAPVLHNSGFAGSCSRIWQ 834
             DQIC ARLP+GLDPHSVP E+LAASLGYM+QLLNL+V  LA PVLHNSGFAGSCSRIWQ
Sbjct: 179  YDQICGARLPKGLDPHSVPSEELAASLGYMVQLLNLVVLNLAVPVLHNSGFAGSCSRIWQ 238

Query: 835  RASYWDACPFSQSKEYPLFIPRQSFCTSGGENLWSNRSSSNFGVASVXXXXXXXXXXXXX 1014
            R SYWDA P S+S EYPLFIPRQ++C++ GEN W++RSSSNFGVAS+             
Sbjct: 239  RDSYWDARPSSRSNEYPLFIPRQNYCSTSGENSWTDRSSSNFGVASMESERRPQLDSSRS 298

Query: 1015 XXFNYSLASPHSLENHTDLQKGISLLKKSVACITTYWYNILDLDMPSEASTFETFGKXXX 1194
              FNY+ AS HS+E H DLQKGISLLKKSVAC+T Y YN L LD+P+EASTFE F K   
Sbjct: 299  ASFNYTSASTHSVETHKDLQKGISLLKKSVACLTAYCYNSLCLDVPAEASTFEAFAKLLA 358

Query: 1195 XXXXXXXXXXIRNSLKMACSSSEKQAQQLKSSMWNEXXXXXXXXXIVEGEQ-SILSTVDD 1371
                      +  SLKMACS S KQ Q+L  S+WN          +       I   + D
Sbjct: 359  TLSLSKEVRSV-FSLKMACSRSCKQVQKLNRSVWNMNSAISSTTLLESAHMFPITKNLSD 417

Query: 1372 NDFSNPNSSFICTAEMIDYVKPKSLEEGWDIVEHPTFSPPLSETEDVEHWLRAMYIDVPK 1551
            N+  +  +SF+   EM D  K +SL +GWD+VEHPTF PP S+TEDVEHW RAM ID  K
Sbjct: 418  NNLPSSAASFLYATEMSDIGKNESLIDGWDLVEHPTFPPPPSQTEDVEHWTRAMIIDATK 477

Query: 1552 K 1554
            K
Sbjct: 478  K 478


>ref|XP_002302270.1| hypothetical protein POPTR_0002s09150g [Populus trichocarpa]
            gi|566157047|ref|XP_006386388.1| hypothetical protein
            POPTR_0002s09150g [Populus trichocarpa]
            gi|566157050|ref|XP_006386389.1| hypothetical protein
            POPTR_0002s09150g [Populus trichocarpa]
            gi|222843996|gb|EEE81543.1| hypothetical protein
            POPTR_0002s09150g [Populus trichocarpa]
            gi|550344610|gb|ERP64185.1| hypothetical protein
            POPTR_0002s09150g [Populus trichocarpa]
            gi|550344611|gb|ERP64186.1| hypothetical protein
            POPTR_0002s09150g [Populus trichocarpa]
          Length = 475

 Score =  491 bits (1264), Expect = e-136
 Identities = 256/474 (54%), Positives = 315/474 (66%), Gaps = 1/474 (0%)
 Frame = +1

Query: 136  NSCCAICEGSNLASICAPCVNYRLNEYSALMKSLNIVRQSYYSKLFDFLEKKRIADEQNN 315
            +SCCAICE SN ASIC  CVNYRLNEY  L+KSLN  R S YSKL   L  K  AD+Q N
Sbjct: 5    SSCCAICENSNRASICPICVNYRLNEYGTLLKSLNSRRDSLYSKLSVVLIAKGKADDQFN 64

Query: 316  WISDQNEKLKKLEQRLTYLKAKRVEDKAKVAEQSNDLKSKYESLELAFEIIKKKRTDVLD 495
            W   QNEKL    ++L   K +  + KAKV + S DLK K   LE A  +++K R + L+
Sbjct: 65   WRVQQNEKLASSREKLHRNKEQLAQGKAKVEKLSQDLKKKNGMLESARNVLEKNRMEQLE 124

Query: 496  KFNTDLIYNQRSAYMAITSELFHRQSVIVKQICKIFPLRKVNSDAKKDGSNSMCDQICNA 675
            KF  +LI  Q   +MAITSEL H+QSV++KQICK+FP R+VN D +++ S    DQICNA
Sbjct: 125  KFYPNLICTQSLGHMAITSELLHKQSVVIKQICKLFPQRRVNVDGERNFSGQY-DQICNA 183

Query: 676  RLPRGLDPHSVPPEQLAASLGYMIQLLNLIVPILAAPVLHNSGFAGSCSRIWQRASYWDA 855
            RLPRGLDPHSV  E+LAASLGYM+QLLNL+   LAAP LHN+GFAGSCSRIWQR SYW+A
Sbjct: 184  RLPRGLDPHSVSSEELAASLGYMVQLLNLVAHNLAAPTLHNAGFAGSCSRIWQRDSYWNA 243

Query: 856  CPFSQSKEYPLFIPRQSFCTSGGENLWSNRSSSNFGVASVXXXXXXXXXXXXXXXFNYSL 1035
            CP S+S EYPLFIPRQ++C++  EN W+++SSSNFGVAS+               FNYS 
Sbjct: 244  CPSSRSNEYPLFIPRQNYCSTSSENSWTDKSSSNFGVASMESERRPHLDSTRSNSFNYSS 303

Query: 1036 ASPHSLENHTDLQKGISLLKKSVACITTYWYNILDLDMPSEASTFETFGKXXXXXXXXXX 1215
             SPHS+E H DLQKG+SLLKKSVAC+T Y YN+L LD+PS+ STFE F K          
Sbjct: 304  VSPHSVETHKDLQKGVSLLKKSVACVTAYCYNLLCLDVPSDTSTFEAFAKLLSTLSSSKE 363

Query: 1216 XXXIRNSLKMACSSSEKQAQQLKSSMWNEXXXXXXXXXIVEGEQ-SILSTVDDNDFSNPN 1392
               + N LKMACS S KQ Q+L  S+WN          +       ++    DN+  N  
Sbjct: 364  VRSVFN-LKMACSRSCKQVQKLNKSVWNVNSAISSSALLESAHALQLMKNTSDNNLPNSA 422

Query: 1393 SSFICTAEMIDYVKPKSLEEGWDIVEHPTFSPPLSETEDVEHWLRAMYIDVPKK 1554
            +SF+    + D  K +S  +GWD+VEHPTF PP S+ ED+EHW RAM+ID  KK
Sbjct: 423  ASFLFATGISD-GKNESFIDGWDLVEHPTFPPPPSQVEDIEHWTRAMFIDATKK 475


>ref|XP_006472675.1| PREDICTED: uncharacterized protein LOC102626964 isoform X1 [Citrus
            sinensis] gi|568837325|ref|XP_006472676.1| PREDICTED:
            uncharacterized protein LOC102626964 isoform X2 [Citrus
            sinensis]
          Length = 478

 Score =  491 bits (1263), Expect = e-136
 Identities = 259/481 (53%), Positives = 323/481 (67%), Gaps = 2/481 (0%)
 Frame = +1

Query: 118  MARKSSNSCCAICEGSNLASICAPCVNYRLNEYSALMKSLNIVRQSYYSKLFDFLEKKRI 297
            M +K+SN  CAICE SN ASICA CVNYRL+E + L+KSL   R + Y +L + L  K  
Sbjct: 1    MNKKASN--CAICENSNRASICAACVNYRLSECNTLLKSLKSRRDALYMRLSEVLVAKGK 58

Query: 298  ADEQNNWISDQNEKLKKLEQRLTYLKAKRVEDKAKVAEQSNDLKSKYESLELAFEIIKKK 477
            AD+Q NW   QNEKL  L ++L   K +  + K K+ + S DLK +Y  L+ A  +++K 
Sbjct: 59   ADDQLNWRVLQNEKLTNLREKLRRSKEQLSQGKLKIEKSSYDLKVRYAILDSARSMMEKN 118

Query: 478  RTDVLDKFNTDLIYNQRSAYMAITSELFHRQSVIVKQICKIFPLRKVNSDA-KKDGSNSM 654
            R + L+KF  ++I  Q   +MAI SEL H+QSV++KQICK+FP R+VN D  ++DGS+  
Sbjct: 119  RAEQLEKFYPNIICTQSLGHMAIVSELLHKQSVVIKQICKLFPQRRVNIDGERRDGSSGQ 178

Query: 655  CDQICNARLPRGLDPHSVPPEQLAASLGYMIQLLNLIVPILAAPVLHNSGFAGSCSRIWQ 834
             DQIC ARLP+GLDPHSVP E+LAASLGYM+QLLNL+V  LA P+LHNSGFAGSCSRIWQ
Sbjct: 179  YDQICGARLPKGLDPHSVPSEELAASLGYMVQLLNLVVLNLAVPILHNSGFAGSCSRIWQ 238

Query: 835  RASYWDACPFSQSKEYPLFIPRQSFCTSGGENLWSNRSSSNFGVASVXXXXXXXXXXXXX 1014
            R SYWDA P S+S EYPLFIPRQ++C++ GEN W++RSSSNFGVAS+             
Sbjct: 239  RDSYWDARPSSRSNEYPLFIPRQNYCSTSGENSWTDRSSSNFGVASMESERRPQLDSSRS 298

Query: 1015 XXFNYSLASPHSLENHTDLQKGISLLKKSVACITTYWYNILDLDMPSEASTFETFGKXXX 1194
              FNY+ AS HS+E H DLQKGISLLKKSVAC+T Y YN L LD+P+EASTFE F K   
Sbjct: 299  TSFNYTSASTHSVETHKDLQKGISLLKKSVACLTAYCYNSLCLDVPAEASTFEAFAKLLA 358

Query: 1195 XXXXXXXXXXIRNSLKMACSSSEKQAQQLKSSMWNEXXXXXXXXXIVEGEQ-SILSTVDD 1371
                      +  SLKMACS S KQ Q+L  S+WN          +       I   + D
Sbjct: 359  TLSSSKEVRSV-FSLKMACSRSCKQVQKLNRSVWNMNSAISSTTLLESAHMFPITKNLSD 417

Query: 1372 NDFSNPNSSFICTAEMIDYVKPKSLEEGWDIVEHPTFSPPLSETEDVEHWLRAMYIDVPK 1551
            N+  +  +SF+   EM D  K +SL +GWD+VEHPTF PP S+TEDVEHW RAM ID  K
Sbjct: 418  NNLPSSAASFLYATEMSDIGKNESLIDGWDLVEHPTFPPPPSQTEDVEHWTRAMIIDATK 477

Query: 1552 K 1554
            K
Sbjct: 478  K 478


>ref|XP_003544777.1| PREDICTED: uncharacterized protein LOC100776426 isoformX1 [Glycine
            max]
          Length = 475

 Score =  483 bits (1243), Expect = e-133
 Identities = 260/480 (54%), Positives = 322/480 (67%), Gaps = 1/480 (0%)
 Frame = +1

Query: 118  MARKSSNSCCAICEGSNLASICAPCVNYRLNEYSALMKSLNIVRQSYYSKLFDFLEKKRI 297
            MARK+SN  CAICE SN ASIC+ CVNYRLNEY+  +K L   R S Y KL + L +K  
Sbjct: 1    MARKTSN--CAICENSNQASICSICVNYRLNEYNTSLKLLKDRRDSLYLKLSEVLVRKGK 58

Query: 298  ADEQNNWISDQNEKLKKLEQRLTYLKAKRVEDKAKVAEQSNDLKSKYESLELAFEIIKKK 477
             D+Q NW   Q+EKL +L+++L   K +  + +AK+   S DLK KY  LE A   ++K 
Sbjct: 59   GDDQANWRVLQHEKLARLKEKLRQSKEQVTQGRAKIETMSADLKLKYGLLESALSTLEKN 118

Query: 478  RTDVLDKFNTDLIYNQRSAYMAITSELFHRQSVIVKQICKIFPLRKVNSDA-KKDGSNSM 654
            R + L+KF  +LI  Q   ++AITSEL H++SV++KQICK+FP R+V  +  ++DG +  
Sbjct: 119  RVEQLEKFYPNLICTQSLGHVAITSELLHKESVVIKQICKLFPQRRVVIEGERRDGCSGQ 178

Query: 655  CDQICNARLPRGLDPHSVPPEQLAASLGYMIQLLNLIVPILAAPVLHNSGFAGSCSRIWQ 834
             DQICNARLPR LDPHSVP E+L+ SLGYM+QLLNL++  LAAP LHNSGFAGSCSRIWQ
Sbjct: 179  YDQICNARLPRALDPHSVPSEELSTSLGYMVQLLNLVIHNLAAPALHNSGFAGSCSRIWQ 238

Query: 835  RASYWDACPFSQSKEYPLFIPRQSFCTSGGENLWSNRSSSNFGVASVXXXXXXXXXXXXX 1014
            R SYWDA P S+S EYPLFIPRQ++C++ GEN WS RSSSNFGVASV             
Sbjct: 239  RDSYWDARPSSRSNEYPLFIPRQNYCSTDGENSWSERSSSNFGVASVESERRHRLDSSGS 298

Query: 1015 XXFNYSLASPHSLENHTDLQKGISLLKKSVACITTYWYNILDLDMPSEASTFETFGKXXX 1194
              FNYSLAS HS++ H DLQKGISLLKKSV CIT Y YN L LD+PSEASTFE F K   
Sbjct: 299  TSFNYSLASSHSVQTHKDLQKGISLLKKSVVCITAYCYNSLCLDVPSEASTFEAFAKLLA 358

Query: 1195 XXXXXXXXXXIRNSLKMACSSSEKQAQQLKSSMWNEXXXXXXXXXIVEGEQSILSTVDDN 1374
                      +  SLKMA S + KQ QQL  S+WN          ++E   S+ +T  +N
Sbjct: 359  TLASSKEVRSV-FSLKMARSRTCKQVQQLNKSVWN-MNSAISSTTLLESAHSVPTTRIEN 416

Query: 1375 DFSNPNSSFICTAEMIDYVKPKSLEEGWDIVEHPTFSPPLSETEDVEHWLRAMYIDVPKK 1554
               +   SF+  A++ D  K + L EGWDIVEHPTF PP S++EDVEHW RAM+ID   K
Sbjct: 417  YLPSSTGSFLYAADLSD-GKNECLIEGWDIVEHPTFPPPPSQSEDVEHWTRAMFIDAKGK 475


>gb|ESW13116.1| hypothetical protein PHAVU_008G169200g [Phaseolus vulgaris]
            gi|561014256|gb|ESW13117.1| hypothetical protein
            PHAVU_008G169200g [Phaseolus vulgaris]
          Length = 476

 Score =  482 bits (1241), Expect = e-133
 Identities = 262/481 (54%), Positives = 328/481 (68%), Gaps = 2/481 (0%)
 Frame = +1

Query: 118  MARKSSNSCCAICEGSNLASICAPCVNYRLNEYSALMKSLNIVRQSYYSKLFDFLEKKRI 297
            MARK+SN  CAICE SN ASIC+ CVNYRLNEY+  +KSL   R S YSKL + L +K  
Sbjct: 1    MARKTSN--CAICENSNQASICSICVNYRLNEYNTSLKSLKDRRDSLYSKLSEVLVQKGK 58

Query: 298  ADEQNNWISDQNEKLKKLEQRLTYLKAKRVEDKAKVAEQSNDLKSKYESLELAFEIIKKK 477
             D+Q N+I  QNEKL +L+++L   K +  + +AK+   S DLK KY  LE A   ++K 
Sbjct: 59   GDDQENYIVLQNEKLARLKEKLHRSKEQVTQGRAKIETVSADLKHKYGLLESALSTLEKN 118

Query: 478  RTDVLDKFNTDLIYNQRSAYMAITSELFHRQSVIVKQICKIFPLRKVNSDAK-KDGSNSM 654
            R + L+KF  +LI  Q   ++AITSE  H+QSV++KQICK+FP R+V  + + +DG +  
Sbjct: 119  RVEQLEKFYPNLICTQSLGHVAITSERLHKQSVVIKQICKLFPQRRVVIEGEIRDGCSGQ 178

Query: 655  CDQICNARLPRGLDPHSVPPEQLAASLGYMIQLLNLIVPILAAPVLHNSGFAGSCSRIWQ 834
             DQICNARLPR LDPHSVP E+L+ASLGYM+QLLNL+V  LAAP LHNSGFAGSCSRIWQ
Sbjct: 179  YDQICNARLPRALDPHSVPSEELSASLGYMVQLLNLVVHNLAAPALHNSGFAGSCSRIWQ 238

Query: 835  RASYWDACPFSQSKEYPLFIPRQSFCTSGGENLWS-NRSSSNFGVASVXXXXXXXXXXXX 1011
            R SYWDA P S+S EYPLFIPRQ++C++ GEN WS ++SSSNFGVAS+            
Sbjct: 239  RDSYWDARPSSRSNEYPLFIPRQNYCSTAGENSWSTDKSSSNFGVASMESEKRNRLDSSG 298

Query: 1012 XXXFNYSLASPHSLENHTDLQKGISLLKKSVACITTYWYNILDLDMPSEASTFETFGKXX 1191
               FNYSLAS HS++ H DLQKGISLLKKSVACIT Y YN L LD PSEASTFE+F K  
Sbjct: 299  NSNFNYSLASLHSVQTHKDLQKGISLLKKSVACITAYCYNSLCLDAPSEASTFESFAKLL 358

Query: 1192 XXXXXXXXXXXIRNSLKMACSSSEKQAQQLKSSMWNEXXXXXXXXXIVEGEQSILSTVDD 1371
                       +  SLKMA S + KQ QQL  S+WN          ++E   S+ +T  +
Sbjct: 359  ATLSSSKEVRSV-FSLKMAQSRTCKQVQQLNKSVWN-MNSVISSTTLLESAHSVPTTRIE 416

Query: 1372 NDFSNPNSSFICTAEMIDYVKPKSLEEGWDIVEHPTFSPPLSETEDVEHWLRAMYIDVPK 1551
            N   +  +SF+   ++ D  K + L EGWDI+EHPTF PP S++EDVEHW RAM+ID  +
Sbjct: 417  NYLPSSTASFLYATDLND-GKNECLIEGWDIIEHPTFPPPPSQSEDVEHWTRAMFIDAKR 475

Query: 1552 K 1554
            K
Sbjct: 476  K 476


>ref|XP_003542641.1| PREDICTED: uncharacterized protein LOC100813297 [Glycine max]
          Length = 474

 Score =  478 bits (1230), Expect = e-132
 Identities = 260/480 (54%), Positives = 321/480 (66%), Gaps = 1/480 (0%)
 Frame = +1

Query: 118  MARKSSNSCCAICEGSNLASICAPCVNYRLNEYSALMKSLNIVRQSYYSKLFDFLEKKRI 297
            MARK+SN  CAICE SN ASIC+ CVNYRLNEY+  +K L   R S YSKL + L +K  
Sbjct: 1    MARKTSN--CAICENSNQASICSICVNYRLNEYNTSLKLLKDRRDSLYSKLSEVLVRKGK 58

Query: 298  ADEQNNWISDQNEKLKKLEQRLTYLKAKRVEDKAKVAEQSNDLKSKYESLELAFEIIKKK 477
             D+Q NW   Q+EKL +L+++L   K +  + +AK+  +S DLK KY  LE A   ++K 
Sbjct: 59   GDDQANWRVLQHEKLARLKEKLRQGKEQVTQGRAKIETKSADLKLKYGLLESALSTLEKN 118

Query: 478  RTDVLDKFNTDLIYNQRSAYMAITSELFHRQSVIVKQICKIFPLRKVNSDAKK-DGSNSM 654
            R + L+KF  +LI  Q   ++AITSE  H+QSV++KQICK+FP R+V  + ++ DG    
Sbjct: 119  RVEQLEKFYPNLICTQSLGHVAITSERLHKQSVVIKQICKLFPQRRVVIEGERGDGCCGQ 178

Query: 655  CDQICNARLPRGLDPHSVPPEQLAASLGYMIQLLNLIVPILAAPVLHNSGFAGSCSRIWQ 834
             DQICNARLPR LDP SVP E+L+ SLGYM+QLLNLIV  LAAP LHNSGFAGSCSRIWQ
Sbjct: 179  FDQICNARLPRALDPRSVPSEELSTSLGYMVQLLNLIVHNLAAPALHNSGFAGSCSRIWQ 238

Query: 835  RASYWDACPFSQSKEYPLFIPRQSFCTSGGENLWSNRSSSNFGVASVXXXXXXXXXXXXX 1014
            R SYWDA P S+S EYPLFIPRQ++C++GGEN WS RSSSNFGVAS+             
Sbjct: 239  RDSYWDARPSSRSNEYPLFIPRQNYCSTGGENSWSERSSSNFGVASMESERRHRLDSSGS 298

Query: 1015 XXFNYSLASPHSLENHTDLQKGISLLKKSVACITTYWYNILDLDMPSEASTFETFGKXXX 1194
              FNYSLAS HS++ H DLQKGISLLKKSVACIT Y YN L LD+PSEASTFE F K   
Sbjct: 299  SSFNYSLASSHSVQTHKDLQKGISLLKKSVACITAYCYNSLCLDVPSEASTFEAFAKLLA 358

Query: 1195 XXXXXXXXXXIRNSLKMACSSSEKQAQQLKSSMWNEXXXXXXXXXIVEGEQSILSTVDDN 1374
                      +  SLKM  S + KQ QQL  S+WN          ++E   S+ +T  +N
Sbjct: 359  TLSSSKEVRSV-FSLKMPRSRTCKQVQQLNKSVWN-MNSAISSTTLLESAHSVPTTRIEN 416

Query: 1375 DFSNPNSSFICTAEMIDYVKPKSLEEGWDIVEHPTFSPPLSETEDVEHWLRAMYIDVPKK 1554
               +  +SF+   +     K + L EGWDIVEHPTF PP S++EDVEHW RAM+ID  +K
Sbjct: 417  YLPSATASFLYATDSDG--KNECLVEGWDIVEHPTFPPPPSQSEDVEHWTRAMFIDAKRK 474


>ref|XP_006386390.1| hypothetical protein POPTR_0002s09150g [Populus trichocarpa]
            gi|550344612|gb|ERP64187.1| hypothetical protein
            POPTR_0002s09150g [Populus trichocarpa]
          Length = 506

 Score =  475 bits (1222), Expect = e-131
 Identities = 256/505 (50%), Positives = 315/505 (62%), Gaps = 32/505 (6%)
 Frame = +1

Query: 136  NSCCAICEGSNLASICAPCVNYRLNEYSALMKSLNIVRQSYYSKLFDFLEKKRIADEQNN 315
            +SCCAICE SN ASIC  CVNYRLNEY  L+KSLN  R S YSKL   L  K  AD+Q N
Sbjct: 5    SSCCAICENSNRASICPICVNYRLNEYGTLLKSLNSRRDSLYSKLSVVLIAKGKADDQFN 64

Query: 316  WISDQNEKLKKLEQRLTYLKAKRVEDKAKVAEQSNDLKSKYESLELAFEIIKKKRTDVLD 495
            W   QNEKL    ++L   K +  + KAKV + S DLK K   LE A  +++K R + L+
Sbjct: 65   WRVQQNEKLASSREKLHRNKEQLAQGKAKVEKLSQDLKKKNGMLESARNVLEKNRMEQLE 124

Query: 496  KFNTDLIYNQRSAYMAITSELFHRQSVIVKQICKIFPLRKVNSDAKKDGSNSMCDQICNA 675
            KF  +LI  Q   +MAITSEL H+QSV++KQICK+FP R+VN D +++ S    DQICNA
Sbjct: 125  KFYPNLICTQSLGHMAITSELLHKQSVVIKQICKLFPQRRVNVDGERNFSGQY-DQICNA 183

Query: 676  RLPRGLDPHSVPPEQLAASLGYMIQLLNLIVPILAAPVLHNSGFAGSCSRIWQRASYWDA 855
            RLPRGLDPHSV  E+LAASLGYM+QLLNL+   LAAP LHN+GFAGSCSRIWQR SYW+A
Sbjct: 184  RLPRGLDPHSVSSEELAASLGYMVQLLNLVAHNLAAPTLHNAGFAGSCSRIWQRDSYWNA 243

Query: 856  CPFSQ-------------------------------SKEYPLFIPRQSFCTSGGENLWSN 942
            CP S+                               S EYPLFIPRQ++C++  EN W++
Sbjct: 244  CPSSRRYFDWKSLCFGISVAKFELLLLSELNILCACSNEYPLFIPRQNYCSTSSENSWTD 303

Query: 943  RSSSNFGVASVXXXXXXXXXXXXXXXFNYSLASPHSLENHTDLQKGISLLKKSVACITTY 1122
            +SSSNFGVAS+               FNYS  SPHS+E H DLQKG+SLLKKSVAC+T Y
Sbjct: 304  KSSSNFGVASMESERRPHLDSTRSNSFNYSSVSPHSVETHKDLQKGVSLLKKSVACVTAY 363

Query: 1123 WYNILDLDMPSEASTFETFGKXXXXXXXXXXXXXIRNSLKMACSSSEKQAQQLKSSMWNE 1302
             YN+L LD+PS+ STFE F K             + N LKMACS S KQ Q+L  S+WN 
Sbjct: 364  CYNLLCLDVPSDTSTFEAFAKLLSTLSSSKEVRSVFN-LKMACSRSCKQVQKLNKSVWNV 422

Query: 1303 XXXXXXXXXIVEGEQ-SILSTVDDNDFSNPNSSFICTAEMIDYVKPKSLEEGWDIVEHPT 1479
                     +       ++    DN+  N  +SF+    + D  K +S  +GWD+VEHPT
Sbjct: 423  NSAISSSALLESAHALQLMKNTSDNNLPNSAASFLFATGISD-GKNESFIDGWDLVEHPT 481

Query: 1480 FSPPLSETEDVEHWLRAMYIDVPKK 1554
            F PP S+ ED+EHW RAM+ID  KK
Sbjct: 482  FPPPPSQVEDIEHWTRAMFIDATKK 506


>ref|XP_006354053.1| PREDICTED: uncharacterized protein LOC102590673 isoform X1 [Solanum
            tuberosum] gi|565375051|ref|XP_006354054.1| PREDICTED:
            uncharacterized protein LOC102590673 isoform X2 [Solanum
            tuberosum]
          Length = 483

 Score =  473 bits (1216), Expect = e-130
 Identities = 247/479 (51%), Positives = 314/479 (65%), Gaps = 7/479 (1%)
 Frame = +1

Query: 139  SCCAICEGSNLASICAPCVNYRLNEYSALMKSLNIVRQSYYSKLFDFLEKKRIADEQNNW 318
            SCC ICE SNL S+C  CVNYRLNEYS ++KSL   R++   KL + L  K  AD+Q +W
Sbjct: 6    SCCGICENSNLPSVCTLCVNYRLNEYSTVLKSLKGRREALCGKLSEILLAKGKADDQLSW 65

Query: 319  ISDQNEKLKKLEQRLTYLKAKRVEDKAKVAEQSNDLKSKYESLELAFEIIKKKRTDVLDK 498
               +NEKL +L ++L   K +  + KAK+ + S+DLK +YE L  A  +++K R + L+K
Sbjct: 66   RVPRNEKLARLREKLRQQKEQISQGKAKIEKMSHDLKVQYELLGSATRMLEKNRAEQLEK 125

Query: 499  FNTDLIYNQRSAYMAITSELFHRQSVIVKQICKIFPLRKVNSDA-KKDGSNSMCDQICNA 675
            F  +LI  Q   +MAITSEL H+QSV+VKQICK+FP R+V  D  KKDGS+   D ICNA
Sbjct: 126  FYPNLICTQNLGHMAITSELLHKQSVVVKQICKLFPQRRVTIDGDKKDGSSGQYDSICNA 185

Query: 676  RLPRGLDPHSVPPEQLAASLGYMIQLLNLIVPILAAPVLHNSGFAGSCSRIWQRASYWDA 855
            RLP+GLDPHSVP ++L+ASLGYM+QLLNL++  + AP LHNSGFAGSCSRIWQR SYWDA
Sbjct: 186  RLPKGLDPHSVPSDELSASLGYMVQLLNLVIRCVCAPALHNSGFAGSCSRIWQRDSYWDA 245

Query: 856  CPFSQSKEYPLFIPRQSFCTSGGENLWSNRS------SSNFGVASVXXXXXXXXXXXXXX 1017
             P S+S EYPLFIPRQ+FC+SGGE  W +RS      SSNFGV S+              
Sbjct: 246  RPSSRSGEYPLFIPRQNFCSSGGEASWYDRSCSNSGTSSNFGVTSMESDRKPRLDSSSSS 305

Query: 1018 XFNYSLASPHSLENHTDLQKGISLLKKSVACITTYWYNILDLDMPSEASTFETFGKXXXX 1197
             FNY+ AS HS+E H DLQKGI+LLKKSVACIT Y YN L L++P+EASTFETF +    
Sbjct: 306  SFNYASASLHSIETHKDLQKGIALLKKSVACITAYCYNTLCLEVPAEASTFETFARLLAT 365

Query: 1198 XXXXXXXXXIRNSLKMACSSSEKQAQQLKSSMWNEXXXXXXXXXIVEGEQSILSTVDDND 1377
                     +  SLKM+ S + KQ Q L  S+WN          +  G   +L    +N 
Sbjct: 366  LSSSKEVRSV-FSLKMSGSRASKQVQPLNKSVWNVDSAGSSSTLMESGHVPVLRNTFENA 424

Query: 1378 FSNPNSSFICTAEMIDYVKPKSLEEGWDIVEHPTFSPPLSETEDVEHWLRAMYIDVPKK 1554
              + + + I   E+ D  + ++L E WD++EHP F PP S TEDVEHW RAM+ID  KK
Sbjct: 425  LPSSSGNLIYATEVSDARRNENLIEDWDLIEHPPFPPPPSHTEDVEHWTRAMFIDATKK 483


>gb|EXC06677.1| hypothetical protein L484_021513 [Morus notabilis]
          Length = 478

 Score =  471 bits (1213), Expect = e-130
 Identities = 257/482 (53%), Positives = 317/482 (65%), Gaps = 3/482 (0%)
 Frame = +1

Query: 118  MARKSSNSCCAICEGSNLASICAPCVNYRLNEYSALMKSLNIVRQSYYSKLFDFLEKKRI 297
            M RKS++  CA+CE SNL SIC+ CVNYRL ++  ++KS    R S YS+L + L  K  
Sbjct: 1    MNRKSTS--CALCENSNLPSICSICVNYRLADHYNILKSNKSHRDSLYSRLEEVLLAKGK 58

Query: 298  ADEQNNWISDQNEKLKKLEQRLTYLKAKRVEDKAKVAEQSNDLKSKYESLELAFEIIKKK 477
            AD+Q  W   QNEKL KL ++    K + V+ KAKV     DLK K   LE A  +++  
Sbjct: 59   ADDQVGWRMSQNEKLAKLREKHRRSKERLVQGKAKVERMHYDLKVKSGVLEAARSMLENN 118

Query: 478  RTDVLDKFNTDLIYNQRSAYMAITSELFHRQSVIVKQICKIFPLRKVNSDA-KKDGSNSM 654
            R + L+KF  + I  Q   +MAITSE  H+QSV++KQICK+FP R+V  D  +K+GS   
Sbjct: 119  RMEQLEKFYPNFICTQTLGHMAITSERLHKQSVVIKQICKLFPHRRVIIDGERKNGSAEQ 178

Query: 655  CDQICNARLPRGLDPHSVPPEQLAASLGYMIQLLNLIVPILAAPVLHNSGFAGSCSRIWQ 834
             DQICNARLPRG+DPHSV  E+L ASLGYM+QLLNLIV ILAAP LHNSGFAGS SRIWQ
Sbjct: 179  YDQICNARLPRGVDPHSVASEELGASLGYMVQLLNLIVRILAAPALHNSGFAGSNSRIWQ 238

Query: 835  RASYWDACPFSQSKEYPLFIPRQSFCTSGGENLWSNRSSSNFGVASVXXXXXXXXXXXXX 1014
            R SYWDA P S+S EYPLFIPRQ++C++  EN WS+RSSSNFGV S+             
Sbjct: 239  RDSYWDARPSSRSNEYPLFIPRQNYCSTSVENSWSDRSSSNFGVTSIESERKVRLDSSGS 298

Query: 1015 XXFNYSLASPHSLENHTDLQKGISLLKKSVACITTYWYNILDLDMPSEASTFETFGKXXX 1194
              FNYS ASPHS+E H DLQKGISLLKKSVACITTY YN L LD+PSEASTFE F K   
Sbjct: 299  NSFNYSSASPHSIETHKDLQKGISLLKKSVACITTYCYNSLCLDVPSEASTFEAFAKLLA 358

Query: 1195 XXXXXXXXXXIRNSLKMACSSSEKQAQQLKSSMWNEXXXXXXXXXIVEGEQSILS--TVD 1368
                      +  S+K ACS S KQ QQL  S+WN          +++   ++ S   + 
Sbjct: 359  TLSSSKELRSV-CSIKSACSRSNKQVQQLNKSVWN-VNSAFASTTLLDSAHTVASMKNIG 416

Query: 1369 DNDFSNPNSSFICTAEMIDYVKPKSLEEGWDIVEHPTFSPPLSETEDVEHWLRAMYIDVP 1548
            +N+  NP +SF+   E  D  K + + EGWD++EHPTF PP S+ EDVEHW RAM+ID  
Sbjct: 417  ENNLPNPATSFLYATES-DAGKNEFIIEGWDLIEHPTFPPPPSQCEDVEHWTRAMFIDAT 475

Query: 1549 KK 1554
            KK
Sbjct: 476  KK 477


>ref|XP_003614901.1| hypothetical protein MTR_5g061040 [Medicago truncatula]
            gi|355516236|gb|AES97859.1| hypothetical protein
            MTR_5g061040 [Medicago truncatula]
          Length = 501

 Score =  467 bits (1201), Expect = e-129
 Identities = 256/489 (52%), Positives = 320/489 (65%), Gaps = 14/489 (2%)
 Frame = +1

Query: 118  MARKSSNSCCAICEGSNLASICAPCVNYRLNEYSALMKSLNIVRQSYYSKLFDFLEKKRI 297
            MARKS+N  CAICE  N  SIC+ CVNYRLNEY++ +KSL   R S YSKL + L +K  
Sbjct: 1    MARKSTN--CAICENLNQPSICSVCVNYRLNEYNSSLKSLKERRDSLYSKLSEVLVRKGK 58

Query: 298  ADEQNNWISDQNEKLKKLEQRLTYLKAKRVEDKAKVAEQSNDLKSKYESLELAFEIIKKK 477
             D+Q NW   ++EKL +  ++L + K +  + +AK+   S DLK KY  LE A  +++K 
Sbjct: 59   GDDQTNWRVLRHEKLARSREKLRHNKEQVTQGRAKIQAMSADLKLKYGVLESALSMLEKN 118

Query: 478  RTDVLDKFNTDLIYNQRSAYMAITSELFHRQSVIVKQICKIFPLRKVNSDAKK-DGSNSM 654
            R + L+KF  +LI  Q   ++AITSE  H+QSV++KQICK+FP R+V  + +K D  +  
Sbjct: 119  RVEQLEKFYPNLICTQSLGHVAITSERLHKQSVVIKQICKLFPQRRVVIEGEKGDDCSGQ 178

Query: 655  CDQICNARLPRGLDPHSVPPEQLAASLGYMIQLLNLIVPILAAPVLHNSGFAGSCSRIWQ 834
             DQICNARLPR LDPHSVP E+L+ASLGYM+QLLNL+   LAAP LHNSGFAGSCSRIWQ
Sbjct: 179  YDQICNARLPRALDPHSVPSEELSASLGYMVQLLNLVAHNLAAPALHNSGFAGSCSRIWQ 238

Query: 835  RASYWDACPFSQSK-------------EYPLFIPRQSFCTSGGENLWSNRSSSNFGVASV 975
            R SYWDA P S+SK             EYPLFIPRQ++C++ GEN WS +SSSNFGVAS+
Sbjct: 239  RDSYWDARPSSRSKNFFNLKYSLFFSNEYPLFIPRQNYCSTSGENSWSEKSSSNFGVASM 298

Query: 976  XXXXXXXXXXXXXXXFNYSLASPHSLENHTDLQKGISLLKKSVACITTYWYNILDLDMPS 1155
                           FNYSLAS HS+++H DLQKGISLLKKSVACIT Y YN L  D+PS
Sbjct: 299  ESDRRPRLDSSGSSSFNYSLASSHSVQSHKDLQKGISLLKKSVACITAYCYNSLCFDIPS 358

Query: 1156 EASTFETFGKXXXXXXXXXXXXXIRNSLKMACSSSEKQAQQLKSSMWNEXXXXXXXXXIV 1335
            EASTFE F K             +  SLKMA S + KQ QQL  S+WN          ++
Sbjct: 359  EASTFEAFAKLLATLSSSKEVRSV-FSLKMARSRTCKQVQQLNKSVWN-MNSANSSTTLL 416

Query: 1336 EGEQSILSTVDDNDFSNPNSSFICTAEMIDYVKPKSLEEGWDIVEHPTFSPPLSETEDVE 1515
            E   S+ +T  +N   N  +SF+   +  D  K + L EGWDIVEHPT  PP S++EDVE
Sbjct: 417  ESTHSVPTTRIENYMPNSAASFLYPTDSSDR-KSECLIEGWDIVEHPTLPPPPSQSEDVE 475

Query: 1516 HWLRAMYID 1542
            HW RAM+ID
Sbjct: 476  HWTRAMFID 484


>ref|XP_004237862.1| PREDICTED: uncharacterized protein LOC101264619 [Solanum
            lycopersicum]
          Length = 481

 Score =  461 bits (1187), Expect = e-127
 Identities = 246/486 (50%), Positives = 315/486 (64%), Gaps = 7/486 (1%)
 Frame = +1

Query: 118  MARKSSNSCCAICEGSNLASICAPCVNYRLNEYSALMKSLNIVRQSYYSKLFDFLEKKRI 297
            M RK+S  CC ICE SNL S+C  CVNYRLNEYS ++KSL   R++   +L + L  K  
Sbjct: 1    MTRKTS--CCGICENSNLPSVCTLCVNYRLNEYSTVLKSLKGRREALCGQLSEILLAKGK 58

Query: 298  ADEQNNWISDQNEKLKKLEQRLTYLKAKRVEDKAKVAEQSNDLKSKYESLELAFEIIKKK 477
            AD+Q +W   +NEKL +L ++L   K +  + KAK+ + S+DLK +YE L  A  +++K 
Sbjct: 59   ADDQLSWRVPRNEKLARLREKLRQQKEQVSQGKAKIEKMSHDLKVQYELLGSATRMLEKN 118

Query: 478  RTDVLDKFNTDLIYNQRSAYMAITSELFHRQSVIVKQICKIFPLRKVNSDA-KKDGSNSM 654
            R + L+KF  +LI  Q   +MAITSEL H+QSV+VKQICK+FP R+V  D  KKDGS+  
Sbjct: 119  RAEQLEKFYPNLICTQNLGHMAITSELLHKQSVVVKQICKLFPQRRVTIDGDKKDGSSGQ 178

Query: 655  CDQICNARLPRGLDPHSVPPEQLAASLGYMIQLLNLIVPILAAPVLHNSGFAGSCSRIWQ 834
             D ICNARLP+GLDPHSVP ++L+ASLGYM+QLLNL+V  + AP LHNSGFAGSCSRIWQ
Sbjct: 179  YDSICNARLPKGLDPHSVPSDELSASLGYMVQLLNLVVRCVCAPALHNSGFAGSCSRIWQ 238

Query: 835  RASYWDACPFSQSKEYPLFIPRQSFCTSGGENLWSNRS------SSNFGVASVXXXXXXX 996
            R SYWDA P S+S EYPLFIPRQ+FC+SGGE  W +RS      SSNFGV S+       
Sbjct: 239  RDSYWDARPSSRSGEYPLFIPRQNFCSSGGEASWYDRSSSNSGTSSNFGVTSMESDRKPR 298

Query: 997  XXXXXXXXFNYSLASPHSLENHTDLQKGISLLKKSVACITTYWYNILDLDMPSEASTFET 1176
                    FNY+ AS HS+E H DLQKGI+LLKKSVACIT Y YN L L++P+EASTFET
Sbjct: 299  LDSSSSSSFNYASASLHSIETHKDLQKGIALLKKSVACITAYCYNTLCLEVPAEASTFET 358

Query: 1177 FGKXXXXXXXXXXXXXIRNSLKMACSSSEKQAQQLKSSMWNEXXXXXXXXXIVEGEQSIL 1356
            F +             +  SLKM+ S + KQ Q L  S+WN          +  G   + 
Sbjct: 359  FARLLATLSSSKEVRSV-FSLKMSGSRASKQVQPLNKSVWNVDSAGSSSTLMESGH--VP 415

Query: 1357 STVDDNDFSNPNSSFICTAEMIDYVKPKSLEEGWDIVEHPTFSPPLSETEDVEHWLRAMY 1536
                +    +   + +   E+ +  + ++L E WD++EHP F PP S TEDVEHW RAM+
Sbjct: 416  RNTFEKSLPSSGGNLMYATEVSNVGRNENLIEDWDLIEHPPFPPPPSHTEDVEHWTRAMF 475

Query: 1537 IDVPKK 1554
            ID  KK
Sbjct: 476  IDATKK 481


>ref|XP_004138644.1| PREDICTED: uncharacterized protein LOC101217421 [Cucumis sativus]
            gi|449524750|ref|XP_004169384.1| PREDICTED:
            uncharacterized LOC101217421 [Cucumis sativus]
          Length = 476

 Score =  459 bits (1181), Expect = e-126
 Identities = 245/480 (51%), Positives = 309/480 (64%), Gaps = 1/480 (0%)
 Frame = +1

Query: 118  MARKSSNSCCAICEGSNLASICAPCVNYRLNEYSALMKSLNIVRQSYYSKLFDFLEKKRI 297
            M RK  N  CAICE SN ASIC  CVN RLN+Y++ +KSL   R   YS+L D L  K  
Sbjct: 1    MNRKFCN--CAICENSNQASICTGCVNLRLNDYNSSLKSLRARRDVLYSRLSDVLVAKGK 58

Query: 298  ADEQNNWISDQNEKLKKLEQRLTYLKAKRVEDKAKVAEQSNDLKSKYESLELAFEIIKKK 477
            AD+Q NW   +NEKL  L ++L   + +  + KA++  +S DL+ KY  LE A  +++K+
Sbjct: 59   ADDQLNWRVTRNEKLTSLREKLRRSREQLEQGKAEIEMKSFDLQLKYAMLESARSVLEKQ 118

Query: 478  RTDVLDKFNTDLIYNQRSAYMAITSELFHRQSVIVKQICKIFPLRKVNSDAKKD-GSNSM 654
            R + L+K   DLI  +   +MAITSE  H+QSV++KQ+CK+FP R+V    +K+ G    
Sbjct: 119  RLEQLEKAYPDLISTKNLGHMAITSERLHKQSVVIKQLCKLFPQRRVLVRGEKEVGPGEP 178

Query: 655  CDQICNARLPRGLDPHSVPPEQLAASLGYMIQLLNLIVPILAAPVLHNSGFAGSCSRIWQ 834
             DQICN  LPR LDPHSV P +L+ASLGYM+QLLNL+V  LAAP LH SGFAGSCSRIWQ
Sbjct: 179  FDQICNVSLPRSLDPHSVEPYELSASLGYMVQLLNLVVQYLAAPALHTSGFAGSCSRIWQ 238

Query: 835  RASYWDACPFSQSKEYPLFIPRQSFCTSGGENLWSNRSSSNFGVASVXXXXXXXXXXXXX 1014
            R SYW+ACP S+S EYP+F+PRQS+C++ GEN WS++SSSNFGVAS+             
Sbjct: 239  RDSYWNACPSSRSNEYPVFMPRQSYCSTSGENSWSDKSSSNFGVASLESERKPQLSSLEN 298

Query: 1015 XXFNYSLASPHSLENHTDLQKGISLLKKSVACITTYWYNILDLDMPSEASTFETFGKXXX 1194
              FNYS ASPHS+E+H DLQKGI+LLKKSVAC+T Y YN L LD+PSEASTFE F K   
Sbjct: 299  RSFNYSSASPHSIESHKDLQKGIALLKKSVACVTAYGYNSLSLDVPSEASTFEAFAKLLA 358

Query: 1195 XXXXXXXXXXIRNSLKMACSSSEKQAQQLKSSMWNEXXXXXXXXXIVEGEQSILSTVDDN 1374
                      +  SLKMA S S K  Q+   S WN             G   I+ T  ++
Sbjct: 359  TLSSSKEVRSV-FSLKMASSRSTKHIQKPIKSTWN-VNSIASSMLFESGHSQIMKTNYES 416

Query: 1375 DFSNPNSSFICTAEMIDYVKPKSLEEGWDIVEHPTFSPPLSETEDVEHWLRAMYIDVPKK 1554
            +  +  SS++   E  D  K  S  EGWD+VEHPTF PP S+ ED+EHW RAM ID  K+
Sbjct: 417  NLPSSASSYLYATEFSDTGKNDSSIEGWDLVEHPTFPPPPSQAEDIEHWTRAMIIDATKQ 476


>ref|XP_006397280.1| hypothetical protein EUTSA_v10028627mg [Eutrema salsugineum]
            gi|557098297|gb|ESQ38733.1| hypothetical protein
            EUTSA_v10028627mg [Eutrema salsugineum]
          Length = 474

 Score =  452 bits (1162), Expect = e-124
 Identities = 245/481 (50%), Positives = 318/481 (66%), Gaps = 2/481 (0%)
 Frame = +1

Query: 118  MARKSSNSCCAICEGSNLASICAPCVNYRLNEYSALMKSLNIVRQSYYSKLFDFLEKKRI 297
            M ++SSN  CAICE +N ASIC+ CVNYRL EYS L+KSL   R + YSKL + LE K  
Sbjct: 1    MIKRSSN--CAICENTNRASICSVCVNYRLIEYSTLLKSLKTRRDALYSKLSELLEAKGK 58

Query: 298  ADEQNNWISDQNEKLKKLEQRLTYLKAKRVEDKAKVAEQSNDLKSKYESLELAFEIIKKK 477
            AD+Q NW   QNEKL  L+  L   K +  + KAK+  +S DLK KY  L+ A   +++ 
Sbjct: 59   ADDQKNWKLIQNEKLSGLKNNLRRNKEQVTQGKAKIERESRDLKLKYGVLDSARSTLERI 118

Query: 478  RTDVLDKFNTDLIYNQRSAYMAITSELFHRQSVIVKQICKIFPLRKVNSDAK-KDGSNSM 654
            R + ++K+  +LI  Q   +MAI+SE  H+QSV++KQ+CK+FP R+V+ D + ++GS   
Sbjct: 119  RVEQVEKYFPNLICTQSLGHMAISSERLHKQSVVMKQVCKLFPQRRVSFDGESQNGSVGQ 178

Query: 655  CDQICNARLPRGLDPHSVPPEQLAASLGYMIQLLNLIVPILAAPVLHNSGFAGSCSRIWQ 834
             + ICN+RLP+GLDPHS+P E+LAASLG M+QLLNL+V  LAAP LHNSGFAGSCSRIWQ
Sbjct: 179  YNLICNSRLPKGLDPHSIPSEELAASLGLMVQLLNLVVHNLAAPALHNSGFAGSCSRIWQ 238

Query: 835  RASYWDACPFSQSKEYPLFIPRQSFCTSGGENLWSNRSSSNFGVASV-XXXXXXXXXXXX 1011
            R SYWDA P ++S EYPLFIPRQ++C++  EN W++++SSNFGVAS+             
Sbjct: 239  RDSYWDARPSTRSNEYPLFIPRQNYCSTSVENSWTDKNSSNFGVASMESDRKEARLDSTG 298

Query: 1012 XXXFNYSLASPHSLENHTDLQKGISLLKKSVACITTYWYNILDLDMPSEASTFETFGKXX 1191
               FNYS ASPHS+E+H DLQKGI+LLKKSVAC+T Y YN L L++P EASTFE F K  
Sbjct: 299  RNSFNYSSASPHSVESHRDLQKGIALLKKSVACLTAYCYNSLCLEVPPEASTFEAFAKLL 358

Query: 1192 XXXXXXXXXXXIRNSLKMACSSSEKQAQQLKSSMWNEXXXXXXXXXIVEGEQSILSTVDD 1371
                       +  SLKMA S S KQAQQL  S+WN          I+E    +      
Sbjct: 359  ATLSSSKEVRSV-FSLKMASSRSCKQAQQLNKSIWN--AHSVISSSILESSH-LPRNASY 414

Query: 1372 NDFSNPNSSFICTAEMIDYVKPKSLEEGWDIVEHPTFSPPLSETEDVEHWLRAMYIDVPK 1551
            N   N  +S++   E+ +  K   +  GWD+VEHP + PP S++EDVEHW RAM+ID  K
Sbjct: 415  NQDPNSAASYLSGTELSEIRKSNDM-NGWDLVEHPKYPPPPSQSEDVEHWTRAMFIDAKK 473

Query: 1552 K 1554
            K
Sbjct: 474  K 474


>ref|NP_192594.2| DNA-directed RNA polymerase II protein [Arabidopsis thaliana]
            gi|23297675|gb|AAN13006.1| unknown protein [Arabidopsis
            thaliana] gi|332657255|gb|AEE82655.1| DNA-directed RNA
            polymerase II protein [Arabidopsis thaliana]
          Length = 473

 Score =  436 bits (1120), Expect = e-119
 Identities = 236/481 (49%), Positives = 314/481 (65%), Gaps = 2/481 (0%)
 Frame = +1

Query: 118  MARKSSNSCCAICEGSNLASICAPCVNYRLNEYSALMKSLNIVRQSYYSKLFDFLEKKRI 297
            M ++SSN  CAIC+ +N   IC  CVN+RL EY+ L+KSL   R S  S+  + LE K  
Sbjct: 1    MTKRSSN--CAICDNTNRPCICTACVNHRLIEYNTLLKSLKTRRDSLLSRFNELLESKGK 58

Query: 298  ADEQNNWISDQNEKLKKLEQRLTYLKAKRVEDKAKVAEQSNDLKSKYESLELAFEIIKKK 477
            AD+Q NW   QNEK+ KL+++L   K    + K K+   S+DLK KY  L+ A   ++K 
Sbjct: 59   ADDQKNWRLIQNEKISKLKKKLKSNKELVTQGKVKIERGSSDLKVKYGVLDSARSTLEKT 118

Query: 478  RTDVLDKFNTDLIYNQRSAYMAITSELFHRQSVIVKQICKIFPLRKVNSDAK-KDGSNSM 654
            R + ++K+  +LI  Q   +MAI+SE  H+QSV+VKQICK+FPLR+V+ D + ++GS   
Sbjct: 119  RVEQVEKYFPNLICTQSLGHMAISSERLHKQSVVVKQICKLFPLRRVSFDGESQNGSVRQ 178

Query: 655  CDQICNARLPRGLDPHSVPPEQLAASLGYMIQLLNLIVPILAAPVLHNSGFAGSCSRIWQ 834
             D ICN+RLP GLDPHS+P E+LA SLGYM+QLLNL+V  LAAP LH+SGFAGSCSRIWQ
Sbjct: 179  YDVICNSRLPSGLDPHSIPSEELAVSLGYMVQLLNLVVHNLAAPALHSSGFAGSCSRIWQ 238

Query: 835  RASYWDACPFSQSKEYPLFIPRQSFCTSGGENLWSNRSSSNFGVASV-XXXXXXXXXXXX 1011
            R SYWD    ++S EYPLFIPR+++C++  EN W++++SSNFGVAS+             
Sbjct: 239  RDSYWDGRTSTRSNEYPLFIPRRNYCSTSVENSWTDKNSSNFGVASMESDRKEPRLDSPG 298

Query: 1012 XXXFNYSLASPHSLENHTDLQKGISLLKKSVACITTYWYNILDLDMPSEASTFETFGKXX 1191
               F YS ASPHS+E+H DLQKGI+LLKKSVAC+T Y YN L L++P EASTFE F K  
Sbjct: 299  SNSFKYSSASPHSIESHRDLQKGIALLKKSVACLTAYCYNSLCLEVPPEASTFEAFAKLL 358

Query: 1192 XXXXXXXXXXXIRNSLKMACSSSEKQAQQLKSSMWNEXXXXXXXXXIVEGEQSILSTVDD 1371
                       +  SLKMA S S KQAQQL  S+WN          ++E      +T  +
Sbjct: 359  ATLSSSKEVRSV-FSLKMASSRSGKQAQQLNKSIWN--AHSVISSSLLESAHLPRNTSYN 415

Query: 1372 NDFSNPNSSFICTAEMIDYVKPKSLEEGWDIVEHPTFSPPLSETEDVEHWLRAMYIDVPK 1551
             D ++P +S++   E+    +  +   GWD+VEHP + PP S++EDVEHW RAM+ID  K
Sbjct: 416  QDPNSP-ASYLSATEL--STRKNNDMNGWDLVEHPKYPPPPSQSEDVEHWTRAMFIDAKK 472

Query: 1552 K 1554
            K
Sbjct: 473  K 473


>gb|AAL59980.1| unknown protein [Arabidopsis thaliana]
          Length = 473

 Score =  435 bits (1118), Expect = e-119
 Identities = 236/481 (49%), Positives = 314/481 (65%), Gaps = 2/481 (0%)
 Frame = +1

Query: 118  MARKSSNSCCAICEGSNLASICAPCVNYRLNEYSALMKSLNIVRQSYYSKLFDFLEKKRI 297
            M ++SSN  CAIC+ +N   IC  CVN+RL EY+ L+KSL   R S  S+  + LE K  
Sbjct: 1    MTKRSSN--CAICDNTNRPCICTACVNHRLIEYNTLLKSLKTRRDSLLSRFNELLESKGK 58

Query: 298  ADEQNNWISDQNEKLKKLEQRLTYLKAKRVEDKAKVAEQSNDLKSKYESLELAFEIIKKK 477
            AD+Q NW   QNEK+ KL+++L   K    + K K+   S+DLK KY  L+ A   ++K 
Sbjct: 59   ADDQKNWRLIQNEKISKLKKKLKSNKELVTQGKVKIERGSSDLKVKYGVLDSARSTLEKT 118

Query: 478  RTDVLDKFNTDLIYNQRSAYMAITSELFHRQSVIVKQICKIFPLRKVNSDAK-KDGSNSM 654
            R + ++K+  +LI  Q   +MAI+SE  H+QSV+VKQICK+FPLR+V+ D + ++GS   
Sbjct: 119  RVEQVEKYFPNLICTQSLGHMAISSERLHKQSVVVKQICKLFPLRRVSFDGESQNGSVRQ 178

Query: 655  CDQICNARLPRGLDPHSVPPEQLAASLGYMIQLLNLIVPILAAPVLHNSGFAGSCSRIWQ 834
             D ICN+RLP GLDPHS+P E+LA SLGYM+QLLNL+V  LAAP LH+SGFAGSCSRIWQ
Sbjct: 179  YDVICNSRLPSGLDPHSIPSEELAVSLGYMVQLLNLVVHNLAAPALHSSGFAGSCSRIWQ 238

Query: 835  RASYWDACPFSQSKEYPLFIPRQSFCTSGGENLWSNRSSSNFGVASV-XXXXXXXXXXXX 1011
            R SYWD    ++S EYPLFIPR+++C++  EN W++++SSNFGVAS+             
Sbjct: 239  RDSYWDGRTSTRSNEYPLFIPRRNYCSTSVENSWTDKNSSNFGVASMESDRKEPRLDSPG 298

Query: 1012 XXXFNYSLASPHSLENHTDLQKGISLLKKSVACITTYWYNILDLDMPSEASTFETFGKXX 1191
               F YS ASPHS+E+H DLQKGI+LLKKSVAC+T Y YN L L++P EASTFE F K  
Sbjct: 299  SNSFMYSSASPHSIESHRDLQKGIALLKKSVACLTAYCYNSLCLEVPPEASTFEAFAKLL 358

Query: 1192 XXXXXXXXXXXIRNSLKMACSSSEKQAQQLKSSMWNEXXXXXXXXXIVEGEQSILSTVDD 1371
                       +  SLKMA S S KQAQQL  S+WN          ++E      +T  +
Sbjct: 359  ATLSSSKEVRSV-FSLKMASSRSGKQAQQLNKSIWN--AHSVISSSLLESAHLPRNTSYN 415

Query: 1372 NDFSNPNSSFICTAEMIDYVKPKSLEEGWDIVEHPTFSPPLSETEDVEHWLRAMYIDVPK 1551
             D ++P +S++   E+    +  +   GWD+VEHP + PP S++EDVEHW RAM+ID  K
Sbjct: 416  QDPNSP-ASYLSATEL--STRKNNDMNGWDLVEHPKYPPPPSQSEDVEHWTRAMFIDAKK 472

Query: 1552 K 1554
            K
Sbjct: 473  K 473


Top