BLASTX nr result

ID: Stemona21_contig00008443 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Stemona21_contig00008443
         (1939 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|EMJ26908.1| hypothetical protein PRUPE_ppa005050mg [Prunus pe...   600   e-169
ref|XP_004300664.1| PREDICTED: uncharacterized protein LOC101292...   599   e-168
gb|EOY16120.1| DNA-directed RNA polymerase II protein isoform 1 ...   598   e-168
ref|XP_006472675.1| PREDICTED: uncharacterized protein LOC102626...   597   e-168
ref|XP_006434072.1| hypothetical protein CICLE_v10001001mg [Citr...   592   e-166
ref|XP_002533875.1| DNA-directed RNA polymerase II, putative [Ri...   590   e-166
ref|XP_002285818.1| PREDICTED: uncharacterized protein LOC100260...   583   e-164
ref|XP_003542641.1| PREDICTED: uncharacterized protein LOC100813...   577   e-162
ref|XP_002302270.1| hypothetical protein POPTR_0002s09150g [Popu...   574   e-161
ref|XP_003544777.1| PREDICTED: uncharacterized protein LOC100776...   574   e-161
gb|ESW13116.1| hypothetical protein PHAVU_008G169200g [Phaseolus...   571   e-160
ref|XP_003614901.1| hypothetical protein MTR_5g061040 [Medicago ...   565   e-158
gb|EXC06677.1| hypothetical protein L484_021513 [Morus notabilis]     565   e-158
ref|XP_006386390.1| hypothetical protein POPTR_0002s09150g [Popu...   558   e-156
ref|XP_004237862.1| PREDICTED: uncharacterized protein LOC101264...   550   e-153
ref|XP_006354053.1| PREDICTED: uncharacterized protein LOC102590...   548   e-153
ref|NP_192594.2| DNA-directed RNA polymerase II protein [Arabido...   541   e-151
gb|AAL59980.1| unknown protein [Arabidopsis thaliana]                 541   e-151
ref|XP_006397280.1| hypothetical protein EUTSA_v10028627mg [Eutr...   538   e-150
ref|XP_004138644.1| PREDICTED: uncharacterized protein LOC101217...   524   e-146

>gb|EMJ26908.1| hypothetical protein PRUPE_ppa005050mg [Prunus persica]
            gi|462422646|gb|EMJ26909.1| hypothetical protein
            PRUPE_ppa005050mg [Prunus persica]
          Length = 479

 Score =  600 bits (1548), Expect = e-169
 Identities = 306/479 (63%), Positives = 369/479 (77%), Gaps = 7/479 (1%)
 Frame = -3

Query: 1748 MTGRSSGSCAICEGSNLAAVCAPCVNHRLGEYNGTLKSLKSLRDSLHSKLNGVLLARKKA 1569
            M  R S +CAICE SNLA+VCA CVN+RL EYN +LK+LKS RDSL+S+L   L+A+ KA
Sbjct: 1    MMNRKSSNCAICESSNLASVCAICVNYRLTEYNSSLKALKSRRDSLYSRLTEALVAKGKA 60

Query: 1568 DEQTSWRVLQKEKHMKMKERLQQLEKKLSQDRAKVKRVSNDLKVKRGLLESAFSTLKKKR 1389
            D+Q +WRVLQ EK ++++E+L+  +++L Q +AK+++ S DLKVK G+LESA + L+K R
Sbjct: 61   DDQLNWRVLQNEKLVRLREKLRCNKEQLVQGKAKIEKTSYDLKVKSGVLESALAVLEKNR 120

Query: 1388 AEVLEKYYPNLIHTQSLGYVAINFERLHKQSVVIKQICRLLPMRKVNSAGETKDGSNGSY 1209
            AE LEK+YPN I TQ+LG++AI  ERLHKQSVVIKQIC+L P R+V    + KD S G Y
Sbjct: 121  AEQLEKFYPNFICTQNLGHMAITSERLHKQSVVIKQICKLFPQRRVTVDAKRKDASGGQY 180

Query: 1208 DQICNARLPRGLDPHSVPSEELAASLGYMLQLLNLVVPSLAAPSLHNAGFAGSCSRIWQR 1029
            DQICNA LPRGLDPHSVPSEELAASLGYM+QLLNLVV +LAAP+LHN+GFAGSCSRIWQR
Sbjct: 181  DQICNACLPRGLDPHSVPSEELAASLGYMVQLLNLVVQNLAAPALHNSGFAGSCSRIWQR 240

Query: 1028 DSYWDARPSSQSKEYPLFIPRQNFCSSGGENSWSERSSSNFGVASVESERKPYLDPAIXX 849
            DSYWDARPSS+S EYPLFIPRQN+CS+ GENSWS+RSSSNFGVAS++SERKP+LD +   
Sbjct: 241  DSYWDARPSSRSNEYPLFIPRQNYCSTSGENSWSDRSSSNFGVASIDSERKPHLDSSGSS 300

Query: 848  XXXXXXXXXXSVESHKDLQKALSLLKKSVACITAYGYNSIGLDVPSEASTFEAFARLLAT 669
                      SVE+HKDLQ+ +SLLKKSVACITAY YNS+ LDVPSEASTFEAFA+LLAT
Sbjct: 301  SFNYTSASQHSVETHKDLQRGISLLKKSVACITAYCYNSLCLDVPSEASTFEAFAKLLAT 360

Query: 668  LSSS-------NLKNTCSRSGKEAQRLNKXXXXXXXXXXXXXXXXXSHMAHVTPNARYNN 510
            LSSS       +LK  CSRS K+ Q+LNK                 +H   +T N    N
Sbjct: 361  LSSSKEVHSVFSLKMACSRSCKQVQQLNKSVWNVNSAISSTTLLDSAHAMTMTKNLYEYN 420

Query: 509  LSNSAASFLYSTEIATPGRSESLVEGWDIVEHPTLPPPPSQVEDVEHWTRAMFTDATKK 333
            L   A S L STE++  G++ESLVEGWD+VEHPT PPPPSQ ED+EHWTRAMF DA +K
Sbjct: 421  LPTYATSSLCSTELSDSGKNESLVEGWDLVEHPTFPPPPSQSEDIEHWTRAMFIDAKRK 479


>ref|XP_004300664.1| PREDICTED: uncharacterized protein LOC101292418 [Fragaria vesca
            subsp. vesca]
          Length = 478

 Score =  599 bits (1545), Expect = e-168
 Identities = 303/479 (63%), Positives = 372/479 (77%), Gaps = 7/479 (1%)
 Frame = -3

Query: 1748 MTGRSSGSCAICEGSNLAAVCAPCVNHRLGEYNGTLKSLKSLRDSLHSKLNGVLLARKKA 1569
            MT + S +CAICE SNLA++CA CVN+RL +YN +LK+LKS RD L+S+L+  L+A+ KA
Sbjct: 1    MTNKKSSNCAICENSNLASICAVCVNYRLNDYNNSLKALKSRRDLLYSRLSDALVAKGKA 60

Query: 1568 DEQTSWRVLQKEKHMKMKERLQQLEKKLSQDRAKVKRVSNDLKVKRGLLESAFSTLKKKR 1389
            D+Q +WR+LQ EK ++++E+L++ +++L Q +AK+++ S DLKVK G+LESA S L+K R
Sbjct: 61   DDQLNWRILQDEKLVRLREKLRRNKEQLVQGKAKIEKTSYDLKVKYGVLESALSMLEKNR 120

Query: 1388 AEVLEKYYPNLIHTQSLGYVAINFERLHKQSVVIKQICRLLPMRKVNSAGETKDGSNGSY 1209
            AE LEK+YPNLI TQSLG++AI  ERLHKQSVVIKQIC+L P R+V    + K+GS G Y
Sbjct: 121  AEQLEKFYPNLICTQSLGHMAITSERLHKQSVVIKQICKLFPQRRVTVDAKRKEGSGGQY 180

Query: 1208 DQICNARLPRGLDPHSVPSEELAASLGYMLQLLNLVVPSLAAPSLHNAGFAGSCSRIWQR 1029
            DQICNA LPRGLDPHSVPSEELAASLGYM+QLLNLVV +L AP+LHN+GFAGSCSRIWQR
Sbjct: 181  DQICNASLPRGLDPHSVPSEELAASLGYMVQLLNLVVQNLGAPALHNSGFAGSCSRIWQR 240

Query: 1028 DSYWDARPSSQSKEYPLFIPRQNFCSSGGENSWSERSSSNFGVASVESERKPYLDPAIXX 849
            DSYWDARPSS+S EYPLFIPRQN+CS+ GENSWS+RSSSNFGVAS+ESERKP LD +   
Sbjct: 241  DSYWDARPSSRSNEYPLFIPRQNYCSTSGENSWSDRSSSNFGVASIESERKPRLDSSGSS 300

Query: 848  XXXXXXXXXXSVESHKDLQKALSLLKKSVACITAYGYNSIGLDVPSEASTFEAFARLLAT 669
                      SVE+HKDLQ+ +SLLKKSVACITAY YNS+ LDVPSEASTFEAFA+LL+T
Sbjct: 301  SFNYSSASQHSVETHKDLQRGISLLKKSVACITAYCYNSLCLDVPSEASTFEAFAKLLST 360

Query: 668  LSSS-------NLKNTCSRSGKEAQRLNKXXXXXXXXXXXXXXXXXSHMAHVTPNARYNN 510
            LSSS       +LK  CSRS K+ Q+LNK                 +H   +T N   NN
Sbjct: 361  LSSSKEVHSVFSLKMACSRSCKQVQQLNKSVWNVNSAISSTTLLDSAHTMTMTKNFYENN 420

Query: 509  LSNSAASFLYSTEIATPGRSESLVEGWDIVEHPTLPPPPSQVEDVEHWTRAMFTDATKK 333
            + N A SFL STE++  G++E  +EGWD+VEHPTL PPPSQ ED+EHWTRAMF D TK+
Sbjct: 421  IPNYATSFLSSTEMSDVGKNECTIEGWDLVEHPTL-PPPSQSEDIEHWTRAMFIDVTKR 478


>gb|EOY16120.1| DNA-directed RNA polymerase II protein isoform 1 [Theobroma cacao]
          Length = 479

 Score =  598 bits (1541), Expect = e-168
 Identities = 301/479 (62%), Positives = 373/479 (77%), Gaps = 7/479 (1%)
 Frame = -3

Query: 1748 MTGRSSGSCAICEGSNLAAVCAPCVNHRLGEYNGTLKSLKSLRDSLHSKLNGVLLARKKA 1569
            M  + + +CAIC+ SN A++CA CVN+RL EYN  LKSLKS RD L+SKL+ VL A++KA
Sbjct: 1    MMSKKASNCAICDNSNRASICAVCVNYRLNEYNSLLKSLKSRRDFLYSKLDEVLAAKRKA 60

Query: 1568 DEQTSWRVLQKEKHMKMKERLQQLEKKLSQDRAKVKRVSNDLKVKRGLLESAFSTLKKKR 1389
            D+Q +W++LQ EK   +KE+L++ +++L+Q +AK++RVS DLKVK G+LESA   L+K R
Sbjct: 61   DDQLNWKILQNEKLTDLKEKLRRSKEQLAQGKAKIERVSYDLKVKYGVLESARGMLEKNR 120

Query: 1388 AEVLEKYYPNLIHTQSLGYVAINFERLHKQSVVIKQICRLLPMRKVNSAGETKDGSNGSY 1209
             E LEK+YPNLI TQSLG +AI  ERLHKQSVVIKQIC+L P R+VN  GE +DGS G Y
Sbjct: 121  VEKLEKFYPNLICTQSLGLMAITSERLHKQSVVIKQICKLFPQRRVNLDGEGRDGSCGQY 180

Query: 1208 DQICNARLPRGLDPHSVPSEELAASLGYMLQLLNLVVPSLAAPSLHNAGFAGSCSRIWQR 1029
            D ICN  LPRGLDPHSVPSE+LAASLGYM+QLLNLVV +LAAP+LHN+GFAGSCSRIWQR
Sbjct: 181  DLICNVGLPRGLDPHSVPSEQLAASLGYMVQLLNLVVHNLAAPALHNSGFAGSCSRIWQR 240

Query: 1028 DSYWDARPSSQSKEYPLFIPRQNFCSSGGENSWSERSSSNFGVASVESERKPYLDPAIXX 849
            DSYW+ARPSS+S EYPLFIPRQN+CS+ G+NSW++RSSSNFGVAS+ESER+P LD +   
Sbjct: 241  DSYWNARPSSRSNEYPLFIPRQNYCSTSGDNSWTDRSSSNFGVASMESERRPRLDSSGSN 300

Query: 848  XXXXXXXXXXSVESHKDLQKALSLLKKSVACITAYGYNSIGLDVPSEASTFEAFARLLAT 669
                      +VE+HKDLQ  +SLLKKSVACITA+ YNS+ LDVP+EASTFEAF++LLAT
Sbjct: 301  SFNYSSASSHTVETHKDLQIGISLLKKSVACITAFCYNSLCLDVPTEASTFEAFSKLLAT 360

Query: 668  LSSS-------NLKNTCSRSGKEAQRLNKXXXXXXXXXXXXXXXXXSHMAHVTPNARYNN 510
            LSS+       +LK  CSRS K+AQ+LNK                 +HM  +T N   +N
Sbjct: 361  LSSTKEVRSVFSLKMACSRSSKQAQQLNKSVWNVNSAMSSSMLLESAHMLPLTKNLSDHN 420

Query: 509  LSNSAASFLYSTEIATPGRSESLVEGWDIVEHPTLPPPPSQVEDVEHWTRAMFTDATKK 333
            L +SAASFL++TE+   G++ESL+E WD+VEHPT PPPPSQ EDVEHWTRAMF DATK+
Sbjct: 421  LPSSAASFLFATEMPDIGKNESLIEEWDLVEHPTFPPPPSQTEDVEHWTRAMFIDATKR 479


>ref|XP_006472675.1| PREDICTED: uncharacterized protein LOC102626964 isoform X1 [Citrus
            sinensis] gi|568837325|ref|XP_006472676.1| PREDICTED:
            uncharacterized protein LOC102626964 isoform X2 [Citrus
            sinensis]
          Length = 478

 Score =  597 bits (1540), Expect = e-168
 Identities = 300/476 (63%), Positives = 372/476 (78%), Gaps = 7/476 (1%)
 Frame = -3

Query: 1739 RSSGSCAICEGSNLAAVCAPCVNHRLGEYNGTLKSLKSLRDSLHSKLNGVLLARKKADEQ 1560
            + + +CAICE SN A++CA CVN+RL E N  LKSLKS RD+L+ +L+ VL+A+ KAD+Q
Sbjct: 3    KKASNCAICENSNRASICAACVNYRLSECNTLLKSLKSRRDALYMRLSEVLVAKGKADDQ 62

Query: 1559 TSWRVLQKEKHMKMKERLQQLEKKLSQDRAKVKRVSNDLKVKRGLLESAFSTLKKKRAEV 1380
             +WRVLQ EK   ++E+L++ +++LSQ + K+++ S DLKV+  +L+SA S ++K RAE 
Sbjct: 63   LNWRVLQNEKLTNLREKLRRSKEQLSQGKLKIEKSSYDLKVRYAILDSARSMMEKNRAEQ 122

Query: 1379 LEKYYPNLIHTQSLGYVAINFERLHKQSVVIKQICRLLPMRKVNSAGETKDGSNGSYDQI 1200
            LEK+YPN+I TQSLG++AI  E LHKQSVVIKQIC+L P R+VN  GE +DGS+G YDQI
Sbjct: 123  LEKFYPNIICTQSLGHMAIVSELLHKQSVVIKQICKLFPQRRVNIDGERRDGSSGQYDQI 182

Query: 1199 CNARLPRGLDPHSVPSEELAASLGYMLQLLNLVVPSLAAPSLHNAGFAGSCSRIWQRDSY 1020
            C ARLP+GLDPHSVPSEELAASLGYM+QLLNLVV +LA P LHN+GFAGSCSRIWQRDSY
Sbjct: 183  CGARLPKGLDPHSVPSEELAASLGYMVQLLNLVVLNLAVPILHNSGFAGSCSRIWQRDSY 242

Query: 1019 WDARPSSQSKEYPLFIPRQNFCSSGGENSWSERSSSNFGVASVESERKPYLDPAIXXXXX 840
            WDARPSS+S EYPLFIPRQN+CS+ GENSW++RSSSNFGVAS+ESER+P LD +      
Sbjct: 243  WDARPSSRSNEYPLFIPRQNYCSTSGENSWTDRSSSNFGVASMESERRPQLDSSRSTSFN 302

Query: 839  XXXXXXXSVESHKDLQKALSLLKKSVACITAYGYNSIGLDVPSEASTFEAFARLLATLSS 660
                   SVE+HKDLQK +SLLKKSVAC+TAY YNS+ LDVP+EASTFEAFA+LLATLSS
Sbjct: 303  YTSASTHSVETHKDLQKGISLLKKSVACLTAYCYNSLCLDVPAEASTFEAFAKLLATLSS 362

Query: 659  S-------NLKNTCSRSGKEAQRLNKXXXXXXXXXXXXXXXXXSHMAHVTPNARYNNLSN 501
            S       +LK  CSRS K+ Q+LN+                 +HM  +T N   NNL +
Sbjct: 363  SKEVRSVFSLKMACSRSCKQVQKLNRSVWNMNSAISSTTLLESAHMFPITKNLSDNNLPS 422

Query: 500  SAASFLYSTEIATPGRSESLVEGWDIVEHPTLPPPPSQVEDVEHWTRAMFTDATKK 333
            SAASFLY+TE++  G++ESL++GWD+VEHPT PPPPSQ EDVEHWTRAM  DATKK
Sbjct: 423  SAASFLYATEMSDIGKNESLIDGWDLVEHPTFPPPPSQTEDVEHWTRAMIIDATKK 478


>ref|XP_006434072.1| hypothetical protein CICLE_v10001001mg [Citrus clementina]
            gi|567883029|ref|XP_006434073.1| hypothetical protein
            CICLE_v10001001mg [Citrus clementina]
            gi|567883031|ref|XP_006434074.1| hypothetical protein
            CICLE_v10001001mg [Citrus clementina]
            gi|567883033|ref|XP_006434075.1| hypothetical protein
            CICLE_v10001001mg [Citrus clementina]
            gi|557536194|gb|ESR47312.1| hypothetical protein
            CICLE_v10001001mg [Citrus clementina]
            gi|557536195|gb|ESR47313.1| hypothetical protein
            CICLE_v10001001mg [Citrus clementina]
            gi|557536196|gb|ESR47314.1| hypothetical protein
            CICLE_v10001001mg [Citrus clementina]
            gi|557536197|gb|ESR47315.1| hypothetical protein
            CICLE_v10001001mg [Citrus clementina]
          Length = 478

 Score =  592 bits (1527), Expect = e-166
 Identities = 298/476 (62%), Positives = 370/476 (77%), Gaps = 7/476 (1%)
 Frame = -3

Query: 1739 RSSGSCAICEGSNLAAVCAPCVNHRLGEYNGTLKSLKSLRDSLHSKLNGVLLARKKADEQ 1560
            + + +CAICE SN A++CA CVN+RL E N  LKSLKS RD+L+ +L+ VL+A+ KAD+Q
Sbjct: 3    KKASNCAICENSNRASICAACVNYRLSECNTLLKSLKSRRDALYMRLSEVLVAKGKADDQ 62

Query: 1559 TSWRVLQKEKHMKMKERLQQLEKKLSQDRAKVKRVSNDLKVKRGLLESAFSTLKKKRAEV 1380
             +WRVLQ EK   ++E+L++ +++LSQ + K+++ S DLK +  +L+SA S ++K RAE 
Sbjct: 63   LNWRVLQNEKLTNLREKLRRSKEQLSQGKLKIEKSSYDLKGRYAILDSARSMMEKNRAEQ 122

Query: 1379 LEKYYPNLIHTQSLGYVAINFERLHKQSVVIKQICRLLPMRKVNSAGETKDGSNGSYDQI 1200
            LEK+YPN+I TQSLG++AI  E LHKQSVVIKQIC+L P R+VN  GE +DGS+G YDQI
Sbjct: 123  LEKFYPNIICTQSLGHMAIVSELLHKQSVVIKQICKLFPQRRVNIDGERRDGSSGQYDQI 182

Query: 1199 CNARLPRGLDPHSVPSEELAASLGYMLQLLNLVVPSLAAPSLHNAGFAGSCSRIWQRDSY 1020
            C ARLP+GLDPHSVPSEELAASLGYM+QLLNLVV +LA P LHN+GFAGSCSRIWQRDSY
Sbjct: 183  CGARLPKGLDPHSVPSEELAASLGYMVQLLNLVVLNLAVPVLHNSGFAGSCSRIWQRDSY 242

Query: 1019 WDARPSSQSKEYPLFIPRQNFCSSGGENSWSERSSSNFGVASVESERKPYLDPAIXXXXX 840
            WDARPSS+S EYPLFIPRQN+CS+ GENSW++RSSSNFGVAS+ESER+P LD +      
Sbjct: 243  WDARPSSRSNEYPLFIPRQNYCSTSGENSWTDRSSSNFGVASMESERRPQLDSSRSASFN 302

Query: 839  XXXXXXXSVESHKDLQKALSLLKKSVACITAYGYNSIGLDVPSEASTFEAFARLLATLSS 660
                   SVE+HKDLQK +SLLKKSVAC+TAY YNS+ LDVP+EASTFEAFA+LLATLS 
Sbjct: 303  YTSASTHSVETHKDLQKGISLLKKSVACLTAYCYNSLCLDVPAEASTFEAFAKLLATLSL 362

Query: 659  S-------NLKNTCSRSGKEAQRLNKXXXXXXXXXXXXXXXXXSHMAHVTPNARYNNLSN 501
            S       +LK  CSRS K+ Q+LN+                 +HM  +T N   NNL +
Sbjct: 363  SKEVRSVFSLKMACSRSCKQVQKLNRSVWNMNSAISSTTLLESAHMFPITKNLSDNNLPS 422

Query: 500  SAASFLYSTEIATPGRSESLVEGWDIVEHPTLPPPPSQVEDVEHWTRAMFTDATKK 333
            SAASFLY+TE++  G++ESL++GWD+VEHPT PPPPSQ EDVEHWTRAM  DATKK
Sbjct: 423  SAASFLYATEMSDIGKNESLIDGWDLVEHPTFPPPPSQTEDVEHWTRAMIIDATKK 478


>ref|XP_002533875.1| DNA-directed RNA polymerase II, putative [Ricinus communis]
            gi|223526176|gb|EEF28506.1| DNA-directed RNA polymerase
            II, putative [Ricinus communis]
          Length = 478

 Score =  590 bits (1522), Expect = e-166
 Identities = 296/476 (62%), Positives = 364/476 (76%), Gaps = 7/476 (1%)
 Frame = -3

Query: 1739 RSSGSCAICEGSNLAAVCAPCVNHRLGEYNGTLKSLKSLRDSLHSKLNGVLLARKKADEQ 1560
            + S  CAICE SN A++C  CVN+RL EY+  LKSLKS RD L+S+L+ VL+A+ KAD+Q
Sbjct: 3    KKSSCCAICENSNRASICTVCVNYRLNEYSTLLKSLKSRRDLLYSRLSEVLVAKGKADDQ 62

Query: 1559 TSWRVLQKEKHMKMKERLQQLEKKLSQDRAKVKRVSNDLKVKRGLLESAFSTLKKKRAEV 1380
             +WRV Q EK   ++E+L + +++L Q +AK +++S+DL  K GLLES+ S L+K R + 
Sbjct: 63   LNWRVHQNEKLANLREKLLRSKEQLIQAKAKTEKMSSDLNAKYGLLESSRSALEKNRVDQ 122

Query: 1379 LEKYYPNLIHTQSLGYVAINFERLHKQSVVIKQICRLLPMRKVNSAGETKDGSNGSYDQI 1200
            LEKY+PNLI TQSLG++AI  E LH  SV +KQIC+L P R+V   GE KDGS+G YDQI
Sbjct: 123  LEKYFPNLICTQSLGHMAITSELLHNLSVTVKQICKLFPQRRVIVEGEKKDGSSGQYDQI 182

Query: 1199 CNARLPRGLDPHSVPSEELAASLGYMLQLLNLVVPSLAAPSLHNAGFAGSCSRIWQRDSY 1020
            CNARLPRGLDPHS+PSEELAASLGYM+QLLNLVV +LAAP+LHN+GFAGSCSRIWQRDSY
Sbjct: 183  CNARLPRGLDPHSIPSEELAASLGYMVQLLNLVVHNLAAPALHNSGFAGSCSRIWQRDSY 242

Query: 1019 WDARPSSQSKEYPLFIPRQNFCSSGGENSWSERSSSNFGVASVESERKPYLDPAIXXXXX 840
            W+ARPSS+S EYPLFIPRQ +CS+ GENSW++RSSSNFGVAS+ESER+  LD +      
Sbjct: 243  WNARPSSRSNEYPLFIPRQRYCSTSGENSWTDRSSSNFGVASMESERRARLDSSRSSSFN 302

Query: 839  XXXXXXXSVESHKDLQKALSLLKKSVACITAYGYNSIGLDVPSEASTFEAFARLLATLSS 660
                   SVE+HKDLQK +SL+KKSVAC+TAYGYN + LDVP+EASTFEAFA+LLATLSS
Sbjct: 303  YNSASPHSVETHKDLQKGISLMKKSVACVTAYGYNLLCLDVPAEASTFEAFAKLLATLSS 362

Query: 659  S-------NLKNTCSRSGKEAQRLNKXXXXXXXXXXXXXXXXXSHMAHVTPNARYNNLSN 501
            S       +LK  CSRS K+ Q+LNK                 +H  H+T N   NNL N
Sbjct: 363  SKEVRSVFSLKMACSRSCKQVQKLNKSVWNVNSIISSSTLMESAHAPHLTKNINDNNLRN 422

Query: 500  SAASFLYSTEIATPGRSESLVEGWDIVEHPTLPPPPSQVEDVEHWTRAMFTDATKK 333
            SA SFL++ EI+  G++ESL++GWD+VEHPT PPPPSQ EDVEHWTRAMF DATKK
Sbjct: 423  SATSFLFANEISDAGKNESLIDGWDLVEHPTFPPPPSQTEDVEHWTRAMFIDATKK 478


>ref|XP_002285818.1| PREDICTED: uncharacterized protein LOC100260778 [Vitis vinifera]
            gi|302141899|emb|CBI19102.3| unnamed protein product
            [Vitis vinifera]
          Length = 478

 Score =  583 bits (1504), Expect = e-164
 Identities = 297/476 (62%), Positives = 364/476 (76%), Gaps = 7/476 (1%)
 Frame = -3

Query: 1739 RSSGSCAICEGSNLAAVCAPCVNHRLGEYNGTLKSLKSLRDSLHSKLNGVLLARKKADEQ 1560
            R + SC+ICE SNLA++CA CVN+RL EYN +LKS K  RDSL+ +L+ VL+A+ KAD+Q
Sbjct: 3    RKTSSCSICEKSNLASICAVCVNYRLNEYNTSLKSSKGRRDSLYLRLSEVLVAKGKADDQ 62

Query: 1559 TSWRVLQKEKHMKMKERLQQLEKKLSQDRAKVKRVSNDLKVKRGLLESAFSTLKKKRAEV 1380
             +WRVLQ EK  +++E+L+  +++    +AKV+++SNDLK+K GLLESA S L+K R E 
Sbjct: 63   INWRVLQNEKLARLREKLRHRKEQYLDGKAKVEKMSNDLKLKYGLLESAMSMLEKNRVEQ 122

Query: 1379 LEKYYPNLIHTQSLGYVAINFERLHKQSVVIKQICRLLPMRKVNSAGETKDGSNGSYDQI 1200
            LEK+YPNLI TQ+LG +AI  ER HKQSVVIKQIC+L P R+VN  GE KDGS+  YDQI
Sbjct: 123  LEKFYPNLICTQNLGLMAITSERFHKQSVVIKQICKLFPQRRVNIDGEKKDGSSRPYDQI 182

Query: 1199 CNARLPRGLDPHSVPSEELAASLGYMLQLLNLVVPSLAAPSLHNAGFAGSCSRIWQRDSY 1020
            CN RLPR LDPHSVPS+ELAASLGYM+QLLNLVV +LAAP+LHN+GFAGSCSRIWQR+SY
Sbjct: 183  CNVRLPRVLDPHSVPSDELAASLGYMVQLLNLVVYNLAAPALHNSGFAGSCSRIWQRESY 242

Query: 1019 WDARPSSQSKEYPLFIPRQNFCSSGGENSWSERSSSNFGVASVESERKPYLDPAIXXXXX 840
            W+ RPSS+S EYPLFIPRQN CS+ GENSWSERSSSNFG+AS+ES+RKP L+ +      
Sbjct: 243  WNPRPSSRSNEYPLFIPRQNLCSTNGENSWSERSSSNFGIASMESDRKPRLESSGSSSFN 302

Query: 839  XXXXXXXSVESHKDLQKALSLLKKSVACITAYGYNSIGLDVPSEASTFEAFARLLATLSS 660
                   SVE+HKDLQK +SLLKKSVAC+T Y Y+S+ LDVP+EASTFEAFA+LLA LSS
Sbjct: 303  YSSASLHSVETHKDLQKGISLLKKSVACLTTYCYSSLCLDVPTEASTFEAFAKLLAILSS 362

Query: 659  S-------NLKNTCSRSGKEAQRLNKXXXXXXXXXXXXXXXXXSHMAHVTPNARYNNLSN 501
            S       +LK  CSRS K+ Q+LNK                 +H   +T N   NNL N
Sbjct: 363  SKEVRSVFSLKMACSRSCKQVQQLNKSIWNMNSAISSSTLLESAHTLPMTRNIFDNNLPN 422

Query: 500  SAASFLYSTEIATPGRSESLVEGWDIVEHPTLPPPPSQVEDVEHWTRAMFTDATKK 333
            SAASFLY+TE++  G++ESL+E WD+VEH   PPPPSQ ED+EHWTRAM  DATKK
Sbjct: 423  SAASFLYTTEMSDIGKNESLIEEWDLVEHANFPPPPSQTEDIEHWTRAMIIDATKK 478


>ref|XP_003542641.1| PREDICTED: uncharacterized protein LOC100813297 [Glycine max]
          Length = 474

 Score =  577 bits (1488), Expect = e-162
 Identities = 303/477 (63%), Positives = 363/477 (76%), Gaps = 8/477 (1%)
 Frame = -3

Query: 1739 RSSGSCAICEGSNLAAVCAPCVNHRLGEYNGTLKSLKSLRDSLHSKLNGVLLARKKADEQ 1560
            R + +CAICE SN A++C+ CVN+RL EYN +LK LK  RDSL+SKL+ VL+ + K D+Q
Sbjct: 3    RKTSNCAICENSNQASICSICVNYRLNEYNTSLKLLKDRRDSLYSKLSEVLVRKGKGDDQ 62

Query: 1559 TSWRVLQKEKHMKMKERLQQLEKKLSQDRAKVKRVSNDLKVKRGLLESAFSTLKKKRAEV 1380
             +WRVLQ EK  ++KE+L+Q +++++Q RAK++  S DLK+K GLLESA STL+K R E 
Sbjct: 63   ANWRVLQHEKLARLKEKLRQGKEQVTQGRAKIETKSADLKLKYGLLESALSTLEKNRVEQ 122

Query: 1379 LEKYYPNLIHTQSLGYVAINFERLHKQSVVIKQICRLLPMRKVNSAGETKDGSNGSYDQI 1200
            LEK+YPNLI TQSLG+VAI  ERLHKQSVVIKQIC+L P R+V   GE  DG  G +DQI
Sbjct: 123  LEKFYPNLICTQSLGHVAITSERLHKQSVVIKQICKLFPQRRVVIEGERGDGCCGQFDQI 182

Query: 1199 CNARLPRGLDPHSVPSEELAASLGYMLQLLNLVVPSLAAPSLHNAGFAGSCSRIWQRDSY 1020
            CNARLPR LDP SVPSEEL+ SLGYM+QLLNL+V +LAAP+LHN+GFAGSCSRIWQRDSY
Sbjct: 183  CNARLPRALDPRSVPSEELSTSLGYMVQLLNLIVHNLAAPALHNSGFAGSCSRIWQRDSY 242

Query: 1019 WDARPSSQSKEYPLFIPRQNFCSSGGENSWSERSSSNFGVASVESERKPYLDPAIXXXXX 840
            WDARPSS+S EYPLFIPRQN+CS+GGENSWSERSSSNFGVAS+ESER+  LD +      
Sbjct: 243  WDARPSSRSNEYPLFIPRQNYCSTGGENSWSERSSSNFGVASMESERRHRLDSSGSSSFN 302

Query: 839  XXXXXXXSVESHKDLQKALSLLKKSVACITAYGYNSIGLDVPSEASTFEAFARLLATLSS 660
                   SV++HKDLQK +SLLKKSVACITAY YNS+ LDVPSEASTFEAFA+LLATLSS
Sbjct: 303  YSLASSHSVQTHKDLQKGISLLKKSVACITAYCYNSLCLDVPSEASTFEAFAKLLATLSS 362

Query: 659  S-------NLKNTCSRSGKEAQRLNKXXXXXXXXXXXXXXXXXSHMAHVTPNARYNN-LS 504
            S       +LK   SR+ K+ Q+LNK                    AH  P  R  N L 
Sbjct: 363  SKEVRSVFSLKMPRSRTCKQVQQLNK---SVWNMNSAISSTTLLESAHSVPTTRIENYLP 419

Query: 503  NSAASFLYSTEIATPGRSESLVEGWDIVEHPTLPPPPSQVEDVEHWTRAMFTDATKK 333
            ++ ASFLY+T+  + G++E LVEGWDIVEHPT PPPPSQ EDVEHWTRAMF DA +K
Sbjct: 420  SATASFLYATD--SDGKNECLVEGWDIVEHPTFPPPPSQSEDVEHWTRAMFIDAKRK 474


>ref|XP_002302270.1| hypothetical protein POPTR_0002s09150g [Populus trichocarpa]
            gi|566157047|ref|XP_006386388.1| hypothetical protein
            POPTR_0002s09150g [Populus trichocarpa]
            gi|566157050|ref|XP_006386389.1| hypothetical protein
            POPTR_0002s09150g [Populus trichocarpa]
            gi|222843996|gb|EEE81543.1| hypothetical protein
            POPTR_0002s09150g [Populus trichocarpa]
            gi|550344610|gb|ERP64185.1| hypothetical protein
            POPTR_0002s09150g [Populus trichocarpa]
            gi|550344611|gb|ERP64186.1| hypothetical protein
            POPTR_0002s09150g [Populus trichocarpa]
          Length = 475

 Score =  574 bits (1480), Expect = e-161
 Identities = 296/476 (62%), Positives = 359/476 (75%), Gaps = 7/476 (1%)
 Frame = -3

Query: 1739 RSSGSCAICEGSNLAAVCAPCVNHRLGEYNGTLKSLKSLRDSLHSKLNGVLLARKKADEQ 1560
            + S  CAICE SN A++C  CVN+RL EY   LKSL S RDSL+SKL+ VL+A+ KAD+Q
Sbjct: 3    KKSSCCAICENSNRASICPICVNYRLNEYGTLLKSLNSRRDSLYSKLSVVLIAKGKADDQ 62

Query: 1559 TSWRVLQKEKHMKMKERLQQLEKKLSQDRAKVKRVSNDLKVKRGLLESAFSTLKKKRAEV 1380
             +WRV Q EK    +E+L + +++L+Q +AKV+++S DLK K G+LESA + L+K R E 
Sbjct: 63   FNWRVQQNEKLASSREKLHRNKEQLAQGKAKVEKLSQDLKKKNGMLESARNVLEKNRMEQ 122

Query: 1379 LEKYYPNLIHTQSLGYVAINFERLHKQSVVIKQICRLLPMRKVNSAGETKDGSNGSYDQI 1200
            LEK+YPNLI TQSLG++AI  E LHKQSVVIKQIC+L P R+VN  GE     +G YDQI
Sbjct: 123  LEKFYPNLICTQSLGHMAITSELLHKQSVVIKQICKLFPQRRVNVDGERN--FSGQYDQI 180

Query: 1199 CNARLPRGLDPHSVPSEELAASLGYMLQLLNLVVPSLAAPSLHNAGFAGSCSRIWQRDSY 1020
            CNARLPRGLDPHSV SEELAASLGYM+QLLNLV  +LAAP+LHNAGFAGSCSRIWQRDSY
Sbjct: 181  CNARLPRGLDPHSVSSEELAASLGYMVQLLNLVAHNLAAPTLHNAGFAGSCSRIWQRDSY 240

Query: 1019 WDARPSSQSKEYPLFIPRQNFCSSGGENSWSERSSSNFGVASVESERKPYLDPAIXXXXX 840
            W+A PSS+S EYPLFIPRQN+CS+  ENSW+++SSSNFGVAS+ESER+P+LD        
Sbjct: 241  WNACPSSRSNEYPLFIPRQNYCSTSSENSWTDKSSSNFGVASMESERRPHLDSTRSNSFN 300

Query: 839  XXXXXXXSVESHKDLQKALSLLKKSVACITAYGYNSIGLDVPSEASTFEAFARLLATLSS 660
                   SVE+HKDLQK +SLLKKSVAC+TAY YN + LDVPS+ STFEAFA+LL+TLSS
Sbjct: 301  YSSVSPHSVETHKDLQKGVSLLKKSVACVTAYCYNLLCLDVPSDTSTFEAFAKLLSTLSS 360

Query: 659  S-------NLKNTCSRSGKEAQRLNKXXXXXXXXXXXXXXXXXSHMAHVTPNARYNNLSN 501
            S       NLK  CSRS K+ Q+LNK                 +H   +  N   NNL N
Sbjct: 361  SKEVRSVFNLKMACSRSCKQVQKLNKSVWNVNSAISSSALLESAHALQLMKNTSDNNLPN 420

Query: 500  SAASFLYSTEIATPGRSESLVEGWDIVEHPTLPPPPSQVEDVEHWTRAMFTDATKK 333
            SAASFL++T I + G++ES ++GWD+VEHPT PPPPSQVED+EHWTRAMF DATKK
Sbjct: 421  SAASFLFATGI-SDGKNESFIDGWDLVEHPTFPPPPSQVEDIEHWTRAMFIDATKK 475


>ref|XP_003544777.1| PREDICTED: uncharacterized protein LOC100776426 isoformX1 [Glycine
            max]
          Length = 475

 Score =  574 bits (1479), Expect = e-161
 Identities = 298/477 (62%), Positives = 361/477 (75%), Gaps = 8/477 (1%)
 Frame = -3

Query: 1739 RSSGSCAICEGSNLAAVCAPCVNHRLGEYNGTLKSLKSLRDSLHSKLNGVLLARKKADEQ 1560
            R + +CAICE SN A++C+ CVN+RL EYN +LK LK  RDSL+ KL+ VL+ + K D+Q
Sbjct: 3    RKTSNCAICENSNQASICSICVNYRLNEYNTSLKLLKDRRDSLYLKLSEVLVRKGKGDDQ 62

Query: 1559 TSWRVLQKEKHMKMKERLQQLEKKLSQDRAKVKRVSNDLKVKRGLLESAFSTLKKKRAEV 1380
             +WRVLQ EK  ++KE+L+Q +++++Q RAK++ +S DLK+K GLLESA STL+K R E 
Sbjct: 63   ANWRVLQHEKLARLKEKLRQSKEQVTQGRAKIETMSADLKLKYGLLESALSTLEKNRVEQ 122

Query: 1379 LEKYYPNLIHTQSLGYVAINFERLHKQSVVIKQICRLLPMRKVNSAGETKDGSNGSYDQI 1200
            LEK+YPNLI TQSLG+VAI  E LHK+SVVIKQIC+L P R+V   GE +DG +G YDQI
Sbjct: 123  LEKFYPNLICTQSLGHVAITSELLHKESVVIKQICKLFPQRRVVIEGERRDGCSGQYDQI 182

Query: 1199 CNARLPRGLDPHSVPSEELAASLGYMLQLLNLVVPSLAAPSLHNAGFAGSCSRIWQRDSY 1020
            CNARLPR LDPHSVPSEEL+ SLGYM+QLLNLV+ +LAAP+LHN+GFAGSCSRIWQRDSY
Sbjct: 183  CNARLPRALDPHSVPSEELSTSLGYMVQLLNLVIHNLAAPALHNSGFAGSCSRIWQRDSY 242

Query: 1019 WDARPSSQSKEYPLFIPRQNFCSSGGENSWSERSSSNFGVASVESERKPYLDPAIXXXXX 840
            WDARPSS+S EYPLFIPRQN+CS+ GENSWSERSSSNFGVASVESER+  LD +      
Sbjct: 243  WDARPSSRSNEYPLFIPRQNYCSTDGENSWSERSSSNFGVASVESERRHRLDSSGSTSFN 302

Query: 839  XXXXXXXSVESHKDLQKALSLLKKSVACITAYGYNSIGLDVPSEASTFEAFARLLATLSS 660
                   SV++HKDLQK +SLLKKSV CITAY YNS+ LDVPSEASTFEAFA+LLATL+S
Sbjct: 303  YSLASSHSVQTHKDLQKGISLLKKSVVCITAYCYNSLCLDVPSEASTFEAFAKLLATLAS 362

Query: 659  S-------NLKNTCSRSGKEAQRLNKXXXXXXXXXXXXXXXXXSHMAHVTPNARYNN-LS 504
            S       +LK   SR+ K+ Q+LNK                    AH  P  R  N L 
Sbjct: 363  SKEVRSVFSLKMARSRTCKQVQQLNK---SVWNMNSAISSTTLLESAHSVPTTRIENYLP 419

Query: 503  NSAASFLYSTEIATPGRSESLVEGWDIVEHPTLPPPPSQVEDVEHWTRAMFTDATKK 333
            +S  SFLY+ ++ + G++E L+EGWDIVEHPT PPPPSQ EDVEHWTRAMF DA  K
Sbjct: 420  SSTGSFLYAADL-SDGKNECLIEGWDIVEHPTFPPPPSQSEDVEHWTRAMFIDAKGK 475


>gb|ESW13116.1| hypothetical protein PHAVU_008G169200g [Phaseolus vulgaris]
            gi|561014256|gb|ESW13117.1| hypothetical protein
            PHAVU_008G169200g [Phaseolus vulgaris]
          Length = 476

 Score =  571 bits (1471), Expect = e-160
 Identities = 299/478 (62%), Positives = 364/478 (76%), Gaps = 9/478 (1%)
 Frame = -3

Query: 1739 RSSGSCAICEGSNLAAVCAPCVNHRLGEYNGTLKSLKSLRDSLHSKLNGVLLARKKADEQ 1560
            R + +CAICE SN A++C+ CVN+RL EYN +LKSLK  RDSL+SKL+ VL+ + K D+Q
Sbjct: 3    RKTSNCAICENSNQASICSICVNYRLNEYNTSLKSLKDRRDSLYSKLSEVLVQKGKGDDQ 62

Query: 1559 TSWRVLQKEKHMKMKERLQQLEKKLSQDRAKVKRVSNDLKVKRGLLESAFSTLKKKRAEV 1380
             ++ VLQ EK  ++KE+L + +++++Q RAK++ VS DLK K GLLESA STL+K R E 
Sbjct: 63   ENYIVLQNEKLARLKEKLHRSKEQVTQGRAKIETVSADLKHKYGLLESALSTLEKNRVEQ 122

Query: 1379 LEKYYPNLIHTQSLGYVAINFERLHKQSVVIKQICRLLPMRKVNSAGETKDGSNGSYDQI 1200
            LEK+YPNLI TQSLG+VAI  ERLHKQSVVIKQIC+L P R+V   GE +DG +G YDQI
Sbjct: 123  LEKFYPNLICTQSLGHVAITSERLHKQSVVIKQICKLFPQRRVVIEGEIRDGCSGQYDQI 182

Query: 1199 CNARLPRGLDPHSVPSEELAASLGYMLQLLNLVVPSLAAPSLHNAGFAGSCSRIWQRDSY 1020
            CNARLPR LDPHSVPSEEL+ASLGYM+QLLNLVV +LAAP+LHN+GFAGSCSRIWQRDSY
Sbjct: 183  CNARLPRALDPHSVPSEELSASLGYMVQLLNLVVHNLAAPALHNSGFAGSCSRIWQRDSY 242

Query: 1019 WDARPSSQSKEYPLFIPRQNFCSSGGENSWS-ERSSSNFGVASVESERKPYLDPAIXXXX 843
            WDARPSS+S EYPLFIPRQN+CS+ GENSWS ++SSSNFGVAS+ESE++  LD +     
Sbjct: 243  WDARPSSRSNEYPLFIPRQNYCSTAGENSWSTDKSSSNFGVASMESEKRNRLDSSGNSNF 302

Query: 842  XXXXXXXXSVESHKDLQKALSLLKKSVACITAYGYNSIGLDVPSEASTFEAFARLLATLS 663
                    SV++HKDLQK +SLLKKSVACITAY YNS+ LD PSEASTFE+FA+LLATLS
Sbjct: 303  NYSLASLHSVQTHKDLQKGISLLKKSVACITAYCYNSLCLDAPSEASTFESFAKLLATLS 362

Query: 662  SS-------NLKNTCSRSGKEAQRLNKXXXXXXXXXXXXXXXXXSHMAHVTPNARYNN-L 507
            SS       +LK   SR+ K+ Q+LNK                    AH  P  R  N L
Sbjct: 363  SSKEVRSVFSLKMAQSRTCKQVQQLNK---SVWNMNSVISSTTLLESAHSVPTTRIENYL 419

Query: 506  SNSAASFLYSTEIATPGRSESLVEGWDIVEHPTLPPPPSQVEDVEHWTRAMFTDATKK 333
             +S ASFLY+T++   G++E L+EGWDI+EHPT PPPPSQ EDVEHWTRAMF DA +K
Sbjct: 420  PSSTASFLYATDL-NDGKNECLIEGWDIIEHPTFPPPPSQSEDVEHWTRAMFIDAKRK 476


>ref|XP_003614901.1| hypothetical protein MTR_5g061040 [Medicago truncatula]
            gi|355516236|gb|AES97859.1| hypothetical protein
            MTR_5g061040 [Medicago truncatula]
          Length = 501

 Score =  565 bits (1457), Expect = e-158
 Identities = 299/487 (61%), Positives = 360/487 (73%), Gaps = 21/487 (4%)
 Frame = -3

Query: 1739 RSSGSCAICEGSNLAAVCAPCVNHRLGEYNGTLKSLKSLRDSLHSKLNGVLLARKKADEQ 1560
            R S +CAICE  N  ++C+ CVN+RL EYN +LKSLK  RDSL+SKL+ VL+ + K D+Q
Sbjct: 3    RKSTNCAICENLNQPSICSVCVNYRLNEYNSSLKSLKERRDSLYSKLSEVLVRKGKGDDQ 62

Query: 1559 TSWRVLQKEKHMKMKERLQQLEKKLSQDRAKVKRVSNDLKVKRGLLESAFSTLKKKRAEV 1380
            T+WRVL+ EK  + +E+L+  +++++Q RAK++ +S DLK+K G+LESA S L+K R E 
Sbjct: 63   TNWRVLRHEKLARSREKLRHNKEQVTQGRAKIQAMSADLKLKYGVLESALSMLEKNRVEQ 122

Query: 1379 LEKYYPNLIHTQSLGYVAINFERLHKQSVVIKQICRLLPMRKVNSAGETKDGSNGSYDQI 1200
            LEK+YPNLI TQSLG+VAI  ERLHKQSVVIKQIC+L P R+V   GE  D  +G YDQI
Sbjct: 123  LEKFYPNLICTQSLGHVAITSERLHKQSVVIKQICKLFPQRRVVIEGEKGDDCSGQYDQI 182

Query: 1199 CNARLPRGLDPHSVPSEELAASLGYMLQLLNLVVPSLAAPSLHNAGFAGSCSRIWQRDSY 1020
            CNARLPR LDPHSVPSEEL+ASLGYM+QLLNLV  +LAAP+LHN+GFAGSCSRIWQRDSY
Sbjct: 183  CNARLPRALDPHSVPSEELSASLGYMVQLLNLVAHNLAAPALHNSGFAGSCSRIWQRDSY 242

Query: 1019 WDARPSSQSK-------------EYPLFIPRQNFCSSGGENSWSERSSSNFGVASVESER 879
            WDARPSS+SK             EYPLFIPRQN+CS+ GENSWSE+SSSNFGVAS+ES+R
Sbjct: 243  WDARPSSRSKNFFNLKYSLFFSNEYPLFIPRQNYCSTSGENSWSEKSSSNFGVASMESDR 302

Query: 878  KPYLDPAIXXXXXXXXXXXXSVESHKDLQKALSLLKKSVACITAYGYNSIGLDVPSEAST 699
            +P LD +             SV+SHKDLQK +SLLKKSVACITAY YNS+  D+PSEAST
Sbjct: 303  RPRLDSSGSSSFNYSLASSHSVQSHKDLQKGISLLKKSVACITAYCYNSLCFDIPSEAST 362

Query: 698  FEAFARLLATLSSS-------NLKNTCSRSGKEAQRLNKXXXXXXXXXXXXXXXXXSHMA 540
            FEAFA+LLATLSSS       +LK   SR+ K+ Q+LNK                     
Sbjct: 363  FEAFAKLLATLSSSKEVRSVFSLKMARSRTCKQVQQLNK---SVWNMNSANSSTTLLEST 419

Query: 539  HVTPNARYNN-LSNSAASFLYSTEIATPGRSESLVEGWDIVEHPTLPPPPSQVEDVEHWT 363
            H  P  R  N + NSAASFLY T+ ++  +SE L+EGWDIVEHPTLPPPPSQ EDVEHWT
Sbjct: 420  HSVPTTRIENYMPNSAASFLYPTD-SSDRKSECLIEGWDIVEHPTLPPPPSQSEDVEHWT 478

Query: 362  RAMFTDA 342
            RAMF DA
Sbjct: 479  RAMFIDA 485


>gb|EXC06677.1| hypothetical protein L484_021513 [Morus notabilis]
          Length = 478

 Score =  565 bits (1456), Expect = e-158
 Identities = 292/476 (61%), Positives = 355/476 (74%), Gaps = 7/476 (1%)
 Frame = -3

Query: 1739 RSSGSCAICEGSNLAAVCAPCVNHRLGEYNGTLKSLKSLRDSLHSKLNGVLLARKKADEQ 1560
            R S SCA+CE SNL ++C+ CVN+RL ++   LKS KS RDSL+S+L  VLLA+ KAD+Q
Sbjct: 3    RKSTSCALCENSNLPSICSICVNYRLADHYNILKSNKSHRDSLYSRLEEVLLAKGKADDQ 62

Query: 1559 TSWRVLQKEKHMKMKERLQQLEKKLSQDRAKVKRVSNDLKVKRGLLESAFSTLKKKRAEV 1380
              WR+ Q EK  K++E+ ++ +++L Q +AKV+R+  DLKVK G+LE+A S L+  R E 
Sbjct: 63   VGWRMSQNEKLAKLREKHRRSKERLVQGKAKVERMHYDLKVKSGVLEAARSMLENNRMEQ 122

Query: 1379 LEKYYPNLIHTQSLGYVAINFERLHKQSVVIKQICRLLPMRKVNSAGETKDGSNGSYDQI 1200
            LEK+YPN I TQ+LG++AI  ERLHKQSVVIKQIC+L P R+V   GE K+GS   YDQI
Sbjct: 123  LEKFYPNFICTQTLGHMAITSERLHKQSVVIKQICKLFPHRRVIIDGERKNGSAEQYDQI 182

Query: 1199 CNARLPRGLDPHSVPSEELAASLGYMLQLLNLVVPSLAAPSLHNAGFAGSCSRIWQRDSY 1020
            CNARLPRG+DPHSV SEEL ASLGYM+QLLNL+V  LAAP+LHN+GFAGS SRIWQRDSY
Sbjct: 183  CNARLPRGVDPHSVASEELGASLGYMVQLLNLIVRILAAPALHNSGFAGSNSRIWQRDSY 242

Query: 1019 WDARPSSQSKEYPLFIPRQNFCSSGGENSWSERSSSNFGVASVESERKPYLDPAIXXXXX 840
            WDARPSS+S EYPLFIPRQN+CS+  ENSWS+RSSSNFGV S+ESERK  LD +      
Sbjct: 243  WDARPSSRSNEYPLFIPRQNYCSTSVENSWSDRSSSNFGVTSIESERKVRLDSSGSNSFN 302

Query: 839  XXXXXXXSVESHKDLQKALSLLKKSVACITAYGYNSIGLDVPSEASTFEAFARLLATLSS 660
                   S+E+HKDLQK +SLLKKSVACIT Y YNS+ LDVPSEASTFEAFA+LLATLSS
Sbjct: 303  YSSASPHSIETHKDLQKGISLLKKSVACITTYCYNSLCLDVPSEASTFEAFAKLLATLSS 362

Query: 659  S-------NLKNTCSRSGKEAQRLNKXXXXXXXXXXXXXXXXXSHMAHVTPNARYNNLSN 501
            S       ++K+ CSRS K+ Q+LNK                 +H      N   NNL N
Sbjct: 363  SKELRSVCSIKSACSRSNKQVQQLNKSVWNVNSAFASTTLLDSAHTVASMKNIGENNLPN 422

Query: 500  SAASFLYSTEIATPGRSESLVEGWDIVEHPTLPPPPSQVEDVEHWTRAMFTDATKK 333
             A SFLY+TE +  G++E ++EGWD++EHPT PPPPSQ EDVEHWTRAMF DATKK
Sbjct: 423  PATSFLYATE-SDAGKNEFIIEGWDLIEHPTFPPPPSQCEDVEHWTRAMFIDATKK 477


>ref|XP_006386390.1| hypothetical protein POPTR_0002s09150g [Populus trichocarpa]
            gi|550344612|gb|ERP64187.1| hypothetical protein
            POPTR_0002s09150g [Populus trichocarpa]
          Length = 506

 Score =  558 bits (1438), Expect = e-156
 Identities = 296/507 (58%), Positives = 359/507 (70%), Gaps = 38/507 (7%)
 Frame = -3

Query: 1739 RSSGSCAICEGSNLAAVCAPCVNHRLGEYNGTLKSLKSLRDSLHSKLNGVLLARKKADEQ 1560
            + S  CAICE SN A++C  CVN+RL EY   LKSL S RDSL+SKL+ VL+A+ KAD+Q
Sbjct: 3    KKSSCCAICENSNRASICPICVNYRLNEYGTLLKSLNSRRDSLYSKLSVVLIAKGKADDQ 62

Query: 1559 TSWRVLQKEKHMKMKERLQQLEKKLSQDRAKVKRVSNDLKVKRGLLESAFSTLKKKRAEV 1380
             +WRV Q EK    +E+L + +++L+Q +AKV+++S DLK K G+LESA + L+K R E 
Sbjct: 63   FNWRVQQNEKLASSREKLHRNKEQLAQGKAKVEKLSQDLKKKNGMLESARNVLEKNRMEQ 122

Query: 1379 LEKYYPNLIHTQSLGYVAINFERLHKQSVVIKQICRLLPMRKVNSAGETKDGSNGSYDQI 1200
            LEK+YPNLI TQSLG++AI  E LHKQSVVIKQIC+L P R+VN  GE     +G YDQI
Sbjct: 123  LEKFYPNLICTQSLGHMAITSELLHKQSVVIKQICKLFPQRRVNVDGER--NFSGQYDQI 180

Query: 1199 CNARLPRGLDPHSVPSEELAASLGYMLQLLNLVVPSLAAPSLHNAGFAGSCSRIWQRDSY 1020
            CNARLPRGLDPHSV SEELAASLGYM+QLLNLV  +LAAP+LHNAGFAGSCSRIWQRDSY
Sbjct: 181  CNARLPRGLDPHSVSSEELAASLGYMVQLLNLVAHNLAAPTLHNAGFAGSCSRIWQRDSY 240

Query: 1019 WDARPSSQ-------------------------------SKEYPLFIPRQNFCSSGGENS 933
            W+A PSS+                               S EYPLFIPRQN+CS+  ENS
Sbjct: 241  WNACPSSRRYFDWKSLCFGISVAKFELLLLSELNILCACSNEYPLFIPRQNYCSTSSENS 300

Query: 932  WSERSSSNFGVASVESERKPYLDPAIXXXXXXXXXXXXSVESHKDLQKALSLLKKSVACI 753
            W+++SSSNFGVAS+ESER+P+LD               SVE+HKDLQK +SLLKKSVAC+
Sbjct: 301  WTDKSSSNFGVASMESERRPHLDSTRSNSFNYSSVSPHSVETHKDLQKGVSLLKKSVACV 360

Query: 752  TAYGYNSIGLDVPSEASTFEAFARLLATLSSS-------NLKNTCSRSGKEAQRLNKXXX 594
            TAY YN + LDVPS+ STFEAFA+LL+TLSSS       NLK  CSRS K+ Q+LNK   
Sbjct: 361  TAYCYNLLCLDVPSDTSTFEAFAKLLSTLSSSKEVRSVFNLKMACSRSCKQVQKLNKSVW 420

Query: 593  XXXXXXXXXXXXXXSHMAHVTPNARYNNLSNSAASFLYSTEIATPGRSESLVEGWDIVEH 414
                          +H   +  N   NNL NSAASFL++T I + G++ES ++GWD+VEH
Sbjct: 421  NVNSAISSSALLESAHALQLMKNTSDNNLPNSAASFLFATGI-SDGKNESFIDGWDLVEH 479

Query: 413  PTLPPPPSQVEDVEHWTRAMFTDATKK 333
            PT PPPPSQVED+EHWTRAMF DATKK
Sbjct: 480  PTFPPPPSQVEDIEHWTRAMFIDATKK 506


>ref|XP_004237862.1| PREDICTED: uncharacterized protein LOC101264619 [Solanum
            lycopersicum]
          Length = 481

 Score =  550 bits (1416), Expect = e-153
 Identities = 281/482 (58%), Positives = 355/482 (73%), Gaps = 13/482 (2%)
 Frame = -3

Query: 1739 RSSGSCAICEGSNLAAVCAPCVNHRLGEYNGTLKSLKSLRDSLHSKLNGVLLARKKADEQ 1560
            R +  C ICE SNL +VC  CVN+RL EY+  LKSLK  R++L  +L+ +LLA+ KAD+Q
Sbjct: 3    RKTSCCGICENSNLPSVCTLCVNYRLNEYSTVLKSLKGRREALCGQLSEILLAKGKADDQ 62

Query: 1559 TSWRVLQKEKHMKMKERLQQLEKKLSQDRAKVKRVSNDLKVKRGLLESAFSTLKKKRAEV 1380
             SWRV + EK  +++E+L+Q ++++SQ +AK++++S+DLKV+  LL SA   L+K RAE 
Sbjct: 63   LSWRVPRNEKLARLREKLRQQKEQVSQGKAKIEKMSHDLKVQYELLGSATRMLEKNRAEQ 122

Query: 1379 LEKYYPNLIHTQSLGYVAINFERLHKQSVVIKQICRLLPMRKVNSAGETKDGSNGSYDQI 1200
            LEK+YPNLI TQ+LG++AI  E LHKQSVV+KQIC+L P R+V   G+ KDGS+G YD I
Sbjct: 123  LEKFYPNLICTQNLGHMAITSELLHKQSVVVKQICKLFPQRRVTIDGDKKDGSSGQYDSI 182

Query: 1199 CNARLPRGLDPHSVPSEELAASLGYMLQLLNLVVPSLAAPSLHNAGFAGSCSRIWQRDSY 1020
            CNARLP+GLDPHSVPS+EL+ASLGYM+QLLNLVV  + AP+LHN+GFAGSCSRIWQRDSY
Sbjct: 183  CNARLPKGLDPHSVPSDELSASLGYMVQLLNLVVRCVCAPALHNSGFAGSCSRIWQRDSY 242

Query: 1019 WDARPSSQSKEYPLFIPRQNFCSSGGENSWSERS------SSNFGVASVESERKPYLDPA 858
            WDARPSS+S EYPLFIPRQNFCSSGGE SW +RS      SSNFGV S+ES+RKP LD +
Sbjct: 243  WDARPSSRSGEYPLFIPRQNFCSSGGEASWYDRSSSNSGTSSNFGVTSMESDRKPRLDSS 302

Query: 857  IXXXXXXXXXXXXSVESHKDLQKALSLLKKSVACITAYGYNSIGLDVPSEASTFEAFARL 678
                         S+E+HKDLQK ++LLKKSVACITAY YN++ L+VP+EASTFE FARL
Sbjct: 303  SSSSFNYASASLHSIETHKDLQKGIALLKKSVACITAYCYNTLCLEVPAEASTFETFARL 362

Query: 677  LATLSSS-------NLKNTCSRSGKEAQRLNKXXXXXXXXXXXXXXXXXSHMAHVTPNAR 519
            LATLSSS       +LK + SR+ K+ Q LNK                     HV  N  
Sbjct: 363  LATLSSSKEVRSVFSLKMSGSRASKQVQPLNK---SVWNVDSAGSSSTLMESGHVPRNTF 419

Query: 518  YNNLSNSAASFLYSTEIATPGRSESLVEGWDIVEHPTLPPPPSQVEDVEHWTRAMFTDAT 339
              +L +S  + +Y+TE++  GR+E+L+E WD++EHP  PPPPS  EDVEHWTRAMF DAT
Sbjct: 420  EKSLPSSGGNLMYATEVSNVGRNENLIEDWDLIEHPPFPPPPSHTEDVEHWTRAMFIDAT 479

Query: 338  KK 333
            KK
Sbjct: 480  KK 481


>ref|XP_006354053.1| PREDICTED: uncharacterized protein LOC102590673 isoform X1 [Solanum
            tuberosum] gi|565375051|ref|XP_006354054.1| PREDICTED:
            uncharacterized protein LOC102590673 isoform X2 [Solanum
            tuberosum]
          Length = 483

 Score =  548 bits (1411), Expect = e-153
 Identities = 280/477 (58%), Positives = 354/477 (74%), Gaps = 13/477 (2%)
 Frame = -3

Query: 1724 CAICEGSNLAAVCAPCVNHRLGEYNGTLKSLKSLRDSLHSKLNGVLLARKKADEQTSWRV 1545
            C ICE SNL +VC  CVN+RL EY+  LKSLK  R++L  KL+ +LLA+ KAD+Q SWRV
Sbjct: 8    CGICENSNLPSVCTLCVNYRLNEYSTVLKSLKGRREALCGKLSEILLAKGKADDQLSWRV 67

Query: 1544 LQKEKHMKMKERLQQLEKKLSQDRAKVKRVSNDLKVKRGLLESAFSTLKKKRAEVLEKYY 1365
             + EK  +++E+L+Q ++++SQ +AK++++S+DLKV+  LL SA   L+K RAE LEK+Y
Sbjct: 68   PRNEKLARLREKLRQQKEQISQGKAKIEKMSHDLKVQYELLGSATRMLEKNRAEQLEKFY 127

Query: 1364 PNLIHTQSLGYVAINFERLHKQSVVIKQICRLLPMRKVNSAGETKDGSNGSYDQICNARL 1185
            PNLI TQ+LG++AI  E LHKQSVV+KQIC+L P R+V   G+ KDGS+G YD ICNARL
Sbjct: 128  PNLICTQNLGHMAITSELLHKQSVVVKQICKLFPQRRVTIDGDKKDGSSGQYDSICNARL 187

Query: 1184 PRGLDPHSVPSEELAASLGYMLQLLNLVVPSLAAPSLHNAGFAGSCSRIWQRDSYWDARP 1005
            P+GLDPHSVPS+EL+ASLGYM+QLLNLV+  + AP+LHN+GFAGSCSRIWQRDSYWDARP
Sbjct: 188  PKGLDPHSVPSDELSASLGYMVQLLNLVIRCVCAPALHNSGFAGSCSRIWQRDSYWDARP 247

Query: 1004 SSQSKEYPLFIPRQNFCSSGGENSWSERS------SSNFGVASVESERKPYLDPAIXXXX 843
            SS+S EYPLFIPRQNFCSSGGE SW +RS      SSNFGV S+ES+RKP LD +     
Sbjct: 248  SSRSGEYPLFIPRQNFCSSGGEASWYDRSCSNSGTSSNFGVTSMESDRKPRLDSSSSSSF 307

Query: 842  XXXXXXXXSVESHKDLQKALSLLKKSVACITAYGYNSIGLDVPSEASTFEAFARLLATLS 663
                    S+E+HKDLQK ++LLKKSVACITAY YN++ L+VP+EASTFE FARLLATLS
Sbjct: 308  NYASASLHSIETHKDLQKGIALLKKSVACITAYCYNTLCLEVPAEASTFETFARLLATLS 367

Query: 662  SS-------NLKNTCSRSGKEAQRLNKXXXXXXXXXXXXXXXXXSHMAHVTPNARYNNLS 504
            SS       +LK + SR+ K+ Q LNK                  H+  V  N   N L 
Sbjct: 368  SSKEVRSVFSLKMSGSRASKQVQPLNKSVWNVDSAGSSSTLMESGHVP-VLRNTFENALP 426

Query: 503  NSAASFLYSTEIATPGRSESLVEGWDIVEHPTLPPPPSQVEDVEHWTRAMFTDATKK 333
            +S+ + +Y+TE++   R+E+L+E WD++EHP  PPPPS  EDVEHWTRAMF DATKK
Sbjct: 427  SSSGNLIYATEVSDARRNENLIEDWDLIEHPPFPPPPSHTEDVEHWTRAMFIDATKK 483


>ref|NP_192594.2| DNA-directed RNA polymerase II protein [Arabidopsis thaliana]
            gi|23297675|gb|AAN13006.1| unknown protein [Arabidopsis
            thaliana] gi|332657255|gb|AEE82655.1| DNA-directed RNA
            polymerase II protein [Arabidopsis thaliana]
          Length = 473

 Score =  541 bits (1394), Expect = e-151
 Identities = 282/480 (58%), Positives = 360/480 (75%), Gaps = 8/480 (1%)
 Frame = -3

Query: 1748 MTGRSSGSCAICEGSNLAAVCAPCVNHRLGEYNGTLKSLKSLRDSLHSKLNGVLLARKKA 1569
            MT RSS +CAIC+ +N   +C  CVNHRL EYN  LKSLK+ RDSL S+ N +L ++ KA
Sbjct: 1    MTKRSS-NCAICDNTNRPCICTACVNHRLIEYNTLLKSLKTRRDSLLSRFNELLESKGKA 59

Query: 1568 DEQTSWRVLQKEKHMKMKERLQQLEKKLSQDRAKVKRVSNDLKVKRGLLESAFSTLKKKR 1389
            D+Q +WR++Q EK  K+K++L+  ++ ++Q + K++R S+DLKVK G+L+SA STL+K R
Sbjct: 60   DDQKNWRLIQNEKISKLKKKLKSNKELVTQGKVKIERGSSDLKVKYGVLDSARSTLEKTR 119

Query: 1388 AEVLEKYYPNLIHTQSLGYVAINFERLHKQSVVIKQICRLLPMRKVNSAGETKDGSNGSY 1209
             E +EKY+PNLI TQSLG++AI+ ERLHKQSVV+KQIC+L P+R+V+  GE+++GS   Y
Sbjct: 120  VEQVEKYFPNLICTQSLGHMAISSERLHKQSVVVKQICKLFPLRRVSFDGESQNGSVRQY 179

Query: 1208 DQICNARLPRGLDPHSVPSEELAASLGYMLQLLNLVVPSLAAPSLHNAGFAGSCSRIWQR 1029
            D ICN+RLP GLDPHS+PSEELA SLGYM+QLLNLVV +LAAP+LH++GFAGSCSRIWQR
Sbjct: 180  DVICNSRLPSGLDPHSIPSEELAVSLGYMVQLLNLVVHNLAAPALHSSGFAGSCSRIWQR 239

Query: 1028 DSYWDARPSSQSKEYPLFIPRQNFCSSGGENSWSERSSSNFGVASVESERK-PYLDPAIX 852
            DSYWD R S++S EYPLFIPR+N+CS+  ENSW++++SSNFGVAS+ES+RK P LD    
Sbjct: 240  DSYWDGRTSTRSNEYPLFIPRRNYCSTSVENSWTDKNSSNFGVASMESDRKEPRLDSPGS 299

Query: 851  XXXXXXXXXXXSVESHKDLQKALSLLKKSVACITAYGYNSIGLDVPSEASTFEAFARLLA 672
                       S+ESH+DLQK ++LLKKSVAC+TAY YNS+ L+VP EASTFEAFA+LLA
Sbjct: 300  NSFKYSSASPHSIESHRDLQKGIALLKKSVACLTAYCYNSLCLEVPPEASTFEAFAKLLA 359

Query: 671  TLSSS-------NLKNTCSRSGKEAQRLNKXXXXXXXXXXXXXXXXXSHMAHVTPNARYN 513
            TLSSS       +LK   SRSGK+AQ+LNK                    AH+  N  YN
Sbjct: 360  TLSSSKEVRSVFSLKMASSRSGKQAQQLNK----SIWNAHSVISSSLLESAHLPRNTSYN 415

Query: 512  NLSNSAASFLYSTEIATPGRSESLVEGWDIVEHPTLPPPPSQVEDVEHWTRAMFTDATKK 333
               NS AS+L +TE++T  R  + + GWD+VEHP  PPPPSQ EDVEHWTRAMF DA KK
Sbjct: 416  QDPNSPASYLSATELST--RKNNDMNGWDLVEHPKYPPPPSQSEDVEHWTRAMFIDAKKK 473


>gb|AAL59980.1| unknown protein [Arabidopsis thaliana]
          Length = 473

 Score =  541 bits (1394), Expect = e-151
 Identities = 282/480 (58%), Positives = 360/480 (75%), Gaps = 8/480 (1%)
 Frame = -3

Query: 1748 MTGRSSGSCAICEGSNLAAVCAPCVNHRLGEYNGTLKSLKSLRDSLHSKLNGVLLARKKA 1569
            MT RSS +CAIC+ +N   +C  CVNHRL EYN  LKSLK+ RDSL S+ N +L ++ KA
Sbjct: 1    MTKRSS-NCAICDNTNRPCICTACVNHRLIEYNTLLKSLKTRRDSLLSRFNELLESKGKA 59

Query: 1568 DEQTSWRVLQKEKHMKMKERLQQLEKKLSQDRAKVKRVSNDLKVKRGLLESAFSTLKKKR 1389
            D+Q +WR++Q EK  K+K++L+  ++ ++Q + K++R S+DLKVK G+L+SA STL+K R
Sbjct: 60   DDQKNWRLIQNEKISKLKKKLKSNKELVTQGKVKIERGSSDLKVKYGVLDSARSTLEKTR 119

Query: 1388 AEVLEKYYPNLIHTQSLGYVAINFERLHKQSVVIKQICRLLPMRKVNSAGETKDGSNGSY 1209
             E +EKY+PNLI TQSLG++AI+ ERLHKQSVV+KQIC+L P+R+V+  GE+++GS   Y
Sbjct: 120  VEQVEKYFPNLICTQSLGHMAISSERLHKQSVVVKQICKLFPLRRVSFDGESQNGSVRQY 179

Query: 1208 DQICNARLPRGLDPHSVPSEELAASLGYMLQLLNLVVPSLAAPSLHNAGFAGSCSRIWQR 1029
            D ICN+RLP GLDPHS+PSEELA SLGYM+QLLNLVV +LAAP+LH++GFAGSCSRIWQR
Sbjct: 180  DVICNSRLPSGLDPHSIPSEELAVSLGYMVQLLNLVVHNLAAPALHSSGFAGSCSRIWQR 239

Query: 1028 DSYWDARPSSQSKEYPLFIPRQNFCSSGGENSWSERSSSNFGVASVESERK-PYLDPAIX 852
            DSYWD R S++S EYPLFIPR+N+CS+  ENSW++++SSNFGVAS+ES+RK P LD    
Sbjct: 240  DSYWDGRTSTRSNEYPLFIPRRNYCSTSVENSWTDKNSSNFGVASMESDRKEPRLDSPGS 299

Query: 851  XXXXXXXXXXXSVESHKDLQKALSLLKKSVACITAYGYNSIGLDVPSEASTFEAFARLLA 672
                       S+ESH+DLQK ++LLKKSVAC+TAY YNS+ L+VP EASTFEAFA+LLA
Sbjct: 300  NSFMYSSASPHSIESHRDLQKGIALLKKSVACLTAYCYNSLCLEVPPEASTFEAFAKLLA 359

Query: 671  TLSSS-------NLKNTCSRSGKEAQRLNKXXXXXXXXXXXXXXXXXSHMAHVTPNARYN 513
            TLSSS       +LK   SRSGK+AQ+LNK                    AH+  N  YN
Sbjct: 360  TLSSSKEVRSVFSLKMASSRSGKQAQQLNK----SIWNAHSVISSSLLESAHLPRNTSYN 415

Query: 512  NLSNSAASFLYSTEIATPGRSESLVEGWDIVEHPTLPPPPSQVEDVEHWTRAMFTDATKK 333
               NS AS+L +TE++T  R  + + GWD+VEHP  PPPPSQ EDVEHWTRAMF DA KK
Sbjct: 416  QDPNSPASYLSATELST--RKNNDMNGWDLVEHPKYPPPPSQSEDVEHWTRAMFIDAKKK 473


>ref|XP_006397280.1| hypothetical protein EUTSA_v10028627mg [Eutrema salsugineum]
            gi|557098297|gb|ESQ38733.1| hypothetical protein
            EUTSA_v10028627mg [Eutrema salsugineum]
          Length = 474

 Score =  538 bits (1385), Expect = e-150
 Identities = 278/477 (58%), Positives = 362/477 (75%), Gaps = 8/477 (1%)
 Frame = -3

Query: 1739 RSSGSCAICEGSNLAAVCAPCVNHRLGEYNGTLKSLKSLRDSLHSKLNGVLLARKKADEQ 1560
            + S +CAICE +N A++C+ CVN+RL EY+  LKSLK+ RD+L+SKL+ +L A+ KAD+Q
Sbjct: 3    KRSSNCAICENTNRASICSVCVNYRLIEYSTLLKSLKTRRDALYSKLSELLEAKGKADDQ 62

Query: 1559 TSWRVLQKEKHMKMKERLQQLEKKLSQDRAKVKRVSNDLKVKRGLLESAFSTLKKKRAEV 1380
             +W+++Q EK   +K  L++ +++++Q +AK++R S DLK+K G+L+SA STL++ R E 
Sbjct: 63   KNWKLIQNEKLSGLKNNLRRNKEQVTQGKAKIERESRDLKLKYGVLDSARSTLERIRVEQ 122

Query: 1379 LEKYYPNLIHTQSLGYVAINFERLHKQSVVIKQICRLLPMRKVNSAGETKDGSNGSYDQI 1200
            +EKY+PNLI TQSLG++AI+ ERLHKQSVV+KQ+C+L P R+V+  GE+++GS G Y+ I
Sbjct: 123  VEKYFPNLICTQSLGHMAISSERLHKQSVVMKQVCKLFPQRRVSFDGESQNGSVGQYNLI 182

Query: 1199 CNARLPRGLDPHSVPSEELAASLGYMLQLLNLVVPSLAAPSLHNAGFAGSCSRIWQRDSY 1020
            CN+RLP+GLDPHS+PSEELAASLG M+QLLNLVV +LAAP+LHN+GFAGSCSRIWQRDSY
Sbjct: 183  CNSRLPKGLDPHSIPSEELAASLGLMVQLLNLVVHNLAAPALHNSGFAGSCSRIWQRDSY 242

Query: 1019 WDARPSSQSKEYPLFIPRQNFCSSGGENSWSERSSSNFGVASVESERK-PYLDPAIXXXX 843
            WDARPS++S EYPLFIPRQN+CS+  ENSW++++SSNFGVAS+ES+RK   LD       
Sbjct: 243  WDARPSTRSNEYPLFIPRQNYCSTSVENSWTDKNSSNFGVASMESDRKEARLDSTGRNSF 302

Query: 842  XXXXXXXXSVESHKDLQKALSLLKKSVACITAYGYNSIGLDVPSEASTFEAFARLLATLS 663
                    SVESH+DLQK ++LLKKSVAC+TAY YNS+ L+VP EASTFEAFA+LLATLS
Sbjct: 303  NYSSASPHSVESHRDLQKGIALLKKSVACLTAYCYNSLCLEVPPEASTFEAFAKLLATLS 362

Query: 662  SS-------NLKNTCSRSGKEAQRLNKXXXXXXXXXXXXXXXXXSHMAHVTPNARYNNLS 504
            SS       +LK   SRS K+AQ+LNK                    +H+  NA YN   
Sbjct: 363  SSKEVRSVFSLKMASSRSCKQAQQLNK----SIWNAHSVISSSILESSHLPRNASYNQDP 418

Query: 503  NSAASFLYSTEIATPGRSESLVEGWDIVEHPTLPPPPSQVEDVEHWTRAMFTDATKK 333
            NSAAS+L  TE++   +S  +  GWD+VEHP  PPPPSQ EDVEHWTRAMF DA KK
Sbjct: 419  NSAASYLSGTELSEIRKSNDM-NGWDLVEHPKYPPPPSQSEDVEHWTRAMFIDAKKK 474


>ref|XP_004138644.1| PREDICTED: uncharacterized protein LOC101217421 [Cucumis sativus]
            gi|449524750|ref|XP_004169384.1| PREDICTED:
            uncharacterized LOC101217421 [Cucumis sativus]
          Length = 476

 Score =  524 bits (1349), Expect = e-146
 Identities = 267/472 (56%), Positives = 345/472 (73%), Gaps = 7/472 (1%)
 Frame = -3

Query: 1727 SCAICEGSNLAAVCAPCVNHRLGEYNGTLKSLKSLRDSLHSKLNGVLLARKKADEQTSWR 1548
            +CAICE SN A++C  CVN RL +YN +LKSL++ RD L+S+L+ VL+A+ KAD+Q +WR
Sbjct: 7    NCAICENSNQASICTGCVNLRLNDYNSSLKSLRARRDVLYSRLSDVLVAKGKADDQLNWR 66

Query: 1547 VLQKEKHMKMKERLQQLEKKLSQDRAKVKRVSNDLKVKRGLLESAFSTLKKKRAEVLEKY 1368
            V + EK   ++E+L++  ++L Q +A+++  S DL++K  +LESA S L+K+R E LEK 
Sbjct: 67   VTRNEKLTSLREKLRRSREQLEQGKAEIEMKSFDLQLKYAMLESARSVLEKQRLEQLEKA 126

Query: 1367 YPNLIHTQSLGYVAINFERLHKQSVVIKQICRLLPMRKVNSAGETKDGSNGSYDQICNAR 1188
            YP+LI T++LG++AI  ERLHKQSVVIKQ+C+L P R+V   GE + G    +DQICN  
Sbjct: 127  YPDLISTKNLGHMAITSERLHKQSVVIKQLCKLFPQRRVLVRGEKEVGPGEPFDQICNVS 186

Query: 1187 LPRGLDPHSVPSEELAASLGYMLQLLNLVVPSLAAPSLHNAGFAGSCSRIWQRDSYWDAR 1008
            LPR LDPHSV   EL+ASLGYM+QLLNLVV  LAAP+LH +GFAGSCSRIWQRDSYW+A 
Sbjct: 187  LPRSLDPHSVEPYELSASLGYMVQLLNLVVQYLAAPALHTSGFAGSCSRIWQRDSYWNAC 246

Query: 1007 PSSQSKEYPLFIPRQNFCSSGGENSWSERSSSNFGVASVESERKPYLDPAIXXXXXXXXX 828
            PSS+S EYP+F+PRQ++CS+ GENSWS++SSSNFGVAS+ESERKP L             
Sbjct: 247  PSSRSNEYPVFMPRQSYCSTSGENSWSDKSSSNFGVASLESERKPQLSSLENRSFNYSSA 306

Query: 827  XXXSVESHKDLQKALSLLKKSVACITAYGYNSIGLDVPSEASTFEAFARLLATLSSS--- 657
               S+ESHKDLQK ++LLKKSVAC+TAYGYNS+ LDVPSEASTFEAFA+LLATLSSS   
Sbjct: 307  SPHSIESHKDLQKGIALLKKSVACVTAYGYNSLSLDVPSEASTFEAFAKLLATLSSSKEV 366

Query: 656  ----NLKNTCSRSGKEAQRLNKXXXXXXXXXXXXXXXXXSHMAHVTPNARYNNLSNSAAS 489
                +LK   SRS K  Q+  K                    + +      +NL +SA+S
Sbjct: 367  RSVFSLKMASSRSTKHIQKPIKSTWNVNSIASSMLFESGH--SQIMKTNYESNLPSSASS 424

Query: 488  FLYSTEIATPGRSESLVEGWDIVEHPTLPPPPSQVEDVEHWTRAMFTDATKK 333
            +LY+TE +  G+++S +EGWD+VEHPT PPPPSQ ED+EHWTRAM  DATK+
Sbjct: 425  YLYATEFSDTGKNDSSIEGWDLVEHPTFPPPPSQAEDIEHWTRAMIIDATKQ 476


Top