BLASTX nr result

ID: Rauwolfia21_contig00011445 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Rauwolfia21_contig00011445
         (2044 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002285818.1| PREDICTED: uncharacterized protein LOC100260...   703   0.0  
ref|XP_006472675.1| PREDICTED: uncharacterized protein LOC102626...   698   0.0  
gb|EMJ26908.1| hypothetical protein PRUPE_ppa005050mg [Prunus pe...   698   0.0  
ref|XP_002533875.1| DNA-directed RNA polymerase II, putative [Ri...   697   0.0  
ref|XP_006434072.1| hypothetical protein CICLE_v10001001mg [Citr...   694   0.0  
ref|XP_004300664.1| PREDICTED: uncharacterized protein LOC101292...   692   0.0  
gb|EOY16120.1| DNA-directed RNA polymerase II protein isoform 1 ...   689   0.0  
ref|XP_006354053.1| PREDICTED: uncharacterized protein LOC102590...   685   0.0  
ref|XP_004237862.1| PREDICTED: uncharacterized protein LOC101264...   685   0.0  
ref|XP_002302270.1| hypothetical protein POPTR_0002s09150g [Popu...   682   0.0  
gb|ESW13116.1| hypothetical protein PHAVU_008G169200g [Phaseolus...   679   0.0  
ref|XP_003544777.1| PREDICTED: uncharacterized protein LOC100776...   677   0.0  
ref|XP_003542641.1| PREDICTED: uncharacterized protein LOC100813...   676   0.0  
ref|XP_006386390.1| hypothetical protein POPTR_0002s09150g [Popu...   669   0.0  
ref|XP_003614901.1| hypothetical protein MTR_5g061040 [Medicago ...   669   0.0  
gb|EXC06677.1| hypothetical protein L484_021513 [Morus notabilis]     649   0.0  
ref|XP_006397280.1| hypothetical protein EUTSA_v10028627mg [Eutr...   629   e-177
ref|XP_004514262.1| PREDICTED: uncharacterized protein LOC101503...   613   e-173
ref|NP_192594.2| DNA-directed RNA polymerase II protein [Arabido...   612   e-172
gb|AAL59980.1| unknown protein [Arabidopsis thaliana]                 612   e-172

>ref|XP_002285818.1| PREDICTED: uncharacterized protein LOC100260778 [Vitis vinifera]
            gi|302141899|emb|CBI19102.3| unnamed protein product
            [Vitis vinifera]
          Length = 478

 Score =  703 bits (1814), Expect = 0.0
 Identities = 356/479 (74%), Positives = 404/479 (84%), Gaps = 4/479 (0%)
 Frame = -2

Query: 1854 MTRKTNTCCAICESSNLASICAPCVNYRLNEYNTNLKSLRSRRDALYSRLSEVLLVKGKA 1675
            MTRKT++C +ICE SNLASICA CVNYRLNEYNT+LKS + RRD+LY RLSEVL+ KGKA
Sbjct: 1    MTRKTSSC-SICEKSNLASICAVCVNYRLNEYNTSLKSSKGRRDSLYLRLSEVLVAKGKA 59

Query: 1674 DDQKSWMVFQNEKLARMREKLRLHKEELIQGKAKIEKMSRELKVKYELLESAMSVLEKNQ 1495
            DDQ +W V QNEKLAR+REKLR  KE+ + GKAK+EKMS +LK+KY LLESAMS+LEKN+
Sbjct: 60   DDQINWRVLQNEKLARLREKLRHRKEQYLDGKAKVEKMSNDLKLKYGLLESAMSMLEKNR 119

Query: 1494 VEQLEKFYPNLICAQNLGYMAITSERLYKQSVIIKQICKLFPMRRVNVEGEKKDGFVGQY 1315
            VEQLEKFYPNLIC QNLG MAITSER +KQSV+IKQICKLFP RRVN++GEKKDG    Y
Sbjct: 120  VEQLEKFYPNLICTQNLGLMAITSERFHKQSVVIKQICKLFPQRRVNIDGEKKDGSSRPY 179

Query: 1314 DTICSARLPRGLDPHSVPTEELAASLGYMVQLLNLVVHNVCAPALHNSGFAGSCSRIWQR 1135
            D IC+ RLPR LDPHSVP++ELAASLGYMVQLLNLVV+N+ APALHNSGFAGSCSRIWQR
Sbjct: 180  DQICNVRLPRVLDPHSVPSDELAASLGYMVQLLNLVVYNLAAPALHNSGFAGSCSRIWQR 239

Query: 1134 DSYWDARPSSRS-EYPLFIPRPTFCNI-GETSWSDKSSSNFGVASMESVRKPHLD--XXX 967
            +SYW+ RPSSRS EYPLFIPR   C+  GE SWS++SSSNFG+ASMES RKP L+     
Sbjct: 240  ESYWNPRPSSRSNEYPLFIPRQNLCSTNGENSWSERSSSNFGIASMESDRKPRLESSGSS 299

Query: 966  XXXXXXXXXXSIETHKDLQKGISLLKKSVACITACCYNSLGLEVPAEASTFEAFARLLAT 787
                      S+ETHKDLQKGISLLKKSVAC+T  CY+SL L+VP EASTFEAFA+LLA 
Sbjct: 300  SFNYSSASLHSVETHKDLQKGISLLKKSVACLTTYCYSSLCLDVPTEASTFEAFAKLLAI 359

Query: 786  LSSSKEMRSVFSLKMASSRSSKQVQQLNKSVWNVDSAVSSSTLMESAHALHTTRNALDNN 607
            LSSSKE+RSVFSLKMA SRS KQVQQLNKS+WN++SA+SSSTL+ESAH L  TRN  DNN
Sbjct: 360  LSSSKEVRSVFSLKMACSRSCKQVQQLNKSIWNMNSAISSSTLLESAHTLPMTRNIFDNN 419

Query: 606  IPSSAASFLYPVEPSDAGKYENFVEGWDLVEHPTFPPPPSQTEDVEHWTRAMFIDATKK 430
            +P+SAASFLY  E SD GK E+ +E WDLVEH  FPPPPSQTED+EHWTRAM IDATKK
Sbjct: 420  LPNSAASFLYTTEMSDIGKNESLIEEWDLVEHANFPPPPSQTEDIEHWTRAMIIDATKK 478


>ref|XP_006472675.1| PREDICTED: uncharacterized protein LOC102626964 isoform X1 [Citrus
            sinensis] gi|568837325|ref|XP_006472676.1| PREDICTED:
            uncharacterized protein LOC102626964 isoform X2 [Citrus
            sinensis]
          Length = 478

 Score =  698 bits (1802), Expect = 0.0
 Identities = 353/479 (73%), Positives = 403/479 (84%), Gaps = 4/479 (0%)
 Frame = -2

Query: 1854 MTRKTNTCCAICESSNLASICAPCVNYRLNEYNTNLKSLRSRRDALYSRLSEVLLVKGKA 1675
            M +K + C AICE+SN ASICA CVNYRL+E NT LKSL+SRRDALY RLSEVL+ KGKA
Sbjct: 1    MNKKASNC-AICENSNRASICAACVNYRLSECNTLLKSLKSRRDALYMRLSEVLVAKGKA 59

Query: 1674 DDQKSWMVFQNEKLARMREKLRLHKEELIQGKAKIEKMSRELKVKYELLESAMSVLEKNQ 1495
            DDQ +W V QNEKL  +REKLR  KE+L QGK KIEK S +LKV+Y +L+SA S++EKN+
Sbjct: 60   DDQLNWRVLQNEKLTNLREKLRRSKEQLSQGKLKIEKSSYDLKVRYAILDSARSMMEKNR 119

Query: 1494 VEQLEKFYPNLICAQNLGYMAITSERLYKQSVIIKQICKLFPMRRVNVEGEKKDGFVGQY 1315
             EQLEKFYPN+IC Q+LG+MAI SE L+KQSV+IKQICKLFP RRVN++GE++DG  GQY
Sbjct: 120  AEQLEKFYPNIICTQSLGHMAIVSELLHKQSVVIKQICKLFPQRRVNIDGERRDGSSGQY 179

Query: 1314 DTICSARLPRGLDPHSVPTEELAASLGYMVQLLNLVVHNVCAPALHNSGFAGSCSRIWQR 1135
            D IC ARLP+GLDPHSVP+EELAASLGYMVQLLNLVV N+  P LHNSGFAGSCSRIWQR
Sbjct: 180  DQICGARLPKGLDPHSVPSEELAASLGYMVQLLNLVVLNLAVPILHNSGFAGSCSRIWQR 239

Query: 1134 DSYWDARPSSRS-EYPLFIPRPTFCNI-GETSWSDKSSSNFGVASMESVRKPHLD--XXX 967
            DSYWDARPSSRS EYPLFIPR  +C+  GE SW+D+SSSNFGVASMES R+P LD     
Sbjct: 240  DSYWDARPSSRSNEYPLFIPRQNYCSTSGENSWTDRSSSNFGVASMESERRPQLDSSRST 299

Query: 966  XXXXXXXXXXSIETHKDLQKGISLLKKSVACITACCYNSLGLEVPAEASTFEAFARLLAT 787
                      S+ETHKDLQKGISLLKKSVAC+TA CYNSL L+VPAEASTFEAFA+LLAT
Sbjct: 300  SFNYTSASTHSVETHKDLQKGISLLKKSVACLTAYCYNSLCLDVPAEASTFEAFAKLLAT 359

Query: 786  LSSSKEMRSVFSLKMASSRSSKQVQQLNKSVWNVDSAVSSSTLMESAHALHTTRNALDNN 607
            LSSSKE+RSVFSLKMA SRS KQVQ+LN+SVWN++SA+SS+TL+ESAH    T+N  DNN
Sbjct: 360  LSSSKEVRSVFSLKMACSRSCKQVQKLNRSVWNMNSAISSTTLLESAHMFPITKNLSDNN 419

Query: 606  IPSSAASFLYPVEPSDAGKYENFVEGWDLVEHPTFPPPPSQTEDVEHWTRAMFIDATKK 430
            +PSSAASFLY  E SD GK E+ ++GWDLVEHPTFPPPPSQTEDVEHWTRAM IDATKK
Sbjct: 420  LPSSAASFLYATEMSDIGKNESLIDGWDLVEHPTFPPPPSQTEDVEHWTRAMIIDATKK 478


>gb|EMJ26908.1| hypothetical protein PRUPE_ppa005050mg [Prunus persica]
            gi|462422646|gb|EMJ26909.1| hypothetical protein
            PRUPE_ppa005050mg [Prunus persica]
          Length = 479

 Score =  698 bits (1802), Expect = 0.0
 Identities = 353/479 (73%), Positives = 408/479 (85%), Gaps = 4/479 (0%)
 Frame = -2

Query: 1854 MTRKTNTCCAICESSNLASICAPCVNYRLNEYNTNLKSLRSRRDALYSRLSEVLLVKGKA 1675
            M RK++ C AICESSNLAS+CA CVNYRL EYN++LK+L+SRRD+LYSRL+E L+ KGKA
Sbjct: 2    MNRKSSNC-AICESSNLASVCAICVNYRLTEYNSSLKALKSRRDSLYSRLTEALVAKGKA 60

Query: 1674 DDQKSWMVFQNEKLARMREKLRLHKEELIQGKAKIEKMSRELKVKYELLESAMSVLEKNQ 1495
            DDQ +W V QNEKL R+REKLR +KE+L+QGKAKIEK S +LKVK  +LESA++VLEKN+
Sbjct: 61   DDQLNWRVLQNEKLVRLREKLRCNKEQLVQGKAKIEKTSYDLKVKSGVLESALAVLEKNR 120

Query: 1494 VEQLEKFYPNLICAQNLGYMAITSERLYKQSVIIKQICKLFPMRRVNVEGEKKDGFVGQY 1315
             EQLEKFYPN IC QNLG+MAITSERL+KQSV+IKQICKLFP RRV V+ ++KD   GQY
Sbjct: 121  AEQLEKFYPNFICTQNLGHMAITSERLHKQSVVIKQICKLFPQRRVTVDAKRKDASGGQY 180

Query: 1314 DTICSARLPRGLDPHSVPTEELAASLGYMVQLLNLVVHNVCAPALHNSGFAGSCSRIWQR 1135
            D IC+A LPRGLDPHSVP+EELAASLGYMVQLLNLVV N+ APALHNSGFAGSCSRIWQR
Sbjct: 181  DQICNACLPRGLDPHSVPSEELAASLGYMVQLLNLVVQNLAAPALHNSGFAGSCSRIWQR 240

Query: 1134 DSYWDARPSSRS-EYPLFIPRPTFCNI-GETSWSDKSSSNFGVASMESVRKPHLD--XXX 967
            DSYWDARPSSRS EYPLFIPR  +C+  GE SWSD+SSSNFGVAS++S RKPHLD     
Sbjct: 241  DSYWDARPSSRSNEYPLFIPRQNYCSTSGENSWSDRSSSNFGVASIDSERKPHLDSSGSS 300

Query: 966  XXXXXXXXXXSIETHKDLQKGISLLKKSVACITACCYNSLGLEVPAEASTFEAFARLLAT 787
                      S+ETHKDLQ+GISLLKKSVACITA CYNSL L+VP+EASTFEAFA+LLAT
Sbjct: 301  SFNYTSASQHSVETHKDLQRGISLLKKSVACITAYCYNSLCLDVPSEASTFEAFAKLLAT 360

Query: 786  LSSSKEMRSVFSLKMASSRSSKQVQQLNKSVWNVDSAVSSSTLMESAHALHTTRNALDNN 607
            LSSSKE+ SVFSLKMA SRS KQVQQLNKSVWNV+SA+SS+TL++SAHA+  T+N  + N
Sbjct: 361  LSSSKEVHSVFSLKMACSRSCKQVQQLNKSVWNVNSAISSTTLLDSAHAMTMTKNLYEYN 420

Query: 606  IPSSAASFLYPVEPSDAGKYENFVEGWDLVEHPTFPPPPSQTEDVEHWTRAMFIDATKK 430
            +P+ A S L   E SD+GK E+ VEGWDLVEHPTFPPPPSQ+ED+EHWTRAMFIDA +K
Sbjct: 421  LPTYATSSLCSTELSDSGKNESLVEGWDLVEHPTFPPPPSQSEDIEHWTRAMFIDAKRK 479


>ref|XP_002533875.1| DNA-directed RNA polymerase II, putative [Ricinus communis]
            gi|223526176|gb|EEF28506.1| DNA-directed RNA polymerase
            II, putative [Ricinus communis]
          Length = 478

 Score =  697 bits (1799), Expect = 0.0
 Identities = 356/476 (74%), Positives = 401/476 (84%), Gaps = 4/476 (0%)
 Frame = -2

Query: 1845 KTNTCCAICESSNLASICAPCVNYRLNEYNTNLKSLRSRRDALYSRLSEVLLVKGKADDQ 1666
            K ++CCAICE+SN ASIC  CVNYRLNEY+T LKSL+SRRD LYSRLSEVL+ KGKADDQ
Sbjct: 3    KKSSCCAICENSNRASICTVCVNYRLNEYSTLLKSLKSRRDLLYSRLSEVLVAKGKADDQ 62

Query: 1665 KSWMVFQNEKLARMREKLRLHKEELIQGKAKIEKMSRELKVKYELLESAMSVLEKNQVEQ 1486
             +W V QNEKLA +REKL   KE+LIQ KAK EKMS +L  KY LLES+ S LEKN+V+Q
Sbjct: 63   LNWRVHQNEKLANLREKLLRSKEQLIQAKAKTEKMSSDLNAKYGLLESSRSALEKNRVDQ 122

Query: 1485 LEKFYPNLICAQNLGYMAITSERLYKQSVIIKQICKLFPMRRVNVEGEKKDGFVGQYDTI 1306
            LEK++PNLIC Q+LG+MAITSE L+  SV +KQICKLFP RRV VEGEKKDG  GQYD I
Sbjct: 123  LEKYFPNLICTQSLGHMAITSELLHNLSVTVKQICKLFPQRRVIVEGEKKDGSSGQYDQI 182

Query: 1305 CSARLPRGLDPHSVPTEELAASLGYMVQLLNLVVHNVCAPALHNSGFAGSCSRIWQRDSY 1126
            C+ARLPRGLDPHS+P+EELAASLGYMVQLLNLVVHN+ APALHNSGFAGSCSRIWQRDSY
Sbjct: 183  CNARLPRGLDPHSIPSEELAASLGYMVQLLNLVVHNLAAPALHNSGFAGSCSRIWQRDSY 242

Query: 1125 WDARPSSRS-EYPLFIPRPTFCNI-GETSWSDKSSSNFGVASMESVRKPHLD--XXXXXX 958
            W+ARPSSRS EYPLFIPR  +C+  GE SW+D+SSSNFGVASMES R+  LD        
Sbjct: 243  WNARPSSRSNEYPLFIPRQRYCSTSGENSWTDRSSSNFGVASMESERRARLDSSRSSSFN 302

Query: 957  XXXXXXXSIETHKDLQKGISLLKKSVACITACCYNSLGLEVPAEASTFEAFARLLATLSS 778
                   S+ETHKDLQKGISL+KKSVAC+TA  YN L L+VPAEASTFEAFA+LLATLSS
Sbjct: 303  YNSASPHSVETHKDLQKGISLMKKSVACVTAYGYNLLCLDVPAEASTFEAFAKLLATLSS 362

Query: 777  SKEMRSVFSLKMASSRSSKQVQQLNKSVWNVDSAVSSSTLMESAHALHTTRNALDNNIPS 598
            SKE+RSVFSLKMA SRS KQVQ+LNKSVWNV+S +SSSTLMESAHA H T+N  DNN+ +
Sbjct: 363  SKEVRSVFSLKMACSRSCKQVQKLNKSVWNVNSIISSSTLMESAHAPHLTKNINDNNLRN 422

Query: 597  SAASFLYPVEPSDAGKYENFVEGWDLVEHPTFPPPPSQTEDVEHWTRAMFIDATKK 430
            SA SFL+  E SDAGK E+ ++GWDLVEHPTFPPPPSQTEDVEHWTRAMFIDATKK
Sbjct: 423  SATSFLFANEISDAGKNESLIDGWDLVEHPTFPPPPSQTEDVEHWTRAMFIDATKK 478


>ref|XP_006434072.1| hypothetical protein CICLE_v10001001mg [Citrus clementina]
            gi|567883029|ref|XP_006434073.1| hypothetical protein
            CICLE_v10001001mg [Citrus clementina]
            gi|567883031|ref|XP_006434074.1| hypothetical protein
            CICLE_v10001001mg [Citrus clementina]
            gi|567883033|ref|XP_006434075.1| hypothetical protein
            CICLE_v10001001mg [Citrus clementina]
            gi|557536194|gb|ESR47312.1| hypothetical protein
            CICLE_v10001001mg [Citrus clementina]
            gi|557536195|gb|ESR47313.1| hypothetical protein
            CICLE_v10001001mg [Citrus clementina]
            gi|557536196|gb|ESR47314.1| hypothetical protein
            CICLE_v10001001mg [Citrus clementina]
            gi|557536197|gb|ESR47315.1| hypothetical protein
            CICLE_v10001001mg [Citrus clementina]
          Length = 478

 Score =  694 bits (1790), Expect = 0.0
 Identities = 351/479 (73%), Positives = 401/479 (83%), Gaps = 4/479 (0%)
 Frame = -2

Query: 1854 MTRKTNTCCAICESSNLASICAPCVNYRLNEYNTNLKSLRSRRDALYSRLSEVLLVKGKA 1675
            M +K + C AICE+SN ASICA CVNYRL+E NT LKSL+SRRDALY RLSEVL+ KGKA
Sbjct: 1    MNKKASNC-AICENSNRASICAACVNYRLSECNTLLKSLKSRRDALYMRLSEVLVAKGKA 59

Query: 1674 DDQKSWMVFQNEKLARMREKLRLHKEELIQGKAKIEKMSRELKVKYELLESAMSVLEKNQ 1495
            DDQ +W V QNEKL  +REKLR  KE+L QGK KIEK S +LK +Y +L+SA S++EKN+
Sbjct: 60   DDQLNWRVLQNEKLTNLREKLRRSKEQLSQGKLKIEKSSYDLKGRYAILDSARSMMEKNR 119

Query: 1494 VEQLEKFYPNLICAQNLGYMAITSERLYKQSVIIKQICKLFPMRRVNVEGEKKDGFVGQY 1315
             EQLEKFYPN+IC Q+LG+MAI SE L+KQSV+IKQICKLFP RRVN++GE++DG  GQY
Sbjct: 120  AEQLEKFYPNIICTQSLGHMAIVSELLHKQSVVIKQICKLFPQRRVNIDGERRDGSSGQY 179

Query: 1314 DTICSARLPRGLDPHSVPTEELAASLGYMVQLLNLVVHNVCAPALHNSGFAGSCSRIWQR 1135
            D IC ARLP+GLDPHSVP+EELAASLGYMVQLLNLVV N+  P LHNSGFAGSCSRIWQR
Sbjct: 180  DQICGARLPKGLDPHSVPSEELAASLGYMVQLLNLVVLNLAVPVLHNSGFAGSCSRIWQR 239

Query: 1134 DSYWDARPSSRS-EYPLFIPRPTFCNI-GETSWSDKSSSNFGVASMESVRKPHLD--XXX 967
            DSYWDARPSSRS EYPLFIPR  +C+  GE SW+D+SSSNFGVASMES R+P LD     
Sbjct: 240  DSYWDARPSSRSNEYPLFIPRQNYCSTSGENSWTDRSSSNFGVASMESERRPQLDSSRSA 299

Query: 966  XXXXXXXXXXSIETHKDLQKGISLLKKSVACITACCYNSLGLEVPAEASTFEAFARLLAT 787
                      S+ETHKDLQKGISLLKKSVAC+TA CYNSL L+VPAEASTFEAFA+LLAT
Sbjct: 300  SFNYTSASTHSVETHKDLQKGISLLKKSVACLTAYCYNSLCLDVPAEASTFEAFAKLLAT 359

Query: 786  LSSSKEMRSVFSLKMASSRSSKQVQQLNKSVWNVDSAVSSSTLMESAHALHTTRNALDNN 607
            LS SKE+RSVFSLKMA SRS KQVQ+LN+SVWN++SA+SS+TL+ESAH    T+N  DNN
Sbjct: 360  LSLSKEVRSVFSLKMACSRSCKQVQKLNRSVWNMNSAISSTTLLESAHMFPITKNLSDNN 419

Query: 606  IPSSAASFLYPVEPSDAGKYENFVEGWDLVEHPTFPPPPSQTEDVEHWTRAMFIDATKK 430
            +PSSAASFLY  E SD GK E+ ++GWDLVEHPTFPPPPSQTEDVEHWTRAM IDATKK
Sbjct: 420  LPSSAASFLYATEMSDIGKNESLIDGWDLVEHPTFPPPPSQTEDVEHWTRAMIIDATKK 478


>ref|XP_004300664.1| PREDICTED: uncharacterized protein LOC101292418 [Fragaria vesca
            subsp. vesca]
          Length = 478

 Score =  692 bits (1786), Expect = 0.0
 Identities = 350/479 (73%), Positives = 406/479 (84%), Gaps = 4/479 (0%)
 Frame = -2

Query: 1854 MTRKTNTCCAICESSNLASICAPCVNYRLNEYNTNLKSLRSRRDALYSRLSEVLLVKGKA 1675
            MT K ++ CAICE+SNLASICA CVNYRLN+YN +LK+L+SRRD LYSRLS+ L+ KGKA
Sbjct: 1    MTNKKSSNCAICENSNLASICAVCVNYRLNDYNNSLKALKSRRDLLYSRLSDALVAKGKA 60

Query: 1674 DDQKSWMVFQNEKLARMREKLRLHKEELIQGKAKIEKMSRELKVKYELLESAMSVLEKNQ 1495
            DDQ +W + Q+EKL R+REKLR +KE+L+QGKAKIEK S +LKVKY +LESA+S+LEKN+
Sbjct: 61   DDQLNWRILQDEKLVRLREKLRRNKEQLVQGKAKIEKTSYDLKVKYGVLESALSMLEKNR 120

Query: 1494 VEQLEKFYPNLICAQNLGYMAITSERLYKQSVIIKQICKLFPMRRVNVEGEKKDGFVGQY 1315
             EQLEKFYPNLIC Q+LG+MAITSERL+KQSV+IKQICKLFP RRV V+ ++K+G  GQY
Sbjct: 121  AEQLEKFYPNLICTQSLGHMAITSERLHKQSVVIKQICKLFPQRRVTVDAKRKEGSGGQY 180

Query: 1314 DTICSARLPRGLDPHSVPTEELAASLGYMVQLLNLVVHNVCAPALHNSGFAGSCSRIWQR 1135
            D IC+A LPRGLDPHSVP+EELAASLGYMVQLLNLVV N+ APALHNSGFAGSCSRIWQR
Sbjct: 181  DQICNASLPRGLDPHSVPSEELAASLGYMVQLLNLVVQNLGAPALHNSGFAGSCSRIWQR 240

Query: 1134 DSYWDARPSSRS-EYPLFIPRPTFCNI-GETSWSDKSSSNFGVASMESVRKPHLD--XXX 967
            DSYWDARPSSRS EYPLFIPR  +C+  GE SWSD+SSSNFGVAS+ES RKP LD     
Sbjct: 241  DSYWDARPSSRSNEYPLFIPRQNYCSTSGENSWSDRSSSNFGVASIESERKPRLDSSGSS 300

Query: 966  XXXXXXXXXXSIETHKDLQKGISLLKKSVACITACCYNSLGLEVPAEASTFEAFARLLAT 787
                      S+ETHKDLQ+GISLLKKSVACITA CYNSL L+VP+EASTFEAFA+LL+T
Sbjct: 301  SFNYSSASQHSVETHKDLQRGISLLKKSVACITAYCYNSLCLDVPSEASTFEAFAKLLST 360

Query: 786  LSSSKEMRSVFSLKMASSRSSKQVQQLNKSVWNVDSAVSSSTLMESAHALHTTRNALDNN 607
            LSSSKE+ SVFSLKMA SRS KQVQQLNKSVWNV+SA+SS+TL++SAH +  T+N  +NN
Sbjct: 361  LSSSKEVHSVFSLKMACSRSCKQVQQLNKSVWNVNSAISSTTLLDSAHTMTMTKNFYENN 420

Query: 606  IPSSAASFLYPVEPSDAGKYENFVEGWDLVEHPTFPPPPSQTEDVEHWTRAMFIDATKK 430
            IP+ A SFL   E SD GK E  +EGWDLVEHPT  PPPSQ+ED+EHWTRAMFID TK+
Sbjct: 421  IPNYATSFLSSTEMSDVGKNECTIEGWDLVEHPTL-PPPSQSEDIEHWTRAMFIDVTKR 478


>gb|EOY16120.1| DNA-directed RNA polymerase II protein isoform 1 [Theobroma cacao]
          Length = 479

 Score =  689 bits (1779), Expect = 0.0
 Identities = 349/479 (72%), Positives = 401/479 (83%), Gaps = 4/479 (0%)
 Frame = -2

Query: 1854 MTRKTNTCCAICESSNLASICAPCVNYRLNEYNTNLKSLRSRRDALYSRLSEVLLVKGKA 1675
            M  K  + CAIC++SN ASICA CVNYRLNEYN+ LKSL+SRRD LYS+L EVL  K KA
Sbjct: 1    MMSKKASNCAICDNSNRASICAVCVNYRLNEYNSLLKSLKSRRDFLYSKLDEVLAAKRKA 60

Query: 1674 DDQKSWMVFQNEKLARMREKLRLHKEELIQGKAKIEKMSRELKVKYELLESAMSVLEKNQ 1495
            DDQ +W + QNEKL  ++EKLR  KE+L QGKAKIE++S +LKVKY +LESA  +LEKN+
Sbjct: 61   DDQLNWKILQNEKLTDLKEKLRRSKEQLAQGKAKIERVSYDLKVKYGVLESARGMLEKNR 120

Query: 1494 VEQLEKFYPNLICAQNLGYMAITSERLYKQSVIIKQICKLFPMRRVNVEGEKKDGFVGQY 1315
            VE+LEKFYPNLIC Q+LG MAITSERL+KQSV+IKQICKLFP RRVN++GE +DG  GQY
Sbjct: 121  VEKLEKFYPNLICTQSLGLMAITSERLHKQSVVIKQICKLFPQRRVNLDGEGRDGSCGQY 180

Query: 1314 DTICSARLPRGLDPHSVPTEELAASLGYMVQLLNLVVHNVCAPALHNSGFAGSCSRIWQR 1135
            D IC+  LPRGLDPHSVP+E+LAASLGYMVQLLNLVVHN+ APALHNSGFAGSCSRIWQR
Sbjct: 181  DLICNVGLPRGLDPHSVPSEQLAASLGYMVQLLNLVVHNLAAPALHNSGFAGSCSRIWQR 240

Query: 1134 DSYWDARPSSRS-EYPLFIPRPTFCNI-GETSWSDKSSSNFGVASMESVRKPHLD--XXX 967
            DSYW+ARPSSRS EYPLFIPR  +C+  G+ SW+D+SSSNFGVASMES R+P LD     
Sbjct: 241  DSYWNARPSSRSNEYPLFIPRQNYCSTSGDNSWTDRSSSNFGVASMESERRPRLDSSGSN 300

Query: 966  XXXXXXXXXXSIETHKDLQKGISLLKKSVACITACCYNSLGLEVPAEASTFEAFARLLAT 787
                      ++ETHKDLQ GISLLKKSVACITA CYNSL L+VP EASTFEAF++LLAT
Sbjct: 301  SFNYSSASSHTVETHKDLQIGISLLKKSVACITAFCYNSLCLDVPTEASTFEAFSKLLAT 360

Query: 786  LSSSKEMRSVFSLKMASSRSSKQVQQLNKSVWNVDSAVSSSTLMESAHALHTTRNALDNN 607
            LSS+KE+RSVFSLKMA SRSSKQ QQLNKSVWNV+SA+SSS L+ESAH L  T+N  D+N
Sbjct: 361  LSSTKEVRSVFSLKMACSRSSKQAQQLNKSVWNVNSAMSSSMLLESAHMLPLTKNLSDHN 420

Query: 606  IPSSAASFLYPVEPSDAGKYENFVEGWDLVEHPTFPPPPSQTEDVEHWTRAMFIDATKK 430
            +PSSAASFL+  E  D GK E+ +E WDLVEHPTFPPPPSQTEDVEHWTRAMFIDATK+
Sbjct: 421  LPSSAASFLFATEMPDIGKNESLIEEWDLVEHPTFPPPPSQTEDVEHWTRAMFIDATKR 479


>ref|XP_006354053.1| PREDICTED: uncharacterized protein LOC102590673 isoform X1 [Solanum
            tuberosum] gi|565375051|ref|XP_006354054.1| PREDICTED:
            uncharacterized protein LOC102590673 isoform X2 [Solanum
            tuberosum]
          Length = 483

 Score =  685 bits (1768), Expect = 0.0
 Identities = 353/485 (72%), Positives = 399/485 (82%), Gaps = 10/485 (2%)
 Frame = -2

Query: 1854 MTRKTNTCCAICESSNLASICAPCVNYRLNEYNTNLKSLRSRRDALYSRLSEVLLVKGKA 1675
            MT KT+ CC ICE+SNL S+C  CVNYRLNEY+T LKSL+ RR+AL  +LSE+LL KGKA
Sbjct: 1    MTLKTS-CCGICENSNLPSVCTLCVNYRLNEYSTVLKSLKGRREALCGKLSEILLAKGKA 59

Query: 1674 DDQKSWMVFQNEKLARMREKLRLHKEELIQGKAKIEKMSRELKVKYELLESAMSVLEKNQ 1495
            DDQ SW V +NEKLAR+REKLR  KE++ QGKAKIEKMS +LKV+YELL SA  +LEKN+
Sbjct: 60   DDQLSWRVPRNEKLARLREKLRQQKEQISQGKAKIEKMSHDLKVQYELLGSATRMLEKNR 119

Query: 1494 VEQLEKFYPNLICAQNLGYMAITSERLYKQSVIIKQICKLFPMRRVNVEGEKKDGFVGQY 1315
             EQLEKFYPNLIC QNLG+MAITSE L+KQSV++KQICKLFP RRV ++G+KKDG  GQY
Sbjct: 120  AEQLEKFYPNLICTQNLGHMAITSELLHKQSVVVKQICKLFPQRRVTIDGDKKDGSSGQY 179

Query: 1314 DTICSARLPRGLDPHSVPTEELAASLGYMVQLLNLVVHNVCAPALHNSGFAGSCSRIWQR 1135
            D+IC+ARLP+GLDPHSVP++EL+ASLGYMVQLLNLV+  VCAPALHNSGFAGSCSRIWQR
Sbjct: 180  DSICNARLPKGLDPHSVPSDELSASLGYMVQLLNLVIRCVCAPALHNSGFAGSCSRIWQR 239

Query: 1134 DSYWDARPSSRS-EYPLFIPRPTFCNI-GETSWSDKS------SSNFGVASMESVRKPHL 979
            DSYWDARPSSRS EYPLFIPR  FC+  GE SW D+S      SSNFGV SMES RKP L
Sbjct: 240  DSYWDARPSSRSGEYPLFIPRQNFCSSGGEASWYDRSCSNSGTSSNFGVTSMESDRKPRL 299

Query: 978  D--XXXXXXXXXXXXXSIETHKDLQKGISLLKKSVACITACCYNSLGLEVPAEASTFEAF 805
            D               SIETHKDLQKGI+LLKKSVACITA CYN+L LEVPAEASTFE F
Sbjct: 300  DSSSSSSFNYASASLHSIETHKDLQKGIALLKKSVACITAYCYNTLCLEVPAEASTFETF 359

Query: 804  ARLLATLSSSKEMRSVFSLKMASSRSSKQVQQLNKSVWNVDSAVSSSTLMESAHALHTTR 625
            ARLLATLSSSKE+RSVFSLKM+ SR+SKQVQ LNKSVWNVDSA SSSTLMES H +   R
Sbjct: 360  ARLLATLSSSKEVRSVFSLKMSGSRASKQVQPLNKSVWNVDSAGSSSTLMESGH-VPVLR 418

Query: 624  NALDNNIPSSAASFLYPVEPSDAGKYENFVEGWDLVEHPTFPPPPSQTEDVEHWTRAMFI 445
            N  +N +PSS+ + +Y  E SDA + EN +E WDL+EHP FPPPPS TEDVEHWTRAMFI
Sbjct: 419  NTFENALPSSSGNLIYATEVSDARRNENLIEDWDLIEHPPFPPPPSHTEDVEHWTRAMFI 478

Query: 444  DATKK 430
            DATKK
Sbjct: 479  DATKK 483


>ref|XP_004237862.1| PREDICTED: uncharacterized protein LOC101264619 [Solanum
            lycopersicum]
          Length = 481

 Score =  685 bits (1768), Expect = 0.0
 Identities = 353/485 (72%), Positives = 398/485 (82%), Gaps = 10/485 (2%)
 Frame = -2

Query: 1854 MTRKTNTCCAICESSNLASICAPCVNYRLNEYNTNLKSLRSRRDALYSRLSEVLLVKGKA 1675
            MTRKT+ CC ICE+SNL S+C  CVNYRLNEY+T LKSL+ RR+AL  +LSE+LL KGKA
Sbjct: 1    MTRKTS-CCGICENSNLPSVCTLCVNYRLNEYSTVLKSLKGRREALCGQLSEILLAKGKA 59

Query: 1674 DDQKSWMVFQNEKLARMREKLRLHKEELIQGKAKIEKMSRELKVKYELLESAMSVLEKNQ 1495
            DDQ SW V +NEKLAR+REKLR  KE++ QGKAKIEKMS +LKV+YELL SA  +LEKN+
Sbjct: 60   DDQLSWRVPRNEKLARLREKLRQQKEQVSQGKAKIEKMSHDLKVQYELLGSATRMLEKNR 119

Query: 1494 VEQLEKFYPNLICAQNLGYMAITSERLYKQSVIIKQICKLFPMRRVNVEGEKKDGFVGQY 1315
             EQLEKFYPNLIC QNLG+MAITSE L+KQSV++KQICKLFP RRV ++G+KKDG  GQY
Sbjct: 120  AEQLEKFYPNLICTQNLGHMAITSELLHKQSVVVKQICKLFPQRRVTIDGDKKDGSSGQY 179

Query: 1314 DTICSARLPRGLDPHSVPTEELAASLGYMVQLLNLVVHNVCAPALHNSGFAGSCSRIWQR 1135
            D+IC+ARLP+GLDPHSVP++EL+ASLGYMVQLLNLVV  VCAPALHNSGFAGSCSRIWQR
Sbjct: 180  DSICNARLPKGLDPHSVPSDELSASLGYMVQLLNLVVRCVCAPALHNSGFAGSCSRIWQR 239

Query: 1134 DSYWDARPSSRS-EYPLFIPRPTFCNI-GETSWSDKS------SSNFGVASMESVRKPHL 979
            DSYWDARPSSRS EYPLFIPR  FC+  GE SW D+S      SSNFGV SMES RKP L
Sbjct: 240  DSYWDARPSSRSGEYPLFIPRQNFCSSGGEASWYDRSSSNSGTSSNFGVTSMESDRKPRL 299

Query: 978  D--XXXXXXXXXXXXXSIETHKDLQKGISLLKKSVACITACCYNSLGLEVPAEASTFEAF 805
            D               SIETHKDLQKGI+LLKKSVACITA CYN+L LEVPAEASTFE F
Sbjct: 300  DSSSSSSFNYASASLHSIETHKDLQKGIALLKKSVACITAYCYNTLCLEVPAEASTFETF 359

Query: 804  ARLLATLSSSKEMRSVFSLKMASSRSSKQVQQLNKSVWNVDSAVSSSTLMESAHALHTTR 625
            ARLLATLSSSKE+RSVFSLKM+ SR+SKQVQ LNKSVWNVDSA SSSTLMES    H  R
Sbjct: 360  ARLLATLSSSKEVRSVFSLKMSGSRASKQVQPLNKSVWNVDSAGSSSTLMESG---HVPR 416

Query: 624  NALDNNIPSSAASFLYPVEPSDAGKYENFVEGWDLVEHPTFPPPPSQTEDVEHWTRAMFI 445
            N  + ++PSS  + +Y  E S+ G+ EN +E WDL+EHP FPPPPS TEDVEHWTRAMFI
Sbjct: 417  NTFEKSLPSSGGNLMYATEVSNVGRNENLIEDWDLIEHPPFPPPPSHTEDVEHWTRAMFI 476

Query: 444  DATKK 430
            DATKK
Sbjct: 477  DATKK 481


>ref|XP_002302270.1| hypothetical protein POPTR_0002s09150g [Populus trichocarpa]
            gi|566157047|ref|XP_006386388.1| hypothetical protein
            POPTR_0002s09150g [Populus trichocarpa]
            gi|566157050|ref|XP_006386389.1| hypothetical protein
            POPTR_0002s09150g [Populus trichocarpa]
            gi|222843996|gb|EEE81543.1| hypothetical protein
            POPTR_0002s09150g [Populus trichocarpa]
            gi|550344610|gb|ERP64185.1| hypothetical protein
            POPTR_0002s09150g [Populus trichocarpa]
            gi|550344611|gb|ERP64186.1| hypothetical protein
            POPTR_0002s09150g [Populus trichocarpa]
          Length = 475

 Score =  682 bits (1759), Expect = 0.0
 Identities = 346/476 (72%), Positives = 400/476 (84%), Gaps = 4/476 (0%)
 Frame = -2

Query: 1845 KTNTCCAICESSNLASICAPCVNYRLNEYNTNLKSLRSRRDALYSRLSEVLLVKGKADDQ 1666
            K ++CCAICE+SN ASIC  CVNYRLNEY T LKSL SRRD+LYS+LS VL+ KGKADDQ
Sbjct: 3    KKSSCCAICENSNRASICPICVNYRLNEYGTLLKSLNSRRDSLYSKLSVVLIAKGKADDQ 62

Query: 1665 KSWMVFQNEKLARMREKLRLHKEELIQGKAKIEKMSRELKVKYELLESAMSVLEKNQVEQ 1486
             +W V QNEKLA  REKL  +KE+L QGKAK+EK+S++LK K  +LESA +VLEKN++EQ
Sbjct: 63   FNWRVQQNEKLASSREKLHRNKEQLAQGKAKVEKLSQDLKKKNGMLESARNVLEKNRMEQ 122

Query: 1485 LEKFYPNLICAQNLGYMAITSERLYKQSVIIKQICKLFPMRRVNVEGEKKDGFVGQYDTI 1306
            LEKFYPNLIC Q+LG+MAITSE L+KQSV+IKQICKLFP RRVNV+GE+   F GQYD I
Sbjct: 123  LEKFYPNLICTQSLGHMAITSELLHKQSVVIKQICKLFPQRRVNVDGER--NFSGQYDQI 180

Query: 1305 CSARLPRGLDPHSVPTEELAASLGYMVQLLNLVVHNVCAPALHNSGFAGSCSRIWQRDSY 1126
            C+ARLPRGLDPHSV +EELAASLGYMVQLLNLV HN+ AP LHN+GFAGSCSRIWQRDSY
Sbjct: 181  CNARLPRGLDPHSVSSEELAASLGYMVQLLNLVAHNLAAPTLHNAGFAGSCSRIWQRDSY 240

Query: 1125 WDARPSSRS-EYPLFIPRPTFCNI-GETSWSDKSSSNFGVASMESVRKPHLD--XXXXXX 958
            W+A PSSRS EYPLFIPR  +C+   E SW+DKSSSNFGVASMES R+PHLD        
Sbjct: 241  WNACPSSRSNEYPLFIPRQNYCSTSSENSWTDKSSSNFGVASMESERRPHLDSTRSNSFN 300

Query: 957  XXXXXXXSIETHKDLQKGISLLKKSVACITACCYNSLGLEVPAEASTFEAFARLLATLSS 778
                   S+ETHKDLQKG+SLLKKSVAC+TA CYN L L+VP++ STFEAFA+LL+TLSS
Sbjct: 301  YSSVSPHSVETHKDLQKGVSLLKKSVACVTAYCYNLLCLDVPSDTSTFEAFAKLLSTLSS 360

Query: 777  SKEMRSVFSLKMASSRSSKQVQQLNKSVWNVDSAVSSSTLMESAHALHTTRNALDNNIPS 598
            SKE+RSVF+LKMA SRS KQVQ+LNKSVWNV+SA+SSS L+ESAHAL   +N  DNN+P+
Sbjct: 361  SKEVRSVFNLKMACSRSCKQVQKLNKSVWNVNSAISSSALLESAHALQLMKNTSDNNLPN 420

Query: 597  SAASFLYPVEPSDAGKYENFVEGWDLVEHPTFPPPPSQTEDVEHWTRAMFIDATKK 430
            SAASFL+    SD GK E+F++GWDLVEHPTFPPPPSQ ED+EHWTRAMFIDATKK
Sbjct: 421  SAASFLFATGISD-GKNESFIDGWDLVEHPTFPPPPSQVEDIEHWTRAMFIDATKK 475


>gb|ESW13116.1| hypothetical protein PHAVU_008G169200g [Phaseolus vulgaris]
            gi|561014256|gb|ESW13117.1| hypothetical protein
            PHAVU_008G169200g [Phaseolus vulgaris]
          Length = 476

 Score =  679 bits (1753), Expect = 0.0
 Identities = 348/480 (72%), Positives = 409/480 (85%), Gaps = 5/480 (1%)
 Frame = -2

Query: 1854 MTRKTNTCCAICESSNLASICAPCVNYRLNEYNTNLKSLRSRRDALYSRLSEVLLVKGKA 1675
            M RKT+ C AICE+SN ASIC+ CVNYRLNEYNT+LKSL+ RRD+LYS+LSEVL+ KGK 
Sbjct: 1    MARKTSNC-AICENSNQASICSICVNYRLNEYNTSLKSLKDRRDSLYSKLSEVLVQKGKG 59

Query: 1674 DDQKSWMVFQNEKLARMREKLRLHKEELIQGKAKIEKMSRELKVKYELLESAMSVLEKNQ 1495
            DDQ++++V QNEKLAR++EKL   KE++ QG+AKIE +S +LK KY LLESA+S LEKN+
Sbjct: 60   DDQENYIVLQNEKLARLKEKLHRSKEQVTQGRAKIETVSADLKHKYGLLESALSTLEKNR 119

Query: 1494 VEQLEKFYPNLICAQNLGYMAITSERLYKQSVIIKQICKLFPMRRVNVEGEKKDGFVGQY 1315
            VEQLEKFYPNLIC Q+LG++AITSERL+KQSV+IKQICKLFP RRV +EGE +DG  GQY
Sbjct: 120  VEQLEKFYPNLICTQSLGHVAITSERLHKQSVVIKQICKLFPQRRVVIEGEIRDGCSGQY 179

Query: 1314 DTICSARLPRGLDPHSVPTEELAASLGYMVQLLNLVVHNVCAPALHNSGFAGSCSRIWQR 1135
            D IC+ARLPR LDPHSVP+EEL+ASLGYMVQLLNLVVHN+ APALHNSGFAGSCSRIWQR
Sbjct: 180  DQICNARLPRALDPHSVPSEELSASLGYMVQLLNLVVHNLAAPALHNSGFAGSCSRIWQR 239

Query: 1134 DSYWDARPSSRS-EYPLFIPRPTFCN-IGETSWS-DKSSSNFGVASMESVRKPHLD--XX 970
            DSYWDARPSSRS EYPLFIPR  +C+  GE SWS DKSSSNFGVASMES ++  LD    
Sbjct: 240  DSYWDARPSSRSNEYPLFIPRQNYCSTAGENSWSTDKSSSNFGVASMESEKRNRLDSSGN 299

Query: 969  XXXXXXXXXXXSIETHKDLQKGISLLKKSVACITACCYNSLGLEVPAEASTFEAFARLLA 790
                       S++THKDLQKGISLLKKSVACITA CYNSL L+ P+EASTFE+FA+LLA
Sbjct: 300  SNFNYSLASLHSVQTHKDLQKGISLLKKSVACITAYCYNSLCLDAPSEASTFESFAKLLA 359

Query: 789  TLSSSKEMRSVFSLKMASSRSSKQVQQLNKSVWNVDSAVSSSTLMESAHALHTTRNALDN 610
            TLSSSKE+RSVFSLKMA SR+ KQVQQLNKSVWN++S +SS+TL+ESAH++ TTR  ++N
Sbjct: 360  TLSSSKEVRSVFSLKMAQSRTCKQVQQLNKSVWNMNSVISSTTLLESAHSVPTTR--IEN 417

Query: 609  NIPSSAASFLYPVEPSDAGKYENFVEGWDLVEHPTFPPPPSQTEDVEHWTRAMFIDATKK 430
             +PSS ASFLY  + +D GK E  +EGWD++EHPTFPPPPSQ+EDVEHWTRAMFIDA +K
Sbjct: 418  YLPSSTASFLYATDLND-GKNECLIEGWDIIEHPTFPPPPSQSEDVEHWTRAMFIDAKRK 476


>ref|XP_003544777.1| PREDICTED: uncharacterized protein LOC100776426 isoformX1 [Glycine
            max]
          Length = 475

 Score =  677 bits (1746), Expect = 0.0
 Identities = 344/479 (71%), Positives = 405/479 (84%), Gaps = 4/479 (0%)
 Frame = -2

Query: 1854 MTRKTNTCCAICESSNLASICAPCVNYRLNEYNTNLKSLRSRRDALYSRLSEVLLVKGKA 1675
            M RKT+ C AICE+SN ASIC+ CVNYRLNEYNT+LK L+ RRD+LY +LSEVL+ KGK 
Sbjct: 1    MARKTSNC-AICENSNQASICSICVNYRLNEYNTSLKLLKDRRDSLYLKLSEVLVRKGKG 59

Query: 1674 DDQKSWMVFQNEKLARMREKLRLHKEELIQGKAKIEKMSRELKVKYELLESAMSVLEKNQ 1495
            DDQ +W V Q+EKLAR++EKLR  KE++ QG+AKIE MS +LK+KY LLESA+S LEKN+
Sbjct: 60   DDQANWRVLQHEKLARLKEKLRQSKEQVTQGRAKIETMSADLKLKYGLLESALSTLEKNR 119

Query: 1494 VEQLEKFYPNLICAQNLGYMAITSERLYKQSVIIKQICKLFPMRRVNVEGEKKDGFVGQY 1315
            VEQLEKFYPNLIC Q+LG++AITSE L+K+SV+IKQICKLFP RRV +EGE++DG  GQY
Sbjct: 120  VEQLEKFYPNLICTQSLGHVAITSELLHKESVVIKQICKLFPQRRVVIEGERRDGCSGQY 179

Query: 1314 DTICSARLPRGLDPHSVPTEELAASLGYMVQLLNLVVHNVCAPALHNSGFAGSCSRIWQR 1135
            D IC+ARLPR LDPHSVP+EEL+ SLGYMVQLLNLV+HN+ APALHNSGFAGSCSRIWQR
Sbjct: 180  DQICNARLPRALDPHSVPSEELSTSLGYMVQLLNLVIHNLAAPALHNSGFAGSCSRIWQR 239

Query: 1134 DSYWDARPSSRS-EYPLFIPRPTFCNI-GETSWSDKSSSNFGVASMESVRKPHLD--XXX 967
            DSYWDARPSSRS EYPLFIPR  +C+  GE SWS++SSSNFGVAS+ES R+  LD     
Sbjct: 240  DSYWDARPSSRSNEYPLFIPRQNYCSTDGENSWSERSSSNFGVASVESERRHRLDSSGST 299

Query: 966  XXXXXXXXXXSIETHKDLQKGISLLKKSVACITACCYNSLGLEVPAEASTFEAFARLLAT 787
                      S++THKDLQKGISLLKKSV CITA CYNSL L+VP+EASTFEAFA+LLAT
Sbjct: 300  SFNYSLASSHSVQTHKDLQKGISLLKKSVVCITAYCYNSLCLDVPSEASTFEAFAKLLAT 359

Query: 786  LSSSKEMRSVFSLKMASSRSSKQVQQLNKSVWNVDSAVSSSTLMESAHALHTTRNALDNN 607
            L+SSKE+RSVFSLKMA SR+ KQVQQLNKSVWN++SA+SS+TL+ESAH++ TTR  ++N 
Sbjct: 360  LASSKEVRSVFSLKMARSRTCKQVQQLNKSVWNMNSAISSTTLLESAHSVPTTR--IENY 417

Query: 606  IPSSAASFLYPVEPSDAGKYENFVEGWDLVEHPTFPPPPSQTEDVEHWTRAMFIDATKK 430
            +PSS  SFLY  + SD GK E  +EGWD+VEHPTFPPPPSQ+EDVEHWTRAMFIDA  K
Sbjct: 418  LPSSTGSFLYAADLSD-GKNECLIEGWDIVEHPTFPPPPSQSEDVEHWTRAMFIDAKGK 475


>ref|XP_003542641.1| PREDICTED: uncharacterized protein LOC100813297 [Glycine max]
          Length = 474

 Score =  676 bits (1743), Expect = 0.0
 Identities = 345/479 (72%), Positives = 404/479 (84%), Gaps = 4/479 (0%)
 Frame = -2

Query: 1854 MTRKTNTCCAICESSNLASICAPCVNYRLNEYNTNLKSLRSRRDALYSRLSEVLLVKGKA 1675
            M RKT+ C AICE+SN ASIC+ CVNYRLNEYNT+LK L+ RRD+LYS+LSEVL+ KGK 
Sbjct: 1    MARKTSNC-AICENSNQASICSICVNYRLNEYNTSLKLLKDRRDSLYSKLSEVLVRKGKG 59

Query: 1674 DDQKSWMVFQNEKLARMREKLRLHKEELIQGKAKIEKMSRELKVKYELLESAMSVLEKNQ 1495
            DDQ +W V Q+EKLAR++EKLR  KE++ QG+AKIE  S +LK+KY LLESA+S LEKN+
Sbjct: 60   DDQANWRVLQHEKLARLKEKLRQGKEQVTQGRAKIETKSADLKLKYGLLESALSTLEKNR 119

Query: 1494 VEQLEKFYPNLICAQNLGYMAITSERLYKQSVIIKQICKLFPMRRVNVEGEKKDGFVGQY 1315
            VEQLEKFYPNLIC Q+LG++AITSERL+KQSV+IKQICKLFP RRV +EGE+ DG  GQ+
Sbjct: 120  VEQLEKFYPNLICTQSLGHVAITSERLHKQSVVIKQICKLFPQRRVVIEGERGDGCCGQF 179

Query: 1314 DTICSARLPRGLDPHSVPTEELAASLGYMVQLLNLVVHNVCAPALHNSGFAGSCSRIWQR 1135
            D IC+ARLPR LDP SVP+EEL+ SLGYMVQLLNL+VHN+ APALHNSGFAGSCSRIWQR
Sbjct: 180  DQICNARLPRALDPRSVPSEELSTSLGYMVQLLNLIVHNLAAPALHNSGFAGSCSRIWQR 239

Query: 1134 DSYWDARPSSRS-EYPLFIPRPTFCNI-GETSWSDKSSSNFGVASMESVRKPHLD--XXX 967
            DSYWDARPSSRS EYPLFIPR  +C+  GE SWS++SSSNFGVASMES R+  LD     
Sbjct: 240  DSYWDARPSSRSNEYPLFIPRQNYCSTGGENSWSERSSSNFGVASMESERRHRLDSSGSS 299

Query: 966  XXXXXXXXXXSIETHKDLQKGISLLKKSVACITACCYNSLGLEVPAEASTFEAFARLLAT 787
                      S++THKDLQKGISLLKKSVACITA CYNSL L+VP+EASTFEAFA+LLAT
Sbjct: 300  SFNYSLASSHSVQTHKDLQKGISLLKKSVACITAYCYNSLCLDVPSEASTFEAFAKLLAT 359

Query: 786  LSSSKEMRSVFSLKMASSRSSKQVQQLNKSVWNVDSAVSSSTLMESAHALHTTRNALDNN 607
            LSSSKE+RSVFSLKM  SR+ KQVQQLNKSVWN++SA+SS+TL+ESAH++ TTR  ++N 
Sbjct: 360  LSSSKEVRSVFSLKMPRSRTCKQVQQLNKSVWNMNSAISSTTLLESAHSVPTTR--IENY 417

Query: 606  IPSSAASFLYPVEPSDAGKYENFVEGWDLVEHPTFPPPPSQTEDVEHWTRAMFIDATKK 430
            +PS+ ASFLY  +    GK E  VEGWD+VEHPTFPPPPSQ+EDVEHWTRAMFIDA +K
Sbjct: 418  LPSATASFLYATDSD--GKNECLVEGWDIVEHPTFPPPPSQSEDVEHWTRAMFIDAKRK 474


>ref|XP_006386390.1| hypothetical protein POPTR_0002s09150g [Populus trichocarpa]
            gi|550344612|gb|ERP64187.1| hypothetical protein
            POPTR_0002s09150g [Populus trichocarpa]
          Length = 506

 Score =  669 bits (1725), Expect = 0.0
 Identities = 345/507 (68%), Positives = 400/507 (78%), Gaps = 35/507 (6%)
 Frame = -2

Query: 1845 KTNTCCAICESSNLASICAPCVNYRLNEYNTNLKSLRSRRDALYSRLSEVLLVKGKADDQ 1666
            K ++CCAICE+SN ASIC  CVNYRLNEY T LKSL SRRD+LYS+LS VL+ KGKADDQ
Sbjct: 3    KKSSCCAICENSNRASICPICVNYRLNEYGTLLKSLNSRRDSLYSKLSVVLIAKGKADDQ 62

Query: 1665 KSWMVFQNEKLARMREKLRLHKEELIQGKAKIEKMSRELKVKYELLESAMSVLEKNQVEQ 1486
             +W V QNEKLA  REKL  +KE+L QGKAK+EK+S++LK K  +LESA +VLEKN++EQ
Sbjct: 63   FNWRVQQNEKLASSREKLHRNKEQLAQGKAKVEKLSQDLKKKNGMLESARNVLEKNRMEQ 122

Query: 1485 LEKFYPNLICAQNLGYMAITSERLYKQSVIIKQICKLFPMRRVNVEGEKKDGFVGQYDTI 1306
            LEKFYPNLIC Q+LG+MAITSE L+KQSV+IKQICKLFP RRVNV+GE+   F GQYD I
Sbjct: 123  LEKFYPNLICTQSLGHMAITSELLHKQSVVIKQICKLFPQRRVNVDGER--NFSGQYDQI 180

Query: 1305 CSARLPRGLDPHSVPTEELAASLGYMVQLLNLVVHNVCAPALHNSGFAGSCSRIWQRDSY 1126
            C+ARLPRGLDPHSV +EELAASLGYMVQLLNLV HN+ AP LHN+GFAGSCSRIWQRDSY
Sbjct: 181  CNARLPRGLDPHSVSSEELAASLGYMVQLLNLVAHNLAAPTLHNAGFAGSCSRIWQRDSY 240

Query: 1125 WDARPSSR--------------------------------SEYPLFIPRPTFCNI-GETS 1045
            W+A PSSR                                +EYPLFIPR  +C+   E S
Sbjct: 241  WNACPSSRRYFDWKSLCFGISVAKFELLLLSELNILCACSNEYPLFIPRQNYCSTSSENS 300

Query: 1044 WSDKSSSNFGVASMESVRKPHLD--XXXXXXXXXXXXXSIETHKDLQKGISLLKKSVACI 871
            W+DKSSSNFGVASMES R+PHLD               S+ETHKDLQKG+SLLKKSVAC+
Sbjct: 301  WTDKSSSNFGVASMESERRPHLDSTRSNSFNYSSVSPHSVETHKDLQKGVSLLKKSVACV 360

Query: 870  TACCYNSLGLEVPAEASTFEAFARLLATLSSSKEMRSVFSLKMASSRSSKQVQQLNKSVW 691
            TA CYN L L+VP++ STFEAFA+LL+TLSSSKE+RSVF+LKMA SRS KQVQ+LNKSVW
Sbjct: 361  TAYCYNLLCLDVPSDTSTFEAFAKLLSTLSSSKEVRSVFNLKMACSRSCKQVQKLNKSVW 420

Query: 690  NVDSAVSSSTLMESAHALHTTRNALDNNIPSSAASFLYPVEPSDAGKYENFVEGWDLVEH 511
            NV+SA+SSS L+ESAHAL   +N  DNN+P+SAASFL+    SD GK E+F++GWDLVEH
Sbjct: 421  NVNSAISSSALLESAHALQLMKNTSDNNLPNSAASFLFATGISD-GKNESFIDGWDLVEH 479

Query: 510  PTFPPPPSQTEDVEHWTRAMFIDATKK 430
            PTFPPPPSQ ED+EHWTRAMFIDATKK
Sbjct: 480  PTFPPPPSQVEDIEHWTRAMFIDATKK 506


>ref|XP_003614901.1| hypothetical protein MTR_5g061040 [Medicago truncatula]
            gi|355516236|gb|AES97859.1| hypothetical protein
            MTR_5g061040 [Medicago truncatula]
          Length = 501

 Score =  669 bits (1725), Expect = 0.0
 Identities = 344/489 (70%), Positives = 404/489 (82%), Gaps = 17/489 (3%)
 Frame = -2

Query: 1854 MTRKTNTCCAICESSNLASICAPCVNYRLNEYNTNLKSLRSRRDALYSRLSEVLLVKGKA 1675
            M RK+ T CAICE+ N  SIC+ CVNYRLNEYN++LKSL+ RRD+LYS+LSEVL+ KGK 
Sbjct: 1    MARKS-TNCAICENLNQPSICSVCVNYRLNEYNSSLKSLKERRDSLYSKLSEVLVRKGKG 59

Query: 1674 DDQKSWMVFQNEKLARMREKLRLHKEELIQGKAKIEKMSRELKVKYELLESAMSVLEKNQ 1495
            DDQ +W V ++EKLAR REKLR +KE++ QG+AKI+ MS +LK+KY +LESA+S+LEKN+
Sbjct: 60   DDQTNWRVLRHEKLARSREKLRHNKEQVTQGRAKIQAMSADLKLKYGVLESALSMLEKNR 119

Query: 1494 VEQLEKFYPNLICAQNLGYMAITSERLYKQSVIIKQICKLFPMRRVNVEGEKKDGFVGQY 1315
            VEQLEKFYPNLIC Q+LG++AITSERL+KQSV+IKQICKLFP RRV +EGEK D   GQY
Sbjct: 120  VEQLEKFYPNLICTQSLGHVAITSERLHKQSVVIKQICKLFPQRRVVIEGEKGDDCSGQY 179

Query: 1314 DTICSARLPRGLDPHSVPTEELAASLGYMVQLLNLVVHNVCAPALHNSGFAGSCSRIWQR 1135
            D IC+ARLPR LDPHSVP+EEL+ASLGYMVQLLNLV HN+ APALHNSGFAGSCSRIWQR
Sbjct: 180  DQICNARLPRALDPHSVPSEELSASLGYMVQLLNLVAHNLAAPALHNSGFAGSCSRIWQR 239

Query: 1134 DSYWDARPSSRS--------------EYPLFIPRPTFCNI-GETSWSDKSSSNFGVASME 1000
            DSYWDARPSSRS              EYPLFIPR  +C+  GE SWS+KSSSNFGVASME
Sbjct: 240  DSYWDARPSSRSKNFFNLKYSLFFSNEYPLFIPRQNYCSTSGENSWSEKSSSNFGVASME 299

Query: 999  SVRKPHLD--XXXXXXXXXXXXXSIETHKDLQKGISLLKKSVACITACCYNSLGLEVPAE 826
            S R+P LD               S+++HKDLQKGISLLKKSVACITA CYNSL  ++P+E
Sbjct: 300  SDRRPRLDSSGSSSFNYSLASSHSVQSHKDLQKGISLLKKSVACITAYCYNSLCFDIPSE 359

Query: 825  ASTFEAFARLLATLSSSKEMRSVFSLKMASSRSSKQVQQLNKSVWNVDSAVSSSTLMESA 646
            ASTFEAFA+LLATLSSSKE+RSVFSLKMA SR+ KQVQQLNKSVWN++SA SS+TL+ES 
Sbjct: 360  ASTFEAFAKLLATLSSSKEVRSVFSLKMARSRTCKQVQQLNKSVWNMNSANSSTTLLEST 419

Query: 645  HALHTTRNALDNNIPSSAASFLYPVEPSDAGKYENFVEGWDLVEHPTFPPPPSQTEDVEH 466
            H++ TTR  ++N +P+SAASFLYP + SD  K E  +EGWD+VEHPT PPPPSQ+EDVEH
Sbjct: 420  HSVPTTR--IENYMPNSAASFLYPTDSSDR-KSECLIEGWDIVEHPTLPPPPSQSEDVEH 476

Query: 465  WTRAMFIDA 439
            WTRAMFIDA
Sbjct: 477  WTRAMFIDA 485


>gb|EXC06677.1| hypothetical protein L484_021513 [Morus notabilis]
          Length = 478

 Score =  649 bits (1673), Expect = 0.0
 Identities = 335/479 (69%), Positives = 390/479 (81%), Gaps = 4/479 (0%)
 Frame = -2

Query: 1854 MTRKTNTCCAICESSNLASICAPCVNYRLNEYNTNLKSLRSRRDALYSRLSEVLLVKGKA 1675
            M RK+ T CA+CE+SNL SIC+ CVNYRL ++   LKS +S RD+LYSRL EVLL KGKA
Sbjct: 1    MNRKS-TSCALCENSNLPSICSICVNYRLADHYNILKSNKSHRDSLYSRLEEVLLAKGKA 59

Query: 1674 DDQKSWMVFQNEKLARMREKLRLHKEELIQGKAKIEKMSRELKVKYELLESAMSVLEKNQ 1495
            DDQ  W + QNEKLA++REK R  KE L+QGKAK+E+M  +LKVK  +LE+A S+LE N+
Sbjct: 60   DDQVGWRMSQNEKLAKLREKHRRSKERLVQGKAKVERMHYDLKVKSGVLEAARSMLENNR 119

Query: 1494 VEQLEKFYPNLICAQNLGYMAITSERLYKQSVIIKQICKLFPMRRVNVEGEKKDGFVGQY 1315
            +EQLEKFYPN IC Q LG+MAITSERL+KQSV+IKQICKLFP RRV ++GE+K+G   QY
Sbjct: 120  MEQLEKFYPNFICTQTLGHMAITSERLHKQSVVIKQICKLFPHRRVIIDGERKNGSAEQY 179

Query: 1314 DTICSARLPRGLDPHSVPTEELAASLGYMVQLLNLVVHNVCAPALHNSGFAGSCSRIWQR 1135
            D IC+ARLPRG+DPHSV +EEL ASLGYMVQLLNL+V  + APALHNSGFAGS SRIWQR
Sbjct: 180  DQICNARLPRGVDPHSVASEELGASLGYMVQLLNLIVRILAAPALHNSGFAGSNSRIWQR 239

Query: 1134 DSYWDARPSSRS-EYPLFIPRPTFCNIG-ETSWSDKSSSNFGVASMESVRKPHLD--XXX 967
            DSYWDARPSSRS EYPLFIPR  +C+   E SWSD+SSSNFGV S+ES RK  LD     
Sbjct: 240  DSYWDARPSSRSNEYPLFIPRQNYCSTSVENSWSDRSSSNFGVTSIESERKVRLDSSGSN 299

Query: 966  XXXXXXXXXXSIETHKDLQKGISLLKKSVACITACCYNSLGLEVPAEASTFEAFARLLAT 787
                      SIETHKDLQKGISLLKKSVACIT  CYNSL L+VP+EASTFEAFA+LLAT
Sbjct: 300  SFNYSSASPHSIETHKDLQKGISLLKKSVACITTYCYNSLCLDVPSEASTFEAFAKLLAT 359

Query: 786  LSSSKEMRSVFSLKMASSRSSKQVQQLNKSVWNVDSAVSSSTLMESAHALHTTRNALDNN 607
            LSSSKE+RSV S+K A SRS+KQVQQLNKSVWNV+SA +S+TL++SAH + + +N  +NN
Sbjct: 360  LSSSKELRSVCSIKSACSRSNKQVQQLNKSVWNVNSAFASTTLLDSAHTVASMKNIGENN 419

Query: 606  IPSSAASFLYPVEPSDAGKYENFVEGWDLVEHPTFPPPPSQTEDVEHWTRAMFIDATKK 430
            +P+ A SFLY  E SDAGK E  +EGWDL+EHPTFPPPPSQ EDVEHWTRAMFIDATKK
Sbjct: 420  LPNPATSFLYATE-SDAGKNEFIIEGWDLIEHPTFPPPPSQCEDVEHWTRAMFIDATKK 477


>ref|XP_006397280.1| hypothetical protein EUTSA_v10028627mg [Eutrema salsugineum]
            gi|557098297|gb|ESQ38733.1| hypothetical protein
            EUTSA_v10028627mg [Eutrema salsugineum]
          Length = 474

 Score =  629 bits (1623), Expect = e-177
 Identities = 324/477 (67%), Positives = 390/477 (81%), Gaps = 5/477 (1%)
 Frame = -2

Query: 1845 KTNTCCAICESSNLASICAPCVNYRLNEYNTNLKSLRSRRDALYSRLSEVLLVKGKADDQ 1666
            K ++ CAICE++N ASIC+ CVNYRL EY+T LKSL++RRDALYS+LSE+L  KGKADDQ
Sbjct: 3    KRSSNCAICENTNRASICSVCVNYRLIEYSTLLKSLKTRRDALYSKLSELLEAKGKADDQ 62

Query: 1665 KSWMVFQNEKLARMREKLRLHKEELIQGKAKIEKMSRELKVKYELLESAMSVLEKNQVEQ 1486
            K+W + QNEKL+ ++  LR +KE++ QGKAKIE+ SR+LK+KY +L+SA S LE+ +VEQ
Sbjct: 63   KNWKLIQNEKLSGLKNNLRRNKEQVTQGKAKIERESRDLKLKYGVLDSARSTLERIRVEQ 122

Query: 1485 LEKFYPNLICAQNLGYMAITSERLYKQSVIIKQICKLFPMRRVNVEGEKKDGFVGQYDTI 1306
            +EK++PNLIC Q+LG+MAI+SERL+KQSV++KQ+CKLFP RRV+ +GE ++G VGQY+ I
Sbjct: 123  VEKYFPNLICTQSLGHMAISSERLHKQSVVMKQVCKLFPQRRVSFDGESQNGSVGQYNLI 182

Query: 1305 CSARLPRGLDPHSVPTEELAASLGYMVQLLNLVVHNVCAPALHNSGFAGSCSRIWQRDSY 1126
            C++RLP+GLDPHS+P+EELAASLG MVQLLNLVVHN+ APALHNSGFAGSCSRIWQRDSY
Sbjct: 183  CNSRLPKGLDPHSIPSEELAASLGLMVQLLNLVVHNLAAPALHNSGFAGSCSRIWQRDSY 242

Query: 1125 WDARPSSRS-EYPLFIPRPTFCNIG-ETSWSDKSSSNFGVASMESVRK-PHLD--XXXXX 961
            WDARPS+RS EYPLFIPR  +C+   E SW+DK+SSNFGVASMES RK   LD       
Sbjct: 243  WDARPSTRSNEYPLFIPRQNYCSTSVENSWTDKNSSNFGVASMESDRKEARLDSTGRNSF 302

Query: 960  XXXXXXXXSIETHKDLQKGISLLKKSVACITACCYNSLGLEVPAEASTFEAFARLLATLS 781
                    S+E+H+DLQKGI+LLKKSVAC+TA CYNSL LEVP EASTFEAFA+LLATLS
Sbjct: 303  NYSSASPHSVESHRDLQKGIALLKKSVACLTAYCYNSLCLEVPPEASTFEAFAKLLATLS 362

Query: 780  SSKEMRSVFSLKMASSRSSKQVQQLNKSVWNVDSAVSSSTLMESAHALHTTRNALDNNIP 601
            SSKE+RSVFSLKMASSRS KQ QQLNKS+WN  S +SSS L  S    H  RNA  N  P
Sbjct: 363  SSKEVRSVFSLKMASSRSCKQAQQLNKSIWNAHSVISSSILESS----HLPRNASYNQDP 418

Query: 600  SSAASFLYPVEPSDAGKYENFVEGWDLVEHPTFPPPPSQTEDVEHWTRAMFIDATKK 430
            +SAAS+L   E S+  K  N + GWDLVEHP +PPPPSQ+EDVEHWTRAMFIDA KK
Sbjct: 419  NSAASYLSGTELSEIRK-SNDMNGWDLVEHPKYPPPPSQSEDVEHWTRAMFIDAKKK 474


>ref|XP_004514262.1| PREDICTED: uncharacterized protein LOC101503483 isoform X1 [Cicer
            arietinum]
          Length = 427

 Score =  613 bits (1581), Expect = e-173
 Identities = 311/424 (73%), Positives = 363/424 (85%), Gaps = 4/424 (0%)
 Frame = -2

Query: 1689 VKGKADDQKSWMVFQNEKLARMREKLRLHKEELIQGKAKIEKMSRELKVKYELLESAMSV 1510
            ++GK DDQ +W V Q+EKLAR+REKLR  KE++ QG+AK+E +S +LK+KY +LESA+S+
Sbjct: 7    LQGKGDDQTNWRVVQHEKLARLREKLRHSKEQVSQGRAKVEALSADLKLKYGVLESALSM 66

Query: 1509 LEKNQVEQLEKFYPNLICAQNLGYMAITSERLYKQSVIIKQICKLFPMRRVNVEGEKKDG 1330
            LEKN+VEQLEKFYPNLIC Q+LG++AITSERL+KQSV+IKQICKLFP RRV +EGE++D 
Sbjct: 67   LEKNRVEQLEKFYPNLICTQSLGHVAITSERLHKQSVVIKQICKLFPQRRVVIEGERRDD 126

Query: 1329 FVGQYDTICSARLPRGLDPHSVPTEELAASLGYMVQLLNLVVHNVCAPALHNSGFAGSCS 1150
              GQYD IC+ARLPR LDPHSVP+E L+ SLGYMVQLLNLVVHN+ APALHNSGFAGSCS
Sbjct: 127  CSGQYDQICNARLPRALDPHSVPSEALSTSLGYMVQLLNLVVHNLAAPALHNSGFAGSCS 186

Query: 1149 RIWQRDSYWDARPSSRS-EYPLFIPRPTFCNI-GETSWSDKSSSNFGVASMESVRKPHLD 976
            RIWQRDSYWDARPSSRS EYPLFIPR  +C+  GE SWSDKSSSNFGVASMES R+P LD
Sbjct: 187  RIWQRDSYWDARPSSRSNEYPLFIPRQNYCSTSGENSWSDKSSSNFGVASMESDRRPRLD 246

Query: 975  --XXXXXXXXXXXXXSIETHKDLQKGISLLKKSVACITACCYNSLGLEVPAEASTFEAFA 802
                           S++THKDLQKGISLLKKSVACITA CYNSL L+VP EASTFEAFA
Sbjct: 247  SSGSSSFNYSLGSSHSVQTHKDLQKGISLLKKSVACITAYCYNSLCLDVPIEASTFEAFA 306

Query: 801  RLLATLSSSKEMRSVFSLKMASSRSSKQVQQLNKSVWNVDSAVSSSTLMESAHALHTTRN 622
            +LLATLSSSKE+RSVFSLKMA SR+ KQVQQLNKSVWN++SA+SS+TL+ESAH++ TTR 
Sbjct: 307  KLLATLSSSKEVRSVFSLKMARSRTCKQVQQLNKSVWNMNSAISSTTLLESAHSVPTTR- 365

Query: 621  ALDNNIPSSAASFLYPVEPSDAGKYENFVEGWDLVEHPTFPPPPSQTEDVEHWTRAMFID 442
             ++N +PSSAASFLYP + SD GK E  +EGWD+VEHPT PPPPSQ+EDVEHWTRAMFID
Sbjct: 366  -IENYMPSSAASFLYPTDSSD-GKSECLIEGWDIVEHPTLPPPPSQSEDVEHWTRAMFID 423

Query: 441  ATKK 430
            A +K
Sbjct: 424  AKRK 427


>ref|NP_192594.2| DNA-directed RNA polymerase II protein [Arabidopsis thaliana]
            gi|23297675|gb|AAN13006.1| unknown protein [Arabidopsis
            thaliana] gi|332657255|gb|AEE82655.1| DNA-directed RNA
            polymerase II protein [Arabidopsis thaliana]
          Length = 473

 Score =  612 bits (1577), Expect = e-172
 Identities = 317/480 (66%), Positives = 386/480 (80%), Gaps = 5/480 (1%)
 Frame = -2

Query: 1854 MTRKTNTCCAICESSNLASICAPCVNYRLNEYNTNLKSLRSRRDALYSRLSEVLLVKGKA 1675
            MT++++ C AIC+++N   IC  CVN+RL EYNT LKSL++RRD+L SR +E+L  KGKA
Sbjct: 1    MTKRSSNC-AICDNTNRPCICTACVNHRLIEYNTLLKSLKTRRDSLLSRFNELLESKGKA 59

Query: 1674 DDQKSWMVFQNEKLARMREKLRLHKEELIQGKAKIEKMSRELKVKYELLESAMSVLEKNQ 1495
            DDQK+W + QNEK++++++KL+ +KE + QGK KIE+ S +LKVKY +L+SA S LEK +
Sbjct: 60   DDQKNWRLIQNEKISKLKKKLKSNKELVTQGKVKIERGSSDLKVKYGVLDSARSTLEKTR 119

Query: 1494 VEQLEKFYPNLICAQNLGYMAITSERLYKQSVIIKQICKLFPMRRVNVEGEKKDGFVGQY 1315
            VEQ+EK++PNLIC Q+LG+MAI+SERL+KQSV++KQICKLFP+RRV+ +GE ++G V QY
Sbjct: 120  VEQVEKYFPNLICTQSLGHMAISSERLHKQSVVVKQICKLFPLRRVSFDGESQNGSVRQY 179

Query: 1314 DTICSARLPRGLDPHSVPTEELAASLGYMVQLLNLVVHNVCAPALHNSGFAGSCSRIWQR 1135
            D IC++RLP GLDPHS+P+EELA SLGYMVQLLNLVVHN+ APALH+SGFAGSCSRIWQR
Sbjct: 180  DVICNSRLPSGLDPHSIPSEELAVSLGYMVQLLNLVVHNLAAPALHSSGFAGSCSRIWQR 239

Query: 1134 DSYWDARPSSRS-EYPLFIPRPTFCNIG-ETSWSDKSSSNFGVASMESVRK-PHLD--XX 970
            DSYWD R S+RS EYPLFIPR  +C+   E SW+DK+SSNFGVASMES RK P LD    
Sbjct: 240  DSYWDGRTSTRSNEYPLFIPRRNYCSTSVENSWTDKNSSNFGVASMESDRKEPRLDSPGS 299

Query: 969  XXXXXXXXXXXSIETHKDLQKGISLLKKSVACITACCYNSLGLEVPAEASTFEAFARLLA 790
                       SIE+H+DLQKGI+LLKKSVAC+TA CYNSL LEVP EASTFEAFA+LLA
Sbjct: 300  NSFKYSSASPHSIESHRDLQKGIALLKKSVACLTAYCYNSLCLEVPPEASTFEAFAKLLA 359

Query: 789  TLSSSKEMRSVFSLKMASSRSSKQVQQLNKSVWNVDSAVSSSTLMESAHALHTTRNALDN 610
            TLSSSKE+RSVFSLKMASSRS KQ QQLNKS+WN  S +SSS L+ESA   H  RN   N
Sbjct: 360  TLSSSKEVRSVFSLKMASSRSGKQAQQLNKSIWNAHSVISSS-LLESA---HLPRNTSYN 415

Query: 609  NIPSSAASFLYPVEPSDAGKYENFVEGWDLVEHPTFPPPPSQTEDVEHWTRAMFIDATKK 430
              P+S AS+L   E S   +  N + GWDLVEHP +PPPPSQ+EDVEHWTRAMFIDA KK
Sbjct: 416  QDPNSPASYLSATELST--RKNNDMNGWDLVEHPKYPPPPSQSEDVEHWTRAMFIDAKKK 473


>gb|AAL59980.1| unknown protein [Arabidopsis thaliana]
          Length = 473

 Score =  612 bits (1577), Expect = e-172
 Identities = 317/480 (66%), Positives = 386/480 (80%), Gaps = 5/480 (1%)
 Frame = -2

Query: 1854 MTRKTNTCCAICESSNLASICAPCVNYRLNEYNTNLKSLRSRRDALYSRLSEVLLVKGKA 1675
            MT++++ C AIC+++N   IC  CVN+RL EYNT LKSL++RRD+L SR +E+L  KGKA
Sbjct: 1    MTKRSSNC-AICDNTNRPCICTACVNHRLIEYNTLLKSLKTRRDSLLSRFNELLESKGKA 59

Query: 1674 DDQKSWMVFQNEKLARMREKLRLHKEELIQGKAKIEKMSRELKVKYELLESAMSVLEKNQ 1495
            DDQK+W + QNEK++++++KL+ +KE + QGK KIE+ S +LKVKY +L+SA S LEK +
Sbjct: 60   DDQKNWRLIQNEKISKLKKKLKSNKELVTQGKVKIERGSSDLKVKYGVLDSARSTLEKTR 119

Query: 1494 VEQLEKFYPNLICAQNLGYMAITSERLYKQSVIIKQICKLFPMRRVNVEGEKKDGFVGQY 1315
            VEQ+EK++PNLIC Q+LG+MAI+SERL+KQSV++KQICKLFP+RRV+ +GE ++G V QY
Sbjct: 120  VEQVEKYFPNLICTQSLGHMAISSERLHKQSVVVKQICKLFPLRRVSFDGESQNGSVRQY 179

Query: 1314 DTICSARLPRGLDPHSVPTEELAASLGYMVQLLNLVVHNVCAPALHNSGFAGSCSRIWQR 1135
            D IC++RLP GLDPHS+P+EELA SLGYMVQLLNLVVHN+ APALH+SGFAGSCSRIWQR
Sbjct: 180  DVICNSRLPSGLDPHSIPSEELAVSLGYMVQLLNLVVHNLAAPALHSSGFAGSCSRIWQR 239

Query: 1134 DSYWDARPSSRS-EYPLFIPRPTFCNIG-ETSWSDKSSSNFGVASMESVRK-PHLD--XX 970
            DSYWD R S+RS EYPLFIPR  +C+   E SW+DK+SSNFGVASMES RK P LD    
Sbjct: 240  DSYWDGRTSTRSNEYPLFIPRRNYCSTSVENSWTDKNSSNFGVASMESDRKEPRLDSPGS 299

Query: 969  XXXXXXXXXXXSIETHKDLQKGISLLKKSVACITACCYNSLGLEVPAEASTFEAFARLLA 790
                       SIE+H+DLQKGI+LLKKSVAC+TA CYNSL LEVP EASTFEAFA+LLA
Sbjct: 300  NSFMYSSASPHSIESHRDLQKGIALLKKSVACLTAYCYNSLCLEVPPEASTFEAFAKLLA 359

Query: 789  TLSSSKEMRSVFSLKMASSRSSKQVQQLNKSVWNVDSAVSSSTLMESAHALHTTRNALDN 610
            TLSSSKE+RSVFSLKMASSRS KQ QQLNKS+WN  S +SSS L+ESA   H  RN   N
Sbjct: 360  TLSSSKEVRSVFSLKMASSRSGKQAQQLNKSIWNAHSVISSS-LLESA---HLPRNTSYN 415

Query: 609  NIPSSAASFLYPVEPSDAGKYENFVEGWDLVEHPTFPPPPSQTEDVEHWTRAMFIDATKK 430
              P+S AS+L   E S   +  N + GWDLVEHP +PPPPSQ+EDVEHWTRAMFIDA KK
Sbjct: 416  QDPNSPASYLSATELST--RKNNDMNGWDLVEHPKYPPPPSQSEDVEHWTRAMFIDAKKK 473


Top