BLASTX nr result

ID: Catharanthus22_contig00012930 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Catharanthus22_contig00012930
         (2480 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002285818.1| PREDICTED: uncharacterized protein LOC100260...   554   e-155
gb|EMJ26908.1| hypothetical protein PRUPE_ppa005050mg [Prunus pe...   553   e-154
ref|XP_004237862.1| PREDICTED: uncharacterized protein LOC101264...   552   e-154
ref|XP_004300664.1| PREDICTED: uncharacterized protein LOC101292...   552   e-154
ref|XP_006354053.1| PREDICTED: uncharacterized protein LOC102590...   550   e-153
ref|XP_002533875.1| DNA-directed RNA polymerase II, putative [Ri...   550   e-153
gb|ESW13116.1| hypothetical protein PHAVU_008G169200g [Phaseolus...   548   e-153
ref|XP_006472675.1| PREDICTED: uncharacterized protein LOC102626...   546   e-152
ref|XP_003544777.1| PREDICTED: uncharacterized protein LOC100776...   545   e-152
ref|XP_003542641.1| PREDICTED: uncharacterized protein LOC100813...   544   e-152
gb|EOY16120.1| DNA-directed RNA polymerase II protein isoform 1 ...   543   e-151
ref|XP_006434071.1| hypothetical protein CICLE_v10001001mg [Citr...   543   e-151
ref|XP_006596316.1| PREDICTED: uncharacterized protein LOC100776...   542   e-151
ref|XP_006434072.1| hypothetical protein CICLE_v10001001mg [Citr...   542   e-151
ref|XP_002302270.1| hypothetical protein POPTR_0002s09150g [Popu...   541   e-151
ref|XP_003614901.1| hypothetical protein MTR_5g061040 [Medicago ...   532   e-148
ref|XP_006386390.1| hypothetical protein POPTR_0002s09150g [Popu...   528   e-147
ref|XP_006397280.1| hypothetical protein EUTSA_v10028627mg [Eutr...   522   e-145
ref|NP_192594.2| DNA-directed RNA polymerase II protein [Arabido...   516   e-143
gb|AAL59980.1| unknown protein [Arabidopsis thaliana]                 516   e-143

>ref|XP_002285818.1| PREDICTED: uncharacterized protein LOC100260778 [Vitis vinifera]
            gi|302141899|emb|CBI19102.3| unnamed protein product
            [Vitis vinifera]
          Length = 478

 Score =  554 bits (1428), Expect = e-155
 Identities = 285/411 (69%), Positives = 329/411 (80%), Gaps = 4/411 (0%)
 Frame = +1

Query: 250  MTRKTNTCCALCESSNLASICVPCVNYRLNEYNTNLKSLNNRRDALYSRLSEVLVAKGKA 429
            MTRKT++C ++CE SNLASIC  CVNYRLNEYNT+LKS   RRD+LY RLSEVLVAKGKA
Sbjct: 1    MTRKTSSC-SICEKSNLASICAVCVNYRLNEYNTSLKSSKGRRDSLYLRLSEVLVAKGKA 59

Query: 430  DDQKSLMVFQNXXXXXXXXXXXXXXXXXIQGKAKIEKMSRDLKVKYELLESARSVLGKNR 609
            DDQ +  V QN                 + GKAK+EKMS DLK+KY LLESA S+L KNR
Sbjct: 60   DDQINWRVLQNEKLARLREKLRHRKEQYLDGKAKVEKMSNDLKLKYGLLESAMSMLEKNR 119

Query: 610  VEQLEKFFPNLICTQNLGYMAITSERLYKQSVIVKQLCKLFPLRRLNVEGDKKDGFIGQY 789
            VEQLEKF+PNLICTQNLG MAITSER +KQSV++KQ+CKLFP RR+N++G+KKDG    Y
Sbjct: 120  VEQLEKFYPNLICTQNLGLMAITSERFHKQSVVIKQICKLFPQRRVNIDGEKKDGSSRPY 179

Query: 790  DTICNARLPRGLDPHSVPTEELAVSLGYMVQLLNLVVHNVCAPALHNSGFAGSCSRIWQR 969
            D ICN RLPR LDPHSVP++ELA SLGYMVQLLNLVV+N+ APALHNSGFAGSCSRIWQR
Sbjct: 180  DQICNVRLPRVLDPHSVPSDELAASLGYMVQLLNLVVYNLAAPALHNSGFAGSCSRIWQR 239

Query: 970  DSYWDARPSSRS-EYPLFIPRPTFCNI-GEASWSDKSSSNFGVASMESVRKPHLE--XXX 1137
            +SYW+ RPSSRS EYPLFIPR   C+  GE SWS++SSSNFG+ASMES RKP LE     
Sbjct: 240  ESYWNPRPSSRSNEYPLFIPRQNLCSTNGENSWSERSSSNFGIASMESDRKPRLESSGSS 299

Query: 1138 XXXXXXXXXXXIETHRELQKGISLLKKSVACITAYCYNSLCLEVPAEASTFEAFARLLAT 1317
                       +ETH++LQKGISLLKKSVAC+T YCY+SLCL+VP EASTFEAFA+LLA 
Sbjct: 300  SFNYSSASLHSVETHKDLQKGISLLKKSVACLTTYCYSSLCLDVPTEASTFEAFAKLLAI 359

Query: 1318 LSSSKEMRSVFSLKMVSSRSSKQVQQLNKTLCNVDSAVSSSNLIESAHAFP 1470
            LSSSKE+RSVFSLKM  SRS KQVQQLNK++ N++SA+SSS L+ESAH  P
Sbjct: 360  LSSSKEVRSVFSLKMACSRSCKQVQQLNKSIWNMNSAISSSTLLESAHTLP 410


>gb|EMJ26908.1| hypothetical protein PRUPE_ppa005050mg [Prunus persica]
            gi|462422646|gb|EMJ26909.1| hypothetical protein
            PRUPE_ppa005050mg [Prunus persica]
          Length = 479

 Score =  553 bits (1424), Expect = e-154
 Identities = 285/409 (69%), Positives = 331/409 (80%), Gaps = 4/409 (0%)
 Frame = +1

Query: 250  MTRKTNTCCALCESSNLASICVPCVNYRLNEYNTNLKSLNNRRDALYSRLSEVLVAKGKA 429
            M RK++ C A+CESSNLAS+C  CVNYRL EYN++LK+L +RRD+LYSRL+E LVAKGKA
Sbjct: 2    MNRKSSNC-AICESSNLASVCAICVNYRLTEYNSSLKALKSRRDSLYSRLTEALVAKGKA 60

Query: 430  DDQKSLMVFQNXXXXXXXXXXXXXXXXXIQGKAKIEKMSRDLKVKYELLESARSVLGKNR 609
            DDQ +  V QN                 +QGKAKIEK S DLKVK  +LESA +VL KNR
Sbjct: 61   DDQLNWRVLQNEKLVRLREKLRCNKEQLVQGKAKIEKTSYDLKVKSGVLESALAVLEKNR 120

Query: 610  VEQLEKFFPNLICTQNLGYMAITSERLYKQSVIVKQLCKLFPLRRLNVEGDKKDGFIGQY 789
             EQLEKF+PN ICTQNLG+MAITSERL+KQSV++KQ+CKLFP RR+ V+  +KD   GQY
Sbjct: 121  AEQLEKFYPNFICTQNLGHMAITSERLHKQSVVIKQICKLFPQRRVTVDAKRKDASGGQY 180

Query: 790  DTICNARLPRGLDPHSVPTEELAVSLGYMVQLLNLVVHNVCAPALHNSGFAGSCSRIWQR 969
            D ICNA LPRGLDPHSVP+EELA SLGYMVQLLNLVV N+ APALHNSGFAGSCSRIWQR
Sbjct: 181  DQICNACLPRGLDPHSVPSEELAASLGYMVQLLNLVVQNLAAPALHNSGFAGSCSRIWQR 240

Query: 970  DSYWDARPSSRS-EYPLFIPRPTFCNI-GEASWSDKSSSNFGVASMESVRKPHLE--XXX 1137
            DSYWDARPSSRS EYPLFIPR  +C+  GE SWSD+SSSNFGVAS++S RKPHL+     
Sbjct: 241  DSYWDARPSSRSNEYPLFIPRQNYCSTSGENSWSDRSSSNFGVASIDSERKPHLDSSGSS 300

Query: 1138 XXXXXXXXXXXIETHRELQKGISLLKKSVACITAYCYNSLCLEVPAEASTFEAFARLLAT 1317
                       +ETH++LQ+GISLLKKSVACITAYCYNSLCL+VP+EASTFEAFA+LLAT
Sbjct: 301  SFNYTSASQHSVETHKDLQRGISLLKKSVACITAYCYNSLCLDVPSEASTFEAFAKLLAT 360

Query: 1318 LSSSKEMRSVFSLKMVSSRSSKQVQQLNKTLCNVDSAVSSSNLIESAHA 1464
            LSSSKE+ SVFSLKM  SRS KQVQQLNK++ NV+SA+SS+ L++SAHA
Sbjct: 361  LSSSKEVHSVFSLKMACSRSCKQVQQLNKSVWNVNSAISSTTLLDSAHA 409


>ref|XP_004237862.1| PREDICTED: uncharacterized protein LOC101264619 [Solanum
            lycopersicum]
          Length = 481

 Score =  552 bits (1423), Expect = e-154
 Identities = 292/414 (70%), Positives = 329/414 (79%), Gaps = 10/414 (2%)
 Frame = +1

Query: 250  MTRKTNTCCALCESSNLASICVPCVNYRLNEYNTNLKSLNNRRDALYSRLSEVLVAKGKA 429
            MTRKT+ CC +CE+SNL S+C  CVNYRLNEY+T LKSL  RR+AL  +LSE+L+AKGKA
Sbjct: 1    MTRKTS-CCGICENSNLPSVCTLCVNYRLNEYSTVLKSLKGRREALCGQLSEILLAKGKA 59

Query: 430  DDQKSLMVFQNXXXXXXXXXXXXXXXXXIQGKAKIEKMSRDLKVKYELLESARSVLGKNR 609
            DDQ S  V +N                  QGKAKIEKMS DLKV+YELL SA  +L KNR
Sbjct: 60   DDQLSWRVPRNEKLARLREKLRQQKEQVSQGKAKIEKMSHDLKVQYELLGSATRMLEKNR 119

Query: 610  VEQLEKFFPNLICTQNLGYMAITSERLYKQSVIVKQLCKLFPLRRLNVEGDKKDGFIGQY 789
             EQLEKF+PNLICTQNLG+MAITSE L+KQSV+VKQ+CKLFP RR+ ++GDKKDG  GQY
Sbjct: 120  AEQLEKFYPNLICTQNLGHMAITSELLHKQSVVVKQICKLFPQRRVTIDGDKKDGSSGQY 179

Query: 790  DTICNARLPRGLDPHSVPTEELAVSLGYMVQLLNLVVHNVCAPALHNSGFAGSCSRIWQR 969
            D+ICNARLP+GLDPHSVP++EL+ SLGYMVQLLNLVV  VCAPALHNSGFAGSCSRIWQR
Sbjct: 180  DSICNARLPKGLDPHSVPSDELSASLGYMVQLLNLVVRCVCAPALHNSGFAGSCSRIWQR 239

Query: 970  DSYWDARPSSRS-EYPLFIPRPTFCNI-GEASWSDKS------SSNFGVASMESVRKPHL 1125
            DSYWDARPSSRS EYPLFIPR  FC+  GEASW D+S      SSNFGV SMES RKP L
Sbjct: 240  DSYWDARPSSRSGEYPLFIPRQNFCSSGGEASWYDRSSSNSGTSSNFGVTSMESDRKPRL 299

Query: 1126 E--XXXXXXXXXXXXXXIETHRELQKGISLLKKSVACITAYCYNSLCLEVPAEASTFEAF 1299
            +                IETH++LQKGI+LLKKSVACITAYCYN+LCLEVPAEASTFE F
Sbjct: 300  DSSSSSSFNYASASLHSIETHKDLQKGIALLKKSVACITAYCYNTLCLEVPAEASTFETF 359

Query: 1300 ARLLATLSSSKEMRSVFSLKMVSSRSSKQVQQLNKTLCNVDSAVSSSNLIESAH 1461
            ARLLATLSSSKE+RSVFSLKM  SR+SKQVQ LNK++ NVDSA SSS L+ES H
Sbjct: 360  ARLLATLSSSKEVRSVFSLKMSGSRASKQVQPLNKSVWNVDSAGSSSTLMESGH 413


>ref|XP_004300664.1| PREDICTED: uncharacterized protein LOC101292418 [Fragaria vesca
            subsp. vesca]
          Length = 478

 Score =  552 bits (1422), Expect = e-154
 Identities = 282/408 (69%), Positives = 331/408 (81%), Gaps = 4/408 (0%)
 Frame = +1

Query: 250  MTRKTNTCCALCESSNLASICVPCVNYRLNEYNTNLKSLNNRRDALYSRLSEVLVAKGKA 429
            MT K ++ CA+CE+SNLASIC  CVNYRLN+YN +LK+L +RRD LYSRLS+ LVAKGKA
Sbjct: 1    MTNKKSSNCAICENSNLASICAVCVNYRLNDYNNSLKALKSRRDLLYSRLSDALVAKGKA 60

Query: 430  DDQKSLMVFQNXXXXXXXXXXXXXXXXXIQGKAKIEKMSRDLKVKYELLESARSVLGKNR 609
            DDQ +  + Q+                 +QGKAKIEK S DLKVKY +LESA S+L KNR
Sbjct: 61   DDQLNWRILQDEKLVRLREKLRRNKEQLVQGKAKIEKTSYDLKVKYGVLESALSMLEKNR 120

Query: 610  VEQLEKFFPNLICTQNLGYMAITSERLYKQSVIVKQLCKLFPLRRLNVEGDKKDGFIGQY 789
             EQLEKF+PNLICTQ+LG+MAITSERL+KQSV++KQ+CKLFP RR+ V+  +K+G  GQY
Sbjct: 121  AEQLEKFYPNLICTQSLGHMAITSERLHKQSVVIKQICKLFPQRRVTVDAKRKEGSGGQY 180

Query: 790  DTICNARLPRGLDPHSVPTEELAVSLGYMVQLLNLVVHNVCAPALHNSGFAGSCSRIWQR 969
            D ICNA LPRGLDPHSVP+EELA SLGYMVQLLNLVV N+ APALHNSGFAGSCSRIWQR
Sbjct: 181  DQICNASLPRGLDPHSVPSEELAASLGYMVQLLNLVVQNLGAPALHNSGFAGSCSRIWQR 240

Query: 970  DSYWDARPSSRS-EYPLFIPRPTFCNI-GEASWSDKSSSNFGVASMESVRKPHLE--XXX 1137
            DSYWDARPSSRS EYPLFIPR  +C+  GE SWSD+SSSNFGVAS+ES RKP L+     
Sbjct: 241  DSYWDARPSSRSNEYPLFIPRQNYCSTSGENSWSDRSSSNFGVASIESERKPRLDSSGSS 300

Query: 1138 XXXXXXXXXXXIETHRELQKGISLLKKSVACITAYCYNSLCLEVPAEASTFEAFARLLAT 1317
                       +ETH++LQ+GISLLKKSVACITAYCYNSLCL+VP+EASTFEAFA+LL+T
Sbjct: 301  SFNYSSASQHSVETHKDLQRGISLLKKSVACITAYCYNSLCLDVPSEASTFEAFAKLLST 360

Query: 1318 LSSSKEMRSVFSLKMVSSRSSKQVQQLNKTLCNVDSAVSSSNLIESAH 1461
            LSSSKE+ SVFSLKM  SRS KQVQQLNK++ NV+SA+SS+ L++SAH
Sbjct: 361  LSSSKEVHSVFSLKMACSRSCKQVQQLNKSVWNVNSAISSTTLLDSAH 408


>ref|XP_006354053.1| PREDICTED: uncharacterized protein LOC102590673 isoform X1 [Solanum
            tuberosum] gi|565375051|ref|XP_006354054.1| PREDICTED:
            uncharacterized protein LOC102590673 isoform X2 [Solanum
            tuberosum]
          Length = 483

 Score =  550 bits (1416), Expect = e-153
 Identities = 290/414 (70%), Positives = 328/414 (79%), Gaps = 10/414 (2%)
 Frame = +1

Query: 250  MTRKTNTCCALCESSNLASICVPCVNYRLNEYNTNLKSLNNRRDALYSRLSEVLVAKGKA 429
            MT KT+ CC +CE+SNL S+C  CVNYRLNEY+T LKSL  RR+AL  +LSE+L+AKGKA
Sbjct: 1    MTLKTS-CCGICENSNLPSVCTLCVNYRLNEYSTVLKSLKGRREALCGKLSEILLAKGKA 59

Query: 430  DDQKSLMVFQNXXXXXXXXXXXXXXXXXIQGKAKIEKMSRDLKVKYELLESARSVLGKNR 609
            DDQ S  V +N                  QGKAKIEKMS DLKV+YELL SA  +L KNR
Sbjct: 60   DDQLSWRVPRNEKLARLREKLRQQKEQISQGKAKIEKMSHDLKVQYELLGSATRMLEKNR 119

Query: 610  VEQLEKFFPNLICTQNLGYMAITSERLYKQSVIVKQLCKLFPLRRLNVEGDKKDGFIGQY 789
             EQLEKF+PNLICTQNLG+MAITSE L+KQSV+VKQ+CKLFP RR+ ++GDKKDG  GQY
Sbjct: 120  AEQLEKFYPNLICTQNLGHMAITSELLHKQSVVVKQICKLFPQRRVTIDGDKKDGSSGQY 179

Query: 790  DTICNARLPRGLDPHSVPTEELAVSLGYMVQLLNLVVHNVCAPALHNSGFAGSCSRIWQR 969
            D+ICNARLP+GLDPHSVP++EL+ SLGYMVQLLNLV+  VCAPALHNSGFAGSCSRIWQR
Sbjct: 180  DSICNARLPKGLDPHSVPSDELSASLGYMVQLLNLVIRCVCAPALHNSGFAGSCSRIWQR 239

Query: 970  DSYWDARPSSRS-EYPLFIPRPTFCNI-GEASWSDKS------SSNFGVASMESVRKPHL 1125
            DSYWDARPSSRS EYPLFIPR  FC+  GEASW D+S      SSNFGV SMES RKP L
Sbjct: 240  DSYWDARPSSRSGEYPLFIPRQNFCSSGGEASWYDRSCSNSGTSSNFGVTSMESDRKPRL 299

Query: 1126 E--XXXXXXXXXXXXXXIETHRELQKGISLLKKSVACITAYCYNSLCLEVPAEASTFEAF 1299
            +                IETH++LQKGI+LLKKSVACITAYCYN+LCLEVPAEASTFE F
Sbjct: 300  DSSSSSSFNYASASLHSIETHKDLQKGIALLKKSVACITAYCYNTLCLEVPAEASTFETF 359

Query: 1300 ARLLATLSSSKEMRSVFSLKMVSSRSSKQVQQLNKTLCNVDSAVSSSNLIESAH 1461
            ARLLATLSSSKE+RSVFSLKM  SR+SKQVQ LNK++ NVDSA SSS L+ES H
Sbjct: 360  ARLLATLSSSKEVRSVFSLKMSGSRASKQVQPLNKSVWNVDSAGSSSTLMESGH 413


>ref|XP_002533875.1| DNA-directed RNA polymerase II, putative [Ricinus communis]
            gi|223526176|gb|EEF28506.1| DNA-directed RNA polymerase
            II, putative [Ricinus communis]
          Length = 478

 Score =  550 bits (1416), Expect = e-153
 Identities = 286/406 (70%), Positives = 326/406 (80%), Gaps = 4/406 (0%)
 Frame = +1

Query: 259  KTNTCCALCESSNLASICVPCVNYRLNEYNTNLKSLNNRRDALYSRLSEVLVAKGKADDQ 438
            K ++CCA+CE+SN ASIC  CVNYRLNEY+T LKSL +RRD LYSRLSEVLVAKGKADDQ
Sbjct: 3    KKSSCCAICENSNRASICTVCVNYRLNEYSTLLKSLKSRRDLLYSRLSEVLVAKGKADDQ 62

Query: 439  KSLMVFQNXXXXXXXXXXXXXXXXXIQGKAKIEKMSRDLKVKYELLESARSVLGKNRVEQ 618
             +  V QN                 IQ KAK EKMS DL  KY LLES+RS L KNRV+Q
Sbjct: 63   LNWRVHQNEKLANLREKLLRSKEQLIQAKAKTEKMSSDLNAKYGLLESSRSALEKNRVDQ 122

Query: 619  LEKFFPNLICTQNLGYMAITSERLYKQSVIVKQLCKLFPLRRLNVEGDKKDGFIGQYDTI 798
            LEK+FPNLICTQ+LG+MAITSE L+  SV VKQ+CKLFP RR+ VEG+KKDG  GQYD I
Sbjct: 123  LEKYFPNLICTQSLGHMAITSELLHNLSVTVKQICKLFPQRRVIVEGEKKDGSSGQYDQI 182

Query: 799  CNARLPRGLDPHSVPTEELAVSLGYMVQLLNLVVHNVCAPALHNSGFAGSCSRIWQRDSY 978
            CNARLPRGLDPHS+P+EELA SLGYMVQLLNLVVHN+ APALHNSGFAGSCSRIWQRDSY
Sbjct: 183  CNARLPRGLDPHSIPSEELAASLGYMVQLLNLVVHNLAAPALHNSGFAGSCSRIWQRDSY 242

Query: 979  WDARPSSRS-EYPLFIPRPTFCNI-GEASWSDKSSSNFGVASMESVRKPHLE--XXXXXX 1146
            W+ARPSSRS EYPLFIPR  +C+  GE SW+D+SSSNFGVASMES R+  L+        
Sbjct: 243  WNARPSSRSNEYPLFIPRQRYCSTSGENSWTDRSSSNFGVASMESERRARLDSSRSSSFN 302

Query: 1147 XXXXXXXXIETHRELQKGISLLKKSVACITAYCYNSLCLEVPAEASTFEAFARLLATLSS 1326
                    +ETH++LQKGISL+KKSVAC+TAY YN LCL+VPAEASTFEAFA+LLATLSS
Sbjct: 303  YNSASPHSVETHKDLQKGISLMKKSVACVTAYGYNLLCLDVPAEASTFEAFAKLLATLSS 362

Query: 1327 SKEMRSVFSLKMVSSRSSKQVQQLNKTLCNVDSAVSSSNLIESAHA 1464
            SKE+RSVFSLKM  SRS KQVQ+LNK++ NV+S +SSS L+ESAHA
Sbjct: 363  SKEVRSVFSLKMACSRSCKQVQKLNKSVWNVNSIISSSTLMESAHA 408


>gb|ESW13116.1| hypothetical protein PHAVU_008G169200g [Phaseolus vulgaris]
            gi|561014256|gb|ESW13117.1| hypothetical protein
            PHAVU_008G169200g [Phaseolus vulgaris]
          Length = 476

 Score =  548 bits (1412), Expect = e-153
 Identities = 291/451 (64%), Positives = 349/451 (77%), Gaps = 11/451 (2%)
 Frame = +1

Query: 250  MTRKTNTCCALCESSNLASICVPCVNYRLNEYNTNLKSLNNRRDALYSRLSEVLVAKGKA 429
            M RKT+ C A+CE+SN ASIC  CVNYRLNEYNT+LKSL +RRD+LYS+LSEVLV KGK 
Sbjct: 1    MARKTSNC-AICENSNQASICSICVNYRLNEYNTSLKSLKDRRDSLYSKLSEVLVQKGKG 59

Query: 430  DDQKSLMVFQNXXXXXXXXXXXXXXXXXIQGKAKIEKMSRDLKVKYELLESARSVLGKNR 609
            DDQ++ +V QN                  QG+AKIE +S DLK KY LLESA S L KNR
Sbjct: 60   DDQENYIVLQNEKLARLKEKLHRSKEQVTQGRAKIETVSADLKHKYGLLESALSTLEKNR 119

Query: 610  VEQLEKFFPNLICTQNLGYMAITSERLYKQSVIVKQLCKLFPLRRLNVEGDKKDGFIGQY 789
            VEQLEKF+PNLICTQ+LG++AITSERL+KQSV++KQ+CKLFP RR+ +EG+ +DG  GQY
Sbjct: 120  VEQLEKFYPNLICTQSLGHVAITSERLHKQSVVIKQICKLFPQRRVVIEGEIRDGCSGQY 179

Query: 790  DTICNARLPRGLDPHSVPTEELAVSLGYMVQLLNLVVHNVCAPALHNSGFAGSCSRIWQR 969
            D ICNARLPR LDPHSVP+EEL+ SLGYMVQLLNLVVHN+ APALHNSGFAGSCSRIWQR
Sbjct: 180  DQICNARLPRALDPHSVPSEELSASLGYMVQLLNLVVHNLAAPALHNSGFAGSCSRIWQR 239

Query: 970  DSYWDARPSSRS-EYPLFIPRPTFCN-IGEASWS-DKSSSNFGVASMESVRKPHLE--XX 1134
            DSYWDARPSSRS EYPLFIPR  +C+  GE SWS DKSSSNFGVASMES ++  L+    
Sbjct: 240  DSYWDARPSSRSNEYPLFIPRQNYCSTAGENSWSTDKSSSNFGVASMESEKRNRLDSSGN 299

Query: 1135 XXXXXXXXXXXXIETHRELQKGISLLKKSVACITAYCYNSLCLEVPAEASTFEAFARLLA 1314
                        ++TH++LQKGISLLKKSVACITAYCYNSLCL+ P+EASTFE+FA+LLA
Sbjct: 300  SNFNYSLASLHSVQTHKDLQKGISLLKKSVACITAYCYNSLCLDAPSEASTFESFAKLLA 359

Query: 1315 TLSSSKEMRSVFSLKMVSSRSSKQVQQLNKTLCNVDSAVSSSNLIESAHAFPAMV*VLVP 1494
            TLSSSKE+RSVFSLKM  SR+ KQVQQLNK++ N++S +SS+ L+ESAH+ P      + 
Sbjct: 360  TLSSSKEVRSVFSLKMAQSRTCKQVQQLNKSVWNMNSVISSTTLLESAHSVPT---TRIE 416

Query: 1495 TLFPS--GEFIYFF*L----HYCLLEHFNLL 1569
               PS    F+Y   L    + CL+E ++++
Sbjct: 417  NYLPSSTASFLYATDLNDGKNECLIEGWDII 447


>ref|XP_006472675.1| PREDICTED: uncharacterized protein LOC102626964 isoform X1 [Citrus
            sinensis] gi|568837325|ref|XP_006472676.1| PREDICTED:
            uncharacterized protein LOC102626964 isoform X2 [Citrus
            sinensis]
          Length = 478

 Score =  546 bits (1408), Expect = e-152
 Identities = 279/411 (67%), Positives = 330/411 (80%), Gaps = 4/411 (0%)
 Frame = +1

Query: 250  MTRKTNTCCALCESSNLASICVPCVNYRLNEYNTNLKSLNNRRDALYSRLSEVLVAKGKA 429
            M +K + C A+CE+SN ASIC  CVNYRL+E NT LKSL +RRDALY RLSEVLVAKGKA
Sbjct: 1    MNKKASNC-AICENSNRASICAACVNYRLSECNTLLKSLKSRRDALYMRLSEVLVAKGKA 59

Query: 430  DDQKSLMVFQNXXXXXXXXXXXXXXXXXIQGKAKIEKMSRDLKVKYELLESARSVLGKNR 609
            DDQ +  V QN                  QGK KIEK S DLKV+Y +L+SARS++ KNR
Sbjct: 60   DDQLNWRVLQNEKLTNLREKLRRSKEQLSQGKLKIEKSSYDLKVRYAILDSARSMMEKNR 119

Query: 610  VEQLEKFFPNLICTQNLGYMAITSERLYKQSVIVKQLCKLFPLRRLNVEGDKKDGFIGQY 789
             EQLEKF+PN+ICTQ+LG+MAI SE L+KQSV++KQ+CKLFP RR+N++G+++DG  GQY
Sbjct: 120  AEQLEKFYPNIICTQSLGHMAIVSELLHKQSVVIKQICKLFPQRRVNIDGERRDGSSGQY 179

Query: 790  DTICNARLPRGLDPHSVPTEELAVSLGYMVQLLNLVVHNVCAPALHNSGFAGSCSRIWQR 969
            D IC ARLP+GLDPHSVP+EELA SLGYMVQLLNLVV N+  P LHNSGFAGSCSRIWQR
Sbjct: 180  DQICGARLPKGLDPHSVPSEELAASLGYMVQLLNLVVLNLAVPILHNSGFAGSCSRIWQR 239

Query: 970  DSYWDARPSSRS-EYPLFIPRPTFCNI-GEASWSDKSSSNFGVASMESVRKPHLE--XXX 1137
            DSYWDARPSSRS EYPLFIPR  +C+  GE SW+D+SSSNFGVASMES R+P L+     
Sbjct: 240  DSYWDARPSSRSNEYPLFIPRQNYCSTSGENSWTDRSSSNFGVASMESERRPQLDSSRST 299

Query: 1138 XXXXXXXXXXXIETHRELQKGISLLKKSVACITAYCYNSLCLEVPAEASTFEAFARLLAT 1317
                       +ETH++LQKGISLLKKSVAC+TAYCYNSLCL+VPAEASTFEAFA+LLAT
Sbjct: 300  SFNYTSASTHSVETHKDLQKGISLLKKSVACLTAYCYNSLCLDVPAEASTFEAFAKLLAT 359

Query: 1318 LSSSKEMRSVFSLKMVSSRSSKQVQQLNKTLCNVDSAVSSSNLIESAHAFP 1470
            LSSSKE+RSVFSLKM  SRS KQVQ+LN+++ N++SA+SS+ L+ESAH FP
Sbjct: 360  LSSSKEVRSVFSLKMACSRSCKQVQKLNRSVWNMNSAISSTTLLESAHMFP 410


>ref|XP_003544777.1| PREDICTED: uncharacterized protein LOC100776426 isoformX1 [Glycine
            max]
          Length = 475

 Score =  545 bits (1404), Expect = e-152
 Identities = 286/450 (63%), Positives = 348/450 (77%), Gaps = 10/450 (2%)
 Frame = +1

Query: 250  MTRKTNTCCALCESSNLASICVPCVNYRLNEYNTNLKSLNNRRDALYSRLSEVLVAKGKA 429
            M RKT+ C A+CE+SN ASIC  CVNYRLNEYNT+LK L +RRD+LY +LSEVLV KGK 
Sbjct: 1    MARKTSNC-AICENSNQASICSICVNYRLNEYNTSLKLLKDRRDSLYLKLSEVLVRKGKG 59

Query: 430  DDQKSLMVFQNXXXXXXXXXXXXXXXXXIQGKAKIEKMSRDLKVKYELLESARSVLGKNR 609
            DDQ +  V Q+                  QG+AKIE MS DLK+KY LLESA S L KNR
Sbjct: 60   DDQANWRVLQHEKLARLKEKLRQSKEQVTQGRAKIETMSADLKLKYGLLESALSTLEKNR 119

Query: 610  VEQLEKFFPNLICTQNLGYMAITSERLYKQSVIVKQLCKLFPLRRLNVEGDKKDGFIGQY 789
            VEQLEKF+PNLICTQ+LG++AITSE L+K+SV++KQ+CKLFP RR+ +EG+++DG  GQY
Sbjct: 120  VEQLEKFYPNLICTQSLGHVAITSELLHKESVVIKQICKLFPQRRVVIEGERRDGCSGQY 179

Query: 790  DTICNARLPRGLDPHSVPTEELAVSLGYMVQLLNLVVHNVCAPALHNSGFAGSCSRIWQR 969
            D ICNARLPR LDPHSVP+EEL+ SLGYMVQLLNLV+HN+ APALHNSGFAGSCSRIWQR
Sbjct: 180  DQICNARLPRALDPHSVPSEELSTSLGYMVQLLNLVIHNLAAPALHNSGFAGSCSRIWQR 239

Query: 970  DSYWDARPSSRS-EYPLFIPRPTFCNI-GEASWSDKSSSNFGVASMESVRKPHLE--XXX 1137
            DSYWDARPSSRS EYPLFIPR  +C+  GE SWS++SSSNFGVAS+ES R+  L+     
Sbjct: 240  DSYWDARPSSRSNEYPLFIPRQNYCSTDGENSWSERSSSNFGVASVESERRHRLDSSGST 299

Query: 1138 XXXXXXXXXXXIETHRELQKGISLLKKSVACITAYCYNSLCLEVPAEASTFEAFARLLAT 1317
                       ++TH++LQKGISLLKKSV CITAYCYNSLCL+VP+EASTFEAFA+LLAT
Sbjct: 300  SFNYSLASSHSVQTHKDLQKGISLLKKSVVCITAYCYNSLCLDVPSEASTFEAFAKLLAT 359

Query: 1318 LSSSKEMRSVFSLKMVSSRSSKQVQQLNKTLCNVDSAVSSSNLIESAHAFPAMV*VLVPT 1497
            L+SSKE+RSVFSLKM  SR+ KQVQQLNK++ N++SA+SS+ L+ESAH+ P      +  
Sbjct: 360  LASSKEVRSVFSLKMARSRTCKQVQQLNKSVWNMNSAISSTTLLESAHSVPT---TRIEN 416

Query: 1498 LFPS--GEFIYFF*L----HYCLLEHFNLL 1569
              PS  G F+Y   L    + CL+E ++++
Sbjct: 417  YLPSSTGSFLYAADLSDGKNECLIEGWDIV 446


>ref|XP_003542641.1| PREDICTED: uncharacterized protein LOC100813297 [Glycine max]
          Length = 474

 Score =  544 bits (1401), Expect = e-152
 Identities = 280/411 (68%), Positives = 331/411 (80%), Gaps = 4/411 (0%)
 Frame = +1

Query: 250  MTRKTNTCCALCESSNLASICVPCVNYRLNEYNTNLKSLNNRRDALYSRLSEVLVAKGKA 429
            M RKT+ C A+CE+SN ASIC  CVNYRLNEYNT+LK L +RRD+LYS+LSEVLV KGK 
Sbjct: 1    MARKTSNC-AICENSNQASICSICVNYRLNEYNTSLKLLKDRRDSLYSKLSEVLVRKGKG 59

Query: 430  DDQKSLMVFQNXXXXXXXXXXXXXXXXXIQGKAKIEKMSRDLKVKYELLESARSVLGKNR 609
            DDQ +  V Q+                  QG+AKIE  S DLK+KY LLESA S L KNR
Sbjct: 60   DDQANWRVLQHEKLARLKEKLRQGKEQVTQGRAKIETKSADLKLKYGLLESALSTLEKNR 119

Query: 610  VEQLEKFFPNLICTQNLGYMAITSERLYKQSVIVKQLCKLFPLRRLNVEGDKKDGFIGQY 789
            VEQLEKF+PNLICTQ+LG++AITSERL+KQSV++KQ+CKLFP RR+ +EG++ DG  GQ+
Sbjct: 120  VEQLEKFYPNLICTQSLGHVAITSERLHKQSVVIKQICKLFPQRRVVIEGERGDGCCGQF 179

Query: 790  DTICNARLPRGLDPHSVPTEELAVSLGYMVQLLNLVVHNVCAPALHNSGFAGSCSRIWQR 969
            D ICNARLPR LDP SVP+EEL+ SLGYMVQLLNL+VHN+ APALHNSGFAGSCSRIWQR
Sbjct: 180  DQICNARLPRALDPRSVPSEELSTSLGYMVQLLNLIVHNLAAPALHNSGFAGSCSRIWQR 239

Query: 970  DSYWDARPSSRS-EYPLFIPRPTFCNI-GEASWSDKSSSNFGVASMESVRKPHLE--XXX 1137
            DSYWDARPSSRS EYPLFIPR  +C+  GE SWS++SSSNFGVASMES R+  L+     
Sbjct: 240  DSYWDARPSSRSNEYPLFIPRQNYCSTGGENSWSERSSSNFGVASMESERRHRLDSSGSS 299

Query: 1138 XXXXXXXXXXXIETHRELQKGISLLKKSVACITAYCYNSLCLEVPAEASTFEAFARLLAT 1317
                       ++TH++LQKGISLLKKSVACITAYCYNSLCL+VP+EASTFEAFA+LLAT
Sbjct: 300  SFNYSLASSHSVQTHKDLQKGISLLKKSVACITAYCYNSLCLDVPSEASTFEAFAKLLAT 359

Query: 1318 LSSSKEMRSVFSLKMVSSRSSKQVQQLNKTLCNVDSAVSSSNLIESAHAFP 1470
            LSSSKE+RSVFSLKM  SR+ KQVQQLNK++ N++SA+SS+ L+ESAH+ P
Sbjct: 360  LSSSKEVRSVFSLKMPRSRTCKQVQQLNKSVWNMNSAISSTTLLESAHSVP 410


>gb|EOY16120.1| DNA-directed RNA polymerase II protein isoform 1 [Theobroma cacao]
          Length = 479

 Score =  543 bits (1399), Expect = e-151
 Identities = 278/411 (67%), Positives = 328/411 (79%), Gaps = 4/411 (0%)
 Frame = +1

Query: 250  MTRKTNTCCALCESSNLASICVPCVNYRLNEYNTNLKSLNNRRDALYSRLSEVLVAKGKA 429
            M  K  + CA+C++SN ASIC  CVNYRLNEYN+ LKSL +RRD LYS+L EVL AK KA
Sbjct: 1    MMSKKASNCAICDNSNRASICAVCVNYRLNEYNSLLKSLKSRRDFLYSKLDEVLAAKRKA 60

Query: 430  DDQKSLMVFQNXXXXXXXXXXXXXXXXXIQGKAKIEKMSRDLKVKYELLESARSVLGKNR 609
            DDQ +  + QN                  QGKAKIE++S DLKVKY +LESAR +L KNR
Sbjct: 61   DDQLNWKILQNEKLTDLKEKLRRSKEQLAQGKAKIERVSYDLKVKYGVLESARGMLEKNR 120

Query: 610  VEQLEKFFPNLICTQNLGYMAITSERLYKQSVIVKQLCKLFPLRRLNVEGDKKDGFIGQY 789
            VE+LEKF+PNLICTQ+LG MAITSERL+KQSV++KQ+CKLFP RR+N++G+ +DG  GQY
Sbjct: 121  VEKLEKFYPNLICTQSLGLMAITSERLHKQSVVIKQICKLFPQRRVNLDGEGRDGSCGQY 180

Query: 790  DTICNARLPRGLDPHSVPTEELAVSLGYMVQLLNLVVHNVCAPALHNSGFAGSCSRIWQR 969
            D ICN  LPRGLDPHSVP+E+LA SLGYMVQLLNLVVHN+ APALHNSGFAGSCSRIWQR
Sbjct: 181  DLICNVGLPRGLDPHSVPSEQLAASLGYMVQLLNLVVHNLAAPALHNSGFAGSCSRIWQR 240

Query: 970  DSYWDARPSSRS-EYPLFIPRPTFCNI-GEASWSDKSSSNFGVASMESVRKPHLE--XXX 1137
            DSYW+ARPSSRS EYPLFIPR  +C+  G+ SW+D+SSSNFGVASMES R+P L+     
Sbjct: 241  DSYWNARPSSRSNEYPLFIPRQNYCSTSGDNSWTDRSSSNFGVASMESERRPRLDSSGSN 300

Query: 1138 XXXXXXXXXXXIETHRELQKGISLLKKSVACITAYCYNSLCLEVPAEASTFEAFARLLAT 1317
                       +ETH++LQ GISLLKKSVACITA+CYNSLCL+VP EASTFEAF++LLAT
Sbjct: 301  SFNYSSASSHTVETHKDLQIGISLLKKSVACITAFCYNSLCLDVPTEASTFEAFSKLLAT 360

Query: 1318 LSSSKEMRSVFSLKMVSSRSSKQVQQLNKTLCNVDSAVSSSNLIESAHAFP 1470
            LSS+KE+RSVFSLKM  SRSSKQ QQLNK++ NV+SA+SSS L+ESAH  P
Sbjct: 361  LSSTKEVRSVFSLKMACSRSSKQAQQLNKSVWNVNSAMSSSMLLESAHMLP 411


>ref|XP_006434071.1| hypothetical protein CICLE_v10001001mg [Citrus clementina]
            gi|557536193|gb|ESR47311.1| hypothetical protein
            CICLE_v10001001mg [Citrus clementina]
          Length = 421

 Score =  543 bits (1398), Expect = e-151
 Identities = 278/414 (67%), Positives = 329/414 (79%), Gaps = 4/414 (0%)
 Frame = +1

Query: 250  MTRKTNTCCALCESSNLASICVPCVNYRLNEYNTNLKSLNNRRDALYSRLSEVLVAKGKA 429
            M +K + C A+CE+SN ASIC  CVNYRL+E NT LKSL +RRDALY RLSEVLVAKGKA
Sbjct: 1    MNKKASNC-AICENSNRASICAACVNYRLSECNTLLKSLKSRRDALYMRLSEVLVAKGKA 59

Query: 430  DDQKSLMVFQNXXXXXXXXXXXXXXXXXIQGKAKIEKMSRDLKVKYELLESARSVLGKNR 609
            DDQ +  V QN                  QGK KIEK S DLK +Y +L+SARS++ KNR
Sbjct: 60   DDQLNWRVLQNEKLTNLREKLRRSKEQLSQGKLKIEKSSYDLKGRYAILDSARSMMEKNR 119

Query: 610  VEQLEKFFPNLICTQNLGYMAITSERLYKQSVIVKQLCKLFPLRRLNVEGDKKDGFIGQY 789
             EQLEKF+PN+ICTQ+LG+MAI SE L+KQSV++KQ+CKLFP RR+N++G+++DG  GQY
Sbjct: 120  AEQLEKFYPNIICTQSLGHMAIVSELLHKQSVVIKQICKLFPQRRVNIDGERRDGSSGQY 179

Query: 790  DTICNARLPRGLDPHSVPTEELAVSLGYMVQLLNLVVHNVCAPALHNSGFAGSCSRIWQR 969
            D IC ARLP+GLDPHSVP+EELA SLGYMVQLLNLVV N+  P LHNSGFAGSCSRIWQR
Sbjct: 180  DQICGARLPKGLDPHSVPSEELAASLGYMVQLLNLVVLNLAVPVLHNSGFAGSCSRIWQR 239

Query: 970  DSYWDARPSSRS-EYPLFIPRPTFCNI-GEASWSDKSSSNFGVASMESVRKPHLE--XXX 1137
            DSYWDARPSSRS EYPLFIPR  +C+  GE SW+D+SSSNFGVASMES R+P L+     
Sbjct: 240  DSYWDARPSSRSNEYPLFIPRQNYCSTSGENSWTDRSSSNFGVASMESERRPQLDSSRSA 299

Query: 1138 XXXXXXXXXXXIETHRELQKGISLLKKSVACITAYCYNSLCLEVPAEASTFEAFARLLAT 1317
                       +ETH++LQKGISLLKKSVAC+TAYCYNSLCL+VPAEASTFEAFA+LLAT
Sbjct: 300  SFNYTSASTHSVETHKDLQKGISLLKKSVACLTAYCYNSLCLDVPAEASTFEAFAKLLAT 359

Query: 1318 LSSSKEMRSVFSLKMVSSRSSKQVQQLNKTLCNVDSAVSSSNLIESAHAFPAMV 1479
            LS SKE+RSVFSLKM  SRS KQVQ+LN+++ N++SA+SS+ L+ESAH FP  V
Sbjct: 360  LSLSKEVRSVFSLKMACSRSCKQVQKLNRSVWNMNSAISSTTLLESAHMFPITV 413


>ref|XP_006596316.1| PREDICTED: uncharacterized protein LOC100776426 isoform X2 [Glycine
            max]
          Length = 452

 Score =  542 bits (1397), Expect = e-151
 Identities = 277/411 (67%), Positives = 331/411 (80%), Gaps = 4/411 (0%)
 Frame = +1

Query: 250  MTRKTNTCCALCESSNLASICVPCVNYRLNEYNTNLKSLNNRRDALYSRLSEVLVAKGKA 429
            M RKT+ C A+CE+SN ASIC  CVNYRLNEYNT+LK L +RRD+LY +LSEVLV KGK 
Sbjct: 1    MARKTSNC-AICENSNQASICSICVNYRLNEYNTSLKLLKDRRDSLYLKLSEVLVRKGKG 59

Query: 430  DDQKSLMVFQNXXXXXXXXXXXXXXXXXIQGKAKIEKMSRDLKVKYELLESARSVLGKNR 609
            DDQ +  V Q+                  QG+AKIE MS DLK+KY LLESA S L KNR
Sbjct: 60   DDQANWRVLQHEKLARLKEKLRQSKEQVTQGRAKIETMSADLKLKYGLLESALSTLEKNR 119

Query: 610  VEQLEKFFPNLICTQNLGYMAITSERLYKQSVIVKQLCKLFPLRRLNVEGDKKDGFIGQY 789
            VEQLEKF+PNLICTQ+LG++AITSE L+K+SV++KQ+CKLFP RR+ +EG+++DG  GQY
Sbjct: 120  VEQLEKFYPNLICTQSLGHVAITSELLHKESVVIKQICKLFPQRRVVIEGERRDGCSGQY 179

Query: 790  DTICNARLPRGLDPHSVPTEELAVSLGYMVQLLNLVVHNVCAPALHNSGFAGSCSRIWQR 969
            D ICNARLPR LDPHSVP+EEL+ SLGYMVQLLNLV+HN+ APALHNSGFAGSCSRIWQR
Sbjct: 180  DQICNARLPRALDPHSVPSEELSTSLGYMVQLLNLVIHNLAAPALHNSGFAGSCSRIWQR 239

Query: 970  DSYWDARPSSRS-EYPLFIPRPTFCNI-GEASWSDKSSSNFGVASMESVRKPHLE--XXX 1137
            DSYWDARPSSRS EYPLFIPR  +C+  GE SWS++SSSNFGVAS+ES R+  L+     
Sbjct: 240  DSYWDARPSSRSNEYPLFIPRQNYCSTDGENSWSERSSSNFGVASVESERRHRLDSSGST 299

Query: 1138 XXXXXXXXXXXIETHRELQKGISLLKKSVACITAYCYNSLCLEVPAEASTFEAFARLLAT 1317
                       ++TH++LQKGISLLKKSV CITAYCYNSLCL+VP+EASTFEAFA+LLAT
Sbjct: 300  SFNYSLASSHSVQTHKDLQKGISLLKKSVVCITAYCYNSLCLDVPSEASTFEAFAKLLAT 359

Query: 1318 LSSSKEMRSVFSLKMVSSRSSKQVQQLNKTLCNVDSAVSSSNLIESAHAFP 1470
            L+SSKE+RSVFSLKM  SR+ KQVQQLNK++ N++SA+SS+ L+ESAH+ P
Sbjct: 360  LASSKEVRSVFSLKMARSRTCKQVQQLNKSVWNMNSAISSTTLLESAHSVP 410


>ref|XP_006434072.1| hypothetical protein CICLE_v10001001mg [Citrus clementina]
            gi|567883029|ref|XP_006434073.1| hypothetical protein
            CICLE_v10001001mg [Citrus clementina]
            gi|567883031|ref|XP_006434074.1| hypothetical protein
            CICLE_v10001001mg [Citrus clementina]
            gi|567883033|ref|XP_006434075.1| hypothetical protein
            CICLE_v10001001mg [Citrus clementina]
            gi|557536194|gb|ESR47312.1| hypothetical protein
            CICLE_v10001001mg [Citrus clementina]
            gi|557536195|gb|ESR47313.1| hypothetical protein
            CICLE_v10001001mg [Citrus clementina]
            gi|557536196|gb|ESR47314.1| hypothetical protein
            CICLE_v10001001mg [Citrus clementina]
            gi|557536197|gb|ESR47315.1| hypothetical protein
            CICLE_v10001001mg [Citrus clementina]
          Length = 478

 Score =  542 bits (1396), Expect = e-151
 Identities = 277/411 (67%), Positives = 328/411 (79%), Gaps = 4/411 (0%)
 Frame = +1

Query: 250  MTRKTNTCCALCESSNLASICVPCVNYRLNEYNTNLKSLNNRRDALYSRLSEVLVAKGKA 429
            M +K + C A+CE+SN ASIC  CVNYRL+E NT LKSL +RRDALY RLSEVLVAKGKA
Sbjct: 1    MNKKASNC-AICENSNRASICAACVNYRLSECNTLLKSLKSRRDALYMRLSEVLVAKGKA 59

Query: 430  DDQKSLMVFQNXXXXXXXXXXXXXXXXXIQGKAKIEKMSRDLKVKYELLESARSVLGKNR 609
            DDQ +  V QN                  QGK KIEK S DLK +Y +L+SARS++ KNR
Sbjct: 60   DDQLNWRVLQNEKLTNLREKLRRSKEQLSQGKLKIEKSSYDLKGRYAILDSARSMMEKNR 119

Query: 610  VEQLEKFFPNLICTQNLGYMAITSERLYKQSVIVKQLCKLFPLRRLNVEGDKKDGFIGQY 789
             EQLEKF+PN+ICTQ+LG+MAI SE L+KQSV++KQ+CKLFP RR+N++G+++DG  GQY
Sbjct: 120  AEQLEKFYPNIICTQSLGHMAIVSELLHKQSVVIKQICKLFPQRRVNIDGERRDGSSGQY 179

Query: 790  DTICNARLPRGLDPHSVPTEELAVSLGYMVQLLNLVVHNVCAPALHNSGFAGSCSRIWQR 969
            D IC ARLP+GLDPHSVP+EELA SLGYMVQLLNLVV N+  P LHNSGFAGSCSRIWQR
Sbjct: 180  DQICGARLPKGLDPHSVPSEELAASLGYMVQLLNLVVLNLAVPVLHNSGFAGSCSRIWQR 239

Query: 970  DSYWDARPSSRS-EYPLFIPRPTFCNI-GEASWSDKSSSNFGVASMESVRKPHLE--XXX 1137
            DSYWDARPSSRS EYPLFIPR  +C+  GE SW+D+SSSNFGVASMES R+P L+     
Sbjct: 240  DSYWDARPSSRSNEYPLFIPRQNYCSTSGENSWTDRSSSNFGVASMESERRPQLDSSRSA 299

Query: 1138 XXXXXXXXXXXIETHRELQKGISLLKKSVACITAYCYNSLCLEVPAEASTFEAFARLLAT 1317
                       +ETH++LQKGISLLKKSVAC+TAYCYNSLCL+VPAEASTFEAFA+LLAT
Sbjct: 300  SFNYTSASTHSVETHKDLQKGISLLKKSVACLTAYCYNSLCLDVPAEASTFEAFAKLLAT 359

Query: 1318 LSSSKEMRSVFSLKMVSSRSSKQVQQLNKTLCNVDSAVSSSNLIESAHAFP 1470
            LS SKE+RSVFSLKM  SRS KQVQ+LN+++ N++SA+SS+ L+ESAH FP
Sbjct: 360  LSLSKEVRSVFSLKMACSRSCKQVQKLNRSVWNMNSAISSTTLLESAHMFP 410


>ref|XP_002302270.1| hypothetical protein POPTR_0002s09150g [Populus trichocarpa]
            gi|566157047|ref|XP_006386388.1| hypothetical protein
            POPTR_0002s09150g [Populus trichocarpa]
            gi|566157050|ref|XP_006386389.1| hypothetical protein
            POPTR_0002s09150g [Populus trichocarpa]
            gi|222843996|gb|EEE81543.1| hypothetical protein
            POPTR_0002s09150g [Populus trichocarpa]
            gi|550344610|gb|ERP64185.1| hypothetical protein
            POPTR_0002s09150g [Populus trichocarpa]
            gi|550344611|gb|ERP64186.1| hypothetical protein
            POPTR_0002s09150g [Populus trichocarpa]
          Length = 475

 Score =  541 bits (1394), Expect = e-151
 Identities = 277/410 (67%), Positives = 330/410 (80%), Gaps = 4/410 (0%)
 Frame = +1

Query: 259  KTNTCCALCESSNLASICVPCVNYRLNEYNTNLKSLNNRRDALYSRLSEVLVAKGKADDQ 438
            K ++CCA+CE+SN ASIC  CVNYRLNEY T LKSLN+RRD+LYS+LS VL+AKGKADDQ
Sbjct: 3    KKSSCCAICENSNRASICPICVNYRLNEYGTLLKSLNSRRDSLYSKLSVVLIAKGKADDQ 62

Query: 439  KSLMVFQNXXXXXXXXXXXXXXXXXIQGKAKIEKMSRDLKVKYELLESARSVLGKNRVEQ 618
             +  V QN                  QGKAK+EK+S+DLK K  +LESAR+VL KNR+EQ
Sbjct: 63   FNWRVQQNEKLASSREKLHRNKEQLAQGKAKVEKLSQDLKKKNGMLESARNVLEKNRMEQ 122

Query: 619  LEKFFPNLICTQNLGYMAITSERLYKQSVIVKQLCKLFPLRRLNVEGDKKDGFIGQYDTI 798
            LEKF+PNLICTQ+LG+MAITSE L+KQSV++KQ+CKLFP RR+NV+G++   F GQYD I
Sbjct: 123  LEKFYPNLICTQSLGHMAITSELLHKQSVVIKQICKLFPQRRVNVDGERN--FSGQYDQI 180

Query: 799  CNARLPRGLDPHSVPTEELAVSLGYMVQLLNLVVHNVCAPALHNSGFAGSCSRIWQRDSY 978
            CNARLPRGLDPHSV +EELA SLGYMVQLLNLV HN+ AP LHN+GFAGSCSRIWQRDSY
Sbjct: 181  CNARLPRGLDPHSVSSEELAASLGYMVQLLNLVAHNLAAPTLHNAGFAGSCSRIWQRDSY 240

Query: 979  WDARPSSRS-EYPLFIPRPTFCNIG-EASWSDKSSSNFGVASMESVRKPHLEXXXXXXXX 1152
            W+A PSSRS EYPLFIPR  +C+   E SW+DKSSSNFGVASMES R+PHL+        
Sbjct: 241  WNACPSSRSNEYPLFIPRQNYCSTSSENSWTDKSSSNFGVASMESERRPHLDSTRSNSFN 300

Query: 1153 XXXXXX--IETHRELQKGISLLKKSVACITAYCYNSLCLEVPAEASTFEAFARLLATLSS 1326
                    +ETH++LQKG+SLLKKSVAC+TAYCYN LCL+VP++ STFEAFA+LL+TLSS
Sbjct: 301  YSSVSPHSVETHKDLQKGVSLLKKSVACVTAYCYNLLCLDVPSDTSTFEAFAKLLSTLSS 360

Query: 1327 SKEMRSVFSLKMVSSRSSKQVQQLNKTLCNVDSAVSSSNLIESAHAFPAM 1476
            SKE+RSVF+LKM  SRS KQVQ+LNK++ NV+SA+SSS L+ESAHA   M
Sbjct: 361  SKEVRSVFNLKMACSRSCKQVQKLNKSVWNVNSAISSSALLESAHALQLM 410


>ref|XP_003614901.1| hypothetical protein MTR_5g061040 [Medicago truncatula]
            gi|355516236|gb|AES97859.1| hypothetical protein
            MTR_5g061040 [Medicago truncatula]
          Length = 501

 Score =  532 bits (1371), Expect = e-148
 Identities = 276/424 (65%), Positives = 328/424 (77%), Gaps = 17/424 (4%)
 Frame = +1

Query: 250  MTRKTNTCCALCESSNLASICVPCVNYRLNEYNTNLKSLNNRRDALYSRLSEVLVAKGKA 429
            M RK+ T CA+CE+ N  SIC  CVNYRLNEYN++LKSL  RRD+LYS+LSEVLV KGK 
Sbjct: 1    MARKS-TNCAICENLNQPSICSVCVNYRLNEYNSSLKSLKERRDSLYSKLSEVLVRKGKG 59

Query: 430  DDQKSLMVFQNXXXXXXXXXXXXXXXXXIQGKAKIEKMSRDLKVKYELLESARSVLGKNR 609
            DDQ +  V ++                  QG+AKI+ MS DLK+KY +LESA S+L KNR
Sbjct: 60   DDQTNWRVLRHEKLARSREKLRHNKEQVTQGRAKIQAMSADLKLKYGVLESALSMLEKNR 119

Query: 610  VEQLEKFFPNLICTQNLGYMAITSERLYKQSVIVKQLCKLFPLRRLNVEGDKKDGFIGQY 789
            VEQLEKF+PNLICTQ+LG++AITSERL+KQSV++KQ+CKLFP RR+ +EG+K D   GQY
Sbjct: 120  VEQLEKFYPNLICTQSLGHVAITSERLHKQSVVIKQICKLFPQRRVVIEGEKGDDCSGQY 179

Query: 790  DTICNARLPRGLDPHSVPTEELAVSLGYMVQLLNLVVHNVCAPALHNSGFAGSCSRIWQR 969
            D ICNARLPR LDPHSVP+EEL+ SLGYMVQLLNLV HN+ APALHNSGFAGSCSRIWQR
Sbjct: 180  DQICNARLPRALDPHSVPSEELSASLGYMVQLLNLVAHNLAAPALHNSGFAGSCSRIWQR 239

Query: 970  DSYWDARPSSRS--------------EYPLFIPRPTFCNI-GEASWSDKSSSNFGVASME 1104
            DSYWDARPSSRS              EYPLFIPR  +C+  GE SWS+KSSSNFGVASME
Sbjct: 240  DSYWDARPSSRSKNFFNLKYSLFFSNEYPLFIPRQNYCSTSGENSWSEKSSSNFGVASME 299

Query: 1105 SVRKPHLE--XXXXXXXXXXXXXXIETHRELQKGISLLKKSVACITAYCYNSLCLEVPAE 1278
            S R+P L+                +++H++LQKGISLLKKSVACITAYCYNSLC ++P+E
Sbjct: 300  SDRRPRLDSSGSSSFNYSLASSHSVQSHKDLQKGISLLKKSVACITAYCYNSLCFDIPSE 359

Query: 1279 ASTFEAFARLLATLSSSKEMRSVFSLKMVSSRSSKQVQQLNKTLCNVDSAVSSSNLIESA 1458
            ASTFEAFA+LLATLSSSKE+RSVFSLKM  SR+ KQVQQLNK++ N++SA SS+ L+ES 
Sbjct: 360  ASTFEAFAKLLATLSSSKEVRSVFSLKMARSRTCKQVQQLNKSVWNMNSANSSTTLLEST 419

Query: 1459 HAFP 1470
            H+ P
Sbjct: 420  HSVP 423


>ref|XP_006386390.1| hypothetical protein POPTR_0002s09150g [Populus trichocarpa]
            gi|550344612|gb|ERP64187.1| hypothetical protein
            POPTR_0002s09150g [Populus trichocarpa]
          Length = 506

 Score =  528 bits (1360), Expect = e-147
 Identities = 276/441 (62%), Positives = 330/441 (74%), Gaps = 35/441 (7%)
 Frame = +1

Query: 259  KTNTCCALCESSNLASICVPCVNYRLNEYNTNLKSLNNRRDALYSRLSEVLVAKGKADDQ 438
            K ++CCA+CE+SN ASIC  CVNYRLNEY T LKSLN+RRD+LYS+LS VL+AKGKADDQ
Sbjct: 3    KKSSCCAICENSNRASICPICVNYRLNEYGTLLKSLNSRRDSLYSKLSVVLIAKGKADDQ 62

Query: 439  KSLMVFQNXXXXXXXXXXXXXXXXXIQGKAKIEKMSRDLKVKYELLESARSVLGKNRVEQ 618
             +  V QN                  QGKAK+EK+S+DLK K  +LESAR+VL KNR+EQ
Sbjct: 63   FNWRVQQNEKLASSREKLHRNKEQLAQGKAKVEKLSQDLKKKNGMLESARNVLEKNRMEQ 122

Query: 619  LEKFFPNLICTQNLGYMAITSERLYKQSVIVKQLCKLFPLRRLNVEGDKKDGFIGQYDTI 798
            LEKF+PNLICTQ+LG+MAITSE L+KQSV++KQ+CKLFP RR+NV+G++   F GQYD I
Sbjct: 123  LEKFYPNLICTQSLGHMAITSELLHKQSVVIKQICKLFPQRRVNVDGERN--FSGQYDQI 180

Query: 799  CNARLPRGLDPHSVPTEELAVSLGYMVQLLNLVVHNVCAPALHNSGFAGSCSRIWQRDSY 978
            CNARLPRGLDPHSV +EELA SLGYMVQLLNLV HN+ AP LHN+GFAGSCSRIWQRDSY
Sbjct: 181  CNARLPRGLDPHSVSSEELAASLGYMVQLLNLVAHNLAAPTLHNAGFAGSCSRIWQRDSY 240

Query: 979  WDARPSSR--------------------------------SEYPLFIPRPTFCNI-GEAS 1059
            W+A PSSR                                +EYPLFIPR  +C+   E S
Sbjct: 241  WNACPSSRRYFDWKSLCFGISVAKFELLLLSELNILCACSNEYPLFIPRQNYCSTSSENS 300

Query: 1060 WSDKSSSNFGVASMESVRKPHLE--XXXXXXXXXXXXXXIETHRELQKGISLLKKSVACI 1233
            W+DKSSSNFGVASMES R+PHL+                +ETH++LQKG+SLLKKSVAC+
Sbjct: 301  WTDKSSSNFGVASMESERRPHLDSTRSNSFNYSSVSPHSVETHKDLQKGVSLLKKSVACV 360

Query: 1234 TAYCYNSLCLEVPAEASTFEAFARLLATLSSSKEMRSVFSLKMVSSRSSKQVQQLNKTLC 1413
            TAYCYN LCL+VP++ STFEAFA+LL+TLSSSKE+RSVF+LKM  SRS KQVQ+LNK++ 
Sbjct: 361  TAYCYNLLCLDVPSDTSTFEAFAKLLSTLSSSKEVRSVFNLKMACSRSCKQVQKLNKSVW 420

Query: 1414 NVDSAVSSSNLIESAHAFPAM 1476
            NV+SA+SSS L+ESAHA   M
Sbjct: 421  NVNSAISSSALLESAHALQLM 441


>ref|XP_006397280.1| hypothetical protein EUTSA_v10028627mg [Eutrema salsugineum]
            gi|557098297|gb|ESQ38733.1| hypothetical protein
            EUTSA_v10028627mg [Eutrema salsugineum]
          Length = 474

 Score =  522 bits (1345), Expect = e-145
 Identities = 269/406 (66%), Positives = 324/406 (79%), Gaps = 5/406 (1%)
 Frame = +1

Query: 259  KTNTCCALCESSNLASICVPCVNYRLNEYNTNLKSLNNRRDALYSRLSEVLVAKGKADDQ 438
            K ++ CA+CE++N ASIC  CVNYRL EY+T LKSL  RRDALYS+LSE+L AKGKADDQ
Sbjct: 3    KRSSNCAICENTNRASICSVCVNYRLIEYSTLLKSLKTRRDALYSKLSELLEAKGKADDQ 62

Query: 439  KSLMVFQNXXXXXXXXXXXXXXXXXIQGKAKIEKMSRDLKVKYELLESARSVLGKNRVEQ 618
            K+  + QN                  QGKAKIE+ SRDLK+KY +L+SARS L + RVEQ
Sbjct: 63   KNWKLIQNEKLSGLKNNLRRNKEQVTQGKAKIERESRDLKLKYGVLDSARSTLERIRVEQ 122

Query: 619  LEKFFPNLICTQNLGYMAITSERLYKQSVIVKQLCKLFPLRRLNVEGDKKDGFIGQYDTI 798
            +EK+FPNLICTQ+LG+MAI+SERL+KQSV++KQ+CKLFP RR++ +G+ ++G +GQY+ I
Sbjct: 123  VEKYFPNLICTQSLGHMAISSERLHKQSVVMKQVCKLFPQRRVSFDGESQNGSVGQYNLI 182

Query: 799  CNARLPRGLDPHSVPTEELAVSLGYMVQLLNLVVHNVCAPALHNSGFAGSCSRIWQRDSY 978
            CN+RLP+GLDPHS+P+EELA SLG MVQLLNLVVHN+ APALHNSGFAGSCSRIWQRDSY
Sbjct: 183  CNSRLPKGLDPHSIPSEELAASLGLMVQLLNLVVHNLAAPALHNSGFAGSCSRIWQRDSY 242

Query: 979  WDARPSSRS-EYPLFIPRPTFCNIG-EASWSDKSSSNFGVASMESVRKP---HLEXXXXX 1143
            WDARPS+RS EYPLFIPR  +C+   E SW+DK+SSNFGVASMES RK            
Sbjct: 243  WDARPSTRSNEYPLFIPRQNYCSTSVENSWTDKNSSNFGVASMESDRKEARLDSTGRNSF 302

Query: 1144 XXXXXXXXXIETHRELQKGISLLKKSVACITAYCYNSLCLEVPAEASTFEAFARLLATLS 1323
                     +E+HR+LQKGI+LLKKSVAC+TAYCYNSLCLEVP EASTFEAFA+LLATLS
Sbjct: 303  NYSSASPHSVESHRDLQKGIALLKKSVACLTAYCYNSLCLEVPPEASTFEAFAKLLATLS 362

Query: 1324 SSKEMRSVFSLKMVSSRSSKQVQQLNKTLCNVDSAVSSSNLIESAH 1461
            SSKE+RSVFSLKM SSRS KQ QQLNK++ N  S +SSS ++ES+H
Sbjct: 363  SSKEVRSVFSLKMASSRSCKQAQQLNKSIWNAHSVISSS-ILESSH 407


>ref|NP_192594.2| DNA-directed RNA polymerase II protein [Arabidopsis thaliana]
            gi|23297675|gb|AAN13006.1| unknown protein [Arabidopsis
            thaliana] gi|332657255|gb|AEE82655.1| DNA-directed RNA
            polymerase II protein [Arabidopsis thaliana]
          Length = 473

 Score =  516 bits (1329), Expect = e-143
 Identities = 269/409 (65%), Positives = 323/409 (78%), Gaps = 5/409 (1%)
 Frame = +1

Query: 250  MTRKTNTCCALCESSNLASICVPCVNYRLNEYNTNLKSLNNRRDALYSRLSEVLVAKGKA 429
            MT++++ C A+C+++N   IC  CVN+RL EYNT LKSL  RRD+L SR +E+L +KGKA
Sbjct: 1    MTKRSSNC-AICDNTNRPCICTACVNHRLIEYNTLLKSLKTRRDSLLSRFNELLESKGKA 59

Query: 430  DDQKSLMVFQNXXXXXXXXXXXXXXXXXIQGKAKIEKMSRDLKVKYELLESARSVLGKNR 609
            DDQK+  + QN                  QGK KIE+ S DLKVKY +L+SARS L K R
Sbjct: 60   DDQKNWRLIQNEKISKLKKKLKSNKELVTQGKVKIERGSSDLKVKYGVLDSARSTLEKTR 119

Query: 610  VEQLEKFFPNLICTQNLGYMAITSERLYKQSVIVKQLCKLFPLRRLNVEGDKKDGFIGQY 789
            VEQ+EK+FPNLICTQ+LG+MAI+SERL+KQSV+VKQ+CKLFPLRR++ +G+ ++G + QY
Sbjct: 120  VEQVEKYFPNLICTQSLGHMAISSERLHKQSVVVKQICKLFPLRRVSFDGESQNGSVRQY 179

Query: 790  DTICNARLPRGLDPHSVPTEELAVSLGYMVQLLNLVVHNVCAPALHNSGFAGSCSRIWQR 969
            D ICN+RLP GLDPHS+P+EELAVSLGYMVQLLNLVVHN+ APALH+SGFAGSCSRIWQR
Sbjct: 180  DVICNSRLPSGLDPHSIPSEELAVSLGYMVQLLNLVVHNLAAPALHSSGFAGSCSRIWQR 239

Query: 970  DSYWDARPSSRS-EYPLFIPRPTFCNIG-EASWSDKSSSNFGVASMESVRK-PHLE--XX 1134
            DSYWD R S+RS EYPLFIPR  +C+   E SW+DK+SSNFGVASMES RK P L+    
Sbjct: 240  DSYWDGRTSTRSNEYPLFIPRRNYCSTSVENSWTDKNSSNFGVASMESDRKEPRLDSPGS 299

Query: 1135 XXXXXXXXXXXXIETHRELQKGISLLKKSVACITAYCYNSLCLEVPAEASTFEAFARLLA 1314
                        IE+HR+LQKGI+LLKKSVAC+TAYCYNSLCLEVP EASTFEAFA+LLA
Sbjct: 300  NSFKYSSASPHSIESHRDLQKGIALLKKSVACLTAYCYNSLCLEVPPEASTFEAFAKLLA 359

Query: 1315 TLSSSKEMRSVFSLKMVSSRSSKQVQQLNKTLCNVDSAVSSSNLIESAH 1461
            TLSSSKE+RSVFSLKM SSRS KQ QQLNK++ N  S +SSS L+ESAH
Sbjct: 360  TLSSSKEVRSVFSLKMASSRSGKQAQQLNKSIWNAHSVISSS-LLESAH 407


>gb|AAL59980.1| unknown protein [Arabidopsis thaliana]
          Length = 473

 Score =  516 bits (1329), Expect = e-143
 Identities = 269/409 (65%), Positives = 323/409 (78%), Gaps = 5/409 (1%)
 Frame = +1

Query: 250  MTRKTNTCCALCESSNLASICVPCVNYRLNEYNTNLKSLNNRRDALYSRLSEVLVAKGKA 429
            MT++++ C A+C+++N   IC  CVN+RL EYNT LKSL  RRD+L SR +E+L +KGKA
Sbjct: 1    MTKRSSNC-AICDNTNRPCICTACVNHRLIEYNTLLKSLKTRRDSLLSRFNELLESKGKA 59

Query: 430  DDQKSLMVFQNXXXXXXXXXXXXXXXXXIQGKAKIEKMSRDLKVKYELLESARSVLGKNR 609
            DDQK+  + QN                  QGK KIE+ S DLKVKY +L+SARS L K R
Sbjct: 60   DDQKNWRLIQNEKISKLKKKLKSNKELVTQGKVKIERGSSDLKVKYGVLDSARSTLEKTR 119

Query: 610  VEQLEKFFPNLICTQNLGYMAITSERLYKQSVIVKQLCKLFPLRRLNVEGDKKDGFIGQY 789
            VEQ+EK+FPNLICTQ+LG+MAI+SERL+KQSV+VKQ+CKLFPLRR++ +G+ ++G + QY
Sbjct: 120  VEQVEKYFPNLICTQSLGHMAISSERLHKQSVVVKQICKLFPLRRVSFDGESQNGSVRQY 179

Query: 790  DTICNARLPRGLDPHSVPTEELAVSLGYMVQLLNLVVHNVCAPALHNSGFAGSCSRIWQR 969
            D ICN+RLP GLDPHS+P+EELAVSLGYMVQLLNLVVHN+ APALH+SGFAGSCSRIWQR
Sbjct: 180  DVICNSRLPSGLDPHSIPSEELAVSLGYMVQLLNLVVHNLAAPALHSSGFAGSCSRIWQR 239

Query: 970  DSYWDARPSSRS-EYPLFIPRPTFCNIG-EASWSDKSSSNFGVASMESVRK-PHLE--XX 1134
            DSYWD R S+RS EYPLFIPR  +C+   E SW+DK+SSNFGVASMES RK P L+    
Sbjct: 240  DSYWDGRTSTRSNEYPLFIPRRNYCSTSVENSWTDKNSSNFGVASMESDRKEPRLDSPGS 299

Query: 1135 XXXXXXXXXXXXIETHRELQKGISLLKKSVACITAYCYNSLCLEVPAEASTFEAFARLLA 1314
                        IE+HR+LQKGI+LLKKSVAC+TAYCYNSLCLEVP EASTFEAFA+LLA
Sbjct: 300  NSFMYSSASPHSIESHRDLQKGIALLKKSVACLTAYCYNSLCLEVPPEASTFEAFAKLLA 359

Query: 1315 TLSSSKEMRSVFSLKMVSSRSSKQVQQLNKTLCNVDSAVSSSNLIESAH 1461
            TLSSSKE+RSVFSLKM SSRS KQ QQLNK++ N  S +SSS L+ESAH
Sbjct: 360  TLSSSKEVRSVFSLKMASSRSGKQAQQLNKSIWNAHSVISSS-LLESAH 407


Top