BLASTX nr result

ID: Panax24_contig00027653 seq

BLASTX 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Panax24_contig00027653
         (639 letters)

Database: ./nr 
           115,041,592 sequences; 42,171,959,267 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

KZN10149.1 hypothetical protein DCAR_002805 [Daucus carota subsp...   207   4e-58
XP_017224679.1 PREDICTED: uncharacterized protein LOC108200914 [...   207   4e-58
XP_017234830.1 PREDICTED: uncharacterized protein LOC108208804 [...   175   7e-47
KZN04973.1 hypothetical protein DCAR_005810 [Daucus carota subsp...   175   7e-47
CAN74679.1 hypothetical protein VITISV_006858 [Vitis vinifera]        154   9e-40
XP_010652052.1 PREDICTED: uncharacterized protein LOC100254466 [...   154   1e-39
OMP06649.1 Zinc finger, CW-type [Corchorus olitorius]                 127   2e-30
OMO63756.1 Zinc finger, CW-type [Corchorus capsularis]                125   2e-29
XP_002321024.2 hypothetical protein POPTR_0014s12740g [Populus t...   121   4e-28
EOX94983.1 CW-type Zinc Finger, putative isoform 1 [Theobroma ca...   117   1e-26
XP_011033585.1 PREDICTED: uncharacterized protein LOC105132026 [...   116   2e-26
KHG10306.1 MORC family CW-type zinc finger protein 4 [Gossypium ...   116   2e-26
XP_017630345.1 PREDICTED: uncharacterized protein LOC108473352 i...   116   2e-26
XP_016694781.1 PREDICTED: uncharacterized protein LOC107911477 [...   116   2e-26
XP_017982041.1 PREDICTED: uncharacterized protein LOC18613498 is...   114   1e-25
XP_017982033.1 PREDICTED: uncharacterized protein LOC18613498 is...   114   1e-25
KHG00169.1 MORC family CW-type zinc finger protein 4 [Gossypium ...   113   2e-25
KHG00168.1 MORC family CW-type zinc finger protein 4 [Gossypium ...   113   2e-25
XP_017613876.1 PREDICTED: uncharacterized protein LOC108459015 [...   113   2e-25
OAY61780.1 hypothetical protein MANES_01G215600 [Manihot esculenta]   113   2e-25

>KZN10149.1 hypothetical protein DCAR_002805 [Daucus carota subsp. sativus]
          Length = 1672

 Score =  207 bits (527), Expect = 4e-58
 Identities = 127/226 (56%), Positives = 150/226 (66%), Gaps = 19/226 (8%)
 Frame = +2

Query: 2    SGLVGXXXXXXXXXXXXPFENYPGEGDTKSSKIRKKRETDQDFSKASKKVKTNAVLRTDE 181
            SGLVG            P    PGEGDTKS KIR KRE +Q+FSKASKK+K ++    +E
Sbjct: 796  SGLVGQKQRHKRKDKSKPHAT-PGEGDTKSLKIRNKRENNQEFSKASKKLKASSD-HIEE 853

Query: 182  DRKSD----------------SINETGKNRHKYDDLPKDSKRE--VFVRNPEDQTQFTSD 307
            + KSD                SI +TGK+R KYDD PK+SKR+  V VRN ED+TQF SD
Sbjct: 854  EWKSDNGGAALKVGHSSSSGLSIKKTGKHRQKYDDHPKESKRDLKVSVRNSEDRTQFPSD 913

Query: 308  AGLLHAEKYIDRDMKKRK-VNEYQDSQLYTTSLASGGHRLEEHRDYMEGTSESSHRKEKK 484
              LLH E YID D+KKRK +NEY D Q YTTS  + GHR E HRD+ME TSES+HR+EKK
Sbjct: 914  ERLLHTE-YIDGDVKKRKKINEYHDIQPYTTSHITEGHRPENHRDFMEETSESNHREEKK 972

Query: 485  ARVSKSGGKESSITKGSGGIDKKGRSVKNQQSVADLGNNLSLRRLD 622
            ARVSKSGGKE S++KGS G+DKK RS KNQQ+   L N L  R +D
Sbjct: 973  ARVSKSGGKERSMSKGS-GVDKKSRSSKNQQTEVALENGLFDRSMD 1017


>XP_017224679.1 PREDICTED: uncharacterized protein LOC108200914 [Daucus carota subsp.
            sativus] XP_017224686.1 PREDICTED: uncharacterized
            protein LOC108200914 [Daucus carota subsp. sativus]
          Length = 1688

 Score =  207 bits (527), Expect = 4e-58
 Identities = 127/226 (56%), Positives = 150/226 (66%), Gaps = 19/226 (8%)
 Frame = +2

Query: 2    SGLVGXXXXXXXXXXXXPFENYPGEGDTKSSKIRKKRETDQDFSKASKKVKTNAVLRTDE 181
            SGLVG            P    PGEGDTKS KIR KRE +Q+FSKASKK+K ++    +E
Sbjct: 812  SGLVGQKQRHKRKDKSKPHAT-PGEGDTKSLKIRNKRENNQEFSKASKKLKASSD-HIEE 869

Query: 182  DRKSD----------------SINETGKNRHKYDDLPKDSKRE--VFVRNPEDQTQFTSD 307
            + KSD                SI +TGK+R KYDD PK+SKR+  V VRN ED+TQF SD
Sbjct: 870  EWKSDNGGAALKVGHSSSSGLSIKKTGKHRQKYDDHPKESKRDLKVSVRNSEDRTQFPSD 929

Query: 308  AGLLHAEKYIDRDMKKRK-VNEYQDSQLYTTSLASGGHRLEEHRDYMEGTSESSHRKEKK 484
              LLH E YID D+KKRK +NEY D Q YTTS  + GHR E HRD+ME TSES+HR+EKK
Sbjct: 930  ERLLHTE-YIDGDVKKRKKINEYHDIQPYTTSHITEGHRPENHRDFMEETSESNHREEKK 988

Query: 485  ARVSKSGGKESSITKGSGGIDKKGRSVKNQQSVADLGNNLSLRRLD 622
            ARVSKSGGKE S++KGS G+DKK RS KNQQ+   L N L  R +D
Sbjct: 989  ARVSKSGGKERSMSKGS-GVDKKSRSSKNQQTEVALENGLFDRSMD 1033


>XP_017234830.1 PREDICTED: uncharacterized protein LOC108208804 [Daucus carota subsp.
            sativus] XP_017234831.1 PREDICTED: uncharacterized
            protein LOC108208804 [Daucus carota subsp. sativus]
            XP_017234832.1 PREDICTED: uncharacterized protein
            LOC108208804 [Daucus carota subsp. sativus]
            XP_017234833.1 PREDICTED: uncharacterized protein
            LOC108208804 [Daucus carota subsp. sativus]
            XP_017234834.1 PREDICTED: uncharacterized protein
            LOC108208804 [Daucus carota subsp. sativus]
          Length = 1539

 Score =  175 bits (443), Expect = 7e-47
 Identities = 100/198 (50%), Positives = 130/198 (65%), Gaps = 15/198 (7%)
 Frame = +2

Query: 74   EGDTKSSKIRKKRETDQDFSKASKKVKTNAVLRTDEDRKSDS---------------INE 208
            EGD KS K+R   E +++  KA KK+K   V + DED KSD+               IN+
Sbjct: 712  EGDFKSLKMRSSGEKNEETFKAPKKLKAGGV-QIDEDWKSDNGAAALKVGCSSNSFPINK 770

Query: 209  TGKNRHKYDDLPKDSKREVFVRNPEDQTQFTSDAGLLHAEKYIDRDMKKRKVNEYQDSQL 388
            + + +HK+D  PKDSK    + N ED TQF+SDA LLH E + DRD+KKRK++EYQ SQL
Sbjct: 771  SQELQHKHDGYPKDSKS---LGNSEDWTQFSSDASLLHTENFTDRDVKKRKISEYQGSQL 827

Query: 389  YTTSLASGGHRLEEHRDYMEGTSESSHRKEKKARVSKSGGKESSITKGSGGIDKKGRSVK 568
            Y TS ++ GH LE H+DYME T+ES+HRKEKKARV +S GKES ++KGSG ID K   +K
Sbjct: 828  YATSRSNEGHHLEHHKDYMEETNESNHRKEKKARVPRSEGKESFMSKGSGRIDNKEICLK 887

Query: 569  NQQSVADLGNNLSLRRLD 622
            +QQ+ AD  +    R LD
Sbjct: 888  DQQAGADPEDGHFRRSLD 905


>KZN04973.1 hypothetical protein DCAR_005810 [Daucus carota subsp. sativus]
          Length = 1563

 Score =  175 bits (443), Expect = 7e-47
 Identities = 100/198 (50%), Positives = 130/198 (65%), Gaps = 15/198 (7%)
 Frame = +2

Query: 74   EGDTKSSKIRKKRETDQDFSKASKKVKTNAVLRTDEDRKSDS---------------INE 208
            EGD KS K+R   E +++  KA KK+K   V + DED KSD+               IN+
Sbjct: 736  EGDFKSLKMRSSGEKNEETFKAPKKLKAGGV-QIDEDWKSDNGAAALKVGCSSNSFPINK 794

Query: 209  TGKNRHKYDDLPKDSKREVFVRNPEDQTQFTSDAGLLHAEKYIDRDMKKRKVNEYQDSQL 388
            + + +HK+D  PKDSK    + N ED TQF+SDA LLH E + DRD+KKRK++EYQ SQL
Sbjct: 795  SQELQHKHDGYPKDSKS---LGNSEDWTQFSSDASLLHTENFTDRDVKKRKISEYQGSQL 851

Query: 389  YTTSLASGGHRLEEHRDYMEGTSESSHRKEKKARVSKSGGKESSITKGSGGIDKKGRSVK 568
            Y TS ++ GH LE H+DYME T+ES+HRKEKKARV +S GKES ++KGSG ID K   +K
Sbjct: 852  YATSRSNEGHHLEHHKDYMEETNESNHRKEKKARVPRSEGKESFMSKGSGRIDNKEICLK 911

Query: 569  NQQSVADLGNNLSLRRLD 622
            +QQ+ AD  +    R LD
Sbjct: 912  DQQAGADPEDGHFRRSLD 929


>CAN74679.1 hypothetical protein VITISV_006858 [Vitis vinifera]
          Length = 1671

 Score =  154 bits (390), Expect = 9e-40
 Identities = 99/222 (44%), Positives = 129/222 (58%), Gaps = 27/222 (12%)
 Frame = +2

Query: 53   PFENYPGEGDTKSSKIRKKRETDQDFSKASKKVKTNAVLRTDEDRKSDS----------- 199
            P E Y   GDTK+SK++ K  TDQD  +ASKK+K   +  TDED  SD            
Sbjct: 830  PLECYSDGGDTKNSKMKNKSGTDQDCVRASKKIKIEGMHSTDEDWTSDHGGTNGKVHLSS 889

Query: 200  -----INETGKNRHKYDDLP--KDSKRE------VFVRNPEDQTQFTSDAGLLHAEKYID 340
                 +N    N  K+ +    KD+K E      V VR P++Q + +SD G L+  KY  
Sbjct: 890  SNGLPVNVVSNNHFKHSERTSSKDTKYEAKDNIQVTVRKPKEQVRVSSDDGSLNVGKYDS 949

Query: 341  RDM--KKRKVNEYQDSQLYTTSLASGGHRLEEHRDYM-EGTSESSHRKEKKARVSKSGGK 511
            RD+  KKRKV E QD+++Y++SL S GH LE+   ++ E  SES HRKEKKARVSKS GK
Sbjct: 950  RDIVAKKRKVKECQDTEIYSSSLPSTGHHLEDSGAFVKEEFSESDHRKEKKARVSKSEGK 1009

Query: 512  ESSITKGSGGIDKKGRSVKNQQSVADLGNNLSLRRLDAIDSV 637
            E   +K SG  DKK  S++ QQ   DLG+ LS R LD +DS+
Sbjct: 1010 EFIASKSSGRTDKKVSSMRTQQQGQDLGSVLSQRSLDGVDSL 1051


>XP_010652052.1 PREDICTED: uncharacterized protein LOC100254466 [Vitis vinifera]
          Length = 1742

 Score =  154 bits (389), Expect = 1e-39
 Identities = 100/222 (45%), Positives = 129/222 (58%), Gaps = 27/222 (12%)
 Frame = +2

Query: 53   PFENYPGEGDTKSSKIRKKRETDQDFSKASKKVKTNAVLRTDEDRKSDSINETGK----- 217
            P E Y   GDTK+SK++ K  TDQD  +ASKK+K   +  TDED  SD     GK     
Sbjct: 852  PLECYSDGGDTKNSKMKNKSGTDQDCVRASKKIKIEGMHSTDEDWTSDHGGTNGKVHLSS 911

Query: 218  -----------NRHKYDDLP--KDSKRE------VFVRNPEDQTQFTSDAGLLHAEKYID 340
                       N  K+ +    KD+K E      V VR P++Q + +SD G L+  KY  
Sbjct: 912  SNGLPANVVSNNHFKHSERTSSKDTKYEAKDNIQVTVRKPKEQVRVSSDDGSLNVGKYDS 971

Query: 341  RDM--KKRKVNEYQDSQLYTTSLASGGHRLEEHRDYM-EGTSESSHRKEKKARVSKSGGK 511
            RD+  KKRKV E QD+++Y++SL S GH LE+   ++ E  SES HRKEKKARVSKS GK
Sbjct: 972  RDIVAKKRKVKECQDTEIYSSSLPSTGHHLEDSGAFVKEEFSESDHRKEKKARVSKSEGK 1031

Query: 512  ESSITKGSGGIDKKGRSVKNQQSVADLGNNLSLRRLDAIDSV 637
            E   +K SG  DKK  S++ QQ   DLG+ LS R LD +DS+
Sbjct: 1032 EFIASKSSGRTDKKVSSMRTQQQGQDLGSVLSQRSLDGVDSL 1073


>OMP06649.1 Zinc finger, CW-type [Corchorus olitorius]
          Length = 1719

 Score =  127 bits (320), Expect = 2e-30
 Identities = 85/208 (40%), Positives = 122/208 (58%), Gaps = 21/208 (10%)
 Frame = +2

Query: 77   GDTKSSKIRKKRETDQDFSKASKKVKTNAVLRTDEDRKSDSINE-------------TGK 217
            GD K+SK++ KR TDQD  +ASKK+KT  +   DED   +   +              GK
Sbjct: 828  GDGKTSKMKGKRSTDQDSLRASKKIKTENMHVADEDWAFEHTGKGGPSTSNGFPTMLVGK 887

Query: 218  NRHKYDDLP-KDSK-----REVFVRNPEDQTQFTSDAGLLHAEKYIDRDM-KKRKVNEYQ 376
            N+ K+D+   KDSK     ++   + P+D+ Q +   G L        ++ +KRKV+E  
Sbjct: 888  NQPKHDERSYKDSKLDKARQQASGKRPKDKLQVSLTDGSLDLVNCDGGEVSRKRKVDECI 947

Query: 377  DSQLYTTSLASGGHRLEEHRDYM-EGTSESSHRKEKKARVSKSGGKESSITKGSGGIDKK 553
            D+QLYT S+ S G+ L++ R ++ E  SE+ +R++KKARVSKSGGK+SS +K SG  +KK
Sbjct: 948  DNQLYTGSIQSMGNHLQDSRMFVKEEFSENDYRRDKKARVSKSGGKDSSASKSSGKTEKK 1007

Query: 554  GRSVKNQQSVADLGNNLSLRRLDAIDSV 637
             R  KN QS  D G+ LS R LD  DS+
Sbjct: 1008 SRHAKNHQSGLDPGSTLSQRSLDGTDSL 1035


>OMO63756.1 Zinc finger, CW-type [Corchorus capsularis]
          Length = 1693

 Score =  125 bits (313), Expect = 2e-29
 Identities = 83/208 (39%), Positives = 121/208 (58%), Gaps = 21/208 (10%)
 Frame = +2

Query: 77   GDTKSSKIRKKRETDQDFSKASKKVKTNAVLRTDEDRKSDSINE-------------TGK 217
            GD K+SK++ KR  DQD  +ASKK+KT ++   DED   +   +              GK
Sbjct: 828  GDGKTSKMKGKRSNDQDSLRASKKIKTESMHVADEDWAFEHTGKGGPSTSNGFPTVLVGK 887

Query: 218  NRHKYDDLP-KDSK-----REVFVRNPEDQTQFTSDAGLLHAEKYIDRDM-KKRKVNEYQ 376
            N+ K+D+   KDSK     ++   + P+D+ Q +   G L        ++ +KRKV+E  
Sbjct: 888  NQPKHDERSYKDSKLDKSRQQTSGKRPKDKLQVSLTDGSLDLVNCDGGEVSRKRKVDECI 947

Query: 377  DSQLYTTSLASGGHRLEEHRDYM-EGTSESSHRKEKKARVSKSGGKESSITKGSGGIDKK 553
            D+QLY  S+ S G+ L++ R ++ E  SE+ +R++KKARVSKSGGK+SS +K SG  +KK
Sbjct: 948  DNQLYVGSIQSMGNHLQDSRMFVKEEFSENDYRRDKKARVSKSGGKDSSASKSSGKTEKK 1007

Query: 554  GRSVKNQQSVADLGNNLSLRRLDAIDSV 637
             R  KN QS  D G+ LS R LD  DS+
Sbjct: 1008 SRHAKNHQSGLDPGSTLSQRSLDGTDSL 1035


>XP_002321024.2 hypothetical protein POPTR_0014s12740g [Populus trichocarpa]
            EEE99339.2 hypothetical protein POPTR_0014s12740g
            [Populus trichocarpa]
          Length = 1643

 Score =  121 bits (303), Expect = 4e-28
 Identities = 81/211 (38%), Positives = 114/211 (54%), Gaps = 24/211 (11%)
 Frame = +2

Query: 77   GDTKSSKIRKKRETDQDFSKASKKVKTNAVLRTDEDRKSD----------------SINE 208
            G +K SK + KR+ DQD  +ASKK++T       ED  SD                ++  
Sbjct: 818  GGSKRSKGKGKRDPDQDCFRASKKIRTEGF---PEDWTSDHGGAIEKVGPPSSNGLAMAS 874

Query: 209  TGKNRHKYDDLPKDSKR-------EVFVRNPEDQTQFTSDAGLLHAEKYIDRDMKKRKVN 367
            +GKN  KY+D    + +       ++  +NP++  + + D G +      DRD KKRKV 
Sbjct: 875  SGKNPPKYNDCTSKNMKHDQKDWAQLSSKNPKEDVRASLDNGSVDMANCDDRDTKKRKVK 934

Query: 368  EYQDSQLYTTSLASGGHRLEEHRDYM-EGTSESSHRKEKKARVSKSGGKESSITKGSGGI 544
            E  D+QLY  SL + GH L++      E  SE+ +RK KK RVS+S GKE+S +K +G  
Sbjct: 935  ESHDAQLYRDSLPNTGHHLQDSNIMAKEEFSENDYRKVKKPRVSRSEGKEASGSKSNGRT 994

Query: 545  DKKGRSVKNQQSVADLGNNLSLRRLDAIDSV 637
            DKKG   KNQQ   DLG+ LS R LD +DS+
Sbjct: 995  DKKGSHRKNQQLRHDLGSTLSQRSLDGVDSL 1025


>EOX94983.1 CW-type Zinc Finger, putative isoform 1 [Theobroma cacao] EOX94984.1
            CW-type Zinc Finger, putative isoform 1 [Theobroma cacao]
            EOX94985.1 CW-type Zinc Finger, putative isoform 1
            [Theobroma cacao] EOX94986.1 CW-type Zinc Finger,
            putative isoform 1 [Theobroma cacao]
          Length = 1680

 Score =  117 bits (292), Expect = 1e-26
 Identities = 80/210 (38%), Positives = 115/210 (54%), Gaps = 23/210 (10%)
 Frame = +2

Query: 77   GDTKSSKIRKKRETDQDFSKASKKVKTNAVLRTDEDRK---------------------S 193
            GD K+SK++ KR TDQD  +ASKK+KT ++   DED                        
Sbjct: 811  GDDKTSKMKGKRVTDQDSLRASKKIKTESLHLADEDWVFEHAVKGGPSTSNGLPTTLVGK 870

Query: 194  DSINETGKNRHKYDDLPKDSKREVFVRNPEDQTQFTSDAGLLHAEKYIDRDM-KKRKVNE 370
            D    + ++ H+   L KD +++ +V+  +D+ Q +   G L        ++ +KRKV+E
Sbjct: 871  DQPKHSERSSHRDSKLDKD-RQQAYVKRLKDKVQVSLTDGSLDMANCDGGEISRKRKVDE 929

Query: 371  YQDSQLYTTSLASGGHRLEEHR-DYMEGTSESSHRKEKKARVSKSGGKESSITKGSGGID 547
              D QL T SL S G+ L++ R    E  SE+ +R+EKKARVSKSGGK+SS +K SG ++
Sbjct: 930  CIDCQLNTGSLQSMGNNLQDSRVSVKEEFSENDYRREKKARVSKSGGKDSSASKSSGKLE 989

Query: 548  KKGRSVKNQQSVADLGNNLSLRRLDAIDSV 637
            KK R  KN +S  D    LS R LD  DS+
Sbjct: 990  KKSRHTKNHRSGQDPDITLSQRSLDGTDSL 1019


>XP_011033585.1 PREDICTED: uncharacterized protein LOC105132026 [Populus euphratica]
          Length = 1660

 Score =  116 bits (291), Expect = 2e-26
 Identities = 78/211 (36%), Positives = 112/211 (53%), Gaps = 24/211 (11%)
 Frame = +2

Query: 77   GDTKSSKIRKKRETDQDFSKASKKVKTNAVLRTDEDRKSD----------------SINE 208
            G +K SK + KR+ DQD  +ASKK++        ED  SD                ++  
Sbjct: 835  GGSKRSKGKGKRDPDQDCFRASKKIRAEGF---PEDWMSDHGGAIEKVGPPSSNGLAMAS 891

Query: 209  TGKNRHKYDDLPKDSKR-------EVFVRNPEDQTQFTSDAGLLHAEKYIDRDMKKRKVN 367
            +GKN  KY+D    + +       ++  +NP++  + + D G +      DRD KKRKV 
Sbjct: 892  SGKNPPKYNDCTSKNMKHDLKDWAQLSAKNPKEDVRASLDNGFVDIGNCDDRDTKKRKVK 951

Query: 368  EYQDSQLYTTSLASGGHRLEEHRDYM-EGTSESSHRKEKKARVSKSGGKESSITKGSGGI 544
            E  D+QLY  SL + GH  ++      E  SE+ +RK KK RVS+S GKE+S +K +G  
Sbjct: 952  ESHDAQLYQDSLPNTGHHHQDSNIMAKEEFSETDYRKVKKPRVSRSEGKEASGSKSNGRT 1011

Query: 545  DKKGRSVKNQQSVADLGNNLSLRRLDAIDSV 637
            DKKG   KNQQ   DLG+ +S R LD +DS+
Sbjct: 1012 DKKGSHRKNQQLRHDLGSTVSQRSLDGVDSL 1042


>KHG10306.1 MORC family CW-type zinc finger protein 4 [Gossypium arboreum]
          Length = 1637

 Score =  116 bits (290), Expect = 2e-26
 Identities = 77/205 (37%), Positives = 121/205 (59%), Gaps = 18/205 (8%)
 Frame = +2

Query: 77   GDTKSSKIRKKRETDQDFSKASKKVKTNAVLRTDED--------RKSDSINET--GKNRH 226
            GD ++S ++ KR T+QD  +ASKK+K  +    DED          S+ +  T  GK++ 
Sbjct: 804  GDARTSNMKGKRTTEQDSLRASKKIKVESSRLADEDWMFEHAGKSTSNGLPNTSVGKDQP 863

Query: 227  K------YDDLPKDSKREVFVRNPEDQTQFT-SDAGLLHAEKYIDRDMKKRKVNEYQDSQ 385
            K      Y D     +++V  + P+++     +D  L  A        +KR+V++  +SQ
Sbjct: 864  KNSEGSSYKDSSDKDRQQVSGKRPKNKVGVPLTDGSLDLANCDGGAVSRKREVDDCINSQ 923

Query: 386  LYTTSLASGGHRLEEHRDYM-EGTSESSHRKEKKARVSKSGGKESSITKGSGGIDKKGRS 562
            L+T S  S G+ L+E+R ++ E   E+ +R+EKKAR SKSGGK+SS +K SG ++KKGR 
Sbjct: 924  LFTDSFQSMGNYLQENRVFVKEEFCENDYRREKKARASKSGGKDSSASKSSGTLEKKGRH 983

Query: 563  VKNQQSVADLGNNLSLRRLDAIDSV 637
             KN+QS  DLG ++S +RLD +DS+
Sbjct: 984  TKNRQSGQDLGRSMSQQRLDGMDSL 1008


>XP_017630345.1 PREDICTED: uncharacterized protein LOC108473352 isoform X1 [Gossypium
            arboreum] XP_017630346.1 PREDICTED: uncharacterized
            protein LOC108473352 isoform X1 [Gossypium arboreum]
            XP_017630347.1 PREDICTED: uncharacterized protein
            LOC108473352 isoform X1 [Gossypium arboreum]
            XP_017630348.1 PREDICTED: uncharacterized protein
            LOC108473352 isoform X1 [Gossypium arboreum]
            XP_017630349.1 PREDICTED: uncharacterized protein
            LOC108473352 isoform X1 [Gossypium arboreum]
          Length = 1657

 Score =  116 bits (290), Expect = 2e-26
 Identities = 77/205 (37%), Positives = 121/205 (59%), Gaps = 18/205 (8%)
 Frame = +2

Query: 77   GDTKSSKIRKKRETDQDFSKASKKVKTNAVLRTDED--------RKSDSINET--GKNRH 226
            GD ++S ++ KR T+QD  +ASKK+K  +    DED          S+ +  T  GK++ 
Sbjct: 804  GDARTSNMKGKRTTEQDSLRASKKIKVESSRLADEDWMFEHAGKSTSNGLPNTSVGKDQP 863

Query: 227  K------YDDLPKDSKREVFVRNPEDQTQFT-SDAGLLHAEKYIDRDMKKRKVNEYQDSQ 385
            K      Y D     +++V  + P+++     +D  L  A        +KR+V++  +SQ
Sbjct: 864  KNSEGSSYKDSSDKDRQQVSGKRPKNKVGVPLTDGSLDLANCDGGAVSRKREVDDCINSQ 923

Query: 386  LYTTSLASGGHRLEEHRDYM-EGTSESSHRKEKKARVSKSGGKESSITKGSGGIDKKGRS 562
            L+T S  S G+ L+E+R ++ E   E+ +R+EKKAR SKSGGK+SS +K SG ++KKGR 
Sbjct: 924  LFTDSFQSMGNYLQENRVFVKEEFCENDYRREKKARASKSGGKDSSASKSSGTLEKKGRH 983

Query: 563  VKNQQSVADLGNNLSLRRLDAIDSV 637
             KN+QS  DLG ++S +RLD +DS+
Sbjct: 984  TKNRQSGQDLGRSMSQQRLDGMDSL 1008


>XP_016694781.1 PREDICTED: uncharacterized protein LOC107911477 [Gossypium hirsutum]
            XP_016694782.1 PREDICTED: uncharacterized protein
            LOC107911477 [Gossypium hirsutum] XP_016694783.1
            PREDICTED: uncharacterized protein LOC107911477
            [Gossypium hirsutum]
          Length = 1663

 Score =  116 bits (290), Expect = 2e-26
 Identities = 80/206 (38%), Positives = 121/206 (58%), Gaps = 19/206 (9%)
 Frame = +2

Query: 77   GDTKSSKIRKKRETDQDFSKASKKVKTNAVLRTDED--------RKSDSINET--GKNRH 226
            GD ++S ++ KR T+QD  +ASKK+K  +    DED         +S+ +  T  GK++ 
Sbjct: 808  GDARTSNMKGKRTTEQDSLRASKKIKVESSRLADEDWMFEHAGKSRSNGLPNTSVGKDQP 867

Query: 227  K------YDDLPKDSKRE-VFVRNPEDQTQFT-SDAGLLHAEKYIDRDMKKRKVNEYQDS 382
            K      Y D   D  R+ V  + P+ +     +D  L  A        +KR+V++  +S
Sbjct: 868  KNSEGSSYKDSKSDKDRQQVSGKRPKTKVGVPLTDGSLDLANCDGGAVSRKREVDDCINS 927

Query: 383  QLYTTSLASGGHRLEEHRDYM-EGTSESSHRKEKKARVSKSGGKESSITKGSGGIDKKGR 559
            QLYT S  S G+ L+E+R ++ E   E+ +R+EKKAR SKSGGK+SS +K  G ++KKGR
Sbjct: 928  QLYTDSFQSMGNYLQENRVFVKEEFCENDYRREKKARASKSGGKDSSASKSCGTLEKKGR 987

Query: 560  SVKNQQSVADLGNNLSLRRLDAIDSV 637
              KN+QS  DLG++LS +RLD +DS+
Sbjct: 988  HTKNRQSGQDLGSSLSQQRLDGMDSL 1013


>XP_017982041.1 PREDICTED: uncharacterized protein LOC18613498 isoform X2 [Theobroma
            cacao]
          Length = 1680

 Score =  114 bits (285), Expect = 1e-25
 Identities = 79/210 (37%), Positives = 114/210 (54%), Gaps = 23/210 (10%)
 Frame = +2

Query: 77   GDTKSSKIRKKRETDQDFSKASKKVKTNAVLRTDEDRK---------------------S 193
            GD K+SK++ KR TDQD  +ASKK+KT ++   DED                        
Sbjct: 811  GDDKTSKMKGKRVTDQDSLRASKKIKTESLHLADEDWVFEHAVKGGPSTSNGLPTTLVGK 870

Query: 194  DSINETGKNRHKYDDLPKDSKREVFVRNPEDQTQFTSDAGLLHAEKYIDRDM-KKRKVNE 370
            D    + ++ H+   L KD +++ + +  +D+ Q +   G L        ++ +KRKV+E
Sbjct: 871  DQPKHSERSSHRDSKLDKD-RQQAYGKRLKDKVQVSLTDGSLDMANCDGGEISRKRKVDE 929

Query: 371  YQDSQLYTTSLASGGHRLEEHR-DYMEGTSESSHRKEKKARVSKSGGKESSITKGSGGID 547
              D QL T SL S G+ L++ R    E  SE+ +R+EKKARVSKSGGK+SS +K SG ++
Sbjct: 930  CIDCQLNTGSLQSMGNNLQDSRVSVKEEFSENDYRREKKARVSKSGGKDSSASKSSGKLE 989

Query: 548  KKGRSVKNQQSVADLGNNLSLRRLDAIDSV 637
            KK R  KN +S  D    LS R LD  DS+
Sbjct: 990  KKSRHTKNHRSGQDPDITLSQRSLDGTDSL 1019


>XP_017982033.1 PREDICTED: uncharacterized protein LOC18613498 isoform X1 [Theobroma
            cacao]
          Length = 1701

 Score =  114 bits (285), Expect = 1e-25
 Identities = 79/210 (37%), Positives = 114/210 (54%), Gaps = 23/210 (10%)
 Frame = +2

Query: 77   GDTKSSKIRKKRETDQDFSKASKKVKTNAVLRTDEDRK---------------------S 193
            GD K+SK++ KR TDQD  +ASKK+KT ++   DED                        
Sbjct: 832  GDDKTSKMKGKRVTDQDSLRASKKIKTESLHLADEDWVFEHAVKGGPSTSNGLPTTLVGK 891

Query: 194  DSINETGKNRHKYDDLPKDSKREVFVRNPEDQTQFTSDAGLLHAEKYIDRDM-KKRKVNE 370
            D    + ++ H+   L KD +++ + +  +D+ Q +   G L        ++ +KRKV+E
Sbjct: 892  DQPKHSERSSHRDSKLDKD-RQQAYGKRLKDKVQVSLTDGSLDMANCDGGEISRKRKVDE 950

Query: 371  YQDSQLYTTSLASGGHRLEEHR-DYMEGTSESSHRKEKKARVSKSGGKESSITKGSGGID 547
              D QL T SL S G+ L++ R    E  SE+ +R+EKKARVSKSGGK+SS +K SG ++
Sbjct: 951  CIDCQLNTGSLQSMGNNLQDSRVSVKEEFSENDYRREKKARVSKSGGKDSSASKSSGKLE 1010

Query: 548  KKGRSVKNQQSVADLGNNLSLRRLDAIDSV 637
            KK R  KN +S  D    LS R LD  DS+
Sbjct: 1011 KKSRHTKNHRSGQDPDITLSQRSLDGTDSL 1040


>KHG00169.1 MORC family CW-type zinc finger protein 4 [Gossypium arboreum]
          Length = 1643

 Score =  113 bits (283), Expect = 2e-25
 Identities = 79/207 (38%), Positives = 117/207 (56%), Gaps = 20/207 (9%)
 Frame = +2

Query: 77   GDTKSSKIRKKRETDQDFSKASKKVKTNAVLRTDEDRKSD-------SINE-----TGKN 220
            GD K+SK++ KR TDQD  ++SKK+K +++   DED   +       S N       GK+
Sbjct: 791  GDAKTSKMKSKRTTDQDSLRSSKKIKGDSLHLADEDCMFEHGGMGGPSTNNGLPTTLGKD 850

Query: 221  RHK-----YDDLPKDSKRE-VFVRNPEDQTQFTSDAGLLHAEKYIDRDM-KKRKVNEYQD 379
            + K     Y+ L  D +R+ +  + P+D+   +   G L        ++ +KRKV+E  D
Sbjct: 851  QPKHSECSYNVLKSDKERQQISGKRPKDKVHPSLTDGSLDLVNCNGGEVSRKRKVDECID 910

Query: 380  SQLYTTSLASGGHRLEEHRDYM-EGTSESSHRKEKKARVSKSGGKESSITKGSGGIDKKG 556
             QLYT  L   G+  ++ R +  E  SE+ +R+EKKARVSKSGGK+SS  K SG ++KK 
Sbjct: 911  GQLYTGFLQGFGNHFQDSRVFTKEDVSENEYRREKKARVSKSGGKDSSAGKSSGKLEKKS 970

Query: 557  RSVKNQQSVADLGNNLSLRRLDAIDSV 637
            R  K  Q+  DLG++L  R LD  DS+
Sbjct: 971  RHTKGHQTGQDLGSSLPQRSLDVPDSL 997


>KHG00168.1 MORC family CW-type zinc finger protein 4 [Gossypium arboreum]
          Length = 1654

 Score =  113 bits (283), Expect = 2e-25
 Identities = 79/207 (38%), Positives = 117/207 (56%), Gaps = 20/207 (9%)
 Frame = +2

Query: 77   GDTKSSKIRKKRETDQDFSKASKKVKTNAVLRTDEDRKSD-------SINE-----TGKN 220
            GD K+SK++ KR TDQD  ++SKK+K +++   DED   +       S N       GK+
Sbjct: 802  GDAKTSKMKSKRTTDQDSLRSSKKIKGDSLHLADEDCMFEHGGMGGPSTNNGLPTTLGKD 861

Query: 221  RHK-----YDDLPKDSKRE-VFVRNPEDQTQFTSDAGLLHAEKYIDRDM-KKRKVNEYQD 379
            + K     Y+ L  D +R+ +  + P+D+   +   G L        ++ +KRKV+E  D
Sbjct: 862  QPKHSECSYNVLKSDKERQQISGKRPKDKVHPSLTDGSLDLVNCNGGEVSRKRKVDECID 921

Query: 380  SQLYTTSLASGGHRLEEHRDYM-EGTSESSHRKEKKARVSKSGGKESSITKGSGGIDKKG 556
             QLYT  L   G+  ++ R +  E  SE+ +R+EKKARVSKSGGK+SS  K SG ++KK 
Sbjct: 922  GQLYTGFLQGFGNHFQDSRVFTKEDVSENEYRREKKARVSKSGGKDSSAGKSSGKLEKKS 981

Query: 557  RSVKNQQSVADLGNNLSLRRLDAIDSV 637
            R  K  Q+  DLG++L  R LD  DS+
Sbjct: 982  RHTKGHQTGQDLGSSLPQRSLDVPDSL 1008


>XP_017613876.1 PREDICTED: uncharacterized protein LOC108459015 [Gossypium arboreum]
          Length = 1660

 Score =  113 bits (283), Expect = 2e-25
 Identities = 79/207 (38%), Positives = 117/207 (56%), Gaps = 20/207 (9%)
 Frame = +2

Query: 77   GDTKSSKIRKKRETDQDFSKASKKVKTNAVLRTDEDRKSD-------SINE-----TGKN 220
            GD K+SK++ KR TDQD  ++SKK+K +++   DED   +       S N       GK+
Sbjct: 808  GDAKTSKMKSKRTTDQDSLRSSKKIKGDSLHLADEDCMFEHGGMGGPSTNNGLPTTLGKD 867

Query: 221  RHK-----YDDLPKDSKRE-VFVRNPEDQTQFTSDAGLLHAEKYIDRDM-KKRKVNEYQD 379
            + K     Y+ L  D +R+ +  + P+D+   +   G L        ++ +KRKV+E  D
Sbjct: 868  QPKHSECSYNVLKSDKERQQISGKRPKDKVHPSLTDGSLDLVNCNGGEVSRKRKVDECID 927

Query: 380  SQLYTTSLASGGHRLEEHRDYM-EGTSESSHRKEKKARVSKSGGKESSITKGSGGIDKKG 556
             QLYT  L   G+  ++ R +  E  SE+ +R+EKKARVSKSGGK+SS  K SG ++KK 
Sbjct: 928  GQLYTGFLQGFGNHFQDSRVFTKEDVSENEYRREKKARVSKSGGKDSSAGKSSGKLEKKS 987

Query: 557  RSVKNQQSVADLGNNLSLRRLDAIDSV 637
            R  K  Q+  DLG++L  R LD  DS+
Sbjct: 988  RHTKGHQTGQDLGSSLPQRSLDVPDSL 1014


>OAY61780.1 hypothetical protein MANES_01G215600 [Manihot esculenta]
          Length = 1661

 Score =  113 bits (283), Expect = 2e-25
 Identities = 73/208 (35%), Positives = 114/208 (54%), Gaps = 15/208 (7%)
 Frame = +2

Query: 59   ENYPGEGDTKSSKIRKKRETDQDFSKASKKVKTNAVLR---TDEDRKSDSINETGKNR-- 223
            +N    GDT  SK++ KR+ +QD  +ASKK+KT  + +   +D     + +  +  NR  
Sbjct: 834  DNCSDGGDTTQSKMKGKRDLEQDILRASKKMKTEGLPQDWMSDHHVTIEKVGPSSSNRLP 893

Query: 224  --HKYDDLPKDSKR-------EVFVRNPEDQTQFTSDAGLLHAEKYIDRDM-KKRKVNEY 373
                  ++PK + R       +V  R P+D+   + D   +   K +DR++ KKRKV   
Sbjct: 894  SMPSGKNMPKTNSRTSSMDQIQVSARKPKDEIPISMDDVTMDMGKQVDREVGKKRKVKGS 953

Query: 374  QDSQLYTTSLASGGHRLEEHRDYMEGTSESSHRKEKKARVSKSGGKESSITKGSGGIDKK 553
             D Q    +L++ GH L+   ++    SE+  RKEKKAR+S+S GKESS +KG+   DKK
Sbjct: 954  CDGQANQGTLSNTGHNLQAKEEF----SENEFRKEKKARISRSDGKESSASKGNSKSDKK 1009

Query: 554  GRSVKNQQSVADLGNNLSLRRLDAIDSV 637
                KN+Q   D+G+ +S R LD +DS+
Sbjct: 1010 SSHRKNRQPGKDVGSTVSQRSLDGVDSL 1037


Top