EP4010004A2 - Compositions and methods for modulation of gene expression - Google Patents

Compositions and methods for modulation of gene expression

Info

Publication number
EP4010004A2
EP4010004A2 EP20851042.0A EP20851042A EP4010004A2 EP 4010004 A2 EP4010004 A2 EP 4010004A2 EP 20851042 A EP20851042 A EP 20851042A EP 4010004 A2 EP4010004 A2 EP 4010004A2
Authority
EP
European Patent Office
Prior art keywords
dbd
gene
nucleic acid
terminus
seq
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
EP20851042.0A
Other languages
German (de)
French (fr)
Other versions
EP4010004A4 (en
Inventor
Fyodor Urnov
John A. Stamatoyannopoulos
Joycelynn PEARL
Matthew WILKEN
Christie CIARLO
Shon GREEN
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Altius Institute for Biomedical Sciences
Original Assignee
I Altius Institute For Biomedical Sciences
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by I Altius Institute For Biomedical Sciences filed Critical I Altius Institute For Biomedical Sciences
Publication of EP4010004A2 publication Critical patent/EP4010004A2/en
Publication of EP4010004A4 publication Critical patent/EP4010004A4/en
Pending legal-status Critical Current

Links

Classifications

    • CCHEMISTRY; METALLURGY
    • C07ORGANIC CHEMISTRY
    • C07KPEPTIDES
    • C07K14/00Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof
    • C07K14/435Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from animals; from humans
    • C07K14/705Receptors; Cell surface antigens; Cell surface determinants
    • C07K14/70503Immunoglobulin superfamily
    • C07K14/7051T-cell receptor (TcR)-CD3 complex
    • CCHEMISTRY; METALLURGY
    • C07ORGANIC CHEMISTRY
    • C07KPEPTIDES
    • C07K14/00Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof
    • C07K14/435Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from animals; from humans
    • C07K14/705Receptors; Cell surface antigens; Cell surface determinants
    • C07K14/70503Immunoglobulin superfamily
    • C07K14/70521CD28, CD152
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N15/00Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
    • C12N15/09Recombinant DNA-technology
    • C12N15/63Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
    • C12N15/79Vectors or expression systems specially adapted for eukaryotic hosts
    • C12N15/85Vectors or expression systems specially adapted for eukaryotic hosts for animal cells
    • CCHEMISTRY; METALLURGY
    • C07ORGANIC CHEMISTRY
    • C07KPEPTIDES
    • C07K14/00Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof
    • C07K14/435Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from animals; from humans
    • C07K14/705Receptors; Cell surface antigens; Cell surface determinants
    • C07K14/70503Immunoglobulin superfamily
    • CCHEMISTRY; METALLURGY
    • C07ORGANIC CHEMISTRY
    • C07KPEPTIDES
    • C07K14/00Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof
    • C07K14/435Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from animals; from humans
    • C07K14/705Receptors; Cell surface antigens; Cell surface determinants
    • C07K14/70596Molecules with a "CD"-designation not provided for elsewhere
    • CCHEMISTRY; METALLURGY
    • C07ORGANIC CHEMISTRY
    • C07KPEPTIDES
    • C07K2319/00Fusion polypeptide
    • C07K2319/01Fusion polypeptide containing a localisation/targetting motif
    • C07K2319/03Fusion polypeptide containing a localisation/targetting motif containing a transmembrane segment
    • CCHEMISTRY; METALLURGY
    • C07ORGANIC CHEMISTRY
    • C07KPEPTIDES
    • C07K2319/00Fusion polypeptide
    • C07K2319/70Fusion polypeptide containing domain for protein-protein interaction
    • C07K2319/71Fusion polypeptide containing domain for protein-protein interaction containing domain for transcriptional activaation, e.g. VP16
    • CCHEMISTRY; METALLURGY
    • C07ORGANIC CHEMISTRY
    • C07KPEPTIDES
    • C07K2319/00Fusion polypeptide
    • C07K2319/80Fusion polypeptide containing a DNA binding domain, e.g. Lacl or Tet-repressor
    • CCHEMISTRY; METALLURGY
    • C07ORGANIC CHEMISTRY
    • C07KPEPTIDES
    • C07K2319/00Fusion polypeptide
    • C07K2319/80Fusion polypeptide containing a DNA binding domain, e.g. Lacl or Tet-repressor
    • C07K2319/81Fusion polypeptide containing a DNA binding domain, e.g. Lacl or Tet-repressor containing a Zn-finger domain for DNA binding
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N2830/00Vector systems having a special element relevant for transcription
    • C12N2830/001Vector systems having a special element relevant for transcription controllable enhancer/promoter combination
    • C12N2830/005Vector systems having a special element relevant for transcription controllable enhancer/promoter combination repressible enhancer/promoter combination, e.g. KRAB
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N9/00Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
    • C12N9/14Hydrolases (3)
    • C12N9/16Hydrolases (3) acting on ester bonds (3.1)
    • C12N9/22Ribonucleases RNAses, DNAses

Definitions

  • Modulating gene expression has been a strategy for enhancing the success of cancer and infectious disease therapies.
  • cell therapies such as CAR T cell therapies
  • immune checkpoint inhibitors there exists a need for agents that can modulate the expression of target genes, such as, immune checkpoint inhibitors.
  • the present disclosure provides engineered polypeptides comprising DNA binding domains and repressor domains for repressing a target gene.
  • the present disclosure provides polypeptides, compositions thereof, and methods for suppressing expression of a target gene such as PDCD1, CTLA4, LAG3, or TIM-3.
  • the polypeptides disclosed herein include a DNA binding domain (DBD) that binds to a sequence of the target gene and a transcriptional repressor domain that suppresses expression of the target gene.
  • DBD DNA binding domain
  • the transcriptional repressor domain may be a known transcriptional repressor or may be a novel transcriptional repressor disclosed herein.
  • sequences of novel transcriptional repressor domains that are conjugated to a heterologous DNA binding domain. As shown herein, these novel transcriptional repressor domains mediate suppression of expression of a target gene bound by the heterologous DNA binding domain.
  • FIGS. 1A-1C illustrate the locations in the PDCD1 gene to which the DBDs of the indicated recombinant polypeptides were designed to bind. Recombinant polypeptides that repressed expression of PDCD1 in at least 50% of cells treated with the recombinant polypeptides are indicated by clear arrows .
  • Recombinant polypeptides that repressed expression of PDCD1 in less than 50% of the cells treated with the recombinant polypeptides are indicated by solid arrows .
  • the orientation of the arrows indicates the DNA strand to which the recombinant polypeptide is designed to bind. Arrows having the orientation are designed to bind to the anti-sense strand.
  • Arrows having the orientation are designed to bind to the sense strand.
  • FIG. 2 shows the fold change in number of PD-1 expressing cells 2 days after transfection of mRNA encoding the indicated recombinant polypeptides into CD3+ T cells.
  • FIG. 3 shows effect of dose of mRNA encoding the recombinant polypeptide, pAL040 and pAL043, on the percent of CD3+ T cells expressing PD-1 3 days after transfection.
  • FIG. 4 shows the fold change in number of PD-1 -positive cells at the indicated number of days post-transfection of mRNA encoding the indicated recombinant polypeptide relative to control.
  • FIGS. 5 A and 5B show that PD-1 repression with pAL043 in anti-CD 19 CAR-T cells is sustained after in vivo expansion and clearance of CD 19-positive NALM-6 B-ALL tumor model in NOD SCID Gamma (NSG) mice.
  • FIG. 6 illustrates the locations in the TIM3 gene at which the DBDs of the indicated recombinant polypeptides bind.
  • Recombinant polypeptides that repressed expression of TIM3 in at least 50% of the cells are indicated by unfilled arrows .
  • Recombinant polypeptides that repressed expression of TIM3 in less than 50% of the cells are indicated by filled arrows
  • FIG. 7 shows the fold change in number of cells expressing TIM3 at 2 days, 5 days, 8 days, or 14 days after transfection of mRNA encoding the indicated recombinant polypeptides into CD3+ T cells.
  • FIG. 8 shows the fold change in number of cells expressing TIM3 at 3 days or 6 days after transfection of mRNA encoding the indicated recombinant polypeptides into CD3+ T cells.
  • FIG. 9 illustrates the locations in the CTLA4 gene at which the DBDs of the indicated recombinant polypeptides bind. Recombinant polypeptides that repressed expression of CTLA4 in at least
  • FIG. 10 shows the fold change in number of cells expressing CTLA4 at 3 days after transfection of mRNA encoding the indicated recombinant polypeptides into CD3+ T cells.
  • FIG. 11 illustrates the locations in the LAG3 gene at which the DBDs of the indicated recombinant polypeptides bind. Recombinant polypeptides that repressed expression of LAG3 in at least 50% of the cells are indicated by unfdled arrows . Recombinant polypeptides that repressed expression of LAG3 in less than 50% of the cells are indicated by fdled arrows
  • FIG. 12 shows the fold change in number of cells expressing LAG3 at 2 days, 7 days, or 12 days after transfection of mRNA encoding the indicated recombinant polypeptides into CD3+ T cells.
  • FIG. 13 shows the fold change in number of cells expressing LAG3 at 2 days after transfection of mRNA encoding the indicated recombinant polypeptides into CD3+ T cells.
  • FIGS. 14A and 14B show multiplexing of recombinant polypeptides to simultaneously suppress expression of PD-1, LAG3, and TIM3 is a single cell.
  • FIGS. 15A-15C illustrates specificity of the recombinant polypeptides as indicated by lack of significant off-target effect as measured by RNA-seq.
  • FIG. 16 shows characterization of repression of TIM3 expression by the listed candidate transcriptional repressors.
  • FIG. 17 shows characterization of repression of LAG3, TIM3, or PD-1 expression by the listed candidate transcriptional repressors.
  • FIG. 18 shows characterization of repression of TIM3 expression by the listed candidate transcriptional repressors.
  • FIG. 19 shows a schematic of an anti-CD 19 CAR-T cell in which expression of PD1, TIM3, and LAG3 has been repressed using the engineered polypeptides (pAL043+TL8188+TL8222) described herein.
  • FIG. 20 shows flow cytometry data confirming repression of PD1, TIM3, and LAG3 expression in the multiplex-treated CAR-T cells.
  • FIG. 21 provides an overview of in vivo leukemia xenograft model and treatment using indicated CAR-T cells.
  • FIG. 22 demonstrates that multiplexed repression of immune checkpoint genes is sustained in vivo.
  • FIG. 23 demonstrates that multiplexed repression of immune checkpoint genes enhances CAR-Ts ability to resist tumor re-challenge.
  • FIG. 24 shows expansion of CAR-Ts in the mouse blood.
  • FIG. 25 TALE-KRAB split system.
  • FIG. 26 Large-scale analysis of functional domains enabled by split encoding of DNA targeting and functional activities.
  • FIG. 27 Repression of TIM3 expression using TALE-KRAB split system.
  • FIGS. 28 and 29 Control of gene expression using CIPHR logic gates.
  • the present disclosure provides recombinant polypeptides, compositions and methods for suppressing target gene expression for therapeutic purposes.
  • engineered polypeptides comprising a DNA-binding domain (DBD) and a transcription repressor.
  • the DBD mediates binding of the disclosed polypeptides to a sequence in the target gene.
  • the target gene may be PDCD1, LAG3, TIM3, or CTLA4.
  • regions in these target genes have been identified that can be targeted for repression of expression of these gene when these regions are bound by the polypeptides disclosed herein. These regions may be located in the target gene within an expression control region, such as, a coding region, a non-coding region, such as, a regulatory region (e.g., promoter region) or an intron.
  • an expression control region such as, a coding region, a non-coding region, such as, a regulatory region (e.g., promoter region) or an intron.
  • novel transcriptional repressors that are conjugated to a heterologous DNA binding domain and mediate suppression of expression of a target gene bound by the DNA binding domain.
  • split systems for modulating gene expression where the DBD and the functional domain are provided as separated polypeptides and are assembled using dimerization of a heterodimer pair, where the DBD and the functional domain are each fused to a member of the heterodimer pair.
  • polypeptide derived in the context of a polypeptide refers to a polypeptide that has a sequence that is based on that of a protein from a particular source (e.g., Xanthomonas or Legionella).
  • a polypeptide derived from a protein from a particular source may be a variant of the protein from the particular source.
  • a polypeptide derived from a protein from a particular source may have a sequence that is modified with respect to the protein's sequence from which it is derived.
  • a polypeptide derived from a protein from a particular source shares at least 30% sequence identity with, at least 40% sequence identity with, at least 50% sequence identity with, at least 60% sequence identity with, at least 70% sequence identity with, at least 80% sequence identity with, or at least 90% sequence identity with the protein from which it is derived.
  • the term “modular” as used herein in the context of a DNA binding domain e.g., a modular animal pathogen derived nucleic acid binding domain (MAP-NBD) indicates that the plurality of repeat units present in the DBD can be rearranged and/or replaced with other repeat units and can be arranged in an order such that the DBD binds to the target nucleic acid.
  • MAP-NBD modular animal pathogen derived nucleic acid binding domain
  • any repeat unit in a modular nucleic acid binding domain can be switched with a different repeat unit.
  • modularity of the DNA binding domains disclosed herein allows for switching the target nucleic acid base for a particular repeat unit by simply switching it out for another repeat unit.
  • modularity of the DNA binding domains disclosed herein allows for swapping out a particular repeat unit for another repeat unit to increase the affinity of the repeat unit for a particular target nucleic acid.
  • the modular nature of the DNA binding domains disclosed herein enables the development of genome editing complexes that can precisely target any nucleic acid sequence of interest.
  • polypeptide refers to a polymeric form of amino acids of any length, which can include genetically coded and non-genetically coded amino acids, chemically or biochemically modified or derivatized amino acids, and polypeptides having modified polypeptide backbones.
  • the terms include fusion proteins, including, but not limited to, fusion proteins with a heterologous amino acid sequence, fusion proteins with heterologous and homologous leader sequences, with or without N-terminus methionine residues; immunologically tagged proteins; and the like.
  • the terms refer to a polymeric form of amino acids of any length which include genetically coded amino acids.
  • the terms refer to a polymeric form of amino acids of any length which include genetically coded amino acids fused to a heterologous amino acid sequence.
  • heterologous refers to two components that are defined by structures derived from different sources.
  • a “heterologous” polypeptide may include operably linked amino acid sequences that are derived from different polypeptides (e.g., a DBD and a functional domain, e.g., a transcriptional repressor, derived from different sources).
  • a “heterologous” polynucleotide may include operably linked nucleic acid sequences that can be derived from different genes.
  • heterologous nucleic acids include expression constructs in which a nucleic acid comprising a coding sequence is operably linked to a regulatory element (e.g., a promoter) that is from a genetic origin different from that of the coding sequence (e.g., to provide for expression in a host cell of interest, which may be of different genetic origin than the promoter, the coding sequence or both).
  • a regulatory element e.g., a promoter
  • heterologous can refer to the presence of a nucleic acid (or gene product, such as a polypeptide) that is of a different genetic origin than the host cell in which it is present.
  • operably linked refers to linkage between molecules to provide a desired function.
  • operably linked in the context of nucleic acids refers to a functional linkage between nucleic acid sequences.
  • a nucleic acid expression control sequence such as a promoter, signal sequence, or array of transcription factor binding sites
  • the expression control sequence affects transcription and/or translation of the second polynucleotide.
  • operably linked refers to a functional linkage between amino acid sequences (e.g., different domains) to provide for a described activity of the polypeptide.
  • a “target nucleic acid,” “target sequence,” or “target site” is a nucleic acid sequence that defines a portion of a nucleic acid to which a binding molecule, such as, the DBD disclosed herein will bind.
  • the target nucleic acid may be present in an isolated form or inside a cell.
  • a target nucleic acid may be present in a region of interest.
  • a “region of interest” may be any region of cellular chromatin, such as, for example, a gene or a non-coding sequence within or adjacent to a gene, in which it is desirable to bind an exogenous molecule.
  • a region of interest can be present in a chromosome, an episome, an organellar genome (e.g., mitochondrial, chloroplast), or an infecting viral genome, for example.
  • a region of interest can be within the coding region of a gene, within transcribed non-coding regions such as, for example, promoter sequences, leader sequences, trailer sequences or introns, or within non-transcribed regions, either upstream or downstream of the coding region.
  • a region of interest can be as small as a five nucleotide pair or up to 200 nucleotide pairs in length, or any integral value of nucleotide pairs.
  • exogenous molecule is a molecule that is not normally present in a cell but can be introduced into a cell by one or more genetic, biochemical or other methods.
  • An exogenous nucleic acid can be present in an infecting viral genome, a plasmid or episome introduced into a cell.
  • Methods for the introduction of exogenous molecules into cells include, but are not limited to, lipid-mediated transfer (i.e., liposomes, including neutral and cationic lipids), electroporation, direct injection, cell fusion, particle bombardment, calcium phosphate co-precipitation, DEAE-dextran-mediated transfer and viral vector-mediated transfer.
  • an “endogenous” molecule is one that is normally present in a particular cell at a particular developmental stage under particular environmental conditions.
  • an endogenous nucleic acid can comprise a chromosome, the genome of a mitochondrion, chloroplast or other organelle, or a naturally-occurring episomal nucleic acid.
  • Additional endogenous molecules can include proteins, for example, transcription factors and enzymes.
  • a “gene,” for the purposes of the present disclosure, includes a DNA region encoding a gene product, as well as all DNA regions which regulate the production of the gene product, whether or not such regulatory sequences are adjacent to coding and/or transcribed sequences. Accordingly, a gene includes, but is not necessarily limited to, promoter sequences, terminators, translational regulatory sequences such as ribosome binding sites and internal ribosome entry sites, enhancers, silencers, insulators, boundary elements, replication origins, matrix attachment sites and locus control region.
  • Gene expression refers to the conversion of the information, contained in a gene, into a gene product.
  • a gene product can be the direct transcriptional product of a gene (e.g., mRNA, tRNA, rRNA, antisense RNA, ribozyme, structural RNA, shRNA, RNAi, miRNA or any other type of RNA) or a protein produced by translation of a mRNA.
  • Gene products also include RNAs which are modified, by processes such as capping, polyadenylation, methylation, and editing, and proteins modified by, for example, methylation, acetylation, phosphorylation, ubiquitination, ADP-ribosylation, myristylation, and glycosylation.
  • conjugating refers to an association of two entities, for example, of two molecules such as two proteins, two domains (e.g., a binding domain and a transcription repressor domain), or a protein and an agent, e.g., a protein binding domain and a small molecule.
  • the association can be, for example, via a direct or indirect (e.g., via a linker) covalent linkage or via non-covalent interactions. In some embodiments, the association is covalent. In some embodiments, two molecules are conjugated via a linker connecting both molecules.
  • the two proteins may be conjugated via a polypeptide linker, e.g., an amino acid sequence connecting the C-terminus of one protein to the N- terminus of the other protein.
  • a polypeptide linker e.g., an amino acid sequence connecting the C-terminus of one protein to the N- terminus of the other protein.
  • conjugated proteins may be expressed as a fusion protein.
  • an effective amount refers to an amount of a biologically active agent that is sufficient to elicit a desired biological response.
  • an effective amount of a polypeptide comprising a transcriptional repressor may refer to the amount of the polypeptide that is sufficient to induce repression of expression from a gene specifically bound by the polypeptide.
  • an agent e.g., a recombinant polynucleotide
  • the effective amount of an agent e.g., a recombinant polynucleotide, may vary depending on various factors as, for example, on the desired biological response, the specific allele, genome, target site, cell, or tissue being targeted, and the agent being used.
  • strand refers to a nucleic acid made up of nucleotides covalently linked together by covalent bonds, e.g., phosphodiester bonds.
  • DNA usually exists in a double-stranded form, and as such, has two complementary strands of nucleic acid referred to herein as the “top” and “bottom” strands or the “Watson” and “Crick” strands.
  • Watson strand refers to 5' to 3' top strand (5'
  • Crick strand refers to 3' to 5' bottom strand (3' 5').
  • the assignment of a strand as being a top or bottom strand is arbitrary and does not imply any particular orientation, function or structure.
  • complementary strands of a chromosomal DNA may be interchangeably referred to as “top” and “bottom” strands, “plus” and “minus” strands, the “first” and “second” strands, the “coding” and “noncoding” strands, the “Watson” and “Crick” strands, or the “sense” and “antisense” strands.
  • on-target repression refers repression of expression of a gene containing the genomic sequence that is the target of the recombinant polypeptide comprising the DBD and the transcription repressor.
  • the DBD determines the specificity of the polypeptide for the binding the target site.
  • An on-target repression site refers to a nucleic acid sequence that includes the DNA sequence specifically bound by the DBD of the recombinant polypeptide.
  • off-target repression refers to repression of expression of a gene containing the genomic sequence that is not the target of the recombinant polypeptide comprising the DBD and the transcription repressor but is repressed due to non-specific binding of the DBD of the recombinant polypeptide.
  • domain refers to a part of a protein sequence that may exist and function independently of the rest of the protein chain.
  • these recombinant polypeptides function as transcriptional repressors by virtue of the DBD that mediates binding to a target gene and a repressor domain that suppresses target gene expression upon binding of the polypeptide to the target gene.
  • the recombinant polypeptides disclosed herein may also be referred to as transcriptional repressors.
  • sequences provided herein may be specified to be at least 30%, 40%, 50%, 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99%, or a 100% identical to another sequence provided herein.
  • Percent identity between a pair of sequences may be calculated by multiplying the number of matches in the pair by 100 and dividing by the length of the aligned region, including gaps. Identity scoring only counts perfect matches and does not consider the degree of similarity of amino acids to one another. Only internal gaps are included in the length, not gaps at the sequence ends.
  • Conservative amino acid substitutions refers to substitution of amino acid residues within the following groups: 1) L, I, M, V, F; 2) R, K; 3) F, Y, H, W, R; 4) G, A, T, S; 5) Q, N; and 6) D, E.
  • Conservative amino acid substitutions may preserve the activity of the protein by replacing an amino acid(s) in the protein with an amino acid with a side chain of similar acidity, basicity, charge, polarity, or size of the side chain.
  • Guidance for substitutions, insertions, or deletions may be based on alignments of amino acid sequences of proteins from different species or from a consensus sequence based on a plurality of proteins having the same or similar function.
  • the terms “patient” or “subject” are used interchangeably to refer to a human or a non-human animal (e.g., a mammal).
  • the terms “treat”, “treating”, treatment” and the like refer to a course of action (such as administering a polypeptide comprising a DBD fused to a heterologous transcription repressor domain or a nucleic acid encoding the polypeptide) initiated after a disease, disorder or condition, or a symptom thereof, has been diagnosed, observed, and the like so as to eliminate, reduce, suppress, mitigate, or ameliorate, either temporarily or permanently, at least one of the underlying causes of a disease, disorder, or condition afflicting a subject, or at least one of the symptoms associated with a disease, disorder, condition afflicting a subject.
  • prevent refers to a course of action (such as administering a polypeptide comprising a DBD fused to a heterologous functional domain or a nucleic acid encoding the polypeptide) initiated in a manner (e.g., prior to the onset of a disease, disorder, condition or symptom thereof) so as to prevent, suppress, inhibit or reduce, either temporarily or permanently, a subject's risk of developing a disease, disorder, condition or the like (as determined by, for example, the absence of clinical symptoms) or delaying the onset thereof, generally in the context of a subject predisposed to having a particular disease, disorder or condition. In certain instances, the terms also refer to slowing the progression of the disease, disorder or condition or inhibiting progression thereof to a harmful or otherwise undesired state.
  • the phrase “therapeutically effective amount” refers to the administration of an agent to a subject, either alone or as a part of a pharmaceutical composition or as a companion therapy and either in a single dose or as part of a series of doses, in an amount that is capable of having any detectable, positive effect on any symptom, aspect, or characteristics of a disease, disorder or condition when administered to a patient.
  • the therapeutically effective amount can be ascertained by measuring relevant physiological effects.
  • the recombinant polypeptides include a DBD that mediates binding to a sequence in a target gene and a heterologous transcriptional repressor.
  • the DBD includes a plurality of RUs ordered from N-terminus to C-terminus of the DBD to bind to a nucleic acid sequence of the target gene, where binding of the recombinant polypeptide to the nucleic acid sequence results in decreased expression of the target gene.
  • a recombinant polypeptide disclosed herein may include from N- to C- terminus: a N-cap region, a DBD comprising a plurality of RUs ordered from N-terminus to C-terminus of the DBD to bind to a nucleic acid sequence of the target gene, a C-cap region, an optional linker, and a transcription repressor domain.
  • the transcriptional repressor domain may be at the N- terminus of the recombinant polypeptide instead of the C-terminus.
  • the RUs may have the sequence (X 1-11 X 12 X 13 X 14-33, 34, or 35 ) z (SEQ ID NO: 453), where X 1-11 is a chain of 11 contiguous amino acids, X 14-33 or 34 or 35 is a chain of 20, 21 or 22 contiguous amino acids, X 12 X 13 is selected from: (a) NH, HH, KH, NK, NQ, RH, RN, SS, NN, SN, or KN for recognition of guanine (G); (b) NI, KI, RI, HI, or SI for recognition of adenine (A); (c) NG, HG, KG, or RG for recognition of thymine (T); (d) HD, RD, SD, ND, KD, or YG for recognition of cytosine (C); and (e) NV or HN for recognition of A or G; and (f) H*, HA, KA, N*
  • Any suitable RU such as those based upon the RUs from Xanthomonas transcription activator- like effector (TALE) systems, Ralstonia solanacearum (modular Ralstonia nucleic acid binding domain; RNBD), or an animal pathogen (e.g., Legionella quateirensis, Legionella maceachernii, Burkholderia, Paraburkholderia, or Francisella) (modular animal pathogen nucleic acid binding domain; MAP-NBD) may be used for binding to the regions of the target genes provided herein.
  • TALE Xanthomonas transcription activator- like effector
  • RNBD modular Ralstonia solanacearum
  • an animal pathogen e.g., Legionella quateirensis, Legionella maceachernii, Burkholderia, Paraburkholderia, or Francisella
  • MAP-NBD modular animal pathogen nucleic acid binding domain
  • the arrangement of the RUs in the DBD may be based upon the sequence identified in the target gene to which binding of the recombinant polypeptide results in decreased expression of the target gene. These sequences identified in PDCD-1 gene, TIM3 gene, CTLA4 gene, and LAG3 gene, and the corresponding DBDs are described in detail below.
  • PDCD-1 programmed cell death 1 gene
  • PD-1 gene is also known as PD-1 gene and encodes a cell surface membrane protein of the immunoglobulin superfamily, which is also referred to as PDCD-1 or PD-1.
  • PD- 1 binds to the ligands PD-L1 and PD-L2.
  • the PD-l/PD-1 ligands pathway plays a role in immunosuppression.
  • Recent studies have shown that PD-L1 and PD-L2 are widely expressed on various cancer cells (Keir ME, et al. Annu Rev Immunol. 2008; 26 (677-704)).
  • Expression of PD-ligands prevents cancer cells from being killed by T cells and promotes cancer progression. Targeting the PD-1 pathway has been recognized as an effective immunotherapy for different cancers (Ostrand-Rosenberg S, et al. J Immunol. 2014; 193(8): 3835- 41).
  • TIM3 T-Cell Immunoglobulin Mucin Receptor 3 gene is also referred to as Hepatitis A Virus Cellular Receptor 2 (HAVCR2) and encodes a cell surface membrane protein of the immunoglobulin superfamily of the same name.
  • HAVCR2 Hepatitis A Virus Cellular Receptor 2
  • CTLA4 gene (Cytotoxic T-Lymphocyte Associated Protein 4) encodes an immunoglobulin superfamily protein of the same name which transmits an inhibitory signal to T cells.
  • LAG3 (Lymphocyte-Activation Gene 3) gene encodes the Lymphocyte Activating 3 protein which is also known as LAG3 protein.
  • CTLA-4, PD-1, LAG-3, and TIM3 are known immune checkpoint proteins. The pathways involving LAG3 and TIM3 are recognized in the art to constitute immune checkpoint pathways similar to the CTLA-4 and PD-1 dependent pathways (see e.g. Pardoll, 2012. Nature Rev Cancer 12:252-264; Mellman et al. 2011. Nature 480:480-489).
  • a DBD may include a plurality of RUs ordered from N-terminus to C-terminus of the DBD to bind to a nucleic acid sequence of the PDCD1 gene.
  • the plurality of RUs may be a number of repeat units sufficient to bind to a target sequence, which number may range from 7 to 40. In certain aspects, the number of RUs may range from 9 to 35. In certain aspects, the number of RUs may range from 12 to 30, 14 to 25, or 16 to 25.
  • the recombinant polypeptides disclosed herein all reduce the expression of the target gene in at least 50% of the cells transfected with a nucleic acid encoding the recombinant polypeptides while cells not transfected with a nucleic acid encoding the recombinant polypeptides do not show a significant decrease in the expression of the target gene.
  • the recombinant polypeptides disclosed herein all reduce the expression of the target gene in at least 80% of the cells transfected with a nucleic acid encoding the recombinant polypeptides while cells not transfected with a nucleic acid encoding the recombinant polypeptides do not show a significant decrease in the expression of the target gene.
  • recombinant polypeptides that bind to sequences in the PDCD-1 gene that have been identified to be present in regions of the gene that when bound by the recombinant polypeptides comprising a transcriptional repressor domain lead to suppression of PD-1 expression from the PDCD-1 gene.
  • FIG. 1A The sequences in the PDCD-1 gene that were tested to determine repression by a transcriptional repressor domain bound to the sequence are pictorially depicted in Figure 1A.
  • the analysis of repression by the disclosed recombinant polypeptides that are designed to bind to these sequences identified certain regions that provide repression of PDCD-1 expression in at least 50% of the cells expressing these recombinant polypeptides. These regions are depicted in Figures 1B-1C and include regions 1-4. In regions 1, 2, 3, the anti-sense strand of the PDCD-1 gene was successfully targeted to significantly repress expression of PD-1. In region 4, the sense strand was identified as the region of the PDCD-1 gene that can be successfully target for repression. In addition, certain sequences in the sense strand in region 1 were also identified a region that can be successfully targeted for repression. [0076] Region 1:
  • Table 1 illustrates the identification of region 1 which includes sequences that can be targeted for repression.
  • region 1 which includes sequences that can be targeted for repression.
  • the indicated recombinant polypeptides that included RUs arranged from N-terminus to C-terminus to bind to the listed target sequence, repressed expression of PD- 1 by at least 80% as compared to a negative control.
  • the location of these target sequences when aligned reveals a region (Region 1) in minus strand of the PDCD-1 gene that may be targeted for repressing PDCD-1 expression.
  • the alignment of the target sequences also reveals the minimal sequences within Region 1 that can be targeted for binding by the DBD for repressing PDCD-1 expression.
  • a recombinant polypeptide that suppresses expression of PD 1 receptor encoded by the PDCD 1 gene may include a DNA binding domain (DBD) and a transcriptional repressor domain.
  • the DBD may include a plurality of RUs ordered from N-terminus to C-terminus of the DBD to bind to a nucleic acid sequence of the PDCD 1 gene, wherein the nucleic acid sequence is present within the sequence: TGGTGGGGCTGCTCCAGGCATGCAGATCCCACAGGCGCCCTGG (SEQ ID NO: 1). As explained in the Examples section of the application, this sequence corresponds to Region 1 in the PDCD1 gene.
  • the RUs may include the sequence (X 1-11 X 12 X 13 X 14-33, 34, or 35 ) z (SEQ ID NO: 453), where X 1-11 is a chain of 11 contiguous amino acids, X 14-33 or 34 or 35 is a chain of 20, 21 or 22 contiguous amino acids,X 12 X 13 is selected from: (a) NH, HH, KH, NK, NQ, RH, RN, SS, NN, SN, or KN for recognition of guanine (G); (b) NI, KI, RI, HI, or SI for recognition of adenine (A); (c) NG, HG, KG, or RG for recognition of thymine (T); (d) HD, RD, SD, ND, KD, or YG for recognition of cytosine (C); and (e) NV or HN for recognition of A or G; and (f) H*, HA, KA, N*,
  • the RUs are ordered from N-terminus to the C-terminus to bind to the sequence: GGGGCTGCTCC (SEQ ID NO:2), wherein the first RU at the N-terminus binds to the G at the 5’ end of the sequence and the last RU at the C-terminus binds to the C at the 3’ end of the sequence.
  • the X 12 X 13 in the RUs from N-terminus to C-terminus may be NH, NH, NH, HD, NG, NH, HD, NG, HD, and HD.
  • the DBD may include at least an additional RU at the N-terminus such that the DBD binds to the nucleic acid sequence TGGGGCTGCTCC (SEQ ID NO:3), wherein X 12 X 13 in the additional RU is NG, HG, KG, or RG for recognition of the T.
  • the RUs are ordered from N-terminus to the C-terminus to bind to the sequence: GGTGGGGCTGCTCC (SEQ ID NO:4), wherein the first RU at the N-terminus binds to the G at the 5’ end of the sequence and the last RU at the C-terminus binds to the C at the 3’ end of the sequence.
  • the DBD comprises at least fourteen RUs, wherein X 12 X 13 in the RUs from N-terminus to C-terminus are NH, NH, NG, NH, NH, NH, HD, NG, NH, HD, NG, HD, and HD.
  • the DBD comprises three additional RU at the N-terminus such that the DBD binds to the nucleic acid sequence TGGTGGGGCTGCTCC (SEQ ID NO:5). In certain aspects, the DBD comprises three additional RUs at the C-terminus such that the DBD binds to the sequence GGTGGGGCTGCTCCAGG (SEQ ID NO:6).
  • the RUs are arranged from N-terminus to C-terminus to bind to the sequence: GC AGAT CCC AC AGGCGC (SEQ ID NO:7).
  • the RUs are arranged from N-terminus to C-terminus to bind to the sequence: CCCACAGGCGCCCTGG (SEQ ID NO: 8).
  • the RUs are arranged from N-terminus to C-terminus to bind to the sequence: GGGGCTGCTCCAGGCATGC (SEQ ID NO: 9).
  • the RUs may be arranged from N-terminus to C-terminus to bind to a sequence that is a complement of a sequence in region 1.
  • the complementary sequence may be the sequence: GGAGCAGCCCC (SEQ ID NO: 105).
  • the DBD that binds to the complementary sequence may include RUs ordered from N-terminus to C-terminus to bind to the sequence: GGAGCAGCCCCACCAGAGT (SEQ ID NO: 106).
  • Table 2 illustrates the identification of region 2 which includes sequences that can be targeted for repression.
  • region 2 which includes sequences that can be targeted for repression.
  • the indicated recombinant polypeptides that included RUs arranged from N-terminus to C-terminus to bind to the listed target sequence, repressed expression of PD- 1 by at least 80% as compared to a negative control.
  • the location of these target sequences when aligned reveals a region (Region 2) in the minus strand of the PDCD-1 gene that may be targeted for repressing PDCD-1 expression.
  • the alignment of the target sequences also reveals the minimal sequence that can be targeted for binding by the DBD for repressing PDCD-1 expression.
  • the DBD includes a plurality of RUs ordered from N-terminus to C-terminus of the DBD to bind to a nucleic acid sequence of the PDCD 1 gene, wherein the nucleic acid sequence is present within the sequence:
  • the DBD includes a plurality of RUs ordered from N-terminus to C-terminus of the DBD to bind to a nucleic acid sequence of the PDCD 1 gene, wherein the nucleic acid sequence is present within the sequence: CCTCCCCCAGCACTGCCTCTGTCACTCTCGCCCACGT (SEQ ID NO: 454).
  • the nucleic acid sequence is present within the sequence: CCTCCCCCAGCACTGCCTCTGTCACTCTCGCCCACGT (SEQ ID NO: 454).
  • all of the eight DBD-repressor domains that bound to a nucleic acid sequence within this sequence repressed expression of PD-1 in at least 50% of the cells treated with the DBD-repressor domain as compared to mock treated cells.
  • the DBD includes a plurality of RUs ordered from N-terminus to C-terminus of the DBD to bind to a nucleic acid sequence of the PDCD 1 gene, wherein the nucleic acid sequence is present within the sequence: GCCTCTGTCACTCTCGCCCAC (SEQ ID NO: 444).
  • GCCTCTGTCACTCTCGCCCAC SEQ ID NO: 444
  • the RUs are ordered from N-terminus to C-terminus of the DBD to bind to the nucleic acid sequence TCTGTCACTCTCG (SEQ ID NO: 11).
  • the DBD comprises at least thirteen RUs, wherein X 12 X 13 in the RUs from N-terminus to C-terminus are NG, HD, NG, NH, NG, HD, NI, HD, NG, HD, NG, HD, and NH.
  • the DBD further comprises three additional RUs at the N-terminus such that the DBD binds to the nucleic acid sequence GCCTCTGTCACTCTCG (SEQ ID NO: 12).
  • the DBD further comprises three additional RUs at the C-terminus such that the DBD binds to the nucleic acid sequence GCCTCTGTCACTCTCGCCC (SEQ ID NO: 13).
  • the DBD comprises at least nineteen RUs, wherein X 12 X 13 in the RUs from N-terminus to C-terminus are NH, HD, HD, NG, HD, NG, NH, NG, HD, NI, HD, NG, HD, NG, HD, NH, HD, HD, and HD.
  • the DBD further comprises five additional RUs at the C-terminus such that the DBD binds to the nucleic acid sequence TCTGTCACTCTCGCCCAC (SEQ ID NO: 14).
  • the DBD comprises at least eighteen RUs, wherein X 12 X 13 in the RUs from N-terminus to C- terminus are NG, HD, NG, NH, NG, HD, NI, HD, NG, HD, NG, HD, NG, NH, HD, HD, HD, NI, and HD.
  • the DBD comprises thirteen RUs ordered from N-terminus to C-terminus of the DBD to bind to the nucleic acid sequence: CCCCCAGCACTGC (SEQ ID NO: 15). In certain aspects, the DBD further comprises three additional RUs at the N-terminus such that the DBD binds to the nucleic acid sequence: CCTCCCCCAGCACTGC (SEQ ID NO: 16). In certain aspects, the DBD further comprises an additional RU at the C-terminus such that the DBD binds to the nucleic acid sequence: CCTCCCCCAGCACTGCC (SEQ ID NO: 17).
  • Table 3 illustrates the identification of region 3 which includes sequences that can be targeted for repression.
  • the indicated recombinant polypeptides that included RUs arranged from N-terminus to C-terminus to bind to the listed target sequence, repressed expression of PD- 1 by at least 80% as compared to a negative control.
  • the location of these target sequences when aligned reveals a region (Region 3) in the minus strand in the PDCD-1 gene that may be targeted for repressing PDCD-1 expression.
  • the alignment of the target sequences also reveals the minimal sequence that can be targeted for binding by the DBD for repressing PDCD-1 expression.
  • the DBD includes at least nine RUs ordered from N-terminus to C-terminus of the DBD to bind to a nucleic acid sequence of the PDCD 1 gene, wherein the nucleic acid sequence is present within the sequence:
  • the DBD comprises ten RUs ordered from N-terminus to C-terminus to bind to the nucleic acid sequence: TCCGCTCACC (SEQ ID NO:20). In certain aspects, the DBD comprises nine additional RUs at the C-terminus such that the DBD binds to the nucleic acid sequence: TCCGCTCACCTCCGCCTGA (SEQ ID NO:21). In certain aspects, the DBD comprises four additional RUs at the N-terminus such that the DBD binds to the nucleic acid sequence: CCCTTCCGCTCACC (SEQ ID NO:22).
  • the DBD comprises five additional RUs at the C-terminus such that the DBD binds to the nucleic acid sequence: CCCTTCCGCTCACCTCCGC (SEQ ID NO:23). In certain aspects, the DBD comprises two additional RUs at the N-terminus such that the DBD binds to the nucleic acid sequence: TTCCCTTCCGCTCACC (SEQ ID NO:24). [0101] In certain aspects, the DBD comprises twelve RUs ordered from N-terminus to C-terminus to bind to the nucleic acid sequence: GGGACAGTTTCC (SEQ ID NO:25).
  • the DBD further comprises four additional RUs at the C-terminus such that the DBD binds to the nucleic acid sequence: GGGACAGTTTCCCTTC (SEQ ID NO:26). In certain aspects, the DBD further comprises five additional RUs at the N-terminus such that the DBD binds to the nucleic acid sequence: GACCTGGGACAGTTTCC (SEQ ID NO: 27).
  • the DBD comprises eleven RUs ordered from N-terminus to C-terminus to bind to the nucleic acid sequence: CAACCTGACCT (SEQ ID NO:28). In certain aspects, the DBD comprises nine additional RUs at the C-terminus such that the DBD binds to the nucleic acid sequence: CAACCTGACCTGGGACAGTT (SEQ ID NO:29) In certain aspects, the DBD comprises five additional RUs at the N-terminus such that the DBD binds to the nucleic acid sequence: CCCTTCAACCTGACCT (SEQ ID NO:30).
  • Table 4 illustrates the identification of region 4 which includes sequences that can be targeted for repression.
  • region 4 includes sequences that can be targeted for repression.
  • the indicated recombinant polypeptides that included RUs arranged from N-terminus to C-terminus to bind to the listed target sequence, repressed expression of PD- 1 by at least 80% as compared to a negative control.
  • the location of these target sequences when aligned reveals a region (Region 4) in the plus strand of the PDCD-1 gene that may be targeted for repressing PDCD-1 expression.
  • the alignment of the target sequences also reveals the minimal sequence that can be targeted for binding by the DBD for repressing PDCD-1 expression.
  • the DBD includes at least nine RUs ordered from N-terminus to C-terminus of the DBD to bind to a nucleic acid sequence of the PDCD 1 gene, wherein the nucleic acid sequence is present within the sequence: GCCGCCTTCTCCACTGCTCAGGCGGAGGT (SEQ ID NO:31). [0106] As explained in the Examples section of the application, this sequence corresponds to Region 4. [0107] In certain aspects, the DBD comprises RUs arranged from N-terminus to C-terminus such that the DBD binds to the nucleic acid sequence: GCCGCCTTCTCCACT (SEQ ID NO:32).
  • the DBD comprises RUs arranged from N-terminus to C-terminus such that the DBD binds to the nucleic acid sequence: CCACTGCTCAGGCG (SEQ ID NO:33). In certain aspects, the DBD further comprises three additional RUs at the N-terminus such that the DBD binds to the nucleic acid sequence: TCTCCACTGCTCAGGCG (SEQ ID NO:34). In certain aspects, the DBD further comprises five additional RUs at the C-terminus such that the DBD binds to the nucleic acid sequence: CCACTGCTCAGGCGGAGGT (SEQ ID NO:35).
  • the present disclosure provides additional recombinant polypeptides for repressing PDCD1 expression.
  • the DBD of the recombinant polypeptide includes at least nine RUs ordered from N- terminus to C-terminus of the DBD to bind to a nucleic acid sequence of the PDCD1 gene, wherein the nucleic acid sequence is present within the sequence: CCCAGGTCAGGTTGAAG (SEQ ID NO:63).
  • the DBD of the recombinant polypeptide includes at least nine RUs ordered from N- terminus to C-terminus of the DBD to bind to a nucleic acid sequence of the PDCD1 gene, wherein the nucleic acid sequence is present within the sequence: GGCCAGGGCGCCTGT (SEQ ID NO:36). In certain aspects, the DBD of the recombinant polypeptide includes at least nine RUs ordered from N- terminus to C-terminus of the DBD to bind to a nucleic acid sequence of the PDCD1 gene, wherein the nucleic acid sequence is present within the sequence: CTGCATGCCTGGAGCAG (SEQ ID NO:37).
  • the DBD of the recombinant polypeptide includes at least nine RUs ordered from N- terminus to C-terminus of the DBD to bind to a nucleic acid sequence of the PDCD1 gene, wherein the nucleic acid sequence is present within the sequence: GCTCCCGCCCCCTCTTCCT (SEQ ID NO:38). In certain aspects, the DBD of the recombinant polypeptide includes at least nine RUs ordered from N- terminus to C-terminus of the DBD to bind to a nucleic acid sequence of the PDCD1 gene, wherein the nucleic acid sequence is present within the sequence: CTTCCTCCACATCCACG (SEQ ID NO:39).
  • the DBD of the recombinant polypeptide includes at least nine RUs ordered from N- terminus to C-terminus of the DBD to bind to a nucleic acid sequence of the PDCD1 gene, wherein the nucleic acid sequence is present within the sequence: CCTCCACATCCACGTGGGC (SEQ ID NO:40).
  • the RUs of the recombinant polypeptide may be arranged from N-terminus to C-terminus to bind to a sequence present in a target sequence listed in Table 9 and shown to have a PD-1 suppression of at least 50%. As noted herein the RUs may range from 7 to 40 in number.
  • the RUs of the recombinant polypeptide may be arranged from N-terminus to C-terminus to bind to the target sequence listed in Table 9 and shown to have a PD-1 suppression of at least 50%.
  • the recombinant polypeptides disclosed herein all reduce the expression of PDCD1 gene in at least 50% of the cells transfected with a nucleic acid encoding the recombinant polypeptides while cells not transfected with a nucleic acid encoding the recombinant polypeptides do not show a significant decrease in the expression of the target gene.
  • the recombinant polypeptides disclosed herein all reduce the expression of the PDCD1 gene in at least 80% of the cells transfected with a nucleic acid encoding the recombinant polypeptides while cells not transfected with a nucleic acid encoding the recombinant polypeptides do not show a significant decrease in the expression of the PDCD1 gene.
  • a recombinant polypeptide of the present disclosure may include a DBD and a transcriptional repressor domain, the DBD comprising a plurality of RUs ordered from N-terminus to C-terminus of the DBD to bind to a nucleic acid sequence of the PDCD1 gene, wherein the nucleic acid sequence is present within one of the following sequences:
  • TTCCCTTCCGCTCACC (SEQ ID NO:24);
  • GCCTCTGTCACTCTCGCCC (SEQ ID NO: 13);
  • GGTGGGGCTGCTCCAGG SEQ ID NO:6
  • TCTCCACTGCTCAGGCG SEQ ID NO:34
  • TCCGCTCACCTCCGCCTGA SEQ ID NO:21
  • GCCTCTGTCACTCTCG (SEQ ID NO: 12);
  • each of the RU comprises the sequence X 1-11 X 12 X 13 X 14-33, 34, or 35 (SEQ ID NO: 455), wherein X 1-11 is a chain of 11 contiguous amino acids, X 14-33 or 34 or 35 is a chain of 20, 21 or 22 contiguous amino acids, X 12 X 13 is selected from (a) NH, HH, KH, NK, NQ, RH, RN, SS, NN, SN, or KN for recognition of guanine (G); (b) NI, KI, RI, HI, or SI for recognition of adenine (A); (c) NG, HG, KG, or RG for recognition of thymine (T); (d) HD, RD, SD, ND, KD, or YG for recognition of cytosine (C); and (e) NV or HN for recognition of A or G; and (f) H*, HA, KA, N*, NA
  • the DBD comprises at least fourteen RUs, at least sixteen, or at least seventeen RUs and optionally, up to 25 RUs.
  • DBD binds to the nucleic acid sequence selected from SEQ ID NOs.: 5, 27, 24, 32, 13, 63, 6, 34, 21, 23, 26, 12, 7, 16, 17, 14, and 29.
  • recombinant polypeptides that bind to sequences in the TIM3 gene that have been identified to be present in regions of the gene that when bound by the recombinant polypeptides comprising a transcriptional repressor domain lead to suppression of TIM3 expression from the TIM3 gene.
  • a recombinant polypeptide that suppresses expression of TIM3 encoded by the TIM3 gene may include a DNA binding domain (DBD) and a transcriptional repressor.
  • the DBD may include a plurality of RUs ordered from N-terminus to C-terminus of the DBD to bind to a nucleic acid sequence of the TIM3 gene, wherein the nucleic acid sequence is present within the sequence: GGCAGTGTTACTATAAGAATCACTGGCAATCAGACACCCGGGTG (SEQ ID NO: 41) or a complement thereof.
  • the DBD comprises RUs that bind to the nucleic acid sequence TGTTACTATA (SEQ ID NO:42). In certain aspects, the DBD comprises an additional RU at the C- terminus such that the DBD binds to the nucleic acid sequence TGTTACTATA (SEQ ID NO:43). In certain aspects, the DBD comprises three additional RUs at the N-terminus such that the DBD binds to the nucleic acid sequence CAGTGTTACTATAA (SEQ ID NO:44). the DBD comprises two additional RUs at the N-terminus such that the DBD binds to the nucleic acid sequence GGCAGTGTTACTATAA (SEQ ID NO:45).
  • the DBD comprises RUs that bind to the nucleic acid sequence TCAGACACCCGGGTG (SEQ ID NO:46). In certain aspects, the DBD comprises three additional RUs at the N-terminus such that the DBD binds to the nucleic acid sequence CAATCAGACACCCGGGTG (SEQ ID NO:47). In certain aspects, the DBD comprises three additional RUs at the N-terminus such that the DBD binds to the nucleic acid sequence TGGCAATCAGACACCCGGGTG (SEQ ID NO:48).
  • a recombinant polypeptide that represses TIM3 expression may bind to a sequence that is a complement of
  • the DBD comprises RUs that are ordered to bind to the sequence TGCCAGTGATT (SEQ ID NO:50). In certain aspects, the DBD comprises eight additional RUs at the C-terminus such that the DBD binds to the sequence TGCCAGTGATTCTTATAGT (SEQ ID NO:51). In certain aspects, the DBD comprises RUs that are ordered to binds to the sequence TGATTGCCAGTGATT (SEQ ID NO:52). In certain aspects, the DBD comprises four additional RUs at the N-terminus such that the DBD binds to the sequence TGTCTGATTGCCAGTGATT (SEQ ID NO:53).
  • the present disclosure provides additional recombinant polypeptides for repressing TIM3 expression.
  • the DBD of such a recombinant polypeptide may include a plurality of RUs ordered from N-terminus to C-terminus of the DBD to bind to a nucleic acid sequence of TIM3 gene, wherein the nucleic acid sequence is: TACACACAT (SEQ ID NO:54).
  • the DBD comprises four additional RUs at the N-terminus such that the DBD binds to the sequence ACACTACACACAT (SEQ ID NO:55).
  • the DBD comprises four additional RUs at the N-terminus such that the DBD binds to the sequence TGCCACACTACACACAT (SEQ ID NO:56).
  • the RUs of the recombinant polypeptide may be arranged from N-terminus to C-terminus to bind to a sequence present in a target sequence listed in Table 10 and shown to have a TIM3 suppression of at least 50%. As noted herein the RUs may range from 7 to 40 in number. In certain aspects, the RUs of the recombinant polypeptide may be arranged from N-terminus to C-terminus to bind to the target sequence listed in Table 10 and shown to have a TIM3 suppression of at least 50%.
  • the recombinant polypeptides disclosed herein all reduce the expression of TIM3 gene in at least 50% of the cells transfected with a nucleic acid encoding the recombinant polypeptides while cells not transfected with a nucleic acid encoding the recombinant polypeptides do not show a significant decrease in the expression of the TIM3.
  • the recombinant polypeptides disclosed herein all reduce the expression of the TIM3 gene in at least 80% of the cells transfected with a nucleic acid encoding the recombinant polypeptides while cells not transfected with a nucleic acid encoding the recombinant polypeptides do not show a significant decrease in the expression of the TIM3 gene.
  • a recombinant polypeptide of the present disclosure may include a DBD and a transcriptional repressor domain, the DBD comprising a plurality of RUs ordered from N-terminus to C-terminus of the DBD to bind to a nucleic acid sequence of the TIM3 gene, wherein the nucleic acid sequence is present within one of the following sequences:
  • GGCAGTGTTACTATAA SEQ ID NO:45
  • TGCCAGTGATTCTTATAGT SEQ ID NO:51
  • TGCCACACTACACACAT SEQ ID NO:56
  • TGTCTGATTGCCAGTGATT SEQ ID NO:53
  • each of the RU comprises the sequence X 1-11 X 12 X 13 X 14-33, 34, or 35 (SEQ ID NO: 455), wherein X 1-11 is a chain of 11 contiguous amino acids, X 14-33 or 34 or 35 is a chain of 20, 21 or 22 contiguous amino acids, X 12 X 13 is selected from (a) NH, HH, KH, NK, NQ, RH, RN, SS, NN, SN, or KN for recognition of guanine (G); (b) NI, KI, RI, HI, or SI for recognition of adenine (A); (c) NG, HG, KG, or RG for recognition of thymine (T); (d) HD, RD, SD, ND, KD, or YG for recognition of cytosine (C); and (e) NV or HN for recognition of A or G; and (f) H*, HA, KA, N*, NA
  • the DBD comprises at least fourteen RUs, at least sixteen, or at least seventeen RUs and optionally, up to 25 RUs.
  • DBD binds to the nucleic acid sequence selected from SEQ ID NOs:45, 51, 48, 56, and 53.
  • recombinant polypeptides that bind to sequences in the LAG3 gene that have been identified to be present in regions of the gene that when bound by the recombinant polypeptides comprising a transcriptional repressor domain lead to suppression of LAG3 expression from the LAG3 gene.
  • the recombinant polypeptide that binds to this region may include a DBD in which the RUs are ordered from N-terminus to C-terminus of the DBD to bind to a nucleic acid sequence of the LAG3 gene, wherein the nucleic acid sequence is present within the sequence: GCCGTTCTGCTGGTCTCTGGGCCTTCACCCCTGTGCCCGGCCTTCC (SEQ ID NO: 57).
  • the DBD comprises RUs that bind to the sequence TCTGCTGGTCT (SEQ ID NO:58). In certain aspects, the DBD comprises five additional RUs at the N-terminus such that the DBD binds to the sequence GCCGTTCTGCTGGTCT (SEQ ID NO:59). In certain aspects, the DBD comprises two additional RUs at the C-terminus such that the DBD binds to the sequence GCCGTTCTGCTGGTCTCT (SEQ ID NO:60). In certain aspects, the DBD comprises four additional RUs at the C-terminus such that the DBD binds to the sequence TCTGCTGGTCTGGGC (SEQ ID NO:61).
  • the DBD comprises an additional RUs at the C-terminus such that the DBD binds to the sequence TCTGCTGGTCTGGGCC (SEQ ID NO:62). In certain aspects, the DBD comprises three additional RUs at the C-terminus such that the DBD binds to the sequence TCTGCTGGTCTGGGCCTTC (SEQ ID NO:63).
  • the DBD comprises RUs that bind to the sequence TCTCTGGGCCTTCA (SEQ ID NO:64). In certain aspects, the DBD comprises two additional RUs at the N-terminus such that the DBD binds the sequence GGTCTCTGGGCCTTCA (SEQ ID NO:65). In certain aspects, the DBD comprises three additional RUs at the C-terminus such that the DBD binds the sequence GGTCTCTGGGCCTTCACCC (SEQ ID NO:66). In certain aspects, the DBD comprises an additional RUs at the N-terminus such that the DBD binds the sequence TGGTCTCTGGGCCTTCACC (SEQ ID NO:67).
  • the DBD comprises RUs that bind to the sequence TTCACCCCTGTG (SEQ ID NO:68). In certain aspects, the DBD comprises four additional RUs at the C-terminus such that the DBD binds to the sequence TTCACCCCTGTGCCCG (SEQ ID NO:69). In certain aspects, the DBD comprises four additional RUs at the C-terminus such that the DBD binds to the sequence TTCACCCCTGTGCCCGGCCT (SEQ ID NO:70). In certain aspects, the DBD comprises three additional RUs at the C-terminus such that the DBD binds to the sequence TTCACCCCTGTGCCCGGCCTTCC (SEQ ID NO:71).
  • the DBD of such a recombinant polypeptide may include a plurality of RUs ordered from N-terminus to C-terminus of the DBD to bind to a nucleic acid sequence of LAG3 gene, wherein the nucleic acid sequence is: TGCTCTGTCTGC (SEQ ID NO:72).
  • the DBD comprises two additional RUs at the C-terminus such that the DBD binds to the sequence TGCTCTGTCTGCTC (SEQ ID NO:73).
  • the DBD comprises two additional RUs at the N-terminus such that the DBD binds to the sequence TTTGCTCTGTCTGCTC (SEQ ID NO:74).
  • the RUs of the recombinant polypeptide may be arranged from N-terminus to C-terminus to bind to a sequence present in a target sequence listed in Table 12 and shown to have a LAG3 suppression of at least 50%. As noted herein the RUs may range from 7 to 40 in number. In certain aspects, the RUs of the recombinant polypeptide may be arranged from N-terminus to C-terminus to bind to the target sequence listed in Table 12 and shown to have a LAG3 suppression of at least 50%.
  • the recombinant polypeptides disclosed herein all reduce the expression of LAG3 gene in at least 50% of the cells transfected with a nucleic acid encoding the recombinant polypeptides while cells not transfected with a nucleic acid encoding the recombinant polypeptides do not show a significant decrease in the expression of the LAG3.
  • the recombinant polypeptides disclosed herein reduce the expression of the LAG3 gene in at least 80% of the cells transfected with a nucleic acid encoding the recombinant polypeptides while cells not transfected with a nucleic acid encoding the recombinant polypeptides do not show a significant decrease in the expression of the TIM3 gene.
  • a recombinant polypeptide of the present disclosure may include a DBD and a transcriptional repressor domain, the DBD comprising a plurality of RUs ordered from N-terminus to C-terminus of the DBD to bind to a nucleic acid sequence of the TIM3 gene, wherein the nucleic acid sequence is present within one of the following sequences:
  • TTCACCCCTGTGCCCGGCCTTCC SEQ ID NO:71
  • TCTGCTGGTCTCTGGGCCTTC SEQ ID NO:450
  • TTTGCTCTGTCTGCTC SEQ ID NO:74
  • each of the RU comprises the sequence X 1-11 X 12 X 13 X 14-33, 34, or 35 (SEQ ID NO: 455), wherein X 1-11 is a chain of 11 contiguous amino acids, X 14-33 or 34 or 35 is a chain of 20, 21 or 22 contiguous amino acids, X 12 X 13 is selected from (a) NH, HH, KH, NK, NQ, RH, RN, SS, NN, SN, or KN for recognition of guanine (G); (b) NI, KI, RI, HI, or SI for recognition of adenine (A); (c) NG, HG, KG, or RG for recognition of thymine (T); (d) HD, RD, SD, ND, KD, or YG for recognition of cytosine (C); and (e) NV or HN for recognition of A or G; and (f) H*, HA, KA, N*, NA
  • the DBD comprises at least fourteen RUs, at least sixteen, or at least seventeen RUs and optionally, up to 25 RUs.
  • DBD binds to the nucleic acid sequence selected from SEQ ID NOs: 65, 448, 60, 59, 71, 449, 450, and 74.
  • recombinant polypeptides that bind to sequences in the CTLA4 gene that have been identified to be present in regions of the gene that when bound by the recombinant polypeptides comprising a transcriptional repressor domain lead to suppression of CTLA4 expression from the CTLA4 gene.
  • the DBD of the recombinant polypeptide may include at least nine RUs ordered from N-terminus to C-terminus of the DBD to bind to a nucleic acid sequence of the CTLA4 gene, wherein the nucleic acid sequence is present in the sequence: ACATATCTGGGATCAAAGCT (SEQ ID NO:75); ATATAAAGTCCTTGAT (SEQ ID NO:76); or TTCTATTCAAGTGCC (SEQ ID NO:77).
  • the RUs of the recombinant polypeptide may be arranged from N-terminus to C-terminus to bind to a sequence present in a target sequence listed in Table 11 and shown to have a CTLA4 suppression of at least 50%. As noted herein the RUs may range from 7 to 40 in number. In certain aspects, the RUs of the recombinant polypeptide may be arranged from N-terminus to C-terminus to bind to the target sequence listed in Table 11 and shown to have a CTLA4 suppression of at least 50%.
  • the DBD may be extended at the N-terminus, the C-terminus, or both to increase the number of RUs that contact the nucleic acid sequence is present in the sequence of SEQ ID NOs: 75-77.
  • the DBD may include at least 10, at least 12, at least 13, at least 14, at least 16, or more and up to 20, 25, 35, or 40 RUs. Repeat Units
  • the repeat unit may have the sequence X 1-11 X 12 X 13 X 14-33, 34, or 35 (SEQ ID NO: 455), where X 1-11 is a chain of 11 contiguous amino acids, X 14-33 or 34 or 35 is a chain of 20, 21 or 22 contiguous amino acids, X 12 X 13 is selected from: (a) NH, HH, KH, NK, NQ, RH, RN, SS, NN, SN, or KN for recognition of guanine (G); (b) NI, KI, RI, HI, or SI for recognition of adenine (A); (c) NG, HG, KG, or RG for recognition of thymine (T); (d) HD, RD, SD, ND, KD, or YG for recognition of cytosine (C); and (e) NV or HN for recognition of A or G; and (f) H*, HA, KA, N*,
  • Any suitable RU such as those based upon the RUs from Xanthomonas transcription activator- like effector (TALE) systems, Ralstonia solanacearum (modular Ralstonia nucleic acid binding domain; RNBD), or an animal pathogen (e.g., Legionella quateirensis, Legionella maceachernii, Burkholderia, Paraburkholderia, or Francisella) (modular animal pathogen nucleic acid binding domain; MAP-NBD) may be arranged to bind to the nucleotide sequences in the target genes as disclosed herein.
  • TALE Xanthomonas transcription activator- like effector
  • RNBD modular Ralstonia solanacearum
  • an animal pathogen e.g., Legionella quateirensis, Legionella maceachernii, Burkholderia, Paraburkholderia, or Francisella
  • MAP-NBD modular animal pathogen nucleic acid binding domain
  • the DNA binding domains of the disclosed recombinant polypeptides may be engineered to include 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, or more, e.g., up to 30, 40 or 50 repeat units arranged in a N-terminal to C-terminal direction to bind to a predetermined 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25 nucleotide length nucleic acid sequence, such as, a sequence disclosed herein.
  • DNA binding domains may be engineered to include 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26 or more or more, e.g., up to 30, 40 or 50 repeat units that are specifically ordered or arranged to bind to target nucleic acid sequences of length 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26 or more or more, e.g., up to 30, 40 or 50, respectively.
  • the RUs are contiguous.
  • half-RUs may be used in the place of one or more RUs.
  • the last RU in a DBD may be a half RU.
  • the RUs and the half-RU, if present, are derived from Xanthomonas TALE.
  • X 1-11 is at least 80%, at least 90%, or 100% identical to LTPEQVVAIAS (SEQ ID NO: 458), LTPAQVVAIAS (SEQ ID NO: 459), LTPDQVVAIAN (SEQ ID NO: 460), LTPDQVVAIAS (SEQ ID NO: 461), LTPYQVVAIAS (SEQ ID NO: 462), LTREQVVAIAS (SEQ ID NO: 463), or LSTAQVVAIAS (SEQ ID NO: 464).
  • X 14-20 or 21 or 22 is at least 80%, at least 90%, at least 95%, or 100% identical to GGKQALETV QRLLPVLCQDHG (SEQ ID NO:79),
  • the RU is at least 80%, at least 90%, at least 95%, or 100% identical to:
  • X 12 X 13 is repeat variable diresdue (RVD) and is selected from: (a) NH, HH, KH, NK, NQ, RH, RN, SS, NN, SN, or KN for recognition of guanine (G); (b) NI, KI, RI, HI, or SI for recognition of adenine (A); (c) NG, HG, KG, or RG for recognition of thymine (T); (d) HD, RD, SD, ND, KD, or YG for recognition of cytosine (C); and (e) NV or HN for recognition of A or G; and (f) H*, HA, KA, N*, NA, NC, NS, RA, or S*for recognition of A or T or G or C, wherein (*) means that the amino acid at X
  • the DBD may include a N-cap region at N-terminus of the recombinant polypeptide which N-cap region is derived from the N-cap region of a Xanthomonas TALE protein.
  • the DBD may include a N-cap region at the N-terminus which may be present immediately adjacent the first RU.
  • the N-cap region at the N-terminus which may be linked to the first RU via a linker.
  • An N-cap region may be any length, e.g., may comprise from about 0 to about 136 amino acid residues in length.
  • An N-terminal cap may be about 5, about 10, about 15, about 20, about 25, about 30, about 35, about 40, about 45, about 50, about 60, about 70, about 80, about 90, about 100, about 110, about 120, or about 130 amino acid residues in length.
  • the DBD comprises a N-cap region comprising an amino acid sequence at least 80% (e.g., at least 90%, at least 95%, or 100%) identical to the amino acid sequence:
  • the N-cap region is from TALE proteins like those expressed in Burkholderia, Paraburkholderia, or Xanthomonas.
  • the N-cap regions may be derived from N-cap domain used in conjunction with DNA binding domains disclosed in US20180010152.
  • the N-cap regions may be derived from the N-terminal regions disclosed in US20150225465, e.g., SEQ ID NOs.:7, 8, or 9 disclosed therein.
  • the N-cap region may include the amino acid residues from position 1 (N) through position 137 (M) of the naturally occurring Xanthomonas TALE protein (numbered backwards with N(l) being the residue immediately adjacent the first RU:
  • This amino acid sequence includes a M added to the N-terminus which is not present in the wild type N-cap region of a Xanthomonas TALE protein.
  • This amino acid sequence is generated by deleting amino acids N+288 through N+137 of the N-terminus region of a TALE protein, adding a M, such that amino acids N+136 through N+l of the N-terminus region of the TALE protein are present.
  • the N-terminus can be truncated such that the fragment of the N-terminus includes amino acids from position 1 (N) through position 120 (K) of the naturally occurring Xanthomonas spp. -derived protein as follows:
  • the N-cap region can be truncated such that the fragment of the N-terminus includes amino acids from position 1 (N) through position 115 (S) of the naturally occurring Xanthomonas spp. -derived protein as follows:
  • the N-cap region can be truncated may include amino acids from position 1 (N) through position 110 (H) of the naturally occurri ng Xanthomonas spp. -derived protein as follows: HHEALVGHGFTHAHIVALSQHPAALGTVAVKY QDMIAALPEATHEAIVGVGKQW SGARALEAL LTVAGELRGPPLQLDTGQLLKIAKRGGVTAVEAVHAWRNALTGAPLN (SEQ ID NO: 447).
  • the DBD may include a C-cap region at C-terminus of the recombinant polypeptide which C-cap region is derived from the C-cap region of a Xanthomonas TALE protein.
  • the C-cap region at the C-terminus which may be present immediately adjacent the last RU or the last half-RU, if present.
  • the C-cap region at the C-terminus which may be linked to the last RU or the last half-RU, if present, via a linker.
  • a C-cap may be any length and may comprise from about 0 to about 278 amino acid residues in length.
  • a C-terminal cap may be about 5, about 10, about 15, about 20, about 25, about 30, about 35, about 40, about 45, about 50, about 60, about 80, about 100, about 150, about 200, or about 250 amino acid residues in length.
  • the DBD comprises a C-cap region comprising an amino acid sequence at least 80% (e.g., at least 90%, at least 95%, or 100%) identical to the amino acid sequence: SIVAQLSRPDPALAALTNDHLVALACLGGRPALDAVKKGLPHAPALIKRTNRRIPERTSHRVA (SEQ ID NO:452).
  • the C-cap region is from TALE proteins like those expressed in Burkholderia, Parahurkholderia, or Xanthomonas. [0194] In some aspects, the C-Cap region can be positions 1 (S) through position 278 (Q) of the naturally occurring Xanthomonas spp. -derived protein as follows:
  • the predetermined N-terminus to C-terminus order of the plurality of RUs of the DNA binding domain determines the corresponding predetermined target nucleic acid sequence to which the recombinant polypeptides may bind.
  • the RUs and at least one or more half RU are specifically ordered to target the genomic locus or gene of interest.
  • the natural TALE-binding sites always begin with a thymine (T), which may be specified by a cryptic signal within the non-repetitive N-cap region of the TALE polypeptide; in some cases this region may be referred to as repeat 0.
  • T thymine
  • TALE binding sites do not necessarily have to begin with a thymine (T) and recombinant polypeptides disclosed herein may target DNA sequences that begin with T, A, G or C.
  • the recombinant polypeptides disclosed herein may target DNA sequences that begin with T and hence include a RU that contains a RVD that mediated binding to T.
  • the tandem repeat of TALE RUs ends with a half-length repeat or a stretch of sequence that may share identity with only the first 20 amino acids of a repetitive full length TALE RU and this half repeat may be referred to as a half-monomer, a half RU, or a half repeat.
  • DBD may be engineered to include X number (e.g., 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, or 26) full length RUs that are specifically ordered or arranged to target nucleic acid sequences of X+2 length (e.g., 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23,
  • the last RU in the DBD may be a half repeat.
  • the half repeat may comprise the amino acid sequence X 1-11 X 12 X 13 X 14-19 ,20 or 21 (SEQ ID NO: 471), wherein X 1-11 is a chain of 11 contiguous amino acids, X 14-19 ,20 or 21 is a chain of 7, 8 or 9 contiguous amino acids, and X 12 X 13 is selected from: (a) NH, HH, KH, NK, NQ, RH, RN, SS, NN, SN, or KN for recognition of guanine (G); (b) NI, KI, RI, HI, or SI for recognition of adenine (A); (c) NG, HG, KG, or RG for recognition of thymine (T); (d) HD, RD, SD, ND, KD, or YG for recognition of cytosine (C); (e) NV or HN for
  • X 1-11 is at least 80% identical, at least 90% identical, or 100% identical to LTPEQVVAIAS (SEQ ID NO:458).
  • X 14-20 or 21 or 22 is at least 80% identical to GGRPALE (SEQ ID NO: 472).
  • a recombinant polypeptide disclosed herein may include from N- to C-terminus, a N-cap region, a DBD comprising a plurality of RUs, a C-cap region, an optional linker, and a transcription repressor domain.
  • the RUs are derived from a TALE protein
  • the recombinant polypeptide may be referred to as TALE-TF.
  • the recombinant polypeptides, such as, TALE- TFs, of the present disclosure can further include a linker connecting the DBD or the C-cap region, if present, to the repressor domain.
  • the linker can serve to provide flexibility between the TALE protein and the repressor domain, allowing for the repressor domain (e.g., KRAB to efficiently inhibit transcriptional machinery).
  • a linker used herein can be a short flexible linker comprising an amino acid sequence comprising 0 residues, 1-3 residues, 4-7 residues, 8-10 residues, 10-12 residues, 5-20 residues, 12-15 residues, or 1-15 residues.
  • Linkers can include, but are not limited to, residues such as glycine, methionine, aspartic acid, alanine, lysine, serine, leucine, threonine, tryptophan, or any combination thereof.
  • the linker can have the amino acid sequence of GGGGGMDAKSLTAWS (SEQ ID NO: 109).
  • a Xanthomonas spp. -derived repeat units can have a sequence of LTPDOVVAIASNHGGKOALETVORLLPVLCODHG (SEQ ID NO: 438) comprising an RVD of NH, which recognizes guanine.
  • a Xanthomonas spp. -derived repeat units can have a sequence of LTPDOVVAIASNGGGKOALETVORLLPVLCODHG (SEQ ID NO: 439) comprising an RVD of NG, which recognizes thymidine.
  • -derived repeat units can have a sequence of LTPDQVVAIASNIGGKQALETV QRLLPVLCQDHG (SEQ ID NO: 440) comprising an RVD of NI, which recognizes adenosine.
  • a Xanthomonas spp. -derived repeat units can have a sequence of LTPDOVVAIASHDGGKOALETVORLLPVLCODHG (SEQ ID NO: 441) comprising an RVD of HD, which recognizes cytosine.
  • the RUs and one or both N-Cap and C-Cap regions may be derived from a transcription activator like effector-like protein (TALE-like protein) of Ralstonia solanacearum.
  • TALE-like protein transcription activator like effector-like protein
  • Repeat units derived from Ralstonia solanacearum can be 33-35 amino acid residues in length.
  • the repeat can be derived from the naturally occurring Ralstonia solanacearum TALE-like protein.
  • the RUs may have the sequence X 1-11 X 12 X 13 X 14-33, 34, or 35 (SEQ ID NO: 455), where X 1-11 is a chain of 11 contiguous amino acids, X 14-33 or 34 or 35 is a chain of 20, 21 or 22 contiguous amino acids, X 12 X 13 is RVD and is selected from: (a) NH, HH, KH, NK, NQ, RH, RN, SS, NN, SN, or KN for recognition of guanine (G); (b) NI, KI, RI, HI, or SI for recognition of adenine (A); (c) NG, HG, KG, or RG for recognition of thymine (T); (d) HD, RD, SD, ND, KD, or YG for recognition of cytosine (C); and (e) NV or HN for recognition of A or G; and (f) H*, HA,
  • X 1-11 may include a stretch of amino acids at least 80%, at least 90%, or a 100% identical to the X 1-11 residues of the following RUs from Ralstonia.
  • X 14-33 or 34 or 35 may include a stretch of 20, 21, or 22 amino acids at least 80%, at least 90%, or a 100% identical to the X 14-33 or 34 or 35 residues of the following RUs from Ralstonia:
  • a Ralstonia solanacearum- repeat unit can have at least 80% sequence identity with any one of the Ralstonia RUs provided herein.
  • the DBD may include a N-cap region at the N-terminus which may be present immediately adjacent the first RU or may be linked to the first RU via a linker.
  • an DBD of the present disclosure can have the full length naturally occurring N-terminus of a naturally occurring Ralstonia solanacearum-derived protein.
  • any truncation of the full length naturally occurring N-terminus of a naturally occurring Ralstonia solanacearum-derived protein can be used at the N-terminus of a DBD of the present disclosure.
  • amino acid residues at positions 1 (H) to position 137 (F) of the naturally occurring Ralstonia solanacearum-derived protein N-terminus can be used as the N-cap region.
  • the truncated N-terminus from position 1 (H) to position 137 (F) can have a sequence as follows:
  • the naturally occurring N-terminus of Ralstonia solanacearum can be truncated to any length and used as the N-cap of the engineered DNA binding domain.
  • the naturally occurring N-terminus of Ralstonia solanacearum can be truncated to include amino acid residues at position 1 (H) to position 120 (K) as follows:
  • the naturally occurring N-terminus of Ralstonia solanacearum can be truncated to amino acid residues at positions 1 to 50, 1 to 70, 1 to 100, 1 to 120, 1 to 130, 10 to 40, 60 to 100, or 100 to 120 and used as the N-cap of the engineered DNA binding domain.
  • N-cap region derived from Xanthomonas TALE the amino acid residues are numbered backward from the first repeat unit such that the amino acid (H in this case) of the N-cap adjacent the first RU is numbered 1 while the N-terminal amino acid of the N-cap is numbered 137 (and is F in this case) or 120 (and is K in this case).
  • the N-cap referred to as the amino terminus or the “NH2” domain, can recognize a guanine.
  • the N-cap can be engineered to bind a cytosine, adenosine, thymidine, guanine, or uracil.
  • an DBD of the present disclosure can include a plurality of RUs followed by a final single half-repeat also derived from Ralstonia solanacearum.
  • the half repeat can have 15 to 23 amino acid residues, for example, the half repeat can have 19 amino acid residues.
  • the half-repeat can have a sequence as follows: LSTAQVVAIACISGQQALE (SEQ ID NO:229).
  • an DBD of the present disclosure can have the full length naturally occurring C-terminus of a naturally occurring Ralstonia solanacearum-derived protein as a C-cap region that is conjugated to the last RU.
  • any truncation of the full length naturally occurring C-terminus of a naturally occurring Ralstonia solanacearum-derived protein can be used as the C-cap.
  • the DBD can comprise amino acid residues at position 1 (A) to position 63 (S) as follows:
  • AIEAHMPTLRQASHSLSPERVAAIACIGGRSAVEAVRQGLPVKAIRRIRREKAPVAGPPPAS (SEQ ID NO:230) of the naturally occurring Ralstonia solanacearum- derived protein C-terminus.
  • the naturally occurring C-terminus of Ralstonia solanacearum can be truncated to any length and used as the C-cap of the DBD.
  • the naturally occurring C-terminus of Ralstonia solanacearum can be truncated to amino acid residues at positions 1 to 63 and used as the C-terminus of the DBD.
  • the naturally occurring C-terminus of Ralstonia solanacearum can be truncated amino acid residues at positions 1 to 50 and used as the C-cap of the DBD.
  • the naturally occurring C-terminus of Ralstonia solanacearum can be truncated to amino acid residues at positions 1 to 63, 1 to 50, 1 to 70, 1 to 100, 1 to 120, 1 to 130, 10 to 40, 60 to 100, or 100 to 120 and used as the C-cap of the DBD.
  • TABLE 7 shows N-Cap, C-Cap, and half-repeats derived from Ralstonia.
  • the present disclosure provides DNA binding domains in which the repeat units can be derived from a Legionellales bacterium, a species of the genus of Legionella, such as L. quateirensis or L. maceachernii, the genus of Burkholderia, the genus of Paraburkholderia, or the genus of Francisella.
  • the RUs may have the sequence X 1-11 X 12 X 13 X 14-33, 34, or 35 (SEQ ID NO: 455), where X 1-11 is a chain of 11 contiguous amino acids, X 14-33 or 34 or 35 is a chain of 20, 21 or 22 contiguous amino acids, X 12 X 13 is selected from: (a) NH, HH, KH, NK, NQ, RH, RN, SS, NN, SN, HN, or KN for recognition of guanine (G); (b) NI, KI, RI, HI, HA, or SI for recognition of adenine (A); (c) NG, HG, KG, or RG for recognition of thymine (T); (d) HD, RD, SD, ND, KD, or YG for recognition of cytosine (C); and (e) NV or HN for recognition of A or G; and (f) H*,
  • X 1-11 may include a stretch of amino acids at least 80%, at least 90%, or a 100% identical to the X 1-11 residues of the following RUs from animal pathogens, Legionella, Burkholderia, Paraburkholderia, or Francisella.
  • X 14-33 or 34 or 35 may include a stretch of 20, 21, or 22 amino acids at least 80%, at least 90%, or a 100% identical to the X 14-33 or 34 or 35 residues of the RUs from animal pathogens, Legionella (e.g ., L. quateirensis or L. maceachernii), Burkholderia, Paraburkholderia, or Francisella listed in Table 8.
  • Table 8 Repeat Unit Sequences
  • Residues X 12 X 13 of the RU may include base contacting residues (BCR) as listed in the table 8 and may be chosen based upon the target nucleic acid sequence.
  • the last RU in the DBD may be a half RU.
  • the half RU may include a sequence that is at least 80%, at least 90%, at least 95% or a 100% identical to the half RU from L. quateirensis (FNAEQIVRMVSX 12 X 13 GGSKNU) (SEQ ID NO:333).
  • the half RU may include a sequence that is at least 80%, at least 90%, at least 95% or a 100% identical to the half RU from Francisella (YNKKQIVLIASX 12 X 13 SGG) (SEQ ID NO:334).
  • the polypeptide comprises an N-cap region, where the C-terminus (i.e., the last amino acid) of the N-cap region is covalently linked to the N-terminus (i.e., the first amino acid) of the first RU of the DBD either directly or via a linker.
  • the N-cap region is the N-terminus of L. quateirensis protein and may have an amino acid sequence that is at least 80% (e.g., at least 85%, at least 90%, 95%, or 99%, or a 100%) identical to the amino acid sequence:
  • the N-cap region comprises a fragment of SEQ ID NO:335.
  • the N-cap region is a N-terminal domain or a fragment thereof from TALE proteins like those expressed in Burkholderia, Paraburkholderia, or Xanthomonas.
  • the polypeptide comprises a C-cap region, where the N-terminus (i.e., the first amino acid) of the C-terminal domain is covalently linked to the C-terminus (i.e., the last amino acid) of the last RU or the half-repeat unit, if present, in the DBD either directly or via a linker.
  • the C-cap region is the C-terminal domain of L. quateirensis protein and may have an amino acid sequence that is at least 80% (e.g., at least 85%, at least 90%, 95%, or 99%, or a 100%) identical to the amino acid sequence:
  • the C-cap region comprises a fragment of SEQ ID NO:336, such as a fragment having the amino acid sequence
  • the C-cap region domain is a C-terminal domain or a fragment thereof from TALE proteins like those expressed in Burkholderia, Paraburkholderia, or Xanthomonas.
  • the present disclosure provides DNA binding domains in which the repeat units, the N-cap, and the C-ap can be derived from any one of Ralstonia solanacearum, Xanthomonas spp., Legionella quateirensis, Burkholderia, Paraburkholderia, or Francisella.
  • the present disclosure provides a DNA binding domain wherein the plurality of repeat units are selected from any one of the RUs as provided herein and can further comprise an N-cap and/or C-cap as provided herein.
  • repressor repressor domain
  • transcriptional repressor domain refers to a portion of the recombinant polypeptide as disclosed herein which portion decreases expression of a gene when the recombinant polypeptide is bound to the target gene.
  • the repressor domain comprises Kriippel-associated box (KRAB) protein.
  • the repressor domain comprises KRAB, Sin3a, LSD1, SUV39H1, G9A (EHMT2), DNMT1, DNMT3A-DNMT3L, DNMT3B, KOX, TGF-beta-inducible early gene (TIEG), v-erbA, SID, MBD2, MBD3, Rb, or MeCP2.
  • the repressor domain comprises an amino acid sequence at least 80%, at least 90%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or a 100% to the amino acid sequence set forth in one of SEQ ID NOs:84-101.
  • the repressor domain includes a KRAB domain comprising an amino acid sequence that is at least 80%, at least 90%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or a 100% to the amino acid sequence set forth in SEQ ID NO:338:
  • the N-cap region or the C-cap region included in the disclosed DBD may include a nuclear localization sequence (NLS) to facilitate entry into the nucleus of a cell, e.g., an animal cell, such as, a human cell.
  • NLS nuclear localization sequence
  • the polypeptide may be produced in a host cell and expressed with a translocation signal at the N-terminus which translocation signal may be cleaved during translocation.
  • the RUs may be linked C-terminus to N-terminus with no additional amino acids separating immediately adjacent RUs.
  • immediately adjacent RUs may be separated by a spacer sequence of at least one amino acid.
  • the spacer sequence includes at least 2, 3, 4, 5, 6, or 7 amino acids, or up to 5, or up to 10 amino acids.
  • the spacer sequence may include amino acids that have small side chains.
  • the spacer sequence is a flexible linker.
  • a DBD of the present disclosure can comprise between 2 to 50 RUs, e.g., between 5 and 36, between 9 and 36, between 9 and 40, between 12 and 30, between 5 to 10, between 10 to 15, between 15 to 20, between 20 to 25, between 25 to 30, between 30 to 35 animal pathogen-derived repeat domains, or between 35 to 40 animal pathogen-derived repeat domains.
  • a MAP- NBD described herein can comprise up to 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22,
  • a recombinant polypeptide as disclosed herein can be linked to a fluorophore, such as Hydroxycoumarin, methoxycoumarin, Alexa fluor, aminocoumarin, Cy2, FAM, Alexa fluor 488, Fluorescein FITC, Alexa fluor 430, Alexa fluor 532, HEX, Cy3, TRITC, Alexa fluor 546, Alexa fluor 555, R-phycoerythrin (PE), Rhodamine Red-X, Tamara, Cy3.5, Rox, Alexa fluor 568, Red 613, Texas Red, Alexa fluor 594, Alexa fluor 633, Allophycocyanin, Alexa fluor 633, Cy5, Alexa fluor 660, Cy5.5, TruRed, Alexa fluor 680, Cy7, GFP, or mCHERRY.
  • a fluorophore such as Hydroxycoumarin, methoxycoumarin, Alexa fluor, aminocoumarin, Cy2, FAM, Alexa fluor 488, Flu
  • a recombinant polypeptide as disclosed herein can be linked to a biotinylation reagent.
  • a recombinant polypeptide labeled with an imaging moiety as disclosed herein may be used to image binding and/locabzation of the recombinant polypeptide to a site in the genome of a cell.
  • the polypeptides and the nucleic acids described herein may be present in a pharmaceutical composition comprising a pharmaceutically acceptable excipient.
  • the polypeptides and the nucleic acids are present in a therapeutically effective amount in the pharmaceutical composition.
  • a therapeutically effective amount can be determined based on an observed effectiveness of the composition.
  • a therapeutically effective amount can be determined using assays that measure the desired effect in a cell, e.g., in a reporter cell line in which expression of a reporter is modulated in response to the polypeptides of the present disclosure.
  • the pharmaceutical compositions can be administered ex vivo or in vivo to a subject in order to practice the therapeutic and prophylactic methods and uses described herein.
  • compositions of the present disclosure can be formulated to be compatible with the intended method or route of administration; exemplary routes of administration are set forth herein.
  • Suitable pharmaceutically acceptable or physiologically acceptable diluents, carriers or excipients include, but are not limited to, nuclease inhibitors, protease inhibitors, a suitable vehicle such as physiological saline solution or citrate buffered saline.
  • the pharmaceutical composition may include a plurality of the polypeptides provided herein.
  • the composition may include two, three, four, or more of the polypeptides provided herein, wherein the polypeptides all bind to sequences in regulatory region of the same gene or sequences in regulatory regions of different genes.
  • the composition may include a plurality of polypeptides that bind to a sequence of a target gene as disclosed herein (e.g., PD1, TIM3, or LAG3 gene).
  • the composition may include a first polypeptide that binds to regulatory region of a first gene and a second polypeptide that binds to regulatory region of a second gene, where the first and second genes are independently selected from PD1, TIM3, and LAG3.
  • the composition may include a first polypeptide that binds to regulatory region of PD1 gene, a second polypeptide that binds to regulatory region of TIM3 gene, and a third polypeptide that binds to regulatory region of LAG3 gene.
  • the composition may include a plurality of polypeptides that bind to regulatory region of PD1 gene, a plurality of polypeptides that bind to regulatory region of TIM3 gene, and a plurality of polypeptides that bind to regulatory region of LAG3 gene.
  • compositions comprising the disclosed polypeptides, and nucleic acids encoding the disclosed polypeptides can be delivered into a target cell by any suitable means, including, for example, by injection, infection, transfection, and vesicle or liposome mediated delivery.
  • a mRNA or a vector encoding the polypeptides disclosed herein may be injected, transfected, or introduced via viral infection into a target cell, where the cell is ex vivo or in vivo.
  • Any vector systems may be used including, but not limited to, plasmid vectors, retroviral vectors, lentiviral vectors, adenovirus vectors, poxvirus vectors; herpesvirus vectors and adeno-associated virus vectors, etc.
  • the nucleic acids encoding the polypeptides may be carried on the same vector or on different vectors.
  • Non-viral vector delivery systems include DNA plasmids, naked nucleic acid, and nucleic acid complexed with a delivery vehicle such as a liposome or poloxamer.
  • Viral vector delivery systems include DNA and RNA viruses, which have either episomal or integrated genomes after delivery to the cell.
  • Vectors suitable for introduction of polynucleotides as described herein include described herein include non-integrating lentivirus vectors (IDLV).
  • IDLV non-integrating lentivirus vectors
  • Non-viral vector delivery systems include electroporation, lipofection, microinjection, biolistics, virosomes, liposomes, immunoliposomes, polycation or lipidmucleic acid conjugates, naked DNA, artificial virions, and agent-enhanced uptake of DNA.
  • Primary cells may be isolated and used ex vivo for reintroduction into the subject to be treated.
  • Suitable primary cells include peripheral blood mononuclear cells (PBMC), and other blood cell subsets such as, but not limited to, CD4+ T cells or CD8+ T cells.
  • the cell may be a CART cell.
  • Suitable cells also include stem cells such as, by way of example, embryonic stem cells, induced pluripotent stem cells, hematopoietic stem cells, neuronal stem cells, mesenchymal stem cells, muscle stem cells and skin stem cells.
  • the stem cells may be isolated from a subject to be treated or may be derived from a somatic cell of a subject to be treated using the polypeptides disclosed herein.
  • the cells into which the polypeptides of the present disclosure or a nucleic acid encoding a polypeptide of the present disclosure may be an animal cell, e.g., from a human needing treatment.
  • the polypeptide of the present disclosure is only transiently present in a target cell.
  • the polypeptide is expressed from a nucleic acid that expressed the polypeptide for a short period of time, e.g., for up to 1 day, 3 days, 1 week, 3 weeks, or 1 month.
  • adenoviral based systems may be used.
  • Adeno-associated virus (“AAV”) vectors can also be used to transduce cells with nucleic acids encoding the polypeptide of the present disclosure, e.g., in the in vitro production of nucleic acids and peptides, and for in vivo and ex vivo gene therapy procedures.
  • rAAV recombinant adeno- associated virus vectors
  • rAAV recombinant adeno- associated virus vectors
  • nucleic acids encoding the polypeptides disclosed herein can be delivered using a gene therapy vector with a high degree of specificity to a particular tissue type or cell type.
  • a viral vector is typically modified to have specificity for a given cell type by including a sequence encoding a ligand expressed as a fusion protein with a viral coat protein on the viruses' outer surface.
  • the ligand is chosen to have affinity for a receptor known to be present on the cell type of interest.
  • gene therapy vectors can be delivered in vivo by administration to an individual patient.
  • administration involves systemic administration (e.g., intravenous, intraperitoneal, intramuscular, subdermal, or intracranial infusion), direct injection (e.g., intrathecal), or topical application, as described below.
  • vectors can be delivered to cells ex vivo, such as cells explanted from an individual patient (e.g., lymphocytes, bone marrow aspirates, tissue biopsy) or universal donor hematopoietic stem cells, followed by reimplantation of the cells into a patient, usually after selection for cells which have incorporated the vector or which have been modified by expression of the polypeptide of the present disclosure encoded by the vector.
  • the nucleic acid encoding the polypeptides provided herein may be codon optimized to enhance expression of the polypeptide in the target cell.
  • the sequence of the nucleic acid can be varied to provide codons that are known to be highly used in animal cells, such as, human cells to enhance production of the polypeptide in a human cell.
  • silent mutations may be made in the nucleotide sequence encoding a polypeptide disclosed herein for codon optimization in mammalian cells.
  • a method of suppressing expression of PDCD-1 gene in a cell comprising introducing into the cell the recombinant polypeptide that comprises the DBD and the transcriptional repressor domain as provided herein, where the DBD binds to a target nucleic acid sequence present in the PDCD-1 gene and the transcriptional repressor suppresses expression of the PDCD-1 gene.
  • a method of suppressing expression of TIM3 gene in a cell comprising introducing into the cell the recombinant polypeptide that comprises the DBD and the transcriptional repressor domain as provided herein, where the DBD binds to a target nucleic acid sequence present in the TIM3 gene and the transcriptional repressor suppresses expression of the TIM3 gene.
  • a method of suppressing expression of LAG3 gene in a cell comprising introducing into the cell the recombinant polypeptide that comprises the DBD and the transcriptional repressor domain as provided herein, where the DBD binds to a target nucleic acid sequence present in the LAG3 gene and the transcriptional repressor suppresses expression of the LAG3 gene.
  • the polypeptide is introduced as a nucleic acid encoding the polypeptide.
  • the nucleic acid is a deoxyribonucleic acid (DNA).
  • the nucleic acid is a ribonucleic acid (RNA).
  • the sequence of the nucleic acid is codon optimized for expression in a human cell.
  • the cell is an animal cell. In certain aspects, the cell is a human cell. In certain aspects, the cell is a cancer cell. In certain aspects, the cell is an ex vivo cell.
  • the introducing comprises administering the polypeptide or a nucleic acid encoding the polypeptide to a subject.
  • the administering comprises parenteral administration.
  • the administering comprises intravenous, intramuscular, intrathecal, or subcutaneous administration.
  • the administering comprises direct injection into a site in a subject.
  • the administering comprises direct injection into a tumor.
  • the introducing may induce a repression of expression of the target gene for a period of at least 2 days, at least 3 days, at least 9 days, at least at least 15 days, at least 1 month, at least 6 months, at least 1 year to up to 5 years.
  • the introducing may suppress expression of gene expression by at least 10%, at least 20%, at least 30%, at least 40%, at least 50%, at least 60%, at least 70%, at least 80%, or more.
  • the introducing may be repeated to maintain suppression of target gene expression.
  • the introducing may be performed as a combination therapy with, for example, a cancer therapy.
  • the combination therapy may involve introducing the recombinant polypeptide into the cell prior to, concurrently with, or after administration of cancer therapy.
  • An animal cell can include a cell from a marine invertebrate, fish, insects, amphibian, reptile, or mammal.
  • a mammalian cell can be obtained from a primate, ape, equine, bovine, porcine, canine, feline, or rodent.
  • a mammal can be a primate, ape, dog, cat, rabbit, ferret, or the like.
  • a rodent can be a mouse, rat, hamster, gerbil, hamster, chinchilla, or guinea pig.
  • a bird cell can be from a canary, parakeet or parrots.
  • a reptile cell can be from a turtle, lizard or snake.
  • a fish cell can be from a tropical fish.
  • the fish cell can be from a zebrafish (e.g., Danio rerio).
  • a worm cell can be from a nematode (e.g., C. elegans).
  • An amphibian cell can be from a frog.
  • An arthropod cell can be from a tarantula or hermit crab.
  • a mammalian cell can also include cells obtained from a primate (e.g., a human or a non-human primate).
  • a mammalian cell can include an epithelial cell, connective tissue cell, hormone secreting cell, a nerve cell, a skeletal muscle cell, a blood cell, an immune system cell, or a stem cell.
  • Exemplary mammalian cells can include, but are not limited to, 293A cell line, 293FT cell line, 293F cells , 293 H cells, HEK 293 cells, CHO DG44 cells, CHO-S cells, CHO-K1 cells, Expi293FTM cells, Flp-InTM T-RExTM 293 cell line, Flp-InTM-293 cell line, Flp-InTM-3T3 cell line, Flp-InTM-BHK cell line, Flp-InTM-CHO cell line, Flp-InTM-CV-l cell line, Flp-InTM-Jurkat cell line, FreeStyleTM 293-F cells, FreeStyleTM CHO-S cells, GripTiteTM 293 MSR cell line, GS-CHO cell line, HepaRGTM cells, T-RExTM Jurkat cell line, Per.C6 cells, T-RExTM-293 cell line, T-RExTM-CHO cell line, T-RExTM-HeLa cell line, NC-HI
  • a target cell is a cancerous cell, e.g., in a human.
  • Cancer can be a solid tumor or a hematologic malignancy.
  • the solid tumor can include a sarcoma or a carcinoma.
  • Exemplary sarcoma target cell can include, but are not limited to, cell obtained from alveolar rhabdomyosarcoma, alveolar soft part sarcoma, ameloblastoma, angiosarcoma, chondrosarcoma, chordoma, clear cell sarcoma of soft tissue, dedifferentiated liposarcoma, desmoid, desmoplastic small round cell tumor, embryonal rhabdomyosarcoma, epithelioid fibrosarcoma, epithelioid hemangioendothelioma, epithelioid sarcoma, esthesioneuroblastoma, Ewing sarcoma, extrarenal rhabdoid tumor, extraskeletal myxoid chondrosarcoma, extraskeletal osteosarcoma, fibrosarcoma, giant cell tumor, hemangiopericytoma, infantile fibrosarcoma, inflammatory myofibroblastic
  • Exemplary carcinoma target cell can include, but are not limited to, cell obtained from anal cancer, appendix cancer, bile duct cancer (i.e., cholangiocarcinoma), bladder cancer, brain tumor, breast cancer, cervical cancer, colon cancer, cancer of Unknown Primary (CUP), esophageal cancer, eye cancer, fallopian tube cancer, gastroenterological cancer, kidney cancer, liver cancer, lung cancer, medulloblastoma, melanoma, oral cancer, ovarian cancer, pancreatic cancer, parathyroid disease, penile cancer, pituitary tumor, prostate cancer, rectal cancer, skin cancer, stomach cancer, testicular cancer, throat cancer, thyroid cancer, uterine cancer, vaginal cancer, or vulvar cancer.
  • CUP Unknown Primary
  • the cancerous cell can comprise cells obtained from a hematologic malignancy.
  • Hematologic malignancy can comprise a leukemia, a lymphoma, a myeloma, a non-Hodgkin's lymphoma, or a Hodgkin's lymphoma.
  • the hematologic malignancy can be a T-cell based hematologic malignancy.
  • the hematologic malignancy can be a B-cell based hematologic malignancy.
  • Exemplary B-cell based hematologic malignancy can include, but are not limited to, chronic lymphocytic leukemia (CLL), small lymphocytic lymphoma (SLL), high-risk CLL, a non-CLL/SLL lymphoma, prolymphocytic leukemia (PLL), follicular lymphoma (FL), diffuse large B-cell lymphoma (DLBCL), mantle cell lymphoma (MCL), Waldenstrom's macroglobulinemia, multiple myeloma, extranodal marginal zone B cell lymphoma, nodal marginal zone B cell lymphoma, Burkitt's lymphoma, non-Burkitt high grade B cell lymphoma, primary mediastinal B-cell lymphoma (PMBL), immunoblastic large cell lymphoma, precursor B -lymphoblastic lymphoma, B cell prolymphocytic leukemia, lymphoplasmacytic lymphoma, s
  • Exemplary T-cell based hematologic malignancy can include, but are not limited to, peripheral T-cell lymphoma not otherwise specified (PTCL-NOS), anaplastic large cell lymphoma, angioimmunoblastic lymphoma, cutaneous T-cell lymphoma, adult T-cell leukemia/lymphoma (ATLL), blastic NK-cell lymphoma, enteropathy-type T-cell lymphoma, hematosplenic gamma-delta T-cell lymphoma, lymphoblastic lymphoma, nasal NK/T-cell lymphomas, or treatment-related T-cell lymphomas.
  • PTCL-NOS peripheral T-cell lymphoma not otherwise specified
  • anaplastic large cell lymphoma angioimmunoblastic lymphoma
  • ATLL adult T-cell leukemia/lymphoma
  • blastic NK-cell lymphoma enteropathy-type T-cell lymphoma
  • a cell can be a tumor cell line.
  • Exemplary tumor cell line can include, but are not limited to, 600MPE, AU565, BT-20, BT-474, BT-483, BT-549, Evsa-T, Hs578T, MCF-7, MDA-MB- 231, SkBr3, T-47D, HeLa, DU145, PC3, LNCaP, A549, H1299, NCI-H460, A2780, SKOV-3/Luc, Neuro2a, RKO, RKO-AS45-1, HT-29, SW1417, SW948, DLD-1, SW480, Capan-1, MC/9, B72.3, B25.2, B6.2, B38.1, DMS 153, SU.86.86, SNU-182, SNU-423, SNU-449, SNU-475, SNU-387, Hs 817.T, LMH, LMH/2A, SNU-398, PLHC-1, HepG2/SF
  • polypeptides disclosed herein are produced using a suitable method including recombinant and non-recombinant methods (e.g., chemical synthesis).
  • a polypeptide is chemically synthesized
  • the synthesis may proceed via liquid-phase or solid-phase.
  • Solid-phase peptide synthesis allows the incorporation of unnatural amino acids and/or peptide/protein backbone modification.
  • Various forms of SPPS such as Fmoc and Boc, are available for synthesizing polypeptides of the present disclosure. Details of the chemical synthesis are known in the art (e.g., Ganesan A. 2006 Mini Rev. Med. Chem. 6:3-10; and Camarero J.A. et al. 2005 Protein Pept Lett. 12:723-8).
  • the polypeptide may be produced as an intracellular protein or as a secreted protein, using any suitable construct and any suitable host cell, which can be a prokaryotic or eukaryotic cell, such as a bacterial (e.g., E. coli) or a yeast host cell, respectively.
  • eukaryotic cells that are used as host cells for production of the polypeptides include insect cells, mammalian cells, and/or plant cells.
  • mammalian host cells may include human cells (e.g., HeLa, 293, H9 and Jurkat cells); mouse cells (e.g., NIH3T3, L cells, and C127 cells); primate cells (e.g., Cos 1, Cos 7 and CV1) and hamster cells (e.g., Chinese hamster ovary (CHO) cells).
  • human cells e.g., HeLa, 293, H9 and Jurkat cells
  • mouse cells e.g., NIH3T3, L cells, and C127 cells
  • primate cells e.g., Cos 1, Cos 7 and CV1
  • hamster cells e.g., Chinese hamster ovary (CHO) cells.
  • the polypeptide disclosed herein are produced in CHO cells.
  • a variety of host-vector systems suitable for the expression of a polypeptide may be employed according to standard procedures known in the art. See, e.g., Sambrook et al. 1989 Current Protocols in Molecular Biology Cold Spring Harbor Press, New York; and Ausubel et al. 1995 Current Protocols in Molecular Biology, Eds. Wiley and Sons. Methods for introduction of genetic material into host cells include, for example, transformation, electroporation, conjugation, calcium phosphate methods and the like. The method for transfer can be selected so as to provide for stable expression of the introduced polypeptide -encoding nucleic acid.
  • the polypeptide -encoding nucleic acid can be provided as an inheritable episomal element (e.g., a plasmid) or can be genomically integrated.
  • an inheritable episomal element e.g., a plasmid
  • genomically integrated e.g., a variety of appropriate vectors for use in production of a polypeptide of interest are commercially available.
  • Vectors can provide for extrachromosomal maintenance in a host cell or can provide for integration into the host cell genome.
  • the expression vector provides transcriptional and translational regulatory sequences and may provide for inducible or constitutive expression where the coding region is operably-linked under the transcriptional control of the transcriptional initiation region, and a transcriptional and translational termination region.
  • the transcriptional and translational regulatory sequences may include, but are not limited to, promoter sequences, ribosomal binding sites, transcriptional start and stop sequences, translational start and stop sequences, and enhancer or activator sequences. Promoters can be either constitutive or inducible, and can be a strong constitutive promoter (e.g., T7).
  • nucleic acids encoding the polypeptides disclosed herein.
  • a nucleic acid encoding the polypeptides disclosed herein is operably linked to a promoter sequence that confers expression of the polypeptide.
  • the sequence of the nucleic acid is codon optimized for expression of the polypeptide in a human cell.
  • the nucleic acid is a deoxyribonucleic acid (DNA).
  • the nucleic acid is a ribonucleic acid (RNA).
  • a vector comprising the nucleic acid encoding the polypeptides for binding a target nucleic acid as described herein. In certain aspects, the vector is a viral vector.
  • a host cell comprising the nucleic acid or the vector encoding the polypeptides disclosed herein is provided. In certain aspects, a host cell comprising the polypeptides disclosed herein is provided. In certain aspects, a host cell that expresses the polypeptide is also disclosed.
  • the present disclosure also provides recombinant polypeptide comprising a DNA binding domain and a transcriptional repressor domain, wherein the DNA binding domain and the transcriptional repressor domain are heterologous, wherein the transcriptional repressor domain comprises an amino acid sequence at least 80% identical to any one of the sequences set out in SEQ ID NOs: 84-101.
  • the transcriptional repressor domain comprises an amino acid sequence at least 85% identical, at least 90% identical, at least 95% identical, or a 100% identical to any one of the sequences set out in SEQ ID NOs: 84-101.
  • the DNA binding domain may be a zinc finger protein (ZFP), a transcription activator-like effector (TALE), or a guide RNA.
  • ZFP zinc finger protein
  • TALE transcription activator-like effector
  • the DNA binding domain may be a DBD as disclosed herein that binds to a target sequence provided herein.
  • the DNA binding domain may bind to a target nucleic acid sequence in a gene.
  • the target nucleic acid sequence may be present in a PDCD 1 gene, a CTLA4 gene, a LAG3 gene, a TET2 gene, a ETLA gene, a HA VCR2 gene, a CCR5 gene, a CXCR4 gene, a TRA gene, a TRE gene, a E2M gene, an albumin gene, a HEE gene, a HEA1 gene, a TTR gene, a NR3C1 gene, a CD52 gene, an erythroid specific enhancer of the BCL11A gene, a CELE gene, a TGFER1 gene, a SERPINAl gene, a HEV genomic DNA in infected cells, a CEP290 gene, a DMD gene, a CFTR gene, or an IL2RG gene.
  • the present disclosure also provides a nucleic acid encoding the recombinant polypeptide.
  • the nucleic acid may be operably linked to a promoter sequence that confers expression of the polypeptide.
  • the sequence of the nucleic acid is codon optimized for expression of the polypeptide in a human cell.
  • the nucleic acid is a deoxyribonucleic acid (DNA).
  • the nucleic acid is a ribonucleic acid (RNA).
  • the present disclosure also provides a vector comprising the nucleic acid disclosed herein.
  • the vector may be a viral vector.
  • the present disclosure also provides a host cell comprising the nucleic acid or the vector disclosed herein.
  • the host cell may include the polypeptide.
  • the host cell may express the polypeptide.
  • composition comprising the polypeptide and a pharmaceutically acceptable excipient.
  • the pharmaceutical composition may include the nucleic acid or the vector and a pharmaceutically acceptable excipient.
  • a method of suppressing expression of an endogenous gene in a cell may include introducing into the cell the recombinant polypeptide, wherein the DBD of the polypeptide binds to a target nucleic acid sequence present in the endogenous gene and the heterologous transcriptional repressor domain suppresses expression of the endogenous gene.
  • the recombinant polypeptide is introduced as a nucleic acid encoding the polypeptide.
  • the nucleic acid may be a deoxyribonucleic acid (DNA) or RNA.
  • the nucleic acid may be codon optimized for expression in a human cell.
  • the target gene may be a PDCD 1 gene, a CTLA4 gene, a LAG3 gene, a TET2 gene, a ETLA gene, a HA VCR2 gene, a CCR5 gene, a CXCR4 gene, a TRA gene, a TRE gene, a E2M gene, an albumin gene, a HEE gene, a HEA1 gene, a TTR gene, a NR3C1 gene, a CD52 gene, an erythroid specific enhancer of the ECL11A gene, a CELE gene, a TGFER1 gene, a SERPINA1 gene, a HEV genomic DNA in infected cells, a CEP290 gene, a DMD gene, a CFTR gene, or an IL2RG gene.
  • the cell may be an animal cell.
  • the cell may be a human cell.
  • the cell may be a cancer cell.
  • the cell may be an ex vivo cell or an in vivo cell.
  • the introducing may include administering the polypeptide or a nucleic acid encoding the polypeptide to a subject.
  • the administering may include parenteral administration.
  • the administering may include intravenous, intramuscular, intrathecal, or subcutaneous administration.
  • the administering may include direct injection into a site in a subject.
  • the administering may include direct injection into a tumor.
  • a DBD and a functional domain are provided as separate polypeptides instead of a single polypeptide and are assembled into a functional complex using dimerization of a heterodimer pair, where the DBD and the functional domain are each fused to a member of the heterodimer pair.
  • indirect dimerization may also be utilized by using a fused polypeptide comprising two individual members of a heterodimer pair that act as a bridge to bring a DBD and a functional domain together, as explained in detail below.
  • split systems find use in screens for a DBD or a functional domain by, e.g., using a DBD fused to a first member of a heterodimer pair and screening a plurality of candidate functional domains each fused to a second member of the heterodimer pairs and vice versa.
  • control of modulation of gene expression may be achieved by having the DBD and functional domain expression on board (e.g., constitutive expression) a cell as separate polypeptides and assembling a functional DBD and functional domain complex by introducing a bridging construct into the cell, when modulation of gene expression is desired.
  • the bridging construct may be expressed transiently thereby modulating gene expression transiently.
  • control of modulation of gene expression may be achieved by disrupting the DBD and functional domain complex by introducing a disruptor comprising a heterodimer pair or an individual member of a heterodimer pair as explained below.
  • the individual components of a split system may be introduced into a cell as nucleic acids encoding the individual components or as polypeptides or a combination thereof.
  • the split systems may be used for modulating gene expression in any cell such as a mammalian cell having a target site at which the DBD binds. Examples of such cells are provided herein, e.g., in the preceding sections of the application.
  • the heterodimer pairs of the split system include: 37A, 37B; 13A, 13B; DHD37-BBB-A, DHD37-BBB-B; DHD150-A, DHD150-B; DHD154-A, DHD-154B; 37A, 9B; 13A, 37B; 13A, DHD150- B; 37A, DHD37-BBB-B; and DHD37-BBB-A, 37B, where each of 37A, 37B, 13A, 13B, DHD37-BBB- A, DHD37-BBB-B, DHD150-A, DHD150-B, DHD154-A, and DHD-154B, are the individual members of the listed heterodimer pairs.
  • the term first member and second member refers to either of the individual members of a listed heterodimer pair.
  • 37A and the numeral “1” are used herein interchangeably and in the context of a member of a heterodimer pair refer to a polypeptide comprising an amino acid sequence that is at least 80% identical (e.g., at least 85%, at least 90%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99% identical) to the amino acid sequence:
  • DSDEHLKKLKTFLENLRRHLDRLDKHIKQLRDILSENPEDERVKDVIDLSERSVRIVKTVIKIFEDS VRKKE (SEQ ID NO: 473), and is capable of binding to 37B, 9B, and DHD37-BBB-B.
  • 37B and “1” are used herein interchangeably and in the context of a member of a heterodimer pair refers to a polypeptide comprising an amino acid sequence that is at least 80% identical (e.g., at least 85%, at least 90%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99% identical) to the amino acid sequence:
  • GSDDKELDKLLDTLEKILQTATKIIDDANKLLEKLRRSERKDPKVVETYVELLKRHEKAV KELLEIAKTHAKKVE (SEQ ID NO: 474), and is capable of binding to 37A, 13A, and DHD37-BBB-A.
  • the term “13 A” and the numeral “9” are used herein interchangeably and in the context of a member of a heterodimer pair refer to a polypeptide comprising an amino acid sequence that is at least 80% identical (e.g., at least 85%, at least 90%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99% identical) to the amino acid sequence:
  • GTKEDILERQRKIIERAQEIHRRQQEILEELERIIRKPGSSEEAMKRMLKLLEESLRLLKELL ELSEESAQLLYEQR (SEQ ID NO: 475), and is capable of binding to 13B, 37B, and DHD150-B.
  • 13B and 9' are used herein interchangeably and in the context of a member of a heterodimer pair refers to a polypeptide comprising an amino acid sequence that is at least 80% identical (e.g., at least 85%, at least 90%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99% identical) to the amino acid sequence:
  • DHD37-BBB-A in the context of a member of a heterodimer pair refers to a polypeptide comprising an amino acid sequence that is at least 80% identical (e.g., at least 85%, at least 90%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99% identical) to the amino acid sequence:
  • DHD37-BBB-B in the context of a member of a heterodimer pair refers to a polypeptide comprising an amino acid sequence that is at least 80% identical (e.g., at least 85%, at least 90%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99% identical) to the amino acid sequence:
  • GDVKELTKILDTLTKILETATKVIKDATKLLEEHRKSDKPDPRLIETHKKLVEEHETLVRQHKELA EEHLKRTR (SEQ ID NO: 478), and is capable of binding to DHD37-BBB-A and 37A.
  • DHD150-A in the context of a member of a heterodimer pair refers to a polypeptide comprising an amino acid sequence that is at least 80% identical (e.g., at least 85%, at least 90%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99% identical) to the amino acid sequence: GDVKELTKILDTLTKILETATKVIKDATKLLEEHRKSDKPDPRLIETHKKLVEEHETLVRQHKELA EEHLKRTR (SEQ ID NO: 478), and is capable of binding to DHD150-B.
  • GDVKELTKILDTLTKILETATKVIKDATKLLEEHRKSDKPDPRLIETHKKLVEEHETLVRQHKELA EEHLKRTR SEQ ID NO: 478
  • DHD150-B in the context of a member of a heterodimer pair refers to a polypeptide comprising an amino acid sequence that is at least 80% identical (e.g., at least 85%, at least 90%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99% identical) to the amino acid sequence: DNEEIIKEARRVVEEYKKAVDRLEELVRRAENAKHASEKELKDIVREILRISKELNKVSERLIELW ERSQERAR (SEQ ID NO: 479), and is capable of binding to DHD150-A and 13A.
  • DHD154-A and “DHD-154-A” in the context of a member of a heterodimer pair refers to a polypeptide comprising an amino acid sequence that is at least 80% identical (e.g., at least 85%, at least 90%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99% identical) to the amino acid sequence:
  • TAEELLEVHKKSDRVTKEHLRVSEEILKVVEVLTRGEVSSEVLKRVLRKLEELTDKLRRVTEEQR RVVEKLN (SEQ ID NO: 480), and is capable of binding to DHD-154-B.
  • DHD154-B and “DHD-154-B” in the context of a member of a heterodimer pair refers to a polypeptide comprising an amino acid sequence that is at least 80% identical (e.g., at least 85%, at least 90%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99% identical) to the amino acid sequence:
  • DLEDLLRRLRRLVDEQRRLVEELERVSRRLEKAVRDNEDERELARLSREHSDIQDKHDKLAREIL EVLKRLLERTE SEQ ID NO: 481
  • the present disclosure provides two or more nucleic acids encoding one or more of the members of the heterodimer pairs.
  • the nucleic acid encoding a fusion protein comprising a DBD and a member of a heterodimer pair and another nucleic acid encoding a fusion protein comprising a functional domain and a member of the heterodimer pair are provided.
  • a plurality of nucleic acids encode (i) polypeptides that dimerize via direct dimerization, comprising: (A) a DBD fused to a first member of a heterodimer pair and a functional domain fused to a second member of the heterodimer pair, or (B) a DBD fused to a second member of a heterodimer pair and a functional domain fused to a first member of the heterodimer pair, wherein the first and second members of the heterodimer pair bind to each other thereby directly dimerizing the DBD and the functional domain, and wherein the heterodimer pair is selected from one of the following heterodimer pairs: 37A, 37B; 13A, 13B; DHD37-BBB-A, DHD37-BBB-B; DHD150-A, DHD150-B; DHD154-A, DHD-154B; 37A, 9B; 13A, 37B
  • the DBD in (i) (A) or (i) (B) may be fused to a first member of a first heterodimer pair and the functional domain is a first functional domain fused a second member of the first heterodimer pair and to a first member of a second heterodimer pair, and may be used with a second functional domain fused to a second member of the second heterodimer pair, wherein the members of the first heterodimer pair mediate dimerization of the DBD and the first functional domain and members of the second heterodimer pair mediate dimerization of the first functional domain and the second functional domain.
  • the DBD is fused to a first member of a first heterodimer pair and to a first member of a second heterodimer pair
  • the functional domain is fused a second member of the first heterodimer pair the system further comprising a second functional domain fused to a second member of the second heterodimer pair, wherein the members of the first heterodimer pair mediate assembly of the DBD and the first functional domain and members of the second heterodimer pair mediate assembly of the DBD and the second functional domain.
  • a plurality of nucleic acids are provided, where the plurality of nucleic acids encode (ii) polypeptides that dimerize indirectly via a bridging construct, comprising: (A) a DBD fused to a first member of a first heterodimer pair; a bridging construct comprising a second member of the first heterodimer pair fused to a first member of a second heterodimer pair; and a functional domain fused to a second member of the second heterodimer pair; or (B) a DBD fused to a second member of a first heterodimer pair; a bridging construct comprising a first member of the first heterodimer pair fused to a first member of a second heterodimer pair; and a functional domain fused to a second member of the second heterodimer pair; or (C) a DBD fused to a second member of a first heterodimer pair; a bridging construct comprising a first
  • the DBD may bind to a target nucleic acid sequence present in an endogenous gene in a cell.
  • the functional domain may be an enzyme, a transcriptional activator, a transcriptional repressor, or a DNA nucleotide modifier.
  • the enzyme may be a nuclease, a DNA modifying protein, or a chromatin modifying protein.
  • the nuclease may be a cleavage domain or a half cleavage domain.
  • the cleavage domain or half-cleavage domain may be a type IIS restriction enzyme.
  • the type IIS restriction enzyme may be Fokl or Bfil.
  • the chromatin modifying protein may be lysine- specific histone demethylase 1 (LSD1).
  • the transcriptional activator may be VP16, VP64, p65, p300 catalytic domain, TET1 catalytic domain, TDG, Ldbl self-associated domain, SAM activator (VP64, p65, HSF1), or VPR (VP64, p65, Rta).
  • the transcriptional repressor may be KRAB, Sin3a, LSD1, SUV39H1, G9A (EHMT2), DNMT1, DNMT3A-DNMT3L, DNMT3B, KOX, TGF-beta-inducible early gene (TIEG), v-erbA, SID, MBD2, MBD3, Rb, MeCP2, or a novel transcriptional repressor as disclosed herein.
  • the DNA nucleotide modifier may be an adenosine deaminase.
  • the target nucleic acid sequence may be within a PDCD 1 gene, a CTLA4 gene, a LAG3 gene, a TET2 gene, a ETLA gene, a HA VCR2 gene, a CCR5 gene, a CXCR4 gene, a TRA gene, a TRE gene, a E2M gene, an albumin gene, a HEE gene, a HEA1 gene, a TTR gene, a NR3C1 gene, a CD52 gene, an erythroid specific enhancer of the ECL11A gene, a CELE gene, a TGFER1 gene, a SERPINAl gene, a HEV genomic DNA in infected cells, a CEP290 gene, a DMD gene, a CFTR gene, or an IL2RG gene.
  • the DBD may be a transcription activator-like effector (TALE).
  • a DBD fused to a member of a heterodimer pair fused to a member of a heterodimer pair, a functional domain fused to a member of a heterodimer pair, a bridging construct comprising a member of a heterodimer pair fused to another member, such as those described in the preceding paragraphs and further described below and those encoded by the plurality of nucleic acids described above.
  • a DBA and a functional domain is as set forth in (i)(A) or (i)(B).
  • a DBD, a bridging construct, and a functional domain is as set forth in (ii)(A), (ii)(B), or (ii)(C).
  • host cells that include (a) nucleic acids encoding the polypeptides as set forth in (i)(A) or (i)(B); or (b) nucleic acids encoding the polypeptides as set forth in (ii)(A), (ii)(B), or
  • host cells that include host cells that include (a) the polypeptides as set forth in (i)(A) or (i)(B); or (b) the polypeptides as set forth (ii)(A), (ii)(B), or (ii)(C).
  • kits comprising: (a) nucleic acids encoding the polypeptides as set forth in (i)(A) or (i)(B); or (b) nucleic acids encoding the polypeptides as set forth in (ii)(A), (ii)(B), or (ii)(C).
  • kit comprising: (a) a first vector comprising a nucleic acid encoding the DBD set forth in (i)(A); and (b) a second vector comprising a nucleic acid encoding the functional domain set forth in (i)(A); or (a) a first vector comprising a nucleic acid encoding the DBD set forth in
  • kits comprising: a first vector comprising a nucleic acid encoding the DBD set forth in (ii)(A); a second vector comprising a nucleic acid encoding the bridging construct set forth in (ii)(A); and a third vector comprising a nucleic acid encoding the functional domain set forth in
  • compositions comprising the nucleic acids disclosed herein or the polypeptides disclosed herein.
  • the pharmaceutical composition may also include a pharmaceutically acceptable excipient.
  • the pharmaceutical composition may include (a) nucleic acids encoding the polypeptides as set forth in (i)(A) or (i)(B); or (b) nucleic acids encoding the polypeptides as set forth in (ii)(A), (ii)(B), or (ii)(C).
  • the pharmaceutical composition may include (a) a first vector comprising a nucleic acid encoding the DBD set forth in (i)(A); and (b) a second vector comprising a nucleic acid encoding the functional domain set forth in (i)(A); or (a) a first vector comprising a nucleic acid encoding the DBD set forth in (i)(B); and (b) a second vector comprising a nucleic acid encoding the functional domain set forth in (i)(B).
  • the pharmaceutical composition may include: (a) a first vector comprising a nucleic acid encoding the DBD set forth in (ii)(A); (b) a second vector comprising a nucleic acid encoding the bridging construct set forth in (ii)(A); and (c) a third vector comprising a nucleic acid encoding the functional domain set forth in (ii)(A); or (a) a first vector comprising a nucleic acid encoding the DBD set forth in (ii)(B); (b) a second vector comprising a nucleic acid encoding the bridging construct set forth in (ii)(B); and (c) a third vector comprising a nucleic acid encoding the functional domain set forth in (ii)(B); or (a) a first vector comprising a nucleic acid encoding the DBD set forth in (ii)(C); (b) a second vector comprising a nucleic acid
  • the pharmaceutical composition may include the DBD and a functional domain or a DNA binding domain, a functional domain and a bridging construct as provided herein and a pharmaceutically acceptable excipient.
  • the pharmaceutical composition may include the host cell as provided herein and a pharmaceutically acceptable excipient.
  • the split systems of DBD and functional domains and heterodimer pairs may be used in a method for modulating expression from a target gene in a cell.
  • the method may include (i) introducing into the cell a first nucleic acid encoding a DNA binding domain fused to a first member of a heterodimer pair and a second nucleic acid encoding a functional domain fused to a second member of the heterodimer pair; or (ii) introducing into the cell a first nucleic acid encoding a DNA binding domain fused to a second member of a heterodimer pair and a second nucleic acid encoding a functional domain fused to a first member of the heterodimer pair; or (iii) introducing into the cell a DNA binding domain fused to a first member of a heterodimer pair and a functional domain fused to a second member of the heterodimer pair; or (iv) introducing into the cell a DNA binding domain fused to a second
  • the heterodimer pair may be selected from one of the following heterodimer pairs: 37A, 37B; 13A, 13B; DHD37-BBB-A, DHD37- BBB-B; DHD150-A, DHD150-B; DHD154-A, DHD-154B; 37A, 9B; 13A, 37B; 13A, DHD150-B; 37A, DHD37-BBB-B; and DHD37-BBB-A, 37B, wherein the DBD dimerizes with the functional domain via dimerization of the members of the heterodimer pair and wherein binding of the DBD to a target nucleic acid sequence in the target gene results in modulation of expression of the target gene via the functional domain dimerized to the DBD.
  • the method may be used for screening a candidate DBD or a candidate functional domain or for ranking DBDs or functional domains based on specificity, activity, and the like.
  • the modulation of expression of the target gene may be assessed to determine whether a DBD is specific for the target gene and/or whether the functional domain is active in repressing or activating expression of the target gene.
  • the split systems of DBD and functional domains and heterodimer pairs may be used in a method for modulating expression from a target gene in a cell, where the method includes introducing into a cell expressing a DNA binding domain (DBD) fused to a first member of a first heterodimer pair and a functional domain fused to a second member of a second heterodimer pair, a bridging construct comprising a second member of the first heterodimer pair fused to a first member of the second heterodimer pair or a nucleic acid encoding the bridging construct; or introducing into a cell expressing a DNA binding domain (DBD) fused to a second member of a first heterodimer pair and a functional domain fused to a second member of a second heterodimer pair, a bridging construct comprising a first member of the first heterodimer pair fused to a first member of the second heterodimer pair or a nucleic acid encoding the bridging construct;
  • Such a system may be used for fine tuning control of modulation of gene expression by controlling expression of the different components required for modulating gene expression.
  • non-cognate heterodimer pair refers to a heterodimer pair whose members bind to each other with an affinity that is lower than the affinity with which members of a “cognate heterodimer pair” bind.
  • 37A,37B is a cognate heterodimer pair while 37A, 9B form a non-cognate heterodimer pair, since the binding affinity between 37A and 37B is higher than that between 37A and 9B.
  • cognate heterodimer pairs examples include 37A, 37B; 13A, 13B; DHD37- BBB-A, DHD37-BBB-B; DHD150-A, DHD150-B; and DHD154-A, DHD-154B. While members of a “non-cognate heterodimer” bind to each other, members that are not part of a “non-cognate heterodimer” or a “cognate heterodimer” do not significantly bind to each other and are not considered as members of a heterodimer pair.
  • the fusion polypeptides such as, DBD fused to a member of a heterodimer pair may be such that the C-terminus of the DBD is fused to the N-terminus of a member of a heterodimer pair and the N-terminus of the functional domain is fused to the C-terminus of a member of a heterodimer pair.
  • one or more components of the system may be expressed transiently while other component(s) are expressed stably. Stable and transient expression in a cell may be achieved by methods known in the art, such as, transient transfection, gene integration, constitutive and inducible promoters and the like.
  • Anti CD19 CAR-T cell manufacturing Primary T cells were thawed and activated with CD3/CD28 Dynabeads and cultured for 48 hours prior to electroporation with either no mRNA (control) or mRNA encoding the TALE-TFs against PD1. At 24 hours post electroporation T cells were transduced with a lentivirus vector encoding a 3 rd generation anti CD 19 CAR construct on Retronectin at an MOI of 5 to 10. After 24 hours the virus and beads were removed and T cells expanded in RPMI + 10%FBS + IL- 2 for up to 5 days.
  • Co-culture (killing) assay CAR-T cells and control T cells were incubated with CD 19- expressing NALM-6 cells or NALM-6 cells engineered to express PDL-1 (the ligand for PD-1) or NALM-6 cells in which the target antigen CD 19 was knocked out using TALENs (CD 19 KO) at an effector-to-target (E:T) ratio of 1: 1 in a 96-well round bottom culture plate for 16 hours at 37degrees with 5% C02. After 16 hours of incubation, specific target cell killing was measured by release of lactate dehydrogenase (LDH) into the supernatant (Promega kit #) or by flow cytometry analysis.
  • LDH lactate dehydrogenase
  • RNA samples were electroporated with TALETFs (either single or multiplexed) in triplicate. Cells were harvested at 2 days post-transfection for RNA extraction and parallel analysis of expression using flow cytometry. Total RNA was extracted from these samples and from control T cells electroporated without mRNA using Qiagen miRNeasy extraction kit. Total RNA samples were constructed into libraries using Illumina's TruSeq Stranded Total RNA Plurality of nucleic acids Prep Gold kit. Libraries were then sequenced using Illumina's Hiseq 4000 platform with 2x76bp read length to a depth of 25-50 million reads per sample. Reads were aligned using STAR paired alignment (RNA-STAR 2.3.1), mapped to the GRCh38 human genome assembly, and differential gene expression analysis was performed using edgeR.
  • TALETFs either single or multiplexed
  • Functional domains were added by Infusion cloning (Takara Bio; catalog # 638909) onto the C-terminal end of the TAF.
  • Functional domain constructs contained a 15 amino acid linker domain (GGGGGMDAKSFTAWS) (SEQ ID NO: 109) and either an epigenetic-functional domain (e.g. - KRAB) or heterodimer protein (e.g. - 9' of the 9:9' pair).
  • Table 9 TL11107 PDCD1 PROMOTER - GAGCGGAAGGGAAACTGTC 357 ⁇ 50%
  • FIG. 1A illustrates the locations in the PDCD1 gene to which the DBDs of the indicated recombinant polypeptides were designed to bind.
  • Recombinant polypeptides that repressed expression of PDCD1 in at least 50% of cells treated with the recombinant polypeptides are indicated by clear arrows .
  • Recombinant polypeptides that repressed expression of PDCD1 in less than 50% of the cells treated with the recombinant polypeptides are indicated by solid arrows .
  • the orientation of the arrows indicates the DNA strand to which the recombinant polypeptide is designed to bind. Arrows having the orientation are designed to bind to the anti-sense strand. Arrows having the orientation are designed to bind to the sense strand.
  • the analysis of repression by the disclosed recombinant polypeptides that are designed bind to these sequences identified certain regions that provide repression of PDCD-1 expression in at least 50% of the cells expressing these recombinant polypeptides. These regions are depicted in FIGS. 1B-1C and include regions 1-4. In regions 1, 2, 3, the anti-sense strand of the PDCD-1 gene was successfully targeted to significantly repress expression of PD-1. In region 4, the sense strand was identified as the region of the PDCD-1 gene that can be successfully target for repression. In addition, certain sequences in the sense strand in region 1 were also identified a region that can be targeted for repression.
  • Tables 1-4 illustrate the sequences present in each of Regions 1-4 that can be targeted for repression.
  • FIG. 2 shows the fold change in number of PD-1 expressing cells 2 days after transfection of mRNA encoding the indicated recombinant polypeptides into CD3+ T cells.
  • FIG. 3 shows effect of dose of mRNA encoding the recombinant polypeptide, pAF040 and pAF043, on the percent of CD3+ T cells expressing PD-1, 3 days after transfection.
  • CD3+ T cells were activated with beads and electroporated 48 hours post activation according to standard process with varying concentration of TAFE-TF mRNA from 3ng to 2ug per transfection (250,000 T cells per condition). PD-1 expression by flow was measured on day 3 post transfection.
  • FIG. 4 shows the fold change in number of PD-1 -positive cells at the indicated number of days post-transfection of mRNA encoding the indicated recombinant polypeptide relative to control, which are cells electroporated without repressor mRNA.
  • PD-1 repression is durable for about 2 weeks in culture and after freeze-thaw.
  • FIGS. 5 A and 5B show that PD-1 repression with pAF043 in anti-CD 19 CAR-T cells is sustained after in vivo expansion and clearance of CD19-positive NAFM-6 B-AFF tumor model in NSG mice.
  • targeting the sequence GGCCAGGGCGCCTGT (SEQ ID NO: 36) by TAFE-TF TL11084 also significantly suppressed PD-1 expression.
  • FIG. 6 illustrates the locations in the TIM3 gene at which the DBDs of the indicated recombinant polypeptides bind.
  • Recombinant polypeptides that repressed expression of TIM3 in at least 50% of the cells are indicated by unfilled arrows .
  • Recombinant polypeptides that repressed expression of TIM3 in less than 50% of the cells are indicated by filled arrows .
  • the orientation of the arrows indicates the DNA strand to which the recombinant polypeptide is designed to bind. Arrows having the orientation are designed to bind to the anti-sense strand.
  • Arrows having the orientation are designed to bind to the sense strand.
  • FIG. 7 shows the fold change in number of cells expressing TIM3 at 2 days, 5 days, 8 days, or 14 days after transfection of mRNA encoding the indicated recombinant polypeptides into CD3+ T cells.
  • FIG. 8 shows the fold change in number of cells expressing TIM3 at 3 days or 6 days after transfection of mRNA encoding the indicated recombinant polypeptides into CD3+ T cells.
  • FIG. 9 illustrates the locations in the CTLA4 gene at which the DBDs of the indicated recombinant polypeptides bind.
  • Recombinant polypeptides that repressed expression of CTLA4 in at least 50% of the cells are indicated by unfdled arrows .
  • Recombinant polypeptides that repressed expression of CTLA4 in less than 50% of the cells are indicated by filled arrows .
  • the orientation of the arrows indicates the DNA strand to which the recombinant polypeptide is designed to bind. Arrows having the orientation are designed to bind to the anti-sense strand. Arrows having the orientation are designed to bind to the sense strand.
  • FIG. 10 shows the fold change in number of cells expressing CTLA4 at 3 days after transfection of mRNA encoding the indicated recombinant polypeptides into CD3+ T cells.
  • EXAMPLE 4 Identification of TALE-TFs for LAG3 repression
  • This example illustrates identification of TALE-TFs that significantly repress LAG3 expression.
  • Figure 11 provides a pictorial map of all of the regions in the LAG3 gene that were tested for identifying TAFE-TFs that significantly repress LAG3 expression. The results are provided in Table 12 below:
  • FIG. 11 illustrates the locations in the LAG3 gene at which the DBDs of the indicated recombinant polypeptides bind. Recombinant polypeptides that repressed expression of LAG3 in at least
  • FIG. 12 shows the fold change in number of cells expressing LAG3 at 2 days, 7 days, or 12 days after transfection of mRNA encoding the indicated recombinant polypeptides into CD3+ T cells.
  • FIG. 13 shows the fold change in number of cells expressing LAG3 at 2 days after transfection of mRNA encoding the indicated recombinant polypeptides into CD3+ T cells.
  • FIGS. 14A and 14B show multiplexing of recombinant polypeptides to simultaneously suppress expression of PD-1, LAG3, and TIM3 is a single cell.
  • FIGS. 15A-15C illustrates specificity of the recombinant polypeptides as indicated by lack of significant off-target effect as measured by RNA-seq.
  • Anti-CD19 CAR-T cells were treated with epiTFs against PD-1, LAG3, and TIM3 and then used against a B-Cell Acute Lymphoblastic Leukemia (B-ALL) xenograft model in Non-obese Diabetic, NOD Scid Gamma (NSG) mice.
  • B-ALL B-Cell Acute Lymphoblastic Leukemia
  • CAR-T cells were manufactured using lentivirus delivery of a 3rd generation anti-CD 19 CAR containing FMC63 scFv, CD28 and 4-1BB co-stimulatory domains, and a truncated EGFRtag (Lenti- EF1a-CD19-EGFRt-3rd-CAR Vector, Creative Biolabs).
  • Primary human T cells were activated with Dynabeads as previously described and transfected by electroporation with repressor mRNA at 48 hours post activation.
  • Transfected cells along with no-mRNA transfected controls were allowed to recover for 24 hours after electroporation and then transduced with lentivirus encoding the CAR on RetroNectin (Takara Bio) according to manufacturer's protocol at an MOI of 5 and in the absence of serum.
  • RetroNectin RetroNectin
  • CAR-T cells were allowed to expand in media with IL-2 until day 11 post activation when they were washed with PBS and administered to mice.
  • CAR-T cells Prior to using in animals, CAR-T cells were analyzed by flow cytometry for CAR expression (via EGFR staining) and expression of immune checkpoint genes (PD-1, LAG3, and TIM3).
  • mice Animal experiments were conducted at the Fred Hutchinson Cancer Center, Comparative Medicine department (Seattle, WA) according to an approved IACCUC protocol.
  • Female NSG mice aged 6-8 weeks were implanted intravenously with 5x10 5 NAFM-6-luc-GFP tumor cells (human B-AFF cancer cells expressing CD 19) and tumors were measured by total bioluminescent flux using a Xenogen Imaging System (Perkin Elmer).
  • Each experimental arm contained 5 mice.
  • At 4 days post tumor implantation mice were imaged and randomized into treatment arms based on baseline tumor burden.
  • mice On day 5 post implantation mice were dosed intravenously with 250,000 anti-CD19 CAR-T cells either treated or untreated with repressor mRNA.
  • Peripheral blood was collected via retroorbital bleeding at weekly intervals into EDTA-coated tubes at room temperature.
  • Red blood cell lysis was performed using (1X RBC Lysis Buffer, eBiosciences Cat. # 333-57) according to manufacturer's protocol.
  • Flow cytometry was performed as previously described. At 3 weeks post initial dosing mice were re-challenged with 5x10 5 NAFM-6-luc-GFP tumor cells to test for persistence and activity of circulating CAR-T cells in the blood.
  • FIG. 19 shows a schematic of an anti-CD 19 CAR-T cell in which expression of PD1, TIM3, and LAG3 has been repressed using the engineered polypeptides (pAL043+TL8188+TL8222) described herein.
  • FIG. 20 shows flow cytometry data confirming repression of PD1, TIM3, and LAG3 expression in the multiplex-treated CAR-T cells.
  • FIG. 21 provides an overview of in vivo leukemia xenograft model and treatment using indicated CAR-T cells.
  • FIG. 22 demonstrates that multiplexed repression of immune checkpoint genes is sustained in vivo. Flow cytometry showed persistent repression of immune checkpoint genes at 1 week post dosing CAR-Ts into mice.
  • FIG. 23 demonstrates that multiplexed repression of immune checkpoint genes enhances CAR-Ts ability to resist tumor re-challenge. Tumor burden as measured by total flux (bioluminescence) showed all mice were initially “cured” of leukemia in all treatment arms, but upon re-challenge with leukemia cells only the mice treated with CAR-Ts in which all 3 immune checkpoint genes were repressed were able to completely resist tumor formation. This indicates superior persistence and resistance to exhaustion.
  • FIG. 24 shows expansion of CAR-Ts in the mouse blood.
  • Flow cytometry data showed expansion of CAR-T cells in the mouse blood (measured as human CD3+ T cells). After the re-challenge the multiplex-treated T cells expanded the best, in line with their enhanced proliferative capacity and resistance to exhaustion.
  • FIG. 16 shows characterization of repression of TIM3 expression by the listed candidate transcriptional repressors.
  • FIG. 17 shows characterization of repression of LAG3, TIM3, or PD-1 expression by the listed candidate transcriptional repressors.
  • FIG. 18 shows characterization of repression of TIM3 expression by the listed candidate transcriptional repressors.
  • CTBP1 [0370] CTBP1:
  • ZNF45 [0381] MTKSKEAVTFKDVAVVFSEEEFQFFDFAQRKFYRDVMFENFRNVVSVGHQSTPDGFPQ LEREEKLWMMKMATQRDNSSGAKNLKEMETLQEVGLRYLPHEELFCSQIWQQITRELIKYQDS VVNIQRTGCQLEKRDDLHYKDEGFSNQSSHLQVHRVHTGEKP (SEQ ID NO: 89)
  • ARQKLLGRSW SVPVIRHLFAPLKDYFACE SEQ ID NO:98
  • SSTFs synthetic split TFs
  • Fig. 26 Large-scale analysis of functional domains enabled by split encoding of DNA targeting and functional activities. Top panel. The DNA binding domain of the TIM3 repressor TL8188 was selected to screen additional functional domains.
  • TL8188-DBD was fused to heterodimer 37A, and a plurality of nucleic acids of functional domains was fused to heterodimer 37B. Both constructs were transiently expressed in primary human T cells by RNA electroporation, and fraction of cells with TIM3 repressed (%TIM3 negative cells in TL8188-treated cells relative to no RNA control) was evaluated periodically for 26 days by cell surface antibody staining and flow cytometry. Cells with greater fluorescence intensity than unstained control were considered TIM3+. Middle panel. Domains containing KRAB showed more durable repression, or relatively slow kinetics of decay, for several different KRAB domains. Bottom panel. Domains from methyl-DNA binding proteins showed less durable repression, or relatively fast kinetics of decay.
  • TIM3 expression was assayed using flow cytometry and plotted as %TIM3+cells at Day 2 post transfection with an mRNA encoding TIM3 targeting DBD (from TL8188) fused to one member of a heterodimer pair and an mRNA encoding another member of the heterodimer pair fused to a KRAB domain.
  • Cognate pairs 13A,13B; 37A,37B; DHD37-BBB-A, DHD37-BBB-B; DHD150A, DHD150B; DHD154A, DHD154B mediate dimerization and repression.
  • Non-cognate pairs 13,37; 13,DHD150; 37, DHD37-BBB; and 37, DHD150 also mediated dimerization and repression. See Fig. 27.
  • a recombinant polypeptide comprising: a DNA binding domain (DBD) and a transcriptional repressor domain, the DBD comprising a plurality of repeat units (RUs) ordered from N-terminus to C- terminus of the DBD to bind to a nucleic acid sequence of the PDCD1 gene, wherein the nucleic acid sequence is present within the sequence:
  • DBD DNA binding domain
  • RUs repeat units
  • each of the RU comprises the sequence X 1-11 X 12 X 13 X 14-33, 34, or 35 (SEQ ID NO: 455), wherein: X 1-11 is a chain of 11 contiguous amino acids, X 14-33 or 34 or 35 is a chain of 20, 21 or 22 contiguous amino acids, X 12 X 13 is selected from: (a) NH, HH, KH, NK, NQ, RH, RN, SS, NN, SN, or KN for recognition of guanine (G); (b) NI, KI, RI, HI, or SI for recognition of adenine (A); (c) NG, HG, KG, or RG for recognition of thymine (T); (d) HD, RD, SD, ND, KD, or YG for recognition of cytosine (C); and (e) NV
  • HD high definition, NG, NH, HD, NG, HD, and HD.
  • a recombinant polypeptide comprising: a DNA binding domain (DBD) and a transcriptional repressor, the DBD comprising a plurality of repeat units (RUs) ordered from N-terminus to C- terminus of the DBD to bind to a nucleic acid sequence of the PDCD1 gene, wherein the nucleic acid sequence is present within the sequence:
  • DBD DNA binding domain
  • RUs repeat units
  • each of the RU comprises the sequence X 1-11 X 12 X 13 X 14-33, 34, or 35 (SEQ ID NO: 455), wherein: X 1- 11 is a chain of 11 contiguous amino acids, X 14-33 or 34 or 35 is a chain of 20, 21 or 22 contiguous amino acids, X 12 X 13 is selected from: (a) NH, HH, KH, NK, NQ, RH, RN, SS, NN, SN, or KN for recognition of guanine (G); (b) NI, KI, RI, HI, or SI for recognition of adenine (A); (c) NG, HG, KG, or RG for recognition of thymine (T); (d) HD, RD, SD, ND, KD, or YG for recognition of cytosine (C); and (e)
  • the DBD comprises at least thirteen RUs, wherein X 12 X 13 in the RUs from N-terminus to C-terminus are NG, HD, NG, NH, NG, HD, NI, HD, NG, HD, NG, HD, and NH.
  • the DBD comprises at least eighteen RUs, wherein X 12 X 13 in the RUs from N-terminus to C-terminus are NG, HD, NG, NH, NG, HD, NI, HD, NG, HD, NG, HD, NG, HD, NG, NH, HD, HD, HD, NI, and HD.
  • a recombinant polypeptide comprising: a DNA binding domain (DBD) and a transcriptional repressor, the DBD comprising at least nine repeat units (RUs) ordered from N-terminus to C-terminus of the DBD to bind to a nucleic acid sequence of the PDCD1 gene, wherein the nucleic acid sequence is present within the sequence:
  • DBD DNA binding domain
  • RUs repeat units
  • each of the RU comprises the sequence X 1- 11 X 12 X 13 X 14-33, 34, or 35 (SEQ ID NO: 455), wherein: X 1-11 is a chain of 11 contiguous amino acids, X 14-33 or 34 or 35 is a chain of 20, 21 or 22 contiguous amino acids, X 12 X 13 is selected from: (a) NH, HH, KH, NK, NQ, RH, RN, SS, NN, SN, or KN for recognition of guanine (G); (b) NI, KI, RI, HI, or SI for recognition of adenine (A); (c) NG, HG, KG, or RG for recognition of thymine (T); (d) HD, RD, SD, ND, KD, or YG for recognition of cytosine (C); and (e) NV or HN for recognition of A or G; and (
  • a recombinant polypeptide comprising: a DNA binding domain (DBD) and a transcriptional repressor, the DBD comprising at least nine repeat units (RUs) ordered from N-terminus to C-terminus of the DBD to bind to a nucleic acid sequence of the PDCD1 gene, wherein the nucleic acid sequence is present within the sequence:
  • DBD DNA binding domain
  • RUs repeat units
  • each of the RU comprises the sequence X 1-11 X 12 X 13 X 14-33, 34, or 35 (SEQ ID NO: 455), wherein: X 1- 11 is a chain of 11 contiguous amino acids, X 14-33 or 34 or 35 is a chain of 20, 21 or 22 contiguous amino acids, X 12 X 13 is selected from: (a) NH, HH, KH, NK, NQ, RH, RN, SS, NN, SN, or KN for recognition of guanine (G); (b) NI, KI, RI, HI, or SI for recognition of adenine (A); (c) NG, HG, KG, or RG for recognition of thymine (T); (d) HD, RD, SD, ND, KD, or YG for recognition of cytosine (C); and (e
  • a recombinant polypeptide comprising: a DNA binding domain (DBD) and a transcriptional repressor, the DBD comprising at least nine repeat units (RUs) ordered from N-terminus to C-terminus of the DBD to bind to a nucleic acid sequence of the PDCD 1 gene, wherein the nucleic acid sequence is present within the sequence: GCCGCCTTCTCCACTGCTCAGGCGGAGGT (SEQ ID NO:31), wherein each of the RU comprises the sequence X 1-11 X 12 X 13 X 14-33 or 34 or 35 (SEQ ID NO: 455), wherein: X 1-11 is a chain of 11 contiguous amino acids, X 14-33 or 34 or 35 is a chain of 20, 21 or 22 contiguous amino acids, X 12 X 13 is selected from: (a) NH, HH, KH, NK, NQ, RH, RN, SS, NN, SN, or KN for recognition of guanine (G);
  • a recombinant polypeptide comprising : a DNA binding domain (DBD) and a transcriptional repressor, the DBD comprising at least nine repeat units (RUs) ordered from N-terminus to C-terminus of the DBD to bind to a nucleic acid sequence of the PDCD 1 gene, wherein the nucleic acid sequence is present within the sequence: GGCCAGGGCGCCTGT (SEQ ID NO:36);
  • each of the RU comprises the sequence X 1-11 X 12 X 13 X 14-33, 34, or 35 (SEQ ID NO: 455), wherein: X 1- 11 is a chain of 11 contiguous amino acids, X 14-33 or 34 or 35 is a chain of 20, 21 or 22 contiguous amino acids, X 12 X 13 is selected from: (a) NH, HH, KH, NK, NQ, RH, RN, SS, NN, SN, or KN for recognition of guanine (G); (b) NI, KI, RI, HI, or SI for recognition of adenine (A); (c) NG, HG, KG, or RG for recognition of th
  • a recombinant polypeptide comprising: a DNA binding domain (DBD) and a transcriptional repressor, the DBD comprising a plurality of repeat units (RUs) ordered from N-terminus to C-terminus of the DBD to bind to a nucleic acid sequence of the TIM3 gene, wherein the nucleic acid sequence is present within the sequence:
  • DBD DNA binding domain
  • RUs repeat units
  • each of the RU comprises the sequence X 1-11 X 12 X 13 X 14-33, 34, or 35 (SEQ ID NO: 455), wherein: X 1-11 is a chain of 11 contiguous amino acids, X 14-33 or 34 or 35 is a chain of 20, 21 or 22 contiguous amino acids, X 12 X 13 is selected from: (a) NH, HH, KH, NK, NQ, RH, RN, SS, NN, SN, or KN for recognition of guanine (G); (b) NI, KI, RI, HI, or SI for recognition of adenine (A); (c) NG, HG, KG, or RG for recognition of thymine (T); (d) HD, RD, SD, ND, KD, or YG for recognition of cytosine (C);
  • a recombinant polypeptide comprising: a DNA binding domain (DBD) and a transcriptional repressor, the DBD comprising a plurality of repeat units (RUs) ordered from N-terminus to C-terminus of the DBD to bind a nucleic acid sequence of the TIM3 gene, wherein the nucleic acid sequence is present within the sequence:
  • DBD DNA binding domain
  • RUs repeat units
  • each of the repeat unit comprises the sequence X 1-11 X 12 X 13 X 14-33, 34, or 35 (SEQ ID NO: 455), wherein: X 1-11 is a chain of 11 contiguous amino acids, X 14-33 or 34 or 35 is a chain of 20, 21 or 22 contiguous amino acids, X 12 X 13 is selected from: (a) NH, HH, KH, NK, NQ, RH, RN, SS, NN, SN, or KN for recognition of guanine (G); (b) NI, KI, RI, HI, or SI for recognition of adenine (A); (c) NG, HG, KG, or RG for recognition of thymine (T); (d) HD, RD, SD, ND, KD, or YG for recognition of cytosine (C); and (e) NV or HN for recognition of A
  • a recombinant polypeptide comprising: a DNA binding domain (DBD) and a transcriptional repressor, the DBD comprising a plurality of repeat units (RUs) ordered from N-terminus to C-terminus of the DBD to bind to a nucleic acid sequence of TIM3 gene, wherein the nucleic acid sequence is: TACACACAT (SEQ ID NO:54), wherein each of the repeat unit comprises the sequence X 1- 11 X 12 X 13 X 14-33, 34, o ( r S 35 EQ ID NO: 455), wherein: X 1-11 is a chain of 11 contiguous amino acids, X 14-33 or 34 or 35 is a chain of 20, 21 or 22 contiguous amino acids, X 12 X 13 is selected from:(a) NH, HH, KH, NK, NQ, RH, RN, SS, NN, SN, or KN for recognition of guanine (G); (b) NI, KI, KI
  • a recombinant polypeptide comprising: a DNA binding domain (DBD) and a transcriptional repressor, the DBD comprising at least nine repeat units (RUs) ordered from N-terminus to C-terminus of the DBD to bind to a nucleic acid sequence of the LAG3 gene, wherein the nucleic acid sequence is present within the sequence:
  • DBD DNA binding domain
  • RUs repeat units
  • each of the RU comprises the sequence X 1-11 X 12 X 13 X 14-33, 34, or 35 (SEQ ID NO: 455), wherein: X 1-11 is a chain of 11 contiguous amino acids, X 14-33 or 34 or 35 is a chain of 20, 21 or 22 contiguous amino acids, X 12 X 13 is selected from:
  • a recombinant polypeptide comprising: a DNA binding domain (DBD) and a transcriptional repressor, the DBD comprising a plurality of repeat units (RUs) ordered from N-terminus to C-terminus of the DBD to bind to a nucleic acid sequence of LAG3 gene, wherein the nucleic acid sequence is: TGCTCTGTCTGC (SEQ ID NO:72), wherein each of the repeat unit comprises the sequence X 1-11 X 12 X 13 X 14-33, 34, or 35 (SEQ ID NO: 455), wherein: X 1-11 is a chain of 11 contiguous amino acids, X 14-33 or 34 or 35 is a chain of 20, 21 or 22 contiguous amino acids, X 12 X 13 is selected from: (a) NH, HH, KH, NK, NQ, RH, RN, SS, NN, SN, or KN for recognition of guanine (G); (b) NI, KI, RI
  • a recombinant polypeptide comprising: a DNA binding domain (DBD) and a transcriptional repressor, the DBD comprising at least nine repeat units (RUs) ordered from N-terminus to C-terminus of the DBD to bind to a nucleic acid sequence of the CTLA4 gene, wherein the nucleic acid sequence is:
  • each of the RU comprises the sequence X 1-11 X 12 X 13 X 14-33, 34, or 35 (SEQ ID NO: 455), wherein: X 1-11 is a chain of 11 contiguous amino acids, X 14-33 or 34 or 35 is a chain of 20, 21 or 22 contiguous amino acids,
  • X 12 X 13 is selected from:
  • the DBD comprises a C-cap region comprising an amino acid sequence at least 80% identical to the amino acid sequence set forth in SEQ ID NO: 452, wherein the recombinant polypeptide comprises from N-terminus to C-terminus: the N-cap region, the plurality of RUs, and the C-cap region.
  • X 1-11 is a chain of 11 contiguous amino acids
  • X 14-20 or 21 or 22 is a chain of 7, 8 or 9 contiguous amino acids
  • X 12 X 13 is selected from:(a) NH, HH, KH, NK, NQ, RH, RN, SS, NN, SN, or KN for recognition of guanine (G); (b) NI, KI, RI, HI, or SI for recognition of adenine (A); (c) NG, HG, KG, or RG for recognition of thymine (T); (d) HD, RD, SD, ND, KD, or YG for recognition of cytosine (C); and (e)
  • nucleic acid of clause 99 wherein the nucleic acid is operably linked to a promoter sequence that confers expression of the polypeptide.
  • nucleic acid of clause 99 or 100 wherein the sequence of the nucleic acid is codon optimized for expression of the polypeptide in a human cell.
  • RNA ribonucleic acid
  • a vector comprising the nucleic acid of any of clauses 99-103.
  • a host cell comprising the nucleic acid of any of clauses 99-103 or the vector of clause 104 or 105.
  • a pharmaceutical polypeptide comprising the polypeptide of any of clauses 1-98 and a pharmaceutically acceptable excipient.
  • a pharmaceutical polypeptide comprising the nucleic acid of any of clauses 99-103 or the vector of clause 104 or 105 and a pharmaceutically acceptable excipient.
  • a method of suppressing expression of PDCD-1 gene in a cell comprising: introducing into the cell the recombinant polypeptide of any one of clauses 1-48, wherein the recombinant polypeptide binds to a target nucleic acid sequence present in the PDCD-1 gene and the transcriptional repressor domain suppresses expression of the PDCD-1 gene.
  • a method of suppressing expression of TIM3 gene in a cell comprising: introducing into the cell the recombinant polypeptide of any one of clauses 49-64, wherein the recombinant polypeptide binds to a target nucleic acid sequence present in the TIM3 gene and the transcriptional repressor domain suppresses expression of the TIM3 gene.
  • a method of suppressing expression of LAG3 gene in a cell comprising: introducing into the cell the recombinant polypeptide of any one of clauses 65-82, wherein the recombinant polypeptide binds to a target nucleic acid sequence present in the LAG3 gene and the transcriptional repressor domain suppresses expression of the LAG3 gene.
  • a method of suppressing expression of CTLA4 gene in a cell comprising: introducing into the cell the recombinant polypeptide of any one of clause 83, wherein the recombinant polypeptide binds to a target nucleic acid sequence present in the CTLA4 gene and the transcriptional repressor domain suppresses expression of the CTLA4 gene.
  • nucleic acid is a deoxyribonucleic acid (DNA).
  • nucleic acid is a ribonucleic acid (RNA).
  • the transcriptional repressor domain comprises KRAB, Sin3a, LSD1, SUV39H1, G9A (EHMT2), DNMT1, DNMT3A-DNMT3L, DNMT3B, KOX, TGF-beta-inducible early gene (TIEG), v-erbA, SID, MBD2, MBD3, Rb, or MeCP2.
  • the transcriptional repressor domain comprises KRAB, Sin3a, LSD1, SUV39H1, G9A (EHMT2), DNMT1, DNMT3A-DNMT3L, DNMT3B, KOX, TGF-beta-inducible early gene (TIEG), v-erbA, SID, MBD2, MBD3, Rb, or MeCP2.
  • a recombinant polypeptide comprising a DNA binding domain and a transcriptional repressor domain, wherein the DNA binding domain and the transcriptional repressor domain are heterologous, wherein the transcriptional repressor domain comprises an amino acid sequence at least 80% identical to any one of the sequences set out in SEQ ID NOs: 84-101.
  • the target nucleic acid sequence is in a PDCD 1 gene, a CTLA4 gene, a LAG3 gene, a TET2 gene, a ETLA gene, a HAVCR2 gene, a CCR5 gene, a CXCR4 gene, a TRA gene, a TRE gene, a E2M gene, an albumin gene, a HEE gene, a HEA1 gene, a TTR gene, a NR3C1 gene, a CD52 gene, an erythroid specific enhancer of the BCL11A gene, a CELE gene, a TGFER1 gene, a SERPINAl gene, a HEV genomic DNA in infected cells, a CEP290 gene, a DMD gene, a CFTR gene, or an IL2RG gene.
  • nucleic acid of clause 135 or 136 wherein the sequence of the nucleic acid is codon optimized for expression of the polypeptide in a human cell.
  • RNA ribonucleic acid
  • a vector comprising the nucleic acid of any of clauses 135-138.
  • a host cell comprising the nucleic acid of any of clauses 135-139 or the vector of clause 140 or 141.
  • a host cell comprising the polypeptide of any of clauses 128-134.
  • a pharmaceutical composition comprising the polypeptide of any of clauses 128-134 and a pharmaceutically acceptable excipient.
  • a pharmaceutical composition comprising the nucleic acid of any of clauses 135-139 or the vector of clause 140 or 141 and a pharmaceutically acceptable excipient.
  • a method of suppressing expression of an endogenous gene in a cell comprising: introducing into the cell the recombinant polypeptide of any one of clauses 128-134, wherein the DBD of the polypeptide binds to a target nucleic acid sequence present in the endogenous gene and the heterologous transcriptional repressor domain suppresses expression of the endogenous gene.
  • nucleic acid is a deoxyribonucleic acid (DNA).
  • nucleic acid is a ribonucleic acid (RNA).
  • the gene is a PDCD 1 gene, a CTLA4 gene, a LAG3 gene, a TET2 gene, a ETLA gene, a HAVCR2 gene, a CCR5 gene, a CXCR4 gene, a TRA gene, a TRE gene, a E2M gene, an albumin gene, a HEE gene, a HEA1 gene, a TTR gene, a NR3C1 gene, a CD52 gene, an erythroid specific enhancer of the ECL11A gene, a CELE gene, a TGFER1 gene, a SERPINAl gene, a HEV genomic DNA in infected cells, a CEP290 gene, a DMD gene, a CFTR gene, or an IL2RG gene.
  • polypeptides that dimerize via direct dimerization comprising:
  • A a DNA binding domain (DBD) fused to a first member of a heterodimer pair and a functional domain fused to a second member of the heterodimer pair, or
  • B a DNA binding domain (DBD) fused to a second member of a heterodimer pair and a functional domain fused to a first member of the heterodimer pair, wherein the first and second members of the heterodimer pair bind to each other thereby directly dimerizing the DBD and the functional domain, wherein the heterodimer pair is selected from one of the following heterodimer pairs:
  • DBD DNA binding domain
  • DHD37-BBB-A DHD37-BBB-B;
  • DHD154-A DHD-154B
  • polypeptides that dimerize indirectly via a bridging construct comprising:
  • A a DNA binding domain (DBD) fused to a first member of a first heterodimer pair; a bridging construct comprising a second member of the first heterodimer pair fused to a first member of a second heterodimer pair; and a functional domain fused to a second member of the second heterodimer pair; or
  • B a DNA binding domain (DBD) fused to a second member of a first heterodimer pair; a bridging construct comprising a first member of the first heterodimer pair fused to a first member of a second heterodimer pair; and a functional domain fused to a second member of the second heterodimer pair; or
  • C a DNA binding domain (DBD) fused to a second member of a first heterodimer pair; a bridging construct comprising a first member of the first heterodimer pair fused to a second member of a second heterodimer pair; and a functional domain fused to a first member of the second heterodimer pair, wherein the DBD and the functional domain dimerize indirectly via the bridging construct, wherein the first and second heterodimer pairs are different and are selected from the following heterodimer pairs:
  • DHD37-BBB-A DHD37-BBB-B;
  • DHD154-A DHD-154B
  • nuclease is a cleavage domain or a half-cleavage domain.
  • the transcriptional repressor comprises KRAB, Sin3a, LSD1, SUV39H1, G9A (EHMT2), DNMT1, DNMT3A-DNMT3L, DNMT3B, KOX, TGF-beta-inducible early gene (TIEG), v-erbA, SID, MBD2, MBD3, Rb, MeCP2, or a transcriptional repressor provided in clauses 128-134.
  • the target nucleic acid sequence is within a PDCD 1 gene, a CTLA4 gene, a LAG
  • a host cell comprising: (a) nucleic acids encoding the polypeptides as set forth in clause 162 (i)(A) or (i)(B); or (b) nucleic acids encoding the polypeptides as set forth in clause 162 (ii)(A), ( ⁇ )(B), or (ii)(C). 181.
  • a host cell comprising: (a) the polypeptides as set forth in clause 162 (i)(A) or (i)(B); or (b) the polypeptides as set forth in clause 162 (ii)(A), (ii)(B), or (ii)(C).
  • a kit comprising:
  • nucleic acids encoding the polypeptides as set forth in clause 162 (ii)(A), (ii)(B), or (ii)(C).
  • a kit comprising:
  • a kit comprising:
  • a pharmaceutical composition comprising:
  • nucleic acids encoding the polypeptides as set forth in clause 162 (ii)(A), (ii)(B), or (ii)(C).
  • a pharmaceutical composition comprising:
  • a pharmaceutical composition comprising:
  • a pharmaceutical composition comprising the DBD and a functional domain or a DNA binding domain, a functional domain and a bridging construct of clause 178 and a pharmaceutically acceptable excipient.
  • a pharmaceutical composition comprising the host cell of clause 180 or 181 and a pharmaceutically acceptable excipient.
  • a method for modulating expression from a target gene in a cell comprising:
  • DHD37-BBB-A DHD37-BBB-B;
  • DHD154-A DHD-154B
  • DBD DNA binding domain
  • a method of modulating expression of a target gene in a cell comprising:
  • a bridging construct comprising a second member of the first heterodimer pair fused to a first member of the second heterodimer pair or a nucleic acid encoding the bridging construct
  • a bridging construct comprising a first member of the first heterodimer pair fused to a first member of the second heterodimer pair or a nucleic acid encoding the bridging construct
  • a bridging construct comprising a second member of the first heterodimer pair fused to a second member of the second heterodimer pair or a nucleic acid encoding the bridging construct, wherein the DBD and the functional domain dimerize indirectly via the bridging construct, wherein binding of the DBD to a target nucleic acid sequence in a target gene in the cell results in in modulation of expression of the target gene via the functional domain dimerized to the DBD via the bridging construct, wherein the first and second heterodimer pairs are different and are selected from the following heterodimer pairs:
  • DHD37-BBB-A DHD37-BBB-B;
  • DHD154-A DHD-154B

Landscapes

  • Health & Medical Sciences (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Chemical & Material Sciences (AREA)
  • Organic Chemistry (AREA)
  • Genetics & Genomics (AREA)
  • Immunology (AREA)
  • Zoology (AREA)
  • Engineering & Computer Science (AREA)
  • Molecular Biology (AREA)
  • Biochemistry (AREA)
  • Biophysics (AREA)
  • General Health & Medical Sciences (AREA)
  • Medicinal Chemistry (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • Proteomics, Peptides & Aminoacids (AREA)
  • Toxicology (AREA)
  • Wood Science & Technology (AREA)
  • Cell Biology (AREA)
  • Biomedical Technology (AREA)
  • Biotechnology (AREA)
  • General Engineering & Computer Science (AREA)
  • Gastroenterology & Hepatology (AREA)
  • Plant Pathology (AREA)
  • Microbiology (AREA)
  • Physics & Mathematics (AREA)
  • Peptides Or Proteins (AREA)

Abstract

The present disclosure provides polypeptides, compositions thereof, and methods for suppressing expression of a target gene such as PDCD1, CTLA4, LAG3, or TIM-3. The polypeptides disclosed herein include a DNA binding domain (DBD) that binds to a sequence of the target gene and a transcriptional repressor domain that suppresses expression of the target gene. The transcriptional repressor domain may be a known transcriptional repressor or may be a novel transcriptional repressor disclosed herein. Also disclosed herein are novel transcriptional repressors that are conjugated to a heterologous DNA binding domain and mediate suppression of expression of a target gene bound by the DNA binding domain. Also disclosed herein are split systems for modulating gene expression where the DBD and the functional domain are provided as separated polypeptides and are assembled using dimerization of a heterodimer pair, where the DBD and the functional domain are each fused to a member of the heterodimer pair.

Description

COMPOSITIONS AND METHODS FOR MODULATION OF GENE EXPRESSION
CROSS-REFERENCE TO RELATED APPLICATIONS
[0001] This application claims priority benefit of U.S. Provisional Application No. 62/884,028, filed August 7, 2019, U.S. Provisional Application No. 62/898,434, filed September 10, 2019, and U.S. Provisional Application No. 62/937,011, filed November 18, 2019, the disclosures of which are incorporated herein by reference in their entirety.
INCORPORATION OF SEQUENCE LISTING
[0002] The sequence listing named “ALTI-727WO Seq Listing_ST25” which was created on August 4, 2020 and is 219KB in size, is hereby incorporated by reference in its entirety.
BACKGROUND
[0003] Modulating gene expression has been a strategy for enhancing the success of cancer and infectious disease therapies. In particular, cell therapies, such as CAR T cell therapies, can suffer from dampened immunogenicity by inhibition via an immune checkpoint inhibitor. Thus, there exists a need for agents that can modulate the expression of target genes, such as, immune checkpoint inhibitors. The present disclosure provides engineered polypeptides comprising DNA binding domains and repressor domains for repressing a target gene.
SUMMARY
[0004] The present disclosure provides polypeptides, compositions thereof, and methods for suppressing expression of a target gene such as PDCD1, CTLA4, LAG3, or TIM-3. The polypeptides disclosed herein include a DNA binding domain (DBD) that binds to a sequence of the target gene and a transcriptional repressor domain that suppresses expression of the target gene. The transcriptional repressor domain may be a known transcriptional repressor or may be a novel transcriptional repressor disclosed herein.
[0005] Also disclosed herein are sequences of novel transcriptional repressor domains that are conjugated to a heterologous DNA binding domain. As shown herein, these novel transcriptional repressor domains mediate suppression of expression of a target gene bound by the heterologous DNA binding domain.
[0006] Also disclosed herein are split systems for modulating gene expression where the DBD and the functional domain are provided as separated polypeptides and are assembled using dimerization of a heterodimer pair, where the DBD and the functional domain are each fused to a member of the heterodimer pair. BRIEF DESCRIPTION OF THE DRAWINGS [0007] FIGS. 1A-1C illustrate the locations in the PDCD1 gene to which the DBDs of the indicated recombinant polypeptides were designed to bind. Recombinant polypeptides that repressed expression of PDCD1 in at least 50% of cells treated with the recombinant polypeptides are indicated by clear arrows . Recombinant polypeptides that repressed expression of PDCD1 in less than 50% of the cells treated with the recombinant polypeptides are indicated by solid arrows . The orientation of the arrows indicates the DNA strand to which the recombinant polypeptide is designed to bind. Arrows having the orientation are designed to bind to the anti-sense strand.
Arrows having the orientation are designed to bind to the sense strand.
[0008] FIG. 2 shows the fold change in number of PD-1 expressing cells 2 days after transfection of mRNA encoding the indicated recombinant polypeptides into CD3+ T cells.
[0009] FIG. 3 shows effect of dose of mRNA encoding the recombinant polypeptide, pAL040 and pAL043, on the percent of CD3+ T cells expressing PD-1 3 days after transfection.
[0010] FIG. 4 shows the fold change in number of PD-1 -positive cells at the indicated number of days post-transfection of mRNA encoding the indicated recombinant polypeptide relative to control.
[0011] FIGS. 5 A and 5B show that PD-1 repression with pAL043 in anti-CD 19 CAR-T cells is sustained after in vivo expansion and clearance of CD 19-positive NALM-6 B-ALL tumor model in NOD SCID Gamma (NSG) mice.
[0012] FIG. 6 illustrates the locations in the TIM3 gene at which the DBDs of the indicated recombinant polypeptides bind. Recombinant polypeptides that repressed expression of TIM3 in at least 50% of the cells are indicated by unfilled arrows . Recombinant polypeptides that repressed expression of TIM3 in less than 50% of the cells are indicated by filled arrows
[0013] FIG. 7 shows the fold change in number of cells expressing TIM3 at 2 days, 5 days, 8 days, or 14 days after transfection of mRNA encoding the indicated recombinant polypeptides into CD3+ T cells. [0014] FIG. 8 shows the fold change in number of cells expressing TIM3 at 3 days or 6 days after transfection of mRNA encoding the indicated recombinant polypeptides into CD3+ T cells.
[0015] FIG. 9 illustrates the locations in the CTLA4 gene at which the DBDs of the indicated recombinant polypeptides bind. Recombinant polypeptides that repressed expression of CTLA4 in at least
50% of the cells are indicated by unfilled arrows . Recombinant polypeptides that repressed expression of CTLA4 in less than 50% of the cells are indicated by filled arrows
[0016] FIG. 10 shows the fold change in number of cells expressing CTLA4 at 3 days after transfection of mRNA encoding the indicated recombinant polypeptides into CD3+ T cells. [0017] FIG. 11 illustrates the locations in the LAG3 gene at which the DBDs of the indicated recombinant polypeptides bind. Recombinant polypeptides that repressed expression of LAG3 in at least 50% of the cells are indicated by unfdled arrows . Recombinant polypeptides that repressed expression of LAG3 in less than 50% of the cells are indicated by fdled arrows
[0018] FIG. 12 shows the fold change in number of cells expressing LAG3 at 2 days, 7 days, or 12 days after transfection of mRNA encoding the indicated recombinant polypeptides into CD3+ T cells.
[0019] FIG. 13 shows the fold change in number of cells expressing LAG3 at 2 days after transfection of mRNA encoding the indicated recombinant polypeptides into CD3+ T cells.
[0020] FIGS. 14A and 14B show multiplexing of recombinant polypeptides to simultaneously suppress expression of PD-1, LAG3, and TIM3 is a single cell.
[0021] FIGS. 15A-15C illustrates specificity of the recombinant polypeptides as indicated by lack of significant off-target effect as measured by RNA-seq.
[0022] FIG. 16 shows characterization of repression of TIM3 expression by the listed candidate transcriptional repressors.
[0023] FIG. 17 shows characterization of repression of LAG3, TIM3, or PD-1 expression by the listed candidate transcriptional repressors.
[0024] FIG. 18 shows characterization of repression of TIM3 expression by the listed candidate transcriptional repressors.
[0025] FIG. 19 shows a schematic of an anti-CD 19 CAR-T cell in which expression of PD1, TIM3, and LAG3 has been repressed using the engineered polypeptides (pAL043+TL8188+TL8222) described herein.
[0026] FIG. 20 shows flow cytometry data confirming repression of PD1, TIM3, and LAG3 expression in the multiplex-treated CAR-T cells.
[0027] FIG. 21 provides an overview of in vivo leukemia xenograft model and treatment using indicated CAR-T cells.
[0028] FIG. 22 demonstrates that multiplexed repression of immune checkpoint genes is sustained in vivo.
[0029] FIG. 23 demonstrates that multiplexed repression of immune checkpoint genes enhances CAR-Ts ability to resist tumor re-challenge.
[0030] FIG. 24 shows expansion of CAR-Ts in the mouse blood.
[0031] FIG. 25 TALE-KRAB split system.
[0032] FIG. 26 Large-scale analysis of functional domains enabled by split encoding of DNA targeting and functional activities. [0033] FIG. 27 Repression of TIM3 expression using TALE-KRAB split system. [0034] FIGS. 28 and 29 Control of gene expression using CIPHR logic gates.
DETAILED DESCRIPTION
[0035] The present disclosure provides recombinant polypeptides, compositions and methods for suppressing target gene expression for therapeutic purposes. In particular, described herein are engineered polypeptides comprising a DNA-binding domain (DBD) and a transcription repressor. The DBD mediates binding of the disclosed polypeptides to a sequence in the target gene. The target gene may be PDCD1, LAG3, TIM3, or CTLA4.
[0036] Certain regions in these target genes have been identified that can be targeted for repression of expression of these gene when these regions are bound by the polypeptides disclosed herein. These regions may be located in the target gene within an expression control region, such as, a coding region, a non-coding region, such as, a regulatory region (e.g., promoter region) or an intron.
[0037] These regions as well as the polypeptides that bind to these regions are provided herein.
[0038] Also disclosed herein are novel transcriptional repressors that are conjugated to a heterologous DNA binding domain and mediate suppression of expression of a target gene bound by the DNA binding domain.
[0039] Also disclosed herein are split systems for modulating gene expression where the DBD and the functional domain are provided as separated polypeptides and are assembled using dimerization of a heterodimer pair, where the DBD and the functional domain are each fused to a member of the heterodimer pair.
DEFINITIONS
[0040] As used herein, the term “derived” in the context of a polypeptide refers to a polypeptide that has a sequence that is based on that of a protein from a particular source (e.g., Xanthomonas or Legionella). A polypeptide derived from a protein from a particular source may be a variant of the protein from the particular source. For example, a polypeptide derived from a protein from a particular source may have a sequence that is modified with respect to the protein's sequence from which it is derived. A polypeptide derived from a protein from a particular source shares at least 30% sequence identity with, at least 40% sequence identity with, at least 50% sequence identity with, at least 60% sequence identity with, at least 70% sequence identity with, at least 80% sequence identity with, or at least 90% sequence identity with the protein from which it is derived. [0041] The term “modular” as used herein in the context of a DNA binding domain, e.g., a modular animal pathogen derived nucleic acid binding domain (MAP-NBD) indicates that the plurality of repeat units present in the DBD can be rearranged and/or replaced with other repeat units and can be arranged in an order such that the DBD binds to the target nucleic acid. For example, any repeat unit in a modular nucleic acid binding domain can be switched with a different repeat unit. In some aspects, modularity of the DNA binding domains disclosed herein allows for switching the target nucleic acid base for a particular repeat unit by simply switching it out for another repeat unit. In some embodiments, modularity of the DNA binding domains disclosed herein allows for swapping out a particular repeat unit for another repeat unit to increase the affinity of the repeat unit for a particular target nucleic acid. Overall, the modular nature of the DNA binding domains disclosed herein enables the development of genome editing complexes that can precisely target any nucleic acid sequence of interest.
[0042] The terms “polypeptide,” “peptide,” and “protein”, used interchangeably herein, refer to a polymeric form of amino acids of any length, which can include genetically coded and non-genetically coded amino acids, chemically or biochemically modified or derivatized amino acids, and polypeptides having modified polypeptide backbones. The terms include fusion proteins, including, but not limited to, fusion proteins with a heterologous amino acid sequence, fusion proteins with heterologous and homologous leader sequences, with or without N-terminus methionine residues; immunologically tagged proteins; and the like. In specific aspects, the terms refer to a polymeric form of amino acids of any length which include genetically coded amino acids. In particular aspects, the terms refer to a polymeric form of amino acids of any length which include genetically coded amino acids fused to a heterologous amino acid sequence.
[0043] The term “heterologous” refers to two components that are defined by structures derived from different sources. For example, in the context of a polypeptide, a “heterologous” polypeptide may include operably linked amino acid sequences that are derived from different polypeptides (e.g., a DBD and a functional domain, e.g., a transcriptional repressor, derived from different sources). Similarly, in the context of a polynucleotide encoding a chimeric polypeptide, a “heterologous” polynucleotide may include operably linked nucleic acid sequences that can be derived from different genes. Other exemplary “heterologous” nucleic acids include expression constructs in which a nucleic acid comprising a coding sequence is operably linked to a regulatory element (e.g., a promoter) that is from a genetic origin different from that of the coding sequence (e.g., to provide for expression in a host cell of interest, which may be of different genetic origin than the promoter, the coding sequence or both). In the context of recombinant cells, “heterologous” can refer to the presence of a nucleic acid (or gene product, such as a polypeptide) that is of a different genetic origin than the host cell in which it is present. [0044] The term “operably linked” refers to linkage between molecules to provide a desired function.
For example, “operably linked” in the context of nucleic acids refers to a functional linkage between nucleic acid sequences. By way of example, a nucleic acid expression control sequence (such as a promoter, signal sequence, or array of transcription factor binding sites) may be operably linked to a second polynucleotide, wherein the expression control sequence affects transcription and/or translation of the second polynucleotide. In the context of a polypeptide, “operably linked” refers to a functional linkage between amino acid sequences (e.g., different domains) to provide for a described activity of the polypeptide.
[0045] A “target nucleic acid,” “target sequence,” or “target site” is a nucleic acid sequence that defines a portion of a nucleic acid to which a binding molecule, such as, the DBD disclosed herein will bind. The target nucleic acid may be present in an isolated form or inside a cell. A target nucleic acid may be present in a region of interest. A “region of interest” may be any region of cellular chromatin, such as, for example, a gene or a non-coding sequence within or adjacent to a gene, in which it is desirable to bind an exogenous molecule. A region of interest can be present in a chromosome, an episome, an organellar genome (e.g., mitochondrial, chloroplast), or an infecting viral genome, for example. A region of interest can be within the coding region of a gene, within transcribed non-coding regions such as, for example, promoter sequences, leader sequences, trailer sequences or introns, or within non-transcribed regions, either upstream or downstream of the coding region. A region of interest can be as small as a five nucleotide pair or up to 200 nucleotide pairs in length, or any integral value of nucleotide pairs.
[0046] An “exogenous” molecule is a molecule that is not normally present in a cell but can be introduced into a cell by one or more genetic, biochemical or other methods. An exogenous nucleic acid can be present in an infecting viral genome, a plasmid or episome introduced into a cell. Methods for the introduction of exogenous molecules into cells are known to those of skill in the art and include, but are not limited to, lipid-mediated transfer (i.e., liposomes, including neutral and cationic lipids), electroporation, direct injection, cell fusion, particle bombardment, calcium phosphate co-precipitation, DEAE-dextran-mediated transfer and viral vector-mediated transfer.
[0047] By contrast, an “endogenous” molecule is one that is normally present in a particular cell at a particular developmental stage under particular environmental conditions. For example, an endogenous nucleic acid can comprise a chromosome, the genome of a mitochondrion, chloroplast or other organelle, or a naturally-occurring episomal nucleic acid. Additional endogenous molecules can include proteins, for example, transcription factors and enzymes.
[0048] A “gene,” for the purposes of the present disclosure, includes a DNA region encoding a gene product, as well as all DNA regions which regulate the production of the gene product, whether or not such regulatory sequences are adjacent to coding and/or transcribed sequences. Accordingly, a gene includes, but is not necessarily limited to, promoter sequences, terminators, translational regulatory sequences such as ribosome binding sites and internal ribosome entry sites, enhancers, silencers, insulators, boundary elements, replication origins, matrix attachment sites and locus control region.
[0049] “Gene expression” refers to the conversion of the information, contained in a gene, into a gene product. A gene product can be the direct transcriptional product of a gene (e.g., mRNA, tRNA, rRNA, antisense RNA, ribozyme, structural RNA, shRNA, RNAi, miRNA or any other type of RNA) or a protein produced by translation of a mRNA. Gene products also include RNAs which are modified, by processes such as capping, polyadenylation, methylation, and editing, and proteins modified by, for example, methylation, acetylation, phosphorylation, ubiquitination, ADP-ribosylation, myristylation, and glycosylation.
[0050] The terms “conjugating,” “conjugated,” and “conjugation” refer to an association of two entities, for example, of two molecules such as two proteins, two domains (e.g., a binding domain and a transcription repressor domain), or a protein and an agent, e.g., a protein binding domain and a small molecule. The association can be, for example, via a direct or indirect (e.g., via a linker) covalent linkage or via non-covalent interactions. In some embodiments, the association is covalent. In some embodiments, two molecules are conjugated via a linker connecting both molecules. For example, in some embodiments where two proteins are conjugated to each other, e.g., a binding domain and a cleavage domain of an engineered nuclease, to form a protein fusion, the two proteins may be conjugated via a polypeptide linker, e.g., an amino acid sequence connecting the C-terminus of one protein to the N- terminus of the other protein. Such conjugated proteins may be expressed as a fusion protein.
[0051] The term “effective amount,” as used herein, refers to an amount of a biologically active agent that is sufficient to elicit a desired biological response. For example, in some aspects, an effective amount of a polypeptide comprising a transcriptional repressor may refer to the amount of the polypeptide that is sufficient to induce repression of expression from a gene specifically bound by the polypeptide. As will be appreciated by the skilled artisan, the effective amount of an agent, e.g., a recombinant polynucleotide, may vary depending on various factors as, for example, on the desired biological response, the specific allele, genome, target site, cell, or tissue being targeted, and the agent being used.
[0052] The term “strand” as used herein refers to a nucleic acid made up of nucleotides covalently linked together by covalent bonds, e.g., phosphodiester bonds. In a cell, DNA usually exists in a double-stranded form, and as such, has two complementary strands of nucleic acid referred to herein as the “top” and “bottom” strands or the “Watson” and “Crick” strands. Watson strand refers to 5' to 3' top strand (5'
3'), whereas Crick strand refers to 3' to 5' bottom strand (3' 5'). The assignment of a strand as being a top or bottom strand is arbitrary and does not imply any particular orientation, function or structure. In certain cases, complementary strands of a chromosomal DNA may be interchangeably referred to as “top” and “bottom” strands, “plus” and “minus” strands, the “first” and “second” strands, the “coding” and “noncoding” strands, the “Watson” and “Crick” strands, or the “sense” and “antisense” strands. The nucleotide sequences of the coding strand of several mammalian chromosomal regions (e.g., BACs, assemblies, chromosomes, etc.) are known, and may be found in NCBI’s GenBank database, for example. [0053] As used herein, the term, “on-target” repression refers repression of expression of a gene containing the genomic sequence that is the target of the recombinant polypeptide comprising the DBD and the transcription repressor. The DBD determines the specificity of the polypeptide for the binding the target site. An on-target repression site refers to a nucleic acid sequence that includes the DNA sequence specifically bound by the DBD of the recombinant polypeptide.
[0054] As used herein, the term, “off-target” repression refers to repression of expression of a gene containing the genomic sequence that is not the target of the recombinant polypeptide comprising the DBD and the transcription repressor but is repressed due to non-specific binding of the DBD of the recombinant polypeptide.
[0055] As used herein, the term “domain” or “protein domain” refers to a part of a protein sequence that may exist and function independently of the rest of the protein chain. In the context of the recombinant polypeptides disclosed herein, these recombinant polypeptides function as transcriptional repressors by virtue of the DBD that mediates binding to a target gene and a repressor domain that suppresses target gene expression upon binding of the polypeptide to the target gene. The recombinant polypeptides disclosed herein may also be referred to as transcriptional repressors.
[0056] The sequences provided herein may be specified to be at least 30%, 40%, 50%, 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99%, or a 100% identical to another sequence provided herein. Percent identity between a pair of sequences may be calculated by multiplying the number of matches in the pair by 100 and dividing by the length of the aligned region, including gaps. Identity scoring only counts perfect matches and does not consider the degree of similarity of amino acids to one another. Only internal gaps are included in the length, not gaps at the sequence ends.
[0057] Percent Identity = (Matches x 100)/Length of aligned region (with gaps)
[0058] The phrase “conservative amino acid substitution” refers to substitution of amino acid residues within the following groups: 1) L, I, M, V, F; 2) R, K; 3) F, Y, H, W, R; 4) G, A, T, S; 5) Q, N; and 6) D, E. Conservative amino acid substitutions may preserve the activity of the protein by replacing an amino acid(s) in the protein with an amino acid with a side chain of similar acidity, basicity, charge, polarity, or size of the side chain. Guidance for substitutions, insertions, or deletions may be based on alignments of amino acid sequences of proteins from different species or from a consensus sequence based on a plurality of proteins having the same or similar function. [0059] The terms “patient” or “subject” are used interchangeably to refer to a human or a non-human animal (e.g., a mammal).
[0060] The terms “treat”, “treating”, treatment” and the like refer to a course of action (such as administering a polypeptide comprising a DBD fused to a heterologous transcription repressor domain or a nucleic acid encoding the polypeptide) initiated after a disease, disorder or condition, or a symptom thereof, has been diagnosed, observed, and the like so as to eliminate, reduce, suppress, mitigate, or ameliorate, either temporarily or permanently, at least one of the underlying causes of a disease, disorder, or condition afflicting a subject, or at least one of the symptoms associated with a disease, disorder, condition afflicting a subject.
[0061] The terms “prevent”, “preventing”, “prevention” and the like refer to a course of action (such as administering a polypeptide comprising a DBD fused to a heterologous functional domain or a nucleic acid encoding the polypeptide) initiated in a manner (e.g., prior to the onset of a disease, disorder, condition or symptom thereof) so as to prevent, suppress, inhibit or reduce, either temporarily or permanently, a subject's risk of developing a disease, disorder, condition or the like (as determined by, for example, the absence of clinical symptoms) or delaying the onset thereof, generally in the context of a subject predisposed to having a particular disease, disorder or condition. In certain instances, the terms also refer to slowing the progression of the disease, disorder or condition or inhibiting progression thereof to a harmful or otherwise undesired state.
[0062] The phrase “therapeutically effective amount” refers to the administration of an agent to a subject, either alone or as a part of a pharmaceutical composition or as a companion therapy and either in a single dose or as part of a series of doses, in an amount that is capable of having any detectable, positive effect on any symptom, aspect, or characteristics of a disease, disorder or condition when administered to a patient. The therapeutically effective amount can be ascertained by measuring relevant physiological effects.
Recombinant Polypeptides
[0063] As noted above, the recombinant polypeptides include a DBD that mediates binding to a sequence in a target gene and a heterologous transcriptional repressor. The DBD includes a plurality of RUs ordered from N-terminus to C-terminus of the DBD to bind to a nucleic acid sequence of the target gene, where binding of the recombinant polypeptide to the nucleic acid sequence results in decreased expression of the target gene.
[0064] In certain aspects, a recombinant polypeptide disclosed herein may include from N- to C- terminus: a N-cap region, a DBD comprising a plurality of RUs ordered from N-terminus to C-terminus of the DBD to bind to a nucleic acid sequence of the target gene, a C-cap region, an optional linker, and a transcription repressor domain. In certain aspects, the transcriptional repressor domain may be at the N- terminus of the recombinant polypeptide instead of the C-terminus.
[0065] The RUs may have the sequence (X 1-11X12X13X14-33, 34, or 35)z (SEQ ID NO: 453), where X 1-11 is a chain of 11 contiguous amino acids, X14-33 or 34 or 35 is a chain of 20, 21 or 22 contiguous amino acids, X12X13 is selected from: (a) NH, HH, KH, NK, NQ, RH, RN, SS, NN, SN, or KN for recognition of guanine (G); (b) NI, KI, RI, HI, or SI for recognition of adenine (A); (c) NG, HG, KG, or RG for recognition of thymine (T); (d) HD, RD, SD, ND, KD, or YG for recognition of cytosine (C); and (e) NV or HN for recognition of A or G; and (f) H*, HA, KA, N*, NA, NC, NS, RA, or S*for recognition of A or T or G or C, wherein (*) means that the amino acid at X13 is absent, and wherein z=7-40, 7-35, or 7-25. [0066] Any suitable RU such as those based upon the RUs from Xanthomonas transcription activator- like effector (TALE) systems, Ralstonia solanacearum (modular Ralstonia nucleic acid binding domain; RNBD), or an animal pathogen (e.g., Legionella quateirensis, Legionella maceachernii, Burkholderia, Paraburkholderia, or Francisella) (modular animal pathogen nucleic acid binding domain; MAP-NBD) may be used for binding to the regions of the target genes provided herein. The arrangement of the RUs in the DBD may be based upon the sequence identified in the target gene to which binding of the recombinant polypeptide results in decreased expression of the target gene. These sequences identified in PDCD-1 gene, TIM3 gene, CTLA4 gene, and LAG3 gene, and the corresponding DBDs are described in detail below.
[0067] PDCD-1 (programmed cell death 1) gene is also known as PD-1 gene and encodes a cell surface membrane protein of the immunoglobulin superfamily, which is also referred to as PDCD-1 or PD-1. PD- 1 binds to the ligands PD-L1 and PD-L2. The PD-l/PD-1 ligands pathway plays a role in immunosuppression. Recent studies have shown that PD-L1 and PD-L2 are widely expressed on various cancer cells (Keir ME, et al. Annu Rev Immunol. 2008; 26 (677-704)). Expression of PD-ligands prevents cancer cells from being killed by T cells and promotes cancer progression. Targeting the PD-1 pathway has been recognized as an effective immunotherapy for different cancers (Ostrand-Rosenberg S, et al. J Immunol. 2014; 193(8): 3835- 41).
[0068] TIM3 (T-Cell Immunoglobulin Mucin Receptor 3) gene is also referred to as Hepatitis A Virus Cellular Receptor 2 (HAVCR2) and encodes a cell surface membrane protein of the immunoglobulin superfamily of the same name.
[0069] CTLA4 gene (Cytotoxic T-Lymphocyte Associated Protein 4) encodes an immunoglobulin superfamily protein of the same name which transmits an inhibitory signal to T cells.
[0070] LAG3 (Lymphocyte-Activation Gene 3) gene encodes the Lymphocyte Activating 3 protein which is also known as LAG3 protein. [0071] CTLA-4, PD-1, LAG-3, and TIM3 are known immune checkpoint proteins. The pathways involving LAG3 and TIM3 are recognized in the art to constitute immune checkpoint pathways similar to the CTLA-4 and PD-1 dependent pathways (see e.g. Pardoll, 2012. Nature Rev Cancer 12:252-264; Mellman et al. 2011. Nature 480:480-489).
[0072] Unless stated otherwise, all nucleic acid sequences are written from 5’ to 3’ and all polypeptide sequences are from N-terminus to C-terminus. As indicated herein, a DBD may include a plurality of RUs ordered from N-terminus to C-terminus of the DBD to bind to a nucleic acid sequence of the PDCD1 gene. The plurality of RUs may be a number of repeat units sufficient to bind to a target sequence, which number may range from 7 to 40. In certain aspects, the number of RUs may range from 9 to 35. In certain aspects, the number of RUs may range from 12 to 30, 14 to 25, or 16 to 25.
[0073] In certain aspects, the recombinant polypeptides disclosed herein all reduce the expression of the target gene in at least 50% of the cells transfected with a nucleic acid encoding the recombinant polypeptides while cells not transfected with a nucleic acid encoding the recombinant polypeptides do not show a significant decrease in the expression of the target gene. In certain aspects, the recombinant polypeptides disclosed herein all reduce the expression of the target gene in at least 80% of the cells transfected with a nucleic acid encoding the recombinant polypeptides while cells not transfected with a nucleic acid encoding the recombinant polypeptides do not show a significant decrease in the expression of the target gene.
PDCD-1 Repressors
[0074] Provided herein are recombinant polypeptides that bind to sequences in the PDCD-1 gene that have been identified to be present in regions of the gene that when bound by the recombinant polypeptides comprising a transcriptional repressor domain lead to suppression of PD-1 expression from the PDCD-1 gene.
[0075] The sequences in the PDCD-1 gene that were tested to determine repression by a transcriptional repressor domain bound to the sequence are pictorially depicted in Figure 1A. The analysis of repression by the disclosed recombinant polypeptides that are designed to bind to these sequences identified certain regions that provide repression of PDCD-1 expression in at least 50% of the cells expressing these recombinant polypeptides. These regions are depicted in Figures 1B-1C and include regions 1-4. In regions 1, 2, 3, the anti-sense strand of the PDCD-1 gene was successfully targeted to significantly repress expression of PD-1. In region 4, the sense strand was identified as the region of the PDCD-1 gene that can be successfully target for repression. In addition, certain sequences in the sense strand in region 1 were also identified a region that can be successfully targeted for repression. [0076] Region 1:
[0077] Table 1 illustrates the identification of region 1 which includes sequences that can be targeted for repression. As can be seen from Table 1, the indicated recombinant polypeptides, that included RUs arranged from N-terminus to C-terminus to bind to the listed target sequence, repressed expression of PD- 1 by at least 80% as compared to a negative control. The location of these target sequences when aligned reveals a region (Region 1) in minus strand of the PDCD-1 gene that may be targeted for repressing PDCD-1 expression. The alignment of the target sequences also reveals the minimal sequences within Region 1 that can be targeted for binding by the DBD for repressing PDCD-1 expression.
Table 1: Region 1
[0078] Accordingly, in certain aspects, a recombinant polypeptide that suppresses expression of PD 1 receptor encoded by the PDCD 1 gene may include a DNA binding domain (DBD) and a transcriptional repressor domain. The DBD may include a plurality of RUs ordered from N-terminus to C-terminus of the DBD to bind to a nucleic acid sequence of the PDCD 1 gene, wherein the nucleic acid sequence is present within the sequence: TGGTGGGGCTGCTCCAGGCATGCAGATCCCACAGGCGCCCTGG (SEQ ID NO: 1). As explained in the Examples section of the application, this sequence corresponds to Region 1 in the PDCD1 gene.
[0079] The RUs may include the sequence (X 1-11X12X13X14-33, 34, or 35)z (SEQ ID NO: 453), where X 1-11 is a chain of 11 contiguous amino acids, X14-33 or 34 or 35 is a chain of 20, 21 or 22 contiguous amino acids,X12X13 is selected from: (a) NH, HH, KH, NK, NQ, RH, RN, SS, NN, SN, or KN for recognition of guanine (G); (b) NI, KI, RI, HI, or SI for recognition of adenine (A); (c) NG, HG, KG, or RG for recognition of thymine (T); (d) HD, RD, SD, ND, KD, or YG for recognition of cytosine (C); and (e) NV or HN for recognition of A or G; and (f) H*, HA, KA, N*, NA, NC, NS, RA, or S*for recognition of A or T or G or C, wherein (*) means that the amino acid at X13 is absent, and wherein z=7-40, 7-35, or 7-25. [0080] In certain aspects, the RUs are ordered from N-terminus to the C-terminus to bind to the sequence: GGGGCTGCTCC (SEQ ID NO:2), wherein the first RU at the N-terminus binds to the G at the 5’ end of the sequence and the last RU at the C-terminus binds to the C at the 3’ end of the sequence. In certain aspects, the X12X13 in the RUs from N-terminus to C-terminus may be NH, NH, NH, NH, HD, NG, NH, HD, NG, HD, and HD.
[0081] In certain aspects, the DBD may include at least an additional RU at the N-terminus such that the DBD binds to the nucleic acid sequence TGGGGCTGCTCC (SEQ ID NO:3), wherein X12X13 in the additional RU is NG, HG, KG, or RG for recognition of the T.
[0082] In certain aspects, the RUs are ordered from N-terminus to the C-terminus to bind to the sequence: GGTGGGGCTGCTCC (SEQ ID NO:4), wherein the first RU at the N-terminus binds to the G at the 5’ end of the sequence and the last RU at the C-terminus binds to the C at the 3’ end of the sequence. In certain aspects, the DBD comprises at least fourteen RUs, wherein X12X13 in the RUs from N-terminus to C-terminus are NH, NH, NG, NH, NH, NH, NH, HD, NG, NH, HD, NG, HD, and HD. In certain aspects, the DBD comprises three additional RU at the N-terminus such that the DBD binds to the nucleic acid sequence TGGTGGGGCTGCTCC (SEQ ID NO:5). In certain aspects, the DBD comprises three additional RUs at the C-terminus such that the DBD binds to the sequence GGTGGGGCTGCTCCAGG (SEQ ID NO:6).
[0083] In certain aspects, the RUs are arranged from N-terminus to C-terminus to bind to the sequence: GC AGAT CCC AC AGGCGC (SEQ ID NO:7).
[0084] In certain aspects, the RUs are arranged from N-terminus to C-terminus to bind to the sequence: CCCACAGGCGCCCTGG (SEQ ID NO: 8).
[0085] In certain aspects, the RUs are arranged from N-terminus to C-terminus to bind to the sequence: GGGGCTGCTCCAGGCATGC (SEQ ID NO: 9).
[0086] In certain aspects, the RUs may be arranged from N-terminus to C-terminus to bind to a sequence that is a complement of a sequence in region 1. In certain aspects, the complementary sequence may be the sequence: GGAGCAGCCCC (SEQ ID NO: 105). In certain aspects, the DBD that binds to the complementary sequence may include RUs ordered from N-terminus to C-terminus to bind to the sequence: GGAGCAGCCCCACCAGAGT (SEQ ID NO: 106).
[0087] Region 2:
[0088] Table 2 illustrates the identification of region 2 which includes sequences that can be targeted for repression. As can be seen from Table 2, the indicated recombinant polypeptides, that included RUs arranged from N-terminus to C-terminus to bind to the listed target sequence, repressed expression of PD- 1 by at least 80% as compared to a negative control. The location of these target sequences when aligned reveals a region (Region 2) in the minus strand of the PDCD-1 gene that may be targeted for repressing PDCD-1 expression. The alignment of the target sequences also reveals the minimal sequence that can be targeted for binding by the DBD for repressing PDCD-1 expression.
Table 2: Region 2
[0089] In some aspects, the DBD includes a plurality of RUs ordered from N-terminus to C-terminus of the DBD to bind to a nucleic acid sequence of the PDCD 1 gene, wherein the nucleic acid sequence is present within the sequence:
[0090] CCTCCCCCAGCACTGCCTCTGTCACTCTCGCCCACGTGGATGTGG (SEQ ID NO: 10) [0091] As explained herein, this sequence is the sequence of Region 2.
[0092] In some aspects, the DBD includes a plurality of RUs ordered from N-terminus to C-terminus of the DBD to bind to a nucleic acid sequence of the PDCD 1 gene, wherein the nucleic acid sequence is present within the sequence: CCTCCCCCAGCACTGCCTCTGTCACTCTCGCCCACGT (SEQ ID NO: 454). As shown in the Examples section of the application, all of the eight DBD-repressor domains that bound to a nucleic acid sequence within this sequence, repressed expression of PD-1 in at least 50% of the cells treated with the DBD-repressor domain as compared to mock treated cells.
[0093] In some aspects, the DBD includes a plurality of RUs ordered from N-terminus to C-terminus of the DBD to bind to a nucleic acid sequence of the PDCD 1 gene, wherein the nucleic acid sequence is present within the sequence: GCCTCTGTCACTCTCGCCCAC (SEQ ID NO: 444). As shown in the Examples section of the application, all of the three DBD-repressor domains (pAL040, TL11128, and TL11129) that bound to a nucleic acid sequence within this sequence, repressed expression of PD-1 in at least 80% of the cells treated with the DBD-repressor domain as compared to mock treated cells.
[0094] In certain aspects, the RUs are ordered from N-terminus to C-terminus of the DBD to bind to the nucleic acid sequence TCTGTCACTCTCG (SEQ ID NO: 11). In certain aspects, the DBD comprises at least thirteen RUs, wherein X12X13 in the RUs from N-terminus to C-terminus are NG, HD, NG, NH, NG, HD, NI, HD, NG, HD, NG, HD, and NH. In certain aspects, the DBD further comprises three additional RUs at the N-terminus such that the DBD binds to the nucleic acid sequence GCCTCTGTCACTCTCG (SEQ ID NO: 12). In certain aspects, the DBD further comprises three additional RUs at the C-terminus such that the DBD binds to the nucleic acid sequence GCCTCTGTCACTCTCGCCC (SEQ ID NO: 13).
In certain aspects, the DBD comprises at least nineteen RUs, wherein X12X13 in the RUs from N-terminus to C-terminus are NH, HD, HD, NG, HD, NG, NH, NG, HD, NI, HD, NG, HD, NG, HD, NH, HD, HD, and HD. In certain aspects, the DBD further comprises five additional RUs at the C-terminus such that the DBD binds to the nucleic acid sequence TCTGTCACTCTCGCCCAC (SEQ ID NO: 14). In certain aspects, the DBD comprises at least eighteen RUs, wherein X12X13 in the RUs from N-terminus to C- terminus are NG, HD, NG, NH, NG, HD, NI, HD, NG, HD, NG, HD, NG, NH, HD, HD, HD, NI, and HD.
[0095] In certain aspects, the DBD comprises thirteen RUs ordered from N-terminus to C-terminus of the DBD to bind to the nucleic acid sequence: CCCCCAGCACTGC (SEQ ID NO: 15). In certain aspects, the DBD further comprises three additional RUs at the N-terminus such that the DBD binds to the nucleic acid sequence: CCTCCCCCAGCACTGC (SEQ ID NO: 16). In certain aspects, the DBD further comprises an additional RU at the C-terminus such that the DBD binds to the nucleic acid sequence: CCTCCCCCAGCACTGCC (SEQ ID NO: 17).
[0096] Region 3:
[0097] Table 3 illustrates the identification of region 3 which includes sequences that can be targeted for repression. As can be seen from Table 3, the indicated recombinant polypeptides, that included RUs arranged from N-terminus to C-terminus to bind to the listed target sequence, repressed expression of PD- 1 by at least 80% as compared to a negative control. The location of these target sequences when aligned reveals a region (Region 3) in the minus strand in the PDCD-1 gene that may be targeted for repressing PDCD-1 expression. The alignment of the target sequences also reveals the minimal sequence that can be targeted for binding by the DBD for repressing PDCD-1 expression.
Table 3 : Region 3
[0098] In certain aspects, the DBD includes at least nine RUs ordered from N-terminus to C-terminus of the DBD to bind to a nucleic acid sequence of the PDCD 1 gene, wherein the nucleic acid sequence is present within the sequence:
[0099] CCCTTCAACCTGACCTGGGACAGTTTCCCTTCCGCTCACCTCCGCCTGA (SEQ ID NO: 19). As explained in the Examples section of the application, this sequence corresponds to region 3 of the PDCD1 gene.
[0100] In certain aspects, the DBD comprises ten RUs ordered from N-terminus to C-terminus to bind to the nucleic acid sequence: TCCGCTCACC (SEQ ID NO:20). In certain aspects, the DBD comprises nine additional RUs at the C-terminus such that the DBD binds to the nucleic acid sequence: TCCGCTCACCTCCGCCTGA (SEQ ID NO:21). In certain aspects, the DBD comprises four additional RUs at the N-terminus such that the DBD binds to the nucleic acid sequence: CCCTTCCGCTCACC (SEQ ID NO:22). In certain aspects, the DBD comprises five additional RUs at the C-terminus such that the DBD binds to the nucleic acid sequence: CCCTTCCGCTCACCTCCGC (SEQ ID NO:23). In certain aspects, the DBD comprises two additional RUs at the N-terminus such that the DBD binds to the nucleic acid sequence: TTCCCTTCCGCTCACC (SEQ ID NO:24). [0101] In certain aspects, the DBD comprises twelve RUs ordered from N-terminus to C-terminus to bind to the nucleic acid sequence: GGGACAGTTTCC (SEQ ID NO:25). In certain aspects, the DBD further comprises four additional RUs at the C-terminus such that the DBD binds to the nucleic acid sequence: GGGACAGTTTCCCTTC (SEQ ID NO:26). In certain aspects, the DBD further comprises five additional RUs at the N-terminus such that the DBD binds to the nucleic acid sequence: GACCTGGGACAGTTTCC (SEQ ID NO: 27).
[0102] In certain aspects, the DBD comprises eleven RUs ordered from N-terminus to C-terminus to bind to the nucleic acid sequence: CAACCTGACCT (SEQ ID NO:28). In certain aspects, the DBD comprises nine additional RUs at the C-terminus such that the DBD binds to the nucleic acid sequence: CAACCTGACCTGGGACAGTT (SEQ ID NO:29) In certain aspects, the DBD comprises five additional RUs at the N-terminus such that the DBD binds to the nucleic acid sequence: CCCTTCAACCTGACCT (SEQ ID NO:30).
[0103] Region 4:
[0104] Table 4 illustrates the identification of region 4 which includes sequences that can be targeted for repression. As can be seen from Table 4, the indicated recombinant polypeptides, that included RUs arranged from N-terminus to C-terminus to bind to the listed target sequence, repressed expression of PD- 1 by at least 80% as compared to a negative control. The location of these target sequences when aligned reveals a region (Region 4) in the plus strand of the PDCD-1 gene that may be targeted for repressing PDCD-1 expression. The alignment of the target sequences also reveals the minimal sequence that can be targeted for binding by the DBD for repressing PDCD-1 expression.
Table 4: Region 4
[0105] In other aspects, the DBD includes at least nine RUs ordered from N-terminus to C-terminus of the DBD to bind to a nucleic acid sequence of the PDCD 1 gene, wherein the nucleic acid sequence is present within the sequence: GCCGCCTTCTCCACTGCTCAGGCGGAGGT (SEQ ID NO:31). [0106] As explained in the Examples section of the application, this sequence corresponds to Region 4. [0107] In certain aspects, the DBD comprises RUs arranged from N-terminus to C-terminus such that the DBD binds to the nucleic acid sequence: GCCGCCTTCTCCACT (SEQ ID NO:32).
[0108] In certain aspects, the DBD comprises RUs arranged from N-terminus to C-terminus such that the DBD binds to the nucleic acid sequence: CCACTGCTCAGGCG (SEQ ID NO:33). In certain aspects, the DBD further comprises three additional RUs at the N-terminus such that the DBD binds to the nucleic acid sequence: TCTCCACTGCTCAGGCG (SEQ ID NO:34). In certain aspects, the DBD further comprises five additional RUs at the C-terminus such that the DBD binds to the nucleic acid sequence: CCACTGCTCAGGCGGAGGT (SEQ ID NO:35).
[0109] In addition to the recombinant polypeptides that bind to a sequence in Regions 1-4 of PDCD1, the present disclosure provides additional recombinant polypeptides for repressing PDCD1 expression. In certain aspects, the DBD of the recombinant polypeptide includes at least nine RUs ordered from N- terminus to C-terminus of the DBD to bind to a nucleic acid sequence of the PDCD1 gene, wherein the nucleic acid sequence is present within the sequence: CCCAGGTCAGGTTGAAG (SEQ ID NO:63). In certain aspects, the DBD of the recombinant polypeptide includes at least nine RUs ordered from N- terminus to C-terminus of the DBD to bind to a nucleic acid sequence of the PDCD1 gene, wherein the nucleic acid sequence is present within the sequence: GGCCAGGGCGCCTGT (SEQ ID NO:36). In certain aspects, the DBD of the recombinant polypeptide includes at least nine RUs ordered from N- terminus to C-terminus of the DBD to bind to a nucleic acid sequence of the PDCD1 gene, wherein the nucleic acid sequence is present within the sequence: CTGCATGCCTGGAGCAG (SEQ ID NO:37). In certain aspects, the DBD of the recombinant polypeptide includes at least nine RUs ordered from N- terminus to C-terminus of the DBD to bind to a nucleic acid sequence of the PDCD1 gene, wherein the nucleic acid sequence is present within the sequence: GCTCCCGCCCCCTCTTCCT (SEQ ID NO:38). In certain aspects, the DBD of the recombinant polypeptide includes at least nine RUs ordered from N- terminus to C-terminus of the DBD to bind to a nucleic acid sequence of the PDCD1 gene, wherein the nucleic acid sequence is present within the sequence: CTTCCTCCACATCCACG (SEQ ID NO:39). In certain aspects, the DBD of the recombinant polypeptide includes at least nine RUs ordered from N- terminus to C-terminus of the DBD to bind to a nucleic acid sequence of the PDCD1 gene, wherein the nucleic acid sequence is present within the sequence: CCTCCACATCCACGTGGGC (SEQ ID NO:40). [0110] In certain aspects, the RUs of the recombinant polypeptide may be arranged from N-terminus to C-terminus to bind to a sequence present in a target sequence listed in Table 9 and shown to have a PD-1 suppression of at least 50%. As noted herein the RUs may range from 7 to 40 in number. In certain aspects, the RUs of the recombinant polypeptide may be arranged from N-terminus to C-terminus to bind to the target sequence listed in Table 9 and shown to have a PD-1 suppression of at least 50%.
[0111] In certain aspects, the recombinant polypeptides disclosed herein all reduce the expression of PDCD1 gene in at least 50% of the cells transfected with a nucleic acid encoding the recombinant polypeptides while cells not transfected with a nucleic acid encoding the recombinant polypeptides do not show a significant decrease in the expression of the target gene.
[0112] In certain aspects, the recombinant polypeptides disclosed herein all reduce the expression of the PDCD1 gene in at least 80% of the cells transfected with a nucleic acid encoding the recombinant polypeptides while cells not transfected with a nucleic acid encoding the recombinant polypeptides do not show a significant decrease in the expression of the PDCD1 gene. Accordingly, in certain aspects, a recombinant polypeptide of the present disclosure may include a DBD and a transcriptional repressor domain, the DBD comprising a plurality of RUs ordered from N-terminus to C-terminus of the DBD to bind to a nucleic acid sequence of the PDCD1 gene, wherein the nucleic acid sequence is present within one of the following sequences:
[0113] TGGTGGGGCTGCTCC (SEQ ID NO:5);
[0114] GACCTGGGACAGTTTCC (SEQ ID NO:27);
[0115] TTCCCTTCCGCTCACC (SEQ ID NO:24);
[0116] GCCGCCTTCTCCACT (SEQ ID NO:32);
[0117] GCCTCTGTCACTCTCGCCC (SEQ ID NO: 13);
[0118] CCCAGGTCAGGTTGAAG (SEQ ID NO:63);
[0119] GGTGGGGCTGCTCCAGG (SEQ ID NO:6);
[0120] TCTCCACTGCTCAGGCG (SEQ ID NO:34);
[0121] TCCGCTCACCTCCGCCTGA (SEQ ID NO:21);
[0122] CCCTTCCGCTCACCTCCGC (SEQ ID NO:23);
[0123] GGGACAGTTTCCCTTC (SEQ ID NO:26);
[0124] GCCTCTGTCACTCTCG (SEQ ID NO: 12);
[0125] GCAGATCCCACAGGCGC (SEQ ID NO:7);
[0126] CCTCCCCCAGCACTGC (SEQ ID NO: 16);
[0127] CCTCCCCCAGCACTGCC (SEQ ID NO: 17);
[0128] TCTGTCACTCTCGCCCAC (SEQ ID NO: 14); and [0129] CAACCTGACCTGGGACAGTT (SEQ ID NO:29),
[0130] wherein each of the RU comprises the sequence X 1-11X12X13X14-33, 34, or 35 (SEQ ID NO: 455), wherein X 1-11 is a chain of 11 contiguous amino acids, X14-33 or 34 or 35 is a chain of 20, 21 or 22 contiguous amino acids, X12X13 is selected from (a) NH, HH, KH, NK, NQ, RH, RN, SS, NN, SN, or KN for recognition of guanine (G); (b) NI, KI, RI, HI, or SI for recognition of adenine (A); (c) NG, HG, KG, or RG for recognition of thymine (T); (d) HD, RD, SD, ND, KD, or YG for recognition of cytosine (C); and (e) NV or HN for recognition of A or G; and (f) H*, HA, KA, N*, NA, NC, NS, RA, or S*for recognition of A or T or G or C, wherein (*) means that the amino acid at X13 is absent.
[0131] In certain aspects, the DBD comprises at least fourteen RUs, at least sixteen, or at least seventeen RUs and optionally, up to 25 RUs.
[0132] In certain aspects, DBD binds to the nucleic acid sequence selected from SEQ ID NOs.: 5, 27, 24, 32, 13, 63, 6, 34, 21, 23, 26, 12, 7, 16, 17, 14, and 29.
TIM3 Repressors
[0133] Provided herein are recombinant polypeptides that bind to sequences in the TIM3 gene that have been identified to be present in regions of the gene that when bound by the recombinant polypeptides comprising a transcriptional repressor domain lead to suppression of TIM3 expression from the TIM3 gene.
[0134] The sequences in the TIM3 gene that were tested to determine repression by a transcriptional repressor domain bound to the sequence are pictorially depicted in Figure 6. The analysis of repression by the disclosed recombinant polypeptides that are designed bind to these sequences identified certain regions that provide repression of TIM3 expression in at least 50% of the cells expressing these recombinant polypeptides. One such region is depicted in Figure 6. As explained in the Examples section, in this region, the anti-sense strand of the TIM3 gene as well as the sense strand was successfully targeted to significantly repress expression of TIM3.
[0135] The analysis of repression by the disclosed recombinant polypeptides that are designed bind to these sequences identified certain regions that provide repression of TIM3 expression in at least 50% of the cells expressing these recombinant polypeptides. One such region is demarcated in FIG. 6. In this region both sense and anti-sense strand of the TIM3 gene was successfully targeted to significantly repress expression of TIM3-1. The following Table illustrates the sequences present in this region of TIM3 that can be successfully targeted for repression.
Table 5:
[0136] As evident from Table 5, the sequences to which TL9337 and TL8188 as well as the sequence between these two sequences (indicated in bold font) can be targeted for TIM3 suppression. This anti- sense sequence of TIM3 is listed in Table 5. The sequences to which TL8189 and TL9339 bind define a region in the sense strand that can be targeted for TIM3 suppression. The sequence of this sense strand is complementary to the anti-sense sequence listed in Table 5.
[0137] In certain aspects, a recombinant polypeptide that suppresses expression of TIM3 encoded by the TIM3 gene may include a DNA binding domain (DBD) and a transcriptional repressor. The DBD may include a plurality of RUs ordered from N-terminus to C-terminus of the DBD to bind to a nucleic acid sequence of the TIM3 gene, wherein the nucleic acid sequence is present within the sequence: GGCAGTGTTACTATAAGAATCACTGGCAATCAGACACCCGGGTG (SEQ ID NO: 41) or a complement thereof.
[0138] In certain aspects, the DBD comprises RUs that bind to the nucleic acid sequence TGTTACTATA (SEQ ID NO:42). In certain aspects, the DBD comprises an additional RU at the C- terminus such that the DBD binds to the nucleic acid sequence TGTTACTATA (SEQ ID NO:43). In certain aspects, the DBD comprises three additional RUs at the N-terminus such that the DBD binds to the nucleic acid sequence CAGTGTTACTATAA (SEQ ID NO:44). the DBD comprises two additional RUs at the N-terminus such that the DBD binds to the nucleic acid sequence GGCAGTGTTACTATAA (SEQ ID NO:45).
[0139] In certain aspects, the DBD comprises RUs that bind to the nucleic acid sequence TCAGACACCCGGGTG (SEQ ID NO:46). In certain aspects, the DBD comprises three additional RUs at the N-terminus such that the DBD binds to the nucleic acid sequence CAATCAGACACCCGGGTG (SEQ ID NO:47). In certain aspects, the DBD comprises three additional RUs at the N-terminus such that the DBD binds to the nucleic acid sequence TGGCAATCAGACACCCGGGTG (SEQ ID NO:48).
[0140] In another aspect, a recombinant polypeptide that represses TIM3 expression may bind to a sequence that is a complement of
GGCAGTGTTACTATAAGAATCACTGGCAATCAGACACCCGGGTG (SEQ ID NO:41) may bind to the sequence: TGTCTGATTGCCAGTGATTCTTATAGT (SEQ ID NO:49). In certain aspects, the DBD comprises RUs that are ordered to bind to the sequence TGCCAGTGATT (SEQ ID NO:50). In certain aspects, the DBD comprises eight additional RUs at the C-terminus such that the DBD binds to the sequence TGCCAGTGATTCTTATAGT (SEQ ID NO:51). In certain aspects, the DBD comprises RUs that are ordered to binds to the sequence TGATTGCCAGTGATT (SEQ ID NO:52). In certain aspects, the DBD comprises four additional RUs at the N-terminus such that the DBD binds to the sequence TGTCTGATTGCCAGTGATT (SEQ ID NO:53).
[0141] In addition to the recombinant polypeptides that bind to a sense or anti-sense sequence in the region of TIM3 identified herein, the present disclosure provides additional recombinant polypeptides for repressing TIM3 expression. In certain aspects, the DBD of such a recombinant polypeptide may include a plurality of RUs ordered from N-terminus to C-terminus of the DBD to bind to a nucleic acid sequence of TIM3 gene, wherein the nucleic acid sequence is: TACACACAT (SEQ ID NO:54). In certain aspects, the DBD comprises four additional RUs at the N-terminus such that the DBD binds to the sequence ACACTACACACAT (SEQ ID NO:55). In certain aspects, the DBD comprises four additional RUs at the N-terminus such that the DBD binds to the sequence TGCCACACTACACACAT (SEQ ID NO:56).
[0142] In certain aspects, the RUs of the recombinant polypeptide may be arranged from N-terminus to C-terminus to bind to a sequence present in a target sequence listed in Table 10 and shown to have a TIM3 suppression of at least 50%. As noted herein the RUs may range from 7 to 40 in number. In certain aspects, the RUs of the recombinant polypeptide may be arranged from N-terminus to C-terminus to bind to the target sequence listed in Table 10 and shown to have a TIM3 suppression of at least 50%.
[0143] In certain aspects, the recombinant polypeptides disclosed herein all reduce the expression of TIM3 gene in at least 50% of the cells transfected with a nucleic acid encoding the recombinant polypeptides while cells not transfected with a nucleic acid encoding the recombinant polypeptides do not show a significant decrease in the expression of the TIM3.
[0144] In certain aspects, the recombinant polypeptides disclosed herein all reduce the expression of the TIM3 gene in at least 80% of the cells transfected with a nucleic acid encoding the recombinant polypeptides while cells not transfected with a nucleic acid encoding the recombinant polypeptides do not show a significant decrease in the expression of the TIM3 gene. Accordingly, in certain aspects, a recombinant polypeptide of the present disclosure may include a DBD and a transcriptional repressor domain, the DBD comprising a plurality of RUs ordered from N-terminus to C-terminus of the DBD to bind to a nucleic acid sequence of the TIM3 gene, wherein the nucleic acid sequence is present within one of the following sequences:
[0145] GGCAGTGTTACTATAA (SEQ ID NO:45); [0146] TGCCAGTGATTCTTATAGT (SEQ ID NO:51);
[0147] TGGCAATCAGACACCCGGGTG (SEQ ID NO:48);
[0148] TGCCACACTACACACAT (SEQ ID NO:56); or [0149] TGTCTGATTGCCAGTGATT (SEQ ID NO:53),
[0150] wherein each of the RU comprises the sequence X 1-11X12X13X14-33, 34, or 35 (SEQ ID NO: 455), wherein X 1-11 is a chain of 11 contiguous amino acids, X14-33 or 34 or 35 is a chain of 20, 21 or 22 contiguous amino acids, X12X13 is selected from (a) NH, HH, KH, NK, NQ, RH, RN, SS, NN, SN, or KN for recognition of guanine (G); (b) NI, KI, RI, HI, or SI for recognition of adenine (A); (c) NG, HG, KG, or RG for recognition of thymine (T); (d) HD, RD, SD, ND, KD, or YG for recognition of cytosine (C); and (e) NV or HN for recognition of A or G; and (f) H*, HA, KA, N*, NA, NC, NS, RA, or S*for recognition of A or T or G or C, wherein (*) means that the amino acid at X13 is absent.
[0151] In certain aspects, the DBD comprises at least fourteen RUs, at least sixteen, or at least seventeen RUs and optionally, up to 25 RUs.
[0152] In certain aspects, DBD binds to the nucleic acid sequence selected from SEQ ID NOs:45, 51, 48, 56, and 53.
LAG3 Repressors
[0153] Provided herein are recombinant polypeptides that bind to sequences in the LAG3 gene that have been identified to be present in regions of the gene that when bound by the recombinant polypeptides comprising a transcriptional repressor domain lead to suppression of LAG3 expression from the LAG3 gene.
[0154] The sequences in the LAG3 gene that were tested to determine repression by a transcriptional repressor domain bound to the sequence are pictorially depicted in Figure 11. The analysis of repression by the disclosed recombinant polypeptides that are designed bind to these sequences identified certain regions that provide repression of LAG3 expression in at least 50% of the cells expressing these recombinant polypeptides. One such region is depicted in Figure 11. The following Table illustrates the sequences present in this region of LAG3 that can be successfully targeted for repression.
Table 6: LAG3 Repressors
[0155] In certain aspects, the recombinant polypeptide that binds to this region may include a DBD in which the RUs are ordered from N-terminus to C-terminus of the DBD to bind to a nucleic acid sequence of the LAG3 gene, wherein the nucleic acid sequence is present within the sequence: GCCGTTCTGCTGGTCTCTGGGCCTTCACCCCTGTGCCCGGCCTTCC (SEQ ID NO: 57).
[0156] In certain aspects, the DBD comprises RUs that bind to the sequence TCTGCTGGTCT (SEQ ID NO:58). In certain aspects, the DBD comprises five additional RUs at the N-terminus such that the DBD binds to the sequence GCCGTTCTGCTGGTCT (SEQ ID NO:59). In certain aspects, the DBD comprises two additional RUs at the C-terminus such that the DBD binds to the sequence GCCGTTCTGCTGGTCTCT (SEQ ID NO:60). In certain aspects, the DBD comprises four additional RUs at the C-terminus such that the DBD binds to the sequence TCTGCTGGTCTGGGC (SEQ ID NO:61). In certain aspects, the DBD comprises an additional RUs at the C-terminus such that the DBD binds to the sequence TCTGCTGGTCTGGGCC (SEQ ID NO:62). In certain aspects, the DBD comprises three additional RUs at the C-terminus such that the DBD binds to the sequence TCTGCTGGTCTGGGCCTTC (SEQ ID NO:63).
[0157] In certain aspects, the DBD comprises RUs that bind to the sequence TCTCTGGGCCTTCA (SEQ ID NO:64). In certain aspects, the DBD comprises two additional RUs at the N-terminus such that the DBD binds the sequence GGTCTCTGGGCCTTCA (SEQ ID NO:65). In certain aspects, the DBD comprises three additional RUs at the C-terminus such that the DBD binds the sequence GGTCTCTGGGCCTTCACCC (SEQ ID NO:66). In certain aspects, the DBD comprises an additional RUs at the N-terminus such that the DBD binds the sequence TGGTCTCTGGGCCTTCACC (SEQ ID NO:67).
[0158] In certain aspects, the DBD comprises RUs that bind to the sequence TTCACCCCTGTG (SEQ ID NO:68). In certain aspects, the DBD comprises four additional RUs at the C-terminus such that the DBD binds to the sequence TTCACCCCTGTGCCCG (SEQ ID NO:69). In certain aspects, the DBD comprises four additional RUs at the C-terminus such that the DBD binds to the sequence TTCACCCCTGTGCCCGGCCT (SEQ ID NO:70). In certain aspects, the DBD comprises three additional RUs at the C-terminus such that the DBD binds to the sequence TTCACCCCTGTGCCCGGCCTTCC (SEQ ID NO:71).
[0159] In addition to the recombinant polypeptides that bind to a sequence in the region of LAG3 identified herein, the present disclosure provides additional recombinant polypeptides for repressing LAG3 expression. In certain aspects, the DBD of such a recombinant polypeptide may include a plurality of RUs ordered from N-terminus to C-terminus of the DBD to bind to a nucleic acid sequence of LAG3 gene, wherein the nucleic acid sequence is: TGCTCTGTCTGC (SEQ ID NO:72). the DBD comprises two additional RUs at the C-terminus such that the DBD binds to the sequence TGCTCTGTCTGCTC (SEQ ID NO:73). In certain aspects, the DBD comprises two additional RUs at the N-terminus such that the DBD binds to the sequence TTTGCTCTGTCTGCTC (SEQ ID NO:74).
[0160] In certain aspects, the RUs of the recombinant polypeptide may be arranged from N-terminus to C-terminus to bind to a sequence present in a target sequence listed in Table 12 and shown to have a LAG3 suppression of at least 50%. As noted herein the RUs may range from 7 to 40 in number. In certain aspects, the RUs of the recombinant polypeptide may be arranged from N-terminus to C-terminus to bind to the target sequence listed in Table 12 and shown to have a LAG3 suppression of at least 50%.
[0161] In certain aspects, the recombinant polypeptides disclosed herein all reduce the expression of LAG3 gene in at least 50% of the cells transfected with a nucleic acid encoding the recombinant polypeptides while cells not transfected with a nucleic acid encoding the recombinant polypeptides do not show a significant decrease in the expression of the LAG3.
[0162] In certain aspects, the recombinant polypeptides disclosed herein reduce the expression of the LAG3 gene in at least 80% of the cells transfected with a nucleic acid encoding the recombinant polypeptides while cells not transfected with a nucleic acid encoding the recombinant polypeptides do not show a significant decrease in the expression of the TIM3 gene. Accordingly, in certain aspects, a recombinant polypeptide of the present disclosure may include a DBD and a transcriptional repressor domain, the DBD comprising a plurality of RUs ordered from N-terminus to C-terminus of the DBD to bind to a nucleic acid sequence of the TIM3 gene, wherein the nucleic acid sequence is present within one of the following sequences:
[0163] GGTCTCTGGGCCTTCA (SEQ ID NO:65);
[0164] TCTGCTGGTCTCTGGGCC (SEQ ID NO:448);
[0165] GCCGTTCTGCTGGTCTCT (SEQ ID NO:60);
[0166] GCCGTTCTGCTGGTCT (SEQ ID NO:59);
[0167] TTCACCCCTGTGCCCGGCCTTCC (SEQ ID NO:71);
[0168] TGGTCTCTGGGCCTTCACCC (SEQ ID NO:449);
[0169] TCTGCTGGTCTCTGGGCCTTC (SEQ ID NO:450); or [0170] TTTGCTCTGTCTGCTC (SEQ ID NO:74),
[0171] wherein each of the RU comprises the sequence X 1-11X12X13X14-33, 34, or 35 (SEQ ID NO: 455), wherein X 1-11 is a chain of 11 contiguous amino acids, X14-33 or 34 or 35 is a chain of 20, 21 or 22 contiguous amino acids, X12X13 is selected from (a) NH, HH, KH, NK, NQ, RH, RN, SS, NN, SN, or KN for recognition of guanine (G); (b) NI, KI, RI, HI, or SI for recognition of adenine (A); (c) NG, HG, KG, or RG for recognition of thymine (T); (d) HD, RD, SD, ND, KD, or YG for recognition of cytosine (C); and (e) NV or HN for recognition of A or G; and (f) H*, HA, KA, N*, NA, NC, NS, RA, or S*for recognition of A or T or G or C, wherein (*) means that the amino acid at X13 is absent.
[0172] In certain aspects, the DBD comprises at least fourteen RUs, at least sixteen, or at least seventeen RUs and optionally, up to 25 RUs.
[0173] In certain aspects, DBD binds to the nucleic acid sequence selected from SEQ ID NOs: 65, 448, 60, 59, 71, 449, 450, and 74.
CTLA4 Repressors
[0174] Provided herein are recombinant polypeptides that bind to sequences in the CTLA4 gene that have been identified to be present in regions of the gene that when bound by the recombinant polypeptides comprising a transcriptional repressor domain lead to suppression of CTLA4 expression from the CTLA4 gene.
[0175] The sequences in the CTLA4 gene that were tested to determine repression by a transcriptional repressor domain bound to the sequence are pictorially depicted in Figure 9.
[0176] In certain aspects, the DBD of the recombinant polypeptide may include at least nine RUs ordered from N-terminus to C-terminus of the DBD to bind to a nucleic acid sequence of the CTLA4 gene, wherein the nucleic acid sequence is present in the sequence: ACATATCTGGGATCAAAGCT (SEQ ID NO:75); ATATAAAGTCCTTGAT (SEQ ID NO:76); or TTCTATTCAAGTGCC (SEQ ID NO:77).
[0177] In certain aspects, the RUs of the recombinant polypeptide may be arranged from N-terminus to C-terminus to bind to a sequence present in a target sequence listed in Table 11 and shown to have a CTLA4 suppression of at least 50%. As noted herein the RUs may range from 7 to 40 in number. In certain aspects, the RUs of the recombinant polypeptide may be arranged from N-terminus to C-terminus to bind to the target sequence listed in Table 11 and shown to have a CTLA4 suppression of at least 50%. [0178] In certain aspects, the DBD may be extended at the N-terminus, the C-terminus, or both to increase the number of RUs that contact the nucleic acid sequence is present in the sequence of SEQ ID NOs: 75-77. In certain aspects, the DBD may include at least 10, at least 12, at least 13, at least 14, at least 16, or more and up to 20, 25, 35, or 40 RUs. Repeat Units
[0179] As noted above, the repeat unit may have the sequence X 1-11X12X13X14-33, 34, or 35 (SEQ ID NO: 455), where X 1-11 is a chain of 11 contiguous amino acids, X14-33 or 34 or 35 is a chain of 20, 21 or 22 contiguous amino acids, X12X13 is selected from: (a) NH, HH, KH, NK, NQ, RH, RN, SS, NN, SN, or KN for recognition of guanine (G); (b) NI, KI, RI, HI, or SI for recognition of adenine (A); (c) NG, HG, KG, or RG for recognition of thymine (T); (d) HD, RD, SD, ND, KD, or YG for recognition of cytosine (C); and (e) NV or HN for recognition of A or G; and (f) H*, HA, KA, N*, NA, NC, NS, RA, or S*for recognition of A or T or G or C, wherein (*) means that the amino acid at X13 is absent.
[0180] Any suitable RU such as those based upon the RUs from Xanthomonas transcription activator- like effector (TALE) systems, Ralstonia solanacearum (modular Ralstonia nucleic acid binding domain; RNBD), or an animal pathogen (e.g., Legionella quateirensis, Legionella maceachernii, Burkholderia, Paraburkholderia, or Francisella) (modular animal pathogen nucleic acid binding domain; MAP-NBD) may be arranged to bind to the nucleotide sequences in the target genes as disclosed herein.
[0181] In certain aspects, the DNA binding domains of the disclosed recombinant polypeptides may be engineered to include 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, or more, e.g., up to 30, 40 or 50 repeat units arranged in a N-terminal to C-terminal direction to bind to a predetermined 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25 nucleotide length nucleic acid sequence, such as, a sequence disclosed herein. In certain aspects, DNA binding domains may be engineered to include 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26 or more or more, e.g., up to 30, 40 or 50 repeat units that are specifically ordered or arranged to bind to target nucleic acid sequences of length 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26 or more or more, e.g., up to 30, 40 or 50, respectively. In certain embodiments the RUs are contiguous. In some embodiments, half-RUs may be used in the place of one or more RUs. In some aspects, the last RU in a DBD may be a half RU.
DBD derived from Xanthomonas TALE
[0182] In certain aspects, the RUs and the half-RU, if present, are derived from Xanthomonas TALE. In certain aspects, X 1-11 is at least 80%, at least 90%, or 100% identical to LTPEQVVAIAS (SEQ ID NO: 458), LTPAQVVAIAS (SEQ ID NO: 459), LTPDQVVAIAN (SEQ ID NO: 460), LTPDQVVAIAS (SEQ ID NO: 461), LTPYQVVAIAS (SEQ ID NO: 462), LTREQVVAIAS (SEQ ID NO: 463), or LSTAQVVAIAS (SEQ ID NO: 464). In certain aspects, X14-20 or 21 or 22 is at least 80%, at least 90%, at least 95%, or 100% identical to GGKQALETV QRLLPVLCQDHG (SEQ ID NO:79),
GGKQALATV QRLLPVLCQDHG (SEQ ID NO: 467), GGKQALETV QRVLPVLCQDHG (SEQ ID NO: 468), or GGKQALETV QRVLPVLCQDHG (SEQ ID NO: 468). In certain aspects, the RU is at least 80%, at least 90%, at least 95%, or 100% identical to:
LTPEQVVAIASX12X13GGKQALETVQRLLPVLCQDHG (SEQ ID NO: 470), X12X13 is repeat variable diresdue (RVD) and is selected from: (a) NH, HH, KH, NK, NQ, RH, RN, SS, NN, SN, or KN for recognition of guanine (G); (b) NI, KI, RI, HI, or SI for recognition of adenine (A); (c) NG, HG, KG, or RG for recognition of thymine (T); (d) HD, RD, SD, ND, KD, or YG for recognition of cytosine (C); and (e) NV or HN for recognition of A or G; and (f) H*, HA, KA, N*, NA, NC, NS, RA, or S*for recognition of A or T or G or C, wherein (*) means that the amino acid at X13 is absent.
[0183] In certain aspects, the DBD may include a N-cap region at N-terminus of the recombinant polypeptide which N-cap region is derived from the N-cap region of a Xanthomonas TALE protein. In certain aspects, the DBD may include a N-cap region at the N-terminus which may be present immediately adjacent the first RU. In certain aspects, the N-cap region at the N-terminus which may be linked to the first RU via a linker.
[0184] An N-cap region may be any length, e.g., may comprise from about 0 to about 136 amino acid residues in length. An N-terminal cap may be about 5, about 10, about 15, about 20, about 25, about 30, about 35, about 40, about 45, about 50, about 60, about 70, about 80, about 90, about 100, about 110, about 120, or about 130 amino acid residues in length. In certain aspects, the DBD comprises a N-cap region comprising an amino acid sequence at least 80% (e.g., at least 90%, at least 95%, or 100%) identical to the amino acid sequence:
DYKDHDGDYKDHDIDYKDDDDKMAPKKKRKVGIHRGVPMVDLRTLGYSQQQQEKIKPKVRST VAQHHEALVGHGFTHAHIVALSQHPAALGTVAVKYQDMIAALPEATHEAIVGVGKQWSGARA LEALLTVAGELRGPPLQLDTGQLLKIAKRGGVTAVEAVHAWRNALTGAPLETPN (SEQ ID NO:339)
[0185] In certain aspects, the N-cap region is from TALE proteins like those expressed in Burkholderia, Paraburkholderia, or Xanthomonas. In certain aspects, the N-cap regions may be derived from N-cap domain used in conjunction with DNA binding domains disclosed in US20180010152. In certain aspects, the N-cap regions may be derived from the N-terminal regions disclosed in US20150225465, e.g., SEQ ID NOs.:7, 8, or 9 disclosed therein.
[0186] In some aspects, the N-cap region may include the amino acid residues from position 1 (N) through position 137 (M) of the naturally occurring Xanthomonas TALE protein (numbered backwards with N(l) being the residue immediately adjacent the first RU:
MVDLRTLGYSQQQQEKIKPKVRSTVAQHHEALVGHGFTHAHIVALSQHPAALGTVAVKY QDMI AALPEATHEAIVGVGKQWSGARALEALLTVAGELRGPPLQLDTGQLLKIAKRGGVTAVEAVHA WRNALTGAPLN (SEQ ID NO: 107). [0187] This amino acid sequence includes a M added to the N-terminus which is not present in the wild type N-cap region of a Xanthomonas TALE protein. This amino acid sequence is generated by deleting amino acids N+288 through N+137 of the N-terminus region of a TALE protein, adding a M, such that amino acids N+136 through N+l of the N-terminus region of the TALE protein are present.
[0188] In some embodiments, the N-terminus can be truncated such that the fragment of the N-terminus includes amino acids from position 1 (N) through position 120 (K) of the naturally occurring Xanthomonas spp. -derived protein as follows:
KPKVRSTVAQHHEALVGHGFTHAHIVALSQHPAALGTVAVKYQDMIAALPEATHEAIVGVGKQ WSGARALEALLTVAGELRGPPLQLDTGQLLKIAKRGGVTAVEAVHAWRNALTGAPLN (SEQ ID NO: 301).
[0189] In some aspects, the N-cap region can be truncated such that the fragment of the N-terminus includes amino acids from position 1 (N) through position 115 (S) of the naturally occurring Xanthomonas spp. -derived protein as follows:
STVAQHHEALVGHGFTHAHIVALSQHPAALGTVAVKYQDMIAALPEATHEAIVGVGKQWSGAR ALEALLTVAGELRGPPLQLDTGQLLKIAKRGGVTAVEAVHAWRNALTGAPLN (SEQ ID NO:
321).
[0190] In some aspects, the N-cap region can be truncated may include amino acids from position 1 (N) through position 110 (H) of the naturally occurri ng Xanthomonas spp. -derived protein as follows: HHEALVGHGFTHAHIVALSQHPAALGTVAVKY QDMIAALPEATHEAIVGVGKQW SGARALEAL LTVAGELRGPPLQLDTGQLLKIAKRGGVTAVEAVHAWRNALTGAPLN (SEQ ID NO: 447).
[0191] In certain aspects, the DBD may include a C-cap region at C-terminus of the recombinant polypeptide which C-cap region is derived from the C-cap region of a Xanthomonas TALE protein. In certain aspects, the C-cap region at the C-terminus which may be present immediately adjacent the last RU or the last half-RU, if present. In certain aspects, the C-cap region at the C-terminus which may be linked to the last RU or the last half-RU, if present, via a linker.
[0192] A C-cap may be any length and may comprise from about 0 to about 278 amino acid residues in length. A C-terminal cap may be about 5, about 10, about 15, about 20, about 25, about 30, about 35, about 40, about 45, about 50, about 60, about 80, about 100, about 150, about 200, or about 250 amino acid residues in length. In certain aspects, the DBD comprises a C-cap region comprising an amino acid sequence at least 80% (e.g., at least 90%, at least 95%, or 100%) identical to the amino acid sequence: SIVAQLSRPDPALAALTNDHLVALACLGGRPALDAVKKGLPHAPALIKRTNRRIPERTSHRVA (SEQ ID NO:452).
[0193] In certain aspects, the C-cap region is from TALE proteins like those expressed in Burkholderia, Parahurkholderia, or Xanthomonas. [0194] In some aspects, the C-Cap region can be positions 1 (S) through position 278 (Q) of the naturally occurring Xanthomonas spp. -derived protein as follows:
SIVAQLSRPDPALAALTNDHLVALACLGGRPALDAVKKGLPHAPALIKRTNRRIPERTSHRVADH AQVVRVLGFFQCHSHPAQAFDDAMTQFGMSRHGLLQLFRRVGVTELEARSGTLPPASQRWDRI FQASGMKRAKPSPTSTQTPDQASFHAFADSFERDFDAPSPTHEGDQRRASSRKRSRSDRAVTGPS AQQSFEVRAPEQRDAFHFPFSWRVKRPRTSIGGGFPDPGTPTAADFAASSTVMREQDEDPFAGA ADDFPAFNEEEFAWFMEFFPQ (SEQ ID NO: 108).
[0195] In certain aspects, the predetermined N-terminus to C-terminus order of the plurality of RUs of the DNA binding domain determines the corresponding predetermined target nucleic acid sequence to which the recombinant polypeptides may bind. As used herein the RUs and at least one or more half RU are specifically ordered to target the genomic locus or gene of interest. In plant genomes, such as Xanthomonas, the natural TALE-binding sites always begin with a thymine (T), which may be specified by a cryptic signal within the non-repetitive N-cap region of the TALE polypeptide; in some cases this region may be referred to as repeat 0. In animal genomes, TALE binding sites do not necessarily have to begin with a thymine (T) and recombinant polypeptides disclosed herein may target DNA sequences that begin with T, A, G or C. In certain aspects, the recombinant polypeptides disclosed herein may target DNA sequences that begin with T and hence include a RU that contains a RVD that mediated binding to T. The tandem repeat of TALE RUs ends with a half-length repeat or a stretch of sequence that may share identity with only the first 20 amino acids of a repetitive full length TALE RU and this half repeat may be referred to as a half-monomer, a half RU, or a half repeat. Therefore, it follows that the length of the DNA sequence being targeted by DBD derived from TALEs is equal to the number of full RUs plus two. Thus, for example, DBD may be engineered to include X number (e.g., 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, or 26) full length RUs that are specifically ordered or arranged to target nucleic acid sequences of X+2 length (e.g., 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23,
24, 25, 26, 27, or 28 nucleotides, respectively), with the first RU binding “T” and the last RU being a half-repeat.
[0196] As noted herein, in certain aspects, the last RU in the DBD may be a half repeat. The half repeat may comprise the amino acid sequence X 1-11X12X13 X14-19 ,20 or 21 (SEQ ID NO: 471), wherein X 1-11 is a chain of 11 contiguous amino acids, X14-19 ,20 or 21 is a chain of 7, 8 or 9 contiguous amino acids, and X12X13 is selected from: (a) NH, HH, KH, NK, NQ, RH, RN, SS, NN, SN, or KN for recognition of guanine (G); (b) NI, KI, RI, HI, or SI for recognition of adenine (A); (c) NG, HG, KG, or RG for recognition of thymine (T); (d) HD, RD, SD, ND, KD, or YG for recognition of cytosine (C); (e) NV or HN for recognition of A or G; and (f) H*, HA, KA, N*, NA, NC, NS, RA, or S*for recognition of A or T or G or C, wherein (*) means that the amino acid at X13 is absent. In certain aspects, X 1-11 is at least 80% identical, at least 90% identical, or 100% identical to LTPEQVVAIAS (SEQ ID NO:458). In certain aspects, X14-20 or 21 or 22 is at least 80% identical to GGRPALE (SEQ ID NO: 472).
[0197] As noted herein, a recombinant polypeptide disclosed herein may include from N- to C-terminus, a N-cap region, a DBD comprising a plurality of RUs, a C-cap region, an optional linker, and a transcription repressor domain. In cases, where the RUs are derived from a TALE protein, the recombinant polypeptide may be referred to as TALE-TF. The recombinant polypeptides, such as, TALE- TFs, of the present disclosure can further include a linker connecting the DBD or the C-cap region, if present, to the repressor domain. The linker can serve to provide flexibility between the TALE protein and the repressor domain, allowing for the repressor domain (e.g., KRAB to efficiently inhibit transcriptional machinery). A linker used herein can be a short flexible linker comprising an amino acid sequence comprising 0 residues, 1-3 residues, 4-7 residues, 8-10 residues, 10-12 residues, 5-20 residues, 12-15 residues, or 1-15 residues. Linkers can include, but are not limited to, residues such as glycine, methionine, aspartic acid, alanine, lysine, serine, leucine, threonine, tryptophan, or any combination thereof. The linker can have the amino acid sequence of GGGGGMDAKSLTAWS (SEQ ID NO: 109). [0198] In certain aspects, a Xanthomonas spp. -derived repeat units can have a sequence of LTPDOVVAIASNHGGKOALETVORLLPVLCODHG (SEQ ID NO: 438) comprising an RVD of NH, which recognizes guanine. A Xanthomonas spp. -derived repeat units can have a sequence of LTPDOVVAIASNGGGKOALETVORLLPVLCODHG (SEQ ID NO: 439) comprising an RVD of NG, which recognizes thymidine. A Xanthomonas spp. -derived repeat units can have a sequence of LTPDQVVAIASNIGGKQALETV QRLLPVLCQDHG (SEQ ID NO: 440) comprising an RVD of NI, which recognizes adenosine. A Xanthomonas spp. -derived repeat units can have a sequence of LTPDOVVAIASHDGGKOALETVORLLPVLCODHG (SEQ ID NO: 441) comprising an RVD of HD, which recognizes cytosine.
DBD derived from Ralstonia
[0199] In certain aspects, the RUs and one or both N-Cap and C-Cap regions may be derived from a transcription activator like effector-like protein (TALE-like protein) of Ralstonia solanacearum. Repeat units derived from Ralstonia solanacearum can be 33-35 amino acid residues in length. In some embodiments, the repeat can be derived from the naturally occurring Ralstonia solanacearum TALE-like protein.
[0200] As noted herein, the RUs may have the sequence X 1-11X12X13X14-33, 34, or 35 (SEQ ID NO: 455), where X 1-11 is a chain of 11 contiguous amino acids, X14-33 or 34 or 35 is a chain of 20, 21 or 22 contiguous amino acids, X12X13 is RVD and is selected from: (a) NH, HH, KH, NK, NQ, RH, RN, SS, NN, SN, or KN for recognition of guanine (G); (b) NI, KI, RI, HI, or SI for recognition of adenine (A); (c) NG, HG, KG, or RG for recognition of thymine (T); (d) HD, RD, SD, ND, KD, or YG for recognition of cytosine (C); and (e) NV or HN for recognition of A or G; and (f) H*, HA, KA, N*, NA, NC, NS, RA, or S*for recognition of A or T or G or C, wherein (*) means that the amino acid at X13 is absent. In certain aspects,X 1-11 may include a stretch of amino acids at least 80%, at least 90%, or a 100% identical to the X 1-11 residues of the following RUs from Ralstonia. In certain aspects, X14-33 or 34 or 35 may include a stretch of 20, 21, or 22 amino acids at least 80%, at least 90%, or a 100% identical to the X14-33 or 34 or 35 residues of the following RUs from Ralstonia:
[0201] In certain aspects, a Ralstonia solanacearum- repeat unit can have at least 80% sequence identity with any one of the Ralstonia RUs provided herein.
[0202] In certain aspects, the DBD may include a N-cap region at the N-terminus which may be present immediately adjacent the first RU or may be linked to the first RU via a linker. In some aspects, an DBD of the present disclosure can have the full length naturally occurring N-terminus of a naturally occurring Ralstonia solanacearum-derived protein. In some aspects, any truncation of the full length naturally occurring N-terminus of a naturally occurring Ralstonia solanacearum-derived protein can be used at the N-terminus of a DBD of the present disclosure. For example, in some embodiments, amino acid residues at positions 1 (H) to position 137 (F) of the naturally occurring Ralstonia solanacearum-derived protein N-terminus can be used as the N-cap region. In particular embodiments, the truncated N-terminus from position 1 (H) to position 137 (F) can have a sequence as follows:
FGKLVALGYSREQIRKLKQESLSEIAKYHTTLTGQGFTHADICRISRRRQSLRVVARNYPELAAAL PELTRAHIVDIARQRSGDLALQALLPVATALTAAPLRLSASQIATVAQYGERPAIQALYRLRRKLT RAPLH (SEQ ID NO:227). In some embodiments, the naturally occurring N-terminus of Ralstonia solanacearum can be truncated to any length and used as the N-cap of the engineered DNA binding domain. For example, the naturally occurring N-terminus of Ralstonia solanacearum can be truncated to include amino acid residues at position 1 (H) to position 120 (K) as follows:
KQESLSEIAKYHTTLTGQGFTHADICRISRRRQSLRVVARNYPELAAALPELTRAHIVDIARQRSG DLALQALLPVATALTAAPLRLSASQIATVAQYGERPAIQALYRLRRKLTRAPLH (SEQ ID NO:228) and used as the N-cap of the DBD. The naturally occurring N-terminus of Ralstonia solanacearum can be truncated amino acid residues to include positions 1 to 115 and used at the N-cap of the engineered DNA binding domain. The naturally occurring N-terminus of Ralstonia solanacearum can be truncated to amino acid residues at positions 1 to 50, 1 to 70, 1 to 100, 1 to 120, 1 to 130, 10 to 40, 60 to 100, or 100 to 120 and used as the N-cap of the engineered DNA binding domain. As noted for N-cap region derived from Xanthomonas TALE, the amino acid residues are numbered backward from the first repeat unit such that the amino acid (H in this case) of the N-cap adjacent the first RU is numbered 1 while the N-terminal amino acid of the N-cap is numbered 137 (and is F in this case) or 120 (and is K in this case).
[0203] In some embodiments, the N-cap, referred to as the amino terminus or the “NH2” domain, can recognize a guanine. In some embodiments, the N-cap can be engineered to bind a cytosine, adenosine, thymidine, guanine, or uracil.
[0204] In some embodiments, an DBD of the present disclosure can include a plurality of RUs followed by a final single half-repeat also derived from Ralstonia solanacearum. The half repeat can have 15 to 23 amino acid residues, for example, the half repeat can have 19 amino acid residues. In particular embodiments, the half-repeat can have a sequence as follows: LSTAQVVAIACISGQQALE (SEQ ID NO:229).
[0205] In some embodiments, an DBD of the present disclosure can have the full length naturally occurring C-terminus of a naturally occurring Ralstonia solanacearum-derived protein as a C-cap region that is conjugated to the last RU. In some embodiments, any truncation of the full length naturally occurring C-terminus of a naturally occurring Ralstonia solanacearum-derived protein can be used as the C-cap. For example, in some embodiments, the DBD can comprise amino acid residues at position 1 (A) to position 63 (S) as follows:
AIEAHMPTLRQASHSLSPERVAAIACIGGRSAVEAVRQGLPVKAIRRIRREKAPVAGPPPAS (SEQ ID NO:230) of the naturally occurring Ralstonia solanacearum- derived protein C-terminus. In some embodiments, the naturally occurring C-terminus of Ralstonia solanacearum can be truncated to any length and used as the C-cap of the DBD. For example, the naturally occurring C-terminus of Ralstonia solanacearum can be truncated to amino acid residues at positions 1 to 63 and used as the C-terminus of the DBD. The naturally occurring C-terminus of Ralstonia solanacearum can be truncated amino acid residues at positions 1 to 50 and used as the C-cap of the DBD. The naturally occurring C-terminus of Ralstonia solanacearum can be truncated to amino acid residues at positions 1 to 63, 1 to 50, 1 to 70, 1 to 100, 1 to 120, 1 to 130, 10 to 40, 60 to 100, or 100 to 120 and used as the C-cap of the DBD.
[0206] TABLE 7 shows N-Cap, C-Cap, and half-repeats derived from Ralstonia.
DBD derived from Animal Pathogens
[0207] In some embodiments, the present disclosure provides DNA binding domains in which the repeat units can be derived from a Legionellales bacterium, a species of the genus of Legionella, such as L. quateirensis or L. maceachernii, the genus of Burkholderia, the genus of Paraburkholderia, or the genus of Francisella.
[0208] As noted herein, the RUs may have the sequence X 1-11X12X13X14-33, 34, or 35 (SEQ ID NO: 455), where X 1-11 is a chain of 11 contiguous amino acids, X14-33 or 34 or 35 is a chain of 20, 21 or 22 contiguous amino acids, X12X13 is selected from: (a) NH, HH, KH, NK, NQ, RH, RN, SS, NN, SN, HN, or KN for recognition of guanine (G); (b) NI, KI, RI, HI, HA, or SI for recognition of adenine (A); (c) NG, HG, KG, or RG for recognition of thymine (T); (d) HD, RD, SD, ND, KD, or YG for recognition of cytosine (C); and (e) NV or HN for recognition of A or G; and (f) H*, HA, KA, N*, NA, NC, NS, RA, or S*for recognition of A or T or G or C, wherein (*) means that the amino acid at X13 is absent. In certain aspects, X 1-11 may include a stretch of amino acids at least 80%, at least 90%, or a 100% identical to the X 1-11 residues of the following RUs from animal pathogens, Legionella, Burkholderia, Paraburkholderia, or Francisella. In certain aspects, X14-33 or 34 or 35 may include a stretch of 20, 21, or 22 amino acids at least 80%, at least 90%, or a 100% identical to the X14-33 or 34 or 35 residues of the RUs from animal pathogens, Legionella (e.g ., L. quateirensis or L. maceachernii), Burkholderia, Paraburkholderia, or Francisella listed in Table 8. Table 8: Repeat Unit Sequences
[0209] Residues X12X13 of the RU may include base contacting residues (BCR) as listed in the table 8 and may be chosen based upon the target nucleic acid sequence.
[0210] In certain aspects, the last RU in the DBD may be a half RU. In certain aspects, the half RU may include a sequence that is at least 80%, at least 90%, at least 95% or a 100% identical to the half RU from L. quateirensis (FNAEQIVRMVSX12X13GGSKNU) (SEQ ID NO:333). In certain aspects, the half RU may include a sequence that is at least 80%, at least 90%, at least 95% or a 100% identical to the half RU from Francisella (YNKKQIVLIASX12X13SGG) (SEQ ID NO:334).
[0211] In certain aspects, the polypeptide comprises an N-cap region, where the C-terminus (i.e., the last amino acid) of the N-cap region is covalently linked to the N-terminus (i.e., the first amino acid) of the first RU of the DBD either directly or via a linker. In certain aspects, the N-cap region is the N-terminus of L. quateirensis protein and may have an amino acid sequence that is at least 80% (e.g., at least 85%, at least 90%, 95%, or 99%, or a 100%) identical to the amino acid sequence:
[0212] MPDLELNFAIPLHLFDDETVFTHDATNDNSQASSSYSSKSSPASANARKRTSRKEMSGPP SKEPANTKSRRANSQNNKLSLADRLTKYNIDEEFYQTRSDSLLSLNYTKKQIERLILYKGRTSAV QQLLCKHEELLNLISPDG (SEQ ID NO:335). In certain aspects, the N-cap region comprises a fragment of SEQ ID NO:335. In certain aspects, the N-cap region is a N-terminal domain or a fragment thereof from TALE proteins like those expressed in Burkholderia, Paraburkholderia, or Xanthomonas.
[0213] In certain aspects, the polypeptide comprises a C-cap region, where the N-terminus (i.e., the first amino acid) of the C-terminal domain is covalently linked to the C-terminus (i.e., the last amino acid) of the last RU or the half-repeat unit, if present, in the DBD either directly or via a linker. In certain aspects, the C-cap region is the C-terminal domain of L. quateirensis protein and may have an amino acid sequence that is at least 80% (e.g., at least 85%, at least 90%, 95%, or 99%, or a 100%) identical to the amino acid sequence:
[0214] ALVKEYFPVFSSFHFTADQIVALICQSKQCFRNLKKNHQQWKNKGLSAEQIVDLILQETP PKPNFNNTSSSTPSPSAPSFFQGPSTPIPTPVLDNSPAPIFSNPVCFFSSRSENNTEQYLQDSTLDLDS QLGDPTKNFNVNNFWSLFPFDDVGYHPHSNDVGYHLHSDEESPFFDF (SEQ ID NO: 336).
[0215] In certain aspects, the C-cap region comprises a fragment of SEQ ID NO:336, such as a fragment having the amino acid sequence
ALVKEYFPVFSSFHFTADQIVALICQSKQCFRNLKKNHQQWKNKGLSAEQIVDLILQETPPKP (SEQ ID NO: 337). In certain aspects, the C-cap region domain is a C-terminal domain or a fragment thereof from TALE proteins like those expressed in Burkholderia, Paraburkholderia, or Xanthomonas.
Mixed DNA Binding Domains
[0216] In some embodiments, the present disclosure provides DNA binding domains in which the repeat units, the N-cap, and the C-ap can be derived from any one of Ralstonia solanacearum, Xanthomonas spp., Legionella quateirensis, Burkholderia, Paraburkholderia, or Francisella. For example, the present disclosure provides a DNA binding domain wherein the plurality of repeat units are selected from any one of the RUs as provided herein and can further comprise an N-cap and/or C-cap as provided herein.
Repressor Domain
[0217] The terms “repressor,” “repressor domain,” and “transcriptional repressor domain” are used herein interchangeably to refer to a portion of the recombinant polypeptide as disclosed herein which portion decreases expression of a gene when the recombinant polypeptide is bound to the target gene. In certain aspects, the repressor domain comprises Kriippel-associated box (KRAB) protein. In other aspects, the repressor domain comprises KRAB, Sin3a, LSD1, SUV39H1, G9A (EHMT2), DNMT1, DNMT3A-DNMT3L, DNMT3B, KOX, TGF-beta-inducible early gene (TIEG), v-erbA, SID, MBD2, MBD3, Rb, or MeCP2. In certain aspects, the repressor domain comprises an amino acid sequence at least 80%, at least 90%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or a 100% to the amino acid sequence set forth in one of SEQ ID NOs:84-101. In certain aspects, the repressor domain includes a KRAB domain comprising an amino acid sequence that is at least 80%, at least 90%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or a 100% to the amino acid sequence set forth in SEQ ID NO:338:
RTLVTFKDVFVDFTREEWKLLDTAQQIVYRNVMLENYKNLV SLGY QLTKPDVILRLEKGEEP .
Additional Features of the DBD
[0218] In certain aspects, the N-cap region or the C-cap region included in the disclosed DBD may include a nuclear localization sequence (NLS) to facilitate entry into the nucleus of a cell, e.g., an animal cell, such as, a human cell. In certain aspects, the polypeptide may be produced in a host cell and expressed with a translocation signal at the N-terminus which translocation signal may be cleaved during translocation.
[0219] In certain aspects, the RUs may be linked C-terminus to N-terminus with no additional amino acids separating immediately adjacent RUs. In certain aspects, immediately adjacent RUs may be separated by a spacer sequence of at least one amino acid. In certain aspects, the spacer sequence includes at least 2, 3, 4, 5, 6, or 7 amino acids, or up to 5, or up to 10 amino acids. The spacer sequence may include amino acids that have small side chains. In certain aspects, the spacer sequence is a flexible linker.
[0220] In some embodiments, a DBD of the present disclosure can comprise between 2 to 50 RUs, e.g., between 5 and 36, between 9 and 36, between 9 and 40, between 12 and 30, between 5 to 10, between 10 to 15, between 15 to 20, between 20 to 25, between 25 to 30, between 30 to 35 animal pathogen-derived repeat domains, or between 35 to 40 animal pathogen-derived repeat domains. In certain aspects, a MAP- NBD described herein can comprise up to 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22,
23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, or 40 animal pathogen-derived repeat domains.
Imaging Moieties
[0221] A recombinant polypeptide as disclosed herein can be linked to a fluorophore, such as Hydroxycoumarin, methoxycoumarin, Alexa fluor, aminocoumarin, Cy2, FAM, Alexa fluor 488, Fluorescein FITC, Alexa fluor 430, Alexa fluor 532, HEX, Cy3, TRITC, Alexa fluor 546, Alexa fluor 555, R-phycoerythrin (PE), Rhodamine Red-X, Tamara, Cy3.5, Rox, Alexa fluor 568, Red 613, Texas Red, Alexa fluor 594, Alexa fluor 633, Allophycocyanin, Alexa fluor 633, Cy5, Alexa fluor 660, Cy5.5, TruRed, Alexa fluor 680, Cy7, GFP, or mCHERRY. A recombinant polypeptide as disclosed herein can be linked to a biotinylation reagent. In certain aspects, a recombinant polypeptide labeled with an imaging moiety as disclosed herein may be used to image binding and/locabzation of the recombinant polypeptide to a site in the genome of a cell.
Compositions
[0222] In certain aspects, the polypeptides and the nucleic acids described herein may be present in a pharmaceutical composition comprising a pharmaceutically acceptable excipient. In certain aspects, the polypeptides and the nucleic acids are present in a therapeutically effective amount in the pharmaceutical composition. A therapeutically effective amount can be determined based on an observed effectiveness of the composition. A therapeutically effective amount can be determined using assays that measure the desired effect in a cell, e.g., in a reporter cell line in which expression of a reporter is modulated in response to the polypeptides of the present disclosure. The pharmaceutical compositions can be administered ex vivo or in vivo to a subject in order to practice the therapeutic and prophylactic methods and uses described herein.
[0223] The pharmaceutical compositions of the present disclosure can be formulated to be compatible with the intended method or route of administration; exemplary routes of administration are set forth herein. Suitable pharmaceutically acceptable or physiologically acceptable diluents, carriers or excipients include, but are not limited to, nuclease inhibitors, protease inhibitors, a suitable vehicle such as physiological saline solution or citrate buffered saline.
[0224] The pharmaceutical composition may include a plurality of the polypeptides provided herein. For example, the composition may include two, three, four, or more of the polypeptides provided herein, wherein the polypeptides all bind to sequences in regulatory region of the same gene or sequences in regulatory regions of different genes. For example, the composition may include a plurality of polypeptides that bind to a sequence of a target gene as disclosed herein (e.g., PD1, TIM3, or LAG3 gene). Alternatively, the composition may include a first polypeptide that binds to regulatory region of a first gene and a second polypeptide that binds to regulatory region of a second gene, where the first and second genes are independently selected from PD1, TIM3, and LAG3. The composition may include a first polypeptide that binds to regulatory region of PD1 gene, a second polypeptide that binds to regulatory region of TIM3 gene, and a third polypeptide that binds to regulatory region of LAG3 gene. The composition may include a plurality of polypeptides that bind to regulatory region of PD1 gene, a plurality of polypeptides that bind to regulatory region of TIM3 gene, and a plurality of polypeptides that bind to regulatory region of LAG3 gene.
Delivery
[0225] The polypeptides disclosed herein, compositions comprising the disclosed polypeptides, and nucleic acids encoding the disclosed polypeptides can be delivered into a target cell by any suitable means, including, for example, by injection, infection, transfection, and vesicle or liposome mediated delivery.
[0226] In certain aspects, a mRNA or a vector encoding the polypeptides disclosed herein may be injected, transfected, or introduced via viral infection into a target cell, where the cell is ex vivo or in vivo. Any vector systems may be used including, but not limited to, plasmid vectors, retroviral vectors, lentiviral vectors, adenovirus vectors, poxvirus vectors; herpesvirus vectors and adeno-associated virus vectors, etc. When two or more polypeptides according to present disclosure are introduced into the cell, the nucleic acids encoding the polypeptides may be carried on the same vector or on different vectors. Non-viral vector delivery systems include DNA plasmids, naked nucleic acid, and nucleic acid complexed with a delivery vehicle such as a liposome or poloxamer. Viral vector delivery systems include DNA and RNA viruses, which have either episomal or integrated genomes after delivery to the cell. Vectors suitable for introduction of polynucleotides as described herein include described herein include non-integrating lentivirus vectors (IDLV). [0227] Non-viral vector delivery systems include electroporation, lipofection, microinjection, biolistics, virosomes, liposomes, immunoliposomes, polycation or lipidmucleic acid conjugates, naked DNA, artificial virions, and agent-enhanced uptake of DNA.
[0228] Primary cells may be isolated and used ex vivo for reintroduction into the subject to be treated. Suitable primary cells include peripheral blood mononuclear cells (PBMC), and other blood cell subsets such as, but not limited to, CD4+ T cells or CD8+ T cells. In certain aspects, the cell may be a CART cell. Suitable cells also include stem cells such as, by way of example, embryonic stem cells, induced pluripotent stem cells, hematopoietic stem cells, neuronal stem cells, mesenchymal stem cells, muscle stem cells and skin stem cells. In certain aspects, the stem cells may be isolated from a subject to be treated or may be derived from a somatic cell of a subject to be treated using the polypeptides disclosed herein.
[0229] In certain aspects, the cells into which the polypeptides of the present disclosure or a nucleic acid encoding a polypeptide of the present disclosure may be an animal cell, e.g., from a human needing treatment.
[0230] In certain aspects, the polypeptide of the present disclosure is only transiently present in a target cell. For example, the polypeptide is expressed from a nucleic acid that expressed the polypeptide for a short period of time, e.g., for up to 1 day, 3 days, 1 week, 3 weeks, or 1 month. In applications where transient expression of the polypeptide of the present disclosure is desired, adenoviral based systems may be used. Adeno-associated virus (“AAV”) vectors can also be used to transduce cells with nucleic acids encoding the polypeptide of the present disclosure, e.g., in the in vitro production of nucleic acids and peptides, and for in vivo and ex vivo gene therapy procedures. In certain aspects, recombinant adeno- associated virus vectors (rAAV) such as replication-deficient recombinant adenoviral vectors may be used for introduction of nucleic acids encoding the polypeptides disclosed herein.
[0231] In certain aspects, nucleic acids encoding the polypeptides disclosed herein can be delivered using a gene therapy vector with a high degree of specificity to a particular tissue type or cell type. A viral vector is typically modified to have specificity for a given cell type by including a sequence encoding a ligand expressed as a fusion protein with a viral coat protein on the viruses' outer surface. The ligand is chosen to have affinity for a receptor known to be present on the cell type of interest.
[0232] In certain aspects, gene therapy vectors can be delivered in vivo by administration to an individual patient. In certain aspects, administration involves systemic administration (e.g., intravenous, intraperitoneal, intramuscular, subdermal, or intracranial infusion), direct injection (e.g., intrathecal), or topical application, as described below. Alternatively, vectors can be delivered to cells ex vivo, such as cells explanted from an individual patient (e.g., lymphocytes, bone marrow aspirates, tissue biopsy) or universal donor hematopoietic stem cells, followed by reimplantation of the cells into a patient, usually after selection for cells which have incorporated the vector or which have been modified by expression of the polypeptide of the present disclosure encoded by the vector.
[0233] In certain aspects, the nucleic acid encoding the polypeptides provided herein may be codon optimized to enhance expression of the polypeptide in the target cell. For example, the sequence of the nucleic acid can be varied to provide codons that are known to be highly used in animal cells, such as, human cells to enhance production of the polypeptide in a human cell. For example, silent mutations may be made in the nucleotide sequence encoding a polypeptide disclosed herein for codon optimization in mammalian cells.
Methods for Gene Suppression in Target Cells
[0234] In some aspects, described herein is a method of suppressing expression of PDCD-1 gene in a cell, the method comprising introducing into the cell the recombinant polypeptide that comprises the DBD and the transcriptional repressor domain as provided herein, where the DBD binds to a target nucleic acid sequence present in the PDCD-1 gene and the transcriptional repressor suppresses expression of the PDCD-1 gene.
[0235] In some aspects, described herein is a method of suppressing expression of TIM3 gene in a cell, the method comprising introducing into the cell the recombinant polypeptide that comprises the DBD and the transcriptional repressor domain as provided herein, where the DBD binds to a target nucleic acid sequence present in the TIM3 gene and the transcriptional repressor suppresses expression of the TIM3 gene.
[0236] In some aspects, described herein is a method of suppressing expression of LAG3 gene in a cell, the method comprising introducing into the cell the recombinant polypeptide that comprises the DBD and the transcriptional repressor domain as provided herein, where the DBD binds to a target nucleic acid sequence present in the LAG3 gene and the transcriptional repressor suppresses expression of the LAG3 gene.
[0237] In certain aspects, the polypeptide is introduced as a nucleic acid encoding the polypeptide. In certain aspects, the nucleic acid is a deoxyribonucleic acid (DNA). In certain aspects, the nucleic acid is a ribonucleic acid (RNA). In certain aspects, the sequence of the nucleic acid is codon optimized for expression in a human cell.
[0238] In certain aspects, the cell is an animal cell. In certain aspects, the cell is a human cell. In certain aspects, the cell is a cancer cell. In certain aspects, the cell is an ex vivo cell.
[0239] In certain aspects, the introducing comprises administering the polypeptide or a nucleic acid encoding the polypeptide to a subject. In certain aspects, the administering comprises parenteral administration. In certain aspects, the administering comprises intravenous, intramuscular, intrathecal, or subcutaneous administration. In certain aspects, the administering comprises direct injection into a site in a subject. In certain aspects, the administering comprises direct injection into a tumor.
[0240] In certain aspects, the introducing may induce a repression of expression of the target gene for a period of at least 2 days, at least 3 days, at least 9 days, at least at least 15 days, at least 1 month, at least 6 months, at least 1 year to up to 5 years. In certain aspects, the introducing may suppress expression of gene expression by at least 10%, at least 20%, at least 30%, at least 40%, at least 50%, at least 60%, at least 70%, at least 80%, or more. In certain aspects, the introducing may be repeated to maintain suppression of target gene expression. In certain aspects, the introducing may be performed as a combination therapy with, for example, a cancer therapy. The combination therapy may involve introducing the recombinant polypeptide into the cell prior to, concurrently with, or after administration of cancer therapy.
[0241] An animal cell can include a cell from a marine invertebrate, fish, insects, amphibian, reptile, or mammal. A mammalian cell can be obtained from a primate, ape, equine, bovine, porcine, canine, feline, or rodent. A mammal can be a primate, ape, dog, cat, rabbit, ferret, or the like. A rodent can be a mouse, rat, hamster, gerbil, hamster, chinchilla, or guinea pig. A bird cell can be from a canary, parakeet or parrots. A reptile cell can be from a turtle, lizard or snake. A fish cell can be from a tropical fish. For example, the fish cell can be from a zebrafish (e.g., Danio rerio). A worm cell can be from a nematode (e.g., C. elegans). An amphibian cell can be from a frog. An arthropod cell can be from a tarantula or hermit crab.
[0242] A mammalian cell can also include cells obtained from a primate (e.g., a human or a non-human primate). A mammalian cell can include an epithelial cell, connective tissue cell, hormone secreting cell, a nerve cell, a skeletal muscle cell, a blood cell, an immune system cell, or a stem cell.
[0243] Exemplary mammalian cells can include, but are not limited to, 293A cell line, 293FT cell line, 293F cells , 293 H cells, HEK 293 cells, CHO DG44 cells, CHO-S cells, CHO-K1 cells, Expi293F™ cells, Flp-In™ T-REx™ 293 cell line, Flp-In™-293 cell line, Flp-In™-3T3 cell line, Flp-In™-BHK cell line, Flp-In™-CHO cell line, Flp-In™-CV-l cell line, Flp-In™-Jurkat cell line, FreeStyle™ 293-F cells, FreeStyle™ CHO-S cells, GripTite™ 293 MSR cell line, GS-CHO cell line, HepaRG™ cells, T-REx™ Jurkat cell line, Per.C6 cells, T-REx™-293 cell line, T-REx™-CHO cell line, T-REx™-HeLa cell line, NC-HIMT cell line, PC12 cell line, primary cells (e.g., from a human) including primary T cells, primary hematopoietic stem cells, primary human embryonic stem cells (hESCs), and primary induced pluripotent stem cells (iPSCs).
[0244] In some cases, a target cell is a cancerous cell, e.g., in a human. Cancer can be a solid tumor or a hematologic malignancy. The solid tumor can include a sarcoma or a carcinoma. Exemplary sarcoma target cell can include, but are not limited to, cell obtained from alveolar rhabdomyosarcoma, alveolar soft part sarcoma, ameloblastoma, angiosarcoma, chondrosarcoma, chordoma, clear cell sarcoma of soft tissue, dedifferentiated liposarcoma, desmoid, desmoplastic small round cell tumor, embryonal rhabdomyosarcoma, epithelioid fibrosarcoma, epithelioid hemangioendothelioma, epithelioid sarcoma, esthesioneuroblastoma, Ewing sarcoma, extrarenal rhabdoid tumor, extraskeletal myxoid chondrosarcoma, extraskeletal osteosarcoma, fibrosarcoma, giant cell tumor, hemangiopericytoma, infantile fibrosarcoma, inflammatory myofibroblastic tumor, Kaposi sarcoma, leiomyosarcoma of bone, liposarcoma, liposarcoma of bone, malignant fibrous histiocytoma (MFH), malignant fibrous histiocytoma (MFH) of bone, malignant mesenchymoma, malignant peripheral nerve sheath tumor, mesenchymal chondrosarcoma, myxofibrosarcoma, myxoid liposarcoma, myxoinflammatory fibroblastic sarcoma, neoplasms with perivascular epitheioid cell differentiation, osteosarcoma, parosteal osteosarcoma, neoplasm with perivascular epitheioid cell differentiation, periosteal osteosarcoma, pleomorphic liposarcoma, pleomorphic rhabdomyosarcoma, PNET/extraskeletal Ewing tumor, rhabdomyosarcoma, round cell liposarcoma, small cell osteosarcoma, solitary fibrous tumor, synovial sarcoma, or telangiectatic osteosarcoma.
[0245] Exemplary carcinoma target cell can include, but are not limited to, cell obtained from anal cancer, appendix cancer, bile duct cancer (i.e., cholangiocarcinoma), bladder cancer, brain tumor, breast cancer, cervical cancer, colon cancer, cancer of Unknown Primary (CUP), esophageal cancer, eye cancer, fallopian tube cancer, gastroenterological cancer, kidney cancer, liver cancer, lung cancer, medulloblastoma, melanoma, oral cancer, ovarian cancer, pancreatic cancer, parathyroid disease, penile cancer, pituitary tumor, prostate cancer, rectal cancer, skin cancer, stomach cancer, testicular cancer, throat cancer, thyroid cancer, uterine cancer, vaginal cancer, or vulvar cancer.
[0246] Alternatively, the cancerous cell can comprise cells obtained from a hematologic malignancy. Hematologic malignancy can comprise a leukemia, a lymphoma, a myeloma, a non-Hodgkin's lymphoma, or a Hodgkin's lymphoma. In some cases, the hematologic malignancy can be a T-cell based hematologic malignancy. Other times, the hematologic malignancy can be a B-cell based hematologic malignancy. Exemplary B-cell based hematologic malignancy can include, but are not limited to, chronic lymphocytic leukemia (CLL), small lymphocytic lymphoma (SLL), high-risk CLL, a non-CLL/SLL lymphoma, prolymphocytic leukemia (PLL), follicular lymphoma (FL), diffuse large B-cell lymphoma (DLBCL), mantle cell lymphoma (MCL), Waldenstrom's macroglobulinemia, multiple myeloma, extranodal marginal zone B cell lymphoma, nodal marginal zone B cell lymphoma, Burkitt's lymphoma, non-Burkitt high grade B cell lymphoma, primary mediastinal B-cell lymphoma (PMBL), immunoblastic large cell lymphoma, precursor B -lymphoblastic lymphoma, B cell prolymphocytic leukemia, lymphoplasmacytic lymphoma, splenic marginal zone lymphoma, plasma cell myeloma, plasmacytoma, mediastinal (thymic) large B cell lymphoma, intravascular large B cell lymphoma, primary effusion lymphoma, or lymphomatoid granulomatosis. Exemplary T-cell based hematologic malignancy can include, but are not limited to, peripheral T-cell lymphoma not otherwise specified (PTCL-NOS), anaplastic large cell lymphoma, angioimmunoblastic lymphoma, cutaneous T-cell lymphoma, adult T-cell leukemia/lymphoma (ATLL), blastic NK-cell lymphoma, enteropathy-type T-cell lymphoma, hematosplenic gamma-delta T-cell lymphoma, lymphoblastic lymphoma, nasal NK/T-cell lymphomas, or treatment-related T-cell lymphomas.
[0247] In some cases, a cell can be a tumor cell line. Exemplary tumor cell line can include, but are not limited to, 600MPE, AU565, BT-20, BT-474, BT-483, BT-549, Evsa-T, Hs578T, MCF-7, MDA-MB- 231, SkBr3, T-47D, HeLa, DU145, PC3, LNCaP, A549, H1299, NCI-H460, A2780, SKOV-3/Luc, Neuro2a, RKO, RKO-AS45-1, HT-29, SW1417, SW948, DLD-1, SW480, Capan-1, MC/9, B72.3, B25.2, B6.2, B38.1, DMS 153, SU.86.86, SNU-182, SNU-423, SNU-449, SNU-475, SNU-387, Hs 817.T, LMH, LMH/2A, SNU-398, PLHC-1, HepG2/SF, OCI-Lyl, OCI-Ly2, OCI-Ly3, OCI-Ly4, OCI-Ly6, OCI-Ly7, OCI-LylO, OCI-Lyl 8, OCI-Lyl9, U2932, DB, HBL-1, RIVA, SUDHL2, TMD8, MEC1, MEC2, 8E5, CCRF-CEM, MOLT-3, TALL- 104, AML- 193, THP-1, BDCM, HL-60, Jurkat, RPMI 8226, MOLT-4, RS4, K-562, KASUMI-1, Daudi, GA-10, Raji, JeKo-1, NK-92, and Mino.
Methods of Production of Polypeptides
[0248] In certain embodiments, the polypeptides disclosed herein are produced using a suitable method including recombinant and non-recombinant methods (e.g., chemical synthesis).
A. Chemical Synthesis
[0249] Where a polypeptide is chemically synthesized, the synthesis may proceed via liquid-phase or solid-phase. Solid-phase peptide synthesis (SPPS) allows the incorporation of unnatural amino acids and/or peptide/protein backbone modification. Various forms of SPPS, such as Fmoc and Boc, are available for synthesizing polypeptides of the present disclosure. Details of the chemical synthesis are known in the art (e.g., Ganesan A. 2006 Mini Rev. Med. Chem. 6:3-10; and Camarero J.A. et al. 2005 Protein Pept Lett. 12:723-8).
B. Recombinant Production
[0250] Where a polypeptide is produced using recombinant techniques, the polypeptide may be produced as an intracellular protein or as a secreted protein, using any suitable construct and any suitable host cell, which can be a prokaryotic or eukaryotic cell, such as a bacterial (e.g., E. coli) or a yeast host cell, respectively. In certain aspects, eukaryotic cells that are used as host cells for production of the polypeptides include insect cells, mammalian cells, and/or plant cells. In certain aspects, mammalian host cells are used and may include human cells (e.g., HeLa, 293, H9 and Jurkat cells); mouse cells (e.g., NIH3T3, L cells, and C127 cells); primate cells (e.g., Cos 1, Cos 7 and CV1) and hamster cells (e.g., Chinese hamster ovary (CHO) cells). In specific embodiments, the polypeptide disclosed herein are produced in CHO cells.
[0251] A variety of host-vector systems suitable for the expression of a polypeptide may be employed according to standard procedures known in the art. See, e.g., Sambrook et al. 1989 Current Protocols in Molecular Biology Cold Spring Harbor Press, New York; and Ausubel et al. 1995 Current Protocols in Molecular Biology, Eds. Wiley and Sons. Methods for introduction of genetic material into host cells include, for example, transformation, electroporation, conjugation, calcium phosphate methods and the like. The method for transfer can be selected so as to provide for stable expression of the introduced polypeptide -encoding nucleic acid. The polypeptide -encoding nucleic acid can be provided as an inheritable episomal element (e.g., a plasmid) or can be genomically integrated. A variety of appropriate vectors for use in production of a polypeptide of interest are commercially available.
[0252] Vectors can provide for extrachromosomal maintenance in a host cell or can provide for integration into the host cell genome. The expression vector provides transcriptional and translational regulatory sequences and may provide for inducible or constitutive expression where the coding region is operably-linked under the transcriptional control of the transcriptional initiation region, and a transcriptional and translational termination region. In general, the transcriptional and translational regulatory sequences may include, but are not limited to, promoter sequences, ribosomal binding sites, transcriptional start and stop sequences, translational start and stop sequences, and enhancer or activator sequences. Promoters can be either constitutive or inducible, and can be a strong constitutive promoter (e.g., T7).
[0253] Also provided herein are nucleic acids encoding the polypeptides disclosed herein. In certain aspects, a nucleic acid encoding the polypeptides disclosed herein is operably linked to a promoter sequence that confers expression of the polypeptide. In certain aspects, the sequence of the nucleic acid is codon optimized for expression of the polypeptide in a human cell. In certain aspects, the nucleic acid is a deoxyribonucleic acid (DNA). In certain aspects, the nucleic acid is a ribonucleic acid (RNA). Also provided herein is a vector comprising the nucleic acid encoding the polypeptides for binding a target nucleic acid as described herein. In certain aspects, the vector is a viral vector.
[0254] In certain aspects, a host cell comprising the nucleic acid or the vector encoding the polypeptides disclosed herein is provided. In certain aspects, a host cell comprising the polypeptides disclosed herein is provided. In certain aspects, a host cell that expresses the polypeptide is also disclosed. Recombinant Polypeptides Comprising Novel Transcription Repressor Domains
[0255] The present disclosure also provides recombinant polypeptide comprising a DNA binding domain and a transcriptional repressor domain, wherein the DNA binding domain and the transcriptional repressor domain are heterologous, wherein the transcriptional repressor domain comprises an amino acid sequence at least 80% identical to any one of the sequences set out in SEQ ID NOs: 84-101.
[0256] In certain aspects, the transcriptional repressor domain comprises an amino acid sequence at least 85% identical, at least 90% identical, at least 95% identical, or a 100% identical to any one of the sequences set out in SEQ ID NOs: 84-101.
[0257] The DNA binding domain may be a zinc finger protein (ZFP), a transcription activator-like effector (TALE), or a guide RNA. In certain aspects, the DNA binding domain may be a DBD as disclosed herein that binds to a target sequence provided herein.
[0258] In certain aspects, the DNA binding domain may bind to a target nucleic acid sequence in a gene. The target nucleic acid sequence may be present in a PDCD 1 gene, a CTLA4 gene, a LAG3 gene, a TET2 gene, a ETLA gene, a HA VCR2 gene, a CCR5 gene, a CXCR4 gene, a TRA gene, a TRE gene, a E2M gene, an albumin gene, a HEE gene, a HEA1 gene, a TTR gene, a NR3C1 gene, a CD52 gene, an erythroid specific enhancer of the BCL11A gene, a CELE gene, a TGFER1 gene, a SERPINAl gene, a HEV genomic DNA in infected cells, a CEP290 gene, a DMD gene, a CFTR gene, or an IL2RG gene. [0259] The present disclosure also provides a nucleic acid encoding the recombinant polypeptide. The nucleic acid may be operably linked to a promoter sequence that confers expression of the polypeptide. [0260] In certain aspects, the sequence of the nucleic acid is codon optimized for expression of the polypeptide in a human cell. In certain aspects, the nucleic acid is a deoxyribonucleic acid (DNA). In certain aspects, the nucleic acid is a ribonucleic acid (RNA).
[0261] The present disclosure also provides a vector comprising the nucleic acid disclosed herein. In certain aspects, the vector may be a viral vector.
[0262] The present disclosure also provides a host cell comprising the nucleic acid or the vector disclosed herein. In certain aspects, the host cell may include the polypeptide. In certain aspects, the host cell may express the polypeptide.
[0263] Also provided herein is a pharmaceutical composition comprising the polypeptide and a pharmaceutically acceptable excipient. The pharmaceutical composition may include the nucleic acid or the vector and a pharmaceutically acceptable excipient.
[0264] Also provided herein is a method of suppressing expression of an endogenous gene in a cell. The method may include introducing into the cell the recombinant polypeptide, wherein the DBD of the polypeptide binds to a target nucleic acid sequence present in the endogenous gene and the heterologous transcriptional repressor domain suppresses expression of the endogenous gene. [0265] In certain aspects, the recombinant polypeptide is introduced as a nucleic acid encoding the polypeptide. The nucleic acid may be a deoxyribonucleic acid (DNA) or RNA. The nucleic acid may be codon optimized for expression in a human cell.
[0266] The target gene may be a PDCD 1 gene, a CTLA4 gene, a LAG3 gene, a TET2 gene, a ETLA gene, a HA VCR2 gene, a CCR5 gene, a CXCR4 gene, a TRA gene, a TRE gene, a E2M gene, an albumin gene, a HEE gene, a HEA1 gene, a TTR gene, a NR3C1 gene, a CD52 gene, an erythroid specific enhancer of the ECL11A gene, a CELE gene, a TGFER1 gene, a SERPINA1 gene, a HEV genomic DNA in infected cells, a CEP290 gene, a DMD gene, a CFTR gene, or an IL2RG gene.
[0267] The cell may be an animal cell. The cell may be a human cell. The cell may be a cancer cell. The cell may be an ex vivo cell or an in vivo cell.
[0268] In certain aspects, the introducing may include administering the polypeptide or a nucleic acid encoding the polypeptide to a subject. The administering may include parenteral administration. The administering may include intravenous, intramuscular, intrathecal, or subcutaneous administration. The administering may include direct injection into a site in a subject. The administering may include direct injection into a tumor.
Split Systems for modulating gene expression
[0269] Split systems for modulating gene expression are provided. In certain aspects, a DBD and a functional domain are provided as separate polypeptides instead of a single polypeptide and are assembled into a functional complex using dimerization of a heterodimer pair, where the DBD and the functional domain are each fused to a member of the heterodimer pair. In certain aspects, indirect dimerization may also be utilized by using a fused polypeptide comprising two individual members of a heterodimer pair that act as a bridge to bring a DBD and a functional domain together, as explained in detail below.
[0270] These split systems find use in screens for a DBD or a functional domain by, e.g., using a DBD fused to a first member of a heterodimer pair and screening a plurality of candidate functional domains each fused to a second member of the heterodimer pairs and vice versa.
[0271] These split systems find use in providing additional control in modulation of gene expression by a DBD:: functional domain complex. In certain aspects, control of modulation of gene expression may be achieved by having the DBD and functional domain expression on board (e.g., constitutive expression) a cell as separate polypeptides and assembling a functional DBD and functional domain complex by introducing a bridging construct into the cell, when modulation of gene expression is desired. The bridging construct may be expressed transiently thereby modulating gene expression transiently. In certain aspects, control of modulation of gene expression may be achieved by disrupting the DBD and functional domain complex by introducing a disruptor comprising a heterodimer pair or an individual member of a heterodimer pair as explained below.
[0272] As would be understood by the skilled person, the individual components of a split system may be introduced into a cell as nucleic acids encoding the individual components or as polypeptides or a combination thereof.
[0273] The split systems may be used for modulating gene expression in any cell such as a mammalian cell having a target site at which the DBD binds. Examples of such cells are provided herein, e.g., in the preceding sections of the application.
[0274] The heterodimer pairs of the split system include: 37A, 37B; 13A, 13B; DHD37-BBB-A, DHD37-BBB-B; DHD150-A, DHD150-B; DHD154-A, DHD-154B; 37A, 9B; 13A, 37B; 13A, DHD150- B; 37A, DHD37-BBB-B; and DHD37-BBB-A, 37B, where each of 37A, 37B, 13A, 13B, DHD37-BBB- A, DHD37-BBB-B, DHD150-A, DHD150-B, DHD154-A, and DHD-154B, are the individual members of the listed heterodimer pairs. As used herein, the term first member and second member refers to either of the individual members of a listed heterodimer pair.
[0275] The term “37A” and the numeral “1” are used herein interchangeably and in the context of a member of a heterodimer pair refer to a polypeptide comprising an amino acid sequence that is at least 80% identical (e.g., at least 85%, at least 90%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99% identical) to the amino acid sequence:
DSDEHLKKLKTFLENLRRHLDRLDKHIKQLRDILSENPEDERVKDVIDLSERSVRIVKTVIKIFEDS VRKKE (SEQ ID NO: 473), and is capable of binding to 37B, 9B, and DHD37-BBB-B.
[0276] The terms “37B” and “1” are used herein interchangeably and in the context of a member of a heterodimer pair refers to a polypeptide comprising an amino acid sequence that is at least 80% identical (e.g., at least 85%, at least 90%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99% identical) to the amino acid sequence:
[0277] GSDDKELDKLLDTLEKILQTATKIIDDANKLLEKLRRSERKDPKVVETYVELLKRHEKAV KELLEIAKTHAKKVE (SEQ ID NO: 474), and is capable of binding to 37A, 13A, and DHD37-BBB-A. [0278] The term “13 A” and the numeral “9” are used herein interchangeably and in the context of a member of a heterodimer pair refer to a polypeptide comprising an amino acid sequence that is at least 80% identical (e.g., at least 85%, at least 90%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99% identical) to the amino acid sequence:
[0279] GTKEDILERQRKIIERAQEIHRRQQEILEELERIIRKPGSSEEAMKRMLKLLEESLRLLKELL ELSEESAQLLYEQR (SEQ ID NO: 475), and is capable of binding to 13B, 37B, and DHD150-B.
[0280] The terms “13B” and “9'” are used herein interchangeably and in the context of a member of a heterodimer pair refers to a polypeptide comprising an amino acid sequence that is at least 80% identical (e.g., at least 85%, at least 90%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99% identical) to the amino acid sequence:
[0281] GTEKRLLEEAERAHREQKEIIKKAQELHRRLEEIVRQSGSSEEAKKEAKKILEEIRELSKRS LELLREILYLSQEQKGSLVPR (SEQ ID NO: 476), and is capable of binding to 13A.
[0282] The term “DHD37-BBB-A” in the context of a member of a heterodimer pair refers to a polypeptide comprising an amino acid sequence that is at least 80% identical (e.g., at least 85%, at least 90%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99% identical) to the amino acid sequence:
DEEDHLKKLKTHLEKLERHLKLLEDHAKKLEDILKERPEDSAVKESIDELRRSIELVRESIEIFRQS VEEEE (SEQ ID NO: 477), and is capable of binding to DHD37-BBB-B and 37B.
[0283] The term “DHD37-BBB-B” in the context of a member of a heterodimer pair refers to a polypeptide comprising an amino acid sequence that is at least 80% identical (e.g., at least 85%, at least 90%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99% identical) to the amino acid sequence:
GDVKELTKILDTLTKILETATKVIKDATKLLEEHRKSDKPDPRLIETHKKLVEEHETLVRQHKELA EEHLKRTR (SEQ ID NO: 478), and is capable of binding to DHD37-BBB-A and 37A.
[0284] The term “DHD150-A” in the context of a member of a heterodimer pair refers to a polypeptide comprising an amino acid sequence that is at least 80% identical (e.g., at least 85%, at least 90%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99% identical) to the amino acid sequence: GDVKELTKILDTLTKILETATKVIKDATKLLEEHRKSDKPDPRLIETHKKLVEEHETLVRQHKELA EEHLKRTR (SEQ ID NO: 478), and is capable of binding to DHD150-B.
[0285] The term “DHD150-B” in the context of a member of a heterodimer pair refers to a polypeptide comprising an amino acid sequence that is at least 80% identical (e.g., at least 85%, at least 90%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99% identical) to the amino acid sequence: DNEEIIKEARRVVEEYKKAVDRLEELVRRAENAKHASEKELKDIVREILRISKELNKVSERLIELW ERSQERAR (SEQ ID NO: 479), and is capable of binding to DHD150-A and 13A.
[0286] The terms “DHD154-A” and “DHD-154-A” in the context of a member of a heterodimer pair refers to a polypeptide comprising an amino acid sequence that is at least 80% identical (e.g., at least 85%, at least 90%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99% identical) to the amino acid sequence:
TAEELLEVHKKSDRVTKEHLRVSEEILKVVEVLTRGEVSSEVLKRVLRKLEELTDKLRRVTEEQR RVVEKLN (SEQ ID NO: 480), and is capable of binding to DHD-154-B.
[0287] The terms “DHD154-B” and “DHD-154-B” in the context of a member of a heterodimer pair refers to a polypeptide comprising an amino acid sequence that is at least 80% identical (e.g., at least 85%, at least 90%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99% identical) to the amino acid sequence:
DLEDLLRRLRRLVDEQRRLVEELERVSRRLEKAVRDNEDERELARLSREHSDIQDKHDKLAREIL EVLKRLLERTE (SEQ ID NO: 481), and is capable of binding to DHD-154-A.
[0288] In certain aspects, the present disclosure provides two or more nucleic acids encoding one or more of the members of the heterodimer pairs. In certain aspects, the nucleic acid encoding a fusion protein comprising a DBD and a member of a heterodimer pair and another nucleic acid encoding a fusion protein comprising a functional domain and a member of the heterodimer pair are provided.
[0289] In certain aspects, a plurality of nucleic acids are provided, where the plurality of nucleic acids encode (i) polypeptides that dimerize via direct dimerization, comprising: (A) a DBD fused to a first member of a heterodimer pair and a functional domain fused to a second member of the heterodimer pair, or (B) a DBD fused to a second member of a heterodimer pair and a functional domain fused to a first member of the heterodimer pair, wherein the first and second members of the heterodimer pair bind to each other thereby directly dimerizing the DBD and the functional domain, and wherein the heterodimer pair is selected from one of the following heterodimer pairs: 37A, 37B; 13A, 13B; DHD37-BBB-A, DHD37-BBB-B; DHD150-A, DHD150-B; DHD154-A, DHD-154B; 37A, 9B; 13A, 37B; 13A, DHD150- B; 37A, DHD37-BBB-B; and DHD37-BBB-A, 37B.
[0290] In certain aspects, the DBD in (i) (A) or (i) (B) may be fused to a first member of a first heterodimer pair and the functional domain is a first functional domain fused a second member of the first heterodimer pair and to a first member of a second heterodimer pair, and may be used with a second functional domain fused to a second member of the second heterodimer pair, wherein the members of the first heterodimer pair mediate dimerization of the DBD and the first functional domain and members of the second heterodimer pair mediate dimerization of the first functional domain and the second functional domain. In certain aspects, the DBD is fused to a first member of a first heterodimer pair and to a first member of a second heterodimer pair, and the functional domain is fused a second member of the first heterodimer pair the system further comprising a second functional domain fused to a second member of the second heterodimer pair, wherein the members of the first heterodimer pair mediate assembly of the DBD and the first functional domain and members of the second heterodimer pair mediate assembly of the DBD and the second functional domain.
[0291] In certain aspects, a plurality of nucleic acids are provided, where the plurality of nucleic acids encode (ii) polypeptides that dimerize indirectly via a bridging construct, comprising: (A) a DBD fused to a first member of a first heterodimer pair; a bridging construct comprising a second member of the first heterodimer pair fused to a first member of a second heterodimer pair; and a functional domain fused to a second member of the second heterodimer pair; or (B) a DBD fused to a second member of a first heterodimer pair; a bridging construct comprising a first member of the first heterodimer pair fused to a first member of a second heterodimer pair; and a functional domain fused to a second member of the second heterodimer pair; or (C) a DBD fused to a second member of a first heterodimer pair; a bridging construct comprising a first member of the first heterodimer pair fused to a second member of a second heterodimer pair; and a functional domain fused to a first member of the second heterodimer pair, wherein the DBD and the functional domain dimerize indirectly via the bridging construct, wherein the first and second heterodimer pairs are different and are selected from the following heterodimer pairs: 37A, 37B; 13A, 13B; DHD37-BBB-A, DHD37-BBB-B; DHD150-A, DHD150-B; DHD154-A, DHD- 154B; 37A, 9B; 13A, 37B; 13A, DHD150-B; 37A, DHD37-BBB-B; and DHD37-BBB-A, 37B. For example, the DBD may be fused to 37A, the bridging construct may be a fusion of 37B and 13 A, and the functional domain fused to 13B.
[0292] As described in the specification, the DBD may bind to a target nucleic acid sequence present in an endogenous gene in a cell. The functional domain may be an enzyme, a transcriptional activator, a transcriptional repressor, or a DNA nucleotide modifier. The enzyme may be a nuclease, a DNA modifying protein, or a chromatin modifying protein. The nuclease may be a cleavage domain or a half cleavage domain. The cleavage domain or half-cleavage domain may be a type IIS restriction enzyme. The type IIS restriction enzyme may be Fokl or Bfil. The chromatin modifying protein may be lysine- specific histone demethylase 1 (LSD1). The transcriptional activator may be VP16, VP64, p65, p300 catalytic domain, TET1 catalytic domain, TDG, Ldbl self-associated domain, SAM activator (VP64, p65, HSF1), or VPR (VP64, p65, Rta). The transcriptional repressor may be KRAB, Sin3a, LSD1, SUV39H1, G9A (EHMT2), DNMT1, DNMT3A-DNMT3L, DNMT3B, KOX, TGF-beta-inducible early gene (TIEG), v-erbA, SID, MBD2, MBD3, Rb, MeCP2, or a novel transcriptional repressor as disclosed herein. The DNA nucleotide modifier may be an adenosine deaminase. The target nucleic acid sequence may be within a PDCD 1 gene, a CTLA4 gene, a LAG3 gene, a TET2 gene, a ETLA gene, a HA VCR2 gene, a CCR5 gene, a CXCR4 gene, a TRA gene, a TRE gene, a E2M gene, an albumin gene, a HEE gene, a HEA1 gene, a TTR gene, a NR3C1 gene, a CD52 gene, an erythroid specific enhancer of the ECL11A gene, a CELE gene, a TGFER1 gene, a SERPINAl gene, a HEV genomic DNA in infected cells, a CEP290 gene, a DMD gene, a CFTR gene, or an IL2RG gene. The DBD may be a transcription activator-like effector (TALE). The DBD may be a novel DBD as provided herein.
[0293] Also provided herein are a DBD fused to a member of a heterodimer pair, a functional domain fused to a member of a heterodimer pair, a bridging construct comprising a member of a heterodimer pair fused to another member, such as those described in the preceding paragraphs and further described below and those encoded by the plurality of nucleic acids described above. [0294] In certain aspects, a DBA and a functional domain is as set forth in (i)(A) or (i)(B). In certain aspects, a DBD, a bridging construct, and a functional domain is as set forth in (ii)(A), (ii)(B), or (ii)(C). [0295] Also provided herein are host cells that include (a) nucleic acids encoding the polypeptides as set forth in (i)(A) or (i)(B); or (b) nucleic acids encoding the polypeptides as set forth in (ii)(A), (ii)(B), or
(h)(C).
[0296] Also provided herein are host cells that include host cells that include (a) the polypeptides as set forth in (i)(A) or (i)(B); or (b) the polypeptides as set forth (ii)(A), (ii)(B), or (ii)(C).
[0297] Also provided herein is a kit comprising: (a) nucleic acids encoding the polypeptides as set forth in (i)(A) or (i)(B); or (b) nucleic acids encoding the polypeptides as set forth in (ii)(A), (ii)(B), or (ii)(C). [0298] Also provided herein is a kit comprising: (a) a first vector comprising a nucleic acid encoding the DBD set forth in (i)(A); and (b) a second vector comprising a nucleic acid encoding the functional domain set forth in (i)(A); or (a) a first vector comprising a nucleic acid encoding the DBD set forth in
(i)(B); and (b) a second vector comprising a nucleic acid encoding the functional domain set forth in
(i)(B).
[0299] Also provided herein is a kit comprising: a first vector comprising a nucleic acid encoding the DBD set forth in (ii)(A); a second vector comprising a nucleic acid encoding the bridging construct set forth in (ii)(A); and a third vector comprising a nucleic acid encoding the functional domain set forth in
(ii)(A); or (a) a first vector comprising a nucleic acid encoding the DBD set forth in (ii)(B); (b) a second vector comprising a nucleic acid encoding the bridging construct set forth in (ii)(B); and (c) a third vector comprising a nucleic acid encoding the functional domain set forth in (ii)(B); or a first vector comprising a nucleic acid encoding the DBD set forth (ii)(C); a second vector comprising a nucleic acid encoding the bridging construct set forth in (ii)(C);and a third vector comprising a nucleic acid encoding the functional domain set forth in (ii)(C).
[0300] Also disclosed are pharmaceutical compositions comprising the nucleic acids disclosed herein or the polypeptides disclosed herein. The pharmaceutical composition may also include a pharmaceutically acceptable excipient. In certain aspects, the pharmaceutical composition may include (a) nucleic acids encoding the polypeptides as set forth in (i)(A) or (i)(B); or (b) nucleic acids encoding the polypeptides as set forth in (ii)(A), (ii)(B), or (ii)(C).
[0301] In certain aspects, the pharmaceutical composition may include (a) a first vector comprising a nucleic acid encoding the DBD set forth in (i)(A); and (b) a second vector comprising a nucleic acid encoding the functional domain set forth in (i)(A); or (a) a first vector comprising a nucleic acid encoding the DBD set forth in (i)(B); and (b) a second vector comprising a nucleic acid encoding the functional domain set forth in (i)(B). [0302] In certain aspects, the pharmaceutical composition may include: (a) a first vector comprising a nucleic acid encoding the DBD set forth in (ii)(A); (b) a second vector comprising a nucleic acid encoding the bridging construct set forth in (ii)(A); and (c) a third vector comprising a nucleic acid encoding the functional domain set forth in (ii)(A); or (a) a first vector comprising a nucleic acid encoding the DBD set forth in (ii)(B); (b) a second vector comprising a nucleic acid encoding the bridging construct set forth in (ii)(B); and (c) a third vector comprising a nucleic acid encoding the functional domain set forth in (ii)(B); or (a) a first vector comprising a nucleic acid encoding the DBD set forth in (ii)(C); (b) a second vector comprising a nucleic acid encoding the bridging construct set forth in (ii)(C); and (c) a third vector comprising a nucleic acid encoding the functional domain set forth in (ii)(C). [0303] In certain aspects, the pharmaceutical composition may include the DBD and a functional domain or a DNA binding domain, a functional domain and a bridging construct as provided herein and a pharmaceutically acceptable excipient. In certain aspects, the pharmaceutical composition may include the host cell as provided herein and a pharmaceutically acceptable excipient.
[0304] The split systems of DBD and functional domains and heterodimer pairs may be used in a method for modulating expression from a target gene in a cell. The method may include (i) introducing into the cell a first nucleic acid encoding a DNA binding domain fused to a first member of a heterodimer pair and a second nucleic acid encoding a functional domain fused to a second member of the heterodimer pair; or (ii) introducing into the cell a first nucleic acid encoding a DNA binding domain fused to a second member of a heterodimer pair and a second nucleic acid encoding a functional domain fused to a first member of the heterodimer pair; or (iii) introducing into the cell a DNA binding domain fused to a first member of a heterodimer pair and a functional domain fused to a second member of the heterodimer pair; or (iv) introducing into the cell a DNA binding domain fused to a second member of a heterodimer pair and a functional domain fused to a first member of the heterodimer pair. The heterodimer pair may be selected from one of the following heterodimer pairs: 37A, 37B; 13A, 13B; DHD37-BBB-A, DHD37- BBB-B; DHD150-A, DHD150-B; DHD154-A, DHD-154B; 37A, 9B; 13A, 37B; 13A, DHD150-B; 37A, DHD37-BBB-B; and DHD37-BBB-A, 37B, wherein the DBD dimerizes with the functional domain via dimerization of the members of the heterodimer pair and wherein binding of the DBD to a target nucleic acid sequence in the target gene results in modulation of expression of the target gene via the functional domain dimerized to the DBD.
[0305] In certain aspects, the method may be used for screening a candidate DBD or a candidate functional domain or for ranking DBDs or functional domains based on specificity, activity, and the like. The modulation of expression of the target gene may be assessed to determine whether a DBD is specific for the target gene and/or whether the functional domain is active in repressing or activating expression of the target gene. [0306] The split systems of DBD and functional domains and heterodimer pairs may be used in a method for modulating expression from a target gene in a cell, where the method includes introducing into a cell expressing a DNA binding domain (DBD) fused to a first member of a first heterodimer pair and a functional domain fused to a second member of a second heterodimer pair, a bridging construct comprising a second member of the first heterodimer pair fused to a first member of the second heterodimer pair or a nucleic acid encoding the bridging construct; or introducing into a cell expressing a DNA binding domain (DBD) fused to a second member of a first heterodimer pair and a functional domain fused to a second member of a second heterodimer pair, a bridging construct comprising a first member of the first heterodimer pair fused to a first member of the second heterodimer pair or a nucleic acid encoding the bridging construct; or introducing into a cell expressing a DNA binding domain (DBD) fused to a first member of a first heterodimer pair and a functional domain fused to a first member of a second heterodimer pair, a bridging construct comprising a second member of the first heterodimer pair fused to a second member of the second heterodimer pair or a nucleic acid encoding the bridging construct, wherein the DBD and the functional domain dimerize indirectly via the bridging construct, wherein binding of the DBD to a target nucleic acid sequence in a target gene in the cell results in in modulation of expression of the target gene via the functional domain dimerized to the DBD via the bridging construct, wherein the first and second heterodimer pairs are different and are selected from the following heterodimer pairs: 37A, 37B; 13A, 13B; DHD37-BBB-A, DHD37-BBB-B; DHD150-A, DHD150-B; DHD154-A, DHD-154B; 37A, 9B; 13A, 37B; 13A, DHD150-B; 37A, DHD37-BBB-B; and DHD37-BBB-A, 37B.
[0307] Such a system may be used for fine tuning control of modulation of gene expression by controlling expression of the different components required for modulating gene expression.
[0308] Also provided is a method of reversing modulation of expression of a target gene in a cell expressing a DNA binding domain (DBD) fused to a first member of a non-cognate heterodimer pair and a functional domain fused to a second member of the non-cognate heterodimer pair, wherein the DBD binds to a target nucleic acid sequence in a target gene and the functional domain dimerized to the DBD via dimerization of the members of the heterodimer pair modulates expression of the target gene, the method comprising introducing into the cell a disruptor which binds to either the first member or the second member with a higher binding affinity than the binding affinity between the first and second members, wherein non-cognate heterodimer pairs and the corresponding disruptor are selected from one of the following combinations:
[0309] As used herein, the term “non-cognate heterodimer pair” refers to a heterodimer pair whose members bind to each other with an affinity that is lower than the affinity with which members of a “cognate heterodimer pair” bind. For example, 37A,37B is a cognate heterodimer pair while 37A, 9B form a non-cognate heterodimer pair, since the binding affinity between 37A and 37B is higher than that between 37A and 9B. Examples of cognate heterodimer pairs include 37A, 37B; 13A, 13B; DHD37- BBB-A, DHD37-BBB-B; DHD150-A, DHD150-B; and DHD154-A, DHD-154B. While members of a “non-cognate heterodimer” bind to each other, members that are not part of a “non-cognate heterodimer” or a “cognate heterodimer” do not significantly bind to each other and are not considered as members of a heterodimer pair.
[0310] In certain aspect, the fusion polypeptides, such as, DBD fused to a member of a heterodimer pair may be such that the C-terminus of the DBD is fused to the N-terminus of a member of a heterodimer pair and the N-terminus of the functional domain is fused to the C-terminus of a member of a heterodimer pair. In certain aspects, one or more components of the system may be expressed transiently while other component(s) are expressed stably. Stable and transient expression in a cell may be achieved by methods known in the art, such as, transient transfection, gene integration, constitutive and inducible promoters and the like.
EXAMPLES
[0311] These examples are provided for illustrative purposes only and not to limit the scope of the claims provided herein.
Materials and Methods
[0312] TALE backbone sequences:
[0313] N-Cap:
DYKDHDGDYKDHDIDYKDDDDKMAPKKKRKVGIHRGVPMVDLRTLGYSQQQQEKIKPKVRST VAQHHEALVGHGFTHAHIVALSQHPAALGTVAVKYQDMIAALPEATHEAIVGVGKQWSGARA LEALLTVAGELRGPPLQLDTGQLLKIAKRGGVTAVEAVHAWRNALTGAPLETPN (SEQ ID NO:339)
[0314] Repeat Unit: LTPDQVVAIASX11X12GGKQALETVQRLLPVLCQDHG (SEQ ID NO:340) [0315] Half repeat unit: LTPEQVVAIAS X11X12GG (SEQ ID NO:341) [0316] RVD= X11X12; X11X12=NH for binding G; NG for binding T; NI for binding A; and HD for binding C.
[0317] C-Cap:
RPALESIVAQLSRPDPALAALTNDHLVALACLGGRPALDAVKKGLPHAPALIKRTNRRIPERTSH RVA (SEQ ID N: 342)
[0318] Flexible linker between C-Cap and KRAB: GAGGGGGMDAKSLTAW S (SEQ ID N0 343) [0319] KRAB:
RTLVTFKDVFVDFTREEWKLLDTAQQIVYRNVMLENYKNLV SLGY QLTKPDVILRLEKGEEP (SEQ ID NO:338)
[0320] Anti CD19 CAR-T cell manufacturing: Primary T cells were thawed and activated with CD3/CD28 Dynabeads and cultured for 48 hours prior to electroporation with either no mRNA (control) or mRNA encoding the TALE-TFs against PD1. At 24 hours post electroporation T cells were transduced with a lentivirus vector encoding a 3rd generation anti CD 19 CAR construct on Retronectin at an MOI of 5 to 10. After 24 hours the virus and beads were removed and T cells expanded in RPMI + 10%FBS + IL- 2 for up to 5 days.
[0321] Co-culture (killing) assay: CAR-T cells and control T cells were incubated with CD 19- expressing NALM-6 cells or NALM-6 cells engineered to express PDL-1 (the ligand for PD-1) or NALM-6 cells in which the target antigen CD 19 was knocked out using TALENs (CD 19 KO) at an effector-to-target (E:T) ratio of 1: 1 in a 96-well round bottom culture plate for 16 hours at 37degrees with 5% C02. After 16 hours of incubation, specific target cell killing was measured by release of lactate dehydrogenase (LDH) into the supernatant (Promega kit #) or by flow cytometry analysis.
[0322] Animal model: Human B-Acute Lymphoblastic Leukemia (ALL) NALM-6 cells expressing CD19 were implanted intra-venously into NOD SCID Gamma (NSG) mice at 0.5 million cells per mouse. 5 days later when tumor engraftment was detectable by in vivo imaging, mice were injected intra- venously with 2.5 million anti-CD19 CAR+ T cells either treated or untreated with the anti PD-1 TALE- TF pAL043. Mice were bled once per week after infusion and blood was processed for flow cytometry to detect human CD3+ T cells, CAR-T cells and measure expression of PD-1.
[0323] Off Target Analysis: CD3+ cells were electroporated with TALETFs (either single or multiplexed) in triplicate. Cells were harvested at 2 days post-transfection for RNA extraction and parallel analysis of expression using flow cytometry. Total RNA was extracted from these samples and from control T cells electroporated without mRNA using Qiagen miRNeasy extraction kit. Total RNA samples were constructed into libraries using Illumina's TruSeq Stranded Total RNA Plurality of nucleic acids Prep Gold kit. Libraries were then sequenced using Illumina's Hiseq 4000 platform with 2x76bp read length to a depth of 25-50 million reads per sample. Reads were aligned using STAR paired alignment (RNA-STAR 2.3.1), mapped to the GRCh38 human genome assembly, and differential gene expression analysis was performed using edgeR.
[0324] Synthetic Repressor Design and Assembly. TAF monomers were cloned and assembled into full length TAFs with modifications to established methods (T. Cermak et al. , Nucleic Acids Res 39, e82 (2011); T. Sakuma et al., Genes Cells 18, 315-326 (2013) into a pVAX-based plasmid and included an N-terminal 3x-FFAG tag and SV40 nuclear localization signal. Functional domains were selected by literature search for evidence of transcriptional repressive function and annotated DNA- binding domains removed in silico before synthesis and incorporation into TAF or heterodimer constructs. Functional domains were added by Infusion cloning (Takara Bio; catalog # 638909) onto the C-terminal end of the TAF. Functional domain constructs contained a 15 amino acid linker domain (GGGGGMDAKSFTAWS) (SEQ ID NO: 109) and either an epigenetic-functional domain (e.g. - KRAB) or heterodimer protein (e.g. - 9' of the 9:9' pair).
[0325] Obligate heterodimers. Mutually orthogonal heterodimer pairs listed in Table 14 were designed and synthesized. Heterodimer sequences were appended to sequences encoding TAF-DBDs or effector domains via colinear placement in plasmids used for in vitro RNA transcription. Heterodimer epigenetic domain constructs for screening were designed with a T7 promoter, NFS (nuclear localization signal), heterodimer protein (e.g. - 9' of the 9:9' pair), the 15 amino acid linker (see above), and the functional domain (e.g. - KRAB); and generated as double-stranded DNA (Integrated DNA Technologies; gBlocks Gene Fragments).
EXAMPLE 1
Identification of TALE-TFs for PDCD1 repression [0326] This example illustrates identification of TALE-TFs that significantly repress PD-1 expression. Figure 1 provides a pictorial map of all of the regions in the PDCD-1 gene that were tested for identifying TAFE-TFs that significantly repress PD-1 expression. The results are provided in Table 9 below:
[0327] Table 9: TL11107 PDCD1 PROMOTER - GAGCGGAAGGGAAACTGTC 357 <50%
100 +100 23 EPITF chr2:24185888 C
9-241858910 PLUS
TL11113 PDCD1 PROMOTER - CAGGTTGAAGGGAGGGTGC 358 <50%
100 +100 29 EPITF chr2:24185891
4-241858934 PLUS
TL11115 PDCD1 PROMOTER - GAAGGGAGGGTGCCCGCCC 359 <50%
100 +100 31 EPITF chr2:24185892 C
0-241858941 PLUS
TL11116 PDCD1 PROMOTER - GCCCGCCCCTTGCTC 360 <50%
100 +100 32 EPITF chr2:24185893
1-241858947 PLUS
TL11117 PDCD1 PROMOTER - GCCCGCCCCTTGCTCCC 361 <50%
100 +100 33 EPITF chr2:24185893
1-241858949 PLUS
TL11118 PDCD1 PROMOTER - TGCTCCCGCCCCCTC 362 <50%
100 +100 34 EPITF chr2:24185893
1-241858952 PLUS
TL11121 PDCD1 PROMOTER - GGAGGAAGAGGGGGCGG 363 <50%
100 +100 37 EPITF chr2:24185894
7-241858965 MINUS
TL11122 PDCD1 PROMOTER - GGATGTGGAGGAAGAGGGG 364 <50%
100 +100 38 EPITF chr2:24185895 G
0-241858971 MINUS
TL11878 PDCD1 PROMOTER - TGAAGGGAGGGTGCCCG 365 <50%
100 +100 30 EPITF chr2:24185891
9-241858937
[0328] FIG. 1A illustrates the locations in the PDCD1 gene to which the DBDs of the indicated recombinant polypeptides were designed to bind. Recombinant polypeptides that repressed expression of PDCD1 in at least 50% of cells treated with the recombinant polypeptides are indicated by clear arrows . Recombinant polypeptides that repressed expression of PDCD1 in less than 50% of the cells treated with the recombinant polypeptides are indicated by solid arrows . The orientation of the arrows indicates the DNA strand to which the recombinant polypeptide is designed to bind. Arrows having the orientation are designed to bind to the anti-sense strand. Arrows having the orientation are designed to bind to the sense strand.
[0329] The analysis of repression by the disclosed recombinant polypeptides that are designed bind to these sequences identified certain regions that provide repression of PDCD-1 expression in at least 50% of the cells expressing these recombinant polypeptides. These regions are depicted in FIGS. 1B-1C and include regions 1-4. In regions 1, 2, 3, the anti-sense strand of the PDCD-1 gene was successfully targeted to significantly repress expression of PD-1. In region 4, the sense strand was identified as the region of the PDCD-1 gene that can be successfully target for repression. In addition, certain sequences in the sense strand in region 1 were also identified a region that can be targeted for repression. Tables 1-4 illustrate the sequences present in each of Regions 1-4 that can be targeted for repression. [0330] FIG. 2 shows the fold change in number of PD-1 expressing cells 2 days after transfection of mRNA encoding the indicated recombinant polypeptides into CD3+ T cells.
[0331] FIG. 3 shows effect of dose of mRNA encoding the recombinant polypeptide, pAF040 and pAF043, on the percent of CD3+ T cells expressing PD-1, 3 days after transfection. CD3+ T cells were activated with beads and electroporated 48 hours post activation according to standard process with varying concentration of TAFE-TF mRNA from 3ng to 2ug per transfection (250,000 T cells per condition). PD-1 expression by flow was measured on day 3 post transfection.
[0332] FIG. 4 shows the fold change in number of PD-1 -positive cells at the indicated number of days post-transfection of mRNA encoding the indicated recombinant polypeptide relative to control, which are cells electroporated without repressor mRNA. PD-1 repression is durable for about 2 weeks in culture and after freeze-thaw.
[0333] FIGS. 5 A and 5B show that PD-1 repression with pAF043 in anti-CD 19 CAR-T cells is sustained after in vivo expansion and clearance of CD19-positive NAFM-6 B-AFF tumor model in NSG mice. [0334] In addition to regions 1-4, targeting the sequence GGCCAGGGCGCCTGT (SEQ ID NO: 36) by TAFE-TF TL11084 also significantly suppressed PD-1 expression.
EXAMPLE 2
Identification of TALE-TFs for TIM3 repression [0335] This example illustrates identification of TALE-TFs that significantly repress TIM3 expression. Figure 6 provides a pictorial map of all of the regions in the TIM3 gene that were tested for identifying TAFE-TFs that significantly repress TIM3 expression. The results are provided in Table 10 below:
[0336] Table 10:
[0337] FIG. 6 illustrates the locations in the TIM3 gene at which the DBDs of the indicated recombinant polypeptides bind. Recombinant polypeptides that repressed expression of TIM3 in at least 50% of the cells are indicated by unfilled arrows . Recombinant polypeptides that repressed expression of TIM3 in less than 50% of the cells are indicated by filled arrows . The orientation of the arrows indicates the DNA strand to which the recombinant polypeptide is designed to bind. Arrows having the orientation are designed to bind to the anti-sense strand.
Arrows having the orientation are designed to bind to the sense strand.
[0338] FIG. 7 shows the fold change in number of cells expressing TIM3 at 2 days, 5 days, 8 days, or 14 days after transfection of mRNA encoding the indicated recombinant polypeptides into CD3+ T cells. [0339] FIG. 8 shows the fold change in number of cells expressing TIM3 at 3 days or 6 days after transfection of mRNA encoding the indicated recombinant polypeptides into CD3+ T cells.
EXAMPLE 3
Identification of TALE-TFs for CTLA4 repression [0340] This example illustrates identification of TALE-TFs that significantly repress CTLA4 expression. Figure 9 provides a pictorial map of all of the regions in the CTFA4 gene that were tested for identifying TAFE-TFs that significantly repress CTFA4 expression. The results are provided in Table 11 below:
[0341] Table 11:
[0342] FIG. 9 illustrates the locations in the CTLA4 gene at which the DBDs of the indicated recombinant polypeptides bind. Recombinant polypeptides that repressed expression of CTLA4 in at least 50% of the cells are indicated by unfdled arrows . Recombinant polypeptides that repressed expression of CTLA4 in less than 50% of the cells are indicated by filled arrows . The orientation of the arrows indicates the DNA strand to which the recombinant polypeptide is designed to bind. Arrows having the orientation are designed to bind to the anti-sense strand. Arrows having the orientation are designed to bind to the sense strand.
[0343] FIG. 10 shows the fold change in number of cells expressing CTLA4 at 3 days after transfection of mRNA encoding the indicated recombinant polypeptides into CD3+ T cells.
EXAMPLE 4 Identification of TALE-TFs for LAG3 repression [0344] This example illustrates identification of TALE-TFs that significantly repress LAG3 expression. Figure 11 provides a pictorial map of all of the regions in the LAG3 gene that were tested for identifying TAFE-TFs that significantly repress LAG3 expression. The results are provided in Table 12 below:
[0345] Table 12:
[0346] FIG. 11 illustrates the locations in the LAG3 gene at which the DBDs of the indicated recombinant polypeptides bind. Recombinant polypeptides that repressed expression of LAG3 in at least
50% of the cells are indicated by unfdled arrows . Recombinant polypeptides that repressed expression of LAG3 in less than 50% of the cells are indicated by fdled arrows . The orientation of the arrows indicates the DNA strand to which the recombinant polypeptide is designed to bind. Arrows having the orientation are designed to bind to the anti-sense strand. Arrows having the orientation are designed to bind to the sense strand.
[0347] FIG. 12 shows the fold change in number of cells expressing LAG3 at 2 days, 7 days, or 12 days after transfection of mRNA encoding the indicated recombinant polypeptides into CD3+ T cells.
[0348] FIG. 13 shows the fold change in number of cells expressing LAG3 at 2 days after transfection of mRNA encoding the indicated recombinant polypeptides into CD3+ T cells.
EXAMPLE 5
Multiplexing of TALE-TFs for PDCD1, TIM3, and LAG3 repression [0349] FIGS. 14A and 14B show multiplexing of recombinant polypeptides to simultaneously suppress expression of PD-1, LAG3, and TIM3 is a single cell.
[0350] FIGS. 15A-15C illustrates specificity of the recombinant polypeptides as indicated by lack of significant off-target effect as measured by RNA-seq.
[0351] Anti-CD19 CAR-T cells were treated with epiTFs against PD-1, LAG3, and TIM3 and then used against a B-Cell Acute Lymphoblastic Leukemia (B-ALL) xenograft model in Non-obese Diabetic, NOD Scid Gamma (NSG) mice.
[0352] CAR-T cells were manufactured using lentivirus delivery of a 3rd generation anti-CD 19 CAR containing FMC63 scFv, CD28 and 4-1BB co-stimulatory domains, and a truncated EGFRtag (Lenti- EF1a-CD19-EGFRt-3rd-CAR Vector, Creative Biolabs). Primary human T cells were activated with Dynabeads as previously described and transfected by electroporation with repressor mRNA at 48 hours post activation. Transfected cells along with no-mRNA transfected controls were allowed to recover for 24 hours after electroporation and then transduced with lentivirus encoding the CAR on RetroNectin (Takara Bio) according to manufacturer's protocol at an MOI of 5 and in the absence of serum. At 24 hours post transduction beads and virus were removed and CAR-T cells were allowed to expand in media with IL-2 until day 11 post activation when they were washed with PBS and administered to mice. Prior to using in animals, CAR-T cells were analyzed by flow cytometry for CAR expression (via EGFR staining) and expression of immune checkpoint genes (PD-1, LAG3, and TIM3).
[0353] Animal experiments were conducted at the Fred Hutchinson Cancer Center, Comparative Medicine department (Seattle, WA) according to an approved IACCUC protocol. Female NSG mice aged 6-8 weeks were implanted intravenously with 5x105 NAFM-6-luc-GFP tumor cells (human B-AFF cancer cells expressing CD 19) and tumors were measured by total bioluminescent flux using a Xenogen Imaging System (Perkin Elmer). Each experimental arm contained 5 mice. At 4 days post tumor implantation mice were imaged and randomized into treatment arms based on baseline tumor burden. On day 5 post implantation mice were dosed intravenously with 250,000 anti-CD19 CAR-T cells either treated or untreated with repressor mRNA. Peripheral blood was collected via retroorbital bleeding at weekly intervals into EDTA-coated tubes at room temperature. Red blood cell lysis was performed using (1X RBC Lysis Buffer, eBiosciences Cat. # 333-57) according to manufacturer's protocol. Flow cytometry was performed as previously described. At 3 weeks post initial dosing mice were re-challenged with 5x105 NAFM-6-luc-GFP tumor cells to test for persistence and activity of circulating CAR-T cells in the blood.
[0354] FIG. 19 shows a schematic of an anti-CD 19 CAR-T cell in which expression of PD1, TIM3, and LAG3 has been repressed using the engineered polypeptides (pAL043+TL8188+TL8222) described herein.
[0355] FIG. 20 shows flow cytometry data confirming repression of PD1, TIM3, and LAG3 expression in the multiplex-treated CAR-T cells. Flow cytometry, performed on CAR-T cells prior to infusion, showed repression of all three targeted immune checkpoint genes in the multiplex-treated CAR-T cells. [0356] FIG. 21 provides an overview of in vivo leukemia xenograft model and treatment using indicated CAR-T cells.
[0357] FIG. 22 demonstrates that multiplexed repression of immune checkpoint genes is sustained in vivo. Flow cytometry showed persistent repression of immune checkpoint genes at 1 week post dosing CAR-Ts into mice. [0358] FIG. 23 demonstrates that multiplexed repression of immune checkpoint genes enhances CAR-Ts ability to resist tumor re-challenge. Tumor burden as measured by total flux (bioluminescence) showed all mice were initially “cured” of leukemia in all treatment arms, but upon re-challenge with leukemia cells only the mice treated with CAR-Ts in which all 3 immune checkpoint genes were repressed were able to completely resist tumor formation. This indicates superior persistence and resistance to exhaustion.
[0359] FIG. 24 shows expansion of CAR-Ts in the mouse blood. Flow cytometry data showed expansion of CAR-T cells in the mouse blood (measured as human CD3+ T cells). After the re-challenge the multiplex-treated T cells expanded the best, in line with their enhanced proliferative capacity and resistance to exhaustion.
EXAMPLE 6
Identification of novel transcriptional repressors [0360] FIG. 16 shows characterization of repression of TIM3 expression by the listed candidate transcriptional repressors.
[0361] FIG. 17 shows characterization of repression of LAG3, TIM3, or PD-1 expression by the listed candidate transcriptional repressors.
[0362] FIG. 18 shows characterization of repression of TIM3 expression by the listed candidate transcriptional repressors.
[0363] The sequences of the candidate transcriptional repressors are as follows:
[0364] MBD2:
[0365] MRAHPGGGRCCPEQEEGESAAGGSGAGGDSAIEQGGQGSALAPSPVSGVRREGARGGG RGRGRWKQ AGRGGGV CGRGRGRGRGRGRGRGRGRGRGRPP SGGSGLGGDGGGCGGGGSGGG GAPRREPVPFPSGSAGPGPRGPRATESGKRMSKLQKNKQRLRNDPLNQNKGKPDLNTTLPIRQT ASIFKQPVTKVTNHPSNKVKSDPQRMNEQPRQLFWEKRLQGLSASDVTEQIIKTMELPKGLQGV GPGSNDETLLSAVASALHTSSAPITGQVSAAVEKNPAVWLNTSQPLCKAFIVTDEDIRKQEERVQ QVRKKLEEALMADILSRAADTEEMDIEMDSGDEA (SEQ ID NO: 81)
[0366] MBD3:
[0367] MRVRYDSSNQVKGKPDLNTALPVRQTASIFKQPVTKITNHPSNKVKSDPQKAVDQPRQL FWEKKLSGLNAFDIAEELVKTMDLPKGLQGVGPGCTDETLLSAIASALHTSTMPITGQLSAAVEK NPGVWLNTTQPLCKAFMVTDEDIRKQEELVQQVRKRLEEALMADMLAHVEELARDGEAPLDK ACAEDDDEEDEEEEEEEPDPDPEMEHV (SEQ ID NO: 82)
[0368] MeCP2:
[0369] MASSPKKKRKVEASVQVKRVLEKSPGKLLVKMPFQASPGGKGEGGGATTSAQVMVIKR PGRKRKAEADPQAIPKKRGRKPGS VVAAAAAEAKKKAVKES SIRS V QETVLPIKKRKTRETV SIE VKEVVKPLLVSTLGEKSGKGLKTCKSPGRKSKESSPKGRSSSASSPPKKEHHHHHHHAESPKAP
MPLLPPPPPPEPQSSEDPISPPEPQDLSSSICKEEKMPRAGSLESDGCPKEPAKTQPMVAAAATTTT
[0370] CTBP1:
[0371] MGSSHLLNKGLPLGVRPPIMNGPLHPRPLVALLDGRDCTVEMPILKDVATVAFCDAQST QEIHEKVLNEAVGALMYHTITLTREDLEKFKALRIIVRIGSGFDNIDIKSAGDLGIAVCNVPAASV EETADSTLCHILNLYRRATWLHQALREGTRVQSVEQIREVASGAARIRGETLGIIGLGRVGQAVA LRAKAFGFNVLFYDPYLSDGVERALGLQRVSTLQDLLFHSDCVTLHCGLNEHNHHLINDFTVKQ MRQGAFFVNTARGGFVDEKAFAQAFKEGRIRGAAFDVHESEPFSFSQGPFKDAPNFICTPHAAW YSEQASIEMREEAAREIRRAITGRIPDSFKNCVNKDHFTAATHWASMDPAVVHPEFNGAAYRYP PGVVGVAPTGIPAAVEGIVPSAMSFSHGFPPVAHPPHAPSPGQTVKPEADRDHASDQF (SEQ ID NO: 84)
[0372] ZNF283:
[0373] MESRSVAQAGVQWCDFGSFQAPPPGFTFFSCFSFFSSWDYSSGFSGFCASPIEESHGAFIS SCNSRTMTDGFVTFRDVAIDFSQEEWECFDPAQRDFYVDVMFENYSNFVSFDFESKTYETKKIF SENDIFEINFSQWEMKDKSKTFGFEASIFRNNWKCKSIFEGFKGHQEGYFSQMIISYEKIPSYRKS KSFTPHQRIHNTE (SEQ ID NO: 85)
[0374] ZNF283+B:
[0375] MESRSVAQAGVQWCDFGSFQAPPPGFTFFSCFSFFSSWDYSSGFSGFCASPIEESHGAFIS SCNSRTMTDGFVTFRDVAIDFSQEEWECFDPAQRDFYVDVMFENYSNFVSFGYQFTKPDVIFRF EKGEEPIFRNNWKCKSIFEGFKGHQEGYFSQMIISYEKIPSYRKSKSFTPHQRIHNTE (SEQ ID NO:86)
[0376] ZNF133:
[0377] MAFRDVAVDFTQDEWRLLSPAQRTLYREVMLENY SNLV SLGISFSKPELITQLEQGKET WREEKKCSPATCPDPEPELYLDPFCPPGFSSQKFPMQHVLCNHPPWIFTCLCAEGNIQPGDPGPG DQEKQQQASEGRPWSDQAEGPEGEGAMPLFGRTKKRTLGAFSRPPQRQPVSSRNGLRGVELEAS PAQSGNPEETDKLLKRIEVLGFGTV (SEQ ID NO: 87)
[0378] ZNF140:
[0379] MSQGSVTFRDVAIDFSQEEWKWLQPAQRDLYRCVMLENYGHLVSLGLSISKPDVVSLLE QGKEPWLGKREVKRDLFSVSESSGEIKDFSPKNVIYDDSSQYLIMERILSQGPVYSSFKGGWKCK DHTEMLQENQGCIRKVTVSHQEALAQHMNISTVERP (SEQ ID NO: 88)
[0380] ZNF45: [0381] MTKSKEAVTFKDVAVVFSEEEFQFFDFAQRKFYRDVMFENFRNVVSVGHQSTPDGFPQ LEREEKLWMMKMATQRDNSSGAKNLKEMETLQEVGLRYLPHEELFCSQIWQQITRELIKYQDS VVNIQRTGCQLEKRDDLHYKDEGFSNQSSHLQVHRVHTGEKP (SEQ ID NO: 89)
[0382] ZNF274:
[0383] MASRLPTAW SCEPVTFEDVTLGFTPEEWGLLDLKQKSLYREVMLENYRNLV SVEHQLS
KPDVVSQLEEAEDFWPVERGIPQDTIPEYPELQLDPKLDPLPAESPLMNIEVVEVLTLNQEVAGPR
NAQIQALYAEDGSLSADAPSEQVQQQGKHPGDPEAARQRFRQFRYKDMTGPREALDQLRELCH
QWLQPKARSKEQILELLVLEQFLGALPVKLRTWVESQHPENCQEVVALVEGVTWMSEEEVLPA
GQPAEGTTCCLEVTAQQEEKQEDAAICPVTVLPEEPVTFQDVAVDFSREEWGLLGPTQRTEYRD
VMLETFGHLV S VGWETTLENKELAPNSDIPEEEPAPSLKV QES SRDCALS STLEDTLQGGV QEV Q
DTVLKQMESAQEKDLPQKKHFDNRESQANSGALDTNQVSLQKIDNPESQANSGALDTNQVLLH
KIPPRKRLRKRDSQVKSMKHNSRVKIHQKSCERQKAKEGNGCRKTFSRSTKQITFIRIHKGSQV
(SEQ ID NO:90)
[0384] TRIM28D:
[0385] GVKRSRSGEGEV SGLMRKVPRV SLERLDLDLTADSQPPVFKVFPGSTTEDYNLIVIERGA
AAAATGQPGTAPAGTPGAPPFAGMAIVKEEETEAAIGAPPTATEGPETKPVFMAFAEGPGAEGP
RFASPSGSTSSGFEVVAPEGTSAPGGGPGTFDDSATICRVCQKPGDFVMCNQCEFCFHFDCHFPA
FQDVPGEEWSCSFCHVFPDFKEEDGSFSFDGADSTGVVAKFSPANQRKCERVFFAFFCHEPCRP
FHQFATD STF SFDQPGGTFDFTFIRARFQEKFSPPY S SPQEFAQD V GRMFKQFNKFTEDKAD V Q S
IIGFQRFFETRMNEAFGDTKFSAVFVEPPPMSFPGAGFSSQEFSGGPGDGPGVKRSRSGEGEVSGF
MRKVPRVSFERFDFDFTADSQPPVFKVFPGSTTEDYNFIVIERGAAAAATGQPGTAPAGTPGAPP
FAGMAIVKEEETEAAIGAPPTATEGPETKPVFMAFAEGPGAEGPRFASPSGSTSSGFEVVAPEGTS
APGGGPGTFDDSATICRVCQKPGDFVMCNQCEFCFHFDCHFPAFQDVPGEEWSCSFCHVFPDFK
EEDGSFSFDGADSTGVVAKFSPANQRKCERVFFAFFCHEPCRPFHQFATDSTFSFDQPGGTFDFT
FIRARFQEKFSPPYSSPQEFAQDVGRMFKQFNKFTEDKADVQSIIGFQRFFETRMNEAFGDTKFS
AVFVEPPPMSFPGAGFSSQEFSGGPGDGP (SEQ ID NO:91)
[0386] CBX5-phos:
[0387] MGKKTKRTADDDDDEDEEEYVVEKVFDRRVVKGQVEYFFKWKGFSEEHNTWEPEKNF DCPEFISEFMKKYKKMKEGENNKPREKSESNKRKSNFSNSADDIKSKKKREQSNDIARGFERGFE PEKIIGATDSCGDFMFFMKWKDTDEADFVFAKEANVKCPQIVIAFYEERFTWHAYPEDAENKEK ETAKS (SEQ ID NO: 92)
[0388] CBX5:
[0389] MGKKTKRTAD S S S SEDEEEYVVEKVLDRRVVKGQ VEYLLKWKGF SEEHNTWEPEKNL DCPELISEFMKKYKKMKEGENNKPREKSESNKRKSNFSNSADDIKSKKKREQSNDIARGFERGLE PEKIIGATDSCGDLMFLMKWKDTDEADLVLAKEANVKCPQIVIAFYEERLTWHAYPEDAENKEK ETAKS (SEQ ID NO:93)
[0390] SUV39H2:
[0391] MAAVGAEARGAWCVPCLVSLDTLQELCRKEKLTCKSIGITKRNLNNYEVEYLCDYKVV
KDMEYYLVKWKGWPDSTNTWEPLQNLKCPLLLQQFSNDKHNYLSQVKKGKAITPKDNNKTLK
PAIAEYIVKKAKQRIALQRWQDELNRRKNHKGMIFVENTVDLEGPPSDFYYINEYKPAPGISLVN
EATFGCSCTDCFFQKCCPAEAGVLLAYNKNQQIKIPPGTPIYECNSRCQCGPDCPNRIVQKGTQYS
LCIFRTSNGRGWGVKTLVKIKRMSFVMEYVGEVITSEEAERRGQFYDNKGITYLFDLDYESDEFT
VDAARYGNVSHFVNHSCDPNFQVFNVFIDNFDTRFPRIAFFSTRTINAGEEFTFDYQMKGSGDIS
SDSIDHSPAKKRVRTV CKCGAVTCRGYFN (SEQ ID NO:94)
[0392] IKZF:
[0393] MNYFESMGFPGTFYPVIKEETNHSEMAEDFCKIGSERSFVFDRFASNVAKRKSSMPQKF FGDKGFSDTPYDSSASYEKENEMMKSHVMDQAINNAINYFGAESFRPFVQTPPGGSEVVPVISP MYQFHKPFAEGTPRSNHSAQDSAVENFFFFSKAKFVPSEREASPSNSCQDSTDTESNNEEQRSGF IYFTNHIAPHARNGFSFKEEHRAYDFFRAASENSQDAFRVVSTSGEQMKVYKCEHCRVFFFDHV MYTIHMGCHGFRDPFECNMCGYHSQDRYEFSSHITRGEHRFHMS (SEQ ID NO:95)
[0394] ATF7IP:
[0395] RSKSEDMDNVQSKRRRYMEEEYEAEFQVKITAKGDINQKLQKVIQWLLEEKLCALQCA
VFDKTLAELKTRVEKIECNKRHKTVLTELQAKIARLTKRFEAAKEDLKKRHEHPPNPPVSPGKTV
NDVNSNNNMSYRNAGTVRQMLESKRNVSESAPPSFQTPVNTVSSTNLVTPPAVVSSQPKLQTPV
TSGSLTATSVLPAPNTATVVATTQVPSGNPQPTISLQPLPVILHVPVAVSSQPQLLQSHPGTLVTN
QPSGNVEFISVQSPPTVSGLTKNPVSLPSLPNPTKPNNVPSVPSPSIQRNPTASAAPLGTTLAVQAV
PTAHSIVQATRTSLPTVGPSGLYSPSTNRGPIQMKIPISAFSTSSAAEQNSNTTPRIENQTNKTIDAS
VSKKAADSTSQCGKATGSDSSGVIDLTMDDEESGASQDPKKLNHTPVSTMSSSQPVSRPLQPIQP
APPLQPSGVPTSGPSQTTIHLLPTAPTTVNVTHRPVTQVTTRLPVPRAPANHQVVYTTLPAPPAQA
PLRGTVMQAPAVRQVNPQNSVTVRVPQTTTYVVNNGLTLGSTGPQLTVHHRPPQVHTEPPRPV
HPAPLPEAPQPQRLPPEAASTSLPQKPHLKLARVQSQNGIVLSWSVLEVDRSCATVDSYHLYAYH
EEPSATVPSQWKKIGEVKALPLPMACTLTQFVSGSKYYFAVRAKDIYGRFGPFCDPQSTDVISST
QSS (SEQ ID NO:96)
[0396] DNMT3A-DNMT3L:
[0397] IRVLSLFDGIATGLLVLKDLGIQVDRYIASEVCEDSITVGMVRHQGKIMYVGDVRSVTQK HIQEWGPFDLVIGGSPCNDLSIVNPARKGLYEGTGRLFFEFYRLLHDARPKEGDDRPFFWLFENV VAMGV SDKRDISRFLESNPVMIDAKEV SAAHRARYFWGNLPGMNRPLASTVNDKLELQECLEH GRIAKFSKVRTITTRSNSIKQGKDQHFPVFMNEKEDILWCTEMERVFGFPVHYTDVSNMSRLARQ RLLGRSWSVPVIRHLFAPLKEYFACVSSGNSNANSRGPSFSSGLVPLSLRGSHNPLEMFETVPVW RRQPVRVLSLFEDIKKELTSLGFLESGSDPGQLKHVVDVTDTVRKDVEEWGPFDLVYGATPPLG HTCDRPPSWYLFQFHRLLQYARPKPGSPRPFFWMFVDNLVLNKEDLDVASRFLEMEPVTIPDVH GGSLQNA VRVW SNIP AIRS SRHWALV SEEELSLLAQNKQ S SKLA AKWPTKLVKN CFLPLREYFK YFSTELTSSL (SEQ ID NO:97)
[0398] DNMT3B:
[0399] MKGDTRHLNGEEDAGGREDSILVNGACSDQSSDSPPILEAIRTPEIRGRRSSSRLSKREVSS
LLSYTQDLTGDGDGEDGDGSDTPVMPKLFRETRTRSESPAVRTRNNNSVSSRERHRPSPRSTRGR
QGRNHVDE SP VEFP ATRSLRRRATAS AGTPWP SPP S S YLTIDLTDDTEDTHGTPQ S S STPY ARLAQ
DSQQGGMESPQVEADSGDGDSSEYQDGKEFGIGDLVWGKIKGFSWWPAMVVSWKATSKRQA
MSGMRWVQWFGDGKFSEVSADKLVALGLFSQHFNLATFNKLVSYRKAMYHALEKARVRAGK
TFPSSPGDSLEDQLKPMLEWAHGGFKPTGIEGLKPNNTQPVVNKSKVRRAGSRKLESRKYENKT
RRRTADDSATSDYCPAPKRLKTNCYNNGKDRGDEDQSREQMASDVANNKSSLEDGCLSCGRK
NPVSFHPLFEGGLCQTCRDRFLELFYMYDDDGYQSYCTVCCEGRELLLCSNTSCCRCFCVECLE
VLVGTGTAAEAKLQEPWSCYMCLPQRCHGVLRRRKDWNVRLQAFFTSDTGLEYEAPKLYPAIP
AARRRPIRVLSLFDGIATGYLVLKELGIKVGKYVASEVCEESIAVGTVKHEGNIKYVNDVRNITK
KNIEEWGPFDLVIGGSPCNDLSNVNPARKGLYEGTGRLFFEFYHLLNYSRPKEGDDRPFFWMFE
NVVAMKVGDKRDISRFLECNPVMIDAIKVSAAHRARYFWGNLPGMNRPVIASKNDKLELQDCL
EYNRIAKLKKVQTITTKSNSIKQGKNQLFPVVMNGKEDVLWCTELERIFGFPVHYTDVSNMGRG
ARQKLLGRSW SVPVIRHLFAPLKDYFACE (SEQ ID NO:98)
[0400] ZNF-657-Krab:
[0401] SQGRVTFEDVTVNFTQGEWQRLNPEQRNLYRDVMLENY SNLV SVGQGETTKPDVILRL EQGKEPWLEEEEVLGSGRAE (SEQ ID NO: 99)
[0402] ZNF-554-Krab:
[0403] SQELVTFEDVSMDFSQEEWELLEPAQKNLYREVMLENYRNVVSLEALKNQCTDVGIKE GPLSPAQTSQVTSLSSWTGYLLFQPVASSHLEQREALWIEEKGTPQASCS (SEQ ID NO: 100)
[0404] ZNF-324-Krab:
[0405] MAFEDVAVYFSQEEWGLLDTAQRALYRRVMLDNFALVASLGLSTSRPRVVIQLERGEEP WVPSGTDTTLSRTTYRRRNPGSWSLTEDRDV S (SEQ ID NO: 101)
EXAMPLE 7
Split transcriptional repressors
[0406] Modularity is a hallmark of transcription factors. Split encoding of DNA targeting and functional activities on separate molecules, as exemplified in RNA-guided systems such as Cas/CRISPR, offers substantial potential for flexibility and scale. We reasoned that if synthetic repressors could be decomposed into separately delivered T-DBDs (TALE-DBDs) and repressor domains that assembled in situ, it would be possible to screen large numbers of functional alternatives to KRAB by delivering them to the same target site. It would also open new avenues for implementing complex combinatorial cell engineering programs.
[0407] Orthogonal protein heterodimer pairs (Z. Chen et a ., Nature 565, 106-111, 2019) offer an attractive system for ordered protein-protein pairing. However, the ability of such pairs to function in the complex environment of human cells is unknown. We first tested whether T-DBD and KRAB domains could be split and efficiently assembled following electroporation as separate molecules. We designed modified synthetic repressors that incorporated one half of an orthogonal protein heterodimer pair (see Table 14) after the C-terminal residue of the PD-1 synthetic repressor T-DBD. On a separately encoded molecule, we engineered its cognate half upstream of the N-terminal residue of KRAB. Introduction of either the separately encoded T-DBD/heterodimer or heterodimer/KRAB proteins alone showed no effects on PD-1 gene expression (Fig. 25, left). By contrast, parallel electroporation of separate mRNAs encoding each molecule produced potent repression nearly indistinguishable from that of the same T- DBD/KRAB synthetic repressor encoded by a single chain polypeptide (Fig. 25, right). As such, an obligate heterodimer pair can enable the DNA binding and functional domains of synthetic transcription factors to be split and separately delivered in a flexible, and potentially highly scalable, manner.
[0408] Next, we leveraged synthetic split TFs (SSTFs) to explore the functional impacts on both the potency and the kinetics expression of a wide range of candidate repressor domains extracted from native human TFs by delivering them to a target site in the TIM3 promoter targeted by the DBD of TF8188 (Fig. 26, Top panel). We co-delivered TF8188-DBD-1 mRNA into primary human T-cells together with mRNA encoding each (separately) of 77 candidate repressive domains listed in Table 13, fused to the 37B heterodimer (Fig. 26, Top panel), and assayed TIM3 expression by flow cytometry over a 26 day interval. We identified numerous highly active repressive domains that differed chiefly in their temporal kinetics of repression (Fig. 26, middle and bottom panels). Some SSTFs displayed an immediate sharp decline in repression at 5 days and complete loss by 2 weeks (Fig. 26, bottom panel). In contrast, different KRAB domain homologs from human zinc finger proteins exhibited a relatively slow kinetic profile of de- repression that extended to at least 26 days (Fig. 26, medium panel). The relative potency of different domains was similar but not identical across genes. Further, the spatial presentation of functional domains, whether fused to the heterodimer at the C or N-terminus, altered the repressive efficacy of at least one domain (MBD2), but not others (KRAB, CTBP1, and MECP2) (data not shown ). Notably, we observed only modest repressive activity for the DNMT3A-3F dual domain when combined with the DNA binding domain of TF8188, pAF043, or TF8222. [0409] Fig. 26. Large-scale analysis of functional domains enabled by split encoding of DNA targeting and functional activities. Top panel. The DNA binding domain of the TIM3 repressor TL8188 was selected to screen additional functional domains. TL8188-DBD was fused to heterodimer 37A, and a plurality of nucleic acids of functional domains was fused to heterodimer 37B. Both constructs were transiently expressed in primary human T cells by RNA electroporation, and fraction of cells with TIM3 repressed (%TIM3 negative cells in TL8188-treated cells relative to no RNA control) was evaluated periodically for 26 days by cell surface antibody staining and flow cytometry. Cells with greater fluorescence intensity than unstained control were considered TIM3+. Middle panel. Domains containing KRAB showed more durable repression, or relatively slow kinetics of decay, for several different KRAB domains. Bottom panel. Domains from methyl-DNA binding proteins showed less durable repression, or relatively fast kinetics of decay.
[0410] The above results thus show that SSTFs can be used to deliver different functional activities to the same keyhole site (or any other targeted site) at scale, and indicate that different classes of repressive domains encoded within native TFs may confer different functions that are reflected chiefly in the kinetics of repression as a function of cell proliferation time.
[0411] Table 13: List of genes from which candidate repressor domains were selected for screening. EXAMPLE 8
Cognate and Non-Cognate Heterodimer Pairs
[0412] TIM3 expression was assayed using flow cytometry and plotted as %TIM3+cells at Day 2 post transfection with an mRNA encoding TIM3 targeting DBD (from TL8188) fused to one member of a heterodimer pair and an mRNA encoding another member of the heterodimer pair fused to a KRAB domain. Cognate pairs: 13A,13B; 37A,37B; DHD37-BBB-A, DHD37-BBB-B; DHD150A, DHD150B; DHD154A, DHD154B mediate dimerization and repression. Non-cognate pairs 13,37; 13,DHD150; 37, DHD37-BBB; and 37, DHD150 also mediated dimerization and repression. See Fig. 27.
[0413] Integration of CIPHR logic gates with T cell transcriptional repressors. Engineered T cell therapies are promising therapeutic modalities, but their efficacy for treating solid tumors is limited at least in part by T cell exhaustion. Immune checkpoint genes including PD-1, CTLA4, LAG3, and TIM3 are believed to play critical roles in modulating T cell exhaustion. To put the transcription of such proteins under the control of the CIPHR logic gates, we took advantage of potent and selective transcriptional repressors of immune checkpoint genes in primary T cells that combine sequence-specific transcription activator-like effector (TALE) DNA binding domains with the Kriippel-associated box (KRAB) repressor domain; this repression activity is preserved in split systems pairing a DNA recognition domain fused with a monomer of a heterodimer pair with a functional domain fused to the complementary monomer of the heterodimer pair.
[0414] We reasoned that this system could be exploited to engineer programmable therapeutic devices by placing the coupling of separate TALE and KRAB polypeptides fused to monomers (and hence the repression function of the combined molecule) under control of CIPHR gates, such that their proximity could be controlled by logic operations. Use of a repressive domain effectively reverses the logic of CIPHR gates when expression level of the target gene is measured as the output.
[0415] To test the feasibility of this concept, we used a TALE-KRAB fusion engineered to repress TIM3, and thus potentially attenuate T cell exhaustion. We used the all-by-all interaction specificity of a set of four heterodimer pairs (1-1’, 2-2’, 4-4’, and 9-9’) in this TALE-KRAB setting to design a NOT gate, with 1 fused to TALE, 9’ fused to KRAB, and the 1’-9 linker protein as the input. In this scheme,1’-9 brings KRAB to the promoter region bound by the TALE, therefore triggering repression of TIM3 (Fig. 29, Top panel). Taking advantage of the interaction between 9 and 1’, we built an OR gate with 9- TALE and 1’-KRAB fusions; TIM3 is repressed in the absence of inputs, but upon addition of either 9’ or 1, the weaker 9: 1’ interaction is outcompeted in favor of the stronger 9:9' and 1:1' interactions, restoring TIM3 expression (Fig. 29, Bottom panel). These results suggest that the combination of CIPHR and TALE-KRAB systems could be directly applied to add signal processing capabilities to adoptive T cell therapy. [0416] Table 14. Sequences of the heterodimer members.
[0417] While specific embodiments of the present invention have been shown and described herein, it will be apparent to those skilled in the art that such embodiments are provided by way of example only. Numerous variations, changes, and substitutions will now occur to those skilled in the art without departing from the invention. It should be understood that various alternatives to the embodiments of the invention described herein may be employed in practicing the invention. It is intended that the following claims define the scope of the invention and that methods and structures within the scope of these claims and their equivalents be covered thereby. [0418] For reasons of completeness, certain aspects of the polypeptides, composition, and methods of the present disclosure are set out in the following numbered clauses:
1. A recombinant polypeptide comprising: a DNA binding domain (DBD) and a transcriptional repressor domain, the DBD comprising a plurality of repeat units (RUs) ordered from N-terminus to C- terminus of the DBD to bind to a nucleic acid sequence of the PDCD1 gene, wherein the nucleic acid sequence is present within the sequence:
TGGTGGGGCTGCTCCAGGCATGCAGATCCCACAGGCGCCCTGG (SEQ ID NO: 1) wherein each of the RU comprises the sequence X 1-11X12X13X14-33, 34, or 35 (SEQ ID NO: 455), wherein: X 1-11 is a chain of 11 contiguous amino acids, X14-33 or 34 or 35 is a chain of 20, 21 or 22 contiguous amino acids, X12X13 is selected from: (a) NH, HH, KH, NK, NQ, RH, RN, SS, NN, SN, or KN for recognition of guanine (G); (b) NI, KI, RI, HI, or SI for recognition of adenine (A); (c) NG, HG, KG, or RG for recognition of thymine (T); (d) HD, RD, SD, ND, KD, or YG for recognition of cytosine (C); and (e) NV or HN for recognition of A or G; and (f) H* , HA, KA, N* , NA, NC, NS, RA, or S* for recognition of A or T or G or C, wherein (* ) means that the amino acid at X13 is absent, and wherein the transcriptional repressor domain suppresses expression of PD1 receptor encoded by the PDCD1 gene.
2. The recombinant polypeptide of clause 1, wherein the RUs are ordered from N-terminus to the C-terminus to bind to the sequence: GGGGCTGCTCC (SEQ ID NO:2), wherein the first RU at the N-terminus binds to the G at the 5’ end of the sequence and the last RU at the C-terminus binds to the C at the 3’ end of the sequence.
3. The recombinant polypeptide of clause 2, wherein the X12X13 in the RUs from N-terminus to C-terminus are NH, NH, NH, NH, HD, NG, NH, HD, NG, HD, and HD.
4. The recombinant polypeptide of clause 2 or 3, wherein the DBD comprises at least an additional RU at the N-terminus such that the DBD binds to the nucleic acid sequence TGGGGCTGCTCC (SEQ ID NO:3), wherein X12X13 in the additional RU is NG, HG, KG, or RG for recognition of the T.
5. The recombinant polypeptide of clause 1, wherein the RUs are ordered from N-terminus to the C-terminus to bind to the sequence: GGTGGGGCTGCTCC (SEQ ID NO:4), wherein the first RU at the N-terminus binds to the G at the 5’ end of the sequence and the last RU at the C-terminus binds to the C at the 3’ end of the sequence.
6. The recombinant polypeptide of clause 5, wherein the DBD comprises at least fourteen RUs, wherein X12X13 in the RUs from N-terminus to C-terminus are NH, NH, NG, NH, NH, NH, NH,
HD, NG, NH, HD, NG, HD, and HD.
7. The recombinant polypeptide of clause 5 or 6, wherein the DBD comprises three additional RU at the N-terminus such that the DBD binds to the nucleic acid sequence TGGTGGGGCTGCTCC (SEQ ID NO:5).
8. The recombinant polypeptide of clause 5, wherein the DBD comprises three additional RUs at the C-terminus such that the DBD binds to the sequence GGTGGGGCTGCTCCAGG (SEQ ID NO:6).
9. The recombinant polypeptide of clause 1, wherein the RUs are arranged from N-terminus to C-terminus to bind to the sequence: GCAGATCCCACAGGCGC (SEQ ID NO:7). 10. The recombinant polypeptide of clause 1, wherein the RUs are arranged from N-terminus to C-terminus to bind to the sequence: CCCACAGGCGCCCTGG (SEQ ID NO:8).
11. The recombinant polypeptide of clause 1, wherein the RUs are arranged from N-terminus to C-terminus to bind to the sequence: GGGGCTGCTCCAGGCATGC (SEQ ID NO:9).
12. A recombinant polypeptide comprising: a DNA binding domain (DBD) and a transcriptional repressor, the DBD comprising a plurality of repeat units (RUs) ordered from N-terminus to C- terminus of the DBD to bind to a nucleic acid sequence of the PDCD1 gene, wherein the nucleic acid sequence is present within the sequence:
CCTCCCCCAGCACTGCCTCTGTCACTCTCGCCCACGTGGATGTGG (SEQ ID NO: 10), wherein each of the RU comprises the sequence X1-11X12X13X14-33, 34, or 35 (SEQ ID NO: 455), wherein: X1- 11 is a chain of 11 contiguous amino acids, X14-33 or 34 or 35 is a chain of 20, 21 or 22 contiguous amino acids, X12X13 is selected from: (a) NH, HH, KH, NK, NQ, RH, RN, SS, NN, SN, or KN for recognition of guanine (G); (b) NI, KI, RI, HI, or SI for recognition of adenine (A); (c) NG, HG, KG, or RG for recognition of thymine (T); (d) HD, RD, SD, ND, KD, or YG for recognition of cytosine (C); and (e) NV or HN for recognition of A or G; and (f) H* , HA, KA, N* , NA, NC, NS, RA, or S* for recognition of A or T or G or C, wherein (* ) means that the amino acid at X13 is absent, and wherein the transcriptional repressor domain suppresses expression of PD1 receptor encoded by the PDCD1 gene.
13. The recombinant polypeptide of clause 12, wherein the RUs are ordered from N-terminus to C-terminus of the DBD to bind to the nucleic acid sequence TCTGTCACTCTCG (SEQ ID NO: 11).
14. The recombinant polypeptide of clause 13, wherein the DBD comprises at least thirteen RUs, wherein X12X13 in the RUs from N-terminus to C-terminus are NG, HD, NG, NH, NG, HD, NI, HD, NG, HD, NG, HD, and NH.
15. The recombinant polypeptide of clause 13 or 14, wherein the DBD further comprises three additional RUs at the N-terminus such that the DBD binds to the nucleic acid sequence GCCTCTGTCACTCTCG (SEQ ID NO: 12).
16. The recombinant polypeptide of clause 15, wherein the DBD further comprises three additional RUs at the C-terminus such that the DBD binds to the nucleic acid sequence GCCTCTGTCACTCTCGCCC (SEQ ID NO: 13).
17. The recombinant polypeptide of clause 16, wherein the DBD comprises at least nineteen RUs, wherein X12X13 in the RUs from N-terminus to C-terminus are NH, HD, HD, NG, HD, NG, NH,
NG, HD, NI, HD, NG, HD, NG, HD, NH, HD, HD, and HD.
18. The recombinant polypeptide of clause 13 or 14, wherein the DBD further comprises five additional RUs at the C-terminus such that the DBD binds to the nucleic acid sequence TCTGTCACTCTCGCCCAC (SEQ ID NO: 14).
19. The recombinant polypeptide of clause 18, wherein the DBD comprises at least eighteen RUs, wherein X12X13 in the RUs from N-terminus to C-terminus are NG, HD, NG, NH, NG, HD, NI, HD, NG, HD, NG, HD, NG, NH, HD, HD, HD, NI, and HD.
20. The recombinant polypeptide of clause 12, wherein the DBD comprises thirteen RUs ordered from N-terminus to C-terminus of the DBD to bind to the nucleic acid sequence: CCCCCAGCACTGC (SEQ ID NO: 15).
21. The recombinant polypeptide of clause 20, wherein the DBD further comprises three additional RUs at the N-terminus such that the DBD binds to the nucleic acid sequence: CCTCCCCCAGCACTGC (SEQ ID NO: 16). 22. The recombinant polypeptide of clause 21, wherein the DBD further comprises an additional RU at the C-terminus such that the DBD binds to the nucleic acid sequence: CCTCCCCCAGCACTGCC (SEQ ID NO: 17).
23. A recombinant polypeptide comprising: a DNA binding domain (DBD) and a transcriptional repressor, the DBD comprising at least nine repeat units (RUs) ordered from N-terminus to C-terminus of the DBD to bind to a nucleic acid sequence of the PDCD1 gene, wherein the nucleic acid sequence is present within the sequence:
CCCAGGTCAGGTTGAAG (SEQ ID NO: 18), wherein each of the RU comprises the sequence X1- 11X12X13X14-33, 34, or 35 (SEQ ID NO: 455), wherein: X 1-11 is a chain of 11 contiguous amino acids, X14-33 or 34 or 35 is a chain of 20, 21 or 22 contiguous amino acids, X12X13 is selected from: (a) NH, HH, KH, NK, NQ, RH, RN, SS, NN, SN, or KN for recognition of guanine (G); (b) NI, KI, RI, HI, or SI for recognition of adenine (A); (c) NG, HG, KG, or RG for recognition of thymine (T); (d) HD, RD, SD, ND, KD, or YG for recognition of cytosine (C); and (e) NV or HN for recognition of A or G; and (f) H* , HA, KA, N* , NA, NC, NS, RA, or S* for recognition of A or T or G or C, wherein (* ) means that the amino acid at Xi3 is absent, and wherein the transcriptional repressor domain suppresses expression of PD1 receptor encoded by the PDCD1 gene.
24. A recombinant polypeptide comprising: a DNA binding domain (DBD) and a transcriptional repressor, the DBD comprising at least nine repeat units (RUs) ordered from N-terminus to C-terminus of the DBD to bind to a nucleic acid sequence of the PDCD1 gene, wherein the nucleic acid sequence is present within the sequence:
CCCTTCAACCTGACCTGGGACAGTTTCCCTTCCGCTCACCTCCGCCTGA (SEQ ID NO: 19), wherein each of the RU comprises the sequence X1-11X12X13X14-33, 34, or 35 (SEQ ID NO: 455), wherein: X1- 11 is a chain of 11 contiguous amino acids, X14-33 or 34 or 35 is a chain of 20, 21 or 22 contiguous amino acids, X12X13 is selected from: (a) NH, HH, KH, NK, NQ, RH, RN, SS, NN, SN, or KN for recognition of guanine (G); (b) NI, KI, RI, HI, or SI for recognition of adenine (A); (c) NG, HG, KG, or RG for recognition of thymine (T); (d) HD, RD, SD, ND, KD, or YG for recognition of cytosine (C); and (e) NV or HN for recognition of A or G; and (f) H* , HA, KA, N* , NA, NC, NS, RA, or S* for recognition of A or T or G or C, wherein (* ) means that the amino acid at X13 is absent, and wherein the transcriptional repressor domain suppresses expression of PD1 receptor encoded by the PDCD1 gene.
25. The recombinant polypeptide of clause 24, wherein the DBD comprises ten RUs ordered from N-terminus to C-terminus to bind to the nucleic acid sequence: TCCGCTCACC (SEQ ID NO:20).
26. The recombinant polypeptide of clause 25, wherein the DBD comprises nine additional RUs at the C-terminus such that the DBD binds to the nucleic acid sequence: TCCGCTCACCTCCGCCTGA (SEQ ID NO:21).
27. The recombinant polypeptide of clause 25, wherein the DBD comprises four additional RUs at the N-terminus such that the DBD binds to the nucleic acid sequence: CCCTTCCGCTCACC (SEQ ID NO:22).
28. The recombinant polypeptide of clause 27, wherein the DBD comprises five additional RUs at the C-terminus such that the DBD binds to the nucleic acid sequence: CCCTTCCGCTCACCTCCGC (SEQ ID NO:23). 29. The recombinant polypeptide of clause 27, wherein the DBD comprises two additional RUs at the N-terminus such that the DBD binds to the nucleic acid sequence: TTCCCTTCCGCTCACC (SEQ ID NO:24).
30. The recombinant polypeptide of clause 24, wherein the DBD comprises twelve RUs ordered from N-terminus to C-terminus to bind to the nucleic acid sequence: GGGACAGTTTCC (SEQ ID NO:25).
31. The recombinant polypeptide of clause 30, wherein the DBD further comprises four additional RUs at the C-terminus such that the DBD binds to the nucleic acid sequence: GGGACAGTTTCCCTTC (SEQ ID NO: 26).
32. The recombinant polypeptide of clause 30, wherein the DBD further comprises five additional RUs at the N-terminus such that the DBD binds to the nucleic acid sequence: GACCTGGGACAGTTTCC (SEQ ID NO: 27).
33. The recombinant polypeptide of clause 24, wherein the DBD comprises eleven RUs ordered from N-terminus to C-terminus to bind to the nucleic acid sequence: CAACCTGACCT (SEQ ID NO:28).
34. The recombinant polypeptide of clause 33, wherein the DBD comprises nine additional RUs at the C-terminus such that the DBD binds to the nucleic acid sequence: CAACCTGACCTGGGACAGTT (SEQ ID NO: 29).
35. The recombinant polypeptide of clause 33, wherein the DBD comprises five additional RUs at the N-terminus such that the DBD binds to the nucleic acid sequence: CCCTTCAACCTGACCT (SEQ ID NO:30).
36. A recombinant polypeptide comprising: a DNA binding domain (DBD) and a transcriptional repressor, the DBD comprising at least nine repeat units (RUs) ordered from N-terminus to C-terminus of the DBD to bind to a nucleic acid sequence of the PDCD 1 gene, wherein the nucleic acid sequence is present within the sequence: GCCGCCTTCTCCACTGCTCAGGCGGAGGT (SEQ ID NO:31), wherein each of the RU comprises the sequence X 1-11X12X13X14-33 or 34 or 35 (SEQ ID NO: 455), wherein: X 1-11 is a chain of 11 contiguous amino acids, X14-33 or 34 or 35 is a chain of 20, 21 or 22 contiguous amino acids, X12X13 is selected from: (a) NH, HH, KH, NK, NQ, RH, RN, SS, NN, SN, or KN for recognition of guanine (G); (b) NI, KI, RI, HI, or SI for recognition of adenine (A); (c) NG, HG, KG, or RG for recognition of thymine (T); (d) HD, RD, SD, ND, KD, or YG for recognition of cytosine (C); and (e) NV or HN for recognition of A or G; and (f) H* , HA, KA, N* , NA, NC, NS, RA, or S* for recognition of A or T or G or C, wherein (* ) means that the amino acid at X13 is absent, and wherein the transcriptional repressor domain suppresses expression of PD1 receptor encoded by the PDCD1 gene.
37. The recombinant polypeptide of clause 36, wherein the DBD comprises RUs arranged from N-terminus to C-terminus such that the DBD binds to the nucleic acid sequence: GCCGCCTTCTCCACT (SEQ ID NO:32).
38. The recombinant polypeptide of clause 36, wherein the DBD comprises RUs arranged from N-terminus to C-terminus such that the DBD binds to the nucleic acid sequence: CCACTGCTCAGGCG (SEQ ID NO:33).
39. The recombinant polypeptide of clause 38, wherein the DBD further comprises three additional RUs at the N-terminus such that the DBD binds to the nucleic acid sequence: TCTCCACTGCTCAGGCG (SEQ ID NO:34). 40. The recombinant polypeptide of clause 38, wherein the DBD further comprises five additional RUs at the C-terminus such that the DBD binds to the nucleic acid sequence: CCACTGCTCAGGCGGAGGT (SEQ ID NO:35).
41. A recombinant polypeptide comprising : a DNA binding domain (DBD) and a transcriptional repressor, the DBD comprising at least nine repeat units (RUs) ordered from N-terminus to C-terminus of the DBD to bind to a nucleic acid sequence of the PDCD 1 gene, wherein the nucleic acid sequence is present within the sequence: GGCCAGGGCGCCTGT (SEQ ID NO:36);
CTGCATGCCTGGAGCAG (SEQ ID NO:37); GCTCCCGCCCCCTCTTCCT (SEQ ID NO:38); CTTCCTCCACATCCACG (SEQ ID NO:39); or CCTCCACATCCACGTGGGC (SEQ ID NO:40), wherein each of the RU comprises the sequence X 1-11X12X13X14-33, 34, or 35 (SEQ ID NO: 455), wherein: X1- 11 is a chain of 11 contiguous amino acids, X14-33 or 34 or 35 is a chain of 20, 21 or 22 contiguous amino acids, X12X13 is selected from: (a) NH, HH, KH, NK, NQ, RH, RN, SS, NN, SN, or KN for recognition of guanine (G); (b) NI, KI, RI, HI, or SI for recognition of adenine (A); (c) NG, HG, KG, or RG for recognition of thymine (T); (d) HD, RD, SD, ND, KD, or YG for recognition of cytosine (C); and (e) NV or HN for recognition of A or G; and (f) H* , HA, KA, N* , NA, NC, NS, RA, or S* for recognition of A or T or G or C, wherein (* ) means that the amino acid at X13 is absent, and wherein the transcriptional repressor domain suppresses expression of PD1 receptor encoded by the PDCD1 gene.
42. The recombinant polypeptide of any one of clauses 1-41, wherein the DBD comprises at least 11 RUs.
43. The recombinant polypeptide of any one of clauses 1-41, wherein the DBD comprises at least 13 RUs.
44. The recombinant polypeptide of any one of clauses 1-41, wherein the DBD comprises at least 15 RUs.
45. The recombinant polypeptide of any one of clauses 1-41, wherein the DBD comprises at least 17 RUs.
46. The recombinant polypeptide of any one of the preceding clauses, wherein the DBD comprises up to 40 RUs.
47. The recombinant polypeptide of any one of the preceding clauses, wherein the DBD comprises additional RUs at the N-terminus that bind to the nucleotides present upstream of the nucleic acid sequence.
48. The recombinant polypeptide of any one of the preceding clauses, wherein the DBD comprises additional RUs at the C-terminus that bind to the nucleotides present downstream of the nucleic acid sequence.
49. A recombinant polypeptide comprising: a DNA binding domain (DBD) and a transcriptional repressor, the DBD comprising a plurality of repeat units (RUs) ordered from N-terminus to C-terminus of the DBD to bind to a nucleic acid sequence of the TIM3 gene, wherein the nucleic acid sequence is present within the sequence:
GGCAGTGTTACTATAAGAATCACTGGCAATCAGACACCCGGGTG (SEQ ID NO:41) or a complement thereof, wherein each of the RU comprises the sequence X 1-11X12X13X14-33, 34, or 35 (SEQ ID NO: 455), wherein: X 1-11 is a chain of 11 contiguous amino acids, X14-33 or 34 or 35 is a chain of 20, 21 or 22 contiguous amino acids, X12X13 is selected from: (a) NH, HH, KH, NK, NQ, RH, RN, SS, NN, SN, or KN for recognition of guanine (G); (b) NI, KI, RI, HI, or SI for recognition of adenine (A); (c) NG, HG, KG, or RG for recognition of thymine (T); (d) HD, RD, SD, ND, KD, or YG for recognition of cytosine (C); and (e) NV or HN for recognition of A or G; and (f) H* , HA, KA, N* , NA, NC, NS, RA, or S* for recognition of A or T or G or C, wherein (* ) means that the amino acid at X13 is absent, andwherein the transcriptional repressor domain suppresses expression of TIM3 encoded by the TIM3 gene.
50. The recombinant polypeptide of clause 49, wherein the DBD comprises RUs that bind to the nucleic acid sequence TGTTACTATA (SEQ ID NO:42).
51. The recombinant polypeptide of clause 50, wherein the DBD comprises an additional RU at the C-terminus such that the DBD binds to the nucleic acid sequence TGTTACTATAA (SEQ ID NO:43).
52. The recombinant polypeptide of clause 50 or 51, wherein the DBD comprises three additional RUs at the N-terminus such that the DBD binds to the nucleic acid sequence CAGTGTTACTATAA (SEQ ID NO: 44).
53. The recombinant polypeptide of clause 52, wherein the DBD comprises two additional RUs at the N-terminus such that the DBD binds to the nucleic acid sequence GGCAGTGTTACTATAA (SEQ ID NO:45).
54. The recombinant polypeptide of clause 49, wherein the DBD comprises RUs that bind to the nucleic acid sequence TCAGACACCCGGGTG (SEQ ID NO:46).
55. The recombinant polypeptide of clause 54, wherein the DBD comprises three additional RUs at the N-terminus such that the DBD binds to the nucleic acid sequence CAATCAGACACCCGGGTG (SEQ ID NO: 47).
56. The recombinant polypeptide of clause 54, wherein the DBD comprises three additional RUs at the N-terminus such that the DBD binds to the nucleic acid sequence TGGCAATCAGACACCCGGGTG (SEQ ID NO:48).
57. A recombinant polypeptide comprising: a DNA binding domain (DBD) and a transcriptional repressor, the DBD comprising a plurality of repeat units (RUs) ordered from N-terminus to C-terminus of the DBD to bind a nucleic acid sequence of the TIM3 gene, wherein the nucleic acid sequence is present within the sequence:
TGTCTGATTGCCAGTGATTCTTATAGT (SEQ ID NO:49), wherein each of the repeat unit comprises the sequence X 1-11X12X13X14-33, 34, or 35 (SEQ ID NO: 455), wherein: X 1-11 is a chain of 11 contiguous amino acids, X14-33 or 34 or 35 is a chain of 20, 21 or 22 contiguous amino acids, X12X13 is selected from: (a) NH, HH, KH, NK, NQ, RH, RN, SS, NN, SN, or KN for recognition of guanine (G); (b) NI, KI, RI, HI, or SI for recognition of adenine (A); (c) NG, HG, KG, or RG for recognition of thymine (T); (d) HD, RD, SD, ND, KD, or YG for recognition of cytosine (C); and (e) NV or HN for recognition of A or G; and (f) H* , HA, KA, N* , NA, NC, NS, RA, or S* for recognition of A or T or G or C, wherein (* ) means that the amino acid at X13 is absent, and wherein the transcriptional repressor domain suppresses expression of TIM3 encoded by the TIM3 gene.
58. The recombinant polypeptide of clause 57, wherein the DBD comprises RUs that are ordered to bind to the sequence TGCCAGTGATT (SEQ ID NO: 50).
59. The recombinant polypeptide of clause 58, wherein the DBD comprises eight additional RUs at the C-terminus such that the DBD binds to the sequence TGCCAGTGATTCTTATAGT (SEQ ID NO:51).
60. The recombinant polypeptide of clause 57, wherein the DBD comprises RUs that are ordered to binds to the sequence TGATTGCCAGTGATT (SEQ ID NO:52). 61. The recombinant polypeptide of clause 60, wherein the DBD comprises four additional RUs at the N-terminus such that the DBD binds to the sequence TGTCTGATTGCCAGTGATT (SEQ ID NO:53).
62. A recombinant polypeptide comprising: a DNA binding domain (DBD) and a transcriptional repressor, the DBD comprising a plurality of repeat units (RUs) ordered from N-terminus to C-terminus of the DBD to bind to a nucleic acid sequence of TIM3 gene, wherein the nucleic acid sequence is: TACACACAT (SEQ ID NO:54), wherein each of the repeat unit comprises the sequence X1- 11X12X13X14-33, 34, o (rS 35EQ ID NO: 455), wherein: X 1-11 is a chain of 11 contiguous amino acids, X14-33 or 34 or 35 is a chain of 20, 21 or 22 contiguous amino acids, X12X13 is selected from:(a) NH, HH, KH, NK, NQ, RH, RN, SS, NN, SN, or KN for recognition of guanine (G); (b) NI, KI, RI, HI, or SI for recognition of adenine (A); (c) NG, HG, KG, or RG for recognition of thymine (T); (d) HD, RD, SD, ND, KD, or YG for recognition of cytosine (C); and (e) NV or HN for recognition of A or G; and (f) H* , HA, KA, N* , NA, NC, NS, RA, or S* for recognition of A or T or G or C, wherein (* ) means that the amino acid atX13 is absent, and wherein the transcriptional repressor domain suppresses expression of TIM3 encoded by the TIM3 gene.
63. The recombinant polypeptide of clause 62, wherein the DBD comprises four additional RUs at the N-terminus such that the DBD binds to the sequence ACACTACACACAT (SEQ ID NO:55).
64. The recombinant polypeptide of clause 63, wherein the DBD comprises four additional RUs at the N-terminus such that the DBD binds to the sequence TGCCACACTACACACAT (SEQ ID NO:56).
65. A recombinant polypeptide comprising: a DNA binding domain (DBD) and a transcriptional repressor, the DBD comprising at least nine repeat units (RUs) ordered from N-terminus to C-terminus of the DBD to bind to a nucleic acid sequence of the LAG3 gene, wherein the nucleic acid sequence is present within the sequence:
GCCGTTCTGCTGGTCTCTGGGCCTTCACCCCTGTGCCCGGCCTTCC (SEQ ID NO: 57), wherein each of the RU comprises the sequence X1-11X12X13X14-33, 34, or 35 (SEQ ID NO: 455), wherein: X 1-11 is a chain of 11 contiguous amino acids, X14-33 or 34 or 35 is a chain of 20, 21 or 22 contiguous amino acids, X12X13 is selected from:
(a) NH, HH, KH, NK, NQ, RH, RN, SS, NN, SN, or KN for recognition of guanine (G);
(b) NI, KI, RI, HI, or SI for recognition of adenine (A);
(c) NG, HG, KG, or RG for recognition of thymine (T);
(d) HD, RD, SD, ND, KD, or YG for recognition of cytosine (C); and
(e) NV or HN for recognition of A or G; and (f) H* , HA, KA, N* , NA, NC, NS, RA, or S* for recognition of A or T or G or C, wherein (* ) means that the amino acid at X13 is absent, and wherein the transcriptional repressor domain suppresses expression of LAG3 encoded by the LAG3 gene.
66. The recombinant polypeptide of clause 65, wherein the DBD comprises RUs that bind to the sequence TCTGCTGGTCT (SEQ ID NO: 58).
67. The recombinant polypeptide of clause 66, wherein the DBD comprises five additional RUs at the N-terminus such that the DBD binds to the sequence GCCGTTCTGCTGGTCT (SEQ ID NO:59).
68. The recombinant polypeptide of clause 67, wherein the DBD comprises two additional RUs at the C-terminus such that the DBD binds to the sequence GCCGTTCTGCTGGTCTCT (SEQ ID NO:60). 69. The recombinant polypeptide of clause 66, wherein the DBD comprises four additional RUs at the C-terminus such that the DBD binds to the sequence TCTGCTGGTCTGGGC (SEQ ID NO: 61).
70. The recombinant polypeptide of clause 69, wherein the DBD comprises an additional RUs at the C-terminus such that the DBD binds to the sequence TCTGCTGGTCTGGGCC (SEQ ID NO: 62).
71. The recombinant polypeptide of clause 70, wherein the DBD comprises three additional RUs at the C-terminus such that the DBD binds to the sequence TCTGCTGGTCTGGGCCTTC (SEQ ID NO:63).
72. The recombinant polypeptide of clause 65, wherein the DBD comprises RUs that bind to the sequence TCTCTGGGCCTTCA (SEQ ID NO:64).
73. The recombinant polypeptide of clause 72, wherein the DBD comprises two additional RUs at the N-terminus such that the DBD binds the sequence GGTCTCTGGGCCTTCA (SEQ ID NO:65).
74. The recombinant polypeptide of clause 73, wherein the DBD comprises three additional RUs at the C-terminus such that the DBD binds the sequence GGTCTCTGGGCCTTCACCC (SEQ ID NO:66).
75. The recombinant polypeptide of clause 74, wherein the DBD comprises an additional RUs at the N-terminus such that the DBD binds the sequence TGGTCTCTGGGCCTTCACC (SEQ ID NO:67).
76. The recombinant polypeptide of clause 65, wherein the DBD comprises RUs that bind to the sequence TTCACCCCTGTG (SEQ ID NO:68).
77. The recombinant polypeptide of clause 76, wherein the DBD comprises four additional RUs at the C-terminus such that the DBD binds to the sequence TTCACCCCTGTGCCCG (SEQ ID NO:69).
78. The recombinant polypeptide of clause 77, wherein the DBD comprises four additional RUs at the C-terminus such that the DBD binds to the sequence TTCACCCCTGTGCCCGGCCT (SEQ ID NO: 70).
79. The recombinant polypeptide of clause 78, wherein the DBD comprises three additional RUs at the C-terminus such that the DBD binds to the sequence TTCACCCCTGTGCCCGGCCTTCC (SEQ ID NO:71).
80. A recombinant polypeptide comprising: a DNA binding domain (DBD) and a transcriptional repressor, the DBD comprising a plurality of repeat units (RUs) ordered from N-terminus to C-terminus of the DBD to bind to a nucleic acid sequence of LAG3 gene, wherein the nucleic acid sequence is: TGCTCTGTCTGC (SEQ ID NO:72), wherein each of the repeat unit comprises the sequence X1-11X12X13X14-33, 34, or 35 (SEQ ID NO: 455), wherein: X 1-11 is a chain of 11 contiguous amino acids, X14-33 or 34 or 35 is a chain of 20, 21 or 22 contiguous amino acids, X12X13 is selected from: (a) NH, HH, KH, NK, NQ, RH, RN, SS, NN, SN, or KN for recognition of guanine (G); (b) NI, KI, RI, HI, or SI for recognition of adenine (A); (c) NG, HG, KG, or RG for recognition of thymine (T); (d) HD, RD, SD, ND, KD, or YG for recognition of cytosine (C); and (e) NV or HN for recognition of A or G; and (f) H* , HA, KA, N* , NA, NC, NS, RA, or S* for recognition of A or T or G or C, wherein (* ) means that the amino acid at X13 is absent, and wherein the transcriptional repressor domain suppresses expression of LAG3 encoded by the LAG3 gene. 81. The recombinant polypeptide of clause 80, wherein the DBD comprises two additional RUs at the C-terminus such that the DBD binds to the sequence TGCTCTGTCTGCTC (SEQ ID NO:73).
82. The recombinant polypeptide of clause 81, wherein the DBD comprises two additional RUs at the N-terminus such that the DBD binds to the sequence TTTGCTCTGTCTGCTC (SEQ ID NO: 74).
83. A recombinant polypeptide comprising: a DNA binding domain (DBD) and a transcriptional repressor, the DBD comprising at least nine repeat units (RUs) ordered from N-terminus to C-terminus of the DBD to bind to a nucleic acid sequence of the CTLA4 gene, wherein the nucleic acid sequence is:
ACATATCTGGGATCAAAGCT (SEQ ID NO:75),
ATATAAAGTCCTTGAT (SEQ ID NO:76), or TTCTATTCAAGTGCC (SEQ ID NO: 77), wherein each of the RU comprises the sequence X 1-11X12X13X14-33, 34, or 35 (SEQ ID NO: 455), wherein: X 1-11 is a chain of 11 contiguous amino acids, X14-33 or 34 or 35 is a chain of 20, 21 or 22 contiguous amino acids,
X12X13 is selected from:
(a) NH, HH, KH, NK, NQ, RH, RN, SS, NN, SN, or KN for recognition of guanine (G);
(b) NI, KI, RI, HI, or SI for recognition of adenine (A);
(c) NG, HG, KG, or RG for recognition of thymine (T);
(d) HD, RD, SD, ND, KD, or YG for recognition of cytosine (C); and
(e) NV or HN for recognition of A or G; and (f) H* , HA, KA, N* , NA, NC, NS, RA, or S* for recognition of A or T or G or C, wherein (* ) means that the amino acid at X13 is absent, and wherein the transcriptional repressor domain suppresses expression of CTLA4 encoded by the CTLA4 gene.
84. The recombinant polypeptide of any one of the preceding clauses, wherein the DBD comprises up to 40 RUs.
85. The recombinant polypeptide of any one of the preceding clauses, wherein the DBD comprises up to 35 RUs.
86. The recombinant polypeptide of any one of the preceding clauses, wherein the DBD comprises up to 30 RUs.
87. The recombinant polypeptide of any one of the preceding clauses, wherein the DBD comprises up to 25 RUs.
88. The recombinant polypeptide of any one of the preceding clauses, wherein the DBD comprises up to 20 RUs.
89. The recombinant polypeptide of any one of the preceding clauses, wherein the DBD comprises additional RUs at the N-terminus that bind to the nucleotides present upstream of the nucleic acid sequence.
90. The recombinant polypeptide of any one of the preceding clauses, wherein the DBD comprises additional RUs at the C-terminus that bind to the nucleotides present downstream of the nucleic acid sequence.
91. The recombinant polypeptide of any one of the preceding clauses, wherein the transcriptional repressor domain is conjugated to the C-terminus of the DBD. 92. The recombinant polypeptide of any one of the preceding clauses, wherein the chain of 11 contiguous amino acids is at least 80% identical to LTPDQVVAIAS (SEQ ID NO:78).
93. The recombinant polypeptide of any one of the preceding clauses, wherein the chain of 20, 21, or 22 contiguous amino acids is at least 80% identical to GGKQALETVQRLLPVLCQDHG (SEQ ID NO: 79).
94. The recombinant polypeptide of any one of the preceding clauses, wherein the DBD comprises a N-cap region comprising an amino acid sequence at least 80% identical to the amino acid sequence set for the in SEQ ID NO: 339.
95. The recombinant polypeptide of any one of the preceding clauses, wherein the DBD comprises a C-cap region comprising an amino acid sequence at least 80% identical to the amino acid sequence set forth in SEQ ID NO: 452, wherein the recombinant polypeptide comprises from N-terminus to C-terminus: the N-cap region, the plurality of RUs, and the C-cap region.
96. The recombinant polypeptide of any one of the preceding clauses, wherein the DBD comprises a half-repeat comprising the amino acid sequence X1-11X12X13 X14-19 ,20, or 22 (SEQ ID NO: 471), wherein: X1-11 is a chain of 11 contiguous amino acids, X14-20 or 21 or 22 is a chain of 7, 8 or 9 contiguous amino acids, X12X13 is selected from:(a) NH, HH, KH, NK, NQ, RH, RN, SS, NN, SN, or KN for recognition of guanine (G); (b) NI, KI, RI, HI, or SI for recognition of adenine (A); (c) NG, HG, KG, or RG for recognition of thymine (T); (d) HD, RD, SD, ND, KD, or YG for recognition of cytosine (C); and (e) NV or HN for recognition of A or G; and (f) H* , HA, KA, N* , NA, NC, NS, RA, or S* for recognition of A or T or G or C, wherein (* ) means that the amino acid at X13 is absent.
97. The recombinant polypeptide of clause 96, wherein X 1-11 is at least 80% identical to LTPEQWAIAS (SEQ ID NO: 458).
98. The recombinant polypeptide of clause 96 or 97, wherein X14-20 or 21 or 22 is at least 80% identical to GGRPALE (SEQ ID NO:472).
99. A nucleic acid encoding the recombinant polypeptide of any of clauses 1-98.
100. The nucleic acid of clause 99, wherein the nucleic acid is operably linked to a promoter sequence that confers expression of the polypeptide.
101. The nucleic acid of clause 99 or 100, wherein the sequence of the nucleic acid is codon optimized for expression of the polypeptide in a human cell.
102. The nucleic acid of any one of clauses 99-101, wherein the nucleic acid is a deoxyribonucleic acid (DNA).
103. The nucleic acid of any one of clauses 99-101, wherein the nucleic acid is a ribonucleic acid (RNA).
104. A vector comprising the nucleic acid of any of clauses 99-103.
105. The vector of clause 104, wherein the vector is a viral vector.
106. A host cell comprising the nucleic acid of any of clauses 99-103 or the vector of clause 104 or 105.
107. A host cell that expresses the polypeptide of any of clauses 1-98.
108. A pharmaceutical polypeptide comprising the polypeptide of any of clauses 1-98 and a pharmaceutically acceptable excipient.
109. A pharmaceutical polypeptide comprising the nucleic acid of any of clauses 99-103 or the vector of clause 104 or 105 and a pharmaceutically acceptable excipient.
110. A method of suppressing expression of PDCD-1 gene in a cell, the method comprising: introducing into the cell the recombinant polypeptide of any one of clauses 1-48, wherein the recombinant polypeptide binds to a target nucleic acid sequence present in the PDCD-1 gene and the transcriptional repressor domain suppresses expression of the PDCD-1 gene.
111. A method of suppressing expression of TIM3 gene in a cell, the method comprising: introducing into the cell the recombinant polypeptide of any one of clauses 49-64, wherein the recombinant polypeptide binds to a target nucleic acid sequence present in the TIM3 gene and the transcriptional repressor domain suppresses expression of the TIM3 gene.
112. A method of suppressing expression of LAG3 gene in a cell, the method comprising: introducing into the cell the recombinant polypeptide of any one of clauses 65-82, wherein the recombinant polypeptide binds to a target nucleic acid sequence present in the LAG3 gene and the transcriptional repressor domain suppresses expression of the LAG3 gene.
113. A method of suppressing expression of CTLA4 gene in a cell, the method comprising: introducing into the cell the recombinant polypeptide of any one of clause 83, wherein the recombinant polypeptide binds to a target nucleic acid sequence present in the CTLA4 gene and the transcriptional repressor domain suppresses expression of the CTLA4 gene.
114. The method of any one of clauses 110-113, wherein the polypeptide is introduced as a nucleic acid encoding the polypeptide.
115. The method of clause 114, wherein the nucleic acid is a deoxyribonucleic acid (DNA).
116. The method of clause 114, wherein the nucleic acid is a ribonucleic acid (RNA).
117. The method of any of clauses 110-116, wherein the sequence of the nucleic acid is codon optimized for expression in a human cell.
118. The method of any of clauses 110-116, wherein the transcriptional repressor domain comprises KRAB, Sin3a, LSD1, SUV39H1, G9A (EHMT2), DNMT1, DNMT3A-DNMT3L, DNMT3B, KOX, TGF-beta-inducible early gene (TIEG), v-erbA, SID, MBD2, MBD3, Rb, or MeCP2.
119. The method of any one of clauses 110-118, wherein the cell is an animal cell.
120. The method of any one of clauses 110-118, wherein the cell is a human cell.
121. The method of any one of clauses 110-120, wherein the cell is a cancer cell.
122. The method of any one of clauses 110-121, wherein the cell is an ex vivo cell.
123. The method of any one of clauses 110-121, wherein the introducing comprises administering the polypeptide or a nucleic acid encoding the polypeptide to a subject.
124. The method of clause 123, wherein the administering comprises parenteral administration.
125. The method of clause 123, wherein the administering comprises intravenous, intramuscular, intrathecal, or subcutaneous administration.
126. The method of clause 123, wherein the administering comprises direct injection into a site in a subject.
127. The method of any of clause 123, wherein the administering comprises direct injection into a tumor.
128. A recombinant polypeptide comprising a DNA binding domain and a transcriptional repressor domain, wherein the DNA binding domain and the transcriptional repressor domain are heterologous, wherein the transcriptional repressor domain comprises an amino acid sequence at least 80% identical to any one of the sequences set out in SEQ ID NOs: 84-101. 129. The recombinant polypeptide of clause 128, wherein the transcriptional repressor domain comprises an amino acid sequence at least 85% identical to any one of the sequences set out in SEQ ID NOs: 84-101.
130. The recombinant polypeptide of clause 128, wherein the transcriptional repressor domain comprises an amino acid sequence at least 90% identical to any one of the sequences set out in SEQ ID NOs: 84-101.
131. The recombinant polypeptide of clause 128, wherein the transcriptional repressor domain comprises an amino acid sequence at least 95% identical to any one of the sequences set out in SEQ ID NOs: 84-101.
132. The recombinant polypeptide of any one of clauses 128-131, wherein the DNA binding domain comprises zinc finger protein (ZFP), a transcription activator-like effector (TALE), or a guide RNA.
133. The recombinant polypeptide of any one of clauses 128-132, wherein the DNA binding domain binds to a target nucleic acid sequence in a gene and optionally, wherein the DNA binding domain is the DBD of any one of clauses 1-98.
134. The recombinant polypeptide of clause 133, wherein the target nucleic acid sequence is in a PDCD 1 gene, a CTLA4 gene, a LAG3 gene, a TET2 gene, a ETLA gene, a HAVCR2 gene, a CCR5 gene, a CXCR4 gene, a TRA gene, a TRE gene, a E2M gene, an albumin gene, a HEE gene, a HEA1 gene, a TTR gene, a NR3C1 gene, a CD52 gene, an erythroid specific enhancer of the BCL11A gene, a CELE gene, a TGFER1 gene, a SERPINAl gene, a HEV genomic DNA in infected cells, a CEP290 gene, a DMD gene, a CFTR gene, or an IL2RG gene.
135. A nucleic acid encoding the recombinant polypeptide of any of clauses 128-134.
136. The nucleic acid of clause 135, wherein the nucleic acid is operably linked to a promoter sequence that confers expression of the polypeptide.
137. The nucleic acid of clause 135 or 136, wherein the sequence of the nucleic acid is codon optimized for expression of the polypeptide in a human cell.
138. The nucleic acid of any one of clauses 135-137, wherein the nucleic acid is a deoxyribonucleic acid (DNA).
139. The nucleic acid of any one of clauses 135-137, wherein the nucleic acid is a ribonucleic acid (RNA).
140. A vector comprising the nucleic acid of any of clauses 135-138.
141. The vector of clause 140, wherein the vector is a viral vector.
142. A host cell comprising the nucleic acid of any of clauses 135-139 or the vector of clause 140 or 141.
143. A host cell comprising the polypeptide of any of clauses 128-134.
144. A host cell that expresses the polypeptide of any of clauses 128-134.
145. A pharmaceutical composition comprising the polypeptide of any of clauses 128-134 and a pharmaceutically acceptable excipient.
146. A pharmaceutical composition comprising the nucleic acid of any of clauses 135-139 or the vector of clause 140 or 141 and a pharmaceutically acceptable excipient.
147. A method of suppressing expression of an endogenous gene in a cell, the method comprising: introducing into the cell the recombinant polypeptide of any one of clauses 128-134, wherein the DBD of the polypeptide binds to a target nucleic acid sequence present in the endogenous gene and the heterologous transcriptional repressor domain suppresses expression of the endogenous gene.
148. The method of clause 147, wherein the recombinant polypeptide is introduced as a nucleic acid encoding the polypeptide.
149. The method of clause 148, wherein the nucleic acid is a deoxyribonucleic acid (DNA).
150. The method of clause 148, wherein the nucleic acid is a ribonucleic acid (RNA).
151. The method of any of clauses 148-150, wherein the sequence of the nucleic acid is codon optimized for expression in a human cell.
152. The method of any of clauses 147-151, wherein the gene is a PDCD 1 gene, a CTLA4 gene, a LAG3 gene, a TET2 gene, a ETLA gene, a HAVCR2 gene, a CCR5 gene, a CXCR4 gene, a TRA gene, a TRE gene, a E2M gene, an albumin gene, a HEE gene, a HEA1 gene, a TTR gene, a NR3C1 gene, a CD52 gene, an erythroid specific enhancer of the ECL11A gene, a CELE gene, a TGFER1 gene, a SERPINAl gene, a HEV genomic DNA in infected cells, a CEP290 gene, a DMD gene, a CFTR gene, or an IL2RG gene.
153. The method of any one of clauses 147-152, wherein the cell is an animal cell.
154. The method of any one of clauses 147-152, wherein the cell is a human cell.
155. The method of any one of clauses 147-152, wherein the cell is a cancer cell.
156. The method of any one of clauses 147-152, wherein the cell is an ex vivo cell.
157. The method of any one of clauses 147-155, wherein the introducing comprises administering the polypeptide or a nucleic acid encoding the polypeptide to a subject.
158. The method of clause 157, wherein the administering comprises parenteral administration.
159. The method of clause 157, wherein the administering comprises intravenous, intramuscular, intrathecal, or subcutaneous administration.
160. The method of clause 157, wherein the administering comprises direct injection into a site in a subject.
161. The method of any of clause 157, wherein the administering comprises direct injection into a tumor.
162. A plurality of nucleic acids encoding:
(i) polypeptides that dimerize via direct dimerization, comprising:
(A) a DNA binding domain (DBD) fused to a first member of a heterodimer pair and a functional domain fused to a second member of the heterodimer pair, or
(B) a DNA binding domain (DBD) fused to a second member of a heterodimer pair and a functional domain fused to a first member of the heterodimer pair, wherein the first and second members of the heterodimer pair bind to each other thereby directly dimerizing the DBD and the functional domain, wherein the heterodimer pair is selected from one of the following heterodimer pairs:
37A, 37B;
13 A, 13B;
DHD37-BBB-A, DHD37-BBB-B;
DHD150-A, DHD150-B;
DHD154-A, DHD-154B;
37 A, 9B; 13 A, 37B;
13 A, DHD150-B;
37A, DHD37-BBB-B; and
DHD37-BBB-A, 37B; or
(ii) polypeptides that dimerize indirectly via a bridging construct, comprising:
(A) a DNA binding domain (DBD) fused to a first member of a first heterodimer pair; a bridging construct comprising a second member of the first heterodimer pair fused to a first member of a second heterodimer pair; and a functional domain fused to a second member of the second heterodimer pair; or
(B) a DNA binding domain (DBD) fused to a second member of a first heterodimer pair; a bridging construct comprising a first member of the first heterodimer pair fused to a first member of a second heterodimer pair; and a functional domain fused to a second member of the second heterodimer pair; or
(C) a DNA binding domain (DBD) fused to a second member of a first heterodimer pair; a bridging construct comprising a first member of the first heterodimer pair fused to a second member of a second heterodimer pair; and a functional domain fused to a first member of the second heterodimer pair, wherein the DBD and the functional domain dimerize indirectly via the bridging construct, wherein the first and second heterodimer pairs are different and are selected from the following heterodimer pairs:
37A, 37B;
13 A, 13B;
DHD37-BBB-A, DHD37-BBB-B;
DHD150-A, DHD150-B;
DHD154-A, DHD-154B;
37A, 9B;
13 A, 37B;
13 A, DHD150-B;
37A, DHD37-BBB-B; and
DHD37-BBB-A, 37B.
163. The plurality of nucleic acids of clause 162, wherein the DBD in (i) (A) or (i) (B) is fused to a first member of a first heterodimer pair and the functional domain is a first functional domain fused a second member of the first heterodimer pair and to a first member of a second heterodimer pair, the system further comprising a second functional domain fused to a second member of the second heterodimer pair, wherein the members of the first heterodimer pair mediate dimerization of the DBD and the first functional domain and members of the second heterodimer pair mediate dimerization of the first functional domain and the second functional domain.
164. The plurality of nucleic acids of clause 163, wherein the DBD is fused to a first member of a first heterodimer pair and to a first member of a second heterodimer pair, and the functional domain is fused a second member of the first heterodimer pair the system further comprising a second functional domain fused to a second member of the second heterodimer pair, wherein the members of the first heterodimer pair mediate assembly of the DBD and the first functional domain and members of the second heterodimer pair mediate assembly of the DBD and the second functional domain. 165. The plurality of nucleic acids of any one of clauses 162-164, wherein the DBD binds to a target nucleic acid sequence present in an endogenous gene in a cell.
166. The plurality of nucleic acids of any one of clauses 162-165, wherein the functional domain comprises an enzyme, a transcriptional activator, a transcriptional repressor, or a DNA nucleotide modifier.
167. The plurality of nucleic acids of clause 166, wherein the enzyme is a nuclease, a DNA modifying protein, or a chromatin modifying protein.
168. The plurality of nucleic acids of clause 167, wherein the nuclease is a cleavage domain or a half-cleavage domain.
169. The plurality of nucleic acids of clause 168, wherein the cleavage domain or half cleavage domain comprises a type IIS restriction enzyme.
170. The plurality of nucleic acids of clause 169, wherein the type IIS restriction enzyme comprises Fokl or Bfil.
171. The plurality of nucleic acids of clause 167, wherein the chromatin modifying protein is lysine -specific histone demethylase 1 (LSD1).
172. The plurality of nucleic acids of clause 166, wherein the transcriptional activator comprises VP 16, VP64, p65, p300 catalytic domain, TET1 catalytic domain, TDG, Ldbl self-associated domain, SAM activator (VP64, p65, HSF1), or VPR (VP64, p65, Rta).
173. The plurality of nucleic acids of clause 168, wherein the transcriptional repressor comprises KRAB, Sin3a, LSD1, SUV39H1, G9A (EHMT2), DNMT1, DNMT3A-DNMT3L, DNMT3B, KOX, TGF-beta-inducible early gene (TIEG), v-erbA, SID, MBD2, MBD3, Rb, MeCP2, or a transcriptional repressor provided in clauses 128-134.
174. The plurality of nucleic acids of clause 166, wherein the DNA nucleotide modifier is adenosine deaminase.
175. The plurality of nucleic acids of any of clauses 165-174, wherein the target nucleic acid sequence is within a PDCD 1 gene, a CTLA4 gene, a LAG3 gene, a TET2 gene, a ETLA gene, a HA VCR2 gene, a CCR5 gene, a CXCR4 gene, a TRA gene, a TRE gene, a E2M gene, an albumin gene, a HEE gene, a HEA1 gene, a TTR gene, a NR3C1 gene, a CD52 gene, an erythroid specific enhancer of the ECL11A gene, a CELE gene, a TGFER1 gene, a SERPINAl gene, a HEV genomic DNA in infected cells, a CEP290 gene, a DMD gene, a CFTR gene, or an IL2RG gene.
176. The plurality of nucleic acids of any of clauses 162-175, wherein the DBD comprises a transcription activator-like effector (TALE).
177. The plurality of nucleic acids of any of clauses 162-176, wherein the DBD comprises a DBD as set out in any one of clauses 1-98.
178. A DNA binding domain and a functional domain or a DNA binding domain, a functional domain and a bridging construct encoded by the plurality of nucleic acids of nucleic acids of any one of clauses 162-177.
179. A DNA binding domain and a functional domain as set forth in clause 162 (i)(A); or
(i)(B); or a DNA binding domain, a bridging construct, and a functional domain as set forth in clause 162
(ii)(A), (ii)(B), or (ii)(C).
180. A host cell comprising: (a) nucleic acids encoding the polypeptides as set forth in clause 162 (i)(A) or (i)(B); or (b) nucleic acids encoding the polypeptides as set forth in clause 162 (ii)(A), (ϋ)(B), or (ii)(C). 181. A host cell comprising: (a) the polypeptides as set forth in clause 162 (i)(A) or (i)(B); or (b) the polypeptides as set forth in clause 162 (ii)(A), (ii)(B), or (ii)(C).
182. A kit comprising:
(a) nucleic acids encoding the polypeptides as set forth in clause 162 (i)(A) or (i)(B); or
(b) nucleic acids encoding the polypeptides as set forth in clause 162 (ii)(A), (ii)(B), or (ii)(C).
183. A kit comprising:
(a) a first vector comprising a nucleic acid encoding the DBD set forth in clause 162 (i)(A); and
(b) a second vector comprising a nucleic acid encoding the functional domain set forth in clause 162 (i)(A); or
(a) a first vector comprising a nucleic acid encoding the DBD set forth in clause 162 (i)(B); and
(b) a second vector comprising a nucleic acid encoding the functional domain set forth in clause 162 (i)(B).
184. A kit comprising:
(a) a first vector comprising a nucleic acid encoding the DBD set forth in clause 162 (ii)(A);
(b) a second vector comprising a nucleic acid encoding the bridging construct set forth in clause 162 (ii)(A);and
(c) a third vector comprising a nucleic acid encoding the functional domain set forth in clause 162 (ii)(A); or
(a) a first vector comprising a nucleic acid encoding the DBD set forth in clause 162 (ii)(B);
(b) a second vector comprising a nucleic acid encoding the bridging construct set forth in clause 162 (ii)(B); and
(c) a third vector comprising a nucleic acid encoding the functional domain set forth in clause 162 (ii)(B); or
(a) a first vector comprising a nucleic acid encoding the DBD set forth in clause 162 (ii)(C);
(b) a second vector comprising a nucleic acid encoding the bridging construct set forth in clause 162 (ii)(C);and
(c) a third vector comprising a nucleic acid encoding the functional domain set forth in clause 162 (ii)(C).
185. A pharmaceutical composition comprising:
(a) nucleic acids encoding the polypeptides as set forth in clause 162 (i)(A) or (i)(B); or
(b) nucleic acids encoding the polypeptides as set forth in clause 162 (ii)(A), (ii)(B), or (ii)(C).
186. A pharmaceutical composition comprising:
(a) a first vector comprising a nucleic acid encoding the DBD set forth in clause 162 (i)(A); and
(b) a second vector comprising a nucleic acid encoding the functional domain set forth in clause 162 (i)(A); or
(a) a first vector comprising a nucleic acid encoding the DBD set forth in clause 162 (i)(B); and
(b) a second vector comprising a nucleic acid encoding the functional domain set forth in clause 162 (i)(B).
187. A pharmaceutical composition comprising:
(a) a first vector comprising a nucleic acid encoding the DBD set forth in clause 162 (ii)(A);
(b) a second vector comprising a nucleic acid encoding the bridging construct set forth in clause 162 (ii)(A);and
(c) a third vector comprising a nucleic acid encoding the functional domain set forth in clause 162 (ii)(A); or (a) a first vector comprising a nucleic acid encoding the DBD set forth in clause 162 (ii)(B);
(b) a second vector comprising a nucleic acid encoding the bridging construct set forth in clause 162 (ii)(B); and
(c) a third vector comprising a nucleic acid encoding the functional domain set forth in clause 162
(ii)(B); or
(a) a first vector comprising a nucleic acid encoding the DBD set forth in clause 162 (ii)(C);
(b) a second vector comprising a nucleic acid encoding the bridging construct set forth in clause 162 (ii)(C);and
(c) a third vector comprising a nucleic acid encoding the functional domain set forth in clause 162 (ii)(C).
188. A pharmaceutical composition comprising the DBD and a functional domain or a DNA binding domain, a functional domain and a bridging construct of clause 178 and a pharmaceutically acceptable excipient.
189. A pharmaceutical composition comprising the host cell of clause 180 or 181 and a pharmaceutically acceptable excipient.
190. A method for modulating expression from a target gene in a cell, the method comprising:
(i) introducing into the cell a first nucleic acid encoding a DNA binding domain fused to a first member of a heterodimer pair and a second nucleic acid encoding a functional domain fused to a second member of the heterodimer pair; or
(ii) introducing into the cell a first nucleic acid encoding a DNA binding domain fused to a second member of a heterodimer pair and a second nucleic acid encoding a functional domain fused to a first member of the heterodimer pair; or
(iii) introducing into the cell a DNA binding domain fused to a first member of a heterodimer pair and a functional domain fused to a second member of the heterodimer pair; or
(iv) introducing into the cell a DNA binding domain fused to a second member of a heterodimer pair and a functional domain fused to a first member of the heterodimer pair, wherein the heterodimer pair is selected from one of the following heterodimer pairs:
37A, 37B;
13 A, 13B;
DHD37-BBB-A, DHD37-BBB-B;
DHD150-A, DHD150-B;
DHD154-A, DHD-154B;
37A, 9B;
13 A, 37B;
13 A, DHD150-B;
37A, DHD37-BBB-B; and
DHD37-BBB-A, 37B, wherein the DNA binding domain (DBD) dimerizes with the functional domain via dimerization of the members of the heterodimer pair and wherein binding of the DBD to a target nucleic acid sequence in the target gene results in modulation of expression of the target gene via the functional domain dimerized to the DBD.
191. A method of modulating expression of a target gene in a cell, the method comprising:
(i) introducing into a cell expressing a DNA binding domain (DBD) fused to a first member of a first heterodimer pair and a functional domain fused to a second member of a second heterodimer pair, a bridging construct comprising a second member of the first heterodimer pair fused to a first member of the second heterodimer pair or a nucleic acid encoding the bridging construct; or
(ii) introducing into a cell expressing a DNA binding domain (DBD) fused to a second member of a first heterodimer pair and a functional domain fused to a second member of a second heterodimer pair, a bridging construct comprising a first member of the first heterodimer pair fused to a first member of the second heterodimer pair or a nucleic acid encoding the bridging construct; or
(iii) introducing into a cell expressing a DNA binding domain (DBD) fused to a first member of a first heterodimer pair and a functional domain fused to a first member of a second heterodimer pair, a bridging construct comprising a second member of the first heterodimer pair fused to a second member of the second heterodimer pair or a nucleic acid encoding the bridging construct, wherein the DBD and the functional domain dimerize indirectly via the bridging construct, wherein binding of the DBD to a target nucleic acid sequence in a target gene in the cell results in in modulation of expression of the target gene via the functional domain dimerized to the DBD via the bridging construct, wherein the first and second heterodimer pairs are different and are selected from the following heterodimer pairs:
37A, 37B;
13 A, 13B;
DHD37-BBB-A, DHD37-BBB-B;
DHD150-A, DHD150-B;
DHD154-A, DHD-154B;
37A, 9B;
13 A, 37B;
13 A, DHD150-B;
37A, DHD37-BBB-B; and
DHD37-BBB-A, 37B.
192. A method of reversing modulation of expression of a target gene in a cell expressing a DNA binding domain (DBD) fused to a first member of a non-cognate heterodimer pair and a functional domain fused to a second member of the non-cognate heterodimer pair, wherein the DBD binds to a target nucleic acid sequence in a target gene and the functional domain dimerized to the DBD via dimerization of the members of the heterodimer pair modulates expression of the target gene, the method comprising introducing into the cell a disruptor which binds to either the first member or the second member with a higher binding affinity than the binding affinity between the first and second members, wherein non cognate heterodimer pairs and the corresponding disruptor are selected from one of the following combinations:

Claims

CLAIMS What is claimed is:
1. A recombinant polypeptide comprising: a DNA binding domain (DBD) and a transcriptional repressor domain, the DBD comprising a plurality of repeat units (RUs) ordered from N-terminus to C-terminus of the DBD to bind to a nucleic acid sequence of the PDCD 1 gene, wherein the nucleic acid sequence is present within the sequence:
TGGTGGGGCTGCTCCAGGCATGCAGATCCCACAGGCGCCCTGG (SEQ ID NO: 1) wherein each of the RU comprises the sequence X 1-11X12X13X14-33, 34, or 35 (SEQ ID NO: 455), wherein: X1-11 is a chain of 11 contiguous amino acids,
X14-33 or 34 or 35 is a chain of 20, 21 or 22 contiguous amino acids,
X12X13 is selected from:
(a) NH, HH, KH, NK, NQ, RH, RN, SS, NN, SN, or KN for recognition of guanine (G);
(b) NI, KI, RI, HI, or SI for recognition of adenine (A);
(c) NG, HG, KG, or RG for recognition of thymine (T);
(d) HD, RD, SD, ND, KD, or YG for recognition of cytosine (C); and
(e) NV or HN for recognition of A or G; and (f) H* , HA, KA, N* , NA, NC, NS, RA, or S* for recognition of A or T or G or C, wherein (* ) means that the amino acid at X13 is absent, and wherein the transcriptional repressor domain suppresses expression of PD1 receptor encoded by the PDCD1 gene.
2. The recombinant polypeptide of claim 1, wherein the RUs are ordered from N-terminus to the C-terminus to bind to the sequence: GGGGCTGCTCC (SEQ ID NO:2), wherein the first RU at the N-terminus binds to the G at the 5’ end of the sequence and the last RU at the C-terminus binds to the C at the 3’ end of the sequence.
3. The recombinant polypeptide of claim 2, wherein the X12X13 in the RUs from N-terminus to C-terminus are NH, NH, NH, NH, HD, NG, NH, HD, NG, HD, and HD.
4. The recombinant polypeptide of claim 2 or 3, wherein the DBD comprises at least an additional RU at the N-terminus such that the DBD binds to the nucleic acid sequence TGGGGCTGCTCC (SEQ ID NO:3), wherein X12X13 in the additional RU is NG, HG, KG, or RG for recognition of the T.
5. The recombinant polypeptide of claim 1, wherein the RUs are ordered from N-terminus to the C-terminus to bind to the sequence: GGTGGGGCTGCTCC (SEQ ID NO:4), wherein the first RU at the N-terminus binds to the G at the 5’ end of the sequence and the last RU at the C-terminus binds to the C at the 3’ end of the sequence.
6. The recombinant polypeptide of claim 5, wherein the DBD comprises at least fourteen RUs, wherein X12X13 in the RUs from N-terminus to C-terminus are NH, NH, NG, NH, NH, NH, NH, HD, NG, NH, HD, NG, HD, and HD.
7. The recombinant polypeptide of claim 5 or 6, wherein the DBD comprises three additional RU at the N-terminus such that the DBD binds to the nucleic acid sequence TGGTGGGGCTGCTCC (SEQ ID NO:5).
8. The recombinant polypeptide of claim 5, wherein the DBD comprises three additional RUs at the C-terminus such that the DBD binds to the sequence GGTGGGGCTGCTCCAGG (SEQ ID NO:6).
9. The recombinant polypeptide of claim 1, wherein the RUs are arranged from N-terminus to C-terminus to bind to the sequence: GCAGATCCCACAGGCGC (SEQ ID NO:7).
10. The recombinant polypeptide of claim 1, wherein the RUs are arranged from N-terminus to C-terminus to bind to the sequence: CCCACAGGCGCCCTGG (SEQ ID NO:8).
11. The recombinant polypeptide of claim 1, wherein the RUs are arranged from N-terminus to C-terminus to bind to the sequence: GGGGCTGCTCCAGGCATGC (SEQ ID NO:9).
12. A recombinant polypeptide comprising: a DNA binding domain (DBD) and a transcriptional repressor, the DBD comprising a plurality of repeat units (RUs) ordered from N-terminus to C-terminus of the DBD to bind to a nucleic acid sequence of the PDCD 1 gene, wherein the nucleic acid sequence is present within the sequence:
CCTCCCCCAGCACTGCCTCTGTCACTCTCGCCCACGTGGATGTGG (SEQ ID NO: 10) wherein each of the RU comprises the sequence X1-11X12X13X14-33 or 34 or 35 (SEQ ID NO: 455), wherein: X1-11 is a chain of 11 contiguous amino acids, X14-33 or 34 or 35 is a chain of 20, 21 or 22 contiguous amino acids,
X12X13 is selected from:
(a) NH, HH, KH, NK, NQ, RH, RN, SS, NN, SN, or KN for recognition of guanine (G);
(b) NI, KI, RI, HI, or SI for recognition of adenine (A);
(c) NG, HG, KG, or RG for recognition of thymine (T);
(d) HD, RD, SD, ND, KD, or YG for recognition of cytosine (C); and
(e) NV or HN for recognition of A or G; and (f) H* , HA, KA, N* , NA, NC, NS, RA, or S* for recognition of A or T or G or C, wherein (* ) means that the amino acid at X13 is absent, and wherein the transcriptional repressor domain suppresses expression of PD1 receptor encoded by the PDCD1 gene.
13. The recombinant polypeptide of claim 12, wherein the RUs are ordered from N-terminus to C-terminus of the DBD to bind to the nucleic acid sequence TCTGTCACTCTCG (SEQ ID NO: 11).
14. The recombinant polypeptide of claim 13, wherein the DBD comprises at least thirteen RUs, wherein X12X13 in the RUs from N-terminus to C-terminus are NG, HD, NG, NH, NG, HD, NI, HD, NG, HD, NG, HD, and NH.
15. The recombinant polypeptide of claim 13 or 14, wherein the DBD further comprises three additional RUs at the N-terminus such that the DBD binds to the nucleic acid sequence GCCTCTGTCACTCTCG (SEQ ID NO: 12).
16. The recombinant polypeptide of claim 15, wherein the DBD further comprises three additional RUs at the C-terminus such that the DBD binds to the nucleic acid sequence GCCTCTGTCACTCTCGCCC (SEQ ID NO: 13).
17. The recombinant polypeptide of claim 16, wherein the DBD comprises at least nineteen RUs, wherein X12X13 in the RUs from N-terminus to C-terminus are NH, HD, HD, NG, HD, NG, NH,
NG, HD, NI, HD, NG, HD, NG, HD, NH, HD, HD, and HD.
18. The recombinant polypeptide of claim 13 or 14, wherein the DBD further comprises five additional RUs at the C-terminus such that the DBD binds to the nucleic acid sequence TCTGTCACTCTCGCCCAC (SEQ ID NO: 14).
19. The recombinant polypeptide of claim 18, wherein the DBD comprises at least eighteen RUs, wherein X12X13 in the RUs from N-terminus to C-terminus are NG, HD, NG, NH, NG, HD, NI, HD, NG, HD, NG, HD, NG, NH, HD, HD, HD, NI, and HD.
20. The recombinant polypeptide of claim 12, wherein the DBD comprises thirteen RUs ordered from N-terminus to C-terminus of the DBD to bind to the nucleic acid sequence: CCCCCAGCACTGC (SEQ ID NO: 15).
21. The recombinant polypeptide of claim 20, wherein the DBD further comprises three additional RUs at the N-terminus such that the DBD binds to the nucleic acid sequence: CCTCCCCCAGCACTGC (SEQ ID NO: 16).
22. The recombinant polypeptide of claim 21, wherein the DBD further comprises an additional RU at the C-terminus such that the DBD binds to the nucleic acid sequence: CCTCCCCCAGCACTGCC (SEQ ID NO: 17).
23. A recombinant polypeptide comprising: a DNA binding domain (DBD) and a transcriptional repressor, the DBD comprising at least nine repeat units (RUs) ordered from N-terminus to C-terminus of the DBD to bind to a nucleic acid sequence of the PDCD 1 gene, wherein the nucleic acid sequence is present within the sequence:
CCCAGGTCAGGTTGAAG (SEQ ID NO: 18), wherein each of the RU comprises the sequence X 1-11X12X13X14-33 or 34 or 35 (SEQ ID NO: 455), wherein: X 1-11 is a chain of 11 contiguous amino acids, X14-33 or 34 or 35 is a chain of 20, 21 or 22 contiguous amino acids,
X12X13 is selected from:
(a) NH, HH, KH, NK, NQ, RH, RN, SS, NN, SN, or KN for recognition of guanine (G);
(b) NI, KI, RI, HI, or SI for recognition of adenine (A);
(c) NG, HG, KG, or RG for recognition of thymine (T); (d) HD, RD, SD, ND, KD, or YG for recognition of cytosine (C); and
(e) NV or HN for recognition of A or G; and (f) H* , HA, KA, N* , NA, NC, NS, RA, or S* for recognition of A or T or G or C, wherein (* ) means that the amino acid at X13 is absent, and wherein the transcriptional repressor domain suppresses expression of PD1 receptor encoded by the PDCD1 gene.
24. A recombinant polypeptide comprising: a DNA binding domain (DBD) and a transcriptional repressor, the DBD comprising at least nine repeat units (RUs) ordered from N-terminus to C-terminus of the DBD to bind to a nucleic acid sequence of the PDCD 1 gene, wherein the nucleic acid sequence is present within the sequence:
CCCTTCAACCTGACCTGGGACAGTTTCCCTTCCGCTCACCTCCGCCTGA (SEQ ID
NO: 19), wherein each of the RU comprises the sequence X1-11X12X13X14-33 or 34 or 35 (SEQ ID NO: 455), wherein: X 1-11 is a chain of 11 contiguous amino acids, X14-33 or 34 or 35 is a chain of 20, 21 or 22 contiguous amino acids,
X12X13 is selected from:
(a) NH, HH, KH, NK, NQ, RH, RN, SS, NN, SN, or KN for recognition of guanine (G);
(b) NI, KI, RI, HI, or SI for recognition of adenine (A);
(c) NG, HG, KG, or RG for recognition of thymine (T);
(d) HD, RD, SD, ND, KD, or YG for recognition of cytosine (C); and
(e) NV or HN for recognition of A or G; and (f) H* , HA, KA, N* , NA, NC, NS, RA, or S* for recognition of A or T or G or C, wherein (* ) means that the amino acid at X13 is absent, and wherein the transcriptional repressor domain suppresses expression of PD1 receptor encoded by the PDCD1 gene.
25. The recombinant polypeptide of claim 24, wherein the DBD comprises ten RUs ordered from N-terminus to C-terminus to bind to the nucleic acid sequence: TCCGCTCACC (SEQ ID NO:20).
26. The recombinant polypeptide of claim 25, wherein the DBD comprises nine additional RUs at the C-terminus such that the DBD binds to the nucleic acid sequence: TCCGCTCACCTCCGCCTGA (SEQ ID NO:21).
27. The recombinant polypeptide of claim 25, wherein the DBD comprises four additional RUs at the N-terminus such that the DBD binds to the nucleic acid sequence: CCCTTCCGCTCACC (SEQ ID NO:22).
28. The recombinant polypeptide of claim 27, wherein the DBD comprises five additional RUs at the C-terminus such that the DBD binds to the nucleic acid sequence: CCCTTCCGCTCACCTCCGC (SEQ ID NO:23).
29. The recombinant polypeptide of claim 27, wherein the DBD comprises two additional RUs at the N-terminus such that the DBD binds to the nucleic acid sequence: TTCCCTTCCGCTCACC (SEQ ID NO:24).
30. The recombinant polypeptide of claim 24, wherein the DBD comprises twelve RUs ordered from N-terminus to C-terminus to bind to the nucleic acid sequence: GGGACAGTTTCC (SEQ ID NO:25).
31. The recombinant polypeptide of claim 30, wherein the DBD further comprises four additional RUs at the C-terminus such that the DBD binds to the nucleic acid sequence: GGGACAGTTTCCCTTC (SEQ ID NO: 26).
32. The recombinant polypeptide of claim 30, wherein the DBD further comprises five additional RUs at the N-terminus such that the DBD binds to the nucleic acid sequence: GACCTGGGACAGTTTCC (SEQ ID NO: 27).
33. The recombinant polypeptide of claim 24, wherein the DBD comprises eleven RUs ordered from N-terminus to C-terminus to bind to the nucleic acid sequence: CAACCTGACCT (SEQ ID NO:28).
34. The recombinant polypeptide of claim 33, wherein the DBD comprises nine additional RUs at the C-terminus such that the DBD binds to the nucleic acid sequence: CAACCTGACCTGGGACAGTT (SEQ ID NO: 29).
35. The recombinant polypeptide of claim 33, wherein the DBD comprises five additional RUs at the N-terminus such that the DBD binds to the nucleic acid sequence: CCCTTCAACCTGACCT (SEQ ID NO:30).
36. A recombinant polypeptide comprising: a DNA binding domain (DBD) and a transcriptional repressor, the DBD comprising at least nine repeat units (RUs) ordered from N-terminus to C-terminus of the DBD to bind to a nucleic acid sequence of the PDCD 1 gene, wherein the nucleic acid sequence is present within the sequence: GCCGCCTTCTCCACTGCTCAGGCGGAGGT (SEQ ID NO:31), wherein each of the RU comprises the sequence X 1-11X12X13X14-33 or 34 or 35 (SEQ ID NO: 455), wherein: X 1-11 is a chain of 11 contiguous amino acids, X14-33 or 34 or 35 is a chain of 20, 21 or 22 contiguous amino acids,
X12X13 is selected from:
(a) NH, HH, KH, NK, NQ, RH, RN, SS, NN, SN, or KN for recognition of guanine (G);
(b) NI, KI, RI, HI, or SI for recognition of adenine (A);
(c) NG, HG, KG, or RG for recognition of thymine (T);
(d) HD, RD, SD, ND, KD, or YG for recognition of cytosine (C); and
(e) NV or HN for recognition of A or G; and (f) H* , HA, KA, N* , NA, NC, NS, RA, or S* for recognition of A or T or G or C, wherein (* ) means that the amino acid at X13 is absent, and wherein the transcriptional repressor domain suppresses expression of PD1 receptor encoded by the PDCD1 gene.
37. The recombinant polypeptide of claim 36, wherein the DBD comprises RUs arranged from N-terminus to C-terminus such that the DBD binds to the nucleic acid sequence: GCCGCCTTCTCCACT (SEQ ID NO:32).
38. The recombinant polypeptide of claim 36, wherein the DBD comprises RUs arranged from N-terminus to C-terminus such that the DBD binds to the nucleic acid sequence: CCACTGCTCAGGCG (SEQ ID NO:33).
39. The recombinant polypeptide of claim 38, wherein the DBD further comprises three additional RUs at the N-terminus such that the DBD binds to the nucleic acid sequence: TCTCCACTGCTCAGGCG (SEQ ID NO:34).
40. The recombinant polypeptide of claim 38, wherein the DBD further comprises five additional RUs at the C-terminus such that the DBD binds to the nucleic acid sequence: CCACTGCTCAGGCGGAGGT (SEQ ID NO:35).
41. A recombinant polypeptide comprising : a DNA binding domain (DBD) and a transcriptional repressor, the DBD comprising at least nine repeat units (RUs) ordered from N-terminus to C-terminus of the DBD to bind to a nucleic acid sequence of the PDCD 1 gene, wherein the nucleic acid sequence is present within the sequence:
GGCCAGGGCGCCTGT (SEQ ID NO:36);
CTGCATGCCTGGAGCAG (SEQ ID NO:37);
GCTCCCGCCCCCTCTTCCT (SEQ ID NO:38);
CTTCCTCCACATCCACG (SEQ ID NO:39); or
CCTCCACATCCACGTGGGC (SEQ ID NO: 40), wherein each of the RU comprises the sequence X 1-11X12X13X14-33 or 34 or 35 (SEQ ID NO: 455), wherein: X1-11 is a chain of 11 contiguous amino acids, X14-33 or 34 or 35 is a chain of 20, 21 or 22 contiguous amino acids,
X12X13 is selected from:
(a) NH, HH, KH, NK, NQ, RH, RN, SS, NN, SN, or KN for recognition of guanine (G);
(b) NI, KI, RI, HI, or SI for recognition of adenine (A);
(c) NG, HG, KG, or RG for recognition of thymine (T);
(d) HD, RD, SD, ND, KD, or YG for recognition of cytosine (C); and
(e) NV or HN for recognition of A or G; and (f) H* , HA, KA, N* , NA, NC, NS, RA, or S* for recognition of A or T or G or C, wherein (* ) means that the amino acid at X13 is absent, and wherein the transcriptional repressor domain suppresses expression of PD1 receptor encoded by the PDCD1 gene.
42. The recombinant polypeptide of any one of claims 1-41, wherein the DBD comprises at least 11 RUs.
43. The recombinant polypeptide of any one of claims 1-41, wherein the DBD comprises at least 13 RUs.
44. The recombinant polypeptide of any one of claims 1-41, wherein the DBD comprises at least 15 RUs.
45. The recombinant polypeptide of any one of claims 1-41, wherein the DBD comprises at least 17 RUs.
46. The recombinant polypeptide of any one of the preceding claims, wherein the DBD comprises up to 40 RUs.
47. The recombinant polypeptide of any one of the preceding claims, wherein the DBD comprises additional RUs at the N-terminus that bind to the nucleotides present upstream of the nucleic acid sequence.
48. The recombinant polypeptide of any one of the preceding claims, wherein the DBD comprises additional RUs at the C-terminus that bind to the nucleotides present downstream of the nucleic acid sequence.
49. A recombinant polypeptide comprising: a DNA binding domain (DBD) and a transcriptional repressor, the DBD comprising a plurality of repeat units (RUs) ordered from N-terminus to C-terminus of the DBD to bind to a nucleic acid sequence of the TIM3 gene, wherein the nucleic acid sequence is present within the sequence: GGCAGTGTTACTATAAGAATCACTGGCAATCAGACACCCGGGTG (SEQ ID NO:41) or a complement thereof, wherein each of the RU comprises the sequence X1- 11X12X13X14-33, 34, or 35 (SEQ ID NO: 455), wherein: X1-11 is a chain of 11 contiguous amino acids, X14-33 or 34 or 35 is a chain of 20, 21 or 22 contiguous amino acids,
X12X13 is selected from:
(a) NH, HH, KH, NK, NQ, RH, RN, SS, NN, SN, or KN for recognition of guanine (G);
(b) NI, KI, RI, HI, or SI for recognition of adenine (A);
(c) NG, HG, KG, or RG for recognition of thymine (T);
(d) HD, RD, SD, ND, KD, or YG for recognition of cytosine (C); and
(e) NV or HN for recognition of A or G; and (f) H* , HA, KA, N* , NA, NC, NS, RA, or S* for recognition of A or T or G or C, wherein (* ) means that the amino acid at X13 is absent, and wherein the transcriptional repressor domain suppresses expression of TIM3 encoded by the TIM3 gene.
50. The recombinant polypeptide of claim 49, wherein the DBD comprises RUs that bind to the nucleic acid sequence TGTTACTATA (SEQ ID NO:42).
51. The recombinant polypeptide of claim 50, wherein the DBD comprises an additional RU at the C-terminus such that the DBD binds to the nucleic acid sequence TGTTACTATAA (SEQ ID NO:43).
52. The recombinant polypeptide of claim 50 or 51, wherein the DBD comprises three additional RUs at the N-terminus such that the DBD binds to the nucleic acid sequence CAGTGTTACTATAA (SEQ ID NO: 44).
53. The recombinant polypeptide of claim 52, wherein the DBD comprises two additional RUs at the N-terminus such that the DBD binds to the nucleic acid sequence GGCAGTGTTACTATAA (SEQ ID NO:45).
54. The recombinant polypeptide of claim 49, wherein the DBD comprises RUs that bind to the nucleic acid sequence TCAGACACCCGGGTG (SEQ ID NO:46).
55. The recombinant polypeptide of claim 54, wherein the DBD comprises three additional RUs at the N-terminus such that the DBD binds to the nucleic acid sequence CAATCAGACACCCGGGTG (SEQ ID NO: 47).
56. The recombinant polypeptide of claim 54, wherein the DBD comprises three additional RUs at the N-terminus such that the DBD binds to the nucleic acid sequence TGGCAATCAGACACCCGGGTG (SEQ ID NO:48).
57. A recombinant polypeptide comprising: a DNA binding domain (DBD) and a transcriptional repressor, the DBD comprising a plurality of repeat units (RUs) ordered from N-terminus to C-terminus of the DBD to bind a nucleic acid sequence of the TIM3 gene, wherein the nucleic acid sequence is present within the sequence:
TGTCTGATTGCCAGTGATTCTTATAGT (SEQ ID NO:49), wherein each of the repeat unit comprises the sequence X1-11X12X13X14-33 or 34 or 35 (SEQ ID NO: 455), wherein: X1-11 is a chain of 11 contiguous amino acids, X14-33 or 34 or 35 is a chain of 20, 21 or 22 contiguous amino acids,
X12X13 is selected from:
(a) NH, HH, KH, NK, NQ, RH, RN, SS, NN, SN, or KN for recognition of guanine (G);
(b) NI, KI, RI, HI, or SI for recognition of adenine (A);
(c) NG, HG, KG, or RG for recognition of thymine (T);
(d) HD, RD, SD, ND, KD, or YG for recognition of cytosine (C); and
(e) NV or HN for recognition of A or G; and (f) H* , HA, KA, N* , NA, NC, NS, RA, or S* for recognition of A or T or G or C, wherein (* ) means that the amino acid at X13 is absent, and wherein the transcriptional repressor domain suppresses expression of TIM3 encoded by the TIM3 gene.
58. The recombinant polypeptide of claim 57, wherein the DBD comprises RUs that are ordered to bind to the sequence TGCCAGTGATT (SEQ ID NO: 50).
59. The recombinant polypeptide of claim 58, wherein the DBD comprises eight additional RUs at the C-terminus such that the DBD binds to the sequence TGCCAGTGATTCTTATAGT (SEQ ID NO:51).
60. The recombinant polypeptide of claim 57, wherein the DBD comprises RUs that are ordered to binds to the sequence TGATTGCCAGTGATT (SEQ ID NO:52).
61. The recombinant polypeptide of claim 60, wherein the DBD comprises four additional RUs at the N-terminus such that the DBD binds to the sequence TGTCTGATTGCCAGTGATT (SEQ ID NO:53).
62. A recombinant polypeptide comprising: a DNA binding domain (DBD) and a transcriptional repressor, the DBD comprising a plurality of repeat units (RUs) ordered from N-terminus to C-terminus of the DBD to bind to a nucleic acid sequence of TIM3 gene, wherein the nucleic acid sequence is: TACACACAT (SEQ ID NO:54), wherein each of the repeat unit comprises the sequence X1-11X12X13X14-33 or 34 or 35 (SEQ ID NO: 455), wherein: X1-11 is a chain of 11 contiguous amino acids, X14-33 or 34 or 35 is a chain of 20, 21 or 22 contiguous amino acids,
X12X13 is selected from:
(a) NH, HH, KH, NK, NQ, RH, RN, SS, NN, SN, or KN for recognition of guanine (G);
(b) NI, KI, RI, HI, or SI for recognition of adenine (A);
(c) NG, HG, KG, or RG for recognition of thymine (T);
(d) HD, RD, SD, ND, KD, or YG for recognition of cytosine (C); and
(e) NV or HN for recognition of A or G; and (f) H* , HA, KA, N* , NA, NC, NS, RA, or S* for recognition of A or T or G or C, wherein (* ) means that the amino acid at X13 is absent, and wherein the transcriptional repressor domain suppresses expression of TIM3 encoded by the TIM3 gene.
63. The recombinant polypeptide of claim 62, wherein the DBD comprises four additional RUs at the N-terminus such that the DBD binds to the sequence ACACTACACACAT (SEQ ID NO:55).
64. The recombinant polypeptide of claim 63, wherein the DBD comprises four additional RUs at the N-terminus such that the DBD binds to the sequence TGCCACACTACACACAT (SEQ ID NO:56).
65. A recombinant polypeptide comprising: a DNA binding domain (DBD) and a transcriptional repressor, the DBD comprising at least nine repeat units (RUs) ordered from N-terminus to C-terminus of the DBD to bind to a nucleic acid sequence of the LAG3 gene, wherein the nucleic acid sequence is present within the sequence: GCCGTTCTGCTGGTCTCTGGGCCTTCACCCCTGTGCCCGGCCTTCC (SEQ ID NO:57), wherein each of the RU comprises the sequence X1-11X12X13X14-33 or 34 or 35 (SEQ ID NO: 455), wherein: X1-11 is a chain of 11 contiguous amino acids, X14-33 or 34 or 35 is a chain of 20, 21 or 22 contiguous amino acids,
X12X13 is selected from:
(a) NH, HH, KH, NK, NQ, RH, RN, SS, NN, SN, or KN for recognition of guanine (G);
(b) NI, KI, RI, HI, or SI for recognition of adenine (A);
(c) NG, HG, KG, or RG for recognition of thymine (T);
(d) HD, RD, SD, ND, KD, or YG for recognition of cytosine (C); and
(e) NV or HN for recognition of A or G; and (f) H* , HA, KA, N* , NA, NC, NS, RA, or S* for recognition of A or T or G or C, wherein (* ) means that the amino acid at X13 is absent, and wherein the transcriptional repressor domain suppresses expression of LAG3 encoded by the LAG3 gene.
66. The recombinant polypeptide of claim 65, wherein the DBD comprises RUs that bind to the sequence TCTGCTGGTCT (SEQ ID NO:58).
67. The recombinant polypeptide of claim 66, wherein the DBD comprises five additional RUs at the N-terminus such that the DBD binds to the sequence GCCGTTCTGCTGGTCT (SEQ ID NO:59).
68. The recombinant polypeptide of claim 67, wherein the DBD comprises two additional RUs at the C-terminus such that the DBD binds to the sequence GCCGTTCTGCTGGTCTCT (SEQ ID NO:60).
69. The recombinant polypeptide of claim 66, wherein the DBD comprises four additional RUs at the C-terminus such that the DBD binds to the sequence TCTGCTGGTCTGGGC (SEQ ID NO: 61).
70. The recombinant polypeptide of claim 69, wherein the DBD comprises an additional RUs at the C-terminus such that the DBD binds to the sequence TCTGCTGGTCTGGGCC (SEQ ID NO: 62).
71. The recombinant polypeptide of claim 70, wherein the DBD comprises three additional RUs at the C-terminus such that the DBD binds to the sequence TCTGCTGGTCTGGGCCTTC (SEQ ID NO:63).
72. The recombinant polypeptide of claim 65, wherein the DBD comprises RUs that bind to the sequence TCTCTGGGCCTTCA (SEQ ID NO:64).
73. The recombinant polypeptide of claim 72, wherein the DBD comprises two additional RUs at the N-terminus such that the DBD binds the sequence GGTCTCTGGGCCTTCA (SEQ ID NO:65).
74. The recombinant polypeptide of claim 73, wherein the DBD comprises three additional RUs at the C-terminus such that the DBD binds the sequence GGTCTCTGGGCCTTCACCC (SEQ ID NO:66).
75. The recombinant polypeptide of claim 74, wherein the DBD comprises an additional RUs at the N-terminus such that the DBD binds the sequence TGGTCTCTGGGCCTTCACC (SEQ ID NO:67).
76. The recombinant polypeptide of claim 65, wherein the DBD comprises RUs that bind to the sequence TTCACCCCTGTG (SEQ ID NO:68).
77. The recombinant polypeptide of claim 76, wherein the DBD comprises four additional RUs at the C-terminus such that the DBD binds to the sequence TTCACCCCTGTGCCCG (SEQ ID NO:69).
78. The recombinant polypeptide of claim 77, wherein the DBD comprises four additional RUs at the C-terminus such that the DBD binds to the sequence TTCACCCCTGTGCCCGGCCT (SEQ ID NO: 70).
79. The recombinant polypeptide of claim 78, wherein the DBD comprises three additional RUs at the C-terminus such that the DBD binds to the sequence TTCACCCCTGTGCCCGGCCTTCC (SEQ ID NO:71).
80. A recombinant polypeptide comprising: a DNA binding domain (DBD) and a transcriptional repressor, the DBD comprising a plurality of repeat units (RUs) ordered from N-terminus to C-terminus of the DBD to bind to a nucleic acid sequence of LAG3 gene, wherein the nucleic acid sequence is:
TGCTCTGTCTGC (SEQ ID NO:72), wherein each of the repeat unit comprises the sequence X1-11X12X13X14-33 or 34 or 35 (SEQ ID NO: 455), wherein: X1-11 is a chain of 11 contiguous amino acids, X14-33 or 34 or 35 is a chain of 20, 21 or 22 contiguous amino acids,
X12X13 is selected from:
(a) NH, HH, KH, NK, NQ, RH, RN, SS, NN, SN, or KN for recognition of guanine (G); (b) NI, KI, RI, HI, or SI for recognition of adenine (A);
(c) NG, HG, KG, or RG for recognition of thymine (T);
(d) HD, RD, SD, ND, KD, or YG for recognition of cytosine (C); and
(e) NV or HN for recognition of A or G; and (f) H* , HA, KA, N* , NA, NC, NS, RA, or S* for recognition of A or T or G or C, wherein (* ) means that the amino acid at X13 is absent, and wherein the transcriptional repressor domain suppresses expression of LAG3 encoded by the LAG3 gene.
81. The recombinant polypeptide of claim 80, wherein the DBD comprises two additional RUs at the C-terminus such that the DBD binds to the sequence TGCTCTGTCTGCTC (SEQ ID NO:73).
82. The recombinant polypeptide of claim 81, wherein the DBD comprises two additional RUs at the N-terminus such that the DBD binds to the sequence TTTGCTCTGTCTGCTC (SEQ ID NO: 74).
83. A recombinant polypeptide comprising: a DNA binding domain (DBD) and a transcriptional repressor, the DBD comprising at least nine repeat units (RUs) ordered from N-terminus to C-terminus of the DBD to bind to a nucleic acid sequence of the CTLA4 gene, wherein the nucleic acid sequence is: ACATATCTGGGATCAAAGCT (SEQ ID NO:75),
ATATAAAGTCCTTGAT (SEQ ID NO:76), or TTCTATTCAAGTGCC (SEQ ID NO: 77), wherein each of the RU comprises the sequence X1-11X12X13X14-33 or 34 or 35 (SEQ ID NO: 455), wherein: X1-11 is a chain of 11 contiguous amino acids, X14-33 or 34 or 35 is a chain of 20, 21 or 22 contiguous amino acids,
X12X13 is selected from:
(a) NH, HH, KH, NK, NQ, RH, RN, SS, NN, SN, or KN for recognition of guanine (G);
(b) NI, KI, RI, HI, or SI for recognition of adenine (A);
(c) NG, HG, KG, or RG for recognition of thymine (T);
(d) HD, RD, SD, ND, KD, or YG for recognition of cytosine (C); and
(e) NV or HN for recognition of A or G; and (f) H* , HA, KA, N* , NA, NC, NS, RA, or S* for recognition of A or T or G or C, wherein (* ) means that the amino acid at X13 is absent, and wherein the transcriptional repressor domain suppresses expression of CTLA4 encoded by the CTLA4 gene.
84. The recombinant polypeptide of any one of the preceding claims, wherein the DBD comprises up to 40 RUs.
85. The recombinant polypeptide of any one of the preceding claims, wherein the DBD comprises up to 35 RUs.
86. The recombinant polypeptide of any one of the preceding claims, wherein the DBD comprises up to 30 RUs.
87. The recombinant polypeptide of any one of the preceding claims, wherein the DBD comprises up to 25 RUs.
88. The recombinant polypeptide of any one of the preceding claims, wherein the DBD comprises up to 20 RUs.
89. The recombinant polypeptide of any one of the preceding claims, wherein the DBD comprises additional RUs at the N-terminus that bind to the nucleotides present upstream of the nucleic acid sequence.
90. The recombinant polypeptide of any one of the preceding claims, wherein the DBD comprises additional RUs at the C-terminus that bind to the nucleotides present downstream of the nucleic acid sequence.
91. The recombinant polypeptide of any one of the preceding claims, wherein the transcriptional repressor domain is conjugated to the C-terminus of the DBD.
92. The recombinant polypeptide of any one of the preceding claims, wherein the chain of 11 contiguous amino acids is at least 80% identical to UTPDQVVAIAS (SEQ ID NO:78).
93. The recombinant polypeptide of any one of the preceding claims, wherein the chain of 20, 21, or 22 contiguous amino acids is at least 80% identical to GGKQALETVQRLLPVLCQDHG (SEQ ID NO: 79).
94. The recombinant polypeptide of any one of the preceding claims, wherein the DBD comprises a N-cap region comprising an amino acid sequence at least 80% identical to the amino acid sequence set for the in SEQ ID NO: 339.
95. The recombinant polypeptide of any one of the preceding claims, wherein the DBD comprises a C-cap region comprising an amino acid sequence at least 80% identical to the amino acid sequence set forth in SEQ ID NO: 452, wherein the recombinant polypeptide comprises from N-terminus to C-terminus: the N-cap region, the plurality of RUs, and the C-cap region.
96. The recombinant polypeptide of any one of the preceding claims, wherein the DBD comprises a half-repeat comprising the amino acid sequence X 1-11X12X 13X14-19, 20, or 21 (SEQ ID NO: 471), wherein: X 1-11 is a chain of 11 contiguous amino acids,
X14-20 or 21 or 22 is a chain of 7, 8 or 9 contiguous amino acids,
X12X13 is selected from:
(a) NH, HH, KH, NK, NQ, RH, RN, SS, NN, SN, or KN for recognition of guanine (G);
(b) NI, KI, RI, HI, or SI for recognition of adenine (A);
(c) NG, HG, KG, or RG for recognition of thymine (T);
(d) HD, RD, SD, ND, KD, or YG for recognition of cytosine (C); and
(e) NV or HN for recognition of A or G; and (f) H* , HA, KA, N* , NA, NC, NS, RA, or S* for recognition of A or T or G or C, wherein (* ) means that the amino acid at X13 is absent.
97. The recombinant polypeptide of claim 96, whereinX1-11 is at least 80% identical to LTPEQVVAIAS (SEQ ID NO: 458).
98. The recombinant polypeptide of claim 96 or 97, wherein X14-20 or 21 or 22 is at least 80% identical to GGRPALE (SEQ ID NO:472).
99. A nucleic acid encoding the recombinant polypeptide of any of claims 1-98.
100. The nucleic acid of claim 99, wherein the nucleic acid is operably linked to a promoter sequence that confers expression of the polypeptide.
101. The nucleic acid of claim 99 or 100, wherein the sequence of the nucleic acid is codon optimized for expression of the polypeptide in a human cell.
102. The nucleic acid of any one of claims 99-101, wherein the nucleic acid is a deoxyribonucleic acid (DNA).
103. The nucleic acid of any one of claims 99-101, wherein the nucleic acid is a ribonucleic acid (RNA).
104. A vector comprising the nucleic acid of any of claims 99-103.
105. The vector of claim 104, wherein the vector is a viral vector.
106. A host cell comprising the nucleic acid of any of claims 99-103 or the vector of claim 104 or 105.
107. A host cell that expresses the polypeptide of any of claims 1-98.
108. A pharmaceutical composition comprising the polypeptide of any of claims 1-98 and a pharmaceutically acceptable excipient.
109. A pharmaceutical composition comprising the nucleic acid of any of claims 99-103 or the vector of claim 104 or 105 and a pharmaceutically acceptable excipient.
110. A method of suppressing expression of PDCD-1 gene in a cell, the method comprising: introducing into the cell the recombinant polypeptide of any one of claims 1-48, wherein the recombinant polypeptide binds to a target nucleic acid sequence present in the PDCD-1 gene and the transcriptional repressor domain suppresses expression of the PDCD-1 gene.
111. A method of suppressing expression of TIM3 gene in a cell, the method comprising: introducing into the cell the recombinant polypeptide of any one of claims 49-64, wherein the recombinant polypeptide binds to a target nucleic acid sequence present in the TIM3 gene and the transcriptional repressor domain suppresses expression of the TIM3 gene.
112. A method of suppressing expression of LAG3 gene in a cell, the method comprising: introducing into the cell the recombinant polypeptide of any one of claims 65-82, wherein the recombinant polypeptide binds to a target nucleic acid sequence present in the LAG3 gene and the transcriptional repressor domain suppresses expression of the LAG3 gene.
113. A method of suppressing expression of CTLA4 gene in a cell, the method comprising: introducing into the cell the recombinant polypeptide of any one of claim 83, wherein the recombinant polypeptide binds to a target nucleic acid sequence present in the CTLA4 gene and the transcriptional repressor domain suppresses expression of the CTLA4 gene.
114. The method of any one of claims 110-113, wherein the polypeptide is introduced as a nucleic acid encoding the polypeptide.
115. The method of claim 114, wherein the nucleic acid is a deoxyribonucleic acid (DNA).
116. The method of claim 114, wherein the nucleic acid is a ribonucleic acid (RNA).
117. The method of any of claims 110-116, wherein the sequence of the nucleic acid is codon optimized for expression in a human cell.
118. The method of any of claims 110-116, wherein the transcriptional repressor domain comprises KRAB, Sin3a, LSD1, SUV39H1, G9A (EHMT2), DNMT1, DNMT3A-DNMT3L, DNMT3B, KOX, TGF -beta-inducible early gene (TIEG), v-erbA, SID, MBD2, MBD3, Rb, or MeCP2.
119. The method of any one of claims 110-118, wherein the cell is an animal cell.
120. The method of any one of claims 110-118, wherein the cell is a human cell.
121. The method of any one of claims 110-120, wherein the cell is a cancer cell.
122. The method of any one of claims 110-121, wherein the cell is an ex vivo cell.
123. The method of any one of claims 110-121, wherein the introducing comprises administering the polypeptide or a nucleic acid encoding the polypeptide to a subject.
124. The method of claim 123, wherein the administering comprises parenteral administration.
125. The method of claim 123, wherein the administering comprises intravenous, intramuscular, intrathecal, or subcutaneous administration.
126. The method of claim 123, wherein the administering comprises direct injection into a site in a subject.
127. The method of any of claim 123, wherein the administering comprises direct injection into a tumor.
128. A recombinant polypeptide comprising a DNA binding domain and a transcriptional repressor domain, wherein the DNA binding domain and the transcriptional repressor domain are heterologous, wherein the transcriptional repressor domain comprises an amino acid sequence at least 80% identical to any one of the sequences set out in SEQ ID NOs: 84-101.
129. The recombinant polypeptide of claim 128, wherein the transcriptional repressor domain comprises an amino acid sequence at least 85% identical to any one of the sequences set out in SEQ ID NOs: 84-101.
130. The recombinant polypeptide of claim 128, wherein the transcriptional repressor domain comprises an amino acid sequence at least 90% identical to any one of the sequences set out in SEQ ID NOs: 84-101.
131. The recombinant polypeptide of claim 128, wherein the transcriptional repressor domain comprises an amino acid sequence at least 95% identical to any one of the sequences set out in SEQ ID NOs: 84-101.
132. The recombinant polypeptide of any one of claims 128-131, wherein the DNA binding domain comprises zinc finger protein (ZFP), a transcription activator-like effector (TALE), or a guide RNA.
133. The recombinant polypeptide of any one of claims 128-132, wherein the DNA binding domain binds to a target nucleic acid sequence in a gene and optionally, wherein the DNA binding domain is the DBD of any one of claims 1-98.
134. The recombinant polypeptide of claim 133, wherein the target nucleic acid sequence is in a PDCD 1 gene, a CTLA4 gene, a LAG3 gene, a TET2 gene, a ETLA gene, a HAVCR2 gene, a CCR5 gene, a CXCR4 gene, a TRA gene, a TRE gene, a E2M gene, an albumin gene, a HEE gene, a HEA1 gene, a TTR gene, a NR3C1 gene, a CD52 gene, an erythroid specific enhancer of the BCL11A gene, a CELE gene, a TGFER1 gene, a SERPINA1 gene, a HEV genomic DNA in infected cells, a CEP290 gene, a DMD gene, a CFTR gene, or an IL2RG gene.
135. A nucleic acid encoding the recombinant polypeptide of any of claims 128-134.
136. The nucleic acid of claim 135, wherein the nucleic acid is operably linked to a promoter sequence that confers expression of the polypeptide.
137. The nucleic acid of claim 135 or 136, wherein the sequence of the nucleic acid is codon optimized for expression of the polypeptide in a human cell.
138. The nucleic acid of any one of claims 135-137, wherein the nucleic acid is a deoxyribonucleic acid (DNA).
139. The nucleic acid of any one of claims 135-137, wherein the nucleic acid is a ribonucleic acid (RNA).
140. A vector comprising the nucleic acid of any of claims 135-138.
141. The vector of claim 140, wherein the vector is a viral vector.
142. A host cell comprising the nucleic acid of any of claims 135-139 or the vector of claim 140 or 141.
143. A host cell comprising the polypeptide of any of claims 128-134.
144. A host cell that expresses the polypeptide of any of claims 128-134.
145. A pharmaceutical composition comprising the polypeptide of any of claims 128-134 and a pharmaceutically acceptable excipient.
146. A pharmaceutical composition comprising the nucleic acid of any of claims 135-139 or the vector of claim 140 or 141 and a pharmaceutically acceptable excipient.
147. A method of suppressing expression of an endogenous gene in a cell, the method comprising: introducing into the cell the recombinant polypeptide of any one of claims 128-134, wherein the DBD of the polypeptide binds to a target nucleic acid sequence present in the endogenous gene and the heterologous transcriptional repressor domain suppresses expression of the endogenous gene.
148. The method of claim 147, wherein the recombinant polypeptide is introduced as a nucleic acid encoding the polypeptide.
149. The method of claim 148, wherein the nucleic acid is a deoxyribonucleic acid (DNA).
150. The method of claim 148, wherein the nucleic acid is a ribonucleic acid (RNA).
151. The method of any of claims 148-150, wherein the sequence of the nucleic acid is codon optimized for expression in a human cell.
152. The method of any of claims 147-151, wherein the gene is a PDCD 1 gene, a CTLA4 gene, a LAG3 gene, a TET2 gene, a ETLA gene, a HAVCR2 gene, a CCR5 gene, a CXCR4 gene, a TRA gene, a TRE gene, a E2M gene, an albumin gene, a HEE gene, a HEA1 gene, a TTR gene, a NR3C1 gene, a CD52 gene, an erythroid specific enhancer of the ECL11A gene, a CELE gene, a TGFER1 gene, a SERPINAl gene, a HEV genomic DNA in infected cells, a CEP290 gene, a DMD gene, a CFTR gene, or an IL2RG gene.
153. The method of any one of claims 147-152, wherein the cell is an animal cell.
154. The method of any one of claims 147-152, wherein the cell is a human cell.
155. The method of any one of claims 147-152, wherein the cell is a cancer cell.
156. The method of any one of claims 147-152, wherein the cell is an ex vivo cell.
157. The method of any one of claims 147-155, wherein the introducing comprises administering the polypeptide or a nucleic acid encoding the polypeptide to a subject.
158. The method of claim 157, wherein the administering comprises parenteral administration.
159. The method of claim 157, wherein the administering comprises intravenous, intramuscular, intrathecal, or subcutaneous administration.
160. The method of claim 157, wherein the administering comprises direct injection into a site in a subject.
161. The method of any of claim 157, wherein the administering comprises direct injection into a tumor.
162. A plurality of nucleic acids encoding:
(i) polypeptides that dimerize via direct dimerization, comprising:
(A) a DNA binding domain (DBD) fused to a first member of a heterodimer pair and a functional domain fused to a second member of the heterodimer pair, or
(B) a DNA binding domain (DBD) fused to a second member of a heterodimer pair and a functional domain fused to a first member of the heterodimer pair, wherein the first and second members of the heterodimer pair bind to each other thereby directly dimerizing the DBD and the functional domain, wherein the heterodimer pair is selected from one of the following heterodimer pairs:
37A, 37B;
13 A, 13B;
DHD37-BBB-A, DHD37-BBB-B; DHD150-A, DHD150-B;
DHD154-A, DHD-154B;
37A, 9B;
13 A, 37B;
13 A, DHD150-B;
37A, DHD37-BBB-B; and DHD37-BBB-A, 37B; or
(ii) polypeptides that dimerize indirectly via a bridging construct, comprising:
(A) a DNA binding domain (DBD) fused to a first member of a first heterodimer pair; a bridging construct comprising a second member of the first heterodimer pair fused to a first member of a second heterodimer pair; and a functional domain fused to a second member of the second heterodimer pair; or
(B) a DNA binding domain (DBD) fused to a second member of a first heterodimer pair; a bridging construct comprising a first member of the first heterodimer pair fused to a first member of a second heterodimer pair; and a functional domain fused to a second member of the second heterodimer pair; or
(C) a DNA binding domain (DBD) fused to a second member of a first heterodimer pair; a bridging construct comprising a first member of the first heterodimer pair fused to a second member of a second heterodimer pair; and a functional domain fused to a first member of the second heterodimer pair, wherein the DBD and the functional domain dimerize indirectly via the bridging construct, wherein the first and second heterodimer pairs are different and are selected from the following heterodimer pairs:
37A, 37B;
13 A, 13B;
DHD37-BBB-A, DHD37-BBB-B;
DHD150-A, DHD150-B;
DHD154-A, DHD-154B;
37 A, 9B;
13 A, 37B;
13 A, DHD150-B;
37A, DHD37-BBB-B; and DHD37-BBB-A, 37B.
163. The plurality of nucleic acids of claim 162, wherein the DBD in (i) (A) or (i) (B) is fused to a first member of a first heterodimer pair and the functional domain is a first functional domain fused a second member of the first heterodimer pair and to a first member of a second heterodimer pair, the system further comprising a second functional domain fused to a second member of the second heterodimer pair, wherein the members of the first heterodimer pair mediate dimerization of the DBD and the first functional domain and members of the second heterodimer pair mediate dimerization of the first functional domain and the second functional domain.
164. The plurality of nucleic acids of claim 163, wherein the DBD is fused to a first member of a first heterodimer pair and to a first member of a second heterodimer pair, and the functional domain is fused a second member of the first heterodimer pair the system further comprising a second functional domain fused to a second member of the second heterodimer pair, wherein the members of the first heterodimer pair mediate assembly of the DBD and the first functional domain and members of the second heterodimer pair mediate assembly of the DBD and the second functional domain.
165. The plurality of nucleic acids of any one of claims 162-164, wherein the DBD binds to a target nucleic acid sequence present in an endogenous gene in a cell.
166. The plurality of nucleic acids of any one of claims 162-165, wherein the functional domain comprises an enzyme, a transcriptional activator, a transcriptional repressor, or a DNA nucleotide modifier.
167. The plurality of nucleic acids of claim 166, wherein the enzyme is a nuclease, a DNA modifying protein, or a chromatin modifying protein.
168. The plurality of nucleic acids of claim 167, wherein the nuclease is a cleavage domain or a half-cleavage domain.
169. The plurality of nucleic acids of claim 168, wherein the cleavage domain or half-cleavage domain comprises a type IIS restriction enzyme.
170. The plurality of nucleic acids of claim 169, wherein the type IIS restriction enzyme comprises Fokl or Bfil.
171. The plurality of nucleic acids of claim 167, wherein the chromatin modifying protein is lysine -specific histone demethylase 1 (LSD1).
172. The plurality of nucleic acids of claim 166, wherein the transcriptional activator comprises VP 16, VP64, p65, p300 catalytic domain, TET1 catalytic domain, TDG, Ldbl self-associated domain, SAM activator (VP64, p65, HSF1), or VPR (VP64, p65, Rta).
173. The plurality of nucleic acids of claim 168, wherein the transcriptional repressor comprises KRAB, Sin3a, LSD1, SUV39H1, G9A (EHMT2), DNMT1, DNMT3A-DNMT3L, DNMT3B, KOX, TGF-beta-inducible early gene (TIEG), v-erbA, SID, MBD2, MBD3, Rb, MeCP2, or a transcriptional repressor provided in claims 128-134.
174. The plurality of nucleic acids of claim 166, wherein the DNA nucleotide modifier is adenosine deaminase.
175. The plurality of nucleic acids of any of claims 165-174, wherein the target nucleic acid sequence is within a PDCD 1 gene, a CTLA4 gene, a LAG3 gene, a TET2 gene, a ETLA gene, a HA VCR2 gene, a CCR5 gene, a CXCR4 gene, a TRA gene, a TRE gene, a E2M gene, an albumin gene, a HEE gene, a HEA1 gene, a TTR gene, a NR3C1 gene, a CD52 gene, an erythroid specific enhancer of the ECL11A gene, a CELE gene, a TGFER1 gene, a SERPINAl gene, a HEV genomic DNA in infected cells, a CEP290 gene, a DMD gene, a CFTR gene, or an IL2RG gene.
176. The plurality of nucleic acids of any of claims 162-175, wherein the DBD comprises a transcription activator-like effector (TALE).
177. The plurality of nucleic acids of any of claims 162-176, wherein the DBD comprises a DBD as set out in any one of claims 1-98.
178. A DNA binding domain and a functional domain or a DNA binding domain, a functional domain and a bridging construct encoded by the plurality of nucleic acids of nucleic acids of any one of claims 162-177.
179. A DNA binding domain and a functional domain as set forth in claim 162 (i)(A); or (i)(B); or a DNA binding domain, a bridging construct, and a functional domain as set forth in claim 162 (h)(A), (ii)(B), or (ii)(C).
180. A host cell comprising: (a) nucleic acids encoding the polypeptides as set forth in claim 162 (i)(A) or (i)(B); or (b) nucleic acids encoding the polypeptides as set forth in claim 162 (ii)(A), (h)(B), or (ii)(C).
181. A host cell comprising: (a) the polypeptides as set forth in claim 162 (i)(A) or (i)(B); or (b) the polypeptides as set forth in claim 162 (ii)(A), (ii)(B), or (ii)(C).
182. A kit comprising:
(a) nucleic acids encoding the polypeptides as set forth in claim 162 (i)(A) or (i)(B); or
(b) nucleic acids encoding the polypeptides as set forth in claim 162 (ii)(A), (ii)(B), or (ii)(C).
183. A kit comprising:
(a) a first vector comprising a nucleic acid encoding the DBD set forth in claim 162 (i)(A); and
(b) a second vector comprising a nucleic acid encoding the functional domain set forth in claim 162 (i)(A); or
(a) a first vector comprising a nucleic acid encoding the DBD set forth in claim 162 (i)(B); and
(b) a second vector comprising a nucleic acid encoding the functional domain set forth in claim 162 (i)(B).
184. A kit comprising:
(a) a first vector comprising a nucleic acid encoding the DBD set forth in claim 162 (ii)(A);
(b) a second vector comprising a nucleic acid encoding the bridging construct set forth in claim 162 (ii)(A);and
(c) a third vector comprising a nucleic acid encoding the functional domain set forth in claim 162 (ii)(A); or
(a) a first vector comprising a nucleic acid encoding the DBD set forth in claim 162 (ii)(B);
(b) a second vector comprising a nucleic acid encoding the bridging construct set forth in claim 162 (ii)(B); and
(c) a third vector comprising a nucleic acid encoding the functional domain set forth in claim 162 (ii)(B); or
(a) a first vector comprising a nucleic acid encoding the DBD set forth in claim 162 (ii)(C);
(b) a second vector comprising a nucleic acid encoding the bridging construct set forth in claim 162 (ii)(C);and
(c) a third vector comprising a nucleic acid encoding the functional domain set forth in claim 162 (ii)(C).
185. A pharmaceutical composition comprising:
(a) nucleic acids encoding the polypeptides as set forth in claim 162 (i)(A) or (i)(B); or
(b) nucleic acids encoding the polypeptides as set forth in claim 162 (ii)(A), (ii)(B), or (ii)(C).
186. A pharmaceutical composition comprising:
(a) a first vector comprising a nucleic acid encoding the DBD set forth in claim 162 (i)(A); and
(b) a second vector comprising a nucleic acid encoding the functional domain set forth in claim 162 (i)(A); or
(a) a first vector comprising a nucleic acid encoding the DBD set forth in claim 162 (i)(B); and
(b) a second vector comprising a nucleic acid encoding the functional domain set forth in claim 162 (i)(B).
187. A pharmaceutical composition comprising:
(a) a first vector comprising a nucleic acid encoding the DBD set forth in claim 162 (ii)(A);
(b) a second vector comprising a nucleic acid encoding the bridging construct set forth in claim 162 (ii)(A);and
(c) a third vector comprising a nucleic acid encoding the functional domain set forth in claim 162 (ii)(A); or
(a) a first vector comprising a nucleic acid encoding the DBD set forth in claim 162 (ii)(B);
(b) a second vector comprising a nucleic acid encoding the bridging construct set forth in claim 162 (ii)(B); and
(c) a third vector comprising a nucleic acid encoding the functional domain set forth in claim 162
(ii)(B); or
(a) a first vector comprising a nucleic acid encoding the DBD set forth in claim 162 (ii)(C);
(b) a second vector comprising a nucleic acid encoding the bridging construct set forth in claim 162 (ii)(C);and
(c) a third vector comprising a nucleic acid encoding the functional domain set forth in claim 162 (ii)(C).
188. A pharmaceutical composition comprising the DBD and a functional domain or a DNA binding domain, a functional domain and a bridging construct of claim 178 and a pharmaceutically acceptable excipient.
189. A pharmaceutical composition comprising the host cell of claim 180 or 181 and a pharmaceutically acceptable excipient.
190. A method for modulating expression from a target gene in a cell, the method comprising:
(i) introducing into the cell a first nucleic acid encoding a DNA binding domain fused to a first member of a heterodimer pair and a second nucleic acid encoding a functional domain fused to a second member of the heterodimer pair; or
(ii) introducing into the cell a first nucleic acid encoding a DNA binding domain fused to a second member of a heterodimer pair and a second nucleic acid encoding a functional domain fused to a first member of the heterodimer pair; or
(iii) introducing into the cell a DNA binding domain fused to a first member of a heterodimer pair and a functional domain fused to a second member of the heterodimer pair; or
(iv) introducing into the cell a DNA binding domain fused to a second member of a heterodimer pair and a functional domain fused to a first member of the heterodimer pair, wherein the heterodimer pair is selected from one of the following heterodimer pairs:
37A, 37B; 13 A, 13B;
DHD37-BBB-A, DHD37-BBB-B;
DHD150-A, DHD150-B;
DHD154-A, DHD-154B;
37A, 9B;
13 A, 37B;
13 A, DHD150-B;
37A, DHD37-BBB-B; and
DHD37-BBB-A, 37B, wherein the DNA binding domain (DBD) dimerizes with the functional domain via dimerization of the members of the heterodimer pair and wherein binding of the DBD to a target nucleic acid sequence in the target gene results in modulation of expression of the target gene via the functional domain dimerized to the DBD.
191. A method of modulating expression of a target gene in a cell, the method comprising:
(i) introducing into a cell expressing a DNA binding domain (DBD) fused to a first member of a first heterodimer pair and a functional domain fused to a second member of a second heterodimer pair, a bridging construct comprising a second member of the first heterodimer pair fused to a first member of the second heterodimer pair or a nucleic acid encoding the bridging construct; or
(ii) introducing into a cell expressing a DNA binding domain (DBD) fused to a second member of a first heterodimer pair and a functional domain fused to a second member of a second heterodimer pair, a bridging construct comprising a first member of the first heterodimer pair fused to a first member of the second heterodimer pair or a nucleic acid encoding the bridging construct; or
(iii) introducing into a cell expressing a DNA binding domain (DBD) fused to a first member of a first heterodimer pair and a functional domain fused to a first member of a second heterodimer pair, a bridging construct comprising a second member of the first heterodimer pair fused to a second member of the second heterodimer pair or a nucleic acid encoding the bridging construct, wherein the DBD and the functional domain dimerize indirectly via the bridging construct, wherein binding of the DBD to a target nucleic acid sequence in a target gene in the cell results in in modulation of expression of the target gene via the functional domain dimerized to the DBD via the bridging construct, wherein the first and second heterodimer pairs are different and are selected from the following heterodimer pairs:
37A, 37B; 13 A, 13B;
DHD37-BBB-A, DHD37-BBB-B;
DHD150-A, DHD150-B;
DHD154-A, DHD-154B;
37A, 9B;
13 A, 37B;
13 A, DHD150-B;
37A, DHD37-BBB-B; and
DHD37-BBB-A, 37B.
192. A method of reversing modulation of expression of a target gene in a cell expressing a
DNA binding domain (DBD) fused to a first member of a non-cognate heterodimer pair and a functional domain fused to a second member of the non-cognate heterodimer pair, wherein the DBD binds to a target nucleic acid sequence in a target gene and the functional domain dimerized to the DBD via dimerization of the members of the heterodimer pair modulates expression of the target gene, the method comprising introducing into the cell a disruptor which binds to either the first member or the second member with a higher binding affinity than the binding affinity between the first and second members, wherein non-cognate heterodimer pairs and the corresponding disruptor are selected from one of the following combinations:
193. The method of any one of claims 190-192, wherein the functional domain comprises an enzyme, a transcriptional activator, a transcriptional repressor, or a DNA nucleotide modifier.
194. The method of any one of claims 190-193, wherein the target nucleic acid sequence is within a PDCD 1 gene, a CTLA4 gene, a LAG3 gene, a TET2 gene, a ETLA gene, a HA VCR2 gene, a CCR5 gene, a CXCR4 gene, a TRA gene, a TRE gene, a E2M gene, an albumin gene, a HEE gene, a HEA1 gene, a TTR gene, a NR3C1 gene, a CD52 gene, an erythroid specific enhancer of the ECL11A gene, a CELE gene, a TGFER1 gene, a SERPINAl gene, a HEV genomic DNA in infected cells, a CEP290 gene, a DMD gene, a CFTR gene, or an IL2RG gene.
195. The method of any one of claims 190-194, wherein the DBD comprises a transcription activator-like effector (TALE).
EP20851042.0A 2019-08-07 2020-08-06 Compositions and methods for modulation of gene expression Pending EP4010004A4 (en)

Applications Claiming Priority (4)

Application Number Priority Date Filing Date Title
US201962884028P 2019-08-07 2019-08-07
US201962898434P 2019-09-10 2019-09-10
US201962937011P 2019-11-18 2019-11-18
PCT/US2020/045174 WO2021026336A2 (en) 2019-08-07 2020-08-06 Compositions and methods for modulation of gene expression

Publications (2)

Publication Number Publication Date
EP4010004A2 true EP4010004A2 (en) 2022-06-15
EP4010004A4 EP4010004A4 (en) 2023-09-13

Family

ID=74502663

Family Applications (1)

Application Number Title Priority Date Filing Date
EP20851042.0A Pending EP4010004A4 (en) 2019-08-07 2020-08-06 Compositions and methods for modulation of gene expression

Country Status (2)

Country Link
EP (1) EP4010004A4 (en)
WO (1) WO2021026336A2 (en)

Families Citing this family (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20230287391A1 (en) * 2020-08-14 2023-09-14 The Governing Council Of The University Of Toronto Krab fusion repressors and methods and compositions for repressing gene expression
CN114195902B (en) * 2021-10-18 2023-06-20 翌圣生物科技(上海)股份有限公司 Recombinant protein structural domain enhanced TET enzyme and whole genome DNA methylation detection method
CN114600837B (en) * 2022-04-15 2023-05-02 润康生物医药(苏州)有限公司 Animal model for granulocytopenia, construction method thereof and application of ikzf1 and cmyb in construction model

Family Cites Families (10)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6919204B2 (en) * 2000-09-29 2005-07-19 Sangamo Biosciences, Inc. Modulation of gene expression using localization domains
US20110239315A1 (en) * 2009-01-12 2011-09-29 Ulla Bonas Modular dna-binding domains and methods of use
EP2780460B1 (en) * 2011-11-16 2018-07-11 Sangamo Therapeutics, Inc. Modified dna-binding proteins and uses thereof
EA039384B1 (en) * 2012-08-29 2022-01-20 Сангамо Байосаенсез, Инк. Zinc finger protein for modulating expression of bcl11a gene and method of modulating expression of globin gene
WO2014059173A2 (en) * 2012-10-10 2014-04-17 Sangamo Biosciences, Inc. T cell modifying compounds and uses thereof
HRP20211706T1 (en) * 2013-11-13 2022-02-04 The Children's Medical Center Corporation Nuclease-mediated regulation of gene expression
WO2016019144A2 (en) * 2014-07-30 2016-02-04 Sangamo Biosciences, Inc. Gene correction of scid-related genes in hematopoietic stem and progenitor cells
US20190255107A1 (en) * 2015-10-09 2019-08-22 The Brigham And Women's Hospital, Inc. Modulation of novel immune checkpoint targets
UA126373C2 (en) * 2015-12-04 2022-09-28 Новартіс Аг Compositions and methods for immunooncology
AU2019256517A1 (en) * 2018-04-18 2020-11-12 Altius Institute For Biomedical Sciences Animal pathogen-derived polypeptides and uses thereof for genetic engineering

Also Published As

Publication number Publication date
WO2021026336A3 (en) 2021-03-11
EP4010004A4 (en) 2023-09-13
WO2021026336A2 (en) 2021-02-11

Similar Documents

Publication Publication Date Title
JP6995151B2 (en) synTac polypeptide and its use
US20210077566A1 (en) Peptides Having Anti-Inflammatory Properties
EP4010004A2 (en) Compositions and methods for modulation of gene expression
JP6849600B6 (en) Chimeric antigen receptor, composition and method
JP7048494B2 (en) Conditional active chimeric antigen receptor for modified T cells
CN108948177B (en) Fusion molecules and IL-15 variants
IL265920B1 (en) Rationally-designed synthetic peptide shuttle agents for delivering polypeptide cargos from an extracellular space to the cytosol and/or nucleus of a target eukaryotic cell, uses thereof, methods and kits relating to same
EA030147B1 (en) Bispecific t cell activating antigen binding molecules
CN107847583A (en) PEGylated Interleukin 10 for treating cancer
CN106795548A (en) Found and Production conditions active biological protein in identical eukaryotic produces host
KR20210126073A (en) Compositions and methods for the identification of antigen-specific T cells
KR102653906B1 (en) Biased IL2 mutein methods and compositions
KR20220104204A (en) PRAME TCR receptor and uses thereof
Lim et al. Cell type preference of a novel human derived cell-permeable peptide dNP2 and TAT in murine splenic immune cells
JP2023511274A (en) IL2 orthologs and usage
EP3390454A1 (en) A recombinant bi-specific polypeptide for coordinately activating tumor-reactive t-cells and neutralizing immune suppression
US20210115093A1 (en) Animal pathogen-derived polypeptides and uses thereof for genetic engineering
JP2021536256A (en) Conditional active chimeric antigen receptor for modified T cells
Futami et al. Design of cytotoxic ribonucleases by cationization to enhance intracellular protein delivery
US20220290188A1 (en) Compositions and Methods for Modulation of Gene Expression
Giang et al. Affibody-based hBCMA x CD16 dual engagers for NK cell-mediated killing of multiple myeloma cells
CN114181315B (en) Endosome escape peptide and application thereof
AU2021356243A1 (en) Compositions and methods for simultaneously modulating expression of genes
KR20230111187A (en) PRAME specific T-cell receptors and uses thereof
CN114790225A (en) Novel endosome escape peptide and application thereof

Legal Events

Date Code Title Description
STAA Information on the status of an ep patent application or granted ep patent

Free format text: STATUS: THE INTERNATIONAL PUBLICATION HAS BEEN MADE

PUAI Public reference made under article 153(3) epc to a published international application that has entered the european phase

Free format text: ORIGINAL CODE: 0009012

STAA Information on the status of an ep patent application or granted ep patent

Free format text: STATUS: REQUEST FOR EXAMINATION WAS MADE

17P Request for examination filed

Effective date: 20220210

AK Designated contracting states

Kind code of ref document: A2

Designated state(s): AL AT BE BG CH CY CZ DE DK EE ES FI FR GB GR HR HU IE IS IT LI LT LU LV MC MK MT NL NO PL PT RO RS SE SI SK SM TR

RAP3 Party data changed (applicant data changed or rights of an application transferred)

Owner name: ALTIUS INSTITUTE FOR BIOMEDICAL SCIENCES

DAV Request for validation of the european patent (deleted)
DAX Request for extension of the european patent (deleted)
A4 Supplementary search report drawn up and despatched

Effective date: 20230814

RIC1 Information provided on ipc code assigned before grant

Ipc: C12N 9/22 20060101ALI20230808BHEP

Ipc: C07K 14/705 20060101ALI20230808BHEP

Ipc: C07K 14/725 20060101ALI20230808BHEP

Ipc: C12N 15/85 20060101ALI20230808BHEP

Ipc: C12N 15/00 20060101ALI20230808BHEP

Ipc: C07K 19/00 20060101ALI20230808BHEP

Ipc: C07K 14/195 20060101ALI20230808BHEP

Ipc: A61K 38/00 20060101AFI20230808BHEP