Chemically modified proteins are invaluable tools for studying the molecular details of biological processes, and they also hold great potential as new therapeutic agents. Several methods have been developed for the site-specific modification of proteins, one of the most widely used being expressed protein ligation (EPL) in which a recombinant α-thioester is ligated to an N-terminal Cys-containing peptide. Despite the widespread use of EPL, the generation and isolation of the required recombinant protein α-thioesters remain challenging. We describe here a new method for the preparation and purification of recombinant protein α-thioesters using engineered versions of naturally split DnaE inteins. This family of autoprocessing enzymes is closely related to the inteins currently used for protein α-thioester generation, but they feature faster kinetics and are split into two inactive polypeptides that need to associate to become active. Taking advantage of the strong affinity between the two split intein fragments, we devised a streamlined procedure for the purification and generation of protein α-thioesters from cell lysates and applied this strategy for the semisynthesis of a variety of proteins including an acetylated histone and a site-specifically modified monoclonal antibody.
The Chlamydomonas reinhardtii chloroplast-localized poly(A)-binding protein RB47 is predicted to contain a non-conserved linker (NCL) sequence flanked by highly conserved N- and C-terminal sequences, based on the corresponding cDNA. RB47 was purified from chloroplasts in association with an endoribonuclease activity, however, protein sequencing failed to detect the NCL. Furthermore, while recombinant RB47 including the NCL did not display endoribonuclease activity in vitro, versions lacking the NCL displayed strong activity. Both full-length and shorter forms of RB47 could be detected in chloroplasts, with conversion to the shorter form occurring in chloroplasts isolated from cells grown in the light. This conversion could be replicated in vitro in chloroplast extracts in a light-dependent manner, where epitope tags and protein sequencing showed that the NCL was excised from a full-length recombinant substrate, together with splicing of the flanking sequences. The requirement for endogenous factors and light differentiates this protein splicing from autocatalytic inteins, and may allow the chloroplast to regulate the activation of RB47 endoribonuclease activity. We speculate that this protein splicing activity arose to post-translationally repair proteins that had been inactivated by deleterious insertions or extensions.
An intein from Halobacterium salinarum can be isolated as an unspliced precursor protein with exogenous exteins after Escherichia coli over-expression. The intein promotes protein splicing and uncoupled N-terminal cleavage in vitro, conditional on incubation with NaCl or KCl at concentrations greater than 1.5 M. The protein splicing reaction also is conditional on reduction of a disulfide bond between two active site cysteines. Conditional protein splicing under these relatively mild conditions may lead to advances in intein-based biotechnology applications and hints at the possibility that this H. salinarum intein could serve as a switch to control extein activity under physiologically relevant conditions.
Inteins, also called protein introns, are self-splicing mobile elements found in all domains of life. A bioinformatic survey of genomic data highlights a biased distribution of inteins among functional categories of proteins in both bacteria and archaea, with a strong preference for a single network of functions containing replisome proteins. Many non-orthologous, functionally equivalent replicative proteins in bacteria and archaea carry inteins, suggesting a selective retention of inteins in proteins of particular functions across domains of life. Inteins cluster not only in proteins with related roles, but also in specific functional units of those proteins, like ATPase domains. This peculiar bias does not fully fit the models describing inteins exclusively as parasitic elements. In such models, evolutionary dynamics of inteins is viewed primarily through their mobility with the intein homing endonuclease (HEN) as the major factor of intein acquisition and loss. Although the HEN is essential for intein invasion and spread in populations, HEN dynamics does not explain the observed biased distribution of inteins among proteins in specific functional categories. We propose that the protein splicing domain of the intein can act as an environmental sensor that adapts to a particular niche and could potentially increase the chance of the intein becoming fixed in a population. We argue that selective retention of some inteins might be beneficial under certain environmental stresses, to act as panic buttons that reversibly inhibit specific networks, consistent with the observed intein distribution.
Inteins are intervening proteins that undergo an autocatalytic splicing reaction that ligates flanking host protein sequences termed exteins. Some intein-containing proteins have evolved to couple splicing to environmental signals; this represents a new form of posttranslational regulation. Of particular interest is RadA from the archaeon Pyrococcus horikoshii, for which long-range intein-extein interactions block splicing, requiring temperature and single-stranded DNA (ssDNA) substrate to splice rapidly and accurately. Here, we report that splicing of the intein-containing RadA from another archaeon, Thermococcus sibericus, is activated by significantly lower temperatures than is P. horikoshii RadA, consistent with differences in their growth environments. Investigation into variations between T. sibericus and P. horikoshii RadA inteins led to the discovery that a nonconserved region (NCR) of the intein, a flexible loop where a homing endonuclease previously resided, is critical to splicing. Deletion of the NCR leads to a substantial loss in the rate and accuracy of P. horikoshii RadA splicing only within native exteins. The influence of the NCR deletion can be largely overcome by ssDNA, demonstrating that the splicing-competent conformation can be achieved. We present a model whereby the NCR is a flexible hinge which acts as a switch by controlling distant intein-extein interactions that inhibit active site assembly. These results speak to the repurposing of the vestigial endonuclease loop to control an intein-extein partnership, which ultimately allows exquisite adaptation of protein splicing upon changes in the environment.IMPORTANCE Inteins are mobile genetic elements that interrupt coding sequences (exteins) and are removed by protein splicing. They are abundant elements in microbes, and recent work has demonstrated that protein splicing can be controlled by environmental cues, including the substrate of the intein-containing protein. Here, we describe an intein-extein collaboration that controls temperature-induced splicing of RadA from two archaea and how variation in this intein-extein partnership results in fine-tuning of splicing to closely match the environment. Specifically, we found that a small sequence difference between the two inteins, a flexible loop that likely once housed a homing endonuclease used for intein mobility, acts as a switch to control intein-extein interactions that block splicing. Our results argue strongly that some inteins have evolved away from a purely parasitic lifestyle to control the activity of host proteins, representing a new form of posttranslational regulation that is potentially widespread in the microbial world.
Biologics, such as antibody-drug conjugates, are becoming mainstream therapeutics. Consequently, methods to functionalize biologics without disrupting their native properties are essential for identifying, characterizing, and translating candidate biologics from the bench to clinical practice. Here, we present a method for site-specific, carboxy-terminal modification of single-chain antibody fragments (scFvs). ScFvs displayed on the surface of yeast were isolated and functionalized by combining intein-mediated expressed protein ligation (EPL) with inverse electron-demand Diels-Alder (IEDDA) cycloaddition using a styrene-tetrazine pair. The high thiol concentration required to trigger EPL can hinder the subsequent chemoselective ligation reactions; therefore, the EPL reaction was used to append styrene to the scFv, limiting tetrazine exposure to damaging thiols. Subsequently, the styrene-functionalized scFv was reacted with tetrazine-conjugated compounds in an IEDDA cycloaddition to generate functionalized scFvs that retain their native binding activity. Rapid functionalization of yeast surface-derived scFv in a site-directed manner could find utility in many downstream laboratory and pre-clinical applications.
- Proceedings of the National Academy of Sciences of the United States of America
- Published over 1 year ago
The facile rearrangement of “S-acyl isopeptides” to native peptide bonds viaS,N-acyl shift is central to the success of native chemical ligation, the widely used approach for protein total synthesis. Proximity-driven amide bond formation via acyl transfer reactions in other contexts has proven generally less effective. Here, we show that under neutral aqueous conditions, “O-acyl isopeptides” derived from hydroxy-asparagine [aspartic acid-β-hydroxamic acid; Asp(β-HA)] rearrange to form native peptide bonds via anO,N-acyl shift. This process constitutes a rare example of anO,N-acyl shift that proceeds rapidly across a medium-size ring (t1/2∼ 15 min), and takes place in water with minimal interference from hydrolysis. In contrast to serine/threonine or tyrosine, which formO-acyl isopeptides only by the use of highly activated acyl donors and appropriate protecting groups in organic solvent, Asp(β-HA) is sufficiently reactive to formO-acyl isopeptides by treatment with an unprotected peptide-αthioester, at low mM concentration, in water. These findings were applied to an acyl transfer-based chemical ligation strategy, in which an unprotectedN-terminal Asp(β-HA)-peptide and peptide-αthioester react under aqueous conditions to give a ligation product ultimately linked by a native peptide bond.
Harnessing and controlling self-assembly is an important step in developing proteins as novel biomaterials. With this goal, here we report the design of a general genetically programmed system that covalently concatenates multiple distinct protein domains into specific assembled arrays. It is driven by iterative intein-mediated Native Chemical Ligation (NCL) under mild native conditions. The system uses a series of initially inert recombinant protein fusions that sandwich the protein modules to be ligated between one of a number of different affinity tags and an intein protein domain. Orthogonal activation at opposite termini of compatible protein fusions, via protease and intein cleavage, coupled with sequential mixing directs an irreversible and traceless stepwise assembly process. This gives total control over the composition and arrangement of component proteins within the final product, enabled the limits of the system - reaction efficiency and yield - to be investigated and led to the production of “functional” assemblies.
Semisynthesis of proteins via expressed protein ligation is a widely applicable method, even more so because of the possibility of ligation at non-cysteine sites using β-mercapto amino acids that can be converted to the corresponding native amino acids by desulfurization. A drawback of this ligation- desulfurization approach is the removal of any unprotected native cysteine residues within the ligated protein segments. Here, we show that the phenacyl (PAc) moiety can be successfully used to protect cysteines within recombinantly generated protein segments. As such, this group was selectively appended onto cysteine side chains within bacterially expressed polypeptides following intein cleavage, which reveals a rather sensitive thioester at the C-terminus. The PAc group proved to be compatible with native chemical ligation, radical desulfurization, and reverse-phase HPLC conditions, and was smoothly removed at the end. The utility of the PAc protecting group was then demonstrated by the ‘traceless’ semisynthesis of two proteins containing one or two native cysteines: human small heat shock protein Hsp27 and murine prion protein.
A sizeable fraction of the selenoproteome encodes oxidoreductases possessing a thioredoxin fold, a structural motif that is shared among a diverse group of enzymes. In these oxidoreductases, the active site is comprised of a cysteine and a selenocysteine separated by one to two amino acids. In a subset of these selenoproteins, such as human SELENOH, SELENOM, SELENOT, SELENOV, SELENOW, and SELENOF, this redox motif is positioned immediately after the first β-sheet in a short loop, and is essential for interactions with its substrate or partners. Here, we describe the preparation of a representative member of this group, SELENOM, by selenocysteine-driven expressed protein ligation. The preparation employs a peptide bond formation between two protein fragments expressed recombinantly in E. coli. This method can be employed to prepare other selenoproteins.