Multi-component protein complexes play crucial roles during cellular function and development. Here we describe a method used to isolate native protein complexes from Drosophila embryos after in vivo crosslinking followed by purification of the crosslinked complexes for subsequent structure-function analysis.
Many cellular processes are controlled by multisubunit protein complexes. Frequently these complexes form transiently and require native environment to assemble. Therefore, to identify these functional protein complexes, it is important to stabilize them in vivo before cell lysis and subsequent purification. Here we describe a method used to isolate large bona fide protein complexes from Drosophila embryos. This method is based on embryo permeabilization and stabilization of the complexes inside the embryos by in vivo crosslinking using a low concentration of formaldehyde, which can easily cross the cell membrane. Subsequently, the protein complex of interest is immunopurified followed by gel purification and analyzed by mass spectrometry. We illustrate this method using purification of a Tudor protein complex, which is essential for germline development. Tudor is a large protein, which contains multiple Tudor domains – small modules that interact with methylated arginines or lysines of target proteins. This method can be adapted for isolation of native protein complexes from different organisms and tissues.
Isolation of multisubunit protein assemblies and DNA- or RNA-protein complexes is performed to identify protein complexes, genomic loci recognized by DNA-binding regulatory proteins or RNA targets of RNA binding proteins. Different methods allow genome-wide identification of DNA sites recognized by transcription factors or chromatin proteins (ChIP-seq)1 and RNA targets associated with a given RNA-binding protein (CLIP-seq)2. The libraries of RNA-derived cDNAs or DNA targets are then deeply sequenced. These methods use chemical or UV-induced cross-linking to stabilize the complexes followed by immunoprecipitation (IP) with an antibody against a protein component of the studied complex.
During development of an organism, many protein complexes form transiently. Therefore, it is crucial to analyze the composition and function of these complexes in vivo to understand the molecular mechanisms that control development. Such an in vivo analysis would be superior to in vitro approach since it is virtually impossible to reproduce native concentrations of the interacting components and cellular biochemical environment in vitro. Here we demonstrate an in vivo approach that we successfully use to isolate large protein complexes from Drosophila embryos. In this method, protein complexes in the living embryos are crosslinked with a low concentration of formaldehyde and subsequently protein complexes of interest are isolated by IP with an antibody against a known component of the complexes followed by gel purification of the complexes and mass spectrometry analysis to identify unknown complex components. Since formaldehyde is able to permeate the cell membrane and has a crosslinking range of 2.3-2.7 Å3, protein complexes can be crosslinked in vivo and the complex components are likely to be close to each other. In this article we describe this method using the isolation of Tudor (Tud) protein complex as an example. Tud is a germline protein which is essential for germline development4-7. This protein contains 11 Tud domains known to interact with methylated arginines or lysines of other polypeptides8-10.
Previously, we have generated a transgenic Drosophila line which expresses HA-tagged functional Tud5 and therefore, specific anti-HA antibody is used to pull down Tud complex after crosslinking.
In addition to protein-protein crosslinks, formaldehyde can generate nucleic acid-protein crosslinks and is used in ChIP-seq experiments. Furthermore, in Drosophila, in vivo crosslinking with formaldehyde has allowed the identification of an RNA target of Vasa RNA helicase protein11.
While in this article we describe a method for in vivo crosslinking and purification of protein complexes from Drosophila embryos, this method can be adapted for other organisms and tissues.
1. Preparing Large Apple Juice-agar Plates
2. Drosophila Embryo Collection and Pre-crosslinking Treatment
3. In vivo Crosslinking of Collected Embryos
4. Protein Sample Preparation and Immunoprecipitation
The effectiveness of the crosslinking and the successful purification of the crosslinked Tud protein complex were analyzed by SDS-PAGE on a 3% – 7% step gel (illustrated in Figure 1) followed by western blot (Figure 2).
The purpose of using the 3% – 7% step gel is based on the effective separation of crosslinked Tud protein complex from the remaining uncrosslinked Tud protein and concentration of the complex. Under our in vivo crosslinking conditions mentioned above, a large proportion of Tud protein could still remain uncrosslinked. This free Tud protein would copurify with the crosslinked complex during IP until separation by SDS-PAGE. Though large in size, uncrosslinked Tud protein (285 kDa) migrates across the 3% – 7% border without obvious retention. Whereas crosslinked Tud protein complex with larger molecular weight would have difficulty migrating across the gel border, resulting in the complex being "trapped" and concentrated at the border or "tailing" near the border and therefore, separating from the free Tud protein.
The inclusion of a proper control is critical to validate the specificity of this crosslinking protocol. Therefore, we used embryos expressing HA-tagged GFP protein14 to monitor the extent of the crosslinking. We have performed pilot experiments of different formaldehyde concentrations (0.1% – 2%) on embryos expressing HA-tagged GFP and part of the results (0.1% – 0.5%) are shown here (Figure 3). It is clear that the absolute majority of GFP protein in the treated embryos (up to 0.5% formaldehyde) remained uncrosslinked, which validated that our crosslinking protocol does not crosslink target protein to random surrounding molecules. In particular, the 0.2% formaldehyde treatment (Figure 3, lane 5 and 6) did not show any crosslinked products when compared to uncrosslinked control (Figure 3, lane 1 and 2) despite the extended exposure time of the film, which further justified our choice of 0.2% formaldehyde in this protocol. However, the GFP control embryos should always be crosslinked in parallel with the Tud complex crosslinking under the same conditions and control GFP protein samples should be run on the 3% – 7% step gel together with Tud complex samples following IP. In addition, both Tud complex and the corresponding GFP control 3% – 7% gel areas should be excised and submitted for mass spectrometry analysis to determine false positive candidates that will be present in both Tud complex band and the GFP control.
The effectiveness of the in vivo crosslinking was determined by comparing free Tud protein and Tud complex from uncrosslinked control and crosslinked embryos samples respectively (Figure 2A). Free Tud protein from uncrosslinked control embryos almost exclusively migrated across the 3% – 7% border with minimal amount retained near the border area (Figure 2A, lane 1). Contrary to that, the crosslinked embryos showed the presence of large molecular weight Tud crosslinked complex at the 3% – 7% gel border (Figure 2A, lane 2). This clearly showed that our crosslinking procedure successfully crosslinked approximately 50% of total Tud protein with its in vivo interacting partners (compare Figure 2A lane 1 and 2).
The competitive elution with HA-peptide confirmed the successful purification and strengthened the specificity of the purified Tud protein complex (Figure 2B). The elution fraction by HA-peptide contained the complex concentrated around the 3% – 7% border area and also free Tud protein (Figure 2B, lane 1). The following SDS sample buffer eluted mostly free Tud remaining on the beads and minimal amount of Tud complex (Figure 2B, lane 2).
Figure 1. Diagram of the 3% – 7% step gel used for SDS-PAGE. The 3% – 7% step gel is manually cast in such a way that it contains three layers. From top to the bottom, those layers are 3% stacking gel, 3% separating gel and 7% separating gel. To work with the Mini-PROTEAN system (Bio-Rad) used in this manuscript, the height of the 3% separating gel was made to be approximately 1 cm as depicted in the diagram.
Note: The 3% stacking gel and the 3% separating gel layers are extremely fragile so use extra caution when handling the gel to prevent distortion of these layers.
Figure 2. Western blot results of crosslinked embryos and control embryos lysates and purification of Tudor protein complex by IP. Panel A, lane 1, crude protein lysate of uncrosslinked control embryos; lane 2, crude protein lysate from formaldehyde crosslinked embryos. Panel B, lane 1, elution by HA-peptide after IP from crosslinked embryo lysate; lane 2, elution by 2 x SDS sample buffer after the initial HA-peptide elution. SDS-PAGE was run at 200 V for 3.5 hr. Anti-HA antibody (1:1,000) was used in western blot.
Figure 3. Western blot results of crosslinked embryos and control embryos expressing HA- tagged GFP. Lane 1, crude protein lysate of uncrosslinked control embryos; lane 2, same as lane 1 but twice the amount; lane 3, crude protein lysate of crosslinked embryos using 0.1% formaldehyde; lane 4, same as lane 3 but twice the amount; lane 5, crude protein lysate of crosslinked embryos using 0.2% formaldehyde; lane 6, same as lane 5 but twice the amount; lane 7, crude protein lysate of crosslinked embryos using 0.3% formaldehyde; lane 8, same as lane 7 but twice the amount. Lane 9, crude lysate of crosslinked embryos using 0.5% formaldehyde. SDS-PAGE was run at 200 V for 45 min in a 4% – 15% gradient gel. Anti-HA antibody (1:1,000) was used in western blot.
Formaldehyde has been commonly used as a crosslinking reagent for identifying protein-protein and protein-nucleic acid interactions. Its good solubility and cell membrane permeability, together with the compatibility with downstream mass spectrometry procedures, make formaldehyde an ideal candidate agent for intracellular crosslinking applications3,15-17. In particular, it was successfully used to identify mRNAs associated with Vasa, a critical germ cell RNA helicase in Drosophila11. In addition, formaldehyde has one of the shortest spacer arms (2.3 – 2.7 Å) among commercially available crosslinkers, which implies that only molecules that exist in close proximity would be crosslinked under proper conditions, strengthening the specificity of the interacting partners of the target protein. For the reasons stated above, formaldehyde was chosen as the crosslinking agent in our protocol. However, this should not exclude the usefulness of other crosslinking agents in in vivo application for Drosophila embryos. However, we have tried to use dithiobis(succinimidyl propionate) (DSP), an N-hydroxysuccinimide (NHS) ester family crosslinker frequently used in combination with formaldehyde in ChIP assays17,18, to replace formaldehyde in our study, but no apparent Tud complex was obtained with this crosslinker On the other hand, DSP might be effective for other protein complexes which will need to be determined on case-by-case basis. In addition to in vivo crosslinking approach, in vitro crosslinkers from bismaleimidohexane (BMH) family have been shown to be effective in capturing protein–protein interactions in Drosophila embryos14,19, however, in these experiments the crosslinking occurred after embryo extracts were made.
The crosslinking time, temperature and the formaldehyde concentration are the three major factors that influence crosslinking efficiency. Our protocol has been tested using pilot experiments for each individual factor, therefore, it has been optimized for the crosslinking of Tud protein and its interacting partners in Drosophila embryos in vivo, while maintaining the balance between the specificity of the crosslinked complex and the risk of crosslinking nonspecific molecules. Nonspecific crosslinking leads to the presence of false positives in the complex and must be carefully controlled with an unrelated protein control, for example GFP, which was used in our experiments. In addition, the optimal crosslinking conditions may vary with different target molecules. Therefore, using our protocol as a starting point, the optimal crosslinking time, temperature and the formaldehyde concentration will have to be determined for a given protein complex of interest.
Once the optimal crosslinking conditions are established it is critical to follow them precisely each time the embryos are crosslinked to minimize nonspecific crosslinking. Additional crucial step is washing the beads after IP with high concentration of NaCl (1 M) and two detergents (IGEPAL CA-630 and Tween-20) to reduce nonspecific binding of unrelated proteins from embryonic extracts to the beads.
Our protocol depends on the high affinity between the target protein and the antibody used to pull down the protein complex during IP. Therefore, the antibody's epitope in the target protein should not be destroyed by crosslinking and must be accessible for the antibody during IP. In addition, the antibody-target protein interaction should withstand the stringent washes aimed at the reduction of nonspecific background proteins. These requirements for successful IP might prevent one from employing a particular antibody-epitope combination, however, it should be possible to find a suitable antibody-epitope pair for a protein of interest. In particular, HA-tag epitope and the corresponding antibody have been successfully used in this protocol.
The described protocol can be used for the isolation of native protein complexes during different developmental stages and in different tissues. Although we demonstrate this method for Drosophila, our protocol can be adapted for other organisms.
The authors have nothing to disclose.
We thank Jordan Davis, Yanyan Lin, Eric Schadler and Jimiao Zheng for their technical help with this study. This work was supported by NSF CAREER grant MCB-1054962 to A.L.A.
Drosophila agar | Lab Scientific (http://www.labscientific.com/) | FLY-8020-1 | Do not autoclave the water and agar mix for prolonged period of time as it will cause the apple juice plates to become fragile |
Methyl 4-hydroxybenzoate | Sigma (http://www.sigmaaldrich.com/united-states.html) | H5501 | Other name: TEGOSEPT |
Population cage | Flystuff (http://www.flystuff.com/) | 59-104 | |
Fine nylon mesh | Flystuff (http://www.flystuff.com/) | 57-102 | When making the collection basket, cut the mesh slightly larger than the opening of the falcon tube cap to ensure a tight seal |
Dounce homogenizer | Sigma (http://www.sigmaaldrich.com/united-states.html) | D8938-1SET | Chill the homogenizer on ice and prerinse with cold lysis buffer before use to prevent protein degradation |
Protease inhibitor cocktail | Roche (http://www.rocheusa.com/portal/usa) | 4693132001 | PBS could be used to prepare concentrated protease inhibitor stock solution |
Anti-HA agarose beads | MBL international (http://www.mblintl.com/) | 561-8 | The kit also includes HA-peptide and spin columns |
HA-peptide | MBL international (http://www.mblintl.com/) | 561-8 | Prepare to 2 mg/ml with PBS |
Spin Column | MBL international (http://www.mblintl.com/) | 561-8 | Spin columns are included as part of the kit |
Isopropanol | Fisher Scientific (www.fishersci.com/) | BP26324 | |
Triton X-100 | Fisher Scientific (www.fishersci.com/) | BP151-500 | |
Heptane | Fisher Scientific (www.fishersci.com/) | H350-4 | |
PBS | Invitrogen (https://www.lifetechnologies.com/us/en/home.html) | AM9625 | Dilute from 10 X to 1 X with nanopure water before use |
Formaldehyde | Fisher Scientific (www.fishersci.com/) | BP531-500 | |
Glycine | BioRad (www.bio-rad.com/) | 161-0724 | |
SDS | BioRad (www.bio-rad.com/) | 161-0301 | |
Urea | BioRad (www.bio-rad.com/) | 161-0730 | |
Phenylmethanesulfonyl fluoride | Sigma (http://www.sigmaaldrich.com/united-states.html) | P7626-1G | Prepare 200 mM stock solution in isopropanol then dilute to working concentration of 2 mM in lysis buffer |
IGEPAL CA-630 | Sigma (http://www.sigmaaldrich.com/united-states.html) | I8896-50ML | |
Tween 20 | Fisher Scientific (www.fishersci.com/) | BP337-100 | |
15-ml tubes | USA Scientific (http://www.usascientific.com/) | 1475-0511 | |
Bleach | Clorox brand | Dilute with equal volume of nanopure water to make 50% bleach | |
50-ml tubes | BD Biosciences (http://www.bdbiosciences.com/home.jsp) | 352098 | |
Top layer sieve | Fisher Scientific (www.fishersci.com/) | 04-884-1AK | |
Bottom layer sieve | Fisher Scientific (www.fishersci.com/) | 04-884-1BA |