"Integration by Design: Potential deleterious retrovirus insertions fueled investigation into the mechanism of insertion site selection."
Suzanne Sandmeyer*
Department of Biological Chemistry, College of Medicine, University of California, Irvine, CA 92697-1700
*E-mail: sbsandme@uci.edu
As goes history, so goes research: this year, activity in areas of
retrovirus research related only indirectly have provoked events that are
notable when considered together. Last summer it was reported that a patient
in one X-linked severe combined immunodeficiency retroviral vector gene
therapy trial had developed leukemia. Now disquietingly, there has been
a second such event, and a third patient is reported to have a vector insertion
near the same gene (LMO2) as observed in the other two individuals
(1). Meanwhile, in a basic research laboratory, experiments
have moved us another step closer to understanding the mechanics of insertion
specificity for retrovirus-type integrases (IN). As reported in this issue
of PNAS, investigators have produced active retroviruslike elements with
synthetic insertion specificities (2). Dan Voytas and
colleagues at Iowa State University (Ames) study the Saccharomyces
long terminal repeat (LTR)-retrotransposon Ty5, which targets
heterochromatic regions (3). Now, in an elegant
adaptation of the two-hybrid system, the 6-aa Ty5 targeting domain (TD)
was exchanged for two heterologous domains shown to mediate interaction
of their respective proteins with protein partners. When domains from those
partners were produced fused to the LexA DNA-binding domain, targeting
to LexA-binding sites was observed. Although integration specificity in
the system was by no means absolute, these results are of interest to genetic
engineers and future gene therapists.
Interest in the integration patterns of retroviruses is longstanding.
Despite the potential danger of deleterious activating or even inactivating
insertions, retroviruses present compelling advantages as therapy vectors
(reviewed in ref. 4). Early investigations of oncogenic
retrovirus insertion sites in transformed cells
showed that insertions were linked to activation of f lanking oncogenes
or DNaseI hypersensitive sites, leading to the notion that insertion into
open chromatin was favored (reviewed in ref. 5; see also
refs. 6 and 7). The potential for deleterious retrovirus vector insertions
fueled investigation into the mechanistic basis of insertion site selection.
Development of PCR assays with which significant numbers of retrovirus
integration sites could be mapped showed that genomes are broadly accessed
by retroviruses, but that there are decidedly nonrandom patterns as well
(8). More recently, large numbers of HIV type 1 (HIV-1)
insertions have been mapped and
compared with genomewide transcription patterns to globally probe
the relationship between gene expression and retrovirus integration (9).
These experiments showed that HIV-1 insertion favors transcribed regions.
Nonetheless, the basis of the preference for transcribed regions has been
elusive, and examination of at least one transcribed region for effects
of transcriptional activity on integration activity have not shown a positive
correlation (10).
At the heart of retroviral integration is the IN. It is a member
of the D,D(35)E transposaseyIN superfamily named after its conserved catalytic
triad of amino acids. Because of its central role in the retrovirus lifecycle,
the function and structure of this enzyme has been studied extensively
(reviewed in refs. 1115). Retroviral IN mediates a
strand transfer of LTR DNA 39' OH ends to staggered positions in the host
DNA (16, 17). Combined evidence of many types shows
a retroviral IN with three physically distinct domains. An N-terminal domain
includes three a-helices and a zinc-binding motif. This domain has been
implicated in dimeriza-tion and in binding the LTR ends. The central domain
contains the conserved catalytic triad D,D(35)E. Members of this triad
coordinate a divalent metal cation, probably Mg 2+ in
vivo (15) and are essential for catalytic activity.
The
C-terminal domain contributes to oligomerization, has nonspecific
DNA-binding activity and is physically similar to the SH3 protein interaction
domain. No full-length IN structure has yet been determined at high resolution.
In vivo a retroviral preintegration complex composed of IN
bound to the ends of the full-length DNA mediates
integration into host DNA. Isolation first of preintegration complexes
from infected cells and then production of
active, recombinant IN allowed examination of the effect of different
target features on integration in vitro. A generalization that has
emerged from studies conducted in several laboratories is that bending
of DNA favors integration (18), as do hairpin structures
(19). The former occurs in nucleosomes, which, contrary
to expectations, were found to act as preferred targets over nonnucleosomal
DNA, both in vitro and in vivo (2022).
The relatively global distribution of retrovirus integration sites
stands in interesting contrast to the distinctive
insertion preferences of their LTR-retrotransposon cousins, the
Pseudoviridiae
(e.g., Ty1 and Ty5 copialike elements) (23) and the
Metaviridae
(e.g., Tf1 and Ty3 gypsylike elements) (24). IN proteins
encoded by these elements have the zinc-binding motif, the highly conserved
residues of the central domain and the poorly conserved C-terminal domain.
The IN proteins of the Pseudoviridae and the
Metaviridae
differ from each other in the C-terminal do-main where the Pseudoviridae
have a conserved GKGY motif (23), and the Metaviridae
have a conserved GPFyY motif. Some members of the Metaviridae also
have a chromodomain (24).
As a group, the yeast LTR retrotransposons have notable insertion
preferences. The specificity of Ty5 for heterochromatin is discussed further
below. In budding yeast, the Pseudoviridae Ty1, 2, and 4 reside
mostly within 750 bp of the 59 ends of tRNA genes (25, 26).
In
vivo insertions fall along a gradient beginning at about -80 bp from
the 5' coding end of the tRNA gene and extending upstream. Integration
appears to rise
and fall in a pattern which could correlate with some feature of
the nucleosome (27). The pattern of integration of the
Metaviridae
element Ty3 is even more restricted. The gene-proximal strand transfer
in this case occurs within one or two nucleotides of tRNA gene transcription
initiation sites. In vivo it is likely that transcription factors TFIIIB
and TFIIIC are essential for Ty3 targeting (2830).
Furthermore, it has been shown that yeast elements Ty14 target other genes
transcribed by RNA polymerase III with similar patterns to those
observed f lanking tRNA genes (27, 30).
In
vitro, Ty3 targeting to the U6 gene requires only TATA-binding protein
and Brf1 (29).
Observation of highly specific integration in yeast helped to motivate a series of experiments to confer novel insertion specificities on retrovirus IN proteins (reviewed in refs. 31 and 32). Recombinant retroviral IN has been expressed as a fusion with relatively compact DNA-binding domains including lambda repressor (33), LexA DNA-binding domain (34, 35), and the DNA-binding domain of Zif268 (36). Recombinant proteins have been shown to target in vitro integration to the respective DNA-binding sites of the fusion proteins. Disappointingly, these chimeric IN species, appear to be incompatible with high levels of infectious virus. Presumably this is caused by some failure to structurally accommodate the heterologous domain. To circumvent some of these problems, a strategy involving trans expression of IN has been used. In this variation, a fusion of HIV-1 structural protein p6 to an IN-LexA targeting domain directs IN to the virion and complements catalytically defective IN contributed from Gag-Pol (37, 38). However, there are no naturally occurring LexA-binding sites in mammalian cells, and targeting to synthetic sites has not yet been reported.
Ty5 is distinct among the yeast elements. Originally identified as a degenerate element at the ends of Saccharomyces cerevisiae chromosomes (39), the Voytas laboratory recovered an active copy from Saccharomyces paradoxus and transferred it into S. cerevisiae (3). In this context, they showed that Ty5 inserted into heterochromatic DNA (40). Mutations in Sir3p or Sir4p that disrupted silencing of telomeric DNA also resulted in loss of targeting to silenced regions (41). The pieces of the puzzle fell quickly into place. A targeting domain of 6 aa (TD), virtually at the C-terminus of Ty5 IN, was mapped, which was required for targeting (42) and which mediated interactions with a large C-terminal portion of Sir4p (43).
In the current article (2), the Voytas laboratory
accomplishes design-based integration. The strategy is outlined
in Fig. 1.
Many questions remain. For example, how does Ty5 access the DNA after docking at Sir4p? What is the distribution of the majority of (nontarget plasmid) Ty5 integrations? Do nonplasmid insertions default to random, to native Rad53p direction in the case of the Rad9p-based TD, or do natural, as yet unidentified, functions continue to operate on the Ty5 IN? Is it possible to generate integration that is more highly restricted, perhaps through the use of phage panning or slightly larger domains?
The experiments by Voytas suggest many new avenues for genome exploration. The occurrence of a compact and independent interaction domain in a retroviral-type IN of course poses the question of whether other such domains exist. In the case of Ty3, interactions between the N-terminal domain and TFIIIC subunit Tfc1p have been documented in vitro and are consistent with in vivo results (44). Ty3 also has a relatively extended C-terminal domain that could interact with targeting proteins including TFIIIB subunits, but this has not been demonstrated. It seems likely that the S. cerevisiae Pseudoviridae element Ty1 will be targeted by some feature of chromatin which distinguishes regions directly upstream of tRNA genes (27). An alignment of Metaviridae element IN C-terminal domains recently resulted in the identification of a chromodomain motif (24). Tf1, a Schizosaccharomyces pombe element of this class has been shown to insert in inter-ORF spaces, apparently with preference for the region within 100300 bp from the ORF initiation codon (45, 46). Results of recent experiments suggest that Tf1 integration is actually targeted through interaction of the chromodomain with histone H3 methylated at K4 (H. Levin, National Institutes of Health, Bethesda, personal communication). These observations are exciting because they not only hint at the subtlety and diversity of integration specificity, but suggest that integration can be used to learn about chromatin structure as well as to manipulate the genome.
It is not clear to what extent retroviral proteins will be shown
to interact with specific proteins for targeting in the manner observed
for the yeast LTR ret-rotransposons. The C-terminal domain of characterized
retroviral IN proteins has an SH3 structure and the SH3 motif mediates
a wide variety of protein interactions albeit mostly having to do with
signal transduction (47). In addition, it has been shown
that several chromatin-related
proteins enhance retroviral integration in vitro and potentially
in
vivo; one such case is INI1 (48), and another
is LEDGFyp75 (49). The recent findings in yeast
are likely to encourage further exploration for proteins that contribute
to the loosely defined preference of at least some retroviruses for insertion
into transcriptionally active regions and into particular hotspots.
What are the lessons that could be applied to better laboratory retrovirus
vectors, or even make safer therapeutic vectors? One observation, so obvious
it can hardly be considered a lesson, is that relatively subtle changes
are likely to be better tolerated by the virion. A second point is that
the known structure of the C-terminal domain of retroviral IN might be
used to identify positions actually within the IN, which are compatible
with replacements or insertions of small TD cassettes. The Ty5 study underscores
the findings from in vitro targeting studies with retroviral IN,
namely that the C-terminal domain can deliver active IN to the integration
site. Finally, although proteinprotein mediation of IN docking does not
have the reassuring simplicity of an IN that binds unique DNA sequences,
it offers the rich combinatorial complexity of the
natural proteome.
Clearly, much work remains to explore the mechanisms, implications, and applications of targeted retroviral integration. Integration by design in a model organism from the Voytas laboratory hints at the possibilities.
References:
1. Kaiser, J. (2003) Science 299, 991.
2. Zhu, Y., Dai, J., Fuerst, P. G. & Voytas, D. F. (2003) Proc. Natl. Acad. Sci. USA 100, 58915895.
3. Zou, S., Ke, N., Kim, J. M. & Voytas, D. F. (1996) Genes Dev. 10, 634645.
4. Galimi, F. & Verma, I. M. (2002) Curr. Top. Microbiol. Immunol. 261, 245254.
5. Sandmeyer, S. B., Hansen, L. J. & Chalker, D. L. (1990) Annu. Rev. Genet. 24, 491518.
6. Scherdin, U., Rhodes, K. & Breindl, M. (1990) J. Virol. 64, 907912.
7. Mooslehner, K., Karls, U. & Harbers, K. (1990) J. Virol. 64, 30563058.
8. Withers-Ward, E. S., Kitamura, Y., Barnes, J. P. & Coffin, J. M. (1994) Genes Dev. 8, 14731487.
9. Schroder, A. R., Shinn, P., Chen, H., Berry, C., Ecker, J. R. & Bushman, F. (2002) Cell 110, 521529.
10. Weidhaas, J. B., Angelichio, E. L., Fenner, S. & Coffin, J. M. (2000) J. Virol. 74, 83828389.
11. Haren, L., Ton-Hoang, B. & Chandler, M. (1999) Annu. Rev. Microbiol. 53, 245281.
12. Hindmarsh, P. & Leis, J. (1999) Microbiol. Mol. Biol. Rev. 63, 836843.
13. Wlodawer, A. (1999) Adv. Virus Res. 52, 335350.
14. Craigie, R. (2001) J. Biol. Chem. 276, 2321323216.
15. Rice, P. A. & Baker, T. A. (2001) Nat. Struct. Biol. 8, 302307.
16. Fujiwara, T. & Mizuuchi, K. (1988) Cell 54, 497 504.
17. Craigie, R. & Mizuuchi, K. (1985) Cell 41, 867 876.
18. Mu¨ller, H.-P. &Varmus, H. E. (1994) EMBO J. 13, 47044714.
19. Katz, R. A., Gravuer, K. & Skalka, A. M. (1998) J. Biol. Chem. 273, 2419024195.
20. Pryciak, P. M., Mu¨ller, H.-P. & Varmus, H. E. (1992) Proc. Natl. Acad. Sci. USA 89, 92379241.
21. Pryciak, P. M. & Varmus, H. E. (1992) Cell 69, 769780.
22. Pruss, D., Bushman, F. D. & Wolffe, A. P. (1994) Proc. Natl. Acad. Sci. USA 91, 59135917.
23. Peterson-Burch, B. D. &Voytas, D. F. (2002) Mol. Biol. Evol. 19, 18321845.
24. Malik, H. S. & Eickbush, T. H. (1999) J. Virol. 73, 51865190.
25. Ji, H., Moore, D. P., Blomberg, M. A., Braiterman, L. T., Voytas, D. F., Natsoulis, G. & Boeke, J. D. (1993) Cell 73, 10071018.
26. Kim, J. M., Vanguri, S., Boeke, J. D., Gabriel, A. & Voytas, D. F. (1998) Genome Res. 8, 464478.
27. Devine, S. E. & Boeke, J. D. (1996) Genes Dev. 10, 620633.
28. Kirchner, J., Connolly, C. M. & Sandmeyer, S. B. (1995) Science 267, 14881491.
29. Yieh, L., Kassavetis, G., Geiduschek, E. P. & Sand-meyer, S. B. (2000) J. Biol. Chem. 275, 2980029807.
30. Chalker, D. L. & Sandmeyer, S. B. (1992) Genes Dev. 6, 117128.
31. Bushman, F. D. (2002) Curr. Top. Microbiol. Immunol. 261, 165177.
32. Holmes-Son, M. L., Appa, R. S. & Chow, S. A. (2001) Adv. Genet. 43, 3369.
33. Bushman, F. (1994) Proc. Natl. Acad. Sci. USA 91, 92339237.
34. Goulaouic, H. & Chow, S. A. (1996) J. Virol. 70, 3746.
35. Katz, R. A., Merkel, G. & Skalka, A. M. (1996) Virology 217, 178190.
36. Bushman, F. D. & Miller, M. D. (1997) J. Virol. 71, 458464.
37. Holmes-Son, M. L. & Chow, S. A. (2000) J. Virol. 74, 1154811556.
38. Holmes-Son, M. L. & Chow, S. A. (2002) Mol. Ther. 5, 360370.
39. Voytas, D. F. & Boeke, J. D. (1992) Nature 358, 717.
40. Zou, S. & Voytas, D. F. (1997) Proc. Natl. Acad. Sci. USA 94, 74127416.
41. Zhu, Y., Zou, S., Wright, D. A. & Voytas, D. F. (1999) Genes Dev. 13, 27382749.
42. Gai, X. & Voytas, D. F. (1998) Mol. Cell 1, 10511055.
43. Xie, W., Gai, X., Zhu, Y., Zappulla, D. C., Sternglanz, R. & Voytas, D. F. (2001) Mol. Cell. Biol. 21, 66066614.
44. Aye, M., Dildine, S. L., Claypool, J. A., Jourdain, S. & Sandmeyer, S. B. (2001) Mol. Cell. Biol. 21, 78397851.
45. Behrens, R., Hayles, J. & Nurse, P. (2000) Nucleic Acids Res. 28, 47094716.
46. Singleton, T. L. & Levin, H. L. (2002) Eukaryot. Cell 1, 4455.
47. Mayer, B. J. (2001) J. Cell Sci. 114, 12531263.
48. Kalpana, G. V., Marmon, S., Wang, W., Crabtree, G. R. & Goff, S. P. (1994) Science 266, 20022006.
49. Cherepanov, P., Maertens, G., Proost, P., Devreese,
B., Van Beeumen, J., Engelborghs, Y., De Clercq, E. & Debyser, Z. (2003)
J. Biol. Chem. 278, 372381.
1. Frenster JH, "Model of Single-Stranded Integration of Oncogenic Viral Genomes".
2. Hovsepian JA, and Frenster JH, "RNA-Induced Melting of DNA during Selective Gene Transcription".
3. Frenster JH, "Ultrastructural Probes of Active
DNA Sites, and the RNA Activators of DNA".
Top of Page - Euchromatin Network - Current Research - Forums - Other Sites - Future Events -