ASBMB Today Science Genetic mutations can be benign or cancerous

News

Genetic mutations can be benign or cancerous

A new method to differentiate between them could lead to better treatments

By Ryan Layer

June 12, 2022

Most of the roughly 40 trillion cells of your body have nearly identical copies of your genome — the DNA inherited from your parents, containing instructions for everything from converting food to energy to fighting off infections. Healthy cells become cancerous through harmful mutations in the genome. If a cell’s genome is damaged by ultraviolet light, for example, it can result in mutations that tell the cell to grow uncontrollably and form a tumor.

Identifying the genetic changes that cause healthy cells to become malignant can help doctors select therapies that specifically target the tumor. For example, about 25% of breast cancers are HER2-positive, meaning the cells in this type of tumor have mutations that cause them to produce more of a protein called HER2 that helps them grow. Treatments that specifically target HER2 have dramatically increased survival rates for this type of breast cancer.

Scientists can now readily read cell DNA to identify mutations. The challenge is that the human genome is massive, and mutations are a normal part of evolution. The human genome is long enough to fill a 1.2 million-page book, and any two people can have about 3 million genetic differences. Finding one cancer-driving mutation in a tumor is like finding a needle in a stack of needles.

I am a computer scientist who explores large and complex genetic data sets to answer fundamental questions about biology and disease. My research team and I recently published a study using DNA from thousands of healthy people to help identify disease-causing mutations by using the principle of natural selection.

While genetic mutations are an everyday part of life, some can lead to cancer.

Using big data to find cancerous mutations

When determining what type of cancer mutation a patient has, the gold standard is to compare two samples from the patient: one from the tumor and one from healthy tissue (typically blood). Since both samples came from the same person, most of their DNA is identical; focusing only the genetic regions that differ from each other drastically narrows the location of a possible cancer-causing mutation.

The problem is that healthy tissue isn’t always collected from patients, for reasons ranging from clinical costs to narrow research protocols.

One way to get around this is to look at massive public DNA databases. Since cancer-driving mutations are detrimental to survival, natural selection tends to eliminate them over time in successive generations. Of all the mutations in a tumor, the ones that occur less frequently in a given population are more likely to be harmful than changes that are shared by many people. By counting how often a mutation occurs in these databases, researchers can distinguish between genetic changes that are common and likely benign and those that are rare and potentially cancerous.

National Cancer Institute on Wikimedia Commons

One cancer-driving mutation can lead to a cascade of other mutations that lead to uncontrollable cell division.

Given the power of this approach, there has been a recent surge of projects to collect and share the DNA sequences from hundreds to thousands of individuals. These projects include the 1000 Genomes Project, Simons Genome Diversity Project, GnomAD and All of Us. There will likely be many more in the future.

Estimating how likely a mutation causes disease by how frequently it appears in a genome is common for small genetic changes called single-nucleotide variants (SNVs). SNVs affect just one position in the 3 billion neuclotide human genome. It could, for example, switch one thymine T to a cytosine C.

Most researchers and clinical pathologists use a catalog of variants that have been detected across thousands of samples. If an SNV identified in a tumor is not listed in the catalog, we can assume that it’s rare and possibly drives cancer. This works well for SNVs because detection of these mutations is usually accurate, with few false negatives.

However, this process breaks down for genetic changes across longer strands of DNA called structural variants (SVs). SVs are more complex because they include the addition, removal, inversion or duplication of sequences. Compared to much simpler SNVs, SVs have higher error rates in detection. False negatives are relatively frequent, resulting in incomplete catalogs that make comparing mutations against them difficult. Finding a tumor SV that isn’t listed in a catalog could mean that it’s rare and a cancer-driving candidate, or that it was missed when the catalog was created.

Focusing on verification

My colleagues and I solved these problems by moving from a process focused on detection to one that focuses on verification. Detection is difficult — it requires processing complex data to determine if there is enough evidence to support the existence of a mutation. On the other hand, verification limits decision-making just to whether or not the evidence at hand supports the existence of a specific event. Instead of looking for a needle in a stack of needles, we are now simply considering whether the needle we have is the one we want.

Our method leverages this strategy by searching through raw data from thousands of DNA samples for any evidence supporting specific SV. In addition to the efficiency benefits of only looking at the data flanking the target variant, if there is no such evidence, we can confidently conclude that the target variant is rare and potentially disease-causing.

Using our method, we scanned the SVs identified in prior cancer studies and found that thousands of SVs previously associated with cancers also appear in normal healthy samples. This indicates that these variants are more likely to be benign, inherited sequences rather than disease-causing ones.

Most importantly, our method performed just as well as the traditional strategy that requires both tumor and healthy samples, opening the door to reducing the cost and increasing the accessibility of high-quality cancer mutation analysis.

My team and I are exploring expanding our searches to include large collections of tumors from different types of cancers such as breast and lung. Determining which organ a tumor originated from is critical to prognosis and treatment because it can indicate whether the cancer has metastasized or not. Because most tumors have specific mutational signatures, recovering evidence of an SV within a specific tumor sample could help identify the patient’s tumor type and lead to faster treatment.

Chowdhury et al. in Nature Methods

This is Fig. 1 of the paper. a,b, The STIX indexing and query process for three samples and a polymorphic deletion. a, A small number of the alignments that tile the genomes are discordant (designated by a dotted line connected read pairs) because of either an SV or other nonspecific causes (for example, mapping artifacts). b, Discordant alignments are extracted from all samples and indexed using GIGGLE. Query SVs are mapped to alignments that reside in both regions and are aggregated and returned. The first query returns three alignments in two samples and the second returns zero alignments. c, The distribution of evidence depths for a deletion searched in the SGDP cohort through the http://stix.colorado.edu interface.

This article is republished from The Conversation under a Creative Commons license. Read the original article.

Enjoy reading ASBMB Today?

Become a member to receive the print edition four times a year and the digital edition monthly.

Learn more

Ryan Layer

Ryan Layer is an assistant professor of computer science at the University of Colorado Boulder.

Contribute your story

Awards for Alrubaye and Dutta; Strochlic named ass't dean

ASBMB Today Staff

Spotlight on hematologic cancer

Lydia Smith

What’s it like to work at 23andMe?

Elizabeth Stivison

Many medications affect more than one target in the body

Gregory Way

Record-breaking rapid DNA sequencing

Kevin Doxzen

Dozens of ways of using CRISPR

Laurel Oldach

Get the latest from ASBMB Today

Enter your email address, and we’ll send you a weekly email with recent articles, interviews and more.

Latest in Science

Science highlights or most popular articles

Show more Science

Journal News

E-cigarettes drive irreversible lung damage via free radicals

April 17, 2025

E-cigarettes are often thought to be safer because they lack many of the carcinogens found in tobacco cigarettes. However, scientists recently found that exposure to e-cigarette vapor can cause severe, irreversible lung damage.

ASBMB Annual Meeting

Using DNA barcodes to capture local biodiversity

April 15, 2025

Undergraduate at the University of California, Santa Barbara, leads citizen science initiative to engage the public in DNA barcoding to catalog local biodiversity, fostering community involvement in science.

Journal News

Targeting Toxoplasma parasites and their protein accomplices

April 11, 2025

Researchers identify that a Toxoplasma gondii enzyme drives parasite's survival. Read more about this recent study from the Journal of Lipid Research.

Journal News

Scavenger protein receptor aids the transport of lipoproteins

April 11, 2025

Scientists elucidated how two major splice variants of scavenger receptors affect cellular localization in endothelial cells. Read more about this recent study from the Journal of Lipid Research.

Journal News

Fat cells are a culprit in osteoporosis

April 11, 2025

Scientists reveal that lipid transfer from bone marrow adipocytes to osteoblasts impairs bone formation by downregulating osteogenic proteins and inducing ferroptosis. Read more about this recent study from the Journal of Lipid Research.

ASBMB Annual Meeting

Unraveling oncogenesis: What makes cancer tick?

April 7, 2025

Learn about the ASBMB 2025 symposium on oncogenic hubs: chromatin regulatory and transcriptional complexes in cancer.

Genetic mutations can be benign or cancerous

Using big data to find cancerous mutations

Focusing on verification

Enjoy reading ASBMB Today?

Related articles

Get the latest from ASBMB Today

E-cigarettes drive irreversible lung damage via free radicals

Using DNA barcodes to capture local biodiversity

Targeting Toxoplasma parasites and their protein accomplices

Scavenger protein receptor aids the transport of lipoproteins

Fat cells are a culprit in osteoporosis

Unraveling oncogenesis: What makes cancer tick?