Random gene sets can predict breast cancer survival better than cancer-related signatures
Posted Jan 10 2012 12:07pm
Tumours are bundles of cells that grow and divide uncontrollably, and their genes are deployed in unusual ways. By analysing the genes from different tumour samples, scientists have tried to pin down the chaotic events that lead to cancer. They seem to be making headway. Dozens of papers have reported “gene expression signatures” that predict the risk of dying or surviving from cancer, and new ones come out every month.
These signatures purportedly hint at how healthy cells transform into tumours in the first place. If, for example, the genes in question are involved in wound healing, this tells you that the healing process is somehow involved in a tumour’s progression. These collections of genes reveal deeper truths about the disease they’re associated with.
This idea sounds reasonable, but David Venet from the Université Libre de Bruxelles has thrown a big spanner into the works. He has shown that completely random sets of genes can predict the odds of surviving breast cancer better than published signatures.
Venet found three signatures that are completely unconnected to cancer. Instead, these collections of genes were associated with laughing at jokes after lunch, with the experience of social defeat in mice, and with the positioning of skin cells. All of them were associated with breast cancer outcomes.
It got worse. Venet collected 47 breast cancer signatures from published papers and compared them to sets of random genes. The random sets were equally (or more) strongly associated with breast cancer outcomes than 60% of the published ones. In fact, you can randomly select a group of 100 genes or more, and be 90% sure of finding a statistically significant link with breast cancer. Venet wrote, “Investigators are bound to find an association however whimsical their marker is.”
Venet’s study was described as a “must-read” by F1000 member Jinfeng Liu from Genentech Inc. The results may seem unbelievable, but there is a simple reason for them. The activities of thousands of genes across a breast cancer cell’s genome are related to how quickly that cell proliferates (grows and divides). And that is related to a patient’s prognosis.
As an analogy, you could find hundreds of things that correlate with a person’s wellbeing and lifespan: the number of Apple products they own, whether they have university degrees, how many cars they have, and so on. But this doesn’t mean that these things improve our health; instead, they reflect how wealthy we are, our lifestyle choices, and our access to good healthcare.
Gene signatures may be relatively useless at illuminating the causes of cancer, but the team stresses that they can still help doctors – after all, they’re still related to prognosis. Writing in The Scientist, the study’s lead author Vince Detours says, “Smoke does not drive fire, yet it is powerful indicator of when and where a fire is burning.”
Detours also aims a blow at scientific publishers who have let studies of genetic signatures proliferate uncontrollably. He wrote:
It took us four years and six rejections to get this work finally published in a computational biology journal – not the most efficient venue to reach the oncology community. Meanwhile, a steady stream of studies confounded by proliferation rates has appeared.
This has to be said; one can no longer stay silent about the rather limited self-correction capability of the top tier publishing system (Cell, Nature Genetics, PNAS, etc.), which promoted these studies in the first place.