Feeds

Most brain science papers are neurotrash: Official

Don't believe everything you read

Choosing a cloud hosting partner with confidence

A group of academics from Oxford, Stanford, Virginia and Bristol universities have looked at a range of subfields of neuroscience and concluded that most of the results are statistically worthless.

The researchers found that most structural and volumetric MRI studies are very small and have minimal power to detect differences between compared groups (for example, healthy people versus those with mental health diseases). Their paper also stated that, specifically, a clear excess of "significance bias" (too many results deemed statistically significant) has been demonstrated in studies of brain volume abnormalities, and similar problems appear to exist in fMRI studies of the blood-oxygen-level-dependent response.

The team, researchers at Stanford Medical School, Virginia, Bristol and the Human Genetics dept at Oxford, looked at 246 neuroscience articles published in 2011 and and excluded papers where the test data was unavailable. They found that the papers' median statistical power - the possibility that a study will identify an effect when there is an effect there to be found - was just 21 per cent. What that means in practice is that if you were to run one of the experiments five times, you’d only find the effect once.

A further survey of papers drawn from fMRI brain scanners - and studies using such scanners have long filled the popular media with dramatic claims - found that their statistical power was just 8 per cent.

Low statistical power caused three problems, the authors said. Firstly, there is a low probability of finding true effects; secondly, there is a low probability that a "true" finding is actually true; and thirdly, exaggerating the magnitude of the effect when a positive is discovered.

There were further problems that led them to believe the power is even lower than they suggest. They noted:

[T]he summary effect size estimates that we used to determine the statistical power of individual studies are themselves likely to be inflated owing to bias — our excess of significance test provided clear evidence for this. Therefore, the average statistical power of studies in our analysis may in fact be even lower than the 8–31% range we observed.

Publishing is a highly competitive enterprise, with certain kinds of findings more likely to be published than others. Research that produces novel results, statistically significant results (that is, typically p < 0.05) and seemingly "clean" results is more likely to be published. As a consequence, researchers have strong incentives to engage in research practices that make their findings publishable quickly, even if those practices reduce the likelihood that the findings reflect a true (that is, non-null) effect.

The paper is titled Power failure: Why small sample size undermines the reliability of neuroscience and is published in the May 2013 edition of Nature Reviews' Neuroscience journal. The conclusions have wide implications for the field.

Button et al note that advances in computer processing have made crunching large data sets faster and easier, but the statistical rigour hasn't kept pace. They call for research to be fundamentally redesigned to maintain the credibility of neuroscience.

These dramatic advances in the flexibility of research design and analysis have occurred without accompanying changes to other aspects of research design, particularly power. For example, the average sample size has not changed substantially over time despite the fact that neuroscientists are likely to be pursuing smaller effects.

The increase in research flexibility and the complexity of study designs combined with the stability of sample size and search for increasingly subtle effects has a disquieting consequence: a dramatic increase in the likelihood that statistically significant findings are spurious. This may be at the root of the recent replication failures in the preclinical literature8 and the correspondingly poor translation of these findings into humans.

Kate Button, one of the authors behind the paper, has a nice article at the Guardian explaining the issues.

"The current reliance on small, low-powered studies is wasteful and inefficient, and it undermines the ability of neuroscience to gain genuine insight into brain function and behaviour. It takes longer for studies to converge on the true effect, and litters the research literature with bogus or misleading results," writes Button.

Demand for brain science has increased from policy wonks and other pseuds looking for a "neuroscientific explanation" to settle their turf war; from journalists, eager to fill pages with a brightly coloured pictures and grabby headlines; and from academics chasing after publications.

And it isn't just the brain boffins who are cutting corners and making improbable exaggerations. Neuroscience evangelist journalist Jonah Lehrer resigned from The New Yorker magazine and later parted ways with WiReD last year after he admitted making up quotes that he had attributed to Bob Dylan. ®

Beginner's guide to SSL certificates

More from The Register

next story
Renewable energy 'simply WON'T WORK': Top Google engineers
Windmills, solar, tidal - all a 'false hope', say Stanford PhDs
FORGET the CLIMATE: FATTIES are a MUCH BIGGER problem - study
Fat guy? Drink or smoke? You're worse than a TERRORIST
Rosetta probot drilling DENIED: Philae has its 'LEG in the AIR'
NOT best position for scientific fulfillment
SEX BEAST SEALS may be egging each other on to ATTACK PENGUINS
Boffin: 'I think the behaviour is increasing in frequency'
HUMAN DNA 'will be FOUND ON MOON' – rockin' boffin Brian Cox
Crowdfund plan to stimulate Blighty's space programme
Post-pub nosh neckfiller: The MIGHTY Scotch egg
Off to the boozer? This delicacy might help mitigate the effects
I'M SO SORRY, sobs Rosetta Brit boffin in 'sexist' sexy shirt storm
'He is just being himself' says proud mum of larger-than-life physicist
NASA launches new climate model at SC14
75 days of supercomputing later ...
Britain's HUMAN DNA-strewing Moon mission rakes in £200k
3 days, and Kickstarter moves lander 37% nearer takeoff
prev story

Whitepapers

Why and how to choose the right cloud vendor
The benefits of cloud-based storage in your processes. Eliminate onsite, disk-based backup and archiving in favor of cloud-based data protection.
Forging a new future with identity relationship management
Learn about ForgeRock's next generation IRM platform and how it is designed to empower CEOS's and enterprises to engage with consumers.
High Performance for All
While HPC is not new, it has traditionally been seen as a specialist area – is it now geared up to meet more mainstream requirements?
Getting ahead of the compliance curve
Learn about new services that make it easy to discover and manage certificates across the enterprise and how to get ahead of the compliance curve.
Top 5 reasons to deploy VMware with Tegile
Data demand and the rise of virtualization is challenging IT teams to deliver storage performance, scalability and capacity that can keep up, while maximizing efficiency.