5,000 Down, 95,000 to Go: U.K. Biobank Releases First Brain-Imaging Data

Inside a nondescript building just south of Manchester, England, a new person squeezes into an MRI tube every 36 minutes. This machine subserves a single purpose: to conduct the largest imaging study in human history. The brain-scanning portion of the U.K. Biobank epidemiological study aims to image 100,000 healthy people within five years. The goal is to uncover early warning signs of neurological and other diseases. In the September 19 Nature Neuroscience, the researchers, led by Stephen Smith at the University of Oxford in England, published data from the first 5,000 volunteers scanned.

This initial release appeared to be a practice run at the formidable task of distilling massive amounts of complex imaging data into digestible findings. The researchers made connections between the structure and function of different brain regions on the one hand, and a myriad of health and lifestyle measures for each participant that are also being recorded on the other. While this initial analysis only looked for baseline correlations, the real meat of the project will come later, when people in the cohort start to develop diseases such as AD. Then the researchers will re-examine the imaging and health data to dig for early harbingers of disease. Ultimately, other researchers who tap this open-access data in the future can analyze it however they choose, but the present paper is a first stab at parsing this ever-growing data set.

“The ambition to generate the quantity of imaging data described in this paper is simply phenomenal, and this description of the process and results from the first 5,000 scans marks a step change in imaging sciences,” commented Simon Lovestone, also from the University of Oxford. While not involved with the current paper, Lovestone plans to utilize U.K. Biobank data. “Given that the U.K. Biobank data, including the imaging data reported here, is readily available within the spirit of open access, its uses will be limited only by the scientific imagination of the research community,” he wrote. Other AD scientists note that it is also limited by not testing for known molecular pathology markers, which could prove invaluable later on for determining when AD started, in whom, and why.

U.K. Biobank is a broad prospective epidemiological study that aims to uncover connections between various diseases and health and lifestyle factors that precede them. Researchers are collecting extensive phenotypic data, using exhaustive health and lifestyle questionnaires, whole body and cardiac imaging, blood samples, genetic testing, and medical records.

Its scope is massive. Half a million participants, aged 40-69 years, will be tracked for the emergence of a variety of health outcomes, including neurodegenerative disease. Of these volunteers, 100,000 will also undergo structural, functional, and diffusion MRI of the brain (described in the current study), whole body and heart imaging also by MRI, low-dose X-rays of bone and joints, and ultrasound of the carotid arteries. The researchers packed as many different brain imaging modalities as possible into a 36-minute window to allow them to scan 18 participants per day, seven days a week, and complete the imaging phase by 2022.

Notably, participants will not undergo PET scans of any kind, an omission some neurodegenerative disease researchers lament. Because PET scans can capture Aβ and tau pathology, which precede structural changes in the brain and the appearance of cognitive symptoms, their usefulness in studying presymptomatic Alzheimer’s and other tauopathies is matched only by CSF analysis. The U.K. Biobank will not collect CSF either.

While this first release covers 5,000 participants, first author Karla Miller told Alzforum that as of this writing, more than 11,000 people have been scanned in Manchester, with two more locations coming online soon. Costs for the brain-imaging portion of the study—funded by the U.K. Medical Research Council and Wellcome Trust—are projected to reach £40 million, Miller said.

What does the first batch of data show? Taken together, different types of structural MRI revealed the basic shape and size of different parts of the brain, the appearance of white-matter lesions, and deposits of iron, which are associated with aging and neurodegeneration. Diffusion MRI revealed the integrity of microstructures in the brain, including tracts of white matter that connect different regions. Finally, neural activity and connections between different brain regions were assessed using functional MRI, conducted as participants rested or were immersed in a particular task.

In an attempt to make biological sense out of this sprawling mass of data, Miller and colleagues drew all the imaging data down to 2,501 image-derived phenotypes (IDPs). Each IDP represents a single parameter, such as the volume of a specific brain region, the connection strength between two parts of the brain, or the level of neural activity that flares up when a participant completes a certain task. The researchers then looked for associations between these IDPs and other health data stored in the Biobank. The researchers divided these 1,100 health data measures into 11 categories, which included lifestyle factors such as exercise and diet, physical attributes such as body-mass index and bone density, and scores on cognitive tests.

Divvying Up Deep Data. The strength of thousands of connections (middle) between different brain networks (left) could be correlated to various health-related phenotypes (right). [Image courtesy of Miller et al., Nature Neuroscience, 2016.]

After adjusting for age, sex, and head size, the researchers found strong correlations between various IDPs and other measures. For example, performance on the digit-symbol substitution test, which wanes in people with dementia, correlated with thalamic volume as well as the integrity of three white-matter tracts: the corona radiata, the superior thalamic radiation, and the posterior fornix. Scores on a reaction-time test correlated with the size of the left putamen, while fluid intelligence scores correlated with the strength of neural activity when participants performed a simple shape-matching task. Neurodegeneration in the amygdala and hippocampus—as inferred from elevated iron deposition in those structures—correlated with elevated body-mass index. This latter association confirmed what previously had been reported in the Austrian Stroke Prevention Study, suggesting the U.K. Biobank data set can corroborate previous findings (see Pirpamer et al., 2016).

What do correlations like these really mean? Miller said it’s impossible to know at this point, and that researchers should use the data to form hypotheses for testing, rather than drawing causal relationships from this observational study. As participants in the cohort develop AD, PD, FTD, or ALS, researchers can look for further associations between these diseases and various IDPs, and then plan further experiments to explain the relationships and identify risk/protective factors for diseases.

Ultimately, Miller said that similarly to any single nucleotide polymorphism in a genome-wide association study, no single measure in the U.K. Biobank will likely hold dramatic sway over disease risk. Rather, composites of multiple measures may emerge as robust risk factors for disease. To get a sense of what these composite measures could be, the researchers analyzed the imaging and other data to uncover “modes” of common characteristics among different people in the cohort. Each of the nine modes that emerged represented a cluster of IDPs and other health factors that significantly covaried among certain participants. In six of the modes, age was one such covariate. “Each of these [six] modes reflects a different aspect of the aging process,” Miller said. This underscores just how heterogeneous the aging process is, she added. That the researchers pulled out nine modes is a testament to the large size and depth of the data set. A similar analysis of data from the Human Connectome Project, which correlated brain imaging with 280 lifestyle and demographic variables in 461 healthy adults, only uncovered one mode (see Smith et al., 2015).

The modes brought out some curious correlations. For example, mode 7 linked low bone density and poor cognitive scores to multiple brain structure and diffusion MRI measures, including total brain volume. These relationships evoke previous findings linking weakening bones to cognitive decline and AD (see Yaffe et al., 1999Tan et al., 2005). In mode 9, lifestyle factors such as eating more cheese or spending less time outdoors in the summer or winter correlated with cognitive measures such as fluid intelligence, as well as with a bevy of functional and structural MRI measures. Assigning meaning to these associations is tricky, Miller said, as some factors may simply be a proxy for socioeconomic status or education level. For example, people who eat more cheese may have a higher economic status, while spending time outdoors may reflect occupations that include heavy physical labor, which was also negatively associated with cognitive and imaging measures in this mode.

The researchers will monitor incident health outcomes, then see how those relate to the baseline brain imaging and non-imaging measures. While researchers will have indefinite access to participants’ medical records through the National Health Service, most participants will not be required to undergo regularly scheduled checkups or tests beyond the baseline measures. Rather, the occurrence of future health problems (such as dementia) will only be recorded when patients come to the doctor on their own volition, Miller said. Based on the demographics of the Biobank participants, the researchers estimated that by the time imaging is completed in 2022, around 1,800 people in the imaging cohort will show symptoms of AD, and 1,200 will develop PD. Five years later, those numbers are projected to rise to 6,000 and 2,800 for AD and PD, respectively, while 50-100 may have motor neuron disease. The large size of the cohort will facilitate discovery of presymptomatic markers for these diseases, particularly for rare ones such as ALS, Miller believes. The Biobank plans to scan 10,000 participants at least one more time, but Miller said researchers have yet to decide when to initiate this longitudinal substudy component.

U.K. Biobank may be the largest brain-imaging game in town, but other European studies play in the same league. The German National Cohort (GNC) has imaged the brains of 10,000 of its target of 30,000 participants, according to Fabian Bamberg of the University of Tuebingen, who heads the MRI portion of that study (see Bamberg et al., 2015). In addition to collection of health data, whole-body and cardiac imaging, the GNC’s brain-imaging protocol includes MRI measurements similar to the U.K. Biobank’s, with the exclusion of diffusion MRI and task-related functional MRI. Bamberg told Alzforum that researchers from the two projects are working together closely to harmonize the measurements they have in common. Similar in scope to the GNC study, the Rhineland Study is in the process of imaging 30,000 participants from the Bonn area, aged 30 and over, with plans to track their health outcomes for the remainder of their lifespans (see Breteler et al., 2014). The Maastricht Study, which so far has run more than 5,000 of its projected 10,000 participants through the MRI scanner, focuses specifically on factors associated with Type 2 diabetes and cardiovascular disease (see Schram et al., 2014). Authors of the U.K. Biobank study hope to synchronize their data with those from these other imaging studies in the future.

“U.K. Biobank will be a tremendous resource for studies on neurodegeneration, both for data discovery or for replication,” wrote Betty Tijms and Pieter-Jelle Visser of VU Medical Center in Amsterdam in a joint comment to Alzforum. “Because data collection is not focused on a single disease, it will be possible to find common mechanisms across different brain disorders.” However, they also note the shortcomings of the data set in terms of neurodegenerative disease research, such as the lack of PET scans and absence of participants older than 80.

While Miller considers U.K. Biobank’s broad scope a strength, other researchers criticized the £40 million price tag and its lack of focus on specific diseases. “This is a lot of normative data but no findings,” commented John Hardy of University College London. “Obtaining longitudinal data, especially in the context of PET imaging of amyloid and tau, would really make this a valuable resource.”

While the inclusion of PET imaging and/or CSF biomarker data in a cohort of this size would be a boon to neurodegeneration research, Miller explained that MRI is noninvasive and captures a broad range of information about the brain’s structure, function, and connectivity, while PET scans are expensive, trickier to execute, and expose participants to more ionizing radiation than X-rays. Bamberg gave similar reasons for excluding PET scans from the GNC study. Nick Fox of University College London agreed that PET scans or CSF biomarker studies would have added cost. “Essentially the tradeoff for U.K. Biobank is numbers versus depth—they have the power of very large numbers but that means that expensive, invasive, or lengthy assessments may not be justifiable,” Fox wrote to Alzforum.

Smaller imaging studies that do collect PET scans and CSF biomarkers are focused specifically on neurodegenerative disease, such as the Alzheimer’s Neuroimaging Initiative (ADNI), the Australian Imaging, Biomarker, and Lifestyle study, the Harvard Aging Brain Study, or the Mayo Clinic Study of Aging. Michael Weiner of the University of California, San Francisco, who heads ADNI, told Alzforum that despite its lack of AD-specific measures, the broad-based U.K. Biobank study is exciting for the neurodegenerative disease field.

However, he added that with large data sets come sizable hurdles in data interpretation. “In my view, one of the biggest challenges to medical science is to find more robust ways to reduce false positive and false negative results from analysis of large data sets,” Weiner wrote. “Nevertheless, this recent report from the U.K. Biobank is a ‘tour de force’ and all of us in the field look forward to seeing more exciting discoveries coming from this project.”

Primary Reference:

Miller KL, Alfaro-Almagro F, Bangerter NK, Thomas DL, Yacoub E, Xu J, Bartsch AJ, Jbabdi S, Sotiropoulos SN, Andersson JL, Griffanti L, Douaud G, Okell TW, Weale P, Dragonu I, Garratt S, Hudson S, Collins R, Jenkinson M, Matthews PM, Smith SM. Multimodal population brain imaging in the UK Biobank prospective epidemiological study. Nat Neurosci. 2016 Sep 19. [PubMed].

To view commentaries, primary articles and linked stories, go to the original posting on Alzforum.org here.

Copyright 1996–2016 Biomedical Research Forum, LLC. All Rights Reserved.

topic-biomarkers topic-clinical
Share this: