The role of UBQLN2 in disease: a protocol for a systematic review and meta- analysis.

Bethany Waddington1,2, Dr Jenna Gregory 2,3, Dr George Gorrie4 and Dr Thimo Kurz1,2.

1Molecular, Cell and Systems Biology, University of Glasgow, Davidson Building, Glasgow, G12 8QQ, UK 2 Euan MacDonald Centre for Motor Neurone Disease Research, University of Edinburgh, Chancellor’s Building, Edinburgh, EH16 4SB, UK. 3 Centre for Clinical Brain Sciences, University of Edinburgh, Chancellor’s Building, Edinburgh, EH16 4SB, UK. 4 Department of Neurology, Institute of Neurological Sciences, Queen Elizabeth University Hospital, Glasgow, UK.

*Correspondence to [email protected]

Keywords UBQLN2, disease, systematic review, meta-analysis

1. INTRODUCTION

There are 5 Ubiquilin within the family in humans, named UBQLN 1-4 and UBQLNL. Whilst UBQLN3 and UBQLNL are restricted to the testis, the remaining members are widely expressed (Marı´n, 2014), with UBQLN2 most prominently expressed in the brain and nervous system, spleen, heart, liver and pancreas, amongst other tissues. (Zhang and Saunders, 2011). UBQLN2 is composed of 3 key domains: The Associated Domain (UBA), the central domain (STI Like), and the Ubiquitin Like Domain (UBL) (Walters et al., 2004). Within the central domain lies the unique PxxP motif, a region which contains nearly all the disease-causing UBQLN2 mutations, but with unknown function in the activity of UBQLN2 (Deng et al., 2011). UBQLN2 has been found to contribute to numerous neurodegenerative diseases and accumulations (Deng et al., 2011; Hjerpe et al., 2016; Williams et al., 2012; Zeng et al., 2015). Of recent interest is the role of UBQLN2 mutations in the development of Amyotrophic Lateral Sclerosis (Deng et al., 2011).

Amyotrophic Lateral Sclerosis (ALS) is a fatal neurodegenerative disease characterised by the loss of motor neurones. A major characteristic of ALS and other forms of neurodegenerative diseases is the presence of aggregates in the affected areas (Al-Chalabi et al., 2012; Blokhuis et al., 2013). However, the nature and content of these aggregates can vary between individuals, creating problems discerning between causal and coincidental pathology. Nonetheless, this inability to efficiently clear mutated, damaged or aggregated proteins points to a dysfunction in the homeostatic proteolytic systems within the motor neurones.

Protein degradation is crucial in post-mitotic cells, such as neurones; without proliferation the toxic proteins or aggregations will not be diluted out (Hara et al., 2006; Komatsu et al., 2006). Therefore, in order to ensure efficient progression through the system, numerous proteins and complexes are formed and utilised. Of interest in this case is Ubquillin-2 (UBQLN2), a protein known to cause X-linked juvenile- and adult-onset ALS (Deng et al., 2011). Of note is the presence of UBQLN2 positive aggregations in non-mutant-UBQLN2 patients; ALS patients with no genetic mutation in UBQLN2 still present with aggregations positive for UBQLN2 (Deng et al., 2011; Williams et al., 2012). This reaffirms the belief that UQBLN2 activity is central to the cellular homeostasis and protein degradation within motor neurones. The complexity of the formation and clearance of these aggregates is yet to be understood, although much evidence points to an impairment in protein degradation pathways as human spinal cord autopsy of UBQLN2 patients found skein-like inclusions containing UBQLN2, Ubiquitin, p62, TDP-43, Fus and OPTN (Deng et al., 2011; Williams et al., 2012). Together, this data provides the justification that further research is needed into the mechanistic actions of UBQLN2, and its impact in the development of ALS.

As no other systematic review and meta-analysis has been carried out investigating UBQLN2, this review aims to collate the existing literature regarding UBLQN2 in all diseases. If appropriate, quantitative analysis will be carried out using studies of the same pathway. Gathering a broad range of data allows the possibility of grouping, and potentially analysing if appropriate, data sets based on the disease implicated, model used, or pathways involved. In using this primarily wide spanning, qualitative approach, techniques, methodologies and results could be extrapolated from the better- established research, to provide suggestions and tools for moving forward in the elusive fields of neurodegenerative disease and ALS. 2. APPROACH

A systematic review will be performed to assess the contribution of UBQLN2 mutations to the development, progression and severity of disease in both preclinical models and clinical cases. The search strategy will include clinical cases and human post-mortem, animal models and cellular models providing the model and mutation is clearly stated. Individual meta-analyses will be consequently performed for the outcomes measured and further sub-group analysis carried out based on their suggested cause of phenotype, if appropriate. 3. OBJECTIVES

The overarching aim of this project is to elucidate the role of UBQLN2 mutations to the development of disease in pre-clinical models and clinical cases and provide a summary of the current data in the field. This summary will comprise of the methodology, including the mutations assessed and how these models or mutations were studied or created (Overexpression models for instance); and interventions in the treatment of disease caused by mutations in UBQLN2, including drug interventions, siRNA, or lentiviral rescue for instance. This overarching qualitative summary will be used to generate tools such as a frequency distribution model summarising the models and mutations used, and their phenotype. A more quantitative analysis will be performed retrospectively using standardised mean differences if appropriate.

Specifically, we will gather insight into the contribution of UBQLN2 mutations to the development and severity of disease across all 3 platforms of research (Human, Animal and Cellular), where human denotes all clinical and post-mortem research, animal as whole animal or post-mortem/cellular subgroups and cellular as immortalised cell lines, including hiPSC work. In human cases, the primary outcome measure will be survival, with duration, age of onset, cognition, locomotion, behavioural, biochemical and anatomical as secondary objectives, where appropriate. Where animal models are concerned, again the primary objective will be survival, followed by duration, age of onset, cognition, locomotion, behavioural, biochemical and anatomical as secondary outcomes. Dependant on the experiments carried out, the primary outcome from the cellular models will be survival, followed by biochemical or electrophysiological as the secondary outcomes, but the cell line used must be clearly stated. The secondary aims of the pre-clinical models will be pooled together, whilst the primary outcome of mortality kept separate. From the results gathered a final set of outcomes will be assessed, to see whether any common suggestions of binding partners, modifiers or indicators of stress cause or contribute to the phenotype observed.

4. METHODS

4.1 Search Methods

Sources: databases: 1. PubMed, 2. MEDLINE and 3. EMBASE.

Search Date:

There will be no publication date or language restrictions.

4.2 Search strategy

Search terms:

PubMed:

(“UBQLN2” or “UBQLN” or “Ubiquilin-2” or “Ubuqillin 2” or “Ubquilin2” or “Ubiquilin” or “PLIC2” or “PLIC-2” or “DSK2” or “DSK-2” or “ALS15”) NOT (“UBQLN1” or “UBQLN3” or “UBQLN4” or “Ubiquilin- 1” or “Ubiquilin-3” or “Ubiquilin-4”or “Ubiquilin 1” or “Ubiquilin 3” or “Ubiquilin 4”).

MEDLINE:

(UBQLN2 or UBQLN or Ubiquilin-2 or Ubiquilin 2 or Ubiquilin2 or Ubiquilin or PLIC2 or PLIC-2 or DSK2 or DSK-2 or ALS15) not (UBQLN1 or UBQLN3 or UBQLN4 or Ubiquilin-1 or Ubiquilin-3 or Ubiquilin-4 or Ubiquilin 1 or Ubiquilin 3 or Ubiquilin 4).

EMBASE:

(UBQLN2 or UBQLN or Ubiquilin-2 or Ubiquilin 2 or Ubiquilin2 or Ubiquilin or PLIC2 or PLIC-2 or DSK2 or DSK-2 or ALS15) not (UBQLN1 or UBQLN3 or UBQLN4 or Ubiquilin-1 or Ubiquilin-3 or Ubiquilin-4 or Ubiquilin 1 or Ubiquilin 3 or Ubiquilin 4).

4.3 Screening

The Systematic Review Facility online screening tool (app.syrf.org.uk) will be used by two independent authors in order to screen the title and abstract of each paper identified from our predetermined criteria, with regards to the inclusion and exclusion criteria. Duplicates will be removed during screening, but reviews kept to check for additional studies. A third, independent reviewer will assess 10% of the selected studies to assess inter-observer variability and bias and concordance will be set at <0.66. A quality score will be awarded, and the data extracted as outlined below.

4.4 Eligibility

4.4.1 Inclusion Criteria

All models or cases of disease where mutations in UBQLN2 or UBQLN were the disease-causing mutation, or where levels of wild-type UBQLN2 or UBQLN were manipulated. Types of models included are: Immortal cell, e/iPS derived cell lines, primary cell cultures, whole animal models, post- mortem tissue (animal and human), clinical cases; Endogenous and exogenous expression; all known disease-causing patient mutations will be accepted in clinical cases or as Knock-In mutations, alongside UBQLN2 and UBQLN amplification or silencing in animal and cellular models.

4.4.2 Exclusion Criteria

No data No controls or inappropriate control Case reports Letters and comments Reviews Any mutations/knock-ins or knock-outs of UBQLN 1,3 or 4 specifically.

4.5 Quality Score Checklist

CAMRADES study checklist, adapted as below. One point will be awarded for each of the seven points observed. The median number of scored items will be taken, and the interquartile range calculated.

1. Peer reviewed 2. Blinded assessment of outcomes 3. Sample size calculation 4. Appropriate controls 5. Appropriate statistics 6. Statement potential conflict of interest 7. Cohort definition clearly stated

4.6 Data Extraction

The nature of the data extracted will vary dependent on the type of study. Thus, the outcome measures outlined in 3.2 should be applied to each extraction point accordingly.

• Citation information (Author and year of publication) • Model Status (Cell, Animal, Human) and sub-group (cell line, animal or brain region etc) • Disease status (ALS, FTD, HD, PD, Other) • Mutation status • Source (exogenous vs endogenous) • Modification (Up-regulation, down-regulation or physiological expression level) • Pathway Involved (Autophagy, Stress Granule, UPS, LLPS, Protein Trafficking, Other) • Description of Data (Survival, Duration, Age Onset, Locomotion, Behavioural, Cognition, Biochemical, Anatomical, Electrophysiological) • Value of comparison for data (Days/moths survival for example) • Individual comparisons identified where the outcome is compared with that of a suitable control. • Mean outcome • Sample size disease: control and standard deviation (where control groups are either wild- type UBQLN2/UBQLN or rescued mutation or isogenic pair) • Most conservative n number used if n-number varies (eg, if n=6-12, then a n=6 will be recorded) • Direction will be added to data which does not conform to the “bigger is better” scenario, denoted by -1.

4.7 Statistical Analysis

Given the heterogeneity of disease models assessed, effect sizes for outcome measures will be expressed as standardised mean differences (SMD) to enable scale-relevant inter-study comparisons. SMDs will be compared using Hedges’ g-statistic, to account for bias from small sample sizes, using a random effects model. Survival will be assessed using odds ratios as described previously (Vesterinen et al., 2014). SMDs and odds ratios will be included on separate Forest plots with 95% confidence intervals. Heterogeneity will be assessed for all outcome measures using I2 values, a funnel plot and Egger’s regression test will be used to assess for evidence of publication bias. Differences between groups with respect to non-quantitative data and semi-quantitative quality scoring will be explored by plotting frequency distributions and where appropriate by partitioning heterogeneity followed by plotting grouped data and analysis by 2-way ANOVA. All data (quantitative and qualitative) will be summarised in tables and provided as a freely available resource for the research field.

Conflicts of Interest

No conflicts of interest to declare

References

Al-Chalabi A, Jones A, Troakes C, King A, Al-Sarraj S, vanden Berg LH. 2012. The genetics and neuropathology of amyotrophic lateral sclerosis. Acta Neuropathol 124:339–352.

Blokhuis AM, Groen EJN, Koppers M, van den Berg LH, Pasterkamp RJ. 2013. Protein aggregation in amyotrophic lateral sclerosis. Acta Neuropathol. 125:777–794

Deng HX, Chen W, Hong S-T et al. 2011. Mutations in UBQLN2 cause dominant X-linked juvenile and adult-onset ALS and ALS/dementia. Nature 477:211–215.

Hara, T, Nakamura, K, Matsui, M, Yamamoto, A, Nakahara, Y, Suzuki-Migishima, R, Yokoyama, M, Mishima, K, Saito, I, Okano, H & Mizushima, N. 2006. Suppression of basal autophagy in neural cells causes neurodegenerative disease in mice. Nature, 441, 885-889.

Hjerpe R, Bett JS, Keuss MJ, Solovyova A, McWilliams TG, Johnson C, Sahu I, Varghese J, Wood N, Wightman M, et al. 2016. UBQLN2 Mediates Autophagy-Independent Protein Aggregate Clearance by the Proteasome. Cell. 2016 Aug 11;166(4):935-949

Komatsu, M, Waguri, S, Chiba, T, Murata, S, Iwata, J, Tanida, I, Ueno, T, Koike, M, Uchiyama, Y, Kominami, E & Tanaka, K. 2006. Loss of autophagy in the central nervous system causes neurodegeneration in mice. Nature, 441, 880-884.

Marı´n, I. 2014. The ubiquilin gene family: evolutionary patterns and functional insights. BMC Evol. Biol. 14, 63.

Vesterinen HM, Sena ES, Egan KJ, Hirst TC, Churolov L, Currie GL, Antonic A, Howells DW, Macleod MR. 2014. Meta-analysis of Data From Animal Studies: A Practical Guide. J Neurosci Methods 15;221:92-102.

Walters KJ, Goh AM, Wang Q, Wagner G, Howley PM. 2004. Ubiquitin family proteins and their relationship to the proteasome: a structural perspective. Biochim Biophys Acta 1695:73–87.

Williams KL, Warraich ST, Yang S et al. 2012. UBQLN2/ubiquilin 2 mutation and pathology in familial amyotrophic lateral sclerosis. Neurobiol Aging 33:2527 e2523-2510.

Zeng L, Wang B, Merillat SA, Minakawa EN, Perkins MD, Ramani B, Tallasken-Greene SJ, Carmo Costa M, Albin RL, Paulson HL. 2015. Differential recruitment of UBQLN2 to nuclear inclusions in the polyglutamine diseases HD and SCA3. Neurobiol Dis 82:281-288.

Zhang C, Saunders AJ. 2009. An Emerging Role for Ubiquilin 1 in Regulating Protein Quality Control System and in Disease Pathogenesis. Discov Med 8(40):18-22.