CHARACTERIZATION OF EARLY LIFE EXPOSURE TO ENVIRONMENTAL CHEMICALS AND ITS IMPACTS ON HEALTH School of Science and Technology Örebro University Sweden

Lisanna Sinisalu Supervisors from Örebro University: Tuulia Hyötyläinen, Leo Yeung Examiner: Ingrid Ericson Jogsten Spring 2020

List of abbreviations

12-epiCA 3α, 7α, 12β-trihydroxy-5β-cholan-24-oic acid 12-oxo-LCA 12-oxolithocholic acid 18 18O2-PFHxS Perfluoro-1-hexane [ O2] sulfonic acid 4:2FTSA 4:2 fluorotelomer sulfonic acid 6:2 Cl-PFESA 6:2 chlorinated polyfluoroalkyl ether sulfonic acid 6:2FTSA 6:2 fluorotelomer sulfonic acid 7-oxo-DCA 7-oxodeoxycholic acid 7-oxo-HDCA 7-oxohyodeoxycholic acid 8:2 Cl-PFESA 8:2 chlorinated polyfluoroalkyl ether sulfonic acid 8:2FTSA 8:2 fluorotelomer sulfonic acid BA CA CA-d4 Deuterated cholic acid CD Celiac disease CDCA CDCA-d4 Deuterated chenodeoxycholic acid CE Cholesterol ester Cer Ceramide CYP7A1 Cholesterol-7α-hydroxylase DCA DCA-d4 Deuterated deoxycholic acid DG Diglyceride DHCA 3α,7α-dihydroxycholestanoic acid ECF Electrochemical fluorination FA Fatty acid FFA Free fatty acid FOSA Perfluorooctanesulfonamide GCA Glycocholic acid GCA-d4 Deuterated glycocholic acid GCDCA Glycochenodeoxycholic acid GCDCA-d4 Deuterated glycochenodeoxycholic acid GDCA Glycodeoxycholic acid GDHCA Glycodehydrocholic acid GHCA Glycohyocholic acid GHDCA Glycohyodeoxycholic acid GL Glycerolipid GLCA Glycolithocholic acid GLCA-d4 Deuterated glycolithocholic acid GP Glycerophospholipid GUDCA Glycoursodeoxycholic acid GUDCA-d4 Deuterated glycoursodeoxycholic acid HCA HDCA HPLC High-performance liquid chromatography ISTD Internal standard LCA

1

LCA-d4 Deuterated lithocholic acid LC-qTOF Liquid chromatography quadrupole time-of-flight LPC Lysophosphatidylcholine LPE Lysophosphatidylethanolamine L-PFBS Linear-perfluorobutane sulfonic acid L-PFDoS Linear -perfluorododecane sulfonic acid L-PFDS Linear-perfluorodecane sulfonic acid L-PFHpS Linear-perfluoroheptane sulfonic acid L-PFHxS Linear-perfluorohexane sulfonic acid L-PFNS Linear-perfluorononanoic sulfonic acid L-PFOS Linear-perfluorooctane sulfonic acid L-PFPeS Linear-perfluoropentane sulfonic acid 13 M2-6:2FTSA 1H, 1H, 2H, 2H-perfluoro-1-[1,2- C2] octane sulfonic acid 13 M2-8:2FTSA 1H, 1H, 2H, 2H-perfluoro-1-[1,2- C2] decane sulfonic acid 13 M2PFDA Perfluoro-n-[1,2- C2] decanoic acid 13 M2PFDoA Perfluoro-n-[1,2- C2] dodecanoic acid 13 M2PFHxA Perfluoro-n-[1,2- C2] hexanoic acid 13 M2PFHxDA Perfluoro-n-[1,2- C2] hexadecanoic acid 13 M2PFTeDA Perfluoro-n-[1,2- C2] tetradecanoic acid 13 M2PFUndA Perfluoro-n-[1,2- C2] undecanoic acid 13 M3PFBS Sodium perfluoro-1-[2,3,4- C3] butane sulfonic acid 13 M3PFPeA Perfluoro-n-[1,2,3- C3] pentanoic acid 13 M4PFBA Perfluoro-n-[1,2,3,4- C4] butanoic acid 13 M4PFHpA Perfluoro-n-[1,2,3,4- C4] heptanoic acid 13 M4PFOA Perfluoro-n-[ C8] octanoic acid 13 M4PFOS Sodium perfluoro-[ C8] octane sulfonic acid 13 M5PFNA Perfluoro-n-[ C9] nonanoic acid 13 M8FOSA-M Perfluoro-1-[ C8] octanesulfonamide NAFLD Non-alcoholic fatty-liver disease PC Phosphatidylcholine PC ether Phosphatidylcholine ether PE Phosphatidylethanolamine PE ether Phosphatidylethanolamine ether PFAS Per- and polyfluoroalkyl substances PFBA Perfluorobutanoic acid PFBS Perfluorobutane sulfonic acid PFDA Perfluorodecanoic acid PFDoDA Perfluorododecanoic acid PFDoDS Perfluorododecane sulfonic acid PFDS Perfluorodecane sulfonic acid PFHpA Perfluoroheptanoic acid PFHpS Perfluoroheptane sulfonic acid PFHxA Perfluorohexanoic acid PFHxDA Perfluorohexadecanoic acid PFHxS Perfluorohexane sulfonic acid PFNA Perfluorononanoic acid PFNS Perfluorononane sulfonic acid PFOA Perfluorooctanoic acid

2

PFOS Perfluorooctane sulfonic acid PFPeA Perfluoropentanoic acid PFPeS Perfluoropentan sulfonic acid PFTDA Perfluorotetradecanoic acid PFTeDA Perfluorotetradecanoic acid PFTriDA Perfluorotridecanoic acid PFUnDA Perfluoroundecanoic acid PI Phosphatidylinositols POPs Persistent organic pollutants PS Phosphatidyleserine RSD Relative standard deviation SM Sphingomyelins SP Sphingolipid ST Sterol lipid T1D Type 1 diabetes T2D Type 2 diabetes TCA TCA-d4 Deuterated taurocholic acid TCDCA Taurochenodeoxycholic acid TCDCA-d9 Deuterated taurochenodeoxycholic acid TDCA Taurodeoxycholic acid TDHCA Taurohyodeoxycholic acid TG Triglyceride TG mufa Triglyceride Monounsaturated fatty acids TG pufa Triglyceride Polyunsaturated fatty acids TG sfa Triglyceride Saturated fatty acids THCA Taurodeoxycholic acid THDCA Taurohyodeoxycholic acid TLCA Taurolithocholic acid TLCA-d4 Deuterated taurolithocholic acid TUDCA Tauroursodeoxycholic acid TUDCA-d4 Deuterated tauroursodeoxycholic acid TαMCA α-tauromuricholic acid TβMCA β-tauromuricholic acid TωMCA ω-tauromuricholic acid UDCA UDCA-d4 Deuterated ursodeoxycholic acid UHPLC- qTOF/MS Ultra-performance liquid chromatography quadrupole time-of-flight mass spectrometry UNEP United Nations Environment Programme αMCA α- βMCA β-murocholic acid ωMCA ω-murocholic acid

3

Abstract

The exposome is a complex study that includes both endogenous and exogenous markers related to different diseases as part of the research. Connecting both endogenous and exogenous markers can help explain many associations between environmental chemicals and metabolic profiles, which can be stressors for causing different diseases. This study investigates the impact of per- and polyfluorinated alkyl substances (PFAS) exposure during pregnancy on lipid profiles and measures the concentrations in blood and also in infants cord blood at birth. In addition, we are interested in lipids and bile acids (BA) levels in blood and if these three groups of analytes of interest are associated with each other. The infant blood-based samples come from a Chinese cohort and the maternal samples are from a Finnish cohort. The samples were extracted for PFAS and BA and for lipid analysis. Statistical analysis was performed for both cohorts. The main interest was to investigate the interactions between the identified compounds. The results also include concentrations to show the variety of values measured. Type 2 diabetes (T2D) and non-alcoholic fatty-liver disease (NAFLD) are usually not acquired early in life. Our results show a strong positive correlation between PFASs and triglycerides (TGs), as well as strong correlation between PFASs and conjugated primary and secondary BA particularly in cord blood samples, which can indicate higher risk of developing T2D or NAFLD. In maternal samples, the interactions between PFASs, BAs and lipids were much lower and showed less significant correlation. This indicates that the impact of PFAS exposure is much stronger during fetal development than in adult life. Acknowledgement

The author of this study is thankful for support and help from Professor Tuulia Hyötyläinen and Dr. Leo Yeung. In addition, the author would like to thank Dr. Samira Salihovic and laboratory technician Daniel Duberg from Örebro University for guidance and help throughout the project. Professor Jia-Yin Dai and his students and staff from Chinese Academy of Sciences, the Institute of Zoology have given practical help and are being thanked for the support and kindness. The author is especially thankful for Jinghua Wang, from Chinese Academy of Sciences, for lots of practical help and for sharing a part of her research with this study. In addition, to FinnBrain cohort project for the cooperation. The author is grateful for Mobily grant provided by The Swedish Foundation for International Cooperation in Research and Higher Education (STINT) for the Joint China Sweden Mobility Programme 2018 (Dnr: CH2018-7805).

4

Table of content

List of abbreviations ...... 1 Abstract ...... 4 Acknowledgement ...... 4 Appendix list ...... 7 1. Introduction ...... 8 1.1 The exposome ...... 8 1.2 Per- and polyfluoroalkyl substances (PFASs) ...... 8 1.3 Bile acids ...... 9 1.4 Lipidomics ...... 10 1.5 Impacts on health ...... 10 The aim ...... 11 2. Materials and Methods ...... 12 2.1 Materials ...... 12 Chemicals ...... 12 China ...... 12 Sweden ...... 12 2.2 Methods...... 13

2.2.1 One-phase extraction method MMC (MeOH/MTBE/CHCl3) ...... 13 2.2.2 Matyash extraction method (MTBE/MeOH) ...... 14 2.2.3 Bile acids and PFASs analysis ...... 14 2.2.4 Lipid analysis ...... 14 2.2.5 Quality control ...... 14 3. Experimental part ...... 15 3.1 Samples from China ...... 15 3.1.1 Samples ...... 15 3.1.2 Sample preparation for the Chinese samples ...... 15 3.2 Samples from Finland ...... 15 3.2.1 Samples ...... 15 3.2.2 Sample preparation for Turku samples ...... 16 3.3 Sample analysis ...... 16 3.3.1 Lipid analysis ...... 16 3.3.2 Bile acid and PFAS analyses ...... 17 3.4 Data Pre-Processing ...... 17

5

3.5 Statistical analyses ...... 18 4. Results ...... 18 4.1 Method development ...... 18 4.2 Chinese samples ...... 19 4.2.1 Lipid profiles ...... 19 4.2.2 Bile acids ...... 20 4.2.3 The exposome ...... 21 4.2.4 Statistical analyses ...... 23 4.3 Finnish samples – FinnBrain project ...... 29 4.3.1 Lipid profiles ...... 29 4.3.2 Bile acids ...... 29 4.3.3 Exposome ...... 30 4.3.4 Statistical analyses ...... 31 5. Discussion ...... 37 Conclusions ...... 39 References ...... 40 Appendix ...... 45

6

Appendix list

1. Figure 20 – Normalisation results after log transformation and auto-scaling on Chinese blood samples in measured variables.

2. Figure 21 – Normalisation results after log transformation and auto-scaling on Chinese blood samples. Presented 50 samples.

3. Figure 22 – Normalisation results after log transformation and auto-scaling done on FinnBrain blood samples in measured variables.

4. Figure 23 - Normalisation results after log transformation and auto-scaling on FinnBrain blood samples. Presented 50 samples.

5. Table 8 – Correlation heat map between all measured and identified compounds in Chinese blood samples. Identified compounds are lipids, PFASs and BAs.

6. Table 9 – Correlation heat map between all measured and identified compounds in FinnBrain blood samples. Identified compounds are lipids, PFASs and BAs.

7. Table 10 - Lipid median concentrations with minimum and maximum values in blood samples from Wuhan and Beijing, China.

8. Table 11 – BA median concentrations with minimum and maximum values in blood samples from Wuhan and Beijing, China.

9. Table 12 – PFAS median concentrations with minimum and maximum values in blood samples from Wuhan and Beijing, China.

10. Table 13 – Non-target analysis results as median values in blood samples from Wuhan and Beijing, China, with minimum and maximum values.

11. Table 14 – Lipid median concentrations with minimum and maximum values for FinnBrain blood samples.

12. Table 15 – BA median concentrations with minimum and maximum values for all FinnBrain blood samples.

13. Table 16 – PFAS median concentrations for FinnBrain blood samples with minimum and maximum values.

14. Table 17 – Non-target analysis results as median values for all Finnbrain blood samples with minimum and maximum values.

7

1. Introduction

1.1 The exposome Human health and the quality of living is an important research topic. Significant development in the healthcare has been contributed by new discoveries in research. A large part of the medical science focuses on preventing diseases with a broad interest of looking into different possible factors, which may cause or induce several diseases. Nowadays scientists do not only focus on endogenous or exogenous stressors separately but connect these two to be able to see correlations on a larger combined profile. One of the largely growing terms for this complex study is called the exposome. The exposome consists of both environmental exposure and metabolites as biomarkers for different diseases. The exogenous compounds that can trigger negative health effects can be all different environmental pollutants. These compounds come from industries or are released and leached from objects that are used in everyday life. For example, some substances are released to the environment as water soluble compounds from factories or are emitted to the air as volatile compounds. Other compounds are leached from various materials or released to the environment in other ways. All biochemical substances in the human organism are considered as endogenous compounds. They are usually produced by the organism, which include different metabolites or also food that is ingested and is broken down to simpler compounds (Miller, 2013; Sarigiannis, 2019). 1.2 Per- and polyfluoroalkyl substances (PFASs) One specific group of environmentally relevant chemicals are the per- and polyfluoroalkyl substances (PFASs) that have been widely used for more than 60 years. Perfluoroalkyl substances are fully fluorinated surfactants with all hydrogens replaced with fluorine atoms. Polyfluoroalkyl substances are instead partially fluorinated, containing both hydrogen and fluorine atoms in the molecule. The hydrocarbon segment in polyfluoroalkyl substances is known to have a better solubility in commonly used solvents and due to that the partially fluorinated surfactants have an advantage over the fully fluorinated ones (Kissa, 2001). These substances do not occur naturally but are a diverse group of man-made chemicals. The productions of fluorinated surfactants first started in 1938 with the invention of Teflon. After that, in the 1950s, the production of PFAS chemical began to a larger scale. There are two methods mainly used to synthesize PFAS chemicals – telomerization and electrochemical fluorination (ECF). The telomerization yields only linear isomers, but ECF can produce up to 30% branched isomers in addition to linear isomers (Kissa, 2001). PFAS chemicals have been widely used for many of their unique purposes in industries and also in households. They are extremely persistent and due to having hydro- and lipophobic properties, they have been greatly used as surfactants on many materials. These substances degrade very slowly and because of that have a greater change to bioaccumulate in the nature and in living organisms and have been found everywhere around the world. They can travel long distances due to their properties and as of early 2000s, PFASs have even been found in the eastern Arctic marine species, far from the main production areas (Kissa, 2001; Tomy et al., 2004).

8

The first detection of PFAS chemicals in human blood was in the 1970s from workers in a PFAS chemical production factory. Elevated levels of perfluorooctanic acid (PFOA) were found in their urine (Ubel, Sorenson and Roach, 1980). Another fluorinated substance, perfluorooctane sulfonic acid (PFOS), was found in the general populations of the U.S. in 1998. Pooled samples from blood banks were analysed and showed significant detections of PFOS (3M, 1999). The toxicological testing has been mainly done on rats and mice to understand the absorption, distribution, metabolism, and excretion of fluorinated substances. When introduced to the organism, PFASs are well absorbed in the gastrointestinal tract. These substances are also distributed to the kidney, liver, and serum. Significant levels have also been found in lungs and heart, as well as in spleen, skin, and testis. Much lower levels from other mentioned organs have been found in the brain and in adipose tissue (DeWitt, 2015). PFASs were shown to also transfer from mother to the fetus through placenta and in addition, after birth, with the breastmilk (Croes et al., 2012). 1.3 Bile acids Bile acids (BAs) are endogenous chemicals that are actively synthesized in the liver in parenchymal cells from cholesterol molecules and are stored in the gall bladder as in the bile. They are then released to the intestinal lumen, where they help with digestion of food (Lefebvre et al., 2009). BAs are also metabolites that are part of many biochemical reactions. Not only do they help with the digestion and absorption of lipids, cholesterol, and fat-soluble vitamins but they also regulate the homeostasis of lipid and glucose metabolism (Prawitt, Caron and Staels, 2011; Haeusler et al., 2013). An average person has a pool of 2 grams of BAs in the organism that cycles around 12 times per day. The flux is regulated by the liver and the intestine by transporting ca 24 grams of BAs daily. Most of the BAs are moving in circulation and are being reabsorbed. About 5% of the circulated BAs move to duodenum and are then excreted with feces. The loss is rebalanced in the liver with new synthesized BAs (Lefebvre et al., 2009). BAs are divided into two groups - primary and secondary bile acids. Primary BAs such as cholic acid (CA) and chenodeoxycholic acid (CDCA) are directly synthesized from cholesterol in the liver. Bile acids get conjugated in the liver with taurine and . Secondary BAs are bio-transformed from unconjugated primary BAs by the gut microbiota, mainly to deoxycholic acid (DCA) and lithocholic acid (LCA). Conjugated and unconjugated BAs are both in circulation for liver uptake, where the secondary BAs are re-conjugated with taurine and glycine. These movements keep the BAs pool balanced and regulated (Di Ciaula et al., 2017). The key enzyme that catalyses the first steps of cholesterol synthesized to BAs is cholesterol- 7α-hydroxylase (CYP7A1) (Prawitt, Caron and Staels, 2011). It has been speculated that CYP7A1 may be down-regulated by PFASs, which suppresses the synthesis of BAs if high levels of PFASs are introduced to or accumulated in the human organism (Beggs et al., 2016). This can then cause increased levels of reabsorbed BAs. Especially PFOS and PFHxS have shown to decrease the levels of BA excretion. Current knowledge also suggest that BAs and

9

PFAS are being reabsorbed in a similar manner and there is a possible observed association between the levels of BAs, PFAS and lipids (Salihović et al., 2019). 1.4 Lipidomics Lipidomics is a discipline that is increasingly being used in health-related research for finding biomarkers to predict and detect different diseases. It studies lipids with analytical chemistry principles and presents the cellular lipid profiles on a wide scale. Lipids have an essential part in the human organism and have many important roles. For example in cellular functions, lipids are part of cellular membranes and they also act as vital energy depots as described in more detail in literature (Vance and Vance, 2008; Hyötyläinen and Orešič, 2015; Yang and Han, 2016). Fatty acids are usually built up with a straight alkyl chain and a carboxyl end-group. The number of carbons in the fatty acid chain varies and double bonds in the chain determine if the lipid is saturated (containing no double bonds) or unsaturated (containing one or many double bonds) (National-Research-Council, 1989). Lipids are divided into smaller classes mainly based on their structures but also because of their functions. The main lipid classes found in human plasma are sterol lipids, sphingolipids, glycerolipids and glycerophospholipids (Risé et al., 2007). 1.5 Impacts on health Previous studies suggest that many serious health problems and diseases are related to high levels of lipids in the blood. Changes in lipid profiles can already be seen in infants and small children years before the onset of any disease. For example, altered lipid profiles have been detected in children later developing type 2 diabetes (T2D) (Alderete et al., 2019) or non- alcoholic fatty-liver disease (NAFLD), both of which are indicated by having abnormal concentrations of specific lipids in the blood (Orešič, Hyötyläinen, et al., 2013). Cord blood is often used as a sample matrix when analysing blood from new-borns. This shows new-borns blood profile and also reflects directly, which environmental pollutants have been transferred from mother to child through the umbilical cord. Another group of diseases that has been linked with lipid abnormalities are autoimmune diseases such as type 1 diabetes (T1D) and celiac disease (CD). T1D is a serious autoimmune disease that has been increasing in the younger populations during the last decade. It is mostly known to be caused by destruction of insulin-secreting pancreatic beta-cells (Atkinson, Eisenbarth and Michels, 2014; Lamichhane et al., 2019). In addition, there is a strong suggestion that PFASs, especially PFOS and PFOA are tightly connected with the increased risk of developing T1D, particularly in individuals that have a genetic risk (Mcglinchey et al., 2019). Both PFOS and PFOA have been listed as persistent organic pollutants (POPs). These substances are regulated, PFOS from year 2009 and PFOA, its salts and related compounds from year 2019, by the Stockholm Convention (Templeton et al., 2019). PFAS exposure during pregnancy can increase the risk of developing T1D in the offspring (Mcglinchey et al., 2019). CD is, similarly to T1D, an autoimmune disorder and causes strong gluten intolerance (Caio et al., 2019). CD diagnoses have recently increased in the developed countries and it is suggested that the triggers might be, in addition to gluten, also environmental risk factors (Assa et al.,

10

2017). The connection between PFASs and CD has been less studied than the interaction between T1D and PFASs, but it has been hypothesized that early life PFAS exposure might support the development of CD in long term, and that the PFAS exposure is linked with changes in lipid and bile acid metabolism (Sinisalu et al., 2020). The aim The aim of this study is to investigate the impact of prenatal PFAS exposure on concentrations of BAs and lipids using both infant blood-based samples (cord blood) as well as maternal samples during pregnancy. A comparison was done between two cities in China (Beijing and Wuhan, cord blood) and Finland (Turku, maternal samples). Specific objectives: 1. Further investigate the interplay of PFASs and BAs and their association with lipid profiles; 2. Develop methods using smaller sample volumes and more efficient analysis, that can reduce the total analysis time and gives the ability to analyse many analytes of interest with one extraction step; 3. Study the concentrations of BAs and lipids in relation to prenatal exposure to PFASs in cord blood of infants collected from two cities in China; 4. Study the concentrations of BAs and lipids in relation to exposure to PFASs in blood samples of mothers during pregnancy.

11

2. Materials and Methods

Since parts of the experiments were carried out in China and Sweden, the chemicals used are given from respective sources. 2.1 Materials

Chemicals China Methanol HPLC Grade (MeOH) from Fisher chemical (Hampton, New Hampshire, the United States of America). Chloroform A.R. (CHCl3) with 0.3-1.0% ethanol (C2H5OH) from Beijing Chemical Works (Beijing, China). Reagent alcohol – methyl alcohol 90%, ethyl alcohol 4.5% and isopropyl alcohol 5% from Fisher chemical (Hampton, New Hampshire, the United States of America). Hexane HPLC Grade (C6H14) from Fisher chemical (Hampton, New Hampshire, the United States of America). Methylene chloride (CH2Cl2) from Fisher chemical (Hampton, New Hampshire, the United States of America). Methyl tert-butyl ether HPLC Grade (C5H12O) from Fisher chemical (Hampton, New Hampshire, the United States of America). Sweden Methanol LC-MS Grade 99.9% (MeOH) from Fisher chemical (Hampton, New Hampshire, the United States of America), chloroform (CHCl3) for HPLC ≥99.9% from Sigma-Aldrich (Saint Louis, Missouri, the United States of America). Contains amylenes as stabilizer. Ethanol 99.7% (C2H5OH) from Solveco AB (Rosenberg, Sweden). Sodium chloride reagent grade from Scharlau (Barcelona, Spain). Acetonitrile LC/MS Grade 99.99% from Fisher chemical (Hampton, New Hampshire, the United States of America). LC-MS Grade water (H2O) from Fisher chemical (Hampton, New Hampshire, the United States of America). Tert-butyl methyl ether (C5H12O) for HPLC Grade ≥99.8% from Sigma-Aldrich (Saint Louis, Missouri, the United States of America). N-Hexane (CH3(CH2)4CH3) from Merch (Darmstadt, Germany). Dichloromethane (CH2Cl2) from Honeywell (Charlotte, North-Carolina, the United States of America). 2-Propanol (C3H8O) LC-MS ≥99.9% from Honeywell (Charlotte, North-Carolina, the United States of America). Ammonium acetate (C2H7NO2) ≥99.0% solid form from Sigma- Aldrich (Saint Louis, Missouri, the United States of America). Formic Acid (CH2O2) from Fisher chemical (Hampton, New Hampshire, the United States of America). Quality control (QC) of pooled human blood samples from donors from Örebro University Hospital, Sweden. PFAS internal and calibration standards M8FOSA-M, M2PFHxDA, M2PFTeDA, M2- 6:2FTSA, M2-8:2FTSA, M3PFBS, M3PFPeA, M4PFHpA, M4PFBA, M2PFHxA, M4PFOA, M5PFNA, M2PFDA, M2PFUndA, M2PFDoA, 18O2-PFHxS, M4PFOS, PFBA, PFPeA, PFHxA, PFHpA, PFOA, PFNA, PFDA, PFUnDA, PFDoDA, PFTrDA, PFTeDA, PFHxDA, PFODA, L-PFBS, L-PFHxS, L-PFOS, L-PFDS, FOSA, 4:2FTSA, 6:2FTSA, 8:2FTSA, L- PFHpS, L-PFDoS, L-PFNS and L-PFPeS were purchased from Wellington Laboratories 13 18 (Guelph, Ontario, Canada). Internal standards were C-/ O2 labelled PFAS compounds. BA calibration standards were purchased from different companies. 7H-dodecafluoroheptanoic acid was purchased from ABCR (Karlsruhe, Germany). CA, CDCA, DCA, DHCA, GCA,

12

GCDCA, LCA, TCA, TCDCA, TDCA, TDHCA, THCA, THDCA, TLCA, and TUDCA were purchased from Sigma-Aldrich (Saint Louis, Missouri, the United States of America), HDCA, HCA, αMCA, βMCA, ωMCA, 7-oxo-HDCA, 7-oxo-DCA, 12-oxo-LCA, TαMCA, TβMCA, TωMCA, GDHCA, GHCA, and GHDCA were purchased from Steraloids (Newport, RI,U.S.A), GLCA and GUDCA from Calbiochem (Gibbstown, NJ, the United States of America), and GDCA and UDGA from Fluka (Buchs, Switzerland). Internal standards CA-d4, LCA-d4, UDCA-d4, CDCA-d4, DCA-d4, GCA-d4, GLCA-d4, GUDCA-d4 and GCDCA-d4 were purchased from Qmx laboratories Ltd. (Essex, UK). TCDCA-d9, TCA-d4, TUDCA-d4 and TLCA-d4 were purchased from Cambridge Isotope Laboratories, Inc. (Massachusetts, the United States of America). All PFAS and bile acids standards were prepared in methanol and were stored at 4 °C. Lipid internal standards CE (17:0) and TG (17:0/17:0/17:0) (Triheptadecanoin) were purchased from Larodan Research Grade Lipids, Inc. (Monroe, the United States of America). Ceramide (d18:1/17:0), LysoPC (17:0), PC (16:0d31-18:1), PC (17:0), PE (17:0) and SM (17:0) were purchased from Avanti Polar Lipids, Inc. (Alabama, the United States of America). Calibration standards 1,3-Dilinolein (DG(18:1/18:1)), Cholesteryl Linoleate (CE(18:1)), Cholesteryl Palmitate (CE(16:0)) and TG (18:0/18:0/18:0) were purchased from Larodan Research Grade Lipids, Inc. (Monroe, the United States of America). Cer (d18:0/18:1(9z)), Cer (d18:1/18:1(9z)), LysoPC (18:0), LysoPC (18:1), LysoPE (18:1), PC(16:0/16:0), PC (16:0- 18:1), PC (18:0/18:0), PE (16:0-18:1), SM (d18:1/18:1(9z)) and TG (16:0/16:0/16:0) were purchased from Avanti Polar Lipids, Inc. (Alabama, the United States of America). 2.2 Methods We tested two methods for extraction and compared the results with two previously validated methods for lipids, BAs and PFASs. The two methods tested were the Matyash method (MTBE) and the one-phase extraction (MMC method) (Gil et al., 2018).

The chosen method used in China was one-phase MMC (MeOH/MTBE/CHCl3) extraction method, which main focus is on using methanol/methyl tert-butyl ether/chloroform (MeOH/MTBE/CHCl3) 1.33:1:1 solution mixture for extraction. This method gave us the best recovery out of two methods tested and good quality results.

2.2.1 One-phase extraction method MMC (MeOH/MTBE/CHCl3) In total 40 µl of serum sample was added to Eppendorf tube and mixed with 270 µl of MeOH/MTBE/CHCl3 (1.33:1:1, v/v/v) solution mixture. A volume of 10 µl of BA and PFAS internal standard (200 ng/mL PFASs and 1000 ng/mL bile acids in acetonitrile) and 12 µl of lipid internal standard (25 ppm lipids in chloroform and methanol 2:1 solution) were added to the tube and vortexed for 20 seconds. The Eppendorf tubes were then incubated on a shaker at 900 rpm for one hour at room temperature. The tubes were then centrifuged at 9600 relative centrifugal force (RCF) for 10 minutes at room temperature. In total, 250 µl of the supernatant was transferred to a glass vial and the extracted sample was evaporated to dryness. After that, 200 µl of CHCl3/MeOH (2:1) solution mixture was added and the sample was divided into two parts. A volume of 50 µl was taken for lipid analysis and the remaining 150 µl was taken for BA and PFAS analysis. The sample was evaporated to dryness and 40 µl of methanol/water (40%/60%) was then added.

13

2.2.2 Matyash extraction method (MTBE/MeOH) In total, 40 µl of serum sample was added to Eppendorf tube and mixed with 100 µl of methanol, 10 µl of BA and PFAS internal standard (200 ng/mL PFASs and 1000 ng/mL bile acids in acetonitrile) and 12 µl of lipid internal standard (25 ppm lipids in chloroform and methanol 2:1 solution). An additional 335 µl of MTBE was added to the tube and vortexed for 20 seconds. The tubes were then incubated at 900 rpm for one hour at room temperature. A total volume of 85 µl of LC-grade water was added, vortexed for 10 seconds and after that the tubes were incubated for additional 10 minutes. After incubation, the sample tubes were centrifuged at 9600 RCF for 10 minutes at room temperature. In total, 300 µl of the upper phase was collected to glass vial and evaporated with nitrogen to dryness. A volume of 200 µl of CHCl3/MeOH (2:1) was then added and the sample was divided to 150 µl for BA and PFAS analysis and 50 µl for lipid analysis. The 150 µl of sample was evaporated to dryness again and after that 40 µl of methanol/water (40%/60%) solution mixture was added. 2.2.3 Bile acids and PFASs analysis An aliquot of 40 µl of serum sample was transferred to an Eppendorf tube. A volume of 10 µl of BA+PFAS internal standard (200 ng/mL PFASs and 1000 ng/mL Bile acids in acetonitrile) was added to the tube with 50 µl of acetonitrile and vortexed for 5 seconds. The sample extraction was ultrasonicated for 3 minutes and then centrifuged at 10000 RCF for 5 minutes at room temperature. In total, 80 µl of the sample was then transferred to a glass vial and evaporated to dryness. The sample was resuspended with 40 µl of methanol/water (40%/60%) solution mixture. 2.2.4 Lipid analysis A volume of 10 µl of serum sample, 10 µl of 0.9% NaCl solution and 120 µl of lipid internal standard (2.5 ppm lipids in chloroform and methanol 2:1 solution) were added to an Eppendorf tube and vortexed until milk-like colour appeared in the tube. The samples were then put into ice for 30 minutes. After that, the tubes with samples were centrifuged at 9400 RCF for 3 minutes. In total, 60 µl of CHCl3/MeOH (2:1) solution mixture was added to a new glass vial and 60 µl of the lower layer of the sample extraction in the Eppendorf tube as added to the glass vial as well. Samples were then ready for analysis. 2.2.5 Quality control All used lab equipment was cleaned with ethanol, hexane, and dichloromethane in series. Glassware was burned for 12 hours at 450°C to remove contaminants before use. QC-pooled blood samples from Örebro University medical faculty were used to assess the quality of the analysis. A volume of 10 µl of extracted samples were pooled together to use as quality control. Extraction blanks were included after every 20 samples for additional quality control. Internal standards were used for quality assurance and quantification. Calibration curves were used for quantitative measurements and quality control. One ppm of lipid standard was used as quality control throughout the instrumental analysis. Methanol blanks were used for the instrument between injections for detecting any internal contamination. Batch standard mixture solution was used to ensure the quality of sample preparation and instrumental status.

14

3. Experimental part

First step of the experimental part was to validate the extraction method used for Chinese samples in Beijing, China. The aim was to use an effective method that includes all analytes of interest in one extraction step with no additional steps needed. 3.1 Samples from China 3.1.1 Samples The cord plasma samples were collected from two cities (Beijing and Wuhan) in China in 2018. We obtained n=100 samples from Wuhan and n=104 samples from Beijing, i.e. in total 204 samples from China. The Beijing samples were kept in -40°C in the blood bank facility until they were transferred to the Key Laboratory of Animal Ecology and Conservation Biology, Institute of Zoology, Chinese Academy of Science, where they were kept at -80°C. Samples from Wuhan were kept at -80°C in their blood bank reserve. After transportation to the institute, the samples were kept at -80°C. 3.1.2 Sample preparation for the Chinese samples Samples were prepared according to the MeOH/MTBE/CHCl3 method, justification is given in the results section on method development. First, 40 µl of the serum sample was transferred to an Eppendorf tube. Then, 270 µl of MeOH/MTBE/CHCl3 (1.33:1:1, v/v/v), 10 µl of BA and PFAS internal standard mixture (200 ng/mL PFAS and 1000 ng/mL bile acids in acetonitrile) and 12 µl of lipid internal standard mixture (25 ppm lipids in chloroform and methanol 2:1 solution) were added to the serum sample. The sample tube was vortexed for 20 seconds and then incubated in the room temperature at 900 rpm on a shaker for one hour. After incubation, the samples were centrifuged at 9600 RCF at room temperature for ten minutes. In total, 250 µl of the, supernatant, was collected to a new Eppendorf tube and evaporated with nitrogen at room temperature to dryness. The extracted Chinese serum samples were transported to Örebro University, Örebro, Sweden. All samples were prepared for lipidomics analysis by resuspending CHCl3/MeOH 2:1 solution into the dried vials and transferring the liquid to LC vials. After running the samples on LC- qTOF/MS (Agilent Technologies, Santa Clara, CA, he United States of America), additional evaporation to dryness was done to prepare samples for bile acid analysis. Methanol/water 70%/30% solution was added. The resuspended samples were run on LC-qTOF/MS (Agilent Technologies, Santa Clara, CA, he United States of America). 3.2 Samples from Finland 3.2.1 Samples The blood plasma samples were collected from Turku, Finland. The samples were taken from pregnant women from FinnBrain Birth cohort project. All pregnancies were as far as week 24. In total, we obtained 200 samples. The samples were kept at -80°C in Turku and transported in dry ice and kept at -80°C at Örebro University until further experiments.

15

3.2.2 Sample preparation for Turku samples All samples were randomized before extraction. Bile acids and PFAS extraction. A volume of 40 µl of serum sample was taken for BA and PFAS analyses and added to Eppendorf tube. A volume of 10 µl of BA and PFAS internal standard (200 ng/mL PFASs and 1000 ng/mL bile acids in acetonitrile) was added to the tube with 80 µl of ACN and vortex for 5 seconds. The tube was ultrasonicated for 3 minutes and then centrifuged at 10000 RCF for 5 minutes. After centrifugation, 90 µl of the supernatant was added to a glass vial with an insert and 10 µl of the sample was collected to a separate glass vial for pooled samples. The 90 µl of sample was evaporated to dryness and resuspended with 60 µl of MeOH:H2O mixture solution (70%:30%). Lipid extraction. The lipid extraction work was done on ice blocks to prevent lipid degradation. A volume of 10 µl of serum sample was added to Eppendorf tube with 10 µl of 0.9% NaCl solution and 120 µl of lipid internal standard (2.5 ppm lipids in chloroform and methanol 2:1 solution). The tube was vortexed until milk-like colour and put on ice for 30 minutes. After that, the sample was centrifuged at 9400 RCF for three minutes. In total, 60 µl of CHCl3:MeOH (2:1) was added to a glass vial with an insert. A volume of 60 µl of the lower layer of the sample in the Eppendorf tube was added to the same glass vial with an insert. In total, 50 µl of the leftover lower layer was added to a glass flask for pooled samples. External calibration curves were made for both lipid and BA and PFAS analysis. The calibration curves were run twice, once in the beginning of the run and once in the end of the run. For lipids, the linearity was received with 7 points from 0.1-5 ppm from a mixture of 17 lipids in a calibration standard mixture. The calibration curve points were 0.1 ppm, 0.5 ppm, 1 ppm, 2 ppm, 3 ppm, 4 ppm and 5 ppm. For bile acids and PFASs, the linearity was received with 7 points from 0-60/320 (PFAS/BA) ng/ml. The starting point of 0 included only internal standard and no additional calibration standards. The curve points were made from bile acid and PFAS calibration mixtures. The points were, 0 ng/ml, 0.04 ng/ml, 0.1 ng/ml, 2 ng/ml, 4 ng/ml, 20 ng/ml and 60 ng/ml for PFASs and 0 ng/ml, 10 ng/ml, 20 ng/ml, 40 ng/ml, 80 ng/ml, 160 ng/ml and 320 ng/ml for BAs. The results from calibration curve run were showing similar peak intentions and an average point value was calculated with the two values from the results for every curve point, which were then used for further calculations. 3.3 Sample analysis 3.3.1 Lipid analysis Samples were analysed on a UHPLC-qTOF/MS (Agilent Technologies, Santa Clara, CA, the United States of America) with separation using a Acquity UPLC®, BEH C18 (2.1x100 mm, particle size 1.7 µm) (Waters corporation, Milford, MA, the United States of America) column kept at 50°C. The column was protected with C18 pre-column (Waters Corporation, Wexford, Ireland). Mobile phases used for the sample run were A: 10 mM NH4Ac and 0.1% formic acid in water and B: 10 mM NH4Ac and 0.1% formic acid in acetonitrile/isopropanol (1:1). NH4Ac is used as ionization agent. The samples were kept at 10°C during the whole run and 1 µl of the sample volume was injected. The flow rate was set on 0.4 ml/min and the gradient for the run started with 65%A and 35%B with a change after two minutes to 20%A and 80%B, which followed the change to 100%B from minute 7-14. The whole acquisition was 21 minutes with 14 min gradient program and additional 7 minutes of equilibration time including clean-up and

16

stabilization before start of next sample. Maximum pressure limit in the binary pump was set on 850 bar. Dual jet stream electrospray (dual ESI) ion source was used and the reference mass solution included masses between 121.050873 m/z and 922.009798 m/z. The ion polarity was on positive mode. The acquisition mass range was set between 100-1700 m/z and the instrument was in extended dynamic range. The capillary voltage and the nozzle voltage were kept at 3643 V and 1500 V. The N2 pressure was set on 21 psi, with the sheath gas flow as 11 L/min with the nebulizer temperature at 379°C. The data was acquired with MassHunter B.06.01 software (Agilent Technologies, Santa Clara, CA, the United States of America). 3.3.2 Bile acid and PFAS analyses Bile acids and PFASs were analysed on a UHPLC-qTOF/MS (Agilent Technologies, Santa Clara, CA, the United States of America) with Acquity UPLC®, BEH C18 (2.1 x 100 mm, particle size 1.7 µm) (Waters Corporation, Milford, MA, the United States of America) column at 50°C with a C18 pre-column for column protection (Waters Corporation, Wexford, Ireland). Mobile phases used for the sample analysis were A: 2mM NH4Ac in H2O:MeOH (70:30) and B: 2mM NH4Ac in MeOH. NH4Ac was used as ionization agent. The samples were kept at 10°C during the whole sample acquisition and 1 µl of the sample volume was injected. The flow rate was set to 0.4 ml/min and the gradient started with 95%A and 5%B with a change after 1.5 minute to 70%A and 30%B, which followed a change after 4.5 minutes to 30%A and 70%B, the last change was after 7.5 minutes with 100%B until the end of run. Gradient program was 18 minutes long. 13 minutes for sample analysis and 5 minutes for clean-up, including equilibration in the end of the run. Maximum pressure limit in the binary pump was set on 850 bar. Dual jet stream electrospray (dual ESI) ion source was used and the ion polarity was on negative mode. The capillary voltage and the nozzle voltage were kept at 4500 V and 1500 V. The N2 pressure was set on 21 psi, with the sheath gas flow as 11 L/min and temperature at 379°C for the nebulizer. The data was acquired with MassHunter B.06.01 software (Agilent Technologies, Santa Clara, CA, the United States of America). 3.4 Data Pre-Processing Lipids were processed separately from BA and PFAS data. Mass spectrometry data was pre- processed with MZmine 2.53 (Pluskal et al., 2010). Peak detection with a noise level of 1000 was performed first, following with ADAP chromatogram builder including group intensity threshold to be as noise level (1000), minimum highest intensity 10 and m/z tolerance 0.009 m/z or 8 ppm. Next, chromatogram deconvolution was performed with local minimum search as algorithm with a 70% chromatographic threshold, 0.05 minutes of minimum tR range, 5% minimum relative height, 2250 minimum absolute height, 1 as minimum ratio of peak top/edge and peak duration range in minutes from 0.08 to 5.00. Isotopic peak grouper was done next with m/z tolerance of 0.05 m/z or 5 ppm, tR tolerance was set on 0.05 minutes and a maximum charge of 2. For the alignment of peak lists, a Join alignment algorithm was applied with m/z tolerance as 0.006 or 10.0 ppm and a weight of m/z as 2 with a tR tolerance of 0.1 and a weight of tR 1. Filtering with feature list rows filter was done next with 3 steps. First step rows that match with all criteria were kept with a tR range from 2 to 12 and m/z of 369-1200. Second filtering step removed rows that match with all criteria with a tR range of 2-4 and m/z of 800- 1200. Third filtering step removed rows that match with criteria with a tR range of 4-8 and m/z of 370-500. Next, gap filling with peak finder was done with m/z tolerance of 0.006 m/z or

17

10.0 ppm, tR tolerance of 0.1 minutes and with intensity tolerance as 50%. Last, the identification with a custom data base was done to identify the peak list with compound names. 3.5 Statistical analyses Data processing was done in Microsoft Excel. Lipidomics data was processed separately from BA and PFAS data. Both datasets were filtered by removing compounds that were present at high concentrations in extraction blanks. Instrumental blanks were included but no internal contamination was detected. The ratio for that was calculated with median of all samples peak areas divided by median of extraction blanks peak areas. The samples were then sorted based on the ratio from lowest to highest, and for the compounds having the ratio less than 3 were removed. After sorting, the peak areas of the detectable compounds were then normalised with internal standards (ISTD). If ISTD was not available for a specific compound, then ISTD with closest retention time (tR) was used for normalisation. Calibration curves were calculated for normalised values and used to calculate concentrations for identified compounds. Additional sorting was done for lipid samples. Relative standard deviation (RSD) was calculated for all QC samples and pooled samples. The RSD values were sorted from smallest to biggest and the compounds that had the RSD higher than 40% were removed from the list. Statistical analyses were done by using MetaboAnalyst 4.0 (Chong, Wishart and Xia, 2019). The data matrix contained 196 samples and 45 measured variables in total in the Chinese dataset and 198 samples and 42 measured variables in total in the Finnish dataset. No additional sample normalisation was done in the system, but the data was normalised by Log transformation and the data went through auto-scaling. The results are presented in Figure 20- 23, Cytoscape 3.8.0 (Shannon et al., 2003) with Metscape 3.1.3 (Karnovsky et al., 2012) and Correlation Calculation 1.0.1 (Basu et al., 2017) were used for additional statistical analysis for building correlation networks with the Chinese samples both Beijing and Wuhan and with the Finnish samples. The results are presented in Figures 11 and 19. 4. Results

In this study, the focus was mainly on the evaluation of interactions between the identified compounds and here, the concentrations of individual compounds were less relevant. Quantitative measurements of individual compounds are still included and presented to show the variation of the concentrations in the study subjects. 4.1 Method development

The two methods, MeOH/MTBE/CHCl3 and MTBE/MeOH (explained in the 2.2 Methods), which have been previously tested (Gil et al., 2018), were evaluated in this study,. The data was processed using raw values and corrected with a percentage of organic solvent volume used for the analysis. The results are presented as averages of overall peak area intensities and as well as the internal standards ISTD peak areas shown in Figure 1. Both methods were tested using the same amount of ISTD and the same sample volume. It is clear to see that ISTD recovery was much lower for the MTBE/MeOH method. Averages of overall peak areas are significantly higher using the MeOH/MTBE/CHCl3 method. Based on these results, the MeOH/MTBE/CHCl3 method was chosen for the preparation of Chinese samples in China.

18

Figure 1 – Comparison of the results from method development with two methods - MTBE/MeOH and MeOH/MTBE/CHCl3. The left graph on the figure presents the average values of ISTD peak area intensities measured for both methods. The right graph on the figure presents the overall averages of peak area intensities, which show a clear result of the MeOH/MTBE/CHCl3 giving much better recovery. Based on this comparison, this method was chosen for Chinese samples preparation.

A 3-day stability test was also conducted to evaluate any lipid degradation over time. QC- pooled samples were extracted according to the lipid extraction method (2.2.4 Lipid analysis) and evaporated to dryness. In total, 12 replicate samples were extracted for the test. Nine samples were kept in room temperature according to the plan, to test the stability in room temperature – Three for 1 day, three for 2 days and three for 3 days. Additional three samples were kept in the freezer from day one as comparison. After the test was over, all 12 samples were analysed by LC-qTOF and processed the data. No specific changes were detected. This test was needed to confirm stability of compounds of interest, if confronted with problems of sample transportation from China to Sweden. 4.2 Chinese samples Initially, 204 samples were analysed. However, due to insufficient volume of sample for extraction, in total 8 samples were removed, and 196 samples were analysed. 4.2.1 Lipid profiles A total number of 2880 of lipids were detected in the cord plasma samples, and 287 lipids, including four ISTDs, were identified with the help of a custom data base library. After sorting and cleaning the data (further explained in Chapter 4.2.4 Statistical analyses), 191 lipids were kept for further analysis. The lipid groups identified were free fatty acids (FFA), fatty acids (FA) from non-target approach analysis; while glycerolipids (GL), glycerophospholipids (GP), sterol lipids (ST) and sphingolipids (SP) from targeted lipid analysis. All measured lipid concentrations were median normalised and summed together as groups for further analysis. The lipids were grouped for statistical analysis based on their origin and named as Cer

19

(Ceramides), LPC (Lysophosphatidylcholines), LPE (Lysophosphatidylethanolamines), PC (Phosphatidylcholines), PC ether (Phosphatidylcholine ethers), PE (Phosphatidylethanolamines), PI (Phosphatidylinositols), PS (Phosphatidylserines), SM (Sphingomyelins), TG mufa (Triglyceride Monounsaturated fatty acids), TG pufa (Triglyceride Polyunsaturated fatty acids) and TG sfa (Triglyceride Saturated fatty acids). Median concentrations for lipid groups of Beijing and Wuhan samples were calculated separately. The error bars on the graphs present the maximum and minimum concentrations measured for both Beijing and Wuhan samples, representing possible biological variation of the lipids. The results are presented in Figure 2. Lipids are divided into different graphs based on concentration. Measured concentrations vary from 0.046 ng/mL to 958 ng/mL (Table 2).

Figure 2 A, B, C – Median values for all lipid groups with error bars presenting the maximum and minimum values measured. Beijing and Wuhan samples are presented in two different colours. Results are divided into three graphs due to measured values vary and different scales were needed.

4.2.2 Bile acids A total number of 7639 compounds were detected, and 12 bile acids, including one novel BA 12-epiCA, were identified in cord plasma samples. The identified BAs were HCA (primary BA), CA, (primary BA), 7-oxo-DCA (secondary BA), TαMCA (primary BA, conjugated with taurine), GHCA (primary BA, conjugated with glycine), GCA (primary BA, conjugated with glycine), TCDCA (primary BA, conjugated with taurine), GCDCA (primary BA, conjugated with glycine), GLCA (secondary BA, conjugated with glycine) and 12-epiCA (3α, 7α, 12β- trihydroxy-5β-cholan-24-oic acid, novel secondary BA, epimerized). Median concentrations were calculated for all measured BAs and are presented in Figure 3. Error bars on the graphs represent measured maximum and minimum values. Values vary from 0.018 ng/mL to 2115 ng/mL (Table 11).

20

Figure 3 A,B – Bile acid median values with error bars presenting maximum and minimum values measured for both Beijing and Wuhan samples. The results are divided into different graphs due to measured values vary. Beijing and Wuhan samples are presented with different colours.

4.2.3 The exposome The cord plasma samples were extracted and analysed to detect environmental toxins with both a targeted and a non-targeted approach. Target PFAS analysis in the same samples have been measured and reported in another study (Wang et al., 2020). The detected and identified PFASs were PFOA, PFNA, PFDA, PFUnDA, PFTriDA, PFHxS, PFHpS, PFOS, 6:2 Cl-PFESA and 8:2 Cl-PFESA. Calculated median values for detectable PFAS are presented in Figure 4. In addition to PFASs, two environmental pollutants butylparaben and benzylparaben; nine free fatty acids C16:1, C22:6, C18:2 and C20:5, linolenic acid, oleic acid, palmitic acid, stearic acid and arachidic acid were identified with the non-target analysis approach. The results are all presented in Figure 5. Error bars on the graph are presenting minimum and maximum concentrations measured for the Beijing and Wuhan samples.

21

Figure 4 A,B,C – PFAS median values with error bars presenting maximum and minimum values for samples from Beijing and Wuhan presented in different colours. Results are divided into different scales.

Figure 5 A,B,C,D,E – Non-target analysis results including free fatty acids and parabens. Error bars present minimum and maximum values. Beijing and Wuhan samples are presented with different colours and results are divided into different scales.

All median values from PFAS and identified analytes from non-target analysis with minimum and maximum values for both Beijing and Wuhan samples are presented in Tables 12 and 13.

22

4.2.4 Statistical analyses 4.2.4.1 T-test The T-test is used for predicting differences in two groups. It analyses if the means of two groups are different from each other and can be statistically used for dividing samples into two separate groups. Significant level is a risk percentage given for the test to make type 1 error. This gives the probability that the test would not be able to detect the difference. Type 1 error is usually interpreted as false positive. The p-values are calculated probabilities that describe the ability to reject the null hypothesis, which means that there is a difference in the samples between two groups. A t-test was used to assess significant differences in concentrations of detectable analytes between samples from Wuhan and Beijing with significance level set with p<0.05. The p values obtained from the t-test were transformed by -log10 and plotted in Figure 6. In total, 39 significant features out of 45 variables were detected. Seven variables with the lowest p-values are named and circled on the graph. In addition, five compounds of interest are named on the graph; two novel chlorinated polyfluorinated ether sulfonates and triglycerides (monounsaturated, polyunsaturated, and saturated). Significance line runs on y-axis at 1.25. Seven compounds that stayed under the significant line are, in order, HCA, 12-epiCA, oleic acid, palmitic acid, Cer, LPC and PE. The full list of significant features is presented in Table 1. The novel PFAS compounds were found in much higher concentrations in Wuhan samples, which are presented as group number one (1). The difference between two groups (Beijing and Wuhan) based on 6:2 Cl-PFESA and 8:2 Cl-PFESA compounds are presented in Figure 7.

Figure 6 – Graphical presentation of the p-values obtained from the T-test after -log10 between blood samples from Wuhan and Beijing analysed for markers of early life exposure.

Compounds on the x-axis are in the order as they are set in the file used for the statistical analysis.

23

Figure 7 – Two groups (Beijing and Wuhan) from Chinese samples compared by two different compounds – 6:2 Cl-PFESA and 8:2 Cl-PFESA

Table 1 - The full list of the 39 significant important features detected in the t-test between Beijing and Wuhan.

24

4.2.4.2 Correlations A correlation heat map between all measured variables in Chinese samples is presented in Table 8. The scale is set to from -0.8 as negative correlation to 0.8 as positive correlation, perfect correlation is presented as 1. The correlations between compounds separately are presented in the Table 2 for PFASs vs. BAs, Table 3 for PFASs vs. Lipid classes and Table 4 for BAs vs. Lipid classes. In Table 2, significant correlations were observed between PFAS and BA compounds. The strongest positive correlations are PFOA, PFNA, PFDA, PFUnDA, PFHxS, PFOS and 6:2 Cl- PFESA from the PFASs list with GHCA, CA and GLCA from the BAs list. Relatively strong positive correlation is also observed between TαMCA and GCA. The novel BA 12-epiCA has negative correlation with most of the PFAS compound. PFTriDA, PFHpS and 8:2 Cl-PFESA have the lowest positive correlation with the compounds presented.

Table 2 – Measured values of PFASs compared with measured values of BAs in Chinese blood samples. Blue colour represents negative correlation and red colour represents positive correlation. White colour shows neutral correlation.

In Table 3, a correlation table between identified PFASs and lipid groups is shown. Very strong positive correlations are observed between most of the PFASs and all three TGs groups. PFTriDA and 8:2 Cl-PFESA and PFTriDA have the lowest positive correlation with TGs. Cer, LPC, PE, PI and PS are lipid groups that are in a negative correlation with all PFASs, with one exception, LPC, which has a low positive correlation with one PFAS, 8:2 Cl-PFESA. Lipid groups PC and PC ether have fairly low positive correlations with PFASs. SM and LPE have a significant positive correlation with all PFASs.

Table 3 - Measured values of PFASs compared with measured values of lipids in Chinese blood samples. Blue colour represents negative correlation and red colour represents positive correlation. White colour shows neutral correlation.

The correlations between all detected and identified BAs and lipid groups are presented in the Table 4. There are two BAs that stand out with a strong positive correlation with TGs – CA and GLCA. Three BAs, 7-oxo-DCA, HCA and 12-epiCA have low correlation with TGs. The rest of the bile acids have significant positive correlation with all three presented TG groups. Lipid groups Cer, PC and PC ether are fairly neutral compounds with few exceptions. LPC and

25

LPE have a low positive correlation with BAs. PE, PI, and PS have quite strong negative correlation with all BAs. Lipid group SM has a significant positive correlation with all BAs but not as strong as TGs.

Table 4 - Measured values of BAs compared with measured values of lipids in Chinese blood samples. Blue colour represents negative correlation and red colour represents positive correlation. White colour shows neutral correlation.

In Figure 8, the correlations between the top 24 compounds with mono- and polyunsaturated and saturated triglycerides are shown. These 24 compounds have either the strongest positive or negative correlations with chosen compound. Strong positive correlations are observed with most of the PFASs. Especially high correlation with PFOS, PFOA, PFNA and PFHxS. The BA that is strongly correlated with TGs is CA, which is one of the primary BAs that is synthesized first, directly in the liver.

Figure 8 – Triglycerides strongest positive or negative correlations with 24 compounds in the Chinese blood samples. Blue colour represents negative correlation and red colour represents positive correlation.

In Figure 9, the correlations between the top 24 compounds that show the strongest correlations, with four PFAS compounds, PFOA, PFNA, PFHxS and PFOS, are shown. All these PFASs are strongly correlated with each other but also with all TGs. In addition, we can see from the patterns, that three BAs (CA, GLCA and GHCA) have respectively strong correlation with these PFASs.

26

Figure 9 – Strongest positive or negative correlations of PFOA, PFNA, PFHxS and PFOS, with 24 compounds in the Chinese blood samples. Blue colour represents negative correlation and red colour represents positive correlation.

In Figure 10, the correlations between the top 24 compounds, that show the strongest correlations, and three BAs are shown. GLCA and CA have very strong correlation with previously presented PFASs – PFOA, PFNA, PFOS and PFHxS. These two BAs are also in a significant correlation with all three TG groups. GHCA is mainly correlating with different BAs, but is also in respectively strong correlation with PFOA, PFNA and PFOS.

27

Figure 10 – Strongest positive or negative correlations of GLCA, CA and GHCA, with 24 compounds in the Chinese blood samples. Blue colour represents negative correlation and red colour represents positive correlation.

Cytoscape 3.8.0 was used to build an additional correlation network showing the relationship between different compounds and the results are presented in Figure 11.

Figure 11 – Correlation network with all measured values for the whole sample list from China. On the network we can see compounds coloured based on their grouping – light blue as PFASs, light green as BAs, yellow as lipids and purple as compounds detected with a non-target approach. Dark blue lines represent negative correlation and if the line is bolder, the stronger is the negative correlation. Red lined represent positive correlation and similarly to blue lines, the bolder the line, the stronger is the positive correlation between compounds. The grey lines are presenting neutral correlation between all compounds. No correlation was detected if there are no lines between compounds.

28

4.3 Finnish samples – FinnBrain project In total, two samples were removed during the data processing, due to technical difficulties during instrumental analysis. 4.3.1 Lipid profiles In total, 1134 compounds were detected from plasma samples, and 278 lipids, including 8 ISTD, from the whole list were identified with a help of the same custom data base library used for the Chinese samples. The lipid groups identified were FFA, FA, GL, GP, ST, and SP. All lipid concentrations were median normalised and summed together as groups based on their origin for statistical analysis. The groups created for further analysis were Cer, LPC, PC, PC ether, PE, PE ether, PI, PS, SM, TG mufa, TG pufa and TG sfa. Median values were calculated for compounds and are presented with minimum and maximum values as error bars in Figure 12. Values are divided into three different scales due to values varying from 0.273 ng/mL to 1466 ng/ml. The median concentrations with minimum and maximum values for FinnBrain lipid analysis are presented in Table 14.

Figure 12 A,B,C – Lipid median values with error bars presenting maximum and minimum values measured for all compounds in Finnish samples. Results are divided into different scales.

4.3.2 Bile acids BA and PFAS extraction and instrumental analysis was performed simultaneously. In total, 1142 compounds were detected from plasma samples and 20 BAs were identified. The identified BAs were CA (primary BA), DCA (primary BA), CDCA (primary BA), GCA (primary BA, conjugated with glycine), GDCA (primary BA, conjugated with glycine), GCDCA (primary BA, conjugated with glycine), GHDA (primary BA, conjugated with glycine), TαMCA (primary BA, conjugated with taurine), TCA (primary BA, conjugated with taurine), TDCA (primary BA, conjugated with taurine), TCDCA (primary BA, conjugated with taurine), HDCA (secondary BA), LCA (secondary BA), UDCA (secondary BA), GHDCA (secondary BA, conjugated with glycine), GLCA (secondary BA, conjugated with glycine), GUDCA (secondary BA, conjugated with glycine), THDCA (secondary BA, conjugated with

29

taurine), TLCA (secondary BA, conjugated with taurine), TUDCA (secondary, conjugated with taurine). Median concentrations were calculated for all identified BAs and are presented in Figure 13 with error bars as minimum and maximum values. The compounds have been spread out on three different scales due to concentrations varying from 0-189 ng/mL.

Figure 13 A,B,C,D – BA median values with error bars presenting minimum and maximum values measured for all BAs in Finnish sample list. Results are divided into different scales.

4.3.3 Exposome In total, five PFASs were identified from the 1142 compounds detected. In addition to five PFASs, one paraben and one amino acid and free fatty acids were identified in the non-target analysis to find environmental toxins and other substances. The five identified PFASs were PFHxS, PFNA, PFOA, PFOS and PFPeA. The other identified substances were propylparaben, tyrosine, stearic acid, palmitic acid, myristic acid and linolenic acid. All median concentrations with minimum and maximum values for both PFASs and identified analytes from non-target analysis results are presented in the Figure 14 and Figure 15. With one exception, PFPeA was excluded, due to high blank contamination and issues with ISTD.

30

Figure 14 A,B – PFAS median values with error bars as minimum and maximum values for all compounds in Finnish samples. Results are divided into different scales.

Figure 15 A,B,C – Median values for non-target analysis with error bars as minimum and maximum values for all compounds. Results are divided into different scales.

4.3.4 Statistical analyses 4.3.4.1 Correlation The correlation heat map between all measured variables in Finnish samples are presented in Table 9. The scale is set similarly to that of the Chinese samples, from -0.8 as negative correlation to 0.8 as positive correlation, where 1 shows the perfect correlation. The

31

correlations between compound groups separately are presented in the Table 5 for PFASs vs BAs, Table 6 for PFASs vs. Lipid classes and Table 7 for BAs vs. Lipid classes. In Table 5, significant correlations are observed between half of the PFASs and BAs. CA, DCA, GLCA, HDCA, LCA and UDCA have quite neutral correlation with PFASs. PFNA has the strongest positive correlation with BAs, especially with taurine conjugated BAs. GDCA, GUDCA, TαMCA, TCA, TDCA, TDCA, THDCA, TLCA and TUDCA have all the strongest positive correlation with all PFASs, but the correlations are much lower when comparing the Finnish results with the Chinese results.

Table 5 - Measured values of PFASs compared with measured values of BAs in Finnish blood samples. Blue colour represents negative correlation and red colour represents positive correlation. White colour shows neutral correlation.

In Table 6, correlations are observed between four PFASs and twelve lipid groups. Cer, LPS, PE ether and SM have the strongest positive correlation with all PFASs presented in this table. PC, PC ether, and TG pufa have significant correlation with PFASs. The most neutral correlation is between PE, TG mufa, TG sfa and all PFASs.

Table 6 - Measured values of PFASs compared with measured values of lipids in Finnish blood samples. Blue colour represents negative correlation and red colour represents positive correlation. White colour shows neutral correlation.

In Table 7, correlations are observed between all identified BAs and the lipid groups. Most of the compounds do not have strong correlation with each other, with some exceptions. Two lipid groups, LPC and TG sfa, have the strongest positive correlation with most of the BAs. The strongest with CA, DCA, GDCA, GHDCA, GUDCA and TDCA. Another significant correlation is between GHDCA and lipid group PS. Cer, PC, PC ether, PE ether, PI, and TG pufa have mostly negative correlation with all BAs. PE, SM and TG mufa have little correlation with most of the BAs.

32

Table 7 - Measured values of BAs compared with measured values of lipids in Finnish blood samples. Blue colour represents negative correlation and red colour represents positive correlation. White colour shows neutral correlation.

In Figure 16, correlations between top 24 compounds that show the strongest correlations with four BAs are presented. The highest positive correlation is between these BAs and all the rest of BAs. As shown above, PFNA is one of the PFASs that is showing significant positive correlation with all four BAs. Other two PFASs, PFOA and PFOS have a bit lower correlation, but still significant with these BAs. One lipid group, TG sfa, has a low positive correlation with presented compounds and rest of the lipids have either negative correlation or do not show any strong correlations and thus, are not shown here. There is one compound, TDCA, that has a strong positive correlation with most of the compounds, although in the Table 7 we can see that TDCA is mostly in negative correlation with lipid groups.

33

Figure 16 – Strongest positive or negative correlations of TCA, TCDCA, TDCA and TUDCA, with 24 compounds in the Finnish blood samples. Blue colour represents negative correlation and red colour represents positive correlation.

Correlation patterns of top 24 compounds with PFASs are presented in the Figure 17. All four compounds have strong positive correlation with each other and much lower positive correlation with all the other compounds. All four PFASs have very similar correlation patterns with a few exceptions. Lipid groups SM, Cer, LPC and PE ether, have the strongest positive correlations with PFASs from all the lipid groups, as also seen in Table 6. In addition, all four compounds have significant negative correlation with one detected environmental chemical – propylparaben.

33

Figure 17 – Strongest positive or negative correlations of PFNA, PFOA, PFOS and PFHxS, with 24 compounds in the Finnish blood samples. Blue colour represents negative correlation and red colour represents positive correlation.

Correlation patterns of top 24 compounds with lipid groups Cer, LPC, PE ether, SM and TG sfa are presented in Figure 18. These lipid groups are chosen based on having significant numbers on the correlation heat maps and are presented separately to show more specific correlations. Cer, LPC, PE ether and SM have strong positive correlations with other lipid groups but also with all PFAS substances, as also seen in Table 6. One group, TG sfa, is in very neutral, slightly negative correlation with PFASs, but instead is in quite strong positive correlation with almost all BAs. In addition to TG sfa, LPC has a significant positive correlation with most of the BAs.

34

Figure 18 – Strongest positive or negative correlations of Cer, LPC, PE ether, SM and TG sfa, with 24 compounds in the Finnish blood samples. Blue colour represents negative correlation and red colour represents positive correlation.

35

Cytoscape 3.8.0 was used to build an additional correlation network showing the relationship between different compounds and the results are presented in Figure 19.

Figure 19 - Correlation network with all measured values for the whole sample list from Finland. Compounds are coloured in the same order on the correlation network as with China samples on the Figure 11 – light blue as PFASs, light green as BAs, yellow as lipids and purple as compounds detected with non-target approach. Dark blue lines are presenting negative correlation and if the line is bolder, the stronger is the negative correlation. Red lined are showing positive correlation and similarly to blue lines, the bolder the line, the stronger is the positive correlation between compounds. Light grey lines present neutral correlation between all compounds. No correlation was detected if there are no lines between compounds.

36

5. Discussion

PFAS exposure at high concentrations have been indicated to increase the risk of development of several adverse health outcomes. One of the mentioned diseases is diabetes, and PFAS exposure has been linked with both type 1 and type 2 (T1D, T2D). T1D is an autoimmune disease that has been increasing worldwide in young children under the age of 15 and has been predicted to increase more rapidly in children younger than 5 years old (Onkamo et al., 1999; Harjutsalo, Sjöberg and Tuomilehto, 2008; Patterson et al., 2009). Previous studies have shown that PFAS exposure during pregnancy has been affecting these children who have a genetic risk with specific changes in their lipid profiles (Lamichhane et al., 2019; Mcglinchey et al., 2019) corresponding to the lipid profiles that have been identified in the cord blood of children who later develop T1D (La Torre et al., 2013; Orešič et al., 2013). There was a significant interaction between the genetic risk factors and PFAS levels, showing that the impact is caused by a combinatory of effects. For example, SM levels (one group of the phospholipids) were down regulated by PFAS exposure and that may indicate an increased risk of developing T1D in long term in those children that have a genetic risk for T1D (Lamichhane et al., 2019). In another study, low levels of lipid groups PCs and PEs has been shown to increase the risk for developing T1D at young age (La Torre et al., 2013). In addition, for healthy fetus development, choline intake is demanded; if choline intake is not sufficient during pregnancy, as a consequence low levels of TGs in the blood and low levels of phospholipids in the cord blood is proposed (Sanford C. Garner, Mei-Heng Mar, 1995; Yamaguchi et al., 2007). Interestingly, in this study we observed a positive association between PFAS levels and SM in cord blood samples from China (Figure 2), which is an opposite trend of an association that has been shown to be linked with the T1D risk. Lipid group PE had a strong negative correlation with both PFASs and BAs, which could be an indication of increasing the risk of developing T1D (Tables 3 and 4). The overall incidence of T1D is much lower in China than it is in the Nordic countries and the Chinese population does not have the same genetic risk factors (Xia et al., 2019). Thus, our results show the importance of genetic risk factors in exposome studies if investigating the development of T1D (Cerolsaletti, Hao and Greenbaum, 2019). In addition, if comparing our results with a previous study by Conway, Innes and Long (2016), the risk of developing T1D is lower than developing T2D, based on the measured levels of PFOA in cord- blood samples. T2D, on the other hand, is strongly connected with high lipid concentrations in blood, particularly links with TGs; moreover, T2D is typically linked with obesity. Non-alcoholic fatty-liver disease (NAFLD), a comorbidity of T2D, has similarly to T2D been associated with high levels of lipids, especially TGs, as well as BAs in blood. Previous research also suggests that PFASs are positively correlated with lipid groups or alter the regulation of several lipid pathways (Alderete et al., 2019; Jin et al., 2020). In addition, PFAS exposure, particularly PFOA for example, during pregnancy has been proven to cause lower birth weights (Ashley- Martin et al., 2017), With this hypothesis, and results presented in literature, we expected a rise in total lipid concentrations with high PFAS concentrations, and a decrease in BAs total concentrations. Many studies have demonstrated how PFAS disturb regular metabolic pathways. The most investigated and monitored PFASs, related to observing the relationship between PFASs exposure and diabetes, have been PFOA, PFOS, PFNA and PFHxS, which are also the PFASs that show stronger correlations with lipid groups in this study and are detected

37 in higher concentrations than other PFASs. Several studies have indicated that high concentrations of PFASs found in plasma can cause alterations in glucose homeostasis and lipid and amino acid pathways, which are linked with both T2D and NAFLD (Orešič, Hyötyläinen, et al., 2013; Alderete et al., 2019; Salihović et al., 2019). In this study we detected very high correlation between TGs and PFASs compounds in the cord plasma samples (Table 3), which can be an indication of having a higher risk of developing T2D, based on the previously mentioned studies. In addition to positive correlation between PFASs and lipids, we would expect lower levels of unconjugated BAs. In the cord blood based samples we can see strong negative correlation with some of the lipid groups, but still significant positive correlation with TGs and also with SM (Table 4). Correlations between BAs and PFASs are also significant and positive. One BA that shows surprisingly high positive correlation with both PFASs and TGs is CA, one of the primary BA, as well as GLCA, which is conjugated secondary BA. High levels of TGs and lower levels of ether lipids and phospholipids in our cord blood samples can be an indication for a higher risk of developing NAFLD (Orešič, Hyötyläinen, et al., 2013). In the maternal samples, we detected rather low correlations between PFASs and most of the BAs, as well as between lipid groups and PFASs. TGs, that showed very high correlations with PFASs in the cord plasma, do not show any significant correlations in maternal samples. Most of the taurine conjugated BA, which have been associated as one of the markers in developing T2D (Wewalka et al., 2014) are showing the highest correlations with PFASs. Our results indicate that the impact of PFAS exposure on lipid metabolism is much stronger during fetal development than in adult life. It should also be noted that the bile acid metabolism in the fetus is remarkably different when compared with the adults. Fetus liver starts to produce bile acids very early during the pregnancy, however its functions are yet very poor, and the elimination of compounds is not efficient when compared to adults. Thus, the accumulation of cholephilic compounds, that are usually excreted into bile, is increased (Macias, Marin and Serrano, 2009). In the maternal samples we observed much weaker association, both on bile acids and in lipids, thus can conclude that the PFAS exposure cause much stronger impacts during the early development. Previous studies have hypothesized how the BA synthesis from cholesterol may be down- regulated by PFAS compounds (Beggs et al., 2016; Chiang, 2017). Because BAs are involved in the regulation of glucose metabolism and lipid metabolism, it is crucial to maintain balanced BA pool in the organism. Particularly, TG metabolism is regulated by BAs through farnesoid X receptor, which is linking the overall BA metabolism with lipid metabolism (Bjork, Butenhoff and Wallace, 2011; Zhang et al., 2015; Behr et al., 2020) If the BA synthesis initiated by CYP7A1 is altered, it may cause increased levels of cholesterol in the liver (Beggs et al., 2016; Chiang, 2017). This can be an additional trigger and increase the risk of developing T2D. In this study we found that most of the conjugated secondary and primary BAs are in positive correlation with PFASs in the cord blood samples. This may indicate that synthesis of BA can be down-regulated. In addition to PFAS, and BAs, with the non-target approach, we detected and identified mainly free fatty acids but also three parabens (butylparaben, benzylparaben and propylparaben) and one amino acid (tyrosine). Parabens are commonly used as preservatives in cosmetics and pharmaceuticals (PubChem, 2020a). Butylparaben is also widely used in personal care but mostly as a food preservative and antifungal agent in food industry (PubChem, 2020b).

38

Two novel PFASs that were detected in cord blood samples 6:2 Cl-PFESA and 8:2 Cl-PFESA were found in relatively high concentrations when compared with the results of a previous study (Pan et al., 2017). These compounds have been used as alternative for PFOS in Chinese metal plating industry for about 30 years and are mostly found in the environment close to the factories in China (Wang et al., 2013; Ruan et al., 2015; Shi et al., 2015). The novel BA, 12-epiCA, found in cord blood samples, is a CA (cholic acid) that is epimerized into 12-epiCA (Lin et al., 2020). During the epimerization, hydroxyl groups are catalysed by hydroxysteroid dehydrogenases (Macdonald et al., 1976; MacDonald et al., 1977). 12-epiCA is relatively new BA to the research, but it is believed that it is one of the common human BAs (Zhu et al., 2018). In this study we identified the novel BA only in the cord blood samples and found quite low levels of 12-epiCA. The compound is in the most neutral, slightly negative correlation with all the other compounds identified (Table 8), which can indicate that the novel BA is not affected by PFAS exposure. Conclusions

We collected novel data on bile acids in cord blood samples from a Chinese cohort and observed several interactions between PFAS, BA and lipids in infants. The data was compared to maternal blood samples from a Finnish cohort. The results indicated that PFAS exposure is linked with increased levels of those lipids that have been previously associated with T2D and NAFLD, particularly TGs. Moreover, the impact of prenatal exposure has much less impact on adults (mothers) and seems to be much more harmful on infants and the following first years after birth. Further investigation is needed to be able to fully characterise the impact of exposure by combining the data with health outcome data. The FinnBrain cohort includes a follow-up study on the children health. The study will be continued including cord blood samples from the FinnBrain cohort project. Next step is to evaluate the impacts of the exposure on the health of these children and the health outcomes. The data has been collected. In addition, the study can continue by identifying the unknown compounds found in both Chinese and Finnish cohorts.

39

References

3M (1999) Perfluorooctane Sulfonate: Current Summary of Human Sera, Health and Toxicology Data 3M.

Alderete, T. L. et al. (2019) ‘Perfluoroalkyl substances, metabolomic profiling, and alterations in glucose homeostasis among overweight and obese Hispanic children: A proof- of-concept analysis’, Environment International. Elsevier Ltd, 126, pp. 445–453. doi: 10.1016/j.envint.2019.02.047.

Ashley-Martin, J. et al. (2017) ‘Maternal Concentrations of Perfluoroalkyl Substances and Fetal Markers of Metabolic Function and Birth Weight’, American Journal of Epidemiology, 185(3), pp. 185–193. doi: 10.1093/aje/kww213.

Assa, A. et al. (2017) ‘Anthropometric measures and prevalence trends in adolescents with coeliac disease: A population based study’, Archives of Disease in Childhood. BMJ Publishing Group, 102(2), pp. 139–144. doi: 10.1136/archdischild-2016-311376.

Atkinson, M. A., Eisenbarth, G. S. and Michels, A. W. (2014) ‘Type 1 diabetes’, The Lancet. Lancet Publishing Group, pp. 69–82. doi: 10.1016/S0140-6736(13)60591-7.

Basu, S. et al. (2017) ‘Sparse network modeling and metscape-based visualization methods for the analysis of large-scale metabolomics data’, Bioinformatics, 33(10), pp. 1545–1553. doi: 10.1093/bioinformatics/btx012.

Beggs, K. M. et al. (2016) ‘The role of hepatocyte nuclear factor 4-alpha in perfluorooctanoic acid- and perfluorooctanesulfonic acid-induced hepatocellular dysfunction’, Toxicology and Applied Pharmacology. Academic Press Inc., 304, pp. 18–29. doi: 10.1016/j.taap.2016.05.001.

Behr, A. C. et al. (2020) ‘Activation of human nuclear receptors by perfluoroalkylated substances (PFAS)’, Toxicology in Vitro. Elsevier Ltd, 62. doi: 10.1016/j.tiv.2019.104700.

Bjork, J. A., Butenhoff, J. L. and Wallace, K. B. (2011) ‘Multiplicity of nuclear receptor activation by PFOA and PFOS in primary human and rodent hepatocytes’, Toxicology. Toxicology, 288(1–3), pp. 8–17. doi: 10.1016/j.tox.2011.06.012.

Caio, G. et al. (2019) ‘Celiac disease: A comprehensive current review’, BMC Medicine. BioMed Central Ltd. doi: 10.1186/s12916-019-1380-z.

Cerolsaletti, K., Hao, W. and Greenbaum, C. J. (2019) ‘Genetics coming of age in type 1 diabetes’, Diabetes Care. American Diabetes Association Inc., pp. 189–191. doi: 10.2337/dci18-0039.

Chiang, J. Y. L. (2017) ‘Recent advances in understanding bile acid homeostasis’, F1000Research. Faculty of 1000 Ltd. doi: 10.12688/f1000research.12449.1.

Chong, J., Wishart, D. S. and Xia, J. (2019) ‘Using MetaboAnalyst 4.0 for Comprehensive and Integrative Metabolomics Data Analysis’, Current Protocols in Bioinformatics. John Wiley and Sons Inc., 68(1). doi: 10.1002/cpbi.86.

40

Di Ciaula, A. et al. (2017) ‘Bile acids and cancer: Direct and environmental-dependent effects’, Annals of Hepatology. Fundacion Clinica Medica Sur, 16, pp. S87–S105. doi: 10.5604/01.3001.0010.5501.

Conway, B., Innes, K. E. and Long, D. (2016) ‘Perfluoroalkyl substances and beta cell deficient diabetes’, Journal of Diabetes and its Complications. Elsevier Inc., 30(6), pp. 993– 998. doi: 10.1016/j.jdiacomp.2016.05.001.

Croes, K. et al. (2012) ‘Persistent organic pollutants (POPs) in human milk: A biomonitoring study in rural areas of Flanders (Belgium)’, Chemosphere. doi: 10.1016/j.chemosphere.2012.06.058.

DeWitt, J. (2015) Toxicological Effects of Perfluoroalkyl and Polyfluoroalkyl Substances, Molecular and Integrative Toxicology. Edited by R. R. Dietert. Springer International Publishing (Molecular and Integrative Toxicology). doi: 10.1007/978-3-319-15518-0.

Gil, A. et al. (2018) ‘One- vs two-phase extraction: re-evaluation of sample preparation procedures for untargeted lipidomics in plasma samples’, Analytical and Bioanalytical Chemistry. Springer Verlag, 410(23), pp. 5859–5870. doi: 10.1007/s00216-018-1200-x.

Haeusler, R. A. et al. (2013) ‘Human insulin resistance is associated with increased plasma levels of 12a-hydroxylated bile acids’, Diabetes. Diabetes, 62(12), pp. 4184–4191. doi: 10.2337/db13-0639.

Harjutsalo, V., Sjöberg, L. and Tuomilehto, J. (2008) ‘Time trends in the incidence of type 1 diabetes in Finnish children: a cohort study’, The Lancet. Elsevier, 371(9626), pp. 1777– 1782. doi: 10.1016/S0140-6736(08)60765-5.

Hyötyläinen, T. and Orešič, M. (2015) ‘Optimizing the lipidomics workflow for clinical studies - practical considerations’, Analytical and Bioanalytical Chemistry. Springer Verlag, 407(17), pp. 4973–4993. doi: 10.1007/s00216-015-8633-2.

Jin, R. et al. (2020) ‘Perfluoroalkyl substances and severity of nonalcoholic fatty liver in Children: An untargeted metabolomics approach’, Environment International. Elsevier Ltd, 134, p. 105220. doi: 10.1016/j.envint.2019.105220.

Karnovsky, A. et al. (2012) ‘Metscape 2 bioinformatics tool for the analysis and visualization of metabolomics and gene expression data’, Bioinformatics, 28(3), pp. 373–380. doi: 10.1093/bioinformatics/btr661.

Kissa, E. (2001) Fluorinated Surfactants and Repellents, Second Edition, Marcel Dekker, New York.

Lamichhane, S. et al. (2019) ‘Cord-blood lipidome in progression to islet autoimmunity and type 1 diabetes’, Biomolecules, 9(1), pp. 1–9. doi: 10.3390/biom9010033.

Lefebvre, P. et al. (2009) ‘Role of bile acids and bile acid receptors in metabolic regulation’, Physiological Reviews. American Physiological Society, pp. 147–191. doi: 10.1152/physrev.00010.2008.

Macias, R. I. R., Marin, J. J. G. and Serrano, M. A. (2009) ‘Excretion of biliary compounds during intrauterine life’, World Journal of Gastroenterology. Baishideng Publishing Group

41

Inc, pp. 817–828. doi: 10.3748/wjg.15.817.

Mcglinchey, A. et al. (2019) ‘Prenatal exposure to perfluoroalkyl substances modulates neonatal serum phospholipids , increasing risk of type 1 diabetes’, pp. 1–33.

Miller, G. (2013) The Exposome: A Primer, The Exposome: A Primer. Elsevier Inc. doi: 10.1016/C2013-0-06870-3.

National-Research-Council (1989) Diet and Health: Implications for Reducing Chronic Disease Risk., National Academies Press (US). Available at: https://www.ncbi.nlm.nih.gov/books/NBK218759/ (Accessed: 6 May 2020).

Onkamo, P. et al. (1999) ‘Worldwide increase in incidence of Type I diabetes - The analysis of the data on published incidence trends’, Diabetologia. Diabetologia, 42(12), pp. 1395– 1403. doi: 10.1007/s001250051309.

Orešič, M., Gopalacharyulu, P., et al. (2013) ‘Cord serum lipidome in prediction of islet autoimmunity and type 1 diabetes’, Diabetes. American Diabetes Association, 62(9), pp. 3268–3274. doi: 10.2337/db13-0159.

Orešič, M., Hyötyläinen, T., et al. (2013) ‘Prediction of non-alcoholic fatty-liver disease and liver fat content by serum molecular lipids’, Diabetologia, 56(10), pp. 2266–2274. doi: 10.1007/s00125-013-2981-2.

Patterson, C. C. et al. (2009) ‘Incidence trends for childhood type 1 diabetes in Europe during 1989-2003 and predicted new cases 2005-20: a multicentre prospective registration study’, The Lancet. Elsevier Limited, 373(9680), pp. 2027–2033. doi: 10.1016/S0140- 6736(09)60568-7.

Pluskal, T. et al. (2010) ‘MZmine 2: Modular framework for processing, visualizing, and analyzing mass spectrometry-based molecular profile data’, BMC Bioinformatics. BioMed Central, 11(1), p. 395. doi: 10.1186/1471-2105-11-395.

Prawitt, J., Caron, S. and Staels, B. (2011) ‘Bile acid metabolism and the pathogenesis of type 2 diabetes’, Current Diabetes Reports, pp. 160–166. doi: 10.1007/s11892-011-0187-x.

PubChem, D. (2020a) Benzyl 4-hydroxybenzoate | C14H12O3 - PubChem, National Center for Biotechnology Information. Available at: https://pubchem.ncbi.nlm.nih.gov/compound/7180#section=Use-and-Manufacturing (Accessed: 9 May 2020).

PubChem, D. (2020b) Butylparaben | C11H14O3 - PubChem, National Center for Biotechnology Information. Available at: https://pubchem.ncbi.nlm.nih.gov/compound/7184#section=Overview&fullscreen=true (Accessed: 9 May 2020).

Risé, P. et al. (2007) ‘Fatty acid composition of plasma, blood cells and whole blood: Relevance for the assessment of the fatty acid status in humans’, Prostaglandins Leukotrienes and Essential Fatty Acids. Churchill Livingstone, 76(6), pp. 363–369. doi: 10.1016/j.plefa.2007.05.003.

Salihović, S. et al. (2019) ‘Simultaneous determination of perfluoroalkyl substances and bile

42

acids in human serum using ultra-high-performance liquid chromatography–tandem mass spectrometry’, Analytical and Bioanalytical Chemistry. doi: 10.1007/s00216-019-02263-6.

Sanford C. Garner, Mei-Heng Mar, S. H. Z. (1995) Choline Distribution and Metabolism in Pregnant Rats and Fetuses are Influenced by the Choline Content of the Maternal Diet, J Nutr. doi: 10.1093/jn/125.11.2851.

Sarigiannis, D. A. (2019) ‘The Exposome: A New Tool for Improved Health Risk Assessment’, in Management of Emerging Public Health Issues and Risks. Elsevier, pp. xxiii–xlv. doi: 10.1016/b978-0-12-813290-6.02002-3.

Shannon, P. et al. (2003) ‘Cytoscape: A software Environment for integrated models of biomolecular interaction networks’, Genome Research, 13(11), pp. 2498–2504. doi: 10.1101/gr.1239303.

Sinisalu, L. et al. (2020) ‘Early-life exposure to perfluorinated alkyl substances modulates lipid metabolism in progression to celiac disease’, medRxiv. Cold Spring Harbor Laboratory Press, p. 2020.04.02.20051359. doi: 10.1101/2020.04.02.20051359.

Templeton, J. et al. (2019) Earth Negotiations Bulletin A Reporting Service for Environment and Development Negotiations COPs FINAL. Available at: http://enb.iisd.org/chemical/cops/2019/ (Accessed: 28 May 2020).

Tomy, G. T. et al. (2004) ‘Fluorinated Organic Compounds in an Eastern Arctic Marine Food Web’, Environmental Science & Technology, 38(24), pp. 6475–6481. doi: 10.1021/es049620g.

La Torre, D. et al. (2013) ‘Decreased cord-blood phospholipids in young age-at-onset type 1 diabetes’, Diabetes. American Diabetes Association, 62(11), pp. 3951–3956. doi: 10.2337/db13-0215.

Ubel, F. A., Sorenson, S. D. and Roach, D. E. (1980) ‘Health status of plant workers exposed to fluorochemicals-a preliminary report’, American Industrial Hygiene Association Journal, 41(8), pp. 584–589. doi: 10.1080/15298668091425310.

Vance, J. E. and Vance, D. E. (2008) Biochemistry Of Lipids, Lipoproteins And Membranes, Biochemistry of Lipids, Lipoproteins and Membranes. Elsevier. doi: 10.1016/B978-0-444- 53219-0.X5001-6.

Wang, J. (2020) ‘Longitudinal Trends in Prenatal Exposure (1998-2018) to Emerging and Legacy Per- and Polyfluoroalkyl Substances (PFASs) in Beijing, China’.

Wewalka, M. et al. (2014) ‘Fasting serum taurine-conjugated bile acids are elevated in type 2 diabetes and do not change with intensification of Insulin’, Journal of Clinical Endocrinology and Metabolism. Endocrine Society, 99(4), pp. 1442–1451. doi: 10.1210/jc.2013-3367.

Xia, Y. et al. (2019) ‘Incidence and trend of type 1 diabetes and the underlying environmental determinants’, Diabetes/Metabolism Research and Reviews. John Wiley and Sons Ltd, 35(1), p. e3075. doi: 10.1002/dmrr.3075.

Yamaguchi, K. et al. (2007) ‘Inhibiting triglyceride synthesis improves hepatic steatosis but exacerbates liver damage and fibrosis in obese mice with nonalcoholic steatohepatitis’,

43

Hepatology. John Wiley & Sons, Ltd, 45(6), pp. 1366–1374. doi: 10.1002/hep.21655.

Yang, K. and Han, X. (2016) ‘Lipidomics: Techniques, Applications, and Outcomes Related to Biomedical Sciences’, Trends in Biochemical Sciences. Elsevier Ltd, pp. 954–969. doi: 10.1016/j.tibs.2016.08.010.

Zhang, L. et al. (2015) ‘Persistent organic pollutants modify gut microbiota–host metabolic homeostasis in mice through aryl hydrocarbon receptor activation’, Environmental Health Perspectives. Public Health Services, US Dept of Health and Human Services, 123(7), pp. 679–688. doi: 10.1289/ehp.1409055.

44

Appendix

Figure 20 – Normalisation results after log transformation and auto-scaling on Chinese blood samples in measured variables.

45

Figure 21 – Normalisation results after log transformation and auto-scaling on Chinese blood samples. Presented 50 samples.

46

Figure 22 – Normalisation results after log transformation and auto-scaling done on FinnBrain blood samples in measured variables.

47

Figure 23 - Normalisation results after log transformation and auto-scaling on FinnBrain blood samples. Presented 50 samples.

48

Table 8 – Correlation heat map between all measured and identified compounds in Chinese blood samples. Identified compounds are lipids, PFASs and BAs.

7-oxo-DCAHCA TaMCA GHCA GHDCA THDCA GCA CA TCDCA GCDCA 12-epiCA GLCA C161 C226 C182 C205 ButylparabBenzylpar Linolenic aoleic acid palmitic acstearic acidArachidic aPFOA PFNA PFDA PFUnDA PFTriDA PFHxS PFHpS PFOS 62 Cl-PFES82 Cl-PFESCer LPC LysoPE181PC PC_ether PES PI P SM TG_mufa TG_pufa TG_sfa 7-oxo-DCA 1,000 0,528 0,290 0,368 0,095 0,012 0,275 0,355 0,005 -0,003 -0,040 0,339 0,208 0,336 0,334 0,017 -0,235 0,050 0,321 0,041 -0,237 -0,238 -0,181 0,182 0,175 0,189 0,229 0,244 0,148 0,139 0,196 0,203 0,223 -0,143 0,146 0,145 0,010 -0,047 -0,183 -0,014 -0,058 0,209 0,065 0,003 0,045 HCA 0,528 1,000 0,241 0,300 0,088 0,043 0,241 0,358 -0,097 0,023 0,016 0,165 0,102 0,190 0,174 0,019 -0,071 0,131 0,150 -0,010 -0,109 -0,139 -0,070 0,140 0,160 0,159 0,142 0,143 0,093 0,076 0,138 0,154 0,163 -0,125 0,066 0,114 0,007 -0,040 -0,076 0,047 0,005 0,133 -0,024 -0,008 0,023 TaMCA 0,290 0,241 1,000 0,829 0,409 0,458 0,482 0,433 0,373 0,271 -0,039 0,375 0,288 0,480 0,376 -0,259 -0,370 0,018 0,388 -0,086 -0,226 -0,450 -0,257 0,377 0,368 0,291 0,294 0,200 0,329 0,252 0,341 0,306 0,301 -0,193 0,093 0,127 0,000 -0,069 -0,284 -0,129 -0,190 0,211 0,233 0,207 0,230 GHCA 0,368 0,300 0,829 1,000 0,467 0,425 0,639 0,573 0,415 0,459 -0,072 0,449 0,384 0,568 0,470 -0,389 -0,466 0,010 0,506 0,049 -0,213 -0,471 -0,276 0,465 0,449 0,391 0,395 0,289 0,413 0,319 0,444 0,364 0,344 -0,182 0,053 0,118 -0,021 -0,073 -0,384 -0,213 -0,272 0,259 0,346 0,321 0,329 GHDCA 0,095 0,088 0,409 0,467 1,000 0,475 0,332 0,276 0,423 0,444 -0,094 0,383 0,223 0,355 0,305 -0,158 -0,190 0,074 0,294 0,056 -0,109 -0,292 -0,049 0,259 0,246 0,178 0,177 0,127 0,216 0,158 0,247 0,182 0,146 0,006 0,003 0,020 -0,109 -0,060 -0,255 -0,194 -0,201 0,141 0,230 0,208 0,224 THDCA 0,012 0,043 0,458 0,425 0,475 1,000 0,263 0,273 0,584 0,398 -0,055 0,276 0,266 0,371 0,295 -0,209 -0,221 0,031 0,272 -0,036 -0,183 -0,337 -0,200 0,232 0,249 0,149 0,134 0,108 0,239 0,203 0,245 0,173 0,091 -0,015 -0,065 -0,027 -0,202 -0,163 -0,297 -0,266 -0,285 0,178 0,217 0,177 0,227 GCA 0,275 0,241 0,482 0,639 0,332 0,263 1,000 0,486 0,270 0,381 -0,045 0,307 0,295 0,388 0,330 -0,367 -0,396 0,052 0,361 0,031 -0,126 -0,355 -0,229 0,394 0,397 0,395 0,362 0,214 0,364 0,274 0,389 0,338 0,229 -0,084 0,041 0,223 -0,010 -0,050 -0,211 -0,201 -0,201 0,218 0,290 0,287 0,292 CA 0,355 0,358 0,433 0,573 0,276 0,273 0,486 1,000 0,328 0,441 -0,061 0,495 0,408 0,606 0,532 -0,468 -0,542 0,011 0,598 0,121 -0,237 -0,513 -0,273 0,595 0,589 0,504 0,514 0,351 0,516 0,366 0,563 0,472 0,321 -0,075 -0,066 0,080 0,026 0,000 -0,293 -0,239 -0,285 0,350 0,519 0,496 0,527 TCDCA 0,005 -0,097 0,373 0,415 0,423 0,584 0,270 0,328 1,000 0,739 -0,071 0,283 0,255 0,371 0,297 -0,138 -0,213 0,030 0,285 0,111 -0,126 -0,284 -0,187 0,188 0,182 0,107 0,092 0,168 0,223 0,141 0,157 0,103 0,068 0,046 0,023 -0,021 -0,299 -0,173 -0,422 -0,409 -0,401 0,198 0,215 0,175 0,167 GCDCA -0,003 0,023 0,271 0,459 0,444 0,398 0,381 0,441 0,739 1,000 -0,103 0,364 0,240 0,376 0,268 -0,249 -0,271 0,005 0,286 0,068 -0,083 -0,306 -0,175 0,235 0,241 0,167 0,159 0,217 0,288 0,209 0,244 0,153 0,126 0,081 -0,042 0,033 -0,156 -0,085 -0,297 -0,325 -0,341 0,264 0,270 0,284 0,221 12-epiCA -0,040 0,016 -0,039 -0,072 -0,094 -0,055 -0,045 -0,061 -0,071 -0,103 1,000 -0,099 -0,041 -0,055 0,017 0,086 0,050 0,011 -0,026 0,081 0,098 0,102 0,035 -0,041 -0,038 0,021 0,014 -0,050 -0,041 -0,019 -0,045 -0,044 0,086 -0,019 -0,066 -0,095 -0,012 0,026 -0,059 0,032 0,030 -0,031 -0,034 -0,073 -0,018 GLCA 0,339 0,165 0,375 0,449 0,383 0,276 0,307 0,495 0,283 0,364 -0,099 1,000 0,394 0,627 0,477 -0,353 -0,540 -0,067 0,537 0,048 -0,291 -0,533 -0,203 0,533 0,531 0,409 0,435 0,340 0,526 0,397 0,503 0,430 0,243 -0,076 -0,097 0,147 0,091 0,037 -0,273 -0,198 -0,270 0,313 0,477 0,465 0,440 C161 0,208 0,102 0,288 0,384 0,223 0,266 0,295 0,408 0,255 0,240 -0,041 0,394 1,000 0,527 0,828 -0,085 -0,286 0,377 0,748 0,411 -0,340 -0,339 -0,268 0,349 0,370 0,326 0,343 0,176 0,352 0,265 0,439 0,324 0,154 -0,030 -0,198 -0,074 0,054 0,002 -0,237 -0,172 -0,231 0,237 0,448 0,359 0,412 C226 0,336 0,190 0,480 0,568 0,355 0,371 0,388 0,606 0,371 0,376 -0,055 0,627 0,527 1,000 0,763 -0,410 -0,727 -0,106 0,856 0,071 -0,488 -0,642 -0,230 0,630 0,690 0,565 0,588 0,437 0,610 0,465 0,638 0,527 0,376 -0,106 -0,093 0,101 0,133 0,140 -0,318 -0,230 -0,294 0,392 0,607 0,559 0,542 C182 0,334 0,174 0,376 0,470 0,305 0,295 0,330 0,532 0,297 0,268 0,017 0,477 0,828 0,763 1,000 -0,118 -0,465 0,226 0,920 0,283 -0,458 -0,487 -0,347 0,452 0,520 0,471 0,476 0,283 0,450 0,352 0,546 0,444 0,305 -0,135 -0,154 0,019 0,156 0,107 -0,276 -0,147 -0,198 0,257 0,536 0,436 0,469 C205 0,017 0,019 -0,259 -0,389 -0,158 -0,209 -0,367 -0,468 -0,138 -0,249 0,086 -0,353 -0,085 -0,410 -0,118 1,000 0,612 0,375 -0,307 0,068 0,134 0,552 0,230 -0,597 -0,615 -0,488 -0,507 -0,392 -0,598 -0,496 -0,590 -0,409 -0,313 -0,004 0,033 -0,243 -0,113 -0,069 0,062 0,182 0,215 -0,313 -0,522 -0,540 -0,501 Butylparab -0,235 -0,071 -0,370-0,466 -0,190 -0,221 -0,396 -0,542 -0,213 -0,271 0,050 -0,540 -0,286 -0,727 -0,465 0,612 1,000 0,270 -0,622 0,030 0,442 0,616 0,238 -0,619 -0,671 -0,537 -0,538 -0,426 -0,604 -0,448 -0,596 -0,465 -0,372 -0,014 0,041 -0,241 -0,274 -0,232 0,071 0,103 0,121 -0,324 -0,581 -0,580 -0,526 Benzylpara 0,050 0,131 0,018 0,010 0,074 0,031 0,052 0,011 0,030 0,005 0,011 -0,067 0,377 -0,106 0,226 0,375 0,270 1,000 0,043 0,152 -0,038 0,162 -0,027 -0,193 -0,184 -0,081 -0,083 -0,147 -0,225 -0,173 -0,125 -0,039 -0,115 -0,050 -0,025 -0,080 -0,016 -0,115 -0,019 -0,033 0,021 -0,135 -0,098 -0,102 -0,017 Linolenic a 0,321 0,150 0,388 0,506 0,294 0,272 0,361 0,598 0,285 0,286 -0,026 0,537 0,748 0,856 0,920 -0,307 -0,622 0,043 1,000 0,308 -0,442 -0,553 -0,305 0,586 0,638 0,556 0,559 0,360 0,564 0,430 0,629 0,494 0,333 -0,058 -0,168 0,052 0,153 0,103 -0,306 -0,219 -0,285 0,333 0,645 0,548 0,579 oleic acid 0,041 -0,010 -0,086 0,049 0,056 -0,036 0,031 0,121 0,111 0,068 0,081 0,048 0,411 0,071 0,283 0,068 0,030 0,152 0,308 1,000 0,214 0,305 0,175 0,111 0,079 0,072 0,084 0,040 0,031 0,053 0,097 0,015 -0,059 -0,035 -0,302 -0,288 -0,176 -0,142 -0,190 -0,282 -0,290 0,001 0,127 0,110 0,078 palmitic ac -0,237 -0,109 -0,226-0,213 -0,109 -0,183 -0,126 -0,237 -0,126 -0,083 0,098 -0,291 -0,340 -0,488 -0,458 0,134 0,442 -0,038 -0,442 0,214 1,000 0,497 0,354 -0,154 -0,278 -0,218 -0,237 -0,356 -0,236 -0,152 -0,262 -0,234 -0,198 0,029 -0,057 -0,131 -0,234 -0,174 -0,006 -0,122 -0,111 -0,064 -0,241 -0,191 -0,226 stearic acid -0,238 -0,139 -0,450-0,471 -0,292 -0,337 -0,355 -0,513 -0,284 -0,306 0,102 -0,533 -0,339 -0,642 -0,487 0,552 0,616 0,162 -0,553 0,305 0,497 1,000 0,445 -0,537 -0,577 -0,441 -0,463 -0,363 -0,587 -0,433 -0,599 -0,462 -0,326 0,000 -0,042 -0,278 -0,168 -0,107 0,168 0,089 0,135 -0,307 -0,534 -0,469 -0,507 Arachidic a -0,181 -0,070 -0,257-0,276 -0,049 -0,200 -0,229 -0,273 -0,187 -0,175 0,035 -0,203 -0,268 -0,230 -0,347 0,230 0,238 -0,027 -0,305 0,175 0,354 0,445 1,000 -0,267 -0,291 -0,244 -0,239 -0,129 -0,376 -0,355 -0,339 -0,298 -0,130 0,020 -0,061 -0,109 0,012 0,028 0,155 0,093 0,127 -0,065 -0,310 -0,219 -0,300 PFOA 0,182 0,140 0,377 0,465 0,259 0,232 0,394 0,595 0,188 0,235 -0,041 0,533 0,349 0,630 0,452 -0,597 -0,619 -0,193 0,586 0,111 -0,154 -0,537 -0,267 1,000 0,829 0,683 0,675 0,375 0,736 0,612 0,761 0,666 0,364 -0,068 -0,122 0,123 0,142 0,047 -0,162 -0,167 -0,225 0,324 0,605 0,573 0,592 PFNA 0,175 0,160 0,368 0,449 0,246 0,249 0,397 0,589 0,182 0,241 -0,038 0,531 0,370 0,690 0,520 -0,615 -0,671 -0,184 0,638 0,079 -0,278 -0,577 -0,291 0,829 1,000 0,863 0,868 0,544 0,776 0,671 0,855 0,813 0,498 -0,103 -0,111 0,192 0,157 0,071 -0,177 -0,178 -0,228 0,305 0,584 0,559 0,536 PFDA 0,189 0,159 0,291 0,391 0,178 0,149 0,395 0,504 0,107 0,167 0,021 0,409 0,326 0,565 0,471 -0,488 -0,537 -0,081 0,556 0,072 -0,218 -0,441 -0,244 0,683 0,863 1,000 0,929 0,616 0,649 0,577 0,767 0,817 0,588 -0,151 -0,053 0,203 0,137 0,092 -0,099 -0,131 -0,161 0,245 0,442 0,436 0,404 PFUnDA 0,229 0,142 0,294 0,395 0,177 0,134 0,362 0,514 0,092 0,159 0,014 0,435 0,343 0,588 0,476 -0,507 -0,538 -0,083 0,559 0,084 -0,237 -0,463 -0,239 0,675 0,868 0,929 1,000 0,627 0,661 0,594 0,761 0,802 0,574 -0,169 -0,066 0,173 0,127 0,091 -0,153 -0,145 -0,194 0,283 0,466 0,448 0,414 PFTriDA 0,244 0,143 0,200 0,289 0,127 0,108 0,214 0,351 0,168 0,217 -0,050 0,340 0,176 0,437 0,283 -0,392 -0,426 -0,147 0,360 0,040 -0,356 -0,363 -0,129 0,375 0,544 0,616 0,627 1,000 0,425 0,428 0,505 0,476 0,557 -0,171 -0,020 0,141 0,020 0,059 -0,153 -0,192 -0,196 0,259 0,207 0,195 0,169 PFHxS 0,148 0,093 0,329 0,413 0,216 0,239 0,364 0,516 0,223 0,288 -0,041 0,526 0,352 0,610 0,450 -0,598 -0,604 -0,225 0,564 0,031 -0,236 -0,587 -0,376 0,736 0,776 0,649 0,661 0,425 1,000 0,699 0,770 0,605 0,362 -0,041 -0,095 0,160 0,163 0,120 -0,196 -0,190 -0,243 0,297 0,588 0,537 0,532 PFHpS 0,139 0,076 0,252 0,319 0,158 0,203 0,274 0,366 0,141 0,209 -0,019 0,397 0,265 0,465 0,352 -0,496 -0,448 -0,173 0,430 0,053 -0,152 -0,433 -0,355 0,612 0,671 0,577 0,594 0,428 0,699 1,000 0,742 0,562 0,406 -0,086 -0,087 0,064 0,124 0,069 -0,180 -0,147 -0,175 0,259 0,461 0,404 0,426 PFOS 0,196 0,138 0,341 0,444 0,247 0,245 0,389 0,563 0,157 0,244 -0,045 0,503 0,439 0,638 0,546 -0,590 -0,596 -0,125 0,629 0,097 -0,262 -0,599 -0,339 0,761 0,855 0,767 0,761 0,505 0,770 0,742 1,000 0,762 0,488 -0,090 -0,131 0,132 0,185 0,090 -0,155 -0,142 -0,185 0,269 0,582 0,555 0,546 62 Cl-PFES 0,203 0,154 0,306 0,364 0,182 0,173 0,338 0,472 0,103 0,153 -0,044 0,430 0,324 0,527 0,444 -0,409 -0,465 -0,039 0,494 0,015 -0,234 -0,462 -0,298 0,666 0,813 0,817 0,802 0,476 0,605 0,562 0,762 1,000 0,569 -0,210 -0,076 0,189 0,127 0,031 -0,106 -0,124 -0,119 0,237 0,372 0,365 0,347 82 Cl-PFES 0,223 0,163 0,301 0,344 0,146 0,091 0,229 0,321 0,068 0,126 0,086 0,243 0,154 0,376 0,305 -0,313 -0,372 -0,115 0,333 -0,059 -0,198 -0,326 -0,130 0,364 0,498 0,588 0,574 0,557 0,362 0,406 0,488 0,569 1,000 -0,270 0,056 0,184 0,102 0,044 -0,113 -0,070 -0,060 0,134 0,216 0,198 0,176 Cer -0,143 -0,125 -0,193 -0,182 0,006 -0,015 -0,084 -0,075 0,046 0,081 -0,019 -0,076 -0,030 -0,106 -0,135 -0,004 -0,014 -0,050 -0,058 -0,035 0,029 0,000 0,020 -0,068 -0,103 -0,151 -0,169 -0,171 -0,041 -0,086 -0,090 -0,210 -0,270 1,000 0,128 -0,034 -0,151 -0,037 0,123 -0,071 -0,025 0,264 0,001 0,041 0,072 LPC 0,146 0,066 0,093 0,053 0,003 -0,065 0,041 -0,066 0,023 -0,042 -0,066 -0,097 -0,198 -0,093 -0,154 0,033 0,041 -0,025 -0,168 -0,302 -0,057 -0,042 -0,061 -0,122 -0,111 -0,053 -0,066 -0,020 -0,095 -0,087 -0,131 -0,076 0,056 0,128 1,000 0,602 -0,166 -0,084 0,061 0,017 0,080 0,072 -0,360 -0,303 -0,350 LysoPE18 0,145 0,114 0,127 0,118 0,020 -0,027 0,223 0,080 -0,021 0,033 -0,095 0,147 -0,074 0,101 0,019 -0,243 -0,241 -0,080 0,052 -0,288 -0,131 -0,278 -0,109 0,123 0,192 0,203 0,173 0,141 0,160 0,064 0,132 0,189 0,184 -0,034 0,602 1,000 0,070 -0,033 0,216 0,024 0,083 0,137 -0,092 -0,069 -0,088 PC 0,010 0,007 0,000 -0,021 -0,109 -0,202 -0,010 0,026 -0,299 -0,156 -0,012 0,091 0,054 0,133 0,156 -0,113 -0,274 -0,016 0,153 -0,176 -0,234 -0,168 0,012 0,142 0,157 0,137 0,127 0,020 0,163 0,124 0,185 0,127 0,102 -0,151 -0,166 0,070 1,000 0,735 0,568 0,720 0,700 -0,029 0,268 0,296 0,213 PC_ether -0,047 -0,040 -0,069 -0,073 -0,060 -0,163 -0,050 0,000 -0,173 -0,085 0,026 0,037 0,002 0,140 0,107 -0,069 -0,232 -0,115 0,103 -0,142 -0,174 -0,107 0,028 0,047 0,071 0,092 0,091 0,059 0,120 0,069 0,090 0,031 0,044 -0,037 -0,084 -0,033 0,735 1,000 0,502 0,546 0,518 0,025 0,169 0,234 0,067 PE -0,183 -0,076 -0,284 -0,384 -0,255 -0,297 -0,211 -0,293 -0,422 -0,297 -0,059 -0,273 -0,237 -0,318 -0,276 0,062 0,071 -0,019 -0,306 -0,190 -0,006 0,168 0,155 -0,162 -0,177 -0,099 -0,153 -0,153 -0,196 -0,180 -0,155 -0,106 -0,113 0,123 0,061 0,216 0,568 0,502 1,000 0,667 0,723 -0,218 -0,214 -0,122 -0,215 PI -0,014 0,047 -0,129 -0,213 -0,194 -0,266 -0,201 -0,239 -0,409 -0,325 0,032 -0,198 -0,172 -0,230 -0,147 0,182 0,103 -0,033 -0,219 -0,282 -0,122 0,089 0,093 -0,167 -0,178 -0,131 -0,145 -0,192 -0,190 -0,147 -0,142 -0,124 -0,070 -0,071 0,017 0,024 0,720 0,546 0,667 1,000 0,883 -0,127 -0,157 -0,126 -0,171 PS -0,058 0,005 -0,190 -0,272 -0,201 -0,285 -0,201 -0,285 -0,401 -0,341 0,030 -0,270 -0,231 -0,294 -0,198 0,215 0,121 0,021 -0,285 -0,290 -0,111 0,135 0,127 -0,225 -0,228 -0,161 -0,194 -0,196 -0,243 -0,175 -0,185 -0,119 -0,060 -0,025 0,080 0,083 0,700 0,518 0,723 0,883 1,000 -0,244 -0,238 -0,207 -0,242 SM 0,209 0,133 0,211 0,259 0,141 0,178 0,218 0,350 0,198 0,264 -0,031 0,313 0,237 0,392 0,257 -0,313 -0,324 -0,135 0,333 0,001 -0,064 -0,307 -0,065 0,324 0,305 0,245 0,283 0,259 0,297 0,259 0,269 0,237 0,134 0,264 0,072 0,137 -0,029 0,025 -0,218 -0,127 -0,244 1,000 0,195 0,227 0,209 TG_mufa 0,065 -0,024 0,233 0,346 0,230 0,217 0,290 0,519 0,215 0,270 -0,034 0,477 0,448 0,607 0,536 -0,522 -0,581 -0,098 0,645 0,127 -0,241 -0,534 -0,310 0,605 0,584 0,442 0,466 0,207 0,588 0,461 0,582 0,372 0,216 0,001 -0,360 -0,092 0,268 0,169 -0,214 -0,157 -0,238 0,195 1,000 0,916 0,924 TG_pufa 0,003 -0,008 0,207 0,321 0,208 0,177 0,287 0,496 0,175 0,284 -0,073 0,465 0,359 0,559 0,436 -0,540 -0,580 -0,102 0,548 0,110 -0,191 -0,469 -0,219 0,573 0,559 0,436 0,448 0,195 0,537 0,404 0,555 0,365 0,198 0,041 -0,303 -0,069 0,296 0,234 -0,122 -0,126 -0,207 0,227 0,916 1,000 0,855 TG_sfa 0,045 0,023 0,230 0,329 0,224 0,227 0,292 0,527 0,167 0,221 -0,018 0,440 0,412 0,542 0,469 -0,501 -0,526 -0,017 0,579 0,078 -0,226 -0,507 -0,300 0,592 0,536 0,404 0,414 0,169 0,532 0,426 0,546 0,347 0,176 0,072 -0,350 -0,088 0,213 0,067 -0,215 -0,171 -0,242 0,209 0,924 0,855 1,000

Table 9 – Correlation heat map between all measured and identified compounds in FinnBrain blood samples. Identified compounds are lipids, PFASs and BAs.

48

Table 10 - Lipid median concentrations with minimum and maximum values in blood samples from Wuhan and Beijing, China.

Table 11 – BA median concentrations with minimum and maximum values in blood samples from Wuhan and Beijing, China.

Table 12 – PFAS median concentrations with minimum and maximum values in blood samples from Wuhan and Beijing, China.

Table 13 – Non-target analysis results as median values in blood samples from Wuhan and Beijing, China, with minimum and maximum values.

Table 14 – Lipid median concentrations with minimum and maximum values for FinnBrain blood samples.

Table 15 – BA median concentrations with minimum and maximum values for all FinnBrain blood samples.

49

Table 16 – PFAS median concentrations for FinnBrain blood samples with minimum and maximum values.

Table 17 – Non-target analysis results as median values for all Finnbrain blood samples with minimum and maximum values.

50