Universidade de Lisboa -...

36
Universidade de Lisboa Faculdade de Ciências Departamento de Biologia Vegetal A new variant in the interleukin-6 receptor (IL-6R) gene increases gene expression and asthma risk Queensland Institute of Medical Research Joana Nunes Mexia Allen Revez Dissertação Mestrado em Biologia Molecular e Genética 2013

Transcript of Universidade de Lisboa -...

Universidade de Lisboa Faculdade de Ciências

Departamento de Biologia Vegetal

 

 

A new variant in the interleukin-6 receptor (IL-6R) gene increases gene expression and asthma risk

Queensland Institute of Medical Research

Joana Nunes Mexia Allen Revez

Dissertação

Mestrado em Biologia Molecular e Genética

2013  

Universidade de Lisboa Faculdade de Ciências

Departamento de Biologia Vegetal

 

 

A new variant in the interleukin-6 receptor (IL-6R) gene increases gene expression and asthma risk

Queensland Institute of Medical Research

Joana Nunes Mexia Allen Revez

Dissertação orientada pelo Prof. Doutor Manuel C. Gomes e pelo Doutor Manuel Ferreira

Mestrado em Biologia Molecular e Genética

2013

 

ii

Índice

RESUMO EM LÍNGUA PORTUGUESA DA DISSERTAÇÃO III

DISSERTAÇÃO 1 NOTA 2 RESUMO 3 ABSTRACT 4 INTRODUCTION 5 METHODS 6

Study participants 6 sIL-6R serum protein measurements 7 IL6R mRNA measurements 8 IL6R SNP genotyping 9 Statistical analyses 10

RESULTS 12 Common variants affecting sIL-6R serum levels independently of rs4129267 12 Replication of the new association between rs12083537 and sIL-6R serum levels 13 Association between rs12083537 with IL6R mRNA levels 14 Association between rs12083537 and asthma risk 14

DISCUSSION 16 ACKNOWLEDGEMENTS 18 REFERENCES 19

ANEXOS S1  

Joana N. M. Allen Revez Mestrado em Biologia Molecular e Genética  

Resumo da dissertação iii  

Resumo em língua Portuguesa da Dissertação intitulada: “A new variant in the

interleukin-6 receptor (IL-6R) gene increases gene expression and asthma risk.”

A interleucina-6 (IL-6) é uma citocina envolvida na asma alérgica, que atua nas suas células-

alvo através de um recetor constituído por pelo menos duas subunidades membranares: uma

glicoproteína de associação ao ligando (IL-6R) e uma glicoproteína transdutora de sinal

(gp130). Embora a gp130 se expresse na grande maioria das células, apenas algumas são

diretamente estimuladas pela IL-6, porque a expressão da IL-6R membranar (mIL-6R) é

limitada a hepatócitos e determinadas células imunes. No entanto, a IL-6R também existe sob

forma solúvel (sIL-6R), que tanto pode ser formada por clivagem da mIL-6R, como por

splicing alternativo. Uma vez ligada à IL-6, a sIL-6R pode associar-se à gp130, estimulando

as células por uma via alternativa, também viável em células que não expressam mIL-6R.

A variante rs2228145, no gene IL6R, encontra-se na região que codifica para o local de corte

da mIL-6R e é considerada o principal determinante genético dos níveis de sIL-6R,

representando cerca de 19% da sua variabilidade. Portadores do alelo de risco têm elevados

níveis de sIL-6R e maior risco de asma. No entanto, a percentagem de variação nos níveis de

sIL-6R explicada geneticamente é estimada em 70%, muito superior aos 19%

supramencionados, pelo que é possível que outras variantes no IL6R, ou próximo deste,

também contribuam para a sua variabilidade independentemente da rs2228145, e

possivelmente afetem o risco de asma.

Para testar esta hipótese, neste estudo (1) efetuámos uma imputação exaustiva de variantes

comuns na região do IL6R; (2) testámos a associação entre as variantes obtidas e os níveis de

sIL-6R de 360 indivíduos; (3) replicámos o estudo numa amostra independente de 354

indivíduos para as variantes que apresentaram associação mais forte; e (4) testámos a

associação das variantes replicadas (a) com os níveis de transcrito do IL6R (em dois estudos

independentes; N = 851 e N = 5 191) e (b) com o risco de asma (N = 47 514). Segue-se uma

descrição mais pormenorizada de todos estes pontos.

Numa primeira fase, medimos em duplicado as concentrações serológicas de sIL-6R de 360

asmáticos já genotipados. Após eliminar variabilidades técnicas, calculámos a média dos

Joana N. M. Allen Revez Mestrado em Biologia Molecular e Genética  

Resumo da dissertação iv  

níveis corrigidos de sIL-6R para cada indivíduo e normalizámos a distribuição global das

medições. Não se verificaram efeitos significativos do sexo ou da idade.

Tendo o 1000 Genomes Project como referência, imputámos 452 variantes no IL6R (± 50 kb)

dos 360 asmáticos, e testámos a sua associação com os níveis de sIL-6R, bem como a de

outras 19 variantes diretamente genotipadas nestes indivíduos.

A variante que apresentou maior grau de associação (P = 0.0005) com os níveis de sIL-6R

ajustados pelo rs4129267, uma variante correlacionada (r2 = 0.99) com a rs2228145, foi a

rs12083537, e a associação manteve-se significativa (P = 0.0496) após correção pelo número

de variantes testadas. Três variantes independentes demonstraram estar significativamente

associadas com os níveis de sIL-6R ajustados pelas variantes rs4129267 e rs12083537, mas

qualquer destas associações deixou de ser significativa depois de corrigir pelo número de

variantes testadas.

Para confirmar a associação entre a variante rs12083537 e os níveis de sIL-6R, replicámos o

estudo de associação em 354 indivíduos não relacionados com os anteriores. Após

ajustamento pelos efeitos do rs4129267, detetámos uma associação significativa entre o

rs12083537 e os níveis de sIL-6R (P = 0.033), sendo a direção do efeito a mesma que a da

análise prévia. Estes resultados confirmam portanto que rs12083537, ou uma variante

correlacionada com esta, regula os níveis serológicos de sIL-6R.

Depois de combinar as duas amostras onde se efetuaram os estudos de associação (N = 714),

o rs12083537 passou a explicar 2.2% da variabilidade dos níveis de sIL-6R ajustados pelo

rs4129267 (P = 6x10-5), sendo que antes do ajuste explicava 0.15%. O alelo rs12083537:G

reduziu os níveis de sIL-6R de forma semelhante nas três classes genotípicas do rs4129267.

Para ajudar a compreender o mecanismo molecular pelo qual o rs12083537 afeta os níveis de

sIL-6R, estudámos em seguida os níveis de RNA mensageiro (mRNA) do IL6R num grupo de

gémeos adolescentes (N = 851) para determinar se esta variante influenciava os níveis da

transcrição. Não foi detetada associação significativa entre o rs12083537 e os níveis de

mRNA, quer com uma sonda para a região 3’UTR, quer com uma sonda para o exão 9. Por

outro lado, detetámos uma associação significativa entre a variante rs4129267 e os níveis de

Joana N. M. Allen Revez Mestrado em Biologia Molecular e Genética  

Resumo da dissertação v  

transcrito do IL6R. Contraintuitivamente, o alelo rs4129267:T, que aumenta os níveis de sIL-

6R, demonstrou associação com baixos níveis de transcrito do IL6R.

Para garantir que a ausência de associação entre o rs12083537 e os níveis de mRNA do IL6R

não se deveu a baixo poder de deteção da nossa amostra, realizámos uma análise semelhante à

anterior num estudo independente e maior (N = 5 191). Neste estudo, após correção pelo

número de sondas testadas, não foi detetada uma associação significativa entre a variante e os

níveis de mRNA do IL6R. A sonda para a qual a associação foi mais forte é complementar à

região 3’UTR (P = 0.0031); o alelo rs12083537:A que aumenta os níveis de sIL-6R esteve

associado com uma redução dos níveis de transcrito do IL6R.

Por fim, como o alelo rs2228145:C aumenta não só os níveis de sIL-6R mas também o risco

de asma, testámos a hipótese de o rs12083537 também ser um fator genético de risco para a

asma. A associação entre esta variante e asma foi testada em 47 514 indivíduos de três

estudos independentes, dos quais 16 705 eram asmáticos diagnosticados por um médico. O

alelo rs12083537:A foi detetado com mais frequência em asmáticos do que nao-asmáticos nos

três estudos individuais, sendo a associação global entre o rs12083537 e o risco de asma fraca

mas estatisticamente significativa (OR = 1.039, 95% I.C. = 1.002 – 1.078, P = 0.0419).

A variante rs12083537 localizada no intrão 1 do IL6R, a 2.9 kb do exão 1, revelou estar

significativamente associada com os níveis de sIL-6R independentemente da variante

rs4129267, e a associação foi confirmada quando replicámos a análise. A variante localiza-se

numa região do gene que, em pelo menos quatro linhas celulares, incluindo fibroblastos

pulmonares e queratinócitos, tem elevado grau de metilação (H3K4Me1). Este tipo de

modificação de histonas está associada a enhancers e promotores, sendo portanto plausível

que o rs12083537 influencie a transcrição do IL6R. No entanto, não foi detetada associação

significativa entre o rs12083537 e os níveis de mRNA do IL6R, pelo que o mecanismo de

ação da variante permanece por elucidar.

Também verificámos que o alelo rs4129267:T, que aumenta os níveis de sIL-6R, está

associado a baixos níveis de mRNA do IL6R. As medições para as quais se verificou esta

associação foram feitas com uma sonda que se liga ao exão 9, o qual pode ser removido por

splicing alternativo. Assim, as medições em causa correspondem apenas aos níveis de mRNA

que não sofreu splicing. Tendo em conta que o alelo rs4129267:T está associado a um

Joana N. M. Allen Revez Mestrado em Biologia Molecular e Genética  

Resumo da dissertação vi  

aumento do splicing do exão 9, é portanto de esperar que os níveis da isoforma que não sofreu

splicing diminuam na presença deste alelo.

Por fim, verificámos que a variante rs12083537 está significativamente associada com o risco

de asma. Com base na nossa análise, cada alelo rs12083537:A aumenta os níveis serológicos

de sIL-6R cerca de 2.4 vezes e o risco de asma1.04 vezes.

Em conclusão, este estudo demonstra que pelo menos duas variantes genéticas influenciam os

níveis serológicos de sIL-6R. Como tal, estudos de resposta clínica ao tocilizumab, um

anticorpo monoclonal para o IL-6R já aprovado para o tratamento da artrite reumatoide,

devem avaliar a contribuição das diversas variantes que regulam os níveis de sIL-6R.

Pontuações de risco genético cumulativo poderão ser úteis para explicar variabilidade na

respostas clínica ao tocilizumab.

 

A new variant in the interleukin-6 receptor (IL-6R) gene increases gene expression and asthma risk.

 

Joana Allen Revez

Supervised by: Manuel Ferreira PhD Manuel Carmo Gomes PhD  

Queensland Institute of Medical Research 2013

Molecular Biology and Genetics

Joana N. M. Allen Revez Mestrado em Biologia Molecular e Genética  

A new variant in the IL6R gene increases gene expression and asthma risk 2

Nota

O conteúdo desta dissertação foi parcialmente submetido para publicação à revista Genes and Immunity em maio de 2013, estando identificado como A new regulatory variant in the interleukin-6 receptor gene associates with asthma risk, sob a autoria de Joana A. Revez, Lisa Bain, Brett Chapman, Joseph E. Powell, Rick Jansen, David L. Duffy, Joyce Y. Tung, AAGC collaborators, Brenda W. Penninx, Peter M. Visscher, Eco J.C. de Geus, Dorret I. Boomsma, David A. Hinds, Nicholas G. Martin, Grant W. Montgomery, Manuel A.R. Ferreira. O manuscrito encontra-se presentemente sob revisão.

Neste trabalho, eu, Joana Nunes Mexia Allen Revez, estive envolvida na análise de associação entre as 714 variantes genéticas e os níveis de sIL-6R da amostra de 360 asmáticos. Participei em todo o processo de replicação desse mesmo estudo, desde a medição dos níveis de sIL-6R, à análise estatística dos resultados. Também participei na análise dos resultados de associação entre a variante rs12083537 e (a) os níveis de transcrito do IL6R e (b) o risco de asma.

Joana N. M. Allen Revez Mestrado em Biologia Molecular e Genética  

A new variant in the IL6R gene increases gene expression and asthma risk 3

Resumo

O principal determinante genético dos níveis de IL-6R solúvel é a variante rs2228145, que se

encontra no local de corte da proteína IL-6R. Por cada alelo Ala, os níveis de sIL-6R

aumentam por ~20 ng/ml e o risco de asma aumenta 1.09 vezes. No entanto, esta variante não

explica a totalidade da componente hereditária dos níveis de sIL-6R. É portanto possível que

outras variantes no gene IL6R também contribuam para variações nos níveis de sIL-6R e

influenciem o risco de asma. Para testar esta hipótese, imputámos 471 variantes comuns no

IL6R e testámos a sua associação com os níveis serológicos de sIL-6R em 360 indivíduos.

Uma variante intrónica (rs12083537) apresentou associação com os níveis de sIL-6R

independentemente da variante rs4129267 (P = 0.0005). Observámos uma associação

significativa e consistente ao replicar o estudo em 354 indivíduos (P = 0.033). Cada cópia do

alelo rs12083537:A aumentou os níveis de sIL-6R em 2.4 ng/ml. Na análise dos níveis de

mRNA em dois estudos independentes não foram identificadas associações significativas

entre a variante rs12083537 e os níveis de transcrição do IL6R. Por outro lado, o alelo

rs12083537:A aumentou o risco de asma 1.04 vezes (P = 0.0419) na análise de 16 705

asmáticos e 30 908 controlos. Pontuações de risco genético poderão portanto ser úteis para

explicar variabilidade na resposta clínica ao tocilizumab, um anticorpo monoclonal contra o

IL-6R.

Palavras-chave: alergia, eQTL, expressão, doença

 

Joana N. M. Allen Revez Mestrado em Biologia Molecular e Genética  

A new variant in the IL6R gene increases gene expression and asthma risk 4

Abstract

The main genetic determinant of soluble IL-6R levels is the missense variant rs2228145,

which maps to the cleavage site of IL-6R. For each Ala allele, sIL-6R serum levels increase

by ~20 ng/ml and asthma risk by 1.09-fold. However, this variant does not explain the total

heritability for sIL-6R levels. Additional independent variants in IL6R may therefore

contribute to variation in sIL-6R levels and influence asthma risk. To test this hypothesis, we

imputed 471 common variants in IL6R, which were then tested for association with sIL-6R

serum levels in 360 individuals. An intronic variant (rs12083537) was associated with sIL-6R

levels independently of rs4129267 (P = 0.0005). A significant and consistent association for

rs12083537 was observed in a replication panel of 354 individuals (P = 0.033). Each

rs12083537:A allele increased sIL-6R serum levels by 2.4 ng/ml. On the one hand, analysis of

mRNA levels in two cohorts did not identify significant associations between rs12083537 and

IL6R transcription levels. On the other hand, results from 16 705 asthmatics and 30 809

controls showed that the rs12083537:A allele increased asthma risk by 1.04-fold (P = 0.0419).

Genetic risk scores based on IL6R regulatory variants may prove useful in explaining

variation in clinical response to tocilizumab, an anti-IL-6R monoclonal antibody.

Keywords: allergy, eQTL, expression, disease

Joana N. M. Allen Revez Mestrado em Biologia Molecular e Genética  

A new variant in the IL6R gene increases gene expression and asthma risk 5

Introduction

Asthma is a chronic lung disease characterized by airway obstruction, inflammation and

hyperresponsiveness.1 It is caused by a combination of both genetic and environmental factors

but their interaction is complex and still not fully understood.

IL-6 is a cytokine known to be involved in allergic asthma2-5 and it binds to its target cells

through a complex of at least two distinct membrane-bound proteins, an 80 kD ligand binding

glycoprotein (IL-6R) and a 130 kD signal transducing glycoprotein (gp130).6, 7 Although

gp130 is expressed on most cells, expression of the membrane-bound form of IL-6R (mIL-

6R) is limited to hepatocytes and some immune cells, such as neutrophils, monocytes, CD4+

T cells and B cells;8, 9 as such, only these cells are directly stimulated by IL-6. However, IL-

6R also exists in a soluble form (sIL-6R) that is either produced by proteolytic cleavage of

mIL-6R or by alternative splicing.10-13 sIL-6R can associate with free IL-6 and this complex is

then recognized by the membrane-bound gp130, hence providing an alternative IL-6

signalling pathway available to cells that do not express mIL-6R, a process known as trans-

signalling.7

The main genetic determinant of sIL-6R levels is thought to be the non-synonymous variant

rs2228145 in exon 9,14 which maps to amino acid position 358 on the main cleavage site of

IL-6R.15 Specifically, individuals that carry the Ala358 allele have increased sIL-6R levels14

– which in turn are associated with increased levels of allergic inflammation in the lung16 –

and also increased risk of asthma.17 However, given that this variant explains ~19% of the

variation in sIL-6R levels18 but the total heritability of this trait has been estimated at ~70%,19

we hypothesised that additional variants in or near the IL6R gene contribute to sIL-6R

variation independently of rs2228145 and may also influence asthma risk.

To test this hypothesis, in this study we (1) used data from the 1000 Genomes Project20 to

comprehensively impute common variants in or near the IL6R gene; (2) tested these variants

for association with sIL-6R levels measured in 360 individuals; (3) followed-up the most

associated variants in an independent sample of 354 individuals; and (4) tested the replicated

variants for association with IL6R transcript levels in two independent cohorts (N = 851 and 5

191) and asthma risk (N = 47 514).

Joana N. M. Allen Revez Mestrado em Biologia Molecular e Genética  

A new variant in the IL6R gene increases gene expression and asthma risk 6

Methods

Study participants

Genetic variants in or near the IL6R gene were tested for association with sIL-6R protein

levels, mRNA levels and asthma risk in the cohorts described below.

Association with IL-6R protein levels – discovery cohort. To find new variants in the IL6R

gene that affect sIL-6R levels, we first studied 360 unrelated asthmatic subjects from the

1995-1998 Asthma and Allergy study, which is described in detail elsewhere.24 Briefly, 3 073

subjects were recruited from 802 families that were registered on the Australian Twin

Registry and had at least one twin who previously reported a history of wheezing in studies

conducted at Queensland Institute of Medical Research (QIMR) and by collaborators

elsewhere in Australia. Participants completed a questionnaire that was designed to validate

the diagnosis of asthma and to obtain data on respiratory symptoms, environmental exposures

and family history of asthma. In addition, participants underwent clinical testing, including

lung function and skin prick tests. For the present study, we selected 360 unrelated individuals

with available DNA and a serum sample, and who answered “Yes, told to me by a doctor” to

the question “Have you ever had asthma?”. Demographics and clinical characteristics for this

discovery sample are summarised in Supplementary Table 1. These individuals were

included in a recent genome-wide association study (GWAS) of asthma.17

Association with IL-6R protein levels – replication cohort. To confirm associations between

SNPs and IL-6R protein levels observed in the discovery sample, we studied an additional

sample of 354 unrelated asthmatics recruited through the QIMR Asthma Genetics Study

between 2011 and 2013. In this study, individuals with a self-reported doctor-diagnosis of

asthma were recruited from the general community through media appeals (newspapers,

radio, television and social media) to participate in a study of genetic risk factors for asthma.

Participation included completing a brief online questionnaire about asthma symptoms,

medication and severity, and the provision of a blood sample for genetic and immune function

testing. Blood was collected at a local pathology laboratory and shipped to QIMR by

overnight courier. Demographics and clinical characteristics for this replication sample are

summarised in Supplementary Table 2.

Joana N. M. Allen Revez Mestrado em Biologia Molecular e Genética  

A new variant in the IL6R gene increases gene expression and asthma risk 7

Association with IL6R mRNA levels. To provide some insight into the molecular mechanisms

underlying SNP-protein associations, SNPs confirmed to associate with sIL-6R protein levels

were then tested for association with mRNA levels measured in 851 individuals from 262

families who participated in a genome-wide gene-expression study reported recently.25

Briefly, adolescent monozygotic and dizygotic twins, their siblings, and their parents have

been recruited over a 21-year period into an ongoing study of the genetic and environmental

factors influencing pigmented nevi and the associated risk of developing skin cancer and

cognition (Twin Moles study)26. Results obtained in this cohort were confirmed by analysing

expression levels from an additional 5 191 individuals who participated in two large-scale

longitudinal studies: the Netherlands Twin Register (N = 3 170) and the Netherlands Study of

Depression and Anxiety (N = 2 021); henceforth we refer to this as the NTR-NESDA eQTL

study. The individual NTR and NESDA cohorts are described in detail elsewhere.27, 28

Association with asthma risk. SNPs associated with sIL-6R protein levels were also tested for

association with asthma risk in 21 039 unrelated individuals of European ancestry who

participated in two independent studies. First, 5 967 individuals who participated in the

Australian asthma GWAS,17 including 2 110 doctor-diagnosed asthmatics and 3 857 controls.

Sample ascertainment and patient characteristics for this study are described in detail in

Ferreira.17 Second, 15 072 customers of 23andMe, Inc., a personal genetics company, who

had been genotyped as part of the 23andMe Personal Genome Service®. We selected 4 230

cases and 10 842 controls of European ancestry who had taken a survey about asthma. Cases

(mean age 48, 54% female, 55% with onset of asthma at or before age 16) gave positive

responses to the question "Have you ever been diagnosed by a doctor with asthma or

bronchial asthma?", and also checked a box indicating that they had ever had "allergic rhinitis

(stuffed or dripping nose caused by allergies)". Controls (mean age 49, 39% females) gave

negative responses to both questions. Association results from these 6 340 cases and 14 699

controls were meta-analysed with publicly available results from 10 365 asthmatics and 16

110 controls included in the GABRIEL GWAS,29 for a combined total sample size of 47 514

individuals, including 16 705 cases and 30 809 controls.

sIL-6R serum protein measurements

A 40 mL venous blood sample was collected from all consenting subjects. After spinning 10

mL of blood in a serum tube at 805g for 10 min, the serum layer was extracted and stored at -

Joana N. M. Allen Revez Mestrado em Biologia Molecular e Genética  

A new variant in the IL6R gene increases gene expression and asthma risk 8

80ºC until analysis. ELISA kits (R&D Systems, Minneapolis, MN, USA) were used to

measure sIL-6R levels according to manufacturer’s procedures. The optical density was

determined using a PowerWave XS2 microplate spectrophotometer (BioTek Instruments,

Inc.) at both 450 and 540 nm wavelengths. Results and standard curves were acquired with

Gen5 2.0 Data Analysis Software. sIL-6R levels were measured in duplicate for each sample.

IL6R mRNA measurements

Whole blood for expression profiling was collected from 851 individuals from the Twin

Moles study and processed as described in detail by Powell et al.25 Briefly, total RNA was

extracted from PAXgeneTM tubes using the QIAGEN whole blood gene RNA purification kit.

RNA quality and concentration was estimated with Agilent Bioanalyzer and subsequently

converted to cDNA, amplified and purified using the Ambion Illumina TotalPrep RNA

Amplification Kit (Ambion). Expression profiles were generated by hybridising 750 ng of

cRNA to Illumina HumanHT-12 v4.0 Beadchip according to Illumina whole-genome gene

expression direct hybridization assay guide (Illumina Inc, San Diego, USA). Beadchips were

then washed and stained and subsequently scanned to obtain fluorescence intensities. Samples

were scanned using an Illumina Bead Array Reader. Samples were randomised across chips

and chip positions, with check for balance across families, sex and generation. For this study,

we restricted our analysis to two probes targeting the IL6R gene, specifically ILMN_1754753

on the 3'-end and ILMN_1696394 on exon 9 (Supplementary Table 5).

Gene expression association results were confirmed by analysing a further 5 191 individuals

from the NTR-NESDA eQTL study. In this study, venous blood samples were drawn in the

morning after an overnight fast; heparinised whole blood was transferred within 20 minutes of

sampling into PAXgeneTM tubes and stored at -20°C. RNA extraction and analysis was

performed at the Rutgers University Cell and DNA Repository (USA). RNA was extracted

using Qiagen Universal liquid handling system (PAXgene extraction kits following the

manufacturer's protocol). Total RNA was measured by spectroscopy (Trinean DropSense) to

determine purity and concentration, while RNA fidelity was measured by the Agilent

Bioanalyzer analysis. RNA samples were hybridized to Affymetrix U219 array plates

(GeneTitan), which contains 530 467 probes, each 25 bases in length. Array hybridization,

washing, staining, and scanning were carried out in an Affymetrix GeneTitan System

following the manufacturer’s protocol. Non-uniquely mapping probes (hg19) and probes

Joana N. M. Allen Revez Mestrado em Biologia Molecular e Genética  

A new variant in the IL6R gene increases gene expression and asthma risk 9

containing a polymorphic SNP based on snp137 (UCSC) were removed. Expression values

were obtained using RMA normalization implemented in Affymetrix Power Tools (APT, v

1.12.0). Thirty-two probes targeting IL6R were selected for analysis (Supplementary Table

4).

IL6R SNP genotyping

For association with sIL-6R protein levels – discovery cohort. Genome-wide SNP data for the

360 individuals were obtained using Illumina 610-Quad Beadchip as part of the Australian

Asthma Genetics Consortium GWAS, which is described in detail elsewhere.17 For this study,

we selected 254 post-quality control (minor allele frequency [MAF] > 1%, call rate > 95%,

Hardy-Weinberg equilibrium P-value > 10-6) SNPs in or within 1 000 kb of IL6R, including

rs4129267. To expand the genetic coverage of IL6R, these 254 directly genotyped SNPs were

then used to impute 530 common variants (MAF >= 1%) from the 1000 Genomes Project

(March 2012 release, 1,092 samples of all ancestries) using Impute2.30 After quality control

(MAF > 1%, imputation information score > 0.3), there were 471 SNPs located in or within

50 kb of IL6R available for analysis, of which 19 had been directly genotyped.

For association with sIL-6R protein levels – replication cohort. Genotypes for two SNPs

(rs4129267 and rs12083537) were obtained for the 354 individuals with Sequenom

MassARRAY iPLEX platform; 12.5 ng of DNA isolated from buffy coats with a salt

precipitation method were used per assay. SNP sequences were downloaded from the

National Centre for Biotechnology Information (http://www.ncbi.nlm.nih.gov/) and cross

checked. Design of the PCR and iPLEX Extension assays was done using the Sequenom

Design Suite (https://mysequenom.com/). The assays were then manually adjusted to increase

the level of multiplexing. SNPs were typed using iPLEXTM Gold chemistry and analysed

using a Sequenom MassARRAY Compact Mass Spectrometer (Sequenom Inc, San Diego,

CA, USA). The PCR, SAP and iPLEX reactions were performed using half the volume of

reagents recommended in the manufacturer’s specifications. The post-PCR products were

spotted on a Sequenom SpectroChip 2, and the data was processed and analysed using

Sequenom MassARRAY TYPER 4.0 software. The primers used to amplify both SNPs are

included in Supplementary Table 6.

Joana N. M. Allen Revez Mestrado em Biologia Molecular e Genética  

A new variant in the IL6R gene increases gene expression and asthma risk 10

For association with IL6R mRNA levels. DNA samples from the Twin Moles study (N = 851)

were genotyped by the Scientific Services Division at deCODE Genetics, Iceland, using the

Illumina 610-Quad Beadchip. Genotypes were called with the Illumina BeadStudio software.

Full details of genotyping procedures are given in Medland et al.31 DNA extraction for the

NTR-NESDA eQTL study (N = 5 191) has been described before.27 Genotyping was

performed on multiple SNP array platforms, including Perlegen 5.0, Illumina 370, Illumina

660, Illumina Omni Express 1M and Affymetrix 6.0. Standard pre-imputation quality control

filters were applied first within and then between chip platforms; subsequently, data were

merged into a single dataset. Genome-wide SNP imputation was done with Impute2 30 using

the 1000 Genomes phase I Interim Jun 2011 release. This included the rs12083537 variant

analysed in this study, which was imputed with high confidence (information score > 0.9).

For association with asthma risk. We accessed individual-level genotype data for a single

SNP (rs12083537) from the Australian GWAS (N = 5 967) and the 23andMe cohort (N = 15

072). This SNP was imputed in the Australian GWAS with high confidence (information

score 0.97), using Impute2 and genotype data from the combined 1000 Genomes (60

individuals with northern and western European ancestry from the Centre d’Etude du

Polymorphisme Humain collection [CEU], March, 2010 release) and HapMap 3 (955

individuals from 11 populations, February, 2009 release) projects as the reference panel. In

the 23andMe cohort, rs12083537 was imputed with high confidence (r2 = 0.95) using the

August 2010 release of the 1000 Genomes reference haplotypes.

Statistical analyses

Quality control of sIL-6R ELISA data. Each sample was measured in duplicate (two assays),

with up to 80 serum samples included per ELISA plate. No outlier observations (located six

standard deviations from the mean) were observed for either assay. The same procedure was

used for the discovery and replication cohorts to estimate and remove the effects of both

technical and biological effects on sIL-6R levels. Briefly, we (1) used linear regression to

adjust for assay and plate effects; (2) normalized the distribution using a rank-based inverse-

normal transformation; and (3) tested and, if significant (P < 0.05), adjusted for the effects of

age and sex including these variables in a multiple regression model. Subsequent association

analyses were carried out using the residuals obtained from (3).

Joana N. M. Allen Revez Mestrado em Biologia Molecular e Genética  

A new variant in the IL6R gene increases gene expression and asthma risk 11

Association analyses. Individual SNPs in IL6R, directly genotyped or imputed, were tested for

association with sIL-6R protein levels (discovery and replication cohorts), mRNA levels and

asthma risk. Association with protein levels was tested using multiple linear regression,

assuming an additive model for the SNP effect; for imputed SNPs, we used the inferred allelic

dosage. Analyses were performed before and after adjustment for rs4129267, a SNP highly

correlated with the missense variant rs2228145 (r2 = 0.99), the main genetic determinant of

sIL-6R protein serum levels.21 We permuted sIL-6R levels (100 000 replicates) and repeated

the association analysis to obtain empirical association P-values corrected for the number of

SNPs tested. Association with mRNA levels in the Twin Moles study was tested using

MACH2QTL,32, 33 a software that performs a linear regression analysis based on imputed

dosages and which takes into account the family structure of the data. In the NTR-NESDA

eQTL study, association with mRNA levels was tested using a linear mixed model to correct

for family structure (as random effects) as well as sex, age, body mass index, smoking status,

technical covariates and ancestry (as fixed effects). Lastly, association with asthma risk was

tested using logistic regression, with the dependent variable being asthma status, i.e. asthmatic

or control. The analysis included covariates to adjust for sample origin in the Australian

GWAS and, in the 23andMe cohort, for age, sex and ancestry. Fixed-effects meta-analysis of

results from the Australian GWAS, 23andMe and the publicly available GABRIEL GWAS

was performed with METAL.34

Joana N. M. Allen Revez Mestrado em Biologia Molecular e Genética  

A new variant in the IL6R gene increases gene expression and asthma risk 12

Results

Common variants affecting sIL-6R serum levels independently of rs4129267

To find new common variants associated with sIL-6R levels and possibly with asthma risk,

we first measured sIL-6R serum levels in 360 asthmatics (Supplementary Table 1) who had

been previously genotyped with Illumina 610K SNP arrays.17 sIL-6R measurements were

performed in duplicate, with good agreement between assays (r = 0.95). Assay and plate

effects explained respectively 0.7% (P = 0.013) and 13.6% (P = 3.3x10-20) of the sample

variation in sIL-6R levels and so these effects were removed prior to association analyses.

Duplicate measurements were then averaged and the overall distribution normalised using a

rank-based inverse-normal transformation. There were no significant effects of age (P =

0.256) or sex (P = 0.932) on sIL-6R levels.

The main genetic determinant of sIL-6R levels is thought to be the non-synonymous variant

rs2228145,14 which is in high linkage disequilibrium (r2 = 0.99) with rs4129267, a SNP that

was directly genotyped in all individuals. We confirmed that the rs4129267 SNP has a

significant impact on sIL-6R availability, increasing serum levels by 19.7 ng/ml for each copy

of the T allele and explaining 30% of the inter-individual variation (P = 1.8x10-29,

Supplementary Figure 1). However, given that the heritability of sIL-6R levels has been

estimated at ~70%,19 we hypothesised that additional variants in or near the IL6R gene

contribute to variation in sIL-6R serum levels independently of rs4129267.

To test this hypothesis, we imputed 452 common variants in IL6R (± 50 kb) in the 360

asthmatics using data from the 1 000 Genomes Project as the reference panel. These variants,

as well as 19 additional IL6R SNPs that were present in the Illumina 610K array, were then

tested for association with sIL-6R levels, after adjustment for the effect of rs4129267.

The variant with strongest association with rs4129267-adjusted sIL-6R levels was rs12083537

(uncorrected P = 0.0005, Supplementary Table 2 and Supplementary Figure 2); the

association remained significant after correcting for the number of SNPs tested through 100

000 permutations (P = 0.0496). The rs12083537 variant is in low linkage disequilibrium with

rs4129267 (r2 = 0.03) and has a minor allele (G) frequency of 0.23. An additional 3

independent (r2 < 0.1) SNPs had a significant (uncorrected P < 0.05) association with sIL-6R

Joana N. M. Allen Revez Mestrado em Biologia Molecular e Genética  

A new variant in the IL6R gene increases gene expression and asthma risk 13

levels after adjusting for both rs4129267 and rs12083537 (Supplementary Table 2).

However, these associations did not remain significant after correction for multiple SNP

testing (P > 0.05).

Replication of the new association between rs12083537 and sIL-6R serum levels

To confirm that the observed association between rs12083537 and sIL-6R levels was real and

not a technical or statistical artefact, we performed a replication study using the same

experimental procedure in an additional set of 354 unrelated asthmatics (Supplementary

Table 3), genotyped for both rs4129267 and rs12083537.

After adjusting for the effects of rs4129267, rs12083537 was significantly associated with

sIL-6R levels (P = 0.033), with the same direction of effect observed in the discovery analysis

(Supplementary Figure 3). These results thus confirm that rs12083537 is a new quantitative

trait locus for sIL-6R serum protein levels.

After combining data from the discovery and replication cohorts (combined N = 714),

rs12083537 explained 2.2% of the variation in sIL-6R levels after adjustment for the effects

of rs4129267 (P = 6x10-5). Before adjustment for rs4129267, rs12083537 explained 0.15% of

the variation in sIL-6R levels. The rs12083537:G minor allele decreased sIL-6R levels

similarly in the three rs4129267 genotype classes (Figure 1).

 

Figure 1. sIL-6R serum levels as a

function of rs4129267 and rs12083537

genotype (N = 714). The proportion of

variation in sIL-6R levels explained by

rs12083537 (r2) is shown for each

rs4129267 genotype class, together with

the estimated regression coefficient (β).

 

Joana N. M. Allen Revez Mestrado em Biologia Molecular e Genética  

A new variant in the IL6R gene increases gene expression and asthma risk 14

Association between rs12083537 with IL6R mRNA levels

The main genetic determinant of sIL-6R levels, rs2228145, is located on the main cleavage

site of IL-6R and so it is thought to affect protein levels through a post-transcriptional

mechanism.15 To provide some insight into the molecular mechanisms underlying the new

rs12083537 association with sIL-6R protein levels, we investigated if this variant influenced

IL6R mRNA levels measured in whole blood in a cohort of adolescent twins and their

relatives (N = 851). There was no significant association between rs12083537 and IL6R

mRNA levels in this cohort, either measured by a probe targeting the 3' UTR (beta = -0.050

for the G allele, SE = 0.062, P = 0.4234) or exon 9 (beta = -0.062 for the G allele, SE = 0.063,

P = 0.3209). On the other hand, we observed a significant association between rs4129267 (a

proxy for rs2228145) and IL6R gene transcription (exon 9 probe, beta = -0.120 for T allele,

SE = 0.051, P = 0.0187). Counterintuitively, the rs4129267:T allele that increases sIL-6R

protein levels was associated with decreased mRNA expression.

To confirm that the lack of association between rs12083537 and mRNA levels was not due to

low power or incomplete coverage of IL6R expression patterns, we performed a similar

analysis in a larger independent cohort (N = 5 191) with IL6R gene expression levels in whole

blood measured by 32 individual Affymetrix probes, targeting exons 5, 7, 8 and the 3’ UTR.

In this cohort, the rs12083537 variant was also not associated with IL6R transcription levels

after a Bonferroni correction for the number of probes tested (P < 0.0016, Supplementary

Table 4). The strongest association was with a probe mapping to the 3’UTR region (P =

0.0031), with the rs12083537:A allele that increases sIL-6R protein levels being associated

with decreased mRNA expression, as noted above for rs4129267.

Association between rs12083537 and asthma risk

The rs2228145:C allele that increases sIL-6R levels14 also increases asthma risk.17 We

therefore tested the hypothesis that rs12083537 is also a genetic risk factor for asthma.

Specifically, we hypothesised that, as for rs2228145, the rs12083537:A allele that is

associated with higher sIL-6R levels would be predisposing for asthma. Association results

were available for 47 514 individuals from three independent studies, including 16 705

doctor-diagnosed asthmatics and 30 809 controls. A consistent predisposing effect was

observed for the rs12083537:A allele in all three studies (Table 1). Overall, the association

Joana N. M. Allen Revez Mestrado em Biologia Molecular e Genética  

A new variant in the IL6R gene increases gene expression and asthma risk 15

between rs12083537 and asthma risk was weak but statistically significant (OR = 1.039, 95%

C.I. = 1.002 – 1.078, P = 0.0419).

Study   N cases   N Controls   OR*   SE   P-value  AAGC   2110   3857   1.095   0.0528   0.0846  

23andMe   4230   10842   1.036   0.0333   0.2888  GABRIEL   10365   16110   1.028   0.0249   0.2726  

All samples   16705   30809   1.039   0.0186   0.0419  

Table  1.  Association between rs12083537 and asthma risk in three independent studies.  

Joana N. M. Allen Revez Mestrado em Biologia Molecular e Genética  

A new variant in the IL6R gene increases gene expression and asthma risk 16

Discussion

Recently, Ferreira et al.17 identified a variant (rs4129267) in the IL6R gene with a modest (OR

= 1.09) but genome-wide significant association with asthma risk. This variant is in near

complete LD with the exon 9 missense variant rs2228145, and is associated with a 1.4-fold

increase in sIL-6R levels for each copy of the T allele.21 The goal of this study was to test the

hypotheses that (1) other variants in IL6R regulate gene expression and (2) that these also

associate with asthma risk.

A single variant (rs12083537) located in intron 1 of IL6R, 2.9 kb away from exon 1, was

found to associate with sIL-6R levels independently of rs4129267 and after correction for

multiple SNP testing. A significant and consistent association for this variant was then

observed in our replication panel. Furthermore, at the time of submission, we also noted that

R. Ferreira et al.22 independently reported two new genetic associations for sIL-6R levels, one

of which (rs1386821) is correlated with rs12083537 (r2 = 0.94). The alleles that increase sIL-

6R levels for these two SNPs (rs12083537:A and rs1386821:T) are in phase and so our results

are consistent with those of R. Ferreira et al. Together, these results establish rs12083537, or a

variant in LD with it, as a new regulatory variant for sIL-6R serum levels.

The rs12083537 variant is located in a region with high H3K4Me1 lysine methylation in four

cell lines, including lung fibroblasts and keratinocytes.23 This histone modification is

associated with enhancers and promoters and so it is plausible that rs12083537 may influence

gene transcription. However, analysis of mRNA levels extracted from whole blood in two

independent cohorts did not identify any significant associations between rs12083537 and

IL6R transcription levels, after correcting for multiple testing. Thus, the molecular mechanism

underlying the association between rs12083537 and sIL-6R levels remains to be elucidated.

Gene expression studies in individual immune cell types or upon allergen stimulation may

help characterise the function of this variant in greater detail.

Interestingly, we observed that the rs4129267:T allele that is associated with increased sIL-6R

protein levels21 was associated with decreased IL6R mRNA levels, which may appear

contradicting. The Illumina probe for which we observed an association with rs4129267 maps

to exon 9, which is skipped in the main differentially spliced IL6R isoform. Therefore, this

Joana N. M. Allen Revez Mestrado em Biologia Molecular e Genética  

A new variant in the IL6R gene increases gene expression and asthma risk 17

probe measures the levels of the full length isoform, which necessarily decrease if splicing of

exon 9 increases. Given that the rs4129267:T allele, or an allele associated with it (e.g.

rs2228145:C), strongly increases the splicing of exon 9,22 it is in fact not surprising that

consequently this allele is associated with decreased mRNA levels of the full length isoform.

Unlike our results, R. Ferreira22 did not observe a significant effect of rs2228145 on the levels

of the full length isoform. However, as the authors noted, splicing of exon 9 is a rare event.

Therefore, the lack of association in that study likely reflected the low power provided by a

small sample size (N = 88) to detect small differences in the levels of the full length isoform

between genotype classes.

Lastly, we found rs12083537 to be significantly associated with asthma risk. Based on our

analysis, each rs12083537:A allele increased sIL-6R serum levels by 2.4 ng/ml and increased

asthma risk by 1.04-fold. For comparison, each rs4129267:T allele increased sIL-6R serum

levels by 19.7 ng/ml and increases asthma risk by 1.09-fold.17 The association between

rs12083537 and asthma risk was consistent across the three cohorts analysed but only

marginally statistically significant in the overall analysis. Replication of our results by well

powered studies is therefore warranted; over 23 000 cases and 23 000 controls (or, for

example, 18 000 cases and 36 000 controls) are required to achieve 80% power to detect (α =

0.05) the effect we observed.

In summary, our study demonstrates that at least two independent genetic variants influence

sIL-6R serum levels. Consequently, pharmacogenetic studies of clinical response to

tocilizumab, an anti-IL-6R monoclonal antibody approved to treat rheumatoid arthritis34 and

suggested to treat asthma,16, 17 should consider the effects of not only the major genetic

determinant of sIL-6R levels (rs2228145) but also of other confirmed regulatory variants such

as rs12083537. For this purpose, multi-SNP scores that summarise an individual's combined

genetic risk of having high sIL-6R levels may prove useful.

Joana N. M. Allen Revez Mestrado em Biologia Molecular e Genética  

A new variant in the IL6R gene increases gene expression and asthma risk 18

Acknowledgements

I would like to thank all those without whom I would not have been able to complete this

dissertation. First and foremost, I would like to express my deepest gratitude to my supervisor

Dr. Manuel Ferreira. For the opportunity to work on this project, for everything he taught me,

for his excellent guidance and for all the help in writing this manuscript. I would also like to

thank Prof. Dr. Manuel Carmo Gomes for all the help provided to the completion of this

dissertation. I thank my family for all the support and encouragement that brought me here.

Finally, I would like to express my gratitude to Prof. Dr. Grant Montgomery, for the

opportunity to work in his group, and I would like to thank all the group members, especially

Lisa Bain, for all the patience and caring while guiding me in the laboratory.

Joana N. M. Allen Revez Mestrado em Biologia Molecular e Genética  

A new variant in the IL6R gene increases gene expression and asthma risk 19

References

1. Murphy DM, O'Byrne PM. Recent advances in the pathophysiology of asthma. Chest 2010;

137(6): 1417-26.

2. Doganci A, Sauer K, Karwot R, Finotto S. Pathological role of IL-6 in the experimental

allergic bronchial asthma in mice. Clin Rev Allergy Immunol 2005; 28(3): 257-70.

3. Neveu WA, Allard JL, Raymond DM, Bourassa LM, Burns SM, Bunn JY et al. Elevation of

IL-6 in the allergic asthmatic airway is independent of inflammation but associates with loss

of central airway function. Respir Res 2010; 11: 28.

4. Rincon M, Irvin CG. Role of IL-6 in asthma and other inflammatory pulmonary diseases. Int J

Biol Sci 2012; 8(9): 1281-90.

5. Wang J, Homer RJ, Chen Q, Elias JA. Endogenous and exogenous IL-6 inhibit aeroallergen-

induced Th2 inflammation. J Immunol 2000; 165(7): 4051-61.

6. Hibi M, Murakami M, Saito M, Hirano T, Taga T, Kishimoto T. Molecular cloning and

expression of an IL-6 signal transducer, gp130. Cell 1990; 63(6): 1149-57.

7. Taga T, Hibi M, Hirata Y, Yamasaki K, Yasukawa K, Matsuda T et al. Interleukin-6 triggers

the association of its receptor with a possible signal transducer, gp130. Cell 1989; 58(3): 573-

81.

8. Chalaris A, Rabe B, Paliga K, Lange H, Laskay T, Fielding CA et al. Apoptosis is a natural

stimulus of IL6R shedding and contributes to the proinflammatory trans-signaling function of

neutrophils. Blood 2007; 110(6): 1748-55.

9. Oberg HH, Wesch D, Grussel S, Rose-John S, Kabelitz D. Differential expression of CD126

and CD130 mediates different STAT-3 phosphorylation in CD4+CD25- and CD25high

regulatory T cells. Int Immunol 2006; 18(4): 555-63.

10. Horiuchi S, Koyanagi Y, Zhou Y, Miyamoto H, Tanaka Y, Waki M et al. Soluble interleukin-

6 receptors released from T cell or granulocyte/macrophage cell lines and human peripheral

Joana N. M. Allen Revez Mestrado em Biologia Molecular e Genética  

A new variant in the IL6R gene increases gene expression and asthma risk 20

blood mononuclear cells are generated through an alternative splicing mechanism. Eur J

Immunol 1994; 24(8): 1945-8.

11. Lust JA, Donovan KA, Kline MP, Greipp PR, Kyle RA, Maihle NJ. Isolation of an mRNA

encoding a soluble form of the human interleukin-6 receptor. Cytokine 1992; 4(2): 96-100.

12. Mullberg J, Schooltink H, Stoyan T, Gunther M, Graeve L, Buse G et al. The soluble

interleukin-6 receptor is generated by shedding. Eur J Immunol 1993; 23(2): 473-80.

13. Rose-John S, Heinrich PC. Soluble receptors for cytokines and growth factors: generation and

biological function. Biochem J 1994; 300 ( Pt 2): 281-90.

14. Galicia JC, Tai H, Komatsu Y, Shimada Y, Akazawa K, Yoshie H. Polymorphisms in the IL-6

receptor (IL-6R) gene: strong evidence that serum levels of soluble IL-6R are genetically

influenced. Genes Immun 2004; 5(6): 513-6.

15. Mullberg J, Oberthur W, Lottspeich F, Mehl E, Dittrich E, Graeve L et al. The soluble human

IL-6 receptor. Mutational characterization of the proteolytic cleavage site. J Immunol 1994;

152(10): 4958-68.

16. Doganci A, Eigenbrod T, Krug N, De Sanctis GT, Hausding M, Erpenbeck VJ et al. The IL-

6R alpha chain controls lung CD4+CD25+ Treg development and function during allergic

airway inflammation in vivo. J Clin Invest 2005; 115(2): 313-25.

17. Ferreira MA, Matheson MC, Duffy DL, Marks GB, Hui J, Le Souef P et al. Identification of

IL6R and chromosome 11q13.5 as risk loci for asthma. Lancet 2011; 378(9795): 1006-14.

18. Rafiq S, Frayling TM, Murray A, Hurst A, Stevens K, Weedon MN et al. A common variant

of the interleukin 6 receptor (IL-6r) gene increases IL-6r and IL-6 levels, without other

inflammatory effects. Genes Immun 2007; 8(7): 552-9.

19. Raggi P, Su S, Karohl C, Veledar E, Rojas-Campos E, Vaccarino V. Heritability of renal

function and inflammatory markers in adult male twins. Am J Nephrol 2010; 32(4): 317-23.

20. Abecasis GR, Auton A, Brooks LD, DePristo MA, Durbin RM, Handsaker RE et al. An

integrated map of genetic variation from 1,092 human genomes. Nature 2012; 491(7422): 56-

65.

Joana N. M. Allen Revez Mestrado em Biologia Molecular e Genética  

A new variant in the IL6R gene increases gene expression and asthma risk 21

21. Melzer D, Perry JR, Hernandez D, Corsi AM, Stevens K, Rafferty I et al. A genome-wide

association study identifies protein quantitative trait loci (pQTLs). PLoS Genet 2008; 4(5):

e1000072.

22. Ferreira RC, Freitag DF, Cutler AJ, Howson JM, Rainbow DB, Smyth DJ et al. Functional

IL6R 358Ala Allele Impairs Classical IL-6 Receptor Signaling and Influences Risk of Diverse

Inflammatory Diseases. PLoS Genet 2013; 9(4): e1003444.

23. Dunham I, Kundaje A, Aldred SF, Collins PJ, Davis CA, Doyle F et al. An integrated

encyclopedia of DNA elements in the human genome. Nature 2012; 489(7414): 57-74.

24. Ferreira MA, O'Gorman L, Le Souef P, Burton PR, Toelle BG, Robertson CF et al. Variance

components analyses of multiple asthma traits in a large sample of Australian families

ascertained through a twin proband. Allergy 2006; 61(2): 245-53.

25. Powell JE, Henders AK, McRae AF, Caracella A, Smith S, Wright MJ et al. The Brisbane

Systems Genetics Study: genetical genomics meets complex trait genetics. PLoS One 2012;

7(4): e35430.

26. Zhu G, Montgomery GW, James MR, Trent JM, Hayward NK, Martin NG et al. A genome-

wide scan for naevus count: linkage to CDKN2A and to other chromosome regions. Eur J

Hum Genet 2007; 15(1): 94-102.

27. Boomsma DI, Willemsen G, Sullivan PF, Heutink P, Meijer P, Sondervan D et al. Genome-

wide association of major depression: description of samples for the GAIN Major Depressive

Disorder Study: NTR and NESDA biobank projects. Eur J Hum Genet 2008; 16(3): 335-42.

28. Willemsen G, Vink JM, Abdellaoui A, den Braber A, van Beek JH, Draisma HH et al. The

Adult Netherlands Twin Register: twenty-five years of survey and biological data collection.

Twin Res Hum Genet 2013; 16(1): 271-81.

29. Moffatt MF, Gut IG, Demenais F, Strachan DP, Bouzigon E, Heath S et al. A large-scale,

consortium-based genomewide association study of asthma. N Engl J Med 2010; 363(13):

1211-21.

Joana N. M. Allen Revez Mestrado em Biologia Molecular e Genética  

A new variant in the IL6R gene increases gene expression and asthma risk 22

30. Howie BN, Donnelly P, Marchini J. A flexible and accurate genotype imputation method for

the next generation of genome-wide association studies. PLoS Genet 2009; 5(6): e1000529.

31. Medland SE, Nyholt DR, Painter JN, McEvoy BP, McRae AF, Zhu G et al. Common variants

in the trichohyalin gene are associated with straight hair in Europeans. Am J Hum Genet 2009;

85(5): 750-5.

32. Li Y, Willer C, Sanna S, Abecasis G. Genotype imputation. Annu Rev Genomics Hum Genet

2009; 10: 387-406.

33. Li Y, Willer CJ, Ding J, Scheet P, Abecasis GR. MaCH: using sequence and genotype data to

estimate haplotypes and unobserved genotypes. Genet Epidemiol 2010; 34(8): 816-34.

34. Willer CJ, Li Y, Abecasis GR. METAL: fast and efficient meta-analysis of genomewide

association scans. Bioinformatics 2010; 26(17): 2190-1.

 

A new variant in the interleukin-6 receptor (IL-6R) gene increases gene expression and asthma risk.

Joana Allen Revez

Supervised by: Manuel Ferreira PhD Manuel Carmo Gomes PhD

Queensland Institute of Medical Research 2013

Molecular Biology and Genetics

Supplemental Data

Joana N. M. Allen Revez Mestrado em Biologia Molecular e Genética  

Anexos S2

 

Supplementary Figure 1. Association between sIL-6R serum levels (after adjusting for

technical covariates) and rs4129267 genotype in the discovery cohort (N = 360). The

association was tested using linear regression; the figure shows the proportion of variation in

sIL-6R levels explained by rs4129267 (r2), the association P-value (p) and the corresponding

regression line.

Supplementary Figure 2. Association between sIL-6R serum levels (after adjusting for

technical covariates and the effects of rs4129267) and rs12083537 genotype in the discovery

cohort (N = 360). The association was tested using linear regression; the figure shows the

proportion of variation in sIL-6R levels explained by rs12083537 (r2), the association P-value

(p) and the corresponding regression line.

Joana N. M. Allen Revez Mestrado em Biologia Molecular e Genética  

Anexos S3

Supplementary Figure 3. Association between sIL-6R serum levels (after adjusting for

technical covariates and the effects of rs4129267) and rs12083537 genotype in the replication

cohort (N = 354). The association was tested using linear regression; the figure shows the

proportion of variation in sIL-6R levels explained by rs12083537 (r2), the association P-value

(p) and the corresponding regression line.

Joana N. M. Allen Revez Mestrado em Biologia Molecular e Genética  

Anexos S4

Supplementary Table 1. Demographics and clinical characteristics of 360 asthmatics

included in the cohort used to discover novel SNP associations with sIL-6R serum levels.

rs4129267 genotype CC CT TT All

N 112 170 78 360

Eczema (% yes, no, unknown) 44.6,55.4,0 40,59.4,0.6 35.9,64.1,0 40.6,59.2,0.3

Atopy (% SPT+, SPT-, unknown) 63.4,21.4,15.2 71.2,14.7,14.1 66.7,12.8,20.5 67.8,16.4,15.8

Asthma onset (% ≤16, >16, unknown) 51.8,17,31.2 62.4,9.4,28.2 57.7,11.5,30.8 58.1,12.2,29.7

IgE, SD (IU/mL) 128.4,4.1 162.3,4.3 147.8,4.1 148.3,4.2

Eosinophils, SD (x109/L) 0.31,0.3 0.34,0.34 0.31,0.25 0.32,0.31

Family history (% yes, no, unknown) 90.2,9.8,0 92.9,7.1,0 88.5,11.5,0 91.1,8.9,0

Age (mean, SD, range) 27,13.9,7-69 26,14.7,7-71 28,14.6,8-73 27,14.4,7-73

Sex (% female) 52.7 55.9 51.3 53.9

Smoker ever (% yes, no, unknown) 24.1,75.9,0 21.8,77.6,0.6 28.2,70.5,1.3 23.9,75.6,0.6

Smoker current (% yes, no, unknown) 14.3,38.4,47.3 6.5,52.9,40.6 12.8,47.4,39.7 10.3,47.2,42.5

Medication ever (% yes, no, unknown) 97.3,2.7,0 96.5,3.5,0 97.4,2.6,0 96.9,3.1,0

Medication current (% yes, no, unknown) 78.6,21.4,0 82.4,17.6,0 80.8,19.2,0 80.8,19.2,0

Steroids ever (% yes, no, unknown) 21.4,78.6,0 22.4,77.6,0 23.1,76.9,0 22.2,77.8,0

Steroids current (% yes, no, unknown) 40.2,35.7,24.1 50.6,24.7,24.7 39.7,29.5,30.8 45,29.2,25.8

Hospital ever (% yes, no, unknown) 23.2,74.1,2.7 29.4,66.5,4.1 29.5,66.7,3.8 27.5,68.9,3.6

sIL-6R levels, SD (ng/mL) 54.99,19.92 74.17,25.66 93.99,28.68 72.5,28.4

 

 

Joana N. M. Allen Revez Mestrado em Biologia Molecular e Genética  

Anexos S5

Supplementary Table 2. Independent SNPs (r2 < 0.1) with strongest evidence for association

with sIL-6R serum levels in the discovery cohort (N = 360).

SNP Effect

allele Effect allele

frequency Variance

explained, r2 Beta SE

Uncorrected P-value

Corrected P-value*

After adjustment for rs4129267 rs12083537 G 0.23 0.031 -0.259 0.073 0.0005 0.0496

rs138227751 A 0.07 0.018 0.322 0.116 0.0058 0.3827 rs115224285 T 0.01 0.017 0.646 0.244 0.0085 0.4899 rs191793127 T 0.03 0.011 -0.712 0.321 0.0273 0.8489 rs72698116 G 0.01 0.008 -0.745 0.370 0.0449 0.9490

rs147763778 A 0.05 0.008 0.339 0.170 0.0472 0.9551 After adjustment for rs4129267 and rs12083537

rs116059394 G 0.05 0.013 -0.386 0.160 0.0161 0.6947 rs138227751 A 0.07 0.011 0.255 0.114 0.0263 0.8391 rs115224285 T 0.01 0.009 0.506 0.241 0.0364 0.9139

* Empirical P-values corrected for the total number of SNPs tested (N = 471) through 100,000 permutations.

Supplementary Table 3. Demographics and clinical characteristics of 354 asthmatics

included in the cohort used to replicate SNP associations with sIL-6R serum levels.

rs4129267 genotype CC CT TT All

N 136 160 58 354

Age (mean, SD, range) 49,17.57,13-87 50,16.08,10-86 52,15.14,9-90 50,16.51,9-90

Sex (% female) 63.2 68.8 74.1 67.5

Hay fever (% yes, no, unknown) 61.8,36,2.2 55.6,42.5,1.9 60.3,39.7,0 58.8,39.5,1.7

Eczema (% yes, no, unknown) 34.6,64.7,0.7 36.9,61.9,1.2 37.9,62.1,0 36.2,63,0.8

Family history (% yes, no, unknown) 64.7,31.6,3.7 68.8,30.6,0.6 72.4,24.1,3.4 67.8,29.9,2.3

Asthma onset (% ≤16, >16, unknown) 60.3,33.1,6.6 59.4,30.6,10 56.9,34.5,8.6 59.3,32.2,8.5

sIL-6R levels, SD (ng/mL) 49.68,13.04 71.78,16.27 88.25,20.92 65.99,21.31

 

Joana N. M. Allen Revez Mestrado em Biologia Molecular e Genética  

Anexos S6

Supplementary Table 4. Association results between rs12083537 (effect allele: G) and IL6R

mRNA levels measured by 32 individual Affymetrix probes, using RNA extracted from

whole blood (N = 5 191).

Probe Locus Start, bp End, bp Sequence P-value T

11741958_a_at;264;163 Exon 5 154407569 154407594 TACAGACTACGGTTTGAGCTCAGAT 0.4850 0.70 11741958_a_at;314;329 Exon 5 154407576 154407601 TACGGTTTGAGCTCAGATATCGGGC 0.1068 1.61 11736510_a_at;323;134 Exon 7 154420604 154420629 CAGGAGTCCTCCAGCTGAGAACGAG 0.7989 0.25 11741958_a_at;582;132 Exon 7 154420604 154420629 CAGGAGTCCTCCAGCTGAGAACGAG 0.8337 0.21 11736510_a_at;201;497 Exon 7 154420613 154420638 TCCAGCTGAGAACGAGGTGTCCACC 0.3149 -1.01 11741958_a_at;202;497 Exon 7 154420613 154420638 TCCAGCTGAGAACGAGGTGTCCACC 0.8479 -0.19 11736510_a_at;484;717 Exon 8 154422428 154422453 GATTCTGCAAATGCGACAAGCCTCC 0.6635 0.44 11741958_a_at;483;717 Exon 8 154422428 154422453 GATTCTGCAAATGCGACAAGCCTCC 0.8186 0.23 11736510_a_at;609;718 3'-end 154437649 154437674 AAGACAAGCATGCATCCGCCGTACT 0.4188 -0.81 11741958_a_at;610;718 3'-end 154437649 154437674 AAGACAAGCATGCATCCGCCGTACT 0.6768 0.42 11741958_a_at;705;156 3'-end 154437682 154437707 CAGCTGGTCCCGGAGAGGCCTCGAC 0.4211 -0.80 11736510_a_at;58;615 3'-end 154437760 154437785 GGGTCTGACAATACCTCGAGCCACA 0.6003 -0.52

11736510_a_at;659;168 3'-end 154437779 154437804 GCCACAACCGACCAGATGCCAGGGA 0.9457 0.07 11736510_a_at;390;362 3'-end 154437807 154437832 ACGGAGCCCTTATGACATCAGCAAT 0.8324 -0.21 11736510_a_at;46;386 3'-end 154437834 154437859 AGACTACTTCTTCCCCAGATAGCTG 0.8752 0.16 11741959_x_at;218;13 3'-end 154438769 154438794 ATCAAAACGGTTTTACTGCAGCTTT 0.6968 0.39

11741959_x_at;586;417 3'-end 154438780 154438805 TTTACTGCAGCTTTGTTTGTTGTCA 0.5539 0.59 11741959_x_at;7;323 3'-end 154438789 154438814 GCTTTGTTTGTTGTCAGCTGAACCT 0.1111 1.59

11741959_x_at;109;540 3'-end 154438797 154438822 TGTTGTCAGCTGAACCTGGGTAACT 0.8790 -0.15 11741959_x_at;505;200 3'-end 154438903 154438928 CTGCTGACTGTTTTCTCTTGAGAGG 0.4423 -0.77 11741959_x_at;548;21 3'-end 154438916 154438941 TCTCTTGAGAGGGTGGAATATCCAA 0.5571 -0.59

11741959_x_at;361;694 3'-end 154438932 154438957 AATATCCAATATTCGCTGTGTCAGC 0.4901 0.69 11741959_x_at;267;582 3'-end 154438945 154438970 CGCTGTGTCAGCATAGAAGTAACTT 0.9922 -0.01 11741959_x_at;260;360 3'-end 154438986 154439011 AGCACCATAACTTTGTTTAGCCCAA 0.3022 1.03 11736509_x_at;495;603 3'-end 154439660 154439685 GTTCCTTGAGTTGATTCAGCTCTGC 0.2138 1.24 11736509_x_at;553;369 3'-end 154439733 154439758 ACTTCAGCTGACTTTTCTGTCCGAG 0.0031 2.96 11736509_x_at;545;104 3'-end 154439782 154439807 GGTTACCCAGTTAGCTCTCAAGTTA 0.9530 0.06 11736509_x_at;551;225 3'-end 154439795 154439820 GCTCTCAAGTTATCAGGGTATTCCA 0.5074 0.66 11736509_x_at;308;15 3'-end 154439837 154439862 AAATCAGCCGTGTAACCATGGACCC 0.4184 -0.81

11736509_x_at;372;285 3'-end 154439953 154439978 CCATGTTTTGTTTACGGTTTTCCAG 0.5067 -0.66 11736509_x_at;61;280 3'-end 154439990 154440015 ACTGTCTTCAGTAAGCCGTGATTTT 0.4114 -0.82

11736509_x_at;40;550 3'-end 154440035 154440060 GATTTTAGACCCTATTGCTGCTTGA 0.9373 -0.08

 

 

 

Joana N. M. Allen Revez Mestrado em Biologia Molecular e Genética  

Anexos S7

Supplementary Table 5. DNA sequence targeted by the two Illumina probes used to measure

IL6R transcription levels in whole-blood (N = 851).

Probe Locus Sequence

ILMN_1754753 3'-end GACCCTATTGCTGCTTGAGGCAACTCATCTTAGGTTGGCAAAAAGGCAGG

ILMN_1696394 Exon 9 TCTTCAGTACCACTGCCCACATTCCTGGTTGCTGGAGGGAGCCTGGCCTT

Supplementary Table 6. Primers used to amplify the rs4129267 and rs12083537 SNPs in the

replication cohort (N = 354) using Sequenom MassARRAY.

SNP Primer Sequence

rs4129267 Reverse ACGTTGGATGTGTGGTCTAGTCCCTACTTG

Forward ACGTTGGATGACTTGCTCAGCTTGGAGTGG

Extension TGAGTGGGGTCAATTCT

rs12083537 Reverse ACGTTGGATGTCTACAAATACCCAGTGAGG

Forward ACGTTGGATGCAGGCACACTGATCCTGAC

Extension TCCCAATTAGGGGAGTATTTGTTTACCC