Pulmonology Pulmonology
Rev Port Pneumol 2015;21:299-306 - Vol. 21 Num.6 DOI: 10.1016/j.rppnen.2015.07.006
Cluster analysis in phenotyping a Portuguese population
C.C.. Loureiroa,,b, , P.. Sa-Coutoc, A.. Todo-Bomb,d, J.. Bousquete,f
a Pneumology Unit, Hospitais da Universidade de Coimbra, Centro Hospitalar e Universit??rio de Coimbra, Coimbra, Portugal
b Centre of Pneumology, Faculty of Medicine, University of Coimbra, Portugal
c Center for Research and Development in Mathematics and Applications, Department of Mathematics, University of Aveiro, Aveiro, Portugal
d Immunoallergology Unit, Hospitais da Universidade de Coimbra, Centro Hospitalar e Universit??rio de Coimbra, Coimbra, Portugal
e University Hospital Arnaud de Villeneuve, Montpellier, France
f Inserm, CESP Centre for Research in Epidemiology and Population Health, U1018, Respiratory and Environmental Epidemiology Team, Villejuif, France
Related article:

Unbiased cluster analysis using clinical parameters has identified asthma phenotypes. Adding inflammatory biomarkers to this analysis provided a better insight into the disease mechanisms. This approach has not yet been applied to asthmatic Portuguese patients.


To identify phenotypes of asthma using cluster analysis in a Portuguese asthmatic population treated in secondary medical care.


Consecutive patients with asthma were recruited from the outpatient clinic. Patients were optimally treated according to GINA guidelines and enrolled in the study. Procedures were performed according to a standard evaluation of asthma. Phenotypes were identified by cluster analysis using Ward's clustering method.


Of the 72 patients enrolled, 57 had full data and were included for cluster analysis. Distribution was set in 5 clusters described as follows: cluster (C) 1, early onset mild allergic asthma; C2, moderate allergic asthma, with long evolution, female prevalence and mixed inflammation; C3, allergic brittle asthma in young females with early disease onset and no evidence of inflammation; C4, severe asthma in obese females with late disease onset, highly symptomatic despite low Th2 inflammation; C5, severe asthma with chronic airflow obstruction, late disease onset and eosinophilic inflammation.


In our study population, the identified clusters were mainly coincident with other larger-scale cluster analysis. Variables such as age at disease onset, obesity, lung function, FeNO (Th2 biomarker) and disease severity were important for cluster distinction.

Asthma, Phenotypes, Cluster analysis

Asthma is heterogeneous and cluster approaches have been developed to better define phenotypes.1, 2, 3 Although variable selection and analysis methodology differ, some studies have reached comparable results. Identification of each corresponding endotype requires the definition of their clinical characteristics, biomarkers, lung physiology, genetic aspects, disease history and therapeutic response.4 According to Wenzel, the first step to better understand asthma is to define phenotypes by cluster analysis.5

The objective of this observational study was to cluster an asthmatic Portuguese population, treated in secondary medical care, combining clinical, inflammatory, lung function and severity parameters.

MethodsStudy design

Consecutive patients with asthma were recruited from the outpatient clinic, using written informative material. They were included if aged between 18 and 79 years and excluded in the presence of any of the following: cystic fibrosis, interstitial lung disease, auto-immune disease, neoplastic disease, untreated cardiac failure.

Asthma was defined on the basis of a relevant symptom history, plus one or more of the following: history of airway reversibility to salbutamol according to GINA guidelines,6 positive test for airway hyperresponsiveness using methacholine.7

Asthma control was measured under optimized treatment, with validated asthma questionnaires (asthma control test (ACT), control asthma and allergic rhinitis test (CARAT), asthma life quality (ALQ), and severity of asthma score (SOA)).

Severe exacerbations were defined as events requiring urgent action to prevent a serious outcome and at least one of the following: use of systemic corticosteroids or an increase from a stable maintenance dose, for at least 3 days, and/or hospitalization or emergency room visit due to asthma requiring systemic corticosteroid treatment.8

The risk of future adverse events was evaluated considering loss of control, number of exacerbations in the previous year, number of systemic corticosteroids in the previous year, accelerated decline in lung function, and side-effects of treatment. A low basal FEV1 (% predicted value) and low reversibility were also taken into account.

Lung function was assessed with spirometry and plethysmography according to ATS criteria;9, 10 skin prick tests (SPT) were performed using standardized allergens and/or specific serum IgE, FeNO measurement was done using a CLD 88 SP (EcoMedics®) analyzer before any forced expiratory manoeuvers11 and sputum induction and collection was performed using hypertonic saline 4.5% if stable asthma, delivered via an ultrasonic nebulizer.12 Induced sputum was collected by a trained nurse, stored on ice and processed within two hours after expectoration. Sputum processing and immunophenotypical analysis of sputum cells was performed according to laboratory procedures.

If the need to exclude other diagnoses occurred, specific procedures were scheduled (Table E1).

Data processing

Variables were selected according to previous studies1, 2 and included demographic data; comorbidities; evaluation of disease control, quality of life and risk assessment; lung function and blood biomarkers (Table 1). Patients with missing data were excluded. Summary statistics were reported as mean and standard deviation values for continuous variables and as percentages and counts for categorical variables. Geometric mean was reported for total serum IgE.

Table 1. Variables selected for cluster analysis (continuous, binary and composed).

NumberVariable nameType of dataKey
Demographic and comorbidities – Questionnaire data
1GenderBinary0 – Female/1 – Male
4Onset under 12 yBinary0 – Yes/1 – No
5Years of diseaseContinuous 
6ComorbiditiesComposite: atopy; rhinitis; polyposis; sinusitis; any smoke exposure; pneumonia history; NSAID's HS; GERD; α1-AT deficit; Bronchiectasic; COPD; Other obstructive disease; Aspergillus (IgEpos)Presence: 1 Absence: 0

Minimum: 0

Maximum: 13
Control and severity of disease, risk of adverse events, and quality of life
7Control of disease and patient at riskComposite: hospitalization (in previous year); Severe exacerbation (in previousyear); Disease control; % patients at risk; OCSPresence: 1 Absence: 0

Minimum: 0

Maximum: 5
Demographic and comorbidities – Questionnaire data
8Medication usedComposite: Imunotherapy; ICS dose; LABA; Tiotropium; Montelukast; Omalizumab; Aminophylline; nasal CSPresence: 1 Absence: 0

Minimum: 0

Maximum: 8
9No. of allergiesComposite: Mites; Pollens; Fungus; Cockroach; CatPresence: 1 Absence: 0

Minimum: 0

Maximum: 5
10Risk of adverse eventsComposite: hospitalizations >0 (previous year); OCS≥2 (previous year); FEV1<80%; ΔFEV1 after BD<200 ml; adverse effects (1 = yes)Minimum: 0

Maximum: 5
13CARAT (Rhinitis)Continuous 
14CARAT (Asthma)Continuous 
Lung function
16Bronchial labilityComposite: fixed obstruction; bronchial reversibility; fixed obstruction despite BD responsePresence: 1 Absence: 0

Minimum: 0

Maximum: 3
17Basal FEV1 (%predicted)Continuous 
18Basal FEV1/CVF (%predicted)Continuous 
19Basal FEV25%–75% (%predicted)Continuous 
20Basal RV (%predicted)Continuous 
21Blood eosinophils (%):ContinuousLog scale
22Seric IgE (mmol/L):ContinuousLog scale

ACT: asthma control test; ALQ: asthma life quality; CARAT: Control of Allergic Rhinitis and Asthma Test; CS: corticosteroids;; ICS: inhaled corticosteroids; LABA: long-acting b-agonists; SOA: severity asthma score; OCS: oral corticosteroids.

Clinically redundant variables (or with correlation values above 0.9 in module) were reduced. Binary questionnaire data and data with a spectrum of responses were transformed into “composite variables” (Table 1) to capture multiple questions in a ranked ordinal scale.2, 13, 14

A cluster analysis of 22 variables (Table 1) was applied to identify groups of patients with the same characteristics. Ward's minimum-variance hierarchical clustering method was performed using an agglomerative approach, and the linkage measure was the squared Euclidean distance with standardization in z scores, according to Moore et al.1 By visual inspection of dendrogram, no single member, small clusters or observations with large distances from all other observations were observed. Therefore, no formal method for outliers’ detection was used. Analysis of variance (ANOVA or Kruskal–Wallis) or contingency table tests (Person chi-square or Fisher) were used to compare differences between clusters. All statistical analyses were performed using SPSS® Software, version 20.0 (SPSS, Inc., Chicago, IL), and p-values under 0.05 were considered significant.


Of the 72 patients enrolled, 57 had completed data for the considered variables (79.2% of total sample).

The optimum number of clusters was estimated by visual inspection of the dendrogram (Figure 1) and by representation of the difference between consecutive clusters (Fig. E1). A second cluster method was used (two-step cluster approach) to ensure that Ward's cluster solutions were not bias due to small sample size or by the measurement level of the considerable variables. For both clustering methods, the five cluster solution was the one that, from a clinical perspective, best explained the results.

dendrogram obtained using ward's method.

Figure 1. Dendrogram obtained using Ward's method.

Elements were then distributed in five clusters. From variables that showed significance in distribution the following stand out: gender, age, BMI, number of exacerbations in previous year, disease severity, disease control, scores in ACT, CARAT, ALQ, SOA, ICS dose, fixed obstruction, basal FEV1, blood eosinophils (Table 2).

Table 2. Main characteristics obtained for total sample and each of five clusters.


Qualitative (n, %)

Quantitative (M ± SD)
Total (n = 57)C1 (n = 6)C2 (n = 15)C3 (n = 10)C4 (n = 19)C5 (n = 7)Statistical analysis
Female42 (73.7)2 (4.8)12 (28.6)10 (23.8)18 (42.9)0 (0.0)Fisher = 29.5
Male15 (26.3)4 (26.7)3 (20.0)0 (20.0)1 (6.7)7 (46.7)p < 0.001
Age (years)45.6 ± 18.023.0 ± 6.853.7 ± 15.925.9 ± 8.449.9 ± 11.964.3 ± 8.3F(4,52) = 19.0

p < 0.001
BMI (kg/m2)27.8 ± 6.021.2 ± 1.828.1 ± 4.726.9 ± 7.030.3 ± 6.827.3 ± 1.4F(4,52) = 3.2

p = 0.020
Years of disease (years)24.0 ± 14.012.8 ± 8.731.8 ± 18.514.3 ± 7.424.4 ± 13.029.7 ± 13.8F(4,52) = 3.8

p = 0.009
Disease onset under age 12 Y
No27 (47.4)2 (7.4)6 (22.2)1 (3.7)11 (40.7)7 (25.9)Fisher = 15.4
Yes30 (52.6)4 (13.3)9 (30.0)9 (30.0)8 (26.7)0 (0.0)p = 0.003
IgE sensitization
No18 (31.6)0 (0.0)5 (27.8)2 (11.1)9 (50.0)2 (11.1)Fisher = 5.3
Yes39 (68.4)6 (15.4)10 (25.6)8 (20.5)10 (25.6)5 (12.8)p = 0.255
Any smoke exposure
No36 (63.2)4 (11.1)10 (27.8)7 (19.4)13 (36.1)2 (5.6)Fisher = 3.9
Yes21 (36.8)2 (9.5)5 (23.8)3 (14.3)6 (28.6)5 (23.8)p = 0.432
Pneumonia history
No36 (63.2)5 (13.9)10 (27.8)6 (16.7)11 (30.6)4 (11.1)Fisher = 1.6
Yes21 (36.8)1 (4.8)5 (23.8)4 (19.0)8 (38.1)3 (14.3)p = 0.857
No10 (17.5)0 (0.0)5 (10.0)1 (10.0)4 (40.0)0 (0.0)Fisher = 4.6
Yes47 (82.5)6 (12.8)10 (21.3)9 (19.1)15 (31.9)7 (14.9)p = 0.282
Anxiety (yes)13 (22.8)2 (15.4)5 (38.5)1 (7.7)5 (38.5)0 (0.0)Fisher = 4.3

p = 0.361
Depression (yes)6 (10.5)0 (0.0)3 (50.0)0 (0.0)3 (50.0)0 (0.0)Fisher = 3.4

p = 0.482
No. of comorbidities3.7 ± 1.83.5 ± 1.03.6 ± 1.43.6 ± 1.43.3 ± 1.45.6 ± 3.0F(4,52) = 2.6

p = 0.046
Disease control
Yes10 (17.5)4 (40.0)5 (50.0)1 (10.0)0 (0.0)0 (0.0)Fisher = 15.7
No47 (82.5)2 (4.3)10 (21.3)9 (19.1)19 (40.4)7 (14.9)p = 0.001
Asthma control test (ACT)19.4 (4.35)23.5 ± 1.421.8 ± 2.119.6 ± 4.017.7 ± 3.214.8 ± 6.7F(4,52) = 7.6

p < 0.001
CARAT (Rhinitis)6.7 (3.3)6.2 ± 1.89.3 ± 2.45.5 ± 2.16.0 ± 3.75.6 ± 4.0F(4,52) = 3.7

p = 0.010
CARAT (Asthma)11.7 (4.5)15.7 ± 2.515.5 ± 3.411.1 ± 3.48.7 ± 3.19.4 ± 5.0F(4,52) = 9.5

p < 0.001
Asthma life quality (ALQ)11.7 ± 3.87.7 ± 3.39.6 ± 2.911.7 ± 3.514.2 ± 2.712.9 ± 4.0F(4,52) = 7.5

p < 0.001
Severe asthma score (SOA)10.3 ± 3.94.8 ± 2.58.7 ± 2.710.3 ± 4.011.6 ± 2.515.0 ± 2.8F(4,52) = 9.4

p < 0.001
Therapeutics (GINA)
Until step33 (5.3)0 (0.0)2 (66.6)1 (33.3)0 (0.0)0 (0.0) 
Step314 (24.6)5 (35.7)1 (7.1)4 (28.6)4 (28.6)0 (0.0)Fisher = 22.1
Step437 (64.9)1 (2.7)12 (32.4)5 (13.5)14 (37.8)5 (13.5)p = 0.003
Step53 (5.3)0 (0.0)0 (0.0)0 (0.0)1 (33.3)2 (66.7) 
High-dose ICS
No35 (61.4)6 (17.1)13 (37.1)7 (20.0)9 (25.7)0 (0.0)Fisher = 20.6
Yes22 (38.6)0 (0.0)2 (9.1)3 (13.6)10 (45.5)7 (31.8)p < 0.001
OCS (last year)
026 (45.6)5 (19.2)11 (42.3)4 (15.4)6 (23.1)0 (0.0)Fisher = 20.9
118 (31.6)1 (5.6)3 (16.7)5 (27.8)7 (38.9)2 (11.1)p = 0.013
26 (10.5)0 (0.0)0 (0.0)0 (0.0)4 (66.7)2 (33.3) 
≥37 (12.3)0 (0.0)1 (14.3)1 (14.3)2 (28.6)3 (42.9) 
Severe exacerbation (previous year)
No27 (47.4)5 (18.5)11 (40.7)2 (7.4)8 (29.6)1 (3.7)Fisher = 12.9
Yes30 (52.6)1 (3.3)4 (13.3)8 (26.7)11 (36.7)6 (20.0)p = 0.009
FEV1 (%)
Baseline92.6 ± 26.4114.0 ± 27.288.0 ± 24.0114.7 ± 13.887.5 ± 24.366.1 ± 16.2F(4,52) = 6.7

p < 0.001
FEV1/CVF (%)
Baseline76.7 ± 17.296.3 ± 5.771.1 ± 7.993.7 ± 15.473.4 ± 15.356.3 ± 6.6F(2,52) = 14.9

p < 0.001
Fixed obstruction
No39 (68.4)6 (15.4)10 (25.6)10 (25.6)13 (33.3)0 (0.0)Fisher = 21.4
Yes18 (31.6)0 (0.0)5 (27.8)0 (0.0)6 (33.3)7 (38.9)p < 0.001
Non-severe24 (42.1)6 (25.0)7 (29.2)6 (25.0)5 (20.8)0 (0.0)Fisher = 16.8
Severe33 (57.9)0 (0.0)8 (24.2)4 (12.1)14 (42.4)7 (21.2)p = 0.001
FeNO (ppb):
<3520 (60.6)2 (10.0)4 (20.0)5 (25.0)8 (40.0)1 (5.0)Fisher = 4.8
≥3513 (39.9)3 (23.1)3 (23.1)2 (15.4)2 (15.4)3 (23.1)p = 0.320
Serum IgE (log (mmol/L))2.2 ± 0.62.7 ± 0.22.3 ± 0.72.1 ± 0.71.9 ± 0.42.5 ± 0.4F(4,52) = 3.2

p = 0.019
Blood eosinophils (%):3.3 ± 3.33.3 ± 1.61.5 ± 1.31.8 ± 1.73.9 ± 1.77.7 ± 6.8F(4,52) = 7.0

p < 0.001
Sputum eosinophils (%)24.0 ± 26.644.5 ± 48.8 (n = 2)21.3 ± 25.6 (n = 7)5.8 ± 6.3 (n = 4)24.3 ± 30.3 (n = 11)32.0 ± 21.4 (n = 6)n.a.
Sputum neutrophils (%)50.8 ± 30.838.0 ± 53.7 (n = 2)59.4 ± 31.8 (n = 7)36.3 ± 32.1 (n = 4)50.6 ± 32.1 (n = 11)55.2 ± 25.7 (n = 6)n.a.

*p value from analysis of variance or Chi-square analysis between five clusters.

High-dose ICS: dose equivalent to over 750 μg fluticasone propionate daily.

Definition of abbreviations: BMI: body mass index; CARAT: control of allergic rhinitis and asthma test; CS: corticosteroids; FeNO: fractional concentration of nitric oxide in exhaled air; ICS: inhaled corticosteroids; LABA: long-acting b-agonists; NSAID HS: nonsteroidal anti-inflammatory drugs hypersensitivity; OCS: oral corticosteroids.

Sample characteristics obtained for total sample and each of five clusters are depicted in Table 2.

Cluster description

Cluster1 (n = 6), early onset mild allergic asthma, with eosinophilic inflammation: it was the youngest and the least severe group, with a male prevalence. Subjects showed the lowest grade of obesity, the best lung function and disease control and the lowest health care recurrence, despite elevated FeNO values.

Cluster2 (n = 15), moderate allergic asthma, long evolution, female prevalence, mixed inflammation: although the older age and the high BMI, questionnaires showed good disease control, low impact of disease (both in life quality and lung function) and low severity.

Regarding biomarkers, there was no blood eosinophilia and sputum neutrophil percentage was the highest of all groups.

Cluster3 (n = 10), allergic brittle asthma, young females, early onset, no evidence of inflammation: mostly with disease onset before the age of 12, atopy was present in 80% of them. Mean BMI was lower, compared to C2. Lung function was normal in all evaluated parameters and disease was more frequently classified as non-severe. We found a good score in the ACT questionnaire, with normal lung function and no evidence of eosinophilic inflammation, despite high hospitalizations and severe exacerbation rates, suggesting a brittle phenotype.15

Cluster 4 (n = 19), severe asthma in obese females, late onset, mixed inflammation, highly symptomatic: this was the most prevalent and obese group, with a marked female prevalence, the majority having disease onset after the age of 12. Atopy was less frequent, compared with other groups, and depression-anxiety had more protagonists. None of these subjects showed disease control, reporting the worst quality of life of all groups.

Despite the high grade of symptoms and therapeutics, with frequent use of OCS and emergency care, lung function was not very impaired. There was a low grade of Th2 inflammation as we found a low percentage of blood eosinophils, with FeNO values frequently inferior to 35 ppb.

Cluster5 (n = 7), severe asthma with chronic airflow obstruction, late onset, long evolution, eosinophilic inflammation: subjects were all male, with age of disease onset above 12 years old, and presented a mean BMI of over 25 kg/m2. The mean age was the highest, with long disease evolution and more comorbidities (in particular IgE sensitization, smoke exposure and NSAID's HS). The ACT and CARAT scores were low, and ALQ and SOA were high, with frequent severe exacerbations, hospitalizations and use of OCS. Most of them showed CT scan abnormalities, air trapping being the most prevalent. Lung function evaluation showed the worst FEV1 out of all groups, with fixed obstruction in all patients (two with COPD overlap). The blood eosinophils count was the highest noted and FeNO was frequently over 35 ppb. Sputum analysis showed a mixed inflammation.


According to Moore et al.,1 the algorithm approach using 5 clusters is sufficient for the classification of disease severity. We have here confirmed many results of Moore et al. but found other parameters of interest such as age, weight, disease control or severity, quality of life and blood eosinophilia. In other studies,2, 3 some of these characteristics were used in the cluster analysis, suggesting our data relevance.

Cluster discussion

Cluster1 (early onset mild allergic asthma, eosinophilic inflammation) overlapped with the less severe groups from other large-scale studies: C1 from Haldar et al.,2 C1 from Moore et al.1 and C1 from Wu et al.3

Cluster2 (moderate allergic asthma, long evolution, female prevalence, mixed inflammation), appears to overlap with some of the characteristics from C5 in Moore et al.1 (female prevalence, older age, high BMI). However there is differentiation in other variables (questionnaires showed good disease control, low impact of disease and low grade of severity).

Once asthmatic phenotypes are not static, with intrinsic and extrinsic factors acting as modifiers, it appears that C2 may be an evolution of mild allergic asthma, probably due to weight gain. This could modify the immunocellular response of early-onset allergic asthma into a mixed phenotype obesity related.

Cluster3 (allergic brittle asthma, young females, early onset, no evidence of inflammation) shared some of the characteristics with C3 from Haldar et al.2 (early onset asthma, symptom predominance) and with C1 from Moore et al.1 (mild allergic asthma). Despite the good score in the ACT questionnaire, a discordant score in CARAT (rhinitis) was noticed, showing a possible correlation to aeroallergen exacerbation triggers, responsible for the exacerbation rates, as described in the literature. The mean age of this group, as well as the female prevalence, may suggest some correlation of exacerbations to hormonal changes.

Cluster 4 (severe asthma in obese females, late onset, mixed inflammation, highly symptomatic) overlapped with C2 from Haldar et al.2 and C3 from Moore et al.1 (non-Th2 asthma-obesity related). The discordance between the high-intensity treatment, low disease control and relatively preserved lung function suggests a relation to factors, such as mechanics and psychogenics, and to concurrent comorbidities (like GERD or sedentary lifestyle). Another factor that might explain insensitivity to CS and better response to obesity-targeted treatment, already described in this phenotype,16 is the noticed low grade of Th2 inflammation. This gains importance as it points to a pathobiologic disease mechanism different from the Th2-eosinophilic phenotype.

Cluster5 (severe asthma with chronic airflow obstruction, late onset, long evolution, eosinophilic inflammation) presented the highest severity grade, corresponding to C5 of Moore et al.1 (severe patients with fixed obstruction) and C4 of Haldar et al.2 (asthma with predominant inflammation). Coincident with cluster results of a more recent study3 we verified blood eosinophilia and high FeNO values, most having eosinophilic or mixed sputum phenotype, with high blood eosinophils count, according to current data regarding severe patients.

This cluster also shares many of the characteristics found by the TENOR study17 for patients with asthma and chronic airway obstruction, being older, with frequent history of smoke exposure and many years of disease evolution, showing eosinophilic inflammation. Consistently, it is known that one of the mechanisms for persistent inflammation in asthmatics despite CS treatment is the reduction of HDAC2.18 Other pathobiologic mechanisms may be involved in this group as it had the highest prevalence of NSAID's HS.

According to Wenzel's questions regarding phenotype distinction5 (age of disease onset, eosinophilic inflammation and allergy), C4 and C5 (more severe groups) had disease onset after 12 years of age, and C3 and C2 before that age. C1 and C5 (less severe and most severe groups) showed an eosinophilic inflammation, and allergy was present in all groups.

We can then match our clusters to Wenzel endotypes:19 C1 is clearly associated with Th2 response; C4 with non-Th2 response; C2 and C5 with a mixed response with Th2 prevalence. Possible exogenous factors could be related with the mixed response in C5 (such as smoke exposure) and in C2 (weight could have modified the original endotype of these females) (Figure 2).

adaptation of obtained clusters to wenzel endotypes.

Figure 2. Adaptation of obtained clusters to Wenzel endotypes.

Limitations and strengths

Cluster analysis requires a large number of patients and this study may give the impression of being underpowered. However, both methods used to assess cluster analysis (Ward method and two-step method) empirically validate our cluster results.

Another limitation is the applicability of our results to all clinical scenarios: our approach, applied to patients treated in a secondary health care unit, may not apply in primary health care.


The results of this cluster analysis are consistent with those from other larger-scale studies. C1, early-onset mild allergic asthma, was recognized in previous Moore, Haldar and Wu clusters; C2 and C3, female clusters with symptom predominant early-onset disease in Haldar and Wu clusters; C4, a female obese cluster less allergic with late onset disease, in Moore clusters; C5, a late-onset eosinophilic cluster associated with nasal polyposis and mixed inflammatory cellular phenotype, in Moore, Haldar and Wu clusters.

Variables such as age at disease onset, obesity, lung function, FeNO (as Th2 biomarker) and severity were important for cluster characterization and distinction.

Authors’ contribution statements

CCL contributed to the study design, collected clinical data, contributed to the interpretation and global integration of the results, and wrote the manuscript draft; PSC performed the cluster analysis and statistical data analysis, and contributed to the interpretation of the results; ATB contributed to the study design and manuscript revision; JB contributed to the discussion on severe asthma and the writing of the paper.

Ethical disclosuresProtection of human and animal subjects

The authors declare that no experiments were performed on humans or animals for this study.

Confidentiality of data

The authors declare that no patient data appear in this article.

Right to privacy and informed consent

The authors declare that no patient data appear in this article.

Conflict of interest

The authors stated that there are no conflicts of interest regarding the publication of this article.


PSC's work was supported by Portuguese funds through the CIDMA – Center for Research and Development in Mathematics and Applications, and the FCT within the project UID/MAT/04106/2013.

We thank Anna Bedbrook for writing assistance.

Appendix A. Supplementary data

Supplementary material associated with this article can be found in the online version available at doi:10.1016/j.rppnen.2015.07.006.

Appendix A. Supplementary data

The following are the supplementary data to this article:

Received 28 May 2015

Accepted 17 July 2015

Corresponding author. cchloureiro@gmail.com

Identification of Asthma Phenotypes Using Cluster Analysis in the Severe Asthma Research Program. Am J Respir Crit Care Med. 2010; 181:315-23.
Cluster Analysis and Clinical Asthma Phenotypes. Am J Respir Crit Care Med. 2008; 178:218-24.
Unsupervised phenotyping of Severe Asthma Research Program participants using expanded lung data. J Allergy Clin Immunol. 2014; 133:1280-8.
Asthma endotypes: a new approach to classification of disease entities within the asthma syndrome. J Allergy Clin Immunol. 2011; 127:355-60.
Complex phenotypes in asthma: current definitions. Pulm Pharmacol Ther. 2013; 26:710-5.
From the Global Strategy for Asthma Management and Prevention, Global Initiative for Asthma (GINA) 2012. Available from: http://www.ginasthma.org/ [accessed 02.02.14].
Airway responsiveness: standardized challenge testing with pharmacological, physical and sensitizing stimuli in adults. Report of the Working Party Standardization of Lung Function Tests, European Community for Steel and Coal. Official Statement of the European Respiratory Society. Eur Respir J Suppl. 1993; 16:53-83.
An official American Thoracic Society/European Respiratory Society statement: asthma control and exacerbations standardizing endpoints for clinical asthma trials and clinical practice. Am J Respir Crit Care Med. 2009; 59-99.
Standardisation of spirometry. Eur Respir J. 2005; 26:319-38.
Standardisation of the measurement of lung volumes. Eur Respir J. 2005; 26:511-22.
Recommendations for standardized procedures for the on-line and off-line measurement of exhaled lower respiratory nitric oxide and nasal nitric oxide in adults and children-1999. Am J Respir Crit Care Med. 1999; 160:2104-17.
Methods of sputum processing for cell counts, immunocytochemistry and in situ hybridisation. Eur Respir J. 2002; 20:19s-23s.
A clustering technique for summarising multivariate data. Behav Sci. 1967; 12:153-5.
A robust and scalable clustering algorithm for mixed type attributes in large database environment. Proceedings of the 7th ACM SIGKDD international conference in knowledge discovery and data mining. San Francisco, CA: Association for Computing Machinery; 2001. 263-8.
Brittle asthma. Thorax. 1998; 53:315-21.
Asthma and obesity. Clin Exp Allergy. 2013; 43:8-21.
Design and baseline characteristics of the epidemiology and natural history of asthma: Outcomes and Treatment Regimens (TENOR) study: a large cohort of patients with severe or difficult-to-treat asthma. Ann Allergy Asthma Immunol. 2004; 92:32-9.
Histone acetylation and deacetylation: importance in inflammatory lung diseases. Eur Respir J. 2005; 25:552-63.
Asthma phenotypes: the evolution from clinical to molecular approaches. Nat Med. 2012; 18:716-25.
NOTICE Undefined index: nombre (includes_ws/librerias/html/item.php[1239])
Corresponding author. cchloureiro@gmail.com ( Loureiro cchloureiro@gmail.com)
Rev Port Pneumol 2015;21:299-306 - Vol. 21 Num.6 DOI: 10.1016/j.rppnen.2015.07.006
Cookies Policy
To improve our services and products, we use cookies (own or third parties authorized) to show advertising related to client preferences through the analyses of navigation customer behavior. Continuing navigation will be considered as acceptance of this use. You can change the settings or obtain more information by clicking here.