Skip to main content

Development and validation of a nomogram model for prediction of stroke-associated pneumonia associated with intracerebral hemorrhage



We aimed to establish risk factors for stroke-associated pneumonia (SAP) following intracerebral hemorrhage (ICH) and develop an efficient and convenient model to predict SAP in patients with ICH.


Our study involved 1333 patients consecutively diagnosed with ICH and admitted to the Neurology Department of the First Affiliated Hospital of Wenzhou Medical University. The 1333 patients were randomly divided (3:1) into the derivation cohort (n = 1000) and validation Cohort (n = 333). Variables were screened from demographics, lifestyle-related factors, comorbidities, clinical symptoms, neuroimaging features, and laboratory tests. In the derivation cohort, we developed a prediction model with multivariable logistic regression analysis. In the validation cohort, we assessed the model performance and compared it to previously reported models. The area under the receiver operating characteristic curve (AUROC), GiViTI calibration belt, net reclassification index (NRI), integrated discrimination index (IDI) and decision curve analysis (DCA) were used to assess the prediction ability and the clinical decision-making ability.


The incidence of SAP was 19.9% and 19.8% in the derivation (n = 1000) and validation (n = 333) cohorts, respectively. We developed a nomogram prediction model including age (Odds Ratio [OR] 1.037, 95% confidence interval [CI] 1.020–1.054), male sex (OR 1.824, 95% CI 1.206–2.757), multilobar involvement (OR 1.851, 95% CI 1.160–2.954), extension into ventricles (OR 2.164, 95% CI 1.456–3.215), dysphagia (OR 3.626, 95% CI 2.297–5.725), disturbance of consciousness (OR 2.113, 95% CI 1.327–3.362) and total muscle strength of the worse side (OR 0.93, 95% CI 0.876–0.987). Compared with previous models, our model was well calibrated and showed significantly higher AUROC, better reclassification ability (improved NRI and IDI) and a positive net benefit for predicted probability thresholds between 10% and 73% in DCA.


We developed a simple, valid, and clinically useful model to predict SAP following ICH, with better predictive performance than previous models. It might be a promising tool to assess the individual risk of developing SAP for patients with ICH and optimize decision-making.

Peer Review reports


Stroke-associated pneumonia (SAP) was defined as pneumonia complicating the first 7 days after stroke onset in nonventilated patients [1]. As a frequently encountered complication after stroke, SAP developed in 6.5-33% patients with stroke [2,3,4,5,6,7]. SAP increases the economic burden on stroke patients [8], lengthens the hospitalization duration, and is associated with a poor prognosis by increasing the risk of a negative and fatal outcome [9]. Therefore, prompt identification of risk factors associated with SAP and making preventive interventions may help reduce the risk of developing SAP and thus benefiting the patients.

Several risk factors related to the development of SAP have been identified in previous studies. These include older age, male, hypertension, diabetes, atrial fibrillation [10], previous history of chronic obstructive pulmonary disease (COPD) [11], dysphagia [12], pre-stroke dependence [11], intracerebral hemorrhage (ICH) [13], higher admission National Institutes of Health Stroke Scale (NIHSS) score [6], lower Glasgow Coma Scale score (GCS) [14], infratentorial location [11], extension of hemorrhage into ventricles [11], hematoma volume [11], stroke-induced immunodepression syndrome [12], and increased C-reactive protein [15]. Based on these risk factors, some prediction models or scoring systems were constructed to help identify patients at elevated risk of developing SAP [6, 7, 10, 11, 14].

Studies have shown that patients with ICH have a higher risk of developing SAP (8.5-16.9%) [7, 11]compared to patients with ischemic stroke (6.5-12.2%) [6, 7, 10, 13, 14]. However, most of these studies only included patients with ischemic stroke. There are few studies on SAP following ICH. One is ISAN score, which was developed from a multi-center registry including both patients with ischemic stroke and ICH and incorporated parameters including pre-stroke dependence, sex, age, and NIHSS score [7]. Because this model does not include ICH-specific variables, it did not perform well in ICH patients, with a AUROC of 0.71 (0.66 to 0.77) compared to 0.78 (0.76 to 0.80) in the patients with ischemic stroke [7]. Another one was ICH-associated pneumonia score (ICH-APS). It developed two prediction models with or without hematoma volume as an included variable (ICH-APS-B and ICH-APS-A, respectively). Other variables in the prediction models included age, current smoking, excess alcohol consumption, COPD, pre-stroke dependence, admission GCS score, admission NIHSS score, dysphagia, location of ICH, and intraventricular extension [11]. This scoring model is relatively complex, in which the GCS and NIHSS scores overlap to some extent and may have collinearity. Different parts of the NIHSS score have different impacts on SAP development. For example, paralysis and dysphasia probably contributed more to SAP development than ataxia and loss of visual field. Direct use of NIHSS score without considering the specific factors may weaken some critical factors’ influence on SAP.

We aimed to establish risk factors for SAP following ICH from more specific and simplified clinical variables and to build a more efficient and convenient model to predict SAP in patients with ICH.


Study design and source of data

This retrospective cohort study involved 1333 patients consecutively diagnosed with ICH and admitted to the Neurology Department of the First Affiliated Hospital of Wenzhou Medical University from January 2010 to December 2019. Primary inclusion criteria for the study were adult patients (aged ≥ 18 years) diagnosed with ICH confirmed by head CT imaging [16]. Exclusion criteria included (1) crucial clinical data incomplete or missing (e.g., patients without an initial CT performed within 72 h post-ICH); (2) lung infection developed before onset of ICH; (3) severe mental or cognitive impairment before ICH and unable to cooperate with the examination.

We sequentially numbered the 1333 patients included according to the admission date and designated it as the overall cohort dataset. Using the “train_test_split” method in Python, we randomly divided the overall cohort dataset into derivation and validation cohorts at a ratio of 3:1, with 1000 patients in the derivation cohort and 333 patients in the validation cohort. This study obtained approval by the Ethics Committee of the First Affiliated Hospital of Wenzhou Medical University (No.2020 − 185). Informed consent was obtained verbally through telephone interviews with the participants or their legal representatives and how their data will be collected, used, and protected were explained to them. To protect privacy, patients’ identifying information (e.g., names, addresses) were removed from the dataset throughout the research to ensure participants’ anonymity and the data is stored securely using encryption and access controls.


We collected data from the electronic medical records system of the First Affiliated Hospital of Wenzhou Medical University at the time of initial admission, including demographics, life style-associated factors, comorbidities, clinical symptoms, neuroimaging characteristics, and laboratory examinations. Demographic variables included sex aand age at the time of ICH. Lifestyle factors included current smoking and alcohol intake status. The subject’s medical history was notable for comorbidities that comprised hypertension, diabetes, hyperlipidemia, ischemic heart disease, hyperuricemia, and COPD. Variables associated with the clinical symptoms included pre-stroke dependence (modified Rankin Scale [mRS] ≥ 2), presence of dysphagia (assessed with a swallow test by a physical therapist), the status of consciousness (classified as sopor, somnolence or coma), total muscle strength of the worse side (ranging from 0 to10), vomiting after ICH, admission GCS and NIHSS. As for the status of consciousness, somnolence is characterized as a state of strong desire for sleep, or sleeping for unusually long periods but can be arousable by minor stimulation to obey, answer, or respond. Sopor is defined as a condition of abnormally deep sleep that the patients can only be arousable by repeated strong or painful stimulation to respond. Coma is defined as a deep state of prolonged unconsciousness in which a person cannot be awakened, fails to respond normally to stimuli or exhibits reflex responses.

In addition, neuroimaging characteristics of ICH were recorded, including multilobar involvement, deep region involvement, extension of hemorrhage into ventricles, and lesion volume. Volumetric assessment of the lesion location was determined using the ABC/2 method [17]. Based on previous research indicating that hematoma volume greater than 20ml is an important factor associated with poor prognosis [18, 19], we categorized hematoma volume into two groups using 20ml as the boundary. Laboratory examinations at initial admission included red blood cells, platelet, albumin, blood glucose, and creatinine.

The predictive outcome of our study was whether patients developed SAP following ICH. SAP was defined as a range of pulmonary infections affecting the lower respiratory tract that developed within the first 7 days after stroke onset according to the recommendations of the Pneumonia in Stroke Consensus Group. It was diagnosed on a basis of clinical and laboratory indices of respiratory tract infection (e.g., fever, new purulent sputum, cough, bronchial breath sounds or worsening gas exchange), and supported by typical chest radiographs findings [1].

Model development and validation

We randomly divided the study cohort into a derivation and validation cohort at a 3:1 ratio. The candidate variables in the derivation cohort were screened for collinearity using the linear regression test. Variance inflation factor (VIF) > 5 was considered the existence of collinearity. We performed a univariable analysis of the candidate variables in the derivation cohort with simple logistic regression and chose variables with P-value < 0.2 for multivariable analysis. A total of 16 predictive variables were entered into the multivariable logistic regression for model development using the forward stepwise method based on likelihood ratio test. A nomogram model for prediction was constructed, based on the findings of the logistic regression, with the model’s goodness of fit assessed using the Hosmer-Lemeshow test.

In the validation cohort, we assessed the model performance and compared it to previously reported prediction models (details of these baseline models were displayed in Supplementary Tables 2,3) with the area under the receiver operating characteristic curve (AUROC) and accuracy (GiViTI calibration belt), decision curve analysis (DCA)、net reclassification index (NRI) and integrated discrimination index (IDI). The GiViTI calibration belt reveals the relationship between predicted and observed probabilities, including 80% and 95% confidence intervals [20, 21]. DCA was employed to assess the clinical utility, especially the capacity to enhance decision-making, of the prediction models by quantifying the net benefits at various threshold probabilities [22]. NRI and IDI reflect the ability of a new model to appropriately reassign people into different risk strata [23].

Statistical analysis

Continuous variables, such as age and total muscle strength on the worse side, were evaluated in terms of mean ± standard deviation (SD) or median and interquartile range (IQR) as appropriate, while categorical variables were expressed as counts and percentages. Univariable analysis of candidate variables was performed using univariate logistic regression. Variables with a two-sided P-value < 0.2 were entered into the multivariate logistic regression to build the prediction model using the forward stepwise method. The multivariable analysis results were presented using a Forest plot with the ‘forest’ package. P-value < 0.05 was considered statistically significant in multivariate logistic regression and the GiViTI calibration test. Univariable analysis and multivariable logistic regression were performed using IBM SPSS Statistics 25.0 software (IBM Corporation, NY, USA). The AUROC, GiViTI calibration belt, DCA analyses, NRI and IDI were performed using R version 4.2.1 (R Project for Statistical Computing) with the pROC, givitiR, rmda, and PredictABEL libraries (

During statistical analysis, missing data of hematoma volume (22/1333,1.7%) and blood glucose (3/1333,0.23%) were considered missing completely at random (MCAR) [9]. As the missing data was less than 5%, we employed pairwise deletion, an indirect (passive) method, to maximize data utilization.


A total of 1350 patients diagnosed with ICH met the primary inclusion criteria. After reviewing the medical records, we excluded 12 patients without an initial CT performed within 72 h post-ICH, one patient who had severe mental or cognitive disorders before ICH and could not cooperate with the examination, and 4 patients whose lung infection developed before the cerebral hemorrhage. Ultimately, 1333 eligible participants were included in the study for analysis, including 1000 patients in the derivation cohort and 333 patients in the validation cohort (Table 1; Fig. 1).

Table 1 Baseline characteristics
Fig. 1
figure 1

Flow chart of patient recruitment. Flow-chart concisely outlines sequential application of inclusion and exclusion criteria that ultimately lead to the definition of the final study cohort

SAP: stroke-associated pneumonia, ICH: intracerebral hemorrhage

Characteristics and univariable analysis of the derivation cohort

Table 2 displays the characteristics of the study cohort. The derivation cohort comprised 199 (19.9%, n = 199/1000) patients with SAP and 801 (80.1%, n = 801/1000) without SAP. The median age of patients with SAP is 62 (IQR 55–72), which was significantly older than those without SAP (median 59, IQR 52–68, p < 0.001, 95%CI 1.016–1.045). Patients with SAP exhibited a higher likelihood of experiencing impaired consciousness (50.8%, n = 101/199 vs. 16.1%, n = 129/801, p < 0.001, 95%CI 3.838–7.511) and dysphagia (49.7%, n = 99/199 vs. 12.0%, n = 96/801, p < 0.001, 95%CI 5.122–10.320) compared to those without SAP. Patients with SAP demonstrated a significantly lower median total muscle strength on the worse side compared to those without SAP (median 5, IQR 0–8 vs. median 8, IQR 5–9, p < 0.001, 95%CI 0.791–0.866). Multilobar involvement (23.6%, n = 47/199 vs. 11.4%, n = 91/801, p < 0.001, 95%CI 1.628–3.575), lesion volume ≥ 20mL (25.0%, n = 49/196 vs. 11.1%, n = 87/786, p < 0.001, 95%CI 1.808–3.966) and extension of hemorrhage into ventricles (36.2%, n = 72/199 vs. 18.7%, n = 150/801, p < 0.001, 95%CI 1.753–3.454) occurred more frequently in patients with SAP compared to those without SAP. Pre-stroke dependence was present in 7.0% (n = 14/199) of patients with SAP, while only 2.6% (n = 21/801) of those without SAP (p = 0.004, 95%CI 1.403–5.632). In the GCS score, patients with SAP had a greater probability of scoring ≤ 14 than those without SAP (55.3%, n = 110/199 vs. 21.2%, n = 170/801, p < 0.001, 95%CI 3.309–6.361). In addition, patients with SAP exhibited markedly elevated levels of blood glucose and creatinine compared to those without SAP.

Table 2 Univariable analysis

Multivariable analysis and construction of prediction model for SAP

A total of 16 variables with p < 0.2 were entered into the multivariate logistic analysis, among which NHISS was excluded from further analysis due to the existence of collinearity (VIF = 6.899, Supplementary Table 1). These included age, sex (male), smoking, COPD, pre-stroke dependence (mRS ≥ 2), dysphagia, disturbance of consciousness, total muscle strength of the worse side, GCS (≤ 14), multilobar involvement, deep region involvement, extension into ventricles, lesion volume(≥ 20mL), albumin(< 40 g/L), blood glucose (> 6.1mmol/L) and creatinine(> 97μmol/L). The outcomes of the multivariable logistic analysis are shown in Fig. 2A. Variables retained in our model included age (Odds Ratio [OR] 1.037, 95% confidence interval [CI] 1.020–1.054, P < 0.001), sex (OR 1.824, 95% CI 1.206–2.757, P = 0.004), multilobar involvement (OR 1.851, 95% CI 1.160–2.954, P = 0.01 ), extension into ventricles (OR 2.164, 95% CI 1.456–3.215, P < 0.001), dysphagia (OR 3.626, 95% CI 2.297–5.725, P < 0.001), disturbance of consciousness (OR 2.113, 95% CI 1.327–3.362, P = 0.002) and total muscle strength of the worse side (OR 0.93, 95% CI 0.876–0.987, P = 0.017). Upon analyzing the data using logistic regression, we have developed a nomogram that predicts the individualized risk of developing SAP during hospitalization (Fig. 2B). The AUROC of the model was 0.793. The Hosmer–Lemeshow test was not significant(P = 0.203).

Fig. 2
figure 2

Prediction of SAP probability using results of multivariable analysis. A, Forest plot based on the results of multivariable analysis. B, Prediction nomogram based on the results of the multivariable analysis conducted on the entire cohort

OR: Odds Ratio, 95% CI: 95% confidence interval

The application of the nomogram was as follows: based on the nomogram, we first calculate the score for each prediction indicator and then sum them to obtain the total score. The resulting total score can be used to estimate the individualized risk of developing SAP during hospitalization. For example, a male patient (24 points) was 70-year-old (63 points), had a disorder of consciousness (30 points), and had the symptom of dysphagia (52 points). Total muscle strength of the worse side was 0 (30 points), and only one lobe was involved (0 points). The hematoma didn’t extend into the ventricle (0 points). The cumulative score of the prediction indicators was 24 + 63 + 30 + 52 + 30 + 0 + 0 = 199 points, and the corresponding predicted risk for him to develop SAP was about 60% (Fig. 2B).

Evaluation and validation of model performance

As shown in Fig. 3, the GiviTI calibration belt demonstrated that our and ISAN models were well calibrated with no significant deviation between the predicted and actual probabilities (P = 0.837 and P = 1.000, respectively). However, the ICH-A and ICH-B models showed a significant dissimilarity between the predicted and actual probabilities (P < 0.001 and P < 0.001, respectively), which means these models were not well calibrated.

Fig. 3
figure 3

Calibration Belts in the Validation cohort. (A) our prediction model, (B) the ISAN model, (C) the ICH-A model, (D) the ICH-B model. The calibration bands with 80% and 95% confidence levels are shown in light and dark gray, respectively. The bottom-right table shows the predicted probability ranges. P > 0.05 were deemed that the model fit was good

To evaluate the discrimination of the prediction model, we validated the AUROC of our model in the validation cohort and compared it to the ISAN model [7], ICH-APS-A [11] and ICH-APS-B models [11] (Fig. 4A). The AUROC of the validation group (0.8116, 95% CI 0.7499–0.8733) was significantly higher than that of the ISAN and ICH-APS-A/B models (ISAN model: 0.693, 95% CI 0.62–0.766; ICH-A model: 0.7167, 95% CI 0.6448–0.7886; ICH-B model: 0.7228, 95% CI 0.6514–0.7941). Delong’s test revealed a significant difference in AUROC between our model and any of the three previous models (P < 0.05) (Table 3), indicating that our model performs significantly better than these previous models in distinguishing between patients who will develop SAP or not Moreover, our model showed improved performance of reclassification than the ISAN, ICHA and ICHB models, with an NRI of 0.4384 (P < 0.01), 0.5236 (P < 0.01) and 0.4784 (P < 0.01), and an IDI of 0.1279 (P < 0.01), 0.2098 (P < 0.01) and 0.1841 (P < 0.01), respectively (Table 3). Both NRI and IDI quantifies the improvement in a model’s ability to discriminate between two groups or classes (e.g., diseased vs. non-diseased) compared to a baseline or reference model. The NRI calculates the proportion of individuals whose risk classification is improved minus the proportion whose classification is worsened by the new model, compared to the baseline previous models; while the IDI calculates the difference in the average predicted probabilities of the positive and negative classes between the two models. Therefore, these results indicated that our model had a substantially improvement in predictive accuracy over previous models.

Fig. 4
figure 4

ROC and decision curves in the Validation cohort. A, ROC curves of our model (red curve), ISAN model (orange curve), and ICH-A model (green curve) and ICH-B models (blue curve) of SAP in the validation cohort ROC: receiver operating characteristic, AUC: area under the curve; B, Decision curves of our model (red curve), ISAN model (orange curve), and ICH-A model (green curve) and ICH-B models (blue curve) in the validation cohort

DCA: decision curve analysis

Table 3 Performance metrics

In DCA (Fig. 4B), the curve for our model showed a positive net benefit for the threshold probabilities between 10% and 73% compared to the strategies of assuming that all or none of the patients had SAP (i.e., treat-all or treat-none strategies). Moreover, the net benefit of our model is significantly better than any of the three previous models within the threshold probabilities of 10 − 73%.


In this study, we developed a simple, valid, and clinically useful model to predict the probability of developing SAP for individual patients with ICH. This nomogram model incorporated simple risk factors, including male gender, older age, symptoms of dysphagia, disturbance of consciousness, weaker muscle strength, ICH with multilobar involvement and extension into ventricles. All the variables are easy to collect immediately after diagnosis of ICH. With this model, clinicians can quickly calculate the risk of individual patients developing SAP, which may help with effective management decisions. Thus, those with a high risk of developing SAP may benefit from more intensive care, preventative interventions, and earlier treatment.

Several risk factors for developing SAP have been identified in previous studies. Among them, older age is a well-recognized risk factor for pneumonia [24] and has been consistently found in most of the studies for SAP, either in ischemic or hemorrhagic stroke [10, 11]. This could be attributed to the gradual decline in people’s immune function with increasing age, making older individuals more susceptible to infections, including SAP [25]. Male sex is included in our and the ISAN model but not the ICH-APS model. This is probably because current smoking and excess alcohol consumption, which were more frequent in males, are already incorporated as risk factors in the ICH-APS model [7, 11]. Pre-stroke dependence is consistently included in the ISAN and ICH-APS model models, although they were defined as ≥ 2 in the ISAN model [7] while as ≥ 3 in the ICH-APS model [11]. In this study, although more patients with SAP had pre-stroke dependence (mRS ≥ 2), it was not included in the final model after regression analysis. A possible explanation might be that we directly included ‘total muscle strength of the worse side’ as a variable, which reflects not only the affected limbs of the index stroke but also the pre-stroke disability.

Regarding variables associated with stroke symptoms, compared to the ISAN and ICH-APS model, which included an overall NHISS score, our model independently included muscle strength (represented as total muscle strength of the worse side), dysphagia and disturbance of consciousness. We chose the three variables because they were theoretically critical neurological signs associated with SAP as both decreased limb muscle strength and disturbed consciousness can impair the patient’s ability to change positions, leading to pulmonary fluid retention and accumulation of secretions and so increasing the risk of pneumonia, while dysphagia may result in aspiration and increase the risk of SAP. They were consistently recognized as risk factors in previous studies for SAP as well [11, 26, 27]. In addition, they are simpler to evaluate and have a lower collinear relationship to the other variables compared to the NHISS score.

As an ICH-specific characteristic, ‘extension into ventricles’ is a significant and independent factor contributing to both morbidity and mortality [28] and its inclusion as a risk factor in both ICH-APS model and ours is not surprising. As for other hemorrhage-specific characteristics, the hematoma volume is included in the ICH-APS-B model, while multilober involvement is in our model. To some extent, these two variables are similar, as a hemorrhage involving multiple lobes usually have a larger volume than those within a lobe. As for ‘multilobe involvement’, a hemorrhage involving multiple lobes usually have a larger volume than those within a lobe. Regarding the mechanism, this was probably associated with decreased degree of systemic immune regulation due to a large hematoma [29]. However, compared to calculating the volume of ICH, multilobe involvement is easier to determine.

Except for convenience, the more important aim of our study is to improve the model performance for the prediction of SAP compared to existing SAP prediction models developed for ICH. As for the calibration, our model and ISAN model obtained good agreement between observed outcomes and predictions, while the ICH-APS-A and ICH-APS-B models were not well calibrated with significant underestimation. Regarding discrimination, our nomogram model performed significantly better than the ISAN and ICH-APS-A/B models, with an AUROC of 0.8116 versus 0.6930 and 0.7167/0.7228 in our validation sample. As AUROC may not be the optimal measure for evaluating models aimed at predicting future risk or stratifying individuals into risk categories [30] and not sensitive in comparing models with a well-performed baseline model [31], we used NRI and IDI as complement measures to test the discrimination ability [32, 33]. The results further supported that our model offered significant improvement over previous models. Although a model with better discrimination and calibration has theoretically been a better guide to clinical management [34], they are not enough to evaluate whether the prediction model improves clinical decision-making. Therefore, we conducted DCA to assess the clinical decision-making ability of the models [35]. As a result, our model demonstrated a positive net benefit for predicted probability thresholds between 10% and 73%. All these analyses further demonstrated that our model was a more accurate prediction model to identify patients with ICH at higher risk of developing SAP. This allows healthcare providers to tailor the allocation of medical resources to take more intensive care or interventions for these high-risk patients to prevent the progression of SAP or reduce its severity, thus contributing to an overall improved prognosis of the population with ICH.

Our study had several limitations. First, our study included only patients admitted into the neurology department, and those who died in the emergency department or were admitted to the intensive care unit were not included. This will inevitably result in selection bias and the underestimation of outcome events (SAP). Therefore, our model is more applicable to ICH patients with mild to moderate stroke severity compared to those with severe conditions. However, this problem also existed in the previous studies, as the median of NIHSS score of the study subjects was 9 for the ICH-APS-A/B models (IQR 3–16) and 4 for the ISAN model (IQR 2–9). Although the severity of our study population is comparable to these previous studies, caution should be exercised when interpreting the results due to the common selection bias in the patient populations of all three studies. Further studies should be conducted to validate the predictive performance of the models for those with severe stroke. Second, as a retrospective study, 12 patients were excluded from the study due to without an initial CT performed within 72 h post-ICH. This might also lead to the objective existence of selection bias. Third, other clinical data not considered in the existing model may have a confounding effect and impact the pneumonia risk. For example, the patient’s medication history, such as use of glucocorticoids or immunosuppressive drugs, may result in an immunosuppressive state and influence the risk of developing SAP. Lastly, this is a single-center study with internal verification conducted on patients in a tertiary hospital. Therefore, it is not clear whether the model can be applied to external patients, especially those from secondary and primary healthcare institutions. To address these limitations, in future investigations, we should carry out a large-sized prospective study, including not only patients admitted to the neurology department but also those who experienced mortality in the emergency department or were admitted to the intensive care unit, and conduct external validation at both secondary and primary healthcare institutions.


In summary, male patients with older age, multilobar involvement, the extension of ICH into ventricles, dysphagia, disturbance of consciousness, and worse muscle strength were at higher risk of developing SAP following ICH. The nomogram model obtained in this study is convenient and shows better predictive performance than previous models. Therefore, it might be a promising tool to assess the individual risk of developing SAP for patients with ICH and thus facilitate preventive measures. Nevertheless, a further improved prediction model can be achieved through a larger-sized well-designed prospective research with external validation in the future.

Data Availability

The datasets used and/or analysed during the current study are available from the corresponding author on reasonable request.


  1. Smith CJ, Kishore AK, Vail A, Chamorro A, Garau J, Hopkins SJ, Di Napoli M, Kalra L, Langhorne P, Montaner J, et al. Diagnosis of Stroke-Associated Pneumonia: recommendations from the Pneumonia in Stroke Consensus Group. Stroke. 2015;46(8):2335–40.

    Article  PubMed  Google Scholar 

  2. Teh WH, Smith CJ, Barlas RS, Wood AD, Bettencourt-Silva JH, Clark AB, Metcalf AK, Bowles KM, Potter JF, Myint PK. Impact of stroke-associated pneumonia on mortality, length of hospitalization, and functional outcome. Acta Neurol Scand. 2018;138(4):293–300.

    Article  CAS  PubMed  Google Scholar 

  3. Chumbler NR, Williams LS, Wells CK, Lo AC, Nadeau S, Peixoto AJ, Gorman M, Boice JL, Concato J, Bravata DM. Derivation and validation of a clinical system for predicting pneumonia in acute stroke. Neuroepidemiology. 2010;34(4):193–9.

    Article  PubMed  PubMed Central  Google Scholar 

  4. Kishore AK, Vail A, Chamorro A, Garau J, Hopkins SJ, Di Napoli M, Kalra L, Langhorne P, Montaner J, Roffe C, et al. How is pneumonia diagnosed in clinical stroke research? A systematic review and meta-analysis. Stroke. 2015;46(5):1202–9.

    Article  PubMed  Google Scholar 

  5. Emsley HCA, Hopkins SJ. Acute ischaemic stroke and infection: recent and emerging concepts. Lancet Neurol. 2008;7(4):341–53.

    Article  PubMed  Google Scholar 

  6. Hoffmann S, Malzahn U, Harms H, Koennecke HC, Berger K, Kalic M, Walter G, Meisel A, Heuschmann PU. Development of a clinical score (A2DS2) to predict pneumonia in acute ischemic stroke. Stroke. 2012;43(10):2617–23.

    Article  PubMed  Google Scholar 

  7. Smith CJ, Bray BD, Hoffman A, Meisel A, Heuschmann PU, Wolfe CD, Tyrrell PJ, Rudd AG. Intercollegiate stroke Working Party G: can a novel clinical risk score improve pneumonia prediction in acute stroke care? A UK multicenter cohort study. J Am Heart Assoc. 2015;4(1):e001307.

    Article  PubMed  PubMed Central  Google Scholar 

  8. Katzan IL, Dawson NV, Thomas CL, Votruba ME, Cebul RD. The cost of pneumonia after acute stroke. Neurology. 2007;68(22):1938–43.

    Article  CAS  PubMed  Google Scholar 

  9. Koennecke HC, Belz W, Berfelde D, Endres M, Fitzek S, Hamilton F, Kreitsch P, Mackert BM, Nabavi DG, Nolte CH, et al. Factors influencing in-hospital mortality and morbidity in patients treated on a stroke unit. Neurology. 2011;77(10):965–72.

    Article  PubMed  Google Scholar 

  10. Huang GQ, Lin YT, Wu YM, Cheng QQ, Cheng HR, Wang Z. Individualized prediction of Stroke-Associated Pneumonia for patients with Acute ischemic stroke. Clin Interv Aging. 2019;14:1951–62.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  11. Ji R, Shen H, Pan Y, Du W, Wang P, Liu G, Wang Y, Li H, Zhao X, Wang Y, et al. Risk score to predict hospital-acquired pneumonia after spontaneous intracerebral hemorrhage. Stroke. 2014;45(9):2620–8.

    Article  PubMed  Google Scholar 

  12. Hoffmann S, Harms H, Ulm L, Nabavi DG, Mackert BM, Schmehl I, Jungehulsing GJ, Montaner J, Bustamante A, Hermans M, et al. Stroke-induced immunodepression and dysphagia independently predict stroke-associated pneumonia - the PREDICT study. J Cereb Blood flow Metabolism: Official J Int Soc Cereb Blood Flow Metabolism. 2017;37(12):3671–82.

    Article  Google Scholar 

  13. Westendorp WF, Vermeij JD, Hilkens NA, Brouwer MC, Algra A, van der Worp HB, Dippel DW, van de Beek D, Nederkoorn PJ. Development and internal validation of a prediction rule for post-stroke infection and post-stroke pneumonia in acute stroke patients. Eur Stroke J. 2018;3(2):136–44.

    Article  PubMed  PubMed Central  Google Scholar 

  14. Ji R, Shen H, Pan Y, Wang P, Liu G, Wang Y, Li H, Wang Y. Novel risk score to predict pneumonia after acute ischemic stroke. Stroke. 2013;44(5):1303–9.

    Article  CAS  PubMed  Google Scholar 

  15. Kalra L, Smith CJ, Hodsoll J, Vail A, Irshad S, Manawadu D. Elevated C-reactive protein increases diagnostic accuracy of algorithm-defined stroke-associated pneumonia in afebrile patients. Int J Stroke: Official J Int Stroke Soc. 2019;14(2):167–73.

    Article  Google Scholar 

  16. Hemphill JC 3rd, Greenberg SM, Anderson CS, Becker K, Bendok BR, Cushman M, Fung GL, Goldstein JN, Macdonald RL, Mitchell PH, et al. Guidelines for the management of spontaneous intracerebral hemorrhage: a Guideline for Healthcare Professionals from the American Heart Association/American Stroke Association. Stroke. 2015;46(7):2032–60.

    Article  PubMed  Google Scholar 

  17. Kothari RU, Brott T, Broderick JP, Barsan WG, Sauerbeck LR, Zuccarello M, Khoury J. The ABCs of measuring intracerebral hemorrhage volumes. Stroke. 1996;27(8):1304–5.

    Article  CAS  PubMed  Google Scholar 

  18. He XW, Chen MD, Du CN, Zhao K, Yang MF, Ma QF. A novel model for predicting the outcome of intracerebral hemorrhage: based on 1186 patients. J Stroke Cerebrovasc Diseases: Official J Natl Stroke Association. 2020;29(8):104867.

    Article  Google Scholar 

  19. Maeshima S, Ueyoshi A, Matsumoto T, Boh-oka S, Yoshida M, Itakura T, Dohi N. Unilateral spatial neglect in patients with cerebral hemorrhage: the relationship between hematoma volume and prognosis. J Clin Neurosci. 2002;9(5):544–8.

    Article  PubMed  Google Scholar 

  20. Nattino G, Finazzi S, Bertolini G. A new test and graphical tool to assess the goodness of fit of logistic regression models. Stat Med. 2016;35(5):709–20.

    Article  PubMed  Google Scholar 

  21. Nattino G, Finazzi S, Bertolini G. A new calibration test and a reappraisal of the calibration belt for the assessment of prediction models based on dichotomous outcomes. Stat Med. 2014;33(14):2390–407.

    Article  PubMed  Google Scholar 

  22. Steyerberg EW, Vickers AJ, Cook NR, Gerds T, Gonen M, Obuchowski N, Pencina MJ, Kattan MW. Assessing the performance of prediction models: a framework for traditional and novel measures. Epidemiology. 2010;21(1):128–38.

    Article  PubMed  PubMed Central  Google Scholar 

  23. Lin JS, Evans CV, Johnson E, Redmond N, Coppola EL, Smith N. Nontraditional risk factors in Cardiovascular Disease Risk Assessment. JAMA 2018, 320(3).

  24. Torres A, Cilloniz C, Niederman MS, Menendez R, Chalmers JD, van der Wunderink RG. Poll T: Pneumonia. Nat Rev Dis Primers. 2021;7(1):25.

    Article  PubMed  Google Scholar 

  25. Goplen NP, Wu Y, Son YM, Li C, Wang Z, Cheon IS, Jiang L, Zhu B, Ayasoufi K, Chini EN et al. Tissue-resident CD8(+) T cells drive age-associated chronic lung sequelae after viral pneumonia. Sci Immunol 2020, 5(53).

  26. Westendorp WF, Dames C, Nederkoorn PJ, Meisel A. Immunodepression, Infections, and functional outcome in ischemic stroke. Stroke. 2022;53(5):1438–48.

    Article  CAS  PubMed  Google Scholar 

  27. Patel UK, Kodumuri N, Dave M, Lekshminarayanan A, Khan N, Kavi T, Kothari R, Lunagariya A, Jani V. Stroke-Associated Pneumonia: a retrospective study of risk factors and outcomes. Neurologist. 2020;25(3):39–48.

    Article  PubMed  Google Scholar 

  28. Hinson HE, Hanley DF, Ziai WC. Management of intraventricular hemorrhage. Curr Neurol Neurosci Rep. 2010;10(2):73–82.

    Article  PubMed  PubMed Central  Google Scholar 

  29. Illanes S, Liesz A, Sun L, Dalpke A, Zorn M, Veltkamp R. Hematoma size as major modulator of the cellular immune system after experimental intracerebral hemorrhage. Neurosci Lett. 2011;490(3):170–4.

    Article  CAS  PubMed  Google Scholar 

  30. Cook NR. Use and misuse of the receiver operating characteristic curve in risk prediction. Circulation. 2007;115(7):928–35.

    Article  PubMed  Google Scholar 

  31. Pencina MJ, D’Agostino RB, Pencina KM, Janssens AC, Greenland P. Interpreting incremental value of markers added to risk prediction models. Am J Epidemiol. 2012;176(6):473–81.

    Article  PubMed  PubMed Central  Google Scholar 

  32. Pencina MJ, D’Agostino RB, Sr., D’Agostino RB Jr., Vasan RS. Evaluating the added predictive ability of a new marker: from area under the ROC curve to reclassification and beyond. Stat Med. 2008;27(2):157–72. discussion 207 – 112.

    Article  PubMed  Google Scholar 

  33. Leening MJ, Vedder MM, Witteman JC, Pencina MJ, Steyerberg EW. Net reclassification improvement: computation, interpretation, and controversies: a literature review and clinician’s guide. Ann Intern Med. 2014;160(2):122–31.

    Article  PubMed  Google Scholar 

  34. Olchanski N, Cohen JT, Neumann PJ, Wong JB, Kent DM. Understanding the value of Individualized Information: the impact of poor calibration or discrimination in Outcome Prediction Models. Med Decis Making. 2017;37(7):790–801.

    Article  PubMed  PubMed Central  Google Scholar 

  35. Vickers AJ, Elkin EB. Decision curve analysis: a novel method for evaluating prediction models. Med Decis Making. 2006;26(6):565–74.

    Article  PubMed  PubMed Central  Google Scholar 

Download references


Not applicable.


This work was supported by the National Natural Science Foundation of China (No. 82001363), and Wenzhou Science and Technology Project (Y2020060).

Author information

Authors and Affiliations



WXS and XHQ designed and administrated this study. WY and CYT drafted this manuscript. WXS, CRM and XYC revised this manuscript. ZH, CYT, and WY were responsible for the statistical analysis. All the authors contributed to the data collection. All authors read and approved the final manuscript.

Corresponding authors

Correspondence to Huiqin Xu or Xinshi Wang.

Ethics declarations

Ethics approval and consent to participate

All methods were carried out in accordance with the Declaration of Helsinki. This study obtained approval by the Ethics Committee of the First Affiliated Hospital of Wenzhou Medical University (No.2020 − 185). Verbal informed consent was obtained from the subject or the subject’s legally authorized representative through tele-interview.

Consent for publication

Not applicable.

Competing interests

The authors declare no competing interests.

Additional information

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Electronic supplementary material

Below is the link to the electronic supplementary material.

Supplementary Material 1

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit The Creative Commons Public Domain Dedication waiver ( applies to the data made available in this article, unless otherwise stated in a credit line to the data.

Reprints and Permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Wang, Y., Chen, Y., Chen, R. et al. Development and validation of a nomogram model for prediction of stroke-associated pneumonia associated with intracerebral hemorrhage. BMC Geriatr 23, 633 (2023).

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI: