Comparison of four histological scoring systems for autoimmune hepatitis to improve diagnostic sensitivity
Article information
Abstract
Background/Aims
The histological criteria in the 1999 and 2008 scoring systems proposed by the International Autoimmune Hepatitis Group (IAIHG) have their inherent limitations in diagnosing autoimmune hepatitis (AIH). In this study, we evaluated the histology components of four scoring systems (1. revised original scoring system [“1999 IAIHG”], 2. simplified scoring system [“2008 IAIHG”], 3. modified histologic criteria [“2017 UCSF”], and 4. a new histologic criteria proposed by the International AIH Pathology Group [“2022 IAHPG”]) in AIH patients.
Methods
Medical records and liver biopsies were retrospectively reviewed for 68 patients from two independent medical institutions, diagnosed with AIH based on the 1999 IAIHG system between 2006 and 2016. The histological features were reviewed in detail, and the four histological scoring systems were compared.
Results
Out of the 68 patients, 56 (82.4%) patients met the “probable” or “definite” AIH criteria of the 2008 IAIHG system, and the proportion of histologic score 2 (maximum) was 40/68 (58.8%). By applying the 2017 UCSF criteria, the number of histology score 2 increased to 60/68 (88.2%), and “probable” or “definite” AIH cases increased to 61/68 (89.7%). Finally, applying the 2022 IAHPG histology score resulted in the highest number of cases with histologic score 2 (64/68; 94.1%) and with a diagnosis of “probable” or “definite” AIH (62/68; 91.2%).
Conclusions
The recently proposed UCSF/IAHPG histological criteria increased the histology score of AIH. Substituting the histology component of the 2008 IAIHG system with the 2022 IAHPG criteria increased the sensitivity for diagnosing AIH (≥“Probable AIH”) from 82.4% to 91.2%.
Graphical Abstract
INTRODUCTION
The diagnosis of autoimmune hepatitis(AIH) is made based on the clinical presentation, laboratory findings (e.g., elevated liver enzymes, serum immunoglobulin G [IgG], and the presence of autoantibodies), the histopathological findings, and the exclusion of other etiologies [1]. The diagnosis of AIH can at times be difficult considering the heterogeneity of the clinical features and the large number of differential diagnoses [2]. Diagnostic scoring systems have been proposed by the International Autoimmune Hepatitis Group (IAIHG)—originally in 1993, revised in 1999 and then simplified in 2008—with the initial aim of selecting a homogeneous group of AIH patientsfor research [3-5]. These scoring systems have eventually made their way into clinical practice, and the revised criteria (hereafter referred to as “1999 IAIHG”) and the simplified criteria (hereafter referred to as “2008 IAIHG”) are commonly used in routine clinical care [3,4].
Liver biopsy is an essential part of the diagnosis of AIH, and this is reflected in the histology components of the 1999 IAIHG and 2008 IAIHG systems [3,4,6]. For example, in the 2008 IAIHG system, in which a minimum of 6 points is necessary to call a case as at least “probable” AIH, liver histology is allocated up to 2 out of a total of 8 points [4]. However, the histological criteria stated in this system has not been prospectively validated, and the diagnostic utility of hepatocyte rosette formation and emperipolesis has been questioned [7,8]. To this end, modificationsin the his tological component of the 2008 IAIHG criteria have been recently proposed by pathologists [7,8]. In 2017, Balitzer et al. [7] proposed a modified histological criteria for AIH (hereafter referred to as “2017 UCSF”), based on the necroinflammatory activity and the degree of plasma cell infiltration, and excluding rosettes and emperipolesis. This modified histologic criteria increased the histology score for AIH cases, and applying the 2017 UCSF histologic criteria to the 2008 IAIHG system increased the diagnostic sensitivity of AIH [7]. Recently, a group of liver pathologists(International Autoimmune Hepatitis Pathology Group) published a consensus statement for the histological diagnosis of AIH (hereafter referred to as“2022 IAHPG”) [8].
In this study, we compared the four scoring systems—1999 IAIHG, 2008 IAIHG, 2008 IAIHG with 2017 UCSF histology criteria (2008 IAIHG+2017 UCSF), and 2008 IAIHG with 2022 IAHPG criteria (2008 IAIHG+2022 IAHPG)—in a retrospective cohort of AIH patientsfrom two institutionsin Korea.
MATERIALS AND METHODS
Case selection
Liver biopsy cases diagnosed between 2006 and 2016 at Seoul National University Hospital and Seoul National University Bundang Hospital were retrieved from the pathology database of each institution, by searching the keyword “autoimmune hepatitis” in pathology reports. Cases containing the terms “typical AIH”, “consistent with AIH”, “suggestive of AIH”, and “the possibility of AIH could be considered” were included in the search. The medical records for each patient were reviewed and the following information was recorded: age, sex, body massindex (BMI), viral hepatitisstatus, alcohol consumption history, medication history, symptoms at initial presentation, biochemical status at the time of biopsy, serum IgG level, the presence of auto-antibodies (antinuclear antibody [ANA], anti-smooth muscle antibody [SMA], anti-liver kidney microsome [LKM] type1 antibody, anti-mitochondrial antibody [AMA]), co-morbidities, and extrahepatic autoimmune disorders including Sjögren syndrome and autoimmune thyroiditis. Cases with other etiologies (e.g., alcoholic hepatitis, viral hepatitis, toxic hepatitis, primary biliary cholangitis), overlap syndrome and those with uncertain etiology were excluded. Liver biopsies with fewer than 6 portal tracts were excluded. The disease course and response to immunosuppressive therapy were also evaluated. Complete response was defined as the normalization of serum transaminase and immunoglobulin levels below the upper normal limit within 6 months of initial treatment. Finally, 68 patients with a final diagnosis of AIH based on the clinical features, laboratory findings and pathology were included in the study. All cases were interpreted as “definite” or “probable” AIH, based on the pretreatment aggregates scores of the 1999 IAIHG system [3]. This study was approved by the Institutional Review Board of Seoul National University Hospital and Seoul National University Bundang Hospital (#H-2208-098-1350), and informed consent was waived due to the retrospective nature of the study.
Histopathological evaluation
Hematoxylin and eosin and Masson trichrome-stain slides of liver biopsies were reviewed by two liver pathologists (S.A and H.K). The necroinflammatory grade was determined for each case using the Ishak grading scheme as follows: interface activity (A0-A4), confluent necrosis (B1-B6), lobular necroinflammatory activity (C0-C4), and portal inflammation (D0-D4) [7,9]. The degree of fibrosis was recorded using the Batts-Ludwig scheme (scale 0-4) [7,10]. The presence of hepatocyte rosettes, emperipolesis, and the predominant inflammatory cells were evaluated. Rosettes were defined as hepatocytes arranged around a clearly identifiable luminal space [7]. Emperipolesis was defined as the presence of lymphocytes or plasma cells within the cytoplasm of hepatocytes [7]. A plasma cell cluster was defined as an aggregate of more than 5 plasma cells in one focus [7]. The presence of bile duct injury, bile duct loss and steatosis were also evaluated. The histological features were reviewed independently, without knowledge of the other pathologist’s interpretation results, and after completing the first round of independent histopathological slide review, discrepant cases were reviewed together at a multiheaded microscope.
Application of AIH scoring systems
For each case, the histology scores of the 1999 IAIHG system (maximum 5 points) and 2008 IAIHG system (maximum 2 points) were calculated [3,4]. We then evaluated the modified histology scores of the 2017 UCSF and 2022 IAHPG proposals [7,8]. The four scoring systems are summarized in Table 1. For the 2022 IAHPG system, we categorized “likely AIH” as 2 points, “possible AIH” as 1 point and “unlikely AIH” as 0 point, and applied these scoresto the 2008 IAIHG system.
RESULTS
Patient characteristics
The clinical features of the 68 patients are summarized in Table 2. All patients were Korean and 55 (80.9%) were female. The mean age at the time of diagnosis was 58 years (range: 20–87), and 16 (23.5%) patients had BMI >25 kg/m2. Thirty-two (47.1%) patients presented with acute presentation or aggravation. Thirty-six (52.9%) patients were symptomatic at the time of diagnosis (including jaundice, nausea, and abdominal discomfort), and one (1.5%) patient displayed hepatic decompensation. Serum IgG levels were elevated in 53 (77.9%) patients, and ANA was detected in 65 (95.6%) cases. Three patients with ANA negativity showed increased serum IgG levels. Anti-SMA was detected in 20 (35.1%) out of 57 tested patients, and anti-LKM type 1 antibody was negative in all 33 tested patients. All patients were negative for AMA.
Sixteen patients also had extrahepatic autoimmune disease, including Sjögren’s syndrome (11.8%), systemic lupus erythematosus (7.4%), rheumatoid arthritis (1.5%), polymyositis (1.5%), and autoimmune thyroid disease (1.5%). All patients were treated with immunosuppressive therapy: the initial treatment was prednisolone for 52 (76.5%) patients, and prednisolone and azathioprine for 16 (23.5%) patients. Out of the 66 patients with available follow up information, 53 (80.3%) patients showed complete response to initial treatment, and 13 (19.7%) showed incomplete response for initial treatment. Glucocorticoid-induced side effects were identified in 20 (30.3%) patients. Co-morbidities included hypertension (35.3%), diabetes mellitus(13.2%), and osteoporosis(35.3%).
Histopathological findings
The histopathological features of 68 patients are summarized in Table 3. Sixty-three (92.6%) cases showed at least moderate portal activity (≥D2), and 60 (88.2%) cases demonstrated at least mild/moderate interface activity (≥A2) (Fig. 1A). Confluent necrosis (≥B2) was identified in 27 (39.7%) cases. Lobular necroinflammatory activity (≥C2) was observed in 65 (95.6%) cases. Fibrosis of any degree was present in 65 (95.6%) cases. Bridging fibrosis and cirrhosis was observed in 15 (22.1%) and 16 (23.5%) patients, respectively. The predominant inflammatory cell type was lymphoplasmacytic in 60 (88.2%) cases, and lymphocytic in 8 (11.8%) cases. None of the cases showed predominant neutrophilic or eosinophilic infiltration. At least one plasma cell was present in the portal tracts of all but one case, and plasma cell clusters in the portal tracts were identified in 51 (75.0%) patients. Lobular plasma cell infiltration was observed in 59 (86.8%) patients. Emperipolesis and hepatocellular rosettes were identified in 49 (72.1%) and 50 (73.5%) cases, respectively (Fig. 1B, C). Co-existing steatosis was identified in 18 (26.5%); 18 of these cases showed mild macrovesicular steatosis and one case demonstrated moderate macrovesicular steatosis. Perivenular and perisinusoidal fibrosis was identified in 3 (4.4%) and 2 (2.9%) cases, respectively. None of the cases demonstrated features of steatohepatitis. Bile duct damage of more than mild degree was identified in 3 (4.4%) cases; however, bile duct loss was not observed in any patient.
Comparison of four scoring systems
Based on the clinical and histological findings of each case, the AIH scores were calculated using the four different scoring methods. The results are summarized in Figure 2A, 2B and Table 4. For the 1999 IAIHG (revised original scoring system), the pretreatment criteria was applied. All patients met the definite or probable criteria by the 1999 IAIHG system (60.3% “definite”, 39.7% “probable”). The histologic score component ranged from 0 to 5 (the maximum histology score) with a mean value of 4.43. By this system, 50 (73.5%) cases were given the maximum histologic score (5).
Using the 2008 IAIHG system, 56 (82.4%) cases met the probable (total score ≥6) or definite (≥7) criteria for AIH (61.8% “definite”, 20.6% “probable”). The histological scores were “1” for 28 (41.2%) and “2” for 40 (58.8%) cases, and the mean score was 1.59.
By applying the 2017 UCSF system to the 2008 IAIHG system (2008 IAIHG+2017 UCSF), the number of casesthat qualified for the maximum histology score (“2”) increased from 40 (58.8%) to 60 (88.2%) cases, and the mean histologic score increased from 1.59 to 1.87. Accordingly, when the 2017 UCSF histology criteria was applied to the histology component of the 2008 IAIHG system, the number of cases that met the definite or probable criteria for AIH increased to 61 (89.7%), with an increase of “definite” casesfrom 61.8% to 64.7%.
Finally, by the recently proposed 2022 IAHPG criteria (2008 IAIHG+2022 IAHPG), the number cases that qualified for the maximum histology score (“2”) was the highest at 64 (94.1%) cases, with the highest mean histologic score of 1.94. By applying the 2022 IAHPG criteria to the 2008 system, 62 (91.2%) cases met the probable or definite criteria for AIH, with 45 (66.2%) cases being classified as “definite AIH”. The proportion of definite AIH (total score ≥7) was the highest among fourscoring systems.
For the thirty-two cases with acute presentation or aggravation, 21 (65.6%) cases met the probable or definite AIH category by 2008 IAIHG system (Table 5). However, by applying 2017 UCSF system, the number of cases that qualified for the maximum histology score (“2”) increased from 21 (65.6%) to 29 (90.6%) cases, and there was an increase in cases that met the probable or definite AIH category from 81.3% to 90.6%. Finally, by applying 2022 IAHPG system, the number of cases with the maximum histology score was the highest (96.9%), and the proportion of probable or definite AIH category was the highest (93.8%) (Fig. 2C, D).
DISCUSSION
Although the 1999 IAIHG scoring system is regarded as the gold standard for the diagnosis of AIH, it was originally intended for research purposes and is too complicated to use in everyday clinical practice. To this end, the Simplified criteria was proposed in 2008 to provide a diagnostic criteria that was easier to use; however, it has been recognized that the histology component of the 2008 IAIHG system may lead to underscoring of potential AIH cases, and proposals have been put forth to increase the diagnostic sensitivity of this system [7,8,11-14]. In this retrospective study, we compared four AIH scoring systems in 68 Korean patients who were diagnosed as AIH based on the 1999 IAIHG system, and focused on the histological score component. As expected, the 2008 IAIHG system showed the lowest sensitivity (82.4% for “Probable or definite AIH”) among the fourscoring systems.
By substituting the histological criteria of the 2008 IAIHG system with the 2022 IAHPG histologic score, the sensitivity of diagnosing “definite AIH” and “at least probable AIH” increased from 61.8% and 82.4%, respectively (2008 IAIHG) to 66.2% and 91.2%, respectively (2008 IAIHG+2022 IAHPG). When the histological component was analyzed separately, the proportion of cases with the maximum histological scores was highest by the 2022 IAHPG method (94.1%), and lowest by the 2008 IAIHG system (58.8%).
Liver biopsy is an essential component in making a diagnosis of AIH, and each AIH scoring system accordingly contains a histology score [6]. The 1999 IAIHG system emphasizes the importance of moderate/severe interface hepatitis (3 points out of a maximum of 5 histology points), and the total histology score is the result of a simple addition of component scores. Therefore, compared to the 2008 system in which potential AIH cases without hepatocytic rosettes or emperipolesis are regarded insufficient for a diagnosis of “typical” AIH, such cases could qualify for a diagnosis of “definite AIH” by the 1999 system as long as other clinical or pathological features are present, yielding a total score of >15 points. In this regard, the 1999 IAIHG system is more sensitive compared to the 2008 IAIHG system. However, both IAIHG systems do not take into account the acute hepatitis presentation of AIH; indeed, the histological features of AIH vary according to disease status, and do not always demonstrate the classical feature of a chronic hepatitis patternwith portal lymphoplasmacytic infiltration and interface hepatitis [15,16]. AIH with acute presentation mostly demonstrates histological features of acute lobular hepatitis, and such cases would be less likely to meet the histological criteria for AIH according to the 1999 and 2008 IAIHG systems, due to the lack of interface hepatitis. In fact, acute hepatitis patterns of AIH are being increasingly recognized, and the differential diagnosis between AIH and other causes of acute hepatitis, such as acute viral hepatitis and drug/toxin-induced liver injury, is often difficult but at the same time, very important [15,17]. It is therefore crucial for pathologists to be aware of this type of AIH, and applying the 2022 IAHPG definitions for the lobular hepatitis pattern of AIH would serve as a useful guiding tool in identifying the acute form of AIH [8].
For the 2008 IAIHG “simplified scoring” system, the histological score accounts for a maximum of 2 out of 8 points [4]. A histological score of “2” requires the presence of interface hepatitis and hepatocytic rosettes and emperipolesis, and if any one of these features are absent, a score of 1 is given for a chronic hepatitis picture [4]. However, recent studies have questioned the utility of including hepatocytic rosettes and emperipolesis as prerequisites for a diagnosis of “typical” AIH [7,18]. The reported frequency of hepatocyte rosette formation and emperipolesis has markedly varied, with ranges of 19–75% and 19–80%, respectively [7,15,18-22]. They are difficult to interpret in daily diagnostic practice; identifying emperipolesis, especially, is a time-consuming task for pathologists, and also prone to interobserver variability in interpretation. Moreover, hepatocytic rosettes and emperipolesis lack sensitivity and specificity for AIH, as hepatocyte rosettes are seen in during regeneration after various types of injury, and emperipolesis is seen in severe lobular injury [7,18,20,21]. Thus, these two histological features were excluded from the 2017 UCSF and 2022 IAHPG histology scores [7,8].
The presence of hyaline globules in Kupffer cells has been suggested to serve as a diagnostic clue for AIH, and a modification of the 2008 IAIHG histology component has been proposed by Gurung et al., which requires the presence of Kupffer cell hyaline globules in addition to a prominent plasmacytic infiltration to qualify as a “typical” AIH [23]. However, this needs further validation; a few other studies demonstrated that this feature was not correlated with serum IgG levels, autoantibody status or histological activity/fibrosis, and moreover, its identification requires additional stains such as periodic acid-Schiff post-diastase (D-PAS) stain and CD68 [24,25]. Another interesting finding which requires further validation is the higher number of apoptotic lymphocytes in portal tractsin AIH compared to other liver diseases [26].
The limitations of this study are as follows. It is a retrospective study from two institutions on a relatively small number of cases. In addition, a control group with other etiologies was not included, and only typical cases confirmed by the 1999 IAIHG system were included in thisstudy. Therefore, the specificity of each diagnostic system was not assessed. Nevertheless, this is a uniform cohort of Korean AIH patients and, to our knowledge, this is the first study to directly compare the histological scoring systems, including the most recent 2022 IAHPG consensus criteria.
In summary, recently proposed 2017 UCSF and 2022 IAHPG criteria increased the histological scores of AIH cases, and substituting the histological component of the 2008 IAIHG system with these newly proposed histological criteria increased the diagnostic sensitivity for AIH. Therefore, the recently proposed histologic criteria are expected to resolve the low diagnostic sensitivity of 2008 simplified scoring system, which warrantsfurther investigation.
Notes
Authors’ contributions
Conception: H.K., S-H.J.; Data acquisition: S.A., S-H.J., EJ.C., K.L., G.K., H.K.; Data Analysis: S.A., H.K.; Interpretation: S.A.,EJ.C., H.K. S-H.J.; Original Draft: S.A.; Critical revision: H.K. S-H.J. All authors approved for submission of the final version of the manuscript.
Conflicts of Interest
The authors have no conflictsto disclose.
Acknowledgements
This study was supported by the research fund of Seoul National University Bundang Hospital (SNUBH 02-2018-011, S.A.) and The Research Supporting Program of The Korean Association for the Study of the Liver and The Korean Liver Foundation (H.K.).
Abbreviations
AIH
autoimmune hepatitis
IgG
immunoglobulin G
IAIHG
International Autoimmune Hepatitis Group
IAHPG
International Autoimmune Hepatitis Pathology Group
ANA
antinuclear antibody
SMA
smooth muscle antibody
anti-LKM
anti-liver kidney microsome antibody type 1
AMA
anti-mitochondrial antibody
References
Article information Continued
Notes
Study Highlights
• Although the 2008 International Autoimmune Hepatitis Group (IAIHG) simplified scoring system is commonly used in clinical practice, the histological component in this scoring system has low sensitivity for diagnosing autoimmune hepatitis.
• Substituting the histological component of the 2008 scoring system by modified histological scores (2017 UCSF or 2022 IAHPG) increased the diagnostic sensitivity, especially for AIH cases with acute hepatitis patterns.