Glaucoma is a set of eye conditions in which the optic nerve head (ONH) and retinal ganglion cells are damaged, resulting in the visual field (VF) of the sufferer being reduced. Part of what makes glaucoma so dangerous is that the rate of VF impairment is typically slow and that it usually begins by affecting the peripheral vision of a sufferer. Consequently, many patients do not report the condition until very late when the amount of VF lost can seriously undermine their quality of life (e.g. by having their driving license revoked or being at increased risk of a fall). Naturally, most methods of glaucoma treatment involve reducing the intraocular pressure and this will require adapting based on the rate of disease progression, which makes monitoring VF progression very important. Information on VF progression is also used to assess whether a patient requires further intervention or not dependent on whether the condition is progressing quickly enough to have a tangible effect in their expected lifetime.1 Furthermore, VF measurements (and intraocular pressure) are by and large the only accepted endpoints in evaluations of new therapies for glaucoma; the Advanced glaucoma intervention study (AGIS), Collaborative initial glaucoma treatment study (CIGTS) and Early manifest glaucoma trial (EMGT) being examples of major trials in which VF progression was the chief endpoint. This article will describe the most important current methods of assessing glaucomatous VF progression, comparing their strengths and weaknesses. In addition, practical advice concerning monitoring functional loss will be discussed and current research topics will be discussed.
Measuring the Functional Progression of Glaucoma
One of the most important methods of diagnosing glaucoma and monitoring its progression is through measuring the functional progression of the disease. Structural methods, looking at the health of the ONH, can also be used but these are not the focus of this article. Perimetry is the means by which the VF of a patient is mapped and the only means of measuring functional progression. The Humphrey® Field Analyzer (HFA; Carl Zeiss Meditec, Dublin, CA) and the Octopus (Haag-Streit, Köniz, Switzerland) are commonly used perimeters, especially in a tertiary or referral setting where the goal is to monitor VFs in patients with glaucoma or at risk of developing glaucoma, so this article will focus on using these instruments. Standard automated perimetry (SAP) has been the gold-standard to measure the VF since its introduction in the 1980s, but there are a number of other automated perimetry methods that have been developed to try and displace it, including short wavelength automated perimetry (SWAP), pulsar (or flicker perimetry) and frequency doubling perimetry (FDP or FDT).2,3 However, research is still ongoing and, for now, SAP remains the primary method for detecting VF progression and is hence the subject of this article. As with structural methods, alternative functional methods are helpful to be used alongside rather than instead of SAP.4
The full-threshold method to measure thresholds in SAP employs a ‘staircase’ technique. This method provides measurement of defect severity at each location in the VF but can take quite a long time (about 15 minutes per eye), during which it can be difficult to keep a patient’s attention. Thus, faster methods have been devised in order to reduce test time. One such technique for the HFA is the Swedish interactive threshold algorithm (SITA) Standard method. This strategy seeks to reduce test time by incorporating the knowledge that the threshold at each point is unlikely to be independent of neighbouring locations.5 The large advantage of SITA Standard is that it can halve the duration of testing for many patients. An equivalent, though different, strategy for the Octopus perimeter is known as the ‘Dynamic’ strategy, which shares many of SITA Standard’s advantages. Other even faster methods have been devised by researchers in order to enhance the speed of testing further (such as SITA Fast and Tendency-Orientated Perimetry for the Humphrey and Octopus respectively), but these may sacrifice sensitivity in identifying VF defects and there is some evidence to show the results are less repeatable.6,7
Figure 1: Humphrey® Field Analyser Greyscale Representations of Six Different Eyes
A point for consideration when reviewing perimetry results is the means of assessing how reliable these are, as poor results lead to misdiagnosis. All of the tests described above have methods of evaluating the reliability of the test itself through looking for false negatives, false positives and fixation losses. These measures give an indication of patient attentiveness, how ‘trigger-happy’ the patient is and the patient’s fixation performance, respectively. Major clinical trials have used these measures to assess VF reliability,8–10 yet the criteria applied to them remain arbitrary and vary between trials. For instance, for the AGIS and CIGTS trials, a scoring system was used which meant that patients could theoretically pass with false positive or negative rates in excess of 33 %,8,9 whilst the EMGT did not look at the false negative rate at all.10 In addition, other reliability criteria such as the total number of questions asked8 (longer tests imply greater uncertainty in measuring the threshold) and short-term fluctuation values have been used.8,9 There is no single ‘perfect’ reliability index in assessing progression but if a VF is unreliable then this will hinder the ability to detect progression. Furthermore, there is evidence to suggest that the reliability indices do not provide accurate insight about a patient’s performance.11 Instructions to the patient, correction of spherical ammetropia and patient attention all have a significant bearing on the result and reliability indices should not be relied upon exclusively.4 Furthermore, the psychophysical nature of the tests means that learning effects need to be taken into consideration, as patients often improve in their ability to undertake perimetric testing with experience. Although seemingly wasteful, discounting initial visual fields remains good practice when assessing glaucoma progression. There are several good publications about the practicalities of good VF testing with SAP and interpretation of results, a comprehensive one being by Henson.12 However, even after following all the recommendations for good practice, the clinician is often still left with variable VF measurements upon which to make a decision. It is this ‘noise’ in the measurements of VF thresholds, which makes measuring progression far from straight-forward, particularly in areas of damage, where variability is even greater.6,13,14
Summary Methods of Detecting Functional Progression in Glaucoma
The output produced from SAP can be confusing as it contains a huge amount of data and information, and it is not always particularly obvious how large changes are from one VF assessment to the next. In spite of this, many clinicians judge progression ‘manually’ using their experience in comparing SAP printouts. However, there is good evidence that this will lead to inconsistent decisions since agreement between clinicians for judging progression is mediocre at best.15,16 Furthermore, it has already been shown that, when clinicians use software, their concordance increases substantially.15 Thus, it is vital that quantitative methods are utilised for measuring progression. Detecting VF progression requires the thorough understanding and utilisation of the available software and analysis methods to facilitate this task and these will now be reviewed.
Global indices are often utilised to assess the visual field. An important feature of any such statistic is that it can be put into some context; consequently, every perimetry machine has a normative database of VF thresholds to compare measured VF thresholds against at a given location and for a given age, so that a significant loss can be determined.
Figure 2: A Demonstration of the Differences Between (A) Event and (B) Trend-based Analyses for One Point in Consecutive Visual Fields
At each point, the total deviation (TD) value represents the difference between the measured threshold and the expected VF sensitivity for a healthy eye of the given age. Summary measures such as the mean defect or mean deviation (MD), take an average of all TD values in the test to produce a single statistic of global VF sensitivity compared with that of an aged matched person with ‘normal’ vision.
The problem with using MD as a measure of VF damage is that it is only useful in terms of summarising the defectiveness of the whole VF, when, in glaucoma, the size and position of localised damage are likely to be of most interest (see Figure 1). The TD plot partially resolves this issue, but it does not differentiate the effects of glaucoma from overall sensitivity loss associated with worsening media opacity from cataract. The pattern deviation (PD) plot was devised to assess the location of defects, whilst also separating what is likely to be glaucomatous from general VF worsening caused by cataract. Pattern standard deviation (PSD) thenmeasures the amount of variability in the PD values and so attempts to resolve focal from diffuse loss. Pattern standard deviation (PSD) is not a good indicator of overall damage in advanced disease, though it can still be helpful when used alongside the MD.17 However, neither PSD nor MD provide spatial information about the VF defect.
The Visual Field Index (VFI) is a relatively new summary measure that seeks to quantify glaucomatous damage in newer models of the The Humphrey Field Analyser. Visual fields with no discernable defect are scored 100 % with 0 % signifying perimetric blindness. For each defective point in the pattern or TD plot, the VFI computes a weighted average, operating on the principle that the very central part of the VF is of highest importance and so is weighted more. The VFI attempts to measure only damage from the PD values rendering it some immunity to the confounding effects of cataract and it prioritises the very central VF more, therefore giving a better estimate of actual functional loss.18 Interestingly, however, there is evidence to point out that the VFI over-estimates what would be attributed by expert opinion on the percentage of functionally useful VF remaining of an eye; in short the VFI may be somewhat misleading in representing how well a patient can see.19 Thus, though the VFI is useful, MD should still be considered when evaluating VF loss. A better index, in terms of VF disability, could potentially be gleaned by combining measures from two eyes,20,21 but no specific measures combining monocular VFs have yet been tested and clinically accepted. The integrated visual field (also known as the best sensitivity method), which takes the best sensitivities from either eye for each point, is one such method that has so far had promising results when compared with conventional binocular tests unavailable to the clinician.22–24
Overall, though summary measures are useful in terms of having a singular measure representing how badly an individual’s VF has been degraded by glaucoma, a universal problem for all of these measures is that they waste data and ignore important spatial information. Furthermore, it is difficult to tell how changes in these measures relate to visual disability. For example, what exactly does a drop in VFI from 100 to 97 % mean? Criteria must be devised in order to indicate whether glaucomatous progression is taking place at a dangerous rate or not. Pointwise scoring criteria (examining each VF location separately) that amount to counting defective points have been used for determining progression, though usually only in clinical trials, as they seem to have acceptable diagnostic specificity (probability of diagnosing stable eyes as non-progressing). Precise details on the methods used for evaluation of VF progression in some major clinical trials can be found in the papers referenced here.8,10,25–28 However, better methods for detecting change, namely event and trend based analyses, have now been incorporated into software for the analysis of progression.
Table 1: Methods of Detecting Glaucomatous Progression
Figure 3: A Simplified Graphical Representation of Table 2 from the Paper by Chauhan et al.4
Event and Trend-based Analyses
There is often much debate between whether to use event-based analyses or trend-based methods. The main difference between the two is to do with how they treat measurements taken between the first and last VF, which gives them their specific properties. Event based analysis involves taking a baseline and comparing every subsequent test against this reading (see Figure 2a). Any significant difference between the baseline and latest reading is considered to be due to progression.
One method of this kind is the ‘glaucoma change probability’ method (GCP), which is sometimes called ‘glaucoma probability analysis’ and is available from Statpac 2 software for the HFA. In short, the pointwise VF sensitivities from an average of two baseline readings are compared with subsequent VF tests using the glaucoma change probability map (GCPM), which considers the amount of ‘normal’ variability one would expect for the baseline values. Points that show significant deviation from baseline in three consecutive tests are marked as progressing (see Figure 2a). Heijl et al. used an adapted version of the GCPM for the Early Manifest Glaucoma Trial (EMGT).26 The GCP method is a very sensitive method for detecting glaucomatous progression27 and there is evidence to suggest that results from this analysis have good agreement with expert opinion on progression.10
Trend-based analyses (see Figure 2b) are an evolving process in which all VFs are analysed using linear regression to assess the rate and significance at which the measurement is changing over time. Trend-based analyses can be applied to summary measures such as VFI or MD, and are performed in the Statpac (HFA) and EyeSuite (Octopus) Progression Analysis software. Recently, this approach has been advocated, using the VFI in particular,29 but linear regression of summary measures has been criticised as being relatively insensitive to detecting VF progression.30,31 An alternative method is Pointwise Linear Regression (PLR), implemented on PROGRESSOR (PROGRESSOR Medisoft, Leeds, UK), EyeSuite (Haag-Streit, Köniz, Switzerland) and Peridata (Peridata Software GmbH, Huerth, Germany) software, which detects the rate of progression of each point in the VF. A common statistical criterion used to assess whether progression is significant for a single point is a rate less than -1 dB/year with a p value of less than 0.05. Other criteria are examined in detail by Gardiner and Crabb.32
As a result of using all the previous fields in its diagnosis, trend analysis has statistical advantages and disadvantages compared with event based analyses. First of all, event based analyses generally require fewer VFs and less time to produce definitive results, so may detect rapid deterioration in the VF more quickly than trend analysis.27 However, trend analysis is an important diagnostic of progression because it estimates the rate of VF progression whereas event analyses focus only on a significant magnitude of difference between two values. Furthermore, trend analyses have been shown to have higher diagnostic sensitivity than event analyses.27 On the other hand, trend analyses often take longer to establish progression and the inference changes with each new measurement. For example, in Figure 2b, progression would be diagnosed after 3.5 years, but after 4 years the eye is again considered as not progressing (though after 4.5 years progression is once again diagnosed). For all their perceived differences, event- and trend-based methods are quite similar; the resolution of the analysis is at a single point and they can, with the aid of graphics used in software, highlight the spatial location of change that an experienced clinician can act upon. Still, there remains no general consensus on the number of points that constitutes ‘real’ progression, whether there should be a requirement for contiguous points to show this behaviour or whether they should be maintained in subsequent fields. For a comprehensive review summarising the evidence base on the different methods for assessing glaucomatous VF progression readers are directed to a review by Ernest et al.,28 but a practical summary of the differences between all the methods discussed is given in Table 1.
The Frequency of Visual Field Testing
Another important topic regarding how progression is detected refers to how regularly VFs should be taken. Chauhan and colleagues make suggestions based on the power (the probability of correctly diagnosing a progressing patient) associated with the number of tests taken.4 If too few VFs are taken of a given patient, detecting glaucomatous progression is unlikely due to the inherent variability of VF measurements. A more accurate estimate of the rate of progression is obtained by taking more readings because the underlying signal is more likely to be detected amidst the variability. For instance, it would be foolish to conclude that there are no fish in a lake based on one unsuccessful fishing trip. The probability of there being no fish depends on how long the fishing trip lasted and how many fish were in the lake. The duration of the fishing trip is analogous to sample size – if you collect more data you have a greater power to detect change. The number of fish in the lake is analogous to the effect size – you have greater power to find a large effect than a small one. Chauhan et al. suggest, as a minimum, that six VFs should be taken in the patient’s first 2 years of monitoring, before choosing the number of subsequent tests per year on the basis of progression rate and time-scale thereafter (see Figure 3). Of course, taking more VFs in a period of time will increase the number of false positive diagnoses, i.e. decrease specificity;36 though specificity can be improved by using more conservative progression criteria.32
Gardiner and Crabb found, using simulation, in highly variable eyes with thresholds decreasing by 2 dB per year, undergoing three VF tests per year was the optimum number in terms of sensitivity and specificity.36 However, Nouri-Mahdavi et al. point out that, in clinical practice, it is often difficult, time-consuming and costly to carry out so many VFs each year and hence recommend measuring VFs once every 6 months.37 However, there is a substantial loss in sensitivity by taking less than three VFs a year.4,36 Some attention has been given to the prudent idea of varying the intervals between VF tests to optimise detection of progression.38,39 One novel approach that should appeal to the interested reader is to adopt an approach that varies the length of the interval between subsequent tests depending on the outcome of previous test results.40 In conclusion and in order to shift the discussion away from a statistical one, it should be emphasised that a real commitment should be made to carry out an adequate amount of testing in newly diagnosed patients; doing the odd VF test in every patient, say once or less per year, is probably as bad as doing no VF testing at all. The key is, perhaps, to stratify patients into those that will benefit from more frequent testing. However, accurate risk profiling of progression awaits further research.
Figure 4: Screenshots from a) Statpac 2’s Glaucoma Probability Analysis for the HFA, b) Eyesuite Analysis Software for the Octopus Perimeter, c) PROGRESSOR and d) Peridata’s Boxplot Trend Analysis
Software for Glaucoma Monitoring
The importance of using the correct software to aid diagnosis of VF progression is stressed again here. It should be noted that software should not be used in isolation from either clinical judgement or other measures and risk factors, but it is essential in aiding the clinician to make the appropriate assessment of VF progression. Statpac 2 for the Humphrey perimeter contains the means of performing GCP analysis as well as all the summary indices described earlier in this review (though no PLR analysis). The EyeSuite Progression Analysis software for the Octopus, meanwhile, has its own range of output including all the summary measures described above bar the VFI, as well as a program combining structural and functional defects, trend analyses of MDs and PLR, but does not yet feature a GCP analysis program. PROGRESSOR implements PLR analysis using colour-coded bar charts for each visual field threshold, whilst Peridata allows PLR and also has the ability to display coloured diagrams of defect depth and 3D hills of vision (both can be used alongside the HFA and Octopus perimeters) (see Figure 4).
Discussion
This review has summarised the main approaches for assessing glaucomatous progression and the following conclusions can be made. First, SAP remains the current gold standard for measuring functional progression; methods of detecting ‘structural’ progression at the site of the ONH should be used in conjunction with SAP, if possible, but never instead of functional methods of measuring progression. Second, both trend- and event-based methods are useful for monitoring and detecting progression, and should be employed according to the clinical objective. Third, reliability indicators in addition to those provided by perimeters need to be taken into account when assessing whether an obtained VF is sufficiently accurate. Finally, an adequate number of VF examinations should be taken in the first 2 years of monitoring a patient in order to maximise the probability of detecting glaucomatous progression. Hopefully, in the near future, new methods could yet allow for more accurate monitoring of glaucoma progression. For example, novel research aims to determine how structural defects can be better linked to functional ones to aid diagnoses with promising results thus far.41,42 Incorporating Bayesian statistics into monitoring VF progression – using prior knowledge of patients with glaucoma or structural information – is another interesting research topic.43,44 Many researchers have utilised computer simulations in order to compare methods,27,32,45 which provides a method for benchmarking different techniques in the absence of a true gold-standard for VF progression, so the future may see this method used increasingly. New modelling methods that navigate the problem of detecting VF progression better than standard linear modelling are also being investigated.46,47
Finally, little is known about how VF defects, at different stages of glaucoma, affect patients’ abilities to perform everyday visual tasks. Being able to link the measurements taken in the clinic to what patients visually ‘can’ and ‘cannot do’ would be enormously helpful. In particular, the VF component for legal fitness to drive is an area that is drawing much investigation21,48,49 due to the fact that this is closely related to an individual’s quality of life.50 In this respect, the future benchmark for the success of new glaucoma treatments to negate progression could be aligned to a measurable reduction in visual disability rather than imperceptible changes on a clinical chart. One conclusion that we can be certain of is that, for the moment, detecting visual field progression is non-trivial and quantitative tools are essential to aid clinical decisions.