Keynote Lectures — Importance of Tumour Measurements

doi:10.1102/1470-7330/00/010028+07

Open access
Published: 05 May 2015

Keynote Lectures — Importance of Tumour Measurements

Tuesday 10 October 2000, 09.15–10.45

Cancer Imaging volume 1, pages 28–34 (2000)Cite this article

3563 Accesses
Metrics details

Cancer imaging — the significance of the findings

Rodney H. Reznek

Academic Department of Radiology, St Bartholomew’s Hospital, London, UK
Rodney H. Reznek

Introduction

Over the past several years, the technological advances in the field of body imaging have been almost too numerous to catalogue. Each modality goes through a cyclical pattern of evolution. In the earliest phase of this evolution, most research is descriptive and anecdotal in nature. As a modality becomes established, it enters the second phase, in which it is touted as being superior to all prior conventional modalities. The third phase represents a backlash effect in which the shortcomings of a new technique and its inferiority to prior modalities are stressed. Finally, the technical development of a new modality reaches a plateau, the literature reflects an equality with earlier techniques. It is during this phase that the true cost-effectiveness of a new imaging technique and its impact on patient care are established. All new imaging modalities go through these phases. Most academic radiology departments are composed of single modality advocates who fail to see the interrelationships among the available imaging techniques. The radiologists must be prepared to offer unbiased aid to the referring clinicians in choosing the most cost-effective procedure from the radiological armamentarium. Nowhere is this more relevant than in imaging patients who have cancer where the range of anatomy and pathology to be imaged is infinite. Close co-operation between the clinician and radiologist is essential and a clear understanding of the purpose of the imaging is mandatory.

The aims of imaging

Diagnostic imaging fulfils several functions in patients with cancer. These include making a diagnosis; staging the disease; monitoring response; detecting recurrence and research applications.

Diagnosis

It is only infrequently that straightforward imaging provides a sufficiently specific diagnosis on which treatment can be based. However, the application of image-guided biopsy techniques has revolutionized the ease with which cytology or histology can be obtained. Few anatomic sites are now inaccessible to the skilled radiologist using imaging guidance. The choice of the most appropriate form of imaging guidance will vary from institution to institution depending on the skill and preference of individual radiologists and also on the site of the disease.

Staging the disease

Increasingly, imaging techniques are being applied to provide a more refined and accurate staging of the disease. However, to do this requires a detailed knowledge of the sensitivities, specificities and accuracy of individual imaging techniques as they relate to assessing the stage of each individual pathological process. These will be discussed in detail below. A detailed knowledge of the advantages and limitations of each imaging technology as they apply to the assessment of each individual stage is required. In order to do this, close collaboration between clinician and radiologist is often essential. For example, the simple assessment of the possibility of liver metastases requires a knowledge of the most appropriate technique, its accuracies, pitfalls and shortcomings in assessing focal pathology.

Monitoring response and detecting recurrence

Increasingly, with the more effective use of radiotherapy and chemotherapeutic agents, imaging is used to assess the response to therapy and to detect recurrent disease. This usually relies on a straightforward assessment of a change in size although occasionally a change in the characteristics of a mass lesion can also provide information as to a changing response. It is recognized, however, that this is a crude form of assessment to response and that imaging is but one facet in the overall assessment of the patient’s response to therapy. Frequently, changes in the imaging appearance result from the effect of radiotherapy or chemotherapy and a detailed knowledge of these appearances is required both by the radiologist and by the oncologist. Similarly, an appreciation of the phenomenon of a residual ‘sterile’ mass is also necessary together with possible imaging strategies for evaluating this residual mass.

Research applications

The very high accuracy and reproducibility of many imaging techniques make it extremely well suited to Phase II trials in which the oncologist is assessing the biological activity of new treatments. In Phase III trials, when comparing the results of different treatments, survival is usually the final arbiter. If the size of the patient group is large enough, sophisticated staging is not needed as the stage will be randomized out. But in practice, the groups tend to be small and one of the prognostic variables, namely, the varying stage of the diseases can be removed from the study by achieving more accurate staging through CT. Another application in Phase III trial is in advanced disease where there is no obvious difference in survival and one is looking not for survival but an increase in biological activity. So, in this sort of study of patients with advanced disease, where the end point of the study is a response rate, not survival, imaging becomes a very valuable tool because of its accuracy in monitoring the disease.

Choice of the appropriate technique The choice of the most appropriate technique for each particular aim in assessing patients with cancer, depends on several factors: these include an assessment of the sensitivity and specificity, the availability of any technique, the cost and the cost-effectiveness.

Sensitivity and specificity

When selecting a diagnostic test, one of the most important considerations is the accuracy of the test. Diagnostic accuracy is best described in terms of sensitivity and specificity. Stated simply, sensitivity is the ability of the test to recognize disease and specificity is the ability of the test to recognize normality. These concepts are both illustrated with the help of a Binary Table that depicts the correlation between test interpretation and the presence or absence of disease in the population under study.

The Binary Table categorizes patients into four mutually exclusive outcomes:

(1)
positive test result in the disease present, true positive (TP);
(2)
positive test result in the disease absent, false positive (FP);
(3)
negative test result in the disease present, false negative (FN); and
(4)
negative test result in the disease absent, true negative (TN).

The sensitivity of the test is the proportion of patients with disease who have a positive test result. In other words, it is the ability of the test to recognize disease.

$${\rm{Sensitivity}} = {{{\rm{TP}}} \over {{\rm{TP}} + {\rm{FN}}}}$$

The specificity of a test is the proportion of patients without disease who have a negative test result. In other words, it is the ability of the test to recognize normality or the absence of a particular disease.

$${\rm{Specificity}} = {{{\rm{TN}}} \over {{\rm{TN}} = {\rm{FP}}}}$$

The accuracy of the test is of less value than the sensitivity and specificity because it lumps together positive and negative results.

The positive predictive value of a test indicates the probability of whether the disease is actually present if the test is positive.

$$\matrix{{{\rm{Positive}}\,{\rm{predictive}}\,{\rm{value}}\,{\rm{(PPV)}} = {{{\rm{TP}}} \over {{\rm{TP}} + {\rm{FP}}}}} \hfill \cr {{\rm{Negative}}\,{\rm{predictive}}\,{\rm{value}}\,{\rm{(NPV)}} = {{{\rm{TN}}} \over {{\rm{TN}} + {\rm{FN}}}}} \hfill \cr }$$

Receiver-Operator-Characteristics analysis

The sensitivity and specificity of a test depend on the criteria chosen for interpretation. As the criteria for calling a test result positive are made more stringent, specificity improves at the expense of sensitivity. Conversely, as the criteria are relaxed, sensitivity improves while specificity diminishes. This relationship can be demonstrated on a receiver-operator-characteristics (ROC) curve. This curve is generated by plotting the sensitivity (true-positive rate) this is 1 — for specificity (false-positive rate) for the different interpretation criteria. The fundamental principle illustrated by the ROC curve is that there is an inherent limit to the diagnostic accuracy of a test. Once this limit has been reached, the interpreter can only improve sensitivity at the expense of specificity and vice versa. The ROC curve can be used to select the ‘best’ cut-off criteria for positivity, taking the pre-test probability and the relative cost (in terms of patient outcome) of false-positive and false-negative test results into account. Additionally, ROC curves are useful in comparing the performance of different tests, because they allow for a wide range of different positivity (criteria).

Interobserver agreement (kappa test)

Altman (see further reading) describes well how to measure interobserver agreement, using as data the assessments of 85 xeromammograms by two radiologists (A and B) where the xeromammogram reports are given as one of four results: normal; benign disease; suspected cancer; cancer.

A measure of agreement is required between radiologist A and radiologist B rather than a test of association such as might be undertaken using the χ² test.

As Altman points out, the simplest approach is to count how many exact agreements were observed between A and B, which from Table 1 is 54/85 = 0.64. However, the disadvantages with this method of merely quoting a 64% measure of agreement are that it does not take into account where the agreements occurred and also the fact that one would expect a certain amount of agreement between radiologist A and radiologist B purely by chance, even if they were guessing their assessments.

Table 1 Interobserver basic data for assessment of 85 xeromammograms by two radiologists, after Altman and taken from a larger study by Boyd et al.

Full size table

The expected frequencies along the diagonal of Table 1 are given in Table 2 from which it is seen for these data that the number of agreements expected by chance is 26.2 which is 31% of the total, i.e. 26.2/85. What the kappa test gives is the answer to the question of how much better the radiologists were than 0.31.

Table 2 Calculation of the expected frequencies for the kappa test, after Altman

Full size table

The maximum agreement is 1.00 and the kappa statistic gives the radiologists’ agreement as a proportion of the possible scope for performing better than chance, which is 1.00 2− 0.31.

$$\kappa = (0.64 - 0.31)/(1.00 - 0.31) = 0.47$$

There are no absolute definitions for interpreting κ but it has been suggested that the guidelines in Table 3 can be followed, which in the example considered here means that there was moderate agreement between radiologist A and radiologist B.

Table 3 Guidelines for the interpretation of the κ statistic (REF)

Full size table

Stage migration (‘Will Rogers’ effect)

An important impact of the use of sophisticated techniques to stage patients with cancer is the apparent continuous improvement in cancer survival rates reported over the last 25 years. Although this is quickly and easily attributable to earlier diagnosis and new and more effective treatments, the effect of more accurate staging may to some extent explain these improved results. Feinstein et al. found that a 1977 cohort of patients who had undergone lung cancer treatment survived significantly longer in each of three TNM subcategories than a cohort managed in the 1950s and 1960s; a finding which is not surprising. When, however, he staged the recent cohort on clinical grounds only — without the benefit of ultrasonography, CT and nuclear medicine — these survival differences disappeared. It was apparent that the improved survival rates were mainly an artefact of better staging; patients in the lower stages with clinically occult (usually nodal) disease were being identified with better imaging and were being placed in a more advanced stage (‘stage migration’). Better staging leads to benefit to all; in the lower stages, patients with occult metastases would be removed with benefit to those stages; in the higher stages, those patients with a lower tumour burden would be added to those with a higher burden, with improvement in survival rates. Thus while individual prognosis did not change overall, survival in each stage improved. The stage migration phenomenon occurs when comparisons are made between groups of patients who have undergone less or more thorough staging techniques and as such is likely to occur when the comparisons are made over a time period which spans the introduction of new technology. It has been noted with numerous tumours including metastatic germ cell tumours and gastric cancers.

Diagnostic procedure

The diagnostic accuracy of most techniques is high but irrespective, figures for accuracy are readily available. The diagnostic impact is not limited to a change in diagnosis or prognosis but includes the ease with which the diagnosis is reached, reduction in the number of invasive investigations, and reduction of the time spent in hospital. It is self-evident that by achieving percutaneous biopsies, and diagnosing and staging tumours accurately without numerous more invasive investigations (including surgery) that most forms of imaging can be of benefit to the patient. As regards therapeutic impact there are several studies showing that imaging substantially alters the patient’s management. The effect on patient outcome is a great deal more difficult to evaluate than any of these other factors. In the short term, it has a very obvious impact by reducing the number of invasive test, by reducing the time in hospital and by avoiding surgery. The long-term effect, such as an improvement in the rate of cure, or the rate of survival or even the relief of symptoms brought about by the imaging technique is a great deal more difficult to evaluate.