Abstract
Background
Early detection of disease outbreaks enables public health officials to implement disease control and prevention measures at the earliest possible time. A time periodic geographical disease surveillance system based on a cylindrical spacetime scan statistic has been used extensively for disease surveillance along with the SaTScan software. In the purely spatial setting, many different methods have been proposed to detect spatial disease clusters. In particular, some spatial scan statistics are aimed at detecting irregularly shaped clusters which may not be detected by the circular spatial scan statistic.
Results
Based on the flexible purely spatial scan statistic, we propose a flexibly shaped spacetime scan statistic for early detection of disease outbreaks. The performance of the proposed spacetime scan statistic is compared with that of the cylindrical scan statistic using benchmark data. In order to compare their performances, we have developed a spacetime power distribution by extending the purely spatial bivariate power distribution. Daily syndromic surveillance data in Massachusetts, USA, are used to illustrate the proposed test statistic.
Conclusion
The flexible spacetime scan statistic is well suited for detecting and monitoring disease outbreaks in irregularly shaped areas.
Background
The anthrax terrorist attacks in 2001, the severe acute respiratory syndrome (SARS) outbreak in 2002, and a concern about pandemic influenza have motivated many public health departments to develop early disease outbreak detection systems. Early detection of disease outbreaks enables public health officials to implement disease control and prevention measures at the earliest possible time. For an infectious disease, improvement in detection time by even one day might enable public health officials to control the disease before it becomes widespread. In many cities such as New York City [1], Washington, D.C. [2], Boston [3,4], Denver, and Minneapolis, realtime, geographic, early outbreak detection system have been implemented. For a welldefined geographical area, standard disease surveillance uses purely temporal methods that seek anomalies in time series data without using spatial information [5]. The increased need for geographical cluster detection has coincided with an increasing availability of spatial data [6]. Investigators ask whether the geographical cluster is unlikely to have arisen by chance given random variations from the background incidence, according for the multiple comparisons inherent in the many possible cluster locations and size evaluated. Scan statistics are tools to answer such questions [7,8]. Increasingly, there is interest in the prospective surveillance of new data as it becomes available in order to detect a localized disease outbreak as early as possible. Particularly in light of the perceived threat of bioterrorism and newly emerging infectious diseases, there has been a spate of recent interest in the development of geographic surveillance systems that can detect changes in spatial patterns of disease [9]. Recently, a time periodic geographical disease surveillance system based on a cylindrical spacetime scan statistic was proposed by Kulldorff and colleagues [10,11].
Several different approaches to the statistical assessment of potential geographic clustering in either pointor areabased disease data have been developed [12,13]. Almost all of these purely spatial approaches are retrospective, in the sense that they describe statistical tests that are designed to be carried out once, on a set of data that has been collected from the recent past [9]. In particular, the circular spatial scan statistic [8] has been used extensively for the detections and evaluation of purely spatial disease clusters along with the SaTScan software [14]. For example, as part of their cancer surveillance initiative, the New York State Department of Health used the spatial scan statistic to look at the geographical variation of breast, lung, prostate, and colorectal cancer incidence in New York State, finding various statistically significant clusters but no local hotspots with greatly elevated risk [15]. However, as the statistic uses a circular scanning window with variable size to define the potential cluster area, it is difficult to correctly detect some noncircular clusters such as those along a river [16]. Recently, spatial scan statistics for irregular shaped clusters have been proposed, using the same likelihood ratio test formulation as before. The spatial scan statistics proposed by Duczmal and Assunção [17], Patil and Taillie [18], Tango and Takahashi [16], Assunção et al. [19] and Kulldorff et al. [20] are aimed at detecting irregularly shaped clusters which may not be detected by the circular spatial scan statistic. Due to the unlimited geometric freedom of cluster shapes, some of these statistics run the risk of detecting quite large and very peculiarly shaped clusters. The flexible spatial scan statistic [16], which has been used along with the FleXScan software [21], has a parameter K as the preset maximum length of neighbors to be scanned, to avoid detecting clusters with a very peculiar shape.
In this paper, we propose a flexibly shaped spacetime scan statistic ("flexible spacetime scan statistic" hereafter) for the early detection of disease outbreaks. It is based on the flexible purely spatial scan statistic [16] and the prospective spacetime scan statistic [10]. The performance of our proposed spacetime scan statistic is compared with that of the cylindrical scan statistic, using the benchmark data provided by Kulldorff et al. [22]. In order to evaluate its performance we propose a spacetime power distribution by extending the purely spatial bivariate power distribution [16]. Daily syndromic surveillance data in Massachusetts, USA, are used to illustrate the proposed method with real data.
The flexible spacetime scan statistic
Consider the situation where an entire study area is divided into m regions (for example, counties, ZIP codes, enumeration districts, etcetera), and each region is periodically reporting the number of cases of a disease or syndrome under study. We assume that, under the null hypothesis of no clustering, the number of cases N_{id }is a Poisson random variable with the observed value n_{id }and the expected values μ_{id }in each region i(i = 1,...,m) at time d, where μ_{id }is proportional to its population size, or a covariateadjusted population at risk. Since we are only interested in detecting clusters that are alive (active) at the current time t_{P}, we only consider 'alive' clusters that are present in the following T time intervals:
where T is a prespecified maximum temporal length of the cluster.
A time periodic geographical disease surveillance system based on a cylindrical spacetime scan statistic has already been proposed by Kulldorff [10]. The cylindrical spacetime scan statistic uses a cylindrical window in three dimensions where the base of the cylinder represents space and the height represents time. As with the purely spatial scan statistic, the cylindrical spacetime scan statistic imposes a circular base Z on each centroid of regions for each of T time intervals. For each of centroids, the radius of the circle is varied from zero up to a preset maximum radius, for example, so that the window never includes more than 50% of the total population at risk [8]. In this paper, we use a preset maximum number of regions K to be included in the cluster as an upperbound of the radius. If the base contains the centroid of a region, then that whole region is included in the base. In total, a very large number of different but overlapping circular bases are created, each with a different set of neighboring regions and each being a possible candidate area containing a disease outbreak. Let Z_{ik}, k = 1,...,K, denote the base composed by the region i and the (k  1)nearest neighbors to i. Then, all the cylindrical windows to be scanned by the cylindrical scan statistic are the cylinders with the base in the set
and the heights in the set
On the other hand, a flexible spacetime scan statistic which we propose in this paper imposes a three dimensional prismatic window with an arbitrarily shaped base Z. For any given region i, we create the set of arbitrarily shaped bases consisting of k connected regions (1 ≤ k ≤ K) including i. To avoid detecting a cluster of unlikely peculiar shape, the connected regions are restricted as the subset of the Knearest neighbors to the region i, where K = 1 implies the region i itself. Let Z_{ik(j)}, j = 1,...,j_{ik }denote the jth window which is a set of k regions connected starting from the region i, where j_{ik }is the number of j satisfying Z_{ik(j) }⊆ Z_{iK }for k = 1,...,K. Then, all the windows to be scanned are the prisms whose base is included in the set
with height in the set . In other words, for any given region i, the cylindrical scan statistic consider K concentric circles for the base, whereas the flexible scan statistic consider K concentric circles plus all the sets of connected regions including the single region i, whose centroids are located within the Kth largest concentric circle.
Define L(W) as the likelihood under the alternative hypothesis that there is a cluster in the spacetime window W(∈ ), where (or ) and L_{0 }the likelihood under the null hypothesis. Then, conditioning on the observed total number of cases, N, the definition of the spacetime scan statistic S is the maximum likelihood ratio over all possible windows W,
Let n_{W }be the number of cases in window W . For the Poisson model, let μ_{W }be the expected number in window W under the null hypothesis, so that μ_{G }= N for G, the entire study space in three dimensions. It can then be shown that
if n_{W }> μ_{W}, and L(W)/L_{0 }= 1 otherwise. The window for which the likelihood ratio is maximized identifies the most likely cluster (MLC) [8]. To find the distribution of the log likelihood ratio (LLR) under the null hypothesis, Monte Carlo hypothesis testing [23] is required. pvalue of the test is based upon the null distribution of LLR with large number B of Monte Carlo replications of data sets generated under the null hypothesis, i.e.,
where LLR_{v }and LLR* is the value of the test statistic for the vth Monte Carlo replicate and that for the observed data, respectively, and I(·) is the indicator function.
Syndromic surveillance in Massachusetts
We applied the prospective flexible spacetime scan statistic to daily syndromic surveillance data in eastern Massachusetts mimicking a real time surveillance system. The data came from an electronic medical record system used by Harvard Vanguard Medical Associates [3,24]. We used the rash and respiratory data during August 1–30, 2005. The data are geographically aggregated to ZIP codes. The number of ZIP codes used were different for each syndrome, for example cases of the rash were analyzed in 252 ZIP codes and respiratory in 385. Note that for the flexible spacetime scan statistic, the ZIP code whose data does not exist, was treated like a ravine. For example, assume that ZIP codes i_{1 }and i_{2}, i_{2 }and i_{3 }are adjacent each other, respectively, but i_{1 }and i_{3 }are not adjacent. If the data of i_{2 }does not exist under the situation, then it is assumed that i_{1 }and i_{3 }are not directly connected.
Based on the prior daily data for over a year in MA, the expected number of cases were calculated as the predicted means from a generalized linear mixed model (GLMM) as developed by Kleinman et al, adjusted for seasonal effect, day of week, etc, these are the same expectations used in the actual real time surveillance system [25]. We set K = 20 as the maximum length of the geographical window, and the maximum temporal length to be T = 7 days. The number of replications for the Monte Carlo procedure was set to B = 999. In disease outbreak detection, the recurrence interval (RI) is often used as an alternative to the pvalue [14]. The measure reflects how often a cluster will be observed by chance, assuming that analyzes are repeated on a regular basis with a periodicity equal to the period of the study. For daily surveillance such as this analysis, the pvalue of 0.001 corresponds to the RI of 1,000 days, i.e., 2.7 years, and an alpha level of 0.0027 corresponds to one expected false alarm every year.
The results of analysis during August 1–30 by the flexible and the cylindrical spacetime scan statistics are given in Tables 1, 2 and Figure 1. The tables show results for the days with p < 0.0054, which corresponds to the RI of at least 6 months. When looking at rash outbreaks (Table 1), both tests detected the same cluster with a single ZIP code 01951 on August 7, with the same temporal length (6 days) and the same RI (2.7 years). Note that the clusters detected by both tests from August 8 to 10 are not signals of an outbreak because the number of cases on August 8 must be 0, and on August 9 and 10, the number of cases of the cluster was decreasing. For respiratory syndrome (Table 2), each test detected a different cluster with the same RI of 2.7 years on August 12. The cluster detected by the flexible scan statistic contained 12 ZIP codes, while that from the cylindrical scan statistic contained 18 ZIP codes, with 11 ZIP codes detected in common. On August 13 and 14, the flexible scan statistic detected significant clusters with larger RIs, 333 days and 250 days respectively, while the cylindrical scan statistic detected clusters with short RIs, 91 days and 30 days respectively. The flexible scan statistic also detected a cluster on August 15 (RI = 1.4 years) with a temporal length of 6 days, while the cylindrical scan statistic detected a cluster with a temporal length of 5 days (RI = 200 days). For the 6 days from August 12 to 17 (results on August 16 and 17 are not shown in Table 2 because of shorter RIs), the cylindrical scan statistic kept detecting the same cluster, while the flexible scan statistic detected a similar but slightly different cluster each day. However, we should acknowledge the similar lack of evidence in Table 2 for a continued outbreak on August 13 to 14, because the number of additional cases on those days is very close to the expected number of additional cases. On the other hand, there is some evidence for an excess of cases on August 15 (23 additional cases), although the estimated relative risk is substantially reduced.
Table 1. Detected outbreaks of Rash based on daily syndromic surveillance data in eastern Massachusetts during August 1–30, 2005.
Table 2. Detected outbreaks of Respiratory based on daily syndromic surveillance data in eastern Massachusetts during August 1–30, 2005.
Figure 1. Detected outbreaks of Rash and Reepiratory in eastern Massachusetts during August 1–30, 2005, by the cylindrical scan statistic ((a) and (b)) and the flexible scan statistic ((a), (c) and (d)).
Statistical power, sensitivity and positive predictive value
In this section, we compare the flexible and cylindrical spacetime scan statistics, using benchmark data from 176 New York City ZIP codes ([14,22]). This benchmark data has been described in detail elsewhere [22], and here we only give a brief overview. Based on 2002 numbers, the total population is 8,003,510. The benchmark data sets contain a number of randomly located of cases of a hypothetical disease or syndrome, generated either under the null model with no outbreaks or under one of eight different alternative models with an outbreak in one of four different locations and with either a high or modest excess risk. For each of the null and alternative models, three different sets of data sets were generated, with 31, 32, and 33 days, respectively. For each of the null models, 9,999 random data sets were generated. For each of the alternative models, 1,000 random data sets were generated.
For each data set, the total number of randomly allocated cases was 100 times the number of days (i.e., 3,100 cases in the data sets containing 31 days). The number 100 was chosen to reflect the occurrence rate of certain syndromes common to the NYC emergency department(ED)based syndromic surveillance system. Under the null model, each person living in NYC is equally likely to contract the disease, and the time of each case is assigned with equal probability to any given day. Thus, each case was randomly assigned to ZIP code i and day d with probability proportional to μ_{id }= pop_{i}, where pop_{i }is the population of ZIP code i. For the alternative models, one or more ZIP codes were assigned an increased risk on Day 31 and, when applicable, on Days 32 and 33 as well. For these ZIP code and day combinations, μ_{id }was multiplied by an assigned relative risk. For all other ZIP code and day combinations, μ_{id }did not change. Each case was then randomly assigned with probability proportional to the new set of μ_{id }to generate data under the alternative models.
Eight alternative models were evaluated, based on four different outbreak areas of length s* and total population pop* therein, with either high or medium relative risk (RR) [22] (Figure 2).
Figure 2. NYC 176 ZIP codes area and assumed clusters (i) Cluster A, (ii) Cluster A5, (iii) The Rockaways, and (iv) Hudson River.
1. Cluster A: a single ZIP code area in Brooklyn (circular area)
s* = 1, pop* = 85, 089, RR: high = 9.91, medium = 5.66
2. Cluster A5: the same ZIP code with 4 neighboring ZIP codes (noncircular area)
s* = 5, pop* = 318, 754, RR: high = 4.47, medium = 3.06
3. The Rockaways, 5 ZIP codes area (noncircular area)
s* = 5, pop* = 106, 738, RR: high = 8.48, medium = 5.01
4. Hudson River: 20 ZIP codes areas along the shore of the Hudson River (noncircular area)
s* = 20, pop* = 827, 382, RR: high = 2.97, medium = 2.24
A maximum length of the geographic window K = 20 was used for the flexible scan statistic, while the cylindrical scan statistic used a maximum of either K = 20 or a 50 % of the population at risk. A period of T = 3 days was used as the maximum temporal length of the cluster. We did not use the options to include purely temporal clusters (see details in [14]).
Standard statistical power
First of all, we estimated the standard statistical power, which is the probability that the null hypothesis is rejected at the α = 0.05 significance level, without considering the overlap between the detected and real clusters. The random data sets generated under the null model were used to get the critical values of the scan statistics. For α = 0.05, this is defined as the 500th highest log likelihood ratio when raning those value from all the 9,999 simulated data sets. The estimated power was then calculated is the proportion of the 1,000 random data sets that had a higher log likelihood ratio than the critical value obtained from the null data sets. The results are shown in Table 3. In general, the cylindrical spacetime scan statistic has higher power for the three more compact clusters, while the flexible spacetime scan statistic have higher power for the long and narrow the Hudson River cluster. On Day 33 of the high excess risk outbreaks, both methods have very high power.
Table 3. Standard power of the prospective spacetime scan statistics – flexible and cylindrical – at different days of the outbreak
Spacetime power distribution
In order to compare the performance of the cluster detection tests, the standard power has been derived in the same manner as for usual hypothesis tests. However, it should be noted that standard statistical power reflect the 'power to reject the null hypothesis for whatever reasons,' while the probability of both rejecting the null hypothesis and accurately identifying the true cluster is a different matter altogether.
In order to compare the performance of purely spatial cluster detection tests, Tango and Takahashi [16] proposed a spatial bivariate power distribution P_{0}(l, s  s*) based on Monte Carlo simulation where l is the length of the significant MLC, while s is the number of regions identified out of the true cluster with s* regions.
where L and S denote the random variable of l and s under the specified model, respectively, and l ≥ 1 and 0 ≤ s ≤ s*. In a similar manner, we propose a spacetime trivariate power distribution for a spacetime cluster detection test based on Monte Carlo simulation where the temporal length of the true cluster is denoted t*:
where U denotes the random variable of t and 1 ≤ t ≤ T.
In Tables 4, 5 and 6, we show the estimated trivariate power distribution P(l, s, t  s*, t*) × 1,000 for (a) Cluster A (s* = 1) on Day 31 (t* = 1) (b) Cluster A5 (s* = 5) on Day 33 (t* = 3) and (c) the Rockaways cluster (s* = 5) on Day 33 (t* = 3), in all cases with high excess risk.
Table 4. Spacetime power distribution P_{1}(l, s, t  s*, t*) for the Cluster A (s* = 1) on Day 31 (t* = 1) with high risk (RR= 9. 91), where t is a temporal length of detected cluster. The mark "*" is the powers of accurate detection.
Table 5. Spacetime power distribution P_{1}(l, s, t  s*, t*) for the Cluster A5 (s* = 5) on Day 33 (t* = 3) with high risk (RR = 4. 47), where t is a temporal length of detected cluster, and the raw all cells of which have zero powers of both tests is not shown. The mark "*" is the powers of accurate detection.
Table 6. Spacetime power distribution P_{1}(l, s, t  s*, t*) for the Rockaways (s* = 5) on Day 33 (t* = 3) with high risk (RR = 8. 48), where t is a temporal length of detected cluster, and the raw all cells of which have zero powers of both tests is not shown. The mark "*" is the powers of accurate detection.
This trivariate power distribution provides us with a detailed description of the spacetime cluster detection tests performance. For the outbreak in cluster A with a single ZIP code, the cylindrical scan statistic has higher power to detect the cluster with complete accuracy, with P_{1}(l = 1, s = 1, t = 1  s*, t*) = 697/1000, compared to 315/1000 for the flexible. Moreover, the flexible scan statistic has a heavier tail in the (s, t) = (1, 3) column than the cylindrical one. However the cylindrical scan detected some large clusters including several with l ≥ 15. For outbreaks in the noncircular shaped A5 and Rockaway clusters, the flexible scan statistic has higher power for complete accurate detection. Indeed, the cylindrical scan statistic cannot detect these clusters with complete accuracy since they are not circular, so that the power for complete accuracy is zero. Moreover, note that for cluster A5, the flexible scan statistic is more likely to include all the five areas in the true cluster (797 + 12 = 809/1000 versus 601 + 12 = 613/1000), and it is also more likely to avoid including any of the ZIP codes outside the true cluster (12 + 74 + 2 + 287 + 3 = 378/1000 versus 37 + 1 + 301 + 7 = 346/1000). For the Rockaway cluster, the flexible scan statistic is again more likely to include all the five areas in the true cluster (667 + 4 + 1 = 672 versus 1 + 0 + 1 = 1), but the cylindrical scan statistic avoids the ZIP codes outside the cluster more often (2 + 8 + 52 + 1 + 876 + 6 + 1 + 0 + 0 + 0 = 946/1000 versus 0 + 0 + 6 + 0 + 181 + 1 + 0 + 571 + 2 + 0 = 761/1000). Tables 5 and 6 show that the temporal accuracy of the detected cluster is very good for both methods. For example, for cluster A5, the flexible scan has P_{1}(+, +, 3  s*, t*) = ∑_{l }∑_{s}P_{1}(l, s, 3  s*, t*) = (15 + 171 + 797)/1000 = 0.983 while the cylindrical scan has P_{1}(+, +, 3  s*, t*) = (41 + 338 + 601)/1000 = 0.980.
The complexity of the threedimensional trivariate power distributions suggests that we need some summary measure. Since the temporal accuracy is very similar, we focus on the geographical accuracy. We will compute the extended power of spatial cluster detection tests, as developed by Takahashi and Tango [26]. We will also define and compute geographical sensitivity and false positive rates.
The extended power
We can consider two types of spatial misclassifications when applying the cluster detection test (CDT). One is a false negative test result (FN) in which the CDT misses a region included in the true cluster. Sensitivity is 1  FN rate. The other is a false positive test result (FP) in which the CDT incorrectly detects a region that is not present in the true cluster. The numbers of FNs and FPs for geographical detection are s*  s and l  s, respectively.
The extended power is based on the bivariate distribution P_{0}(l, s  s*) and penalties introduced for the FPs and FNs of the geographical detection as
where W(l, s; w^{}, w^{+}) is a weight function such that
and w^{ }and w^{+ }are the predefined penalties for the FNs and FPs (per region), respectively. This power includes the following three special powers:
1. The standard power as I(0, 0).
2. The power to detect the geographical true cluster accurately as I(1, 1).
3. The power for which the MLC includes all the regions within the true cluster as I(1, 0).
Takahashi and Tango [26] also proposed the profile of the extended power as
where r = w^{+}/w^{ }with w^{ }= 1/s*, because it is difficult to set the value of w^{ }and w^{+ }in advance. Figure 3 shows the plots of the profile Q(r  s*) against r (0 ≤ r ≤ 1) for flexible and cylindrical scan statistics applied to (a) the cluster A5 and (b) the Rockaways, both on Day 33 with high risk, based upon Tables 5 and 6. Figure 3(a) shows the flexible scan statistic has higher extended power when r = 0 i.e. penalties for the FP w^{+ }= 0, I(1/5, 0) = 0.978 for the flexible and 0.954 for the cylindrical, while the extended power of cylindrical scan statistic is higher for large r, as I(1/5, 1/5) = 0.765 for the flexible and 0.862 for the cylindrical. On the other hand, Figure 3(b) shows the flexible scan statistic is more uniformly powerful than the cylindrical one for the Rockaways cluster, I(1/5, 0) = 0.958 and I(1/5, 1/5) = 0.913 for the flexible, and I(1/5, 0) = 0.885 and I(1/5, 1/5) = 0.872 for the cylindrical, respectively.
Figure 3. Profile of the extended power Q(r  s*) for flexible and cylindrical scan statistics applied to the cluster (a) Cluster A5, and (b) The Rockaways.
Sensitivity and positive predictive value
As other measures of accuracy of cluster detection tests, we shall consider sensitivity and positive predictive value [27,28]. These measures can be defined in terms of either the number of regions or the population. First, we define sensitivity of cluster detection tests as the probability of detecting the regions that actually constitute the cluster, i.e, proportion of the number of regions correctly detected from the true cluster, s/s*. We shall present the expected value:
Positive predictive value (PPV) of cluster detection tests is defined in a similar manner as the proportion of the number of true regions in the detected cluster, i.e, s/l under l > 0, and the expected value is presented:
Based upon the population, we can define the following sensitivity TP_{2 }and positive predictive value PP_{2}:
All these summary measures are better the larger they are with 100 being the optimal.
Table 7 shows the sensitivity and PPV of the flexible and cylindrical spacetime scan statistics for each cluster with a high relative risk. For cluster A, the cylindrical scan statistic has higher PPV and higher sensitivity than the flexible one. For cluster A5 and the cylindrical has higher PPV on all days and higher sensitivity on day 31, but the flexible scan statistic has higher sensitivity on days 32 and 33. The same is true for the Rockaway cluster. For the Hudson River cluster, the flexible scan statistic has higher PPV than the cylindrical. The flexible scan has higher sensitivity than for the cylindrical with the same upper constant K = 20 on the number of regions in the detected cluster, but lower sensitivity compared to the cylindrical scan with a 50% upper limit on the cluster size. Note though, that this difference in sensitivity is less than the difference in PPV that goes the other way.
Table 7. Sensitivity and positive predictive value (PPV) of the flexible and cylindrical spacetime scan statistics.
Conclusion
In this paper, we have proposed a flexible spacetime scan statistic to detect arbitrarily shaped disease outbreaks. We have also presented a trivariate power distribution which is useful for evaluating the performance of cluster detection tests, informing us about the spatial and temporal accuracy of the detected clusters in addition to the standard statistical power.
For the benchmark data evaluated in this paper, the cylindrical scan statistic performs better for the small single zipcode cluster, although by the third day of the outbreak both methods are almost perfect. For the small irregular shaped clusters, A5 and Rockaways, the cylindrical performs better on the first day of the outbreak, but as more data accumulates, the flexible scan statistic has certain advantages in determining the precise size and shape of the outbreak. For the large and narrow Hudson River cluster, the flexible scan statistic performs better than the cylindrical one, with slightly higher standard power, much higher PPV and slightly higher or lower sensitivity depending on the type of cylindrical method used. Results may be different for other types of regular and irregularly shaped disease outbreaks, but the four examples used in this paper gives some sense of the proposed methods performance.
For early detection, timeliness is much more important than geographical accuracy. When monitoring an occurring outbreak, on the other hand, geographical accuracy becomes critical and is then the key objective since we already know the outbreak is there. Our results suggest that we may use both the cylindrical and flexible scan statistic for disease outbreak detection, but for different purposes. Specifically, for detecting new outbreak that, one may want to use the cylindrical scan statistic. That is especially if we expect the outbreak to start locally, within a reasonably small and compact area containing only a few ZIPcodes. On the other hand, once the outbreak has spread to a larger area, and we want to monitor that spread, one may want to use the flexible scan statistic, with its ability to accuratly determine the precise geographical extent of irregular shaped outbreaks. This is especially true ones the outbreak has left its local area of origin.
To evaluate the performance of spacetime scan statistic, we applied the extended power for purely spatial cluster detection test (8), which is defined as the weighted sum of the bivariate power distribution wherein the weight is given by the geometric mean of (1penalty for the false negatives) and (1penalty for the false positives), including the standard power as a special case. Also we applied the profile Q(r  s*) proposed by Takahashi and Tango [26]. This plot gave us a detailed description regarding power of cluster detection tests. Needless to say, it is possible to extend it to spacetime version if we could consider the penalties for temporal false negatives and false positives, but we leave this problem for future work. Also, for the profile of the extended power, we chose to use a fixed cost of w^{ }= 1/s* for false negatives and a smaller or equal cost for false positives. For more general situations, we could plot the full bivariate extended power function on the unit square.
Similarly to the flexible spatial scan statistic in the purely spatial situation, the flexible spacetime scan statistics proposed in this paper has a limitation of cluster size, because of the limitation of the speed of computation. The proposed scan statistic works well for small to moderate sized clusters. Although we set the maximum length of the geographical window to K = 20, this is not large enough to detect the 20 ZIP codes of the Hudson River cluster accurately because this cluster is too long to be the subset of the 20th nearest neighbors of any region. Computation time depends on the size of the data set and K. Indeed, for the August 11 analysis of respiratory syndrome data in Massachusetts, with 385 ZIP codes, a maximum temporal length of T = 7 days, a maximum spatial size of K = 20, and with 999 Monte Carlo replications, the flexible spacetime scan statistic took 87.7 minutes to run on a 3.06GHz Pentium 4 computer, while the cylindrical spacetime scan statistic took only 9.8 minutes.
A limitation of length may also prevent the analysis to present large clusters of unlikely and very peculiar shapes. These undesirable properties produced by maximum likelihood ratio might suggest the use of different criterion for model selection, including some penalized likelihood [20,29]. Also, for larger cluster seizes, the method is not practically feasible and a more efficient algorithm is needed.
In this paper, we considered the right cylinder or right prism of the cluster model, as an expansion of the cylindrical spacetime scan statistic for a prospective disease surveillance by Kulldorff [10]. This does not allow the scanning window to adjust itself as the disease outbreak grows or shrinks geographically over time. Recently, Iyengar has suggested using a square pyramid shape window which can model either growth (or shrinkage) and movement of the disease cluster [30]. For the proposed flexible spacetime scan statistic, if we could consider the flexibility in both space and time, that is, evaluating all connected subsets within a cylinder instead of in (4), we can detect more arbitrarily shaped clusters in spacetime. For such an expansion, an efficient computational algorithm will be needed for the scanning process, as well as a more sophisticated mechanism for the interpretation of such complicatedly shaped clusters. The implementation and importance of such methods for disease surveillance and monitoring, is an issue for future research.
Authors' contributions
KT, MK and TT developed the statistical methodology and designed the study. KT, MK and KY analyzed and interpreted the syndromic surveillance data. KT programmed the methods, did the power calculations and wrote the first draft of the manuscript. All authors participated in the interpretation of the results, revised the manuscript, and approved the final version.
Acknowledgements
The authors thank Allyson Abrams for comments concerning the syndromic surveillance data from Massachusetts, and Dr. Tetsuji Yokoyama for advice about C++ programming.
This research was partly founded by a Modeling Infectious Disease Agent Study (MIDAS) grant (No. U01GM076672) from the National Institute of General Medical Science, National Institutes of Health, USA, and a scientific grant (No. H16Kenkou039) from the Ministry of Health, Labour and Welfare, Japan.
References

Heffernan R, Mostashari F, Das D, Karpati A, Kulldorff M, Weiss D: Syndromic surveillance in public health practice, New York City.
Emerging Infectious Diseases 2004, 10:858864. PubMed Abstract  Publisher Full Text

Lombardo J, Burkom H, Elbert E, Magruder S, Lewis SH, Loschen W, Sari J, Sniegoski C, Wojcik R, Pavlin J: A systems overview of the electronic surveillance system for the early notification of communitybased epidemics (ESSENCE II).
Journal of Urban Health 2003, 80(2 suppl.1):i32i42. PubMed Abstract

Lazarus R, Kleinman K, Dashevsky I, Adams C, Kludt P, DeMaria A, Platt R: Use of automated ambulatorycare encounter records for detection of acute illness clusters, including potential bioterrorism events.
Emerg Infect Dis 2002, 8(8):753760. PubMed Abstract  Publisher Full Text

Platt R, Bocchino C, Caldwell B, Harmon R, Kleinman K, Lazarus R, Nelson AF, Nordin JD, Ritzwoller P: Syndromic surveillance using minimum transfer of identifiable data: the example of the National Bioterrorism Syndromic Surveilance Demonstration Program.
Journal of Urban Health 2003, 80(2 suppl.1):i25i31. PubMed Abstract

Sonesson C, Bock D: A review and discussion of prospective statistical surveillance in public health.
Journal of the Royal Statistical Society, Series A 2003, 166:521.

Lawson AB, Kleinman K, Eds: Spatial & Syndromic Surveillance for Public Health. Chichester: Wiley; 2005.

Naus J, Wallenstein S: Temporal surveillance using scan statistics.
Statistics in Medicine 2006, 25:311324. PubMed Abstract  Publisher Full Text

Kulldorff M: A spatial scan statistic.
Communications in Statistics – Theory and Methods 1997, 26:14811496.

Rogerson PA, Yamada I: Monitoring change in spatial patterns of disease: comparing univariate and multivariate cumulative sum approaches.
Statistics in Medicine 2004, 23:21952214. PubMed Abstract  Publisher Full Text

Kulldorff M: Prospective time periodic geographical disease surveillance using a scan statistic.
Journal of the Royal Statistical Society, Series A 2001, 164:6172.

Kulldorff M, Heffernan R, Hartman J, Assunção R, Mostashari F: A spacetime permutation scan statistic for disease outbreak detection.
PLoS Medicine 2005, 2(3):e59. PubMed Abstract  Publisher Full Text  PubMed Central Full Text

Lawson AB, Biggeri A, Böhning D, Lesaffre E, Viel JF, Bertollini R, Eds: Disease Mapping and Risk Assessment for Public Health. New York: Wiley; 1999.

Lawson AB: Statistical Methods in Spatial Epidemiology. 2nd edition. Chichester: Wiley; 2006.

Kulldorff M, Information Management Services, Inc: SaTScan version 7.0: software for the spatial and spacetime scan statistics. [http://www.satscan.org/] webcite
2007.

Kulldorff M: Scan statistics for geographical disease surveillance: an overview. In Spatial & Syndromic Surveillance for Public Health. 2nd edition. Edited by Lawson AB, Kleinman K. Chichester: Wiley; 2005:115131.

Tango T, Takahashi K: A flexibly shaped spatial scan statistic for detecting clusters.
International Journal of Health Geographics 2005., 4(11) PubMed Abstract  Publisher Full Text  PubMed Central Full Text

Duczmal L, Assunção R: A simulated annealing strategy for the detection of arbitrarily shaped spatial clusters.

Patil GP, Taillie C: Upper level set scan statistic for detecting arbitrarily shaped hotspots.

Assunção R, Costa M, Tavares A, Ferreira S: Fast detection of arbitrarily shaped disease clusters.
Statistics in Medicine 2006, 25:723742. PubMed Abstract  Publisher Full Text

Kulldorff M, Huang L, Pickle L, Duczmal L: An elliptic spatial scan statistic.
Statistics in Medicine 2006, 25:39293943. PubMed Abstract  Publisher Full Text

Takahashi K, Yokoyama T, Tango T: FleXScan version 2.0: Software for the Flexible Spatial Scan Statistic. [http://www.niph.go.jp/soshiki/gijutsu/index_e.html] webcite

Kulldorff M, Zhang Z, Hartman J, Heffernan R, Huang L, Mostashari F: Benchmark data and power calculations for evaluating disease outbreak detection methods.
Morbidity and Mortality Weekly Report 2004, 53(Supplement 1):144151. PubMed Abstract  Publisher Full Text

Dwass M: Modified randomization tests for nonparametric hypotheses.

Lazarus R, Kleinman K, Dashevsky I, DeMaria A, Platt R: Using automated medical records for rapid identification of illness syndromes (syndromic surveillance): the example of lower respiratory infection.
BMC Public Health 2001, 1:9. PubMed Abstract  Publisher Full Text  PubMed Central Full Text

Kleinman K, Lazarus R, Platt R: A generalized linear mixed models approach for detecting incident clusters of disease in small areas, with an application to biological terrorism.
American Journal of Epidemiology 2004, 159:217224. PubMed Abstract  Publisher Full Text

Takahashi K, Tango T: An extended power of cluster detection tests.
Statistics in Medicine 2006, 25:841852. PubMed Abstract  Publisher Full Text

Forsberg L, Bonetti M, Jeffery C, Ozonoff A, Pagano M: Distancebased methods for spatial and spatiotemporal surveillance. In Spatial & Syndromic Surveillance for Public Health. 2nd edition. Edited by Lawson AB, Kleinman K. Chichester: Wiley; 2005:115131.

Huang L, Kulldorff M, Gregorio D: A spatial scan statistic for survival data.
Biometrics 2007, 63:109118. PubMed Abstract  Publisher Full Text

Duczmal L, Kulldorff M, Huang L: Evaluation of spatial scan statistics for irregularly shaped clusters.
Journal of Computational and Graphical Statistics 2006, 15(2):428442.

Iyengar VS: Spacetime clusters with flexible shapes.
Morbidity and Mortality Weekly Report 2005, 54(Supplement):7176. PubMed Abstract  Publisher Full Text