Geographic patterns of cancer death rates in the U.S. have customarily been presented by county or aggregated into state economic or health service areas. Herein, we present the geographic patterns of cancer death rates in the U.S. by congressional district. Many congressional districts do not follow state or county boundaries. However, counties are the smallest geographical units for which death rates are available. Thus, a method based on the hierarchical relationship of census geographic units was developed to estimate age-adjusted death rates for congressional districts using data obtained at county level. These rates may be useful in communicating to legislators and policy makers about the cancer burden and potential impact of cancer control in their jurisdictions.
Mortality data were obtained from the National Center for Health Statistics (NCHS) for 1990–2001 for 50 states, the District of Columbia, and all counties. We computed annual average age-adjusted death rates for all cancer sites combined, the four major cancers (lung and bronchus, prostate, female breast, and colorectal cancer) and cervical cancer. Cancer death rates varied widely across congressional districts for all cancer sites combined, for the four major cancers, and for cervical cancer. When examined at the national level, broad patterns of mortality by sex, race and region were generally similar with those previously observed based on county and state economic area.
We developed a method to generate cancer death rates by congressional district using county-level mortality data. Characterizing the cancer burden by congressional district may be useful in promoting cancer control and prevention programs, and persuading legislators to enact new cancer control programs and/or strengthening existing ones. The method can be applied to state legislative districts and other analyses that involve data aggregation from different geographic units.
Cancer death rates presented by geographic boundaries such as state and county, state economic areas, and health service areas have been useful in monitoring temporal trends in allocating public health resources [1,2], and in some instances, in generating etiological hypotheses. These rates are less useful for communicating to legislators and policy makers whose jurisdictions are not defined by state or county boundaries. There have been no published studies that attempted to measure cancer death rates within congressional districts.
Public policy and legislation play a critically important role in efforts to reduce the burden of cancer. For example, the American Cancer Society estimates that in 2006 about 170,000 of the 564,830 cancer deaths are expected to be caused by tobacco use alone . Policy measures that are proven to reduce smoking prevalence include excise taxes and funding for state comprehensive tobacco control programs [4-6]. Declines in smoking prevalence among men as a result of public health efforts have had a major influence on the declines in cancer mortality in the last decade.
We present a method to calculate cancer death rates according to congressional district that may be useful in advocating for legislative initiatives and funding for cancer research and prevention programs.
Results and discussion
Maps of cancer death rates by congressional district were prepared for men and women, for all races combined, and for African Americans, non-Hispanic whites, and Hispanics (Figures 1, 2, 3, 4, 5); Hispanics are not mutually exclusive of whites and African Americans. Regional patterns of cancer mortality for African Americans and non-Hispanic whites were compared to previously published maps based on counties and state economic areas . Although maps of cancer mortality by congressional district were also prepared for Hispanics, regional patterns are difficult to interpret because of insufficient data to calculate rates for most parts of the country. When examined at the national level, broad patterns of mortality for African Americans and non-Hispanic whites by sex and region were consistent with those previously observed . Geographic variations in cancer death rates may reflect, in part, regional variations in risk factors such as smoking and obesity, early detection and screening, and access to and utilization of medical services.
Figure 1. All cancers combined death rates per 100,000 person-years by congressional district (age-adjusted 2000 US population), 1990–2001.
Figure 2. Lung cancer death rates per 100,000 person-years by congressional district (age-adjusted 2000 US population), 1990–2001.
Figure 3. Colorectal cancer death rates per 100,000 person-years by congressional district (age-adjusted 2000 US population), 1990–2001.
Figure 4. Prostate, female breast cancer death rates per 100,000 person-years by congressional district (age-adjusted 2000 US population), 1990–2001.
Figure 5. Cervical cancer death rates per 100,000 person-years by congressional district (age-adjusted 2000 US population), 1990–2001.
Figure 1 shows geographic patterns of death rates for all cancer sites combined by congressional district in the United States. In men, rates range from 186.3 in Utah congressional district #3 to 343.7 in District of Columbia (Table 1) and in women, from 123.4 in Utah congressional district #1 to 217.4 in Pennsylvania congressional district #2 (Table 2). Generally, the patterns for all cancer sites combined are strikingly similar to those for lung cancer (Figure 2), reflecting the importance of lung cancer as a cause of cancer death, and the strong association of lung and cancers of several other sites with tobacco smoking. Lung cancer death rates in all races combined range from 35.7 in Utah congressional district #1 to 130.3 in Kentucky congressional district #5 for men and from 14.8 in Utah congressional district #3 to 57.9 in Kentucky congressional district #5 for women. Lung cancer death rates are the highest in congressional districts in Appalachia and the south among non-Hispanic white men and in the Midwest and the south among African American men. In contrast, among women, rates are the highest in congressional districts in the Midwest among African Americans and in the west, Appalachia, and the coastal south among non-Hispanic whites. Historically, smoking was more common in the south among men and in the west among women, especially among whites . Although patterns of lung cancer mortality in the 1990's primarily reflect smoking patterns in the 1950's and 1960's, the burden of death from all cancers and lung cancer by congressional district can be used to illustrate the importance of tobacco control measures as well as to document local needs for cancer treatment and associated services.
Table 1. Age-adjusted death rates, all cancers combined, for US men by congressional district (CD), 1990–2001
Table 2. Age-adjusted death rates, all cancers combined, for US women by congressional district (CD), 1990–2001
Historically, female breast cancer death rates have been elevated in the Northeastern and North Central regions; North-South differences have diminished over time as female breast cancer death rates decreased in the Northeast but increased in the South . For all races combined, female breast cancer death rates vary from 20.6 in Hawaii to 39.4 in District of Columbia. Among African American women, breast cancer death rates are highest in congressional districts in the south, Midwest, and west coast, while among non-Hispanic whites, breast cancer mortality is highest in congressional districts in the Northeast and west coast (Figure 4, right panel). Patterns of breast cancer mortality partly reflect the influence of known risk factors as well as access to and utilization of cancer screening and treatment. Important cancer control measures include access to mammography for the uninsured and under-insured, and availability of Medicaid coverage for diagnosis and treatment.
Colorectal cancer death rates are highest overall in the Northeast and parts of the South and Midwest. Generally, death rates range from 18.4 in Texas congressional district #15 to 37.1 in Pennsylvania congressional district #1 for men and from 11.3 in Texas congressional district #15 to 24.1 in District of Columbia for women (Figure 3). Although a strong geographic pattern for colorectal cancer mortality has existed since the 1950's, the reasons are not well-understood . The current priority for colorectal cancer control is to increase the proportion of individuals over 50 who receive recommended screening tests. Illustrating colorectal cancer mortality by legislative district may be influential in encouraging legislative support for mandated insurance coverage of colorectal screening tests and for programs to provide testing for the uninsured and under-insured.
For all races combined, prostate cancer death rates range from 23.8 in Texas congressional district #15 and Hawaii to 58.2 in District of Columbia. Generally, rates are highest in congressional districts in the mid-Atlantic and Southern coastal areas, reflecting in large part the higher proportion of the African American men in the population of these areas (Figure 4, left panel). Death rates for African American men are more than twice the rates for non-Hispanic white men, reflecting higher incidence, later stage at diagnosis and poorer survival among African American men. Among non-Hispanic whites, rates are highest in congressional districts in the Rocky Mountain region; high rate (40.2) is observed in Hispanics in Texas congressional district #13. A recent study suggested that 10% to 30% of the geographic variation in prostate cancer death rates might relate to variations in access to medical care . Although cancer control measures for prostate cancer are less well-defined than measures for some other cancer sites, illustrating prostate cancer mortality by congressional district may be helpful in advocating for funding of research on the prevention, early detection and treatment of prostate cancer and highlighting the importance of access to medical care for African American men.
Mortality from cervical cancer in all races combined is highest in congressional districts in Appalachia, in the South and parts of the Southwest, with rates ranging from 1.4 in Minnesota congressional district #2 to 5.7 in New York congressional district #16 (Figure 5). Among African American women, rates are highest in congressional districts in the south and southeast, among non-Hispanic whites, rates are highest in congressional districts in Appalachia, and in Hispanics rates are highest in congressional districts in the coastal parts of California and Texas and in Colorado congressional district #3. Important cancer control measures include access to Pap tests for the uninsured and under-insured, and availability of Medicaid coverage for diagnosis and treatment.
The cancer mortality patterns by congressional district are generally similar to the patterns seen using other geographic boundaries. However, the patterns by congressional district may be useful to cancer control advocates to illustrate the importance of cancer control measures (prevention, early detection, and treatment) for their constituents. The method can be applied to state legislative districts and other analyses that involve data aggregation from different geographic units. Further research is needed to validate the estimates using mortality data geocoded to the lower geographic level such as block.
Death rates for U.S. states and counties
Mortality data were obtained from the National Center for Health Statistics (NCHS). We computed annual average age-adjusted death rates for all cancer sites combined, the four major cancers (lung and bronchus, prostate, female breast, and colorectal cancer) and cervical cancer from 1990–2001 for 50 states, District of Columbia, and all counties using SEER*Stat . Death rates, counts (number of deaths), and populations for counties were directly obtained for men and women, for all races combined, and for African Americans, non-Hispanic whites, and Hispanics. Except for the years of 1990 and 2000, the intercensal populations computed by the Census Bureau were used to obtain the total populations for the study time period. Since county designation for Alaska and Hawaii was not available from NCHS, death rates for Alaska and Hawaii reflect state rates. Rates were standardized to the 2000 U.S. population and expressed per 100,000 person-years.
Death rates for U.S. congressional districts
There are 436 (excluding Puerto Rico) federal congressional districts in the U.S. . Among these, eight congressional districts followed state boundaries or their equivalent (Alaska, District of Columbia, Delaware, Montana, North Dakota, South Dakota, Vermont, and Wyoming). Further, since county-specific mortality data were not provided for Hawaii in SEER*Stat, we assigned the state death rate to both congressional districts. For congressional districts whose boundaries did not follow state and county boundaries (n = 426), death rates were calculated by assigning county-level age-adjusted death rates to census block and then aggregating death rates over blocks by congressional district using GIS  and SAS . By doing so, we assume that blocks within a county have same death rates.
There are three major areal interpolation methods (area weighting, surface smoothing, and dasymetric technique) for generating estimates for target zones from data available for source zones when the two geographic units are not comparable. Areal weighting assumes that data are homogeneously distributed across geographic units, which is generally unrealistic; it also involves the direct superimposition of source zones and target zones , which often leads to a lot of geographic boundary-line discrepancies . Surface smoothing models data available for source zones as a continuous surface across the adjacent zones, assuming that the density declines with distance, taking into account the proximity of neighboring centroids [16,17]. Dasymetric technique uses ancillary information to refine uneven data distributions across geographic units. Land cover from remote sensing  and the street layer [15,19] have been used as subzone ancillary information. A recent study uses parish level (the lowest administrative unit) population data to derive weights . However, there is no universal rule to construct areal interpolation, and the best solution depends on various factors: the variables of interest, the spatial relationships between source zones and target zones, and the availability of ancillary information related to both.
In this study, we constructed a dasymetric method based on the hierarchical spatial relationships between blocks and counties and between blocks and congressional districts. Generally, congressional district and county share census block as a common basic spatial unit (Table 3) [21,22]. We used block level sex- and race- specific population to devise a dasymetric approach that assigns county-level measures such as cancer death rates to census block and then aggregates census blocks at the congressional district level, using block population as a weighting factor. We did not use area weighting because of its unrealistic homogeneity assumption and boundary-line discrepancies associated with direct superimposition of two incomparable geographic units. Surface smoothing gives reliable estimates when smoothness is the real property of the density. However, the occurrence of cancer rarely follows a smooth distance-decay surface because major risk factors that affect cancer occurrence do not have smooth paths from the centroid to its adjacent neighboring centroids.
Table 3. The hierarchical spatial relationships between blocks and counties and between blocks and congressional districts
To make the calculations, the following steps were taken:
1. The number of people living within each census block by sex and race was determined from the 2000 U.S. census (covering 42 states, 426 congressional districts). Therefore, block population is sex- and race- specific.
2. Block population was spatially assigned to congressional districts by block centroids.
3. The age-adjusted cancer death rates for counties by sex and race were assigned to block by county FIPS (Federal Information Processing Standards) codes; FIPS codes are a standardized set of numeric or alphabetic codes issued by the National Institute of Standards and Technology (NIST) to ensure uniform identification of geographic entities through all federal government agencies .
4. Cancer death rate for each congressional district by sex and race was calculated by aggregating sex- and race- specific cancer death rates over blocks. Taking non-Hispanic white men as an example, suppose that ri was the age-adjusted cancer death rate for block i (obtained from the corresponding county rate calculated from SEER*Stat). Suppose that aij was the population of block i within district j, and that the population for district j, , were known. Then the aggregated cancer death rate for district j, pj, was the summation of ri, weighted by the proportion of block population within the district,. Other sex- and race-specific cancer death rates were calculated similarly.
5. The number of cancer deaths for each congressional district by sex and race was calculated by aggregating the sex- and race- specific number of cancer deaths over blocks. The number of cancer deaths for a block was the product of crude death rate for the block (inherited from the corresponding county, which is the number of deaths for the county divided by the county population) and the block population. Again, taking non-Hispanic white men as an example, suppose that ni and ci were the number of deaths and the population for the county to which block i belongs, the crude death rate for block i was . Given aij was the population of block i within district j, then the number of deaths for block i within district j was aij, and the aggregated number of deaths for district j was . Other sex- and race- specific number of cancer deaths were calculated in a similar way.
6. The aggregated cancer death rates and the number of cancer deaths for the congressional districts (n = 426) from step 4 & 5 were exported back to GIS and linked with the other ten congressional districts (Alaska, District of Columbia, Delaware, Montana, North Dakota, South Dakota, Vermont, Wyoming, and two Hawaii districts) for producing maps. The estimates of the number of deaths were not presented separately. Instead, they were used as the criteria when mapping death rates across congressional districts. Death rates based on the small number of deaths (< 20) for the study time period were considered not reliable and thus excluded.
7. Maps were generated using ArcGIS . For all cancer sites combined and for each cancer site, the maps for all races combined were created by categorizing the rates into five groups. Cut points for the lowest and highest groups are approximately the 10th and 90th percentiles, except for cervical cancer which are 20th and 80th percentiles. Intervening groups are set at equal length between the lower bound cut point of 90th or 80th and the upper bound of 10th or 20th. Thus each interval represents the same absolute change over the middle range of rates, while the most extreme rates fall into the first and fifth categories. For each cancer site, to allow comparison among ethnic subgroups, the cut points for all races combined are used for race specific maps if rates are in the same range as those for all races combined. When the race specific rates fall out of the range of rates for all races combined, cut points for the exceeded portion are equally set at the length of rates in the highest category for all races combined. Cancer death rates based on the small number of deaths (< 20) are considered unstable and congressional districts with such rates are marked with hatches.
In describing the cancer burden by congressional district, we used direct age adjustment instead of indirect age adjustment because direct method is more statistically correct when the rates are being compared . Direct age-adjusted death rates describe the cancer death rate each congressional district would have if it had the age-sex-race distribution of the U.S. in the year 2000. In so far as congressional districts have age-sex-race compositions different from the U.S. in 2000, the need for resources to eliminate disparities between districts might be more or less than that suggested by the results described in this paper.
The author(s) declare that they have no competing interests.
YH, EMW, and AJ conceived the analysis and wrote the final version of the manuscript. LWP provided technical support on the method and critically revised the manuscript. MJT conceptualized and critically revised the manuscript.
The views and opinions expressed in this article do not necessarily reflect those of the National Cancer Institute.
We gratefully acknowledge Dr. Lance A Waller from Rollins School of Public Health at Emory University for his comments and suggestions on the early version of the manuscript.
Devesa SS, Grauman DJ, Bolt WJ, Pennello GA, Hoover RN, Fraumeni JFJ: Atlas of cancer mortality in the United States, 1950-94. Bethesda, MD: National Institutes of Health, National Cancer Institute , NIH Publication No. 99-4564; 1999.
Freeman HP, Wingrove BK: Excess Cervical Cancer Mortality: A Marker for Low Access to Health Care in Poor Communities. Rockville, MD: National Cancer Institute, Center to Reduce Cancer Health Disparities , NIH Pub No. 05-5282; 2005.
J Public Health Policy 1994, 15(1):26-36. PubMed Abstract
Am J Public Health 1997, 87(7):1126-1130. PubMed Abstract
Am J Public Health 1992, 82(1):94-96. PubMed Abstract
National Cancer Institute Cancer Statistics Branch DCCPS Surveillance Research Program: Surveillance, Epidemiology, and End Results (SEER) Program (www.seer.cancer.gov) SEER*Stat Database: Incidence - SEER 9 Regs Public-Use, Nov 2003 Sub (1973-2001). Bethesda, MD; 2004.
US Census Bureau: 108th Congressional District Summary Files, United States Census 2000. [http://www.census.gov/Press-Release/www/2003/108th.html] webcite
Annals of Regional Science 1992, 26:67-78. Publisher Full Text
Environment and Planning A 2005, 37(1):127-139. Publisher Full Text
International Journal of Geographical Information Systems 1996, 10(8):973-989. Publisher Full Text
Computers Environment and Urban Systems 1995, 19(4):287-306. Publisher Full Text
US Census Bureau: TIGER®, TIGER/Line® and TIGER-Related Products. [http://www.census.gov/geo/www/tiger/tgrcd108/spblk108.txt] webcite
Stat Med 1995, 14(5-7):615-627. PubMed Abstract