Readme file
SERIES
A
Statistics
in Society
Effects of neighbourhood demographic shifts on findings of environmental
injustice: a New York City case-study by M. Talih and R. D. Fricker,
Jr, Journal of the Royal Statistics Society, Series A,
Statistics in Society, Volume 165 (2002), Part 2, pages 375 - 397.
Included are 2 data tab-delimited data files:
- Dataset1.tab (446.7 KB) : Tract-level Census data, manufacturing zones
& geographic info.
- Dataset2.tab (5.6 KB) : Toxics Release Inventory (TRI) site location
info.
Questions about the datasets should be addressed to makram.talih@yale.edu
--
Dataset1.tab
- Tab delimited text file with select variables from the 1970, 1980,
and 1990 Census.
- Also included are geographic coordinates and manufacturing information
at the tract level.
- Coordinates are expressed in kilometres from a fixed origin for the
North American Datum (NAD) 83, New York State/Long Island plane.
- Manufacturing information is based on zoning maps provided by the
NYC Department of City Planning.
First line of the file contains the variable (column) names.
First column contains the identifier for each row. This is just the
tract unique identifier.
Number of columns (excluding row names): 31
Number of rows (excluding header row): 1462
Variable definitions:
Tract Census 1990 tract number.
First 5 digits correspond to county FIPS code.
If the sequence of remaining digits is of length 6, then a "decimal"
is implied:
core tract number only has a maximum of 4 digits.
In Talih & Fricker (2002), we used the 1990 tract geography in order
to obtain the maps of the clusters. Hence, using the tract comparability
files, we disaggregate the data from the combined 70-80-90 tract to
the component 1990 tracts.
Adj.Area Area (in the 1990 Census) of tract, with parks and cemeteries
excluded. Tracts with zero adjusted area consist only of parks and/or
cemeteries.
1436 tracts have non-zero adjusted area.
Area is expressed in square kilometres.
Easting Kilometres East from a fixed origin for the New York State/Long
Island plane.
Northing Kilometres North from a fixed origin for the New York State/Long
Island plane.
Tract centroid coordinates are the coordinates of that block centroid
within the tract minimizing the maximum distance to every other block
centroid within the tract.
Tracts with zero adjusted area have not been given coordinate values.
M1/M2/M3 Type of Manufacturing within the tract, as determined from
the NYC-DCP zoning maps.
M1=light, M2=medium, M3=heavy. Some tracts have a mixture of manufacturing
activities.
A zero entry in all three columns indicates a non-manufacturing tract.
md.rnt70 1970: Median Rent
md.rnt80 1980: Median Rent (in 1970 Dollars)
md.rnt90 1990: Median Rent (in 1970 Dollars)
md.val70 1970: Median Value of Owner-Occupied Units
mn.val80 1980: Mean Housing Unit Value (in 1970 Dollars)
md.val90 1990: Median Housing Unit Value (in 1970 Dollars)
med.hi69 1969: Median Household Income
med.hi79 1979: Median Household Income (1970 $$)
med.hi89 1989: Median Household Income (1970 $$)
hisp.pr70 1970: % Hispanic
wh.pr70 1970: % Not Hispanic: White
bk.pr70 1970: % Not Hispanic: Black
HISP.PR80 1980: % Hispanic
WH.PR80 1980: % Not Hispanic: White
BK.PR80 1980: % Not Hispanic: Black
HISP.PR90 1990: % Hispanic
WH.PR90 1990: % Not Hispanic: White
BK.PR90 1990: % Not Hispanic: Black
total70 1970: Total Population
TOTAL80 1980: Total Population
TOTAL90 1990: Total Population
density70 1970: Population Density per sq. km.
DENSITY80 1980: Population Density per sq. km.
DENSITY90 1990: Population Density per sq. km.
--
Note on conversions to a fixed (1990 Census) tract geography:
Essentially, the data for the combined 70-80-90 tract is split into
the component 1990 tracts. This would not affect income measure, nor
does it affect percentages, as the demographics over the combined tract
are assumed "uniform", so that the demographics over the component
tracts would be proportional, with proportionality constant equal to
the ratio of the area of the component tract to the area of the combined
tract. Thus total population counts are non-integer numbers on occasions.
--
Notes on Combined 70-80-90 Census Files [07/07/2000]
Prepared by Jennifer Pace -- RAND
RACE & INCOME VARIABLES
1990 CENSUS
Downloaded from www.census.gov
Census of Population and Housing 1990, STF 3A
Racial Splits that were already given:
Non-Hispanic: White, Black, Asian, American Indian, Other Race Hispanic:
White, Black, Asian, American Indian, Other Race Percentages were calculated
as the raw number divided by the sum of all categories.
Median Household Income, Median Home Value, and Median Rent were also
given. However, these numbers were not used in the combined file (1970-1990),
because they changed when tracts were combined.
See Section on COMPUTING MEDIANS.
1980 CENSUS
RAND data facility CF-273 Census of Population and Housing 1980, STF
3A
Breakdowns Given ->
Total: White
Black
American Indian -------|
Eskimo |
Aleut |
Japanese |
Chinese |
Filipino | Combined to get
Korean | Total Asian/American Indian
Asian Indian |
Vietnamese |
Hawaiian |
Guamanian |
Samoan |
Other Asian -------|
Other Race, Hispanic
Other Race, Not Hispanic
Hispanic White, Hispanic Black, Hispanic Asian/American Indian, Hispanic
Other Race
Subtracted these Hispanic numbers from the Total numbers to get Non-Hispanic
numbers
Percentages calculated as the raw numbers divided by the sum of all
categories.
Median Household Income and Median Rent were also given.
However, these numbers were not used in the combined file (1970-1990),
because they changed when tracts were combined. See Section on COMPUTING
MEDIANS.
1970 CENSUS
RAND data facility CF-036 1970 Census of Population and Housing, 4th
Count A 1970 Census of Population and Housing, 5th Count
From 5th Count:
Breakdowns Given ->
Total: White
Black
Indian ----|
Japanese |
Chinese | Combined to get Other Race
Filipino |
Other Race ----|
Income given in two groups: Family Income and Unrelated Individuals
Income. I combined these two categories to get Household Income. Rent
and Value categories also given See COMPUTING MEDIANS.
From 4th Count A:
Up to three rows per tract. 1= Total, 2= White, 3= Black.
Summed over the age categories to get population for each row.
Then looked at the spanish indicator* to substract out the
Hispanic population in each row. This gave me Hispanic White,
Hispanic Black, and Total Hispanic.
Hispanic Other Race = Hisp Tot - (Hisp Wh + Hisp Black)
These Hispanic Numbers were then merged onto the 5th Count data. Non-Hispanic
Numbers were computed from the 5th Count race variables, subtracting
out the Non-Hispanic numbers.
* spanish indicator refers to people of Puerto Rican birth or parentage
or spanish speaking: took the maximum of these two categories.
ADJUSTING FOR INFLATION:
All Dollar Values have been converted to 1970 Dollars.
- 1990 values were divided by 3.366
- 1980 values were divided by 1.993
COMPUTING MEDIANS
Because aggregate values were not given, means could not be computed
in most cases. Medians were calculated in the following way:
After tracts were combined -> look at the count of people in each
category. For example, in 1970 there are 9 rent categories:
Less than $40
$ 40 - $ 59
$ 60 - $ 79
$ 80 - $ 99
$100 - $149
$150 - $199
$200 - $249
$250 - $299
$300 or more
If the middle person is in category 6 ($150 - $199) the median rent
is recorded as $175. For the "Less than" categories, the value
is half of the upper bound ($20 in this case). For the "More than"
categories, the value is the lower bound ($300 in this case). Otherwise
the value is halfway between the lower and upper bounds of the category.
This process of calculating medians may be altered later.
Note: For Value of Housing Unit the median is given for 1970 and 1990,
but the MEAN is given for 1980 (median could not be calculated)
TRACT COMPARABILITY:
Note: Deleted all tracts with the suffix "99" - these are
boat-houses.
Linking several years of data and tract changes...
Source- 1) Census of Population and Housing, 1980: 1980-1970 Tract
Comparability File (for 1970 -> 1980 changes)
2) 1980 -> 1990 All non-matching tracts appear to be splits.
These are obvious changes.
County 70-80-90 # 1970 Tract 1980 Tract 1990 Tract
-----------------------------------------------------------------------------------
047 3 3.01 3.01, 3.02
047 455 455 455.97, 455.98
047 491 491, 493 491, 493 491, 493
047 546 546 546.98
047 579 579, 589 579, 589 579, 589
047 598 598, 626 598, 626 598, 626
047 600 600 600.97, 600.98
047 606 606 606.97
047 610 610.01, 610.02 610.01, 610.02 610.01, 610.97
047 616 616 616.97, 616.98
047 628 628 628.98
047 666 666 666.98
047 758 758 758.98
047 882 882, 884 882, 884 882, 884
047 910 910, 1132 910, 1132 910, 1132
047 916 916, 918 916, 918 916, 918
047 1040 1040, 1070 1070 1070
047 1190 1190 1190.97
047 1202 1202 1202.97, 1202.98
081 248 248, 250 248, 250 248, 250
081 456 456 456.98
081 641 641.01
081 664 664 664.98
081 716 716 716.98
081 769 769.01, 769.02 769.01, 769.02 769.02, 769.97, 769.98
081 773 773 773.97, 773.98
081 803.01 803.01, 837 803.01, 837 803.01, 837
081 964 964, 972, 992 964, 972, 992 964, 972, 992
081 1072 1072.01 1072.01, 1072.02 1072.01, 1072.02
081 1113 1113, 1123 1113, 1123 1113, 1123
081 1267 1267 1267.98
--
Dataset2.tab
- Tab delimited text file with coordinates of TRI site locations,
geocoded from most recent street address and TIGER/Line files (Topologically
Integrated Geographic Encoding and Referencing), which are provided
by the US Census Bureau.
- Coordinates are expressed in kilometres from a fixed origin for the
North American Datum (NAD) 83, New York State/Long Island plane.
- The TRI database is available through the US Environmental Protection
Agency data repository.
First line of the file contains the variable (column) names.
First column contains the identifier for each row. This is just the
facility's unique identifier.
Number of columns (excluding row names): 2
Number of rows (excluding header row): 150
Variable definitions:
Easting Kilometres East from a fixed origin for the New York State/Long
Island plane.
Northing Kilometres North from a fixed origin for the New York State/Long
Island plane.
Dataset
(Fricker.zip 194kb)
|