You can view Eastwide maps generated from these data and an example study using the data at:
FIA Units obtain from the various States information on mills to support periodic estimates of wood processing activity. The NE and SO regions and Texas provided us with information on mill type, name, and address. We used a geocoding service provided by ETAK, Inc. (now Tele Atlas North America) to estimate the latitude and longitude for each mill plus identify from this the county and Census tract and block identifier corresponding to that location. Note that many if not most addresses referred to Post Office Box numbers. If the geocoding service could not identify an actual street address, it provided a location based on the zipcode. Therefor points in the coverage can be shared by multiple mills. Put another way, the coverage frequently contains "stacks" of mills with the same location. Data from the North Central Station were obtained more recently and were handled somewhat differently as noted below in the "Lon, Lat" section.
The mill locations are provided in several formats, all unprojected ("geographic projection"): ArcInfo (uncompressed) Export and ArcView Shape files. Each supplier's data are provided as a separate coverage as well as in a single coverage consisting of mills from all four sources.
So what if I don't have ArcView or ArcInfo? Most GIS programs have the ability to import these formats, but GIS programs are expensive. The publisher of ArcView and ArcInfo offers a free downloadable program called ArcExplorer for viewing and analyzing files with these formats.
RECORDS and VARIABLES: The ArcInfo coverage (eastmill.e00) contains minimal information on each mill, represented as points. More detailed information on each mill is available in the dBase IV file "eastmill.dbf." Each record in a coverage corresponds to one mill. Record numbers created for each mill ("mill-id") uniquely identify the mill and match the mill-id variable in the dBase file. Records also contain an identifier for the source which provided it. The included location data represents information on 8,423 mills from the geographic area defined by the North Central, Northeastern, and Southern Research Stations of the U.S. Forest Service, plus all of West Texas. Data reporting quality varies by state, so state-by-state comparisons should be approached with caution. In particular, many mills in Minnesota had no address information and could not be assigned a location, although these mills are still listed in the dBase table. Errors can arise from different sources including errors in mailing addresses and in geocoding. While we've tried to minimize our own procedural errors and corrrect a few of the more obvious errors from other sources, there are doubtless errors still remaining in some of the records, over and above the imprecisions inherent in geocoding in general and on accepting Post Office Box addresses in particular.
ArcView shapefiles begin with several GIS-standard variables describing shape, perimeter, and internal record ID, but these are of limited use in this case. More generally useful are the remaining variables in the shapefiles, and the corresponding fields in the dBase table. Each mill has one row of data, and each row in the dBase file contains fifteen variables (text and numerical). The tabular data can be joined to any of the coverages using the mill-id. Indeed, the shapefiles were generated by joining the dBase file to the ArcInfo point coverage and then subsetting the resulting theme based on region_id (see below). In order, the variables in the table and shapefiles are:
Mill-id Assigned by J. Prestemon, this identifies a unique mill with a number that will never be repeated.
Lon, Lat This is our best estimate of the mill longitude and latitude, respectively. At worst, no coordinates are provided. This was the case for many mills with no address data. Nearly all of the non-addressed mills, however, were located to the county level (hence have FIPS codes--see below). The best estimates of longitude and latitude were GPS coordinates from Missouri, where state officials have reportedly provided the North Central Research Station with GPS coordinates. The next best coordinates, with one exception, were obtained by submitting mill address information to Etak, Inc. (now Tele Atlas N.A.), in January - May, 1999. The quality of the latitude and longitude data when evaluated by Etak, Inc. can be judged by MAT_TYPE, explained below. Finally, the exception to the Etak results was for the North Central Station’s mills. In most cases in that Station, their FIA Unit provided latitude and longitude coordinates to J. Prestemon and J. Pye. These coordinates were those of the town nearest the mill. In cases where Etak match codes were higher than 3, the North Central Station's estimate of the mill latitude and longitude was taken as the best estimate of the mill's location. These are the locations provided in the accompanying data set.
Region_ID Identifiers of who supplied the mill address data. Note that some mills identified as originating from one region were actually located in another region. This occurred where the physical locations of mills were different from the company headquarters of the mill. Regions and their included states were as follows:
Name Name of the forest products business.
Type_New Kind of mill, recategorizations based on identifiers provided by the four regions. Six types were identified:
Town The town corresponding to the mailing address of the physical location of the mill
State The state corresponding to the mailing address of the physical location of the mill
MAT_ZIP The zipcode corresponding to the mailing address of the physical location of the mill
MAT_TYPE Codes include:
|0||Non-match||Address could not be matched to any database.|
|1||Block Face||Street segment exact address match.|
|2||Near match||Address matched to a single street segment but the exact address number was not found.|
|3||Zip+2 Centroid||A point representing the aggregate of all geocoded ZIP+4's for the ZIP+2 of the input address.|
|4||5-Digit ZIP Centroid||A point representing the aggregate of all geocoded ZIP+4's for the 5-digit ZIP of the input address.|
|5||3-Digit ZIP (SCF) Centroid||A point representing the aggregate of all geocoded ZIP+4's for the 3-digit ZIP of the input address.|
|6||Ambiguous match||Address matched to more than one street segment, a centroid of all segments is returned.|
Note: The Zip 7, Zip 5, and Zip 3 Centroid is the geographic center of zip codes that share the first three, five, or (in the case of the four-digit zip code extension) seven digits.
Match 0 (nonmatch) do not have latitude and longitudes, except for some cases in the North Central Station states. Match 1 (exact match) is likely to contain precise coordinates, within a city block or so of distance from the exact location. Successively higher numbers indicate less and less precise geographic coordinates. In only 19 cases were matches as poor as code 5. In only 6 cases were Match 6’s encountered, and for even these Etak produced coordinates.
FIPS_Code The format of this code is SSCCC, where SS refers to the state FIPS code and CCC refers to the county FIPS code.
CEN_TRACT This is the census tract code provided by Etak, Inc., when the mill address was geocoded. The code begins with a two-character state abbreviation and is followed by:
Street The mailing street or P.O. Box or rural route of the physical location of the mill.
Precise_Type The reporting agency’s classifier of the mill. Contact the Station mill data managers or Texas Forest Service for the code definitions. But the code definitions are fairly self-explanatory and intuitive.
FIA Survey Year The year that the mill address information was recorded. It varies by state.
Entire Eastwide coverage:
Regional shapefiles with attributes, in Zip format:
- ArcInfo point coverage zipped 151 KB or unzipped 1.3 MB - just the point locations of all mills, in ArcInfo 7 uncompressed export format
- dBase table zipped 361 KB or unzipped 2.5 MB - containing mill names and other attributes
- Excel workbook uncompressed 2,193 KB - containing mill names and other attributes
If you only need information on Texas mills, try searching the Directory of Forest Products Industries of Texas.
created by: John M. Pye