B. Kochtubajda, C. Humphrey and M. Johnson 1
University of Alberta and Alberta Research Council 1
Edmonton, Alberta, Canada
paper presented at the 21st Annual Conference of the International Association for Social Science Information Service and Technology IASSIST 95 May 9-12, 1995, Quebec City.
Abstract 1. Introduction 2. Objectives 3. Approach and Work Plan 3.1 The Data Archive 3.2 Data retrieval assumptions 3.2.1 Aircraft data 3.2.2 Polarization radar data 3.2.3 Surface hailfall and rainfall data 3.2.4 Upper air data (LIMEX) 3.2.5 Supporting data sources 3.3 Transfer of digital data to compact disk 3.4 Documentation describing the data 4. Conclusions Acknowledgements References
A valuable meteorological data archive collected by the Alberta Research Council over the course of the Hail Studies Project in central Alberta was in jeopardy of becoming unusable as the digital data stored on magnetic tape degrade over time, and expertise in the data collection, calibration, and interpretation becomes scarce. An archiving project has been put in place to rescue it. The overall goal of this project is to preserve the digital radar, aircraft, upper air and surface precipitation data along with supporting calibrations and documentation; to transfer this archive to the University of Alberta; and to make the archive available to the scientific community.
There were three distinct operations carried out to ensure the long-term preservation of the archive; retrieval of the digital data and all supporting (secondary) data sources; transfer of digital data from magnetic tape to compact disk; and the collection and preparation of relevant documentation describing the data. The archive will provide researchers with a documented dataset to support further research in radar meteorology, climate change, hydrology, cloud physics, mesoscale meteorology and severe weather phenomena.
1. Introduction
The acquisition of atmospheric data is an expensive endeavour, and the data are
usually irreplaceable. The subsequent research uses of good data are often
not contemplated by the original data collectors. For example, data can be
re-analyzed to test new hypotheses, or can be used for comparative analyses
with other geographic areas. Data may become unusable when supporting
documentation is lost or destroyed, or when the physical media on which these
data are stored become unreadable through degradation over time, or through the
lack of equipment capable of reading the physical medium due to its obsolete
format.
The Alberta Hail Studies Project (1956-1985) was established to study hailstorm
physics and dynamics and to design and test means for suppressing hail.
Central to these activities was the Alberta Research Council's (ARC) radar
facility located at the Red Deer Industrial Airport in central Alberta (Figure
1). A vast amount of data were collected from several platforms to conduct
research into precipitation mechanisms, severe storm development, hail
suppression, hydrology and microwave propagation.
Since the termination of the Alberta Hail Project in 1986, numerous research
projects have demonstrated the value of the Alberta dataset. During the period
1990-1994, 23 archive-based publications appeared in refereed journals and
conference proceedings and 4 scientific reports were prepared. There have also
been nine graduate theses (2 Ph.D., 7 MSc) completed at 3 universities during
this period. The areas of study have included radar meteorology, cloud physics,
hydrology/hydrometeorology, computer science, instrumentation, and synoptic,
dynamic and mesoscale meteorology. Scientific research and collaborations
based on this dataset continue to this day.
Recognizing that this valuable meteorological data archive was in jeopardy of
becoming unusable as the digital data stored on magnetic tape degraded over
time, and the expertise familiar with the data collection, calibration, and
interpretation became scarce, the University of Alberta, with the participation
of the Alberta Research Council and the Atmospheric Environment Service,
initiated an effort to save this unique dataset.
2. Objectives
The specific objectives of this project were:
1. to transfer the computer-readable radar, aircraft, upper air and surface
data from the existing short-term storage medium (magnetic tape) to an
archival medium (CD ROM).
2. to collect all available supporting (secondary) data sources and develop the
necessary documentation to describe the computer-readable data files.
3. to coordinate these efforts with the University of Alberta Data Library and
develop appropriate mechanisms to make the archive available to the scientific
community.
3. Approach and Work Plan
There were three distinct operations carried out to ensure the long-term
preservation of the archive; retrieval of the digital data and all supporting
(secondary) data sources; transfer of digital data from magnetic tape to
compact disk; and the collection and preparation of relevant documentation
describing the data.
3.1 The Data Archive
The radar facility near Red Deer consists of a unique polarization-diversity
S-band (10 cm) weather radar, a standard C-band (5 cm) weather radar, and an
X-band (3 cm) radar used to track aircraft through a transponder system. With
the addition of computer interfaces in 1974, a systematic archive of radar data
was initiated. This archive now includes close to 200 Gb of data, representing
approximately 12, 000 hours of multi-parameter radar data. In addition to the
radar archive, an extensive archive of aircraft, surface precipitation, and
upper air data has also been collected. Approximately 18 Gb of data were
recorded between 1983 and 1985 aboard an instrumented research aircraft flying
through convective storms and cumulus clouds. Also, quantitative
precipitation reports (hail and rain) were obtained from approximately 500
ground stations within the radar coverage, between 1974 and 1985 and in 1989.
These data were stored on several media, including 800, 1600, 6250 bpi magnetic
tape and 8 mm cassettes. Comprehensive radar, aircraft and upper-air software
packages are also available for data analysis and display.
3.2 Retrieval of the digital data and all supporting data sources
A number of assumptions were made before the digital data were
recovered. To allow for the broadest range of research with the data,
unprocessed aircraft and radar data would be provided to the user. This would
facilitate a quicker retrieval of the data and (given the time and budgetary
constraints of the project) would result in more of the data being recovered.
To maximize the research potential, we decided to archive all data types and
work backward by year, from 1985 towards 1974, thus ensuring a complete
multi-platform data set.
3.2.1 Aircraft data
Physical experiments, designed to explore the potential of hail suppression,
and rain augmentation through airborne glaciogenic seeding in convective cells,
were conducted in central Alberta between 1983-1985, as described by Humphries
et al. (1986). These studies emphasized in-situ aircraft measurements to
investigate natural and artificially modified precipitation processes. The
primary observational platform used in these studies was the Intera/Alberta
Research Council cloud physics instrumented research aircraft, a Cessna 441
Conquest, pressurized twin-engine turboprop. Data from the instruments were
managed by a computer based data system which provided data acquisition,
recording, and real-time calculations and display (Johnson et al., 1987).
Approximately 300 magnetic tapes (copied to 1600bpi) were processed for all the
research flights conducted from 1983 to 1985. Data were segmented into unique
files for each hour of the day. The file names follow the ISO 9660 level 1
standard, and are of the form YYMMDDHH.ADB where YY - year; MM - month (numeric
to help in sorting); DD - day; HH - hour of the day; and ADB represents the
file extension identifier for Aircraft Data Block.
Aircraft data were recorded with Coordinated Universal Time. Yearly index files
summarizing the amount of information collected for each hour of the research
flight were prepared. The first line provides a brief description of the
purpose of each research flight, including the date, start and end time of data
collection (hh:mm UTC) and the type of study carried out. The subsequent lines
describe the filename, file size (in KBytes and MBytes) as well as the number
of records collected (including the 2-D cloud particle imagery).
3.2.2 Polarization radar data
The S-band polarization-diversity radar, installed in 1967, operates at 2.88
GHz and has a parabolic reflector antenna with a 6.67 m diameter dish that
produces a 1.15° beamwidth in both azimuth and elevation. The radar
sweeps out a helical volume scan rotating at 48° s-1 (7.5 s
per revolution) and rises 1° in elevation for every 360° in
azimuth, up to a maximum elevation of 8 or 20° selected depending on
the proximity of the storms to the radar. The radar system records data with
an approximate azimuthal resolution of 1°, and 147 range gates, from 3
km to a distance of 157 km from the radar, with a range resolution of 1.05 km
per range gate (English et al, 1991).
The radar can transmit any polarization, but has usually transmitted left-hand
circular (LHC) polarization with 450 kW peak power. The receiver circuitry
digitally records four measurements from each range and azimuth bin. These are
the RHC co-polar signal power, the LHC cross-polar signal power, and the
correlation and phase difference between the LHC and RHC signals.
Approximately 240 S-band radar tapes (previously copied to 6250bpi) have been
examined for the period 1980 to 1985. Radar data were extracted into files
containing one complete 3D volume scan, representing either 1.5 minutes for the
8° scan, or 3 minutes for the 20° scan. A quality control report
was produced with each data file containing the azimuth and elevation time
histories of the volume scan. The file naming convention adopted for the radar
data uses all 11 characters (YYMMDDHH.MMR) where YY - year ; MM - month
(numeric to help in sorting); DD - day; HH - hour of day (24 hour clock); MM -
minute of first data in file; and R - represents the data type (S for S-band,
C for C-band, Q for quality control reports). Radar data were recorded using
Mountain Daylight time.
3.2.3 Surface hailfall and rainfall data
Volunteer observations of precipitation events were used to supplement
quantitative surface measurements obtained by specially equipped vehicles,
which were directed beneath thunderstorms. After each storm, telephone surveys
were conducted to collect hail and rain reports. Report cards were also
received by mail from volunteer farmers. This information was useful to develop
hail climatologies and correlate hailstone characteristics with crop damage.
The surface data collection includes the digital hail and rain report files
(YYHAIL.DAT, YYRAIN.DAT), from the telephone surveys for the period 1957 to
1985 (except those files missing from 70-73); selected time-resolved hail and
rain truck observations (YYMOBILE.DAT); and daily precipitation measurements
collected by approximately 500 volunteer farmers during the months of June,
July and August for the period 1975 to 1983 (YYPRECIP.DAT).
3.2.4 Upper air data (LIMEX)
A mesoscale upper-air study, the Limestone
Mountain Experiment (LIMEX-85) was carried out over
the foothills and mountains of southwestern Alberta during July, 1985 (Strong,
1989). The objectives of the field experiment were focused on understanding
mesoscale convective processes, orographic effects, and interactions with
synoptic processes, with particular emphasis on severe storm forecasting
applications.
The archive data includes soundings every two hours from nine upper-air sites
with an average spacing of 50 km, continuous SODAR profiles, research aircraft
soundings at 20-km intervals, surface data from eight automated systems, and an
extensive cloud photo set. The compressed upper air data and accompanying
analysis software are currently archived on 2 high density 1.44 MB diskettes.
3.2.5 Supporting data sources
An equally important component of the retrieval process was the collection of
the secondary (supporting) data sources including aircraft mission scientist
notes, radar plot summaries, various operational log books, checklists,
calibration notes, cloud photographs and video tapes. The data were boxed and
transferred from the Alberta Research Council to the Meteorology Division at
the University of Alberta. Subsequently, the boxes were itemized and given a
location identifier. The contents of each box were stratified into one of five
categories (photos and maps; data summaries and timelines; operational logs,
checklists; and calibrations and data. A listing has been prepared which
stratifies the data according to the type of information and the year of
collection. A subset of the listing (for 1983-1984), reproduced in Table 1,
illustrates the variety and richness of the materials available.
3.3 Transfer of digital data from magnetic tape to compact disk
The procedures used to produce the aircraft and radar data compact disks are
depicted in Figures 2a and 2b. A series of programs was used to copy
unprocessed aircraft data from tape to file (COPYADB), and to generate hourly
flight files (TIMESEL). These hourly files were backed up on a series of 8mm
EXABYTE data tapes and high capacity SONY compact tapes. Complete 3D volume scan data files and associated quality control reports were generated
from the radar tapes (TAPEDD) and these were also backed up on EXABYTE and SONY
tapes.
Compact disks were produced using a Pinnacle Micro RCD-1000 recordable compact
disk recorder with Macintosh authoring software and a Quantum 1 GB Fast Scuzzi
3 hard drive for preparing a CD image. The requirements on the hard drive
included an average seek time of 12 milliseconds or faster, a transfer rate of
1.2 MB per second or better, and an intelligent calibration feature (ie:
thermal recalibration not performed during a continuous read to avoid a write
interruption which would render the CD invalid). The ISO 9660 level 1
standard was selected as the format for the CD-ROMs. This standard allows
the same CD to be read and interpreted on Mac, MS-DOS, UNIX, VAX/VMS, and other
computer platforms. This includes the restriction of file names to 8
characters, with a 3 character "extension".
An inventory of the compact disks produced is summarized in
Table 2. There are
62 CDs in total, including 10 research aircraft data CDs ; 47 CDs containing
the S-band polarization data from 287 days between 1979-85, 1989 and 1991;
4 CDs containing the co-incident C-band radar data from 44 days during the
period 1979-85, 1989 and 1991;and 1 CD
containing the surface precipitation files, dataset documentations, and the
radar and aircraft software source codes. Each aircraft CD contains a series of
sequential hourly data files and 3 text folders (MAC, DOS, and UNIX) containing
the file summaries and a disclaimer.
The directory structure of a radar CD is
described in Figure 3.
A radar CD contains a series of daily files (YYMMDD).
Each file contains 4 sub-directories (DATA, QUALITY, CALIB, LOGS). The DATA
sub-directory contains the sequence of radar volume scans. The quality control
reports for each radar file are located in the QUALITY sub-directory. The
calibration text files and the daily radar and transmitter log files are found
in the CALIB and LOGS sub-directories, respectively.
3.4 Documentation describing the data
A series of documents have been gathered and/or prepared to describe the
various datasets. Aircraft experiment descriptions including study objectives;
flight procedures; aircraft description and instrumentation list; 2D image
processing; aircraft tag descriptions; daily flight assessments (instrument
evaluations); and sensor calibration files accompany the digital aircraft
files.
Digital radar and transmitter logs for the period 1977-1985; as well as
descriptions of the radar characteristics and scan protocols; data structures;
and calibration files accompany the digital radar files. The primary
documentation for the surface hail and rain dataset is a coding sheet
describing the file format. The daily farmer precipitation files from 1975-1983
have an accompanying text file.
An extensive bibliography of Hail Project related papers has been compiled.
The original hard copies are currently stored in the Meteorology Division at
the University of Alberta. Software packages to analyze and display radar, and
upper air data, developed at Uof Essex and at AES in Saskatoon, respectively,
have been obtained and can be shared by users. A summary of the archive as it
is currently configured is presented in Appendix 1.
4. Conclusions
Potential users of the archive have indicated that the radar and ground
measurements dataset will be used to continue severe hailstorm and rainstorm
studies; to provide input to distributed hydrologic models; to carry out
radar-based precipitation climatology studies; and to validate numerical models
being developed during MAGS, BOREAS, GCIP, or BASE. The aircraft archive will
be used to improve our understanding of the chemical composition of cloud water
and the processes which affect it, and for icing research. The LIMEX upper air
dataset would be used for the atmospheric correction of NOAA-AVHRR data in
estimating regional evaporation, as well as in moisture budget estimates and
the evaluation of evapotranspiration studies.
A documented archive of radar, aircraft, surface and upper air data has been
provided from which further research in these areas can be carried out. The
retention and preservation of this archive at the University of Alberta Data
Library will ensure the continued accessibility and long-term survivability of
these datasets.
Acknowledgements
This project was financially sponsored by the University of Alberta, Alberta
Research Council, and the Atmospheric Environment Service. Mr. S. Kozak and Mr.
F. Bergwall assisted in the data retrieval.
The authors also acknowledge the contributions and support of Drs. EP
Lozowski, GW Reuter, and TY Gan (UoA), Dr. AR Holt (UoEssex), Dr. E. Torlaschi
(UQAM), Dr. DC Rogers (Colorado State Univ), Drs. PE Merilees, GA Isaac, P.
Joe, TW Krauss, GS Strong, Mr. J. Eley and R. Lawford (AES), Dr. BL Barge,
and Mr. CF Richmond (ARC), Dr. RG Humphries (Boreas Consulting), and Mr. D.
Andres (Trillium Engineering and Hydrographics Inc).
References
English, M., B. Kochtubajda, F.D. Barlow, A.R. Holt, and R. McGuinness,
1991: Radar measurement of rainfall by differential propagation phase.
Atmosphere-Ocean, 29, 357-380.
Humphries, R.G., M. English, and J.H. Renick, 1986: Weather modification
research in Alberta Canada. 10th Conf. Planned and inadvertent weather
modification, Arlington, AMS, 357-361.
Johnson, M.R., L.E. Lilie, and B. Kochtubajda, 1987: A data structure
for acquisition, analysis, and display of meteorological data. 6th AMS Symp.
on Met. Obs. and Instr., New Orleans, AMS, 397-400.
Strong, G.S., 1989: LIMEX-85: 1. Processing of data sets from an Alberta
Mesoscale Upper-air Experiment. Climatological Bulletin, 23, 98-118.
Table 2: Summary of Data Compact Disks (revised 95 Aug 04)
Data Type Year No. Files No. CDs Total MB
Aircraft
1983 349 5 2947.5
1984 226 3 1706.8
1985 166 2 1284.8
Radar (S-band)
1979 67 8 2922.3
1980 13 3 1433.1
1981 9 1 467.9
1982 20 3 1657.8
1983 48 8 3681.5
1984 56 9 4775.2
1985 51 11 5093.3
1989 10 2 1226.1
1991 13 2 673.5
Radar (coincident C-band)
1979 17 2 338.8
1980-85 15 1 166.5
1989+91 12 1 401.9
Back to the Home Page