An Explanation of the Hurricane Research Division Archive of Post-Season Processed Hurricane Data Written by Ed Rahn The primary data in this archive are held in files with the extension ".APF", standing for "ASCII ProFile". Each such file is just what the name implies - an ASCII file holding data that give a profile of the storm during one aircraft pass. This is a single pass, either inward or outward, not a complete penetration. The APF file contains 299 data lines (no header) with explanatory text at the bottom of the file. Each line contains data for one radius. The variables contained, reading from left to right, are time (in seconds relative to a zerotime, defined as 0000Z on the morning of a starting day, month and year) aircraft latitude and longitude, ZP (height in meters of a standard pressure surface), radial, tangential and vertical winds (in meters per sec.), temperature and dewpoint (degrees Celcius) and liquid water content (grams per kilogram). Vertical wind and liquid water are missing from data sets taken by Air Force aircraft, since the data systems on board these aircraft are unable to measure them. All the data in the APF files are integers, with implied decimal points - the position of the decimal point for each variable is given in the text at the bottom of the file. All these data are expressed as functions of distance from the storm center. The original data are functions of time - one second data are the inputs to the analysis, except for the special case of Air Force data, which will be explained below. A track, in the form of beta splines, is fit to the storm motion, and the data are recomputed and averaged as functions of distance from the (moving) storm center. Data are lumped into 300 "bins", which are actually imaginary rings concentric to the storm center. The width of bins, and their spacing, are controlled by the quantity DR, the half-width of a bin. The bins are 1/2 DR apart, meaning that there is considerable overlap of bins, and a single one second datum will be averaged into several bins, although with variable weighting. If DR is 1.0 (kilometers) the total bin width is 2.0, the nominal bin radius (from the storm center) increases by 0.5 km from one bin to the next, the closest bin is 0.5 km from the center, and the farthest, 150 km. However, due to a miscalculation during the writing of the original software, which for consistency we were forced to propagate through later versions of the code, the outermost bin is always empty. For this reason, the last bin is not included in any .APF file; that is why the real total number of bins is the conspicuously non-round number of 299. All data are weighted as they are placed in bins; the weighting decreases linearly from 1.0 for a datum at exactly the nominal bin radius, to 0.0 at exactly plus or minus DR. After all the data for a pass has been placed in bins, the data in each bin is normalized: the sum of the data is divided by the sum of the weighting factors. The data we receive from the Air Force are ten-second averages. Interpolation is performed to convert this into one-second data, which are then input into the analysis as if they were one-second data originally. The data held in all the APF files for a single storm are also held in a single file with the extension ".RPFL" - standing for Real ProFiLe. The data is in ASCII coded real numbers with explicit decimal points, with more significant figures for most variables than the APF files. As a result, each data line is 95 characters long. The APF files are more convenient to work with, (only 80 characters per line maximim) but the RPFL files are more accurate. Each data line in an RPFL file has the same variables in the same order as an AFL file, except that the bin number is absent. The structure of the file is as follows: The first 4 lines are headers giving data for the whole storm; the storm name, starting data (the 3rd line holds the date as integer month day and year at the start of the line, which is thus readable by a simple FORTRAN list-directed READ) and value of DR. Then for each profile in the storm there are a blank line, 4 header lines giving information for that profile, and 299 data lines. For a storm with N profiles the total number of lines is 4 + (N * (1 + 4 + 299)). There are other files associated with this archive, of which the index files (with extensions of .IDX) are the most important. There is one IDX file for each storm. Each IDX file contains in the first line the number of profiles (ie., the number of .APF files), the number of flights, the value of DR for the storm (there is always only ONE value of DR for a storm - when there is not, the storm must be split into "logical" storms) and the starting day in integer month, day and year. Succeeding lines hold data on each flight: the flight ID, the global pass numbers of the first and last passes for that flight, and the standard pressure for the flight. There is a final line holding the storm name as used in this archive, which always includes the year in two digit form. For example, Hurricane Zachary of 1992 is designated ZACHARY92. Since the archive begins with 1977, we can continue with two digits to designate the year until 2077, at which time we will have to go to four digits, for example AARON2077. Stay tuned for this :-) There is also one .NDX file for each storm. It is like the .IDX file except that there is one line of data per profile, rather than one per flight. The IDX file is pretty much necessary to use the archive, and the .NDX file, while not indispensible, contains some very useful summary information. There is only one starting date for an entire storm - meaning all the times in seconds in all APF, RPFL, TRK and POS files for a storm are all relative to the same zero time. There is no hard and fast rule determining the starting date - it is simply whatever is convenient for the processing, and is not necessarily the starting date assigned by the National Hurricane Center. A flight ID contains the date of the flight; the next character is an alphabetic character designating the aircraft (H for NOAA42, I for NOAA43 and U for any Air Force aircraft) and there is a final numeral to indicate which flight during that day this flight is, if the aircraft (or any aircraft for the Air Force) flew more than once during that day. Thus, if a flight is the fourth for the Air Force on Feb. 29, 1992, the flight ID will be 920229U4. However, there is a slight complication here - if there is a change in altitude during the flight, the flight may have to be split into two (or very rarely, three) logical flights, which can add one more character at the end; thus the flight ID can be up to 9 characters long. One or more track files (files holding spline coefficients, with the extension of ".TRK" at the end) are necessary for the processing of data. The spline coefficients describe the motion of the storm center - this is necessary information to place data in bins, since the bins move with the storm. The spline coefficients are necessary if users of this archive wish to translate the winds (which are storm-relative) into Earth-relative winds. The simplest case is that in which there is one global track file which was used to process all flights in a storm - i.e., one track file that covers the entire period during which aircraft were flying the storm. For the convenience of users of this archive, there are also files with the extension ".POS". A POS file holds a list of positions and velocities for the storm center; each POS file is calculated using the spline coefficients from the TRK file with the corresponding name. All TRK files are calculated by fitting splines to a set of centers and as a matter of good form, the center file from which each TRK file was generated is included in the archive. In summary: In the simplest case, for each storm there will be one CTR file holding all the centers used for final processing, one TRK file generated from this file, one POS file generated using the TRK file, and one IDX file that contains information useful for interpreting the data. These files will have names identical except for the extensions. Life however, is not always simple. For some storms in which flights were at different altitudes, there is more than one track; for example, one track for processing of flights at 700mb and another for flights at 850mb. Other storms have been divided into two or more logical flights, with separate CTR, TRK, POS and IDX files for each. In several cases, especially those of the earlier storms, due to misunderstandings, disk crashes, or whatever, some files have not been preserved. Some TRK files, and a few CTR files, are re-creations of the originals, which have been lost. In such cases, there is explanatory text at the end of the files. For a storm in which the simplest case does not apply, there is a README.ASCII file in the individual storm subdirectory, explaining the circumstances. By the way, since there is explanatory text at the end of APF, TRK and CTR files, a program reading one of these cannot simply read until the end-of-file is detected, it must know where the end of the data is. An APF file has 299 data lines; the structure of the following text is fixed - consult any APF file if you need to know what it is. A CTR file has data lines until -9999.90 appears as an end-of-data marker. The zerotime appears in the fourth line after this, in integer month, day and year. The structure of the TRK file is variable. There is one line at the beginning holding the number of splines, the start and end times (seconds relative to the zerotime) and the time interval per spline. There are at least two lines for the latitude coefficients, and two lines for the longitude coefficients; if the spline number is 11 to 15, there are three lines each, if the spline number is 16 to 20 (20 is the maximum allowed) there are four lines each. The explanatory text has the same format as for CTR files; the zerotime is held in the fourth line after the end of the data. Some of the storm subdirectories have further subdirectories called .MISC. These hold data used during processing, but which are of absolutely no use to you. Ignore them. There is a subdirectory (directly below the one you are now in) called .CODE which has FORTRAN code that may be of use to you in working with the data in this archive. The file reading subroutines at least may be of interest, and we will have graphical routines to display the data available in the not too distant future. If you have questions about the code or the data in the archive, send them to Ed Rahn (rahn@aoml.noaa.gov) We have tried to check this archive for errors as thoroughly as possible, but if you notice any that have slipped by us, we would appreciate hearing from you. There are also graphical radar composites in existence for many of the storms in this archive, but they are not included in it as of this time. A great deal of work needs to be done to translate these into a standard graphical format. We hope to have them available here in the near future. At the moment, all the files in the entire archive are ASCII. Acknowledgements The project of post-season analysis of storms is the brainchild of Dr. Hugh E. Willoughby, who has supervised the project during its entire lifetime. The early software development was done by Hugh Willoughby, Marcy Chelmow and Jeanne Clos. William Patrick Barry made later additions to the software, and Ed Rahn has done most of the software development since the late Eighties. The track fitting is done using beta splines developed by Dr. Katsuyuki V. Ooyama. For the first few years Hugh Willoughby did the actual data processing. Ed Rahn took over this in early 1987, and has processed several of the storms dating from before that time.