Readme file

SERIES C 
Applied Statistics

Unsupervised classification of chemical compounds, by P.Guttierrez Toscano and F.H.C.Marriott,
Journal of the Royal Statistical Society, Series C, Applied Statistics, Volume 48, (1999)

Address for correspondence: F.H.C.Marriott, Department of Statistics, University of Oxford, 1 South Parks Road, Oxford, OX1 3TG, UK.
E-mail: marriott@stats.ox.ac.uk 

The data set described in the paper, kindly supplied by Glaxo- Wellcome, consists of 960 `fingerprints', each of 1024 bits. The file contains these data, preceded by 17 lines of text. Note that the fingerprints, in the absence of a key, contain no information about the structure of the chemicals they represent.

If you are using Windows 95 then open a command prompt window. From this window it is possible to use DOS commands such as PKUNZIP. If PKUNZIP is not in your path then use the CD command to change to the appropriate directory. If you are unsure of the location of PKUNZIP use Find from the Windows 95 Start menu. Earlier versions of PKUNZIP may be unable to handle the file.

To unzip the file Toscano.ZIP enter the command PKUNZIP A:DATA.ZIP A file called S960_102.db2 is created in the current directory.

Journals

SERIES A
Statistics in Society

SERIES B
Statistical Methodology

SERIES C
Applied Statistics

SERIES D
The Statistician