
|
Readme file
SERIES C
Applied Statistics
Unsupervised classification of chemical compounds, by
P.Guttierrez Toscano and F.H.C.Marriott,
Journal of the Royal Statistical Society, Series C, Applied Statistics, Volume 48,
(1999)
Address for correspondence: F.H.C.Marriott, Department of Statistics, University of
Oxford, 1 South Parks Road, Oxford, OX1 3TG, UK.
E-mail: marriott@stats.ox.ac.uk
The data set described in the paper, kindly supplied by Glaxo- Wellcome, consists of
960 `fingerprints', each of 1024 bits. The file contains these data, preceded by 17 lines
of text. Note that the fingerprints, in the absence of a key, contain no information about
the structure of the chemicals they represent.
If you are using Windows 95 then open a command prompt window. From this window it is
possible to use DOS commands such as PKUNZIP. If PKUNZIP is not in your path then use the
CD command to change to the appropriate directory. If you are unsure of the location of
PKUNZIP use Find from the Windows 95 Start menu. Earlier versions of PKUNZIP may be unable
to handle the file.
To unzip the file Toscano.ZIP enter the command PKUNZIP A:DATA.ZIP A file called
S960_102.db2 is created in the current directory.
|
Journals SERIES
A
Statistics
in Society
SERIES B
Statistical
Methodology
SERIES C
Applied Statistics
SERIES D
The
Statistician

|