BioSPP: Statistical Methods

Point Pattern Data

The data that are typically available are the spatial locations of the recorded objects, and a label denoting object type. The location data are recorded as triples (x,y,z), that is, coordinates in 3d space. The labels, l, take integer values {1,2,3,...}, where, say 1="PML", 2="RNA Polymerase II", 3-"Acetylated Histone" etc, so we might have a data file formatted as follows:

x₁,y₁,z₁,l₁

x₂,y₂,z₂,l₂

x₃,y₃,z₃,l₃

and so on. Other information such as the volume of object observed at each location may also be available, but we ignore such information here.

The extracted data can be displayed graphically as here or here.

Here is a typical data file. The x,y,z coordinates are in columns 6,7 and 8.

REMARK     1.inp
REMARK
ATOM       1 XO   RED     1        14.541 13.893   2.392 1.00 5.88
ATOM       2 XN   BLU      2        10.169 15.487   2.013 1.00 0.85
ATOM       3 XN   BLU      3        15.689 18.437   2.266 1.00 1.03
ATOM       4 XN   BLU      4          9.835    8.068   2.423 1.00 0.88
ATOM       5 XN   BLU      5        13.642    8.988   1.775 1.00 0.49
ATOM       6 XN   BLU      6        19.754 16.116   1.769 1.00 0.39
ATOM       7 XN   BLU      7        14.886 10.409   2.551 1.00 0.67
ATOM       8 XN   BLU      8        15.179 12.477   2.653 1.00 0.74
ATOM       9 XN   BLU      9        19.597 13.290   2.651 1.00 0.58
ATOM     10 XN   BLU    10        18.648 17.766   2.728 1.00 0.78
ATOM     11 XN   BLU    11        13.198    8.388   3.096 1.00 0.34
ATOM     12 XN   BLU    12        11.366 13.011   3.176 1.00 0.63
ATOM     13 XN   BLU    13        12.874 16.799   3.347 1.00 0.60

The coordinate system is typically arbitrary; scales may differ from image to image, and so aggregation of pattern data from different images must be carried out with great care.

In biological reality, the points lie within the cell nucleus, and thus are spatially restricted. For some parts of the statistical analysis it is important to model this aspect explicitly, so a proxy for the cell nucleus, typically an ellipsoid, is also inferred from the image data (see below).

Fitted ellipsoid model of cell nucleus: PML foci coloured green

On to page 3:

Pages 1 2 3 4 5 6 7

Back to the BioSPP Home Page