| Contact Us | Search | Join | Site Map | Home
About TS | News & Info | Meetings | Membership | Publications | Resources | Student & Postdoc Corner | Career Center

Letter about the HCD

August 24, 2000

Hew,

At the Teratology Society meeting I promised to send you my thoughts on the MARTA/MTA Historical Control Data project. Here they are.

First of all, let me say that I have changed my views since we last spoke. After playing some more with the search functions of the system I realised that the database is a lot more useful than I thought. Your demonstration didn't really do it justice (because of the obvious technical difficulties in doing the presentation to such a large group by remote connection). Also, like many others present, I may have forgotten what Teratology is all about! The major function of the database is of course the compilation of fetal abnormality data, which it does better than ever, thanks to the standardised terminology.

Before getting down to specifics, we should reflect on how background control data (or "hysterical data", as I like to call them) are used in the interpretation of results from regulatory toxicology studies. In practice, the incidence data and numerical litter data are evaluated very differently. We usehistorical incidence data to help assess whether a given occurrence in a treated group is consistent with a chance event; the parameter may be group-based (eg. 3abortions in a given group) or litter-based (eg. 3 fetuses from 2 litters with spina bifida). In this case, we are trying to compensate for the inadequacy of the sample size in our concurrent control group for assessing low-frequency events. This is the strong point of the database (see below). When assessing
numerical litter data, historical values are most often used to check for abnormal responses in the concurrent control group. Results from treated groups are usually only compared with historical data when the concurrent controls are considered abnormal. Also, the confidence attributed to historical parameters depends largely on their variability in comparison with that of the concurrent control. The pertinence of historical data is greatest when faced with highly variable concurrent control data. Very heterogeneous historical data (eg. maternal body weight and food consumption) are of little interest.

All that we need from the database for the purposes of assessing the significance of low frequency events (such as malformations) is a good estimate of the background incidence in the population. The database answers this question in seconds, with the added advantage of calculations on a litter-, fetus- or group-basis. With this information, we can estimate the probability of a given scenario by assuming a Poisson distribution. The same is true for higher frequency events, for which a binomial distribution may be assumed.

In order to make use of historical data in assessing numerical litter values, estimates of both the central tendency and dispersion are needed. All that we can calculate from the database at present is the mean and range. We cannot even assess the variability of the data in order to evaluate their reliability. The median would be better as a measure of central tendency than the mean, since the underlying distribution may not normally distributed. Better still, a comparison of the mean and the median would allow some estimation of the symmetry of the distribution. For me, the range is next to useless, since by definition it is most influenced by the extreme values. In a way, we are using the most abnormal members of the population as our standard. Many statisticians would actually consider excluding these extreme values as outliers. I am not sure how this could be improved in the database without greatly complicating the data input. Interquartile or percentile ranges might be one solution. Otherwise, you could require the users to enter the SD for each parameter and then estimate the population variance from that (there goes my argument about non-normal distributions!); but that would probably be too much effort for everyone concerned.

I agreed at the time with the comments from the floor during your demo that it is a shame not to make use of data from preliminary studies, but now I realise that the inclusion of under-sized groups would invalidate some of your parameters by disproportionately influencing the calculations (such as max % affected fetuses per group).

Therefore, overall, I think that the database is a very worthy initiative, which I certainly intend to use a reference in my work. Unfortunately, our laboratory is not in a position to enter data. First, as we have discussed, our management is paranoid about data confidentiality (for no good reason). Second, almost all of our data are derived using unique strains from a French breeder (namely the OFA rat and INRA rabbit) that are practically not used elsewhere, so our data are little interest to other labs. In addition, our data acquisition system (Grosse) already compiles historical data without having to re-enter the values.

I hope that all of this is of interest to you. Please don't hesitate to contact me if you have any questions.

Paul Barrow




Rev. Saturday, June 21, 2008; at 12:21:12 AM EDT
The Society thanks our generous corporate Sponsors.
© Copyright Teratology Society 2012.
Please read our disclaimer.