Use of data mining techniques to investigate disease risk classification as a proxy for compromised Biosecurity of cattle herds in Wales

Research output: Journal Publications and Reviews (RGC: 21, 22, 62)21_Publication in refereed journal

22 Scopus Citations
View graph of relations



Original languageEnglish
Article number24
Journal / PublicationBMC Veterinary Research
Publication statusPublished - 2008
Externally publishedYes


Background: Biosecurity is at the forefront of the fight against infectious diseases in animal populations. Few research studies have attempted to identify and quantify the effectiveness of biosecurity against disease introduction or presence in cattle farms and, when done, they have relied on the collection of on-farm data. Data on environmental, animal movement, demographic/husbandry systems and density disease determinants can be collated without requiring additional specific on-farm data collection activities, since they have already been collected for some other purposes. The aim of this study was to classify cattle herds according to their risk of disease presence as a proxy for compromised biosecurity in the cattle population of Wales in 2004 for riskbased surveillance purposes. Results: Three data mining methods have been applied: logistic regression, classification trees and factor analysis. Using the cattle holding population in Wales, a holding was considered positive if at least bovine TB or one of the ten most frequently diagnosed infectious or transmissible nonnotifiable diseases in England and Wales, according to the Veterinary Investigation Surveillance Report (VIDA) had been diagnosed in 2004. High-risk holdings can be described as open large cattle herds located in high-density cattle areas with frequent movements off to many locations within Wales. Additional risks are associated with the holding being a dairy enterprise and with a large farming area. Conclusion: This work has demonstrated the potential of mining various livestock-relevant databases to obtain generic criteria for individual cattle herd biosecurity risk classification. Despite the data and analytical constraints the described risk profiles are highly specific and present variable sensitivity depending on the model specifications. Risk profiling of farms provides a tool for designing targeted surveillance activities for endemic or emerging diseases, regardless of the prior amount of information available on biosecurity at farm level. As the delivery of practical evidencebased information and advice is one of the priorities of Defra's new Animal Health and Welfare Strategy (AHWS), data-driven models, derived from existing databases, need to be developed that can then be used to inform activities during outbreaks of endemic diseases and to help design surveillance activities. © 2008 Ortiz-Pelaez and Pfeiffer; licensee BioMed Central Ltd.