Both footrot and foot integrity scoring scales were more consistent within observers than
between. The weighted kappa values between observers for both footrot and integrity scoring scales ranged from moderate to substantial. There was disagreement between observers with both observer bias and different thresholds between score values. The between observer thresholds were different for scores 1 and 2 for footrot (using photographs and videos) and for all scores for integrity (both walls and soles). The within observer agreement was higher with weighted kappa values ranging from substantial to almost perfect. Within observer thresholds were also more consistent than between observer thresholds. Scoring using photographs was less variable than scoring using video clips or feet.
Latent class modelling is a useful method for exploring components of disagreement within and between observers and this information could be used when developing a scoring system to improve reliability.
For more of the article, please click here.
Article made possible through the contribution of Alessandro Foddai, Jasmeet Kaler, Sam Mason, Laura E Green and BioMed Central.