Hi, yes, I’m interested. updates would be good.
Thanks for making this available, I’ve been looking through it and there it would seem there are quite a few errors, the extra-fast and extra-slow times are easy enough to spot but mistakes within normal parameters aren’t. If possible (!) it might be a good idea to scrape RP data and compare the two.
The date format is a little awkward, if it were along the lines of 2010-12-10 (for example) it would make re-ordering easier and enable a unique numbering of each entry (per spreadsheet) – useful when correcting mistakes.
Probably the most serious omission from the data is the class of race, I’m not sure how you can make accurate standard times without this data. If you take an average from all races you are not accounting for the different average class peculiar to each course. It would also be useful (at least from a research perspective) to have age, weight, OR and winning distance data (if possible!).
Whatever, it’s an admirable and inspirational project, I wish you well.