Recall: I am cancelling the link for 48 hours while someone tries to verify.
In future cases, would it be allowed under the FOI rules to include in any new request at least one of the previously released pieces of data, so that an alignment verification check could be run? Or even request all of the original data together with the new, so you would have a fully coherent superset of the original data set?I have now received a new data file.
I requested that the data be aligned with previously released data. QMUL say that they are providing me with the information as requested. I cannot verify that to be the case. I believe, however, that these data are aligned with the individual results in the data released after Alem Matthees's successful tribunal hearing.
https://www.dropbox.com/s/ur8h533uio0x9ej/PACE_July2019.xlsx?dl=0
In future cases, would it be allowed under the FOI rules to include in any new request at least one of the previously released pieces of data, so that an alignment verification check could be run? Or even request all of the original data together with the new, so you would have a fully coherent superset of the original data set?
Not only should it be allowed, it should be mandatory. But in fact it should be mandatory for the authors not to play silly buggers and pretend that they don't have, somewhere, a single file with all the data in it, which is how everyone works, because it's simple and obvious. (People who have never seen scientific data are often surprised by just how simple it looks, unless it's coming from particle detectors at CERN, and even then, the main complexity is the number of rows.)In future cases, would it be allowed under the FOI rules to include in any new request at least one of the previously released pieces of data, so that an alignment verification check could be run? Or even request all of the original data together with the new, so you would have a fully coherent superset of the original data set?
Many of us might think they should do anywayOf course, it's just about theoretically possible that there is no single-file copy of the data left, but at that point, the entire research team should hand back their PhDs and get jobs in McDonald's.
visualising the combination of uniform and customer interface skills ! lolNot only should it be allowed, it should be mandatory. But in fact it should be mandatory for the authors not to play silly buggers and pretend that they don't have, somewhere, a single file with all the data in it, which is how everyone works, because it's simple and obvious. (People who have never seen scientific data are often surprised by just how simple it looks, unless it's coming from particle detectors at CERN, and even then, the main complexity is the number of rows.)
Of course, it's just about theoretically possible that there is no single-file copy of the data left, but at that point, the entire research team should hand back their PhDs and get jobs in McDonald's.
£5M. Biggest trial of its kind. Trial meeting notes show they planned for it. No excuses.Not only should it be allowed, it should be mandatory. But in fact it should be mandatory for the authors not to play silly buggers and pretend that they don't have, somewhere, a single file with all the data in it, which is how everyone works, because it's simple and obvious. (People who have never seen scientific data are often surprised by just how simple it looks, unless it's coming from particle detectors at CERN, and even then, the main complexity is the number of rows.)
Of course, it's just about theoretically possible that there is no single-file copy of the data left, but at that point, the entire research team should hand back their PhDs and get jobs in McDonald's.
I assume missing cells are dropouts? Reasons for dropping out are still withheld?Thanks to @sTeamTraen who says released data are probably in the correct order & has merged them with #Matthees data.
New data https://www.dropbox.com/s/hqp4drejh6e3lue/PACE_July2019.xlsx?dl=0 …
Merged files https://www.dropbox.com/s/hvwl4dayvnzgqch/PACE-AugMerged.xls?dl=0 …
CSV https://www.dropbox.com/s/tzevyxbu0o6ucnj/PACE-AugMerged.csv?dl=0 …
Folder link https://www.dropbox.com/sh/f3nfolkh1hlw9kg/AACb78M_jA3Q_NoBbnbQsWvXa?dl=0
EQ_index is EuroQol-5D-3L (EQ_index in dataset).
The questionnaire asks about 5 items of health (Mobility, Pain/discomfort, Self-care, Anxiety/depression, Usual activities) scored on 3 levels (score 1, 2, 3) + a VAS of current health state (scored 0-100). EQ asks about your health today. The questionnaire used in PACE has also added a question about comparing how you feel now with 1 year ago.
The data in the dataset is the summary index score, and not the health state score (which replicates the individual item scores - eg 12132). The summary index score is normally used for cost-utility analyses. Euroqol summary index scores [in this dataset] appear to be <1.0 (although some are negative) - each country requires its own method of calculation based on age and sex.
The Euroqol user guide (section 4, page 11) provides more info on its calculation from health states obtained from the questionnaire itself: https://euroqol.org/wp-content/uploads/2016/09/EQ-5D-3L_UserGuide_2015.pdf
I assume higher scores on the index indicate better quality of life.
Hospital Anxiety and Depression Scale is a 14-item scale and asks how you feel over the past week.
Each item scores 0-3. Overall scores between 0 and 21 - presumably because anxiety and depression qs are split (7 questions each).
(HAANT and HADET in dataset).
Work and Social Adjustment Scale (WSAT). 5 items scored on 0-8 scale.
Scores between 0 and 40. No time specified - about how you feel now.
Borg scale is scored between 6 and 20. On the laminated sheet given to pts, 7 is scored "Very, very light", 11="Fairly light", 13="Somewhat hard", 19="Very, very hard".
Measures effort perception after the step test.
(STBOR in dataset)
[ETA: This webpage provides more context: https://www.hsph.harvard.edu/nutritionsource/borg-scale/]
Patient Health Questionnaire records physical (somatic) symptoms in past 4 weeks.
15-item scale. Each item scores 0-2. Scored out of 30.
(PSTOT in dataset)
PACE trial protocol has copies of the questionnaires used, but provides no info on how the scores are calculated.
Not only should it be allowed, it should be mandatory. But in fact it should be mandatory for the authors not to play silly buggers and pretend that they don't have, somewhere, a single file with all the data in it, which is how everyone works, because it's simple and obvious. (People who have never seen scientific data are often surprised by just how simple it looks, unless it's coming from particle detectors at CERN, and even then, the main complexity is the number of rows.)
Of course, it's just about theoretically possible that there is no single-file copy of the data left, but at that point, the entire research team should hand back their PhDs and get jobs in McDonald's.
And here's my summary of the variables, from earlier in this thread:
Zero chance that the data provided to peer reviewers had those problems.Even if they have provided the data in the same order as the Matthees data release, it is shockingly bad data management not to provide any fields on which to do an external check of any merge, such as trial arm. It shows they really don't care about the data. Makes me so cross.![]()
Zero chance that the data provided to peer reviewers had those problems.
Even if they have provided the data in the same order as the Matthees data release, it is shockingly bad data management not to provide any fields on which to do an external check of any merge, such as trial arm. It shows they really don't care about the data. Makes me so cross.![]()
Can someone explain the improvements to me? On the third tab.
Funny you should say that, I think setting up McPsych is the overall plan.Of course, it's just about theoretically possible that there is no single-file copy of the data left, but at that point, the entire research team should hand back their PhDs and get jobs in McDonald's.