So tell us what you want to record and how. That’s what we’re looking for, how would you approach this? What would you do? You’re the boss
@bobbler and you are in control here

Take my suggestion and modify it to work for you
sorry I thought I had but didn't mention that I'm thinking like the app/visible type thing, mainly ticking a few boxes on a scale for not too many features, but being able to select those first. that has a shortened funcap monthly too.
I can give you (an) answers to a question of name 3 things you guess you can do, as I have done again in the next paras, but I'm pointing out there is another level too. I can't think that creatively on my abilities other than going by what I've tried. I have so much on my plate asking me to self-assess from all sorts of angles and it is about the hardest level of thinking you can get when there are lots of different overlapping conditions and of course ME being situation-dependent. So I do have insight into how blinking hard this is in reality. And how complex it is when something mightn't be a 'cure' or even 'specific to ME' but a case of fishing out benefit/risk or whatnot for things that might help and exist already.
but making sure that it can be done retrospectively (within limits, depending on what the data is you mightn't remember if you had a headache 5 days ago) if you don't get it done on that day before falling asleep then wake up and its past eg midnight cut-off.
and that if anything has to be specifically taken (eg hr) on a certain day then it does it automatically at a certain time as a back-up, so if there is eg a better time for it to be taken because of what someone has that day re: sleep-wake or whatnot then (depending on how relevant that bit it) the person can either add to it or override it. BUt it is there in case you forget.
PS I'm assuming here that the measure would be long-term, so 6months AND 12months, 18mnths (and maybe more) because that is when the crash will happen for something we though worked but was eg like a stimulant that just allowed pushing boundaries and then bit in bum with a deterioration longer term. So this longer term takes the pressure off having an over-productive month or where it lucked out with other cumualtive stuff.
BUT that the measures would only be eg for a 2 week period around those points, so people aren't having to be stuck doing an app or whatever for months on end.
As it is , when you talk about measures like shower or toothbrushing I'd be talking about a month accounting period maybe for frequency. I do think it would be interesting to add the app type data to these however because how long you can/have to shower for and how well I'd feel after are key, and because standing and brushing teeth for a consistent time each day would be a whole new ballgame vs struggling through whilst sitting when I can (but the issue is that just lenght of time of brushing could also be misleading)
It gets really complicated once I start thinking about what we'd use that would be useful and then consistently measurable instead of only thinking from the angle of what is important to me and filling in certain things.
Because for example I'd be tempted to suggest my third one would be an on-screen cognitive or reaction type test test. But then up crops issues such as at what time of day would I do that would be impacted by sleep and many other things. And if I'd only do it on every few days for example. And then me having to think whether that or if I'm lying or sitting whilst doing it really matters or is overkill if it is being done eg over a few weeks repeatedly.
I suspect the 'feet up' % is actually the more insightful over the number of steps too as a measure of health. Because if I'm ill or in PEM other symptoms might make me move as well as me feeling awful and unable to move. But I will have an overwhelming need to get my head down when I can. It probably accounted better for the extra strength but not changing the underlying ME/CFS re: steroids.
And then I have to go back to the thinking out loud of when it is trying to solve the whole conundrum instead of just 'testing whether that one thing makes a difference, and if it does then it is a clue in the bank' when I start thinking about all the cumulative stuff causing variation.
I'm not trying to frustrate, actually help by talking it around because you do have to keep coming back to the original 'research question' to check it doesn't get distracted into 'what info can you get' or 'what is the illness vs what is something helping'. This is important stuff.
I think what these measures are is something you are approx on the right track with. I do think that we need to bear in mind a few caveats as I've been trying to think out loud around
1. these things need to remember to focus on what matters with regard differentiation for the actual thing that is being tested. Just my experience over the years of working in eg differentiating things (and yes also many years of market research/insight - strange how those two areas overlapped, but basically I've done a lot of 'how do we find out/measure/validly rank' type stuff), and having to keep eye on prize as to 'what do we need/matters' rather than just ways of differentiating/ranking/getting a figure. My steroid bit goes into that somewhat - lots might change but only certain things maybe are really differentiating at a certain time period. A similar point might be that if we went with hr or something then whether it helped ME or not that (or other drugs) might might it go up anyway in some people.
2. there are some things worth trying where we can't predict where the benefit will be, but it is significant none the less which is why I included the b12 in my examples of thinking through some things in actually context/history. I haven't thought through how we accommodate for that.
I think we need to be open to the possibility that we live with such a symptom load that even if we anticipate x will mean we could do more or change something, if we realised that issue we didn't talk about anymore of a constant thick headache disappearing or breathing better (not having noticed it before) then who knows if it is 'unrelated' but it is worth having if we've done the off-on enough times so it isn't just coincidence. I'm not sure even the longer tranche of questions/symptoms would anticipate that because we mightn't realise our headache was a 4/5 and not a 2/5 because our frame of reference had been skewed by it being there so constantly (and getting worse from there) but once it disappears and the idea of it coming back then it becomes more than a 2/5?
It probably feels like those 2 points are the polar opposite of each other.
The normal approach for both of these would be to run exploratory research so we go into such things with a sense of where the 'good and bad effects' are likely to lie/appear. SO if something makes you more clear thinking it can make sure it includes that.