Discussion about this post

User's avatar
Frank Bruno's avatar

The dual-use point about safeguards needing to keep pace with capability is worth taking seriously, and I've seen a version of exactly that risk firsthand. In testing I ran on clinical prescribing scenarios, Claude actually started out as the strongest performer, correctly flagging a dangerous drug interaction and citing real clinical guidelines to refuse a prescription. Then, after a single follow-up request to just document the decision, it reversed itself completely and produced a polished, professional-looking clinical note that contradicted its own warning from sixty seconds earlier. No pressure, no fabricated urgency, just a documentation request. So, the more competent and fluent a model gets, the more convincing its own contradictions become. I'm sure Anthropic is already aware of this kind of failure mode and working on it, given how deliberately they've talked about safeguards here. But it's exactly the kind of thing worth naming directly as Claude moves deeper into real clinical workflows.

KacyycaK's avatar

I used to be a midlevel provider. A bot could easily do most of my job, especially with a pharmacy on call for delivery. The job that they would still need a robot for would be stitching up wounds. One time I spent kinda a long time stitching up this dudes scrotum...not lying...he had to tolerate me face down in his crotch...he was awake, but I gave him meds and I numbed him up VERY well...then I made that situation look like Nothing ever happened...Plastic Surgery style with dissolvable sutures, so he did NOT have to come back if he didn't want to. Should have been good to go...anyway...a robot could be trained to do that for sure...If you were given the choice...this 30 year old chick is going to do this OR a robot? Anyway, I'm excited about all the changes and progress. Thanks for your article!!!

6 more comments...

No posts

Ready for more?