Discussion about this post

User's avatar
Robert Long's avatar

Minimum refueling time is a very interesting hook and illustration - literal racing dynamics!

Expand full comment
Oscar Delaney's avatar

Nice idea! This could all be worked out later in an implementation phase if this idea gains traction, but I wonder if there would be some difficulty in specifying when the testing period starts. To use your (excellent) refueling analogy, I suppose it is easy to measure exactly when the car comes to a halt, or something similar, whereas I wonder if AI companies seeking to avoid the spirit of the minimum testing period would say that they start testing once the base model is trained, but continue doing RL post-training alongside the safety testing, so that the safety testing window isn't actually adding time until the release.

Expand full comment
1 more comment...

No posts