Replicating Anthropic's Feature Steering Introspection on 7B Parameter Models

2 pointsposted 3 months ago
by vuciv

No comments yet