Replicating Anthropic's Feature Steering Introspection on 7B Parameter Models

2 pointsposted 12 hours ago
by vuciv

No comments yet