Will Anthropic open-source the training code of their SAE interpretability effort?
➕
Plus
5
Ṁ485
2028
14%
this year, fully
29%
this year, significantly incomplete
19%
next year
23%
not before 2028
14%
Other

We mean the code used for producing Scaling Interpretability blog post.

Get
Ṁ1,000
and
S3.00
© Manifold Markets, Inc.Terms + Mana-only TermsPrivacyRules