Skip to yearly menu bar Skip to main content


Poster

Beyond Top Activations: Efficient and Reliable Crowdsourced Evaluation of Automated Interpretability

Tuomas Oikarinen ⋅ Ge Yan ⋅ Akshay R. Kulkarni ⋅ Tsui-Wei Weng

Abstract

Log in and register to view live content