INDEX

Explanations

internet and technology

np_max-act · gemini-2.0-flash

New Auto-Interp

Configuration

andyrdt/saes-qwen2.5-7b-instruct/resid_post_layer_11/trainer_1

Dataset (Dashboard)

Various

Features

131,072

Data Type

float32

Hook Name

blocks.11.hook_resid_post

Architecture

standard

Context Size

1,024

Dataset

monology/pile-uncopyrighted

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

챠

-0.08

熛

-0.08

�

-0.07

 empez

-0.07

看

-0.07

Ⲗ

-0.07

俅

-0.07

铕

-0.07

<AM

-0.07

닢

-0.07

POSITIVE LOGITS

erc

0.07

-cart

0.07

正確

0.07

 privately

0.07

don

0.07

±

0.07

 Stmt

0.07

泸州

0.06

劣势

0.06

blems

0.06

Activations Density 0.481%

internet and technology

No Comments

No Known Activations

internet and technology

No Comments

No Known Activations