INDEX

Explanations

caught

np_max-act · gemini-2.0-flash

New Auto-Interp

Configuration

andyrdt/saes-llama-3.1-8b-instruct/resid_post_layer_3/trainer_1

Dataset (Dashboard)

Various

Features

131,072

Data Type

float32

Hook Name

blocks.3.hook_resid_post

Architecture

standard

Context Size

1,024

Dataset

monology/pile-uncopyrighted

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

 sábado

-0.07

 delimited

-0.07

 yasal

-0.06

 birinin

-0.06

 millenn

-0.06

 shave

-0.06

 antivirus

-0.06

 konum

-0.06

.description

-0.06

šla

-0.06

POSITIVE LOGITS

 caught

0.13

 catching

0.11

 catches

0.10

 catcher

0.09

 Caught

0.09

Caught

0.09

 catch

0.08

 цей

0.07

-catching

0.07

 Respond

0.07

Activations Density 0.008%

caught

No Comments

No Known Activations