INDEX

Explanations

Code and documentation

np_max-act · gemini-2.0-flash

tokens that are parts of file paths, filenames, or other code/documentation structural markup.

oai_token-act-pair · gpt-5-mini Triggered by @vetterc0

New Auto-Interp

Configuration

andyrdt/saes-llama-3.1-8b-instruct/resid_post_layer_7/trainer_1

Dataset (Dashboard)

Various

Features

131,072

Data Type

float32

Hook Name

blocks.7.hook_resid_post

Architecture

standard

Context Size

1,024

Dataset

monology/pile-uncopyrighted

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

 müzik

-0.06

帮助

-0.06

 receptor

-0.06

_TEST

-0.06

opers

-0.06

났

-0.06

lığı

-0.06

Vault

-0.05

 привод

-0.05

{:.

-0.05

POSITIVE LOGITS

 sagen

0.07

weg

0.06

 member

0.06

гар

0.06

со

0.06

 khỏ

0.06

 collagen

0.06

 جر

0.06

 Например

0.06

ogra

0.06

Activations Density 1.195%

Code and documentation

tokens that are parts of file paths, filenames, or other code/documentation structural markup.

No Comments

No Known Activations