INDEX

Explanations

philosophical arguments

np_max-act · gemini-2.0-flash

New Auto-Interp

Configuration

andyrdt/saes-llama-3.1-8b-instruct/resid_post_layer_15/trainer_1

Dataset (Dashboard)

Various

Features

131,072

Data Type

float32

Hook Name

blocks.15.hook_resid_post

Architecture

standard

Context Size

1,024

Dataset

monology/pile-uncopyrighted

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

 mystery

-0.06

 parser

-0.06

 PRESS

-0.06

HP

-0.06

，则

-0.06

 choice

-0.06

over

-0.06

 Modern

-0.06

 situation

-0.06

"""

-0.06

POSITIVE LOGITS

 mimeType

0.07

_cum

0.07

/*@

0.07

 Koreans

0.07

Utf

0.07

 πάνω

0.06

massage

0.06

_signature

0.06

 cậu

0.06

iants

0.06

Activations Density 0.014%

philosophical arguments

No Comments

No Known Activations

philosophical arguments

No Comments

No Known Activations