INDEX

Explanations

Intensifying adjectives

np_max-act · gemini-2.0-flash

New Auto-Interp

Configuration

andyrdt/saes-llama-3.1-8b-instruct/resid_post_layer_15/trainer_1

Dataset (Dashboard)

Various

Features

131,072

Data Type

float32

Hook Name

blocks.15.hook_resid_post

Architecture

standard

Context Size

1,024

Dataset

monology/pile-uncopyrighted

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

�

-0.06

温

-0.06

์โ

-0.06

 визнача

-0.06

dod

-0.06

انس

-0.06

א

-0.06

 бра

-0.06

вала

-0.06

 budeme

-0.06

POSITIVE LOGITS

_DIFF

0.07

 locking

0.07

_exp

0.07

 french

0.06

seq

0.06

!")

0.06

ButtonTitles

0.06

 perceived

0.06

ela

0.06

 kernel

0.06

Activations Density 0.047%

Intensifying adjectives

No Comments

No Known Activations