INDEX

Explanations

Protecting from danger

np_max-act · gemini-2.0-flash

New Auto-Interp

Configuration

andyrdt/saes-llama-3.1-8b-instruct/resid_post_layer_7/trainer_1

Dataset (Dashboard)

Various

Features

131,072

Data Type

float32

Hook Name

blocks.7.hook_resid_post

Architecture

standard

Context Size

1,024

Dataset

monology/pile-uncopyrighted

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

PA

-0.07

 کسب

-0.07

-pencil

-0.06

गर

-0.06

ucson

-0.06

cht

-0.06

 fictional

-0.06

تی

-0.06

(PARAM

-0.06

ogg

-0.06

POSITIVE LOGITS

 democr

0.07

 />)↵

0.07

 Nüfus

0.07

educ

0.06

_allow

0.06

(express

0.06

_supp

0.06

.nr

0.06

.Fat

0.06

 DataView

0.06

Activations Density 0.089%

Protecting from danger

No Comments

No Known Activations

Protecting from danger

No Comments

No Known Activations