INDEX

Explanations

divide

np_max-act · gemini-2.0-flash

New Auto-Interp

Configuration

andyrdt/saes-llama-3.1-8b-instruct/resid_post_layer_7/trainer_1

Dataset (Dashboard)

Various

Features

131,072

Data Type

float32

Hook Name

blocks.7.hook_resid_post

Architecture

standard

Context Size

1,024

Dataset

monology/pile-uncopyrighted

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

 heater

-0.06

_TEX

-0.06

 вероят

-0.06

发展

-0.06

_maps

-0.06

 Heater

-0.06

 competitiveness

-0.06

 způsobem

-0.06

Question

-0.06

 heating

-0.06

POSITIVE LOGITS

HOR

0.07

Select

0.07

_large

0.07

nEnter

0.06

 Pornhub

0.06

_ISO

0.06

 seiz

0.06

############

0.06

select

0.06

 devour

0.06

Activations Density 0.002%

divide

No Comments

No Known Activations

divide

No Comments

No Known Activations