INDEX

Explanations

기

np_max-act · gemini-2.0-flash

New Auto-Interp

Configuration

andyrdt/saes-llama-3.1-8b-instruct/resid_post_layer_7/trainer_1

Dataset (Dashboard)

Various

Features

131,072

Data Type

float32

Hook Name

blocks.7.hook_resid_post

Architecture

standard

Context Size

1,024

Dataset

monology/pile-uncopyrighted

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

Mej

-0.07

declaration

-0.06

.Section

-0.06

alara

-0.06

.msg

-0.06

Tag

-0.06

 clips

-0.06

efd

-0.06

Checksum

-0.06

 circulating

-0.06

POSITIVE LOGITS

하기

0.09

ाप

0.07

 bridge

0.07

 alınması

0.07

omon

0.07

 highways

0.07

infinity

0.07

 başka

0.07

 kullanımı

0.07

각

0.06

Activations Density 0.016%

기

No Comments

No Known Activations