INDEX

Explanations

Ruling/governing classes

np_max-act-logits · gemini-2.0-flash

New Auto-Interp

Configuration

andyrdt/saes-llama-3.1-8b-instruct/resid_post_layer_27/trainer_1

Dataset (Dashboard)

Various

Features

131,072

Data Type

float32

Hook Name

blocks.27.hook_resid_post

Architecture

standard

Context Size

1,024

Dataset

monology/pile-uncopyrighted

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

 datePicker

-0.07

 preferences

-0.07

uby

-0.07

VARCHAR

-0.06

JavaScript

-0.06

$("#

-0.06

 Reconstruction

-0.06

avg

-0.06

Rev

-0.06

Username

-0.06

POSITIVE LOGITS

σμός

0.07

 '">'

0.07

 polož

0.06

 가진

0.06

.pnl

0.06

].

0.06

 watering

0.06

 mãe

0.06

 تهیه

0.06

 wast

0.06

Activations Density 0.041%

Ruling/governing classes

No Comments

No Known Activations