INDEX

Explanations

twe

np_max-act · gemini-2.0-flash

New Auto-Interp

Configuration

andyrdt/saes-llama-3.1-8b-instruct/resid_post_layer_15/trainer_1

Dataset (Dashboard)

Various

Features

131,072

Data Type

float32

Hook Name

blocks.15.hook_resid_post

Architecture

standard

Context Size

1,024

Dataset

monology/pile-uncopyrighted

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

 summon

-0.07

=re

-0.07

Pane

-0.07

 Melissa

-0.06

Ug

-0.06

stral

-0.06

avourites

-0.06

 persistent

-0.06

.annot

-0.06

官

-0.06

POSITIVE LOGITS

 performans

0.07

 세상

0.07

$body

0.07

 hành

0.06

 partager

0.06

šit

0.06

.getDescription

0.06

-hooks

0.06

 نرم

0.06

.backward

0.06

Activations Density 0.018%

twe

No Comments

No Known Activations