INDEX

Explanations

estead

np_max-act · gemini-2.0-flash

New Auto-Interp

Configuration

andyrdt/saes-llama-3.1-8b-instruct/resid_post_layer_15/trainer_1

Dataset (Dashboard)

Various

Features

131,072

Data Type

float32

Hook Name

blocks.15.hook_resid_post

Architecture

standard

Context Size

1,024

Dataset

monology/pile-uncopyrighted

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

ableOpacity

-0.07

 avail

-0.06

bst

-0.06

safe

-0.06

 accounted

-0.06

‌دهد

-0.06

 xrange

-0.06

чает

-0.06

 Squad

-0.06

 Παρ

-0.06

POSITIVE LOGITS

Alexander

0.06

 Servers

0.06

/resources

0.06

Pictures

0.06

singleton

0.06

 Theatre

0.06

esine

0.06

collections

0.06

"));
↵

0.06

 fatty

0.06

Activations Density 0.000%

estead

No Comments

No Known Activations

estead

No Comments

No Known Activations