INDEX

Explanations

guide

np_max-act · gemini-2.0-flash

New Auto-Interp

Configuration

andyrdt/saes-llama-3.1-8b-instruct/resid_post_layer_15/trainer_1

Dataset (Dashboard)

Various

Features

131,072

Data Type

float32

Hook Name

blocks.15.hook_resid_post

Architecture

standard

Context Size

1,024

Dataset

monology/pile-uncopyrighted

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

方案

-0.06

nj

-0.06

----------------------------------------------------------------------------

-0.06

 Successfully

-0.06

기는

-0.06

.Upload

-0.06

.std

-0.06

aternity

-0.06

org

-0.06

्रच

-0.06

POSITIVE LOGITS

��

0.06

 cinnamon

0.06

gly

0.06

 muster

0.06

qi

0.06

.GetService

0.06

 selenium

0.06

ант

0.06

.execute

0.06

ература

0.06

Activations Density 0.057%

guide

No Comments

No Known Activations

guide

No Comments

No Known Activations