INDEX

Explanations

The

np_max-act-logits · gemini-2.0-flash

New Auto-Interp

Configuration

andyrdt/saes-llama-3.1-8b-instruct/resid_post_layer_27/trainer_1

Dataset (Dashboard)

Various

Features

131,072

Data Type

float32

Hook Name

blocks.27.hook_resid_post

Architecture

standard

Context Size

1,024

Dataset

monology/pile-uncopyrighted

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

vtk

-0.07

doz

-0.07

Separ

-0.07

„V

-0.06

elo

-0.06

める

-0.06

egt

-0.06

 zeal

-0.06

�

-0.06

POSITIVE LOGITS

 Previous

0.07

.Tasks

0.07

 htmlspecialchars

0.07

_observer

0.06

 monopol

0.06

 cleanliness

0.06

leurs

0.06

serviceName

0.06

.categories

0.06

 Fallon

0.06

Activations Density 0.036%

The

No Comments

No Known Activations

The

No Comments

No Known Activations