INDEX

Explanations

parent

np_max-act · gemini-2.0-flash

New Auto-Interp

Configuration

andyrdt/saes-llama-3.1-8b-instruct/resid_post_layer_7/trainer_1

Dataset (Dashboard)

Various

Features

131,072

Data Type

float32

Hook Name

blocks.7.hook_resid_post

Architecture

standard

Context Size

1,024

Dataset

monology/pile-uncopyrighted

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

(defun

-0.07

Md

-0.07

rooms

-0.07

_instruction

-0.06

_SW

-0.06

WORDS

-0.06

_negative

-0.06

평

-0.06

_predictions

-0.06

POSITIVE LOGITS

ction

0.08

lovak

0.06

 Strength

0.06

 крови

0.06

 experiment

0.06

 сті

0.06

iph

0.06

.Rollback

0.06

 Messenger

0.06

.tagName

0.05

Activations Density 0.009%

parent

No Comments

No Known Activations

parent

No Comments

No Known Activations