INDEX

Explanations

import

np_max-act · gemini-2.0-flash

New Auto-Interp

Configuration

andyrdt/saes-llama-3.1-8b-instruct/resid_post_layer_7/trainer_1

Dataset (Dashboard)

Various

Features

131,072

Data Type

float32

Hook Name

blocks.7.hook_resid_post

Architecture

standard

Context Size

1,024

Dataset

monology/pile-uncopyrighted

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

:length

-0.06

遭

-0.06

aur

-0.06

ousing

-0.06

ful

-0.06

 argent

-0.06

 crime

-0.06

 влад

-0.06

 включ

-0.06

递

-0.05

POSITIVE LOGITS

addEventListener

0.07

 PHYS

0.07

AUTHORIZED

0.07

 Broadway

0.07

 (*)(

0.07

HBO

0.06

_bonus

0.06

DialogContent

0.06

.Features

0.06

 Foto

0.06

Activations Density 0.023%

import

No Comments

No Known Activations

import

No Comments

No Known Activations