INDEX

Explanations

institution

np_max-act · gemini-2.0-flash

New Auto-Interp

Configuration

andyrdt/saes-llama-3.1-8b-instruct/resid_post_layer_15/trainer_1

Dataset (Dashboard)

Various

Features

131,072

Data Type

float32

Hook Name

blocks.15.hook_resid_post

Architecture

standard

Context Size

1,024

Dataset

monology/pile-uncopyrighted

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

aio

-0.06

 voiture

-0.06

 thor

-0.06

्वय

-0.06

ervo

-0.06

Fly

-0.06

 tahmin

-0.06

 لف

-0.06

_refer

-0.06

 free

-0.06

POSITIVE LOGITS

 institutions

0.10

 Institution

0.10

 institution

0.09

0.08

 Institutions

0.08

 Instructions

0.07

规定

0.07

услов

0.07

:/

0.07

GC

0.07

Activations Density 0.008%

institution

No Comments

No Known Activations

institution

No Comments

No Known Activations