INDEX

Explanations

colon, quotation mark

np_max-act · gemini-2.0-flash

New Auto-Interp

Configuration

andyrdt/saes-llama-3.1-8b-instruct/resid_post_layer_19/trainer_1

Dataset (Dashboard)

Various

Features

131,072

Data Type

float32

Hook Name

blocks.19.hook_resid_post

Architecture

standard

Context Size

1,024

Dataset

monology/pile-uncopyrighted

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

 sincerity

-0.07

uples

-0.07

Svc

-0.06

_factors

-0.06

数据

-0.06

.friend

-0.06

 repaired

-0.06

 ngại

-0.06

 до

-0.06

Spatial

-0.06

POSITIVE LOGITS

buff

0.07

DONE

0.07

Vs

0.07

 Baltimore

0.06

 McLaren

0.06

 steep

0.06

 backpack

0.06

sudo

0.06

 matcher

0.06

이크

0.06

Activations Density 0.033%

colon, quotation mark

No Comments

No Known Activations

colon, quotation mark

No Comments

No Known Activations