INDEX

Explanations

is are

np_max-act · gemini-2.0-flash

This neuron fires on domain-specific scientific jargon—multisyllabic, technical terms typical of academic writing.

oai_token-act-pair · o4-mini Triggered by @xinyanhu8

New Auto-Interp

Configuration

andyrdt/saes-llama-3.1-8b-instruct/resid_post_layer_11/trainer_1

Dataset (Dashboard)

Various

Features

131,072

Data Type

float32

Hook Name

blocks.11.hook_resid_post

Architecture

standard

Context Size

1,024

Dataset

monology/pile-uncopyrighted

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

ış

-0.06

 kênh

-0.06

σα

-0.06

illi

-0.06

probe

-0.06

olland

-0.06

 návrh

-0.06

 со

-0.06

_PARAMETER

-0.06

_SIGNAL

-0.06

POSITIVE LOGITS

>::

0.07

.dateTimePicker

0.07

.full

0.07

 tuberculosis

0.07

 nike

0.06

 Шев

0.06

 League

0.06

 něk

0.06

 теж

0.06

specs

0.06

Activations Density 0.242%

is are

This neuron fires on domain-specific scientific jargon—multisyllabic, technical terms typical of academic writing.

No Comments

No Known Activations

is are

This neuron fires on domain-specific scientific jargon—multisyllabic, technical terms typical of academic writing.

No Comments

No Known Activations