INDEX

Explanations

duplication or repetition

np_max-act · gemini-2.0-flash

questions related to software installation and configuration issues.

oai_token-act-pair · gpt-4o-mini Triggered by @xinyanhu8

The neuron activates on numeric quantifiers and related words indicating counts or ordinals (e.g., “two,” “another,” “second”).

oai_token-act-pair · o4-mini Triggered by @xinyanhu8

New Auto-Interp

Configuration

andyrdt/saes-llama-3.1-8b-instruct/resid_post_layer_11/trainer_1

Dataset (Dashboard)

Various

Features

131,072

Data Type

float32

Hook Name

blocks.11.hook_resid_post

Architecture

standard

Context Size

1,024

Dataset

monology/pile-uncopyrighted

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

 Joint

-0.08

 unlike

-0.07

 joints

-0.07

 stiffness

-0.06

 Monsters

-0.06

ювання

-0.06

students

-0.06

sig

-0.06

 pint

-0.06

POSITIVE LOGITS

าษฎ

0.07

OKIE

0.06

Inform

0.06

 Kurul

0.06

tml

0.06

tplib

0.06

optim

0.06

clone

0.06

ccoli

0.06

fw

0.06

Activations Density 0.234%

duplication or repetition

questions related to software installation and configuration issues.

The neuron activates on numeric quantifiers and related words indicating counts or ordinals (e.g., “two,” “another,” “second”).

No Comments

No Known Activations