INDEX

Explanations

Sports game summaries

np_max-act · gemini-2.0-flash

The neuron consistently flags play‐by‐play football commentary language—especially descriptions of defensive actions, player positions, and in‐game movements.

oai_token-act-pair · o4-mini Triggered by @xinyanhu8

New Auto-Interp

Configuration

andyrdt/saes-llama-3.1-8b-instruct/resid_post_layer_11/trainer_1

Dataset (Dashboard)

Various

Features

131,072

Data Type

float32

Hook Name

blocks.11.hook_resid_post

Architecture

standard

Context Size

1,024

Dataset

monology/pile-uncopyrighted

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

 referees

-0.06

ASF

-0.06

 лак

-0.06

inions

-0.06

 stressing

-0.06

cdr

-0.06

 basal

-0.06

폐

-0.06

 governmental

-0.06

 Bach

-0.06

POSITIVE LOGITS

pr

0.06

_scroll

0.06

(It

0.06

$view

0.06

 NSArray

0.06

RUN

0.06

ivement

0.06

_cs

0.06

Paid

0.06

vyk

0.06

Activations Density 0.015%

Sports game summaries

The neuron consistently flags play‐by‐play football commentary language—especially descriptions of defensive actions, player positions, and in‐game movements.

No Comments

No Known Activations