INDEX

Explanations

code/logs

np_max-act · gemini-2.0-flash

The neuron fires on tokens that appear in code‐example output or printed/error message sections, i.e. program output lines rather than code.

oai_token-act-pair · o4-mini Triggered by @xinyanhu8

New Auto-Interp

Configuration

andyrdt/saes-llama-3.1-8b-instruct/resid_post_layer_11/trainer_1

Dataset (Dashboard)

Various

Features

131,072

Data Type

float32

Hook Name

blocks.11.hook_resid_post

Architecture

standard

Context Size

1,024

Dataset

monology/pile-uncopyrighted

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

 груп

-0.06

pv

-0.06

_visibility

-0.06

.WriteHeader

-0.06

Th

-0.06

Colorado

-0.06

_SCHED

-0.06

Rex

-0.06

аки

-0.06

_SUP

-0.06

POSITIVE LOGITS

orno

0.07

-ready

0.07

<style

0.07

 majority

0.06

ağın

0.06

-back

0.06

 fetus

0.06

 bone

0.06

、↵

0.06

street

0.06

Activations Density 0.123%

code/logs

The neuron fires on tokens that appear in code‐example output or printed/error message sections, i.e. program output lines rather than code.

No Comments

No Known Activations

code/logs

The neuron fires on tokens that appear in code‐example output or printed/error message sections, i.e. program output lines rather than code.

No Comments

No Known Activations