EXPLANATION TYPE
np_max-act-logits
Description
Attempts to replicate Anthropic's autointerp used for their attribution graphs paper's features.
Author
Neuronpedia
URL
https://github.com/hijohnnylin/automated-interpretability/blob/4463a9fab7d4828bfd4c33194e64856b95377166/neuron_explainer/explanations/explainer.py#L811-L1135Settings
Activations shown = 24 tokens around max act. Shows top 10 logits. Shows model the max activating token too.
Recent Explanations
be
gpt-5-mini
that are not likely to be well-known in the target
LLAMA3.1-8B-IT
19-RESID-POST-AA
INDEX 111683
pronouns and commas
Method used: 3 — top logits are pronouns/punctuation, so "pronouns and commas"
gpt-5-mini
therefore u cannot complete the quest so ur only option is
GEMMA-2-2B
18-GEMMASCOPE-RES-16K
INDEX 15327
the
gpt-5
, art easels and the like.↵There are
GEMMA-2-2B
18-GEMMASCOPE-RES-16K
INDEX 1339
foundational, initial, basic
gemini-2.5-flash
, gentle, strong (both physically and emotionally), selfless
GEMMA-3-12B-IT
31-GEMMASCOPE-2-RES-65K
INDEX 2156
punctuation and symbols
gemini-2.5-flash-lite
`return` statement is generally preferred for its concisen
GEMMA-3-4B-IT
33-GEMMASCOPE-2-TRANSCODER-262K
INDEX 261859
made
gemini-2.5-flash-lite
Link Chain**↵↵Crafted from genuine 92
GEMMA-3-4B-IT
33-GEMMASCOPE-2-TRANSCODER-262K
INDEX 261386
German descriptive words
gemini-2.5-flash-lite
reformation:** Religiöse Konflikte und eine Kr
GEMMA-3-4B-IT
33-GEMMASCOPE-2-TRANSCODER-262K
INDEX 262032
ellipsis
gemini-2.5-flash-lite
↵↵Seraphina was...remarkable. She was forty
GEMMA-3-4B-IT
33-GEMMASCOPE-2-TRANSCODER-262K
INDEX 261747
b
gemini-2.5-flash-lite
:↵↵**1. bcache (Recommended - Most
GEMMA-3-4B-IT
33-GEMMASCOPE-2-TRANSCODER-262K
INDEX 262116
console logs
gemini-2.5-flash-lite
}↵ }↵ return true;↵
GEMMA-3-4B-IT
33-GEMMASCOPE-2-TRANSCODER-262K
INDEX 261683
local
gemini-2.5-flash-lite
Vicuñas were once nearly extinct due to excessive hunting
GEMMA-3-4B-IT
33-GEMMASCOPE-2-TRANSCODER-262K
INDEX 261906
conversational turns
gemini-2.5-flash-lite
is a test.<end_of_turn>↵<start_of_turn>model↵The Am
GEMMA-3-4B-IT
33-GEMMASCOPE-2-TRANSCODER-262K
INDEX 262079
proper nouns
gemini-2.5-flash-lite
** Arrive at Schiphol Airport (AMS).
GEMMA-3-4B-IT
33-GEMMASCOPE-2-TRANSCODER-262K
INDEX 261710
technical language
gemini-2.5-flash-lite
(↵ 'id' => '_video_
GEMMA-3-4B-IT
33-GEMMASCOPE-2-TRANSCODER-262K
INDEX 262036
clothing and fabrics
gemini-2.5-flash-lite
|↵| **Scott Perry
GEMMA-3-4B-IT
33-GEMMASCOPE-2-TRANSCODER-262K
INDEX 262136
say ":"
gemini-2.5-flash-lite
' },↵ { id: '4', text
GEMMA-3-4B-IT
33-GEMMASCOPE-2-TRANSCODER-262K
INDEX 261603
non-standard characters
gemini-2.5-flash-lite
about pepe. They just don't *get*
GEMMA-3-4B-IT
33-GEMMASCOPE-2-TRANSCODER-262K
INDEX 261690
instructions and code
gemini-2.5-flash-lite
else.↵↵Example 1:↵<prompt>
GEMMA-3-4B-IT
33-GEMMASCOPE-2-TRANSCODER-262K
INDEX 261792
code and logic
gemini-2.5-flash-lite
this route can suffer from low yields, particularly in the
GEMMA-3-4B-IT
33-GEMMASCOPE-2-TRANSCODER-262K
INDEX 261989
positive attributes
gemini-2.5-flash-lite
-quality, comfortable, and stylish pajama sets – exactly
GEMMA-3-4B-IT
33-GEMMASCOPE-2-TRANSCODER-262K
INDEX 261961