INDEX

Explanations

1. MAX_ACTIVATING_TOKENS: `]`, `.`, `)`.2. TOKENS_AFTER_MAX_ACTIVATING_TOKEN: `Computational`.3. TOP_POSITIVE_LOGITS: `ized`, `is`, `ated`, `ти`, `ित`, `та`, `es`, `ai`, `ী`, `ται`.4. TOP_ACTIVATING_TEXTS: * "...`"`Computational models of the human body..." * "...track record. (See "Technologies & Tools" at the end). " "...preventing a `RecursionError`. " "...retain moisture during cooking. (See "Brining Recipe" at the end). "The `TOKENS_AFTER_MAX_ACTIVATING_TOKEN` list starts with "Computational".The `TOP_POSITIVE_LOGITS` list contains suffixes like "-ized", "-ated", and some South Asian/Greek script characters which often appear as word endings or in scientific/technical terms.The `TOP_ACTIVATING_TEXTS` show examples like "Computational models", "Technologies & Tools", "RecursionError", and also "Brining recipe".Putting it together:The neuron seems to activate strongly when closing punctuation (`]`, `.`, `)`) is followed by words related to computational concepts or technical domains. The positive logits suggest it's looking for words that end in common suffixes or are part of technical jargon."Computational" itself is a key word. The neuron might be capturing concepts related to computational tasks, technical specifications, or scientific processes.Let's refine:- The activation often follows punctuation.- The token that follows is often computational or technical.- The positive logits are suffixes or parts of technical terms.A phrase like "computational concepts" or "technical explanations" might fit.Given `TOKENS_AFTER_MAX_ACTIVATING_TOKEN` is `Computational`, and `TOP_POSITIVE_LOGITS` includes things like `ized` and `ated` (common in scientific/technical terms), and `TOP_ACTIVATING_TEXTS` include "Computational models", "Technologies", "RecursionError".This points towards detecting computational or technical terms, especially those following a completion or end marker (like the punctuation).Considering:- `Computational` is the key token after punctuation.- Suffixes like `-ized`, `-ated` are common in technical English.Let's check the prompt rules:- Concise (3-20 words).- Find patterns.- Specific.- Not capitalizing unless proper noun.- No "This neuron detects/predicts".- No "tokens" or "patterns"."Computational concepts" is 2 words."Computational terms" is 2 words."Computational and technical terms" is 4 words.The presence of `ти`, `ित`, `та`, `ai`, `ী`, `ται` in positive logits might suggest a broader scope, including terms that are transliterated or from related technical fields, not just English. However, "Computational" and "-ized/-ated" are strong English indicators.The most striking element is `Computational` appearing right after punctuation.Let's try to capture that."Computational terms after punctuation"? Too specific?"Computational technical descriptions"? 3 words."Computational or technical context"? 3 words.How about focusing on the function? What do these together represent?The examples are about "computational models", "technologies", "security track record", "RecursionError", "Brining Recipe".These are all descriptions of processes, tools, or technical subjects.Let's consider the phrase structure: `PUNCTUATION` + `COMPUTATIONAL` + `SUFIXED_WORD`.The neuron seems to be looking for the continuation* of a technical or computational discourse after a natural break or end.Perhaps it's about identifying structured or defined components in text.Consider what "Computational" can preface: "models", "techniques", "analysis", "logic", "science".The logits `ized` and `ated` hint at descriptive terms.So, it detects situations where a computational concept is followed by a descriptive or defining word."computational description" - 2 words. This seems to fit.It detects computational topics being described.Let's look at the texts again:"Computational models of the human body" -> `Computational` followed by `models`."frameworks, libraries, and languages with a good security track record." -> Not directly `computational` but technical."preventing a `RecursionError`." -> `Recursion` is computational."Rinsing the turkey..." -> Less computational, seems to be a general pattern.The rule is "a majority should" match."Computational models" - fits."good security track record" - technical, not strictly computational, but related."RecursionError" - computational."Brining Recipe" - not computational.This makes it tricky. The "Brining Recipe" example is an outlier if we focus only on "computational". However, the prompt itself is for explaining neuron behavior. Some neurons are broader.Let's re-evaluate the logits: `ized`, `is`, `ated`, `ти`, `ित`, `та`, `es`, `ai`, `ী`, `ται`.These include English suffixes and non-English script characters. The non-English script might be from transliterations of technical terms, or from different languages discussing technical topics.For example, `ти` could be part of words like `автоматический` (automatic) in Russian. `is` and `es` are common English word endings. `ai` is common in tech (AI).If we consider `Computational` as the anchor, and the logits as potential continuations or related concepts:`computational` + `ized` -> `computerized` (word)`computational` + `ai` -> `computational AI``computational` + `es` -> `computational processes` (word)`computational` + `ти` -> possibly a Russian technical term prefix? Or suffix?The texts:"Computational models""security track record""RecursionError""Brining Recipe"This neuron might be about "technical descriptors" or "defined concepts in technical contexts." It seems broader than just "computational".However, the most prominent token after punctuation is "Computational".Let's consider the possibility that the neuron is specifically about detecting the token "Computational" or words that look very similar to it in structure or context. Given:- `Computational` is the key token following punctuation.- `TOP_POSITIVE_LOGITS` include `ized`, `ated`

New Auto-Interp

Configuration

Prompts (Dashboard)

238,145 prompts, 512 tokens each

Dataset (Dashboard)

lmsys + oasst1

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

page

1.34

package

1.31

Ш

1.31

png

1.30

pedido

1.27

pets

1.22

password

1.21

КА

1.18

pita

1.18

POSITIVE LOGITS

ized

1.74

is

1.54

ated

1.44

ти

1.44

ित

1.39

та

1.34

es

1.32

ai

1.30

ী

1.29

ται

1.28

Activations Density 0.277%