INDEX

Explanations

This neuron is strongly associated with the word "deck" and its variations. Looking at the `TOKENS_AFTER_MAX_ACTIVATING_TOKEN`, we see common continuations like "and", "off", and words related to cleaning ("cleaner") or structural elements ("jo"). The `TOP_ACTIVATING_TEXTS` reinforce this by providing contexts like "build a 10x12 foot deck", "Deck & Patio Cleaning", "preferred method for deck joists", "used for decks or patios", "Wood Decking", "beautiful decks", "Use a deck cleaner", "under decks and porches". All these suggest the neuron is activated by discussions related to outdoor wooden structures, particularly decks.The `TOP_POSITIVE_LOGITS` seem to be an artifact of a multilingual model or a specific training setup, potentially unrelated to the core semantic meaning associated with "deck" in English texts. The task is to find a pattern within the English activating texts and tokens.The dominant pattern is the word "deck" and phrases or contexts where it appears, often associated with construction, maintenance, or placement.Therefore, a concise explanation would be:"deck"

New Auto-Interp

Configuration

Prompts (Dashboard)

238,145 prompts, 512 tokens each

Dataset (Dashboard)

lmsys + oasst1

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

 Childhood

0.50

 एंड्रॉयड

0.49

ЕТ

0.49

settes

0.49

 والدہ

0.49

migration

0.48

 Centrale

0.47

scp

0.47

 Learning

0.47

 নয়

0.47

POSITIVE LOGITS

をお

0.46

 truyền

0.45

እ

0.44

ఉ

0.43

 mehreren

0.43

 pemb

0.43

xm

0.43



0.43

lia

0.42

 fasci

0.42

Activations Density 0.001%