INDEX

Explanations

These lists contain elements from Korean, Chinese, and Japanese languages, often functioning as grammatical particles or descriptors. The `TOKENS_AFTER_MAX_ACTIVATING_TOKEN` list includes words like "factors," "content," "view," "aspect," "model," "main," and "method."The pattern suggests the neuron is recognizing descriptors or particles commonly found in lists or enumerations across East Asian languages, often preceding explanatory terms like "factors," "content," or "aspects."Therefore, a good short explanation would be:East Asian descriptive particles before list items

New Auto-Interp

Configuration

Prompts (Dashboard)

238,145 prompts, 512 tokens each

Dataset (Dashboard)

lmsys + oasst1

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

ة

0.59

的需求

0.54

由于

0.52

াগত

0.50

看着

0.50

 odors

0.49

elected

0.48

ing

0.47

使用的

0.46

造成的

0.46

POSITIVE LOGITS

 것으로

0.91

 것이

0.89

 것도

0.86

 것은

0.85

もので

0.82

ような

0.79

こと

0.78

ことは

0.75

ものです

0.75

場合は

0.74

Activations Density 0.002%