INDEX
Explanations
Thoughts and feelings
The neuron fires on key content words that denote someone’s focus or objective—nouns like “mind,” “goal,” “thing,” or “strength” that signal what’s central in the discourse.
New Auto-Interp
Negative Logits
/login
-0.07
itou
-0.07
etus
-0.07
PKK
-0.07
arrests
-0.07
请输入
-0.07
lyon
-0.07
greens
-0.06
Sergei
-0.06
Quint
-0.06
POSITIVE LOGITS
__(↵
0.07
rep
0.06
_READ
0.06
sağlar
0.06
establish
0.06
wherein
0.06
-window
0.06
Objective
0.06
lọc
0.06
์โ
0.06
Activations Density 0.023%