INDEX
Explanations
European
language related to trust and communication in relationships.
This neuron fires on the main topic words or keywords (usually nouns) that introduce what a piece of text is about.
New Auto-Interp
Negative Logits
Kle
-0.07
These
-0.06
海
-0.06
_lm
-0.06
replied
-0.06
certainty
-0.06
Pat
-0.06
Hernandez
-0.06
What
-0.06
obstruction
-0.05
POSITIVE LOGITS
<message
0.07
ms
0.07
м
0.06
union
0.06
волос
0.06
_qs
0.06
oseconds
0.06
abant
0.06
ativ
0.06
(od
0.06
Activations Density 0.224%