INDEX
Explanations
The neuron fires on polite acknowledgements of permission or offers—words like “kindly,” “allowed,” and “provided” that express gratitude or granted access.
New Auto-Interp
Negative Logits
hablar
-0.06
ampions
-0.06
ือก
-0.06
もう
-0.06
-IS
-0.06
_patch
-0.06
газ
-0.06
ヤ
-0.06
<quote
-0.06
зміни
-0.06
POSITIVE LOGITS
kindly
0.07
-----------------------------------------------------------------------------↵
0.07
específ
0.06
andro
0.06
contato
0.06
Massage
0.06
searchText
0.06
(NO
0.06
Fecha
0.06
}//
0.06
Activations Density 0.008%