INDEX
Explanations
Possibility
The neuron detects phrases expressing limitless possibility, especially variants of “anything is possible.”
New Auto-Interp
Negative Logits
lun
-0.07
utils
-0.07
main
-0.06
zorunda
-0.06
ToJson
-0.06
rosse
-0.06
enn
-0.06
SU
-0.06
excelente
-0.06
lose
-0.06
POSITIVE LOGITS
REG
0.07
บาท
0.06
里
0.06
yapmaya
0.06
�
0.06
_FREE
0.06
nouve
0.06
제품
0.06
니
0.06
_stylesheet
0.06
Activations Density 0.008%