INDEX
Explanations
The neuron detects mentions of “booby traps.”
New Auto-Interp
Negative Logits
undler
-0.07
Riyadh
-0.06
للت
-0.06
.Bunifu
-0.06
.say
-0.06
/MPL
-0.06
_kill
-0.06
upil
-0.06
Link
-0.06
پیشنهاد
-0.06
POSITIVE LOGITS
fas
0.07
! ↵
0.07
âm
0.06
prev
0.06
Events
0.06
details
0.06
maid
0.06
有
0.06
.getElements
0.06
hacking
0.06
Activations Density 0.000%