INDEX
Explanations
This neuron detects mentions of “quality of life.”
New Auto-Interp
Negative Logits
uştur
-0.08
_teacher
-0.07
كات
-0.06
signIn
-0.06
.lines
-0.06
haf
-0.06
.dy
-0.06
باعث
-0.06
�
-0.06
müdür
-0.06
POSITIVE LOGITS
Gtk
0.07
responseBody
0.06
Come
0.06
Sebastian
0.06
Ubergraph
0.06
lerle
0.06
stringWithFormat
0.06
attendees
0.06
СРСР
0.06
ivel
0.06
Activations Density 0.178%