INDEX
Explanations
This neuron detects mentions of studies or data in humans, i.e. occurrences of the word "humans" (often with a numerical value).
New Auto-Interp
Negative Logits
ственного
-0.06
Btn
-0.06
rette
-0.06
Assistance
-0.06
Ing
-0.06
jištění
-0.06
Playback
-0.06
Vic
-0.06
022
-0.06
आश
-0.06
POSITIVE LOGITS
searchBar
0.07
{$0.07
�
0.06
posing
0.06
germany
0.06
cùng
0.06
/>↵
0.06
.Has
0.06
لكل
0.06
making
0.06
Activations Density 0.011%