INDEX
Explanations
The neuron activates on phrases describing animals’ social behavior, especially mentions of living in groups, herds, or communal social structures.
New Auto-Interp
Negative Logits
Geoff
-0.07
.curr
-0.06
Porn
-0.06
unny
-0.06
clients
-0.06
oured
-0.06
treasures
-0.06
ederation
-0.06
vehicle
-0.06
izontal
-0.06
POSITIVE LOGITS
ez
0.08
¡
0.07
brighter
0.06
tweaking
0.06
olması
0.06
initially
0.06
Formation
0.06
formation
0.06
(QL
0.06
implements
0.06
Activations Density 0.013%