INDEX
Explanations
solitude
This neuron responds to words and phrases indicating solitary or isolated living situations (e.g., “alone in a shack”).
New Auto-Interp
Negative Logits
(task
-0.07
Rh
-0.06
楽し
-0.06
vẫn
-0.06
hostage
-0.06
quienes
-0.06
Joh
-0.06
HashTable
-0.06
userInput
-0.06
Fade
-0.06
POSITIVE LOGITS
liked
0.07
μέσα
0.07
earn
0.07
مول
0.07
قة
0.06
seventh
0.06
Suppliers
0.06
months
0.06
press
0.06
формування
0.06
Activations Density 0.008%