INDEX
Explanations
The neuron activates on occurrences of the word “life” and short phrases that center on talking about life itself.
New Auto-Interp
Negative Logits
angel
-0.07
Now
-0.07
кто
-0.06
Crisis
-0.06
Geography
-0.06
Hearts
-0.06
overcome
-0.06
اند
-0.06
undy
-0.06
Comput
-0.06
POSITIVE LOGITS
oluşan
0.07
life
0.07
bool
0.06
/b
0.06
možné
0.06
cultiv
0.06
ató
0.06
énom
0.06
.number
0.06
eniz
0.06
Activations Density 0.024%