INDEX
Explanations
The main thing this neuron does is find keywords related to long-term aspects of various topics
New Auto-Interp
Negative Logits
illard
-0.78
IRO
-0.77
ICAN
-0.74
leck
-0.72
ILLE
-0.71
Compass
-0.67
ILA
-0.64
unin
-0.61
CB
-0.61
ONSORED
-0.60
POSITIVE LOGITS
itud
1.27
itudinal
1.17
sword
1.17
lasting
1.10
term
1.06
term
1.04
sighted
1.01
lasting
1.01
shore
1.01
overdue
1.00
Activations Density 2.601%