INDEX
Explanations
occurrences of the word "there" in various contexts
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
355
+0.14
0.8%
261
+0.14
0.8%
398
+0.13
0.7%
Correlated Neurons
Index
P. Corr.
Cos Sim.
283
+0.14
0.13
216
+0.14
0.11
335
+0.13
0.12
Negative Logits
Decided
-1.61
ably
-1.59
ynchron
-1.45
ually
-1.44
{}{-1.43
---|
-1.42
![
-1.41
/*!
-1.41
soever
-1.40
oath
-1.40
POSITIVE LOGITS
Īĺ
2.85
ĥ½
2.62
»¿
2.57
Ĩ
2.49
¼
2.49
ĸ´
2.49
Ļª
2.47
½
2.47
ĭ
2.46
Ń
2.40
Activations Density 0.088%