INDEX
Explanations
phrases and expressions related to expectations and forecasts
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
444
+0.12
0.7%
117
+0.12
0.7%
500
+0.11
0.6%
Correlated Neurons
Index
P. Corr.
Cos Sim.
117
+0.12
0.04
148
+0.12
0.01
444
+0.11
0.01
Negative Logits
§
-2.96
Ļª
-2.79
Ļ
-2.72
IJ
-2.65
Īĺ
-2.61
¡
-2.59
«
-2.57
ĥ½
-2.53
¤
-2.52
Ń
-2.50
POSITIVE LOGITS
result
1.65
latest
1.54
soon
1.46
fresh
1.45
white
1.44
goodbye
1.43
fare
1.42
dark
1.41
passage
1.40
however
1.40
Activations Density 0.254%