INDEX
Explanations
phrases mentioning events that happened over weekends
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
1145
+0.12
0.4%
1306
+0.11
0.4%
169
+0.11
0.4%
Correlated Neurons
Index
P. Corr.
Cos Sim.
220
+0.12
0.03
1993
+0.11
0.03
1306
+0.11
0.02
Negative Logits
lunedì
-0.60
Décembre
-0.59
Luglio
-0.56
Quels
-0.56
🕗
-0.55
préc
-0.55
Février
-0.54
renfer
-0.53
ouvre
-0.52
réuss
-0.52
POSITIVE LOGITS
weekend
1.42
weekend
1.31
Weekend
1.25
Weekend
1.24
WEEKEND
1.16
weekends
1.15
Wochenende
0.78
Saturday
0.77
keramik
0.73
Saturday
0.73
Activations Density 0.034%