INDEX
Explanations
mentions of locations, specifically Toronto and associated entities
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
200
+0.15
0.5%
920
+0.13
0.4%
1023
+0.13
0.4%
Correlated Neurons
Index
P. Corr.
Cos Sim.
200
+0.15
0.03
489
+0.13
0.03
920
+0.13
0.02
Negative Logits
câte
-0.62
يتيمه
-0.58
ագրություններ
-0.55
Și
-0.54
Și
-0.53
հղումներ
-0.50
Może
-0.50
Vezi
-0.50
păr
-0.49
Bardzo
-0.48
POSITIVE LOGITS
Toronto
1.40
Toronto
1.29
toronto
1.01
TORONTO
0.97
Canadian
0.96
Ontario
0.95
Canada
0.91
Canadians
0.91
toronto
0.89
Canadá
0.87
Activations Density 0.044%