INDEX
Explanations
phrases related to physical locations or positioning
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
32
+0.11
0.3%
1053
+0.11
0.3%
897
+0.10
0.3%
Correlated Neurons
Index
P. Corr.
Cos Sim.
1053
+0.11
0.08
395
+0.11
0.08
2016
+0.10
0.08
Negative Logits
éről
-0.50
része
-0.46
ároz
-0.46
واسطة
-0.45
Források
-0.45
engkapi
-0.44
gosta
-0.42
éhez
-0.42
beiter
-0.42
edema
-0.41
POSITIVE LOGITS
sappi
0.89
lamborghini
0.80
swarovski
0.77
effe
0.77
vespa
0.77
cioc
0.76
peculi
0.75
sopr
0.74
eiffel
0.72
discogs
0.72
Activations Density 0.284%