INDEX
Explanations
information related to traffic rules and regulations
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
453
+0.14
0.4%
2019
+0.13
0.4%
1438
+0.10
0.3%
Correlated Neurons
Index
P. Corr.
Cos Sim.
305
+0.14
0.02
191
+0.13
0.02
974
+0.10
0.02
Negative Logits
motherfucker
-0.70
bougie
-0.66
nasel
-0.63
caucasian
-0.63
spartan
-0.63
wurde
-0.61
hobo
-0.59
duffel
-0.59
xPos
-0.59
mercurial
-0.59
POSITIVE LOGITS
kafe
1.03
silikon
1.00
keramik
0.93
panik
0.93
optik
0.90
antik
0.88
maksi
0.87
seksi
0.86
konsult
0.85
kosme
0.84
Activations Density 0.044%