INDEX
Explanations
references to legal and criminal codes
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
1499
+0.12
0.4%
872
+0.12
0.3%
1343
+0.11
0.3%
Correlated Neurons
Index
P. Corr.
Cos Sim.
1499
+0.12
0.04
1597
+0.12
0.02
811
+0.11
0.03
Negative Logits
stdarg
-0.67
paisley
-0.63
indescri
-0.60
Zunanje
-0.60
hairc
-0.60
exasper
-0.58
cushi
-0.57
sophistic
-0.57
Genau
-0.57
reluct
-0.56
POSITIVE LOGITS
teras
0.89
marte
0.87
sement
0.83
kasa
0.81
kafe
0.81
torba
0.81
silikon
0.80
karton
0.79
seksi
0.77
potest
0.76
Activations Density 0.137%