INDEX
Explanations
terms related to regulations and regulatory environments
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
1870
+0.13
0.5%
544
+0.13
0.5%
410
+0.13
0.5%
Correlated Neurons
Index
P. Corr.
Cos Sim.
976
+0.13
0.03
544
+0.13
0.03
1127
+0.13
0.03
Negative Logits
sophie
-0.62
smar
-0.60
crê
-0.60
dante
-0.58
nicolas
-0.58
impractica
-0.56
impra
-0.56
embra
-0.55
olivia
-0.55
resis
-0.55
POSITIVE LOGITS
regulation
1.34
Regulation
1.24
regulation
1.22
regulatory
1.18
Regulation
1.17
regulations
1.14
regulated
1.14
regulate
1.13
regulators
1.13
regulator
1.13
Activations Density 0.099%