INDEX
Explanations
references to oral health and related treatments
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
156
+0.21
1.2%
27
+0.14
0.8%
23
+0.12
0.7%
Correlated Neurons
Index
P. Corr.
Cos Sim.
8
+0.21
0.01
27
+0.14
0.00
23
+0.12
0.01
Negative Logits
+/
-1.69
arma
-1.68
ês
-1.68
ĻĤ
-1.59
aria
-1.56
usalem
-1.46
âĪĴ/âĪĴ
-1.45
Ĩ
-1.42
nae
-1.41
watson
-1.39
POSITIVE LOGITS
iles
1.66
[(\[
1.48
iating
1.48
hest
1.48
gers
1.47
forecast
1.43
ils
1.41
shred
1.39
shovel
1.36
![
1.36
Activations Density 0.094%