INDEX
Explanations
mentions of drug addiction and related terms
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
1464
+0.14
0.5%
31
+0.11
0.4%
370
+0.09
0.3%
Correlated Neurons
Index
P. Corr.
Cos Sim.
1464
+0.14
0.03
1809
+0.11
0.03
1935
+0.09
0.03
Negative Logits
trovo
-0.56
disreg
-0.56
bismuth
-0.56
FSH
-0.54
hydrochlor
-0.53
tolu
-0.51
carbonic
-0.51
seclu
-0.51
pollut
-0.51
nõ
-0.51
POSITIVE LOGITS
addiction
0.95
addicts
0.82
addict
0.82
Addiction
0.80
addicted
0.79
Addiction
0.78
addictions
0.70
addictive
0.65
heroin
0.62
drug
0.61
Activations Density 0.107%