INDEX
Explanations
mentions of HIV/AIDS, patients, infections, and related healthcare issues
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
1624
+0.16
0.5%
1350
+0.12
0.4%
1741
+0.12
0.4%
Correlated Neurons
Index
P. Corr.
Cos Sim.
1624
+0.16
0.02
1742
+0.12
0.01
1296
+0.12
0.01
Negative Logits
виправивши
-0.61
wata
-0.56
República
-0.49
nakalista
-0.49
ntö
-0.46
marshaller
-0.46
Gazetteer
-0.45
minecraftforge
-0.44
ponto
-0.43
Pek
-0.42
POSITIVE LOGITS
HIV
1.21
HIV
1.18
AIDS
1.03
AIDS
0.96
VIH
0.88
Wtf
0.85
hiv
0.78
rodriguez
0.77
Lmao
0.76
intersper
0.72
Activations Density 0.039%