INDEX
Explanations
terms related to infectious diseases and their characteristics
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
377
+0.13
0.7%
150
+0.12
0.6%
464
+0.11
0.6%
Correlated Neurons
Index
P. Corr.
Cos Sim.
377
+0.13
0.04
415
+0.12
0.02
374
+0.11
-0.00
Negative Logits
âĢĤ
-1.90
resis
-1.50
reira
-1.49
iento
-1.48
lap
-1.45
Advertisement
-1.43
ranged
-1.40
âĢĬ
-1.40
ÂĹ
-1.39
illet
-1.37
POSITIVE LOGITS
?",
1.75
?”
1.65
?_
1.57
role
1.45
Tigers
1.43
lon
1.42
cliffe
1.41
?’
1.36
abases
1.34
asp
1.34
Activations Density 1.041%