INDEX
Explanations
information pertaining to vaccines and medical treatments
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
1741
+0.23
0.7%
1535
+0.15
0.5%
2019
+0.12
0.4%
Correlated Neurons
Index
P. Corr.
Cos Sim.
1325
+0.23
0.04
51
+0.15
0.03
1044
+0.12
0.04
Negative Logits
idać
-0.83
saluti
-0.80
jména
-0.80
ypeł
-0.73
jectures
-0.72
sappi
-0.69
pensieri
-0.69
:"-
-0.68
dovr
-0.67
tramonto
-0.66
POSITIVE LOGITS
meantime
0.77
same
0.67
absence
0.63
midst
0.56
same
0.55
meanwhile
0.54
broadest
0.54
context
0.53
case
0.53
absence
0.52
Activations Density 0.198%