INDEX
Explanations
words and phrases related to computer security breaches and negotiation activities
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
136
+0.07
0.2%
1601
+0.07
0.2%
851
+0.07
0.2%
Correlated Neurons
Index
P. Corr.
Cos Sim.
1363
+0.07
0.04
850
+0.07
0.04
706
+0.07
0.04
Negative Logits
roberto
-0.56
oxford
-0.55
رائع
-0.55
Or
-0.54
,
-0.53
.
-0.52
laura
-0.52
Re
-0.52
लिए
-0.51
сада
-0.51
POSITIVE LOGITS
sappi
1.22
dimenti
1.21
pernic
1.18
soggior
1.16
incess
1.13
pecuni
1.12
dichi
1.10
saper
1.09
credere
1.08
dimentic
1.08
Activations Density 0.159%