INDEX
Explanations
terms related to cyber security and associated threats
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
156
+0.17
1.0%
148
+0.15
0.9%
111
+0.14
0.8%
Correlated Neurons
Index
P. Corr.
Cos Sim.
384
+0.17
0.02
232
+0.15
0.01
79
+0.14
0.02
Negative Logits
į
-2.12
ĨĴ
-2.03
Ľ
-1.87
Ļª
-1.86
ĸ´
-1.83
Ļ
-1.70
present
-1.66
·¸
-1.62
ľ
-1.62
½
-1.61
POSITIVE LOGITS
crime
1.74
ató
1.67
punk
1.66
notes
1.65
prints
1.59
urges
1.59
bour
1.57
ips
1.54
tracts
1.54
waves
1.53
Activations Density 1.594%