INDEX
Explanations
terms and phrases related to user login and authentication processes
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
156
+0.27
1.5%
281
+0.11
0.7%
45
+0.11
0.6%
Correlated Neurons
Index
P. Corr.
Cos Sim.
82
+0.27
0.01
45
+0.11
0.01
17
+0.11
0.00
Negative Logits
Ĭ
-2.40
´
-2.29
Ļ
-2.19
IJ
-1.97
-1.95
-1.95
↵ ↵
-1.95
↵
-1.95
↵ ↵
-1.95
↵
-1.95
POSITIVE LOGITS
credentials
1.99
ership
1.93
mistake
1.75
notification
1.71
controller
1.70
service
1.67
experience
1.65
server
1.59
realm
1.57
opportunity
1.57
Activations Density 0.174%