INDEX
Explanations
expressions related to financial cost, fear of change, and discussions about the best option
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
1253
+0.14
0.4%
1978
+0.12
0.4%
674
+0.11
0.3%
Correlated Neurons
Index
P. Corr.
Cos Sim.
875
+0.14
0.03
942
+0.12
0.06
1176
+0.11
0.02
Negative Logits
fte
-1.05
„,
-1.02
tew
-1.00
profi
-1.00
inder
-0.99
bett
-0.98
fto
-0.98
secon
-0.98
laun
-0.98
wien
-0.96
POSITIVE LOGITS
etc
0.57
somehow
0.57
LEGGI
0.56
setDisabled
0.54
ēju
0.52
ביוגרפיה
0.52
etc
0.51
MIDDLEWARE
0.50
توضیحات
0.50
wohnung
0.49
Activations Density 0.922%