INDEX
Explanations
phrases related to technology or medicine
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
437
+0.12
0.5%
545
+0.12
0.5%
68
+0.12
0.5%
Correlated Neurons
Index
P. Corr.
Cos Sim.
1537
+0.12
0.03
437
+0.12
0.02
1235
+0.12
0.03
Negative Logits
MessageBoxIcon
-0.66
Allister
-0.51
InputModule
-0.51
curacies
-0.50
Solución
-0.50
SwitchCompat
-0.50
Cormack
-0.49
Història
-0.48
FlatAppearance
-0.48
🕗
-0.48
POSITIVE LOGITS
cool
1.30
cool
1.28
COOL
1.27
Cool
1.25
Cool
1.19
cools
1.18
coolness
1.16
COOL
1.12
cooler
1.00
cooling
0.98
Activations Density 0.064%