INDEX
Explanations
references to locations and events related to hospitality or tourism
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
690
+0.09
0.2%
113
+0.07
0.2%
562
+0.07
0.2%
Correlated Neurons
Index
P. Corr.
Cos Sim.
736
+0.09
0.05
480
+0.07
0.03
104
+0.07
0.02
Negative Logits
Darío
-1.06
vito
-1.06
Áng
-1.05
Borja
-1.02
Sinal
-1.01
Mónica
-0.97
Perci
-0.97
Valentín
-0.97
inappro
-0.95
Keny
-0.95
POSITIVE LOGITS
becomes
0.87
becomes
0.79
become
0.75
become
0.65
suddenly
0.56
switches
0.54
switch
0.54
transforms
0.54
становится
0.51
ValueStyle
0.50
Activations Density 0.427%