INDEX
Explanations
the word 'possibility' or phrases related to considerations and evaluations of potential circumstances
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
869
+0.15
0.5%
871
+0.12
0.4%
168
+0.11
0.4%
Correlated Neurons
Index
P. Corr.
Cos Sim.
869
+0.15
0.03
2030
+0.12
0.02
871
+0.11
0.02
Negative Logits
guardare
-0.56
dimenti
-0.55
Perché
-0.54
migli
-0.53
Più
-0.52
avete
-0.52
Vedi
-0.51
Più
-0.51
Perché
-0.51
gius
-0.49
POSITIVE LOGITS
possibility
0.95
Possibility
0.94
possibility
0.92
Possibility
0.78
Possibilities
0.73
possib
0.69
possibilities
0.68
posibilidad
0.68
pylab
0.67
fays
0.60
Activations Density 0.062%