INDEX
Explanations
references to Libya and related entities
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
50
+0.17
0.9%
313
+0.12
0.6%
1974
+0.12
0.6%
Correlated Neurons
Index
P. Corr.
Cos Sim.
1516
+0.17
0.02
755
+0.12
0.02
406
+0.12
0.01
Negative Logits
<bos>
-2.70
-0.79
ⓧ
-0.72
amass
-0.72
/**
-0.70
shuddered
-0.70
/***
-0.68
implore
-0.68
<?
-0.67
harmonize
-0.65
POSITIVE LOGITS
Libya
1.03
Libya
1.01
Libyan
0.98
roul
0.93
rafraî
0.86
rempliss
0.83
représ
0.82
rafra
0.80
dégust
0.79
quoique
0.78
Activations Density 0.041%