INDEX
Explanations
phrases related to political controversy and policy decisions
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
1978
+0.10
0.3%
1919
+0.08
0.2%
1372
+0.08
0.2%
Correlated Neurons
Index
P. Corr.
Cos Sim.
1919
+0.10
0.04
434
+0.08
0.04
1420
+0.08
0.04
Negative Logits
GEBURTSDATUM
-0.66
Accès
-0.57
Mə
-0.55
providedIn
-0.55
Largeur
-0.55
Poids
-0.53
ագրություններ
-0.53
Personensuche
-0.51
Longueur
-0.50
Exemples
-0.50
POSITIVE LOGITS
photographe
0.72
getty
0.72
curé
0.71
élève
0.70
gti
0.68
jaya
0.68
Docteur
0.67
vainqueur
0.66
gué
0.65
jajaja
0.64
Activations Density 0.121%