INDEX
Explanations
phrases or words related to the concept of "freedom"
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
50
+0.07
0.3%
1092
+0.07
0.3%
1506
+0.07
0.3%
Correlated Neurons
Index
P. Corr.
Cos Sim.
1219
+0.07
0.03
543
+0.07
0.03
1128
+0.07
0.03
Negative Logits
<bos>
-1.37
/*
-0.89
/**
-0.80
public
-0.75
/*
-0.74
-0.73
//
-0.71
#
-0.70
,
-0.68
.
-0.67
POSITIVE LOGITS
freedom
2.10
Freedom
2.10
FREEDOM
2.07
freedom
2.00
Freedom
1.96
Minang
1.91
affor
1.88
accla
1.88
stockholm
1.88
bandung
1.84
Activations Density 0.120%