INDEX
Explanations
phrases related to technological advancements and changes over time
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
1166
+0.10
0.3%
674
+0.09
0.2%
1385
+0.08
0.2%
Correlated Neurons
Index
P. Corr.
Cos Sim.
1181
+0.10
0.05
114
+0.09
0.04
1166
+0.08
0.06
Negative Logits
sophistic
-0.86
unspeak
-0.83
indestru
-0.83
inconce
-0.79
shenan
-0.78
intrigu
-0.76
indescri
-0.75
hentai
-0.75
downvotes
-0.74
impra
-0.73
POSITIVE LOGITS
anymore
1.06
<bos>
0.85
enää
0.63
otheby
0.59
lagi
0.52
AssemblyCulture
0.49
SourceChecksum
0.48
sumpay
0.48
traditional
0.48
Jereo
0.48
Activations Density 0.470%