INDEX
Explanations
text related to sci-fi and technological terms
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
2034
+0.14
0.4%
184
+0.10
0.3%
752
+0.10
0.3%
Correlated Neurons
Index
P. Corr.
Cos Sim.
1959
+0.14
0.06
106
+0.10
0.04
1445
+0.10
0.05
Negative Logits
ideolog
-0.77
reger
-0.77
guarn
-0.69
notor
-0.69
lapto
-0.69
patr
-0.68
€€
-0.67
controver
-0.65
acred
-0.64
sappi
-0.64
POSITIVE LOGITS
Therefore
0.59
therefore
0.57
Fortunately
0.56
Thankfully
0.56
leyici
0.56
Luckily
0.53
This
0.52
idxs
0.52
Therefore
0.51
Hence
0.51
Activations Density 0.388%