INDEX
Explanations
information about researching or exploring a specific topic
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
1150
+0.11
0.3%
1056
+0.10
0.3%
872
+0.09
0.3%
Correlated Neurons
Index
P. Corr.
Cos Sim.
1056
+0.11
0.05
597
+0.10
0.03
549
+0.09
0.04
Negative Logits
kön
-0.84
akus
-0.76
ekos
-0.73
kosme
-0.72
uefa
-0.72
kooper
-0.69
kollek
-0.68
kriminal
-0.68
minimalis
-0.68
akut
-0.67
POSITIVE LOGITS
researching
0.71
search
0.65
googling
0.65
searched
0.64
searching
0.64
online
0.63
googled
0.62
0.60
researched
0.57
onnaissance
0.56
Activations Density 0.261%