INDEX
Explanations
phrases related to boats and water activities
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
101
+0.14
0.5%
874
+0.13
0.4%
1103
+0.12
0.4%
Correlated Neurons
Index
P. Corr.
Cos Sim.
908
+0.14
0.02
1895
+0.13
0.02
1351
+0.12
0.02
Negative Logits
krab
-0.90
klap
-0.84
ananas
-0.75
kombi
-0.74
drap
-0.74
hek
-0.73
dora
-0.73
vola
-0.73
apparente
-0.72
impon
-0.71
POSITIVE LOGITS
boat
1.27
boat
1.26
boats
1.12
Boat
1.10
boats
1.08
Boat
1.06
Boats
0.92
Boats
0.90
BOAT
0.89
boating
0.82
Activations Density 0.093%