INDEX
Explanations
phrases that include the preposition "of"
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
478
+0.15
0.8%
159
+0.12
0.7%
98
+0.11
0.7%
Correlated Neurons
Index
P. Corr.
Cos Sim.
404
+0.15
0.04
457
+0.12
0.03
81
+0.11
0.03
Negative Logits
perspective
-1.49
iscus
-1.45
js
-1.42
surroundings
-1.42
ring
-1.41
yards
-1.41
dimensions
-1.38
footsteps
-1.38
properties
-1.38
compartments
-1.37
POSITIVE LOGITS
ľĵ
1.73
MOESM
1.66
ĥ½
1.65
Ń
1.62
rese
1.51
said
1.49
Fig
1.48
---|---|---
1.47
anco
1.41
↵
1.41
Activations Density 0.153%