INDEX
Explanations
phrases related to education and communication in a classroom setting
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
1150
+0.13
0.4%
453
+0.11
0.3%
1314
+0.08
0.2%
Correlated Neurons
Index
P. Corr.
Cos Sim.
16
+0.13
0.07
1415
+0.11
0.02
1889
+0.08
0.04
Negative Logits
swarovski
-1.28
hairc
-1.25
increa
-1.24
guarante
-1.21
encomp
-1.19
affor
-1.18
disagre
-1.16
hilux
-1.15
scrat
-1.15
tupperware
-1.14
POSITIVE LOGITS
<bos>
0.97
information
0.70
ProtoMessage
0.70
information
0.68
SourceChecksum
0.67
Paglinawan
0.64
discussion
0.63
reporting
0.63
ieteur
0.63
informative
0.63
Activations Density 0.881%