INDEX
Explanations
phrases related to product reviews or evaluations
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
587
+0.07
0.2%
1056
+0.07
0.2%
1253
+0.07
0.2%
Correlated Neurons
Index
P. Corr.
Cos Sim.
1056
+0.07
0.04
1129
+0.07
0.04
1399
+0.07
0.03
Negative Logits
reluct
-1.17
inappro
-1.11
accla
-0.99
inev
-0.98
unlaw
-0.98
inconce
-0.97
practition
-0.97
impra
-0.96
encomp
-0.95
impractica
-0.94
POSITIVE LOGITS
how
0.56
rodillas
0.54
mistrzost
0.54
脚注の使い方
0.53
twimg
0.53
fick
0.52
ExecuteAsync
0.51
why
0.51
RectangleBorder
0.51
animity
0.49
Activations Density 0.215%