INDEX
Explanations
phrases that indicate time or sequential events, particularly those using the word "next."
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
50
+0.27
1.4%
650
+0.13
0.6%
776
+0.12
0.6%
Correlated Neurons
Index
P. Corr.
Cos Sim.
405
+0.27
0.04
650
+0.13
0.04
776
+0.12
0.04
Negative Logits
<bos>
-2.27
/***
-0.77
/**
-0.67
<?
-0.64
emplace
-0.62
-0.61
///**
-0.59
declare
-0.58
mobilize
-0.58
ⓧ
-0.58
POSITIVE LOGITS
vespa
1.11
chrysler
1.11
mitsubishi
1.08
opport
1.08
imposs
1.08
pican
1.07
peugeot
1.07
accla
1.07
suspic
1.06
bosch
1.05
Activations Density 0.060%