INDEX
Explanations
instances of phrases indicating starting or initiating actions
New Auto-Interp
Negative Logits
.scalablytyped
-0.18
اÙĪÙĦÛĮÙĨ
-0.18
Starting
-0.17
åħ¥åı£
-0.17
Starting
-0.17
(first
-0.16
第ä¸Ģ
-0.15
starting
-0.15
Entry
-0.15
first
-0.14
POSITIVE LOGITS
begin
0.52
begin
0.47
.begin
0.43
Begin
0.41
begins
0.40
Begin
0.40
began
0.39
BEGIN
0.37
BEGIN
0.35
begun
0.35
Activations Density 0.019%