INDEX
Explanations
the word "beginning"
occurrences of the phrase "at the beginning"
New Auto-Interp
Negative Logits
incinn
-0.70
aths
-0.68
hemat
-0.61
verbs
-0.60
OUP
-0.56
ternity
-0.56
avorite
-0.56
dain
-0.55
dyed
-0.54
sacrific
-0.53
POSITIVE LOGITS
stages
1.14
of
1.04
thereof
0.99
phases
0.87
nings
0.81
phase
0.79
liest
0.77
OF
0.75
of
0.75
stage
0.75
Activations Density 0.030%