INDEX
Explanations
adjectives describing enjoyment or desirability
occurrences of the word "to"
New Auto-Interp
Negative Logits
afety
-0.75
calling
-0.72
hent
-0.71
arity
-0.71
vich
-0.70
urances
-0.66
uli
-0.62
urance
-0.62
eon
-0.61
hur
-0.59
POSITIVE LOGITS
behold
1.32
contemplate
1.21
begin
1.03
navigate
1.00
contend
0.96
ggles
0.95
undertake
0.93
pursue
0.92
administer
0.92
acquire
0.91
Activations Density 0.157%