INDEX
Explanations
mentions of personal experiences and emotions
New Auto-Interp
Negative Logits
sucks
-0.39
Appears
-0.36
pires
-0.36
requires
-0.36
)?
-0.35
WATCH
-0.34
Goo
-0.34
depends
-0.34
=================================
-0.34
comes
-0.34
POSITIVE LOGITS
enthusi
0.39
tended
0.37
earlier
0.35
cumbers
0.35
arranging
0.35
initially
0.34
Was
0.34
hoped
0.33
beforehand
0.32
Had
0.32
Activations Density 4.708%