INDEX
Explanations
sentences expressing personal emotions and experiences
New Auto-Interp
Negative Logits
thereby
-0.86
according
-0.77
aimed
-0.76
Allows
-0.76
LEASE
-0.73
namely
-0.72
thood
-0.71
perse
-0.71
require
-0.69
nesty
-0.69
POSITIVE LOGITS
slightest
1.27
rest
1.24
whole
1.19
guy
1.18
entirety
1.17
coolest
1.17
proverbial
1.11
entire
1.09
same
1.08
smallest
1.07
Activations Density 2.424%