INDEX
Explanations
expressions of gratitude or requests in a polite manner
expressions of desire or preference
New Auto-Interp
Negative Logits
idious
-0.64
ccording
-0.63
onut
-0.62
VERTISEMENT
-0.62
ulty
-0.62
livious
-0.60
rift
-0.59
ascus
-0.59
abal
-0.58
ashtra
-0.57
POSITIVE LOGITS
clarification
0.86
to
0.84
assurances
0.79
thereto
0.71
nothing
0.68
someone
0.67
somebody
0.66
something
0.65
seeing
0.61
luck
0.60
Activations Density 0.080%