INDEX
Explanations
phrases signaling a conversational tone or indicating emphasis
expressions indicating acknowledgment or familiarity
New Auto-Interp
Negative Logits
utenberg
-0.90
stad
-0.75
omal
-0.74
iets
-0.72
otom
-0.71
aez
-0.70
OGR
-0.70
anmar
-0.69
isc
-0.67
pex
-0.66
POSITIVE LOGITS
terday
0.79
lege
0.76
how
0.71
whats
0.69
WHAT
0.67
ledged
0.67
darn
0.65
what
0.65
abella
0.64
why
0.63
Activations Density 0.040%