INDEX
Explanations
dialogue with dynamic emotional expressions and occasional exclamations
New Auto-Interp
Negative Logits
imes
-0.62
habit
-0.58
wrists
-0.58
barr
-0.56
degraded
-0.54
wasteful
-0.53
favourable
-0.52
accumulated
-0.51
warr
-0.51
patronage
-0.51
POSITIVE LOGITS
atown
0.68
Somebody
0.67
Nobody
0.67
Everyone
0.65
Sometimes
0.65
cue
0.64
Everybody
0.64
/"
0.64
Then
0.63
Everybody
0.61
Activations Density 9.894%