INDEX
Explanations
phrases with a sense of finality or emphasis
punctuation marks, particularly periods and exclamation points
New Auto-Interp
Negative Logits
representatives
-0.77
affili
-0.74
substantive
-0.72
policies
-0.69
eligibility
-0.69
assessments
-0.68
acknowled
-0.66
commitments
-0.66
contributions
-0.65
aggreg
-0.65
POSITIVE LOGITS
wav
1.01
jpg
0.97
gif
0.97
Sounds
0.90
cue
0.87
Yeah
0.85
Yep
0.84
Literally
0.84
huh
0.83
Nope
0.82
Activations Density 0.477%