INDEX
Explanations
symbols or punctuation that signify emphasis or emotional weight
New Auto-Interp
Negative Logits
cius
-0.68
Percent
-0.67
Unsure
-0.67
aida
-0.67
aniel
-0.64
claimer
-0.63
inguishable
-0.63
bane
-0.62
FORE
-0.61
pires
-0.61
POSITIVE LOGITS
it
0.70
behavi
0.68
behav
0.65
newsletters
0.63
Advertisements
0.62
observations
0.62
aspirations
0.62
feelings
0.62
plans
0.61
notebooks
0.61
Activations Density 0.172%