INDEX
Explanations
exclamatory expressions or interjections
punctuation marks, specifically exclamation marks and periods
New Auto-Interp
Negative Logits
necess
-0.82
accomp
-0.78
favoring
-0.72
surpr
-0.68
plagued
-0.68
exting
-0.67
ittered
-0.67
administ
-0.67
enriched
-0.66
targ
-0.65
POSITIVE LOGITS
Didn
1.11
You
1.09
Alright
1.08
Anyway
1.08
There
1.07
Looks
1.05
Lots
1.04
Somebody
1.03
Sorry
1.03
Seriously
1.03
Activations Density 0.263%