INDEX
Explanations
personal pronouns and expressions of personal experience or opinion
New Auto-Interp
Negative Logits
uter
-0.15
Presence
-0.14
oods
-0.14
iasco
-0.14
iae
-0.14
skup
-0.14
utta
-0.14
rise
-0.13
Toll
-0.13
//~
-0.13
POSITIVE LOGITS
frequently
0.23
often
0.18
oft
0.18
encounter
0.17
encounters
0.17
witness
0.16
personally
0.16
commonly
0.15
increasingly
0.15
firm
0.15
Activations Density 0.130%