INDEX
Explanations
the word 'so' with varying intensities
repeated use of the phrase "so-called."
New Auto-Interp
Negative Logits
theless
-0.63
works
-0.63
eviction
-0.59
glances
-0.58
wiser
-0.57
expectancy
-0.57
Mens
-0.55
geist
-0.55
silhouette
-0.54
Contemporary
-0.54
POSITIVE LOGITS
oths
1.25
othes
1.20
bered
1.14
apy
1.07
othe
1.04
oooo
1.01
oner
0.96
bs
0.95
ooo
0.94
iled
0.93
Activations Density 0.113%