INDEX
Explanations
adjectives or adverbs indicating a high degree or intensity
the word "so" indicating emphasis or intensity
New Auto-Interp
Negative Logits
wreck
-0.60
glances
-0.60
eviction
-0.59
acre
-0.57
ropolitan
-0.55
gallery
-0.54
theless
-0.54
{:-0.53
haven
-0.53
idon
-0.53
POSITIVE LOGITS
bered
1.19
oths
1.12
othes
1.11
oooo
1.05
apy
1.04
ooo
1.02
othe
1.01
oner
0.99
zin
0.99
aps
0.96
Activations Density 0.126%