INDEX
Explanations
words related to physical discomfort or distress
gerunds or present participles in the text
New Auto-Interp
Negative Logits
bal
-0.72
omet
-0.63
onso
-0.63
rgb
-0.62
isible
-0.62
ŀ
-0.62
Neg
-0.59
Aerospace
-0.59
saf
-0.58
§
-0.58
POSITIVE LOGITS
havoc
0.89
tons
0.88
redients
0.88
ulate
0.83
river
0.74
itcher
0.72
bilt
0.72
noises
0.70
Squid
0.69
stocks
0.69
Activations Density 0.125%