INDEX
Explanations
words containing the sequence "dist" or "dist" followed by a number
words related to distress or negative emotional states
New Auto-Interp
Negative Logits
tes
-0.87
swick
-0.82
glers
-0.82
theless
-0.76
ton
-0.72
ggle
-0.70
phone
-0.70
LOAD
-0.69
FORE
-0.68
FIELD
-0.65
POSITIVE LOGITS
anced
1.18
ribut
1.17
illery
1.01
ancing
1.00
dist
0.95
aste
0.95
ributes
0.94
ances
0.88
enfranch
0.88
antly
0.86
Activations Density 0.007%