INDEX
Explanations
occurrences of the word "work" and its variants, indicating a focus on the concept of work or labor-related contexts
New Auto-Interp
Negative Logits
clid
-0.17
592
-0.16
e
-0.16
urge
-0.15
irm
-0.15
lor
-0.14
mot
-0.14
quires
-0.14
thal
-0.14
ERCHANT
-0.14
POSITIVE LOGITS
ktop
0.19
Ãłnh
0.18
INGTON
0.18
hest
0.18
wart
0.17
ombat
0.16
ingga
0.16
akit
0.15
robe
0.15
าย
0.15
Activations Density 0.011%