INDEX
Explanations
words related to a controversial labor practice known as "karoshi," which is linked to excessive work leading to serious health issues or even death
references to specific cultural or ethnic identities
New Auto-Interp
Negative Logits
Moroc
-0.86
Accountability
-0.81
Own
-0.81
Yard
-0.80
Weather
-0.79
Lot
-0.77
Fn
-0.76
Heard
-0.76
Breed
-0.76
Notting
-0.75
POSITIVE LOGITS
ensis
1.07
acters
0.93
berries
0.77
bons
0.76
iles
0.74
igans
0.74
Äĵ
0.73
ettes
0.72
otic
0.72
istics
0.71
Activations Density 0.370%