INDEX
Explanations
keywords related to work and care
mentions of work and care-related activities or responsibilities
New Auto-Interp
Negative Logits
disappearing
-0.73
circling
-0.65
popping
-0.64
flo
-0.62
ruining
-0.61
turning
-0.60
Newsp
-0.60
collapsing
-0.60
seizing
-0.60
headlines
-0.59
POSITIVE LOGITS
uate
0.90
dress
0.88
ociate
0.88
enance
0.82
itate
0.78
ative
0.76
efficiently
0.76
aintain
0.75
properly
0.75
adequately
0.75
Activations Density 0.246%