INDEX
Explanations
phrases relating to daily routines and work-life balance
New Auto-Interp
Negative Logits
avra
-0.15
oust
-0.14
ka
-0.14
ضÙħ
-0.14
umar
-0.14
itary
-0.14
acea
-0.14
353
-0.14
uite
-0.13
Ã¥l
-0.13
POSITIVE LOGITS
work
0.75
work
0.57
Work
0.52
-work
0.50
Work
0.48
_work
0.47
lavoro
0.44
ä»ķäºĭ
0.44
trabajo
0.43
trabalho
0.43
Activations Density 0.190%