INDEX
Explanations
phrases related to employment or tasks being carried out
New Auto-Interp
Negative Logits
Royale
-0.64
roared
-0.64
zeb
-0.64
VIDEOS
-0.63
ennes
-0.62
uden
-0.62
np
-0.62
LET
-0.62
invade
-0.61
oras
-0.61
POSITIVE LOGITS
working
3.41
Working
2.46
Working
2.44
working
2.41
collaborating
1.76
WORK
1.69
worked
1.60
work
1.57
functioning
1.48
cooperating
1.41
Activations Density 0.038%