INDEX
Explanations
words related to employment and working
references to employment or labor-related activities
New Auto-Interp
Negative Logits
ylon
-0.72
anamo
-0.72
Flavoring
-0.69
wcs
-0.68
Cricket
-0.65
EStream
-0.65
asar
-0.64
————
-0.62
Bubble
-0.61
Predators
-0.60
POSITIVE LOGITS
bench
1.16
ethic
1.15
station
1.06
flows
1.05
manship
1.04
hops
0.98
work
0.91
mate
0.85
horse
0.84
fare
0.78
Activations Density 0.069%