INDEX
Explanations
phrases related to hours worked or specified in a context
New Auto-Interp
Negative Logits
dinand
-0.94
emort
-0.89
mson
-0.77
\\\\\\\\
-0.75
tymology
-0.73
tarian
-0.73
Sov
-0.72
Lex
-0.71
sonian
-0.71
illac
-0.71
POSITIVE LOGITS
hift
1.20
pring
1.20
cale
0.99
mith
0.95
ilver
0.94
ourcing
0.88
hare
0.87
pread
0.87
Ago
0.86
creen
0.86
Activations Density 0.086%