INDEX
Explanations
references to labor practices and professional responsibilities
New Auto-Interp
Negative Logits
FIX
-0.17
fix
-0.16
aim
-0.16
abee
-0.16
trying
-0.15
clock
-0.15
let
-0.15
appen
-0.15
figures
-0.14
Appearance
-0.14
POSITIVE LOGITS
pro
0.22
effect
0.18
effectively
0.18
-effect
0.18
hol
0.18
marshal
0.17
better
0.17
mange
0.17
Effect
0.17
effect
0.16
Activations Density 0.399%