INDEX
Explanations
phrases related to job suitability and cultural fit in the workplace
New Auto-Interp
Negative Logits
заб
-0.16
ihan
-0.15
embali
-0.14
avors
-0.14
awns
-0.14
useful
-0.13
idal
-0.13
머
-0.13
illeg
-0.13
Useful
-0.13
POSITIVE LOGITS
fit
0.75
-fit
0.62
fits
0.61
Fit
0.60
fit
0.60
match
0.57
Fit
0.56
_fit
0.51
.fit
0.50
FIT
0.49
Activations Density 0.128%