INDEX
Explanations
expressions of appreciation and positive feedback regarding quality of work or performance
New Auto-Interp
Negative Logits
ặt
-0.16
jez
-0.15
zi
-0.15
@js
-0.14
igy
-0.13
енка
-0.13
COPE
-0.13
247
-0.13
_utilities
-0.13
iran
-0.13
POSITIVE LOGITS
job
1.52
job
1.25
Job
1.24
Job
1.14
-job
1.11
JOB
1.10
jobs
1.06
_job
1.01
.job
0.95
(job
0.91
Activations Density 0.143%