INDEX
Explanations
instances of positions being appointed or replaced
New Auto-Interp
Negative Logits
patch
-0.85
Ïģ
-0.79
λ
-0.76
iday
-0.74
pack
-0.74
pling
-0.74
psc
-0.70
fal
-0.70
olver
-0.68
iland
-0.67
POSITIVE LOGITS
appointments
1.02
appoint
0.92
ees
0.90
appointment
0.89
appointed
0.86
ointed
0.85
eering
0.84
ineligible
0.84
ancies
0.82
lies
0.80
Activations Density 14.615%