INDEX
Explanations
words related to the concept of supervision or overseeing
terms related to supervision and oversight
New Auto-Interp
Negative Logits
UES
-0.74
fixes
-0.71
grave
-0.70
tics
-0.69
tr
-0.69
ml
-0.68
Sov
-0.66
bis
-0.66
anne
-0.66
utter
-0.64
POSITIVE LOGITS
supervision
1.17
supervised
0.97
Instruct
0.88
probation
0.86
superv
0.84
oversee
0.78
rador
0.73
vised
0.71
confir
0.69
manship
0.69
Activations Density 0.016%