INDEX
Explanations
verbs related to supervision and management
New Auto-Interp
Negative Logits
\
-0.81
-0.76
1
-0.72
отношению
-0.71
p
-0.69
0
-0.68
M
-0.68
N
-0.68
N
-0.68
h
-0.67
POSITIVE LOGITS
doubtnut
1.48
myſelf
1.34
themſelves
1.30
Monfieur
1.26
ſever
1.26
ſelf
1.22
poffible
1.22
Theſe
1.20
leſs
1.20
pleaſure
1.19
Activations Density 0.018%