INDEX
Explanations
references to personal or possessive pronouns
New Auto-Interp
Negative Logits
vault
-0.17
(es
-0.16
iaux
-0.15
esus
-0.15
ials
-0.14
abled
-0.14
ibli
-0.14
Vault
-0.14
skipping
-0.14
(s
-0.13
POSITIVE LOGITS
presenter
0.19
present
0.17
present
0.17
_CHAN
0.17
presentation
0.16
presenting
0.16
Treat
0.15
Present
0.15
présent
0.15
Present
0.15
Activations Density 0.023%