INDEX
Explanations
occurrences of the name "Manuel" and variations related to manual actions or descriptions
New Auto-Interp
Negative Logits
contr
-0.19
cord
-0.18
e
-0.17
veteran
-0.16
mented
-0.16
first
-0.16
res
-0.15
aget
-0.15
Paper
-0.14
Contr
-0.14
POSITIVE LOGITS
pek
0.20
Linh
0.18
fbe
0.17
loy
0.16
ossip
0.16
.UnitTesting
0.15
teri
0.15
é̏
0.15
porte
0.14
DISCLAIM
0.14
Activations Density 0.007%