INDEX
Explanations
instances of the word "de."
New Auto-Interp
Negative Logits
myſelf
-1.50
Theſe
-1.48
Monfieur
-1.47
Anſ
-1.43
itſelf
-1.41
themſelves
-1.40
pleaſure
-1.36
ſeveral
-1.35
himſelf
-1.35
cauſe
-1.34
POSITIVE LOGITS
de
3.75
De
2.53
De
2.16
DE
1.77
de
1.64
де
1.58
des
1.39
del
1.37
di
1.26
du
1.18
Activations Density 0.068%