INDEX
Explanations
frequent occurrences of the word "de" in various contexts
Followed by a preposition
"de" followed by nouns
New Auto-Interp
Negative Logits
faſt
-0.66
raiſ
-0.63
pleaſure
-0.60
slutt
-0.59
chrétien
-0.58
ſtand
-0.56
ainfi
-0.56
ſever
-0.56
ſta
-0.54
ſelf
-0.54
POSITIVE LOGITS
de
1.34
De
0.91
of
0.91
di
0.88
De
0.86
OF
0.79
DE
0.78
von
0.75
ของ
0.75
де
0.72
Activations Density 0.069%