INDEX
Explanations
occurrences of the word "Nach."
New Auto-Interp
Negative Logits
¶Į
-0.18
BITS
-0.17
hlen
-0.16
atcher
-0.15
stakes
-0.15
ASSES
-0.15
630
-0.14
otto
-0.14
.dy
-0.14
änder
-0.14
POSITIVE LOGITS
fol
0.24
ward
0.24
dem
0.23
ts
0.20
wards
0.20
-effects
0.20
weis
0.19
tk
0.18
tm
0.18
completion
0.18
Activations Density 0.009%