INDEX
Explanations
occurrences of the word "Nach", indicating a focus on sequences or events following a certain point in time
New Auto-Interp
Negative Logits
atcher
-0.16
.dy
-0.16
ê³³
-0.15
layers
-0.15
hlen
-0.14
stal
-0.14
hic
-0.14
_matched
-0.14
stakes
-0.14
incontri
-0.14
POSITIVE LOGITS
fol
0.23
dem
0.19
tk
0.19
weis
0.17
ts
0.17
td
0.16
noon
0.16
tm
0.16
dem
0.16
ting
0.15
Activations Density 0.008%