INDEX
Explanations
variations of the preposition "för" (meaning "for") in different contexts
New Auto-Interp
Negative Logits
arias
-0.16
mar
-0.16
aria
-0.15
MOVE
-0.15
ανδ
-0.15
prow
-0.15
419
-0.14
Dent
-0.14
fid
-0.14
-live
-0.14
POSITIVE LOGITS
lj
0.29
rf
0.27
ret
0.25
dda
0.24
re
0.24
dd
0.24
rs
0.24
rete
0.23
rl
0.23
rest
0.22
Activations Density 0.007%