INDEX
Explanations
elements and references related to Danish political and social structures
New Auto-Interp
Negative Logits
nakne
-0.18
ragaz
-0.17
\Bridge
-0.15
porrf
-0.15
pornofil
-0.15
Ha
-0.14
pornost
-0.14
Ī
-0.14
Lor
-0.14
Bog
-0.14
POSITIVE LOGITS
var
0.25
hav
0.22
fik
0.20
blev
0.20
er
0.19
ville
0.18
overt
0.18
dro
0.17
vil
0.17
lod
0.16
Activations Density 0.014%