INDEX
Explanations
phrases indicating citizenship or national identity
New Auto-Interp
Negative Logits
RetVal
-0.16
hors
-0.15
agate
-0.15
erable
-0.15
agle
-0.15
üme
-0.14
èĤ¯å®ļ
-0.14
sume
-0.14
ogle
-0.14
inflate
-0.14
POSITIVE LOGITS
maken
0.26
gaan
0.26
doen
0.26
komen
0.25
laten
0.24
worden
0.24
kunnen
0.23
zien
0.23
hebben
0.22
willen
0.21
Activations Density 0.026%