INDEX
Explanations
mentions of Singapore and related locations
New Auto-Interp
Negative Logits
iners
-0.18
berger
-0.16
oday
-0.16
dif
-0.15
insk
-0.15
sole
-0.15
ysa
-0.15
adge
-0.15
uf
-0.14
enburg
-0.14
POSITIVE LOGITS
ans
0.20
an
0.18
osl
0.17
-based
0.17
prene
0.17
ersen
0.16
Sing
0.16
apolis
0.15
-bound
0.15
stan
0.15
Activations Density 0.008%