INDEX
Explanations
the term "urban" in various contexts
New Auto-Interp
Negative Logits
aler
-0.17
erland
-0.16
OME
-0.16
udent
-0.16
aus
-0.15
÷
-0.15
ikan
-0.15
oons
-0.14
autop
-0.14
Blick
-0.14
POSITIVE LOGITS
ization
0.30
ized
0.27
ites
0.25
isation
0.24
/sub
0.23
izing
0.23
ite
0.23
icity
0.22
izations
0.20
ised
0.19
Activations Density 0.011%