INDEX
Explanations
references to local music scenes and their cultural significance
New Auto-Interp
Negative Logits
urum
-0.22
Zus
-0.15
upos
-0.14
iesta
-0.14
ignum
-0.14
pec
-0.13
which
-0.13
âĢŀM
-0.13
owie
-0.13
ÑĤабли
-0.13
POSITIVE LOGITS
już
0.23
jeszcze
0.23
przez
0.22
tu
0.21
na
0.21
bow
0.20
też
0.20
sobie
0.20
także
0.20
jednak
0.20
Activations Density 0.051%