INDEX
Explanations
words related to a specific cultural or geographical identity
references to the Kansai region or its people
New Auto-Interp
Negative Logits
kefeller
-0.92
ADS
-0.78
ptive
-0.76
PRESS
-0.74
Canaver
-0.73
buds
-0.73
bable
-0.72
nces
-0.70
TY
-0.68
ERY
-0.68
POSITIVE LOGITS
alis
1.01
hee
0.98
hip
0.94
hin
0.94
igans
0.93
uman
0.90
omal
0.90
laughter
0.87
haw
0.83
thood
0.83
Activations Density 0.015%