INDEX
Explanations
words related to locations or spaces located away from central areas or edges
words related to edges or boundaries
New Auto-Interp
Negative Logits
ROCK
-0.64
POWER
-0.64
Destroy
-0.63
ATK
-0.60
SUP
-0.59
bourg
-0.58
OTA
-0.58
bait
-0.57
hypnot
-0.57
DIRECT
-0.57
POSITIVE LOGITS
inges
4.07
inged
1.71
rium
1.19
agher
1.15
periphery
0.94
outset
0.87
opian
0.85
irez
0.84
unal
0.82
eteria
0.80
Activations Density 0.034%