INDEX
Explanations
references to the concept of "home."
New Auto-Interp
Negative Logits
sect
-0.77
alias
-0.75
roup
-0.74
conn
-0.73
erness
-0.72
etting
-0.69
thodox
-0.69
href
-0.68
azi
-0.67
Files
-0.67
POSITIVE LOGITS
trophies
0.82
prizes
0.74
NX
0.72
podium
0.70
bragging
0.68
trophy
0.67
Ribbon
0.67
Wii
0.67
victory
0.66
$
0.65
Activations Density 0.009%