INDEX
Explanations
mentions and references to the word "gold."
references to the word 'gold' and its related contexts
New Auto-Interp
Negative Logits
ATIONS
-0.80
ROR
-0.78
zee
-0.78
ally
-0.73
========
-0.73
Consent
-0.71
Alive
-0.69
reperto
-0.68
Homeless
-0.66
Stras
-0.65
POSITIVE LOGITS
vertisement
1.20
smith
1.05
medal
1.02
coins
1.00
fish
0.96
medals
0.89
stone
0.88
jewelry
0.88
mine
0.88
mund
0.86
Activations Density 0.017%