INDEX
Explanations
references to the precious metal "gold"
mentions of gold
New Auto-Interp
Negative Logits
ATIONS
-0.80
zee
-0.77
Consent
-0.74
ally
-0.74
ROR
-0.74
Stras
-0.70
reperto
-0.68
Debor
-0.68
mble
-0.65
========
-0.65
POSITIVE LOGITS
vertisement
1.10
medal
1.05
coins
1.03
smith
0.98
fish
0.97
jewelry
0.92
medals
0.91
stone
0.88
mine
0.84
coin
0.83
Activations Density 0.014%