INDEX
Explanations
mentions of the word "Gold" followed by a single character such as a number or a letter
repeated occurrences of the term "Gold 2"
New Auto-Interp
Negative Logits
ATIONS
-0.99
ROR
-0.68
reperto
-0.66
================================================================
-0.65
========
-0.63
abdom
-0.63
ufact
-0.62
ABLE
-0.61
aneously
-0.59
informational
-0.59
POSITIVE LOGITS
vertisement
1.19
smith
1.15
finger
1.07
rush
1.07
mund
1.06
stein
1.03
sch
1.02
wyn
0.98
fields
0.95
stone
0.94
Activations Density 0.030%