INDEX
Explanations
mentions of the word "old" in various contexts
the word "gold" in various contexts
New Auto-Interp
Negative Logits
Downloadha
-0.73
BILITY
-0.72
senal
-0.71
FANTASY
-0.67
OPLE
-0.66
involved
-0.66
ESA
-0.65
Pwr
-0.64
dism
-0.61
FUL
-0.61
POSITIVE LOGITS
orf
1.18
ynam
1.10
ouble
1.05
roid
1.04
rums
1.03
ership
0.97
ritch
0.95
irect
0.94
ynamic
0.92
iesel
0.92
Activations Density 0.019%