INDEX
Explanations
references to the term "Golden," which appears in various contexts throughout the document
New Auto-Interp
Negative Logits
noch
-0.17
igr
-0.17
les
-0.17
ual
-0.16
apon
-0.16
iah
-0.15
vin
-0.15
ata
-0.15
shint
-0.15
ivals
-0.15
POSITIVE LOGITS
rod
0.32
retrie
0.28
rule
0.19
ratio
0.19
opportunity
0.18
golden
0.18
baum
0.18
berg
0.18
Retrie
0.18
eye
0.17
Activations Density 0.008%