INDEX
Explanations
references to the word "golden" or its variations in different contexts
New Auto-Interp
Negative Logits
igr
-0.18
noch
-0.17
les
-0.16
iah
-0.15
ual
-0.15
vin
-0.15
759
-0.15
apon
-0.15
¸ı
-0.14
ivals
-0.14
POSITIVE LOGITS
rod
0.32
retrie
0.28
rule
0.21
berg
0.19
opportunity
0.19
rule
0.19
eye
0.18
Rule
0.18
-rule
0.18
Retrie
0.17
Activations Density 0.008%