INDEX
Explanations
instances of the word "run" in multiple contexts
New Auto-Interp
Negative Logits
Majefty
-0.90
myſelf
-0.87
EGL
-0.82
Northampton
-0.81
Southgate
-0.81
ainfi
-0.80
Testi
-0.78
pleaſure
-0.77
Hygge
-0.76
houſe
-0.75
POSITIVE LOGITS
run
1.60
Run
1.48
runs
1.46
run
1.46
RUN
1.45
RUN
1.41
Run
1.39
runs
1.36
Runs
1.35
running
1.33
Activations Density 0.069%