INDEX
Explanations
phrases related to success or achieving a goal
instances of the word "hit" in various contexts
New Auto-Interp
Negative Logits
pires
-0.87
ç«
-0.72
inent
-0.71
agin
-0.68
æĢ
-0.67
æ©Ł
-0.66
orld
-0.66
otive
-0.64
pite
-0.62
cia
-0.62
POSITIVE LOGITS
ched
1.08
hit
0.82
achi
0.82
tle
0.80
boxes
0.78
ches
0.77
ted
0.73
puberty
0.73
arro
0.71
gerald
0.71
Activations Density 0.027%