INDEX
Explanations
words related to achieving success or making progress
gerunds and present participles related to actions
New Auto-Interp
Negative Logits
wa
-0.62
Bei
-0.62
Bai
-0.59
rium
-0.57
apples
-0.57
apple
-0.57
\<
-0.56
aple
-0.56
OTUS
-0.56
Toby
-0.56
POSITIVE LOGITS
kefeller
0.99
restling
0.89
backer
0.87
issance
0.79
IGH
0.74
Racer
0.73
itual
0.73
PAC
0.72
esh
0.72
iking
0.71
Activations Density 0.057%