INDEX
Explanations
words related to starting or initiating actions
instances of the word "begin" in various contexts
New Auto-Interp
Negative Logits
aths
-0.71
allo
-0.70
rats
-0.69
visor
-0.67
otor
-0.67
houses
-0.67
ado
-0.67
acho
-0.65
oted
-0.65
tor
-0.65
POSITIVE LOGITS
anew
1.11
"$:/
0.78
ITIES
0.72
WithNo
0.72
nings
0.71
icago
0.70
ende
0.70
20439
0.68
igmatic
0.68
mble
0.68
Activations Density 0.040%