INDEX
Explanations
phrases related to beginnings or starting actions
New Auto-Interp
Negative Logits
Theſe
-0.62
juſt
-0.57
ſtre
-0.57
مض
-0.56
Chrift
-0.56
fwrite
-0.54
Hift
-0.54
Jefus
-0.53
ihnachten
-0.53
Efq
-0.53
POSITIVE LOGITS
Begin
0.89
begin
0.87
BeginContext
0.87
inici
0.86
はじめに
0.83
Commencez
0.80
begins
0.77
Begin
0.75
beginnen
0.74
began
0.74
Activations Density 0.180%