INDEX
Explanations
phrases that suggest starting or initiating something significant
New Auto-Interp
Negative Logits
ocale
-0.16
reste
-0.16
SystemService
-0.15
inha
-0.15
jÃŃt
-0.15
ër
-0.15
oose
-0.14
å§ĵ
-0.14
porto
-0.14
ctors
-0.14
POSITIVE LOGITS
bang
0.29
Bang
0.22
bang
0.18
basics
0.18
followed
0.18
Bang
0.17
humble
0.17
premise
0.17
est
0.17
foundation
0.16
Activations Density 0.079%