INDEX
Explanations
commands or phrases indicating initiation or beginning actions
New Auto-Interp
Negative Logits
שוליים
-0.75
actualité
-0.70
Errorf
-0.68
-0.68
contentType
-0.66
tiker
-0.66
Waray
-0.65
Professions
-0.63
بوابة
-0.63
numberWith
-0.62
POSITIVE LOGITS
start
2.21
started
2.03
starts
1.99
starting
1.86
Start
1.82
Starts
1.71
begin
1.68
START
1.63
Started
1.61
started
1.60
Activations Density 0.103%