INDEX
Explanations
the beginnings of events or actions
phrases indicating the beginning of events or stories
New Auto-Interp
Negative Logits
nor
-0.65
congratulated
-0.64
wa
-0.64
cedented
-0.63
ib
-0.63
phe
-0.62
hold
-0.61
refresh
-0.60
orth
-0.60
wd
-0.59
POSITIVE LOGITS
innoc
0.89
WithNo
0.79
genesis
0.77
actionDate
0.76
anew
0.74
Ò
0.73
misunder
0.72
abruptly
0.71
Magikarp
0.70
spontaneously
0.70
Activations Density 0.088%