INDEX
Explanations
verbs that indicate announcements and confirmations regarding events or information
New Auto-Interp
Negative Logits
warts
-0.15
ld
-0.15
xp
-0.14
宣
-0.14
NOTICE
-0.14
arring
-0.14
ken
-0.14
atif
-0.14
ldr
-0.13
Becker
-0.13
POSITIVE LOGITS
during
0.17
uze
0.17
via
0.16
earlier
0.15
iese
0.15
uesday
0.15
unei
0.15
дап
0.14
äºİ
0.14
upa
0.14
Activations Density 0.115%