INDEX
Explanations
mentions of places and events
New Auto-Interp
Negative Logits
xm
-0.17
upa
-0.15
573
-0.15
Logic
-0.15
ł
-0.14
iban
-0.14
ayo
-0.14
orus
-0.14
uch
-0.13
Sabb
-0.13
POSITIVE LOGITS
odash
0.19
ibrator
0.16
405
0.15
çª
0.15
519
0.15
oger
0.15
-Clause
0.15
_drv
0.14
strup
0.14
aters
0.14
Activations Density 0.190%