INDEX
Negative Logits
brib
0.29
policing
0.28
eiro
0.28
incênd
0.27
Confira
0.27
ισμό
0.27
confiscated
0.27
Owned
0.27
໊
0.27
refused
0.26
POSITIVE LOGITS
olika
0.27
ancien
0.26
朼
0.25
Am
0.25
subalgebra
0.24
сіі
0.24
Various
0.24
essi
0.24
alten
0.23
indicato
0.23
Activations Density 0.059%