INDEX
Negative Logits
irin
-0.08
ctor
-0.07
lette
-0.07
vald
-0.07
>::
-0.07
iros
-0.07
mers
-0.07
ação
-0.07
拔
-0.07
atórias
-0.07
POSITIVE LOGITS
lágr
0.09
lacag
0.08
എണ്ണം
0.08
мае
0.08
severely
0.08
কাপ
0.08
সে
0.08
页
0.08
significantly
0.08
deren
0.08
Activations Density 0.212%