INDEX
Negative Logits
erina
0.46
auf
0.41
()>
0.39
$>
0.39
入り
0.38
вход
0.38
harris
0.38
બંધ
0.37
('#0.37
ໃຫ້
0.37
POSITIVE LOGITS
```
0.48
certainly
0.47
Certainly
0.46
yes
0.46
Yes
0.45
Certainly
0.45
Below
0.44
Yes
0.43
Below
0.43
kyll
0.41
Activations Density 0.021%