INDEX
Negative Logits
adu
-0.18
backs
-0.16
atever
-0.15
inel
-0.15
fas
-0.14
ui
-0.14
220
-0.14
岸
-0.14
tings
-0.13
agon
-0.13
POSITIVE LOGITS
ers
0.24
Bridge
0.21
-based
0.18
istan
0.18
erry
0.18
Underground
0.18
etta
0.17
ale
0.17
subpackage
0.17
Metropolitan
0.17
Activations Density 0.013%