INDEX
Negative Logits
nowrap
-0.06
Welfare
-0.06
Cap
-0.06
.coll
-0.06
087
-0.06
_nl
-0.06
setProperty
-0.06
Pollution
-0.06
-consuming
-0.06
017
-0.06
POSITIVE LOGITS
team
0.08
classes
0.07
gadgets
0.07
zą
0.07
.urls
0.07
teammates
0.07
tribute
0.07
VES
0.07
Baseball
0.06
=tf
0.06
Activations Density 0.003%