INDEX
Negative Logits
Pee
-0.08
policeman
-0.08
herence
-0.08
brief
-0.07
Brian
-0.07
Sean
-0.07
_dm
-0.07
Drum
-0.07
small
-0.07
(timer
-0.07
POSITIVE LOGITS
oxidation
0.14
oxid
0.13
oxid
0.09
Ox
0.09
oxidative
0.08
oxide
0.08
ox
0.07
�
0.07
oxide
0.07
+'_
0.07
Activations Density 0.017%