INDEX
Negative Logits
chern
-0.08
enthusiasts
-0.08
contenders
-0.08
.loading
-0.08
(combo
-0.08
erren
-0.08
nelly
-0.08
umers
-0.08
-loading
-0.07
.combo
-0.07
POSITIVE LOGITS
omn
0.11
GPT
0.09
...,
0.08
GPT
0.08
assistants
0.08
...,
0.08
ACE
0.08
assistant
0.08
reproduce
0.07
Assistant
0.07
Activations Density 0.443%