INDEX
Negative Logits
terrible
-0.07
Reese
-0.06
Pastor
-0.06
상
-0.06
اشة
-0.06
utilities
-0.06
(((
-0.06
潮
-0.06
γγ
-0.06
<State
-0.06
POSITIVE LOGITS
_seg
0.07
Execution
0.06
endeavors
0.06
.lr
0.06
spacecraft
0.06
fp
0.06
punk
0.06
upward
0.06
dönüş
0.06
!")
0.06
Activations Density 0.002%