INDEX
Negative Logits
needlessly
0.41
zoomSeekBar
0.40
喡
0.40
뭅
0.40
𓇼
0.39
柣
0.38
streamlines
0.37
બા
0.37
unruly
0.36
streamlined
0.36
POSITIVE LOGITS
0.92
0.91
0.86
0.86
0.80
0.78
0.77
0.77
0.77
0.75
Activations Density 0.038%
needlessly
zoomSeekBar
喡
뭅
𓇼
柣
streamlines
બા
unruly
streamlined