INDEX
Negative Logits
rious
-0.76
ACTED
-0.73
à¨
-0.73
rous
-0.70
lder
-0.67
brance
-0.65
ASED
-0.64
Countdown
-0.64
conspicuous
-0.62
urated
-0.62
POSITIVE LOGITS
etti
1.02
bach
1.02
iter
0.97
olini
0.92
inson
0.92
andowski
0.89
aunders
0.89
endale
0.85
andra
0.84
lyn
0.83
Activations Density 0.030%