INDEX
Negative Logits
ngth
-0.86
tery
-0.80
lihood
-0.79
andum
-0.78
ãĤ´ãĥ³
-0.73
FORM
-0.71
MENT
-0.71
×ķ
-0.70
ASED
-0.66
vasive
-0.65
POSITIVE LOGITS
oos
1.05
ornament
1.03
ed
1.03
ie
1.01
tip
0.96
hood
0.91
sie
0.91
ies
0.88
skin
0.86
edo
0.84
Activations Density 0.018%