INDEX
Explanations
instances of denial or negation
New Auto-Interp
Negative Logits
opsy
-0.14
rak
-0.14
icit
-0.14
ueur
-0.14
aciente
-0.14
âce
-0.14
ê³µ
-0.14
chia
-0.13
elman
-0.13
.shows
-0.13
POSITIVE LOGITS
FIT
0.17
Charleston
0.17
Hol
0.17
South
0.17
South
0.15
Clemson
0.15
Haley
0.15
FIT
0.15
cooker
0.15
(
0.14
Activations Density 0.000%