INDEX
Explanations
text related to decisions and opinions
the presence of a specific symbol or character in the text
New Auto-Interp
Negative Logits
snail
-0.82
mainland
-0.69
Isle
-0.68
ric
-0.68
Mous
-0.64
antip
-0.64
DES
-0.63
DEM
-0.62
MET
-0.62
sacrific
-0.61
POSITIVE LOGITS
Ŀ
1.73
¡
1.31
¦
1.23
°
1.19
ľ
1.16
©
1.15
«
1.13
Ķ
1.12
ĸ
1.10
¤
1.08
Activations Density 0.289%