INDEX
Explanations
phrases related to consequences or outcomes
special characters, particularly the symbol "âĢ"
New Auto-Interp
Negative Logits
dispers
-0.75
wagen
-0.72
confinement
-0.68
mamm
-0.66
grips
-0.66
anwhile
-0.66
shroud
-0.65
BMC
-0.65
guiActiveUnfocused
-0.64
contraception
-0.64
POSITIVE LOGITS
¹
1.34
į
1.30
º
1.28
¤
1.26
ª
1.24
Ķ
1.20
¡
1.18
£
1.17
ĵ
1.17
Ń
1.16
Activations Density 0.093%