INDEX
Explanations
phrases or words with a symbol or character followed by 'âĢ'
instances of the phrase "I’m" or similar first-person expressions conveying personal sentiment
New Auto-Interp
Negative Logits
Zup
-0.71
Tid
-0.69
whichever
-0.68
guiActiveUnfocused
-0.66
dispers
-0.66
scattering
-0.63
feeding
-0.62
Palest
-0.62
Yon
-0.62
publicity
-0.60
POSITIVE LOGITS
ª
1.37
£
1.22
Ķ
1.19
¤
1.17
¢
1.16
¬
1.16
ķ
1.15
¼
1.14
«
1.13
¡
1.11
Activations Density 0.111%