INDEX
Explanations
words related to exclamations or expressions of surprise or emphasis
names or references to characters, possibly in a narrative or dialogue context
New Auto-Interp
Negative Logits
senal
-0.76
BMC
-0.75
Presence
-0.70
Proceedings
-0.64
Kinder
-0.64
NCT
-0.62
Polk
-0.61
Scholar
-0.60
¿½
-0.60
Median
-0.59
POSITIVE LOGITS
oooo
1.45
mmmm
1.40
OOOO
1.33
aaaa
1.32
OOOOOOOO
1.28
mmm
1.24
ooo
1.22
AAAAAAAA
1.22
eeee
1.21
oooooooo
1.21
Activations Density 0.220%