INDEX
Explanations
exclamatory and emphatic punctuation, particularly related to strong emotional responses
New Auto-Interp
Negative Logits
ingroup
-0.15
ptide
-0.15
ç̬
-0.15
acle
-0.14
PPER
-0.14
λÏį
-0.14
uraa
-0.14
šť
-0.13
ouz
-0.13
worrying
-0.13
POSITIVE LOGITS
wing
0.14
standing
0.14
365
0.14
asi
0.14
Them
0.14
alk
0.14
stant
0.13
634
0.13
234
0.13
ammer
0.13
Activations Density 0.187%