INDEX
Explanations
phrases related to verbal communication or speech
instances of punctuation, particularly quotation marks and commas
New Auto-Interp
Negative Logits
etheless
-0.90
Ĥª
-0.86
¬¼
-0.83
ibrary
-0.83
£ı
-0.83
İĭ
-0.82
»Ĵ
-0.75
itionally
-0.72
ĻĤ
-0.72
ramid
-0.71
POSITIVE LOGITS
says
1.18
whispered
1.06
said
1.02
muttered
1.01
said
1.01
replied
1.00
reads
0.99
exclaimed
0.98
murm
0.96
joked
0.94
Activations Density 0.100%