INDEX
Explanations
quotations that are the speaker's statements
New Auto-Interp
Negative Logits
etheless
-1.01
ingu
-0.77
£ı
-0.73
¬¼
-0.70
anuts
-0.66
»Ĵ
-0.66
Ĥª
-0.64
bably
-0.64
²¾
-0.62
osite
-0.62
POSITIVE LOGITS
said
1.23
said
1.22
says
1.21
reads
1.18
wrote
1.10
commented
1.03
recalls
1.00
writes
0.98
joked
0.97
remarked
0.97
Activations Density 0.390%