INDEX
Explanations
instances of humor and communication styles
New Auto-Interp
Negative Logits
Naissance
-0.58
includegraphics
-0.55
RRect
-0.52
plona
-0.52
fotogra
-0.51
recens
-0.50
WaitGroup
-0.49
SPEC
-0.49
matsu
-0.48
Дата
-0.48
POSITIVE LOGITS
uttered
1.08
spoken
1.00
words
0.95
utterances
0.88
uttering
0.88
speech
0.88
talking
0.81
Spoken
0.81
worded
0.80
peech
0.79
Activations Density 0.521%