INDEX
Explanations
verbs indicating communication or conversation
instances of the word "talk" and its variations
New Auto-Interp
Negative Logits
================
-0.55
||||
-0.55
sender
-0.54
CoC
-0.53
UPDATE
-0.52
HUD
-0.52
Cola
-0.51
======
-0.51
Answer
-0.51
ulia
-0.51
POSITIVE LOGITS
about
1.23
of
1.09
nostalg
1.05
About
0.93
about
0.91
ABOUT
0.87
extensively
0.86
fond
0.84
glow
0.78
bitterly
0.77
Activations Density 0.215%