INDEX
Explanations
phrases related to interpersonal communication, particularly dialogue and exchanges between people
dialogues and conversations in the text
New Auto-Interp
Negative Logits
fres
-0.81
travelling
-0.64
extrad
-0.64
knockout
-0.61
mosa
-0.60
vaccinations
-0.60
vable
-0.59
daily
-0.59
migration
-0.59
favoured
-0.58
POSITIVE LOGITS
Suddenly
1.02
"-
0.96
Fuck
0.89
Pause
0.88
Everyone
0.86
Again
0.86
Thankfully
0.85
Slow
0.85
Fortunately
0.84
Then
0.83
Activations Density 0.165%