INDEX
Explanations
formal communication fragments, including names and titles
New Auto-Interp
Negative Logits
oiler
-0.77
gore
-0.73
dunno
-0.70
hardcore
-0.69
legit
-0.68
partying
-0.68
Panic
-0.68
cops
-0.68
umat
-0.68
destro
-0.68
POSITIVE LOGITS
Honour
1.46
sir
1.28
gentlemen
1.24
respectfully
1.17
gentleman
1.14
Chairman
1.13
Chair
1.11
Thank
1.06
reetings
1.06
Gentle
1.06
Activations Density 0.671%