INDEX
Explanations
expressions of formal dialogue and requests related to authority or hierarchy
New Auto-Interp
Negative Logits
kasarigan
-0.80
guy
-0.62
dude
-0.59
viewDidLoad
-0.59
Guys
-0.58
Geplaatst
-0.57
dudes
-0.56
GUYS
-0.56
guys
-0.54
esternos
-0.54
POSITIVE LOGITS
sir
1.11
madam
0.91
sir
0.84
Sir
0.82
Sir
0.82
Excellency
0.77
madame
0.72
dear
0.72
gentlemen
0.71
signore
0.70
Activations Density 0.297%