INDEX
Explanations
phrases related to giving instructions or commands
New Auto-Interp
Negative Logits
VERTISEMENT
-0.84
ahime
-0.77
ibrary
-0.76
wcs
-0.76
ighed
-0.76
eco
-0.75
ledged
-0.74
inction
-0.72
iculty
-0.72
eton
-0.71
POSITIVE LOGITS
sir
1.43
dear
1.26
gentlemen
1.18
darling
1.16
comrade
1.12
Mister
1.11
mate
1.09
buddy
1.05
please
1.01
huh
0.99
Activations Density 0.149%