INDEX
Explanations
greetings and salutations in communication
greetings and informal salutations
New Auto-Interp
Negative Logits
"},"
-0.88
exerted
-0.73
culosis
-0.72
)</
-0.68
sunk
-0.67
advert
-0.66
``
-0.66
empt
-0.64
survives
-0.63
denies
-0.63
POSITIVE LOGITS
guys
0.99
gentlemen
0.89
Guys
0.87
comrades
0.86
reetings
0.86
ladies
0.85
fellow
0.81
Exc
0.81
folks
0.78
readers
0.76
Activations Density 0.053%