INDEX
Explanations
phrases urging action or encouraging participation from the audience
New Auto-Interp
Negative Logits
Wort
-0.16
ptron
-0.15
lue
-0.14
ehler
-0.14
кÑĤÑĥ
-0.14
оÑĤÑĮ
-0.14
_circle
-0.13
éªij
-0.13
arp
-0.13
ÙĪØ§Øª
-0.13
POSITIVE LOGITS
encouraged
0.42
invited
0.41
urged
0.31
encourage
0.29
invit
0.28
advised
0.28
encourages
0.28
invite
0.28
invitation
0.27
encour
0.27
Activations Density 0.028%