INDEX
Explanations
phrases related to statements of intent or action
expressions of strong opinions or significant actions related to leadership or events
New Auto-Interp
Negative Logits
vere
-0.68
ach
-0.65
[+
-0.63
Grey
-0.62
grades
-0.62
auder
-0.56
Vish
-0.55
urable
-0.54
Die
-0.53
ofer
-0.53
POSITIVE LOGITS
immediately
0.97
instantly
0.85
promptly
0.84
mediately
0.79
greeted
0.79
uddenly
0.75
astonished
0.74
assumed
0.74
faced
0.73
flo
0.73
Activations Density 0.280%