INDEX
Explanations
phrases related to strong emotions across various contexts
expressions of shock or strong emotions
New Auto-Interp
Negative Logits
divergence
-0.87
contribution
-0.74
similarity
-0.72
occurrence
-0.72
overlap
-0.71
snippet
-0.71
variability
-0.71
variation
-0.70
presentation
-0.70
originate
-0.69
POSITIVE LOGITS
ivated
1.16
alysed
1.16
ussed
1.14
azed
1.11
icent
1.10
rified
1.09
iably
1.09
ocked
1.06
assed
1.06
athed
1.06
Activations Density 0.270%