INDEX
Explanations
phrases that incite action or excitement
exclamatory or emphatic expressions often related to reactions or events
New Auto-Interp
Negative Logits
Chin
-0.72
Cyr
-0.72
Aram
-0.68
Kaw
-0.67
Nass
-0.66
Chaff
-0.66
Chapman
-0.65
©¶æ
-0.65
Shap
-0.65
Beir
-0.64
POSITIVE LOGITS
!:
1.41
!'
1.41
!.
1.36
!,
1.36
!
1.34
!'"
1.20
!".
1.14
!"
1.13
!",
1.12
!]
1.10
Activations Density 0.201%