INDEX
Explanations
expressions of surprise or strong emotion
expressions of surprise or exclamation, often invoking a divine reference
New Auto-Interp
Negative Logits
BILITIES
-0.76
Reviewed
-0.69
20439
-0.67
Perspective
-0.67
CONCLUS
-0.66
FN
-0.65
BIL
-0.65
oult
-0.65
actionDate
-0.64
ilater
-0.61
POSITIVE LOGITS
Gaw
0.71
hurry
0.69
warts
0.66
Merlin
0.65
!'
0.65
amaz
0.65
!--
0.64
tta
0.64
!!!!!
0.64
rushes
0.62
Activations Density 0.042%