INDEX
Explanations
phrases that express feelings of surprise or anticipation
New Auto-Interp
Negative Logits
ewire
-0.15
enderror
-0.15
_sensitive
-0.14
ableViewController
-0.14
اتÙĩ
-0.14
incons
-0.14
ืà¸Ńà¸Ĥ
-0.13
XR
-0.13
esen
-0.13
gd
-0.13
POSITIVE LOGITS
surprise
1.02
surprises
0.88
Surprise
0.88
surprised
0.79
surpr
0.77
surprising
0.71
unexpected
0.66
sur
0.64
Sur
0.62
Sur
0.61
Activations Density 0.404%