INDEX
Explanations
phrases emphasizing that something is more than what it appears to be
themes of superficiality versus deeper meaning in various contexts
New Auto-Interp
Negative Logits
unfocusedRange
-0.70
also
-0.67
unsus
-0.64
å§«
-0.64
ãģ¦
-0.63
ALSO
-0.63
concess
-0.61
éŃĶ
-0.60
Jury
-0.60
OTH
-0.60
POSITIVE LOGITS
anymore
0.90
alone
0.88
;
0.76
itself
0.76
superficial
0.70
decoration
0.69
':
0.69
but
0.68
.;
0.68
.
0.67
Activations Density 0.335%