INDEX
Explanations
phrases related to critical commentary or sarcastic remarks
responses that convey strong opinions or judgments
New Auto-Interp
Negative Logits
footing
-0.88
intended
-0.70
jointly
-0.66
utive
-0.66
intending
-0.65
sailing
-0.65
funded
-0.65
arij
-0.64
discontin
-0.62
artney
-0.62
POSITIVE LOGITS
Anyway
1.24
³³³³³³³³
1.10
³³³³³³³³³³³³³³³³
1.07
Thankfully
1.02
³³³³
1.01
Moreover
0.99
Consider
0.98
Likewise
0.97
Unless
0.97
Nevertheless
0.97
Activations Density 0.398%