INDEX
Explanations
words related to irony
expressions of irony
New Auto-Interp
Negative Logits
eding
-0.87
erto
-0.81
hani
-0.77
Interstitial
-0.75
á
-0.75
lished
-0.75
tests
-0.75
league
-0.75
eder
-0.74
undy
-0.73
POSITIVE LOGITS
irony
1.09
ironic
1.03
twist
0.93
wink
0.81
reversal
0.81
sidel
0.81
paradox
0.80
ironically
0.78
juxtap
0.76
Osw
0.71
Activations Density 0.013%