INDEX
Explanations
unexpected or ironic situations and events
instances of irony or interesting contrasts in the text
New Auto-Interp
Negative Logits
comprehension
-0.72
gulf
-0.65
commitment
-0.65
sewage
-0.62
division
-0.61
verbal
-0.60
"},"
-0.58
Submission
-0.58
kindred
-0.58
iens
-0.58
POSITIVE LOGITS
ffe
0.71
situated
0.71
coinc
0.68
ironically
0.67
Ironically
0.67
haus
0.67
titled
0.66
identally
0.65
leck
0.64
pmwiki
0.63
Activations Density 0.030%