INDEX
Explanations
instances where someone's actions lead to unexpected or ironic outcomes
the word "only" in various contexts
New Auto-Interp
Negative Logits
insula
-0.77
ahime
-0.76
hement
-0.66
idon
-0.66
iosyncr
-0.59
endo
-0.59
lass
-0.59
sem
-0.59
intensity
-0.57
Code
-0.56
POSITIVE LOGITS
marginally
0.94
kidding
0.93
incidentally
0.89
seconds
0.77
WARE
0.77
ĨĴ
0.72
lasts
0.69
Owner
0.68
ices
0.67
minutes
0.67
Activations Density 0.062%