INDEX
Explanations
instances where an action is taken despite a known obstacle or contrary circumstances
instances of the word "despite."
New Auto-Interp
Negative Logits
ISE
-0.73
ahime
-0.72
lees
-0.72
allo
-0.70
ault
-0.69
isa
-0.68
esian
-0.66
ael
-0.66
=-=-=-=-=-=-=-=-
-0.63
isites
-0.62
POSITIVE LOGITS
ĸļ
0.92
having
0.79
acknowledging
0.76
insisting
0.72
spelling
0.71
seeming
0.71
math
0.70
citing
0.70
admitting
0.69
knowing
0.68
Activations Density 0.019%