INDEX
Explanations
instances where something happens or is true in spite of something else
instances of the word "despite."
New Auto-Interp
Negative Logits
ahime
-0.81
isa
-0.74
allo
-0.71
ISE
-0.71
isites
-0.70
ault
-0.69
eg
-0.66
oyd
-0.65
apeake
-0.64
oya
-0.63
POSITIVE LOGITS
ĸļ
0.98
acknowledging
0.85
admitting
0.80
insisting
0.80
having
0.78
denying
0.74
ignoring
0.72
knowing
0.72
pledging
0.69
mentioning
0.69
Activations Density 0.028%