INDEX
Explanations
instances where the concept of 'never' is referenced in context
instances of the word "never."
New Auto-Interp
Negative Logits
ahime
-0.90
pour
-0.87
antioxid
-0.78
iant
-0.71
åĤ
-0.67
achev
-0.67
CRIP
-0.63
uliffe
-0.63
PI
-0.63
ESCO
-0.61
POSITIVE LOGITS
theless
1.74
bothered
1.12
dreamed
0.99
ceases
0.95
existed
0.85
ceased
0.85
mind
0.84
heard
0.84
bothers
0.82
imagined
0.81
Activations Density 0.059%