INDEX
Explanations
the word "never" occurring in sentences
phrases indicating repeated actions or events that did not occur
New Auto-Interp
Negative Logits
antioxid
-0.79
ahime
-0.74
=-=-=-=-=-=-=-=-
-0.67
Crusher
-0.67
OUR
-0.64
âĶģ
-0.61
equality
-0.60
Disclaimer
-0.59
ÃįÃį
-0.59
PI
-0.58
POSITIVE LOGITS
theless
1.59
bothered
1.21
ceases
1.13
mind
1.04
knew
1.02
imagined
1.02
ceased
1.02
doubted
1.02
dreamed
1.01
forgot
0.98
Activations Density 0.045%