INDEX
Explanations
the word "someone"
references to the word "someone."
New Auto-Interp
Negative Logits
interest
-0.72
èª
-0.69
ean
-0.68
heny
-0.68
Chain
-0.68
ories
-0.63
icons
-0.62
osterone
-0.62
de
-0.60
Dep
-0.60
POSITIVE LOGITS
else
1.41
Else
1.23
Else
0.86
WithNo
0.86
else
0.82
20439
0.81
toget
0.79
etheless
0.78
ĪĴ
0.78
unlucky
0.75
Activations Density 0.022%