INDEX
Explanations
the word "If" at the beginning of a sentence
New Auto-Interp
Negative Logits
ãĤª
-0.76
âĵĺ
-0.69
ãĥ³ãĤ¸
-0.69
ãĥŃ
-0.66
Variable
-0.62
hs
-0.60
blind
-0.59
forth
-0.58
oves
-0.57
iliar
-0.57
POSITIVE LOGITS
fy
1.08
you
0.99
anything
0.89
rame
0.86
ihad
0.82
anybody
0.77
unchecked
0.76
yip
0.76
anyone
0.76
ya
0.70
Activations Density 0.095%