INDEX
Explanations
instances of the word "because" and its variations, indicating causality or reasoning
New Auto-Interp
Negative Logits
(æĹ¥
-0.19
earer
-0.17
Ãły
-0.16
because
-0.16
ên
-0.16
agraph
-0.15
aeper
-0.15
because
-0.15
agna
-0.14
------+------+
-0.14
POSITIVE LOGITS
otherwise
0.19
else
0.17
Otherwise
0.15
896
0.15
else
0.15
zá
0.14
aring
0.14
('*',0.14
ELSE
0.14
muz
0.14
Activations Density 0.077%