INDEX
Explanations
events and actions in a narrative context
New Auto-Interp
Negative Logits
ãĥ¥
-0.68
è£ħ
-0.64
quartered
-0.62
ãĤ´ãĥ³
-0.61
Cause
-0.61
alogy
-0.58
SourceFile
-0.58
ãĤµãĥ¼ãĥĨãĤ£ãĥ¯ãĥ³
-0.54
ãĤ¢ãĥ«
-0.54
Pool
-0.53
POSITIVE LOGITS
however
0.84
somew
0.79
alas
0.74
though
0.68
we
0.65
lest
0.65
whenever
0.61
when
0.58
confronted
0.58
thankfully
0.57
Activations Density 11.028%