INDEX
Explanations
phrases related to causation
the word "thereby" and similar phrases indicating causation or consequence
New Auto-Interp
Negative Logits
Serge
-0.64
Atkinson
-0.64
Freddie
-0.62
cer
-0.61
Kle
-0.61
ten
-0.60
Kelvin
-0.60
Columb
-0.60
Food
-0.60
Pepper
-0.60
POSITIVE LOGITS
guiActiveUn
0.99
forth
0.94
hiba
0.84
ãĤ´ãĥ³
0.80
convol
0.80
guiActive
0.77
forfe
0.75
sidx
0.75
dwind
0.73
forfeit
0.73
Activations Density 0.007%