INDEX
Explanations
phrases indicating new information or reports
the repetition of the word "that" in various contexts
New Auto-Interp
Negative Logits
oses
-0.81
andem
-0.80
ocaust
-0.75
ocene
-0.73
izont
-0.72
pill
-0.71
ãĤ´ãĥ³
-0.70
aturally
-0.70
cosystem
-0.64
pec
-0.63
POSITIVE LOGITS
although
0.88
they
0.85
soever
0.74
there
0.70
'[
0.68
"[
0.65
he
0.63
whilst
0.62
we
0.61
she
0.60
Activations Density 0.200%