INDEX
Explanations
occurrences of summary statements and conclusions in the text
New Auto-Interp
Negative Logits
dana
-0.17
inta
-0.16
Gund
-0.15
è°
-0.14
iven
-0.14
稿
-0.14
ÏĤ
-0.14
anje
-0.13
Hel
-0.13
ettel
-0.13
POSITIVE LOGITS
osaur
0.17
onis
0.15
estation
0.15
oni
0.15
odyn
0.15
entence
0.15
chied
0.14
ucer
0.14
odesk
0.14
_OVERRIDE
0.14
Activations Density 0.232%