INDEX
Explanations
statements and quotes made by individuals in the text
New Auto-Interp
Negative Logits
nier
-0.16
luet
-0.16
Touches
-0.16
amework
-0.16
uestion
-0.15
Mention
-0.14
usting
-0.14
mention
-0.14
âĸłâĸł
-0.14
lue
-0.14
POSITIVE LOGITS
thunder
0.21
decl
0.21
memor
0.20
bl
0.19
imper
0.19
say
0.18
crow
0.18
beams
0.18
int
0.18
declared
0.17
Activations Density 0.109%