INDEX
Explanations
references to themes and thematic elements in the text
New Auto-Interp
Negative Logits
aneous
-0.18
arian
-0.18
teen
-0.17
ty
-0.17
inet
-0.16
iegel
-0.15
ree
-0.15
hti
-0.15
nd
-0.15
land
-0.14
POSITIVE LOGITS
elves
0.24
æĿIJ
0.21
atically
0.19
park
0.19
562
0.18
atical
0.17
eted
0.16
gth
0.16
eting
0.16
Setter
0.16
Activations Density 0.017%