INDEX
Explanations
references to the character "Pyrrha" and some related entities
proper nouns and names of characters
New Auto-Interp
Negative Logits
Indiana
-0.85
tery
-0.82
draw
-0.74
ktop
-0.74
$$$$
-0.72
creen
-0.72
aton
-0.69
milo
-0.69
DT
-0.69
score
-0.68
POSITIVE LOGITS
Grimm
1.17
RW
0.90
Beacon
0.85
Arc
0.80
Weaver
0.77
Pyrrha
0.75
Cascade
0.75
semblance
0.74
Oz
0.73
Emerald
0.72
Activations Density 0.018%