INDEX
Explanations
specific pronouns followed by verbs
the repeated use of the word "its" in various contexts
New Auto-Interp
Negative Logits
tu
-0.74
Scully
-0.68
Texture
-0.67
Bron
-0.67
hog
-0.67
Show
-0.67
Condition
-0.66
vere
-0.66
AAAA
-0.66
picture
-0.65
POSITIVE LOGITS
own
1.66
predecessor
1.14
ELF
1.10
respective
1.07
predecessors
1.05
namesake
0.93
newfound
0.92
newest
0.91
flagship
0.90
self
0.89
Activations Density 0.124%