INDEX
Explanations
occurrences of the verb "have" and its variations, signaling possession or status
New Auto-Interp
Negative Logits
plusplus
-0.20
wrote
-0.17
shared
-0.16
knew
-0.15
chose
-0.15
saw
-0.15
ehler
-0.14
withdrew
-0.14
aval
-0.14
fell
-0.14
POSITIVE LOGITS
given
0.25
meant
0.24
implications
0.23
resulted
0.23
proven
0.23
been
0.23
led
0.22
ramifications
0.20
given
0.20
nothing
0.20
Activations Density 0.130%