INDEX
Explanations
questions addressed to the reader
occurrences of second person pronouns and questions
New Auto-Interp
Negative Logits
76561
-0.83
ces
-0.77
ricting
-0.72
Par
-0.72
ItemTracker
-0.70
Making
-0.67
atlantic
-0.67
Heads
-0.66
ching
-0.66
haven
-0.66
POSITIVE LOGITS
accept
0.92
be
0.89
suffice
0.87
overwrite
0.84
succeed
0.82
recognise
0.81
avail
0.80
prescribe
0.79
abandon
0.79
decide
0.79
Activations Density 0.053%