INDEX
Explanations
mentions of futuristic concepts or technologies
punctuation that signifies the end of sentences
New Auto-Interp
Negative Logits
inclusion
-0.77
ominated
-0.73
interchange
-0.72
imperson
-0.72
reb
-0.71
burgh
-0.71
involuntary
-0.68
systematically
-0.68
nonexistent
-0.68
representation
-0.68
POSITIVE LOGITS
<|endoftext|>
1.69
®
1.43
Hopefully
1.40
Until
1.22
Regardless
1.16
Stay
1.15
Otherwise
1.14
UPDATE
1.12
;)
1.09
Unless
1.08
Activations Density 0.477%