INDEX
Explanations
words related to future outcomes or events
references to outcomes or consequences
New Auto-Interp
Negative Logits
books
-0.80
book
-0.74
PAC
-0.72
BOOK
-0.68
anecd
-0.66
gun
-0.66
guns
-0.66
emen
-0.64
toe
-0.63
lua
-0.63
POSITIVE LOGITS
ity
1.21
ities
1.11
ITY
1.00
demise
0.94
aneously
0.89
winner
0.85
occupant
0.83
downfall
0.83
itous
0.81
bankruptcy
0.80
Activations Density 0.020%