INDEX
Explanations
phrases related to future events or outcomes
references to anticipated or future outcomes
New Auto-Interp
Negative Logits
toe
-0.75
emen
-0.74
manship
-0.74
lua
-0.73
books
-0.70
mans
-0.70
men
-0.70
erie
-0.68
gun
-0.68
rooms
-0.67
POSITIVE LOGITS
ity
1.05
ities
1.05
ITY
0.94
demise
0.87
downfall
0.86
aneously
0.85
ITIES
0.81
itous
0.80
occupant
0.76
successor
0.76
Activations Density 0.011%