INDEX
Explanations
phrases related to political figures and events
occurrences of the word "the" and references to significant political events or figures
New Auto-Interp
Negative Logits
/-
-0.76
nces
-0.70
ãĥ¼ãĥĨ
-0.69
ér
-0.68
ATURES
-0.67
}:
-0.67
TAIN
-0.64
payers
-0.63
ifact
-0.63
brate
-0.63
POSITIVE LOGITS
entirety
0.84
brink
0.79
guise
0.79
midst
0.78
equation
0.75
future
0.73
quest
0.72
afterlife
0.72
bargain
0.70
stead
0.70
Activations Density 0.530%