INDEX
Explanations
times or events related to political figures, particularly within a specific context of activities or mentions involving certain people or groups
proper nouns and names
New Auto-Interp
Negative Logits
Tud
-1.00
Rud
-0.97
OD
-0.90
Judd
-0.87
Tib
-0.87
Sud
-0.83
OCT
-0.83
Nost
-0.82
Treaty
-0.78
Rudolph
-0.78
POSITIVE LOGITS
el
1.25
75
1.24
EL
1.22
els
1.17
elman
1.07
ELS
1.04
elope
1.01
ela
1.01
azel
0.98
eling
0.97
Activations Density 0.296%