INDEX
Explanations
information related to significant historical events and their consequences
New Auto-Interp
Negative Logits
Majefty
-1.26
pleaſure
-1.21
Monfieur
-1.05
themſelves
-1.05
ſeveral
-1.05
ſtre
-1.03
Reſ
-1.02
ſever
-1.01
Diſ
-1.00
myſelf
-1.00
POSITIVE LOGITS
۱۹
0.97
२०
0.83
২০
0.82
一九
0.65
denas
0.60
jahr
0.60
years
0.60
'
0.60
itrile
0.59
Trump
0.58
Activations Density 0.887%