INDEX
Explanations
references to geographical locations and their global significance
New Auto-Interp
Negative Logits
↵
-0.70
-0.69
(
-0.64
,
-0.61
in
-0.60
-
-0.59
-0.57
"
-0.56
'
-0.55
to
-0.55
POSITIVE LOGITS
Efq
1.64
Majefty
1.40
―――――
1.38
Monfieur
1.35
ſelf
1.33
itſelf
1.32
myſelf
1.32
iſt
1.27
་་
1.26
ſelves
1.23
Activations Density 0.275%