INDEX
Explanations
references to medical professionals and healthcare settings
New Auto-Interp
Negative Logits
-0.59
↵
-0.59
(
-0.56
and
-0.54
se
-0.52
’
-0.50
y
-0.49
↵↵
-0.49
'
-0.49
.
-0.48
POSITIVE LOGITS
myſelf
1.25
ſelf
1.22
Jefus
1.20
Monfieur
1.20
Anſ
1.15
queſta
1.15
majánló
1.13
ſou
1.12
itſelf
1.12
ſind
1.11
Activations Density 0.251%