INDEX
Explanations
references to specific medical conditions or health-related terms
C macro definitions
programming identifiers or code keywords
New Auto-Interp
Negative Logits
b
-0.79
d
-0.73
(
-0.73
w
-0.71
p
-0.71
c
-0.70
m
-0.70
i
-0.68
t
-0.68
e
-0.67
POSITIVE LOGITS
itſelf
1.81
myſelf
1.80
ſelf
1.74
Shakspeare
1.64
་་
1.62
ſelves
1.60
iſt
1.59
Jefus
1.57
Monfieur
1.53
Houſe
1.52
Activations Density 0.579%