INDEX
Explanations
terms of respect and address such as "sir" or specific names like "john"
references to individuals, particularly names and titles of respect
New Auto-Interp
Negative Logits
Clinic
-0.74
healed
-0.70
Piercing
-0.66
Dying
-0.66
Surgery
-0.66
Lens
-0.66
Camer
-0.65
PDATE
-0.64
Leban
-0.64
Remem
-0.64
POSITIVE LOGITS
otaur
1.01
athan
1.01
sb
1.00
gers
0.99
atory
0.89
son
0.87
tyard
0.83
gling
0.83
izen
0.83
ster
0.83
Activations Density 0.033%