INDEX
Explanations
names or terms related to specific individuals, particularly with the occurrence of "hr" possibly indicating "ehr" which might be part of names or titles
the presence of specific character sequences or formats within the text
New Auto-Interp
Negative Logits
eers
-0.81
Hots
-0.75
e
-0.72
cens
-0.70
eer
-0.70
Grizzlies
-0.68
reversible
-0.68
BALL
-0.68
Samoa
-0.66
yip
-0.65
POSITIVE LOGITS
acht
0.96
lich
0.90
agan
0.89
azard
0.89
onds
0.88
anging
0.87
acking
0.87
ud
0.86
mann
0.85
uty
0.84
Activations Density 0.018%