INDEX
Explanations
words ending in "-hes" or "-hes" itself
the presence of the word "hes" in various contexts
New Auto-Interp
Negative Logits
congress
-0.64
Reviewer
-0.62
planning
-0.62
coff
-0.62
CAL
-0.61
recomm
-0.60
collusion
-0.60
link
-0.60
reference
-0.59
Doctors
-0.59
POSITIVE LOGITS
hes
1.18
apeake
1.18
creen
1.01
omen
0.97
terday
0.92
borough
0.91
hens
0.89
ervative
0.89
ashore
0.88
piration
0.88
Activations Density 0.006%