INDEX
Explanations
words related to prosthetics
references to prosthetics and related devices
New Auto-Interp
Negative Logits
INESS
-0.90
hound
-0.70
Dick
-0.65
prevailing
-0.65
hower
-0.64
Finder
-0.64
ACTIONS
-0.63
seekers
-0.61
Grind
-0.60
ÃĽ
-0.60
POSITIVE LOGITS
hetics
1.27
hetic
1.19
heses
1.11
hesis
1.09
orius
1.09
prost
1.01
acles
0.95
ificial
0.95
acist
0.90
acet
0.89
Activations Density 0.030%