INDEX
Explanations
words associated with medical treatment
repeated syllables or phonetic patterns in words
New Auto-Interp
Negative Logits
lain
-0.69
HH
-0.69
icles
-0.62
deterrence
-0.58
manship
-0.58
innocence
-0.57
confir
-0.57
umbn
-0.57
igators
-0.56
sovereignty
-0.56
POSITIVE LOGITS
zzi
1.32
zzle
1.24
ffee
1.18
ppe
1.15
ven
1.13
ffe
1.11
pping
1.10
ppa
1.08
pp
1.07
pper
1.07
Activations Density 0.140%