INDEX
Explanations
specific mentions of the word 'Skidmore'
New Auto-Interp
Negative Logits
writ
-0.74
stricken
-0.68
sanity
-0.66
moot
-0.66
sympt
-0.64
slee
-0.64
âĸ¬âĸ¬
-0.62
limited
-0.60
normally
-0.60
peace
-0.60
POSITIVE LOGITS
yrim
1.36
ysc
1.30
ipper
1.26
ratch
1.23
oln
1.19
illet
1.18
ulpt
1.18
illed
1.17
irts
1.16
rill
1.15
Activations Density 0.015%