INDEX
Explanations
words containing the suffix "-ible" with high activation values
words related to conditions of capability or existence
New Auto-Interp
Negative Logits
Adren
-0.65
Ren
-0.63
stress
-0.63
preferring
-0.62
strip
-0.61
Wass
-0.59
stri
-0.59
Admir
-0.58
oscill
-0.54
opting
-0.54
POSITIVE LOGITS
ible
4.85
ibly
3.20
ibles
3.15
ibility
3.14
IBLE
2.73
ibilities
2.27
able
1.63
ibl
1.62
iable
1.31
uble
1.30
Activations Density 0.008%