INDEX
Explanations
names or terms containing "amed" or similar phonetic patterns
words related to blame or shame
New Auto-Interp
Negative Logits
awaru
-0.72
compr
-0.72
tremend
-0.70
livest
-0.66
angers
-0.65
agan
-0.65
tamp
-0.61
indict
-0.61
neg
-0.61
knife
-0.60
POSITIVE LOGITS
ame
0.97
ames
0.94
tu
0.80
orie
0.78
eus
0.77
icative
0.76
amed
0.76
onde
0.76
icate
0.75
gaard
0.75
Activations Density 0.012%