INDEX
Explanations
words related to physical attributes or characteristics
words denoting various past participle verb forms
New Auto-Interp
Negative Logits
anches
-0.76
Trade
-0.74
involved
-0.74
nesota
-0.72
forums
-0.71
ledge
-0.70
estamp
-0.70
trade
-0.70
osponsors
-0.70
ideo
-0.69
POSITIVE LOGITS
gling
0.82
ness
0.77
tir
0.74
Brach
0.70
contra
0.70
glers
0.69
adoes
0.67
omn
0.66
irection
0.66
pup
0.65
Activations Density 0.120%