INDEX
Explanations
words related to actions or attributes that are evaluative or judgmental
words that convey a sense of affirmation or positive acknowledgment
New Auto-Interp
Negative Logits
SE
-0.64
walk
-0.63
BUS
-0.63
falls
-0.63
Bee
-0.61
conf
-0.61
WS
-0.61
grace
-0.60
squid
-0.60
flock
-0.59
POSITIVE LOGITS
ative
4.50
atives
3.13
atively
2.84
ATIVE
2.25
ativity
2.01
ational
1.69
atory
1.65
ation
1.49
atorial
1.48
ations
1.48
Activations Density 0.008%