INDEX
Explanations
verbs related to causing intense emotional reactions
instances of the word "implicate" or its variations
New Auto-Interp
Negative Logits
chal
-0.79
Ĥ¬
-0.75
kees
-0.74
grass
-0.72
liness
-0.71
lette
-0.65
flix
-0.65
cleaner
-0.64
grab
-0.63
Warwick
-0.63
POSITIVE LOGITS
osion
1.54
oded
1.47
ausible
1.43
icating
1.34
ications
1.28
icate
1.27
icates
1.27
oding
1.27
icit
1.25
anting
1.18
Activations Density 0.025%