INDEX
Explanations
words related to reactions or responses to stimuli
instances of the word "respond" and its variations in the context of reactions or replies
New Auto-Interp
Negative Logits
dar
-0.73
corn
-0.68
thing
-0.65
colo
-0.62
fre
-0.62
borne
-0.61
Dracula
-0.60
=-=-=-=-=-=-=-=-
-0.58
fi
-0.58
fusc
-0.58
POSITIVE LOGITS
favorably
1.20
positively
1.14
ivated
1.03
angrily
1.02
thereto
1.01
negatively
0.96
harshly
0.94
enthusiastically
0.91
promptly
0.90
appropriately
0.90
Activations Density 0.050%