INDEX
Explanations
action words related to stimulus or reaction
concepts related to triggering mechanisms and stimuli
New Auto-Interp
Negative Logits
earchers
-0.79
bush
-0.69
âĶĢâĶĢ
-0.66
nudity
-0.65
sein
-0.63
sou
-0.61
bottleneck
-0.61
flowed
-0.59
grease
-0.58
crate
-0.58
POSITIVE LOGITS
gamer
0.71
Lists
0.70
imony
0.69
Claims
0.69
tv
0.69
areth
0.67
Instance
0.66
uli
0.64
atches
0.64
ICH
0.63
Activations Density 0.001%