INDEX
Explanations
words or phrases describing a location or a place
phrases indicating a continual state or characteristic
New Auto-Interp
Negative Logits
soon
-0.82
ģ«
-0.68
regate
-0.67
yet
-0.67
illions
-0.66
ellar
-0.65
now
-0.65
))))
-0.64
ptoms
-0.64
actually
-0.63
POSITIVE LOGITS
regarded
0.91
fascinated
0.88
shrouded
0.79
wary
0.78
skeptical
0.74
elusive
0.73
considered
0.72
admired
0.72
uneasy
0.72
reluctant
0.71
Activations Density 0.144%