INDEX
Explanations
verbs or actions related to strong and often drastic actions or decisions
action verbs indicating significant activities or events
New Auto-Interp
Negative Logits
been
-0.59
yond
-0.53
copy
-0.52
emate
-0.51
estate
-0.49
estone
-0.47
arb
-0.47
é¾įåĸļ士
-0.46
orest
-0.46
arsity
-0.46
POSITIVE LOGITS
yesterday
0.52
BART
0.50
showc
0.50
tremend
0.49
him
0.47
harshly
0.46
hes
0.46
nces
0.45
abruptly
0.45
briefly
0.45
Activations Density 0.756%