INDEX
Explanations
[action verbs expressed in the past tense related to various activities or events.]
instances of the word "took."
New Auto-Interp
Negative Logits
Smile
-0.66
lex
-0.66
Cong
-0.66
ler
-0.64
agre
-0.60
icing
-0.59
bie
-0.58
---------
-0.58
psey
-0.58
gom
-0.58
POSITIVE LOGITS
advantage
1.15
aways
1.11
aback
1.02
refuge
1.00
pains
0.95
precedence
0.92
place
0.89
care
0.86
heed
0.85
prising
0.84
Activations Density 0.090%