INDEX
Explanations
phrases related to actions, decisions, and consequences
the presence of conjunctions, particularly "and," indicating connections or additions in the text
New Auto-Interp
Negative Logits
tnc
-0.80
ļéĨĴ
-0.77
Interested
-0.75
inent
-0.74
incial
-0.71
culus
-0.70
ANC
-0.69
Enlarge
-0.69
cerning
-0.69
rared
-0.68
POSITIVE LOGITS
succeeded
1.12
deserve
1.01
rightly
0.94
reap
0.93
nobody
0.90
rightfully
0.89
waited
0.87
luckily
0.87
prevailed
0.87
rewarded
0.86
Activations Density 0.221%