INDEX
Explanations
phrases indicating decisions or actions being taken/not taken by individuals or groups
occurrences of the verb "have" and its various forms in different contexts
New Auto-Interp
Negative Logits
Spend
-0.77
spends
-0.70
cares
-0.64
Discuss
-0.63
recalls
-0.62
loses
-0.62
[&
-0.61
hates
-0.60
detrim
-0.59
AAF
-0.58
POSITIVE LOGITS
arisen
1.46
been
1.40
been
1.33
emerged
1.31
occurred
1.24
surfaced
1.22
ensued
1.20
kell
1.13
existed
1.09
flowed
1.08
Activations Density 0.123%