INDEX
Explanations
phrases related to personal experiences involving various activities or events
occurrences of the word "had" in various contexts
New Auto-Interp
Negative Logits
bie
-0.69
bies
-0.67
bery
-0.63
Below
-0.63
owe
-0.62
Voters
-0.62
hack
-0.60
edy
-0.60
ethy
-0.58
yles
-0.56
POSITIVE LOGITS
been
1.06
gotten
0.99
undergone
0.94
begun
0.91
gone
0.89
iths
0.82
dealings
0.78
rontal
0.77
done
0.77
wandered
0.77
Activations Density 0.134%