INDEX
Explanations
references to actions taken or events that have occurred
the occurrence of the word "have."
New Auto-Interp
Negative Logits
territ
-0.63
catentry
-0.63
ocol
-0.60
colonization
-0.60
Apart
-0.59
housing
-0.58
fireball
-0.55
settlement
-0.54
blending
-0.54
neigh
-0.53
POSITIVE LOGITS
been
1.22
been
1.04
Been
0.96
undergone
0.93
gotten
0.92
gotten
0.92
taken
0.85
ĸļ
0.85
gone
0.84
done
0.82
Activations Density 0.242%