INDEX
Explanations
instances of the word "have"
occurrences of the word "have"
New Auto-Interp
Negative Logits
catentry
-0.65
Apart
-0.57
ocol
-0.55
territ
-0.50
fireball
-0.50
colonization
-0.48
neigh
-0.48
smear
-0.48
osa
-0.47
persuasion
-0.47
POSITIVE LOGITS
been
1.19
been
1.02
Been
0.92
undergone
0.91
gotten
0.89
gone
0.83
taken
0.81
done
0.81
gotten
0.80
begun
0.79
Activations Density 0.435%