INDEX
Explanations
phrases related to attributing actions or qualities to individuals
forms of the verb "have"
New Auto-Interp
Negative Logits
hammer
-0.66
Apart
-0.62
esp
-0.61
catentry
-0.60
Voters
-0.60
Arc
-0.60
ocol
-0.59
eal
-0.59
tel
-0.59
oshi
-0.56
POSITIVE LOGITS
been
1.35
been
1.22
gotten
1.12
Been
1.02
undergone
1.02
done
0.93
gone
0.92
fallen
0.91
become
0.90
begun
0.87
Activations Density 0.461%