INDEX
Explanations
quantitative mentions of occurrences or instances
occurrences of the verb "had" in various contexts
New Auto-Interp
Negative Logits
hammer
-0.67
ugu
-0.60
omics
-0.58
lig
-0.57
ocol
-0.55
inance
-0.54
defense
-0.54
—-
-0.54
reciation
-0.53
taxp
-0.53
POSITIVE LOGITS
been
1.05
undergone
0.99
iths
0.93
hers
0.91
ĸļ
0.90
gone
0.89
begun
0.89
gotten
0.83
raltar
0.81
never
0.76
Activations Density 0.154%