INDEX
Explanations
occurrences of the verb "have" in various contexts
New Auto-Interp
Negative Logits
olta
-0.16
underwent
-0.16
apor
-0.16
otle
-0.15
raph
-0.15
tainment
-0.14
иÑģÑĮ
-0.13
tgl
-0.13
rze
-0.13
fried
-0.13
POSITIVE LOGITS
been
0.32
been
0.26
Been
0.23
BEEN
0.22
become
0.21
Been
0.21
sido
0.20
come
0.20
chosen
0.20
difficulty
0.19
Activations Density 0.147%