INDEX
Explanations
occurrences of the word "included"
New Auto-Interp
Negative Logits
ero
-0.16
umont
-0.14
acco
-0.13
ninger
-0.13
jom
-0.13
eres
-0.13
Russo
-0.13
******/
-0.13
_aspect
-0.13
ertz
-0.13
POSITIVE LOGITS
obo
0.18
922
0.16
Conte
0.15
oph
0.15
ipi
0.14
enant
0.14
722
0.14
ETIME
0.14
kle
0.14
aed
0.14
Activations Density 0.041%