INDEX
Explanations
occurrences of the word "the"
New Auto-Interp
Negative Logits
èm
-0.16
Mention
-0.15
anky
-0.15
ulfilled
-0.14
ono
-0.14
zew
-0.14
idelity
-0.14
ampton
-0.14
herent
-0.13
dap
-0.13
POSITIVE LOGITS
same
0.20
ocratic
0.17
ocracy
0.16
same
0.16
opportunity
0.15
TabPage
0.14
mismo
0.14
ymoon
0.14
,DB
0.14
Babe
0.14
Activations Density 0.215%