INDEX
Explanations
words related to people's names
instances of the word "ze"
New Auto-Interp
Negative Logits
glim
-0.89
paced
-0.84
ials
-0.83
runner
-0.79
ially
-0.78
runners
-0.76
inarily
-0.74
cutting
-0.74
istrate
-0.73
selling
-0.72
POSITIVE LOGITS
lda
1.30
ppelin
1.21
zinski
1.01
ppel
0.84
ÅĤ
0.84
uble
0.84
itsch
0.82
cki
0.81
zza
0.81
hn
0.79
Activations Density 0.023%