INDEX
Explanations
the letter "z" followed by a word or part of a word
instances of the letter 'z'
New Auto-Interp
Negative Logits
tenance
-0.71
Palestin
-0.71
ModLoader
-0.71
mingham
-0.70
compe
-0.70
Sut
-0.69
Lumpur
-0.69
Staples
-0.68
disadvant
-0.68
Lauder
-0.66
POSITIVE LOGITS
ebra
1.30
eros
1.17
ooming
1.14
ipped
1.11
odiac
1.09
ither
1.07
ipping
1.06
ippers
1.05
ipper
1.04
ipp
1.01
Activations Density 0.015%