INDEX
Explanations
capital letters followed by periods (e.g., "E.")
occurrences of the letter "e"
New Auto-Interp
Negative Logits
mallow
-0.72
GEAR
-0.70
Kats
-0.70
Showdown
-0.68
Kitchen
-0.68
Cats
-0.67
Painter
-0.67
å§«
-0.67
Noir
-0.66
Kah
-0.65
POSITIVE LOGITS
tymology
1.20
lements
1.18
gypt
1.16
lev
1.15
ighth
1.11
ternally
1.11
ighty
1.08
lder
1.07
astern
1.05
resp
1.04
Activations Density 0.030%