INDEX
Explanations
mentions of a specific name, "Pekka"
the presence of the token "ek" in various contexts
New Auto-Interp
Negative Logits
ACTED
-0.74
appropriately
-0.69
AMERICA
-0.66
ãĥ¼ãĥĨ
-0.62
ãĤ¤
-0.61
diapers
-0.61
ÙĴ
-0.60
belts
-0.60
ãĥĩãĤ£
-0.59
Bland
-0.59
POSITIVE LOGITS
nown
1.38
kers
1.05
enstein
1.05
ansas
1.00
kies
0.95
yll
0.95
hov
0.95
anism
0.93
enzie
0.90
ker
0.90
Activations Density 0.017%