INDEX
Explanations
occurrences of the suffix "ster" in various contexts
New Auto-Interp
Negative Logits
mercial
-0.80
ĸļ
-0.76
UTERS
-0.73
ongyang
-0.72
ĪĴ
-0.70
ython
-0.70
ouston
-0.68
iferation
-0.68
onential
-0.67
isconsin
-0.66
POSITIVE LOGITS
oids
0.87
bucks
0.85
oid
0.84
iders
0.82
ious
0.78
ding
0.77
geist
0.77
holm
0.77
ized
0.76
Spit
0.74
Activations Density 0.011%