INDEX
Explanations
occurrences of the word "Ham" and its variations in different contexts
New Auto-Interp
Negative Logits
omanip
-0.19
educ
-0.16
меÑĩ
-0.15
Minority
-0.15
eenth
-0.15
p
-0.15
.cum
-0.15
697
-0.15
649
-0.14
ents
-0.14
POSITIVE LOGITS
ilton
0.27
pered
0.24
pton
0.20
ham
0.18
ming
0.18
sters
0.18
Ham
0.17
pering
0.17
oen
0.17
ppy
0.16
Activations Density 0.017%