INDEX
Explanations
phrases related to imitations or replicas
occurrences of the word "knock" and its variations
New Auto-Interp
Negative Logits
stice
-0.83
emale
-0.77
heny
-0.72
DISTRICT
-0.67
ivities
-0.65
ceed
-0.64
ional
-0.64
mberg
-0.64
xia
-0.63
heid
-0.63
POSITIVE LOGITS
down
1.13
down
0.92
about
0.90
knock
0.89
downs
0.86
unconscious
0.84
kn
0.83
senseless
0.80
offs
0.76
boxing
0.75
Activations Density 0.036%