INDEX
Explanations
instances of the word "Horn" with varying degrees of activation
occurrences of the word "Horn" and its variants, often in various contexts
New Auto-Interp
Negative Logits
ãĥ¥
-0.69
MIA
-0.67
amin
-0.65
ENCY
-0.64
erate
-0.64
Pepsi
-0.63
ufact
-0.63
ensional
-0.61
BILITIES
-0.61
perature
-0.61
POSITIVE LOGITS
et
0.99
obyl
0.96
worms
0.90
ado
0.89
sey
0.87
worm
0.87
iasis
0.85
entimes
0.83
beam
0.82
stein
0.82
Activations Density 0.044%