INDEX
Explanations
mentions of a specific word "Qu"
instances of the brand name 'Quaker'
New Auto-Interp
Negative Logits
AGES
-0.70
heart
-0.70
HAEL
-0.69
contrast
-0.69
manship
-0.67
deed
-0.66
hearts
-0.65
fulness
-0.64
intensity
-0.64
rabbits
-0.64
POSITIVE LOGITS
arantine
1.30
oted
1.09
ipped
1.08
inoa
1.06
otation
1.05
arie
1.05
artz
1.03
arters
1.03
icks
1.02
atern
1.01
Activations Density 0.018%