INDEX
Explanations
words related to food items, specifically pizza and related terms
occurrences of the word "pizza."
New Auto-Interp
Negative Logits
Niet
-0.73
newcom
-0.64
bystand
-0.62
ITNESS
-0.58
QC
-0.58
Qual
-0.58
Alam
-0.56
Territories
-0.56
Social
-0.56
CLASS
-0.56
POSITIVE LOGITS
hare
1.13
poons
1.04
etting
1.03
hips
1.02
hip
1.01
mith
1.01
cale
1.01
hell
0.99
avers
0.99
igmatic
0.98
Activations Density 0.176%