INDEX
Explanations
the name "Paolo" at varying degrees of strong activation
names and terms related to Italian cuisine or culture
New Auto-Interp
Negative Logits
writers
-0.67
present
-0.66
mark
-0.66
tags
-0.65
Condition
-0.62
hence
-0.62
��������
-0.62
Conditions
-0.62
correct
-0.59
nah
-0.59
POSITIVE LOGITS
olo
1.46
zzi
1.11
etooth
1.07
oaded
0.96
opsis
0.95
zzle
0.95
ogie
0.92
lette
0.91
Lens
0.91
zin
0.91
Activations Density 0.005%