INDEX
Explanations
the word "pizza" in various contexts
occurrences of the term "Mozzarella"
New Auto-Interp
Negative Logits
spect
-0.68
Polar
-0.66
compl
-0.66
warming
-0.60
Span
-0.60
conspicuous
-0.59
Patriot
-0.59
©¶æ
-0.59
entitle
-0.58
Malays
-0.58
POSITIVE LOGITS
arella
1.37
etta
1.10
arro
1.09
azz
1.07
erella
1.03
ombies
0.98
hou
0.95
eria
0.95
ucc
0.94
zz
0.93
Activations Density 0.052%