INDEX
Explanations
references to chicken
references to chicken
New Auto-Interp
Negative Logits
orsche
-0.87
INAL
-0.81
aylor
-0.78
raints
-0.76
Palestin
-0.74
oppable
-0.73
ibel
-0.72
unci
-0.72
DPR
-0.72
ilities
-0.71
POSITIVE LOGITS
pox
1.16
breasts
0.95
meat
0.94
manure
0.92
fish
0.91
bones
0.91
thighs
0.89
chickens
0.85
wings
0.85
bone
0.84
Activations Density 0.015%