INDEX
Explanations
references to pit bulls
references to pit bulls and their related contexts
New Auto-Interp
Negative Logits
IGH
-0.75
ãĤµãĥ¼ãĥĨãĤ£ãĥ¯ãĥ³
-0.71
Lauder
-0.68
Idle
-0.65
lihood
-0.64
Feinstein
-0.62
Carbuncle
-0.61
issance
-0.61
ALTH
-0.61
RAFT
-0.59
POSITIVE LOGITS
iful
1.27
iless
1.19
ifully
1.18
cair
1.15
cher
1.04
adium
0.93
oco
0.89
Osw
0.89
bull
0.85
uit
0.83
Activations Density 0.024%