INDEX
Explanations
references to fouls in sports contexts
New Auto-Interp
Negative Logits
ocobo
-0.72
akeru
-0.72
Downloadha
-0.72
edia
-0.71
_>
-0.69
udeau
-0.64
Airl
-0.64
itan
-0.64
btn
-0.63
1920
-0.63
POSITIVE LOGITS
cery
0.96
terness
0.88
sie
0.84
smelling
0.79
s
0.79
nesses
0.79
mouth
0.77
ness
0.76
eners
0.72
ster
0.71
Activations Density 0.005%