INDEX
Explanations
references to evaluations of creative works and the entertainment industry
New Auto-Interp
Negative Logits
chten
-0.18
ugo
-0.17
ordes
-0.16
Dayton
-0.15
needed
-0.15
endeavor
-0.15
Needed
-0.15
chu
-0.14
umen
-0.14
roke
-0.14
POSITIVE LOGITS
misdemean
0.22
onward
0.19
contrib
0.19
nees
0.18
intake
0.18
advert
0.18
Yorkshire
0.17
strap
0.17
innings
0.17
cracking
0.17
Activations Density 0.440%