INDEX
Explanations
mentions of the term "Oscar" in relation to awards or recognition in the film industry
New Auto-Interp
Negative Logits
endra
-0.18
opard
-0.15
iry
-0.15
opian
-0.14
Advoc
-0.14
esh
-0.14
M
-0.14
OND
-0.14
oph
-0.14
aic
-0.14
POSITIVE LOGITS
uder
0.17
aris
0.15
unken
0.15
åħ¸
0.15
alion
0.14
Corps
0.14
inge
0.14
rape
0.14
åħ½
0.14
ãģĵãĤĵãģ«ãģ¡ãģ¯
0.14
Activations Density 0.023%