INDEX
Explanations
mentions of the Oscar awards and their associated terms
New Auto-Interp
Negative Logits
it
-0.44
campista
-0.44
T
-0.41
rmtree
-0.40
strict
-0.39
ről
-0.38
슷
-0.38
те
-0.38
New
-0.37
T
-0.37
POSITIVE LOGITS
Oscar
1.04
Oscar
1.03
Oscars
1.02
oscar
0.99
Óscar
0.96
oscar
0.94
فريبيس
0.78
Nobel
0.77
Oskar
0.77
esternos
0.70
Activations Density 0.002%