INDEX
Explanations
mentions of the Sundance Film Festival
mentions of film festivals, particularly Sundance and Cannes
New Auto-Interp
Negative Logits
posit
-0.79
vernment
-0.78
itably
-0.74
berus
-0.73
utherford
-0.73
iliary
-0.69
lectic
-0.67
acters
-0.66
urdue
-0.64
lly
-0.64
POSITIVE LOGITS
Sund
1.08
inguished
0.96
Sund
0.90
rained
0.84
ered
0.82
ance
0.81
icator
0.81
heim
0.78
azz
0.77
rum
0.76
Activations Density 0.022%