INDEX
Explanations
the names of theaters or venues
mentions of specific theaters or venues
New Auto-Interp
Negative Logits
£ı
-0.82
vironment
-0.78
rint
-0.77
istries
-0.77
aunder
-0.74
ulkan
-0.71
itor
-0.70
achev
-0.70
arah
-0.69
itude
-0.68
POSITIVE LOGITS
wright
0.97
Workshop
0.90
writers
0.84
writer
0.83
trou
0.82
wr
0.82
Theatre
0.81
marqu
0.79
plays
0.77
Pub
0.73
Activations Density 0.030%