INDEX
Explanations
season-related words and phrases
references to specific seasons and seasonal events
New Auto-Interp
Negative Logits
hai
-0.77
aware
-0.69
enei
-0.64
plete
-0.63
stract
-0.62
Fram
-0.62
weak
-0.61
mercial
-0.60
sten
-0.60
downed
-0.59
POSITIVE LOGITS
opener
0.92
finale
0.92
ally
0.87
premiere
0.82
ings
0.77
ticket
0.69
ı
0.67
eve
0.65
aments
0.65
igraph
0.64
Activations Density 0.036%