INDEX
Explanations
words related to sports events, particularly finals and championships
instances of the word "Final" or variations thereof, indicating championship events
New Auto-Interp
Negative Logits
relative
-0.74
maid
-0.68
behavior
-0.67
spr
-0.67
urat
-0.66
limit
-0.65
beh
-0.65
raved
-0.63
afort
-0.63
Sov
-0.62
POSITIVE LOGITS
ists
0.98
isers
0.90
izers
0.89
ist
0.84
FANTASY
0.84
izes
0.84
izer
0.82
izing
0.81
Fantasy
0.79
aneously
0.77
Activations Density 0.018%