INDEX
Explanations
words related to different sports
references to various sports and entertainment contexts
New Auto-Interp
Negative Logits
pex
-0.65
thens
-0.64
Bei
-0.62
Kik
-0.62
graded
-0.62
inki
-0.61
LOC
-0.61
xit
-0.61
isher
-0.60
Krug
-0.60
POSITIVE LOGITS
careers
0.99
journalism
0.99
circles
0.94
academia
0.91
folklore
0.90
royalty
0.87
renaissance
0.87
fandom
0.83
fraternity
0.81
lore
0.78
Activations Density 0.316%