INDEX
Explanations
phrases related to fandom and enthusiasm for specific individuals or entities
New Auto-Interp
Negative Logits
ffset
-0.16
exels
-0.14
UBLISH
-0.14
üzel
-0.14
IDES
-0.14
Dll
-0.14
tainment
-0.14
egasus
-0.14
ĩnh
-0.14
aliz
-0.13
POSITIVE LOGITS
fans
0.98
fan
0.84
Fans
0.83
fans
0.77
Fans
0.71
Fan
0.69
fan
0.67
Fan
0.61
íĮ¬
0.56
fandom
0.52
Activations Density 0.287%