INDEX
Explanations
mentions of fans or things related to fan experiences
references to fans and their sentiments
New Auto-Interp
Negative Logits
srfAttach
-0.70
LY
-0.67
Coch
-0.63
EDIT
-0.62
Kaplan
-0.62
ENCE
-0.61
ateral
-0.61
Prosecut
-0.58
Proceedings
-0.58
tein
-0.58
POSITIVE LOGITS
fans
1.12
Fans
1.07
Fans
1.02
atics
0.93
ervatives
0.87
atically
0.87
ervative
0.87
atical
0.84
rejoice
0.84
fan
0.83
Activations Density 0.020%