INDEX
Explanations
mentions of a specific term or acronym, 'SM'
references to "SM" followed by numeric designations, indicating a focus on specific media or series
New Auto-Interp
Negative Logits
ãĥĥãĥĪ
-0.73
Ved
-0.72
icular
-0.71
Authorization
-0.71
Frie
-0.70
Lyme
-0.70
ãĤ¡
-0.69
Gamergate
-0.68
ça
-0.67
bart
-0.65
POSITIVE LOGITS
ITH
1.08
ART
0.98
ugg
0.97
ASH
0.95
iley
0.92
ooth
0.86
ould
0.86
oke
0.85
achine
0.85
amba
0.85
Activations Density 0.013%