INDEX
Explanations
mentions of celebrities, particularly Lady Gaga
mentions of specific celebrities, particularly Lady Gaga and Ariana Grande
New Auto-Interp
Negative Logits
ģ
-0.86
æ³
-0.82
Sod
-0.72
âĢ¢âĢ¢âĢ¢âĢ¢
-0.72
Icar
-0.71
Shinra
-0.71
Pax
-0.71
Noct
-0.70
Raiders
-0.70
Seaf
-0.69
POSITIVE LOGITS
steen
0.97
udeau
0.91
hler
0.89
essen
0.85
awaru
0.84
psey
0.83
anooga
0.82
millenn
0.81
tremend
0.76
ende
0.76
Activations Density 0.020%