INDEX
Explanations
references to individuals or things being idolized or celebrated
references to idols and celebrity culture
New Auto-Interp
Negative Logits
hire
-0.95
es
-0.84
hid
-0.79
hou
-0.75
rooms
-0.74
yards
-0.74
berra
-0.71
oning
-0.70
cale
-0.70
externalActionCode
-0.69
POSITIVE LOGITS
inating
0.86
otive
0.84
ãĤ§
0.83
izable
0.81
inated
0.80
BALL
0.78
ãĥ¤
0.76
9999
0.76
yrinth
0.74
inatory
0.74
Activations Density 0.073%