INDEX
Explanations
references to celebrity culture and its impact on society
New Auto-Interp
Negative Logits
various
-0.50
flere
-0.48
antaranya
-0.47
لينك
-0.47
Various
-0.47
OGND
-0.47
Various
-0.47
específico
-0.46
specific
-0.46
Additional
-0.45
POSITIVE LOGITS
human
0.73
бывает
0.73
pareil
0.72
humans
0.70
hindsight
0.70
successful
0.70
democracies
0.70
great
0.69
Хоро
0.68
こういう
0.68
Activations Density 0.571%