INDEX
Explanations
names of individuals or characters
references to specific individuals, particularly those involved in creative projects
New Auto-Interp
Negative Logits
federal
-0.77
NAACP
-0.74
federally
-0.74
iscopal
-0.74
defund
-0.73
POLIT
-0.73
DonaldTrump
-0.71
leftist
-0.70
olitics
-0.70
precincts
-0.69
POSITIVE LOGITS
uda
0.81
Warp
0.79
Nebula
0.78
Rou
0.77
Robot
0.74
osaurus
0.73
Games
0.73
Nek
0.73
oshi
0.72
eki
0.72
Activations Density 0.658%