INDEX
Explanations
mentions of animated media
references to animated films or shows
New Auto-Interp
Negative Logits
orr
-0.84
yer
-0.83
mith
-0.82
hood
-0.76
ILA
-0.75
OA
-0.73
ills
-0.72
Availability
-0.70
lay
-0.70
anship
-0.70
POSITIVE LOGITS
GIF
0.97
ebted
0.81
gif
0.80
animated
0.79
ocument
0.78
atable
0.75
netflix
0.73
Animated
0.69
versions
0.69
cartoon
0.68
Activations Density 0.017%