INDEX
Explanations
references to specific television shows, episodes, events, and health-related topics
references to popular TV shows and segments
New Auto-Interp
Negative Logits
agara
-0.83
aff
-0.82
uga
-0.81
rum
-0.79
intent
-0.77
umph
-0.75
upp
-0.75
Äĩ
-0.75
internal
-0.75
bon
-0.74
POSITIVE LOGITS
Initiative
1.11
Shooter
1.00
Productions
0.97
Comics
0.96
Geek
0.95
Solutions
0.95
Tracker
0.94
Arcade
0.94
Cafe
0.93
Baseball
0.93
Activations Density 0.200%