INDEX
Explanations
references to current events or controversial topics mentioned in news articles
New Auto-Interp
Negative Logits
çīĪ
-0.73
OTOS
-0.68
Achievements
-0.67
Anxiety
-0.66
Bakr
-0.65
Ces
-0.64
Odyssey
-0.64
Emirates
-0.63
Calculator
-0.63
Administ
-0.63
POSITIVE LOGITS
skinned
1.27
colored
1.22
haired
1.17
washed
1.12
collar
1.12
legged
1.09
backed
1.08
eyed
1.07
bodied
1.06
oak
1.06
Activations Density 0.095%