INDEX
Explanations
instances of phrases related to publicly expressing an opinion or stance
instances of the phrase "came out" or variations of it
New Auto-Interp
Negative Logits
olesterol
-0.67
nesota
-0.66
»Ĵ
-0.63
ille
-0.62
Bots
-0.61
Login
-0.61
cius
-0.60
Icar
-0.59
=-=-=-=-=-=-=-=-
-0.58
atomic
-0.58
POSITIVE LOGITS
fitted
1.05
doors
0.83
swinging
0.83
wards
0.82
smart
0.79
door
0.75
board
0.75
rower
0.73
stronger
0.72
skirts
0.72
Activations Density 0.040%