INDEX
Explanations
content related to political and social issues with a focus on topics like medical procedures, cultural norms, opinions on beliefs, and government actions
New Auto-Interp
Negative Logits
contact
-0.80
estones
-0.78
sighting
-0.77
completion
-0.74
Finder
-0.74
pleted
-0.73
gauge
-0.71
availability
-0.70
hooting
-0.70
oother
-0.70
POSITIVE LOGITS
hypocrisy
1.63
hypocritical
1.51
disingen
1.41
arrogance
1.37
coward
1.37
misguided
1.34
irresponsible
1.34
hypoc
1.33
bigotry
1.32
hypocr
1.32
Activations Density 9.722%