INDEX
Explanations
phrases related to specific objects or concepts, such as "oxygen supplies" and "electricity grid"
specific nouns and topics related to health, environment, and legal issues
New Auto-Interp
Negative Logits
ulhu
-0.69
ggles
-0.66
agall
-0.61
âĺĨ
-0.61
ifice
-0.60
infeld
-0.58
umblr
-0.58
edy
-0.55
OHN
-0.55
vez
-0.54
POSITIVE LOGITS
pox
0.65
vale
0.57
discrimination
0.56
worth
0.54
ãĥ
0.53
iculture
0.53
tattoo
0.53
billboards
0.52
bestos
0.51
shaw
0.51
Activations Density 1.083%