INDEX
Explanations
phrases related to societal issues and impacts
items related to social or cultural values and criticisms
New Auto-Interp
Negative Logits
ajor
-0.85
scribe
-0.80
wark
-0.79
uca
-0.78
oru
-0.77
instance
-0.77
alion
-0.76
ocument
-0.75
dule
-0.74
escription
-0.73
POSITIVE LOGITS
lousy
1.05
sluggish
1.02
lack
1.01
penchant
1.00
inability
0.99
scant
0.98
lackluster
0.97
catchy
0.95
inexplicable
0.95
dismal
0.95
Activations Density 0.380%