INDEX
Explanations
adjectives or nouns related to societal issues
terms related to classifications of various conditions, identities, and societal issues
New Auto-Interp
Negative Logits
ravings
-0.87
assies
-0.82
Rooms
-0.79
doms
-0.79
ispers
-0.78
xes
-0.78
apses
-0.76
votes
-0.76
acters
-0.74
flows
-0.73
POSITIVE LOGITS
spokesperson
0.87
reminder
0.84
brainer
0.83
sleeper
0.82
affair
0.81
unto
0.80
contender
0.78
entity
0.78
approximation
0.77
continuation
0.75
Activations Density 0.457%