INDEX
Explanations
keywords related to acceptance or feelings within various contexts
terms associated with acceptance and emotional responses to societal issues
New Auto-Interp
Negative Logits
ouf
-0.67
owitz
-0.67
onies
-0.66
claimer
-0.62
orio
-0.61
istries
-0.61
Bravo
-0.59
Shining
-0.59
odore
-0.58
sit
-0.57
POSITIVE LOGITS
universally
0.87
unanimously
0.81
ĸļ
0.75
psychiat
0.75
ynamic
0.74
inconsist
0.74
by
0.73
nergy
0.69
everywhere
0.69
ãĥ¼ãĥĨ
0.68
Activations Density 0.216%