INDEX
Explanations
items listed in a structured format containing a mix of varied information
lists and comparisons related to demographics and societal observations
New Auto-Interp
Negative Logits
unin
-0.54
insepar
-0.54
egu
-0.52
ĺħ
-0.51
Powered
-0.51
¬¼
-0.51
slightest
-0.51
ocument
-0.50
odor
-0.50
escap
-0.50
POSITIVE LOGITS
than
2.87
than
2.63
Than
2.18
compared
1.15
TH
0.91
Th
0.84
rather
0.81
then
0.80
worldly
0.79
THEN
0.75
Activations Density 1.491%