INDEX
Explanations
mentions of advanced or modern technology and its societal implications
terms related to issues of autonomy and control in society
New Auto-Interp
Negative Logits
nicer
-0.76
lately
-0.74
ursday
-0.68
comparing
-0.67
similarities
-0.67
bullish
-0.65
contrasting
-0.65
sharper
-0.64
thicker
-0.64
bigger
-0.61
POSITIVE LOGITS
âĢ
1.30
âĢ
1.14
indefinitely
1.04
unless
0.97
nor
0.93
nor
0.92
impunity
0.91
unless
0.91
forever
0.86
anymore
0.83
Activations Density 0.636%