INDEX
Explanations
phrases related to contrasting ideas or situations
phrases indicating contradiction or contrast in context
New Auto-Interp
Negative Logits
--+
-0.79
cellaneous
-0.69
icultural
-0.66
iqueness
-0.65
alach
-0.64
leck
-0.64
rez
-0.62
itable
-0.62
chairs
-0.61
kamp
-0.60
POSITIVE LOGITS
reality
1.31
actually
1.26
truth
1.21
actual
1.09
actually
1.06
Actually
1.01
Actually
1.00
reality
0.93
Reality
0.93
realities
0.92
Activations Density 0.291%