INDEX
Explanations
time-related phrases, specifically words related to past, present, and future events
the word "now" in various contexts
New Auto-Interp
Negative Logits
ascript
-0.86
habi
-0.79
insula
-0.76
ò
-0.75
advertisement
-0.74
æ©
-0.72
ked
-0.70
teasp
-0.70
drawn
-0.69
rongh
-0.69
POSITIVE LOGITS
we
1.17
everybody
1.03
they
1.02
adays
1.00
you
0.98
it
0.97
everyone
0.94
somebody
0.93
I
0.91
everything
0.87
Activations Density 0.113%