INDEX
Explanations
phrases indicating a continuous or ongoing action or state
the word "ever" indicating ongoing situations or experiences
New Auto-Interp
Negative Logits
Reviewed
-0.70
aucus
-0.69
ouri
-0.64
esses
-0.62
animous
-0.62
Enlarge
-0.62
OUR
-0.61
icio
-0.61
our
-0.60
orst
-0.60
POSITIVE LOGITS
since
1.15
thing
1.09
green
1.02
since
0.98
lasting
0.86
bilt
0.86
more
0.85
onward
0.82
last
0.77
theless
0.76
Activations Density 0.018%