INDEX
Explanations
words related to long-term activities or processes
topics related to safety, particularly concerning personal and societal issues
New Auto-Interp
Negative Logits
OULD
-0.75
hod
-0.61
umbledore
-0.59
))))
-0.59
ERE
-0.58
{:-0.57
=~
-0.57
)))
-0.56
rame
-0.55
pload
-0.55
POSITIVE LOGITS
lately
2.05
since
1.97
since
1.64
ever
1.35
recently
1.21
thus
1.05
Since
1.05
Since
1.03
recent
0.96
over
0.95
Activations Density 0.929%