INDEX
Explanations
mentions of actions or scenarios that involve making someone lazy
instances of competition or opportunities for attention
New Auto-Interp
Negative Logits
SEA
-0.80
natureconservancy
-0.79
Interstitial
-0.79
RT
-0.72
BF
-0.69
Feature
-0.68
Playoffs
-0.67
EVA
-0.65
ILE
-0.65
APD
-0.64
POSITIVE LOGITS
none
0.63
pex
0.62
no
0.62
ãĥĥãĥī
0.60
anooga
0.60
deter
0.60
resa
0.60
igible
0.59
unch
0.58
onga
0.58
Activations Density 0.000%