INDEX
Explanations
the presence of the word "cover" or its variations related to concealing or hiding something
instances of the word "cover" and its variations, indicating a focus on concealment or protection
New Auto-Interp
Negative Logits
----------
-0.66
bour
-0.64
EO
-0.64
friend
-0.61
ue
-0.61
********************************
-0.60
CARD
-0.60
nir
-0.59
--------------------------------------------------------
-0.58
================================================================
-0.58
POSITIVE LOGITS
ages
0.83
up
0.79
alls
0.76
bases
0.74
iday
0.71
usky
0.71
extensively
0.69
basics
0.68
cover
0.68
expense
0.66
Activations Density 0.049%