INDEX
Explanations
mentions of being physically covered or hidden by something
instances of the word "covered."
New Auto-Interp
Negative Logits
mage
-0.69
rw
-0.64
Sa
-0.64
assuming
-0.62
cgi
-0.61
Sabha
-0.61
rained
-0.59
lua
-0.59
correctness
-0.59
enc
-0.58
POSITIVE LOGITS
Coverage
0.79
krit
0.79
iday
0.78
Cover
0.77
ummer
0.72
COVER
0.71
bys
0.71
utical
0.71
ãĤ¹
0.70
alls
0.70
Activations Density 0.018%