INDEX
Explanations
instances of the word "cover" in various contexts
New Auto-Interp
Negative Logits
ering
-0.16
swagen
-0.16
zing
-0.16
covered
-0.15
ãĥ³ãĥĸ
-0.15
èľľ
-0.15
riba
-0.15
scale
-0.14
epad
-0.14
alo
-0.14
POSITIVE LOGITS
gence
0.23
dale
0.21
alls
0.21
story
0.20
story
0.19
Story
0.17
plate
0.17
utra
0.17
iges
0.16
letter
0.16
Activations Density 0.011%