INDEX
Explanations
instances of the word "pass" in various forms and contexts
New Auto-Interp
Negative Logits
lify
-0.21
tero
-0.20
eyi
-0.19
ean
-0.17
ead
-0.17
ting
-0.17
tings
-0.16
ty
-0.16
ey
-0.15
ted
-0.15
POSITIVE LOGITS
ive
0.26
engers
0.26
ionate
0.25
enger
0.24
ported
0.24
ione
0.23
code
0.22
ages
0.21
arella
0.21
IVE
0.21
Activations Density 0.008%