INDEX
Negative Logits
tys
-0.08
.entity
-0.07
(V
-0.07
ty
-0.07
envisioned
-0.07
electronic
-0.07
envis
-0.07
dig
-0.07
(scene
-0.07
entic
-0.07
POSITIVE LOGITS
errs
0.12
safest
0.12
precaution
0.12
conserv
0.11
hindsight
0.11
cautious
0.10
Conserv
0.10
safeguard
0.10
safe
0.10
conservative
0.10
Activations Density 0.042%