INDEX
Explanations
references to specific locations or events within a text
instances of the word "Back" in various contexts
New Auto-Interp
Negative Logits
©¶æ¥µ
-0.64
ccess
-0.63
utical
-0.63
ihad
-0.62
Osc
-0.62
tyr
-0.60
izo
-0.59
isen
-0.59
ellen
-0.59
constitu
-0.59
POSITIVE LOGITS
lash
1.20
stab
1.18
door
1.11
tracking
1.08
GROUND
1.06
yard
1.05
wards
1.04
stage
1.04
dated
1.02
pack
1.02
Activations Density 0.027%