INDEX
Explanations
mentions of leaves, perceived as a common or unusual element
New Auto-Interp
Negative Logits
PDATE
-0.86
ALLY
-0.66
ENTION
-0.64
adjud
-0.64
ICAN
-0.63
alty
-0.63
anyahu
-0.63
cedes
-0.62
indemn
-0.61
USS
-0.61
POSITIVE LOGITS
let
1.25
lets
1.11
leaf
1.04
leted
0.96
iard
0.92
worm
0.92
ho
0.92
overs
0.91
leaf
0.91
worker
0.86
Activations Density 0.021%