INDEX
Explanations
mentions of body parts
instances of the word "ar" in various contexts
New Auto-Interp
Negative Logits
Lumpur
-0.87
zinski
-0.81
Showdown
-0.81
rome
-0.76
Dew
-0.76
worth
-0.73
cade
-0.70
Mull
-0.70
Dull
-0.67
Rasmussen
-0.66
POSITIVE LOGITS
ar
3.69
Ar
1.60
Ar
1.57
Archer
1.29
arch
1.28
AR
1.16
Ark
1.16
ank
1.08
arrow
1.06
bows
1.06
Activations Density 0.011%