INDEX
Explanations
phrases related to physical sensations or bodily characteristics
New Auto-Interp
Negative Logits
Educational
-0.66
Soros
-0.65
vernment
-0.65
Hammond
-0.64
Hof
-0.64
agall
-0.64
Nex
-0.63
AMI
-0.61
Developers
-0.61
Peg
-0.60
POSITIVE LOGITS
lings
1.17
bags
1.00
meat
0.98
ly
0.95
shed
0.93
thirst
0.92
lesh
0.91
roots
0.90
mares
0.89
grave
0.89
Activations Density 0.024%