INDEX
Explanations
words related to evaluation or judgment on a scale
the phrase "what makes" in various contexts
New Auto-Interp
Negative Logits
scrimmage
-0.67
ban
-0.67
nurs
-0.67
76561
-0.65
thia
-0.65
---------
-0.62
conditioning
-0.57
Witch
-0.55
Souls
-0.55
herry
-0.55
POSITIVE LOGITS
hift
1.13
sure
1.02
berra
0.81
paio
0.77
auri
0.76
elling
0.76
ailable
0.74
landfall
0.74
paces
0.73
ebin
0.73
Activations Density 0.129%