INDEX
Explanations
instances where actions or procedures are being carried out or recommended
instances of the word "apply" and its variations in various contexts
New Auto-Interp
Negative Logits
acters
-0.71
owship
-0.70
hoff
-0.68
footed
-0.67
airs
-0.63
OUS
-0.61
Island
-0.61
birds
-0.61
beck
-0.61
keeping
-0.60
POSITIVE LOGITS
pressure
0.92
arate
0.71
sunscreen
0.69
rals
0.66
aution
0.62
Anarchy
0.62
liber
0.61
enz
0.61
rigorous
0.60
muster
0.60
Activations Density 0.034%